TOP ACTIVATIONS
MAX = 0.077

sad . She threw a tant rum and cried loudly .
kept crying and throwing a tant rum . Finally
listen . She threw a tant rum in the store .
. He kept throwing a tant rum . His mom was
his feet and threw a tant rum . "
She cried and threw a tant rum . Her mom tried
surprise . He threw a tant rum . He kicked and
to cry and threw a tant rum . She didn 't
her feet and threw a tant rum . She grabbed the
them and never threw a tant rum again . <|endoftext|> Once
and started to throw a tant rum . Suddenly , she
and so she threw a tant rum when her parents said
stubborn . She throws a tant rum . She kicks and
temper and often threw temper tant rums . One day ,
away , a bright blue spe ck in the sky .
as he opened the ch ocol ates and started eating them
That was the setup that fore shadow ed the bad ending
. When D ott ie arrived at the lab
and started eating the ch ocol ates . The shop keeper
bled up all the ch ocol ates , and she was

MIN ACTIVATIONS
MIN = -0.036

of all â if she only follows the
. But ot ters are happiest in the wild . Let
in here â .' <|endoftext|> Once there was
his elephant the name " BI GG EST ". He was
'm glad you are behaving reasonably and thinking independently !" <|endoftext|>
, the bear became the happiest bear around . Everywhere he
â Everyone nodded in agreement and the family continued to
Now , he was the happiest crab in the ocean !
. He was happiest when he was running around
a big barrel in a farm . The barrel was full
and nodded their heads in agreement . Mom looked
the park , even a farm ! Everywhere they went M
safe ". Everyone nodded in agreement . The
red par rot was the happiest he had ever been .
sand , and a big tree . Tim and
swings , slides , and pige ons . P ige ons
. The triangle was the happiest of all . <|endoftext|> Once
my had no time to dwell <|endoftext|> Once there was a
she played . She was happiest when she heard her music
just yet â but trust me , it

INTERVAL 0.049 - 0.077
CONTAINS 0.000%

and started eating the ch ocol ates . The shop keeper
his feet and threw a tant rum . "
She cried and threw a tant rum . Her mom tried
. He kept throwing a tant rum . His mom was
That was the setup that fore shadow ed the bad ending

INTERVAL 0.020 - 0.049
CONTAINS 0.564%

things . He felt happy and proud of himself . From
It took a lot of searching , but they eventually
noises . When she came out of the tunnel , she
the nurses gave her a big hug .
too high and the man couldn 't reach the others .

INTERVAL -0.008 - 0.020
CONTAINS 87.143%

the pond and fed the reef and then carefully moved it
One day , she asked her mom if they could go
and see ," Ben said . They ran to
mum came over and asked him what he was looking at
it won 't get so dusty !". <|endoftext|> This is a

INTERVAL -0.036 - -0.008
CONTAINS 12.293%

cut the mel on into slices and put them on a
them set up a play area in the living room .
that they would see Comet Hale - B opp again someday
instead . They left the hen and her eggs alone .
. Tim my loved to draw pictures in his notebook .
Token sad
Token ID6507
Feature activation+7.61e-03

Pos logit contributions
at+0.018
imm+0.018
wy+0.018
zzy+0.018
ier+0.017
Neg logit contributions
 have-0.011
 be-0.010
 what-0.009
 I-0.009
 Lily-0.009
Token.
Token ID13
Feature activation-3.87e-03

Pos logit contributions
imm+0.020
ier+0.020
at+0.020
zzy+0.019
iest+0.019
Neg logit contributions
 have-0.008
 be-0.007
 knock-0.006
 I-0.006
 Lily-0.005
Token She
Token ID1375
Feature activation-1.62e-03

Pos logit contributions
 have+0.005
 be+0.004
 knock+0.004
 we+0.004
 do+0.004
Neg logit contributions
at-0.009
imm-0.009
zzy-0.009
tto-0.008
wy-0.008
Token threw
Token ID9617
Feature activation-7.93e-03

Pos logit contributions
 have+0.002
 be+0.002
 I+0.002
 we+0.002
 Lily+0.002
Neg logit contributions
at-0.004
wy-0.004
imm-0.003
zzy-0.003
tto-0.003
Token a
Token ID257
Feature activation+2.67e-03

Pos logit contributions
 have+0.008
 be+0.008
 I+0.007
 we+0.007
 knock+0.006
Neg logit contributions
at-0.021
wy-0.021
imm-0.020
zzy-0.020
iest-0.019
Token tant
Token ID24246
Feature activation+7.44e-02

Pos logit contributions
at+0.005
imm+0.005
wy+0.005
zzy+0.004
iest+0.004
Neg logit contributions
 have-0.005
 be-0.005
 Lily-0.005
 I-0.005
 what-0.005
Tokenrum
Token ID6582
Feature activation-3.62e-03

Pos logit contributions
at+0.103
wy+0.099
zzy+0.098
imm+0.098
vell+0.094
Neg logit contributions
 have-0.166
 be-0.161
 what-0.152
 Lily-0.150
 it-0.145
Token and
Token ID290
Feature activation+4.12e-04

Pos logit contributions
 have+0.004
 be+0.004
 knock+0.003
 Lily+0.003
 do+0.003
Neg logit contributions
at-0.009
imm-0.009
ier-0.009
wy-0.009
zzy-0.009
Token cried
Token ID16896
Feature activation-6.11e-03

Pos logit contributions
at+0.001
wy+0.001
imm+0.001
zzy+0.001
ier+0.001
Neg logit contributions
 have-0.001
 be-0.001
 I-0.000
 knock-0.000
 do-0.000
Token loudly
Token ID23112
Feature activation+1.04e-02

Pos logit contributions
 have+0.006
 be+0.006
 Lily+0.005
 I+0.005
 we+0.004
Neg logit contributions
at-0.017
imm-0.017
wy-0.017
zzy-0.016
ier-0.016
Token.
Token ID13
Feature activation-8.75e-04

Pos logit contributions
at+0.031
imm+0.030
wy+0.030
zzy+0.029
iest+0.028
Neg logit contributions
 have-0.009
 be-0.008
 Lily-0.007
 I-0.006
 do-0.006
Token kept
Token ID4030
Feature activation+2.10e-03

Pos logit contributions
at+0.009
wy+0.009
imm+0.009
zzy+0.009
ier+0.008
Neg logit contributions
 have-0.006
 be-0.005
 Lily-0.005
 I-0.005
 what-0.005
Token crying
Token ID13774
Feature activation-3.23e-04

Pos logit contributions
at+0.005
wy+0.005
imm+0.005
zzy+0.005
ier+0.004
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.003
 I-0.003
 do-0.002
Token and
Token ID290
Feature activation-1.23e-03

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 I+0.000
 do+0.000
Neg logit contributions
at-0.001
imm-0.001
wy-0.001
zzy-0.001
ier-0.001
Token throwing
Token ID9644
Feature activation-5.54e-03

Pos logit contributions
 have+0.002
 be+0.002
 I+0.001
 we+0.001
 Lily+0.001
Neg logit contributions
at-0.003
imm-0.003
wy-0.003
zzy-0.003
ier-0.003
Token a
Token ID257
Feature activation+3.86e-03

Pos logit contributions
 have+0.006
 be+0.006
 I+0.005
 Lily+0.005
 we+0.005
Neg logit contributions
at-0.015
wy-0.015
imm-0.015
zzy-0.014
iest-0.014
Token tant
Token ID24246
Feature activation+7.34e-02

Pos logit contributions
at+0.007
imm+0.007
wy+0.007
zzy+0.007
iest+0.006
Neg logit contributions
 have-0.008
 be-0.007
 Lily-0.007
 I-0.007
 what-0.007
Tokenrum
Token ID6582
Feature activation-2.74e-03

Pos logit contributions
at+0.100
wy+0.096
zzy+0.096
imm+0.096
vell+0.092
Neg logit contributions
 have-0.166
 be-0.161
 what-0.150
 Lily-0.150
 it-0.145
Token.
Token ID13
Feature activation-2.33e-03

Pos logit contributions
 have+0.003
 be+0.003
 knock+0.002
 Lily+0.002
 do+0.002
Neg logit contributions
at-0.008
imm-0.007
ier-0.007
wy-0.007
zzy-0.007
Token
Token ID198
Feature activation+1.22e-03

Pos logit contributions
 have+0.003
 be+0.003
 do+0.002
 knock+0.002
 we+0.002
Neg logit contributions
imm-0.006
at-0.005
zzy-0.005
side-0.005
wy-0.005
Token
Token ID198
Feature activation+2.96e-03

Pos logit contributions
at+0.003
imm+0.003
zzy+0.003
wy+0.003
us+0.003
Neg logit contributions
 have-0.001
 be-0.001
 I-0.001
 Lily-0.001
 do-0.001
TokenFinally
Token ID11158
Feature activation-2.71e-03

Pos logit contributions
at+0.005
zzy+0.005
wy+0.005
imm+0.005
tto+0.004
Neg logit contributions
 have-0.006
 be-0.006
 look-0.006
 I-0.005
 what-0.005
Token listen
Token ID6004
Feature activation-1.42e-02

Pos logit contributions
 we+0.020
 I+0.019
 Mom+0.019
 cars+0.019
 mom+0.019
Neg logit contributions
zzy-0.031
at-0.031
wy-0.031
ier-0.030
rene-0.029
Token.
Token ID13
Feature activation-9.62e-04

Pos logit contributions
 have+0.018
 be+0.016
 Lily+0.014
 I+0.014
 knock+0.013
Neg logit contributions
at-0.036
imm-0.035
wy-0.035
zzy-0.034
ier-0.034
Token She
Token ID1375
Feature activation-6.97e-03

Pos logit contributions
 have+0.001
 be+0.001
 knock+0.001
 do+0.001
 we+0.001
Neg logit contributions
imm-0.002
at-0.002
zzy-0.002
ibr-0.002
wy-0.002
Token threw
Token ID9617
Feature activation-9.67e-03

Pos logit contributions
 have+0.009
 be+0.009
 Lily+0.007
 do+0.007
 I+0.007
Neg logit contributions
at-0.018
imm-0.018
wy-0.017
zzy-0.017
iest-0.017
Token a
Token ID257
Feature activation+4.75e-03

Pos logit contributions
 have+0.011
 be+0.010
 I+0.009
 we+0.009
 knock+0.009
Neg logit contributions
at-0.024
wy-0.024
imm-0.023
zzy-0.023
ier-0.022
Token tant
Token ID24246
Feature activation+7.31e-02

Pos logit contributions
at+0.008
imm+0.008
wy+0.008
zzy+0.007
iest+0.007
Neg logit contributions
 have-0.010
 be-0.010
 Lily-0.009
 I-0.009
 what-0.009
Tokenrum
Token ID6582
Feature activation-4.11e-03

Pos logit contributions
at+0.098
imm+0.093
wy+0.092
zzy+0.091
rol+0.084
Neg logit contributions
 have-0.175
 be-0.170
 Lily-0.160
 what-0.158
 do-0.153
Token in
Token ID287
Feature activation-5.11e-03

Pos logit contributions
 have+0.005
 be+0.004
 knock+0.004
 do+0.003
 Lily+0.003
Neg logit contributions
at-0.011
ier-0.011
imm-0.010
wy-0.010
iest-0.010
Token the
Token ID262
Feature activation+3.38e-03

Pos logit contributions
 have+0.005
 be+0.005
 we+0.004
 knock+0.004
 I+0.004
Neg logit contributions
wy-0.013
at-0.012
iest-0.012
zzy-0.012
imm-0.012
Token store
Token ID3650
Feature activation-2.96e-03

Pos logit contributions
at+0.006
wy+0.006
iest+0.006
us+0.006
imm+0.006
Neg logit contributions
 have-0.006
 be-0.005
 I-0.005
 Lily-0.005
 what-0.005
Token.
Token ID13
Feature activation-3.90e-04

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.002
 I+0.002
 knock+0.002
Neg logit contributions
at-0.008
ier-0.008
imm-0.008
wy-0.007
zzy-0.007
Token.
Token ID13
Feature activation-1.05e-03

Pos logit contributions
 have+0.014
 be+0.012
 Lily+0.010
 I+0.010
 knock+0.009
Neg logit contributions
at-0.037
imm-0.035
zzy-0.035
wy-0.035
ier-0.035
Token He
Token ID679
Feature activation-2.82e-03

Pos logit contributions
 have+0.002
 be+0.002
 we+0.001
 do+0.001
 what+0.001
Neg logit contributions
at-0.002
imm-0.002
zzy-0.002
wy-0.002
ier-0.002
Token kept
Token ID4030
Feature activation+1.63e-03

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.003
 do+0.002
 I+0.002
Neg logit contributions
at-0.008
imm-0.008
wy-0.008
iest-0.007
rol-0.007
Token throwing
Token ID9644
Feature activation-8.02e-03

Pos logit contributions
at+0.004
wy+0.004
imm+0.003
zzy+0.003
us+0.003
Neg logit contributions
 have-0.002
 be-0.002
 I-0.002
 Lily-0.002
 we-0.002
Token a
Token ID257
Feature activation+3.26e-03

Pos logit contributions
 have+0.009
 be+0.009
 I+0.008
 we+0.008
 knock+0.008
Neg logit contributions
wy-0.020
at-0.020
imm-0.019
ier-0.018
zzy-0.018
Token tant
Token ID24246
Feature activation+7.28e-02

Pos logit contributions
at+0.008
wy+0.008
imm+0.007
iest+0.007
zzy+0.007
Neg logit contributions
 have-0.005
 be-0.004
 Lily-0.004
 I-0.004
 what-0.004
Tokenrum
Token ID6582
Feature activation-5.37e-03

Pos logit contributions
at+0.095
imm+0.093
zzy+0.091
wy+0.090
rol+0.085
Neg logit contributions
 have-0.175
 be-0.171
 what-0.157
 Lily-0.156
 knock-0.153
Token.
Token ID13
Feature activation-2.22e-06

Pos logit contributions
 have+0.006
 be+0.005
 knock+0.004
 Lily+0.004
 do+0.004
Neg logit contributions
at-0.015
ier-0.014
imm-0.014
wy-0.014
iest-0.014
Token His
Token ID2399
Feature activation+3.50e-04

Feature not active on prev token;
no predictions were affected.
Token mom
Token ID1995
Feature activation-6.60e-03

Pos logit contributions
at+0.001
wy+0.001
zzy+0.001
imm+0.001
iest+0.001
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 I-0.000
 what-0.000
Token was
Token ID373
Feature activation-9.32e-03

Pos logit contributions
 have+0.009
 be+0.008
 Lily+0.007
 I+0.007
 what+0.006
Neg logit contributions
at-0.017
imm-0.017
wy-0.016
zzy-0.016
iest-0.015
Token his
Token ID465
Feature activation+8.73e-03

Pos logit contributions
 have+0.016
 be+0.015
 Lily+0.014
 what+0.013
 I+0.013
Neg logit contributions
at-0.028
imm-0.027
wy-0.027
zzy-0.026
vell-0.025
Token feet
Token ID3625
Feature activation+6.01e-03

Pos logit contributions
at+0.019
wy+0.019
imm+0.018
zzy+0.018
rito+0.017
Neg logit contributions
 have-0.013
 Lily-0.012
 what-0.011
 be-0.011
 I-0.011
Token and
Token ID290
Feature activation-1.28e-03

Pos logit contributions
at+0.019
imm+0.019
wy+0.018
zzy+0.018
iest+0.017
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.003
 do-0.003
Token threw
Token ID9617
Feature activation-1.30e-02

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.002
 do+0.002
Neg logit contributions
at-0.003
imm-0.003
wy-0.003
zzy-0.003
iest-0.003
Token a
Token ID257
Feature activation+3.98e-03

Pos logit contributions
 have+0.017
 be+0.016
 I+0.014
 do+0.013
 we+0.013
Neg logit contributions
at-0.032
wy-0.032
imm-0.031
zzy-0.030
iest-0.029
Token tant
Token ID24246
Feature activation+7.28e-02

Pos logit contributions
at+0.010
imm+0.010
wy+0.009
zzy+0.009
iest+0.009
Neg logit contributions
 have-0.006
 be-0.005
 Lily-0.005
 I-0.005
 do-0.005
Tokenrum
Token ID6582
Feature activation-6.62e-03

Pos logit contributions
at+0.106
zzy+0.105
imm+0.105
rol+0.101
vell+0.101
Neg logit contributions
 be-0.149
 have-0.148
 what-0.137
 Lily-0.131
 it-0.130
Token.
Token ID13
Feature activation+2.67e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.004
 knock+0.004
 do+0.004
Neg logit contributions
at-0.020
imm-0.019
wy-0.019
zzy-0.018
ier-0.018
Token
Token ID198
Feature activation-1.70e-03

Pos logit contributions
at+0.007
imm+0.007
zzy+0.007
wy+0.007
us+0.007
Neg logit contributions
 have-0.003
 be-0.002
 do-0.002
 knock-0.002
 what-0.002
Token
Token ID198
Feature activation-1.23e-04

Pos logit contributions
 have+0.002
 be+0.002
 do+0.001
 look+0.001
 knock+0.001
Neg logit contributions
at-0.005
zzy-0.005
imm-0.005
wy-0.005
us-0.004
Token"
Token ID1
Feature activation+4.44e-03

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 do+0.000
 what+0.000
Neg logit contributions
at-0.000
wy-0.000
zzy-0.000
imm-0.000
us-0.000
Token She
Token ID1375
Feature activation-6.72e-04

Pos logit contributions
 have+0.003
 be+0.003
 knock+0.003
 do+0.003
 what+0.003
Neg logit contributions
at-0.006
imm-0.006
zzy-0.006
wy-0.006
amb-0.006
Token cried
Token ID16896
Feature activation-3.45e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 what+0.001
Neg logit contributions
at-0.002
imm-0.002
wy-0.002
zzy-0.002
ier-0.001
Token and
Token ID290
Feature activation+1.19e-02

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.002
 I+0.002
 do+0.002
Neg logit contributions
at-0.010
imm-0.010
wy-0.010
zzy-0.010
ier-0.009
Token threw
Token ID9617
Feature activation-8.07e-03

Pos logit contributions
at+0.030
imm+0.029
wy+0.029
zzy+0.028
ier+0.027
Neg logit contributions
 have-0.015
 be-0.014
 Lily-0.012
 I-0.012
 what-0.012
Token a
Token ID257
Feature activation+3.50e-03

Pos logit contributions
 have+0.011
 be+0.011
 I+0.010
 we+0.010
 do+0.009
Neg logit contributions
at-0.018
wy-0.018
imm-0.018
zzy-0.017
rito-0.016
Token tant
Token ID24246
Feature activation+7.22e-02

Pos logit contributions
at+0.006
imm+0.006
wy+0.006
zzy+0.006
iest+0.005
Neg logit contributions
 have-0.007
 be-0.007
 Lily-0.006
 I-0.006
 what-0.006
Tokenrum
Token ID6582
Feature activation-1.46e-03

Pos logit contributions
at+0.096
imm+0.092
wy+0.091
zzy+0.091
rol+0.083
Neg logit contributions
 have-0.172
 be-0.167
 Lily-0.157
 what-0.156
 knock-0.151
Token.
Token ID13
Feature activation-6.54e-04

Pos logit contributions
 have+0.002
 be+0.001
 knock+0.001
 Lily+0.001
 do+0.001
Neg logit contributions
at-0.004
imm-0.004
ier-0.004
wy-0.004
zzy-0.004
Token Her
Token ID2332
Feature activation-5.54e-03

Pos logit contributions
 have+0.001
 be+0.001
 do+0.001
 knock+0.001
 what+0.001
Neg logit contributions
at-0.002
imm-0.002
wy-0.001
zzy-0.001
rol-0.001
Token mom
Token ID1995
Feature activation-1.87e-03

Pos logit contributions
 have+0.004
 be+0.003
 I+0.003
 what+0.003
 we+0.003
Neg logit contributions
at-0.016
wy-0.016
imm-0.016
zzy-0.015
iest-0.015
Token tried
Token ID3088
Feature activation+1.03e-02

Pos logit contributions
 have+0.003
 be+0.002
 Lily+0.002
 I+0.002
 what+0.002
Neg logit contributions
at-0.005
imm-0.005
wy-0.004
zzy-0.004
ier-0.004
Token surprise
Token ID5975
Feature activation-4.56e-03

Pos logit contributions
 have+0.011
 be+0.011
 Lily+0.010
 I+0.009
 what+0.009
Neg logit contributions
at-0.018
wy-0.017
iest-0.016
vell-0.016
zzy-0.016
Token.
Token ID13
Feature activation+2.24e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 I+0.003
 what+0.003
Neg logit contributions
at-0.013
imm-0.013
wy-0.013
iest-0.012
ier-0.012
Token He
Token ID679
Feature activation-5.95e-03

Pos logit contributions
at+0.005
imm+0.005
zzy+0.005
wy+0.005
us+0.005
Neg logit contributions
 have-0.003
 be-0.003
 knock-0.003
 do-0.003
 we-0.003
Token threw
Token ID9617
Feature activation-1.21e-02

Pos logit contributions
 have+0.007
 be+0.007
 Lily+0.006
 I+0.006
 do+0.006
Neg logit contributions
at-0.016
imm-0.016
wy-0.015
zzy-0.015
iest-0.014
Token a
Token ID257
Feature activation+1.99e-03

Pos logit contributions
 have+0.013
 be+0.012
 we+0.011
 I+0.010
 knock+0.010
Neg logit contributions
wy-0.031
at-0.031
imm-0.029
zzy-0.028
iest-0.028
Token tant
Token ID24246
Feature activation+7.14e-02

Pos logit contributions
at+0.005
imm+0.004
wy+0.004
zzy+0.004
iest+0.004
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.003
 I-0.003
 what-0.003
Tokenrum
Token ID6582
Feature activation-5.94e-03

Pos logit contributions
at+0.096
imm+0.094
zzy+0.093
wy+0.090
rol+0.088
Neg logit contributions
 have-0.165
 be-0.162
 what-0.151
 Lily-0.149
 do-0.144
Token.
Token ID13
Feature activation+1.34e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.004
 knock+0.004
 do+0.004
Neg logit contributions
at-0.017
imm-0.016
wy-0.016
ier-0.016
zzy-0.016
Token He
Token ID679
Feature activation-3.90e-03

Pos logit contributions
at+0.003
imm+0.003
zzy+0.003
wy+0.003
ier+0.003
Neg logit contributions
 have-0.002
 be-0.002
 do-0.001
 what-0.001
 we-0.001
Token kicked
Token ID12165
Feature activation-1.02e-02

Pos logit contributions
 have+0.005
 be+0.005
 Lily+0.005
 I+0.004
 do+0.004
Neg logit contributions
at-0.010
imm-0.010
wy-0.009
zzy-0.009
iest-0.009
Token and
Token ID290
Feature activation+3.55e-04

Pos logit contributions
 have+0.010
 be+0.009
 I+0.008
 we+0.008
 knock+0.008
Neg logit contributions
wy-0.027
at-0.027
imm-0.026
zzy-0.025
ier-0.025
Token to
Token ID284
Feature activation+3.14e-04

Pos logit contributions
at+0.023
imm+0.022
zzy+0.022
wy+0.021
us+0.020
Neg logit contributions
 have-0.015
 be-0.014
 I-0.013
 we-0.013
 Lily-0.012
Token cry
Token ID3960
Feature activation-5.87e-03

Pos logit contributions
at+0.001
zzy+0.001
wy+0.001
imm+0.001
ited+0.001
Neg logit contributions
 I-0.000
 we-0.000
 have-0.000
 Lily-0.000
 what-0.000
Token and
Token ID290
Feature activation+1.84e-03

Pos logit contributions
 have+0.007
 be+0.006
 I+0.006
 knock+0.005
 we+0.005
Neg logit contributions
ier-0.014
imm-0.014
at-0.014
zzy-0.013
iful-0.013
Token threw
Token ID9617
Feature activation-6.62e-03

Pos logit contributions
zzy+0.004
at+0.003
imm+0.003
wy+0.003
ited+0.003
Neg logit contributions
 have-0.003
 be-0.002
 I-0.002
 we-0.002
 knock-0.002
Token a
Token ID257
Feature activation+5.34e-03

Pos logit contributions
 have+0.009
 be+0.009
 we+0.008
 I+0.008
 knock+0.007
Neg logit contributions
at-0.015
wy-0.015
zzy-0.014
imm-0.014
iest-0.014
Token tant
Token ID24246
Feature activation+7.12e-02

Pos logit contributions
at+0.011
imm+0.011
wy+0.011
zzy+0.011
iest+0.010
Neg logit contributions
 have-0.009
 be-0.009
 Lily-0.008
 I-0.008
 what-0.008
Tokenrum
Token ID6582
Feature activation-1.62e-03

Pos logit contributions
at+0.098
wy+0.094
zzy+0.093
imm+0.093
rol+0.088
Neg logit contributions
 have-0.163
 be-0.159
 what-0.149
 Lily-0.148
 knock-0.143
Token.
Token ID13
Feature activation-3.61e-03

Pos logit contributions
 have+0.002
 be+0.002
 knock+0.001
 Lily+0.001
 we+0.001
Neg logit contributions
at-0.004
ier-0.004
imm-0.004
zzy-0.004
wy-0.004
Token She
Token ID1375
Feature activation+3.09e-03

Pos logit contributions
 have+0.006
 be+0.005
 we+0.005
 what+0.005
 do+0.005
Neg logit contributions
imm-0.007
at-0.007
zzy-0.007
side-0.007
ibr-0.006
Token didn
Token ID1422
Feature activation+2.20e-02

Pos logit contributions
at+0.007
wy+0.007
imm+0.007
zzy+0.007
tto+0.006
Neg logit contributions
 have-0.005
 be-0.004
 I-0.004
 Lily-0.004
 what-0.004
Token't
Token ID470
Feature activation-5.62e-03

Pos logit contributions
at+0.053
imm+0.052
wy+0.051
zzy+0.049
iest+0.048
Neg logit contributions
 have-0.032
 be-0.030
 Lily-0.027
 I-0.026
 what-0.025
Token her
Token ID607
Feature activation+2.26e-03

Pos logit contributions
at+0.000
imm+0.000
wy+0.000
zzy+0.000
rol+0.000
Neg logit contributions
 have-0.000
 be-0.000
 I-0.000
 we-0.000
 do-0.000
Token feet
Token ID3625
Feature activation+6.14e-03

Pos logit contributions
at+0.005
imm+0.005
wy+0.005
zzy+0.005
vell+0.005
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.003
 I-0.002
 what-0.002
Token and
Token ID290
Feature activation-4.28e-03

Pos logit contributions
at+0.019
imm+0.018
wy+0.018
zzy+0.018
iest+0.017
Neg logit contributions
 have-0.005
 be-0.005
 Lily-0.004
 I-0.003
 do-0.003
Token threw
Token ID9617
Feature activation-1.07e-02

Pos logit contributions
 have+0.006
 be+0.006
 I+0.005
 Lily+0.005
 do+0.005
Neg logit contributions
at-0.010
imm-0.009
wy-0.009
zzy-0.009
ier-0.009
Token a
Token ID257
Feature activation+3.99e-03

Pos logit contributions
 have+0.013
 be+0.012
 I+0.010
 we+0.010
 do+0.010
Neg logit contributions
at-0.027
wy-0.026
imm-0.026
zzy-0.025
rol-0.025
Token tant
Token ID24246
Feature activation+6.97e-02

Pos logit contributions
at+0.005
wy+0.005
imm+0.005
zzy+0.004
iest+0.004
Neg logit contributions
 have-0.010
 be-0.010
 Lily-0.009
 I-0.009
 what-0.009
Tokenrum
Token ID6582
Feature activation-3.77e-03

Pos logit contributions
at+0.097
wy+0.093
imm+0.092
zzy+0.092
vell+0.087
Neg logit contributions
 have-0.156
 be-0.153
 what-0.142
 Lily-0.141
 knock-0.137
Token.
Token ID13
Feature activation-2.83e-03

Pos logit contributions
 have+0.004
 be+0.003
 Lily+0.003
 knock+0.003
 do+0.003
Neg logit contributions
at-0.010
imm-0.010
ier-0.010
wy-0.010
zzy-0.010
Token She
Token ID1375
Feature activation-3.58e-03

Pos logit contributions
 have+0.004
 be+0.003
 do+0.003
 what+0.003
 Lily+0.003
Neg logit contributions
at-0.007
imm-0.007
wy-0.007
zzy-0.007
rol-0.007
Token grabbed
Token ID13176
Feature activation-7.78e-03

Pos logit contributions
 have+0.005
 be+0.005
 Lily+0.004
 I+0.004
 what+0.004
Neg logit contributions
at-0.008
wy-0.008
imm-0.008
zzy-0.008
ier-0.007
Token the
Token ID262
Feature activation+4.27e-03

Pos logit contributions
 have+0.007
 be+0.007
 I+0.005
 do+0.005
 we+0.005
Neg logit contributions
at-0.023
wy-0.022
imm-0.022
zzy-0.021
iest-0.021
Token them
Token ID606
Feature activation-3.63e-03

Pos logit contributions
 have+0.008
 be+0.007
 I+0.006
 Lily+0.006
 knock+0.006
Neg logit contributions
at-0.018
wy-0.018
imm-0.018
zzy-0.017
us-0.017
Token and
Token ID290
Feature activation-5.22e-03

Pos logit contributions
 have+0.004
 be+0.003
 Lily+0.003
 I+0.003
 do+0.003
Neg logit contributions
at-0.010
imm-0.010
wy-0.010
zzy-0.010
iest-0.009
Token never
Token ID1239
Feature activation+1.41e-03

Pos logit contributions
 have+0.005
 be+0.004
 Lily+0.004
 knock+0.004
 I+0.004
Neg logit contributions
at-0.013
imm-0.013
wy-0.013
zzy-0.013
iest-0.012
Token threw
Token ID9617
Feature activation-1.21e-02

Pos logit contributions
at+0.003
imm+0.003
wy+0.003
ited+0.003
ier+0.003
Neg logit contributions
 have-0.001
 mom-0.001
 Lily-0.001
 Mom-0.001
 be-0.001
Token a
Token ID257
Feature activation+6.53e-03

Pos logit contributions
 have+0.015
 be+0.014
 knock+0.013
 do+0.012
 I+0.012
Neg logit contributions
at-0.030
wy-0.029
imm-0.028
zzy-0.027
iest-0.027
Token tant
Token ID24246
Feature activation+6.92e-02

Pos logit contributions
at+0.014
wy+0.013
imm+0.013
zzy+0.013
iest+0.013
Neg logit contributions
 have-0.011
 be-0.010
 Lily-0.010
 I-0.009
 what-0.009
Tokenrum
Token ID6582
Feature activation-6.73e-03

Pos logit contributions
at+0.099
zzy+0.098
imm+0.096
wy+0.093
vell+0.092
Neg logit contributions
 have-0.150
 be-0.149
 what-0.137
 Lily-0.133
 do-0.130
Token again
Token ID757
Feature activation-1.20e-02

Pos logit contributions
 have+0.007
 be+0.006
 knock+0.005
 Lily+0.005
 do+0.004
Neg logit contributions
at-0.019
imm-0.019
wy-0.018
ier-0.018
zzy-0.018
Token.
Token ID13
Feature activation+8.27e-05

Pos logit contributions
 have+0.011
 be+0.010
 Lily+0.008
 knock+0.008
 I+0.007
Neg logit contributions
at-0.034
imm-0.034
wy-0.033
zzy-0.032
iest-0.032
Token<|endoftext|>
Token ID50256
Feature activation+5.45e-03

Pos logit contributions
at+0.000
imm+0.000
zzy+0.000
rol+0.000
amb+0.000
Neg logit contributions
 have-0.000
 be-0.000
 do-0.000
 knock-0.000
 look-0.000
TokenOnce
Token ID7454
Feature activation-2.76e-03

Pos logit contributions
at+0.017
imm+0.017
wy+0.017
zzy+0.016
iest+0.016
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.002
 do-0.002
Token and
Token ID290
Feature activation+5.18e-03

Pos logit contributions
 have+0.000
 be+0.000
 knock+0.000
 I+0.000
 Lily+0.000
Neg logit contributions
ier-0.001
at-0.001
imm-0.001
iest-0.001
us-0.001
Token started
Token ID2067
Feature activation+6.22e-03

Pos logit contributions
at+0.012
imm+0.011
wy+0.011
zzy+0.011
ier+0.010
Neg logit contributions
 have-0.008
 be-0.008
 Lily-0.007
 I-0.006
 what-0.006
Token to
Token ID284
Feature activation-1.21e-03

Pos logit contributions
at+0.017
imm+0.017
wy+0.017
zzy+0.016
iest+0.016
Neg logit contributions
 have-0.007
 be-0.007
 Lily-0.006
 what-0.005
 do-0.005
Token throw
Token ID3714
Feature activation-1.28e-02

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 we+0.001
Neg logit contributions
at-0.004
wy-0.003
zzy-0.003
imm-0.003
tto-0.003
Token a
Token ID257
Feature activation+4.01e-03

Pos logit contributions
 have+0.014
 be+0.013
 I+0.010
 Lily+0.010
 do+0.010
Neg logit contributions
at-0.036
wy-0.035
imm-0.034
zzy-0.033
iest-0.032
Token tant
Token ID24246
Feature activation+6.81e-02

Pos logit contributions
at+0.009
imm+0.008
wy+0.008
zzy+0.008
iest+0.008
Neg logit contributions
 have-0.007
 be-0.007
 Lily-0.006
 I-0.006
 what-0.006
Tokenrum
Token ID6582
Feature activation-7.46e-03

Pos logit contributions
at+0.093
imm+0.090
wy+0.090
zzy+0.090
rol+0.085
Neg logit contributions
 have-0.155
 be-0.152
 what-0.141
 Lily-0.140
 knock-0.135
Token.
Token ID13
Feature activation-3.43e-03

Pos logit contributions
 have+0.009
 be+0.008
 knock+0.006
 Lily+0.006
 do+0.006
Neg logit contributions
at-0.019
ier-0.019
imm-0.019
zzy-0.018
wy-0.018
Token Suddenly
Token ID24975
Feature activation-6.79e-03

Pos logit contributions
 have+0.005
 be+0.005
 what+0.004
 do+0.004
 we+0.004
Neg logit contributions
at-0.008
imm-0.007
zzy-0.007
wy-0.007
side-0.007
Token,
Token ID11
Feature activation-6.46e-03

Pos logit contributions
 have+0.008
 be+0.007
 knock+0.006
 I+0.006
 do+0.006
Neg logit contributions
at-0.016
ier-0.016
iest-0.016
zzy-0.016
imm-0.015
Token she
Token ID673
Feature activation-2.03e-03

Pos logit contributions
 have+0.008
 be+0.008
 knock+0.006
 I+0.006
 do+0.006
Neg logit contributions
at-0.015
zzy-0.015
wy-0.015
iest-0.015
ier-0.014
Token and
Token ID290
Feature activation+1.81e-02

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.001
 do+0.001
Neg logit contributions
at-0.010
imm-0.010
wy-0.009
zzy-0.009
iest-0.009
Token so
Token ID523
Feature activation-1.16e-03

Pos logit contributions
at+0.044
wy+0.042
imm+0.042
zzy+0.041
iest+0.039
Neg logit contributions
 have-0.025
 be-0.023
 Lily-0.021
 what-0.020
 I-0.020
Token she
Token ID673
Feature activation+2.55e-03

Pos logit contributions
 have+0.001
 be+0.001
 knock+0.001
 Lily+0.001
 find+0.001
Neg logit contributions
imm-0.003
at-0.003
iest-0.003
wy-0.003
zzy-0.003
Token threw
Token ID9617
Feature activation-5.44e-03

Pos logit contributions
at+0.007
imm+0.007
wy+0.006
iest+0.006
zzy+0.006
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.003
 do-0.003
 I-0.003
Token a
Token ID257
Feature activation+4.21e-03

Pos logit contributions
 have+0.007
 be+0.006
 we+0.006
 I+0.006
 knock+0.006
Neg logit contributions
at-0.013
wy-0.013
zzy-0.013
imm-0.013
iest-0.012
Token tant
Token ID24246
Feature activation+6.75e-02

Pos logit contributions
at+0.011
imm+0.011
wy+0.011
zzy+0.010
ier+0.010
Neg logit contributions
 have-0.005
 be-0.005
 Lily-0.004
 do-0.004
 I-0.004
Tokenrum
Token ID6582
Feature activation-2.86e-03

Pos logit contributions
wy+0.093
at+0.093
zzy+0.090
imm+0.089
rol+0.084
Neg logit contributions
 have-0.152
 be-0.148
 what-0.140
 Lily-0.136
 it-0.134
Token when
Token ID618
Feature activation-3.50e-03

Pos logit contributions
 have+0.003
 be+0.002
 Lily+0.002
 knock+0.002
 do+0.002
Neg logit contributions
at-0.008
imm-0.008
wy-0.008
zzy-0.008
ier-0.008
Token her
Token ID607
Feature activation+1.30e-03

Pos logit contributions
 have+0.005
 be+0.004
 knock+0.004
 do+0.004
 what+0.003
Neg logit contributions
at-0.007
zzy-0.007
iest-0.007
side-0.007
dd-0.007
Token parents
Token ID3397
Feature activation+1.50e-02

Pos logit contributions
at+0.004
imm+0.004
wy+0.004
zzy+0.004
iest+0.004
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 I-0.000
 do-0.000
Token said
Token ID531
Feature activation-1.03e-04

Pos logit contributions
at+0.040
imm+0.040
wy+0.039
zzy+0.038
iest+0.036
Neg logit contributions
 have-0.019
 be-0.017
 Lily-0.015
 do-0.014
 what-0.014
Token stubborn
Token ID24444
Feature activation-5.81e-03

Pos logit contributions
 have+0.010
 be+0.010
 Lily+0.008
 I+0.008
 we+0.008
Neg logit contributions
at-0.024
wy-0.024
imm-0.023
zzy-0.023
ier-0.022
Token.
Token ID13
Feature activation+6.01e-04

Pos logit contributions
 have+0.006
 be+0.005
 knock+0.005
 Lily+0.004
 I+0.004
Neg logit contributions
at-0.015
ier-0.015
wy-0.015
imm-0.015
iest-0.015
Token She
Token ID1375
Feature activation-9.70e-03

Pos logit contributions
zzy+0.001
at+0.001
imm+0.001
ibr+0.001
wy+0.001
Neg logit contributions
 have-0.001
 be-0.001
 knock-0.001
 what-0.001
 we-0.001
Token throws
Token ID12542
Feature activation-5.95e-03

Pos logit contributions
 have+0.011
 be+0.011
 Lily+0.010
 we+0.009
 I+0.009
Neg logit contributions
at-0.025
wy-0.024
zzy-0.023
imm-0.023
ier-0.022
Token a
Token ID257
Feature activation+2.71e-03

Pos logit contributions
 have+0.006
 be+0.005
 I+0.004
 we+0.004
 do+0.004
Neg logit contributions
at-0.017
wy-0.017
imm-0.016
zzy-0.016
iest-0.015
Token tant
Token ID24246
Feature activation+6.71e-02

Pos logit contributions
at+0.005
imm+0.005
wy+0.005
zzy+0.004
iest+0.004
Neg logit contributions
 have-0.006
 be-0.005
 Lily-0.005
 I-0.005
 do-0.005
Tokenrum
Token ID6582
Feature activation-6.70e-03

Pos logit contributions
at+0.090
imm+0.088
zzy+0.086
wy+0.085
rol+0.082
Neg logit contributions
 have-0.156
 be-0.153
 what-0.142
 Lily-0.140
 do-0.136
Token.
Token ID13
Feature activation-8.64e-04

Pos logit contributions
 have+0.007
 be+0.006
 knock+0.006
 Lily+0.005
 do+0.005
Neg logit contributions
at-0.018
ier-0.017
imm-0.017
wy-0.017
iest-0.016
Token She
Token ID1375
Feature activation-7.86e-03

Pos logit contributions
 have+0.001
 be+0.001
 knock+0.001
 do+0.001
 what+0.001
Neg logit contributions
at-0.002
zzy-0.002
imm-0.002
wy-0.002
ibr-0.002
Token kicks
Token ID17364
Feature activation-5.83e-03

Pos logit contributions
 have+0.010
 be+0.009
 Lily+0.009
 what+0.008
 someone+0.008
Neg logit contributions
at-0.019
wy-0.019
zzy-0.017
imm-0.017
tto-0.016
Token and
Token ID290
Feature activation+3.26e-03

Pos logit contributions
 have+0.005
 be+0.005
 I+0.004
 knock+0.004
 we+0.004
Neg logit contributions
at-0.017
wy-0.017
imm-0.016
zzy-0.016
ier-0.015
Token temper
Token ID4124
Feature activation+7.27e-03

Pos logit contributions
at+0.042
imm+0.040
wy+0.040
zzy+0.039
iest+0.038
Neg logit contributions
 have-0.024
 be-0.022
 Lily-0.020
 I-0.019
 we-0.018
Token and
Token ID290
Feature activation-2.69e-03

Pos logit contributions
at+0.021
imm+0.020
wy+0.020
zzy+0.019
iest+0.019
Neg logit contributions
 have-0.007
 be-0.007
 Lily-0.006
 I-0.005
 we-0.005
Token often
Token ID1690
Feature activation+3.98e-03

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.003
 knock+0.003
 we+0.003
Neg logit contributions
at-0.007
imm-0.006
wy-0.006
zzy-0.006
iest-0.006
Token threw
Token ID9617
Feature activation-8.69e-03

Pos logit contributions
at+0.009
imm+0.009
wy+0.009
zzy+0.009
iest+0.008
Neg logit contributions
 have-0.006
 be-0.005
 Lily-0.005
 I-0.005
 what-0.005
Token temper
Token ID4124
Feature activation+1.12e-02

Pos logit contributions
 have+0.011
 be+0.010
 I+0.009
 we+0.009
 knock+0.009
Neg logit contributions
at-0.022
wy-0.021
imm-0.021
zzy-0.020
iest-0.020
Token tant
Token ID24246
Feature activation+6.28e-02

Pos logit contributions
at+0.026
imm+0.025
wy+0.025
zzy+0.024
wen+0.022
Neg logit contributions
 have-0.018
 be-0.017
 Lily-0.016
 what-0.015
 do-0.014
Tokenrums
Token ID45241
Feature activation+9.56e-04

Pos logit contributions
zzy+0.107
at+0.104
imm+0.101
wy+0.100
rol+0.097
Neg logit contributions
 have-0.117
 be-0.116
 what-0.114
 it-0.104
 Lily-0.103
Token.
Token ID13
Feature activation-3.65e-03

Pos logit contributions
at+0.003
imm+0.003
wy+0.003
zzy+0.002
iest+0.002
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 I-0.001
 do-0.001
Token One
Token ID1881
Feature activation-1.11e-02

Pos logit contributions
 have+0.004
 be+0.004
 do+0.004
 we+0.004
 Lily+0.003
Neg logit contributions
at-0.009
imm-0.008
zzy-0.008
side-0.008
rol-0.008
Token day
Token ID1110
Feature activation-9.76e-03

Pos logit contributions
 have+0.019
 be+0.018
 Lily+0.017
 I+0.016
 what+0.016
Neg logit contributions
at-0.024
imm-0.023
wy-0.022
zzy-0.022
iest-0.022
Token,
Token ID11
Feature activation-6.84e-03

Pos logit contributions
 have+0.008
 be+0.008
 Lily+0.006
 I+0.006
 do+0.005
Neg logit contributions
at-0.030
imm-0.029
wy-0.029
zzy-0.028
iest-0.027
Token away
Token ID1497
Feature activation+7.30e-03

Pos logit contributions
at+0.007
wy+0.007
imm+0.006
atmeal+0.006
zzy+0.006
Neg logit contributions
 be-0.005
 have-0.005
 Lily-0.005
 it-0.004
 what-0.004
Token,
Token ID11
Feature activation+1.79e-03

Pos logit contributions
at+0.023
wy+0.023
imm+0.022
zzy+0.021
robat+0.021
Neg logit contributions
 have-0.006
 be-0.006
 Lily-0.005
 what-0.004
 do-0.004
Token a
Token ID257
Feature activation-6.49e-03

Pos logit contributions
at+0.004
imm+0.004
wy+0.004
zzy+0.004
rol+0.004
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.002
 do-0.002
 I-0.002
Token bright
Token ID6016
Feature activation-2.61e-03

Pos logit contributions
 have+0.008
 Lily+0.007
 be+0.007
 what+0.007
 I+0.007
Neg logit contributions
at-0.015
imm-0.015
zzy-0.015
iest-0.014
wy-0.014
Token blue
Token ID4171
Feature activation-5.71e-03

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.003
 what+0.003
 we+0.003
Neg logit contributions
imm-0.006
at-0.006
iest-0.006
zzy-0.006
wy-0.006
Token spe
Token ID693
Feature activation+5.47e-02

Pos logit contributions
 have+0.009
 be+0.008
 Lily+0.008
 what+0.007
 knock+0.007
Neg logit contributions
at-0.013
imm-0.012
iest-0.012
zzy-0.012
wy-0.012
Tokenck
Token ID694
Feature activation+4.33e-03

Pos logit contributions
at+0.116
imm+0.113
vell+0.113
wy+0.112
ibr+0.111
Neg logit contributions
 be-0.088
 have-0.088
 Lily-0.071
 knock-0.070
 do-0.069
Token in
Token ID287
Feature activation-1.55e-03

Pos logit contributions
at+0.010
imm+0.010
wy+0.010
zzy+0.009
tto+0.009
Neg logit contributions
 have-0.007
 be-0.007
 Lily-0.006
 what-0.006
 do-0.006
Token the
Token ID262
Feature activation-2.78e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.001
 I+0.001
 do+0.001
Neg logit contributions
at-0.004
imm-0.004
wy-0.004
zzy-0.004
iest-0.004
Token sky
Token ID6766
Feature activation-1.43e-02

Pos logit contributions
 have+0.005
 be+0.004
 Lily+0.004
 what+0.004
 I+0.004
Neg logit contributions
at-0.006
imm-0.006
wy-0.006
zzy-0.006
iest-0.005
Token.
Token ID13
Feature activation-3.11e-03

Pos logit contributions
 have+0.011
 be+0.010
 Lily+0.008
 I+0.007
 do+0.007
Neg logit contributions
at-0.044
imm-0.043
wy-0.043
zzy-0.042
iest-0.041
Token as
Token ID355
Feature activation+1.17e-02

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 what+0.001
Neg logit contributions
at-0.005
imm-0.004
wy-0.004
zzy-0.004
iest-0.004
Token he
Token ID339
Feature activation+1.14e-02

Pos logit contributions
at+0.034
imm+0.033
wy+0.033
zzy+0.032
iest+0.031
Neg logit contributions
 have-0.012
 be-0.010
 Lily-0.009
 I-0.008
 what-0.008
Token opened
Token ID4721
Feature activation+4.50e-03

Pos logit contributions
at+0.028
imm+0.028
wy+0.027
zzy+0.026
iest+0.026
Neg logit contributions
 have-0.017
 be-0.016
 Lily-0.013
 do-0.013
 I-0.013
Token the
Token ID262
Feature activation+6.73e-03

Pos logit contributions
at+0.013
imm+0.013
wy+0.012
zzy+0.012
iest+0.012
Neg logit contributions
 have-0.005
 be-0.004
 Lily-0.004
 what-0.003
 I-0.003
Token ch
Token ID442
Feature activation+1.38e-03

Pos logit contributions
at+0.017
wy+0.016
imm+0.016
iest+0.015
zzy+0.015
Neg logit contributions
 have-0.009
 be-0.008
 Lily-0.008
 we-0.007
 I-0.007
Tokenocol
Token ID4668
Feature activation+5.37e-02

Pos logit contributions
at+0.002
imm+0.002
wy+0.002
zzy+0.002
iest+0.001
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.003
 do-0.003
Tokenates
Token ID689
Feature activation+5.28e-03

Pos logit contributions
zzy+0.066
ited+0.066
 energ+0.063
tto+0.061
vell+0.061
Neg logit contributions
 be-0.110
 have-0.102
 what-0.099
 one-0.095
 each-0.093
Token and
Token ID290
Feature activation-7.64e-03

Pos logit contributions
at+0.016
imm+0.016
wy+0.016
zzy+0.015
rito+0.015
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.003
 what-0.003
Token started
Token ID2067
Feature activation+5.02e-03

Pos logit contributions
 have+0.010
 be+0.010
 Lily+0.009
 I+0.008
 do+0.008
Neg logit contributions
at-0.018
imm-0.018
wy-0.018
zzy-0.017
iest-0.017
Token eating
Token ID6600
Feature activation-1.43e-02

Pos logit contributions
at+0.013
imm+0.013
wy+0.013
zzy+0.012
iest+0.012
Neg logit contributions
 have-0.006
 be-0.006
 Lily-0.005
 I-0.005
 what-0.005
Token them
Token ID606
Feature activation-4.02e-03

Pos logit contributions
 have+0.015
 be+0.014
 Lily+0.012
 I+0.011
 do+0.011
Neg logit contributions
at-0.040
imm-0.039
wy-0.039
zzy-0.038
iest-0.036
Token That
Token ID1320
Feature activation-3.18e-03

Pos logit contributions
 have+0.001
 be+0.001
 do+0.001
 knock+0.001
 Lily+0.001
Neg logit contributions
at-0.002
imm-0.002
zzy-0.002
us-0.002
wy-0.002
Token was
Token ID373
Feature activation+4.66e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.004
 I+0.003
 what+0.003
Neg logit contributions
at-0.008
imm-0.008
wy-0.007
zzy-0.007
iest-0.007
Token the
Token ID262
Feature activation-1.10e-02

Pos logit contributions
at+0.014
imm+0.013
wy+0.013
zzy+0.013
iest+0.012
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.003
 do-0.003
Token setup
Token ID9058
Feature activation+2.24e-03

Pos logit contributions
 have+0.017
 Lily+0.015
 be+0.015
 I+0.014
 what+0.014
Neg logit contributions
at-0.025
wy-0.024
zzy-0.023
imm-0.023
iest-0.023
Token that
Token ID326
Feature activation-2.35e-03

Pos logit contributions
at+0.007
imm+0.007
wy+0.007
zzy+0.007
vell+0.006
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.001
 do-0.001
 what-0.001
Token fore
Token ID1674
Feature activation+5.08e-02

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.003
 what+0.002
 knock+0.002
Neg logit contributions
at-0.006
imm-0.006
wy-0.006
iest-0.005
zzy-0.005
Tokenshadow
Token ID19106
Feature activation-5.05e-03

Pos logit contributions
at+0.106
imm+0.102
wy+0.101
zzy+0.098
iest+0.094
Neg logit contributions
 have-0.090
 be-0.088
 Lily-0.077
 what-0.076
 I-0.075
Tokened
Token ID276
Feature activation+1.07e-02

Pos logit contributions
 have+0.013
 be+0.013
 Lily+0.012
 I+0.012
 what+0.012
Neg logit contributions
at-0.006
wy-0.006
imm-0.006
zzy-0.006
vell-0.005
Token the
Token ID262
Feature activation-2.88e-03

Pos logit contributions
at+0.029
imm+0.028
wy+0.028
zzy+0.027
iest+0.026
Neg logit contributions
 have-0.013
 be-0.012
 Lily-0.011
 what-0.010
 I-0.010
Token bad
Token ID2089
Feature activation+1.28e-02

Pos logit contributions
 have+0.004
 Lily+0.004
 be+0.004
 I+0.004
 what+0.004
Neg logit contributions
at-0.006
wy-0.006
zzy-0.006
iest-0.006
imm-0.005
Token ending
Token ID7464
Feature activation-1.50e-03

Pos logit contributions
at+0.023
iest+0.022
imm+0.022
wy+0.022
zzy+0.022
Neg logit contributions
 have-0.023
 Lily-0.021
 what-0.020
 be-0.020
 I-0.020
Token.
Token ID13
Feature activation-8.08e-04

Pos logit contributions
at+0.023
imm+0.022
zzy+0.022
wy+0.022
iest+0.021
Neg logit contributions
 have-0.009
 be-0.008
 knock-0.007
 I-0.006
 Lily-0.006
Token
Token ID198
Feature activation+5.17e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.000
 I+0.000
 do+0.000
Neg logit contributions
at-0.002
imm-0.002
wy-0.002
zzy-0.002
iest-0.002
Token
Token ID198
Feature activation+6.85e-03

Pos logit contributions
at+0.016
imm+0.015
wy+0.015
zzy+0.015
iest+0.014
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.003
 what-0.003
TokenWhen
Token ID2215
Feature activation+1.20e-04

Pos logit contributions
at+0.014
imm+0.014
wy+0.013
zzy+0.013
iest+0.012
Neg logit contributions
 have-0.013
 be-0.012
 Lily-0.011
 I-0.011
 what-0.011
Token D
Token ID360
Feature activation+5.87e-03

Pos logit contributions
at+0.000
imm+0.000
wy+0.000
zzy+0.000
iest+0.000
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 I-0.000
 do-0.000
Tokenott
Token ID1252
Feature activation+5.01e-02

Pos logit contributions
at+0.009
wy+0.009
imm+0.009
zzy+0.009
iest+0.008
Neg logit contributions
 have-0.013
 be-0.013
 Lily-0.012
 do-0.012
 I-0.011
Tokenie
Token ID494
Feature activation+1.70e-02

Pos logit contributions
vell+0.105
rol+0.101
umm+0.101
imm+0.099
zzy+0.097
Neg logit contributions
 have-0.077
 be-0.076
 what-0.071
 Lily-0.068
 find-0.064
Token arrived
Token ID5284
Feature activation+1.09e-02

Pos logit contributions
at+0.043
imm+0.042
wy+0.041
zzy+0.039
iest+0.039
Neg logit contributions
 have-0.024
 be-0.023
 Lily-0.020
 do-0.019
 I-0.018
Token at
Token ID379
Feature activation-5.63e-03

Pos logit contributions
at+0.031
imm+0.031
wy+0.030
zzy+0.029
iest+0.028
Neg logit contributions
 have-0.011
 be-0.010
 Lily-0.009
 I-0.008
 what-0.008
Token the
Token ID262
Feature activation+4.19e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 I+0.002
 do+0.002
Neg logit contributions
at-0.018
imm-0.017
wy-0.017
zzy-0.017
iest-0.016
Token lab
Token ID2248
Feature activation+6.52e-03

Pos logit contributions
at+0.010
imm+0.010
wy+0.010
zzy+0.010
iest+0.009
Neg logit contributions
 have-0.006
 be-0.006
 Lily-0.005
 I-0.005
 what-0.005
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
at+0.024
imm+0.023
wy+0.023
zzy+0.022
iest+0.022
Neg logit contributions
 have-0.007
 be-0.006
 Lily-0.005
 I-0.004
 do-0.004
Token started
Token ID2067
Feature activation+7.17e-03

Pos logit contributions
 have+0.017
 be+0.017
 do+0.014
 Lily+0.014
 I+0.013
Neg logit contributions
at-0.031
wy-0.030
imm-0.030
zzy-0.029
iest-0.028
Token eating
Token ID6600
Feature activation-1.40e-02

Pos logit contributions
at+0.018
imm+0.017
wy+0.017
zzy+0.017
rol+0.016
Neg logit contributions
 have-0.009
 be-0.009
 Lily-0.008
 I-0.007
 do-0.007
Token the
Token ID262
Feature activation-2.13e-04

Pos logit contributions
 have+0.013
 be+0.012
 I+0.009
 knock+0.009
 Lily+0.009
Neg logit contributions
at-0.040
wy-0.040
imm-0.039
zzy-0.038
iest-0.037
Token ch
Token ID442
Feature activation-1.81e-03

Pos logit contributions
 have+0.000
 be+0.000
 I+0.000
 Lily+0.000
 we+0.000
Neg logit contributions
at-0.000
wy-0.000
iest-0.000
imm-0.000
vell-0.000
Tokenocol
Token ID4668
Feature activation+4.88e-02

Pos logit contributions
 have+0.005
 be+0.005
 Lily+0.005
 I+0.005
 do+0.005
Neg logit contributions
at-0.002
imm-0.002
wy-0.002
iest-0.001
zzy-0.001
Tokenates
Token ID689
Feature activation+4.21e-04

Pos logit contributions
zzy+0.058
ited+0.057
 energ+0.055
vell+0.053
rol+0.052
Neg logit contributions
 be-0.107
 have-0.105
 what-0.095
 half-0.091
 mom-0.090
Token.
Token ID13
Feature activation-3.10e-03

Pos logit contributions
at+0.001
imm+0.001
wy+0.001
zzy+0.001
iest+0.001
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 I-0.000
 do-0.000
Token The
Token ID383
Feature activation-1.04e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 do+0.003
 what+0.003
Neg logit contributions
at-0.008
imm-0.008
wy-0.007
zzy-0.007
iest-0.007
Token shop
Token ID6128
Feature activation-4.08e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.001
 I+0.001
 we+0.001
Neg logit contributions
at-0.002
wy-0.002
imm-0.002
iest-0.002
zzy-0.002
Tokenkeeper
Token ID13884
Feature activation-1.34e-03

Pos logit contributions
 have+0.007
 be+0.006
 Lily+0.006
 what+0.006
 I+0.006
Neg logit contributions
at-0.008
imm-0.008
zzy-0.008
wy-0.008
iest-0.008
Tokenbled
Token ID9342
Feature activation-1.11e-02

Pos logit contributions
at+0.003
imm+0.003
wy+0.003
zzy+0.003
robat+0.003
Neg logit contributions
 have-0.008
 be-0.008
 Lily-0.008
 mom-0.008
 what-0.008
Token up
Token ID510
Feature activation+1.10e-02

Pos logit contributions
 have+0.014
 be+0.013
 Lily+0.011
 what+0.010
 I+0.010
Neg logit contributions
at-0.030
imm-0.029
wy-0.029
zzy-0.028
iest-0.027
Token all
Token ID477
Feature activation+9.44e-03

Pos logit contributions
at+0.035
imm+0.034
wy+0.034
zzy+0.033
iest+0.032
Neg logit contributions
 have-0.008
 be-0.007
 Lily-0.005
 I-0.005
 do-0.004
Token the
Token ID262
Feature activation+4.62e-03

Pos logit contributions
at+0.034
vell+0.034
imm+0.034
wy+0.033
zzy+0.033
Neg logit contributions
 have-0.004
 Lily-0.003
 be-0.002
 it-0.001
 the-0.001
Token ch
Token ID442
Feature activation+1.81e-03

Pos logit contributions
at+0.011
wy+0.011
imm+0.011
zzy+0.010
iest+0.010
Neg logit contributions
 have-0.007
 be-0.006
 Lily-0.006
 I-0.005
 we-0.005
Tokenocol
Token ID4668
Feature activation+4.76e-02

Pos logit contributions
at+0.002
imm+0.002
wy+0.002
zzy+0.002
iest+0.001
Neg logit contributions
 have-0.005
 be-0.005
 Lily-0.005
 I-0.005
 do-0.005
Tokenates
Token ID689
Feature activation+1.01e-02

Pos logit contributions
zzy+0.057
ited+0.056
 energ+0.054
rol+0.053
vell+0.050
Neg logit contributions
 be-0.101
 have-0.100
 what-0.095
 each-0.087
 half-0.086
Token,
Token ID11
Feature activation+5.58e-03

Pos logit contributions
at+0.032
imm+0.032
wy+0.031
zzy+0.031
rol+0.029
Neg logit contributions
 have-0.007
 be-0.007
 Lily-0.005
 what-0.005
 we-0.004
Token and
Token ID290
Feature activation-7.06e-03

Pos logit contributions
at+0.016
imm+0.015
wy+0.015
zzy+0.015
iest+0.014
Neg logit contributions
 have-0.006
 be-0.006
 Lily-0.005
 I-0.004
 what-0.004
Token she
Token ID673
Feature activation+5.19e-03

Pos logit contributions
 have+0.009
 be+0.008
 Lily+0.007
 I+0.007
 what+0.007
Neg logit contributions
at-0.019
imm-0.018
wy-0.018
zzy-0.017
iest-0.017
Token was
Token ID373
Feature activation-2.89e-03

Pos logit contributions
at+0.015
imm+0.015
wy+0.014
iful+0.014
iest+0.014
Neg logit contributions
 have-0.006
 be-0.006
 Lily-0.005
 mom-0.004
 do-0.004
Token of
Token ID286
Feature activation+3.07e-05

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.002
 I+0.002
 do+0.002
Neg logit contributions
at-0.006
imm-0.005
wy-0.005
zzy-0.005
iest-0.005
Token all
Token ID477
Feature activation-4.06e-03

Pos logit contributions
at+0.000
imm+0.000
wy+0.000
iest+0.000
zzy+0.000
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 knock-0.000
 do-0.000
Tokenâ
Token ID22940
Feature activation-1.19e-02

Pos logit contributions
 have+0.005
 be+0.004
 Lily+0.004
 I+0.004
 what+0.003
Neg logit contributions
at-0.011
imm-0.011
wy-0.011
zzy-0.010
iest-0.010
Token
Token ID26391
Feature activation-3.83e-03

Pos logit contributions
 have+0.028
 be+0.027
 Lily+0.024
 do+0.024
 I+0.024
Neg logit contributions
at-0.017
imm-0.017
vell-0.016
wy-0.016
rol-0.016
Token
Token ID126
Feature activation-1.71e-02

Pos logit contributions
 have+0.008
 be+0.008
 Lily+0.007
 what+0.007
 do+0.007
Neg logit contributions
at-0.007
imm-0.007
wy-0.006
zzy-0.006
iest-0.006
Token
Token ID99
Feature activation-3.59e-02

Pos logit contributions
 have+0.034
 be+0.033
 Lily+0.030
 I+0.029
 do+0.029
Neg logit contributions
at-0.032
imm-0.031
wy-0.030
zzy-0.029
iest-0.029
Tokenif
Token ID361
Feature activation+7.36e-03

Pos logit contributions
 have+0.080
 be+0.076
 Lily+0.071
 I+0.069
 do+0.068
Neg logit contributions
at-0.060
imm-0.058
wy-0.057
zzy-0.054
iest-0.052
Token she
Token ID673
Feature activation+4.60e-03

Pos logit contributions
at+0.014
imm+0.014
wy+0.013
zzy+0.013
iest+0.013
Neg logit contributions
 have-0.014
 be-0.014
 Lily-0.013
 what-0.012
 do-0.012
Token only
Token ID691
Feature activation+5.83e-03

Pos logit contributions
at+0.012
imm+0.012
wy+0.012
zzy+0.011
iest+0.011
Neg logit contributions
 have-0.006
 be-0.005
 Lily-0.005
 what-0.004
 I-0.004
Token follows
Token ID5679
Feature activation-3.67e-03

Pos logit contributions
at+0.015
imm+0.015
wy+0.015
zzy+0.014
iest+0.014
Neg logit contributions
 have-0.006
 be-0.006
 Lily-0.006
 mom-0.005
 Mom-0.005
Token the
Token ID262
Feature activation+7.47e-04

Pos logit contributions
 have+0.004
 be+0.003
 Lily+0.003
 knock+0.003
 do+0.003
Neg logit contributions
at-0.010
imm-0.010
wy-0.010
zzy-0.009
iest-0.009
Token.
Token ID13
Feature activation-1.79e-03

Pos logit contributions
at+0.010
imm+0.010
wy+0.010
zzy+0.009
iest+0.009
Neg logit contributions
 have-0.003
 be-0.002
 Lily-0.002
 I-0.002
 do-0.002
Token But
Token ID887
Feature activation+1.05e-04

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 do+0.002
 what+0.002
Neg logit contributions
at-0.005
imm-0.005
wy-0.005
zzy-0.004
rol-0.004
Token ot
Token ID30972
Feature activation+4.59e-03

Pos logit contributions
at+0.000
imm+0.000
iest+0.000
wy+0.000
iful+0.000
Neg logit contributions
 have-0.000
 be-0.000
 knock-0.000
 Lily-0.000
 They-0.000
Tokenters
Token ID1010
Feature activation+6.10e-03

Pos logit contributions
at+0.007
imm+0.007
wy+0.007
zzy+0.007
iest+0.006
Neg logit contributions
 have-0.010
 be-0.010
 Lily-0.009
 I-0.009
 do-0.009
Token are
Token ID389
Feature activation-4.15e-03

Pos logit contributions
at+0.018
imm+0.017
wy+0.017
zzy+0.016
iest+0.016
Neg logit contributions
 have-0.005
 be-0.005
 Lily-0.005
 I-0.004
 mom-0.004
Token happiest
Token ID49414
Feature activation-3.50e-02

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 knock+0.003
 do+0.003
Neg logit contributions
wy-0.011
at-0.011
imm-0.010
zzy-0.010
ier-0.010
Token in
Token ID287
Feature activation-6.51e-03

Pos logit contributions
 have+0.036
 be+0.032
 knock+0.026
 do+0.025
 Lily+0.025
Neg logit contributions
at-0.094
imm-0.093
iest-0.092
wy-0.091
ier-0.090
Token the
Token ID262
Feature activation-4.59e-03

Pos logit contributions
 have+0.005
 be+0.005
 knock+0.004
 Lily+0.004
 do+0.004
Neg logit contributions
at-0.018
imm-0.018
wy-0.018
iest-0.017
zzy-0.017
Token wild
Token ID4295
Feature activation-5.00e-03

Pos logit contributions
 have+0.008
 be+0.007
 Lily+0.007
 what+0.007
 I+0.006
Neg logit contributions
at-0.010
imm-0.009
wy-0.009
iest-0.009
zzy-0.008
Token.
Token ID13
Feature activation-1.37e-03

Pos logit contributions
 have+0.008
 be+0.007
 Lily+0.007
 what+0.007
 knock+0.007
Neg logit contributions
at-0.011
imm-0.011
iest-0.010
wy-0.010
zzy-0.010
Token Let
Token ID3914
Feature activation-1.43e-02

Pos logit contributions
 have+0.001
 be+0.001
 do+0.001
 what+0.001
 knock+0.001
Neg logit contributions
at-0.004
imm-0.004
wy-0.003
zzy-0.003
rol-0.003
Token in
Token ID287
Feature activation-1.05e-02

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 knock+0.002
 mom+0.001
Neg logit contributions
at-0.004
wy-0.004
imm-0.004
zzy-0.003
iest-0.003
Token here
Token ID994
Feature activation-1.74e-03

Pos logit contributions
 have+0.009
 be+0.008
 Lily+0.007
 knock+0.007
 do+0.007
Neg logit contributions
at-0.028
imm-0.027
wy-0.027
zzy-0.027
iest-0.027
Tokenâ
Token ID22940
Feature activation-1.60e-02

Pos logit contributions
 have+0.003
 be+0.002
 Lily+0.002
 knock+0.002
 do+0.002
Neg logit contributions
at-0.004
imm-0.004
wy-0.004
zzy-0.004
iest-0.004
Token
Token ID26391
Feature activation-3.49e-03

Pos logit contributions
 have+0.037
 be+0.036
 Lily+0.033
 I+0.032
 do+0.032
Neg logit contributions
at-0.025
imm-0.024
wy-0.023
zzy-0.022
iest-0.021
Token
Token ID126
Feature activation-1.62e-02

Pos logit contributions
 have+0.007
 be+0.007
 Lily+0.006
 what+0.006
 do+0.006
Neg logit contributions
at-0.006
imm-0.006
zzy-0.006
wy-0.006
lor-0.005
Token
Token ID99
Feature activation-3.49e-02

Pos logit contributions
 have+0.031
 be+0.029
 Lily+0.027
 I+0.026
 what+0.026
Neg logit contributions
at-0.032
wy-0.030
imm-0.030
zzy-0.030
rito-0.027
Token.'
Token ID2637
Feature activation+8.10e-04

Pos logit contributions
 have+0.074
 be+0.070
 Lily+0.066
 what+0.064
 we+0.064
Neg logit contributions
at-0.063
wy-0.059
imm-0.059
zzy-0.058
rol-0.051
Token<|endoftext|>
Token ID50256
Feature activation+1.31e-03

Pos logit contributions
at+0.002
imm+0.002
wy+0.002
zzy+0.002
iest+0.002
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 do-0.001
 knock-0.001
TokenOnce
Token ID7454
Feature activation-8.37e-03

Pos logit contributions
at+0.003
imm+0.003
wy+0.003
zzy+0.003
iest+0.002
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.002
 I-0.002
 do-0.002
Token there
Token ID612
Feature activation-2.25e-03

Pos logit contributions
 have+0.015
 be+0.015
 Lily+0.013
 knock+0.013
 do+0.013
Neg logit contributions
at-0.017
imm-0.016
wy-0.016
zzy-0.015
iest-0.015
Token was
Token ID373
Feature activation+5.78e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.002
 knock+0.002
Neg logit contributions
at-0.006
imm-0.006
wy-0.006
zzy-0.006
iest-0.006
Token his
Token ID465
Feature activation+1.17e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 I+0.003
 do+0.003
Neg logit contributions
at-0.008
imm-0.007
wy-0.007
zzy-0.007
iest-0.007
Token elephant
Token ID20950
Feature activation-9.48e-03

Pos logit contributions
at+0.003
imm+0.003
wy+0.003
zzy+0.003
iest+0.003
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 what-0.001
 I-0.001
Token the
Token ID262
Feature activation-8.69e-04

Pos logit contributions
 have+0.012
 be+0.011
 Lily+0.011
 I+0.010
 what+0.010
Neg logit contributions
at-0.023
imm-0.023
wy-0.022
zzy-0.021
iest-0.021
Token name
Token ID1438
Feature activation-5.80e-06

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 what+0.001
 I+0.001
Neg logit contributions
at-0.002
wy-0.002
imm-0.002
zzy-0.002
iest-0.002
Token "
Token ID366
Feature activation+4.91e-03

Feature not active on prev token;
no predictions were affected.
TokenBI
Token ID3483
Feature activation-3.46e-02

Pos logit contributions
at+0.009
imm+0.008
wy+0.008
zzy+0.008
iest+0.007
Neg logit contributions
 have-0.010
 be-0.010
 Lily-0.009
 I-0.009
 do-0.009
TokenGG
Token ID11190
Feature activation-1.74e-02

Pos logit contributions
 have+0.069
 be+0.067
 Lily+0.062
 what+0.059
 I+0.059
Neg logit contributions
at-0.065
imm-0.062
wy-0.062
zzy-0.059
iest-0.056
TokenEST
Token ID6465
Feature activation+5.90e-03

Pos logit contributions
 have+0.034
 be+0.033
 Lily+0.030
 I+0.029
 do+0.029
Neg logit contributions
at-0.033
imm-0.033
wy-0.032
zzy-0.031
iest-0.030
Token".
Token ID1911
Feature activation-1.57e-03

Pos logit contributions
at+0.012
imm+0.012
wy+0.012
zzy+0.011
iest+0.011
Neg logit contributions
 have-0.010
 be-0.010
 Lily-0.009
 I-0.009
 do-0.009
Token He
Token ID679
Feature activation+2.25e-03

Pos logit contributions
 have+0.002
 be+0.002
 do+0.002
 what+0.002
 knock+0.002
Neg logit contributions
at-0.004
imm-0.004
zzy-0.004
rol-0.004
wy-0.003
Token was
Token ID373
Feature activation-2.50e-03

Pos logit contributions
at+0.006
imm+0.005
zzy+0.005
wy+0.005
iest+0.005
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.003
 I-0.002
 what-0.002
Token'm
Token ID1101
Feature activation-7.12e-03

Pos logit contributions
 have+0.004
 be+0.003
 Lily+0.003
 do+0.003
 what+0.003
Neg logit contributions
at-0.006
imm-0.006
wy-0.006
zzy-0.005
iest-0.005
Token glad
Token ID9675
Feature activation-1.88e-02

Pos logit contributions
 have+0.010
 be+0.010
 Lily+0.009
 I+0.008
 what+0.008
Neg logit contributions
at-0.018
imm-0.018
wy-0.017
zzy-0.017
iest-0.016
Token you
Token ID345
Feature activation+6.16e-03

Pos logit contributions
 have+0.016
 be+0.013
 knock+0.011
 do+0.009
 Lily+0.009
Neg logit contributions
at-0.053
imm-0.051
zzy-0.051
iest-0.050
us-0.050
Token are
Token ID389
Feature activation+8.97e-04

Pos logit contributions
at+0.017
imm+0.016
wy+0.016
zzy+0.015
iest+0.015
Neg logit contributions
 have-0.007
 be-0.007
 Lily-0.006
 I-0.006
 what-0.006
Token behaving
Token ID37722
Feature activation+1.69e-03

Pos logit contributions
at+0.002
imm+0.002
wy+0.002
zzy+0.002
iest+0.002
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 I-0.001
 do-0.001
Token reasonably
Token ID13025
Feature activation-3.46e-02

Pos logit contributions
at+0.005
imm+0.005
wy+0.005
zzy+0.004
iest+0.004
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.001
 I-0.001
 what-0.001
Token and
Token ID290
Feature activation+3.57e-04

Pos logit contributions
 have+0.031
 be+0.028
 Lily+0.023
 knock+0.021
 what+0.020
Neg logit contributions
at-0.102
imm-0.099
wy-0.099
zzy-0.096
iest-0.094
Token thinking
Token ID3612
Feature activation-8.40e-03

Pos logit contributions
imm+0.001
at+0.001
wy+0.001
iest+0.001
ier+0.001
Neg logit contributions
 have-0.000
 Lily-0.000
 be-0.000
 One-0.000
 knock-0.000
Token independently
Token ID14799
Feature activation+5.60e-03

Pos logit contributions
 have+0.009
 be+0.009
 knock+0.007
 Lily+0.007
 find+0.007
Neg logit contributions
at-0.022
imm-0.021
iest-0.021
wy-0.021
ier-0.021
Token!"
Token ID2474
Feature activation-1.45e-03

Pos logit contributions
at+0.015
imm+0.015
wy+0.015
ier+0.014
iest+0.014
Neg logit contributions
 have-0.006
 be-0.005
 Lily-0.005
 knock-0.004
 do-0.004
Token<|endoftext|>
Token ID50256
Feature activation+1.72e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 do+0.001
 I+0.001
Neg logit contributions
at-0.004
imm-0.004
wy-0.004
zzy-0.004
rol-0.004
Token,
Token ID11
Feature activation-5.15e-03

Pos logit contributions
at+0.013
imm+0.013
wy+0.012
zzy+0.012
iest+0.012
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.002
 I-0.002
 what-0.002
Token the
Token ID262
Feature activation-3.19e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 I+0.003
 do+0.002
Neg logit contributions
at-0.016
imm-0.015
wy-0.015
zzy-0.015
iest-0.015
Token bear
Token ID6842
Feature activation+2.98e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.004
 we+0.004
 what+0.004
Neg logit contributions
at-0.007
imm-0.007
wy-0.007
iest-0.007
vell-0.007
Token became
Token ID2627
Feature activation-2.88e-03

Pos logit contributions
at+0.008
imm+0.008
wy+0.008
zzy+0.007
iest+0.007
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 do-0.003
 I-0.003
Token the
Token ID262
Feature activation-7.13e-03

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.002
 what+0.002
 mom+0.002
Neg logit contributions
at-0.009
imm-0.008
wy-0.008
zzy-0.008
iest-0.008
Token happiest
Token ID49414
Feature activation-3.44e-02

Pos logit contributions
 have+0.012
 be+0.011
 Lily+0.010
 do+0.010
 I+0.010
Neg logit contributions
at-0.016
imm-0.016
wy-0.015
zzy-0.015
iest-0.014
Token bear
Token ID6842
Feature activation-3.69e-03

Pos logit contributions
 have+0.039
 be+0.036
 Lily+0.032
 I+0.029
 do+0.029
Neg logit contributions
at-0.096
wy-0.093
imm-0.093
zzy-0.090
rol-0.086
Token around
Token ID1088
Feature activation-1.82e-02

Pos logit contributions
 have+0.005
 be+0.004
 Lily+0.004
 I+0.004
 we+0.003
Neg logit contributions
at-0.010
imm-0.010
wy-0.010
zzy-0.010
rol-0.009
Token.
Token ID13
Feature activation-1.54e-03

Pos logit contributions
 have+0.019
 be+0.017
 Lily+0.015
 I+0.014
 what+0.013
Neg logit contributions
at-0.052
imm-0.051
wy-0.051
zzy-0.049
iest-0.047
Token Everywhere
Token ID48201
Feature activation+7.06e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.001
 do+0.001
Neg logit contributions
at-0.004
imm-0.004
wy-0.004
zzy-0.004
iest-0.004
Token he
Token ID339
Feature activation+6.53e-03

Pos logit contributions
at+0.020
imm+0.019
wy+0.019
zzy+0.019
iest+0.018
Neg logit contributions
 have-0.008
 be-0.007
 Lily-0.006
 I-0.006
 do-0.006
Tokenâ
Token ID22940
Feature activation-9.11e-03

Pos logit contributions
 have+0.002
 be+0.002
 do+0.002
 knock+0.002
 what+0.002
Neg logit contributions
at-0.006
imm-0.005
zzy-0.005
rol-0.005
wy-0.005
Token
Token ID26391
Feature activation+1.76e-03

Pos logit contributions
 have+0.023
 be+0.022
 Lily+0.020
 I+0.020
 do+0.020
Neg logit contributions
at-0.013
imm-0.012
wy-0.012
zzy-0.011
vell-0.011
Token Everyone
Token ID11075
Feature activation+6.78e-03

Pos logit contributions
at+0.004
imm+0.004
zzy+0.004
wy+0.004
wen+0.003
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.003
 what-0.002
 knock-0.002
Token nodded
Token ID14464
Feature activation-6.88e-03

Pos logit contributions
at+0.017
imm+0.017
wy+0.017
zzy+0.016
iest+0.016
Neg logit contributions
 have-0.009
 be-0.008
 Lily-0.007
 I-0.007
 what-0.007
Token in
Token ID287
Feature activation-3.00e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.004
 I+0.004
 do+0.004
Neg logit contributions
at-0.021
imm-0.020
wy-0.020
zzy-0.019
iest-0.019
Token agreement
Token ID4381
Feature activation-3.43e-02

Pos logit contributions
 have+0.005
 be+0.004
 Lily+0.004
 we+0.004
 I+0.004
Neg logit contributions
at-0.006
zzy-0.006
imm-0.006
wy-0.006
iest-0.006
Token and
Token ID290
Feature activation-5.33e-04

Pos logit contributions
 have+0.030
 be+0.026
 Lily+0.022
 knock+0.019
 do+0.019
Neg logit contributions
at-0.102
imm-0.099
wy-0.098
zzy-0.096
iest-0.094
Token the
Token ID262
Feature activation-1.26e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 what+0.001
Neg logit contributions
at-0.001
imm-0.001
wy-0.001
zzy-0.001
iest-0.001
Token family
Token ID1641
Feature activation+6.00e-04

Pos logit contributions
 have+0.002
 be+0.001
 Lily+0.001
 what+0.001
 I+0.001
Neg logit contributions
at-0.003
iest-0.003
wy-0.003
imm-0.003
zzy-0.003
Token continued
Token ID3767
Feature activation+2.44e-03

Pos logit contributions
at+0.001
imm+0.001
wy+0.001
zzy+0.001
iest+0.001
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 I-0.001
 what-0.001
Token to
Token ID284
Feature activation-1.76e-03

Pos logit contributions
at+0.007
imm+0.006
wy+0.006
zzy+0.006
iest+0.006
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.002
 what-0.002
 do-0.002
Token Now
Token ID2735
Feature activation+1.04e-02

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.002
 what+0.002
Neg logit contributions
at-0.007
imm-0.007
wy-0.007
zzy-0.007
iest-0.006
Token,
Token ID11
Feature activation+1.36e-03

Pos logit contributions
at+0.031
imm+0.030
wy+0.030
zzy+0.029
iest+0.028
Neg logit contributions
 have-0.009
 be-0.008
 Lily-0.006
 do-0.006
 knock-0.006
Token he
Token ID339
Feature activation-2.95e-05

Pos logit contributions
at+0.004
imm+0.004
wy+0.003
iest+0.003
zzy+0.003
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 knock-0.001
 what-0.001
Token was
Token ID373
Feature activation+4.65e-04

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 what+0.000
 I+0.000
Neg logit contributions
at-0.000
imm-0.000
wy-0.000
zzy-0.000
iest-0.000
Token the
Token ID262
Feature activation-1.40e-02

Pos logit contributions
at+0.001
imm+0.001
wy+0.001
zzy+0.001
iest+0.001
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 I-0.001
 what-0.001
Token happiest
Token ID49414
Feature activation-3.40e-02

Pos logit contributions
 have+0.023
 be+0.022
 Lily+0.020
 I+0.019
 do+0.019
Neg logit contributions
at-0.032
imm-0.031
wy-0.030
zzy-0.029
iest-0.028
Token crab
Token ID32202
Feature activation-4.96e-03

Pos logit contributions
 have+0.059
 be+0.056
 Lily+0.052
 I+0.049
 do+0.048
Neg logit contributions
at-0.074
imm-0.071
wy-0.070
zzy-0.068
iest-0.065
Token in
Token ID287
Feature activation-1.02e-02

Pos logit contributions
 have+0.008
 be+0.007
 I+0.006
 Lily+0.006
 we+0.006
Neg logit contributions
at-0.012
imm-0.012
wy-0.012
zzy-0.012
rol-0.011
Token the
Token ID262
Feature activation-2.61e-03

Pos logit contributions
 have+0.007
 be+0.006
 Lily+0.005
 do+0.004
 I+0.004
Neg logit contributions
at-0.032
imm-0.031
wy-0.031
zzy-0.030
iest-0.029
Token ocean
Token ID9151
Feature activation-6.31e-03

Pos logit contributions
 have+0.005
 be+0.005
 Lily+0.004
 what+0.004
 knock+0.004
Neg logit contributions
at-0.005
wy-0.004
imm-0.004
zzy-0.004
iest-0.004
Token!
Token ID0
Feature activation-4.87e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.004
 what+0.004
 I+0.004
Neg logit contributions
at-0.019
imm-0.018
wy-0.018
zzy-0.017
iest-0.017
Token.
Token ID13
Feature activation-4.44e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.004
 what+0.004
 do+0.004
Neg logit contributions
at-0.022
imm-0.022
wy-0.022
zzy-0.021
iest-0.020
Token
Token ID198
Feature activation+4.39e-05

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 do+0.003
 knock+0.003
Neg logit contributions
at-0.013
imm-0.012
wy-0.012
zzy-0.012
rol-0.012
Token
Token ID198
Feature activation+1.13e-03

Pos logit contributions
at+0.000
imm+0.000
wy+0.000
zzy+0.000
iest+0.000
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 I-0.000
 do-0.000
TokenHe
Token ID1544
Feature activation-1.30e-03

Pos logit contributions
at+0.003
imm+0.003
wy+0.003
zzy+0.003
iest+0.003
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 do-0.001
 I-0.001
Token was
Token ID373
Feature activation-5.02e-04

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 do+0.001
 I+0.001
Neg logit contributions
at-0.003
imm-0.003
wy-0.003
zzy-0.003
iest-0.003
Token happiest
Token ID49414
Feature activation-3.39e-02

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.000
 what+0.000
 do+0.000
Neg logit contributions
at-0.001
imm-0.001
wy-0.001
zzy-0.001
iest-0.001
Token when
Token ID618
Feature activation-3.24e-03

Pos logit contributions
 have+0.029
 be+0.027
 Lily+0.025
 I+0.022
 we+0.020
Neg logit contributions
at-0.107
wy-0.105
imm-0.102
zzy-0.099
vell-0.096
Token he
Token ID339
Feature activation-5.56e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 knock+0.003
 do+0.003
Neg logit contributions
at-0.008
imm-0.008
iest-0.008
wy-0.007
zzy-0.007
Token was
Token ID373
Feature activation+2.57e-03

Pos logit contributions
 have+0.007
 be+0.006
 do+0.005
 Lily+0.005
 what+0.005
Neg logit contributions
at-0.015
imm-0.015
wy-0.015
vell-0.014
zzy-0.014
Token running
Token ID2491
Feature activation-5.88e-03

Pos logit contributions
at+0.007
imm+0.006
wy+0.006
zzy+0.006
iest+0.006
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.003
 I-0.003
 do-0.003
Token around
Token ID1088
Feature activation-1.25e-02

Pos logit contributions
 have+0.007
 be+0.007
 Lily+0.006
 I+0.006
 do+0.006
Neg logit contributions
at-0.015
imm-0.015
wy-0.015
zzy-0.014
iest-0.014
Token a
Token ID257
Feature activation-8.91e-03

Pos logit contributions
at+0.013
imm+0.013
wy+0.013
zzy+0.012
iest+0.012
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.001
 I-0.001
 do-0.001
Token big
Token ID1263
Feature activation+9.38e-03

Pos logit contributions
 have+0.010
 be+0.009
 what+0.008
 I+0.008
 Lily+0.007
Neg logit contributions
at-0.022
wy-0.022
zzy-0.021
imm-0.021
vell-0.021
Token barrel
Token ID9036
Feature activation+1.20e-03

Pos logit contributions
at+0.022
wy+0.020
imm+0.020
robat+0.019
iest+0.019
Neg logit contributions
 have-0.013
 be-0.011
 what-0.010
 someone-0.010
 I-0.010
Token in
Token ID287
Feature activation-1.89e-02

Pos logit contributions
at+0.003
imm+0.003
iest+0.003
wy+0.003
zzy+0.003
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 what-0.001
 do-0.001
Token a
Token ID257
Feature activation-3.69e-03

Pos logit contributions
 have+0.016
 be+0.014
 Lily+0.011
 we+0.011
 do+0.010
Neg logit contributions
at-0.054
wy-0.052
imm-0.052
zzy-0.051
iest-0.049
Token farm
Token ID5318
Feature activation-3.39e-02

Pos logit contributions
 have+0.005
 be+0.004
 what+0.004
 Lily+0.004
 we+0.004
Neg logit contributions
at-0.008
zzy-0.008
wy-0.007
us-0.007
eh-0.007
Token.
Token ID13
Feature activation-2.87e-03

Pos logit contributions
 have+0.042
 be+0.038
 Lily+0.034
 what+0.033
 knock+0.033
Neg logit contributions
at-0.083
imm-0.078
zzy-0.073
iest-0.072
side-0.071
Token The
Token ID383
Feature activation+4.14e-04

Pos logit contributions
 be+0.003
 have+0.003
 what+0.003
 we+0.003
 knock+0.003
Neg logit contributions
at-0.007
imm-0.007
zzy-0.006
wy-0.006
iest-0.006
Token barrel
Token ID9036
Feature activation+6.98e-03

Pos logit contributions
at+0.001
imm+0.001
wy+0.001
zzy+0.001
iest+0.001
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 I-0.001
 what-0.001
Token was
Token ID373
Feature activation-3.56e-03

Pos logit contributions
at+0.017
imm+0.016
iest+0.015
wy+0.015
led+0.014
Neg logit contributions
 have-0.009
 be-0.009
 what-0.008
 Lily-0.008
 someone-0.007
Token full
Token ID1336
Feature activation-2.54e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 I+0.003
 we+0.003
Neg logit contributions
at-0.009
imm-0.009
wy-0.009
zzy-0.009
iest-0.008
Token and
Token ID290
Feature activation+6.89e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 what+0.003
 I+0.003
Neg logit contributions
at-0.018
wy-0.017
imm-0.017
zzy-0.017
iest-0.016
Token nodded
Token ID14464
Feature activation-5.54e-03

Pos logit contributions
at+0.017
imm+0.017
wy+0.017
zzy+0.016
iest+0.016
Neg logit contributions
 have-0.009
 be-0.009
 Lily-0.008
 I-0.007
 do-0.007
Token their
Token ID511
Feature activation+8.99e-03

Pos logit contributions
 have+0.005
 be+0.004
 Lily+0.004
 what+0.003
 I+0.003
Neg logit contributions
at-0.017
imm-0.017
wy-0.016
zzy-0.016
iest-0.015
Token heads
Token ID6665
Feature activation-3.03e-03

Pos logit contributions
at+0.020
imm+0.020
wy+0.019
zzy+0.019
iful+0.018
Neg logit contributions
 have-0.014
 Lily-0.012
 be-0.012
 I-0.011
 what-0.011
Token in
Token ID287
Feature activation-6.42e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.001
 what+0.001
 I+0.001
Neg logit contributions
at-0.010
imm-0.010
wy-0.010
zzy-0.009
iest-0.009
Token agreement
Token ID4381
Feature activation-3.37e-02

Pos logit contributions
 have+0.011
 be+0.010
 Lily+0.010
 I+0.010
 we+0.010
Neg logit contributions
zzy-0.012
at-0.012
imm-0.012
wy-0.012
iest-0.011
Token.
Token ID13
Feature activation-3.35e-03

Pos logit contributions
 have+0.029
 be+0.025
 Lily+0.021
 knock+0.019
 I+0.019
Neg logit contributions
at-0.100
imm-0.098
wy-0.096
zzy-0.095
iest-0.093
Token
Token ID198
Feature activation-1.89e-03

Pos logit contributions
 have+0.003
 be+0.003
 do+0.003
 knock+0.003
 what+0.002
Neg logit contributions
at-0.009
imm-0.009
zzy-0.009
wy-0.009
rol-0.009
Token
Token ID198
Feature activation-5.51e-04

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.001
 I+0.001
 look+0.001
Neg logit contributions
at-0.005
zzy-0.005
imm-0.005
wy-0.005
us-0.005
TokenMom
Token ID29252
Feature activation+2.77e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 what+0.001
Neg logit contributions
at-0.001
imm-0.001
wy-0.001
zzy-0.001
iest-0.001
Token looked
Token ID3114
Feature activation-6.61e-03

Pos logit contributions
at+0.007
imm+0.007
wy+0.007
zzy+0.007
iest+0.006
Neg logit contributions
 have-0.004
 be-0.003
 Lily-0.003
 I-0.003
 do-0.003
Token the
Token ID262
Feature activation-5.53e-03

Pos logit contributions
at+0.011
imm+0.011
wy+0.011
zzy+0.010
iest+0.010
Neg logit contributions
 have-0.005
 be-0.004
 Lily-0.004
 what-0.003
 I-0.003
Token park
Token ID3952
Feature activation-1.23e-02

Pos logit contributions
 have+0.007
 be+0.006
 Lily+0.006
 I+0.005
 what+0.005
Neg logit contributions
at-0.015
imm-0.014
wy-0.014
zzy-0.014
iest-0.013
Token,
Token ID11
Feature activation+3.98e-04

Pos logit contributions
 have+0.013
 be+0.012
 Lily+0.010
 I+0.009
 what+0.009
Neg logit contributions
at-0.035
imm-0.034
wy-0.033
zzy-0.032
iest-0.032
Token even
Token ID772
Feature activation-9.27e-03

Pos logit contributions
at+0.001
imm+0.001
wy+0.001
zzy+0.001
iest+0.001
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 what-0.000
 I-0.000
Token a
Token ID257
Feature activation-8.50e-03

Pos logit contributions
 have+0.011
 be+0.011
 Lily+0.010
 what+0.009
 do+0.009
Neg logit contributions
at-0.025
imm-0.024
wy-0.024
zzy-0.024
wen-0.023
Token farm
Token ID5318
Feature activation-3.35e-02

Pos logit contributions
 have+0.011
 be+0.010
 Lily+0.009
 I+0.009
 what+0.009
Neg logit contributions
at-0.021
imm-0.020
wy-0.020
zzy-0.020
iest-0.019
Token!
Token ID0
Feature activation-7.54e-03

Pos logit contributions
 have+0.034
 be+0.031
 Lily+0.027
 I+0.024
 what+0.023
Neg logit contributions
at-0.095
imm-0.093
wy-0.090
zzy-0.089
iest-0.088
Token Everywhere
Token ID48201
Feature activation+5.92e-03

Pos logit contributions
 have+0.009
 be+0.008
 Lily+0.007
 I+0.007
 what+0.007
Neg logit contributions
at-0.021
imm-0.020
wy-0.020
zzy-0.019
iest-0.019
Token they
Token ID484
Feature activation+2.31e-03

Pos logit contributions
at+0.019
imm+0.019
wy+0.019
zzy+0.018
iest+0.018
Neg logit contributions
 have-0.004
 be-0.003
 Lily-0.003
 I-0.002
 what-0.002
Token went
Token ID1816
Feature activation-6.22e-04

Pos logit contributions
at+0.007
imm+0.007
wy+0.006
wen+0.006
lor+0.006
Neg logit contributions
 have-0.003
 be-0.002
 Lily-0.002
 look-0.002
 do-0.002
Token M
Token ID337
Feature activation-5.80e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.000
 knock+0.000
 mom+0.000
Neg logit contributions
at-0.002
imm-0.002
wy-0.002
zzy-0.002
iest-0.002
Token safe
Token ID3338
Feature activation-9.30e-03

Pos logit contributions
 have+0.013
 be+0.012
 Lily+0.011
 I+0.010
 what+0.010
Neg logit contributions
at-0.030
imm-0.029
wy-0.029
zzy-0.028
iest-0.027
Token".
Token ID1911
Feature activation-5.33e-03

Pos logit contributions
 have+0.010
 be+0.009
 Lily+0.008
 I+0.007
 do+0.007
Neg logit contributions
at-0.026
imm-0.025
wy-0.025
zzy-0.024
iest-0.024
Token Everyone
Token ID11075
Feature activation+4.06e-03

Pos logit contributions
 have+0.005
 be+0.005
 do+0.004
 Lily+0.004
 what+0.004
Neg logit contributions
at-0.015
imm-0.015
wy-0.014
zzy-0.014
rol-0.014
Token nodded
Token ID14464
Feature activation-1.11e-02

Pos logit contributions
at+0.010
imm+0.010
wy+0.010
zzy+0.010
iest+0.009
Neg logit contributions
 have-0.005
 be-0.005
 Lily-0.004
 I-0.004
 what-0.004
Token in
Token ID287
Feature activation-4.45e-03

Pos logit contributions
 have+0.009
 be+0.008
 Lily+0.006
 I+0.005
 what+0.005
Neg logit contributions
at-0.035
imm-0.034
wy-0.033
zzy-0.033
iest-0.032
Token agreement
Token ID4381
Feature activation-3.34e-02

Pos logit contributions
 have+0.008
 be+0.007
 Lily+0.007
 I+0.007
 we+0.007
Neg logit contributions
at-0.009
wy-0.008
imm-0.008
zzy-0.008
iest-0.008
Token.
Token ID13
Feature activation+1.34e-03

Pos logit contributions
 have+0.031
 be+0.027
 Lily+0.023
 knock+0.022
 I+0.021
Neg logit contributions
at-0.095
imm-0.093
wy-0.090
zzy-0.090
ier-0.089
Token
Token ID220
Feature activation+1.63e-04

Pos logit contributions
at+0.004
imm+0.003
zzy+0.003
wy+0.003
rol+0.003
Neg logit contributions
 have-0.001
 be-0.001
 do-0.001
 knock-0.001
 look-0.001
Token
Token ID198
Feature activation-1.56e-03

Pos logit contributions
at+0.000
imm+0.000
wy+0.000
zzy+0.000
us+0.000
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 do-0.000
 knock-0.000
Token
Token ID198
Feature activation-3.53e-04

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 do+0.001
Neg logit contributions
at-0.005
imm-0.005
wy-0.004
zzy-0.004
us-0.004
TokenThe
Token ID464
Feature activation-1.67e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 what+0.001
Neg logit contributions
at-0.001
imm-0.001
wy-0.001
zzy-0.001
iest-0.001
Token red
Token ID2266
Feature activation-5.14e-04

Pos logit contributions
 have+0.009
 I+0.008
 be+0.008
 knock+0.008
 what+0.008
Neg logit contributions
at-0.012
wy-0.012
vell-0.011
robat-0.011
iest-0.011
Token par
Token ID1582
Feature activation-1.17e-02

Pos logit contributions
 have+0.001
 knock+0.001
 Lily+0.001
 what+0.001
 I+0.001
Neg logit contributions
at-0.001
imm-0.001
robat-0.001
iest-0.001
lor-0.001
Tokenrot
Token ID10599
Feature activation-1.22e-02

Pos logit contributions
 have+0.026
 be+0.025
 Lily+0.024
 I+0.023
 do+0.023
Neg logit contributions
at-0.019
imm-0.018
wy-0.018
zzy-0.017
iest-0.016
Token was
Token ID373
Feature activation-3.33e-03

Pos logit contributions
 have+0.013
 be+0.012
 Lily+0.011
 I+0.010
 do+0.010
Neg logit contributions
at-0.034
imm-0.033
wy-0.033
zzy-0.032
iest-0.031
Token the
Token ID262
Feature activation-1.21e-02

Pos logit contributions
 have+0.004
 be+0.003
 Lily+0.003
 I+0.003
 what+0.003
Neg logit contributions
at-0.009
imm-0.009
wy-0.009
zzy-0.009
iest-0.008
Token happiest
Token ID49414
Feature activation-3.34e-02

Pos logit contributions
 have+0.021
 be+0.020
 Lily+0.019
 I+0.018
 do+0.018
Neg logit contributions
at-0.026
imm-0.025
wy-0.024
zzy-0.024
iest-0.023
Token he
Token ID339
Feature activation+5.23e-05

Pos logit contributions
 have+0.042
 be+0.039
 knock+0.034
 do+0.033
 what+0.032
Neg logit contributions
at-0.083
imm-0.082
iest-0.080
wy-0.079
zzy-0.078
Token had
Token ID550
Feature activation-7.75e-03

Pos logit contributions
at+0.000
imm+0.000
wy+0.000
zzy+0.000
rol+0.000
Neg logit contributions
 have-0.000
 be-0.000
 I-0.000
 Lily-0.000
 we-0.000
Token ever
Token ID1683
Feature activation-3.23e-03

Pos logit contributions
 have+0.014
 be+0.013
 Lily+0.011
 I+0.011
 we+0.011
Neg logit contributions
at-0.017
imm-0.017
wy-0.017
zzy-0.016
rito-0.016
Token been
Token ID587
Feature activation-6.52e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.005
 I+0.005
 do+0.005
Neg logit contributions
at-0.007
imm-0.007
wy-0.007
zzy-0.006
iest-0.006
Token.
Token ID13
Feature activation-1.64e-04

Pos logit contributions
 have+0.007
 be+0.007
 Lily+0.006
 I+0.005
 what+0.005
Neg logit contributions
at-0.018
imm-0.018
wy-0.017
zzy-0.017
iest-0.016
Token sand
Token ID6450
Feature activation-1.51e-03

Pos logit contributions
at+0.019
wy+0.018
imm+0.017
iest+0.017
ier+0.017
Neg logit contributions
 have-0.007
 be-0.007
 I-0.006
 Lily-0.006
 what-0.006
Token,
Token ID11
Feature activation+5.23e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 what+0.001
Neg logit contributions
at-0.005
imm-0.005
wy-0.004
zzy-0.004
iest-0.004
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
at+0.015
wy+0.014
imm+0.014
zzy+0.013
ier+0.013
Neg logit contributions
 have-0.005
 be-0.004
 Lily-0.004
 I-0.004
 what-0.004
Token a
Token ID257
Feature activation-1.72e-02

Pos logit contributions
 have+0.011
 be+0.011
 I+0.011
 One+0.010
 Mom+0.010
Neg logit contributions
iest-0.021
at-0.020
wy-0.020
ier-0.020
rol-0.019
Token big
Token ID1263
Feature activation+2.40e-03

Pos logit contributions
 have+0.023
 be+0.021
 what+0.020
 I+0.020
 Lily+0.020
Neg logit contributions
at-0.037
vell-0.035
zzy-0.035
wy-0.035
iest-0.035
Token tree
Token ID5509
Feature activation-3.33e-02

Pos logit contributions
at+0.005
robat+0.005
iest+0.005
wy+0.004
imm+0.004
Neg logit contributions
 have-0.003
 what-0.003
 someone-0.003
 we-0.003
 I-0.003
Token.
Token ID13
Feature activation-3.25e-04

Pos logit contributions
 have+0.038
 be+0.035
 Lily+0.034
 someone+0.034
 what+0.033
Neg logit contributions
at-0.080
imm-0.075
ier-0.072
zzy-0.071
wy-0.070
Token
Token ID198
Feature activation+3.08e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 knock+0.000
 look+0.000
Neg logit contributions
imm-0.001
at-0.001
rol-0.001
zzy-0.001
rito-0.001
Token
Token ID198
Feature activation+4.50e-03

Pos logit contributions
zzy+0.008
at+0.008
imm+0.008
us+0.008
tto+0.007
Neg logit contributions
 be-0.003
 have-0.003
 I-0.003
 look-0.003
 knock-0.003
TokenTim
Token ID14967
Feature activation-6.40e-04

Pos logit contributions
at+0.010
zzy+0.010
imm+0.010
wy+0.010
dd+0.009
Neg logit contributions
 have-0.007
 be-0.006
 look-0.006
 Lily-0.006
 I-0.006
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
 have+0.001
 be+0.000
 Lily+0.000
 do+0.000
 I+0.000
Neg logit contributions
at-0.002
imm-0.002
wy-0.002
zzy-0.002
iest-0.002
Token swings
Token ID26728
Feature activation-1.61e-02

Pos logit contributions
at+0.013
wy+0.012
iest+0.012
imm+0.011
zzy+0.011
Neg logit contributions
 have-0.007
 be-0.006
 Lily-0.006
 knock-0.006
 I-0.006
Token,
Token ID11
Feature activation+6.14e-03

Pos logit contributions
 have+0.015
 be+0.013
 I+0.011
 Lily+0.011
 knock+0.011
Neg logit contributions
at-0.046
imm-0.044
wy-0.043
zzy-0.042
iest-0.041
Token slides
Token ID19392
Feature activation-5.66e-03

Pos logit contributions
at+0.014
wy+0.014
iest+0.013
imm+0.013
ier+0.013
Neg logit contributions
 have-0.008
 be-0.008
 Lily-0.007
 Mom-0.007
 mom-0.007
Token,
Token ID11
Feature activation+3.90e-03

Pos logit contributions
 have+0.005
 be+0.004
 Lily+0.004
 I+0.004
 do+0.003
Neg logit contributions
at-0.017
imm-0.016
wy-0.016
zzy-0.015
iest-0.015
Token and
Token ID290
Feature activation-9.76e-03

Pos logit contributions
at+0.011
wy+0.010
imm+0.010
ier+0.010
iest+0.009
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.003
 Mom-0.003
Token pige
Token ID32372
Feature activation-3.32e-02

Pos logit contributions
 mom+0.011
 Mom+0.011
 have+0.011
 be+0.010
 I+0.010
Neg logit contributions
iest-0.019
at-0.019
ier-0.018
rob-0.018
wy-0.018
Tokenons
Token ID684
Feature activation-7.49e-03

Pos logit contributions
 have+0.074
 be+0.071
 Lily+0.067
 I+0.065
 do+0.064
Neg logit contributions
at-0.056
imm-0.053
wy-0.052
zzy-0.049
iest-0.046
Token.
Token ID13
Feature activation-4.55e-03

Pos logit contributions
 have+0.005
 be+0.005
 Lily+0.004
 I+0.003
 do+0.003
Neg logit contributions
at-0.024
imm-0.023
wy-0.023
zzy-0.023
iest-0.022
Token P
Token ID350
Feature activation-9.19e-03

Pos logit contributions
 be+0.005
 have+0.005
 do+0.004
 arrow+0.004
 what+0.004
Neg logit contributions
zzy-0.010
side-0.010
at-0.010
wy-0.009
rol-0.009
Tokenige
Token ID10045
Feature activation-1.84e-02

Pos logit contributions
 have+0.020
 be+0.020
 do+0.019
 Lily+0.018
 what+0.018
Neg logit contributions
imm-0.014
at-0.013
zzy-0.013
wy-0.013
rito-0.013
Tokenons
Token ID684
Feature activation-7.77e-04

Pos logit contributions
 have+0.029
 be+0.028
 Lily+0.027
 Mom+0.026
 I+0.026
Neg logit contributions
at-0.042
wy-0.039
imm-0.038
iest-0.036
zzy-0.036
Token.
Token ID13
Feature activation+2.17e-03

Pos logit contributions
 have+0.015
 be+0.013
 Lily+0.011
 knock+0.011
 do+0.010
Neg logit contributions
imm-0.039
at-0.038
ier-0.037
iest-0.037
zzy-0.037
Token The
Token ID383
Feature activation-8.99e-04

Pos logit contributions
at+0.006
imm+0.006
wy+0.005
zzy+0.005
us+0.005
Neg logit contributions
 have-0.002
 be-0.002
 do-0.002
 knock-0.002
 Lily-0.002
Token triangle
Token ID22950
Feature activation-3.19e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 what+0.001
Neg logit contributions
at-0.002
wy-0.002
iest-0.002
vell-0.002
ier-0.002
Token was
Token ID373
Feature activation+2.20e-03

Pos logit contributions
 have+0.004
 be+0.003
 Lily+0.003
 what+0.003
 I+0.003
Neg logit contributions
at-0.009
imm-0.008
wy-0.008
zzy-0.008
iest-0.008
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
at+0.006
wy+0.006
imm+0.006
zzy+0.006
robat+0.006
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.002
 what-0.002
 I-0.002
Token happiest
Token ID49414
Feature activation-3.31e-02

Pos logit contributions
 have+0.027
 be+0.025
 Lily+0.024
 I+0.022
 what+0.022
Neg logit contributions
at-0.031
imm-0.030
wy-0.030
zzy-0.028
iest-0.027
Token of
Token ID286
Feature activation-1.09e-02

Pos logit contributions
 have+0.042
 be+0.039
 Lily+0.034
 what+0.032
 do+0.032
Neg logit contributions
at-0.084
imm-0.083
wy-0.080
zzy-0.079
iest-0.079
Token all
Token ID477
Feature activation-4.35e-03

Pos logit contributions
 have+0.012
 be+0.011
 knock+0.010
 Lily+0.009
 do+0.009
Neg logit contributions
at-0.028
imm-0.028
wy-0.027
iest-0.026
ier-0.026
Token.
Token ID13
Feature activation+1.02e-03

Pos logit contributions
 have+0.004
 be+0.003
 Lily+0.003
 I+0.003
 do+0.003
Neg logit contributions
at-0.013
imm-0.013
wy-0.013
zzy-0.013
rito-0.012
Token<|endoftext|>
Token ID50256
Feature activation+3.41e-03

Pos logit contributions
at+0.003
imm+0.003
wy+0.003
zzy+0.003
iest+0.003
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 do-0.001
 knock-0.001
TokenOnce
Token ID7454
Feature activation-5.80e-03

Pos logit contributions
at+0.009
imm+0.009
wy+0.009
zzy+0.009
iest+0.008
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.003
 do-0.003
Tokenmy
Token ID1820
Feature activation-5.71e-03

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 do+0.000
 I+0.000
Neg logit contributions
at-0.000
imm-0.000
wy-0.000
zzy-0.000
iest-0.000
Token had
Token ID550
Feature activation+2.85e-03

Pos logit contributions
 have+0.007
 be+0.007
 Lily+0.007
 I+0.006
 what+0.006
Neg logit contributions
at-0.014
imm-0.014
zzy-0.014
wy-0.014
ier-0.013
Token no
Token ID645
Feature activation+1.67e-03

Pos logit contributions
at+0.007
imm+0.007
wy+0.007
zzy+0.006
iest+0.006
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.004
 I-0.003
 do-0.003
Token time
Token ID640
Feature activation-1.49e-03

Pos logit contributions
at+0.003
wy+0.003
imm+0.003
iest+0.003
zzy+0.003
Neg logit contributions
 have-0.003
 Lily-0.003
 I-0.003
 we-0.002
 be-0.002
Token to
Token ID284
Feature activation+2.92e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 knock+0.001
 I+0.001
Neg logit contributions
at-0.004
imm-0.004
wy-0.004
zzy-0.003
ier-0.003
Token dwell
Token ID14046
Feature activation-3.30e-02

Pos logit contributions
at+0.008
zzy+0.008
imm+0.008
wy+0.007
iest+0.007
Neg logit contributions
 have-0.003
 Lily-0.003
 be-0.002
 I-0.002
 we-0.002
Token<|endoftext|>
Token ID50256
Feature activation+2.72e-03

Pos logit contributions
 be+0.042
 have+0.042
 Lily+0.040
 someone+0.035
 Tom+0.035
Neg logit contributions
at-0.085
wy-0.085
imm-0.083
vell-0.082
rito-0.079
TokenOnce
Token ID7454
Feature activation-7.66e-03

Pos logit contributions
at+0.006
imm+0.006
wy+0.006
zzy+0.006
iest+0.005
Neg logit contributions
 have-0.005
 be-0.004
 Lily-0.004
 I-0.004
 do-0.004
Token there
Token ID612
Feature activation-3.96e-03

Pos logit contributions
 have+0.015
 be+0.014
 Lily+0.013
 knock+0.013
 I+0.012
Neg logit contributions
at-0.014
imm-0.014
wy-0.013
zzy-0.013
iest-0.013
Token was
Token ID373
Feature activation+6.19e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 I+0.003
 do+0.003
Neg logit contributions
at-0.011
imm-0.011
wy-0.011
zzy-0.011
iest-0.010
Token a
Token ID257
Feature activation-4.02e-03

Pos logit contributions
at+0.020
imm+0.020
wy+0.020
zzy+0.019
iest+0.019
Neg logit contributions
 have-0.004
 be-0.003
 Lily-0.002
 I-0.002
 do-0.002
Token she
Token ID673
Feature activation-1.47e-03

Pos logit contributions
 have+0.016
 be+0.015
 Lily+0.012
 knock+0.011
 do+0.011
Neg logit contributions
at-0.044
imm-0.043
wy-0.042
zzy-0.042
ier-0.041
Token played
Token ID2826
Feature activation-5.26e-04

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 do+0.002
 I+0.001
Neg logit contributions
at-0.004
imm-0.004
wy-0.004
zzy-0.004
iest-0.003
Token.
Token ID13
Feature activation-1.63e-03

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 I+0.000
 what+0.000
Neg logit contributions
at-0.002
imm-0.002
wy-0.001
zzy-0.001
iest-0.001
Token She
Token ID1375
Feature activation-4.13e-04

Pos logit contributions
 have+0.002
 be+0.002
 do+0.002
 knock+0.001
 what+0.001
Neg logit contributions
at-0.004
imm-0.004
zzy-0.004
wy-0.004
rol-0.004
Token was
Token ID373
Feature activation+1.44e-03

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 I+0.000
 do+0.000
Neg logit contributions
at-0.001
imm-0.001
wy-0.001
zzy-0.001
iest-0.001
Token happiest
Token ID49414
Feature activation-3.27e-02

Pos logit contributions
at+0.004
imm+0.004
wy+0.004
zzy+0.004
iest+0.004
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.001
 what-0.001
 I-0.001
Token when
Token ID618
Feature activation-1.53e-03

Pos logit contributions
 have+0.032
 be+0.029
 Lily+0.025
 I+0.022
 what+0.022
Neg logit contributions
at-0.097
imm-0.094
wy-0.094
zzy-0.091
iest-0.087
Token she
Token ID673
Feature activation-1.04e-02

Pos logit contributions
 have+0.002
 be+0.002
 knock+0.001
 Lily+0.001
 do+0.001
Neg logit contributions
at-0.004
iest-0.004
ier-0.004
imm-0.004
zzy-0.004
Token heard
Token ID2982
Feature activation-1.08e-02

Pos logit contributions
 have+0.013
 be+0.012
 Lily+0.010
 do+0.009
 I+0.009
Neg logit contributions
at-0.028
imm-0.028
wy-0.027
zzy-0.026
vell-0.026
Token her
Token ID607
Feature activation-6.54e-03

Pos logit contributions
 have+0.010
 be+0.009
 knock+0.007
 we+0.007
 Lily+0.007
Neg logit contributions
at-0.029
imm-0.029
iest-0.028
ier-0.028
zzy-0.027
Token music
Token ID2647
Feature activation-7.35e-03

Pos logit contributions
 have+0.009
 be+0.008
 Lily+0.008
 I+0.008
 what+0.007
Neg logit contributions
imm-0.015
at-0.015
zzy-0.014
iest-0.014
wy-0.014
Token just
Token ID655
Feature activation-6.47e-03

Pos logit contributions
 have+0.010
 be+0.009
 Lily+0.009
 knock+0.008
 what+0.008
Neg logit contributions
at-0.023
imm-0.022
zzy-0.021
wy-0.021
iest-0.021
Token yet
Token ID1865
Feature activation-1.69e-02

Pos logit contributions
 Lily+0.005
 have+0.005
 mom+0.004
 be+0.004
 Mom+0.004
Neg logit contributions
at-0.018
imm-0.017
wy-0.017
zzy-0.017
iest-0.017
Tokenâ
Token ID22940
Feature activation-1.40e-02

Pos logit contributions
 have+0.020
 be+0.018
 Lily+0.017
 knock+0.016
 mom+0.015
Neg logit contributions
at-0.042
imm-0.041
zzy-0.040
wy-0.040
iest-0.040
Token
Token ID26391
Feature activation-3.60e-05

Pos logit contributions
 have+0.036
 be+0.034
 Lily+0.032
 do+0.031
 I+0.031
Neg logit contributions
at-0.019
imm-0.018
wy-0.018
zzy-0.017
vell-0.016
Token
Token ID126
Feature activation-1.56e-02

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 what+0.000
 do+0.000
Neg logit contributions
at-0.000
imm-0.000
wy-0.000
zzy-0.000
iest-0.000
Token
Token ID99
Feature activation-3.25e-02

Pos logit contributions
 have+0.029
 be+0.028
 Lily+0.026
 I+0.025
 what+0.025
Neg logit contributions
at-0.031
imm-0.030
wy-0.030
zzy-0.029
iest-0.027
Tokenbut
Token ID4360
Feature activation-1.91e-03

Pos logit contributions
 have+0.070
 be+0.067
 Lily+0.062
 what+0.060
 I+0.060
Neg logit contributions
at-0.057
imm-0.055
wy-0.054
zzy-0.052
iest-0.048
Token trust
Token ID3774
Feature activation-2.08e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 knock+0.001
 cars+0.001
Neg logit contributions
at-0.005
imm-0.005
zzy-0.005
wy-0.005
iest-0.005
Token me
Token ID502
Feature activation-9.05e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 knock+0.002
 I+0.002
Neg logit contributions
at-0.006
wy-0.005
imm-0.005
zzy-0.005
iest-0.005
Token,
Token ID11
Feature activation-4.15e-03

Pos logit contributions
 have+0.011
 be+0.010
 Lily+0.009
 I+0.008
 do+0.008
Neg logit contributions
at-0.024
imm-0.023
wy-0.023
zzy-0.023
iest-0.022
Token it
Token ID340
Feature activation-4.69e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 knock+0.003
 what+0.003
Neg logit contributions
at-0.011
wy-0.010
imm-0.010
zzy-0.010
iest-0.010
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
at+0.024
imm+0.023
wy+0.023
zzy+0.022
iest+0.022
Neg logit contributions
 have-0.007
 be-0.006
 Lily-0.005
 I-0.004
 do-0.004
Token started
Token ID2067
Feature activation+7.17e-03

Pos logit contributions
 have+0.017
 be+0.017
 do+0.014
 Lily+0.014
 I+0.013
Neg logit contributions
at-0.031
wy-0.030
imm-0.030
zzy-0.029
iest-0.028
Token eating
Token ID6600
Feature activation-1.40e-02

Pos logit contributions
at+0.018
imm+0.017
wy+0.017
zzy+0.017
rol+0.016
Neg logit contributions
 have-0.009
 be-0.009
 Lily-0.008
 I-0.007
 do-0.007
Token the
Token ID262
Feature activation-2.13e-04

Pos logit contributions
 have+0.013
 be+0.012
 I+0.009
 knock+0.009
 Lily+0.009
Neg logit contributions
at-0.040
wy-0.040
imm-0.039
zzy-0.038
iest-0.037
Token ch
Token ID442
Feature activation-1.81e-03

Pos logit contributions
 have+0.000
 be+0.000
 I+0.000
 Lily+0.000
 we+0.000
Neg logit contributions
at-0.000
wy-0.000
iest-0.000
imm-0.000
vell-0.000
Tokenocol
Token ID4668
Feature activation+4.88e-02

Pos logit contributions
 have+0.005
 be+0.005
 Lily+0.005
 I+0.005
 do+0.005
Neg logit contributions
at-0.002
imm-0.002
wy-0.002
iest-0.001
zzy-0.001
Tokenates
Token ID689
Feature activation+4.21e-04

Pos logit contributions
zzy+0.058
ited+0.057
 energ+0.055
vell+0.053
rol+0.052
Neg logit contributions
 be-0.107
 have-0.105
 what-0.095
 half-0.091
 mom-0.090
Token.
Token ID13
Feature activation-3.10e-03

Pos logit contributions
at+0.001
imm+0.001
wy+0.001
zzy+0.001
iest+0.001
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 I-0.000
 do-0.000
Token The
Token ID383
Feature activation-1.04e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 do+0.003
 what+0.003
Neg logit contributions
at-0.008
imm-0.008
wy-0.007
zzy-0.007
iest-0.007
Token shop
Token ID6128
Feature activation-4.08e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.001
 I+0.001
 we+0.001
Neg logit contributions
at-0.002
wy-0.002
imm-0.002
iest-0.002
zzy-0.002
Tokenkeeper
Token ID13884
Feature activation-1.34e-03

Pos logit contributions
 have+0.007
 be+0.006
 Lily+0.006
 what+0.006
 I+0.006
Neg logit contributions
at-0.008
imm-0.008
zzy-0.008
wy-0.008
iest-0.008
Token his
Token ID465
Feature activation+8.73e-03

Pos logit contributions
 have+0.016
 be+0.015
 Lily+0.014
 what+0.013
 I+0.013
Neg logit contributions
at-0.028
imm-0.027
wy-0.027
zzy-0.026
vell-0.025
Token feet
Token ID3625
Feature activation+6.01e-03

Pos logit contributions
at+0.019
wy+0.019
imm+0.018
zzy+0.018
rito+0.017
Neg logit contributions
 have-0.013
 Lily-0.012
 what-0.011
 be-0.011
 I-0.011
Token and
Token ID290
Feature activation-1.28e-03

Pos logit contributions
at+0.019
imm+0.019
wy+0.018
zzy+0.018
iest+0.017
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.003
 do-0.003
Token threw
Token ID9617
Feature activation-1.30e-02

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.002
 do+0.002
Neg logit contributions
at-0.003
imm-0.003
wy-0.003
zzy-0.003
iest-0.003
Token a
Token ID257
Feature activation+3.98e-03

Pos logit contributions
 have+0.017
 be+0.016
 I+0.014
 do+0.013
 we+0.013
Neg logit contributions
at-0.032
wy-0.032
imm-0.031
zzy-0.030
iest-0.029
Token tant
Token ID24246
Feature activation+7.28e-02

Pos logit contributions
at+0.010
imm+0.010
wy+0.009
zzy+0.009
iest+0.009
Neg logit contributions
 have-0.006
 be-0.005
 Lily-0.005
 I-0.005
 do-0.005
Tokenrum
Token ID6582
Feature activation-6.62e-03

Pos logit contributions
at+0.106
zzy+0.105
imm+0.105
rol+0.101
vell+0.101
Neg logit contributions
 be-0.149
 have-0.148
 what-0.137
 Lily-0.131
 it-0.130
Token.
Token ID13
Feature activation+2.67e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.004
 knock+0.004
 do+0.004
Neg logit contributions
at-0.020
imm-0.019
wy-0.019
zzy-0.018
ier-0.018
Token
Token ID198
Feature activation-1.70e-03

Pos logit contributions
at+0.007
imm+0.007
zzy+0.007
wy+0.007
us+0.007
Neg logit contributions
 have-0.003
 be-0.002
 do-0.002
 knock-0.002
 what-0.002
Token
Token ID198
Feature activation-1.23e-04

Pos logit contributions
 have+0.002
 be+0.002
 do+0.001
 look+0.001
 knock+0.001
Neg logit contributions
at-0.005
zzy-0.005
imm-0.005
wy-0.005
us-0.004
Token"
Token ID1
Feature activation+4.44e-03

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 do+0.000
 what+0.000
Neg logit contributions
at-0.000
wy-0.000
zzy-0.000
imm-0.000
us-0.000
Token She
Token ID1375
Feature activation-6.72e-04

Pos logit contributions
 have+0.003
 be+0.003
 knock+0.003
 do+0.003
 what+0.003
Neg logit contributions
at-0.006
imm-0.006
zzy-0.006
wy-0.006
amb-0.006
Token cried
Token ID16896
Feature activation-3.45e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 what+0.001
Neg logit contributions
at-0.002
imm-0.002
wy-0.002
zzy-0.002
ier-0.001
Token and
Token ID290
Feature activation+1.19e-02

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.002
 I+0.002
 do+0.002
Neg logit contributions
at-0.010
imm-0.010
wy-0.010
zzy-0.010
ier-0.009
Token threw
Token ID9617
Feature activation-8.07e-03

Pos logit contributions
at+0.030
imm+0.029
wy+0.029
zzy+0.028
ier+0.027
Neg logit contributions
 have-0.015
 be-0.014
 Lily-0.012
 I-0.012
 what-0.012
Token a
Token ID257
Feature activation+3.50e-03

Pos logit contributions
 have+0.011
 be+0.011
 I+0.010
 we+0.010
 do+0.009
Neg logit contributions
at-0.018
wy-0.018
imm-0.018
zzy-0.017
rito-0.016
Token tant
Token ID24246
Feature activation+7.22e-02

Pos logit contributions
at+0.006
imm+0.006
wy+0.006
zzy+0.006
iest+0.005
Neg logit contributions
 have-0.007
 be-0.007
 Lily-0.006
 I-0.006
 what-0.006
Tokenrum
Token ID6582
Feature activation-1.46e-03

Pos logit contributions
at+0.096
imm+0.092
wy+0.091
zzy+0.091
rol+0.083
Neg logit contributions
 have-0.172
 be-0.167
 Lily-0.157
 what-0.156
 knock-0.151
Token.
Token ID13
Feature activation-6.54e-04

Pos logit contributions
 have+0.002
 be+0.001
 knock+0.001
 Lily+0.001
 do+0.001
Neg logit contributions
at-0.004
imm-0.004
ier-0.004
wy-0.004
zzy-0.004
Token Her
Token ID2332
Feature activation-5.54e-03

Pos logit contributions
 have+0.001
 be+0.001
 do+0.001
 knock+0.001
 what+0.001
Neg logit contributions
at-0.002
imm-0.002
wy-0.001
zzy-0.001
rol-0.001
Token mom
Token ID1995
Feature activation-1.87e-03

Pos logit contributions
 have+0.004
 be+0.003
 I+0.003
 what+0.003
 we+0.003
Neg logit contributions
at-0.016
wy-0.016
imm-0.016
zzy-0.015
iest-0.015
Token tried
Token ID3088
Feature activation+1.03e-02

Pos logit contributions
 have+0.003
 be+0.002
 Lily+0.002
 I+0.002
 what+0.002
Neg logit contributions
at-0.005
imm-0.005
wy-0.004
zzy-0.004
ier-0.004
Token.
Token ID13
Feature activation-1.05e-03

Pos logit contributions
 have+0.014
 be+0.012
 Lily+0.010
 I+0.010
 knock+0.009
Neg logit contributions
at-0.037
imm-0.035
zzy-0.035
wy-0.035
ier-0.035
Token He
Token ID679
Feature activation-2.82e-03

Pos logit contributions
 have+0.002
 be+0.002
 we+0.001
 do+0.001
 what+0.001
Neg logit contributions
at-0.002
imm-0.002
zzy-0.002
wy-0.002
ier-0.002
Token kept
Token ID4030
Feature activation+1.63e-03

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.003
 do+0.002
 I+0.002
Neg logit contributions
at-0.008
imm-0.008
wy-0.008
iest-0.007
rol-0.007
Token throwing
Token ID9644
Feature activation-8.02e-03

Pos logit contributions
at+0.004
wy+0.004
imm+0.003
zzy+0.003
us+0.003
Neg logit contributions
 have-0.002
 be-0.002
 I-0.002
 Lily-0.002
 we-0.002
Token a
Token ID257
Feature activation+3.26e-03

Pos logit contributions
 have+0.009
 be+0.009
 I+0.008
 we+0.008
 knock+0.008
Neg logit contributions
wy-0.020
at-0.020
imm-0.019
ier-0.018
zzy-0.018
Token tant
Token ID24246
Feature activation+7.28e-02

Pos logit contributions
at+0.008
wy+0.008
imm+0.007
iest+0.007
zzy+0.007
Neg logit contributions
 have-0.005
 be-0.004
 Lily-0.004
 I-0.004
 what-0.004
Tokenrum
Token ID6582
Feature activation-5.37e-03

Pos logit contributions
at+0.095
imm+0.093
zzy+0.091
wy+0.090
rol+0.085
Neg logit contributions
 have-0.175
 be-0.171
 what-0.157
 Lily-0.156
 knock-0.153
Token.
Token ID13
Feature activation-2.22e-06

Pos logit contributions
 have+0.006
 be+0.005
 knock+0.004
 Lily+0.004
 do+0.004
Neg logit contributions
at-0.015
ier-0.014
imm-0.014
wy-0.014
iest-0.014
Token His
Token ID2399
Feature activation+3.50e-04

Feature not active on prev token;
no predictions were affected.
Token mom
Token ID1995
Feature activation-6.60e-03

Pos logit contributions
at+0.001
wy+0.001
zzy+0.001
imm+0.001
iest+0.001
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 I-0.000
 what-0.000
Token was
Token ID373
Feature activation-9.32e-03

Pos logit contributions
 have+0.009
 be+0.008
 Lily+0.007
 I+0.007
 what+0.006
Neg logit contributions
at-0.017
imm-0.017
wy-0.016
zzy-0.016
iest-0.015
Token That
Token ID1320
Feature activation-3.18e-03

Pos logit contributions
 have+0.001
 be+0.001
 do+0.001
 knock+0.001
 Lily+0.001
Neg logit contributions
at-0.002
imm-0.002
zzy-0.002
us-0.002
wy-0.002
Token was
Token ID373
Feature activation+4.66e-03

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.004
 I+0.003
 what+0.003
Neg logit contributions
at-0.008
imm-0.008
wy-0.007
zzy-0.007
iest-0.007
Token the
Token ID262
Feature activation-1.10e-02

Pos logit contributions
at+0.014
imm+0.013
wy+0.013
zzy+0.013
iest+0.012
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 I-0.003
 do-0.003
Token setup
Token ID9058
Feature activation+2.24e-03

Pos logit contributions
 have+0.017
 Lily+0.015
 be+0.015
 I+0.014
 what+0.014
Neg logit contributions
at-0.025
wy-0.024
zzy-0.023
imm-0.023
iest-0.023
Token that
Token ID326
Feature activation-2.35e-03

Pos logit contributions
at+0.007
imm+0.007
wy+0.007
zzy+0.007
vell+0.006
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.001
 do-0.001
 what-0.001
Token fore
Token ID1674
Feature activation+5.08e-02

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.003
 what+0.002
 knock+0.002
Neg logit contributions
at-0.006
imm-0.006
wy-0.006
iest-0.005
zzy-0.005
Tokenshadow
Token ID19106
Feature activation-5.05e-03

Pos logit contributions
at+0.106
imm+0.102
wy+0.101
zzy+0.098
iest+0.094
Neg logit contributions
 have-0.090
 be-0.088
 Lily-0.077
 what-0.076
 I-0.075
Tokened
Token ID276
Feature activation+1.07e-02

Pos logit contributions
 have+0.013
 be+0.013
 Lily+0.012
 I+0.012
 what+0.012
Neg logit contributions
at-0.006
wy-0.006
imm-0.006
zzy-0.006
vell-0.005
Token the
Token ID262
Feature activation-2.88e-03

Pos logit contributions
at+0.029
imm+0.028
wy+0.028
zzy+0.027
iest+0.026
Neg logit contributions
 have-0.013
 be-0.012
 Lily-0.011
 what-0.010
 I-0.010
Token bad
Token ID2089
Feature activation+1.28e-02

Pos logit contributions
 have+0.004
 Lily+0.004
 be+0.004
 I+0.004
 what+0.004
Neg logit contributions
at-0.006
wy-0.006
zzy-0.006
iest-0.006
imm-0.005
Token ending
Token ID7464
Feature activation-1.50e-03

Pos logit contributions
at+0.023
iest+0.022
imm+0.022
wy+0.022
zzy+0.022
Neg logit contributions
 have-0.023
 Lily-0.021
 what-0.020
 be-0.020
 I-0.020
Token things
Token ID1243
Feature activation+3.80e-03

Pos logit contributions
at+0.032
imm+0.031
wy+0.030
zzy+0.029
iest+0.029
Neg logit contributions
 have-0.013
 be-0.012
 Lily-0.011
 I-0.010
 what-0.010
Token.
Token ID13
Feature activation-1.08e-03

Pos logit contributions
at+0.011
imm+0.011
wy+0.011
zzy+0.011
iest+0.010
Neg logit contributions
 have-0.004
 be-0.003
 Lily-0.003
 I-0.002
 what-0.002
Token He
Token ID679
Feature activation+4.05e-03

Pos logit contributions
 have+0.001
 be+0.001
 do+0.001
 knock+0.001
 Lily+0.001
Neg logit contributions
at-0.003
imm-0.003
wy-0.002
zzy-0.002
rol-0.002
Token felt
Token ID2936
Feature activation+4.15e-03

Pos logit contributions
at+0.011
imm+0.010
wy+0.010
zzy+0.010
iest+0.010
Neg logit contributions
 have-0.005
 be-0.005
 Lily-0.004
 I-0.004
 do-0.004
Token happy
Token ID3772
Feature activation-2.36e-03

Pos logit contributions
at+0.012
imm+0.011
wy+0.011
zzy+0.011
wen+0.010
Neg logit contributions
 have-0.005
 be-0.004
 Lily-0.004
 what-0.004
 do-0.004
Token and
Token ID290
Feature activation+2.09e-02

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 knock+0.002
 do+0.002
Neg logit contributions
at-0.007
imm-0.007
wy-0.006
zzy-0.006
ier-0.006
Token proud
Token ID6613
Feature activation-4.89e-03

Pos logit contributions
at+0.053
imm+0.051
wy+0.050
zzy+0.049
rol+0.047
Neg logit contributions
 have-0.032
 be-0.030
 Lily-0.027
 I-0.025
 do-0.025
Token of
Token ID286
Feature activation+2.70e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.004
 do+0.004
 I+0.004
Neg logit contributions
at-0.013
imm-0.013
wy-0.013
zzy-0.013
iest-0.012
Token himself
Token ID2241
Feature activation-7.66e-03

Pos logit contributions
at+0.004
wy+0.004
ier+0.004
imm+0.004
iest+0.004
Neg logit contributions
 have-0.005
 be-0.005
 knock-0.005
 I-0.005
 we-0.005
Token.
Token ID13
Feature activation+2.02e-04

Pos logit contributions
 have+0.007
 be+0.007
 Lily+0.006
 I+0.005
 knock+0.005
Neg logit contributions
at-0.022
imm-0.021
wy-0.021
ier-0.020
zzy-0.020
Token From
Token ID3574
Feature activation+1.20e-02

Pos logit contributions
at+0.000
imm+0.000
wy+0.000
zzy+0.000
rol+0.000
Neg logit contributions
 have-0.000
 be-0.000
 do-0.000
 knock-0.000
 look-0.000
Token
Token ID198
Feature activation+5.47e-03

Pos logit contributions
at+0.013
zzy+0.012
imm+0.012
wy+0.012
us+0.012
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.004
 I-0.004
 we-0.003
TokenIt
Token ID1026
Feature activation+1.43e-02

Pos logit contributions
at+0.012
imm+0.012
wy+0.012
zzy+0.011
iest+0.011
Neg logit contributions
 have-0.009
 be-0.009
 Lily-0.008
 I-0.007
 do-0.007
Token took
Token ID1718
Feature activation-7.57e-04

Pos logit contributions
at+0.042
wy+0.041
imm+0.040
zzy+0.039
iest+0.038
Neg logit contributions
 have-0.015
 be-0.014
 Lily-0.011
 do-0.010
 what-0.010
Token a
Token ID257
Feature activation+8.73e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 what+0.001
 do+0.001
Neg logit contributions
at-0.002
imm-0.002
wy-0.002
zzy-0.002
iest-0.002
Token lot
Token ID1256
Feature activation+1.43e-02

Pos logit contributions
at+0.022
zzy+0.021
imm+0.021
wy+0.020
iest+0.020
Neg logit contributions
 have-0.011
 be-0.010
 Lily-0.009
 I-0.009
 what-0.008
Token of
Token ID286
Feature activation+2.49e-02

Pos logit contributions
imm+0.045
at+0.045
wy+0.045
ridge+0.043
atmeal+0.043
Neg logit contributions
 have-0.011
 be-0.009
 Lily-0.009
 what-0.008
 that-0.007
Token searching
Token ID10342
Feature activation+1.44e-02

Pos logit contributions
at+0.064
imm+0.062
wy+0.062
zzy+0.060
iest+0.059
Neg logit contributions
 have-0.031
 be-0.029
 Lily-0.026
 I-0.026
 we-0.024
Token,
Token ID11
Feature activation+1.10e-02

Pos logit contributions
at+0.045
wy+0.043
imm+0.043
zzy+0.042
wen+0.040
Neg logit contributions
 have-0.012
 be-0.011
 Lily-0.010
 what-0.009
 do-0.008
Token but
Token ID475
Feature activation+1.66e-03

Pos logit contributions
at+0.030
imm+0.029
wy+0.028
zzy+0.028
iest+0.027
Neg logit contributions
 have-0.012
 be-0.011
 Lily-0.009
 I-0.009
 do-0.008
Token they
Token ID484
Feature activation+7.44e-04

Pos logit contributions
at+0.005
imm+0.005
wy+0.005
zzy+0.005
iest+0.005
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 I-0.001
 do-0.001
Token eventually
Token ID4191
Feature activation-9.08e-03

Pos logit contributions
at+0.002
imm+0.002
wy+0.002
zzy+0.002
iest+0.002
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 do-0.001
 what-0.001
Token noises
Token ID26782
Feature activation+1.21e-02

Pos logit contributions
at+0.021
imm+0.021
ier+0.019
iest+0.019
iful+0.019
Neg logit contributions
 have-0.010
 be-0.009
 I-0.008
 we-0.008
 Lily-0.008
Token.
Token ID13
Feature activation-2.17e-03

Pos logit contributions
at+0.032
imm+0.031
ier+0.030
zzy+0.030
wy+0.030
Neg logit contributions
 have-0.013
 be-0.012
 Lily-0.011
 I-0.010
 we-0.010
Token When
Token ID1649
Feature activation-3.31e-04

Pos logit contributions
 have+0.003
 be+0.003
 what+0.002
 we+0.002
 knock+0.002
Neg logit contributions
imm-0.005
at-0.005
zzy-0.005
ibr-0.005
obb-0.004
Token she
Token ID673
Feature activation+6.22e-03

Pos logit contributions
 have+0.000
 be+0.000
 I+0.000
 do+0.000
 what+0.000
Neg logit contributions
at-0.001
zzy-0.001
iest-0.001
wy-0.001
side-0.001
Token came
Token ID1625
Feature activation+2.22e-02

Pos logit contributions
wy+0.018
at+0.018
imm+0.017
rol+0.017
ier+0.017
Neg logit contributions
 have-0.008
 be-0.008
 do-0.006
 Lily-0.006
 what-0.006
Token out
Token ID503
Feature activation+2.11e-02

Pos logit contributions
at+0.051
imm+0.050
zzy+0.049
wy+0.048
iest+0.047
Neg logit contributions
 have-0.032
 be-0.029
 Lily-0.027
 knock-0.026
 do-0.025
Token of
Token ID286
Feature activation+9.05e-03

Pos logit contributions
at+0.066
wy+0.065
imm+0.064
zzy+0.062
iest+0.059
Neg logit contributions
 have-0.018
 be-0.016
 Lily-0.014
 what-0.012
 I-0.012
Token the
Token ID262
Feature activation+1.34e-03

Pos logit contributions
at+0.030
imm+0.029
wy+0.029
zzy+0.028
ier+0.027
Neg logit contributions
 have-0.006
 be-0.006
 Lily-0.005
 what-0.004
 mom-0.004
Token tunnel
Token ID13275
Feature activation+1.57e-02

Pos logit contributions
at+0.003
imm+0.003
iest+0.003
zzy+0.003
vell+0.003
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.002
 I-0.002
 what-0.002
Token,
Token ID11
Feature activation+2.26e-03

Pos logit contributions
at+0.044
imm+0.044
iest+0.042
ier+0.042
zzy+0.041
Neg logit contributions
 have-0.015
 be-0.013
 Lily-0.011
 what-0.010
 do-0.010
Token she
Token ID673
Feature activation+3.22e-04

Pos logit contributions
zzy+0.005
iest+0.004
at+0.004
ier+0.004
vell+0.004
Neg logit contributions
 be-0.003
 have-0.003
 knock-0.003
 I-0.002
 do-0.002
Token the
Token ID262
Feature activation-2.81e-03

Pos logit contributions
at+0.007
imm+0.007
wy+0.007
zzy+0.006
iest+0.006
Neg logit contributions
 have-0.004
 be-0.003
 Lily-0.003
 I-0.003
 do-0.003
Token nurses
Token ID20669
Feature activation+1.26e-02

Pos logit contributions
 have+0.004
 be+0.004
 Lily+0.003
 what+0.003
 I+0.003
Neg logit contributions
at-0.006
wy-0.006
zzy-0.006
iest-0.006
imm-0.006
Token gave
Token ID2921
Feature activation-6.86e-03

Pos logit contributions
at+0.031
imm+0.030
wy+0.030
zzy+0.029
iest+0.028
Neg logit contributions
 have-0.018
 be-0.016
 Lily-0.015
 I-0.014
 do-0.014
Token her
Token ID607
Feature activation-1.51e-02

Pos logit contributions
 have+0.007
 be+0.007
 Lily+0.006
 what+0.005
 I+0.005
Neg logit contributions
at-0.019
imm-0.019
wy-0.019
zzy-0.018
iest-0.018
Token a
Token ID257
Feature activation-1.42e-03

Pos logit contributions
 have+0.011
 be+0.010
 Lily+0.007
 I+0.007
 knock+0.006
Neg logit contributions
at-0.047
imm-0.045
wy-0.045
zzy-0.045
iest-0.043
Token big
Token ID1263
Feature activation+2.26e-02

Pos logit contributions
 have+0.002
 be+0.001
 Lily+0.001
 I+0.001
 do+0.001
Neg logit contributions
at-0.004
imm-0.004
wy-0.004
zzy-0.004
iest-0.004
Token hug
Token ID16225
Feature activation+8.65e-03

Pos logit contributions
at+0.055
imm+0.052
wy+0.052
zzy+0.052
iest+0.050
Neg logit contributions
 have-0.030
 be-0.028
 Lily-0.025
 I-0.024
 we-0.023
Token.
Token ID13
Feature activation+3.86e-03

Pos logit contributions
at+0.029
wy+0.028
imm+0.028
zzy+0.027
vell+0.027
Neg logit contributions
 have-0.005
 be-0.005
 Lily-0.004
 do-0.003
 what-0.003
Token
Token ID220
Feature activation+5.19e-03

Pos logit contributions
at+0.010
imm+0.010
zzy+0.010
wy+0.010
rol+0.010
Neg logit contributions
 have-0.004
 be-0.004
 do-0.003
 knock-0.003
 what-0.003
Token
Token ID198
Feature activation+9.73e-05

Pos logit contributions
at+0.015
imm+0.015
wy+0.015
zzy+0.014
rol+0.014
Neg logit contributions
 have-0.005
 be-0.004
 Lily-0.003
 knock-0.003
 do-0.003
Token
Token ID198
Feature activation+1.17e-03

Pos logit contributions
at+0.000
imm+0.000
wy+0.000
zzy+0.000
iest+0.000
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 I-0.000
 do-0.000
Token too
Token ID1165
Feature activation-8.41e-03

Pos logit contributions
at+0.024
imm+0.024
wy+0.023
zzy+0.023
iest+0.022
Neg logit contributions
 have-0.007
 be-0.006
 Lily-0.005
 I-0.005
 what-0.004
Token high
Token ID1029
Feature activation+1.76e-02

Pos logit contributions
 have+0.011
 be+0.010
 Lily+0.009
 I+0.008
 do+0.008
Neg logit contributions
at-0.022
imm-0.022
wy-0.021
zzy-0.021
rol-0.021
Token and
Token ID290
Feature activation+8.61e-03

Pos logit contributions
at+0.060
zzy+0.058
imm+0.058
wy+0.058
vell+0.056
Neg logit contributions
 have-0.009
 be-0.009
 Lily-0.007
 what-0.006
 it-0.006
Token the
Token ID262
Feature activation-2.43e-05

Pos logit contributions
at+0.024
imm+0.024
wy+0.023
zzy+0.023
iest+0.022
Neg logit contributions
 have-0.009
 be-0.008
 Lily-0.007
 do-0.006
 what-0.006
Token man
Token ID582
Feature activation-4.82e-03

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 what+0.000
 we+0.000
Neg logit contributions
at-0.000
wy-0.000
imm-0.000
zzy-0.000
ier-0.000
Token couldn
Token ID3521
Feature activation+2.08e-02

Pos logit contributions
 have+0.006
 be+0.006
 Lily+0.005
 do+0.005
 I+0.005
Neg logit contributions
at-0.012
imm-0.012
wy-0.012
zzy-0.012
iest-0.011
Token't
Token ID470
Feature activation-9.07e-03

Pos logit contributions
at+0.049
imm+0.047
wy+0.046
zzy+0.045
iest+0.043
Neg logit contributions
 have-0.031
 be-0.029
 Lily-0.027
 what-0.026
 do-0.025
Token reach
Token ID3151
Feature activation+2.66e-03

Pos logit contributions
 have+0.010
 be+0.009
 Lily+0.008
 do+0.007
 I+0.007
Neg logit contributions
at-0.026
imm-0.026
wy-0.025
rol-0.024
zzy-0.024
Token the
Token ID262
Feature activation-5.14e-04

Pos logit contributions
at+0.008
imm+0.007
wy+0.007
zzy+0.007
iest+0.007
Neg logit contributions
 have-0.003
 be-0.002
 Lily-0.002
 I-0.002
 do-0.002
Token others
Token ID1854
Feature activation+1.30e-02

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 what+0.001
 we+0.001
Neg logit contributions
at-0.001
wy-0.001
imm-0.001
ier-0.001
iest-0.001
Token.
Token ID13
Feature activation-2.22e-03

Pos logit contributions
at+0.039
imm+0.037
wy+0.037
zzy+0.036
ier+0.035
Neg logit contributions
 have-0.011
 be-0.010
 Lily-0.009
 I-0.008
 knock-0.007
Token the
Token ID262
Feature activation-5.34e-03

Pos logit contributions
 have+0.000
 be+0.000
 I+0.000
 we+0.000
 knock+0.000
Neg logit contributions
at-0.000
zzy-0.000
wy-0.000
imm-0.000
us-0.000
Token pond
Token ID16723
Feature activation-1.44e-02

Pos logit contributions
 have+0.008
 be+0.007
 Lily+0.006
 what+0.006
 I+0.006
Neg logit contributions
at-0.012
wy-0.012
zzy-0.011
vell-0.011
imm-0.011
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
 have+0.013
 be+0.011
 Lily+0.009
 I+0.008
 do+0.008
Neg logit contributions
at-0.043
imm-0.042
wy-0.041
zzy-0.040
iest-0.039
Token fed
Token ID11672
Feature activation-6.55e-03

Pos logit contributions
 have+0.019
 be+0.018
 Lily+0.016
 I+0.015
 do+0.015
Neg logit contributions
at-0.033
imm-0.032
wy-0.031
zzy-0.031
iest-0.029
Token the
Token ID262
Feature activation-6.26e-03

Pos logit contributions
 have+0.006
 be+0.005
 knock+0.004
 I+0.004
 Lily+0.004
Neg logit contributions
at-0.019
wy-0.019
imm-0.019
zzy-0.018
iest-0.018
Token reef
Token ID25088
Feature activation+2.83e-03

Pos logit contributions
 have+0.010
 be+0.009
 Lily+0.008
 what+0.008
 knock+0.008
Neg logit contributions
at-0.014
wy-0.013
imm-0.013
zzy-0.013
iest-0.013
Token and
Token ID290
Feature activation-9.51e-03

Pos logit contributions
at+0.008
imm+0.008
wy+0.008
zzy+0.008
iest+0.007
Neg logit contributions
 have-0.003
 be-0.002
 Lily-0.002
 I-0.002
 do-0.002
Token then
Token ID788
Feature activation+4.22e-04

Pos logit contributions
 have+0.015
 be+0.014
 Lily+0.013
 I+0.012
 do+0.012
Neg logit contributions
at-0.022
imm-0.021
wy-0.021
zzy-0.020
iest-0.019
Token carefully
Token ID7773
Feature activation+2.45e-03

Pos logit contributions
at+0.001
imm+0.001
wy+0.001
zzy+0.001
iest+0.001
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.000
 I-0.000
 what-0.000
Token moved
Token ID3888
Feature activation-2.38e-03

Pos logit contributions
at+0.006
imm+0.006
wy+0.005
zzy+0.005
iest+0.005
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.003
 what-0.003
 do-0.003
Token it
Token ID340
Feature activation-1.01e-02

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.002
 I+0.002
 what+0.002
Neg logit contributions
at-0.006
imm-0.006
wy-0.006
zzy-0.006
iest-0.006
Token One
Token ID1881
Feature activation-8.83e-03

Pos logit contributions
 have+0.002
 be+0.002
 knock+0.002
 we+0.001
 what+0.001
Neg logit contributions
at-0.005
imm-0.005
zzy-0.004
amb-0.004
wy-0.004
Token day
Token ID1110
Feature activation-6.99e-03

Pos logit contributions
 have+0.014
 be+0.013
 Lily+0.012
 do+0.011
 I+0.011
Neg logit contributions
at-0.021
imm-0.021
wy-0.020
zzy-0.020
ier-0.019
Token,
Token ID11
Feature activation-5.11e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.004
 I+0.004
 do+0.004
Neg logit contributions
at-0.021
imm-0.020
wy-0.020
zzy-0.020
iest-0.019
Token she
Token ID673
Feature activation-2.78e-03

Pos logit contributions
 have+0.006
 be+0.005
 knock+0.004
 do+0.004
 I+0.004
Neg logit contributions
at-0.014
imm-0.013
wy-0.013
zzy-0.013
iest-0.013
Token asked
Token ID1965
Feature activation-2.30e-03

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.003
 I+0.002
 do+0.002
Neg logit contributions
at-0.008
imm-0.007
wy-0.007
zzy-0.007
iest-0.007
Token her
Token ID607
Feature activation-7.52e-03

Pos logit contributions
 have+0.003
 be+0.003
 I+0.003
 knock+0.003
 we+0.003
Neg logit contributions
at-0.006
imm-0.005
wy-0.005
zzy-0.005
iest-0.005
Token mom
Token ID1995
Feature activation-2.56e-03

Pos logit contributions
 have+0.004
 be+0.003
 I+0.002
 Lily+0.002
 do+0.002
Neg logit contributions
at-0.025
imm-0.024
wy-0.024
zzy-0.024
iest-0.023
Token if
Token ID611
Feature activation-1.52e-02

Pos logit contributions
 have+0.004
 be+0.003
 knock+0.003
 I+0.003
 Lily+0.003
Neg logit contributions
at-0.006
wy-0.005
imm-0.005
iest-0.005
ier-0.005
Token they
Token ID484
Feature activation+4.66e-05

Pos logit contributions
 have+0.017
 be+0.015
 Lily+0.012
 knock+0.012
 do+0.011
Neg logit contributions
at-0.040
imm-0.039
wy-0.038
zzy-0.038
iest-0.037
Token could
Token ID714
Feature activation-5.16e-03

Pos logit contributions
at+0.000
zzy+0.000
wy+0.000
imm+0.000
iest+0.000
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 knock-0.000
 what-0.000
Token go
Token ID467
Feature activation+9.46e-03

Pos logit contributions
 have+0.003
 Lily+0.003
 be+0.003
 what+0.002
 I+0.002
Neg logit contributions
at-0.016
imm-0.015
zzy-0.015
wy-0.015
iest-0.014
Token and
Token ID290
Feature activation-5.03e-03

Pos logit contributions
 have+0.008
 be+0.007
 knock+0.007
 Lily+0.006
 cry+0.006
Neg logit contributions
at-0.017
iest-0.016
imm-0.015
wy-0.015
ier-0.015
Token see
Token ID766
Feature activation-6.45e-03

Pos logit contributions
 language+0.004
 sad+0.003
 Sharing+0.003
 have+0.003
 be+0.003
Neg logit contributions
at-0.012
zzy-0.011
wy-0.011
atmeal-0.011
ridge-0.011
Token,"
Token ID553
Feature activation+1.49e-02

Pos logit contributions
 have+0.008
 be+0.007
 knock+0.006
 cry+0.005
 do+0.005
Neg logit contributions
at-0.016
wy-0.015
ier-0.015
us-0.015
imm-0.015
Token Ben
Token ID3932
Feature activation+1.91e-03

Pos logit contributions
at+0.037
zzy+0.037
wy+0.037
imm+0.036
dd+0.034
Neg logit contributions
 have-0.016
 be-0.015
 knock-0.013
 what-0.012
 I-0.011
Token said
Token ID531
Feature activation-4.39e-03

Pos logit contributions
at+0.005
imm+0.005
wy+0.005
zzy+0.005
iest+0.005
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.002
 I-0.002
 what-0.002
Token.
Token ID13
Feature activation-1.99e-03

Pos logit contributions
 have+0.004
 be+0.003
 Lily+0.002
 I+0.002
 do+0.002
Neg logit contributions
at-0.013
imm-0.013
wy-0.013
zzy-0.013
iest-0.012
Token
Token ID198
Feature activation-3.02e-03

Pos logit contributions
 have+0.002
 be+0.001
 Lily+0.001
 what+0.001
 do+0.001
Neg logit contributions
at-0.006
imm-0.006
wy-0.006
zzy-0.006
iest-0.006
Token
Token ID198
Feature activation-1.54e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.001
 do+0.001
Neg logit contributions
at-0.009
imm-0.009
wy-0.009
zzy-0.009
iest-0.009
TokenThey
Token ID2990
Feature activation-1.06e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.001
 do+0.001
Neg logit contributions
at-0.004
imm-0.004
wy-0.004
zzy-0.004
iest-0.004
Token ran
Token ID4966
Feature activation-5.43e-03

Pos logit contributions
 have+0.002
 be+0.001
 Lily+0.001
 do+0.001
 I+0.001
Neg logit contributions
at-0.003
imm-0.003
wy-0.002
zzy-0.002
iest-0.002
Token to
Token ID284
Feature activation+3.04e-03

Pos logit contributions
 have+0.007
 be+0.007
 Lily+0.006
 I+0.006
 do+0.005
Neg logit contributions
at-0.014
imm-0.013
wy-0.013
zzy-0.013
iest-0.012
Token mum
Token ID25682
Feature activation-1.93e-04

Pos logit contributions
at+0.001
imm+0.001
wy+0.001
zzy+0.001
iest+0.001
Neg logit contributions
 have-0.000
 be-0.000
 Lily-0.000
 I-0.000
 what-0.000
Token came
Token ID1625
Feature activation+8.34e-03

Pos logit contributions
 have+0.000
 be+0.000
 Lily+0.000
 I+0.000
 what+0.000
Neg logit contributions
at-0.001
imm-0.000
wy-0.000
zzy-0.000
iest-0.000
Token over
Token ID625
Feature activation-1.28e-02

Pos logit contributions
at+0.021
imm+0.020
wy+0.020
zzy+0.019
iest+0.018
Neg logit contributions
 have-0.012
 be-0.011
 Lily-0.010
 I-0.009
 do-0.009
Token and
Token ID290
Feature activation-1.21e-02

Pos logit contributions
 have+0.012
 be+0.011
 Lily+0.009
 what+0.008
 I+0.008
Neg logit contributions
at-0.039
imm-0.038
wy-0.038
zzy-0.036
iest-0.035
Token asked
Token ID1965
Feature activation-6.38e-03

Pos logit contributions
 have+0.014
 be+0.013
 Lily+0.011
 I+0.011
 do+0.010
Neg logit contributions
at-0.033
imm-0.032
wy-0.031
zzy-0.031
iest-0.030
Token him
Token ID683
Feature activation-1.94e-03

Pos logit contributions
 have+0.011
 be+0.010
 knock+0.009
 I+0.009
 we+0.009
Neg logit contributions
at-0.013
imm-0.012
wy-0.012
us-0.012
zzy-0.012
Token what
Token ID644
Feature activation+1.01e-03

Pos logit contributions
 have+0.002
 be+0.002
 knock+0.002
 I+0.001
 we+0.001
Neg logit contributions
at-0.005
zzy-0.005
iest-0.005
imm-0.005
ier-0.005
Token he
Token ID339
Feature activation+1.65e-02

Pos logit contributions
at+0.003
imm+0.003
iest+0.002
zzy+0.002
wy+0.002
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 what-0.001
 knock-0.001
Token was
Token ID373
Feature activation-2.13e-03

Pos logit contributions
at+0.044
imm+0.043
wy+0.042
zzy+0.041
iest+0.039
Neg logit contributions
 have-0.019
 be-0.018
 Lily-0.017
 I-0.015
 we-0.015
Token looking
Token ID2045
Feature activation-1.93e-03

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.002
 do+0.002
 I+0.002
Neg logit contributions
at-0.006
imm-0.005
wy-0.005
zzy-0.005
iest-0.005
Token at
Token ID379
Feature activation-1.82e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.002
 do+0.002
Neg logit contributions
at-0.005
imm-0.005
wy-0.005
zzy-0.005
iest-0.005
Token it
Token ID340
Feature activation-1.71e-03

Pos logit contributions
 have+0.002
 be+0.002
 knock+0.001
 Lily+0.001
 look+0.001
Neg logit contributions
us-0.007
at-0.007
dd-0.007
iest-0.007
ier-0.007
Token won
Token ID1839
Feature activation+1.46e-02

Pos logit contributions
 have+0.003
 be+0.002
 Lily+0.002
 I+0.002
 we+0.002
Neg logit contributions
at-0.004
imm-0.004
wy-0.004
zzy-0.004
iest-0.004
Token't
Token ID470
Feature activation-1.01e-02

Pos logit contributions
at+0.031
imm+0.029
wy+0.028
zzy+0.027
ier+0.027
Neg logit contributions
 have-0.023
 be-0.022
 Lily-0.021
 what-0.020
 knock-0.019
Token get
Token ID651
Feature activation+8.20e-04

Pos logit contributions
 have+0.010
 Lily+0.009
 what+0.009
 be+0.009
 Mom+0.008
Neg logit contributions
at-0.026
imm-0.025
ier-0.024
wy-0.024
iest-0.024
Token so
Token ID523
Feature activation+1.61e-03

Pos logit contributions
at+0.002
wy+0.002
imm+0.002
ier+0.002
iest+0.002
Neg logit contributions
 have-0.001
 be-0.001
 do-0.001
 Lily-0.001
 knock-0.001
Token dusty
Token ID36972
Feature activation+3.13e-03

Pos logit contributions
at+0.004
wy+0.004
imm+0.004
zzy+0.004
iest+0.004
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.002
 do-0.002
 what-0.002
Token!".
Token ID48220
Feature activation-6.59e-03

Pos logit contributions
at+0.009
imm+0.009
wy+0.009
zzy+0.009
wen+0.008
Neg logit contributions
 have-0.003
 be-0.003
 Lily-0.002
 I-0.002
 what-0.002
Token<|endoftext|>
Token ID50256
Feature activation+3.01e-03

Pos logit contributions
 have+0.006
 be+0.006
 Lily+0.005
 I+0.005
 do+0.005
Neg logit contributions
at-0.019
imm-0.019
wy-0.018
zzy-0.018
iest-0.017
TokenThis
Token ID1212
Feature activation-4.95e-03

Pos logit contributions
at+0.007
imm+0.007
wy+0.007
zzy+0.007
iest+0.007
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.004
 I-0.004
 do-0.004
Token is
Token ID318
Feature activation-6.45e-03

Pos logit contributions
 have+0.007
 be+0.007
 Lily+0.006
 do+0.006
 I+0.006
Neg logit contributions
at-0.012
imm-0.012
wy-0.012
zzy-0.011
ier-0.011
Token a
Token ID257
Feature activation-7.32e-03

Pos logit contributions
 have+0.005
 be+0.004
 Lily+0.004
 what+0.003
 I+0.003
Neg logit contributions
at-0.020
imm-0.020
wy-0.020
zzy-0.019
vell-0.019
Token cut
Token ID2005
Feature activation-3.69e-03

Pos logit contributions
 have+0.007
 be+0.006
 Lily+0.006
 do+0.005
 I+0.005
Neg logit contributions
at-0.013
imm-0.013
wy-0.013
zzy-0.012
iest-0.012
Token the
Token ID262
Feature activation-5.01e-03

Pos logit contributions
 have+0.003
 be+0.002
 Lily+0.002
 I+0.001
 do+0.001
Neg logit contributions
at-0.012
imm-0.012
wy-0.011
zzy-0.011
iest-0.011
Token mel
Token ID7758
Feature activation-9.61e-03

Pos logit contributions
 have+0.009
 be+0.008
 Lily+0.008
 I+0.007
 what+0.007
Neg logit contributions
at-0.011
wy-0.010
imm-0.010
zzy-0.009
iest-0.009
Tokenon
Token ID261
Feature activation-3.25e-03

Pos logit contributions
 have+0.017
 be+0.016
 I+0.014
 do+0.014
 Lily+0.014
Neg logit contributions
zzy-0.019
imm-0.019
wy-0.018
at-0.018
rol-0.018
Token into
Token ID656
Feature activation-1.28e-02

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.002
 do+0.002
 what+0.002
Neg logit contributions
at-0.010
imm-0.010
wy-0.010
zzy-0.009
iest-0.009
Token slices
Token ID24314
Feature activation-1.30e-02

Pos logit contributions
 have+0.015
 be+0.013
 I+0.012
 we+0.011
 do+0.011
Neg logit contributions
wy-0.031
at-0.031
imm-0.031
zzy-0.030
iest-0.030
Token and
Token ID290
Feature activation-5.30e-03

Pos logit contributions
 have+0.008
 be+0.007
 Lily+0.005
 do+0.004
 what+0.004
Neg logit contributions
at-0.043
imm-0.043
wy-0.042
zzy-0.041
rol-0.040
Token put
Token ID1234
Feature activation-1.22e-02

Pos logit contributions
 have+0.008
 be+0.008
 Lily+0.007
 I+0.007
 do+0.007
Neg logit contributions
at-0.012
wy-0.011
imm-0.011
zzy-0.011
ier-0.011
Token them
Token ID606
Feature activation+1.12e-03

Pos logit contributions
 have+0.010
 be+0.009
 Lily+0.007
 I+0.006
 do+0.006
Neg logit contributions
at-0.037
imm-0.036
wy-0.036
zzy-0.035
iest-0.034
Token on
Token ID319
Feature activation-5.25e-03

Pos logit contributions
imm+0.003
at+0.003
zzy+0.003
wy+0.003
wen+0.003
Neg logit contributions
 have-0.001
 be-0.001
 Lily-0.001
 what-0.001
 mom-0.001
Token a
Token ID257
Feature activation+5.25e-03

Pos logit contributions
 have+0.004
 be+0.003
 Lily+0.002
 I+0.002
 do+0.002
Neg logit contributions
at-0.017
imm-0.016
wy-0.016
zzy-0.016
iest-0.015
Token them
Token ID606
Feature activation-6.32e-03

Pos logit contributions
 have+0.004
 be+0.004
 I+0.004
 knock+0.003
 do+0.003
Neg logit contributions
at-0.010
wy-0.009
imm-0.009
zzy-0.009
us-0.009
Token set
Token ID900
Feature activation+5.58e-03

Pos logit contributions
 have+0.005
 be+0.005
 Lily+0.004
 I+0.004
 we+0.004
Neg logit contributions
at-0.018
imm-0.018
wy-0.017
zzy-0.017
iest-0.016
Token up
Token ID510
Feature activation-2.49e-03

Pos logit contributions
at+0.013
wy+0.013
imm+0.013
zzy+0.012
us+0.012
Neg logit contributions
 have-0.008
 be-0.007
 I-0.007
 do-0.006
 Lily-0.006
Token a
Token ID257
Feature activation+3.96e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.001
 do+0.001
Neg logit contributions
at-0.008
imm-0.007
wy-0.007
zzy-0.007
iest-0.007
Token play
Token ID711
Feature activation+6.65e-03

Pos logit contributions
at+0.009
wy+0.009
imm+0.009
iest+0.009
zzy+0.009
Neg logit contributions
 have-0.005
 be-0.005
 I-0.004
 what-0.004
 Lily-0.004
Token area
Token ID1989
Feature activation-8.19e-03

Pos logit contributions
at+0.016
imm+0.015
wy+0.015
zzy+0.015
iest+0.014
Neg logit contributions
 have-0.010
 be-0.009
 Lily-0.009
 I-0.008
 do-0.008
Token in
Token ID287
Feature activation-7.25e-03

Pos logit contributions
 have+0.008
 be+0.007
 Lily+0.006
 I+0.006
 do+0.005
Neg logit contributions
at-0.023
imm-0.023
zzy-0.022
wy-0.022
iest-0.021
Token the
Token ID262
Feature activation+7.55e-03

Pos logit contributions
 have+0.007
 be+0.006
 I+0.006
 knock+0.005
 do+0.005
Neg logit contributions
zzy-0.018
wy-0.018
at-0.018
imm-0.018
side-0.017
Token living
Token ID2877
Feature activation-1.50e-02

Pos logit contributions
at+0.017
wy+0.016
imm+0.016
zzy+0.015
iest+0.015
Neg logit contributions
 have-0.011
 be-0.010
 what-0.010
 I-0.009
 we-0.009
Token room
Token ID2119
Feature activation-7.44e-03

Pos logit contributions
 have+0.011
 be+0.010
 Lily+0.008
 I+0.006
 do+0.006
Neg logit contributions
at-0.048
imm-0.046
wy-0.046
zzy-0.045
iest-0.044
Token.
Token ID13
Feature activation+1.04e-04

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.004
 I+0.004
 do+0.004
Neg logit contributions
at-0.022
imm-0.022
wy-0.022
zzy-0.021
ier-0.021
Token that
Token ID326
Feature activation-4.78e-03

Pos logit contributions
 have+0.003
 be+0.002
 Lily+0.002
 I+0.002
 knock+0.002
Neg logit contributions
at-0.008
imm-0.008
zzy-0.008
iest-0.008
ier-0.008
Token they
Token ID484
Feature activation-1.27e-02

Pos logit contributions
 have+0.004
 be+0.004
 knock+0.003
 Lily+0.003
 do+0.003
Neg logit contributions
at-0.013
imm-0.012
iest-0.012
wy-0.012
zzy-0.012
Token would
Token ID561
Feature activation-7.04e-03

Pos logit contributions
 have+0.022
 be+0.022
 Lily+0.021
 I+0.020
 Mom+0.019
Neg logit contributions
at-0.025
wy-0.023
imm-0.023
zzy-0.023
ier-0.022
Token see
Token ID766
Feature activation-1.73e-02

Pos logit contributions
 Lily+0.006
 have+0.005
 kids+0.004
 I+0.004
 we+0.004
Neg logit contributions
at-0.018
zzy-0.018
imm-0.017
ier-0.017
vell-0.017
Token Comet
Token ID36238
Feature activation-1.95e-03

Pos logit contributions
 have+0.017
 be+0.015
 Lily+0.012
 knock+0.012
 I+0.011
Neg logit contributions
at-0.047
imm-0.046
wy-0.046
iest-0.045
ier-0.044
Token Hale
Token ID35056
Feature activation-9.31e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.002
 knock+0.002
Neg logit contributions
at-0.005
imm-0.005
wy-0.005
zzy-0.005
ier-0.005
Token-
Token ID12
Feature activation+1.27e-02

Pos logit contributions
 have+0.017
 be+0.016
 Lily+0.015
 what+0.014
 do+0.014
Neg logit contributions
at-0.020
imm-0.019
wy-0.019
zzy-0.018
iest-0.017
TokenB
Token ID33
Feature activation+1.36e-02

Pos logit contributions
at+0.020
imm+0.020
wy+0.019
zzy+0.018
iest+0.017
Neg logit contributions
 have-0.029
 be-0.028
 Lily-0.026
 what-0.026
 I-0.026
Tokenopp
Token ID10365
Feature activation-7.83e-03

Pos logit contributions
at+0.020
imm+0.019
wy+0.019
zzy+0.018
iest+0.017
Neg logit contributions
 have-0.032
 be-0.031
 Lily-0.029
 I-0.028
 do-0.028
Token again
Token ID757
Feature activation-1.44e-02

Pos logit contributions
 have+0.014
 be+0.014
 Lily+0.012
 what+0.012
 we+0.012
Neg logit contributions
at-0.015
imm-0.015
zzy-0.015
wy-0.015
rol-0.014
Token someday
Token ID25580
Feature activation-3.16e-03

Pos logit contributions
 have+0.013
 be+0.012
 Lily+0.010
 I+0.009
 do+0.009
Neg logit contributions
at-0.043
imm-0.042
wy-0.041
zzy-0.040
iest-0.039
Token instead
Token ID2427
Feature activation-6.91e-04

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.002
 I+0.002
 do+0.002
Neg logit contributions
at-0.008
imm-0.008
wy-0.008
zzy-0.008
iest-0.007
Token.
Token ID13
Feature activation+6.82e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.000
 knock+0.000
 I+0.000
Neg logit contributions
at-0.002
imm-0.002
wy-0.002
zzy-0.002
iest-0.002
Token They
Token ID1119
Feature activation-7.05e-03

Pos logit contributions
zzy+0.016
imm+0.015
at+0.015
ibr+0.015
amb+0.015
Neg logit contributions
 be-0.008
 have-0.007
 do-0.006
 knock-0.006
 look-0.006
Token left
Token ID1364
Feature activation+7.12e-04

Pos logit contributions
 have+0.009
 be+0.009
 Lily+0.008
 I+0.007
 what+0.007
Neg logit contributions
at-0.018
wy-0.017
imm-0.017
zzy-0.017
iest-0.016
Token the
Token ID262
Feature activation-2.25e-03

Pos logit contributions
at+0.002
wy+0.002
imm+0.002
zzy+0.002
ier+0.002
Neg logit contributions
 have-0.001
 be-0.001
 I-0.001
 do-0.001
 Lily-0.000
Token hen
Token ID30963
Feature activation-1.89e-02

Pos logit contributions
 have+0.003
 be+0.003
 Lily+0.002
 what+0.002
 I+0.002
Neg logit contributions
at-0.005
wy-0.005
imm-0.005
iest-0.005
zzy-0.005
Token and
Token ID290
Feature activation-7.45e-03

Pos logit contributions
 have+0.025
 be+0.024
 Lily+0.021
 I+0.021
 what+0.020
Neg logit contributions
at-0.047
wy-0.044
imm-0.044
zzy-0.042
rito-0.041
Token her
Token ID607
Feature activation-8.87e-03

Pos logit contributions
 have+0.008
 be+0.008
 I+0.007
 Lily+0.007
 do+0.006
Neg logit contributions
at-0.020
wy-0.019
imm-0.019
zzy-0.019
ier-0.017
Token eggs
Token ID9653
Feature activation-2.06e-03

Pos logit contributions
 have+0.010
 be+0.009
 Lily+0.008
 I+0.008
 what+0.007
Neg logit contributions
at-0.023
wy-0.023
imm-0.023
zzy-0.022
iest-0.021
Token alone
Token ID3436
Feature activation-7.67e-03

Pos logit contributions
 have+0.002
 be+0.002
 Lily+0.001
 what+0.001
 do+0.001
Neg logit contributions
at-0.006
imm-0.006
wy-0.006
zzy-0.006
iest-0.006
Token.
Token ID13
Feature activation+4.51e-03

Pos logit contributions
 have+0.007
 be+0.006
 Lily+0.005
 I+0.005
 do+0.004
Neg logit contributions
at-0.023
imm-0.022
wy-0.022
zzy-0.022
ier-0.021
Token.
Token ID13
Feature activation+2.19e-03

Pos logit contributions
 have+0.001
 be+0.001
 Lily+0.001
 I+0.001
 do+0.001
Neg logit contributions
at-0.005
imm-0.005
wy-0.005
zzy-0.005
iest-0.005
Token Tim
Token ID5045
Feature activation+6.58e-03

Pos logit contributions
at+0.006
imm+0.006
wy+0.006
zzy+0.006
iest+0.006
Neg logit contributions
 have-0.002
 be-0.002
 Lily-0.001
 what-0.001
 do-0.001
Tokenmy
Token ID1820
Feature activation-4.48e-03

Pos logit contributions
at+0.018
imm+0.018
wy+0.017
zzy+0.017
vell+0.016
Neg logit contributions
 have-0.008
 be-0.007
 Lily-0.006
 do-0.006
 what-0.006
Token loved
Token ID6151
Feature activation+5.80e-03

Pos logit contributions
 have+0.006
 be+0.005
 Lily+0.005
 I+0.005
 what+0.005
Neg logit contributions
at-0.011
imm-0.011
wy-0.011
zzy-0.011
iest-0.010
Token to
Token ID284
Feature activation+5.54e-03

Pos logit contributions
at+0.016
imm+0.015
wy+0.015
zzy+0.015
iest+0.014
Neg logit contributions
 have-0.007
 be-0.006
 Lily-0.006
 I-0.005
 do-0.005
Token draw
Token ID3197
Feature activation-1.10e-02

Pos logit contributions
at+0.016
imm+0.015
wy+0.015
zzy+0.015
atmeal+0.015
Neg logit contributions
 have-0.004
 be-0.004
 Lily-0.004
 what-0.003
 I-0.003
Token pictures
Token ID5986
Feature activation-5.27e-03

Pos logit contributions
 have+0.012
 be+0.011
 I+0.009
 knock+0.009
 we+0.008
Neg logit contributions
at-0.030
imm-0.030
wy-0.029
iest-0.028
vell-0.028
Token in
Token ID287
Feature activation-1.05e-02

Pos logit contributions
 have+0.006
 be+0.006
 knock+0.005
 cry+0.005
 Lily+0.005
Neg logit contributions
at-0.013
imm-0.013
iest-0.012
ier-0.012
vell-0.011
Token his
Token ID465
Feature activation+4.69e-03

Pos logit contributions
 have+0.019
 be+0.018
 I+0.017
 we+0.017
 do+0.016
Neg logit contributions
wy-0.016
at-0.016
us-0.016
iest-0.015
imm-0.015
Token notebook
Token ID20922
Feature activation-1.34e-02

Pos logit contributions
at+0.013
imm+0.012
wy+0.012
zzy+0.012
iest+0.011
Neg logit contributions
 have-0.005
 be-0.005
 Lily-0.004
 I-0.004
 what-0.004
Token.
Token ID13
Feature activation+2.44e-03

Pos logit contributions
 have+0.011
 be+0.010
 Lily+0.008
 I+0.007
 what+0.007
Neg logit contributions
at-0.041
imm-0.040
wy-0.040
zzy-0.039
iest-0.038