TOP ACTIVATIONS
MAX = 0.062

fly in his mouth . Ho ppy 's friends cheered and
buzzing around the pond . Ho ppy jumped and caught the
and saw a drain . Ho ppy rushed to help him
. When Gr acy got to the house
bug that got away . Ho ppy 's friends told him
reach it . Eventually , Ho ppy was tired , so
family . When Ho ppy got home , his
but it flew away . Ho ppy felt sad and started
shake and rum ble . Ho ppy opened his eyes and
high in the sky . Ho ppy thought it looked like
a bug next time . Ho ppy smiled and felt happy
sun began to set , Ho ppy realized it was time
The next day , Ho ppy saw a fat fly
On his journey back , Ho ppy felt proud of himself
him what was wrong . Ho ppy told them about the
, just like before . Ho ppy was very happy and
One day , Ho ppy hurt his leg .
hop for a week . Ho ppy was sad , but
hopped in the rain . Ho ppy was very happy .
As the farmer scattered the av oc ados , he noticed

MIN ACTIVATIONS
MIN = -0.052

were in the park one sunny day . A ball rolled
and his stubborn ness gave way to understanding . He took
car with her parents one sunny day . They drove for
and soon the darkness gave way to a beautiful underground garden
lovely conversation and made his way back home , happy that
time , there was a do ggy named Bob . Bob
neither was willing to give way . Finally , after much
the dull sky had given way to a bright and sunny
agreed and they made their way to the store . When
park on a bright and sunny day . Sam
playing in the park one sunny day . He saw a
slide and laughed all the way down .
, there was a big do ggie named Max . Max
his friends and made his way to the tower .
friends and skipped all the way back to the car .
, Tommy took the long way to the park so the
in a kind and thoughtful way . His friends soon joined
carefully while they made their way down from the tree .
walking through the park one sunny day . He saw a
let his mum lead the way . They soon got to

INTERVAL 0.033 - 0.062
CONTAINS 0.051%

and looked around . The cater pillar wanted to find a
frog named Ho ppy . Ho ppy lived in a small
days ." Ben smiles . He likes his boots
were sorry . He felt a little sorry too . He
," Ben says . He smiles at L ila too .

INTERVAL 0.005 - 0.033
CONTAINS 28.891%

He asked Mom my , " Can I have
. Then they put some pine cones on the top to
the friends said the cart looked very useful . The other
She saw a door that said " Do not enter ".
nice , but I made a dragon !" Tom said ,

INTERVAL -0.024 - 0.005
CONTAINS 70.728%

is cold and fresh ," Ben said . " It makes
up , she saw that her cereal was spoiled . It
the go . One day , M andy was in her
as new . From that day on , the boy made
The end . <|endoftext|> L ily liked to climb . She

INTERVAL -0.052 - -0.024
CONTAINS 0.331%

the shiny metal and the way it opened and closed .
, � � € � � You â € ™ re
€ ™ s nice and warm . Thank you , Mama
their things . On their way home , they told stories
the belt around Jack 's waist prevented his coat from falling
Token fly
Token ID6129
Feature activation-2.49e-03

Pos logit contributions
epad+0.008
licks+0.008
rou+0.008
 twice+0.008
 Does+0.007
Neg logit contributions
 Ph-0.005
Ph-0.004
 umbrella-0.004
 duck-0.004
 kids-0.004
Token in
Token ID287
Feature activation+4.84e-03

Pos logit contributions
 duck+0.003
 umbrella+0.002
 kids+0.002
 Ph+0.002
 best+0.002
Neg logit contributions
epad-0.005
licks-0.005
rou-0.004
 Does-0.004
minded-0.004
Token his
Token ID465
Feature activation+7.60e-03

Pos logit contributions
licks+0.010
epad+0.010
rou+0.009
 twice+0.009
minded+0.009
Neg logit contributions
 duck-0.005
 kids-0.004
 umbrella-0.004
 Ph-0.004
Ph-0.004
Token mouth
Token ID5422
Feature activation-2.45e-03

Pos logit contributions
licks+0.009
auldron+0.009
 habitats+0.009
rou+0.009
 twice+0.008
Neg logit contributions
 Ph-0.012
Ph-0.011
 Esk-0.011
n-0.011
 kids-0.011
Token.
Token ID13
Feature activation+7.31e-03

Pos logit contributions
 duck+0.001
 best+0.001
 umbrella+0.001
 kids+0.001
 two+0.001
Neg logit contributions
epad-0.006
licks-0.006
 Does-0.006
 habitats-0.006
minded-0.006
Token Ho
Token ID9544
Feature activation+6.15e-02

Pos logit contributions
epad+0.015
licks+0.014
rou+0.014
ited+0.013
 facts+0.013
Neg logit contributions
 duck-0.008
 pig-0.007
 Ph-0.007
 kids-0.007
 je-0.007
Tokenppy
Token ID14097
Feature activation+4.98e-03

Pos logit contributions
epad+0.092
licks+0.085
 twice+0.079
 facts+0.079
 attent+0.076
Neg logit contributions
 duck-0.094
 best-0.085
 kids-0.085
 pig-0.083
n-0.082
Token's
Token ID338
Feature activation-4.82e-03

Pos logit contributions
epad+0.011
licks+0.011
 twice+0.010
rou+0.010
minded+0.009
Neg logit contributions
 duck-0.005
 kids-0.004
 umbrella-0.004
 pig-0.004
 k-0.004
Token friends
Token ID2460
Feature activation+7.45e-03

Pos logit contributions
 Ph+0.003
Ph+0.003
 duck+0.003
 umbrella+0.003
 kids+0.003
Neg logit contributions
epad-0.011
licks-0.011
 twice-0.010
rou-0.010
 habitats-0.010
Token cheered
Token ID34767
Feature activation+2.10e-02

Pos logit contributions
epad+0.015
licks+0.015
 twice+0.014
rou+0.014
 Does+0.014
Neg logit contributions
 duck-0.008
 kids-0.007
 umbrella-0.007
 Ph-0.007
 boys-0.006
Token and
Token ID290
Feature activation+3.89e-03

Pos logit contributions
epad+0.040
licks+0.040
rou+0.036
 habitats+0.036
 Does+0.036
Neg logit contributions
 duck-0.023
 kids-0.022
 umbrella-0.021
 best-0.020
 Ph-0.020
Token buzzing
Token ID41719
Feature activation-9.92e-03

Pos logit contributions
 duck+0.007
 umbrella+0.006
 kids+0.006
 k+0.006
 best+0.006
Neg logit contributions
epad-0.013
licks-0.013
minded-0.011
 Does-0.011
 habitats-0.011
Token around
Token ID1088
Feature activation+6.55e-03

Pos logit contributions
 duck+0.012
 kids+0.011
 umbrella+0.011
 Ph+0.011
Ph+0.010
Neg logit contributions
epad-0.019
licks-0.018
rou-0.016
 Does-0.016
 habitats-0.016
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
epad+0.016
licks+0.015
rou+0.014
auldron+0.014
 twice+0.014
Neg logit contributions
 duck-0.005
 kids-0.004
 Ph-0.004
 pig-0.004
 umbrella-0.003
Token pond
Token ID16723
Feature activation+3.40e-03

Pos logit contributions
licks+0.028
epad+0.028
rou+0.026
 twice+0.026
 Does+0.025
Neg logit contributions
 umbrella-0.014
 Ph-0.014
 duck-0.014
 kids-0.014
Ph-0.013
Token.
Token ID13
Feature activation+7.88e-03

Pos logit contributions
epad+0.008
licks+0.008
 Does+0.008
asper+0.008
ems+0.008
Neg logit contributions
 duck-0.002
 kids-0.002
 umbrella-0.001
 best-0.001
 boys-0.001
Token Ho
Token ID9544
Feature activation+6.11e-02

Pos logit contributions
epad+0.016
licks+0.015
rou+0.014
 twice+0.013
ited+0.013
Neg logit contributions
 duck-0.009
 kids-0.008
 Ph-0.008
 pig-0.008
 umbrella-0.008
Tokenppy
Token ID14097
Feature activation+1.15e-03

Pos logit contributions
epad+0.090
licks+0.084
 twice+0.078
 facts+0.077
 habitats+0.076
Neg logit contributions
 duck-0.094
 best-0.085
 kids-0.085
 pig-0.083
n-0.081
Token jumped
Token ID11687
Feature activation+1.86e-03

Pos logit contributions
epad+0.002
licks+0.002
 twice+0.002
rou+0.002
 Does+0.002
Neg logit contributions
 duck-0.001
 kids-0.001
 umbrella-0.001
 pig-0.001
 Ph-0.001
Token and
Token ID290
Feature activation+8.43e-03

Pos logit contributions
epad+0.004
licks+0.004
 habitats+0.003
 facts+0.003
asper+0.003
Neg logit contributions
 duck-0.002
 umbrella-0.002
 Ph-0.002
 best-0.002
 k-0.002
Token caught
Token ID4978
Feature activation+6.61e-03

Pos logit contributions
epad+0.018
licks+0.017
 habitats+0.016
rou+0.016
 facts+0.016
Neg logit contributions
 duck-0.009
 Ph-0.008
 k-0.008
 kids-0.008
 umbrella-0.008
Token the
Token ID262
Feature activation+4.46e-03

Pos logit contributions
epad+0.014
licks+0.014
 twice+0.013
rou+0.013
 Does+0.013
Neg logit contributions
 duck-0.005
 umbrella-0.005
 kids-0.005
 Ph-0.005
Ph-0.005
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
epad+0.026
licks+0.025
 Does+0.024
rou+0.023
ems+0.023
Neg logit contributions
 duck-0.008
 best-0.006
 kids-0.006
 umbrella-0.006
 boat-0.006
Token saw
Token ID2497
Feature activation+7.73e-03

Pos logit contributions
epad+0.022
licks+0.020
rou+0.019
 facts+0.019
 habitats+0.018
Neg logit contributions
 duck-0.013
 Ph-0.012
 pig-0.012
 kids-0.012
 umbrella-0.012
Token a
Token ID257
Feature activation+1.16e-02

Pos logit contributions
epad+0.018
licks+0.017
rou+0.016
 twice+0.016
minded+0.016
Neg logit contributions
 duck-0.006
 kids-0.005
 umbrella-0.005
 Ph-0.005
 best-0.005
Token drain
Token ID14782
Feature activation-2.21e-03

Pos logit contributions
licks+0.027
epad+0.026
 twice+0.024
minded+0.024
rou+0.024
Neg logit contributions
 duck-0.010
 umbrella-0.009
 kids-0.008
 pig-0.008
 k-0.008
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
 duck+0.002
 umbrella+0.002
 kids+0.002
 Ph+0.002
 best+0.002
Neg logit contributions
epad-0.005
licks-0.005
minded-0.004
rou-0.004
 Does-0.004
Token Ho
Token ID9544
Feature activation+6.05e-02

Pos logit contributions
epad+0.023
licks+0.022
rou+0.020
 twice+0.020
minded+0.019
Neg logit contributions
 duck-0.018
 kids-0.017
 umbrella-0.017
 Ph-0.017
Ph-0.016
Tokenppy
Token ID14097
Feature activation+5.48e-03

Pos logit contributions
epad+0.073
licks+0.071
 twice+0.062
rou+0.061
 Does+0.059
Neg logit contributions
 duck-0.106
 kids-0.102
 umbrella-0.101
 Ph-0.101
Ph-0.100
Token rushed
Token ID13999
Feature activation+3.92e-03

Pos logit contributions
epad+0.011
licks+0.010
rou+0.009
 twice+0.009
 Does+0.009
Neg logit contributions
 duck-0.006
 kids-0.006
 umbrella-0.006
 Ph-0.005
 pig-0.005
Token to
Token ID284
Feature activation+1.40e-02

Pos logit contributions
epad+0.007
licks+0.006
rou+0.006
 facts+0.006
 habitats+0.006
Neg logit contributions
 duck-0.005
 umbrella-0.005
 Ph-0.005
 kids-0.005
Ph-0.005
Token help
Token ID1037
Feature activation+5.01e-03

Pos logit contributions
epad+0.026
licks+0.025
 twice+0.023
rou+0.023
 Does+0.022
Neg logit contributions
 duck-0.015
 umbrella-0.015
 kids-0.015
 Ph-0.015
Ph-0.014
Token him
Token ID683
Feature activation+4.51e-03

Pos logit contributions
licks+0.009
epad+0.008
 twice+0.008
 recite+0.008
 Does+0.007
Neg logit contributions
 umbrella-0.006
 kids-0.006
Ph-0.006
 Ph-0.006
 duck-0.006
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
 duck+0.004
 kids+0.004
 umbrella+0.004
 best+0.003
 Ph+0.003
Neg logit contributions
licks-0.012
epad-0.012
 Does-0.011
rou-0.011
 habitats-0.011
Token
Token ID220
Feature activation+1.30e-02

Pos logit contributions
epad+0.041
licks+0.041
rou+0.038
 twice+0.038
minded+0.037
Neg logit contributions
 duck-0.016
 kids-0.015
 Ph-0.015
 umbrella-0.015
Ph-0.014
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
licks+0.025
epad+0.024
 twice+0.023
rou+0.022
to+0.021
Neg logit contributions
 duck-0.015
 umbrella-0.015
 pig-0.013
 Ph-0.013
 kids-0.013
Token
Token ID198
Feature activation+1.56e-02

Pos logit contributions
epad+0.030
licks+0.030
rou+0.028
 twice+0.028
minded+0.027
Neg logit contributions
 duck-0.008
 kids-0.007
 Ph-0.007
Ph-0.007
 umbrella-0.007
TokenWhen
Token ID2215
Feature activation+9.01e-03

Pos logit contributions
epad+0.025
licks+0.025
rou+0.022
 twice+0.022
 habitats+0.021
Neg logit contributions
 duck-0.021
 kids-0.021
 Ph-0.020
Ph-0.020
 umbrella-0.020
Token Gr
Token ID1902
Feature activation+6.00e-02

Pos logit contributions
epad+0.015
licks+0.015
 twice+0.013
rou+0.013
 Does+0.013
Neg logit contributions
 duck-0.012
 umbrella-0.012
 kids-0.011
 Ph-0.011
Ph-0.011
Tokenacy
Token ID1590
Feature activation+5.59e-03

Pos logit contributions
epad+0.076
licks+0.075
 twice+0.069
 Does+0.068
rou+0.065
Neg logit contributions
 kids-0.098
 best-0.090
 duck-0.090
n-0.089
 Ph-0.088
Token got
Token ID1392
Feature activation-6.15e-03

Pos logit contributions
epad+0.010
licks+0.010
rou+0.009
 twice+0.009
 Does+0.009
Neg logit contributions
 duck-0.007
 kids-0.006
 umbrella-0.006
 Ph-0.006
Ph-0.006
Token to
Token ID284
Feature activation+4.93e-03

Pos logit contributions
 duck+0.006
 kids+0.006
 umbrella+0.006
 Ph+0.006
 best+0.006
Neg logit contributions
epad-0.012
licks-0.012
rou-0.011
 twice-0.011
 habitats-0.011
Token the
Token ID262
Feature activation+3.49e-03

Pos logit contributions
epad+0.011
licks+0.011
rou+0.010
 twice+0.010
minded+0.010
Neg logit contributions
 duck-0.004
 umbrella-0.004
 kids-0.004
 Ph-0.003
Ph-0.003
Token house
Token ID2156
Feature activation+1.15e-02

Pos logit contributions
epad+0.007
licks+0.007
 twice+0.006
rou+0.006
minded+0.006
Neg logit contributions
 duck-0.004
 umbrella-0.004
 kids-0.003
 Ph-0.003
Ph-0.003
Token bug
Token ID5434
Feature activation-5.32e-03

Pos logit contributions
epad+0.023
licks+0.022
rou+0.021
 twice+0.020
minded+0.020
Neg logit contributions
 duck-0.008
 kids-0.007
 umbrella-0.007
 pig-0.006
 Ph-0.006
Token that
Token ID326
Feature activation-1.01e-03

Pos logit contributions
 duck+0.005
 umbrella+0.005
 kids+0.005
 Ph+0.005
 best+0.005
Neg logit contributions
epad-0.011
licks-0.010
rou-0.010
 twice-0.010
minded-0.009
Token got
Token ID1392
Feature activation-1.82e-03

Pos logit contributions
 duck+0.001
 kids+0.001
 umbrella+0.001
 Ph+0.001
 best+0.001
Neg logit contributions
epad-0.002
licks-0.002
rou-0.002
 twice-0.002
minded-0.002
Token away
Token ID1497
Feature activation+1.26e-03

Pos logit contributions
 duck+0.002
 kids+0.001
 umbrella+0.001
 best+0.001
 pig+0.001
Neg logit contributions
epad-0.004
licks-0.004
 habitats-0.004
auldron-0.004
 Does-0.004
Token.
Token ID13
Feature activation+9.23e-03

Pos logit contributions
epad+0.003
licks+0.003
 Does+0.002
asper+0.002
 recite+0.002
Neg logit contributions
 duck-0.001
 kids-0.001
 best-0.001
 umbrella-0.001
 boys-0.001
Token Ho
Token ID9544
Feature activation+5.93e-02

Pos logit contributions
epad+0.018
licks+0.017
rou+0.016
 twice+0.015
 recite+0.015
Neg logit contributions
 duck-0.011
 kids-0.010
 pig-0.010
 Ph-0.010
 umbrella-0.010
Tokenppy
Token ID14097
Feature activation+3.13e-03

Pos logit contributions
epad+0.093
licks+0.085
 facts+0.082
 twice+0.081
 attent+0.079
Neg logit contributions
 duck-0.086
 best-0.075
 kids-0.074
 pig-0.074
n-0.073
Token's
Token ID338
Feature activation-4.34e-03

Pos logit contributions
epad+0.006
licks+0.006
rou+0.005
 twice+0.005
 Does+0.005
Neg logit contributions
 duck-0.004
 kids-0.004
 umbrella-0.003
 Ph-0.003
 pig-0.003
Token friends
Token ID2460
Feature activation+5.50e-03

Pos logit contributions
 Ph+0.003
 duck+0.003
 umbrella+0.002
Ph+0.002
 kids+0.002
Neg logit contributions
epad-0.010
licks-0.010
rou-0.009
 twice-0.009
minded-0.009
Token told
Token ID1297
Feature activation+2.33e-02

Pos logit contributions
epad+0.010
licks+0.010
rou+0.009
 twice+0.009
 habitats+0.009
Neg logit contributions
 duck-0.006
 kids-0.006
 umbrella-0.006
 Ph-0.006
 best-0.006
Token him
Token ID683
Feature activation-7.63e-03

Pos logit contributions
licks+0.032
epad+0.031
 twice+0.028
minded+0.027
rou+0.027
Neg logit contributions
 umbrella-0.035
 duck-0.035
Ph-0.035
 kids-0.034
 Ph-0.034
Token reach
Token ID3151
Feature activation+1.64e-03

Pos logit contributions
epad+0.027
licks+0.027
 twice+0.025
rou+0.024
minded+0.023
Neg logit contributions
 duck-0.022
 umbrella-0.022
 kids-0.022
 Ph-0.021
Ph-0.021
Token it
Token ID340
Feature activation+1.71e-02

Pos logit contributions
epad+0.003
licks+0.003
rou+0.003
 twice+0.003
 Does+0.003
Neg logit contributions
 duck-0.002
 kids-0.002
 umbrella-0.002
 Ph-0.001
Ph-0.001
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
epad+0.036
licks+0.036
rou+0.033
 twice+0.032
minded+0.032
Neg logit contributions
 duck-0.014
 umbrella-0.014
 kids-0.014
 Ph-0.014
 best-0.013
Token Eventually
Token ID16178
Feature activation+1.70e-02

Pos logit contributions
epad+0.031
licks+0.030
rou+0.027
 twice+0.027
minded+0.026
Neg logit contributions
 duck-0.026
 kids-0.025
 Ph-0.025
 umbrella-0.025
Ph-0.024
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
licks+0.030
epad+0.030
 twice+0.029
 Does+0.027
rou+0.027
Neg logit contributions
 umbrella-0.019
 duck-0.018
 kids-0.018
Ph-0.017
 Ph-0.017
Token Ho
Token ID9544
Feature activation+5.90e-02

Pos logit contributions
epad+0.025
licks+0.025
rou+0.023
 habitats+0.022
 twice+0.021
Neg logit contributions
 duck-0.019
 kids-0.018
 Ph-0.018
 umbrella-0.017
 pig-0.017
Tokenppy
Token ID14097
Feature activation+5.04e-03

Pos logit contributions
epad+0.079
licks+0.077
 twice+0.067
 facts+0.066
minded+0.065
Neg logit contributions
 duck-0.096
 Ph-0.090
 kids-0.090
 umbrella-0.089
Ph-0.089
Token was
Token ID373
Feature activation-3.44e-03

Pos logit contributions
epad+0.009
licks+0.009
rou+0.008
 twice+0.008
 Does+0.007
Neg logit contributions
 duck-0.006
 kids-0.006
 umbrella-0.006
 Ph-0.006
Ph-0.006
Token tired
Token ID10032
Feature activation+7.38e-04

Pos logit contributions
 duck+0.004
 kids+0.004
 umbrella+0.004
 Ph+0.004
 best+0.004
Neg logit contributions
epad-0.006
licks-0.006
rou-0.006
 Does-0.006
minded-0.006
Token,
Token ID11
Feature activation+1.79e-03

Pos logit contributions
epad+0.002
licks+0.002
rou+0.001
 Does+0.001
 habitats+0.001
Neg logit contributions
 duck-0.001
 kids-0.001
 umbrella-0.001
 Ph-0.001
 best-0.001
Token so
Token ID523
Feature activation+6.24e-03

Pos logit contributions
licks+0.004
epad+0.004
rou+0.004
 habitats+0.004
minded+0.004
Neg logit contributions
 kids-0.001
 duck-0.001
 umbrella-0.001
 Ph-0.001
 best-0.001
Token family
Token ID1641
Feature activation+4.62e-04

Pos logit contributions
 Ph+0.003
Ph+0.003
 duck+0.003
 umbrella+0.002
 kids+0.002
Neg logit contributions
epad-0.009
licks-0.009
rou-0.009
 twice-0.009
 habitats-0.008
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
epad+0.001
licks+0.001
rou+0.001
 twice+0.001
 Does+0.001
Neg logit contributions
 duck-0.000
 kids-0.000
 umbrella-0.000
 Ph-0.000
 best-0.000
Token
Token ID198
Feature activation+3.29e-03

Pos logit contributions
licks+0.027
epad+0.027
 twice+0.025
rou+0.024
 Does+0.024
Neg logit contributions
 umbrella-0.015
 duck-0.015
 kids-0.015
 Ph-0.014
Ph-0.014
Token
Token ID198
Feature activation+1.12e-02

Pos logit contributions
licks+0.007
epad+0.007
 twice+0.006
rou+0.006
 Does+0.006
Neg logit contributions
 umbrella-0.003
 duck-0.003
 kids-0.003
 Ph-0.003
 best-0.003
TokenWhen
Token ID2215
Feature activation+3.76e-03

Pos logit contributions
epad+0.017
licks+0.017
rou+0.015
 twice+0.015
minded+0.014
Neg logit contributions
 duck-0.016
 umbrella-0.016
 kids-0.016
 Ph-0.016
Ph-0.015
Token Ho
Token ID9544
Feature activation+5.89e-02

Pos logit contributions
licks+0.006
epad+0.006
 twice+0.005
 Does+0.005
rou+0.005
Neg logit contributions
 umbrella-0.005
 kids-0.005
 duck-0.005
 Ph-0.005
Ph-0.005
Tokenppy
Token ID14097
Feature activation+1.05e-02

Pos logit contributions
epad+0.092
licks+0.088
 facts+0.082
 twice+0.078
 Does+0.077
Neg logit contributions
 duck-0.085
n-0.078
 pig-0.075
 best-0.074
Ph-0.073
Token got
Token ID1392
Feature activation-7.74e-03

Pos logit contributions
epad+0.019
licks+0.018
rou+0.016
 twice+0.016
 Does+0.016
Neg logit contributions
 duck-0.013
 umbrella-0.012
 kids-0.012
 Ph-0.012
Ph-0.012
Token home
Token ID1363
Feature activation-1.46e-03

Pos logit contributions
 duck+0.008
 umbrella+0.008
 kids+0.008
 Ph+0.008
Ph+0.007
Neg logit contributions
epad-0.015
licks-0.015
 twice-0.014
rou-0.014
minded-0.013
Token,
Token ID11
Feature activation+7.55e-03

Pos logit contributions
 duck+0.002
 umbrella+0.001
 kids+0.001
 Ph+0.001
Ph+0.001
Neg logit contributions
licks-0.003
epad-0.003
 twice-0.003
rou-0.002
minded-0.002
Token his
Token ID465
Feature activation-2.26e-03

Pos logit contributions
epad+0.013
licks+0.013
rou+0.011
 twice+0.011
minded+0.011
Neg logit contributions
 duck-0.010
 umbrella-0.009
 kids-0.009
 Ph-0.009
Ph-0.009
Token but
Token ID475
Feature activation+1.47e-03

Pos logit contributions
epad+0.007
licks+0.007
rou+0.006
 twice+0.006
minded+0.006
Neg logit contributions
 duck-0.003
 kids-0.003
 umbrella-0.003
 Ph-0.003
Ph-0.003
Token it
Token ID340
Feature activation-1.07e-03

Pos logit contributions
licks+0.003
epad+0.003
 twice+0.003
 Does+0.003
rou+0.003
Neg logit contributions
 umbrella-0.001
 kids-0.001
 duck-0.001
 Ph-0.001
Ph-0.001
Token flew
Token ID13112
Feature activation-1.04e-03

Pos logit contributions
 kids+0.001
 umbrella+0.001
 duck+0.001
 Ph+0.001
Ph+0.001
Neg logit contributions
licks-0.002
epad-0.002
 twice-0.002
rou-0.002
 Does-0.002
Token away
Token ID1497
Feature activation+8.47e-03

Pos logit contributions
 umbrella+0.001
 best+0.001
 duck+0.001
 Ph+0.001
 k+0.001
Neg logit contributions
epad-0.002
 facts-0.002
 writer-0.002
 recite-0.002
isdom-0.002
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
licks+0.021
epad+0.021
asper+0.021
 habitats+0.020
 Does+0.020
Neg logit contributions
 duck-0.005
 best-0.004
 k-0.004
 umbrella-0.004
 Ph-0.004
Token Ho
Token ID9544
Feature activation+5.88e-02

Pos logit contributions
licks+0.031
epad+0.031
 twice+0.029
 Does+0.028
rou+0.028
Neg logit contributions
 umbrella-0.016
 duck-0.016
 kids-0.015
 Ph-0.015
Ph-0.014
Tokenppy
Token ID14097
Feature activation+6.16e-03

Pos logit contributions
epad+0.089
licks+0.082
 twice+0.077
 facts+0.076
minded+0.075
Neg logit contributions
 duck-0.088
 best-0.078
 kids-0.078
n-0.078
 pig-0.077
Token felt
Token ID2936
Feature activation+1.15e-02

Pos logit contributions
epad+0.012
licks+0.012
rou+0.011
 twice+0.011
 Does+0.011
Neg logit contributions
 duck-0.006
 kids-0.006
 umbrella-0.006
 Ph-0.006
Ph-0.005
Token sad
Token ID6507
Feature activation+1.54e-02

Pos logit contributions
epad+0.024
licks+0.024
rou+0.022
 twice+0.022
minded+0.021
Neg logit contributions
 duck-0.010
 umbrella-0.010
 kids-0.010
 Ph-0.009
Ph-0.009
Token and
Token ID290
Feature activation+2.36e-02

Pos logit contributions
epad+0.033
licks+0.032
rou+0.029
 habitats+0.029
 recite+0.028
Neg logit contributions
 duck-0.014
 kids-0.014
 umbrella-0.014
 best-0.013
 Ph-0.013
Token started
Token ID2067
Feature activation-2.46e-03

Pos logit contributions
epad+0.048
licks+0.046
rou+0.042
 twice+0.041
asper+0.040
Neg logit contributions
 duck-0.026
 kids-0.025
 umbrella-0.024
 Ph-0.023
Ph-0.022
Token shake
Token ID13279
Feature activation-5.69e-04

Pos logit contributions
epad+0.017
licks+0.017
 twice+0.016
rou+0.015
 recite+0.015
Neg logit contributions
 duck-0.008
 kids-0.008
Ph-0.008
 umbrella-0.008
 Ph-0.008
Token and
Token ID290
Feature activation+8.60e-03

Pos logit contributions
 duck+0.000
 umbrella+0.000
 kids+0.000
 Ph+0.000
 best+0.000
Neg logit contributions
epad-0.001
licks-0.001
rou-0.001
 Does-0.001
 habitats-0.001
Token rum
Token ID7440
Feature activation+3.81e-04

Pos logit contributions
epad+0.019
licks+0.018
rou+0.017
 habitats+0.016
minded+0.016
Neg logit contributions
 duck-0.009
 Ph-0.008
 umbrella-0.008
 kids-0.008
 pig-0.008
Tokenble
Token ID903
Feature activation-3.32e-03

Pos logit contributions
epad+0.001
licks+0.001
rou+0.001
 twice+0.001
minded+0.001
Neg logit contributions
 duck-0.000
 umbrella-0.000
 kids-0.000
 Ph-0.000
Ph-0.000
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
 duck+0.003
 umbrella+0.003
 kids+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.007
licks-0.006
rou-0.006
 Does-0.006
minded-0.006
Token Ho
Token ID9544
Feature activation+5.81e-02

Pos logit contributions
epad+0.024
licks+0.023
 twice+0.021
rou+0.021
 Does+0.020
Neg logit contributions
 duck-0.019
 umbrella-0.019
 kids-0.019
 Ph-0.018
Ph-0.018
Tokenppy
Token ID14097
Feature activation+5.22e-03

Pos logit contributions
epad+0.089
licks+0.083
 facts+0.077
 twice+0.075
 habitats+0.074
Neg logit contributions
 duck-0.086
 pig-0.076
 kids-0.076
 best-0.076
n-0.075
Token opened
Token ID4721
Feature activation+1.04e-02

Pos logit contributions
epad+0.011
licks+0.011
rou+0.010
minded+0.010
 facts+0.010
Neg logit contributions
 duck-0.005
 kids-0.005
 umbrella-0.005
 pig-0.004
 Ph-0.004
Token his
Token ID465
Feature activation+1.10e-02

Pos logit contributions
epad+0.023
licks+0.023
rou+0.021
 habitats+0.021
minded+0.021
Neg logit contributions
 duck-0.009
 umbrella-0.008
 Ph-0.008
 kids-0.008
 pig-0.007
Token eyes
Token ID2951
Feature activation+1.96e-02

Pos logit contributions
licks+0.014
epad+0.014
 twice+0.012
rou+0.012
auldron+0.011
Neg logit contributions
 Ph-0.018
 duck-0.018
Ph-0.018
 kids-0.017
 umbrella-0.017
Token and
Token ID290
Feature activation+6.01e-03

Pos logit contributions
epad+0.045
licks+0.045
 habitats+0.043
 facts+0.042
minded+0.041
Neg logit contributions
 umbrella-0.014
 duck-0.013
 Ph-0.012
 kids-0.012
 best-0.011
Token high
Token ID1029
Feature activation-9.56e-03

Pos logit contributions
epad+0.003
licks+0.003
 Does+0.003
 facts+0.003
 Enjoy+0.003
Neg logit contributions
 duck-0.001
 best-0.001
 k-0.001
 umbrella-0.001
Ph-0.001
Token in
Token ID287
Feature activation+1.07e-02

Pos logit contributions
 duck+0.007
 best+0.007
 k+0.007
 kids+0.006
 umbrella+0.006
Neg logit contributions
epad-0.022
licks-0.020
 recite-0.019
 Does-0.019
ems-0.019
Token the
Token ID262
Feature activation+7.80e-03

Pos logit contributions
licks+0.021
epad+0.021
 twice+0.020
rou+0.019
minded+0.019
Neg logit contributions
 duck-0.009
Ph-0.009
 Ph-0.009
 umbrella-0.009
 kids-0.009
Token sky
Token ID6766
Feature activation+8.16e-03

Pos logit contributions
epad+0.014
licks+0.014
rou+0.013
 twice+0.013
 Does+0.013
Neg logit contributions
 duck-0.009
 umbrella-0.009
 kids-0.008
 Ph-0.008
Ph-0.008
Token.
Token ID13
Feature activation+1.79e-02

Pos logit contributions
licks+0.022
epad+0.022
 Does+0.020
asper+0.020
minded+0.020
Neg logit contributions
 duck-0.003
 kids-0.003
 umbrella-0.002
 best-0.002
 bird-0.002
Token Ho
Token ID9544
Feature activation+5.80e-02

Pos logit contributions
epad+0.037
licks+0.036
rou+0.033
 twice+0.033
minded+0.032
Neg logit contributions
 duck-0.018
 kids-0.017
 Ph-0.016
 umbrella-0.016
 pig-0.016
Tokenppy
Token ID14097
Feature activation+5.33e-03

Pos logit contributions
epad+0.080
licks+0.075
 twice+0.068
minded+0.065
 facts+0.065
Neg logit contributions
 duck-0.095
 kids-0.088
 best-0.087
 pig-0.086
 umbrella-0.086
Token thought
Token ID1807
Feature activation+8.86e-03

Pos logit contributions
epad+0.011
licks+0.011
 twice+0.010
rou+0.010
minded+0.010
Neg logit contributions
 duck-0.005
 kids-0.005
 umbrella-0.005
 Ph-0.004
 pig-0.004
Token it
Token ID340
Feature activation+2.57e-04

Pos logit contributions
licks+0.016
epad+0.016
 twice+0.015
 Does+0.014
rou+0.014
Neg logit contributions
 Ph-0.009
Ph-0.009
 umbrella-0.009
 duck-0.009
 kids-0.009
Token looked
Token ID3114
Feature activation+8.30e-03

Pos logit contributions
licks+0.000
epad+0.000
 twice+0.000
 Does+0.000
rou+0.000
Neg logit contributions
 kids-0.000
 umbrella-0.000
 duck-0.000
 Ph-0.000
Ph-0.000
Token like
Token ID588
Feature activation-3.56e-03

Pos logit contributions
epad+0.019
licks+0.019
oes+0.017
 habitats+0.017
osity+0.017
Neg logit contributions
 best-0.006
 duck-0.006
 umbrella-0.006
 kids-0.006
 fine-0.006
Token a
Token ID257
Feature activation+3.54e-03

Pos logit contributions
 duck+0.003
 kids+0.003
 best+0.003
 umbrella+0.003
 Ph+0.003
Neg logit contributions
epad-0.007
licks-0.006
rou-0.006
 habitats-0.006
 twice-0.006
Token bug
Token ID5434
Feature activation-6.73e-03

Pos logit contributions
licks+0.009
epad+0.008
minded+0.008
 habitats+0.008
 twice+0.008
Neg logit contributions
 duck-0.003
 kids-0.002
 umbrella-0.002
 k-0.002
 best-0.002
Token next
Token ID1306
Feature activation-3.58e-03

Pos logit contributions
 duck+0.007
 umbrella+0.007
 kids+0.007
 best+0.006
 pig+0.006
Neg logit contributions
epad-0.013
licks-0.013
rou-0.012
 Does-0.011
 twice-0.011
Token time
Token ID640
Feature activation-1.04e-02

Pos logit contributions
 duck+0.005
 umbrella+0.005
 kids+0.005
 Ph+0.005
Ph+0.005
Neg logit contributions
epad-0.005
licks-0.005
 twice-0.005
rou-0.005
minded-0.005
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
 duck+0.008
 kids+0.008
 umbrella+0.007
 Ph+0.007
 best+0.007
Neg logit contributions
licks-0.023
epad-0.023
rou-0.021
 Does-0.021
 twice-0.020
Token Ho
Token ID9544
Feature activation+5.66e-02

Pos logit contributions
epad+0.020
licks+0.019
rou+0.018
 twice+0.017
 recite+0.016
Neg logit contributions
 duck-0.012
 kids-0.012
 Ph-0.011
 umbrella-0.011
 pig-0.011
Tokenppy
Token ID14097
Feature activation-3.22e-03

Pos logit contributions
epad+0.088
licks+0.081
 facts+0.077
 twice+0.076
 attent+0.074
Neg logit contributions
 duck-0.083
 best-0.073
 kids-0.072
 pig-0.072
n-0.071
Token smiled
Token ID13541
Feature activation+2.34e-02

Pos logit contributions
 duck+0.004
 kids+0.003
 umbrella+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.006
licks-0.006
rou-0.006
 twice-0.006
 Does-0.005
Token and
Token ID290
Feature activation+1.43e-04

Pos logit contributions
epad+0.052
licks+0.052
 habitats+0.046
rou+0.046
 facts+0.045
Neg logit contributions
 duck-0.020
 best-0.018
 kids-0.018
 umbrella-0.017
 pig-0.016
Token felt
Token ID2936
Feature activation+2.70e-03

Pos logit contributions
epad+0.000
licks+0.000
 twice+0.000
rou+0.000
 Does+0.000
Neg logit contributions
 duck-0.000
 umbrella-0.000
 kids-0.000
 Ph-0.000
Ph-0.000
Token happy
Token ID3772
Feature activation-2.16e-03

Pos logit contributions
epad+0.005
licks+0.005
 twice+0.005
rou+0.004
 Does+0.004
Neg logit contributions
 umbrella-0.003
 duck-0.003
 kids-0.003
 Ph-0.003
 train-0.003
Token sun
Token ID4252
Feature activation-3.36e-03

Pos logit contributions
Ph+0.005
 Ph+0.005
 umbrella+0.005
 duck+0.005
 je+0.004
Neg logit contributions
licks-0.011
epad-0.011
auldron-0.011
 twice-0.011
 habitats-0.011
Token began
Token ID2540
Feature activation-1.59e-02

Pos logit contributions
 kids+0.005
 umbrella+0.005
Ph+0.005
 Ph+0.005
 duck+0.005
Neg logit contributions
licks-0.004
epad-0.004
 twice-0.004
 Does-0.004
rou-0.004
Token to
Token ID284
Feature activation-5.10e-03

Pos logit contributions
 duck+0.023
 umbrella+0.021
 Ph+0.021
 kids+0.020
 best+0.020
Neg logit contributions
epad-0.027
licks-0.027
rou-0.024
 Does-0.023
minded-0.023
Token set
Token ID900
Feature activation+4.64e-04

Pos logit contributions
 duck+0.005
 umbrella+0.005
 kids+0.005
 Ph+0.005
Ph+0.005
Neg logit contributions
epad-0.010
licks-0.010
rou-0.009
 twice-0.009
 Does-0.009
Token,
Token ID11
Feature activation+8.79e-03

Pos logit contributions
epad+0.001
licks+0.001
rou+0.001
 Does+0.001
minded+0.001
Neg logit contributions
 duck-0.000
 umbrella-0.000
 Ph-0.000
 kids-0.000
 best-0.000
Token Ho
Token ID9544
Feature activation+5.65e-02

Pos logit contributions
epad+0.013
licks+0.013
rou+0.011
 twice+0.011
minded+0.011
Neg logit contributions
 duck-0.013
 kids-0.013
 umbrella-0.013
 Ph-0.013
Ph-0.012
Tokenppy
Token ID14097
Feature activation+2.71e-03

Pos logit contributions
epad+0.084
licks+0.081
 facts+0.075
 twice+0.073
minded+0.071
Neg logit contributions
 duck-0.084
 best-0.076
 kids-0.076
 pig-0.075
Ph-0.073
Token realized
Token ID6939
Feature activation+6.58e-03

Pos logit contributions
epad+0.006
licks+0.005
 facts+0.005
rou+0.005
 Does+0.005
Neg logit contributions
 duck-0.003
 kids-0.003
 pig-0.003
 umbrella-0.003
 k-0.003
Token it
Token ID340
Feature activation-8.25e-04

Pos logit contributions
epad+0.012
licks+0.012
 twice+0.011
rou+0.011
 Does+0.010
Neg logit contributions
 duck-0.008
 umbrella-0.008
 kids-0.008
 Ph-0.007
Ph-0.007
Token was
Token ID373
Feature activation-4.45e-03

Pos logit contributions
 duck+0.001
 umbrella+0.001
 kids+0.001
 Ph+0.001
Ph+0.001
Neg logit contributions
epad-0.002
licks-0.002
 twice-0.002
rou-0.002
 Does-0.002
Token time
Token ID640
Feature activation-9.32e-03

Pos logit contributions
 duck+0.005
 umbrella+0.004
 kids+0.004
 Ph+0.004
Ph+0.004
Neg logit contributions
epad-0.009
licks-0.008
 twice-0.008
rou-0.008
 Does-0.008
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
epad+0.026
licks+0.025
rou+0.024
 twice+0.024
minded+0.023
Neg logit contributions
 duck-0.008
 umbrella-0.008
 kids-0.007
 Ph-0.007
Ph-0.007
TokenThe
Token ID464
Feature activation+4.32e-03

Pos logit contributions
epad+0.020
licks+0.019
rou+0.017
 twice+0.016
minded+0.016
Neg logit contributions
 duck-0.019
Ph-0.018
n-0.018
 kids-0.018
 Ph-0.018
Token next
Token ID1306
Feature activation+5.50e-04

Pos logit contributions
epad+0.007
licks+0.006
 twice+0.006
rou+0.006
minded+0.006
Neg logit contributions
 Ph-0.006
Ph-0.005
 umbrella-0.005
n-0.005
 je-0.005
Token day
Token ID1110
Feature activation+1.04e-02

Pos logit contributions
epad+0.001
licks+0.001
 twice+0.001
rou+0.001
minded+0.001
Neg logit contributions
 Ph-0.001
 umbrella-0.001
Ph-0.000
 duck-0.000
 kids-0.000
Token,
Token ID11
Feature activation+8.89e-03

Pos logit contributions
licks+0.019
epad+0.019
 twice+0.018
 Does+0.018
rou+0.018
Neg logit contributions
 umbrella-0.011
 duck-0.010
 kids-0.010
 Ph-0.010
Ph-0.010
Token Ho
Token ID9544
Feature activation+5.65e-02

Pos logit contributions
epad+0.013
licks+0.013
rou+0.012
 twice+0.011
minded+0.011
Neg logit contributions
 duck-0.013
 kids-0.013
 umbrella-0.013
 Ph-0.013
Ph-0.012
Tokenppy
Token ID14097
Feature activation-3.33e-03

Pos logit contributions
epad+0.083
licks+0.080
 facts+0.072
 twice+0.071
 habitats+0.070
Neg logit contributions
 duck-0.086
 best-0.078
 kids-0.078
 pig-0.077
n-0.074
Token saw
Token ID2497
Feature activation+3.56e-03

Pos logit contributions
 duck+0.004
 kids+0.003
 umbrella+0.003
 pig+0.003
 Ph+0.003
Neg logit contributions
epad-0.007
licks-0.006
rou-0.006
 twice-0.006
 Does-0.006
Token a
Token ID257
Feature activation+8.03e-03

Pos logit contributions
epad+0.008
licks+0.008
 twice+0.007
rou+0.007
minded+0.007
Neg logit contributions
 duck-0.003
 kids-0.002
 umbrella-0.002
 Ph-0.002
 best-0.002
Token fat
Token ID3735
Feature activation-2.25e-03

Pos logit contributions
licks+0.019
epad+0.019
 twice+0.017
minded+0.017
 habitats+0.017
Neg logit contributions
 duck-0.006
 umbrella-0.005
 kids-0.005
 best-0.005
 pig-0.005
Token fly
Token ID6129
Feature activation-6.31e-03

Pos logit contributions
 duck+0.002
 kids+0.002
 umbrella+0.002
 pig+0.002
 Ph+0.002
Neg logit contributions
epad-0.005
licks-0.005
minded-0.005
 twice-0.005
rou-0.005
TokenOn
Token ID2202
Feature activation-3.54e-03

Pos logit contributions
epad+0.019
licks+0.019
rou+0.017
 twice+0.017
minded+0.016
Neg logit contributions
 duck-0.018
 kids-0.017
Ph-0.017
 Ph-0.017
 umbrella-0.017
Token his
Token ID465
Feature activation+5.41e-04

Pos logit contributions
 kids+0.003
 umbrella+0.003
 duck+0.003
 Ph+0.003
 boys+0.003
Neg logit contributions
licks-0.007
epad-0.007
 twice-0.006
rou-0.006
 Does-0.006
Token journey
Token ID7002
Feature activation-2.20e-03

Pos logit contributions
by+0.001
 habitats+0.001
 reads+0.001
 recite+0.001
 solar+0.001
Neg logit contributions
 Ph-0.001
ter-0.001
Ph-0.001
ters-0.001
ily-0.001
Token back
Token ID736
Feature activation-6.26e-03

Pos logit contributions
 duck+0.002
 kids+0.002
 umbrella+0.002
 best+0.002
Ph+0.002
Neg logit contributions
epad-0.005
licks-0.005
rou-0.004
 Does-0.004
 habitats-0.004
Token,
Token ID11
Feature activation+6.99e-03

Pos logit contributions
 duck+0.006
 umbrella+0.005
 kids+0.005
 Ph+0.005
 best+0.005
Neg logit contributions
epad-0.013
licks-0.013
rou-0.012
 twice-0.012
 Does-0.012
Token Ho
Token ID9544
Feature activation+5.65e-02

Pos logit contributions
epad+0.011
licks+0.011
rou+0.010
 twice+0.009
minded+0.009
Neg logit contributions
 duck-0.010
 kids-0.010
 Ph-0.010
 umbrella-0.010
 best-0.009
Tokenppy
Token ID14097
Feature activation-1.55e-03

Pos logit contributions
epad+0.084
licks+0.082
 facts+0.075
 twice+0.073
 habitats+0.072
Neg logit contributions
 duck-0.084
 best-0.077
 kids-0.076
 pig-0.075
n-0.072
Token felt
Token ID2936
Feature activation+5.61e-03

Pos logit contributions
 duck+0.002
 kids+0.002
 umbrella+0.002
 Ph+0.002
Ph+0.002
Neg logit contributions
epad-0.003
licks-0.003
rou-0.002
 twice-0.002
 Does-0.002
Token proud
Token ID6613
Feature activation+1.52e-02

Pos logit contributions
epad+0.012
licks+0.012
rou+0.010
 habitats+0.010
minded+0.010
Neg logit contributions
 duck-0.005
 kids-0.005
 Ph-0.005
 umbrella-0.005
Ph-0.005
Token of
Token ID286
Feature activation+2.25e-03

Pos logit contributions
licks+0.035
epad+0.035
 recite+0.030
use+0.030
 habitats+0.030
Neg logit contributions
 kids-0.013
 duck-0.013
 best-0.011
 umbrella-0.011
 boys-0.011
Token himself
Token ID2241
Feature activation+6.26e-03

Pos logit contributions
epad+0.005
licks+0.005
 twice+0.004
rou+0.004
 Does+0.004
Neg logit contributions
 duck-0.002
 umbrella-0.002
 kids-0.002
 Ph-0.002
Ph-0.002
Token him
Token ID683
Feature activation-5.54e-03

Pos logit contributions
epad+0.023
licks+0.022
rou+0.020
 twice+0.020
auldron+0.020
Neg logit contributions
 duck-0.014
 kids-0.014
 Ph-0.013
 umbrella-0.013
 pig-0.013
Token what
Token ID644
Feature activation-1.15e-02

Pos logit contributions
 duck+0.007
 best+0.007
 kids+0.007
 pig+0.006
 umbrella+0.006
Neg logit contributions
epad-0.010
licks-0.010
auldron-0.009
ited-0.009
 habitats-0.009
Token was
Token ID373
Feature activation+3.01e-05

Pos logit contributions
 umbrella+0.014
 Ph+0.013
 duck+0.013
Ph+0.013
 kids+0.012
Neg logit contributions
epad-0.020
licks-0.020
 Does-0.019
rou-0.018
 twice-0.018
Token wrong
Token ID2642
Feature activation-7.73e-03

Pos logit contributions
epad+0.000
licks+0.000
 twice+0.000
rou+0.000
minded+0.000
Neg logit contributions
 umbrella-0.000
 duck-0.000
 Ph-0.000
Ph-0.000
 kids-0.000
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
 umbrella+0.008
Ph+0.008
 Ph+0.008
 duck+0.008
 kids+0.007
Neg logit contributions
epad-0.014
licks-0.014
 twice-0.014
rou-0.013
minded-0.013
Token Ho
Token ID9544
Feature activation+5.63e-02

Pos logit contributions
epad+0.023
licks+0.021
rou+0.020
 twice+0.019
 habitats+0.018
Neg logit contributions
 duck-0.018
 kids-0.017
 Ph-0.017
 pig-0.016
 umbrella-0.016
Tokenppy
Token ID14097
Feature activation-2.96e-03

Pos logit contributions
epad+0.088
licks+0.081
 facts+0.077
 twice+0.076
 attent+0.075
Neg logit contributions
 duck-0.081
 best-0.072
 pig-0.071
 kids-0.070
n-0.070
Token told
Token ID1297
Feature activation+2.19e-02

Pos logit contributions
 duck+0.004
 kids+0.003
 umbrella+0.003
 Ph+0.003
 pig+0.003
Neg logit contributions
epad-0.005
licks-0.005
rou-0.005
 twice-0.005
 Does-0.005
Token them
Token ID606
Feature activation-8.99e-03

Pos logit contributions
licks+0.034
epad+0.033
 twice+0.031
minded+0.030
rou+0.029
Neg logit contributions
 umbrella-0.029
 duck-0.029
Ph-0.029
 kids-0.028
 Ph-0.028
Token about
Token ID546
Feature activation+2.49e-03

Pos logit contributions
 duck+0.009
 kids+0.009
 umbrella+0.008
 boys+0.008
 best+0.008
Neg logit contributions
epad-0.019
licks-0.018
 habitats-0.017
rou-0.017
 Does-0.017
Token the
Token ID262
Feature activation+9.21e-03

Pos logit contributions
epad+0.006
licks+0.006
rou+0.005
 twice+0.005
minded+0.005
Neg logit contributions
 duck-0.002
 kids-0.002
 umbrella-0.002
 Ph-0.002
 pig-0.002
Token,
Token ID11
Feature activation+3.33e-03

Pos logit contributions
 best+0.002
 kids+0.002
 duck+0.002
 umbrella+0.002
 boys+0.002
Neg logit contributions
licks-0.008
epad-0.008
 Does-0.008
ems-0.008
 recite-0.008
Token just
Token ID655
Feature activation+3.61e-03

Pos logit contributions
epad+0.008
licks+0.008
asper+0.008
 facts+0.007
 habitats+0.007
Neg logit contributions
 duck-0.003
 best-0.002
 kids-0.002
 k-0.002
 pig-0.002
Token like
Token ID588
Feature activation+7.11e-04

Pos logit contributions
licks+0.007
epad+0.007
asper+0.007
auldron+0.006
rou+0.006
Neg logit contributions
 best-0.004
 duck-0.004
 kids-0.004
 umbrella-0.004
 Ph-0.004
Token before
Token ID878
Feature activation+9.42e-03

Pos logit contributions
epad+0.001
licks+0.001
 twice+0.001
rou+0.001
 Does+0.001
Neg logit contributions
 duck-0.001
 umbrella-0.001
 kids-0.001
 Ph-0.001
Ph-0.001
Token.
Token ID13
Feature activation+6.89e-03

Pos logit contributions
epad+0.017
licks+0.017
 twice+0.017
rou+0.016
 Does+0.016
Neg logit contributions
 duck-0.010
 umbrella-0.010
 kids-0.009
 Ph-0.009
Ph-0.009
Token Ho
Token ID9544
Feature activation+5.63e-02

Pos logit contributions
epad+0.013
licks+0.012
rou+0.011
 twice+0.011
minded+0.011
Neg logit contributions
 duck-0.008
 kids-0.008
 Ph-0.008
 pig-0.008
 umbrella-0.008
Tokenppy
Token ID14097
Feature activation+6.89e-03

Pos logit contributions
epad+0.088
licks+0.082
 facts+0.079
 twice+0.076
 habitats+0.075
Neg logit contributions
 duck-0.081
 best-0.072
 pig-0.072
 kids-0.070
n-0.069
Token was
Token ID373
Feature activation-2.09e-03

Pos logit contributions
epad+0.015
licks+0.014
rou+0.013
 twice+0.013
 Does+0.013
Neg logit contributions
 duck-0.007
 kids-0.006
 umbrella-0.006
 Ph-0.006
 pig-0.006
Token very
Token ID845
Feature activation+7.08e-03

Pos logit contributions
 umbrella+0.002
 duck+0.002
 kids+0.002
 Ph+0.002
 je+0.002
Neg logit contributions
epad-0.004
licks-0.004
 twice-0.004
rou-0.004
 Does-0.004
Token happy
Token ID3772
Feature activation+8.57e-04

Pos logit contributions
epad+0.017
licks+0.017
 twice+0.016
rou+0.016
minded+0.015
Neg logit contributions
 duck-0.004
 kids-0.004
 umbrella-0.004
 Ph-0.004
 best-0.003
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
epad+0.002
licks+0.002
rou+0.002
asper+0.002
 habitats+0.002
Neg logit contributions
 kids-0.001
 duck-0.001
 best-0.001
 boys-0.001
 umbrella-0.001
Token
Token ID198
Feature activation+7.75e-03

Pos logit contributions
epad+0.051
licks+0.051
rou+0.047
 twice+0.046
minded+0.045
Neg logit contributions
 duck-0.018
 kids-0.017
 umbrella-0.017
 Ph-0.017
Ph-0.016
Token
Token ID198
Feature activation+1.47e-02

Pos logit contributions
licks+0.016
epad+0.015
 twice+0.014
rou+0.014
 Does+0.014
Neg logit contributions
 umbrella-0.007
 duck-0.007
 Ph-0.007
 kids-0.007
 pig-0.006
TokenOne
Token ID3198
Feature activation+9.82e-03

Pos logit contributions
epad+0.024
licks+0.024
rou+0.021
 twice+0.021
minded+0.020
Neg logit contributions
 duck-0.020
 kids-0.019
Ph-0.019
 Ph-0.019
 umbrella-0.019
Token day
Token ID1110
Feature activation+5.91e-03

Pos logit contributions
epad+0.018
licks+0.018
 twice+0.017
rou+0.017
oes+0.017
Neg logit contributions
 Ph-0.009
 umbrella-0.009
Ph-0.009
 duck-0.009
 kids-0.009
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
epad+0.012
licks+0.012
rou+0.011
 twice+0.011
minded+0.011
Neg logit contributions
 duck-0.005
 kids-0.005
 umbrella-0.005
 Ph-0.005
Ph-0.005
Token Ho
Token ID9544
Feature activation+5.62e-02

Pos logit contributions
epad+0.015
licks+0.015
rou+0.014
 twice+0.013
minded+0.013
Neg logit contributions
 duck-0.015
 kids-0.015
 umbrella-0.015
 Ph-0.015
Ph-0.014
Tokenppy
Token ID14097
Feature activation+4.06e-03

Pos logit contributions
epad+0.087
licks+0.085
 facts+0.076
 twice+0.075
 attent+0.074
Neg logit contributions
 duck-0.081
 best-0.073
 pig-0.073
 kids-0.072
n-0.070
Token hurt
Token ID5938
Feature activation+1.67e-03

Pos logit contributions
epad+0.007
licks+0.007
rou+0.006
 twice+0.006
 Does+0.006
Neg logit contributions
 duck-0.005
 kids-0.005
 umbrella-0.005
 Ph-0.005
Ph-0.005
Token his
Token ID465
Feature activation-1.43e-03

Pos logit contributions
epad+0.003
licks+0.003
rou+0.003
 twice+0.003
minded+0.003
Neg logit contributions
 duck-0.002
 kids-0.002
 umbrella-0.002
 Ph-0.001
 best-0.001
Token leg
Token ID1232
Feature activation-4.99e-03

Pos logit contributions
 Ph+0.002
Ph+0.002
 kids+0.001
n+0.001
 duck+0.001
Neg logit contributions
epad-0.003
rou-0.002
licks-0.002
 twice-0.002
 habitats-0.002
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
 duck+0.003
 umbrella+0.003
 kids+0.003
 best+0.003
 k+0.003
Neg logit contributions
epad-0.012
licks-0.012
asper-0.011
minded-0.011
rou-0.011
Token hop
Token ID1725
Feature activation+1.82e-02

Pos logit contributions
epad+0.020
licks+0.019
 recite+0.018
 twice+0.018
oes+0.017
Neg logit contributions
 Ph-0.021
 kids-0.021
 boys-0.020
 best-0.020
Ph-0.020
Token for
Token ID329
Feature activation+2.17e-02

Pos logit contributions
epad+0.034
licks+0.033
rou+0.031
minded+0.030
 Does+0.030
Neg logit contributions
 duck-0.021
 umbrella-0.020
 kids-0.020
 Ph-0.019
Ph-0.019
Token a
Token ID257
Feature activation+3.91e-03

Pos logit contributions
licks+0.036
epad+0.036
 twice+0.035
 Does+0.034
 habitats+0.033
Neg logit contributions
 umbrella-0.023
 duck-0.023
Ph-0.022
 Ph-0.021
 train-0.021
Token week
Token ID1285
Feature activation+1.95e-03

Pos logit contributions
epad+0.007
licks+0.006
 twice+0.006
rou+0.006
 Does+0.006
Neg logit contributions
 kids-0.005
 duck-0.005
 Ph-0.005
 umbrella-0.005
Ph-0.004
Token.
Token ID13
Feature activation+1.93e-02

Pos logit contributions
licks+0.004
epad+0.004
rou+0.004
 Does+0.004
minded+0.004
Neg logit contributions
 duck-0.002
 kids-0.002
 umbrella-0.002
 Ph-0.001
 best-0.001
Token Ho
Token ID9544
Feature activation+5.60e-02

Pos logit contributions
epad+0.035
licks+0.033
rou+0.031
 twice+0.029
minded+0.029
Neg logit contributions
 duck-0.025
 kids-0.024
 Ph-0.023
 pig-0.023
 umbrella-0.022
Tokenppy
Token ID14097
Feature activation+5.40e-03

Pos logit contributions
epad+0.084
licks+0.080
 twice+0.073
 facts+0.073
 attent+0.072
Neg logit contributions
 duck-0.083
 best-0.076
 kids-0.075
 pig-0.075
n-0.073
Token was
Token ID373
Feature activation-3.52e-03

Pos logit contributions
epad+0.011
licks+0.011
rou+0.010
 twice+0.009
 Does+0.009
Neg logit contributions
 duck-0.006
 kids-0.006
 umbrella-0.005
 pig-0.005
 Ph-0.005
Token sad
Token ID6507
Feature activation+1.03e-02

Pos logit contributions
 duck+0.004
 umbrella+0.003
 kids+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.007
licks-0.007
 twice-0.006
rou-0.006
 Does-0.006
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
epad+0.025
licks+0.024
asper+0.022
 habitats+0.022
 recite+0.021
Neg logit contributions
 kids-0.008
 duck-0.008
 best-0.008
 umbrella-0.007
 boys-0.006
Token but
Token ID475
Feature activation+1.97e-02

Pos logit contributions
epad+0.023
licks+0.023
rou+0.021
minded+0.021
asper+0.021
Neg logit contributions
 duck-0.009
 kids-0.008
 Ph-0.008
 umbrella-0.008
 best-0.007
Token hopped
Token ID45230
Feature activation+1.50e-03

Pos logit contributions
epad+0.012
licks+0.012
rou+0.011
minded+0.010
osity+0.010
Neg logit contributions
 duck-0.007
 kids-0.007
 best-0.007
 pig-0.006
 Ph-0.006
Token in
Token ID287
Feature activation+1.45e-02

Pos logit contributions
epad+0.003
licks+0.003
osity+0.003
isdom+0.003
rou+0.003
Neg logit contributions
 duck-0.001
 best-0.001
 kids-0.001
 umbrella-0.001
 two-0.001
Token the
Token ID262
Feature activation+3.52e-03

Pos logit contributions
epad+0.030
licks+0.030
 twice+0.028
rou+0.028
 Does+0.027
Neg logit contributions
 duck-0.012
 umbrella-0.012
 Ph-0.012
 kids-0.011
Ph-0.011
Token rain
Token ID6290
Feature activation-1.92e-02

Pos logit contributions
epad+0.008
licks+0.008
rou+0.007
 twice+0.007
minded+0.007
Neg logit contributions
 duck-0.003
 umbrella-0.003
 kids-0.003
 best-0.002
 Ph-0.002
Token.
Token ID13
Feature activation+1.84e-02

Pos logit contributions
 duck+0.013
 kids+0.012
 umbrella+0.012
 best+0.010
 Ph+0.010
Neg logit contributions
epad-0.046
licks-0.045
 Does-0.042
 twice-0.041
rou-0.041
Token Ho
Token ID9544
Feature activation+5.60e-02

Pos logit contributions
epad+0.036
licks+0.035
rou+0.033
 twice+0.032
minded+0.032
Neg logit contributions
 duck-0.019
 kids-0.018
 umbrella-0.018
 Ph-0.018
Ph-0.017
Tokenppy
Token ID14097
Feature activation+7.60e-03

Pos logit contributions
epad+0.080
licks+0.075
 twice+0.069
minded+0.067
 facts+0.066
Neg logit contributions
 duck-0.089
 kids-0.081
 best-0.081
 pig-0.080
n-0.080
Token was
Token ID373
Feature activation-5.42e-03

Pos logit contributions
epad+0.015
licks+0.015
rou+0.013
 twice+0.013
 Does+0.013
Neg logit contributions
 duck-0.008
 kids-0.008
 umbrella-0.008
 Ph-0.007
 best-0.007
Token very
Token ID845
Feature activation+4.34e-03

Pos logit contributions
 umbrella+0.005
 duck+0.005
 kids+0.005
 Ph+0.005
Ph+0.005
Neg logit contributions
licks-0.011
epad-0.011
 twice-0.010
rou-0.010
 Does-0.009
Token happy
Token ID3772
Feature activation-5.00e-04

Pos logit contributions
epad+0.010
licks+0.010
 twice+0.009
 Does+0.009
 habitats+0.009
Neg logit contributions
 duck-0.003
 kids-0.003
 best-0.003
 Ph-0.003
 umbrella-0.003
Token.
Token ID13
Feature activation+2.33e-02

Pos logit contributions
 kids+0.000
 duck+0.000
 best+0.000
 boys+0.000
 Ph+0.000
Neg logit contributions
epad-0.001
licks-0.001
asper-0.001
use-0.001
ited-0.001
Token As
Token ID1081
Feature activation+7.17e-04

Pos logit contributions
epad+0.001
licks+0.001
rou+0.001
 twice+0.001
minded+0.001
Neg logit contributions
 duck-0.000
 kids-0.000
 Ph-0.000
 umbrella-0.000
Ph-0.000
Token the
Token ID262
Feature activation-2.53e-03

Pos logit contributions
licks+0.002
epad+0.002
rou+0.001
auldron+0.001
ems+0.001
Neg logit contributions
 duck-0.001
 kids-0.001
 best-0.001
 Ph-0.001
 pig-0.001
Token farmer
Token ID18739
Feature activation+5.27e-03

Pos logit contributions
 duck+0.002
 umbrella+0.002
 kids+0.002
 Ph+0.002
Ph+0.002
Neg logit contributions
epad-0.005
licks-0.005
rou-0.005
 twice-0.005
 Does-0.005
Token scattered
Token ID16830
Feature activation+8.93e-03

Pos logit contributions
epad+0.010
licks+0.010
rou+0.009
 twice+0.009
 Does+0.009
Neg logit contributions
 duck-0.006
 kids-0.006
 umbrella-0.005
 Ph-0.005
 pig-0.005
Token the
Token ID262
Feature activation+6.44e-04

Pos logit contributions
epad+0.019
licks+0.018
rou+0.018
 Does+0.017
minded+0.017
Neg logit contributions
 duck-0.008
 umbrella-0.008
 kids-0.008
 Ph-0.007
 best-0.007
Token av
Token ID1196
Feature activation+5.57e-02

Pos logit contributions
licks+0.001
epad+0.001
rou+0.001
 twice+0.001
 Does+0.001
Neg logit contributions
 duck-0.001
 umbrella-0.001
 kids-0.001
 best-0.001
 pig-0.001
Tokenoc
Token ID420
Feature activation+2.77e-02

Pos logit contributions
epad+0.084
licks+0.079
rou+0.072
 Does+0.072
 twice+0.071
Neg logit contributions
 duck-0.081
 pig-0.079
 k-0.077
 je-0.077
 umbrella-0.076
Tokenados
Token ID22484
Feature activation+2.17e-02

Pos logit contributions
epad+0.052
licks+0.051
 Does+0.046
rou+0.046
 twice+0.046
Neg logit contributions
 duck-0.031
 kids-0.030
 pig-0.030
 umbrella-0.029
 boys-0.028
Token,
Token ID11
Feature activation+2.13e-03

Pos logit contributions
epad+0.041
licks+0.040
rou+0.037
 twice+0.036
 Does+0.036
Neg logit contributions
 duck-0.024
 umbrella-0.024
 kids-0.023
 Ph-0.022
 best-0.022
Token he
Token ID339
Feature activation+9.04e-03

Pos logit contributions
epad+0.004
licks+0.004
rou+0.004
 habitats+0.004
minded+0.004
Neg logit contributions
 duck-0.002
 kids-0.002
 umbrella-0.002
 Ph-0.002
 pig-0.002
Token noticed
Token ID6810
Feature activation+2.74e-03

Pos logit contributions
epad+0.018
licks+0.018
oes+0.017
 Enjoy+0.016
iffs+0.016
Neg logit contributions
 duck-0.010
 kids-0.009
 two-0.009
 pig-0.008
 boys-0.008
Token were
Token ID547
Feature activation-1.35e-03

Pos logit contributions
epad+0.016
licks+0.015
rou+0.014
 twice+0.014
 habitats+0.014
Neg logit contributions
 duck-0.008
 kids-0.008
 umbrella-0.007
 Ph-0.007
 pig-0.007
Token in
Token ID287
Feature activation-3.61e-03

Pos logit contributions
 duck+0.002
 umbrella+0.002
 kids+0.002
 Ph+0.002
Ph+0.002
Neg logit contributions
epad-0.002
licks-0.002
rou-0.002
 twice-0.002
 Does-0.002
Token the
Token ID262
Feature activation-8.95e-03

Pos logit contributions
Ph+0.004
 duck+0.004
 umbrella+0.004
n+0.003
 Ph+0.003
Neg logit contributions
epad-0.007
licks-0.006
 Does-0.006
 early-0.006
 recite-0.006
Token park
Token ID3952
Feature activation-1.44e-02

Pos logit contributions
 duck+0.011
 Ph+0.010
 umbrella+0.010
 kids+0.010
Ph+0.010
Neg logit contributions
epad-0.015
licks-0.015
rou-0.014
 twice-0.014
 Does-0.013
Token one
Token ID530
Feature activation-3.82e-03

Pos logit contributions
 kids+0.010
 duck+0.010
 umbrella+0.009
 best+0.009
 Ph+0.008
Neg logit contributions
licks-0.034
epad-0.033
rou-0.031
 Does-0.031
auldron-0.030
Token sunny
Token ID27737
Feature activation-4.94e-02

Pos logit contributions
 duck+0.003
 kids+0.003
 umbrella+0.003
 best+0.002
 Ph+0.002
Neg logit contributions
epad-0.009
licks-0.009
rou-0.008
 twice-0.008
minded-0.008
Token day
Token ID1110
Feature activation-4.15e-03

Pos logit contributions
 duck+0.042
 Ph+0.041
 umbrella+0.040
Ph+0.040
 kids+0.039
Neg logit contributions
epad-0.099
licks-0.097
 twice-0.091
rou-0.091
 habitats-0.089
Token.
Token ID13
Feature activation-4.54e-03

Pos logit contributions
 kids+0.003
 duck+0.003
 umbrella+0.003
 Ph+0.003
 best+0.003
Neg logit contributions
licks-0.009
epad-0.009
rou-0.008
 Does-0.008
minded-0.008
Token A
Token ID317
Feature activation-4.11e-03

Pos logit contributions
 umbrella+0.006
 duck+0.006
 best+0.005
n+0.005
Ph+0.005
Neg logit contributions
licks-0.007
epad-0.007
 Does-0.007
 twice-0.007
rou-0.006
Token ball
Token ID2613
Feature activation+3.18e-05

Pos logit contributions
 duck+0.002
 umbrella+0.002
 kids+0.002
 Ph+0.002
Ph+0.002
Neg logit contributions
epad-0.010
licks-0.010
rou-0.009
 twice-0.009
 Does-0.009
Token rolled
Token ID11686
Feature activation-9.09e-03

Pos logit contributions
epad+0.000
licks+0.000
rou+0.000
 twice+0.000
minded+0.000
Neg logit contributions
 duck-0.000
 umbrella-0.000
 kids-0.000
 Ph-0.000
Ph-0.000
Token and
Token ID290
Feature activation+2.94e-03

Pos logit contributions
 duck+0.010
 kids+0.010
 umbrella+0.010
 Ph+0.010
Ph+0.009
Neg logit contributions
licks-0.017
epad-0.017
rou-0.016
 habitats-0.015
minded-0.015
Token his
Token ID465
Feature activation-2.06e-02

Pos logit contributions
epad+0.005
licks+0.005
rou+0.005
 twice+0.005
 Does+0.004
Neg logit contributions
 duck-0.004
 kids-0.004
 umbrella-0.004
 Ph-0.003
Ph-0.003
Token stubborn
Token ID24444
Feature activation-1.81e-02

Pos logit contributions
 duck+0.013
 Ph+0.013
Ph+0.012
 umbrella+0.012
 kids+0.012
Neg logit contributions
epad-0.047
licks-0.046
rou-0.044
 twice-0.043
 habitats-0.042
Tokenness
Token ID1108
Feature activation-8.66e-03

Pos logit contributions
 kids+0.024
 duck+0.024
 umbrella+0.024
 best+0.023
 boys+0.022
Neg logit contributions
licks-0.031
epad-0.029
 twice-0.026
 Does-0.026
rou-0.026
Token gave
Token ID2921
Feature activation-3.81e-03

Pos logit contributions
 duck+0.010
 umbrella+0.010
 kids+0.010
 Ph+0.010
Ph+0.010
Neg logit contributions
epad-0.015
licks-0.015
 twice-0.014
rou-0.013
minded-0.013
Token way
Token ID835
Feature activation-4.81e-02

Pos logit contributions
 duck+0.004
 kids+0.004
 umbrella+0.004
 best+0.004
 Ph+0.004
Neg logit contributions
licks-0.007
epad-0.007
rou-0.006
 twice-0.006
 Does-0.006
Token to
Token ID284
Feature activation-5.01e-04

Pos logit contributions
 duck+0.055
 kids+0.054
 umbrella+0.054
 Ph+0.051
 best+0.051
Neg logit contributions
licks-0.088
epad-0.088
rou-0.079
 twice-0.078
 Does-0.077
Token understanding
Token ID4547
Feature activation+1.02e-02

Pos logit contributions
 duck+0.001
 umbrella+0.001
 kids+0.001
 Ph+0.000
Ph+0.000
Neg logit contributions
epad-0.001
licks-0.001
 twice-0.001
rou-0.001
 Does-0.001
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
licks+0.025
epad+0.025
auldron+0.023
rou+0.023
 Does+0.023
Neg logit contributions
 kids-0.007
 boys-0.005
 best-0.005
 duck-0.005
 children-0.005
Token He
Token ID679
Feature activation-8.85e-03

Pos logit contributions
 kids+0.013
 best+0.012
Ph+0.012
 Ph+0.012
 duck+0.012
Neg logit contributions
licks-0.022
epad-0.022
rou-0.020
 habitats-0.019
 facts-0.019
Token took
Token ID1718
Feature activation+4.26e-03

Pos logit contributions
 duck+0.010
 umbrella+0.010
 kids+0.010
 Ph+0.009
 best+0.009
Neg logit contributions
epad-0.017
licks-0.016
rou-0.014
 twice-0.014
 Does-0.014
Token car
Token ID1097
Feature activation-7.61e-04

Pos logit contributions
 Ph+0.007
n+0.006
Ph+0.006
 duck+0.006
 umbrella+0.006
Neg logit contributions
licks-0.010
epad-0.010
rou-0.010
 twice-0.010
 Does-0.009
Token with
Token ID351
Feature activation-6.87e-03

Pos logit contributions
 duck+0.001
 kids+0.001
 umbrella+0.001
 best+0.001
 Ph+0.001
Neg logit contributions
licks-0.002
epad-0.002
rou-0.002
 Does-0.001
 twice-0.001
Token her
Token ID607
Feature activation-1.63e-02

Pos logit contributions
 duck+0.009
 umbrella+0.008
 kids+0.008
 Ph+0.008
Ph+0.008
Neg logit contributions
epad-0.012
licks-0.012
 twice-0.011
rou-0.010
 Does-0.010
Token parents
Token ID3397
Feature activation-6.59e-03

Pos logit contributions
 Ph+0.017
Ph+0.017
n+0.016
 Esk+0.015
add+0.014
Neg logit contributions
epad-0.028
rou-0.027
licks-0.027
 twice-0.026
 Does-0.026
Token one
Token ID530
Feature activation-6.92e-03

Pos logit contributions
 duck+0.006
 umbrella+0.006
 kids+0.006
 Ph+0.005
Ph+0.005
Neg logit contributions
epad-0.014
licks-0.014
 twice-0.013
rou-0.013
 Does-0.012
Token sunny
Token ID27737
Feature activation-4.65e-02

Pos logit contributions
 duck+0.006
 Ph+0.006
 umbrella+0.006
 kids+0.006
Ph+0.006
Neg logit contributions
epad-0.014
licks-0.014
 twice-0.013
rou-0.013
minded-0.013
Token day
Token ID1110
Feature activation-2.33e-03

Pos logit contributions
 Ph+0.044
 duck+0.043
Ph+0.042
 kids+0.039
 pig+0.039
Neg logit contributions
epad-0.085
licks-0.079
 habitats-0.079
 twice-0.078
rou-0.076
Token.
Token ID13
Feature activation+3.97e-03

Pos logit contributions
 duck+0.002
 kids+0.002
 umbrella+0.002
 Ph+0.002
 best+0.002
Neg logit contributions
licks-0.005
epad-0.005
rou-0.005
 twice-0.005
 Does-0.005
Token They
Token ID1119
Feature activation-1.06e-02

Pos logit contributions
epad+0.006
licks+0.006
 twice+0.005
 Does+0.005
rou+0.005
Neg logit contributions
 duck-0.006
 umbrella-0.006
 kids-0.006
 best-0.006
Ph-0.006
Token drove
Token ID10357
Feature activation-2.10e-03

Pos logit contributions
 duck+0.013
 umbrella+0.013
 Ph+0.012
Ph+0.012
 kids+0.012
Neg logit contributions
epad-0.018
licks-0.018
 twice-0.016
rou-0.016
 Does-0.015
Token for
Token ID329
Feature activation-7.48e-04

Pos logit contributions
 duck+0.003
 kids+0.003
 umbrella+0.003
 Ph+0.003
 best+0.003
Neg logit contributions
epad-0.003
licks-0.003
rou-0.003
 twice-0.003
 Does-0.003
Token and
Token ID290
Feature activation+5.80e-03

Pos logit contributions
 duck+0.007
 kids+0.006
 umbrella+0.006
 Ph+0.006
 best+0.006
Neg logit contributions
licks-0.014
epad-0.014
rou-0.013
 Does-0.012
 habitats-0.012
Token soon
Token ID2582
Feature activation+6.35e-03

Pos logit contributions
epad+0.011
licks+0.011
rou+0.010
 habitats+0.009
 Does+0.009
Neg logit contributions
 duck-0.007
 kids-0.006
 Ph-0.006
 umbrella-0.006
Ph-0.006
Token the
Token ID262
Feature activation+8.69e-03

Pos logit contributions
epad+0.010
licks+0.010
 twice+0.010
minded+0.009
rou+0.009
Neg logit contributions
 umbrella-0.008
 duck-0.008
 kids-0.007
 Ph-0.007
 pig-0.007
Token darkness
Token ID11854
Feature activation-7.54e-03

Pos logit contributions
licks+0.019
epad+0.019
rou+0.018
 twice+0.017
minded+0.017
Neg logit contributions
 duck-0.008
 kids-0.008
 umbrella-0.007
 best-0.007
 boys-0.007
Token gave
Token ID2921
Feature activation-5.07e-03

Pos logit contributions
 duck+0.009
 umbrella+0.009
 kids+0.009
 Ph+0.009
Ph+0.009
Neg logit contributions
epad-0.013
licks-0.013
 twice-0.012
rou-0.011
minded-0.011
Token way
Token ID835
Feature activation-4.57e-02

Pos logit contributions
 duck+0.007
 umbrella+0.007
 kids+0.007
 Ph+0.006
Ph+0.006
Neg logit contributions
epad-0.008
licks-0.008
 twice-0.007
rou-0.007
minded-0.007
Token to
Token ID284
Feature activation+8.17e-03

Pos logit contributions
 duck+0.052
 umbrella+0.051
 kids+0.050
 Ph+0.049
Ph+0.048
Neg logit contributions
epad-0.084
licks-0.083
rou-0.075
 twice-0.075
 Does-0.073
Token a
Token ID257
Feature activation+1.58e-02

Pos logit contributions
epad+0.016
licks+0.015
 twice+0.014
rou+0.014
 recite+0.013
Neg logit contributions
 duck-0.009
 umbrella-0.008
 kids-0.008
Ph-0.008
 Ph-0.008
Token beautiful
Token ID4950
Feature activation+3.57e-03

Pos logit contributions
licks+0.036
epad+0.035
rou+0.032
 twice+0.032
minded+0.031
Neg logit contributions
 duck-0.013
 umbrella-0.012
 kids-0.012
 best-0.011
 pig-0.011
Token underground
Token ID11447
Feature activation-4.75e-03

Pos logit contributions
licks+0.008
epad+0.008
minded+0.007
rou+0.007
ems+0.007
Neg logit contributions
 duck-0.004
 umbrella-0.003
 kids-0.003
 pig-0.003
 k-0.003
Token garden
Token ID11376
Feature activation-8.21e-03

Pos logit contributions
 duck+0.005
 umbrella+0.005
 kids+0.005
 Ph+0.005
 best+0.004
Neg logit contributions
epad-0.009
licks-0.009
rou-0.008
 twice-0.008
 Does-0.008
Token lovely
Token ID14081
Feature activation-3.88e-03

Pos logit contributions
 Ph+0.006
Ph+0.006
 umbrella+0.006
n+0.006
 kids+0.005
Neg logit contributions
epad-0.010
licks-0.009
 twice-0.009
rou-0.009
 facts-0.009
Token conversation
Token ID5273
Feature activation+4.16e-03

Pos logit contributions
 Ph+0.004
Ph+0.004
 umbrella+0.004
 kids+0.004
 duck+0.004
Neg logit contributions
epad-0.007
licks-0.007
 twice-0.006
rou-0.006
 recite-0.006
Token and
Token ID290
Feature activation-7.09e-03

Pos logit contributions
licks+0.009
epad+0.009
rou+0.008
 Does+0.008
minded+0.008
Neg logit contributions
 duck-0.004
 kids-0.004
 umbrella-0.003
 best-0.003
 Ph-0.003
Token made
Token ID925
Feature activation-1.99e-02

Pos logit contributions
 duck+0.008
 kids+0.008
 Ph+0.007
 umbrella+0.007
 best+0.007
Neg logit contributions
epad-0.014
licks-0.014
rou-0.012
 Does-0.012
auldron-0.012
Token his
Token ID465
Feature activation-1.36e-02

Pos logit contributions
 umbrella+0.020
 duck+0.020
 Ph+0.019
Ph+0.019
 kids+0.019
Neg logit contributions
epad-0.038
licks-0.038
 twice-0.035
rou-0.035
minded-0.034
Token way
Token ID835
Feature activation-4.50e-02

Pos logit contributions
 Ph+0.015
Ph+0.015
 umbrella+0.014
 duck+0.014
 kids+0.014
Neg logit contributions
epad-0.023
licks-0.022
 twice-0.022
rou-0.022
 recite-0.021
Token back
Token ID736
Feature activation-6.60e-03

Pos logit contributions
 duck+0.039
 umbrella+0.038
 kids+0.037
 Ph+0.036
Ph+0.035
Neg logit contributions
epad-0.095
licks-0.095
rou-0.086
 twice-0.086
 Does-0.085
Token home
Token ID1363
Feature activation-9.59e-03

Pos logit contributions
 duck+0.006
 umbrella+0.006
 kids+0.006
 Ph+0.006
 best+0.006
Neg logit contributions
epad-0.013
licks-0.013
rou-0.012
 twice-0.012
 Does-0.012
Token,
Token ID11
Feature activation-1.37e-02

Pos logit contributions
 duck+0.009
 kids+0.008
 umbrella+0.008
 Ph+0.008
Ph+0.008
Neg logit contributions
licks-0.020
epad-0.020
rou-0.018
 twice-0.018
 Does-0.018
Token happy
Token ID3772
Feature activation-7.08e-03

Pos logit contributions
 duck+0.015
 kids+0.014
 best+0.014
 umbrella+0.014
 Ph+0.014
Neg logit contributions
licks-0.027
epad-0.026
minded-0.024
asper-0.023
auldron-0.023
Token that
Token ID326
Feature activation-1.98e-04

Pos logit contributions
 duck+0.008
 kids+0.008
 umbrella+0.007
 best+0.007
 Ph+0.007
Neg logit contributions
licks-0.014
epad-0.014
rou-0.012
 Does-0.012
ems-0.012
Token time
Token ID640
Feature activation-1.61e-03

Pos logit contributions
 duck+0.003
 umbrella+0.003
 Ph+0.003
 kids+0.003
 pig+0.002
Neg logit contributions
epad-0.011
licks-0.010
 twice-0.010
rou-0.010
 habitats-0.010
Token,
Token ID11
Feature activation+7.47e-03

Pos logit contributions
 duck+0.002
 pig+0.002
 umbrella+0.002
Ph+0.002
 kids+0.002
Neg logit contributions
licks-0.002
 twice-0.002
epad-0.002
 Does-0.002
rou-0.002
Token there
Token ID612
Feature activation-9.91e-03

Pos logit contributions
licks+0.013
 Does+0.012
 twice+0.011
epad+0.011
chard+0.011
Neg logit contributions
 umbrella-0.008
Ph-0.008
 Ph-0.008
 pig-0.008
 kids-0.008
Token was
Token ID373
Feature activation-1.33e-02

Pos logit contributions
 umbrella+0.015
 pig+0.014
Ph+0.014
 stret+0.014
 boys+0.013
Neg logit contributions
licks-0.013
 Does-0.011
 twice-0.011
 desperately-0.010
to-0.010
Token a
Token ID257
Feature activation-2.00e-03

Pos logit contributions
 umbrella+0.017
Ph+0.017
n+0.017
pe+0.017
 duck+0.016
Neg logit contributions
epad-0.019
licks-0.017
 Enjoy-0.017
 early-0.017
 Does-0.016
Token do
Token ID466
Feature activation-4.42e-02

Pos logit contributions
 Ph+0.002
n+0.002
Ph+0.001
 umbrella+0.001
 kids+0.001
Neg logit contributions
epad-0.004
rou-0.004
licks-0.004
 writer-0.004
 Does-0.004
Tokenggy
Token ID19970
Feature activation-1.10e-02

Pos logit contributions
 umbrella+0.044
 Ph+0.042
Ph+0.038
 kids+0.037
 boys+0.037
Neg logit contributions
epad-0.077
auldron-0.075
to-0.075
licks-0.074
rou-0.072
Token named
Token ID3706
Feature activation+2.63e-03

Pos logit contributions
 umbrella+0.015
 Ph+0.015
 duck+0.015
Ph+0.014
 kids+0.014
Neg logit contributions
licks-0.017
epad-0.017
 twice-0.015
rou-0.015
 Does-0.015
Token Bob
Token ID5811
Feature activation-8.60e-04

Pos logit contributions
epad+0.005
licks+0.005
rou+0.005
 twice+0.005
minded+0.005
Neg logit contributions
 duck-0.003
 kids-0.003
 Ph-0.003
 umbrella-0.003
Ph-0.002
Token.
Token ID13
Feature activation+6.89e-03

Pos logit contributions
 umbrella+0.001
 Ph+0.001
Ph+0.001
 duck+0.001
 kids+0.001
Neg logit contributions
licks-0.001
epad-0.001
rou-0.001
 twice-0.001
 Does-0.001
Token Bob
Token ID5811
Feature activation+6.57e-03

Pos logit contributions
epad+0.014
licks+0.014
rou+0.013
 twice+0.013
minded+0.013
Neg logit contributions
 duck-0.006
 kids-0.006
 umbrella-0.006
 Ph-0.006
Ph-0.006
Token neither
Token ID6159
Feature activation-5.51e-03

Pos logit contributions
epad+0.009
licks+0.009
rou+0.008
auldron+0.008
 habitats+0.008
Neg logit contributions
 duck-0.003
 kids-0.003
 Ph-0.003
 best-0.003
 boys-0.003
Token was
Token ID373
Feature activation+2.66e-05

Pos logit contributions
 duck+0.006
 kids+0.006
 umbrella+0.006
 Ph+0.006
 best+0.006
Neg logit contributions
epad-0.010
licks-0.010
rou-0.009
 twice-0.009
minded-0.009
Token willing
Token ID4684
Feature activation+4.69e-03

Pos logit contributions
epad+0.000
licks+0.000
 twice+0.000
rou+0.000
minded+0.000
Neg logit contributions
 duck-0.000
 umbrella-0.000
 Ph-0.000
Ph-0.000
 kids-0.000
Token to
Token ID284
Feature activation+2.01e-03

Pos logit contributions
licks+0.007
epad+0.007
 Does+0.006
rou+0.006
 twice+0.006
Neg logit contributions
 kids-0.007
 duck-0.007
 best-0.006
 umbrella-0.006
 pig-0.006
Token give
Token ID1577
Feature activation-5.18e-03

Pos logit contributions
epad+0.003
licks+0.003
rou+0.003
 twice+0.003
 Does+0.003
Neg logit contributions
 duck-0.003
 umbrella-0.003
 kids-0.003
 Ph-0.003
Ph-0.003
Token way
Token ID835
Feature activation-4.41e-02

Pos logit contributions
 duck+0.004
 kids+0.004
 umbrella+0.004
 Ph+0.004
 best+0.004
Neg logit contributions
epad-0.011
licks-0.011
rou-0.010
 twice-0.010
 Does-0.010
Token.
Token ID13
Feature activation-5.26e-03

Pos logit contributions
 duck+0.048
 umbrella+0.047
 kids+0.047
 Ph+0.044
 best+0.043
Neg logit contributions
epad-0.083
licks-0.083
rou-0.076
 twice-0.075
 Does-0.073
Token Finally
Token ID9461
Feature activation+8.18e-03

Pos logit contributions
 duck+0.006
 umbrella+0.006
 kids+0.006
 Ph+0.006
Ph+0.006
Neg logit contributions
epad-0.009
licks-0.009
rou-0.008
 twice-0.008
 Does-0.008
Token,
Token ID11
Feature activation-4.81e-04

Pos logit contributions
 twice+0.013
epad+0.013
licks+0.013
 Does+0.012
ways+0.011
Neg logit contributions
 umbrella-0.010
 duck-0.010
Ph-0.010
 je-0.009
 Ph-0.009
Token after
Token ID706
Feature activation-6.97e-03

Pos logit contributions
 duck+0.000
 kids+0.000
 Ph+0.000
 umbrella+0.000
 best+0.000
Neg logit contributions
licks-0.001
epad-0.001
rou-0.001
minded-0.001
 habitats-0.001
Token much
Token ID881
Feature activation+6.78e-03

Pos logit contributions
 duck+0.006
 umbrella+0.005
 kids+0.005
 Ph+0.005
Ph+0.005
Neg logit contributions
epad-0.015
licks-0.015
 twice-0.014
rou-0.014
 Does-0.013
Token the
Token ID262
Feature activation-5.55e-03

Pos logit contributions
licks+0.009
epad+0.008
 facts+0.007
iffs+0.007
rou+0.007
Neg logit contributions
 duck-0.004
 kids-0.004
 best-0.004
 happy-0.004
n-0.004
Token dull
Token ID19222
Feature activation-3.71e-03

Pos logit contributions
 Ph+0.005
Ph+0.005
 umbrella+0.005
 duck+0.005
 kids+0.005
Neg logit contributions
epad-0.011
licks-0.011
rou-0.010
 twice-0.010
minded-0.010
Token sky
Token ID6766
Feature activation-3.42e-03

Pos logit contributions
 duck+0.003
 kids+0.003
 umbrella+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.008
licks-0.008
rou-0.007
 twice-0.007
 Does-0.007
Token had
Token ID550
Feature activation+5.97e-03

Pos logit contributions
 duck+0.004
 umbrella+0.004
 kids+0.004
 Ph+0.004
Ph+0.004
Neg logit contributions
epad-0.006
licks-0.006
rou-0.006
 twice-0.006
 Does-0.005
Token given
Token ID1813
Feature activation+6.34e-03

Pos logit contributions
epad+0.012
licks+0.012
rou+0.011
 twice+0.011
 Does+0.011
Neg logit contributions
 duck-0.005
 kids-0.005
 umbrella-0.005
 Ph-0.005
Ph-0.005
Token way
Token ID835
Feature activation-4.40e-02

Pos logit contributions
licks+0.013
epad+0.013
rou+0.012
 Does+0.012
auldron+0.012
Neg logit contributions
 duck-0.005
 kids-0.005
 best-0.005
 umbrella-0.005
 Ph-0.005
Token to
Token ID284
Feature activation+7.12e-03

Pos logit contributions
 umbrella+0.060
 duck+0.059
 kids+0.059
 best+0.058
 je+0.054
Neg logit contributions
licks-0.073
epad-0.072
 Does-0.065
auldron-0.063
rou-0.063
Token a
Token ID257
Feature activation+8.43e-03

Pos logit contributions
licks+0.014
epad+0.014
rou+0.013
 Does+0.013
minded+0.013
Neg logit contributions
 duck-0.007
 umbrella-0.007
 kids-0.007
 Ph-0.006
 best-0.006
Token bright
Token ID6016
Feature activation-2.60e-03

Pos logit contributions
licks+0.020
epad+0.019
rou+0.017
 twice+0.017
minded+0.017
Neg logit contributions
 duck-0.007
 umbrella-0.006
 kids-0.006
 best-0.006
 pig-0.006
Token and
Token ID290
Feature activation+4.57e-04

Pos logit contributions
 duck+0.002
 umbrella+0.002
 kids+0.002
 k+0.002
 pig+0.002
Neg logit contributions
licks-0.006
epad-0.006
minded-0.005
rou-0.005
 twice-0.005
Token sunny
Token ID27737
Feature activation-2.71e-02

Pos logit contributions
licks+0.001
epad+0.001
minded+0.001
rou+0.001
auldron+0.001
Neg logit contributions
 duck-0.000
 umbrella-0.000
 kids-0.000
 best-0.000
 k-0.000
Token agreed
Token ID4987
Feature activation-5.44e-03

Pos logit contributions
 duck+0.012
 umbrella+0.012
 best+0.012
 kids+0.012
 je+0.012
Neg logit contributions
epad-0.012
licks-0.012
 twice-0.011
minded-0.011
rou-0.010
Token and
Token ID290
Feature activation-1.10e-03

Pos logit contributions
 duck+0.006
 umbrella+0.006
 kids+0.006
 Ph+0.006
Ph+0.005
Neg logit contributions
epad-0.010
licks-0.010
 twice-0.009
rou-0.009
minded-0.009
Token they
Token ID484
Feature activation-1.94e-02

Pos logit contributions
 duck+0.001
 umbrella+0.001
 kids+0.001
 Ph+0.001
Ph+0.001
Neg logit contributions
epad-0.002
licks-0.002
rou-0.002
 twice-0.002
 Does-0.002
Token made
Token ID925
Feature activation-2.07e-02

Pos logit contributions
 duck+0.026
 kids+0.025
 umbrella+0.025
Ph+0.024
 Ph+0.024
Neg logit contributions
licks-0.029
epad-0.029
 twice-0.027
rou-0.026
 attent-0.025
Token their
Token ID511
Feature activation-2.06e-02

Pos logit contributions
 duck+0.021
 umbrella+0.021
 Ph+0.021
Ph+0.020
 je+0.020
Neg logit contributions
epad-0.037
licks-0.037
 twice-0.035
rou-0.035
 attent-0.035
Token way
Token ID835
Feature activation-4.29e-02

Pos logit contributions
Ph+0.027
 Ph+0.027
 duck+0.025
 umbrella+0.024
n+0.024
Neg logit contributions
epad-0.031
licks-0.030
 habitats-0.030
minded-0.029
 twice-0.028
Token to
Token ID284
Feature activation-2.07e-03

Pos logit contributions
 duck+0.051
 Ph+0.049
 kids+0.048
Ph+0.047
 pig+0.046
Neg logit contributions
epad-0.075
licks-0.071
 twice-0.068
rou-0.065
 early-0.063
Token the
Token ID262
Feature activation+2.50e-03

Pos logit contributions
 duck+0.002
Ph+0.002
 umbrella+0.002
 boys+0.002
 kids+0.002
Neg logit contributions
epad-0.003
 early-0.003
 recite-0.003
licks-0.003
 Enjoy-0.003
Token store
Token ID3650
Feature activation-1.52e-02

Pos logit contributions
epad+0.004
licks+0.004
 twice+0.004
rou+0.004
minded+0.003
Neg logit contributions
 duck-0.003
 Ph-0.003
Ph-0.003
 umbrella-0.003
 kids-0.003
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
 duck+0.019
 umbrella+0.018
 Ph+0.018
Ph+0.017
n+0.016
Neg logit contributions
epad-0.026
licks-0.024
 twice-0.024
minded-0.023
 early-0.022
Token When
Token ID1649
Feature activation-1.11e-02

Pos logit contributions
 duck+0.017
 kids+0.017
 Ph+0.016
 umbrella+0.016
Ph+0.016
Neg logit contributions
epad-0.026
licks-0.025
rou-0.024
 twice-0.022
minded-0.022
Token park
Token ID3952
Feature activation-1.67e-02

Pos logit contributions
 duck+0.003
 umbrella+0.003
 kids+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.006
licks-0.006
rou-0.006
 twice-0.006
minded-0.006
Token on
Token ID319
Feature activation-2.74e-02

Pos logit contributions
 duck+0.014
 kids+0.014
 umbrella+0.014
 Ph+0.013
Ph+0.013
Neg logit contributions
epad-0.035
licks-0.035
rou-0.032
 twice-0.032
 Does-0.032
Token a
Token ID257
Feature activation-1.01e-02

Pos logit contributions
 duck+0.026
Ph+0.026
 umbrella+0.025
n+0.025
 Ph+0.025
Neg logit contributions
licks-0.047
epad-0.046
 habitats-0.045
 early-0.044
 twice-0.044
Token bright
Token ID6016
Feature activation-2.00e-02

Pos logit contributions
 Ph+0.010
 kids+0.010
 duck+0.010
 boys+0.010
 umbrella+0.010
Neg logit contributions
epad-0.018
licks-0.017
rou-0.017
 twice-0.017
 attent-0.016
Token and
Token ID290
Feature activation-1.26e-02

Pos logit contributions
 Ph+0.020
 duck+0.020
 kids+0.020
Ph+0.019
 best+0.019
Neg logit contributions
epad-0.035
licks-0.035
 twice-0.033
rou-0.033
 habitats-0.032
Token sunny
Token ID27737
Feature activation-4.25e-02

Pos logit contributions
 duck+0.012
 kids+0.011
 umbrella+0.011
 Ph+0.011
Ph+0.010
Neg logit contributions
epad-0.026
licks-0.026
rou-0.023
 twice-0.023
minded-0.023
Token day
Token ID1110
Feature activation-8.76e-03

Pos logit contributions
 duck+0.038
 kids+0.036
 Ph+0.036
 umbrella+0.035
Ph+0.034
Neg logit contributions
epad-0.084
licks-0.081
 twice-0.077
rou-0.076
 habitats-0.075
Token.
Token ID13
Feature activation+3.75e-03

Pos logit contributions
 duck+0.008
 umbrella+0.008
 kids+0.008
 Ph+0.007
Ph+0.007
Neg logit contributions
epad-0.018
licks-0.018
 twice-0.016
rou-0.016
minded-0.016
Token
Token ID198
Feature activation-9.27e-03

Pos logit contributions
epad+0.006
licks+0.006
 twice+0.006
rou+0.006
 Does+0.006
Neg logit contributions
 duck-0.005
 umbrella-0.005
 kids-0.005
 Ph-0.004
Ph-0.004
Token
Token ID198
Feature activation-4.46e-03

Pos logit contributions
 umbrella+0.011
 duck+0.010
 kids+0.010
 best+0.010
 pig+0.010
Neg logit contributions
licks-0.017
epad-0.016
 twice-0.015
rou-0.015
 Does-0.015
TokenSam
Token ID16305
Feature activation+5.67e-03

Pos logit contributions
 duck+0.006
 umbrella+0.006
 kids+0.006
 best+0.006
 Ph+0.006
Neg logit contributions
epad-0.007
licks-0.007
 twice-0.006
rou-0.006
 Does-0.006
Token playing
Token ID2712
Feature activation-1.53e-02

Pos logit contributions
 duck+0.012
 umbrella+0.012
 kids+0.012
 Ph+0.011
Ph+0.011
Neg logit contributions
epad-0.024
licks-0.024
rou-0.022
 twice-0.022
minded-0.021
Token in
Token ID287
Feature activation-3.24e-03

Pos logit contributions
 duck+0.017
 kids+0.017
 umbrella+0.017
 Ph+0.016
Ph+0.016
Neg logit contributions
epad-0.028
licks-0.028
rou-0.025
 twice-0.025
minded-0.025
Token the
Token ID262
Feature activation-4.52e-03

Pos logit contributions
 duck+0.003
Ph+0.003
 Ph+0.003
 umbrella+0.003
 best+0.003
Neg logit contributions
epad-0.006
licks-0.006
 twice-0.005
 Does-0.005
rou-0.005
Token park
Token ID3952
Feature activation-1.26e-02

Pos logit contributions
 Ph+0.005
 duck+0.005
Ph+0.005
 umbrella+0.005
n+0.005
Neg logit contributions
epad-0.008
licks-0.008
rou-0.007
 twice-0.007
 Does-0.007
Token one
Token ID530
Feature activation-1.05e-03

Pos logit contributions
 kids+0.011
 duck+0.010
 best+0.010
 umbrella+0.009
 Ph+0.009
Neg logit contributions
licks-0.028
epad-0.028
rou-0.026
 Does-0.025
minded-0.025
Token sunny
Token ID27737
Feature activation-4.21e-02

Pos logit contributions
 duck+0.001
 kids+0.001
 umbrella+0.001
 Ph+0.001
 best+0.001
Neg logit contributions
epad-0.002
licks-0.002
rou-0.002
 twice-0.002
minded-0.002
Token day
Token ID1110
Feature activation-1.61e-04

Pos logit contributions
 duck+0.033
 umbrella+0.031
 Ph+0.031
 kids+0.030
Ph+0.030
Neg logit contributions
epad-0.090
licks-0.089
 twice-0.083
rou-0.082
 habitats-0.080
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 kids+0.000
 duck+0.000
 umbrella+0.000
 best+0.000
 Ph+0.000
Neg logit contributions
licks-0.000
epad-0.000
rou-0.000
 Does-0.000
minded-0.000
Token He
Token ID679
Feature activation+1.70e-03

Pos logit contributions
 duck+0.016
 umbrella+0.015
 kids+0.015
 Ph+0.014
Ph+0.014
Neg logit contributions
epad-0.018
licks-0.017
 twice-0.016
rou-0.015
 Does-0.015
Token saw
Token ID2497
Feature activation-5.02e-03

Pos logit contributions
epad+0.004
licks+0.003
rou+0.003
minded+0.003
osity+0.003
Neg logit contributions
 kids-0.002
 duck-0.002
 umbrella-0.002
 Ph-0.002
 best-0.002
Token a
Token ID257
Feature activation-4.74e-03

Pos logit contributions
 duck+0.003
 umbrella+0.003
 kids+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.011
licks-0.011
rou-0.011
 twice-0.011
 Does-0.010
Token slide
Token ID10649
Feature activation+3.66e-03

Pos logit contributions
 duck+0.015
 Ph+0.013
 pig+0.013
 umbrella+0.013
 kids+0.013
Neg logit contributions
licks-0.024
epad-0.023
 twice-0.022
rou-0.022
oes-0.021
Token and
Token ID290
Feature activation-3.52e-04

Pos logit contributions
epad+0.008
licks+0.008
 Does+0.007
 facts+0.007
 recite+0.007
Neg logit contributions
 kids-0.003
 duck-0.003
 best-0.003
 umbrella-0.003
 Ph-0.003
Token laughed
Token ID13818
Feature activation+7.20e-04

Pos logit contributions
 kids+0.000
 duck+0.000
 Ph+0.000
 umbrella+0.000
Ph+0.000
Neg logit contributions
epad-0.001
licks-0.001
rou-0.001
 habitats-0.001
 facts-0.001
Token all
Token ID477
Feature activation-2.00e-02

Pos logit contributions
epad+0.001
licks+0.001
 twice+0.001
rou+0.001
 attent+0.001
Neg logit contributions
 duck-0.001
 umbrella-0.001
 kids-0.001
 Ph-0.001
Ph-0.001
Token the
Token ID262
Feature activation-2.50e-02

Pos logit contributions
 duck+0.024
 pig+0.024
 umbrella+0.023
 Ph+0.022
 kitten+0.021
Neg logit contributions
licks-0.033
 twice-0.032
epad-0.031
rou-0.030
oes-0.030
Token way
Token ID835
Feature activation-4.17e-02

Pos logit contributions
 duck+0.027
 Ph+0.026
 umbrella+0.026
Ph+0.024
 pig+0.024
Neg logit contributions
epad-0.045
licks-0.045
 twice-0.042
rou-0.041
minded-0.039
Token down
Token ID866
Feature activation-1.40e-02

Pos logit contributions
 duck+0.065
 kids+0.061
 Ph+0.061
 umbrella+0.061
 pig+0.061
Neg logit contributions
epad-0.057
licks-0.056
 twice-0.054
rou-0.051
oes-0.048
Token.
Token ID13
Feature activation-1.16e-03

Pos logit contributions
 duck+0.013
 umbrella+0.012
 kids+0.012
 Ph+0.012
Ph+0.012
Neg logit contributions
epad-0.028
licks-0.028
 twice-0.026
rou-0.026
minded-0.025
Token
Token ID220
Feature activation+1.64e-03

Pos logit contributions
 duck+0.001
 kids+0.001
 Ph+0.001
 umbrella+0.001
Ph+0.001
Neg logit contributions
epad-0.002
licks-0.002
rou-0.002
 twice-0.002
minded-0.002
Token
Token ID198
Feature activation-1.65e-03

Pos logit contributions
licks+0.003
epad+0.003
 twice+0.003
rou+0.003
to+0.003
Neg logit contributions
 umbrella-0.002
 duck-0.002
 pig-0.002
 Ph-0.002
 kids-0.002
Token
Token ID198
Feature activation-1.44e-03

Pos logit contributions
 duck+0.001
 umbrella+0.001
 kids+0.001
 Ph+0.001
 pig+0.001
Neg logit contributions
epad-0.004
licks-0.004
 twice-0.003
rou-0.003
 Does-0.003
Token,
Token ID11
Feature activation+5.28e-03

Pos logit contributions
 duck+0.002
 umbrella+0.002
 pig+0.002
Ph+0.002
 train+0.002
Neg logit contributions
licks-0.002
epad-0.002
 twice-0.002
rou-0.001
 Does-0.001
Token there
Token ID612
Feature activation-1.36e-02

Pos logit contributions
licks+0.008
 Does+0.008
epad+0.008
 twice+0.008
chard+0.007
Neg logit contributions
 umbrella-0.007
 Ph-0.006
Ph-0.006
 pig-0.006
 stret-0.006
Token was
Token ID373
Feature activation-1.46e-02

Pos logit contributions
 umbrella+0.021
Ph+0.019
 pig+0.019
 stret+0.019
 duck+0.018
Neg logit contributions
licks-0.018
 Does-0.016
 twice-0.015
to-0.015
epad-0.015
Token a
Token ID257
Feature activation-2.16e-03

Pos logit contributions
 umbrella+0.019
Ph+0.019
 duck+0.018
pe+0.018
chie+0.018
Neg logit contributions
epad-0.022
 early-0.020
 Enjoy-0.020
licks-0.019
 Does-0.018
Token big
Token ID1263
Feature activation-4.91e-03

Pos logit contributions
 Ph+0.002
n+0.002
 umbrella+0.002
Ph+0.002
 themselves+0.001
Neg logit contributions
epad-0.004
rou-0.004
licks-0.004
 writer-0.004
 Does-0.004
Token do
Token ID466
Feature activation-4.16e-02

Pos logit contributions
 Ph+0.004
Ph+0.004
 duck+0.004
 umbrella+0.004
n+0.004
Neg logit contributions
epad-0.010
licks-0.010
rou-0.009
 twice-0.009
 Does-0.009
Tokenggie
Token ID23571
Feature activation-9.07e-03

Pos logit contributions
 umbrella+0.040
 Ph+0.039
Ph+0.035
 duck+0.034
 themselves+0.033
Neg logit contributions
epad-0.079
licks-0.075
auldron-0.074
 twice-0.074
to-0.073
Token named
Token ID3706
Feature activation+1.14e-03

Pos logit contributions
 umbrella+0.011
 duck+0.011
 Ph+0.011
Ph+0.011
 kids+0.011
Neg logit contributions
epad-0.015
licks-0.015
 twice-0.014
rou-0.014
 Does-0.013
Token Max
Token ID5436
Feature activation+5.46e-04

Pos logit contributions
epad+0.002
licks+0.002
rou+0.002
minded+0.002
 twice+0.002
Neg logit contributions
 kids-0.001
 duck-0.001
 Ph-0.001
 umbrella-0.001
n-0.001
Token.
Token ID13
Feature activation+9.71e-03

Pos logit contributions
epad+0.001
licks+0.001
rou+0.001
 twice+0.001
 Does+0.001
Neg logit contributions
 umbrella-0.001
 duck-0.001
 Ph-0.001
 kids-0.001
Ph-0.001
Token Max
Token ID5436
Feature activation+8.06e-03

Pos logit contributions
epad+0.020
licks+0.019
rou+0.019
 twice+0.018
minded+0.018
Neg logit contributions
 kids-0.009
 duck-0.009
 boys-0.009
 best-0.008
 umbrella-0.008
Token his
Token ID465
Feature activation+3.67e-03

Pos logit contributions
Ph+0.005
 duck+0.005
 kids+0.005
 Ph+0.005
 umbrella+0.005
Neg logit contributions
epad-0.009
licks-0.009
 twice-0.009
rou-0.008
 Does-0.008
Token friends
Token ID2460
Feature activation+8.50e-03

Pos logit contributions
epad+0.007
licks+0.007
 twice+0.006
rou+0.006
 habitats+0.006
Neg logit contributions
 Ph-0.004
Ph-0.004
 duck-0.004
 umbrella-0.003
 kids-0.003
Token and
Token ID290
Feature activation-5.83e-03

Pos logit contributions
epad+0.017
licks+0.017
rou+0.015
 twice+0.015
 Does+0.015
Neg logit contributions
 duck-0.008
 umbrella-0.008
 kids-0.008
 Ph-0.008
Ph-0.007
Token made
Token ID925
Feature activation-1.47e-02

Pos logit contributions
 duck+0.006
 kids+0.006
 umbrella+0.006
 Ph+0.006
 best+0.006
Neg logit contributions
licks-0.011
epad-0.011
rou-0.010
 twice-0.010
 Does-0.010
Token his
Token ID465
Feature activation-1.14e-02

Pos logit contributions
pe+0.020
 duck+0.019
 umbrella+0.019
kin+0.019
 kitten+0.018
Neg logit contributions
to-0.020
 twice-0.020
 int-0.019
epad-0.019
rou-0.019
Token way
Token ID835
Feature activation-4.16e-02

Pos logit contributions
 Ph+0.016
Ph+0.015
 duck+0.014
 umbrella+0.013
 kids+0.013
Neg logit contributions
epad-0.017
 twice-0.016
rou-0.016
licks-0.016
 recite-0.015
Token to
Token ID284
Feature activation-1.20e-03

Pos logit contributions
 duck+0.045
 kids+0.043
 Ph+0.042
 umbrella+0.042
Ph+0.041
Neg logit contributions
epad-0.078
licks-0.077
 twice-0.072
rou-0.071
minded-0.068
Token the
Token ID262
Feature activation+9.34e-03

Pos logit contributions
 duck+0.001
Ph+0.001
 Ph+0.001
 umbrella+0.001
 kids+0.001
Neg logit contributions
epad-0.002
licks-0.002
 twice-0.002
 recite-0.002
rou-0.002
Token tower
Token ID10580
Feature activation+7.38e-03

Pos logit contributions
licks+0.019
epad+0.019
 twice+0.017
rou+0.017
 Does+0.017
Neg logit contributions
 kids-0.009
 duck-0.009
 umbrella-0.009
 best-0.009
 je-0.008
Token.
Token ID13
Feature activation+2.12e-03

Pos logit contributions
epad+0.015
licks+0.015
 twice+0.014
rou+0.013
minded+0.013
Neg logit contributions
 duck-0.007
 kids-0.007
 umbrella-0.007
 Ph-0.007
Ph-0.006
Token
Token ID198
Feature activation+6.40e-04

Pos logit contributions
licks+0.003
 Does+0.003
 Enjoy+0.003
 twice+0.003
rou+0.003
Neg logit contributions
 umbrella-0.003
 duck-0.003
 kids-0.003
 boys-0.003
 best-0.003
Token friends
Token ID2460
Feature activation+2.09e-03

Pos logit contributions
 Ph+0.017
Ph+0.017
 duck+0.016
n+0.016
 umbrella+0.015
Neg logit contributions
epad-0.041
licks-0.038
rou-0.038
 habitats-0.037
 twice-0.037
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
epad+0.004
licks+0.004
rou+0.004
 twice+0.004
 Does+0.004
Neg logit contributions
 duck-0.002
 kids-0.002
 umbrella-0.002
 Ph-0.002
 best-0.002
Token skipped
Token ID26684
Feature activation-1.71e-02

Pos logit contributions
 duck+0.011
 kids+0.011
 umbrella+0.011
 Ph+0.010
 best+0.010
Neg logit contributions
licks-0.023
epad-0.023
rou-0.021
 twice-0.020
 Does-0.020
Token all
Token ID477
Feature activation-1.77e-02

Pos logit contributions
 duck+0.019
 umbrella+0.019
 kids+0.019
 Ph+0.018
Ph+0.018
Neg logit contributions
epad-0.031
licks-0.031
 twice-0.028
rou-0.028
minded-0.027
Token the
Token ID262
Feature activation-2.53e-02

Pos logit contributions
 duck+0.018
 umbrella+0.017
 pig+0.017
 Ph+0.016
 kids+0.016
Neg logit contributions
licks-0.034
epad-0.033
 twice-0.031
rou-0.030
minded-0.029
Token way
Token ID835
Feature activation-4.16e-02

Pos logit contributions
 duck+0.034
 Ph+0.032
 umbrella+0.031
Ph+0.030
 pig+0.030
Neg logit contributions
epad-0.038
licks-0.038
 twice-0.036
 habitats-0.034
rou-0.033
Token back
Token ID736
Feature activation-1.02e-02

Pos logit contributions
 duck+0.043
 kids+0.041
 umbrella+0.041
 Ph+0.040
Ph+0.038
Neg logit contributions
epad-0.080
licks-0.078
 twice-0.074
rou-0.073
minded-0.070
Token to
Token ID284
Feature activation-6.65e-03

Pos logit contributions
 duck+0.009
 umbrella+0.009
 kids+0.009
 Ph+0.008
 best+0.008
Neg logit contributions
epad-0.022
licks-0.021
rou-0.020
 twice-0.020
 Does-0.019
Token the
Token ID262
Feature activation-6.91e-03

Pos logit contributions
 duck+0.007
 umbrella+0.006
 kids+0.006
Ph+0.006
 Ph+0.006
Neg logit contributions
epad-0.013
licks-0.012
 twice-0.012
rou-0.011
 recite-0.011
Token car
Token ID1097
Feature activation+1.43e-04

Pos logit contributions
 duck+0.006
 umbrella+0.006
 Ph+0.005
 kids+0.005
Ph+0.005
Neg logit contributions
epad-0.015
licks-0.014
rou-0.013
 twice-0.013
minded-0.013
Token.
Token ID13
Feature activation-6.52e-03

Pos logit contributions
epad+0.000
licks+0.000
rou+0.000
 twice+0.000
 Does+0.000
Neg logit contributions
 duck-0.000
 umbrella-0.000
 kids-0.000
 Ph-0.000
Ph-0.000
Token,
Token ID11
Feature activation-2.14e-02

Pos logit contributions
epad+0.003
licks+0.003
 twice+0.003
 Does+0.003
minded+0.003
Neg logit contributions
 duck-0.002
 umbrella-0.002
Ph-0.002
 Ph-0.002
 kids-0.002
Token Tommy
Token ID19919
Feature activation+3.18e-03

Pos logit contributions
 duck+0.021
 umbrella+0.021
Ph+0.019
 Ph+0.019
 kids+0.019
Neg logit contributions
epad-0.041
licks-0.040
 twice-0.038
rou-0.037
 Does-0.037
Token took
Token ID1718
Feature activation-1.05e-03

Pos logit contributions
epad+0.006
licks+0.006
 Does+0.005
rou+0.005
 twice+0.005
Neg logit contributions
 duck-0.004
 kids-0.003
 pig-0.003
 umbrella-0.003
 best-0.003
Token the
Token ID262
Feature activation-1.15e-02

Pos logit contributions
 umbrella+0.001
 duck+0.001
Ph+0.001
 Ph+0.001
 kids+0.001
Neg logit contributions
licks-0.002
epad-0.002
 twice-0.002
minded-0.002
 Enjoy-0.002
Token long
Token ID890
Feature activation+5.97e-04

Pos logit contributions
Ph+0.012
 Esk+0.012
 Ph+0.012
pe+0.012
n+0.011
Neg logit contributions
epad-0.020
licks-0.019
 twice-0.018
rou-0.018
 early-0.018
Token way
Token ID835
Feature activation-4.16e-02

Pos logit contributions
epad+0.001
licks+0.001
 twice+0.001
ems+0.001
 attent+0.001
Neg logit contributions
Ph-0.001
 Esk-0.001
 Ph-0.001
 duck-0.001
 best-0.001
Token to
Token ID284
Feature activation-3.38e-04

Pos logit contributions
 duck+0.045
 umbrella+0.044
 kids+0.044
 Ph+0.042
 best+0.041
Neg logit contributions
epad-0.079
licks-0.078
rou-0.071
 twice-0.070
 Does-0.069
Token the
Token ID262
Feature activation+1.79e-03

Pos logit contributions
 duck+0.000
Ph+0.000
 umbrella+0.000
 kids+0.000
 Ph+0.000
Neg logit contributions
epad-0.001
licks-0.001
 recite-0.001
 twice-0.001
rou-0.001
Token park
Token ID3952
Feature activation-1.01e-02

Pos logit contributions
epad+0.004
licks+0.004
 twice+0.003
rou+0.003
 Does+0.003
Neg logit contributions
 duck-0.002
 umbrella-0.001
 kids-0.001
 Ph-0.001
Ph-0.001
Token so
Token ID523
Feature activation-1.27e-02

Pos logit contributions
 duck+0.009
 kids+0.009
 umbrella+0.008
 Ph+0.008
 best+0.008
Neg logit contributions
licks-0.021
epad-0.021
rou-0.020
 Does-0.019
 twice-0.019
Token the
Token ID262
Feature activation-7.98e-03

Pos logit contributions
 duck+0.014
 kids+0.013
 Ph+0.013
 umbrella+0.013
Ph+0.012
Neg logit contributions
epad-0.024
licks-0.024
rou-0.022
 twice-0.022
minded-0.021
Token in
Token ID287
Feature activation-1.35e-03

Pos logit contributions
 duck+0.008
 kids+0.008
 umbrella+0.008
 Ph+0.008
Ph+0.008
Neg logit contributions
epad-0.013
licks-0.013
rou-0.012
 twice-0.012
 Does-0.011
Token a
Token ID257
Feature activation+8.02e-03

Pos logit contributions
 duck+0.001
 umbrella+0.001
 kids+0.001
 Ph+0.001
Ph+0.001
Neg logit contributions
epad-0.003
licks-0.003
 twice-0.003
rou-0.003
 Does-0.003
Token kind
Token ID1611
Feature activation-2.84e-03

Pos logit contributions
licks+0.017
epad+0.017
rou+0.015
 twice+0.015
minded+0.015
Neg logit contributions
 duck-0.007
 umbrella-0.007
 kids-0.007
 best-0.006
 pig-0.006
Token and
Token ID290
Feature activation-5.38e-03

Pos logit contributions
 duck+0.003
 kids+0.003
 umbrella+0.003
 pig+0.003
 Ph+0.002
Neg logit contributions
licks-0.006
epad-0.006
rou-0.005
 Does-0.005
 twice-0.005
Token thoughtful
Token ID22677
Feature activation-1.33e-02

Pos logit contributions
 kids+0.005
 best+0.005
 duck+0.005
 umbrella+0.004
n+0.004
Neg logit contributions
licks-0.011
epad-0.011
rou-0.010
auldron-0.010
 habitats-0.010
Token way
Token ID835
Feature activation-4.16e-02

Pos logit contributions
 duck+0.016
 kids+0.016
 umbrella+0.016
 best+0.015
 pig+0.015
Neg logit contributions
licks-0.025
epad-0.024
rou-0.022
 Does-0.021
 twice-0.021
Token.
Token ID13
Feature activation-3.69e-03

Pos logit contributions
 duck+0.046
 umbrella+0.044
 Ph+0.044
 kids+0.043
Ph+0.042
Neg logit contributions
epad-0.076
licks-0.074
 twice-0.070
rou-0.068
minded-0.067
Token His
Token ID2399
Feature activation-9.10e-03

Pos logit contributions
 duck+0.002
 kids+0.002
 best+0.002
Ph+0.001
 Ph+0.001
Neg logit contributions
epad-0.009
licks-0.009
rou-0.009
 habitats-0.008
 twice-0.008
Token friends
Token ID2460
Feature activation-1.20e-03

Pos logit contributions
 duck+0.007
 umbrella+0.007
 Ph+0.006
 kids+0.006
Ph+0.006
Neg logit contributions
epad-0.020
licks-0.020
 twice-0.018
rou-0.018
minded-0.018
Token soon
Token ID2582
Feature activation-2.30e-03

Pos logit contributions
 duck+0.001
 kids+0.001
 umbrella+0.001
 Ph+0.001
 best+0.001
Neg logit contributions
epad-0.002
licks-0.002
 twice-0.002
rou-0.002
 Does-0.002
Token joined
Token ID5399
Feature activation+2.71e-03

Pos logit contributions
 duck+0.003
 kids+0.003
 umbrella+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.004
licks-0.004
rou-0.004
 twice-0.004
 Does-0.003
Token carefully
Token ID7773
Feature activation-1.25e-02

Pos logit contributions
 duck+0.009
 kids+0.008
 best+0.008
 umbrella+0.008
 Ph+0.008
Neg logit contributions
licks-0.014
epad-0.013
 habitats-0.012
auldron-0.012
minded-0.012
Token while
Token ID981
Feature activation-1.30e-02

Pos logit contributions
 duck+0.016
 umbrella+0.015
 kids+0.015
 Ph+0.015
Ph+0.014
Neg logit contributions
licks-0.022
epad-0.021
rou-0.019
 Does-0.019
 twice-0.019
Token they
Token ID484
Feature activation-1.18e-02

Pos logit contributions
 duck+0.017
 umbrella+0.016
Ph+0.016
 kids+0.016
 Ph+0.016
Neg logit contributions
epad-0.022
licks-0.021
 twice-0.019
rou-0.019
 Does-0.018
Token made
Token ID925
Feature activation-8.24e-03

Pos logit contributions
 duck+0.013
 kids+0.013
 umbrella+0.013
 Ph+0.012
Ph+0.012
Neg logit contributions
licks-0.021
epad-0.021
rou-0.019
 twice-0.019
minded-0.018
Token their
Token ID511
Feature activation-2.11e-02

Pos logit contributions
 duck+0.009
 umbrella+0.009
Ph+0.009
 Ph+0.009
 kids+0.008
Neg logit contributions
epad-0.014
licks-0.014
 twice-0.014
rou-0.013
 attent-0.013
Token way
Token ID835
Feature activation-4.15e-02

Pos logit contributions
Ph+0.023
 Ph+0.023
 duck+0.023
 Esk+0.021
n+0.021
Neg logit contributions
epad-0.034
licks-0.033
 habitats-0.033
rou-0.032
 twice-0.032
Token down
Token ID866
Feature activation-8.56e-03

Pos logit contributions
 duck+0.039
 kids+0.037
 umbrella+0.037
 Ph+0.036
Ph+0.035
Neg logit contributions
epad-0.084
licks-0.083
 twice-0.077
rou-0.077
minded-0.074
Token from
Token ID422
Feature activation-1.08e-02

Pos logit contributions
 duck+0.007
 umbrella+0.007
 kids+0.007
 Ph+0.007
Ph+0.007
Neg logit contributions
epad-0.017
licks-0.017
 twice-0.016
rou-0.016
minded-0.015
Token the
Token ID262
Feature activation-1.97e-04

Pos logit contributions
 duck+0.009
 umbrella+0.008
 Ph+0.008
Ph+0.008
 kids+0.008
Neg logit contributions
epad-0.023
licks-0.023
 twice-0.021
rou-0.021
minded-0.021
Token tree
Token ID5509
Feature activation-8.21e-03

Pos logit contributions
 duck+0.000
 Ph+0.000
 kids+0.000
 umbrella+0.000
Ph+0.000
Neg logit contributions
epad-0.000
licks-0.000
rou-0.000
 twice-0.000
 Does-0.000
Token.
Token ID13
Feature activation-1.90e-02

Pos logit contributions
 duck+0.008
 umbrella+0.007
 kids+0.007
 Ph+0.007
Ph+0.007
Neg logit contributions
epad-0.017
licks-0.017
rou-0.015
 twice-0.015
 Does-0.015
Token walking
Token ID6155
Feature activation-1.31e-02

Pos logit contributions
 duck+0.013
 kids+0.013
 best+0.012
 umbrella+0.012
 Ph+0.012
Neg logit contributions
licks-0.031
epad-0.031
minded-0.029
 facts-0.028
 habitats-0.028
Token through
Token ID832
Feature activation+2.52e-04

Pos logit contributions
 kids+0.014
 duck+0.014
 umbrella+0.013
 best+0.013
 je+0.012
Neg logit contributions
licks-0.026
epad-0.026
rou-0.024
 Does-0.023
auldron-0.023
Token the
Token ID262
Feature activation-4.87e-03

Pos logit contributions
epad+0.001
licks+0.001
 twice+0.001
rou+0.001
 Does+0.001
Neg logit contributions
 duck-0.000
 umbrella-0.000
 kids-0.000
 Ph-0.000
Ph-0.000
Token park
Token ID3952
Feature activation-1.08e-02

Pos logit contributions
 duck+0.004
 kids+0.004
 umbrella+0.004
 Ph+0.004
 best+0.004
Neg logit contributions
epad-0.011
licks-0.011
 twice-0.010
rou-0.010
 Does-0.009
Token one
Token ID530
Feature activation-1.79e-03

Pos logit contributions
 kids+0.009
 best+0.009
 duck+0.009
 umbrella+0.008
 Ph+0.008
Neg logit contributions
licks-0.024
epad-0.023
rou-0.022
 Does-0.022
ems-0.021
Token sunny
Token ID27737
Feature activation-4.14e-02

Pos logit contributions
 kids+0.001
 best+0.001
 duck+0.001
 umbrella+0.001
 k+0.001
Neg logit contributions
epad-0.004
licks-0.004
rou-0.004
 Does-0.004
minded-0.004
Token day
Token ID1110
Feature activation+7.39e-04

Pos logit contributions
 duck+0.032
 kids+0.030
 umbrella+0.030
 Ph+0.029
Ph+0.028
Neg logit contributions
epad-0.092
licks-0.091
rou-0.084
 twice-0.084
 Does-0.082
Token.
Token ID13
Feature activation-6.71e-03

Pos logit contributions
licks+0.002
epad+0.002
rou+0.002
 Does+0.001
 habitats+0.001
Neg logit contributions
 kids-0.001
 duck-0.001
 umbrella-0.001
 best-0.001
 Ph-0.001
Token He
Token ID679
Feature activation+4.05e-03

Pos logit contributions
 duck+0.009
 umbrella+0.009
 kids+0.009
 Ph+0.009
Ph+0.009
Neg logit contributions
epad-0.011
licks-0.010
 twice-0.009
rou-0.009
 Does-0.009
Token saw
Token ID2497
Feature activation-3.89e-03

Pos logit contributions
epad+0.008
licks+0.008
rou+0.007
 facts+0.007
osity+0.007
Neg logit contributions
 kids-0.005
 duck-0.005
 Ph-0.004
 umbrella-0.004
Ph-0.004
Token a
Token ID257
Feature activation-4.67e-03

Pos logit contributions
 duck+0.003
 umbrella+0.002
 kids+0.002
 Ph+0.002
Ph+0.002
Neg logit contributions
epad-0.009
licks-0.009
rou-0.008
 twice-0.008
minded-0.008
Token let
Token ID1309
Feature activation-2.60e-02

Pos logit contributions
 duck+0.005
 umbrella+0.005
 kids+0.005
 Ph+0.005
Ph+0.005
Neg logit contributions
epad-0.008
licks-0.007
rou-0.007
 twice-0.007
minded-0.007
Token his
Token ID465
Feature activation-1.16e-02

Pos logit contributions
 umbrella+0.033
 kids+0.032
 boys+0.032
Ph+0.032
 je+0.031
Neg logit contributions
 twice-0.038
epad-0.036
licks-0.036
 recite-0.036
to-0.036
Token mum
Token ID25682
Feature activation-1.40e-02

Pos logit contributions
Ph+0.016
 Ph+0.015
 duck+0.015
 umbrella+0.015
 je+0.015
Neg logit contributions
epad-0.017
licks-0.016
 twice-0.016
rou-0.015
 recite-0.015
Token lead
Token ID1085
Feature activation-2.32e-02

Pos logit contributions
 duck+0.017
 umbrella+0.016
 kids+0.016
 Ph+0.016
Ph+0.016
Neg logit contributions
epad-0.025
licks-0.024
 twice-0.022
rou-0.022
 Does-0.021
Token the
Token ID262
Feature activation-2.98e-02

Pos logit contributions
 duck+0.028
 umbrella+0.028
 kids+0.027
Ph+0.027
 Ph+0.027
Neg logit contributions
epad-0.039
licks-0.039
 twice-0.036
minded-0.034
rou-0.034
Token way
Token ID835
Feature activation-4.14e-02

Pos logit contributions
Ph+0.032
 Ph+0.031
 duck+0.028
 Esk+0.026
n+0.026
Neg logit contributions
epad-0.052
 twice-0.045
rou-0.044
licks-0.044
 recite-0.044
Token.
Token ID13
Feature activation-1.55e-02

Pos logit contributions
 duck+0.054
 kids+0.052
 Ph+0.052
 umbrella+0.051
Ph+0.051
Neg logit contributions
epad-0.068
licks-0.065
 twice-0.062
rou-0.060
minded-0.056
Token They
Token ID1119
Feature activation-1.86e-02

Pos logit contributions
 duck+0.018
 Ph+0.018
 kids+0.018
 umbrella+0.017
Ph+0.017
Neg logit contributions
epad-0.029
licks-0.027
rou-0.026
minded-0.025
 facts-0.025
Token soon
Token ID2582
Feature activation-6.59e-03

Pos logit contributions
 duck+0.023
 kids+0.022
 umbrella+0.022
 Ph+0.022
Ph+0.022
Neg logit contributions
epad-0.031
licks-0.031
 twice-0.028
rou-0.028
minded-0.027
Token got
Token ID1392
Feature activation-7.63e-03

Pos logit contributions
 duck+0.008
 umbrella+0.008
 kids+0.008
 Ph+0.008
Ph+0.008
Neg logit contributions
epad-0.011
licks-0.010
 twice-0.010
rou-0.009
minded-0.009
Token to
Token ID284
Feature activation-9.00e-03

Pos logit contributions
 duck+0.011
 umbrella+0.011
 kids+0.011
 Ph+0.011
Ph+0.011
Neg logit contributions
licks-0.011
epad-0.011
 twice-0.010
rou-0.010
minded-0.009
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
epad+0.003
licks+0.003
 Does+0.002
minded+0.002
rou+0.002
Neg logit contributions
 duck-0.001
 umbrella-0.001
 kids-0.001
 best-0.001
 k-0.001
Token looked
Token ID3114
Feature activation-9.79e-04

Pos logit contributions
epad+0.024
licks+0.023
rou+0.023
asper+0.023
minded+0.022
Neg logit contributions
 duck-0.010
 kids-0.009
 k-0.008
 umbrella-0.008
 pig-0.008
Token around
Token ID1088
Feature activation+6.56e-03

Pos logit contributions
 umbrella+0.001
 fine+0.001
 duck+0.001
 best+0.001
 lucky+0.001
Neg logit contributions
epad-0.002
asper-0.002
 habitats-0.002
licks-0.002
 facts-0.002
Token.
Token ID13
Feature activation+9.86e-03

Pos logit contributions
licks+0.016
epad+0.016
rou+0.015
auldron+0.015
asper+0.015
Neg logit contributions
 duck-0.004
 kids-0.004
 umbrella-0.003
 best-0.003
 Ph-0.003
Token The
Token ID383
Feature activation+1.65e-02

Pos logit contributions
epad+0.020
licks+0.019
rou+0.018
 twice+0.017
minded+0.017
Neg logit contributions
 duck-0.010
 kids-0.010
 Ph-0.010
 umbrella-0.010
Ph-0.009
Token cater
Token ID20825
Feature activation+3.51e-02

Pos logit contributions
epad+0.029
licks+0.029
rou+0.027
 twice+0.027
minded+0.026
Neg logit contributions
Ph-0.018
 Ph-0.018
 duck-0.018
 umbrella-0.017
 kids-0.016
Tokenpillar
Token ID41643
Feature activation+4.15e-03

Pos logit contributions
epad+0.050
licks+0.045
 twice+0.040
 habitats+0.039
rou+0.039
Neg logit contributions
 duck-0.058
 kids-0.056
 umbrella-0.056
 Ph-0.055
Ph-0.054
Token wanted
Token ID2227
Feature activation+6.36e-03

Pos logit contributions
epad+0.008
licks+0.008
rou+0.007
 facts+0.007
minded+0.007
Neg logit contributions
 duck-0.005
 kids-0.004
 umbrella-0.004
 pig-0.004
n-0.004
Token to
Token ID284
Feature activation+8.25e-03

Pos logit contributions
epad+0.011
licks+0.011
rou+0.010
asper+0.010
ited+0.010
Neg logit contributions
 duck-0.009
 kids-0.009
 umbrella-0.008
 best-0.008
 pig-0.008
Token find
Token ID1064
Feature activation+4.36e-03

Pos logit contributions
epad+0.015
licks+0.015
 twice+0.014
rou+0.014
 recite+0.013
Neg logit contributions
 duck-0.009
 kids-0.009
 umbrella-0.009
 Ph-0.009
Ph-0.008
Token a
Token ID257
Feature activation+1.12e-03

Pos logit contributions
epad+0.010
licks+0.010
rou+0.009
 habitats+0.009
 twice+0.009
Neg logit contributions
 duck-0.004
 umbrella-0.003
 kids-0.003
 best-0.003
 pig-0.003
Token frog
Token ID21264
Feature activation+8.87e-03

Pos logit contributions
 duck+0.008
 umbrella+0.007
 kids+0.007
 Ph+0.006
 pig+0.006
Neg logit contributions
epad-0.023
licks-0.023
 twice-0.021
rou-0.021
minded-0.021
Token named
Token ID3706
Feature activation+1.68e-02

Pos logit contributions
epad+0.020
licks+0.019
minded+0.018
 twice+0.018
asper+0.017
Neg logit contributions
 duck-0.009
 kids-0.007
 pig-0.007
 umbrella-0.007
 je-0.007
Token Ho
Token ID9544
Feature activation+3.98e-02

Pos logit contributions
epad+0.032
licks+0.031
rou+0.029
 twice+0.029
 Does+0.028
Neg logit contributions
 duck-0.018
 umbrella-0.018
 kids-0.018
 Ph-0.017
Ph-0.017
Tokenppy
Token ID14097
Feature activation+8.85e-03

Pos logit contributions
epad+0.048
licks+0.047
rou+0.040
 twice+0.040
 Does+0.038
Neg logit contributions
 duck-0.070
 umbrella-0.069
 kids-0.069
 Ph-0.068
Ph-0.067
Token.
Token ID13
Feature activation+2.97e-02

Pos logit contributions
epad+0.019
licks+0.019
 twice+0.018
rou+0.018
 Does+0.017
Neg logit contributions
 duck-0.007
 kids-0.007
 umbrella-0.007
 Ph-0.006
Ph-0.006
Token Ho
Token ID9544
Feature activation+5.13e-02

Pos logit contributions
epad+0.047
licks+0.044
rou+0.041
 twice+0.040
minded+0.039
Neg logit contributions
 duck-0.044
 kids-0.042
 Ph-0.042
 umbrella-0.041
 pig-0.041
Tokenppy
Token ID14097
Feature activation+5.66e-03

Pos logit contributions
epad+0.064
licks+0.062
 twice+0.055
rou+0.053
 Does+0.052
Neg logit contributions
 duck-0.089
 kids-0.085
 umbrella-0.084
 Ph-0.083
 best-0.083
Token lived
Token ID5615
Feature activation-5.72e-03

Pos logit contributions
epad+0.010
licks+0.010
rou+0.009
 twice+0.009
 Does+0.009
Neg logit contributions
 duck-0.006
 umbrella-0.006
 kids-0.006
 Ph-0.006
Ph-0.006
Token in
Token ID287
Feature activation+5.95e-03

Pos logit contributions
 duck+0.004
 best+0.004
 kids+0.004
 pig+0.003
 two+0.003
Neg logit contributions
epad-0.013
licks-0.013
 facts-0.012
asper-0.012
minded-0.012
Token a
Token ID257
Feature activation+3.78e-03

Pos logit contributions
licks+0.011
epad+0.011
 Does+0.010
 recite+0.010
 twice+0.010
Neg logit contributions
 umbrella-0.006
Ph-0.006
 Ph-0.006
 duck-0.005
 kids-0.005
Token small
Token ID1402
Feature activation-1.74e-03

Pos logit contributions
epad+0.008
licks+0.008
rou+0.007
 twice+0.007
 Does+0.007
Neg logit contributions
 duck-0.003
 umbrella-0.003
 kids-0.003
 Ph-0.003
Ph-0.003
Token days
Token ID1528
Feature activation-2.05e-02

Pos logit contributions
Ph+0.023
 duck+0.023
 Ph+0.023
 umbrella+0.022
 best+0.022
Neg logit contributions
licks-0.037
epad-0.036
 twice-0.034
minded-0.033
rou-0.032
Token."
Token ID526
Feature activation+8.77e-03

Pos logit contributions
 duck+0.022
 umbrella+0.021
 Ph+0.021
Ph+0.021
 kids+0.020
Neg logit contributions
epad-0.038
licks-0.038
 twice-0.035
rou-0.034
minded-0.033
Token
Token ID198
Feature activation-1.45e-03

Pos logit contributions
epad+0.020
licks+0.019
 twice+0.018
rou+0.018
 Does+0.018
Neg logit contributions
 duck-0.006
 umbrella-0.006
 kids-0.006
 Ph-0.006
Ph-0.006
Token
Token ID198
Feature activation-2.29e-03

Pos logit contributions
 duck+0.001
 umbrella+0.001
 Ph+0.001
 kids+0.001
 best+0.001
Neg logit contributions
licks-0.003
epad-0.003
rou-0.003
 twice-0.003
 Does-0.003
TokenBen
Token ID11696
Feature activation+7.19e-03

Pos logit contributions
 duck+0.003
 umbrella+0.003
 kids+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.004
licks-0.004
rou-0.004
 twice-0.004
minded-0.003
Token smiles
Token ID21845
Feature activation+3.50e-02

Pos logit contributions
epad+0.013
licks+0.011
osity+0.011
rou+0.011
 Does+0.011
Neg logit contributions
 kids-0.010
 duck-0.010
 boys-0.009
 umbrella-0.009
 pig-0.009
Token.
Token ID13
Feature activation+7.93e-03

Pos logit contributions
epad+0.070
licks+0.069
 twice+0.064
rou+0.063
minded+0.062
Neg logit contributions
 duck-0.033
 umbrella-0.033
 Ph-0.032
 kids-0.031
Ph-0.031
Token He
Token ID679
Feature activation+5.95e-03

Pos logit contributions
epad+0.015
licks+0.014
rou+0.013
 twice+0.013
minded+0.012
Neg logit contributions
 duck-0.009
 kids-0.009
 umbrella-0.009
 Ph-0.009
Ph-0.008
Token likes
Token ID7832
Feature activation-3.04e-03

Pos logit contributions
epad+0.010
licks+0.009
oes+0.008
osity+0.008
minded+0.008
Neg logit contributions
 duck-0.009
 kids-0.009
 umbrella-0.008
 best-0.008
 pig-0.008
Token his
Token ID465
Feature activation-8.04e-03

Pos logit contributions
 duck+0.004
 kids+0.004
 umbrella+0.004
 Ph+0.004
 pig+0.004
Neg logit contributions
epad-0.005
licks-0.005
rou-0.004
 twice-0.004
minded-0.004
Token boots
Token ID14412
Feature activation-7.52e-03

Pos logit contributions
Ph+0.009
 Ph+0.009
 duck+0.008
n+0.007
ter+0.007
Neg logit contributions
rou-0.013
epad-0.013
 twice-0.013
 habitats-0.013
oes-0.012
Token were
Token ID547
Feature activation+3.44e-03

Pos logit contributions
 duck+0.002
 kids+0.002
 umbrella+0.002
 Ph+0.002
Ph+0.002
Neg logit contributions
epad-0.005
licks-0.005
 twice-0.005
rou-0.005
minded-0.005
Token sorry
Token ID7926
Feature activation+1.54e-02

Pos logit contributions
licks+0.006
epad+0.006
 twice+0.005
rou+0.005
minded+0.005
Neg logit contributions
 umbrella-0.004
 Ph-0.004
Ph-0.004
 duck-0.004
n-0.004
Token.
Token ID13
Feature activation+2.98e-04

Pos logit contributions
epad+0.030
licks+0.030
 twice+0.027
rou+0.027
 Does+0.027
Neg logit contributions
 duck-0.016
 umbrella-0.015
 kids-0.015
 Ph-0.015
Ph-0.014
Token He
Token ID679
Feature activation-6.26e-05

Pos logit contributions
licks+0.000
epad+0.000
 twice+0.000
 Does+0.000
rou+0.000
Neg logit contributions
 duck-0.000
 umbrella-0.000
 kids-0.000
 Ph-0.000
 best-0.000
Token felt
Token ID2936
Feature activation+1.34e-02

Pos logit contributions
 duck+0.000
 kids+0.000
 umbrella+0.000
 Ph+0.000
Ph+0.000
Neg logit contributions
epad-0.000
licks-0.000
rou-0.000
 twice-0.000
minded-0.000
Token a
Token ID257
Feature activation+3.39e-02

Pos logit contributions
licks+0.026
epad+0.025
 twice+0.024
rou+0.023
 Does+0.023
Neg logit contributions
 umbrella-0.013
 Ph-0.013
 duck-0.013
n-0.012
 kids-0.012
Token little
Token ID1310
Feature activation+2.12e-02

Pos logit contributions
epad+0.074
licks+0.073
rou+0.068
 twice+0.068
 Does+0.066
Neg logit contributions
 duck-0.025
 Ph-0.024
 umbrella-0.024
 kids-0.024
Ph-0.022
Token sorry
Token ID7926
Feature activation+1.86e-02

Pos logit contributions
epad+0.036
licks+0.036
 twice+0.033
rou+0.033
 Does+0.032
Neg logit contributions
 Ph-0.025
n-0.024
 umbrella-0.024
Ph-0.024
 kids-0.024
Token too
Token ID1165
Feature activation+1.88e-02

Pos logit contributions
epad+0.027
licks+0.027
 twice+0.024
rou+0.023
 Does+0.023
Neg logit contributions
 duck-0.027
 Ph-0.027
 umbrella-0.027
 kids-0.027
Ph-0.027
Token.
Token ID13
Feature activation-5.95e-04

Pos logit contributions
epad+0.035
licks+0.034
 twice+0.032
rou+0.031
minded+0.030
Neg logit contributions
 duck-0.020
 umbrella-0.020
 Ph-0.020
Ph-0.019
 kids-0.019
Token He
Token ID679
Feature activation-1.40e-03

Pos logit contributions
 umbrella+0.001
 duck+0.001
 Ph+0.001
 kids+0.001
 best+0.001
Neg logit contributions
licks-0.001
epad-0.001
 twice-0.001
 Does-0.001
rou-0.001
Token,"
Token ID553
Feature activation+7.58e-03

Pos logit contributions
 duck+0.012
 kids+0.012
 umbrella+0.011
 Ph+0.011
 best+0.011
Neg logit contributions
epad-0.029
licks-0.029
rou-0.026
 twice-0.026
 Does-0.026
Token Ben
Token ID3932
Feature activation-5.66e-03

Pos logit contributions
epad+0.013
licks+0.012
rou+0.011
 twice+0.011
 Does+0.011
Neg logit contributions
 duck-0.010
 kids-0.010
 umbrella-0.010
 Ph-0.010
Ph-0.009
Token says
Token ID1139
Feature activation+6.05e-03

Pos logit contributions
 duck+0.008
 kids+0.008
 umbrella+0.008
 Ph+0.007
 boys+0.007
Neg logit contributions
epad-0.009
licks-0.009
rou-0.008
 twice-0.008
 habitats-0.008
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
epad+0.013
licks+0.013
 twice+0.012
rou+0.012
 Does+0.011
Neg logit contributions
 duck-0.005
 umbrella-0.005
 kids-0.005
 Ph-0.005
Ph-0.005
Token He
Token ID679
Feature activation+4.33e-03

Pos logit contributions
licks+0.023
epad+0.022
 twice+0.021
 Does+0.020
rou+0.020
Neg logit contributions
 umbrella-0.019
 duck-0.018
 Ph-0.018
 kids-0.018
n-0.017
Token smiles
Token ID21845
Feature activation+3.54e-02

Pos logit contributions
epad+0.008
licks+0.007
minded+0.006
oes+0.006
osity+0.006
Neg logit contributions
 duck-0.006
 kids-0.006
 k-0.006
 umbrella-0.006
 p-0.006
Token at
Token ID379
Feature activation-3.15e-03

Pos logit contributions
epad+0.069
licks+0.068
 twice+0.063
rou+0.063
minded+0.061
Neg logit contributions
 umbrella-0.036
 duck-0.036
 Ph-0.035
 kids-0.034
Ph-0.034
Token L
Token ID406
Feature activation+6.37e-03

Pos logit contributions
 duck+0.004
 umbrella+0.004
Ph+0.004
 pig+0.004
 Ph+0.004
Neg logit contributions
licks-0.005
epad-0.005
 twice-0.005
rou-0.005
minded-0.005
Tokenila
Token ID10102
Feature activation-7.13e-03

Pos logit contributions
licks+0.010
rou+0.009
 twice+0.009
epad+0.009
ems+0.008
Neg logit contributions
 pig-0.009
 je-0.009
 umbrella-0.009
 duck-0.009
 best-0.008
Token too
Token ID1165
Feature activation+1.51e-02

Pos logit contributions
 duck+0.007
 Ph+0.007
 umbrella+0.007
Ph+0.007
 kids+0.007
Neg logit contributions
epad-0.014
licks-0.014
 twice-0.013
rou-0.013
minded-0.012
Token.
Token ID13
Feature activation+5.58e-03

Pos logit contributions
epad+0.032
licks+0.032
rou+0.029
 twice+0.029
 Does+0.029
Neg logit contributions
 duck-0.013
 kids-0.012
 umbrella-0.012
 Ph-0.012
Ph-0.011
Token
Token ID198
Feature activation-6.96e-03

Pos logit contributions
licks+0.006
epad+0.005
 twice+0.005
rou+0.005
to+0.005
Neg logit contributions
 duck-0.003
 umbrella-0.003
 pig-0.003
 kids-0.003
 Ph-0.003
Token
Token ID198
Feature activation-9.62e-03

Pos logit contributions
 umbrella+0.007
 duck+0.007
 best+0.007
 pig+0.007
 kids+0.007
Neg logit contributions
licks-0.013
epad-0.013
rou-0.012
 twice-0.012
 Does-0.012
TokenHe
Token ID1544
Feature activation-1.22e-03

Pos logit contributions
 umbrella+0.012
 duck+0.012
 pig+0.011
 best+0.011
 kids+0.011
Neg logit contributions
licks-0.016
epad-0.016
rou-0.015
 twice-0.015
 Does-0.015
Token asked
Token ID1965
Feature activation-8.90e-04

Pos logit contributions
 duck+0.001
 kids+0.001
 umbrella+0.001
 Ph+0.001
Ph+0.001
Neg logit contributions
epad-0.002
licks-0.002
rou-0.002
 twice-0.002
minded-0.002
Token Mom
Token ID11254
Feature activation-3.49e-03

Pos logit contributions
 duck+0.001
 kids+0.001
 umbrella+0.001
 Ph+0.001
Ph+0.001
Neg logit contributions
epad-0.002
licks-0.002
rou-0.002
 twice-0.002
minded-0.002
Tokenmy
Token ID1820
Feature activation+8.99e-03

Pos logit contributions
 kids+0.004
 best+0.004
 two+0.004
 duck+0.004
 umbrella+0.004
Neg logit contributions
epad-0.006
licks-0.006
 habitats-0.006
auldron-0.006
 battlefield-0.006
Token,
Token ID11
Feature activation-1.26e-02

Pos logit contributions
epad+0.020
 habitats+0.019
licks+0.019
ited+0.019
auldron+0.019
Neg logit contributions
 kids-0.007
 best-0.007
 duck-0.007
 two-0.006
 pig-0.006
Token "
Token ID366
Feature activation-4.49e-03

Pos logit contributions
 duck+0.014
 umbrella+0.014
 kids+0.014
 Ph+0.013
Ph+0.013
Neg logit contributions
epad-0.023
licks-0.022
 twice-0.021
rou-0.020
 Does-0.020
TokenCan
Token ID6090
Feature activation-1.47e-03

Pos logit contributions
 duck+0.006
 kids+0.006
 umbrella+0.006
 Ph+0.006
Ph+0.006
Neg logit contributions
epad-0.007
licks-0.007
rou-0.006
 twice-0.006
minded-0.006
Token I
Token ID314
Feature activation-7.03e-03

Pos logit contributions
 umbrella+0.002
Ph+0.002
 duck+0.002
 Ph+0.002
 kids+0.002
Neg logit contributions
licks-0.002
epad-0.002
 Does-0.002
 twice-0.002
rou-0.002
Token have
Token ID423
Feature activation+1.13e-03

Pos logit contributions
 duck+0.007
 umbrella+0.007
 kids+0.007
 Ph+0.007
Ph+0.006
Neg logit contributions
epad-0.014
licks-0.014
 twice-0.013
rou-0.012
 Does-0.012
Token.
Token ID13
Feature activation-6.24e-03

Pos logit contributions
epad+0.027
licks+0.027
rou+0.025
 Does+0.024
 twice+0.024
Neg logit contributions
 duck-0.013
 kids-0.012
 umbrella-0.012
 best-0.011
 Ph-0.011
Token Then
Token ID3244
Feature activation+1.89e-03

Pos logit contributions
 duck+0.009
 umbrella+0.008
 best+0.008
 kids+0.008
 pig+0.008
Neg logit contributions
licks-0.010
epad-0.009
 twice-0.009
 Does-0.009
 habitats-0.008
Token they
Token ID484
Feature activation-3.86e-03

Pos logit contributions
epad+0.003
licks+0.003
 twice+0.003
 Does+0.003
minded+0.003
Neg logit contributions
 duck-0.002
Ph-0.002
 umbrella-0.002
 Ph-0.002
 kids-0.002
Token put
Token ID1234
Feature activation+5.67e-03

Pos logit contributions
 duck+0.004
 kids+0.004
 umbrella+0.004
 boys+0.004
 two+0.004
Neg logit contributions
licks-0.007
epad-0.007
rou-0.007
 Does-0.007
minded-0.007
Token some
Token ID617
Feature activation+8.92e-03

Pos logit contributions
licks+0.010
epad+0.009
 twice+0.009
minded+0.008
 attent+0.008
Neg logit contributions
Ph-0.007
 Ph-0.007
n-0.006
 duck-0.006
 umbrella-0.006
Token pine
Token ID20161
Feature activation+6.55e-03

Pos logit contributions
epad+0.019
licks+0.018
rou+0.017
 twice+0.017
 Does+0.016
Neg logit contributions
 duck-0.008
 kids-0.008
 umbrella-0.008
 best-0.007
 pig-0.007
Token cones
Token ID47314
Feature activation+6.47e-03

Pos logit contributions
licks+0.011
epad+0.010
 twice+0.010
rou+0.010
minded+0.010
Neg logit contributions
 Ph-0.008
Ph-0.008
 kids-0.007
 umbrella-0.007
 duck-0.007
Token on
Token ID319
Feature activation-4.79e-03

Pos logit contributions
licks+0.011
epad+0.011
 twice+0.010
rou+0.009
minded+0.009
Neg logit contributions
 Ph-0.008
 duck-0.008
 kids-0.008
Ph-0.008
 umbrella-0.008
Token the
Token ID262
Feature activation+5.85e-03

Pos logit contributions
Ph+0.005
 kids+0.005
 umbrella+0.005
 Ph+0.005
 duck+0.005
Neg logit contributions
licks-0.008
epad-0.008
 twice-0.008
 attent-0.007
 int-0.007
Token top
Token ID1353
Feature activation+1.30e-02

Pos logit contributions
epad+0.010
licks+0.010
 twice+0.009
rou+0.009
minded+0.009
Neg logit contributions
 Ph-0.007
 duck-0.007
Ph-0.007
 kids-0.006
 umbrella-0.006
Token to
Token ID284
Feature activation+4.64e-03

Pos logit contributions
epad+0.025
licks+0.025
rou+0.023
 twice+0.023
 Does+0.022
Neg logit contributions
 duck-0.013
 umbrella-0.013
 kids-0.013
 Ph-0.013
Ph-0.012
Token the
Token ID262
Feature activation-2.10e-03

Pos logit contributions
Ph+0.017
 knee+0.016
 umbrella+0.015
 best+0.015
 Ph+0.015
Neg logit contributions
licks-0.019
 twice-0.017
 Alexa-0.016
 Dean-0.016
epad-0.016
Token friends
Token ID2460
Feature activation+1.96e-03

Pos logit contributions
Ph+0.002
 Ph+0.002
n+0.002
 duck+0.002
 umbrella+0.002
Neg logit contributions
epad-0.004
licks-0.004
oes-0.004
 twice-0.004
 habitats-0.004
Token said
Token ID531
Feature activation+2.03e-03

Pos logit contributions
licks+0.003
epad+0.003
 twice+0.003
rou+0.003
minded+0.003
Neg logit contributions
 duck-0.002
 umbrella-0.002
Ph-0.002
 Ph-0.002
 kids-0.002
Token the
Token ID262
Feature activation-9.91e-03

Pos logit contributions
epad+0.003
licks+0.003
 twice+0.003
 Does+0.003
rou+0.003
Neg logit contributions
 umbrella-0.002
 duck-0.002
 Ph-0.002
Ph-0.002
 kids-0.002
Token cart
Token ID6383
Feature activation-8.76e-03

Pos logit contributions
Ph+0.011
 Ph+0.010
 duck+0.010
n+0.009
ggy+0.009
Neg logit contributions
epad-0.017
oes-0.015
 twice-0.015
minded-0.015
rou-0.015
Token looked
Token ID3114
Feature activation+5.00e-03

Pos logit contributions
 umbrella+0.008
 duck+0.008
Ph+0.008
 Ph+0.008
 kids+0.008
Neg logit contributions
licks-0.017
epad-0.017
 twice-0.016
rou-0.016
 Does-0.015
Token very
Token ID845
Feature activation+7.42e-03

Pos logit contributions
licks+0.011
epad+0.011
rou+0.010
 habitats+0.010
minded+0.010
Neg logit contributions
 kids-0.004
 duck-0.004
 best-0.004
 umbrella-0.004
 Ph-0.004
Token useful
Token ID4465
Feature activation-6.93e-03

Pos logit contributions
epad+0.017
licks+0.016
rou+0.015
 twice+0.015
 Does+0.015
Neg logit contributions
 duck-0.006
 kids-0.005
 umbrella-0.005
 Ph-0.005
Ph-0.005
Token.
Token ID13
Feature activation+3.65e-03

Pos logit contributions
 duck+0.006
 umbrella+0.006
 kids+0.006
 Ph+0.006
Ph+0.005
Neg logit contributions
epad-0.014
licks-0.014
rou-0.013
 twice-0.013
 Does-0.013
Token The
Token ID383
Feature activation-6.97e-03

Pos logit contributions
epad+0.007
licks+0.007
 twice+0.006
rou+0.006
 Does+0.006
Neg logit contributions
 duck-0.004
 umbrella-0.004
 kids-0.004
 Ph-0.004
Ph-0.004
Token other
Token ID584
Feature activation-6.63e-04

Pos logit contributions
Ph+0.007
 Ph+0.007
 duck+0.007
n+0.007
 themselves+0.006
Neg logit contributions
epad-0.012
 twice-0.011
rou-0.011
oes-0.011
minded-0.011
Token She
Token ID1375
Feature activation-8.42e-03

Pos logit contributions
 duck+0.001
 umbrella+0.001
 kids+0.000
Ph+0.000
 best+0.000
Neg logit contributions
licks-0.001
epad-0.001
 twice-0.001
 Does-0.001
rou-0.001
Token saw
Token ID2497
Feature activation+1.48e-03

Pos logit contributions
 duck+0.009
 kids+0.009
 umbrella+0.009
 Ph+0.009
 pig+0.008
Neg logit contributions
epad-0.017
licks-0.016
rou-0.014
 Does-0.014
minded-0.014
Token a
Token ID257
Feature activation+1.15e-02

Pos logit contributions
epad+0.003
licks+0.003
 twice+0.003
rou+0.003
minded+0.003
Neg logit contributions
 duck-0.001
 umbrella-0.001
 kids-0.001
 Ph-0.001
Ph-0.001
Token door
Token ID3420
Feature activation+7.42e-03

Pos logit contributions
licks+0.029
epad+0.027
minded+0.027
 habitats+0.026
 twice+0.026
Neg logit contributions
 duck-0.008
 kids-0.008
 umbrella-0.007
 train-0.007
 best-0.007
Token that
Token ID326
Feature activation+7.17e-03

Pos logit contributions
epad+0.015
licks+0.014
rou+0.013
minded+0.013
 habitats+0.013
Neg logit contributions
 umbrella-0.008
 kids-0.007
 duck-0.007
 Ph-0.007
 best-0.007
Token said
Token ID531
Feature activation+5.97e-03

Pos logit contributions
epad+0.015
licks+0.014
rou+0.013
 twice+0.013
minded+0.013
Neg logit contributions
 duck-0.007
 umbrella-0.007
 kids-0.007
 Ph-0.006
 best-0.006
Token "
Token ID366
Feature activation+1.25e-03

Pos logit contributions
epad+0.011
licks+0.011
 twice+0.010
 Does+0.009
rou+0.009
Neg logit contributions
 duck-0.007
 umbrella-0.007
 kids-0.006
 Ph-0.006
Ph-0.006
TokenDo
Token ID5211
Feature activation+9.29e-03

Pos logit contributions
epad+0.002
licks+0.002
 twice+0.002
rou+0.002
minded+0.002
Neg logit contributions
 duck-0.002
 umbrella-0.002
 kids-0.002
 best-0.002
 Ph-0.002
Token not
Token ID407
Feature activation-4.08e-03

Pos logit contributions
 twice+0.011
licks+0.010
 facts+0.010
epad+0.010
ems+0.009
Neg logit contributions
 duck-0.016
 pig-0.016
 umbrella-0.015
 Esk-0.015
 Ph-0.015
Token enter
Token ID3802
Feature activation-1.70e-02

Pos logit contributions
 duck+0.004
 kids+0.004
 umbrella+0.004
 Ph+0.004
Ph+0.004
Neg logit contributions
licks-0.008
epad-0.008
rou-0.007
 twice-0.007
minded-0.007
Token".
Token ID1911
Feature activation-2.61e-03

Pos logit contributions
 duck+0.020
 umbrella+0.019
 kids+0.019
 Ph+0.019
Ph+0.018
Neg logit contributions
epad-0.030
licks-0.030
 twice-0.027
rou-0.027
minded-0.026
Token nice
Token ID3621
Feature activation-1.52e-02

Pos logit contributions
 duck+0.009
 umbrella+0.008
 pig+0.008
Ph+0.007
 kids+0.007
Neg logit contributions
 twice-0.014
epad-0.014
licks-0.014
rou-0.013
 facts-0.013
Token,
Token ID11
Feature activation-7.43e-03

Pos logit contributions
 duck+0.016
 umbrella+0.016
 Ph+0.015
 kids+0.015
Ph+0.015
Neg logit contributions
epad-0.029
licks-0.028
 twice-0.026
rou-0.026
minded-0.025
Token but
Token ID475
Feature activation-1.02e-02

Pos logit contributions
 duck+0.009
Ph+0.008
 umbrella+0.008
 pig+0.008
 best+0.008
Neg logit contributions
licks-0.013
 twice-0.012
epad-0.012
rou-0.012
 Does-0.011
Token I
Token ID314
Feature activation+6.10e-03

Pos logit contributions
 duck+0.011
 umbrella+0.010
 kids+0.010
 Ph+0.010
Ph+0.010
Neg logit contributions
epad-0.020
licks-0.019
rou-0.018
 twice-0.018
minded-0.017
Token made
Token ID925
Feature activation-1.47e-03

Pos logit contributions
epad+0.012
licks+0.012
 habitats+0.011
 twice+0.011
minded+0.011
Neg logit contributions
 duck-0.006
 kids-0.006
n-0.006
 best-0.006
 umbrella-0.006
Token a
Token ID257
Feature activation+1.67e-02

Pos logit contributions
 duck+0.001
 Ph+0.001
 umbrella+0.001
Ph+0.001
 kids+0.001
Neg logit contributions
epad-0.003
licks-0.003
 twice-0.003
rou-0.003
minded-0.003
Token dragon
Token ID10441
Feature activation+7.97e-03

Pos logit contributions
licks+0.039
epad+0.036
minded+0.034
 Does+0.034
ems+0.034
Neg logit contributions
 duck-0.016
 best-0.014
 kids-0.014
 umbrella-0.014
 train-0.013
Token!"
Token ID2474
Feature activation-8.97e-03

Pos logit contributions
licks+0.014
epad+0.014
rou+0.013
 twice+0.013
minded+0.013
Neg logit contributions
 duck-0.009
 kids-0.009
 umbrella-0.009
 best-0.008
 pig-0.008
Token Tom
Token ID4186
Feature activation-5.10e-03

Pos logit contributions
 duck+0.011
 umbrella+0.011
 best+0.010
 pig+0.010
 kids+0.010
Neg logit contributions
licks-0.015
epad-0.014
 twice-0.014
 Does-0.013
 attent-0.013
Token said
Token ID531
Feature activation-3.43e-03

Pos logit contributions
 duck+0.006
 umbrella+0.006
 kids+0.006
 Ph+0.006
Ph+0.006
Neg logit contributions
epad-0.009
licks-0.009
 twice-0.008
rou-0.008
minded-0.008
Token,
Token ID11
Feature activation+7.33e-03

Pos logit contributions
 duck+0.003
 umbrella+0.003
 kids+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.007
licks-0.006
 twice-0.006
rou-0.006
minded-0.006
Token is
Token ID318
Feature activation+5.80e-03

Pos logit contributions
 Ph+0.004
Ph+0.004
 kids+0.004
 umbrella+0.004
 duck+0.004
Neg logit contributions
licks-0.006
epad-0.005
 twice-0.005
rou-0.005
 Does-0.005
Token cold
Token ID4692
Feature activation-1.20e-02

Pos logit contributions
epad+0.011
licks+0.011
 twice+0.010
rou+0.010
 Does+0.010
Neg logit contributions
 duck-0.006
 kids-0.006
 umbrella-0.006
 Ph-0.006
Ph-0.006
Token and
Token ID290
Feature activation+5.52e-03

Pos logit contributions
 duck+0.014
 Ph+0.014
Ph+0.014
 umbrella+0.014
 kids+0.014
Neg logit contributions
licks-0.020
epad-0.020
 twice-0.018
rou-0.017
minded-0.017
Token fresh
Token ID4713
Feature activation-8.59e-03

Pos logit contributions
epad+0.011
licks+0.011
 twice+0.010
rou+0.010
 Does+0.010
Neg logit contributions
 duck-0.005
 umbrella-0.005
 kids-0.005
 Ph-0.005
Ph-0.004
Token,"
Token ID553
Feature activation+1.02e-02

Pos logit contributions
Ph+0.010
 Ph+0.010
 umbrella+0.010
 duck+0.010
 kids+0.010
Neg logit contributions
licks-0.014
epad-0.013
 twice-0.013
minded-0.012
rou-0.012
Token Ben
Token ID3932
Feature activation-2.25e-03

Pos logit contributions
licks+0.017
epad+0.017
 twice+0.015
 Does+0.015
rou+0.015
Neg logit contributions
 umbrella-0.013
 duck-0.013
 Ph-0.012
 kids-0.012
Ph-0.012
Token said
Token ID531
Feature activation-1.08e-03

Pos logit contributions
 duck+0.003
 umbrella+0.003
 Ph+0.003
 kids+0.003
Ph+0.003
Neg logit contributions
licks-0.004
epad-0.004
 twice-0.003
rou-0.003
minded-0.003
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
 umbrella+0.001
 duck+0.001
 Ph+0.001
Ph+0.001
 kids+0.001
Neg logit contributions
epad-0.002
licks-0.002
 twice-0.002
minded-0.002
rou-0.002
Token "
Token ID366
Feature activation+2.50e-02

Pos logit contributions
licks+0.028
epad+0.028
 twice+0.026
 Does+0.025
rou+0.025
Neg logit contributions
 umbrella-0.017
 duck-0.016
 Ph-0.016
 kids-0.016
Ph-0.016
TokenIt
Token ID1026
Feature activation+2.31e-03

Pos logit contributions
epad+0.040
licks+0.039
 Does+0.037
minded+0.036
rou+0.035
Neg logit contributions
 umbrella-0.034
 Ph-0.032
 best-0.032
 kids-0.031
Ph-0.030
Token makes
Token ID1838
Feature activation-2.36e-03

Pos logit contributions
licks+0.003
epad+0.003
 twice+0.003
rou+0.003
 Does+0.003
Neg logit contributions
 duck-0.003
 umbrella-0.003
 kids-0.003
 Ph-0.003
Ph-0.003
Token up
Token ID510
Feature activation+1.29e-03

Pos logit contributions
 duck+0.013
 umbrella+0.012
 kids+0.012
 Ph+0.012
Ph+0.012
Neg logit contributions
epad-0.012
licks-0.012
rou-0.011
 twice-0.011
 Does-0.010
Token,
Token ID11
Feature activation-1.99e-03

Pos logit contributions
licks+0.003
epad+0.003
 habitats+0.003
rou+0.003
minded+0.003
Neg logit contributions
 umbrella-0.001
 duck-0.001
 kids-0.001
 Ph-0.001
 best-0.001
Token she
Token ID673
Feature activation+2.48e-03

Pos logit contributions
 duck+0.003
 kids+0.002
 umbrella+0.002
 Ph+0.002
Ph+0.002
Neg logit contributions
epad-0.003
licks-0.003
rou-0.003
 twice-0.003
minded-0.003
Token saw
Token ID2497
Feature activation+1.59e-03

Pos logit contributions
epad+0.006
licks+0.006
 facts+0.005
osity+0.005
minded+0.005
Neg logit contributions
 duck-0.002
 kids-0.002
 pig-0.002
 umbrella-0.002
 p-0.002
Token that
Token ID326
Feature activation-2.40e-03

Pos logit contributions
epad+0.004
licks+0.004
 twice+0.003
rou+0.003
minded+0.003
Neg logit contributions
 duck-0.001
 umbrella-0.001
 kids-0.001
 Ph-0.001
 best-0.001
Token her
Token ID607
Feature activation-5.75e-03

Pos logit contributions
 kids+0.002
 duck+0.002
 umbrella+0.002
 best+0.002
 Ph+0.002
Neg logit contributions
licks-0.006
epad-0.005
rou-0.005
 habitats-0.005
minded-0.005
Token cereal
Token ID33158
Feature activation+2.52e-03

Pos logit contributions
 duck+0.005
 kids+0.005
 umbrella+0.004
 best+0.004
 pig+0.004
Neg logit contributions
epad-0.013
licks-0.013
rou-0.012
 twice-0.012
minded-0.011
Token was
Token ID373
Feature activation-3.40e-03

Pos logit contributions
epad+0.005
licks+0.005
rou+0.005
 twice+0.005
minded+0.005
Neg logit contributions
 duck-0.002
 umbrella-0.002
 kids-0.002
 Ph-0.002
Ph-0.002
Token spoiled
Token ID34781
Feature activation-1.47e-02

Pos logit contributions
 duck+0.003
 umbrella+0.003
 kids+0.003
 Ph+0.003
 best+0.003
Neg logit contributions
licks-0.007
epad-0.007
minded-0.006
rou-0.006
 habitats-0.006
Token.
Token ID13
Feature activation+7.70e-03

Pos logit contributions
 duck+0.012
 umbrella+0.012
 kids+0.012
 Ph+0.011
 best+0.011
Neg logit contributions
epad-0.032
licks-0.031
rou-0.029
 twice-0.029
minded-0.028
Token It
Token ID632
Feature activation+6.82e-03

Pos logit contributions
epad+0.012
licks+0.012
rou+0.011
 twice+0.011
minded+0.010
Neg logit contributions
 duck-0.011
 umbrella-0.010
 kids-0.010
 Ph-0.010
Ph-0.010
Token the
Token ID262
Feature activation+1.87e-03

Pos logit contributions
 kids+0.014
 umbrella+0.013
 duck+0.012
 best+0.012
 Ph+0.012
Neg logit contributions
licks-0.032
epad-0.031
rou-0.029
 Does-0.029
 twice-0.028
Token go
Token ID467
Feature activation-5.23e-03

Pos logit contributions
licks+0.004
epad+0.004
rou+0.003
 twice+0.003
 Does+0.003
Neg logit contributions
 umbrella-0.002
 duck-0.002
 kids-0.002
 best-0.002
 Ph-0.002
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
 duck+0.006
 umbrella+0.006
 kids+0.006
 Ph+0.005
 best+0.005
Neg logit contributions
epad-0.010
licks-0.010
rou-0.009
 twice-0.009
 Does-0.009
Token One
Token ID1881
Feature activation+1.37e-03

Pos logit contributions
epad+0.026
licks+0.025
rou+0.023
 twice+0.023
minded+0.022
Neg logit contributions
 duck-0.016
 kids-0.015
 umbrella-0.015
 Ph-0.015
Ph-0.014
Token day
Token ID1110
Feature activation-9.04e-04

Pos logit contributions
epad+0.002
 early+0.002
 twice+0.002
licks+0.002
oes+0.002
Neg logit contributions
 Ph-0.002
Ph-0.001
 duck-0.001
 umbrella-0.001
 kids-0.001
Token,
Token ID11
Feature activation+2.54e-03

Pos logit contributions
 duck+0.001
 kids+0.001
 umbrella+0.001
 Ph+0.001
 best+0.001
Neg logit contributions
epad-0.002
licks-0.002
rou-0.002
minded-0.002
 habitats-0.002
Token M
Token ID337
Feature activation+2.17e-02

Pos logit contributions
epad+0.005
licks+0.005
rou+0.004
minded+0.004
 twice+0.004
Neg logit contributions
 duck-0.003
 kids-0.003
 umbrella-0.003
 Ph-0.003
 best-0.003
Tokenandy
Token ID10757
Feature activation-6.35e-03

Pos logit contributions
licks+0.015
rou+0.015
epad+0.015
 twice+0.013
 habitats+0.012
Neg logit contributions
 duck-0.047
Ph-0.046
 pig-0.046
 kids-0.046
 umbrella-0.046
Token was
Token ID373
Feature activation-8.40e-03

Pos logit contributions
 duck+0.008
 umbrella+0.008
 Ph+0.008
 kids+0.008
Ph+0.007
Neg logit contributions
epad-0.010
licks-0.010
 twice-0.010
rou-0.009
minded-0.009
Token in
Token ID287
Feature activation+1.11e-02

Pos logit contributions
 duck+0.010
 kids+0.009
 umbrella+0.009
 Ph+0.009
 best+0.009
Neg logit contributions
epad-0.015
licks-0.015
rou-0.014
 twice-0.014
minded-0.013
Token her
Token ID607
Feature activation-6.38e-03

Pos logit contributions
epad+0.023
licks+0.023
 twice+0.021
rou+0.021
 Does+0.021
Neg logit contributions
 duck-0.009
 umbrella-0.009
Ph-0.009
 Ph-0.009
 kids-0.008
Token as
Token ID355
Feature activation+7.95e-03

Pos logit contributions
 kids+0.008
 duck+0.007
 umbrella+0.007
 best+0.007
 boys+0.007
Neg logit contributions
licks-0.013
epad-0.012
rou-0.012
 Does-0.011
 twice-0.011
Token new
Token ID649
Feature activation-4.75e-03

Pos logit contributions
licks+0.016
epad+0.016
rou+0.014
 twice+0.014
 Does+0.014
Neg logit contributions
 duck-0.008
 kids-0.008
 umbrella-0.007
 Ph-0.007
 best-0.007
Token.
Token ID13
Feature activation-8.18e-03

Pos logit contributions
 duck+0.004
 kids+0.004
 umbrella+0.004
 Ph+0.004
 best+0.004
Neg logit contributions
epad-0.010
licks-0.010
rou-0.009
 twice-0.009
 Does-0.009
Token From
Token ID3574
Feature activation+7.62e-03

Pos logit contributions
 kids+0.009
 Ph+0.008
 duck+0.008
Ph+0.008
 umbrella+0.008
Neg logit contributions
epad-0.016
licks-0.016
rou-0.014
minded-0.014
ems-0.014
Token that
Token ID326
Feature activation+6.12e-03

Pos logit contributions
epad+0.015
licks+0.015
 twice+0.013
rou+0.013
 Does+0.013
Neg logit contributions
 duck-0.008
 umbrella-0.008
 kids-0.008
 Ph-0.008
Ph-0.007
Token day
Token ID1110
Feature activation+8.51e-04

Pos logit contributions
epad+0.014
licks+0.014
 twice+0.013
rou+0.013
 Does+0.012
Neg logit contributions
 duck-0.004
 umbrella-0.004
 kids-0.004
 Ph-0.004
Ph-0.004
Token on
Token ID319
Feature activation-1.26e-02

Pos logit contributions
licks+0.001
epad+0.001
 twice+0.001
 Does+0.001
rou+0.001
Neg logit contributions
 duck-0.001
 umbrella-0.001
 kids-0.001
 Ph-0.001
Ph-0.001
Token,
Token ID11
Feature activation-7.16e-03

Pos logit contributions
 duck+0.014
 umbrella+0.014
 Ph+0.014
Ph+0.014
 kids+0.014
Neg logit contributions
licks-0.022
epad-0.022
 twice-0.021
 Does-0.020
rou-0.019
Token the
Token ID262
Feature activation-1.21e-02

Pos logit contributions
 umbrella+0.007
 duck+0.007
 Ph+0.007
Ph+0.007
 kids+0.007
Neg logit contributions
epad-0.014
licks-0.014
 twice-0.013
rou-0.012
 Does-0.012
Token boy
Token ID2933
Feature activation+9.99e-04

Pos logit contributions
 Ph+0.010
Ph+0.009
n+0.009
 umbrella+0.008
 duck+0.007
Neg logit contributions
epad-0.025
rou-0.024
licks-0.023
minded-0.023
 Does-0.023
Token made
Token ID925
Feature activation-1.42e-02

Pos logit contributions
epad+0.002
licks+0.002
 twice+0.002
rou+0.002
 Does+0.002
Neg logit contributions
 duck-0.001
 kids-0.001
 umbrella-0.001
 best-0.001
 Ph-0.001
TokenThe
Token ID464
Feature activation-3.67e-03

Pos logit contributions
 umbrella+0.006
 best+0.006
 kids+0.006
 duck+0.006
 Ph+0.006
Neg logit contributions
licks-0.008
epad-0.008
rou-0.007
 twice-0.007
 Does-0.007
Token end
Token ID886
Feature activation+1.02e-02

Pos logit contributions
 Ph+0.003
 duck+0.003
 umbrella+0.003
Ph+0.003
 kids+0.003
Neg logit contributions
epad-0.008
licks-0.007
rou-0.007
 twice-0.007
minded-0.007
Token.
Token ID13
Feature activation-3.35e-03

Pos logit contributions
epad+0.022
licks+0.022
rou+0.020
 twice+0.020
minded+0.020
Neg logit contributions
 duck-0.008
 kids-0.008
 umbrella-0.008
 Ph-0.008
Ph-0.007
Token<|endoftext|>
Token ID50256
Feature activation+4.67e-03

Pos logit contributions
 umbrella+0.003
 duck+0.003
 kids+0.003
 best+0.003
 Ph+0.003
Neg logit contributions
licks-0.007
epad-0.007
 Does-0.006
 twice-0.006
rou-0.006
TokenL
Token ID43
Feature activation+7.55e-03

Pos logit contributions
licks+0.007
epad+0.006
 Does+0.006
 twice+0.005
 Enjoy+0.005
Neg logit contributions
 umbrella-0.008
 Ph-0.007
 duck-0.007
Ph-0.007
 pig-0.007
Tokenily
Token ID813
Feature activation+7.57e-04

Pos logit contributions
licks+0.011
epad+0.011
 twice+0.010
asper+0.010
rou+0.010
Neg logit contributions
 duck-0.011
 tire-0.010
 Ph-0.010
Ph-0.010
 je-0.010
Token liked
Token ID8288
Feature activation-2.87e-02

Pos logit contributions
epad+0.002
licks+0.002
 twice+0.002
rou+0.002
 Does+0.002
Neg logit contributions
 duck-0.001
 kids-0.001
 umbrella-0.001
 pig-0.001
 Ph-0.000
Token to
Token ID284
Feature activation-1.74e-02

Pos logit contributions
 duck+0.046
 kids+0.045
 umbrella+0.045
 Ph+0.043
 pig+0.042
Neg logit contributions
licks-0.041
epad-0.040
 twice-0.035
rou-0.035
 Does-0.034
Token climb
Token ID12080
Feature activation-1.81e-02

Pos logit contributions
 duck+0.023
Ph+0.023
 Ph+0.023
 kids+0.023
 umbrella+0.022
Neg logit contributions
epad-0.028
licks-0.026
 twice-0.024
 recite-0.024
rou-0.023
Token.
Token ID13
Feature activation+9.69e-03

Pos logit contributions
 duck+0.020
 kids+0.020
Ph+0.019
 Ph+0.019
 umbrella+0.019
Neg logit contributions
licks-0.032
epad-0.031
 twice-0.029
rou-0.028
minded-0.028
Token She
Token ID1375
Feature activation-5.95e-03

Pos logit contributions
epad+0.014
licks+0.013
 twice+0.012
rou+0.012
 Does+0.011
Neg logit contributions
 duck-0.015
 umbrella-0.015
 kids-0.015
 Ph-0.015
Ph-0.014
Token the
Token ID262
Feature activation+1.26e-03

Pos logit contributions
epad+0.036
licks+0.035
rou+0.033
 twice+0.032
 Does+0.032
Neg logit contributions
 duck-0.013
 kids-0.012
 umbrella-0.012
 Ph-0.011
 best-0.011
Token shiny
Token ID22441
Feature activation+6.30e-03

Pos logit contributions
epad+0.003
licks+0.003
rou+0.002
 twice+0.002
 Does+0.002
Neg logit contributions
 duck-0.001
 umbrella-0.001
 kids-0.001
 Ph-0.001
Ph-0.001
Token metal
Token ID6147
Feature activation+2.21e-03

Pos logit contributions
licks+0.014
epad+0.014
rou+0.013
 Does+0.012
minded+0.012
Neg logit contributions
 umbrella-0.006
 duck-0.006
 kids-0.005
 pig-0.005
 k-0.005
Token and
Token ID290
Feature activation+2.86e-03

Pos logit contributions
epad+0.005
licks+0.005
rou+0.005
minded+0.005
 Does+0.004
Neg logit contributions
 duck-0.002
 umbrella-0.002
 kids-0.002
 pig-0.002
 k-0.002
Token the
Token ID262
Feature activation-3.88e-03

Pos logit contributions
licks+0.007
epad+0.006
rou+0.006
asper+0.006
oes+0.006
Neg logit contributions
 kids-0.002
 duck-0.002
 best-0.002
 Ph-0.002
 k-0.002
Token way
Token ID835
Feature activation-2.70e-02

Pos logit contributions
 duck+0.004
 kids+0.003
 umbrella+0.003
 Ph+0.003
 best+0.003
Neg logit contributions
licks-0.008
epad-0.008
rou-0.007
 twice-0.007
 Does-0.007
Token it
Token ID340
Feature activation-2.14e-03

Pos logit contributions
 duck+0.041
 kids+0.040
 umbrella+0.040
 Ph+0.039
Ph+0.039
Neg logit contributions
epad-0.039
licks-0.038
 twice-0.035
rou-0.034
 Does-0.032
Token opened
Token ID4721
Feature activation+2.89e-03

Pos logit contributions
 duck+0.002
 umbrella+0.002
 kids+0.002
 Ph+0.002
Ph+0.002
Neg logit contributions
epad-0.004
licks-0.004
rou-0.004
 twice-0.004
 Does-0.003
Token and
Token ID290
Feature activation+6.86e-03

Pos logit contributions
licks+0.007
epad+0.007
 Does+0.006
ems+0.006
auldron+0.006
Neg logit contributions
 duck-0.002
 best-0.002
 umbrella-0.002
 Ph-0.002
 kids-0.002
Token closed
Token ID4838
Feature activation-8.57e-03

Pos logit contributions
epad+0.015
licks+0.015
rou+0.014
 habitats+0.014
 facts+0.014
Neg logit contributions
 duck-0.006
 kids-0.006
 two-0.006
 Ph-0.006
 bird-0.006
Token.
Token ID13
Feature activation+6.21e-03

Pos logit contributions
 duck+0.007
 umbrella+0.006
 kids+0.006
 Ph+0.006
 pig+0.006
Neg logit contributions
epad-0.019
licks-0.019
rou-0.018
 Does-0.017
minded-0.017
Token,
Token ID11
Feature activation-1.12e-03

Pos logit contributions
 duck+0.003
 kids+0.003
 umbrella+0.003
 Ph+0.003
Ph+0.003
Neg logit contributions
epad-0.007
licks-0.007
rou-0.007
 twice-0.007
 habitats-0.006
Token �
Token ID6184
Feature activation+1.55e-02

Pos logit contributions
 duck+0.001
 umbrella+0.001
 kids+0.001
 Ph+0.001
Ph+0.001
Neg logit contributions
epad-0.002
licks-0.002
 twice-0.002
rou-0.002
 Does-0.002
Token�
Token ID95
Feature activation+3.88e-03

Pos logit contributions
licks+0.018
rou+0.018
epad+0.017
 twice+0.017
minded+0.017
Neg logit contributions
 kids-0.027
 umbrella-0.026
 best-0.026
 boys-0.026
 duck-0.025
Token€
Token ID26391
Feature activation-1.64e-02

Pos logit contributions
epad+0.005
licks+0.005
 twice+0.005
minded+0.005
rou+0.005
Neg logit contributions
 umbrella-0.007
 kids-0.006
 duck-0.006
 best-0.006
 boys-0.006
Token�
Token ID129
Feature activation+7.82e-03

Pos logit contributions
 umbrella+0.022
 duck+0.022
 Ph+0.022
 kids+0.022
 best+0.020
Neg logit contributions
epad-0.025
licks-0.025
rou-0.024
 twice-0.023
 Does-0.023
Token�
Token ID241
Feature activation-2.39e-02

Pos logit contributions
epad+0.012
licks+0.012
 twice+0.011
rou+0.011
 attent+0.011
Neg logit contributions
 duck-0.011
 kids-0.011
 umbrella-0.011
 best-0.010
 boys-0.010
TokenYou
Token ID1639
Feature activation+3.66e-03

Pos logit contributions
 umbrella+0.032
 duck+0.030
 kids+0.029
 pig+0.029
 best+0.028
Neg logit contributions
epad-0.040
licks-0.038
rou-0.037
 twice-0.036
 Does-0.035
Tokenâ
Token ID22940
Feature activation-4.37e-03

Pos logit contributions
epad+0.006
licks+0.006
 twice+0.006
rou+0.006
 Does+0.005
Neg logit contributions
 duck-0.004
 umbrella-0.004
 Ph-0.004
 kids-0.004
Ph-0.004
Token€
Token ID26391
Feature activation-2.22e-02

Pos logit contributions
 duck+0.006
 umbrella+0.006
 kids+0.006
 pig+0.006
 best+0.006
Neg logit contributions
licks-0.007
epad-0.007
 twice-0.006
minded-0.006
 Enjoy-0.006
Tokenâ„¢
Token ID8151
Feature activation-1.25e-02

Pos logit contributions
 umbrella+0.030
 duck+0.029
 Ph+0.028
 kids+0.028
 best+0.027
Neg logit contributions
licks-0.035
epad-0.035
rou-0.032
 Does-0.032
 twice-0.031
Tokenre
Token ID260
Feature activation-2.98e-03

Pos logit contributions
 duck+0.019
 umbrella+0.019
 kids+0.019
 Ph+0.018
Ph+0.018
Neg logit contributions
licks-0.018
epad-0.018
rou-0.016
 twice-0.016
 Does-0.015
Token€
Token ID26391
Feature activation-2.18e-02

Pos logit contributions
 duck+0.013
 umbrella+0.012
 kids+0.011
 pig+0.011
 je+0.011
Neg logit contributions
epad-0.017
licks-0.017
 twice-0.016
minded-0.015
ems-0.015
Tokenâ„¢
Token ID8151
Feature activation-1.11e-02

Pos logit contributions
 duck+0.027
 umbrella+0.027
 Ph+0.024
 kids+0.024
 best+0.024
Neg logit contributions
epad-0.037
licks-0.037
 twice-0.035
 Does-0.035
rou-0.034
Tokens
Token ID82
Feature activation-8.95e-03

Pos logit contributions
 duck+0.018
 umbrella+0.018
 kids+0.017
 Ph+0.017
Ph+0.017
Neg logit contributions
licks-0.015
epad-0.015
 twice-0.013
rou-0.013
 Does-0.012
Token nice
Token ID3621
Feature activation-1.01e-02

Pos logit contributions
 duck+0.007
 umbrella+0.007
 kids+0.007
 Ph+0.007
 best+0.007
Neg logit contributions
licks-0.019
epad-0.019
rou-0.018
minded-0.018
 twice-0.017
Token and
Token ID290
Feature activation-8.60e-03

Pos logit contributions
 kids+0.012
 duck+0.011
 umbrella+0.011
 best+0.011
 k+0.010
Neg logit contributions
licks-0.020
epad-0.019
minded-0.017
rou-0.017
 Does-0.017
Token warm
Token ID5814
Feature activation-2.39e-02

Pos logit contributions
 kids+0.009
 duck+0.009
 umbrella+0.008
 best+0.008
 Ph+0.008
Neg logit contributions
licks-0.017
epad-0.017
rou-0.015
minded-0.015
 twice-0.015
Token.
Token ID13
Feature activation-6.47e-03

Pos logit contributions
 duck+0.026
 umbrella+0.025
 kids+0.025
 Ph+0.024
 best+0.023
Neg logit contributions
epad-0.047
licks-0.046
rou-0.042
 twice-0.041
minded-0.041
Token Thank
Token ID6952
Feature activation+1.00e-02

Pos logit contributions
 duck+0.007
 umbrella+0.007
 kids+0.007
 Ph+0.006
Ph+0.006
Neg logit contributions
epad-0.012
licks-0.012
 twice-0.011
rou-0.011
 Does-0.011
Token you
Token ID345
Feature activation+1.03e-02

Pos logit contributions
 twice+0.012
licks+0.012
 Enjoy+0.012
 Does+0.012
epad+0.012
Neg logit contributions
 umbrella-0.016
 tire-0.014
 duck-0.014
 je-0.014
Ph-0.014
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
epad+0.017
licks+0.017
rou+0.015
 habitats+0.015
minded+0.015
Neg logit contributions
 duck-0.013
 kids-0.013
 umbrella-0.013
 Ph-0.013
 best-0.013
Token Mama
Token ID40084
Feature activation-1.23e-02

Pos logit contributions
 duck+0.015
 umbrella+0.014
 kids+0.014
 Ph+0.014
Ph+0.013
Neg logit contributions
epad-0.023
licks-0.023
 twice-0.021
rou-0.021
 Does-0.021
Token their
Token ID511
Feature activation-1.81e-02

Pos logit contributions
 Ph+0.027
 duck+0.026
Ph+0.026
 umbrella+0.025
n+0.024
Neg logit contributions
epad-0.043
licks-0.041
 twice-0.041
rou-0.039
 early-0.038
Token things
Token ID1243
Feature activation-1.23e-02

Pos logit contributions
 Ph+0.019
Ph+0.018
 duck+0.016
 umbrella+0.016
ter+0.015
Neg logit contributions
 habitats-0.032
epad-0.032
 facts-0.031
rou-0.030
licks-0.030
Token.
Token ID13
Feature activation-7.90e-03

Pos logit contributions
 duck+0.013
 kids+0.012
 umbrella+0.012
 Ph+0.012
Ph+0.012
Neg logit contributions
epad-0.023
licks-0.023
 twice-0.021
rou-0.021
minded-0.020
Token On
Token ID1550
Feature activation-1.44e-02

Pos logit contributions
 duck+0.009
 umbrella+0.009
 kids+0.009
 Ph+0.008
Ph+0.008
Neg logit contributions
epad-0.014
licks-0.014
 twice-0.013
rou-0.013
 Does-0.013
Token their
Token ID511
Feature activation-1.26e-02

Pos logit contributions
 duck+0.016
 kids+0.015
 umbrella+0.015
 Esk+0.015
 boys+0.014
Neg logit contributions
 twice-0.024
epad-0.024
licks-0.023
rou-0.022
 early-0.022
Token way
Token ID835
Feature activation-3.00e-02

Pos logit contributions
ter+0.022
 Ph+0.021
ily+0.020
 duck+0.020
aver+0.019
Neg logit contributions
 habitats-0.014
by-0.013
 solar-0.013
 int-0.013
 watches-0.012
Token home
Token ID1363
Feature activation-9.90e-03

Pos logit contributions
 duck+0.035
 Ph+0.034
 kids+0.034
 umbrella+0.033
Ph+0.033
Neg logit contributions
epad-0.049
licks-0.048
 twice-0.047
rou-0.044
oes-0.043
Token,
Token ID11
Feature activation-8.91e-03

Pos logit contributions
 duck+0.009
 kids+0.008
 umbrella+0.008
 Ph+0.008
 best+0.008
Neg logit contributions
epad-0.021
licks-0.021
rou-0.019
 twice-0.019
 Does-0.019
Token they
Token ID484
Feature activation-7.65e-03

Pos logit contributions
 duck+0.007
 kids+0.007
 umbrella+0.007
 Ph+0.007
Ph+0.006
Neg logit contributions
epad-0.019
licks-0.019
rou-0.018
 twice-0.018
minded-0.017
Token told
Token ID1297
Feature activation+1.45e-02

Pos logit contributions
 duck+0.008
 kids+0.008
 umbrella+0.008
 Ph+0.008
Ph+0.008
Neg logit contributions
epad-0.014
licks-0.014
rou-0.013
 twice-0.013
 Does-0.013
Token stories
Token ID3923
Feature activation+1.21e-04

Pos logit contributions
epad+0.029
licks+0.028
 twice+0.027
rou+0.026
minded+0.026
Neg logit contributions
 duck-0.014
 umbrella-0.013
Ph-0.013
 Ph-0.013
 kids-0.013
Token the
Token ID262
Feature activation-8.36e-03

Pos logit contributions
licks+0.001
epad+0.001
rou+0.001
minded+0.001
 twice+0.001
Neg logit contributions
 duck-0.001
 kids-0.001
 umbrella-0.001
 best-0.001
 Ph-0.001
Token belt
Token ID10999
Feature activation-1.96e-02

Pos logit contributions
Ph+0.007
 Ph+0.007
 umbrella+0.006
 duck+0.006
n+0.006
Neg logit contributions
epad-0.017
licks-0.016
 twice-0.016
rou-0.016
minded-0.015
Token around
Token ID1088
Feature activation-7.40e-03

Pos logit contributions
 duck+0.021
 Ph+0.020
 umbrella+0.020
 kids+0.020
Ph+0.020
Neg logit contributions
epad-0.036
licks-0.036
 twice-0.033
rou-0.033
minded-0.031
Token Jack
Token ID3619
Feature activation+4.26e-03

Pos logit contributions
 duck+0.006
 kids+0.006
 umbrella+0.006
 Ph+0.006
 best+0.005
Neg logit contributions
epad-0.016
licks-0.016
rou-0.015
 twice-0.014
 Does-0.014
Token's
Token ID338
Feature activation-1.20e-02

Pos logit contributions
epad+0.009
licks+0.009
rou+0.008
 twice+0.008
 Does+0.008
Neg logit contributions
 duck-0.004
 kids-0.004
 umbrella-0.004
 Ph-0.004
Ph-0.004
Token waist
Token ID16139
Feature activation-2.48e-02

Pos logit contributions
 Ph+0.015
Ph+0.014
 duck+0.013
n+0.013
 umbrella+0.012
Neg logit contributions
epad-0.021
rou-0.020
licks-0.019
 habitats-0.019
 twice-0.019
Token prevented
Token ID13351
Feature activation+7.81e-03

Pos logit contributions
 duck+0.022
 umbrella+0.021
 kids+0.021
 Ph+0.020
 best+0.020
Neg logit contributions
epad-0.051
licks-0.051
 twice-0.046
rou-0.046
minded-0.046
Token his
Token ID465
Feature activation-1.28e-03

Pos logit contributions
epad+0.014
licks+0.013
 twice+0.013
rou+0.012
minded+0.012
Neg logit contributions
 duck-0.009
 umbrella-0.009
 kids-0.009
Ph-0.009
 Ph-0.009
Token coat
Token ID13209
Feature activation-1.80e-02

Pos logit contributions
 Ph+0.001
Ph+0.001
 duck+0.001
 umbrella+0.001
n+0.001
Neg logit contributions
epad-0.003
licks-0.002
rou-0.002
 twice-0.002
minded-0.002
Token from
Token ID422
Feature activation-1.33e-04

Pos logit contributions
 duck+0.022
 kids+0.021
 umbrella+0.021
 Ph+0.021
Ph+0.020
Neg logit contributions
epad-0.032
licks-0.031
 twice-0.029
rou-0.028
minded-0.027
Token falling
Token ID7463
Feature activation-3.39e-03

Pos logit contributions
 duck+0.000
 kids+0.000
 umbrella+0.000
 Ph+0.000
 best+0.000
Neg logit contributions
licks-0.000
epad-0.000
rou-0.000
 twice-0.000
 Does-0.000