TOP ACTIVATIONS
MAX = 0.052

Daisy heard a noise and became very scared . She quickly
heard a peculiar whistle and became very curious . He wanted
with him , and he tre asured it always . <|endoftext|>
storm came and the water became very cold . Fin ny
noise appeared and the cats became very scared . They noticed
climb it and see what is on the roof .
take turns playing with what is inside ." Anna
the ladder and see what is on the roof . Sam
His friends were scared and became angry . One of them
a loud noise and he became very scared . His mom
should be compassionate . Comp assion ate means you care about
he lf and see what is on the top ."
heard a loud noise and became very worried . She didn
a loud noise and he became frightened . He called out
Look at this , Bob ," said his mom my .
an old man . He tre asured it , for it
, Bob jumped forward and snatched it back . Susan was
catch them . The lion became nervous and said , "
. Do you want to replace your car with mine ?"
ed her yellow shell it became very friendly . From then

MIN ACTIVATIONS
MIN = -0.064

tree and picked the biggest , rip est olive she could
decided to buy the biggest , highest cel ery she could
, she found a small , rough purse . The purse
, she drew a big , pretty bird . She showed
up and saw the big , blue sky . Blue was
. He grabbed some big , p uffy snow balls ,
. He saw a big , hairy spider on the ceiling
He put on his big , wide gloves and grabbed a
He had the brightest yellow , the darkest black and the
, Lily found a big , blank paper . She thought
. It included a big , white dove . It was
wanted to explore the big , tall world outside of his
. It was a big , high market . All the
, she found a big , white sheet of paper and
It was a small , wo olly puppy called Max .
. They put a soft , warm blanket on Max .
wanted to design a big , black anchor .
spot and saw a big , beautiful tree with tasty fruits
. They brought a big , red nap kin to sit
They also bought a big , sour , green apple to

INTERVAL 0.023 - 0.052
CONTAINS 0.904%

from L izz ie and inspected it . â
and his friends cheered . Then , they all sat down
she heard loud noises coming from her house . Izzy quickly
and a leash . He opened the fence and took out
So Bear y wiped and wiped the fridge with his little

INTERVAL -0.006 - 0.023
CONTAINS 72.091%

Joe thanked the children and gave them a
them . Sara and Tom are scared . They run away
watching him carefully , because this model was very complicated .
box ?" Sam asks . " I don 't
. She never wanted to let it go ever again .

INTERVAL -0.035 - -0.006
CONTAINS 26.926%

to your instincts . You never know when something might split
happy . He held the balloon tight and it lifted him
then he saw something strange . The cat had vanished !
?" Anna asked . She yawn ed and rubbed her eyes
with their ball , happy to share and be friends .

INTERVAL -0.064 - -0.035
CONTAINS 0.078%

their wounds . They gave them some water and ice cream
journey to see the big , beautiful mountains . Her mom
Anna . She was 3 years old . One day ,
. It was a big , brown worm !
would pick up her cr ay ons and start to draw
Token Daisy
Token ID40355
Feature activation+1.08e-02

Pos logit contributions
Wow+0.021
L+0.019
 sounded+0.019
 matter+0.018
Okay+0.018
Neg logit contributions
oured-0.020
umbs-0.020
 sandwiches-0.020
 hind-0.019
ulent-0.019
Token heard
Token ID2982
Feature activation-1.58e-02

Pos logit contributions
 sounded+0.011
L+0.011
Wow+0.011
Okay+0.010
 matter+0.009
Neg logit contributions
 hind-0.021
 sandwiches-0.021
gary-0.020
umbs-0.020
oured-0.020
Token a
Token ID257
Feature activation+7.83e-03

Pos logit contributions
 hind+0.027
oured+0.027
umbs+0.026
ulent+0.026
oops+0.026
Neg logit contributions
 sounded-0.022
L-0.020
Wow-0.020
er-0.018
 matter-0.018
Token noise
Token ID7838
Feature activation-3.75e-03

Pos logit contributions
Wow+0.012
L+0.011
 sounded+0.011
Okay+0.011
cer+0.011
Neg logit contributions
 sandwiches-0.011
 hind-0.010
oured-0.010
 fishing-0.010
ulent-0.010
Token and
Token ID290
Feature activation+2.16e-02

Pos logit contributions
 sandwiches+0.005
oured+0.005
 hind+0.005
ulent+0.005
umbs+0.005
Neg logit contributions
Wow-0.005
 sounded-0.005
L-0.005
Okay-0.005
 matter-0.005
Token became
Token ID2627
Feature activation+5.24e-02

Pos logit contributions
Wow+0.030
L+0.028
Okay+0.028
 matter+0.028
 sounded+0.028
Neg logit contributions
 sandwiches-0.032
oured-0.031
 hind-0.031
ulent-0.030
 tum-0.030
Token very
Token ID845
Feature activation+8.99e-03

Pos logit contributions
Wow+0.105
Okay+0.099
L+0.098
 matter+0.098
 sounded+0.096
Neg logit contributions
 fishing-0.049
 sandwiches-0.046
 hind-0.045
 tum-0.042
 thirsty-0.041
Token scared
Token ID12008
Feature activation-1.15e-02

Pos logit contributions
Wow+0.017
 sounded+0.016
L+0.016
Okay+0.016
 matter+0.016
Neg logit contributions
 fishing-0.009
 sandwiches-0.009
 hind-0.009
 tum-0.008
oured-0.008
Token.
Token ID13
Feature activation+6.10e-03

Pos logit contributions
 sandwiches+0.022
 tum+0.021
 fishing+0.021
umbs+0.021
ulent+0.021
Neg logit contributions
Wow-0.012
Okay-0.011
OK-0.011
 narrative-0.010
Really-0.010
Token She
Token ID1375
Feature activation+2.47e-02

Pos logit contributions
Wow+0.007
 sounded+0.007
L+0.006
Okay+0.006
OK+0.006
Neg logit contributions
 sandwiches-0.011
oured-0.011
gary-0.011
umbs-0.011
ulent-0.011
Token quickly
Token ID2952
Feature activation-4.43e-05

Pos logit contributions
Wow+0.035
cer+0.034
 narrative+0.033
ator+0.033
Okay+0.033
Neg logit contributions
 hind-0.035
oured-0.033
 tum-0.033
 sandwiches-0.032
ulent-0.032
Token heard
Token ID2982
Feature activation-9.91e-03

Pos logit contributions
 sandwiches+0.010
 hind+0.009
oured+0.009
ulent+0.009
umbs+0.009
Neg logit contributions
Wow-0.009
 sounded-0.009
L-0.009
 matter-0.008
Okay-0.008
Token a
Token ID257
Feature activation+8.53e-03

Pos logit contributions
 sandwiches+0.012
oured+0.012
 hind+0.012
umbs+0.012
ulent+0.012
Neg logit contributions
Wow-0.017
 sounded-0.016
L-0.016
 matter-0.015
Okay-0.015
Token peculiar
Token ID19501
Feature activation+1.81e-02

Pos logit contributions
Wow+0.014
cer+0.014
Okay+0.013
Fine+0.013
L+0.013
Neg logit contributions
 hind-0.010
 sandwiches-0.010
 fishing-0.010
 tum-0.010
ulent-0.009
Token whistle
Token ID16121
Feature activation+4.88e-03

Pos logit contributions
Wow+0.019
L+0.018
Okay+0.018
Fine+0.017
OK+0.016
Neg logit contributions
 sandwiches-0.033
 hind-0.032
 fishing-0.032
 tum-0.032
oured-0.031
Token and
Token ID290
Feature activation+2.47e-02

Pos logit contributions
Wow+0.006
Okay+0.006
Fine+0.005
L+0.005
Really+0.005
Neg logit contributions
 fishing-0.008
 sandwiches-0.008
umbs-0.008
oured-0.008
 tum-0.008
Token became
Token ID2627
Feature activation+5.24e-02

Pos logit contributions
 narrative+0.036
Wow+0.036
Okay+0.035
L+0.034
 matter+0.034
Neg logit contributions
 tum-0.032
oured-0.032
 fishing-0.031
 sandwiches-0.031
 hind-0.031
Token very
Token ID845
Feature activation+1.31e-02

Pos logit contributions
Wow+0.102
L+0.098
Okay+0.097
 matter+0.095
 sounded+0.093
Neg logit contributions
 fishing-0.052
 hind-0.048
 sandwiches-0.048
ulent-0.045
 tum-0.044
Token curious
Token ID11040
Feature activation+4.29e-04

Pos logit contributions
Wow+0.025
L+0.024
 sounded+0.024
Okay+0.023
 matter+0.023
Neg logit contributions
 fishing-0.013
 sandwiches-0.013
 hind-0.013
oured-0.012
 tum-0.012
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
Wow+0.000
L+0.000
Okay+0.000
OK+0.000
Fine+0.000
Neg logit contributions
 sandwiches-0.001
 tum-0.001
 fishing-0.001
oured-0.001
 hind-0.001
Token He
Token ID679
Feature activation+1.45e-02

Pos logit contributions
 matter+0.015
 narrative+0.015
Wow+0.015
Okay+0.015
 sounded+0.015
Neg logit contributions
 fishing-0.015
oured-0.015
 tum-0.015
 hind-0.014
ulent-0.014
Token wanted
Token ID2227
Feature activation-1.90e-02

Pos logit contributions
Wow+0.020
 narrative+0.019
Okay+0.019
L+0.018
cer+0.018
Neg logit contributions
oured-0.021
 hind-0.021
 fishing-0.020
ulent-0.020
 sandwiches-0.020
Token with
Token ID351
Feature activation+1.84e-02

Pos logit contributions
 sandwiches+0.020
umbs+0.020
 hind+0.020
oured+0.020
ulent+0.020
Neg logit contributions
L-0.007
 sounded-0.007
er-0.007
cer-0.007
Wow-0.006
Token him
Token ID683
Feature activation-6.38e-04

Pos logit contributions
Wow+0.023
 sounded+0.022
L+0.021
 matter+0.020
Okay+0.020
Neg logit contributions
 sandwiches-0.031
oured-0.030
 hind-0.030
ulent-0.030
umbs-0.030
Token,
Token ID11
Feature activation-2.39e-03

Pos logit contributions
 hind+0.001
umbs+0.001
 sandwiches+0.001
gary+0.001
oured+0.001
Neg logit contributions
 sounded-0.000
L-0.000
Wow-0.000
er-0.000
 matter-0.000
Token and
Token ID290
Feature activation+1.98e-02

Pos logit contributions
umbs+0.004
 sandwiches+0.004
oured+0.004
ulent+0.004
gary+0.003
Neg logit contributions
Wow-0.003
 sounded-0.003
L-0.003
 matter-0.003
Okay-0.003
Token he
Token ID339
Feature activation-6.20e-03

Pos logit contributions
Wow+0.026
 sounded+0.025
L+0.025
Okay+0.024
 matter+0.024
Neg logit contributions
 sandwiches-0.031
oured-0.031
 hind-0.030
umbs-0.030
ulent-0.030
Token tre
Token ID2054
Feature activation+5.21e-02

Pos logit contributions
 sandwiches+0.012
 hind+0.012
ulent+0.012
umbs+0.012
oured+0.011
Neg logit contributions
 sounded-0.006
Wow-0.006
L-0.006
Okay-0.005
 matter-0.005
Tokenasured
Token ID34006
Feature activation+1.81e-02

Pos logit contributions
Wow+0.109
L+0.104
 sounded+0.100
Hey+0.100
OK+0.098
Neg logit contributions
 hind-0.044
oops-0.043
 seen-0.040
oured-0.037
 sandwiches-0.034
Token it
Token ID340
Feature activation+7.40e-03

Pos logit contributions
Wow+0.026
L+0.025
 sounded+0.024
Okay+0.024
 narrative+0.024
Neg logit contributions
 sandwiches-0.027
oured-0.027
 hind-0.026
 fishing-0.025
 tum-0.025
Token always
Token ID1464
Feature activation+8.64e-03

Pos logit contributions
 sounded+0.009
Wow+0.009
L+0.009
 matter+0.008
Okay+0.008
Neg logit contributions
 sandwiches-0.012
umbs-0.012
oured-0.012
 hind-0.012
ulent-0.012
Token.
Token ID13
Feature activation-3.95e-04

Pos logit contributions
Wow+0.013
L+0.012
Okay+0.011
Fine+0.011
E+0.010
Neg logit contributions
oured-0.013
 sandwiches-0.013
 fishing-0.012
ulent-0.012
 hind-0.012
Token<|endoftext|>
Token ID50256
Feature activation+3.72e-04

Pos logit contributions
umbs+0.001
 sandwiches+0.001
 hind+0.001
oured+0.001
 fishing+0.001
Neg logit contributions
Wow-0.001
L-0.001
 matter-0.001
 sounded-0.001
Okay-0.000
Token storm
Token ID6388
Feature activation+1.14e-02

Pos logit contributions
 sounded+0.003
Wow+0.002
 matter+0.002
L+0.002
er+0.002
Neg logit contributions
oured-0.004
umbs-0.004
ulent-0.004
gary-0.003
 sandwiches-0.003
Token came
Token ID1625
Feature activation+1.74e-02

Pos logit contributions
Wow+0.017
Okay+0.016
Really+0.016
 narrative+0.016
Deal+0.016
Neg logit contributions
oured-0.015
umbs-0.015
 sandwiches-0.014
 fishing-0.014
 tum-0.014
Token and
Token ID290
Feature activation+4.62e-03

Pos logit contributions
Wow+0.023
L+0.022
Okay+0.021
 sounded+0.021
 matter+0.021
Neg logit contributions
 sandwiches-0.027
oured-0.027
umbs-0.026
 hind-0.026
ulent-0.026
Token the
Token ID262
Feature activation+1.23e-03

Pos logit contributions
Wow+0.006
 sounded+0.006
L+0.006
Okay+0.006
 matter+0.006
Neg logit contributions
 sandwiches-0.007
oured-0.007
 hind-0.007
umbs-0.007
ulent-0.007
Token water
Token ID1660
Feature activation+4.31e-03

Pos logit contributions
Wow+0.002
 sounded+0.001
L+0.001
Okay+0.001
 matter+0.001
Neg logit contributions
oured-0.002
 sandwiches-0.002
 hind-0.002
ulent-0.002
umbs-0.002
Token became
Token ID2627
Feature activation+5.09e-02

Pos logit contributions
Wow+0.005
L+0.005
 sounded+0.005
Okay+0.005
 matter+0.005
Neg logit contributions
 sandwiches-0.007
oured-0.007
 hind-0.007
umbs-0.007
ulent-0.007
Token very
Token ID845
Feature activation+1.08e-02

Pos logit contributions
Wow+0.096
Okay+0.094
Fine+0.091
L+0.090
OK+0.090
Neg logit contributions
 sandwiches-0.053
 fishing-0.052
 tum-0.050
 hind-0.049
oured-0.045
Token cold
Token ID4692
Feature activation+6.02e-03

Pos logit contributions
Wow+0.022
 sounded+0.020
L+0.020
Okay+0.020
cer+0.020
Neg logit contributions
 sandwiches-0.010
 fishing-0.010
 tum-0.010
 hind-0.010
oured-0.009
Token.
Token ID13
Feature activation+6.78e-03

Pos logit contributions
Wow+0.006
Okay+0.006
L+0.006
Really+0.006
 matter+0.006
Neg logit contributions
 sandwiches-0.012
umbs-0.011
 fishing-0.011
 tum-0.011
oured-0.011
Token Fin
Token ID4463
Feature activation-5.01e-03

Pos logit contributions
Wow+0.007
 sounded+0.006
L+0.006
Okay+0.006
Fine+0.006
Neg logit contributions
gary-0.013
 sandwiches-0.013
oured-0.013
umbs-0.012
 hind-0.012
Tokenny
Token ID3281
Feature activation-7.17e-03

Pos logit contributions
 sandwiches+0.011
oured+0.010
 hind+0.010
ulent+0.010
gary+0.010
Neg logit contributions
Wow-0.004
L-0.004
 sounded-0.004
Okay-0.003
cer-0.003
Token noise
Token ID7838
Feature activation+4.82e-03

Pos logit contributions
 sandwiches+0.006
 hind+0.006
 fishing+0.006
 tum+0.006
ulent+0.006
Neg logit contributions
Wow-0.006
Okay-0.006
L-0.006
cer-0.006
 narrative-0.006
Token appeared
Token ID4120
Feature activation+7.87e-03

Pos logit contributions
Wow+0.006
Okay+0.006
L+0.006
OK+0.006
 narrative+0.006
Neg logit contributions
 sandwiches-0.007
oured-0.007
 fishing-0.007
umbs-0.007
 tum-0.007
Token and
Token ID290
Feature activation+2.62e-03

Pos logit contributions
Wow+0.011
Okay+0.010
 sounded+0.010
L+0.010
 matter+0.010
Neg logit contributions
 sandwiches-0.012
oured-0.012
 hind-0.012
umbs-0.012
ulent-0.012
Token the
Token ID262
Feature activation-2.76e-03

Pos logit contributions
Wow+0.004
L+0.004
Okay+0.004
 sounded+0.004
 matter+0.003
Neg logit contributions
 sandwiches-0.004
oured-0.004
 hind-0.004
ulent-0.004
umbs-0.003
Token cats
Token ID11875
Feature activation-1.17e-02

Pos logit contributions
oured+0.005
 sandwiches+0.005
umbs+0.005
 hind+0.005
gary+0.005
Neg logit contributions
Wow-0.003
L-0.003
 sounded-0.003
 matter-0.003
er-0.003
Token became
Token ID2627
Feature activation+5.01e-02

Pos logit contributions
 sandwiches+0.018
oured+0.017
umbs+0.017
 hind+0.017
ulent+0.017
Neg logit contributions
Wow-0.016
L-0.015
Okay-0.015
 sounded-0.015
 matter-0.015
Token very
Token ID845
Feature activation+9.11e-03

Pos logit contributions
Wow+0.089
L+0.085
Okay+0.084
 sounded+0.084
 matter+0.081
Neg logit contributions
 sandwiches-0.058
 hind-0.055
 fishing-0.055
oured-0.054
ulent-0.053
Token scared
Token ID12008
Feature activation-1.08e-02

Pos logit contributions
Wow+0.016
 sounded+0.016
L+0.015
Okay+0.015
 matter+0.015
Neg logit contributions
 sandwiches-0.010
 hind-0.010
 fishing-0.010
oured-0.010
ulent-0.010
Token.
Token ID13
Feature activation-3.06e-03

Pos logit contributions
 sandwiches+0.021
 fishing+0.021
umbs+0.020
 tum+0.020
oured+0.020
Neg logit contributions
Wow-0.011
Okay-0.010
OK-0.009
Really-0.009
L-0.009
Token They
Token ID1119
Feature activation+3.35e-03

Pos logit contributions
 sandwiches+0.005
oured+0.005
 hind+0.005
gary+0.005
ulent+0.004
Neg logit contributions
Wow-0.004
 sounded-0.004
L-0.004
Okay-0.004
OK-0.004
Token noticed
Token ID6810
Feature activation+8.17e-03

Pos logit contributions
Wow+0.004
L+0.004
Okay+0.004
 sounded+0.004
 matter+0.004
Neg logit contributions
 sandwiches-0.005
oured-0.005
 hind-0.005
ulent-0.005
umbs-0.005
Token climb
Token ID12080
Feature activation+3.50e-03

Pos logit contributions
Wow+0.021
 sounded+0.020
L+0.020
 matter+0.020
Okay+0.019
Neg logit contributions
 sandwiches-0.017
oured-0.017
ulent-0.016
 hind-0.016
umbs-0.016
Token it
Token ID340
Feature activation+1.08e-02

Pos logit contributions
Wow+0.005
 sounded+0.004
L+0.004
Okay+0.004
 matter+0.004
Neg logit contributions
 sandwiches-0.006
oured-0.005
 hind-0.005
ulent-0.005
umbs-0.005
Token and
Token ID290
Feature activation+1.62e-02

Pos logit contributions
Wow+0.009
 sounded+0.008
L+0.008
Okay+0.008
 matter+0.008
Neg logit contributions
 sandwiches-0.022
oured-0.022
 hind-0.022
umbs-0.021
ulent-0.021
Token see
Token ID766
Feature activation+1.69e-02

Pos logit contributions
Wow+0.030
 sounded+0.029
L+0.028
 matter+0.028
Okay+0.028
Neg logit contributions
 sandwiches-0.017
oured-0.017
umbs-0.017
ulent-0.017
 hind-0.017
Token what
Token ID644
Feature activation+1.14e-02

Pos logit contributions
Wow+0.026
L+0.025
Okay+0.025
 sounded+0.024
Fine+0.024
Neg logit contributions
 sandwiches-0.024
 hind-0.022
oured-0.021
 fishing-0.021
oops-0.021
Token is
Token ID318
Feature activation+5.00e-02

Pos logit contributions
Wow+0.011
L+0.011
 sounded+0.010
Okay+0.010
 matter+0.010
Neg logit contributions
 sandwiches-0.022
oured-0.021
 hind-0.021
umbs-0.020
 tum-0.020
Token on
Token ID319
Feature activation+9.02e-03

Pos logit contributions
Wow+0.090
L+0.087
Okay+0.086
 sounded+0.085
cer+0.082
Neg logit contributions
 sandwiches-0.056
 fishing-0.053
 hind-0.051
oops-0.051
oured-0.050
Token the
Token ID262
Feature activation+9.20e-03

Pos logit contributions
Wow+0.016
L+0.015
Okay+0.015
 sounded+0.015
Really+0.014
Neg logit contributions
 sandwiches-0.011
 fishing-0.010
 toast-0.009
 hind-0.009
 several-0.009
Token roof
Token ID9753
Feature activation+1.45e-02

Pos logit contributions
Wow+0.013
 sounded+0.012
L+0.012
Okay+0.012
 matter+0.011
Neg logit contributions
 sandwiches-0.014
oured-0.014
 hind-0.014
ulent-0.013
umbs-0.013
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
Wow+0.013
Okay+0.011
L+0.011
 sounded+0.010
Fine+0.010
Neg logit contributions
 sandwiches-0.030
umbs-0.029
oured-0.029
oops-0.028
 fishing-0.028
Token
Token ID198
Feature activation+5.56e-03

Pos logit contributions
Wow+0.019
 sounded+0.018
 matter+0.018
L+0.017
Okay+0.017
Neg logit contributions
 sandwiches-0.028
oured-0.027
 hind-0.027
umbs-0.027
 tum-0.026
Token take
Token ID1011
Feature activation+1.88e-04

Pos logit contributions
 sandwiches+0.005
oured+0.005
 tum+0.004
umbs+0.004
 hind+0.004
Neg logit contributions
Wow-0.009
 sounded-0.008
Really-0.008
Okay-0.008
L-0.008
Token turns
Token ID4962
Feature activation-5.27e-03

Pos logit contributions
Wow+0.000
L+0.000
 sounded+0.000
 matter+0.000
Okay+0.000
Neg logit contributions
oured-0.000
 hind-0.000
 sandwiches-0.000
ulent-0.000
umbs-0.000
Token playing
Token ID2712
Feature activation+5.07e-03

Pos logit contributions
 hind+0.011
oured+0.011
 sandwiches+0.011
ulent+0.010
umbs+0.010
Neg logit contributions
L-0.004
 sounded-0.004
Wow-0.004
er-0.004
 matter-0.004
Token with
Token ID351
Feature activation+3.43e-03

Pos logit contributions
Wow+0.006
 sounded+0.006
Okay+0.006
 matter+0.006
L+0.005
Neg logit contributions
 sandwiches-0.009
 fishing-0.008
 hind-0.008
oured-0.008
ulent-0.008
Token what
Token ID644
Feature activation+1.41e-03

Pos logit contributions
Wow+0.004
L+0.004
 sounded+0.004
 matter+0.004
Okay+0.004
Neg logit contributions
oured-0.006
 sandwiches-0.006
 hind-0.006
umbs-0.006
ulent-0.006
Token is
Token ID318
Feature activation+4.96e-02

Pos logit contributions
Wow+0.002
 sounded+0.002
L+0.002
Okay+0.001
 matter+0.001
Neg logit contributions
 sandwiches-0.002
oured-0.002
 hind-0.002
ulent-0.002
umbs-0.002
Token inside
Token ID2641
Feature activation+1.98e-02

Pos logit contributions
Wow+0.097
Okay+0.091
 sounded+0.091
L+0.090
Fine+0.087
Neg logit contributions
 sandwiches-0.053
 fishing-0.050
 hind-0.047
oops-0.047
ulent-0.045
Token."
Token ID526
Feature activation+1.10e-03

Pos logit contributions
Wow+0.015
 sounded+0.013
L+0.013
Okay+0.012
 matter+0.012
Neg logit contributions
 sandwiches-0.043
oured-0.042
 hind-0.042
ulent-0.041
umbs-0.041
Token
Token ID198
Feature activation+1.54e-03

Pos logit contributions
Wow+0.001
 sounded+0.001
L+0.001
 matter+0.001
Okay+0.001
Neg logit contributions
 sandwiches-0.002
oured-0.002
 hind-0.002
umbs-0.002
ulent-0.002
Token
Token ID198
Feature activation+4.78e-03

Pos logit contributions
Wow+0.002
 sounded+0.002
 matter+0.002
Okay+0.002
cer+0.002
Neg logit contributions
 sandwiches-0.002
 hind-0.002
umbs-0.002
 fishing-0.002
ulent-0.002
TokenAnna
Token ID31160
Feature activation-1.96e-03

Pos logit contributions
 sounded+0.005
 matter+0.005
ators+0.005
en+0.005
er+0.005
Neg logit contributions
gary-0.008
 sandwiches-0.008
 fishing-0.007
ulent-0.007
 hind-0.007
Token the
Token ID262
Feature activation+7.86e-03

Pos logit contributions
Wow+0.001
 sounded+0.001
L+0.001
Okay+0.001
 matter+0.001
Neg logit contributions
 sandwiches-0.001
oured-0.001
 hind-0.001
ulent-0.001
umbs-0.001
Token ladder
Token ID18002
Feature activation+4.83e-03

Pos logit contributions
Wow+0.014
L+0.013
 sounded+0.013
 matter+0.013
OK+0.013
Neg logit contributions
oured-0.010
ulent-0.010
 sandwiches-0.009
gary-0.009
umbs-0.009
Token and
Token ID290
Feature activation+1.86e-02

Pos logit contributions
Wow+0.004
 sounded+0.004
L+0.004
Okay+0.004
 matter+0.004
Neg logit contributions
 sandwiches-0.010
oured-0.010
 hind-0.009
ulent-0.009
umbs-0.009
Token see
Token ID766
Feature activation+1.80e-02

Pos logit contributions
Wow+0.031
 sounded+0.030
L+0.029
 matter+0.029
Okay+0.029
Neg logit contributions
 sandwiches-0.023
umbs-0.023
ulent-0.023
oured-0.022
 hind-0.022
Token what
Token ID644
Feature activation+1.42e-02

Pos logit contributions
Wow+0.028
Okay+0.026
L+0.026
 sounded+0.026
 matter+0.025
Neg logit contributions
 sandwiches-0.025
oured-0.024
 hind-0.024
 fishing-0.023
ulent-0.023
Token is
Token ID318
Feature activation+4.95e-02

Pos logit contributions
Wow+0.015
L+0.014
 sounded+0.014
Okay+0.013
 matter+0.013
Neg logit contributions
 sandwiches-0.026
oured-0.025
 hind-0.024
umbs-0.024
 tum-0.024
Token on
Token ID319
Feature activation+1.03e-02

Pos logit contributions
Wow+0.089
L+0.086
 sounded+0.085
Okay+0.085
cer+0.082
Neg logit contributions
 sandwiches-0.054
 fishing-0.051
 hind-0.050
oured-0.049
oops-0.049
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
Wow+0.017
Okay+0.017
L+0.017
 sounded+0.017
Really+0.016
Neg logit contributions
 sandwiches-0.012
 fishing-0.011
 toast-0.011
 several-0.011
 hind-0.011
Token roof
Token ID9753
Feature activation+1.59e-02

Pos logit contributions
Wow+0.015
 sounded+0.014
L+0.014
Okay+0.013
 matter+0.013
Neg logit contributions
 sandwiches-0.018
oured-0.018
 hind-0.017
ulent-0.017
umbs-0.017
Token.
Token ID13
Feature activation+7.59e-03

Pos logit contributions
Wow+0.015
Okay+0.013
L+0.013
Really+0.012
Fine+0.012
Neg logit contributions
 sandwiches-0.032
umbs-0.031
oured-0.030
oops-0.030
 tum-0.030
Token Sam
Token ID3409
Feature activation+1.00e-02

Pos logit contributions
Wow+0.009
 sounded+0.009
L+0.009
 matter+0.008
Okay+0.008
Neg logit contributions
 sandwiches-0.013
oured-0.012
 hind-0.012
umbs-0.012
ulent-0.012
TokenHis
Token ID6653
Feature activation+3.00e-03

Pos logit contributions
Wow+0.005
 sounded+0.005
 matter+0.005
cer+0.004
 narrative+0.004
Neg logit contributions
 hind-0.006
 fishing-0.006
ulent-0.006
umbs-0.006
gary-0.006
Token friends
Token ID2460
Feature activation-9.94e-04

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
 matter+0.003
Okay+0.003
Neg logit contributions
oured-0.005
umbs-0.005
 sandwiches-0.005
ulent-0.005
 hind-0.005
Token were
Token ID547
Feature activation+1.75e-02

Pos logit contributions
 sandwiches+0.002
 hind+0.002
oured+0.002
umbs+0.002
ulent+0.002
Neg logit contributions
Wow-0.001
 sounded-0.001
L-0.001
Okay-0.001
 matter-0.001
Token scared
Token ID12008
Feature activation-2.65e-02

Pos logit contributions
 sounded+0.025
Wow+0.023
L+0.023
 matter+0.021
Why+0.021
Neg logit contributions
oured-0.027
umbs-0.026
 sandwiches-0.026
gary-0.026
ulent-0.025
Token and
Token ID290
Feature activation+2.15e-02

Pos logit contributions
 sandwiches+0.038
oured+0.038
 hind+0.038
ulent+0.037
umbs+0.037
Neg logit contributions
Wow-0.039
 sounded-0.038
L-0.037
 matter-0.035
Okay-0.035
Token became
Token ID2627
Feature activation+4.94e-02

Pos logit contributions
Wow+0.028
L+0.025
Okay+0.025
cer+0.025
ate+0.024
Neg logit contributions
 sandwiches-0.034
 hind-0.033
oured-0.033
 fishing-0.033
 tum-0.032
Token angry
Token ID7954
Feature activation+2.39e-03

Pos logit contributions
Wow+0.089
L+0.083
Okay+0.083
 sounded+0.082
OK+0.080
Neg logit contributions
 sandwiches-0.054
 fishing-0.053
 hind-0.053
oured-0.050
ulent-0.050
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
Wow+0.003
Okay+0.002
L+0.002
 matter+0.002
OK+0.002
Neg logit contributions
 sandwiches-0.005
 fishing-0.004
 tum-0.004
oured-0.004
ulent-0.004
Token One
Token ID1881
Feature activation-1.35e-02

Pos logit contributions
Wow+0.015
 sounded+0.014
L+0.014
Okay+0.014
 matter+0.013
Neg logit contributions
 sandwiches-0.026
oured-0.026
 hind-0.026
ulent-0.025
umbs-0.025
Token of
Token ID286
Feature activation-1.40e-03

Pos logit contributions
oured+0.022
 hind+0.022
ulent+0.022
umbs+0.022
 sandwiches+0.022
Neg logit contributions
 sounded-0.018
Wow-0.018
L-0.018
 matter-0.017
Okay-0.016
Token them
Token ID606
Feature activation+3.19e-03

Pos logit contributions
 sandwiches+0.002
 fishing+0.002
 hind+0.002
oured+0.002
ulent+0.002
Neg logit contributions
Okay-0.002
Wow-0.002
OK-0.002
Really-0.002
 sounded-0.002
Token a
Token ID257
Feature activation+1.15e-02

Pos logit contributions
 hind+0.023
oured+0.023
ulent+0.022
umbs+0.022
 sandwiches+0.022
Neg logit contributions
 sounded-0.024
Wow-0.023
L-0.023
 matter-0.021
er-0.021
Token loud
Token ID7812
Feature activation-2.07e-03

Pos logit contributions
Wow+0.017
Okay+0.016
L+0.015
cer+0.015
 sounded+0.015
Neg logit contributions
 hind-0.016
 sandwiches-0.016
ulent-0.015
 tum-0.015
 fishing-0.015
Token noise
Token ID7838
Feature activation-2.37e-04

Pos logit contributions
 sandwiches+0.003
 hind+0.003
ulent+0.003
oured+0.003
 fishing+0.003
Neg logit contributions
Wow-0.003
Okay-0.002
L-0.002
cer-0.002
OK-0.002
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
 sandwiches+0.000
oured+0.000
 hind+0.000
ulent+0.000
umbs+0.000
Neg logit contributions
Wow-0.000
L-0.000
 sounded-0.000
Okay-0.000
 matter-0.000
Token he
Token ID339
Feature activation+1.14e-02

Pos logit contributions
Wow+0.023
Okay+0.022
L+0.022
 narrative+0.022
 matter+0.022
Neg logit contributions
 sandwiches-0.021
oured-0.021
 hind-0.020
 fishing-0.020
 tum-0.020
Token became
Token ID2627
Feature activation+4.92e-02

Pos logit contributions
Wow+0.017
Okay+0.016
 narrative+0.016
Deal+0.016
OK+0.016
Neg logit contributions
oured-0.015
 tum-0.015
 hind-0.015
 sandwiches-0.014
ulent-0.014
Token very
Token ID845
Feature activation+1.45e-02

Pos logit contributions
Wow+0.093
L+0.088
Okay+0.088
 matter+0.086
 sounded+0.085
Neg logit contributions
 fishing-0.050
 sandwiches-0.050
 hind-0.049
oured-0.046
ulent-0.046
Token scared
Token ID12008
Feature activation-1.12e-02

Pos logit contributions
Wow+0.028
 sounded+0.026
L+0.026
Okay+0.025
 matter+0.025
Neg logit contributions
 fishing-0.015
 sandwiches-0.015
 hind-0.015
oured-0.014
 tum-0.014
Token.
Token ID13
Feature activation+4.97e-03

Pos logit contributions
 sandwiches+0.023
 fishing+0.023
 hind+0.022
oured+0.022
umbs+0.022
Neg logit contributions
Wow-0.010
Okay-0.008
L-0.008
OK-0.008
 sounded-0.008
Token His
Token ID2399
Feature activation-2.29e-03

Pos logit contributions
Wow+0.006
 sounded+0.006
L+0.006
Okay+0.005
 matter+0.005
Neg logit contributions
 sandwiches-0.009
oured-0.008
 hind-0.008
ulent-0.008
umbs-0.008
Token mom
Token ID1995
Feature activation+6.63e-03

Pos logit contributions
umbs+0.004
oured+0.004
ulent+0.004
gary+0.004
 sandwiches+0.004
Neg logit contributions
 sounded-0.002
Wow-0.002
L-0.002
 matter-0.002
Okay-0.002
Token should
Token ID815
Feature activation-4.05e-03

Pos logit contributions
Wow+0.000
 sounded+0.000
L+0.000
 matter+0.000
Okay+0.000
Neg logit contributions
 hind-0.000
oured-0.000
 sandwiches-0.000
ulent-0.000
gary-0.000
Token be
Token ID307
Feature activation+1.76e-02

Pos logit contributions
 sandwiches+0.004
 toast+0.004
 tum+0.004
 hind+0.004
 fishing+0.004
Neg logit contributions
Wow-0.008
 sounded-0.007
Really-0.007
cer-0.007
inct-0.007
Token compassionate
Token ID32533
Feature activation+1.47e-02

Pos logit contributions
Wow+0.029
rey+0.027
ator+0.027
 sounded+0.026
ators+0.026
Neg logit contributions
 sandwiches-0.022
 hind-0.021
 fishing-0.020
oops-0.020
 tum-0.020
Token.
Token ID13
Feature activation+6.80e-03

Pos logit contributions
Wow+0.017
 sounded+0.015
 matter+0.014
L+0.014
Really+0.014
Neg logit contributions
 sandwiches-0.027
 tum-0.026
umbs-0.026
 to-0.025
ulent-0.025
Token Comp
Token ID3082
Feature activation+6.05e-03

Pos logit contributions
 matter+0.010
 sounded+0.009
rey+0.009
ator+0.009
ators+0.009
Neg logit contributions
oops-0.010
ulent-0.010
imes-0.009
 Many-0.009
umbs-0.009
Tokenassion
Token ID11857
Feature activation+4.91e-02

Pos logit contributions
Wow+0.010
 sounded+0.009
 matter+0.009
Really+0.009
L+0.008
Neg logit contributions
oured-0.008
shaped-0.007
ives-0.007
colored-0.007
oops-0.007
Tokenate
Token ID378
Feature activation+9.39e-03

Pos logit contributions
 matter+0.053
Okay+0.048
Wow+0.047
 avail+0.047
Really+0.045
Neg logit contributions
 sandwiches-0.080
umbs-0.078
ives-0.076
 to-0.072
 fishing-0.071
Token means
Token ID1724
Feature activation+1.60e-04

Pos logit contributions
Wow+0.009
L+0.008
 sounded+0.008
Okay+0.008
 matter+0.008
Neg logit contributions
 sandwiches-0.018
oured-0.017
umbs-0.017
 hind-0.017
ulent-0.017
Token you
Token ID345
Feature activation+2.45e-03

Pos logit contributions
Wow+0.000
 sounded+0.000
L+0.000
Okay+0.000
cer+0.000
Neg logit contributions
 sandwiches-0.000
oured-0.000
 hind-0.000
ulent-0.000
 fishing-0.000
Token care
Token ID1337
Feature activation-3.55e-03

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
Okay+0.003
 matter+0.003
Neg logit contributions
 sandwiches-0.004
oured-0.004
 hind-0.004
ulent-0.004
umbs-0.004
Token about
Token ID546
Feature activation+9.90e-03

Pos logit contributions
 sandwiches+0.004
 hind+0.004
oured+0.003
umbs+0.003
ulent+0.003
Neg logit contributions
Wow-0.007
Okay-0.006
L-0.006
 sounded-0.006
Fine-0.006
Tokenhe
Token ID258
Feature activation+5.27e-03

Pos logit contributions
ulent+0.000
oured+0.000
 sandwiches+0.000
 hind+0.000
gary+0.000
Neg logit contributions
Wow-0.000
 sounded-0.000
L-0.000
Okay-0.000
er-0.000
Tokenlf
Token ID1652
Feature activation+6.68e-03

Pos logit contributions
Wow+0.005
Okay+0.005
L+0.005
 sounded+0.005
 matter+0.004
Neg logit contributions
 sandwiches-0.010
 hind-0.010
umbs-0.010
oured-0.010
 fishing-0.010
Token and
Token ID290
Feature activation+2.88e-03

Pos logit contributions
 sounded+0.005
Wow+0.005
L+0.005
er+0.005
cer+0.005
Neg logit contributions
oured-0.014
ulent-0.013
 hind-0.013
gary-0.013
 sandwiches-0.013
Token see
Token ID766
Feature activation+4.53e-03

Pos logit contributions
Wow+0.004
 sounded+0.004
L+0.004
 matter+0.004
OK+0.004
Neg logit contributions
oured-0.004
umbs-0.004
ulent-0.004
gary-0.004
 sandwiches-0.004
Token what
Token ID644
Feature activation+3.56e-03

Pos logit contributions
Wow+0.006
 sounded+0.006
L+0.006
Okay+0.006
 matter+0.005
Neg logit contributions
 sandwiches-0.007
oured-0.007
 hind-0.007
ulent-0.007
umbs-0.007
Token is
Token ID318
Feature activation+4.87e-02

Pos logit contributions
Wow+0.004
 sounded+0.004
L+0.004
Okay+0.004
 matter+0.004
Neg logit contributions
oured-0.006
 hind-0.006
ulent-0.006
 sandwiches-0.006
umbs-0.006
Token on
Token ID319
Feature activation+7.45e-03

Pos logit contributions
Wow+0.082
 sounded+0.078
L+0.078
Okay+0.077
 matter+0.074
Neg logit contributions
 sandwiches-0.062
 hind-0.058
oured-0.057
 fishing-0.057
ulent-0.057
Token the
Token ID262
Feature activation+8.71e-03

Pos logit contributions
Wow+0.010
 sounded+0.010
L+0.010
Okay+0.010
 matter+0.009
Neg logit contributions
 sandwiches-0.011
oured-0.011
 hind-0.011
umbs-0.011
ulent-0.011
Token top
Token ID1353
Feature activation+4.35e-03

Pos logit contributions
Wow+0.015
 sounded+0.014
L+0.014
Okay+0.014
 matter+0.013
Neg logit contributions
 sandwiches-0.010
 hind-0.010
oured-0.010
ulent-0.010
umbs-0.010
Token."
Token ID526
Feature activation+7.83e-03

Pos logit contributions
Wow+0.004
L+0.004
 sounded+0.004
Okay+0.004
 matter+0.003
Neg logit contributions
 sandwiches-0.009
umbs-0.008
oured-0.008
 hind-0.008
ulent-0.008
Token
Token ID198
Feature activation+2.66e-03

Pos logit contributions
Wow+0.009
 sounded+0.009
L+0.009
Okay+0.008
 matter+0.008
Neg logit contributions
 sandwiches-0.013
oured-0.013
 hind-0.013
ulent-0.013
umbs-0.013
Token heard
Token ID2982
Feature activation-2.14e-02

Pos logit contributions
 hind+0.014
umbs+0.014
 sandwiches+0.014
ulent+0.014
oured+0.013
Neg logit contributions
 sounded-0.009
L-0.009
Wow-0.008
Okay-0.007
 matter-0.007
Token a
Token ID257
Feature activation+5.30e-03

Pos logit contributions
 hind+0.037
ulent+0.037
oured+0.036
umbs+0.036
oops+0.035
Neg logit contributions
 sounded-0.031
L-0.028
Wow-0.027
er-0.026
 matter-0.025
Token loud
Token ID7812
Feature activation-1.54e-03

Pos logit contributions
Wow+0.008
L+0.007
 sounded+0.007
Okay+0.007
cer+0.007
Neg logit contributions
 sandwiches-0.008
 hind-0.007
oured-0.007
 tum-0.007
ulent-0.007
Token noise
Token ID7838
Feature activation-2.13e-03

Pos logit contributions
 sandwiches+0.002
 hind+0.002
 fishing+0.002
oured+0.002
 tum+0.002
Neg logit contributions
Wow-0.002
Okay-0.002
L-0.002
cer-0.002
OK-0.002
Token and
Token ID290
Feature activation+1.89e-02

Pos logit contributions
 sandwiches+0.003
 hind+0.003
oured+0.003
ulent+0.003
umbs+0.003
Neg logit contributions
 sounded-0.003
Wow-0.003
L-0.003
 matter-0.003
Okay-0.003
Token became
Token ID2627
Feature activation+4.86e-02

Pos logit contributions
Wow+0.026
L+0.024
Okay+0.024
 sounded+0.024
 matter+0.023
Neg logit contributions
 sandwiches-0.028
oured-0.027
 hind-0.027
 tum-0.026
ulent-0.026
Token very
Token ID845
Feature activation+9.57e-03

Pos logit contributions
Wow+0.096
Okay+0.091
L+0.090
 matter+0.089
 sounded+0.088
Neg logit contributions
 fishing-0.044
 sandwiches-0.044
 hind-0.043
 tum-0.040
oured-0.039
Token worried
Token ID7960
Feature activation+5.39e-03

Pos logit contributions
Wow+0.019
 sounded+0.017
L+0.017
Okay+0.017
 matter+0.017
Neg logit contributions
 fishing-0.009
 hind-0.009
 sandwiches-0.009
 tum-0.009
oured-0.008
Token.
Token ID13
Feature activation+1.77e-03

Pos logit contributions
Wow+0.005
Okay+0.004
L+0.004
OK+0.004
cer+0.004
Neg logit contributions
 sandwiches-0.011
 tum-0.011
 hind-0.011
umbs-0.010
oured-0.010
Token She
Token ID1375
Feature activation+2.16e-02

Pos logit contributions
Wow+0.003
 sounded+0.002
L+0.002
Okay+0.002
 matter+0.002
Neg logit contributions
 sandwiches-0.003
oured-0.002
 hind-0.002
ulent-0.002
umbs-0.002
Token didn
Token ID1422
Feature activation-1.20e-02

Pos logit contributions
Wow+0.031
cer+0.029
 narrative+0.029
ator+0.029
Okay+0.028
Neg logit contributions
oured-0.029
 hind-0.029
 tum-0.029
 sandwiches-0.027
 fishing-0.026
Token a
Token ID257
Feature activation+1.03e-02

Pos logit contributions
oured+0.016
 hind+0.016
ulent+0.016
umbs+0.016
 sandwiches+0.016
Neg logit contributions
 sounded-0.017
Wow-0.016
L-0.016
 matter-0.015
er-0.015
Token loud
Token ID7812
Feature activation-1.43e-03

Pos logit contributions
Wow+0.015
L+0.014
 sounded+0.013
Okay+0.013
cer+0.013
Neg logit contributions
 sandwiches-0.015
 hind-0.015
oured-0.015
ulent-0.014
 fishing-0.014
Token noise
Token ID7838
Feature activation-8.39e-04

Pos logit contributions
 sandwiches+0.002
 hind+0.002
oured+0.002
ulent+0.002
 fishing+0.002
Neg logit contributions
Wow-0.002
L-0.002
Okay-0.002
 sounded-0.002
OK-0.002
Token and
Token ID290
Feature activation+1.53e-02

Pos logit contributions
 sandwiches+0.001
oured+0.001
 hind+0.001
ulent+0.001
umbs+0.001
Neg logit contributions
Wow-0.001
L-0.001
Okay-0.001
 sounded-0.001
 matter-0.001
Token he
Token ID339
Feature activation+9.98e-03

Pos logit contributions
Wow+0.023
Okay+0.022
L+0.021
 narrative+0.021
 matter+0.021
Neg logit contributions
oured-0.020
 sandwiches-0.020
 hind-0.020
 fishing-0.020
 tum-0.020
Token became
Token ID2627
Feature activation+4.85e-02

Pos logit contributions
Wow+0.015
Okay+0.014
 narrative+0.014
Deal+0.014
 matter+0.014
Neg logit contributions
oured-0.013
 tum-0.013
 hind-0.013
 sandwiches-0.013
ulent-0.012
Token frightened
Token ID24776
Feature activation-1.17e-02

Pos logit contributions
Wow+0.092
L+0.087
Okay+0.086
 matter+0.084
 sounded+0.083
Neg logit contributions
 fishing-0.050
 sandwiches-0.049
 hind-0.049
oured-0.046
ulent-0.046
Token.
Token ID13
Feature activation+7.94e-03

Pos logit contributions
 sandwiches+0.025
oured+0.024
 hind+0.024
 fishing+0.024
umbs+0.024
Neg logit contributions
Wow-0.009
L-0.008
Okay-0.008
 sounded-0.008
OK-0.007
Token He
Token ID679
Feature activation+2.27e-02

Pos logit contributions
Wow+0.009
 sounded+0.009
L+0.009
Okay+0.008
 matter+0.008
Neg logit contributions
 sandwiches-0.014
oured-0.014
 hind-0.013
ulent-0.013
umbs-0.013
Token called
Token ID1444
Feature activation-7.15e-03

Pos logit contributions
Wow+0.033
 narrative+0.031
cer+0.031
Okay+0.031
L+0.031
Neg logit contributions
 hind-0.032
oured-0.031
 tum-0.030
 sandwiches-0.029
 fishing-0.029
Token out
Token ID503
Feature activation-1.53e-02

Pos logit contributions
oured+0.010
umbs+0.009
gary+0.009
ulent+0.009
 sandwiches+0.009
Neg logit contributions
 sounded-0.011
Wow-0.011
L-0.011
 matter-0.010
er-0.010
TokenLook
Token ID8567
Feature activation-5.31e-03

Pos logit contributions
er+0.007
 sounded+0.007
 matter+0.007
 narrative+0.007
ator+0.007
Neg logit contributions
umbs-0.016
gary-0.016
 sandwiches-0.016
ulent-0.016
oured-0.015
Token at
Token ID379
Feature activation+1.50e-03

Pos logit contributions
umbs+0.008
 sandwiches+0.008
ulent+0.008
 hind+0.008
oured+0.008
Neg logit contributions
Wow-0.007
 sounded-0.007
L-0.007
 matter-0.006
Okay-0.006
Token this
Token ID428
Feature activation+1.39e-02

Pos logit contributions
Wow+0.002
 sounded+0.002
L+0.001
 matter+0.001
ate+0.001
Neg logit contributions
ulent-0.003
umbs-0.003
gary-0.003
oured-0.003
 hind-0.003
Token,
Token ID11
Feature activation+1.23e-02

Pos logit contributions
Wow+0.018
 sounded+0.017
L+0.017
 matter+0.016
Okay+0.016
Neg logit contributions
oured-0.022
ulent-0.022
 hind-0.022
 sandwiches-0.021
umbs-0.021
Token Bob
Token ID5811
Feature activation+2.65e-03

Pos logit contributions
Wow+0.017
L+0.017
 sounded+0.017
Okay+0.016
 matter+0.016
Neg logit contributions
 sandwiches-0.018
 hind-0.017
oured-0.017
umbs-0.017
 tum-0.017
Token,"
Token ID553
Feature activation+4.84e-02

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
Okay+0.003
 matter+0.003
Neg logit contributions
 sandwiches-0.005
oured-0.004
 hind-0.004
ulent-0.004
umbs-0.004
Token said
Token ID531
Feature activation+1.48e-02

Pos logit contributions
Wow+0.066
L+0.065
cer+0.063
Okay+0.063
 narrative+0.062
Neg logit contributions
 sandwiches-0.073
 tum-0.069
 hind-0.066
 fishing-0.066
 toast-0.065
Token his
Token ID465
Feature activation-4.72e-03

Pos logit contributions
Wow+0.022
L+0.021
 sounded+0.020
Okay+0.020
 matter+0.020
Neg logit contributions
 sandwiches-0.021
oured-0.021
 hind-0.020
 tum-0.020
umbs-0.020
Token mom
Token ID1995
Feature activation+1.49e-02

Pos logit contributions
ulent+0.008
oured+0.008
umbs+0.008
 sandwiches+0.008
gary+0.008
Neg logit contributions
Wow-0.005
 sounded-0.005
L-0.005
Okay-0.005
 matter-0.005
Tokenmy
Token ID1820
Feature activation+1.60e-02

Pos logit contributions
Wow+0.020
 narrative+0.019
Okay+0.018
Hey+0.018
Fine+0.018
Neg logit contributions
oured-0.025
 sandwiches-0.023
 tum-0.023
ulent-0.022
umbs-0.022
Token.
Token ID13
Feature activation+3.46e-02

Pos logit contributions
Wow+0.017
Okay+0.014
 matter+0.014
L+0.014
 narrative+0.014
Neg logit contributions
oured-0.032
 sandwiches-0.030
 fishing-0.030
ulent-0.030
 tum-0.030
Token an
Token ID281
Feature activation-2.09e-02

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
Okay+0.003
 matter+0.003
Neg logit contributions
 sandwiches-0.002
oured-0.002
 hind-0.002
ulent-0.002
umbs-0.002
Token old
Token ID1468
Feature activation+2.50e-02

Pos logit contributions
oured+0.026
 sandwiches+0.026
umbs+0.026
ulent+0.025
 hind+0.025
Neg logit contributions
Wow-0.034
 sounded-0.033
L-0.033
 matter-0.032
Okay-0.031
Token man
Token ID582
Feature activation+7.06e-03

Pos logit contributions
Wow+0.031
Okay+0.027
 sounded+0.027
L+0.027
 matter+0.027
Neg logit contributions
 sandwiches-0.044
 fishing-0.042
 tum-0.042
 hind-0.040
oured-0.040
Token.
Token ID13
Feature activation-4.21e-03

Pos logit contributions
Wow+0.005
 sounded+0.005
L+0.005
Okay+0.004
 matter+0.004
Neg logit contributions
 sandwiches-0.015
 hind-0.015
oured-0.015
umbs-0.015
ulent-0.015
Token He
Token ID679
Feature activation-2.34e-03

Pos logit contributions
 sandwiches+0.007
oured+0.007
 hind+0.006
ulent+0.006
umbs+0.006
Neg logit contributions
Wow-0.006
 sounded-0.005
L-0.005
Okay-0.005
 matter-0.005
Token tre
Token ID2054
Feature activation+4.83e-02

Pos logit contributions
 sandwiches+0.004
oured+0.004
 hind+0.004
ulent+0.004
 fishing+0.004
Neg logit contributions
Wow-0.003
L-0.002
Okay-0.002
 matter-0.002
 sounded-0.002
Tokenasured
Token ID34006
Feature activation+6.20e-03

Pos logit contributions
Wow+0.090
L+0.085
Really+0.082
 sounded+0.081
Okay+0.081
Neg logit contributions
 hind-0.051
oops-0.051
 seen-0.047
 sandwiches-0.045
oured-0.043
Token it
Token ID340
Feature activation-1.56e-03

Pos logit contributions
Wow+0.009
L+0.008
 sounded+0.008
Okay+0.008
 matter+0.008
Neg logit contributions
 sandwiches-0.010
 hind-0.009
oured-0.009
 tum-0.009
 fishing-0.009
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
umbs+0.003
oured+0.003
gary+0.003
ulent+0.003
 sandwiches+0.003
Neg logit contributions
 sounded-0.002
L-0.002
Wow-0.002
 matter-0.002
er-0.002
Token for
Token ID329
Feature activation-2.94e-03

Pos logit contributions
umbs+0.030
oured+0.030
ulent+0.029
gary+0.028
 hind+0.028
Neg logit contributions
 sounded-0.021
Wow-0.020
 matter-0.020
L-0.019
Okay-0.017
Token it
Token ID340
Feature activation+6.55e-03

Pos logit contributions
 sandwiches+0.004
oured+0.004
 hind+0.004
ulent+0.004
umbs+0.004
Neg logit contributions
Wow-0.004
 sounded-0.004
L-0.004
Okay-0.004
 matter-0.004
Token,
Token ID11
Feature activation+2.13e-02

Pos logit contributions
Wow+0.014
 sounded+0.013
L+0.013
Okay+0.013
 matter+0.013
Neg logit contributions
 sandwiches-0.013
oured-0.013
 hind-0.013
ulent-0.013
umbs-0.012
Token Bob
Token ID5811
Feature activation-9.54e-03

Pos logit contributions
Wow+0.035
L+0.034
 sounded+0.033
Okay+0.033
 matter+0.032
Neg logit contributions
 sandwiches-0.027
oured-0.025
 hind-0.025
 fishing-0.024
 tum-0.024
Token jumped
Token ID11687
Feature activation+5.77e-03

Pos logit contributions
 hind+0.016
 sandwiches+0.016
gary+0.016
umbs+0.015
oured+0.015
Neg logit contributions
 sounded-0.012
L-0.011
Wow-0.011
cer-0.010
er-0.010
Token forward
Token ID2651
Feature activation-1.32e-02

Pos logit contributions
Wow+0.009
 sounded+0.008
L+0.008
Okay+0.008
 matter+0.008
Neg logit contributions
 sandwiches-0.008
oured-0.008
 hind-0.008
ulent-0.008
umbs-0.008
Token and
Token ID290
Feature activation+9.53e-03

Pos logit contributions
gary+0.025
umbs+0.024
 sandwiches+0.024
oured+0.024
ulent+0.023
Neg logit contributions
er-0.013
 sounded-0.013
.-0.012
L-0.012
Wow-0.012
Token snatched
Token ID43276
Feature activation+4.83e-02

Pos logit contributions
Wow+0.014
 sounded+0.014
L+0.014
Okay+0.013
 matter+0.013
Neg logit contributions
 sandwiches-0.013
oured-0.013
 hind-0.013
ulent-0.013
umbs-0.012
Token it
Token ID340
Feature activation-1.16e-03

Pos logit contributions
Wow+0.072
 sounded+0.072
Okay+0.071
L+0.071
 matter+0.067
Neg logit contributions
 sandwiches-0.067
oured-0.063
 hind-0.062
oops-0.061
umbs-0.060
Token back
Token ID736
Feature activation+1.71e-02

Pos logit contributions
oured+0.002
ulent+0.002
 hind+0.002
gary+0.002
 sandwiches+0.002
Neg logit contributions
L-0.001
Wow-0.001
 sounded-0.001
er-0.001
cer-0.001
Token.
Token ID13
Feature activation+6.79e-03

Pos logit contributions
Wow+0.021
 sounded+0.020
L+0.020
Okay+0.019
 matter+0.019
Neg logit contributions
 sandwiches-0.029
oured-0.028
 hind-0.028
ulent-0.027
umbs-0.027
Token Susan
Token ID14919
Feature activation+1.18e-03

Pos logit contributions
Wow+0.010
 sounded+0.009
L+0.009
Okay+0.009
 matter+0.009
Neg logit contributions
 sandwiches-0.010
oured-0.010
 hind-0.010
ulent-0.010
umbs-0.010
Token was
Token ID373
Feature activation+5.55e-03

Pos logit contributions
 sounded+0.001
L+0.001
Wow+0.001
er+0.001
Okay+0.001
Neg logit contributions
 hind-0.003
 sandwiches-0.003
umbs-0.003
gary-0.003
ulent-0.003
Token catch
Token ID4929
Feature activation+4.71e-03

Pos logit contributions
Wow+0.041
Okay+0.039
 sounded+0.039
L+0.039
 narrative+0.037
Neg logit contributions
 sandwiches-0.025
 hind-0.024
 fishing-0.024
ulent-0.024
oured-0.023
Token them
Token ID606
Feature activation+1.11e-02

Pos logit contributions
Wow+0.006
Okay+0.006
 sounded+0.006
L+0.006
 matter+0.005
Neg logit contributions
 sandwiches-0.008
 fishing-0.007
umbs-0.007
ulent-0.007
 hind-0.007
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
Wow+0.007
 sounded+0.006
L+0.006
Okay+0.006
 matter+0.006
Neg logit contributions
 sandwiches-0.025
oured-0.025
 hind-0.025
ulent-0.024
umbs-0.024
Token The
Token ID383
Feature activation-4.02e-03

Pos logit contributions
Wow+0.014
 sounded+0.013
L+0.013
Okay+0.013
 matter+0.012
Neg logit contributions
 sandwiches-0.017
oured-0.017
 hind-0.017
ulent-0.017
umbs-0.017
Token lion
Token ID18744
Feature activation-4.24e-03

Pos logit contributions
oured+0.006
 sandwiches+0.006
 hind+0.006
umbs+0.006
ulent+0.006
Neg logit contributions
Wow-0.005
 sounded-0.005
L-0.005
 matter-0.005
Okay-0.005
Token became
Token ID2627
Feature activation+4.78e-02

Pos logit contributions
 sandwiches+0.007
oured+0.007
 hind+0.007
 tum+0.007
ulent+0.007
Neg logit contributions
Wow-0.005
L-0.004
Okay-0.004
 matter-0.004
 sounded-0.004
Token nervous
Token ID10927
Feature activation-1.48e-02

Pos logit contributions
Wow+0.084
L+0.080
Okay+0.080
 sounded+0.079
 matter+0.077
Neg logit contributions
 sandwiches-0.055
 hind-0.054
 fishing-0.053
ulent-0.051
oured-0.051
Token and
Token ID290
Feature activation+2.34e-02

Pos logit contributions
 sandwiches+0.026
oured+0.025
 hind+0.025
ulent+0.025
umbs+0.025
Neg logit contributions
Wow-0.017
 sounded-0.017
L-0.016
Okay-0.015
 matter-0.015
Token said
Token ID531
Feature activation-4.07e-04

Pos logit contributions
Wow+0.031
L+0.028
Okay+0.027
cer+0.027
 narrative+0.027
Neg logit contributions
 sandwiches-0.037
 hind-0.036
 fishing-0.036
oured-0.035
 tum-0.035
Token,
Token ID11
Feature activation+6.07e-03

Pos logit contributions
oured+0.001
umbs+0.001
 sandwiches+0.001
 hind+0.001
ulent+0.001
Neg logit contributions
Wow-0.001
 sounded-0.001
L-0.001
Okay-0.000
 matter-0.000
Token "
Token ID366
Feature activation+1.37e-02

Pos logit contributions
Wow+0.007
 sounded+0.006
L+0.006
Okay+0.006
 matter+0.006
Neg logit contributions
 sandwiches-0.011
oured-0.011
umbs-0.011
 hind-0.011
gary-0.011
Token.
Token ID13
Feature activation+8.84e-03

Pos logit contributions
L+0.003
 sounded+0.003
Wow+0.003
er+0.003
Okay+0.003
Neg logit contributions
 hind-0.003
ulent-0.003
oured-0.003
 sandwiches-0.003
umbs-0.003
Token Do
Token ID2141
Feature activation+2.12e-02

Pos logit contributions
Wow+0.012
 sounded+0.012
 matter+0.012
L+0.012
Okay+0.012
Neg logit contributions
ulent-0.012
oured-0.012
oops-0.012
 sandwiches-0.012
 hind-0.012
Token you
Token ID345
Feature activation-1.53e-02

Pos logit contributions
Wow+0.033
cer+0.032
Okay+0.031
 sounded+0.030
ator+0.029
Neg logit contributions
 sandwiches-0.029
 hind-0.026
 tum-0.025
oops-0.025
ulent-0.025
Token want
Token ID765
Feature activation-1.34e-02

Pos logit contributions
 sandwiches+0.021
oured+0.021
 hind+0.021
ulent+0.020
umbs+0.020
Neg logit contributions
Wow-0.023
 sounded-0.022
L-0.022
Okay-0.021
 matter-0.021
Token to
Token ID284
Feature activation-1.17e-02

Pos logit contributions
oured+0.011
umbs+0.011
 hind+0.011
ulent+0.011
 sandwiches+0.010
Neg logit contributions
L-0.028
Wow-0.028
 sounded-0.028
er-0.026
 matter-0.026
Token replace
Token ID6330
Feature activation+4.78e-02

Pos logit contributions
 sandwiches+0.015
oured+0.014
 hind+0.014
ulent+0.014
umbs+0.014
Neg logit contributions
Wow-0.019
 sounded-0.018
L-0.018
Okay-0.018
 matter-0.017
Token your
Token ID534
Feature activation+5.90e-03

Pos logit contributions
Okay+0.082
 narrative+0.082
Wow+0.079
 sounded+0.077
OK+0.076
Neg logit contributions
 sandwiches-0.063
 hind-0.054
 toast-0.053
 fishing-0.053
oured-0.052
Token car
Token ID1097
Feature activation+2.87e-03

Pos logit contributions
Wow+0.008
 sounded+0.008
L+0.008
Okay+0.008
 matter+0.008
Neg logit contributions
 sandwiches-0.009
oured-0.009
 hind-0.008
ulent-0.008
umbs-0.008
Token with
Token ID351
Feature activation+7.37e-03

Pos logit contributions
Wow+0.003
L+0.003
 sounded+0.003
Okay+0.003
 matter+0.003
Neg logit contributions
 sandwiches-0.005
oured-0.005
 hind-0.005
ulent-0.005
umbs-0.005
Token mine
Token ID6164
Feature activation+4.27e-03

Pos logit contributions
Wow+0.010
 sounded+0.010
L+0.010
Okay+0.009
 matter+0.009
Neg logit contributions
 sandwiches-0.011
oured-0.011
 hind-0.011
umbs-0.011
ulent-0.011
Token?"
Token ID1701
Feature activation+1.56e-02

Pos logit contributions
Wow+0.005
 sounded+0.004
L+0.004
Okay+0.004
 matter+0.004
Neg logit contributions
 sandwiches-0.008
oured-0.008
 hind-0.008
umbs-0.008
ulent-0.008
Tokened
Token ID276
Feature activation+1.03e-02

Pos logit contributions
Wow+0.001
 sounded+0.001
L+0.001
Okay+0.001
 matter+0.001
Neg logit contributions
 sandwiches-0.001
oured-0.001
umbs-0.001
 hind-0.001
ulent-0.001
Token her
Token ID607
Feature activation+1.04e-02

Pos logit contributions
Wow+0.014
L+0.013
 sounded+0.013
Okay+0.013
 matter+0.013
Neg logit contributions
 sandwiches-0.015
oured-0.015
 hind-0.015
ulent-0.015
 tum-0.014
Token yellow
Token ID7872
Feature activation+6.65e-03

Pos logit contributions
Wow+0.011
 sounded+0.011
 matter+0.011
L+0.011
cer+0.010
Neg logit contributions
umbs-0.018
gary-0.018
oured-0.018
 sandwiches-0.018
ulent-0.018
Token shell
Token ID7582
Feature activation-1.13e-03

Pos logit contributions
Wow+0.010
 sounded+0.010
L+0.010
Okay+0.009
 matter+0.009
Neg logit contributions
 sandwiches-0.009
oured-0.009
 hind-0.009
ulent-0.008
 tum-0.008
Token it
Token ID340
Feature activation+8.13e-04

Pos logit contributions
 sandwiches+0.003
oured+0.002
 hind+0.002
ulent+0.002
umbs+0.002
Neg logit contributions
Wow-0.001
 sounded-0.001
L-0.001
Okay-0.001
 matter-0.001
Token became
Token ID2627
Feature activation+4.75e-02

Pos logit contributions
Wow+0.001
L+0.001
 sounded+0.001
Okay+0.001
 matter+0.001
Neg logit contributions
 sandwiches-0.002
oured-0.002
 hind-0.002
ulent-0.002
umbs-0.002
Token very
Token ID845
Feature activation+9.74e-03

Pos logit contributions
Wow+0.073
Fine+0.067
Okay+0.066
Really+0.064
Te+0.063
Neg logit contributions
 sandwiches-0.069
 fishing-0.067
 hind-0.065
 to-0.063
 tum-0.063
Token friendly
Token ID8030
Feature activation+2.88e-03

Pos logit contributions
Wow+0.019
ators+0.018
ator+0.017
 sounded+0.017
cer+0.017
Neg logit contributions
 tum-0.010
 sandwiches-0.010
 hind-0.010
 fishing-0.010
 long-0.009
Token.
Token ID13
Feature activation-5.68e-03

Pos logit contributions
Wow+0.002
L+0.002
 sounded+0.002
Okay+0.002
 matter+0.002
Neg logit contributions
 sandwiches-0.006
oured-0.006
 hind-0.006
umbs-0.006
ulent-0.006
Token From
Token ID3574
Feature activation+1.63e-02

Pos logit contributions
 sandwiches+0.008
oured+0.008
 hind+0.008
ulent+0.007
umbs+0.007
Neg logit contributions
Wow-0.008
 sounded-0.008
 matter-0.008
L-0.008
Okay-0.008
Token then
Token ID788
Feature activation+4.03e-04

Pos logit contributions
Wow+0.029
 sounded+0.028
L+0.028
Okay+0.027
 matter+0.027
Neg logit contributions
 sandwiches-0.018
oured-0.018
 hind-0.017
umbs-0.017
ulent-0.017
Token tree
Token ID5509
Feature activation-3.04e-03

Pos logit contributions
oured+0.002
umbs+0.002
 hind+0.002
ulent+0.002
 sandwiches+0.002
Neg logit contributions
Wow-0.003
L-0.003
 matter-0.002
Okay-0.002
 sounded-0.002
Token and
Token ID290
Feature activation+2.60e-03

Pos logit contributions
gary+0.006
 hind+0.006
umbs+0.006
 sandwiches+0.006
ulent+0.005
Neg logit contributions
er-0.003
 sounded-0.003
L-0.003
.-0.003
cer-0.002
Token picked
Token ID6497
Feature activation+7.10e-03

Pos logit contributions
 sounded+0.003
Wow+0.003
L+0.003
Okay+0.003
cer+0.003
Neg logit contributions
umbs-0.005
 hind-0.004
ulent-0.004
 sandwiches-0.004
gary-0.004
Token the
Token ID262
Feature activation-2.01e-02

Pos logit contributions
Wow+0.008
L+0.008
 sounded+0.008
er+0.007
 matter+0.007
Neg logit contributions
 hind-0.013
umbs-0.012
ulent-0.012
oured-0.012
gary-0.012
Token biggest
Token ID4094
Feature activation-1.48e-02

Pos logit contributions
 hind+0.022
oured+0.021
umbs+0.020
 sandwiches+0.020
gary+0.019
Neg logit contributions
Wow-0.038
L-0.037
 sounded-0.035
 matter-0.034
Okay-0.033
Token,
Token ID11
Feature activation-6.44e-02

Pos logit contributions
 hind+0.021
oured+0.020
umbs+0.019
 sandwiches+0.019
ulent+0.019
Neg logit contributions
Wow-0.023
L-0.022
 matter-0.022
 sounded-0.021
er-0.021
Token rip
Token ID18344
Feature activation+1.03e-02

Pos logit contributions
oured+0.091
 hind+0.088
umbs+0.086
 sandwiches+0.080
sing+0.079
Neg logit contributions
L-0.104
Wow-0.103
 matter-0.098
 sounded-0.094
You-0.089
Tokenest
Token ID395
Feature activation-1.49e-02

Pos logit contributions
Wow+0.012
Okay+0.011
 sounded+0.011
L+0.011
OK+0.010
Neg logit contributions
 sandwiches-0.018
oured-0.018
umbs-0.018
ulent-0.017
 hind-0.017
Token olive
Token ID19450
Feature activation-3.11e-03

Pos logit contributions
 hind+0.018
oured+0.017
umbs+0.017
 sandwiches+0.017
ulent+0.016
Neg logit contributions
Wow-0.026
L-0.025
 matter-0.025
 sounded-0.024
er-0.024
Token she
Token ID673
Feature activation-1.01e-02

Pos logit contributions
 hind+0.006
oured+0.006
 sandwiches+0.006
ulent+0.006
gary+0.005
Neg logit contributions
Wow-0.003
 sounded-0.003
L-0.003
er-0.003
 matter-0.003
Token could
Token ID714
Feature activation-6.90e-03

Pos logit contributions
 sandwiches+0.018
oured+0.017
 hind+0.017
umbs+0.017
ulent+0.017
Neg logit contributions
Wow-0.012
 sounded-0.011
L-0.011
Okay-0.011
 matter-0.010
Token decided
Token ID3066
Feature activation+1.12e-03

Pos logit contributions
 hind+0.043
 sandwiches+0.041
umbs+0.041
ulent+0.041
gary+0.041
Neg logit contributions
 sounded-0.019
L-0.018
Wow-0.017
Okay-0.016
er-0.016
Token to
Token ID284
Feature activation-5.21e-03

Pos logit contributions
L+0.002
Wow+0.002
 sounded+0.002
Okay+0.002
OK+0.002
Neg logit contributions
umbs-0.001
ulent-0.001
 hind-0.001
oured-0.001
gary-0.001
Token buy
Token ID2822
Feature activation-1.52e-02

Pos logit contributions
 sandwiches+0.006
oured+0.006
 hind+0.005
 tum+0.005
 fishing+0.005
Neg logit contributions
Wow-0.009
 sounded-0.009
L-0.009
Okay-0.009
cer-0.008
Token the
Token ID262
Feature activation-1.61e-02

Pos logit contributions
 hind+0.021
oured+0.020
umbs+0.020
sing+0.019
ulent+0.019
Neg logit contributions
L-0.024
Wow-0.023
 sounded-0.023
er-0.022
 matter-0.021
Token biggest
Token ID4094
Feature activation-8.07e-03

Pos logit contributions
oured+0.019
 hind+0.019
umbs+0.018
ulent+0.017
gary+0.017
Neg logit contributions
Wow-0.027
L-0.027
 sounded-0.026
 matter-0.026
er-0.025
Token,
Token ID11
Feature activation-5.86e-02

Pos logit contributions
 hind+0.010
oured+0.010
umbs+0.010
gary+0.009
ulent+0.009
Neg logit contributions
 matter-0.013
 sounded-0.013
Wow-0.013
L-0.013
er-0.012
Token highest
Token ID4511
Feature activation-4.25e-03

Pos logit contributions
oured+0.087
 hind+0.083
sed+0.077
sing+0.076
umbs+0.072
Neg logit contributions
L-0.091
Wow-0.090
 sounded-0.089
 matter-0.088
You-0.079
Token cel
Token ID18725
Feature activation+1.80e-03

Pos logit contributions
oured+0.006
 sandwiches+0.005
 hind+0.005
umbs+0.005
ulent+0.005
Neg logit contributions
Wow-0.007
 sounded-0.006
L-0.006
 matter-0.006
Okay-0.006
Tokenery
Token ID1924
Feature activation+1.53e-02

Pos logit contributions
Wow+0.002
 sounded+0.002
L+0.002
Okay+0.002
 matter+0.002
Neg logit contributions
 sandwiches-0.003
umbs-0.003
oured-0.003
ulent-0.003
 hind-0.003
Token she
Token ID673
Feature activation-1.05e-02

Pos logit contributions
Wow+0.018
 sounded+0.018
L+0.018
er+0.017
cer+0.017
Neg logit contributions
oured-0.025
 hind-0.025
gary-0.024
ulent-0.024
umbs-0.024
Token could
Token ID714
Feature activation-2.25e-03

Pos logit contributions
 sandwiches+0.020
 hind+0.020
oured+0.020
umbs+0.020
ulent+0.020
Neg logit contributions
Wow-0.010
 sounded-0.010
L-0.010
Okay-0.009
 matter-0.009
Token,
Token ID11
Feature activation+6.01e-03

Pos logit contributions
oured+0.010
umbs+0.010
 hind+0.010
ulent+0.009
 sandwiches+0.009
Neg logit contributions
Wow-0.006
 sounded-0.006
L-0.006
 matter-0.006
Okay-0.005
Token she
Token ID673
Feature activation-2.25e-03

Pos logit contributions
Wow+0.008
 sounded+0.007
L+0.007
 matter+0.007
Okay+0.006
Neg logit contributions
oured-0.011
ulent-0.011
umbs-0.011
 hind-0.010
gary-0.010
Token found
Token ID1043
Feature activation+1.12e-02

Pos logit contributions
 sandwiches+0.004
oured+0.004
 hind+0.004
ulent+0.004
umbs+0.004
Neg logit contributions
Wow-0.003
 sounded-0.002
L-0.002
Okay-0.002
 matter-0.002
Token a
Token ID257
Feature activation-1.78e-02

Pos logit contributions
L+0.014
Wow+0.013
 sounded+0.013
er+0.011
You+0.011
Neg logit contributions
 hind-0.021
umbs-0.020
oured-0.020
ulent-0.020
 contain-0.019
Token small
Token ID1402
Feature activation-2.39e-02

Pos logit contributions
umbs+0.025
oured+0.024
ulent+0.023
oops+0.023
 hind+0.022
Neg logit contributions
Wow-0.028
 sounded-0.028
L-0.027
 matter-0.027
Okay-0.025
Token,
Token ID11
Feature activation-5.83e-02

Pos logit contributions
umbs+0.037
oured+0.036
 hind+0.035
ulent+0.035
gary+0.034
Neg logit contributions
Wow-0.033
 sounded-0.033
 matter-0.032
L-0.032
cer-0.030
Token rough
Token ID5210
Feature activation-2.25e-02

Pos logit contributions
umbs+0.088
oured+0.087
ulent+0.081
 hind+0.081
 sandwiches+0.080
Neg logit contributions
Wow-0.090
 sounded-0.089
 matter-0.087
L-0.086
Okay-0.077
Token purse
Token ID27517
Feature activation-4.81e-03

Pos logit contributions
oured+0.031
 sandwiches+0.031
umbs+0.031
 hind+0.031
ulent+0.030
Neg logit contributions
Wow-0.034
 sounded-0.033
L-0.032
 matter-0.031
Okay-0.031
Token.
Token ID13
Feature activation-1.26e-03

Pos logit contributions
 sandwiches+0.008
 hind+0.008
oured+0.008
ulent+0.007
umbs+0.007
Neg logit contributions
Wow-0.006
 sounded-0.006
L-0.006
Okay-0.006
 matter-0.006
Token The
Token ID383
Feature activation+5.39e-03

Pos logit contributions
oured+0.002
 sandwiches+0.002
 hind+0.002
ulent+0.002
 fishing+0.002
Neg logit contributions
Wow-0.002
 sounded-0.002
L-0.002
Okay-0.001
OK-0.001
Token purse
Token ID27517
Feature activation-9.95e-03

Pos logit contributions
Wow+0.010
 sounded+0.010
L+0.010
Okay+0.010
 matter+0.009
Neg logit contributions
 sandwiches-0.005
oured-0.005
 hind-0.005
ulent-0.005
umbs-0.005
Token,
Token ID11
Feature activation+9.17e-03

Pos logit contributions
oured+0.009
 hind+0.009
 sandwiches+0.009
umbs+0.009
ulent+0.009
Neg logit contributions
Wow-0.008
L-0.008
 sounded-0.007
 matter-0.007
Okay-0.007
Token she
Token ID673
Feature activation+6.11e-03

Pos logit contributions
Wow+0.012
L+0.012
 sounded+0.012
Okay+0.011
 matter+0.011
Neg logit contributions
 sandwiches-0.015
oured-0.015
 hind-0.014
ulent-0.014
umbs-0.014
Token drew
Token ID9859
Feature activation-1.05e-02

Pos logit contributions
Wow+0.008
 sounded+0.007
L+0.007
Okay+0.007
 matter+0.007
Neg logit contributions
 sandwiches-0.010
oured-0.010
 hind-0.009
ulent-0.009
umbs-0.009
Token a
Token ID257
Feature activation-9.81e-03

Pos logit contributions
 hind+0.018
oured+0.018
ulent+0.017
umbs+0.017
gary+0.016
Neg logit contributions
L-0.015
 sounded-0.015
Wow-0.014
er-0.013
ate-0.012
Token big
Token ID1263
Feature activation-1.79e-02

Pos logit contributions
oured+0.011
 sandwiches+0.010
umbs+0.010
ulent+0.010
 hind+0.010
Neg logit contributions
Wow-0.018
L-0.017
 sounded-0.017
 matter-0.017
Okay-0.017
Token,
Token ID11
Feature activation-5.78e-02

Pos logit contributions
oured+0.024
umbs+0.024
ulent+0.023
 hind+0.023
 sandwiches+0.023
Neg logit contributions
Wow-0.028
 sounded-0.027
L-0.027
 matter-0.026
er-0.025
Token pretty
Token ID2495
Feature activation+6.86e-03

Pos logit contributions
oured+0.079
umbs+0.077
 hind+0.075
 sandwiches+0.074
ulent+0.073
Neg logit contributions
Wow-0.095
L-0.092
 sounded-0.091
 matter-0.087
Okay-0.082
Token bird
Token ID6512
Feature activation+3.16e-04

Pos logit contributions
Wow+0.011
 sounded+0.011
Okay+0.011
L+0.011
Fine+0.011
Neg logit contributions
 sandwiches-0.009
oured-0.008
 hind-0.008
 tum-0.008
ulent-0.008
Token.
Token ID13
Feature activation-1.12e-03

Pos logit contributions
Wow+0.000
L+0.000
 sounded+0.000
 matter+0.000
Okay+0.000
Neg logit contributions
 sandwiches-0.001
oured-0.001
 hind-0.001
umbs-0.001
ulent-0.001
Token She
Token ID1375
Feature activation-7.10e-04

Pos logit contributions
 sandwiches+0.001
oured+0.001
umbs+0.001
 hind+0.001
 tum+0.001
Neg logit contributions
Wow-0.002
 sounded-0.002
 matter-0.002
Okay-0.002
L-0.002
Token showed
Token ID3751
Feature activation+5.17e-03

Pos logit contributions
 sandwiches+0.001
oured+0.001
 hind+0.001
ulent+0.001
 tum+0.001
Neg logit contributions
Wow-0.001
L-0.001
Okay-0.001
 sounded-0.001
 matter-0.001
Token up
Token ID510
Feature activation-2.42e-02

Pos logit contributions
umbs+0.002
ulent+0.002
gary+0.002
ives+0.002
earned+0.002
Neg logit contributions
Wow-0.001
 sounded-0.001
 matter-0.001
 just-0.001
 me-0.001
Token and
Token ID290
Feature activation+3.89e-03

Pos logit contributions
ulent+0.048
umbs+0.048
gary+0.047
oured+0.045
 sandwiches+0.045
Neg logit contributions
Wow-0.022
L-0.022
 sounded-0.021
.-0.020
cer-0.019
Token saw
Token ID2497
Feature activation+6.55e-03

Pos logit contributions
Wow+0.006
 sounded+0.006
L+0.006
Okay+0.006
 matter+0.006
Neg logit contributions
 sandwiches-0.005
oured-0.005
 hind-0.005
ulent-0.005
umbs-0.005
Token the
Token ID262
Feature activation-9.86e-03

Pos logit contributions
Wow+0.010
 sounded+0.009
L+0.008
 matter+0.008
You+0.008
Neg logit contributions
oured-0.011
umbs-0.010
ulent-0.010
gary-0.010
 hind-0.010
Token big
Token ID1263
Feature activation-1.63e-02

Pos logit contributions
oured+0.016
 sandwiches+0.016
 hind+0.016
ulent+0.016
umbs+0.015
Neg logit contributions
Wow-0.013
 sounded-0.012
L-0.012
 matter-0.012
Okay-0.011
Token,
Token ID11
Feature activation-5.76e-02

Pos logit contributions
oured+0.026
 sandwiches+0.025
umbs+0.025
 hind+0.025
ulent+0.025
Neg logit contributions
Wow-0.022
 sounded-0.021
 matter-0.020
L-0.020
er-0.019
Token blue
Token ID4171
Feature activation-7.52e-03

Pos logit contributions
oured+0.087
umbs+0.083
ulent+0.080
 hind+0.079
 sandwiches+0.079
Neg logit contributions
Wow-0.091
 sounded-0.085
L-0.083
 matter-0.082
OK-0.075
Token sky
Token ID6766
Feature activation+3.31e-03

Pos logit contributions
 sandwiches+0.011
oured+0.011
 hind+0.011
ulent+0.011
umbs+0.011
Neg logit contributions
Wow-0.010
 sounded-0.010
L-0.010
Okay-0.009
 matter-0.009
Token.
Token ID13
Feature activation+2.35e-03

Pos logit contributions
Wow+0.004
L+0.003
Okay+0.003
 sounded+0.003
 matter+0.003
Neg logit contributions
 sandwiches-0.006
oured-0.006
 hind-0.006
umbs-0.006
ulent-0.006
Token Blue
Token ID4518
Feature activation-7.72e-04

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
Okay+0.002
 matter+0.002
Neg logit contributions
 sandwiches-0.004
oured-0.004
 hind-0.004
ulent-0.004
umbs-0.004
Token was
Token ID373
Feature activation-3.13e-03

Pos logit contributions
 sandwiches+0.001
oured+0.001
 hind+0.001
umbs+0.001
ulent+0.001
Neg logit contributions
Wow-0.001
 sounded-0.001
L-0.001
Okay-0.001
 matter-0.001
Token.
Token ID13
Feature activation-2.65e-03

Pos logit contributions
 sounded+0.009
L+0.009
Wow+0.009
 matter+0.009
er+0.008
Neg logit contributions
 hind-0.019
oured-0.019
umbs-0.018
 sandwiches-0.018
ulent-0.018
Token He
Token ID679
Feature activation-4.46e-03

Pos logit contributions
 sandwiches+0.005
oured+0.005
ulent+0.005
 hind+0.005
umbs+0.005
Neg logit contributions
Wow-0.003
 sounded-0.003
L-0.003
Okay-0.003
 matter-0.002
Token grabbed
Token ID13176
Feature activation-7.71e-03

Pos logit contributions
 sandwiches+0.008
oured+0.008
 hind+0.008
umbs+0.008
ulent+0.008
Neg logit contributions
Wow-0.005
 sounded-0.005
L-0.005
Okay-0.004
 matter-0.004
Token some
Token ID617
Feature activation-2.22e-02

Pos logit contributions
ulent+0.014
umbs+0.014
oured+0.014
 hind+0.014
gary+0.013
Neg logit contributions
Wow-0.009
 sounded-0.009
 matter-0.008
L-0.008
Hey-0.007
Token big
Token ID1263
Feature activation-2.29e-02

Pos logit contributions
oured+0.040
ulent+0.039
umbs+0.037
 hind+0.035
cious+0.035
Neg logit contributions
 sounded-0.025
 matter-0.025
Wow-0.025
L-0.024
er-0.024
Token,
Token ID11
Feature activation-5.73e-02

Pos logit contributions
oured+0.037
umbs+0.036
gary+0.035
ulent+0.035
 hind+0.034
Neg logit contributions
 sounded-0.030
Wow-0.029
 matter-0.028
er-0.027
L-0.027
Token p
Token ID279
Feature activation-1.43e-02

Pos logit contributions
oured+0.093
umbs+0.091
ulent+0.086
ives+0.083
 hind+0.082
Neg logit contributions
 sounded-0.081
Wow-0.079
L-0.079
 matter-0.073
OK-0.068
Tokenuffy
Token ID15352
Feature activation-1.30e-02

Pos logit contributions
ulent+0.018
 hind+0.018
 sandwiches+0.018
umbs+0.017
gary+0.016
Neg logit contributions
L-0.022
Wow-0.021
 sounded-0.021
er-0.021
Okay-0.021
Token snow
Token ID6729
Feature activation+4.31e-03

Pos logit contributions
 hind+0.025
gary+0.025
umbs+0.024
oured+0.024
ulent+0.023
Neg logit contributions
 sounded-0.014
er-0.013
L-0.012
cer-0.012
Wow-0.011
Tokenballs
Token ID21591
Feature activation-8.46e-03

Pos logit contributions
Wow+0.005
 sounded+0.005
L+0.005
Okay+0.004
cer+0.004
Neg logit contributions
oured-0.007
 sandwiches-0.007
 hind-0.007
ulent-0.007
gary-0.007
Token,
Token ID11
Feature activation-2.85e-02

Pos logit contributions
 hind+0.017
gary+0.016
ulent+0.015
earned+0.015
umbs+0.015
Neg logit contributions
 sounded-0.009
er-0.008
Wow-0.008
L-0.008
.-0.007
Token.
Token ID13
Feature activation-5.95e-04

Pos logit contributions
Wow+0.007
 sounded+0.006
L+0.005
Okay+0.005
 matter+0.005
Neg logit contributions
 sandwiches-0.020
oured-0.019
 hind-0.019
umbs-0.019
ulent-0.019
Token He
Token ID679
Feature activation+2.47e-03

Pos logit contributions
 sandwiches+0.001
oured+0.001
 hind+0.001
ulent+0.001
umbs+0.001
Neg logit contributions
Wow-0.001
 sounded-0.001
L-0.001
Okay-0.001
 matter-0.001
Token saw
Token ID2497
Feature activation+1.36e-04

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
Okay+0.002
OK+0.002
Neg logit contributions
oured-0.005
 sandwiches-0.005
umbs-0.005
ulent-0.005
 hind-0.005
Token a
Token ID257
Feature activation-1.28e-02

Pos logit contributions
 sounded+0.000
Wow+0.000
L+0.000
Hey+0.000
Why+0.000
Neg logit contributions
ulent-0.000
oured-0.000
umbs-0.000
 hind-0.000
gary-0.000
Token big
Token ID1263
Feature activation-2.27e-02

Pos logit contributions
 sandwiches+0.012
 hind+0.012
oured+0.011
 fishing+0.011
ulent+0.011
Neg logit contributions
Wow-0.025
 sounded-0.024
L-0.024
Okay-0.023
 matter-0.023
Token,
Token ID11
Feature activation-5.69e-02

Pos logit contributions
oured+0.037
umbs+0.036
ulent+0.036
 hind+0.035
gary+0.035
Neg logit contributions
Wow-0.030
 sounded-0.029
L-0.028
 matter-0.027
cer-0.026
Token hairy
Token ID43873
Feature activation-1.23e-02

Pos logit contributions
oured+0.086
umbs+0.085
ulent+0.081
 hind+0.078
 sandwiches+0.076
Neg logit contributions
Wow-0.089
 sounded-0.086
L-0.084
 matter-0.082
Okay-0.076
Token spider
Token ID19230
Feature activation+4.65e-03

Pos logit contributions
 sandwiches+0.018
 hind+0.017
oured+0.017
 tum+0.017
 fishing+0.017
Neg logit contributions
Wow-0.018
 sounded-0.017
L-0.016
Okay-0.016
 matter-0.016
Token on
Token ID319
Feature activation-1.76e-02

Pos logit contributions
Wow+0.008
Okay+0.007
 matter+0.007
L+0.007
 sounded+0.007
Neg logit contributions
 sandwiches-0.006
 fishing-0.006
ulent-0.006
 hind-0.006
 tum-0.006
Token the
Token ID262
Feature activation-9.56e-03

Pos logit contributions
 sandwiches+0.025
oured+0.025
 hind+0.024
ulent+0.024
umbs+0.024
Neg logit contributions
Wow-0.026
 sounded-0.025
L-0.025
 matter-0.024
Okay-0.024
Token ceiling
Token ID13387
Feature activation+4.70e-03

Pos logit contributions
oured+0.020
gary+0.019
ulent+0.018
umbs+0.018
oops+0.018
Neg logit contributions
L-0.009
 sounded-0.008
er-0.008
Wow-0.008
 matter-0.007
Token He
Token ID679
Feature activation-1.93e-03

Pos logit contributions
L+0.018
Wow+0.018
cer+0.018
Okay+0.018
 narrative+0.017
Neg logit contributions
 fishing-0.017
 hind-0.017
 tum-0.017
 sandwiches-0.017
oured-0.017
Token put
Token ID1234
Feature activation-7.69e-03

Pos logit contributions
oured+0.004
umbs+0.004
 sandwiches+0.004
ulent+0.004
 hind+0.004
Neg logit contributions
 sounded-0.002
L-0.002
Wow-0.001
Okay-0.001
 matter-0.001
Token on
Token ID319
Feature activation-2.83e-02

Pos logit contributions
oured+0.011
ulent+0.011
 hind+0.011
umbs+0.011
gary+0.010
Neg logit contributions
Wow-0.012
 sounded-0.011
L-0.011
 matter-0.010
OK-0.010
Token his
Token ID465
Feature activation-4.99e-03

Pos logit contributions
umbs+0.063
oured+0.063
ulent+0.062
gary+0.060
ives+0.060
Neg logit contributions
 sounded-0.021
L-0.019
Wow-0.019
 matter-0.016
ate-0.016
Token big
Token ID1263
Feature activation-6.82e-03

Pos logit contributions
umbs+0.009
oured+0.009
ives+0.008
ulent+0.008
gary+0.008
Neg logit contributions
 sounded-0.006
L-0.006
OK-0.005
 matter-0.005
Wow-0.005
Token,
Token ID11
Feature activation-5.64e-02

Pos logit contributions
umbs+0.012
ives+0.012
oured+0.012
gary+0.012
ulent+0.011
Neg logit contributions
 sounded-0.008
 matter-0.008
L-0.007
Wow-0.007
er-0.007
Token wide
Token ID3094
Feature activation-9.69e-03

Pos logit contributions
oured+0.093
umbs+0.091
ives+0.085
ulent+0.085
gary+0.083
Neg logit contributions
 sounded-0.079
Wow-0.079
L-0.078
 matter-0.070
OK-0.067
Token gloves
Token ID18051
Feature activation-7.75e-03

Pos logit contributions
umbs+0.017
ives+0.017
gary+0.016
oured+0.016
ulent+0.016
Neg logit contributions
 sounded-0.012
L-0.010
er-0.010
 matter-0.010
Wow-0.010
Token and
Token ID290
Feature activation-1.63e-03

Pos logit contributions
gary+0.016
ulent+0.016
 hind+0.015
 sandwiches+0.015
umbs+0.015
Neg logit contributions
er-0.008
 sounded-0.008
L-0.007
Wow-0.007
ter-0.007
Token grabbed
Token ID13176
Feature activation+9.42e-03

Pos logit contributions
umbs+0.003
oured+0.003
ulent+0.003
gary+0.003
 hind+0.003
Neg logit contributions
 sounded-0.002
Wow-0.002
L-0.002
 matter-0.002
Okay-0.002
Token a
Token ID257
Feature activation+3.38e-03

Pos logit contributions
Wow+0.010
 sounded+0.010
L+0.009
 matter+0.009
er+0.008
Neg logit contributions
umbs-0.017
ulent-0.017
 hind-0.017
oured-0.017
gary-0.017
Token He
Token ID679
Feature activation-7.22e-03

Pos logit contributions
Wow+0.011
cer+0.011
 sounded+0.011
L+0.011
Okay+0.011
Neg logit contributions
 sandwiches-0.010
 tum-0.010
 fishing-0.010
oured-0.010
umbs-0.010
Token had
Token ID550
Feature activation-6.46e-03

Pos logit contributions
 sandwiches+0.014
oured+0.014
 hind+0.014
umbs+0.014
ulent+0.013
Neg logit contributions
Wow-0.007
 sounded-0.007
L-0.007
Okay-0.006
 matter-0.006
Token the
Token ID262
Feature activation-9.72e-03

Pos logit contributions
umbs+0.010
oured+0.010
 hind+0.010
ulent+0.009
gary+0.009
Neg logit contributions
 sounded-0.009
Wow-0.009
L-0.009
 matter-0.008
ate-0.008
Token brightest
Token ID33871
Feature activation-9.36e-03

Pos logit contributions
oured+0.014
umbs+0.014
 sandwiches+0.014
 hind+0.013
ulent+0.013
Neg logit contributions
Wow-0.015
 sounded-0.014
L-0.014
 matter-0.014
Okay-0.013
Token yellow
Token ID7872
Feature activation-1.67e-02

Pos logit contributions
 hind+0.015
oured+0.015
ulent+0.015
umbs+0.015
 sandwiches+0.015
Neg logit contributions
Wow-0.012
 sounded-0.012
L-0.011
 matter-0.011
er-0.011
Token,
Token ID11
Feature activation-5.57e-02

Pos logit contributions
 sandwiches+0.025
 hind+0.024
oured+0.024
ulent+0.024
umbs+0.024
Neg logit contributions
Wow-0.024
 sounded-0.023
L-0.022
 matter-0.021
Okay-0.021
Token the
Token ID262
Feature activation-1.46e-02

Pos logit contributions
oured+0.083
umbs+0.081
 hind+0.080
ulent+0.076
 sandwiches+0.076
Neg logit contributions
Wow-0.084
 sounded-0.084
L-0.080
 matter-0.080
er-0.071
Token darkest
Token ID35995
Feature activation-9.64e-03

Pos logit contributions
 sandwiches+0.021
oured+0.021
 hind+0.020
umbs+0.020
ulent+0.020
Neg logit contributions
Wow-0.022
 sounded-0.021
L-0.020
 matter-0.020
Okay-0.020
Token black
Token ID2042
Feature activation-1.25e-02

Pos logit contributions
 sandwiches+0.014
oured+0.014
 hind+0.014
ulent+0.014
umbs+0.013
Neg logit contributions
Wow-0.014
 sounded-0.013
L-0.013
Okay-0.013
 matter-0.013
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
 sandwiches+0.013
oured+0.012
umbs+0.012
ulent+0.012
 tum+0.012
Neg logit contributions
Wow-0.023
 sounded-0.022
L-0.022
Okay-0.022
 matter-0.021
Token the
Token ID262
Feature activation-5.42e-03

Pos logit contributions
 sandwiches+0.014
oured+0.013
 hind+0.013
ulent+0.013
 fishing+0.012
Neg logit contributions
Wow-0.025
L-0.024
 sounded-0.024
Okay-0.024
Fine-0.023
Token,
Token ID11
Feature activation+6.25e-03

Pos logit contributions
oured+0.009
 sandwiches+0.009
 hind+0.009
umbs+0.009
ulent+0.009
Neg logit contributions
Wow-0.008
L-0.008
 sounded-0.008
 matter-0.008
Okay-0.008
Token Lily
Token ID20037
Feature activation-4.16e-03

Pos logit contributions
Wow+0.009
L+0.008
 sounded+0.008
Okay+0.008
 matter+0.008
Neg logit contributions
oured-0.010
umbs-0.010
 sandwiches-0.010
 hind-0.010
ulent-0.010
Token found
Token ID1043
Feature activation+1.20e-02

Pos logit contributions
 sandwiches+0.007
 hind+0.007
oured+0.007
ulent+0.007
umbs+0.007
Neg logit contributions
Wow-0.005
 sounded-0.004
L-0.004
Okay-0.004
 matter-0.004
Token a
Token ID257
Feature activation-1.62e-02

Pos logit contributions
L+0.015
Wow+0.015
 sounded+0.014
er+0.012
You+0.012
Neg logit contributions
 hind-0.022
umbs-0.022
oured-0.021
ulent-0.021
 contain-0.021
Token big
Token ID1263
Feature activation-2.90e-02

Pos logit contributions
umbs+0.023
oured+0.022
ulent+0.022
oops+0.022
 hind+0.021
Neg logit contributions
Wow-0.025
 sounded-0.025
 matter-0.025
L-0.025
Okay-0.022
Token,
Token ID11
Feature activation-5.57e-02

Pos logit contributions
umbs+0.050
oured+0.048
ulent+0.046
 hind+0.045
gary+0.045
Neg logit contributions
 sounded-0.037
 matter-0.037
Wow-0.037
L-0.035
er-0.033
Token blank
Token ID9178
Feature activation-5.70e-03

Pos logit contributions
umbs+0.084
oured+0.083
ulent+0.078
 sandwiches+0.077
 hind+0.077
Neg logit contributions
Wow-0.086
 sounded-0.084
 matter-0.083
L-0.082
Okay-0.074
Token paper
Token ID3348
Feature activation+1.23e-02

Pos logit contributions
 sandwiches+0.007
 tum+0.007
oured+0.006
 hind+0.006
 toast+0.006
Neg logit contributions
Wow-0.010
Okay-0.009
Fine-0.009
OK-0.009
L-0.009
Token.
Token ID13
Feature activation-1.10e-03

Pos logit contributions
Wow+0.017
 sounded+0.016
L+0.016
Okay+0.015
 matter+0.015
Neg logit contributions
 sandwiches-0.019
oured-0.019
 hind-0.018
ulent-0.018
umbs-0.018
Token She
Token ID1375
Feature activation+6.34e-04

Pos logit contributions
 sandwiches+0.002
oured+0.002
 hind+0.002
umbs+0.002
ulent+0.002
Neg logit contributions
Wow-0.002
 sounded-0.002
L-0.001
Okay-0.001
 matter-0.001
Token thought
Token ID1807
Feature activation-3.38e-02

Pos logit contributions
Wow+0.001
Okay+0.001
L+0.001
 sounded+0.001
cer+0.001
Neg logit contributions
 sandwiches-0.001
oured-0.001
 hind-0.001
 tum-0.001
ulent-0.001
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
Wow+0.004
Okay+0.004
L+0.004
OK+0.003
Fine+0.003
Neg logit contributions
 sandwiches-0.010
oured-0.010
 fishing-0.010
umbs-0.010
 tum-0.009
Token It
Token ID632
Feature activation-3.59e-03

Pos logit contributions
 sandwiches+0.019
oured+0.019
 hind+0.019
ulent+0.018
umbs+0.018
Neg logit contributions
Wow-0.017
 sounded-0.016
L-0.015
Okay-0.015
 matter-0.015
Token included
Token ID3017
Feature activation-1.08e-02

Pos logit contributions
 sandwiches+0.005
 tum+0.005
 fishing+0.005
colored+0.005
 to+0.005
Neg logit contributions
Okay-0.005
Fine-0.004
Deal-0.004
OK-0.004
Wow-0.004
Token a
Token ID257
Feature activation-9.87e-03

Pos logit contributions
oured+0.018
 hind+0.017
umbs+0.017
ulent+0.016
ipop+0.016
Neg logit contributions
Wow-0.015
 sounded-0.015
L-0.013
er-0.013
ate-0.012
Token big
Token ID1263
Feature activation-1.27e-02

Pos logit contributions
oured+0.012
 sandwiches+0.011
umbs+0.011
 hind+0.011
ulent+0.011
Neg logit contributions
Wow-0.017
 sounded-0.016
L-0.016
 matter-0.016
Okay-0.016
Token,
Token ID11
Feature activation-5.53e-02

Pos logit contributions
umbs+0.021
oured+0.020
ulent+0.020
gary+0.019
 hind+0.019
Neg logit contributions
Wow-0.017
 sounded-0.016
 matter-0.016
L-0.015
cer-0.015
Token white
Token ID2330
Feature activation-7.67e-03

Pos logit contributions
umbs+0.080
oured+0.080
 hind+0.075
ulent+0.074
oops+0.073
Neg logit contributions
Wow-0.086
 sounded-0.083
L-0.081
 matter-0.079
Okay-0.073
Token dove
Token ID38862
Feature activation+1.58e-04

Pos logit contributions
oured+0.011
 sandwiches+0.011
 hind+0.011
umbs+0.011
ulent+0.011
Neg logit contributions
Wow-0.011
 sounded-0.010
L-0.010
Okay-0.010
 matter-0.010
Token.
Token ID13
Feature activation-7.71e-03

Pos logit contributions
 sounded+0.000
L+0.000
Wow+0.000
cer+0.000
er+0.000
Neg logit contributions
 hind-0.000
gary-0.000
oured-0.000
umbs-0.000
ulent-0.000
Token It
Token ID632
Feature activation-2.01e-03

Pos logit contributions
 sandwiches+0.011
oured+0.011
 hind+0.011
ulent+0.011
umbs+0.011
Neg logit contributions
Wow-0.011
 sounded-0.010
L-0.010
Okay-0.010
 matter-0.010
Token was
Token ID373
Feature activation-1.07e-02

Pos logit contributions
 sandwiches+0.004
oured+0.003
 tum+0.003
ulent+0.003
 hind+0.003
Neg logit contributions
Wow-0.002
Okay-0.002
Fine-0.002
 narrative-0.002
 matter-0.002
Token wanted
Token ID2227
Feature activation-2.55e-02

Pos logit contributions
Wow+0.000
 sounded+0.000
L+0.000
Okay+0.000
 matter+0.000
Neg logit contributions
 sandwiches-0.001
oured-0.001
 hind-0.001
 tum-0.001
ulent-0.001
Token to
Token ID284
Feature activation-2.28e-03

Pos logit contributions
oured+0.052
umbs+0.050
cases+0.046
 hind+0.043
sing+0.043
Neg logit contributions
 sounded-0.030
 me-0.028
L-0.028
 animals-0.027
.-0.027
Token explore
Token ID7301
Feature activation-1.83e-02

Pos logit contributions
oured+0.004
 sandwiches+0.003
ulent+0.003
umbs+0.003
gary+0.003
Neg logit contributions
L-0.003
 matter-0.003
 sounded-0.003
Wow-0.003
ate-0.003
Token the
Token ID262
Feature activation-5.48e-03

Pos logit contributions
oured+0.030
umbs+0.030
ulent+0.029
 sandwiches+0.029
 hind+0.028
Neg logit contributions
Wow-0.023
 sounded-0.023
L-0.023
 matter-0.021
er-0.020
Token big
Token ID1263
Feature activation-3.97e-03

Pos logit contributions
oured+0.011
umbs+0.011
ulent+0.011
 sandwiches+0.011
gary+0.011
Neg logit contributions
Wow-0.005
L-0.005
 sounded-0.005
 matter-0.004
Okay-0.004
Token,
Token ID11
Feature activation-5.51e-02

Pos logit contributions
oured+0.006
umbs+0.006
ulent+0.006
 sandwiches+0.006
gary+0.006
Neg logit contributions
Wow-0.005
 sounded-0.005
L-0.005
 matter-0.005
er-0.005
Token tall
Token ID7331
Feature activation-1.70e-03

Pos logit contributions
oured+0.092
umbs+0.088
ulent+0.082
 sandwiches+0.082
ives+0.081
Neg logit contributions
Wow-0.077
 sounded-0.075
 matter-0.074
L-0.073
OK-0.063
Token world
Token ID995
Feature activation-6.74e-03

Pos logit contributions
 sandwiches+0.002
oured+0.002
 hind+0.002
ulent+0.002
umbs+0.002
Neg logit contributions
Wow-0.003
 sounded-0.003
L-0.003
Okay-0.002
 matter-0.002
Token outside
Token ID2354
Feature activation+7.18e-04

Pos logit contributions
oured+0.010
 sandwiches+0.010
 hind+0.010
umbs+0.010
ulent+0.010
Neg logit contributions
Wow-0.009
 sounded-0.009
L-0.009
Okay-0.009
 matter-0.008
Token of
Token ID286
Feature activation+1.14e-02

Pos logit contributions
 sounded+0.001
Wow+0.001
L+0.001
er+0.001
You+0.001
Neg logit contributions
oured-0.002
 hind-0.001
ulent-0.001
 sandwiches-0.001
umbs-0.001
Token his
Token ID465
Feature activation+1.75e-02

Pos logit contributions
Wow+0.017
 sounded+0.016
L+0.016
Okay+0.015
 matter+0.015
Neg logit contributions
 sandwiches-0.016
oured-0.016
 hind-0.016
ulent-0.016
umbs-0.016
Token.
Token ID13
Feature activation+7.94e-03

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
Okay+0.002
 matter+0.002
Neg logit contributions
oured-0.012
 hind-0.012
 sandwiches-0.012
ulent-0.012
umbs-0.012
Token It
Token ID632
Feature activation-1.80e-03

Pos logit contributions
Wow+0.011
 sounded+0.011
L+0.011
Okay+0.010
 matter+0.010
Neg logit contributions
 sandwiches-0.012
oured-0.011
 hind-0.011
umbs-0.011
ulent-0.011
Token was
Token ID373
Feature activation-1.77e-02

Pos logit contributions
 sandwiches+0.004
oured+0.004
umbs+0.004
 hind+0.004
ulent+0.004
Neg logit contributions
Wow-0.001
L-0.001
Okay-0.001
 sounded-0.001
 matter-0.001
Token a
Token ID257
Feature activation-2.16e-02

Pos logit contributions
oured+0.031
ives+0.029
umbs+0.028
earned+0.027
ulent+0.027
Neg logit contributions
 sounded-0.024
Wow-0.024
L-0.023
 matter-0.023
er-0.019
Token big
Token ID1263
Feature activation-1.57e-02

Pos logit contributions
oured+0.028
umbs+0.027
 sandwiches+0.025
ulent+0.025
oops+0.025
Neg logit contributions
Wow-0.037
L-0.036
 sounded-0.035
 matter-0.034
Okay-0.034
Token,
Token ID11
Feature activation-5.50e-02

Pos logit contributions
umbs+0.025
oured+0.025
 hind+0.024
ulent+0.024
gary+0.024
Neg logit contributions
Wow-0.021
 sounded-0.020
 matter-0.020
L-0.019
Okay-0.018
Token high
Token ID1029
Feature activation-6.11e-03

Pos logit contributions
oured+0.085
umbs+0.079
ives+0.075
 hind+0.074
ived+0.072
Neg logit contributions
Wow-0.086
L-0.083
 matter-0.082
 sounded-0.081
Okay-0.072
Token market
Token ID1910
Feature activation+3.43e-03

Pos logit contributions
 sandwiches+0.008
oured+0.008
 hind+0.008
ulent+0.007
 fishing+0.007
Neg logit contributions
Wow-0.010
 sounded-0.009
L-0.009
Okay-0.009
 matter-0.009
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
Wow+0.004
 sounded+0.004
L+0.004
 matter+0.003
Okay+0.003
Neg logit contributions
 sandwiches-0.006
oured-0.006
 hind-0.006
ulent-0.006
umbs-0.006
Token All
Token ID1439
Feature activation-5.61e-03

Pos logit contributions
Wow+0.015
 sounded+0.015
L+0.015
 matter+0.015
cer+0.014
Neg logit contributions
oured-0.018
 sandwiches-0.018
 hind-0.017
umbs-0.017
 tum-0.017
Token the
Token ID262
Feature activation-1.19e-02

Pos logit contributions
ulent+0.010
 hind+0.010
 sandwiches+0.010
oured+0.009
umbs+0.009
Neg logit contributions
Wow-0.007
 sounded-0.007
L-0.007
 matter-0.006
Okay-0.006
Token,
Token ID11
Feature activation+9.31e-03

Pos logit contributions
 hind+0.013
oured+0.013
 sandwiches+0.012
umbs+0.012
ulent+0.012
Neg logit contributions
Wow-0.010
L-0.010
 sounded-0.009
 matter-0.009
Okay-0.009
Token she
Token ID673
Feature activation+6.23e-03

Pos logit contributions
Wow+0.012
 sounded+0.011
L+0.011
Okay+0.011
 matter+0.010
Neg logit contributions
 sandwiches-0.015
oured-0.015
 hind-0.015
ulent-0.015
umbs-0.015
Token found
Token ID1043
Feature activation+1.26e-02

Pos logit contributions
Wow+0.008
 sounded+0.007
L+0.007
Okay+0.007
 matter+0.007
Neg logit contributions
 sandwiches-0.010
oured-0.010
 hind-0.010
umbs-0.010
ulent-0.010
Token a
Token ID257
Feature activation-1.40e-02

Pos logit contributions
L+0.016
Wow+0.015
 sounded+0.015
You+0.013
er+0.013
Neg logit contributions
 hind-0.024
umbs-0.023
ulent-0.023
oured-0.022
 contain-0.022
Token big
Token ID1263
Feature activation-2.82e-02

Pos logit contributions
umbs+0.019
oured+0.018
ulent+0.018
oops+0.018
 sandwiches+0.018
Neg logit contributions
Wow-0.022
 sounded-0.022
L-0.022
 matter-0.022
Okay-0.020
Token,
Token ID11
Feature activation-5.48e-02

Pos logit contributions
umbs+0.048
oured+0.045
ulent+0.045
 hind+0.044
gary+0.043
Neg logit contributions
Wow-0.037
 sounded-0.037
 matter-0.036
L-0.035
er-0.032
Token white
Token ID2330
Feature activation-1.57e-02

Pos logit contributions
umbs+0.079
oured+0.078
ulent+0.073
 hind+0.073
 sandwiches+0.073
Neg logit contributions
Wow-0.088
 sounded-0.085
L-0.084
 matter-0.083
Okay-0.076
Token sheet
Token ID9629
Feature activation+5.08e-04

Pos logit contributions
 sandwiches+0.022
oured+0.021
 hind+0.021
umbs+0.021
ulent+0.021
Neg logit contributions
Wow-0.024
 sounded-0.023
L-0.022
Okay-0.022
 matter-0.021
Token of
Token ID286
Feature activation-3.50e-02

Pos logit contributions
 sounded+0.001
Wow+0.001
L+0.001
er+0.001
Okay+0.001
Neg logit contributions
 hind-0.001
oured-0.001
 sandwiches-0.001
ulent-0.001
gary-0.001
Token paper
Token ID3348
Feature activation+8.28e-03

Pos logit contributions
oured+0.063
umbs+0.061
 hind+0.058
oops+0.058
ulent+0.058
Neg logit contributions
Wow-0.045
 sounded-0.044
L-0.043
 matter-0.041
er-0.035
Token and
Token ID290
Feature activation+2.76e-03

Pos logit contributions
Wow+0.009
 sounded+0.009
L+0.008
Okay+0.008
 matter+0.008
Neg logit contributions
oured-0.015
 hind-0.015
 sandwiches-0.015
umbs-0.014
ulent-0.014
Token It
Token ID632
Feature activation-2.77e-03

Pos logit contributions
 sandwiches+0.010
oured+0.010
umbs+0.010
 hind+0.010
ulent+0.010
Neg logit contributions
Wow-0.010
 sounded-0.009
L-0.009
Okay-0.009
 matter-0.009
Token was
Token ID373
Feature activation-2.01e-02

Pos logit contributions
 sandwiches+0.006
oured+0.006
 tum+0.005
ulent+0.005
umbs+0.005
Neg logit contributions
Wow-0.002
Okay-0.002
L-0.002
Fine-0.002
OK-0.002
Token a
Token ID257
Feature activation-2.00e-02

Pos logit contributions
oured+0.033
umbs+0.032
 hind+0.032
ulent+0.031
oops+0.030
Neg logit contributions
 sounded-0.028
 matter-0.027
L-0.026
Wow-0.026
er-0.023
Token small
Token ID1402
Feature activation-1.07e-02

Pos logit contributions
oured+0.026
 sandwiches+0.026
umbs+0.026
 hind+0.025
ulent+0.025
Neg logit contributions
Wow-0.032
 sounded-0.030
L-0.030
 matter-0.029
Okay-0.029
Token,
Token ID11
Feature activation-4.47e-02

Pos logit contributions
umbs+0.017
oured+0.016
 hind+0.016
 sandwiches+0.016
ulent+0.016
Neg logit contributions
Wow-0.015
 sounded-0.014
 matter-0.014
L-0.014
Okay-0.013
Token wo
Token ID24486
Feature activation-5.47e-02

Pos logit contributions
oured+0.063
umbs+0.063
 hind+0.060
ulent+0.059
oops+0.058
Neg logit contributions
Wow-0.070
 sounded-0.069
L-0.067
 matter-0.066
Okay-0.062
Tokenolly
Token ID5098
Feature activation-2.07e-03

Pos logit contributions
 hind+0.069
 sandwiches+0.068
ulent+0.064
 fishing+0.060
 tum+0.060
Neg logit contributions
L-0.099
Wow-0.094
Okay-0.090
OK-0.085
 sounded-0.084
Token puppy
Token ID26188
Feature activation-4.09e-03

Pos logit contributions
 sandwiches+0.003
oured+0.003
 hind+0.003
umbs+0.003
ulent+0.003
Neg logit contributions
Wow-0.003
 sounded-0.003
L-0.003
Okay-0.003
 matter-0.003
Token called
Token ID1444
Feature activation-2.44e-02

Pos logit contributions
 sandwiches+0.008
oured+0.008
 hind+0.008
umbs+0.008
ulent+0.008
Neg logit contributions
Wow-0.004
 sounded-0.003
L-0.003
Okay-0.003
 matter-0.003
Token Max
Token ID5436
Feature activation-2.53e-04

Pos logit contributions
 sandwiches+0.037
 hind+0.036
oured+0.036
umbs+0.035
ulent+0.035
Neg logit contributions
Wow-0.034
 sounded-0.032
L-0.032
 matter-0.030
Okay-0.030
Token.
Token ID13
Feature activation-8.17e-04

Pos logit contributions
 sandwiches+0.001
 hind+0.001
oured+0.001
umbs+0.001
ulent+0.001
Neg logit contributions
Wow-0.000
 sounded-0.000
L-0.000
Okay-0.000
 matter-0.000
Token.
Token ID13
Feature activation-8.89e-03

Pos logit contributions
oured+0.013
ulent+0.013
 hind+0.013
 sandwiches+0.013
gary+0.013
Neg logit contributions
 sounded-0.004
L-0.004
Wow-0.004
er-0.004
Okay-0.003
Token They
Token ID1119
Feature activation-1.73e-02

Pos logit contributions
 sandwiches+0.018
oured+0.018
ulent+0.017
 hind+0.017
gary+0.017
Neg logit contributions
Wow-0.008
 sounded-0.008
L-0.008
Okay-0.007
 matter-0.007
Token put
Token ID1234
Feature activation-3.89e-03

Pos logit contributions
 hind+0.029
 sandwiches+0.029
oured+0.029
ulent+0.029
umbs+0.028
Neg logit contributions
Wow-0.021
 sounded-0.020
L-0.020
Okay-0.019
 matter-0.018
Token a
Token ID257
Feature activation-1.05e-02

Pos logit contributions
oured+0.006
 hind+0.006
ulent+0.006
umbs+0.006
 sandwiches+0.006
Neg logit contributions
Wow-0.005
 sounded-0.005
L-0.005
Okay-0.005
 matter-0.005
Token soft
Token ID2705
Feature activation-9.13e-03

Pos logit contributions
oured+0.020
ulent+0.020
umbs+0.020
oops+0.019
imes+0.019
Neg logit contributions
L-0.011
Wow-0.011
 sounded-0.010
Okay-0.009
 matter-0.009
Token,
Token ID11
Feature activation-5.45e-02

Pos logit contributions
oured+0.013
ulent+0.013
 hind+0.013
umbs+0.013
oops+0.012
Neg logit contributions
Wow-0.014
L-0.013
 sounded-0.013
er-0.012
 matter-0.012
Token warm
Token ID5814
Feature activation-7.55e-03

Pos logit contributions
umbs+0.086
oured+0.086
ulent+0.082
imes+0.081
 hind+0.078
Neg logit contributions
L-0.076
 sounded-0.075
Wow-0.072
 matter-0.067
Okay-0.066
Token blanket
Token ID18447
Feature activation-1.60e-02

Pos logit contributions
oured+0.010
ulent+0.010
 hind+0.010
umbs+0.010
oops+0.010
Neg logit contributions
Wow-0.012
L-0.012
 sounded-0.012
er-0.011
Okay-0.011
Token on
Token ID319
Feature activation-1.32e-02

Pos logit contributions
gary+0.030
ulent+0.030
 hind+0.029
earned+0.029
oured+0.028
Neg logit contributions
er-0.018
 sounded-0.017
L-0.016
.-0.016
Wow-0.015
Token Max
Token ID5436
Feature activation-9.79e-03

Pos logit contributions
 sandwiches+0.020
 hind+0.019
oured+0.019
umbs+0.018
ulent+0.018
Neg logit contributions
Wow-0.019
 sounded-0.018
L-0.018
Okay-0.018
 matter-0.017
Token.
Token ID13
Feature activation-8.23e-03

Pos logit contributions
 hind+0.019
gary+0.019
oured+0.019
imes+0.018
 sandwiches+0.018
Neg logit contributions
 sounded-0.009
L-0.009
er-0.008
.-0.008
 matter-0.007
Token wanted
Token ID2227
Feature activation-2.95e-02

Pos logit contributions
 hind+0.013
umbs+0.013
 sandwiches+0.013
ulent+0.013
oured+0.013
Neg logit contributions
 sounded-0.008
L-0.007
Wow-0.007
Okay-0.007
 matter-0.007
Token to
Token ID284
Feature activation-1.01e-02

Pos logit contributions
umbs+0.043
 hind+0.042
oured+0.040
ulent+0.040
sed+0.038
Neg logit contributions
 sounded-0.052
 matter-0.048
er-0.045
L-0.045
ate-0.044
Token design
Token ID1486
Feature activation-1.28e-03

Pos logit contributions
 sandwiches+0.014
 hind+0.013
oured+0.013
ulent+0.013
umbs+0.013
Neg logit contributions
Wow-0.016
 sounded-0.015
L-0.015
Okay-0.015
OK-0.014
Token a
Token ID257
Feature activation+1.22e-03

Pos logit contributions
oured+0.002
 hind+0.002
umbs+0.002
 sandwiches+0.002
ulent+0.002
Neg logit contributions
Wow-0.002
 sounded-0.002
L-0.002
 matter-0.002
Okay-0.002
Token big
Token ID1263
Feature activation-1.03e-02

Pos logit contributions
Wow+0.002
 sounded+0.002
L+0.002
 matter+0.002
Okay+0.002
Neg logit contributions
oured-0.002
 sandwiches-0.002
umbs-0.002
 hind-0.001
ulent-0.001
Token,
Token ID11
Feature activation-5.44e-02

Pos logit contributions
umbs+0.016
oured+0.016
ulent+0.016
 hind+0.016
gary+0.015
Neg logit contributions
Wow-0.013
 matter-0.013
 sounded-0.013
L-0.012
er-0.012
Token black
Token ID2042
Feature activation-5.90e-03

Pos logit contributions
umbs+0.080
oured+0.079
 hind+0.074
 sandwiches+0.073
ulent+0.073
Neg logit contributions
Wow-0.082
 sounded-0.081
 matter-0.079
L-0.079
Okay-0.071
Token anchor
Token ID18021
Feature activation+7.69e-03

Pos logit contributions
 sandwiches+0.008
oured+0.008
 hind+0.008
ulent+0.008
umbs+0.008
Neg logit contributions
Wow-0.009
 sounded-0.009
L-0.009
Okay-0.008
 matter-0.008
Token.
Token ID13
Feature activation+9.79e-03

Pos logit contributions
 sounded+0.008
er+0.008
 matter+0.007
L+0.007
cer+0.007
Neg logit contributions
 hind-0.014
umbs-0.014
oured-0.014
gary-0.014
ulent-0.014
Token
Token ID198
Feature activation-2.35e-03

Pos logit contributions
Wow+0.013
 sounded+0.013
 matter+0.013
cer+0.013
L+0.012
Neg logit contributions
oured-0.014
 sandwiches-0.014
umbs-0.014
 hind-0.014
 tum-0.014
Token
Token ID198
Feature activation+1.98e-03

Pos logit contributions
oured+0.005
 sandwiches+0.005
 hind+0.005
ulent+0.005
umbs+0.005
Neg logit contributions
Wow-0.002
L-0.002
 sounded-0.002
Okay-0.002
OK-0.002
Token spot
Token ID4136
Feature activation-2.17e-02

Pos logit contributions
Wow+0.001
L+0.001
 sounded+0.001
Okay+0.001
 matter+0.001
Neg logit contributions
oured-0.001
 sandwiches-0.001
umbs-0.001
 hind-0.001
ulent-0.001
Token and
Token ID290
Feature activation+2.89e-03

Pos logit contributions
ulent+0.040
gary+0.039
 hind+0.038
umbs+0.037
oured+0.036
Neg logit contributions
er-0.025
 sounded-0.025
L-0.023
cer-0.023
.-0.023
Token saw
Token ID2497
Feature activation+1.81e-02

Pos logit contributions
Wow+0.004
 sounded+0.004
L+0.004
Okay+0.004
 matter+0.004
Neg logit contributions
 sandwiches-0.004
umbs-0.004
oured-0.004
 hind-0.004
ulent-0.004
Token a
Token ID257
Feature activation-1.41e-03

Pos logit contributions
Wow+0.026
 sounded+0.025
L+0.023
 matter+0.022
er+0.022
Neg logit contributions
oured-0.029
umbs-0.028
 hind-0.027
gary-0.027
ulent-0.027
Token big
Token ID1263
Feature activation-1.27e-02

Pos logit contributions
 sandwiches+0.002
 hind+0.001
oured+0.001
ulent+0.001
 fishing+0.001
Neg logit contributions
Wow-0.003
 sounded-0.002
L-0.002
Okay-0.002
Fine-0.002
Token,
Token ID11
Feature activation-5.42e-02

Pos logit contributions
oured+0.018
 sandwiches+0.018
 hind+0.018
umbs+0.018
ulent+0.018
Neg logit contributions
Wow-0.018
 sounded-0.017
L-0.017
 matter-0.017
Okay-0.016
Token beautiful
Token ID4950
Feature activation+6.14e-03

Pos logit contributions
oured+0.076
umbs+0.075
 sandwiches+0.070
 hind+0.070
ulent+0.069
Neg logit contributions
Wow-0.085
 sounded-0.081
L-0.081
 matter-0.080
Okay-0.074
Token tree
Token ID5509
Feature activation-2.31e-03

Pos logit contributions
Wow+0.009
 sounded+0.009
L+0.009
Okay+0.009
Fine+0.008
Neg logit contributions
 sandwiches-0.009
oured-0.008
 hind-0.008
 tum-0.008
ulent-0.008
Token with
Token ID351
Feature activation-4.12e-03

Pos logit contributions
 sandwiches+0.005
oured+0.005
 hind+0.005
ulent+0.005
umbs+0.005
Neg logit contributions
Wow-0.002
 sounded-0.002
L-0.002
Okay-0.002
 matter-0.001
Token tasty
Token ID25103
Feature activation-8.80e-03

Pos logit contributions
oured+0.006
 hind+0.006
umbs+0.006
ipop+0.006
ulent+0.006
Neg logit contributions
Wow-0.006
 sounded-0.006
L-0.005
 matter-0.005
You-0.005
Token fruits
Token ID15921
Feature activation+2.25e-03

Pos logit contributions
oured+0.011
 hind+0.010
umbs+0.010
 sandwiches+0.010
ulent+0.010
Neg logit contributions
Wow-0.015
 sounded-0.014
L-0.014
 matter-0.014
er-0.013
Token.
Token ID13
Feature activation-2.52e-03

Pos logit contributions
Wow+0.022
Okay+0.019
 sounded+0.019
Fine+0.018
Really+0.018
Neg logit contributions
 sandwiches-0.049
 fishing-0.047
umbs-0.047
oured-0.047
 tum-0.046
Token They
Token ID1119
Feature activation+3.43e-03

Pos logit contributions
 sandwiches+0.005
oured+0.005
 hind+0.005
ulent+0.005
gary+0.005
Neg logit contributions
Wow-0.002
L-0.002
 sounded-0.002
Okay-0.002
 matter-0.002
Token brought
Token ID3181
Feature activation-1.32e-03

Pos logit contributions
Wow+0.004
 sounded+0.004
L+0.004
Okay+0.003
OK+0.003
Neg logit contributions
umbs-0.006
 hind-0.006
ulent-0.006
 sandwiches-0.006
gary-0.006
Token a
Token ID257
Feature activation-1.10e-03

Pos logit contributions
oured+0.002
 hind+0.002
ulent+0.002
sing+0.002
gary+0.002
Neg logit contributions
Wow-0.002
L-0.002
 sounded-0.002
OK-0.001
ate-0.001
Token big
Token ID1263
Feature activation-6.63e-03

Pos logit contributions
ulent+0.002
umbs+0.002
oured+0.002
oops+0.002
ives+0.001
Neg logit contributions
Wow-0.002
L-0.002
 sounded-0.002
OK-0.002
 matter-0.001
Token,
Token ID11
Feature activation-5.42e-02

Pos logit contributions
ulent+0.011
umbs+0.011
oured+0.010
 hind+0.010
shaped+0.010
Neg logit contributions
Wow-0.009
L-0.008
 sounded-0.008
 matter-0.008
cer-0.008
Token red
Token ID2266
Feature activation-1.43e-03

Pos logit contributions
umbs+0.083
oured+0.082
ulent+0.078
 hind+0.076
oops+0.073
Neg logit contributions
Wow-0.081
L-0.077
 sounded-0.075
 matter-0.071
OK-0.071
Token nap
Token ID25422
Feature activation-1.62e-03

Pos logit contributions
ulent+0.002
oured+0.002
umbs+0.002
 hind+0.002
gary+0.002
Neg logit contributions
Wow-0.002
L-0.002
 sounded-0.002
Okay-0.002
OK-0.002
Tokenkin
Token ID5116
Feature activation+1.63e-03

Pos logit contributions
 sandwiches+0.003
 hind+0.003
oured+0.003
umbs+0.003
ulent+0.003
Neg logit contributions
Wow-0.002
 sounded-0.002
L-0.002
 matter-0.002
Okay-0.002
Token to
Token ID284
Feature activation-2.04e-03

Pos logit contributions
er+0.002
L+0.002
 sounded+0.002
cer+0.002
Wow+0.001
Neg logit contributions
gary-0.003
 hind-0.003
ulent-0.003
oured-0.003
umbs-0.003
Token sit
Token ID1650
Feature activation+3.49e-03

Pos logit contributions
 sandwiches+0.003
oured+0.002
 hind+0.002
ulent+0.002
umbs+0.002
Neg logit contributions
Wow-0.003
 sounded-0.003
L-0.003
Okay-0.003
 matter-0.003
Token They
Token ID1119
Feature activation-6.59e-03

Pos logit contributions
 sandwiches+0.000
ulent+0.000
oured+0.000
 hind+0.000
gary+0.000
Neg logit contributions
Wow-0.000
L-0.000
 sounded-0.000
Okay-0.000
OK-0.000
Token also
Token ID635
Feature activation-7.73e-03

Pos logit contributions
 hind+0.012
 sandwiches+0.011
ulent+0.011
umbs+0.011
oured+0.011
Neg logit contributions
Wow-0.008
 sounded-0.008
L-0.008
Okay-0.007
 matter-0.007
Token bought
Token ID5839
Feature activation-6.18e-03

Pos logit contributions
 hind+0.014
umbs+0.014
 sandwiches+0.013
ulent+0.013
oured+0.013
Neg logit contributions
 sounded-0.009
L-0.009
Wow-0.009
 matter-0.008
Okay-0.008
Token a
Token ID257
Feature activation-2.36e-03

Pos logit contributions
 hind+0.011
ulent+0.011
umbs+0.011
oured+0.010
sing+0.010
Neg logit contributions
 sounded-0.008
Wow-0.007
L-0.007
 matter-0.007
er-0.006
Token big
Token ID1263
Feature activation-1.33e-02

Pos logit contributions
umbs+0.004
ulent+0.004
oops+0.004
oured+0.004
imes+0.003
Neg logit contributions
L-0.003
 sounded-0.003
Wow-0.003
 matter-0.003
OK-0.003
Token,
Token ID11
Feature activation-5.35e-02

Pos logit contributions
umbs+0.024
ulent+0.022
earned+0.022
imes+0.022
shaped+0.022
Neg logit contributions
 sounded-0.016
 matter-0.016
Wow-0.016
L-0.015
er-0.014
Token sour
Token ID11348
Feature activation+3.74e-04

Pos logit contributions
umbs+0.088
oured+0.087
ulent+0.082
oops+0.080
 hind+0.079
Neg logit contributions
 sounded-0.073
L-0.072
Wow-0.072
 matter-0.066
OK-0.059
Token,
Token ID11
Feature activation-3.59e-02

Pos logit contributions
 sounded+0.001
Wow+0.001
 matter+0.001
er+0.000
L+0.000
Neg logit contributions
ulent-0.001
umbs-0.001
oured-0.001
 hind-0.001
gary-0.001
Token green
Token ID4077
Feature activation-3.87e-04

Pos logit contributions
umbs+0.057
oured+0.056
 hind+0.054
ulent+0.054
gary+0.052
Neg logit contributions
Wow-0.051
 sounded-0.050
L-0.050
 matter-0.047
Okay-0.043
Token apple
Token ID17180
Feature activation-4.39e-03

Pos logit contributions
oured+0.001
umbs+0.001
ulent+0.001
gary+0.001
 hind+0.001
Neg logit contributions
Wow-0.001
 sounded-0.001
L-0.001
er-0.001
 matter-0.000
Token to
Token ID284
Feature activation-1.18e-02

Pos logit contributions
oured+0.007
 hind+0.007
gary+0.007
ulent+0.007
needed+0.007
Neg logit contributions
 sounded-0.005
er-0.005
L-0.005
Wow-0.005
 matter-0.005
Token from
Token ID422
Feature activation+1.66e-02

Pos logit contributions
ulent+0.019
oured+0.019
gary+0.018
 hind+0.017
 sandwiches+0.017
Neg logit contributions
er-0.012
L-0.011
 sounded-0.010
.-0.010
Wow-0.009
Token L
Token ID406
Feature activation+1.46e-03

Pos logit contributions
Wow+0.025
 sounded+0.025
Okay+0.024
L+0.024
 matter+0.024
Neg logit contributions
 sandwiches-0.022
oured-0.022
 hind-0.021
 fishing-0.021
ulent-0.020
Tokenizz
Token ID6457
Feature activation+1.55e-03

Pos logit contributions
Wow+0.002
 sounded+0.002
 matter+0.002
Okay+0.002
cer+0.002
Neg logit contributions
oured-0.002
 sandwiches-0.002
imes-0.002
umbs-0.002
gary-0.002
Tokenie
Token ID494
Feature activation-2.98e-03

Pos logit contributions
Wow+0.001
 sounded+0.001
L+0.001
er+0.001
Okay+0.001
Neg logit contributions
 hind-0.003
ulent-0.003
 sandwiches-0.003
umbs-0.003
oured-0.003
Token and
Token ID290
Feature activation+1.52e-03

Pos logit contributions
gary+0.006
 hind+0.006
umbs+0.006
 sandwiches+0.006
ulent+0.005
Neg logit contributions
er-0.003
 sounded-0.003
.-0.003
L-0.002
cer-0.002
Token inspected
Token ID34295
Feature activation+3.68e-02

Pos logit contributions
Wow+0.002
 sounded+0.002
L+0.002
Okay+0.002
 matter+0.002
Neg logit contributions
 sandwiches-0.002
oured-0.002
 hind-0.002
ulent-0.002
umbs-0.002
Token it
Token ID340
Feature activation+1.15e-02

Pos logit contributions
Okay+0.064
 narrative+0.062
Fine+0.060
Deal+0.059
Wow+0.059
Neg logit contributions
oured-0.042
 sandwiches-0.041
 several-0.041
shaped-0.039
colored-0.037
Token.
Token ID13
Feature activation-2.38e-03

Pos logit contributions
Wow+0.010
 sounded+0.009
L+0.009
Okay+0.008
 matter+0.008
Neg logit contributions
 sandwiches-0.024
oured-0.023
 hind-0.023
ulent-0.023
umbs-0.023
Token
Token ID198
Feature activation+6.48e-03

Pos logit contributions
oured+0.004
 sandwiches+0.004
 hind+0.003
ulent+0.003
umbs+0.003
Neg logit contributions
Wow-0.003
 sounded-0.003
 matter-0.003
L-0.003
Okay-0.003
Token
Token ID198
Feature activation+5.88e-03

Pos logit contributions
 matter+0.008
Wow+0.008
 sounded+0.008
Okay+0.007
cer+0.007
Neg logit contributions
 sandwiches-0.010
umbs-0.010
oured-0.010
 hind-0.010
 fishing-0.010
Tokenâ
Token ID22940
Feature activation+7.97e-03

Pos logit contributions
 matter+0.005
 sounded+0.005
cer+0.005
Wow+0.005
ators+0.005
Neg logit contributions
gary-0.011
 sandwiches-0.011
umbs-0.011
oured-0.011
ulent-0.011
Token and
Token ID290
Feature activation+8.79e-04

Pos logit contributions
 hind+0.013
ulent+0.013
gary+0.013
 sandwiches+0.012
oured+0.012
Neg logit contributions
 sounded-0.009
Wow-0.008
er-0.008
cer-0.007
L-0.007
Token his
Token ID465
Feature activation-1.21e-02

Pos logit contributions
Wow+0.001
 sounded+0.001
L+0.001
 matter+0.001
Okay+0.001
Neg logit contributions
oured-0.001
 sandwiches-0.001
 hind-0.001
umbs-0.001
ulent-0.001
Token friends
Token ID2460
Feature activation-6.95e-03

Pos logit contributions
ulent+0.021
umbs+0.021
oured+0.021
gary+0.020
ives+0.020
Neg logit contributions
OK-0.013
Wow-0.013
 sounded-0.013
 matter-0.013
 boy-0.012
Token cheered
Token ID34767
Feature activation+1.64e-02

Pos logit contributions
 hind+0.013
 sandwiches+0.012
ulent+0.012
umbs+0.012
gary+0.012
Neg logit contributions
 sounded-0.008
Wow-0.007
L-0.007
Okay-0.007
 matter-0.007
Token.
Token ID13
Feature activation+6.18e-03

Pos logit contributions
Wow+0.028
Fine+0.026
 narrative+0.026
Okay+0.026
OK+0.025
Neg logit contributions
 sandwiches-0.021
 to-0.019
 fishing-0.019
 tum-0.019
oured-0.019
Token Then
Token ID3244
Feature activation+2.37e-02

Pos logit contributions
 sounded+0.008
 matter+0.008
Wow+0.008
cer+0.008
L+0.008
Neg logit contributions
 sandwiches-0.009
oured-0.009
umbs-0.009
 hind-0.009
 tum-0.009
Token,
Token ID11
Feature activation+1.66e-02

Pos logit contributions
Wow+0.040
Okay+0.040
L+0.039
Fine+0.039
 sounded+0.038
Neg logit contributions
 sandwiches-0.028
 tum-0.027
 fishing-0.025
 several-0.023
oured-0.023
Token they
Token ID484
Feature activation-7.77e-03

Pos logit contributions
Wow+0.023
Okay+0.022
 sounded+0.022
L+0.022
Fine+0.022
Neg logit contributions
 sandwiches-0.025
 fishing-0.023
 tum-0.022
oured-0.022
 hind-0.022
Token all
Token ID477
Feature activation-7.60e-03

Pos logit contributions
 sandwiches+0.012
oured+0.012
 hind+0.012
umbs+0.012
ulent+0.012
Neg logit contributions
Wow-0.010
L-0.010
 sounded-0.009
Okay-0.009
 matter-0.009
Token sat
Token ID3332
Feature activation+1.27e-02

Pos logit contributions
oured+0.011
 sandwiches+0.010
 tum+0.010
oops+0.010
umbs+0.009
Neg logit contributions
Wow-0.011
L-0.011
Okay-0.011
Really-0.010
Fine-0.010
Token down
Token ID866
Feature activation-2.24e-02

Pos logit contributions
Wow+0.022
 matter+0.022
 sounded+0.022
L+0.022
Okay+0.021
Neg logit contributions
 sandwiches-0.015
oured-0.014
 fishing-0.014
 hind-0.013
 tum-0.013
Token she
Token ID673
Feature activation+3.98e-03

Pos logit contributions
Wow+0.017
 sounded+0.016
L+0.016
Okay+0.015
 matter+0.015
Neg logit contributions
 sandwiches-0.016
oured-0.016
 hind-0.016
ulent-0.015
umbs-0.015
Token heard
Token ID2982
Feature activation-6.22e-03

Pos logit contributions
 sounded+0.005
L+0.005
Wow+0.005
 matter+0.004
Okay+0.004
Neg logit contributions
 sandwiches-0.007
 hind-0.007
umbs-0.007
oured-0.006
gary-0.006
Token loud
Token ID7812
Feature activation+8.25e-04

Pos logit contributions
 hind+0.009
oured+0.009
 sandwiches+0.009
umbs+0.009
ulent+0.009
Neg logit contributions
 sounded-0.009
Wow-0.009
L-0.009
 matter-0.008
er-0.008
Token noises
Token ID26782
Feature activation-1.24e-02

Pos logit contributions
 sounded+0.001
Wow+0.001
L+0.001
 matter+0.001
er+0.001
Neg logit contributions
 sandwiches-0.001
oured-0.001
 hind-0.001
umbs-0.001
gary-0.001
Token coming
Token ID2406
Feature activation+3.20e-03

Pos logit contributions
 sandwiches+0.022
 hind+0.022
oured+0.021
ulent+0.021
umbs+0.021
Neg logit contributions
 sounded-0.014
Wow-0.014
L-0.013
 matter-0.013
er-0.012
Token from
Token ID422
Feature activation+2.82e-02

Pos logit contributions
Wow+0.005
Okay+0.005
 sounded+0.005
L+0.005
Fine+0.004
Neg logit contributions
 sandwiches-0.004
 hind-0.004
oured-0.004
ulent-0.004
umbs-0.004
Token her
Token ID607
Feature activation+1.49e-02

Pos logit contributions
Wow+0.046
 sounded+0.044
L+0.043
 matter+0.042
Okay+0.042
Neg logit contributions
 sandwiches-0.037
oured-0.036
umbs-0.035
 hind-0.035
ulent-0.035
Token house
Token ID2156
Feature activation+5.63e-03

Pos logit contributions
er+0.015
 sounded+0.014
L+0.014
cer+0.014
 matter+0.014
Neg logit contributions
gary-0.028
umbs-0.026
oured-0.026
ulent-0.026
oops-0.025
Token.
Token ID13
Feature activation+3.61e-03

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
 matter+0.003
Okay+0.003
Neg logit contributions
 sandwiches-0.013
oured-0.013
 hind-0.013
ulent-0.013
umbs-0.013
Token Izzy
Token ID46359
Feature activation+1.10e-03

Pos logit contributions
Wow+0.005
L+0.004
 sounded+0.004
Okay+0.004
OK+0.004
Neg logit contributions
 sandwiches-0.006
oured-0.006
 hind-0.006
ulent-0.006
umbs-0.006
Token quickly
Token ID2952
Feature activation-9.81e-03

Pos logit contributions
Wow+0.001
 sounded+0.001
L+0.001
Okay+0.001
 matter+0.001
Neg logit contributions
 sandwiches-0.002
oured-0.002
 hind-0.002
umbs-0.002
ulent-0.002
Token and
Token ID290
Feature activation-1.76e-02

Pos logit contributions
 sounded+0.004
er+0.004
cer+0.003
L+0.003
Wow+0.003
Neg logit contributions
 hind-0.007
gary-0.006
ulent-0.006
umbs-0.006
 sandwiches-0.006
Token a
Token ID257
Feature activation-1.94e-03

Pos logit contributions
umbs+0.028
 hind+0.028
gary+0.027
ulent+0.027
oured+0.027
Neg logit contributions
 sounded-0.025
Wow-0.023
L-0.023
 matter-0.021
OK-0.020
Token leash
Token ID40529
Feature activation-7.35e-04

Pos logit contributions
oured+0.003
umbs+0.003
 sandwiches+0.003
ulent+0.003
 hind+0.003
Neg logit contributions
Wow-0.003
 sounded-0.003
L-0.003
 matter-0.003
Okay-0.003
Token.
Token ID13
Feature activation-2.88e-03

Pos logit contributions
 hind+0.002
 sandwiches+0.002
ulent+0.002
gary+0.002
oured+0.002
Neg logit contributions
 sounded-0.001
Wow-0.001
L-0.000
er-0.000
Okay-0.000
Token He
Token ID679
Feature activation-7.75e-03

Pos logit contributions
 sandwiches+0.005
oured+0.005
 hind+0.005
umbs+0.005
ulent+0.005
Neg logit contributions
Wow-0.004
 sounded-0.003
L-0.003
Okay-0.003
 matter-0.003
Token opened
Token ID4721
Feature activation+3.18e-02

Pos logit contributions
 sandwiches+0.013
oured+0.013
 hind+0.013
ulent+0.013
umbs+0.013
Neg logit contributions
Wow-0.009
L-0.008
 sounded-0.008
Okay-0.008
 matter-0.008
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
Wow+0.048
Okay+0.047
L+0.046
 sounded+0.046
 matter+0.045
Neg logit contributions
 sandwiches-0.045
oured-0.043
 fishing-0.042
 hind-0.042
 tum-0.040
Token fence
Token ID13990
Feature activation-8.82e-03

Pos logit contributions
Wow+0.017
 sounded+0.016
L+0.016
Okay+0.016
 matter+0.015
Neg logit contributions
 sandwiches-0.015
oured-0.014
 hind-0.014
umbs-0.014
ulent-0.014
Token and
Token ID290
Feature activation-1.05e-02

Pos logit contributions
 hind+0.017
ulent+0.017
gary+0.017
oured+0.017
 sandwiches+0.016
Neg logit contributions
 sounded-0.009
er-0.009
Wow-0.008
L-0.008
Okay-0.007
Token took
Token ID1718
Feature activation+9.75e-03

Pos logit contributions
 sandwiches+0.016
 hind+0.016
ulent+0.016
umbs+0.016
oured+0.016
Neg logit contributions
Wow-0.014
 sounded-0.014
L-0.013
Okay-0.013
 matter-0.012
Token out
Token ID503
Feature activation-1.93e-02

Pos logit contributions
Wow+0.014
L+0.014
 sounded+0.014
 matter+0.013
Okay+0.013
Neg logit contributions
 hind-0.014
oured-0.014
ulent-0.014
 sandwiches-0.013
umbs-0.013
TokenSo
Token ID2396
Feature activation+1.28e-02

Pos logit contributions
Wow+0.003
 matter+0.003
 sounded+0.003
cer+0.003
Okay+0.003
Neg logit contributions
 sandwiches-0.006
ulent-0.006
 hind-0.006
umbs-0.006
gary-0.006
Token Bear
Token ID14732
Feature activation-8.08e-03

Pos logit contributions
Wow+0.020
Okay+0.019
L+0.019
 sounded+0.018
Fine+0.018
Neg logit contributions
 sandwiches-0.017
 hind-0.017
 tum-0.016
oured-0.016
ulent-0.016
Tokeny
Token ID88
Feature activation-1.36e-02

Pos logit contributions
 sandwiches+0.015
oured+0.015
 hind+0.015
ulent+0.015
umbs+0.015
Neg logit contributions
Wow-0.008
 sounded-0.008
L-0.008
Okay-0.007
 matter-0.007
Token wiped
Token ID21122
Feature activation+2.55e-02

Pos logit contributions
 sandwiches+0.022
 hind+0.022
oured+0.021
ulent+0.021
umbs+0.021
Neg logit contributions
Wow-0.017
 sounded-0.017
L-0.017
Okay-0.016
 matter-0.015
Token and
Token ID290
Feature activation-5.66e-03

Pos logit contributions
Wow+0.042
Okay+0.041
 sounded+0.040
L+0.039
 narrative+0.039
Neg logit contributions
 sandwiches-0.031
oured-0.030
 hind-0.030
 tum-0.029
umbs-0.029
Token wiped
Token ID21122
Feature activation+2.75e-02

Pos logit contributions
 sandwiches+0.008
oured+0.008
 hind+0.008
 tum+0.008
gary+0.008
Neg logit contributions
Wow-0.008
 sounded-0.008
Okay-0.008
L-0.007
 matter-0.007
Token the
Token ID262
Feature activation-1.53e-02

Pos logit contributions
Wow+0.048
Okay+0.048
 narrative+0.046
Really+0.046
Fine+0.045
Neg logit contributions
 sandwiches-0.030
 tum-0.029
oured-0.029
 several-0.028
 hind-0.028
Token fridge
Token ID25772
Feature activation+2.66e-03

Pos logit contributions
oured+0.025
ulent+0.024
 hind+0.023
gary+0.023
umbs+0.023
Neg logit contributions
L-0.020
Wow-0.020
 sounded-0.019
er-0.019
 matter-0.018
Token with
Token ID351
Feature activation+9.85e-03

Pos logit contributions
Wow+0.004
 sounded+0.003
L+0.003
Okay+0.003
 matter+0.003
Neg logit contributions
 sandwiches-0.004
oured-0.004
 hind-0.004
ulent-0.004
umbs-0.004
Token his
Token ID465
Feature activation-1.18e-03

Pos logit contributions
Wow+0.012
 sounded+0.012
 matter+0.011
L+0.011
Okay+0.010
Neg logit contributions
oured-0.016
ulent-0.016
 hind-0.016
 sandwiches-0.016
umbs-0.016
Token little
Token ID1310
Feature activation+3.08e-04

Pos logit contributions
oured+0.002
umbs+0.002
ulent+0.002
gary+0.002
 sandwiches+0.002
Neg logit contributions
L-0.002
 sounded-0.002
 matter-0.002
Wow-0.002
Okay-0.002
Token
Token ID220
Feature activation+4.56e-04

Pos logit contributions
 sandwiches+0.002
oured+0.002
 hind+0.002
ulent+0.002
umbs+0.002
Neg logit contributions
Wow-0.002
 sounded-0.001
L-0.001
Okay-0.001
 matter-0.001
Token
Token ID198
Feature activation-5.80e-03

Pos logit contributions
Wow+0.000
 sounded+0.000
L+0.000
Okay+0.000
 matter+0.000
Neg logit contributions
 sandwiches-0.001
oured-0.001
 hind-0.001
umbs-0.001
ulent-0.001
Token
Token ID198
Feature activation-4.84e-03

Pos logit contributions
 sandwiches+0.011
oured+0.011
 hind+0.011
ulent+0.011
umbs+0.011
Neg logit contributions
Wow-0.006
L-0.005
 sounded-0.005
Okay-0.005
OK-0.005
TokenJoe
Token ID19585
Feature activation-8.68e-03

Pos logit contributions
 sandwiches+0.010
oured+0.009
 hind+0.009
ulent+0.009
umbs+0.009
Neg logit contributions
Wow-0.005
 sounded-0.004
L-0.004
Okay-0.004
 matter-0.004
Token thanked
Token ID26280
Feature activation-1.21e-03

Pos logit contributions
 sandwiches+0.018
 hind+0.018
ulent+0.017
gary+0.017
umbs+0.017
Neg logit contributions
 sounded-0.008
L-0.007
Wow-0.007
er-0.006
Okay-0.006
Token the
Token ID262
Feature activation+9.66e-03

Pos logit contributions
 sandwiches+0.002
oured+0.002
 hind+0.002
ulent+0.002
umbs+0.002
Neg logit contributions
Wow-0.002
 sounded-0.002
L-0.002
Okay-0.002
 matter-0.002
Token children
Token ID1751
Feature activation-1.32e-02

Pos logit contributions
Wow+0.014
 sounded+0.014
L+0.014
Okay+0.013
 matter+0.013
Neg logit contributions
 sandwiches-0.014
 hind-0.014
oured-0.014
ulent-0.013
 fishing-0.013
Token and
Token ID290
Feature activation+3.09e-03

Pos logit contributions
 hind+0.021
umbs+0.021
gary+0.021
oured+0.020
ulent+0.020
Neg logit contributions
er-0.016
 sounded-0.015
.-0.015
L-0.015
cer-0.014
Token gave
Token ID2921
Feature activation+1.52e-02

Pos logit contributions
Wow+0.004
 sounded+0.004
L+0.004
Okay+0.003
 matter+0.003
Neg logit contributions
 sandwiches-0.005
oured-0.005
 hind-0.005
ulent-0.005
umbs-0.005
Token them
Token ID606
Feature activation-2.09e-02

Pos logit contributions
Wow+0.018
L+0.017
Okay+0.017
 sounded+0.017
 matter+0.016
Neg logit contributions
 sandwiches-0.026
oured-0.026
 fishing-0.024
 hind-0.024
ulent-0.024
Token a
Token ID257
Feature activation-8.24e-03

Pos logit contributions
 hind+0.034
ulent+0.033
umbs+0.033
sed+0.032
sing+0.032
Neg logit contributions
L-0.026
 sounded-0.025
Wow-0.024
er-0.022
Okay-0.022
Token them
Token ID606
Feature activation+1.39e-02

Pos logit contributions
Wow+0.045
Okay+0.042
 sounded+0.042
L+0.042
 narrative+0.041
Neg logit contributions
 sandwiches-0.036
 hind-0.035
ulent-0.033
 fishing-0.033
 tum-0.033
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
Wow+0.008
 sounded+0.007
L+0.007
Okay+0.006
 matter+0.006
Neg logit contributions
 sandwiches-0.032
oured-0.032
 hind-0.032
ulent-0.031
umbs-0.031
Token Sara
Token ID24799
Feature activation-3.89e-03

Pos logit contributions
Wow+0.024
 sounded+0.023
L+0.023
Okay+0.022
 matter+0.022
Neg logit contributions
 sandwiches-0.029
oured-0.029
 hind-0.029
ulent-0.028
umbs-0.028
Token and
Token ID290
Feature activation+1.74e-03

Pos logit contributions
 sandwiches+0.006
gary+0.006
oured+0.006
ulent+0.006
 hind+0.006
Neg logit contributions
 sounded-0.005
Wow-0.005
L-0.005
Okay-0.005
OK-0.005
Token Tom
Token ID4186
Feature activation+6.43e-03

Pos logit contributions
Wow+0.002
 sounded+0.002
L+0.002
Okay+0.002
OK+0.002
Neg logit contributions
umbs-0.003
oured-0.003
 hind-0.003
ulent-0.003
gary-0.003
Token are
Token ID389
Feature activation+1.82e-02

Pos logit contributions
Wow+0.010
 sounded+0.010
L+0.009
Okay+0.009
 matter+0.009
Neg logit contributions
 sandwiches-0.009
oured-0.009
 hind-0.008
ulent-0.008
umbs-0.008
Token scared
Token ID12008
Feature activation-1.95e-02

Pos logit contributions
 sounded+0.024
Wow+0.023
L+0.022
 matter+0.021
cer+0.020
Neg logit contributions
oured-0.029
gary-0.029
 sandwiches-0.028
umbs-0.028
ulent-0.028
Token.
Token ID13
Feature activation+1.84e-02

Pos logit contributions
 sandwiches+0.038
 hind+0.037
umbs+0.037
oured+0.036
 fishing+0.036
Neg logit contributions
Wow-0.019
Okay-0.017
 sounded-0.017
L-0.016
 matter-0.016
Token They
Token ID1119
Feature activation+1.42e-02

Pos logit contributions
Wow+0.020
 sounded+0.020
L+0.019
 matter+0.019
Okay+0.019
Neg logit contributions
 sandwiches-0.032
oured-0.031
 hind-0.031
umbs-0.031
 tum-0.031
Token run
Token ID1057
Feature activation-3.37e-03

Pos logit contributions
Wow+0.026
L+0.025
 sounded+0.024
Okay+0.024
cer+0.024
Neg logit contributions
 hind-0.015
 sandwiches-0.015
oured-0.015
 tum-0.014
ulent-0.014
Token away
Token ID1497
Feature activation+7.58e-03

Pos logit contributions
 sandwiches+0.004
gary+0.004
oured+0.004
umbs+0.004
ulent+0.004
Neg logit contributions
Wow-0.005
L-0.005
 sounded-0.005
Okay-0.005
OK-0.005
Token watching
Token ID4964
Feature activation-1.12e-02

Pos logit contributions
 sounded+0.021
Wow+0.019
L+0.019
 matter+0.018
er+0.018
Neg logit contributions
 sandwiches-0.017
umbs-0.017
 hind-0.016
oured-0.016
ulent-0.016
Token him
Token ID683
Feature activation-1.24e-03

Pos logit contributions
oured+0.020
 hind+0.020
umbs+0.020
 sandwiches+0.020
ulent+0.020
Neg logit contributions
 sounded-0.013
Wow-0.012
L-0.012
 matter-0.012
er-0.011
Token carefully
Token ID7773
Feature activation+7.73e-03

Pos logit contributions
gary+0.002
 hind+0.002
umbs+0.002
 sandwiches+0.002
oured+0.002
Neg logit contributions
 sounded-0.001
 matter-0.001
er-0.001
.-0.001
L-0.001
Token,
Token ID11
Feature activation-1.54e-03

Pos logit contributions
 sounded+0.008
Wow+0.007
 matter+0.007
L+0.007
er+0.007
Neg logit contributions
 sandwiches-0.014
umbs-0.014
 hind-0.014
gary-0.014
oured-0.014
Token because
Token ID780
Feature activation+5.35e-03

Pos logit contributions
umbs+0.002
 sandwiches+0.002
oured+0.002
 hind+0.002
gary+0.002
Neg logit contributions
Wow-0.002
 sounded-0.002
L-0.002
 matter-0.002
Okay-0.002
Token this
Token ID428
Feature activation+9.17e-03

Pos logit contributions
Wow+0.008
L+0.007
 sounded+0.007
Okay+0.007
OK+0.007
Neg logit contributions
 sandwiches-0.008
oured-0.007
 hind-0.007
ulent-0.007
 fishing-0.007
Token model
Token ID2746
Feature activation-1.92e-03

Pos logit contributions
Wow+0.012
Fine+0.012
Okay+0.011
Really+0.011
L+0.011
Neg logit contributions
 sandwiches-0.014
oured-0.014
 tum-0.013
 fishing-0.013
colored-0.013
Token was
Token ID373
Feature activation+8.95e-03

Pos logit contributions
 sandwiches+0.004
oured+0.004
 hind+0.004
umbs+0.004
ulent+0.004
Neg logit contributions
Wow-0.002
L-0.001
 sounded-0.001
Okay-0.001
 matter-0.001
Token very
Token ID845
Feature activation-6.04e-04

Pos logit contributions
 sounded+0.014
Wow+0.014
 matter+0.013
L+0.013
Okay+0.012
Neg logit contributions
oured-0.012
umbs-0.012
 sandwiches-0.012
 hind-0.012
ulent-0.012
Token complicated
Token ID8253
Feature activation+8.02e-03

Pos logit contributions
 sandwiches+0.001
 hind+0.001
oured+0.001
 fishing+0.001
ulent+0.001
Neg logit contributions
Wow-0.001
 sounded-0.001
L-0.001
Okay-0.001
Fine-0.001
Token.
Token ID13
Feature activation-7.41e-04

Pos logit contributions
Wow+0.007
 sounded+0.007
L+0.006
 matter+0.006
Okay+0.006
Neg logit contributions
 sandwiches-0.016
oured-0.016
 hind-0.016
umbs-0.016
ulent-0.016
Token box
Token ID3091
Feature activation-3.04e-03

Pos logit contributions
 sandwiches+0.001
oured+0.000
 hind+0.000
ulent+0.000
umbs+0.000
Neg logit contributions
Wow-0.000
 sounded-0.000
L-0.000
Okay-0.000
 matter-0.000
Token?"
Token ID1701
Feature activation+1.33e-02

Pos logit contributions
 hind+0.008
ulent+0.008
umbs+0.008
gary+0.008
oured+0.008
Neg logit contributions
 sounded-0.001
?-0.001
er-0.001
L-0.001
ate-0.001
Token Sam
Token ID3409
Feature activation+1.77e-02

Pos logit contributions
Wow+0.019
Okay+0.019
 sounded+0.018
cer+0.018
L+0.018
Neg logit contributions
 sandwiches-0.018
 tum-0.018
 fishing-0.017
oured-0.017
 hind-0.017
Token asks
Token ID7893
Feature activation-2.45e-03

Pos logit contributions
Wow+0.025
Okay+0.023
cer+0.023
L+0.023
ator+0.022
Neg logit contributions
 fishing-0.025
 sandwiches-0.025
 toast-0.025
 hind-0.024
oops-0.024
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
 sandwiches+0.005
 fishing+0.005
 tum+0.004
oured+0.004
umbs+0.004
Neg logit contributions
Wow-0.003
Okay-0.002
 matter-0.002
Really-0.002
OK-0.002
Token
Token ID198
Feature activation+2.03e-03

Pos logit contributions
Wow+0.016
 matter+0.015
Okay+0.014
 sounded+0.014
L+0.014
Neg logit contributions
 sandwiches-0.020
 fishing-0.020
 hind-0.020
oured-0.020
 tum-0.020
Token
Token ID198
Feature activation+9.68e-03

Pos logit contributions
 matter+0.003
Wow+0.003
 narrative+0.003
ators+0.003
cer+0.003
Neg logit contributions
 fishing-0.003
 to-0.002
 tum-0.002
umbs-0.002
gary-0.002
Token"
Token ID1
Feature activation+4.81e-03

Pos logit contributions
 matter+0.012
Wow+0.011
ators+0.011
 narrative+0.011
ator+0.011
Neg logit contributions
 fishing-0.016
ulent-0.015
gary-0.015
 sandwiches-0.015
 hind-0.015
TokenI
Token ID40
Feature activation+5.15e-03

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
 matter+0.003
Okay+0.003
Neg logit contributions
 sandwiches-0.011
oured-0.010
 hind-0.010
umbs-0.010
ulent-0.010
Token don
Token ID836
Feature activation-1.74e-02

Pos logit contributions
Wow+0.005
 sounded+0.004
L+0.004
Okay+0.004
 matter+0.004
Neg logit contributions
 sandwiches-0.010
oured-0.010
 hind-0.010
ulent-0.010
umbs-0.010
Token't
Token ID470
Feature activation-2.55e-02

Pos logit contributions
oured+0.027
 hind+0.026
 sandwiches+0.026
ulent+0.025
 tum+0.024
Neg logit contributions
 sounded-0.024
Wow-0.024
L-0.023
er-0.021
You-0.021
Token.
Token ID13
Feature activation-9.83e-03

Pos logit contributions
 sounded+0.002
er+0.002
L+0.002
Wow+0.002
Okay+0.001
Neg logit contributions
 hind-0.011
 sandwiches-0.010
ulent-0.010
umbs-0.010
oured-0.010
Token She
Token ID1375
Feature activation-4.58e-03

Pos logit contributions
 sandwiches+0.016
 hind+0.015
oured+0.015
ulent+0.015
umbs+0.015
Neg logit contributions
Wow-0.013
L-0.012
 sounded-0.012
Okay-0.012
OK-0.011
Token never
Token ID1239
Feature activation-2.42e-02

Pos logit contributions
umbs+0.009
ulent+0.009
gary+0.009
 sandwiches+0.009
 hind+0.009
Neg logit contributions
 sounded-0.005
L-0.004
Wow-0.004
Okay-0.004
OK-0.004
Token wanted
Token ID2227
Feature activation-1.72e-02

Pos logit contributions
 sandwiches+0.043
 hind+0.042
umbs+0.041
oured+0.041
ulent+0.041
Neg logit contributions
Wow-0.028
 sounded-0.028
L-0.027
 matter-0.027
Okay-0.026
Token to
Token ID284
Feature activation-1.15e-02

Pos logit contributions
umbs+0.021
oured+0.020
 hind+0.020
ulent+0.019
gary+0.018
Neg logit contributions
 matter-0.030
 sounded-0.029
er-0.029
L-0.028
Wow-0.028
Token let
Token ID1309
Feature activation+5.07e-03

Pos logit contributions
 sandwiches+0.015
oured+0.014
 hind+0.014
ulent+0.014
umbs+0.014
Neg logit contributions
Wow-0.019
 sounded-0.018
L-0.018
Okay-0.017
 matter-0.017
Token it
Token ID340
Feature activation-1.22e-04

Pos logit contributions
Wow+0.007
 sounded+0.006
L+0.006
 matter+0.006
Okay+0.006
Neg logit contributions
 sandwiches-0.008
oured-0.008
 hind-0.008
ulent-0.008
umbs-0.008
Token go
Token ID467
Feature activation-1.65e-02

Pos logit contributions
 sandwiches+0.000
oured+0.000
ulent+0.000
 hind+0.000
umbs+0.000
Neg logit contributions
 sounded-0.000
Wow-0.000
L-0.000
 matter-0.000
Okay-0.000
Token ever
Token ID1683
Feature activation+7.71e-03

Pos logit contributions
 sandwiches+0.030
 hind+0.030
oured+0.030
ulent+0.029
umbs+0.029
Neg logit contributions
Wow-0.018
L-0.017
 sounded-0.017
 matter-0.016
er-0.016
Token again
Token ID757
Feature activation+1.51e-02

Pos logit contributions
Wow+0.009
 sounded+0.009
L+0.008
 matter+0.008
Okay+0.008
Neg logit contributions
 sandwiches-0.013
oured-0.013
 hind-0.013
ulent-0.013
umbs-0.013
Token.
Token ID13
Feature activation-5.12e-03

Pos logit contributions
Wow+0.011
 sounded+0.010
L+0.009
Okay+0.009
 matter+0.009
Neg logit contributions
 sandwiches-0.033
oured-0.033
 hind-0.033
umbs-0.032
ulent-0.032
Token to
Token ID284
Feature activation+2.57e-03

Pos logit contributions
umbs+0.018
 sandwiches+0.018
oured+0.017
 hind+0.017
ulent+0.016
Neg logit contributions
 sounded-0.042
L-0.041
Wow-0.041
er-0.041
Okay-0.040
Token your
Token ID534
Feature activation+8.40e-03

Pos logit contributions
Wow+0.003
 sounded+0.003
L+0.003
Okay+0.003
 matter+0.003
Neg logit contributions
 sandwiches-0.004
oured-0.004
 hind-0.004
ulent-0.004
umbs-0.004
Token instincts
Token ID26744
Feature activation+5.10e-03

Pos logit contributions
Wow+0.013
L+0.012
 sounded+0.012
Okay+0.011
 matter+0.011
Neg logit contributions
 sandwiches-0.012
 hind-0.012
 tum-0.011
oured-0.011
ulent-0.011
Token.
Token ID13
Feature activation-2.46e-03

Pos logit contributions
Wow+0.006
 sounded+0.005
L+0.005
Okay+0.005
 matter+0.005
Neg logit contributions
 sandwiches-0.009
oured-0.009
 hind-0.009
ulent-0.009
umbs-0.009
Token You
Token ID921
Feature activation+1.73e-02

Pos logit contributions
umbs+0.003
ulent+0.003
 fishing+0.003
 tum+0.003
oured+0.003
Neg logit contributions
Wow-0.004
 matter-0.004
 sounded-0.004
Okay-0.004
cer-0.004
Token never
Token ID1239
Feature activation-1.35e-02

Pos logit contributions
Wow+0.024
L+0.022
Okay+0.021
 sounded+0.021
 matter+0.020
Neg logit contributions
 sandwiches-0.026
oured-0.026
 hind-0.026
umbs-0.026
ulent-0.025
Token know
Token ID760
Feature activation+4.37e-03

Pos logit contributions
 sandwiches+0.019
oured+0.018
 hind+0.018
umbs+0.018
ulent+0.018
Neg logit contributions
Wow-0.021
 sounded-0.020
L-0.020
Okay-0.019
 matter-0.019
Token when
Token ID618
Feature activation-6.69e-03

Pos logit contributions
Wow+0.006
 sounded+0.005
L+0.005
 matter+0.005
Okay+0.005
Neg logit contributions
oured-0.007
 sandwiches-0.007
umbs-0.007
 hind-0.007
ulent-0.007
Token something
Token ID1223
Feature activation+2.21e-03

Pos logit contributions
umbs+0.012
oured+0.011
 hind+0.011
 sandwiches+0.011
gary+0.010
Neg logit contributions
 sounded-0.008
 matter-0.008
Wow-0.008
L-0.007
OK-0.007
Token might
Token ID1244
Feature activation-5.31e-03

Pos logit contributions
Wow+0.002
 sounded+0.002
L+0.002
 matter+0.002
Okay+0.002
Neg logit contributions
 sandwiches-0.004
oured-0.004
 hind-0.004
umbs-0.004
ulent-0.004
Token split
Token ID6626
Feature activation+8.59e-03

Pos logit contributions
 sandwiches+0.009
oured+0.009
 hind+0.009
umbs+0.008
ulent+0.008
Neg logit contributions
 sounded-0.007
Wow-0.006
 matter-0.006
L-0.006
Okay-0.006
Token happy
Token ID3772
Feature activation+1.16e-02

Pos logit contributions
oured+0.016
 sandwiches+0.015
ulent+0.015
 hind+0.015
umbs+0.015
Neg logit contributions
Wow-0.018
L-0.017
 sounded-0.017
Okay-0.016
OK-0.016
Token.
Token ID13
Feature activation+1.95e-03

Pos logit contributions
Wow+0.020
 sounded+0.019
L+0.019
Okay+0.018
 matter+0.018
Neg logit contributions
 sandwiches-0.014
oured-0.014
 hind-0.013
umbs-0.013
ulent-0.013
Token He
Token ID679
Feature activation-5.06e-03

Pos logit contributions
Wow+0.002
 sounded+0.001
L+0.001
Okay+0.001
OK+0.001
Neg logit contributions
gary-0.004
 sandwiches-0.004
oured-0.004
ulent-0.004
umbs-0.004
Token held
Token ID2714
Feature activation+5.69e-03

Pos logit contributions
 sandwiches+0.009
umbs+0.009
ulent+0.009
oured+0.009
 hind+0.009
Neg logit contributions
Wow-0.006
 sounded-0.006
L-0.005
Okay-0.005
 matter-0.005
Token the
Token ID262
Feature activation-1.18e-02

Pos logit contributions
Wow+0.007
 sounded+0.007
L+0.007
 matter+0.006
OK+0.006
Neg logit contributions
umbs-0.009
ulent-0.009
 sandwiches-0.009
oured-0.009
 hind-0.009
Token balloon
Token ID21190
Feature activation-1.18e-02

Pos logit contributions
ulent+0.023
oured+0.022
imes+0.021
 hind+0.021
gary+0.021
Neg logit contributions
Wow-0.014
OK-0.012
 matter-0.011
 sounded-0.011
Okay-0.011
Token tight
Token ID5381
Feature activation-8.74e-03

Pos logit contributions
gary+0.022
umbs+0.022
oured+0.021
ulent+0.021
ives+0.020
Neg logit contributions
 sounded-0.011
cer-0.011
er-0.011
Wow-0.011
L-0.011
Token and
Token ID290
Feature activation+4.75e-03

Pos logit contributions
gary+0.015
umbs+0.015
 sandwiches+0.014
 hind+0.014
ulent+0.014
Neg logit contributions
Wow-0.010
 sounded-0.010
er-0.010
L-0.010
Okay-0.009
Token it
Token ID340
Feature activation+5.22e-03

Pos logit contributions
Wow+0.006
L+0.006
 sounded+0.006
Okay+0.006
 matter+0.006
Neg logit contributions
 sandwiches-0.007
oured-0.007
 hind-0.007
 tum-0.007
ulent-0.007
Token lifted
Token ID13663
Feature activation+1.75e-02

Pos logit contributions
Wow+0.006
 sounded+0.006
L+0.006
Okay+0.006
 matter+0.005
Neg logit contributions
 sandwiches-0.009
oured-0.009
 hind-0.009
ulent-0.009
umbs-0.009
Token him
Token ID683
Feature activation+7.22e-03

Pos logit contributions
 narrative+0.029
Wow+0.028
Okay+0.028
 sounded+0.028
Fine+0.027
Neg logit contributions
 sandwiches-0.021
 tum-0.020
oured-0.019
 several-0.019
 hind-0.019
Token then
Token ID788
Feature activation+2.88e-03

Pos logit contributions
Wow+0.006
L+0.006
 sounded+0.006
Okay+0.005
Fine+0.005
Neg logit contributions
 sandwiches-0.006
oured-0.005
 hind-0.005
 fishing-0.005
 tum-0.005
Token he
Token ID339
Feature activation+2.35e-04

Pos logit contributions
Wow+0.005
L+0.004
Okay+0.004
 sounded+0.004
 matter+0.004
Neg logit contributions
 sandwiches-0.004
oured-0.004
 fishing-0.004
 tum-0.004
ulent-0.004
Token saw
Token ID2497
Feature activation+1.23e-02

Pos logit contributions
Wow+0.000
 sounded+0.000
L+0.000
Okay+0.000
 matter+0.000
Neg logit contributions
 sandwiches-0.000
 hind-0.000
oured-0.000
umbs-0.000
ulent-0.000
Token something
Token ID1223
Feature activation-9.28e-03

Pos logit contributions
Wow+0.018
 sounded+0.018
L+0.018
 matter+0.017
Okay+0.016
Neg logit contributions
 hind-0.017
umbs-0.017
 sandwiches-0.017
oured-0.017
ulent-0.017
Token strange
Token ID6283
Feature activation+1.00e-02

Pos logit contributions
umbs+0.013
oured+0.013
 hind+0.013
ulent+0.013
 sandwiches+0.013
Neg logit contributions
 sounded-0.014
Wow-0.013
L-0.013
 matter-0.013
Okay-0.012
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
Wow+0.009
 sounded+0.009
L+0.009
 matter+0.008
Okay+0.008
Neg logit contributions
 sandwiches-0.019
oured-0.019
 hind-0.019
umbs-0.019
ulent-0.019
Token The
Token ID383
Feature activation-7.10e-03

Pos logit contributions
oured+0.016
 sandwiches+0.016
umbs+0.016
ulent+0.015
 hind+0.015
Neg logit contributions
Wow-0.015
 sounded-0.015
 matter-0.015
Okay-0.015
L-0.015
Token cat
Token ID3797
Feature activation-1.13e-02

Pos logit contributions
 sandwiches+0.011
oured+0.011
 hind+0.011
umbs+0.011
ulent+0.010
Neg logit contributions
Wow-0.010
 sounded-0.009
L-0.009
Okay-0.009
 matter-0.009
Token had
Token ID550
Feature activation-4.13e-03

Pos logit contributions
 sandwiches+0.021
oured+0.021
 hind+0.020
umbs+0.020
ulent+0.020
Neg logit contributions
Wow-0.012
 sounded-0.011
L-0.011
Okay-0.010
 matter-0.010
Token vanished
Token ID23717
Feature activation+3.59e-03

Pos logit contributions
 hind+0.007
oured+0.007
 sandwiches+0.007
umbs+0.007
ulent+0.006
Neg logit contributions
 sounded-0.006
 matter-0.005
L-0.005
Wow-0.005
er-0.004
Token!
Token ID0
Feature activation-9.58e-03

Pos logit contributions
Wow+0.004
 sounded+0.004
L+0.004
 matter+0.004
Okay+0.004
Neg logit contributions
 sandwiches-0.006
oured-0.006
 hind-0.006
umbs-0.006
ulent-0.006
Token?"
Token ID1701
Feature activation+1.75e-02

Pos logit contributions
ulent+0.006
oured+0.006
 hind+0.006
umbs+0.006
gary+0.006
Neg logit contributions
L-0.002
 sounded-0.002
Wow-0.002
cer-0.002
ate-0.002
Token Anna
Token ID11735
Feature activation+1.09e-02

Pos logit contributions
Wow+0.022
 sounded+0.021
L+0.020
Okay+0.020
 matter+0.020
Neg logit contributions
 sandwiches-0.029
 hind-0.028
oured-0.028
umbs-0.027
 tum-0.027
Token asked
Token ID1965
Feature activation+5.64e-04

Pos logit contributions
Wow+0.014
Okay+0.012
 matter+0.012
Really+0.012
 narrative+0.012
Neg logit contributions
 tum-0.017
oured-0.017
 fishing-0.017
 sandwiches-0.017
ulent-0.017
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
Wow+0.001
Okay+0.001
 matter+0.001
 sounded+0.000
L+0.000
Neg logit contributions
 sandwiches-0.001
oured-0.001
 tum-0.001
 fishing-0.001
ulent-0.001
Token She
Token ID1375
Feature activation+2.09e-02

Pos logit contributions
Wow+0.013
 matter+0.012
 sounded+0.012
L+0.012
Okay+0.012
Neg logit contributions
 sandwiches-0.016
 hind-0.016
oured-0.016
 tum-0.016
umbs-0.016
Token yawn
Token ID40908
Feature activation-6.66e-03

Pos logit contributions
Wow+0.031
 matter+0.028
 narrative+0.027
cer+0.027
Okay+0.027
Neg logit contributions
oured-0.031
 sandwiches-0.029
 tum-0.029
 hind-0.028
 fishing-0.027
Tokened
Token ID276
Feature activation+1.57e-02

Pos logit contributions
 sandwiches+0.006
oured+0.006
umbs+0.006
 hind+0.006
ulent+0.006
Neg logit contributions
Wow-0.013
 sounded-0.012
L-0.012
 matter-0.012
Okay-0.012
Token and
Token ID290
Feature activation-4.52e-03

Pos logit contributions
Wow+0.022
 matter+0.021
Okay+0.020
Fine+0.020
L+0.020
Neg logit contributions
 sandwiches-0.023
 tum-0.022
 to-0.021
 toast-0.021
 several-0.021
Token rubbed
Token ID31862
Feature activation+1.67e-02

Pos logit contributions
oured+0.006
 sandwiches+0.006
 tum+0.006
 hind+0.006
 fishing+0.006
Neg logit contributions
Wow-0.007
L-0.006
ator-0.006
cer-0.006
Okay-0.006
Token her
Token ID607
Feature activation+4.02e-03

Pos logit contributions
Wow+0.026
 matter+0.025
Okay+0.025
Fine+0.024
Really+0.024
Neg logit contributions
 sandwiches-0.023
 tum-0.022
 toast-0.021
umbs-0.020
oured-0.020
Token eyes
Token ID2951
Feature activation-1.29e-02

Pos logit contributions
 sounded+0.006
Wow+0.006
L+0.006
 matter+0.006
Okay+0.005
Neg logit contributions
ulent-0.006
gary-0.005
oured-0.005
umbs-0.005
 sandwiches-0.005
Token with
Token ID351
Feature activation+1.64e-02

Pos logit contributions
Wow+0.018
 matter+0.017
 sounded+0.017
Okay+0.017
L+0.016
Neg logit contributions
 sandwiches-0.024
 fishing-0.023
 hind-0.022
oured-0.022
 tum-0.022
Token their
Token ID511
Feature activation+1.71e-03

Pos logit contributions
Wow+0.023
 sounded+0.022
L+0.021
Okay+0.021
 matter+0.020
Neg logit contributions
 sandwiches-0.025
oured-0.024
 hind-0.024
ulent-0.024
umbs-0.024
Token ball
Token ID2613
Feature activation-4.60e-03

Pos logit contributions
Wow+0.002
L+0.002
OK+0.002
 matter+0.002
Okay+0.002
Neg logit contributions
oured-0.003
ulent-0.003
umbs-0.003
gary-0.003
oops-0.003
Token,
Token ID11
Feature activation-5.85e-03

Pos logit contributions
oured+0.011
ulent+0.011
 hind+0.011
 sandwiches+0.011
gary+0.011
Neg logit contributions
L-0.002
Wow-0.002
 sounded-0.002
er-0.002
Okay-0.002
Token happy
Token ID3772
Feature activation+1.10e-02

Pos logit contributions
umbs+0.009
oured+0.009
gary+0.009
ulent+0.009
 sandwiches+0.008
Neg logit contributions
Wow-0.008
L-0.008
 sounded-0.008
Okay-0.007
 matter-0.007
Token to
Token ID284
Feature activation-1.01e-02

Pos logit contributions
Wow+0.017
 sounded+0.017
L+0.016
Okay+0.016
 matter+0.016
Neg logit contributions
 sandwiches-0.015
oured-0.015
 hind-0.014
ulent-0.014
umbs-0.014
Token share
Token ID2648
Feature activation+2.95e-02

Pos logit contributions
 sandwiches+0.013
 hind+0.013
ulent+0.012
 fishing+0.012
 tum+0.012
Neg logit contributions
Wow-0.017
 sounded-0.016
Okay-0.015
L-0.015
Fine-0.015
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
Wow+0.051
Okay+0.050
Fine+0.048
 sounded+0.047
 narrative+0.047
Neg logit contributions
 sandwiches-0.040
 fishing-0.035
 tum-0.033
 toast-0.033
 hind-0.032
Token be
Token ID307
Feature activation+1.37e-02

Pos logit contributions
Wow+0.021
 sounded+0.020
L+0.019
Okay+0.019
 matter+0.018
Neg logit contributions
 sandwiches-0.024
 hind-0.022
 fishing-0.022
 tum-0.022
ulent-0.022
Token friends
Token ID2460
Feature activation+2.45e-02

Pos logit contributions
Wow+0.023
 matter+0.021
Okay+0.021
 sounded+0.021
Fine+0.021
Neg logit contributions
 sandwiches-0.018
 fishing-0.017
gary-0.016
ulent-0.016
 satisfied-0.015
Token.
Token ID13
Feature activation-2.66e-03

Pos logit contributions
Wow+0.037
 matter+0.035
inct+0.032
 sounded+0.032
Kids+0.031
Neg logit contributions
 sandwiches-0.038
 to-0.037
 goodbye-0.036
 tum-0.036
 fishing-0.036
Token their
Token ID511
Feature activation+4.09e-05

Pos logit contributions
Wow+0.014
Okay+0.013
cer+0.013
 matter+0.013
Fine+0.013
Neg logit contributions
 sandwiches-0.016
 tum-0.015
umbs-0.015
 toast-0.015
oured-0.014
Token wounds
Token ID14129
Feature activation+5.73e-03

Pos logit contributions
Wow+0.000
OK+0.000
L+0.000
 sounded+0.000
Okay+0.000
Neg logit contributions
oured-0.000
ulent-0.000
umbs-0.000
gary-0.000
 hind-0.000
Token.
Token ID13
Feature activation+8.68e-03

Pos logit contributions
Wow+0.005
L+0.004
 sounded+0.004
Okay+0.004
 matter+0.004
Neg logit contributions
 sandwiches-0.012
oured-0.012
ulent-0.012
 hind-0.012
umbs-0.012
Token They
Token ID1119
Feature activation-1.84e-02

Pos logit contributions
 matter+0.010
Wow+0.010
cer+0.010
 sounded+0.010
Okay+0.010
Neg logit contributions
 sandwiches-0.015
 tum-0.014
 hind-0.014
umbs-0.014
 fishing-0.014
Token gave
Token ID2921
Feature activation+2.65e-03

Pos logit contributions
 sandwiches+0.030
oured+0.029
 hind+0.029
umbs+0.028
ulent+0.028
Neg logit contributions
Wow-0.024
 sounded-0.022
L-0.022
Okay-0.021
 matter-0.021
Token them
Token ID606
Feature activation-4.10e-02

Pos logit contributions
Wow+0.002
 sounded+0.002
L+0.002
 matter+0.002
Okay+0.002
Neg logit contributions
ulent-0.006
 hind-0.006
oured-0.006
umbs-0.006
gary-0.006
Token some
Token ID617
Feature activation-3.14e-02

Pos logit contributions
 hind+0.067
ulent+0.063
oured+0.062
sing+0.061
sed+0.061
Neg logit contributions
L-0.058
 sounded-0.051
Wow-0.049
Mom-0.047
Okay-0.046
Token water
Token ID1660
Feature activation-2.07e-02

Pos logit contributions
 hind+0.052
oured+0.051
ulent+0.051
rolling+0.047
sing+0.047
Neg logit contributions
er-0.038
 sounded-0.038
L-0.037
 matter-0.036
.-0.035
Token and
Token ID290
Feature activation-1.95e-02

Pos logit contributions
 hind+0.040
ulent+0.039
gary+0.038
oured+0.038
shaped+0.036
Neg logit contributions
er-0.022
 sounded-0.020
L-0.019
ter-0.018
Wow-0.018
Token ice
Token ID4771
Feature activation-7.53e-03

Pos logit contributions
 hind+0.029
oured+0.028
ulent+0.028
umbs+0.027
gary+0.026
Neg logit contributions
 sounded-0.031
L-0.028
Wow-0.028
 matter-0.027
OK-0.025
Token cream
Token ID8566
Feature activation-6.11e-03

Pos logit contributions
 hind+0.012
ulent+0.012
oured+0.011
gary+0.011
umbs+0.011
Neg logit contributions
L-0.010
Wow-0.010
 sounded-0.010
er-0.009
OK-0.009
Token journey
Token ID7002
Feature activation-1.74e-02

Pos logit contributions
L+0.010
 sounded+0.010
 matter+0.010
Wow+0.009
rey+0.009
Neg logit contributions
umbs-0.014
oured-0.013
ives-0.013
ulent-0.013
imes-0.012
Token to
Token ID284
Feature activation+2.19e-02

Pos logit contributions
umbs+0.028
oured+0.027
ulent+0.027
 hind+0.027
gary+0.026
Neg logit contributions
L-0.024
.-0.023
 sounded-0.023
er-0.023
 matter-0.022
Token see
Token ID766
Feature activation+8.34e-03

Pos logit contributions
Wow+0.038
 sounded+0.036
L+0.036
Okay+0.035
 matter+0.035
Neg logit contributions
 sandwiches-0.026
oured-0.025
 hind-0.025
ulent-0.025
umbs-0.025
Token the
Token ID262
Feature activation+6.66e-03

Pos logit contributions
Wow+0.013
Okay+0.012
L+0.012
 sounded+0.012
OK+0.012
Neg logit contributions
 sandwiches-0.012
oured-0.011
 hind-0.011
ulent-0.011
 fishing-0.011
Token big
Token ID1263
Feature activation+1.69e-03

Pos logit contributions
 sounded+0.010
Fine+0.010
Wow+0.009
cer+0.009
L+0.009
Neg logit contributions
 sandwiches-0.009
 fishing-0.009
 tum-0.008
 hind-0.008
 leftover-0.008
Token,
Token ID11
Feature activation-4.10e-02

Pos logit contributions
Wow+0.003
L+0.002
 sounded+0.002
Okay+0.002
Fine+0.002
Neg logit contributions
 sandwiches-0.002
 hind-0.002
oured-0.002
 fishing-0.002
 tum-0.002
Token beautiful
Token ID4950
Feature activation+2.14e-02

Pos logit contributions
oured+0.058
umbs+0.057
ulent+0.054
 hind+0.054
 sandwiches+0.054
Neg logit contributions
Wow-0.066
 sounded-0.064
L-0.061
 matter-0.061
Okay-0.056
Token mountains
Token ID12269
Feature activation-4.63e-03

Pos logit contributions
Okay+0.034
Fine+0.034
L+0.033
Deal+0.033
Wow+0.032
Neg logit contributions
 sandwiches-0.030
 fishing-0.029
 tum-0.027
 toast-0.025
colored-0.024
Token.
Token ID13
Feature activation+6.47e-03

Pos logit contributions
oured+0.010
 hind+0.010
ulent+0.010
gary+0.010
 sandwiches+0.010
Neg logit contributions
 sounded-0.003
Wow-0.003
L-0.003
er-0.003
 matter-0.002
Token Her
Token ID2332
Feature activation+5.53e-03

Pos logit contributions
Wow+0.009
 sounded+0.009
cer+0.009
 matter+0.009
 narrative+0.009
Neg logit contributions
 sandwiches-0.009
oured-0.009
 tum-0.009
 hind-0.009
umbs-0.009
Token mom
Token ID1995
Feature activation+6.57e-03

Pos logit contributions
Wow+0.007
 sounded+0.006
L+0.006
Okay+0.006
OK+0.006
Neg logit contributions
umbs-0.009
oured-0.009
ulent-0.009
 sandwiches-0.009
gary-0.009
Token Anna
Token ID11735
Feature activation+1.02e-02

Pos logit contributions
 sandwiches+0.018
oured+0.018
 hind+0.018
ulent+0.018
umbs+0.018
Neg logit contributions
Wow-0.008
 sounded-0.007
L-0.007
Okay-0.007
 matter-0.007
Token.
Token ID13
Feature activation+1.17e-03

Pos logit contributions
Wow+0.014
Fine+0.013
Really+0.013
 narrative+0.013
Okay+0.012
Neg logit contributions
 to-0.016
 tum-0.015
 seen-0.015
 sandwiches-0.015
ulent-0.014
Token She
Token ID1375
Feature activation+7.10e-03

Pos logit contributions
Wow+0.001
 sounded+0.001
L+0.001
Okay+0.001
 matter+0.001
Neg logit contributions
 sandwiches-0.002
oured-0.002
 hind-0.002
ulent-0.002
umbs-0.002
Token was
Token ID373
Feature activation-1.46e-02

Pos logit contributions
Wow+0.010
 matter+0.009
cer+0.009
Okay+0.009
 sounded+0.009
Neg logit contributions
 sandwiches-0.010
 tum-0.010
 hind-0.010
 fishing-0.010
oured-0.010
Token 3
Token ID513
Feature activation-1.27e-02

Pos logit contributions
umbs+0.029
oured+0.027
ives+0.027
 sandwiches+0.026
 hind+0.026
Neg logit contributions
L-0.017
 sounded-0.016
Wow-0.016
You-0.015
 matter-0.015
Token years
Token ID812
Feature activation-4.33e-02

Pos logit contributions
 sandwiches+0.022
oured+0.022
 hind+0.022
ulent+0.021
umbs+0.021
Neg logit contributions
Wow-0.015
 sounded-0.014
L-0.014
Okay-0.013
 matter-0.013
Token old
Token ID1468
Feature activation+1.61e-02

Pos logit contributions
oured+0.077
ulent+0.076
umbs+0.074
 sandwiches+0.070
 hind+0.069
Neg logit contributions
er-0.052
L-0.051
ator-0.043
ate-0.041
Wow-0.040
Token.
Token ID13
Feature activation+4.81e-03

Pos logit contributions
Wow+0.030
 narrative+0.030
 matter+0.029
Okay+0.029
 sounded+0.029
Neg logit contributions
 sandwiches-0.016
 tum-0.015
 hind-0.014
 toast-0.014
 to-0.014
Token One
Token ID1881
Feature activation-1.86e-02

Pos logit contributions
Wow+0.006
 sounded+0.006
L+0.006
Okay+0.006
 matter+0.005
Neg logit contributions
 sandwiches-0.008
oured-0.008
 hind-0.008
ulent-0.008
umbs-0.007
Token day
Token ID1110
Feature activation+1.45e-03

Pos logit contributions
oured+0.035
 hind+0.035
ulent+0.034
 sandwiches+0.034
umbs+0.034
Neg logit contributions
Wow-0.020
 sounded-0.020
L-0.020
 matter-0.018
Okay-0.018
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
Wow+0.002
L+0.002
 sounded+0.002
Okay+0.002
 matter+0.002
Neg logit contributions
oured-0.002
 hind-0.002
ulent-0.002
umbs-0.002
 sandwiches-0.002
Token.
Token ID13
Feature activation-3.99e-03

Pos logit contributions
Wow+0.013
L+0.012
Okay+0.012
OK+0.011
Fine+0.011
Neg logit contributions
 sandwiches-0.024
 fishing-0.023
oured-0.022
 tum-0.022
 hind-0.022
Token It
Token ID632
Feature activation+8.03e-03

Pos logit contributions
 sandwiches+0.005
oured+0.005
 hind+0.005
ulent+0.005
umbs+0.005
Neg logit contributions
Wow-0.006
L-0.006
Okay-0.006
 sounded-0.006
 matter-0.006
Token was
Token ID373
Feature activation-1.40e-02

Pos logit contributions
Okay+0.011
OK+0.010
Deal+0.010
Fine+0.010
 narrative+0.010
Neg logit contributions
 sandwiches-0.012
 tum-0.012
 fishing-0.011
 several-0.011
 to-0.011
Token a
Token ID257
Feature activation-1.88e-02

Pos logit contributions
oured+0.023
umbs+0.022
gary+0.022
 hind+0.021
ulent+0.021
Neg logit contributions
 sounded-0.021
Wow-0.019
L-0.018
 matter-0.018
Why-0.017
Token big
Token ID1263
Feature activation-1.54e-02

Pos logit contributions
 sandwiches+0.022
oured+0.022
 hind+0.021
umbs+0.021
ulent+0.021
Neg logit contributions
Wow-0.032
 sounded-0.031
L-0.031
Okay-0.030
 matter-0.030
Token,
Token ID11
Feature activation-5.09e-02

Pos logit contributions
oured+0.022
 sandwiches+0.022
umbs+0.022
 hind+0.022
ulent+0.022
Neg logit contributions
Wow-0.022
 sounded-0.021
L-0.021
 matter-0.020
Okay-0.020
Token brown
Token ID7586
Feature activation-7.34e-03

Pos logit contributions
oured+0.070
umbs+0.069
 hind+0.065
ulent+0.065
 sandwiches+0.065
Neg logit contributions
Wow-0.083
 sounded-0.081
L-0.077
 matter-0.077
Okay-0.071
Token worm
Token ID23849
Feature activation+1.33e-02

Pos logit contributions
 sandwiches+0.011
oured+0.010
 hind+0.010
ulent+0.010
 tum+0.010
Neg logit contributions
Wow-0.010
 sounded-0.010
L-0.010
Okay-0.010
 matter-0.009
Token!
Token ID0
Feature activation+3.09e-04

Pos logit contributions
Wow+0.017
Okay+0.016
 matter+0.015
OK+0.015
Fine+0.015
Neg logit contributions
 sandwiches-0.023
ulent-0.022
 fishing-0.022
 tum-0.021
umbs-0.021
Token
Token ID220
Feature activation+3.10e-03

Pos logit contributions
Wow+0.000
L+0.000
Okay+0.000
 sounded+0.000
 matter+0.000
Neg logit contributions
 sandwiches-0.000
oured-0.000
 hind-0.000
umbs-0.000
ulent-0.000
Token
Token ID198
Feature activation-3.75e-03

Pos logit contributions
Wow+0.004
 matter+0.004
Okay+0.004
 sounded+0.004
Fine+0.004
Neg logit contributions
umbs-0.005
 sandwiches-0.005
 fishing-0.004
 tum-0.004
oured-0.004
Token would
Token ID561
Feature activation-2.71e-02

Pos logit contributions
 sandwiches+0.024
 hind+0.023
oured+0.023
umbs+0.023
ulent+0.023
Neg logit contributions
Wow-0.017
 sounded-0.016
L-0.016
 matter-0.015
Okay-0.015
Token pick
Token ID2298
Feature activation-2.24e-02

Pos logit contributions
 sandwiches+0.042
oured+0.041
umbs+0.041
 hind+0.041
ulent+0.040
Neg logit contributions
 sounded-0.037
Wow-0.037
 matter-0.036
L-0.036
Okay-0.034
Token up
Token ID510
Feature activation-3.52e-02

Pos logit contributions
 hind+0.038
ulent+0.037
umbs+0.036
oured+0.036
gary+0.035
Neg logit contributions
Wow-0.030
 sounded-0.029
L-0.029
er-0.027
ate-0.026
Token her
Token ID607
Feature activation-1.74e-02

Pos logit contributions
ulent+0.072
umbs+0.072
oured+0.072
 hind+0.071
sing+0.068
Neg logit contributions
L-0.033
Wow-0.032
 sounded-0.031
 matter-0.030
er-0.028
Token cr
Token ID1067
Feature activation-2.57e-02

Pos logit contributions
umbs+0.032
ulent+0.030
 hind+0.029
oured+0.029
gary+0.029
Neg logit contributions
L-0.021
Wow-0.020
 matter-0.019
ate-0.019
er-0.019
Tokenay
Token ID323
Feature activation-3.93e-02

Pos logit contributions
 hind+0.040
 fishing+0.040
 sandwiches+0.040
ulent+0.039
oured+0.038
Neg logit contributions
L-0.034
er-0.033
 sounded-0.032
Wow-0.032
ate-0.032
Tokenons
Token ID684
Feature activation-1.24e-02

Pos logit contributions
 hind+0.059
oured+0.052
ulent+0.052
 fishing+0.049
 sandwiches+0.048
Neg logit contributions
L-0.063
 sounded-0.062
er-0.060
Wow-0.058
 matter-0.056
Token and
Token ID290
Feature activation+3.16e-03

Pos logit contributions
 hind+0.026
ulent+0.025
oured+0.025
gary+0.024
needed+0.023
Neg logit contributions
er-0.012
 sounded-0.011
L-0.011
.-0.010
Wow-0.010
Token start
Token ID923
Feature activation-1.62e-02

Pos logit contributions
Wow+0.004
 sounded+0.004
 matter+0.004
L+0.004
Okay+0.004
Neg logit contributions
umbs-0.005
oured-0.005
 sandwiches-0.005
ulent-0.005
 hind-0.005
Token to
Token ID284
Feature activation-9.24e-03

Pos logit contributions
 hind+0.020
umbs+0.019
oured+0.019
ulent+0.018
 sandwiches+0.017
Neg logit contributions
 sounded-0.028
L-0.028
Wow-0.028
 matter-0.027
ate-0.026
Token draw
Token ID3197
Feature activation-2.48e-03

Pos logit contributions
 sandwiches+0.009
oured+0.009
 hind+0.008
 tum+0.008
ulent+0.008
Neg logit contributions
Wow-0.017
Okay-0.016
 sounded-0.016
cer-0.016
L-0.016