TOP ACTIVATIONS
MAX = 0.127

clown s juggling and ac rob ats doing flips . But
thought they looked like ac rob ats . She enjoyed the
of fun watching the ac rob ats and clown s .
the animals , the ac rob ats and the clown s
saw clown s and ac rob ats and lions and elephants
elephants , lions and ac rob ats . They are amazed
clown s and the ac rob ats and the lions and
because of his big bus hy mustache . One
mom my said , " we 'll be home soon ."
soon nothing but a small spe ck in the sky .
had a big , bus hy tail and sharp teeth .
of the frog . He spe ared the lazy frog and
away , a bright blue spe ck in the sky .
flowers and birds and rain bows . One day
air and looked for cob we bs to play in .
for Santa and his re ind eer ," Anna says to
great at making beautiful cob we bs , and I am
It was covered with cob we bs and the paint was
The eggs were blue and spe ck led . They looked
big elephants and tall gir aff es . Lily and her

MIN ACTIVATIONS
MIN = -0.114

the cars stopped and the pedestrian light turned green . Then
Sam gasped in amaz ement . He had never
Her best friend watched in amaz ement as Amelia gigg led
YOU STEP ON THE LE AF ?!" The embarrassed
wind . Everyone watched in amaz ement . The
looked at each other in amaz ement . They
the little girl watched in amaz ement . Mama 's fingers
watched the k ite in amaz ement , and Little Joe
L ucky watched in amaz ement as the frog zoom
Wow ," Emily gasped in amaz ement . She realized that
the zipper and gasped with amaz ement at all the treasures
baseball ! He gasped in amaz ement and jumped up and
as the woman watched in amaz ement . She saw the
ered in , blinking in amaz ement . But
. To her amaz ement , the clothes came
The little boy watched in amaz ement . The
her eyes grew wide with amaz ement . She wanted to
, her eyes wide with amaz ement . "
and M ummy gasped with amaz ement . "
seen ! She gasped in amaz ement , and her tears

INTERVAL 0.067 - 0.127
CONTAINS 0.002%

of fun watching the ac rob ats and clown s .
. They were blue and spe ck led . L ila
. Do you like rain bows ?" Anna asked .
and had a big bus hy tail . Jack
The eggs were blue and spe ck led . L ila

INTERVAL 0.007 - 0.067
CONTAINS 13.288%

some leaves and mud to make the walls ."
He walked through the grass until he saw a rough stone
there , but he didn 't worry too much .
lot of work to do all day long . But ,
and papers and stickers . She liked to make pictures of

INTERVAL -0.054 - 0.007
CONTAINS 86.703%

to give the rabbit a hug , so he opened his
. He made a clay mask and put it on his
not fair . I have to un pack my bag and
oppy cries . She has no balls left . She sits
of what he had accomplished . He thanked the little girl

INTERVAL -0.114 - -0.054
CONTAINS 0.007%

and dangerous to explore the woods alone . With her pole
were c aw ing in unison to make it sound like
any more without asking . Deal ?" L ily
. She was filled with amaz ement . At
the bush and watched in amaz ement as it flew away
Token clown
Token ID25573
Feature activation+3.82e-03

Pos logit contributions
 unison+0.008
 agreement+0.008
 bur+0.006
 Deal+0.006
 amaz+0.006
Neg logit contributions
w-0.011
ane-0.011
iky-0.011
rob-0.010
ooky-0.010
Tokens
Token ID82
Feature activation-3.87e-03

Pos logit contributions
w+0.005
iky+0.004
uch+0.004
ooky+0.004
ane+0.004
Neg logit contributions
 Mama-0.007
 agreement-0.007
 Mum-0.007
 amaz-0.007
 unison-0.007
Token juggling
Token ID48097
Feature activation-1.67e-03

Pos logit contributions
 agreement+0.006
 unison+0.006
 amaz+0.005
 towels+0.005
 bur+0.005
Neg logit contributions
ane-0.006
iky-0.006
hy-0.006
bows-0.006
z-0.006
Token and
Token ID290
Feature activation-2.54e-03

Pos logit contributions
 Mama+0.003
 Papa+0.003
 son+0.003
 mum+0.003
 amaz+0.003
Neg logit contributions
iky-0.003
w-0.002
hy-0.002
bows-0.002
t-0.002
Token ac
Token ID936
Feature activation+5.92e-03

Pos logit contributions
 agreement+0.005
 unison+0.004
 amaz+0.004
 son+0.004
 towels+0.004
Neg logit contributions
rob-0.004
working-0.004
bows-0.004
z-0.003
whe-0.003
Tokenrob
Token ID22609
Feature activation+9.89e-02

Pos logit contributions
ooky+0.002
working+0.002
iky+0.002
ane+0.002
uch+0.002
Neg logit contributions
 unison-0.014
 Sam-0.014
 Mama-0.014
 Mum-0.014
 towels-0.014
Tokenats
Token ID1381
Feature activation+2.52e-03

Pos logit contributions
t+0.112
ane+0.112
working+0.109
bows+0.108
ooky+0.108
Neg logit contributions
 hands-0.179
 unison-0.173
 agreement-0.171
 amaz-0.170
 dad-0.169
Token doing
Token ID1804
Feature activation-2.27e-03

Pos logit contributions
t+0.004
iky+0.004
w+0.004
ane+0.004
bows+0.004
Neg logit contributions
 amaz-0.004
 unison-0.004
 agreement-0.003
 Mama-0.003
 Mum-0.003
Token flips
Token ID45971
Feature activation-2.05e-02

Pos logit contributions
 Mama+0.003
 Papa+0.003
 son+0.003
 Mum+0.003
 agreement+0.003
Neg logit contributions
iky-0.004
z-0.003
bows-0.003
rob-0.003
ooky-0.003
Token.
Token ID13
Feature activation-1.49e-03

Pos logit contributions
 amaz+0.031
 son+0.031
 agreement+0.030
 unison+0.030
 Mama+0.030
Neg logit contributions
iky-0.034
ooky-0.033
t-0.033
bows-0.033
z-0.032
Token But
Token ID887
Feature activation-7.71e-03

Pos logit contributions
 unison+0.002
 agreement+0.002
 amaz+0.002
 towels+0.002
 son+0.002
Neg logit contributions
ane-0.002
t-0.002
bows-0.002
working-0.002
w-0.002
Token thought
Token ID1807
Feature activation+6.72e-04

Pos logit contributions
w+0.005
iky+0.005
uch+0.004
ooky+0.004
ane+0.004
Neg logit contributions
 Mama-0.003
 Mum-0.003
 unison-0.003
 amaz-0.003
 agreement-0.003
Token they
Token ID484
Feature activation-2.84e-03

Pos logit contributions
iky+0.001
w+0.001
ooky+0.001
ane+0.001
t+0.001
Neg logit contributions
 unison-0.001
 agreement-0.001
 amaz-0.001
 son-0.001
 Mum-0.001
Token looked
Token ID3114
Feature activation+9.53e-05

Pos logit contributions
 Mama+0.003
 son+0.003
 amaz+0.003
 Mum+0.003
 Dad+0.003
Neg logit contributions
w-0.006
rob-0.005
iky-0.005
ane-0.005
uch-0.005
Token like
Token ID588
Feature activation+4.34e-03

Pos logit contributions
w+0.000
ooky+0.000
iky+0.000
uch+0.000
working+0.000
Neg logit contributions
 Mama-0.000
 Mum-0.000
 unison-0.000
 mum-0.000
 son-0.000
Token ac
Token ID936
Feature activation+9.25e-03

Pos logit contributions
bows+0.007
working+0.006
ane+0.006
t+0.006
whe+0.006
Neg logit contributions
 agreement-0.007
 unison-0.006
 amaz-0.006
 Deal-0.006
 hand-0.006
Tokenrob
Token ID22609
Feature activation+9.79e-02

Pos logit contributions
lder+0.006
working+0.006
ane+0.006
iky+0.006
mod+0.006
Neg logit contributions
 water-0.018
 Sam-0.018
 too-0.018
 his-0.018
 towels-0.017
Tokenats
Token ID1381
Feature activation+9.03e-03

Pos logit contributions
ane+0.124
t+0.123
hy+0.122
bows+0.120
working+0.116
Neg logit contributions
 hands-0.166
 ways-0.158
 friends-0.157
 lunch-0.152
 towels-0.151
Token.
Token ID13
Feature activation-7.04e-04

Pos logit contributions
bows+0.016
t+0.015
ane+0.015
w+0.015
iky+0.015
Neg logit contributions
 amaz-0.014
 unison-0.013
 agreement-0.013
 Mum-0.012
 mum-0.012
Token She
Token ID1375
Feature activation+3.88e-03

Pos logit contributions
 unison+0.001
 agreement+0.001
 amaz+0.001
 Mama+0.001
 Mum+0.001
Neg logit contributions
w-0.001
iky-0.001
ooky-0.001
ane-0.001
uch-0.001
Token enjoyed
Token ID8359
Feature activation-9.05e-03

Pos logit contributions
w+0.006
iky+0.006
uch+0.006
z+0.006
working+0.006
Neg logit contributions
 agreement-0.006
 unison-0.006
 amaz-0.006
 Mum-0.006
 Mama-0.006
Token the
Token ID262
Feature activation-1.79e-03

Pos logit contributions
 unison+0.015
 son+0.014
 agreement+0.014
 amaz+0.014
 Mum+0.013
Neg logit contributions
rob-0.013
working-0.013
z-0.013
w-0.013
bows-0.012
Token of
Token ID286
Feature activation-1.09e-02

Pos logit contributions
t+0.008
iky+0.008
ooky+0.007
ane+0.007
hy+0.007
Neg logit contributions
 Mama-0.010
 amaz-0.010
 Papa-0.010
 mum-0.010
 son-0.010
Token fun
Token ID1257
Feature activation-7.37e-04

Pos logit contributions
 Mama+0.014
 unison+0.013
 Papa+0.013
 Mum+0.013
 mum+0.013
Neg logit contributions
w-0.020
rob-0.020
iky-0.020
ooky-0.019
uch-0.019
Token watching
Token ID4964
Feature activation-9.71e-03

Pos logit contributions
 Mama+0.001
 agreement+0.001
 Mum+0.001
 Papa+0.001
 mum+0.001
Neg logit contributions
w-0.001
iky-0.001
rob-0.001
z-0.001
ooky-0.001
Token the
Token ID262
Feature activation-3.87e-03

Pos logit contributions
 amaz+0.013
 agreement+0.013
 unison+0.013
 bur+0.013
 shovel+0.013
Neg logit contributions
hy-0.015
bows-0.015
iky-0.014
ane-0.014
ooky-0.014
Token ac
Token ID936
Feature activation+7.26e-03

Pos logit contributions
 Mama+0.006
 Mum+0.006
 amaz+0.006
 unison+0.006
 agreement+0.006
Neg logit contributions
w-0.006
iky-0.006
ooky-0.006
uch-0.006
ane-0.006
Tokenrob
Token ID22609
Feature activation+9.77e-02

Pos logit contributions
mod+0.004
iky+0.004
working+0.004
w+0.004
bows+0.004
Neg logit contributions
 water-0.015
 his-0.015
 Sam-0.015
 Mama-0.014
 towels-0.014
Tokenats
Token ID1381
Feature activation+6.52e-03

Pos logit contributions
ane+0.104
w+0.104
iky+0.103
bows+0.101
t+0.098
Neg logit contributions
 son-0.177
 unison-0.174
 dad-0.174
 agreement-0.173
 mum-0.173
Token and
Token ID290
Feature activation-1.69e-03

Pos logit contributions
ane+0.011
t+0.011
ooky+0.011
ding+0.011
whe+0.011
Neg logit contributions
 amaz-0.009
 unison-0.009
 Mama-0.009
 agreement-0.009
 Papa-0.009
Token clown
Token ID25573
Feature activation+5.33e-03

Pos logit contributions
 unison+0.002
 agreement+0.002
 towel+0.002
 Deal+0.002
ulp+0.002
Neg logit contributions
bows-0.002
hy-0.002
whe-0.002
working-0.002
rob-0.002
Tokens
Token ID82
Feature activation-1.77e-04

Pos logit contributions
ane+0.005
z+0.005
t+0.005
w+0.005
iky+0.005
Neg logit contributions
 agreement-0.012
 amaz-0.012
 Mama-0.012
 Mum-0.011
 unison-0.011
Token.
Token ID13
Feature activation-4.60e-04

Pos logit contributions
 Papa+0.000
 Mama+0.000
 agreement+0.000
 bur+0.000
 unison+0.000
Neg logit contributions
bows-0.000
hy-0.000
ane-0.000
whe-0.000
z-0.000
Token the
Token ID262
Feature activation-1.85e-03

Pos logit contributions
rob+0.005
iky+0.005
ooky+0.005
ane+0.005
w+0.005
Neg logit contributions
 unison-0.003
 agreement-0.003
 amaz-0.003
 Mum-0.003
 mum-0.003
Token animals
Token ID4695
Feature activation+6.24e-03

Pos logit contributions
 Mama+0.002
 Mum+0.002
 unison+0.002
 amaz+0.002
 agreement+0.002
Neg logit contributions
w-0.004
iky-0.003
ooky-0.003
uch-0.003
ane-0.003
Token,
Token ID11
Feature activation+1.38e-03

Pos logit contributions
ane+0.010
iky+0.010
working+0.010
ooky+0.010
bows+0.010
Neg logit contributions
 unison-0.009
 amaz-0.009
 agreement-0.009
 Mum-0.009
 Mama-0.009
Token the
Token ID262
Feature activation-1.47e-03

Pos logit contributions
hy+0.002
bows+0.002
working+0.002
iky+0.002
t+0.002
Neg logit contributions
 unison-0.002
 amaz-0.002
 agreement-0.002
 mum-0.002
 hands-0.002
Token ac
Token ID936
Feature activation+9.25e-03

Pos logit contributions
 amaz+0.002
 unison+0.002
 agreement+0.002
 Mama+0.002
 Mum+0.002
Neg logit contributions
iky-0.002
w-0.002
rob-0.002
bows-0.002
ooky-0.002
Tokenrob
Token ID22609
Feature activation+9.67e-02

Pos logit contributions
t+0.005
mod+0.005
working+0.005
hy+0.005
iky+0.005
Neg logit contributions
 water-0.020
 his-0.020
 Sam-0.020
 Mama-0.020
 towels-0.019
Tokenats
Token ID1381
Feature activation+9.79e-03

Pos logit contributions
bows+0.123
ane+0.121
hy+0.119
t+0.117
Bloom+0.117
Neg logit contributions
 hands-0.153
 friends-0.152
 dad-0.149
 their-0.149
 towels-0.147
Token and
Token ID290
Feature activation+3.36e-04

Pos logit contributions
ane+0.017
iky+0.017
t+0.017
w+0.017
working+0.016
Neg logit contributions
 amaz-0.014
 unison-0.013
 agreement-0.013
 son-0.013
 Mum-0.013
Token the
Token ID262
Feature activation-1.55e-03

Pos logit contributions
working+0.001
hy+0.000
bows+0.000
grand+0.000
rob+0.000
Neg logit contributions
 unison-0.000
 agreement-0.000
 amaz-0.000
 towel-0.000
ulp-0.000
Token clown
Token ID25573
Feature activation+8.41e-03

Pos logit contributions
 Mama+0.002
 unison+0.002
 Mum+0.002
 amaz+0.002
 agreement+0.002
Neg logit contributions
rob-0.002
bows-0.002
w-0.002
iky-0.002
hy-0.002
Tokens
Token ID82
Feature activation+5.01e-03

Pos logit contributions
ane+0.009
w+0.009
iky+0.009
t+0.009
ooky+0.009
Neg logit contributions
 amaz-0.017
 agreement-0.017
 unison-0.017
 Mama-0.016
 Mum-0.016
Token saw
Token ID2497
Feature activation+9.96e-04

Pos logit contributions
w+0.004
iky+0.003
t+0.003
hy+0.003
rob+0.003
Neg logit contributions
 agreement-0.006
 unison-0.005
 son-0.005
 Mama-0.005
 amaz-0.005
Token clown
Token ID25573
Feature activation+9.69e-03

Pos logit contributions
iky+0.002
bows+0.002
hy+0.002
rob+0.002
ooky+0.002
Neg logit contributions
 agreement-0.001
 unison-0.001
 amaz-0.001
 bur-0.001
 Deal-0.001
Tokens
Token ID82
Feature activation+5.39e-03

Pos logit contributions
w+0.010
iky+0.010
ane+0.010
ooky+0.010
uch+0.009
Neg logit contributions
 agreement-0.020
 amaz-0.020
 Mama-0.020
 Mum-0.020
 unison-0.020
Token and
Token ID290
Feature activation+1.69e-03

Pos logit contributions
ane+0.007
iky+0.007
bows+0.007
hy+0.007
w+0.007
Neg logit contributions
 agreement-0.009
 unison-0.009
 son-0.009
 Papa-0.008
 Mama-0.008
Token ac
Token ID936
Feature activation+1.07e-02

Pos logit contributions
hy+0.002
bows+0.002
working+0.002
rob+0.002
w+0.002
Neg logit contributions
 agreement-0.003
 unison-0.003
 Deal-0.003
 amaz-0.002
 approval-0.002
Tokenrob
Token ID22609
Feature activation+9.54e-02

Pos logit contributions
t+0.006
iky+0.006
working+0.006
ooky+0.005
ane+0.005
Neg logit contributions
 towels-0.026
 unison-0.025
 Mama-0.025
 Sam-0.025
 water-0.024
Tokenats
Token ID1381
Feature activation+1.02e-02

Pos logit contributions
ane+0.106
t+0.104
ooky+0.102
iky+0.101
working+0.100
Neg logit contributions
 hands-0.177
 unison-0.176
 agreement-0.173
 amaz-0.173
 son-0.172
Token and
Token ID290
Feature activation+2.00e-03

Pos logit contributions
t+0.016
ane+0.016
iky+0.016
bows+0.015
w+0.015
Neg logit contributions
 amaz-0.017
 unison-0.016
 agreement-0.016
 mum-0.015
 Mum-0.015
Token lions
Token ID31420
Feature activation+1.02e-02

Pos logit contributions
hy+0.003
rob+0.003
bows+0.002
working+0.002
w+0.002
Neg logit contributions
 agreement-0.003
 unison-0.003
 Mum-0.003
 mum-0.003
 amaz-0.003
Token and
Token ID290
Feature activation+2.22e-03

Pos logit contributions
bows+0.016
iky+0.015
hy+0.015
ane+0.015
w+0.015
Neg logit contributions
 agreement-0.016
 Mum-0.016
 mum-0.016
 Mama-0.016
 amaz-0.015
Token elephants
Token ID25368
Feature activation+6.30e-03

Pos logit contributions
hy+0.003
bows+0.003
rob+0.003
working+0.003
w+0.003
Neg logit contributions
 agreement-0.004
 Mum-0.004
 Mama-0.004
 mum-0.004
 amaz-0.004
Token elephants
Token ID25368
Feature activation+4.56e-04

Pos logit contributions
 agreement+0.001
 amaz+0.001
 unison+0.000
 approval+0.000
 delight+0.000
Neg logit contributions
hy-0.001
w-0.001
rob-0.001
bows-0.001
whe-0.001
Token,
Token ID11
Feature activation-3.50e-04

Pos logit contributions
w+0.001
iky+0.001
ane+0.001
bows+0.001
hy+0.001
Neg logit contributions
 agreement-0.001
 amaz-0.001
 Mum-0.001
 Mama-0.001
 unison-0.001
Token lions
Token ID31420
Feature activation+4.34e-03

Pos logit contributions
 agreement+0.000
 Mum+0.000
 Mama+0.000
 mum+0.000
 unison+0.000
Neg logit contributions
w-0.001
bows-0.001
rob-0.001
iky-0.001
hy-0.001
Token and
Token ID290
Feature activation-1.64e-03

Pos logit contributions
bows+0.006
w+0.006
iky+0.006
ane+0.006
hy+0.006
Neg logit contributions
 agreement-0.007
 amaz-0.007
 Mum-0.007
 mum-0.007
 Mama-0.007
Token ac
Token ID936
Feature activation+5.32e-03

Pos logit contributions
 agreement+0.003
 Mum+0.003
 Mama+0.002
 mum+0.002
 amaz+0.002
Neg logit contributions
rob-0.002
hy-0.002
w-0.002
bows-0.002
iky-0.002
Tokenrob
Token ID22609
Feature activation+9.43e-02

Pos logit contributions
t+0.004
ooky+0.003
working+0.003
uch+0.003
mod+0.003
Neg logit contributions
 towels-0.012
 water-0.012
 Sam-0.012
 his-0.012
 Mama-0.012
Tokenats
Token ID1381
Feature activation+4.21e-03

Pos logit contributions
t+0.108
ane+0.107
ooky+0.104
iky+0.103
working+0.102
Neg logit contributions
 hands-0.172
 unison-0.170
 agreement-0.168
 dad-0.166
 son-0.165
Token.
Token ID13
Feature activation-2.16e-04

Pos logit contributions
ane+0.007
bows+0.007
iky+0.007
w+0.007
z+0.007
Neg logit contributions
 amaz-0.006
 unison-0.006
 agreement-0.006
 son-0.006
 mum-0.006
Token They
Token ID1119
Feature activation-1.10e-03

Pos logit contributions
 amaz+0.000
 agreement+0.000
 unison+0.000
 son+0.000
 towel+0.000
Neg logit contributions
t-0.000
bows-0.000
iky-0.000
hy-0.000
ooky-0.000
Token are
Token ID389
Feature activation-9.57e-04

Pos logit contributions
 son+0.002
 unison+0.002
 Mama+0.002
 agreement+0.002
 Mum+0.002
Neg logit contributions
bows-0.001
hy-0.001
working-0.001
w-0.001
iky-0.001
Token amazed
Token ID24562
Feature activation-7.89e-03

Pos logit contributions
 unison+0.001
 agreement+0.001
 amaz+0.001
 son+0.001
 towel+0.001
Neg logit contributions
iky-0.001
hy-0.001
bows-0.001
ooky-0.001
whe-0.001
Token clown
Token ID25573
Feature activation+4.47e-03

Pos logit contributions
 unison+0.005
 Mama+0.005
 Mum+0.005
 agreement+0.005
 amaz+0.004
Neg logit contributions
w-0.005
ooky-0.005
iky-0.004
bows-0.004
ane-0.004
Tokens
Token ID82
Feature activation-2.24e-04

Pos logit contributions
ane+0.007
w+0.007
iky+0.007
ooky+0.007
z+0.007
Neg logit contributions
 agreement-0.006
 Mama-0.006
 Mum-0.006
 unison-0.006
 mum-0.006
Token and
Token ID290
Feature activation-2.14e-03

Pos logit contributions
 Mama+0.000
 Papa+0.000
 Mum+0.000
 Molly+0.000
 son+0.000
Neg logit contributions
hy-0.000
iky-0.000
ane-0.000
bows-0.000
w-0.000
Token the
Token ID262
Feature activation-3.22e-03

Pos logit contributions
 Deal+0.003
 agreement+0.003
 unison+0.003
 bur+0.003
 Mum+0.003
Neg logit contributions
hy-0.003
bows-0.003
working-0.003
rob-0.003
w-0.003
Token ac
Token ID936
Feature activation+5.50e-03

Pos logit contributions
 Mama+0.005
 unison+0.005
 Mum+0.005
 agreement+0.005
 Sammy+0.005
Neg logit contributions
w-0.005
bows-0.005
hy-0.005
iky-0.004
rob-0.004
Tokenrob
Token ID22609
Feature activation+9.10e-02

Pos logit contributions
working+0.003
iky+0.003
t+0.003
mod+0.002
ooky+0.002
Neg logit contributions
 towels-0.013
 towel-0.013
 water-0.013
 his-0.012
 Mama-0.012
Tokenats
Token ID1381
Feature activation+2.31e-03

Pos logit contributions
ane+0.093
iky+0.092
ooky+0.091
w+0.090
bows+0.089
Neg logit contributions
 unison-0.171
 son-0.170
 agreement-0.170
 hands-0.169
 dad-0.168
Token and
Token ID290
Feature activation-2.11e-03

Pos logit contributions
t+0.004
ane+0.004
iky+0.004
hy+0.004
w+0.004
Neg logit contributions
 amaz-0.003
 unison-0.003
 Mama-0.003
 mum-0.003
 Mum-0.003
Token the
Token ID262
Feature activation-3.25e-03

Pos logit contributions
 unison+0.003
 agreement+0.003
ete+0.003
 Deal+0.003
 Mum+0.003
Neg logit contributions
hy-0.003
working-0.003
rob-0.003
bows-0.003
grand-0.003
Token lions
Token ID31420
Feature activation+4.33e-03

Pos logit contributions
 Mama+0.005
 Mum+0.005
 unison+0.005
 Sammy+0.005
 agreement+0.005
Neg logit contributions
w-0.005
hy-0.005
bows-0.005
rob-0.005
iky-0.004
Token and
Token ID290
Feature activation-2.00e-03

Pos logit contributions
ane+0.007
bows+0.007
iky+0.007
hy+0.007
w+0.007
Neg logit contributions
 Mum-0.007
 Mama-0.007
 mum-0.006
 amaz-0.006
 agreement-0.006
Token because
Token ID780
Feature activation+9.35e-03

Pos logit contributions
w+0.006
uch+0.006
t+0.005
lder+0.005
z+0.005
Neg logit contributions
 amaz-0.006
 unison-0.006
 fins-0.005
 agreement-0.005
 bur-0.005
Token of
Token ID286
Feature activation+3.44e-03

Pos logit contributions
working+0.014
t+0.013
w+0.012
grand+0.011
iky+0.011
Neg logit contributions
 agreement-0.017
 unison-0.017
 amaz-0.015
 approval-0.014
 Deal-0.013
Token his
Token ID465
Feature activation+1.23e-03

Pos logit contributions
w+0.004
working+0.004
bows+0.004
iky+0.004
rob+0.004
Neg logit contributions
 agreement-0.005
 unison-0.005
 amaz-0.005
 son-0.004
 approval-0.004
Token big
Token ID1263
Feature activation+8.87e-03

Pos logit contributions
working+0.002
iky+0.002
w+0.002
uch+0.002
hy+0.002
Neg logit contributions
 unison-0.002
 agreement-0.002
 amaz-0.001
 Judy-0.001
 Rachel-0.001
Token bus
Token ID1323
Feature activation+1.58e-02

Pos logit contributions
working+0.014
ane+0.013
w+0.013
iky+0.013
hy+0.013
Neg logit contributions
 unison-0.013
 agreement-0.012
 themselves-0.011
 amaz-0.011
 towels-0.011
Tokenhy
Token ID12114
Feature activation+8.99e-02

Pos logit contributions
t+0.028
ane+0.028
hy+0.027
w+0.026
bows+0.026
Neg logit contributions
 agreement-0.024
 amaz-0.024
 unison-0.023
 Mama-0.022
 mum-0.022
Token mustache
Token ID49303
Feature activation+9.89e-03

Pos logit contributions
w+0.157
ane+0.155
iky+0.150
uch+0.149
ooky+0.146
Neg logit contributions
 Mum-0.132
 Mama-0.131
 amaz-0.128
 Dad-0.124
 mum-0.124
Token.
Token ID13
Feature activation+7.14e-03

Pos logit contributions
ane+0.018
w+0.017
t+0.017
working+0.017
z+0.017
Neg logit contributions
 agreement-0.014
 unison-0.014
 amaz-0.014
 Mum-0.013
 Mama-0.013
Token
Token ID198
Feature activation+3.36e-03

Pos logit contributions
ane+0.009
t+0.009
 These+0.009
grand+0.009
hy+0.008
Neg logit contributions
 amaz-0.012
 agreement-0.012
 towel-0.012
 towels-0.012
 fear-0.012
Token
Token ID198
Feature activation+3.88e-03

Pos logit contributions
w+0.006
t+0.006
iky+0.006
z+0.006
ooky+0.006
Neg logit contributions
 amaz-0.004
 son-0.004
 eyes-0.004
 unison-0.004
 fins-0.004
TokenOne
Token ID3198
Feature activation+1.54e-02

Pos logit contributions
w+0.005
iky+0.005
t+0.005
uch+0.005
z+0.005
Neg logit contributions
 amaz-0.007
 eyes-0.007
 son-0.007
 Mama-0.007
 Mum-0.007
Token mom
Token ID1995
Feature activation+9.23e-03

Pos logit contributions
 unison+0.000
 amaz+0.000
 agreement+0.000
 Sammy+0.000
 shallow+0.000
Neg logit contributions
hy-0.000
w-0.000
iky-0.000
bows-0.000
working-0.000
Tokenmy
Token ID1820
Feature activation+1.43e-02

Pos logit contributions
w+0.014
uch+0.014
t+0.013
ane+0.013
bows+0.013
Neg logit contributions
 amaz-0.015
 unison-0.014
 fins-0.014
 agreement-0.014
 excitement-0.013
Token said
Token ID531
Feature activation+1.21e-02

Pos logit contributions
w+0.023
working+0.022
uch+0.021
z+0.021
ane+0.021
Neg logit contributions
 amaz-0.022
 unison-0.022
 agreement-0.021
 fins-0.020
 Mum-0.020
Token,
Token ID11
Feature activation+7.99e-03

Pos logit contributions
t+0.019
w+0.017
hy+0.017
working+0.016
free+0.016
Neg logit contributions
 amaz-0.020
 unison-0.019
 bur-0.019
 agreement-0.018
 towels-0.018
Token "
Token ID366
Feature activation+1.43e-02

Pos logit contributions
t+0.011
ding+0.010
free+0.010
z+0.010
bows+0.009
Neg logit contributions
 unison-0.013
 amaz-0.013
 fins-0.012
 ways-0.012
 agreement-0.012
Tokenwe
Token ID732
Feature activation+7.55e-02

Pos logit contributions
w+0.018
uch+0.016
t+0.016
iky+0.016
ane+0.016
Neg logit contributions
 Mama-0.027
 amaz-0.027
 Mum-0.027
 son-0.027
 mum-0.027
Token'll
Token ID1183
Feature activation+1.27e-02

Pos logit contributions
w+0.117
uch+0.108
ane+0.102
working+0.101
t+0.101
Neg logit contributions
 amaz-0.128
 Mum-0.126
 Mama-0.125
 mum-0.124
 unison-0.121
Token be
Token ID307
Feature activation+9.25e-03

Pos logit contributions
t+0.020
uch+0.020
w+0.019
working+0.018
iky+0.018
Neg logit contributions
 amaz-0.019
 unison-0.018
 son-0.017
 agreement-0.017
 fins-0.017
Token home
Token ID1363
Feature activation-1.24e-03

Pos logit contributions
z+0.013
w+0.012
organized+0.012
uch+0.012
working+0.011
Neg logit contributions
 amaz-0.014
 unison-0.013
 eyes-0.013
 fins-0.012
 Mum-0.012
Token soon
Token ID2582
Feature activation+5.71e-03

Pos logit contributions
 amaz+0.002
 unison+0.002
 agreement+0.002
 fins+0.002
 excitement+0.002
Neg logit contributions
t-0.002
ooky-0.002
uch-0.002
w-0.002
working-0.002
Token."
Token ID526
Feature activation+5.87e-03

Pos logit contributions
bows+0.007
w+0.007
t+0.007
working+0.007
uch+0.007
Neg logit contributions
 amaz-0.011
 unison-0.011
 agreement-0.011
 fins-0.010
 mum-0.010
Token soon
Token ID2582
Feature activation+2.29e-03

Pos logit contributions
iky+0.008
t+0.008
w+0.008
uch+0.008
z+0.008
Neg logit contributions
 son-0.010
 unison-0.010
 Dad-0.009
 agreement-0.009
 Papa-0.009
Token nothing
Token ID2147
Feature activation+1.02e-02

Pos logit contributions
bows+0.003
w+0.003
iky+0.003
ooky+0.003
uch+0.003
Neg logit contributions
 unison-0.004
 son-0.004
 agreement-0.004
 Dad-0.004
 Mama-0.004
Token but
Token ID475
Feature activation+4.38e-03

Pos logit contributions
t+0.015
w+0.015
iky+0.014
ooky+0.014
uch+0.014
Neg logit contributions
 unison-0.017
 agreement-0.017
 amaz-0.017
 Mama-0.017
 Mum-0.017
Token a
Token ID257
Feature activation+4.94e-03

Pos logit contributions
t+0.007
bows+0.007
w+0.007
ane+0.007
uch+0.007
Neg logit contributions
 unison-0.006
 agreement-0.006
 towels-0.006
 son-0.006
 fins-0.005
Token small
Token ID1402
Feature activation+8.24e-03

Pos logit contributions
t+0.006
working+0.006
w+0.006
Bloom+0.006
uch+0.006
Neg logit contributions
 themselves-0.007
 Sam-0.007
 towels-0.007
 unison-0.007
 hands-0.007
Token spe
Token ID693
Feature activation+7.50e-02

Pos logit contributions
working+0.011
ane+0.011
bows+0.011
hy+0.010
t+0.010
Neg logit contributions
 towels-0.012
 Deal-0.012
 themselves-0.012
 unison-0.012
 Mum-0.011
Tokenck
Token ID694
Feature activation+2.70e-02

Pos logit contributions
ane+0.148
ooky+0.147
bows+0.145
rob+0.143
uch+0.142
Neg logit contributions
 agreement-0.086
 unison-0.076
 hand-0.072
 amaz-0.070
 son-0.069
Token in
Token ID287
Feature activation+7.23e-03

Pos logit contributions
w+0.038
ane+0.034
t+0.032
uch+0.031
iky+0.031
Neg logit contributions
 amaz-0.052
 Dad-0.051
 Mum-0.050
 unison-0.050
 agreement-0.049
Token the
Token ID262
Feature activation+1.67e-03

Pos logit contributions
hy+0.011
w+0.010
iky+0.010
Bloom+0.010
mod+0.009
Neg logit contributions
 son-0.010
 dad-0.008
 mum-0.008
 Dad-0.008
 supervision-0.008
Token sky
Token ID6766
Feature activation+3.12e-03

Pos logit contributions
w+0.003
uch+0.002
ane+0.002
iky+0.002
ooky+0.002
Neg logit contributions
 Sam-0.003
 son-0.003
 unison-0.003
 amaz-0.002
 Dad-0.002
Token.
Token ID13
Feature activation+2.32e-03

Pos logit contributions
bows+0.005
t+0.005
ane+0.005
w+0.005
z+0.005
Neg logit contributions
 agreement-0.005
 son-0.005
 unison-0.004
 amaz-0.004
 Deal-0.004
Token had
Token ID550
Feature activation-4.03e-03

Pos logit contributions
 his+0.002
 water+0.001
 Mama+0.001
 little+0.001
 dad+0.001
Neg logit contributions
ane-0.003
ooky-0.003
bows-0.003
iky-0.003
uch-0.003
Token a
Token ID257
Feature activation+3.33e-03

Pos logit contributions
 unison+0.003
 agreement+0.003
 Mama+0.003
 amaz+0.003
 Mum+0.003
Neg logit contributions
w-0.010
iky-0.010
uch-0.010
ooky-0.009
ane-0.009
Token big
Token ID1263
Feature activation-5.64e-04

Pos logit contributions
w+0.006
working+0.005
iky+0.005
uch+0.005
ooky+0.005
Neg logit contributions
 unison-0.005
 amaz-0.005
 agreement-0.005
 Mum-0.004
 Dad-0.004
Token,
Token ID11
Feature activation+1.60e-03

Pos logit contributions
 unison+0.001
 amaz+0.001
 Mum+0.001
 agreement+0.001
 Dad+0.001
Neg logit contributions
w-0.001
working-0.001
uch-0.001
iky-0.001
ooky-0.001
Token bus
Token ID1323
Feature activation+5.82e-03

Pos logit contributions
w+0.003
iky+0.003
uch+0.003
ooky+0.003
ane+0.003
Neg logit contributions
 unison-0.002
 amaz-0.002
 agreement-0.002
 Mama-0.002
 Mum-0.002
Tokenhy
Token ID12114
Feature activation+7.35e-02

Pos logit contributions
w+0.011
uch+0.011
ooky+0.011
iky+0.011
mod+0.010
Neg logit contributions
 dad-0.007
 Dad-0.007
 hand-0.007
 Sam-0.007
 Mum-0.007
Token tail
Token ID7894
Feature activation-1.34e-02

Pos logit contributions
w+0.167
iky+0.164
ooky+0.163
uch+0.162
ane+0.160
Neg logit contributions
 Mama-0.060
 Mum-0.058
 unison-0.058
 agreement-0.058
 amaz-0.057
Token and
Token ID290
Feature activation+1.21e-03

Pos logit contributions
 amaz+0.016
 agreement+0.015
 son+0.015
 Mum+0.015
 Mama+0.015
Neg logit contributions
iky-0.026
w-0.026
ane-0.026
t-0.025
ooky-0.025
Token sharp
Token ID7786
Feature activation+1.96e-03

Pos logit contributions
w+0.002
working+0.002
rob+0.002
hy+0.002
z+0.002
Neg logit contributions
 unison-0.002
 amaz-0.002
 agreement-0.002
 son-0.002
 Mum-0.002
Token teeth
Token ID9941
Feature activation+2.40e-03

Pos logit contributions
w+0.004
iky+0.004
ooky+0.004
working+0.004
uch+0.004
Neg logit contributions
 unison-0.002
 Mum-0.002
 amaz-0.002
 agreement-0.002
 Mama-0.002
Token.
Token ID13
Feature activation+1.32e-03

Pos logit contributions
w+0.005
iky+0.005
ooky+0.005
uch+0.005
ane+0.005
Neg logit contributions
 amaz-0.002
 Mama-0.002
 agreement-0.002
 unison-0.002
 Mum-0.002
Token of
Token ID286
Feature activation-8.34e-03

Pos logit contributions
 agreement+0.014
 unison+0.014
 amaz+0.014
 son+0.013
 Papa+0.013
Neg logit contributions
t-0.013
ane-0.012
iky-0.012
z-0.012
hy-0.012
Token the
Token ID262
Feature activation+1.82e-03

Pos logit contributions
 unison+0.011
 Deal+0.011
 son+0.010
 agreement+0.010
aj+0.010
Neg logit contributions
working-0.010
w-0.010
no-0.010
hy-0.009
whe-0.009
Token frog
Token ID21264
Feature activation+6.84e-03

Pos logit contributions
w+0.003
working+0.003
uch+0.003
ooky+0.003
ane+0.003
Neg logit contributions
 amaz-0.002
 agreement-0.002
 Mum-0.002
 Mia-0.002
 Jill-0.002
Token.
Token ID13
Feature activation+5.52e-03

Pos logit contributions
z+0.010
t+0.009
w+0.009
ane+0.009
uch+0.009
Neg logit contributions
 agreement-0.012
 Mum-0.012
 Dad-0.012
 unison-0.012
 Mama-0.012
Token He
Token ID679
Feature activation-8.54e-04

Pos logit contributions
t+0.007
 These+0.007
ane+0.006
ugg+0.006
hy+0.006
Neg logit contributions
 agreement-0.010
 towels-0.010
 towel-0.010
 amaz-0.010
 unison-0.009
Token spe
Token ID693
Feature activation+7.28e-02

Pos logit contributions
 agreement+0.002
 Mama+0.001
 unison+0.001
 Mum+0.001
 Dad+0.001
Neg logit contributions
w-0.001
z-0.001
uch-0.001
t-0.001
iky-0.001
Tokenared
Token ID1144
Feature activation+2.16e-02

Pos logit contributions
ooky+0.134
ane+0.132
iky+0.131
uch+0.129
t+0.129
Neg logit contributions
 agreement-0.088
 unison-0.086
 amaz-0.084
 son-0.083
 Mama-0.082
Token the
Token ID262
Feature activation+1.28e-03

Pos logit contributions
z+0.033
t+0.032
w+0.032
iky+0.032
hy+0.029
Neg logit contributions
 unison-0.034
 agreement-0.033
 Mama-0.030
 son-0.030
 amaz-0.030
Token lazy
Token ID16931
Feature activation+5.25e-03

Pos logit contributions
w+0.002
uch+0.002
opard+0.002
working+0.002
iky+0.002
Neg logit contributions
 Jill-0.002
 amaz-0.002
 Dad-0.002
 Mama-0.002
 Sam-0.002
Token frog
Token ID21264
Feature activation+6.35e-03

Pos logit contributions
working+0.008
w+0.008
iky+0.008
opard+0.008
hy+0.008
Neg logit contributions
 amaz-0.008
 unison-0.008
 agreement-0.007
 towels-0.007
 Mama-0.007
Token and
Token ID290
Feature activation+1.50e-03

Pos logit contributions
z+0.010
t+0.009
w+0.009
uch+0.009
iky+0.009
Neg logit contributions
 Dad-0.011
 agreement-0.010
 Mama-0.010
 Mum-0.010
 unison-0.010
Token away
Token ID1497
Feature activation+4.02e-03

Pos logit contributions
w+0.002
z+0.002
t+0.002
ane+0.001
iky+0.001
Neg logit contributions
 son-0.002
 unison-0.002
 amaz-0.002
 Dad-0.002
 dad-0.002
Token,
Token ID11
Feature activation+1.90e-04

Pos logit contributions
t+0.006
working+0.006
z+0.006
bows+0.006
iky+0.005
Neg logit contributions
 unison-0.007
 agreement-0.006
 son-0.006
 amaz-0.006
 Mama-0.006
Token a
Token ID257
Feature activation-2.87e-04

Pos logit contributions
z+0.000
hy+0.000
ding+0.000
t+0.000
bows+0.000
Neg logit contributions
 unison-0.000
 towels-0.000
 Mum-0.000
 son-0.000
 agreement-0.000
Token bright
Token ID6016
Feature activation+3.75e-03

Pos logit contributions
 themselves+0.000
 ways+0.000
 Sam+0.000
 Dad+0.000
 towels+0.000
Neg logit contributions
t-0.000
hy-0.000
uch-0.000
iky-0.000
w-0.000
Token blue
Token ID4171
Feature activation+9.55e-03

Pos logit contributions
ane+0.005
hy+0.005
iky+0.005
working+0.005
w+0.005
Neg logit contributions
 Dad-0.005
 towels-0.005
 son-0.005
 themselves-0.005
 Sam-0.005
Token spe
Token ID693
Feature activation+7.26e-02

Pos logit contributions
ane+0.015
w+0.015
uch+0.013
t+0.013
iky+0.013
Neg logit contributions
 Dad-0.014
 son-0.014
 amaz-0.014
 Mum-0.014
 agreement-0.013
Tokenck
Token ID694
Feature activation+1.83e-02

Pos logit contributions
ane+0.126
ooky+0.125
uch+0.122
w+0.122
bows+0.121
Neg logit contributions
 agreement-0.095
 unison-0.089
 amaz-0.086
 Mama-0.085
 mum-0.085
Token in
Token ID287
Feature activation+1.64e-03

Pos logit contributions
w+0.023
ane+0.022
iky+0.020
t+0.019
z+0.019
Neg logit contributions
 amaz-0.036
 Mum-0.036
 Dad-0.036
 Mama-0.036
 agreement-0.035
Token the
Token ID262
Feature activation-1.73e-03

Pos logit contributions
hy+0.002
w+0.002
iky+0.002
 spe+0.002
 ind+0.002
Neg logit contributions
 son-0.002
 mum-0.002
 Carlos-0.002
 supervision-0.002
 dad-0.002
Token sky
Token ID6766
Feature activation-2.76e-03

Pos logit contributions
 son+0.003
 Sam+0.003
 unison+0.003
 Dad+0.003
 amaz+0.003
Neg logit contributions
w-0.002
ane-0.002
uch-0.002
hy-0.002
iky-0.002
Token.
Token ID13
Feature activation-6.29e-04

Pos logit contributions
 agreement+0.004
 son+0.004
 mum+0.004
 Dad+0.004
 dad+0.004
Neg logit contributions
bows-0.004
z-0.004
hy-0.004
t-0.004
ane-0.004
Token flowers
Token ID12734
Feature activation+1.34e-03

Pos logit contributions
iky+0.030
ane+0.030
w+0.029
t+0.029
ooky+0.029
Neg logit contributions
 amaz-0.022
 unison-0.022
 agreement-0.021
 Mama-0.021
 Papa-0.021
Token and
Token ID290
Feature activation+7.26e-03

Pos logit contributions
bows+0.002
w+0.002
ane+0.002
t+0.002
working+0.002
Neg logit contributions
 amaz-0.002
 unison-0.002
 agreement-0.002
 son-0.002
 Mama-0.002
Token birds
Token ID10087
Feature activation+2.27e-02

Pos logit contributions
bows+0.011
working+0.011
w+0.011
hy+0.009
uch+0.009
Neg logit contributions
 unison-0.012
 agreement-0.012
 son-0.011
 amaz-0.010
 Deal-0.010
Token and
Token ID290
Feature activation+5.36e-03

Pos logit contributions
bows+0.037
w+0.037
ane+0.037
z+0.036
t+0.035
Neg logit contributions
 amaz-0.036
 agreement-0.035
 son-0.035
 Mum-0.034
 unison-0.034
Token rain
Token ID6290
Feature activation-5.17e-03

Pos logit contributions
bows+0.008
working+0.008
w+0.007
hy+0.007
rob+0.006
Neg logit contributions
 unison-0.009
 agreement-0.008
 son-0.008
 amaz-0.008
 Mum-0.008
Tokenbows
Token ID25435
Feature activation+7.20e-02

Pos logit contributions
 amaz+0.014
 agreement+0.014
 Mama+0.014
 Mum+0.014
 son+0.014
Neg logit contributions
w-0.002
bows-0.002
iky-0.002
ooky-0.002
uch-0.002
Token.
Token ID13
Feature activation+2.92e-03

Pos logit contributions
bows+0.128
iky+0.124
w+0.123
ooky+0.121
working+0.119
Neg logit contributions
 Mama-0.108
 Mum-0.107
 amaz-0.105
 agreement-0.104
 son-0.104
Token
Token ID198
Feature activation+4.34e-04

Pos logit contributions
 These+0.003
ane+0.003
t+0.003
 Did+0.003
grand+0.003
Neg logit contributions
 amaz-0.005
 towel-0.005
 agreement-0.005
 unison-0.005
 towels-0.005
Token
Token ID198
Feature activation+1.28e-03

Pos logit contributions
w+0.001
iky+0.001
t+0.001
ooky+0.001
ane+0.001
Neg logit contributions
 amaz-0.001
 son-0.001
 unison-0.001
 agreement-0.001
 fins-0.001
TokenOne
Token ID3198
Feature activation+1.52e-02

Pos logit contributions
w+0.002
iky+0.002
uch+0.001
t+0.001
ooky+0.001
Neg logit contributions
 son-0.002
 amaz-0.002
 eyes-0.002
 Mum-0.002
 Mama-0.002
Token day
Token ID1110
Feature activation+1.02e-02

Pos logit contributions
w+0.021
iky+0.020
ooky+0.019
uch+0.019
working+0.019
Neg logit contributions
 unison-0.029
 amaz-0.028
 son-0.027
 agreement-0.027
 towels-0.026
Token air
Token ID1633
Feature activation-5.20e-03

Pos logit contributions
 unison+0.001
 amaz+0.001
 agreement+0.001
 Mum+0.001
 Dad+0.001
Neg logit contributions
w-0.002
working-0.002
ooky-0.002
uch-0.002
bows-0.002
Token and
Token ID290
Feature activation+1.47e-03

Pos logit contributions
 agreement+0.008
 unison+0.008
 amaz+0.008
 son+0.007
 mum+0.007
Neg logit contributions
t-0.008
bows-0.008
working-0.008
iky-0.008
w-0.008
Token looked
Token ID3114
Feature activation+2.96e-03

Pos logit contributions
bows+0.002
working+0.002
w+0.002
whe+0.002
organized+0.002
Neg logit contributions
 unison-0.003
 agreement-0.003
 Deal-0.002
 son-0.002
 towels-0.002
Token for
Token ID329
Feature activation+5.11e-03

Pos logit contributions
ooky+0.004
working+0.004
iky+0.004
w+0.004
ane+0.003
Neg logit contributions
 unison-0.005
 Mama-0.005
 towels-0.005
 amaz-0.005
 son-0.005
Token cob
Token ID22843
Feature activation+1.31e-02

Pos logit contributions
hy+0.005
working+0.005
iky+0.005
iest+0.005
whe+0.005
Neg logit contributions
 unison-0.008
 Deal-0.007
 agreement-0.007
die-0.007
 amaz-0.006
Tokenwe
Token ID732
Feature activation+7.17e-02

Pos logit contributions
w+0.008
iky+0.007
uch+0.007
ooky+0.007
ane+0.007
Neg logit contributions
 Mama-0.032
 Mum-0.032
 mum-0.032
 unison-0.031
 Dad-0.031
Tokenbs
Token ID1443
Feature activation+1.19e-02

Pos logit contributions
w+0.078
working+0.063
ane+0.062
uch+0.062
t+0.061
Neg logit contributions
 amaz-0.169
 agreement-0.163
 Mum-0.158
 unison-0.158
 mum-0.157
Token to
Token ID284
Feature activation+2.01e-03

Pos logit contributions
t+0.019
ane+0.018
w+0.018
bows+0.018
working+0.017
Neg logit contributions
 amaz-0.021
 agreement-0.020
 unison-0.019
 excitement-0.018
 Mum-0.018
Token play
Token ID711
Feature activation+3.30e-03

Pos logit contributions
working+0.002
iky+0.002
w+0.002
bows+0.002
uch+0.002
Neg logit contributions
 son-0.004
 amaz-0.003
 unison-0.003
 Dad-0.003
 agreement-0.003
Token in
Token ID287
Feature activation+7.29e-03

Pos logit contributions
ooky+0.005
iky+0.005
bows+0.005
w+0.005
t+0.005
Neg logit contributions
 amaz-0.006
 son-0.005
 Papa-0.005
 Mama-0.005
 mum-0.005
Token.
Token ID13
Feature activation+2.38e-03

Pos logit contributions
iky+0.010
ooky+0.010
t+0.009
bows+0.009
working+0.009
Neg logit contributions
 son-0.013
 agreement-0.012
 unison-0.011
 amaz-0.011
 mum-0.011
Token for
Token ID329
Feature activation+1.40e-02

Pos logit contributions
t+0.002
w+0.002
bows+0.002
iky+0.002
uch+0.002
Neg logit contributions
 agreement-0.003
 amaz-0.003
 unison-0.003
 confusion-0.002
 excitement-0.002
Token Santa
Token ID8909
Feature activation+2.11e-02

Pos logit contributions
w+0.020
lder+0.017
uch+0.017
working+0.016
qu+0.016
Neg logit contributions
 amaz-0.024
 unison-0.021
 agreement-0.021
 son-0.018
 confusion-0.018
Token and
Token ID290
Feature activation+8.67e-03

Pos logit contributions
working+0.033
w+0.033
t+0.032
uch+0.032
bows+0.031
Neg logit contributions
 amaz-0.037
 agreement-0.036
 unison-0.035
 excitement-0.032
 delight-0.031
Token his
Token ID465
Feature activation+2.79e-03

Pos logit contributions
working+0.014
w+0.013
grand+0.013
lder+0.012
rob+0.012
Neg logit contributions
 unison-0.012
 agreement-0.012
 amaz-0.011
 towels-0.010
 shallow-0.010
Token re
Token ID302
Feature activation+8.58e-03

Pos logit contributions
w+0.005
working+0.005
hy+0.005
uch+0.005
ane+0.004
Neg logit contributions
 unison-0.003
 agreement-0.003
 amaz-0.003
 shallow-0.003
 Jill-0.003
Tokenind
Token ID521
Feature activation+7.17e-02

Pos logit contributions
w+0.005
z+0.004
ane+0.004
iky+0.004
uch+0.004
Neg logit contributions
 amaz-0.023
 Mum-0.022
 Mama-0.022
 unison-0.022
 agreement-0.022
Tokeneer
Token ID28153
Feature activation+1.50e-02

Pos logit contributions
w+0.078
working+0.070
ane+0.068
iky+0.067
ooky+0.064
Neg logit contributions
 amaz-0.162
 unison-0.157
 agreement-0.155
 Mum-0.149
 mum-0.148
Token,"
Token ID553
Feature activation+3.51e-03

Pos logit contributions
w+0.023
working+0.022
t+0.022
ane+0.022
z+0.022
Neg logit contributions
 amaz-0.025
 agreement-0.023
 unison-0.023
 Mum-0.021
 Mama-0.020
Token Anna
Token ID11735
Feature activation-1.59e-03

Pos logit contributions
w+0.006
lder+0.006
grand+0.006
rob+0.005
t+0.005
Neg logit contributions
 amaz-0.005
 unison-0.004
 towels-0.004
 ways-0.004
 agreement-0.004
Token says
Token ID1139
Feature activation-1.20e-03

Pos logit contributions
 amaz+0.002
 raft+0.002
 towels+0.002
 unison+0.002
 mud+0.002
Neg logit contributions
w-0.003
t-0.002
lder-0.002
ane-0.002
iky-0.002
Token to
Token ID284
Feature activation+3.74e-03

Pos logit contributions
 amaz+0.002
 unison+0.002
 bur+0.002
 towels+0.002
 shallow+0.002
Neg logit contributions
t-0.002
z-0.002
working-0.002
ane-0.002
free-0.002
Token great
Token ID1049
Feature activation+3.25e-03

Pos logit contributions
working+0.006
bows+0.006
iky+0.006
uch+0.005
ooky+0.005
Neg logit contributions
 unison-0.006
 towels-0.006
 agreement-0.006
 son-0.006
 towel-0.006
Token at
Token ID379
Feature activation+5.13e-03

Pos logit contributions
working+0.006
bows+0.006
ane+0.005
t+0.005
w+0.005
Neg logit contributions
 unison-0.004
 amaz-0.004
 agreement-0.004
 Mum-0.004
 Mama-0.004
Token making
Token ID1642
Feature activation+8.82e-04

Pos logit contributions
working+0.007
uch+0.006
w+0.006
iky+0.006
t+0.006
Neg logit contributions
 unison-0.009
 son-0.009
 Mum-0.009
 agreement-0.009
 Dad-0.009
Token beautiful
Token ID4950
Feature activation-2.24e-04

Pos logit contributions
bows+0.001
ooky+0.001
w+0.001
qu+0.001
iky+0.001
Neg logit contributions
 agreement-0.001
 unison-0.001
 Mum-0.001
 son-0.001
 Dad-0.001
Token cob
Token ID22843
Feature activation+1.31e-02

Pos logit contributions
 unison+0.000
 Dad+0.000
 Mum+0.000
 agreement+0.000
 son+0.000
Neg logit contributions
bows-0.000
ooky-0.000
w-0.000
working-0.000
iky-0.000
Tokenwe
Token ID732
Feature activation+7.16e-02

Pos logit contributions
w+0.008
iky+0.008
uch+0.007
ooky+0.007
ane+0.007
Neg logit contributions
 Mama-0.032
 Mum-0.032
 amaz-0.032
 mum-0.032
 unison-0.032
Tokenbs
Token ID1443
Feature activation+1.26e-02

Pos logit contributions
w+0.100
uch+0.091
ane+0.091
working+0.089
t+0.085
Neg logit contributions
 amaz-0.136
 Mum-0.135
 agreement-0.135
 mum-0.133
 unison-0.132
Token,
Token ID11
Feature activation+3.77e-03

Pos logit contributions
bows+0.018
ane+0.017
t+0.017
w+0.017
ooky+0.017
Neg logit contributions
 amaz-0.023
 unison-0.023
 agreement-0.023
 Mum-0.022
 son-0.022
Token and
Token ID290
Feature activation+1.76e-03

Pos logit contributions
bows+0.005
ane+0.005
w+0.005
t+0.005
iky+0.005
Neg logit contributions
 unison-0.006
 agreement-0.006
 amaz-0.005
 towels-0.005
 Mum-0.005
Token I
Token ID314
Feature activation-4.99e-03

Pos logit contributions
working+0.002
w+0.002
bows+0.002
t+0.002
free+0.002
Neg logit contributions
 unison-0.003
 agreement-0.003
 towels-0.003
 son-0.003
 towel-0.003
Token am
Token ID716
Feature activation+5.37e-03

Pos logit contributions
 unison+0.010
 Dad+0.010
 Mum+0.010
 Mama+0.010
 son+0.010
Neg logit contributions
w-0.006
t-0.006
uch-0.006
working-0.005
iky-0.005
Token It
Token ID632
Feature activation+9.74e-03

Pos logit contributions
 unison+0.005
 agreement+0.005
 amaz+0.005
 mum+0.004
 son+0.004
Neg logit contributions
t-0.006
hy-0.006
w-0.006
iky-0.006
ooky-0.006
Token was
Token ID373
Feature activation+6.82e-03

Pos logit contributions
w+0.017
iky+0.016
uch+0.016
ooky+0.016
t+0.016
Neg logit contributions
 agreement-0.014
 unison-0.014
 amaz-0.014
 son-0.013
 Mum-0.013
Token covered
Token ID5017
Feature activation+6.47e-04

Pos logit contributions
iky+0.011
hy+0.011
t+0.011
ooky+0.011
working+0.010
Neg logit contributions
 agreement-0.010
 unison-0.010
 son-0.009
 amaz-0.009
 approval-0.009
Token with
Token ID351
Feature activation+3.34e-03

Pos logit contributions
w+0.001
iky+0.001
ooky+0.001
uch+0.001
ane+0.001
Neg logit contributions
 Mama-0.001
 amaz-0.001
 unison-0.001
 Mum-0.001
 agreement-0.001
Token cob
Token ID22843
Feature activation+1.45e-02

Pos logit contributions
hy+0.005
iky+0.005
working+0.005
bows+0.005
w+0.005
Neg logit contributions
 unison-0.005
 agreement-0.005
 mum-0.005
 Mum-0.005
 Pete-0.005
Tokenwe
Token ID732
Feature activation+7.15e-02

Pos logit contributions
w+0.018
uch+0.018
ane+0.017
iky+0.016
t+0.016
Neg logit contributions
 Mama-0.026
 Mum-0.025
 towels-0.025
 eyes-0.025
 Dad-0.025
Tokenbs
Token ID1443
Feature activation+1.28e-02

Pos logit contributions
w+0.093
uch+0.085
t+0.084
iky+0.083
ooky+0.082
Neg logit contributions
 agreement-0.139
 amaz-0.139
 Mum-0.137
 unison-0.136
 mum-0.136
Token and
Token ID290
Feature activation+3.12e-03

Pos logit contributions
t+0.020
bows+0.020
ooky+0.020
iky+0.020
ane+0.019
Neg logit contributions
 agreement-0.022
 amaz-0.020
 mum-0.020
 unison-0.020
 Mum-0.020
Token the
Token ID262
Feature activation+3.11e-03

Pos logit contributions
working+0.004
w+0.004
bows+0.004
hy+0.004
t+0.004
Neg logit contributions
 agreement-0.006
 unison-0.005
 Deal-0.005
 approval-0.005
 amaz-0.005
Token paint
Token ID7521
Feature activation+1.92e-03

Pos logit contributions
w+0.005
iky+0.005
ooky+0.005
uch+0.005
t+0.005
Neg logit contributions
 unison-0.004
 Mama-0.004
 Mum-0.004
 amaz-0.004
 agreement-0.004
Token was
Token ID373
Feature activation+7.83e-03

Pos logit contributions
t+0.003
iky+0.003
ooky+0.003
bows+0.003
w+0.003
Neg logit contributions
 agreement-0.003
 unison-0.003
 mum-0.003
 amaz-0.003
 Mum-0.003
Token The
Token ID383
Feature activation+3.21e-03

Pos logit contributions
t+0.002
ane+0.002
 These+0.002
hy+0.002
 Did+0.002
Neg logit contributions
 agreement-0.003
 amaz-0.003
 towels-0.003
 towel-0.003
 son-0.003
Token eggs
Token ID9653
Feature activation+1.07e-04

Pos logit contributions
w+0.005
uch+0.004
iky+0.004
ane+0.004
working+0.004
Neg logit contributions
 unison-0.005
 amaz-0.005
 Pete-0.005
 Sammy-0.005
 Molly-0.005
Token were
Token ID547
Feature activation-5.54e-03

Pos logit contributions
w+0.000
uch+0.000
t+0.000
ane+0.000
iky+0.000
Neg logit contributions
 unison-0.000
 agreement-0.000
 amaz-0.000
 Mum-0.000
 Mama-0.000
Token blue
Token ID4171
Feature activation+8.78e-03

Pos logit contributions
 unison+0.010
 son+0.009
 amaz+0.009
 agreement+0.009
 Pete+0.008
Neg logit contributions
w-0.008
iky-0.007
uch-0.007
ooky-0.006
t-0.006
Token and
Token ID290
Feature activation-2.06e-03

Pos logit contributions
w+0.017
iky+0.017
ane+0.016
uch+0.016
ooky+0.016
Neg logit contributions
 agreement-0.010
 unison-0.010
 Mama-0.010
 amaz-0.010
 Mum-0.010
Token spe
Token ID693
Feature activation+7.13e-02

Pos logit contributions
 son+0.004
 unison+0.004
 agreement+0.004
 amaz+0.004
 ways+0.003
Neg logit contributions
w-0.003
t-0.002
uch-0.002
iky-0.002
working-0.002
Tokenck
Token ID694
Feature activation+1.71e-02

Pos logit contributions
ane+0.144
ooky+0.143
t+0.139
iky+0.137
bows+0.135
Neg logit contributions
 agreement-0.074
 unison-0.069
 son-0.068
 hands-0.068
 Mama-0.068
Tokenled
Token ID992
Feature activation+1.63e-02

Pos logit contributions
w+0.012
iky+0.009
ane+0.009
t+0.009
ooky+0.008
Neg logit contributions
 amaz-0.045
 agreement-0.045
 unison-0.044
 Mum-0.043
 mum-0.043
Token.
Token ID13
Feature activation+8.21e-04

Pos logit contributions
iky+0.020
w+0.019
bows+0.019
star+0.019
z+0.019
Neg logit contributions
 unison-0.027
 son-0.027
 Mama-0.026
 amaz-0.026
 Mum-0.026
Token They
Token ID1119
Feature activation-7.13e-04

Pos logit contributions
ane+0.001
t+0.001
 These+0.001
bows+0.001
iky+0.001
Neg logit contributions
 agreement-0.001
 amaz-0.001
 fear-0.001
 towel-0.001
 son-0.001
Token looked
Token ID3114
Feature activation-3.42e-03

Pos logit contributions
 agreement+0.001
 son+0.001
 unison+0.001
 amaz+0.001
 Mum+0.001
Neg logit contributions
w-0.001
uch-0.001
iky-0.001
t-0.001
bows-0.001
Token big
Token ID1263
Feature activation+8.71e-03

Pos logit contributions
bows+0.027
working+0.025
hy+0.025
iky+0.025
t+0.023
Neg logit contributions
 agreement-0.031
 amaz-0.031
 unison-0.030
 Deal-0.028
 approval-0.027
Token elephants
Token ID25368
Feature activation+1.24e-02

Pos logit contributions
bows+0.013
iky+0.012
ooky+0.012
working+0.012
w+0.012
Neg logit contributions
 unison-0.014
 agreement-0.014
 Mum-0.013
 amaz-0.013
 mum-0.013
Token and
Token ID290
Feature activation+8.45e-03

Pos logit contributions
bows+0.020
iky+0.019
ane+0.019
t+0.019
hy+0.019
Neg logit contributions
 agreement-0.021
 amaz-0.020
 son-0.019
 unison-0.019
 mum-0.018
Token tall
Token ID7331
Feature activation+1.22e-02

Pos logit contributions
working+0.012
hy+0.012
bows+0.011
Bloom+0.010
iky+0.010
Neg logit contributions
 agreement-0.014
 amaz-0.014
 unison-0.014
 towel-0.014
 son-0.013
Token gir
Token ID37370
Feature activation+2.47e-02

Pos logit contributions
bows+0.018
iky+0.017
hy+0.016
rob+0.016
working+0.016
Neg logit contributions
 agreement-0.020
 amaz-0.020
 Mama-0.020
 Mum-0.020
 unison-0.020
Tokenaff
Token ID2001
Feature activation+7.11e-02

Pos logit contributions
w+0.024
uch+0.023
ooky+0.021
iky+0.021
ane+0.021
Neg logit contributions
 unison-0.052
 Mama-0.052
 Mum-0.052
 amaz-0.051
 son-0.051
Tokenes
Token ID274
Feature activation-1.63e-03

Pos logit contributions
w+0.067
iky+0.063
ane+0.062
ooky+0.059
uch+0.057
Neg logit contributions
 agreement-0.162
 amaz-0.161
 Mum-0.160
 Mama-0.160
 son-0.158
Token.
Token ID13
Feature activation+6.36e-03

Pos logit contributions
 amaz+0.003
 agreement+0.003
 son+0.003
 Mum+0.003
 Mama+0.003
Neg logit contributions
bows-0.002
ane-0.002
t-0.002
iky-0.002
hy-0.002
Token Lily
Token ID20037
Feature activation+3.32e-03

Pos logit contributions
 These+0.008
ane+0.008
t+0.007
hy+0.007
bows+0.007
Neg logit contributions
 amaz-0.011
 towel-0.011
 fear-0.011
 towels-0.011
 agreement-0.011
Token and
Token ID290
Feature activation+4.23e-03

Pos logit contributions
w+0.005
iky+0.005
working+0.005
ooky+0.005
uch+0.005
Neg logit contributions
 agreement-0.005
 unison-0.005
 amaz-0.005
 Mum-0.005
 son-0.005
Token her
Token ID607
Feature activation+1.17e-03

Pos logit contributions
working+0.008
bows+0.008
hy+0.008
w+0.007
rob+0.007
Neg logit contributions
 unison-0.006
 agreement-0.005
 amaz-0.005
 towels-0.005
 towel-0.005
Token the
Token ID262
Feature activation+3.85e-03

Pos logit contributions
working+0.011
t+0.010
w+0.010
uch+0.009
ane+0.009
Neg logit contributions
 unison-0.010
 fins-0.009
 agreement-0.009
 amaz-0.009
 towels-0.009
Token cars
Token ID5006
Feature activation+3.63e-03

Pos logit contributions
w+0.006
working+0.006
uch+0.006
t+0.006
ane+0.006
Neg logit contributions
 amaz-0.006
 agreement-0.006
 unison-0.006
 Mama-0.006
 Molly-0.005
Token stopped
Token ID5025
Feature activation+2.29e-03

Pos logit contributions
t+0.006
w+0.006
ane+0.006
uch+0.005
working+0.005
Neg logit contributions
 agreement-0.006
 unison-0.006
 amaz-0.006
 Mama-0.006
 Mum-0.006
Token and
Token ID290
Feature activation-1.89e-04

Pos logit contributions
working+0.003
w+0.003
z+0.003
ane+0.003
bows+0.003
Neg logit contributions
 unison-0.004
 agreement-0.004
 amaz-0.003
 excitement-0.003
 son-0.003
Token the
Token ID262
Feature activation+2.06e-03

Pos logit contributions
 unison+0.000
 agreement+0.000
 son+0.000
 fins+0.000
 amaz+0.000
Neg logit contributions
bows-0.000
w-0.000
t-0.000
z-0.000
whe-0.000
Token pedestrian
Token ID22382
Feature activation-1.14e-01

Pos logit contributions
hy+0.003
w+0.003
bows+0.003
working+0.003
t+0.003
Neg logit contributions
 amaz-0.003
 Mama-0.003
 agreement-0.003
 Mum-0.003
 Papa-0.003
Token light
Token ID1657
Feature activation+2.84e-03

Pos logit contributions
 dad+0.194
 mum+0.188
 Dad+0.187
 son+0.184
 Mum+0.183
Neg logit contributions
bows-0.167
hy-0.162
iky-0.161
ooky-0.157
t-0.157
Token turned
Token ID2900
Feature activation+1.12e-02

Pos logit contributions
bows+0.005
t+0.005
ane+0.004
iky+0.004
z+0.004
Neg logit contributions
 agreement-0.005
 amaz-0.004
 Mum-0.004
 unison-0.004
 Mama-0.004
Token green
Token ID4077
Feature activation+1.50e-03

Pos logit contributions
t+0.018
bows+0.018
ane+0.017
hy+0.017
iky+0.017
Neg logit contributions
 agreement-0.018
 unison-0.017
 Mum-0.016
 amaz-0.016
 mum-0.016
Token.
Token ID13
Feature activation+1.50e-03

Pos logit contributions
bows+0.003
working+0.002
iky+0.002
ane+0.002
hy+0.002
Neg logit contributions
 agreement-0.002
 mum-0.002
 Mum-0.002
 Mama-0.002
 Papa-0.002
Token Then
Token ID3244
Feature activation-2.55e-03

Pos logit contributions
t+0.002
ugg+0.002
bows+0.002
ane+0.002
 These+0.002
Neg logit contributions
 amaz-0.003
 fins-0.003
 unison-0.002
 agreement-0.002
 towels-0.002
Token
Token ID198
Feature activation+9.14e-04

Pos logit contributions
w+0.031
uch+0.030
iky+0.030
ane+0.029
ooky+0.029
Neg logit contributions
 agreement-0.019
 unison-0.018
 Mama-0.018
 amaz-0.018
 Mum-0.018
Token
Token ID198
Feature activation+1.41e-03

Pos logit contributions
t+0.002
iky+0.002
w+0.001
z+0.001
ane+0.001
Neg logit contributions
 amaz-0.001
 son-0.001
 unison-0.001
 agreement-0.001
 eyes-0.001
TokenSam
Token ID16305
Feature activation-4.15e-03

Pos logit contributions
w+0.002
t+0.002
iky+0.002
z+0.002
uch+0.002
Neg logit contributions
 eyes-0.003
 amaz-0.003
 son-0.003
 hands-0.003
 unison-0.003
Token gasped
Token ID45236
Feature activation-2.47e-02

Pos logit contributions
 unison+0.006
 agreement+0.006
 amaz+0.006
 towels+0.006
 towel+0.006
Neg logit contributions
w-0.006
z-0.006
ane-0.006
iky-0.006
hy-0.006
Token in
Token ID287
Feature activation+4.45e-03

Pos logit contributions
 unison+0.039
 amaz+0.037
 towels+0.036
 agreement+0.036
 towel+0.035
Neg logit contributions
w-0.032
iky-0.032
ane-0.032
z-0.032
working-0.031
Token amaz
Token ID40642
Feature activation-1.08e-01

Pos logit contributions
hy+0.008
w+0.007
iky+0.006
z+0.006
 Leave+0.006
Neg logit contributions
 son-0.005
 towels-0.004
 raft-0.004
 thirst-0.004
 bur-0.004
Tokenement
Token ID972
Feature activation+2.51e-02

Pos logit contributions
 Mama+0.204
 unison+0.202
 agreement+0.200
 Mum+0.199
 son+0.198
Neg logit contributions
w-0.134
iky-0.131
ooky-0.130
ane-0.127
uch-0.123
Token.
Token ID13
Feature activation+2.36e-03

Pos logit contributions
z+0.037
working+0.037
t+0.037
ane+0.036
w+0.036
Neg logit contributions
 agreement-0.040
 unison-0.039
 amaz-0.038
 Mama-0.035
 bur-0.034
Token He
Token ID679
Feature activation-5.91e-04

Pos logit contributions
ane+0.003
t+0.003
hy+0.003
 These+0.003
 Did+0.003
Neg logit contributions
 unison-0.004
 agreement-0.004
 towels-0.004
 towel-0.004
 amaz-0.004
Token had
Token ID550
Feature activation-5.91e-03

Pos logit contributions
 agreement+0.001
 towels+0.001
 unison+0.001
 son+0.001
 ways+0.001
Neg logit contributions
w-0.001
z-0.001
iky-0.001
t-0.001
uch-0.001
Token never
Token ID1239
Feature activation+1.00e-02

Pos logit contributions
 agreement+0.012
 unison+0.012
 son+0.011
 Mama+0.011
 Papa+0.011
Neg logit contributions
w-0.007
t-0.006
uch-0.006
iky-0.006
hy-0.006
Token Her
Token ID2332
Feature activation+1.27e-05

Pos logit contributions
 agreement+0.000
 towels+0.000
 unison+0.000
 towel+0.000
 son+0.000
Neg logit contributions
ane-0.000
bows-0.000
t-0.000
working-0.000
w-0.000
Token best
Token ID1266
Feature activation+8.85e-03

Pos logit contributions
z+0.000
uch+0.000
w+0.000
iky+0.000
working+0.000
Neg logit contributions
 agreement-0.000
 unison-0.000
 Jill-0.000
 Judy-0.000
 amaz-0.000
Token friend
Token ID1545
Feature activation-1.31e-03

Pos logit contributions
w+0.014
ane+0.014
t+0.013
z+0.013
working+0.013
Neg logit contributions
 agreement-0.011
 amaz-0.010
 unison-0.010
 groups-0.010
 shallow-0.010
Token watched
Token ID7342
Feature activation-2.02e-02

Pos logit contributions
 agreement+0.002
 unison+0.002
 amaz+0.002
 towels+0.002
 towel+0.002
Neg logit contributions
w-0.002
ane-0.002
z-0.002
bows-0.002
t-0.002
Token in
Token ID287
Feature activation+1.53e-03

Pos logit contributions
 amaz+0.023
 unison+0.023
 son+0.021
 agreement+0.020
 towels+0.020
Neg logit contributions
t-0.036
z-0.036
working-0.036
iky-0.036
ane-0.036
Token amaz
Token ID40642
Feature activation-1.08e-01

Pos logit contributions
hy+0.003
z+0.002
t+0.002
whe+0.002
iky+0.002
Neg logit contributions
 son-0.002
 fins-0.002
 raft-0.002
die-0.002
 Freddy-0.002
Tokenement
Token ID972
Feature activation+2.27e-02

Pos logit contributions
 Mama+0.204
 agreement+0.201
 unison+0.200
 Mum+0.200
 son+0.200
Neg logit contributions
w-0.130
ooky-0.128
iky-0.128
ane-0.124
uch-0.120
Token as
Token ID355
Feature activation-1.28e-02

Pos logit contributions
working+0.040
bows+0.040
z+0.039
t+0.039
ane+0.039
Neg logit contributions
 agreement-0.030
 amaz-0.029
 unison-0.029
 towel-0.027
 Mama-0.027
Token Amelia
Token ID49769
Feature activation+5.69e-03

Pos logit contributions
 agreement+0.017
 unison+0.017
 towels+0.017
 fins+0.016
 amaz+0.016
Neg logit contributions
working-0.022
ane-0.021
iky-0.021
bows-0.020
t-0.020
Token gigg
Token ID30442
Feature activation-1.22e-02

Pos logit contributions
w+0.009
z+0.009
uch+0.009
working+0.008
ane+0.008
Neg logit contributions
 agreement-0.009
 amaz-0.009
 Mum-0.008
 Mama-0.008
 Dad-0.008
Tokenled
Token ID992
Feature activation+1.44e-02

Pos logit contributions
 unison+0.023
 agreement+0.023
 Mama+0.023
 Mum+0.023
 son+0.023
Neg logit contributions
w-0.014
uch-0.014
iky-0.014
ooky-0.013
ane-0.013
Token YOU
Token ID7013
Feature activation-1.82e-02

Pos logit contributions
 Mum+0.027
 Mama+0.026
 mum+0.026
 unison+0.026
 agreement+0.026
Neg logit contributions
w-0.026
iky-0.024
uch-0.024
ooky-0.024
rob-0.024
Token STEP
Token ID49154
Feature activation-8.23e-02

Pos logit contributions
 son+0.029
 amaz+0.028
 Mama+0.028
 Dad+0.028
 Mum+0.028
Neg logit contributions
w-0.030
uch-0.029
t-0.028
iky-0.028
ane-0.028
Token ON
Token ID6177
Feature activation-5.73e-02

Pos logit contributions
 amaz+0.134
 son+0.129
 agreement+0.129
 Mama+0.128
 unison+0.127
Neg logit contributions
w-0.130
iky-0.126
uch-0.125
ooky-0.123
ane-0.121
Token THE
Token ID3336
Feature activation-2.83e-02

Pos logit contributions
 amaz+0.100
 unison+0.093
 agreement+0.089
 Mum+0.086
 son+0.086
Neg logit contributions
w-0.090
iky-0.085
ooky-0.083
uch-0.082
t-0.082
Token LE
Token ID12509
Feature activation-1.35e-02

Pos logit contributions
 Mama+0.052
 Mum+0.052
 unison+0.052
 amaz+0.052
 agreement+0.052
Neg logit contributions
w-0.036
iky-0.034
uch-0.034
ooky-0.034
ane-0.033
TokenAF
Token ID8579
Feature activation-1.08e-01

Pos logit contributions
 Mama+0.024
 unison+0.024
 Mum+0.024
 agreement+0.023
 amaz+0.023
Neg logit contributions
w-0.018
iky-0.017
ooky-0.017
uch-0.017
ane-0.017
Token?!"
Token ID30823
Feature activation-2.86e-03

Pos logit contributions
 unison+0.131
 amaz+0.131
 agreement+0.116
 towels+0.112
 Mama+0.104
Neg logit contributions
lder-0.189
z-0.188
iky-0.184
working-0.180
w-0.179
Token
Token ID198
Feature activation-2.52e-03

Pos logit contributions
 amaz+0.004
 agreement+0.004
 unison+0.004
 fins+0.004
 towels+0.004
Neg logit contributions
t-0.005
ane-0.005
w-0.004
whe-0.004
hy-0.004
Token
Token ID198
Feature activation-2.27e-03

Pos logit contributions
 amaz+0.003
 son+0.003
 unison+0.003
 agreement+0.003
 eyes+0.003
Neg logit contributions
t-0.005
w-0.005
iky-0.005
uch-0.004
ane-0.004
TokenThe
Token ID464
Feature activation+3.56e-04

Pos logit contributions
 amaz+0.004
 Mama+0.004
 son+0.004
 Mum+0.004
 unison+0.004
Neg logit contributions
w-0.003
iky-0.003
uch-0.003
t-0.003
ooky-0.003
Token embarrassed
Token ID21100
Feature activation-9.68e-03

Pos logit contributions
working+0.001
iky+0.001
ane+0.001
t+0.001
ooky+0.001
Neg logit contributions
 unison-0.000
 amaz-0.000
 towels-0.000
 Mama-0.000
 son-0.000
Token wind
Token ID2344
Feature activation-1.03e-03

Pos logit contributions
 amaz+0.007
 Mum+0.007
 Dad+0.007
 Mama+0.007
 unison+0.007
Neg logit contributions
working-0.007
bows-0.007
ooky-0.007
whe-0.007
ane-0.007
Token.
Token ID13
Feature activation+5.47e-04

Pos logit contributions
 agreement+0.002
 amaz+0.002
 unison+0.002
 Mama+0.002
 Mum+0.002
Neg logit contributions
t-0.002
ane-0.002
w-0.002
iky-0.002
z-0.002
Token Everyone
Token ID11075
Feature activation-4.74e-03

Pos logit contributions
ane+0.001
t+0.001
bows+0.001
ugg+0.001
whe+0.001
Neg logit contributions
 towels-0.001
 agreement-0.001
 son-0.001
 unison-0.001
 amaz-0.001
Token watched
Token ID7342
Feature activation-2.09e-02

Pos logit contributions
 Mama+0.007
 Mum+0.007
 unison+0.007
 agreement+0.007
 mum+0.007
Neg logit contributions
w-0.007
ane-0.007
t-0.007
iky-0.007
z-0.007
Token in
Token ID287
Feature activation+1.51e-03

Pos logit contributions
 unison+0.022
 amaz+0.022
 Mum+0.021
 agreement+0.021
 Mama+0.021
Neg logit contributions
w-0.039
t-0.039
z-0.039
iky-0.039
ane-0.039
Token amaz
Token ID40642
Feature activation-1.07e-01

Pos logit contributions
hy+0.003
z+0.002
whe+0.002
ane+0.002
bows+0.002
Neg logit contributions
 son-0.001
 mum-0.001
 Freddy-0.001
die-0.001
 Mum-0.001
Tokenement
Token ID972
Feature activation+2.19e-02

Pos logit contributions
 Mama+0.205
 agreement+0.201
 Mum+0.200
 son+0.199
 unison+0.199
Neg logit contributions
w-0.128
ooky-0.126
iky-0.125
ane-0.122
uch-0.117
Token.
Token ID13
Feature activation-3.89e-04

Pos logit contributions
ane+0.039
z+0.039
t+0.039
working+0.039
bows+0.039
Neg logit contributions
 agreement-0.028
 unison-0.027
 amaz-0.026
 Mama-0.026
 Mum-0.026
Token
Token ID198
Feature activation-3.00e-03

Pos logit contributions
 son+0.001
 agreement+0.001
 towels+0.001
 towel+0.001
 amaz+0.001
Neg logit contributions
ane-0.001
t-0.001
 These-0.001
bows-0.001
ugg-0.001
Token
Token ID198
Feature activation-2.63e-03

Pos logit contributions
 son+0.004
 amaz+0.004
 unison+0.004
 agreement+0.004
 eyes+0.004
Neg logit contributions
t-0.005
iky-0.005
w-0.005
z-0.005
ooky-0.005
TokenThe
Token ID464
Feature activation-9.65e-04

Pos logit contributions
 son+0.005
 eyes+0.005
 amaz+0.005
 Mama+0.005
 Mum+0.005
Neg logit contributions
w-0.003
t-0.003
iky-0.003
z-0.003
uch-0.003
Token looked
Token ID3114
Feature activation-2.78e-03

Pos logit contributions
 agreement+0.004
 unison+0.004
 Deal+0.004
 ways+0.004
 groups+0.004
Neg logit contributions
w-0.003
hy-0.003
whe-0.003
z-0.003
rob-0.003
Token at
Token ID379
Feature activation+2.14e-05

Pos logit contributions
 unison+0.005
 Mama+0.005
 Papa+0.004
 Mum+0.004
 son+0.004
Neg logit contributions
ooky-0.003
z-0.003
iky-0.003
w-0.003
ding-0.003
Token each
Token ID1123
Feature activation+7.27e-03

Pos logit contributions
iky+0.000
ane+0.000
w+0.000
working+0.000
rob+0.000
Neg logit contributions
 Deal-0.000
 agreement-0.000
 bur-0.000
 unison-0.000
 fins-0.000
Token other
Token ID584
Feature activation+3.31e-03

Pos logit contributions
w+0.014
iky+0.013
working+0.013
t+0.013
ooky+0.013
Neg logit contributions
 Dad-0.008
 unison-0.008
 amaz-0.008
 Mum-0.008
 Mama-0.008
Token in
Token ID287
Feature activation+2.27e-03

Pos logit contributions
z+0.005
bows+0.005
t+0.005
ane+0.005
hy+0.005
Neg logit contributions
 Mama-0.005
 amaz-0.004
 unison-0.004
 Papa-0.004
 bur-0.004
Token amaz
Token ID40642
Feature activation-1.07e-01

Pos logit contributions
hy+0.004
z+0.003
working+0.003
iky+0.003
w+0.003
Neg logit contributions
 son-0.002
 raft-0.002
 bur-0.002
 Sammy-0.002
 mum-0.002
Tokenement
Token ID972
Feature activation+2.38e-02

Pos logit contributions
 Mama+0.223
 agreement+0.216
 Papa+0.215
 Mum+0.214
 son+0.214
Neg logit contributions
ooky-0.112
iky-0.110
w-0.108
ane-0.108
t-0.097
Token.
Token ID13
Feature activation+2.05e-03

Pos logit contributions
working+0.035
z+0.035
bows+0.035
t+0.034
ane+0.034
Neg logit contributions
 agreement-0.039
 amaz-0.036
 unison-0.036
 bur-0.035
 towel-0.034
Token
Token ID198
Feature activation-1.91e-03

Pos logit contributions
 These+0.003
ane+0.003
t+0.002
ugg+0.002
 Did+0.002
Neg logit contributions
 agreement-0.004
 towel-0.003
 fear-0.003
 towels-0.003
 breath-0.003
Token
Token ID198
Feature activation-1.60e-03

Pos logit contributions
 amaz+0.003
 eyes+0.003
 son+0.003
 agreement+0.003
 ways+0.003
Neg logit contributions
t-0.003
z-0.003
iky-0.003
w-0.003
ane-0.003
TokenThey
Token ID2990
Feature activation-9.15e-03

Pos logit contributions
 eyes+0.003
 amaz+0.003
 son+0.003
 Mama+0.003
 hands+0.003
Neg logit contributions
w-0.002
z-0.002
t-0.002
iky-0.002
uch-0.002
Token the
Token ID262
Feature activation-2.09e-03

Pos logit contributions
 agreement+0.004
 unison+0.004
 son+0.004
 groups+0.003
 Deal+0.003
Neg logit contributions
w-0.003
working-0.003
z-0.003
bows-0.003
whe-0.003
Token little
Token ID1310
Feature activation-6.33e-03

Pos logit contributions
 Mum+0.003
 Mama+0.003
 Mia+0.003
 Sam+0.003
 amaz+0.003
Neg logit contributions
w-0.003
t-0.003
ane-0.003
ooky-0.003
working-0.003
Token girl
Token ID2576
Feature activation-1.39e-02

Pos logit contributions
 unison+0.011
 agreement+0.010
 amaz+0.010
 Deal+0.009
 Mum+0.009
Neg logit contributions
working-0.008
bows-0.008
w-0.007
ane-0.007
z-0.007
Token watched
Token ID7342
Feature activation-1.98e-02

Pos logit contributions
 agreement+0.023
 unison+0.022
 amaz+0.021
 Dad+0.021
 Mum+0.020
Neg logit contributions
z-0.021
t-0.018
w-0.018
uch-0.018
working-0.017
Token in
Token ID287
Feature activation+1.45e-03

Pos logit contributions
 unison+0.024
 amaz+0.023
 fins+0.022
 shallow+0.021
 agreement+0.021
Neg logit contributions
free-0.033
whe-0.032
working-0.032
z-0.032
t-0.031
Token amaz
Token ID40642
Feature activation-1.07e-01

Pos logit contributions
hy+0.002
bows+0.002
z+0.002
whe+0.002
free+0.002
Neg logit contributions
 son-0.001
 raft-0.001
die-0.001
 fins-0.001
 Freddy-0.001
Tokenement
Token ID972
Feature activation+2.29e-02

Pos logit contributions
 Mama+0.208
 agreement+0.205
 son+0.204
 unison+0.204
 Mum+0.203
Neg logit contributions
ooky-0.121
iky-0.119
w-0.119
ane-0.115
uch-0.109
Token.
Token ID13
Feature activation-1.92e-03

Pos logit contributions
working+0.038
bows+0.037
z+0.037
iky+0.036
t+0.036
Neg logit contributions
 agreement-0.032
 amaz-0.031
 unison-0.031
 Papa-0.028
 Mama-0.028
Token Mama
Token ID40084
Feature activation+7.94e-03

Pos logit contributions
 agreement+0.004
 towels+0.004
 unison+0.004
 towel+0.004
 fins+0.003
Neg logit contributions
 These-0.002
ugg-0.002
 Did-0.002
ane-0.002
t-0.002
Token's
Token ID338
Feature activation-9.18e-03

Pos logit contributions
z+0.013
w+0.012
ane+0.012
t+0.012
uch+0.011
Neg logit contributions
 agreement-0.012
 unison-0.012
 amaz-0.011
 ways-0.010
 towels-0.010
Token fingers
Token ID9353
Feature activation-8.85e-03

Pos logit contributions
 Sam+0.008
 his+0.008
 Jill+0.008
 Freddy+0.008
 shovel+0.008
Neg logit contributions
hy-0.017
z-0.016
lder-0.016
w-0.016
bows-0.016
Token watched
Token ID7342
Feature activation-2.04e-02

Pos logit contributions
w+0.011
ane+0.010
t+0.010
uch+0.010
z+0.010
Neg logit contributions
 agreement-0.011
 Mama-0.011
 Mum-0.011
 son-0.011
 Dad-0.011
Token the
Token ID262
Feature activation-1.42e-03

Pos logit contributions
 unison+0.022
 bur+0.021
 agreement+0.021
 amaz+0.021
 towels+0.020
Neg logit contributions
z-0.035
ane-0.034
whe-0.033
working-0.033
w-0.033
Token k
Token ID479
Feature activation-1.43e-02

Pos logit contributions
 Mama+0.002
 Mum+0.002
 Dad+0.002
 Papa+0.002
 son+0.002
Neg logit contributions
iky-0.002
ane-0.002
w-0.002
ooky-0.002
uch-0.002
Tokenite
Token ID578
Feature activation-4.28e-03

Pos logit contributions
 unison+0.029
 Mum+0.028
 Mama+0.028
 agreement+0.028
 mum+0.028
Neg logit contributions
ooky-0.014
w-0.014
iky-0.014
working-0.013
uch-0.013
Token in
Token ID287
Feature activation+1.07e-03

Pos logit contributions
 Dad+0.007
 son+0.007
 Mum+0.007
 Papa+0.007
 Mama+0.007
Neg logit contributions
z-0.006
ane-0.006
bows-0.006
iky-0.006
working-0.006
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
hy+0.001
bows+0.001
z+0.001
w+0.001
he+0.001
Neg logit contributions
 son-0.001
 potatoes-0.001
aj-0.001
 bur-0.001
 Indians-0.001
Tokenement
Token ID972
Feature activation+2.24e-02

Pos logit contributions
 Mama+0.221
 Papa+0.215
 son+0.215
 agreement+0.214
 Mum+0.213
Neg logit contributions
ooky-0.114
iky-0.113
ane-0.111
w-0.109
uch-0.098
Token,
Token ID11
Feature activation-6.98e-04

Pos logit contributions
ane+0.038
bows+0.038
z+0.037
t+0.037
working+0.037
Neg logit contributions
 agreement-0.032
 amaz-0.029
 Mama-0.029
 Papa-0.029
 unison-0.029
Token and
Token ID290
Feature activation-2.75e-03

Pos logit contributions
 towels+0.001
 unison+0.001
 son+0.001
 agreement+0.001
 Mama+0.001
Neg logit contributions
bows-0.001
z-0.001
ding-0.001
hy-0.001
whe-0.001
Token Little
Token ID7703
Feature activation-7.64e-03

Pos logit contributions
 unison+0.004
 agreement+0.004
 towels+0.004
 Deal+0.004
 towel+0.004
Neg logit contributions
z-0.004
working-0.004
stru-0.004
w-0.004
bows-0.004
Token Joe
Token ID5689
Feature activation-6.75e-03

Pos logit contributions
 agreement+0.007
 amaz+0.007
 unison+0.007
 Mama+0.006
 mum+0.006
Neg logit contributions
w-0.017
ane-0.016
uch-0.016
bows-0.016
iky-0.016
Token
Token ID198
Feature activation-3.92e-04

Pos logit contributions
 amaz+0.002
 eyes+0.002
 son+0.002
 agreement+0.002
 unison+0.002
Neg logit contributions
t-0.002
w-0.002
z-0.002
iky-0.002
wl-0.002
TokenL
Token ID43
Feature activation+5.65e-03

Pos logit contributions
 eyes+0.001
 amaz+0.001
 son+0.001
 Mama+0.001
 hands+0.001
Neg logit contributions
w-0.000
t-0.000
z-0.000
iky-0.000
uch-0.000
Tokenucky
Token ID5309
Feature activation+3.11e-03

Pos logit contributions
w+0.007
iky+0.007
ooky+0.006
ane+0.006
t+0.006
Neg logit contributions
 amaz-0.012
 Mama-0.011
 mum-0.011
 Mum-0.011
 Dad-0.011
Token watched
Token ID7342
Feature activation-1.99e-02

Pos logit contributions
w+0.004
lder+0.004
z+0.004
uch+0.004
t+0.004
Neg logit contributions
 amaz-0.005
 unison-0.005
 agreement-0.005
 towels-0.005
 Dad-0.005
Token in
Token ID287
Feature activation+4.29e-03

Pos logit contributions
 unison+0.028
 amaz+0.028
 towels+0.024
 son+0.022
 shovel+0.022
Neg logit contributions
t-0.029
whe-0.028
working-0.028
free-0.028
z-0.028
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
hy+0.007
 reader+0.006
working+0.006
whe+0.006
uggle+0.005
Neg logit contributions
 son-0.004
 unison-0.004
 towels-0.004
die-0.004
 Papa-0.004
Tokenement
Token ID972
Feature activation+2.58e-02

Pos logit contributions
 Mama+0.204
 agreement+0.201
 unison+0.200
 Mum+0.199
 son+0.199
Neg logit contributions
w-0.127
iky-0.125
ooky-0.125
ane-0.120
uch-0.116
Token as
Token ID355
Feature activation-1.06e-02

Pos logit contributions
working+0.049
t+0.048
z+0.048
iky+0.047
ane+0.047
Neg logit contributions
 agreement-0.031
 unison-0.029
 amaz-0.029
 towel-0.027
 towels-0.027
Token the
Token ID262
Feature activation+1.81e-03

Pos logit contributions
 unison+0.015
 agreement+0.014
 towels+0.014
 amaz+0.014
 hand+0.013
Neg logit contributions
working-0.018
iky-0.017
t-0.016
hy-0.016
w-0.016
Token frog
Token ID21264
Feature activation+6.32e-03

Pos logit contributions
w+0.003
iky+0.003
t+0.003
working+0.003
uch+0.003
Neg logit contributions
 unison-0.003
 amaz-0.003
 Mama-0.002
 son-0.002
 Papa-0.002
Token zoom
Token ID19792
Feature activation+1.31e-02

Pos logit contributions
w+0.008
z+0.008
working+0.008
ane+0.008
t+0.008
Neg logit contributions
 agreement-0.011
 Dad-0.011
 Mum-0.010
 unison-0.010
 towels-0.010
TokenWow
Token ID22017
Feature activation+1.11e-02

Pos logit contributions
w+0.004
uch+0.004
iky+0.004
t+0.004
z+0.004
Neg logit contributions
 son-0.007
 Mama-0.007
 mum-0.007
 eyes-0.007
 Mum-0.007
Token,"
Token ID553
Feature activation-9.65e-04

Pos logit contributions
w+0.019
uch+0.018
ane+0.017
iky+0.017
ooky+0.017
Neg logit contributions
 unison-0.016
 agreement-0.016
 amaz-0.016
 Mama-0.015
 Mum-0.015
Token Emily
Token ID17608
Feature activation-1.98e-02

Pos logit contributions
 unison+0.002
 towels+0.001
 agreement+0.001
 amaz+0.001
 fins+0.001
Neg logit contributions
w-0.001
working-0.001
t-0.001
Bloom-0.001
wl-0.001
Token gasped
Token ID45236
Feature activation-2.44e-02

Pos logit contributions
 agreement+0.030
 unison+0.029
 towels+0.028
 amaz+0.028
 ways+0.027
Neg logit contributions
t-0.030
z-0.029
w-0.029
lder-0.028
hy-0.027
Token in
Token ID287
Feature activation+1.98e-03

Pos logit contributions
 unison+0.032
 amaz+0.031
 towels+0.030
 agreement+0.030
 Mama+0.029
Neg logit contributions
iky-0.040
w-0.040
working-0.039
ooky-0.039
ane-0.039
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
hy+0.004
w+0.003
lder+0.003
iky+0.003
Bloom+0.003
Neg logit contributions
 son-0.002
 towels-0.002
 raft-0.002
 bur-0.002
 towel-0.002
Tokenement
Token ID972
Feature activation+2.32e-02

Pos logit contributions
 Mama+0.202
 unison+0.198
 Mum+0.197
 agreement+0.197
 son+0.196
Neg logit contributions
w-0.128
iky-0.125
ooky-0.125
ane-0.122
uch-0.117
Token.
Token ID13
Feature activation-2.60e-03

Pos logit contributions
t+0.038
z+0.037
working+0.037
w+0.037
iky+0.036
Neg logit contributions
 agreement-0.033
 amaz-0.033
 unison-0.033
 towels-0.031
 Mama-0.030
Token She
Token ID1375
Feature activation+2.93e-03

Pos logit contributions
 towels+0.004
 unison+0.004
 agreement+0.004
 towel+0.004
 fins+0.004
Neg logit contributions
t-0.004
ane-0.004
 Did-0.003
lder-0.003
bows-0.003
Token realized
Token ID6939
Feature activation-4.26e-03

Pos logit contributions
z+0.004
w+0.004
t+0.004
ve+0.003
uch+0.003
Neg logit contributions
 agreement-0.005
 towels-0.005
 ways-0.005
 Mama-0.005
 son-0.005
Token that
Token ID326
Feature activation-4.98e-03

Pos logit contributions
 unison+0.010
 agreement+0.010
 fins+0.010
 towels+0.009
 hand+0.009
Neg logit contributions
ane-0.003
t-0.003
w-0.002
working-0.002
ize-0.002
Token the
Token ID262
Feature activation-3.33e-04

Pos logit contributions
 unison+0.005
 agreement+0.004
 Deal+0.004
 son+0.004
 thirst+0.004
Neg logit contributions
z-0.006
ane-0.005
working-0.005
iky-0.005
w-0.005
Token zipper
Token ID48992
Feature activation-9.09e-04

Pos logit contributions
 unison+0.000
 Mama+0.000
 agreement+0.000
 Mum+0.000
 son+0.000
Neg logit contributions
w-0.001
iky-0.001
ooky-0.001
ane-0.001
uch-0.001
Token and
Token ID290
Feature activation-2.62e-03

Pos logit contributions
 unison+0.001
 agreement+0.001
 amaz+0.001
 Mama+0.001
 Mum+0.001
Neg logit contributions
ane-0.002
t-0.002
w-0.002
z-0.002
iky-0.002
Token gasped
Token ID45236
Feature activation-2.45e-02

Pos logit contributions
 unison+0.005
 agreement+0.004
 ways+0.004
 Deal+0.004
 amaz+0.004
Neg logit contributions
w-0.004
z-0.004
working-0.004
t-0.004
ane-0.004
Token with
Token ID351
Feature activation-3.83e-03

Pos logit contributions
 agreement+0.034
 unison+0.034
 towels+0.033
 Mama+0.032
 amaz+0.032
Neg logit contributions
z-0.036
iky-0.036
working-0.036
ooky-0.036
ane-0.035
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
 bur+0.004
 son+0.004
 towel+0.003
 towels+0.003
 unison+0.003
Neg logit contributions
hy-0.007
whe-0.006
w-0.005
iky-0.005
mod-0.005
Tokenement
Token ID972
Feature activation+2.39e-02

Pos logit contributions
 Mama+0.187
 Mum+0.185
 unison+0.185
 agreement+0.184
 amaz+0.184
Neg logit contributions
w-0.141
iky-0.137
ooky-0.135
uch-0.134
ane-0.133
Token at
Token ID379
Feature activation-3.06e-04

Pos logit contributions
working+0.041
z+0.039
ane+0.039
iky+0.039
t+0.039
Neg logit contributions
 agreement-0.035
 unison-0.032
 amaz-0.031
 Mama-0.030
 towel-0.029
Token all
Token ID477
Feature activation+6.91e-03

Pos logit contributions
 agreement+0.000
 unison+0.000
 Deal+0.000
 towel+0.000
 towels+0.000
Neg logit contributions
ane-0.000
working-0.000
or-0.000
z-0.000
w-0.000
Token the
Token ID262
Feature activation-1.42e-03

Pos logit contributions
ane+0.012
iky+0.012
working+0.012
ooky+0.012
t+0.012
Neg logit contributions
 unison-0.009
 bur-0.007
 fins-0.007
 thirst-0.007
 agreement-0.007
Token treasures
Token ID29561
Feature activation-1.22e-02

Pos logit contributions
 unison+0.002
 Mama+0.002
 Mum+0.002
 Dad+0.002
 agreement+0.002
Neg logit contributions
w-0.002
iky-0.002
ooky-0.002
t-0.002
rob-0.002
Token baseball
Token ID9283
Feature activation-7.77e-04

Pos logit contributions
working+0.000
ane+0.000
bows+0.000
iky+0.000
w+0.000
Neg logit contributions
 unison-0.000
 agreement-0.000
 Papa-0.000
 amaz-0.000
 Mama-0.000
Token!
Token ID0
Feature activation-5.17e-03

Pos logit contributions
 amaz+0.001
 agreement+0.001
 unison+0.001
 Mama+0.001
 Mum+0.001
Neg logit contributions
ane-0.001
t-0.001
w-0.001
working-0.001
iky-0.001
Token He
Token ID679
Feature activation-2.42e-03

Pos logit contributions
 amaz+0.009
 agreement+0.009
 towels+0.009
 towel+0.009
 mud+0.008
Neg logit contributions
ane-0.007
t-0.007
 These-0.007
grand-0.007
hy-0.006
Token gasped
Token ID45236
Feature activation-2.51e-02

Pos logit contributions
 agreement+0.004
 Mama+0.004
 amaz+0.004
 towels+0.004
 unison+0.004
Neg logit contributions
w-0.003
t-0.003
z-0.003
iky-0.003
working-0.003
Token in
Token ID287
Feature activation+2.60e-03

Pos logit contributions
 unison+0.040
 towels+0.038
 agreement+0.038
 amaz+0.038
 Mama+0.038
Neg logit contributions
iky-0.034
working-0.034
ane-0.034
w-0.034
z-0.033
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
hy+0.005
w+0.004
working+0.004
z+0.004
lder+0.004
Neg logit contributions
 son-0.002
 raft-0.002
 towels-0.002
 Bunny-0.002
 thirst-0.002
Tokenement
Token ID972
Feature activation+2.60e-02

Pos logit contributions
 Mama+0.206
 unison+0.201
 agreement+0.201
 Mum+0.201
 son+0.199
Neg logit contributions
w-0.124
ooky-0.123
iky-0.122
ane-0.120
uch-0.113
Token and
Token ID290
Feature activation-6.45e-04

Pos logit contributions
working+0.044
ane+0.043
z+0.043
t+0.042
w+0.041
Neg logit contributions
 agreement-0.037
 unison-0.035
 amaz-0.034
 Mama-0.034
 towel-0.033
Token jumped
Token ID11687
Feature activation-1.72e-02

Pos logit contributions
 unison+0.001
 towels+0.001
 towel+0.001
 agreement+0.001
 bowls+0.001
Neg logit contributions
hy-0.001
working-0.001
w-0.001
t-0.001
z-0.001
Token up
Token ID510
Feature activation+4.27e-03

Pos logit contributions
 unison+0.030
 amaz+0.029
 son+0.029
 Mama+0.028
 agreement+0.027
Neg logit contributions
t-0.023
z-0.023
iky-0.022
ane-0.021
ooky-0.021
Token and
Token ID290
Feature activation-1.02e-03

Pos logit contributions
z+0.006
working+0.006
t+0.005
hy+0.005
iky+0.005
Neg logit contributions
 unison-0.007
 son-0.007
 agreement-0.007
 Mama-0.007
 Papa-0.007
Token as
Token ID355
Feature activation-9.03e-03

Pos logit contributions
 amaz+0.017
 unison+0.017
 son+0.017
 Mama+0.016
 Dad+0.016
Neg logit contributions
z-0.013
t-0.012
iky-0.012
ane-0.012
w-0.012
Token the
Token ID262
Feature activation+4.07e-03

Pos logit contributions
 unison+0.016
 agreement+0.015
 amaz+0.015
 towels+0.015
 son+0.014
Neg logit contributions
w-0.013
working-0.013
t-0.013
iky-0.013
ane-0.012
Token woman
Token ID2415
Feature activation+3.93e-03

Pos logit contributions
w+0.007
working+0.007
hy+0.006
t+0.006
iky+0.006
Neg logit contributions
 amaz-0.006
 unison-0.006
 Dad-0.005
 their-0.005
 Mum-0.005
Token watched
Token ID7342
Feature activation-1.94e-02

Pos logit contributions
z+0.006
t+0.006
ane+0.006
w+0.005
hy+0.005
Neg logit contributions
 agreement-0.006
 unison-0.006
 amaz-0.006
 Dad-0.006
 Mama-0.006
Token in
Token ID287
Feature activation+5.40e-03

Pos logit contributions
 unison+0.026
 amaz+0.025
 towels+0.023
 son+0.022
 towel+0.022
Neg logit contributions
working-0.030
free-0.030
z-0.030
whe-0.030
hy-0.030
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
hy+0.008
z+0.007
whe+0.007
free+0.006
iky+0.006
Neg logit contributions
 son-0.006
 Papa-0.005
die-0.005
 Freddy-0.005
 shovel-0.005
Tokenement
Token ID972
Feature activation+2.52e-02

Pos logit contributions
 Mama+0.218
 agreement+0.213
 son+0.212
 unison+0.211
 Papa+0.210
Neg logit contributions
ooky-0.114
iky-0.114
w-0.112
ane-0.111
uch-0.099
Token.
Token ID13
Feature activation+1.81e-03

Pos logit contributions
working+0.044
z+0.044
ane+0.043
t+0.043
bows+0.042
Neg logit contributions
 agreement-0.034
 unison-0.033
 amaz-0.032
 Mama-0.030
 towel-0.030
Token She
Token ID1375
Feature activation+6.49e-03

Pos logit contributions
ane+0.002
t+0.002
bows+0.002
hy+0.002
z+0.002
Neg logit contributions
 towels-0.004
 towel-0.003
 agreement-0.003
 unison-0.003
 son-0.003
Token saw
Token ID2497
Feature activation-5.30e-03

Pos logit contributions
z+0.008
w+0.008
t+0.008
ane+0.007
iky+0.007
Neg logit contributions
 towels-0.011
 agreement-0.011
 son-0.011
 Dad-0.011
 unison-0.011
Token the
Token ID262
Feature activation+2.53e-03

Pos logit contributions
 unison+0.007
 agreement+0.007
 towel+0.006
 towels+0.006
 Deal+0.006
Neg logit contributions
w-0.009
ane-0.009
bows-0.009
hy-0.009
working-0.009
Tokenered
Token ID1068
Feature activation+1.21e-02

Pos logit contributions
 Mama+0.006
 unison+0.006
 Mum+0.006
 agreement+0.006
 Dad+0.006
Neg logit contributions
w-0.005
uch-0.005
iky-0.005
z-0.005
ooky-0.005
Token in
Token ID287
Feature activation+6.20e-03

Pos logit contributions
iky+0.011
ooky+0.011
w+0.011
working+0.011
t+0.011
Neg logit contributions
 unison-0.025
 agreement-0.023
 son-0.022
 amaz-0.022
 towel-0.021
Token,
Token ID11
Feature activation+5.64e-03

Pos logit contributions
hy+0.007
z+0.006
looking+0.006
t+0.006
iky+0.006
Neg logit contributions
 son-0.009
 unison-0.008
 thirst-0.008
 Indians-0.008
 crew-0.007
Token blinking
Token ID43196
Feature activation-9.66e-03

Pos logit contributions
t+0.008
z+0.007
ding+0.007
working+0.007
w+0.007
Neg logit contributions
 unison-0.010
 agreement-0.009
 amaz-0.009
 towels-0.008
 son-0.008
Token in
Token ID287
Feature activation+5.25e-03

Pos logit contributions
 unison+0.016
 agreement+0.016
 Deal+0.014
 son+0.014
 towels+0.014
Neg logit contributions
iky-0.013
w-0.013
z-0.013
ing-0.012
ooky-0.012
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
hy+0.007
iky+0.005
grown+0.005
bones+0.005
w+0.005
Neg logit contributions
 son-0.008
 Deal-0.006
 raft-0.006
 Indians-0.006
 dads-0.006
Tokenement
Token ID972
Feature activation+2.89e-02

Pos logit contributions
 Mama+0.225
 agreement+0.222
 unison+0.221
 son+0.218
 Papa+0.216
Neg logit contributions
ooky-0.106
ane-0.104
w-0.104
iky-0.104
t-0.090
Token.
Token ID13
Feature activation+4.50e-03

Pos logit contributions
working+0.044
z+0.043
t+0.043
ane+0.042
iky+0.042
Neg logit contributions
 agreement-0.048
 unison-0.045
 amaz-0.042
 towel-0.042
 towels-0.040
Token
Token ID198
Feature activation+5.24e-04

Pos logit contributions
t+0.006
ane+0.005
 These+0.005
 Did+0.005
hy+0.005
Neg logit contributions
 agreement-0.008
 towels-0.008
 towel-0.008
 unison-0.008
 amaz-0.007
Token
Token ID198
Feature activation+6.27e-04

Pos logit contributions
t+0.001
z+0.001
iky+0.001
w+0.001
ane+0.001
Neg logit contributions
 son-0.001
 amaz-0.001
 agreement-0.001
 eyes-0.001
 unison-0.001
TokenBut
Token ID1537
Feature activation+6.15e-03

Pos logit contributions
t+0.001
w+0.001
z+0.001
iky+0.001
ane+0.001
Neg logit contributions
 eyes-0.001
 son-0.001
 amaz-0.001
 hands-0.001
 Mama-0.001
Token.
Token ID13
Feature activation+3.58e-03

Pos logit contributions
t+0.002
w+0.002
working+0.002
ooky+0.002
uch+0.002
Neg logit contributions
 amaz-0.002
 agreement-0.002
 unison-0.002
 son-0.002
 Mum-0.002
Token
Token ID198
Feature activation+8.18e-04

Pos logit contributions
 These+0.004
ane+0.004
t+0.004
 Did+0.004
grand+0.004
Neg logit contributions
 towels-0.007
 towel-0.007
 amaz-0.006
 agreement-0.006
 unison-0.006
Token
Token ID198
Feature activation+1.67e-03

Pos logit contributions
t+0.001
w+0.001
z+0.001
iky+0.001
wl+0.001
Neg logit contributions
 amaz-0.001
 eyes-0.001
 son-0.001
 fins-0.001
 towels-0.001
TokenTo
Token ID2514
Feature activation-1.66e-02

Pos logit contributions
w+0.002
t+0.002
z+0.002
iky+0.002
uch+0.002
Neg logit contributions
 eyes-0.003
 son-0.003
 amaz-0.003
 towels-0.003
 hands-0.003
Token her
Token ID607
Feature activation+1.54e-03

Pos logit contributions
 amaz+0.020
 fins+0.019
 unison+0.019
 towels+0.019
 son+0.018
Neg logit contributions
w-0.028
t-0.027
working-0.026
or-0.026
ane-0.026
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
working+0.002
uggle+0.002
 reader+0.002
 lawyer+0.002
ane+0.002
Neg logit contributions
 theirs-0.002
 agreement-0.002
 towels-0.002
 itself-0.002
 Jill-0.002
Tokenement
Token ID972
Feature activation+2.80e-02

Pos logit contributions
 Mama+0.213
 agreement+0.210
 unison+0.208
 Papa+0.208
 son+0.206
Neg logit contributions
ane-0.120
ooky-0.117
w-0.116
iky-0.116
uch-0.109
Token,
Token ID11
Feature activation+3.58e-03

Pos logit contributions
ane+0.040
t+0.038
working+0.038
z+0.037
bows+0.036
Neg logit contributions
 agreement-0.050
 unison-0.048
 amaz-0.047
 towels-0.045
 shallow-0.043
Token the
Token ID262
Feature activation+5.36e-03

Pos logit contributions
working+0.004
bows+0.004
hy+0.004
w+0.004
t+0.004
Neg logit contributions
 unison-0.006
 towels-0.006
 agreement-0.006
 fins-0.005
 towel-0.005
Token clothes
Token ID8242
Feature activation-5.91e-03

Pos logit contributions
working+0.009
ooky+0.009
w+0.009
t+0.009
iky+0.009
Neg logit contributions
 Sam-0.007
 amaz-0.007
 agreement-0.007
 unison-0.007
 Mum-0.006
Token came
Token ID1625
Feature activation+6.98e-03

Pos logit contributions
 agreement+0.009
 unison+0.009
 Mama+0.009
 son+0.009
 amaz+0.009
Neg logit contributions
t-0.009
w-0.009
ane-0.009
uch-0.008
iky-0.008
Token The
Token ID383
Feature activation+1.98e-03

Pos logit contributions
 towel+0.002
 agreement+0.001
 towels+0.001
 amaz+0.001
 fear+0.001
Neg logit contributions
t-0.001
 These-0.001
ane-0.001
whe-0.001
hy-0.001
Token little
Token ID1310
Feature activation-5.88e-03

Pos logit contributions
working+0.003
ane+0.003
w+0.003
ooky+0.003
iky+0.003
Neg logit contributions
 Mama-0.003
 unison-0.002
 bur-0.002
 Mum-0.002
 amaz-0.002
Token boy
Token ID2933
Feature activation-8.01e-03

Pos logit contributions
 unison+0.009
 Mama+0.009
 amaz+0.009
 agreement+0.008
 Mum+0.008
Neg logit contributions
working-0.009
ane-0.009
w-0.008
bows-0.008
rob-0.008
Token watched
Token ID7342
Feature activation-2.07e-02

Pos logit contributions
 agreement+0.013
 unison+0.013
 Mama+0.013
 amaz+0.012
 Mum+0.012
Neg logit contributions
w-0.011
z-0.011
t-0.011
uch-0.011
iky-0.011
Token in
Token ID287
Feature activation+1.18e-03

Pos logit contributions
 unison+0.026
 amaz+0.025
 bur+0.024
 towels+0.024
 towel+0.023
Neg logit contributions
t-0.034
working-0.034
whe-0.033
z-0.033
ane-0.032
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
hy+0.002
z+0.002
ane+0.002
working+0.002
whe+0.002
Neg logit contributions
 son-0.001
 Mama-0.001
 mum-0.001
 Mum-0.001
 Papa-0.001
Tokenement
Token ID972
Feature activation+2.30e-02

Pos logit contributions
 Mama+0.209
 agreement+0.203
 Mum+0.202
 unison+0.202
 son+0.202
Neg logit contributions
ooky-0.119
w-0.119
iky-0.118
ane-0.116
uch-0.108
Token.
Token ID13
Feature activation-1.69e-03

Pos logit contributions
working+0.040
ane+0.039
z+0.039
t+0.038
iky+0.038
Neg logit contributions
 agreement-0.030
 unison-0.029
 amaz-0.028
 towel-0.027
 towels-0.027
Token
Token ID198
Feature activation-2.60e-03

Pos logit contributions
 towel+0.003
 towels+0.003
 agreement+0.003
 unison+0.003
 amaz+0.003
Neg logit contributions
ane-0.002
t-0.002
 These-0.002
whe-0.002
z-0.002
Token
Token ID198
Feature activation-2.45e-03

Pos logit contributions
 amaz+0.004
 son+0.004
 unison+0.003
 agreement+0.003
 eyes+0.003
Neg logit contributions
t-0.005
iky-0.004
w-0.004
z-0.004
ane-0.004
TokenThe
Token ID464
Feature activation+1.07e-04

Pos logit contributions
 son+0.004
 eyes+0.004
 Mama+0.004
 amaz+0.004
 Mum+0.004
Neg logit contributions
w-0.003
t-0.003
iky-0.003
z-0.003
ane-0.003
Token her
Token ID607
Feature activation-3.61e-03

Pos logit contributions
working+0.001
t+0.001
w+0.001
bows+0.001
ane+0.001
Neg logit contributions
 unison-0.001
 agreement-0.001
 towels-0.001
 fins-0.001
 amaz-0.001
Token eyes
Token ID2951
Feature activation-1.24e-02

Pos logit contributions
 agreement+0.004
 unison+0.003
 towels+0.003
 Molly+0.003
 shallow+0.003
Neg logit contributions
ane-0.007
hy-0.007
working-0.007
w-0.007
iky-0.007
Token grew
Token ID6348
Feature activation-4.37e-03

Pos logit contributions
 Mama+0.017
 son+0.017
 Mum+0.017
 agreement+0.017
 Dad+0.017
Neg logit contributions
w-0.021
iky-0.021
ooky-0.020
uch-0.020
t-0.020
Token wide
Token ID3094
Feature activation+5.63e-03

Pos logit contributions
 unison+0.005
 Mama+0.005
 Mum+0.005
 agreement+0.005
 amaz+0.005
Neg logit contributions
w-0.009
iky-0.009
ooky-0.009
uch-0.009
ane-0.008
Token with
Token ID351
Feature activation-3.46e-03

Pos logit contributions
iky+0.011
ooky+0.010
ane+0.010
bows+0.010
t+0.010
Neg logit contributions
 unison-0.007
 Mama-0.006
 Mum-0.006
 agreement-0.006
 mum-0.006
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
 unison+0.005
 son+0.004
 agreement+0.004
 Deal+0.004
 Carlos+0.004
Neg logit contributions
hy-0.006
iky-0.005
bows-0.005
w-0.005
ooky-0.004
Tokenement
Token ID972
Feature activation+2.57e-02

Pos logit contributions
 Mama+0.186
 Mum+0.184
 unison+0.184
 agreement+0.183
 amaz+0.183
Neg logit contributions
w-0.141
iky-0.136
ooky-0.135
uch-0.134
ane-0.132
Token.
Token ID13
Feature activation+7.45e-05

Pos logit contributions
iky+0.043
ane+0.043
bows+0.043
working+0.042
t+0.042
Neg logit contributions
 agreement-0.039
 unison-0.037
 amaz-0.035
 Mama-0.033
 Papa-0.033
Token She
Token ID1375
Feature activation+3.31e-03

Pos logit contributions
ane+0.000
t+0.000
ugg+0.000
bows+0.000
grand+0.000
Neg logit contributions
 unison-0.000
 agreement-0.000
 towels-0.000
 amaz-0.000
 fins-0.000
Token wanted
Token ID2227
Feature activation-5.75e-03

Pos logit contributions
w+0.004
z+0.004
t+0.004
Bloom+0.004
lder+0.004
Neg logit contributions
 agreement-0.006
 towels-0.005
 unison-0.005
 lunch-0.005
 son-0.005
Token to
Token ID284
Feature activation-1.71e-03

Pos logit contributions
 unison+0.012
 agreement+0.011
 amaz+0.010
 son+0.010
 towel+0.009
Neg logit contributions
t-0.006
w-0.006
working-0.006
ane-0.006
iky-0.006
Token,
Token ID11
Feature activation+1.03e-04

Pos logit contributions
t+0.005
working+0.005
ane+0.005
z+0.005
hy+0.005
Neg logit contributions
 unison-0.004
 amaz-0.004
 agreement-0.004
 towels-0.004
 bur-0.004
Token her
Token ID607
Feature activation-2.60e-03

Pos logit contributions
lder+0.000
t+0.000
ding+0.000
working+0.000
hy+0.000
Neg logit contributions
 unison-0.000
 towels-0.000
 ways-0.000
 amaz-0.000
 agreement-0.000
Token eyes
Token ID2951
Feature activation-1.29e-02

Pos logit contributions
 unison+0.003
 agreement+0.002
 his+0.002
 shallow+0.002
 towels+0.002
Neg logit contributions
working-0.005
ane-0.005
iky-0.005
lder-0.005
ooky-0.005
Token wide
Token ID3094
Feature activation+5.49e-03

Pos logit contributions
 Mama+0.020
 Mum+0.019
 son+0.019
 Dad+0.019
 agreement+0.019
Neg logit contributions
iky-0.020
w-0.020
ooky-0.019
t-0.019
uch-0.019
Token with
Token ID351
Feature activation-4.04e-03

Pos logit contributions
iky+0.010
bows+0.010
z+0.010
ooky+0.010
ane+0.009
Neg logit contributions
 Mama-0.006
 unison-0.006
 agreement-0.006
 Mum-0.006
 Papa-0.006
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
 son+0.005
 buddy+0.004
 agreement+0.004
 unison+0.004
 Deal+0.004
Neg logit contributions
hy-0.007
iky-0.006
bows-0.006
w-0.005
mod-0.005
Tokenement
Token ID972
Feature activation+2.53e-02

Pos logit contributions
 Mama+0.191
 unison+0.188
 Mum+0.188
 agreement+0.188
 amaz+0.186
Neg logit contributions
w-0.136
iky-0.133
ooky-0.131
uch-0.128
ane-0.128
Token.
Token ID13
Feature activation-1.11e-03

Pos logit contributions
iky+0.042
working+0.042
t+0.042
z+0.041
bows+0.041
Neg logit contributions
 agreement-0.037
 unison-0.035
 amaz-0.034
 Mama-0.032
 Papa-0.032
Token
Token ID198
Feature activation-2.64e-03

Pos logit contributions
 unison+0.002
 agreement+0.002
 amaz+0.002
 fear+0.002
 towels+0.002
Neg logit contributions
t-0.002
ane-0.002
 These-0.001
ugg-0.001
lder-0.001
Token
Token ID198
Feature activation-2.35e-03

Pos logit contributions
 amaz+0.004
 unison+0.004
 son+0.004
 fins+0.004
 agreement+0.004
Neg logit contributions
t-0.004
iky-0.004
w-0.004
ooky-0.004
ane-0.004
Token"
Token ID1
Feature activation+2.46e-03

Pos logit contributions
 amaz+0.004
 son+0.004
 Mama+0.004
 eyes+0.004
 unison+0.004
Neg logit contributions
w-0.003
t-0.003
iky-0.003
uch-0.003
ooky-0.003
Token and
Token ID290
Feature activation-1.15e-03

Pos logit contributions
 agreement+0.004
 unison+0.004
 amaz+0.004
 towels+0.004
 Mum+0.004
Neg logit contributions
z-0.004
w-0.004
t-0.004
iky-0.004
uch-0.004
Token M
Token ID337
Feature activation+6.13e-03

Pos logit contributions
 unison+0.002
 towels+0.002
 ditch+0.002
 net+0.002
 spears+0.002
Neg logit contributions
working-0.001
free-0.001
stru-0.001
t-0.001
ugg-0.001
Tokenummy
Token ID13513
Feature activation+6.61e-03

Pos logit contributions
iky+0.006
ooky+0.006
w+0.006
ane+0.006
uch+0.006
Neg logit contributions
 agreement-0.013
 amaz-0.013
 unison-0.013
 Mama-0.012
 Mum-0.012
Token gasped
Token ID45236
Feature activation-2.48e-02

Pos logit contributions
w+0.009
z+0.009
working+0.009
iky+0.009
ane+0.009
Neg logit contributions
 agreement-0.010
 unison-0.009
 Mum-0.009
 Mama-0.009
 Sam-0.009
Token with
Token ID351
Feature activation-3.78e-03

Pos logit contributions
 agreement+0.039
 unison+0.038
 towels+0.037
 towel+0.037
 amaz+0.037
Neg logit contributions
iky-0.031
z-0.031
ooky-0.030
ane-0.030
working-0.030
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
 son+0.004
 towel+0.003
 towels+0.003
 bur+0.003
 Mama+0.003
Neg logit contributions
hy-0.007
iky-0.006
w-0.006
shaped-0.006
mod-0.006
Tokenement
Token ID972
Feature activation+2.45e-02

Pos logit contributions
 Mama+0.208
 agreement+0.203
 unison+0.202
 Mum+0.202
 son+0.201
Neg logit contributions
iky-0.119
w-0.119
ooky-0.118
ane-0.114
uch-0.109
Token.
Token ID13
Feature activation-1.05e-03

Pos logit contributions
working+0.038
bows+0.038
ane+0.038
iky+0.038
z+0.037
Neg logit contributions
 agreement-0.038
 unison-0.034
 amaz-0.033
 Mama-0.032
 Mum-0.031
Token
Token ID198
Feature activation-9.57e-04

Pos logit contributions
 agreement+0.002
 fear+0.002
 towel+0.002
 unison+0.002
 towels+0.002
Neg logit contributions
 These-0.001
 Did-0.001
t-0.001
ane-0.001
 This-0.001
Token
Token ID198
Feature activation-9.17e-04

Pos logit contributions
 amaz+0.002
 son+0.002
 eyes+0.001
 agreement+0.001
 unison+0.001
Neg logit contributions
t-0.001
w-0.001
iky-0.001
z-0.001
ooky-0.001
Token"
Token ID1
Feature activation+4.91e-03

Pos logit contributions
 eyes+0.002
 son+0.002
 amaz+0.002
 agreement+0.002
 Mama+0.002
Neg logit contributions
w-0.001
t-0.001
z-0.001
iky-0.001
ooky-0.001
Token seen
Token ID1775
Feature activation-4.58e-03

Pos logit contributions
w+0.025
working+0.024
uch+0.023
bows+0.023
t+0.023
Neg logit contributions
 Mama-0.027
 mum-0.027
 Mum-0.027
 unison-0.026
 agreement-0.026
Token!
Token ID0
Feature activation-4.08e-03

Pos logit contributions
 unison+0.007
 agreement+0.007
 amaz+0.007
 fins+0.007
 Papa+0.007
Neg logit contributions
bows-0.007
iky-0.007
w-0.007
ane-0.007
t-0.007
Token She
Token ID1375
Feature activation+5.50e-03

Pos logit contributions
 unison+0.007
 amaz+0.007
 towels+0.006
 agreement+0.006
 fins+0.006
Neg logit contributions
t-0.006
ane-0.006
 These-0.006
grand-0.006
ugg-0.006
Token gasped
Token ID45236
Feature activation-2.43e-02

Pos logit contributions
z+0.008
w+0.007
iky+0.007
t+0.007
uch+0.007
Neg logit contributions
 agreement-0.010
 unison-0.009
 Mama-0.009
 Dad-0.009
 amaz-0.009
Token in
Token ID287
Feature activation+3.80e-03

Pos logit contributions
 unison+0.035
 agreement+0.034
 Mama+0.033
 amaz+0.033
 Mum+0.033
Neg logit contributions
w-0.039
iky-0.039
ooky-0.038
ane-0.038
working-0.037
Token amaz
Token ID40642
Feature activation-1.06e-01

Pos logit contributions
hy+0.007
w+0.006
working+0.006
iky+0.006
bows+0.006
Neg logit contributions
 son-0.004
 raft-0.004
 towels-0.003
 Mama-0.003
 mum-0.003
Tokenement
Token ID972
Feature activation+2.61e-02

Pos logit contributions
 Mama+0.192
 unison+0.190
 Mum+0.189
 agreement+0.189
 son+0.188
Neg logit contributions
w-0.135
iky-0.131
ooky-0.130
ane-0.127
uch-0.126
Token,
Token ID11
Feature activation+1.59e-03

Pos logit contributions
working+0.046
z+0.045
ane+0.045
bows+0.045
w+0.045
Neg logit contributions
 agreement-0.034
 unison-0.033
 amaz-0.031
 Mama-0.031
 towels-0.031
Token and
Token ID290
Feature activation-7.43e-04

Pos logit contributions
bows+0.002
hy+0.002
t+0.002
z+0.002
working+0.002
Neg logit contributions
 unison-0.003
 towels-0.003
 son-0.002
 fins-0.002
 agreement-0.002
Token her
Token ID607
Feature activation-2.11e-03

Pos logit contributions
 unison+0.001
 towels+0.001
 agreement+0.001
 towel+0.001
 fins+0.001
Neg logit contributions
w-0.001
working-0.001
z-0.001
whe-0.001
hy-0.001
Token tears
Token ID10953
Feature activation-5.87e-03

Pos logit contributions
 unison+0.003
 agreement+0.002
 towels+0.002
 shallow+0.002
 Mama+0.002
Neg logit contributions
ane-0.003
iky-0.003
working-0.003
z-0.003
hy-0.003
Token of
Token ID286
Feature activation-1.09e-02

Pos logit contributions
t+0.008
iky+0.008
ooky+0.007
ane+0.007
hy+0.007
Neg logit contributions
 Mama-0.010
 amaz-0.010
 Papa-0.010
 mum-0.010
 son-0.010
Token fun
Token ID1257
Feature activation-7.37e-04

Pos logit contributions
 Mama+0.014
 unison+0.013
 Papa+0.013
 Mum+0.013
 mum+0.013
Neg logit contributions
w-0.020
rob-0.020
iky-0.020
ooky-0.019
uch-0.019
Token watching
Token ID4964
Feature activation-9.71e-03

Pos logit contributions
 Mama+0.001
 agreement+0.001
 Mum+0.001
 Papa+0.001
 mum+0.001
Neg logit contributions
w-0.001
iky-0.001
rob-0.001
z-0.001
ooky-0.001
Token the
Token ID262
Feature activation-3.87e-03

Pos logit contributions
 amaz+0.013
 agreement+0.013
 unison+0.013
 bur+0.013
 shovel+0.013
Neg logit contributions
hy-0.015
bows-0.015
iky-0.014
ane-0.014
ooky-0.014
Token ac
Token ID936
Feature activation+7.26e-03

Pos logit contributions
 Mama+0.006
 Mum+0.006
 amaz+0.006
 unison+0.006
 agreement+0.006
Neg logit contributions
w-0.006
iky-0.006
ooky-0.006
uch-0.006
ane-0.006
Tokenrob
Token ID22609
Feature activation+9.77e-02

Pos logit contributions
mod+0.004
iky+0.004
working+0.004
w+0.004
bows+0.004
Neg logit contributions
 water-0.015
 his-0.015
 Sam-0.015
 Mama-0.014
 towels-0.014
Tokenats
Token ID1381
Feature activation+6.52e-03

Pos logit contributions
ane+0.104
w+0.104
iky+0.103
bows+0.101
t+0.098
Neg logit contributions
 son-0.177
 unison-0.174
 dad-0.174
 agreement-0.173
 mum-0.173
Token and
Token ID290
Feature activation-1.69e-03

Pos logit contributions
ane+0.011
t+0.011
ooky+0.011
ding+0.011
whe+0.011
Neg logit contributions
 amaz-0.009
 unison-0.009
 Mama-0.009
 agreement-0.009
 Papa-0.009
Token clown
Token ID25573
Feature activation+5.33e-03

Pos logit contributions
 unison+0.002
 agreement+0.002
 towel+0.002
 Deal+0.002
ulp+0.002
Neg logit contributions
bows-0.002
hy-0.002
whe-0.002
working-0.002
rob-0.002
Tokens
Token ID82
Feature activation-1.77e-04

Pos logit contributions
ane+0.005
z+0.005
t+0.005
w+0.005
iky+0.005
Neg logit contributions
 agreement-0.012
 amaz-0.012
 Mama-0.012
 Mum-0.011
 unison-0.011
Token.
Token ID13
Feature activation-4.60e-04

Pos logit contributions
 Papa+0.000
 Mama+0.000
 agreement+0.000
 bur+0.000
 unison+0.000
Neg logit contributions
bows-0.000
hy-0.000
ane-0.000
whe-0.000
z-0.000
Token.
Token ID13
Feature activation-5.52e-05

Pos logit contributions
 agreement+0.002
 unison+0.002
 amaz+0.002
 son+0.002
 mum+0.002
Neg logit contributions
iky-0.003
w-0.002
ooky-0.002
t-0.002
bows-0.002
Token They
Token ID1119
Feature activation+8.18e-04

Pos logit contributions
 amaz+0.000
 agreement+0.000
 unison+0.000
 son+0.000
 mum+0.000
Neg logit contributions
iky-0.000
t-0.000
ane-0.000
ooky-0.000
bows-0.000
Token were
Token ID547
Feature activation-7.52e-05

Pos logit contributions
w+0.001
iky+0.001
t+0.001
lder+0.001
working+0.001
Neg logit contributions
 agreement-0.001
 unison-0.001
 amaz-0.001
 Mum-0.001
 Mama-0.001
Token blue
Token ID4171
Feature activation+1.06e-02

Pos logit contributions
 unison+0.000
 agreement+0.000
 son+0.000
 amaz+0.000
 towels+0.000
Neg logit contributions
iky-0.000
w-0.000
working-0.000
ooky-0.000
z-0.000
Token and
Token ID290
Feature activation-4.35e-04

Pos logit contributions
w+0.022
iky+0.022
ooky+0.022
uch+0.022
ane+0.022
Neg logit contributions
 agreement-0.010
 Mama-0.010
 unison-0.010
 Mum-0.010
 amaz-0.010
Token spe
Token ID693
Feature activation+7.07e-02

Pos logit contributions
 agreement+0.001
 unison+0.001
 amaz+0.001
 son+0.001
 ways+0.001
Neg logit contributions
working-0.001
w-0.001
iky-0.001
t-0.001
mod-0.001
Tokenck
Token ID694
Feature activation+2.18e-02

Pos logit contributions
ooky+0.145
ane+0.144
t+0.140
bows+0.138
iky+0.137
Neg logit contributions
 agreement-0.073
 hands-0.067
 unison-0.067
 Mama-0.067
 son-0.067
Tokenled
Token ID992
Feature activation+2.42e-02

Pos logit contributions
w+0.015
iky+0.011
ane+0.010
ooky+0.010
z+0.010
Neg logit contributions
 agreement-0.057
 amaz-0.056
 unison-0.055
 Mum-0.055
 mum-0.054
Token.
Token ID13
Feature activation+4.84e-04

Pos logit contributions
bows+0.031
iky+0.029
z+0.028
w+0.027
star+0.026
Neg logit contributions
 agreement-0.040
 unison-0.039
 son-0.037
 Mama-0.036
 Deal-0.036
Token L
Token ID406
Feature activation+1.09e-02

Pos logit contributions
iky+0.001
t+0.001
bows+0.001
ane+0.001
w+0.001
Neg logit contributions
 amaz-0.001
 agreement-0.001
 unison-0.001
 son-0.001
 mum-0.001
Tokenila
Token ID10102
Feature activation+1.10e-02

Pos logit contributions
w+0.017
uch+0.017
working+0.017
ooky+0.017
iky+0.017
Neg logit contributions
 Mum-0.016
 Mama-0.016
 mum-0.015
 Dad-0.015
 son-0.015
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
iky+0.013
ane+0.013
bows+0.012
w+0.012
t+0.012
Neg logit contributions
 agreement-0.010
 son-0.010
 unison-0.010
 amaz-0.010
 Mama-0.010
Token Do
Token ID2141
Feature activation+1.23e-02

Pos logit contributions
ane+0.014
t+0.013
iky+0.012
w+0.012
 These+0.012
Neg logit contributions
 amaz-0.017
 son-0.017
 agreement-0.016
 towel-0.016
 unison-0.016
Token you
Token ID345
Feature activation+8.81e-03

Pos logit contributions
w+0.020
iky+0.019
uch+0.019
ane+0.019
ooky+0.019
Neg logit contributions
 amaz-0.019
 unison-0.019
 agreement-0.019
 Mama-0.019
 son-0.018
Token like
Token ID588
Feature activation+1.05e-02

Pos logit contributions
w+0.012
t+0.011
uch+0.011
working+0.010
iky+0.010
Neg logit contributions
 amaz-0.016
 Mama-0.016
 unison-0.016
 son-0.016
 Mum-0.016
Token rain
Token ID6290
Feature activation-5.21e-03

Pos logit contributions
w+0.020
iky+0.019
ane+0.019
bows+0.018
ooky+0.018
Neg logit contributions
 unison-0.014
 amaz-0.013
 son-0.013
 agreement-0.013
 Mama-0.012
Tokenbows
Token ID25435
Feature activation+6.69e-02

Pos logit contributions
 Mama+0.007
 unison+0.007
 amaz+0.007
 Mum+0.007
 agreement+0.007
Neg logit contributions
w-0.009
iky-0.009
uch-0.009
ooky-0.009
ane-0.009
Token?"
Token ID1701
Feature activation+7.45e-03

Pos logit contributions
w+0.118
iky+0.115
ooky+0.114
uch+0.113
ane+0.112
Neg logit contributions
 Mama-0.091
 amaz-0.090
 unison-0.090
 Mum-0.089
 agreement-0.089
Token Anna
Token ID11735
Feature activation-4.07e-03

Pos logit contributions
w+0.012
t+0.011
ane+0.011
grand+0.011
working+0.011
Neg logit contributions
 unison-0.011
 amaz-0.011
 towels-0.010
 agreement-0.010
 fins-0.010
Token asked
Token ID1965
Feature activation+2.60e-03

Pos logit contributions
 unison+0.005
 amaz+0.005
 agreement+0.005
 son+0.005
 Mum+0.005
Neg logit contributions
w-0.007
iky-0.007
ane-0.007
ooky-0.007
bows-0.007
Token.
Token ID13
Feature activation+5.78e-03

Pos logit contributions
w+0.004
ane+0.004
t+0.004
bows+0.004
working+0.004
Neg logit contributions
 amaz-0.004
 unison-0.004
 bur-0.004
 fins-0.004
 agreement-0.004
Token
Token ID198
Feature activation+8.61e-04

Pos logit contributions
t+0.008
ane+0.008
bows+0.007
iky+0.007
Bloom+0.007
Neg logit contributions
 amaz-0.009
 unison-0.009
 son-0.009
 agreement-0.009
 fins-0.009
Token and
Token ID290
Feature activation-2.43e-03

Pos logit contributions
w+0.010
iky+0.010
ooky+0.010
ane+0.010
uch+0.009
Neg logit contributions
 agreement-0.006
 unison-0.005
 amaz-0.005
 Mum-0.005
 Mama-0.005
Token had
Token ID550
Feature activation-6.57e-03

Pos logit contributions
 agreement+0.005
 unison+0.005
 son+0.005
 towels+0.004
 ways+0.004
Neg logit contributions
w-0.003
iky-0.003
t-0.003
uch-0.003
working-0.003
Token a
Token ID257
Feature activation-1.38e-03

Pos logit contributions
 agreement+0.008
 unison+0.008
 son+0.007
 Mum+0.007
 amaz+0.007
Neg logit contributions
w-0.013
iky-0.013
hy-0.012
t-0.012
uch-0.012
Token big
Token ID1263
Feature activation-2.76e-03

Pos logit contributions
 unison+0.002
 agreement+0.002
 Dad+0.002
 son+0.002
 Mum+0.002
Neg logit contributions
w-0.002
iky-0.002
uch-0.002
ooky-0.002
t-0.002
Token bus
Token ID1323
Feature activation+3.33e-03

Pos logit contributions
 unison+0.003
 agreement+0.003
 son+0.003
 Dad+0.003
 towels+0.003
Neg logit contributions
w-0.005
uch-0.005
iky-0.005
working-0.005
ane-0.005
Tokenhy
Token ID12114
Feature activation+6.77e-02

Pos logit contributions
w+0.002
iky+0.002
uch+0.002
ooky+0.002
ane+0.002
Neg logit contributions
 Mama-0.008
 Mum-0.008
 unison-0.008
 amaz-0.008
 agreement-0.008
Token tail
Token ID7894
Feature activation-1.32e-02

Pos logit contributions
w+0.149
iky+0.146
uch+0.145
ane+0.144
ooky+0.143
Neg logit contributions
 unison-0.060
 Mama-0.060
 Mum-0.060
 agreement-0.060
 amaz-0.059
Token.
Token ID13
Feature activation+6.34e-05

Pos logit contributions
 agreement+0.019
 son+0.019
 amaz+0.019
 Mama+0.018
 Mum+0.018
Neg logit contributions
iky-0.023
w-0.023
ane-0.022
t-0.022
uch-0.022
Token
Token ID198
Feature activation-1.50e-03

Pos logit contributions
t+0.000
 These+0.000
ane+0.000
ding+0.000
working+0.000
Neg logit contributions
 unison-0.000
 agreement-0.000
 towel-0.000
 towels-0.000
 amaz-0.000
Token
Token ID198
Feature activation-1.44e-03

Pos logit contributions
 amaz+0.002
 unison+0.002
 agreement+0.002
 son+0.002
 eyes+0.002
Neg logit contributions
t-0.003
w-0.003
ooky-0.003
iky-0.003
ane-0.003
TokenJack
Token ID14295
Feature activation-1.77e-02

Pos logit contributions
 amaz+0.003
 eyes+0.002
 Mama+0.002
 son+0.002
 agreement+0.002
Neg logit contributions
t-0.002
w-0.002
ooky-0.002
iky-0.002
uch-0.002
Token The
Token ID383
Feature activation+1.31e-03

Pos logit contributions
 unison+0.002
 agreement+0.002
 amaz+0.002
 son+0.002
 mum+0.002
Neg logit contributions
iky-0.003
w-0.003
t-0.003
ane-0.003
ooky-0.003
Token eggs
Token ID9653
Feature activation-4.97e-04

Pos logit contributions
w+0.002
uch+0.002
ane+0.002
iky+0.002
ooky+0.002
Neg logit contributions
 unison-0.002
 amaz-0.002
 Mum-0.002
 agreement-0.002
 Dad-0.002
Token were
Token ID547
Feature activation-4.21e-03

Pos logit contributions
 unison+0.001
 agreement+0.001
 amaz+0.001
 Mama+0.001
 Mum+0.001
Neg logit contributions
w-0.001
uch-0.001
iky-0.001
t-0.001
ane-0.001
Token blue
Token ID4171
Feature activation+7.82e-03

Pos logit contributions
 unison+0.007
 son+0.007
 amaz+0.006
 agreement+0.006
 Mum+0.006
Neg logit contributions
iky-0.006
w-0.006
uch-0.006
z-0.006
working-0.006
Token and
Token ID290
Feature activation-3.30e-03

Pos logit contributions
w+0.017
ooky+0.017
uch+0.017
iky+0.017
ane+0.017
Neg logit contributions
 Mama-0.007
 Mum-0.007
 amaz-0.007
 unison-0.007
 Dad-0.007
Token spe
Token ID693
Feature activation+6.79e-02

Pos logit contributions
 agreement+0.006
 unison+0.006
 son+0.005
 amaz+0.005
 Mum+0.005
Neg logit contributions
w-0.005
iky-0.005
t-0.004
working-0.004
uch-0.004
Tokenck
Token ID694
Feature activation+1.72e-02

Pos logit contributions
ooky+0.139
ane+0.138
t+0.135
iky+0.131
uch+0.130
Neg logit contributions
 agreement-0.069
 hands-0.065
 Mama-0.064
 unison-0.063
 son-0.063
Tokenled
Token ID992
Feature activation+1.95e-02

Pos logit contributions
w+0.013
iky+0.011
ane+0.010
ooky+0.010
t+0.009
Neg logit contributions
 agreement-0.044
 amaz-0.043
 unison-0.043
 Mum-0.042
 mum-0.042
Token.
Token ID13
Feature activation-1.66e-03

Pos logit contributions
iky+0.024
bows+0.024
w+0.023
z+0.023
working+0.022
Neg logit contributions
 unison-0.033
 agreement-0.033
 son-0.032
 Mama-0.031
 mum-0.031
Token L
Token ID406
Feature activation+7.82e-03

Pos logit contributions
 amaz+0.002
 agreement+0.002
 unison+0.002
 son+0.002
 mum+0.002
Neg logit contributions
iky-0.003
t-0.003
ane-0.003
w-0.003
ooky-0.003
Tokenila
Token ID10102
Feature activation+8.04e-03

Pos logit contributions
w+0.013
working+0.012
uch+0.012
ooky+0.012
iky+0.012
Neg logit contributions
 Mum-0.011
 Mama-0.011
 Dad-0.011
 mum-0.011
 son-0.011
Token some
Token ID617
Feature activation+2.60e-03

Pos logit contributions
w+0.009
t+0.008
uch+0.008
working+0.006
ane+0.006
Neg logit contributions
 unison-0.014
 amaz-0.013
 agreement-0.013
 son-0.012
 excitement-0.011
Token leaves
Token ID5667
Feature activation-4.09e-03

Pos logit contributions
w+0.005
iky+0.004
ooky+0.004
uch+0.004
working+0.004
Neg logit contributions
 unison-0.004
 agreement-0.004
 amaz-0.004
 Mum-0.003
 son-0.003
Token and
Token ID290
Feature activation-3.49e-05

Pos logit contributions
 unison+0.009
 amaz+0.008
 agreement+0.008
 son+0.008
 excitement+0.008
Neg logit contributions
t-0.006
w-0.005
ane-0.005
iky-0.005
working-0.005
Token mud
Token ID17492
Feature activation+1.87e-03

Pos logit contributions
 agreement+0.000
 unison+0.000
 amaz+0.000
 son+0.000
 Mum+0.000
Neg logit contributions
w-0.000
working-0.000
uch-0.000
t-0.000
grand-0.000
Token to
Token ID284
Feature activation+1.01e-04

Pos logit contributions
w+0.003
bows+0.003
z+0.003
iky+0.003
ane+0.003
Neg logit contributions
 unison-0.003
 agreement-0.003
 amaz-0.003
 son-0.003
 Mama-0.003
Token make
Token ID787
Feature activation+6.91e-03

Pos logit contributions
uch+0.000
w+0.000
iky+0.000
working+0.000
whe+0.000
Neg logit contributions
 unison-0.000
 son-0.000
 amaz-0.000
 Dad-0.000
 eyes-0.000
Token the
Token ID262
Feature activation-2.73e-05

Pos logit contributions
w+0.011
ooky+0.011
iky+0.011
uch+0.011
z+0.010
Neg logit contributions
 unison-0.011
 agreement-0.010
 amaz-0.009
 son-0.009
 Mum-0.008
Token walls
Token ID7714
Feature activation+8.34e-03

Pos logit contributions
 unison+0.000
 agreement+0.000
 amaz+0.000
 Dad+0.000
 Mum+0.000
Neg logit contributions
w-0.000
uch-0.000
iky-0.000
working-0.000
ooky-0.000
Token."
Token ID526
Feature activation+1.50e-03

Pos logit contributions
iky+0.015
ooky+0.014
bows+0.014
t+0.014
ane+0.014
Neg logit contributions
 unison-0.012
 Mama-0.011
 son-0.011
 agreement-0.011
 Mum-0.011
Token
Token ID198
Feature activation+2.30e-04

Pos logit contributions
t+0.003
ane+0.003
w+0.003
wl+0.003
rob+0.003
Neg logit contributions
 agreement-0.002
 unison-0.002
 amaz-0.001
 towels-0.001
 towel-0.001
Token
Token ID198
Feature activation+2.60e-04

Pos logit contributions
w+0.001
iky+0.001
uch+0.001
ooky+0.001
ane+0.001
Neg logit contributions
 Mama-0.000
 Mum-0.000
 unison-0.000
 agreement-0.000
 amaz-0.000
Token He
Token ID679
Feature activation-2.35e-04

Pos logit contributions
 These+0.005
 Did+0.004
grand+0.004
t+0.004
ane+0.004
Neg logit contributions
 amaz-0.007
 towel-0.007
 agreement-0.007
 towels-0.007
 unison-0.007
Token walked
Token ID6807
Feature activation-1.73e-02

Pos logit contributions
 agreement+0.000
 unison+0.000
 amaz+0.000
 Mama+0.000
 son+0.000
Neg logit contributions
w-0.000
iky-0.000
z-0.000
t-0.000
uch-0.000
Token through
Token ID832
Feature activation-9.19e-04

Pos logit contributions
 unison+0.026
 amaz+0.026
 Mama+0.024
 son+0.024
 Mum+0.024
Neg logit contributions
t-0.029
w-0.028
ooky-0.028
iky-0.027
z-0.027
Token the
Token ID262
Feature activation+3.45e-03

Pos logit contributions
 unison+0.001
 agreement+0.001
 son+0.001
 Deal+0.001
 mum+0.001
Neg logit contributions
z-0.002
iky-0.002
working-0.001
t-0.001
ooky-0.001
Token grass
Token ID8701
Feature activation-1.23e-02

Pos logit contributions
ooky+0.006
iky+0.005
ane+0.005
w+0.005
hy+0.005
Neg logit contributions
 unison-0.005
 son-0.005
 Mum-0.005
 agreement-0.005
 mum-0.005
Token until
Token ID1566
Feature activation+6.62e-03

Pos logit contributions
 unison+0.020
 agreement+0.019
 amaz+0.019
 son+0.018
 approval+0.018
Neg logit contributions
ane-0.020
t-0.019
w-0.018
bows-0.018
working-0.018
Token he
Token ID339
Feature activation+6.23e-04

Pos logit contributions
t+0.010
w+0.010
working+0.010
ane+0.009
uch+0.009
Neg logit contributions
 unison-0.012
 agreement-0.012
 amaz-0.011
 son-0.011
 towels-0.010
Token saw
Token ID2497
Feature activation-5.60e-03

Pos logit contributions
w+0.001
uch+0.001
t+0.001
iky+0.001
ane+0.001
Neg logit contributions
 agreement-0.001
 unison-0.001
 amaz-0.001
 son-0.001
 Mama-0.001
Token a
Token ID257
Feature activation+1.35e-03

Pos logit contributions
 agreement+0.009
 unison+0.008
 Deal+0.008
 amaz+0.007
 approval+0.007
Neg logit contributions
w-0.008
hy-0.008
iky-0.008
rob-0.008
ooky-0.008
Token rough
Token ID5210
Feature activation+9.71e-03

Pos logit contributions
iky+0.002
ooky+0.002
w+0.002
uch+0.002
ane+0.002
Neg logit contributions
 agreement-0.002
 unison-0.002
 Mum-0.002
 amaz-0.002
 hands-0.002
Token stone
Token ID7815
Feature activation-1.23e-02

Pos logit contributions
iky+0.017
w+0.017
working+0.016
ooky+0.016
ane+0.016
Neg logit contributions
 unison-0.013
 amaz-0.013
 Mum-0.012
 Dad-0.012
 agreement-0.012
Token there
Token ID612
Feature activation-1.32e-03

Pos logit contributions
w+0.004
iky+0.003
ooky+0.003
ane+0.003
uch+0.003
Neg logit contributions
 Mama-0.004
 Mum-0.004
 mum-0.004
 agreement-0.004
 unison-0.004
Token,
Token ID11
Feature activation+5.01e-03

Pos logit contributions
 agreement+0.003
 unison+0.003
 amaz+0.003
 fins+0.003
 approval+0.003
Neg logit contributions
w-0.002
ane-0.001
uch-0.001
t-0.001
ooky-0.001
Token but
Token ID475
Feature activation+7.85e-03

Pos logit contributions
w+0.007
t+0.006
working+0.006
ane+0.006
hy+0.006
Neg logit contributions
 unison-0.009
 agreement-0.009
 amaz-0.008
 bur-0.008
 towels-0.008
Token he
Token ID339
Feature activation+5.33e-03

Pos logit contributions
w+0.012
working+0.010
uch+0.010
t+0.010
ho+0.009
Neg logit contributions
 unison-0.014
 agreement-0.013
 towels-0.013
 amaz-0.012
 towel-0.011
Token didn
Token ID1422
Feature activation-3.30e-04

Pos logit contributions
w+0.007
uch+0.007
t+0.006
working+0.006
lder+0.005
Neg logit contributions
 agreement-0.010
 unison-0.010
 towels-0.010
 amaz-0.010
 towel-0.009
Token't
Token ID470
Feature activation+1.27e-02

Pos logit contributions
 Mama+0.001
 Mum+0.001
 unison+0.001
 mum+0.001
 agreement+0.001
Neg logit contributions
w-0.000
iky-0.000
ooky-0.000
uch-0.000
ane-0.000
Token worry
Token ID5490
Feature activation+8.04e-03

Pos logit contributions
t+0.019
working+0.018
iky+0.017
uch+0.017
ooky+0.016
Neg logit contributions
 amaz-0.023
 unison-0.022
 towels-0.021
 eyes-0.020
 Dad-0.020
Token too
Token ID1165
Feature activation+7.49e-03

Pos logit contributions
t+0.012
iky+0.011
ooky+0.011
bows+0.010
ane+0.009
Neg logit contributions
 unison-0.014
 amaz-0.014
 agreement-0.013
 bur-0.012
 towels-0.012
Token much
Token ID881
Feature activation-7.36e-03

Pos logit contributions
ooky+0.013
iky+0.012
t+0.012
working+0.012
ane+0.011
Neg logit contributions
 unison-0.011
 towels-0.011
 amaz-0.011
 agreement-0.011
 towel-0.010
Token.
Token ID13
Feature activation+4.33e-03

Pos logit contributions
 unison+0.010
 amaz+0.010
 agreement+0.010
 Mum+0.010
 mum+0.010
Neg logit contributions
ooky-0.012
iky-0.012
working-0.012
t-0.012
bows-0.012
Token
Token ID198
Feature activation+1.57e-03

Pos logit contributions
t+0.005
 These+0.005
 Did+0.005
ane+0.005
hy+0.004
Neg logit contributions
 towel-0.008
 towels-0.008
 agreement-0.008
 amaz-0.008
 unison-0.008
Token lot
Token ID1256
Feature activation+9.34e-03

Pos logit contributions
uch+0.005
w+0.005
working+0.005
ane+0.005
t+0.005
Neg logit contributions
 unison-0.005
 agreement-0.005
 Mum-0.004
 Mama-0.004
 amaz-0.004
Token of
Token ID286
Feature activation-6.40e-03

Pos logit contributions
t+0.014
w+0.013
iky+0.013
ane+0.013
ooky+0.013
Neg logit contributions
 Mama-0.016
 son-0.016
 unison-0.016
 agreement-0.016
 Papa-0.016
Token work
Token ID670
Feature activation+9.46e-03

Pos logit contributions
 Mama+0.011
 unison+0.011
 Mum+0.011
 Papa+0.010
 mum+0.010
Neg logit contributions
w-0.009
rob-0.008
working-0.008
bows-0.008
mod-0.008
Token to
Token ID284
Feature activation+1.44e-03

Pos logit contributions
t+0.015
w+0.015
ane+0.015
working+0.015
iky+0.015
Neg logit contributions
 unison-0.015
 Mama-0.014
 amaz-0.014
 agreement-0.014
 Mum-0.014
Token do
Token ID466
Feature activation+1.76e-02

Pos logit contributions
uch+0.002
whe+0.002
w+0.002
hy+0.002
working+0.002
Neg logit contributions
 Mama-0.003
 Papa-0.003
 Molly-0.002
 Mum-0.002
 son-0.002
Token all
Token ID477
Feature activation+1.08e-02

Pos logit contributions
ooky+0.030
t+0.030
w+0.030
iky+0.029
ane+0.028
Neg logit contributions
 Mama-0.025
 unison-0.025
 agreement-0.025
 Papa-0.024
 Mum-0.024
Token day
Token ID1110
Feature activation+7.07e-03

Pos logit contributions
working+0.022
w+0.022
rob+0.022
ane+0.021
bows+0.021
Neg logit contributions
 unison-0.012
 Mama-0.011
 mum-0.011
 Mum-0.011
 Dad-0.011
Token long
Token ID890
Feature activation+1.53e-02

Pos logit contributions
t+0.012
working+0.012
w+0.012
iky+0.012
ooky+0.012
Neg logit contributions
 Mama-0.010
 agreement-0.010
 unison-0.010
 Mum-0.010
 son-0.009
Token.
Token ID13
Feature activation+2.35e-03

Pos logit contributions
working+0.025
w+0.024
iky+0.024
bows+0.024
rob+0.023
Neg logit contributions
 Mama-0.023
 agreement-0.023
 Mum-0.023
 unison-0.022
 mum-0.022
Token But
Token ID887
Feature activation-2.52e-03

Pos logit contributions
w+0.004
iky+0.004
ane+0.004
ooky+0.004
t+0.004
Neg logit contributions
 unison-0.003
 agreement-0.003
 amaz-0.003
 mum-0.003
 Mama-0.003
Token,
Token ID11
Feature activation+4.23e-03

Pos logit contributions
 unison+0.004
 towels+0.004
 agreement+0.003
 towel+0.003
 amaz+0.003
Neg logit contributions
w-0.004
t-0.004
working-0.004
uch-0.004
ane-0.004
Token and
Token ID290
Feature activation+1.87e-03

Pos logit contributions
w+0.039
iky+0.039
t+0.038
ane+0.038
ooky+0.038
Neg logit contributions
 unison-0.026
 amaz-0.026
 agreement-0.026
 Mama-0.025
 son-0.025
Token papers
Token ID9473
Feature activation-2.27e-03

Pos logit contributions
w+0.003
working+0.003
iky+0.002
t+0.002
bows+0.002
Neg logit contributions
 agreement-0.003
 unison-0.003
 amaz-0.003
 Mama-0.003
 Mum-0.003
Token and
Token ID290
Feature activation+2.21e-03

Pos logit contributions
 unison+0.003
 amaz+0.003
 Mama+0.003
 agreement+0.003
 son+0.002
Neg logit contributions
w-0.004
iky-0.004
t-0.004
ooky-0.004
ane-0.004
Token stickers
Token ID28568
Feature activation+5.19e-03

Pos logit contributions
working+0.003
w+0.003
bows+0.003
iky+0.002
t+0.002
Neg logit contributions
 agreement-0.004
 unison-0.004
 amaz-0.004
 Mum-0.004
 Mama-0.004
Token.
Token ID13
Feature activation+2.26e-03

Pos logit contributions
w+0.011
iky+0.010
t+0.010
ane+0.010
ooky+0.010
Neg logit contributions
 amaz-0.006
 unison-0.006
 agreement-0.006
 Mama-0.006
 son-0.005
Token She
Token ID1375
Feature activation+7.63e-03

Pos logit contributions
iky+0.004
w+0.004
ooky+0.004
ane+0.004
uch+0.003
Neg logit contributions
 unison-0.003
 amaz-0.003
 son-0.003
 agreement-0.003
 Mama-0.003
Token liked
Token ID8288
Feature activation+5.30e-03

Pos logit contributions
w+0.015
uch+0.015
ooky+0.015
ane+0.015
iky+0.015
Neg logit contributions
 dad-0.009
 Dad-0.008
 Mama-0.008
 mum-0.008
 Mum-0.008
Token to
Token ID284
Feature activation+9.30e-04

Pos logit contributions
w+0.009
iky+0.009
uch+0.009
ooky+0.009
ane+0.009
Neg logit contributions
 Mama-0.007
 Mum-0.007
 amaz-0.007
 unison-0.007
 agreement-0.007
Token make
Token ID787
Feature activation+8.80e-03

Pos logit contributions
w+0.002
iky+0.002
ooky+0.001
uch+0.001
working+0.001
Neg logit contributions
 Mama-0.001
 amaz-0.001
 unison-0.001
 Mum-0.001
 son-0.001
Token pictures
Token ID5986
Feature activation+5.23e-03

Pos logit contributions
iky+0.016
ooky+0.016
w+0.016
bows+0.015
ane+0.015
Neg logit contributions
 unison-0.011
 agreement-0.011
 Mama-0.011
 amaz-0.011
 Mum-0.011
Token of
Token ID286
Feature activation-5.62e-03

Pos logit contributions
iky+0.009
w+0.009
ooky+0.008
ane+0.008
bows+0.008
Neg logit contributions
 unison-0.007
 son-0.007
 agreement-0.007
 amaz-0.007
 Mama-0.007
Token to
Token ID284
Feature activation-1.06e-03

Pos logit contributions
 unison+0.013
 amaz+0.012
 agreement+0.012
 Deal+0.010
 son+0.010
Neg logit contributions
t-0.004
w-0.003
working-0.003
uch-0.003
whe-0.003
Token give
Token ID1577
Feature activation+2.99e-03

Pos logit contributions
 unison+0.002
 amaz+0.002
 agreement+0.002
 son+0.002
 towels+0.002
Neg logit contributions
uch-0.001
t-0.001
w-0.001
working-0.001
lder-0.001
Token the
Token ID262
Feature activation-3.54e-03

Pos logit contributions
w+0.006
uch+0.005
t+0.005
working+0.005
z+0.005
Neg logit contributions
 unison-0.005
 amaz-0.004
 agreement-0.004
 towels-0.004
 towel-0.003
Token rabbit
Token ID22746
Feature activation-8.51e-03

Pos logit contributions
 unison+0.006
 amaz+0.005
 Dad+0.005
 Mum+0.005
 their+0.004
Neg logit contributions
working-0.005
iky-0.005
t-0.005
w-0.005
ooky-0.005
Token a
Token ID257
Feature activation-1.11e-03

Pos logit contributions
 unison+0.013
 amaz+0.012
 agreement+0.011
 Dad+0.010
 Deal+0.010
Neg logit contributions
working-0.013
w-0.013
t-0.013
hy-0.012
uch-0.012
Token hug
Token ID16225
Feature activation-5.95e-03

Pos logit contributions
 unison+0.002
 themselves+0.002
 agreement+0.002
 amaz+0.002
 Dad+0.002
Neg logit contributions
t-0.002
uch-0.002
w-0.002
working-0.002
iky-0.002
Token,
Token ID11
Feature activation-4.94e-04

Pos logit contributions
 amaz+0.010
 unison+0.010
 agreement+0.009
 fins+0.008
 bur+0.008
Neg logit contributions
t-0.010
ane-0.009
w-0.009
working-0.009
hy-0.009
Token so
Token ID523
Feature activation+6.64e-03

Pos logit contributions
 unison+0.001
 amaz+0.001
 agreement+0.001
 mum+0.001
 towels+0.001
Neg logit contributions
t-0.001
hy-0.000
whe-0.000
just-0.000
but-0.000
Token he
Token ID339
Feature activation+2.46e-04

Pos logit contributions
t+0.008
w+0.008
working+0.008
z+0.008
hy+0.007
Neg logit contributions
 unison-0.013
 towels-0.013
 agreement-0.012
 amaz-0.012
 towel-0.011
Token opened
Token ID4721
Feature activation-4.10e-03

Pos logit contributions
w+0.000
t+0.000
working+0.000
uch+0.000
z+0.000
Neg logit contributions
 amaz-0.000
 unison-0.000
 agreement-0.000
 ways-0.000
 towels-0.000
Token his
Token ID465
Feature activation-1.91e-03

Pos logit contributions
 unison+0.007
 son+0.005
 agreement+0.005
 amaz+0.005
 Deal+0.005
Neg logit contributions
z-0.006
w-0.006
working-0.005
ane-0.005
iky-0.005
Token.
Token ID13
Feature activation+1.62e-03

Pos logit contributions
w+0.005
t+0.005
uch+0.005
iky+0.005
ooky+0.005
Neg logit contributions
 amaz-0.007
 unison-0.006
 agreement-0.006
 Mum-0.006
 mum-0.006
Token He
Token ID679
Feature activation-3.06e-03

Pos logit contributions
w+0.002
iky+0.002
uch+0.002
t+0.002
ane+0.002
Neg logit contributions
 amaz-0.003
 agreement-0.003
 unison-0.003
 son-0.003
 mum-0.003
Token made
Token ID925
Feature activation+3.70e-03

Pos logit contributions
 agreement+0.005
 amaz+0.005
 unison+0.005
 Mum+0.005
 Mama+0.005
Neg logit contributions
w-0.004
uch-0.004
iky-0.004
ooky-0.004
t-0.004
Token a
Token ID257
Feature activation+6.94e-04

Pos logit contributions
w+0.007
iky+0.006
ooky+0.006
t+0.006
z+0.006
Neg logit contributions
 unison-0.005
 amaz-0.005
 agreement-0.005
 towel-0.004
 Papa-0.004
Token clay
Token ID21558
Feature activation-1.90e-03

Pos logit contributions
w+0.001
iky+0.001
ooky+0.001
uch+0.001
t+0.001
Neg logit contributions
 unison-0.001
 amaz-0.001
 Mum-0.001
 Dad-0.001
 Papa-0.001
Token mask
Token ID9335
Feature activation-8.95e-03

Pos logit contributions
 unison+0.003
 amaz+0.003
 agreement+0.003
 Mum+0.003
 Mama+0.003
Neg logit contributions
w-0.003
ane-0.003
working-0.003
iky-0.003
bows-0.003
Token and
Token ID290
Feature activation-1.40e-03

Pos logit contributions
 amaz+0.013
 unison+0.013
 agreement+0.012
 Mum+0.012
 Mama+0.012
Neg logit contributions
t-0.015
w-0.015
iky-0.015
ooky-0.015
ane-0.015
Token put
Token ID1234
Feature activation+9.31e-03

Pos logit contributions
 agreement+0.002
 unison+0.002
 amaz+0.002
 Mum+0.002
 mum+0.002
Neg logit contributions
w-0.002
working-0.002
bows-0.002
hy-0.002
rob-0.002
Token it
Token ID340
Feature activation-1.48e-04

Pos logit contributions
w+0.018
uch+0.014
t+0.014
iky+0.014
working+0.014
Neg logit contributions
 unison-0.015
 amaz-0.014
 agreement-0.014
 son-0.013
 mum-0.012
Token on
Token ID319
Feature activation+7.69e-03

Pos logit contributions
 amaz+0.000
 unison+0.000
 son+0.000
 Mama+0.000
 agreement+0.000
Neg logit contributions
w-0.000
uch-0.000
z-0.000
iky-0.000
working-0.000
Token his
Token ID465
Feature activation-1.55e-03

Pos logit contributions
w+0.014
iky+0.013
ooky+0.012
working+0.012
t+0.012
Neg logit contributions
 amaz-0.011
 unison-0.010
 agreement-0.009
 son-0.009
 fins-0.008
Token not
Token ID407
Feature activation-5.31e-03

Pos logit contributions
w+0.001
iky+0.001
t+0.001
working+0.001
ooky+0.001
Neg logit contributions
 amaz-0.001
 unison-0.001
 agreement-0.001
 delight-0.001
 son-0.001
Token fair
Token ID3148
Feature activation+1.90e-02

Pos logit contributions
 amaz+0.010
 unison+0.010
 agreement+0.010
 mum+0.009
 Mum+0.009
Neg logit contributions
working-0.007
w-0.007
ooky-0.007
iky-0.007
t-0.006
Token.
Token ID13
Feature activation+4.08e-03

Pos logit contributions
iky+0.030
ooky+0.030
bows+0.030
working+0.030
t+0.029
Neg logit contributions
 unison-0.031
 agreement-0.031
 amaz-0.030
 Mama-0.028
 Mum-0.027
Token I
Token ID314
Feature activation-9.18e-03

Pos logit contributions
w+0.005
ane+0.005
iky+0.005
uch+0.005
t+0.005
Neg logit contributions
 amaz-0.007
 agreement-0.007
 unison-0.007
 mum-0.007
 son-0.007
Token have
Token ID423
Feature activation+3.69e-04

Pos logit contributions
 Mama+0.014
 unison+0.014
 amaz+0.014
 Mum+0.014
 agreement+0.014
Neg logit contributions
w-0.015
uch-0.014
iky-0.014
ooky-0.014
t-0.014
Token to
Token ID284
Feature activation+1.60e-04

Pos logit contributions
w+0.001
t+0.001
working+0.001
uch+0.001
iky+0.001
Neg logit contributions
 agreement-0.001
 unison-0.001
 amaz-0.001
 Mama-0.000
 Mum-0.000
Token un
Token ID555
Feature activation-2.51e-02

Pos logit contributions
uch+0.000
w+0.000
hy+0.000
working+0.000
grand+0.000
Neg logit contributions
 amaz-0.000
 unison-0.000
 fins-0.000
 Mama-0.000
 bur-0.000
Tokenpack
Token ID8002
Feature activation+4.96e-03

Pos logit contributions
 Dad+0.038
 dad+0.038
 agreement+0.037
 towels+0.037
 hand+0.036
Neg logit contributions
iky-0.041
bows-0.040
w-0.040
ooky-0.038
uch-0.038
Token my
Token ID616
Feature activation-4.30e-03

Pos logit contributions
w+0.008
ooky+0.008
iky+0.008
t+0.007
ane+0.007
Neg logit contributions
 unison-0.007
 amaz-0.007
 bur-0.007
 agreement-0.007
 fins-0.007
Token bag
Token ID6131
Feature activation+1.28e-03

Pos logit contributions
 unison+0.007
 agreement+0.007
 Judy+0.007
 Fred+0.007
 Rachel+0.007
Neg logit contributions
uch-0.006
w-0.006
grand-0.005
ooky-0.005
iky-0.005
Token and
Token ID290
Feature activation+5.96e-04

Pos logit contributions
t+0.002
ooky+0.002
ane+0.002
w+0.002
iky+0.002
Neg logit contributions
 amaz-0.002
 unison-0.002
 agreement-0.002
 fins-0.002
 Mama-0.002
Tokenoppy
Token ID26696
Feature activation+2.66e-03

Pos logit contributions
w+0.028
iky+0.028
ooky+0.027
ane+0.027
working+0.027
Neg logit contributions
 Mama-0.028
 Mum-0.027
 mum-0.027
 son-0.027
 Dad-0.026
Token cries
Token ID24691
Feature activation+9.82e-03

Pos logit contributions
w+0.004
iky+0.004
ooky+0.004
uch+0.004
ane+0.004
Neg logit contributions
 Mama-0.004
 Mum-0.004
 amaz-0.004
 mum-0.004
 agreement-0.004
Token.
Token ID13
Feature activation+2.67e-03

Pos logit contributions
w+0.021
iky+0.020
ooky+0.020
uch+0.020
ane+0.020
Neg logit contributions
 unison-0.009
 amaz-0.009
 Mama-0.009
 Mum-0.009
 agreement-0.009
Token She
Token ID1375
Feature activation+7.83e-03

Pos logit contributions
w+0.005
iky+0.005
ooky+0.005
ane+0.005
uch+0.005
Neg logit contributions
 unison-0.004
 agreement-0.003
 amaz-0.003
 son-0.003
 Mum-0.003
Token has
Token ID468
Feature activation-1.20e-03

Pos logit contributions
w+0.012
iky+0.012
ooky+0.012
ane+0.012
uch+0.011
Neg logit contributions
 Mama-0.012
 Mum-0.012
 amaz-0.012
 unison-0.012
 agreement-0.012
Token no
Token ID645
Feature activation+1.22e-05

Pos logit contributions
 unison+0.002
 agreement+0.002
 son+0.002
 amaz+0.002
 Mama+0.002
Neg logit contributions
w-0.002
uch-0.002
t-0.002
hy-0.002
working-0.002
Token balls
Token ID11333
Feature activation+2.47e-03

Pos logit contributions
working+0.000
hy+0.000
w+0.000
iky+0.000
bows+0.000
Neg logit contributions
 Papa-0.000
 unison-0.000
 amaz-0.000
 Simon-0.000
 Mick-0.000
Token left
Token ID1364
Feature activation+9.08e-03

Pos logit contributions
t+0.004
w+0.004
iky+0.004
ane+0.004
uch+0.004
Neg logit contributions
 amaz-0.004
 unison-0.004
 Mama-0.004
 Mum-0.004
 mum-0.004
Token.
Token ID13
Feature activation+3.28e-03

Pos logit contributions
w+0.013
t+0.013
ane+0.012
hy+0.012
iky+0.012
Neg logit contributions
 amaz-0.017
 unison-0.016
 agreement-0.015
 mum-0.015
 Mum-0.015
Token She
Token ID1375
Feature activation+8.41e-03

Pos logit contributions
w+0.005
iky+0.005
uch+0.005
ooky+0.005
ane+0.005
Neg logit contributions
 Mama-0.005
 Mum-0.005
 unison-0.005
 amaz-0.005
 agreement-0.005
Token sits
Token ID10718
Feature activation-9.31e-03

Pos logit contributions
w+0.012
iky+0.012
ooky+0.012
ane+0.012
uch+0.012
Neg logit contributions
 Mama-0.014
 Mum-0.013
 amaz-0.013
 unison-0.013
 mum-0.013
Token of
Token ID286
Feature activation-1.17e-02

Pos logit contributions
 unison+0.010
 agreement+0.010
 towels+0.010
 amaz+0.010
 towel+0.009
Neg logit contributions
t-0.005
ane-0.004
working-0.004
ending-0.003
iky-0.003
Token what
Token ID644
Feature activation+5.67e-03

Pos logit contributions
 unison+0.015
 towels+0.015
 amaz+0.015
 towel+0.015
 son+0.015
Neg logit contributions
rob-0.019
iky-0.018
w-0.018
working-0.018
makers-0.017
Token he
Token ID339
Feature activation+5.31e-04

Pos logit contributions
working+0.008
t+0.008
w+0.008
ane+0.007
grand+0.007
Neg logit contributions
 unison-0.010
 agreement-0.010
 amaz-0.010
 towels-0.009
 bur-0.008
Token had
Token ID550
Feature activation-6.63e-03

Pos logit contributions
w+0.001
iky+0.001
uch+0.001
ooky+0.001
ane+0.001
Neg logit contributions
 Mama-0.001
 Mum-0.001
 unison-0.001
 amaz-0.001
 agreement-0.001
Token accomplished
Token ID13013
Feature activation-6.97e-03

Pos logit contributions
 Mama+0.011
 unison+0.010
 Mum+0.010
 amaz+0.010
 agreement+0.010
Neg logit contributions
w-0.011
uch-0.010
t-0.010
ooky-0.010
ane-0.010
Token.
Token ID13
Feature activation-1.35e-03

Pos logit contributions
 unison+0.011
 Mama+0.011
 amaz+0.010
 Mum+0.010
 bur+0.010
Neg logit contributions
t-0.011
w-0.011
working-0.011
ooky-0.010
ane-0.010
Token He
Token ID679
Feature activation-4.19e-03

Pos logit contributions
 towels+0.003
 towel+0.003
 unison+0.003
 agreement+0.002
 fins+0.002
Neg logit contributions
ane-0.002
t-0.002
ugg-0.001
grand-0.001
bows-0.001
Token thanked
Token ID26280
Feature activation+2.23e-03

Pos logit contributions
 agreement+0.007
 towels+0.007
 unison+0.007
 Mum+0.007
 Mama+0.007
Neg logit contributions
w-0.006
ane-0.005
t-0.005
z-0.005
working-0.005
Token the
Token ID262
Feature activation-2.75e-03

Pos logit contributions
rob+0.004
iky+0.004
working+0.004
bows+0.004
t+0.004
Neg logit contributions
 unison-0.002
 amaz-0.002
 towels-0.002
 fins-0.002
 towel-0.002
Token little
Token ID1310
Feature activation-4.38e-03

Pos logit contributions
 unison+0.004
 towels+0.004
 Mama+0.004
 amaz+0.004
 towel+0.004
Neg logit contributions
working-0.004
ane-0.004
w-0.004
iky-0.004
ooky-0.004
Token girl
Token ID2576
Feature activation-1.51e-02

Pos logit contributions
 unison+0.006
 Mama+0.005
 amaz+0.005
 Mum+0.005
 agreement+0.005
Neg logit contributions
w-0.008
ane-0.008
iky-0.008
working-0.008
ooky-0.007
Token and
Token ID290
Feature activation-2.61e-03

Pos logit contributions
w+0.027
ooky+0.026
iky+0.026
t+0.026
working+0.026
Neg logit contributions
 unison-0.021
 agreement-0.020
 Mum-0.020
 amaz-0.020
 Mama-0.020
Token dangerous
Token ID4923
Feature activation+6.90e-03

Pos logit contributions
 unison+0.005
 agreement+0.005
 son+0.004
 amaz+0.004
 towels+0.004
Neg logit contributions
working-0.003
t-0.003
w-0.003
hy-0.003
uch-0.003
Token to
Token ID284
Feature activation-2.68e-03

Pos logit contributions
bows+0.012
t+0.012
working+0.012
w+0.012
ooky+0.012
Neg logit contributions
 unison-0.010
 amaz-0.010
 agreement-0.009
 mum-0.009
 Mum-0.009
Token explore
Token ID7301
Feature activation-4.55e-03

Pos logit contributions
 amaz+0.005
 unison+0.004
 son+0.004
 eyes+0.004
 towels+0.004
Neg logit contributions
w-0.004
uch-0.004
working-0.004
t-0.004
iky-0.004
Token the
Token ID262
Feature activation-3.34e-03

Pos logit contributions
 unison+0.007
 son+0.006
 fins+0.006
 agreement+0.006
 mum+0.006
Neg logit contributions
ooky-0.007
iky-0.007
bows-0.007
w-0.006
working-0.006
Tokenwoods
Token ID39493
Feature activation-8.14e-02

Pos logit contributions
 unison+0.006
 Mum+0.006
 agreement+0.006
 Mama+0.006
 amaz+0.005
Neg logit contributions
w-0.005
ooky-0.005
iky-0.005
uch-0.004
ane-0.004
Token alone
Token ID3436
Feature activation-4.47e-03

Pos logit contributions
 Mama+0.130
 Mum+0.129
 Papa+0.126
 son+0.125
 mum+0.125
Neg logit contributions
ooky-0.131
iky-0.128
ane-0.124
t-0.124
w-0.121
Token.
Token ID13
Feature activation-1.02e-03

Pos logit contributions
 amaz+0.007
 agreement+0.007
 unison+0.007
 towel+0.006
 Mama+0.006
Neg logit contributions
t-0.007
w-0.007
working-0.007
ane-0.007
bows-0.007
Token With
Token ID2080
Feature activation-8.74e-04

Pos logit contributions
 unison+0.001
 towels+0.001
 agreement+0.001
 fins+0.001
 towel+0.001
Neg logit contributions
t-0.002
ane-0.002
w-0.002
ooky-0.002
iky-0.002
Token her
Token ID607
Feature activation-4.44e-03

Pos logit contributions
 son+0.001
 agreement+0.001
 unison+0.001
 towel+0.001
 towels+0.001
Neg logit contributions
w-0.001
hy-0.001
working-0.001
iky-0.001
t-0.001
Token pole
Token ID16825
Feature activation-2.18e-02

Pos logit contributions
 unison+0.005
 Pete+0.005
 agreement+0.004
die+0.004
 Freddy+0.004
Neg logit contributions
hy-0.008
w-0.008
t-0.008
ane-0.008
uch-0.007
Token were
Token ID547
Feature activation+7.17e-04

Pos logit contributions
w+0.019
t+0.019
ane+0.018
bows+0.018
uch+0.017
Neg logit contributions
 son-0.024
 unison-0.024
 agreement-0.024
 Mama-0.024
 Mum-0.023
Token c
Token ID269
Feature activation+7.54e-03

Pos logit contributions
iky+0.001
w+0.001
bows+0.001
ooky+0.001
z+0.001
Neg logit contributions
 unison-0.001
 son-0.001
 amaz-0.001
 agreement-0.001
 Dad-0.001
Tokenaw
Token ID707
Feature activation+3.51e-02

Pos logit contributions
w+0.008
rob+0.008
ane+0.008
mod+0.008
iky+0.007
Neg logit contributions
 unison-0.014
 Mama-0.014
 Mum-0.014
 mum-0.014
 son-0.014
Tokening
Token ID278
Feature activation+2.41e-02

Pos logit contributions
w+0.048
uch+0.045
t+0.045
bows+0.045
iky+0.044
Neg logit contributions
 agreement-0.067
 amaz-0.066
 Mama-0.064
 unison-0.064
 Mum-0.063
Token in
Token ID287
Feature activation+9.52e-03

Pos logit contributions
w+0.037
bows+0.037
iky+0.036
z+0.035
ane+0.035
Neg logit contributions
 amaz-0.040
 unison-0.039
 agreement-0.039
 son-0.038
 Mama-0.037
Token unison
Token ID49418
Feature activation-6.96e-02

Pos logit contributions
w+0.015
hy+0.013
z+0.012
iky+0.012
bows+0.011
Neg logit contributions
 son-0.014
 mum-0.012
 dad-0.012
 mother-0.011
 unison-0.011
Token to
Token ID284
Feature activation+6.43e-03

Pos logit contributions
 amaz+0.104
 agreement+0.103
 bur+0.101
 son+0.098
 Mama+0.097
Neg logit contributions
t-0.119
bows-0.106
ane-0.104
hy-0.103
w-0.103
Token make
Token ID787
Feature activation+1.18e-02

Pos logit contributions
w+0.008
hy+0.008
uch+0.008
working+0.007
bows+0.007
Neg logit contributions
 son-0.010
 unison-0.010
 amaz-0.010
 towels-0.009
 fins-0.009
Token it
Token ID340
Feature activation+1.07e-02

Pos logit contributions
w+0.021
bows+0.019
z+0.019
iky+0.019
ic+0.018
Neg logit contributions
 unison-0.016
 agreement-0.015
 amaz-0.015
 shovel-0.014
 Dad-0.013
Token sound
Token ID2128
Feature activation+6.31e-03

Pos logit contributions
bows+0.017
iky+0.016
z+0.016
t+0.016
uch+0.016
Neg logit contributions
 unison-0.017
 Papa-0.016
 agreement-0.016
 amaz-0.016
 Dad-0.015
Token like
Token ID588
Feature activation+1.06e-02

Pos logit contributions
t+0.010
ooky+0.010
iky+0.010
bows+0.010
w+0.009
Neg logit contributions
 amaz-0.010
 unison-0.010
 agreement-0.009
 son-0.009
 mum-0.009
Token any
Token ID597
Feature activation-8.69e-03

Pos logit contributions
w+0.012
uch+0.010
working+0.009
t+0.009
iky+0.009
Neg logit contributions
 amaz-0.011
 unison-0.011
 agreement-0.010
 son-0.010
 towels-0.009
Token more
Token ID517
Feature activation-6.70e-03

Pos logit contributions
 unison+0.013
 amaz+0.012
 Sam+0.011
 Dad+0.011
 agreement+0.011
Neg logit contributions
w-0.016
working-0.014
uch-0.014
t-0.014
iky-0.014
Token without
Token ID1231
Feature activation-2.09e-03

Pos logit contributions
 unison+0.013
 amaz+0.011
 agreement+0.011
 son+0.010
 Mum+0.010
Neg logit contributions
working-0.010
w-0.010
iky-0.010
t-0.009
ooky-0.008
Token asking
Token ID4737
Feature activation+1.25e-02

Pos logit contributions
 amaz+0.004
 unison+0.004
 agreement+0.003
 towels+0.003
 towel+0.003
Neg logit contributions
w-0.003
working-0.003
uch-0.003
lder-0.002
t-0.002
Token.
Token ID13
Feature activation+6.70e-04

Pos logit contributions
w+0.021
uch+0.019
t+0.018
working+0.017
ane+0.017
Neg logit contributions
 amaz-0.022
 unison-0.020
 agreement-0.019
 towel-0.018
 bur-0.017
Token Deal
Token ID15138
Feature activation-6.23e-02

Pos logit contributions
ane+0.001
uch+0.001
w+0.001
t+0.001
grand+0.001
Neg logit contributions
 agreement-0.001
 towel-0.001
 amaz-0.001
 towels-0.001
 unison-0.001
Token?"
Token ID1701
Feature activation+3.98e-03

Pos logit contributions
 amaz+0.116
 unison+0.114
 agreement+0.112
 son+0.107
 towels+0.106
Neg logit contributions
w-0.084
uch-0.083
t-0.079
ane-0.078
ooky-0.076
Token
Token ID198
Feature activation+1.14e-03

Pos logit contributions
w+0.006
t+0.006
working+0.006
mod+0.006
wl+0.006
Neg logit contributions
 unison-0.006
 towels-0.005
 agreement-0.005
 amaz-0.005
 eyes-0.005
Token
Token ID198
Feature activation+1.27e-03

Pos logit contributions
w+0.002
t+0.002
wl+0.002
ane+0.002
ooky+0.002
Neg logit contributions
 amaz-0.002
 agreement-0.002
 unison-0.002
 eyes-0.002
 fins-0.002
TokenL
Token ID43
Feature activation+1.22e-03

Pos logit contributions
w+0.002
wl+0.002
z+0.001
rob+0.001
t+0.001
Neg logit contributions
 eyes-0.002
 agreement-0.002
 amaz-0.002
 unison-0.002
 hands-0.002
Tokenily
Token ID813
Feature activation-4.38e-03

Pos logit contributions
w+0.002
iky+0.002
ooky+0.002
ane+0.002
uch+0.002
Neg logit contributions
 amaz-0.002
 unison-0.002
 agreement-0.002
 Mama-0.002
 mum-0.002
Token.
Token ID13
Feature activation+2.22e-03

Pos logit contributions
iky+0.004
ane+0.003
w+0.003
bows+0.003
ooky+0.003
Neg logit contributions
 agreement-0.002
 unison-0.002
 amaz-0.002
 Mama-0.002
 son-0.002
Token She
Token ID1375
Feature activation+6.63e-03

Pos logit contributions
ane+0.003
bows+0.003
w+0.003
t+0.003
iky+0.003
Neg logit contributions
 unison-0.004
 amaz-0.004
 agreement-0.004
 son-0.004
 towels-0.003
Token was
Token ID373
Feature activation+6.72e-03

Pos logit contributions
w+0.009
iky+0.009
uch+0.009
working+0.009
ane+0.008
Neg logit contributions
 agreement-0.011
 unison-0.011
 amaz-0.011
 Mum-0.010
 Mama-0.010
Token filled
Token ID5901
Feature activation-4.73e-03

Pos logit contributions
working+0.010
bows+0.010
z+0.009
iky+0.009
mod+0.009
Neg logit contributions
 unison-0.010
 towels-0.010
 agreement-0.009
 towel-0.009
 son-0.009
Token with
Token ID351
Feature activation+2.76e-03

Pos logit contributions
 Mum+0.006
 Mama+0.006
 mum+0.006
 agreement+0.006
 amaz+0.006
Neg logit contributions
w-0.008
iky-0.008
ooky-0.008
ane-0.008
uch-0.008
Token amaz
Token ID40642
Feature activation-9.48e-02

Pos logit contributions
w+0.004
bows+0.004
hy+0.004
iky+0.004
mod+0.003
Neg logit contributions
 unison-0.004
 Carlos-0.004
 agreement-0.003
 Sam-0.003
 son-0.003
Tokenement
Token ID972
Feature activation+3.20e-02

Pos logit contributions
 amaz+0.134
 Mum+0.131
 mum+0.129
 eyes+0.128
 Dad+0.127
Neg logit contributions
w-0.167
uch-0.162
working-0.160
rob-0.155
iky-0.151
Token.
Token ID13
Feature activation+2.53e-03

Pos logit contributions
ane+0.057
working+0.057
iky+0.057
w+0.057
bows+0.056
Neg logit contributions
 agreement-0.042
 unison-0.041
 amaz-0.039
 Mama-0.036
 towels-0.036
Token
Token ID198
Feature activation+2.42e-03

Pos logit contributions
w+0.004
ane+0.004
iky+0.004
t+0.004
uch+0.004
Neg logit contributions
 unison-0.003
 agreement-0.003
 amaz-0.003
 son-0.003
 mum-0.003
Token
Token ID198
Feature activation+2.45e-03

Pos logit contributions
w+0.006
iky+0.006
uch+0.006
ooky+0.006
ane+0.006
Neg logit contributions
 Mama-0.002
 Mum-0.002
 agreement-0.002
 mum-0.002
 unison-0.002
TokenAt
Token ID2953
Feature activation-3.65e-03

Pos logit contributions
rob+0.005
ane+0.005
w+0.005
uch+0.005
ooky+0.005
Neg logit contributions
 agreement-0.003
 unison-0.003
 Mama-0.003
 mum-0.002
 Mum-0.002
Token the
Token ID262
Feature activation-4.25e-03

Pos logit contributions
 unison+0.003
 fins+0.003
 amaz+0.003
 agreement+0.003
 son+0.003
Neg logit contributions
working-0.005
w-0.005
rob-0.005
organized-0.005
mod-0.004
Token bush
Token ID24484
Feature activation-7.97e-03

Pos logit contributions
 unison+0.006
 agreement+0.006
 Mum+0.006
 amaz+0.006
 Mama+0.006
Neg logit contributions
w-0.007
working-0.007
ooky-0.007
uch-0.007
iky-0.006
Token and
Token ID290
Feature activation-2.87e-03

Pos logit contributions
 unison+0.013
 amaz+0.012
 agreement+0.012
 fins+0.012
 son+0.012
Neg logit contributions
t-0.013
z-0.011
ooky-0.011
bows-0.011
ane-0.011
Token watched
Token ID7342
Feature activation-1.69e-02

Pos logit contributions
 unison+0.005
 agreement+0.004
 Deal+0.004
 amaz+0.004
 fins+0.004
Neg logit contributions
w-0.004
z-0.004
whe-0.004
working-0.004
bows-0.004
Token in
Token ID287
Feature activation+1.96e-03

Pos logit contributions
 unison+0.023
 amaz+0.022
 fins+0.021
 agreement+0.021
 towels+0.021
Neg logit contributions
t-0.025
z-0.025
free-0.025
working-0.024
w-0.024
Token amaz
Token ID40642
Feature activation-1.01e-01

Pos logit contributions
hy+0.003
z+0.003
he+0.003
free+0.003
w+0.003
Neg logit contributions
 fins-0.002
 son-0.002
 Pete-0.002
 Finn-0.002
 Carlos-0.002
Tokenement
Token ID972
Feature activation+2.80e-02

Pos logit contributions
 Mama+0.187
 agreement+0.184
 unison+0.184
 Mum+0.184
 son+0.183
Neg logit contributions
w-0.125
iky-0.122
ooky-0.122
ane-0.117
uch-0.116
Token as
Token ID355
Feature activation-1.10e-02

Pos logit contributions
z+0.047
working+0.046
bows+0.046
iky+0.045
t+0.045
Neg logit contributions
 agreement-0.040
 amaz-0.039
 unison-0.038
 Mama-0.034
 Papa-0.034
Token it
Token ID340
Feature activation-2.09e-03

Pos logit contributions
 unison+0.012
 agreement+0.012
 amaz+0.011
 fins+0.011
 hand+0.010
Neg logit contributions
working-0.021
iky-0.020
bows-0.020
w-0.020
ooky-0.020
Token flew
Token ID13112
Feature activation+4.28e-03

Pos logit contributions
 agreement+0.004
 Dad+0.004
 Mum+0.004
 son+0.004
 unison+0.004
Neg logit contributions
w-0.003
z-0.003
uch-0.002
t-0.002
ooky-0.002
Token away
Token ID1497
Feature activation+3.88e-03

Pos logit contributions
t+0.006
w+0.006
z+0.005
bows+0.005
iky+0.005
Neg logit contributions
 son-0.008
 amaz-0.008
 unison-0.008
 dad-0.007
 mum-0.007