TOP ACTIVATIONS
MAX = 4.009

ily said , " B run o , you are a
go ! Everyone watched in amaz ement as it took off
before dinner . <|endoftext|> S aman tha was a lazy girl
her mom my watched in amaz ement as the machine flew
friends laughed at their cle vern ess and went off to
YOU ARE E ATING ALL THE S UG AR ! THAT
smile . S aman tha laughed , happy that
. Tim my watched with amaz ement as he saw stars
things can come from bad situations . <|endoftext|> Once upon a
. His mom and dad woke up . They were not
air . He watched with amaz ement as the z ig
egg ! He watched in amaz ement as the egg began
mad . " Who are you ? Why did
. S aman tha was amazed and she
that !" Charl otte thanked her mother for
parents could lead to dangerous situations . From that day on
's mom said , " Don 't cry , boys .
out and said , " B oys , you should not
. She said , " Don 't worry , sweet ie
mad . " Who are you ?" he shouted

MIN ACTIVATIONS
MIN = -3.357

bending her body like a gymn ast . Lily tried her
time there was a little av ac ado . It was
, there was a loud av ac ado . It was
girl who didn 't like av oc ados . She always
has four legs ?' and ' What color is the sky
ongo didn 't like wet av oc ados , so he
take it with you wherever you go ." Billy
tree could see lots of av oc ados that the supplier
can carry your guitar wherever you want . It 's yours
was very blue . Everywhere you looked , it was painted
full of life . Everywhere you looked there were crazy creatures
a hairy palace . Everywhere you looked there were furry walls
. Lily loved to eat av oc ados , they were
remembered his favourite movie , ' T angled '. He got
day on , Lily loved av oc ados and would ask
' birth day ', and ' you '. They decided to
Tim my loved to eat av oc ados , they were
B ongo loved to eat av oc ados . One day
saw a picture of a hamb urger and fries , and
. John felt jealous . ' That 's not fair !'

INTERVAL 2.167 - 4.009
CONTAINS 0.031%

two friends had to take shelter inside . It was storm
shouts . " Who rang the bell ? Who
time to go home . As Tim my said goodbye to
. He called it " stone hop ", and it was
?" he asks . " Who cut the cake ?"

INTERVAL 0.326 - 2.167
CONTAINS 28.995%

story is that it 's important to listen to your parents
That â s wonderful ,
it a try . The black smith smiled and said ,
Lily were very happy . They loved the ze br as
The mouse looked up at her and said ,

INTERVAL -1.515 - 0.326
CONTAINS 70.245%

fly very high in the sky !" The little
house . She knocked on the door and said , "
of other rabbits inside , all hopping around and having fun
coon !" said Po ppy with a gasp .
great big Fer ris wheel and all the other fun rides

INTERVAL -3.357 - -1.515
CONTAINS 0.729%

idea . He remembered that his dad had taught him how
the red car . Where is it ?" They
told her that if she ever needed somebody to talk to
red toy car . She decided to push it around the
house , there lived a happy bird . The bird loved
Tokenily
Token ID813
Feature activation+7.41e-01

Pos logit contributions
 snowball+0.010
 wipes+0.010
 mat+0.010
 ramp+0.010
uddle+0.010
Neg logit contributions
year-0.005
ull-0.004
 ruled-0.004
abby-0.004
epad-0.004
Token said
Token ID531
Feature activation+1.15e-01

Pos logit contributions
 mat+0.019
 wipes+0.019
 snowball+0.018
 Velvet+0.018
uddle+0.018
Neg logit contributions
chard-0.021
epad-0.019
year-0.019
 known-0.019
ho-0.017
Token,
Token ID11
Feature activation-7.74e-01

Pos logit contributions
 mat+0.003
 snowball+0.003
 wipes+0.003
 Velvet+0.002
uddle+0.002
Neg logit contributions
chard-0.003
 known-0.003
epad-0.003
thank-0.003
ho-0.003
Token "
Token ID366
Feature activation+1.74e-01

Pos logit contributions
chard+0.024
 known+0.024
epad+0.023
thank+0.022
year+0.022
Neg logit contributions
 mat-0.016
 Velvet-0.016
 snowball-0.015
 wipes-0.015
 ramp-0.014
TokenB
Token ID33
Feature activation+1.72e+00

Pos logit contributions
 mat+0.005
 snowball+0.005
 wipes+0.005
uddle+0.005
 ramp+0.005
Neg logit contributions
chard-0.004
 known-0.004
thank-0.004
epad-0.004
ho-0.004
Tokenrun
Token ID5143
Feature activation+3.41e+00

Pos logit contributions
 mat+0.042
 wipes+0.042
 snowball+0.042
 ramp+0.039
 stret+0.039
Neg logit contributions
ho-0.046
chard-0.043
working-0.042
epad-0.042
year-0.042
Tokeno
Token ID78
Feature activation+1.26e+00

Pos logit contributions
 mat+0.084
 snowball+0.079
 ramp+0.079
 wipes+0.079
uddle+0.078
Neg logit contributions
 known-0.094
chard-0.089
ho-0.086
year-0.085
thank-0.085
Token,
Token ID11
Feature activation+5.14e-03

Pos logit contributions
 mat+0.022
 snowball+0.022
 wipes+0.022
 Velvet+0.021
ig+0.021
Neg logit contributions
chard-0.046
epad-0.044
year-0.043
 known-0.042
nam-0.041
Token you
Token ID345
Feature activation+8.43e-01

Pos logit contributions
 mat+0.000
 snowball+0.000
 wipes+0.000
uddle+0.000
 Velvet+0.000
Neg logit contributions
chard-0.000
 known-0.000
ho-0.000
epad-0.000
thank-0.000
Token are
Token ID389
Feature activation-3.32e-01

Pos logit contributions
 wipes+0.017
 snowball+0.016
 mat+0.016
 turns+0.016
 Velvet+0.015
Neg logit contributions
chard-0.028
ho-0.026
year-0.026
thank-0.026
epad-0.026
Token a
Token ID257
Feature activation+2.70e-01

Pos logit contributions
ho+0.009
chard+0.009
thank+0.008
epad+0.008
year+0.008
Neg logit contributions
 Velvet-0.008
 mat-0.008
 snowball-0.008
 wipes-0.008
Anna-0.008
Token go
Token ID467
Feature activation+5.71e-01

Pos logit contributions
 mat+0.020
 snowball+0.019
 wipes+0.018
 Velvet+0.017
 tries+0.017
Neg logit contributions
year-0.023
chard-0.021
ho-0.021
epad-0.021
 known-0.021
Token!
Token ID0
Feature activation+8.65e-01

Pos logit contributions
 mat+0.016
 snowball+0.016
 wipes+0.015
 Velvet+0.015
uddle+0.015
Neg logit contributions
chard-0.014
 known-0.013
epad-0.013
ho-0.013
year-0.012
Token Everyone
Token ID11075
Feature activation+1.36e-02

Pos logit contributions
 mat+0.019
 snowball+0.019
 Velvet+0.018
 wipes+0.018
Anna+0.017
Neg logit contributions
chard-0.026
 known-0.025
epad-0.025
thank-0.024
ho-0.024
Token watched
Token ID7342
Feature activation+1.63e+00

Pos logit contributions
 tries+0.000
 wipes+0.000
 turns+0.000
 mat+0.000
 snowball+0.000
Neg logit contributions
chard-0.000
epad-0.000
 known-0.000
thank-0.000
ho-0.000
Token in
Token ID287
Feature activation-6.45e-01

Pos logit contributions
 mat+0.037
 snowball+0.035
 wipes+0.035
 mats+0.034
 turns+0.034
Neg logit contributions
chard-0.052
 known-0.050
thank-0.045
epad-0.044
ho-0.044
Token amaz
Token ID40642
Feature activation+3.34e+00

Pos logit contributions
chard+0.020
 known+0.018
ho+0.017
thank+0.017
year+0.017
Neg logit contributions
 mat-0.016
 snowball-0.016
 wipes-0.015
 mats-0.014
 turns-0.014
Tokenement
Token ID972
Feature activation+6.78e-01

Pos logit contributions
 mat+0.074
 snowball+0.073
 wipes+0.071
uddle+0.068
 Velvet+0.068
Neg logit contributions
chard-0.102
 known-0.097
epad-0.092
ho-0.091
thank-0.091
Token as
Token ID355
Feature activation+9.88e-01

Pos logit contributions
 mat+0.015
 snowball+0.013
 wipes+0.013
 mats+0.013
uddle+0.013
Neg logit contributions
chard-0.023
 known-0.022
year-0.020
thank-0.019
epad-0.019
Token it
Token ID340
Feature activation-4.75e-01

Pos logit contributions
 mat+0.020
 snowball+0.019
 turns+0.019
 wipes+0.019
Anna+0.018
Neg logit contributions
chard-0.032
thank-0.030
ho-0.030
 known-0.029
melon-0.029
Token took
Token ID1718
Feature activation+2.48e-01

Pos logit contributions
chard+0.013
epad+0.012
 known+0.012
year+0.011
thank+0.011
Neg logit contributions
 mat-0.012
 wipes-0.012
 snowball-0.012
 turns-0.012
 tries-0.012
Token off
Token ID572
Feature activation+9.07e-01

Pos logit contributions
 snowball+0.006
 mat+0.006
 wipes+0.006
 turns+0.006
 Velvet+0.006
Neg logit contributions
chard-0.007
 known-0.006
ho-0.006
epad-0.006
thank-0.006
Token before
Token ID878
Feature activation+2.44e-01

Pos logit contributions
chard+0.021
 known+0.021
year+0.019
ully+0.019
ho+0.019
Neg logit contributions
 mat-0.016
 wipes-0.014
 mats-0.014
uddle-0.013
 snowball-0.013
Token dinner
Token ID8073
Feature activation-1.09e-01

Pos logit contributions
 mat+0.006
 Velvet+0.005
 gas+0.005
 wipes+0.005
 snowball+0.005
Neg logit contributions
chard-0.008
year-0.007
 known-0.007
ho-0.007
epad-0.007
Token.
Token ID13
Feature activation+5.43e-02

Pos logit contributions
chard+0.003
 known+0.003
year+0.003
epad+0.003
years+0.003
Neg logit contributions
 mat-0.003
 wipes-0.002
uddle-0.002
 snowball-0.002
 mats-0.002
Token<|endoftext|>
Token ID50256
Feature activation+7.28e-01

Pos logit contributions
 mat+0.001
 snowball+0.001
 wipes+0.001
 Velvet+0.001
Anna+0.001
Neg logit contributions
chard-0.002
 known-0.001
epad-0.001
ho-0.001
thank-0.001
TokenS
Token ID50
Feature activation+9.43e-01

Pos logit contributions
 mat+0.017
 wipes+0.016
 snowball+0.016
uddle+0.016
 mats+0.015
Neg logit contributions
chard-0.021
 known-0.021
 full-0.019
 ruled-0.019
thank-0.019
Tokenaman
Token ID10546
Feature activation+3.28e+00

Pos logit contributions
ara+0.022
 mat+0.021
 snowball+0.020
 wipes+0.020
uddle+0.020
Neg logit contributions
chard-0.028
 known-0.027
epad-0.026
year-0.025
thank-0.025
Tokentha
Token ID12898
Feature activation+1.16e+00

Pos logit contributions
 mat+0.088
 snowball+0.087
ig+0.086
ara+0.086
uddle+0.086
Neg logit contributions
chard-0.077
 known-0.076
year-0.075
epad-0.075
thank-0.069
Token was
Token ID373
Feature activation+5.87e-01

Pos logit contributions
 wipes+0.022
ara+0.021
 snowball+0.021
ig+0.020
 tries+0.020
Neg logit contributions
chard-0.041
epad-0.038
 known-0.038
nam-0.037
year-0.036
Token a
Token ID257
Feature activation+7.64e-01

Pos logit contributions
 snowball+0.017
 mat+0.016
 wipes+0.016
 Velvet+0.015
 tries+0.015
Neg logit contributions
chard-0.015
ho-0.014
 known-0.013
epad-0.013
spe-0.013
Token lazy
Token ID16931
Feature activation+2.85e-01

Pos logit contributions
 snowball+0.019
 mat+0.019
 wipes+0.018
uddle+0.017
 Velvet+0.017
Neg logit contributions
chard-0.022
 known-0.021
epad-0.020
thank-0.019
ho-0.019
Token girl
Token ID2576
Feature activation-7.60e-01

Pos logit contributions
 mat+0.005
 snowball+0.005
 wipes+0.005
 Velvet+0.005
uddle+0.005
Neg logit contributions
chard-0.010
 known-0.009
epad-0.009
year-0.009
thank-0.009
Token her
Token ID607
Feature activation+1.21e+00

Pos logit contributions
chard+0.002
ho+0.002
melon+0.002
thank+0.002
 known+0.002
Neg logit contributions
 wipes-0.002
 tries-0.002
 turns-0.002
 mat-0.002
 Velvet-0.002
Token mom
Token ID1995
Feature activation+4.50e-01

Pos logit contributions
 mat+0.030
 snowball+0.028
 wipes+0.028
uddle+0.027
 Velvet+0.027
Neg logit contributions
chard-0.035
 known-0.034
year-0.034
epad-0.032
thank-0.031
Tokenmy
Token ID1820
Feature activation+7.81e-02

Pos logit contributions
 wipes+0.011
ara+0.010
 snowball+0.010
 tries+0.010
 mat+0.009
Neg logit contributions
chard-0.014
year-0.012
 known-0.012
epad-0.012
 messenger-0.012
Token watched
Token ID7342
Feature activation+8.28e-01

Pos logit contributions
 wipes+0.002
 tries+0.002
 snowball+0.002
 mat+0.002
ara+0.002
Neg logit contributions
chard-0.002
 known-0.002
year-0.002
epad-0.002
wered-0.002
Token in
Token ID287
Feature activation-7.10e-01

Pos logit contributions
 mat+0.019
 wipes+0.019
 snowball+0.019
 Velvet+0.018
 mats+0.017
Neg logit contributions
 known-0.026
chard-0.025
ho-0.022
epad-0.022
year-0.021
Token amaz
Token ID40642
Feature activation+3.28e+00

Pos logit contributions
chard+0.020
 known+0.019
ho+0.018
epad+0.017
year+0.017
Neg logit contributions
 mat-0.018
 snowball-0.018
 wipes-0.018
 Velvet-0.016
 mats-0.016
Tokenement
Token ID972
Feature activation+3.58e-01

Pos logit contributions
 mat+0.079
 snowball+0.078
 wipes+0.074
 Velvet+0.074
 ramp+0.073
Neg logit contributions
chard-0.093
 known-0.088
epad-0.084
year-0.084
thank-0.084
Token as
Token ID355
Feature activation+8.97e-01

Pos logit contributions
 mat+0.008
 snowball+0.008
 wipes+0.008
uddle+0.008
 mats+0.008
Neg logit contributions
chard-0.011
 known-0.011
year-0.010
epad-0.010
thank-0.010
Token the
Token ID262
Feature activation-3.73e-01

Pos logit contributions
 mat+0.020
 wipes+0.020
 snowball+0.019
 Velvet+0.019
Anna+0.018
Neg logit contributions
chard-0.027
ho-0.025
thank-0.025
 known-0.024
melon-0.024
Token machine
Token ID4572
Feature activation-1.87e-01

Pos logit contributions
chard+0.012
 known+0.011
epad+0.010
year+0.010
ho+0.010
Neg logit contributions
 mat-0.009
 snowball-0.009
 wipes-0.009
uddle-0.008
 turns-0.008
Token flew
Token ID13112
Feature activation+2.56e-01

Pos logit contributions
chard+0.005
year+0.005
 known+0.005
epad+0.004
years+0.004
Neg logit contributions
 mat-0.005
 wipes-0.005
 turns-0.005
 gas-0.004
 tries-0.004
Token friends
Token ID2460
Feature activation+9.76e-02

Pos logit contributions
 snowball+0.008
 mat+0.008
 wipes+0.008
uddle+0.008
ig+0.008
Neg logit contributions
chard-0.011
epad-0.010
 known-0.010
thank-0.010
year-0.009
Token laughed
Token ID13818
Feature activation-8.95e-01

Pos logit contributions
 wipes+0.003
 turns+0.003
ig+0.003
 tries+0.003
 mat+0.002
Neg logit contributions
chard-0.003
epad-0.002
 known-0.002
working-0.002
thank-0.002
Token at
Token ID379
Feature activation+4.89e-01

Pos logit contributions
chard+0.032
 known+0.031
epad+0.029
ho+0.028
thank+0.028
Neg logit contributions
 snowball-0.017
 wipes-0.017
 mat-0.016
 turns-0.015
 Velvet-0.015
Token their
Token ID511
Feature activation+4.72e-01

Pos logit contributions
 snowball+0.014
 mat+0.013
 wipes+0.013
 Velvet+0.013
 turns+0.012
Neg logit contributions
chard-0.012
epad-0.011
 known-0.011
year-0.011
thank-0.011
Token cle
Token ID1190
Feature activation+3.36e-03

Pos logit contributions
 mat+0.012
 snowball+0.012
 wipes+0.011
uddle+0.011
 mats+0.011
Neg logit contributions
chard-0.014
 known-0.013
year-0.012
epad-0.012
thank-0.012
Tokenvern
Token ID933
Feature activation+3.27e+00

Pos logit contributions
 mat+0.000
 snowball+0.000
 wipes+0.000
 Velvet+0.000
uddle+0.000
Neg logit contributions
chard-0.000
 known-0.000
epad-0.000
ho-0.000
thank-0.000
Tokeness
Token ID408
Feature activation+7.74e-01

Pos logit contributions
 mat+0.096
 snowball+0.094
uddle+0.093
ig+0.090
 ramp+0.090
Neg logit contributions
chard-0.079
 known-0.071
year-0.061
 ruled-0.061
epad-0.060
Token and
Token ID290
Feature activation+7.53e-01

Pos logit contributions
 mat+0.014
 snowball+0.014
ig+0.014
 wipes+0.013
 ramp+0.012
Neg logit contributions
chard-0.028
 known-0.026
year-0.025
thank-0.025
epad-0.024
Token went
Token ID1816
Feature activation+5.09e-01

Pos logit contributions
 turns+0.019
 wipes+0.019
 tries+0.018
 Velvet+0.018
 mat+0.018
Neg logit contributions
chard-0.020
epad-0.020
year-0.018
working-0.017
 image-0.017
Token off
Token ID572
Feature activation+7.71e-01

Pos logit contributions
 mat+0.015
 snowball+0.015
 wipes+0.014
uddle+0.014
 Velvet+0.014
Neg logit contributions
chard-0.012
 known-0.011
epad-0.011
ho-0.011
thank-0.011
Token to
Token ID284
Feature activation+2.95e-01

Pos logit contributions
 mat+0.018
 snowball+0.017
 wipes+0.016
 Velvet+0.016
 tries+0.016
Neg logit contributions
chard-0.023
year-0.021
 known-0.021
epad-0.021
 image-0.021
Token YOU
Token ID7013
Feature activation+3.82e-01

Pos logit contributions
 snowball+0.033
 mat+0.032
 wipes+0.031
 ramp+0.029
uddle+0.029
Neg logit contributions
chard-0.040
 known-0.039
ho-0.037
epad-0.036
 full-0.036
Token ARE
Token ID15986
Feature activation+6.59e-01

Pos logit contributions
 mat+0.008
 snowball+0.008
 wipes+0.007
 Velvet+0.007
uddle+0.007
Neg logit contributions
chard-0.013
 known-0.012
epad-0.011
thank-0.011
ho-0.011
Token E
Token ID412
Feature activation+5.65e-01

Pos logit contributions
 mat+0.014
 snowball+0.014
 wipes+0.014
uddle+0.013
 ramp+0.013
Neg logit contributions
chard-0.021
 known-0.020
epad-0.019
ho-0.019
thank-0.018
TokenATING
Token ID33881
Feature activation-5.27e-01

Pos logit contributions
 snowball+0.014
 mat+0.014
 wipes+0.013
 ramp+0.013
 Velvet+0.013
Neg logit contributions
chard-0.015
 known-0.014
thank-0.014
ho-0.014
epad-0.014
Token ALL
Token ID11096
Feature activation+1.08e+00

Pos logit contributions
chard+0.014
 known+0.014
epad+0.013
ho+0.013
thank+0.013
Neg logit contributions
 mat-0.013
 snowball-0.013
 wipes-0.013
uddle-0.012
 Velvet-0.012
Token THE
Token ID3336
Feature activation+3.25e+00

Pos logit contributions
 mat+0.023
 snowball+0.022
 wipes+0.022
 Velvet+0.022
ig+0.021
Neg logit contributions
chard-0.035
 known-0.032
epad-0.031
thank-0.031
ho-0.030
Token S
Token ID311
Feature activation+5.43e-01

Pos logit contributions
 mat+0.082
 snowball+0.081
 wipes+0.079
 Velvet+0.077
uddle+0.076
Neg logit contributions
chard-0.090
 known-0.084
epad-0.080
year-0.078
thank-0.077
TokenUG
Token ID7340
Feature activation-4.77e-01

Pos logit contributions
 snowball+0.013
 mat+0.012
 wipes+0.012
 ramp+0.012
 Velvet+0.011
Neg logit contributions
chard-0.016
 known-0.015
epad-0.014
ho-0.014
thank-0.014
TokenAR
Token ID1503
Feature activation+6.18e-01

Pos logit contributions
chard+0.016
 known+0.015
nam+0.014
epad+0.014
year+0.014
Neg logit contributions
 mat-0.011
 Velvet-0.010
 snowball-0.010
ara-0.009
Anna-0.009
Token!
Token ID0
Feature activation+2.01e+00

Pos logit contributions
 mat+0.013
 snowball+0.013
uddle+0.012
 wipes+0.012
 Velvet+0.012
Neg logit contributions
chard-0.020
 known-0.019
epad-0.018
thank-0.018
year-0.018
Token THAT
Token ID14603
Feature activation+8.27e-01

Pos logit contributions
 mat+0.049
 snowball+0.049
 wipes+0.047
uddle+0.045
 ramp+0.045
Neg logit contributions
chard-0.056
 known-0.054
epad-0.051
year-0.050
thank-0.050
Token smile
Token ID8212
Feature activation-7.04e-01

Pos logit contributions
 snowball+0.014
 mat+0.014
 wipes+0.013
 ramp+0.013
uddle+0.012
Neg logit contributions
chard-0.018
 known-0.017
year-0.017
epad-0.016
thank-0.016
Token.
Token ID13
Feature activation+2.96e-01

Pos logit contributions
chard+0.024
 known+0.022
nam+0.021
years+0.021
year+0.020
Neg logit contributions
 mat-0.016
 snowball-0.015
ig-0.015
 wipes-0.014
ara-0.013
Token
Token ID198
Feature activation+2.54e-01

Pos logit contributions
 mat+0.007
 snowball+0.007
 wipes+0.007
 Velvet+0.007
 ramp+0.007
Neg logit contributions
chard-0.008
 known-0.008
epad-0.008
thank-0.008
ho-0.008
Token
Token ID198
Feature activation+3.74e-01

Pos logit contributions
 mat+0.006
 wipes+0.006
 snowball+0.006
uddle+0.006
 ramp+0.006
Neg logit contributions
chard-0.007
 known-0.007
year-0.006
ho-0.006
 full-0.006
TokenS
Token ID50
Feature activation+8.11e-01

Pos logit contributions
 mat+0.008
 snowball+0.008
 wipes+0.008
 Velvet+0.008
uddle+0.008
Neg logit contributions
chard-0.011
 known-0.011
epad-0.010
ho-0.010
thank-0.010
Tokenaman
Token ID10546
Feature activation+3.20e+00

Pos logit contributions
 mat+0.013
 snowball+0.013
 wipes+0.013
uddle+0.012
 Velvet+0.012
Neg logit contributions
chard-0.030
 known-0.028
epad-0.027
year-0.027
ho-0.027
Tokentha
Token ID12898
Feature activation+1.32e+00

Pos logit contributions
 mat+0.093
 snowball+0.092
 wipes+0.091
uddle+0.088
ig+0.088
Neg logit contributions
chard-0.073
 known-0.069
epad-0.067
year-0.065
thank-0.063
Token laughed
Token ID13818
Feature activation-1.47e+00

Pos logit contributions
 wipes+0.036
 snowball+0.035
 mat+0.034
 tries+0.034
ara+0.033
Neg logit contributions
chard-0.036
epad-0.032
 known-0.032
year-0.031
nam-0.030
Token,
Token ID11
Feature activation+2.44e-01

Pos logit contributions
 known+0.054
chard+0.052
epad+0.049
ho+0.047
year+0.047
Neg logit contributions
 snowball-0.026
 mat-0.025
 wipes-0.025
 Velvet-0.024
uddle-0.022
Token happy
Token ID3772
Feature activation-7.55e-01

Pos logit contributions
 snowball+0.005
 wipes+0.005
 mat+0.005
 Velvet+0.005
 tries+0.005
Neg logit contributions
epad-0.007
year-0.007
chard-0.007
 known-0.007
working-0.007
Token that
Token ID326
Feature activation-5.55e-01

Pos logit contributions
chard+0.026
 known+0.025
year+0.023
epad+0.022
nam+0.021
Neg logit contributions
 mat-0.017
 wipes-0.015
 snowball-0.015
 turns-0.014
 mats-0.014
Token.
Token ID13
Feature activation+3.66e-01

Pos logit contributions
 snowball+0.032
 mat+0.031
 wipes+0.030
 Velvet+0.030
uddle+0.030
Neg logit contributions
chard-0.024
 known-0.022
ho-0.021
epad-0.021
thank-0.020
Token Tim
Token ID5045
Feature activation+3.25e-01

Pos logit contributions
 snowball+0.008
 mat+0.008
 Velvet+0.008
 wipes+0.007
Anna+0.007
Neg logit contributions
chard-0.011
 known-0.011
epad-0.011
thank-0.011
year-0.010
Tokenmy
Token ID1820
Feature activation-1.60e-01

Pos logit contributions
 snowball+0.010
 wipes+0.010
 mat+0.010
 Velvet+0.009
 tries+0.009
Neg logit contributions
chard-0.007
 known-0.007
epad-0.006
year-0.006
ho-0.006
Token watched
Token ID7342
Feature activation+1.03e+00

Pos logit contributions
chard+0.004
epad+0.004
 known+0.004
year+0.004
ho+0.003
Neg logit contributions
 wipes-0.005
 snowball-0.005
 mat-0.004
 tries-0.004
 turns-0.004
Token with
Token ID351
Feature activation-9.50e-01

Pos logit contributions
 snowball+0.025
 mat+0.024
 wipes+0.023
 Velvet+0.022
 ramp+0.021
Neg logit contributions
 known-0.032
chard-0.032
ho-0.028
year-0.027
epad-0.027
Token amaz
Token ID40642
Feature activation+3.19e+00

Pos logit contributions
chard+0.024
 known+0.022
epad+0.021
ho+0.021
thank+0.020
Neg logit contributions
 mat-0.027
 snowball-0.026
 wipes-0.026
uddle-0.025
 Velvet-0.025
Tokenement
Token ID972
Feature activation+1.98e-01

Pos logit contributions
 mat+0.073
 snowball+0.072
 wipes+0.070
 Velvet+0.067
uddle+0.066
Neg logit contributions
chard-0.095
 known-0.090
epad-0.086
ho-0.085
thank-0.085
Token as
Token ID355
Feature activation+1.01e+00

Pos logit contributions
 mat+0.004
 snowball+0.004
 wipes+0.004
ig+0.004
uddle+0.004
Neg logit contributions
chard-0.006
 known-0.006
thank-0.005
year-0.005
epad-0.005
Token he
Token ID339
Feature activation-3.52e-01

Pos logit contributions
 snowball+0.020
 mat+0.019
 wipes+0.019
 Velvet+0.018
 gas+0.018
Neg logit contributions
chard-0.033
thank-0.031
ho-0.031
 harvested-0.031
 known-0.030
Token saw
Token ID2497
Feature activation+5.47e-01

Pos logit contributions
chard+0.009
epad+0.009
melon+0.008
thank+0.008
 known+0.008
Neg logit contributions
 wipes-0.009
 snowball-0.009
ig-0.009
 tries-0.009
 turns-0.009
Token stars
Token ID5788
Feature activation-5.92e-01

Pos logit contributions
 snowball+0.013
 wipes+0.013
 mat+0.012
 gas+0.011
 turns+0.011
Neg logit contributions
chard-0.016
epad-0.015
ho-0.015
 known-0.015
thank-0.015
Token things
Token ID1243
Feature activation-4.32e-01

Pos logit contributions
chard+0.038
 known+0.036
epad+0.034
ho+0.034
year+0.034
Neg logit contributions
 mat-0.018
 snowball-0.018
 wipes-0.017
uddle-0.016
 Velvet-0.016
Token can
Token ID460
Feature activation-1.36e+00

Pos logit contributions
chard+0.015
 known+0.015
year+0.014
epad+0.014
thank+0.014
Neg logit contributions
 mat-0.008
 wipes-0.007
 snowball-0.007
 Velvet-0.007
uddle-0.007
Token come
Token ID1282
Feature activation-4.64e-01

Pos logit contributions
year+0.045
chard+0.044
epad+0.039
years+0.039
 known+0.038
Neg logit contributions
 wipes-0.028
 mat-0.027
 ramp-0.026
 Velvet-0.025
 snowball-0.025
Token from
Token ID422
Feature activation-6.16e-01

Pos logit contributions
chard+0.013
year+0.012
thank+0.012
ho+0.012
 known+0.012
Neg logit contributions
 turns-0.011
 snowball-0.011
 stitches-0.010
 wipes-0.010
 scar-0.010
Token bad
Token ID2089
Feature activation+4.15e-01

Pos logit contributions
year+0.020
chard+0.018
ho+0.018
spe+0.017
epad+0.017
Neg logit contributions
 tries-0.013
 gas-0.013
 turns-0.012
 mat-0.012
 wipes-0.012
Token situations
Token ID7445
Feature activation+3.15e+00

Pos logit contributions
 mat+0.009
 snowball+0.009
 wipes+0.009
uddle+0.008
 Velvet+0.008
Neg logit contributions
chard-0.013
 known-0.012
epad-0.011
year-0.011
ho-0.011
Token.
Token ID13
Feature activation-2.64e-01

Pos logit contributions
 mat+0.082
 snowball+0.082
 wipes+0.080
uddle+0.076
 Velvet+0.076
Neg logit contributions
chard-0.083
 known-0.079
epad-0.074
ho-0.073
thank-0.073
Token<|endoftext|>
Token ID50256
Feature activation+6.88e-01

Pos logit contributions
chard+0.007
 known+0.007
epad+0.007
thank+0.007
 full+0.007
Neg logit contributions
 mat-0.006
 snowball-0.006
 wipes-0.006
uddle-0.006
 ramp-0.006
TokenOnce
Token ID7454
Feature activation-1.98e-01

Pos logit contributions
 mat+0.019
 wipes+0.019
 snowball+0.018
uddle+0.018
ig+0.018
Neg logit contributions
chard-0.016
 known-0.016
 full-0.016
thank-0.015
 ruled-0.015
Token upon
Token ID2402
Feature activation+7.66e-01

Pos logit contributions
chard+0.004
 known+0.004
 full+0.003
ho+0.003
epad+0.003
Neg logit contributions
 snowball-0.007
 mat-0.006
 wipes-0.006
 ramp-0.006
 mats-0.006
Token a
Token ID257
Feature activation+8.18e-01

Pos logit contributions
 mat+0.024
 wipes+0.024
uddle+0.024
 snowball+0.023
ig+0.023
Neg logit contributions
chard-0.016
 known-0.015
epad-0.014
 full-0.014
thank-0.014
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
chard+0.024
 known+0.022
epad+0.021
ho+0.021
thank+0.020
Neg logit contributions
 mat-0.026
 snowball-0.026
 wipes-0.025
 Velvet-0.024
uddle-0.024
Token His
Token ID2399
Feature activation-1.12e-01

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.006
 Velvet+0.005
uddle+0.005
Neg logit contributions
chard-0.006
 known-0.006
epad-0.005
ho-0.005
thank-0.005
Token mom
Token ID1995
Feature activation+4.64e-01

Pos logit contributions
chard+0.003
 known+0.003
year+0.003
thank+0.003
epad+0.003
Neg logit contributions
 mat-0.003
 snowball-0.003
 wipes-0.003
uddle-0.003
 Velvet-0.003
Token and
Token ID290
Feature activation+2.87e-01

Pos logit contributions
 snowball+0.014
 mat+0.014
 wipes+0.014
 Velvet+0.013
uddle+0.013
Neg logit contributions
chard-0.011
 known-0.010
epad-0.010
thank-0.009
ho-0.009
Token dad
Token ID9955
Feature activation-8.93e-02

Pos logit contributions
 mat+0.007
 wipes+0.007
 snowball+0.007
 Velvet+0.007
 tries+0.007
Neg logit contributions
chard-0.008
 known-0.007
epad-0.007
year-0.007
thank-0.007
Token woke
Token ID19092
Feature activation+3.12e+00

Pos logit contributions
chard+0.003
 generation+0.002
epad+0.002
thank+0.002
 known+0.002
Neg logit contributions
 wipes-0.002
 tries-0.002
 snowball-0.002
 mat-0.002
 turns-0.002
Token up
Token ID510
Feature activation-3.94e-01

Pos logit contributions
 mat+0.121
 gap+0.118
uddle+0.116
 ramp+0.116
powers+0.116
Neg logit contributions
 going-0.044
 happening-0.034
chard-0.034
 all-0.032
 decided-0.031
Token.
Token ID13
Feature activation+9.99e-01

Pos logit contributions
chard+0.014
 known+0.012
ho+0.012
year+0.012
epad+0.012
Neg logit contributions
 wipes-0.007
 mat-0.007
 snowball-0.007
 Velvet-0.007
 gas-0.007
Token They
Token ID1119
Feature activation+7.38e-01

Pos logit contributions
 mat+0.029
 snowball+0.029
 wipes+0.028
uddle+0.027
 Velvet+0.027
Neg logit contributions
chard-0.024
 known-0.022
epad-0.021
ho-0.021
thank-0.021
Token were
Token ID547
Feature activation+2.02e-01

Pos logit contributions
 wipes+0.019
 snowball+0.018
 pill+0.018
 gas+0.017
 Velvet+0.017
Neg logit contributions
chard-0.020
epad-0.019
working-0.018
thank-0.018
 unlike-0.017
Token not
Token ID407
Feature activation+3.21e-01

Pos logit contributions
 snowball+0.006
 mat+0.006
 gas+0.006
 Velvet+0.006
 tries+0.006
Neg logit contributions
ho-0.004
chard-0.004
thank-0.004
epad-0.004
 known-0.004
Token air
Token ID1633
Feature activation+5.55e-01

Pos logit contributions
 wipes+0.009
 snowball+0.009
 mat+0.009
 Velvet+0.009
 mats+0.008
Neg logit contributions
chard-0.009
 known-0.009
epad-0.009
thank-0.009
ho-0.008
Token.
Token ID13
Feature activation+2.45e-01

Pos logit contributions
 snowball+0.012
 wipes+0.012
 mat+0.012
uddle+0.011
 gas+0.011
Neg logit contributions
chard-0.018
 known-0.017
year-0.017
years-0.017
epad-0.016
Token He
Token ID679
Feature activation+4.07e-01

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.005
 Velvet+0.005
Anna+0.005
Neg logit contributions
chard-0.007
 known-0.007
epad-0.007
thank-0.007
ho-0.007
Token watched
Token ID7342
Feature activation+9.28e-01

Pos logit contributions
 wipes+0.010
 snowball+0.010
 mat+0.010
 tries+0.010
 turns+0.009
Neg logit contributions
chard-0.012
 known-0.011
epad-0.011
thank-0.010
year-0.010
Token with
Token ID351
Feature activation-8.91e-01

Pos logit contributions
 mat+0.021
 snowball+0.021
 wipes+0.020
 Velvet+0.019
 mats+0.019
Neg logit contributions
 known-0.029
chard-0.029
thank-0.026
ho-0.026
epad-0.026
Token amaz
Token ID40642
Feature activation+3.09e+00

Pos logit contributions
chard+0.023
 known+0.022
ho+0.021
thank+0.021
epad+0.021
Neg logit contributions
 snowball-0.024
 mat-0.024
 wipes-0.023
 Velvet-0.022
 ramp-0.022
Tokenement
Token ID972
Feature activation+1.82e-01

Pos logit contributions
 mat+0.068
 snowball+0.067
 wipes+0.066
uddle+0.062
 Velvet+0.062
Neg logit contributions
chard-0.095
 known-0.090
epad-0.086
ho-0.085
thank-0.085
Token as
Token ID355
Feature activation+1.02e+00

Pos logit contributions
 mat+0.004
 snowball+0.004
 wipes+0.004
ig+0.004
uddle+0.004
Neg logit contributions
chard-0.006
 known-0.006
year-0.005
thank-0.005
epad-0.005
Token the
Token ID262
Feature activation-4.56e-01

Pos logit contributions
 snowball+0.021
 mat+0.021
 wipes+0.020
 turns+0.019
 gas+0.019
Neg logit contributions
chard-0.033
thank-0.031
ho-0.030
 known-0.029
melon-0.029
Token z
Token ID1976
Feature activation-1.20e-01

Pos logit contributions
chard+0.014
 known+0.013
epad+0.012
year+0.012
thank+0.012
Neg logit contributions
 snowball-0.011
 mat-0.011
 wipes-0.011
 ramp-0.010
uddle-0.010
Tokenig
Token ID328
Feature activation+1.12e+00

Pos logit contributions
chard+0.005
 known+0.005
ho+0.005
thank+0.005
year+0.005
Neg logit contributions
 snowball-0.001
 mat-0.001
 wipes-0.001
 Velvet-0.001
Anna-0.001
Token egg
Token ID5935
Feature activation+4.37e-01

Pos logit contributions
chard+0.001
 known+0.001
year+0.001
epad+0.001
nam+0.001
Neg logit contributions
 snowball-0.001
 mat-0.001
 wipes-0.001
ig-0.001
 ramp-0.001
Token!
Token ID0
Feature activation+8.68e-01

Pos logit contributions
 snowball+0.010
 mat+0.010
ig+0.010
 wipes+0.009
uddle+0.009
Neg logit contributions
 known-0.013
chard-0.013
year-0.013
epad-0.012
nam-0.012
Token He
Token ID679
Feature activation+3.24e-01

Pos logit contributions
 snowball+0.018
 mat+0.018
 wipes+0.018
 Velvet+0.018
Anna+0.017
Neg logit contributions
chard-0.027
 known-0.026
epad-0.025
thank-0.025
ho-0.025
Token watched
Token ID7342
Feature activation+7.42e-01

Pos logit contributions
 snowball+0.008
 wipes+0.008
 tries+0.008
 mat+0.008
 turns+0.008
Neg logit contributions
chard-0.009
epad-0.008
 known-0.008
thank-0.008
year-0.008
Token in
Token ID287
Feature activation-1.25e+00

Pos logit contributions
 snowball+0.014
 mat+0.013
 wipes+0.013
 gas+0.012
 pill+0.012
Neg logit contributions
 known-0.026
chard-0.026
thank-0.023
epad-0.023
year-0.023
Token amaz
Token ID40642
Feature activation+3.07e+00

Pos logit contributions
chard+0.036
 known+0.034
thank+0.032
ho+0.031
year+0.031
Neg logit contributions
 snowball-0.032
 mat-0.031
 wipes-0.031
 tries-0.029
 turns-0.028
Tokenement
Token ID972
Feature activation+3.46e-01

Pos logit contributions
 mat+0.066
 snowball+0.066
 wipes+0.064
uddle+0.061
 Velvet+0.060
Neg logit contributions
chard-0.095
 known-0.091
epad-0.086
ho-0.085
thank-0.085
Token as
Token ID355
Feature activation+9.56e-01

Pos logit contributions
 mat+0.007
 snowball+0.007
 wipes+0.007
 mats+0.006
ig+0.006
Neg logit contributions
chard-0.011
 known-0.011
thank-0.010
year-0.010
epad-0.010
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
 snowball+0.019
 mat+0.018
 wipes+0.018
 turns+0.017
 gas+0.016
Neg logit contributions
chard-0.032
thank-0.031
 known-0.030
epad-0.029
ho-0.029
Token egg
Token ID5935
Feature activation-1.54e-02

Pos logit contributions
chard+0.033
 known+0.032
epad+0.031
thank+0.029
year+0.029
Neg logit contributions
 snowball-0.026
 wipes-0.022
 mat-0.022
 pill-0.021
 tent-0.020
Token began
Token ID2540
Feature activation-4.94e-02

Pos logit contributions
chard+0.000
 known+0.000
year+0.000
epad+0.000
years+0.000
Neg logit contributions
 snowball-0.000
 wipes-0.000
ig-0.000
 mat-0.000
 tries-0.000
Token mad
Token ID8805
Feature activation-6.27e-01

Pos logit contributions
 snowball+0.015
 turns+0.014
 gas+0.014
Anna+0.014
 tries+0.014
Neg logit contributions
chard-0.016
ully-0.016
epad-0.016
ho-0.016
enna-0.015
Token.
Token ID13
Feature activation-2.39e-01

Pos logit contributions
chard+0.019
 known+0.018
epad+0.017
year+0.016
thank+0.016
Neg logit contributions
 mat-0.015
 snowball-0.015
 wipes-0.014
uddle-0.014
 Velvet-0.014
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
chard+0.006
 known+0.006
epad+0.006
ho+0.006
thank+0.005
Neg logit contributions
 mat-0.006
 snowball-0.006
 wipes-0.006
uddle-0.006
 Velvet-0.006
Token
Token ID198
Feature activation-1.43e-02

Pos logit contributions
chard+0.003
 known+0.003
year+0.003
 full+0.003
ho+0.003
Neg logit contributions
 mat-0.003
 wipes-0.003
 snowball-0.003
uddle-0.003
ig-0.003
Token"
Token ID1
Feature activation-2.55e-01

Pos logit contributions
chard+0.000
 known+0.000
year+0.000
ho+0.000
epad+0.000
Neg logit contributions
 mat-0.000
 snowball-0.000
 wipes-0.000
uddle-0.000
 Velvet-0.000
TokenWho
Token ID8241
Feature activation+3.06e+00

Pos logit contributions
chard+0.008
 known+0.007
thank+0.007
ho+0.007
epad+0.007
Neg logit contributions
 mat-0.006
 snowball-0.006
 wipes-0.006
uddle-0.006
ig-0.005
Token are
Token ID389
Feature activation+7.10e-02

Pos logit contributions
 mat+0.067
 wipes+0.066
 snowball+0.064
uddle+0.061
 turns+0.060
Neg logit contributions
chard-0.095
 known-0.088
thank-0.086
epad-0.085
ho-0.085
Token you
Token ID345
Feature activation+1.82e-02

Pos logit contributions
 mat+0.001
 Velvet+0.001
 wipes+0.001
 mats+0.001
 snowball+0.001
Neg logit contributions
chard-0.003
thank-0.003
epad-0.003
ho-0.003
year-0.002
Token?
Token ID30
Feature activation+1.14e+00

Pos logit contributions
 ha+0.000
 tries+0.000
 gas+0.000
ig+0.000
 mat+0.000
Neg logit contributions
year-0.001
ho-0.001
chard-0.001
thank-0.001
epad-0.001
Token Why
Token ID4162
Feature activation+5.73e-01

Pos logit contributions
 Velvet+0.020
 wipes+0.020
 mat+0.020
Anna+0.019
ara+0.019
Neg logit contributions
chard-0.038
 known-0.037
thank-0.036
epad-0.035
ho-0.035
Token did
Token ID750
Feature activation-3.16e-01

Pos logit contributions
 mat+0.011
 wipes+0.011
 snowball+0.010
 Velvet+0.010
uddle+0.010
Neg logit contributions
chard-0.019
 known-0.018
year-0.018
thank-0.018
epad-0.018
Token
Token ID26391
Feature activation+1.60e+00

Pos logit contributions
 mat+0.020
 snowball+0.019
 wipes+0.019
 Velvet+0.018
uddle+0.018
Neg logit contributions
chard-0.023
 known-0.021
epad-0.020
ho-0.020
year-0.020
Token.
Token ID13
Feature activation+5.35e-01

Pos logit contributions
 snowball+0.037
 mat+0.037
 wipes+0.037
uddle+0.034
 Velvet+0.034
Neg logit contributions
chard-0.046
 known-0.044
ho-0.041
 full-0.041
epad-0.041
Token
Token ID198
Feature activation+5.70e-01

Pos logit contributions
 mat+0.013
 snowball+0.013
 wipes+0.013
uddle+0.012
ig+0.012
Neg logit contributions
chard-0.015
 known-0.014
epad-0.013
year-0.013
ho-0.013
Token
Token ID198
Feature activation+7.71e-01

Pos logit contributions
 mat+0.014
 wipes+0.013
 snowball+0.013
uddle+0.013
 ramp+0.012
Neg logit contributions
chard-0.016
 known-0.016
year-0.015
thank-0.014
epad-0.014
TokenS
Token ID50
Feature activation+9.37e-01

Pos logit contributions
 mat+0.018
 snowball+0.018
 wipes+0.017
uddle+0.017
 ramp+0.016
Neg logit contributions
chard-0.023
 known-0.022
year-0.020
thank-0.020
epad-0.020
Tokenaman
Token ID10546
Feature activation+3.05e+00

Pos logit contributions
 mat+0.008
 snowball+0.008
 wipes+0.007
 Velvet+0.006
uddle+0.006
Neg logit contributions
chard-0.041
 known-0.040
epad-0.038
ho-0.038
thank-0.038
Tokentha
Token ID12898
Feature activation+1.43e+00

Pos logit contributions
 mat+0.087
 snowball+0.086
 wipes+0.084
 Velvet+0.082
ig+0.082
Neg logit contributions
chard-0.072
 known-0.068
epad-0.066
year-0.064
thank-0.062
Token was
Token ID373
Feature activation+5.34e-01

Pos logit contributions
 wipes+0.036
 snowball+0.035
 mat+0.034
ara+0.034
 tries+0.033
Neg logit contributions
chard-0.043
epad-0.039
 known-0.038
nam-0.037
year-0.036
Token amazed
Token ID24562
Feature activation+3.58e-01

Pos logit contributions
 snowball+0.015
 mat+0.014
 Velvet+0.014
 tries+0.014
 turns+0.014
Neg logit contributions
chard-0.013
ho-0.013
epad-0.012
thank-0.012
working-0.012
Token and
Token ID290
Feature activation-3.43e-01

Pos logit contributions
 wipes+0.008
 snowball+0.008
 mat+0.008
ara+0.008
uddle+0.008
Neg logit contributions
chard-0.010
 known-0.010
epad-0.010
year-0.009
ho-0.009
Token she
Token ID673
Feature activation+6.04e-02

Pos logit contributions
melon+0.008
ull+0.008
epad+0.008
chard+0.008
ho+0.008
Neg logit contributions
 tries-0.009
 wipes-0.009
 turns-0.008
 minutes-0.008
 Anna-0.008
Token that
Token ID326
Feature activation-7.76e-02

Pos logit contributions
chard+0.032
 known+0.031
year+0.031
epad+0.030
ho+0.030
Neg logit contributions
 mat-0.020
 snowball-0.017
 mats-0.016
 wipes-0.016
 Velvet-0.016
Token!"
Token ID2474
Feature activation-4.35e-01

Pos logit contributions
year+0.003
chard+0.003
years+0.003
 known+0.003
epad+0.003
Neg logit contributions
 mat-0.001
 wipes-0.001
 turns-0.001
 tries-0.001
 snowball-0.001
Token
Token ID220
Feature activation+2.74e-01

Pos logit contributions
chard+0.008
 known+0.008
 full+0.008
 decided+0.007
thank+0.007
Neg logit contributions
 snowball-0.014
 ramp-0.014
 mat-0.013
uddle-0.013
 mats-0.013
Token
Token ID198
Feature activation+6.02e-01

Pos logit contributions
 snowball+0.008
 mat+0.008
 mats+0.007
 ramp+0.007
 wipes+0.007
Neg logit contributions
chard-0.007
 known-0.007
 full-0.006
thank-0.006
epad-0.006
Token
Token ID198
Feature activation+7.11e-01

Pos logit contributions
 ramp+0.020
 snowball+0.020
ig+0.019
 mats+0.019
 mat+0.019
Neg logit contributions
 known-0.010
chard-0.010
 full-0.010
thank-0.009
 Katy-0.009
TokenCharl
Token ID24453
Feature activation+3.03e+00

Pos logit contributions
 ramp+0.021
 snowball+0.020
uddle+0.019
ig+0.018
 mat+0.018
Neg logit contributions
chard-0.016
 known-0.016
 decided-0.016
Friend-0.015
thank-0.015
Tokenotte
Token ID11404
Feature activation+1.21e+00

Pos logit contributions
 snowball+0.107
 ramp+0.101
 mat+0.100
 wipes+0.100
uddle+0.098
Neg logit contributions
chard-0.062
ho-0.053
 known-0.053
thank-0.052
year-0.050
Token thanked
Token ID26280
Feature activation-9.45e-01

Pos logit contributions
 wipes+0.032
 mat+0.030
 turns+0.029
ara+0.029
 snowball+0.029
Neg logit contributions
chard-0.034
epad-0.032
 known-0.031
year-0.030
nam-0.029
Token her
Token ID607
Feature activation-3.57e-01

Pos logit contributions
chard+0.020
 known+0.018
epad+0.017
ho+0.017
thank+0.017
Neg logit contributions
 mat-0.030
 wipes-0.030
 snowball-0.030
 Velvet-0.029
ig-0.028
Token mother
Token ID2802
Feature activation+3.95e-01

Pos logit contributions
chard+0.009
 known+0.009
thank+0.008
epad+0.008
 ruled+0.008
Neg logit contributions
 wipes-0.010
 mat-0.010
 snowball-0.010
 Velvet-0.009
uddle-0.009
Token for
Token ID329
Feature activation-1.70e+00

Pos logit contributions
 mat+0.008
 wipes+0.007
ig+0.007
 snowball+0.007
 Velvet+0.007
Neg logit contributions
 known-0.013
chard-0.013
thank-0.012
 ruled-0.012
epad-0.012
Token parents
Token ID3397
Feature activation+6.84e-01

Pos logit contributions
chard+0.007
year+0.006
 known+0.006
epad+0.006
ho+0.006
Neg logit contributions
 mat-0.006
 snowball-0.006
 wipes-0.006
uddle-0.005
 Velvet-0.005
Token could
Token ID714
Feature activation+4.65e-01

Pos logit contributions
 mat+0.014
 wipes+0.013
 Velvet+0.013
 tries+0.012
 mats+0.012
Neg logit contributions
chard-0.022
year-0.021
 known-0.021
thank-0.020
ull-0.020
Token lead
Token ID1085
Feature activation+7.95e-01

Pos logit contributions
 snowball+0.011
 mat+0.011
 wipes+0.010
 Velvet+0.010
 ramp+0.010
Neg logit contributions
year-0.015
chard-0.014
epad-0.013
years-0.013
 addressed-0.012
Token to
Token ID284
Feature activation+7.66e-01

Pos logit contributions
 mat+0.021
 Velvet+0.020
 snowball+0.020
 wipes+0.020
 turns+0.019
Neg logit contributions
chard-0.021
year-0.020
 known-0.019
epad-0.019
thank-0.019
Token dangerous
Token ID4923
Feature activation+9.19e-01

Pos logit contributions
 mat+0.015
 snowball+0.015
 wipes+0.014
 Velvet+0.013
 mats+0.013
Neg logit contributions
chard-0.026
year-0.024
ho-0.024
 known-0.023
melon-0.023
Token situations
Token ID7445
Feature activation+3.02e+00

Pos logit contributions
 mat+0.018
 snowball+0.018
 wipes+0.017
uddle+0.016
 Velvet+0.016
Neg logit contributions
chard-0.031
 known-0.029
epad-0.028
ho-0.028
thank-0.028
Token.
Token ID13
Feature activation-4.11e-01

Pos logit contributions
 mat+0.092
 snowball+0.092
 wipes+0.090
uddle+0.088
 Velvet+0.087
Neg logit contributions
chard-0.067
 known-0.062
epad-0.058
ho-0.057
thank-0.057
Token From
Token ID3574
Feature activation-3.76e-01

Pos logit contributions
chard+0.008
 known+0.007
ho+0.007
 full+0.007
year+0.007
Neg logit contributions
 snowball-0.014
 mat-0.014
 ramp-0.013
 wipes-0.013
uddle-0.013
Token that
Token ID326
Feature activation-3.67e-01

Pos logit contributions
chard+0.010
 known+0.010
ho+0.009
thank+0.009
year+0.009
Neg logit contributions
 mat-0.010
 snowball-0.010
 wipes-0.009
uddle-0.009
 ramp-0.009
Token day
Token ID1110
Feature activation+7.32e-01

Pos logit contributions
chard+0.006
 known+0.005
epad+0.005
year+0.005
thank+0.005
Neg logit contributions
 mat-0.014
 snowball-0.014
 wipes-0.013
 Velvet-0.013
ig-0.013
Token on
Token ID319
Feature activation+5.32e-01

Pos logit contributions
 mat+0.024
 snowball+0.024
 wipes+0.023
uddle+0.022
 ramp+0.022
Neg logit contributions
chard-0.015
 known-0.014
epad-0.013
ho-0.013
year-0.012
Token's
Token ID338
Feature activation-2.84e-01

Pos logit contributions
 snowball+0.003
 mat+0.003
 wipes+0.003
 Velvet+0.003
uddle+0.003
Neg logit contributions
chard-0.003
 known-0.003
epad-0.002
year-0.002
thank-0.002
Token mom
Token ID1995
Feature activation+3.49e-01

Pos logit contributions
chard+0.009
 known+0.009
year+0.008
thank+0.008
epad+0.008
Neg logit contributions
 mat-0.007
 snowball-0.007
 wipes-0.006
uddle-0.006
 ramp-0.006
Token said
Token ID531
Feature activation+1.93e-01

Pos logit contributions
 wipes+0.010
 snowball+0.009
 mat+0.009
 Velvet+0.009
uddle+0.009
Neg logit contributions
chard-0.009
 known-0.008
epad-0.008
thank-0.008
ho-0.008
Token,
Token ID11
Feature activation-3.53e-03

Pos logit contributions
 mat+0.005
 snowball+0.005
 wipes+0.005
 Velvet+0.005
uddle+0.004
Neg logit contributions
chard-0.005
 known-0.005
epad-0.005
thank-0.005
year-0.005
Token "
Token ID366
Feature activation+1.32e+00

Pos logit contributions
chard+0.000
 known+0.000
epad+0.000
thank+0.000
year+0.000
Neg logit contributions
 mat-0.000
 snowball-0.000
 wipes-0.000
 Velvet-0.000
uddle-0.000
TokenDon
Token ID3987
Feature activation+2.99e+00

Pos logit contributions
 mat+0.034
 snowball+0.033
 wipes+0.033
uddle+0.032
ig+0.032
Neg logit contributions
chard-0.036
 known-0.036
thank-0.033
ho-0.033
year-0.032
Token't
Token ID470
Feature activation-5.97e-01

Pos logit contributions
 wipes+0.126
 snowball+0.125
 mat+0.124
uddle+0.120
ig+0.118
Neg logit contributions
 known-0.029
chard-0.025
 full-0.025
ho-0.023
 ever-0.023
Token cry
Token ID3960
Feature activation+1.43e+00

Pos logit contributions
year+0.018
chard+0.018
 known+0.016
epad+0.015
thank+0.015
Neg logit contributions
 mat-0.014
 snowball-0.014
 wipes-0.014
 turns-0.014
 Velvet-0.013
Token,
Token ID11
Feature activation+7.24e-01

Pos logit contributions
 mat+0.031
 snowball+0.030
 wipes+0.029
 Velvet+0.029
uddle+0.029
Neg logit contributions
chard-0.046
 known-0.044
epad-0.041
thank-0.041
year-0.041
Token boys
Token ID6510
Feature activation+1.38e+00

Pos logit contributions
ig+0.019
uddle+0.019
 mat+0.019
 snowball+0.018
 wipes+0.018
Neg logit contributions
chard-0.017
 full-0.017
 decided-0.017
ho-0.016
 known-0.016
Token.
Token ID13
Feature activation+8.66e-01

Pos logit contributions
 mat+0.042
 snowball+0.041
 wipes+0.040
uddle+0.039
 Velvet+0.039
Neg logit contributions
chard-0.031
 known-0.029
ho-0.027
epad-0.027
thank-0.027
Token out
Token ID503
Feature activation-6.04e-02

Pos logit contributions
 snowball+0.004
 mat+0.004
 wipes+0.003
 Velvet+0.003
uddle+0.003
Neg logit contributions
chard-0.003
 known-0.002
epad-0.002
thank-0.002
ho-0.002
Token and
Token ID290
Feature activation+6.97e-01

Pos logit contributions
chard+0.002
 known+0.002
epad+0.002
thank+0.002
ho+0.001
Neg logit contributions
 mat-0.002
 snowball-0.002
 wipes-0.001
 Velvet-0.001
 mats-0.001
Token said
Token ID531
Feature activation+5.32e-01

Pos logit contributions
 mat+0.021
 snowball+0.021
 wipes+0.020
uddle+0.020
 Velvet+0.019
Neg logit contributions
chard-0.016
 known-0.015
epad-0.014
ho-0.014
year-0.014
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
 mat+0.013
 snowball+0.013
 wipes+0.013
uddle+0.012
 ramp+0.012
Neg logit contributions
chard-0.015
 known-0.014
ho-0.013
epad-0.013
year-0.013
Token "
Token ID366
Feature activation+1.27e+00

Pos logit contributions
 mat+0.002
 wipes+0.002
 snowball+0.002
 Velvet+0.002
 ramp+0.002
Neg logit contributions
chard-0.004
 known-0.003
thank-0.003
epad-0.003
year-0.003
TokenB
Token ID33
Feature activation+2.99e+00

Pos logit contributions
 mat+0.033
 wipes+0.031
 snowball+0.031
uddle+0.031
 ramp+0.030
Neg logit contributions
chard-0.035
 known-0.035
ho-0.032
year-0.032
 decided-0.032
Tokenoys
Token ID19417
Feature activation+1.55e+00

Pos logit contributions
 snowball+0.093
 wipes+0.093
 mat+0.092
 Velvet+0.088
 wheelchair+0.085
Neg logit contributions
ho-0.060
chard-0.056
epad-0.055
year-0.055
working-0.054
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
 snowball+0.042
 mat+0.042
 wipes+0.040
 Velvet+0.039
 ramp+0.039
Neg logit contributions
chard-0.037
 known-0.036
ho-0.034
epad-0.034
 full-0.034
Token you
Token ID345
Feature activation+9.56e-01

Pos logit contributions
 mat+0.030
 snowball+0.030
uddle+0.029
 wipes+0.029
 Velvet+0.028
Neg logit contributions
chard-0.026
 full-0.025
 decided-0.024
 known-0.024
ho-0.023
Token should
Token ID815
Feature activation+6.59e-01

Pos logit contributions
 mat+0.029
 snowball+0.028
 wipes+0.027
 Velvet+0.027
 mats+0.027
Neg logit contributions
chard-0.021
 known-0.021
 full-0.019
epad-0.019
year-0.019
Token not
Token ID407
Feature activation-5.44e-01

Pos logit contributions
 mat+0.024
 snowball+0.023
 wipes+0.023
uddle+0.022
 Velvet+0.022
Neg logit contributions
chard-0.011
 known-0.010
epad-0.009
thank-0.009
ho-0.009
Token.
Token ID13
Feature activation+8.85e-01

Pos logit contributions
chard+0.011
 known+0.011
 messenger+0.010
abby+0.010
year+0.010
Neg logit contributions
 wipes-0.005
 snowball-0.005
 mat-0.005
 pill-0.005
 gas-0.004
Token She
Token ID1375
Feature activation+7.82e-01

Pos logit contributions
 wipes+0.025
 mat+0.025
 snowball+0.024
 Velvet+0.023
 mats+0.022
Neg logit contributions
chard-0.022
 known-0.022
epad-0.020
thank-0.020
year-0.020
Token said
Token ID531
Feature activation+3.57e-01

Pos logit contributions
 wipes+0.019
 snowball+0.019
 Velvet+0.018
Anna+0.018
ara+0.018
Neg logit contributions
chard-0.023
epad-0.023
working-0.021
thank-0.021
 known-0.020
Token,
Token ID11
Feature activation-2.86e-01

Pos logit contributions
 mat+0.008
 snowball+0.008
 wipes+0.008
 Velvet+0.008
uddle+0.008
Neg logit contributions
chard-0.011
 known-0.010
epad-0.010
thank-0.010
year-0.009
Token "
Token ID366
Feature activation+1.16e+00

Pos logit contributions
chard+0.009
 known+0.008
epad+0.008
thank+0.008
year+0.008
Neg logit contributions
 mat-0.006
 wipes-0.006
 snowball-0.006
 Velvet-0.006
 ramp-0.006
TokenDon
Token ID3987
Feature activation+2.99e+00

Pos logit contributions
 mat+0.027
 snowball+0.026
 wipes+0.026
uddle+0.025
 ramp+0.025
Neg logit contributions
chard-0.035
 known-0.033
thank-0.031
ho-0.031
epad-0.031
Token't
Token ID470
Feature activation-6.01e-01

Pos logit contributions
 wipes+0.119
 snowball+0.118
 mat+0.117
uddle+0.114
ig+0.112
Neg logit contributions
 known-0.034
chard-0.032
 full-0.030
ho-0.029
 decided-0.028
Token worry
Token ID5490
Feature activation+1.79e+00

Pos logit contributions
year+0.021
chard+0.020
 known+0.018
 ruled+0.017
thank+0.017
Neg logit contributions
 wipes-0.012
 mat-0.012
 turns-0.012
 tries-0.012
 snowball-0.012
Token,
Token ID11
Feature activation+7.06e-01

Pos logit contributions
 wipes+0.054
 mat+0.053
 snowball+0.053
 ramp+0.052
 Velvet+0.051
Neg logit contributions
 full-0.034
chard-0.033
ho-0.033
 image-0.031
melon-0.030
Token sweet
Token ID6029
Feature activation+1.43e+00

Pos logit contributions
 mat+0.016
 snowball+0.016
uddle+0.016
 wipes+0.016
ig+0.016
Neg logit contributions
chard-0.020
 known-0.019
 full-0.018
ho-0.018
epad-0.018
Tokenie
Token ID494
Feature activation+1.32e+00

Pos logit contributions
 mat+0.044
 snowball+0.043
 wipes+0.043
uddle+0.041
 Velvet+0.041
Neg logit contributions
chard-0.032
 known-0.030
epad-0.028
thank-0.027
year-0.027
Token mad
Token ID8805
Feature activation-1.01e+00

Pos logit contributions
Anna+0.020
 mat+0.020
 snowball+0.020
 wipes+0.020
 tries+0.020
Neg logit contributions
 generation-0.022
thank-0.021
epad-0.021
chard-0.021
ully-0.021
Token.
Token ID13
Feature activation-6.85e-02

Pos logit contributions
chard+0.023
 known+0.022
epad+0.020
ho+0.020
thank+0.020
Neg logit contributions
 mat-0.030
 snowball-0.030
 wipes-0.029
uddle-0.028
 Velvet-0.028
Token
Token ID198
Feature activation-1.93e-01

Pos logit contributions
chard+0.002
 known+0.002
 full+0.002
year+0.002
epad+0.002
Neg logit contributions
 mat-0.002
 snowball-0.002
 wipes-0.002
uddle-0.002
ig-0.002
Token
Token ID198
Feature activation+3.85e-01

Pos logit contributions
chard+0.005
 known+0.005
year+0.005
 full+0.004
 ruled+0.004
Neg logit contributions
 mat-0.005
 wipes-0.005
 snowball-0.005
ig-0.005
uddle-0.005
Token"
Token ID1
Feature activation-8.29e-02

Pos logit contributions
 mat+0.010
 wipes+0.010
 snowball+0.010
ig+0.009
uddle+0.009
Neg logit contributions
chard-0.011
 known-0.010
year-0.010
 full-0.010
 decided-0.009
TokenWho
Token ID8241
Feature activation+2.98e+00

Pos logit contributions
chard+0.003
 known+0.002
epad+0.002
thank+0.002
ho+0.002
Neg logit contributions
 mat-0.002
 snowball-0.002
 wipes-0.002
uddle-0.002
 Velvet-0.002
Token are
Token ID389
Feature activation-2.90e-01

Pos logit contributions
 mat+0.069
 wipes+0.067
 snowball+0.067
 Velvet+0.063
uddle+0.063
Neg logit contributions
chard-0.090
 known-0.083
epad-0.080
thank-0.079
ho-0.079
Token you
Token ID345
Feature activation-6.41e-01

Pos logit contributions
chard+0.011
thank+0.011
epad+0.011
 known+0.010
year+0.010
Neg logit contributions
 mat-0.004
 Velvet-0.004
 wipes-0.004
 snowball-0.003
 mats-0.003
Token?"
Token ID1701
Feature activation+6.95e-01

Pos logit contributions
chard+0.022
ho+0.022
spe+0.021
year+0.021
melon+0.021
Neg logit contributions
 mat-0.010
ara-0.010
ig-0.010
Anna-0.010
 snowball-0.010
Token he
Token ID339
Feature activation-1.30e-01

Pos logit contributions
 mat+0.017
 snowball+0.017
 wipes+0.016
 Velvet+0.016
 ramp+0.015
Neg logit contributions
chard-0.020
 known-0.019
epad-0.018
ho-0.018
thank-0.018
Token shouted
Token ID17293
Feature activation-2.15e-01

Pos logit contributions
chard+0.003
 known+0.003
epad+0.002
ho+0.002
thank+0.002
Neg logit contributions
 mat-0.004
 snowball-0.004
 wipes-0.004
 Velvet-0.004
uddle-0.004
Token bending
Token ID30808
Feature activation-2.74e-01

Pos logit contributions
abby+0.011
chard+0.011
ull+0.011
thank+0.010
ho+0.010
Neg logit contributions
 tries-0.012
 turns-0.012
 Velvet-0.012
 mat-0.012
 wipes-0.012
Token her
Token ID607
Feature activation+9.21e-01

Pos logit contributions
 known+0.007
chard+0.007
epad+0.007
year+0.007
thank+0.007
Neg logit contributions
 mat-0.007
 snowball-0.007
 wipes-0.007
 Velvet-0.007
 mats-0.007
Token body
Token ID1767
Feature activation+1.09e-01

Pos logit contributions
 mat+0.020
 snowball+0.018
 wipes+0.018
uddle+0.017
 mats+0.017
Neg logit contributions
chard-0.031
 known-0.030
year-0.028
thank-0.028
ho-0.028
Token like
Token ID588
Feature activation-1.32e-01

Pos logit contributions
 mat+0.003
 mats+0.002
 snowball+0.002
 wipes+0.002
ig+0.002
Neg logit contributions
chard-0.003
 known-0.003
 messenger-0.003
 generation-0.003
thank-0.003
Token a
Token ID257
Feature activation+2.06e-01

Pos logit contributions
chard+0.004
 known+0.004
thank+0.003
ho+0.003
abby+0.003
Neg logit contributions
 mat-0.003
 Velvet-0.003
 snowball-0.003
 wipes-0.003
 mats-0.003
Token gymn
Token ID38581
Feature activation-3.36e+00

Pos logit contributions
 mat+0.005
 snowball+0.005
 wipes+0.004
 ramp+0.004
uddle+0.004
Neg logit contributions
chard-0.006
 known-0.006
year-0.006
thank-0.006
ho-0.006
Tokenast
Token ID459
Feature activation+6.65e-01

Pos logit contributions
chard+0.075
 known+0.072
epad+0.068
thank+0.067
ho+0.065
Neg logit contributions
 snowball-0.099
 wipes-0.098
 mat-0.098
 Velvet-0.093
uddle-0.093
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
 mat+0.016
 wipes+0.016
 snowball+0.015
 Velvet+0.015
ig+0.014
Neg logit contributions
chard-0.020
 known-0.019
year-0.018
ho-0.017
thank-0.017
Token Lily
Token ID20037
Feature activation-1.01e-01

Pos logit contributions
chard+0.008
 known+0.007
epad+0.007
ho+0.007
thank+0.007
Neg logit contributions
 mat-0.005
 snowball-0.005
 wipes-0.005
 Velvet-0.005
uddle-0.005
Token tried
Token ID3088
Feature activation-4.76e-01

Pos logit contributions
chard+0.002
 known+0.002
epad+0.002
ho+0.002
year+0.002
Neg logit contributions
 mat-0.003
 snowball-0.003
 wipes-0.003
uddle-0.003
 Velvet-0.003
Token her
Token ID607
Feature activation+4.22e-01

Pos logit contributions
chard+0.012
 known+0.011
thank+0.011
ho+0.011
epad+0.010
Neg logit contributions
 mat-0.013
 snowball-0.013
 wipes-0.013
 Velvet-0.012
uddle-0.012
Token time
Token ID640
Feature activation-4.80e-01

Pos logit contributions
 wipes+0.031
 mat+0.030
 stitches+0.030
 ramp+0.030
 Velvet+0.030
Neg logit contributions
 full-0.016
 known-0.015
 often-0.015
chard-0.015
thank-0.013
Token there
Token ID612
Feature activation+5.79e-01

Pos logit contributions
chard+0.011
ho+0.010
 known+0.010
 full+0.010
epad+0.009
Neg logit contributions
 mat-0.014
 snowball-0.014
 wipes-0.014
 ramp-0.013
uddle-0.013
Token was
Token ID373
Feature activation+3.08e-01

Pos logit contributions
 snowball+0.014
 mat+0.014
 wipes+0.014
 Velvet+0.013
uddle+0.013
Neg logit contributions
chard-0.016
 known-0.015
epad-0.015
thank-0.014
year-0.014
Token a
Token ID257
Feature activation+1.84e-01

Pos logit contributions
 snowball+0.009
 mat+0.008
 wipes+0.008
 Velvet+0.008
uddle+0.008
Neg logit contributions
chard-0.008
 known-0.007
ho-0.007
epad-0.007
thank-0.007
Token little
Token ID1310
Feature activation+6.19e-01

Pos logit contributions
 mat+0.004
 snowball+0.004
 wipes+0.004
uddle+0.004
 Velvet+0.004
Neg logit contributions
chard-0.005
 known-0.005
epad-0.005
ho-0.005
thank-0.005
Token av
Token ID1196
Feature activation-3.30e+00

Pos logit contributions
 mat+0.013
 snowball+0.012
 wipes+0.012
uddle+0.011
 Velvet+0.011
Neg logit contributions
chard-0.020
 known-0.019
ho-0.018
thank-0.018
epad-0.018
Tokenac
Token ID330
Feature activation-9.03e-01

Pos logit contributions
chard+0.119
 known+0.114
epad+0.110
year+0.107
thank+0.107
Neg logit contributions
 snowball-0.055
 mat-0.054
 wipes-0.051
 Velvet-0.049
uddle-0.048
Tokenado
Token ID4533
Feature activation-3.99e-01

Pos logit contributions
chard+0.023
 known+0.020
epad+0.020
thank+0.019
 full+0.019
Neg logit contributions
 snowball-0.026
 wipes-0.025
 mat-0.025
uddle-0.025
ig-0.024
Token.
Token ID13
Feature activation+2.89e-01

Pos logit contributions
chard+0.012
epad+0.011
 known+0.011
thank+0.010
year+0.010
Neg logit contributions
 snowball-0.010
 mat-0.010
uddle-0.009
ig-0.009
 wipes-0.009
Token It
Token ID632
Feature activation-5.97e-01

Pos logit contributions
 snowball+0.007
 mat+0.007
 Velvet+0.006
 wipes+0.006
 ramp+0.006
Neg logit contributions
chard-0.009
 known-0.008
ho-0.008
epad-0.008
thank-0.008
Token was
Token ID373
Feature activation+3.01e-01

Pos logit contributions
epad+0.020
chard+0.019
 known+0.018
nam+0.017
year+0.017
Neg logit contributions
 snowball-0.013
 mat-0.012
 Velvet-0.012
 wipes-0.012
Anna-0.011
Token,
Token ID11
Feature activation-6.74e-01

Pos logit contributions
chard+0.023
ho+0.021
 known+0.021
 full+0.021
epad+0.019
Neg logit contributions
 snowball-0.029
 mat-0.029
 wipes-0.029
 ramp-0.028
uddle-0.027
Token there
Token ID612
Feature activation+6.93e-01

Pos logit contributions
chard+0.018
 known+0.017
epad+0.016
ho+0.016
thank+0.015
Neg logit contributions
 mat-0.018
 snowball-0.018
 wipes-0.017
uddle-0.017
 Velvet-0.017
Token was
Token ID373
Feature activation+2.92e-01

Pos logit contributions
 mat+0.021
 snowball+0.020
 wipes+0.020
uddle+0.019
 ramp+0.019
Neg logit contributions
chard-0.015
 known-0.015
ho-0.014
 full-0.013
thank-0.013
Token a
Token ID257
Feature activation+4.28e-02

Pos logit contributions
 snowball+0.008
 mat+0.008
 wipes+0.008
 Velvet+0.007
uddle+0.007
Neg logit contributions
chard-0.007
 known-0.007
ho-0.007
epad-0.007
thank-0.006
Token loud
Token ID7812
Feature activation-7.26e-01

Pos logit contributions
 mat+0.001
 snowball+0.001
 wipes+0.001
uddle+0.001
 Velvet+0.001
Neg logit contributions
chard-0.001
 known-0.001
ho-0.001
epad-0.001
thank-0.001
Token av
Token ID1196
Feature activation-3.26e+00

Pos logit contributions
chard+0.022
 known+0.022
epad+0.021
year+0.020
ho+0.020
Neg logit contributions
 mat-0.016
 snowball-0.016
 wipes-0.015
 ramp-0.015
 Velvet-0.015
Tokenac
Token ID330
Feature activation-1.37e+00

Pos logit contributions
chard+0.112
 known+0.107
epad+0.102
ho+0.102
thank+0.101
Neg logit contributions
 mat-0.060
 snowball-0.059
 wipes-0.058
uddle-0.054
 Velvet-0.054
Tokenado
Token ID4533
Feature activation-6.95e-01

Pos logit contributions
chard+0.042
epad+0.037
 known+0.037
 full+0.035
thank+0.035
Neg logit contributions
 snowball-0.033
 mat-0.031
 wipes-0.031
uddle-0.031
ig-0.031
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
chard+0.019
 known+0.018
epad+0.018
thank+0.017
year+0.016
Neg logit contributions
 snowball-0.019
 mat-0.019
uddle-0.017
ig-0.017
 wipes-0.017
Token It
Token ID632
Feature activation-5.25e-01

Pos logit contributions
 mat+0.004
 snowball+0.004
 Anna+0.004
Anna+0.004
 mats+0.004
Neg logit contributions
chard-0.008
ho-0.008
 known-0.007
thank-0.007
 ad-0.007
Token was
Token ID373
Feature activation+3.64e-01

Pos logit contributions
chard+0.018
epad+0.018
 known+0.016
nam+0.016
year+0.016
Neg logit contributions
 snowball-0.011
 mat-0.010
Anna-0.010
 wipes-0.009
 tries-0.009
Token girl
Token ID2576
Feature activation-1.04e+00

Pos logit contributions
 mat+0.011
 snowball+0.010
 wipes+0.010
 Velvet+0.010
uddle+0.010
Neg logit contributions
chard-0.017
 known-0.016
ho-0.016
thank-0.015
epad-0.015
Token who
Token ID508
Feature activation+8.54e-01

Pos logit contributions
chard+0.023
 known+0.022
epad+0.022
thank+0.020
year+0.019
Neg logit contributions
 mat-0.032
 snowball-0.032
 wipes-0.031
 Velvet-0.030
ig-0.030
Token didn
Token ID1422
Feature activation-8.39e-02

Pos logit contributions
 wipes+0.027
 snowball+0.027
Anna+0.026
 mat+0.026
 turns+0.026
Neg logit contributions
chard-0.019
epad-0.017
melon-0.017
ho-0.015
 known-0.015
Token't
Token ID470
Feature activation-8.65e-01

Pos logit contributions
chard+0.002
 known+0.002
thank+0.002
epad+0.002
ho+0.002
Neg logit contributions
 mat-0.002
 snowball-0.002
 Velvet-0.002
 ramp-0.002
Anna-0.002
Token like
Token ID588
Feature activation+3.54e-01

Pos logit contributions
chard+0.024
ho+0.023
melon+0.021
 known+0.021
 addressed+0.020
Neg logit contributions
 mat-0.023
 snowball-0.023
 Velvet-0.020
 pill-0.020
 wipes-0.020
Token av
Token ID1196
Feature activation-3.26e+00

Pos logit contributions
 mat+0.009
 snowball+0.009
 wipes+0.009
 Velvet+0.009
 mats+0.008
Neg logit contributions
chard-0.010
 known-0.009
year-0.009
thank-0.009
epad-0.009
Tokenoc
Token ID420
Feature activation+1.74e+00

Pos logit contributions
chard+0.105
 known+0.101
epad+0.096
thank+0.095
ho+0.095
Neg logit contributions
 mat-0.067
 wipes-0.064
 snowball-0.064
uddle-0.060
 Velvet-0.060
Tokenados
Token ID22484
Feature activation-8.58e-01

Pos logit contributions
 wipes+0.045
 mat+0.044
 snowball+0.044
uddle+0.043
 Velvet+0.042
Neg logit contributions
chard-0.046
 known-0.044
year-0.041
epad-0.041
 full-0.041
Token.
Token ID13
Feature activation-2.68e-01

Pos logit contributions
year+0.025
 known+0.025
chard+0.024
epad+0.023
thank+0.023
Neg logit contributions
 mat-0.020
 Velvet-0.019
 snowball-0.018
 mats-0.018
 pill-0.018
Token She
Token ID1375
Feature activation-8.96e-02

Pos logit contributions
chard+0.006
 known+0.006
epad+0.006
year+0.006
ho+0.006
Neg logit contributions
 mat-0.008
 wipes-0.008
 snowball-0.007
 Velvet-0.007
 mats-0.007
Token always
Token ID1464
Feature activation-8.77e-01

Pos logit contributions
epad+0.002
chard+0.002
 known+0.002
enna+0.002
melon+0.002
Neg logit contributions
 tries-0.002
 wipes-0.002
 snowball-0.002
 turns-0.002
Anna-0.002
Token has
Token ID468
Feature activation-4.27e-01

Pos logit contributions
 mat+0.008
 wipes+0.007
 tries+0.007
 turns+0.007
 gas+0.007
Neg logit contributions
chard-0.013
thank-0.012
 known-0.012
epad-0.012
ho-0.012
Token four
Token ID1440
Feature activation-2.78e-01

Pos logit contributions
chard+0.014
ho+0.012
year+0.012
melon+0.011
nam+0.011
Neg logit contributions
 turns-0.009
 gas-0.009
 mat-0.009
 tries-0.009
 wipes-0.008
Token legs
Token ID7405
Feature activation-3.37e-01

Pos logit contributions
chard+0.009
 known+0.008
ho+0.008
thank+0.008
epad+0.008
Neg logit contributions
 mat-0.006
 snowball-0.006
 wipes-0.006
 mats-0.006
ig-0.005
Token?'
Token ID8348
Feature activation-1.74e+00

Pos logit contributions
chard+0.012
 known+0.012
year+0.012
epad+0.011
years+0.011
Neg logit contributions
 mat-0.005
 wipes-0.005
ig-0.005
ara-0.005
uddle-0.005
Token and
Token ID290
Feature activation-3.24e-01

Pos logit contributions
chard+0.057
ho+0.054
melon+0.054
 known+0.053
epad+0.052
Neg logit contributions
 wipes-0.035
 mat-0.035
 snowball-0.033
 Velvet-0.032
 turns-0.030
Token '
Token ID705
Feature activation-3.23e+00

Pos logit contributions
chard+0.009
 known+0.009
epad+0.009
ull+0.009
melon+0.009
Neg logit contributions
 turns-0.008
 mat-0.008
 tries-0.008
 wipes-0.008
 mats-0.007
TokenWhat
Token ID2061
Feature activation+2.97e-01

Pos logit contributions
chard+0.092
melon+0.091
epad+0.089
 image+0.087
 ruled+0.085
Neg logit contributions
 snowball-0.072
Anna-0.071
 turns-0.066
Whe-0.064
ara-0.064
Token color
Token ID3124
Feature activation-1.16e-01

Pos logit contributions
 wipes+0.005
 mat+0.005
 snowball+0.005
 turns+0.004
Anna+0.004
Neg logit contributions
chard-0.011
thank-0.010
ho-0.010
 known-0.010
melon-0.010
Token is
Token ID318
Feature activation-5.24e-01

Pos logit contributions
chard+0.004
 known+0.004
thank+0.004
epad+0.003
year+0.003
Neg logit contributions
 mat-0.002
 wipes-0.002
ig-0.002
uddle-0.002
 snowball-0.002
Token the
Token ID262
Feature activation+4.60e-01

Pos logit contributions
chard+0.017
ho+0.017
thank+0.017
 known+0.016
year+0.016
Neg logit contributions
 mat-0.010
 gas-0.009
 wipes-0.009
 turns-0.009
 snowball-0.009
Token sky
Token ID6766
Feature activation-7.37e-01

Pos logit contributions
 mat+0.012
 snowball+0.010
 pill+0.010
 ramp+0.010
 pills+0.010
Neg logit contributions
chard-0.014
year-0.013
ho-0.012
thank-0.012
spe-0.012
Tokenongo
Token ID25162
Feature activation+3.22e-01

Pos logit contributions
 mat+0.029
 snowball+0.029
 wipes+0.028
uddle+0.027
 Velvet+0.027
Neg logit contributions
chard-0.021
 known-0.019
epad-0.018
ho-0.018
year-0.018
Token didn
Token ID1422
Feature activation-4.53e-01

Pos logit contributions
 snowball+0.010
 wipes+0.010
 mat+0.009
uddle+0.009
 tries+0.009
Neg logit contributions
chard-0.007
epad-0.007
 known-0.007
year-0.006
thank-0.006
Token't
Token ID470
Feature activation-1.23e+00

Pos logit contributions
chard+0.009
 known+0.008
epad+0.008
ho+0.008
thank+0.008
Neg logit contributions
 mat-0.015
 snowball-0.015
 wipes-0.014
 Velvet-0.014
uddle-0.014
Token like
Token ID588
Feature activation-4.25e-01

Pos logit contributions
chard+0.029
 known+0.026
ho+0.025
epad+0.024
thank+0.024
Neg logit contributions
 mat-0.037
 snowball-0.037
 wipes-0.036
 Velvet-0.034
 ramp-0.034
Token wet
Token ID9583
Feature activation+3.22e-01

Pos logit contributions
year+0.013
 known+0.013
chard+0.012
epad+0.012
thank+0.012
Neg logit contributions
 snowball-0.009
 wipes-0.009
 mat-0.009
 Velvet-0.008
 tries-0.008
Token av
Token ID1196
Feature activation-3.22e+00

Pos logit contributions
 wipes+0.006
 snowball+0.006
 turns+0.006
 mat+0.006
 mats+0.005
Neg logit contributions
 known-0.011
year-0.011
chard-0.011
epad-0.011
ho-0.010
Tokenoc
Token ID420
Feature activation+1.57e+00

Pos logit contributions
chard+0.106
 known+0.102
epad+0.097
ho+0.097
thank+0.097
Neg logit contributions
 mat-0.064
 wipes-0.061
 snowball-0.061
uddle-0.057
 Velvet-0.056
Tokenados
Token ID22484
Feature activation-4.77e-01

Pos logit contributions
 wipes+0.039
 snowball+0.037
 mat+0.037
uddle+0.037
ig+0.036
Neg logit contributions
chard-0.044
 known-0.041
year-0.039
epad-0.039
 full-0.039
Token,
Token ID11
Feature activation-6.40e-01

Pos logit contributions
chard+0.014
 known+0.013
year+0.013
thank+0.012
epad+0.012
Neg logit contributions
 mat-0.012
 snowball-0.011
 Velvet-0.011
 wipes-0.011
uddle-0.010
Token so
Token ID523
Feature activation-1.57e+00

Pos logit contributions
epad+0.018
chard+0.018
 known+0.018
year+0.018
thank+0.018
Neg logit contributions
 snowball-0.016
 mat-0.015
 wipes-0.015
 mats-0.015
 Velvet-0.014
Token he
Token ID339
Feature activation-1.19e+00

Pos logit contributions
epad+0.050
chard+0.050
thank+0.046
year+0.046
work+0.046
Neg logit contributions
 snowball-0.035
 Velvet-0.033
 wipes-0.032
 mat-0.030
 pill-0.030
Token take
Token ID1011
Feature activation+3.35e-01

Pos logit contributions
chard+0.035
year+0.033
 known+0.031
epad+0.031
ho+0.030
Neg logit contributions
 mat-0.029
 wipes-0.028
 ramp-0.028
 snowball-0.028
ig-0.027
Token it
Token ID340
Feature activation+3.23e-01

Pos logit contributions
 mat+0.008
 snowball+0.008
 wipes+0.008
uddle+0.007
 ramp+0.007
Neg logit contributions
chard-0.010
 known-0.009
epad-0.009
thank-0.009
year-0.009
Token with
Token ID351
Feature activation-6.21e-01

Pos logit contributions
 mat+0.008
 snowball+0.008
uddle+0.008
 Velvet+0.008
ig+0.008
Neg logit contributions
 known-0.009
chard-0.009
year-0.008
ho-0.008
epad-0.008
Token you
Token ID345
Feature activation-7.46e-01

Pos logit contributions
chard+0.023
year+0.022
 known+0.022
ho+0.022
epad+0.022
Neg logit contributions
 mat-0.010
 wipes-0.009
 Velvet-0.009
 snowball-0.009
 turns-0.009
Token wherever
Token ID14530
Feature activation-7.67e-01

Pos logit contributions
 known+0.023
chard+0.023
ho+0.022
year+0.022
thank+0.021
Neg logit contributions
 mat-0.016
 wipes-0.015
 snowball-0.014
uddle-0.014
 Velvet-0.014
Token you
Token ID345
Feature activation-3.21e+00

Pos logit contributions
chard+0.027
 known+0.026
epad+0.025
year+0.025
ho+0.025
Neg logit contributions
 mat-0.014
 wipes-0.013
 snowball-0.013
 Velvet-0.012
uddle-0.012
Token go
Token ID467
Feature activation-1.30e+00

Pos logit contributions
year+0.094
 known+0.094
chard+0.094
spe+0.089
ho+0.089
Neg logit contributions
 tries-0.070
 turns-0.070
 wipes-0.069
 mat-0.064
 pulls-0.062
Token."
Token ID526
Feature activation-1.00e-01

Pos logit contributions
 known+0.043
chard+0.042
ho+0.042
epad+0.041
year+0.041
Neg logit contributions
 mat-0.023
 snowball-0.023
 wipes-0.022
uddle-0.022
 Velvet-0.022
Token
Token ID198
Feature activation+4.70e-01

Pos logit contributions
chard+0.003
 known+0.003
epad+0.003
ho+0.003
year+0.003
Neg logit contributions
 snowball-0.002
 mat-0.002
 wipes-0.002
 Velvet-0.002
uddle-0.002
Token
Token ID198
Feature activation+5.48e-01

Pos logit contributions
 mat+0.012
 snowball+0.011
 wipes+0.011
 mats+0.011
uddle+0.011
Neg logit contributions
chard-0.013
 known-0.013
thank-0.012
year-0.012
ho-0.012
TokenBilly
Token ID49640
Feature activation+5.62e-01

Pos logit contributions
 mat+0.017
 snowball+0.017
 wipes+0.017
 Velvet+0.016
uddle+0.016
Neg logit contributions
chard-0.012
 known-0.011
epad-0.010
ho-0.010
year-0.010
Token tree
Token ID5509
Feature activation+3.57e-01

Pos logit contributions
epad+0.010
 known+0.009
year+0.009
chard+0.009
years+0.009
Neg logit contributions
uddle-0.008
ig-0.008
 wipes-0.008
 mat-0.008
 snowball-0.008
Token could
Token ID714
Feature activation-8.67e-01

Pos logit contributions
 wipes+0.008
uddle+0.008
ig+0.007
 tries+0.007
 turns+0.007
Neg logit contributions
epad-0.010
chard-0.010
year-0.010
nam-0.010
 known-0.009
Token see
Token ID766
Feature activation+2.84e-01

Pos logit contributions
year+0.027
epad+0.026
years+0.025
chard+0.025
spe+0.024
Neg logit contributions
 mat-0.018
 wipes-0.018
 nap-0.016
 knot-0.016
 snowball-0.016
Token lots
Token ID6041
Feature activation-3.97e-01

Pos logit contributions
 snowball+0.005
 wipes+0.005
 turns+0.005
 Velvet+0.005
 gas+0.005
Neg logit contributions
epad-0.009
chard-0.009
working-0.009
 known-0.008
thank-0.008
Token of
Token ID286
Feature activation-1.31e+00

Pos logit contributions
chard+0.013
 known+0.012
epad+0.011
ho+0.011
thank+0.011
Neg logit contributions
 snowball-0.008
 mat-0.008
 wipes-0.008
uddle-0.008
 mats-0.007
Token av
Token ID1196
Feature activation-3.19e+00

Pos logit contributions
chard+0.037
ho+0.035
 known+0.035
epad+0.034
enna+0.033
Neg logit contributions
 wipes-0.032
 snowball-0.031
 turns-0.029
 tries-0.028
 stitches-0.027
Tokenoc
Token ID420
Feature activation+1.16e+00

Pos logit contributions
chard+0.119
 known+0.114
epad+0.109
year+0.109
ho+0.108
Neg logit contributions
 snowball-0.049
 mat-0.044
 wipes-0.043
ig-0.043
uddle-0.042
Tokenados
Token ID22484
Feature activation-6.46e-01

Pos logit contributions
 wipes+0.027
 snowball+0.026
uddle+0.025
ig+0.025
 tries+0.025
Neg logit contributions
chard-0.034
 known-0.032
 full-0.030
epad-0.030
year-0.029
Token that
Token ID326
Feature activation-3.26e-01

Pos logit contributions
year+0.019
 known+0.019
chard+0.018
years+0.017
ull+0.017
Neg logit contributions
 mat-0.013
 tries-0.012
uddle-0.012
 ramp-0.012
 snowball-0.012
Token the
Token ID262
Feature activation+1.96e-01

Pos logit contributions
epad+0.010
year+0.009
years+0.009
amer+0.008
abby+0.008
Neg logit contributions
 turns-0.007
 Anna-0.006
 pill-0.006
 gas-0.006
 tries-0.006
Token supplier
Token ID22693
Feature activation-7.99e-01

Pos logit contributions
 mat+0.004
 snowball+0.004
 knot+0.004
 wipes+0.004
 ramp+0.004
Neg logit contributions
year-0.006
epad-0.006
 known-0.006
ina-0.005
enna-0.005
Token can
Token ID460
Feature activation-1.23e+00

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.006
uddle+0.006
 Velvet+0.006
Neg logit contributions
chard-0.011
 known-0.011
epad-0.011
thank-0.010
ho-0.010
Token carry
Token ID3283
Feature activation+5.87e-01

Pos logit contributions
chard+0.034
 known+0.031
year+0.031
epad+0.030
ho+0.029
Neg logit contributions
 mat-0.032
 snowball-0.032
 wipes-0.031
 Velvet-0.031
 ramp-0.030
Token your
Token ID534
Feature activation-8.41e-04

Pos logit contributions
uddle+0.014
 snowball+0.014
 mat+0.014
 wipes+0.014
 ramp+0.014
Neg logit contributions
chard-0.016
 known-0.015
 full-0.014
epad-0.014
thank-0.014
Token guitar
Token ID10047
Feature activation-2.21e-01

Pos logit contributions
chard+0.000
epad+0.000
 image+0.000
 known+0.000
 full+0.000
Neg logit contributions
 snowball-0.000
 wipes-0.000
uddle-0.000
 mat-0.000
ara-0.000
Token wherever
Token ID14530
Feature activation-2.39e-01

Pos logit contributions
chard+0.006
 known+0.006
epad+0.006
year+0.006
ho+0.005
Neg logit contributions
 mat-0.006
 snowball-0.005
 wipes-0.005
 Velvet-0.005
uddle-0.005
Token you
Token ID345
Feature activation-3.15e+00

Pos logit contributions
chard+0.008
 known+0.008
epad+0.008
year+0.008
ho+0.007
Neg logit contributions
 mat-0.004
 snowball-0.004
 wipes-0.004
 Velvet-0.004
uddle-0.004
Token want
Token ID765
Feature activation-1.97e+00

Pos logit contributions
chard+0.084
 known+0.082
year+0.080
ho+0.076
epad+0.076
Neg logit contributions
 wipes-0.080
 tries-0.079
 turns-0.079
 mat-0.078
 snowball-0.077
Token.
Token ID13
Feature activation+7.58e-02

Pos logit contributions
chard+0.063
 known+0.060
epad+0.057
ho+0.057
year+0.056
Neg logit contributions
 mat-0.041
 snowball-0.040
 wipes-0.040
 Velvet-0.038
uddle-0.037
Token It
Token ID632
Feature activation-4.24e-01

Pos logit contributions
uddle+0.002
 mat+0.002
 snowball+0.002
 wipes+0.002
ig+0.002
Neg logit contributions
chard-0.002
 known-0.002
year-0.002
epad-0.002
 full-0.002
Token's
Token ID338
Feature activation+3.91e-02

Pos logit contributions
chard+0.013
 known+0.013
ho+0.013
thank+0.013
 decided+0.012
Neg logit contributions
 mat-0.008
 snowball-0.008
 wipes-0.008
uddle-0.008
 mats-0.007
Token yours
Token ID12431
Feature activation+2.04e-02

Pos logit contributions
 snowball+0.001
 Velvet+0.001
 mat+0.001
 wipes+0.001
 ramp+0.001
Neg logit contributions
chard-0.001
ho-0.001
year-0.001
epad-0.001
thank-0.001
Token was
Token ID373
Feature activation+4.01e-01

Pos logit contributions
chard+0.021
epad+0.020
nam+0.019
 known+0.018
year+0.018
Neg logit contributions
 turns-0.010
 mat-0.010
Anna-0.010
 ramp-0.010
 gas-0.009
Token very
Token ID845
Feature activation+3.62e-01

Pos logit contributions
 snowball+0.009
 gas+0.009
 minutes+0.008
 Anna+0.007
 turns+0.007
Neg logit contributions
ho-0.014
chard-0.013
ully-0.012
enna-0.012
melon-0.012
Token blue
Token ID4171
Feature activation+6.06e-01

Pos logit contributions
 turns+0.008
 gas+0.007
 minutes+0.007
 snowball+0.007
 mat+0.007
Neg logit contributions
ully-0.011
epad-0.011
 streaming-0.010
chard-0.010
enna-0.010
Token.
Token ID13
Feature activation+3.19e-02

Pos logit contributions
 snowball+0.014
 mat+0.013
uddle+0.012
 mats+0.012
 wipes+0.011
Neg logit contributions
 known-0.021
chard-0.020
epad-0.018
year-0.018
 harvested-0.018
Token Everywhere
Token ID48201
Feature activation-1.34e-01

Pos logit contributions
 snowball+0.001
 mat+0.001
 wipes+0.001
 mats+0.001
 Velvet+0.001
Neg logit contributions
chard-0.001
ho-0.001
 known-0.001
epad-0.001
thank-0.001
Token you
Token ID345
Feature activation-3.14e+00

Pos logit contributions
chard+0.004
 known+0.004
epad+0.003
thank+0.003
year+0.003
Neg logit contributions
 snowball-0.003
 mat-0.003
 wipes-0.003
 Velvet-0.003
 ramp-0.003
Token looked
Token ID3114
Feature activation-4.71e-01

Pos logit contributions
chard+0.071
 known+0.068
epad+0.063
melon+0.061
ho+0.060
Neg logit contributions
 wipes-0.095
 snowball-0.092
 turns-0.092
 mat-0.091
 tries-0.088
Token,
Token ID11
Feature activation-7.90e-01

Pos logit contributions
chard+0.014
 known+0.013
epad+0.013
thank+0.013
ho+0.012
Neg logit contributions
 mat-0.011
 wipes-0.011
 snowball-0.011
uddle-0.010
 Velvet-0.010
Token it
Token ID340
Feature activation-9.42e-01

Pos logit contributions
ho+0.024
 known+0.024
thank+0.024
chard+0.024
melon+0.023
Neg logit contributions
 mat-0.016
 wipes-0.016
 mats-0.016
 snowball-0.016
 turns-0.016
Token was
Token ID373
Feature activation+5.63e-04

Pos logit contributions
chard+0.027
epad+0.024
thank+0.023
ho+0.023
year+0.023
Neg logit contributions
 wipes-0.023
 turns-0.022
 tries-0.022
 mat-0.022
ig-0.021
Token painted
Token ID13055
Feature activation+1.80e-01

Pos logit contributions
 gas+0.000
 Anna+0.000
 snowball+0.000
 turns+0.000
 minutes+0.000
Neg logit contributions
ho-0.000
enna-0.000
ully-0.000
abby-0.000
year-0.000
Token full
Token ID1336
Feature activation+1.11e-01

Pos logit contributions
 gas+0.001
 snowball+0.001
 Anna+0.001
 pill+0.001
 minutes+0.001
Neg logit contributions
epad-0.002
ho-0.002
chard-0.001
abby-0.001
enna-0.001
Token of
Token ID286
Feature activation-1.10e+00

Pos logit contributions
 mat+0.002
 wipes+0.002
 pill+0.002
 snowball+0.002
uddle+0.001
Neg logit contributions
 known-0.004
chard-0.004
epad-0.004
thank-0.004
working-0.004
Token life
Token ID1204
Feature activation+3.23e-01

Pos logit contributions
chard+0.037
ho+0.036
thank+0.034
abby+0.034
epad+0.034
Neg logit contributions
 wipes-0.021
 pill-0.021
 snowball-0.021
 pills-0.020
 gas-0.020
Token.
Token ID13
Feature activation-1.96e-02

Pos logit contributions
 mat+0.008
 snowball+0.007
 mats+0.007
 wipes+0.007
uddle+0.007
Neg logit contributions
 known-0.010
chard-0.010
epad-0.009
ho-0.009
thank-0.009
Token Everywhere
Token ID48201
Feature activation-3.39e-01

Pos logit contributions
chard+0.001
 known+0.001
ho+0.001
epad+0.001
thank+0.001
Neg logit contributions
 mat-0.000
 snowball-0.000
 mats-0.000
 wipes-0.000
 ramp-0.000
Token you
Token ID345
Feature activation-3.02e+00

Pos logit contributions
chard+0.009
epad+0.009
 known+0.009
year+0.009
thank+0.009
Neg logit contributions
 mat-0.008
 wipes-0.008
 snowball-0.008
 Velvet-0.008
 turns-0.008
Token looked
Token ID3114
Feature activation-4.65e-01

Pos logit contributions
chard+0.075
 known+0.073
epad+0.070
ho+0.064
melon+0.064
Neg logit contributions
 wipes-0.085
 snowball-0.081
 mat-0.080
 turns-0.079
ig-0.078
Token there
Token ID612
Feature activation+7.90e-02

Pos logit contributions
chard+0.013
 known+0.013
epad+0.012
thank+0.012
year+0.012
Neg logit contributions
 mat-0.011
 wipes-0.011
 snowball-0.010
uddle-0.010
 mats-0.010
Token were
Token ID547
Feature activation-1.12e+00

Pos logit contributions
 mat+0.002
 wipes+0.002
 turns+0.002
 mats+0.002
 tries+0.002
Neg logit contributions
chard-0.002
epad-0.002
year-0.002
thank-0.002
 known-0.002
Token crazy
Token ID7165
Feature activation+1.41e-01

Pos logit contributions
epad+0.031
chard+0.031
thank+0.031
ully+0.030
ho+0.030
Neg logit contributions
 gas-0.025
 turns-0.023
 pill-0.022
 mats-0.022
 pills-0.022
Token creatures
Token ID8109
Feature activation-7.74e-01

Pos logit contributions
 wipes+0.003
 snowball+0.003
 mat+0.003
 mats+0.003
uddle+0.003
Neg logit contributions
chard-0.004
 known-0.004
thank-0.004
ho-0.004
epad-0.004
Token a
Token ID257
Feature activation+9.44e-02

Pos logit contributions
 snowball+0.008
 mat+0.008
 wipes+0.008
 Velvet+0.008
 mats+0.007
Neg logit contributions
chard-0.008
 known-0.007
ho-0.007
epad-0.007
thank-0.007
Token hairy
Token ID43873
Feature activation+2.88e-01

Pos logit contributions
 mat+0.002
 snowball+0.002
 wipes+0.002
uddle+0.002
 Velvet+0.002
Neg logit contributions
chard-0.003
 known-0.003
epad-0.002
ho-0.002
thank-0.002
Token palace
Token ID20562
Feature activation-5.31e-01

Pos logit contributions
 mat+0.005
 snowball+0.005
 wipes+0.005
uddle+0.005
 Velvet+0.005
Neg logit contributions
chard-0.010
 known-0.010
epad-0.009
year-0.009
thank-0.009
Token.
Token ID13
Feature activation+2.64e-01

Pos logit contributions
epad+0.020
 known+0.019
chard+0.018
 comparison+0.018
years+0.018
Neg logit contributions
 mat-0.009
uddle-0.008
ig-0.007
 tent-0.007
 snowball-0.006
Token Everywhere
Token ID48201
Feature activation-2.83e-02

Pos logit contributions
 snowball+0.006
 mat+0.005
 wipes+0.005
 Velvet+0.005
 mats+0.005
Neg logit contributions
chard-0.008
 known-0.008
ho-0.008
epad-0.008
melon-0.008
Token you
Token ID345
Feature activation-3.01e+00

Pos logit contributions
chard+0.001
 known+0.001
epad+0.001
thank+0.001
year+0.001
Neg logit contributions
 mat-0.001
 snowball-0.001
 wipes-0.001
 Velvet-0.001
uddle-0.001
Token looked
Token ID3114
Feature activation-6.22e-01

Pos logit contributions
chard+0.076
 known+0.075
epad+0.068
melon+0.066
ho+0.066
Neg logit contributions
 wipes-0.084
 snowball-0.081
 mat-0.080
ig-0.078
 turns-0.077
Token there
Token ID612
Feature activation-4.94e-02

Pos logit contributions
chard+0.019
 known+0.019
epad+0.018
thank+0.018
year+0.017
Neg logit contributions
 mat-0.013
 wipes-0.013
 snowball-0.013
uddle-0.012
ig-0.012
Token were
Token ID547
Feature activation-1.56e+00

Pos logit contributions
chard+0.002
epad+0.001
year+0.001
thank+0.001
melon+0.001
Neg logit contributions
 wipes-0.001
 mat-0.001
 turns-0.001
 tries-0.001
 mats-0.001
Token furry
Token ID40857
Feature activation+6.99e-01

Pos logit contributions
thank+0.044
chard+0.044
epad+0.043
ully+0.042
working+0.042
Neg logit contributions
 pill-0.035
 gas-0.033
 pills-0.032
 snowball-0.032
 wipes-0.032
Token walls
Token ID7714
Feature activation-1.23e+00

Pos logit contributions
 snowball+0.017
 mat+0.016
 wipes+0.016
 mats+0.015
uddle+0.015
Neg logit contributions
chard-0.020
 known-0.020
epad-0.019
thank-0.019
ho-0.018
Token.
Token ID13
Feature activation+3.95e-01

Pos logit contributions
chard+0.028
 known+0.026
epad+0.025
ho+0.024
thank+0.024
Neg logit contributions
 mat-0.029
 snowball-0.029
 wipes-0.029
 Velvet-0.027
Anna-0.027
Token Lily
Token ID20037
Feature activation-4.75e-01

Pos logit contributions
 snowball+0.013
 mat+0.013
 wipes+0.012
uddle+0.012
 ramp+0.012
Neg logit contributions
chard-0.008
 known-0.007
epad-0.007
ho-0.007
 full-0.007
Token loved
Token ID6151
Feature activation+3.94e-01

Pos logit contributions
chard+0.008
 known+0.007
epad+0.007
ho+0.006
year+0.006
Neg logit contributions
 mat-0.017
 snowball-0.017
 wipes-0.017
 Velvet-0.016
uddle-0.016
Token to
Token ID284
Feature activation+8.36e-01

Pos logit contributions
 mat+0.010
 snowball+0.010
 wipes+0.010
 Velvet+0.009
 mats+0.009
Neg logit contributions
chard-0.011
 known-0.010
epad-0.010
year-0.010
ho-0.009
Token eat
Token ID4483
Feature activation+2.70e-01

Pos logit contributions
 mat+0.022
 wipes+0.021
 snowball+0.020
 Velvet+0.019
 mats+0.019
Neg logit contributions
chard-0.024
year-0.023
 known-0.023
ho-0.023
spe-0.022
Token av
Token ID1196
Feature activation-3.00e+00

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.006
 Velvet+0.006
uddle+0.006
Neg logit contributions
chard-0.008
 known-0.008
epad-0.008
year-0.007
ho-0.007
Tokenoc
Token ID420
Feature activation+1.44e+00

Pos logit contributions
chard+0.098
 known+0.094
thank+0.089
epad+0.089
ho+0.089
Neg logit contributions
 mat-0.060
 snowball-0.058
 wipes-0.058
uddle-0.054
 ramp-0.054
Tokenados
Token ID22484
Feature activation-5.34e-01

Pos logit contributions
 wipes+0.037
 mat+0.036
 snowball+0.035
 Velvet+0.035
uddle+0.035
Neg logit contributions
chard-0.038
 known-0.037
epad-0.035
year-0.034
 full-0.034
Token,
Token ID11
Feature activation-5.00e-02

Pos logit contributions
 known+0.015
chard+0.014
year+0.014
epad+0.014
ho+0.013
Neg logit contributions
 mat-0.013
 Velvet-0.012
 mats-0.012
 snowball-0.012
uddle-0.012
Token they
Token ID484
Feature activation-2.70e-01

Pos logit contributions
year+0.001
epad+0.001
spe+0.001
 known+0.001
ho+0.001
Neg logit contributions
 mat-0.001
 snowball-0.001
 Velvet-0.001
 wipes-0.001
 mats-0.001
Token were
Token ID547
Feature activation-1.17e+00

Pos logit contributions
epad+0.008
ull+0.007
enna+0.007
spe+0.007
year+0.006
Neg logit contributions
 turns-0.005
 pill-0.005
 wipes-0.005
 tries-0.005
 Velvet-0.005
Token remembered
Token ID12086
Feature activation-6.12e-01

Pos logit contributions
chard+0.004
epad+0.004
thank+0.004
 known+0.004
year+0.003
Neg logit contributions
 snowball-0.005
 wipes-0.005
 tries-0.005
 turns-0.005
 mat-0.004
Token his
Token ID465
Feature activation-6.68e-01

Pos logit contributions
chard+0.020
thank+0.019
year+0.019
epad+0.019
ho+0.019
Neg logit contributions
 wipes-0.011
 mat-0.010
 snowball-0.010
 tries-0.010
 gas-0.010
Token favourite
Token ID12507
Feature activation-9.24e-01

Pos logit contributions
chard+0.020
 known+0.019
year+0.019
thank+0.018
epad+0.018
Neg logit contributions
 mat-0.017
 snowball-0.016
uddle-0.016
 wipes-0.015
 mats-0.015
Token movie
Token ID3807
Feature activation-1.33e+00

Pos logit contributions
chard+0.030
 known+0.029
year+0.027
ho+0.027
epad+0.027
Neg logit contributions
 mat-0.022
 snowball-0.021
 wipes-0.020
 mats-0.019
uddle-0.019
Token,
Token ID11
Feature activation+1.53e-02

Pos logit contributions
chard+0.043
 known+0.042
year+0.039
nam+0.038
years+0.038
Neg logit contributions
 mat-0.030
 snowball-0.027
 ramp-0.026
 mats-0.026
 Velvet-0.025
Token '
Token ID705
Feature activation-2.97e+00

Pos logit contributions
 snowball+0.000
 mat+0.000
 wipes+0.000
 Velvet+0.000
 mats+0.000
Neg logit contributions
chard-0.000
ho-0.000
 known-0.000
thank-0.000
year-0.000
TokenT
Token ID51
Feature activation-5.35e-02

Pos logit contributions
chard+0.078
epad+0.077
 image+0.076
melon+0.074
 known+0.073
Neg logit contributions
 snowball-0.075
Anna-0.070
 turns-0.068
ara-0.067
Whe-0.065
Tokenangled
Token ID22393
Feature activation+5.80e-01

Pos logit contributions
chard+0.002
 known+0.001
nam+0.001
ho+0.001
 full+0.001
Neg logit contributions
 mat-0.001
ara-0.001
 snowball-0.001
uddle-0.001
 wipes-0.001
Token'.
Token ID4458
Feature activation-5.36e-01

Pos logit contributions
 mat+0.012
 snowball+0.012
 Velvet+0.011
 wipes+0.011
ig+0.010
Neg logit contributions
 known-0.019
chard-0.019
epad-0.018
thank-0.017
year-0.017
Token He
Token ID679
Feature activation-4.13e-02

Pos logit contributions
chard+0.015
 known+0.014
epad+0.013
ho+0.013
thank+0.013
Neg logit contributions
 mat-0.013
 snowball-0.013
 wipes-0.013
 Velvet-0.012
uddle-0.012
Token got
Token ID1392
Feature activation-1.10e+00

Pos logit contributions
chard+0.001
epad+0.001
thank+0.001
 known+0.001
melon+0.001
Neg logit contributions
 snowball-0.001
 wipes-0.001
 mat-0.001
 tries-0.001
 turns-0.001
Token day
Token ID1110
Feature activation+8.66e-01

Pos logit contributions
chard+0.006
 known+0.005
epad+0.005
year+0.005
ho+0.005
Neg logit contributions
 mat-0.013
 snowball-0.013
 wipes-0.013
 Velvet-0.013
ig-0.012
Token on
Token ID319
Feature activation+4.51e-01

Pos logit contributions
 mat+0.028
 snowball+0.028
 wipes+0.028
uddle+0.027
 ramp+0.027
Neg logit contributions
chard-0.017
 known-0.016
epad-0.015
ho-0.015
year-0.015
Token,
Token ID11
Feature activation-9.60e-01

Pos logit contributions
 snowball+0.014
 ramp+0.014
 mat+0.013
 wipes+0.013
uddle+0.013
Neg logit contributions
chard-0.009
 full-0.009
 known-0.008
ho-0.008
 often-0.008
Token Lily
Token ID20037
Feature activation-4.50e-01

Pos logit contributions
chard+0.033
 known+0.031
epad+0.030
ho+0.029
thank+0.029
Neg logit contributions
 mat-0.018
 snowball-0.018
 wipes-0.018
uddle-0.017
 Velvet-0.016
Token loved
Token ID6151
Feature activation+1.80e-01

Pos logit contributions
chard+0.010
epad+0.009
year+0.009
nam+0.008
 known+0.008
Neg logit contributions
 wipes-0.014
 mat-0.014
 snowball-0.014
 Velvet-0.014
 tries-0.013
Token av
Token ID1196
Feature activation-2.97e+00

Pos logit contributions
 mat+0.005
 snowball+0.005
 wipes+0.005
 Velvet+0.005
 mats+0.005
Neg logit contributions
chard-0.005
 known-0.004
epad-0.004
year-0.004
ho-0.004
Tokenoc
Token ID420
Feature activation+8.55e-01

Pos logit contributions
chard+0.098
 known+0.094
thank+0.090
ho+0.090
epad+0.090
Neg logit contributions
 mat-0.058
 snowball-0.056
 wipes-0.056
uddle-0.052
 ramp-0.052
Tokenados
Token ID22484
Feature activation-5.68e-01

Pos logit contributions
 wipes+0.022
 mat+0.021
 snowball+0.021
uddle+0.020
ig+0.020
Neg logit contributions
chard-0.024
 known-0.023
epad-0.021
year-0.021
 full-0.021
Token and
Token ID290
Feature activation-7.64e-01

Pos logit contributions
chard+0.017
 known+0.016
year+0.016
epad+0.016
ho+0.015
Neg logit contributions
 mat-0.013
 snowball-0.012
 Velvet-0.012
ig-0.012
 mats-0.011
Token would
Token ID561
Feature activation-1.41e+00

Pos logit contributions
chard+0.018
year+0.016
epad+0.015
abby+0.015
amer+0.015
Neg logit contributions
 wipes-0.023
 tries-0.022
 turns-0.021
 mat-0.020
 Velvet-0.020
Token ask
Token ID1265
Feature activation+5.50e-01

Pos logit contributions
year+0.028
chard+0.028
epad+0.024
 addressed+0.023
spe+0.023
Neg logit contributions
 mat-0.047
 wipes-0.046
 snowball-0.045
 Velvet-0.044
 tries-0.043
Token '
Token ID705
Feature activation-2.82e+00

Pos logit contributions
chard+0.027
 known+0.025
ho+0.024
epad+0.024
year+0.023
Neg logit contributions
 wipes-0.018
 mat-0.018
 snowball-0.016
 Velvet-0.016
 mats-0.016
Tokenbirth
Token ID24280
Feature activation-2.99e-01

Pos logit contributions
chard+0.070
 image+0.070
melon+0.068
epad+0.068
 unlike+0.066
Neg logit contributions
ara-0.069
Anna-0.068
 snowball-0.065
 turns-0.064
 wipes-0.062
Tokenday
Token ID820
Feature activation+1.80e-01

Pos logit contributions
chard+0.007
epad+0.006
 known+0.006
ho+0.006
 unlike+0.006
Neg logit contributions
 snowball-0.009
 mat-0.009
 wipes-0.009
Anna-0.008
uddle-0.008
Token',
Token ID3256
Feature activation-8.01e-01

Pos logit contributions
 mat+0.004
 wipes+0.004
ara+0.003
 turns+0.003
Anna+0.003
Neg logit contributions
chard-0.006
 known-0.006
year-0.005
ho-0.005
years-0.005
Token and
Token ID290
Feature activation-5.70e-01

Pos logit contributions
chard+0.026
 known+0.025
ho+0.024
epad+0.024
year+0.023
Neg logit contributions
 wipes-0.018
 mat-0.016
ara-0.015
 turns-0.015
 mats-0.014
Token '
Token ID705
Feature activation-2.96e+00

Pos logit contributions
 known+0.018
chard+0.017
ully+0.017
epad+0.017
ho+0.016
Neg logit contributions
 wipes-0.013
 tries-0.012
 mat-0.012
 turns-0.012
Anna-0.010
Tokenyou
Token ID5832
Feature activation+8.26e-01

Pos logit contributions
 image+0.074
chard+0.073
 unlike+0.072
epad+0.071
melon+0.071
Neg logit contributions
Anna-0.071
ara-0.070
 snowball-0.068
 turns-0.068
 wipes-0.065
Token'.
Token ID4458
Feature activation-2.67e-01

Pos logit contributions
 wipes+0.015
 turns+0.015
 tries+0.013
 mat+0.013
Anna+0.012
Neg logit contributions
chard-0.030
ho-0.026
year-0.025
thank-0.025
epad-0.025
Token They
Token ID1119
Feature activation+1.00e+00

Pos logit contributions
chard+0.007
 known+0.007
year+0.006
epad+0.006
thank+0.006
Neg logit contributions
 mat-0.007
 snowball-0.007
uddle-0.007
 ramp-0.007
ig-0.007
Token decided
Token ID3066
Feature activation-8.71e-01

Pos logit contributions
 wipes+0.026
 turns+0.024
 tries+0.023
 wraps+0.022
Anna+0.021
Neg logit contributions
chard-0.030
epad-0.026
year-0.025
 unlike-0.024
melon-0.024
Token to
Token ID284
Feature activation+8.78e-01

Pos logit contributions
chard+0.028
 known+0.026
year+0.026
years+0.025
ho+0.025
Neg logit contributions
 mat-0.018
 wipes-0.018
 turns-0.016
 Velvet-0.016
 mats-0.015
Token Tim
Token ID5045
Feature activation+6.36e-03

Pos logit contributions
 snowball+0.010
 mat+0.009
 wipes+0.009
 ramp+0.009
 Velvet+0.009
Neg logit contributions
chard-0.011
 known-0.011
epad-0.010
year-0.010
thank-0.010
Tokenmy
Token ID1820
Feature activation-5.06e-01

Pos logit contributions
 snowball+0.000
 mat+0.000
 wipes+0.000
 Velvet+0.000
uddle+0.000
Neg logit contributions
chard-0.000
 known-0.000
epad-0.000
ho-0.000
year-0.000
Token loved
Token ID6151
Feature activation+1.99e-01

Pos logit contributions
chard+0.009
 known+0.008
epad+0.008
year+0.008
ho+0.007
Neg logit contributions
 snowball-0.018
 mat-0.018
 wipes-0.017
 Velvet-0.017
Anna-0.017
Token to
Token ID284
Feature activation+5.61e-01

Pos logit contributions
 snowball+0.005
 mat+0.005
 wipes+0.005
 Velvet+0.005
 mats+0.005
Neg logit contributions
chard-0.005
 known-0.005
epad-0.005
year-0.005
ho-0.005
Token eat
Token ID4483
Feature activation+2.93e-01

Pos logit contributions
 mat+0.014
 snowball+0.014
 wipes+0.014
 ramp+0.013
 mats+0.013
Neg logit contributions
chard-0.016
 known-0.016
year-0.016
ho-0.015
spe-0.014
Token av
Token ID1196
Feature activation-2.96e+00

Pos logit contributions
 mat+0.007
 snowball+0.007
 wipes+0.007
uddle+0.006
 Velvet+0.006
Neg logit contributions
chard-0.009
 known-0.008
epad-0.008
ho-0.008
thank-0.008
Tokenoc
Token ID420
Feature activation+1.72e+00

Pos logit contributions
chard+0.099
 known+0.095
epad+0.091
thank+0.090
ho+0.090
Neg logit contributions
 mat-0.057
 snowball-0.055
 wipes-0.054
uddle-0.051
 Velvet-0.050
Tokenados
Token ID22484
Feature activation-4.58e-01

Pos logit contributions
 wipes+0.046
 mat+0.045
 snowball+0.045
uddle+0.044
 Velvet+0.043
Neg logit contributions
chard-0.044
 known-0.042
epad-0.039
year-0.039
 full-0.039
Token,
Token ID11
Feature activation-3.22e-01

Pos logit contributions
 known+0.013
chard+0.013
year+0.013
epad+0.012
ho+0.012
Neg logit contributions
 mat-0.011
 snowball-0.010
 Velvet-0.010
 mats-0.010
uddle-0.010
Token they
Token ID484
Feature activation-6.06e-01

Pos logit contributions
year+0.008
ho+0.007
 known+0.007
epad+0.007
thank+0.007
Neg logit contributions
 snowball-0.009
 mat-0.009
 wipes-0.009
 mats-0.008
 ramp-0.008
Token were
Token ID547
Feature activation-1.30e+00

Pos logit contributions
epad+0.017
enna+0.015
year+0.015
ull+0.015
 generation+0.015
Neg logit contributions
 pill-0.012
 turns-0.012
 tries-0.012
 can-0.011
 snowball-0.011
Token B
Token ID347
Feature activation+1.13e+00

Pos logit contributions
chard+0.005
 known+0.005
epad+0.005
thank+0.005
ho+0.005
Neg logit contributions
 snowball-0.005
 mat-0.005
 wipes-0.005
 Velvet-0.005
uddle-0.005
Tokenongo
Token ID25162
Feature activation+4.29e-01

Pos logit contributions
 mat+0.035
 snowball+0.034
 wipes+0.034
uddle+0.033
 Velvet+0.033
Neg logit contributions
chard-0.025
 known-0.023
epad-0.021
ho-0.021
thank-0.021
Token loved
Token ID6151
Feature activation+1.31e-01

Pos logit contributions
 snowball+0.014
 wipes+0.014
 mat+0.013
 Velvet+0.013
ig+0.013
Neg logit contributions
chard-0.009
epad-0.009
 known-0.008
year-0.008
thank-0.007
Token to
Token ID284
Feature activation+2.92e-01

Pos logit contributions
 snowball+0.003
 mat+0.003
 wipes+0.003
 Velvet+0.003
 mats+0.003
Neg logit contributions
chard-0.004
 known-0.003
year-0.003
epad-0.003
ho-0.003
Token eat
Token ID4483
Feature activation+2.62e-01

Pos logit contributions
 mat+0.007
 snowball+0.007
 wipes+0.007
 ramp+0.006
 mats+0.006
Neg logit contributions
year-0.009
chard-0.009
 known-0.009
spe-0.008
epad-0.008
Token av
Token ID1196
Feature activation-2.95e+00

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.006
uddle+0.006
 Velvet+0.006
Neg logit contributions
chard-0.008
 known-0.007
epad-0.007
ho-0.007
thank-0.007
Tokenoc
Token ID420
Feature activation+1.95e+00

Pos logit contributions
chard+0.098
 known+0.094
epad+0.089
thank+0.089
ho+0.088
Neg logit contributions
 mat-0.058
 snowball-0.056
 wipes-0.056
uddle-0.052
 Velvet-0.052
Tokenados
Token ID22484
Feature activation-4.80e-01

Pos logit contributions
 wipes+0.051
 snowball+0.050
 mat+0.050
uddle+0.049
 Velvet+0.048
Neg logit contributions
chard-0.052
 known-0.049
epad-0.046
year-0.046
 full-0.045
Token.
Token ID13
Feature activation-9.21e-01

Pos logit contributions
 known+0.013
year+0.013
chard+0.013
epad+0.012
ho+0.011
Neg logit contributions
 mat-0.012
 snowball-0.011
 Velvet-0.011
 mats-0.011
uddle-0.011
Token One
Token ID1881
Feature activation-7.78e-01

Pos logit contributions
chard+0.018
 known+0.017
 full+0.016
ho+0.015
thank+0.015
Neg logit contributions
 mat-0.030
 wipes-0.029
 snowball-0.029
uddle-0.028
 tries-0.028
Token day
Token ID1110
Feature activation+6.35e-01

Pos logit contributions
chard+0.015
epad+0.015
 known+0.014
year+0.013
thank+0.012
Neg logit contributions
 snowball-0.027
 mat-0.026
Anna-0.024
ara-0.024
ig-0.024
Token saw
Token ID2497
Feature activation+4.16e-01

Pos logit contributions
chard+0.001
 known+0.001
epad+0.001
year+0.001
thank+0.001
Neg logit contributions
 wipes-0.001
 mat-0.001
 snowball-0.001
 tries-0.001
uddle-0.001
Token a
Token ID257
Feature activation-2.95e-01

Pos logit contributions
 snowball+0.010
 wipes+0.010
 mat+0.009
 Velvet+0.009
 mats+0.009
Neg logit contributions
chard-0.012
epad-0.011
thank-0.011
 known-0.011
ho-0.011
Token picture
Token ID4286
Feature activation+4.87e-01

Pos logit contributions
 known+0.008
chard+0.008
thank+0.007
year+0.007
epad+0.007
Neg logit contributions
 mat-0.008
 snowball-0.008
uddle-0.007
 wipes-0.007
 ramp-0.007
Token of
Token ID286
Feature activation-5.52e-01

Pos logit contributions
 mat+0.010
 snowball+0.008
 Velvet+0.008
 mats+0.008
 wipes+0.008
Neg logit contributions
 known-0.018
chard-0.018
nam-0.016
thank-0.016
epad-0.016
Token a
Token ID257
Feature activation+5.02e-02

Pos logit contributions
chard+0.016
ho+0.016
year+0.016
 known+0.015
spe+0.015
Neg logit contributions
 mat-0.012
 snowball-0.012
 wipes-0.012
 Velvet-0.012
 mats-0.012
Token hamb
Token ID41478
Feature activation-2.95e+00

Pos logit contributions
 mat+0.001
 snowball+0.001
uddle+0.001
 ramp+0.001
 wipes+0.001
Neg logit contributions
 known-0.001
chard-0.001
year-0.001
thank-0.001
epad-0.001
Tokenurger
Token ID32650
Feature activation-9.41e-01

Pos logit contributions
chard+0.077
 known+0.073
year+0.065
epad+0.063
thank+0.062
Neg logit contributions
 snowball-0.083
 mat-0.082
uddle-0.080
 wipes-0.080
ara-0.076
Token and
Token ID290
Feature activation+3.34e-01

Pos logit contributions
 known+0.035
chard+0.032
year+0.031
years+0.030
nam+0.030
Neg logit contributions
 mat-0.019
 snowball-0.016
 tent-0.015
 pill-0.015
 gas-0.015
Token fries
Token ID31757
Feature activation-1.07e+00

Pos logit contributions
 wipes+0.008
 snowball+0.008
 mat+0.008
 mats+0.008
 turns+0.008
Neg logit contributions
chard-0.009
ho-0.009
 known-0.009
year-0.009
epad-0.009
Token,
Token ID11
Feature activation+5.93e-01

Pos logit contributions
chard+0.038
spe+0.037
 known+0.037
 generation+0.037
year+0.036
Neg logit contributions
 mat-0.017
 wipes-0.016
 Velvet-0.016
 gas-0.015
 pill-0.015
Token and
Token ID290
Feature activation-2.83e-01

Pos logit contributions
 snowball+0.010
 wipes+0.010
 mat+0.010
 mats+0.009
 Velvet+0.009
Neg logit contributions
chard-0.021
 known-0.021
ho-0.021
year-0.020
epad-0.020
Token.
Token ID13
Feature activation+3.63e-01

Pos logit contributions
 known+0.031
chard+0.030
year+0.028
 messenger+0.028
 generation+0.027
Neg logit contributions
 mat-0.012
 wipes-0.011
 snowball-0.011
 Velvet-0.011
ig-0.010
Token John
Token ID1757
Feature activation+1.50e-01

Pos logit contributions
 wipes+0.008
 mat+0.007
 snowball+0.007
 Velvet+0.007
ara+0.007
Neg logit contributions
chard-0.012
 known-0.011
ho-0.011
epad-0.011
thank-0.011
Token felt
Token ID2936
Feature activation-3.61e-01

Pos logit contributions
 wipes+0.004
 snowball+0.004
 tries+0.004
ara+0.004
 turns+0.003
Neg logit contributions
chard-0.004
epad-0.004
year-0.004
 known-0.004
nam-0.004
Token jealous
Token ID19354
Feature activation+1.41e-01

Pos logit contributions
chard+0.009
ho+0.008
epad+0.008
 known+0.008
thank+0.007
Neg logit contributions
 wipes-0.011
 snowball-0.010
 mat-0.010
 Velvet-0.010
 tries-0.009
Token.
Token ID13
Feature activation-7.38e-01

Pos logit contributions
 snowball+0.004
 mat+0.004
 wipes+0.004
 Velvet+0.004
uddle+0.004
Neg logit contributions
chard-0.003
 known-0.003
epad-0.003
 full-0.003
ho-0.003
Token '
Token ID705
Feature activation-2.94e+00

Pos logit contributions
chard+0.021
 known+0.020
epad+0.020
ho+0.020
thank+0.019
Neg logit contributions
 wipes-0.017
 mat-0.017
 snowball-0.017
 Velvet-0.016
Anna-0.016
TokenThat
Token ID2504
Feature activation-6.73e-01

Pos logit contributions
chard+0.085
epad+0.082
melon+0.081
ho+0.079
 known+0.078
Neg logit contributions
 snowball-0.067
Anna-0.066
ara-0.062
 wipes-0.062
 turns-0.061
Token's
Token ID338
Feature activation+5.61e-02

Pos logit contributions
chard+0.025
year+0.023
 known+0.023
epad+0.022
years+0.022
Neg logit contributions
 mat-0.012
 snowball-0.012
 wipes-0.012
uddle-0.011
 turns-0.010
Token not
Token ID407
Feature activation-4.20e-01

Pos logit contributions
 snowball+0.001
 Velvet+0.001
 mat+0.001
 wipes+0.001
 knot+0.001
Neg logit contributions
ho-0.002
chard-0.002
year-0.002
thank-0.001
epad-0.001
Token fair
Token ID3148
Feature activation-1.99e-01

Pos logit contributions
chard+0.013
ho+0.013
year+0.011
 known+0.011
thank+0.011
Neg logit contributions
 snowball-0.009
 wipes-0.009
 mat-0.009
 turns-0.009
 Velvet-0.009
Token!'
Token ID13679
Feature activation-7.67e-01

Pos logit contributions
chard+0.006
 known+0.006
year+0.006
thank+0.006
 ruled+0.005
Neg logit contributions
 mat-0.004
 wipes-0.004
uddle-0.004
 snowball-0.004
ig-0.004
Token two
Token ID734
Feature activation-8.08e-01

Pos logit contributions
year+0.018
chard+0.017
epad+0.017
years+0.016
ho+0.016
Neg logit contributions
 snowball-0.014
 gas-0.013
uddle-0.013
 ramp-0.012
 mat-0.012
Token friends
Token ID2460
Feature activation-2.52e-01

Pos logit contributions
chard+0.023
 known+0.021
epad+0.021
thank+0.020
ho+0.020
Neg logit contributions
 snowball-0.020
 mat-0.020
 wipes-0.019
uddle-0.019
 Velvet-0.018
Token had
Token ID550
Feature activation-3.12e-01

Pos logit contributions
chard+0.007
epad+0.006
thank+0.006
 known+0.006
nam+0.005
Neg logit contributions
 wipes-0.007
 turns-0.007
 tries-0.007
uddle-0.007
ig-0.006
Token to
Token ID284
Feature activation+1.23e+00

Pos logit contributions
chard+0.009
year+0.008
epad+0.008
ho+0.008
 known+0.008
Neg logit contributions
 snowball-0.008
 mat-0.008
 wipes-0.008
 turns-0.007
 mats-0.007
Token take
Token ID1011
Feature activation+6.06e-01

Pos logit contributions
 snowball+0.030
 mat+0.030
 gas+0.029
 ramp+0.028
 wipes+0.028
Neg logit contributions
chard-0.036
year-0.036
years-0.034
ho-0.033
epad-0.033
Token shelter
Token ID11772
Feature activation+2.40e+00

Pos logit contributions
 snowball+0.014
 mat+0.014
 wipes+0.013
 Velvet+0.013
uddle+0.012
Neg logit contributions
chard-0.019
 known-0.017
epad-0.017
ho-0.017
thank-0.016
Token inside
Token ID2641
Feature activation+3.67e-01

Pos logit contributions
 mat+0.066
 snowball+0.065
 wipes+0.063
uddle+0.062
 mats+0.061
Neg logit contributions
chard-0.062
 known-0.059
epad-0.056
year-0.054
ho-0.054
Token.
Token ID13
Feature activation+4.53e-01

Pos logit contributions
 mat+0.009
 snowball+0.008
 wipes+0.008
 mats+0.008
 Velvet+0.007
Neg logit contributions
chard-0.011
 known-0.010
ho-0.010
thank-0.010
epad-0.010
Token It
Token ID632
Feature activation-2.56e-01

Pos logit contributions
 mat+0.012
 wipes+0.011
 snowball+0.011
 mats+0.011
 turns+0.011
Neg logit contributions
chard-0.012
 known-0.012
ho-0.012
epad-0.011
thank-0.011
Token was
Token ID373
Feature activation+3.92e-01

Pos logit contributions
chard+0.007
epad+0.007
 known+0.007
year+0.006
thank+0.006
Neg logit contributions
 snowball-0.007
 mat-0.006
 wipes-0.006
 ramp-0.006
uddle-0.006
Token storm
Token ID6388
Feature activation+1.08e+00

Pos logit contributions
 snowball+0.011
 gas+0.010
 pill+0.009
 mat+0.009
 Velvet+0.009
Neg logit contributions
chard-0.011
ho-0.011
epad-0.010
abby-0.010
year-0.010
Token shouts
Token ID34757
Feature activation-3.76e-02

Pos logit contributions
 snowball+0.001
 wipes+0.001
 mat+0.001
 Velvet+0.001
 tries+0.001
Neg logit contributions
chard-0.000
epad-0.000
 known-0.000
thank-0.000
year-0.000
Token.
Token ID13
Feature activation+8.56e-02

Pos logit contributions
chard+0.001
 known+0.001
ho+0.001
epad+0.001
 full+0.001
Neg logit contributions
 mat-0.001
 snowball-0.001
 wipes-0.001
 ramp-0.001
uddle-0.001
Token
Token ID198
Feature activation+4.11e-01

Pos logit contributions
 mat+0.002
 snowball+0.002
 wipes+0.002
 Velvet+0.002
uddle+0.002
Neg logit contributions
chard-0.002
 known-0.002
epad-0.002
ho-0.002
thank-0.002
Token
Token ID198
Feature activation+7.03e-01

Pos logit contributions
 mat+0.012
 wipes+0.011
ig+0.011
 snowball+0.011
uddle+0.011
Neg logit contributions
 known-0.010
chard-0.009
 full-0.009
ho-0.009
year-0.009
Token"
Token ID1
Feature activation+6.18e-02

Pos logit contributions
 mat+0.018
 wipes+0.017
 snowball+0.017
uddle+0.017
ig+0.016
Neg logit contributions
chard-0.020
 known-0.019
year-0.018
ho-0.018
 full-0.018
TokenWho
Token ID8241
Feature activation+2.83e+00

Pos logit contributions
 mat+0.002
uddle+0.001
 wipes+0.001
ig+0.001
 snowball+0.001
Neg logit contributions
chard-0.002
 known-0.002
thank-0.002
ho-0.002
 decided-0.002
Token rang
Token ID28077
Feature activation+8.78e-01

Pos logit contributions
 mat+0.071
 snowball+0.070
 wipes+0.068
uddle+0.065
 Velvet+0.065
Neg logit contributions
chard-0.078
 known-0.074
epad-0.070
ho-0.069
thank-0.069
Token the
Token ID262
Feature activation+5.31e-01

Pos logit contributions
 mat+0.018
 snowball+0.018
 wipes+0.017
 Velvet+0.017
uddle+0.016
Neg logit contributions
 known-0.028
chard-0.028
epad-0.027
year-0.026
thank-0.026
Token bell
Token ID8966
Feature activation+6.02e-01

Pos logit contributions
 snowball+0.012
 mat+0.012
 wipes+0.012
 ramp+0.011
uddle+0.011
Neg logit contributions
chard-0.017
 known-0.015
year-0.015
epad-0.015
years-0.014
Token?
Token ID30
Feature activation+8.93e-01

Pos logit contributions
 mat+0.011
 snowball+0.011
ig+0.010
uddle+0.010
 ramp+0.010
Neg logit contributions
chard-0.021
 known-0.020
year-0.020
epad-0.019
years-0.019
Token Who
Token ID5338
Feature activation+1.29e+00

Pos logit contributions
 wipes+0.017
 snowball+0.017
 mat+0.017
 Velvet+0.017
 ramp+0.016
Neg logit contributions
chard-0.030
 known-0.028
thank-0.027
epad-0.027
ho-0.027
Token time
Token ID640
Feature activation+9.30e-01

Pos logit contributions
 Velvet+0.002
 Anna+0.002
 snowball+0.002
 gas+0.002
 minutes+0.002
Neg logit contributions
ho-0.005
chard-0.004
year-0.004
spe-0.004
years-0.004
Token to
Token ID284
Feature activation+1.17e+00

Pos logit contributions
 mat+0.018
 wipes+0.017
 Velvet+0.015
 tries+0.015
 snowball+0.015
Neg logit contributions
chard-0.031
 known-0.029
thank-0.029
ho-0.028
 image-0.027
Token go
Token ID467
Feature activation+3.58e-01

Pos logit contributions
 mat+0.029
 snowball+0.028
 wipes+0.028
ara+0.027
 gas+0.027
Neg logit contributions
chard-0.036
year-0.035
years-0.033
 unlike-0.032
epad-0.031
Token home
Token ID1363
Feature activation+1.79e+00

Pos logit contributions
 mat+0.012
 snowball+0.012
 wipes+0.012
 Velvet+0.011
uddle+0.011
Neg logit contributions
chard-0.007
 known-0.006
epad-0.006
thank-0.006
ho-0.006
Token.
Token ID13
Feature activation+9.72e-01

Pos logit contributions
 mat+0.039
 wipes+0.036
uddle+0.035
 snowball+0.035
 Velvet+0.035
Neg logit contributions
chard-0.055
 known-0.051
ho-0.051
year-0.050
melon-0.049
Token As
Token ID1081
Feature activation+2.29e+00

Pos logit contributions
 mat+0.028
 snowball+0.027
 wipes+0.027
 Velvet+0.026
uddle+0.026
Neg logit contributions
chard-0.024
 known-0.022
epad-0.021
ho-0.021
thank-0.020
Token Tim
Token ID5045
Feature activation+6.17e-01

Pos logit contributions
 snowball+0.056
 mat+0.054
 Velvet+0.053
 wipes+0.051
Anna+0.049
Neg logit contributions
chard-0.066
epad-0.061
 known-0.060
ho-0.060
year-0.059
Tokenmy
Token ID1820
Feature activation+1.88e-01

Pos logit contributions
 snowball+0.021
 mat+0.021
 wipes+0.021
 Velvet+0.020
uddle+0.020
Neg logit contributions
chard-0.012
 known-0.010
epad-0.010
year-0.009
ho-0.009
Token said
Token ID531
Feature activation-3.69e-02

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.006
 ramp+0.006
 mats+0.006
Neg logit contributions
 known-0.003
chard-0.003
 decided-0.003
 full-0.003
 visited-0.003
Token goodbye
Token ID24829
Feature activation-3.03e-01

Pos logit contributions
chard+0.001
 known+0.001
epad+0.001
thank+0.001
 image+0.001
Neg logit contributions
 wipes-0.001
 mat-0.001
 Velvet-0.001
 snowball-0.001
 turns-0.001
Token to
Token ID284
Feature activation-1.45e-01

Pos logit contributions
chard+0.009
 known+0.009
year+0.009
epad+0.009
 image+0.008
Neg logit contributions
 mat-0.007
 wipes-0.006
 snowball-0.006
 Velvet-0.006
uddle-0.006
Token.
Token ID13
Feature activation-2.37e-01

Pos logit contributions
chard+0.017
 known+0.016
ho+0.015
year+0.015
thank+0.015
Neg logit contributions
 mat-0.016
 snowball-0.014
 ramp-0.014
 mats-0.014
 Velvet-0.014
Token He
Token ID679
Feature activation-5.25e-02

Pos logit contributions
chard+0.006
 known+0.006
epad+0.006
ho+0.006
thank+0.005
Neg logit contributions
 snowball-0.006
 mat-0.006
 wipes-0.006
uddle-0.006
 Velvet-0.006
Token called
Token ID1444
Feature activation-4.27e-01

Pos logit contributions
chard+0.001
epad+0.001
 known+0.001
thank+0.001
ho+0.001
Neg logit contributions
 snowball-0.002
 mat-0.002
 wipes-0.002
 tries-0.001
uddle-0.001
Token it
Token ID340
Feature activation-1.17e-01

Pos logit contributions
chard+0.012
 known+0.012
thank+0.012
ho+0.011
 harvested+0.011
Neg logit contributions
 mat-0.011
 snowball-0.010
 wipes-0.010
 ramp-0.010
 Velvet-0.009
Token "
Token ID366
Feature activation-1.67e+00

Pos logit contributions
chard+0.003
 known+0.003
thank+0.003
year+0.003
nam+0.003
Neg logit contributions
 mat-0.003
 snowball-0.003
 Velvet-0.003
 wipes-0.003
 Luna-0.003
Tokenstone
Token ID6440
Feature activation+2.43e+00

Pos logit contributions
chard+0.051
 known+0.048
melon+0.048
 full+0.047
ho+0.046
Neg logit contributions
 snowball-0.038
Anna-0.036
ara-0.034
 mat-0.033
ig-0.032
Token hop
Token ID1725
Feature activation+5.36e-02

Pos logit contributions
 mat+0.053
 snowball+0.052
 wipes+0.051
uddle+0.049
ig+0.048
Neg logit contributions
chard-0.076
 known-0.072
year-0.069
epad-0.069
thank-0.068
Token",
Token ID1600
Feature activation-9.42e-01

Pos logit contributions
 mat+0.001
 snowball+0.001
 wipes+0.001
uddle+0.001
 Velvet+0.001
Neg logit contributions
chard-0.002
 known-0.002
epad-0.002
thank-0.002
year-0.002
Token and
Token ID290
Feature activation-5.40e-01

Pos logit contributions
chard+0.032
 known+0.031
ho+0.030
epad+0.029
year+0.029
Neg logit contributions
 mat-0.018
 wipes-0.018
 snowball-0.018
 Velvet-0.017
 ramp-0.017
Token it
Token ID340
Feature activation-6.68e-01

Pos logit contributions
chard+0.015
 known+0.014
ho+0.014
epad+0.014
 image+0.013
Neg logit contributions
 mat-0.013
 snowball-0.013
 tries-0.013
 wipes-0.013
 turns-0.012
Token was
Token ID373
Feature activation-1.42e-01

Pos logit contributions
chard+0.018
epad+0.016
 known+0.015
year+0.015
ho+0.015
Neg logit contributions
 snowball-0.019
 mat-0.019
 wipes-0.018
 tries-0.018
uddle-0.018
Token?"
Token ID1701
Feature activation-5.50e-03

Pos logit contributions
chard+0.005
years+0.005
 known+0.005
year+0.005
ho+0.005
Neg logit contributions
 snowball-0.003
 mat-0.003
 wipes-0.003
ara-0.003
 ramp-0.003
Token he
Token ID339
Feature activation-5.15e-01

Pos logit contributions
chard+0.000
 known+0.000
epad+0.000
ho+0.000
thank+0.000
Neg logit contributions
 mat-0.000
 snowball-0.000
 wipes-0.000
uddle-0.000
 Velvet-0.000
Token asks
Token ID7893
Feature activation+2.14e-01

Pos logit contributions
 known+0.015
chard+0.015
 decided+0.014
 ruled+0.014
 full+0.013
Neg logit contributions
 mat-0.011
uddle-0.011
 snowball-0.011
ig-0.011
 ramp-0.010
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.006
uddle+0.006
 Velvet+0.006
Neg logit contributions
chard-0.005
 known-0.005
epad-0.004
ho-0.004
thank-0.004
Token "
Token ID366
Feature activation-1.33e+00

Pos logit contributions
chard+0.033
 known+0.031
epad+0.030
thank+0.030
ho+0.030
Neg logit contributions
 mat-0.026
 snowball-0.026
 wipes-0.025
 Velvet-0.024
 ramp-0.023
TokenWho
Token ID8241
Feature activation+2.28e+00

Pos logit contributions
chard+0.038
 known+0.037
epad+0.036
ho+0.035
 full+0.034
Neg logit contributions
 snowball-0.031
Anna-0.030
 mat-0.030
 wipes-0.029
 Velvet-0.028
Token cut
Token ID2005
Feature activation+1.07e+00

Pos logit contributions
 wipes+0.046
 mat+0.046
 snowball+0.044
Anna+0.042
 turns+0.042
Neg logit contributions
chard-0.075
 known-0.070
thank-0.068
ho-0.067
epad-0.067
Token the
Token ID262
Feature activation+1.30e-02

Pos logit contributions
 mat+0.024
 snowball+0.023
 wipes+0.023
 Velvet+0.022
uddle+0.022
Neg logit contributions
chard-0.033
 known-0.032
epad-0.031
thank-0.030
ho-0.030
Token cake
Token ID12187
Feature activation-8.39e-01

Pos logit contributions
 mat+0.000
 snowball+0.000
 wipes+0.000
uddle+0.000
 ramp+0.000
Neg logit contributions
 known-0.000
chard-0.000
year-0.000
epad-0.000
thank-0.000
Token?"
Token ID1701
Feature activation-1.35e+00

Pos logit contributions
 known+0.031
chard+0.029
year+0.028
years+0.028
epad+0.027
Neg logit contributions
 mat-0.016
uddle-0.014
 snowball-0.014
 ramp-0.014
ig-0.013
Token
Token ID198
Feature activation-5.20e-01

Pos logit contributions
chard+0.042
 known+0.040
epad+0.038
thank+0.037
ho+0.037
Neg logit contributions
 mat-0.030
 snowball-0.030
 wipes-0.029
uddle-0.028
 Velvet-0.027
Token story
Token ID1621
Feature activation-4.59e-01

Pos logit contributions
 mat+0.013
 snowball+0.012
 wipes+0.012
 Velvet+0.012
ig+0.012
Neg logit contributions
chard-0.014
 known-0.012
thank-0.012
epad-0.012
ho-0.012
Token is
Token ID318
Feature activation-1.80e-01

Pos logit contributions
chard+0.021
 known+0.019
thank+0.019
epad+0.019
year+0.018
Neg logit contributions
 mat-0.004
 wipes-0.004
uddle-0.004
 Velvet-0.003
 snowball-0.003
Token that
Token ID326
Feature activation+1.56e-01

Pos logit contributions
chard+0.006
epad+0.005
 known+0.005
thank+0.005
year+0.005
Neg logit contributions
 mat-0.004
 wipes-0.004
 Velvet-0.004
uddle-0.004
ig-0.004
Token it
Token ID340
Feature activation-3.77e-01

Pos logit contributions
 mat+0.004
 wipes+0.004
 Velvet+0.003
 snowball+0.003
 tries+0.003
Neg logit contributions
chard-0.005
year-0.004
thank-0.004
 known-0.004
epad-0.004
Token's
Token ID338
Feature activation-8.22e-02

Pos logit contributions
 known+0.012
chard+0.012
epad+0.011
 full+0.011
 decided+0.011
Neg logit contributions
 snowball-0.008
 mat-0.008
 ramp-0.007
uddle-0.007
 wipes-0.007
Token important
Token ID1593
Feature activation+5.01e-01

Pos logit contributions
ho+0.002
chard+0.002
epad+0.002
melon+0.001
thank+0.001
Neg logit contributions
 wipes-0.002
 snowball-0.002
 Velvet-0.002
 mat-0.002
 tries-0.002
Token to
Token ID284
Feature activation+3.65e-01

Pos logit contributions
 snowball+0.015
 mat+0.015
 wipes+0.014
uddle+0.014
 Velvet+0.014
Neg logit contributions
chard-0.011
 known-0.011
epad-0.010
ho-0.010
thank-0.010
Token listen
Token ID6004
Feature activation+2.66e-01

Pos logit contributions
 wipes+0.009
 mat+0.008
 Velvet+0.008
Anna+0.008
 tries+0.008
Neg logit contributions
chard-0.011
year-0.010
ho-0.010
epad-0.009
spe-0.009
Token to
Token ID284
Feature activation+4.64e-01

Pos logit contributions
 mat+0.006
 wipes+0.006
 turns+0.005
 mats+0.005
 Velvet+0.005
Neg logit contributions
chard-0.008
 known-0.008
thank-0.007
ho-0.007
epad-0.007
Token your
Token ID534
Feature activation+1.48e-01

Pos logit contributions
 wipes+0.010
 mat+0.010
 Velvet+0.010
 tries+0.009
 turns+0.009
Neg logit contributions
chard-0.014
ho-0.014
year-0.013
melon-0.013
thank-0.013
Token parents
Token ID3397
Feature activation+6.49e-01

Pos logit contributions
 mat+0.004
 snowball+0.004
 wipes+0.004
uddle+0.004
 Velvet+0.004
Neg logit contributions
chard-0.004
 known-0.004
epad-0.004
year-0.004
thank-0.004
Token
Token ID95
Feature activation+8.39e-01

Pos logit contributions
chard+0.006
thank+0.005
 known+0.005
ho+0.005
year+0.005
Neg logit contributions
 snowball-0.005
 mat-0.005
 wipes-0.005
 Velvet-0.004
uddle-0.004
Token
Token ID26391
Feature activation+1.21e+00

Pos logit contributions
 snowball+0.022
 mat+0.022
 wipes+0.022
uddle+0.021
 Velvet+0.020
Neg logit contributions
chard-0.021
 known-0.021
thank-0.020
epad-0.019
year-0.019
Token
Token ID129
Feature activation-1.00e-01

Pos logit contributions
 mat+0.027
 snowball+0.027
 wipes+0.026
 Velvet+0.024
uddle+0.024
Neg logit contributions
chard-0.036
 known-0.035
epad-0.033
 full-0.033
ho-0.033
Token
Token ID241
Feature activation+1.44e+00

Pos logit contributions
chard+0.003
 known+0.003
epad+0.003
ho+0.003
year+0.003
Neg logit contributions
 mat-0.002
 snowball-0.002
 wipes-0.002
 Velvet-0.002
uddle-0.002
TokenThat
Token ID2504
Feature activation+3.37e-01

Pos logit contributions
 snowball+0.034
 mat+0.033
Anna+0.031
 Velvet+0.031
 wipes+0.031
Neg logit contributions
chard-0.044
ho-0.038
 known-0.038
melon-0.038
epad-0.037
Tokenâ
Token ID22940
Feature activation+1.08e+00

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.006
uddle+0.005
ig+0.005
Neg logit contributions
chard-0.012
 known-0.011
year-0.011
epad-0.011
thank-0.011
Token
Token ID26391
Feature activation+1.21e+00

Pos logit contributions
 mat+0.026
 snowball+0.026
 wipes+0.025
uddle+0.024
 Velvet+0.024
Neg logit contributions
chard-0.030
 known-0.029
epad-0.027
thank-0.027
ho-0.027
Token
Token ID8151
Feature activation+1.16e+00

Pos logit contributions
 mat+0.027
 snowball+0.027
 wipes+0.027
uddle+0.025
 ramp+0.025
Neg logit contributions
chard-0.036
 known-0.034
epad-0.032
 full-0.032
ho-0.032
Tokens
Token ID82
Feature activation+5.11e-01

Pos logit contributions
 mat+0.024
uddle+0.024
 snowball+0.023
 wipes+0.023
ig+0.023
Neg logit contributions
chard-0.037
 known-0.035
epad-0.034
thank-0.034
 generation-0.033
Token wonderful
Token ID7932
Feature activation+1.05e-01

Pos logit contributions
 wipes+0.010
 mat+0.010
 turns+0.010
 snowball+0.010
 Velvet+0.010
Neg logit contributions
chard-0.017
ho-0.016
epad-0.015
thank-0.015
year-0.015
Token,
Token ID11
Feature activation+9.34e-01

Pos logit contributions
 mat+0.003
 snowball+0.003
 wipes+0.003
uddle+0.003
 Velvet+0.003
Neg logit contributions
chard-0.003
 known-0.003
epad-0.003
ho-0.003
year-0.003
Token it
Token ID340
Feature activation+3.09e-01

Pos logit contributions
chard+0.032
ho+0.031
epad+0.031
 known+0.030
 ruled+0.030
Neg logit contributions
 wipes-0.019
 snowball-0.019
 turns-0.019
 tries-0.017
 mat-0.017
Token a
Token ID257
Feature activation+7.51e-01

Pos logit contributions
 wipes+0.007
 snowball+0.007
 mat+0.007
 turns+0.006
 pill+0.006
Neg logit contributions
chard-0.010
ho-0.009
 known-0.009
epad-0.009
thank-0.008
Token try
Token ID1949
Feature activation-3.53e-01

Pos logit contributions
 mat+0.019
 snowball+0.019
 wipes+0.019
uddle+0.018
 Velvet+0.018
Neg logit contributions
chard-0.020
 known-0.019
epad-0.018
ho-0.018
thank-0.018
Token.
Token ID13
Feature activation-6.28e-01

Pos logit contributions
chard+0.012
years+0.011
year+0.011
nam+0.011
 known+0.011
Neg logit contributions
 mat-0.007
 snowball-0.006
ig-0.006
 wipes-0.006
 ramp-0.006
Token The
Token ID383
Feature activation-9.87e-01

Pos logit contributions
chard+0.020
 known+0.020
epad+0.018
ho+0.018
thank+0.018
Neg logit contributions
 wipes-0.013
 mat-0.013
 snowball-0.013
 ramp-0.012
 turns-0.012
Token black
Token ID2042
Feature activation+5.25e-01

Pos logit contributions
chard+0.027
thank+0.024
melon+0.024
 known+0.023
ho+0.023
Neg logit contributions
 snowball-0.026
 mat-0.025
 wipes-0.025
 tries-0.024
 ramp-0.024
Tokensmith
Token ID21453
Feature activation-1.09e+00

Pos logit contributions
 snowball+0.011
 mat+0.011
 wipes+0.011
 ramp+0.010
ig+0.010
Neg logit contributions
chard-0.017
 known-0.016
epad-0.015
thank-0.015
year-0.015
Token smiled
Token ID13541
Feature activation-1.16e+00

Pos logit contributions
chard+0.033
 known+0.029
epad+0.028
ho+0.028
nam+0.028
Neg logit contributions
 wipes-0.027
 tries-0.026
 snowball-0.025
 turns-0.025
ig-0.024
Token and
Token ID290
Feature activation+9.76e-01

Pos logit contributions
chard+0.044
 known+0.042
epad+0.041
ho+0.040
 generation+0.040
Neg logit contributions
 wipes-0.018
 snowball-0.018
uddle-0.015
 mat-0.015
 Velvet-0.014
Token said
Token ID531
Feature activation-3.78e-01

Pos logit contributions
 wipes+0.023
 snowball+0.022
 mat+0.021
 tries+0.021
 Velvet+0.021
Neg logit contributions
chard-0.029
epad-0.028
 known-0.027
ho-0.026
year-0.026
Token,
Token ID11
Feature activation-6.35e-01

Pos logit contributions
chard+0.012
 known+0.011
epad+0.011
thank+0.011
year+0.010
Neg logit contributions
 mat-0.008
 wipes-0.008
 snowball-0.008
 Velvet-0.007
uddle-0.007
Token Lily
Token ID20037
Feature activation-4.49e-01

Pos logit contributions
 mat+0.003
 snowball+0.003
 wipes+0.003
 Velvet+0.003
 mats+0.003
Neg logit contributions
chard-0.008
 known-0.007
epad-0.007
thank-0.007
ho-0.007
Token were
Token ID547
Feature activation+2.35e-01

Pos logit contributions
chard+0.012
epad+0.011
year+0.011
 known+0.011
 generation+0.010
Neg logit contributions
 wipes-0.012
 mat-0.011
 snowball-0.011
 turns-0.011
ara-0.011
Token very
Token ID845
Feature activation+7.59e-01

Pos logit contributions
 snowball+0.007
 Velvet+0.007
 mat+0.007
 wipes+0.006
 turns+0.006
Neg logit contributions
chard-0.005
ho-0.005
year-0.005
epad-0.005
thank-0.005
Token happy
Token ID3772
Feature activation-6.70e-01

Pos logit contributions
 snowball+0.021
Anna+0.021
 turns+0.020
 mat+0.020
 wipes+0.020
Neg logit contributions
chard-0.018
epad-0.018
thank-0.017
 generation-0.017
ho-0.016
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
chard+0.019
 known+0.018
epad+0.017
year+0.017
thank+0.016
Neg logit contributions
 mat-0.017
 snowball-0.017
 wipes-0.016
 Velvet-0.016
uddle-0.015
Token They
Token ID1119
Feature activation+5.15e-01

Pos logit contributions
chard+0.003
 known+0.003
epad+0.003
ho+0.003
thank+0.003
Neg logit contributions
 mat-0.004
 snowball-0.004
 wipes-0.004
uddle-0.004
 Velvet-0.004
Token loved
Token ID6151
Feature activation+1.09e+00

Pos logit contributions
 wipes+0.013
 Velvet+0.012
 turns+0.012
 mat+0.012
 tries+0.012
Neg logit contributions
chard-0.014
epad-0.014
year-0.014
 image-0.013
 generation-0.013
Token the
Token ID262
Feature activation-2.69e-01

Pos logit contributions
 mat+0.025
 snowball+0.025
 Velvet+0.025
 mats+0.024
 wipes+0.023
Neg logit contributions
chard-0.032
epad-0.030
year-0.030
 known-0.029
thank-0.029
Token ze
Token ID41271
Feature activation-1.06e+00

Pos logit contributions
chard+0.009
year+0.008
epad+0.008
 known+0.008
thank+0.007
Neg logit contributions
 snowball-0.006
 mat-0.006
 Velvet-0.006
 ramp-0.005
 wipes-0.005
Tokenbr
Token ID1671
Feature activation-2.80e-01

Pos logit contributions
chard+0.042
 known+0.040
ho+0.039
epad+0.039
year+0.039
Neg logit contributions
 mat-0.013
 snowball-0.013
 wipes-0.013
uddle-0.011
 ramp-0.011
Tokenas
Token ID292
Feature activation-3.80e-01

Pos logit contributions
chard+0.008
 known+0.008
epad+0.007
ho+0.007
year+0.007
Neg logit contributions
 mat-0.006
 snowball-0.006
 wipes-0.006
 Velvet-0.006
uddle-0.006
Token
Token ID198
Feature activation+2.51e-01

Pos logit contributions
 known+0.004
chard+0.004
 full+0.003
year+0.003
thank+0.003
Neg logit contributions
 mat-0.005
 snowball-0.005
ig-0.005
 wipes-0.005
uddle-0.005
TokenThe
Token ID464
Feature activation-1.10e+00

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.006
uddle+0.005
 Velvet+0.005
Neg logit contributions
chard-0.007
 known-0.007
epad-0.007
ho-0.007
thank-0.007
Token mouse
Token ID10211
Feature activation+1.02e-01

Pos logit contributions
chard+0.037
 known+0.035
epad+0.034
ho+0.033
thank+0.033
Neg logit contributions
 mat-0.022
 snowball-0.022
 wipes-0.021
 Velvet-0.020
 ramp-0.020
Token looked
Token ID3114
Feature activation-3.11e-01

Pos logit contributions
 wipes+0.002
 snowball+0.002
 tries+0.002
 mat+0.002
ig+0.002
Neg logit contributions
chard-0.003
epad-0.003
 known-0.003
thank-0.003
year-0.003
Token up
Token ID510
Feature activation-4.17e-01

Pos logit contributions
chard+0.007
 known+0.007
thank+0.007
epad+0.007
ho+0.007
Neg logit contributions
 wipes-0.009
 snowball-0.009
 mat-0.009
uddle-0.009
 Velvet-0.008
Token at
Token ID379
Feature activation+3.85e-01

Pos logit contributions
 known+0.014
chard+0.014
epad+0.014
 generation+0.013
spe+0.013
Neg logit contributions
 wipes-0.008
 mat-0.007
 snowball-0.007
ara-0.007
uddle-0.007
Token her
Token ID607
Feature activation-4.58e-01

Pos logit contributions
 Velvet+0.011
 wipes+0.011
 mat+0.011
 snowball+0.011
ara+0.010
Neg logit contributions
chard-0.009
epad-0.009
 known-0.009
year-0.009
thank-0.008
Token and
Token ID290
Feature activation-1.92e-01

Pos logit contributions
chard+0.015
 known+0.014
year+0.014
epad+0.013
thank+0.013
Neg logit contributions
 mat-0.010
 snowball-0.010
 wipes-0.010
uddle-0.009
 Velvet-0.009
Token said
Token ID531
Feature activation-3.89e-01

Pos logit contributions
epad+0.006
chard+0.006
 known+0.006
year+0.005
thank+0.005
Neg logit contributions
 wipes-0.004
 snowball-0.004
 mat-0.004
 Velvet-0.004
 tries-0.004
Token,
Token ID11
Feature activation-7.05e-01

Pos logit contributions
chard+0.012
 known+0.012
epad+0.012
thank+0.012
year+0.011
Neg logit contributions
 wipes-0.008
 mat-0.008
 snowball-0.008
 Velvet-0.007
uddle-0.007
Token
Token ID6184
Feature activation-6.80e-01

Pos logit contributions
 known+0.025
chard+0.024
thank+0.024
epad+0.024
year+0.023
Neg logit contributions
 mat-0.011
 Velvet-0.011
 wipes-0.011
 snowball-0.010
 ramp-0.009
Token fly
Token ID6129
Feature activation+3.53e-01

Pos logit contributions
year+0.021
chard+0.021
epad+0.021
 known+0.020
thank+0.019
Neg logit contributions
 mat-0.017
 snowball-0.017
 ramp-0.016
 wipes-0.016
 Velvet-0.016
Token very
Token ID845
Feature activation+1.75e-02

Pos logit contributions
 snowball+0.010
 mat+0.010
 wipes+0.009
 Velvet+0.009
 ramp+0.009
Neg logit contributions
chard-0.009
 known-0.008
epad-0.008
year-0.008
thank-0.008
Token high
Token ID1029
Feature activation-7.69e-02

Pos logit contributions
 snowball+0.001
 mat+0.001
uddle+0.000
 wipes+0.000
 ramp+0.000
Neg logit contributions
chard-0.000
 known-0.000
epad-0.000
thank-0.000
year-0.000
Token in
Token ID287
Feature activation-1.38e+00

Pos logit contributions
chard+0.002
 known+0.002
epad+0.002
year+0.002
ho+0.002
Neg logit contributions
 snowball-0.002
 mat-0.002
 wipes-0.002
 Velvet-0.002
uddle-0.002
Token the
Token ID262
Feature activation-1.61e-01

Pos logit contributions
chard+0.041
 known+0.039
epad+0.038
year+0.038
ho+0.038
Neg logit contributions
 snowball-0.032
 mat-0.031
 wipes-0.030
 ramp-0.028
 Velvet-0.028
Token sky
Token ID6766
Feature activation+1.03e-01

Pos logit contributions
chard+0.004
 known+0.004
epad+0.004
year+0.004
ho+0.004
Neg logit contributions
 snowball-0.005
 wipes-0.004
 mat-0.004
uddle-0.004
ig-0.004
Token!"
Token ID2474
Feature activation-1.19e+00

Pos logit contributions
 mat+0.002
 snowball+0.002
ig+0.002
 Velvet+0.002
Anna+0.002
Neg logit contributions
chard-0.004
 known-0.004
year-0.003
epad-0.003
ho-0.003
Token
Token ID198
Feature activation+8.56e-02

Pos logit contributions
chard+0.032
 known+0.030
epad+0.028
ho+0.028
thank+0.028
Neg logit contributions
 mat-0.032
 snowball-0.031
 wipes-0.031
uddle-0.029
 Velvet-0.029
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
 mat+0.003
 snowball+0.002
 wipes+0.002
 mats+0.002
 ramp+0.002
Neg logit contributions
chard-0.002
 known-0.002
 full-0.002
thank-0.002
ho-0.002
TokenThe
Token ID464
Feature activation-7.10e-01

Pos logit contributions
chard+0.001
 known+0.000
epad+0.000
ho+0.000
year+0.000
Neg logit contributions
 mat-0.000
 snowball-0.000
 wipes-0.000
Anna-0.000
uddle-0.000
Token little
Token ID1310
Feature activation+4.32e-01

Pos logit contributions
chard+0.025
 known+0.024
epad+0.023
 ruled+0.023
ho+0.023
Neg logit contributions
 mat-0.013
 wipes-0.012
ara-0.012
 snowball-0.012
 Velvet-0.011
Token house
Token ID2156
Feature activation+5.72e-01

Pos logit contributions
 snowball+0.011
 mat+0.011
 wipes+0.011
 Velvet+0.010
uddle+0.010
Neg logit contributions
chard-0.011
 known-0.011
epad-0.010
ho-0.010
thank-0.010
Token.
Token ID13
Feature activation-8.73e-02

Pos logit contributions
 mat+0.012
 mats+0.011
 wipes+0.011
 Velvet+0.010
 snowball+0.010
Neg logit contributions
chard-0.018
 known-0.018
ully-0.017
ull-0.017
 image-0.016
Token She
Token ID1375
Feature activation+1.02e+00

Pos logit contributions
chard+0.002
 known+0.002
epad+0.002
ho+0.002
thank+0.002
Neg logit contributions
 mat-0.002
 snowball-0.002
 wipes-0.002
 Velvet-0.002
uddle-0.002
Token knocked
Token ID13642
Feature activation-5.42e-01

Pos logit contributions
 wipes+0.029
 snowball+0.029
 mat+0.029
 Velvet+0.028
uddle+0.027
Neg logit contributions
chard-0.025
epad-0.023
 known-0.023
year-0.022
ho-0.022
Token on
Token ID319
Feature activation+3.31e-01

Pos logit contributions
chard+0.014
 known+0.013
epad+0.013
thank+0.012
 ruled+0.012
Neg logit contributions
 snowball-0.015
 mat-0.015
 wipes-0.015
 Velvet-0.014
Anna-0.014
Token the
Token ID262
Feature activation-3.99e-01

Pos logit contributions
 mat+0.007
 snowball+0.007
 wipes+0.007
 Velvet+0.007
 turns+0.007
Neg logit contributions
chard-0.011
 known-0.010
epad-0.010
thank-0.010
year-0.010
Token door
Token ID3420
Feature activation-3.91e-01

Pos logit contributions
chard+0.011
 known+0.010
epad+0.009
ho+0.009
thank+0.009
Neg logit contributions
 mat-0.010
 snowball-0.010
 wipes-0.010
 Velvet-0.010
uddle-0.010
Token and
Token ID290
Feature activation+6.05e-02

Pos logit contributions
chard+0.013
 known+0.012
thank+0.011
epad+0.011
ho+0.011
Neg logit contributions
 mat-0.009
 wipes-0.008
 snowball-0.008
uddle-0.007
 Velvet-0.007
Token said
Token ID531
Feature activation+1.80e-01

Pos logit contributions
 wipes+0.002
 mat+0.002
 snowball+0.002
 Velvet+0.002
 mats+0.002
Neg logit contributions
chard-0.001
 known-0.001
epad-0.001
ho-0.001
thank-0.001
Token,
Token ID11
Feature activation-5.84e-01

Pos logit contributions
 mat+0.004
 wipes+0.004
 snowball+0.004
 Velvet+0.004
uddle+0.004
Neg logit contributions
chard-0.006
 known-0.005
epad-0.005
thank-0.005
year-0.005
Token "
Token ID366
Feature activation+1.40e-01

Pos logit contributions
chard+0.017
 known+0.017
epad+0.016
thank+0.016
year+0.016
Neg logit contributions
 mat-0.013
 wipes-0.013
 Velvet-0.012
 snowball-0.012
 ramp-0.011
Token of
Token ID286
Feature activation-3.69e-01

Pos logit contributions
 mat+0.016
 snowball+0.015
 wipes+0.015
 Velvet+0.015
 ramp+0.014
Neg logit contributions
chard-0.012
 known-0.011
 full-0.011
year-0.010
epad-0.010
Token other
Token ID584
Feature activation-4.92e-01

Pos logit contributions
chard+0.011
 known+0.011
ho+0.010
epad+0.010
thank+0.010
Neg logit contributions
 snowball-0.009
 mat-0.009
 wipes-0.009
 mats-0.008
uddle-0.008
Token rabbits
Token ID33043
Feature activation-3.47e-01

Pos logit contributions
chard+0.014
 known+0.014
epad+0.013
 full+0.013
thank+0.013
Neg logit contributions
 mat-0.011
 snowball-0.011
 wipes-0.011
 Velvet-0.011
 ramp-0.010
Token inside
Token ID2641
Feature activation+2.07e-01

Pos logit contributions
chard+0.010
 known+0.009
year+0.009
thank+0.009
epad+0.008
Neg logit contributions
 mat-0.009
 snowball-0.009
 wipes-0.008
 ramp-0.008
uddle-0.008
Token,
Token ID11
Feature activation+9.19e-01

Pos logit contributions
 mat+0.004
 snowball+0.003
 wipes+0.003
 gas+0.003
 mats+0.003
Neg logit contributions
chard-0.007
 known-0.007
thank-0.007
 messenger-0.007
working-0.006
Token all
Token ID477
Feature activation+5.29e-04

Pos logit contributions
 snowball+0.022
 mat+0.021
 mats+0.021
 wipes+0.020
 Velvet+0.020
Neg logit contributions
 known-0.027
year-0.027
epad-0.026
chard-0.026
ho-0.026
Token hopping
Token ID47153
Feature activation+4.98e-01

Pos logit contributions
 snowball+0.000
 mat+0.000
 wipes+0.000
 mats+0.000
 turns+0.000
Neg logit contributions
chard-0.000
 known-0.000
thank-0.000
epad-0.000
ho-0.000
Token around
Token ID1088
Feature activation+3.36e-01

Pos logit contributions
 mat+0.014
 snowball+0.014
 mats+0.013
 wipes+0.013
uddle+0.013
Neg logit contributions
 known-0.013
chard-0.013
epad-0.012
thank-0.011
year-0.011
Token and
Token ID290
Feature activation+1.55e-01

Pos logit contributions
 mat+0.007
 snowball+0.007
 mats+0.006
 wipes+0.006
 turns+0.006
Neg logit contributions
chard-0.011
 known-0.010
year-0.010
thank-0.010
epad-0.010
Token having
Token ID1719
Feature activation-8.62e-01

Pos logit contributions
 mat+0.004
 mats+0.004
 snowball+0.004
 Velvet+0.004
 turns+0.004
Neg logit contributions
year-0.004
abby-0.004
 known-0.004
ully-0.004
thank-0.004
Token fun
Token ID1257
Feature activation+1.93e-01

Pos logit contributions
chard+0.023
 known+0.021
ho+0.021
epad+0.020
thank+0.020
Neg logit contributions
 snowball-0.023
 mat-0.023
 wipes-0.022
 Velvet-0.021
 mats-0.021
Tokencoon
Token ID20912
Feature activation+5.92e-01

Pos logit contributions
 mat+0.016
 snowball+0.016
 wipes+0.015
uddle+0.014
 ramp+0.014
Neg logit contributions
chard-0.024
 known-0.023
epad-0.022
ho-0.022
thank-0.021
Token!"
Token ID2474
Feature activation-6.22e-01

Pos logit contributions
 mat+0.015
 snowball+0.015
 wipes+0.014
 Velvet+0.013
 ramp+0.013
Neg logit contributions
chard-0.017
 known-0.016
epad-0.015
year-0.015
thank-0.015
Token said
Token ID531
Feature activation+6.04e-03

Pos logit contributions
chard+0.020
 known+0.019
epad+0.019
year+0.018
ho+0.018
Neg logit contributions
 mat-0.013
 snowball-0.012
 wipes-0.012
 Velvet-0.012
Anna-0.011
Token Po
Token ID7695
Feature activation+1.12e+00

Pos logit contributions
 Velvet+0.000
 snowball+0.000
 mat+0.000
 wipes+0.000
Anna+0.000
Neg logit contributions
chard-0.000
epad-0.000
 known-0.000
thank-0.000
year-0.000
Tokenppy
Token ID14097
Feature activation-7.95e-02

Pos logit contributions
 wipes+0.019
 mat+0.019
 Velvet+0.019
 snowball+0.019
ig+0.018
Neg logit contributions
chard-0.040
 known-0.038
year-0.036
thank-0.035
nam-0.035
Token with
Token ID351
Feature activation-9.86e-01

Pos logit contributions
chard+0.003
 known+0.003
year+0.003
nam+0.003
 generation+0.002
Neg logit contributions
 wipes-0.001
 mat-0.001
 snowball-0.001
 Velvet-0.001
 tries-0.001
Token a
Token ID257
Feature activation+4.63e-01

Pos logit contributions
chard+0.028
epad+0.026
thank+0.026
ho+0.025
year+0.025
Neg logit contributions
 snowball-0.025
 wipes-0.025
 mat-0.024
 Velvet-0.023
 tries-0.023
Token gasp
Token ID47440
Feature activation-9.56e-01

Pos logit contributions
 snowball+0.014
 mat+0.013
 wipes+0.013
 ramp+0.012
 Velvet+0.012
Neg logit contributions
chard-0.012
 known-0.010
year-0.010
thank-0.010
epad-0.010
Token.
Token ID13
Feature activation-3.69e-01

Pos logit contributions
chard+0.035
 known+0.035
nam+0.031
working+0.030
epad+0.030
Neg logit contributions
 snowball-0.018
 wipes-0.017
 mat-0.016
 minutes-0.015
ig-0.015
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
chard+0.011
 known+0.011
epad+0.010
thank+0.010
year+0.010
Neg logit contributions
 mat-0.008
 snowball-0.008
 wipes-0.008
 Velvet-0.008
uddle-0.008
Token
Token ID198
Feature activation-1.14e-01

Pos logit contributions
chard+0.003
 known+0.003
year+0.003
ho+0.003
thank+0.003
Neg logit contributions
 mat-0.003
 snowball-0.003
 wipes-0.003
 ramp-0.003
uddle-0.003
Token great
Token ID1049
Feature activation-7.31e-01

Pos logit contributions
year+0.007
 known+0.007
enna+0.006
chard+0.006
epad+0.006
Neg logit contributions
 mat-0.005
 gas-0.005
 snowball-0.005
uddle-0.005
 turns-0.005
Token big
Token ID1263
Feature activation+2.48e-01

Pos logit contributions
chard+0.020
 known+0.020
year+0.018
ho+0.018
epad+0.018
Neg logit contributions
 mat-0.018
 snowball-0.018
 wipes-0.018
uddle-0.017
 ramp-0.017
Token Fer
Token ID12880
Feature activation-1.37e+00

Pos logit contributions
 mat+0.006
 snowball+0.006
 wipes+0.006
uddle+0.006
 ramp+0.005
Neg logit contributions
 known-0.007
chard-0.007
year-0.007
epad-0.007
thank-0.006
Tokenris
Token ID2442
Feature activation+6.66e-01

Pos logit contributions
chard+0.039
 known+0.037
epad+0.035
ho+0.034
thank+0.034
Neg logit contributions
 mat-0.033
 snowball-0.033
 wipes-0.031
uddle-0.031
 ramp-0.031
Token wheel
Token ID7825
Feature activation-3.90e-01

Pos logit contributions
 mat+0.014
 snowball+0.014
 wipes+0.013
 ramp+0.013
uddle+0.013
Neg logit contributions
chard-0.021
 known-0.020
epad-0.019
year-0.019
thank-0.019
Token and
Token ID290
Feature activation-3.37e-02

Pos logit contributions
 known+0.013
chard+0.013
year+0.012
years+0.011
 harvested+0.011
Neg logit contributions
 mat-0.008
 ramp-0.008
ig-0.007
uddle-0.007
 snowball-0.007
Token all
Token ID477
Feature activation-5.50e-01

Pos logit contributions
chard+0.001
 known+0.001
epad+0.001
ho+0.001
year+0.001
Neg logit contributions
 wipes-0.001
 mat-0.001
 turns-0.001
 mats-0.001
 tries-0.001
Token the
Token ID262
Feature activation+3.83e-01

Pos logit contributions
chard+0.015
 known+0.014
epad+0.014
ho+0.014
 generation+0.014
Neg logit contributions
 mat-0.013
 wipes-0.013
 snowball-0.013
 tries-0.013
 turns-0.013
Token other
Token ID584
Feature activation-7.94e-01

Pos logit contributions
 mat+0.009
 snowball+0.009
 wipes+0.009
 mats+0.008
 ramp+0.008
Neg logit contributions
chard-0.011
 known-0.011
ho-0.010
year-0.010
epad-0.010
Token fun
Token ID1257
Feature activation-1.54e-01

Pos logit contributions
chard+0.025
 known+0.024
epad+0.023
ho+0.022
thank+0.022
Neg logit contributions
 mat-0.017
 snowball-0.017
 wipes-0.016
 Velvet-0.016
uddle-0.016
Token rides
Token ID17445
Feature activation-4.97e-02

Pos logit contributions
chard+0.005
 known+0.004
ho+0.004
year+0.004
thank+0.004
Neg logit contributions
 mat-0.004
 wipes-0.003
 snowball-0.003
 mats-0.003
uddle-0.003
Token idea
Token ID2126
Feature activation-1.09e+00

Pos logit contributions
 snowball+0.005
 mat+0.005
 wipes+0.005
uddle+0.004
 ramp+0.004
Neg logit contributions
chard-0.004
 known-0.003
ho-0.003
epad-0.003
thank-0.003
Token.
Token ID13
Feature activation-5.33e-01

Pos logit contributions
chard+0.022
 known+0.020
epad+0.019
ho+0.019
year+0.019
Neg logit contributions
 mat-0.035
 snowball-0.035
 wipes-0.034
uddle-0.034
 Velvet-0.033
Token He
Token ID679
Feature activation-3.51e-01

Pos logit contributions
chard+0.013
 known+0.012
epad+0.011
thank+0.011
ho+0.011
Neg logit contributions
 mat-0.015
 snowball-0.015
 wipes-0.015
 Velvet-0.014
uddle-0.014
Token remembered
Token ID12086
Feature activation-5.91e-01

Pos logit contributions
chard+0.007
 known+0.006
epad+0.006
thank+0.006
ho+0.006
Neg logit contributions
 snowball-0.012
 wipes-0.011
 mat-0.011
 tries-0.011
uddle-0.011
Token that
Token ID326
Feature activation-7.19e-01

Pos logit contributions
chard+0.018
thank+0.017
ho+0.017
epad+0.017
year+0.017
Neg logit contributions
 wipes-0.013
 snowball-0.012
 mat-0.012
 Velvet-0.011
 tries-0.011
Token his
Token ID465
Feature activation-1.60e+00

Pos logit contributions
chard+0.021
ho+0.020
year+0.019
thank+0.019
abby+0.019
Neg logit contributions
 snowball-0.017
 wipes-0.017
 mat-0.016
 Velvet-0.016
 ramp-0.016
Token dad
Token ID9955
Feature activation-2.38e-01

Pos logit contributions
chard+0.047
year+0.046
 known+0.043
epad+0.043
ho+0.041
Neg logit contributions
 mat-0.038
 wipes-0.038
 snowball-0.037
uddle-0.036
ig-0.035
Token had
Token ID550
Feature activation-6.87e-01

Pos logit contributions
chard+0.006
epad+0.006
ho+0.005
year+0.005
 known+0.005
Neg logit contributions
 wipes-0.006
 tries-0.006
 snowball-0.006
 turns-0.006
ig-0.006
Token taught
Token ID7817
Feature activation+2.49e-01

Pos logit contributions
chard+0.019
ho+0.017
year+0.017
epad+0.015
melon+0.015
Neg logit contributions
 wipes-0.018
 tries-0.018
 gas-0.017
 turns-0.017
 stitches-0.017
Token him
Token ID683
Feature activation-1.90e-01

Pos logit contributions
 snowball+0.007
 turns+0.006
 wipes+0.006
 stitches+0.006
 Velvet+0.006
Neg logit contributions
chard-0.007
 known-0.006
epad-0.006
ho-0.006
thank-0.006
Token how
Token ID703
Feature activation-3.37e-02

Pos logit contributions
chard+0.006
ho+0.005
 known+0.005
thank+0.005
epad+0.005
Neg logit contributions
 mat-0.004
 wipes-0.004
 snowball-0.004
 mats-0.004
 Velvet-0.004
Token the
Token ID262
Feature activation-1.51e-01

Pos logit contributions
 mat+0.017
 wipes+0.017
 snowball+0.017
uddle+0.017
ig+0.016
Neg logit contributions
chard-0.021
 known-0.019
 image-0.019
epad-0.019
ho-0.019
Token red
Token ID2266
Feature activation+4.74e-01

Pos logit contributions
chard+0.005
 known+0.005
epad+0.005
year+0.005
thank+0.005
Neg logit contributions
 mat-0.003
 snowball-0.003
 wipes-0.003
uddle-0.003
 ramp-0.003
Token car
Token ID1097
Feature activation-6.33e-02

Pos logit contributions
 mat+0.009
 snowball+0.009
 wipes+0.009
uddle+0.008
 Velvet+0.008
Neg logit contributions
chard-0.016
 known-0.015
epad-0.014
ho-0.014
thank-0.014
Token.
Token ID13
Feature activation+6.57e-01

Pos logit contributions
 known+0.002
chard+0.002
year+0.002
thank+0.002
epad+0.002
Neg logit contributions
 mat-0.001
 snowball-0.001
 ramp-0.001
uddle-0.001
ig-0.001
Token Where
Token ID6350
Feature activation+1.49e+00

Pos logit contributions
 mat+0.016
 snowball+0.016
uddle+0.015
ig+0.015
 wipes+0.015
Neg logit contributions
chard-0.019
 known-0.017
ho-0.017
epad-0.017
 full-0.017
Token is
Token ID318
Feature activation-1.84e+00

Pos logit contributions
 snowball+0.038
 mat+0.038
 wipes+0.036
 Velvet+0.036
 ramp+0.035
Neg logit contributions
chard-0.039
 known-0.038
 full-0.036
ho-0.035
epad-0.035
Token it
Token ID340
Feature activation-6.93e-01

Pos logit contributions
chard+0.059
 known+0.055
epad+0.052
ho+0.052
 full+0.051
Neg logit contributions
 mat-0.039
 snowball-0.039
 wipes-0.038
uddle-0.037
 ramp-0.035
Token?"
Token ID1701
Feature activation-9.79e-01

Pos logit contributions
chard+0.020
year+0.019
 known+0.018
epad+0.018
ho+0.018
Neg logit contributions
 snowball-0.017
 mat-0.017
 wipes-0.016
uddle-0.016
 ramp-0.016
Token
Token ID198
Feature activation-2.02e-01

Pos logit contributions
chard+0.028
 known+0.026
ho+0.024
thank+0.024
 full+0.024
Neg logit contributions
 mat-0.024
 snowball-0.024
 wipes-0.023
ig-0.023
uddle-0.023
Token
Token ID198
Feature activation+8.03e-02

Pos logit contributions
 known+0.004
chard+0.004
ull+0.004
 full+0.004
year+0.004
Neg logit contributions
 mat-0.006
ig-0.006
 snowball-0.006
 wipes-0.006
 mats-0.006
TokenThey
Token ID2990
Feature activation+9.62e-02

Pos logit contributions
 mat+0.002
 wipes+0.002
 snowball+0.002
uddle+0.002
ig+0.002
Neg logit contributions
chard-0.002
 known-0.002
year-0.002
thank-0.002
ho-0.002
Token told
Token ID1297
Feature activation-1.55e-01

Pos logit contributions
 wipes+0.015
 mat+0.014
 snowball+0.013
 Velvet+0.013
 turns+0.013
Neg logit contributions
chard-0.016
epad-0.015
ho-0.015
year-0.014
melon-0.014
Token her
Token ID607
Feature activation+7.04e-01

Pos logit contributions
chard+0.004
 known+0.004
epad+0.003
ho+0.003
thank+0.003
Neg logit contributions
 wipes-0.004
 mat-0.004
 Velvet-0.004
 snowball-0.004
 mats-0.004
Token that
Token ID326
Feature activation-4.99e-01

Pos logit contributions
 mat+0.017
 wipes+0.016
 Velvet+0.015
 snowball+0.015
 mats+0.015
Neg logit contributions
chard-0.021
thank-0.020
 known-0.019
ho-0.019
melon-0.018
Token if
Token ID611
Feature activation-1.69e-01

Pos logit contributions
chard+0.015
abby+0.015
thank+0.014
year+0.014
ully+0.014
Neg logit contributions
 mat-0.010
 wipes-0.010
 Velvet-0.010
 pill-0.010
 stitches-0.010
Token she
Token ID673
Feature activation-7.98e-01

Pos logit contributions
chard+0.005
epad+0.004
year+0.004
thank+0.004
 known+0.004
Neg logit contributions
 wipes-0.004
 mat-0.004
 Velvet-0.004
 snowball-0.004
 mats-0.004
Token ever
Token ID1683
Feature activation-1.58e+00

Pos logit contributions
chard+0.019
 known+0.016
epad+0.016
year+0.015
melon+0.015
Neg logit contributions
 wipes-0.024
 mat-0.023
 snowball-0.023
 tries-0.023
 turns-0.023
Token needed
Token ID2622
Feature activation-2.91e-02

Pos logit contributions
chard+0.039
year+0.034
 known+0.033
epad+0.033
ho+0.032
Neg logit contributions
 wipes-0.046
 tries-0.044
 turns-0.044
 snowball-0.042
 mat-0.042
Token somebody
Token ID8276
Feature activation+3.53e-01

Pos logit contributions
chard+0.001
 known+0.001
ho+0.001
year+0.001
epad+0.001
Neg logit contributions
 mat-0.001
 wipes-0.001
 snowball-0.001
 Velvet-0.001
 mats-0.001
Token to
Token ID284
Feature activation+1.93e-01

Pos logit contributions
 mat+0.009
 wipes+0.009
 snowball+0.008
 Velvet+0.008
uddle+0.008
Neg logit contributions
chard-0.010
 known-0.009
year-0.009
epad-0.009
ho-0.009
Token talk
Token ID1561
Feature activation-8.01e-01

Pos logit contributions
 wipes+0.004
 mat+0.004
 nap+0.004
 gas+0.003
 stitches+0.003
Neg logit contributions
year-0.007
chard-0.006
years-0.006
spe-0.006
ho-0.006
Token to
Token ID284
Feature activation-2.76e-01

Pos logit contributions
chard+0.025
 known+0.024
ho+0.023
year+0.023
melon+0.022
Neg logit contributions
 mat-0.017
 wipes-0.017
 Velvet-0.015
Anna-0.015
 turns-0.015
Token red
Token ID2266
Feature activation+7.97e-01

Pos logit contributions
 snowball+0.004
 mat+0.004
 wipes+0.004
uddle+0.004
 Velvet+0.004
Neg logit contributions
chard-0.004
 known-0.004
epad-0.004
year-0.004
thank-0.004
Token toy
Token ID13373
Feature activation+1.51e-01

Pos logit contributions
 mat+0.015
 snowball+0.015
 wipes+0.014
uddle+0.014
ig+0.013
Neg logit contributions
 known-0.030
year-0.028
chard-0.028
epad-0.027
thank-0.026
Token car
Token ID1097
Feature activation-9.78e-01

Pos logit contributions
 mat+0.003
 snowball+0.003
 Velvet+0.003
 wipes+0.003
uddle+0.003
Neg logit contributions
 known-0.005
chard-0.005
epad-0.005
year-0.005
thank-0.004
Token.
Token ID13
Feature activation-3.29e-01

Pos logit contributions
 known+0.035
chard+0.035
epad+0.034
year+0.033
abby+0.033
Neg logit contributions
 mat-0.016
 Velvet-0.016
 snowball-0.015
 gas-0.015
 ramp-0.014
Token She
Token ID1375
Feature activation-9.88e-02

Pos logit contributions
chard+0.008
 known+0.007
epad+0.007
ho+0.007
thank+0.006
Neg logit contributions
 mat-0.010
 snowball-0.010
 wipes-0.010
uddle-0.009
 Velvet-0.009
Token decided
Token ID3066
Feature activation-1.73e+00

Pos logit contributions
chard+0.001
 known+0.001
epad+0.001
ho+0.001
thank+0.001
Neg logit contributions
 mat-0.004
 snowball-0.004
 wipes-0.004
 Velvet-0.004
uddle-0.004
Token to
Token ID284
Feature activation-1.90e-02

Pos logit contributions
chard+0.047
 known+0.046
year+0.045
epad+0.044
thank+0.043
Neg logit contributions
 mat-0.042
 wipes-0.041
 snowball-0.040
 Velvet-0.040
 mats-0.038
Token push
Token ID4574
Feature activation-3.21e-01

Pos logit contributions
chard+0.001
 known+0.001
year+0.000
ho+0.000
epad+0.000
Neg logit contributions
 mat-0.000
 snowball-0.000
 wipes-0.000
 ramp-0.000
 turns-0.000
Token it
Token ID340
Feature activation-3.23e-01

Pos logit contributions
chard+0.009
 known+0.008
epad+0.008
ho+0.008
year+0.008
Neg logit contributions
 mat-0.008
 snowball-0.008
 wipes-0.008
uddle-0.008
 ramp-0.007
Token around
Token ID1088
Feature activation+3.70e-01

Pos logit contributions
chard+0.008
 known+0.008
year+0.007
thank+0.007
epad+0.007
Neg logit contributions
 mat-0.009
 snowball-0.009
 ramp-0.009
 Velvet-0.008
 mats-0.008
Token the
Token ID262
Feature activation-2.45e-01

Pos logit contributions
 mat+0.009
 snowball+0.008
 Velvet+0.008
 mats+0.008
 wipes+0.008
Neg logit contributions
chard-0.011
 known-0.011
year-0.010
epad-0.010
ho-0.010
Token house
Token ID2156
Feature activation+1.66e-02

Pos logit contributions
 wipes+0.030
 mat+0.029
 snowball+0.029
 Velvet+0.028
 ramp+0.027
Neg logit contributions
chard-0.020
ho-0.017
 full-0.017
 known-0.017
 image-0.016
Token,
Token ID11
Feature activation-3.50e-01

Pos logit contributions
 mat+0.000
 wipes+0.000
ig+0.000
 snowball+0.000
ara+0.000
Neg logit contributions
chard-0.001
epad-0.001
 known-0.001
year-0.001
thank-0.001
Token there
Token ID612
Feature activation+6.38e-01

Pos logit contributions
chard+0.010
 known+0.010
epad+0.010
thank+0.010
ho+0.009
Neg logit contributions
 snowball-0.008
 mat-0.008
 wipes-0.008
 Velvet-0.007
Anna-0.007
Token lived
Token ID5615
Feature activation-1.58e-01

Pos logit contributions
 mat+0.017
 snowball+0.017
 wipes+0.017
 Velvet+0.016
ara+0.016
Neg logit contributions
chard-0.017
epad-0.015
 known-0.015
year-0.015
thank-0.014
Token a
Token ID257
Feature activation+1.54e-01

Pos logit contributions
chard+0.004
 known+0.004
ho+0.004
epad+0.004
thank+0.003
Neg logit contributions
 mat-0.004
 snowball-0.004
 wipes-0.004
 Velvet-0.004
uddle-0.004
Token happy
Token ID3772
Feature activation-1.92e+00

Pos logit contributions
 mat+0.004
 snowball+0.004
 wipes+0.004
uddle+0.004
 Velvet+0.004
Neg logit contributions
chard-0.004
 known-0.004
epad-0.004
ho-0.004
thank-0.004
Token bird
Token ID6512
Feature activation-8.54e-01

Pos logit contributions
chard+0.068
 known+0.066
epad+0.062
year+0.061
years+0.059
Neg logit contributions
 snowball-0.038
 mat-0.038
 wipes-0.034
ara-0.034
uddle-0.033
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
chard+0.023
epad+0.023
 known+0.022
year+0.021
nam+0.020
Neg logit contributions
 snowball-0.022
 mat-0.022
ig-0.021
 Velvet-0.021
 wipes-0.020
Token The
Token ID383
Feature activation-9.73e-01

Pos logit contributions
 snowball+0.002
 mat+0.002
 Velvet+0.002
 wipes+0.002
 turns+0.002
Neg logit contributions
chard-0.003
 known-0.003
ho-0.003
epad-0.003
year-0.003
Token bird
Token ID6512
Feature activation-6.22e-01

Pos logit contributions
chard+0.033
epad+0.031
 known+0.031
year+0.031
ho+0.029
Neg logit contributions
 snowball-0.021
 mat-0.020
 wipes-0.019
uddle-0.019
 ramp-0.017
Token loved
Token ID6151
Feature activation+7.74e-01

Pos logit contributions
chard+0.015
epad+0.014
 known+0.013
year+0.013
thank+0.012
Neg logit contributions
 wipes-0.018
 snowball-0.018
ig-0.018
 mat-0.018
Anna-0.017