TOP ACTIVATIONS
MAX = 0.073

play at the creek . <|endoftext|> L ily loved to read
Look <|endoftext|> One day , John and
have a bad ending . <|endoftext|> Le o was a little
have at the park . <|endoftext|> L ily and Ben were
trouble . The end . <|endoftext|> Ben and Lily were going
friends . The end . <|endoftext|> Tom and Mia like to
They are lucky . <|endoftext|> L ily and Ben were
bee is their friend . <|endoftext|> Tom and Sam are brothers
they made a new friend <|endoftext|> Tim and Jen were playing
part of the nation . <|endoftext|> Ben and Sam are brothers
The end . <|endoftext|> Tom is a cat who
I have caught lots of <|endoftext|> Once upon a time ,
bird to be okay . <|endoftext|> L ily and Ben were
is a useful thing . <|endoftext|> One day , a little
. They are happy . <|endoftext|> S ara and Ben are
. It is art . <|endoftext|> Ben and Mia were playing
twins . The end . <|endoftext|> L ila and Ben were
her about their day . <|endoftext|> S ara liked to play
. They are happy . <|endoftext|> Anna and Ben are friends
was just in the other <|endoftext|> Once upon a time ,

MIN ACTIVATIONS
MIN = -0.075

make neck l aces and brace lets with them .
. They make rings and brace lets and neck l aces
beautiful neck l aces and brace lets . You sorted the
neck l aces , grape brace lets , and grape crown
shapes . They can make towers , bridges , houses and
They can make towers , bridges , houses and more with
blocks . They make their towers very tall and strong .
, neck l aces , brace lets and ear rings .
sizes . They can make towers , houses , trains and
They can make houses , towers , cars and animals with
shapes . They could make towers , bridges , houses ,
kinds of ear rings and brace lets , and he really
him . But to no avail , they were too scared
They could make towers , bridges , houses , and anything
blocks . They could build towers , bridges , and houses
her wand and say " ab rac ad ab ra "
they both had their own brace lets and they were both
They could make houses , towers , cars and animals with
They can make houses , towers , cars , and animals
your neck l aces and brace lets ?" Mom asks .

INTERVAL 0.036 - 0.073
CONTAINS 0.765%

time in the garden . <|endoftext|> Anna and Lily were sisters
alone for too long . <|endoftext|> Once upon a time ,
lived happily ever after . <|endoftext|> Once , there was a
on another adventure soon !" <|endoftext|> M olly was a 3
speak and play together . <|endoftext|> Once upon a time ,

INTERVAL -0.001 - 0.036
CONTAINS 52.596%

stood up for herself . The boy started to
to play a joke on Mum . â Jake liked
for their mistake . From then on , they made sure
around . Paul was so happy that he asked his mom
girl couldn 't get away . In the end , she

INTERVAL -0.038 - -0.001
CONTAINS 46.165%

black spider . It had eight legs and many eyes .
louder and louder , so the little girl decided to turn
say sorry to the ants and leave them alone .
. But then , a tw inkle of light appeared and
and Lily played with the ball . They laughed and had

INTERVAL -0.075 - -0.038
CONTAINS 0.474%

thoughtful . From that day on , Lily became known as
to play in ," thought Spe edy . So , he
, very much . <|endoftext|> Once upon a time , there
winning a race . <|endoftext|> Once upon a time , there
forgot it . <|endoftext|> Once there was a cow . She
Token play
Token ID711
Feature activation-3.31e-04

Pos logit contributions
lets+0.022
rings+0.020
packs+0.019
adays+0.018
angles+0.017
Neg logit contributions
 boiling-0.020
 stew-0.018
 microwave-0.018
 pot-0.018
 bulldo-0.018
Token at
Token ID379
Feature activation-4.04e-03

Pos logit contributions
lets+0.000
 bracelet+0.000
rings+0.000
 wrist+0.000
 brace+0.000
Neg logit contributions
us-0.001
 boiling-0.001
 microwave-0.001
washer-0.001
 bulldo-0.001
Token the
Token ID262
Feature activation+9.83e-03

Pos logit contributions
lets+0.006
rings+0.005
 styles+0.004
 bracelet+0.004
 wrist+0.004
Neg logit contributions
 boiling-0.008
us-0.008
 microwave-0.008
 Farmer-0.007
 bulldo-0.007
Token creek
Token ID39459
Feature activation-4.07e-03

Pos logit contributions
us+0.020
 boiling+0.019
 microwave+0.018
washer+0.017
 grizz+0.017
Neg logit contributions
lets-0.014
 bracelet-0.013
 own-0.012
rings-0.012
 brace-0.012
Token.
Token ID13
Feature activation-1.52e-02

Pos logit contributions
lets+0.006
rings+0.005
 bracelet+0.005
 styles+0.005
 wrist+0.005
Neg logit contributions
 boiling-0.008
us-0.008
 microwave-0.008
washer-0.007
 bulldo-0.007
Token<|endoftext|>
Token ID50256
Feature activation+7.30e-02

Pos logit contributions
lets+0.025
rings+0.022
 bracelet+0.022
 brace+0.021
 styles+0.021
Neg logit contributions
 microwave-0.026
 Farmer-0.026
us-0.026
 boiling-0.026
 bulldo-0.024
TokenL
Token ID43
Feature activation-1.51e-03

Pos logit contributions
 boiling+0.124
 microwave+0.123
 hamb+0.117
us+0.115
 grizz+0.115
Neg logit contributions
lets-0.115
packs-0.106
 tops-0.101
 every-0.100
rings-0.099
Tokenily
Token ID813
Feature activation-7.96e-03

Pos logit contributions
lets+0.003
rings+0.002
 wrist+0.002
 bracelet+0.002
 styles+0.002
Neg logit contributions
us-0.003
 boiling-0.003
 microwave-0.003
washer-0.003
 bulldo-0.002
Token loved
Token ID6151
Feature activation+4.10e-02

Pos logit contributions
lets+0.010
rings+0.009
 wrist+0.008
 bracelet+0.008
 styles+0.008
Neg logit contributions
 boiling-0.017
us-0.017
 microwave-0.017
 bulldo-0.016
washer-0.016
Token to
Token ID284
Feature activation-1.20e-02

Pos logit contributions
us+0.086
 microwave+0.082
 boiling+0.081
 grizz+0.079
 goblin+0.078
Neg logit contributions
lets-0.055
rings-0.049
 bracelet-0.048
 brace-0.047
 styles-0.046
Token read
Token ID1100
Feature activation-1.19e-02

Pos logit contributions
lets+0.020
rings+0.016
bows+0.016
angles+0.016
herry+0.016
Neg logit contributions
 boiling-0.021
 microwave-0.020
 Farmer-0.019
 bulldo-0.019
 stew-0.019
Token
Token ID95
Feature activation-4.08e-03

Pos logit contributions
lets+0.009
rings+0.008
 wrist+0.008
 bracelet+0.008
 styles+0.007
Neg logit contributions
 boiling-0.011
us-0.011
 microwave-0.010
washer-0.010
 Farmer-0.009
Token
Token ID26391
Feature activation-1.37e-02

Pos logit contributions
lets+0.006
 wrist+0.005
rings+0.005
 cherish+0.005
 styles+0.004
Neg logit contributions
 boiling-0.008
us-0.008
 microwave-0.008
washer-0.008
 hamb-0.007
Token
Token ID129
Feature activation-9.67e-03

Pos logit contributions
lets+0.022
rings+0.019
 styles+0.019
 bracelet+0.019
 brace+0.018
Neg logit contributions
us-0.026
 boiling-0.025
 microwave-0.024
washer-0.023
 bulldo-0.023
Token
Token ID241
Feature activation-7.72e-03

Pos logit contributions
lets+0.015
rings+0.013
 bracelet+0.012
packs+0.012
 styles+0.012
Neg logit contributions
 boiling-0.019
us-0.019
 microwave-0.018
 bulldo-0.017
 Farmer-0.017
TokenLook
Token ID8567
Feature activation-3.57e-02

Pos logit contributions
lets+0.013
angles+0.011
rings+0.011
 brace+0.011
 bracelet+0.011
Neg logit contributions
us-0.014
 boiling-0.013
 microwave-0.013
 Farmer-0.012
 hamb-0.012
Token<|endoftext|>
Token ID50256
Feature activation+7.29e-02

Pos logit contributions
lets+0.058
packs+0.054
 languages+0.054
bows+0.053
 styles+0.053
Neg logit contributions
us-0.067
 boiling-0.058
washer-0.055
 Doctor-0.053
 microwave-0.053
TokenOne
Token ID3198
Feature activation+1.06e-03

Pos logit contributions
 boiling+0.140
 microwave+0.134
washer+0.131
 hamb+0.127
 bulldo+0.127
Neg logit contributions
lets-0.094
 every-0.084
packs-0.084
Every-0.083
rings-0.082
Token day
Token ID1110
Feature activation+3.69e-02

Pos logit contributions
 boiling+0.002
us+0.002
 microwave+0.002
washer+0.002
 Farmer+0.002
Neg logit contributions
lets-0.002
rings-0.002
 brace-0.001
 wrist-0.001
 bracelet-0.001
Token,
Token ID11
Feature activation-3.74e-03

Pos logit contributions
 boiling+0.079
us+0.079
 microwave+0.077
washer+0.072
 bulldo+0.072
Neg logit contributions
lets-0.049
rings-0.041
 bracelet-0.038
 wrist-0.037
 styles-0.037
Token John
Token ID1757
Feature activation-1.10e-02

Pos logit contributions
lets+0.006
rings+0.006
 styles+0.005
 wrist+0.005
 brace+0.005
Neg logit contributions
us-0.007
 boiling-0.007
 microwave-0.006
 Farmer-0.006
 bulldo-0.006
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
lets+0.018
rings+0.016
 styles+0.015
 bracelet+0.015
 wrist+0.015
Neg logit contributions
us-0.021
 boiling-0.021
 microwave-0.020
 bulldo-0.019
washer-0.018
Token have
Token ID423
Feature activation-1.59e-03

Pos logit contributions
lets+0.001
rings+0.001
 styles+0.001
 wrist+0.001
packs+0.001
Neg logit contributions
 boiling-0.002
 microwave-0.002
us-0.002
washer-0.002
 bulldo-0.002
Token a
Token ID257
Feature activation-1.02e-02

Pos logit contributions
lets+0.002
rings+0.002
 wrist+0.002
 bracelet+0.002
 styles+0.002
Neg logit contributions
 boiling-0.003
us-0.003
 microwave-0.003
washer-0.003
 bulldo-0.003
Token bad
Token ID2089
Feature activation-2.03e-04

Pos logit contributions
 bracelet+0.014
lets+0.013
 own+0.013
 common+0.012
 goal+0.012
Neg logit contributions
us-0.021
 boiling-0.019
 microwave-0.018
 Farmer-0.018
washer-0.017
Token ending
Token ID7464
Feature activation+2.99e-02

Pos logit contributions
lets+0.000
rings+0.000
 bracelet+0.000
 styles+0.000
 wrist+0.000
Neg logit contributions
us-0.000
 boiling-0.000
 microwave-0.000
washer-0.000
 bulldo-0.000
Token.
Token ID13
Feature activation-8.84e-03

Pos logit contributions
 boiling+0.065
us+0.065
 microwave+0.065
 bulldo+0.061
 grizz+0.061
Neg logit contributions
lets-0.037
rings-0.031
packs-0.029
 styles-0.028
 bracelet-0.028
Token<|endoftext|>
Token ID50256
Feature activation+7.26e-02

Pos logit contributions
lets+0.014
rings+0.012
 bracelet+0.012
 styles+0.011
 brace+0.011
Neg logit contributions
us-0.017
 boiling-0.017
 microwave-0.016
 Farmer-0.015
washer-0.015
TokenLe
Token ID3123
Feature activation-1.53e-02

Pos logit contributions
 boiling+0.134
 hamb+0.128
 microwave+0.128
 grizz+0.127
 bulldo+0.125
Neg logit contributions
lets-0.096
packs-0.095
 tops-0.088
 every-0.086
 color-0.084
Tokeno
Token ID78
Feature activation+2.93e-03

Pos logit contributions
lets+0.035
rings+0.033
 bracelet+0.033
 styles+0.032
 wrist+0.032
Neg logit contributions
us-0.019
 boiling-0.016
 microwave-0.016
 Farmer-0.014
 bulldo-0.014
Token was
Token ID373
Feature activation+1.36e-02

Pos logit contributions
 boiling+0.007
us+0.007
 microwave+0.007
washer+0.006
 bulldo+0.006
Neg logit contributions
lets-0.003
rings-0.003
 bracelet-0.002
 styles-0.002
 wrist-0.002
Token a
Token ID257
Feature activation+2.97e-03

Pos logit contributions
us+0.027
 boiling+0.025
 microwave+0.025
washer+0.024
 bulldo+0.024
Neg logit contributions
lets-0.018
 own-0.016
 every-0.016
 bracelet-0.016
packs-0.016
Token little
Token ID1310
Feature activation+1.29e-02

Pos logit contributions
us+0.006
 boiling+0.006
 microwave+0.006
washer+0.005
auc+0.005
Neg logit contributions
lets-0.004
 own-0.003
 bracelet-0.003
 color-0.003
 wrist-0.003
Token have
Token ID423
Feature activation+1.23e-02

Pos logit contributions
lets+0.012
rings+0.011
packs+0.010
bows+0.009
 styles+0.009
Neg logit contributions
 boiling-0.011
us-0.010
 microwave-0.010
 Farmer-0.010
 polar-0.010
Token at
Token ID379
Feature activation-6.41e-03

Pos logit contributions
 boiling+0.024
us+0.023
 microwave+0.022
 Farmer+0.021
 hamb+0.021
Neg logit contributions
lets-0.019
rings-0.017
 wrist-0.015
 bracelet-0.015
 styles-0.015
Token the
Token ID262
Feature activation+1.60e-05

Pos logit contributions
lets+0.010
rings+0.008
 bracelet+0.008
 styles+0.007
 wrist+0.007
Neg logit contributions
us-0.013
 boiling-0.013
 microwave-0.012
washer-0.011
 bulldo-0.011
Token park
Token ID3952
Feature activation-1.29e-02

Pos logit contributions
 boiling+0.000
 microwave+0.000
us+0.000
 bulldo+0.000
washer+0.000
Neg logit contributions
lets-0.000
rings-0.000
 styles-0.000
packs-0.000
 wrist-0.000
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
lets+0.021
rings+0.018
 bracelet+0.017
 wrist+0.017
 styles+0.017
Neg logit contributions
 boiling-0.023
us-0.023
 microwave-0.022
washer-0.021
 bulldo-0.021
Token<|endoftext|>
Token ID50256
Feature activation+7.25e-02

Pos logit contributions
lets+0.017
rings+0.015
 bracelet+0.014
 styles+0.014
 brace+0.014
Neg logit contributions
us-0.020
 boiling-0.020
 microwave-0.020
 Farmer-0.019
 Doctor-0.018
TokenL
Token ID43
Feature activation-6.54e-04

Pos logit contributions
 boiling+0.127
 microwave+0.123
 hamb+0.119
us+0.118
washer+0.118
Neg logit contributions
lets-0.108
packs-0.102
 tops-0.097
rings-0.096
 every-0.094
Tokenily
Token ID813
Feature activation-9.48e-03

Pos logit contributions
lets+0.001
rings+0.001
 wrist+0.001
bows+0.001
 styles+0.001
Neg logit contributions
us-0.001
 boiling-0.001
 microwave-0.001
washer-0.001
 Farmer-0.001
Token and
Token ID290
Feature activation-2.33e-03

Pos logit contributions
lets+0.013
rings+0.011
 wrist+0.010
 bracelet+0.010
 styles+0.010
Neg logit contributions
 boiling-0.020
us-0.020
 microwave-0.019
 bulldo-0.018
washer-0.018
Token Ben
Token ID3932
Feature activation-1.74e-02

Pos logit contributions
lets+0.004
 styles+0.004
 tops+0.004
bows+0.004
rings+0.004
Neg logit contributions
 Farmer-0.004
 boiling-0.003
 microwave-0.003
 Doctor-0.003
 smoking-0.003
Token were
Token ID547
Feature activation+1.59e-02

Pos logit contributions
lets+0.029
rings+0.026
 wrist+0.026
 styles+0.025
 tops+0.025
Neg logit contributions
 boiling-0.029
us-0.028
 microwave-0.028
 bulldo-0.026
 hamb-0.026
Token trouble
Token ID5876
Feature activation-2.05e-04

Pos logit contributions
lets+0.049
rings+0.044
 bracelet+0.042
 styles+0.042
 wrist+0.042
Neg logit contributions
 boiling-0.035
us-0.035
 microwave-0.033
 bulldo-0.030
washer-0.030
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
lets+0.000
rings+0.000
 styles+0.000
 bracelet+0.000
 wrist+0.000
Neg logit contributions
us-0.000
 boiling-0.000
 microwave-0.000
 bulldo-0.000
washer-0.000
Token The
Token ID383
Feature activation-1.17e-03

Pos logit contributions
lets+0.018
rings+0.015
 bracelet+0.015
 brace+0.015
 styles+0.014
Neg logit contributions
us-0.021
 boiling-0.021
 microwave-0.020
 Farmer-0.020
 bulldo-0.019
Token end
Token ID886
Feature activation+2.95e-02

Pos logit contributions
lets+0.002
rings+0.001
 bracelet+0.001
 styles+0.001
 tops+0.001
Neg logit contributions
us-0.003
 boiling-0.002
 microwave-0.002
washer-0.002
 bulldo-0.002
Token.
Token ID13
Feature activation-7.86e-03

Pos logit contributions
us+0.060
 boiling+0.060
 microwave+0.059
 bulldo+0.055
washer+0.055
Neg logit contributions
lets-0.042
rings-0.035
 bracelet-0.033
 styles-0.033
 wrist-0.032
Token<|endoftext|>
Token ID50256
Feature activation+7.24e-02

Pos logit contributions
lets+0.012
rings+0.010
 bracelet+0.009
 styles+0.009
 wrist+0.009
Neg logit contributions
us-0.016
 boiling-0.016
 microwave-0.015
washer-0.014
 bulldo-0.014
TokenBen
Token ID11696
Feature activation-6.10e-03

Pos logit contributions
 boiling+0.135
 microwave+0.129
 hamb+0.127
washer+0.123
 paramed+0.123
Neg logit contributions
lets-0.097
packs-0.095
 tops-0.090
 every-0.087
 goal-0.086
Token and
Token ID290
Feature activation+1.37e-03

Pos logit contributions
lets+0.007
rings+0.005
 bracelet+0.005
 styles+0.005
 wrist+0.005
Neg logit contributions
us-0.015
 boiling-0.014
 microwave-0.014
washer-0.013
 bulldo-0.013
Token Lily
Token ID20037
Feature activation-2.29e-02

Pos logit contributions
 boiling+0.003
us+0.002
 microwave+0.002
 Farmer+0.002
 hamb+0.002
Neg logit contributions
lets-0.002
rings-0.002
 styles-0.002
 tops-0.002
bows-0.002
Token were
Token ID547
Feature activation+1.09e-02

Pos logit contributions
lets+0.036
rings+0.032
 wrist+0.030
 styles+0.029
 bracelet+0.029
Neg logit contributions
 boiling-0.043
 microwave-0.041
us-0.041
 bulldo-0.038
washer-0.038
Token going
Token ID1016
Feature activation+3.76e-02

Pos logit contributions
us+0.021
 boiling+0.021
 microwave+0.020
 bulldo+0.020
 goblin+0.019
Neg logit contributions
lets-0.015
packs-0.014
 own-0.013
rings-0.013
 bracelet-0.013
Token friends
Token ID2460
Feature activation+2.44e-03

Pos logit contributions
 microwave+0.003
 boiling+0.003
 Farmer+0.003
 bulldo+0.003
us+0.003
Neg logit contributions
lets-0.003
rings-0.002
adays-0.002
Her-0.002
bows-0.002
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
us+0.005
 boiling+0.005
 microwave+0.005
washer+0.004
 bulldo+0.004
Neg logit contributions
lets-0.004
rings-0.003
 styles-0.003
 bracelet-0.003
 wrist-0.003
Token The
Token ID383
Feature activation-1.83e-03

Pos logit contributions
lets+0.022
rings+0.019
 bracelet+0.018
 styles+0.018
 wrist+0.018
Neg logit contributions
us-0.019
 boiling-0.019
 microwave-0.018
 Farmer-0.017
washer-0.017
Token end
Token ID886
Feature activation+2.82e-02

Pos logit contributions
lets+0.002
 bracelet+0.002
rings+0.002
 wrist+0.002
 styles+0.002
Neg logit contributions
us-0.004
 boiling-0.004
 microwave-0.004
washer-0.003
 bulldo-0.003
Token.
Token ID13
Feature activation-3.91e-03

Pos logit contributions
 boiling+0.060
us+0.060
 microwave+0.059
 bulldo+0.055
 Farmer+0.054
Neg logit contributions
lets-0.037
rings-0.031
packs-0.029
 bracelet-0.028
 styles-0.028
Token<|endoftext|>
Token ID50256
Feature activation+7.23e-02

Pos logit contributions
lets+0.006
rings+0.005
 wrist+0.004
 bracelet+0.004
 styles+0.004
Neg logit contributions
 boiling-0.008
us-0.008
 microwave-0.008
washer-0.007
 bulldo-0.007
TokenTom
Token ID13787
Feature activation-1.29e-02

Pos logit contributions
 boiling+0.133
 microwave+0.127
 hamb+0.126
 grizz+0.124
 paramed+0.123
Neg logit contributions
lets-0.096
packs-0.094
 tops-0.090
 every-0.088
 goal-0.086
Token and
Token ID290
Feature activation+9.27e-04

Pos logit contributions
lets+0.015
rings+0.012
 bracelet+0.011
 wrist+0.011
 styles+0.011
Neg logit contributions
us-0.030
 boiling-0.030
 microwave-0.029
washer-0.028
 bulldo-0.027
Token Mia
Token ID32189
Feature activation-3.04e-02

Pos logit contributions
 boiling+0.001
 Farmer+0.001
us+0.001
 microwave+0.001
 polar+0.001
Neg logit contributions
lets-0.002
 styles-0.002
rings-0.002
bows-0.001
 tops-0.001
Token like
Token ID588
Feature activation+2.89e-02

Pos logit contributions
lets+0.037
rings+0.032
 wrist+0.029
 styles+0.028
 bracelet+0.028
Neg logit contributions
 boiling-0.068
us-0.067
 microwave-0.066
 bulldo-0.062
washer-0.062
Token to
Token ID284
Feature activation-8.79e-03

Pos logit contributions
us+0.060
 boiling+0.059
 microwave+0.059
 grizz+0.056
 goblin+0.054
Neg logit contributions
lets-0.040
rings-0.035
 bracelet-0.032
 styles-0.032
 wrist-0.032
Token
Token ID198
Feature activation+2.60e-03

Pos logit contributions
 boiling+0.011
 microwave+0.011
us+0.011
 bulldo+0.010
washer+0.010
Neg logit contributions
lets-0.011
rings-0.009
 bracelet-0.009
Her-0.009
 wrist-0.009
TokenThey
Token ID2990
Feature activation+6.01e-03

Pos logit contributions
us+0.006
 boiling+0.006
 microwave+0.006
washer+0.005
 bulldo+0.005
Neg logit contributions
lets-0.003
rings-0.003
 bracelet-0.002
 styles-0.002
 wrist-0.002
Token are
Token ID389
Feature activation+2.53e-03

Pos logit contributions
 boiling+0.012
 microwave+0.011
us+0.011
washer+0.010
 bulldo+0.010
Neg logit contributions
lets-0.009
rings-0.008
 styles-0.007
packs-0.007
 wrist-0.007
Token lucky
Token ID9670
Feature activation+8.87e-03

Pos logit contributions
 boiling+0.004
us+0.004
 microwave+0.004
 Farmer+0.004
 Doctor+0.004
Neg logit contributions
lets-0.005
rings-0.004
 styles-0.004
Mine-0.004
 languages-0.004
Token.
Token ID13
Feature activation-3.59e-03

Pos logit contributions
us+0.020
 boiling+0.019
 microwave+0.019
 bulldo+0.018
 grizz+0.018
Neg logit contributions
lets-0.011
rings-0.010
 styles-0.009
 bracelet-0.009
 tops-0.009
Token<|endoftext|>
Token ID50256
Feature activation+7.15e-02

Pos logit contributions
lets+0.006
rings+0.005
 bracelet+0.005
 brace+0.005
 styles+0.004
Neg logit contributions
 boiling-0.007
us-0.007
 microwave-0.007
 Farmer-0.006
washer-0.006
TokenL
Token ID43
Feature activation+7.19e-03

Pos logit contributions
 boiling+0.124
 microwave+0.122
us+0.117
 hamb+0.116
washer+0.115
Neg logit contributions
lets-0.109
packs-0.102
rings-0.097
 tops-0.097
bows-0.095
Tokenily
Token ID813
Feature activation-3.69e-03

Pos logit contributions
us+0.014
 boiling+0.014
 microwave+0.013
washer+0.012
 bulldo+0.012
Neg logit contributions
lets-0.011
rings-0.010
 bracelet-0.009
 wrist-0.009
 styles-0.009
Token and
Token ID290
Feature activation+4.60e-03

Pos logit contributions
lets+0.005
rings+0.004
 wrist+0.004
 bracelet+0.004
 tops+0.004
Neg logit contributions
 boiling-0.008
us-0.008
 microwave-0.008
 bulldo-0.007
washer-0.007
Token Ben
Token ID3932
Feature activation-1.25e-02

Pos logit contributions
 boiling+0.008
 Farmer+0.008
 microwave+0.007
 Doctor+0.007
 smoking+0.007
Neg logit contributions
lets-0.008
 styles-0.007
bows-0.006
angles-0.006
rings-0.006
Token were
Token ID547
Feature activation+1.47e-02

Pos logit contributions
lets+0.016
rings+0.014
 wrist+0.013
 tops+0.012
 bracelet+0.012
Neg logit contributions
 boiling-0.027
us-0.025
 microwave-0.025
 hamb-0.024
 bulldo-0.024
Token bee
Token ID20697
Feature activation+1.28e-02

Pos logit contributions
us+0.029
 boiling+0.028
 microwave+0.027
 bulldo+0.026
washer+0.025
Neg logit contributions
lets-0.029
 bracelet-0.027
rings-0.026
 styles-0.026
 own-0.026
Token is
Token ID318
Feature activation+1.36e-02

Pos logit contributions
 boiling+0.028
us+0.028
 microwave+0.027
washer+0.026
 stew+0.026
Neg logit contributions
lets-0.015
rings-0.013
 wrist-0.013
 bracelet-0.012
 styles-0.012
Token their
Token ID511
Feature activation-2.72e-03

Pos logit contributions
 boiling+0.026
us+0.026
 microwave+0.025
 Farmer+0.023
washer+0.023
Neg logit contributions
lets-0.021
rings-0.018
 wrist-0.017
 bracelet-0.017
 styles-0.017
Token friend
Token ID1545
Feature activation+2.95e-03

Pos logit contributions
lets+0.004
rings+0.003
adays+0.003
Her+0.003
 brace+0.003
Neg logit contributions
 boiling-0.006
 microwave-0.005
us-0.005
 Farmer-0.005
 stew-0.005
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
us+0.005
 boiling+0.005
 microwave+0.005
 Farmer+0.005
washer+0.005
Neg logit contributions
lets-0.005
rings-0.004
 styles-0.004
 wrist-0.004
 bracelet-0.004
Token<|endoftext|>
Token ID50256
Feature activation+7.13e-02

Pos logit contributions
lets+0.016
rings+0.014
 bracelet+0.013
 brace+0.013
 styles+0.013
Neg logit contributions
us-0.021
 boiling-0.021
 microwave-0.021
 Farmer-0.020
 Doctor-0.019
TokenTom
Token ID13787
Feature activation-1.30e-02

Pos logit contributions
 boiling+0.124
 microwave+0.120
 hamb+0.117
 grizz+0.116
us+0.115
Neg logit contributions
lets-0.107
packs-0.102
 tops-0.097
 every-0.094
rings-0.094
Token and
Token ID290
Feature activation+1.43e-03

Pos logit contributions
lets+0.018
rings+0.016
 wrist+0.014
 bracelet+0.014
 tops+0.014
Neg logit contributions
us-0.027
 boiling-0.027
 microwave-0.027
washer-0.025
 bulldo-0.024
Token Sam
Token ID3409
Feature activation-2.25e-02

Pos logit contributions
 boiling+0.002
 Farmer+0.002
 microwave+0.002
us+0.002
 hamb+0.002
Neg logit contributions
lets-0.002
bows-0.002
rings-0.002
 styles-0.002
 tops-0.002
Token are
Token ID389
Feature activation+1.28e-02

Pos logit contributions
lets+0.028
rings+0.025
 wrist+0.023
 bracelet+0.021
 styles+0.021
Neg logit contributions
 boiling-0.049
us-0.048
 microwave-0.048
 bulldo-0.045
washer-0.045
Token brothers
Token ID9397
Feature activation+1.42e-02

Pos logit contributions
us+0.024
 boiling+0.024
 microwave+0.023
washer+0.021
 bulldo+0.021
Neg logit contributions
lets-0.021
rings-0.018
 bracelet-0.017
 styles-0.017
 wrist-0.017
Token they
Token ID484
Feature activation-5.11e-04

Pos logit contributions
lets+0.032
rings+0.026
 wrist+0.024
 bracelet+0.024
 styles+0.024
Neg logit contributions
 boiling-0.043
us-0.041
 microwave-0.040
 Farmer-0.038
washer-0.038
Token made
Token ID925
Feature activation-1.04e-02

Pos logit contributions
lets+0.001
rings+0.001
 styles+0.001
 wrist+0.001
 bracelet+0.001
Neg logit contributions
 boiling-0.001
us-0.001
 microwave-0.001
washer-0.001
 hamb-0.001
Token a
Token ID257
Feature activation-1.60e-02

Pos logit contributions
lets+0.014
rings+0.012
 styles+0.011
 bracelet+0.011
 brace+0.011
Neg logit contributions
us-0.022
 boiling-0.022
 microwave-0.021
washer-0.020
 bulldo-0.020
Token new
Token ID649
Feature activation-1.09e-02

Pos logit contributions
lets+0.020
 bracelet+0.019
 own+0.018
 goal+0.017
 brace+0.017
Neg logit contributions
us-0.035
 boiling-0.033
 microwave-0.031
washer-0.031
 Farmer-0.030
Token friend
Token ID1545
Feature activation+1.12e-02

Pos logit contributions
lets+0.016
rings+0.014
 bracelet+0.013
 styles+0.013
 wrist+0.013
Neg logit contributions
 boiling-0.021
us-0.021
 microwave-0.021
washer-0.019
 bulldo-0.019
Token<|endoftext|>
Token ID50256
Feature activation+7.11e-02

Pos logit contributions
us+0.021
 boiling+0.021
 microwave+0.020
washer+0.019
 Farmer+0.018
Neg logit contributions
lets-0.018
rings-0.015
 styles-0.014
 bracelet-0.014
 wrist-0.014
TokenTim
Token ID14967
Feature activation-4.62e-03

Pos logit contributions
 boiling+0.126
 microwave+0.122
 hamb+0.118
us+0.117
washer+0.116
Neg logit contributions
lets-0.104
packs-0.101
 tops-0.093
rings-0.093
bows-0.089
Token and
Token ID290
Feature activation-4.91e-03

Pos logit contributions
lets+0.007
rings+0.006
 bracelet+0.006
 wrist+0.006
 brace+0.006
Neg logit contributions
us-0.009
 boiling-0.009
 microwave-0.008
washer-0.008
 bulldo-0.008
Token Jen
Token ID13364
Feature activation-2.39e-02

Pos logit contributions
lets+0.009
 styles+0.008
rings+0.008
bows+0.008
angles+0.008
Neg logit contributions
 boiling-0.008
 Farmer-0.008
 microwave-0.007
us-0.007
washer-0.007
Token were
Token ID547
Feature activation+1.13e-02

Pos logit contributions
lets+0.041
rings+0.037
 wrist+0.036
 bracelet+0.036
 styles+0.036
Neg logit contributions
 boiling-0.038
us-0.038
 microwave-0.037
 bulldo-0.035
washer-0.035
Token playing
Token ID2712
Feature activation+1.56e-03

Pos logit contributions
us+0.022
 boiling+0.021
 microwave+0.021
 bulldo+0.020
 goblin+0.020
Neg logit contributions
lets-0.017
packs-0.014
rings-0.014
 bracelet-0.014
 own-0.014
Token part
Token ID636
Feature activation+3.30e-03

Pos logit contributions
 boiling+0.002
us+0.002
 microwave+0.002
 Farmer+0.002
washer+0.002
Neg logit contributions
lets-0.002
rings-0.002
 bracelet-0.002
 wrist-0.002
 styles-0.002
Token of
Token ID286
Feature activation+1.23e-02

Pos logit contributions
us+0.007
 boiling+0.007
 microwave+0.006
washer+0.006
 bulldo+0.006
Neg logit contributions
lets-0.005
rings-0.004
 bracelet-0.004
 styles-0.004
 wrist-0.004
Token the
Token ID262
Feature activation+1.53e-02

Pos logit contributions
 boiling+0.027
us+0.027
 microwave+0.026
 bulldo+0.025
washer+0.025
Neg logit contributions
lets-0.016
rings-0.013
 bracelet-0.012
 styles-0.012
 wrist-0.012
Token nation
Token ID3277
Feature activation-1.50e-02

Pos logit contributions
 boiling+0.028
 microwave+0.027
us+0.027
washer+0.025
 bulldo+0.025
Neg logit contributions
lets-0.026
rings-0.023
 wrist-0.020
 styles-0.020
Her-0.020
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
lets+0.025
rings+0.022
 bracelet+0.021
 styles+0.021
 wrist+0.020
Neg logit contributions
 boiling-0.025
us-0.025
 microwave-0.024
washer-0.024
 bulldo-0.022
Token<|endoftext|>
Token ID50256
Feature activation+7.10e-02

Pos logit contributions
lets+0.019
rings+0.016
 bracelet+0.016
 brace+0.015
 styles+0.015
Neg logit contributions
us-0.021
 microwave-0.021
 boiling-0.020
 Farmer-0.020
 Doctor-0.019
TokenBen
Token ID11696
Feature activation-3.63e-03

Pos logit contributions
 boiling+0.122
 microwave+0.116
us+0.112
 hamb+0.111
 grizz+0.109
Neg logit contributions
lets-0.112
packs-0.103
 tops-0.101
rings-0.100
 every-0.097
Token and
Token ID290
Feature activation+9.62e-04

Pos logit contributions
lets+0.004
rings+0.004
 bracelet+0.003
 wrist+0.003
 styles+0.003
Neg logit contributions
us-0.008
 boiling-0.008
 microwave-0.008
 bulldo-0.008
washer-0.008
Token Sam
Token ID3409
Feature activation-2.05e-02

Pos logit contributions
 boiling+0.002
 Farmer+0.002
us+0.002
 microwave+0.002
 bulldo+0.001
Neg logit contributions
lets-0.002
rings-0.001
 styles-0.001
 tops-0.001
bows-0.001
Token are
Token ID389
Feature activation+1.15e-02

Pos logit contributions
lets+0.029
rings+0.026
 wrist+0.025
 bracelet+0.023
 styles+0.023
Neg logit contributions
 boiling-0.041
us-0.040
 microwave-0.040
 bulldo-0.038
washer-0.037
Token brothers
Token ID9397
Feature activation+1.38e-02

Pos logit contributions
us+0.021
 boiling+0.021
 microwave+0.020
washer+0.019
 bulldo+0.018
Neg logit contributions
lets-0.020
rings-0.017
 styles-0.016
 bracelet-0.016
 wrist-0.016
Token
Token ID198
Feature activation+4.21e-03

Pos logit contributions
lets+0.020
rings+0.017
 bracelet+0.017
 brace+0.016
 styles+0.016
Neg logit contributions
us-0.028
 boiling-0.028
 microwave-0.027
 Farmer-0.026
washer-0.025
Token
Token ID198
Feature activation-3.42e-03

Pos logit contributions
 microwave+0.009
 boiling+0.009
us+0.009
 goblin+0.009
 hamb+0.008
Neg logit contributions
lets-0.005
 own-0.004
Her-0.004
rings-0.004
 cherish-0.004
TokenThe
Token ID464
Feature activation-7.71e-03

Pos logit contributions
lets+0.003
rings+0.002
Her+0.002
 bracelet+0.002
 wrist+0.002
Neg logit contributions
 boiling-0.008
us-0.008
 microwave-0.008
 grizz-0.008
 bulldo-0.008
Token end
Token ID886
Feature activation+2.53e-02

Pos logit contributions
lets+0.010
 own+0.009
 bracelet+0.009
 styles+0.009
 tops+0.009
Neg logit contributions
us-0.017
 boiling-0.015
 microwave-0.015
washer-0.014
 bulldo-0.014
Token.
Token ID13
Feature activation-3.44e-03

Pos logit contributions
 boiling+0.060
us+0.059
 microwave+0.058
 hamb+0.054
 bulldo+0.054
Neg logit contributions
lets-0.026
packs-0.021
rings-0.021
 tops-0.019
 styles-0.019
Token<|endoftext|>
Token ID50256
Feature activation+7.10e-02

Pos logit contributions
lets+0.005
rings+0.004
 wrist+0.004
 bracelet+0.004
packs+0.004
Neg logit contributions
us-0.007
 boiling-0.007
 microwave-0.007
washer-0.007
 bulldo-0.007
TokenTom
Token ID13787
Feature activation-9.19e-03

Pos logit contributions
 boiling+0.133
 microwave+0.129
 hamb+0.127
 grizz+0.125
 burgers+0.122
Neg logit contributions
lets-0.093
packs-0.092
 tops-0.084
 color-0.083
bows-0.083
Token is
Token ID318
Feature activation+1.46e-02

Pos logit contributions
lets+0.011
rings+0.009
 bracelet+0.008
 styles+0.008
 wrist+0.008
Neg logit contributions
us-0.021
 boiling-0.021
 microwave-0.020
washer-0.019
 bulldo-0.019
Token a
Token ID257
Feature activation+4.12e-03

Pos logit contributions
us+0.028
 microwave+0.027
 boiling+0.026
 grizz+0.026
 goblin+0.025
Neg logit contributions
lets-0.020
 own-0.019
 every-0.018
rings-0.018
packs-0.018
Token cat
Token ID3797
Feature activation+2.16e-02

Pos logit contributions
us+0.009
 boiling+0.008
 microwave+0.008
cher+0.008
washer+0.008
Neg logit contributions
lets-0.005
 bracelet-0.005
 own-0.005
 color-0.005
 wrist-0.004
Token who
Token ID508
Feature activation+2.13e-02

Pos logit contributions
us+0.047
 boiling+0.047
 microwave+0.046
 bulldo+0.043
 Farmer+0.043
Neg logit contributions
lets-0.027
rings-0.022
 wrist-0.021
 bracelet-0.021
 styles-0.020
Token I
Token ID314
Feature activation+1.14e-02

Pos logit contributions
 boiling+0.028
us+0.028
 microwave+0.027
 Farmer+0.025
washer+0.025
Neg logit contributions
lets-0.022
rings-0.019
 bracelet-0.018
 wrist-0.018
packs-0.018
Token have
Token ID423
Feature activation+7.40e-03

Pos logit contributions
us+0.022
 boiling+0.022
 microwave+0.021
washer+0.020
 bulldo+0.020
Neg logit contributions
lets-0.017
rings-0.015
 bracelet-0.014
 wrist-0.014
 styles-0.014
Token caught
Token ID4978
Feature activation+2.09e-02

Pos logit contributions
 boiling+0.014
us+0.014
 microwave+0.014
washer+0.013
 Farmer+0.013
Neg logit contributions
lets-0.012
rings-0.010
 wrist-0.009
 bracelet-0.009
 styles-0.009
Token lots
Token ID6041
Feature activation+1.85e-02

Pos logit contributions
us+0.041
 boiling+0.041
 microwave+0.039
 Farmer+0.037
 bulldo+0.036
Neg logit contributions
lets-0.031
rings-0.027
 wrist-0.025
 bracelet-0.025
 styles-0.025
Token of
Token ID286
Feature activation+2.73e-02

Pos logit contributions
us+0.033
 boiling+0.033
 microwave+0.031
washer+0.029
 bulldo+0.028
Neg logit contributions
lets-0.031
rings-0.028
 wrist-0.026
 bracelet-0.026
packs-0.026
Token<|endoftext|>
Token ID50256
Feature activation+7.10e-02

Pos logit contributions
us+0.050
 boiling+0.049
 microwave+0.048
washer+0.045
 bulldo+0.044
Neg logit contributions
lets-0.044
rings-0.038
 styles-0.037
 bracelet-0.037
 tops-0.036
TokenOnce
Token ID7454
Feature activation-1.81e-02

Pos logit contributions
 boiling+0.123
 microwave+0.121
us+0.115
washer+0.114
 hamb+0.113
Neg logit contributions
lets-0.110
rings-0.099
packs-0.099
 tops-0.095
 every-0.093
Token upon
Token ID2402
Feature activation-7.36e-04

Pos logit contributions
lets+0.031
 wrist+0.026
 brace+0.026
 styles+0.025
 wrists+0.025
Neg logit contributions
us-0.034
 boiling-0.033
 microwave-0.031
washer-0.031
 Farmer-0.029
Token a
Token ID257
Feature activation+6.54e-03

Pos logit contributions
lets+0.001
rings+0.001
 styles+0.001
 wrist+0.001
angles+0.001
Neg logit contributions
 boiling-0.001
us-0.001
 microwave-0.001
 Farmer-0.001
washer-0.001
Token time
Token ID640
Feature activation+1.67e-02

Pos logit contributions
 boiling+0.013
us+0.013
 microwave+0.013
 Farmer+0.012
 bulldo+0.012
Neg logit contributions
lets-0.009
rings-0.008
 styles-0.007
 bracelet-0.007
 wrist-0.007
Token,
Token ID11
Feature activation-1.41e-02

Pos logit contributions
us+0.030
 boiling+0.029
 Farmer+0.026
 microwave+0.026
washer+0.026
Neg logit contributions
lets-0.030
 styles-0.025
rings-0.025
 wrist-0.024
 wrists-0.024
Token bird
Token ID6512
Feature activation-6.88e-03

Pos logit contributions
us+0.004
 boiling+0.004
 microwave+0.003
washer+0.003
 bulldo+0.003
Neg logit contributions
lets-0.003
rings-0.002
 bracelet-0.002
 styles-0.002
 wrist-0.002
Token to
Token ID284
Feature activation-1.30e-02

Pos logit contributions
lets+0.013
 wrist+0.012
Mine+0.012
 styles+0.012
herry+0.012
Neg logit contributions
 boiling-0.009
us-0.009
 microwave-0.009
washer-0.009
 smoking-0.008
Token be
Token ID307
Feature activation+1.16e-02

Pos logit contributions
lets+0.023
rings+0.018
 styles+0.017
 languages+0.017
angles+0.017
Neg logit contributions
 boiling-0.024
 polar-0.022
 microwave-0.022
 Farmer-0.021
 bulldo-0.021
Token okay
Token ID8788
Feature activation-6.83e-03

Pos logit contributions
 boiling+0.020
us+0.020
 microwave+0.019
 Farmer+0.017
 bulldo+0.017
Neg logit contributions
lets-0.020
rings-0.018
 wrist-0.017
 bracelet-0.017
 styles-0.017
Token.
Token ID13
Feature activation-1.68e-02

Pos logit contributions
lets+0.011
rings+0.009
 wrist+0.009
 bracelet+0.009
 styles+0.009
Neg logit contributions
us-0.013
 boiling-0.013
 microwave-0.012
washer-0.012
 Farmer-0.011
Token<|endoftext|>
Token ID50256
Feature activation+7.09e-02

Pos logit contributions
lets+0.030
rings+0.027
 bracelet+0.026
 brace+0.025
angles+0.025
Neg logit contributions
 microwave-0.028
 boiling-0.027
us-0.027
 Farmer-0.027
 Doctor-0.025
TokenL
Token ID43
Feature activation+1.12e-03

Pos logit contributions
 boiling+0.127
 microwave+0.123
 hamb+0.121
 grizz+0.119
 paramed+0.117
Neg logit contributions
lets-0.100
packs-0.096
 tops-0.091
 every-0.089
 color-0.087
Tokenily
Token ID813
Feature activation-7.78e-03

Pos logit contributions
us+0.002
 boiling+0.002
 microwave+0.002
washer+0.002
 bulldo+0.002
Neg logit contributions
lets-0.002
rings-0.002
 styles-0.001
 wrist-0.001
 bracelet-0.001
Token and
Token ID290
Feature activation-1.29e-04

Pos logit contributions
lets+0.010
rings+0.008
 wrist+0.007
 bracelet+0.007
 styles+0.007
Neg logit contributions
 boiling-0.017
 microwave-0.017
us-0.017
 bulldo-0.016
washer-0.016
Token Ben
Token ID3932
Feature activation-1.52e-02

Pos logit contributions
lets+0.000
 styles+0.000
bows+0.000
 tops+0.000
angles+0.000
Neg logit contributions
 Farmer-0.000
 boiling-0.000
 microwave-0.000
 Doctor-0.000
 smoking-0.000
Token were
Token ID547
Feature activation+1.26e-02

Pos logit contributions
lets+0.020
rings+0.018
 wrist+0.016
 styles+0.015
 tops+0.015
Neg logit contributions
 boiling-0.032
 microwave-0.031
us-0.031
 bulldo-0.029
 hamb-0.029
Token is
Token ID318
Feature activation-3.90e-03

Pos logit contributions
lets+0.002
rings+0.002
 wrist+0.002
 bracelet+0.002
packs+0.002
Neg logit contributions
 boiling-0.005
 microwave-0.005
us-0.005
washer-0.004
 bulldo-0.004
Token a
Token ID257
Feature activation-9.62e-03

Pos logit contributions
lets+0.006
rings+0.005
 bracelet+0.004
 styles+0.004
 brace+0.004
Neg logit contributions
us-0.008
 boiling-0.008
 microwave-0.008
 bulldo-0.007
washer-0.007
Token useful
Token ID4465
Feature activation+9.71e-04

Pos logit contributions
 bracelet+0.012
lets+0.011
 color+0.011
 own+0.011
 wrist+0.010
Neg logit contributions
us-0.021
 boiling-0.019
 microwave-0.019
 grizz-0.018
 Farmer-0.018
Token thing
Token ID1517
Feature activation+3.76e-03

Pos logit contributions
us+0.002
 boiling+0.002
 microwave+0.002
washer+0.002
 bulldo+0.002
Neg logit contributions
lets-0.001
rings-0.001
 bracelet-0.001
 styles-0.001
 wrist-0.001
Token.
Token ID13
Feature activation-7.94e-03

Pos logit contributions
us+0.007
 boiling+0.007
 microwave+0.007
washer+0.007
 bulldo+0.007
Neg logit contributions
lets-0.006
rings-0.005
 bracelet-0.005
 styles-0.005
 wrist-0.004
Token<|endoftext|>
Token ID50256
Feature activation+7.09e-02

Pos logit contributions
lets+0.012
rings+0.010
 bracelet+0.009
 styles+0.009
 brace+0.009
Neg logit contributions
us-0.016
 boiling-0.016
 microwave-0.016
 Farmer-0.015
 bulldo-0.015
TokenOne
Token ID3198
Feature activation+2.39e-03

Pos logit contributions
 boiling+0.128
 microwave+0.122
 hamb+0.120
 grizz+0.119
us+0.118
Neg logit contributions
lets-0.100
packs-0.095
 tops-0.092
 every-0.091
 color-0.088
Token day
Token ID1110
Feature activation+4.57e-02

Pos logit contributions
 boiling+0.004
us+0.004
 microwave+0.004
washer+0.004
 Farmer+0.004
Neg logit contributions
lets-0.004
rings-0.004
 brace-0.004
 wrists-0.004
 wrist-0.004
Token,
Token ID11
Feature activation-6.07e-03

Pos logit contributions
us+0.090
 boiling+0.088
 microwave+0.083
washer+0.079
 bulldo+0.079
Neg logit contributions
lets-0.072
rings-0.061
 styles-0.059
 wrist-0.059
 bracelet-0.057
Token a
Token ID257
Feature activation+4.87e-03

Pos logit contributions
lets+0.013
 styles+0.012
rings+0.012
 languages+0.011
 wrists+0.011
Neg logit contributions
 boiling-0.009
us-0.008
 bulldo-0.007
 cooking-0.007
 Farmer-0.007
Token little
Token ID1310
Feature activation+1.62e-02

Pos logit contributions
us+0.011
 boiling+0.011
 microwave+0.011
 grizz+0.010
washer+0.010
Neg logit contributions
lets-0.005
 bracelet-0.005
 color-0.005
 own-0.005
 likes-0.004
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
lets+0.017
rings+0.014
 styles+0.013
 wrist+0.013
 bracelet+0.013
Neg logit contributions
us-0.028
 boiling-0.028
 microwave-0.027
 bulldo-0.025
washer-0.025
Token They
Token ID1119
Feature activation+5.72e-04

Pos logit contributions
lets+0.014
rings+0.012
 bracelet+0.011
 wrist+0.011
 brace+0.011
Neg logit contributions
 boiling-0.022
us-0.022
 microwave-0.022
 Farmer-0.021
 bulldo-0.020
Token are
Token ID389
Feature activation+5.63e-03

Pos logit contributions
 boiling+0.001
 microwave+0.001
us+0.001
 stew+0.001
 hamb+0.001
Neg logit contributions
lets-0.001
rings-0.001
packs-0.001
 languages-0.001
 styles-0.001
Token happy
Token ID3772
Feature activation-7.50e-03

Pos logit contributions
 boiling+0.011
us+0.011
 microwave+0.010
washer+0.010
 bulldo+0.010
Neg logit contributions
lets-0.009
rings-0.008
 bracelet-0.007
 styles-0.007
 wrist-0.007
Token.
Token ID13
Feature activation-8.64e-03

Pos logit contributions
lets+0.011
rings+0.009
 bracelet+0.008
 styles+0.008
 wrist+0.008
Neg logit contributions
us-0.015
 boiling-0.015
 microwave-0.015
 bulldo-0.014
washer-0.014
Token<|endoftext|>
Token ID50256
Feature activation+7.08e-02

Pos logit contributions
lets+0.013
rings+0.011
 bracelet+0.011
 brace+0.011
 wrist+0.010
Neg logit contributions
us-0.017
 boiling-0.017
 microwave-0.016
 Farmer-0.016
 bulldo-0.015
TokenS
Token ID50
Feature activation-1.33e-02

Pos logit contributions
 boiling+0.128
 microwave+0.126
 hamb+0.122
 grizz+0.121
washer+0.119
Neg logit contributions
lets-0.099
packs-0.094
 tops-0.092
 every-0.089
bows-0.086
Tokenara
Token ID3301
Feature activation-5.20e-03

Pos logit contributions
lets+0.017
rings+0.014
 bracelet+0.013
 styles+0.013
 wrist+0.013
Neg logit contributions
us-0.029
 boiling-0.029
 microwave-0.028
washer-0.026
 bulldo-0.026
Token and
Token ID290
Feature activation+5.04e-03

Pos logit contributions
lets+0.006
rings+0.005
 bracelet+0.005
 wrist+0.005
 styles+0.005
Neg logit contributions
 boiling-0.012
us-0.012
 microwave-0.012
 bulldo-0.011
washer-0.011
Token Ben
Token ID3932
Feature activation-1.55e-02

Pos logit contributions
 boiling+0.009
 Farmer+0.008
 microwave+0.008
us+0.008
 burgers+0.008
Neg logit contributions
lets-0.009
 styles-0.008
bows-0.008
 tops-0.007
rings-0.007
Token are
Token ID389
Feature activation+1.34e-02

Pos logit contributions
lets+0.019
rings+0.017
 wrist+0.015
 styles+0.015
 bracelet+0.015
Neg logit contributions
 boiling-0.035
 microwave-0.033
us-0.033
 bulldo-0.031
 hamb-0.031
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
 boiling+0.001
us+0.001
 microwave+0.001
washer+0.001
 stew+0.001
Neg logit contributions
lets-0.001
rings-0.001
 wrist-0.001
 bracelet-0.001
 styles-0.001
Token It
Token ID632
Feature activation-1.88e-04

Pos logit contributions
lets+0.019
rings+0.017
 bracelet+0.016
 brace+0.016
 styles+0.016
Neg logit contributions
 microwave-0.023
 boiling-0.023
us-0.022
 Farmer-0.022
 Doctor-0.020
Token is
Token ID318
Feature activation+3.75e-03

Pos logit contributions
lets+0.000
rings+0.000
 wrist+0.000
Her+0.000
packs+0.000
Neg logit contributions
 microwave-0.000
 boiling-0.000
us-0.000
 meat-0.000
washer-0.000
Token art
Token ID1242
Feature activation-1.11e-02

Pos logit contributions
 boiling+0.007
us+0.007
 microwave+0.007
 Farmer+0.006
 bulldo+0.006
Neg logit contributions
lets-0.006
rings-0.005
 wrist-0.005
 styles-0.005
 bracelet-0.005
Token.
Token ID13
Feature activation-8.26e-03

Pos logit contributions
lets+0.017
rings+0.015
 bracelet+0.013
 wrist+0.013
packs+0.013
Neg logit contributions
 boiling-0.021
us-0.021
 microwave-0.021
washer-0.020
 bulldo-0.019
Token<|endoftext|>
Token ID50256
Feature activation+7.05e-02

Pos logit contributions
lets+0.012
rings+0.011
 bracelet+0.010
 styles+0.010
 brace+0.010
Neg logit contributions
 boiling-0.016
us-0.016
 microwave-0.016
 Farmer-0.015
 bulldo-0.015
TokenBen
Token ID11696
Feature activation-3.05e-03

Pos logit contributions
 boiling+0.121
 microwave+0.114
us+0.111
 hamb+0.111
 grizz+0.109
Neg logit contributions
lets-0.112
packs-0.104
 tops-0.101
rings-0.099
 every-0.096
Token and
Token ID290
Feature activation+1.50e-03

Pos logit contributions
lets+0.003
rings+0.003
 wrist+0.003
 bracelet+0.003
 styles+0.002
Neg logit contributions
 boiling-0.007
us-0.007
 microwave-0.007
 bulldo-0.007
washer-0.007
Token Mia
Token ID32189
Feature activation-2.80e-02

Pos logit contributions
 boiling+0.003
 Farmer+0.003
 microwave+0.003
us+0.002
 bulldo+0.002
Neg logit contributions
lets-0.002
rings-0.002
 styles-0.002
 tops-0.002
bows-0.002
Token were
Token ID547
Feature activation+1.44e-02

Pos logit contributions
lets+0.037
rings+0.033
 wrist+0.030
Her+0.028
 styles+0.028
Neg logit contributions
 boiling-0.057
 microwave-0.056
us-0.054
 bulldo-0.052
 hamb-0.051
Token playing
Token ID2712
Feature activation+8.85e-04

Pos logit contributions
us+0.028
 boiling+0.028
 microwave+0.027
 bulldo+0.025
washer+0.025
Neg logit contributions
lets-0.022
rings-0.019
 bracelet-0.018
packs-0.017
 wrist-0.017
Token twins
Token ID20345
Feature activation+6.85e-03

Pos logit contributions
lets+0.002
rings+0.001
adays+0.001
bows+0.001
angles+0.001
Neg logit contributions
 boiling-0.002
 microwave-0.002
us-0.002
 bulldo-0.002
washer-0.002
Token.
Token ID13
Feature activation-7.62e-03

Pos logit contributions
us+0.014
 boiling+0.014
 microwave+0.014
 bulldo+0.013
washer+0.013
Neg logit contributions
lets-0.009
rings-0.008
 bracelet-0.007
 wrist-0.007
 styles-0.007
Token The
Token ID383
Feature activation+1.05e-03

Pos logit contributions
lets+0.011
rings+0.009
 bracelet+0.009
 styles+0.009
 wrist+0.008
Neg logit contributions
us-0.016
 boiling-0.015
 microwave-0.015
 Farmer-0.014
 bulldo-0.014
Token end
Token ID886
Feature activation+3.14e-02

Pos logit contributions
us+0.002
 boiling+0.002
washer+0.002
 microwave+0.002
 grizz+0.002
Neg logit contributions
lets-0.001
 bracelet-0.001
 own-0.001
 tops-0.001
 styles-0.001
Token.
Token ID13
Feature activation-1.06e-03

Pos logit contributions
 boiling+0.069
us+0.069
 microwave+0.068
 bulldo+0.063
 hamb+0.063
Neg logit contributions
lets-0.039
rings-0.032
packs-0.030
 bracelet-0.030
 styles-0.030
Token<|endoftext|>
Token ID50256
Feature activation+7.05e-02

Pos logit contributions
lets+0.002
rings+0.001
 bracelet+0.001
 styles+0.001
 wrist+0.001
Neg logit contributions
us-0.002
 boiling-0.002
 microwave-0.002
 Farmer-0.002
 bulldo-0.002
TokenL
Token ID43
Feature activation-2.03e-03

Pos logit contributions
 boiling+0.130
 microwave+0.125
 hamb+0.122
 grizz+0.122
washer+0.119
Neg logit contributions
lets-0.095
packs-0.092
 every-0.087
 tops-0.086
 own-0.083
Tokenila
Token ID10102
Feature activation-9.30e-03

Pos logit contributions
lets+0.003
rings+0.003
 wrist+0.003
 bracelet+0.003
 styles+0.003
Neg logit contributions
us-0.004
 boiling-0.004
 microwave-0.004
washer-0.003
 bulldo-0.003
Token and
Token ID290
Feature activation-3.30e-03

Pos logit contributions
lets+0.013
rings+0.012
 bracelet+0.011
 styles+0.011
 wrist+0.011
Neg logit contributions
 boiling-0.019
us-0.019
 microwave-0.018
washer-0.017
 bulldo-0.017
Token Ben
Token ID3932
Feature activation-2.04e-02

Pos logit contributions
lets+0.006
 styles+0.006
bows+0.005
rings+0.005
 tops+0.005
Neg logit contributions
 Farmer-0.005
 boiling-0.005
 microwave-0.005
 Doctor-0.005
us-0.005
Token were
Token ID547
Feature activation+1.03e-02

Pos logit contributions
lets+0.031
rings+0.028
 styles+0.026
 wrist+0.026
 tops+0.026
Neg logit contributions
 boiling-0.038
 microwave-0.036
us-0.036
 bulldo-0.034
 hamb-0.034
Token her
Token ID607
Feature activation-6.76e-03

Pos logit contributions
lets+0.004
rings+0.003
 bracelet+0.003
 wrist+0.003
 styles+0.003
Neg logit contributions
 boiling-0.006
us-0.006
 microwave-0.006
washer-0.005
 Farmer-0.005
Token about
Token ID546
Feature activation+6.49e-03

Pos logit contributions
lets+0.012
rings+0.010
Mine+0.010
 wrists+0.010
 wrist+0.010
Neg logit contributions
washer-0.011
us-0.011
 boiling-0.011
 microwave-0.011
 Doctor-0.010
Token their
Token ID511
Feature activation-1.77e-05

Pos logit contributions
us+0.015
 boiling+0.015
 microwave+0.014
washer+0.014
 bulldo+0.014
Neg logit contributions
lets-0.008
rings-0.006
 bracelet-0.006
 styles-0.006
 wrist-0.006
Token day
Token ID1110
Feature activation+1.81e-02

Pos logit contributions
lets+0.000
rings+0.000
 bracelet+0.000
 styles+0.000
 wrist+0.000
Neg logit contributions
us-0.000
 boiling-0.000
 microwave-0.000
washer-0.000
 bulldo-0.000
Token.
Token ID13
Feature activation-4.62e-03

Pos logit contributions
us+0.039
 boiling+0.039
 microwave+0.038
 bulldo+0.036
 grizz+0.036
Neg logit contributions
lets-0.023
rings-0.020
 styles-0.018
packs-0.018
 bracelet-0.018
Token<|endoftext|>
Token ID50256
Feature activation+7.05e-02

Pos logit contributions
lets+0.007
rings+0.005
 bracelet+0.005
 styles+0.005
 wrist+0.005
Neg logit contributions
us-0.009
 boiling-0.009
 microwave-0.009
washer-0.009
 bulldo-0.009
TokenS
Token ID50
Feature activation-1.16e-02

Pos logit contributions
 boiling+0.123
 microwave+0.117
us+0.115
 hamb+0.113
 bulldo+0.112
Neg logit contributions
lets-0.108
packs-0.101
rings-0.097
 tops-0.095
 every-0.094
Tokenara
Token ID3301
Feature activation-5.59e-03

Pos logit contributions
lets+0.014
rings+0.011
 styles+0.010
 wrist+0.010
 tops+0.010
Neg logit contributions
us-0.027
 boiling-0.027
 microwave-0.027
washer-0.025
 bulldo-0.024
Token liked
Token ID8288
Feature activation+4.09e-02

Pos logit contributions
lets+0.008
rings+0.007
 bracelet+0.006
 styles+0.006
 wrist+0.006
Neg logit contributions
us-0.011
 boiling-0.011
 microwave-0.011
washer-0.010
 bulldo-0.010
Token to
Token ID284
Feature activation-1.49e-02

Pos logit contributions
us+0.088
 boiling+0.084
 microwave+0.084
 grizz+0.080
 goblin+0.079
Neg logit contributions
lets-0.054
rings-0.048
 bracelet-0.046
 styles-0.045
 tops-0.045
Token play
Token ID711
Feature activation-9.73e-04

Pos logit contributions
lets+0.024
rings+0.020
angles+0.019
bows+0.019
packs+0.019
Neg logit contributions
 boiling-0.027
 microwave-0.026
 Farmer-0.024
washer-0.024
 stew-0.024
Token.
Token ID13
Feature activation-4.38e-03

Pos logit contributions
 microwave+0.015
 boiling+0.015
us+0.015
 bulldo+0.014
washer+0.014
Neg logit contributions
lets-0.008
packs-0.006
rings-0.006
 styles-0.006
 tops-0.006
Token They
Token ID1119
Feature activation-4.23e-04

Pos logit contributions
lets+0.007
rings+0.006
 bracelet+0.006
 styles+0.006
 brace+0.006
Neg logit contributions
us-0.008
 boiling-0.008
 microwave-0.008
 Farmer-0.008
 Doctor-0.007
Token are
Token ID389
Feature activation-4.21e-03

Pos logit contributions
lets+0.001
rings+0.001
packs+0.000
 wrist+0.000
Her+0.000
Neg logit contributions
 boiling-0.001
 microwave-0.001
us-0.001
 bulldo-0.001
 hamb-0.001
Token happy
Token ID3772
Feature activation-5.69e-03

Pos logit contributions
lets+0.006
rings+0.006
 bracelet+0.005
 styles+0.005
 wrist+0.005
Neg logit contributions
us-0.008
 boiling-0.008
 microwave-0.008
washer-0.007
 bulldo-0.007
Token.
Token ID13
Feature activation-4.45e-03

Pos logit contributions
lets+0.007
rings+0.006
 styles+0.005
 bracelet+0.005
packs+0.005
Neg logit contributions
us-0.013
 boiling-0.012
 microwave-0.012
 bulldo-0.012
washer-0.011
Token<|endoftext|>
Token ID50256
Feature activation+7.05e-02

Pos logit contributions
lets+0.008
rings+0.007
 bracelet+0.007
 brace+0.006
 styles+0.006
Neg logit contributions
us-0.008
 boiling-0.008
 microwave-0.007
 Farmer-0.007
 Doctor-0.007
TokenAnna
Token ID31160
Feature activation-1.36e-02

Pos logit contributions
 boiling+0.121
 microwave+0.119
 hamb+0.113
us+0.112
 grizz+0.112
Neg logit contributions
lets-0.107
packs-0.100
 tops-0.097
 every-0.093
rings-0.093
Token and
Token ID290
Feature activation+2.62e-03

Pos logit contributions
lets+0.016
rings+0.013
 bracelet+0.012
 wrist+0.012
 styles+0.012
Neg logit contributions
us-0.031
 boiling-0.031
 microwave-0.030
 bulldo-0.029
washer-0.028
Token Ben
Token ID3932
Feature activation-1.39e-02

Pos logit contributions
 boiling+0.005
 microwave+0.005
 Farmer+0.004
us+0.004
 Doctor+0.004
Neg logit contributions
lets-0.004
 styles-0.004
rings-0.004
bows-0.004
 tops-0.004
Token are
Token ID389
Feature activation+1.09e-02

Pos logit contributions
lets+0.016
rings+0.014
 wrist+0.012
 bracelet+0.012
 styles+0.011
Neg logit contributions
 boiling-0.032
us-0.031
 microwave-0.031
 bulldo-0.029
 hamb-0.029
Token friends
Token ID2460
Feature activation+1.90e-02

Pos logit contributions
us+0.020
 boiling+0.020
 microwave+0.019
 bulldo+0.018
washer+0.018
Neg logit contributions
lets-0.018
rings-0.015
 bracelet-0.015
 styles-0.014
 wrist-0.014
Token was
Token ID373
Feature activation+1.15e-02

Pos logit contributions
 boiling+0.012
us+0.011
 microwave+0.011
 bulldo+0.010
washer+0.010
Neg logit contributions
lets-0.006
rings-0.005
 cherish-0.004
 bracelet-0.004
 styles-0.004
Token just
Token ID655
Feature activation+8.00e-03

Pos logit contributions
us+0.021
 boiling+0.021
 microwave+0.020
 bulldo+0.019
washer+0.018
Neg logit contributions
lets-0.019
rings-0.016
 bracelet-0.016
 own-0.015
 wrist-0.015
Token in
Token ID287
Feature activation-1.26e-02

Pos logit contributions
us+0.015
 microwave+0.014
 boiling+0.014
washer+0.013
 bulldo+0.013
Neg logit contributions
lets-0.013
rings-0.012
 bracelet-0.011
 wrist-0.011
 own-0.011
Token the
Token ID262
Feature activation+5.12e-03

Pos logit contributions
lets+0.019
rings+0.016
 styles+0.015
 bracelet+0.015
 wrist+0.015
Neg logit contributions
 boiling-0.025
us-0.025
 microwave-0.024
 Farmer-0.022
washer-0.022
Token other
Token ID584
Feature activation-1.47e-02

Pos logit contributions
us+0.009
cher+0.008
 Farmer+0.008
 bulldo+0.008
 microwave+0.008
Neg logit contributions
 own-0.008
 bracelet-0.008
 color-0.007
 goal-0.007
 styles-0.007
Token<|endoftext|>
Token ID50256
Feature activation+7.05e-02

Pos logit contributions
lets+0.023
rings+0.019
 styles+0.019
packs+0.019
 wrist+0.019
Neg logit contributions
us-0.028
 boiling-0.027
 microwave-0.027
 bulldo-0.025
 Farmer-0.025
TokenOnce
Token ID7454
Feature activation-2.22e-02

Pos logit contributions
 boiling+0.135
 microwave+0.131
us+0.128
washer+0.126
 hamb+0.125
Neg logit contributions
lets-0.092
packs-0.086
 Her-0.082
 tops-0.081
rings-0.080
Token upon
Token ID2402
Feature activation+1.02e-03

Pos logit contributions
lets+0.042
 wrist+0.036
 wrists+0.036
 brace+0.036
bows+0.035
Neg logit contributions
us-0.038
 boiling-0.037
washer-0.036
 microwave-0.035
 Farmer-0.032
Token a
Token ID257
Feature activation+1.25e-02

Pos logit contributions
 boiling+0.002
 microwave+0.002
us+0.002
washer+0.001
 hamb+0.001
Neg logit contributions
lets-0.002
rings-0.002
 styles-0.002
 wrists-0.002
angles-0.002
Token time
Token ID640
Feature activation+2.02e-02

Pos logit contributions
 boiling+0.025
us+0.025
 microwave+0.025
 bulldo+0.023
 Farmer+0.023
Neg logit contributions
lets-0.018
rings-0.016
 styles-0.014
 bracelet-0.014
 wrist-0.014
Token,
Token ID11
Feature activation-1.03e-02

Pos logit contributions
us+0.034
 boiling+0.033
 microwave+0.031
 Farmer+0.031
washer+0.030
Neg logit contributions
lets-0.037
rings-0.032
 styles-0.031
 bracelet-0.030
 brace-0.030
Token make
Token ID787
Feature activation-1.43e-02

Pos logit contributions
lets+0.035
rings+0.029
 languages+0.029
bows+0.028
angles+0.028
Neg logit contributions
 boiling-0.030
 microwave-0.029
 polar-0.028
us-0.027
 pot-0.027
Token neck
Token ID7393
Feature activation-3.94e-02

Pos logit contributions
lets+0.015
rings+0.013
 styles+0.013
 bracelet+0.012
 brace+0.012
Neg logit contributions
us-0.033
 boiling-0.032
 microwave-0.031
 grizz-0.031
 bulldo-0.031
Tokenl
Token ID75
Feature activation-9.54e-03

Pos logit contributions
lets+0.067
 wrist+0.067
Mine+0.067
 cherish+0.064
packs+0.064
Neg logit contributions
us-0.064
 boiling-0.058
 microwave-0.054
 coal-0.052
washer-0.052
Tokenaces
Token ID2114
Feature activation-4.87e-02

Pos logit contributions
lets+0.009
rings+0.007
 bracelet+0.007
 wrist+0.007
 styles+0.007
Neg logit contributions
us-0.023
 boiling-0.023
 microwave-0.022
washer-0.021
 bulldo-0.021
Token and
Token ID290
Feature activation-1.61e-02

Pos logit contributions
lets+0.082
Mine+0.077
 languages+0.076
 wrists+0.076
 styles+0.075
Neg logit contributions
us-0.078
washer-0.077
 boiling-0.070
 microwave-0.069
cher-0.064
Token brace
Token ID22581
Feature activation-7.06e-02

Pos logit contributions
lets+0.020
ust+0.019
herry+0.019
inated+0.019
angles+0.019
Neg logit contributions
 microwave-0.029
 boiling-0.028
 potatoes-0.028
 meat-0.027
 coal-0.027
Tokenlets
Token ID5289
Feature activation-3.94e-02

Pos logit contributions
adays+0.067
Me+0.062
 particular+0.060
 comparison+0.060
 forget+0.060
Neg logit contributions
us-0.152
lic-0.133
cher-0.128
iers-0.116
 kit-0.115
Token with
Token ID351
Feature activation-8.61e-03

Pos logit contributions
lets+0.071
 wrist+0.065
herry+0.065
adays+0.065
 cherish+0.065
Neg logit contributions
us-0.059
 boiling-0.055
washer-0.053
 microwave-0.053
 pot-0.048
Token them
Token ID606
Feature activation-6.04e-03

Pos logit contributions
ue+0.012
lets+0.012
 wrist+0.011
adays+0.011
inated+0.011
Neg logit contributions
 boiling-0.014
us-0.014
 coal-0.014
 potatoes-0.013
 meat-0.013
Token.
Token ID13
Feature activation-2.31e-02

Pos logit contributions
lets+0.010
 cherish+0.009
herry+0.009
Mine+0.009
 wrist+0.009
Neg logit contributions
us-0.009
 boiling-0.009
 microwave-0.009
washer-0.008
 stew-0.008
Token
Token ID198
Feature activation-7.59e-03

Pos logit contributions
lets+0.034
rings+0.028
 bracelet+0.028
 brace+0.027
 styles+0.027
Neg logit contributions
us-0.046
 boiling-0.046
 microwave-0.046
 Farmer-0.043
 bulldo-0.042
Token.
Token ID13
Feature activation-6.51e-03

Pos logit contributions
lets+0.024
rings+0.021
 bracelet+0.020
 wrist+0.020
 brace+0.019
Neg logit contributions
 boiling-0.029
us-0.029
 microwave-0.028
washer-0.027
 bulldo-0.026
Token They
Token ID1119
Feature activation-1.44e-02

Pos logit contributions
lets+0.009
 bracelet+0.007
rings+0.007
 cherish+0.007
 styles+0.007
Neg logit contributions
 microwave-0.014
us-0.013
 boiling-0.013
 Farmer-0.013
 Doctor-0.013
Token make
Token ID787
Feature activation-1.88e-02

Pos logit contributions
lets+0.020
rings+0.017
adays+0.017
packs+0.017
 styles+0.016
Neg logit contributions
 boiling-0.028
 microwave-0.026
 bulldo-0.026
washer-0.025
us-0.025
Token rings
Token ID13917
Feature activation-5.03e-02

Pos logit contributions
lets+0.027
rings+0.023
 brace+0.022
 styles+0.022
 bracelet+0.022
Neg logit contributions
 boiling-0.038
us-0.038
 microwave-0.037
washer-0.035
 Farmer-0.034
Token and
Token ID290
Feature activation-2.32e-03

Pos logit contributions
lets+0.077
Mine+0.076
 wrist+0.074
 cherish+0.072
 bracelet+0.071
Neg logit contributions
us-0.084
 boiling-0.079
 coal-0.072
washer-0.071
 smoking-0.071
Token brace
Token ID22581
Feature activation-6.97e-02

Pos logit contributions
lets+0.003
rings+0.003
angles+0.003
adays+0.002
herry+0.002
Neg logit contributions
 boiling-0.005
 microwave-0.004
us-0.004
 stew-0.004
 bulldo-0.004
Tokenlets
Token ID5289
Feature activation-4.61e-02

Pos logit contributions
adays+0.052
Your+0.049
Every+0.046
 particular+0.046
Me+0.045
Neg logit contributions
us-0.163
lic-0.143
cher-0.131
 coal-0.129
 loader-0.127
Token and
Token ID290
Feature activation-5.12e-04

Pos logit contributions
lets+0.079
 wrist+0.073
Mine+0.072
adays+0.072
 cherish+0.072
Neg logit contributions
us-0.071
 boiling-0.069
 microwave-0.062
washer-0.062
 stew-0.060
Token neck
Token ID7393
Feature activation-3.29e-02

Pos logit contributions
lets+0.001
rings+0.001
angles+0.000
packs+0.000
adays+0.000
Neg logit contributions
 boiling-0.001
us-0.001
 microwave-0.001
 bulldo-0.001
 hamb-0.001
Tokenl
Token ID75
Feature activation-1.46e-03

Pos logit contributions
lets+0.049
 wrist+0.048
 cherish+0.048
 bracelet+0.046
packs+0.045
Neg logit contributions
us-0.061
 boiling-0.054
 microwave-0.051
 bulldo-0.049
 coal-0.049
Tokenaces
Token ID2114
Feature activation-4.55e-02

Pos logit contributions
lets+0.001
rings+0.001
 bracelet+0.001
 wrist+0.001
 styles+0.001
Neg logit contributions
us-0.003
 boiling-0.003
 microwave-0.003
 bulldo-0.003
washer-0.003
Token beautiful
Token ID4950
Feature activation-2.14e-03

Pos logit contributions
 boiling+0.001
us+0.001
 microwave+0.000
washer+0.000
 bulldo+0.000
Neg logit contributions
lets-0.000
rings-0.000
 wrist-0.000
 bracelet-0.000
 styles-0.000
Token neck
Token ID7393
Feature activation-2.16e-02

Pos logit contributions
lets+0.003
rings+0.003
Her+0.003
adays+0.003
angles+0.003
Neg logit contributions
 boiling-0.004
us-0.004
 microwave-0.004
 stew-0.004
 Farmer-0.003
Tokenl
Token ID75
Feature activation+2.77e-03

Pos logit contributions
lets+0.033
Mine+0.031
packs+0.030
 wrist+0.030
 cherish+0.029
Neg logit contributions
us-0.040
 boiling-0.038
washer-0.036
 microwave-0.034
 stew-0.033
Tokenaces
Token ID2114
Feature activation-4.16e-02

Pos logit contributions
us+0.006
 boiling+0.006
 microwave+0.005
washer+0.005
 bulldo+0.005
Neg logit contributions
lets-0.003
rings-0.003
 wrist-0.003
 bracelet-0.003
Her-0.003
Token and
Token ID290
Feature activation-1.81e-03

Pos logit contributions
lets+0.064
Her+0.058
 styles+0.058
Mine+0.057
 wrist+0.057
Neg logit contributions
us-0.072
washer-0.070
 boiling-0.067
 microwave-0.063
 bulldo-0.059
Token brace
Token ID22581
Feature activation-6.97e-02

Pos logit contributions
lets+0.002
packs+0.002
ue+0.002
Her+0.002
ust+0.002
Neg logit contributions
 potatoes-0.003
us-0.003
 boiling-0.003
 microwave-0.003
 grizz-0.003
Tokenlets
Token ID5289
Feature activation-3.53e-02

Pos logit contributions
adays+0.077
 forget+0.071
 particular+0.070
 comparison+0.068
Me+0.067
Neg logit contributions
us-0.144
lic-0.122
cher-0.121
d-0.105
iers-0.104
Token.
Token ID13
Feature activation-1.25e-02

Pos logit contributions
lets+0.057
adays+0.055
 wrist+0.053
Mine+0.053
 bracelet+0.052
Neg logit contributions
us-0.058
 boiling-0.052
washer-0.050
 microwave-0.048
 stew-0.047
Token You
Token ID921
Feature activation-6.82e-03

Pos logit contributions
lets+0.017
 bracelet+0.016
packs+0.015
rings+0.014
 brace+0.014
Neg logit contributions
us-0.023
 microwave-0.023
 boiling-0.023
 Farmer-0.022
 grizz-0.022
Token sorted
Token ID23243
Feature activation-3.47e-02

Pos logit contributions
lets+0.010
 wrist+0.009
Her+0.009
packs+0.009
 styles+0.009
Neg logit contributions
us-0.012
washer-0.011
 microwave-0.011
 boiling-0.011
 pot-0.011
Token the
Token ID262
Feature activation+1.71e-02

Pos logit contributions
 wrist+0.056
lets+0.056
 cherish+0.055
ue+0.053
Her+0.052
Neg logit contributions
us-0.060
 boiling-0.048
 potatoes-0.047
 coal-0.046
 oil-0.045
Token neck
Token ID7393
Feature activation-1.19e-02

Pos logit contributions
lets+0.002
rings+0.002
 bracelet+0.002
 wrist+0.002
 styles+0.002
Neg logit contributions
us-0.004
 boiling-0.004
 microwave-0.004
 bulldo-0.003
washer-0.003
Tokenl
Token ID75
Feature activation-1.29e-02

Pos logit contributions
lets+0.018
rings+0.015
 styles+0.015
 wrist+0.015
 bracelet+0.014
Neg logit contributions
us-0.023
 boiling-0.023
 microwave-0.022
 bulldo-0.020
washer-0.020
Tokenaces
Token ID2114
Feature activation-4.42e-02

Pos logit contributions
lets+0.019
 styles+0.017
 bracelet+0.017
rings+0.017
 wrist+0.017
Neg logit contributions
us-0.024
 boiling-0.022
 microwave-0.022
 bulldo-0.021
washer-0.020
Token,
Token ID11
Feature activation+6.64e-03

Pos logit contributions
lets+0.067
 styles+0.061
 cherish+0.060
 languages+0.059
 wrist+0.058
Neg logit contributions
us-0.080
 boiling-0.072
washer-0.071
 microwave-0.069
 bulldo-0.068
Token grape
Token ID30777
Feature activation+1.01e-02

Pos logit contributions
us+0.013
 boiling+0.013
 microwave+0.013
washer+0.012
 bulldo+0.012
Neg logit contributions
lets-0.010
rings-0.008
 bracelet-0.008
 styles-0.008
 wrist-0.008
Token brace
Token ID22581
Feature activation-6.87e-02

Pos logit contributions
us+0.019
 boiling+0.019
 microwave+0.018
 bulldo+0.017
washer+0.017
Neg logit contributions
lets-0.016
rings-0.013
 wrist-0.013
 styles-0.013
 bracelet-0.012
Tokenlets
Token ID5289
Feature activation-3.61e-02

Pos logit contributions
adays+0.059
 forget+0.055
 particular+0.052
 cherish+0.050
 common+0.048
Neg logit contributions
us-0.153
lic-0.132
cher-0.131
 grizz-0.126
d-0.119
Token,
Token ID11
Feature activation+8.09e-03

Pos logit contributions
lets+0.057
 styles+0.050
 cherish+0.049
adays+0.049
 wrist+0.049
Neg logit contributions
us-0.063
 boiling-0.061
 microwave-0.058
washer-0.056
 stew-0.054
Token and
Token ID290
Feature activation+8.56e-03

Pos logit contributions
 boiling+0.017
 microwave+0.016
us+0.016
 bulldo+0.015
 hamb+0.015
Neg logit contributions
lets-0.010
packs-0.009
rings-0.009
angles-0.008
adays-0.008
Token grape
Token ID30777
Feature activation+8.13e-03

Pos logit contributions
 boiling+0.016
us+0.016
 microwave+0.015
 Farmer+0.015
 hamb+0.015
Neg logit contributions
lets-0.012
rings-0.011
packs-0.010
angles-0.010
 wrist-0.010
Token crown
Token ID12389
Feature activation-2.53e-02

Pos logit contributions
us+0.015
 boiling+0.014
 microwave+0.014
washer+0.013
 bulldo+0.013
Neg logit contributions
lets-0.012
rings-0.011
 wrist-0.011
 cherish-0.010
packs-0.010
Token shapes
Token ID15268
Feature activation-3.68e-02

Pos logit contributions
lets+0.015
rings+0.013
Her+0.013
ue+0.012
 bracelet+0.012
Neg logit contributions
 boiling-0.028
 microwave-0.026
us-0.026
washer-0.025
 smoking-0.025
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
lets+0.065
 wrist+0.058
rings+0.057
herry+0.056
 cherish+0.055
Neg logit contributions
 boiling-0.061
us-0.058
 microwave-0.055
washer-0.055
 stew-0.050
Token They
Token ID1119
Feature activation-1.51e-02

Pos logit contributions
lets+0.022
rings+0.018
 bracelet+0.017
 styles+0.017
 wrist+0.017
Neg logit contributions
us-0.032
 boiling-0.032
 microwave-0.032
 Farmer-0.030
 bulldo-0.029
Token can
Token ID460
Feature activation-2.46e-02

Pos logit contributions
lets+0.021
rings+0.017
packs+0.017
Her+0.017
 wrist+0.016
Neg logit contributions
 boiling-0.030
 microwave-0.029
washer-0.028
 pot-0.027
 bulldo-0.026
Token make
Token ID787
Feature activation-2.61e-02

Pos logit contributions
lets+0.043
packs+0.040
 mattered+0.035
 languages+0.035
rings+0.034
Neg logit contributions
 boiling-0.040
 microwave-0.039
washer-0.037
 drill-0.036
 smoking-0.035
Token towers
Token ID18028
Feature activation-6.62e-02

Pos logit contributions
lets+0.042
rings+0.036
 wrist+0.035
 cherish+0.034
 bracelet+0.033
Neg logit contributions
 boiling-0.048
 microwave-0.046
us-0.046
washer-0.042
 smoking-0.041
Token,
Token ID11
Feature activation-1.60e-02

Pos logit contributions
lets+0.113
Mine+0.111
 wrist+0.106
 wrists+0.105
inated+0.104
Neg logit contributions
washer-0.095
 boiling-0.095
 microwave-0.093
us-0.090
 working-0.090
Token bridges
Token ID19432
Feature activation-6.60e-02

Pos logit contributions
lets+0.027
 wrist+0.023
rings+0.022
 bracelet+0.022
 cherish+0.022
Neg logit contributions
 boiling-0.028
 microwave-0.027
 smoking-0.025
 drill-0.024
washer-0.024
Token,
Token ID11
Feature activation-1.97e-02

Pos logit contributions
lets+0.117
rings+0.105
 bracelet+0.103
 wrist+0.101
herry+0.100
Neg logit contributions
 boiling-0.097
 microwave-0.096
us-0.095
washer-0.091
 smoking-0.087
Token houses
Token ID7777
Feature activation-4.71e-02

Pos logit contributions
lets+0.034
 wrist+0.029
 bracelet+0.029
rings+0.029
 cherish+0.028
Neg logit contributions
 boiling-0.034
 microwave-0.033
us-0.030
 smoking-0.030
washer-0.029
Token and
Token ID290
Feature activation-2.63e-02

Pos logit contributions
lets+0.080
rings+0.071
 bracelet+0.069
 wrist+0.069
 cherish+0.066
Neg logit contributions
 boiling-0.077
us-0.077
 microwave-0.075
washer-0.069
 smoking-0.068
Token They
Token ID1119
Feature activation-1.51e-02

Pos logit contributions
lets+0.022
rings+0.018
 bracelet+0.017
 styles+0.017
 wrist+0.017
Neg logit contributions
us-0.032
 boiling-0.032
 microwave-0.032
 Farmer-0.030
 bulldo-0.029
Token can
Token ID460
Feature activation-2.46e-02

Pos logit contributions
lets+0.021
rings+0.017
packs+0.017
Her+0.017
 wrist+0.016
Neg logit contributions
 boiling-0.030
 microwave-0.029
washer-0.028
 pot-0.027
 bulldo-0.026
Token make
Token ID787
Feature activation-2.61e-02

Pos logit contributions
lets+0.043
packs+0.040
 mattered+0.035
 languages+0.035
rings+0.034
Neg logit contributions
 boiling-0.040
 microwave-0.039
washer-0.037
 drill-0.036
 smoking-0.035
Token towers
Token ID18028
Feature activation-6.62e-02

Pos logit contributions
lets+0.042
rings+0.036
 wrist+0.035
 cherish+0.034
 bracelet+0.033
Neg logit contributions
 boiling-0.048
 microwave-0.046
us-0.046
washer-0.042
 smoking-0.041
Token,
Token ID11
Feature activation-1.60e-02

Pos logit contributions
lets+0.113
Mine+0.111
 wrist+0.106
 wrists+0.105
inated+0.104
Neg logit contributions
washer-0.095
 boiling-0.095
 microwave-0.093
us-0.090
 working-0.090
Token bridges
Token ID19432
Feature activation-6.60e-02

Pos logit contributions
lets+0.027
 wrist+0.023
rings+0.022
 bracelet+0.022
 cherish+0.022
Neg logit contributions
 boiling-0.028
 microwave-0.027
 smoking-0.025
 drill-0.024
washer-0.024
Token,
Token ID11
Feature activation-1.97e-02

Pos logit contributions
lets+0.117
rings+0.105
 bracelet+0.103
 wrist+0.101
herry+0.100
Neg logit contributions
 boiling-0.097
 microwave-0.096
us-0.095
washer-0.091
 smoking-0.087
Token houses
Token ID7777
Feature activation-4.71e-02

Pos logit contributions
lets+0.034
 wrist+0.029
 bracelet+0.029
rings+0.029
 cherish+0.028
Neg logit contributions
 boiling-0.034
 microwave-0.033
us-0.030
 smoking-0.030
washer-0.029
Token and
Token ID290
Feature activation-2.63e-02

Pos logit contributions
lets+0.080
rings+0.071
 bracelet+0.069
 wrist+0.069
 cherish+0.066
Neg logit contributions
 boiling-0.077
us-0.077
 microwave-0.075
washer-0.069
 smoking-0.068
Token more
Token ID517
Feature activation-1.07e-02

Pos logit contributions
lets+0.044
rings+0.038
 wrist+0.037
 bracelet+0.037
 cherish+0.035
Neg logit contributions
 boiling-0.046
 microwave-0.044
us-0.043
 bulldo-0.041
washer-0.040
Token with
Token ID351
Feature activation-1.56e-02

Pos logit contributions
lets+0.016
rings+0.014
 wrist+0.013
 bracelet+0.013
 cherish+0.013
Neg logit contributions
 boiling-0.020
us-0.020
 microwave-0.020
washer-0.018
 bulldo-0.018
Token blocks
Token ID7021
Feature activation-1.62e-02

Pos logit contributions
lets+0.003
rings+0.003
 wrist+0.002
 bracelet+0.002
packs+0.002
Neg logit contributions
 boiling-0.004
us-0.004
 microwave-0.004
 Farmer-0.004
 bulldo-0.004
Token.
Token ID13
Feature activation-1.25e-02

Pos logit contributions
lets+0.025
rings+0.022
 wrist+0.022
 bracelet+0.021
 cherish+0.021
Neg logit contributions
 boiling-0.029
us-0.029
 microwave-0.028
washer-0.027
 bulldo-0.026
Token They
Token ID1119
Feature activation-1.10e-02

Pos logit contributions
lets+0.017
rings+0.014
 bracelet+0.013
 styles+0.013
 wrist+0.013
Neg logit contributions
us-0.027
 boiling-0.027
 microwave-0.026
 bulldo-0.024
washer-0.024
Token make
Token ID787
Feature activation-2.68e-02

Pos logit contributions
lets+0.015
rings+0.013
 wrist+0.012
Her+0.012
packs+0.012
Neg logit contributions
 boiling-0.022
 microwave-0.022
washer-0.021
us-0.021
 bulldo-0.021
Token their
Token ID511
Feature activation-2.79e-02

Pos logit contributions
lets+0.041
rings+0.035
 wrist+0.033
 bracelet+0.033
 cherish+0.032
Neg logit contributions
 boiling-0.052
us-0.051
 microwave-0.050
washer-0.047
 bulldo-0.046
Token towers
Token ID18028
Feature activation-6.52e-02

Pos logit contributions
lets+0.041
rings+0.038
Her+0.035
 bracelet+0.032
adays+0.032
Neg logit contributions
 microwave-0.053
 boiling-0.053
washer-0.049
 bulldo-0.049
 burgers-0.048
Token very
Token ID845
Feature activation-6.06e-03

Pos logit contributions
lets+0.112
Mine+0.104
rings+0.101
 wrists+0.099
 wrist+0.098
Neg logit contributions
 boiling-0.099
 microwave-0.095
washer-0.095
 working-0.093
 pot-0.088
Token tall
Token ID7331
Feature activation-1.73e-02

Pos logit contributions
lets+0.011
rings+0.010
 wrist+0.009
 bracelet+0.009
packs+0.009
Neg logit contributions
 boiling-0.010
us-0.010
washer-0.010
 microwave-0.010
 bulldo-0.009
Token and
Token ID290
Feature activation+2.04e-03

Pos logit contributions
lets+0.026
rings+0.023
 bracelet+0.021
Her+0.021
 wrist+0.021
Neg logit contributions
 boiling-0.033
 microwave-0.032
us-0.032
 bulldo-0.030
washer-0.030
Token strong
Token ID1913
Feature activation-3.91e-03

Pos logit contributions
 boiling+0.004
us+0.004
 microwave+0.004
washer+0.003
 bulldo+0.003
Neg logit contributions
lets-0.003
rings-0.003
 bracelet-0.003
 wrist-0.003
 styles-0.003
Token.
Token ID13
Feature activation-1.61e-02

Pos logit contributions
lets+0.005
rings+0.005
 styles+0.004
 bracelet+0.004
 wrist+0.004
Neg logit contributions
us-0.008
 boiling-0.008
 microwave-0.008
washer-0.007
 bulldo-0.007
Token,
Token ID11
Feature activation+1.66e-02

Pos logit contributions
lets+0.056
Mine+0.056
 cherish+0.055
 styles+0.054
 own+0.053
Neg logit contributions
 boiling-0.047
us-0.045
 smoking-0.043
 microwave-0.042
 polar-0.040
Token neck
Token ID7393
Feature activation-2.24e-02

Pos logit contributions
 boiling+0.029
 microwave+0.028
us+0.027
 smoking+0.026
 polar+0.025
Neg logit contributions
lets-0.026
angles-0.023
herry-0.022
adays-0.022
 cherish-0.022
Tokenl
Token ID75
Feature activation+4.99e-03

Pos logit contributions
lets+0.035
 cherish+0.033
 wrist+0.032
packs+0.032
Mine+0.031
Neg logit contributions
us-0.040
 boiling-0.037
 microwave-0.035
 bulldo-0.033
washer-0.033
Tokenaces
Token ID2114
Feature activation-4.35e-02

Pos logit contributions
us+0.011
 boiling+0.011
 microwave+0.011
 bulldo+0.010
 grizz+0.010
Neg logit contributions
lets-0.005
 bracelet-0.005
 wrist-0.005
 styles-0.005
rings-0.005
Token,
Token ID11
Feature activation+2.71e-03

Pos logit contributions
lets+0.073
Mine+0.070
 styles+0.070
 languages+0.069
 cherish+0.067
Neg logit contributions
us-0.068
 boiling-0.064
washer-0.060
 microwave-0.057
 bulldo-0.056
Token brace
Token ID22581
Feature activation-6.48e-02

Pos logit contributions
 boiling+0.005
 microwave+0.005
us+0.005
 potatoes+0.005
 coal+0.005
Neg logit contributions
lets-0.003
Mine-0.003
herry-0.003
ust-0.003
 mattered-0.003
Tokenlets
Token ID5289
Feature activation-4.09e-02

Pos logit contributions
adays+0.055
Mine+0.054
Every+0.053
Me+0.050
Your+0.050
Neg logit contributions
us-0.149
lic-0.125
cher-0.121
 grizz-0.115
 polar-0.114
Token and
Token ID290
Feature activation+4.76e-03

Pos logit contributions
lets+0.065
Mine+0.060
 cherish+0.059
 styles+0.059
 wrist+0.058
Neg logit contributions
us-0.070
 boiling-0.067
 microwave-0.061
washer-0.061
 pot-0.058
Token ear
Token ID1027
Feature activation-3.15e-02

Pos logit contributions
us+0.010
 boiling+0.009
 microwave+0.009
 hamb+0.009
 potatoes+0.009
Neg logit contributions
Mine-0.004
lets-0.004
angles-0.004
inated-0.004
adays-0.004
Tokenrings
Token ID33173
Feature activation-3.44e-02

Pos logit contributions
Mine+0.023
 goal+0.022
 wrist+0.021
adays+0.020
 cherish+0.020
Neg logit contributions
us-0.079
 boiling-0.067
 potatoes-0.066
 coal-0.066
 Pig-0.066
Token.
Token ID13
Feature activation-1.97e-02

Pos logit contributions
lets+0.059
Mine+0.054
herry+0.052
 wrist+0.052
 bracelet+0.051
Neg logit contributions
us-0.056
 boiling-0.053
washer-0.050
 microwave-0.050
 stew-0.047
Token sizes
Token ID10620
Feature activation-3.37e-02

Pos logit contributions
lets+0.015
rings+0.013
Her+0.013
packs+0.012
 bracelet+0.012
Neg logit contributions
 boiling-0.031
 microwave-0.030
us-0.030
 hamb-0.029
 burgers-0.028
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
lets+0.061
 wrist+0.055
adays+0.055
herry+0.055
rings+0.055
Neg logit contributions
 boiling-0.051
us-0.050
washer-0.047
 microwave-0.046
 pot-0.041
Token They
Token ID1119
Feature activation-1.56e-02

Pos logit contributions
lets+0.033
 bracelet+0.032
 brace+0.029
rings+0.028
 cherish+0.027
Neg logit contributions
 microwave-0.026
 Farmer-0.025
 boiling-0.024
 Doctor-0.022
us-0.022
Token can
Token ID460
Feature activation-2.26e-02

Pos logit contributions
lets+0.021
rings+0.018
packs+0.017
 wrist+0.017
 bracelet+0.016
Neg logit contributions
 boiling-0.032
 microwave-0.031
washer-0.030
us-0.030
 bulldo-0.029
Token make
Token ID787
Feature activation-2.41e-02

Pos logit contributions
lets+0.042
packs+0.038
rings+0.036
 bracelet+0.035
 languages+0.034
Neg logit contributions
 boiling-0.037
 microwave-0.035
washer-0.034
us-0.033
 bulldo-0.033
Token towers
Token ID18028
Feature activation-6.40e-02

Pos logit contributions
lets+0.036
rings+0.031
 bracelet+0.029
 wrist+0.029
 styles+0.029
Neg logit contributions
 boiling-0.047
us-0.047
 microwave-0.046
washer-0.042
 bulldo-0.042
Token,
Token ID11
Feature activation-1.69e-02

Pos logit contributions
lets+0.105
Mine+0.101
 wrist+0.099
 cherish+0.097
 wrists+0.097
Neg logit contributions
washer-0.097
 boiling-0.096
us-0.095
 microwave-0.094
 working-0.089
Token houses
Token ID7777
Feature activation-4.64e-02

Pos logit contributions
lets+0.028
rings+0.024
 wrist+0.024
 bracelet+0.024
packs+0.024
Neg logit contributions
 boiling-0.029
 microwave-0.029
us-0.027
 bulldo-0.026
washer-0.026
Token,
Token ID11
Feature activation-2.05e-02

Pos logit contributions
lets+0.077
rings+0.068
 wrist+0.067
 bracelet+0.067
 cherish+0.066
Neg logit contributions
 boiling-0.076
us-0.076
 microwave-0.074
washer-0.070
 bulldo-0.069
Token trains
Token ID13404
Feature activation-3.21e-02

Pos logit contributions
lets+0.035
rings+0.030
 wrist+0.030
 bracelet+0.030
 cherish+0.029
Neg logit contributions
 boiling-0.035
 microwave-0.035
us-0.033
 bulldo-0.031
washer-0.031
Token and
Token ID290
Feature activation-3.26e-02

Pos logit contributions
lets+0.045
rings+0.038
 bracelet+0.036
 wrist+0.036
 styles+0.035
Neg logit contributions
us-0.065
 boiling-0.065
 microwave-0.064
washer-0.060
 bulldo-0.059
Token They
Token ID1119
Feature activation-1.37e-02

Pos logit contributions
lets+0.026
 bracelet+0.024
 brace+0.023
rings+0.022
 cherish+0.021
Neg logit contributions
 microwave-0.026
 Farmer-0.025
 boiling-0.024
us-0.022
 Doctor-0.022
Token can
Token ID460
Feature activation-2.36e-02

Pos logit contributions
lets+0.018
rings+0.015
packs+0.015
 wrist+0.014
adays+0.014
Neg logit contributions
 boiling-0.029
 microwave-0.028
washer-0.027
 bulldo-0.026
us-0.026
Token make
Token ID787
Feature activation-2.53e-02

Pos logit contributions
lets+0.044
packs+0.040
adays+0.037
rings+0.036
 languages+0.036
Neg logit contributions
 boiling-0.036
 microwave-0.036
washer-0.034
 drill-0.033
 bulldo-0.032
Token houses
Token ID7777
Feature activation-4.91e-02

Pos logit contributions
lets+0.041
rings+0.036
 cherish+0.034
 wrist+0.034
 bracelet+0.033
Neg logit contributions
 boiling-0.046
 microwave-0.045
us-0.044
washer-0.040
 bulldo-0.040
Token,
Token ID11
Feature activation-2.16e-02

Pos logit contributions
lets+0.081
 wrist+0.073
 cherish+0.072
Mine+0.072
rings+0.072
Neg logit contributions
 boiling-0.075
 microwave-0.074
washer-0.073
us-0.072
 working-0.071
Token towers
Token ID18028
Feature activation-6.39e-02

Pos logit contributions
lets+0.037
rings+0.031
 cherish+0.031
 wrist+0.031
packs+0.031
Neg logit contributions
 microwave-0.036
 boiling-0.036
 drill-0.032
 bulldo-0.032
 smoking-0.032
Token,
Token ID11
Feature activation-2.05e-02

Pos logit contributions
lets+0.120
Mine+0.113
 bracelet+0.112
 cherish+0.111
 mattered+0.111
Neg logit contributions
 boiling-0.078
 microwave-0.078
us-0.078
washer-0.075
 working-0.073
Token cars
Token ID5006
Feature activation-3.22e-02

Pos logit contributions
lets+0.036
 wrist+0.031
 cherish+0.031
rings+0.031
 bracelet+0.031
Neg logit contributions
 microwave-0.033
 boiling-0.033
us-0.030
 bulldo-0.029
 smoking-0.029
Token and
Token ID290
Feature activation-2.92e-02

Pos logit contributions
lets+0.049
rings+0.043
 bracelet+0.041
 wrist+0.041
 cherish+0.040
Neg logit contributions
us-0.060
 boiling-0.059
 microwave-0.059
washer-0.055
 bulldo-0.055
Token animals
Token ID4695
Feature activation-1.79e-02

Pos logit contributions
lets+0.050
rings+0.043
 wrist+0.042
 bracelet+0.041
 cherish+0.041
Neg logit contributions
 boiling-0.049
 microwave-0.048
us-0.046
 bulldo-0.045
washer-0.044
Token with
Token ID351
Feature activation-1.22e-02

Pos logit contributions
lets+0.027
rings+0.023
 bracelet+0.022
 styles+0.021
 wrist+0.021
Neg logit contributions
us-0.035
 boiling-0.035
 microwave-0.034
washer-0.032
 bulldo-0.031
Token shapes
Token ID15268
Feature activation-3.57e-02

Pos logit contributions
lets+0.014
rings+0.012
 bracelet+0.012
packs+0.012
 wrist+0.012
Neg logit contributions
 boiling-0.033
us-0.032
 microwave-0.031
washer-0.031
 hamb-0.030
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
lets+0.062
 wrist+0.055
rings+0.054
 cherish+0.054
herry+0.053
Neg logit contributions
 boiling-0.059
us-0.057
 microwave-0.054
washer-0.054
 stew-0.049
Token They
Token ID1119
Feature activation-1.83e-02

Pos logit contributions
lets+0.021
 bracelet+0.017
rings+0.017
 brace+0.016
 cherish+0.016
Neg logit contributions
 microwave-0.033
 boiling-0.032
 Farmer-0.031
 Doctor-0.029
us-0.029
Token could
Token ID714
Feature activation-2.33e-02

Pos logit contributions
lets+0.027
rings+0.024
adays+0.024
 wrist+0.024
 bracelet+0.023
Neg logit contributions
 boiling-0.034
 microwave-0.032
washer-0.032
us-0.032
 bulldo-0.031
Token make
Token ID787
Feature activation-2.49e-02

Pos logit contributions
lets+0.046
rings+0.039
packs+0.039
adays+0.038
 mattered+0.038
Neg logit contributions
 microwave-0.035
 boiling-0.035
 drill-0.032
washer-0.032
 loader-0.031
Token towers
Token ID18028
Feature activation-6.37e-02

Pos logit contributions
lets+0.039
rings+0.034
 wrist+0.033
 cherish+0.032
 bracelet+0.032
Neg logit contributions
 boiling-0.046
 microwave-0.046
us-0.045
washer-0.041
 bulldo-0.041
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
lets+0.108
Mine+0.103
 wrist+0.101
 cherish+0.100
 wrists+0.099
Neg logit contributions
 boiling-0.093
 microwave-0.092
washer-0.092
us-0.090
 working-0.087
Token bridges
Token ID19432
Feature activation-6.36e-02

Pos logit contributions
lets+0.027
 wrist+0.024
 cherish+0.023
rings+0.023
 bracelet+0.023
Neg logit contributions
 boiling-0.027
 microwave-0.027
us-0.024
 bulldo-0.024
washer-0.024
Token,
Token ID11
Feature activation-1.94e-02

Pos logit contributions
lets+0.111
rings+0.098
 bracelet+0.097
 cherish+0.096
 wrist+0.095
Neg logit contributions
 boiling-0.097
 microwave-0.096
us-0.095
washer-0.091
 smoking-0.089
Token houses
Token ID7777
Feature activation-4.56e-02

Pos logit contributions
lets+0.033
 wrist+0.029
 cherish+0.028
rings+0.028
 bracelet+0.028
Neg logit contributions
 boiling-0.033
 microwave-0.033
us-0.030
 bulldo-0.029
washer-0.029
Token,
Token ID11
Feature activation-2.23e-02

Pos logit contributions
lets+0.072
rings+0.062
 bracelet+0.060
 wrist+0.060
 cherish+0.059
Neg logit contributions
 boiling-0.083
us-0.082
 microwave-0.080
 bulldo-0.075
washer-0.075
Token kinds
Token ID6982
Feature activation+3.16e-03

Pos logit contributions
lets+0.006
adays+0.005
 wrist+0.005
ue+0.005
rings+0.005
Neg logit contributions
us-0.005
 boiling-0.005
 microwave-0.005
washer-0.004
 coal-0.004
Token of
Token ID286
Feature activation+6.33e-03

Pos logit contributions
us+0.006
 boiling+0.006
 microwave+0.005
washer+0.005
 bulldo+0.005
Neg logit contributions
lets-0.005
rings-0.005
 wrist-0.004
 styles-0.004
 bracelet-0.004
Token ear
Token ID1027
Feature activation-3.91e-02

Pos logit contributions
us+0.013
 boiling+0.013
 microwave+0.012
 bulldo+0.011
 hamb+0.011
Neg logit contributions
lets-0.009
rings-0.008
 wrist-0.007
packs-0.007
adays-0.007
Tokenrings
Token ID33173
Feature activation-3.09e-02

Pos logit contributions
Mine+0.037
 common+0.036
 goal+0.031
ue+0.031
 cherish+0.030
Neg logit contributions
us-0.098
washer-0.082
 coal-0.081
 boiling-0.081
 potatoes-0.079
Token and
Token ID290
Feature activation-1.13e-02

Pos logit contributions
lets+0.049
rings+0.041
 styles+0.041
 wrist+0.041
 bracelet+0.040
Neg logit contributions
us-0.058
 boiling-0.057
 microwave-0.054
washer-0.053
 bulldo-0.050
Token brace
Token ID22581
Feature activation-6.37e-02

Pos logit contributions
lets+0.017
ue+0.017
bows+0.016
packs+0.016
adays+0.016
Neg logit contributions
 boiling-0.019
us-0.019
 potatoes-0.019
 microwave-0.018
 coal-0.018
Tokenlets
Token ID5289
Feature activation-4.35e-02

Pos logit contributions
adays+0.047
Me+0.045
Your+0.044
 forget+0.044
Mine+0.044
Neg logit contributions
us-0.156
lic-0.134
 polar-0.128
cher-0.126
 grizz-0.124
Token,
Token ID11
Feature activation-4.76e-03

Pos logit contributions
lets+0.072
 wrist+0.063
 styles+0.061
adays+0.061
 bracelet+0.061
Neg logit contributions
us-0.078
 boiling-0.074
washer-0.070
 microwave-0.069
 stew-0.064
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
lets+0.008
rings+0.007
packs+0.007
 styles+0.006
 wrist+0.006
Neg logit contributions
 boiling-0.009
 microwave-0.008
us-0.008
 hamb-0.008
 bulldo-0.007
Token he
Token ID339
Feature activation+3.59e-03

Pos logit contributions
lets+0.021
ue+0.018
 wrist+0.018
rings+0.018
packs+0.018
Neg logit contributions
 boiling-0.019
 microwave-0.018
us-0.018
 hamb-0.017
washer-0.017
Token really
Token ID1107
Feature activation-1.77e-02

Pos logit contributions
us+0.006
washer+0.006
 boiling+0.006
 microwave+0.005
 bulldo+0.005
Neg logit contributions
lets-0.006
adays-0.006
packs-0.005
ue-0.005
rings-0.005
Token him
Token ID683
Feature activation+2.71e-03

Pos logit contributions
lets+0.003
rings+0.003
 bracelet+0.003
 styles+0.003
 wrist+0.003
Neg logit contributions
us-0.005
 boiling-0.005
 microwave-0.005
washer-0.005
 bulldo-0.005
Token.
Token ID13
Feature activation+8.44e-03

Pos logit contributions
us+0.005
 boiling+0.005
 microwave+0.005
washer+0.005
 bulldo+0.005
Neg logit contributions
lets-0.004
rings-0.003
 bracelet-0.003
 styles-0.003
 wrist-0.003
Token But
Token ID887
Feature activation-1.49e-02

Pos logit contributions
 boiling+0.018
us+0.018
 microwave+0.017
washer+0.016
 bulldo+0.016
Neg logit contributions
lets-0.011
rings-0.009
 wrist-0.009
 styles-0.009
 tops-0.009
Token to
Token ID284
Feature activation-3.46e-02

Pos logit contributions
lets+0.021
rings+0.018
 bracelet+0.017
 styles+0.017
 wrist+0.016
Neg logit contributions
 boiling-0.031
us-0.031
 microwave-0.030
washer-0.028
 bulldo-0.028
Token no
Token ID645
Feature activation-3.30e-03

Pos logit contributions
lets+0.051
rings+0.043
 bracelet+0.039
packs+0.039
 wrist+0.039
Neg logit contributions
 boiling-0.071
us-0.070
 microwave-0.067
 Farmer-0.065
 bulldo-0.063
Token avail
Token ID29107
Feature activation-6.36e-02

Pos logit contributions
lets+0.003
rings+0.003
 wrist+0.002
Her+0.002
 bracelet+0.002
Neg logit contributions
 boiling-0.008
us-0.008
 microwave-0.008
 Farmer-0.008
 goblin-0.007
Token,
Token ID11
Feature activation-4.09e-03

Pos logit contributions
 bracelet+0.111
 brace+0.109
lets+0.108
adays+0.105
 wrists+0.104
Neg logit contributions
us-0.104
washer-0.086
 boiling-0.085
 polar-0.081
 Farmer-0.081
Token they
Token ID484
Feature activation+9.76e-04

Pos logit contributions
lets+0.006
rings+0.005
 bracelet+0.005
 styles+0.005
 wrist+0.004
Neg logit contributions
us-0.008
 boiling-0.008
 microwave-0.008
washer-0.008
 bulldo-0.008
Token were
Token ID547
Feature activation-1.82e-03

Pos logit contributions
us+0.002
 boiling+0.002
washer+0.002
 microwave+0.002
 bulldo+0.002
Neg logit contributions
lets-0.002
 styles-0.001
 wrist-0.001
rings-0.001
 brace-0.001
Token too
Token ID1165
Feature activation+2.09e-02

Pos logit contributions
lets+0.003
rings+0.003
 bracelet+0.003
 styles+0.003
 wrist+0.003
Neg logit contributions
 boiling-0.003
us-0.003
 microwave-0.003
washer-0.003
 bulldo-0.003
Token scared
Token ID12008
Feature activation+5.96e-03

Pos logit contributions
us+0.032
 boiling+0.032
 microwave+0.030
washer+0.028
 bulldo+0.028
Neg logit contributions
lets-0.041
rings-0.036
 bracelet-0.035
 styles-0.035
 wrist-0.034
Token They
Token ID1119
Feature activation-1.83e-02

Pos logit contributions
lets+0.021
 bracelet+0.017
rings+0.017
 brace+0.016
 cherish+0.016
Neg logit contributions
 microwave-0.033
 boiling-0.032
 Farmer-0.031
 Doctor-0.029
us-0.029
Token could
Token ID714
Feature activation-2.33e-02

Pos logit contributions
lets+0.027
rings+0.024
adays+0.024
 wrist+0.024
 bracelet+0.023
Neg logit contributions
 boiling-0.034
 microwave-0.032
washer-0.032
us-0.032
 bulldo-0.031
Token make
Token ID787
Feature activation-2.49e-02

Pos logit contributions
lets+0.046
rings+0.039
packs+0.039
adays+0.038
 mattered+0.038
Neg logit contributions
 microwave-0.035
 boiling-0.035
 drill-0.032
washer-0.032
 loader-0.031
Token towers
Token ID18028
Feature activation-6.37e-02

Pos logit contributions
lets+0.039
rings+0.034
 wrist+0.033
 cherish+0.032
 bracelet+0.032
Neg logit contributions
 boiling-0.046
 microwave-0.046
us-0.045
washer-0.041
 bulldo-0.041
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
lets+0.108
Mine+0.103
 wrist+0.101
 cherish+0.100
 wrists+0.099
Neg logit contributions
 boiling-0.093
 microwave-0.092
washer-0.092
us-0.090
 working-0.087
Token bridges
Token ID19432
Feature activation-6.36e-02

Pos logit contributions
lets+0.027
 wrist+0.024
 cherish+0.023
rings+0.023
 bracelet+0.023
Neg logit contributions
 boiling-0.027
 microwave-0.027
us-0.024
 bulldo-0.024
washer-0.024
Token,
Token ID11
Feature activation-1.94e-02

Pos logit contributions
lets+0.111
rings+0.098
 bracelet+0.097
 cherish+0.096
 wrist+0.095
Neg logit contributions
 boiling-0.097
 microwave-0.096
us-0.095
washer-0.091
 smoking-0.089
Token houses
Token ID7777
Feature activation-4.56e-02

Pos logit contributions
lets+0.033
 wrist+0.029
 cherish+0.028
rings+0.028
 bracelet+0.028
Neg logit contributions
 boiling-0.033
 microwave-0.033
us-0.030
 bulldo-0.029
washer-0.029
Token,
Token ID11
Feature activation-2.23e-02

Pos logit contributions
lets+0.072
rings+0.062
 bracelet+0.060
 wrist+0.060
 cherish+0.059
Neg logit contributions
 boiling-0.083
us-0.082
 microwave-0.080
 bulldo-0.075
washer-0.075
Token and
Token ID290
Feature activation-3.16e-02

Pos logit contributions
lets+0.037
 wrist+0.032
rings+0.032
 cherish+0.031
 bracelet+0.031
Neg logit contributions
 boiling-0.039
 microwave-0.039
us-0.037
 bulldo-0.035
washer-0.035
Token anything
Token ID1997
Feature activation-1.99e-02

Pos logit contributions
lets+0.052
rings+0.044
 wrist+0.043
 bracelet+0.042
packs+0.041
Neg logit contributions
 boiling-0.057
 microwave-0.055
us-0.054
washer-0.051
 bulldo-0.050
Token blocks
Token ID7021
Feature activation-3.32e-02

Pos logit contributions
lets+0.002
rings+0.001
adays+0.001
 wrist+0.001
 cherish+0.001
Neg logit contributions
 boiling-0.002
 microwave-0.002
us-0.002
 Farmer-0.002
 bulldo-0.002
Token.
Token ID13
Feature activation-2.09e-02

Pos logit contributions
lets+0.057
rings+0.050
 cherish+0.048
 wrist+0.048
adays+0.047
Neg logit contributions
 boiling-0.056
us-0.056
 microwave-0.056
washer-0.054
 bulldo-0.048
Token They
Token ID1119
Feature activation-1.79e-02

Pos logit contributions
lets+0.026
rings+0.022
 bracelet+0.021
 wrist+0.020
 cherish+0.020
Neg logit contributions
 microwave-0.046
 boiling-0.045
us-0.043
 Farmer-0.043
 bulldo-0.042
Token could
Token ID714
Feature activation-2.29e-02

Pos logit contributions
lets+0.027
 wrist+0.025
rings+0.025
adays+0.024
 bracelet+0.024
Neg logit contributions
 boiling-0.032
 microwave-0.031
washer-0.031
us-0.031
 bulldo-0.030
Token build
Token ID1382
Feature activation-3.01e-02

Pos logit contributions
lets+0.048
rings+0.042
packs+0.040
adays+0.040
 languages+0.040
Neg logit contributions
 microwave-0.032
 boiling-0.032
washer-0.028
 drill-0.028
 bulldo-0.028
Token towers
Token ID18028
Feature activation-6.34e-02

Pos logit contributions
lets+0.051
rings+0.045
 wrist+0.043
 cherish+0.042
 bracelet+0.042
Neg logit contributions
 boiling-0.053
 microwave-0.052
us-0.050
 bulldo-0.046
washer-0.046
Token,
Token ID11
Feature activation-1.43e-02

Pos logit contributions
lets+0.109
 wrist+0.100
Mine+0.099
 wrists+0.099
 bracelet+0.098
Neg logit contributions
 microwave-0.096
 boiling-0.095
washer-0.094
us-0.094
 drill-0.088
Token bridges
Token ID19432
Feature activation-6.28e-02

Pos logit contributions
lets+0.025
 wrist+0.022
 cherish+0.022
rings+0.022
 bracelet+0.022
Neg logit contributions
 microwave-0.023
 boiling-0.023
us-0.021
 bulldo-0.021
washer-0.021
Token,
Token ID11
Feature activation-1.89e-02

Pos logit contributions
lets+0.107
rings+0.094
 bracelet+0.093
 wrist+0.090
 cherish+0.089
Neg logit contributions
 boiling-0.101
 microwave-0.101
us-0.100
washer-0.095
 bulldo-0.091
Token and
Token ID290
Feature activation-2.99e-02

Pos logit contributions
lets+0.032
rings+0.027
 wrist+0.027
 bracelet+0.027
 cherish+0.026
Neg logit contributions
 boiling-0.033
 microwave-0.033
us-0.031
 bulldo-0.030
washer-0.030
Token houses
Token ID7777
Feature activation-3.89e-02

Pos logit contributions
lets+0.047
rings+0.040
 wrist+0.039
 bracelet+0.038
packs+0.038
Neg logit contributions
 boiling-0.055
 microwave-0.053
us-0.052
washer-0.050
 bulldo-0.050
Token her
Token ID607
Feature activation+1.01e-02

Pos logit contributions
lets+0.024
packs+0.022
herry+0.022
 wrist+0.021
ue+0.021
Neg logit contributions
us-0.024
 boiling-0.023
 scientist-0.022
 microwave-0.022
 goblin-0.022
Token wand
Token ID11569
Feature activation-1.60e-02

Pos logit contributions
 boiling+0.016
us+0.015
 microwave+0.015
 pot+0.014
washer+0.014
Neg logit contributions
lets-0.017
ue-0.016
 styles-0.016
Mine-0.015
adays-0.015
Token and
Token ID290
Feature activation-1.24e-02

Pos logit contributions
lets+0.027
rings+0.023
ue+0.023
 wrist+0.022
herry+0.022
Neg logit contributions
 boiling-0.027
us-0.026
 microwave-0.025
washer-0.025
 goblin-0.023
Token say
Token ID910
Feature activation-1.42e-02

Pos logit contributions
lets+0.018
ue+0.016
herry+0.016
rings+0.015
angles+0.015
Neg logit contributions
 boiling-0.023
 microwave-0.021
 scientist-0.021
 stew-0.021
 goblin-0.021
Token "
Token ID366
Feature activation-4.52e-03

Pos logit contributions
lets+0.022
rings+0.018
packs+0.018
 wrist+0.018
 styles+0.017
Neg logit contributions
us-0.027
 boiling-0.026
 microwave-0.026
 Farmer-0.024
washer-0.024
Tokenab
Token ID397
Feature activation-6.32e-02

Pos logit contributions
lets+0.008
packs+0.007
adays+0.007
rings+0.007
 languages+0.007
Neg logit contributions
us-0.007
 boiling-0.006
 microwave-0.006
 bulldo-0.006
washer-0.006
Tokenrac
Token ID11510
Feature activation-1.18e-02

Pos logit contributions
 styles+0.078
 languages+0.076
bows+0.075
lets+0.074
Every+0.073
Neg logit contributions
us-0.120
washer-0.112
 pot-0.099
cher-0.099
 bulldo-0.099
Tokenad
Token ID324
Feature activation+1.65e-02

Pos logit contributions
lets+0.024
rings+0.021
 wrists+0.020
packs+0.020
bows+0.020
Neg logit contributions
us-0.018
 boiling-0.017
 microwave-0.015
washer-0.014
 Farmer-0.014
Tokenab
Token ID397
Feature activation-4.80e-02

Pos logit contributions
us+0.031
 boiling+0.028
 microwave+0.027
washer+0.027
 bulldo+0.025
Neg logit contributions
lets-0.026
 styles-0.024
 wrists-0.023
 wrist-0.023
 brace-0.023
Tokenra
Token ID430
Feature activation-7.35e-03

Pos logit contributions
Every+0.056
 languages+0.054
 styles+0.053
 tops+0.050
 bracelet+0.050
Neg logit contributions
us-0.099
washer-0.084
 bulldo-0.084
cher-0.084
 drill-0.082
Token"
Token ID1
Feature activation+1.91e-02

Pos logit contributions
lets+0.010
 tops+0.010
packs+0.010
 styles+0.010
rings+0.009
Neg logit contributions
us-0.015
 boiling-0.013
 microwave-0.013
 bulldo-0.012
washer-0.012
Token they
Token ID484
Feature activation-1.84e-03

Pos logit contributions
lets+0.018
packs+0.017
 tops+0.016
 common+0.016
 wrist+0.016
Neg logit contributions
 Doctor-0.016
us-0.015
 Farmer-0.015
 Bear-0.014
 washing-0.014
Token both
Token ID1111
Feature activation+1.14e-03

Pos logit contributions
lets+0.003
 wrist+0.003
 styles+0.003
rings+0.003
 brace+0.003
Neg logit contributions
washer-0.003
 boiling-0.003
us-0.003
 Bear-0.003
 pot-0.003
Token had
Token ID550
Feature activation-2.03e-03

Pos logit contributions
washer+0.002
 boiling+0.002
us+0.002
 microwave+0.001
 bulldo+0.001
Neg logit contributions
lets-0.002
rings-0.002
 wrist-0.002
Her-0.002
 brace-0.002
Token their
Token ID511
Feature activation-9.10e-03

Pos logit contributions
lets+0.004
rings+0.003
 wrist+0.003
 cherish+0.003
Her+0.003
Neg logit contributions
 boiling-0.003
us-0.003
 microwave-0.003
 hamb-0.003
 Farmer-0.003
Token own
Token ID898
Feature activation-1.96e-03

Pos logit contributions
lets+0.013
rings+0.011
Her+0.010
adays+0.009
angles+0.009
Neg logit contributions
 boiling-0.018
 microwave-0.018
us-0.017
 bulldo-0.017
 Farmer-0.017
Token brace
Token ID22581
Feature activation-6.32e-02

Pos logit contributions
lets+0.003
rings+0.003
angles+0.002
Her+0.002
adays+0.002
Neg logit contributions
 boiling-0.004
 microwave-0.004
us-0.003
washer-0.003
 bulldo-0.003
Tokenlets
Token ID5289
Feature activation-3.61e-02

Pos logit contributions
Your+0.055
Me+0.055
adays+0.055
 particular+0.054
Her+0.053
Neg logit contributions
us-0.131
cher-0.120
box-0.116
lic-0.114
 kit-0.112
Token and
Token ID290
Feature activation+6.39e-03

Pos logit contributions
lets+0.060
rings+0.052
 styles+0.052
adays+0.052
Mine+0.052
Neg logit contributions
us-0.059
 boiling-0.057
washer-0.056
 microwave-0.054
 pot-0.050
Token they
Token ID484
Feature activation-3.58e-03

Pos logit contributions
 potatoes+0.012
us+0.012
 boiling+0.012
 cel+0.012
 bulldo+0.011
Neg logit contributions
inated-0.007
ust-0.006
herry-0.006
ue-0.006
lets-0.006
Token were
Token ID547
Feature activation-6.93e-03

Pos logit contributions
ue+0.005
lets+0.005
 wrist+0.005
adays+0.005
 languages+0.005
Neg logit contributions
washer-0.005
us-0.005
 boiling-0.005
 pot-0.005
 microwave-0.005
Token both
Token ID1111
Feature activation+7.75e-03

Pos logit contributions
lets+0.012
rings+0.011
 wrist+0.011
bows+0.011
Mine+0.011
Neg logit contributions
 boiling-0.011
us-0.011
 microwave-0.010
 cooking-0.009
 working-0.009
Token They
Token ID1119
Feature activation-1.56e-02

Pos logit contributions
lets+0.025
rings+0.021
 bracelet+0.020
 styles+0.020
 wrist+0.020
Neg logit contributions
us-0.033
 boiling-0.032
 microwave-0.032
 Farmer-0.030
 bulldo-0.029
Token could
Token ID714
Feature activation-2.20e-02

Pos logit contributions
lets+0.024
rings+0.022
adays+0.021
 wrist+0.021
packs+0.020
Neg logit contributions
 boiling-0.028
 microwave-0.027
washer-0.027
us-0.026
 bulldo-0.025
Token make
Token ID787
Feature activation-2.30e-02

Pos logit contributions
lets+0.041
packs+0.035
rings+0.034
 mattered+0.033
adays+0.033
Neg logit contributions
 boiling-0.036
 microwave-0.036
 drill-0.033
washer-0.031
 smoking-0.031
Token houses
Token ID7777
Feature activation-4.82e-02

Pos logit contributions
lets+0.037
rings+0.031
 wrist+0.030
 cherish+0.030
 bracelet+0.030
Neg logit contributions
 boiling-0.042
 microwave-0.042
us-0.041
washer-0.038
 bulldo-0.037
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
lets+0.079
 wrist+0.070
rings+0.070
 wrists+0.068
 cherish+0.067
Neg logit contributions
 boiling-0.080
 microwave-0.077
us-0.076
washer-0.075
 smoking-0.071
Token towers
Token ID18028
Feature activation-6.31e-02

Pos logit contributions
lets+0.031
rings+0.026
 wrist+0.025
 bracelet+0.025
 cherish+0.025
Neg logit contributions
 boiling-0.032
 microwave-0.031
us-0.029
 bulldo-0.028
washer-0.028
Token,
Token ID11
Feature activation-1.79e-02

Pos logit contributions
lets+0.113
 bracelet+0.103
 wrist+0.103
 cherish+0.101
Mine+0.101
Neg logit contributions
 boiling-0.091
us-0.090
 microwave-0.089
washer-0.084
 smoking-0.081
Token cars
Token ID5006
Feature activation-3.19e-02

Pos logit contributions
lets+0.030
rings+0.026
 wrist+0.026
 cherish+0.025
 bracelet+0.025
Neg logit contributions
 boiling-0.031
 microwave-0.031
us-0.029
 bulldo-0.028
washer-0.027
Token and
Token ID290
Feature activation-2.89e-02

Pos logit contributions
lets+0.045
rings+0.039
 bracelet+0.037
 wrist+0.036
 styles+0.036
Neg logit contributions
us-0.064
 boiling-0.064
 microwave-0.063
 bulldo-0.059
washer-0.059
Token animals
Token ID4695
Feature activation-1.86e-02

Pos logit contributions
lets+0.049
rings+0.042
 wrist+0.041
 bracelet+0.040
 cherish+0.040
Neg logit contributions
 boiling-0.050
 microwave-0.049
us-0.047
 bulldo-0.045
washer-0.044
Token with
Token ID351
Feature activation-1.55e-02

Pos logit contributions
lets+0.027
rings+0.023
 bracelet+0.021
 styles+0.021
 wrist+0.021
Neg logit contributions
us-0.037
 boiling-0.037
 microwave-0.036
washer-0.034
 bulldo-0.034
Token They
Token ID1119
Feature activation-1.44e-02

Pos logit contributions
lets+0.005
rings+0.004
 bracelet+0.004
 brace+0.003
 wrist+0.003
Neg logit contributions
 boiling-0.008
 microwave-0.008
us-0.008
 Farmer-0.007
 bulldo-0.007
Token can
Token ID460
Feature activation-2.18e-02

Pos logit contributions
lets+0.019
rings+0.017
 wrist+0.016
 bracelet+0.015
packs+0.015
Neg logit contributions
 boiling-0.030
 microwave-0.029
us-0.029
washer-0.028
 bulldo-0.028
Token make
Token ID787
Feature activation-2.47e-02

Pos logit contributions
lets+0.042
packs+0.037
rings+0.036
 bracelet+0.035
adays+0.035
Neg logit contributions
 boiling-0.033
 microwave-0.032
washer-0.032
 drill-0.030
us-0.029
Token houses
Token ID7777
Feature activation-4.62e-02

Pos logit contributions
lets+0.038
rings+0.033
 wrist+0.031
 bracelet+0.031
 cherish+0.031
Neg logit contributions
 boiling-0.047
 microwave-0.046
us-0.046
washer-0.042
 bulldo-0.042
Token,
Token ID11
Feature activation-1.73e-02

Pos logit contributions
lets+0.074
 wrist+0.067
rings+0.067
 cherish+0.065
 wrists+0.065
Neg logit contributions
 boiling-0.075
us-0.074
 microwave-0.073
washer-0.072
 drill-0.068
Token towers
Token ID18028
Feature activation-6.31e-02

Pos logit contributions
lets+0.029
rings+0.025
 wrist+0.025
 cherish+0.025
 bracelet+0.025
Neg logit contributions
 boiling-0.029
 microwave-0.029
us-0.027
 bulldo-0.026
washer-0.026
Token,
Token ID11
Feature activation-1.57e-02

Pos logit contributions
lets+0.113
 bracelet+0.106
 wrist+0.105
 cherish+0.104
Mine+0.104
Neg logit contributions
us-0.086
 boiling-0.086
 microwave-0.084
washer-0.082
 drill-0.077
Token cars
Token ID5006
Feature activation-3.09e-02

Pos logit contributions
lets+0.027
 wrist+0.024
rings+0.023
 cherish+0.023
 bracelet+0.023
Neg logit contributions
 boiling-0.026
 microwave-0.026
us-0.024
 bulldo-0.023
washer-0.023
Token,
Token ID11
Feature activation-1.88e-02

Pos logit contributions
lets+0.045
rings+0.039
 bracelet+0.037
 wrist+0.037
 styles+0.036
Neg logit contributions
us-0.060
 boiling-0.060
 microwave-0.059
washer-0.055
 bulldo-0.055
Token and
Token ID290
Feature activation-2.65e-02

Pos logit contributions
lets+0.031
rings+0.026
 wrist+0.026
 cherish+0.026
 bracelet+0.026
Neg logit contributions
 boiling-0.033
 microwave-0.033
us-0.031
 bulldo-0.030
washer-0.029
Token animals
Token ID4695
Feature activation-1.72e-02

Pos logit contributions
lets+0.043
rings+0.037
 wrist+0.036
 bracelet+0.035
packs+0.035
Neg logit contributions
 boiling-0.047
 microwave-0.046
us-0.045
washer-0.042
 bulldo-0.042
Token your
Token ID534
Feature activation-1.64e-02

Pos logit contributions
lets+0.051
rings+0.042
bows+0.040
packs+0.039
 cherish+0.038
Neg logit contributions
 boiling-0.080
us-0.079
 microwave-0.076
 Farmer-0.073
 Doctor-0.070
Token neck
Token ID7393
Feature activation-4.60e-02

Pos logit contributions
lets+0.026
rings+0.022
adays+0.021
angles+0.020
packs+0.020
Neg logit contributions
 boiling-0.030
 microwave-0.030
us-0.028
 bulldo-0.028
washer-0.027
Tokenl
Token ID75
Feature activation-1.79e-02

Pos logit contributions
lets+0.081
rings+0.071
Mine+0.069
 cherish+0.068
 wrists+0.067
Neg logit contributions
 boiling-0.072
us-0.071
washer-0.069
 microwave-0.069
 smoking-0.064
Tokenaces
Token ID2114
Feature activation-5.01e-02

Pos logit contributions
lets+0.021
rings+0.018
 styles+0.017
Her+0.016
 cherish+0.016
Neg logit contributions
us-0.038
 boiling-0.037
 microwave-0.037
washer-0.036
 bulldo-0.035
Token and
Token ID290
Feature activation-1.56e-02

Pos logit contributions
lets+0.085
rings+0.074
 styles+0.072
 wrists+0.070
 tops+0.069
Neg logit contributions
us-0.084
 boiling-0.084
washer-0.081
 microwave-0.081
 bulldo-0.075
Token brace
Token ID22581
Feature activation-6.30e-02

Pos logit contributions
lets+0.020
herry+0.017
ue+0.017
adays+0.017
Her+0.017
Neg logit contributions
 boiling-0.030
 microwave-0.028
 smoking-0.028
us-0.028
 potatoes-0.028
Tokenlets
Token ID5289
Feature activation-4.43e-02

Pos logit contributions
Me+0.052
adays+0.051
Every+0.047
 particular+0.047
 Lib+0.047
Neg logit contributions
us-0.148
cher-0.130
aker-0.128
 kit-0.127
 wrench-0.126
Token?"
Token ID1701
Feature activation-4.38e-03

Pos logit contributions
lets+0.077
rings+0.067
 styles+0.064
 wrists+0.063
 wrist+0.063
Neg logit contributions
 boiling-0.074
us-0.073
 microwave-0.071
washer-0.069
 smoking-0.064
Token Mom
Token ID11254
Feature activation+2.38e-02

Pos logit contributions
lets+0.006
rings+0.005
 bracelet+0.004
 wrist+0.004
 styles+0.004
Neg logit contributions
us-0.009
 boiling-0.009
 microwave-0.009
washer-0.009
 bulldo-0.009
Token asks
Token ID7893
Feature activation+2.34e-02

Pos logit contributions
 boiling+0.054
us+0.054
 microwave+0.051
 bulldo+0.051
 hamb+0.051
Neg logit contributions
lets-0.026
rings-0.022
Her-0.020
 styles-0.020
 Her-0.020
Token.
Token ID13
Feature activation+8.46e-03

Pos logit contributions
 boiling+0.056
 microwave+0.056
us+0.055
 bulldo+0.054
 hamb+0.053
Neg logit contributions
lets-0.024
rings-0.020
packs-0.018
Her-0.018
 wrist-0.018
Token time
Token ID640
Feature activation+2.05e-02

Pos logit contributions
lets+0.002
rings+0.002
 wrist+0.002
 styles+0.002
 bracelet+0.002
Neg logit contributions
 boiling-0.003
us-0.003
 microwave-0.003
 bulldo-0.003
washer-0.003
Token in
Token ID287
Feature activation-1.66e-02

Pos logit contributions
us+0.049
 microwave+0.049
 boiling+0.048
 goblin+0.047
 grizz+0.046
Neg logit contributions
lets-0.021
packs-0.016
rings-0.016
 every-0.016
 tops-0.015
Token the
Token ID262
Feature activation-4.51e-03

Pos logit contributions
lets+0.023
rings+0.019
 bracelet+0.017
 styles+0.017
 wrist+0.017
Neg logit contributions
 boiling-0.036
us-0.035
 microwave-0.034
 Farmer-0.032
 bulldo-0.032
Token garden
Token ID11376
Feature activation-8.42e-03

Pos logit contributions
lets+0.007
 bracelet+0.006
rings+0.006
 own+0.006
 tops+0.006
Neg logit contributions
us-0.009
 boiling-0.008
 microwave-0.008
washer-0.008
 bulldo-0.008
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
lets+0.011
rings+0.009
 styles+0.008
packs+0.008
 wrist+0.008
Neg logit contributions
us-0.018
 boiling-0.018
 microwave-0.018
 bulldo-0.017
washer-0.016
Token<|endoftext|>
Token ID50256
Feature activation+6.53e-02

Pos logit contributions
lets+0.019
rings+0.017
 bracelet+0.016
 styles+0.016
 brace+0.015
Neg logit contributions
 microwave-0.024
 boiling-0.024
us-0.024
 Farmer-0.023
 bulldo-0.022
TokenAnna
Token ID31160
Feature activation-2.24e-02

Pos logit contributions
 boiling+0.113
 microwave+0.108
us+0.104
 hamb+0.104
washer+0.102
Neg logit contributions
lets-0.104
packs-0.096
 tops-0.092
rings-0.091
 every-0.088
Token and
Token ID290
Feature activation-8.05e-03

Pos logit contributions
lets+0.030
rings+0.026
 bracelet+0.023
 wrist+0.023
 styles+0.023
Neg logit contributions
us-0.048
 boiling-0.047
 microwave-0.046
 bulldo-0.043
washer-0.043
Token Lily
Token ID20037
Feature activation-2.42e-02

Pos logit contributions
lets+0.014
 styles+0.012
rings+0.012
bows+0.011
 tops+0.011
Neg logit contributions
 boiling-0.014
 Farmer-0.013
 microwave-0.013
us-0.013
 Doctor-0.013
Token were
Token ID547
Feature activation+7.53e-03

Pos logit contributions
lets+0.041
rings+0.038
 wrist+0.035
 bracelet+0.034
 styles+0.034
Neg logit contributions
 boiling-0.041
 microwave-0.039
us-0.038
 bulldo-0.037
washer-0.036
Token sisters
Token ID15153
Feature activation-3.76e-03

Pos logit contributions
us+0.016
 boiling+0.016
 microwave+0.015
 bulldo+0.015
 goblin+0.015
Neg logit contributions
lets-0.009
packs-0.008
 own-0.008
rings-0.008
 bracelet-0.007
Token alone
Token ID3436
Feature activation+1.31e-02

Pos logit contributions
lets+0.026
 styles+0.023
rings+0.023
angles+0.023
ue+0.022
Neg logit contributions
 boiling-0.023
us-0.023
washer-0.021
 microwave-0.021
 smoking-0.019
Token for
Token ID329
Feature activation+1.12e-02

Pos logit contributions
us+0.025
 boiling+0.025
 microwave+0.024
washer+0.023
 bulldo+0.023
Neg logit contributions
lets-0.020
rings-0.017
 bracelet-0.016
 styles-0.016
 wrist-0.016
Token too
Token ID1165
Feature activation+1.34e-02

Pos logit contributions
 microwave+0.024
us+0.024
 boiling+0.023
washer+0.022
 bulldo+0.022
Neg logit contributions
lets-0.015
rings-0.012
 bracelet-0.012
 styles-0.012
 own-0.012
Token long
Token ID890
Feature activation+1.38e-02

Pos logit contributions
us+0.025
 boiling+0.025
 microwave+0.023
 Farmer+0.022
 bulldo+0.022
Neg logit contributions
lets-0.022
rings-0.019
 styles-0.018
 wrist-0.018
 bracelet-0.018
Token.
Token ID13
Feature activation+8.69e-03

Pos logit contributions
 boiling+0.029
us+0.029
 microwave+0.028
washer+0.026
 bulldo+0.026
Neg logit contributions
lets-0.019
rings-0.016
 bracelet-0.015
 styles-0.015
 wrist-0.014
Token<|endoftext|>
Token ID50256
Feature activation+4.20e-02

Pos logit contributions
us+0.016
 microwave+0.015
 boiling+0.015
 Farmer+0.015
 bulldo+0.014
Neg logit contributions
lets-0.014
rings-0.013
 bracelet-0.012
 styles-0.012
 brace-0.012
TokenOnce
Token ID7454
Feature activation-4.42e-02

Pos logit contributions
 boiling+0.089
us+0.086
 microwave+0.086
washer+0.081
 bulldo+0.080
Neg logit contributions
lets-0.054
rings-0.045
packs-0.043
 bracelet-0.042
 wrist-0.042
Token upon
Token ID2402
Feature activation-1.12e-02

Pos logit contributions
lets+0.084
 styles+0.074
 wrists+0.074
 wrist+0.073
bows+0.072
Neg logit contributions
us-0.077
 boiling-0.069
washer-0.069
 microwave-0.065
 Farmer-0.063
Token a
Token ID257
Feature activation+9.90e-04

Pos logit contributions
lets+0.021
rings+0.018
 styles+0.017
 wrists+0.017
angles+0.017
Neg logit contributions
us-0.018
 boiling-0.018
 microwave-0.018
 Farmer-0.016
washer-0.016
Token time
Token ID640
Feature activation+8.11e-03

Pos logit contributions
 boiling+0.002
us+0.002
 microwave+0.002
 Farmer+0.002
 bulldo+0.002
Neg logit contributions
lets-0.001
rings-0.001
 styles-0.001
 wrist-0.001
 bracelet-0.001
Token,
Token ID11
Feature activation-1.61e-02

Pos logit contributions
us+0.013
 boiling+0.013
 Farmer+0.012
 bulldo+0.012
washer+0.012
Neg logit contributions
lets-0.015
 styles-0.013
rings-0.013
 wrists-0.013
 wrist-0.013
Token lived
Token ID5615
Feature activation+4.79e-03

Pos logit contributions
us+0.007
 boiling+0.007
 microwave+0.007
washer+0.007
 bulldo+0.006
Neg logit contributions
lets-0.005
rings-0.005
 styles-0.004
 wrist-0.004
 bracelet-0.004
Token happily
Token ID18177
Feature activation+7.65e-03

Pos logit contributions
us+0.010
 boiling+0.010
 microwave+0.010
washer+0.009
 Farmer+0.009
Neg logit contributions
lets-0.006
rings-0.006
 styles-0.005
 bracelet-0.005
 wrist-0.005
Token ever
Token ID1683
Feature activation+2.65e-03

Pos logit contributions
 boiling+0.020
 microwave+0.020
us+0.020
 Farmer+0.018
 bulldo+0.018
Neg logit contributions
lets-0.007
rings-0.005
 bracelet-0.004
 wrist-0.004
 styles-0.004
Token after
Token ID706
Feature activation-4.08e-03

Pos logit contributions
 microwave+0.006
 boiling+0.006
us+0.006
washer+0.006
 bulldo+0.006
Neg logit contributions
lets-0.003
rings-0.002
 own-0.002
 every-0.002
 styles-0.002
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
lets+0.007
rings+0.006
 bracelet+0.005
 styles+0.005
 wrist+0.005
Neg logit contributions
us-0.008
 boiling-0.008
 microwave-0.007
washer-0.007
 bulldo-0.007
Token<|endoftext|>
Token ID50256
Feature activation+4.35e-02

Pos logit contributions
us+0.022
 boiling+0.021
 microwave+0.020
 Farmer+0.020
 Doctor+0.019
Neg logit contributions
lets-0.025
rings-0.022
 styles-0.022
 bracelet-0.021
 brace-0.021
TokenOnce
Token ID7454
Feature activation-3.95e-02

Pos logit contributions
 boiling+0.095
us+0.092
 microwave+0.092
washer+0.086
 bulldo+0.086
Neg logit contributions
lets-0.054
rings-0.045
packs-0.042
 wrist-0.042
 bracelet-0.041
Token,
Token ID11
Feature activation-2.51e-02

Pos logit contributions
lets+0.076
 wrists+0.065
 wrist+0.065
 styles+0.065
 brace+0.063
Neg logit contributions
us-0.069
 boiling-0.063
washer-0.063
 microwave-0.059
 Farmer-0.056
Token there
Token ID612
Feature activation-3.18e-02

Pos logit contributions
lets+0.056
 styles+0.052
rings+0.051
bows+0.050
 brace+0.050
Neg logit contributions
 boiling-0.034
us-0.030
 microwave-0.030
washer-0.029
 pot-0.027
Token was
Token ID373
Feature activation+2.64e-03

Pos logit contributions
lets+0.067
rings+0.060
 brace+0.054
 wrists+0.054
 styles+0.054
Neg logit contributions
us-0.047
 boiling-0.046
 microwave-0.043
 Farmer-0.041
 bulldo-0.041
Token a
Token ID257
Feature activation+8.01e-03

Pos logit contributions
 boiling+0.005
 microwave+0.004
us+0.004
 Farmer+0.004
 hamb+0.004
Neg logit contributions
lets-0.005
rings-0.004
 styles-0.004
 wrist-0.004
 wrists-0.004
Token on
Token ID319
Feature activation-1.29e-02

Pos logit contributions
us+0.045
 boiling+0.045
 microwave+0.043
washer+0.040
 bulldo+0.040
Neg logit contributions
lets-0.034
rings-0.029
 bracelet-0.027
 styles-0.027
 wrist-0.027
Token another
Token ID1194
Feature activation-4.05e-03

Pos logit contributions
lets+0.022
rings+0.019
 styles+0.018
 wrist+0.017
 bracelet+0.017
Neg logit contributions
 boiling-0.024
us-0.023
 microwave-0.022
 Farmer-0.022
 bulldo-0.020
Token adventure
Token ID8855
Feature activation+1.13e-02

Pos logit contributions
lets+0.007
rings+0.006
 styles+0.005
 wrist+0.005
packs+0.005
Neg logit contributions
 boiling-0.007
us-0.007
 microwave-0.007
 bulldo-0.006
washer-0.006
Token soon
Token ID2582
Feature activation+1.36e-02

Pos logit contributions
us+0.024
 boiling+0.024
 microwave+0.023
washer+0.022
 bulldo+0.022
Neg logit contributions
lets-0.015
rings-0.012
 bracelet-0.012
 styles-0.011
 wrist-0.011
Token!"
Token ID2474
Feature activation+2.29e-02

Pos logit contributions
us+0.028
 boiling+0.027
 microwave+0.026
 Farmer+0.025
 bulldo+0.025
Neg logit contributions
lets-0.019
rings-0.016
 bracelet-0.016
 wrist-0.015
 styles-0.015
Token<|endoftext|>
Token ID50256
Feature activation+5.11e-02

Pos logit contributions
 boiling+0.046
us+0.046
 microwave+0.045
washer+0.042
 bulldo+0.042
Neg logit contributions
lets-0.033
rings-0.028
 bracelet-0.026
 wrist-0.026
 styles-0.026
TokenM
Token ID44
Feature activation-1.78e-03

Pos logit contributions
 boiling+0.086
 microwave+0.082
us+0.081
washer+0.075
 bulldo+0.075
Neg logit contributions
lets-0.089
rings-0.078
packs-0.077
 tops-0.075
 bracelet-0.074
Tokenolly
Token ID5098
Feature activation-6.17e-03

Pos logit contributions
lets+0.003
rings+0.003
 styles+0.002
 bracelet+0.002
 wrist+0.002
Neg logit contributions
us-0.003
 boiling-0.003
 microwave-0.003
washer-0.003
 bulldo-0.003
Token was
Token ID373
Feature activation+4.42e-03

Pos logit contributions
lets+0.009
rings+0.008
 bracelet+0.007
 styles+0.007
 wrist+0.007
Neg logit contributions
us-0.012
 boiling-0.012
 microwave-0.012
 bulldo-0.011
washer-0.011
Token a
Token ID257
Feature activation+2.00e-03

Pos logit contributions
us+0.008
 boiling+0.007
 microwave+0.007
washer+0.007
 bulldo+0.007
Neg logit contributions
lets-0.007
rings-0.006
 wrist-0.006
 bracelet-0.006
packs-0.006
Token 3
Token ID513
Feature activation+1.83e-02

Pos logit contributions
us+0.004
 boiling+0.004
 microwave+0.003
washer+0.003
 bulldo+0.003
Neg logit contributions
lets-0.003
rings-0.003
 bracelet-0.003
 wrist-0.003
 styles-0.003
Token speak
Token ID2740
Feature activation-2.59e-02

Pos logit contributions
lets+0.018
rings+0.016
 bracelet+0.015
packs+0.015
 styles+0.014
Neg logit contributions
 boiling-0.024
us-0.023
 stew-0.023
 microwave-0.023
 hamb-0.022
Token and
Token ID290
Feature activation-2.72e-03

Pos logit contributions
lets+0.047
rings+0.044
 wrist+0.043
ue+0.042
Mine+0.042
Neg logit contributions
us-0.036
 boiling-0.036
washer-0.034
 stew-0.033
 scientist-0.032
Token play
Token ID711
Feature activation-1.02e-02

Pos logit contributions
lets+0.004
rings+0.003
 bracelet+0.003
packs+0.003
 styles+0.003
Neg logit contributions
 boiling-0.005
us-0.005
 stew-0.005
 Farmer-0.005
 hamb-0.005
Token together
Token ID1978
Feature activation-1.38e-02

Pos logit contributions
lets+0.015
rings+0.013
 bracelet+0.012
 styles+0.012
 wrist+0.012
Neg logit contributions
 boiling-0.020
us-0.020
 microwave-0.019
washer-0.018
 bulldo-0.018
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
lets+0.020
rings+0.018
 bracelet+0.017
adays+0.016
 wrist+0.016
Neg logit contributions
 boiling-0.026
us-0.026
 microwave-0.024
 stew-0.023
washer-0.023
Token<|endoftext|>
Token ID50256
Feature activation+4.33e-02

Pos logit contributions
us+0.024
 microwave+0.023
 boiling+0.023
 Farmer+0.022
 Doctor+0.021
Neg logit contributions
lets-0.021
rings-0.018
 bracelet-0.018
 wrist-0.018
 styles-0.017
TokenOnce
Token ID7454
Feature activation-3.81e-02

Pos logit contributions
 boiling+0.095
us+0.093
 microwave+0.092
 bulldo+0.087
washer+0.086
Neg logit contributions
lets-0.052
rings-0.043
packs-0.041
 wrist-0.040
 bracelet-0.040
Token upon
Token ID2402
Feature activation-8.37e-03

Pos logit contributions
lets+0.071
 wrists+0.062
 wrist+0.061
 styles+0.061
 brace+0.058
Neg logit contributions
us-0.067
 boiling-0.063
washer-0.062
 microwave-0.060
 Farmer-0.056
Token a
Token ID257
Feature activation+4.89e-03

Pos logit contributions
lets+0.016
rings+0.014
 styles+0.013
 wrists+0.013
angles+0.013
Neg logit contributions
 boiling-0.014
us-0.013
 microwave-0.013
 Farmer-0.012
 hamb-0.012
Token time
Token ID640
Feature activation+1.21e-02

Pos logit contributions
 boiling+0.009
us+0.009
 microwave+0.009
 Farmer+0.008
 hamb+0.008
Neg logit contributions
lets-0.008
rings-0.008
 styles-0.007
 wrist-0.007
 bracelet-0.006
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
us+0.018
 boiling+0.016
 Farmer+0.016
washer+0.015
 bulldo+0.015
Neg logit contributions
lets-0.025
 styles-0.022
rings-0.022
 wrists-0.022
 wrist-0.022
Token stood
Token ID6204
Feature activation+8.48e-03

Pos logit contributions
 boiling+0.008
us+0.008
 microwave+0.007
washer+0.007
 bulldo+0.007
Neg logit contributions
lets-0.006
rings-0.005
 bracelet-0.005
 wrist-0.005
 styles-0.005
Token up
Token ID510
Feature activation+2.80e-03

Pos logit contributions
 boiling+0.017
us+0.017
 microwave+0.017
 Farmer+0.016
washer+0.015
Neg logit contributions
lets-0.013
rings-0.010
 styles-0.010
bows-0.009
packs-0.009
Token for
Token ID329
Feature activation+1.17e-02

Pos logit contributions
us+0.006
 boiling+0.006
 microwave+0.005
washer+0.005
 bulldo+0.005
Neg logit contributions
lets-0.004
rings-0.003
 bracelet-0.003
 wrist-0.003
 styles-0.003
Token herself
Token ID5223
Feature activation-7.65e-03

Pos logit contributions
us+0.028
 microwave+0.028
washer+0.027
 boiling+0.027
 bulldo+0.026
Neg logit contributions
lets-0.011
 herself-0.011
 brace-0.010
rings-0.010
 bracelet-0.010
Token.
Token ID13
Feature activation+4.16e-03

Pos logit contributions
lets+0.012
rings+0.011
 styles+0.010
 bracelet+0.010
angles+0.010
Neg logit contributions
us-0.014
 boiling-0.014
 microwave-0.013
washer-0.013
 bulldo-0.012
Token
Token ID198
Feature activation+8.98e-05

Pos logit contributions
us+0.009
 boiling+0.009
 microwave+0.009
 Farmer+0.008
 Doctor+0.008
Neg logit contributions
lets-0.005
rings-0.004
 styles-0.004
 bracelet-0.004
 wrist-0.004
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
us+0.000
 boiling+0.000
 microwave+0.000
washer+0.000
 bulldo+0.000
Neg logit contributions
lets-0.000
rings-0.000
 bracelet-0.000
 styles-0.000
 wrist-0.000
TokenThe
Token ID464
Feature activation-1.25e-02

Pos logit contributions
lets+0.019
rings+0.017
packs+0.016
bows+0.015
 wrists+0.015
Neg logit contributions
us-0.024
 boiling-0.023
washer-0.022
 microwave-0.022
 scientist-0.021
Token boy
Token ID2933
Feature activation+6.92e-03

Pos logit contributions
lets+0.021
rings+0.018
 bracelet+0.018
 styles+0.017
 wrist+0.017
Neg logit contributions
us-0.023
 boiling-0.022
 microwave-0.022
 bulldo-0.020
 Farmer-0.020
Token started
Token ID2067
Feature activation+6.26e-03

Pos logit contributions
us+0.014
 boiling+0.014
 microwave+0.014
 bulldo+0.013
washer+0.013
Neg logit contributions
lets-0.010
rings-0.008
 bracelet-0.008
 wrist-0.008
 styles-0.008
Token to
Token ID284
Feature activation-3.91e-03

Pos logit contributions
us+0.012
 boiling+0.012
 microwave+0.012
washer+0.011
 bulldo+0.011
Neg logit contributions
lets-0.010
rings-0.008
 styles-0.008
 wrist-0.008
 brace-0.008
Token to
Token ID284
Feature activation-3.20e-03

Pos logit contributions
us+0.017
 boiling+0.017
 microwave+0.017
washer+0.016
 bulldo+0.015
Neg logit contributions
lets-0.013
rings-0.012
 bracelet-0.011
 wrist-0.011
 styles-0.011
Token play
Token ID711
Feature activation-4.70e-03

Pos logit contributions
lets+0.005
rings+0.004
 bracelet+0.004
 styles+0.004
 wrist+0.004
Neg logit contributions
 boiling-0.007
us-0.007
 microwave-0.006
 bulldo-0.006
washer-0.006
Token a
Token ID257
Feature activation-8.67e-03

Pos logit contributions
lets+0.006
rings+0.005
 wrist+0.005
 bracelet+0.005
 styles+0.004
Neg logit contributions
us-0.011
 boiling-0.010
 microwave-0.010
 bulldo-0.010
washer-0.010
Token joke
Token ID9707
Feature activation-2.39e-03

Pos logit contributions
lets+0.010
 bracelet+0.008
rings+0.008
 wrist+0.008
 styles+0.008
Neg logit contributions
 boiling-0.020
us-0.020
 microwave-0.019
 Farmer-0.018
washer-0.018
Token on
Token ID319
Feature activation-1.56e-02

Pos logit contributions
lets+0.003
rings+0.003
 bracelet+0.002
 styles+0.002
 wrist+0.002
Neg logit contributions
 boiling-0.005
us-0.005
 microwave-0.005
 bulldo-0.005
 Farmer-0.005
Token Mum
Token ID38367
Feature activation+1.98e-02

Pos logit contributions
lets+0.024
rings+0.020
 bracelet+0.018
 styles+0.018
packs+0.018
Neg logit contributions
us-0.031
 boiling-0.031
 microwave-0.030
 Farmer-0.028
washer-0.027
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
us+0.041
 boiling+0.040
 microwave+0.040
washer+0.037
 bulldo+0.037
Neg logit contributions
lets-0.027
rings-0.023
 bracelet-0.022
 wrist-0.021
 styles-0.021
Tokenâ
Token ID22940
Feature activation-1.28e-03

Pos logit contributions
us+0.034
 boiling+0.034
 microwave+0.033
washer+0.030
 bulldo+0.030
Neg logit contributions
lets-0.024
rings-0.021
 bracelet-0.019
 styles-0.019
 wrist-0.019
Token
Token ID26391
Feature activation-9.67e-03

Pos logit contributions
lets+0.002
rings+0.002
 styles+0.002
 wrist+0.002
 bracelet+0.001
Neg logit contributions
 boiling-0.003
us-0.003
 microwave-0.002
 bulldo-0.002
washer-0.002
Token Jake
Token ID14757
Feature activation-1.12e-02

Pos logit contributions
lets+0.015
rings+0.013
 bracelet+0.012
 styles+0.012
 brace+0.012
Neg logit contributions
us-0.019
 microwave-0.018
 boiling-0.018
 bulldo-0.017
washer-0.017
Token liked
Token ID8288
Feature activation+2.10e-02

Pos logit contributions
lets+0.019
rings+0.017
 bracelet+0.016
packs+0.016
 wrist+0.015
Neg logit contributions
us-0.020
 boiling-0.019
 microwave-0.019
washer-0.018
 bulldo-0.017
Token for
Token ID329
Feature activation+5.56e-03

Pos logit contributions
 boiling+0.023
 microwave+0.023
us+0.023
 bulldo+0.021
 grizz+0.021
Neg logit contributions
lets-0.015
rings-0.012
 every-0.012
 styles-0.012
 wrist-0.011
Token their
Token ID511
Feature activation+4.83e-03

Pos logit contributions
 microwave+0.013
us+0.013
washer+0.012
auc+0.012
 boiling+0.012
Neg logit contributions
 every-0.006
 herself-0.006
 cherish-0.006
lets-0.005
 her-0.005
Token mistake
Token ID7457
Feature activation+1.35e-02

Pos logit contributions
us+0.009
 boiling+0.009
 microwave+0.009
washer+0.008
 bulldo+0.008
Neg logit contributions
lets-0.008
 own-0.007
 styles-0.007
 bracelet-0.007
rings-0.007
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
 boiling+0.028
us+0.028
 microwave+0.028
 bulldo+0.026
washer+0.025
Neg logit contributions
lets-0.019
rings-0.016
 styles-0.015
 bracelet-0.014
 wrist-0.014
Token From
Token ID3574
Feature activation-6.83e-03

Pos logit contributions
us+0.035
 boiling+0.035
 microwave+0.034
 Farmer+0.033
 bulldo+0.032
Neg logit contributions
lets-0.024
rings-0.020
 bracelet-0.020
 styles-0.019
 brace-0.019
Token then
Token ID788
Feature activation+1.71e-02

Pos logit contributions
lets+0.011
rings+0.009
 styles+0.008
 bracelet+0.008
 wrist+0.008
Neg logit contributions
us-0.013
 boiling-0.013
 microwave-0.013
 Farmer-0.012
washer-0.012
Token on
Token ID319
Feature activation-3.53e-02

Pos logit contributions
 boiling+0.033
us+0.032
 microwave+0.032
washer+0.029
 bulldo+0.029
Neg logit contributions
lets-0.027
rings-0.023
 bracelet-0.021
 styles-0.021
 wrist-0.021
Token,
Token ID11
Feature activation+1.05e-02

Pos logit contributions
lets+0.055
rings+0.047
 bracelet+0.044
 wrist+0.044
 styles+0.043
Neg logit contributions
 boiling-0.069
us-0.069
 microwave-0.066
 Farmer-0.064
 bulldo-0.062
Token they
Token ID484
Feature activation+7.64e-03

Pos logit contributions
us+0.021
 boiling+0.021
 microwave+0.020
 Farmer+0.020
 Doctor+0.019
Neg logit contributions
lets-0.015
rings-0.013
 bracelet-0.013
 wrist-0.013
 styles-0.012
Token made
Token ID925
Feature activation-8.25e-03

Pos logit contributions
us+0.016
 boiling+0.016
 microwave+0.015
washer+0.014
 bulldo+0.014
Neg logit contributions
lets-0.010
rings-0.009
 wrist-0.008
 styles-0.008
 bracelet-0.008
Token sure
Token ID1654
Feature activation-2.27e-02

Pos logit contributions
lets+0.014
 styles+0.012
 own+0.012
 brace+0.012
 every+0.012
Neg logit contributions
 boiling-0.014
us-0.014
 microwave-0.013
 bulldo-0.013
 grizz-0.013
Token around
Token ID1088
Feature activation-5.08e-03

Pos logit contributions
us+0.009
 boiling+0.009
 microwave+0.008
 bulldo+0.008
 Farmer+0.008
Neg logit contributions
lets-0.009
rings-0.008
 wrist-0.008
 bracelet-0.008
 styles-0.008
Token.
Token ID13
Feature activation+6.75e-03

Pos logit contributions
lets+0.006
rings+0.005
 tops+0.005
 bracelet+0.005
 styles+0.005
Neg logit contributions
us-0.011
 boiling-0.011
 microwave-0.011
washer-0.010
 bulldo-0.010
Token Paul
Token ID3362
Feature activation-1.79e-02

Pos logit contributions
 boiling+0.014
us+0.013
 microwave+0.013
washer+0.012
 bulldo+0.012
Neg logit contributions
lets-0.010
rings-0.008
 wrist-0.008
 styles-0.008
 tops-0.008
Token was
Token ID373
Feature activation-4.71e-03

Pos logit contributions
lets+0.027
rings+0.024
 bracelet+0.023
 wrist+0.022
 styles+0.022
Neg logit contributions
us-0.035
 boiling-0.034
 microwave-0.033
washer-0.031
 bulldo-0.031
Token so
Token ID523
Feature activation-5.61e-04

Pos logit contributions
lets+0.008
rings+0.007
 bracelet+0.007
 styles+0.007
 wrist+0.007
Neg logit contributions
us-0.008
 boiling-0.008
 microwave-0.008
washer-0.007
 bulldo-0.007
Token happy
Token ID3772
Feature activation+2.93e-04

Pos logit contributions
lets+0.001
rings+0.001
 bracelet+0.001
 styles+0.001
 wrist+0.001
Neg logit contributions
 boiling-0.001
us-0.001
 microwave-0.001
washer-0.001
 Farmer-0.001
Token that
Token ID326
Feature activation+5.11e-03

Pos logit contributions
 boiling+0.001
us+0.001
 microwave+0.001
washer+0.001
 bulldo+0.001
Neg logit contributions
lets-0.000
rings-0.000
 bracelet-0.000
 styles-0.000
 wrist-0.000
Token he
Token ID339
Feature activation+2.03e-02

Pos logit contributions
 boiling+0.008
us+0.007
 microwave+0.007
 Farmer+0.007
washer+0.007
Neg logit contributions
lets-0.010
rings-0.009
 wrist-0.009
 bracelet-0.009
 styles-0.009
Token asked
Token ID1965
Feature activation+2.89e-03

Pos logit contributions
us+0.038
 boiling+0.037
 microwave+0.036
washer+0.034
 bulldo+0.034
Neg logit contributions
lets-0.032
rings-0.029
packs-0.027
 styles-0.026
 wrist-0.026
Token his
Token ID465
Feature activation+5.15e-03

Pos logit contributions
 boiling+0.007
 microwave+0.006
us+0.006
 bulldo+0.006
washer+0.006
Neg logit contributions
lets-0.003
rings-0.003
 every-0.003
 styles-0.003
 bracelet-0.003
Token mom
Token ID1995
Feature activation+6.27e-03

Pos logit contributions
 boiling+0.012
us+0.012
 microwave+0.011
washer+0.010
 bulldo+0.010
Neg logit contributions
lets-0.006
rings-0.005
 bracelet-0.005
 styles-0.005
 wrist-0.005
Token girl
Token ID2576
Feature activation+5.80e-03

Pos logit contributions
lets+0.000
rings+0.000
 bracelet+0.000
 styles+0.000
 wrist+0.000
Neg logit contributions
us-0.000
 boiling-0.000
 microwave-0.000
washer-0.000
 bulldo-0.000
Token couldn
Token ID3521
Feature activation+2.11e-02

Pos logit contributions
 boiling+0.012
us+0.012
 microwave+0.012
washer+0.011
 bulldo+0.011
Neg logit contributions
lets-0.008
rings-0.006
 wrist-0.006
 bracelet-0.006
 styles-0.006
Token't
Token ID470
Feature activation-5.50e-03

Pos logit contributions
us+0.046
 boiling+0.046
 microwave+0.044
 bulldo+0.042
 Farmer+0.041
Neg logit contributions
lets-0.029
rings-0.024
 styles-0.023
 bracelet-0.022
 wrist-0.022
Token get
Token ID651
Feature activation+2.67e-02

Pos logit contributions
lets+0.007
rings+0.006
 bracelet+0.006
 wrist+0.006
 styles+0.006
Neg logit contributions
 boiling-0.012
us-0.012
 microwave-0.011
washer-0.011
 bulldo-0.010
Token away
Token ID1497
Feature activation-9.23e-03

Pos logit contributions
 microwave+0.054
 boiling+0.053
us+0.053
washer+0.049
 bulldo+0.049
Neg logit contributions
lets-0.037
rings-0.033
 own-0.030
 bracelet-0.030
packs-0.030
Token.
Token ID13
Feature activation+5.54e-03

Pos logit contributions
lets+0.013
rings+0.011
 bracelet+0.010
 styles+0.010
 wrist+0.010
Neg logit contributions
us-0.019
 boiling-0.019
 microwave-0.019
washer-0.017
 bulldo-0.017
Token In
Token ID554
Feature activation-3.52e-02

Pos logit contributions
us+0.012
 boiling+0.012
 microwave+0.012
washer+0.011
 bulldo+0.011
Neg logit contributions
lets-0.007
rings-0.006
 bracelet-0.006
 styles-0.006
 wrist-0.005
Token the
Token ID262
Feature activation-6.13e-03

Pos logit contributions
lets+0.063
rings+0.052
 styles+0.048
bows+0.048
 brace+0.047
Neg logit contributions
 boiling-0.068
us-0.062
 Farmer-0.062
 microwave-0.061
 grizz-0.058
Token end
Token ID886
Feature activation+2.59e-02

Pos logit contributions
 own+0.004
 goal+0.003
 forget+0.003
 color+0.002
 every+0.002
Neg logit contributions
us-0.018
washer-0.016
cher-0.016
 microwave-0.016
 boiling-0.016
Token,
Token ID11
Feature activation+1.29e-03

Pos logit contributions
 boiling+0.059
us+0.059
 microwave+0.058
washer+0.054
 bulldo+0.054
Neg logit contributions
lets-0.030
rings-0.025
 styles-0.022
 tops-0.022
 bracelet-0.022
Token she
Token ID673
Feature activation+4.73e-03

Pos logit contributions
 boiling+0.003
 Farmer+0.002
 microwave+0.002
us+0.002
 Doctor+0.002
Neg logit contributions
lets-0.002
rings-0.002
 wrist-0.002
 styles-0.002
 brace-0.002
Token black
Token ID2042
Feature activation+7.88e-03

Pos logit contributions
us+0.036
 boiling+0.033
 microwave+0.032
washer+0.031
 Farmer+0.030
Neg logit contributions
lets-0.021
 bracelet-0.018
rings-0.018
 brace-0.017
 wrist-0.017
Token spider
Token ID19230
Feature activation+1.57e-02

Pos logit contributions
us+0.016
 boiling+0.014
 microwave+0.013
 Farmer+0.013
washer+0.013
Neg logit contributions
lets-0.011
 bracelet-0.011
 color-0.010
 wrist-0.010
rings-0.010
Token.
Token ID13
Feature activation-1.34e-02

Pos logit contributions
us+0.031
 boiling+0.030
 microwave+0.030
washer+0.028
 Farmer+0.027
Neg logit contributions
lets-0.024
rings-0.020
 bracelet-0.019
 wrist-0.019
 brace-0.019
Token It
Token ID632
Feature activation+1.88e-03

Pos logit contributions
lets+0.012
rings+0.010
 bracelet+0.009
 wrist+0.009
Her+0.008
Neg logit contributions
us-0.034
 boiling-0.034
 microwave-0.033
washer-0.032
 bulldo-0.031
Token had
Token ID550
Feature activation-6.23e-03

Pos logit contributions
 boiling+0.004
us+0.004
 microwave+0.003
 grizz+0.003
 bulldo+0.003
Neg logit contributions
lets-0.003
rings-0.003
 wrist-0.002
 styles-0.002
packs-0.002
Token eight
Token ID3624
Feature activation-1.67e-03

Pos logit contributions
lets+0.011
rings+0.009
 wrist+0.009
packs+0.009
 styles+0.009
Neg logit contributions
 boiling-0.011
us-0.011
 microwave-0.010
 bulldo-0.010
 Farmer-0.010
Token legs
Token ID7405
Feature activation-1.59e-02

Pos logit contributions
lets+0.002
rings+0.002
 styles+0.002
 wrist+0.002
 brace+0.002
Neg logit contributions
us-0.004
 boiling-0.004
 microwave-0.004
washer-0.003
 bulldo-0.003
Token and
Token ID290
Feature activation+1.70e-03

Pos logit contributions
lets+0.024
rings+0.020
 bracelet+0.020
 styles+0.020
 wrist+0.019
Neg logit contributions
us-0.031
 boiling-0.031
 microwave-0.029
 bulldo-0.028
washer-0.028
Token many
Token ID867
Feature activation+3.21e-04

Pos logit contributions
 boiling+0.003
 microwave+0.003
us+0.003
 bulldo+0.003
 smoking+0.003
Neg logit contributions
lets-0.003
rings-0.003
 styles-0.002
 wrist-0.002
angles-0.002
Token eyes
Token ID2951
Feature activation+1.55e-03

Pos logit contributions
us+0.001
 boiling+0.001
 microwave+0.001
washer+0.001
 bulldo+0.001
Neg logit contributions
lets-0.001
rings-0.000
 bracelet-0.000
 styles-0.000
 wrist-0.000
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
 boiling+0.003
us+0.003
 microwave+0.003
washer+0.003
 bulldo+0.003
Neg logit contributions
lets-0.002
rings-0.002
 bracelet-0.002
 styles-0.002
 wrist-0.002
Token louder
Token ID27089
Feature activation-1.48e-02

Pos logit contributions
us+0.014
 boiling+0.014
 microwave+0.013
 stew+0.012
 bulldo+0.012
Neg logit contributions
lets-0.016
rings-0.015
 bracelet-0.014
 brace-0.014
 tops-0.014
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
lets+0.020
rings+0.017
 bracelet+0.016
 styles+0.016
 wrist+0.016
Neg logit contributions
us-0.031
 boiling-0.031
 microwave-0.030
washer-0.028
 bulldo-0.028
Token louder
Token ID27089
Feature activation-5.34e-03

Pos logit contributions
us+0.016
 boiling+0.015
 microwave+0.015
 bulldo+0.014
washer+0.014
Neg logit contributions
lets-0.019
rings-0.018
 bracelet-0.017
 brace-0.017
 styles-0.017
Token,
Token ID11
Feature activation+1.70e-02

Pos logit contributions
lets+0.007
rings+0.006
 styles+0.005
 bracelet+0.005
 wrist+0.005
Neg logit contributions
 boiling-0.012
us-0.012
 microwave-0.011
washer-0.011
 bulldo-0.011
Token so
Token ID523
Feature activation-1.75e-03

Pos logit contributions
us+0.036
 boiling+0.035
 microwave+0.035
washer+0.033
 bulldo+0.032
Neg logit contributions
lets-0.023
rings-0.020
 bracelet-0.018
 styles-0.018
 wrist-0.018
Token the
Token ID262
Feature activation-7.33e-03

Pos logit contributions
lets+0.003
rings+0.002
 wrist+0.002
 styles+0.002
packs+0.002
Neg logit contributions
 boiling-0.004
us-0.003
 microwave-0.003
 Farmer-0.003
 Doctor-0.003
Token little
Token ID1310
Feature activation-9.00e-03

Pos logit contributions
lets+0.014
rings+0.012
packs+0.011
 wrist+0.011
angles+0.011
Neg logit contributions
 boiling-0.012
 microwave-0.011
us-0.011
washer-0.010
 goblin-0.010
Token girl
Token ID2576
Feature activation+8.40e-04

Pos logit contributions
lets+0.021
angles+0.019
rings+0.019
 cherish+0.018
 wrist+0.018
Neg logit contributions
 goblin-0.009
 boiling-0.009
 microwave-0.008
 scientist-0.008
 pot-0.008
Token decided
Token ID3066
Feature activation-9.92e-03

Pos logit contributions
 boiling+0.002
us+0.002
 microwave+0.002
 Farmer+0.002
 bulldo+0.002
Neg logit contributions
lets-0.001
rings-0.001
 bracelet-0.001
 wrist-0.001
 styles-0.001
Token to
Token ID284
Feature activation-4.55e-03

Pos logit contributions
lets+0.020
packs+0.018
angles+0.018
rings+0.017
 styles+0.017
Neg logit contributions
 boiling-0.013
us-0.013
 microwave-0.013
washer-0.012
 smoking-0.012
Token turn
Token ID1210
Feature activation-5.72e-03

Pos logit contributions
lets+0.007
rings+0.006
 styles+0.005
packs+0.005
 brace+0.005
Neg logit contributions
 boiling-0.009
us-0.009
 microwave-0.009
 Farmer-0.008
 stew-0.008
Token say
Token ID910
Feature activation-1.36e-02

Pos logit contributions
lets+0.037
rings+0.030
packs+0.029
 styles+0.027
 wrist+0.027
Neg logit contributions
 boiling-0.053
us-0.051
 microwave-0.051
 Farmer-0.048
washer-0.048
Token sorry
Token ID7926
Feature activation+2.12e-03

Pos logit contributions
lets+0.020
rings+0.018
 bracelet+0.017
 styles+0.016
 wrist+0.016
Neg logit contributions
us-0.026
 boiling-0.026
 microwave-0.025
 bulldo-0.024
washer-0.024
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
us+0.004
 boiling+0.004
 microwave+0.004
washer+0.004
 bulldo+0.004
Neg logit contributions
lets-0.003
rings-0.003
 bracelet-0.003
 wrist-0.003
 styles-0.003
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
lets+0.021
rings+0.017
packs+0.016
 bracelet+0.016
 wrist+0.016
Neg logit contributions
 boiling-0.029
us-0.029
 microwave-0.028
 Farmer-0.026
 bulldo-0.026
Token ants
Token ID27842
Feature activation+2.55e-03

Pos logit contributions
us+0.028
 boiling+0.026
 microwave+0.025
washer+0.023
 bulldo+0.023
Neg logit contributions
lets-0.022
 bracelet-0.020
rings-0.020
 tops-0.019
 own-0.019
Token and
Token ID290
Feature activation-6.70e-03

Pos logit contributions
us+0.005
 boiling+0.005
 microwave+0.005
washer+0.005
 bulldo+0.005
Neg logit contributions
lets-0.004
rings-0.003
 bracelet-0.003
 styles-0.003
 wrist-0.003
Token leave
Token ID2666
Feature activation+4.33e-03

Pos logit contributions
lets+0.010
rings+0.008
 wrist+0.008
 bracelet+0.008
 styles+0.008
Neg logit contributions
 boiling-0.013
us-0.013
 microwave-0.013
washer-0.012
 Farmer-0.012
Token them
Token ID606
Feature activation-1.61e-02

Pos logit contributions
us+0.009
 boiling+0.009
 microwave+0.009
 bulldo+0.008
washer+0.008
Neg logit contributions
lets-0.006
rings-0.005
 styles-0.005
 bracelet-0.005
packs-0.004
Token alone
Token ID3436
Feature activation+8.77e-03

Pos logit contributions
lets+0.030
rings+0.027
 styles+0.026
angles+0.025
 wrist+0.025
Neg logit contributions
 boiling-0.025
us-0.024
 microwave-0.023
washer-0.022
 bulldo-0.021
Token.
Token ID13
Feature activation-8.01e-03

Pos logit contributions
 boiling+0.018
us+0.018
 microwave+0.018
 bulldo+0.017
washer+0.017
Neg logit contributions
lets-0.012
rings-0.010
 bracelet-0.009
 styles-0.009
 wrist-0.009
Token
Token ID198
Feature activation-2.13e-03

Pos logit contributions
lets+0.011
rings+0.009
 bracelet+0.009
 styles+0.009
 wrist+0.008
Neg logit contributions
us-0.017
 boiling-0.017
 microwave-0.016
 Farmer-0.015
 bulldo-0.015
Token.
Token ID13
Feature activation+9.55e-03

Pos logit contributions
lets+0.003
rings+0.002
 bracelet+0.002
 every+0.002
 styles+0.002
Neg logit contributions
us-0.006
 boiling-0.006
 microwave-0.006
 bulldo-0.006
washer-0.006
Token But
Token ID887
Feature activation-1.15e-02

Pos logit contributions
 boiling+0.020
 microwave+0.020
us+0.020
washer+0.019
 bulldo+0.019
Neg logit contributions
lets-0.012
rings-0.010
packs-0.009
 bracelet-0.009
 brace-0.009
Token then
Token ID788
Feature activation+1.35e-03

Pos logit contributions
lets+0.018
rings+0.015
 styles+0.014
packs+0.014
bows+0.014
Neg logit contributions
 boiling-0.022
us-0.022
 microwave-0.021
washer-0.020
 Farmer-0.020
Token,
Token ID11
Feature activation+2.19e-03

Pos logit contributions
us+0.002
 boiling+0.002
 microwave+0.002
 Farmer+0.002
 goblin+0.002
Neg logit contributions
lets-0.002
rings-0.002
 styles-0.002
 wrist-0.002
angles-0.002
Token a
Token ID257
Feature activation+1.03e-02

Pos logit contributions
 boiling+0.004
us+0.004
 microwave+0.004
washer+0.004
 goblin+0.004
Neg logit contributions
lets-0.004
rings-0.003
 styles-0.003
packs-0.003
 tops-0.003
Token tw
Token ID665
Feature activation-6.35e-03

Pos logit contributions
us+0.029
 boiling+0.026
 microwave+0.025
dogs+0.024
auc+0.024
Neg logit contributions
 bracelet-0.009
 color-0.007
 bow-0.007
 pretty-0.007
 spirit-0.007
Tokeninkle
Token ID19894
Feature activation+7.13e-03

Pos logit contributions
lets+0.010
rings+0.009
 styles+0.008
packs+0.008
bows+0.008
Neg logit contributions
us-0.013
 boiling-0.012
 microwave-0.011
 grizz-0.011
washer-0.011
Token of
Token ID286
Feature activation+9.95e-03

Pos logit contributions
us+0.014
 boiling+0.014
 microwave+0.013
 bulldo+0.012
washer+0.012
Neg logit contributions
lets-0.011
rings-0.009
 bracelet-0.009
 wrist-0.008
 styles-0.008
Token light
Token ID1657
Feature activation+1.98e-02

Pos logit contributions
 boiling+0.020
us+0.019
 microwave+0.019
 Farmer+0.018
 grizz+0.018
Neg logit contributions
lets-0.015
rings-0.013
 wrist-0.012
 styles-0.011
 bracelet-0.011
Token appeared
Token ID4120
Feature activation+2.07e-02

Pos logit contributions
us+0.036
 boiling+0.035
 microwave+0.034
 Farmer+0.031
 bulldo+0.031
Neg logit contributions
lets-0.033
rings-0.029
 bracelet-0.028
 styles-0.027
 wrist-0.027
Token and
Token ID290
Feature activation+1.50e-02

Pos logit contributions
us+0.046
 microwave+0.046
 boiling+0.045
 bulldo+0.042
washer+0.042
Neg logit contributions
lets-0.026
rings-0.021
 bracelet-0.020
packs-0.019
 wrist-0.019
Token and
Token ID290
Feature activation-4.60e-03

Pos logit contributions
lets+0.007
rings+0.006
 brace+0.006
 wrist+0.006
 own+0.006
Neg logit contributions
 boiling-0.009
us-0.009
 microwave-0.009
 stew-0.008
washer-0.008
Token Lily
Token ID20037
Feature activation-1.88e-02

Pos logit contributions
lets+0.008
rings+0.007
 wrist+0.007
angles+0.007
bows+0.007
Neg logit contributions
 boiling-0.007
us-0.007
 microwave-0.007
 Farmer-0.007
 Doctor-0.007
Token played
Token ID2826
Feature activation-2.56e-03

Pos logit contributions
lets+0.029
rings+0.025
 bracelet+0.024
 styles+0.024
 wrist+0.024
Neg logit contributions
us-0.036
 boiling-0.036
 microwave-0.035
washer-0.032
 bulldo-0.032
Token with
Token ID351
Feature activation-3.64e-03

Pos logit contributions
lets+0.003
 wrist+0.003
 bracelet+0.003
 styles+0.003
 tops+0.003
Neg logit contributions
us-0.006
 boiling-0.005
 microwave-0.005
washer-0.005
 grizz-0.005
Token the
Token ID262
Feature activation-1.08e-03

Pos logit contributions
lets+0.006
rings+0.005
 wrist+0.005
 cherish+0.005
bows+0.005
Neg logit contributions
 boiling-0.007
 microwave-0.006
us-0.006
 Farmer-0.006
 meat-0.006
Token ball
Token ID2613
Feature activation-1.03e-02

Pos logit contributions
lets+0.002
rings+0.002
 cherish+0.001
 wrist+0.001
packs+0.001
Neg logit contributions
 boiling-0.002
 microwave-0.002
 bulldo-0.002
us-0.002
 burgers-0.002
Token.
Token ID13
Feature activation-4.41e-03

Pos logit contributions
lets+0.016
rings+0.014
 wrist+0.013
 styles+0.013
 bracelet+0.013
Neg logit contributions
 boiling-0.020
us-0.020
 microwave-0.019
washer-0.018
 bulldo-0.018
Token They
Token ID1119
Feature activation-1.27e-02

Pos logit contributions
lets+0.006
rings+0.005
 bracelet+0.005
 wrist+0.005
 styles+0.005
Neg logit contributions
 boiling-0.009
 microwave-0.009
us-0.009
 bulldo-0.008
 Farmer-0.008
Token laughed
Token ID13818
Feature activation-2.66e-02

Pos logit contributions
lets+0.019
rings+0.017
 wrist+0.016
 bracelet+0.016
 styles+0.016
Neg logit contributions
 boiling-0.025
us-0.024
 microwave-0.024
 bulldo-0.022
washer-0.022
Token and
Token ID290
Feature activation+6.87e-03

Pos logit contributions
lets+0.035
rings+0.029
 bracelet+0.027
 styles+0.027
 wrist+0.027
Neg logit contributions
us-0.057
 boiling-0.057
 microwave-0.056
washer-0.052
 bulldo-0.052
Token had
Token ID550
Feature activation-2.04e-02

Pos logit contributions
 boiling+0.014
us+0.014
 microwave+0.014
washer+0.013
 bulldo+0.013
Neg logit contributions
lets-0.010
rings-0.008
 styles-0.007
 bracelet-0.007
 wrist-0.007
Token thoughtful
Token ID22677
Feature activation+6.93e-03

Pos logit contributions
lets+0.019
rings+0.016
 bracelet+0.015
 styles+0.015
 wrist+0.015
Neg logit contributions
us-0.016
 boiling-0.016
 microwave-0.016
 bulldo-0.014
washer-0.014
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
 boiling+0.015
us+0.015
 microwave+0.014
 bulldo+0.013
washer+0.013
Neg logit contributions
lets-0.009
rings-0.008
 bracelet-0.007
 styles-0.007
 wrist-0.007
Token From
Token ID3574
Feature activation-9.67e-03

Pos logit contributions
us+0.022
 microwave+0.021
 boiling+0.021
 Farmer+0.020
 Doctor+0.020
Neg logit contributions
lets-0.017
rings-0.015
 bracelet-0.014
 cherish-0.014
 wrist-0.014
Token that
Token ID326
Feature activation-2.12e-02

Pos logit contributions
lets+0.016
rings+0.014
packs+0.013
 styles+0.013
 wrist+0.013
Neg logit contributions
us-0.018
 boiling-0.018
 microwave-0.017
 Farmer-0.016
washer-0.016
Token day
Token ID1110
Feature activation+1.57e-02

Pos logit contributions
lets+0.031
rings+0.026
 bracelet+0.024
 styles+0.024
 wrist+0.024
Neg logit contributions
us-0.043
 boiling-0.043
 microwave-0.042
washer-0.039
 bulldo-0.039
Token on
Token ID319
Feature activation-3.93e-02

Pos logit contributions
us+0.028
 boiling+0.028
 microwave+0.027
washer+0.025
 bulldo+0.025
Neg logit contributions
lets-0.026
rings-0.023
 bracelet-0.022
 styles-0.022
 wrist-0.021
Token,
Token ID11
Feature activation+2.27e-03

Pos logit contributions
lets+0.076
rings+0.068
 wrist+0.062
 styles+0.062
 cherish+0.059
Neg logit contributions
 boiling-0.064
us-0.063
 Farmer-0.060
 microwave-0.059
 Doctor-0.057
Token Lily
Token ID20037
Feature activation-1.91e-02

Pos logit contributions
 Doctor+0.003
 Farmer+0.003
 boiling+0.003
 scientist+0.003
us+0.003
Neg logit contributions
lets-0.005
 wrist-0.004
rings-0.004
packs-0.004
inated-0.004
Token became
Token ID2627
Feature activation-3.10e-03

Pos logit contributions
lets+0.028
rings+0.025
 wrist+0.023
 bracelet+0.023
 styles+0.022
Neg logit contributions
us-0.038
 boiling-0.037
 microwave-0.036
washer-0.034
 bulldo-0.034
Token known
Token ID1900
Feature activation-2.01e-02

Pos logit contributions
lets+0.005
rings+0.004
 wrist+0.004
 bracelet+0.004
 styles+0.004
Neg logit contributions
us-0.006
 boiling-0.006
 microwave-0.006
 Farmer-0.005
 bulldo-0.005
Token as
Token ID355
Feature activation+3.99e-02

Pos logit contributions
lets+0.030
rings+0.025
 styles+0.024
 wrist+0.024
packs+0.024
Neg logit contributions
 boiling-0.039
us-0.038
 Farmer-0.036
 microwave-0.036
 Doctor-0.035
Token to
Token ID284
Feature activation-2.77e-03

Pos logit contributions
us+0.002
 boiling+0.002
 microwave+0.002
washer+0.002
 bulldo+0.002
Neg logit contributions
lets-0.001
rings-0.001
 bracelet-0.001
 wrist-0.001
 styles-0.001
Token play
Token ID711
Feature activation-1.06e-02

Pos logit contributions
lets+0.004
rings+0.003
packs+0.003
 bracelet+0.003
 styles+0.003
Neg logit contributions
 boiling-0.006
us-0.006
 microwave-0.006
 bulldo-0.005
washer-0.005
Token in
Token ID287
Feature activation-1.22e-02

Pos logit contributions
lets+0.011
 bracelet+0.008
rings+0.008
 wrist+0.008
 styles+0.008
Neg logit contributions
 boiling-0.026
us-0.025
 microwave-0.025
 Farmer-0.023
washer-0.023
Token,"
Token ID553
Feature activation-2.78e-03

Pos logit contributions
lets+0.020
rings+0.017
bows+0.016
 styles+0.016
 bracelet+0.015
Neg logit contributions
 boiling-0.024
us-0.023
 microwave-0.022
 bulldo-0.021
washer-0.021
Token thought
Token ID1807
Feature activation+6.16e-03

Pos logit contributions
lets+0.006
rings+0.005
packs+0.005
 bracelet+0.005
 wrist+0.005
Neg logit contributions
us-0.004
 boiling-0.004
 microwave-0.004
 bulldo-0.004
 Farmer-0.003
Token Spe
Token ID2531
Feature activation-4.06e-02

Pos logit contributions
us+0.012
 boiling+0.012
 microwave+0.012
 bulldo+0.011
washer+0.011
Neg logit contributions
lets-0.009
rings-0.008
 bracelet-0.008
 wrist-0.008
 styles-0.007
Tokenedy
Token ID4716
Feature activation-2.08e-02

Pos logit contributions
lets+0.061
rings+0.059
packs+0.056
 bracelet+0.054
 tops+0.053
Neg logit contributions
us-0.077
 bulldo-0.069
washer-0.066
 microwave-0.065
 boiling-0.064
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
lets+0.033
rings+0.030
 bracelet+0.029
angles+0.029
adays+0.029
Neg logit contributions
us-0.036
washer-0.034
 microwave-0.033
 boiling-0.033
 bulldo-0.033
Token So
Token ID1406
Feature activation-2.25e-02

Pos logit contributions
 boiling+0.028
us+0.026
 microwave+0.026
 stew+0.024
washer+0.024
Neg logit contributions
lets-0.018
rings-0.016
 wrist-0.014
 styles-0.014
 tops-0.014
Token,
Token ID11
Feature activation+1.05e-02

Pos logit contributions
lets+0.039
rings+0.034
 wrist+0.033
 bracelet+0.033
 styles+0.033
Neg logit contributions
us-0.038
 bulldo-0.036
 boiling-0.035
 Farmer-0.035
 microwave-0.035
Token he
Token ID339
Feature activation+9.72e-03

Pos logit contributions
us+0.017
 boiling+0.016
 microwave+0.016
 bulldo+0.015
 Farmer+0.015
Neg logit contributions
lets-0.020
rings-0.017
 wrist-0.017
 bracelet-0.016
 styles-0.016
Token,
Token ID11
Feature activation+6.50e-03

Pos logit contributions
lets+0.021
rings+0.019
 wrist+0.018
 wrists+0.018
packs+0.017
Neg logit contributions
us-0.018
 boiling-0.017
washer-0.016
 microwave-0.016
 Doctor-0.015
Token very
Token ID845
Feature activation-1.13e-02

Pos logit contributions
 boiling+0.012
us+0.012
 microwave+0.012
washer+0.011
 bulldo+0.011
Neg logit contributions
lets-0.010
rings-0.009
 bracelet-0.008
 wrist-0.008
 styles-0.008
Token much
Token ID881
Feature activation+1.15e-02

Pos logit contributions
lets+0.020
rings+0.017
 wrist+0.016
 bracelet+0.016
 brace+0.016
Neg logit contributions
us-0.020
 boiling-0.020
 microwave-0.019
washer-0.018
 Farmer-0.018
Token.
Token ID13
Feature activation+6.59e-03

Pos logit contributions
us+0.026
 boiling+0.026
 microwave+0.026
 bulldo+0.024
washer+0.024
Neg logit contributions
lets-0.013
 every-0.011
 styles-0.010
rings-0.010
 bracelet-0.010
Token<|endoftext|>
Token ID50256
Feature activation+4.07e-02

Pos logit contributions
us+0.013
 boiling+0.013
 microwave+0.013
washer+0.012
 bulldo+0.012
Neg logit contributions
lets-0.010
rings-0.008
 bracelet-0.008
 styles-0.008
 wrist-0.008
TokenOnce
Token ID7454
Feature activation-4.00e-02

Pos logit contributions
 boiling+0.087
 microwave+0.084
us+0.083
washer+0.079
 bulldo+0.079
Neg logit contributions
lets-0.051
rings-0.043
packs-0.042
 tops-0.040
 bracelet-0.040
Token upon
Token ID2402
Feature activation-1.06e-02

Pos logit contributions
lets+0.072
 wrist+0.061
 styles+0.061
 wrists+0.060
 brace+0.059
Neg logit contributions
us-0.074
 boiling-0.070
washer-0.067
 microwave-0.066
 bulldo-0.061
Token a
Token ID257
Feature activation+6.02e-04

Pos logit contributions
lets+0.020
rings+0.017
 styles+0.016
angles+0.016
 wrists+0.016
Neg logit contributions
 boiling-0.018
us-0.018
 microwave-0.017
 Farmer-0.015
washer-0.015
Token time
Token ID640
Feature activation+1.30e-02

Pos logit contributions
 boiling+0.001
us+0.001
 microwave+0.001
 Farmer+0.001
 bulldo+0.001
Neg logit contributions
lets-0.001
rings-0.001
 styles-0.001
 wrist-0.001
 bracelet-0.001
Token,
Token ID11
Feature activation-1.34e-02

Pos logit contributions
us+0.024
 boiling+0.022
 microwave+0.021
washer+0.021
 bulldo+0.021
Neg logit contributions
lets-0.023
rings-0.020
 styles-0.019
 wrists-0.019
 wrist-0.019
Token there
Token ID612
Feature activation-2.14e-02

Pos logit contributions
lets+0.026
rings+0.023
 styles+0.023
 wrist+0.022
 brace+0.022
Neg logit contributions
 boiling-0.021
us-0.021
 microwave-0.020
washer-0.019
 bulldo-0.018
Token winning
Token ID5442
Feature activation-3.70e-02

Pos logit contributions
lets+0.014
rings+0.012
 bracelet+0.011
 styles+0.011
 wrist+0.011
Neg logit contributions
us-0.015
 boiling-0.015
 microwave-0.014
washer-0.013
 bulldo-0.013
Token a
Token ID257
Feature activation-5.12e-03

Pos logit contributions
lets+0.061
rings+0.055
 wrist+0.051
 bracelet+0.050
Her+0.048
Neg logit contributions
us-0.065
 boiling-0.062
 bulldo-0.059
 Farmer-0.058
washer-0.058
Token race
Token ID3234
Feature activation-2.15e-02

Pos logit contributions
lets+0.007
rings+0.005
 bracelet+0.005
 styles+0.005
 wrist+0.005
Neg logit contributions
 boiling-0.011
us-0.011
 microwave-0.011
washer-0.010
 Farmer-0.010
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
lets+0.034
rings+0.030
 bracelet+0.027
 wrist+0.027
 brace+0.027
Neg logit contributions
us-0.040
 boiling-0.038
 microwave-0.037
washer-0.037
 bulldo-0.036
Token<|endoftext|>
Token ID50256
Feature activation+3.94e-02

Pos logit contributions
 boiling+0.021
us+0.021
 microwave+0.020
washer+0.019
 bulldo+0.019
Neg logit contributions
lets-0.014
rings-0.012
 bracelet-0.011
 wrist-0.011
 styles-0.011
TokenOnce
Token ID7454
Feature activation-4.26e-02

Pos logit contributions
 boiling+0.091
 microwave+0.088
us+0.087
washer+0.083
 bulldo+0.083
Neg logit contributions
lets-0.043
rings-0.034
packs-0.034
 own-0.032
 tops-0.032
Token upon
Token ID2402
Feature activation-1.01e-02

Pos logit contributions
lets+0.073
 wrist+0.062
 styles+0.062
 wrists+0.061
rings+0.061
Neg logit contributions
us-0.081
 boiling-0.077
washer-0.073
 microwave-0.071
 Farmer-0.068
Token a
Token ID257
Feature activation+1.36e-03

Pos logit contributions
lets+0.018
rings+0.015
 styles+0.015
 wrist+0.014
 wrists+0.014
Neg logit contributions
 boiling-0.018
us-0.018
 microwave-0.017
 Farmer-0.015
washer-0.015
Token time
Token ID640
Feature activation+9.73e-03

Pos logit contributions
 boiling+0.003
us+0.003
 microwave+0.003
 Farmer+0.002
 bulldo+0.002
Neg logit contributions
lets-0.002
rings-0.002
 styles-0.002
 wrist-0.002
 bracelet-0.002
Token,
Token ID11
Feature activation-1.48e-02

Pos logit contributions
us+0.016
 boiling+0.014
 bulldo+0.014
 Farmer+0.014
washer+0.013
Neg logit contributions
lets-0.019
 styles-0.017
rings-0.017
 wrists-0.016
 wrist-0.016
Token there
Token ID612
Feature activation-2.42e-02

Pos logit contributions
lets+0.031
 styles+0.029
rings+0.028
 wrists+0.027
 wrist+0.027
Neg logit contributions
 boiling-0.021
us-0.021
 microwave-0.019
washer-0.018
 bulldo-0.018
Token forgot
Token ID16453
Feature activation-1.61e-02

Pos logit contributions
lets+0.024
rings+0.020
 wrist+0.019
 bracelet+0.019
 styles+0.019
Neg logit contributions
us-0.031
 boiling-0.031
 microwave-0.030
 bulldo-0.028
washer-0.028
Token it
Token ID340
Feature activation+6.08e-03

Pos logit contributions
lets+0.023
rings+0.019
 wrist+0.018
 bracelet+0.018
 styles+0.018
Neg logit contributions
 boiling-0.033
us-0.033
 microwave-0.032
washer-0.030
 Farmer-0.029
Token.
Token ID13
Feature activation+2.71e-03

Pos logit contributions
 boiling+0.011
us+0.011
 microwave+0.011
washer+0.010
 Farmer+0.010
Neg logit contributions
lets-0.010
rings-0.009
 wrist-0.008
 styles-0.008
 bracelet-0.008
Token<|endoftext|>
Token ID50256
Feature activation+4.16e-02

Pos logit contributions
us+0.005
 boiling+0.005
 microwave+0.005
 Farmer+0.005
 Doctor+0.004
Neg logit contributions
lets-0.005
rings-0.004
 bracelet-0.004
 brace-0.004
 styles-0.004
TokenOnce
Token ID7454
Feature activation-4.79e-02

Pos logit contributions
 boiling+0.079
 microwave+0.075
us+0.075
 bulldo+0.070
washer+0.070
Neg logit contributions
lets-0.063
rings-0.054
packs-0.053
 tops-0.051
 bracelet-0.051
Token there
Token ID612
Feature activation-4.19e-02

Pos logit contributions
lets+0.087
 styles+0.075
 wrists+0.074
 wrist+0.074
 brace+0.073
Neg logit contributions
us-0.087
 boiling-0.081
washer-0.080
 microwave-0.076
 bulldo-0.072
Token was
Token ID373
Feature activation-1.08e-03

Pos logit contributions
lets+0.089
rings+0.077
 wrists+0.074
 brace+0.073
 styles+0.073
Neg logit contributions
us-0.060
 boiling-0.057
washer-0.055
 microwave-0.054
 Farmer-0.054
Token a
Token ID257
Feature activation+6.89e-03

Pos logit contributions
lets+0.002
rings+0.002
 styles+0.001
 wrist+0.001
 bracelet+0.001
Neg logit contributions
 boiling-0.002
us-0.002
 microwave-0.002
 Farmer-0.002
 bulldo-0.002
Token cow
Token ID9875
Feature activation+1.65e-03

Pos logit contributions
 boiling+0.015
us+0.015
 microwave+0.014
washer+0.014
 grizz+0.013
Neg logit contributions
lets-0.009
 bracelet-0.007
 own-0.007
 wrist-0.007
 color-0.007
Token.
Token ID13
Feature activation-2.05e-02

Pos logit contributions
us+0.003
 boiling+0.003
 microwave+0.002
washer+0.002
 bulldo+0.002
Neg logit contributions
lets-0.003
rings-0.003
 styles-0.003
 bracelet-0.003
packs-0.003
Token She
Token ID1375
Feature activation-7.41e-03

Pos logit contributions
lets+0.030
 bracelet+0.025
rings+0.025
 styles+0.025
angles+0.024
Neg logit contributions
 boiling-0.041
us-0.040
 microwave-0.040
 Farmer-0.039
 grizz-0.037