TOP ACTIVATIONS
MAX = 0.225

my takes Lily to the salon . There is a nice
took him to the hair salon and they waited for their
couldn 't run in the salon . When it
have to go to the salon . They have to get
Lily went to the hair salon with her mom my .
to take her to the salon . Her mom said yes
took her to the hair salon . The ha ird ress
." At the salon , Lily observed the other
Amy rolled out of the salon , her new haircut in
loved going to the hair salon and getting different hairc uts
Sue . They leave the salon with big smiles . Lily
She went to the hair salon and got her new haircut
they would go to the salon and fix her hair .
mom went to the hair salon . The little girl was
He sn uck into a salon and found the bar ber
They are for the hair salon !" He ran
never been to a hair salon before and felt a little
afraid to go to the salon anymore . <|endoftext|> Once upon
day , working at his salon and running err ands .
She said she hoped the salon had some candy . She

MIN ACTIVATIONS
MIN = -0.171

loo and pretended to be Esk im os . After a
fox , put the honey com bs on top of his
was a stack of honey com bs ! The
the puppy and the cheer full er times they had together
his favourite treat - ch oc - chip ice cream !
The puppy was much cheer full er now . He said
day , the rh in oc eros felt much better .
away . The rh in oc eros felt che ery and
they could grow more av oc ados and share them with
new ! She thought av oc ados were the most interesting
unicorn took the rh in oc eros to a nearby pond
to see the rh in oc eros anymore . He is
celebrated . The rh in oc eros learned that humans can
wait until the rh in oc eros is gone ."
The rh in oc eros looked up , hopeful
Finally , the rh in oc eros remembered that animals often
not see the rh in oc eros . They asked a
there is a rh in oc eros in the park .
sailed for days , exploring unch arted waters until he came
, and the rh in oc eros remembered that life could

INTERVAL 0.126 - 0.225
CONTAINS 0.008%

She loved to bake cup cakes with her mom my .
blue wheels and pink handle bars . She liked to go
Sue . They leave the salon with big smiles . Lily
day , working at his salon and running err ands .
I can wear my rain coat and boots . Please mom

INTERVAL 0.027 - 0.126
CONTAINS 9.662%

and her mom my knew something was wrong .
Tim my . Tim my loved to play with blocks .
see what surprise her parents were about to reveal . When
. They like to play with a basketball . They bounce
back my hat !" Lily shouted . " Come

INTERVAL -0.072 - 0.027
CONTAINS 89.942%

sad . His bike was gone ! He knew a thief
best day ever . <|endoftext|> One day , a little girl
his ball . One day , Spot 's ball rolled under
, there was a little girl named Lily . She loved
The judge gave George lots of hugs and said

INTERVAL -0.171 - -0.072
CONTAINS 0.388%

like Jack . They have names like Black beard or Hook
was a dog . His name was Tim . One day
on the door . Mom opens the door and smiles .
ave the shark and they sw am together to find Lucky
day on , Lily made sure to always bring her glove
Tokenmy
Token ID1820
Feature activation-2.28e-02

Pos logit contributions
side+0.017
 salon+0.016
fs+0.016
cakes+0.015
but+0.015
Neg logit contributions
 name-0.008
 wonder-0.006
 know-0.006
 lives-0.006
 can-0.006
Token takes
Token ID2753
Feature activation+1.78e-03

Pos logit contributions
 name+0.026
 can+0.022
 comes+0.022
 wonder+0.022
 know+0.022
Neg logit contributions
side-0.060
fs-0.056
 salon-0.054
cakes-0.052
but-0.052
Token Lily
Token ID20037
Feature activation+6.40e-03

Pos logit contributions
side+0.004
fs+0.004
 salon+0.004
cakes+0.004
but+0.004
Neg logit contributions
 name-0.002
 can-0.002
 wonder-0.002
 wants-0.002
 lives-0.002
Token to
Token ID284
Feature activation+8.65e-03

Pos logit contributions
side+0.018
fs+0.018
ves+0.017
cakes+0.017
ized+0.017
Neg logit contributions
 name-0.006
 can-0.005
 comes-0.005
 know-0.004
 lives-0.004
Token the
Token ID262
Feature activation+1.37e-03

Pos logit contributions
side+0.023
fs+0.021
 salon+0.021
but+0.020
cakes+0.020
Neg logit contributions
 name-0.010
 can-0.008
 comes-0.008
 wonder-0.008
 lives-0.008
Token salon
Token ID46608
Feature activation+2.22e-01

Pos logit contributions
fs+0.003
side+0.003
ves+0.003
ized+0.003
but+0.003
Neg logit contributions
 name-0.002
 can-0.002
 lives-0.002
 wonder-0.002
 names-0.002
Token.
Token ID13
Feature activation-2.08e-03

Pos logit contributions
fs+0.640
side+0.624
cakes+0.587
liv+0.584
bly+0.583
Neg logit contributions
 name-0.183
 know-0.143
 wants-0.140
 can-0.138
 lives-0.136
Token There
Token ID1318
Feature activation-3.16e-02

Pos logit contributions
 name+0.003
 lives+0.003
 comes+0.003
 wants+0.003
 know+0.003
Neg logit contributions
side-0.005
 salon-0.004
fs-0.004
but-0.004
cakes-0.004
Token is
Token ID318
Feature activation-5.28e-02

Pos logit contributions
 name+0.036
 wonder+0.030
 comes+0.030
 can+0.030
 know+0.030
Neg logit contributions
side-0.083
fs-0.078
 salon-0.076
cakes-0.073
but-0.072
Token a
Token ID257
Feature activation-5.43e-03

Pos logit contributions
 can+0.036
 name+0.035
 is+0.026
 names+0.026
 wonder+0.026
Neg logit contributions
ized-0.166
fs-0.163
quet-0.162
side-0.158
but-0.150
Token nice
Token ID3621
Feature activation-4.48e-03

Pos logit contributions
 name+0.007
 can+0.006
 wonder+0.006
 lives+0.006
 comes+0.006
Neg logit contributions
side-0.014
fs-0.014
ized-0.013
but-0.013
cakes-0.012
Token took
Token ID1718
Feature activation+2.76e-02

Pos logit contributions
 name+0.002
 can+0.002
 comes+0.002
 wonder+0.002
 know+0.002
Neg logit contributions
side-0.004
fs-0.004
 salon-0.004
but-0.003
cakes-0.003
Token him
Token ID683
Feature activation-1.22e-02

Pos logit contributions
side+0.062
fs+0.057
cakes+0.053
 salon+0.053
ized+0.053
Neg logit contributions
 name-0.040
 wants-0.037
 lives-0.036
 can-0.036
 comes-0.035
Token to
Token ID284
Feature activation+1.76e-02

Pos logit contributions
 name+0.015
 can+0.013
 know+0.013
 comes+0.013
 lives+0.013
Neg logit contributions
side-0.031
fs-0.029
 salon-0.028
ized-0.027
cakes-0.027
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
side+0.045
 salon+0.041
fs+0.041
but+0.038
ized+0.038
Neg logit contributions
 name-0.021
 comes-0.019
 can-0.019
 wants-0.019
 lives-0.019
Token hair
Token ID4190
Feature activation+4.01e-02

Pos logit contributions
side+0.030
fs+0.029
ized+0.026
ves+0.026
cakes+0.025
Neg logit contributions
 name-0.026
 can-0.023
 lives-0.022
 wonder-0.022
 comes-0.021
Token salon
Token ID46608
Feature activation+2.17e-01

Pos logit contributions
fs+0.078
bly+0.074
ves+0.070
quet+0.069
ized+0.068
Neg logit contributions
 can-0.062
 name-0.062
 is-0.058
 be-0.056
 and-0.056
Token and
Token ID290
Feature activation+1.31e-02

Pos logit contributions
side+0.595
fs+0.589
pan+0.548
cakes+0.543
 therapist+0.543
Neg logit contributions
 name-0.204
 know-0.164
 lives-0.156
 wants-0.151
 can-0.146
Token they
Token ID484
Feature activation+1.47e-02

Pos logit contributions
side+0.035
fs+0.033
ves+0.031
ized+0.030
quet+0.030
Neg logit contributions
 name-0.015
 can-0.014
 know-0.014
 comes-0.013
 wonder-0.013
Token waited
Token ID13488
Feature activation+2.44e-02

Pos logit contributions
side+0.039
fs+0.037
 salon+0.035
but+0.034
cakes+0.034
Neg logit contributions
 name-0.016
 can-0.015
 comes-0.014
 wonder-0.014
 know-0.014
Token for
Token ID329
Feature activation+1.35e-02

Pos logit contributions
side+0.067
fs+0.064
 salon+0.063
liv+0.062
 therapist+0.061
Neg logit contributions
 name-0.025
 can-0.021
 comes-0.019
 wonder-0.019
 names-0.018
Token their
Token ID511
Feature activation+1.42e-02

Pos logit contributions
side+0.031
fs+0.029
ized+0.027
 salon+0.026
cakes+0.026
Neg logit contributions
 name-0.020
 wants-0.018
 comes-0.018
 lives-0.017
 can-0.017
Token couldn
Token ID3521
Feature activation+1.17e-02

Pos logit contributions
side+0.069
fs+0.066
 salon+0.064
ves+0.062
cakes+0.061
Neg logit contributions
 name-0.037
 wonder-0.032
 comes-0.031
 know-0.030
 names-0.030
Token't
Token ID470
Feature activation+5.50e-02

Pos logit contributions
side+0.026
fs+0.024
 salon+0.022
ized+0.022
but+0.022
Neg logit contributions
 name-0.018
 can-0.017
 comes-0.017
 know-0.016
 lives-0.016
Token run
Token ID1057
Feature activation-1.18e-02

Pos logit contributions
side+0.155
fs+0.144
 salon+0.137
cakes+0.135
ized+0.134
Neg logit contributions
 name-0.059
 know-0.052
 can-0.051
 wonder-0.049
 comes-0.048
Token in
Token ID287
Feature activation+2.62e-03

Pos logit contributions
 name+0.017
 can+0.016
 comes+0.015
 know+0.015
 wants+0.014
Neg logit contributions
side-0.028
fs-0.026
 salon-0.025
cakes-0.025
liv-0.024
Token the
Token ID262
Feature activation+1.01e-02

Pos logit contributions
side+0.007
fs+0.007
cakes+0.007
 salon+0.007
but+0.006
Neg logit contributions
 name-0.003
 wonder-0.002
 can-0.002
 know-0.002
 comes-0.002
Token salon
Token ID46608
Feature activation+2.15e-01

Pos logit contributions
side+0.022
fs+0.021
ves+0.019
ized+0.019
but+0.019
Neg logit contributions
 name-0.016
 can-0.015
 comes-0.015
 lives-0.015
 wonder-0.015
Token.
Token ID13
Feature activation+5.09e-03

Pos logit contributions
side+0.567
fs+0.558
liv+0.538
cakes+0.528
ized+0.509
Neg logit contributions
 name-0.222
 can-0.199
 know-0.194
 wants-0.193
 comes-0.182
Token
Token ID198
Feature activation+1.11e-02

Pos logit contributions
side+0.013
fs+0.012
 salon+0.012
cakes+0.012
but+0.012
Neg logit contributions
 name-0.006
 wonder-0.005
 comes-0.005
 lives-0.005
 can-0.005
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
side+0.031
fs+0.029
 salon+0.028
but+0.027
ves+0.027
Neg logit contributions
 name-0.011
 comes-0.009
 wants-0.009
 wonder-0.009
 lives-0.009
TokenWhen
Token ID2215
Feature activation-1.17e-02

Pos logit contributions
pan+0.023
 salon+0.022
ier+0.022
side+0.022
fs+0.022
Neg logit contributions
 name-0.017
 They-0.015
 know-0.015
 can-0.015
 wants-0.015
Token it
Token ID340
Feature activation+6.01e-03

Pos logit contributions
 name+0.021
 can+0.019
 comes+0.019
 wonder+0.019
 know+0.019
Neg logit contributions
side-0.023
fs-0.022
 salon-0.021
but-0.020
cakes-0.019
Token have
Token ID423
Feature activation-2.35e-02

Pos logit contributions
 name+0.007
 can+0.006
 comes+0.006
 wonder+0.006
 know+0.006
Neg logit contributions
side-0.027
fs-0.025
 salon-0.024
but-0.024
cakes-0.024
Token to
Token ID284
Feature activation-1.50e-03

Pos logit contributions
 can+0.022
 name+0.021
 names+0.020
 wants+0.019
 comes+0.018
Neg logit contributions
fs-0.069
side-0.066
but-0.063
quet-0.063
liv-0.061
Token go
Token ID467
Feature activation-2.32e-02

Pos logit contributions
 name+0.001
 know+0.001
 wonder+0.001
 be+0.001
 ask+0.001
Neg logit contributions
side-0.005
fs-0.004
cakes-0.004
but-0.004
uddled-0.004
Token to
Token ID284
Feature activation-2.05e-03

Pos logit contributions
 name+0.025
 can+0.022
 comes+0.021
 know+0.021
 wonder+0.020
Neg logit contributions
side-0.062
fs-0.062
liv-0.058
ves-0.058
ized-0.057
Token the
Token ID262
Feature activation-7.68e-03

Pos logit contributions
 name+0.003
 comes+0.002
 can+0.002
 wants+0.002
 lives+0.002
Neg logit contributions
side-0.005
 salon-0.005
fs-0.004
but-0.004
ves-0.004
Token salon
Token ID46608
Feature activation+2.13e-01

Pos logit contributions
 name+0.011
 lives+0.010
 can+0.010
 wonder+0.010
 names+0.009
Neg logit contributions
side-0.018
fs-0.018
ves-0.016
but-0.016
ized-0.016
Token.
Token ID13
Feature activation-7.08e-03

Pos logit contributions
fs+0.574
side+0.561
liv+0.529
cakes+0.525
 therapist+0.516
Neg logit contributions
 name-0.215
 wants-0.185
 know-0.185
 lives-0.181
 can-0.176
Token They
Token ID1119
Feature activation-2.31e-02

Pos logit contributions
 name+0.006
 lives+0.006
 comes+0.006
 wants+0.005
 can+0.005
Neg logit contributions
side-0.020
 salon-0.018
fs-0.017
but-0.017
cakes-0.017
Token have
Token ID423
Feature activation-2.48e-02

Pos logit contributions
 name+0.020
 can+0.017
 wonder+0.017
 comes+0.017
 know+0.016
Neg logit contributions
side-0.067
fs-0.063
 salon-0.061
but-0.059
cakes-0.059
Token to
Token ID284
Feature activation-2.71e-03

Pos logit contributions
 can+0.024
 name+0.024
 names+0.022
 comes+0.022
 wonder+0.022
Neg logit contributions
fs-0.069
side-0.067
ized-0.064
but-0.064
quet-0.063
Token get
Token ID651
Feature activation+1.89e-02

Pos logit contributions
 name+0.003
 know+0.002
 wonder+0.002
 see+0.002
 be+0.002
Neg logit contributions
side-0.008
fs-0.008
cakes-0.007
but-0.007
uddled-0.007
Token Lily
Token ID20037
Feature activation+1.67e-02

Pos logit contributions
 name+0.010
 wonders+0.009
 wonder+0.009
 lives+0.009
 hopes+0.009
Neg logit contributions
side-0.013
 salon-0.012
cakes-0.012
fs-0.011
cake-0.011
Token went
Token ID1816
Feature activation-1.12e-02

Pos logit contributions
side+0.033
brush+0.028
 salon+0.027
ized+0.027
he+0.027
Neg logit contributions
 name-0.026
 names-0.023
 Don-0.022
 Sharks-0.021
roc-0.021
Token to
Token ID284
Feature activation+1.34e-02

Pos logit contributions
 name+0.017
 wonder+0.015
 can+0.015
 know+0.015
 wants+0.015
Neg logit contributions
side-0.025
fs-0.023
 salon-0.022
but-0.021
cakes-0.021
Token the
Token ID262
Feature activation+1.14e-02

Pos logit contributions
 salon+0.023
side+0.022
pron+0.020
 she+0.019
she+0.018
Neg logit contributions
 wants-0.021
 knows-0.021
rr-0.021
 comes-0.020
 name-0.020
Token hair
Token ID4190
Feature activation+3.47e-02

Pos logit contributions
side+0.025
fs+0.024
 salon+0.022
but+0.021
cakes+0.021
Neg logit contributions
 name-0.018
 can-0.016
 wonder-0.016
 comes-0.016
 lives-0.016
Token salon
Token ID46608
Feature activation+2.12e-01

Pos logit contributions
fs+0.035
side+0.032
bly+0.030
ized+0.029
pan+0.025
Neg logit contributions
 name-0.095
 is-0.087
 can-0.087
 lives-0.086
 wants-0.086
Token with
Token ID351
Feature activation+3.59e-02

Pos logit contributions
side+0.533
fs+0.519
pan+0.487
liv+0.478
ized+0.470
Neg logit contributions
 name-0.276
 know-0.226
 lives-0.218
 wants-0.214
 comes-0.209
Token her
Token ID607
Feature activation+6.29e-03

Pos logit contributions
side+0.059
fs+0.054
 salon+0.051
brush+0.050
cakes+0.048
Neg logit contributions
 name-0.069
 lives-0.066
 can-0.066
 wants-0.066
 comes-0.065
Token mom
Token ID1995
Feature activation+1.22e-02

Pos logit contributions
side+0.017
fs+0.016
 salon+0.016
but+0.015
brush+0.015
Neg logit contributions
 name-0.006
 comes-0.006
 lives-0.005
 wants-0.005
 wonder-0.005
Tokenmy
Token ID1820
Feature activation-4.37e-03

Pos logit contributions
side+0.031
fs+0.029
 salon+0.028
cakes+0.027
ves+0.027
Neg logit contributions
 name-0.015
 can-0.013
 comes-0.013
 wonder-0.012
 know-0.012
Token.
Token ID13
Feature activation+2.59e-02

Pos logit contributions
 name+0.005
 can+0.005
 comes+0.004
 know+0.004
 lives+0.004
Neg logit contributions
side-0.011
fs-0.011
 salon-0.010
ves-0.010
liv-0.010
Token to
Token ID284
Feature activation+5.13e-03

Pos logit contributions
side+0.003
fs+0.003
 salon+0.003
ized+0.003
liv+0.003
Neg logit contributions
 name-0.001
 can-0.001
 comes-0.001
 know-0.001
 wonder-0.001
Token take
Token ID1011
Feature activation+4.56e-03

Pos logit contributions
side+0.015
fs+0.014
liv+0.013
ves+0.013
ized+0.013
Neg logit contributions
 name-0.005
 know-0.004
 can-0.004
 wonder-0.004
 ask-0.004
Token her
Token ID607
Feature activation-6.55e-04

Pos logit contributions
side+0.010
fs+0.009
 salon+0.008
cakes+0.008
but+0.008
Neg logit contributions
 name-0.008
 can-0.007
 wants-0.007
 comes-0.007
 wonder-0.007
Token to
Token ID284
Feature activation+6.71e-03

Pos logit contributions
 name+0.001
 can+0.001
 names+0.001
 comes+0.001
 know+0.001
Neg logit contributions
side-0.002
fs-0.002
ized-0.002
cakes-0.002
liv-0.002
Token the
Token ID262
Feature activation+4.60e-03

Pos logit contributions
side+0.017
 salon+0.016
fs+0.016
but+0.015
ized+0.015
Neg logit contributions
 name-0.008
 comes-0.007
 wants-0.007
 can-0.007
 lives-0.007
Token salon
Token ID46608
Feature activation+2.11e-01

Pos logit contributions
side+0.009
fs+0.009
ized+0.008
ves+0.008
but+0.008
Neg logit contributions
 name-0.009
 can-0.008
 lives-0.007
 names-0.007
 wonder-0.007
Token.
Token ID13
Feature activation+3.06e-03

Pos logit contributions
side+0.565
fs+0.561
liv+0.531
ized+0.520
 therapist+0.517
Neg logit contributions
 name-0.228
 know-0.186
 can-0.167
 ask-0.166
 lives-0.165
Token Her
Token ID2332
Feature activation+3.37e-02

Pos logit contributions
side+0.007
 salon+0.006
fs+0.006
cakes+0.006
but+0.006
Neg logit contributions
 name-0.005
 lives-0.004
 comes-0.004
 wants-0.004
 know-0.004
Token mom
Token ID1995
Feature activation+4.44e-03

Pos logit contributions
 salon+0.079
side+0.079
fs+0.073
brush+0.073
cakes+0.070
Neg logit contributions
 comes-0.038
 know-0.036
 wonder-0.035
 lives-0.035
 wants-0.035
Token said
Token ID531
Feature activation+2.54e-02

Pos logit contributions
side+0.011
fs+0.010
 salon+0.010
but+0.010
pan+0.009
Neg logit contributions
 name-0.006
 can-0.005
 comes-0.005
 wants-0.005
 know-0.005
Token yes
Token ID3763
Feature activation+2.25e-02

Pos logit contributions
side+0.067
handed+0.059
ized+0.059
liv+0.058
fs+0.058
Neg logit contributions
 can-0.036
 name-0.033
 comes-0.032
 is-0.029
 wants-0.028
Token took
Token ID1718
Feature activation+1.05e-02

Pos logit contributions
 name+0.039
 wonder+0.034
 lives+0.034
 know+0.033
 comes+0.033
Neg logit contributions
side-0.072
fs-0.066
 salon-0.065
cakes-0.062
but-0.061
Token her
Token ID607
Feature activation-1.03e-02

Pos logit contributions
side+0.023
fs+0.021
 salon+0.020
but+0.019
cakes+0.019
Neg logit contributions
 name-0.017
 comes-0.015
 can-0.015
 wonder-0.015
 know-0.015
Token to
Token ID284
Feature activation+3.68e-03

Pos logit contributions
 name+0.014
 can+0.013
 comes+0.011
 wants+0.011
 wonder+0.011
Neg logit contributions
side-0.026
fs-0.025
liv-0.024
ves-0.024
ized-0.023
Token the
Token ID262
Feature activation-3.83e-03

Pos logit contributions
side+0.009
 salon+0.008
fs+0.008
but+0.008
 therapist+0.008
Neg logit contributions
 name-0.004
 wants-0.004
 can-0.004
 comes-0.004
 lives-0.004
Token hair
Token ID4190
Feature activation+2.87e-02

Pos logit contributions
 name+0.006
 can+0.006
 lives+0.006
 wonder+0.006
 wonders+0.005
Neg logit contributions
fs-0.009
side-0.008
ves-0.008
ized-0.008
but-0.007
Token salon
Token ID46608
Feature activation+2.11e-01

Pos logit contributions
fs+0.075
liv+0.072
ves+0.071
bly+0.071
dd+0.069
Neg logit contributions
 can-0.028
 and-0.026
 name-0.026
 be-0.023
 brother-0.023
Token.
Token ID13
Feature activation-2.06e-03

Pos logit contributions
fs+0.620
side+0.598
liv+0.587
bly+0.568
 therapist+0.561
Neg logit contributions
 name-0.166
 know-0.131
 be-0.115
 lives-0.108
 see-0.106
Token The
Token ID383
Feature activation-1.10e-02

Pos logit contributions
 name+0.002
 comes+0.002
 can+0.002
 wonder+0.002
 lives+0.002
Neg logit contributions
side-0.005
fs-0.005
 salon-0.005
cakes-0.005
but-0.005
Token ha
Token ID387
Feature activation+7.62e-02

Pos logit contributions
 name+0.021
 can+0.019
 wants+0.018
 lives+0.018
 comes+0.018
Neg logit contributions
side-0.022
fs-0.021
ized-0.020
but-0.019
cakes-0.019
Tokenird
Token ID1447
Feature activation+7.61e-02

Pos logit contributions
side+0.097
fs+0.082
ized+0.073
 therapist+0.070
pan+0.070
Neg logit contributions
 comes-0.182
 name-0.181
 can-0.179
 know-0.178
 wants-0.176
Tokenress
Token ID601
Feature activation+6.84e-02

Pos logit contributions
side+0.107
fs+0.096
 salon+0.086
ized+0.082
cakes+0.081
Neg logit contributions
 name-0.179
 can-0.167
 comes-0.167
 know-0.165
 wonder-0.165
Token."
Token ID526
Feature activation-1.69e-02

Pos logit contributions
 name+0.008
 can+0.008
 comes+0.007
 wants+0.007
 lives+0.007
Neg logit contributions
side-0.022
fs-0.021
 salon-0.020
liv-0.020
cakes-0.019
Token
Token ID198
Feature activation-2.20e-04

Pos logit contributions
 name+0.016
 can+0.013
 comes+0.013
 wonder+0.013
 know+0.013
Neg logit contributions
side-0.047
fs-0.045
 salon-0.043
cakes-0.042
but-0.042
Token
Token ID198
Feature activation-8.84e-04

Pos logit contributions
 name+0.000
 know+0.000
 can+0.000
 wonder+0.000
 lives+0.000
Neg logit contributions
side-0.001
fs-0.001
 salon-0.001
pan-0.001
ized-0.001
TokenAt
Token ID2953
Feature activation-2.96e-03

Pos logit contributions
 name+0.001
 know+0.001
 be+0.001
 They+0.001
 wonder+0.001
Neg logit contributions
pan-0.002
ier-0.002
fs-0.002
side-0.002
cakes-0.002
Token the
Token ID262
Feature activation-7.05e-03

Pos logit contributions
 name+0.002
 can+0.002
 know+0.002
 wonder+0.002
 comes+0.002
Neg logit contributions
side-0.009
fs-0.008
ves-0.008
cakes-0.008
brush-0.008
Token salon
Token ID46608
Feature activation+2.10e-01

Pos logit contributions
 name+0.014
 wonder+0.013
 can+0.013
 lives+0.012
 be+0.012
Neg logit contributions
fs-0.014
side-0.013
ized-0.012
ves-0.012
but-0.012
Token,
Token ID11
Feature activation+7.22e-03

Pos logit contributions
side+0.599
fs+0.577
cakes+0.529
but+0.527
 salon+0.527
Neg logit contributions
 name-0.194
 comes-0.168
 can-0.166
 lives-0.162
 know-0.162
Token Lily
Token ID20037
Feature activation+2.48e-03

Pos logit contributions
side+0.023
fs+0.022
ves+0.021
liv+0.021
cake+0.020
Neg logit contributions
 name-0.005
 can-0.004
 Sam-0.003
 brother-0.003
 wonder-0.003
Token observed
Token ID6515
Feature activation+2.16e-02

Pos logit contributions
side+0.006
fs+0.006
 salon+0.006
cakes+0.005
but+0.005
Neg logit contributions
 name-0.003
 can-0.003
 comes-0.003
 wonder-0.003
 know-0.003
Token the
Token ID262
Feature activation-4.86e-03

Pos logit contributions
side+0.066
fs+0.065
cakes+0.061
liv+0.060
ider+0.060
Neg logit contributions
 name-0.016
 wonder-0.011
 can-0.011
 names-0.011
 wonders-0.011
Token other
Token ID584
Feature activation+2.02e-02

Pos logit contributions
 name+0.009
 can+0.008
 wants+0.008
 wonder+0.008
 wonders+0.008
Neg logit contributions
side-0.010
fs-0.010
ized-0.009
bly-0.009
pan-0.009
Token Amy
Token ID14235
Feature activation+1.94e-02

Pos logit contributions
side+0.022
fs+0.021
 salon+0.021
brush+0.020
pan+0.020
Neg logit contributions
 name-0.009
 can-0.008
 wonder-0.008
 comes-0.008
 know-0.008
Token rolled
Token ID11686
Feature activation+4.11e-02

Pos logit contributions
side+0.052
fs+0.050
 salon+0.046
cakes+0.045
but+0.045
Neg logit contributions
 name-0.022
 can-0.020
 comes-0.020
 know-0.019
 wants-0.019
Token out
Token ID503
Feature activation-1.44e-02

Pos logit contributions
side+0.113
fs+0.110
ves+0.105
 salon+0.102
pron+0.102
Neg logit contributions
 name-0.046
 wonder-0.038
 can-0.037
 comes-0.035
 wonders-0.035
Token of
Token ID286
Feature activation+1.26e-02

Pos logit contributions
 name+0.018
 comes+0.017
 wonder+0.017
 can+0.017
 know+0.016
Neg logit contributions
side-0.035
fs-0.035
ves-0.033
liv-0.032
pron-0.032
Token the
Token ID262
Feature activation+1.06e-03

Pos logit contributions
side+0.036
fs+0.035
ves+0.034
but+0.033
cakes+0.033
Neg logit contributions
 name-0.011
 wonder-0.010
 comes-0.009
 can-0.009
 know-0.009
Token salon
Token ID46608
Feature activation+2.07e-01

Pos logit contributions
fs+0.002
side+0.002
ves+0.002
but+0.002
ized+0.002
Neg logit contributions
 name-0.002
 can-0.002
 lives-0.002
 wonders-0.001
 comes-0.001
Token,
Token ID11
Feature activation+9.26e-03

Pos logit contributions
fs+0.623
side+0.605
bly+0.590
liv+0.590
 therapist+0.575
Neg logit contributions
 name-0.129
 be-0.105
 know-0.100
 comes-0.095
 can-0.091
Token her
Token ID607
Feature activation-5.85e-03

Pos logit contributions
side+0.022
fs+0.022
ves+0.021
liv+0.020
cakes+0.020
Neg logit contributions
 name-0.012
 can-0.011
 wonder-0.011
 comes-0.011
 see-0.010
Token new
Token ID649
Feature activation+1.11e-02

Pos logit contributions
 name+0.008
 can+0.006
 wonder+0.006
 lives+0.006
 wants+0.006
Neg logit contributions
side-0.016
fs-0.015
but-0.014
ized-0.014
cakes-0.014
Token haircut
Token ID45548
Feature activation+7.28e-02

Pos logit contributions
side+0.024
fs+0.023
cakes+0.020
ized+0.020
ves+0.020
Neg logit contributions
 name-0.021
 wonder-0.018
 wants-0.018
 lives-0.018
 wonders-0.018
Token in
Token ID287
Feature activation-4.81e-03

Pos logit contributions
side+0.195
fs+0.192
bly+0.180
fters+0.179
ized+0.177
Neg logit contributions
 can-0.067
 is-0.061
 comes-0.059
 name-0.058
 lives-0.054
Token loved
Token ID6151
Feature activation+1.85e-02

Pos logit contributions
side+0.003
fs+0.003
 salon+0.003
but+0.003
brush+0.003
Neg logit contributions
 name-0.002
 wonder-0.001
 know-0.001
 lives-0.001
 comes-0.001
Token going
Token ID1016
Feature activation-4.85e-02

Pos logit contributions
side+0.044
fs+0.041
 salon+0.038
ized+0.037
cakes+0.037
Neg logit contributions
 name-0.027
 comes-0.024
 can-0.023
 wonder-0.023
 lives-0.023
Token to
Token ID284
Feature activation+2.88e-03

Pos logit contributions
 name+0.064
 comes+0.055
 can+0.055
 wonder+0.052
 know+0.052
Neg logit contributions
side-0.121
fs-0.115
 salon-0.107
ves-0.106
liv-0.106
Token the
Token ID262
Feature activation-1.61e-03

Pos logit contributions
side+0.007
fs+0.007
 salon+0.007
but+0.006
ves+0.006
Neg logit contributions
 name-0.004
 can-0.003
 comes-0.003
 lives-0.003
 wonder-0.003
Token hair
Token ID4190
Feature activation+2.70e-02

Pos logit contributions
 name+0.003
 can+0.003
 wonder+0.003
 lives+0.003
 wonders+0.003
Neg logit contributions
side-0.003
fs-0.003
ves-0.003
pan-0.003
cakes-0.003
Token salon
Token ID46608
Feature activation+2.06e-01

Pos logit contributions
fs+0.037
liv+0.035
bly+0.034
side+0.033
ized+0.032
Neg logit contributions
 can-0.059
 name-0.058
 comes-0.055
 is-0.054
 lives-0.053
Token and
Token ID290
Feature activation-1.54e-04

Pos logit contributions
side+0.532
fs+0.526
liv+0.490
pan+0.479
ves+0.478
Neg logit contributions
 name-0.230
 can-0.194
 know-0.191
 lives-0.188
 comes-0.188
Token getting
Token ID1972
Feature activation-2.20e-03

Pos logit contributions
 name+0.000
 can+0.000
 know+0.000
 comes+0.000
 wonder+0.000
Neg logit contributions
side-0.000
fs-0.000
ized-0.000
ves-0.000
cakes-0.000
Token different
Token ID1180
Feature activation+2.37e-02

Pos logit contributions
 name+0.003
 can+0.002
 names+0.002
 wonder+0.002
 know+0.002
Neg logit contributions
side-0.006
fs-0.006
ves-0.005
liv-0.005
pan-0.005
Token hairc
Token ID37363
Feature activation-1.10e-03

Pos logit contributions
side+0.058
fs+0.056
liv+0.053
pron+0.052
pan+0.052
Neg logit contributions
 name-0.032
 names-0.029
 wonders-0.028
 lives-0.028
 can-0.028
Tokenuts
Token ID5500
Feature activation+7.07e-02

Pos logit contributions
 name+0.003
 know+0.003
 names+0.002
 can+0.002
 wants+0.002
Neg logit contributions
side-0.001
fs-0.001
pan-0.001
 therapist-0.001
ized-0.001
Token Sue
Token ID26089
Feature activation+1.69e-02

Pos logit contributions
 name+0.003
 wonder+0.002
 wants+0.002
 comes+0.002
 can+0.002
Neg logit contributions
side-0.004
fs-0.004
 salon-0.004
cakes-0.003
ized-0.003
Token.
Token ID13
Feature activation-9.64e-03

Pos logit contributions
side+0.040
fs+0.037
 salon+0.036
cakes+0.035
but+0.035
Neg logit contributions
 name-0.023
 wonder-0.021
 comes-0.020
 lives-0.020
 can-0.020
Token They
Token ID1119
Feature activation-2.30e-02

Pos logit contributions
 name+0.008
 comes+0.008
 know+0.008
 lives+0.007
 wonder+0.007
Neg logit contributions
side-0.027
fs-0.024
cakes-0.024
but-0.023
 salon-0.023
Token leave
Token ID2666
Feature activation+1.52e-03

Pos logit contributions
 name+0.023
 comes+0.020
 wants+0.020
 lives+0.019
 wonder+0.019
Neg logit contributions
side-0.062
fs-0.057
 salon-0.057
brush-0.054
cakes-0.054
Token the
Token ID262
Feature activation-5.75e-03

Pos logit contributions
side+0.004
fs+0.004
 salon+0.004
but+0.004
cakes+0.004
Neg logit contributions
 name-0.002
 can-0.001
 comes-0.001
 wonder-0.001
 lives-0.001
Token salon
Token ID46608
Feature activation+2.06e-01

Pos logit contributions
 name+0.010
 lives+0.009
 can+0.009
 wonders+0.009
 wonder+0.009
Neg logit contributions
side-0.012
fs-0.012
ves-0.011
cakes-0.011
ized-0.011
Token with
Token ID351
Feature activation+2.30e-02

Pos logit contributions
side+0.495
fs+0.479
liv+0.457
 therapist+0.441
ized+0.436
Neg logit contributions
 name-0.271
 lives-0.239
 know-0.239
 comes-0.237
 can-0.237
Token big
Token ID1263
Feature activation-1.24e-02

Pos logit contributions
side+0.067
fs+0.065
but+0.062
 salon+0.061
ves+0.060
Neg logit contributions
 name-0.021
 wonder-0.018
 can-0.018
 lives-0.017
 know-0.016
Token smiles
Token ID21845
Feature activation-2.51e-02

Pos logit contributions
 name+0.014
 can+0.012
 comes+0.012
 wonder+0.012
 know+0.012
Neg logit contributions
side-0.033
fs-0.031
 salon-0.030
cakes-0.028
but-0.028
Token.
Token ID13
Feature activation-9.95e-03

Pos logit contributions
 name+0.034
 can+0.030
 comes+0.030
 lives+0.027
 wants+0.027
Neg logit contributions
side-0.061
fs-0.060
 salon-0.058
liv-0.058
cakes-0.056
Token Lily
Token ID20037
Feature activation-2.23e-03

Pos logit contributions
 comes+0.010
 know+0.010
 lives+0.010
rr+0.010
 wants+0.009
Neg logit contributions
side-0.025
cakes-0.022
fs-0.021
cake-0.021
but-0.021
TokenShe
Token ID3347
Feature activation+2.55e-02

Pos logit contributions
 know+0.019
 name+0.019
 wonder+0.018
 meet+0.017
 wonders+0.017
Neg logit contributions
pan-0.022
side-0.021
fs-0.021
ier-0.021
 salon-0.021
Token went
Token ID1816
Feature activation-4.80e-02

Pos logit contributions
side+0.067
fs+0.063
 salon+0.059
but+0.058
cakes+0.058
Neg logit contributions
 name-0.031
 can-0.029
 wants-0.027
 comes-0.027
 wonder-0.027
Token to
Token ID284
Feature activation-1.17e-02

Pos logit contributions
 name+0.050
 comes+0.042
 can+0.039
 know+0.037
 meet+0.036
Neg logit contributions
side-0.131
fs-0.130
ized-0.124
ves-0.123
liv-0.121
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
 name+0.015
 wants+0.014
 can+0.014
 lives+0.014
 comes+0.014
Neg logit contributions
side-0.027
 salon-0.025
fs-0.024
but-0.023
cakes-0.023
Token hair
Token ID4190
Feature activation+1.87e-02

Pos logit contributions
 name+0.024
 lives+0.021
 can+0.021
 wonder+0.020
 wonders+0.020
Neg logit contributions
fs-0.039
side-0.037
ves-0.036
ized-0.034
but-0.034
Token salon
Token ID46608
Feature activation+2.05e-01

Pos logit contributions
fs+0.027
bly+0.026
ves+0.024
dd+0.024
liv+0.023
Neg logit contributions
 and-0.040
 can-0.038
 name-0.038
 be-0.037
 is-0.037
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
fs+0.618
side+0.593
bly+0.567
liv+0.563
ves+0.560
Neg logit contributions
 name-0.148
 know-0.105
 wants-0.095
 comes-0.095
 be-0.093
Token got
Token ID1392
Feature activation-6.72e-03

Pos logit contributions
 name+0.012
 can+0.011
 wonder+0.011
 know+0.011
 comes+0.010
Neg logit contributions
side-0.027
fs-0.027
ves-0.025
quet-0.024
ized-0.024
Token her
Token ID607
Feature activation-1.25e-02

Pos logit contributions
 name+0.008
 can+0.006
 names+0.006
 wonder+0.006
 know+0.006
Neg logit contributions
side-0.019
fs-0.019
ves-0.018
ized-0.017
liv-0.017
Token new
Token ID649
Feature activation+1.83e-03

Pos logit contributions
 name+0.020
 can+0.017
 names+0.016
 lives+0.016
 wants+0.016
Neg logit contributions
side-0.031
fs-0.028
ized-0.027
but-0.026
ves-0.026
Token haircut
Token ID45548
Feature activation+6.90e-02

Pos logit contributions
side+0.004
fs+0.004
ized+0.003
but+0.003
cakes+0.003
Neg logit contributions
 name-0.004
 wonder-0.003
 can-0.003
 names-0.003
 wants-0.003
Token they
Token ID484
Feature activation-4.28e-03

Pos logit contributions
side+0.024
fs+0.021
liv+0.021
ized+0.019
handed+0.019
Neg logit contributions
 can-0.025
 name-0.024
 comes-0.024
 Don-0.022
 know-0.022
Token would
Token ID561
Feature activation+9.97e-04

Pos logit contributions
 name+0.005
 can+0.005
 wonder+0.005
 comes+0.005
 know+0.005
Neg logit contributions
side-0.011
fs-0.010
 salon-0.010
but-0.010
cakes-0.009
Token go
Token ID467
Feature activation-2.31e-02

Pos logit contributions
side+0.003
fs+0.003
cakes+0.003
 salon+0.003
ized+0.003
Neg logit contributions
 name-0.001
 can-0.001
 know-0.001
 wonder-0.001
 comes-0.001
Token to
Token ID284
Feature activation-1.17e-03

Pos logit contributions
 name+0.026
 can+0.023
 comes+0.022
 know+0.022
 wonder+0.021
Neg logit contributions
side-0.062
fs-0.060
liv-0.055
 salon-0.055
ves-0.055
Token the
Token ID262
Feature activation-6.70e-03

Pos logit contributions
 name+0.001
 comes+0.001
 can+0.001
 wants+0.001
 lives+0.001
Neg logit contributions
side-0.003
fs-0.003
 salon-0.003
but-0.003
cakes-0.003
Token salon
Token ID46608
Feature activation+2.05e-01

Pos logit contributions
 name+0.011
 lives+0.010
 can+0.010
 wonder+0.010
 wonders+0.009
Neg logit contributions
fs-0.015
side-0.015
ized-0.013
ves-0.013
but-0.013
Token and
Token ID290
Feature activation-5.10e-03

Pos logit contributions
side+0.552
fs+0.540
liv+0.507
ized+0.496
cakes+0.490
Neg logit contributions
 name-0.214
 can-0.189
 know-0.184
 comes-0.181
 wants-0.178
Token fix
Token ID4259
Feature activation+4.09e-02

Pos logit contributions
 name+0.004
 can+0.004
 know+0.003
 wonder+0.003
 comes+0.003
Neg logit contributions
side-0.016
fs-0.015
ized-0.014
liv-0.014
ves-0.014
Token her
Token ID607
Feature activation-9.13e-03

Pos logit contributions
side+0.111
fs+0.100
but+0.098
 salon+0.097
cakes+0.095
Neg logit contributions
 name-0.045
 can-0.042
 comes-0.040
 lives-0.040
 wonder-0.038
Token hair
Token ID4190
Feature activation+2.40e-02

Pos logit contributions
 name+0.015
 can+0.013
 wants+0.013
 lives+0.013
 comes+0.013
Neg logit contributions
side-0.021
fs-0.019
cakes-0.017
 salon-0.017
ized-0.017
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
side+0.065
fs+0.064
liv+0.063
ized+0.061
inge+0.061
Neg logit contributions
 can-0.022
 is-0.019
 name-0.018
 be-0.016
 comes-0.015
Token mom
Token ID1995
Feature activation+3.63e-03

Pos logit contributions
 comes+0.005
 name+0.005
 know+0.005
 lives+0.005
 wants+0.005
Neg logit contributions
side-0.012
 salon-0.012
fs-0.012
cakes-0.011
brush-0.011
Token went
Token ID1816
Feature activation-2.14e-02

Pos logit contributions
side+0.010
fs+0.009
 salon+0.009
cakes+0.008
but+0.008
Neg logit contributions
 name-0.004
 comes-0.004
 can-0.003
 wonder-0.003
 lives-0.003
Token to
Token ID284
Feature activation+6.05e-03

Pos logit contributions
 name+0.030
 can+0.026
 wonder+0.026
 comes+0.026
 lives+0.026
Neg logit contributions
side-0.051
fs-0.047
 salon-0.046
cakes-0.044
but-0.044
Token the
Token ID262
Feature activation+4.18e-03

Pos logit contributions
 salon+0.012
side+0.012
fs+0.010
pron+0.010
ner+0.010
Neg logit contributions
 name-0.008
 wants-0.008
 comes-0.008
 can-0.008
 lives-0.008
Token hair
Token ID4190
Feature activation+2.69e-02

Pos logit contributions
side+0.009
fs+0.008
ves+0.008
ized+0.007
but+0.007
Neg logit contributions
 name-0.007
 can-0.006
 lives-0.006
 wonder-0.006
 comes-0.006
Token salon
Token ID46608
Feature activation+2.04e-01

Pos logit contributions
fs+0.030
bly+0.027
side+0.026
ized+0.026
liv+0.025
Neg logit contributions
 name-0.064
 can-0.064
 is-0.064
 comes-0.062
 be-0.061
Token.
Token ID13
Feature activation+7.11e-03

Pos logit contributions
side+0.542
fs+0.529
liv+0.493
pan+0.484
 therapist+0.484
Neg logit contributions
 name-0.225
 know-0.189
 lives-0.181
 wants-0.180
 comes-0.175
Token The
Token ID383
Feature activation-4.02e-03

Pos logit contributions
side+0.020
cakes+0.018
 salon+0.018
but+0.017
fs+0.017
Neg logit contributions
 name-0.006
 lives-0.006
rr-0.006
 knows-0.006
 wants-0.006
Token little
Token ID1310
Feature activation-4.53e-02

Pos logit contributions
 name+0.005
 can+0.004
 comes+0.004
 wants+0.004
 lives+0.004
Neg logit contributions
side-0.011
fs-0.010
ized-0.009
but-0.009
liv-0.009
Token girl
Token ID2576
Feature activation-3.39e-02

Pos logit contributions
 name+0.040
 can+0.034
 comes+0.033
 wonder+0.033
 wants+0.032
Neg logit contributions
side-0.134
fs-0.127
ves-0.119
but-0.118
 salon-0.118
Token was
Token ID373
Feature activation+2.01e-03

Pos logit contributions
 name+0.040
 can+0.035
 comes+0.035
 wonder+0.034
 lives+0.034
Neg logit contributions
side-0.088
fs-0.082
 salon-0.079
cakes-0.076
but-0.076
Token He
Token ID679
Feature activation+9.76e-03

Pos logit contributions
side+0.029
but+0.026
fs+0.025
 salon+0.025
liv+0.025
Neg logit contributions
 comes-0.014
 name-0.013
 lives-0.013
 can-0.013
 imag-0.012
Token sn
Token ID3013
Feature activation-5.79e-02

Pos logit contributions
side+0.024
fs+0.022
 salon+0.022
ves+0.021
cakes+0.021
Neg logit contributions
 name-0.013
 comes-0.011
 can-0.011
 wonder-0.011
 know-0.011
Tokenuck
Token ID1347
Feature activation+6.15e-03

Pos logit contributions
 name+0.107
 can+0.096
 know+0.095
 wants+0.091
 wonder+0.091
Neg logit contributions
side-0.107
fs-0.104
 salon-0.100
ves-0.094
pan-0.092
Token into
Token ID656
Feature activation+1.98e-02

Pos logit contributions
side+0.014
 salon+0.013
fs+0.013
liv+0.013
cakes+0.013
Neg logit contributions
 name-0.009
 comes-0.008
 can-0.008
 lives-0.008
 wonder-0.008
Token a
Token ID257
Feature activation+5.34e-03

Pos logit contributions
side+0.052
fs+0.048
 salon+0.047
but+0.045
cakes+0.045
Neg logit contributions
 name-0.023
 comes-0.020
 can-0.020
 wonder-0.020
 lives-0.019
Token salon
Token ID46608
Feature activation+2.04e-01

Pos logit contributions
side+0.014
fs+0.013
but+0.013
cakes+0.012
ized+0.012
Neg logit contributions
 name-0.007
 can-0.006
 wonder-0.006
 lives-0.005
 comes-0.005
Token and
Token ID290
Feature activation+7.05e-03

Pos logit contributions
fs+0.634
side+0.613
cakes+0.602
bly+0.589
liv+0.582
Neg logit contributions
 name-0.117
 know-0.086
 wants-0.082
 be-0.079
 comes-0.078
Token found
Token ID1043
Feature activation-1.68e-02

Pos logit contributions
side+0.019
fs+0.018
quet+0.016
cakes+0.016
 salon+0.016
Neg logit contributions
 name-0.008
 can-0.008
 wonder-0.008
 comes-0.007
 know-0.007
Token the
Token ID262
Feature activation+7.81e-03

Pos logit contributions
 name+0.017
 can+0.014
 know+0.014
 wonder+0.014
 comes+0.013
Neg logit contributions
side-0.049
fs-0.045
 salon-0.045
cakes-0.042
ized-0.042
Token bar
Token ID2318
Feature activation-1.36e-02

Pos logit contributions
side+0.021
ized+0.019
fs+0.019
cakes+0.018
ves+0.017
Neg logit contributions
 name-0.012
 can-0.010
 wants-0.009
 wonder-0.009
 wonders-0.009
Tokenber
Token ID527
Feature activation+2.01e-02

Pos logit contributions
 name+0.021
 can+0.020
 wants+0.020
 knows+0.019
 know+0.019
Neg logit contributions
side-0.031
fs-0.028
liv-0.027
ized-0.027
pan-0.025
Token They
Token ID1119
Feature activation-1.68e-02

Feature not active on prev token;
no predictions were affected.
Token are
Token ID389
Feature activation-3.31e-02

Pos logit contributions
 name+0.015
 comes+0.012
 can+0.012
 wonder+0.012
 know+0.012
Neg logit contributions
side-0.048
fs-0.045
 salon-0.044
cakes-0.042
but-0.042
Token for
Token ID329
Feature activation-1.30e-03

Pos logit contributions
 name+0.042
 can+0.037
 comes+0.036
 lives+0.036
 wants+0.035
Neg logit contributions
side-0.087
fs-0.079
 salon-0.077
but-0.074
cakes-0.074
Token the
Token ID262
Feature activation-5.10e-03

Pos logit contributions
 name+0.002
 comes+0.002
 can+0.002
 know+0.002
 lives+0.002
Neg logit contributions
side-0.003
fs-0.003
 salon-0.003
but-0.003
cakes-0.003
Token hair
Token ID4190
Feature activation+2.31e-02

Pos logit contributions
 name+0.007
 can+0.006
 names+0.006
 wants+0.006
 comes+0.006
Neg logit contributions
side-0.013
fs-0.013
liv-0.012
ized-0.012
but-0.012
Token salon
Token ID46608
Feature activation+2.03e-01

Pos logit contributions
side+0.069
fs+0.065
but+0.062
 therapist+0.062
pron+0.060
Neg logit contributions
 can-0.020
 is-0.018
 wants-0.017
 name-0.017
 comes-0.016
Token!"
Token ID2474
Feature activation+1.61e-02

Pos logit contributions
side+0.585
fs+0.546
but+0.511
 therapist+0.507
 salon+0.507
Neg logit contributions
 name-0.197
 wants-0.181
 can-0.176
 comes-0.173
 know-0.171
Token
Token ID198
Feature activation+2.29e-03

Pos logit contributions
side+0.043
fs+0.040
 salon+0.039
pron+0.038
pan+0.037
Neg logit contributions
 name-0.019
 know-0.017
 wonder-0.017
 can-0.017
 comes-0.017
Token
Token ID198
Feature activation+2.30e-03

Pos logit contributions
side+0.006
fs+0.006
ves+0.005
but+0.005
 salon+0.005
Neg logit contributions
 name-0.002
 comes-0.002
 lives-0.002
 wants-0.002
 wonder-0.002
TokenHe
Token ID1544
Feature activation+5.52e-03

Pos logit contributions
side+0.004
fs+0.004
 salon+0.003
cakes+0.003
ized+0.003
Neg logit contributions
 name-0.005
 can-0.004
 know-0.004
 wonder-0.004
 comes-0.004
Token ran
Token ID4966
Feature activation+3.29e-03

Pos logit contributions
side+0.013
fs+0.012
 salon+0.012
cakes+0.011
but+0.011
Neg logit contributions
 name-0.008
 wonder-0.007
 can-0.007
 know-0.007
 comes-0.007
Token never
Token ID1239
Feature activation-1.70e-02

Pos logit contributions
 name+0.012
 know+0.011
 can+0.010
 wonder+0.010
 wants+0.010
Neg logit contributions
side-0.034
fs-0.033
ized-0.030
 salon-0.030
cakes-0.030
Token been
Token ID587
Feature activation+7.25e-03

Pos logit contributions
 name+0.022
 know+0.022
 comes+0.021
 wants+0.020
 can+0.020
Neg logit contributions
side-0.042
fs-0.040
ves-0.037
pan-0.037
cakes-0.036
Token to
Token ID284
Feature activation+4.41e-03

Pos logit contributions
side+0.018
fs+0.017
cakes+0.016
ves+0.015
liv+0.015
Neg logit contributions
 name-0.010
 know-0.009
 can-0.009
 comes-0.009
 wants-0.008
Token a
Token ID257
Feature activation-1.33e-03

Pos logit contributions
side+0.011
fs+0.010
ves+0.009
cakes+0.009
ized+0.009
Neg logit contributions
 name-0.006
 know-0.006
 wonder-0.006
 can-0.006
 comes-0.006
Token hair
Token ID4190
Feature activation+2.62e-02

Pos logit contributions
 name+0.002
 can+0.002
 wonder+0.002
 comes+0.002
 know+0.002
Neg logit contributions
side-0.003
fs-0.003
cakes-0.002
 salon-0.002
ized-0.002
Token salon
Token ID46608
Feature activation+2.02e-01

Pos logit contributions
fs+0.053
side+0.050
liv+0.049
ves+0.048
ized+0.048
Neg logit contributions
 name-0.045
 can-0.044
 be-0.037
 try-0.037
 want-0.037
Token before
Token ID878
Feature activation+2.04e-02

Pos logit contributions
fs+0.507
side+0.480
pan+0.455
ves+0.455
liv+0.450
Neg logit contributions
 name-0.255
 can-0.218
 wants-0.216
 know-0.208
 be-0.207
Token and
Token ID290
Feature activation+1.50e-03

Pos logit contributions
side+0.053
fs+0.050
 salon+0.047
cakes+0.047
brush+0.047
Neg logit contributions
 name-0.024
 comes-0.022
 can-0.022
 wonder-0.021
 lives-0.021
Token felt
Token ID2936
Feature activation+3.33e-02

Pos logit contributions
side+0.004
fs+0.003
ves+0.003
quet+0.003
ized+0.003
Neg logit contributions
 can-0.002
 name-0.002
 wants-0.002
 comes-0.002
 know-0.002
Token a
Token ID257
Feature activation-8.85e-04

Pos logit contributions
side+0.094
fs+0.089
ves+0.084
 salon+0.084
cakes+0.084
Neg logit contributions
 name-0.039
 wonder-0.035
 can-0.034
 know-0.034
 wants-0.031
Token little
Token ID1310
Feature activation-4.60e-02

Pos logit contributions
 name+0.001
 can+0.001
 wonder+0.001
 comes+0.001
 know+0.001
Neg logit contributions
side-0.002
fs-0.002
 salon-0.002
cakes-0.002
but-0.002
Token afraid
Token ID7787
Feature activation-4.09e-02

Pos logit contributions
side+0.057
fs+0.055
cakes+0.051
ves+0.051
 salon+0.050
Neg logit contributions
 name-0.024
 know-0.022
 wonder-0.022
 wants-0.022
 can-0.021
Token to
Token ID284
Feature activation+1.36e-03

Pos logit contributions
 can+0.060
 comes+0.059
 name+0.059
 wants+0.054
 know+0.054
Neg logit contributions
side-0.100
fs-0.094
liv-0.089
 therapist-0.085
 salon-0.085
Token go
Token ID467
Feature activation-2.28e-02

Pos logit contributions
side+0.004
fs+0.004
 salon+0.003
cakes+0.003
but+0.003
Neg logit contributions
 name-0.001
 know-0.001
 can-0.001
 wonder-0.001
 comes-0.001
Token to
Token ID284
Feature activation+1.42e-03

Pos logit contributions
 name+0.027
 comes+0.023
 can+0.023
 know+0.022
 wonder+0.021
Neg logit contributions
side-0.061
fs-0.060
ves-0.056
liv-0.055
cakes-0.054
Token the
Token ID262
Feature activation-5.71e-03

Pos logit contributions
side+0.004
fs+0.003
 salon+0.003
cakes+0.003
but+0.003
Neg logit contributions
 name-0.002
 comes-0.002
 can-0.002
 wonder-0.002
 know-0.002
Token salon
Token ID46608
Feature activation+2.02e-01

Pos logit contributions
 name+0.011
 can+0.010
 lives+0.010
 wonder+0.009
 wonders+0.009
Neg logit contributions
fs-0.012
side-0.011
ves-0.010
ized-0.010
cakes-0.010
Token anymore
Token ID7471
Feature activation+2.70e-02

Pos logit contributions
fs+0.526
side+0.512
liv+0.484
bly+0.471
ves+0.471
Neg logit contributions
 name-0.231
 know-0.207
 can-0.198
 wants-0.191
 lives-0.190
Token.
Token ID13
Feature activation-1.66e-03

Pos logit contributions
side+0.071
fs+0.067
 salon+0.064
liv+0.063
cakes+0.063
Neg logit contributions
 name-0.031
 can-0.030
 know-0.028
 wants-0.027
 comes-0.027
Token<|endoftext|>
Token ID50256
Feature activation+3.76e-02

Pos logit contributions
 name+0.002
 wonder+0.002
rr+0.002
 lives+0.002
 comes+0.002
Neg logit contributions
side-0.004
but-0.003
cakes-0.003
 salon-0.003
cake-0.003
TokenOnce
Token ID7454
Feature activation+2.54e-02

Pos logit contributions
side+0.093
fs+0.082
ves+0.079
but+0.078
cakes+0.077
Neg logit contributions
 name-0.046
 wants-0.044
 wonder-0.043
 lives-0.043
 know-0.042
Token upon
Token ID2402
Feature activation-9.78e-03

Pos logit contributions
side+0.062
fs+0.057
 salon+0.056
but+0.053
ves+0.053
Neg logit contributions
 name-0.033
 wonder-0.029
 comes-0.028
 lives-0.028
 wants-0.028
Token day
Token ID1110
Feature activation+1.75e-02

Pos logit contributions
 name+0.054
 can+0.047
 lives+0.047
 comes+0.047
 wonder+0.046
Neg logit contributions
side-0.120
fs-0.116
 salon-0.111
cakes-0.109
ves-0.108
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
cakes+0.050
liv+0.049
fs+0.049
 salon+0.048
cake+0.047
Neg logit contributions
 name-0.013
 can-0.010
 know-0.010
 wonder-0.009
 wonders-0.009
Token working
Token ID1762
Feature activation-9.71e-03

Pos logit contributions
side+0.052
fs+0.051
cakes+0.049
quet+0.048
brush+0.047
Neg logit contributions
 name-0.029
 can-0.027
 wants-0.027
 wonder-0.027
 comes-0.026
Token at
Token ID379
Feature activation+1.81e-02

Pos logit contributions
 can+0.011
 name+0.011
 comes+0.010
 lives+0.009
 is+0.009
Neg logit contributions
side-0.024
 salon-0.023
fs-0.023
cakes-0.023
liv-0.023
Token his
Token ID465
Feature activation+7.82e-03

Pos logit contributions
side+0.044
 salon+0.040
fs+0.039
but+0.037
liv+0.037
Neg logit contributions
 name-0.025
 comes-0.022
 can-0.022
 wants-0.021
 wonder-0.021
Token salon
Token ID46608
Feature activation+2.02e-01

Pos logit contributions
side+0.018
fs+0.017
cakes+0.015
but+0.015
 salon+0.015
Neg logit contributions
 name-0.012
 lives-0.011
 can-0.010
 wonder-0.010
 wants-0.010
Token and
Token ID290
Feature activation+7.40e-03

Pos logit contributions
side+0.527
fs+0.522
cakes+0.502
cake+0.495
liv+0.493
Neg logit contributions
 name-0.214
 lives-0.191
 can-0.190
 wants-0.184
 comes-0.179
Token running
Token ID2491
Feature activation+1.13e-02

Pos logit contributions
side+0.017
fs+0.017
cakes+0.016
quet+0.016
ves+0.016
Neg logit contributions
 name-0.010
 can-0.010
 wants-0.010
 know-0.009
 lives-0.009
Token err
Token ID11454
Feature activation+6.22e-02

Pos logit contributions
side+0.027
fs+0.026
 salon+0.025
cakes+0.025
liv+0.025
Neg logit contributions
 name-0.015
 can-0.014
 comes-0.013
 know-0.013
 lives-0.013
Tokenands
Token ID1746
Feature activation+7.87e-03

Pos logit contributions
side+0.107
fs+0.099
 salon+0.093
cakes+0.090
ized+0.087
Neg logit contributions
 name-0.120
 know-0.116
 wonder-0.113
 lives-0.112
 comes-0.112
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
cakes+0.020
liv+0.020
fs+0.020
cake+0.020
side+0.020
Neg logit contributions
 can-0.008
 name-0.007
 comes-0.007
 know-0.006
 lives-0.006
Token She
Token ID1375
Feature activation-3.76e-03

Pos logit contributions
 name+0.019
 lives+0.018
 comes+0.018
 wants+0.017
 wonder+0.017
Neg logit contributions
side-0.037
 salon-0.034
fs-0.034
but-0.033
cakes-0.032
Token said
Token ID531
Feature activation+8.67e-03

Pos logit contributions
 name+0.005
 can+0.004
 wants+0.004
 wonder+0.004
 know+0.004
Neg logit contributions
side-0.010
fs-0.009
 salon-0.008
ized-0.008
but-0.008
Token she
Token ID673
Feature activation+2.32e-02

Pos logit contributions
side+0.017
fs+0.014
liv+0.014
ized+0.013
handed+0.013
Neg logit contributions
 can-0.018
 name-0.018
 comes-0.017
 wants-0.016
 know-0.016
Token hoped
Token ID10719
Feature activation-3.65e-02

Pos logit contributions
side+0.058
fs+0.054
 salon+0.053
cakes+0.050
but+0.050
Neg logit contributions
 name-0.029
 comes-0.025
 wonder-0.025
 know-0.024
 lives-0.024
Token the
Token ID262
Feature activation-9.78e-03

Pos logit contributions
 name+0.046
 can+0.044
 comes+0.041
 know+0.041
 be+0.039
Neg logit contributions
side-0.093
fs-0.088
liv-0.085
ized-0.082
inge-0.081
Token salon
Token ID46608
Feature activation+2.00e-01

Pos logit contributions
 name+0.018
 can+0.017
 wants+0.016
 wonder+0.015
 comes+0.015
Neg logit contributions
side-0.021
ized-0.019
fs-0.019
bly-0.018
liv-0.017
Token had
Token ID550
Feature activation-1.89e-02

Pos logit contributions
side+0.562
fs+0.527
ized+0.500
liv+0.491
 salon+0.483
Neg logit contributions
 name-0.214
 can-0.202
 comes-0.183
 wants-0.180
 lives-0.177
Token some
Token ID617
Feature activation-8.54e-03

Pos logit contributions
 name+0.021
 comes+0.020
 can+0.020
 names+0.019
 know+0.019
Neg logit contributions
side-0.053
fs-0.050
but-0.048
 salon-0.046
ized-0.046
Token candy
Token ID18550
Feature activation-1.19e-02

Pos logit contributions
 name+0.012
 can+0.012
 wonder+0.011
 wants+0.010
 know+0.010
Neg logit contributions
side-0.023
liv-0.020
fs-0.020
pan-0.019
ized-0.019
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 can+0.014
 comes+0.011
 is+0.010
 name+0.010
 lives+0.009
Neg logit contributions
liv-0.031
side-0.030
fs-0.030
inge-0.030
ized-0.029
Token She
Token ID1375
Feature activation-9.99e-04

Pos logit contributions
 name+0.017
 lives+0.016
 comes+0.016
 wants+0.015
 wonder+0.015
Neg logit contributions
side-0.029
 salon-0.026
fs-0.026
but-0.026
cakes-0.025
Tokenloo
Token ID29680
Feature activation+2.49e-03

Pos logit contributions
 name+0.035
 wonder+0.032
 know+0.032
 comes+0.031
 can+0.031
Neg logit contributions
side-0.030
fs-0.027
 salon-0.026
cakes-0.024
but-0.024
Token and
Token ID290
Feature activation-8.62e-03

Pos logit contributions
fs+0.007
liv+0.007
 therapist+0.007
 salon+0.006
side+0.006
Neg logit contributions
 name-0.002
 can-0.002
 be-0.002
 comes-0.002
 wonder-0.002
Token pretended
Token ID36766
Feature activation-7.72e-03

Pos logit contributions
 name+0.011
 comes+0.010
 can+0.010
 wonder+0.010
 lives+0.010
Neg logit contributions
side-0.021
fs-0.020
 salon-0.019
cakes-0.019
but-0.018
Token to
Token ID284
Feature activation+3.79e-03

Pos logit contributions
 comes+0.008
 can+0.008
 name+0.007
 Sam+0.007
 is+0.006
Neg logit contributions
side-0.022
 salon-0.021
fs-0.020
inge-0.020
pron-0.020
Token be
Token ID307
Feature activation+3.93e-02

Pos logit contributions
side+0.011
 salon+0.010
fs+0.010
ves+0.010
brush+0.010
Neg logit contributions
 name-0.003
 comes-0.002
 wants-0.002
 can-0.002
 lives-0.002
Token Esk
Token ID47534
Feature activation-1.71e-01

Pos logit contributions
side+0.098
fs+0.088
 salon+0.086
but+0.083
cakes+0.083
Neg logit contributions
 name-0.050
 wants-0.044
 comes-0.044
 can-0.043
 wonder-0.043
Tokenim
Token ID320
Feature activation-5.25e-02

Pos logit contributions
 name+0.283
 wonder+0.271
 can+0.270
 know+0.268
 wants+0.262
Neg logit contributions
side-0.367
fs-0.315
 salon-0.314
liv-0.306
ves-0.305
Tokenos
Token ID418
Feature activation-5.25e-03

Pos logit contributions
 name+0.093
 can+0.085
 comes+0.085
 lives+0.083
 know+0.083
Neg logit contributions
side-0.104
fs-0.095
 salon-0.092
cakes-0.086
but-0.086
Token.
Token ID13
Feature activation-7.28e-03

Pos logit contributions
 name+0.007
 can+0.006
 comes+0.006
 lives+0.006
 know+0.006
Neg logit contributions
side-0.013
fs-0.012
 salon-0.012
liv-0.012
 therapist-0.011
Token After
Token ID2293
Feature activation+3.69e-03

Pos logit contributions
 name+0.007
 lives+0.006
 comes+0.006
 wants+0.006
 wonder+0.006
Neg logit contributions
side-0.020
fs-0.018
 salon-0.018
cakes-0.018
but-0.018
Token a
Token ID257
Feature activation-7.65e-03

Pos logit contributions
side+0.009
fs+0.009
 salon+0.009
but+0.008
ized+0.008
Neg logit contributions
 name-0.004
 can-0.004
 comes-0.004
 know-0.004
 wonder-0.004
Token fox
Token ID21831
Feature activation-3.72e-02

Pos logit contributions
 name+0.012
 comes+0.011
 lives+0.011
 know+0.010
 wants+0.010
Neg logit contributions
side-0.026
fs-0.024
 salon-0.023
 therapist-0.023
but-0.022
Token,
Token ID11
Feature activation-1.37e-03

Pos logit contributions
 name+0.037
 can+0.031
 know+0.030
 comes+0.030
 wonder+0.029
Neg logit contributions
side-0.104
fs-0.098
 salon-0.095
cakes-0.092
liv-0.091
Token put
Token ID1234
Feature activation+5.79e-03

Pos logit contributions
 name+0.001
 can+0.001
 wonder+0.001
 comes+0.001
 know+0.001
Neg logit contributions
side-0.004
fs-0.004
 salon-0.003
cakes-0.003
liv-0.003
Token the
Token ID262
Feature activation-8.86e-03

Pos logit contributions
side+0.015
fs+0.014
 salon+0.014
cakes+0.013
but+0.013
Neg logit contributions
 name-0.007
 can-0.006
 comes-0.006
 wonder-0.006
 lives-0.006
Token honey
Token ID12498
Feature activation-4.62e-03

Pos logit contributions
 name+0.014
 can+0.013
 comes+0.012
 wonder+0.012
 know+0.012
Neg logit contributions
side-0.020
fs-0.018
 salon-0.017
ized-0.017
but-0.017
Tokencom
Token ID785
Feature activation-1.63e-01

Pos logit contributions
 name+0.005
 can+0.005
 wonder+0.004
 bear+0.004
 comes+0.004
Neg logit contributions
liv-0.012
 salon-0.012
side-0.011
fs-0.011
cakes-0.011
Tokenbs
Token ID1443
Feature activation+4.39e-03

Pos logit contributions
 name+0.290
 can+0.276
 comes+0.272
 lives+0.269
 wants+0.266
Neg logit contributions
side-0.320
 salon-0.293
fs-0.291
liv-0.279
cakes-0.269
Token on
Token ID319
Feature activation+2.00e-02

Pos logit contributions
liv+0.012
side+0.012
 salon+0.011
fs+0.011
ves+0.011
Neg logit contributions
 can-0.005
 name-0.004
 comes-0.004
 wonder-0.004
 know-0.004
Token top
Token ID1353
Feature activation+3.85e-02

Pos logit contributions
side+0.055
fs+0.053
 salon+0.053
cakes+0.051
liv+0.050
Neg logit contributions
 name-0.021
 can-0.017
 wonder-0.017
 comes-0.016
 know-0.015
Token of
Token ID286
Feature activation+5.99e-03

Pos logit contributions
fs+0.095
liv+0.094
side+0.089
uddled+0.088
 therapist+0.087
Neg logit contributions
 can-0.045
 name-0.041
 comes-0.041
 wonder-0.039
 know-0.037
Token his
Token ID465
Feature activation-6.80e-03

Pos logit contributions
side+0.016
fs+0.015
 salon+0.014
but+0.014
ized+0.013
Neg logit contributions
 name-0.007
 comes-0.006
 can-0.006
 wants-0.006
 wonder-0.006
Token was
Token ID373
Feature activation-1.13e-02

Pos logit contributions
 name+0.026
 can+0.022
 comes+0.022
 wonder+0.022
 lives+0.022
Neg logit contributions
side-0.059
fs-0.055
 salon-0.054
cakes-0.052
but-0.052
Token a
Token ID257
Feature activation-1.37e-02

Pos logit contributions
 name+0.013
 wonder+0.011
 can+0.011
 comes+0.011
 know+0.010
Neg logit contributions
side-0.030
fs-0.029
 salon-0.028
cakes-0.027
but-0.027
Token stack
Token ID8931
Feature activation+2.62e-02

Pos logit contributions
 name+0.018
 comes+0.016
 wants+0.016
 know+0.015
 lives+0.015
Neg logit contributions
side-0.033
fs-0.031
 salon-0.030
cakes-0.028
but-0.028
Token of
Token ID286
Feature activation+7.48e-03

Pos logit contributions
side+0.063
fs+0.061
liv+0.060
 salon+0.060
 therapist+0.057
Neg logit contributions
 can-0.032
 comes-0.028
 name-0.028
 is-0.027
 be-0.026
Token honey
Token ID12498
Feature activation-1.92e-03

Pos logit contributions
side+0.019
fs+0.017
but+0.017
 salon+0.017
liv+0.016
Neg logit contributions
 name-0.011
 wonder-0.010
 can-0.010
 lives-0.009
 know-0.009
Tokencom
Token ID785
Feature activation-1.57e-01

Pos logit contributions
 name+0.002
 can+0.002
 be+0.002
 comes+0.002
 bear+0.002
Neg logit contributions
fs-0.005
side-0.005
ized-0.005
 salon-0.005
liv-0.005
Tokenbs
Token ID1443
Feature activation+6.50e-03

Pos logit contributions
 name+0.285
 can+0.267
 comes+0.265
 lives+0.262
 wants+0.262
Neg logit contributions
side-0.307
fs-0.278
 salon-0.277
cakes-0.256
liv-0.256
Token!
Token ID0
Feature activation-1.10e-02

Pos logit contributions
side+0.018
fs+0.018
 salon+0.018
liv+0.017
ves+0.017
Neg logit contributions
 can-0.007
 name-0.006
 comes-0.006
 wonder-0.004
 is-0.004
Token
Token ID198
Feature activation-2.64e-03

Pos logit contributions
 name+0.011
 comes+0.010
 lives+0.010
 wants+0.010
 wonder+0.010
Neg logit contributions
side-0.030
fs-0.028
but-0.027
 salon-0.027
cakes-0.026
Token
Token ID198
Feature activation-2.26e-03

Pos logit contributions
 name+0.003
 comes+0.002
 can+0.002
 wants+0.002
 lives+0.002
Neg logit contributions
side-0.007
fs-0.007
 salon-0.006
but-0.006
ves-0.006
TokenThe
Token ID464
Feature activation+3.77e-02

Pos logit contributions
 name+0.004
 can+0.003
 know+0.003
 meet+0.003
 wonder+0.003
Neg logit contributions
side-0.004
 salon-0.004
fs-0.004
pan-0.004
cakes-0.004
Token the
Token ID262
Feature activation-8.58e-03

Pos logit contributions
side+0.075
fs+0.070
 salon+0.070
cakes+0.067
brush+0.067
Neg logit contributions
 can-0.025
 name-0.025
 lives-0.023
 comes-0.022
 is-0.022
Token puppy
Token ID26188
Feature activation-1.27e-02

Pos logit contributions
 name+0.011
 can+0.010
 wonder+0.010
 lives+0.010
 comes+0.010
Neg logit contributions
side-0.021
fs-0.020
 salon-0.019
but-0.019
cakes-0.018
Token and
Token ID290
Feature activation-9.18e-03

Pos logit contributions
 name+0.015
 can+0.015
 lives+0.014
 comes+0.014
 wants+0.013
Neg logit contributions
side-0.034
fs-0.031
 salon-0.031
liv-0.029
cakes-0.029
Token the
Token ID262
Feature activation-7.54e-03

Pos logit contributions
 name+0.012
 can+0.011
 know+0.011
 wonder+0.011
 lives+0.011
Neg logit contributions
side-0.023
fs-0.022
 salon-0.021
but-0.020
cakes-0.020
Token cheer
Token ID14042
Feature activation+5.19e-03

Pos logit contributions
 name+0.010
 can+0.008
 wonder+0.008
 lives+0.008
 wants+0.008
Neg logit contributions
side-0.020
fs-0.019
 salon-0.018
but-0.018
ves-0.018
Tokenfull
Token ID12853
Feature activation-1.56e-01

Pos logit contributions
side+0.012
 salon+0.011
fs+0.010
ized+0.010
liv+0.010
Neg logit contributions
 can-0.008
 name-0.008
 comes-0.008
 is-0.007
 lives-0.007
Tokener
Token ID263
Feature activation-2.69e-02

Pos logit contributions
 name+0.210
 can+0.185
 lives+0.172
 comes+0.166
 wants+0.163
Neg logit contributions
side-0.407
fs-0.381
 salon-0.376
but-0.364
liv-0.363
Token times
Token ID1661
Feature activation-2.71e-02

Pos logit contributions
 name+0.037
 can+0.034
 lives+0.031
 comes+0.030
 wants+0.030
Neg logit contributions
side-0.066
fs-0.064
 salon-0.060
ized-0.059
cakes-0.058
Token they
Token ID484
Feature activation-5.85e-03

Pos logit contributions
 can+0.045
 comes+0.041
 name+0.041
 lives+0.040
 is+0.039
Neg logit contributions
side-0.058
fs-0.054
 salon-0.053
liv-0.052
pron-0.051
Token had
Token ID550
Feature activation-2.28e-02

Pos logit contributions
 name+0.007
 can+0.006
 know+0.006
 lives+0.006
 wonder+0.006
Neg logit contributions
side-0.016
fs-0.014
 salon-0.014
cakes-0.014
but-0.014
Token together
Token ID1978
Feature activation+2.60e-03

Pos logit contributions
 name+0.029
 can+0.027
 know+0.026
 lives+0.026
 comes+0.026
Neg logit contributions
side-0.057
 salon-0.054
fs-0.054
cakes-0.051
but-0.050
Token his
Token ID465
Feature activation-8.39e-03

Pos logit contributions
side+0.043
fs+0.040
 salon+0.039
ves+0.038
liv+0.037
Neg logit contributions
 name-0.020
 can-0.017
 know-0.017
 wonder-0.017
 comes-0.017
Token favourite
Token ID12507
Feature activation+1.34e-02

Pos logit contributions
 name+0.013
 comes+0.013
 lives+0.013
 wonder+0.013
 know+0.013
Neg logit contributions
side-0.017
fs-0.016
 salon-0.015
cakes-0.015
pron-0.014
Token treat
Token ID2190
Feature activation+9.38e-03

Pos logit contributions
side+0.029
fs+0.026
 salon+0.025
but+0.024
ized+0.024
Neg logit contributions
 name-0.024
 can-0.021
 wonder-0.021
 comes-0.020
 lives-0.020
Token -
Token ID532
Feature activation-4.46e-02

Pos logit contributions
side+0.027
 salon+0.026
fs+0.026
ized+0.025
liv+0.025
Neg logit contributions
 name-0.009
 can-0.008
 Sam-0.006
 comes-0.006
 is-0.005
Token ch
Token ID442
Feature activation-5.03e-02

Pos logit contributions
 name+0.071
 can+0.067
 wonder+0.063
 comes+0.061
 know+0.060
Neg logit contributions
side-0.106
fs-0.097
 salon-0.093
liv-0.091
ized-0.091
Tokenoc
Token ID420
Feature activation-1.55e-01

Pos logit contributions
 name+0.069
 comes+0.061
 can+0.060
 wonder+0.060
 know+0.059
Neg logit contributions
side-0.125
fs-0.114
 salon-0.111
cakes-0.107
but-0.107
Token-
Token ID12
Feature activation-9.29e-02

Pos logit contributions
 name+0.296
 wonder+0.274
 can+0.272
 know+0.271
 comes+0.269
Neg logit contributions
side-0.289
fs-0.262
 salon-0.248
cakes-0.239
but-0.237
Tokenchip
Token ID35902
Feature activation-1.22e-02

Pos logit contributions
 name+0.188
 know+0.172
 can+0.160
 wonder+0.160
 wants+0.159
Neg logit contributions
fs-0.160
side-0.156
 salon-0.155
pan-0.150
ized-0.135
Token ice
Token ID4771
Feature activation-9.78e-03

Pos logit contributions
 can+0.017
 name+0.017
 comes+0.016
 wants+0.015
 names+0.015
Neg logit contributions
side-0.031
liv-0.027
 salon-0.027
fs-0.027
ized-0.027
Token cream
Token ID8566
Feature activation+5.39e-02

Pos logit contributions
 name+0.020
 can+0.020
 comes+0.019
 is+0.019
 looks+0.018
Neg logit contributions
side-0.016
 salon-0.015
fs-0.014
but-0.013
ized-0.013
Token!
Token ID0
Feature activation-1.26e-02

Pos logit contributions
side+0.148
fs+0.141
 salon+0.136
ves+0.134
ized+0.134
Neg logit contributions
 can-0.060
 name-0.055
 comes-0.053
 is-0.047
 wants-0.043
TokenThe
Token ID464
Feature activation+3.11e-02

Pos logit contributions
 name+0.017
 know+0.016
 wonder+0.016
 can+0.016
 meet+0.015
Neg logit contributions
 salon-0.024
fs-0.021
pan-0.021
side-0.021
brush-0.020
Token puppy
Token ID26188
Feature activation-1.74e-02

Pos logit contributions
side+0.078
fs+0.073
 salon+0.071
but+0.069
cakes+0.068
Neg logit contributions
 name-0.041
 can-0.036
 wonder-0.035
 lives-0.035
 wants-0.034
Token was
Token ID373
Feature activation-2.13e-02

Pos logit contributions
 name+0.022
 can+0.020
 lives+0.019
 comes+0.019
 wonder+0.019
Neg logit contributions
side-0.044
fs-0.041
 salon-0.040
cakes-0.038
but-0.038
Token much
Token ID881
Feature activation-2.37e-02

Pos logit contributions
 name+0.025
 know+0.022
 wonder+0.022
 can+0.020
 lives+0.020
Neg logit contributions
fs-0.057
side-0.056
cakes-0.053
 salon-0.052
ves-0.051
Token cheer
Token ID14042
Feature activation+2.18e-03

Pos logit contributions
 wonder+0.033
 name+0.031
 can+0.031
 know+0.030
 comes+0.029
Neg logit contributions
side-0.059
fs-0.058
ves-0.054
liv-0.053
 salon-0.052
Tokenfull
Token ID12853
Feature activation-1.55e-01

Pos logit contributions
side+0.006
 salon+0.006
fs+0.006
cakes+0.005
ves+0.005
Neg logit contributions
 name-0.002
 can-0.002
 comes-0.002
 is-0.002
 lives-0.002
Tokener
Token ID263
Feature activation-2.49e-02

Pos logit contributions
 name+0.217
 can+0.198
 comes+0.187
 lives+0.175
 names+0.169
Neg logit contributions
fs-0.377
side-0.376
 salon-0.359
liv-0.354
ves-0.343
Token now
Token ID783
Feature activation-3.66e-02

Pos logit contributions
 name+0.033
 can+0.028
 comes+0.027
 lives+0.026
 know+0.026
Neg logit contributions
fs-0.062
side-0.059
ves-0.055
 salon-0.055
cakes-0.054
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
 can+0.037
 name+0.037
 comes+0.034
 wonder+0.032
 lives+0.032
Neg logit contributions
side-0.100
 salon-0.098
fs-0.097
cakes-0.094
liv-0.091
Token He
Token ID679
Feature activation-1.52e-02

Pos logit contributions
 name+0.014
 lives+0.013
 comes+0.012
 know+0.012
 wants+0.012
Neg logit contributions
side-0.035
fs-0.032
 salon-0.031
but-0.031
cakes-0.030
Token said
Token ID531
Feature activation+5.49e-03

Pos logit contributions
 name+0.020
 can+0.018
 comes+0.018
 wonder+0.018
 know+0.017
Neg logit contributions
side-0.037
fs-0.035
 salon-0.033
cakes-0.032
but-0.032
Token day
Token ID1110
Feature activation-1.35e-03

Pos logit contributions
 name+0.062
 comes+0.056
 wants+0.056
 know+0.055
 can+0.055
Neg logit contributions
side-0.100
fs-0.090
 salon-0.089
cakes-0.084
but-0.084
Token,
Token ID11
Feature activation-9.11e-03

Pos logit contributions
 name+0.002
 comes+0.001
 can+0.001
 wonder+0.001
 lives+0.001
Neg logit contributions
side-0.004
fs-0.003
 salon-0.003
cakes-0.003
but-0.003
Token the
Token ID262
Feature activation-1.63e-02

Pos logit contributions
 name+0.011
 can+0.009
 wonder+0.009
 know+0.008
 comes+0.008
Neg logit contributions
side-0.025
fs-0.024
ves-0.022
 salon-0.022
liv-0.022
Token rh
Token ID9529
Feature activation-6.67e-02

Pos logit contributions
 comes+0.022
 name+0.022
 know+0.021
 wants+0.020
 wonder+0.019
Neg logit contributions
side-0.037
 salon-0.033
fs-0.032
ized-0.032
 therapist-0.031
Tokenin
Token ID259
Feature activation-2.55e-02

Pos logit contributions
 name+0.135
 can+0.130
 lives+0.130
 wants+0.126
 comes+0.125
Neg logit contributions
side-0.114
fs-0.101
 salon-0.095
cakes-0.093
liv-0.091
Tokenoc
Token ID420
Feature activation-1.54e-01

Pos logit contributions
 name+0.030
 names+0.026
 comes+0.026
 wonder+0.026
 know+0.025
Neg logit contributions
side-0.066
fs-0.059
 salon-0.058
ves-0.057
ized-0.056
Tokeneros
Token ID27498
Feature activation-8.09e-02

Pos logit contributions
 name+0.228
 know+0.209
 wonder+0.208
 can+0.206
 comes+0.201
Neg logit contributions
side-0.356
fs-0.321
 salon-0.318
ized-0.309
but-0.306
Token felt
Token ID2936
Feature activation+1.60e-02

Pos logit contributions
 name+0.086
 can+0.079
 comes+0.078
 lives+0.076
 wants+0.075
Neg logit contributions
side-0.219
fs-0.210
 salon-0.199
but-0.194
cakes-0.193
Token much
Token ID881
Feature activation-2.34e-02

Pos logit contributions
fs+0.045
 salon+0.044
side+0.043
ves+0.043
liv+0.042
Neg logit contributions
 can-0.016
 name-0.016
 welcome-0.016
 wonder-0.016
 know-0.016
Token better
Token ID1365
Feature activation-2.41e-02

Pos logit contributions
 name+0.029
 can+0.028
 wonder+0.028
 comes+0.027
 know+0.026
Neg logit contributions
side-0.062
fs-0.059
 salon-0.057
liv-0.056
ves-0.055
Token.
Token ID13
Feature activation-8.84e-03

Pos logit contributions
 can+0.021
 name+0.021
 wonder+0.020
 wants+0.018
 comes+0.018
Neg logit contributions
fs-0.069
side-0.067
cakes-0.066
 salon-0.064
ves-0.064
Token away
Token ID1497
Feature activation+1.65e-03

Pos logit contributions
 name+0.022
 can+0.020
 comes+0.020
 wonder+0.020
 know+0.019
Neg logit contributions
side-0.031
fs-0.029
 salon-0.028
cakes-0.027
ves-0.026
Token.
Token ID13
Feature activation-7.59e-03

Pos logit contributions
fs+0.004
side+0.004
 salon+0.004
liv+0.004
cakes+0.004
Neg logit contributions
 name-0.002
 can-0.002
 wonder-0.002
 comes-0.002
 know-0.002
Token The
Token ID383
Feature activation-1.62e-02

Pos logit contributions
 name+0.009
 comes+0.009
 wonder+0.008
 lives+0.008
 wants+0.008
Neg logit contributions
side-0.019
but-0.017
cake-0.016
cakes-0.016
 salon-0.015
Token rh
Token ID9529
Feature activation-6.38e-02

Pos logit contributions
 comes+0.021
 name+0.021
 know+0.020
 wants+0.019
 wonder+0.018
Neg logit contributions
side-0.037
fs-0.034
ized-0.033
 salon-0.033
pron-0.031
Tokenin
Token ID259
Feature activation-2.32e-02

Pos logit contributions
 name+0.140
 can+0.133
 lives+0.131
 comes+0.131
 wants+0.130
Neg logit contributions
side-0.100
fs-0.087
 salon-0.083
cakes-0.078
liv-0.077
Tokenoc
Token ID420
Feature activation-1.54e-01

Pos logit contributions
 name+0.031
 names+0.030
 comes+0.028
 wonder+0.027
 know+0.027
Neg logit contributions
side-0.054
fs-0.049
ized-0.048
ves-0.047
but-0.047
Tokeneros
Token ID27498
Feature activation-8.11e-02

Pos logit contributions
 name+0.259
 know+0.254
 wonder+0.248
 can+0.242
 comes+0.228
Neg logit contributions
side-0.316
ized-0.287
fs-0.283
 salon-0.279
but-0.279
Token felt
Token ID2936
Feature activation+2.30e-02

Pos logit contributions
 name+0.102
 comes+0.088
 can+0.088
 wonder+0.087
 know+0.087
Neg logit contributions
side-0.204
fs-0.189
 salon-0.183
but-0.175
cakes-0.175
Token che
Token ID1125
Feature activation-3.28e-02

Pos logit contributions
side+0.058
fs+0.056
 salon+0.055
cakes+0.052
ves+0.051
Neg logit contributions
 name-0.030
 wonder-0.027
 know-0.027
 can-0.027
 comes-0.025
Tokenery
Token ID1924
Feature activation-7.80e-03

Pos logit contributions
 name+0.059
 comes+0.057
 wants+0.056
 know+0.056
 can+0.056
Neg logit contributions
side-0.060
 salon-0.054
fs-0.054
cakes-0.052
but-0.050
Token and
Token ID290
Feature activation-8.59e-03

Pos logit contributions
 can+0.009
 name+0.009
 wonder+0.009
 comes+0.008
 lives+0.008
Neg logit contributions
side-0.020
 salon-0.019
fs-0.019
liv-0.019
cakes-0.018
Token they
Token ID484
Feature activation-1.12e-02

Pos logit contributions
 name+0.085
 can+0.078
 comes+0.077
 wonder+0.077
 lives+0.076
Neg logit contributions
side-0.120
fs-0.111
 salon-0.106
liv-0.101
cakes-0.101
Token could
Token ID714
Feature activation-2.47e-02

Pos logit contributions
 name+0.013
 comes+0.011
 wonder+0.011
 lives+0.011
 know+0.011
Neg logit contributions
side-0.028
fs-0.026
 salon-0.026
ized-0.024
cakes-0.024
Token grow
Token ID1663
Feature activation-6.96e-02

Pos logit contributions
 name+0.024
 know+0.020
 wonder+0.020
 can+0.020
 lives+0.018
Neg logit contributions
side-0.072
fs-0.068
 salon-0.065
cakes-0.063
liv-0.063
Token more
Token ID517
Feature activation+7.88e-03

Pos logit contributions
 name+0.114
 comes+0.102
 wonder+0.102
 lives+0.102
 know+0.102
Neg logit contributions
side-0.147
fs-0.134
 salon-0.127
but-0.123
cakes-0.123
Token av
Token ID1196
Feature activation-1.17e-02

Pos logit contributions
side+0.018
fs+0.016
 salon+0.016
liv+0.015
cakes+0.015
Neg logit contributions
 name-0.013
 can-0.011
 wonder-0.011
 comes-0.011
 lives-0.011
Tokenoc
Token ID420
Feature activation-1.54e-01

Pos logit contributions
 name+0.014
 comes+0.011
 wonder+0.011
 Don+0.011
 names+0.011
Neg logit contributions
side-0.032
fs-0.028
brush-0.028
ized-0.027
but-0.026
Tokenados
Token ID22484
Feature activation+1.03e-02

Pos logit contributions
 name+0.361
 wonder+0.336
 can+0.336
 comes+0.334
 know+0.333
Neg logit contributions
side-0.220
fs-0.193
 salon-0.180
cakes-0.169
but-0.168
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
liv+0.028
side+0.028
fs+0.027
inge+0.026
 salon+0.025
Neg logit contributions
 can-0.011
 name-0.009
 comes-0.009
 lives-0.008
 know-0.008
Token share
Token ID2648
Feature activation-1.15e-02

Pos logit contributions
 name+0.011
 can+0.010
 wonder+0.010
 know+0.010
 lives+0.009
Neg logit contributions
side-0.028
fs-0.027
liv-0.026
 salon-0.025
 therapist-0.024
Token them
Token ID606
Feature activation+4.46e-03

Pos logit contributions
 name+0.015
 can+0.013
 wonder+0.013
 comes+0.012
 know+0.012
Neg logit contributions
side-0.030
fs-0.028
 salon-0.027
liv-0.027
 therapist-0.026
Token with
Token ID351
Feature activation+2.03e-02

Pos logit contributions
side+0.012
liv+0.011
 salon+0.011
pan+0.011
fs+0.011
Neg logit contributions
 name-0.006
 can-0.004
 lives-0.004
 names-0.004
 is-0.004
Token new
Token ID649
Feature activation-5.34e-04

Pos logit contributions
side+0.094
 salon+0.084
fs+0.084
pan+0.082
brush+0.079
Neg logit contributions
 name-0.036
 can-0.033
 comes-0.029
 lives-0.028
 is-0.028
Token!
Token ID0
Feature activation-1.46e-02

Pos logit contributions
 name+0.001
 can+0.001
 wants+0.000
 lives+0.000
 wonder+0.000
Neg logit contributions
side-0.001
 salon-0.001
fs-0.001
pan-0.001
ized-0.001
Token She
Token ID1375
Feature activation-7.82e-05

Pos logit contributions
 name+0.019
 lives+0.018
 wonder+0.018
 wants+0.018
 comes+0.017
Neg logit contributions
side-0.034
fs-0.030
but-0.030
 salon-0.030
 therapist-0.029
Token thought
Token ID1807
Feature activation-1.31e-02

Pos logit contributions
 name+0.000
 can+0.000
 comes+0.000
 wonder+0.000
 know+0.000
Neg logit contributions
side-0.000
fs-0.000
 salon-0.000
but-0.000
cakes-0.000
Token av
Token ID1196
Feature activation-1.27e-02

Pos logit contributions
 name+0.019
 can+0.019
 comes+0.018
 know+0.017
 wonder+0.017
Neg logit contributions
side-0.032
fs-0.027
 salon-0.027
liv-0.027
ized-0.026
Tokenoc
Token ID420
Feature activation-1.53e-01

Pos logit contributions
 name+0.017
 Don+0.015
 comes+0.015
 lives+0.014
 meet+0.013
Neg logit contributions
side-0.031
fs-0.027
brush-0.027
ized-0.027
but-0.025
Tokenados
Token ID22484
Feature activation+7.73e-03

Pos logit contributions
 name+0.377
 wonder+0.365
 know+0.358
 can+0.356
 comes+0.353
Neg logit contributions
side-0.193
fs-0.169
cakes-0.157
but-0.154
ized-0.151
Token were
Token ID547
Feature activation+2.25e-02

Pos logit contributions
side+0.021
 salon+0.019
fs+0.019
liv+0.018
pron+0.018
Neg logit contributions
 can-0.009
 name-0.008
 know-0.008
 comes-0.008
 wonder-0.007
Token the
Token ID262
Feature activation-1.24e-02

Pos logit contributions
side+0.060
fs+0.056
 salon+0.055
but+0.051
liv+0.051
Neg logit contributions
 name-0.028
 can-0.025
 wonder-0.024
 know-0.023
 lives-0.023
Token most
Token ID749
Feature activation+3.84e-02

Pos logit contributions
 name+0.019
 can+0.016
 wonder+0.016
 names+0.015
 wonders+0.015
Neg logit contributions
side-0.030
fs-0.029
ves-0.027
 salon-0.027
liv-0.026
Token interesting
Token ID3499
Feature activation-2.71e-02

Pos logit contributions
side+0.103
fs+0.098
 salon+0.095
pan+0.091
ves+0.090
Neg logit contributions
 name-0.049
 can-0.044
 wonder-0.043
 wonders-0.042
 know-0.040
Token unicorn
Token ID44986
Feature activation-3.59e-02

Pos logit contributions
 comes+0.023
 name+0.022
 know+0.021
 wants+0.020
 wonder+0.020
Neg logit contributions
side-0.032
fs-0.029
 salon-0.028
 therapist-0.027
ized-0.027
Token took
Token ID1718
Feature activation+5.85e-03

Pos logit contributions
 name+0.047
 can+0.042
 comes+0.042
 wonder+0.041
 lives+0.041
Neg logit contributions
side-0.088
fs-0.082
 salon-0.079
cakes-0.075
but-0.075
Token the
Token ID262
Feature activation-8.73e-03

Pos logit contributions
side+0.011
fs+0.009
ves+0.009
brush+0.009
ized+0.009
Neg logit contributions
 wants-0.010
 name-0.009
 Don-0.009
rr-0.009
 thinks-0.009
Token rh
Token ID9529
Feature activation-6.21e-02

Pos logit contributions
 name+0.011
 comes+0.011
 know+0.010
 wonder+0.010
 wants+0.009
Neg logit contributions
side-0.021
fs-0.019
 therapist-0.018
 salon-0.017
cake-0.017
Tokenin
Token ID259
Feature activation-2.09e-02

Pos logit contributions
 name+0.126
 can+0.120
 lives+0.119
 comes+0.117
 wants+0.116
Neg logit contributions
side-0.106
fs-0.093
 salon-0.089
cakes-0.085
liv-0.085
Tokenoc
Token ID420
Feature activation-1.53e-01

Pos logit contributions
 name+0.027
 imag+0.027
 comes+0.026
 wants+0.026
 wonders+0.026
Neg logit contributions
side-0.049
but-0.043
fs-0.043
cake-0.041
brush-0.040
Tokeneros
Token ID27498
Feature activation-7.75e-02

Pos logit contributions
 name+0.240
 know+0.226
 wonder+0.226
 can+0.221
 comes+0.213
Neg logit contributions
side-0.341
fs-0.308
 salon-0.304
ized-0.296
but-0.293
Token to
Token ID284
Feature activation-2.82e-03

Pos logit contributions
 name+0.092
 can+0.079
 comes+0.078
 wonder+0.078
 know+0.078
Neg logit contributions
side-0.201
fs-0.188
 salon-0.183
cakes-0.175
but-0.174
Token a
Token ID257
Feature activation-1.09e-02

Pos logit contributions
 wants+0.004
 name+0.004
 imag+0.004
 can+0.004
 lives+0.004
Neg logit contributions
side-0.006
ves-0.005
handed-0.005
fs-0.005
 salon-0.005
Token nearby
Token ID6716
Feature activation-4.29e-04

Pos logit contributions
 name+0.015
 comes+0.014
 wants+0.014
 lives+0.013
 know+0.013
Neg logit contributions
side-0.026
fs-0.023
 salon-0.023
pron-0.021
ized-0.021
Token pond
Token ID16723
Feature activation+1.64e-02

Pos logit contributions
 name+0.001
 wants+0.001
 comes+0.001
 know+0.001
 can+0.001
Neg logit contributions
side-0.001
 salon-0.001
fs-0.001
ized-0.001
brush-0.001
Token to
Token ID284
Feature activation-4.37e-03

Pos logit contributions
 name+0.051
 can+0.045
 comes+0.045
 wonder+0.044
 know+0.044
Neg logit contributions
side-0.100
fs-0.094
 salon-0.090
but-0.086
cakes-0.086
Token see
Token ID766
Feature activation-4.33e-02

Pos logit contributions
 name+0.004
 comes+0.003
 can+0.003
 wants+0.003
 lives+0.003
Neg logit contributions
side-0.012
 salon-0.012
fs-0.011
ves-0.011
but-0.011
Token the
Token ID262
Feature activation-9.35e-03

Pos logit contributions
 name+0.056
 wonder+0.048
 comes+0.048
 know+0.048
 can+0.047
Neg logit contributions
side-0.105
fs-0.098
 salon-0.096
ized-0.091
but-0.091
Token rh
Token ID9529
Feature activation-6.52e-02

Pos logit contributions
 name+0.012
 know+0.012
 comes+0.011
 wonder+0.011
 wants+0.011
Neg logit contributions
side-0.021
 salon-0.020
fs-0.019
ized-0.018
cakes-0.018
Tokenin
Token ID259
Feature activation-2.08e-02

Pos logit contributions
 name+0.132
 can+0.129
 lives+0.128
 comes+0.127
 wants+0.125
Neg logit contributions
side-0.110
fs-0.095
 salon-0.088
liv-0.087
cakes-0.085
Tokenoc
Token ID420
Feature activation-1.53e-01

Pos logit contributions
 name+0.024
 wonder+0.022
 know+0.020
 names+0.020
 imag+0.020
Neg logit contributions
side-0.053
fs-0.049
 salon-0.048
ves-0.047
but-0.047
Tokeneros
Token ID27498
Feature activation-7.96e-02

Pos logit contributions
 name+0.239
 know+0.225
 wonder+0.224
 can+0.216
 comes+0.208
Neg logit contributions
side-0.345
 salon-0.309
fs-0.306
ized-0.299
cakes-0.297
Token anymore
Token ID7471
Feature activation+2.10e-02

Pos logit contributions
 name+0.113
 can+0.104
 comes+0.103
 lives+0.102
 wants+0.101
Neg logit contributions
side-0.187
fs-0.174
 salon-0.166
liv-0.161
cakes-0.160
Token.
Token ID13
Feature activation-6.06e-03

Pos logit contributions
side+0.053
fs+0.049
 salon+0.047
liv+0.045
ized+0.045
Neg logit contributions
 name-0.027
 can-0.025
 comes-0.024
 wants-0.024
 lives-0.024
Token He
Token ID679
Feature activation-1.32e-02

Pos logit contributions
 finds+0.007
 comes+0.006
 know+0.006
 wonders+0.006
 imag+0.006
Neg logit contributions
side-0.013
but-0.011
cakes-0.011
brush-0.011
 therapist-0.011
Token is
Token ID318
Feature activation-6.31e-02

Pos logit contributions
 name+0.010
 wonder+0.008
 know+0.007
 names+0.007
 Don+0.007
Neg logit contributions
side-0.037
fs-0.035
 salon-0.035
ves-0.034
 therapist-0.034
Token celebrated
Token ID13943
Feature activation+1.19e-02

Pos logit contributions
 name+0.056
 can+0.055
 comes+0.053
 wonder+0.052
 know+0.052
Neg logit contributions
side-0.115
fs-0.107
 salon-0.105
liv-0.102
cakes-0.099
Token.
Token ID13
Feature activation-7.79e-03

Pos logit contributions
side+0.030
fs+0.028
 salon+0.028
cakes+0.026
brush+0.026
Neg logit contributions
 name-0.015
 can-0.013
 comes-0.013
 wonder-0.013
 lives-0.013
Token The
Token ID383
Feature activation-1.73e-02

Pos logit contributions
 name+0.007
 wonder+0.006
 comes+0.006
 can+0.006
 wants+0.006
Neg logit contributions
side-0.022
fs-0.020
 salon-0.020
but-0.019
cakes-0.019
Token rh
Token ID9529
Feature activation-6.41e-02

Pos logit contributions
 name+0.022
 comes+0.021
 know+0.020
 wonder+0.019
 can+0.019
Neg logit contributions
side-0.039
fs-0.035
ized-0.035
 salon-0.034
ves-0.034
Tokenin
Token ID259
Feature activation-2.16e-02

Pos logit contributions
 name+0.142
 can+0.134
 lives+0.133
 comes+0.132
 wants+0.132
Neg logit contributions
side-0.099
fs-0.087
 salon-0.082
cakes-0.077
liv-0.076
Tokenoc
Token ID420
Feature activation-1.52e-01

Pos logit contributions
 name+0.028
 names+0.026
 wonder+0.025
 comes+0.024
 can+0.024
Neg logit contributions
side-0.051
fs-0.047
ves-0.046
ized-0.045
but-0.045
Tokeneros
Token ID27498
Feature activation-7.99e-02

Pos logit contributions
 name+0.255
 know+0.245
 wonder+0.242
 can+0.235
 comes+0.222
Neg logit contributions
side-0.317
ized-0.288
fs-0.285
 salon-0.281
but-0.278
Token learned
Token ID4499
Feature activation-1.91e-03

Pos logit contributions
 name+0.089
 can+0.079
 comes+0.077
 lives+0.077
 wants+0.076
Neg logit contributions
side-0.213
fs-0.200
 salon-0.193
cakes-0.185
but-0.185
Token that
Token ID326
Feature activation+3.85e-03

Pos logit contributions
 name+0.003
 know+0.003
 can+0.003
 wonders+0.002
 wonder+0.002
Neg logit contributions
side-0.005
fs-0.004
 salon-0.004
pron-0.004
pan-0.004
Token humans
Token ID5384
Feature activation-4.08e-03

Pos logit contributions
side+0.009
fs+0.008
 salon+0.008
cakes+0.008
liv+0.008
Neg logit contributions
 name-0.006
 can-0.005
 wonder-0.005
 lives-0.005
 comes-0.005
Token can
Token ID460
Feature activation-6.26e-02

Pos logit contributions
 name+0.003
 can+0.003
 comes+0.002
 lives+0.002
 know+0.002
Neg logit contributions
side-0.013
fs-0.012
 salon-0.012
liv-0.011
cakes-0.011
Token wait
Token ID4043
Feature activation+2.51e-02

Pos logit contributions
side+0.000
fs+0.000
cakes+0.000
 salon+0.000
but+0.000
Neg logit contributions
 name-0.000
 know-0.000
 wonder-0.000
 can-0.000
 lives-0.000
Token until
Token ID1566
Feature activation-9.90e-03

Pos logit contributions
side+0.065
fs+0.061
 salon+0.059
liv+0.057
cakes+0.057
Neg logit contributions
 name-0.031
 can-0.027
 comes-0.026
 wonder-0.026
 wants-0.025
Token the
Token ID262
Feature activation-7.16e-03

Pos logit contributions
 name+0.010
 can+0.009
 comes+0.008
 lives+0.008
 wonder+0.008
Neg logit contributions
side-0.028
fs-0.027
liv-0.025
but-0.025
cakes-0.025
Token rh
Token ID9529
Feature activation-6.35e-02

Pos logit contributions
 name+0.010
 comes+0.009
 know+0.009
 wonder+0.009
 can+0.009
Neg logit contributions
side-0.017
fs-0.016
 salon-0.015
cakes-0.015
but-0.014
Tokenin
Token ID259
Feature activation-1.95e-02

Pos logit contributions
 name+0.127
 lives+0.123
 can+0.122
 comes+0.122
 wants+0.120
Neg logit contributions
side-0.113
fs-0.096
 salon-0.090
liv-0.089
but-0.088
Tokenoc
Token ID420
Feature activation-1.52e-01

Pos logit contributions
 name+0.021
 wonder+0.019
 know+0.018
 wonders+0.018
 names+0.017
Neg logit contributions
side-0.050
 salon-0.046
fs-0.046
cakes-0.045
ves-0.044
Tokeneros
Token ID27498
Feature activation-7.84e-02

Pos logit contributions
 name+0.238
 wonder+0.228
 know+0.225
 can+0.219
 wonders+0.205
Neg logit contributions
side-0.338
 salon-0.305
fs-0.304
cakes-0.301
ized-0.291
Token is
Token ID318
Feature activation-5.98e-02

Pos logit contributions
 name+0.070
 can+0.056
 comes+0.056
 wonder+0.056
 know+0.055
Neg logit contributions
side-0.225
fs-0.211
 salon-0.205
cakes-0.198
but-0.197
Token gone
Token ID3750
Feature activation-7.75e-03

Pos logit contributions
 name+0.083
 comes+0.072
 can+0.071
 know+0.069
 lives+0.069
Neg logit contributions
side-0.146
fs-0.139
 salon-0.131
but-0.129
cakes-0.129
Token."
Token ID526
Feature activation-2.69e-02

Pos logit contributions
 name+0.009
 can+0.008
 comes+0.008
 wonder+0.007
 know+0.007
Neg logit contributions
side-0.020
fs-0.019
 salon-0.019
cakes-0.018
but-0.018
Token
Token ID198
Feature activation-2.88e-03

Pos logit contributions
 name+0.023
 comes+0.018
 can+0.018
 lives+0.018
 wonder+0.018
Neg logit contributions
side-0.079
fs-0.074
 salon-0.072
but-0.070
cakes-0.070
Token
Token ID198
Feature activation+1.33e-03

Pos logit contributions
 name+0.000
 lives+0.000
 comes+0.000
 wonder+0.000
 wants+0.000
Neg logit contributions
side-0.000
fs-0.000
 salon-0.000
but-0.000
cakes-0.000
Token
Token ID198
Feature activation+1.55e-03

Pos logit contributions
side+0.003
fs+0.003
ves+0.003
but+0.003
 salon+0.003
Neg logit contributions
 name-0.002
 comes-0.001
 wonder-0.001
 wants-0.001
 lives-0.001
TokenThe
Token ID464
Feature activation+3.84e-02

Pos logit contributions
side+0.002
fs+0.002
 salon+0.002
but+0.002
ves+0.002
Neg logit contributions
 name-0.004
 comes-0.003
 wonder-0.003
 can-0.003
 lives-0.003
Token rh
Token ID9529
Feature activation-6.25e-02

Pos logit contributions
side+0.073
fs+0.067
 salon+0.067
ized+0.063
 therapist+0.063
Neg logit contributions
 comes-0.060
 name-0.057
 know-0.055
 wonder-0.053
 Don-0.052
Tokenin
Token ID259
Feature activation-2.10e-02

Pos logit contributions
 name+0.137
 can+0.129
 lives+0.127
 comes+0.127
 wants+0.126
Neg logit contributions
side-0.099
fs-0.086
 salon-0.081
cakes-0.077
liv-0.075
Tokenoc
Token ID420
Feature activation-1.52e-01

Pos logit contributions
 comes+0.032
 wonder+0.031
 name+0.031
 names+0.030
 Don+0.030
Neg logit contributions
side-0.042
ves-0.037
fs-0.037
pron-0.037
cut-0.036
Tokeneros
Token ID27498
Feature activation-7.76e-02

Pos logit contributions
 name+0.247
 know+0.237
 wonder+0.236
 can+0.227
 comes+0.220
Neg logit contributions
side-0.325
 salon-0.295
fs-0.295
ized-0.287
but-0.284
Token looked
Token ID3114
Feature activation-3.01e-02

Pos logit contributions
 name+0.105
 comes+0.092
 wonder+0.091
 know+0.090
 can+0.090
Neg logit contributions
side-0.185
fs-0.169
 salon-0.164
ized-0.160
cakes-0.159
Token up
Token ID510
Feature activation+1.49e-02

Pos logit contributions
 name+0.053
 know+0.050
 lives+0.049
rr+0.049
 Don+0.049
Neg logit contributions
side-0.055
ized-0.049
fs-0.048
but-0.046
cakes-0.042
Token,
Token ID11
Feature activation+2.36e-03

Pos logit contributions
side+0.042
fs+0.041
 salon+0.039
cakes+0.038
liv+0.038
Neg logit contributions
 name-0.015
 wonder-0.012
 can-0.012
 comes-0.012
 know-0.011
Token hopeful
Token ID17836
Feature activation-4.37e-02

Pos logit contributions
side+0.006
fs+0.006
 salon+0.005
cakes+0.005
ves+0.005
Neg logit contributions
 name-0.003
 can-0.002
 wonder-0.002
 comes-0.002
 wants-0.002
TokenFinally
Token ID11158
Feature activation+4.06e-02

Pos logit contributions
 name+0.000
 know+0.000
 can+0.000
 wonder+0.000
 wonders+0.000
Neg logit contributions
pan-0.000
fs-0.000
ier-0.000
side-0.000
 salon-0.000
Token,
Token ID11
Feature activation+3.04e-03

Pos logit contributions
side+0.117
fs+0.111
 salon+0.107
but+0.104
cakes+0.103
Neg logit contributions
 name-0.036
 comes-0.033
 can-0.033
 lives-0.030
 wonder-0.030
Token the
Token ID262
Feature activation-7.17e-03

Pos logit contributions
fs+0.009
side+0.009
 salon+0.008
liv+0.008
ves+0.008
Neg logit contributions
 name-0.002
 can-0.002
 wonder-0.002
 comes-0.002
 know-0.002
Token rh
Token ID9529
Feature activation-6.14e-02

Pos logit contributions
 name+0.009
 comes+0.009
 know+0.009
 Don+0.008
 wants+0.008
Neg logit contributions
side-0.016
fs-0.014
 salon-0.014
ized-0.014
brush-0.014
Tokenin
Token ID259
Feature activation-1.95e-02

Pos logit contributions
 name+0.131
 can+0.125
 lives+0.124
 comes+0.121
 wants+0.121
Neg logit contributions
side-0.100
fs-0.088
 salon-0.083
liv-0.079
cakes-0.079
Tokenoc
Token ID420
Feature activation-1.52e-01

Pos logit contributions
 name+0.023
 comes+0.020
 know+0.020
 Don+0.020
 names+0.020
Neg logit contributions
side-0.048
ves-0.043
fs-0.042
brush-0.042
cake-0.041
Tokeneros
Token ID27498
Feature activation-7.75e-02

Pos logit contributions
 name+0.239
 know+0.234
 wonder+0.227
 can+0.223
 comes+0.215
Neg logit contributions
side-0.330
 salon-0.290
fs-0.290
ized-0.286
but-0.285
Token remembered
Token ID12086
Feature activation-5.61e-03

Pos logit contributions
 name+0.088
 can+0.079
 comes+0.078
 lives+0.076
 wants+0.076
Neg logit contributions
side-0.205
fs-0.193
 salon-0.185
but-0.179
cakes-0.178
Token that
Token ID326
Feature activation+6.21e-03

Pos logit contributions
 name+0.005
 comes+0.005
 can+0.005
 know+0.004
 wonder+0.004
Neg logit contributions
fs-0.017
side-0.016
cakes-0.015
liv-0.015
brush-0.015
Token animals
Token ID4695
Feature activation-4.11e-02

Pos logit contributions
side+0.016
fs+0.015
liv+0.014
 salon+0.013
cakes+0.013
Neg logit contributions
 wonder-0.009
 can-0.009
 name-0.009
 lives-0.009
 comes-0.009
Token often
Token ID1690
Feature activation-7.49e-02

Pos logit contributions
 can+0.046
 name+0.046
 comes+0.043
 know+0.042
 wants+0.041
Neg logit contributions
side-0.114
fs-0.104
 salon-0.103
liv-0.101
cakes-0.099
Token not
Token ID407
Feature activation+1.98e-02

Pos logit contributions
 name+0.029
 can+0.026
 comes+0.026
 know+0.026
 wonder+0.025
Neg logit contributions
side-0.042
fs-0.039
 salon-0.037
cakes-0.036
brush-0.036
Token see
Token ID766
Feature activation-4.19e-02

Pos logit contributions
side+0.050
 salon+0.047
fs+0.047
wheel+0.046
but+0.045
Neg logit contributions
 name-0.017
 lives-0.016
 wants-0.015
 can-0.014
 comes-0.014
Token the
Token ID262
Feature activation-7.45e-03

Pos logit contributions
 name+0.064
 know+0.056
 wonder+0.056
 can+0.055
 comes+0.055
Neg logit contributions
side-0.092
fs-0.084
 salon-0.082
ves-0.078
ized-0.078
Token rh
Token ID9529
Feature activation-6.36e-02

Pos logit contributions
 name+0.009
 comes+0.008
 know+0.008
 wonder+0.008
 wants+0.008
Neg logit contributions
side-0.018
fs-0.016
 salon-0.016
ized-0.015
cakes-0.015
Tokenin
Token ID259
Feature activation-2.00e-02

Pos logit contributions
 name+0.132
 can+0.126
 lives+0.125
 comes+0.123
 wants+0.121
Neg logit contributions
side-0.107
fs-0.093
 salon-0.088
liv-0.085
cakes-0.083
Tokenoc
Token ID420
Feature activation-1.52e-01

Pos logit contributions
 name+0.021
 wonder+0.019
 know+0.018
 comes+0.018
 wonders+0.017
Neg logit contributions
side-0.053
fs-0.050
ves-0.047
 salon-0.047
ized-0.047
Tokeneros
Token ID27498
Feature activation-7.85e-02

Pos logit contributions
 name+0.238
 know+0.228
 wonder+0.227
 can+0.219
 comes+0.210
Neg logit contributions
side-0.337
fs-0.300
 salon-0.299
cakes-0.294
ized-0.292
Token.
Token ID13
Feature activation-2.03e-03

Pos logit contributions
 name+0.108
 can+0.104
 comes+0.102
 lives+0.100
 wants+0.097
Neg logit contributions
side-0.193
fs-0.178
liv-0.169
 salon-0.168
cakes-0.168
Token They
Token ID1119
Feature activation-1.66e-02

Pos logit contributions
 name+0.002
 know+0.002
 lives+0.002
 comes+0.002
 wonder+0.002
Neg logit contributions
side-0.006
 salon-0.005
but-0.005
cakes-0.005
fs-0.005
Token asked
Token ID1965
Feature activation-2.51e-02

Pos logit contributions
 name+0.022
 comes+0.020
 lives+0.019
 wants+0.019
 can+0.019
Neg logit contributions
side-0.039
fs-0.036
 salon-0.035
ized-0.034
ves-0.033
Token a
Token ID257
Feature activation-9.25e-03

Pos logit contributions
 name+0.032
 comes+0.028
 wonder+0.028
 know+0.028
 can+0.027
Neg logit contributions
side-0.062
fs-0.057
 salon-0.056
ves-0.053
but-0.053
Token there
Token ID612
Feature activation-4.80e-02

Pos logit contributions
 name+0.018
 comes+0.016
 can+0.016
 know+0.016
 wants+0.016
Neg logit contributions
side-0.021
fs-0.019
 salon-0.018
ized-0.017
liv-0.017
Token is
Token ID318
Feature activation-5.90e-02

Pos logit contributions
 name+0.029
 wonder+0.021
 comes+0.021
 can+0.021
 know+0.020
Neg logit contributions
side-0.150
fs-0.142
 salon-0.139
cakes-0.134
ves-0.134
Token a
Token ID257
Feature activation-8.42e-03

Pos logit contributions
 name+0.075
 can+0.065
 comes+0.065
 wonder+0.064
 know+0.064
Neg logit contributions
side-0.148
fs-0.138
 salon-0.133
cakes-0.128
but-0.127
Token rh
Token ID9529
Feature activation-6.21e-02

Pos logit contributions
 name+0.011
 wants+0.010
 comes+0.010
 know+0.010
 thinks+0.010
Neg logit contributions
side-0.019
 salon-0.018
fs-0.017
pron-0.016
ves-0.016
Tokenin
Token ID259
Feature activation-1.90e-02

Pos logit contributions
 name+0.121
 can+0.120
 comes+0.117
 lives+0.117
 wants+0.112
Neg logit contributions
side-0.111
fs-0.094
 salon-0.091
liv-0.090
cakes-0.087
Tokenoc
Token ID420
Feature activation-1.52e-01

Pos logit contributions
 name+0.019
 wonder+0.016
 comes+0.016
 know+0.016
 wonders+0.015
Neg logit contributions
side-0.051
fs-0.048
 salon-0.047
ves-0.046
but-0.045
Tokeneros
Token ID27498
Feature activation-7.79e-02

Pos logit contributions
 name+0.234
 wonder+0.222
 know+0.221
 can+0.216
 wants+0.208
Neg logit contributions
side-0.332
fs-0.307
 salon-0.296
cakes-0.294
but-0.292
Token in
Token ID287
Feature activation-1.13e-02

Pos logit contributions
 can+0.091
 name+0.091
 comes+0.089
 lives+0.087
 wants+0.083
Neg logit contributions
side-0.207
fs-0.194
liv-0.190
cakes-0.185
 salon-0.183
Token the
Token ID262
Feature activation-4.53e-03

Pos logit contributions
 name+0.014
 wants+0.012
 can+0.012
 comes+0.012
 know+0.012
Neg logit contributions
side-0.028
fs-0.026
 salon-0.025
ves-0.024
ized-0.024
Token park
Token ID3952
Feature activation+2.03e-02

Pos logit contributions
 know+0.007
 name+0.007
 comes+0.006
 wants+0.006
 wonder+0.006
Neg logit contributions
side-0.010
 salon-0.008
fs-0.008
brush-0.007
handed-0.007
Token.
Token ID13
Feature activation-4.58e-03

Pos logit contributions
side+0.048
fs+0.046
 salon+0.043
cakes+0.042
liv+0.042
Neg logit contributions
 name-0.028
 can-0.026
 comes-0.025
 lives-0.025
 wants-0.025
Token sailed
Token ID33835
Feature activation-5.91e-02

Pos logit contributions
side+0.034
fs+0.030
 salon+0.029
ized+0.029
ves+0.028
Neg logit contributions
 name-0.021
 wonder-0.018
 comes-0.018
 know-0.017
 names-0.017
Token for
Token ID329
Feature activation-1.54e-02

Pos logit contributions
 name+0.072
 can+0.060
 wonder+0.058
 comes+0.058
 names+0.056
Neg logit contributions
side-0.151
 salon-0.150
fs-0.147
liv-0.141
cakes-0.140
Token days
Token ID1528
Feature activation-7.30e-02

Pos logit contributions
 name+0.031
 wants+0.029
 comes+0.028
 can+0.028
 lives+0.027
Neg logit contributions
side-0.027
fs-0.024
ves-0.022
 salon-0.021
ized-0.020
Token,
Token ID11
Feature activation-1.14e-02

Pos logit contributions
 can+0.047
 name+0.046
 comes+0.041
 lives+0.041
 know+0.040
Neg logit contributions
fs-0.210
cakes-0.208
liv-0.202
side-0.202
 salon-0.201
Token exploring
Token ID13504
Feature activation-5.58e-02

Pos logit contributions
 name+0.017
 can+0.015
 wonder+0.015
 comes+0.015
 know+0.015
Neg logit contributions
side-0.026
fs-0.025
 salon-0.024
cakes-0.023
ves-0.023
Token unch
Token ID16258
Feature activation-1.52e-01

Pos logit contributions
 name+0.060
 can+0.052
 wonder+0.051
 comes+0.051
 lives+0.051
Neg logit contributions
side-0.150
fs-0.141
 salon-0.138
cakes-0.132
but-0.131
Tokenarted
Token ID19112
Feature activation+6.51e-04

Pos logit contributions
 name+0.249
 can+0.227
 comes+0.224
 wonder+0.222
 know+0.221
Neg logit contributions
side-0.323
fs-0.292
 salon-0.285
but-0.271
ized-0.271
Token waters
Token ID10150
Feature activation-1.86e-02

Pos logit contributions
side+0.002
fs+0.001
 salon+0.001
cakes+0.001
but+0.001
Neg logit contributions
 name-0.001
 comes-0.001
 can-0.001
 lives-0.001
 wonder-0.001
Token until
Token ID1566
Feature activation-2.50e-02

Pos logit contributions
 name+0.017
 can+0.016
 comes+0.014
 lives+0.013
 is+0.012
Neg logit contributions
side-0.053
 salon-0.052
fs-0.052
liv-0.051
cakes-0.050
Token he
Token ID339
Feature activation+7.83e-03

Pos logit contributions
 name+0.030
 can+0.028
 comes+0.028
 wonder+0.028
 know+0.026
Neg logit contributions
side-0.062
fs-0.061
 salon-0.059
liv-0.056
but-0.056
Token came
Token ID1625
Feature activation-4.79e-02

Pos logit contributions
side+0.019
fs+0.017
 salon+0.017
ves+0.016
but+0.016
Neg logit contributions
 name-0.011
 wonder-0.009
 comes-0.009
 can-0.009
 wants-0.009
Token,
Token ID11
Feature activation+1.53e-03

Pos logit contributions
side+0.001
but+0.001
fs+0.001
ized+0.001
 salon+0.001
Neg logit contributions
 name-0.001
 know-0.001
 wants-0.001
 lives-0.001
 names-0.001
Token and
Token ID290
Feature activation-9.48e-03

Pos logit contributions
side+0.004
fs+0.004
 salon+0.003
cakes+0.003
ves+0.003
Neg logit contributions
 name-0.002
 comes-0.002
 can-0.002
 wonder-0.002
 wants-0.002
Token the
Token ID262
Feature activation-5.65e-03

Pos logit contributions
 name+0.013
 can+0.012
 comes+0.012
 wonder+0.011
 lives+0.011
Neg logit contributions
side-0.023
fs-0.022
 salon-0.021
cakes-0.020
ves-0.020
Token rh
Token ID9529
Feature activation-6.08e-02

Pos logit contributions
 name+0.007
 comes+0.006
 know+0.006
 wonder+0.006
 can+0.006
Neg logit contributions
side-0.014
fs-0.013
 salon-0.012
ized-0.012
 therapist-0.012
Tokenin
Token ID259
Feature activation-1.97e-02

Pos logit contributions
 name+0.132
 can+0.126
 lives+0.124
 comes+0.123
 wants+0.123
Neg logit contributions
side-0.097
fs-0.084
 salon-0.081
cakes-0.076
liv-0.074
Tokenoc
Token ID420
Feature activation-1.51e-01

Pos logit contributions
 name+0.020
 comes+0.017
 wonder+0.016
 know+0.016
 can+0.016
Neg logit contributions
side-0.055
fs-0.051
 salon-0.049
ves-0.048
ized-0.048
Tokeneros
Token ID27498
Feature activation-7.82e-02

Pos logit contributions
 name+0.249
 know+0.240
 wonder+0.234
 can+0.230
 comes+0.215
Neg logit contributions
side-0.319
ized-0.285
fs-0.285
 salon-0.282
but-0.277
Token remembered
Token ID12086
Feature activation-8.50e-03

Pos logit contributions
 name+0.089
 can+0.082
 comes+0.080
 lives+0.079
 wonder+0.079
Neg logit contributions
side-0.205
fs-0.194
 salon-0.186
cakes-0.181
but-0.178
Token that
Token ID326
Feature activation+7.62e-03

Pos logit contributions
 name+0.009
 can+0.008
 know+0.008
 wonder+0.008
 comes+0.008
Neg logit contributions
side-0.023
fs-0.023
cakes-0.022
 salon-0.022
liv-0.021
Token life
Token ID1204
Feature activation-1.18e-02

Pos logit contributions
side+0.017
fs+0.015
 salon+0.014
cakes+0.014
liv+0.014
Neg logit contributions
 name-0.013
 can-0.012
 wonder-0.012
 lives-0.012
 comes-0.012
Token could
Token ID714
Feature activation-2.05e-02

Pos logit contributions
 can+0.012
 name+0.011
 comes+0.011
 lives+0.011
 wants+0.010
Neg logit contributions
side-0.034
 salon-0.031
fs-0.031
but-0.030
liv-0.029
Token She
Token ID1375
Feature activation+3.19e-02

Pos logit contributions
side+0.045
 salon+0.038
fs+0.038
cakes+0.037
but+0.037
Neg logit contributions
 name-0.032
 lives-0.030
 know-0.030
 comes-0.030
 knows-0.029
Token loved
Token ID6151
Feature activation+2.44e-02

Pos logit contributions
side+0.067
fs+0.062
ves+0.060
pron+0.059
 salon+0.058
Neg logit contributions
 name-0.048
 comes-0.043
 wonder-0.043
 names-0.043
 know-0.042
Token to
Token ID284
Feature activation+1.20e-02

Pos logit contributions
side+0.054
 salon+0.049
fs+0.049
cakes+0.047
but+0.046
Neg logit contributions
 name-0.037
 can-0.033
 wonder-0.033
 know-0.033
 comes-0.032
Token bake
Token ID28450
Feature activation+4.50e-02

Pos logit contributions
side+0.029
fs+0.027
 salon+0.027
brush+0.026
cakes+0.026
Neg logit contributions
 name-0.013
 lives-0.012
 comes-0.012
 wants-0.012
 wonders-0.012
Token cup
Token ID6508
Feature activation+4.68e-02

Pos logit contributions
side+0.088
fs+0.078
 salon+0.076
 therapist+0.072
pan+0.072
Neg logit contributions
 name-0.085
 comes-0.076
 can-0.076
 lives-0.075
 wonder-0.074
Tokencakes
Token ID37263
Feature activation+1.31e-01

Pos logit contributions
side+0.035
fs+0.022
 salon+0.021
pan+0.018
ized+0.018
Neg logit contributions
 name-0.145
 comes-0.135
 can-0.134
 names-0.132
 lives-0.131
Token with
Token ID351
Feature activation+3.29e-02

Pos logit contributions
side+0.305
fs+0.284
 salon+0.270
liv+0.264
 therapist+0.262
Neg logit contributions
 name-0.193
 can-0.169
 comes-0.167
 know-0.163
 lives-0.160
Token her
Token ID607
Feature activation+4.64e-03

Pos logit contributions
side+0.079
fs+0.074
 salon+0.072
liv+0.070
 therapist+0.068
Neg logit contributions
 name-0.048
 can-0.041
 names-0.041
 wonder-0.040
 know-0.040
Token mom
Token ID1995
Feature activation+1.14e-02

Pos logit contributions
side+0.012
fs+0.012
 salon+0.011
cakes+0.011
pron+0.011
Neg logit contributions
 name-0.005
 comes-0.004
 know-0.004
 wonder-0.004
 wants-0.004
Tokenmy
Token ID1820
Feature activation-7.20e-03

Pos logit contributions
side+0.030
fs+0.029
 salon+0.028
cakes+0.027
but+0.027
Neg logit contributions
 name-0.013
 can-0.011
 comes-0.011
 wonder-0.011
 know-0.010
Token.
Token ID13
Feature activation+1.93e-02

Pos logit contributions
 name+0.009
 can+0.008
 comes+0.007
 lives+0.007
 know+0.007
Neg logit contributions
side-0.018
fs-0.018
 salon-0.017
ves-0.017
liv-0.017
Token blue
Token ID4171
Feature activation-4.19e-03

Pos logit contributions
side+0.077
fs+0.072
pan+0.071
 salon+0.070
pron+0.068
Neg logit contributions
 name-0.038
 names-0.032
 wonder-0.032
 know-0.031
 can-0.031
Token wheels
Token ID13666
Feature activation+3.27e-02

Pos logit contributions
 name+0.006
 can+0.005
 wonder+0.005
 wants+0.005
 comes+0.005
Neg logit contributions
side-0.011
fs-0.010
liv-0.010
bly-0.010
cakes-0.010
Token and
Token ID290
Feature activation+5.86e-03

Pos logit contributions
fs+0.112
liv+0.112
side+0.110
pron+0.107
handed+0.106
Neg logit contributions
 name-0.014
 can-0.012
 is-0.008
 and-0.008
 be-0.005
Token pink
Token ID11398
Feature activation+5.23e-03

Pos logit contributions
side+0.015
fs+0.014
 salon+0.014
liv+0.013
pan+0.013
Neg logit contributions
 name-0.008
 can-0.007
 wants-0.007
 know-0.007
 wonder-0.007
Token handle
Token ID5412
Feature activation+3.42e-02

Pos logit contributions
side+0.014
liv+0.013
fs+0.013
pron+0.012
 salon+0.012
Neg logit contributions
 name-0.007
 can-0.006
 names-0.006
 comes-0.006
 wants-0.006
Tokenbars
Token ID34046
Feature activation+1.68e-01

Pos logit contributions
side+0.081
pron+0.076
fs+0.074
liv+0.071
handed+0.070
Neg logit contributions
 name-0.052
 can-0.052
 is-0.048
 comes-0.047
 names-0.045
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
side+0.514
fs+0.496
liv+0.486
pron+0.478
cakes+0.472
Neg logit contributions
 can-0.158
 name-0.147
 comes-0.127
 is-0.126
 wants-0.125
Token She
Token ID1375
Feature activation+2.53e-02

Pos logit contributions
side+0.050
 salon+0.045
fs+0.044
but+0.043
cakes+0.043
Neg logit contributions
 name-0.022
 comes-0.021
 lives-0.020
 know-0.020
 wants-0.019
Token liked
Token ID8288
Feature activation+3.31e-02

Pos logit contributions
side+0.069
fs+0.065
cakes+0.061
 salon+0.061
but+0.060
Neg logit contributions
 name-0.028
 can-0.026
 wants-0.025
 wonder-0.024
 lives-0.024
Token to
Token ID284
Feature activation+5.58e-03

Pos logit contributions
side+0.091
fs+0.087
liv+0.081
ves+0.080
cakes+0.079
Neg logit contributions
 name-0.035
 can-0.032
 comes-0.031
 wants-0.030
 wonder-0.029
Token go
Token ID467
Feature activation-1.85e-02

Pos logit contributions
side+0.014
fs+0.013
 salon+0.012
cakes+0.012
but+0.012
Neg logit contributions
 name-0.008
 know-0.007
 wonder-0.007
 can-0.007
 wants-0.007
Token Sue
Token ID26089
Feature activation+1.69e-02

Pos logit contributions
 name+0.003
 wonder+0.002
 wants+0.002
 comes+0.002
 can+0.002
Neg logit contributions
side-0.004
fs-0.004
 salon-0.004
cakes-0.003
ized-0.003
Token.
Token ID13
Feature activation-9.64e-03

Pos logit contributions
side+0.040
fs+0.037
 salon+0.036
cakes+0.035
but+0.035
Neg logit contributions
 name-0.023
 wonder-0.021
 comes-0.020
 lives-0.020
 can-0.020
Token They
Token ID1119
Feature activation-2.30e-02

Pos logit contributions
 name+0.008
 comes+0.008
 know+0.008
 lives+0.007
 wonder+0.007
Neg logit contributions
side-0.027
fs-0.024
cakes-0.024
but-0.023
 salon-0.023
Token leave
Token ID2666
Feature activation+1.52e-03

Pos logit contributions
 name+0.023
 comes+0.020
 wants+0.020
 lives+0.019
 wonder+0.019
Neg logit contributions
side-0.062
fs-0.057
 salon-0.057
brush-0.054
cakes-0.054
Token the
Token ID262
Feature activation-5.75e-03

Pos logit contributions
side+0.004
fs+0.004
 salon+0.004
but+0.004
cakes+0.004
Neg logit contributions
 name-0.002
 can-0.001
 comes-0.001
 wonder-0.001
 lives-0.001
Token salon
Token ID46608
Feature activation+2.06e-01

Pos logit contributions
 name+0.010
 lives+0.009
 can+0.009
 wonders+0.009
 wonder+0.009
Neg logit contributions
side-0.012
fs-0.012
ves-0.011
cakes-0.011
ized-0.011
Token with
Token ID351
Feature activation+2.30e-02

Pos logit contributions
side+0.495
fs+0.479
liv+0.457
 therapist+0.441
ized+0.436
Neg logit contributions
 name-0.271
 lives-0.239
 know-0.239
 comes-0.237
 can-0.237
Token big
Token ID1263
Feature activation-1.24e-02

Pos logit contributions
side+0.067
fs+0.065
but+0.062
 salon+0.061
ves+0.060
Neg logit contributions
 name-0.021
 wonder-0.018
 can-0.018
 lives-0.017
 know-0.016
Token smiles
Token ID21845
Feature activation-2.51e-02

Pos logit contributions
 name+0.014
 can+0.012
 comes+0.012
 wonder+0.012
 know+0.012
Neg logit contributions
side-0.033
fs-0.031
 salon-0.030
cakes-0.028
but-0.028
Token.
Token ID13
Feature activation-9.95e-03

Pos logit contributions
 name+0.034
 can+0.030
 comes+0.030
 lives+0.027
 wants+0.027
Neg logit contributions
side-0.061
fs-0.060
 salon-0.058
liv-0.058
cakes-0.056
Token Lily
Token ID20037
Feature activation-2.23e-03

Pos logit contributions
 comes+0.010
 know+0.010
 lives+0.010
rr+0.010
 wants+0.009
Neg logit contributions
side-0.025
cakes-0.022
fs-0.021
cake-0.021
but-0.021
Token day
Token ID1110
Feature activation+1.75e-02

Pos logit contributions
 name+0.054
 can+0.047
 lives+0.047
 comes+0.047
 wonder+0.046
Neg logit contributions
side-0.120
fs-0.116
 salon-0.111
cakes-0.109
ves-0.108
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
cakes+0.050
liv+0.049
fs+0.049
 salon+0.048
cake+0.047
Neg logit contributions
 name-0.013
 can-0.010
 know-0.010
 wonder-0.009
 wonders-0.009
Token working
Token ID1762
Feature activation-9.71e-03

Pos logit contributions
side+0.052
fs+0.051
cakes+0.049
quet+0.048
brush+0.047
Neg logit contributions
 name-0.029
 can-0.027
 wants-0.027
 wonder-0.027
 comes-0.026
Token at
Token ID379
Feature activation+1.81e-02

Pos logit contributions
 can+0.011
 name+0.011
 comes+0.010
 lives+0.009
 is+0.009
Neg logit contributions
side-0.024
 salon-0.023
fs-0.023
cakes-0.023
liv-0.023
Token his
Token ID465
Feature activation+7.82e-03

Pos logit contributions
side+0.044
 salon+0.040
fs+0.039
but+0.037
liv+0.037
Neg logit contributions
 name-0.025
 comes-0.022
 can-0.022
 wants-0.021
 wonder-0.021
Token salon
Token ID46608
Feature activation+2.02e-01

Pos logit contributions
side+0.018
fs+0.017
cakes+0.015
but+0.015
 salon+0.015
Neg logit contributions
 name-0.012
 lives-0.011
 can-0.010
 wonder-0.010
 wants-0.010
Token and
Token ID290
Feature activation+7.40e-03

Pos logit contributions
side+0.527
fs+0.522
cakes+0.502
cake+0.495
liv+0.493
Neg logit contributions
 name-0.214
 lives-0.191
 can-0.190
 wants-0.184
 comes-0.179
Token running
Token ID2491
Feature activation+1.13e-02

Pos logit contributions
side+0.017
fs+0.017
cakes+0.016
quet+0.016
ves+0.016
Neg logit contributions
 name-0.010
 can-0.010
 wants-0.010
 know-0.009
 lives-0.009
Token err
Token ID11454
Feature activation+6.22e-02

Pos logit contributions
side+0.027
fs+0.026
 salon+0.025
cakes+0.025
liv+0.025
Neg logit contributions
 name-0.015
 can-0.014
 comes-0.013
 know-0.013
 lives-0.013
Tokenands
Token ID1746
Feature activation+7.87e-03

Pos logit contributions
side+0.107
fs+0.099
 salon+0.093
cakes+0.090
ized+0.087
Neg logit contributions
 name-0.120
 know-0.116
 wonder-0.113
 lives-0.112
 comes-0.112
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
cakes+0.020
liv+0.020
fs+0.020
cake+0.020
side+0.020
Neg logit contributions
 can-0.008
 name-0.007
 comes-0.007
 know-0.006
 lives-0.006
Token I
Token ID314
Feature activation+1.44e-02

Pos logit contributions
side+0.066
 salon+0.062
fs+0.062
brush+0.059
but+0.058
Neg logit contributions
 name-0.037
 know-0.034
 wonder-0.034
 comes-0.033
 lives-0.033
Token can
Token ID460
Feature activation-5.28e-02

Pos logit contributions
side+0.037
 salon+0.036
fs+0.035
brush+0.034
liv+0.034
Neg logit contributions
 name-0.012
 lives-0.010
 names-0.010
 comes-0.010
 X-0.010
Token wear
Token ID5806
Feature activation+4.19e-03

Pos logit contributions
 name+0.080
 can+0.073
 know+0.071
 wonder+0.068
 comes+0.067
Neg logit contributions
side-0.127
fs-0.116
 salon-0.109
cakes-0.106
ized-0.105
Token my
Token ID616
Feature activation+1.67e-02

Pos logit contributions
side+0.012
fs+0.011
 salon+0.011
but+0.011
cake+0.011
Neg logit contributions
 name-0.004
 can-0.004
 comes-0.003
 lives-0.003
 wonder-0.003
Token rain
Token ID6290
Feature activation+3.03e-02

Pos logit contributions
side+0.038
fs+0.035
 salon+0.033
but+0.032
ized+0.032
Neg logit contributions
 name-0.027
 can-0.024
 wonder-0.023
 lives-0.023
 comes-0.023
Tokencoat
Token ID31434
Feature activation+1.54e-01

Pos logit contributions
side+0.038
fs+0.036
ized+0.032
ves+0.031
 therapist+0.031
Neg logit contributions
 name-0.074
 can-0.073
 comes-0.070
 names-0.068
 wants-0.068
Token and
Token ID290
Feature activation+4.37e-04

Pos logit contributions
fs+0.462
side+0.455
ves+0.450
quet+0.427
liv+0.425
Neg logit contributions
 can-0.119
 name-0.105
 is-0.083
 wants-0.080
 comes-0.076
Token boots
Token ID14412
Feature activation+3.21e-02

Pos logit contributions
side+0.001
fs+0.001
 salon+0.001
ized+0.001
but+0.001
Neg logit contributions
 name-0.001
 can-0.001
 comes-0.001
 wants-0.001
 know-0.001
Token.
Token ID13
Feature activation-9.83e-04

Pos logit contributions
fs+0.089
side+0.088
ves+0.086
liv+0.083
ized+0.082
Neg logit contributions
 can-0.033
 name-0.027
 comes-0.024
 is-0.023
 wants-0.023
Token Please
Token ID4222
Feature activation+1.44e-02

Pos logit contributions
 comes+0.001
 name+0.001
 know+0.001
 lives+0.001
 wonder+0.001
Neg logit contributions
side-0.002
fs-0.002
 salon-0.002
brush-0.002
but-0.002
Token mom
Token ID1995
Feature activation+9.01e-04

Pos logit contributions
side+0.034
fs+0.033
 salon+0.032
brush+0.031
ves+0.030
Neg logit contributions
 name-0.016
 wonder-0.016
 wonders-0.016
 lives-0.016
 wants-0.015
Token and
Token ID290
Feature activation+1.95e-03

Pos logit contributions
 name+0.001
 comes+0.001
 lives+0.001
 wonder+0.001
 wants+0.001
Neg logit contributions
side-0.001
fs-0.001
 salon-0.001
but-0.001
ized-0.001
Token her
Token ID607
Feature activation-7.40e-04

Pos logit contributions
side+0.005
fs+0.004
 salon+0.004
cakes+0.004
but+0.004
Neg logit contributions
 name-0.003
 comes-0.003
 can-0.003
 wonder-0.003
 know-0.003
Token mom
Token ID1995
Feature activation+6.16e-03

Pos logit contributions
 name+0.001
 comes+0.001
 know+0.001
 can+0.001
 wants+0.001
Neg logit contributions
side-0.002
fs-0.002
 salon-0.002
cakes-0.002
brush-0.002
Tokenmy
Token ID1820
Feature activation-1.37e-02

Pos logit contributions
side+0.016
fs+0.015
 salon+0.014
but+0.014
cakes+0.014
Neg logit contributions
 name-0.007
 lives-0.006
 wonder-0.006
 comes-0.006
 wants-0.006
Token knew
Token ID2993
Feature activation-1.07e-02

Pos logit contributions
 name+0.017
 can+0.015
 comes+0.015
 wonder+0.015
 lives+0.015
Neg logit contributions
side-0.034
fs-0.032
 salon-0.031
but-0.030
cakes-0.030
Token something
Token ID1223
Feature activation+4.54e-02

Pos logit contributions
 can+0.015
 comes+0.015
 name+0.014
 lives+0.013
 Sam+0.013
Neg logit contributions
side-0.028
fs-0.023
 salon-0.023
but-0.022
ized-0.022
Token was
Token ID373
Feature activation+4.17e-03

Pos logit contributions
side+0.133
 salon+0.118
fs+0.118
ized+0.111
quet+0.109
Neg logit contributions
 can-0.050
 comes-0.048
 name-0.046
 is-0.044
 wants-0.043
Token wrong
Token ID2642
Feature activation+6.12e-02

Pos logit contributions
side+0.007
fs+0.007
 salon+0.007
cakes+0.006
 therapist+0.006
Neg logit contributions
 name-0.009
 comes-0.008
 can-0.008
 wants-0.008
 lives-0.008
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
side+0.178
fs+0.174
 salon+0.171
cakes+0.170
liv+0.166
Neg logit contributions
 name-0.053
 can-0.049
 comes-0.042
 wonder-0.040
 is-0.040
Token
Token ID220
Feature activation-4.46e-03

Pos logit contributions
side+0.032
fs+0.029
 salon+0.028
but+0.028
cakes+0.028
Neg logit contributions
 name-0.012
 comes-0.010
 lives-0.010
 wonder-0.010
 can-0.010
Token
Token ID198
Feature activation+1.66e-02

Pos logit contributions
 name+0.004
 wonder+0.004
 lives+0.004
 comes+0.004
 wants+0.004
Neg logit contributions
side-0.012
fs-0.012
but-0.011
 salon-0.011
ves-0.011
Token Tim
Token ID5045
Feature activation+1.87e-02

Pos logit contributions
 wonder+0.007
 hopes+0.007
 name+0.007
 wonders+0.006
 lives+0.006
Neg logit contributions
side-0.008
cakes-0.007
fs-0.007
but-0.007
 salon-0.007
Tokenmy
Token ID1820
Feature activation+1.40e-03

Pos logit contributions
side+0.040
 salon+0.035
but+0.034
fs+0.034
brush+0.033
Neg logit contributions
 name-0.028
 comes-0.025
 lives-0.024
 wonder-0.024
 names-0.023
Token.
Token ID13
Feature activation+2.97e-02

Pos logit contributions
side+0.003
fs+0.002
but+0.002
 salon+0.002
ized+0.002
Neg logit contributions
 name-0.003
 comes-0.002
 can-0.002
 wonder-0.002
 names-0.002
Token Tim
Token ID5045
Feature activation+2.13e-02

Pos logit contributions
side+0.060
cakes+0.054
but+0.051
cake+0.050
whe+0.050
Neg logit contributions
 hopes-0.045
 lives-0.045
 knows-0.045
 name-0.044
 remembers-0.044
Tokenmy
Token ID1820
Feature activation+1.08e-03

Pos logit contributions
side+0.041
liv+0.036
fs+0.034
he+0.034
brush+0.034
Neg logit contributions
 name-0.035
 comes-0.031
 Don-0.031
 names-0.031
 wonder-0.030
Token loved
Token ID6151
Feature activation+3.25e-02

Pos logit contributions
side+0.002
fs+0.002
 salon+0.002
ves+0.002
liv+0.002
Neg logit contributions
 name-0.002
 comes-0.001
 wonder-0.001
 names-0.001
 know-0.001
Token to
Token ID284
Feature activation+1.85e-02

Pos logit contributions
side+0.072
fs+0.065
 salon+0.065
cakes+0.062
but+0.061
Neg logit contributions
 name-0.049
 wonder-0.044
 can-0.044
 know-0.044
 comes-0.043
Token play
Token ID711
Feature activation+2.98e-02

Pos logit contributions
side+0.048
fs+0.045
 salon+0.044
brush+0.043
cakes+0.043
Neg logit contributions
 name-0.018
 lives-0.017
 comes-0.017
 wants-0.017
 wonders-0.017
Token with
Token ID351
Feature activation+3.86e-02

Pos logit contributions
side+0.065
fs+0.059
 salon+0.057
liv+0.054
cakes+0.054
Neg logit contributions
 name-0.049
 can-0.044
 comes-0.043
 lives-0.042
 wonder-0.042
Token blocks
Token ID7021
Feature activation+6.90e-02

Pos logit contributions
side+0.105
fs+0.096
 salon+0.094
but+0.091
liv+0.090
Neg logit contributions
 name-0.045
 can-0.038
 comes-0.038
 lives-0.038
 wonder-0.037
Token.
Token ID13
Feature activation+2.66e-02

Pos logit contributions
side+0.205
liv+0.199
fs+0.199
 therapist+0.197
pron+0.190
Neg logit contributions
 can-0.053
 name-0.046
 is-0.040
 comes-0.040
 know-0.035
Token see
Token ID766
Feature activation-3.03e-02

Pos logit contributions
side+0.024
fs+0.023
 salon+0.023
ves+0.022
cakes+0.022
Neg logit contributions
 name-0.008
 wants-0.008
 can-0.008
 lives-0.008
 wonders-0.008
Token what
Token ID644
Feature activation-1.01e-02

Pos logit contributions
 name+0.030
 wonder+0.026
 know+0.026
 can+0.025
 lives+0.023
Neg logit contributions
side-0.090
fs-0.083
 salon-0.079
liv-0.077
ider-0.077
Token surprise
Token ID5975
Feature activation+1.46e-02

Pos logit contributions
 name+0.012
 can+0.012
 lives+0.012
 comes+0.011
 wonder+0.011
Neg logit contributions
side-0.027
fs-0.025
but-0.024
ized-0.024
 salon-0.024
Token her
Token ID607
Feature activation-1.92e-03

Pos logit contributions
side+0.036
fs+0.033
 salon+0.032
but+0.031
ized+0.031
Neg logit contributions
 name-0.020
 can-0.019
 comes-0.019
 lives-0.018
 wonder-0.017
Token parents
Token ID3397
Feature activation-8.80e-03

Pos logit contributions
 name+0.002
 comes+0.002
 can+0.002
 wonder+0.002
 know+0.002
Neg logit contributions
side-0.005
fs-0.005
 salon-0.005
cakes-0.004
ves-0.004
Token were
Token ID547
Feature activation+4.13e-02

Pos logit contributions
 name+0.011
 can+0.010
 comes+0.010
 wonder+0.010
 lives+0.010
Neg logit contributions
side-0.022
fs-0.020
 salon-0.019
but-0.019
cakes-0.019
Token about
Token ID546
Feature activation+6.35e-03

Pos logit contributions
side+0.102
fs+0.097
 salon+0.092
liv+0.092
 therapist+0.091
Neg logit contributions
 name-0.053
 comes-0.047
 know-0.047
 can-0.047
 wants-0.046
Token to
Token ID284
Feature activation+9.79e-03

Pos logit contributions
side+0.017
fs+0.016
 salon+0.016
liv+0.015
cakes+0.015
Neg logit contributions
 name-0.007
 can-0.007
 lives-0.006
 wonder-0.006
 comes-0.006
Token reveal
Token ID7766
Feature activation+1.19e-02

Pos logit contributions
side+0.029
fs+0.027
 salon+0.026
but+0.026
cakes+0.025
Neg logit contributions
 name-0.010
 know-0.008
 wonder-0.008
 can-0.008
 comes-0.008
Token.
Token ID13
Feature activation+3.20e-03

Pos logit contributions
side+0.032
 salon+0.031
fs+0.029
liv+0.029
wheel+0.029
Neg logit contributions
 name-0.013
 can-0.012
 lives-0.011
 wonder-0.011
 names-0.011
Token When
Token ID1649
Feature activation-1.25e-02

Pos logit contributions
side+0.008
fs+0.008
 salon+0.007
but+0.007
cakes+0.007
Neg logit contributions
 name-0.004
 comes-0.003
 can-0.003
 wonder-0.003
 know-0.003
Token.
Token ID13
Feature activation+2.88e-02

Pos logit contributions
side+0.072
fs+0.071
cakes+0.067
 salon+0.065
liv+0.065
Neg logit contributions
 name-0.017
 can-0.013
 wonder-0.013
 The-0.012
 is-0.012
Token They
Token ID1119
Feature activation+3.05e-03

Pos logit contributions
side+0.079
fs+0.070
 salon+0.070
cakes+0.070
liv+0.068
Neg logit contributions
 name-0.025
 lives-0.023
 can-0.023
 comes-0.022
 know-0.022
Token like
Token ID588
Feature activation+9.49e-04

Pos logit contributions
side+0.008
fs+0.007
 salon+0.007
but+0.007
cakes+0.007
Neg logit contributions
 name-0.004
 comes-0.003
 lives-0.003
 wonder-0.003
 can-0.003
Token to
Token ID284
Feature activation+6.96e-03

Pos logit contributions
side+0.002
fs+0.002
 salon+0.002
but+0.002
cakes+0.002
Neg logit contributions
 name-0.001
 comes-0.001
 can-0.001
 wonder-0.001
 know-0.001
Token play
Token ID711
Feature activation+1.62e-02

Pos logit contributions
side+0.021
fs+0.020
 salon+0.019
cakes+0.018
but+0.018
Neg logit contributions
 name-0.005
 comes-0.004
 can-0.004
 wonder-0.004
 lives-0.004
Token with
Token ID351
Feature activation+3.02e-02

Pos logit contributions
side+0.036
 salon+0.033
fs+0.033
cakes+0.031
liv+0.031
Neg logit contributions
 name-0.026
 can-0.024
 lives-0.023
 comes-0.023
 wonder-0.022
Token a
Token ID257
Feature activation+8.31e-03

Pos logit contributions
side+0.070
fs+0.064
 salon+0.062
but+0.059
cakes+0.059
Neg logit contributions
 name-0.045
 can-0.040
 comes-0.040
 wonder-0.040
 lives-0.039
Token basketball
Token ID9669
Feature activation+6.41e-02

Pos logit contributions
side+0.021
fs+0.019
 salon+0.019
but+0.018
cakes+0.018
Neg logit contributions
 name-0.011
 can-0.009
 comes-0.009
 wonder-0.009
 lives-0.009
Token.
Token ID13
Feature activation+2.48e-02

Pos logit contributions
side+0.181
liv+0.179
fs+0.173
inge+0.162
isin+0.162
Neg logit contributions
 can-0.067
 name-0.057
 wants-0.054
 lives-0.052
 comes-0.052
Token They
Token ID1119
Feature activation+1.28e-03

Pos logit contributions
side+0.057
 salon+0.052
cakes+0.051
fs+0.051
ves+0.049
Neg logit contributions
 lives-0.027
 name-0.027
 craw-0.026
 knows-0.026
 comes-0.026
Token bounce
Token ID20110
Feature activation+5.24e-02

Pos logit contributions
side+0.003
fs+0.003
 salon+0.003
cakes+0.003
brush+0.003
Neg logit contributions
 name-0.001
 comes-0.001
 lives-0.001
 wonder-0.001
 know-0.001
Token back
Token ID736
Feature activation+1.03e-02

Pos logit contributions
side+0.149
fs+0.133
 salon+0.130
pan+0.126
liv+0.124
Neg logit contributions
 name-0.060
 know-0.050
 can-0.049
 wants-0.046
 names-0.045
Token my
Token ID616
Feature activation+6.61e-02

Pos logit contributions
side+0.025
fs+0.023
 salon+0.023
but+0.022
cakes+0.022
Neg logit contributions
 name-0.013
 wonder-0.012
 comes-0.012
 can-0.012
 lives-0.011
Token hat
Token ID6877
Feature activation+5.11e-02

Pos logit contributions
side+0.185
fs+0.169
 salon+0.161
but+0.158
liv+0.156
Neg logit contributions
 name-0.076
 can-0.064
 know-0.062
 wants-0.061
 wonder-0.061
Token!"
Token ID2474
Feature activation+2.21e-02

Pos logit contributions
side+0.153
fs+0.142
liv+0.134
ized+0.132
 salon+0.131
Neg logit contributions
 name-0.051
 can-0.046
 comes-0.039
 wants-0.039
 is-0.038
Token Lily
Token ID20037
Feature activation+1.23e-02

Pos logit contributions
side+0.071
fs+0.068
 salon+0.064
ized+0.063
ier+0.062
Neg logit contributions
 name-0.016
 know-0.012
 They-0.011
 can-0.011
 comes-0.011
Token shouted
Token ID17293
Feature activation+4.55e-02

Pos logit contributions
side+0.034
fs+0.032
 salon+0.029
cakes+0.029
but+0.029
Neg logit contributions
 name-0.014
 wants-0.013
 comes-0.013
 can-0.013
 know-0.012
Token.
Token ID13
Feature activation-9.53e-03

Pos logit contributions
side+0.122
fs+0.114
 salon+0.111
liv+0.107
cakes+0.106
Neg logit contributions
 name-0.057
 comes-0.047
 can-0.047
 know-0.045
 wants-0.045
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
 name+0.010
 lives+0.009
 wonder+0.009
 know+0.009
 comes+0.008
Neg logit contributions
side-0.025
fs-0.024
 salon-0.023
but-0.023
cakes-0.022
Token
Token ID198
Feature activation-1.28e-02

Pos logit contributions
 name+0.012
 lives+0.011
 comes+0.011
 wants+0.011
 can+0.011
Neg logit contributions
side-0.031
fs-0.028
ves-0.027
but-0.027
 salon-0.027
Token"
Token ID1
Feature activation-3.10e-03

Pos logit contributions
 name+0.020
 know+0.019
 wonder+0.018
 can+0.018
 wants+0.018
Neg logit contributions
side-0.028
fs-0.027
cakes-0.025
 salon-0.025
pan-0.024
TokenCome
Token ID16773
Feature activation+1.52e-02

Pos logit contributions
 name+0.005
 wonder+0.004
 wants+0.004
 can+0.004
 Don+0.004
Neg logit contributions
side-0.007
fs-0.007
ier-0.006
cakes-0.006
pan-0.006
Token sad
Token ID6507
Feature activation+1.92e-02

Pos logit contributions
side+0.066
fs+0.061
 salon+0.060
cakes+0.057
but+0.056
Neg logit contributions
 name-0.030
 know-0.026
 wonder-0.026
 can-0.026
 comes-0.026
Token.
Token ID13
Feature activation+3.33e-03

Pos logit contributions
side+0.056
fs+0.054
 salon+0.052
pron+0.051
liv+0.051
Neg logit contributions
 can-0.018
 name-0.018
 comes-0.015
 is-0.014
 wants-0.014
Token His
Token ID2399
Feature activation+3.64e-02

Pos logit contributions
side+0.008
but+0.007
fs+0.007
 salon+0.007
cakes+0.007
Neg logit contributions
 lives-0.004
 name-0.004
 comes-0.004
 know-0.003
 wonder-0.003
Token bike
Token ID7161
Feature activation+1.73e-02

Pos logit contributions
side+0.099
fs+0.092
 salon+0.089
but+0.086
cakes+0.086
Neg logit contributions
 name-0.040
 can-0.034
 wonder-0.033
 comes-0.033
 lives-0.033
Token was
Token ID373
Feature activation-5.77e-03

Pos logit contributions
side+0.052
fs+0.050
cakes+0.047
pan+0.047
but+0.046
Neg logit contributions
 can-0.014
 comes-0.013
 name-0.013
 wants-0.012
 lives-0.012
Token gone
Token ID3750
Feature activation-4.85e-03

Pos logit contributions
 name+0.007
 can+0.006
 wonder+0.006
 be+0.006
 know+0.006
Neg logit contributions
fs-0.015
side-0.015
cakes-0.015
ves-0.014
 therapist-0.014
Token!
Token ID0
Feature activation-1.87e-03

Pos logit contributions
 can+0.005
 name+0.005
 comes+0.004
 wonder+0.004
 lives+0.004
Neg logit contributions
side-0.013
fs-0.013
cakes-0.012
 salon-0.012
ves-0.012
Token He
Token ID679
Feature activation-3.96e-03

Pos logit contributions
 lives+0.002
 name+0.002
 comes+0.002
 wonder+0.002
 wants+0.002
Neg logit contributions
side-0.005
but-0.004
fs-0.004
cakes-0.004
wheel-0.004
Token knew
Token ID2993
Feature activation-2.05e-02

Pos logit contributions
 name+0.005
 comes+0.005
 names+0.005
 know+0.005
 lives+0.004
Neg logit contributions
side-0.009
fs-0.008
 salon-0.008
ized-0.008
 therapist-0.008
Token a
Token ID257
Feature activation-9.84e-03

Pos logit contributions
 can+0.033
 name+0.032
 Sam+0.031
 know+0.030
 comes+0.029
Neg logit contributions
side-0.048
fs-0.042
cakes-0.042
cake-0.041
 salon-0.040
Token thief
Token ID25906
Feature activation-2.86e-03

Pos logit contributions
 name+0.011
 can+0.010
 wonder+0.010
 comes+0.010
 know+0.010
Neg logit contributions
side-0.026
fs-0.024
 salon-0.023
cakes-0.023
ized-0.023
Token best
Token ID1266
Feature activation+4.54e-03

Pos logit contributions
 name+0.010
 can+0.009
 wonder+0.009
 lives+0.009
 names+0.008
Neg logit contributions
side-0.017
fs-0.016
 salon-0.015
ves-0.015
ized-0.015
Token day
Token ID1110
Feature activation+1.01e-02

Pos logit contributions
side+0.012
fs+0.011
 salon+0.011
but+0.010
liv+0.010
Neg logit contributions
 name-0.006
 lives-0.005
 comes-0.005
 wonder-0.005
 can-0.005
Token ever
Token ID1683
Feature activation-7.87e-02

Pos logit contributions
side+0.026
fs+0.025
 salon+0.025
pan+0.024
liv+0.024
Neg logit contributions
 comes-0.011
 can-0.010
 lives-0.010
 name-0.010
 know-0.009
Token.
Token ID13
Feature activation-3.76e-03

Pos logit contributions
 name+0.097
 can+0.091
 comes+0.090
 lives+0.088
 wants+0.087
Neg logit contributions
side-0.197
fs-0.189
 salon-0.184
brush-0.174
cakes-0.173
Token<|endoftext|>
Token ID50256
Feature activation+2.22e-02

Pos logit contributions
 name+0.004
 comes+0.004
 wonder+0.003
 wants+0.003
 can+0.003
Neg logit contributions
side-0.010
fs-0.009
but-0.009
 salon-0.009
cakes-0.009
TokenOne
Token ID3198
Feature activation-2.51e-02

Pos logit contributions
side+0.058
fs+0.052
but+0.050
cakes+0.049
 salon+0.049
Neg logit contributions
 name-0.024
 know-0.022
 wonder-0.022
 wants-0.022
 lives-0.021
Token day
Token ID1110
Feature activation+1.78e-02

Pos logit contributions
 name+0.031
 know+0.027
 wonder+0.027
 can+0.027
 lives+0.027
Neg logit contributions
side-0.066
fs-0.062
 salon-0.059
but-0.058
pan-0.058
Token,
Token ID11
Feature activation+2.49e-02

Pos logit contributions
side+0.059
 salon+0.057
fs+0.057
brush+0.054
cakes+0.054
Neg logit contributions
 name-0.009
 can-0.008
 comes-0.008
 wants-0.005
 lives-0.005
Token a
Token ID257
Feature activation+8.22e-03

Pos logit contributions
side+0.073
fs+0.071
 salon+0.070
cakes+0.067
brush+0.066
Neg logit contributions
 name-0.023
 can-0.019
 wonder-0.018
 lives-0.017
 know-0.017
Token little
Token ID1310
Feature activation-3.82e-02

Pos logit contributions
side+0.020
pron+0.019
 salon+0.018
fs+0.018
 therapist+0.018
Neg logit contributions
 comes-0.009
 know-0.009
 name-0.009
 wants-0.009
 wonders-0.008
Token girl
Token ID2576
Feature activation-2.65e-02

Pos logit contributions
 name+0.041
 know+0.036
 comes+0.034
 wants+0.034
 wonder+0.034
Neg logit contributions
side-0.100
 salon-0.093
fs-0.092
 therapist-0.088
ized-0.088
Token his
Token ID465
Feature activation+1.71e-02

Pos logit contributions
side+0.092
fs+0.086
 salon+0.082
ves+0.079
cakes+0.079
Neg logit contributions
 name-0.054
 comes-0.048
 wonder-0.048
 can-0.048
 know-0.047
Token ball
Token ID2613
Feature activation+4.79e-02

Pos logit contributions
side+0.035
fs+0.035
ves+0.032
 salon+0.030
ized+0.030
Neg logit contributions
 comes-0.025
 wonder-0.023
 know-0.023
 name-0.023
 Don-0.022
Token.
Token ID13
Feature activation+2.72e-02

Pos logit contributions
side+0.135
liv+0.126
fs+0.126
 salon+0.124
pron+0.120
Neg logit contributions
 can-0.048
 name-0.048
 lives-0.040
 comes-0.039
 wants-0.038
Token One
Token ID1881
Feature activation-3.74e-02

Pos logit contributions
side+0.069
 salon+0.062
fs+0.062
ves+0.059
but+0.059
Neg logit contributions
 name-0.028
 lives-0.028
 wonder-0.028
 know-0.027
 comes-0.027
Token day
Token ID1110
Feature activation+2.54e-02

Pos logit contributions
 name+0.064
 comes+0.059
 imag+0.059
 wonders+0.058
 names+0.058
Neg logit contributions
side-0.074
fs-0.065
ves-0.061
 salon-0.061
liv-0.060
Token,
Token ID11
Feature activation+2.22e-02

Pos logit contributions
side+0.068
fs+0.064
 salon+0.062
cakes+0.060
but+0.060
Neg logit contributions
 name-0.028
 can-0.024
 comes-0.024
 lives-0.023
 wonder-0.023
Token Spot
Token ID15899
Feature activation+1.88e-02

Pos logit contributions
side+0.055
 salon+0.051
fs+0.050
cakes+0.048
ized+0.048
Neg logit contributions
 name-0.030
 can-0.028
 comes-0.026
 wonder-0.025
 wants-0.025
Token's
Token ID338
Feature activation-2.09e-02

Pos logit contributions
side+0.039
ves+0.034
fs+0.033
 salon+0.033
 therapist+0.033
Neg logit contributions
 name-0.029
 wonder-0.026
 names-0.025
 know-0.024
 Don-0.024
Token ball
Token ID2613
Feature activation+4.56e-02

Pos logit contributions
 name+0.028
 comes+0.027
 wonder+0.026
 know+0.026
 lives+0.026
Neg logit contributions
side-0.048
fs-0.044
 salon-0.043
cakes-0.041
but-0.041
Token rolled
Token ID11686
Feature activation+5.05e-02

Pos logit contributions
side+0.115
fs+0.107
 salon+0.104
cakes+0.099
but+0.099
Neg logit contributions
 name-0.057
 can-0.055
 comes-0.054
 wants-0.052
 lives-0.051
Token under
Token ID739
Feature activation-1.27e-02

Pos logit contributions
side+0.115
fs+0.106
 salon+0.102
but+0.098
cakes+0.097
Neg logit contributions
 name-0.076
 comes-0.067
 can-0.067
 wonder-0.067
 know-0.067
Token,
Token ID11
Feature activation+2.74e-02

Pos logit contributions
 name+0.034
 wonder+0.030
 wonders+0.030
 know+0.030
 lives+0.029
Neg logit contributions
side-0.051
fs-0.047
 salon-0.045
but-0.044
ves-0.044
Token there
Token ID612
Feature activation-3.02e-02

Pos logit contributions
side+0.049
fs+0.044
 salon+0.041
but+0.040
ized+0.039
Neg logit contributions
 name-0.053
 comes-0.049
 know-0.048
 wonder-0.048
 lives-0.048
Token was
Token ID373
Feature activation+1.22e-02

Pos logit contributions
 name+0.048
 know+0.043
 names+0.043
rr+0.043
 wonders+0.042
Neg logit contributions
side-0.059
liv-0.053
ves-0.053
fs-0.053
 salon-0.051
Token a
Token ID257
Feature activation+1.23e-02

Pos logit contributions
side+0.031
fs+0.029
 salon+0.028
cakes+0.027
but+0.027
Neg logit contributions
 name-0.015
 comes-0.013
 can-0.013
 know-0.013
 wants-0.013
Token little
Token ID1310
Feature activation-3.73e-02

Pos logit contributions
pron+0.028
 salon+0.026
side+0.025
 therapist+0.024
fs+0.023
Neg logit contributions
 comes-0.016
 know-0.016
 imag-0.015
 names-0.015
 wants-0.015
Token girl
Token ID2576
Feature activation-2.48e-02

Pos logit contributions
 name+0.044
 know+0.040
 comes+0.038
 names+0.038
 wants+0.037
Neg logit contributions
side-0.091
 salon-0.087
fs-0.084
 therapist-0.082
brush-0.081
Token named
Token ID3706
Feature activation-1.17e-02

Pos logit contributions
 name+0.035
 comes+0.030
 can+0.030
 wonder+0.030
 know+0.030
Neg logit contributions
side-0.059
fs-0.054
 salon-0.053
but-0.050
cakes-0.050
Token Lily
Token ID20037
Feature activation+1.51e-02

Pos logit contributions
 name+0.017
 wonder+0.016
 lives+0.016
 comes+0.015
 wonders+0.015
Neg logit contributions
side-0.027
 salon-0.024
fs-0.024
cakes-0.023
cake-0.022
Token.
Token ID13
Feature activation+2.58e-02

Pos logit contributions
side+0.031
fs+0.028
 salon+0.027
but+0.026
ized+0.026
Neg logit contributions
 name-0.026
 comes-0.023
 can-0.023
 wonder-0.022
 lives-0.022
Token She
Token ID1375
Feature activation+2.80e-02

Pos logit contributions
side+0.056
 salon+0.048
fs+0.047
cakes+0.047
brush+0.045
Neg logit contributions
 name-0.040
 lives-0.038
 comes-0.038
 know-0.037
 wants-0.036
Token loved
Token ID6151
Feature activation+2.60e-02

Pos logit contributions
side+0.065
fs+0.061
 salon+0.058
ves+0.058
pron+0.056
Neg logit contributions
 name-0.038
 wonder-0.034
 comes-0.033
 know-0.033
 names-0.033
Token
Token ID198
Feature activation+3.15e-03

Pos logit contributions
 name+0.019
 wonder+0.017
 lives+0.017
 comes+0.017
 wants+0.016
Neg logit contributions
side-0.054
fs-0.050
but-0.048
 salon-0.048
cakes-0.047
Token
Token ID198
Feature activation+3.59e-03

Pos logit contributions
side+0.009
fs+0.009
 salon+0.008
but+0.008
cakes+0.008
Neg logit contributions
 name-0.003
 comes-0.002
 lives-0.002
 can-0.002
 wonder-0.002
TokenThe
Token ID464
Feature activation+4.02e-02

Pos logit contributions
side+0.007
 salon+0.007
fs+0.006
liv+0.006
brush+0.006
Neg logit contributions
 name-0.006
 can-0.006
 wonder-0.006
 know-0.006
 lives-0.006
Token judge
Token ID5052
Feature activation+1.79e-02

Pos logit contributions
side+0.088
fs+0.080
 salon+0.078
cakes+0.075
but+0.074
Neg logit contributions
 name-0.059
 comes-0.056
 can-0.055
 wonder-0.055
 lives-0.054
Token gave
Token ID2921
Feature activation+1.33e-03

Pos logit contributions
side+0.049
fs+0.046
 salon+0.043
but+0.042
liv+0.042
Neg logit contributions
 name-0.020
 can-0.019
 comes-0.018
 lives-0.018
 know-0.017
Token George
Token ID4502
Feature activation-3.16e-02

Pos logit contributions
side+0.003
fs+0.003
 salon+0.002
cakes+0.002
ized+0.002
Neg logit contributions
 name-0.002
 comes-0.002
 wonder-0.002
 wants-0.002
 know-0.002
Token lots
Token ID6041
Feature activation-3.32e-03

Pos logit contributions
 name+0.033
 can+0.027
 know+0.027
 wonder+0.026
 lives+0.026
Neg logit contributions
side-0.087
fs-0.085
 salon-0.082
ves-0.078
cakes-0.078
Token of
Token ID286
Feature activation+4.79e-03

Pos logit contributions
 can+0.005
 name+0.005
 comes+0.004
 know+0.004
 lives+0.004
Neg logit contributions
side-0.008
 salon-0.007
liv-0.007
fs-0.007
pan-0.007
Token hugs
Token ID40657
Feature activation-3.92e-02

Pos logit contributions
side+0.012
fs+0.011
 salon+0.011
but+0.011
cakes+0.010
Neg logit contributions
 name-0.006
 wonder-0.005
 know-0.005
 can-0.005
 comes-0.005
Token and
Token ID290
Feature activation-9.21e-03

Pos logit contributions
 name+0.027
 can+0.025
 comes+0.023
 wonder+0.018
 names+0.018
Neg logit contributions
side-0.119
fs-0.118
 salon-0.117
liv-0.113
ves-0.111
Token said
Token ID531
Feature activation+1.17e-02

Pos logit contributions
 name+0.012
 know+0.012
 comes+0.011
 wonder+0.011
 can+0.011
Neg logit contributions
side-0.024
 salon-0.022
fs-0.022
but-0.021
ized-0.020
Token like
Token ID588
Feature activation-4.26e-03

Pos logit contributions
 can+0.092
 name+0.088
 comes+0.083
 is+0.073
 lives+0.071
Neg logit contributions
side-0.204
liv-0.195
fs-0.188
 salon-0.188
inge-0.184
Token Jack
Token ID3619
Feature activation+1.50e-02

Pos logit contributions
 name+0.005
 can+0.004
 lives+0.004
 comes+0.004
 names+0.004
Neg logit contributions
side-0.012
fs-0.011
but-0.010
 salon-0.010
liv-0.010
Token.
Token ID13
Feature activation+3.76e-04

Pos logit contributions
side+0.041
fs+0.038
 salon+0.036
liv+0.036
but+0.036
Neg logit contributions
 name-0.017
 can-0.016
 wants-0.016
 comes-0.016
 lives-0.015
Token They
Token ID1119
Feature activation-1.33e-02

Pos logit contributions
side+0.001
fs+0.001
but+0.001
cakes+0.001
cake+0.001
Neg logit contributions
 name-0.000
 lives-0.000
 comes-0.000
 wonder-0.000
 craw-0.000
Token have
Token ID423
Feature activation-2.28e-02

Pos logit contributions
 name+0.011
 can+0.009
 comes+0.009
 wonder+0.009
 know+0.009
Neg logit contributions
side-0.039
fs-0.037
 salon-0.036
but-0.035
cakes-0.035
Token names
Token ID3891
Feature activation-7.81e-02

Pos logit contributions
 name+0.028
 can+0.026
 lives+0.025
 comes+0.025
 know+0.025
Neg logit contributions
side-0.060
fs-0.057
but-0.054
 salon-0.054
cakes-0.054
Token like
Token ID588
Feature activation-5.17e-03

Pos logit contributions
 can+0.080
 name+0.073
 comes+0.071
 is+0.061
 lives+0.060
Neg logit contributions
side-0.223
liv-0.213
 salon-0.208
fs-0.205
inge-0.201
Token Black
Token ID2619
Feature activation-6.24e-02

Pos logit contributions
 name+0.006
 can+0.006
 lives+0.005
 comes+0.005
 names+0.005
Neg logit contributions
side-0.014
fs-0.013
 salon-0.012
liv-0.012
but-0.012
Tokenbeard
Token ID39433
Feature activation-4.64e-02

Pos logit contributions
 name+0.095
 wonder+0.088
 know+0.085
 comes+0.082
 wants+0.082
Neg logit contributions
side-0.137
fs-0.129
ves-0.119
 salon-0.119
cakes-0.118
Token or
Token ID393
Feature activation-2.44e-02

Pos logit contributions
 name+0.056
 can+0.051
 comes+0.049
 wants+0.049
 lives+0.049
Neg logit contributions
side-0.120
fs-0.112
 salon-0.108
liv-0.106
but-0.104
Token Hook
Token ID18531
Feature activation-1.84e-02

Pos logit contributions
 name+0.031
 can+0.028
 comes+0.028
 lives+0.027
 know+0.027
Neg logit contributions
side-0.064
fs-0.059
 salon-0.056
liv-0.056
but-0.055
Token was
Token ID373
Feature activation+8.19e-03

Pos logit contributions
 name+0.043
 know+0.037
 comes+0.037
 wants+0.037
 names+0.037
Neg logit contributions
side-0.068
fs-0.061
liv-0.060
 salon-0.060
ves-0.059
Token a
Token ID257
Feature activation+8.73e-03

Pos logit contributions
side+0.023
fs+0.021
 salon+0.021
cakes+0.020
but+0.020
Neg logit contributions
 name-0.009
 wonder-0.007
 can-0.007
 lives-0.007
 know-0.007
Token dog
Token ID3290
Feature activation-1.85e-02

Pos logit contributions
pron+0.019
 salon+0.019
side+0.018
 therapist+0.017
fs+0.017
Neg logit contributions
 comes-0.011
 know-0.011
 names-0.011
 imag-0.010
 wants-0.010
Token.
Token ID13
Feature activation+2.24e-02

Pos logit contributions
 name+0.021
 can+0.019
 lives+0.018
 comes+0.018
 know+0.017
Neg logit contributions
side-0.050
fs-0.047
 salon-0.045
cakes-0.044
liv-0.043
Token His
Token ID2399
Feature activation+5.06e-02

Pos logit contributions
side+0.060
fs+0.053
 salon+0.053
 therapist+0.051
cakes+0.051
Neg logit contributions
 name-0.020
 can-0.020
 comes-0.019
 lives-0.019
 knows-0.019
Token name
Token ID1438
Feature activation-8.01e-02

Pos logit contributions
fs+0.125
side+0.124
liv+0.121
 therapist+0.120
ves+0.119
Neg logit contributions
 comes-0.040
rr-0.031
 wonders-0.030
 imag-0.029
 know-0.029
Token was
Token ID373
Feature activation+7.56e-03

Pos logit contributions
 name+0.092
 wonder+0.078
 know+0.078
 comes+0.078
 can+0.078
Neg logit contributions
side-0.209
fs-0.195
 salon-0.189
cakes-0.182
ves-0.181
Token Tim
Token ID5045
Feature activation+9.56e-03

Pos logit contributions
side+0.017
fs+0.016
liv+0.016
 salon+0.015
ves+0.015
Neg logit contributions
 name-0.010
 wonder-0.010
 comes-0.010
 can-0.009
 wants-0.009
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
side+0.024
fs+0.022
 salon+0.021
but+0.021
ized+0.020
Neg logit contributions
 name-0.012
 comes-0.011
 wonder-0.011
 lives-0.010
 know-0.010
Token One
Token ID1881
Feature activation-4.48e-02

Pos logit contributions
side+0.042
cakes+0.037
fs+0.036
cake+0.036
but+0.035
Neg logit contributions
 name-0.025
 hopes-0.024
 lives-0.024
 knows-0.024
 comes-0.024
Token day
Token ID1110
Feature activation+1.90e-02

Pos logit contributions
 name+0.076
 imag+0.074
 comes+0.073
 hopes+0.073
 wonders+0.072
Neg logit contributions
side-0.082
fs-0.073
ves-0.069
liv-0.068
but-0.067
Token on
Token ID319
Feature activation+2.69e-02

Pos logit contributions
 name+0.029
 can+0.027
 comes+0.027
 wonder+0.026
 know+0.026
Neg logit contributions
side-0.037
fs-0.034
 salon-0.033
ized-0.031
but-0.031
Token the
Token ID262
Feature activation-2.98e-03

Pos logit contributions
side+0.071
fs+0.067
 salon+0.064
cakes+0.062
but+0.062
Neg logit contributions
 name-0.030
 comes-0.026
 can-0.026
 wonder-0.026
 wants-0.025
Token door
Token ID3420
Feature activation+1.98e-02

Pos logit contributions
 name+0.004
 know+0.003
 wonder+0.003
 wants+0.003
 lives+0.003
Neg logit contributions
side-0.007
fs-0.007
 salon-0.007
cakes-0.006
brush-0.006
Token.
Token ID13
Feature activation-1.65e-04

Pos logit contributions
side+0.048
fs+0.045
 salon+0.043
ized+0.042
liv+0.042
Neg logit contributions
 name-0.027
 can-0.024
 comes-0.024
 wonder-0.024
 know-0.024
Token Mom
Token ID11254
Feature activation+1.69e-02

Pos logit contributions
 name+0.000
 comes+0.000
 know+0.000
 lives+0.000
 wants+0.000
Neg logit contributions
side-0.000
 salon-0.000
fs-0.000
but-0.000
cakes-0.000
Token opens
Token ID9808
Feature activation-7.64e-02

Pos logit contributions
side+0.047
 salon+0.046
fs+0.044
cakes+0.044
pan+0.042
Neg logit contributions
 name-0.015
 wonder-0.012
 know-0.012
 names-0.010
 lives-0.010
Token the
Token ID262
Feature activation-3.08e-03

Pos logit contributions
 name+0.075
 wonder+0.061
 comes+0.060
 can+0.060
 lives+0.058
Neg logit contributions
side-0.215
fs-0.206
 salon-0.198
cakes-0.192
but-0.191
Token door
Token ID3420
Feature activation+1.92e-02

Pos logit contributions
 name+0.004
 comes+0.003
 can+0.003
 wonder+0.003
 know+0.003
Neg logit contributions
side-0.008
fs-0.007
 salon-0.007
cakes-0.007
but-0.007
Token and
Token ID290
Feature activation-6.05e-03

Pos logit contributions
side+0.057
fs+0.056
 salon+0.053
liv+0.052
ized+0.051
Neg logit contributions
 name-0.014
 comes-0.013
 can-0.013
 wonder-0.012
 wants-0.011
Token smiles
Token ID21845
Feature activation-2.61e-02

Pos logit contributions
 name+0.006
 know+0.005
 wonder+0.005
rr+0.004
 names+0.004
Neg logit contributions
 salon-0.015
side-0.015
fs-0.015
cakes-0.015
 therapist-0.014
Token.
Token ID13
Feature activation-4.26e-03

Pos logit contributions
 name+0.038
 comes+0.034
 can+0.034
 wonder+0.033
 lives+0.033
Neg logit contributions
side-0.061
fs-0.057
 salon-0.055
cakes-0.052
ves-0.052
Tokenave
Token ID1015
Feature activation+3.48e-02

Pos logit contributions
 name+0.008
 can+0.008
 comes+0.008
 know+0.008
 wants+0.008
Neg logit contributions
side-0.005
fs-0.005
 salon-0.005
but-0.004
cakes-0.004
Token the
Token ID262
Feature activation-3.68e-03

Pos logit contributions
side+0.073
fs+0.069
ized+0.065
 salon+0.064
 therapist+0.063
Neg logit contributions
 name-0.052
 wonder-0.048
 wants-0.047
 lives-0.047
 know-0.046
Token shark
Token ID21027
Feature activation-2.07e-03

Pos logit contributions
 name+0.006
 comes+0.005
 know+0.005
 lives+0.005
 can+0.005
Neg logit contributions
side-0.007
fs-0.007
 therapist-0.007
 salon-0.006
ves-0.006
Token and
Token ID290
Feature activation-1.98e-03

Pos logit contributions
 name+0.002
 comes+0.002
 can+0.002
 wonder+0.002
 lives+0.002
Neg logit contributions
side-0.006
fs-0.005
 salon-0.005
but-0.005
cakes-0.005
Token they
Token ID484
Feature activation-3.25e-03

Pos logit contributions
 name+0.003
 comes+0.003
 can+0.003
 know+0.003
 wonder+0.003
Neg logit contributions
side-0.004
fs-0.004
 salon-0.004
cakes-0.003
but-0.003
Token sw
Token ID1509
Feature activation-7.48e-02

Pos logit contributions
 name+0.005
 comes+0.004
 wants+0.004
 lives+0.004
 wonder+0.004
Neg logit contributions
side-0.008
fs-0.007
 salon-0.007
ized-0.006
cakes-0.006
Tokenam
Token ID321
Feature activation-3.88e-02

Pos logit contributions
 wonder+0.122
 can+0.122
 name+0.119
 know+0.118
 Don+0.116
Neg logit contributions
side-0.142
ves-0.133
ized-0.133
fs-0.132
uddled-0.130
Token together
Token ID1978
Feature activation+6.09e-03

Pos logit contributions
 name+0.074
 comes+0.070
 lives+0.069
 wants+0.069
 can+0.068
Neg logit contributions
side-0.070
fs-0.060
but-0.057
ized-0.056
ves-0.055
Token to
Token ID284
Feature activation+3.46e-03

Pos logit contributions
side+0.015
fs+0.015
 salon+0.015
liv+0.014
cakes+0.014
Neg logit contributions
 name-0.008
 can-0.007
 wonder-0.006
 know-0.006
 comes-0.006
Token find
Token ID1064
Feature activation-3.66e-02

Pos logit contributions
side+0.007
 salon+0.006
ized+0.006
ves+0.006
handed+0.006
Neg logit contributions
 comes-0.004
 wants-0.004
 name-0.004
 can-0.004
rr-0.004
Token Lucky
Token ID27963
Feature activation+2.25e-02

Pos logit contributions
 name+0.056
 wants+0.053
 comes+0.050
 lives+0.049
 know+0.049
Neg logit contributions
side-0.075
fs-0.070
ized-0.066
ves-0.065
 therapist-0.063
Token day
Token ID1110
Feature activation+6.74e-03

Pos logit contributions
side+0.007
fs+0.007
 salon+0.007
cakes+0.006
ized+0.006
Neg logit contributions
 name-0.004
 can-0.003
 comes-0.003
 wonder-0.003
 lives-0.003
Token on
Token ID319
Feature activation+1.62e-02

Pos logit contributions
side+0.012
fs+0.011
 salon+0.010
but+0.010
cakes+0.010
Neg logit contributions
 name-0.013
 wonder-0.012
 comes-0.012
 can-0.012
 lives-0.012
Token,
Token ID11
Feature activation-1.32e-03

Pos logit contributions
side+0.051
fs+0.049
but+0.047
wheel+0.047
 salon+0.047
Neg logit contributions
 can-0.010
 name-0.009
 comes-0.009
 is-0.008
 lives-0.007
Token Lily
Token ID20037
Feature activation-6.17e-03

Pos logit contributions
 name+0.001
 can+0.001
 comes+0.001
 lives+0.001
 wonder+0.001
Neg logit contributions
side-0.004
fs-0.004
ves-0.004
liv-0.004
 salon-0.004
Token made
Token ID925
Feature activation+6.34e-03

Pos logit contributions
 name+0.008
 wonder+0.007
 know+0.007
 lives+0.007
 comes+0.007
Neg logit contributions
side-0.015
fs-0.013
 salon-0.013
but-0.013
 therapist-0.012
Token sure
Token ID1654
Feature activation-7.25e-02

Pos logit contributions
side+0.018
fs+0.017
 salon+0.017
cakes+0.016
but+0.016
Neg logit contributions
 name-0.006
 know-0.005
 comes-0.005
 wonder-0.005
 can-0.005
Token to
Token ID284
Feature activation-2.80e-03

Pos logit contributions
 can+0.040
 comes+0.035
 name+0.034
 know+0.033
 lives+0.029
Neg logit contributions
fs-0.240
side-0.236
pron-0.218
quet-0.216
bridge-0.216
Token always
Token ID1464
Feature activation-2.41e-02

Pos logit contributions
 name+0.003
 know+0.002
 wonder+0.002
 can+0.002
 comes+0.002
Neg logit contributions
side-0.008
fs-0.007
 salon-0.007
cakes-0.007
but-0.007
Token bring
Token ID2222
Feature activation+2.68e-02

Pos logit contributions
 name+0.020
 can+0.016
 comes+0.016
 wonder+0.016
 know+0.016
Neg logit contributions
side-0.072
fs-0.067
 salon-0.065
cakes-0.063
but-0.063
Token her
Token ID607
Feature activation-1.54e-02

Pos logit contributions
side+0.064
fs+0.058
 salon+0.058
cakes+0.054
but+0.054
Neg logit contributions
 name-0.041
 wonder-0.036
 can-0.036
 know-0.036
 comes-0.035
Token glove
Token ID29144
Feature activation+3.28e-02

Pos logit contributions
 name+0.027
 can+0.024
 wonder+0.023
 wants+0.023
 lives+0.023
Neg logit contributions
side-0.034
fs-0.031
 salon-0.030
cakes-0.029
cake-0.028