TOP ACTIVATIONS
MAX = 4.515

" Little bird , why don 't you take a break
small flame . " Why don 't you stay and keep
it teased , " Why don 't you chop the tree
" Why don 't you try that again
the sky , " Why don 't you join us on
's friend laid down on Velvet 's soft , fluffy fur
the zip , " Why don 't you rest on my
a voice , " Why don 't you jump over it
. " Why aren 't you moving like I
and spoke . " Why don 't you strike the butterfly
voice said , " Why don 't you just remain here
was happening . " Why don 't you all share the
voice said , " Why don 't you move around it
Hey Jay , why don â t you
at him . ' Why don 't you try to mark
It 's okay . Why don 't you try looking at
mom says . " He doesn 't understand what is happening
. " Why don 't you add some paint
. " Why don 't you push with your
re hungry , but why don â t you

MIN ACTIVATIONS
MIN = -4.522

He begged and begged his mom to buy it for him
. K iki begged her mom to let her finish it
shop . She begged her mom to buy it for her
wanted . He begged his mom to buy it for him
that she could make his kingdom happy again . So the
so excited and begged her mom my to let her pet
, so he begged his mom to buy it for him
ever . They begged their mom to change her mind .
He begged and begged his mom to let him in ,
it . She begged her mom to buy it . Mom
the puppy and begged her mom my to take it home
. The bridge made their town even more special . <|endoftext|>
pasta . She begged her mom my , but her mom
like your house , your street , your city , or
are both going to your rooms . No more playing today
. They will make our room more beautiful ," Mom said
the store and begged her mom to buy it for her
badly . She begged her mom over and over again ,
is from a long time ago , from when dinosaurs lived
. Tim my begged his parents to go inside . They

INTERVAL 2.256 - 4.515
CONTAINS 0.688%

wanted to go on a wild adventure . So he asked
. You did the right thing . You showed the owl
She hears a g iggle from above . She looks up
him that even though Tweet wasn 't there , they could
Suddenly , he heard a strange sound coming from the sky

INTERVAL -0.003 - 2.256
CONTAINS 49.565%

and wanted to taste it . But the bird knew it
and slide . They see a big plane in the sky
was a little girl named Lily . She loved to play
need to attach you to my truck and take you away
name is Thunder . I â m new in

INTERVAL -2.263 - -0.003
CONTAINS 49.348%

<|endoftext|> Once upon a time there was a tiny pink boat
boy named Tim my . Tim my was very lonely because
to . He asked again and finally , the boss y
powerful the stick had been , and how it had made
This is my family . This is my mom , my

INTERVAL -4.522 - -2.263
CONTAINS 0.398%

. But you need <|endoftext|> " Mom my , what 's
fun . He asked his dad if he could have a
seen her appear in the market . She looked fit and
, Tim my told his mom my what he did and
the little girl asked her mom for a violin , but
Token"
Token ID1
Feature activation-2.15e-01

Pos logit contributions
 cries+0.012
cha+0.012
 breaths+0.011
lers+0.010
ishly+0.010
Neg logit contributions
 color-0.003
 few-0.003
 Jem-0.002
 passenger-0.002
 builder-0.002
TokenLittle
Token ID22253
Feature activation+4.33e-01

Pos logit contributions
 few+0.005
 color+0.004
 Jem+0.004
 passenger+0.004
"+0.004
Neg logit contributions
 cries-0.006
 breaths-0.006
cha-0.006
lers-0.006
ishly-0.005
Token bird
Token ID6512
Feature activation+1.01e+00

Pos logit contributions
 cries+0.014
 breaths+0.014
lers+0.013
cha+0.012
ses+0.012
Neg logit contributions
 few-0.009
 color-0.008
"-0.008
Su-0.008
 passenger-0.007
Token,
Token ID11
Feature activation+5.56e-01

Pos logit contributions
 cries+0.037
 breaths+0.036
cha+0.034
lers+0.034
 worries+0.031
Neg logit contributions
 few-0.017
 color-0.015
Su-0.015
J-0.014
 Jem-0.014
Token why
Token ID1521
Feature activation+2.36e-01

Pos logit contributions
 cries+0.016
 breaths+0.015
cha+0.014
lers+0.014
ses+0.013
Neg logit contributions
 few-0.014
 color-0.013
Su-0.012
"-0.012
J-0.012
Token don
Token ID836
Feature activation+4.51e+00

Pos logit contributions
 cries+0.006
 breaths+0.006
cha+0.006
lers+0.005
 worries+0.005
Neg logit contributions
 few-0.006
Su-0.006
 color-0.006
J-0.006
"-0.005
Token't
Token ID470
Feature activation+1.88e+00

Pos logit contributions
 breaths+0.162
 cries+0.155
ses+0.147
lers+0.145
cha+0.139
Neg logit contributions
"-0.085
 few-0.079
The-0.067
Su-0.065
 color-0.064
Token you
Token ID345
Feature activation+1.06e+00

Pos logit contributions
 breaths+0.047
 cries+0.046
lers+0.041
cha+0.041
ses+0.038
Neg logit contributions
 few-0.053
 color-0.050
"-0.049
Su-0.048
 Jem-0.046
Token take
Token ID1011
Feature activation+8.02e-01

Pos logit contributions
 breaths+0.035
 cries+0.035
lers+0.032
cha+0.032
ses+0.030
Neg logit contributions
 few-0.021
 color-0.020
"-0.019
Su-0.018
 passenger-0.017
Token a
Token ID257
Feature activation+1.12e+00

Pos logit contributions
 cries+0.031
 breaths+0.030
cha+0.029
lers+0.028
ses+0.027
Neg logit contributions
 few-0.012
 color-0.010
"-0.009
Su-0.008
 true-0.008
Token break
Token ID2270
Feature activation-1.28e+00

Pos logit contributions
 cries+0.036
 breaths+0.035
lers+0.032
cha+0.031
 worries+0.030
Neg logit contributions
Su-0.022
 few-0.021
J-0.021
 Jem-0.021
Grand-0.021
Token small
Token ID1402
Feature activation-9.83e-01

Pos logit contributions
 few+0.014
 color+0.013
"+0.013
 passenger+0.012
 sunny+0.012
Neg logit contributions
 breaths-0.019
 cries-0.018
cha-0.018
lers-0.017
ishly-0.016
Token flame
Token ID16506
Feature activation+6.03e-01

Pos logit contributions
 few+0.026
 color+0.024
Su+0.024
"+0.024
J+0.024
Neg logit contributions
 cries-0.026
 breaths-0.025
lers-0.022
cha-0.022
ses-0.020
Token.
Token ID13
Feature activation+5.36e-01

Pos logit contributions
 cries+0.018
 breaths+0.018
cha+0.016
lers+0.016
ses+0.015
Neg logit contributions
 few-0.014
 color-0.013
"-0.012
Su-0.012
 passenger-0.012
Token "
Token ID366
Feature activation+3.81e-01

Pos logit contributions
 cries+0.021
 breaths+0.021
cha+0.020
lers+0.020
 worries+0.018
Neg logit contributions
 few-0.007
 color-0.006
Su-0.006
"-0.006
J-0.005
TokenWhy
Token ID5195
Feature activation+3.98e-01

Pos logit contributions
 cries+0.014
 breaths+0.014
lers+0.013
cha+0.013
ses+0.012
Neg logit contributions
 few-0.006
 color-0.005
Su-0.005
"-0.005
J-0.005
Token don
Token ID836
Feature activation+4.40e+00

Pos logit contributions
 cries+0.011
 breaths+0.010
cha+0.009
lers+0.009
 worries+0.008
Neg logit contributions
 few-0.011
Su-0.010
 color-0.010
J-0.010
 passenger-0.009
Token't
Token ID470
Feature activation+2.32e+00

Pos logit contributions
 breaths+0.151
 cries+0.143
lers+0.133
ses+0.132
cha+0.130
Neg logit contributions
"-0.087
 few-0.085
Su-0.074
 color-0.073
The-0.073
Token you
Token ID345
Feature activation+1.18e+00

Pos logit contributions
 breaths+0.067
 cries+0.062
lers+0.059
ses+0.057
cha+0.055
Neg logit contributions
 few-0.058
 color-0.055
"-0.055
Su-0.051
 true-0.050
Token stay
Token ID2652
Feature activation-3.55e-01

Pos logit contributions
 breaths+0.041
 cries+0.041
lers+0.038
cha+0.038
ses+0.035
Neg logit contributions
 few-0.021
 color-0.020
"-0.020
Su-0.018
 passenger-0.017
Token and
Token ID290
Feature activation+3.53e-01

Pos logit contributions
 few+0.007
Su+0.007
J+0.007
 Jem+0.007
Kim+0.007
Neg logit contributions
 breaths-0.011
 cries-0.011
lers-0.010
cha-0.010
 worries-0.009
Token keep
Token ID1394
Feature activation+1.53e+00

Pos logit contributions
 breaths+0.012
 cries+0.012
lers+0.011
cha+0.011
ses+0.010
Neg logit contributions
 few-0.006
 color-0.006
"-0.006
 sunny-0.006
 tidy-0.005
Token it
Token ID340
Feature activation+3.97e-01

Pos logit contributions
 cries+0.038
 breaths+0.038
cha+0.034
lers+0.034
 worries+0.031
Neg logit contributions
 few-0.026
 color-0.023
Su-0.023
 passenger-0.023
"-0.022
Token teased
Token ID35380
Feature activation+3.45e-01

Pos logit contributions
 breaths+0.013
 cries+0.013
cha+0.012
lers+0.012
ses+0.011
Neg logit contributions
 few-0.008
 color-0.007
"-0.007
Su-0.007
J-0.007
Token,
Token ID11
Feature activation+5.76e-01

Pos logit contributions
 breaths+0.009
 cries+0.008
lers+0.007
cha+0.007
ses+0.007
Neg logit contributions
 few-0.010
 color-0.009
"-0.009
Su-0.009
J-0.009
Token "
Token ID366
Feature activation+5.88e-02

Pos logit contributions
 cries+0.020
 breaths+0.019
cha+0.018
lers+0.017
ishly+0.016
Neg logit contributions
 few-0.011
 color-0.010
Su-0.009
J-0.009
 passenger-0.009
TokenWhy
Token ID5195
Feature activation+2.67e-01

Pos logit contributions
 cries+0.002
cha+0.002
 breaths+0.002
lers+0.002
ted+0.001
Neg logit contributions
 few-0.001
 color-0.001
 Jem-0.001
 builder-0.001
 passenger-0.001
Token don
Token ID836
Feature activation+4.38e+00

Pos logit contributions
 cries+0.006
 breaths+0.005
 doesn+0.005
cha+0.005
 worries+0.004
Neg logit contributions
 few-0.008
Su-0.008
Grand-0.008
Sus-0.008
 passenger-0.008
Token't
Token ID470
Feature activation+1.87e+00

Pos logit contributions
 breaths+0.132
 cries+0.130
lers+0.117
cha+0.116
ses+0.110
Neg logit contributions
 few-0.101
"-0.095
 color-0.091
Su-0.090
Grand-0.085
Token you
Token ID345
Feature activation+1.31e+00

Pos logit contributions
 cries+0.043
 breaths+0.042
cha+0.037
lers+0.036
 worries+0.031
Neg logit contributions
 few-0.057
 color-0.053
Su-0.052
"-0.051
 passenger-0.051
Token chop
Token ID30506
Feature activation-2.43e-01

Pos logit contributions
 cries+0.041
 breaths+0.040
cha+0.037
lers+0.036
 worries+0.033
Neg logit contributions
 few-0.029
Su-0.026
 color-0.025
J-0.025
 passenger-0.025
Token the
Token ID262
Feature activation+4.53e-01

Pos logit contributions
 few+0.005
Su+0.005
J+0.005
 color+0.004
 passenger+0.004
Neg logit contributions
 cries-0.008
 breaths-0.008
lers-0.007
cha-0.007
 worries-0.006
Token tree
Token ID5509
Feature activation-1.27e+00

Pos logit contributions
 breaths+0.013
 cries+0.013
cha+0.011
lers+0.011
ishly+0.010
Neg logit contributions
 few-0.011
 color-0.011
"-0.010
Su-0.010
 passenger-0.010
Token
Token ID220
Feature activation+4.23e-01

Pos logit contributions
 cries+0.008
 breaths+0.008
cha+0.007
lers+0.007
ses+0.007
Neg logit contributions
 few-0.006
 color-0.005
Su-0.005
"-0.005
J-0.005
Token
Token ID198
Feature activation+2.09e-01

Pos logit contributions
 breaths+0.016
 cries+0.015
lers+0.015
cha+0.015
 worries+0.014
Neg logit contributions
"-0.007
 few-0.006
Su-0.006
 Jem-0.006
S-0.006
Token
Token ID198
Feature activation+2.70e-01

Pos logit contributions
 cries+0.007
cha+0.007
 breaths+0.006
lers+0.006
 screams+0.006
Neg logit contributions
 few-0.004
 passenger-0.004
 color-0.004
 builder-0.003
 customer-0.003
Token"
Token ID1
Feature activation-2.76e-01

Pos logit contributions
 cries+0.011
cha+0.011
 breaths+0.010
lers+0.010
ishly+0.009
Neg logit contributions
 passenger-0.003
 color-0.003
 few-0.002
 builder-0.002
 ten-0.002
TokenWhy
Token ID5195
Feature activation+3.53e-01

Pos logit contributions
 few+0.007
 color+0.006
 passenger+0.006
 builder+0.006
"+0.006
Neg logit contributions
 cries-0.007
cha-0.007
 breaths-0.007
lers-0.006
ishly-0.006
Token don
Token ID836
Feature activation+4.37e+00

Pos logit contributions
 cries+0.009
 breaths+0.008
cha+0.008
lers+0.008
 worries+0.007
Neg logit contributions
 few-0.010
Su-0.009
 passenger-0.009
Grand-0.009
J-0.009
Token't
Token ID470
Feature activation+1.81e+00

Pos logit contributions
 breaths+0.145
 cries+0.140
lers+0.129
ses+0.126
cha+0.125
Neg logit contributions
"-0.089
 few-0.089
Su-0.078
 color-0.077
Grand-0.074
Token you
Token ID345
Feature activation+9.42e-01

Pos logit contributions
 breaths+0.047
 cries+0.046
lers+0.042
cha+0.040
ses+0.039
Neg logit contributions
 few-0.049
 color-0.047
"-0.047
Su-0.045
 Jem-0.043
Token try
Token ID1949
Feature activation-4.21e-01

Pos logit contributions
 cries+0.030
 breaths+0.030
cha+0.027
lers+0.027
 worries+0.025
Neg logit contributions
 few-0.019
 color-0.018
Su-0.017
"-0.017
J-0.017
Token that
Token ID326
Feature activation+3.98e-01

Pos logit contributions
 few+0.008
 passenger+0.007
Su+0.007
J+0.007
 Jem+0.007
Neg logit contributions
 cries-0.014
 breaths-0.014
lers-0.013
cha-0.013
 worries-0.012
Token again
Token ID757
Feature activation-1.95e+00

Pos logit contributions
 cries+0.012
 breaths+0.011
cha+0.010
lers+0.010
 worries+0.009
Neg logit contributions
J-0.010
Grand-0.009
 few-0.009
Back-0.009
 Jem-0.009
Token the
Token ID262
Feature activation+3.92e-01

Pos logit contributions
 cries+0.096
lers+0.094
 breaths+0.091
cha+0.090
ience+0.087
Neg logit contributions
 few-0.036
 sunny-0.034
 tall-0.032
 color-0.032
 five-0.032
Token sky
Token ID6766
Feature activation+2.77e-01

Pos logit contributions
 cries+0.013
 breaths+0.013
cha+0.012
lers+0.012
ses+0.011
Neg logit contributions
 few-0.008
 color-0.007
Su-0.007
"-0.007
 passenger-0.006
Token,
Token ID11
Feature activation+6.76e-02

Pos logit contributions
 cries+0.008
 breaths+0.008
cha+0.007
lers+0.007
ses+0.007
Neg logit contributions
 few-0.006
 color-0.006
"-0.006
Su-0.006
 passenger-0.006
Token "
Token ID366
Feature activation+1.57e-01

Pos logit contributions
 breaths+0.002
 cries+0.002
cha+0.002
lers+0.002
ses+0.002
Neg logit contributions
 few-0.001
 color-0.001
"-0.001
Su-0.001
 passenger-0.001
TokenWhy
Token ID5195
Feature activation+2.80e-01

Pos logit contributions
 cries+0.005
 breaths+0.004
cha+0.004
lers+0.004
 worries+0.004
Neg logit contributions
 few-0.004
 color-0.003
 Jem-0.003
 passenger-0.003
 builder-0.003
Token don
Token ID836
Feature activation+4.32e+00

Pos logit contributions
 cries+0.007
 breaths+0.006
cha+0.006
lers+0.006
 worries+0.005
Neg logit contributions
 few-0.008
Su-0.008
Grand-0.007
J-0.007
Sus-0.007
Token't
Token ID470
Feature activation+1.71e+00

Pos logit contributions
 breaths+0.144
 cries+0.139
lers+0.129
ses+0.127
cha+0.125
Neg logit contributions
"-0.088
 few-0.087
 color-0.075
Su-0.074
The-0.074
Token you
Token ID345
Feature activation+9.09e-01

Pos logit contributions
 breaths+0.049
 cries+0.047
lers+0.044
cha+0.041
ses+0.041
Neg logit contributions
 few-0.042
 color-0.041
"-0.040
Su-0.038
 Jem-0.037
Token join
Token ID4654
Feature activation+4.67e-01

Pos logit contributions
 cries+0.030
 breaths+0.030
cha+0.028
lers+0.028
ses+0.026
Neg logit contributions
 few-0.017
 color-0.016
"-0.015
Su-0.015
 passenger-0.015
Token us
Token ID514
Feature activation+4.99e-03

Pos logit contributions
 cries+0.015
 breaths+0.015
cha+0.013
lers+0.013
ses+0.012
Neg logit contributions
 few-0.010
 color-0.009
Su-0.009
"-0.009
J-0.009
Token on
Token ID319
Feature activation-8.82e-01

Pos logit contributions
 cries+0.000
 breaths+0.000
lers+0.000
cha+0.000
 worries+0.000
Neg logit contributions
Su-0.000
J-0.000
Kim-0.000
 few-0.000
Jen-0.000
Token's
Token ID338
Feature activation-3.66e-01

Pos logit contributions
 cries+0.034
 breaths+0.034
cha+0.031
lers+0.030
ses+0.028
Neg logit contributions
 few-0.023
 color-0.021
"-0.021
Su-0.021
J-0.020
Token friend
Token ID1545
Feature activation-5.63e-01

Pos logit contributions
 few+0.007
 color+0.007
"+0.006
 passenger+0.006
 sunny+0.006
Neg logit contributions
 breaths-0.012
 cries-0.012
cha-0.011
lers-0.011
ishly-0.010
Token laid
Token ID8104
Feature activation+6.93e-01

Pos logit contributions
Su+0.011
 few+0.011
 color+0.011
Grand+0.010
Back+0.010
Neg logit contributions
 cries-0.019
 breaths-0.018
cha-0.016
lers-0.016
ishly-0.015
Token down
Token ID866
Feature activation-1.15e+00

Pos logit contributions
 cries+0.023
 breaths+0.023
cha+0.021
lers+0.021
ses+0.019
Neg logit contributions
 few-0.014
 color-0.012
Su-0.012
"-0.012
 passenger-0.011
Token on
Token ID319
Feature activation+3.67e-02

Pos logit contributions
Su+0.028
 few+0.027
 color+0.026
 passenger+0.026
 Jem+0.026
Neg logit contributions
 breaths-0.032
 cries-0.031
cha-0.027
lers-0.027
 deeply-0.026
Token Velvet
Token ID36271
Feature activation+4.31e+00

Pos logit contributions
 cries+0.001
 breaths+0.001
cha+0.001
lers+0.001
 worries+0.001
Neg logit contributions
 few-0.001
Su-0.001
 color-0.001
"-0.000
 passenger-0.000
Token's
Token ID338
Feature activation+1.32e+00

Pos logit contributions
 breaths+0.160
cha+0.154
 cries+0.153
ses+0.147
 worries+0.140
Neg logit contributions
 few-0.065
"-0.061
 color-0.059
 sunny-0.045
 tidy-0.044
Token soft
Token ID2705
Feature activation+1.59e+00

Pos logit contributions
 breaths+0.036
 cries+0.036
cha+0.033
lers+0.032
 worries+0.029
Neg logit contributions
 few-0.033
 color-0.032
"-0.030
Su-0.029
 passenger-0.029
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
 cries+0.040
 breaths+0.039
lers+0.034
cha+0.034
ses+0.030
Neg logit contributions
 few-0.045
Su-0.042
 color-0.041
J-0.041
"-0.041
Token fluffy
Token ID39145
Feature activation+2.03e+00

Pos logit contributions
 few+0.002
Su+0.002
J+0.002
"+0.002
 color+0.002
Neg logit contributions
 cries-0.004
 breaths-0.004
lers-0.003
cha-0.003
 worries-0.003
Token fur
Token ID9230
Feature activation+1.17e+00

Pos logit contributions
 cries+0.049
 breaths+0.047
cha+0.044
lers+0.042
ses+0.038
Neg logit contributions
 few-0.059
 color-0.055
"-0.053
Su-0.051
 sunny-0.050
Token the
Token ID262
Feature activation-6.58e-01

Pos logit contributions
 breaths+0.039
 cries+0.039
lers+0.038
cha+0.037
 worries+0.034
Neg logit contributions
 few-0.008
 color-0.007
"-0.006
Su-0.005
 ten-0.005
Token zip
Token ID19974
Feature activation-4.65e-01

Pos logit contributions
 few+0.013
 color+0.013
"+0.011
 sunny+0.011
 passenger+0.011
Neg logit contributions
 breaths-0.022
 cries-0.022
lers-0.020
cha-0.020
ishly-0.018
Token,
Token ID11
Feature activation+8.34e-03

Pos logit contributions
 few+0.010
Grand+0.010
J+0.010
Su+0.009
 Jem+0.009
Neg logit contributions
 cries-0.015
 breaths-0.014
lers-0.013
cha-0.013
ishly-0.012
Token "
Token ID366
Feature activation-1.62e-01

Pos logit contributions
 cries+0.000
 breaths+0.000
cha+0.000
lers+0.000
ishly+0.000
Neg logit contributions
 few-0.000
 color-0.000
Su-0.000
J-0.000
 passenger-0.000
TokenWhy
Token ID5195
Feature activation+1.37e-01

Pos logit contributions
 few+0.003
 color+0.003
 Jem+0.003
 passenger+0.003
Su+0.003
Neg logit contributions
 cries-0.005
 breaths-0.005
cha-0.005
lers-0.004
 worries-0.004
Token don
Token ID836
Feature activation+4.30e+00

Pos logit contributions
 cries+0.003
 breaths+0.003
cha+0.003
lers+0.003
 worries+0.003
Neg logit contributions
 few-0.004
Su-0.004
Grand-0.004
J-0.004
 passenger-0.004
Token't
Token ID470
Feature activation+1.62e+00

Pos logit contributions
 breaths+0.141
 cries+0.136
lers+0.126
ses+0.122
cha+0.122
Neg logit contributions
 few-0.089
"-0.088
Su-0.078
 color-0.078
The-0.075
Token you
Token ID345
Feature activation+1.14e+00

Pos logit contributions
 breaths+0.043
 cries+0.042
lers+0.039
cha+0.037
ses+0.036
Neg logit contributions
 few-0.043
 color-0.041
"-0.040
Su-0.039
 Jem-0.037
Token rest
Token ID1334
Feature activation-2.08e-01

Pos logit contributions
 breaths+0.040
 cries+0.039
lers+0.036
cha+0.036
ses+0.035
Neg logit contributions
 few-0.021
 color-0.020
"-0.019
Su-0.017
 tidy-0.017
Token on
Token ID319
Feature activation-9.78e-01

Pos logit contributions
 few+0.005
Su+0.005
 passenger+0.004
J+0.004
Grand+0.004
Neg logit contributions
 breaths-0.006
 cries-0.006
cha-0.006
lers-0.006
 worries-0.005
Token my
Token ID616
Feature activation+1.75e+00

Pos logit contributions
J+0.017
 few+0.017
 passenger+0.017
Su+0.016
 builder+0.016
Neg logit contributions
 breaths-0.034
 cries-0.033
cha-0.030
lers-0.030
 worries-0.028
Token a
Token ID257
Feature activation+1.11e+00

Pos logit contributions
 few+0.002
Su+0.002
 color+0.002
 passenger+0.002
"+0.002
Neg logit contributions
 cries-0.004
 breaths-0.004
cha-0.004
lers-0.004
 worries-0.003
Token voice
Token ID3809
Feature activation+1.76e+00

Pos logit contributions
 cries+0.034
 breaths+0.032
cha+0.030
lers+0.029
 screams+0.026
Neg logit contributions
 few-0.024
 color-0.023
 Jem-0.022
Su-0.022
J-0.022
Token,
Token ID11
Feature activation-3.97e-01

Pos logit contributions
 breaths+0.059
 cries+0.057
cha+0.054
lers+0.053
ses+0.053
Neg logit contributions
 few-0.032
"-0.030
 color-0.030
Su-0.030
 Su-0.027
Token "
Token ID366
Feature activation-2.06e-01

Pos logit contributions
 few+0.007
"+0.007
Su+0.006
 color+0.006
Grand+0.006
Neg logit contributions
 breaths-0.014
 cries-0.013
lers-0.012
cha-0.012
 worries-0.011
TokenWhy
Token ID5195
Feature activation+9.09e-02

Pos logit contributions
 few+0.005
 color+0.005
 builder+0.004
 Jem+0.004
 passenger+0.004
Neg logit contributions
 cries-0.006
cha-0.005
 breaths-0.005
lers-0.005
ted-0.004
Token don
Token ID836
Feature activation+4.29e+00

Pos logit contributions
 cries+0.002
 breaths+0.002
cha+0.002
lers+0.002
 worries+0.002
Neg logit contributions
 few-0.003
Su-0.003
Sus-0.003
Grand-0.003
Back-0.003
Token't
Token ID470
Feature activation+1.62e+00

Pos logit contributions
 breaths+0.134
 cries+0.131
lers+0.120
cha+0.118
ses+0.114
Neg logit contributions
 few-0.094
"-0.091
 color-0.084
Su-0.083
Grand-0.080
Token you
Token ID345
Feature activation+1.08e+00

Pos logit contributions
 cries+0.040
 breaths+0.039
cha+0.034
lers+0.034
 worries+0.030
Neg logit contributions
 few-0.046
 color-0.043
Su-0.043
"-0.042
 passenger-0.041
Token jump
Token ID4391
Feature activation-1.73e+00

Pos logit contributions
 cries+0.033
 breaths+0.031
cha+0.029
lers+0.028
 worries+0.027
Neg logit contributions
 few-0.024
J-0.022
Su-0.022
Grand-0.021
S-0.021
Token over
Token ID625
Feature activation-2.08e+00

Pos logit contributions
Su+0.041
J+0.040
 passenger+0.040
 few+0.040
 builder+0.039
Neg logit contributions
 cries-0.048
 breaths-0.047
cha-0.044
lers-0.044
 webs-0.039
Token it
Token ID340
Feature activation-1.13e-01

Pos logit contributions
 passenger+0.044
 few+0.043
Pe+0.042
Grand+0.042
 builder+0.042
Neg logit contributions
 breaths-0.065
 cries-0.064
cha-0.060
lers-0.057
 worries-0.053
Token.
Token ID13
Feature activation+7.89e-02

Pos logit contributions
 passenger+0.033
 few+0.032
 builder+0.032
Su+0.032
Grand+0.031
Neg logit contributions
 breaths-0.035
 cries-0.035
lers-0.031
cha-0.031
 webs-0.028
Token
Token ID198
Feature activation-2.12e-02

Pos logit contributions
 breaths+0.003
 cries+0.003
lers+0.002
cha+0.002
 worries+0.002
Neg logit contributions
 few-0.002
 color-0.001
Su-0.001
"-0.001
 Jem-0.001
Token
Token ID198
Feature activation-2.34e-01

Pos logit contributions
 few+0.000
"+0.000
Su+0.000
 color+0.000
J+0.000
Neg logit contributions
 breaths-0.001
 cries-0.001
lers-0.001
cha-0.001
ses-0.001
Token"
Token ID1
Feature activation-4.92e-01

Pos logit contributions
 color+0.003
 few+0.003
 passenger+0.003
 Jem+0.003
 builder+0.003
Neg logit contributions
 cries-0.009
cha-0.009
 breaths-0.008
lers-0.008
<-0.008
TokenWhy
Token ID5195
Feature activation-3.95e-02

Pos logit contributions
 few+0.011
 color+0.010
 passenger+0.010
 builder+0.010
"+0.010
Neg logit contributions
 cries-0.014
 breaths-0.014
cha-0.013
lers-0.012
 worries-0.011
Token aren
Token ID3588
Feature activation+4.27e+00

Pos logit contributions
 few+0.001
Su+0.001
 passenger+0.001
J+0.001
Grand+0.001
Neg logit contributions
 cries-0.001
 breaths-0.001
cha-0.001
lers-0.001
 worries-0.001
Token't
Token ID470
Feature activation+1.48e+00

Pos logit contributions
 breaths+0.169
 cries+0.154
lers+0.153
ses+0.151
edly+0.142
Neg logit contributions
"-0.065
 few-0.059
The-0.058
Su-0.053
 true-0.049
Token you
Token ID345
Feature activation+1.19e+00

Pos logit contributions
 cries+0.040
 breaths+0.038
cha+0.035
lers+0.034
 worries+0.031
Neg logit contributions
 few-0.038
J-0.036
 passenger-0.036
Su-0.036
Back-0.034
Token moving
Token ID3867
Feature activation-1.06e+00

Pos logit contributions
 cries+0.032
 breaths+0.030
cha+0.027
lers+0.026
 worries+0.026
Neg logit contributions
J-0.032
Su-0.030
 few-0.030
Sus-0.029
 passenger-0.029
Token like
Token ID588
Feature activation+3.12e-01

Pos logit contributions
Su+0.023
J+0.022
 few+0.022
 passenger+0.022
Grand+0.022
Neg logit contributions
 breaths-0.033
 cries-0.032
lers-0.029
cha-0.029
 worries-0.026
Token I
Token ID314
Feature activation+3.31e-01

Pos logit contributions
 cries+0.008
 breaths+0.008
cha+0.007
lers+0.007
 worries+0.006
Neg logit contributions
 few-0.008
J-0.008
Su-0.008
 passenger-0.008
 color-0.008
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
 breaths+0.069
 cries+0.068
cha+0.066
lers+0.062
ses+0.061
Neg logit contributions
 few-0.032
"-0.030
 color-0.029
Su-0.024
 passenger-0.024
Token spoke
Token ID5158
Feature activation-5.31e-02

Pos logit contributions
 cries+0.033
 breaths+0.032
cha+0.029
lers+0.029
ishly+0.026
Neg logit contributions
 few-0.030
 color-0.028
Su-0.028
 passenger-0.027
J-0.027
Token.
Token ID13
Feature activation-5.42e-01

Pos logit contributions
 few+0.001
 color+0.001
Su+0.001
"+0.001
 passenger+0.001
Neg logit contributions
 cries-0.002
 breaths-0.002
cha-0.001
lers-0.001
ses-0.001
Token "
Token ID366
Feature activation+7.14e-01

Pos logit contributions
 few+0.015
Su+0.014
 passenger+0.014
 color+0.014
S+0.013
Neg logit contributions
 cries-0.014
 breaths-0.013
cha-0.012
lers-0.011
ishly-0.011
TokenWhy
Token ID5195
Feature activation+5.03e-01

Pos logit contributions
 cries+0.022
 breaths+0.022
cha+0.020
lers+0.019
 worries+0.018
Neg logit contributions
 few-0.016
 color-0.015
Su-0.014
"-0.014
 passenger-0.014
Token don
Token ID836
Feature activation+4.23e+00

Pos logit contributions
 cries+0.013
 breaths+0.012
cha+0.011
lers+0.011
 worries+0.010
Neg logit contributions
 few-0.014
Su-0.013
Grand-0.013
 passenger-0.013
J-0.013
Token't
Token ID470
Feature activation+1.70e+00

Pos logit contributions
 breaths+0.143
 cries+0.137
lers+0.126
ses+0.126
cha+0.123
Neg logit contributions
"-0.085
 few-0.084
 color-0.072
Su-0.071
The-0.071
Token you
Token ID345
Feature activation+1.16e+00

Pos logit contributions
 breaths+0.043
 cries+0.042
lers+0.038
cha+0.037
ses+0.035
Neg logit contributions
 few-0.047
 color-0.045
"-0.044
Su-0.043
J-0.041
Token strike
Token ID5587
Feature activation-7.60e-01

Pos logit contributions
 cries+0.038
 breaths+0.038
cha+0.035
lers+0.035
ses+0.032
Neg logit contributions
 few-0.023
 color-0.021
"-0.020
Su-0.020
 passenger-0.019
Token the
Token ID262
Feature activation-4.96e-01

Pos logit contributions
 few+0.012
 color+0.011
Su+0.010
"+0.010
J+0.010
Neg logit contributions
 cries-0.028
 breaths-0.028
cha-0.026
lers-0.026
ses-0.024
Token butterfly
Token ID35113
Feature activation+8.38e-01

Pos logit contributions
 few+0.009
 color+0.008
"+0.007
 sunny+0.007
 blue+0.007
Neg logit contributions
 breaths-0.018
 cries-0.018
lers-0.017
cha-0.016
ishly-0.016
Token voice
Token ID3809
Feature activation+5.95e-01

Pos logit contributions
 cries+0.030
 breaths+0.029
cha+0.026
lers+0.026
ses+0.024
Neg logit contributions
 few-0.022
 color-0.020
Su-0.020
"-0.019
J-0.019
Token said
Token ID531
Feature activation+2.18e-01

Pos logit contributions
 breaths+0.020
 cries+0.020
cha+0.018
lers+0.018
ses+0.017
Neg logit contributions
 few-0.011
"-0.010
Su-0.010
 color-0.010
J-0.010
Token,
Token ID11
Feature activation-8.55e-01

Pos logit contributions
 cries+0.008
 breaths+0.008
cha+0.008
lers+0.008
ishly+0.007
Neg logit contributions
 few-0.003
 color-0.003
Su-0.003
 passenger-0.003
J-0.003
Token "
Token ID366
Feature activation-4.31e-01

Pos logit contributions
 few+0.009
 color+0.008
Su+0.007
"+0.007
J+0.007
Neg logit contributions
 cries-0.036
 breaths-0.036
cha-0.033
lers-0.033
ses-0.031
TokenWhy
Token ID5195
Feature activation-1.37e-01

Pos logit contributions
 few+0.010
 color+0.010
 Jem+0.009
 passenger+0.009
 builder+0.009
Neg logit contributions
 cries-0.011
cha-0.011
 breaths-0.011
lers-0.010
ted-0.009
Token don
Token ID836
Feature activation+4.23e+00

Pos logit contributions
 few+0.004
Su+0.004
J+0.004
 passenger+0.004
Grand+0.004
Neg logit contributions
 cries-0.003
 breaths-0.003
cha-0.003
lers-0.003
 worries-0.003
Token't
Token ID470
Feature activation+1.65e+00

Pos logit contributions
 breaths+0.139
 cries+0.134
lers+0.125
ses+0.122
cha+0.121
Neg logit contributions
"-0.087
 few-0.086
 color-0.075
Su-0.074
Grand-0.072
Token you
Token ID345
Feature activation+1.23e+00

Pos logit contributions
 breaths+0.041
 cries+0.041
lers+0.036
cha+0.035
ses+0.033
Neg logit contributions
 few-0.047
 color-0.044
"-0.044
Su-0.043
J-0.041
Token just
Token ID655
Feature activation+5.44e-01

Pos logit contributions
 cries+0.041
 breaths+0.041
cha+0.037
lers+0.037
ses+0.035
Neg logit contributions
 few-0.024
 color-0.022
"-0.022
Su-0.021
 passenger-0.020
Token remain
Token ID3520
Feature activation-9.94e-01

Pos logit contributions
 cries+0.017
 breaths+0.017
cha+0.016
lers+0.015
ses+0.014
Neg logit contributions
 few-0.012
 color-0.011
"-0.010
Su-0.010
 passenger-0.010
Token here
Token ID994
Feature activation-1.73e+00

Pos logit contributions
Su+0.022
 few+0.021
J+0.020
Be+0.019
Kim+0.019
Neg logit contributions
 breaths-0.032
 cries-0.030
lers-0.028
cha-0.026
 worries-0.025
Token was
Token ID373
Feature activation+1.59e+00

Pos logit contributions
 cries+0.011
lers+0.009
 breaths+0.009
cha+0.009
 worries+0.008
Neg logit contributions
 few-0.015
 passenger-0.014
Su-0.014
Grand-0.014
S-0.014
Token happening
Token ID5836
Feature activation-2.55e-03

Pos logit contributions
 cries+0.041
 breaths+0.039
cha+0.036
lers+0.036
ses+0.031
Neg logit contributions
 few-0.043
Su-0.041
J-0.040
 color-0.040
S-0.039
Token.
Token ID13
Feature activation+2.78e-02

Pos logit contributions
 few+0.000
 color+0.000
Su+0.000
 passenger+0.000
J+0.000
Neg logit contributions
 cries-0.000
 breaths-0.000
cha-0.000
lers-0.000
ishly-0.000
Token "
Token ID366
Feature activation+9.05e-01

Pos logit contributions
 cries+0.001
 breaths+0.001
cha+0.001
lers+0.001
ishly+0.001
Neg logit contributions
 few-0.001
 color-0.000
 passenger-0.000
J-0.000
Su-0.000
TokenWhy
Token ID5195
Feature activation+1.43e-01

Pos logit contributions
 cries+0.026
 breaths+0.025
cha+0.024
lers+0.023
ses+0.021
Neg logit contributions
 few-0.021
 color-0.020
Su-0.019
"-0.018
 passenger-0.018
Token don
Token ID836
Feature activation+4.21e+00

Pos logit contributions
 cries+0.003
 breaths+0.003
cha+0.003
 doesn+0.003
 worries+0.002
Neg logit contributions
 few-0.004
Su-0.004
Grand-0.004
Sus-0.004
J-0.004
Token't
Token ID470
Feature activation+1.57e+00

Pos logit contributions
 breaths+0.139
 cries+0.133
lers+0.123
cha+0.120
ses+0.120
Neg logit contributions
"-0.086
 few-0.086
 color-0.076
Su-0.075
The-0.072
Token you
Token ID345
Feature activation+4.81e-01

Pos logit contributions
 cries+0.038
 breaths+0.038
cha+0.033
lers+0.033
ses+0.029
Neg logit contributions
 few-0.045
 color-0.042
Su-0.042
"-0.041
J-0.040
Token all
Token ID477
Feature activation-3.00e-01

Pos logit contributions
 cries+0.015
 breaths+0.015
cha+0.013
lers+0.013
 worries+0.012
Neg logit contributions
 few-0.011
Su-0.010
 color-0.009
J-0.009
"-0.009
Token share
Token ID2648
Feature activation-1.80e+00

Pos logit contributions
 few+0.007
Su+0.006
J+0.006
 color+0.006
Grand+0.006
Neg logit contributions
 cries-0.009
 breaths-0.009
lers-0.008
cha-0.008
 worries-0.007
Token the
Token ID262
Feature activation-9.60e-01

Pos logit contributions
Su+0.038
 passenger+0.036
 few+0.036
 builder+0.035
J+0.035
Neg logit contributions
 cries-0.056
 breaths-0.053
lers-0.051
cha-0.049
 doesn-0.046
Token voice
Token ID3809
Feature activation+4.03e-01

Pos logit contributions
 few+0.004
Su+0.004
Back+0.004
 color+0.004
J+0.004
Neg logit contributions
 cries-0.008
 breaths-0.008
cha-0.007
lers-0.007
ses-0.006
Token said
Token ID531
Feature activation-2.64e-01

Pos logit contributions
 cries+0.013
 breaths+0.013
cha+0.011
lers+0.011
ses+0.011
Neg logit contributions
 few-0.008
 color-0.008
Su-0.008
"-0.008
J-0.007
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
 color+0.005
 passenger+0.005
 few+0.005
Su+0.005
While+0.005
Neg logit contributions
 cries-0.009
 breaths-0.009
cha-0.008
ishly-0.008
lers-0.008
Token "
Token ID366
Feature activation-4.82e-01

Pos logit contributions
 few+0.011
 color+0.009
Su+0.009
"+0.009
J+0.008
Neg logit contributions
 cries-0.043
 breaths-0.042
cha-0.039
lers-0.039
ses-0.037
TokenWhy
Token ID5195
Feature activation-2.59e-02

Pos logit contributions
 few+0.011
 builder+0.011
 color+0.011
 passenger+0.011
 Jem+0.010
Neg logit contributions
cha-0.013
 cries-0.012
 breaths-0.012
lers-0.010
ted-0.010
Token don
Token ID836
Feature activation+4.20e+00

Pos logit contributions
 few+0.001
Su+0.001
Grand+0.001
 passenger+0.001
J+0.001
Neg logit contributions
 cries-0.001
 breaths-0.001
cha-0.000
 worries-0.000
 doesn-0.000
Token't
Token ID470
Feature activation+1.43e+00

Pos logit contributions
 breaths+0.132
 cries+0.129
lers+0.118
cha+0.115
ses+0.113
Neg logit contributions
 few-0.091
"-0.089
 color-0.081
Su-0.081
Grand-0.077
Token you
Token ID345
Feature activation+9.97e-01

Pos logit contributions
 cries+0.033
 breaths+0.031
cha+0.028
lers+0.027
 worries+0.024
Neg logit contributions
 few-0.044
 color-0.040
Su-0.040
 passenger-0.040
J-0.039
Token move
Token ID1445
Feature activation-1.26e+00

Pos logit contributions
 cries+0.032
 breaths+0.032
cha+0.029
lers+0.028
 worries+0.026
Neg logit contributions
 few-0.021
Su-0.019
 color-0.019
J-0.018
"-0.018
Token around
Token ID1088
Feature activation-3.62e-01

Pos logit contributions
Su+0.030
J+0.029
 few+0.028
 passenger+0.028
Grand+0.028
Neg logit contributions
 breaths-0.038
 cries-0.038
lers-0.034
cha-0.033
 doesn-0.030
Token it
Token ID340
Feature activation+4.02e-01

Pos logit contributions
J+0.008
 passenger+0.008
 few+0.008
Grand+0.008
Kim+0.008
Neg logit contributions
 cries-0.010
 breaths-0.010
cha-0.010
lers-0.009
 worries-0.009
Token
Token ID241
Feature activation+6.33e-01

Pos logit contributions
 breaths+0.037
 cries+0.036
lers+0.034
cha+0.033
 worries+0.031
Neg logit contributions
"-0.034
 few-0.033
Su-0.031
 color-0.031
 Jem-0.030
TokenHey
Token ID10814
Feature activation+3.37e-02

Pos logit contributions
 cries+0.021
 breaths+0.020
cha+0.019
lers+0.018
ses+0.017
Neg logit contributions
 few-0.012
 color-0.011
 passenger-0.010
 builder-0.010
"-0.010
Token Jay
Token ID9180
Feature activation-2.66e-02

Pos logit contributions
 cries+0.001
 breaths+0.001
cha+0.001
lers+0.001
 worries+0.001
Neg logit contributions
 few-0.001
 color-0.001
Back-0.001
Grand-0.001
 passenger-0.001
Token,
Token ID11
Feature activation+2.12e-01

Pos logit contributions
 few+0.001
 color+0.001
 passenger+0.001
Back+0.001
Grand+0.000
Neg logit contributions
 cries-0.001
 breaths-0.001
cha-0.001
lers-0.001
ishly-0.001
Token why
Token ID1521
Feature activation+8.59e-01

Pos logit contributions
 cries+0.006
 breaths+0.005
cha+0.005
lers+0.005
 worries+0.005
Neg logit contributions
 few-0.005
J-0.005
 color-0.005
Grand-0.005
Su-0.005
Token don
Token ID836
Feature activation+4.20e+00

Pos logit contributions
 cries+0.020
 breaths+0.019
cha+0.017
lers+0.017
 worries+0.015
Neg logit contributions
 few-0.026
 color-0.023
Su-0.023
 passenger-0.023
Grand-0.023
Tokenâ
Token ID22940
Feature activation+2.64e+00

Pos logit contributions
 breaths+0.140
 cries+0.136
lers+0.126
ses+0.125
cha+0.121
Neg logit contributions
"-0.085
 few-0.083
Su-0.075
The-0.071
 color-0.071
Token
Token ID26391
Feature activation+1.51e+00

Pos logit contributions
 cries+0.076
 breaths+0.075
lers+0.071
ses+0.065
cha+0.063
Neg logit contributions
"-0.062
 color-0.059
 few-0.058
Su-0.058
J-0.056
Token
Token ID8151
Feature activation+6.69e-01

Pos logit contributions
 cries+0.039
 breaths+0.039
cha+0.034
lers+0.034
ses+0.030
Neg logit contributions
 few-0.040
 color-0.038
Su-0.037
"-0.037
J-0.036
Tokent
Token ID83
Feature activation+2.08e+00

Pos logit contributions
 cries+0.017
cha+0.016
lers+0.015
 breaths+0.015
ishly+0.013
Neg logit contributions
 few-0.018
 color-0.017
 passenger-0.017
 builder-0.016
 groom-0.016
Token you
Token ID345
Feature activation+1.13e+00

Pos logit contributions
 cries+0.046
 breaths+0.044
cha+0.039
lers+0.038
 worries+0.033
Neg logit contributions
 few-0.065
 color-0.061
Su-0.059
 passenger-0.059
"-0.058
Token at
Token ID379
Feature activation+1.21e+00

Pos logit contributions
 few+0.002
 color+0.002
"+0.002
Su+0.002
 passenger+0.002
Neg logit contributions
 breaths-0.003
 cries-0.003
cha-0.003
lers-0.003
ses-0.003
Token him
Token ID683
Feature activation+2.19e+00

Pos logit contributions
 breaths+0.035
 cries+0.035
cha+0.031
lers+0.031
ses+0.029
Neg logit contributions
 few-0.028
 color-0.027
"-0.026
Su-0.025
 passenger-0.024
Token.
Token ID13
Feature activation+1.62e-01

Pos logit contributions
 breaths+0.069
 cries+0.068
cha+0.062
lers+0.061
ses+0.058
Neg logit contributions
 few-0.047
 color-0.044
"-0.043
Su-0.040
 true-0.039
Token '
Token ID705
Feature activation-9.63e-01

Pos logit contributions
 breaths+0.005
 cries+0.005
lers+0.004
cha+0.004
 worries+0.004
Neg logit contributions
 few-0.004
"-0.003
 color-0.003
Su-0.003
 Jem-0.003
TokenWhy
Token ID5195
Feature activation+2.52e-01

Pos logit contributions
 few+0.022
 color+0.021
 builder+0.020
 Jem+0.019
 sunny+0.019
Neg logit contributions
cha-0.026
 cries-0.025
 breaths-0.025
lers-0.024
ted-0.021
Token don
Token ID836
Feature activation+4.19e+00

Pos logit contributions
 cries+0.006
 breaths+0.006
cha+0.005
lers+0.005
 worries+0.005
Neg logit contributions
 few-0.007
Su-0.007
 color-0.006
J-0.006
Grand-0.006
Token't
Token ID470
Feature activation+1.80e+00

Pos logit contributions
 breaths+0.142
 cries+0.135
lers+0.126
ses+0.125
cha+0.121
Neg logit contributions
"-0.086
 few-0.084
Su-0.073
The-0.072
 color-0.071
Token you
Token ID345
Feature activation+1.03e+00

Pos logit contributions
 breaths+0.044
 cries+0.043
lers+0.038
cha+0.038
ses+0.035
Neg logit contributions
 few-0.052
 color-0.049
"-0.048
Su-0.047
 Jem-0.045
Token try
Token ID1949
Feature activation+9.19e-02

Pos logit contributions
 breaths+0.034
 cries+0.034
lers+0.031
cha+0.031
ses+0.028
Neg logit contributions
 few-0.021
 color-0.019
"-0.019
Su-0.018
 passenger-0.017
Token to
Token ID284
Feature activation-1.06e+00

Pos logit contributions
 cries+0.003
 breaths+0.003
cha+0.003
lers+0.003
 worries+0.003
Neg logit contributions
 few-0.001
Su-0.001
 color-0.001
 passenger-0.001
J-0.001
Token mark
Token ID1317
Feature activation-2.97e-01

Pos logit contributions
 few+0.022
 color+0.021
"+0.020
Su+0.020
 passenger+0.019
Neg logit contributions
 cries-0.033
 breaths-0.033
cha-0.030
lers-0.030
ses-0.028
TokenIt
Token ID1026
Feature activation+1.58e+00

Pos logit contributions
 cries+0.014
 breaths+0.014
cha+0.013
lers+0.013
ishly+0.012
Neg logit contributions
 few-0.010
 color-0.009
 passenger-0.008
Su-0.008
 Jem-0.008
Token's
Token ID338
Feature activation+3.84e-01

Pos logit contributions
 breaths+0.053
 cries+0.050
cha+0.048
ses+0.046
lers+0.045
Neg logit contributions
 few-0.029
"-0.027
 color-0.027
 true-0.025
 passenger-0.024
Token okay
Token ID8788
Feature activation-1.85e-01

Pos logit contributions
 cries+0.012
 breaths+0.012
lers+0.010
cha+0.010
 worries+0.010
Neg logit contributions
 few-0.008
J-0.008
Su-0.008
"-0.008
 color-0.008
Token.
Token ID13
Feature activation+6.92e-01

Pos logit contributions
 few+0.004
 passenger+0.003
J+0.003
Back+0.003
Su+0.003
Neg logit contributions
 cries-0.006
 breaths-0.006
cha-0.005
lers-0.005
 worries-0.005
Token Why
Token ID4162
Feature activation+3.57e-01

Pos logit contributions
 breaths+0.020
 cries+0.020
lers+0.018
cha+0.018
ses+0.016
Neg logit contributions
 few-0.016
 color-0.016
"-0.015
Su-0.015
 passenger-0.015
Token don
Token ID836
Feature activation+4.19e+00

Pos logit contributions
 cries+0.009
 breaths+0.009
cha+0.008
lers+0.008
ses+0.007
Neg logit contributions
 few-0.010
 color-0.009
Su-0.009
"-0.009
J-0.009
Token't
Token ID470
Feature activation+2.09e+00

Pos logit contributions
 breaths+0.139
 cries+0.134
lers+0.123
ses+0.122
cha+0.121
Neg logit contributions
"-0.084
 few-0.084
 color-0.073
Su-0.073
The-0.071
Token you
Token ID345
Feature activation+8.21e-01

Pos logit contributions
 breaths+0.059
 cries+0.055
lers+0.051
ses+0.050
cha+0.049
Neg logit contributions
 few-0.053
 color-0.051
"-0.049
Su-0.047
 true-0.046
Token try
Token ID1949
Feature activation+2.26e-01

Pos logit contributions
 cries+0.027
 breaths+0.027
cha+0.025
lers+0.025
ses+0.023
Neg logit contributions
 few-0.016
 color-0.015
"-0.014
Su-0.014
 passenger-0.013
Token looking
Token ID2045
Feature activation-1.06e+00

Pos logit contributions
 cries+0.008
 breaths+0.008
lers+0.007
cha+0.007
 worries+0.007
Neg logit contributions
 few-0.004
Su-0.004
J-0.003
 passenger-0.003
 color-0.003
Token at
Token ID379
Feature activation+1.06e+00

Pos logit contributions
Su+0.024
 few+0.024
J+0.023
 passenger+0.022
Kim+0.022
Neg logit contributions
 cries-0.032
 breaths-0.031
cha-0.028
lers-0.028
 worries-0.026
Token mom
Token ID1995
Feature activation-1.61e+00

Pos logit contributions
 few+0.021
 color+0.019
Su+0.019
"+0.019
 Jem+0.018
Neg logit contributions
 breaths-0.039
 cries-0.038
lers-0.035
cha-0.035
ses-0.032
Token says
Token ID1139
Feature activation+1.61e-01

Pos logit contributions
"+0.047
S+0.039
Grand+0.038
J+0.038
Su+0.038
Neg logit contributions
 breaths-0.039
ses-0.036
 thirst-0.036
edly-0.035
cha-0.034
Token.
Token ID13
Feature activation-1.29e-01

Pos logit contributions
 breaths+0.006
cha+0.006
lers+0.006
ses+0.005
 cries+0.005
Neg logit contributions
"-0.002
 few-0.002
 if-0.002
 to-0.002
,-0.002
Token "
Token ID366
Feature activation+6.10e-01

Pos logit contributions
 few+0.002
"+0.002
Su+0.002
 color+0.001
S+0.001
Neg logit contributions
 breaths-0.005
 cries-0.005
lers-0.005
cha-0.005
 worries-0.004
TokenHe
Token ID1544
Feature activation+6.21e-01

Pos logit contributions
 breaths+0.021
 cries+0.021
lers+0.019
cha+0.019
 worries+0.018
Neg logit contributions
 few-0.012
Su-0.010
 color-0.010
"-0.010
J-0.010
Token doesn
Token ID1595
Feature activation+4.18e+00

Pos logit contributions
 breaths+0.019
edly+0.019
der+0.017
cha+0.017
ement+0.017
Neg logit contributions
"-0.011
a-0.010
 kind-0.010
 color-0.009
 allowed-0.009
Token't
Token ID470
Feature activation+5.69e-01

Pos logit contributions
 breaths+0.143
 cries+0.135
lers+0.129
cha+0.126
ses+0.122
Neg logit contributions
 few-0.081
"-0.081
Su-0.072
 color-0.071
The-0.068
Token understand
Token ID1833
Feature activation-7.00e-02

Pos logit contributions
 breaths+0.019
 cries+0.019
cha+0.018
lers+0.018
ses+0.016
Neg logit contributions
 few-0.011
 color-0.010
"-0.010
Su-0.010
S-0.009
Token what
Token ID644
Feature activation+1.17e+00

Pos logit contributions
 few+0.002
 passenger+0.002
Su+0.002
J+0.002
 color+0.002
Neg logit contributions
 cries-0.002
 breaths-0.002
cha-0.002
lers-0.002
 worries-0.002
Token is
Token ID318
Feature activation+1.15e+00

Pos logit contributions
 cries+0.037
 breaths+0.036
cha+0.033
lers+0.033
 worries+0.030
Neg logit contributions
 few-0.025
Su-0.023
 color-0.023
 passenger-0.022
J-0.022
Token happening
Token ID5836
Feature activation-3.02e-01

Pos logit contributions
 cries+0.034
 breaths+0.033
cha+0.030
lers+0.030
ses+0.027
Neg logit contributions
 few-0.028
 color-0.025
Su-0.025
"-0.025
J-0.024
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
 breaths+0.090
 cries+0.088
cha+0.084
lers+0.081
ses+0.075
Neg logit contributions
 few-0.054
 color-0.051
"-0.048
Su-0.048
 Jem-0.046
Token
Token ID198
Feature activation+4.22e-01

Pos logit contributions
 breaths+0.035
 cries+0.034
lers+0.033
cha+0.032
ses+0.031
Neg logit contributions
 few-0.022
"-0.020
 color-0.018
Grand-0.017
Be-0.017
Token
Token ID198
Feature activation+4.92e-01

Pos logit contributions
 breaths+0.015
 cries+0.015
lers+0.014
cha+0.013
ses+0.013
Neg logit contributions
 few-0.008
"-0.008
 color-0.007
Su-0.007
Grand-0.007
Token"
Token ID1
Feature activation+3.11e-01

Pos logit contributions
 cries+0.022
 breaths+0.022
cha+0.021
lers+0.020
 wishes+0.019
Neg logit contributions
 few-0.003
 color-0.003
 Jem-0.003
Su-0.003
 passenger-0.002
TokenWhy
Token ID5195
Feature activation+3.80e-01

Pos logit contributions
 cries+0.009
 breaths+0.009
cha+0.009
lers+0.008
ishly+0.008
Neg logit contributions
 few-0.007
 color-0.006
 passenger-0.006
Su-0.006
 Jem-0.006
Token don
Token ID836
Feature activation+4.18e+00

Pos logit contributions
 cries+0.010
 breaths+0.009
cha+0.008
lers+0.008
 worries+0.008
Neg logit contributions
 few-0.011
Su-0.010
 passenger-0.010
Grand-0.010
J-0.010
Token't
Token ID470
Feature activation+1.51e+00

Pos logit contributions
 breaths+0.140
 cries+0.135
lers+0.126
ses+0.123
cha+0.122
Neg logit contributions
"-0.084
 few-0.084
Su-0.073
 color-0.072
Grand-0.070
Token you
Token ID345
Feature activation+9.87e-01

Pos logit contributions
 cries+0.039
 breaths+0.039
cha+0.035
lers+0.034
ses+0.031
Neg logit contributions
 few-0.041
 color-0.038
Su-0.037
"-0.037
J-0.036
Token add
Token ID751
Feature activation-1.32e+00

Pos logit contributions
 cries+0.031
 breaths+0.031
cha+0.028
lers+0.028
 worries+0.025
Neg logit contributions
 few-0.021
Su-0.019
 color-0.018
J-0.018
 passenger-0.018
Token some
Token ID617
Feature activation-4.37e-01

Pos logit contributions
Su+0.027
 passenger+0.026
 few+0.026
J+0.025
Grand+0.025
Neg logit contributions
 breaths-0.042
 cries-0.042
lers-0.038
cha-0.037
 worries-0.035
Token paint
Token ID7521
Feature activation-1.59e+00

Pos logit contributions
J+0.010
 few+0.010
Grand+0.010
Su+0.010
 passenger+0.009
Neg logit contributions
 breaths-0.013
 cries-0.013
lers-0.012
cha-0.011
 worries-0.011
Token.
Token ID13
Feature activation+7.82e-02

Pos logit contributions
 few+0.008
 color+0.008
"+0.008
Su+0.007
J+0.007
Neg logit contributions
 breaths-0.012
 cries-0.011
cha-0.011
lers-0.011
ses-0.010
Token
Token ID198
Feature activation-4.99e-01

Pos logit contributions
 cries+0.002
 breaths+0.002
cha+0.002
lers+0.002
ses+0.002
Neg logit contributions
 few-0.002
 color-0.002
Su-0.002
 passenger-0.002
 Jem-0.002
Token
Token ID198
Feature activation-4.89e-01

Pos logit contributions
 few+0.011
 color+0.011
Su+0.010
 passenger+0.010
 Jem+0.010
Neg logit contributions
 cries-0.015
 breaths-0.015
cha-0.013
lers-0.013
ses-0.012
Token"
Token ID1
Feature activation-5.34e-01

Pos logit contributions
 color+0.009
 groom+0.009
 Jem+0.009
 type+0.008
 known+0.008
Neg logit contributions
 cries-0.014
cha-0.013
lers-0.013
edly-0.012
led-0.012
TokenWhy
Token ID5195
Feature activation-3.33e-02

Pos logit contributions
 few+0.011
 color+0.010
 Jem+0.010
Su+0.010
 passenger+0.010
Neg logit contributions
 cries-0.016
 breaths-0.015
cha-0.015
lers-0.014
ses-0.013
Token don
Token ID836
Feature activation+4.18e+00

Pos logit contributions
 few+0.001
Su+0.001
 passenger+0.001
 color+0.001
J+0.001
Neg logit contributions
 cries-0.001
 breaths-0.001
cha-0.001
lers-0.001
 worries-0.001
Token't
Token ID470
Feature activation+1.90e+00

Pos logit contributions
 breaths+0.141
 cries+0.134
lers+0.124
cha+0.122
ses+0.121
Neg logit contributions
"-0.085
 few-0.083
 color-0.074
Su-0.071
The-0.070
Token you
Token ID345
Feature activation+7.29e-01

Pos logit contributions
 breaths+0.056
 cries+0.053
lers+0.050
ses+0.049
cha+0.048
Neg logit contributions
 color-0.045
 few-0.045
"-0.045
Su-0.040
 Jem-0.039
Token push
Token ID4574
Feature activation-8.30e-01

Pos logit contributions
 cries+0.022
 breaths+0.021
cha+0.019
lers+0.019
 worries+0.018
Neg logit contributions
 few-0.017
Su-0.015
 passenger-0.015
J-0.015
Grand-0.014
Token with
Token ID351
Feature activation-1.67e-01

Pos logit contributions
 few+0.016
Su+0.015
 passenger+0.014
 color+0.014
 builder+0.014
Neg logit contributions
 cries-0.029
 breaths-0.028
lers-0.026
cha-0.026
 worries-0.023
Token your
Token ID534
Feature activation+4.08e-01

Pos logit contributions
 few+0.004
 passenger+0.004
Su+0.004
J+0.004
 builder+0.004
Neg logit contributions
 cries-0.005
 breaths-0.005
cha-0.004
lers-0.004
 worries-0.004
Tokenre
Token ID260
Feature activation-2.62e-01

Pos logit contributions
 few+0.006
 color+0.006
 passenger+0.005
 groom+0.005
 builder+0.005
Neg logit contributions
 cries-0.006
cha-0.005
lers-0.005
 breaths-0.005
ishly-0.005
Token hungry
Token ID14720
Feature activation-1.24e+00

Pos logit contributions
 few+0.006
Su+0.006
J+0.005
 color+0.005
Grand+0.005
Neg logit contributions
 cries-0.008
 breaths-0.008
cha-0.007
lers-0.007
 worries-0.007
Token,
Token ID11
Feature activation+6.20e-02

Pos logit contributions
 passenger+0.033
Su+0.033
J+0.033
 few+0.032
Pe+0.032
Neg logit contributions
 cries-0.032
 breaths-0.030
cha-0.027
lers-0.026
 doesn-0.024
Token but
Token ID475
Feature activation+1.10e+00

Pos logit contributions
 cries+0.002
 breaths+0.002
cha+0.002
lers+0.002
 worries+0.002
Neg logit contributions
 few-0.001
 passenger-0.001
J-0.001
 color-0.001
Su-0.001
Token why
Token ID1521
Feature activation+2.66e-01

Pos logit contributions
 cries+0.030
 breaths+0.030
cha+0.027
lers+0.027
 worries+0.024
Neg logit contributions
 few-0.027
 color-0.025
Su-0.025
"-0.025
 passenger-0.024
Token don
Token ID836
Feature activation+4.16e+00

Pos logit contributions
 cries+0.006
 breaths+0.006
cha+0.005
lers+0.005
 worries+0.005
Neg logit contributions
 few-0.008
Su-0.007
 passenger-0.007
 color-0.007
J-0.007
Tokenâ
Token ID22940
Feature activation+2.46e+00

Pos logit contributions
 breaths+0.147
 cries+0.138
ses+0.130
lers+0.129
cha+0.125
Neg logit contributions
"-0.081
 few-0.077
The-0.067
Su-0.067
 color-0.066
Token
Token ID26391
Feature activation+1.45e+00

Pos logit contributions
 breaths+0.074
 cries+0.072
lers+0.068
 webs+0.063
cha+0.062
Neg logit contributions
"-0.057
 few-0.054
 color-0.053
S-0.051
Su-0.050
Token
Token ID8151
Feature activation+8.01e-01

Pos logit contributions
 cries+0.039
 breaths+0.038
cha+0.035
lers+0.034
ses+0.031
Neg logit contributions
 few-0.038
 color-0.035
Su-0.034
"-0.034
 passenger-0.033
Tokent
Token ID83
Feature activation+1.62e+00

Pos logit contributions
 cries+0.021
cha+0.019
 breaths+0.019
lers+0.018
ishly+0.016
Neg logit contributions
 few-0.021
 color-0.020
 passenger-0.020
 builder-0.019
Su-0.019
Token you
Token ID345
Feature activation+1.14e+00

Pos logit contributions
 cries+0.035
 breaths+0.033
cha+0.030
lers+0.029
 worries+0.025
Neg logit contributions
 few-0.050
 passenger-0.048
Su-0.047
 color-0.047
J-0.046
Token He
Token ID679
Feature activation-6.35e-01

Pos logit contributions
 cries+0.003
 breaths+0.003
cha+0.003
lers+0.003
ses+0.003
Neg logit contributions
 few-0.003
 color-0.003
Su-0.003
"-0.003
 passenger-0.003
Token begged
Token ID32946
Feature activation+8.25e-02

Pos logit contributions
 few+0.013
 color+0.012
Su+0.012
"+0.012
 passenger+0.011
Neg logit contributions
 cries-0.020
 breaths-0.020
cha-0.018
lers-0.018
ses-0.017
Token and
Token ID290
Feature activation+1.34e-01

Pos logit contributions
 cries+0.003
 breaths+0.002
cha+0.002
lers+0.002
 worries+0.002
Neg logit contributions
 few-0.002
 passenger-0.002
Su-0.002
spe-0.002
Back-0.002
Token begged
Token ID32946
Feature activation+2.41e-02

Pos logit contributions
 cries+0.003
 breaths+0.003
cha+0.003
ishly+0.003
lers+0.003
Neg logit contributions
 few-0.003
 color-0.003
 passenger-0.003
 sunny-0.003
Back-0.003
Token his
Token ID465
Feature activation-1.40e+00

Pos logit contributions
 cries+0.001
 breaths+0.001
cha+0.001
lers+0.001
 deeply+0.001
Neg logit contributions
Su-0.000
 few-0.000
 passenger-0.000
Back-0.000
Be-0.000
Token mom
Token ID1995
Feature activation-4.52e+00

Pos logit contributions
 few+0.033
Su+0.032
Sus+0.029
 Jem+0.029
J+0.029
Neg logit contributions
 cries-0.042
 breaths-0.039
cha-0.035
lers-0.034
 worries-0.033
Token to
Token ID284
Feature activation-2.36e+00

Pos logit contributions
Su+0.072
 few+0.067
 passenger+0.066
J+0.064
Grand+0.063
Neg logit contributions
 cries-0.165
 breaths-0.147
lers-0.147
cha-0.142
 worries-0.136
Token buy
Token ID2822
Feature activation+6.54e-01

Pos logit contributions
 few+0.055
Su+0.051
J+0.050
 passenger+0.048
Sus+0.048
Neg logit contributions
 cries-0.069
 breaths-0.061
cha-0.058
ishly-0.056
 worries-0.053
Token it
Token ID340
Feature activation+3.43e-01

Pos logit contributions
 cries+0.019
 breaths+0.019
cha+0.017
lers+0.017
 worries+0.016
Neg logit contributions
 few-0.015
 color-0.014
Su-0.014
"-0.013
 passenger-0.013
Token for
Token ID329
Feature activation+2.52e-01

Pos logit contributions
 cries+0.007
 breaths+0.006
lers+0.006
cha+0.006
ishly+0.005
Neg logit contributions
Su-0.011
Be-0.010
 few-0.010
Grand-0.010
 builder-0.010
Token him
Token ID683
Feature activation+1.21e-01

Pos logit contributions
 cries+0.007
 breaths+0.007
cha+0.006
lers+0.006
 worries+0.005
Neg logit contributions
 few-0.006
 color-0.006
 passenger-0.006
Su-0.006
 builder-0.006
Token.
Token ID13
Feature activation+9.19e-02

Pos logit contributions
 passenger+0.012
Be+0.012
Pe+0.012
 builder+0.012
Su+0.012
Neg logit contributions
 cries-0.009
 breaths-0.009
 deeply-0.007
cha-0.007
lers-0.007
Token K
Token ID509
Feature activation+5.08e-02

Pos logit contributions
 cries+0.003
 breaths+0.003
cha+0.003
lers+0.003
ishly+0.003
Neg logit contributions
 few-0.002
 color-0.002
J-0.002
 builder-0.001
 passenger-0.001
Tokeniki
Token ID5580
Feature activation+3.21e-02

Pos logit contributions
 cries+0.001
 breaths+0.001
cha+0.001
lers+0.001
ses+0.001
Neg logit contributions
 few-0.001
 color-0.001
Su-0.001
"-0.001
J-0.001
Token begged
Token ID32946
Feature activation+1.46e-01

Pos logit contributions
 breaths+0.001
 cries+0.001
cha+0.001
lers+0.001
ses+0.001
Neg logit contributions
 few-0.001
 color-0.001
"-0.001
Su-0.001
J-0.001
Token her
Token ID607
Feature activation-1.76e+00

Pos logit contributions
 cries+0.005
 breaths+0.004
cha+0.004
 deeply+0.004
 worries+0.004
Neg logit contributions
 few-0.003
 passenger-0.003
Su-0.003
Back-0.003
 color-0.003
Token mom
Token ID1995
Feature activation-4.49e+00

Pos logit contributions
Su+0.040
 few+0.040
Pe+0.039
 passenger+0.039
Be+0.038
Neg logit contributions
 cries-0.051
 breaths-0.046
 worries-0.041
cha-0.041
lers-0.040
Token to
Token ID284
Feature activation-2.41e+00

Pos logit contributions
 few+0.064
Su+0.064
 passenger+0.061
 color+0.060
J+0.060
Neg logit contributions
 cries-0.171
 breaths-0.158
lers-0.151
cha-0.150
 worries-0.144
Token let
Token ID1309
Feature activation+2.26e+00

Pos logit contributions
 few+0.056
 passenger+0.053
J+0.051
cest+0.051
Back+0.050
Neg logit contributions
 cries-0.069
 breaths-0.064
cha-0.057
ishly-0.055
 worries-0.055
Token her
Token ID607
Feature activation-9.08e-01

Pos logit contributions
 cries+0.079
 breaths+0.076
cha+0.070
lers+0.068
 worries+0.064
Neg logit contributions
 few-0.042
 color-0.038
J-0.036
 passenger-0.035
Grand-0.035
Token finish
Token ID5461
Feature activation-1.09e+00

Pos logit contributions
Pe+0.018
 few+0.018
Grand+0.017
 passenger+0.017
Su+0.017
Neg logit contributions
 cries-0.029
 breaths-0.027
lers-0.024
 worries-0.024
cha-0.024
Token it
Token ID340
Feature activation-2.64e-01

Pos logit contributions
 passenger+0.024
Pe+0.023
Grand+0.023
 few+0.023
S+0.022
Neg logit contributions
 breaths-0.034
 cries-0.032
lers-0.028
cha-0.027
 worries-0.026
Token shop
Token ID6128
Feature activation-1.55e+00

Pos logit contributions
Su+0.007
J+0.007
Sus+0.007
 Jem+0.007
 few+0.007
Neg logit contributions
 cries-0.009
 breaths-0.008
lers-0.007
cha-0.007
 wishes-0.007
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
 few+0.046
Su+0.046
Be+0.045
Back+0.045
J+0.044
Neg logit contributions
 cries-0.035
 breaths-0.031
lers-0.029
cha-0.028
ishly-0.026
Token She
Token ID1375
Feature activation-9.90e-02

Pos logit contributions
 few+0.003
 color+0.003
Su+0.003
 passenger+0.003
"+0.003
Neg logit contributions
 cries-0.005
 breaths-0.004
cha-0.004
lers-0.004
ses-0.004
Token begged
Token ID32946
Feature activation+2.59e-01

Pos logit contributions
 few+0.002
Su+0.002
 passenger+0.002
 color+0.002
 Jem+0.002
Neg logit contributions
 cries-0.003
 breaths-0.003
cha-0.003
lers-0.003
 worries-0.002
Token her
Token ID607
Feature activation-1.61e+00

Pos logit contributions
 cries+0.009
 breaths+0.008
cha+0.007
lers+0.007
 worries+0.007
Neg logit contributions
 few-0.005
 passenger-0.005
Su-0.005
cest-0.005
spe-0.005
Token mom
Token ID1995
Feature activation-4.42e+00

Pos logit contributions
 few+0.037
Su+0.035
J+0.033
 passenger+0.033
 Jem+0.033
Neg logit contributions
 cries-0.048
 breaths-0.044
cha-0.040
lers-0.040
 worries-0.038
Token to
Token ID284
Feature activation-1.93e+00

Pos logit contributions
 few+0.061
Su+0.059
 passenger+0.055
 color+0.055
J+0.055
Neg logit contributions
 cries-0.172
 breaths-0.159
lers-0.154
cha-0.152
 worries-0.144
Token buy
Token ID2822
Feature activation+7.37e-01

Pos logit contributions
 few+0.045
 passenger+0.041
J+0.041
Su+0.040
Grand+0.040
Neg logit contributions
 cries-0.056
 breaths-0.051
cha-0.047
ishly-0.045
 worries-0.044
Token it
Token ID340
Feature activation+1.83e-01

Pos logit contributions
 cries+0.022
 breaths+0.022
cha+0.020
lers+0.020
 worries+0.018
Neg logit contributions
 few-0.017
 color-0.015
Su-0.015
 passenger-0.015
"-0.015
Token for
Token ID329
Feature activation+5.65e-01

Pos logit contributions
 cries+0.004
 breaths+0.004
lers+0.003
cha+0.003
ishly+0.003
Neg logit contributions
 passenger-0.006
Grand-0.005
spe-0.005
Be-0.005
 builder-0.005
Token her
Token ID607
Feature activation+9.33e-01

Pos logit contributions
 cries+0.019
 breaths+0.019
cha+0.017
lers+0.017
 worries+0.016
Neg logit contributions
 few-0.011
 color-0.010
 passenger-0.009
Su-0.009
"-0.009
Token wanted
Token ID2227
Feature activation-4.88e-02

Pos logit contributions
 few+0.015
Su+0.014
 passenger+0.014
"+0.013
Grand+0.013
Neg logit contributions
 cries-0.023
 breaths-0.022
cha-0.020
lers-0.020
 worries-0.019
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
Su+0.001
 passenger+0.001
 few+0.001
Grand+0.001
Pe+0.001
Neg logit contributions
 cries-0.001
 breaths-0.001
cha-0.001
lers-0.001
 worries-0.001
Token He
Token ID679
Feature activation-6.38e-01

Pos logit contributions
 few+0.009
 passenger+0.008
 color+0.008
J+0.008
Su+0.008
Neg logit contributions
 cries-0.007
 breaths-0.007
cha-0.006
lers-0.006
ses-0.005
Token begged
Token ID32946
Feature activation-1.17e-03

Pos logit contributions
 few+0.014
Su+0.014
 passenger+0.014
 color+0.013
 Jem+0.013
Neg logit contributions
 cries-0.019
 breaths-0.018
cha-0.017
lers-0.016
 worries-0.015
Token his
Token ID465
Feature activation-1.19e+00

Pos logit contributions
spe+0.000
 passenger+0.000
Su+0.000
cest+0.000
 few+0.000
Neg logit contributions
 cries-0.000
 breaths-0.000
 tears-0.000
 deeply-0.000
lers-0.000
Token mom
Token ID1995
Feature activation-4.35e+00

Pos logit contributions
Su+0.030
 few+0.030
Sus+0.028
J+0.027
 Jem+0.027
Neg logit contributions
 cries-0.033
 breaths-0.030
cha-0.027
ses-0.026
 worries-0.026
Token to
Token ID284
Feature activation-2.14e+00

Pos logit contributions
Su+0.080
 passenger+0.075
J+0.072
Pe+0.072
Sus+0.071
Neg logit contributions
 cries-0.150
lers-0.132
 breaths-0.129
cha-0.127
 doesn-0.124
Token buy
Token ID2822
Feature activation+4.57e-01

Pos logit contributions
 few+0.050
cest+0.048
 passenger+0.048
Su+0.047
J+0.046
Neg logit contributions
 cries-0.058
 breaths-0.050
cha-0.048
ishly-0.048
 worries-0.044
Token it
Token ID340
Feature activation+6.59e-02

Pos logit contributions
 cries+0.014
 breaths+0.014
cha+0.012
lers+0.012
 worries+0.011
Neg logit contributions
 few-0.010
Su-0.009
 color-0.009
"-0.009
 passenger-0.009
Token for
Token ID329
Feature activation+1.44e-01

Pos logit contributions
 cries+0.001
 breaths+0.001
lers+0.001
cha+0.001
 doesn+0.001
Neg logit contributions
Su-0.002
 passenger-0.002
Grand-0.002
Pe-0.002
spe-0.002
Token him
Token ID683
Feature activation+1.51e-01

Pos logit contributions
 cries+0.004
 breaths+0.004
cha+0.004
lers+0.003
 worries+0.003
Neg logit contributions
 few-0.004
 passenger-0.003
 color-0.003
Su-0.003
 builder-0.003
Token that
Token ID326
Feature activation+1.12e+00

Pos logit contributions
 few+0.036
Su+0.035
J+0.033
Grand+0.033
 Jem+0.032
Neg logit contributions
 cries-0.048
 breaths-0.046
lers-0.044
cha-0.042
 worries-0.040
Token she
Token ID673
Feature activation-4.32e-01

Pos logit contributions
 cries+0.035
 breaths+0.034
cha+0.031
lers+0.031
 worries+0.029
Neg logit contributions
 few-0.024
Su-0.022
 passenger-0.021
J-0.021
"-0.021
Token could
Token ID714
Feature activation-4.06e-01

Pos logit contributions
 few+0.007
 color+0.007
"+0.007
 true+0.006
 type+0.006
Neg logit contributions
 breaths-0.016
 cries-0.015
cha-0.014
lers-0.014
ses-0.013
Token make
Token ID787
Feature activation+8.30e-01

Pos logit contributions
 few+0.009
 color+0.008
Su+0.008
"+0.008
J+0.008
Neg logit contributions
 cries-0.013
 breaths-0.013
cha-0.011
lers-0.011
ses-0.010
Token his
Token ID465
Feature activation+2.38e-01

Pos logit contributions
 cries+0.029
 breaths+0.029
cha+0.027
lers+0.027
ses+0.025
Neg logit contributions
 few-0.014
 color-0.013
Su-0.012
"-0.012
J-0.012
Token kingdom
Token ID13239
Feature activation-4.28e+00

Pos logit contributions
 breaths+0.007
 cries+0.007
cha+0.006
lers+0.006
ishly+0.006
Neg logit contributions
 few-0.006
 color-0.005
 passenger-0.005
 true-0.005
 sunny-0.005
Token happy
Token ID3772
Feature activation+1.72e-01

Pos logit contributions
Su+0.087
Kim+0.080
 Jem+0.079
Pe+0.079
 Su+0.078
Neg logit contributions
 cries-0.135
 breaths-0.132
lers-0.120
 wishes-0.116
 worries-0.116
Token again
Token ID757
Feature activation-1.40e+00

Pos logit contributions
 cries+0.005
 breaths+0.005
cha+0.005
lers+0.005
ses+0.004
Neg logit contributions
 few-0.004
 color-0.004
Su-0.004
"-0.003
J-0.003
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
Su+0.041
 few+0.040
 passenger+0.040
Pe+0.039
Be+0.039
Neg logit contributions
 cries-0.031
 breaths-0.030
lers-0.026
cha-0.025
 wishes-0.024
Token So
Token ID1406
Feature activation-9.02e-01

Pos logit contributions
 few+0.003
 color+0.003
Su+0.003
 passenger+0.003
J+0.003
Neg logit contributions
 cries-0.004
 breaths-0.004
cha-0.004
lers-0.003
ishly-0.003
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
Su+0.017
 passenger+0.016
Grand+0.016
J+0.016
 color+0.016
Neg logit contributions
 cries-0.029
 breaths-0.029
cha-0.026
lers-0.025
ishly-0.025
Token so
Token ID523
Feature activation-7.19e-01

Pos logit contributions
 cries+0.007
 breaths+0.006
ishly+0.006
 screams+0.006
lers+0.006
Neg logit contributions
 few-0.006
J-0.006
Su-0.006
 builder-0.006
 color-0.006
Token excited
Token ID6568
Feature activation+1.25e+00

Pos logit contributions
 few+0.015
Su+0.013
 passenger+0.013
 color+0.013
 builder+0.013
Neg logit contributions
 cries-0.024
 breaths-0.021
cha-0.020
 wishes-0.019
 screams-0.019
Token and
Token ID290
Feature activation+5.95e-01

Pos logit contributions
 breaths+0.044
 cries+0.044
lers+0.040
cha+0.040
ses+0.038
Neg logit contributions
 few-0.022
 color-0.020
"-0.019
Su-0.019
 Jem-0.018
Token begged
Token ID32946
Feature activation+3.28e-01

Pos logit contributions
 cries+0.017
 breaths+0.015
cha+0.014
lers+0.013
ishly+0.013
Neg logit contributions
 few-0.015
 passenger-0.014
 color-0.014
Su-0.014
 builder-0.014
Token her
Token ID607
Feature activation-1.40e+00

Pos logit contributions
 cries+0.011
 breaths+0.010
cha+0.010
lers+0.009
ishly+0.009
Neg logit contributions
 few-0.006
 passenger-0.006
Su-0.006
 color-0.005
Back-0.005
Token mom
Token ID1995
Feature activation-4.26e+00

Pos logit contributions
 few+0.033
Su+0.031
 color+0.030
 passenger+0.029
J+0.029
Neg logit contributions
 cries-0.041
 breaths-0.038
cha-0.035
lers-0.034
 worries-0.033
Tokenmy
Token ID1820
Feature activation-3.44e+00

Pos logit contributions
 few+0.094
Su+0.093
 passenger+0.091
J+0.089
Pe+0.089
Neg logit contributions
 cries-0.122
lers-0.106
 breaths-0.105
cha-0.098
ishly-0.097
Token to
Token ID284
Feature activation-1.94e+00

Pos logit contributions
Su+0.057
 few+0.055
J+0.053
 passenger+0.051
Grand+0.051
Neg logit contributions
 cries-0.123
 breaths-0.112
lers-0.110
cha-0.107
ishly-0.103
Token let
Token ID1309
Feature activation+2.12e+00

Pos logit contributions
 few+0.046
J+0.042
 passenger+0.041
cest+0.041
Su+0.041
Neg logit contributions
 cries-0.054
 breaths-0.049
cha-0.046
ishly-0.046
 worries-0.042
Token her
Token ID607
Feature activation-6.06e-01

Pos logit contributions
 cries+0.075
 breaths+0.073
cha+0.068
lers+0.066
 worries+0.061
Neg logit contributions
 few-0.038
 color-0.034
Su-0.034
 passenger-0.033
J-0.032
Token pet
Token ID4273
Feature activation+6.59e-01

Pos logit contributions
J+0.014
Su+0.014
 few+0.013
Sus+0.013
Grand+0.013
Neg logit contributions
 cries-0.017
 breaths-0.016
lers-0.015
cha-0.015
 worries-0.014
Token,
Token ID11
Feature activation-1.84e-01

Pos logit contributions
 cries+0.003
 breaths+0.002
cha+0.002
lers+0.002
idents+0.002
Neg logit contributions
Che-0.004
Grand-0.004
Su-0.004
 passenger-0.004
 groom-0.003
Token so
Token ID523
Feature activation-3.48e-01

Pos logit contributions
 few+0.004
 passenger+0.004
J+0.004
Back+0.004
Grand+0.004
Neg logit contributions
 cries-0.006
 breaths-0.005
cha-0.005
lers-0.004
 worries-0.004
Token he
Token ID339
Feature activation-1.67e-01

Pos logit contributions
 few+0.009
 passenger+0.008
Su+0.008
 color+0.008
 Jem+0.008
Neg logit contributions
 cries-0.009
 breaths-0.009
lers-0.008
cha-0.008
 worries-0.007
Token begged
Token ID32946
Feature activation+3.82e-01

Pos logit contributions
 few+0.004
Su+0.003
 color+0.003
 passenger+0.003
 Jem+0.003
Neg logit contributions
 cries-0.005
 breaths-0.005
cha-0.005
lers-0.005
 worries-0.004
Token his
Token ID465
Feature activation-1.03e+00

Pos logit contributions
 cries+0.012
 breaths+0.010
lers+0.009
cha+0.009
 tears+0.009
Neg logit contributions
 passenger-0.009
 few-0.008
Su-0.008
spe-0.008
cest-0.008
Token mom
Token ID1995
Feature activation-4.26e+00

Pos logit contributions
 few+0.026
Su+0.026
Sus+0.024
Be+0.024
J+0.024
Neg logit contributions
 cries-0.028
 breaths-0.025
cha-0.022
 wishes-0.022
 worries-0.022
Token to
Token ID284
Feature activation-2.06e+00

Pos logit contributions
Su+0.070
 few+0.067
 passenger+0.064
J+0.063
Pe+0.063
Neg logit contributions
 cries-0.152
 breaths-0.134
lers-0.134
cha-0.128
ishly-0.124
Token buy
Token ID2822
Feature activation+6.02e-01

Pos logit contributions
 few+0.049
cest+0.046
 passenger+0.045
Su+0.044
J+0.044
Neg logit contributions
 cries-0.057
 breaths-0.050
cha-0.047
ishly-0.046
 worries-0.043
Token it
Token ID340
Feature activation+4.46e-01

Pos logit contributions
 cries+0.017
 breaths+0.016
cha+0.015
lers+0.015
 worries+0.013
Neg logit contributions
 few-0.015
Su-0.014
 color-0.014
 passenger-0.014
J-0.013
Token for
Token ID329
Feature activation+5.55e-01

Pos logit contributions
 cries+0.008
 breaths+0.007
lers+0.006
cha+0.006
ches+0.005
Neg logit contributions
Be-0.015
Su-0.015
 few-0.014
 passenger-0.014
Pe-0.014
Token him
Token ID683
Feature activation+1.34e-01

Pos logit contributions
 cries+0.015
 breaths+0.015
cha+0.013
lers+0.012
 worries+0.011
Neg logit contributions
 few-0.014
 passenger-0.013
 color-0.013
Su-0.013
 builder-0.013
Token ever
Token ID1683
Feature activation-2.48e-01

Pos logit contributions
 few+0.013
 color+0.012
"+0.011
Su+0.011
 passenger+0.011
Neg logit contributions
 cries-0.015
 breaths-0.015
cha-0.013
lers-0.013
ses-0.012
Token.
Token ID13
Feature activation-2.57e-01

Pos logit contributions
 few+0.004
 color+0.004
"+0.004
 available+0.004
 true+0.004
Neg logit contributions
 breaths-0.009
 cries-0.008
cha-0.008
ses-0.008
ishly-0.008
Token They
Token ID1119
Feature activation-4.30e-01

Pos logit contributions
 few+0.006
 color+0.006
"+0.006
Su+0.006
J+0.005
Neg logit contributions
 cries-0.007
 breaths-0.007
cha-0.007
lers-0.007
 worries-0.006
Token begged
Token ID32946
Feature activation+9.06e-02

Pos logit contributions
 few+0.008
 color+0.008
"+0.007
Su+0.007
J+0.007
Neg logit contributions
 breaths-0.014
 cries-0.014
cha-0.013
lers-0.013
ses-0.012
Token their
Token ID511
Feature activation-1.17e+00

Pos logit contributions
 cries+0.003
 breaths+0.003
cha+0.002
lers+0.002
 worries+0.002
Neg logit contributions
 few-0.002
Su-0.002
 passenger-0.002
 color-0.002
 Jem-0.002
Token mom
Token ID1995
Feature activation-4.24e+00

Pos logit contributions
 few+0.024
 color+0.022
Su+0.022
"+0.021
J+0.021
Neg logit contributions
 cries-0.038
 breaths-0.037
cha-0.034
lers-0.034
ses-0.031
Token to
Token ID284
Feature activation-2.36e+00

Pos logit contributions
Su+0.065
 few+0.065
J+0.059
 color+0.058
 passenger+0.058
Neg logit contributions
 cries-0.158
 breaths-0.145
lers-0.139
cha-0.137
 worries-0.131
Token change
Token ID1487
Feature activation-9.14e-01

Pos logit contributions
 few+0.056
J+0.052
Su+0.051
cest+0.051
Sus+0.050
Neg logit contributions
 cries-0.065
 breaths-0.058
ishly-0.053
cha-0.053
 worries-0.050
Token her
Token ID607
Feature activation+1.16e+00

Pos logit contributions
 few+0.015
Su+0.014
Grand+0.013
 builder+0.013
J+0.013
Neg logit contributions
 breaths-0.034
 cries-0.034
lers-0.031
cha-0.030
 worries-0.029
Token mind
Token ID2000
Feature activation-1.67e+00

Pos logit contributions
 cries+0.033
 breaths+0.033
cha+0.030
lers+0.030
ses+0.027
Neg logit contributions
 few-0.028
 color-0.026
"-0.025
Su-0.025
 passenger-0.024
Token.
Token ID13
Feature activation-9.83e-01

Pos logit contributions
 few+0.043
Be+0.043
Pe+0.043
Su+0.042
J+0.041
Neg logit contributions
 cries-0.044
 breaths-0.041
lers-0.038
der-0.035
ishly-0.035
Token He
Token ID679
Feature activation-9.91e-01

Pos logit contributions
 few+0.011
 color+0.010
 passenger+0.010
Su+0.010
Grand+0.010
Neg logit contributions
 cries-0.010
 breaths-0.009
cha-0.008
lers-0.008
ishly-0.007
Token begged
Token ID32946
Feature activation+1.66e-01

Pos logit contributions
 few+0.022
Su+0.021
 color+0.021
S+0.020
 passenger+0.020
Neg logit contributions
 cries-0.030
 breaths-0.029
lers-0.026
cha-0.026
 worries-0.023
Token and
Token ID290
Feature activation-2.96e-01

Pos logit contributions
 cries+0.005
 breaths+0.004
 tears+0.004
cha+0.003
lers+0.003
Neg logit contributions
 passenger-0.004
cest-0.004
 few-0.004
spe-0.004
Su-0.004
Token begged
Token ID32946
Feature activation+3.72e-02

Pos logit contributions
cest+0.009
 passenger+0.009
 color+0.009
 few+0.009
 sunny+0.009
Neg logit contributions
 cries-0.006
 breaths-0.006
ishly-0.005
cha-0.005
 cheeks-0.004
Token his
Token ID465
Feature activation-1.43e+00

Pos logit contributions
 cries+0.001
 breaths+0.001
 deeply+0.001
 tears+0.001
cha+0.001
Neg logit contributions
 passenger-0.001
cest-0.001
 few-0.001
Su-0.001
Be-0.001
Token mom
Token ID1995
Feature activation-4.20e+00

Pos logit contributions
 few+0.033
Su+0.033
Sus+0.030
 color+0.030
J+0.030
Neg logit contributions
 cries-0.043
 breaths-0.039
cha-0.035
lers-0.034
 worries-0.034
Token to
Token ID284
Feature activation-2.09e+00

Pos logit contributions
Su+0.074
J+0.067
Pe+0.066
 passenger+0.066
 few+0.066
Neg logit contributions
 cries-0.145
lers-0.130
 breaths-0.130
cha-0.125
ishly-0.121
Token let
Token ID1309
Feature activation+2.15e+00

Pos logit contributions
 few+0.050
Su+0.047
 passenger+0.046
J+0.045
S+0.045
Neg logit contributions
 cries-0.059
 breaths-0.055
cha-0.050
ishly-0.047
 worries-0.046
Token him
Token ID683
Feature activation-8.47e-01

Pos logit contributions
 cries+0.051
 breaths+0.047
cha+0.042
lers+0.040
 worries+0.036
Neg logit contributions
 few-0.064
 passenger-0.060
 color-0.060
Su-0.059
Grand-0.058
Token in
Token ID287
Feature activation-6.51e-01

Pos logit contributions
Su+0.019
 few+0.019
J+0.018
Pe+0.018
S+0.018
Neg logit contributions
 cries-0.025
 breaths-0.024
lers-0.022
cha-0.021
 worries-0.020
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
Su+0.014
 builder+0.014
 few+0.014
J+0.013
 color+0.013
Neg logit contributions
 cries-0.020
 breaths-0.019
 worries-0.015
 wishes-0.015
lers-0.015
Token it
Token ID340
Feature activation+1.35e+00

Pos logit contributions
 few+0.004
Su+0.004
 passenger+0.004
 builder+0.003
J+0.003
Neg logit contributions
 cries-0.003
 breaths-0.003
cha-0.003
lers-0.003
 webs-0.002
Token.
Token ID13
Feature activation-4.08e-01

Pos logit contributions
 cries+0.034
 breaths+0.031
lers+0.029
cha+0.028
 worries+0.025
Neg logit contributions
 few-0.038
Su-0.036
 passenger-0.036
J-0.036
Sus-0.035
Token She
Token ID1375
Feature activation-2.60e-02

Pos logit contributions
 few+0.010
 color+0.009
Su+0.009
 builder+0.009
 passenger+0.009
Neg logit contributions
 cries-0.012
 breaths-0.011
cha-0.010
lers-0.009
 wishes-0.009
Token begged
Token ID32946
Feature activation+3.37e-01

Pos logit contributions
 few+0.001
Su+0.001
 color+0.001
 passenger+0.001
"+0.001
Neg logit contributions
 cries-0.001
 breaths-0.001
cha-0.001
lers-0.001
 worries-0.001
Token her
Token ID607
Feature activation-1.52e+00

Pos logit contributions
 cries+0.011
 breaths+0.009
cha+0.009
 worries+0.008
 wishes+0.008
Neg logit contributions
 few-0.007
 passenger-0.007
Su-0.007
 builder-0.007
cest-0.007
Token mom
Token ID1995
Feature activation-4.17e+00

Pos logit contributions
Su+0.040
 few+0.039
 passenger+0.038
Sus+0.038
J+0.037
Neg logit contributions
 cries-0.039
 breaths-0.033
cha-0.031
 worries-0.031
 wishes-0.030
Token to
Token ID284
Feature activation-2.01e+00

Pos logit contributions
Su+0.074
J+0.069
 few+0.069
 passenger+0.069
Sus+0.067
Neg logit contributions
 cries-0.147
lers-0.128
 breaths-0.128
cha-0.126
ishly-0.123
Token buy
Token ID2822
Feature activation+6.20e-01

Pos logit contributions
 few+0.046
 passenger+0.042
J+0.041
 builder+0.041
cest+0.041
Neg logit contributions
 cries-0.059
 breaths-0.053
cha-0.050
ishly-0.049
 worries-0.046
Token it
Token ID340
Feature activation-1.45e-01

Pos logit contributions
 cries+0.019
 breaths+0.019
cha+0.017
lers+0.017
 worries+0.015
Neg logit contributions
 few-0.014
 color-0.013
Su-0.013
 passenger-0.012
"-0.012
Token.
Token ID13
Feature activation-7.18e-01

Pos logit contributions
 passenger+0.005
Be+0.005
 builder+0.005
Che+0.005
spe+0.005
Neg logit contributions
 cries-0.003
 breaths-0.003
lers-0.002
cha-0.002
ches-0.002
Token Mom
Token ID11254
Feature activation-2.14e+00

Pos logit contributions
 few+0.019
Su+0.017
 builder+0.017
 color+0.017
 passenger+0.017
Neg logit contributions
 cries-0.020
 breaths-0.018
cha-0.017
lers-0.015
ses-0.015
Token the
Token ID262
Feature activation-9.50e-01

Pos logit contributions
 cries+0.029
 breaths+0.029
cha+0.027
lers+0.027
ses+0.025
Neg logit contributions
 few-0.012
 color-0.010
Su-0.010
"-0.010
J-0.009
Token puppy
Token ID26188
Feature activation+1.67e+00

Pos logit contributions
 few+0.020
 color+0.019
Su+0.017
"+0.017
 passenger+0.017
Neg logit contributions
 breaths-0.031
 cries-0.030
cha-0.028
lers-0.027
ishly-0.025
Token and
Token ID290
Feature activation+9.00e-01

Pos logit contributions
 cries+0.052
 breaths+0.051
cha+0.047
lers+0.046
ses+0.043
Neg logit contributions
 few-0.036
 color-0.033
"-0.032
Su-0.032
 passenger-0.031
Token begged
Token ID32946
Feature activation+5.02e-01

Pos logit contributions
 cries+0.030
 breaths+0.029
lers+0.026
cha+0.026
 worries+0.024
Neg logit contributions
 few-0.018
Su-0.017
 color-0.016
"-0.016
J-0.016
Token her
Token ID607
Feature activation-1.26e+00

Pos logit contributions
 cries+0.017
 breaths+0.017
cha+0.015
lers+0.015
 worries+0.014
Neg logit contributions
 few-0.010
Su-0.009
 color-0.009
 passenger-0.008
J-0.008
Token mom
Token ID1995
Feature activation-4.16e+00

Pos logit contributions
 few+0.029
Su+0.027
J+0.026
 color+0.026
 passenger+0.026
Neg logit contributions
 cries-0.037
 breaths-0.034
cha-0.032
lers-0.031
 worries-0.030
Tokenmy
Token ID1820
Feature activation-3.48e+00

Pos logit contributions
Su+0.089
 few+0.088
J+0.083
 passenger+0.082
 Jem+0.082
Neg logit contributions
 cries-0.124
 breaths-0.111
lers-0.110
cha-0.101
 worries-0.098
Token to
Token ID284
Feature activation-2.27e+00

Pos logit contributions
Su+0.059
 few+0.054
J+0.054
 passenger+0.051
Grand+0.051
Neg logit contributions
 cries-0.124
 breaths-0.113
lers-0.110
cha-0.106
 worries-0.102
Token take
Token ID1011
Feature activation+1.52e+00

Pos logit contributions
 few+0.052
J+0.048
Su+0.047
 passenger+0.046
Back+0.046
Neg logit contributions
 cries-0.065
 breaths-0.059
cha-0.055
ishly-0.055
 worries-0.051
Token it
Token ID340
Feature activation+6.52e-01

Pos logit contributions
 cries+0.055
 breaths+0.054
cha+0.050
lers+0.050
ses+0.050
Neg logit contributions
 few-0.026
"-0.022
 color-0.022
 Jem-0.020
J-0.020
Token home
Token ID1363
Feature activation-1.39e+00

Pos logit contributions
 cries+0.020
 breaths+0.020
lers+0.018
cha+0.017
 worries+0.016
Neg logit contributions
 few-0.014
Su-0.013
 color-0.013
 passenger-0.012
Be-0.012
Token.
Token ID13
Feature activation+7.09e-01

Pos logit contributions
Su+0.012
Pe+0.012
 few+0.012
While+0.012
S+0.012
Neg logit contributions
 cries-0.013
 breaths-0.012
cha-0.011
lers-0.011
 deeply-0.010
Token The
Token ID383
Feature activation-1.31e+00

Pos logit contributions
 breaths+0.024
 cries+0.023
lers+0.022
cha+0.021
 worries+0.019
Neg logit contributions
 few-0.014
"-0.014
 color-0.013
 Jem-0.013
Su-0.013
Token bridge
Token ID7696
Feature activation-1.50e+00

Pos logit contributions
 few+0.023
 color+0.021
"+0.020
 passenger+0.019
 builder+0.018
Neg logit contributions
 cries-0.047
 breaths-0.046
lers-0.043
cha-0.043
ses-0.040
Token made
Token ID925
Feature activation+4.61e-01

Pos logit contributions
 few+0.035
Su+0.033
Grand+0.032
J+0.032
S+0.031
Neg logit contributions
 cries-0.044
 breaths-0.042
lers-0.039
cha-0.038
 worries-0.036
Token their
Token ID511
Feature activation-4.13e-01

Pos logit contributions
 cries+0.015
 breaths+0.015
cha+0.014
lers+0.014
 worries+0.013
Neg logit contributions
 few-0.009
Su-0.008
 color-0.008
"-0.008
 passenger-0.008
Token town
Token ID3240
Feature activation-4.14e+00

Pos logit contributions
 few+0.009
 color+0.008
 passenger+0.008
 sunny+0.007
"+0.007
Neg logit contributions
 cries-0.013
 breaths-0.013
cha-0.012
lers-0.012
ses-0.011
Token even
Token ID772
Feature activation-1.73e+00

Pos logit contributions
S+0.092
Su+0.092
Pe+0.090
J+0.090
Grand+0.087
Neg logit contributions
 cries-0.126
 breaths-0.118
 worries-0.107
lers-0.104
 wishes-0.104
Token more
Token ID517
Feature activation+1.84e-01

Pos logit contributions
Su+0.044
Pe+0.041
"+0.041
Kim+0.040
room+0.040
Neg logit contributions
 breaths-0.046
 cries-0.046
 worries-0.039
lers-0.039
 wishes-0.036
Token special
Token ID2041
Feature activation+1.10e+00

Pos logit contributions
 cries+0.006
 breaths+0.006
cha+0.005
lers+0.005
 worries+0.005
Neg logit contributions
 few-0.004
Su-0.003
 passenger-0.003
 color-0.003
"-0.003
Token.
Token ID13
Feature activation+7.54e-01

Pos logit contributions
 cries+0.034
 breaths+0.034
cha+0.031
lers+0.031
ses+0.028
Neg logit contributions
 few-0.024
 color-0.022
"-0.021
Su-0.021
 passenger-0.020
Token<|endoftext|>
Token ID50256
Feature activation+1.79e-01

Pos logit contributions
 breaths+0.027
 cries+0.025
lers+0.025
cha+0.024
 worries+0.022
Neg logit contributions
 few-0.014
"-0.013
J-0.013
 Jem-0.013
 color-0.013
Token pasta
Token ID26296
Feature activation+9.32e-01

Pos logit contributions
 few+0.010
 color+0.009
Su+0.009
"+0.009
J+0.009
Neg logit contributions
 cries-0.016
 breaths-0.016
cha-0.015
lers-0.015
ses-0.013
Token.
Token ID13
Feature activation+3.66e-01

Pos logit contributions
 cries+0.025
 breaths+0.024
cha+0.022
lers+0.022
 worries+0.020
Neg logit contributions
Su-0.023
 few-0.022
 passenger-0.021
Pe-0.021
Grand-0.021
Token She
Token ID1375
Feature activation-1.96e-01

Pos logit contributions
 breaths+0.013
 cries+0.012
lers+0.012
cha+0.011
ses+0.010
Neg logit contributions
 few-0.007
"-0.007
 Jem-0.006
 color-0.006
Su-0.006
Token begged
Token ID32946
Feature activation+5.18e-01

Pos logit contributions
 few+0.004
 color+0.004
"+0.004
Su+0.004
J+0.003
Neg logit contributions
 cries-0.006
 breaths-0.006
cha-0.006
lers-0.006
ses-0.005
Token her
Token ID607
Feature activation-1.62e+00

Pos logit contributions
 cries+0.019
 breaths+0.019
cha+0.017
lers+0.017
ses+0.016
Neg logit contributions
 few-0.009
 color-0.008
Su-0.008
"-0.007
 passenger-0.007
Token mom
Token ID1995
Feature activation-4.14e+00

Pos logit contributions
 few+0.032
 color+0.029
Su+0.028
"+0.028
J+0.027
Neg logit contributions
 cries-0.055
 breaths-0.054
cha-0.050
lers-0.049
ses-0.045
Tokenmy
Token ID1820
Feature activation-3.24e+00

Pos logit contributions
Su+0.082
 few+0.080
J+0.079
 passenger+0.077
 Jem+0.076
Neg logit contributions
 cries-0.131
 breaths-0.120
lers-0.117
cha-0.109
 worries-0.107
Token,
Token ID11
Feature activation-1.17e+00

Pos logit contributions
Su+0.049
 few+0.047
J+0.046
 color+0.045
S+0.043
Neg logit contributions
 cries-0.121
 breaths-0.114
lers-0.109
cha-0.106
ishly-0.102
Token but
Token ID475
Feature activation-7.98e-01

Pos logit contributions
 few+0.022
Su+0.021
J+0.021
 color+0.021
 passenger+0.021
Neg logit contributions
 cries-0.037
 breaths-0.035
ishly-0.032
cha-0.032
lers-0.030
Token her
Token ID607
Feature activation-6.62e-01

Pos logit contributions
 few+0.018
 passenger+0.017
Su+0.017
 color+0.017
S+0.016
Neg logit contributions
 cries-0.023
 breaths-0.022
ishly-0.020
cha-0.020
 worries-0.019
Token mom
Token ID1995
Feature activation-3.00e+00

Pos logit contributions
 few+0.010
 color+0.009
"+0.007
 tidy+0.007
 cheap+0.007
Neg logit contributions
 breaths-0.026
 cries-0.025
lers-0.024
cha-0.024
ishly-0.022
Token like
Token ID588
Feature activation-1.24e-01

Pos logit contributions
 few+0.015
Su+0.014
J+0.014
Back+0.013
Sus+0.013
Neg logit contributions
 cries-0.020
 breaths-0.019
cha-0.018
 worries-0.017
lers-0.017
Token your
Token ID534
Feature activation+5.12e-01

Pos logit contributions
Su+0.003
 few+0.003
J+0.003
 passenger+0.002
Back+0.002
Neg logit contributions
 cries-0.004
 breaths-0.004
cha-0.003
lers-0.003
 wishes-0.003
Token house
Token ID2156
Feature activation-3.24e+00

Pos logit contributions
 cries+0.015
 breaths+0.015
cha+0.014
lers+0.014
ses+0.012
Neg logit contributions
 few-0.012
 color-0.011
"-0.010
Su-0.010
 passenger-0.010
Token,
Token ID11
Feature activation-1.91e-01

Pos logit contributions
Su+0.091
 few+0.088
J+0.087
Be+0.085
Pe+0.084
Neg logit contributions
 cries-0.081
 breaths-0.077
lers-0.071
 doesn-0.066
 wishes-0.065
Token your
Token ID534
Feature activation+1.70e-01

Pos logit contributions
 few+0.004
J+0.004
Su+0.004
Back+0.004
Sus+0.004
Neg logit contributions
 cries-0.006
 breaths-0.005
cha-0.005
lers-0.005
 worries-0.005
Token street
Token ID4675
Feature activation-4.12e+00

Pos logit contributions
 cries+0.005
 breaths+0.005
cha+0.005
lers+0.005
ses+0.004
Neg logit contributions
 few-0.004
 color-0.004
Su-0.003
"-0.003
J-0.003
Token,
Token ID11
Feature activation-6.72e-03

Pos logit contributions
Su+0.110
J+0.105
 few+0.102
Sus+0.101
Be+0.100
Neg logit contributions
 cries-0.114
 breaths-0.108
lers-0.100
 doesn-0.093
 worries-0.093
Token your
Token ID534
Feature activation+1.10e-01

Pos logit contributions
 few+0.000
J+0.000
Su+0.000
Sus+0.000
Back+0.000
Neg logit contributions
 cries-0.000
 breaths-0.000
lers-0.000
 worries-0.000
cha-0.000
Token city
Token ID1748
Feature activation-3.46e+00

Pos logit contributions
 cries+0.003
 breaths+0.003
cha+0.003
lers+0.003
ses+0.003
Neg logit contributions
 few-0.003
Su-0.002
 color-0.002
"-0.002
J-0.002
Token,
Token ID11
Feature activation-6.86e-02

Pos logit contributions
Su+0.086
 few+0.084
J+0.083
Sus+0.080
 Jem+0.078
Neg logit contributions
 cries-0.098
 breaths-0.094
lers-0.086
cha-0.080
 worries-0.079
Token or
Token ID393
Feature activation-5.97e-01

Pos logit contributions
J+0.002
 few+0.002
Su+0.002
Sus+0.002
Grand+0.001
Neg logit contributions
 cries-0.002
 breaths-0.002
lers-0.002
 worries-0.002
cha-0.002
Token are
Token ID389
Feature activation+3.63e-01

Pos logit contributions
 cries+0.016
 breaths+0.016
cha+0.015
lers+0.015
ses+0.013
Neg logit contributions
 few-0.014
 color-0.013
Su-0.013
"-0.012
 passenger-0.012
Token both
Token ID1111
Feature activation-7.97e-01

Pos logit contributions
 cries+0.012
 breaths+0.012
cha+0.011
lers+0.011
 worries+0.010
Neg logit contributions
 few-0.007
Su-0.006
 color-0.006
"-0.006
J-0.006
Token going
Token ID1016
Feature activation-1.32e+00

Pos logit contributions
Su+0.017
J+0.016
Jen+0.016
Kim+0.016
Back+0.015
Neg logit contributions
 cries-0.027
 breaths-0.025
 worries-0.023
cha-0.022
lers-0.022
Token to
Token ID284
Feature activation-1.08e+00

Pos logit contributions
Su+0.031
Jen+0.027
J+0.027
Sus+0.026
Pe+0.026
Neg logit contributions
 cries-0.041
 breaths-0.041
lers-0.037
cha-0.037
 worries-0.033
Token your
Token ID534
Feature activation+8.44e-01

Pos logit contributions
 few+0.025
Su+0.024
J+0.023
 builder+0.023
 passenger+0.023
Neg logit contributions
 cries-0.032
 breaths-0.028
 wishes-0.026
 worries-0.025
cha-0.024
Token rooms
Token ID9519
Feature activation-4.12e+00

Pos logit contributions
 cries+0.029
 breaths+0.028
cha+0.026
lers+0.026
ses+0.024
Neg logit contributions
 few-0.016
 color-0.014
Su-0.014
"-0.014
J-0.013
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
Su+0.131
Be+0.129
Pe+0.127
 few+0.123
Jen+0.122
Neg logit contributions
 cries-0.083
 breaths-0.076
der-0.065
 doesn-0.064
lers-0.063
Token No
Token ID1400
Feature activation+1.13e-01

Pos logit contributions
 cries+0.005
 breaths+0.005
cha+0.004
lers+0.004
ses+0.004
Neg logit contributions
 few-0.007
 color-0.007
Su-0.007
"-0.006
J-0.006
Token more
Token ID517
Feature activation-2.54e-01

Pos logit contributions
 cries+0.004
 breaths+0.003
lers+0.003
cha+0.003
 worries+0.003
Neg logit contributions
Su-0.003
Back-0.002
Pe-0.002
 Jem-0.002
J-0.002
Token playing
Token ID2712
Feature activation-1.48e+00

Pos logit contributions
Su+0.006
 few+0.005
J+0.005
Be+0.005
Pe+0.005
Neg logit contributions
 cries-0.008
 breaths-0.008
lers-0.007
cha-0.007
 worries-0.007
Token today
Token ID1909
Feature activation-9.59e-01

Pos logit contributions
Su+0.040
 known+0.036
 Su+0.036
Pe+0.036
Ma+0.036
Neg logit contributions
 cries-0.038
 breaths-0.037
cha-0.035
 webs-0.032
lers-0.032
Token.
Token ID13
Feature activation+4.43e-01

Pos logit contributions
 few+0.002
Su+0.002
 passenger+0.001
J+0.001
 color+0.001
Neg logit contributions
 cries-0.002
 breaths-0.002
cha-0.002
lers-0.002
 worries-0.002
Token They
Token ID1119
Feature activation+6.28e-01

Pos logit contributions
 cries+0.012
 breaths+0.012
lers+0.012
cha+0.011
led+0.010
Neg logit contributions
 few-0.011
"-0.011
 color-0.011
 passenger-0.010
Su-0.010
Token will
Token ID481
Feature activation-6.16e-01

Pos logit contributions
 cries+0.019
 breaths+0.019
cha+0.017
lers+0.017
ses+0.016
Neg logit contributions
 few-0.014
 color-0.014
"-0.013
 passenger-0.012
Su-0.012
Token make
Token ID787
Feature activation+7.27e-01

Pos logit contributions
 few+0.013
Su+0.013
J+0.012
Back+0.012
S+0.012
Neg logit contributions
 cries-0.018
 breaths-0.017
 wishes-0.015
cha-0.015
 worries-0.015
Token our
Token ID674
Feature activation-4.18e-01

Pos logit contributions
 breaths+0.024
 cries+0.024
cha+0.022
lers+0.022
ses+0.021
Neg logit contributions
 few-0.014
 color-0.013
"-0.012
Su-0.012
 Jem-0.011
Token room
Token ID2119
Feature activation-4.12e+00

Pos logit contributions
 color+0.009
 few+0.008
 passenger+0.007
 sunny+0.007
"+0.007
Neg logit contributions
 breaths-0.014
 cries-0.013
cha-0.013
lers-0.012
ishly-0.012
Token more
Token ID517
Feature activation-2.42e-01

Pos logit contributions
Su+0.079
Pe+0.076
Be+0.072
Kim+0.072
 few+0.072
Neg logit contributions
 cries-0.132
 breaths-0.130
 wishes-0.116
 worries-0.116
lers-0.116
Token beautiful
Token ID4950
Feature activation-1.24e-01

Pos logit contributions
 few+0.004
 color+0.004
Su+0.004
"+0.004
J+0.004
Neg logit contributions
 cries-0.009
 breaths-0.008
cha-0.008
lers-0.008
ses-0.007
Token,"
Token ID553
Feature activation-1.26e+00

Pos logit contributions
 few+0.003
 color+0.002
"+0.002
Su+0.002
 passenger+0.002
Neg logit contributions
 cries-0.004
 breaths-0.004
cha-0.004
lers-0.004
ses-0.003
Token Mom
Token ID11254
Feature activation-1.54e+00

Pos logit contributions
 few+0.026
 color+0.024
"+0.023
Su+0.023
J+0.022
Neg logit contributions
 breaths-0.041
 cries-0.041
cha-0.037
lers-0.037
ses-0.034
Token said
Token ID531
Feature activation+2.57e-01

Pos logit contributions
 few+0.035
"+0.033
 color+0.033
Su+0.032
J+0.032
Neg logit contributions
 breaths-0.047
 cries-0.046
cha-0.043
lers-0.041
ses-0.040
Token the
Token ID262
Feature activation+1.43e-01

Pos logit contributions
 builder+0.006
 few+0.006
Su+0.005
 passenger+0.005
Pe+0.005
Neg logit contributions
 cries-0.009
 breaths-0.008
 tears-0.007
 wishes-0.007
lers-0.007
Token store
Token ID3650
Feature activation-1.51e+00

Pos logit contributions
 cries+0.004
lers+0.003
 breaths+0.003
cha+0.003
 wishes+0.003
Neg logit contributions
Su-0.003
J-0.003
 few-0.003
Sus-0.003
"-0.003
Token and
Token ID290
Feature activation-1.70e-01

Pos logit contributions
 few+0.048
Su+0.045
Be+0.044
Back+0.043
Pe+0.043
Neg logit contributions
 cries-0.033
 breaths-0.026
lers-0.025
ishly-0.024
cha-0.024
Token begged
Token ID32946
Feature activation+4.66e-01

Pos logit contributions
 few+0.004
Su+0.004
S+0.004
 passenger+0.004
Pe+0.004
Neg logit contributions
 cries-0.005
 breaths-0.004
cha-0.004
lers-0.004
 wishes-0.004
Token her
Token ID607
Feature activation-1.00e+00

Pos logit contributions
 cries+0.015
 breaths+0.013
cha+0.012
lers+0.012
 worries+0.011
Neg logit contributions
 few-0.010
 passenger-0.010
Su-0.010
 color-0.009
Grand-0.009
Token mom
Token ID1995
Feature activation-4.10e+00

Pos logit contributions
Su+0.024
 few+0.024
Be+0.023
Pe+0.023
Back+0.022
Neg logit contributions
 cries-0.027
 breaths-0.023
cha-0.022
lers-0.022
 wishes-0.022
Token to
Token ID284
Feature activation-2.39e+00

Pos logit contributions
Su+0.079
 few+0.073
Pe+0.073
 passenger+0.071
Grand+0.070
Neg logit contributions
 cries-0.135
lers-0.118
 breaths-0.118
cha-0.114
ishly-0.110
Token buy
Token ID2822
Feature activation+4.97e-01

Pos logit contributions
 few+0.058
cest+0.054
Su+0.054
J+0.053
 passenger+0.052
Neg logit contributions
 cries-0.064
 breaths-0.056
cha-0.055
ishly-0.054
 worries-0.049
Token it
Token ID340
Feature activation+2.70e-01

Pos logit contributions
 cries+0.015
 breaths+0.015
cha+0.013
lers+0.013
ses+0.012
Neg logit contributions
 few-0.011
 color-0.011
Su-0.010
"-0.010
J-0.010
Token for
Token ID329
Feature activation+6.16e-01

Pos logit contributions
 cries+0.005
 breaths+0.005
lers+0.004
cha+0.004
ches+0.003
Neg logit contributions
Be-0.009
Su-0.009
Pe-0.009
 few-0.009
Grand-0.008
Token her
Token ID607
Feature activation+1.44e+00

Pos logit contributions
 cries+0.020
 breaths+0.019
cha+0.018
lers+0.017
 worries+0.016
Neg logit contributions
 few-0.012
 passenger-0.012
 color-0.011
 builder-0.011
Su-0.011
Token badly
Token ID11234
Feature activation+3.93e-01

Pos logit contributions
 breaths+0.009
 cries+0.009
cha+0.008
lers+0.008
ses+0.008
Neg logit contributions
 few-0.005
 color-0.004
Su-0.004
"-0.004
J-0.004
Token.
Token ID13
Feature activation+8.34e-02

Pos logit contributions
 cries+0.011
 breaths+0.010
cha+0.009
lers+0.009
der+0.008
Neg logit contributions
 few-0.010
Su-0.010
J-0.010
Grand-0.010
 color-0.009
Token She
Token ID1375
Feature activation-1.50e-01

Pos logit contributions
 breaths+0.002
 cries+0.002
lers+0.002
cha+0.002
ses+0.002
Neg logit contributions
 few-0.002
 color-0.002
"-0.002
Su-0.002
 Jem-0.002
Token begged
Token ID32946
Feature activation+3.45e-01

Pos logit contributions
 few+0.003
 color+0.003
"+0.003
Su+0.003
J+0.003
Neg logit contributions
 breaths-0.005
 cries-0.005
cha-0.004
lers-0.004
ses-0.004
Token her
Token ID607
Feature activation-1.66e+00

Pos logit contributions
 cries+0.012
 breaths+0.011
cha+0.010
lers+0.010
 worries+0.010
Neg logit contributions
 few-0.006
 passenger-0.006
Su-0.006
 color-0.006
"-0.005
Token mom
Token ID1995
Feature activation-4.10e+00

Pos logit contributions
 few+0.038
Su+0.037
 color+0.035
 passenger+0.035
J+0.034
Neg logit contributions
 cries-0.050
 breaths-0.046
cha-0.042
lers-0.041
 worries-0.040
Token over
Token ID625
Feature activation-3.45e-01

Pos logit contributions
 few+0.057
Su+0.057
 color+0.054
J+0.053
 passenger+0.052
Neg logit contributions
 cries-0.157
 breaths-0.147
lers-0.140
cha-0.139
 worries-0.132
Token and
Token ID290
Feature activation-4.47e-01

Pos logit contributions
 passenger+0.008
Su+0.007
 color+0.007
 few+0.007
Kim+0.007
Neg logit contributions
 cries-0.010
 breaths-0.010
cha-0.009
der-0.008
lers-0.008
Token over
Token ID625
Feature activation+1.83e-01

Pos logit contributions
 few+0.012
 color+0.011
 sunny+0.011
cest+0.011
 passenger+0.011
Neg logit contributions
 cries-0.012
 breaths-0.011
ishly-0.010
cha-0.010
lers-0.009
Token again
Token ID757
Feature activation-1.36e+00

Pos logit contributions
 cries+0.005
 breaths+0.005
cha+0.005
lers+0.004
 worries+0.004
Neg logit contributions
 passenger-0.004
 few-0.004
 color-0.004
Su-0.004
 builder-0.004
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
 passenger+0.035
Su+0.034
Be+0.034
 few+0.034
Pe+0.033
Neg logit contributions
 cries-0.034
 breaths-0.032
cha-0.028
 worries-0.026
der-0.026
Token is
Token ID318
Feature activation+4.58e-02

Pos logit contributions
 few+0.055
Su+0.055
Grand+0.053
J+0.053
Pe+0.052
Neg logit contributions
 cries-0.041
 breaths-0.038
lers-0.035
cha-0.034
 worries-0.031
Token from
Token ID422
Feature activation+9.95e-01

Pos logit contributions
 cries+0.001
 breaths+0.001
 worries+0.001
cha+0.001
 doesn+0.001
Neg logit contributions
J-0.001
Su-0.001
S-0.001
Pe-0.001
Sus-0.001
Token a
Token ID257
Feature activation+5.20e-01

Pos logit contributions
 cries+0.033
 breaths+0.032
cha+0.029
lers+0.028
 worries+0.027
Neg logit contributions
 few-0.019
Su-0.019
J-0.019
S-0.018
Back-0.018
Token long
Token ID890
Feature activation-9.70e-01

Pos logit contributions
 cries+0.016
 breaths+0.015
cha+0.014
lers+0.013
 worries+0.013
Neg logit contributions
Su-0.012
J-0.011
S-0.010
 Jem-0.010
Back-0.010
Token time
Token ID640
Feature activation-8.93e-01

Pos logit contributions
Su+0.022
J+0.021
Grand+0.020
 Jem+0.020
Sus+0.020
Neg logit contributions
 cries-0.029
 breaths-0.029
lers-0.026
cha-0.024
 worries-0.023
Token ago
Token ID2084
Feature activation-4.09e+00

Pos logit contributions
Pe+0.023
Su+0.023
J+0.023
 few+0.023
Be+0.022
Neg logit contributions
 cries-0.023
 breaths-0.023
cha-0.019
lers-0.019
 worries-0.018
Token,
Token ID11
Feature activation-5.47e-01

Pos logit contributions
 few+0.098
Su+0.096
J+0.093
Be+0.093
Grand+0.092
Neg logit contributions
 cries-0.117
 breaths-0.111
lers-0.099
cha-0.097
ses-0.092
Token from
Token ID422
Feature activation+9.83e-01

Pos logit contributions
 few+0.013
Su+0.012
J+0.012
Pe+0.012
 passenger+0.012
Neg logit contributions
 cries-0.015
cha-0.014
 breaths-0.014
 doesn-0.013
 worries-0.012
Token when
Token ID618
Feature activation+1.11e-01

Pos logit contributions
 breaths+0.032
 cries+0.032
cha+0.029
lers+0.028
 worries+0.027
Neg logit contributions
 few-0.019
Su-0.019
J-0.017
"-0.017
S-0.017
Token dinosaurs
Token ID27199
Feature activation+5.02e-01

Pos logit contributions
 cries+0.002
 breaths+0.002
cha+0.002
 worries+0.002
lers+0.002
Neg logit contributions
Su-0.003
 few-0.003
 passenger-0.003
J-0.003
S-0.003
Token lived
Token ID5615
Feature activation-6.91e-01

Pos logit contributions
 cries+0.015
 breaths+0.014
cha+0.013
lers+0.013
 worries+0.011
Neg logit contributions
 few-0.012
Su-0.012
J-0.011
Grand-0.011
Sus-0.011
Token.
Token ID13
Feature activation+7.64e-02

Pos logit contributions
 few+0.002
Su+0.002
Pe+0.002
Grand+0.002
S+0.002
Neg logit contributions
 cries-0.002
 breaths-0.002
lers-0.002
cha-0.002
der-0.002
Token Tim
Token ID5045
Feature activation-3.57e-02

Pos logit contributions
 cries+0.002
 breaths+0.002
cha+0.002
lers+0.001
der+0.001
Neg logit contributions
 few-0.002
 color-0.002
J-0.002
 passenger-0.002
"-0.002
Tokenmy
Token ID1820
Feature activation+8.87e-02

Pos logit contributions
 few+0.001
Su+0.001
S+0.001
Pe+0.001
J+0.001
Neg logit contributions
 cries-0.001
 breaths-0.001
lers-0.001
cha-0.001
 worries-0.001
Token begged
Token ID32946
Feature activation+1.15e-01

Pos logit contributions
 cries+0.002
 breaths+0.002
lers+0.002
cha+0.002
 wishes+0.002
Neg logit contributions
Su-0.002
S-0.002
 few-0.002
 sunny-0.002
Jen-0.002
Token his
Token ID465
Feature activation-9.54e-01

Pos logit contributions
 cries+0.003
 breaths+0.003
 deeply+0.003
 worries+0.002
 wishes+0.002
Neg logit contributions
 passenger-0.003
Su-0.003
Pe-0.003
S-0.003
 few-0.003
Token parents
Token ID3397
Feature activation-4.08e+00

Pos logit contributions
 few+0.021
Su+0.021
J+0.020
S+0.020
"+0.019
Neg logit contributions
 cries-0.029
 breaths-0.028
cha-0.025
lers-0.025
 worries-0.024
Token to
Token ID284
Feature activation-2.32e+00

Pos logit contributions
Su+0.074
S+0.070
Pe+0.068
J+0.067
Jen+0.067
Neg logit contributions
 cries-0.138
 breaths-0.124
lers-0.122
cha-0.118
ishly-0.117
Token go
Token ID467
Feature activation-2.94e-01

Pos logit contributions
 few+0.057
cest+0.055
S+0.054
J+0.054
Su+0.052
Neg logit contributions
 cries-0.057
ishly-0.049
 breaths-0.049
cha-0.046
 worries-0.044
Token inside
Token ID2641
Feature activation-1.39e+00

Pos logit contributions
 color+0.007
Su+0.007
Ma+0.006
Be+0.006
 few+0.006
Neg logit contributions
 cries-0.008
 breaths-0.007
cha-0.007
lers-0.006
ishly-0.006
Token.
Token ID13
Feature activation-4.95e-01

Pos logit contributions
Pe+0.039
S+0.038
 passenger+0.038
Jen+0.038
Su+0.037
Neg logit contributions
 cries-0.033
 breaths-0.030
 deeply-0.026
cha-0.024
 tears-0.023
Token They
Token ID1119
Feature activation-9.78e-01

Pos logit contributions
 few+0.015
 passenger+0.014
S+0.014
Su+0.014
Pe+0.014
Neg logit contributions
 cries-0.011
 breaths-0.010
cha-0.009
idents-0.008
der-0.008
Token wanted
Token ID2227
Feature activation-4.78e-01

Pos logit contributions
 few+0.002
 color+0.002
"+0.002
 sunny+0.002
 passenger+0.002
Neg logit contributions
 breaths-0.005
 cries-0.005
cha-0.005
lers-0.004
der-0.004
Token to
Token ID284
Feature activation-1.25e+00

Pos logit contributions
Su+0.009
 passenger+0.009
S+0.009
J+0.009
"+0.009
Neg logit contributions
 cries-0.016
 breaths-0.014
lers-0.013
cha-0.013
 worries-0.013
Token go
Token ID467
Feature activation+5.30e-01

Pos logit contributions
 few+0.030
J+0.029
Su+0.029
S+0.028
 passenger+0.028
Neg logit contributions
 cries-0.035
 breaths-0.031
 wishes-0.028
cha-0.027
lers-0.027
Token on
Token ID319
Feature activation-1.38e+00

Pos logit contributions
 cries+0.018
 breaths+0.018
lers+0.016
cha+0.016
 worries+0.015
Neg logit contributions
 few-0.010
Su-0.009
J-0.009
 color-0.009
 Jem-0.009
Token a
Token ID257
Feature activation+1.27e+00

Pos logit contributions
 few+0.019
 color+0.016
Su+0.015
"+0.015
J+0.014
Neg logit contributions
 cries-0.054
 breaths-0.053
cha-0.049
lers-0.049
ses-0.047
Token wild
Token ID4295
Feature activation+2.36e+00

Pos logit contributions
 cries+0.038
 breaths+0.037
cha+0.034
lers+0.034
 worries+0.030
Neg logit contributions
 few-0.028
 color-0.026
Su-0.026
"-0.026
J-0.025
Token adventure
Token ID8855
Feature activation+2.98e-01

Pos logit contributions
 cries+0.067
 breaths+0.066
cha+0.060
lers+0.059
ses+0.054
Neg logit contributions
 few-0.057
 color-0.053
Su-0.052
"-0.052
J-0.051
Token.
Token ID13
Feature activation-8.86e-01

Pos logit contributions
 cries+0.008
 breaths+0.008
lers+0.007
cha+0.007
 worries+0.006
Neg logit contributions
 few-0.008
Su-0.007
 color-0.007
J-0.007
"-0.007
Token So
Token ID1406
Feature activation-6.95e-01

Pos logit contributions
 few+0.028
Su+0.027
J+0.026
S+0.026
Sus+0.026
Neg logit contributions
 cries-0.018
 breaths-0.016
cha-0.013
lers-0.013
 doesn-0.012
Token he
Token ID339
Feature activation+2.98e-01

Pos logit contributions
Su+0.015
J+0.014
Sus+0.014
 few+0.014
 passenger+0.014
Neg logit contributions
 breaths-0.020
 cries-0.020
cha-0.018
lers-0.018
 worries-0.017
Token asked
Token ID1965
Feature activation-9.95e-02

Pos logit contributions
 breaths+0.009
 cries+0.008
lers+0.008
cha+0.008
led+0.007
Neg logit contributions
 few-0.006
 Jem-0.006
Su-0.006
 color-0.006
J-0.006
Token.
Token ID13
Feature activation+4.58e-01

Pos logit contributions
 cries+0.003
 breaths+0.002
lers+0.002
cha+0.002
 worries+0.002
Neg logit contributions
 few-0.003
Su-0.002
 color-0.002
J-0.002
 Jem-0.002
Token You
Token ID921
Feature activation+1.06e+00

Pos logit contributions
 breaths+0.011
 cries+0.011
lers+0.010
cha+0.009
 worries+0.008
Neg logit contributions
 few-0.013
"-0.013
 color-0.013
Su-0.012
 Jem-0.012
Token did
Token ID750
Feature activation-8.05e-01

Pos logit contributions
 breaths+0.035
 cries+0.034
cha+0.032
lers+0.032
ses+0.029
Neg logit contributions
 few-0.021
 color-0.020
"-0.020
 passenger-0.018
J-0.018
Token the
Token ID262
Feature activation+1.48e-01

Pos logit contributions
 few+0.008
"+0.007
 color+0.007
 true+0.007
 ten+0.006
Neg logit contributions
 breaths-0.035
 cries-0.034
lers-0.033
ishly-0.032
cha-0.032
Token right
Token ID826
Feature activation-9.78e-01

Pos logit contributions
 cries+0.005
 breaths+0.005
cha+0.004
lers+0.004
ses+0.004
Neg logit contributions
 few-0.003
 color-0.003
Su-0.003
"-0.003
J-0.002
Token thing
Token ID1517
Feature activation+2.30e+00

Pos logit contributions
 few+0.016
Su+0.015
J+0.014
 color+0.014
"+0.014
Neg logit contributions
 cries-0.036
 breaths-0.035
lers-0.032
cha-0.032
ses-0.030
Token.
Token ID13
Feature activation+8.36e-01

Pos logit contributions
 cries+0.068
 breaths+0.068
cha+0.061
lers+0.060
ses+0.055
Neg logit contributions
 few-0.053
 color-0.049
Su-0.048
"-0.048
 passenger-0.046
Token You
Token ID921
Feature activation+9.67e-01

Pos logit contributions
 breaths+0.019
 cries+0.019
lers+0.017
cha+0.016
 worries+0.014
Neg logit contributions
 few-0.025
"-0.024
 color-0.024
Su-0.024
 Jem-0.023
Token showed
Token ID3751
Feature activation+3.79e-01

Pos logit contributions
 breaths+0.031
 cries+0.031
cha+0.028
lers+0.028
ses+0.026
Neg logit contributions
 few-0.019
 color-0.019
"-0.018
 passenger-0.017
J-0.017
Token the
Token ID262
Feature activation-5.35e-01

Pos logit contributions
 breaths+0.011
 cries+0.011
cha+0.010
lers+0.010
ses+0.009
Neg logit contributions
 few-0.009
 color-0.008
"-0.008
Su-0.007
J-0.007
Token owl
Token ID39610
Feature activation-7.57e-01

Pos logit contributions
 few+0.012
 color+0.011
"+0.011
 passenger+0.011
Su+0.011
Neg logit contributions
 breaths-0.016
 cries-0.016
cha-0.015
lers-0.014
ishly-0.013
TokenShe
Token ID3347
Feature activation+1.79e-02

Pos logit contributions
 few+0.002
 color+0.002
Su+0.002
 Jem+0.002
 Su+0.002
Neg logit contributions
 cries-0.004
cha-0.004
 breaths-0.004
lers-0.003
 screams-0.003
Token hears
Token ID23180
Feature activation+8.52e-01

Pos logit contributions
 breaths+0.001
edly+0.000
ishly+0.000
cha+0.000
lers+0.000
Neg logit contributions
 nine-0.000
"-0.000
 ten-0.000
 color-0.000
 blue-0.000
Token a
Token ID257
Feature activation+1.42e+00

Pos logit contributions
 breaths+0.033
 cries+0.032
lers+0.031
cha+0.031
ses+0.029
Neg logit contributions
 few-0.011
 color-0.010
J-0.009
 Jem-0.009
"-0.009
Token g
Token ID308
Feature activation+1.03e+00

Pos logit contributions
 cries+0.042
 breaths+0.040
cha+0.036
lers+0.036
ses+0.032
Neg logit contributions
 few-0.033
 color-0.031
Su-0.031
"-0.030
 Jem-0.030
Tokeniggle
Token ID24082
Feature activation+1.12e+00

Pos logit contributions
 breaths+0.019
cha+0.018
 cries+0.017
lers+0.016
ience+0.015
Neg logit contributions
Su-0.033
 color-0.033
 few-0.033
"-0.032
J-0.032
Token from
Token ID422
Feature activation+2.39e+00

Pos logit contributions
 cries+0.036
 breaths+0.035
cha+0.031
lers+0.031
ishly+0.028
Neg logit contributions
 few-0.024
 color-0.021
Su-0.021
 Jem-0.021
Grand-0.020
Token above
Token ID2029
Feature activation+6.57e-02

Pos logit contributions
lers+0.091
 cries+0.090
cha+0.088
 breaths+0.087
ience+0.087
Neg logit contributions
 few-0.029
 color-0.029
 ten-0.029
 sunny-0.027
 five-0.026
Token.
Token ID13
Feature activation+9.60e-01

Pos logit contributions
 cries+0.003
 breaths+0.002
cha+0.002
lers+0.002
ses+0.002
Neg logit contributions
 few-0.001
 color-0.001
"-0.001
Su-0.001
J-0.001
Token She
Token ID1375
Feature activation+3.31e-01

Pos logit contributions
 breaths+0.036
lers+0.035
 cries+0.034
cha+0.033
 worries+0.032
Neg logit contributions
 few-0.014
"-0.013
 Jem-0.012
 color-0.012
J-0.012
Token looks
Token ID3073
Feature activation+5.66e-01

Pos logit contributions
 breaths+0.010
ishly+0.009
edly+0.009
cha+0.009
lers+0.009
Neg logit contributions
"-0.007
 color-0.006
 few-0.006
 blue-0.006
 red-0.006
Token up
Token ID510
Feature activation+4.25e-01

Pos logit contributions
 breaths+0.017
 cries+0.017
lers+0.015
cha+0.015
ses+0.015
Neg logit contributions
 few-0.012
"-0.011
 color-0.011
 different-0.011
 blue-0.011
Token him
Token ID683
Feature activation-8.55e-01

Pos logit contributions
 breaths+0.034
 cries+0.032
lers+0.032
ses+0.032
cha+0.029
Neg logit contributions
 few-0.033
"-0.030
 color-0.027
 true-0.027
 different-0.026
Token that
Token ID326
Feature activation+8.86e-01

Pos logit contributions
 few+0.019
 color+0.017
Su+0.017
 passenger+0.017
J+0.017
Neg logit contributions
 cries-0.026
 breaths-0.026
cha-0.024
lers-0.023
 worries-0.021
Token even
Token ID772
Feature activation-1.07e+00

Pos logit contributions
 cries+0.027
 breaths+0.026
cha+0.024
lers+0.023
 worries+0.022
Neg logit contributions
 few-0.020
Su-0.018
 passenger-0.018
J-0.018
"-0.018
Token though
Token ID996
Feature activation+4.67e-01

Pos logit contributions
Su+0.019
 few+0.018
J+0.018
 passenger+0.018
 Jem+0.018
Neg logit contributions
 cries-0.037
 breaths-0.036
lers-0.033
cha-0.032
 worries-0.031
Token Tweet
Token ID18752
Feature activation+3.65e-01

Pos logit contributions
 cries+0.015
 breaths+0.015
cha+0.013
lers+0.013
 worries+0.012
Neg logit contributions
 few-0.010
 color-0.009
Su-0.009
"-0.009
J-0.008
Token wasn
Token ID2492
Feature activation+2.61e+00

Pos logit contributions
 breaths+0.011
 cries+0.011
cha+0.010
lers+0.010
ses+0.009
Neg logit contributions
 few-0.008
 color-0.008
"-0.007
Su-0.007
J-0.007
Token't
Token ID470
Feature activation+1.50e-01

Pos logit contributions
 cries+0.074
 breaths+0.074
cha+0.066
lers+0.066
ses+0.060
Neg logit contributions
 few-0.064
 color-0.058
"-0.058
Su-0.057
 passenger-0.055
Token there
Token ID612
Feature activation-1.25e+00

Pos logit contributions
 cries+0.004
 breaths+0.004
lers+0.004
 worries+0.004
cha+0.003
Neg logit contributions
J-0.004
Su-0.003
 passenger-0.003
Kim-0.003
Jen-0.003
Token,
Token ID11
Feature activation-1.86e-01

Pos logit contributions
J+0.028
Grand+0.027
 passenger+0.027
Su+0.027
 few+0.027
Neg logit contributions
 cries-0.038
 breaths-0.035
lers-0.033
cha-0.033
 worries-0.032
Token they
Token ID484
Feature activation-1.17e-01

Pos logit contributions
 few+0.004
 color+0.004
Su+0.003
"+0.003
J+0.003
Neg logit contributions
 cries-0.006
 breaths-0.006
cha-0.005
lers-0.005
ses-0.005
Token could
Token ID714
Feature activation-2.37e-01

Pos logit contributions
 few+0.002
 color+0.002
"+0.002
Su+0.002
 passenger+0.002
Neg logit contributions
 breaths-0.004
 cries-0.004
cha-0.003
lers-0.003
ses-0.003
Token Suddenly
Token ID24975
Feature activation-1.15e+00

Pos logit contributions
 few+0.014
 color+0.013
Su+0.013
J+0.013
 passenger+0.013
Neg logit contributions
 cries-0.014
 breaths-0.013
cha-0.012
lers-0.012
 worries-0.011
Token,
Token ID11
Feature activation-9.81e-01

Pos logit contributions
Su+0.027
J+0.026
S+0.025
Pe+0.024
 passenger+0.024
Neg logit contributions
 cries-0.033
 breaths-0.032
 worries-0.029
cha-0.029
lers-0.028
Token he
Token ID339
Feature activation-8.03e-01

Pos logit contributions
Su+0.023
 few+0.023
"+0.022
 passenger+0.022
 color+0.022
Neg logit contributions
 cries-0.026
 breaths-0.025
cha-0.023
lers-0.022
 worries-0.022
Token heard
Token ID2982
Feature activation-2.00e-01

Pos logit contributions
 few+0.016
Su+0.015
 passenger+0.014
 color+0.014
"+0.014
Neg logit contributions
 cries-0.027
 breaths-0.025
lers-0.024
cha-0.024
 worries-0.022
Token a
Token ID257
Feature activation+1.66e+00

Pos logit contributions
 few+0.004
"+0.004
Su+0.004
 passenger+0.004
 builder+0.004
Neg logit contributions
 cries-0.007
 breaths-0.006
cha-0.005
lers-0.005
 worries-0.005
Token strange
Token ID6283
Feature activation+3.00e+00

Pos logit contributions
 cries+0.052
 breaths+0.050
cha+0.046
lers+0.045
 worries+0.041
Neg logit contributions
 few-0.035
 color-0.033
Su-0.032
"-0.031
J-0.031
Token sound
Token ID2128
Feature activation+2.38e+00

Pos logit contributions
 breaths+0.083
 cries+0.081
cha+0.075
lers+0.074
ses+0.067
Neg logit contributions
 few-0.076
 color-0.071
"-0.068
Su-0.067
 sunny-0.064
Token coming
Token ID2406
Feature activation-4.51e-01

Pos logit contributions
 cries+0.076
 breaths+0.074
cha+0.068
lers+0.068
 worries+0.061
Neg logit contributions
 few-0.050
 color-0.045
Su-0.045
"-0.044
 passenger-0.043
Token from
Token ID422
Feature activation+2.71e+00

Pos logit contributions
 few+0.010
 color+0.010
J+0.010
Su+0.009
"+0.009
Neg logit contributions
 cries-0.014
 breaths-0.013
lers-0.011
cha-0.011
 screams-0.010
Token the
Token ID262
Feature activation+1.32e+00

Pos logit contributions
 cries+0.100
 breaths+0.098
cha+0.093
lers+0.092
ishly+0.086
Neg logit contributions
 few-0.043
 color-0.039
"-0.035
Su-0.035
 passenger-0.035
Token sky
Token ID6766
Feature activation-4.40e-01

Pos logit contributions
 cries+0.040
 breaths+0.038
cha+0.035
lers+0.035
ses+0.032
Neg logit contributions
 few-0.029
Su-0.028
"-0.027
J-0.027
S-0.026
Token and
Token ID290
Feature activation+2.60e-02

Pos logit contributions
 cries+0.001
 breaths+0.001
cha+0.001
lers+0.001
ses+0.001
Neg logit contributions
 few-0.001
 color-0.000
Su-0.000
"-0.000
J-0.000
Token wanted
Token ID2227
Feature activation+6.41e-02

Pos logit contributions
 cries+0.001
 breaths+0.001
cha+0.001
lers+0.001
 worries+0.001
Neg logit contributions
 few-0.001
 color-0.000
Su-0.000
"-0.000
 passenger-0.000
Token to
Token ID284
Feature activation-5.49e-01

Pos logit contributions
 cries+0.002
 breaths+0.002
cha+0.002
lers+0.002
 worries+0.002
Neg logit contributions
 few-0.001
 passenger-0.001
Su-0.001
Grand-0.001
 color-0.001
Token taste
Token ID6938
Feature activation-8.37e-02

Pos logit contributions
 few+0.012
 color+0.011
"+0.011
Su+0.011
 passenger+0.010
Neg logit contributions
 cries-0.017
 breaths-0.017
cha-0.016
lers-0.016
ses-0.014
Token it
Token ID340
Feature activation+7.66e-01

Pos logit contributions
 few+0.002
 color+0.001
"+0.001
Su+0.001
S+0.001
Neg logit contributions
 cries-0.003
 breaths-0.003
cha-0.003
lers-0.003
ses-0.003
Token.
Token ID13
Feature activation+2.46e-01

Pos logit contributions
 cries+0.022
 breaths+0.022
cha+0.019
lers+0.019
ses+0.017
Neg logit contributions
 few-0.019
 color-0.017
Su-0.017
"-0.017
 passenger-0.016
Token But
Token ID887
Feature activation+2.13e-01

Pos logit contributions
 breaths+0.009
 cries+0.008
lers+0.008
cha+0.008
 worries+0.008
Neg logit contributions
 few-0.004
"-0.004
The-0.004
S-0.004
Su-0.004
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
 cries+0.008
 breaths+0.008
cha+0.007
lers+0.007
 worries+0.007
Neg logit contributions
 few-0.003
Su-0.003
 color-0.003
J-0.003
 passenger-0.003
Token bird
Token ID6512
Feature activation+1.26e-01

Pos logit contributions
 few+0.020
 color+0.019
"+0.018
Be+0.016
S+0.016
Neg logit contributions
 cries-0.037
 breaths-0.037
lers-0.035
cha-0.034
ishly-0.031
Token knew
Token ID2993
Feature activation+3.11e-01

Pos logit contributions
 cries+0.004
 breaths+0.004
cha+0.003
lers+0.003
ses+0.003
Neg logit contributions
 few-0.003
 color-0.003
Su-0.002
"-0.002
J-0.002
Token it
Token ID340
Feature activation-6.64e-02

Pos logit contributions
 cries+0.009
 breaths+0.009
cha+0.008
lers+0.008
ses+0.008
Neg logit contributions
 few-0.007
 color-0.007
Su-0.006
"-0.006
J-0.006
Token and
Token ID290
Feature activation-1.32e-01

Pos logit contributions
 few+0.006
Su+0.005
 color+0.005
 Jem+0.005
 passenger+0.005
Neg logit contributions
 cries-0.007
 breaths-0.006
cha-0.006
lers-0.006
ishly-0.005
Token slide
Token ID10649
Feature activation-1.27e+00

Pos logit contributions
 few+0.003
 color+0.002
"+0.002
Su+0.002
 passenger+0.002
Neg logit contributions
 breaths-0.004
 cries-0.004
cha-0.004
lers-0.004
ses-0.004
Token.
Token ID13
Feature activation+3.79e-01

Pos logit contributions
 few+0.036
Su+0.036
Grand+0.035
 builder+0.035
 passenger+0.034
Neg logit contributions
 cries-0.029
 breaths-0.029
lers-0.027
cha-0.026
led-0.021
Token They
Token ID1119
Feature activation+4.92e-01

Pos logit contributions
 cries+0.011
 breaths+0.011
cha+0.010
lers+0.010
ses+0.009
Neg logit contributions
 few-0.009
 color-0.008
Su-0.008
"-0.008
J-0.008
Token see
Token ID766
Feature activation+4.55e-01

Pos logit contributions
 cries+0.015
 breaths+0.015
cha+0.014
lers+0.014
ses+0.013
Neg logit contributions
 few-0.011
 color-0.010
Su-0.009
"-0.009
 passenger-0.009
Token a
Token ID257
Feature activation+1.65e-01

Pos logit contributions
 cries+0.016
 breaths+0.016
lers+0.014
cha+0.014
ses+0.013
Neg logit contributions
 few-0.008
Su-0.007
"-0.007
J-0.007
 color-0.007
Token big
Token ID1263
Feature activation+6.99e-01

Pos logit contributions
 cries+0.006
cha+0.005
 breaths+0.005
lers+0.005
ses+0.005
Neg logit contributions
Su-0.003
 Jem-0.003
em-0.003
Sus-0.003
J-0.003
Token plane
Token ID6614
Feature activation+1.03e+00

Pos logit contributions
 cries+0.021
 breaths+0.020
lers+0.018
cha+0.018
ses+0.017
Neg logit contributions
 few-0.016
Su-0.015
J-0.015
Back-0.015
Sus-0.014
Token in
Token ID287
Feature activation-2.97e-01

Pos logit contributions
 cries+0.033
 breaths+0.032
cha+0.029
lers+0.029
ses+0.026
Neg logit contributions
 few-0.022
 color-0.020
Su-0.020
J-0.019
 Jem-0.019
Token the
Token ID262
Feature activation+1.86e-01

Pos logit contributions
 few+0.006
 builder+0.006
Su+0.005
ress+0.005
J+0.005
Neg logit contributions
 cries-0.009
 breaths-0.009
 wishes-0.008
lers-0.008
cha-0.008
Token sky
Token ID6766
Feature activation-8.04e-01

Pos logit contributions
 cries+0.005
 breaths+0.004
lers+0.004
 wishes+0.004
cha+0.004
Neg logit contributions
Su-0.005
 few-0.005
J-0.005
Sus-0.005
 Jem-0.005
Token was
Token ID373
Feature activation-5.76e-01

Pos logit contributions
 few+0.031
Su+0.028
 passenger+0.028
 color+0.027
Grand+0.027
Neg logit contributions
 cries-0.040
 breaths-0.038
cha-0.035
lers-0.035
 worries-0.032
Token a
Token ID257
Feature activation-4.48e-01

Pos logit contributions
 few+0.007
 color+0.006
Su+0.005
"+0.005
J+0.005
Neg logit contributions
 breaths-0.024
 cries-0.024
cha-0.022
lers-0.022
ishly-0.021
Token little
Token ID1310
Feature activation+7.60e-01

Pos logit contributions
Su+0.007
 few+0.007
Back+0.007
 Jem+0.006
em+0.006
Neg logit contributions
 cries-0.015
 breaths-0.014
cha-0.014
lers-0.013
ses-0.013
Token girl
Token ID2576
Feature activation+1.31e+00

Pos logit contributions
 cries+0.026
 breaths+0.026
lers+0.024
cha+0.023
ses+0.022
Neg logit contributions
 few-0.014
Su-0.013
 color-0.012
J-0.012
 Jem-0.012
Token named
Token ID3706
Feature activation-1.41e+00

Pos logit contributions
 cries+0.042
 breaths+0.041
lers+0.038
cha+0.038
ses+0.034
Neg logit contributions
 few-0.026
 color-0.023
Su-0.023
 Jem-0.023
 passenger-0.022
Token Lily
Token ID20037
Feature activation+1.02e+00

Pos logit contributions
 few+0.028
 color+0.026
Su+0.025
 passenger+0.025
"+0.024
Neg logit contributions
 cries-0.046
 breaths-0.045
cha-0.042
lers-0.041
ses-0.038
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
 breaths+0.034
 cries+0.032
cha+0.031
lers+0.029
 worries+0.028
Neg logit contributions
 few-0.021
 color-0.021
"-0.020
Su-0.019
Grand-0.019
Token She
Token ID1375
Feature activation+8.53e-01

Pos logit contributions
 few+0.027
 color+0.024
Su+0.024
"+0.023
 passenger+0.023
Neg logit contributions
 cries-0.039
 breaths-0.039
cha-0.035
lers-0.035
ses-0.032
Token loved
Token ID6151
Feature activation+1.14e+00

Pos logit contributions
 breaths+0.033
 cries+0.031
cha+0.029
lers+0.029
ishly+0.028
Neg logit contributions
 few-0.014
 color-0.014
"-0.013
Grand-0.012
 sunny-0.012
Token to
Token ID284
Feature activation-2.34e-01

Pos logit contributions
 cries+0.045
 breaths+0.044
lers+0.041
cha+0.041
 worries+0.038
Neg logit contributions
 few-0.015
Su-0.013
"-0.013
 passenger-0.013
J-0.013
Token play
Token ID711
Feature activation-1.54e-01

Pos logit contributions
 few+0.006
Su+0.006
J+0.005
 passenger+0.005
 Jem+0.005
Neg logit contributions
 cries-0.006
 breaths-0.006
cha-0.005
lers-0.005
 worries-0.005
Token need
Token ID761
Feature activation-3.35e-01

Pos logit contributions
 few+0.007
Su+0.007
Back+0.007
 color+0.006
 Jem+0.006
Neg logit contributions
 cries-0.006
 breaths-0.006
cha-0.005
lers-0.005
 wishes-0.005
Token to
Token ID284
Feature activation-1.76e+00

Pos logit contributions
Su+0.007
J+0.007
 passenger+0.007
 Jem+0.007
S+0.006
Neg logit contributions
 cries-0.011
 breaths-0.010
cha-0.009
 worries-0.009
lers-0.009
Token attach
Token ID10199
Feature activation-1.90e+00

Pos logit contributions
 few+0.040
J+0.039
Su+0.038
S+0.036
 Jem+0.036
Neg logit contributions
 cries-0.052
 breaths-0.050
cha-0.044
lers-0.043
 wishes-0.042
Token you
Token ID345
Feature activation-8.13e-01

Pos logit contributions
Su+0.036
 few+0.034
J+0.033
 passenger+0.033
Sus+0.032
Neg logit contributions
 cries-0.064
 breaths-0.063
lers-0.057
cha-0.056
 worries-0.053
Token to
Token ID284
Feature activation-1.39e+00

Pos logit contributions
Su+0.018
J+0.017
Sus+0.016
Jen+0.016
Grand+0.016
Neg logit contributions
 breaths-0.025
 cries-0.025
lers-0.024
cha-0.023
 worries-0.022
Token my
Token ID616
Feature activation+5.79e-01

Pos logit contributions
J+0.032
Su+0.031
Sus+0.030
S+0.028
Grand+0.027
Neg logit contributions
 cries-0.043
 breaths-0.038
 wishes-0.037
 worries-0.035
 tears-0.033
Token truck
Token ID7779
Feature activation-1.13e+00

Pos logit contributions
 cries+0.018
 breaths+0.018
lers+0.016
 worries+0.015
ses+0.015
Neg logit contributions
J-0.014
Su-0.014
Sus-0.013
 few-0.013
S-0.012
Token and
Token ID290
Feature activation-5.77e-01

Pos logit contributions
Su+0.031
J+0.031
Sus+0.030
Be+0.029
Pe+0.029
Neg logit contributions
 cries-0.028
 breaths-0.026
lers-0.024
 doesn-0.023
cha-0.022
Token take
Token ID1011
Feature activation+3.95e-01

Pos logit contributions
Su+0.013
 few+0.013
J+0.012
S+0.012
Sus+0.012
Neg logit contributions
 cries-0.018
 breaths-0.017
cha-0.015
lers-0.015
 worries-0.014
Token you
Token ID345
Feature activation-8.58e-01

Pos logit contributions
 cries+0.013
 breaths+0.013
lers+0.011
cha+0.011
 worries+0.010
Neg logit contributions
 few-0.008
Su-0.008
 color-0.008
J-0.007
 passenger-0.007
Token away
Token ID1497
Feature activation-1.95e+00

Pos logit contributions
Su+0.019
J+0.018
Jen+0.017
Sus+0.017
 Jem+0.017
Neg logit contributions
 cries-0.027
 breaths-0.027
lers-0.025
cha-0.023
 worries-0.022
Token name
Token ID1438
Feature activation+4.29e-02

Pos logit contributions
 cries+0.007
 breaths+0.006
cha+0.006
lers+0.006
ses+0.005
Neg logit contributions
 few-0.007
Su-0.007
 color-0.007
J-0.007
"-0.007
Token is
Token ID318
Feature activation+8.89e-01

Pos logit contributions
 cries+0.001
 breaths+0.001
cha+0.001
lers+0.001
 wishes+0.001
Neg logit contributions
 few-0.001
Su-0.001
J-0.001
 builder-0.001
 passenger-0.001
Token Thunder
Token ID9850
Feature activation+9.01e-02

Pos logit contributions
 cries+0.025
 breaths+0.024
cha+0.023
lers+0.022
 worries+0.020
Neg logit contributions
 few-0.022
 color-0.020
Su-0.020
 passenger-0.020
 builder-0.019
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
 cries+0.002
 breaths+0.002
lers+0.002
cha+0.002
ishly+0.002
Neg logit contributions
 few-0.002
J-0.002
Su-0.002
Back-0.002
 passenger-0.002
Token I
Token ID314
Feature activation+1.01e+00

Pos logit contributions
 cries+0.029
 breaths+0.028
cha+0.025
lers+0.024
ses+0.022
Neg logit contributions
 few-0.032
 color-0.030
Su-0.029
J-0.028
"-0.028
Tokenâ
Token ID22940
Feature activation+2.13e+00

Pos logit contributions
 breaths+0.036
 cries+0.035
cha+0.033
lers+0.032
ses+0.030
Neg logit contributions
 few-0.017
 color-0.016
"-0.015
Su-0.015
J-0.014
Token
Token ID26391
Feature activation+6.92e-01

Pos logit contributions
 cries+0.056
 breaths+0.056
lers+0.050
cha+0.049
ses+0.045
Neg logit contributions
 few-0.055
"-0.053
 color-0.052
Su-0.052
 Jem-0.050
Token
Token ID8151
Feature activation-5.09e-01

Pos logit contributions
 cries+0.021
 breaths+0.021
cha+0.019
lers+0.018
ses+0.017
Neg logit contributions
 few-0.016
"-0.015
 color-0.014
Su-0.014
J-0.014
Tokenm
Token ID76
Feature activation+4.48e-01

Pos logit contributions
 color+0.013
 few+0.012
 passenger+0.012
 builder+0.012
 groom+0.012
Neg logit contributions
 cries-0.014
cha-0.012
lers-0.012
ishly-0.011
 breaths-0.011
Token new
Token ID649
Feature activation-3.19e-01

Pos logit contributions
 cries+0.014
 breaths+0.013
lers+0.012
cha+0.012
 worries+0.011
Neg logit contributions
 few-0.010
 color-0.010
J-0.009
Su-0.009
 passenger-0.009
Token in
Token ID287
Feature activation-9.68e-02

Pos logit contributions
 few+0.006
J+0.006
Su+0.006
Back+0.006
 color+0.006
Neg logit contributions
 cries-0.010
 breaths-0.010
lers-0.009
cha-0.009
 worries-0.009
Token<|endoftext|>
Token ID50256
Feature activation+4.83e-01

Pos logit contributions
 breaths+0.044
 cries+0.044
cha+0.040
lers+0.040
ses+0.037
Neg logit contributions
 few-0.027
"-0.024
Su-0.024
 color-0.024
J-0.023
TokenOnce
Token ID7454
Feature activation-7.20e-01

Pos logit contributions
 cries+0.015
 breaths+0.015
cha+0.014
lers+0.013
ses+0.013
Neg logit contributions
 few-0.011
 color-0.010
"-0.010
Su-0.010
J-0.009
Token upon
Token ID2402
Feature activation+2.30e-02

Pos logit contributions
 few+0.009
 color+0.008
Su+0.007
"+0.007
 passenger+0.007
Neg logit contributions
 cries-0.029
 breaths-0.028
cha-0.027
lers-0.026
ses-0.025
Token a
Token ID257
Feature activation-3.94e-02

Pos logit contributions
 breaths+0.001
 cries+0.001
lers+0.001
cha+0.001
ishly+0.001
Neg logit contributions
 few-0.000
 color-0.000
"-0.000
J-0.000
Su-0.000
Token time
Token ID640
Feature activation-5.73e-02

Pos logit contributions
 few+0.001
"+0.000
 sunny+0.000
 color+0.000
S+0.000
Neg logit contributions
 breaths-0.002
 cries-0.002
lers-0.001
cha-0.001
ishly-0.001
Token there
Token ID612
Feature activation-1.31e+00

Pos logit contributions
 few+0.001
 passenger+0.001
 color+0.001
Su+0.001
Grand+0.001
Neg logit contributions
 cries-0.002
 breaths-0.002
cha-0.002
lers-0.002
 worries-0.002
Token was
Token ID373
Feature activation-4.62e-01

Pos logit contributions
 few+0.030
 passenger+0.027
Su+0.027
 color+0.026
Grand+0.026
Neg logit contributions
 cries-0.039
 breaths-0.037
cha-0.035
lers-0.034
 worries-0.031
Token a
Token ID257
Feature activation-2.98e-01

Pos logit contributions
 few+0.006
 color+0.005
Su+0.005
"+0.005
J+0.005
Neg logit contributions
 cries-0.018
 breaths-0.018
cha-0.017
lers-0.017
ses-0.016
Token tiny
Token ID7009
Feature activation+7.01e-01

Pos logit contributions
Su+0.004
 few+0.004
 Jem+0.004
Back+0.004
em+0.004
Neg logit contributions
 cries-0.010
 breaths-0.010
cha-0.009
lers-0.009
ses-0.009
Token pink
Token ID11398
Feature activation+1.75e+00

Pos logit contributions
 cries+0.021
 breaths+0.021
lers+0.019
cha+0.019
ses+0.017
Neg logit contributions
 few-0.015
Su-0.014
 color-0.014
J-0.014
"-0.014
Token boat
Token ID8848
Feature activation+9.72e-01

Pos logit contributions
 cries+0.052
 breaths+0.052
cha+0.047
lers+0.046
ses+0.042
Neg logit contributions
 few-0.040
 color-0.037
Su-0.036
"-0.036
 passenger-0.035
Token boy
Token ID2933
Feature activation+1.49e+00

Pos logit contributions
 cries+0.015
 breaths+0.015
lers+0.014
cha+0.014
ses+0.013
Neg logit contributions
 few-0.008
Su-0.008
Back-0.007
J-0.007
 color-0.007
Token named
Token ID3706
Feature activation-9.95e-01

Pos logit contributions
 cries+0.049
 breaths+0.047
lers+0.044
cha+0.043
ses+0.039
Neg logit contributions
 few-0.030
Su-0.027
 color-0.026
 Jem-0.026
Back-0.025
Token Tim
Token ID5045
Feature activation+5.78e-01

Pos logit contributions
 few+0.020
 color+0.018
"+0.018
Su+0.017
 Jem+0.017
Neg logit contributions
 breaths-0.034
 cries-0.033
lers-0.030
cha-0.030
ses-0.028
Tokenmy
Token ID1820
Feature activation+8.06e-01

Pos logit contributions
 cries+0.020
 breaths+0.020
lers+0.018
cha+0.018
ses+0.017
Neg logit contributions
 few-0.011
Su-0.009
 color-0.009
"-0.009
 Jem-0.009
Token.
Token ID13
Feature activation-8.20e-01

Pos logit contributions
 breaths+0.027
 cries+0.025
cha+0.025
lers+0.023
 worries+0.022
Neg logit contributions
 few-0.017
 color-0.016
"-0.016
 passenger-0.015
Grand-0.015
Token Tim
Token ID5045
Feature activation-5.95e-02

Pos logit contributions
 few+0.019
 color+0.017
"+0.017
Su+0.017
J+0.017
Neg logit contributions
 cries-0.024
 breaths-0.023
cha-0.021
lers-0.021
ses-0.019
Tokenmy
Token ID1820
Feature activation+1.35e-01

Pos logit contributions
 few+0.001
 color+0.001
"+0.001
Su+0.001
J+0.001
Neg logit contributions
 breaths-0.002
 cries-0.002
cha-0.002
lers-0.002
ses-0.002
Token was
Token ID373
Feature activation-2.86e-01

Pos logit contributions
 cries+0.004
 breaths+0.004
cha+0.004
lers+0.004
ses+0.004
Neg logit contributions
 few-0.003
 color-0.003
"-0.002
Su-0.002
 passenger-0.002
Token very
Token ID845
Feature activation-1.19e+00

Pos logit contributions
 few+0.005
 color+0.004
Su+0.004
"+0.004
J+0.004
Neg logit contributions
 cries-0.010
 breaths-0.010
lers-0.009
cha-0.009
ses-0.009
Token lonely
Token ID21757
Feature activation+5.92e-01

Pos logit contributions
 few+0.025
Su+0.024
Back+0.023
J+0.023
 color+0.023
Neg logit contributions
 cries-0.037
 breaths-0.035
cha-0.032
lers-0.032
ses-0.030
Token because
Token ID780
Feature activation+8.67e-02

Pos logit contributions
 cries+0.019
 breaths+0.018
cha+0.017
lers+0.017
ses+0.015
Neg logit contributions
 few-0.013
 color-0.012
"-0.011
Su-0.011
 passenger-0.011
Token to
Token ID284
Feature activation-8.97e-01

Pos logit contributions
 cries+0.035
 breaths+0.034
cha+0.032
lers+0.031
 worries+0.029
Neg logit contributions
 few-0.011
 color-0.009
Su-0.009
"-0.009
 passenger-0.009
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
 few+0.020
Su+0.019
J+0.018
 passenger+0.018
 Jem+0.018
Neg logit contributions
 cries-0.027
 breaths-0.026
cha-0.024
lers-0.023
 worries-0.022
Token He
Token ID679
Feature activation-5.98e-01

Pos logit contributions
 few+0.000
"+0.000
 color+0.000
Su+0.000
 Jem+0.000
Neg logit contributions
 breaths-0.000
 cries-0.000
lers-0.000
cha-0.000
ses-0.000
Token asked
Token ID1965
Feature activation-1.86e-02

Pos logit contributions
 few+0.013
 color+0.012
Su+0.012
"+0.012
 passenger+0.011
Neg logit contributions
 cries-0.018
 breaths-0.018
cha-0.017
lers-0.016
ses-0.015
Token again
Token ID757
Feature activation-2.38e+00

Pos logit contributions
Su+0.000
 few+0.000
 passenger+0.000
 color+0.000
Grand+0.000
Neg logit contributions
 cries-0.001
 breaths-0.001
cha-0.001
lers-0.001
 worries-0.001
Token and
Token ID290
Feature activation-1.51e-01

Pos logit contributions
Su+0.052
Pe+0.050
Kim+0.049
Back+0.048
 few+0.048
Neg logit contributions
 cries-0.067
 breaths-0.063
der-0.058
cha-0.057
 worries-0.055
Token finally
Token ID3443
Feature activation-1.31e+00

Pos logit contributions
 few+0.003
Su+0.003
 color+0.003
Grand+0.003
 passenger+0.003
Neg logit contributions
 cries-0.005
 breaths-0.005
cha-0.004
lers-0.004
ishly-0.004
Token,
Token ID11
Feature activation-7.43e-01

Pos logit contributions
Su+0.030
 few+0.028
Kim+0.028
Grand+0.027
Sus+0.027
Neg logit contributions
 cries-0.040
 breaths-0.038
cha-0.035
lers-0.034
 worries-0.033
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
Su+0.016
 few+0.015
Grand+0.015
 color+0.015
Sus+0.015
Neg logit contributions
 cries-0.022
 breaths-0.021
cha-0.019
ishly-0.019
lers-0.018
Token boss
Token ID6478
Feature activation-1.78e+00

Pos logit contributions
 few+0.021
 color+0.019
 passenger+0.018
"+0.017
Su+0.017
Neg logit contributions
 breaths-0.039
 cries-0.039
cha-0.036
lers-0.036
ishly-0.032
Tokeny
Token ID88
Feature activation-1.17e+00

Pos logit contributions
 few+0.030
Su+0.025
Grand+0.025
 passenger+0.024
Back+0.024
Neg logit contributions
 cries-0.064
 breaths-0.061
lers-0.060
cha-0.057
der-0.054
Token powerful
Token ID3665
Feature activation+3.14e-01

Pos logit contributions
 few+0.018
 color+0.017
Su+0.016
 passenger+0.016
"+0.016
Neg logit contributions
 cries-0.036
 breaths-0.036
cha-0.033
lers-0.033
 worries-0.030
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
 cries+0.010
 breaths+0.010
cha+0.009
lers+0.009
 worries+0.009
Neg logit contributions
 few-0.006
 color-0.006
Su-0.006
J-0.005
 passenger-0.005
Token stick
Token ID4859
Feature activation-1.63e-01

Pos logit contributions
 few+0.034
 color+0.031
Su+0.031
"+0.030
J+0.030
Neg logit contributions
 cries-0.053
 breaths-0.052
cha-0.047
lers-0.047
ses-0.043
Token had
Token ID550
Feature activation-1.07e+00

Pos logit contributions
 few+0.004
 color+0.003
Su+0.003
"+0.003
J+0.003
Neg logit contributions
 cries-0.005
 breaths-0.005
lers-0.004
cha-0.004
 worries-0.004
Token been
Token ID587
Feature activation-7.38e-01

Pos logit contributions
 passenger+0.025
 few+0.025
Grand+0.025
Su+0.024
S+0.024
Neg logit contributions
 cries-0.030
 breaths-0.029
 worries-0.025
lers-0.025
cha-0.024
Token,
Token ID11
Feature activation-4.18e-01

Pos logit contributions
 passenger+0.020
Grand+0.020
Kim+0.020
Pe+0.019
Su+0.019
Neg logit contributions
 cries-0.018
 breaths-0.018
lers-0.016
 worries-0.015
cha-0.014
Token and
Token ID290
Feature activation+1.17e-01

Pos logit contributions
 few+0.009
J+0.009
 passenger+0.009
Su+0.008
 color+0.008
Neg logit contributions
 cries-0.013
 breaths-0.012
cha-0.011
lers-0.011
 worries-0.010
Token how
Token ID703
Feature activation-8.77e-01

Pos logit contributions
 cries+0.003
 breaths+0.003
cha+0.003
lers+0.003
 worries+0.003
Neg logit contributions
 few-0.003
 color-0.003
Su-0.003
 passenger-0.003
J-0.003
Token it
Token ID340
Feature activation+8.75e-01

Pos logit contributions
 few+0.018
 color+0.016
Su+0.016
 passenger+0.016
Grand+0.015
Neg logit contributions
 cries-0.028
 breaths-0.028
cha-0.026
lers-0.025
 worries-0.023
Token had
Token ID550
Feature activation-1.02e+00

Pos logit contributions
 cries+0.031
 breaths+0.030
cha+0.028
lers+0.027
ses+0.027
Neg logit contributions
 few-0.015
 color-0.014
"-0.013
 builder-0.013
Su-0.012
Token made
Token ID925
Feature activation+5.58e-01

Pos logit contributions
 few+0.023
Grand+0.023
 passenger+0.023
Su+0.023
"+0.022
Neg logit contributions
 cries-0.030
 breaths-0.029
 worries-0.026
lers-0.025
cha-0.025
Token This
Token ID770
Feature activation-6.25e-01

Pos logit contributions
 cries+0.012
 breaths+0.011
cha+0.010
lers+0.010
ses+0.009
Neg logit contributions
 few-0.011
 color-0.010
Su-0.010
"-0.010
J-0.010
Token is
Token ID318
Feature activation+4.50e-01

Pos logit contributions
 few+0.017
 color+0.016
Su+0.015
"+0.015
 passenger+0.015
Neg logit contributions
 cries-0.016
 breaths-0.016
cha-0.014
lers-0.014
ses-0.013
Token my
Token ID616
Feature activation+5.20e-01

Pos logit contributions
 cries+0.015
 breaths+0.014
cha+0.013
lers+0.013
 worries+0.012
Neg logit contributions
 few-0.009
Su-0.008
 passenger-0.008
 color-0.007
J-0.007
Token family
Token ID1641
Feature activation-1.59e-01

Pos logit contributions
 cries+0.017
 breaths+0.017
cha+0.016
lers+0.016
ses+0.014
Neg logit contributions
 few-0.010
 color-0.009
Su-0.009
"-0.009
 passenger-0.009
Token.
Token ID13
Feature activation+9.38e-01

Pos logit contributions
 few+0.004
 color+0.004
Su+0.004
J+0.004
"+0.004
Neg logit contributions
 cries-0.004
 breaths-0.004
cha-0.004
lers-0.004
 worries-0.003
Token This
Token ID770
Feature activation-3.89e-01

Pos logit contributions
 cries+0.026
 breaths+0.025
cha+0.023
lers+0.023
ses+0.020
Neg logit contributions
 few-0.024
 color-0.022
Su-0.022
"-0.022
 passenger-0.021
Token is
Token ID318
Feature activation+3.82e-01

Pos logit contributions
 few+0.011
Su+0.010
 color+0.010
J+0.010
"+0.010
Neg logit contributions
 cries-0.010
 breaths-0.010
lers-0.009
cha-0.009
ses-0.008
Token my
Token ID616
Feature activation+5.25e-01

Pos logit contributions
 cries+0.013
 breaths+0.012
cha+0.012
lers+0.011
 worries+0.011
Neg logit contributions
 few-0.007
Su-0.006
J-0.006
 passenger-0.006
 color-0.006
Token mom
Token ID1995
Feature activation-2.54e-01

Pos logit contributions
 cries+0.018
 breaths+0.018
cha+0.016
lers+0.016
ses+0.015
Neg logit contributions
 few-0.010
 color-0.009
Su-0.009
"-0.009
 passenger-0.008
Token,
Token ID11
Feature activation+7.30e-01

Pos logit contributions
 few+0.006
Su+0.006
J+0.006
Back+0.006
 Jem+0.006
Neg logit contributions
 cries-0.007
 breaths-0.007
lers-0.007
cha-0.006
 worries-0.006
Token my
Token ID616
Feature activation+4.16e-02

Pos logit contributions
 cries+0.021
 breaths+0.021
cha+0.019
lers+0.019
ses+0.017
Neg logit contributions
 few-0.017
 color-0.015
Su-0.015
J-0.015
"-0.015
Token.
Token ID13
Feature activation+3.30e-01

Pos logit contributions
 few+0.003
Su+0.003
J+0.003
Back+0.003
Be+0.003
Neg logit contributions
 cries-0.003
 breaths-0.003
lers-0.003
cha-0.003
ishly-0.002
Token But
Token ID887
Feature activation+9.07e-01

Pos logit contributions
 breaths+0.009
 cries+0.009
lers+0.008
cha+0.008
 worries+0.007
Neg logit contributions
 few-0.008
"-0.008
 color-0.008
Su-0.008
 Jem-0.008
Token you
Token ID345
Feature activation+3.61e-01

Pos logit contributions
 cries+0.025
 breaths+0.025
cha+0.022
lers+0.022
ses+0.020
Neg logit contributions
 few-0.023
 color-0.021
Su-0.020
"-0.020
 Jem-0.020
Token need
Token ID761
Feature activation+2.25e-01

Pos logit contributions
 breaths+0.012
 cries+0.012
cha+0.011
lers+0.011
ses+0.010
Neg logit contributions
 few-0.007
 color-0.007
"-0.007
 tidy-0.006
Su-0.006
Token<|endoftext|>
Token ID50256
Feature activation+1.10e+00

Pos logit contributions
 cries+0.009
 breaths+0.008
cha+0.008
lers+0.008
 worries+0.007
Neg logit contributions
Su-0.003
 passenger-0.003
 few-0.003
J-0.003
 Jem-0.003
Token"
Token ID1
Feature activation-2.29e+00

Pos logit contributions
 cries+0.034
 breaths+0.034
cha+0.030
lers+0.030
ses+0.029
Neg logit contributions
 few-0.024
"-0.023
 color-0.022
Su-0.022
J-0.022
TokenMom
Token ID29252
Feature activation-2.15e+00

Pos logit contributions
 few+0.047
cest+0.044
 passenger+0.044
 tow+0.044
Back+0.041
Neg logit contributions
 cries-0.059
cha-0.056
 breaths-0.055
ted-0.052
lers-0.052
Tokenmy
Token ID1820
Feature activation-1.69e-01

Pos logit contributions
Su+0.053
Pe+0.051
J+0.051
 passenger+0.050
 few+0.049
Neg logit contributions
 cries-0.058
 breaths-0.054
lers-0.051
 doesn-0.047
cha-0.046
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
 few+0.003
Su+0.003
 color+0.003
J+0.003
 passenger+0.003
Neg logit contributions
 cries-0.006
 breaths-0.006
cha-0.006
lers-0.005
ishly-0.005
Token what
Token ID644
Feature activation-1.21e+00

Pos logit contributions
 few+0.029
Su+0.028
Back+0.027
S+0.027
Pe+0.027
Neg logit contributions
 cries-0.035
cha-0.032
 breaths-0.030
 doesn-0.030
lers-0.028
Token's
Token ID338
Feature activation+1.07e+00

Pos logit contributions
 few+0.029
 passenger+0.029
Su+0.028
Pe+0.026
Kim+0.026
Neg logit contributions
 cries-0.034
 breaths-0.028
lers-0.028
cha-0.028
 doesn-0.028
Token fun
Token ID1257
Feature activation+8.65e-01

Pos logit contributions
 few+0.015
Su+0.014
 passenger+0.013
J+0.013
ress+0.013
Neg logit contributions
 cries-0.029
 breaths-0.027
cha-0.026
lers-0.025
 worries-0.024
Token.
Token ID13
Feature activation-4.84e-01

Pos logit contributions
 cries+0.024
 breaths+0.024
cha+0.022
lers+0.021
 worries+0.019
Neg logit contributions
 few-0.021
Su-0.020
 color-0.020
 passenger-0.019
J-0.019
Token He
Token ID679
Feature activation-1.60e-01

Pos logit contributions
 few+0.013
 color+0.012
Su+0.012
"+0.012
J+0.012
Neg logit contributions
 cries-0.013
 breaths-0.012
cha-0.011
lers-0.011
ses-0.010
Token asked
Token ID1965
Feature activation-3.53e-01

Pos logit contributions
 few+0.004
Su+0.004
 Jem+0.003
 color+0.003
"+0.003
Neg logit contributions
 cries-0.005
 breaths-0.004
lers-0.004
cha-0.004
 worries-0.004
Token his
Token ID465
Feature activation-5.54e-01

Pos logit contributions
Su+0.010
Che+0.009
Sus+0.009
spe+0.009
Pe+0.009
Neg logit contributions
 cries-0.009
 breaths-0.009
 wishes-0.007
 tears-0.007
 thoughts-0.007
Token dad
Token ID9955
Feature activation-2.43e+00

Pos logit contributions
 few+0.013
Su+0.012
Back+0.012
Sus+0.011
Be+0.011
Neg logit contributions
 cries-0.016
 breaths-0.016
lers-0.014
cha-0.014
 worries-0.013
Token if
Token ID611
Feature activation-3.04e-01

Pos logit contributions
Su+0.060
Pe+0.055
Kim+0.053
Jen+0.053
Be+0.052
Neg logit contributions
 cries-0.068
 breaths-0.059
lers-0.059
cha-0.059
 doesn-0.057
Token he
Token ID339
Feature activation-2.49e-01

Pos logit contributions
 few+0.009
Su+0.009
 color+0.008
 passenger+0.008
Sus+0.008
Neg logit contributions
 cries-0.007
cha-0.006
 breaths-0.006
lers-0.005
 worries-0.005
Token could
Token ID714
Feature activation-1.32e+00

Pos logit contributions
 few+0.006
 color+0.006
Su+0.006
"+0.006
 passenger+0.006
Neg logit contributions
 cries-0.007
 breaths-0.007
cha-0.006
lers-0.006
ses-0.005
Token have
Token ID423
Feature activation-1.05e+00

Pos logit contributions
J+0.037
 few+0.036
Su+0.035
S+0.034
Sus+0.034
Neg logit contributions
 cries-0.032
 breaths-0.026
 wishes-0.024
cha-0.024
 worries-0.023
Token a
Token ID257
Feature activation-1.60e-01

Pos logit contributions
Su+0.022
 builder+0.022
J+0.021
spe+0.021
 Jem+0.021
Neg logit contributions
 cries-0.032
 breaths-0.031
lers-0.028
 wishes-0.027
 worries-0.027
Token seen
Token ID1775
Feature activation+1.09e+00

Pos logit contributions
Su+0.021
 passenger+0.020
 few+0.019
S+0.019
Kim+0.019
Neg logit contributions
 cries-0.030
 breaths-0.027
cha-0.025
 worries-0.025
 wishes-0.025
Token her
Token ID607
Feature activation+8.09e-01

Pos logit contributions
 cries+0.038
 breaths+0.037
cha+0.034
lers+0.034
 worries+0.032
Neg logit contributions
 few-0.020
Su-0.017
 color-0.017
"-0.017
J-0.017
Token appear
Token ID1656
Feature activation+1.92e-02

Pos logit contributions
 cries+0.025
 breaths+0.023
lers+0.021
cha+0.021
 worries+0.020
Neg logit contributions
Su-0.018
 few-0.018
J-0.018
Sus-0.017
Be-0.017
Token in
Token ID287
Feature activation-4.28e-01

Pos logit contributions
 cries+0.000
 breaths+0.000
lers+0.000
cha+0.000
led+0.000
Neg logit contributions
Su-0.001
 few-0.000
Pe-0.000
 passenger-0.000
Kim-0.000
Token the
Token ID262
Feature activation-5.28e-01

Pos logit contributions
Su+0.008
 few+0.007
 builder+0.007
J+0.007
 passenger+0.007
Neg logit contributions
 cries-0.015
 breaths-0.014
cha-0.012
lers-0.012
 wishes-0.012
Token market
Token ID1910
Feature activation-2.79e+00

Pos logit contributions
Su+0.013
J+0.012
Sus+0.011
 few+0.011
Kim+0.011
Neg logit contributions
 cries-0.015
 breaths-0.015
cha-0.013
lers-0.013
 wishes-0.012
Token.
Token ID13
Feature activation+2.28e-01

Pos logit contributions
Su+0.085
 few+0.081
Sus+0.080
Be+0.079
J+0.079
Neg logit contributions
 cries-0.062
 breaths-0.053
lers-0.050
cha-0.049
der-0.045
Token She
Token ID1375
Feature activation-4.13e-01

Pos logit contributions
 cries+0.007
 breaths+0.007
cha+0.006
lers+0.006
ses+0.006
Neg logit contributions
 few-0.005
 color-0.005
Su-0.005
"-0.005
J-0.004
Token looked
Token ID3114
Feature activation-5.48e-01

Pos logit contributions
 few+0.009
 color+0.009
Su+0.008
"+0.008
 passenger+0.008
Neg logit contributions
 cries-0.013
 breaths-0.012
cha-0.011
lers-0.011
ses-0.010
Token fit
Token ID4197
Feature activation+2.15e-01

Pos logit contributions
 few+0.013
 color+0.011
"+0.011
Su+0.011
 Jem+0.011
Neg logit contributions
 cries-0.016
 breaths-0.016
cha-0.015
lers-0.015
ses-0.014
Token and
Token ID290
Feature activation-9.49e-01

Pos logit contributions
 breaths+0.006
 cries+0.006
cha+0.005
lers+0.005
led+0.005
Neg logit contributions
 few-0.005
Su-0.005
 passenger-0.005
Be-0.005
Pe-0.005
Token,
Token ID11
Feature activation-5.97e-01

Pos logit contributions
Su+0.007
 few+0.007
 color+0.006
 passenger+0.006
Kim+0.006
Neg logit contributions
 cries-0.010
cha-0.010
 breaths-0.009
lers-0.008
 screams-0.008
Token Tim
Token ID5045
Feature activation+5.09e-01

Pos logit contributions
 few+0.013
Su+0.012
 color+0.012
 passenger+0.012
Grand+0.012
Neg logit contributions
 cries-0.017
 breaths-0.016
cha-0.015
lers-0.014
 worries-0.014
Tokenmy
Token ID1820
Feature activation+3.09e-01

Pos logit contributions
 cries+0.017
 breaths+0.016
lers+0.015
cha+0.014
 worries+0.013
Neg logit contributions
 few-0.011
Su-0.010
 color-0.009
Sus-0.009
J-0.009
Token told
Token ID1297
Feature activation+6.30e-01

Pos logit contributions
 cries+0.010
 breaths+0.009
lers+0.009
cha+0.008
 worries+0.008
Neg logit contributions
 few-0.007
Su-0.007
 color-0.006
J-0.006
Sus-0.006
Token his
Token ID465
Feature activation-3.05e-01

Pos logit contributions
 cries+0.024
 breaths+0.024
cha+0.023
lers+0.023
ses+0.022
Neg logit contributions
 few-0.009
 color-0.007
Su-0.007
 Jem-0.007
"-0.007
Token mom
Token ID1995
Feature activation-2.93e+00

Pos logit contributions
 few+0.005
 color+0.004
 true+0.004
"+0.004
 builder+0.004
Neg logit contributions
 breaths-0.012
 cries-0.011
cha-0.011
lers-0.011
ishly-0.010
Tokenmy
Token ID1820
Feature activation-2.46e+00

Pos logit contributions
 few+0.072
Su+0.069
 passenger+0.067
 color+0.065
J+0.065
Neg logit contributions
 cries-0.082
 breaths-0.075
lers-0.071
cha-0.069
 worries-0.065
Token what
Token ID644
Feature activation-1.36e-01

Pos logit contributions
 few+0.065
Su+0.061
 color+0.061
 passenger+0.059
J+0.059
Neg logit contributions
 cries-0.065
 breaths-0.062
cha-0.056
lers-0.056
 worries-0.050
Token he
Token ID339
Feature activation+4.55e-03

Pos logit contributions
 few+0.004
 passenger+0.004
Su+0.004
Back+0.004
While+0.004
Neg logit contributions
 cries-0.003
cha-0.003
lers-0.002
 breaths-0.002
 wishes-0.002
Token did
Token ID750
Feature activation-5.77e-01

Pos logit contributions
 breaths+0.000
 cries+0.000
lers+0.000
cha+0.000
ishly+0.000
Neg logit contributions
"-0.000
 color-0.000
 few-0.000
 true-0.000
J-0.000
Token and
Token ID290
Feature activation-3.23e-01

Pos logit contributions
 passenger+0.015
 few+0.014
Pe+0.014
Su+0.014
Grand+0.013
Neg logit contributions
 cries-0.016
 breaths-0.014
cha-0.013
 doesn-0.013
 wishes-0.012
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
 passenger+0.005
Su+0.004
Back+0.004
 builder+0.004
cest+0.004
Neg logit contributions
 cries-0.007
 breaths-0.006
lers-0.006
cha-0.006
 worries-0.005
Token little
Token ID1310
Feature activation-1.00e+00

Pos logit contributions
 few+0.010
 color+0.009
"+0.007
 tidy+0.006
 sunny+0.006
Neg logit contributions
 breaths-0.052
 cries-0.051
lers-0.049
cha-0.048
ses-0.047
Token girl
Token ID2576
Feature activation+1.68e-02

Pos logit contributions
 few+0.015
Su+0.013
 color+0.013
"+0.013
J+0.012
Neg logit contributions
 cries-0.038
 breaths-0.038
cha-0.035
lers-0.034
 worries-0.032
Token asked
Token ID1965
Feature activation+7.79e-02

Pos logit contributions
 cries+0.001
 breaths+0.000
lers+0.000
cha+0.000
 worries+0.000
Neg logit contributions
 few-0.000
Su-0.000
J-0.000
Sus-0.000
Back-0.000
Token her
Token ID607
Feature activation-4.60e-01

Pos logit contributions
 cries+0.002
 breaths+0.002
cha+0.002
 wishes+0.002
ted+0.002
Neg logit contributions
Su-0.002
 passenger-0.002
cest-0.002
Jen-0.002
Kim-0.002
Token mom
Token ID1995
Feature activation-2.69e+00

Pos logit contributions
 few+0.010
Su+0.010
 color+0.009
 passenger+0.009
J+0.009
Neg logit contributions
 cries-0.014
 breaths-0.014
cha-0.012
lers-0.012
 worries-0.012
Token for
Token ID329
Feature activation-5.28e-02

Pos logit contributions
Su+0.065
Pe+0.059
Sus+0.057
Kim+0.057
Jen+0.057
Neg logit contributions
 cries-0.076
 breaths-0.069
lers-0.067
cha-0.063
 doesn-0.063
Token a
Token ID257
Feature activation+2.05e-01

Pos logit contributions
Su+0.001
cest+0.001
S+0.001
 passenger+0.001
Jen+0.001
Neg logit contributions
 cries-0.001
 breaths-0.001
cha-0.001
 worries-0.001
 wishes-0.001
Token violin
Token ID38283
Feature activation+7.33e-01

Pos logit contributions
 cries+0.005
 breaths+0.005
 worries+0.004
lers+0.004
cha+0.004
Neg logit contributions
Su-0.006
Jen-0.006
Kim-0.005
S-0.005
 Jem-0.005
Token,
Token ID11
Feature activation-1.37e+00

Pos logit contributions
 cries+0.022
 breaths+0.021
lers+0.019
cha+0.019
ishly+0.018
Neg logit contributions
Su-0.016
 few-0.016
J-0.015
Back-0.015
Sus-0.015
Token but
Token ID475
Feature activation-6.59e-01

Pos logit contributions
spe+0.034
 passenger+0.033
cest+0.033
J+0.033
 few+0.033
Neg logit contributions
 cries-0.036
 breaths-0.030
cha-0.028
der-0.028
 doesn-0.027