TOP ACTIVATIONS
MAX = 15.149

fun . Lily realized that pic n ics were not so
to be careful when having pic n ics and not to
. And that 's why Tues days are so important .
be there to support her pic n ics -- no matter
. But Sam squee zes the dinosaur too hard
food . The st eg osaurus came out of
the park , and have pic n ics outside .
They missed their walks and pic n ics near the cliff
." L ily squee zes his hand . "
roof . They would have pic n ics and play games
went on adventures , had pic n ics , and even
the corn field . The st alks looked happy to see
soon nothing but a small spe ck in the sky .
their salad and have happy pic n ics together . <|endoftext|>
favourite place to go for pic n ics . <|endoftext|> Once
and nodded . He loved pic n ics and couldn 't
the warm sunshine and having pic n ics with her family
see the bird and have pic n ics . <|endoftext|> Once
liked to play and have pic n ics . One day
stripes . I have a st inger . I can make

MIN ACTIVATIONS
MIN = -9.168

excited and wanted to take photographs of everything around her .
came back home empty - handed and told Lily the news
he was caught red - handed . But he didn 't
and the trees looked gre ener . Finally ,
the park . Dad showed mom and the 3 year old
grown to be even gre ener and brighter than before .
and Ben smiled and hugged mom . " We feel happy
better ! They were gre ener and taller than before .
walked around and gave each person something . He gave Mia
the car Jim started to feel better . His mother was
seem to make her tie feel better . She started to
idea . She hugged each person and animal .
They felt better and hugged mom . " Thank
her in her dance and feel happy and free and alive
, special notes and even photographs . Every time she looked
's tum my started to feel funny . He was thirsty
B reat he deeply to feel calm ", or " Sm
on the bed and gave mom my the picture and the
to pick it and show mom . She pulls and pulls
back home , empty - handed . Nature was

INTERVAL 9.070 - 15.149
CONTAINS 0.104%

leaves . They saw a sk unk come out of the
joined back together and the tw ig was once again a
. It 's mine . Go play with your doll ,"
the handle from the wet p uddle and gave it back
They see a j olly man . He has

INTERVAL 2.991 - 9.070
CONTAINS 8.478%

sad that she had to go away but was happy because
It was perched from a nearby branch , watching her twist
happy too . They like to play with the big ball
it there . From that day onwards , he made sure
having fun . But next time , please tell us where

INTERVAL -3.088 - 2.991
CONTAINS 85.005%

the woman said . Lu cy smiled and decided
was going on pic n ics ! On this
girl said , " I want to find a new way
fall down from the tree ." Lily said , " I
snakes are dangerous , but some are not .

INTERVAL -9.168 - -3.088
CONTAINS 6.413%

She had a favorite belt that was orange . Lily wore
phone options . She was overwhelmed ! Finally
't mean to hurt your feelings . Let 's forgive him
. She remembered that her mummy and daddy had asked her
, and to think of others too . So , he
Token fun
Token ID1257
Feature activation+3.57e+00

Pos logit contributions
 curtains+0.021
 Some+0.020
edd+0.020
 pieces+0.020
 many+0.019
Neg logit contributions
ah-0.021
gh-0.019
 repeat-0.018
Deal-0.018
iding-0.018
Token.
Token ID13
Feature activation+4.55e-01

Pos logit contributions
ah+0.178
gh+0.163
 repeat+0.162
Deal+0.159
iding+0.155
Neg logit contributions
 many-0.069
 curtains-0.068
 pieces-0.068
 of-0.066
 Some-0.065
Token Lily
Token ID20037
Feature activation-6.21e-01

Pos logit contributions
ah+0.022
gh+0.021
 repeat+0.020
Deal+0.020
iding+0.019
Neg logit contributions
 curtains-0.009
 Some-0.008
 pieces-0.008
edd-0.008
 many-0.008
Token realized
Token ID6939
Feature activation-4.59e+00

Pos logit contributions
 curtains+0.012
 many+0.011
 pieces+0.011
 Some+0.011
+0.011
Neg logit contributions
ah-0.030
gh-0.029
iding-0.028
Deal-0.028
 repeat-0.027
Token that
Token ID326
Feature activation-2.16e+00

Pos logit contributions
 curtains+0.045
 Some+0.043
edd+0.037
 pieces+0.037
 many+0.037
Neg logit contributions
ah-0.269
gh-0.255
 repeat-0.249
Deal-0.246
iding-0.243
Token pic
Token ID8301
Feature activation+1.51e+01

Pos logit contributions
 curtains+0.048
 Some+0.047
edd+0.045
 pieces+0.044
 many+0.044
Neg logit contributions
ah-0.098
gh-0.092
 repeat-0.090
Deal-0.088
iding-0.086
Tokenn
Token ID77
Feature activation+6.67e+00

Pos logit contributions
ah+0.349
 repeat+0.284
gh+0.276
iding+0.272
 pose+0.262
Neg logit contributions
edd-0.656
 many-0.653
-0.647
 Some-0.644
 curtains-0.640
Tokenics
Token ID873
Feature activation-1.92e-01

Pos logit contributions
ah+0.287
gh+0.250
iding+0.242
 repeat+0.239
et+0.237
Neg logit contributions
 curtains-0.191
 rooms-0.177
 Some-0.174
 doorway-0.170
 scientists-0.169
Token were
Token ID547
Feature activation-9.73e-01

Pos logit contributions
 pieces+0.002
 many+0.002
 curtains+0.002
 of+0.002
 sky+0.002
Neg logit contributions
ah-0.011
gh-0.010
iding-0.010
 repeat-0.010
Deal-0.010
Token not
Token ID407
Feature activation-2.11e+00

Pos logit contributions
 many+0.023
 curtains+0.023
 Some+0.022
 pieces+0.022
 rooms+0.021
Neg logit contributions
ah-0.045
gh-0.041
iding-0.040
 repeat-0.039
Deal-0.039
Token so
Token ID523
Feature activation+1.05e+00

Pos logit contributions
 many+0.050
 twins+0.046
 pieces+0.045
 curtains+0.045
 Some+0.044
Neg logit contributions
ah-0.100
gh-0.094
Deal-0.089
iding-0.089
 repeat-0.088
Token to
Token ID284
Feature activation+2.52e+00

Pos logit contributions
 curtains+0.092
 doorway+0.082
edd+0.080
 Some+0.078
 rooms+0.077
Neg logit contributions
ah-0.229
gh-0.222
 repeat-0.215
Deal-0.207
 pose-0.202
Token be
Token ID307
Feature activation+1.84e+00

Pos logit contributions
ah+0.104
iding+0.102
 AW+0.100
gh+0.100
 la+0.100
Neg logit contributions
 many-0.064
-0.064
 family-0.063
 that-0.061
edd-0.060
Token careful
Token ID8161
Feature activation+8.71e-01

Pos logit contributions
ah+0.079
gh+0.072
 repeat+0.069
iding+0.068
Deal+0.067
Neg logit contributions
 many-0.049
 Some-0.049
 curtains-0.047
 pieces-0.047
 twins-0.046
Token when
Token ID618
Feature activation-2.29e+00

Pos logit contributions
ah+0.043
gh+0.041
 repeat+0.040
iding+0.039
Deal+0.039
Neg logit contributions
 curtains-0.016
 Some-0.016
edd-0.015
 doorway-0.014
 rooms-0.014
Token having
Token ID1719
Feature activation+3.64e-01

Pos logit contributions
 curtains+0.032
 many+0.032
 Some+0.031
 pieces+0.030
 rooms+0.029
Neg logit contributions
ah-0.128
gh-0.117
Deal-0.116
 repeat-0.115
iding-0.114
Token pic
Token ID8301
Feature activation+1.41e+01

Pos logit contributions
ah+0.018
iding+0.017
gh+0.017
Deal+0.016
 repeat+0.016
Neg logit contributions
 many-0.008
 pieces-0.007
 Some-0.007
 curtains-0.007
 rooms-0.007
Tokenn
Token ID77
Feature activation+5.66e+00

Pos logit contributions
ah+0.389
 forth+0.305
 repeat+0.301
Deal+0.300
 pose+0.299
Neg logit contributions
 many-0.555
-0.551
edd-0.548
 of-0.534
 Some-0.531
Tokenics
Token ID873
Feature activation-1.01e+00

Pos logit contributions
ah+0.245
gh+0.228
 repeat+0.221
iding+0.215
Deal+0.215
Neg logit contributions
 curtains-0.141
 Some-0.138
 pieces-0.131
edd-0.131
 many-0.130
Token and
Token ID290
Feature activation-3.35e-01

Pos logit contributions
 many+0.014
 everywhere+0.013
 of+0.013
 pieces+0.013
+0.013
Neg logit contributions
ah-0.060
 repeat-0.053
gh-0.053
iding-0.052
-0.052
Token not
Token ID407
Feature activation-1.17e+00

Pos logit contributions
 many+0.008
 curtains+0.008
 pieces+0.007
 Some+0.007
 rooms+0.007
Neg logit contributions
ah-0.016
gh-0.015
iding-0.014
Deal-0.014
 repeat-0.013
Token to
Token ID284
Feature activation+3.30e+00

Pos logit contributions
 of+0.030
 many+0.030
 to+0.027
 flowers+0.026
 twins+0.025
Neg logit contributions
ah-0.056
gh-0.053
-0.052
iding-0.051
Deal-0.051
Token.
Token ID13
Feature activation+1.60e-01

Pos logit contributions
 of+0.048
+0.045
 many+0.044
 pieces+0.043
 curtains+0.037
Neg logit contributions
ah-0.182
Deal-0.165
 repeat-0.165
iding-0.160
gh-0.159
Token And
Token ID843
Feature activation+3.03e+00

Pos logit contributions
ah+0.009
gh+0.008
 repeat+0.008
Deal+0.008
iding+0.008
Neg logit contributions
 curtains-0.002
 Some-0.002
 pieces-0.002
edd-0.002
 many-0.002
Token that
Token ID326
Feature activation-1.87e+00

Pos logit contributions
ah+0.167
gh+0.158
iding+0.153
Deal+0.151
 repeat+0.150
Neg logit contributions
 many-0.047
 of-0.044
-0.042
 pieces-0.040
 curtains-0.039
Token's
Token ID338
Feature activation+2.11e+00

Pos logit contributions
 many+0.032
 pieces+0.031
+0.029
 of+0.029
 sky+0.026
Neg logit contributions
ah-0.099
iding-0.095
gh-0.094
Deal-0.094
 repeat-0.091
Token why
Token ID1521
Feature activation-1.42e+00

Pos logit contributions
ah+0.113
gh+0.108
iding+0.101
 AW+0.101
 pose+0.100
Neg logit contributions
 many-0.039
 twins-0.034
 of-0.034
-0.032
 everywhere-0.031
Token Tues
Token ID48496
Feature activation+1.40e+01

Pos logit contributions
 many+0.018
 of+0.014
+0.013
 pieces+0.013
 scientists+0.012
Neg logit contributions
ah-0.085
Deal-0.082
iding-0.079
gh-0.078
et-0.078
Tokendays
Token ID12545
Feature activation+1.59e+00

Pos logit contributions
ah+0.567
gh+0.524
 repeat+0.498
et+0.492
 pose+0.491
Neg logit contributions
-0.370
edd-0.370
 curtains-0.362
 Some-0.361
-0.361
Token are
Token ID389
Feature activation-5.14e-01

Pos logit contributions
ah+0.092
gh+0.087
iding+0.085
Deal+0.083
astics+0.083
Neg logit contributions
 of-0.021
 many-0.017
 pieces-0.017
 curtains-0.016
-0.015
Token so
Token ID523
Feature activation+4.13e-01

Pos logit contributions
 many+0.009
 of+0.009
 pieces+0.008
 curtains+0.008
 Some+0.008
Neg logit contributions
ah-0.027
gh-0.025
iding-0.025
Deal-0.025
 repeat-0.025
Token important
Token ID1593
Feature activation-2.66e-01

Pos logit contributions
ah+0.021
gh+0.019
iding+0.019
 repeat+0.019
Deal+0.018
Neg logit contributions
 many-0.009
 of-0.008
 Some-0.008
 curtains-0.008
 pieces-0.007
Token.
Token ID13
Feature activation+4.77e-01

Pos logit contributions
 of+0.004
 many+0.004
 curtains+0.004
+0.004
 pieces+0.004
Neg logit contributions
ah-0.015
 repeat-0.013
gh-0.013
Deal-0.013
-0.013
Token be
Token ID307
Feature activation+1.94e+00

Pos logit contributions
ah+0.053
iding+0.047
gh+0.046
 AW+0.045
core+0.044
Neg logit contributions
-0.036
 many-0.036
 a-0.034
 family-0.034
 of-0.033
Token there
Token ID612
Feature activation-1.40e+00

Pos logit contributions
ah+0.092
gh+0.086
Deal+0.084
 repeat+0.084
 pose+0.082
Neg logit contributions
 many-0.045
 pieces-0.041
 of-0.040
-0.039
 curtains-0.039
Token to
Token ID284
Feature activation+2.68e+00

Pos logit contributions
 many+0.026
 of+0.024
 curtains+0.023
+0.023
 pieces+0.022
Neg logit contributions
ah-0.074
gh-0.068
 repeat-0.066
Deal-0.066
iding-0.065
Token support
Token ID1104
Feature activation-8.99e-01

Pos logit contributions
ah+0.126
iding+0.112
 AW+0.110
gh+0.108
Deal+0.105
Neg logit contributions
 many-0.068
-0.067
 of-0.066
 a-0.063
 family-0.061
Token her
Token ID607
Feature activation+1.88e-02

Pos logit contributions
 many+0.013
 curtains+0.013
 pieces+0.012
 of+0.012
+0.012
Neg logit contributions
ah-0.051
gh-0.046
Deal-0.046
iding-0.046
 repeat-0.045
Token pic
Token ID8301
Feature activation+1.37e+01

Pos logit contributions
ah+0.001
gh+0.001
Deal+0.001
+0.001
 AW+0.001
Neg logit contributions
 many-0.000
 pieces-0.000
 curtains-0.000
 family-0.000
 of-0.000
Tokenn
Token ID77
Feature activation+5.45e+00

Pos logit contributions
ah+0.390
Deal+0.328
 cheating+0.319
 repeat+0.309
 forth+0.309
Neg logit contributions
edd-0.531
-0.529
 many-0.522
 of-0.511
 curtains-0.509
Tokenics
Token ID873
Feature activation-1.53e+00

Pos logit contributions
ah+0.254
gh+0.234
Deal+0.231
 repeat+0.230
iding+0.219
Neg logit contributions
 curtains-0.117
 Some-0.114
 many-0.112
 pieces-0.111
edd-0.109
Token--
Token ID438
Feature activation+9.58e-02

Pos logit contributions
 of+0.026
 many+0.026
 pieces+0.023
 everywhere+0.022
+0.022
Neg logit contributions
ah-0.084
gh-0.076
Deal-0.075
 repeat-0.074
-0.073
Tokenno
Token ID3919
Feature activation+4.27e+00

Pos logit contributions
ah+0.004
gh+0.004
 repeat+0.004
 appetite+0.004
Deal+0.004
Neg logit contributions
 many-0.002
 of-0.002
 Some-0.002
 pieces-0.002
-0.002
Token matter
Token ID2300
Feature activation+2.85e+00

Pos logit contributions
ah+0.240
Deal+0.232
 repeat+0.226
 insist+0.223
iding+0.222
Neg logit contributions
 of-0.081
 pieces-0.065
 many-0.056
 people-0.051
 twins-0.050
Token.
Token ID13
Feature activation+7.02e-01

Pos logit contributions
ah+0.113
Deal+0.112
gh+0.106
 repeat+0.105
ercise+0.104
Neg logit contributions
 many-0.028
 of-0.028
-0.027
 pieces-0.026
 curtains-0.023
Token
Token ID198
Feature activation+7.88e-02

Pos logit contributions
ah+0.034
 repeat+0.031
gh+0.031
Deal+0.030
 pose+0.030
Neg logit contributions
edd-0.014
 curtains-0.014
rolley-0.013
 doorway-0.013
armac-0.012
Token
Token ID198
Feature activation+4.80e-01

Pos logit contributions
ah+0.003
gh+0.003
 pose+0.003
 repeat+0.003
Deal+0.003
Neg logit contributions
 doorway-0.002
 curtains-0.002
 models-0.002
 decorations-0.002
rolley-0.002
TokenBut
Token ID1537
Feature activation+9.23e-01

Pos logit contributions
ah+0.017
gh+0.016
Deal+0.016
 repeat+0.015
 pose+0.015
Neg logit contributions
 curtains-0.016
edd-0.015
 doorway-0.015
 rooms-0.015
 Some-0.015
Token Sam
Token ID3409
Feature activation+1.30e+00

Pos logit contributions
ah+0.039
gh+0.037
 repeat+0.036
Deal+0.035
iding+0.034
Neg logit contributions
 curtains-0.024
 Some-0.023
edd-0.023
 pieces-0.022
 many-0.022
Token squee
Token ID13170
Feature activation+1.37e+01

Pos logit contributions
iding+0.061
bek+0.060
ah+0.060
gh+0.058
cluded+0.058
Neg logit contributions
-0.027
 trees-0.027
 pieces-0.027
s-0.026
 was-0.026
Tokenzes
Token ID12271
Feature activation+5.75e-01

Pos logit contributions
ah+0.334
ust+0.331
iding+0.325
 repeat+0.314
chery+0.297
Neg logit contributions
 pieces-0.482
-0.465
 many-0.461
edd-0.460
 rooms-0.459
Token the
Token ID262
Feature activation-3.85e-01

Pos logit contributions
ah+0.031
Deal+0.030
iding+0.030
gh+0.030
 repeat+0.029
Neg logit contributions
 many-0.010
 pieces-0.009
 Some-0.009
-0.009
 of-0.008
Token dinosaur
Token ID28257
Feature activation-2.05e-01

Pos logit contributions
 curtains+0.009
 Some+0.009
 pieces+0.009
 many+0.009
edd+0.009
Neg logit contributions
ah-0.017
gh-0.016
 repeat-0.015
Deal-0.015
iding-0.015
Token too
Token ID1165
Feature activation+9.73e-01

Pos logit contributions
 pieces+0.004
 many+0.004
s+0.004
 of+0.004
 big+0.004
Neg logit contributions
-0.011
Deal-0.011
bek-0.011
ah-0.011
core-0.010
Token hard
Token ID1327
Feature activation+2.25e+00

Pos logit contributions
ah+0.046
gh+0.043
 repeat+0.042
Deal+0.042
iding+0.041
Neg logit contributions
 many-0.024
-0.022
 of-0.022
 pieces-0.021
 Some-0.020
Token food
Token ID2057
Feature activation-1.69e+00

Pos logit contributions
 twins+0.029
 curtains+0.029
 many+0.029
 rooms+0.027
 pieces+0.027
Neg logit contributions
ah-0.067
gh-0.060
iding-0.059
Deal-0.058
 repeat-0.058
Token.
Token ID13
Feature activation+5.84e-01

Pos logit contributions
 of+0.027
 twins+0.025
 curtains+0.024
 pieces+0.024
 everywhere+0.023
Neg logit contributions
ah-0.094
gh-0.086
iding-0.086
-0.085
heit-0.085
Token
Token ID198
Feature activation+1.19e-01

Pos logit contributions
ah+0.030
Deal+0.029
gh+0.029
 repeat+0.029
iding+0.027
Neg logit contributions
 curtains-0.008
 doorway-0.008
edd-0.008
 pieces-0.007
rolley-0.007
Token
Token ID198
Feature activation+5.02e-01

Pos logit contributions
gh+0.004
ah+0.004
iding+0.004
h+0.004
 repeat+0.004
Neg logit contributions
 curtains-0.003
 doorway-0.003
 posters-0.003
 scientists-0.003
 decorations-0.003
TokenThe
Token ID464
Feature activation+2.64e-01

Pos logit contributions
ah+0.018
gh+0.017
Deal+0.016
 repeat+0.016
iding+0.016
Neg logit contributions
 curtains-0.017
 Some-0.016
edd-0.016
 doorway-0.015
 pieces-0.015
Token st
Token ID336
Feature activation+1.36e+01

Pos logit contributions
ah+0.012
gh+0.011
 repeat+0.011
Deal+0.011
iding+0.011
Neg logit contributions
 curtains-0.006
 Some-0.006
 pieces-0.005
 many-0.005
edd-0.005
Tokeneg
Token ID1533
Feature activation+5.66e+00

Pos logit contributions
ah+0.393
gh+0.334
iding+0.278
et+0.271
 repeat+0.265
Neg logit contributions
 curtains-0.567
 Some-0.556
 many-0.548
 scientists-0.542
 rooms-0.541
Tokenosaurus
Token ID47650
Feature activation+5.05e-01

Pos logit contributions
ah+0.156
gh+0.132
 repeat+0.119
et+0.112
Deal+0.110
Neg logit contributions
 Some-0.245
 curtains-0.242
 many-0.237
 doorway-0.233
 rooms-0.231
Token came
Token ID1625
Feature activation-5.23e-01

Pos logit contributions
iding+0.024
ah+0.024
gh+0.023
Deal+0.023
+0.023
Neg logit contributions
 pieces-0.010
 curtains-0.010
-0.009
 sky-0.009
 many-0.009
Token out
Token ID503
Feature activation-1.38e+00

Pos logit contributions
 curtains+0.010
 Some+0.009
 many+0.009
 pieces+0.009
edd+0.009
Neg logit contributions
ah-0.026
gh-0.025
Deal-0.024
 repeat-0.024
iding-0.023
Token of
Token ID286
Feature activation-2.02e+00

Pos logit contributions
 curtains+0.013
 Some+0.012
edd+0.011
 pieces+0.010
 rooms+0.010
Neg logit contributions
ah-0.081
gh-0.076
 repeat-0.076
iding-0.074
 pose-0.072
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
ah+0.136
gh+0.128
 repeat+0.126
Deal+0.123
iding+0.122
Neg logit contributions
 curtains-0.031
 Some-0.030
edd-0.028
 pieces-0.026
 many-0.026
Token park
Token ID3952
Feature activation+1.50e+00

Pos logit contributions
 sky+0.023
 curtains+0.022
 many+0.022
 rooms+0.021
 trees+0.021
Neg logit contributions
ah-0.052
gh-0.050
iding-0.049
Deal-0.048
 repeat-0.048
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
ah+0.086
heit+0.084
gh+0.083
ercise+0.081
 AW+0.081
Neg logit contributions
 many-0.023
 of-0.019
-0.018
 pieces-0.017
 sky-0.016
Token and
Token ID290
Feature activation-6.17e-01

Pos logit contributions
 curtains+0.023
 many+0.023
 pieces+0.021
 of+0.020
 flowers+0.020
Neg logit contributions
ah-0.086
gh-0.084
Deal-0.079
iding-0.078
-0.076
Token have
Token ID423
Feature activation-2.23e+00

Pos logit contributions
 many+0.011
 curtains+0.010
 of+0.009
+0.009
 rooms+0.009
Neg logit contributions
ah-0.034
gh-0.032
iding-0.031
Deal-0.031
-0.029
Token pic
Token ID8301
Feature activation+1.34e+01

Pos logit contributions
 curtains+0.042
 Some+0.041
edd+0.039
 pieces+0.038
 many+0.038
Neg logit contributions
ah-0.109
gh-0.103
 repeat-0.100
Deal-0.098
iding-0.097
Tokenn
Token ID77
Feature activation+4.95e+00

Pos logit contributions
ah+0.323
Deal+0.269
 repeat+0.262
 forth+0.257
 cheating+0.248
Neg logit contributions
 many-0.567
-0.557
edd-0.553
 curtains-0.553
 of-0.550
Tokenics
Token ID873
Feature activation-2.05e+00

Pos logit contributions
ah+0.215
gh+0.200
 repeat+0.195
Deal+0.192
iding+0.188
Neg logit contributions
 curtains-0.121
 Some-0.120
 many-0.114
 pieces-0.114
edd-0.113
Token outside
Token ID2354
Feature activation+1.17e+00

Pos logit contributions
 many+0.035
 of+0.033
 everywhere+0.032
 pieces+0.030
+0.029
Neg logit contributions
ah-0.108
 repeat-0.103
heit-0.103
-0.102
gh-0.101
Token.
Token ID13
Feature activation+4.44e-01

Pos logit contributions
ah+0.065
gh+0.062
Deal+0.060
 repeat+0.060
iding+0.059
Neg logit contributions
 many-0.021
 of-0.019
-0.017
 curtains-0.015
 everywhere-0.015
Token
Token ID220
Feature activation+3.23e-01

Pos logit contributions
ah+0.022
 repeat+0.020
gh+0.020
Deal+0.020
iding+0.019
Neg logit contributions
 curtains-0.008
 pieces-0.008
edd-0.007
 doorway-0.007
 rooms-0.007
Token They
Token ID1119
Feature activation+3.68e-01

Pos logit contributions
ah+0.016
Deal+0.015
 repeat+0.015
+0.014
gh+0.014
Neg logit contributions
 curtains-0.006
 pieces-0.006
 chairs-0.005
 posters-0.005
edd-0.005
Token missed
Token ID6825
Feature activation-2.57e+00

Pos logit contributions
ah+0.017
gh+0.017
iding+0.016
 repeat+0.015
Deal+0.015
Neg logit contributions
 many-0.008
 curtains-0.008
 Some-0.008
 pieces-0.007
-0.007
Token their
Token ID511
Feature activation+5.45e-01

Pos logit contributions
 many+0.038
 curtains+0.036
 pieces+0.033
 rooms+0.032
 twins+0.032
Neg logit contributions
ah-0.148
gh-0.137
 repeat-0.130
iding-0.130
Deal-0.130
Token walks
Token ID11114
Feature activation+4.26e+00

Pos logit contributions
ah+0.027
gh+0.025
 repeat+0.024
iding+0.024
Deal+0.024
Neg logit contributions
 curtains-0.011
 many-0.011
 pieces-0.011
 rooms-0.010
 Some-0.010
Token and
Token ID290
Feature activation-1.58e+00

Pos logit contributions
ah+0.216
gh+0.208
 repeat+0.206
Deal+0.199
iding+0.196
Neg logit contributions
 of-0.077
 many-0.076
 pieces-0.073
 everywhere-0.069
 curtains-0.068
Token pic
Token ID8301
Feature activation+1.34e+01

Pos logit contributions
 curtains+0.035
 Some+0.034
 many+0.034
 pieces+0.033
 rooms+0.032
Neg logit contributions
ah-0.074
gh-0.069
 repeat-0.066
iding-0.066
Deal-0.065
Tokenn
Token ID77
Feature activation+4.99e+00

Pos logit contributions
ah+0.345
 repeat+0.278
 pose+0.276
 cheating+0.276
Deal+0.275
Neg logit contributions
edd-0.549
 many-0.542
-0.541
 of-0.529
 Some-0.520
Tokenics
Token ID873
Feature activation-1.88e+00

Pos logit contributions
ah+0.197
gh+0.180
iding+0.173
 repeat+0.164
et+0.161
Neg logit contributions
 curtains-0.156
 Some-0.148
 rooms-0.144
 doorway-0.140
 chairs-0.140
Token near
Token ID1474
Feature activation+2.41e+00

Pos logit contributions
 many+0.039
 of+0.036
 everywhere+0.035
+0.034
 pieces+0.034
Neg logit contributions
ah-0.095
gh-0.088
 repeat-0.087
astics-0.086
-0.086
Token the
Token ID262
Feature activation-2.20e-01

Pos logit contributions
ah+0.142
gh+0.133
Deal+0.130
 repeat+0.129
iding+0.128
Neg logit contributions
 curtains-0.027
 many-0.025
 Some-0.025
 pieces-0.024
 rooms-0.022
Token cliff
Token ID19516
Feature activation+2.31e+00

Pos logit contributions
 Some+0.006
 curtains+0.006
edd+0.006
 pieces+0.006
 Someone+0.005
Neg logit contributions
ah-0.009
gh-0.008
 repeat-0.008
Deal-0.008
 pose-0.008
Token."
Token ID526
Feature activation-6.13e-01

Pos logit contributions
ah+0.290
gh+0.282
Deal+0.272
 repeat+0.263
iding+0.262
Neg logit contributions
 of-0.126
 curtains-0.117
 many-0.117
-0.115
 Some-0.110
Token
Token ID198
Feature activation+4.85e-01

Pos logit contributions
 curtains+0.011
edd+0.011
 doorway+0.011
armac+0.011
rolley+0.010
Neg logit contributions
ah-0.029
 repeat-0.028
gh-0.027
Deal-0.027
 pose-0.026
Token
Token ID198
Feature activation+7.80e-01

Pos logit contributions
ah+0.021
gh+0.020
 repeat+0.019
iding+0.019
Deal+0.019
Neg logit contributions
 curtains-0.010
 doorway-0.010
 scientists-0.010
 sculptures-0.009
 posters-0.009
TokenL
Token ID43
Feature activation+4.17e+00

Pos logit contributions
ah+0.035
gh+0.033
 repeat+0.032
Deal+0.031
iding+0.031
Neg logit contributions
 Some-0.018
 curtains-0.018
 many-0.018
 pieces-0.017
-0.017
Tokenily
Token ID813
Feature activation-6.08e-01

Pos logit contributions
ah+0.178
gh+0.165
 repeat+0.160
Deal+0.158
iding+0.157
Neg logit contributions
 curtains-0.110
 Some-0.106
 doorway-0.101
 many-0.100
edd-0.100
Token squee
Token ID13170
Feature activation+1.33e+01

Pos logit contributions
 curtains+0.016
 sky+0.015
 pieces+0.015
 trees+0.015
+0.015
Neg logit contributions
gh-0.027
ah-0.025
iding-0.025
bek-0.024
Deal-0.024
Tokenzes
Token ID12271
Feature activation+1.47e-01

Pos logit contributions
ust+0.398
iding+0.363
ah+0.353
cluded+0.333
chery+0.309
Neg logit contributions
edd-0.433
-0.416
 grandma-0.413
 pieces-0.412
 posters-0.411
Token his
Token ID465
Feature activation+1.64e+00

Pos logit contributions
ah+0.008
gh+0.007
Deal+0.007
iding+0.007
 repeat+0.007
Neg logit contributions
 Some-0.003
 many-0.002
 curtains-0.002
 pieces-0.002
-0.002
Token hand
Token ID1021
Feature activation-1.89e+00

Pos logit contributions
iding+0.072
Deal+0.072
ah+0.071
bek+0.070
aling+0.069
Neg logit contributions
 pieces-0.042
 big-0.041
 toys-0.040
 mom-0.039
 chairs-0.039
Token.
Token ID13
Feature activation+6.86e-01

Pos logit contributions
 a+0.018
 some+0.015
 and+0.014
 many+0.014
 of+0.013
Neg logit contributions
heit-0.128
core-0.122
 compete-0.120
bek-0.120
 AW-0.119
Token "
Token ID366
Feature activation+9.28e-02

Pos logit contributions
ah+0.039
gh+0.037
Deal+0.036
 repeat+0.035
et+0.035
Neg logit contributions
 Some-0.010
 curtains-0.009
-0.008
 many-0.008
 Someone-0.008
Token roof
Token ID9753
Feature activation-9.47e-01

Pos logit contributions
 Some+0.010
 curtains+0.009
edd+0.009
 Someone+0.009
rolley+0.008
Neg logit contributions
ah-0.019
gh-0.018
Deal-0.017
 repeat-0.017
et-0.016
Token.
Token ID13
Feature activation+3.65e-01

Pos logit contributions
 curtains+0.019
 Some+0.019
edd+0.017
 Someone+0.017
 pieces+0.017
Neg logit contributions
ah-0.046
gh-0.042
 repeat-0.042
Deal-0.041
iding-0.040
Token They
Token ID1119
Feature activation+2.74e-01

Pos logit contributions
ah+0.016
 repeat+0.014
Deal+0.014
gh+0.014
 compete+0.014
Neg logit contributions
 doorway-0.009
 curtains-0.008
 pieces-0.008
edd-0.008
Someone-0.008
Token would
Token ID561
Feature activation+2.26e+00

Pos logit contributions
ah+0.012
gh+0.012
iding+0.011
 repeat+0.011
Deal+0.011
Neg logit contributions
 many-0.006
 curtains-0.006
 Some-0.006
 pieces-0.006
 rooms-0.006
Token have
Token ID423
Feature activation-2.22e+00

Pos logit contributions
ah+0.092
gh+0.092
iding+0.088
+0.084
 AW+0.083
Neg logit contributions
 many-0.065
 curtains-0.063
-0.060
 family-0.059
 of-0.058
Token pic
Token ID8301
Feature activation+1.33e+01

Pos logit contributions
 curtains+0.039
edd+0.038
 doorway+0.038
 Some+0.036
rolley+0.035
Neg logit contributions
ah-0.111
gh-0.105
 repeat-0.101
Deal-0.100
et-0.097
Tokenn
Token ID77
Feature activation+4.66e+00

Pos logit contributions
ah+0.256
gh+0.209
 repeat+0.198
Deal+0.193
iding+0.181
Neg logit contributions
 curtains-0.642
 Some-0.638
 many-0.626
edd-0.624
 pieces-0.623
Tokenics
Token ID873
Feature activation-2.10e+00

Pos logit contributions
ah+0.190
gh+0.181
iding+0.172
 repeat+0.169
Deal+0.159
Neg logit contributions
 curtains-0.134
 Some-0.130
 doorway-0.128
 Someone-0.125
 rooms-0.124
Token and
Token ID290
Feature activation-4.81e-01

Pos logit contributions
 many+0.037
 pieces+0.034
 curtains+0.034
 of+0.033
+0.033
Neg logit contributions
ah-0.107
gh-0.101
 repeat-0.098
iding-0.096
Deal-0.094
Token play
Token ID711
Feature activation+3.11e+00

Pos logit contributions
 curtains+0.014
 Some+0.014
 many+0.014
 pieces+0.014
edd+0.013
Neg logit contributions
ah-0.019
gh-0.017
 repeat-0.016
Deal-0.016
iding-0.016
Token games
Token ID1830
Feature activation+7.02e-01

Pos logit contributions
ah+0.155
gh+0.140
 repeat+0.135
iding+0.134
 insist+0.130
Neg logit contributions
 many-0.068
 curtains-0.063
 pieces-0.061
-0.061
 rooms-0.060
Token went
Token ID1816
Feature activation+3.31e+00

Pos logit contributions
ah+0.010
gh+0.010
 repeat+0.009
Deal+0.009
iding+0.009
Neg logit contributions
 curtains-0.005
 Some-0.005
 many-0.005
 pieces-0.005
edd-0.005
Token on
Token ID319
Feature activation+4.64e+00

Pos logit contributions
ah+0.157
gh+0.147
 repeat+0.143
Deal+0.141
iding+0.139
Neg logit contributions
 curtains-0.070
 Some-0.068
 many-0.065
 pieces-0.064
edd-0.064
Token adventures
Token ID17545
Feature activation+2.16e+00

Pos logit contributions
ah+0.198
gh+0.194
 repeat+0.182
Deal+0.176
 pose+0.176
Neg logit contributions
edd-0.112
rolley-0.109
 Someone-0.108
 Some-0.105
Someone-0.103
Token,
Token ID11
Feature activation-1.69e+00

Pos logit contributions
ah+0.108
gh+0.101
 repeat+0.099
Deal+0.097
iding+0.096
Neg logit contributions
 curtains-0.040
 Some-0.039
 many-0.038
 pieces-0.037
edd-0.036
Token had
Token ID550
Feature activation-1.70e+00

Pos logit contributions
 curtains+0.035
 Some+0.035
edd+0.033
 pieces+0.032
 doorway+0.032
Neg logit contributions
ah-0.080
gh-0.075
 repeat-0.074
Deal-0.072
iding-0.071
Token pic
Token ID8301
Feature activation+1.32e+01

Pos logit contributions
 curtains+0.024
 Some+0.022
edd+0.021
 doorway+0.021
rolley+0.019
Neg logit contributions
ah-0.093
gh-0.088
 repeat-0.085
Deal-0.084
et-0.082
Tokenn
Token ID77
Feature activation+4.44e+00

Pos logit contributions
ah+0.264
gh+0.214
 repeat+0.206
Deal+0.201
iding+0.187
Neg logit contributions
 curtains-0.624
 Some-0.621
 many-0.614
edd-0.609
 pieces-0.608
Tokenics
Token ID873
Feature activation-2.35e+00

Pos logit contributions
ah+0.182
gh+0.169
iding+0.162
 repeat+0.154
et+0.146
Neg logit contributions
 curtains-0.133
 Some-0.128
 doorway-0.125
 rooms-0.125
 Someone-0.124
Token,
Token ID11
Feature activation-1.41e+00

Pos logit contributions
 many+0.036
 pieces+0.033
 of+0.032
+0.032
 everywhere+0.031
Neg logit contributions
ah-0.126
gh-0.120
 repeat-0.117
iding-0.115
-0.114
Token and
Token ID290
Feature activation-3.24e-01

Pos logit contributions
 Some+0.032
rolley+0.031
edd+0.031
 Someone+0.030
 curtains+0.030
Neg logit contributions
ah-0.064
 repeat-0.061
gh-0.058
Deal-0.056
et-0.056
Token even
Token ID772
Feature activation-1.92e+00

Pos logit contributions
 curtains+0.008
 Some+0.008
edd+0.007
 doorway+0.007
 pieces+0.007
Neg logit contributions
ah-0.014
gh-0.013
 repeat-0.013
Deal-0.013
iding-0.012
Token the
Token ID262
Feature activation-9.98e-01

Pos logit contributions
ah+0.098
Deal+0.089
gh+0.088
iding+0.087
+0.082
Neg logit contributions
 many-0.036
-0.034
 curtains-0.033
 rooms-0.030
 twins-0.029
Token corn
Token ID11676
Feature activation+3.94e+00

Pos logit contributions
 curtains+0.019
 many+0.019
 pieces+0.018
 sky+0.018
 doorway+0.018
Neg logit contributions
ah-0.051
gh-0.048
iding-0.047
Deal-0.046
 repeat-0.046
Token field
Token ID2214
Feature activation+8.61e-01

Pos logit contributions
ah+0.191
gh+0.182
 repeat+0.172
Deal+0.171
iding+0.169
Neg logit contributions
 curtains-0.087
 pieces-0.083
 many-0.082
 sky-0.080
 doorway-0.079
Token.
Token ID13
Feature activation+4.08e-01

Pos logit contributions
ah+0.046
Deal+0.043
gh+0.043
 repeat+0.042
+0.041
Neg logit contributions
 many-0.014
 of-0.014
 pieces-0.013
 curtains-0.013
-0.013
Token The
Token ID383
Feature activation-6.11e-01

Pos logit contributions
ah+0.021
gh+0.019
 repeat+0.019
Deal+0.019
iding+0.019
Neg logit contributions
 curtains-0.007
edd-0.006
 pieces-0.006
 Some-0.006
 rooms-0.006
Token st
Token ID336
Feature activation+1.32e+01

Pos logit contributions
 curtains+0.011
 pieces+0.011
 Some+0.010
 sky+0.010
 many+0.010
Neg logit contributions
ah-0.031
gh-0.030
iding-0.029
 repeat-0.028
Deal-0.028
Tokenalks
Token ID23833
Feature activation-1.71e+00

Pos logit contributions
ah+0.414
gh+0.335
iding+0.329
ig+0.307
et+0.302
Neg logit contributions
 curtains-0.515
 scientists-0.510
 rooms-0.503
 chairs-0.494
 Some-0.485
Token looked
Token ID3114
Feature activation-1.08e+00

Pos logit contributions
 curtains+0.038
 Some+0.037
 pieces+0.036
 many+0.036
edd+0.035
Neg logit contributions
ah-0.078
gh-0.074
 repeat-0.071
Deal-0.071
iding-0.070
Token happy
Token ID3772
Feature activation-3.55e+00

Pos logit contributions
 curtains+0.027
 Some+0.027
edd+0.026
 pieces+0.025
 doorway+0.025
Neg logit contributions
ah-0.047
gh-0.043
 repeat-0.042
Deal-0.042
iding-0.041
Token to
Token ID284
Feature activation+1.47e+00

Pos logit contributions
 many+0.044
 curtains+0.042
 pieces+0.041
 of+0.040
+0.040
Neg logit contributions
ah-0.201
gh-0.190
 repeat-0.185
Deal-0.183
iding-0.183
Token see
Token ID766
Feature activation-4.67e+00

Pos logit contributions
ah+0.065
Deal+0.061
iding+0.061
gh+0.060
 pose+0.057
Neg logit contributions
-0.035
 many-0.034
 Some-0.033
 curtains-0.032
 of-0.032
Token soon
Token ID2582
Feature activation-2.30e+00

Pos logit contributions
 curtains+0.022
 Some+0.022
 many+0.021
 pieces+0.020
edd+0.020
Neg logit contributions
ah-0.046
gh-0.043
 repeat-0.042
Deal-0.042
iding-0.041
Token nothing
Token ID2147
Feature activation-4.24e+00

Pos logit contributions
 curtains+0.052
 many+0.050
 pieces+0.050
 sky+0.049
 doorway+0.048
Neg logit contributions
ah-0.108
gh-0.100
Deal-0.100
 repeat-0.095
iding-0.094
Token but
Token ID475
Feature activation+6.89e-02

Pos logit contributions
 of+0.078
 many+0.076
 curtains+0.074
 pieces+0.073
+0.070
Neg logit contributions
ah-0.211
gh-0.198
Deal-0.193
iding-0.192
 repeat-0.192
Token a
Token ID257
Feature activation-3.98e-01

Pos logit contributions
ah+0.004
gh+0.003
 repeat+0.003
Deal+0.003
iding+0.003
Neg logit contributions
 curtains-0.001
 Some-0.001
 many-0.001
 pieces-0.001
edd-0.001
Token small
Token ID1402
Feature activation+1.68e+00

Pos logit contributions
 curtains+0.006
 sky+0.006
 many+0.006
 doorway+0.006
 pieces+0.006
Neg logit contributions
ah-0.021
iding-0.020
gh-0.020
Deal-0.020
et-0.019
Token spe
Token ID693
Feature activation+1.31e+01

Pos logit contributions
gh+0.092
Deal+0.089
iding+0.089
astics+0.088
ah+0.088
Neg logit contributions
 sky-0.030
 curtains-0.027
 of-0.027
 pieces-0.026
 family-0.025
Tokenck
Token ID694
Feature activation+9.89e-01

Pos logit contributions
 repeat+0.255
ah+0.238
gh+0.233
Deal+0.232
iding+0.228
Neg logit contributions
edd-0.561
 many-0.555
 Some-0.553
 of-0.552
 pieces-0.551
Token in
Token ID287
Feature activation+2.91e+00

Pos logit contributions
Deal+0.049
astics+0.048
gh+0.047
ah+0.046
+0.045
Neg logit contributions
 of-0.022
 pieces-0.019
 sky-0.019
 curtains-0.018
 flowers-0.017
Token the
Token ID262
Feature activation-1.32e+00

Pos logit contributions
ah+0.157
Deal+0.152
gh+0.151
iding+0.150
 repeat+0.141
Neg logit contributions
 many-0.045
 curtains-0.045
 pieces-0.043
 sky-0.042
 Some-0.042
Token sky
Token ID6766
Feature activation-2.16e+00

Pos logit contributions
 curtains+0.015
 Some+0.014
 pieces+0.013
 many+0.013
 sky+0.013
Neg logit contributions
ah-0.075
gh-0.071
 repeat-0.069
Deal-0.069
iding-0.069
Token.
Token ID13
Feature activation+3.47e-02

Pos logit contributions
 of+0.033
 sky+0.032
+0.030
 curtains+0.029
 that+0.025
Neg logit contributions
Deal-0.118
ah-0.118
gh-0.116
astics-0.113
iding-0.112
Token their
Token ID511
Feature activation+2.57e-02

Pos logit contributions
 doorway+0.041
rolley+0.040
armac+0.037
 curtains+0.037
edd+0.035
Neg logit contributions
 repeat-0.047
gh-0.047
Deal-0.045
ah-0.044
et-0.043
Token salad
Token ID20698
Feature activation+4.28e-01

Pos logit contributions
ah+0.001
gh+0.001
Deal+0.001
 repeat+0.001
iding+0.001
Neg logit contributions
 curtains-0.001
 Some-0.001
armac-0.001
edd-0.001
 doorway-0.001
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
ah+0.017
gh+0.016
Deal+0.016
 repeat+0.016
iding+0.016
Neg logit contributions
 Some-0.012
 curtains-0.011
 Someone-0.011
 doorway-0.011
edd-0.010
Token have
Token ID423
Feature activation-2.68e+00

Pos logit contributions
 Some+0.064
 curtains+0.064
edd+0.062
 doorway+0.061
 pieces+0.061
Neg logit contributions
ah-0.053
 repeat-0.049
gh-0.048
Deal-0.047
 pose-0.043
Token happy
Token ID3772
Feature activation-2.47e+00

Pos logit contributions
 curtains+0.081
 doorway+0.079
edd+0.077
rolley+0.076
 Someone+0.074
Neg logit contributions
ah-0.101
gh-0.098
Deal-0.092
 repeat-0.092
astics-0.085
Token pic
Token ID8301
Feature activation+1.31e+01

Pos logit contributions
 curtains+0.061
 Some+0.060
 doorway+0.058
edd+0.058
rolley+0.057
Neg logit contributions
ah-0.102
gh-0.100
Deal-0.096
 repeat-0.096
iding-0.089
Tokenn
Token ID77
Feature activation+4.50e+00

Pos logit contributions
ah+0.292
 repeat+0.236
Deal+0.219
gh+0.218
 forth+0.215
Neg logit contributions
edd-0.577
 many-0.571
 curtains-0.566
 Some-0.565
-0.564
Tokenics
Token ID873
Feature activation-2.57e+00

Pos logit contributions
ah+0.179
gh+0.172
iding+0.162
 repeat+0.158
et+0.150
Neg logit contributions
 curtains-0.133
 Some-0.130
 doorway-0.124
 rooms-0.122
 Someone-0.120
Token together
Token ID1978
Feature activation+1.09e+00

Pos logit contributions
 many+0.051
 pieces+0.049
 of+0.047
+0.047
 twins+0.046
Neg logit contributions
ah-0.127
gh-0.118
 repeat-0.117
iding-0.114
 pose-0.110
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
ah+0.065
+0.058
gh+0.058
 AW+0.058
 repeat+0.058
Neg logit contributions
 many-0.016
-0.015
 of-0.014
 everywhere-0.014
 pieces-0.013
Token<|endoftext|>
Token ID50256
Feature activation-2.84e+00

Pos logit contributions
ah+0.009
Deal+0.008
gh+0.008
 repeat+0.008
iding+0.007
Neg logit contributions
 curtains-0.005
 doorway-0.004
edd-0.004
rolley-0.004
Someone-0.004
Token favourite
Token ID12507
Feature activation-1.66e+00

Pos logit contributions
ah+0.008
gh+0.008
iding+0.007
+0.007
et+0.007
Neg logit contributions
 curtains-0.004
 many-0.004
 family-0.003
 rooms-0.003
 sky-0.003
Token place
Token ID1295
Feature activation-1.15e+00

Pos logit contributions
 curtains+0.037
 pieces+0.036
 of+0.036
 rooms+0.035
 many+0.034
Neg logit contributions
ah-0.079
gh-0.075
iding-0.075
Deal-0.074
 repeat-0.073
Token to
Token ID284
Feature activation+2.48e+00

Pos logit contributions
 curtains+0.020
 of+0.019
 many+0.019
 pieces+0.018
+0.017
Neg logit contributions
ah-0.060
gh-0.055
 repeat-0.054
Deal-0.054
iding-0.053
Token go
Token ID467
Feature activation+3.73e+00

Pos logit contributions
ah+0.113
iding+0.100
beh+0.099
Deal+0.099
gh+0.099
Neg logit contributions
 many-0.068
-0.067
 of-0.066
 a-0.064
 curtains-0.061
Token for
Token ID329
Feature activation-1.03e+00

Pos logit contributions
ah+0.210
Deal+0.206
gh+0.197
 repeat+0.192
iding+0.189
Neg logit contributions
 of-0.058
-0.057
 many-0.056
 curtains-0.050
 sky-0.048
Token pic
Token ID8301
Feature activation+1.30e+01

Pos logit contributions
 many+0.013
 curtains+0.011
 pieces+0.010
+0.010
 Some+0.009
Neg logit contributions
ah-0.062
Deal-0.057
iding-0.057
 repeat-0.056
gh-0.056
Tokenn
Token ID77
Feature activation+4.52e+00

Pos logit contributions
ah+0.332
Deal+0.285
 repeat+0.261
 cheating+0.253
 forth+0.249
Neg logit contributions
 many-0.545
-0.544
 curtains-0.537
edd-0.535
 pieces-0.521
Tokenics
Token ID873
Feature activation-2.39e+00

Pos logit contributions
ah+0.199
gh+0.187
 repeat+0.179
iding+0.176
Deal+0.174
Neg logit contributions
 curtains-0.109
 Some-0.108
edd-0.101
 pieces-0.100
 many-0.099
Token.
Token ID13
Feature activation+5.79e-01

Pos logit contributions
 many+0.033
 of+0.033
+0.032
 pieces+0.027
 everywhere+0.027
Neg logit contributions
ah-0.136
 repeat-0.125
Deal-0.125
gh-0.124
heit-0.124
Token<|endoftext|>
Token ID50256
Feature activation-2.38e+00

Pos logit contributions
ah+0.032
gh+0.030
 repeat+0.030
Deal+0.030
iding+0.029
Neg logit contributions
 curtains-0.007
 pieces-0.006
edd-0.006
 rooms-0.005
 doorway-0.005
TokenOnce
Token ID7454
Feature activation+2.58e+00

Pos logit contributions
 curtains+0.018
 Some+0.017
edd+0.015
 rooms+0.013
 pieces+0.013
Neg logit contributions
ah-0.143
gh-0.135
 repeat-0.131
Deal-0.130
iding-0.128
Token and
Token ID290
Feature activation-2.05e+00

Pos logit contributions
 curtains+0.062
 Some+0.060
 many+0.057
 pieces+0.056
edd+0.055
Neg logit contributions
ah-0.237
gh-0.225
Deal-0.218
 repeat-0.218
iding-0.213
Token nodded
Token ID14464
Feature activation-1.54e+00

Pos logit contributions
 many+0.033
+0.033
 amazed+0.029
 they+0.028
 flowers+0.027
Neg logit contributions
ah-0.111
gh-0.109
iding-0.105
Deal-0.101
 pose-0.097
Token.
Token ID13
Feature activation-2.48e-01

Pos logit contributions
+0.018
 a+0.016
 of+0.016
 the+0.013
 many+0.013
Neg logit contributions
ah-0.095
Deal-0.094
astics-0.089
 AW-0.087
 pose-0.087
Token He
Token ID679
Feature activation+9.88e-01

Pos logit contributions
 Some+0.005
 curtains+0.004
+0.004
 Someone+0.004
 many+0.004
Neg logit contributions
ah-0.013
gh-0.012
Deal-0.012
 repeat-0.012
 pose-0.012
Token loved
Token ID6151
Feature activation-2.51e+00

Pos logit contributions
ah+0.045
gh+0.044
iding+0.043
Deal+0.041
 cheating+0.040
Neg logit contributions
 many-0.021
-0.021
 pieces-0.020
 curtains-0.020
 of-0.019
Token pic
Token ID8301
Feature activation+1.30e+01

Pos logit contributions
 many+0.039
 curtains+0.036
 chairs+0.034
 flowers+0.033
 rooms+0.032
Neg logit contributions
ah-0.145
iding-0.137
gh-0.135
Deal-0.134
 repeat-0.130
Tokenn
Token ID77
Feature activation+4.58e+00

Pos logit contributions
ah+0.293
Deal+0.229
 repeat+0.229
gh+0.220
 pose+0.216
Neg logit contributions
edd-0.568
-0.568
 curtains-0.568
 many-0.564
 Some-0.561
Tokenics
Token ID873
Feature activation-2.42e+00

Pos logit contributions
ah+0.212
gh+0.197
Deal+0.194
 repeat+0.193
 pose+0.185
Neg logit contributions
 curtains-0.098
 Some-0.095
 many-0.094
 pieces-0.093
edd-0.092
Token and
Token ID290
Feature activation-1.29e+00

Pos logit contributions
 of+0.032
 many+0.031
 pieces+0.029
+0.026
 everywhere+0.026
Neg logit contributions
ah-0.136
gh-0.129
 repeat-0.128
 AW-0.128
heit-0.128
Token couldn
Token ID3521
Feature activation+2.78e+00

Pos logit contributions
 many+0.033
 flowers+0.031
 curtains+0.031
 chairs+0.030
 trees+0.030
Neg logit contributions
ah-0.060
gh-0.059
iding-0.056
Deal-0.052
et-0.050
Token't
Token ID470
Feature activation-3.48e-01

Pos logit contributions
ah+0.166
Deal+0.161
iding+0.157
gh+0.156
 repeat+0.156
Neg logit contributions
-0.026
 Some-0.022
 of-0.021
 many-0.019
 sky-0.018
Token the
Token ID262
Feature activation-8.58e-01

Pos logit contributions
ah+0.108
gh+0.103
 repeat+0.102
 pose+0.100
et+0.097
Neg logit contributions
edd-0.036
 Some-0.035
 Someone-0.034
rolley-0.033
Someone-0.033
Token warm
Token ID5814
Feature activation+3.68e+00

Pos logit contributions
 curtains+0.013
 rooms+0.012
 sky+0.012
 many+0.012
 pieces+0.011
Neg logit contributions
ah-0.049
gh-0.045
iding-0.045
Deal-0.045
 repeat-0.044
Token sunshine
Token ID34488
Feature activation-4.42e-01

Pos logit contributions
ah+0.150
et+0.135
gh+0.134
 repeat+0.133
 pose+0.127
Neg logit contributions
 Some-0.097
Someone-0.095
 Someone-0.093
edd-0.087
 scientists-0.086
Token and
Token ID290
Feature activation-1.85e+00

Pos logit contributions
 sky+0.005
 many+0.005
 curtains+0.005
 of+0.005
 pieces+0.005
Neg logit contributions
ah-0.026
gh-0.026
heit-0.025
 AW-0.025
iding-0.025
Token having
Token ID1719
Feature activation-1.23e+00

Pos logit contributions
 curtains+0.040
 many+0.038
 Some+0.037
 rooms+0.036
 pieces+0.036
Neg logit contributions
ah-0.090
gh-0.085
iding-0.081
Deal-0.080
 repeat-0.079
Token pic
Token ID8301
Feature activation+1.30e+01

Pos logit contributions
 many+0.033
 curtains+0.033
 Some+0.032
 pieces+0.032
+0.031
Neg logit contributions
ah-0.053
gh-0.048
iding-0.047
Deal-0.047
 repeat-0.047
Tokenn
Token ID77
Feature activation+4.31e+00

Pos logit contributions
ah+0.315
Deal+0.250
 repeat+0.249
 forth+0.239
 cheating+0.233
Neg logit contributions
 many-0.551
 curtains-0.550
-0.548
edd-0.546
 Some-0.541
Tokenics
Token ID873
Feature activation-2.25e+00

Pos logit contributions
ah+0.178
gh+0.170
iding+0.161
 repeat+0.157
et+0.150
Neg logit contributions
 curtains-0.118
 Some-0.117
 doorway-0.109
 Someone-0.109
 rooms-0.108
Token with
Token ID351
Feature activation-1.38e+00

Pos logit contributions
 many+0.037
 everywhere+0.031
 of+0.030
 pieces+0.029
+0.028
Neg logit contributions
ah-0.127
 repeat-0.118
heit-0.116
gh-0.116
iding-0.116
Token her
Token ID607
Feature activation-4.02e-01

Pos logit contributions
 curtains+0.026
 Some+0.026
edd+0.025
 doorway+0.025
rolley+0.025
Neg logit contributions
ah-0.066
gh-0.062
 repeat-0.061
Deal-0.059
 pose-0.058
Token family
Token ID1641
Feature activation-4.29e+00

Pos logit contributions
 curtains+0.005
 many+0.005
 pieces+0.005
 rooms+0.005
 Some+0.005
Neg logit contributions
ah-0.023
gh-0.022
 repeat-0.021
iding-0.021
Deal-0.021
Token see
Token ID766
Feature activation-4.60e+00

Pos logit contributions
ah+0.104
gh+0.099
iding+0.095
Deal+0.088
 cheating+0.086
Neg logit contributions
 many-0.065
-0.064
 flowers-0.062
 curtains-0.062
 trees-0.059
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
 curtains+0.073
 Some+0.072
 doorway+0.068
edd+0.068
 pieces+0.065
Neg logit contributions
ah-0.235
 repeat-0.221
Deal-0.218
gh-0.218
iding-0.210
Token bird
Token ID6512
Feature activation-2.47e+00

Pos logit contributions
 Some+0.022
 curtains+0.021
edd+0.020
 doorway+0.019
 Someone+0.019
Neg logit contributions
ah-0.051
gh-0.047
 repeat-0.047
Deal-0.046
et-0.044
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
 curtains+0.052
 of+0.050
 many+0.050
+0.049
 pieces+0.048
Neg logit contributions
ah-0.118
gh-0.114
Deal-0.110
iding-0.105
 repeat-0.104
Token have
Token ID423
Feature activation-2.54e+00

Pos logit contributions
 Some+0.029
 curtains+0.029
edd+0.028
 pieces+0.028
 doorway+0.027
Neg logit contributions
ah-0.061
 repeat-0.057
gh-0.056
Deal-0.055
 pose-0.053
Token pic
Token ID8301
Feature activation+1.30e+01

Pos logit contributions
 curtains+0.050
 Some+0.049
edd+0.048
 doorway+0.046
 Someone+0.045
Neg logit contributions
ah-0.122
gh-0.115
 repeat-0.112
Deal-0.110
et-0.106
Tokenn
Token ID77
Feature activation+4.71e+00

Pos logit contributions
ah+0.303
Deal+0.250
 repeat+0.234
gh+0.227
 pose+0.226
Neg logit contributions
 curtains-0.558
 many-0.557
edd-0.555
 Some-0.551
-0.549
Tokenics
Token ID873
Feature activation-2.28e+00

Pos logit contributions
ah+0.198
gh+0.188
iding+0.179
 repeat+0.176
Deal+0.169
Neg logit contributions
 curtains-0.127
 Some-0.125
 rooms-0.117
 doorway-0.116
 Someone-0.115
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
 many+0.041
 of+0.038
+0.037
 everywhere+0.035
 pieces+0.033
Neg logit contributions
ah-0.121
heit-0.112
gh-0.112
 repeat-0.111
-0.109
Token<|endoftext|>
Token ID50256
Feature activation-2.43e+00

Pos logit contributions
ah+0.013
Deal+0.012
+0.012
 repeat+0.011
 insist+0.011
Neg logit contributions
 curtains-0.008
 pieces-0.008
 scientists-0.008
 doorway-0.008
edd-0.007
TokenOnce
Token ID7454
Feature activation+2.58e+00

Pos logit contributions
 curtains+0.045
 scientists+0.044
edd+0.043
 Some+0.041
armac+0.041
Neg logit contributions
ah-0.114
gh-0.112
-0.101
 repeat-0.099
Deal-0.098
Token liked
Token ID8288
Feature activation-3.08e+00

Pos logit contributions
 many+0.000
 Some+0.000
 curtains+0.000
+0.000
 twins+0.000
Neg logit contributions
ah-0.000
gh-0.000
iding-0.000
 pose-0.000
 repeat-0.000
Token to
Token ID284
Feature activation+2.26e+00

Pos logit contributions
 curtains+0.035
 Some+0.034
 many+0.031
 pieces+0.030
edd+0.029
Neg logit contributions
ah-0.177
gh-0.167
 repeat-0.163
Deal-0.162
iding-0.160
Token play
Token ID711
Feature activation+2.60e+00

Pos logit contributions
ah+0.107
iding+0.106
bek+0.106
gh+0.104
Deal+0.101
Neg logit contributions
 flowers-0.049
 curtains-0.045
-0.044
 rooms-0.044
 many-0.044
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
ah+0.129
gh+0.118
 repeat+0.114
iding+0.113
Deal+0.111
Neg logit contributions
 many-0.053
 curtains-0.052
 pieces-0.050
 Some-0.050
-0.049
Token have
Token ID423
Feature activation-2.78e+00

Pos logit contributions
 many+0.034
 curtains+0.033
 Some+0.033
 flowers+0.032
+0.032
Neg logit contributions
ah-0.047
iding-0.045
gh-0.045
Deal-0.042
ig-0.040
Token pic
Token ID8301
Feature activation+1.30e+01

Pos logit contributions
 curtains+0.094
edd+0.093
 Some+0.092
 doorway+0.090
rolley+0.089
Neg logit contributions
ah-0.094
gh-0.087
 repeat-0.083
Deal-0.082
iding-0.078
Tokenn
Token ID77
Feature activation+4.45e+00

Pos logit contributions
ah+0.275
Deal+0.215
 repeat+0.212
gh+0.209
iding+0.192
Neg logit contributions
 curtains-0.596
 Some-0.595
 many-0.593
edd-0.586
 pieces-0.583
Tokenics
Token ID873
Feature activation-2.44e+00

Pos logit contributions
ah+0.183
gh+0.175
iding+0.165
 repeat+0.162
Deal+0.155
Neg logit contributions
 curtains-0.127
 Some-0.122
 doorway-0.119
 Someone-0.117
 rooms-0.115
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
 many+0.044
 everywhere+0.036
+0.035
 of+0.035
 pieces+0.035
Neg logit contributions
ah-0.131
gh-0.123
 repeat-0.121
iding-0.120
-0.120
Token One
Token ID1881
Feature activation-1.19e-01

Pos logit contributions
 curtains+0.009
edd+0.009
 doorway+0.009
rolley+0.009
 pieces+0.009
Neg logit contributions
ah-0.014
 repeat-0.013
Deal-0.013
gh-0.013
iding-0.012
Token day
Token ID1110
Feature activation+2.51e+00

Pos logit contributions
 Some+0.003
 many+0.003
 curtains+0.003
 of+0.003
 pieces+0.003
Neg logit contributions
ah-0.005
gh-0.004
 repeat-0.004
Deal-0.004
iding-0.004
Token stripes
Token ID28806
Feature activation+7.36e-01

Pos logit contributions
 many+0.028
 curtains+0.027
 Some+0.027
 pieces+0.026
 rooms+0.024
Neg logit contributions
ah-0.089
gh-0.083
Deal-0.082
iding-0.082
 repeat-0.081
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
ah+0.039
Deal+0.038
gh+0.038
 repeat+0.036
bek+0.036
Neg logit contributions
 of-0.013
 many-0.012
 Some-0.011
-0.011
 curtains-0.011
Token I
Token ID314
Feature activation+2.21e+00

Pos logit contributions
ah+0.052
gh+0.052
 forth+0.048
et+0.047
iding+0.047
Neg logit contributions
 Some-0.026
-0.022
 Someone-0.022
 curtains-0.021
 many-0.019
Token have
Token ID423
Feature activation-1.59e+00

Pos logit contributions
gh+0.097
iding+0.096
 forth+0.092
ah+0.090
Deal+0.088
Neg logit contributions
 of-0.052
-0.052
 many-0.050
 Some-0.049
-0.046
Token a
Token ID257
Feature activation-3.20e-01

Pos logit contributions
 many+0.027
 curtains+0.026
 Some+0.025
 pieces+0.025
 flowers+0.022
Neg logit contributions
ah-0.084
Deal-0.079
gh-0.079
iding-0.078
 repeat-0.077
Token st
Token ID336
Feature activation+1.30e+01

Pos logit contributions
 big+0.007
 many+0.007
 curtains+0.007
 tiny+0.007
 sky+0.007
Neg logit contributions
Deal-0.015
iding-0.015
ah-0.015
-0.014
gh-0.014
Tokeninger
Token ID3889
Feature activation-1.03e-01

Pos logit contributions
ah+0.449
gh+0.416
Deal+0.408
 repeat+0.401
iding+0.379
Neg logit contributions
 curtains-0.425
 Some-0.422
 many-0.409
edd-0.407
 pieces-0.407
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
 of+0.002
 curtains+0.002
 many+0.002
 pieces+0.002
 flowers+0.002
Neg logit contributions
Deal-0.005
ah-0.005
gh-0.005
-0.005
iding-0.005
Token I
Token ID314
Feature activation+2.28e+00

Pos logit contributions
ah+0.058
gh+0.057
 forth+0.053
et+0.052
iding+0.052
Neg logit contributions
 Some-0.027
-0.023
 Someone-0.023
 curtains-0.022
 of-0.020
Token can
Token ID460
Feature activation+2.55e+00

Pos logit contributions
gh+0.099
iding+0.098
 forth+0.094
ah+0.093
+0.090
Neg logit contributions
 of-0.054
-0.053
 many-0.051
 Some-0.051
-0.048
Token make
Token ID787
Feature activation-1.73e+00

Pos logit contributions
ah+0.091
Deal+0.085
gh+0.085
iding+0.082
bek+0.078
Neg logit contributions
 curtains-0.084
 Some-0.080
-0.080
 of-0.079
 many-0.079
Token excited
Token ID6568
Feature activation-1.84e+00

Pos logit contributions
 curtains+0.003
 Some+0.003
 pieces+0.003
edd+0.003
 doorway+0.003
Neg logit contributions
ah-0.005
gh-0.005
 repeat-0.005
Deal-0.005
iding-0.005
Token and
Token ID290
Feature activation-1.45e+00

Pos logit contributions
 Some+0.038
 curtains+0.036
edd+0.035
Someone+0.033
 doorway+0.033
Neg logit contributions
ah-0.088
gh-0.082
 repeat-0.080
Deal-0.078
iding-0.077
Token wanted
Token ID2227
Feature activation-1.19e+00

Pos logit contributions
 many+0.029
 curtains+0.029
 Some+0.027
+0.026
 rooms+0.026
Neg logit contributions
ah-0.073
gh-0.067
iding-0.064
Deal-0.063
 repeat-0.062
Token to
Token ID284
Feature activation+1.87e+00

Pos logit contributions
rolley+0.023
edd+0.022
armac+0.022
 doorway+0.021
 curtains+0.021
Neg logit contributions
ah-0.056
gh-0.055
 repeat-0.052
 pose-0.049
et-0.048
Token take
Token ID1011
Feature activation+1.29e+00

Pos logit contributions
ah+0.071
gh+0.065
Deal+0.064
iding+0.061
et+0.059
Neg logit contributions
 curtains-0.058
 Some-0.056
 many-0.055
-0.055
 of-0.054
Token photographs
Token ID12566
Feature activation-8.73e+00

Pos logit contributions
ah+0.072
gh+0.068
 repeat+0.067
Deal+0.066
iding+0.065
Neg logit contributions
 curtains-0.016
 Some-0.015
 many-0.014
 pieces-0.014
edd-0.014
Token of
Token ID286
Feature activation-2.87e+00

Pos logit contributions
 curtains+0.154
 many+0.144
 Some+0.143
 pieces+0.143
 of+0.140
Neg logit contributions
ah-0.448
gh-0.420
 repeat-0.408
Deal-0.407
iding-0.400
Token everything
Token ID2279
Feature activation-4.42e+00

Pos logit contributions
 Some+0.060
rolley+0.058
edd+0.056
 doorway+0.055
armac+0.054
Neg logit contributions
ah-0.128
 repeat-0.125
gh-0.124
Deal-0.119
 compete-0.114
Token around
Token ID1088
Feature activation-1.74e+00

Pos logit contributions
 curtains+0.051
 many+0.047
 of+0.047
 pieces+0.044
+0.042
Neg logit contributions
ah-0.261
gh-0.244
 repeat-0.237
Deal-0.235
iding-0.234
Token her
Token ID607
Feature activation+1.33e-01

Pos logit contributions
 curtains+0.026
 Some+0.023
 many+0.023
 pieces+0.022
 rooms+0.022
Neg logit contributions
ah-0.096
gh-0.089
Deal-0.088
iding-0.087
 repeat-0.087
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
ah+0.007
gh+0.006
 repeat+0.006
Deal+0.006
iding+0.006
Neg logit contributions
 curtains-0.003
 many-0.002
 rooms-0.002
 pieces-0.002
 Some-0.002
Token came
Token ID1625
Feature activation-9.21e-01

Pos logit contributions
 curtains+0.013
 many+0.013
 Some+0.012
 rooms+0.012
+0.012
Neg logit contributions
ah-0.030
gh-0.029
iding-0.028
Deal-0.027
-0.026
Token back
Token ID736
Feature activation-6.66e-01

Pos logit contributions
 Some+0.017
 curtains+0.017
edd+0.016
 pieces+0.016
 many+0.015
Neg logit contributions
ah-0.045
gh-0.042
 repeat-0.042
iding-0.041
Deal-0.040
Token home
Token ID1363
Feature activation-2.78e+00

Pos logit contributions
 curtains+0.009
 many+0.008
 Some+0.008
+0.008
 pieces+0.008
Neg logit contributions
ah-0.037
gh-0.035
Deal-0.034
 repeat-0.034
iding-0.033
Token empty
Token ID6565
Feature activation-1.12e+00

Pos logit contributions
 Some+0.052
 curtains+0.051
edd+0.047
 pieces+0.046
 many+0.045
Neg logit contributions
ah-0.135
gh-0.128
 repeat-0.125
Deal-0.122
iding-0.122
Token-
Token ID12
Feature activation+3.12e+00

Pos logit contributions
 curtains+0.021
 pieces+0.019
 rooms+0.018
 many+0.018
 chairs+0.018
Neg logit contributions
ah-0.060
gh-0.055
Deal-0.054
 repeat-0.054
iding-0.053
Tokenhanded
Token ID13638
Feature activation-8.56e+00

Pos logit contributions
ah+0.154
gh+0.145
Deal+0.138
et+0.136
 repeat+0.133
Neg logit contributions
 Some-0.059
 curtains-0.058
 Someone-0.055
 sky-0.052
 pieces-0.050
Token and
Token ID290
Feature activation-1.76e+00

Pos logit contributions
 Some+0.186
 curtains+0.182
edd+0.168
 many+0.168
 pieces+0.165
Neg logit contributions
ah-0.395
gh-0.367
 repeat-0.359
Deal-0.353
iding-0.346
Token told
Token ID1297
Feature activation-2.33e+00

Pos logit contributions
 curtains+0.036
 many+0.036
 Some+0.034
 rooms+0.034
 pieces+0.034
Neg logit contributions
ah-0.087
gh-0.080
iding-0.078
Deal-0.076
 repeat-0.076
Token Lily
Token ID20037
Feature activation-1.57e+00

Pos logit contributions
 curtains+0.032
 many+0.031
 Some+0.029
edd+0.027
 rooms+0.026
Neg logit contributions
ah-0.132
gh-0.123
Deal-0.122
iding-0.122
 repeat-0.119
Token the
Token ID262
Feature activation-6.34e-01

Pos logit contributions
 curtains+0.033
edd+0.031
 Some+0.031
 doorway+0.031
Someone+0.030
Neg logit contributions
ah-0.074
gh-0.069
 repeat-0.068
 pose-0.065
iding-0.064
Token news
Token ID1705
Feature activation-1.13e+00

Pos logit contributions
 curtains+0.014
 sky+0.013
 pieces+0.013
 many+0.012
 twins+0.012
Neg logit contributions
ah-0.032
iding-0.030
gh-0.029
Deal-0.028
 repeat-0.028
Token he
Token ID339
Feature activation+1.08e+00

Pos logit contributions
 curtains+0.032
 many+0.031
 Some+0.031
 pieces+0.029
 rooms+0.027
Neg logit contributions
ah-0.123
gh-0.115
Deal-0.112
 repeat-0.112
iding-0.110
Token was
Token ID373
Feature activation-2.22e-01

Pos logit contributions
ah+0.050
gh+0.047
 repeat+0.045
Deal+0.045
iding+0.044
Neg logit contributions
 curtains-0.024
 Some-0.023
 pieces-0.022
 many-0.022
edd-0.022
Token caught
Token ID4978
Feature activation-1.38e+00

Pos logit contributions
 twins+0.004
 many+0.004
+0.004
 of+0.004
 curtains+0.004
Neg logit contributions
ah-0.011
Deal-0.011
iding-0.010
gh-0.010
 pose-0.010
Token red
Token ID2266
Feature activation+6.98e-01

Pos logit contributions
 pieces+0.022
 of+0.020
 many+0.020
 curtains+0.018
+0.018
Neg logit contributions
ah-0.076
Deal-0.072
gh-0.071
 repeat-0.071
iding-0.070
Token-
Token ID12
Feature activation+3.28e+00

Pos logit contributions
ah+0.035
gh+0.035
Deal+0.034
iding+0.033
 repeat+0.033
Neg logit contributions
 curtains-0.013
 pieces-0.012
edd-0.012
 Some-0.011
 of-0.011
Tokenhanded
Token ID13638
Feature activation-8.44e+00

Pos logit contributions
ah+0.127
gh+0.112
 repeat+0.106
Deal+0.104
et+0.103
Neg logit contributions
 curtains-0.104
 rooms-0.100
 Some-0.098
 doorway-0.098
 pieces-0.095
Token.
Token ID13
Feature activation-6.32e-01

Pos logit contributions
 pieces+0.150
 curtains+0.146
 of+0.141
 twins+0.137
 many+0.136
Neg logit contributions
ah-0.441
gh-0.419
Deal-0.406
 repeat-0.401
iding-0.395
Token But
Token ID887
Feature activation+7.62e-01

Pos logit contributions
 curtains+0.013
edd+0.013
 pieces+0.012
 Some+0.012
 rooms+0.012
Neg logit contributions
ah-0.029
gh-0.028
 repeat-0.027
Deal-0.027
iding-0.026
Token he
Token ID339
Feature activation+1.82e+00

Pos logit contributions
ah+0.042
gh+0.039
iding+0.038
Deal+0.038
 pose+0.036
Neg logit contributions
 many-0.012
 of-0.011
-0.011
 Some-0.011
 curtains-0.011
Token didn
Token ID1422
Feature activation+1.91e+00

Pos logit contributions
iding+0.090
ah+0.090
gh+0.089
Deal+0.085
cluded+0.083
Neg logit contributions
-0.028
 of-0.028
 many-0.026
 nowhere-0.026
 was-0.026
Token't
Token ID470
Feature activation-8.75e-02

Pos logit contributions
ah+0.092
gh+0.090
 repeat+0.086
iding+0.082
 pose+0.080
Neg logit contributions
 curtains-0.038
 Some-0.036
 rooms-0.035
edd-0.035
 doorway-0.035
Token and
Token ID290
Feature activation-1.70e+00

Pos logit contributions
 pieces+0.000
 curtains+0.000
 many+0.000
 of+0.000
 sky+0.000
Neg logit contributions
ah-0.000
gh-0.000
Deal-0.000
 repeat-0.000
-0.000
Token the
Token ID262
Feature activation-7.72e-01

Pos logit contributions
 many+0.041
 curtains+0.040
 pieces+0.039
 sky+0.038
 flowers+0.037
Neg logit contributions
ah-0.081
gh-0.077
iding-0.074
Deal-0.071
 repeat-0.068
Token trees
Token ID7150
Feature activation-1.38e+00

Pos logit contributions
 curtains+0.013
 sky+0.013
 pieces+0.012
 flowers+0.012
 trees+0.011
Neg logit contributions
ah-0.042
gh-0.042
iding-0.041
Deal-0.039
astics-0.038
Token looked
Token ID3114
Feature activation-1.84e+00

Pos logit contributions
 of+0.026
 curtains+0.024
 sky+0.024
 many+0.024
 flowers+0.023
Neg logit contributions
ah-0.071
gh-0.069
Deal-0.068
-0.066
astics-0.065
Token gre
Token ID10536
Feature activation+6.24e+00

Pos logit contributions
 many+0.038
 curtains+0.038
 of+0.036
 Some+0.035
 sky+0.035
Neg logit contributions
ah-0.090
gh-0.084
Deal-0.083
 repeat-0.080
iding-0.078
Tokenener
Token ID877
Feature activation-8.34e+00

Pos logit contributions
gh+0.290
ah+0.270
Deal+0.263
 repeat+0.255
astics+0.247
Neg logit contributions
 curtains-0.152
edd-0.148
 pieces-0.140
ed-0.140
 flowers-0.137
Token.
Token ID13
Feature activation+2.63e-01

Pos logit contributions
 of+0.140
 curtains+0.130
 pieces+0.129
 everywhere+0.121
 flowers+0.111
Neg logit contributions
Deal-0.457
ah-0.453
gh-0.432
-0.417
initely-0.416
Token
Token ID198
Feature activation-9.74e-02

Pos logit contributions
ah+0.015
gh+0.014
 repeat+0.014
Deal+0.014
iding+0.014
Neg logit contributions
 Some-0.003
 curtains-0.003
 many-0.003
-0.003
edd-0.003
Token
Token ID198
Feature activation+2.74e-01

Pos logit contributions
 curtains+0.001
 pieces+0.001
edd+0.001
 rooms+0.001
 Some+0.001
Neg logit contributions
ah-0.006
gh-0.006
 repeat-0.005
Deal-0.005
iding-0.005
TokenFinally
Token ID11158
Feature activation+1.32e+00

Pos logit contributions
ah+0.013
gh+0.012
 repeat+0.012
Deal+0.012
et+0.012
Neg logit contributions
 Some-0.006
 many-0.005
-0.005
Someone-0.005
 curtains-0.005
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
ah+0.073
Deal+0.070
gh+0.069
iding+0.067
astics+0.065
Neg logit contributions
 many-0.021
 curtains-0.020
 of-0.019
-0.017
 chairs-0.017
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
ah+0.083
gh+0.079
 repeat+0.077
Deal+0.076
iding+0.075
Neg logit contributions
 curtains-0.016
 Some-0.016
 many-0.014
 pieces-0.014
edd-0.014
Token park
Token ID3952
Feature activation+7.83e-01

Pos logit contributions
 curtains+0.044
 Some+0.044
 pieces+0.041
edd+0.041
 many+0.040
Neg logit contributions
ah-0.108
gh-0.100
 repeat-0.098
Deal-0.097
iding-0.095
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
ah+0.038
gh+0.036
Deal+0.035
 repeat+0.035
iding+0.034
Neg logit contributions
 many-0.015
 Some-0.015
 curtains-0.015
-0.014
 pieces-0.014
Token Dad
Token ID17415
Feature activation-1.91e+00

Pos logit contributions
 pieces+0.009
edd+0.009
armac+0.008
rolley+0.008
iring+0.008
Neg logit contributions
ah-0.013
 repeat-0.013
 compete-0.012
-0.011
 skate-0.011
Token showed
Token ID3751
Feature activation-2.96e+00

Pos logit contributions
 curtains+0.042
 Some+0.042
edd+0.039
 pieces+0.039
 many+0.039
Neg logit contributions
ah-0.088
gh-0.081
 repeat-0.080
Deal-0.078
iding-0.076
Token mom
Token ID1995
Feature activation-8.29e+00

Pos logit contributions
 doorway+0.089
Someone+0.088
 curtains+0.087
 sky+0.081
 Some+0.080
Neg logit contributions
ah-0.108
et-0.095
 repeat-0.095
gh-0.092
ercise-0.084
Token and
Token ID290
Feature activation-1.93e+00

Pos logit contributions
 Someone+0.249
Someone+0.239
 doorway+0.239
 curtains+0.218
 Some+0.218
Neg logit contributions
ah-0.347
 repeat-0.287
et-0.287
iding-0.270
 pose-0.257
Token the
Token ID262
Feature activation-2.39e+00

Pos logit contributions
 curtains+0.048
 doorway+0.047
 Some+0.047
edd+0.045
colored+0.045
Neg logit contributions
ah-0.075
 repeat-0.074
 insist-0.069
 forth-0.068
Deal-0.068
Token 3
Token ID513
Feature activation-1.10e+00

Pos logit contributions
 Some+0.042
 curtains+0.041
edd+0.038
 doorway+0.037
 Someone+0.036
Neg logit contributions
ah-0.115
 repeat-0.107
gh-0.105
Deal-0.102
 insist-0.100
Token year
Token ID614
Feature activation+4.19e-01

Pos logit contributions
 curtains+0.036
 Some+0.034
edd+0.033
 doorway+0.033
 Someone+0.032
Neg logit contributions
ah-0.041
gh-0.037
Deal-0.034
 repeat-0.034
et-0.034
Token old
Token ID1468
Feature activation-2.56e+00

Pos logit contributions
ah+0.028
gh+0.026
 repeat+0.026
Deal+0.026
iding+0.025
Neg logit contributions
 curtains-0.001
 Some-0.001
 many-0.001
 pieces-0.001
edd-0.001
Token grown
Token ID7334
Feature activation+1.28e+00

Pos logit contributions
 Some+0.023
 curtains+0.023
edd+0.023
rolley+0.021
 rooms+0.021
Neg logit contributions
ah-0.043
gh-0.040
 repeat-0.039
Deal-0.037
et-0.037
Token to
Token ID284
Feature activation+1.60e+00

Pos logit contributions
ah+0.054
gh+0.051
et+0.048
iding+0.047
 repeat+0.047
Neg logit contributions
 Some-0.034
 curtains-0.032
 Someone-0.030
Someone-0.029
 doorway-0.029
Token be
Token ID307
Feature activation+1.29e+00

Pos logit contributions
ah+0.085
Deal+0.077
gh+0.073
iding+0.073
ercise+0.070
Neg logit contributions
 many-0.036
 curtains-0.032
 a-0.031
-0.030
 rooms-0.030
Token even
Token ID772
Feature activation-3.43e+00

Pos logit contributions
ah+0.063
Deal+0.057
gh+0.056
 repeat+0.056
iding+0.054
Neg logit contributions
 many-0.028
 pieces-0.028
 curtains-0.027
 rooms-0.026
 Some-0.025
Token gre
Token ID10536
Feature activation+7.10e+00

Pos logit contributions
 many+0.077
 pieces+0.075
 curtains+0.072
 Some+0.069
 of+0.069
Neg logit contributions
ah-0.165
gh-0.157
iding-0.147
Deal-0.144
 repeat-0.143
Tokenener
Token ID877
Feature activation-8.19e+00

Pos logit contributions
gh+0.291
ah+0.287
Deal+0.267
 repeat+0.265
iding+0.252
Neg logit contributions
 curtains-0.192
edd-0.189
 pieces-0.188
rolley-0.177
 flowers-0.174
Token and
Token ID290
Feature activation-1.51e+00

Pos logit contributions
 of+0.119
 pieces+0.114
 curtains+0.113
 twins+0.099
 many+0.099
Neg logit contributions
ah-0.443
Deal-0.436
gh-0.433
initely-0.410
 repeat-0.410
Token brighter
Token ID26083
Feature activation+3.59e-01

Pos logit contributions
 many+0.035
 curtains+0.031
 Some+0.030
 pieces+0.030
 sky+0.029
Neg logit contributions
ah-0.075
gh-0.072
iding-0.070
Deal-0.065
et-0.063
Token than
Token ID621
Feature activation-1.03e-01

Pos logit contributions
ah+0.018
gh+0.016
Deal+0.016
 repeat+0.016
iding+0.015
Neg logit contributions
 many-0.008
 curtains-0.008
 pieces-0.008
 rooms-0.007
 flowers-0.007
Token before
Token ID878
Feature activation-2.51e+00

Pos logit contributions
 many+0.002
 curtains+0.002
 pieces+0.002
 of+0.002
 Some+0.002
Neg logit contributions
ah-0.005
gh-0.005
iding-0.005
Deal-0.005
 repeat-0.005
Token.
Token ID13
Feature activation+7.96e-01

Pos logit contributions
 curtains+0.046
 many+0.046
 pieces+0.043
+0.042
 sky+0.042
Neg logit contributions
ah-0.134
Deal-0.125
 repeat-0.121
gh-0.120
iding-0.119
Token and
Token ID290
Feature activation+8.18e-01

Pos logit contributions
 curtains+0.021
 Some+0.020
 pieces+0.019
 many+0.019
edd+0.018
Neg logit contributions
ah-0.079
gh-0.076
 repeat-0.072
Deal-0.072
iding-0.072
Token Ben
Token ID3932
Feature activation+1.11e+00

Pos logit contributions
ah+0.040
gh+0.038
Deal+0.038
iding+0.036
 repeat+0.035
Neg logit contributions
 many-0.015
 Some-0.015
 curtains-0.015
-0.015
 twins-0.015
Token smiled
Token ID13541
Feature activation-4.42e+00

Pos logit contributions
ah+0.054
gh+0.054
iding+0.050
 pose+0.049
Deal+0.048
Neg logit contributions
 many-0.022
 curtains-0.021
-0.021
 Some-0.020
 sky-0.020
Token and
Token ID290
Feature activation-1.82e+00

Pos logit contributions
 curtains+0.101
 Some+0.100
edd+0.097
 doorway+0.096
 rooms+0.091
Neg logit contributions
ah-0.194
 repeat-0.182
gh-0.178
iding-0.175
Deal-0.174
Token hugged
Token ID41130
Feature activation-3.45e+00

Pos logit contributions
 curtains+0.046
 Some+0.044
 many+0.044
 pieces+0.043
edd+0.042
Neg logit contributions
ah-0.080
gh-0.075
iding-0.071
Deal-0.070
 repeat-0.070
Token mom
Token ID1995
Feature activation-8.12e+00

Pos logit contributions
 curtains+0.049
 many+0.049
 Some+0.048
 pieces+0.044
+0.043
Neg logit contributions
ah-0.191
gh-0.179
 repeat-0.173
Deal-0.173
iding-0.172
Token.
Token ID13
Feature activation-7.05e-01

Pos logit contributions
 curtains+0.166
 Some+0.165
 doorway+0.154
 Someone+0.152
edd+0.152
Neg logit contributions
ah-0.390
gh-0.352
 repeat-0.351
Deal-0.343
iding-0.336
Token "
Token ID366
Feature activation-8.00e-01

Pos logit contributions
 curtains+0.010
 Some+0.010
 pieces+0.009
edd+0.009
 many+0.009
Neg logit contributions
ah-0.037
gh-0.035
 repeat-0.035
Deal-0.034
iding-0.034
TokenWe
Token ID1135
Feature activation+2.84e+00

Pos logit contributions
 curtains+0.021
 Some+0.021
 pieces+0.020
edd+0.020
 many+0.020
Neg logit contributions
ah-0.033
gh-0.031
 repeat-0.030
Deal-0.030
iding-0.029
Token feel
Token ID1254
Feature activation-4.42e+00

Pos logit contributions
ah+0.127
gh+0.126
iding+0.121
Deal+0.112
 pose+0.112
Neg logit contributions
 curtains-0.064
 many-0.062
 Some-0.061
 pieces-0.059
 sky-0.058
Token happy
Token ID3772
Feature activation-2.18e+00

Pos logit contributions
 Some+0.113
 curtains+0.113
edd+0.111
 doorway+0.109
 pieces+0.108
Neg logit contributions
ah-0.178
 repeat-0.167
gh-0.166
Deal-0.162
iding-0.155
Token better
Token ID1365
Feature activation-9.63e-01

Pos logit contributions
 Some+0.077
 curtains+0.077
edd+0.076
armac+0.074
 doorway+0.073
Neg logit contributions
ah-0.080
 repeat-0.073
gh-0.072
Deal-0.070
et-0.067
Token!
Token ID0
Feature activation-1.32e-01

Pos logit contributions
 pieces+0.017
 many+0.017
 curtains+0.016
 of+0.016
+0.015
Neg logit contributions
ah-0.050
gh-0.046
 repeat-0.045
Deal-0.045
iding-0.044
Token They
Token ID1119
Feature activation+1.45e-01

Pos logit contributions
 curtains+0.004
 pieces+0.004
 posters+0.004
edd+0.004
 doorway+0.004
Neg logit contributions
ah-0.005
Deal-0.004
gh-0.004
 repeat-0.004
-0.004
Token were
Token ID547
Feature activation-1.26e+00

Pos logit contributions
ah+0.007
 repeat+0.006
Deal+0.006
gh+0.006
et+0.006
Neg logit contributions
 Some-0.003
 curtains-0.003
edd-0.003
 Someone-0.003
 rooms-0.003
Token gre
Token ID10536
Feature activation+7.24e+00

Pos logit contributions
 Some+0.030
 curtains+0.029
edd+0.028
rolley+0.028
 Someone+0.028
Neg logit contributions
ah-0.053
 repeat-0.051
gh-0.050
et-0.047
iding-0.047
Tokenener
Token ID877
Feature activation-8.07e+00

Pos logit contributions
ah+0.258
gh+0.247
 repeat+0.229
Deal+0.224
iding+0.218
Neg logit contributions
 curtains-0.235
 Some-0.230
edd-0.229
 pieces-0.226
 many-0.219
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 curtains+0.138
 Some+0.134
 pieces+0.126
edd+0.125
 many+0.125
Neg logit contributions
ah-0.411
gh-0.387
 repeat-0.377
Deal-0.373
iding-0.367
Token taller
Token ID25242
Feature activation+2.13e+00

Pos logit contributions
 curtains+0.025
 Some+0.025
 many+0.024
 pieces+0.023
 rooms+0.023
Neg logit contributions
ah-0.063
gh-0.060
iding-0.057
 repeat-0.056
Deal-0.056
Token than
Token ID621
Feature activation+3.02e-01

Pos logit contributions
ah+0.111
gh+0.107
 repeat+0.105
+0.101
iding+0.101
Neg logit contributions
 pieces-0.038
 many-0.037
 curtains-0.034
 of-0.033
-0.033
Token before
Token ID878
Feature activation-2.42e+00

Pos logit contributions
ah+0.015
gh+0.014
Deal+0.014
 repeat+0.014
iding+0.014
Neg logit contributions
 many-0.005
 curtains-0.005
 Some-0.005
 pieces-0.005
 of-0.005
Token.
Token ID13
Feature activation+6.89e-01

Pos logit contributions
 many+0.043
 curtains+0.042
 pieces+0.042
+0.040
 Some+0.039
Neg logit contributions
ah-0.127
Deal-0.118
 repeat-0.116
gh-0.115
iding-0.114
Token walked
Token ID6807
Feature activation+3.62e+00

Pos logit contributions
ah+0.044
gh+0.043
iding+0.042
Deal+0.042
+0.039
Neg logit contributions
-0.016
 curtains-0.016
 many-0.015
 of-0.015
 pieces-0.015
Token around
Token ID1088
Feature activation-1.41e+00

Pos logit contributions
ah+0.171
gh+0.162
Deal+0.157
 repeat+0.156
iding+0.153
Neg logit contributions
 curtains-0.077
 Some-0.074
 many-0.072
 pieces-0.071
edd-0.069
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
 curtains+0.018
 many+0.016
 pieces+0.015
 Some+0.015
+0.015
Neg logit contributions
ah-0.082
gh-0.077
Deal-0.077
iding-0.075
 repeat-0.074
Token gave
Token ID2921
Feature activation-2.26e+00

Pos logit contributions
 curtains+0.025
 many+0.024
+0.023
 Some+0.022
 doorway+0.022
Neg logit contributions
ah-0.067
gh-0.065
iding-0.063
Deal-0.062
 repeat-0.060
Token each
Token ID1123
Feature activation-2.05e+00

Pos logit contributions
 Some+0.060
 curtains+0.059
 doorway+0.057
 sky+0.057
armac+0.056
Neg logit contributions
ah-0.095
gh-0.085
 repeat-0.082
 pose-0.080
et-0.080
Token person
Token ID1048
Feature activation-7.95e+00

Pos logit contributions
armac+0.039
 rooms+0.038
edd+0.038
 curtains+0.037
tub+0.036
Neg logit contributions
ah-0.104
 repeat-0.089
gh-0.085
 pose-0.085
et-0.082
Token something
Token ID1223
Feature activation-6.82e+00

Pos logit contributions
 curtains+0.115
 Some+0.111
edd+0.100
 pieces+0.100
 doorway+0.100
Neg logit contributions
ah-0.427
gh-0.399
 repeat-0.391
Deal-0.383
iding-0.379
Token.
Token ID13
Feature activation-2.90e-01

Pos logit contributions
 curtains+0.133
 Some+0.127
 many+0.126
 pieces+0.124
edd+0.121
Neg logit contributions
ah-0.338
gh-0.311
 repeat-0.304
Deal-0.304
iding-0.300
Token He
Token ID679
Feature activation+1.13e+00

Pos logit contributions
edd+0.007
armac+0.007
 curtains+0.007
 pieces+0.007
 posters+0.007
Neg logit contributions
ah-0.012
 repeat-0.011
gh-0.011
 pose-0.011
iding-0.011
Token gave
Token ID2921
Feature activation-1.97e+00

Pos logit contributions
ah+0.053
gh+0.050
Deal+0.048
iding+0.048
 repeat+0.047
Neg logit contributions
 curtains-0.025
 pieces-0.024
 many-0.023
 Some-0.023
edd-0.023
Token Mia
Token ID32189
Feature activation-1.76e+00

Pos logit contributions
 Some+0.050
 curtains+0.049
 doorway+0.048
armac+0.048
 rooms+0.048
Neg logit contributions
ah-0.085
gh-0.076
 repeat-0.074
et-0.072
 pose-0.072
Token the
Token ID262
Feature activation-1.66e+00

Pos logit contributions
ah+0.068
gh+0.065
 repeat+0.064
Deal+0.061
 pose+0.061
Neg logit contributions
 Some-0.023
edd-0.023
 curtains-0.022
armac-0.021
rolley-0.020
Token car
Token ID1097
Feature activation+2.89e-01

Pos logit contributions
 curtains+0.028
 sky+0.026
 many+0.026
 pieces+0.026
 doorway+0.026
Neg logit contributions
ah-0.088
gh-0.083
iding-0.082
 repeat-0.081
Deal-0.080
Token Jim
Token ID5395
Feature activation-5.16e-02

Pos logit contributions
ah+0.015
gh+0.015
Deal+0.014
 repeat+0.014
iding+0.014
Neg logit contributions
 of-0.005
 curtains-0.005
 many-0.005
 pieces-0.005
-0.005
Token started
Token ID2067
Feature activation-2.78e+00

Pos logit contributions
+0.001
 curtains+0.001
 sky+0.001
 pieces+0.001
 of+0.001
Neg logit contributions
Deal-0.003
ah-0.003
gh-0.003
iding-0.003
 cheating-0.002
Token to
Token ID284
Feature activation+3.97e-01

Pos logit contributions
 of+0.027
 many+0.026
 curtains+0.024
 pieces+0.024
+0.022
Neg logit contributions
Deal-0.172
ah-0.172
gh-0.156
-0.156
iding-0.155
Token feel
Token ID1254
Feature activation-7.90e+00

Pos logit contributions
ah+0.017
Deal+0.017
gh+0.016
iding+0.015
ig+0.015
Neg logit contributions
 curtains-0.012
 a-0.011
 many-0.011
 rooms-0.011
-0.011
Token better
Token ID1365
Feature activation-1.50e+00

Pos logit contributions
 curtains+0.198
 Some+0.195
edd+0.186
 pieces+0.185
 many+0.185
Neg logit contributions
ah-0.339
gh-0.316
 repeat-0.306
Deal-0.301
iding-0.296
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
 many+0.024
 of+0.022
+0.022
 pieces+0.022
 twins+0.021
Neg logit contributions
ah-0.084
gh-0.078
Deal-0.076
 repeat-0.074
initely-0.073
Token His
Token ID2399
Feature activation+1.19e+00

Pos logit contributions
ah+0.011
gh+0.010
 repeat+0.010
iding+0.010
Deal+0.010
Neg logit contributions
 Some-0.003
-0.003
 Someone-0.002
 many-0.002
 curtains-0.002
Token mother
Token ID2802
Feature activation-4.67e+00

Pos logit contributions
ah+0.075
iding+0.070
Deal+0.070
h+0.068
gh+0.068
Neg logit contributions
 twins-0.010
 family-0.010
 parents-0.010
 curtains-0.010
 grandma-0.010
Token was
Token ID373
Feature activation-1.21e+00

Pos logit contributions
 of+0.085
 curtains+0.081
 pieces+0.079
+0.076
 Some+0.076
Neg logit contributions
gh-0.229
ah-0.228
iding-0.222
Deal-0.218
astics-0.213
Token seem
Token ID1283
Feature activation-2.96e+00

Pos logit contributions
 of+0.040
+0.036
 many+0.035
 family+0.034
 people+0.033
Neg logit contributions
ah-0.063
Deal-0.061
gh-0.059
ig-0.057
bek-0.057
Token to
Token ID284
Feature activation+1.16e+00

Pos logit contributions
 many+0.044
 of+0.037
+0.035
 pieces+0.034
 everywhere+0.033
Neg logit contributions
ah-0.170
gh-0.161
Deal-0.158
 repeat-0.153
iding-0.152
Token make
Token ID787
Feature activation-3.12e+00

Pos logit contributions
ah+0.052
Deal+0.049
iding+0.048
gh+0.047
ig+0.046
Neg logit contributions
-0.031
 many-0.030
 a-0.030
 the-0.029
 family-0.028
Token her
Token ID607
Feature activation-1.34e+00

Pos logit contributions
 many+0.043
 pieces+0.038
 curtains+0.038
 twins+0.034
 flowers+0.033
Neg logit contributions
ah-0.185
Deal-0.173
gh-0.167
iding-0.166
 repeat-0.163
Token tie
Token ID9839
Feature activation-9.08e-01

Pos logit contributions
 curtains+0.035
 pieces+0.034
 many+0.034
 twins+0.032
 rooms+0.032
Neg logit contributions
ah-0.059
Deal-0.054
gh-0.054
iding-0.053
 repeat-0.050
Token feel
Token ID1254
Feature activation-7.85e+00

Pos logit contributions
 pieces+0.016
 curtains+0.016
 many+0.016
 of+0.016
 flowers+0.014
Neg logit contributions
ah-0.046
Deal-0.044
gh-0.043
 repeat-0.042
iding-0.042
Token better
Token ID1365
Feature activation-1.35e+00

Pos logit contributions
 many+0.208
 pieces+0.201
 curtains+0.201
 of+0.199
 Some+0.199
Neg logit contributions
ah-0.350
gh-0.315
Deal-0.303
iding-0.302
 repeat-0.297
Token.
Token ID13
Feature activation+6.89e-02

Pos logit contributions
 many+0.021
 pieces+0.020
 of+0.019
+0.019
 twins+0.018
Neg logit contributions
ah-0.075
gh-0.069
Deal-0.067
 repeat-0.067
iding-0.065
Token She
Token ID1375
Feature activation+4.13e-01

Pos logit contributions
ah+0.004
gh+0.004
iding+0.004
 repeat+0.004
et+0.004
Neg logit contributions
-0.001
 Some-0.001
 Someone-0.001
 many-0.001
 curtains-0.001
Token started
Token ID2067
Feature activation-1.48e+00

Pos logit contributions
gh+0.019
ah+0.019
iding+0.018
Deal+0.017
+0.017
Neg logit contributions
 many-0.009
 pieces-0.008
 curtains-0.008
 Some-0.008
 rooms-0.008
Token to
Token ID284
Feature activation+1.35e+00

Pos logit contributions
 many+0.014
 curtains+0.014
 pieces+0.013
 Some+0.012
+0.011
Neg logit contributions
ah-0.091
Deal-0.089
gh-0.084
iding-0.082
-0.081
Token idea
Token ID2126
Feature activation+7.44e-02

Pos logit contributions
ah+0.043
gh+0.040
iding+0.037
Deal+0.037
 repeat+0.036
Neg logit contributions
 many-0.043
 Some-0.043
 curtains-0.042
 old-0.042
 pieces-0.042
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
ah+0.004
gh+0.004
Deal+0.004
 repeat+0.004
iding+0.004
Neg logit contributions
 of-0.001
-0.001
 curtains-0.001
 pieces-0.001
 many-0.001
Token She
Token ID1375
Feature activation+4.08e-01

Pos logit contributions
ah+0.014
 repeat+0.013
gh+0.013
Deal+0.013
iding+0.013
Neg logit contributions
edd-0.005
 curtains-0.004
 pieces-0.004
 rooms-0.004
 doorway-0.004
Token hugged
Token ID41130
Feature activation-2.56e+00

Pos logit contributions
ah+0.019
gh+0.018
iding+0.018
Deal+0.017
 repeat+0.017
Neg logit contributions
 curtains-0.009
 pieces-0.009
 Some-0.009
 many-0.008
 of-0.008
Token each
Token ID1123
Feature activation-2.14e+00

Pos logit contributions
 curtains+0.033
 Some+0.032
 many+0.029
 pieces+0.028
edd+0.028
Neg logit contributions
ah-0.143
gh-0.136
 repeat-0.132
Deal-0.131
iding-0.130
Token person
Token ID1048
Feature activation-7.84e+00

Pos logit contributions
 curtains+0.027
 Some+0.026
edd+0.024
 rooms+0.023
 pieces+0.023
Neg logit contributions
ah-0.120
gh-0.112
 repeat-0.111
Deal-0.108
iding-0.106
Token and
Token ID290
Feature activation-1.63e+00

Pos logit contributions
 many+0.108
 curtains+0.105
 of+0.103
 pieces+0.101
 Some+0.100
Neg logit contributions
gh-0.422
ah-0.422
Deal-0.403
 repeat-0.393
iding-0.392
Token animal
Token ID5044
Feature activation-3.94e+00

Pos logit contributions
 curtains+0.042
 Some+0.041
 many+0.040
 pieces+0.040
edd+0.039
Neg logit contributions
ah-0.070
gh-0.066
 repeat-0.062
Deal-0.062
iding-0.062
Token.
Token ID13
Feature activation-5.54e-02

Pos logit contributions
 curtains+0.072
 Some+0.071
 many+0.067
 pieces+0.066
edd+0.063
Neg logit contributions
ah-0.200
gh-0.189
 repeat-0.181
Deal-0.181
iding-0.180
Token
Token ID198
Feature activation-3.15e-01

Pos logit contributions
 curtains+0.001
 pieces+0.001
edd+0.001
 rooms+0.001
 doorway+0.001
Neg logit contributions
ah-0.003
 repeat-0.003
Deal-0.003
gh-0.003
iding-0.003
Token
Token ID198
Feature activation+2.96e-02

Pos logit contributions
 curtains+0.006
 doorway+0.006
 pieces+0.006
 rooms+0.006
 posters+0.005
Neg logit contributions
gh-0.015
ah-0.015
 repeat-0.014
h-0.013
iding-0.013
Token They
Token ID1119
Feature activation+4.40e-01

Pos logit contributions
ah+0.029
gh+0.027
 repeat+0.027
Deal+0.027
iding+0.026
Neg logit contributions
 curtains-0.011
edd-0.010
 pieces-0.010
 doorway-0.010
rolley-0.009
Token felt
Token ID2936
Feature activation-4.28e+00

Pos logit contributions
ah+0.020
gh+0.019
 repeat+0.018
Deal+0.018
iding+0.018
Neg logit contributions
 curtains-0.010
 Some-0.010
 many-0.010
 pieces-0.010
edd-0.009
Token better
Token ID1365
Feature activation-1.48e-01

Pos logit contributions
rolley+0.142
armac+0.140
 doorway+0.140
edd+0.140
 curtains+0.135
Neg logit contributions
 repeat-0.137
gh-0.129
 wob-0.122
 insist-0.122
Deal-0.121
Token and
Token ID290
Feature activation-7.53e-01

Pos logit contributions
 curtains+0.002
 Some+0.002
edd+0.002
 pieces+0.002
 many+0.002
Neg logit contributions
ah-0.008
gh-0.007
 repeat-0.007
Deal-0.007
iding-0.007
Token hugged
Token ID41130
Feature activation-2.94e+00

Pos logit contributions
 Some+0.019
 curtains+0.019
edd+0.018
 pieces+0.018
 many+0.018
Neg logit contributions
ah-0.032
gh-0.029
 repeat-0.029
Deal-0.029
iding-0.028
Token mom
Token ID1995
Feature activation-7.84e+00

Pos logit contributions
 many+0.038
+0.035
 Some+0.034
 curtains+0.033
 pieces+0.032
Neg logit contributions
ah-0.170
gh-0.165
iding-0.162
Deal-0.155
 repeat-0.155
Token.
Token ID13
Feature activation+2.56e-01

Pos logit contributions
 curtains+0.114
 many+0.112
 twins+0.111
 pieces+0.109
 of+0.105
Neg logit contributions
gh-0.415
ah-0.413
Deal-0.393
iding-0.390
 repeat-0.388
Token
Token ID198
Feature activation-9.29e-02

Pos logit contributions
 repeat+0.010
Deal+0.010
ah+0.010
gh+0.009
astics+0.009
Neg logit contributions
 pieces-0.007
edd-0.006
 curtains-0.006
rolley-0.006
 Hood-0.006
Token
Token ID198
Feature activation+4.31e-01

Pos logit contributions
 curtains+0.002
 doorway+0.002
 pieces+0.002
 scientists+0.002
 decorations+0.002
Neg logit contributions
gh-0.004
ah-0.004
 repeat-0.003
iding-0.003
h-0.003
Token"
Token ID1
Feature activation-1.50e+00

Pos logit contributions
ah+0.018
gh+0.017
 repeat+0.017
Deal+0.017
iding+0.016
Neg logit contributions
 curtains-0.011
 Some-0.011
 pieces-0.010
edd-0.010
 doorway-0.010
TokenThank
Token ID10449
Feature activation+1.96e+00

Pos logit contributions
 curtains+0.037
 Some+0.036
 many+0.034
 pieces+0.034
edd+0.034
Neg logit contributions
ah-0.066
gh-0.061
 repeat-0.059
Deal-0.059
iding-0.057
Token her
Token ID607
Feature activation-2.26e+00

Pos logit contributions
s+0.026
 pieces+0.025
 of+0.023
 curtains+0.023
 many+0.023
Neg logit contributions
gh-0.056
iding-0.053
ah-0.052
-0.050
 accompl-0.049
Token in
Token ID287
Feature activation+2.06e+00

Pos logit contributions
 curtains+0.042
 many+0.039
 twins+0.039
 rooms+0.038
 pieces+0.037
Neg logit contributions
ah-0.119
gh-0.110
iding-0.108
Deal-0.107
 repeat-0.105
Token her
Token ID607
Feature activation-5.43e-01

Pos logit contributions
ah+0.109
gh+0.102
Deal+0.100
 repeat+0.100
iding+0.098
Neg logit contributions
 curtains-0.033
 Some-0.031
 many-0.030
 pieces-0.030
 rooms-0.028
Token dance
Token ID9280
Feature activation+1.72e+00

Pos logit contributions
 curtains+0.010
 many+0.009
 rooms+0.009
 pieces+0.009
 doorway+0.009
Neg logit contributions
ah-0.028
gh-0.026
Deal-0.025
iding-0.025
 repeat-0.025
Token and
Token ID290
Feature activation-2.88e+00

Pos logit contributions
ah+0.092
Deal+0.088
gh+0.086
 repeat+0.084
iding+0.083
Neg logit contributions
 curtains-0.029
 of-0.027
 many-0.027
 sky-0.027
 pieces-0.026
Token feel
Token ID1254
Feature activation-7.82e+00

Pos logit contributions
 many+0.067
 curtains+0.066
 flowers+0.060
 rooms+0.059
 sky+0.058
Neg logit contributions
ah-0.140
gh-0.133
iding-0.130
Deal-0.127
et-0.116
Token happy
Token ID3772
Feature activation-4.31e+00

Pos logit contributions
 curtains+0.179
 Some+0.176
 many+0.167
 pieces+0.167
edd+0.166
Neg logit contributions
ah-0.355
gh-0.331
 repeat-0.321
Deal-0.317
iding-0.312
Token and
Token ID290
Feature activation-2.30e+00

Pos logit contributions
 curtains+0.081
 many+0.077
 pieces+0.074
 of+0.074
 twins+0.072
Neg logit contributions
ah-0.223
gh-0.207
Deal-0.202
iding-0.198
 repeat-0.198
Token free
Token ID1479
Feature activation+1.30e+00

Pos logit contributions
 curtains+0.051
 many+0.051
 Some+0.047
 pieces+0.047
 rooms+0.046
Neg logit contributions
ah-0.109
gh-0.101
iding-0.095
Deal-0.095
 repeat-0.094
Token and
Token ID290
Feature activation-1.49e+00

Pos logit contributions
ah+0.075
Deal+0.070
gh+0.069
heit+0.068
astics+0.068
Neg logit contributions
 of-0.021
 many-0.018
 curtains-0.018
 sky-0.017
 everywhere-0.017
Token alive
Token ID6776
Feature activation+1.57e-01

Pos logit contributions
 many+0.034
 curtains+0.032
 sky+0.030
 pieces+0.029
 of+0.029
Neg logit contributions
ah-0.073
gh-0.067
iding-0.065
Deal-0.062
et-0.061
Token,
Token ID11
Feature activation-2.13e+00

Pos logit contributions
 curtains+0.058
 many+0.055
 pieces+0.055
 Some+0.053
 of+0.053
Neg logit contributions
ah-0.181
gh-0.168
Deal-0.163
 repeat-0.163
iding-0.159
Token special
Token ID2041
Feature activation-2.65e-01

Pos logit contributions
 curtains+0.041
 Some+0.039
 pieces+0.039
 many+0.039
 of+0.037
Neg logit contributions
ah-0.111
gh-0.105
Deal-0.099
iding-0.098
-0.096
Token notes
Token ID4710
Feature activation-2.77e+00

Pos logit contributions
 curtains+0.005
 pieces+0.005
 many+0.005
 rooms+0.004
 of+0.004
Neg logit contributions
ah-0.014
gh-0.013
Deal-0.013
iding-0.013
-0.012
Token and
Token ID290
Feature activation-1.41e+00

Pos logit contributions
 curtains+0.041
 of+0.038
 many+0.037
 pieces+0.037
 Some+0.036
Neg logit contributions
ah-0.150
gh-0.142
Deal-0.140
 repeat-0.136
iding-0.134
Token even
Token ID772
Feature activation-2.54e+00

Pos logit contributions
 many+0.031
 curtains+0.031
 pieces+0.030
 Some+0.029
 rooms+0.028
Neg logit contributions
ah-0.070
gh-0.065
iding-0.063
Deal-0.061
-0.059
Token photographs
Token ID12566
Feature activation-7.80e+00

Pos logit contributions
 many+0.055
 pieces+0.053
 of+0.051
 curtains+0.050
 Some+0.048
Neg logit contributions
ah-0.127
gh-0.121
iding-0.117
Deal-0.114
-0.110
Token.
Token ID13
Feature activation+5.34e-01

Pos logit contributions
 curtains+0.130
 of+0.126
 pieces+0.123
 many+0.123
 sky+0.112
Neg logit contributions
ah-0.416
gh-0.387
 repeat-0.376
Deal-0.375
iding-0.372
Token Every
Token ID3887
Feature activation+1.33e-01

Pos logit contributions
ah+0.031
gh+0.030
 repeat+0.029
Deal+0.028
iding+0.028
Neg logit contributions
 Some-0.006
 curtains-0.006
 many-0.005
-0.005
edd-0.005
Token time
Token ID640
Feature activation+4.44e+00

Pos logit contributions
iding+0.006
ah+0.006
Deal+0.006
gh+0.005
 repeat+0.005
Neg logit contributions
 many-0.004
 of-0.003
 pieces-0.003
 sky-0.003
 doorway-0.003
Token she
Token ID673
Feature activation+1.13e+00

Pos logit contributions
ah+0.272
gh+0.259
iding+0.255
Deal+0.255
 repeat+0.247
Neg logit contributions
 curtains-0.042
 many-0.038
 Some-0.037
 pieces-0.035
 of-0.034
Token looked
Token ID3114
Feature activation-6.27e-01

Pos logit contributions
iding+0.055
gh+0.053
ah+0.052
+0.050
Deal+0.049
Neg logit contributions
 pieces-0.023
 saw-0.021
 rooms-0.021
 curtains-0.021
 was-0.020
Token's
Token ID338
Feature activation-1.92e+00

Pos logit contributions
ah+0.166
gh+0.155
 repeat+0.148
iding+0.144
Deal+0.143
Neg logit contributions
 curtains-0.110
 Some-0.108
edd-0.105
 pieces-0.104
 doorway-0.103
Token tum
Token ID11814
Feature activation-3.47e+00

Pos logit contributions
 twins+0.040
 curtains+0.040
 many+0.039
 pieces+0.038
 rooms+0.037
Neg logit contributions
ah-0.097
iding-0.089
gh-0.088
Deal-0.086
 repeat-0.082
Tokenmy
Token ID1820
Feature activation-5.01e+00

Pos logit contributions
 curtains+0.080
 Some+0.079
edd+0.075
 pieces+0.075
 many+0.075
Neg logit contributions
ah-0.156
gh-0.145
 repeat-0.141
Deal-0.139
iding-0.137
Token started
Token ID2067
Feature activation-3.27e+00

Pos logit contributions
+0.100
 of+0.098
 pieces+0.094
 many+0.091
s+0.091
Neg logit contributions
Deal-0.241
ah-0.240
iding-0.233
gh-0.232
initely-0.223
Token to
Token ID284
Feature activation+1.44e-01

Pos logit contributions
 many+0.024
 pieces+0.022
 curtains+0.021
 of+0.019
+0.019
Neg logit contributions
ah-0.210
Deal-0.204
iding-0.195
gh-0.191
 repeat-0.189
Token feel
Token ID1254
Feature activation-7.78e+00

Pos logit contributions
ah+0.007
Deal+0.006
gh+0.006
iding+0.006
bek+0.006
Neg logit contributions
-0.004
 curtains-0.004
 many-0.003
 flowers-0.003
 a-0.003
Token funny
Token ID8258
Feature activation-6.35e-01

Pos logit contributions
 curtains+0.209
 Some+0.205
edd+0.198
 pieces+0.195
 many+0.193
Neg logit contributions
ah-0.320
gh-0.297
 repeat-0.289
Deal-0.283
iding-0.276
Token.
Token ID13
Feature activation-9.33e-02

Pos logit contributions
 curtains+0.011
 of+0.010
 pieces+0.010
 many+0.010
 twins+0.010
Neg logit contributions
ah-0.034
Deal-0.032
gh-0.032
 repeat-0.030
iding-0.030
Token He
Token ID679
Feature activation+1.22e+00

Pos logit contributions
 Some+0.001
+0.001
 curtains+0.001
 Someone+0.001
 many+0.001
Neg logit contributions
ah-0.005
gh-0.005
Deal-0.005
 repeat-0.005
iding-0.005
Token was
Token ID373
Feature activation-4.75e-01

Pos logit contributions
iding+0.060
gh+0.060
ah+0.059
Deal+0.057
+0.054
Neg logit contributions
 many-0.022
-0.022
 curtains-0.021
 pieces-0.021
 flowers-0.020
Token thirsty
Token ID47124
Feature activation-1.94e+00

Pos logit contributions
 curtains+0.009
 many+0.009
 Some+0.009
 twins+0.009
 rooms+0.008
Neg logit contributions
ah-0.024
gh-0.022
Deal-0.022
iding-0.021
 repeat-0.021
TokenB
Token ID33
Feature activation+4.51e+00

Pos logit contributions
ah+0.187
Deal+0.174
gh+0.172
 repeat+0.169
iding+0.164
Neg logit contributions
 curtains-0.055
 many-0.050
 pieces-0.048
 Some-0.047
 chairs-0.045
Tokenreat
Token ID630
Feature activation+3.78e+00

Pos logit contributions
ah+0.174
 repeat+0.170
gh+0.169
Deal+0.167
 pose+0.165
Neg logit contributions
edd-0.125
 Some-0.119
-0.118
 of-0.115
 twins-0.113
Tokenhe
Token ID258
Feature activation-9.30e-01

Pos logit contributions
ah+0.177
gh+0.164
Deal+0.164
 repeat+0.163
iding+0.156
Neg logit contributions
 curtains-0.082
 many-0.080
 Some-0.080
 pieces-0.079
edd-0.077
Token deeply
Token ID7744
Feature activation-2.71e+00

Pos logit contributions
 pieces+0.015
 of+0.015
 many+0.015
 curtains+0.014
 chairs+0.014
Neg logit contributions
ah-0.050
Deal-0.047
 repeat-0.046
iding-0.045
gh-0.044
Token to
Token ID284
Feature activation+1.59e+00

Pos logit contributions
 many+0.032
 of+0.030
+0.028
 everywhere+0.025
 sky+0.025
Neg logit contributions
ah-0.163
gh-0.160
Deal-0.158
-0.148
astics-0.147
Token feel
Token ID1254
Feature activation-7.77e+00

Pos logit contributions
ah+0.074
Deal+0.069
iding+0.068
gh+0.067
ig+0.061
Neg logit contributions
 many-0.040
 curtains-0.040
-0.039
 rooms-0.038
 flowers-0.037
Token calm
Token ID9480
Feature activation-1.23e+00

Pos logit contributions
 curtains+0.205
 Some+0.203
 many+0.199
 pieces+0.194
 rooms+0.189
Neg logit contributions
ah-0.330
gh-0.303
 repeat-0.290
Deal-0.289
iding-0.287
Token",
Token ID1600
Feature activation-2.51e+00

Pos logit contributions
 sky+0.016
 curtains+0.016
 of+0.016
+0.016
 pieces+0.015
Neg logit contributions
ah-0.069
gh-0.067
Deal-0.064
iding-0.064
 repeat-0.063
Token or
Token ID393
Feature activation+1.43e+00

Pos logit contributions
+0.046
 many+0.040
 Some+0.039
 curtains+0.038
 of+0.037
Neg logit contributions
ah-0.133
gh-0.130
iding-0.126
Deal-0.125
astics-0.121
Token "
Token ID366
Feature activation-3.58e-01

Pos logit contributions
ah+0.081
gh+0.077
Deal+0.075
iding+0.075
astics+0.070
Neg logit contributions
 many-0.020
 curtains-0.019
 Some-0.018
-0.017
 pieces-0.017
TokenSm
Token ID7556
Feature activation+5.78e+00

Pos logit contributions
+0.007
 Some+0.007
Someone+0.007
 sky+0.006
edd+0.006
Neg logit contributions
gh-0.018
ah-0.018
iding-0.016
et-0.016
ig-0.016
Token on
Token ID319
Feature activation+4.50e+00

Pos logit contributions
ah+0.150
gh+0.144
 repeat+0.139
Deal+0.139
iding+0.137
Neg logit contributions
 curtains-0.060
 many-0.057
 sky-0.055
 pieces-0.055
 Some-0.054
Token the
Token ID262
Feature activation-4.89e-01

Pos logit contributions
ah+0.257
gh+0.243
 repeat+0.238
Deal+0.235
iding+0.233
Neg logit contributions
 curtains-0.051
 Some-0.049
 many-0.044
 pieces-0.044
edd-0.043
Token bed
Token ID3996
Feature activation+1.19e+00

Pos logit contributions
 curtains+0.012
 sky+0.011
 chairs+0.011
 doorway+0.011
 trees+0.011
Neg logit contributions
ah-0.023
iding-0.023
Deal-0.022
gh-0.022
 repeat-0.021
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
ah+0.061
gh+0.057
 repeat+0.056
Deal+0.056
iding+0.054
Neg logit contributions
 curtains-0.021
 Some-0.020
 many-0.019
 pieces-0.019
edd-0.018
Token gave
Token ID2921
Feature activation-2.58e+00

Pos logit contributions
 curtains+0.027
 many+0.026
 chairs+0.025
 pieces+0.025
 doorway+0.024
Neg logit contributions
ah-0.052
gh-0.050
iding-0.049
Deal-0.047
 repeat-0.045
Token mom
Token ID1995
Feature activation-7.77e+00

Pos logit contributions
 curtains+0.042
 Some+0.041
 many+0.039
 pieces+0.038
edd+0.038
Neg logit contributions
ah-0.135
gh-0.127
 repeat-0.124
Deal-0.123
iding-0.122
Tokenmy
Token ID1820
Feature activation-5.59e+00

Pos logit contributions
 doorway+0.168
 Someone+0.166
 curtains+0.162
Someone+0.160
armac+0.153
Neg logit contributions
ah-0.379
et-0.326
 repeat-0.322
gh-0.311
Deal-0.305
Token the
Token ID262
Feature activation-2.26e+00

Pos logit contributions
 curtains+0.070
 Some+0.067
 doorway+0.061
edd+0.060
 Someone+0.060
Neg logit contributions
ah-0.311
gh-0.290
 repeat-0.286
Deal-0.281
iding-0.276
Token picture
Token ID4286
Feature activation-6.23e+00

Pos logit contributions
 curtains+0.053
 pieces+0.049
 many+0.047
 sky+0.047
 Some+0.046
Neg logit contributions
ah-0.109
gh-0.104
iding-0.104
Deal-0.101
astics-0.097
Token and
Token ID290
Feature activation-2.08e+00

Pos logit contributions
 of+0.102
 curtains+0.082
 many+0.081
+0.078
 pieces+0.077
Neg logit contributions
ah-0.341
heit-0.333
iding-0.332
gh-0.330
Deal-0.329
Token the
Token ID262
Feature activation-6.86e-01

Pos logit contributions
 Some+0.049
 curtains+0.048
edd+0.046
 pieces+0.045
 doorway+0.044
Neg logit contributions
ah-0.091
 repeat-0.085
gh-0.084
Deal-0.081
 pose-0.079
Token to
Token ID284
Feature activation+2.49e+00

Pos logit contributions
rolley+0.042
 doorway+0.039
armac+0.039
edd+0.037
 rooms+0.035
Neg logit contributions
ah-0.046
gh-0.044
 repeat-0.044
 pose-0.042
et-0.040
Token pick
Token ID2298
Feature activation-9.41e-01

Pos logit contributions
ah+0.081
gh+0.074
 repeat+0.071
Deal+0.069
iding+0.068
Neg logit contributions
 curtains-0.088
 Some-0.087
edd-0.084
 pieces-0.084
 many-0.084
Token it
Token ID340
Feature activation-3.22e+00

Pos logit contributions
 doorway+0.027
 rooms+0.025
rolley+0.025
armac+0.025
 curtains+0.024
Neg logit contributions
ah-0.041
et-0.037
 repeat-0.035
gh-0.034
ercise-0.032
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
 curtains+0.071
 Some+0.071
 doorway+0.067
 rooms+0.066
edd+0.066
Neg logit contributions
ah-0.150
gh-0.140
 repeat-0.137
iding-0.131
Deal-0.131
Token show
Token ID905
Feature activation-1.45e+00

Pos logit contributions
 curtains+0.053
 Some+0.052
 many+0.050
 pieces+0.050
edd+0.050
Neg logit contributions
ah-0.053
gh-0.048
 repeat-0.045
Deal-0.045
iding-0.045
Token mom
Token ID1995
Feature activation-7.76e+00

Pos logit contributions
 Some+0.029
rolley+0.029
 curtains+0.029
edd+0.029
 doorway+0.029
Neg logit contributions
ah-0.067
 repeat-0.064
gh-0.064
et-0.059
 pose-0.059
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
 doorway+0.207
Someone+0.204
 Someone+0.203
 curtains+0.191
 Some+0.185
Neg logit contributions
ah-0.334
et-0.289
 repeat-0.288
gh-0.282
 pose-0.278
Token She
Token ID1375
Feature activation-1.75e-01

Pos logit contributions
edd+0.006
 curtains+0.006
 pieces+0.006
armac+0.006
 doorway+0.006
Neg logit contributions
ah-0.013
 repeat-0.013
gh-0.013
Deal-0.013
 insist-0.012
Token pulls
Token ID16194
Feature activation+1.29e+00

Pos logit contributions
 curtains+0.006
 Some+0.006
edd+0.006
 doorway+0.006
 pieces+0.006
Neg logit contributions
ah-0.005
 repeat-0.005
gh-0.005
Deal-0.005
 pose-0.004
Token and
Token ID290
Feature activation-1.67e+00

Pos logit contributions
ah+0.069
gh+0.065
 repeat+0.064
Deal+0.062
iding+0.061
Neg logit contributions
 curtains-0.019
 Some-0.018
edd-0.017
 rooms-0.017
 doorway-0.017
Token pulls
Token ID16194
Feature activation+1.01e+00

Pos logit contributions
 pieces+0.056
 curtains+0.055
+0.055
 many+0.054
 chairs+0.054
Neg logit contributions
ah-0.064
gh-0.060
Deal-0.055
iding-0.053
cluded-0.049
Token back
Token ID736
Feature activation-2.33e-01

Pos logit contributions
ah+0.093
gh+0.083
 repeat+0.079
Deal+0.077
iding+0.075
Neg logit contributions
 curtains-0.126
 Some-0.125
edd-0.121
 pieces-0.120
 many-0.119
Token home
Token ID1363
Feature activation-2.65e+00

Pos logit contributions
 Some+0.005
 curtains+0.004
edd+0.004
 Someone+0.004
Someone+0.004
Neg logit contributions
ah-0.011
gh-0.010
 repeat-0.010
 pose-0.010
iding-0.010
Token,
Token ID11
Feature activation-2.38e+00

Pos logit contributions
 Some+0.064
 curtains+0.064
edd+0.061
 pieces+0.059
 many+0.058
Neg logit contributions
ah-0.115
gh-0.108
 repeat-0.105
Deal-0.103
iding-0.102
Token empty
Token ID6565
Feature activation-4.08e-01

Pos logit contributions
 many+0.048
 curtains+0.048
+0.048
 doorway+0.046
 pieces+0.045
Neg logit contributions
ah-0.119
gh-0.111
Deal-0.107
iding-0.104
 repeat-0.101
Token-
Token ID12
Feature activation+4.02e+00

Pos logit contributions
 curtains+0.008
 pieces+0.007
 rooms+0.007
 sky+0.007
 many+0.007
Neg logit contributions
ah-0.022
gh-0.020
Deal-0.020
 repeat-0.020
iding-0.019
Tokenhanded
Token ID13638
Feature activation-7.76e+00

Pos logit contributions
ah+0.195
gh+0.179
Deal+0.173
 repeat+0.169
iding+0.167
Neg logit contributions
 curtains-0.089
 Some-0.083
 rooms-0.080
 pieces-0.078
 twins-0.076
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
 Some+0.188
 curtains+0.186
edd+0.174
 many+0.173
 pieces+0.170
Neg logit contributions
ah-0.340
gh-0.314
 repeat-0.306
Deal-0.300
iding-0.295
Token
Token ID198
Feature activation-2.38e-01

Pos logit contributions
ah+0.005
gh+0.005
 repeat+0.005
Deal+0.005
+0.005
Neg logit contributions
edd-0.003
 curtains-0.003
 pieces-0.003
 posters-0.003
armac-0.003
Token
Token ID198
Feature activation+1.21e-01

Pos logit contributions
 curtains+0.008
edd+0.007
 pieces+0.007
 models+0.007
 scientists+0.007
Neg logit contributions
gh-0.009
ah-0.008
 repeat-0.008
iding-0.007
 skate-0.007
TokenNature
Token ID46934
Feature activation-4.82e-01

Pos logit contributions
ah+0.006
gh+0.006
 repeat+0.005
Deal+0.005
iding+0.005
Neg logit contributions
 curtains-0.003
 Some-0.002
edd-0.002
 pieces-0.002
 rooms-0.002
Token was
Token ID373
Feature activation-3.00e-01

Pos logit contributions
 curtains+0.010
 Some+0.009
 pieces+0.009
 many+0.009
 sky+0.009
Neg logit contributions
ah-0.023
gh-0.022
Deal-0.021
 repeat-0.021
iding-0.021
Token leaves
Token ID5667
Feature activation+5.67e-01

Pos logit contributions
 Some+0.031
 curtains+0.031
edd+0.031
 doorway+0.029
rolley+0.029
Neg logit contributions
ah-0.065
gh-0.060
 repeat-0.060
Deal-0.058
 pose-0.057
Token.
Token ID13
Feature activation+3.60e-01

Pos logit contributions
ah+0.032
Deal+0.031
gh+0.031
+0.030
 repeat+0.030
Neg logit contributions
 of-0.008
 pieces-0.008
 everywhere-0.007
 many-0.007
 curtains-0.007
Token They
Token ID1119
Feature activation+4.94e-01

Pos logit contributions
ah+0.016
gh+0.015
 repeat+0.015
Deal+0.015
iding+0.014
Neg logit contributions
 curtains-0.008
edd-0.007
 pieces-0.007
 doorway-0.007
armac-0.007
Token saw
Token ID2497
Feature activation-4.40e+00

Pos logit contributions
ah+0.022
gh+0.021
iding+0.021
Deal+0.020
 repeat+0.019
Neg logit contributions
 many-0.012
 curtains-0.012
 Some-0.011
 pieces-0.011
 rooms-0.011
Token a
Token ID257
Feature activation-9.23e-01

Pos logit contributions
rolley+0.106
 doorway+0.096
edd+0.094
armac+0.092
eddy+0.088
Neg logit contributions
ah-0.177
 repeat-0.175
 cheating-0.165
et-0.165
 wob-0.163
Token sk
Token ID1341
Feature activation+9.28e+00

Pos logit contributions
 curtains+0.013
 sky+0.012
 pieces+0.011
 many+0.011
 doorway+0.011
Neg logit contributions
ah-0.051
gh-0.049
Deal-0.049
iding-0.048
et-0.047
Tokenunk
Token ID2954
Feature activation-1.80e+00

Pos logit contributions
ah+0.438
gh+0.413
 repeat+0.411
Deal+0.409
iding+0.386
Neg logit contributions
 curtains-0.190
 Some-0.184
 pieces-0.174
edd-0.172
 rooms-0.166
Token come
Token ID1282
Feature activation+3.62e+00

Pos logit contributions
 of+0.035
 curtains+0.032
 living+0.031
 everywhere+0.031
 that+0.030
Neg logit contributions
Deal-0.095
ah-0.094
-0.093
initely-0.090
gh-0.090
Token out
Token ID503
Feature activation-1.35e+00

Pos logit contributions
ah+0.199
Deal+0.189
gh+0.188
 repeat+0.181
 pose+0.176
Neg logit contributions
 curtains-0.051
 many-0.045
 of-0.044
 pieces-0.044
edd-0.044
Token of
Token ID286
Feature activation-1.94e+00

Pos logit contributions
 curtains+0.010
 Some+0.009
 pieces+0.008
 many+0.008
edd+0.007
Neg logit contributions
ah-0.083
gh-0.079
Deal-0.077
 repeat-0.077
iding-0.075
Token the
Token ID262
Feature activation-7.44e-01

Pos logit contributions
 Some+0.037
edd+0.036
rolley+0.034
 curtains+0.033
 Someone+0.033
Neg logit contributions
ah-0.092
gh-0.086
 repeat-0.085
et-0.085
 forth-0.081
Token joined
Token ID5399
Feature activation+1.60e+00

Pos logit contributions
 many+0.007
 everywhere+0.006
 curtains+0.006
+0.006
 of+0.006
Neg logit contributions
ah-0.020
gh-0.020
iding-0.020
Deal-0.019
astics-0.019
Token back
Token ID736
Feature activation+5.35e-01

Pos logit contributions
ah+0.100
Deal+0.096
gh+0.094
 pose+0.088
 repeat+0.087
Neg logit contributions
 many-0.021
 of-0.020
-0.018
 pieces-0.017
 flowers-0.017
Token together
Token ID1978
Feature activation+1.01e+00

Pos logit contributions
ah+0.029
gh+0.028
 repeat+0.027
Deal+0.027
iding+0.026
Neg logit contributions
 curtains-0.007
 Some-0.007
 many-0.007
 pieces-0.007
edd-0.006
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
ah+0.065
Deal+0.061
gh+0.059
+0.059
ercise+0.057
Neg logit contributions
 many-0.012
 of-0.012
-0.011
 pieces-0.011
 everywhere-0.010
Token the
Token ID262
Feature activation+2.91e-02

Pos logit contributions
 many+0.025
 family+0.022
 flowers+0.021
 they+0.021
 rooms+0.021
Neg logit contributions
ah-0.052
iding-0.049
gh-0.049
 AW-0.044
Deal-0.044
Token tw
Token ID665
Feature activation+1.14e+01

Pos logit contributions
ah+0.001
iding+0.001
gh+0.001
Deal+0.001
ercise+0.001
Neg logit contributions
 sky-0.001
 pieces-0.001
 flowers-0.001
 curtains-0.001
 many-0.001
Tokenig
Token ID328
Feature activation+6.82e+00

Pos logit contributions
ah+0.138
gh+0.104
 repeat+0.097
Deal+0.093
iding+0.074
Neg logit contributions
 curtains-0.631
 Some-0.622
 pieces-0.618
 many-0.616
edd-0.613
Token was
Token ID373
Feature activation-8.88e-01

Pos logit contributions
ah+0.310
Deal+0.300
gh+0.291
 repeat+0.285
+0.276
Neg logit contributions
 pieces-0.153
 of-0.152
 curtains-0.152
 many-0.151
-0.141
Token once
Token ID1752
Feature activation+2.75e-01

Pos logit contributions
 many+0.019
 of+0.016
 amazed+0.016
 curtains+0.015
 pieces+0.015
Neg logit contributions
ah-0.045
Deal-0.043
gh-0.041
iding-0.040
 repeat-0.040
Token again
Token ID757
Feature activation-2.71e-01

Pos logit contributions
ah+0.013
gh+0.013
Deal+0.012
 repeat+0.012
iding+0.012
Neg logit contributions
 many-0.006
 curtains-0.006
 pieces-0.006
 of-0.005
 Some-0.005
Token a
Token ID257
Feature activation-7.10e-01

Pos logit contributions
+0.004
 many+0.003
 of+0.003
 family+0.002
 a+0.002
Neg logit contributions
ah-0.017
Deal-0.016
gh-0.016
initely-0.016
-0.015
Token.
Token ID13
Feature activation+2.24e+00

Pos logit contributions
ah+0.047
Deal+0.046
gh+0.044
bek+0.044
iding+0.043
Neg logit contributions
 of-0.016
-0.016
s-0.012
 many-0.012
 pieces-0.012
Token It
Token ID632
Feature activation+6.98e-02

Pos logit contributions
ah+0.128
gh+0.124
iding+0.117
Deal+0.115
 forth+0.112
Neg logit contributions
 Some-0.040
 Someone-0.034
 curtains-0.030
-0.029
 many-0.027
Token's
Token ID338
Feature activation+2.04e+00

Pos logit contributions
iding+0.004
gh+0.004
ah+0.003
Deal+0.003
 repeat+0.003
Neg logit contributions
 pieces-0.001
s-0.001
 curtains-0.001
 Some-0.001
-0.001
Token mine
Token ID6164
Feature activation+4.07e+00

Pos logit contributions
ah+0.100
iding+0.098
Deal+0.096
gh+0.094
 compete+0.087
Neg logit contributions
 many-0.050
 a-0.047
 old-0.046
 everywhere-0.043
 big-0.043
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
ah+0.189
gh+0.175
 repeat+0.171
Deal+0.168
iding+0.164
Neg logit contributions
 curtains-0.090
 many-0.087
 Some-0.086
 pieces-0.086
edd-0.083
Token Go
Token ID1514
Feature activation+9.41e+00

Pos logit contributions
ah+0.067
gh+0.064
iding+0.060
Deal+0.059
 forth+0.058
Neg logit contributions
 Some-0.037
 Someone-0.033
 curtains-0.031
-0.031
 many-0.029
Token play
Token ID711
Feature activation+3.88e+00

Pos logit contributions
ah+0.289
gh+0.268
iding+0.254
 repeat+0.251
Deal+0.250
Neg logit contributions
 curtains-0.340
 Some-0.338
 many-0.336
-0.335
edd-0.328
Token with
Token ID351
Feature activation-4.57e-01

Pos logit contributions
ah+0.230
iding+0.205
Deal+0.201
 repeat+0.200
gh+0.200
Neg logit contributions
 many-0.054
 pieces-0.050
-0.048
 Some-0.044
 curtains-0.044
Token your
Token ID534
Feature activation+2.97e+00

Pos logit contributions
 curtains+0.013
 Some+0.012
 doorway+0.012
edd+0.012
 sky+0.012
Neg logit contributions
ah-0.018
gh-0.017
 repeat-0.016
Deal-0.016
et-0.016
Token doll
Token ID3654
Feature activation-1.19e+00

Pos logit contributions
ah+0.137
gh+0.128
 repeat+0.123
iding+0.122
Deal+0.122
Neg logit contributions
 curtains-0.070
 Some-0.069
 pieces-0.067
 many-0.065
 rooms-0.065
Token,"
Token ID553
Feature activation-2.22e+00

Pos logit contributions
 pieces+0.019
 many+0.018
 of+0.017
 curtains+0.017
+0.017
Neg logit contributions
ah-0.064
gh-0.062
iding-0.060
 repeat-0.059
 som-0.058
Token the
Token ID262
Feature activation-1.50e+00

Pos logit contributions
 curtains+0.055
 Some+0.054
edd+0.052
 pieces+0.050
 rooms+0.049
Neg logit contributions
ah-0.133
gh-0.126
 repeat-0.123
Deal-0.120
iding-0.117
Token handle
Token ID5412
Feature activation-4.09e-01

Pos logit contributions
 curtains+0.045
 pieces+0.045
 Some+0.044
 many+0.043
 doorway+0.043
Neg logit contributions
ah-0.059
gh-0.055
iding-0.052
Deal-0.052
 repeat-0.052
Token from
Token ID422
Feature activation+5.74e-01

Pos logit contributions
 of+0.007
 pieces+0.006
 many+0.006
 curtains+0.005
 sky+0.005
Neg logit contributions
ah-0.023
Deal-0.022
gh-0.022
astics-0.021
-0.021
Token the
Token ID262
Feature activation-3.49e-01

Pos logit contributions
ah+0.033
gh+0.032
iding+0.031
Deal+0.031
 repeat+0.030
Neg logit contributions
 curtains-0.007
 many-0.007
 doorway-0.006
 rooms-0.006
 pieces-0.006
Token wet
Token ID9583
Feature activation+4.30e+00

Pos logit contributions
 curtains+0.008
 sky+0.007
 doorway+0.007
 pieces+0.007
 many+0.007
Neg logit contributions
ah-0.017
gh-0.016
iding-0.016
 repeat-0.015
Deal-0.015
Token p
Token ID279
Feature activation+9.84e+00

Pos logit contributions
ah+0.197
gh+0.187
Deal+0.183
 repeat+0.182
iding+0.178
Neg logit contributions
 curtains-0.099
 pieces-0.096
 sky-0.095
 doorway-0.093
 Some-0.092
Tokenuddle
Token ID24500
Feature activation+1.53e+00

Pos logit contributions
ah+0.416
gh+0.390
 repeat+0.382
Deal+0.372
iding+0.362
Neg logit contributions
 curtains-0.255
 Some-0.252
edd-0.241
 pieces-0.240
 many-0.240
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
ah+0.086
Deal+0.083
 repeat+0.082
astics+0.082
ercise+0.080
Neg logit contributions
 of-0.025
 pieces-0.022
 many-0.019
 curtains-0.019
 everywhere-0.019
Token gave
Token ID2921
Feature activation-2.55e+00

Pos logit contributions
 pieces+0.040
 curtains+0.039
 many+0.038
 chairs+0.037
 doorway+0.037
Neg logit contributions
ah-0.087
gh-0.080
iding-0.078
Deal-0.078
 repeat-0.073
Token it
Token ID340
Feature activation-3.87e+00

Pos logit contributions
armac+0.065
 Some+0.064
 curtains+0.064
 rooms+0.061
Someone+0.060
Neg logit contributions
ah-0.100
gh-0.098
 repeat-0.093
Deal-0.090
et-0.089
Token back
Token ID736
Feature activation-8.30e-01

Pos logit contributions
 many+0.059
 curtains+0.057
 pieces+0.055
 Some+0.054
 of+0.051
Neg logit contributions
ah-0.210
gh-0.197
 repeat-0.192
Deal-0.191
iding-0.188
Token
Token ID198
Feature activation-6.75e-01

Pos logit contributions
 curtains+0.002
 pieces+0.002
 doorway+0.002
edd+0.002
 rooms+0.002
Neg logit contributions
ah-0.011
gh-0.010
 repeat-0.010
iding-0.010
Deal-0.010
Token
Token ID198
Feature activation-2.01e-01

Pos logit contributions
 curtains+0.017
 doorway+0.016
 pieces+0.015
 scientists+0.015
 posters+0.015
Neg logit contributions
ah-0.028
gh-0.028
 repeat-0.025
iding-0.025
h-0.025
TokenThey
Token ID2990
Feature activation+6.46e-01

Pos logit contributions
 curtains+0.005
 Some+0.005
 many+0.004
 pieces+0.004
edd+0.004
Neg logit contributions
ah-0.009
gh-0.008
 repeat-0.008
Deal-0.008
iding-0.008
Token see
Token ID766
Feature activation-4.16e+00

Pos logit contributions
gh+0.027
iding+0.025
astics+0.023
Deal+0.023
ah+0.023
Neg logit contributions
 many-0.019
 were-0.019
-0.018
 chairs-0.018
 big-0.018
Token a
Token ID257
Feature activation-1.29e+00

Pos logit contributions
edd+0.074
 doorway+0.070
rolley+0.069
 curtains+0.066
armac+0.061
Neg logit contributions
ah-0.205
 repeat-0.196
gh-0.188
Deal-0.184
et-0.182
Token j
Token ID474
Feature activation+1.08e+01

Pos logit contributions
 curtains+0.017
 many+0.016
 sky+0.015
 pieces+0.015
 Some+0.015
Neg logit contributions
ah-0.071
gh-0.068
iding-0.066
Deal-0.066
 repeat-0.065
Tokenolly
Token ID5098
Feature activation+2.07e+00

Pos logit contributions
ah+0.450
 repeat+0.431
gh+0.430
Deal+0.423
iding+0.395
Neg logit contributions
 of-0.271
 Some-0.258
 many-0.254
 curtains-0.252
 sky-0.245
Token man
Token ID582
Feature activation-3.03e+00

Pos logit contributions
Deal+0.091
iding+0.090
beh+0.088
astics+0.086
ercise+0.085
Neg logit contributions
 sky-0.056
 pieces-0.053
 flowers-0.053
 big-0.051
 chairs-0.050
Token.
Token ID13
Feature activation-5.65e-02

Pos logit contributions
 of+0.051
+0.044
 living+0.041
 inside+0.039
 twins+0.039
Neg logit contributions
Deal-0.169
ah-0.168
heit-0.167
gh-0.161
 AW-0.161
Token He
Token ID679
Feature activation+1.13e+00

Pos logit contributions
 curtains+0.001
 Some+0.001
 many+0.001
 pieces+0.001
edd+0.001
Neg logit contributions
ah-0.003
gh-0.003
 repeat-0.003
Deal-0.003
iding-0.002
Token has
Token ID468
Feature activation-1.52e+00

Pos logit contributions
gh+0.054
ah+0.054
iding+0.053
Deal+0.051
 repeat+0.049
Neg logit contributions
 curtains-0.022
-0.021
 Some-0.021
-0.020
 many-0.020
Token sad
Token ID6507
Feature activation-3.04e+00

Pos logit contributions
ah+0.010
gh+0.009
Deal+0.009
 repeat+0.009
iding+0.009
Neg logit contributions
 Some-0.004
 many-0.004
 of-0.004
 curtains-0.004
 pieces-0.004
Token that
Token ID326
Feature activation-3.99e+00

Pos logit contributions
 of+0.051
 many+0.048
 curtains+0.048
+0.046
 Some+0.043
Neg logit contributions
ah-0.160
gh-0.152
Deal-0.149
astics-0.145
iding-0.143
Token she
Token ID673
Feature activation-4.01e-01

Pos logit contributions
 many+0.033
 of+0.028
 sky+0.027
 pieces+0.025
 curtains+0.025
Neg logit contributions
ah-0.254
iding-0.243
gh-0.235
Deal-0.231
astics-0.225
Token had
Token ID550
Feature activation-1.14e+00

Pos logit contributions
 pieces+0.006
 curtains+0.006
 of+0.006
 everywhere+0.006
 doorway+0.005
Neg logit contributions
iding-0.021
ah-0.021
gh-0.020
Deal-0.020
 repeat-0.019
Token to
Token ID284
Feature activation+2.64e+00

Pos logit contributions
 many+0.023
 pieces+0.022
+0.021
 of+0.021
 curtains+0.020
Neg logit contributions
ah-0.057
Deal-0.053
iding-0.052
gh-0.051
 repeat-0.050
Token go
Token ID467
Feature activation+4.44e+00

Pos logit contributions
ah+0.124
Deal+0.113
iding+0.109
bek+0.106
ercise+0.105
Neg logit contributions
 a-0.076
 the-0.070
 something-0.067
 family-0.067
 people-0.066
Token away
Token ID1497
Feature activation+8.09e-01

Pos logit contributions
ah+0.227
gh+0.213
Deal+0.209
 repeat+0.207
iding+0.204
Neg logit contributions
 curtains-0.079
 Some-0.074
 many-0.073
 pieces-0.071
 rooms-0.069
Token but
Token ID475
Feature activation-3.30e-01

Pos logit contributions
ah+0.041
gh+0.038
Deal+0.037
 repeat+0.036
iding+0.035
Neg logit contributions
 curtains-0.016
 many-0.016
 pieces-0.015
 of-0.014
 rooms-0.014
Token was
Token ID373
Feature activation-3.07e-01

Pos logit contributions
 curtains+0.004
 many+0.004
 Some+0.004
 pieces+0.004
 doorway+0.004
Neg logit contributions
ah-0.019
gh-0.017
iding-0.017
Deal-0.017
 repeat-0.017
Token happy
Token ID3772
Feature activation-2.67e+00

Pos logit contributions
 many+0.007
 curtains+0.007
 Some+0.007
 of+0.006
 pieces+0.006
Neg logit contributions
ah-0.014
gh-0.013
Deal-0.013
 repeat-0.013
iding-0.013
Token because
Token ID780
Feature activation-2.57e+00

Pos logit contributions
 curtains+0.038
 Some+0.036
 many+0.034
 pieces+0.033
edd+0.032
Neg logit contributions
ah-0.146
gh-0.137
 repeat-0.133
Deal-0.132
iding-0.131
Token It
Token ID632
Feature activation-1.48e+00

Pos logit contributions
edd+0.004
 curtains+0.004
 pieces+0.004
 rooms+0.003
 Some+0.003
Neg logit contributions
ah-0.014
 repeat-0.013
gh-0.013
Deal-0.013
iding-0.012
Token was
Token ID373
Feature activation-5.87e-01

Pos logit contributions
 curtains+0.024
 Some+0.022
 pieces+0.022
 many+0.022
 sky+0.021
Neg logit contributions
ah-0.076
gh-0.073
iding-0.071
Deal-0.071
 repeat-0.069
Token perched
Token ID49264
Feature activation+2.89e+00

Pos logit contributions
 curtains+0.011
 many+0.011
 Some+0.011
 pieces+0.010
 sky+0.010
Neg logit contributions
ah-0.030
Deal-0.028
gh-0.028
iding-0.027
 repeat-0.026
Token from
Token ID422
Feature activation+1.58e+00

Pos logit contributions
ah+0.131
gh+0.123
 repeat+0.123
Deal+0.121
 pose+0.118
Neg logit contributions
 many-0.066
 curtains-0.064
 pieces-0.063
 sky-0.063
-0.060
Token a
Token ID257
Feature activation-6.89e-01

Pos logit contributions
ah+0.085
gh+0.078
 repeat+0.078
Deal+0.076
et+0.075
Neg logit contributions
 Some-0.022
edd-0.022
 curtains-0.022
rolley-0.020
 rooms-0.020
Token nearby
Token ID6716
Feature activation+3.12e+00

Pos logit contributions
 curtains+0.015
 Some+0.014
 pieces+0.014
 many+0.014
 doorway+0.013
Neg logit contributions
ah-0.033
gh-0.031
Deal-0.030
 repeat-0.029
iding-0.029
Token branch
Token ID8478
Feature activation+1.50e+00

Pos logit contributions
gh+0.167
iding+0.165
ah+0.163
Deal+0.163
 repeat+0.151
Neg logit contributions
 sky-0.059
 curtains-0.057
 doorway-0.052
 trees-0.050
 flowers-0.048
Token,
Token ID11
Feature activation-2.11e+00

Pos logit contributions
Deal+0.083
gh+0.082
ah+0.081
heit+0.080
+0.079
Neg logit contributions
 of-0.024
 sky-0.023
 many-0.020
 everywhere-0.018
 doorway-0.018
Token watching
Token ID4964
Feature activation+1.63e+00

Pos logit contributions
 curtains+0.040
 many+0.039
 sky+0.038
 pieces+0.036
 Some+0.036
Neg logit contributions
ah-0.109
gh-0.104
Deal-0.099
iding-0.098
-0.093
Token her
Token ID607
Feature activation+9.52e-02

Pos logit contributions
ah+0.096
Deal+0.091
gh+0.091
iding+0.088
 repeat+0.087
Neg logit contributions
 curtains-0.020
 many-0.018
 Some-0.017
 pieces-0.015
 sky-0.015
Token twist
Token ID14528
Feature activation+2.86e+00

Pos logit contributions
ah+0.005
gh+0.005
Deal+0.005
ercise+0.004
initely+0.004
Neg logit contributions
 curtains-0.002
 flowers-0.002
 many-0.002
-0.002
 of-0.002
Token happy
Token ID3772
Feature activation-2.81e+00

Pos logit contributions
 Some+0.035
 curtains+0.035
edd+0.034
 doorway+0.033
 pieces+0.033
Neg logit contributions
ah-0.055
 repeat-0.052
gh-0.052
Deal-0.050
iding-0.049
Token too
Token ID1165
Feature activation+2.76e-01

Pos logit contributions
 curtains+0.072
 Some+0.071
edd+0.068
 doorway+0.068
 pieces+0.067
Neg logit contributions
ah-0.118
gh-0.109
 repeat-0.108
Deal-0.106
iding-0.103
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
ah+0.010
gh+0.010
 repeat+0.010
iding+0.009
Deal+0.009
Neg logit contributions
 curtains-0.008
 doorway-0.008
 Some-0.008
edd-0.008
rolley-0.008
Token They
Token ID1119
Feature activation+4.62e-01

Pos logit contributions
Deal+0.014
ah+0.014
 repeat+0.014
+0.013
OK+0.012
Neg logit contributions
 doorway-0.016
 pieces-0.016
rolley-0.015
 curtains-0.015
edd-0.014
Token like
Token ID588
Feature activation-1.28e+00

Pos logit contributions
gh+0.022
ah+0.020
iding+0.020
hh+0.018
astics+0.018
Neg logit contributions
-0.011
 many-0.011
 flowers-0.010
 twins-0.010
 of-0.010
Token to
Token ID284
Feature activation+3.42e+00

Pos logit contributions
 curtains+0.014
 Some+0.014
 many+0.013
 pieces+0.012
 rooms+0.012
Neg logit contributions
ah-0.075
gh-0.071
Deal-0.068
 repeat-0.068
iding-0.068
Token play
Token ID711
Feature activation+3.64e+00

Pos logit contributions
bek+0.182
ust+0.176
iding+0.175
vel+0.171
heit+0.170
Neg logit contributions
-0.065
 of-0.055
 the-0.054
 a-0.054
 flowers-0.053
Token with
Token ID351
Feature activation-7.84e-01

Pos logit contributions
ah+0.218
gh+0.193
heit+0.190
bek+0.187
iding+0.187
Neg logit contributions
 many-0.057
-0.053
 of-0.047
 pieces-0.047
 flowers-0.046
Token the
Token ID262
Feature activation-3.62e-01

Pos logit contributions
 curtains+0.013
 doorway+0.012
 Some+0.012
edd+0.012
rolley+0.012
Neg logit contributions
ah-0.040
gh-0.038
 repeat-0.037
Deal-0.036
 pose-0.036
Token big
Token ID1263
Feature activation+2.66e+00

Pos logit contributions
edd+0.012
rolley+0.012
 curtains+0.012
 doorway+0.011
armac+0.011
Neg logit contributions
ah-0.012
 repeat-0.012
gh-0.011
Deal-0.011
 race-0.011
Token ball
Token ID2613
Feature activation+6.12e-01

Pos logit contributions
ah+0.108
gh+0.099
 repeat+0.096
Deal+0.095
iding+0.093
Neg logit contributions
 curtains-0.074
 Some-0.073
edd-0.070
 many-0.069
 pieces-0.069
Token it
Token ID340
Feature activation-2.03e+00

Pos logit contributions
ah+0.061
Deal+0.057
iding+0.056
 repeat+0.055
gh+0.055
Neg logit contributions
 curtains-0.017
 many-0.016
 pieces-0.016
 of-0.016
-0.015
Token there
Token ID612
Feature activation-1.19e+00

Pos logit contributions
 of+0.031
 many+0.030
 everywhere+0.028
 pieces+0.028
+0.027
Neg logit contributions
ah-0.118
Deal-0.108
gh-0.105
 repeat-0.104
-0.103
Token.
Token ID13
Feature activation+6.07e-01

Pos logit contributions
+0.018
 of+0.017
 many+0.017
 everywhere+0.015
 pieces+0.014
Neg logit contributions
ah-0.068
 repeat-0.062
Deal-0.062
iding-0.062
gh-0.062
Token From
Token ID3574
Feature activation+1.63e+00

Pos logit contributions
ah+0.032
gh+0.031
 repeat+0.030
Deal+0.029
iding+0.029
Neg logit contributions
 Some-0.010
 curtains-0.009
 many-0.009
-0.009
 Someone-0.009
Token that
Token ID326
Feature activation-1.43e+00

Pos logit contributions
ah+0.088
gh+0.085
iding+0.082
 repeat+0.081
Deal+0.079
Neg logit contributions
 many-0.024
 curtains-0.023
 sky-0.022
-0.022
 pieces-0.021
Token day
Token ID1110
Feature activation+3.41e+00

Pos logit contributions
 many+0.031
 of+0.030
+0.030
 sky+0.029
 pieces+0.028
Neg logit contributions
iding-0.069
Deal-0.068
ah-0.067
astics-0.066
gh-0.066
Token onwards
Token ID31719
Feature activation+7.48e+00

Pos logit contributions
ah+0.163
Deal+0.160
 pose+0.157
 appetite+0.150
 repeat+0.150
Neg logit contributions
 many-0.080
 of-0.078
-0.072
 everywhere-0.068
-0.061
Token,
Token ID11
Feature activation+1.77e-01

Pos logit contributions
ah+0.405
iding+0.388
Deal+0.388
 repeat+0.378
gh+0.377
Neg logit contributions
 many-0.112
 pieces-0.111
 of-0.104
-0.102
 curtains-0.101
Token he
Token ID339
Feature activation+1.95e+00

Pos logit contributions
ah+0.010
gh+0.010
Deal+0.009
iding+0.009
 repeat+0.009
Neg logit contributions
 many-0.002
 curtains-0.002
 Some-0.002
 pieces-0.002
-0.002
Token made
Token ID925
Feature activation-7.02e-01

Pos logit contributions
ah+0.087
gh+0.085
iding+0.083
Deal+0.080
 repeat+0.078
Neg logit contributions
 many-0.043
 curtains-0.042
 pieces-0.041
-0.041
 of-0.040
Token sure
Token ID1654
Feature activation-7.90e-01

Pos logit contributions
 curtains+0.021
 Some+0.021
edd+0.020
 doorway+0.020
 sky+0.019
Neg logit contributions
ah-0.027
gh-0.026
 repeat-0.024
iding-0.023
Deal-0.023
Token having
Token ID1719
Feature activation-9.63e-01

Pos logit contributions
 many+0.039
 curtains+0.038
 twins+0.037
 pieces+0.037
 Some+0.035
Neg logit contributions
ah-0.073
gh-0.068
iding-0.063
et-0.061
 repeat-0.061
Token fun
Token ID1257
Feature activation+3.95e+00

Pos logit contributions
 many+0.021
 pieces+0.021
 curtains+0.020
 Some+0.020
 twins+0.019
Neg logit contributions
ah-0.047
gh-0.042
iding-0.042
 repeat-0.042
Deal-0.041
Token.
Token ID13
Feature activation+9.38e-01

Pos logit contributions
ah+0.208
gh+0.190
 repeat+0.183
Deal+0.181
+0.177
Neg logit contributions
 many-0.074
 pieces-0.074
 of-0.072
 twins-0.071
 curtains-0.071
Token But
Token ID887
Feature activation+1.77e+00

Pos logit contributions
ah+0.043
gh+0.041
 repeat+0.038
iding+0.038
Deal+0.037
Neg logit contributions
 Some-0.023
 curtains-0.022
-0.021
 many-0.020
 Someone-0.020
Token next
Token ID1306
Feature activation+3.42e+00

Pos logit contributions
ah+0.096
gh+0.089
iding+0.085
cluded+0.085
et+0.082
Neg logit contributions
 many-0.032
-0.032
 grandma-0.030
 of-0.030
 twins-0.030
Token time
Token ID640
Feature activation+4.06e+00

Pos logit contributions
iding+0.138
ah+0.137
beh+0.135
Deal+0.134
pper+0.133
Neg logit contributions
 many-0.097
-0.096
 sky-0.095
 of-0.093
 big-0.083
Token,
Token ID11
Feature activation+1.93e-01

Pos logit contributions
ah+0.236
Deal+0.219
gh+0.219
et+0.214
iding+0.206
Neg logit contributions
-0.063
 of-0.058
 curtains-0.054
 many-0.054
-0.054
Token please
Token ID3387
Feature activation+4.59e+00

Pos logit contributions
ah+0.010
gh+0.009
et+0.009
iding+0.009
cluded+0.008
Neg logit contributions
-0.004
 many-0.004
 grandma-0.004
 of-0.004
 flowers-0.004
Token tell
Token ID1560
Feature activation-9.60e-01

Pos logit contributions
ah+0.240
gh+0.221
et+0.213
initely+0.213
bek+0.212
Neg logit contributions
-0.105
 that-0.081
 of-0.079
 and-0.077
 twins-0.076
Token us
Token ID514
Feature activation+3.05e+00

Pos logit contributions
 many+0.020
 curtains+0.020
 grandma+0.019
 flowers+0.018
+0.018
Neg logit contributions
ah-0.050
iding-0.045
gh-0.044
Deal-0.044
cluded-0.042
Token where
Token ID810
Feature activation-3.28e+00

Pos logit contributions
ah+0.162
gh+0.153
 repeat+0.148
Deal+0.147
iding+0.145
Neg logit contributions
 curtains-0.046
 Some-0.045
 many-0.042
 pieces-0.041
edd-0.041
Token the
Token ID262
Feature activation+7.52e-01

Pos logit contributions
 Some+0.040
 many+0.038
 curtains+0.038
+0.038
 pieces+0.037
Neg logit contributions
ah-0.118
gh-0.112
iding-0.109
 repeat-0.104
et-0.103
Token woman
Token ID2415
Feature activation-3.03e+00

Pos logit contributions
ah+0.042
 repeat+0.039
gh+0.039
Deal+0.038
iding+0.037
Neg logit contributions
 Some-0.009
 curtains-0.009
edd-0.008
 pieces-0.008
 rooms-0.007
Token said
Token ID531
Feature activation+2.59e-01

Pos logit contributions
 of+0.050
+0.045
s+0.037
 surprised+0.034
 pieces+0.033
Neg logit contributions
gh-0.172
Deal-0.168
iding-0.164
astics-0.162
initely-0.161
Token.
Token ID13
Feature activation+7.80e-01

Pos logit contributions
ah+0.014
Deal+0.013
gh+0.013
astics+0.012
ercise+0.012
Neg logit contributions
 of-0.004
-0.004
 many-0.004
 curtains-0.004
 Some-0.004
Token
Token ID198
Feature activation+3.35e-01

Pos logit contributions
ah+0.045
gh+0.043
 repeat+0.043
Deal+0.042
iding+0.041
Neg logit contributions
 curtains-0.007
edd-0.006
 pieces-0.006
 rooms-0.006
 doorway-0.005
Token
Token ID198
Feature activation+5.99e-01

Pos logit contributions
ah+0.018
gh+0.018
Deal+0.018
 repeat+0.017
iding+0.017
Neg logit contributions
 curtains-0.004
 doorway-0.003
 rooms-0.003
 pieces-0.003
 scientists-0.003
TokenLu
Token ID25596
Feature activation+7.01e+00

Pos logit contributions
ah+0.026
gh+0.024
 repeat+0.023
et+0.022
iding+0.022
Neg logit contributions
 Some-0.016
-0.015
 many-0.015
 curtains-0.015
Someone-0.015
Tokency
Token ID948
Feature activation+6.05e-01

Pos logit contributions
gh+0.316
ah+0.306
 repeat+0.291
iding+0.290
 cheating+0.280
Neg logit contributions
 pieces-0.156
 curtains-0.156
edd-0.151
 Some-0.149
rolley-0.147
Token smiled
Token ID13541
Feature activation-4.33e+00

Pos logit contributions
gh+0.029
ah+0.029
Deal+0.028
iding+0.027
 cheating+0.027
Neg logit contributions
 pieces-0.012
-0.011
 curtains-0.011
 sky-0.011
 of-0.011
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
 curtains+0.071
 Some+0.069
 many+0.065
 pieces+0.064
edd+0.064
Neg logit contributions
ah-0.226
gh-0.213
 repeat-0.206
Deal-0.205
iding-0.201
Token decided
Token ID3066
Feature activation-1.05e+00

Pos logit contributions
 many+0.037
+0.037
 amazed+0.033
 of+0.033
 pieces+0.032
Neg logit contributions
ah-0.083
gh-0.081
iding-0.076
Deal-0.072
 cheating-0.070
Token was
Token ID373
Feature activation-9.48e-01

Pos logit contributions
 many+0.018
 pieces+0.018
 curtains+0.016
 sky+0.016
+0.016
Neg logit contributions
ah-0.077
iding-0.076
gh-0.073
Deal-0.071
 repeat-0.069
Token going
Token ID1016
Feature activation+5.92e+00

Pos logit contributions
 many+0.013
 curtains+0.010
+0.010
 twins+0.009
 a+0.009
Neg logit contributions
ah-0.056
Deal-0.053
gh-0.052
iding-0.052
 repeat-0.050
Token on
Token ID319
Feature activation+4.16e+00

Pos logit contributions
ah+0.363
Deal+0.357
 repeat+0.349
iding+0.345
gh+0.340
Neg logit contributions
 many-0.066
 everywhere-0.052
-0.041
 sky-0.038
 trees-0.037
Token pic
Token ID8301
Feature activation+1.28e+01

Pos logit contributions
ah+0.269
Deal+0.253
 repeat+0.252
iding+0.252
gh+0.242
Neg logit contributions
 many-0.035
-0.023
 pieces-0.019
 Some-0.014
 a-0.013
Tokenn
Token ID77
Feature activation+3.89e+00

Pos logit contributions
ah+0.323
Deal+0.261
 repeat+0.251
gh+0.232
 forth+0.230
Neg logit contributions
 many-0.552
-0.542
 curtains-0.535
edd-0.526
 Some-0.526
Tokenics
Token ID873
Feature activation-2.71e+00

Pos logit contributions
ah+0.174
gh+0.165
 repeat+0.156
iding+0.155
Deal+0.150
Neg logit contributions
 curtains-0.093
 Some-0.090
edd-0.086
 pieces-0.085
 doorway-0.085
Token!
Token ID0
Feature activation-2.92e-01

Pos logit contributions
 many+0.045
+0.035
 everywhere+0.033
 of+0.033
 pieces+0.033
Neg logit contributions
ah-0.155
 repeat-0.144
iding-0.140
gh-0.140
-0.138
Token
Token ID198
Feature activation-4.05e-01

Pos logit contributions
 curtains+0.004
edd+0.004
 doorway+0.004
 pieces+0.003
 rooms+0.003
Neg logit contributions
ah-0.016
gh-0.015
 repeat-0.015
Deal-0.014
 pose-0.014
Token
Token ID198
Feature activation-5.62e-02

Pos logit contributions
 curtains+0.010
 doorway+0.009
 posters+0.009
 flames+0.009
 bulbs+0.009
Neg logit contributions
ah-0.019
gh-0.018
Deal-0.016
iding-0.016
 repeat-0.016
TokenOn
Token ID2202
Feature activation+4.21e+00

Pos logit contributions
 Some+0.002
 many+0.002
+0.002
 pieces+0.002
 curtains+0.002
Neg logit contributions
ah-0.002
gh-0.002
 repeat-0.002
iding-0.002
Deal-0.002
Token this
Token ID428
Feature activation+7.83e-01

Pos logit contributions
ah+0.230
gh+0.217
 repeat+0.213
Deal+0.212
iding+0.209
Neg logit contributions
 curtains-0.057
 Some-0.056
 many-0.054
 pieces-0.052
edd-0.049
Token girl
Token ID2576
Feature activation-1.53e+00

Pos logit contributions
ah+0.015
gh+0.015
iding+0.014
 repeat+0.014
Deal+0.014
Neg logit contributions
 Some-0.004
 curtains-0.004
 pieces-0.004
 sky-0.003
 many-0.003
Token said
Token ID531
Feature activation-3.39e-01

Pos logit contributions
+0.025
 saw+0.022
 was+0.021
 amazed+0.021
 curtains+0.020
Neg logit contributions
iding-0.081
gh-0.080
ah-0.079
Deal-0.077
cluded-0.074
Token,
Token ID11
Feature activation-5.16e-01

Pos logit contributions
 many+0.005
 of+0.005
+0.005
 Some+0.005
 curtains+0.004
Neg logit contributions
ah-0.018
Deal-0.017
astics-0.017
iding-0.017
 repeat-0.016
Token "
Token ID366
Feature activation-4.61e-01

Pos logit contributions
+0.003
 Some+0.001
 many+0.000
 curtains+0.000
 "+0.000
Neg logit contributions
ah-0.036
gh-0.034
Deal-0.034
iding-0.034
cluded-0.033
TokenI
Token ID40
Feature activation+3.31e+00

Pos logit contributions
 Some+0.007
 curtains+0.007
edd+0.007
 pieces+0.007
 many+0.007
Neg logit contributions
ah-0.024
gh-0.023
 repeat-0.022
iding-0.022
Deal-0.022
Token want
Token ID765
Feature activation+3.21e-01

Pos logit contributions
ah+0.140
gh+0.131
Deal+0.127
iding+0.126
 repeat+0.125
Neg logit contributions
 Some-0.083
 curtains-0.082
 many-0.080
 pieces-0.079
-0.078
Token to
Token ID284
Feature activation+3.43e+00

Pos logit contributions
ah+0.016
gh+0.015
 repeat+0.015
 pose+0.014
Deal+0.014
Neg logit contributions
 Some-0.005
edd-0.005
 doorway-0.005
rolley-0.005
 curtains-0.005
Token find
Token ID1064
Feature activation-4.05e+00

Pos logit contributions
ah+0.142
Deal+0.130
gh+0.127
iding+0.124
et+0.118
Neg logit contributions
 curtains-0.098
 of-0.094
-0.093
 Some-0.092
 many-0.092
Token a
Token ID257
Feature activation-1.43e+00

Pos logit contributions
tub+0.097
 doorway+0.093
rolley+0.090
 Some+0.089
armac+0.088
Neg logit contributions
ah-0.162
 repeat-0.155
et-0.152
gh-0.148
 mine-0.144
Token new
Token ID649
Feature activation+1.04e+00

Pos logit contributions
 many+0.026
 curtains+0.026
 doorway+0.024
 sky+0.024
 pieces+0.023
Neg logit contributions
ah-0.073
gh-0.070
iding-0.070
Deal-0.068
et-0.067
Token way
Token ID835
Feature activation+6.77e-01

Pos logit contributions
ah+0.050
gh+0.048
Deal+0.047
iding+0.046
 repeat+0.046
Neg logit contributions
 curtains-0.021
 pieces-0.021
 many-0.020
 doorway-0.019
 rooms-0.019
Token fall
Token ID2121
Feature activation+3.42e+00

Pos logit contributions
ah+0.067
gh+0.061
iding+0.058
Deal+0.058
et+0.055
Neg logit contributions
 many-0.057
 Some-0.056
 curtains-0.056
 of-0.055
 pieces-0.055
Token down
Token ID866
Feature activation+2.44e+00

Pos logit contributions
Deal+0.164
ah+0.164
gh+0.159
iding+0.151
 repeat+0.149
Neg logit contributions
 curtains-0.071
 many-0.069
 of-0.065
 flowers-0.065
 everywhere-0.065
Token from
Token ID422
Feature activation+1.56e+00

Pos logit contributions
ah+0.118
gh+0.111
 repeat+0.108
Deal+0.107
iding+0.105
Neg logit contributions
 many-0.053
 curtains-0.052
 Some-0.050
 pieces-0.050
 of-0.048
Token the
Token ID262
Feature activation-9.84e-02

Pos logit contributions
ah+0.080
gh+0.076
 repeat+0.074
Deal+0.072
et+0.071
Neg logit contributions
 curtains-0.025
 Some-0.025
edd-0.023
 doorway-0.022
 pieces-0.022
Token tree
Token ID5509
Feature activation+7.88e-01

Pos logit contributions
 curtains+0.002
 Some+0.002
 pieces+0.002
 many+0.002
edd+0.002
Neg logit contributions
ah-0.005
gh-0.004
 repeat-0.004
Deal-0.004
iding-0.004
Token."
Token ID526
Feature activation-1.42e+00

Pos logit contributions
ah+0.041
gh+0.040
Deal+0.038
iding+0.037
 repeat+0.037
Neg logit contributions
 many-0.015
 of-0.014
 curtains-0.014
 pieces-0.013
-0.013
Token Lily
Token ID20037
Feature activation-9.85e-01

Pos logit contributions
 pieces+0.026
 scientists+0.026
 curtains+0.026
armac+0.025
 doorway+0.024
Neg logit contributions
ah-0.064
 repeat-0.063
gh-0.062
Deal-0.062
-0.061
Token said
Token ID531
Feature activation-6.58e-01

Pos logit contributions
+0.020
 pieces+0.018
 many+0.018
s+0.017
 was+0.016
Neg logit contributions
ah-0.046
gh-0.046
iding-0.045
 cheating-0.042
Deal-0.041
Token,
Token ID11
Feature activation-6.43e-01

Pos logit contributions
 many+0.012
 of+0.011
+0.011
 Some+0.010
 curtains+0.010
Neg logit contributions
ah-0.035
Deal-0.031
et-0.031
astics-0.031
gh-0.031
Token "
Token ID366
Feature activation-9.96e-01

Pos logit contributions
+0.004
 Some+0.003
 many+0.003
 curtains+0.003
 sky+0.002
Neg logit contributions
ah-0.042
gh-0.040
Deal-0.039
iding-0.038
 pose-0.038
TokenI
Token ID40
Feature activation+3.22e+00

Pos logit contributions
 curtains+0.032
 Some+0.030
 rooms+0.030
edd+0.029
 pieces+0.029
Neg logit contributions
ah-0.038
gh-0.034
Deal-0.034
 repeat-0.034
iding-0.031
Token snakes
Token ID26042
Feature activation-1.30e+00

Pos logit contributions
 curtains+0.037
 Some+0.036
edd+0.033
 pieces+0.033
 many+0.033
Neg logit contributions
ah-0.156
gh-0.147
 repeat-0.144
Deal-0.142
iding-0.140
Token are
Token ID389
Feature activation-1.74e+00

Pos logit contributions
 curtains+0.028
 Some+0.027
 pieces+0.027
 many+0.027
 rooms+0.025
Neg logit contributions
ah-0.061
gh-0.057
Deal-0.054
 repeat-0.054
iding-0.054
Token dangerous
Token ID4923
Feature activation+3.91e-01

Pos logit contributions
 curtains+0.040
 many+0.040
 Some+0.039
 pieces+0.038
 rooms+0.037
Neg logit contributions
ah-0.080
gh-0.074
Deal-0.071
 repeat-0.071
iding-0.070
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
ah+0.020
gh+0.019
 repeat+0.018
Deal+0.018
iding+0.018
Neg logit contributions
 curtains-0.007
 Some-0.007
 pieces-0.006
 many-0.006
edd-0.006
Token but
Token ID475
Feature activation+3.98e-02

Pos logit contributions
 Some+0.046
 curtains+0.045
 many+0.044
 pieces+0.044
 rooms+0.041
Neg logit contributions
ah-0.095
gh-0.088
Deal-0.085
 repeat-0.085
iding-0.084
Token some
Token ID617
Feature activation-2.79e+00

Pos logit contributions
ah+0.002
gh+0.002
iding+0.002
 repeat+0.002
Deal+0.002
Neg logit contributions
 curtains-0.001
 Some-0.001
 many-0.001
 pieces-0.001
 rooms-0.001
Token are
Token ID389
Feature activation-7.19e-01

Pos logit contributions
 curtains+0.043
 Some+0.043
 many+0.040
 pieces+0.040
 rooms+0.038
Neg logit contributions
ah-0.149
gh-0.140
 repeat-0.135
Deal-0.135
iding-0.134
Token not
Token ID407
Feature activation-1.97e+00

Pos logit contributions
 many+0.016
 curtains+0.015
 Some+0.015
 pieces+0.015
 of+0.014
Neg logit contributions
ah-0.035
gh-0.033
iding-0.032
Deal-0.031
 repeat-0.031
Token.
Token ID13
Feature activation+3.86e-01

Pos logit contributions
 many+0.042
 pieces+0.037
 of+0.035
 twins+0.034
 flowers+0.034
Neg logit contributions
ah-0.104
gh-0.101
Deal-0.098
-0.094
iding-0.093
Token
Token ID220
Feature activation+3.38e-01

Pos logit contributions
ah+0.019
gh+0.017
 repeat+0.017
Deal+0.017
iding+0.016
Neg logit contributions
 curtains-0.008
 pieces-0.007
 doorway-0.007
edd-0.007
 many-0.007
Token
Token ID198
Feature activation-9.37e-02

Pos logit contributions
ah+0.017
iding+0.016
gh+0.016
 repeat+0.015
Deal+0.015
Neg logit contributions
 doorway-0.006
 curtains-0.005
 posters-0.005
 pieces-0.005
rolley-0.004
Token She
Token ID1375
Feature activation-2.27e-02

Pos logit contributions
ah+0.012
 repeat+0.011
gh+0.011
Deal+0.011
iding+0.011
Neg logit contributions
edd-0.005
 curtains-0.005
armac-0.005
 pieces-0.005
rolley-0.005
Token had
Token ID550
Feature activation-1.63e+00

Pos logit contributions
 curtains+0.001
 Some+0.001
edd+0.001
rolley+0.001
 pieces+0.001
Neg logit contributions
ah-0.001
gh-0.001
 repeat-0.001
Deal-0.001
 pose-0.001
Token a
Token ID257
Feature activation-1.34e+00

Pos logit contributions
rolley+0.044
edd+0.042
armac+0.040
 doorway+0.037
 curtains+0.036
Neg logit contributions
gh-0.068
ah-0.066
 repeat-0.065
Deal-0.060
astics-0.059
Token favorite
Token ID4004
Feature activation-9.35e-01

Pos logit contributions
 curtains+0.021
 Some+0.021
 many+0.020
 pieces+0.019
edd+0.018
Neg logit contributions
ah-0.070
gh-0.066
 repeat-0.064
iding-0.064
Deal-0.063
Token belt
Token ID10999
Feature activation+2.40e+00

Pos logit contributions
 curtains+0.018
 pieces+0.017
 many+0.017
 Some+0.017
 sky+0.017
Neg logit contributions
ah-0.046
iding-0.043
Deal-0.043
gh-0.043
 repeat-0.042
Token that
Token ID326
Feature activation-3.11e+00

Pos logit contributions
ah+0.123
gh+0.117
 repeat+0.113
Deal+0.110
et+0.107
Neg logit contributions
 Some-0.040
 curtains-0.039
edd-0.039
 rooms-0.036
Someone-0.036
Token was
Token ID373
Feature activation-6.01e-01

Pos logit contributions
 Some+0.054
 curtains+0.054
edd+0.053
rolley+0.050
 pieces+0.049
Neg logit contributions
ah-0.157
gh-0.147
Deal-0.145
 repeat-0.143
 pose-0.137
Token orange
Token ID10912
Feature activation+2.16e-01

Pos logit contributions
 curtains+0.011
 many+0.011
 Some+0.011
 pieces+0.010
edd+0.010
Neg logit contributions
ah-0.030
gh-0.028
Deal-0.028
 repeat-0.028
iding-0.027
Token.
Token ID13
Feature activation-5.23e-01

Pos logit contributions
ah+0.011
gh+0.011
Deal+0.011
iding+0.010
 repeat+0.010
Neg logit contributions
 curtains-0.004
 pieces-0.003
 many-0.003
 of-0.003
 Some-0.003
Token Lily
Token ID20037
Feature activation-1.29e+00

Pos logit contributions
edd+0.014
armac+0.013
 pieces+0.013
rolley+0.013
Someone+0.013
Neg logit contributions
ah-0.021
 repeat-0.020
Deal-0.019
gh-0.019
-0.018
Token wore
Token ID12408
Feature activation-7.99e-03

Pos logit contributions
 curtains+0.031
 Some+0.029
edd+0.029
 doorway+0.028
 Someone+0.028
Neg logit contributions
ah-0.059
gh-0.054
 repeat-0.054
Deal-0.054
et-0.050
Token phone
Token ID3072
Feature activation-6.12e-01

Pos logit contributions
 Some+0.017
 curtains+0.016
edd+0.015
 many+0.015
 Someone+0.015
Neg logit contributions
ah-0.057
gh-0.054
 repeat-0.053
Deal-0.052
iding-0.051
Token options
Token ID3689
Feature activation-2.30e+00

Pos logit contributions
 curtains+0.011
 pieces+0.009
 of+0.009
 sky+0.009
 many+0.009
Neg logit contributions
ah-0.034
gh-0.032
Deal-0.031
-0.030
iding-0.030
Token.
Token ID13
Feature activation+3.76e-01

Pos logit contributions
 of+0.037
 pieces+0.034
 many+0.033
 curtains+0.032
 everywhere+0.030
Neg logit contributions
ah-0.131
Deal-0.119
gh-0.117
-0.116
 repeat-0.116
Token She
Token ID1375
Feature activation+5.39e-01

Pos logit contributions
ah+0.016
 repeat+0.016
gh+0.015
Deal+0.015
iding+0.015
Neg logit contributions
edd-0.008
 pieces-0.008
 curtains-0.008
armac-0.007
rolley-0.007
Token was
Token ID373
Feature activation-6.86e-02

Pos logit contributions
ah+0.028
gh+0.027
iding+0.026
Deal+0.026
 repeat+0.026
Neg logit contributions
 curtains-0.008
 many-0.008
 pieces-0.008
 Some-0.008
 rooms-0.008
Token overwhelmed
Token ID20974
Feature activation-3.74e+00

Pos logit contributions
 curtains+0.001
 Some+0.001
 many+0.001
 pieces+0.001
 rooms+0.001
Neg logit contributions
ah-0.003
gh-0.003
Deal-0.003
 repeat-0.003
iding-0.003
Token!
Token ID0
Feature activation+2.03e-01

Pos logit contributions
 curtains+0.073
 Some+0.071
 many+0.068
 pieces+0.067
edd+0.067
Neg logit contributions
ah-0.183
gh-0.172
 repeat-0.167
Deal-0.165
iding-0.162
Token
Token ID220
Feature activation+4.75e-01

Pos logit contributions
ah+0.010
 repeat+0.009
gh+0.009
Deal+0.009
+0.009
Neg logit contributions
edd-0.004
 curtains-0.004
 pieces-0.004
 rooms-0.003
armac-0.003
Token
Token ID198
Feature activation-5.48e-02

Pos logit contributions
iding+0.024
ah+0.024
gh+0.023
 repeat+0.023
Deal+0.022
Neg logit contributions
 curtains-0.006
 doorway-0.006
 pieces-0.006
edd-0.006
 rooms-0.006
Token
Token ID198
Feature activation+3.59e-01

Pos logit contributions
 curtains+0.001
edd+0.001
 pieces+0.001
 rooms+0.001
 flames+0.001
Neg logit contributions
gh-0.003
ah-0.003
 repeat-0.003
iding-0.003
Deal-0.003
TokenFinally
Token ID11158
Feature activation+1.27e+00

Pos logit contributions
ah+0.015
gh+0.014
 repeat+0.014
Deal+0.013
iding+0.013
Neg logit contributions
 curtains-0.010
edd-0.009
 Some-0.009
 pieces-0.009
 rooms-0.009
Token't
Token ID470
Feature activation+2.40e-01

Pos logit contributions
gh+0.100
ah+0.098
 repeat+0.095
 pose+0.087
+0.085
Neg logit contributions
 rooms-0.059
 doorway-0.058
 curtains-0.058
rolley-0.057
 pieces-0.054
Token mean
Token ID1612
Feature activation+2.49e+00

Pos logit contributions
ah+0.010
gh+0.010
Deal+0.010
iding+0.010
et+0.009
Neg logit contributions
 of-0.006
 many-0.006
-0.006
 Some-0.006
 twins-0.006
Token to
Token ID284
Feature activation+3.28e+00

Pos logit contributions
ah+0.134
gh+0.128
 repeat+0.124
Deal+0.122
iding+0.120
Neg logit contributions
 curtains-0.035
 Some-0.034
edd-0.032
 doorway-0.032
 pieces-0.030
Token hurt
Token ID5938
Feature activation+2.50e-01

Pos logit contributions
ah+0.138
gh+0.128
iding+0.127
Deal+0.126
et+0.118
Neg logit contributions
-0.085
 Some-0.084
 many-0.084
 curtains-0.082
edd-0.080
Token your
Token ID534
Feature activation+1.72e+00

Pos logit contributions
ah+0.009
gh+0.008
 repeat+0.008
Deal+0.008
iding+0.007
Neg logit contributions
 curtains-0.008
 Some-0.008
 doorway-0.008
edd-0.008
 pieces-0.008
Token feelings
Token ID7666
Feature activation-3.21e+00

Pos logit contributions
ah+0.065
gh+0.060
 repeat+0.059
Deal+0.057
iding+0.055
Neg logit contributions
 Some-0.052
 curtains-0.052
 many-0.049
edd-0.049
 doorway-0.049
Token.
Token ID13
Feature activation+5.65e-01

Pos logit contributions
 of+0.057
 many+0.056
 pieces+0.054
 curtains+0.054
+0.053
Neg logit contributions
ah-0.173
gh-0.158
Deal-0.155
 repeat-0.150
iding-0.147
Token Let
Token ID3914
Feature activation+3.35e+00

Pos logit contributions
ah+0.025
gh+0.024
 repeat+0.022
iding+0.022
Deal+0.022
Neg logit contributions
 Some-0.015
 curtains-0.014
 Someone-0.013
edd-0.013
-0.013
Token's
Token ID338
Feature activation+2.11e+00

Pos logit contributions
ah+0.120
gh+0.114
 repeat+0.111
 insist+0.103
Deal+0.103
Neg logit contributions
 curtains-0.109
 Some-0.107
 doorway-0.103
 rooms-0.101
edd-0.101
Token forgive
Token ID20927
Feature activation-1.90e+00

Pos logit contributions
ah+0.088
iding+0.078
gh+0.078
Deal+0.076
 AW+0.072
Neg logit contributions
-0.061
 curtains-0.059
 family-0.058
 of-0.057
edd-0.057
Token him
Token ID683
Feature activation+6.36e-01

Pos logit contributions
 curtains+0.056
 Some+0.055
 many+0.054
edd+0.053
 pieces+0.053
Neg logit contributions
ah-0.074
gh-0.068
 repeat-0.065
Deal-0.065
iding-0.064
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
 curtains+0.023
 pieces+0.022
 many+0.021
 of+0.021
 sky+0.020
Neg logit contributions
ah-0.069
gh-0.064
 repeat-0.063
Deal-0.062
iding-0.061
Token She
Token ID1375
Feature activation+1.67e-01

Pos logit contributions
 curtains+0.002
 Some+0.002
edd+0.002
 pieces+0.002
 many+0.002
Neg logit contributions
ah-0.008
gh-0.008
 repeat-0.007
Deal-0.007
iding-0.007
Token remembered
Token ID12086
Feature activation-4.20e+00

Pos logit contributions
ah+0.008
gh+0.008
iding+0.007
Deal+0.007
 repeat+0.007
Neg logit contributions
 curtains-0.003
 pieces-0.003
 many-0.003
 Some-0.003
 rooms-0.003
Token that
Token ID326
Feature activation-2.62e+00

Pos logit contributions
 curtains+0.060
 Some+0.059
 many+0.054
 pieces+0.054
edd+0.053
Neg logit contributions
ah-0.226
gh-0.213
 repeat-0.208
Deal-0.206
iding-0.203
Token her
Token ID607
Feature activation+1.64e+00

Pos logit contributions
 curtains+0.029
 many+0.028
 pieces+0.026
 Some+0.026
 rooms+0.024
Neg logit contributions
ah-0.155
gh-0.146
iding-0.141
Deal-0.141
 repeat-0.140
Token mummy
Token ID38567
Feature activation-4.25e+00

Pos logit contributions
ah+0.097
gh+0.090
Deal+0.089
 repeat+0.088
iding+0.088
Neg logit contributions
 curtains-0.021
 pieces-0.018
 rooms-0.018
 many-0.016
 sky-0.016
Token and
Token ID290
Feature activation+3.11e-01

Pos logit contributions
 of+0.068
 curtains+0.068
 pieces+0.062
+0.062
 twins+0.060
Neg logit contributions
ah-0.219
gh-0.214
Deal-0.208
iding-0.207
astics-0.200
Token daddy
Token ID35822
Feature activation-3.02e+00

Pos logit contributions
ah+0.015
 repeat+0.015
 insist+0.014
 forth+0.014
gh+0.014
Neg logit contributions
 Some-0.005
armac-0.005
edd-0.005
rolley-0.005
 curtains-0.005
Token had
Token ID550
Feature activation-1.49e+00

Pos logit contributions
 of+0.047
 curtains+0.047
+0.044
 everywhere+0.043
 many+0.043
Neg logit contributions
ah-0.163
gh-0.155
Deal-0.151
iding-0.147
astics-0.146
Token asked
Token ID1965
Feature activation-1.32e+00

Pos logit contributions
 curtains+0.023
 many+0.023
 pieces+0.022
 Some+0.022
 rooms+0.020
Neg logit contributions
ah-0.081
gh-0.075
iding-0.074
Deal-0.074
 repeat-0.073
Token her
Token ID607
Feature activation+2.41e-01

Pos logit contributions
 curtains+0.017
 many+0.016
 Some+0.015
 of+0.014
+0.014
Neg logit contributions
ah-0.079
Deal-0.072
gh-0.071
iding-0.069
 repeat-0.069
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
 curtains+0.015
 many+0.015
 pieces+0.015
 Some+0.014
 rooms+0.014
Neg logit contributions
ah-0.048
gh-0.045
 repeat-0.044
Deal-0.043
iding-0.043
Token and
Token ID290
Feature activation-9.84e-01

Pos logit contributions
 many+0.032
 curtains+0.032
 Some+0.032
 of+0.032
 pieces+0.031
Neg logit contributions
ah-0.093
gh-0.088
Deal-0.084
iding-0.084
 repeat-0.081
Token to
Token ID284
Feature activation+2.69e+00

Pos logit contributions
 many+0.017
 curtains+0.016
 of+0.016
 Some+0.015
 pieces+0.015
Neg logit contributions
ah-0.052
gh-0.049
iding-0.048
Deal-0.046
et-0.046
Token think
Token ID892
Feature activation-2.41e+00

Pos logit contributions
ah+0.113
gh+0.104
iding+0.103
 AW+0.098
Deal+0.095
Neg logit contributions
 many-0.081
-0.073
 twins-0.072
 flowers-0.072
 of-0.070
Token of
Token ID286
Feature activation-7.89e-01

Pos logit contributions
 of+0.026
 many+0.022
 Some+0.017
 curtains+0.017
+0.016
Neg logit contributions
ah-0.153
Deal-0.145
gh-0.140
 repeat-0.138
-0.138
Token others
Token ID1854
Feature activation-3.89e+00

Pos logit contributions
 many+0.014
 curtains+0.014
 pieces+0.013
 Some+0.013
 rooms+0.012
Neg logit contributions
ah-0.041
gh-0.038
 repeat-0.037
iding-0.037
Deal-0.036
Token too
Token ID1165
Feature activation+5.74e-01

Pos logit contributions
 of+0.057
 many+0.055
+0.053
 curtains+0.050
 everywhere+0.048
Neg logit contributions
ah-0.223
gh-0.208
-0.195
Deal-0.194
iding-0.194
Token.
Token ID13
Feature activation+7.52e-01

Pos logit contributions
ah+0.031
gh+0.028
 repeat+0.028
Deal+0.028
+0.027
Neg logit contributions
 many-0.011
 of-0.010
 Some-0.010
-0.009
 curtains-0.009
Token So
Token ID1406
Feature activation+4.67e+00

Pos logit contributions
ah+0.039
gh+0.037
 repeat+0.036
Deal+0.036
iding+0.035
Neg logit contributions
 curtains-0.012
 Some-0.011
 pieces-0.011
edd-0.011
 many-0.011
Token,
Token ID11
Feature activation+3.31e-01

Pos logit contributions
ah+0.273
iding+0.266
gh+0.263
Deal+0.255
astics+0.248
Neg logit contributions
 many-0.063
-0.056
 of-0.053
 pieces-0.050
 sky-0.044
Token he
Token ID339
Feature activation+1.92e+00

Pos logit contributions
ah+0.020
gh+0.020
iding+0.019
Deal+0.019
et+0.018
Neg logit contributions
 many-0.003
-0.003
 of-0.003
 pieces-0.003
 Some-0.003