TOP ACTIVATIONS
MAX = 5.557

She said it had a funny smell and could be dangerous
laugh by telling him a funny story but John still wanted
ig ail had learned an important lesson . Next time ,
mailbox . Martha had a funny joke written on a card
the woman had was a funny noise maker . When Jack
of them were having a wonderful time together .
. Jack had learnt an important lesson that day ; that
the leash was having a difficult time because it couldn 't
both of them had a nice time . <|endoftext|> Jimmy was
, laughing and having a great time . When
small bird had learned an important lesson : when things seem
, the boy had a funny thought . He decided to
first ." Tim made a funny noise , " Be ep
mom had taught her an important lesson about trust . From
She said it was a stupid question . But
. He had learned an important lesson that day - that
home , she had a funny hat on her head .
The other friend had a funny hat and the two ran
boats together and had a funny and fun time on that
She soon had a clever plan - she would find

MIN ACTIVATIONS
MIN = -7.658

" What 's inside the boxes ?" The driver answered ,
see what 's in the boxes ?" " Yes
Can you please put the dolls back in the box when
" Can I touch the books he lf too ?" His
, why did you bury carrots under my blanket ? I
I help you balance those newspapers ?" The man was grateful
Can we see the big planes at the airport ?" Her
Why did you take my books ?" he asked .
, can we organize the vegetables ?" Her
the loop and put the trains away ?" she asks .
store and see the other horses ?" Mom smiled
about we play with your dolls or your blocks ? And
Hey , where are the eggs ?" Ben asked . "
I help you print some papers too ?" the prince asked
I help you with the eggs ?" Max asked . He
so I can reach the branches ?" she asked .
you help me crack the eggs and put them in the
Can you please put the dolls back in the box when
I help you load the boxes ?" asked Tim my .
" Can I put the apples on the dough ?" Tom

INTERVAL 2.253 - 5.557
CONTAINS 4.493%

like a frog , fl apping his arms like a butterfly
movie to watch . He wanted to watch them all .
milk , cookies and a new toy . Jon was happy
Jack and his mum had a wonderful time by the
in the ground . They wanted to see what was inside

INTERVAL -1.051 - 2.253
CONTAINS 75.422%

gun . " I got two !" The birds
pigs roll in the mud . But their favorite animals were
she needed to know how to prevent accidents from happening .
universe ?" Her mom chuckled and said ,
her friends did not join her . They watched her roll

INTERVAL -4.354 - -1.051
CONTAINS 19.786%

here , little one ? â € Bob answered without missing
� € � � John , why didn â € ™
asked again , " Are you polite ?" The tank still
. She loved to play in her garden every day .
<|endoftext|> Once upon a time , there was a little dog

INTERVAL -7.658 - -4.354
CONTAINS 0.299%

my , can I add clothes ?" Her mother replied
up for me !" <|endoftext|> Once upon a time , there
" Did you take my axe ? Did you hurt yourself
on her face . <|endoftext|> Once upon a time , there
all day long . <|endoftext|> Once upon a time , there
Token She
Token ID1375
Feature activation+1.18e-01

Pos logit contributions
ously+0.029
der+0.027
 words+0.026
aced+0.024
itted+0.024
Neg logit contributions
chard-0.030
nam-0.030
 Merry-0.027
shadow-0.026
 upon-0.026
Token said
Token ID531
Feature activation+1.50e+00

Pos logit contributions
ously+0.003
 words+0.003
der+0.003
itted+0.002
anked+0.002
Neg logit contributions
chard-0.003
nam-0.003
shadow-0.002
 Merry-0.002
anut-0.002
Token it
Token ID340
Feature activation+1.52e+00

Pos logit contributions
ously+0.028
 words+0.026
der+0.026
itted+0.025
eared+0.024
Neg logit contributions
nam-0.039
chard-0.039
shadow-0.036
 Merry-0.035
 thirsty-0.035
Token had
Token ID550
Feature activation+3.15e+00

Pos logit contributions
ously+0.040
der+0.038
 words+0.036
itted+0.035
eared+0.035
Neg logit contributions
chard-0.027
nam-0.027
shadow-0.024
 upon-0.024
 Merry-0.024
Token a
Token ID257
Feature activation+1.97e+00

Pos logit contributions
ously+0.058
der+0.052
 words+0.049
anked+0.047
itted+0.046
Neg logit contributions
nam-0.085
chard-0.084
shadow-0.079
 upon-0.077
 Merry-0.077
Token funny
Token ID8258
Feature activation+5.56e+00

Pos logit contributions
ously+0.044
der+0.042
 words+0.041
itted+0.038
anked+0.037
Neg logit contributions
chard-0.045
nam-0.045
shadow-0.041
 forget-0.039
 thirsty-0.039
Token smell
Token ID8508
Feature activation+1.16e+00

Pos logit contributions
der+0.135
ously+0.133
itted+0.116
anked+0.115
eared+0.115
Neg logit contributions
 thirsty-0.102
 upon-0.100
 ther-0.100
 cozy-0.098
chard-0.093
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
ously+0.025
der+0.022
 words+0.022
itted+0.022
anked+0.021
Neg logit contributions
chard-0.027
nam-0.026
 unatt-0.025
 Merry-0.024
 thirsty-0.024
Token could
Token ID714
Feature activation+1.87e+00

Pos logit contributions
chard+0.030
nam+0.030
shadow+0.029
anut+0.027
|+0.027
Neg logit contributions
 words-0.029
ously-0.029
der-0.028
itted-0.025
anked-0.025
Token be
Token ID307
Feature activation+6.52e-01

Pos logit contributions
 words+0.040
der+0.036
ously+0.035
anked+0.032
ield+0.031
Neg logit contributions
nam-0.045
shadow-0.044
chard-0.042
 Merry-0.039
anut-0.039
Token dangerous
Token ID4923
Feature activation+6.82e-01

Pos logit contributions
 words+0.015
ously+0.014
der+0.013
eared+0.012
itted+0.012
Neg logit contributions
nam-0.014
chard-0.014
shadow-0.014
 Merry-0.013
|-0.013
Token laugh
Token ID6487
Feature activation+1.21e+00

Pos logit contributions
ously+0.029
 words+0.028
der+0.028
itted+0.026
anked+0.025
Neg logit contributions
chard-0.022
nam-0.022
shadow-0.020
 Merry-0.019
anut-0.019
Token by
Token ID416
Feature activation-2.55e-02

Pos logit contributions
ously+0.025
 words+0.023
der+0.022
itted+0.022
anked+0.021
Neg logit contributions
nam-0.028
chard-0.028
shadow-0.027
 upon-0.025
 unatt-0.025
Token telling
Token ID5149
Feature activation+2.23e+00

Pos logit contributions
nam+0.001
chard+0.001
shadow+0.000
 Merry+0.000
 forget+0.000
Neg logit contributions
ously-0.001
 words-0.001
der-0.001
itted-0.001
anked-0.001
Token him
Token ID683
Feature activation+2.19e+00

Pos logit contributions
ously+0.054
itted+0.053
der+0.051
eared+0.046
aced+0.046
Neg logit contributions
nam-0.044
 Merry-0.040
chard-0.038
 forget-0.038
 upon-0.036
Token a
Token ID257
Feature activation+2.70e+00

Pos logit contributions
ously+0.043
der+0.040
itted+0.039
 words+0.038
anked+0.037
Neg logit contributions
nam-0.053
chard-0.052
shadow-0.049
 upon-0.048
 forget-0.048
Token funny
Token ID8258
Feature activation+5.53e+00

Pos logit contributions
ously+0.059
der+0.056
itted+0.054
anked+0.052
eared+0.052
Neg logit contributions
 forget-0.058
shadow-0.057
 Merry-0.056
 upon-0.056
chard-0.055
Token story
Token ID1621
Feature activation+1.34e+00

Pos logit contributions
ously+0.128
der+0.122
itted+0.116
anked+0.109
aced+0.109
Neg logit contributions
 thirsty-0.100
chard-0.099
 ther-0.099
 upon-0.099
 Merry-0.098
Token but
Token ID475
Feature activation-7.74e-01

Pos logit contributions
ously+0.029
itted+0.028
anked+0.026
der+0.026
eared+0.025
Neg logit contributions
chard-0.027
nam-0.026
shadow-0.026
 upon-0.026
 forget-0.025
Token John
Token ID1757
Feature activation+2.31e+00

Pos logit contributions
chard+0.016
nam+0.016
shadow+0.015
anut+0.014
 Merry+0.014
Neg logit contributions
ously-0.018
 words-0.017
der-0.017
itted-0.015
anked-0.015
Token still
Token ID991
Feature activation+1.18e+00

Pos logit contributions
ously+0.059
 words+0.057
der+0.056
itted+0.053
anked+0.052
Neg logit contributions
chard-0.043
nam-0.042
shadow-0.039
 forget-0.037
anut-0.037
Token wanted
Token ID2227
Feature activation+3.74e+00

Pos logit contributions
ously+0.030
 words+0.029
der+0.029
itted+0.026
anked+0.026
Neg logit contributions
chard-0.023
nam-0.023
shadow-0.021
 Merry-0.019
 forget-0.019
Tokenig
Token ID328
Feature activation-6.11e-01

Pos logit contributions
chard+0.068
nam+0.066
 Merry+0.058
|+0.056
 thirsty+0.056
Neg logit contributions
ously-0.050
 words-0.044
itted-0.042
anked-0.042
der-0.041
Tokenail
Token ID603
Feature activation+1.87e+00

Pos logit contributions
nam+0.012
chard+0.012
 Merry+0.010
shadow+0.010
 unatt+0.010
Neg logit contributions
ously-0.016
 words-0.015
der-0.015
itted-0.014
anked-0.014
Token had
Token ID550
Feature activation+3.10e+00

Pos logit contributions
ously+0.042
 words+0.040
der+0.039
itted+0.035
anked+0.035
Neg logit contributions
nam-0.043
chard-0.043
shadow-0.040
 Merry-0.037
anut-0.036
Token learned
Token ID4499
Feature activation+2.25e+00

Pos logit contributions
ously+0.063
der+0.060
 words+0.059
itted+0.054
anked+0.053
Neg logit contributions
nam-0.076
chard-0.076
shadow-0.071
 Merry-0.068
 forget-0.067
Token an
Token ID281
Feature activation+3.09e+00

Pos logit contributions
ously+0.050
itted+0.045
der+0.045
 words+0.044
anked+0.043
Neg logit contributions
nam-0.051
chard-0.049
shadow-0.046
 upon-0.045
 Merry-0.044
Token important
Token ID1593
Feature activation+5.49e+00

Pos logit contributions
ously+0.075
der+0.066
itted+0.065
 words+0.064
anked+0.062
Neg logit contributions
nam-0.064
chard-0.063
shadow-0.060
 forget-0.057
anut-0.057
Token lesson
Token ID11483
Feature activation+2.07e+00

Pos logit contributions
ously+0.124
der+0.117
anked+0.113
eared+0.112
itted+0.109
Neg logit contributions
|-0.108
 forget-0.106
 Merry-0.103
shadow-0.101
 cozy-0.101
Token.
Token ID13
Feature activation+4.66e-01

Pos logit contributions
ously+0.037
itted+0.036
 words+0.034
der+0.034
anked+0.032
Neg logit contributions
chard-0.049
nam-0.049
shadow-0.048
 Merry-0.048
 upon-0.047
Token Next
Token ID7406
Feature activation+1.71e+00

Pos logit contributions
ously+0.010
 words+0.009
der+0.009
itted+0.008
anked+0.008
Neg logit contributions
chard-0.011
nam-0.011
shadow-0.010
 unatt-0.010
anut-0.010
Token time
Token ID640
Feature activation+1.73e-01

Pos logit contributions
ously+0.035
der+0.033
 words+0.031
itted+0.031
anked+0.029
Neg logit contributions
chard-0.040
 forget-0.040
 Merry-0.040
shadow-0.039
nam-0.039
Token,
Token ID11
Feature activation-7.22e-01

Pos logit contributions
ously+0.002
 words+0.002
der+0.002
itted+0.002
anked+0.002
Neg logit contributions
chard-0.005
nam-0.005
anut-0.005
shadow-0.005
|-0.005
Token mailbox
Token ID37282
Feature activation+8.80e-02

Pos logit contributions
nam+0.006
chard+0.006
 thirsty+0.004
 forget+0.004
 upon+0.004
Neg logit contributions
ously-0.010
der-0.009
itted-0.009
 words-0.009
anked-0.009
Token.
Token ID13
Feature activation+1.34e+00

Pos logit contributions
der+0.002
ously+0.002
anked+0.002
itted+0.002
 words+0.002
Neg logit contributions
chard-0.002
nam-0.002
 upon-0.002
 thirsty-0.002
 forth-0.002
Token Martha
Token ID27243
Feature activation+1.24e+00

Pos logit contributions
der+0.029
ously+0.029
 words+0.028
anked+0.025
itted+0.024
Neg logit contributions
chard-0.029
nam-0.029
 Merry-0.028
 upon-0.027
shadow-0.026
Token had
Token ID550
Feature activation+2.67e+00

Pos logit contributions
ously+0.031
 words+0.030
der+0.030
itted+0.028
anked+0.027
Neg logit contributions
chard-0.024
nam-0.024
shadow-0.022
 Merry-0.020
 upon-0.020
Token a
Token ID257
Feature activation+2.46e+00

Pos logit contributions
ously+0.065
der+0.063
 words+0.057
anked+0.057
itted+0.057
Neg logit contributions
nam-0.056
chard-0.055
shadow-0.052
 upon-0.050
 forget-0.050
Token funny
Token ID8258
Feature activation+5.48e+00

Pos logit contributions
ously+0.059
der+0.057
 words+0.053
itted+0.051
anked+0.050
Neg logit contributions
chard-0.054
nam-0.053
shadow-0.048
 forget-0.047
 upon-0.047
Token joke
Token ID9707
Feature activation+1.94e+00

Pos logit contributions
der+0.120
ously+0.105
eared+0.091
anked+0.090
itted+0.086
Neg logit contributions
 thirsty-0.124
 upon-0.123
 cozy-0.122
nam-0.121
chard-0.119
Token written
Token ID3194
Feature activation+1.46e-01

Pos logit contributions
ously+0.040
itted+0.039
anked+0.038
 words+0.037
aced+0.036
Neg logit contributions
chard-0.036
 unatt-0.035
 upon-0.034
 under-0.033
 thirsty-0.032
Token on
Token ID319
Feature activation+2.77e-01

Pos logit contributions
ously+0.004
 words+0.004
der+0.004
itted+0.003
anked+0.003
Neg logit contributions
chard-0.003
nam-0.003
shadow-0.002
 Merry-0.002
 upon-0.002
Token a
Token ID257
Feature activation+4.11e-01

Pos logit contributions
ously+0.007
itted+0.007
der+0.007
eared+0.007
 words+0.006
Neg logit contributions
nam-0.005
chard-0.005
 Merry-0.004
 upon-0.004
anut-0.004
Token card
Token ID2657
Feature activation+2.06e-01

Pos logit contributions
ously+0.008
der+0.008
 words+0.008
itted+0.007
anked+0.007
Neg logit contributions
chard-0.010
nam-0.010
shadow-0.010
 Merry-0.009
 forget-0.009
Token the
Token ID262
Feature activation-4.84e-01

Pos logit contributions
ously+0.006
der+0.006
 words+0.006
itted+0.006
anked+0.005
Neg logit contributions
chard-0.005
nam-0.005
illas-0.005
 forget-0.005
shadow-0.005
Token woman
Token ID2415
Feature activation+9.80e-02

Pos logit contributions
nam+0.010
chard+0.010
 forget+0.009
 Merry+0.009
shadow+0.009
Neg logit contributions
ously-0.012
der-0.011
anked-0.011
itted-0.010
 words-0.010
Token had
Token ID550
Feature activation+3.12e+00

Pos logit contributions
ously+0.002
 words+0.002
der+0.002
itted+0.002
anked+0.002
Neg logit contributions
nam-0.002
chard-0.002
shadow-0.002
 Merry-0.002
anut-0.002
Token was
Token ID373
Feature activation+1.23e+00

Pos logit contributions
ously+0.078
der+0.077
 words+0.072
itted+0.071
anked+0.069
Neg logit contributions
nam-0.057
chard-0.056
shadow-0.055
 forget-0.053
 thirsty-0.052
Token a
Token ID257
Feature activation+2.19e+00

Pos logit contributions
ously+0.030
 words+0.029
der+0.028
itted+0.026
anked+0.025
Neg logit contributions
nam-0.026
chard-0.026
shadow-0.024
 Merry-0.022
|-0.022
Token funny
Token ID8258
Feature activation+5.36e+00

Pos logit contributions
ously+0.053
der+0.051
 words+0.050
itted+0.048
anked+0.047
Neg logit contributions
chard-0.046
nam-0.045
 forget-0.041
shadow-0.041
 thirsty-0.040
Token noise
Token ID7838
Feature activation+1.80e+00

Pos logit contributions
der+0.131
ously+0.129
anked+0.122
itted+0.120
aced+0.117
Neg logit contributions
 thirsty-0.089
 upon-0.086
 forget-0.085
 ther-0.084
 cozy-0.081
Token maker
Token ID16009
Feature activation+2.15e-01

Pos logit contributions
ously+0.040
 words+0.039
itted+0.039
der+0.037
anked+0.036
Neg logit contributions
chard-0.036
 unatt-0.035
nam-0.033
 upon-0.032
shadow-0.032
Token.
Token ID13
Feature activation+9.52e-01

Pos logit contributions
ously+0.004
 words+0.004
der+0.004
itted+0.004
anked+0.004
Neg logit contributions
chard-0.005
nam-0.005
shadow-0.005
 unatt-0.005
 upon-0.004
Token When
Token ID1649
Feature activation-2.37e-01

Pos logit contributions
ously+0.021
 words+0.020
der+0.019
itted+0.018
aced+0.017
Neg logit contributions
chard-0.022
nam-0.021
 Merry-0.019
 upon-0.019
shadow-0.019
Token Jack
Token ID3619
Feature activation+1.47e+00

Pos logit contributions
chard+0.005
nam+0.005
shadow+0.005
anut+0.005
illas+0.005
Neg logit contributions
ously-0.005
 words-0.005
der-0.005
anked-0.004
itted-0.004
Token of
Token ID286
Feature activation-6.04e-01

Pos logit contributions
der+0.045
ously+0.044
itted+0.041
anked+0.041
 words+0.041
Neg logit contributions
nam-0.042
 forget-0.041
chard-0.039
 upon-0.039
 thirsty-0.038
Token them
Token ID606
Feature activation+7.63e-01

Pos logit contributions
nam+0.012
chard+0.011
 Merry+0.010
shadow+0.010
 upon+0.010
Neg logit contributions
ously-0.015
der-0.015
 words-0.014
itted-0.014
anked-0.013
Token were
Token ID547
Feature activation+1.36e+00

Pos logit contributions
ously+0.019
 words+0.018
der+0.018
itted+0.017
anked+0.016
Neg logit contributions
chard-0.015
nam-0.015
shadow-0.014
 Merry-0.013
 forget-0.013
Token having
Token ID1719
Feature activation+4.15e+00

Pos logit contributions
ously+0.033
der+0.032
 words+0.032
itted+0.029
anked+0.028
Neg logit contributions
chard-0.028
nam-0.028
shadow-0.026
 Merry-0.024
 forget-0.024
Token a
Token ID257
Feature activation+3.17e+00

Pos logit contributions
ously+0.101
der+0.098
anked+0.088
itted+0.087
ield+0.084
Neg logit contributions
nam-0.087
chard-0.079
 thirsty-0.079
 Merry-0.076
 upon-0.076
Token wonderful
Token ID7932
Feature activation+5.34e+00

Pos logit contributions
ously+0.087
der+0.084
 words+0.081
anked+0.077
itted+0.077
Neg logit contributions
chard-0.055
nam-0.055
 forget-0.052
shadow-0.050
 thirsty-0.049
Token time
Token ID640
Feature activation+9.13e-01

Pos logit contributions
der+0.150
ously+0.141
anked+0.123
itted+0.120
 words+0.119
Neg logit contributions
 cozy-0.088
|-0.087
 upon-0.087
 forget-0.085
 thirsty-0.081
Token together
Token ID1978
Feature activation+1.25e+00

Pos logit contributions
ously+0.020
 words+0.019
der+0.018
anked+0.017
itted+0.017
Neg logit contributions
nam-0.020
chard-0.020
shadow-0.019
 upon-0.018
 thirsty-0.018
Token.
Token ID13
Feature activation+2.81e-01

Pos logit contributions
ously+0.026
der+0.025
 words+0.025
itted+0.024
anked+0.023
Neg logit contributions
chard-0.028
nam-0.027
 upon-0.025
shadow-0.025
 Merry-0.025
Token
Token ID198
Feature activation-9.47e-01

Pos logit contributions
ously+0.006
der+0.006
 words+0.006
itted+0.005
anked+0.005
Neg logit contributions
nam-0.007
chard-0.007
shadow-0.006
 unatt-0.006
anut-0.006
Token
Token ID198
Feature activation-2.33e-01

Pos logit contributions
nam+0.021
chard+0.021
shadow+0.018
 unatt+0.018
 forget+0.018
Neg logit contributions
ously-0.022
 words-0.021
der-0.020
itted-0.019
anked-0.019
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
ously+0.007
 words+0.007
der+0.007
itted+0.007
anked+0.006
Neg logit contributions
chard-0.008
nam-0.008
 upon-0.007
 Merry-0.007
 forget-0.007
Token Jack
Token ID3619
Feature activation+2.36e+00

Pos logit contributions
ously+0.025
 words+0.023
der+0.023
itted+0.021
anked+0.020
Neg logit contributions
chard-0.029
nam-0.029
shadow-0.028
 Merry-0.027
 upon-0.027
Token had
Token ID550
Feature activation+3.12e+00

Pos logit contributions
ously+0.055
 words+0.053
der+0.052
itted+0.048
anked+0.048
Neg logit contributions
chard-0.050
nam-0.049
shadow-0.046
 forget-0.043
 Merry-0.043
Token learnt
Token ID26338
Feature activation+2.70e+00

Pos logit contributions
ously+0.072
der+0.070
 words+0.067
anked+0.064
itted+0.063
Neg logit contributions
nam-0.067
chard-0.067
shadow-0.062
 thirsty-0.061
 forget-0.061
Token an
Token ID281
Feature activation+2.87e+00

Pos logit contributions
ously+0.063
itted+0.060
der+0.056
anked+0.055
 words+0.055
Neg logit contributions
nam-0.055
chard-0.052
 upon-0.050
shadow-0.048
 Merry-0.046
Token important
Token ID1593
Feature activation+5.34e+00

Pos logit contributions
ously+0.068
der+0.060
itted+0.059
 words+0.059
anked+0.057
Neg logit contributions
nam-0.061
chard-0.060
shadow-0.058
anut-0.053
 forget-0.053
Token lesson
Token ID11483
Feature activation+2.25e+00

Pos logit contributions
ously+0.123
der+0.116
eared+0.113
anked+0.111
itted+0.109
Neg logit contributions
|-0.107
 upon-0.093
shadow-0.092
 forget-0.092
 Merry-0.089
Token that
Token ID326
Feature activation+2.54e-01

Pos logit contributions
itted+0.042
ously+0.041
anked+0.038
 words+0.037
der+0.037
Neg logit contributions
chard-0.049
 upon-0.049
nam-0.048
shadow-0.047
 Merry-0.047
Token day
Token ID1110
Feature activation+1.17e+00

Pos logit contributions
ously+0.005
 words+0.005
der+0.005
itted+0.005
anked+0.004
Neg logit contributions
chard-0.006
nam-0.006
shadow-0.006
 Merry-0.005
 forget-0.005
Token;
Token ID26
Feature activation-5.25e-01

Pos logit contributions
ously+0.021
 words+0.020
der+0.020
itted+0.019
eared+0.018
Neg logit contributions
nam-0.028
chard-0.027
shadow-0.027
 Merry-0.027
 upon-0.026
Token that
Token ID326
Feature activation-5.24e-01

Pos logit contributions
nam+0.010
chard+0.010
shadow+0.010
 forget+0.009
 upon+0.009
Neg logit contributions
ously-0.013
der-0.012
 words-0.012
itted-0.012
anked-0.011
Token the
Token ID262
Feature activation-2.15e-02

Pos logit contributions
chard+0.005
nam+0.005
shadow+0.004
 Merry+0.004
 upon+0.004
Neg logit contributions
ously-0.010
der-0.010
 words-0.009
itted-0.009
eared-0.009
Token leash
Token ID40529
Feature activation+1.35e+00

Pos logit contributions
nam+0.000
chard+0.000
 forget+0.000
shadow+0.000
 Merry+0.000
Neg logit contributions
ously-0.001
der-0.000
itted-0.000
 words-0.000
anked-0.000
Token was
Token ID373
Feature activation+1.01e+00

Pos logit contributions
ously+0.033
 words+0.033
der+0.032
itted+0.031
anked+0.030
Neg logit contributions
chard-0.025
nam-0.024
shadow-0.022
 forget-0.022
 upon-0.022
Token having
Token ID1719
Feature activation+3.98e+00

Pos logit contributions
ously+0.025
 words+0.024
der+0.024
itted+0.022
anked+0.021
Neg logit contributions
chard-0.021
nam-0.020
shadow-0.019
 thirsty-0.018
 forget-0.017
Token a
Token ID257
Feature activation+2.44e+00

Pos logit contributions
ously+0.100
der+0.095
itted+0.087
anked+0.085
 words+0.083
Neg logit contributions
nam-0.076
 thirsty-0.071
 upon-0.069
chard-0.069
 cozy-0.067
Token difficult
Token ID2408
Feature activation+5.34e+00

Pos logit contributions
ously+0.065
der+0.063
 words+0.061
itted+0.060
anked+0.059
Neg logit contributions
 forget-0.044
chard-0.042
shadow-0.042
nam-0.041
 cozy-0.041
Token time
Token ID640
Feature activation+2.00e+00

Pos logit contributions
der+0.132
ously+0.127
itted+0.126
eared+0.125
red+0.112
Neg logit contributions
 forget-0.093
 forth-0.083
|-0.078
 have-0.077
 thirsty-0.076
Token because
Token ID780
Feature activation-1.12e+00

Pos logit contributions
ously+0.040
 words+0.040
der+0.039
anked+0.038
itted+0.038
Neg logit contributions
 upon-0.041
shadow-0.040
chard-0.038
nam-0.038
 forget-0.038
Token it
Token ID340
Feature activation+4.65e-01

Pos logit contributions
chard+0.025
nam+0.025
shadow+0.023
anut+0.022
 Merry+0.022
Neg logit contributions
ously-0.025
 words-0.024
der-0.024
itted-0.021
anked-0.021
Token couldn
Token ID3521
Feature activation+2.98e+00

Pos logit contributions
ously+0.011
 words+0.011
der+0.011
itted+0.010
anked+0.009
Neg logit contributions
nam-0.010
chard-0.010
shadow-0.009
 Merry-0.009
anut-0.008
Token't
Token ID470
Feature activation+2.73e+00

Pos logit contributions
ously+0.071
 words+0.067
der+0.066
itted+0.062
eared+0.059
Neg logit contributions
chard-0.062
nam-0.061
shadow-0.058
 Merry-0.055
|-0.055
Token both
Token ID1111
Feature activation+5.75e-01

Pos logit contributions
chard+0.015
nam+0.015
shadow+0.014
|+0.013
anut+0.013
Neg logit contributions
ously-0.017
 words-0.016
der-0.016
itted-0.015
anked-0.014
Token of
Token ID286
Feature activation-2.18e-01

Pos logit contributions
ously+0.014
 words+0.014
der+0.014
itted+0.013
anked+0.013
Neg logit contributions
nam-0.011
chard-0.011
shadow-0.010
 Merry-0.010
 forget-0.010
Token them
Token ID606
Feature activation+3.86e-01

Pos logit contributions
nam+0.004
chard+0.004
 Merry+0.003
|+0.003
 upon+0.003
Neg logit contributions
der-0.006
ously-0.006
 words-0.006
itted-0.006
ield-0.005
Token had
Token ID550
Feature activation+3.81e+00

Pos logit contributions
ously+0.009
 words+0.009
der+0.009
itted+0.008
anked+0.008
Neg logit contributions
chard-0.008
nam-0.008
shadow-0.007
 Merry-0.007
 forget-0.007
Token a
Token ID257
Feature activation+2.76e+00

Pos logit contributions
der+0.095
ously+0.093
anked+0.085
itted+0.082
 words+0.081
Neg logit contributions
nam-0.071
 cozy-0.069
chard-0.068
 forget-0.067
shadow-0.067
Token nice
Token ID3621
Feature activation+5.34e+00

Pos logit contributions
ously+0.071
der+0.069
 words+0.067
anked+0.063
itted+0.063
Neg logit contributions
chard-0.053
nam-0.052
shadow-0.048
 forget-0.048
 Merry-0.046
Token time
Token ID640
Feature activation+1.17e+00

Pos logit contributions
der+0.169
anked+0.147
ously+0.144
eared+0.142
itted+0.138
Neg logit contributions
 cozy-0.078
 forget-0.075
 upon-0.071
 thirsty-0.067
bie-0.065
Token.
Token ID13
Feature activation+3.93e-01

Pos logit contributions
ously+0.025
 words+0.023
der+0.023
anked+0.022
itted+0.021
Neg logit contributions
nam-0.026
chard-0.026
shadow-0.024
 upon-0.024
 thirsty-0.024
Token<|endoftext|>
Token ID50256
Feature activation-1.78e+00

Pos logit contributions
ously+0.009
 words+0.008
der+0.008
itted+0.008
anked+0.007
Neg logit contributions
chard-0.009
nam-0.009
shadow-0.008
anut-0.008
 unatt-0.008
TokenJimmy
Token ID40335
Feature activation+4.53e-01

Pos logit contributions
nam+0.040
chard+0.039
shadow+0.037
anut+0.034
 forget+0.034
Neg logit contributions
ously-0.041
 words-0.041
der-0.038
itted-0.036
anked-0.035
Token was
Token ID373
Feature activation-1.76e+00

Pos logit contributions
ously+0.009
 words+0.009
der+0.008
itted+0.008
anked+0.008
Neg logit contributions
nam-0.011
chard-0.011
shadow-0.011
 Merry-0.010
 unatt-0.010
Token,
Token ID11
Feature activation-8.20e-01

Pos logit contributions
chard+0.001
 upon+0.001
nam+0.001
shadow+0.001
 forget+0.001
Neg logit contributions
 words-0.002
ously-0.001
der-0.001
eared-0.001
itted-0.001
Token laughing
Token ID14376
Feature activation+2.23e+00

Pos logit contributions
chard+0.018
nam+0.018
shadow+0.017
 Merry+0.016
 forget+0.016
Neg logit contributions
ously-0.019
 words-0.018
der-0.018
itted-0.016
anked-0.016
Token and
Token ID290
Feature activation-7.13e-01

Pos logit contributions
ously+0.044
der+0.040
anked+0.040
itted+0.040
 words+0.039
Neg logit contributions
nam-0.050
chard-0.049
 upon-0.047
 Merry-0.045
 thirsty-0.045
Token having
Token ID1719
Feature activation+3.45e+00

Pos logit contributions
chard+0.013
nam+0.013
shadow+0.012
 Merry+0.011
anut+0.011
Neg logit contributions
ously-0.018
 words-0.018
der-0.017
itted-0.016
anked-0.016
Token a
Token ID257
Feature activation+3.17e+00

Pos logit contributions
ously+0.080
der+0.080
anked+0.072
itted+0.071
 words+0.069
Neg logit contributions
nam-0.075
chard-0.070
 thirsty-0.067
 Merry-0.067
 forget-0.066
Token great
Token ID1049
Feature activation+5.32e+00

Pos logit contributions
ously+0.079
der+0.076
 words+0.075
itted+0.070
anked+0.069
Neg logit contributions
chard-0.063
nam-0.063
shadow-0.058
 forget-0.056
 Merry-0.055
Token time
Token ID640
Feature activation+1.37e+00

Pos logit contributions
der+0.143
ously+0.134
anked+0.122
 words+0.122
itted+0.119
Neg logit contributions
 forget-0.098
|-0.092
 upon-0.092
 thirsty-0.089
 cozy-0.085
Token.
Token ID13
Feature activation+9.87e-01

Pos logit contributions
ously+0.028
 words+0.027
der+0.027
itted+0.025
anked+0.024
Neg logit contributions
nam-0.032
chard-0.031
shadow-0.030
 upon-0.028
 Merry-0.028
Token
Token ID198
Feature activation-8.31e-01

Pos logit contributions
ously+0.021
 words+0.020
der+0.019
itted+0.017
anked+0.017
Neg logit contributions
chard-0.024
nam-0.023
shadow-0.022
 Merry-0.021
|-0.021
Token
Token ID198
Feature activation-1.65e-01

Pos logit contributions
nam+0.022
chard+0.022
 unatt+0.019
shadow+0.019
 forget+0.019
Neg logit contributions
ously-0.016
 words-0.015
der-0.014
itted-0.014
anked-0.013
TokenWhen
Token ID2215
Feature activation+3.45e-01

Pos logit contributions
nam+0.004
chard+0.004
shadow+0.003
 unatt+0.003
 thirsty+0.003
Neg logit contributions
ously-0.004
 words-0.003
der-0.003
itted-0.003
anked-0.003
Token small
Token ID1402
Feature activation+1.66e+00

Pos logit contributions
nam+0.025
chard+0.025
shadow+0.024
 Merry+0.022
|+0.022
Neg logit contributions
 words-0.015
ously-0.015
der-0.014
itted-0.012
anked-0.012
Token bird
Token ID6512
Feature activation+1.15e+00

Pos logit contributions
ously+0.037
der+0.035
 words+0.032
itted+0.032
anked+0.032
Neg logit contributions
chard-0.039
nam-0.038
 upon-0.034
shadow-0.034
 thirsty-0.034
Token had
Token ID550
Feature activation+3.25e+00

Pos logit contributions
ously+0.026
 words+0.025
der+0.024
itted+0.022
anked+0.021
Neg logit contributions
nam-0.027
chard-0.026
shadow-0.024
 Merry-0.023
anut-0.022
Token learned
Token ID4499
Feature activation+2.41e+00

Pos logit contributions
ously+0.077
der+0.075
 words+0.070
anked+0.069
itted+0.067
Neg logit contributions
nam-0.067
chard-0.066
shadow-0.063
 upon-0.061
 forget-0.060
Token an
Token ID281
Feature activation+3.19e+00

Pos logit contributions
ously+0.055
itted+0.051
der+0.050
 words+0.048
anked+0.048
Neg logit contributions
nam-0.052
chard-0.049
shadow-0.048
 upon-0.046
 Merry-0.044
Token important
Token ID1593
Feature activation+5.32e+00

Pos logit contributions
ously+0.080
der+0.071
itted+0.071
anked+0.068
 words+0.066
Neg logit contributions
nam-0.062
shadow-0.060
chard-0.059
 forget-0.055
anut-0.054
Token lesson
Token ID11483
Feature activation+2.38e+00

Pos logit contributions
der+0.116
ously+0.114
eared+0.108
anked+0.106
itted+0.102
Neg logit contributions
 cozy-0.105
 upon-0.101
shadow-0.101
 forget-0.100
chard-0.096
Token:
Token ID25
Feature activation+3.21e-01

Pos logit contributions
ously+0.043
itted+0.043
der+0.039
anked+0.039
 words+0.038
Neg logit contributions
shadow-0.054
 upon-0.054
chard-0.054
nam-0.053
 Merry-0.052
Token when
Token ID618
Feature activation-5.93e-01

Pos logit contributions
ously+0.007
 words+0.007
der+0.007
itted+0.006
anked+0.006
Neg logit contributions
chard-0.007
nam-0.007
shadow-0.006
 Merry-0.006
 upon-0.006
Token things
Token ID1243
Feature activation-2.27e-01

Pos logit contributions
chard+0.011
nam+0.011
shadow+0.010
|+0.009
anut+0.009
Neg logit contributions
 words-0.015
ously-0.015
der-0.015
anked-0.014
itted-0.014
Token seem
Token ID1283
Feature activation+1.10e+00

Pos logit contributions
chard+0.004
nam+0.004
shadow+0.004
 Merry+0.004
anut+0.003
Neg logit contributions
ously-0.006
 words-0.006
der-0.006
itted-0.005
anked-0.005
Token,
Token ID11
Feature activation-6.54e-01

Pos logit contributions
ously+0.015
 words+0.014
der+0.014
itted+0.012
anked+0.012
Neg logit contributions
chard-0.023
nam-0.023
shadow-0.021
anut-0.020
|-0.020
Token the
Token ID262
Feature activation-3.64e-01

Pos logit contributions
chard+0.014
nam+0.014
shadow+0.014
|+0.013
anut+0.013
Neg logit contributions
ously-0.014
 words-0.014
der-0.013
anked-0.012
itted-0.012
Token boy
Token ID2933
Feature activation+1.98e+00

Pos logit contributions
chard+0.008
nam+0.008
shadow+0.007
 forget+0.007
 Merry+0.007
Neg logit contributions
ously-0.009
der-0.008
 words-0.008
itted-0.008
anked-0.008
Token had
Token ID550
Feature activation+2.92e+00

Pos logit contributions
ously+0.048
 words+0.046
der+0.045
itted+0.042
anked+0.041
Neg logit contributions
chard-0.041
nam-0.041
shadow-0.038
 Merry-0.035
anut-0.035
Token a
Token ID257
Feature activation+2.78e+00

Pos logit contributions
ously+0.064
der+0.059
 words+0.057
anked+0.054
itted+0.054
Neg logit contributions
chard-0.068
nam-0.068
 upon-0.063
 thirsty-0.062
shadow-0.062
Token funny
Token ID8258
Feature activation+5.31e+00

Pos logit contributions
ously+0.066
der+0.064
 words+0.061
itted+0.059
anked+0.058
Neg logit contributions
chard-0.061
nam-0.060
 forget-0.055
shadow-0.054
 thirsty-0.054
Token thought
Token ID1807
Feature activation+2.29e+00

Pos logit contributions
der+0.114
ously+0.111
itted+0.096
eared+0.096
anked+0.095
Neg logit contributions
 thirsty-0.112
 forget-0.108
chard-0.105
 forth-0.104
 upon-0.103
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
ously+0.044
der+0.043
itted+0.042
 words+0.041
anked+0.040
Neg logit contributions
chard-0.053
nam-0.051
 upon-0.049
shadow-0.049
 Merry-0.049
Token He
Token ID679
Feature activation+3.23e-01

Pos logit contributions
ously+0.048
 words+0.041
der+0.040
aced+0.040
itted+0.040
Neg logit contributions
chard-0.035
nam-0.033
 Merry-0.033
 upon-0.031
shadow-0.030
Token decided
Token ID3066
Feature activation+2.07e+00

Pos logit contributions
ously+0.007
 words+0.007
der+0.007
itted+0.006
anked+0.006
Neg logit contributions
nam-0.007
chard-0.007
shadow-0.007
 Merry-0.006
anut-0.006
Token to
Token ID284
Feature activation+1.17e+00

Pos logit contributions
ously+0.034
itted+0.032
der+0.031
 words+0.031
anked+0.030
Neg logit contributions
chard-0.059
nam-0.057
shadow-0.055
 upon-0.052
 forget-0.051
Token first
Token ID717
Feature activation+2.69e-03

Pos logit contributions
nam+0.017
chard+0.016
 Merry+0.015
 unatt+0.015
 ther+0.015
Neg logit contributions
 words-0.011
ously-0.009
eared-0.009
der-0.008
 example-0.008
Token."
Token ID526
Feature activation+1.25e+00

Pos logit contributions
ously+0.000
der+0.000
 words+0.000
itted+0.000
anked+0.000
Neg logit contributions
chard-0.000
nam-0.000
shadow-0.000
 upon-0.000
 forget-0.000
Token Tim
Token ID5045
Feature activation+1.97e+00

Pos logit contributions
der+0.039
ously+0.039
 words+0.036
itted+0.035
aced+0.035
Neg logit contributions
chard-0.015
shadow-0.014
 thirsty-0.014
 Merry-0.014
 upon-0.013
Token made
Token ID925
Feature activation+2.05e+00

Pos logit contributions
der+0.046
ously+0.046
 words+0.043
itted+0.042
anked+0.042
Neg logit contributions
chard-0.038
 forgot-0.037
 forget-0.037
 upon-0.037
nam-0.037
Token a
Token ID257
Feature activation+2.65e+00

Pos logit contributions
ously+0.055
der+0.051
itted+0.047
 words+0.045
anked+0.043
Neg logit contributions
nam-0.037
chard-0.036
shadow-0.036
 thirsty-0.034
 upon-0.034
Token funny
Token ID8258
Feature activation+5.31e+00

Pos logit contributions
ously+0.058
der+0.054
 words+0.053
itted+0.050
anked+0.048
Neg logit contributions
nam-0.062
chard-0.062
shadow-0.057
 forget-0.055
 Merry-0.054
Token noise
Token ID7838
Feature activation+2.14e+00

Pos logit contributions
ously+0.132
der+0.122
itted+0.114
anked+0.109
eared+0.108
Neg logit contributions
 forget-0.099
nam-0.095
 thirsty-0.095
 forth-0.094
chard-0.093
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
ously+0.055
itted+0.050
anked+0.048
aced+0.047
der+0.047
Neg logit contributions
chard-0.038
 unatt-0.037
 thirsty-0.036
 forget-0.036
nam-0.035
Token "
Token ID366
Feature activation+6.40e-01

Pos logit contributions
chard+0.028
nam+0.028
 thirsty+0.025
shadow+0.025
 forget+0.025
Neg logit contributions
ously-0.033
der-0.030
 words-0.030
itted-0.029
anked-0.029
TokenBe
Token ID3856
Feature activation+5.65e-01

Pos logit contributions
ously+0.015
 words+0.015
der+0.014
itted+0.013
anked+0.013
Neg logit contributions
chard-0.013
nam-0.013
shadow-0.012
 Merry-0.012
 forget-0.011
Tokenep
Token ID538
Feature activation+2.04e-01

Pos logit contributions
 words+0.013
ously+0.013
der+0.012
itted+0.011
anked+0.011
Neg logit contributions
chard-0.013
nam-0.012
 Merry-0.011
|-0.011
shadow-0.011
Token mom
Token ID1995
Feature activation+1.85e+00

Pos logit contributions
ously+0.005
der+0.005
 words+0.004
anked+0.004
itted+0.004
Neg logit contributions
nam-0.003
chard-0.003
shadow-0.003
 forget-0.003
 thirsty-0.002
Token had
Token ID550
Feature activation+3.45e+00

Pos logit contributions
ously+0.051
der+0.050
 words+0.050
itted+0.048
anked+0.047
Neg logit contributions
chard-0.029
nam-0.028
shadow-0.027
 forget-0.025
anut-0.025
Token taught
Token ID7817
Feature activation+3.28e+00

Pos logit contributions
ously+0.087
der+0.086
anked+0.080
 words+0.077
itted+0.076
Neg logit contributions
 upon-0.058
shadow-0.058
nam-0.058
 forget-0.057
chard-0.057
Token her
Token ID607
Feature activation+2.39e+00

Pos logit contributions
der+0.101
ously+0.099
itted+0.095
aced+0.086
ield+0.085
Neg logit contributions
nam-0.045
 upon-0.044
 Merry-0.040
chard-0.039
 forget-0.037
Token an
Token ID281
Feature activation+3.34e+00

Pos logit contributions
ously+0.044
der+0.041
anked+0.041
itted+0.041
eared+0.040
Neg logit contributions
nam-0.056
chard-0.054
 upon-0.054
shadow-0.052
 forget-0.052
Token important
Token ID1593
Feature activation+5.31e+00

Pos logit contributions
ously+0.084
itted+0.075
der+0.074
anked+0.074
 words+0.068
Neg logit contributions
nam-0.062
chard-0.061
shadow-0.059
anut-0.054
 forget-0.053
Token lesson
Token ID11483
Feature activation+2.33e+00

Pos logit contributions
der+0.116
eared+0.115
anked+0.113
ously+0.113
itted+0.110
Neg logit contributions
shadow-0.092
 upon-0.090
 forget-0.088
 Merry-0.087
chard-0.085
Token about
Token ID546
Feature activation+2.69e-01

Pos logit contributions
ously+0.043
itted+0.041
anked+0.039
der+0.039
eared+0.037
Neg logit contributions
chard-0.054
 upon-0.053
nam-0.053
shadow-0.051
 Merry-0.051
Token trust
Token ID3774
Feature activation+5.92e-01

Pos logit contributions
ously+0.007
der+0.006
 words+0.006
itted+0.006
anked+0.006
Neg logit contributions
nam-0.006
chard-0.006
shadow-0.005
 Merry-0.005
 forget-0.005
Token.
Token ID13
Feature activation+8.81e-01

Pos logit contributions
ously+0.011
der+0.010
 words+0.010
itted+0.009
anked+0.009
Neg logit contributions
chard-0.015
nam-0.015
shadow-0.014
 upon-0.014
 Merry-0.014
Token From
Token ID3574
Feature activation-8.89e-01

Pos logit contributions
ously+0.018
 words+0.017
der+0.017
itted+0.015
anked+0.015
Neg logit contributions
chard-0.022
nam-0.022
shadow-0.020
 Merry-0.019
 upon-0.019
Token She
Token ID1375
Feature activation+8.32e-01

Pos logit contributions
ously+0.016
 words+0.014
der+0.014
itted+0.014
anked+0.013
Neg logit contributions
chard-0.017
nam-0.016
shadow-0.015
 Merry-0.015
 upon-0.015
Token said
Token ID531
Feature activation+1.84e+00

Pos logit contributions
ously+0.020
 words+0.019
der+0.019
itted+0.017
anked+0.017
Neg logit contributions
chard-0.017
nam-0.017
shadow-0.016
 Merry-0.015
 forget-0.015
Token it
Token ID340
Feature activation+1.86e+00

Pos logit contributions
ously+0.035
itted+0.032
 words+0.032
der+0.032
anked+0.031
Neg logit contributions
chard-0.046
nam-0.045
 Merry-0.042
shadow-0.041
 upon-0.040
Token was
Token ID373
Feature activation+1.91e+00

Pos logit contributions
ously+0.047
der+0.043
 words+0.042
itted+0.041
anked+0.041
Neg logit contributions
chard-0.036
nam-0.035
shadow-0.032
 Merry-0.032
 upon-0.031
Token a
Token ID257
Feature activation+1.96e+00

Pos logit contributions
 words+0.037
ously+0.035
der+0.034
itted+0.030
anked+0.029
Neg logit contributions
nam-0.049
chard-0.048
shadow-0.046
|-0.044
illas-0.044
Token stupid
Token ID8531
Feature activation+5.29e+00

Pos logit contributions
ously+0.044
der+0.041
 words+0.041
itted+0.038
anked+0.038
Neg logit contributions
chard-0.044
nam-0.044
shadow-0.041
 Merry-0.039
 forget-0.039
Token question
Token ID1808
Feature activation+2.14e+00

Pos logit contributions
ously+0.105
itted+0.100
anked+0.100
der+0.097
aced+0.093
Neg logit contributions
 forget-0.114
 Merry-0.109
 upon-0.107
 thirsty-0.105
 ther-0.101
Token.
Token ID13
Feature activation+6.88e-01

Pos logit contributions
ously+0.041
der+0.039
itted+0.038
anked+0.038
 words+0.037
Neg logit contributions
chard-0.049
nam-0.048
 upon-0.046
shadow-0.046
 Merry-0.045
Token
Token ID198
Feature activation-2.38e-02

Pos logit contributions
ously+0.016
 words+0.015
der+0.015
itted+0.014
anked+0.014
Neg logit contributions
chard-0.015
nam-0.015
shadow-0.014
 Merry-0.013
 upon-0.013
Token
Token ID198
Feature activation+5.29e-01

Pos logit contributions
nam+0.001
chard+0.001
shadow+0.000
 Merry+0.000
 forget+0.000
Neg logit contributions
ously-0.001
 words-0.001
der-0.001
itted-0.000
anked-0.000
TokenBut
Token ID1537
Feature activation-9.49e-01

Pos logit contributions
ously+0.013
 words+0.013
der+0.013
itted+0.012
anked+0.012
Neg logit contributions
chard-0.010
nam-0.010
shadow-0.010
 Merry-0.009
 forget-0.009
Token.
Token ID13
Feature activation+7.05e-01

Pos logit contributions
nam+0.002
chard+0.002
shadow+0.001
|+0.001
anut+0.001
Neg logit contributions
ously-0.001
 words-0.001
der-0.001
itted-0.001
anked-0.001
Token He
Token ID679
Feature activation+8.55e-01

Pos logit contributions
ously+0.015
 words+0.015
der+0.015
itted+0.013
anked+0.013
Neg logit contributions
chard-0.016
nam-0.016
shadow-0.015
 Merry-0.014
 upon-0.014
Token had
Token ID550
Feature activation+3.43e+00

Pos logit contributions
ously+0.021
 words+0.020
der+0.020
itted+0.018
anked+0.018
Neg logit contributions
chard-0.017
nam-0.017
shadow-0.016
 Merry-0.015
 forget-0.015
Token learned
Token ID4499
Feature activation+2.27e+00

Pos logit contributions
der+0.080
ously+0.080
 words+0.074
anked+0.074
itted+0.073
Neg logit contributions
chard-0.067
shadow-0.067
nam-0.066
 upon-0.064
 forget-0.064
Token an
Token ID281
Feature activation+3.05e+00

Pos logit contributions
ously+0.051
itted+0.048
der+0.046
 words+0.045
anked+0.045
Neg logit contributions
nam-0.046
chard-0.045
shadow-0.044
 upon-0.042
 Merry-0.041
Token important
Token ID1593
Feature activation+5.29e+00

Pos logit contributions
ously+0.077
itted+0.069
der+0.069
anked+0.067
 words+0.065
Neg logit contributions
shadow-0.056
nam-0.054
chard-0.053
 forget-0.051
 upon-0.049
Token lesson
Token ID11483
Feature activation+1.91e+00

Pos logit contributions
der+0.113
ously+0.112
eared+0.111
anked+0.109
itted+0.107
Neg logit contributions
shadow-0.102
 forget-0.096
|-0.093
 upon-0.092
 Merry-0.090
Token that
Token ID326
Feature activation+2.84e-01

Pos logit contributions
itted+0.034
ously+0.033
 words+0.031
anked+0.031
der+0.030
Neg logit contributions
shadow-0.045
chard-0.044
 upon-0.044
nam-0.043
 Merry-0.042
Token day
Token ID1110
Feature activation+1.06e+00

Pos logit contributions
ously+0.006
der+0.006
 words+0.006
itted+0.005
anked+0.005
Neg logit contributions
chard-0.007
nam-0.007
shadow-0.006
 Merry-0.006
 forget-0.006
Token -
Token ID532
Feature activation-3.33e-01

Pos logit contributions
ously+0.018
 words+0.017
der+0.017
itted+0.017
anked+0.016
Neg logit contributions
chard-0.027
nam-0.026
shadow-0.026
 Merry-0.025
 upon-0.025
Token that
Token ID326
Feature activation-2.87e-01

Pos logit contributions
nam+0.007
chard+0.007
shadow+0.007
 Merry+0.006
 forget+0.006
Neg logit contributions
ously-0.008
der-0.007
 words-0.007
itted-0.007
anked-0.007
Token home
Token ID1363
Feature activation-8.59e-01

Pos logit contributions
nam+0.025
chard+0.025
shadow+0.023
anut+0.023
illas+0.022
Neg logit contributions
ously-0.020
 words-0.019
der-0.017
itted-0.017
eared-0.016
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
nam+0.027
chard+0.026
shadow+0.025
illas+0.024
|+0.024
Neg logit contributions
ously-0.013
 words-0.011
der-0.011
anked-0.010
itted-0.009
Token she
Token ID673
Feature activation+1.60e-01

Pos logit contributions
chard+0.040
illas+0.040
|+0.039
anut+0.038
nam+0.038
Neg logit contributions
ously-0.029
 words-0.026
der-0.024
anked-0.021
 enough-0.021
Token had
Token ID550
Feature activation+2.36e+00

Pos logit contributions
ously+0.003
 words+0.003
der+0.003
itted+0.003
aced+0.003
Neg logit contributions
nam-0.004
chard-0.004
shadow-0.004
riage-0.004
|-0.004
Token a
Token ID257
Feature activation+2.32e+00

Pos logit contributions
ously+0.047
der+0.044
 words+0.044
itted+0.040
anked+0.039
Neg logit contributions
chard-0.059
nam-0.059
shadow-0.055
 Merry-0.052
 upon-0.052
Token funny
Token ID8258
Feature activation+5.28e+00

Pos logit contributions
ously+0.050
 words+0.048
der+0.048
itted+0.043
anked+0.042
Neg logit contributions
chard-0.054
nam-0.053
shadow-0.050
 Merry-0.047
 forget-0.046
Token hat
Token ID6877
Feature activation-8.81e-01

Pos logit contributions
der+0.132
ously+0.130
itted+0.117
anked+0.116
eared+0.112
Neg logit contributions
 thirsty-0.094
chard-0.092
 upon-0.091
 ther-0.091
nam-0.090
Token on
Token ID319
Feature activation-5.93e-01

Pos logit contributions
chard+0.022
nam+0.021
shadow+0.020
 unatt+0.019
 forget+0.019
Neg logit contributions
ously-0.017
 words-0.017
der-0.016
itted-0.015
anked-0.015
Token her
Token ID607
Feature activation-2.38e-01

Pos logit contributions
chard+0.012
nam+0.012
 Merry+0.010
shadow+0.010
 upon+0.010
Neg logit contributions
ously-0.015
der-0.014
 words-0.014
itted-0.014
eared-0.013
Token head
Token ID1182
Feature activation-3.58e-01

Pos logit contributions
chard+0.005
nam+0.005
shadow+0.005
 Merry+0.004
anut+0.004
Neg logit contributions
ously-0.006
 words-0.005
der-0.005
itted-0.005
anked-0.005
Token.
Token ID13
Feature activation+4.92e-01

Pos logit contributions
chard+0.009
nam+0.009
shadow+0.008
 upon+0.008
 unatt+0.008
Neg logit contributions
ously-0.007
 words-0.007
der-0.006
itted-0.006
anked-0.006
TokenThe
Token ID464
Feature activation-5.58e-01

Pos logit contributions
ously+0.005
 words+0.004
der+0.004
itted+0.004
anked+0.004
Neg logit contributions
nam-0.005
chard-0.005
 unatt-0.004
shadow-0.004
 Merry-0.004
Token other
Token ID584
Feature activation+1.57e+00

Pos logit contributions
chard+0.013
nam+0.013
shadow+0.012
 forget+0.012
 Merry+0.012
Neg logit contributions
ously-0.012
der-0.011
 words-0.011
itted-0.010
anked-0.010
Token friend
Token ID1545
Feature activation+2.19e+00

Pos logit contributions
ously+0.033
der+0.031
 words+0.029
itted+0.028
anked+0.028
Neg logit contributions
nam-0.037
chard-0.037
shadow-0.034
 upon-0.034
 forget-0.034
Token had
Token ID550
Feature activation+2.93e+00

Pos logit contributions
ously+0.052
 words+0.050
der+0.049
itted+0.046
anked+0.045
Neg logit contributions
chard-0.046
nam-0.046
shadow-0.042
 Merry-0.040
anut-0.039
Token a
Token ID257
Feature activation+2.73e+00

Pos logit contributions
ously+0.075
der+0.071
itted+0.064
 words+0.064
anked+0.064
Neg logit contributions
chard-0.058
nam-0.057
 thirsty-0.056
 forget-0.054
shadow-0.053
Token funny
Token ID8258
Feature activation+5.28e+00

Pos logit contributions
ously+0.068
der+0.068
 words+0.062
itted+0.061
anked+0.060
Neg logit contributions
chard-0.056
nam-0.055
 forget-0.052
 thirsty-0.051
shadow-0.050
Token hat
Token ID6877
Feature activation-7.45e-02

Pos logit contributions
der+0.116
ously+0.103
anked+0.091
eared+0.089
itted+0.088
Neg logit contributions
 thirsty-0.114
 forget-0.110
 upon-0.109
 ther-0.104
 cozy-0.104
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
chard+0.002
nam+0.002
 forget+0.002
 unatt+0.002
shadow+0.002
Neg logit contributions
ously-0.002
 words-0.002
der-0.001
anked-0.001
itted-0.001
Token the
Token ID262
Feature activation-1.70e-01

Pos logit contributions
nam+0.023
chard+0.023
shadow+0.020
 Merry+0.019
 thirsty+0.019
Neg logit contributions
ously-0.027
der-0.025
 words-0.025
itted-0.024
anked-0.023
Token two
Token ID734
Feature activation+1.75e+00

Pos logit contributions
chard+0.004
nam+0.004
shadow+0.004
 forget+0.003
 Merry+0.003
Neg logit contributions
ously-0.004
der-0.003
 words-0.003
itted-0.003
anked-0.003
Token ran
Token ID4966
Feature activation+1.77e+00

Pos logit contributions
der+0.041
ously+0.040
 words+0.039
itted+0.038
anked+0.037
Neg logit contributions
nam-0.033
chard-0.032
 forget-0.032
 upon-0.031
 thirsty-0.031
Token boats
Token ID16659
Feature activation-1.60e+00

Pos logit contributions
chard+0.009
nam+0.009
shadow+0.008
 Merry+0.008
anut+0.008
Neg logit contributions
ously-0.011
 words-0.011
der-0.010
itted-0.010
anked-0.009
Token together
Token ID1978
Feature activation+5.71e-01

Pos logit contributions
chard+0.030
nam+0.030
shadow+0.027
 upon+0.026
 unatt+0.025
Neg logit contributions
ously-0.042
der-0.040
 words-0.040
anked-0.037
itted-0.037
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
ously+0.014
der+0.013
anked+0.013
ield+0.013
itted+0.012
Neg logit contributions
chard-0.012
 upon-0.011
nam-0.011
 thirsty-0.010
 forget-0.010
Token had
Token ID550
Feature activation+3.35e+00

Pos logit contributions
shadow+0.023
chard+0.023
nam+0.022
 unatt+0.021
anut+0.021
Neg logit contributions
ously-0.024
 words-0.023
der-0.021
itted-0.020
eared-0.020
Token a
Token ID257
Feature activation+2.11e+00

Pos logit contributions
ously+0.089
der+0.085
anked+0.080
itted+0.077
 words+0.074
Neg logit contributions
nam-0.063
chard-0.062
 cozy-0.061
 thirsty-0.060
 Merry-0.060
Token funny
Token ID8258
Feature activation+5.27e+00

Pos logit contributions
ously+0.040
 words+0.039
der+0.036
itted+0.033
 skill+0.031
Neg logit contributions
chard-0.054
nam-0.053
shadow-0.050
 unatt-0.047
 Merry-0.047
Token and
Token ID290
Feature activation-2.24e+00

Pos logit contributions
der+0.130
ously+0.129
anked+0.117
eared+0.116
itted+0.115
Neg logit contributions
 forget-0.094
|-0.093
 upon-0.092
 thirsty-0.091
 cozy-0.091
Token fun
Token ID1257
Feature activation+3.07e+00

Pos logit contributions
chard+0.058
nam+0.057
shadow+0.056
anut+0.054
illas+0.053
Neg logit contributions
 words-0.040
ously-0.038
der-0.036
itted-0.034
 skill-0.032
Token time
Token ID640
Feature activation+3.15e-01

Pos logit contributions
ously+0.073
der+0.069
 words+0.068
itted+0.065
eared+0.063
Neg logit contributions
nam-0.061
chard-0.060
 forget-0.058
shadow-0.057
 forth-0.056
Token on
Token ID319
Feature activation+1.81e-01

Pos logit contributions
ously+0.006
 words+0.006
der+0.006
itted+0.005
anked+0.005
Neg logit contributions
nam-0.008
chard-0.008
shadow-0.008
 Merry-0.007
 upon-0.007
Token that
Token ID326
Feature activation-2.74e-01

Pos logit contributions
ously+0.005
 words+0.005
der+0.005
itted+0.004
anked+0.004
Neg logit contributions
nam-0.003
chard-0.003
shadow-0.003
 Merry-0.003
 forget-0.003
Token
Token ID198
Feature activation+2.29e-01

Pos logit contributions
nam+0.016
chard+0.016
shadow+0.015
 unatt+0.014
 forget+0.014
Neg logit contributions
ously-0.015
 words-0.014
der-0.013
itted-0.013
anked-0.012
TokenShe
Token ID3347
Feature activation+2.43e-01

Pos logit contributions
ously+0.005
 words+0.005
der+0.005
itted+0.004
anked+0.004
Neg logit contributions
chard-0.005
nam-0.005
shadow-0.005
 Merry-0.005
anut-0.005
Token soon
Token ID2582
Feature activation+1.88e+00

Pos logit contributions
ously+0.006
 words+0.006
der+0.006
itted+0.005
anked+0.005
Neg logit contributions
chard-0.005
nam-0.005
shadow-0.005
 Merry-0.004
anut-0.004
Token had
Token ID550
Feature activation+3.41e+00

Pos logit contributions
ously+0.051
 words+0.050
der+0.049
itted+0.046
anked+0.045
Neg logit contributions
chard-0.033
nam-0.033
shadow-0.030
 Merry-0.028
 forget-0.027
Token a
Token ID257
Feature activation+2.40e+00

Pos logit contributions
ously+0.061
der+0.059
 words+0.056
anked+0.054
itted+0.053
Neg logit contributions
chard-0.089
nam-0.088
 upon-0.084
shadow-0.083
 thirsty-0.082
Token clever
Token ID14169
Feature activation+5.27e+00

Pos logit contributions
ously+0.052
 words+0.051
der+0.049
itted+0.045
anked+0.044
Neg logit contributions
nam-0.055
chard-0.055
shadow-0.051
 Merry-0.048
anut-0.048
Token plan
Token ID1410
Feature activation+3.02e+00

Pos logit contributions
der+0.114
ously+0.109
itted+0.102
anked+0.101
eared+0.097
Neg logit contributions
 forget-0.112
chard-0.111
 thirsty-0.109
 upon-0.108
nam-0.107
Token -
Token ID532
Feature activation-8.56e-02

Pos logit contributions
der+0.062
itted+0.059
ously+0.057
 words+0.056
anked+0.055
Neg logit contributions
 upon-0.065
chard-0.064
shadow-0.063
nam-0.062
 unatt-0.061
Token she
Token ID673
Feature activation+4.76e-01

Pos logit contributions
chard+0.002
nam+0.002
shadow+0.002
 Merry+0.002
anut+0.002
Neg logit contributions
ously-0.002
 words-0.002
der-0.002
itted-0.002
anked-0.002
Token would
Token ID561
Feature activation+2.40e+00

Pos logit contributions
ously+0.011
 words+0.011
der+0.010
itted+0.010
eared+0.009
Neg logit contributions
nam-0.011
chard-0.010
shadow-0.009
 Merry-0.009
 unatt-0.009
Token find
Token ID1064
Feature activation+1.27e+00

Pos logit contributions
ously+0.065
 words+0.063
der+0.063
itted+0.059
anked+0.057
Neg logit contributions
chard-0.042
nam-0.041
shadow-0.038
 forget-0.036
 Merry-0.035
Token "
Token ID366
Feature activation+3.91e-01

Pos logit contributions
nam+0.099
chard+0.097
|+0.095
shadow+0.092
 forget+0.092
Neg logit contributions
ously-0.046
 words-0.044
der-0.044
itted-0.035
anked-0.033
TokenWhat
Token ID2061
Feature activation+4.17e-01

Pos logit contributions
ously+0.010
 words+0.010
der+0.010
eared+0.009
itted+0.009
Neg logit contributions
nam-0.007
chard-0.007
 thirsty-0.006
shadow-0.006
 forget-0.006
Token's
Token ID338
Feature activation-9.59e-01

Pos logit contributions
ously+0.010
der+0.010
 words+0.009
itted+0.009
anked+0.009
Neg logit contributions
nam-0.009
chard-0.009
shadow-0.008
 Merry-0.008
 upon-0.007
Token inside
Token ID2641
Feature activation-5.76e+00

Pos logit contributions
shadow+0.024
chard+0.024
|+0.024
 forget+0.024
nam+0.023
Neg logit contributions
 words-0.017
der-0.015
ously-0.014
anked-0.014
ield-0.014
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
nam+0.132
|+0.129
chard+0.124
shadow+0.123
anut+0.114
Neg logit contributions
 words-0.126
 enough-0.110
der-0.105
ield-0.105
 to-0.100
Token boxes
Token ID10559
Feature activation-7.66e+00

Pos logit contributions
chard+0.022
nam+0.022
shadow+0.020
 Merry+0.019
 upon+0.019
Neg logit contributions
ously-0.025
der-0.024
 words-0.023
itted-0.022
anked-0.021
Token?"
Token ID1701
Feature activation+8.93e-01

Pos logit contributions
nam+0.117
shadow+0.103
chard+0.103
anut+0.101
 Merry+0.098
Neg logit contributions
 words-0.211
der-0.210
ously-0.209
ield-0.200
anked-0.196
Token The
Token ID383
Feature activation-7.49e-01

Pos logit contributions
ously+0.019
 words+0.018
der+0.018
itted+0.016
anked+0.016
Neg logit contributions
nam-0.021
chard-0.021
shadow-0.020
anut-0.019
 unatt-0.019
Token driver
Token ID4639
Feature activation-1.43e-01

Pos logit contributions
chard+0.014
nam+0.014
shadow+0.012
 upon+0.012
 Merry+0.012
Neg logit contributions
ously-0.020
der-0.019
 words-0.019
itted-0.018
anked-0.018
Token answered
Token ID9373
Feature activation+1.27e+00

Pos logit contributions
chard+0.003
nam+0.003
shadow+0.003
 Merry+0.003
anut+0.003
Neg logit contributions
ously-0.003
 words-0.003
der-0.003
itted-0.003
anked-0.003
Token,
Token ID11
Feature activation-2.74e+00

Pos logit contributions
ously+0.023
 words+0.021
der+0.021
itted+0.021
anked+0.019
Neg logit contributions
chard-0.034
nam-0.034
shadow-0.031
 Merry-0.030
 upon-0.030
Token see
Token ID766
Feature activation-2.65e+00

Pos logit contributions
 words+0.002
ously+0.002
der+0.002
itted+0.002
anked+0.002
Neg logit contributions
nam-0.002
chard-0.002
shadow-0.002
anut-0.002
 unatt-0.002
Token what
Token ID644
Feature activation-1.08e+00

Pos logit contributions
shadow+0.063
|+0.062
chard+0.062
nam+0.060
har+0.059
Neg logit contributions
 words-0.056
der-0.048
 enough-0.046
ously-0.043
 news-0.042
Token's
Token ID338
Feature activation-2.45e+00

Pos logit contributions
chard+0.022
shadow+0.021
nam+0.021
anut+0.020
 unatt+0.018
Neg logit contributions
 words-0.028
ously-0.025
der-0.025
ield-0.022
itted-0.022
Token in
Token ID287
Feature activation-4.75e+00

Pos logit contributions
shadow+0.057
|+0.055
nam+0.054
cel+0.053
chard+0.053
Neg logit contributions
 words-0.052
der-0.045
 enough-0.043
 news-0.042
itted-0.040
Token the
Token ID262
Feature activation-2.05e+00

Pos logit contributions
shadow+0.105
nam+0.103
chard+0.101
anut+0.099
cel+0.096
Neg logit contributions
 words-0.103
 enough-0.089
der-0.087
ously-0.086
 example-0.084
Token boxes
Token ID10559
Feature activation-7.41e+00

Pos logit contributions
shadow+0.054
chard+0.053
nam+0.052
anut+0.048
|+0.047
Neg logit contributions
 words-0.043
ously-0.035
der-0.035
 word-0.032
 truth-0.032
Token?"
Token ID1701
Feature activation-1.10e+00

Pos logit contributions
nam+0.110
shadow+0.102
chard+0.095
anut+0.091
 Merry+0.090
Neg logit contributions
 words-0.215
der-0.212
ously-0.202
ield-0.191
 enough-0.190
Token
Token ID198
Feature activation-1.56e+00

Pos logit contributions
nam+0.032
chard+0.031
shadow+0.029
 unatt+0.028
illas+0.028
Neg logit contributions
 words-0.019
ously-0.018
der-0.018
itted-0.015
anked-0.014
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
nam+0.047
chard+0.046
shadow+0.041
 unatt+0.041
illas+0.040
Neg logit contributions
 words-0.026
ously-0.025
der-0.024
itted-0.020
anked-0.020
Token"
Token ID1
Feature activation+7.95e-01

Pos logit contributions
nam+0.024
chard+0.024
shadow+0.022
|+0.021
 Merry+0.021
Neg logit contributions
ously-0.021
 words-0.021
der-0.020
eared-0.018
itted-0.018
TokenYes
Token ID5297
Feature activation-1.74e+00

Pos logit contributions
ously+0.023
der+0.021
aced+0.021
 words+0.020
anked+0.020
Neg logit contributions
nam-0.015
chard-0.014
 Merry-0.012
shadow-0.012
 forget-0.011
TokenCan
Token ID6090
Feature activation+4.91e-02

Pos logit contributions
 words+0.046
ously+0.046
der+0.044
eared+0.041
itted+0.041
Neg logit contributions
chard-0.037
nam-0.037
shadow-0.035
anut-0.032
 thirsty-0.032
Token you
Token ID345
Feature activation-6.52e-01

Pos logit contributions
 words+0.001
der+0.001
ield+0.001
 advice+0.001
ously+0.001
Neg logit contributions
 afloat-0.001
anut-0.001
shadow-0.001
chard-0.001
 roasted-0.001
Token please
Token ID3387
Feature activation-2.10e+00

Pos logit contributions
chard+0.014
nam+0.014
shadow+0.013
 Merry+0.012
anut+0.012
Neg logit contributions
ously-0.015
 words-0.015
der-0.014
itted-0.013
anked-0.013
Token put
Token ID1234
Feature activation-2.93e+00

Pos logit contributions
nam+0.050
chard+0.049
shadow+0.045
 Merry+0.043
illas+0.042
Neg logit contributions
 words-0.044
ously-0.039
der-0.038
anked-0.037
ield-0.036
Token the
Token ID262
Feature activation-4.89e-01

Pos logit contributions
chard+0.069
shadow+0.067
 forget+0.066
nam+0.066
 unatt+0.061
Neg logit contributions
 words-0.061
der-0.053
ously-0.051
eared-0.049
 enough-0.049
Token dolls
Token ID36062
Feature activation-7.39e+00

Pos logit contributions
chard+0.011
shadow+0.011
nam+0.010
anut+0.010
 unatt+0.009
Neg logit contributions
 words-0.012
der-0.010
ously-0.009
itted-0.009
eared-0.009
Token back
Token ID736
Feature activation-5.05e+00

Pos logit contributions
 Merry+0.179
nam+0.174
chard+0.172
cel+0.170
shadow+0.168
Neg logit contributions
 words-0.125
 enough-0.115
eared-0.110
der-0.108
 to-0.107
Token in
Token ID287
Feature activation-3.70e+00

Pos logit contributions
nam+0.121
chard+0.118
 Merry+0.113
shadow+0.112
 forget+0.112
Neg logit contributions
 words-0.095
der-0.092
ously-0.090
itted-0.083
eared-0.082
Token the
Token ID262
Feature activation-2.10e+00

Pos logit contributions
chard+0.074
nam+0.070
shadow+0.068
 Merry+0.066
anut+0.065
Neg logit contributions
 words-0.090
ously-0.084
der-0.081
anked-0.076
itted-0.076
Token box
Token ID3091
Feature activation-5.17e+00

Pos logit contributions
chard+0.058
nam+0.057
shadow+0.056
anut+0.051
illas+0.051
Neg logit contributions
 words-0.037
ously-0.035
der-0.033
itted-0.030
eared-0.029
Token when
Token ID618
Feature activation-4.20e+00

Pos logit contributions
nam+0.075
chard+0.070
shadow+0.065
 Merry+0.063
anut+0.060
Neg logit contributions
ously-0.153
 words-0.152
der-0.151
itted-0.138
anked-0.137
Token "
Token ID366
Feature activation+4.27e-01

Pos logit contributions
nam+0.094
chard+0.092
|+0.090
shadow+0.088
anut+0.086
Neg logit contributions
ously-0.048
der-0.044
 words-0.044
itted-0.036
anked-0.035
TokenCan
Token ID6090
Feature activation-1.57e-01

Pos logit contributions
ously+0.011
 words+0.011
der+0.010
eared+0.010
itted+0.009
Neg logit contributions
nam-0.008
chard-0.007
shadow-0.007
|-0.007
 thirsty-0.007
Token I
Token ID314
Feature activation-7.25e-01

Pos logit contributions
anut+0.004
nam+0.004
|+0.003
eling+0.003
chard+0.003
Neg logit contributions
 words-0.003
der-0.003
 advice-0.003
ield-0.003
 to-0.003
Token touch
Token ID3638
Feature activation-2.03e+00

Pos logit contributions
nam+0.019
chard+0.018
shadow+0.017
anut+0.016
 unatt+0.016
Neg logit contributions
 words-0.014
ously-0.013
ield-0.012
der-0.012
 word-0.011
Token the
Token ID262
Feature activation-4.03e-01

Pos logit contributions
chard+0.048
shadow+0.046
nam+0.046
 forget+0.045
|+0.045
Neg logit contributions
 words-0.044
der-0.040
ously-0.036
 enough-0.035
anked-0.033
Token books
Token ID3835
Feature activation-7.27e+00

Pos logit contributions
chard+0.010
shadow+0.010
nam+0.010
anut+0.009
|+0.009
Neg logit contributions
 words-0.009
der-0.007
ously-0.007
itted-0.006
ield-0.006
Tokenhe
Token ID258
Feature activation-1.67e+00

Pos logit contributions
nam+0.149
chard+0.138
shadow+0.135
anut+0.135
 Merry+0.127
Neg logit contributions
der-0.168
ously-0.165
 words-0.164
anked-0.148
ield-0.145
Tokenlf
Token ID1652
Feature activation-5.23e+00

Pos logit contributions
nam+0.033
chard+0.033
shadow+0.031
anut+0.028
 thirsty+0.028
Neg logit contributions
 words-0.044
ously-0.041
der-0.039
 skills-0.035
anked-0.035
Token too
Token ID1165
Feature activation-3.17e+00

Pos logit contributions
nam+0.107
chard+0.100
shadow+0.099
 Merry+0.095
anut+0.095
Neg logit contributions
der-0.118
ously-0.117
 words-0.115
 enough-0.105
anked-0.100
Token?"
Token ID1701
Feature activation+3.11e-01

Pos logit contributions
nam+0.027
chard+0.026
shadow+0.021
 Merry+0.018
anut+0.017
Neg logit contributions
ously-0.115
 words-0.113
der-0.112
itted-0.106
anked-0.105
Token His
Token ID2399
Feature activation-1.11e+00

Pos logit contributions
ously+0.005
 words+0.005
der+0.005
itted+0.005
anked+0.004
Neg logit contributions
chard-0.008
nam-0.008
shadow-0.008
 Merry-0.008
 upon-0.008
Token,
Token ID11
Feature activation-2.62e+00

Pos logit contributions
nam+0.014
chard+0.013
shadow+0.012
 Merry+0.012
anut+0.011
Neg logit contributions
ously-0.011
 words-0.010
der-0.010
itted-0.009
anked-0.009
Token why
Token ID1521
Feature activation-1.66e+00

Pos logit contributions
chard+0.058
nam+0.057
shadow+0.056
aj+0.048
anut+0.048
Neg logit contributions
 words-0.061
ously-0.059
der-0.057
ield-0.050
 enough-0.048
Token did
Token ID750
Feature activation+7.27e-01

Pos logit contributions
chard+0.042
shadow+0.041
nam+0.040
anut+0.036
 unatt+0.036
Neg logit contributions
 words-0.035
ously-0.032
der-0.031
ield-0.027
anked-0.026
Token you
Token ID345
Feature activation-7.60e-01

Pos logit contributions
 words+0.018
der+0.016
ously+0.015
ield+0.015
 enough+0.014
Neg logit contributions
chard-0.015
shadow-0.014
nam-0.014
 unatt-0.013
aj-0.013
Token bury
Token ID30033
Feature activation-2.88e+00

Pos logit contributions
nam+0.017
chard+0.016
shadow+0.015
 Merry+0.014
illas+0.014
Neg logit contributions
 words-0.017
der-0.016
ously-0.016
anked-0.015
ield-0.014
Token carrots
Token ID34397
Feature activation-7.21e+00

Pos logit contributions
shadow+0.065
nam+0.065
chard+0.063
 forget+0.060
|+0.058
Neg logit contributions
 words-0.064
der-0.060
 enough-0.056
anked-0.053
ously-0.052
Token under
Token ID739
Feature activation-3.13e+00

Pos logit contributions
nam+0.166
shadow+0.150
chard+0.146
 Merry+0.143
cel+0.137
Neg logit contributions
der-0.161
 words-0.148
ously-0.147
 enough-0.138
anked-0.130
Token my
Token ID616
Feature activation-1.16e+00

Pos logit contributions
chard+0.054
nam+0.054
shadow+0.052
 forget+0.044
anut+0.044
Neg logit contributions
 words-0.085
ously-0.083
der-0.082
anked-0.075
itted-0.074
Token blanket
Token ID18447
Feature activation-4.39e+00

Pos logit contributions
nam+0.024
chard+0.024
shadow+0.024
 Merry+0.021
|+0.021
Neg logit contributions
ously-0.027
 words-0.027
der-0.026
itted-0.023
anked-0.023
Token?
Token ID30
Feature activation-9.24e-01

Pos logit contributions
nam+0.090
chard+0.087
shadow+0.082
 Merry+0.075
anut+0.074
Neg logit contributions
ously-0.108
der-0.105
 words-0.105
itted-0.094
anked-0.093
Token I
Token ID314
Feature activation-9.55e-01

Pos logit contributions
nam+0.022
chard+0.022
shadow+0.021
 unatt+0.021
 forth+0.020
Neg logit contributions
der-0.019
 words-0.019
ously-0.017
itted-0.017
ield-0.016
Token I
Token ID314
Feature activation-6.07e-01

Pos logit contributions
 words+0.001
ield+0.001
 advice+0.001
der+0.001
 to+0.001
Neg logit contributions
anut-0.001
nam-0.001
aj-0.001
chard-0.001
|-0.001
Token help
Token ID1037
Feature activation-1.78e+00

Pos logit contributions
nam+0.017
chard+0.016
shadow+0.015
anut+0.015
 unatt+0.014
Neg logit contributions
 words-0.011
ously-0.010
der-0.009
ield-0.009
anked-0.008
Token you
Token ID345
Feature activation-1.57e+00

Pos logit contributions
nam+0.035
chard+0.035
|+0.033
shadow+0.032
aj+0.030
Neg logit contributions
 words-0.044
der-0.039
ously-0.039
anked-0.038
ield-0.037
Token balance
Token ID5236
Feature activation-3.23e+00

Pos logit contributions
nam+0.031
chard+0.031
shadow+0.027
 Merry+0.027
|+0.026
Neg logit contributions
ously-0.039
 words-0.038
der-0.038
anked-0.034
itted-0.034
Token those
Token ID883
Feature activation-1.58e+00

Pos logit contributions
|+0.078
chard+0.075
nam+0.072
 Merry+0.070
 forget+0.070
Neg logit contributions
 words-0.071
der-0.064
 enough-0.058
anked-0.058
 skills-0.057
Token newspapers
Token ID14741
Feature activation-7.07e+00

Pos logit contributions
chard+0.043
nam+0.043
shadow+0.041
 Merry+0.039
anut+0.038
Neg logit contributions
ously-0.028
 words-0.026
der-0.026
itted-0.023
anked-0.022
Token?"
Token ID1701
Feature activation+3.11e-01

Pos logit contributions
nam+0.112
chard+0.106
shadow+0.099
 Merry+0.098
anut+0.094
Neg logit contributions
der-0.194
 words-0.192
ously-0.188
anked-0.177
itted-0.175
Token The
Token ID383
Feature activation-1.27e+00

Pos logit contributions
 words+0.006
ously+0.006
der+0.005
anked+0.005
itted+0.005
Neg logit contributions
nam-0.008
chard-0.008
shadow-0.007
anut-0.007
 unatt-0.007
Token man
Token ID582
Feature activation+2.43e-01

Pos logit contributions
chard+0.026
nam+0.024
 forget+0.022
shadow+0.022
 thirsty+0.021
Neg logit contributions
ously-0.032
der-0.032
itted-0.029
anked-0.028
aced-0.028
Token was
Token ID373
Feature activation-9.68e-01

Pos logit contributions
ously+0.005
 words+0.005
der+0.005
itted+0.005
anked+0.005
Neg logit contributions
chard-0.005
nam-0.005
shadow-0.005
 Merry-0.005
anut-0.005
Token grateful
Token ID14066
Feature activation+1.12e+00

Pos logit contributions
chard+0.020
nam+0.020
shadow+0.018
 Merry+0.017
 forget+0.017
Neg logit contributions
ously-0.023
 words-0.023
der-0.022
itted-0.021
anked-0.020
TokenCan
Token ID6090
Feature activation+2.12e-01

Pos logit contributions
ously+0.003
 words+0.003
der+0.003
itted+0.003
eared+0.003
Neg logit contributions
nam-0.002
chard-0.002
shadow-0.002
 thirsty-0.002
|-0.002
Token we
Token ID356
Feature activation-6.22e-01

Pos logit contributions
 words+0.005
ield+0.004
der+0.004
 advice+0.004
 to+0.004
Neg logit contributions
anut-0.004
shadow-0.004
 afloat-0.004
chard-0.004
nam-0.004
Token see
Token ID766
Feature activation-2.17e+00

Pos logit contributions
nam+0.015
chard+0.015
shadow+0.014
 unatt+0.013
anut+0.013
Neg logit contributions
 words-0.012
ously-0.011
ield-0.011
der-0.011
anked-0.010
Token the
Token ID262
Feature activation-4.75e-01

Pos logit contributions
|+0.053
chard+0.052
nam+0.048
shadow+0.047
 forget+0.046
Neg logit contributions
 words-0.047
 enough-0.042
der-0.039
ously-0.035
 news-0.035
Token big
Token ID1263
Feature activation+3.01e-01

Pos logit contributions
chard+0.011
nam+0.010
shadow+0.010
 Merry+0.009
anut+0.009
Neg logit contributions
 words-0.011
ously-0.011
der-0.010
itted-0.009
anked-0.009
Token planes
Token ID13016
Feature activation-7.01e+00

Pos logit contributions
 words+0.006
ously+0.006
der+0.006
 enough+0.005
itted+0.005
Neg logit contributions
chard-0.008
nam-0.007
shadow-0.007
anut-0.007
|-0.007
Token at
Token ID379
Feature activation-3.80e+00

Pos logit contributions
nam+0.151
chard+0.136
anut+0.132
 Merry+0.131
shadow+0.124
Neg logit contributions
 words-0.153
der-0.149
 enough-0.141
ously-0.140
 idea-0.135
Token the
Token ID262
Feature activation-1.94e+00

Pos logit contributions
nam+0.074
chard+0.074
shadow+0.068
 Merry+0.063
anut+0.063
Neg logit contributions
ously-0.094
 words-0.093
der-0.090
itted-0.083
anked-0.083
Token airport
Token ID9003
Feature activation-4.68e+00

Pos logit contributions
chard+0.039
nam+0.039
shadow+0.035
 Merry+0.034
 forget+0.033
Neg logit contributions
ously-0.049
 words-0.046
der-0.046
itted-0.042
anked-0.042
Token?"
Token ID1701
Feature activation+1.89e-01

Pos logit contributions
nam+0.084
chard+0.072
anut+0.067
 Merry+0.065
shadow+0.063
Neg logit contributions
 words-0.126
der-0.121
 enough-0.118
ously-0.116
eared-0.111
Token Her
Token ID2332
Feature activation-1.33e+00

Pos logit contributions
ously+0.004
der+0.004
 words+0.004
itted+0.003
eared+0.003
Neg logit contributions
nam-0.005
chard-0.005
 Merry-0.004
shadow-0.004
 upon-0.004
Token Why
Token ID4162
Feature activation-1.09e+00

Pos logit contributions
nam+0.026
chard+0.026
shadow+0.025
anut+0.024
 forget+0.024
Neg logit contributions
 words-0.023
der-0.023
ously-0.023
itted-0.022
ield-0.021
Token did
Token ID750
Feature activation+1.30e-01

Pos logit contributions
chard+0.029
shadow+0.029
nam+0.028
 unatt+0.026
anut+0.025
Neg logit contributions
 words-0.022
ously-0.019
der-0.019
ield-0.017
anked-0.016
Token you
Token ID345
Feature activation-1.46e+00

Pos logit contributions
 words+0.004
der+0.003
ously+0.003
ield+0.003
anked+0.003
Neg logit contributions
chard-0.002
nam-0.002
shadow-0.002
 unatt-0.002
|-0.002
Token take
Token ID1011
Feature activation-1.94e-01

Pos logit contributions
nam+0.028
chard+0.028
shadow+0.026
 Merry+0.025
 unatt+0.024
Neg logit contributions
 words-0.035
ously-0.035
der-0.033
anked-0.031
itted-0.031
Token my
Token ID616
Feature activation-1.51e-03

Pos logit contributions
chard+0.004
nam+0.004
shadow+0.003
 Merry+0.003
 forget+0.003
Neg logit contributions
ously-0.005
 words-0.005
der-0.005
itted-0.004
anked-0.004
Token books
Token ID3835
Feature activation-7.00e+00

Pos logit contributions
shadow+0.000
chard+0.000
|+0.000
nam+0.000
 Merry+0.000
Neg logit contributions
 words-0.000
ously-0.000
itted-0.000
der-0.000
 advice-0.000
Token?"
Token ID1701
Feature activation-2.68e+00

Pos logit contributions
nam+0.138
chard+0.135
shadow+0.125
 Merry+0.123
anut+0.119
Neg logit contributions
ously-0.170
 words-0.167
der-0.166
anked-0.152
itted-0.150
Token he
Token ID339
Feature activation-2.40e+00

Pos logit contributions
shadow+0.059
nam+0.058
chard+0.057
anut+0.057
illas+0.056
Neg logit contributions
 words-0.060
der-0.056
ously-0.055
anked-0.049
itted-0.048
Token asked
Token ID1965
Feature activation-7.20e-01

Pos logit contributions
nam+0.046
chard+0.045
shadow+0.041
 thirsty+0.037
 forget+0.037
Neg logit contributions
ously-0.064
der-0.060
 words-0.059
itted-0.057
anked-0.055
Token.
Token ID13
Feature activation-1.63e+00

Pos logit contributions
chard+0.019
nam+0.019
shadow+0.018
 Merry+0.017
 upon+0.017
Neg logit contributions
ously-0.013
 words-0.012
der-0.012
itted-0.011
anked-0.011
Token
Token ID198
Feature activation-1.52e+00

Pos logit contributions
chard+0.033
nam+0.032
shadow+0.029
 Merry+0.029
 upon+0.028
Neg logit contributions
ously-0.040
 words-0.038
der-0.038
itted-0.035
anked-0.035
Token,
Token ID11
Feature activation-2.11e+00

Pos logit contributions
ously+0.011
der+0.011
 words+0.010
itted+0.009
anked+0.009
Neg logit contributions
chard-0.017
nam-0.017
shadow-0.016
 forget-0.016
 Merry-0.015
Token can
Token ID460
Feature activation+6.58e-01

Pos logit contributions
shadow+0.049
|+0.048
chard+0.048
nam+0.046
anut+0.045
Neg logit contributions
 words-0.045
der-0.040
ously-0.039
 enough-0.038
ield-0.035
Token we
Token ID356
Feature activation-1.10e+00

Pos logit contributions
 words+0.013
ield+0.011
 advice+0.010
der+0.010
anked+0.010
Neg logit contributions
shadow-0.015
aj-0.015
anut-0.015
|-0.015
 unatt-0.014
Token organize
Token ID16481
Feature activation-2.70e+00

Pos logit contributions
shadow+0.024
nam+0.024
chard+0.023
aj+0.021
illas+0.021
Neg logit contributions
 words-0.023
ously-0.022
der-0.021
ield-0.020
anked-0.020
Token the
Token ID262
Feature activation-6.79e-01

Pos logit contributions
shadow+0.064
chard+0.062
 forget+0.060
|+0.060
nam+0.059
Neg logit contributions
 words-0.055
der-0.049
 enough-0.045
 ideas-0.043
anked-0.043
Token vegetables
Token ID13701
Feature activation-6.96e+00

Pos logit contributions
chard+0.016
nam+0.016
shadow+0.016
|+0.015
 Merry+0.014
Neg logit contributions
 words-0.014
ously-0.014
der-0.013
itted-0.012
anked-0.012
Token?"
Token ID1701
Feature activation+3.54e-01

Pos logit contributions
nam+0.143
 Merry+0.133
shadow+0.132
cel+0.128
|+0.128
Neg logit contributions
der-0.150
 enough-0.145
 words-0.143
ously-0.137
 to-0.134
Token
Token ID220
Feature activation-1.76e+00

Pos logit contributions
ously+0.008
der+0.007
 words+0.007
itted+0.006
anked+0.006
Neg logit contributions
chard-0.008
nam-0.008
shadow-0.008
 Merry-0.008
 upon-0.007
Token
Token ID198
Feature activation-4.64e-01

Pos logit contributions
nam+0.050
chard+0.050
shadow+0.045
 unatt+0.045
 forget+0.044
Neg logit contributions
ously-0.030
 words-0.029
der-0.028
itted-0.026
anked-0.025
Token
Token ID198
Feature activation+2.71e-01

Pos logit contributions
nam+0.013
chard+0.013
 unatt+0.012
shadow+0.012
illas+0.012
Neg logit contributions
ously-0.008
 words-0.008
itted-0.007
der-0.007
anked-0.006
TokenHer
Token ID9360
Feature activation-3.17e-01

Pos logit contributions
ously+0.006
 words+0.006
der+0.006
itted+0.005
anked+0.005
Neg logit contributions
chard-0.006
nam-0.006
shadow-0.006
 Merry-0.005
anut-0.005
Token the
Token ID262
Feature activation-1.21e+00

Pos logit contributions
chard+0.041
nam+0.039
shadow+0.037
anut+0.035
|+0.034
Neg logit contributions
 words-0.052
ously-0.052
der-0.050
anked-0.047
 enough-0.045
Token loop
Token ID9052
Feature activation-3.29e+00

Pos logit contributions
nam+0.030
chard+0.030
shadow+0.028
 Merry+0.026
 forget+0.026
Neg logit contributions
ously-0.025
 words-0.023
der-0.023
itted-0.021
anked-0.020
Token and
Token ID290
Feature activation-4.87e+00

Pos logit contributions
nam+0.059
chard+0.059
shadow+0.054
 Merry+0.050
anut+0.049
Neg logit contributions
ously-0.088
 words-0.085
der-0.085
itted-0.078
anked-0.077
Token put
Token ID1234
Feature activation-3.12e+00

Pos logit contributions
anut+0.135
cel+0.134
icer+0.133
illas+0.125
shadow+0.123
Neg logit contributions
 words-0.072
 to-0.068
 advice-0.059
 string-0.057
 each-0.056
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
chard+0.088
nam+0.081
shadow+0.081
 forget+0.079
|+0.078
Neg logit contributions
 words-0.056
der-0.046
 enough-0.045
ously-0.043
anked-0.043
Token trains
Token ID13404
Feature activation-6.95e+00

Pos logit contributions
chard+0.028
nam+0.027
shadow+0.026
 Merry+0.025
anut+0.024
Neg logit contributions
 words-0.023
ously-0.022
der-0.021
itted-0.020
anked-0.019
Token away
Token ID1497
Feature activation-4.14e+00

Pos logit contributions
nam+0.163
chard+0.161
 Merry+0.159
anut+0.154
cel+0.154
Neg logit contributions
 words-0.132
der-0.130
ously-0.130
itted-0.115
 enough-0.114
Token?"
Token ID1701
Feature activation-1.78e+00

Pos logit contributions
nam+0.069
chard+0.066
shadow+0.064
anut+0.063
 forget+0.061
Neg logit contributions
der-0.116
 words-0.112
ously-0.103
fully-0.100
 enough-0.099
Token she
Token ID673
Feature activation-2.86e+00

Pos logit contributions
nam+0.038
chard+0.038
shadow+0.036
illas+0.034
 unatt+0.034
Neg logit contributions
ously-0.042
 words-0.042
der-0.040
itted-0.037
anked-0.037
Token asks
Token ID7893
Feature activation-1.34e+00

Pos logit contributions
chard+0.059
nam+0.059
shadow+0.055
 Merry+0.051
anut+0.050
Neg logit contributions
ously-0.069
 words-0.067
der-0.066
itted-0.061
anked-0.059
Token.
Token ID13
Feature activation-1.60e+00

Pos logit contributions
chard+0.035
nam+0.035
shadow+0.031
 Merry+0.031
 upon+0.031
Neg logit contributions
ously-0.025
 words-0.024
der-0.024
itted-0.022
anked-0.021
Token store
Token ID3650
Feature activation-3.36e+00

Pos logit contributions
nam+0.076
chard+0.075
shadow+0.072
anut+0.066
 Merry+0.066
Neg logit contributions
ously-0.072
 words-0.070
der-0.070
itted-0.061
anked-0.060
Token and
Token ID290
Feature activation-4.41e+00

Pos logit contributions
nam+0.089
chard+0.083
shadow+0.081
anut+0.079
 Merry+0.079
Neg logit contributions
 words-0.061
der-0.057
ously-0.055
 enough-0.051
eared-0.050
Token see
Token ID766
Feature activation-3.05e+00

Pos logit contributions
icer+0.110
anut+0.101
cel+0.101
shadow+0.100
eling+0.100
Neg logit contributions
 words-0.084
 to-0.077
 advice-0.070
 sale-0.066
 that-0.065
Token the
Token ID262
Feature activation-1.70e+00

Pos logit contributions
|+0.076
chard+0.070
shadow+0.068
 forget+0.067
har+0.066
Neg logit contributions
 words-0.060
der-0.058
 enough-0.053
ously-0.046
 each-0.045
Token other
Token ID584
Feature activation-1.78e+00

Pos logit contributions
shadow+0.044
chard+0.043
nam+0.042
|+0.039
anut+0.039
Neg logit contributions
 words-0.034
der-0.031
ously-0.029
itted-0.025
 skill-0.025
Token horses
Token ID14260
Feature activation-6.93e+00

Pos logit contributions
chard+0.055
shadow+0.055
nam+0.052
anut+0.049
har+0.048
Neg logit contributions
 words-0.030
der-0.023
 enough-0.021
 news-0.020
 times-0.020
Token?"
Token ID1701
Feature activation-1.03e+00

Pos logit contributions
nam+0.155
shadow+0.137
 Merry+0.136
chard+0.134
anut+0.134
Neg logit contributions
der-0.141
 words-0.140
 enough-0.127
ously-0.127
 to-0.122
Token
Token ID198
Feature activation-2.03e+00

Pos logit contributions
chard+0.023
nam+0.023
shadow+0.022
 Merry+0.020
anut+0.020
Neg logit contributions
ously-0.023
 words-0.022
der-0.022
itted-0.020
anked-0.019
Token
Token ID198
Feature activation-1.40e+00

Pos logit contributions
nam+0.053
chard+0.052
shadow+0.048
 unatt+0.046
 forget+0.046
Neg logit contributions
ously-0.039
 words-0.039
der-0.036
itted-0.034
anked-0.033
TokenMom
Token ID29252
Feature activation+9.63e-01

Pos logit contributions
chard+0.034
nam+0.034
shadow+0.031
 unatt+0.030
 upon+0.030
Neg logit contributions
ously-0.028
 words-0.028
der-0.026
itted-0.024
anked-0.024
Token smiled
Token ID13541
Feature activation+1.29e+00

Pos logit contributions
der+0.027
ously+0.027
itted+0.027
 words+0.026
 fault+0.026
Neg logit contributions
 upon-0.012
chard-0.012
nam-0.012
shadow-0.011
 forget-0.011
Token about
Token ID546
Feature activation-1.01e+00

Pos logit contributions
nam+0.038
chard+0.038
 upon+0.035
shadow+0.035
 forget+0.034
Neg logit contributions
ously-0.035
 words-0.031
der-0.031
itted-0.030
anked-0.029
Token we
Token ID356
Feature activation-1.77e+00

Pos logit contributions
chard+0.019
nam+0.018
anut+0.018
shadow+0.017
 unatt+0.017
Neg logit contributions
 words-0.027
der-0.024
 enough-0.022
 news-0.022
ield-0.022
Token play
Token ID711
Feature activation-1.77e+00

Pos logit contributions
nam+0.042
shadow+0.042
chard+0.042
 Merry+0.038
anut+0.037
Neg logit contributions
 words-0.036
der-0.033
ously-0.032
 ideas-0.029
 idea-0.029
Token with
Token ID351
Feature activation-2.03e+00

Pos logit contributions
nam+0.048
chard+0.047
shadow+0.045
anut+0.043
 forget+0.042
Neg logit contributions
 words-0.033
der-0.030
ously-0.028
eared-0.026
 skills-0.025
Token your
Token ID534
Feature activation-1.22e+00

Pos logit contributions
shadow+0.048
chard+0.045
nam+0.043
 forget+0.042
|+0.041
Neg logit contributions
 words-0.045
der-0.041
ously-0.039
anked-0.035
 ideas-0.034
Token dolls
Token ID36062
Feature activation-6.93e+00

Pos logit contributions
shadow+0.041
chard+0.040
nam+0.036
 Merry+0.035
riage+0.035
Neg logit contributions
 words-0.019
 ideas-0.014
 skills-0.013
 idea-0.012
 skill-0.012
Token or
Token ID393
Feature activation-4.12e+00

Pos logit contributions
nam+0.151
shadow+0.144
chard+0.142
 Merry+0.140
|+0.129
Neg logit contributions
der-0.153
 words-0.149
 enough-0.134
ously-0.129
 to-0.128
Token your
Token ID534
Feature activation-1.25e+00

Pos logit contributions
shadow+0.102
chard+0.093
|+0.091
anut+0.090
har+0.089
Neg logit contributions
 words-0.090
der-0.076
ously-0.068
 ideas-0.068
 enough-0.064
Token blocks
Token ID7021
Feature activation-6.13e+00

Pos logit contributions
shadow+0.033
chard+0.033
nam+0.030
 Merry+0.029
anut+0.028
Neg logit contributions
 words-0.026
der-0.022
ously-0.021
 skills-0.020
 ideas-0.020
Token?
Token ID30
Feature activation-1.07e+00

Pos logit contributions
nam+0.099
shadow+0.097
chard+0.096
 Merry+0.091
cel+0.084
Neg logit contributions
der-0.170
 words-0.167
ously-0.155
 enough-0.149
itted-0.145
Token And
Token ID843
Feature activation-3.31e+00

Pos logit contributions
chard+0.026
nam+0.026
anut+0.025
shadow+0.025
 unatt+0.025
Neg logit contributions
 words-0.021
der-0.021
ously-0.018
itted-0.018
ield-0.017
TokenHey
Token ID10814
Feature activation-1.12e+00

Pos logit contributions
ously+0.038
 words+0.037
der+0.036
itted+0.033
anked+0.033
Neg logit contributions
chard-0.033
nam-0.033
shadow-0.032
anut-0.029
 thirsty-0.029
Token,
Token ID11
Feature activation-2.30e+00

Pos logit contributions
nam+0.039
chard+0.039
|+0.037
shadow+0.036
 unatt+0.035
Neg logit contributions
 words-0.010
ously-0.010
der-0.009
 enough-0.008
anked-0.007
Token where
Token ID810
Feature activation-2.21e+00

Pos logit contributions
chard+0.050
nam+0.049
shadow+0.046
|+0.043
illas+0.041
Neg logit contributions
 words-0.053
der-0.052
ously-0.052
anked-0.046
 enough-0.045
Token are
Token ID389
Feature activation-6.04e-01

Pos logit contributions
chard+0.052
nam+0.050
shadow+0.049
anut+0.044
 unatt+0.043
Neg logit contributions
 words-0.049
ously-0.048
der-0.045
ield-0.042
anked-0.041
Token the
Token ID262
Feature activation-8.53e-01

Pos logit contributions
chard+0.011
nam+0.010
shadow+0.010
|+0.009
 unatt+0.009
Neg logit contributions
 words-0.016
der-0.015
ously-0.015
ield-0.014
 enough-0.014
Token eggs
Token ID9653
Feature activation-6.92e+00

Pos logit contributions
nam+0.021
chard+0.021
shadow+0.020
|+0.019
 Merry+0.019
Neg logit contributions
 words-0.017
ously-0.017
der-0.016
itted-0.014
anked-0.014
Token?"
Token ID1701
Feature activation-2.34e+00

Pos logit contributions
nam+0.146
chard+0.128
 Merry+0.115
shadow+0.114
|+0.113
Neg logit contributions
 words-0.159
ously-0.158
der-0.156
 enough-0.154
ield-0.149
Token Ben
Token ID3932
Feature activation-2.38e-01

Pos logit contributions
shadow+0.061
chard+0.057
illas+0.057
nam+0.056
anut+0.056
Neg logit contributions
 words-0.047
der-0.043
ously-0.041
anked-0.036
itted-0.035
Token asked
Token ID1965
Feature activation-6.17e-01

Pos logit contributions
chard+0.004
nam+0.004
shadow+0.004
 thirsty+0.004
 forget+0.004
Neg logit contributions
ously-0.006
der-0.006
 words-0.006
itted-0.006
anked-0.005
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
chard+0.016
nam+0.016
shadow+0.015
 upon+0.015
 Merry+0.015
Neg logit contributions
ously-0.011
der-0.011
 words-0.011
itted-0.010
anked-0.009
Token "
Token ID366
Feature activation+5.33e-02

Pos logit contributions
chard+0.024
nam+0.023
shadow+0.021
 upon+0.021
 Merry+0.021
Neg logit contributions
ously-0.025
 words-0.023
der-0.023
itted-0.022
anked-0.022
Token I
Token ID314
Feature activation-1.17e+00

Pos logit contributions
chard+0.004
anut+0.004
aj+0.004
 thirsty+0.004
nam+0.004
Neg logit contributions
 words-0.004
der-0.004
ield-0.004
 advice-0.003
anked-0.003
Token help
Token ID1037
Feature activation-2.10e+00

Pos logit contributions
nam+0.025
chard+0.025
shadow+0.023
 Merry+0.022
 unatt+0.022
Neg logit contributions
ously-0.026
 words-0.026
der-0.025
itted-0.023
anked-0.023
Token you
Token ID345
Feature activation-1.94e+00

Pos logit contributions
chard+0.040
nam+0.039
|+0.037
shadow+0.036
 thirsty+0.034
Neg logit contributions
 words-0.054
ously-0.050
der-0.049
anked-0.047
ield-0.045
Token print
Token ID3601
Feature activation-2.06e+00

Pos logit contributions
nam+0.029
chard+0.029
shadow+0.025
 Merry+0.024
|+0.023
Neg logit contributions
ously-0.057
 words-0.056
der-0.056
itted-0.052
anked-0.052
Token some
Token ID617
Feature activation-1.75e+00

Pos logit contributions
chard+0.049
nam+0.048
shadow+0.046
|+0.043
anut+0.043
Neg logit contributions
 words-0.043
ously-0.042
der-0.041
anked-0.037
itted-0.036
Token papers
Token ID9473
Feature activation-6.92e+00

Pos logit contributions
chard+0.036
nam+0.035
shadow+0.033
 Merry+0.031
|+0.031
Neg logit contributions
ously-0.042
 words-0.042
der-0.041
itted-0.037
anked-0.037
Token too
Token ID1165
Feature activation-3.74e+00

Pos logit contributions
nam+0.099
chard+0.097
|+0.089
 Merry+0.088
shadow+0.086
Neg logit contributions
 words-0.202
der-0.201
ously-0.197
itted-0.182
anked-0.181
Token?"
Token ID1701
Feature activation-6.67e-01

Pos logit contributions
chard+0.024
nam+0.023
shadow+0.018
 Merry+0.013
 upon+0.012
Neg logit contributions
ously-0.145
 words-0.140
der-0.139
itted-0.133
anked-0.131
Token the
Token ID262
Feature activation-2.38e+00

Pos logit contributions
chard+0.014
nam+0.014
shadow+0.013
anut+0.012
 Merry+0.012
Neg logit contributions
ously-0.016
 words-0.015
der-0.015
itted-0.014
anked-0.014
Token prince
Token ID19716
Feature activation+3.75e-01

Pos logit contributions
chard+0.045
shadow+0.044
nam+0.044
|+0.042
 Merry+0.039
Neg logit contributions
 words-0.061
ously-0.059
der-0.057
itted-0.052
eared-0.050
Token asked
Token ID1965
Feature activation-7.24e-02

Pos logit contributions
ously+0.009
der+0.009
 words+0.009
itted+0.009
anked+0.009
Neg logit contributions
nam-0.007
chard-0.007
shadow-0.007
 forget-0.006
 thirsty-0.006
Token I
Token ID314
Feature activation-1.38e+00

Pos logit contributions
chard+0.002
shadow+0.002
 unatt+0.002
anut+0.002
aj+0.002
Neg logit contributions
 words-0.001
der-0.001
ield-0.001
 that-0.001
 advice-0.001
Token help
Token ID1037
Feature activation-2.15e+00

Pos logit contributions
nam+0.032
chard+0.030
shadow+0.028
 unatt+0.026
 Merry+0.025
Neg logit contributions
 words-0.030
ously-0.029
der-0.028
ield-0.025
itted-0.025
Token you
Token ID345
Feature activation-2.42e+00

Pos logit contributions
nam+0.048
chard+0.047
|+0.044
shadow+0.044
 thirsty+0.041
Neg logit contributions
 words-0.047
der-0.041
anked-0.040
ously-0.040
 advice-0.039
Token with
Token ID351
Feature activation-2.62e+00

Pos logit contributions
nam+0.037
chard+0.034
 Merry+0.031
shadow+0.031
|+0.031
Neg logit contributions
der-0.069
 words-0.069
ously-0.068
anked-0.063
itted-0.061
Token the
Token ID262
Feature activation-5.53e-01

Pos logit contributions
chard+0.055
nam+0.054
shadow+0.053
|+0.053
 forget+0.050
Neg logit contributions
 words-0.059
der-0.056
ously-0.054
anked-0.051
 enough-0.050
Token eggs
Token ID9653
Feature activation-6.91e+00

Pos logit contributions
nam+0.013
chard+0.013
shadow+0.012
 Merry+0.011
 forget+0.011
Neg logit contributions
ously-0.012
der-0.011
 words-0.011
itted-0.011
anked-0.010
Token?"
Token ID1701
Feature activation-2.41e+00

Pos logit contributions
nam+0.117
chard+0.102
 Merry+0.100
|+0.095
shadow+0.094
Neg logit contributions
der-0.185
ously-0.183
 words-0.181
 enough-0.175
anked-0.167
Token Max
Token ID5436
Feature activation+4.08e-01

Pos logit contributions
shadow+0.060
illas+0.059
chard+0.058
anut+0.058
 unatt+0.057
Neg logit contributions
 words-0.049
der-0.044
ously-0.044
anked-0.040
itted-0.039
Token asked
Token ID1965
Feature activation-2.31e-01

Pos logit contributions
ously+0.012
der+0.012
 words+0.011
anked+0.011
itted+0.011
Neg logit contributions
chard-0.006
nam-0.006
 thirsty-0.005
shadow-0.005
 forget-0.005
Token.
Token ID13
Feature activation-3.72e-01

Pos logit contributions
nam+0.006
chard+0.006
shadow+0.006
 upon+0.005
 Merry+0.005
Neg logit contributions
ously-0.004
der-0.004
 words-0.004
itted-0.004
anked-0.003
Token He
Token ID679
Feature activation-1.03e+00

Pos logit contributions
chard+0.008
nam+0.008
shadow+0.008
 Merry+0.007
anut+0.007
Neg logit contributions
ously-0.008
 words-0.008
der-0.008
itted-0.007
anked-0.007
Token so
Token ID523
Feature activation-2.22e+00

Pos logit contributions
nam+0.079
chard+0.079
shadow+0.070
 Merry+0.067
anut+0.063
Neg logit contributions
ously-0.176
 words-0.172
der-0.170
itted-0.161
anked-0.158
Token I
Token ID314
Feature activation-1.66e+00

Pos logit contributions
chard+0.045
|+0.045
nam+0.044
shadow+0.043
illas+0.043
Neg logit contributions
der-0.052
 words-0.051
ously-0.048
anked-0.045
ield-0.042
Token can
Token ID460
Feature activation+9.80e-01

Pos logit contributions
nam+0.038
chard+0.037
shadow+0.033
anut+0.032
 Merry+0.031
Neg logit contributions
 words-0.037
ously-0.037
der-0.036
itted-0.033
ield-0.032
Token reach
Token ID3151
Feature activation-2.50e+00

Pos logit contributions
ously+0.026
 words+0.025
der+0.024
itted+0.023
anked+0.022
Neg logit contributions
chard-0.018
nam-0.018
shadow-0.017
 forget-0.016
 Merry-0.015
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
chard+0.050
nam+0.050
|+0.046
shadow+0.045
 forget+0.045
Neg logit contributions
 words-0.060
der-0.058
ously-0.054
anked-0.052
 enough-0.050
Token branches
Token ID13737
Feature activation-6.89e+00

Pos logit contributions
nam+0.025
chard+0.025
shadow+0.023
 Merry+0.022
 forget+0.022
Neg logit contributions
ously-0.023
 words-0.021
der-0.021
itted-0.019
anked-0.019
Token?"
Token ID1701
Feature activation-5.33e-01

Pos logit contributions
nam+0.113
chard+0.106
 Merry+0.103
cel+0.097
anut+0.096
Neg logit contributions
der-0.185
ously-0.182
 words-0.182
anked-0.168
 enough-0.165
Token she
Token ID673
Feature activation-1.33e+00

Pos logit contributions
chard+0.013
nam+0.013
shadow+0.013
illas+0.012
anut+0.012
Neg logit contributions
ously-0.011
 words-0.010
der-0.009
itted-0.009
anked-0.009
Token asked
Token ID1965
Feature activation-3.81e-01

Pos logit contributions
nam+0.026
chard+0.026
shadow+0.024
 thirsty+0.022
 upon+0.021
Neg logit contributions
ously-0.035
der-0.033
 words-0.032
itted-0.031
anked-0.030
Token.
Token ID13
Feature activation-4.95e-01

Pos logit contributions
nam+0.010
chard+0.010
shadow+0.009
 upon+0.009
 unatt+0.008
Neg logit contributions
ously-0.007
der-0.007
 words-0.007
itted-0.006
anked-0.006
Token
Token ID198
Feature activation-8.63e-01

Pos logit contributions
chard+0.011
nam+0.011
 Merry+0.010
shadow+0.010
 upon+0.010
Neg logit contributions
ously-0.011
der-0.011
 words-0.011
aced-0.010
itted-0.010
Token you
Token ID345
Feature activation-8.80e-01

Pos logit contributions
chard+0.009
anut+0.009
shadow+0.009
nam+0.009
 unatt+0.009
Neg logit contributions
 words-0.010
der-0.009
ield-0.008
 advice-0.008
anked-0.008
Token help
Token ID1037
Feature activation-1.68e+00

Pos logit contributions
chard+0.019
nam+0.019
shadow+0.018
 Merry+0.017
 forget+0.017
Neg logit contributions
ously-0.020
 words-0.019
der-0.019
itted-0.018
anked-0.017
Token me
Token ID502
Feature activation-2.99e+00

Pos logit contributions
nam+0.038
chard+0.038
shadow+0.036
|+0.034
 thirsty+0.033
Neg logit contributions
 words-0.037
ously-0.034
der-0.034
anked-0.032
ield-0.030
Token crack
Token ID8469
Feature activation-2.72e+00

Pos logit contributions
nam+0.056
chard+0.052
|+0.050
 Merry+0.050
shadow+0.047
Neg logit contributions
 words-0.075
der-0.073
ously-0.071
anked-0.068
 enough-0.065
Token the
Token ID262
Feature activation-1.57e+00

Pos logit contributions
nam+0.057
chard+0.055
shadow+0.051
 forget+0.051
|+0.049
Neg logit contributions
 words-0.065
der-0.065
ously-0.061
anked-0.058
itted-0.055
Token eggs
Token ID9653
Feature activation-6.88e+00

Pos logit contributions
chard+0.039
nam+0.039
shadow+0.037
|+0.035
 Merry+0.034
Neg logit contributions
 words-0.031
ously-0.031
der-0.030
itted-0.026
anked-0.026
Token and
Token ID290
Feature activation-4.05e+00

Pos logit contributions
nam+0.141
chard+0.128
 Merry+0.124
shadow+0.121
|+0.118
Neg logit contributions
der-0.159
 words-0.157
ously-0.152
anked-0.145
 enough-0.140
Token put
Token ID1234
Feature activation-2.46e+00

Pos logit contributions
|+0.088
shadow+0.087
chard+0.084
nam+0.080
cel+0.080
Neg logit contributions
 words-0.091
der-0.082
ously-0.074
 advice-0.069
 enough-0.069
Token them
Token ID606
Feature activation-5.27e+00

Pos logit contributions
chard+0.066
nam+0.065
shadow+0.064
 forget+0.062
|+0.061
Neg logit contributions
 words-0.043
der-0.039
ously-0.039
anked-0.034
 enough-0.033
Token in
Token ID287
Feature activation-3.14e+00

Pos logit contributions
 Merry+0.130
shadow+0.130
cel+0.129
nam+0.128
chard+0.128
Neg logit contributions
 words-0.088
der-0.084
 enough-0.081
 to-0.075
 skills-0.073
Token the
Token ID262
Feature activation-2.47e+00

Pos logit contributions
chard+0.062
nam+0.061
shadow+0.057
 Merry+0.054
|+0.053
Neg logit contributions
 words-0.076
ously-0.075
der-0.073
anked-0.067
itted-0.066
TokenCan
Token ID6090
Feature activation+7.15e-01

Pos logit contributions
ously+0.068
der+0.064
 words+0.064
itted+0.060
anked+0.059
Neg logit contributions
nam-0.042
chard-0.042
shadow-0.038
 Merry-0.036
|-0.035
Token you
Token ID345
Feature activation-2.46e-01

Pos logit contributions
 words+0.017
der+0.015
ield+0.014
ously+0.014
 advice+0.013
Neg logit contributions
anut-0.014
shadow-0.014
chard-0.014
 afloat-0.014
 roasted-0.014
Token please
Token ID3387
Feature activation-1.48e+00

Pos logit contributions
chard+0.005
nam+0.005
shadow+0.005
 Merry+0.005
anut+0.005
Neg logit contributions
ously-0.006
 words-0.005
der-0.005
itted-0.005
anked-0.005
Token put
Token ID1234
Feature activation-2.74e+00

Pos logit contributions
nam+0.036
chard+0.035
shadow+0.033
anut+0.030
 Merry+0.030
Neg logit contributions
 words-0.031
ously-0.028
der-0.027
anked-0.025
ield-0.025
Token the
Token ID262
Feature activation-5.67e-01

Pos logit contributions
chard+0.059
shadow+0.057
nam+0.056
 forget+0.056
|+0.052
Neg logit contributions
 words-0.064
der-0.057
ously-0.055
 enough-0.053
eared-0.053
Token dolls
Token ID36062
Feature activation-6.87e+00

Pos logit contributions
shadow+0.013
chard+0.013
nam+0.012
anut+0.012
 Merry+0.012
Neg logit contributions
 words-0.013
der-0.011
ously-0.011
itted-0.010
 skills-0.010
Token back
Token ID736
Feature activation-4.50e+00

Pos logit contributions
 Merry+0.164
nam+0.162
cel+0.159
chard+0.157
shadow+0.156
Neg logit contributions
 words-0.117
 enough-0.111
der-0.106
ously-0.105
eared-0.103
Token in
Token ID287
Feature activation-3.25e+00

Pos logit contributions
nam+0.104
chard+0.101
shadow+0.099
 Merry+0.097
cel+0.096
Neg logit contributions
 words-0.087
ously-0.086
der-0.085
 enough-0.076
itted-0.075
Token the
Token ID262
Feature activation-2.10e+00

Pos logit contributions
chard+0.062
nam+0.059
shadow+0.057
 Merry+0.054
anut+0.053
Neg logit contributions
 words-0.082
ously-0.080
der-0.077
anked-0.072
itted-0.071
Token box
Token ID3091
Feature activation-4.04e+00

Pos logit contributions
shadow+0.051
chard+0.051
nam+0.049
anut+0.045
illas+0.044
Neg logit contributions
 words-0.046
ously-0.041
der-0.040
itted-0.037
eared-0.036
Token when
Token ID618
Feature activation-3.73e+00

Pos logit contributions
nam+0.072
chard+0.070
shadow+0.065
 Merry+0.061
anut+0.059
Neg logit contributions
ously-0.109
 words-0.106
der-0.105
itted-0.096
anked-0.095
Token I
Token ID314
Feature activation-5.01e-01

Pos logit contributions
anut+0.003
aj+0.003
chard+0.002
nam+0.002
bie+0.002
Neg logit contributions
 words-0.002
ield-0.002
der-0.002
 advice-0.002
 to-0.002
Token help
Token ID1037
Feature activation-1.68e+00

Pos logit contributions
nam+0.014
chard+0.013
shadow+0.012
 unatt+0.012
anut+0.011
Neg logit contributions
 words-0.009
ield-0.008
ously-0.007
 advice-0.007
 idea-0.007
Token you
Token ID345
Feature activation-1.63e+00

Pos logit contributions
nam+0.035
chard+0.035
|+0.034
aj+0.032
shadow+0.031
Neg logit contributions
 words-0.039
der-0.033
anked-0.033
 advice-0.032
ously-0.032
Token load
Token ID3440
Feature activation-2.58e+00

Pos logit contributions
nam+0.033
chard+0.032
 Merry+0.030
|+0.029
shadow+0.029
Neg logit contributions
 words-0.038
ously-0.038
der-0.038
anked-0.035
ield-0.033
Token the
Token ID262
Feature activation-4.38e-01

Pos logit contributions
chard+0.058
nam+0.056
 forget+0.056
shadow+0.055
|+0.054
Neg logit contributions
 words-0.057
der-0.051
anked-0.048
 enough-0.048
ously-0.047
Token boxes
Token ID10559
Feature activation-6.86e+00

Pos logit contributions
chard+0.011
nam+0.011
shadow+0.010
 Merry+0.010
anut+0.009
Neg logit contributions
ously-0.009
 words-0.009
der-0.008
itted-0.008
anked-0.007
Token?"
Token ID1701
Feature activation-1.35e+00

Pos logit contributions
nam+0.154
 Merry+0.149
chard+0.148
cel+0.145
shadow+0.143
Neg logit contributions
ously-0.136
der-0.135
 words-0.134
anked-0.123
 enough-0.120
Token asked
Token ID1965
Feature activation-1.35e+00

Pos logit contributions
anut+0.033
chard+0.033
nam+0.033
shadow+0.033
illas+0.032
Neg logit contributions
 words-0.025
ously-0.023
der-0.022
anked-0.021
ield-0.020
Token Tim
Token ID5045
Feature activation+1.79e-01

Pos logit contributions
chard+0.031
shadow+0.031
nam+0.030
anut+0.030
aj+0.030
Neg logit contributions
 words-0.027
ously-0.026
der-0.024
anked-0.023
 enough-0.022
Tokenmy
Token ID1820
Feature activation+5.35e-01

Pos logit contributions
ously+0.006
 words+0.006
der+0.006
itted+0.006
anked+0.005
Neg logit contributions
chard-0.002
nam-0.002
shadow-0.002
 Merry-0.001
anut-0.001
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
ously+0.009
 words+0.009
der+0.009
itted+0.008
anked+0.008
Neg logit contributions
chard-0.015
nam-0.015
shadow-0.014
 Merry-0.013
anut-0.013
Token"
Token ID1
Feature activation+1.82e+00

Pos logit contributions
nam+0.022
chard+0.021
shadow+0.019
 unatt+0.019
|+0.019
Neg logit contributions
ously-0.017
 words-0.017
der-0.016
anked-0.015
eared-0.015
TokenCan
Token ID6090
Feature activation-1.80e-01

Pos logit contributions
ously+0.046
 words+0.043
der+0.043
itted+0.040
anked+0.039
Neg logit contributions
nam-0.037
chard-0.037
shadow-0.033
 Merry-0.032
 forget-0.031
Token I
Token ID314
Feature activation-9.37e-01

Pos logit contributions
anut+0.004
shadow+0.004
aj+0.004
nam+0.004
chard+0.004
Neg logit contributions
 words-0.004
der-0.003
 to-0.003
ield-0.003
 that-0.003
Token put
Token ID1234
Feature activation-3.18e+00

Pos logit contributions
nam+0.024
chard+0.023
shadow+0.022
 unatt+0.020
riage+0.020
Neg logit contributions
 words-0.018
der-0.017
ously-0.017
anked-0.015
ield-0.015
Token the
Token ID262
Feature activation-8.57e-01

Pos logit contributions
chard+0.070
nam+0.070
shadow+0.067
|+0.064
 forget+0.063
Neg logit contributions
 words-0.072
der-0.068
ously-0.065
anked-0.062
 enough-0.059
Token apples
Token ID22514
Feature activation-6.86e+00

Pos logit contributions
chard+0.021
nam+0.021
shadow+0.020
 Merry+0.019
anut+0.019
Neg logit contributions
ously-0.017
 words-0.016
der-0.016
itted-0.014
anked-0.014
Token on
Token ID319
Feature activation-3.41e+00

Pos logit contributions
nam+0.173
cel+0.165
 Merry+0.163
|+0.161
shadow+0.159
Neg logit contributions
der-0.114
 enough-0.110
 words-0.109
ously-0.103
anked-0.101
Token the
Token ID262
Feature activation-1.94e+00

Pos logit contributions
nam+0.066
chard+0.064
shadow+0.063
|+0.057
 forget+0.057
Neg logit contributions
 words-0.084
ously-0.082
der-0.082
anked-0.075
eared-0.071
Token dough
Token ID15756
Feature activation-4.36e+00

Pos logit contributions
nam+0.047
shadow+0.046
chard+0.046
|+0.043
 upon+0.041
Neg logit contributions
 words-0.040
der-0.038
ously-0.038
itted-0.032
anked-0.032
Token?"
Token ID1701
Feature activation-2.28e+00

Pos logit contributions
nam+0.069
chard+0.067
shadow+0.063
 Merry+0.057
|+0.055
Neg logit contributions
ously-0.123
 words-0.123
der-0.122
anked-0.111
itted-0.111
Token Tom
Token ID4186
Feature activation-3.73e-01

Pos logit contributions
nam+0.061
chard+0.060
shadow+0.060
anut+0.058
 unatt+0.058
Neg logit contributions
 words-0.041
ously-0.038
der-0.036
anked-0.033
eared-0.031
Token like
Token ID588
Feature activation+1.22e+00

Pos logit contributions
ously+0.001
 words+0.001
der+0.001
itted+0.000
anked+0.000
Neg logit contributions
chard-0.001
nam-0.001
shadow-0.001
 Merry-0.001
 upon-0.001
Token a
Token ID257
Feature activation+5.18e-01

Pos logit contributions
ously+0.029
der+0.027
itted+0.026
anked+0.026
 words+0.025
Neg logit contributions
nam-0.026
chard-0.026
 Merry-0.023
 upon-0.023
 thirsty-0.022
Token frog
Token ID21264
Feature activation+5.98e-01

Pos logit contributions
 words+0.010
der+0.009
ously+0.009
 skill+0.008
 word+0.008
Neg logit contributions
nam-0.013
chard-0.013
shadow-0.013
illas-0.012
anut-0.012
Token,
Token ID11
Feature activation-1.66e+00

Pos logit contributions
ously+0.011
 words+0.011
der+0.010
itted+0.009
anked+0.009
Neg logit contributions
chard-0.016
nam-0.015
shadow-0.014
 Merry-0.014
 upon-0.014
Token fl
Token ID781
Feature activation-5.80e-01

Pos logit contributions
chard+0.042
nam+0.041
shadow+0.040
 Merry+0.038
|+0.038
Neg logit contributions
ously-0.032
 words-0.031
der-0.030
itted-0.025
anked-0.024
Tokenapping
Token ID5912
Feature activation+2.76e+00

Pos logit contributions
chard+0.016
nam+0.016
shadow+0.015
 Merry+0.014
anut+0.014
Neg logit contributions
ously-0.010
 words-0.010
der-0.009
itted-0.008
anked-0.008
Token his
Token ID465
Feature activation+4.91e-01

Pos logit contributions
ously+0.066
der+0.066
 words+0.063
itted+0.059
anked+0.059
Neg logit contributions
chard-0.052
nam-0.051
 upon-0.049
shadow-0.046
 Merry-0.044
Token arms
Token ID5101
Feature activation+4.61e-01

Pos logit contributions
ously+0.009
der+0.008
 words+0.008
anked+0.007
itted+0.007
Neg logit contributions
nam-0.013
chard-0.013
shadow-0.012
 upon-0.012
 forget-0.012
Token like
Token ID588
Feature activation+1.24e+00

Pos logit contributions
 words+0.010
ously+0.010
ield+0.010
der+0.010
 example+0.010
Neg logit contributions
 upon-0.009
chard-0.009
nam-0.009
 thirsty-0.008
 tired-0.008
Token a
Token ID257
Feature activation+1.33e+00

Pos logit contributions
ously+0.031
itted+0.029
anked+0.029
der+0.029
aced+0.028
Neg logit contributions
nam-0.024
chard-0.024
 Merry-0.021
 upon-0.021
 thirsty-0.021
Token butterfly
Token ID35113
Feature activation+1.09e+00

Pos logit contributions
ously+0.030
 words+0.029
der+0.028
itted+0.026
anked+0.025
Neg logit contributions
chard-0.030
nam-0.030
shadow-0.028
 Merry-0.026
 forget-0.026
Token movie
Token ID3807
Feature activation+1.09e+00

Pos logit contributions
ously+0.023
der+0.022
 words+0.021
itted+0.020
anked+0.020
Neg logit contributions
chard-0.032
nam-0.032
shadow-0.029
 Merry-0.029
 forget-0.028
Token to
Token ID284
Feature activation+6.36e-01

Pos logit contributions
ously+0.025
der+0.023
 words+0.022
itted+0.022
anked+0.021
Neg logit contributions
chard-0.024
nam-0.024
 Merry-0.022
shadow-0.022
 forget-0.021
Token watch
Token ID2342
Feature activation-3.23e-01

Pos logit contributions
ously+0.015
 words+0.015
der+0.015
anked+0.013
itted+0.013
Neg logit contributions
chard-0.013
nam-0.013
shadow-0.012
anut-0.011
 thirsty-0.011
Token.
Token ID13
Feature activation+7.22e-01

Pos logit contributions
chard+0.008
nam+0.008
shadow+0.008
anut+0.007
 forget+0.007
Neg logit contributions
 words-0.006
ously-0.006
der-0.006
anked-0.005
itted-0.005
Token He
Token ID679
Feature activation+2.53e-01

Pos logit contributions
ously+0.017
 words+0.017
der+0.016
itted+0.015
anked+0.015
Neg logit contributions
chard-0.015
nam-0.015
shadow-0.014
 Merry-0.013
 upon-0.013
Token wanted
Token ID2227
Feature activation+3.24e+00

Pos logit contributions
ously+0.006
 words+0.006
der+0.006
itted+0.005
anked+0.005
Neg logit contributions
chard-0.005
nam-0.005
shadow-0.005
 Merry-0.005
 forget-0.005
Token to
Token ID284
Feature activation+9.88e-01

Pos logit contributions
ously+0.055
der+0.051
itted+0.047
anked+0.046
 words+0.042
Neg logit contributions
chard-0.092
nam-0.091
 Merry-0.086
shadow-0.084
 upon-0.084
Token watch
Token ID2342
Feature activation-2.62e-01

Pos logit contributions
ously+0.026
 words+0.026
der+0.025
itted+0.023
anked+0.023
Neg logit contributions
chard-0.018
nam-0.018
shadow-0.017
anut-0.015
 Merry-0.015
Token them
Token ID606
Feature activation-9.48e-01

Pos logit contributions
chard+0.006
nam+0.006
shadow+0.006
anut+0.006
|+0.005
Neg logit contributions
 words-0.006
der-0.005
ously-0.005
anked-0.004
itted-0.004
Token all
Token ID477
Feature activation-3.53e-01

Pos logit contributions
chard+0.023
nam+0.022
shadow+0.020
anut+0.019
 unatt+0.019
Neg logit contributions
 words-0.020
ously-0.020
der-0.019
itted-0.017
anked-0.017
Token.
Token ID13
Feature activation+9.22e-01

Pos logit contributions
chard+0.008
nam+0.008
shadow+0.008
 Merry+0.007
anut+0.007
Neg logit contributions
ously-0.008
 words-0.007
der-0.007
itted-0.007
anked-0.006
Token milk
Token ID7545
Feature activation-2.94e+00

Pos logit contributions
ously+0.016
der+0.015
itted+0.014
 words+0.013
anked+0.013
Neg logit contributions
nam-0.018
chard-0.017
 thirsty-0.016
 unatt-0.015
 Merry-0.015
Token,
Token ID11
Feature activation-2.06e+00

Pos logit contributions
chard+0.075
nam+0.073
shadow+0.068
 upon+0.066
 unatt+0.066
Neg logit contributions
ously-0.058
 words-0.055
der-0.053
itted-0.050
anked-0.048
Token cookies
Token ID14746
Feature activation-1.57e+00

Pos logit contributions
chard+0.047
nam+0.047
shadow+0.044
 Merry+0.041
anut+0.041
Neg logit contributions
ously-0.045
 words-0.044
der-0.043
itted-0.039
anked-0.038
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
chard+0.038
nam+0.036
 upon+0.034
 unatt+0.034
 forget+0.034
Neg logit contributions
ously-0.031
 words-0.030
der-0.029
itted-0.028
eared-0.028
Token a
Token ID257
Feature activation+9.90e-01

Pos logit contributions
chard+0.025
nam+0.025
shadow+0.024
 Merry+0.022
anut+0.022
Neg logit contributions
ously-0.022
 words-0.021
der-0.020
itted-0.018
anked-0.018
Token new
Token ID649
Feature activation+2.73e+00

Pos logit contributions
ously+0.023
 words+0.022
der+0.022
itted+0.020
anked+0.019
Neg logit contributions
chard-0.021
nam-0.021
shadow-0.020
 Merry-0.019
anut-0.018
Token toy
Token ID13373
Feature activation+1.16e-01

Pos logit contributions
ously+0.063
der+0.061
itted+0.057
anked+0.057
 words+0.056
Neg logit contributions
chard-0.058
nam-0.057
 thirsty-0.051
shadow-0.051
 upon-0.051
Token.
Token ID13
Feature activation+9.53e-01

Pos logit contributions
ously+0.002
 words+0.002
itted+0.002
der+0.002
anked+0.002
Neg logit contributions
chard-0.003
nam-0.003
 Merry-0.002
 forget-0.002
 unatt-0.002
Token Jon
Token ID5966
Feature activation+1.38e+00

Pos logit contributions
ously+0.021
 words+0.020
der+0.020
itted+0.019
eared+0.018
Neg logit contributions
chard-0.021
nam-0.021
shadow-0.020
 Merry-0.019
 thirsty-0.019
Token was
Token ID373
Feature activation-5.41e-01

Pos logit contributions
ously+0.033
 words+0.032
der+0.032
itted+0.029
anked+0.028
Neg logit contributions
chard-0.028
nam-0.028
shadow-0.027
 forget-0.025
 Merry-0.025
Token happy
Token ID3772
Feature activation+2.31e+00

Pos logit contributions
chard+0.012
nam+0.012
shadow+0.011
 Merry+0.010
anut+0.010
Neg logit contributions
ously-0.012
 words-0.012
der-0.012
itted-0.011
anked-0.010
Token
Token ID198
Feature activation+2.89e-01

Pos logit contributions
nam+0.013
chard+0.013
shadow+0.012
 forget+0.011
 Merry+0.011
Neg logit contributions
ously-0.012
 words-0.012
der-0.011
itted-0.011
anked-0.010
TokenJack
Token ID14295
Feature activation+2.70e+00

Pos logit contributions
ously+0.007
 words+0.007
der+0.007
itted+0.006
anked+0.006
Neg logit contributions
chard-0.006
nam-0.006
shadow-0.005
 Merry-0.005
 upon-0.005
Token and
Token ID290
Feature activation-1.76e+00

Pos logit contributions
ously+0.056
der+0.055
 words+0.054
itted+0.049
anked+0.049
Neg logit contributions
chard-0.061
nam-0.058
shadow-0.057
 forget-0.056
 upon-0.056
Token his
Token ID465
Feature activation+1.29e-01

Pos logit contributions
chard+0.038
nam+0.038
shadow+0.036
 Merry+0.033
 forget+0.033
Neg logit contributions
ously-0.041
 words-0.039
der-0.039
itted-0.036
anked-0.035
Token mum
Token ID25682
Feature activation+2.50e+00

Pos logit contributions
ously+0.004
der+0.004
eared+0.004
itted+0.004
anked+0.004
Neg logit contributions
chard-0.002
nam-0.002
 thirsty-0.002
 forget-0.002
 Merry-0.001
Token had
Token ID550
Feature activation+3.42e+00

Pos logit contributions
ously+0.068
der+0.067
 words+0.067
itted+0.061
anked+0.060
Neg logit contributions
nam-0.042
chard-0.042
 forget-0.037
shadow-0.036
anut-0.035
Token a
Token ID257
Feature activation+2.79e+00

Pos logit contributions
der+0.093
ously+0.092
anked+0.082
itted+0.079
 words+0.076
Neg logit contributions
nam-0.062
chard-0.059
 thirsty-0.058
 cozy-0.056
shadow-0.055
Token wonderful
Token ID7932
Feature activation+5.07e+00

Pos logit contributions
ously+0.065
 words+0.063
der+0.062
itted+0.057
anked+0.056
Neg logit contributions
chard-0.060
nam-0.060
shadow-0.055
 Merry-0.052
 forget-0.051
Token time
Token ID640
Feature activation+1.22e+00

Pos logit contributions
der+0.147
ously+0.137
anked+0.120
itted+0.117
eared+0.117
Neg logit contributions
 forget-0.080
 upon-0.079
|-0.079
 cozy-0.079
 thirsty-0.074
Token by
Token ID416
Feature activation+1.34e-01

Pos logit contributions
ously+0.030
 words+0.028
der+0.027
anked+0.026
itted+0.026
Neg logit contributions
nam-0.024
chard-0.023
shadow-0.022
 upon-0.021
 thirsty-0.021
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
ously+0.004
itted+0.004
der+0.004
 words+0.004
eared+0.003
Neg logit contributions
nam-0.002
chard-0.002
shadow-0.002
 forget-0.002
 upon-0.002
Token in
Token ID287
Feature activation-1.64e+00

Pos logit contributions
chard+0.017
nam+0.016
 upon+0.015
shadow+0.014
 thirsty+0.014
Neg logit contributions
ously-0.032
 words-0.031
der-0.030
itted-0.029
eared-0.028
Token the
Token ID262
Feature activation-1.20e+00

Pos logit contributions
nam+0.024
chard+0.023
shadow+0.020
 upon+0.018
 forget+0.018
Neg logit contributions
ously-0.051
der-0.049
 words-0.048
itted-0.048
anked-0.045
Token ground
Token ID2323
Feature activation-9.64e-01

Pos logit contributions
chard+0.030
nam+0.030
shadow+0.029
 Merry+0.027
anut+0.027
Neg logit contributions
ously-0.024
 words-0.023
der-0.022
itted-0.020
anked-0.019
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
chard+0.020
 upon+0.019
nam+0.018
 thirsty+0.018
 unatt+0.017
Neg logit contributions
ously-0.023
 words-0.021
itted-0.021
der-0.020
eared-0.020
Token They
Token ID1119
Feature activation-3.25e-01

Pos logit contributions
ously+0.022
der+0.020
 words+0.020
itted+0.018
anked+0.017
Neg logit contributions
chard-0.030
nam-0.030
shadow-0.028
 Merry-0.027
 upon-0.027
Token wanted
Token ID2227
Feature activation+3.22e+00

Pos logit contributions
nam+0.006
chard+0.006
 upon+0.005
 forget+0.005
 forgot+0.005
Neg logit contributions
der-0.009
 words-0.008
 fault-0.008
ously-0.008
itted-0.008
Token to
Token ID284
Feature activation+1.31e+00

Pos logit contributions
der+0.070
ously+0.066
itted+0.065
anked+0.059
ic+0.057
Neg logit contributions
chard-0.077
 upon-0.072
nam-0.072
 thirsty-0.071
 cozy-0.068
Token see
Token ID766
Feature activation-6.93e-01

Pos logit contributions
ously+0.039
der+0.038
 words+0.037
itted+0.035
anked+0.034
Neg logit contributions
chard-0.021
nam-0.020
shadow-0.018
 forget-0.017
 Merry-0.016
Token what
Token ID644
Feature activation+8.66e-01

Pos logit contributions
chard+0.012
nam+0.012
|+0.011
shadow+0.011
 Merry+0.011
Neg logit contributions
 words-0.019
ously-0.019
der-0.018
anked-0.016
 enough-0.016
Token was
Token ID373
Feature activation-5.59e-01

Pos logit contributions
ously+0.018
 words+0.018
der+0.017
itted+0.016
anked+0.015
Neg logit contributions
chard-0.020
nam-0.020
shadow-0.019
 Merry-0.018
anut-0.018
Token inside
Token ID2641
Feature activation-2.28e+00

Pos logit contributions
chard+0.010
nam+0.009
shadow+0.009
 Merry+0.008
anut+0.008
Neg logit contributions
ously-0.016
 words-0.015
der-0.015
itted-0.014
anked-0.014
Token gun
Token ID2485
Feature activation+4.15e-01

Pos logit contributions
ously+0.009
itted+0.008
eared+0.008
 words+0.008
aced+0.008
Neg logit contributions
nam-0.006
chard-0.006
 forth-0.006
 forget-0.006
illas-0.005
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
ously+0.012
itted+0.010
 fault+0.010
 example+0.010
 words+0.010
Neg logit contributions
 forth-0.006
 under-0.005
 upon-0.005
chard-0.005
 forget-0.005
Token "
Token ID366
Feature activation+1.06e+00

Pos logit contributions
ously+0.005
 words+0.005
aced+0.004
der+0.004
itted+0.004
Neg logit contributions
chard-0.003
 upon-0.003
nam-0.003
 Merry-0.003
 thirsty-0.003
TokenI
Token ID40
Feature activation-1.20e+00

Pos logit contributions
ously+0.028
 words+0.026
der+0.026
itted+0.024
anked+0.023
Neg logit contributions
nam-0.020
chard-0.020
shadow-0.018
 forget-0.017
|-0.017
Token got
Token ID1392
Feature activation+6.73e-01

Pos logit contributions
nam+0.025
chard+0.025
shadow+0.023
 Merry+0.022
anut+0.021
Neg logit contributions
ously-0.029
 words-0.028
der-0.027
itted-0.025
anked-0.024
Token two
Token ID734
Feature activation+2.46e-01

Pos logit contributions
ously+0.015
 words+0.015
der+0.014
itted+0.013
anked+0.013
Neg logit contributions
chard-0.015
nam-0.015
shadow-0.014
 Merry-0.013
anut-0.013
Token!"
Token ID2474
Feature activation+2.91e-01

Pos logit contributions
ously+0.006
der+0.005
 words+0.005
itted+0.005
anked+0.005
Neg logit contributions
nam-0.005
chard-0.005
shadow-0.005
 forget-0.005
 thirsty-0.005
Token
Token ID198
Feature activation-9.56e-01

Pos logit contributions
ously+0.008
 words+0.007
der+0.007
itted+0.007
aced+0.007
Neg logit contributions
chard-0.005
nam-0.005
shadow-0.005
 upon-0.005
 Merry-0.005
Token
Token ID198
Feature activation-2.03e-01

Pos logit contributions
chard+0.026
nam+0.026
shadow+0.023
 forget+0.022
 unatt+0.022
Neg logit contributions
ously-0.018
 words-0.018
der-0.017
anked-0.015
itted-0.015
TokenThe
Token ID464
Feature activation-5.10e-01

Pos logit contributions
nam+0.005
chard+0.005
shadow+0.004
 Merry+0.004
 unatt+0.004
Neg logit contributions
ously-0.004
 words-0.004
der-0.004
anked-0.004
itted-0.004
Token birds
Token ID10087
Feature activation-1.11e+00

Pos logit contributions
nam+0.011
chard+0.011
 forget+0.010
 thirsty+0.010
 forth+0.009
Neg logit contributions
ously-0.013
der-0.012
anked-0.011
itted-0.011
aced-0.011
Token pigs
Token ID22333
Feature activation-2.09e+00

Pos logit contributions
chard+0.001
nam+0.001
shadow+0.001
|+0.001
anut+0.001
Neg logit contributions
 words-0.001
ously-0.001
der-0.000
itted-0.000
anked-0.000
Token roll
Token ID4836
Feature activation+1.37e-01

Pos logit contributions
nam+0.045
chard+0.044
shadow+0.042
 Merry+0.039
|+0.038
Neg logit contributions
ously-0.048
 words-0.047
der-0.046
itted-0.042
anked-0.041
Token in
Token ID287
Feature activation+2.93e-01

Pos logit contributions
ously+0.004
der+0.003
 words+0.003
itted+0.003
anked+0.003
Neg logit contributions
chard-0.003
nam-0.003
shadow-0.002
 Merry-0.002
 upon-0.002
Token the
Token ID262
Feature activation-4.28e-01

Pos logit contributions
ously+0.007
der+0.007
itted+0.007
 words+0.006
anked+0.006
Neg logit contributions
nam-0.006
chard-0.006
 upon-0.005
 Merry-0.005
 thirsty-0.005
Token mud
Token ID17492
Feature activation+2.34e-01

Pos logit contributions
chard+0.011
nam+0.011
shadow+0.011
anut+0.010
 forget+0.010
Neg logit contributions
 words-0.008
ously-0.008
der-0.008
itted-0.007
anked-0.007
Token.
Token ID13
Feature activation+3.27e-01

Pos logit contributions
ously+0.006
itted+0.005
 words+0.005
eared+0.005
ield+0.005
Neg logit contributions
chard-0.004
illas-0.004
anut-0.004
 thirsty-0.004
 unatt-0.004
Token But
Token ID887
Feature activation-1.03e+00

Pos logit contributions
der+0.005
ously+0.005
 words+0.004
itted+0.004
anked+0.004
Neg logit contributions
nam-0.010
shadow-0.010
chard-0.010
 unatt-0.009
illas-0.009
Token their
Token ID511
Feature activation+6.56e-01

Pos logit contributions
chard+0.031
shadow+0.031
nam+0.030
|+0.030
 unatt+0.029
Neg logit contributions
 words-0.013
ously-0.013
der-0.013
 enough-0.010
anked-0.010
Token favorite
Token ID4004
Feature activation+2.57e+00

Pos logit contributions
ously+0.018
der+0.017
 words+0.016
anked+0.016
itted+0.016
Neg logit contributions
nam-0.012
chard-0.011
 Merry-0.010
 forget-0.010
 thirsty-0.010
Token animals
Token ID4695
Feature activation+1.08e+00

Pos logit contributions
ously+0.055
der+0.053
itted+0.049
 words+0.049
anked+0.048
Neg logit contributions
chard-0.059
nam-0.058
 Merry-0.054
 upon-0.053
 thirsty-0.052
Token were
Token ID547
Feature activation+1.51e+00

Pos logit contributions
ously+0.025
der+0.024
 words+0.024
itted+0.023
anked+0.022
Neg logit contributions
chard-0.023
nam-0.022
 forget-0.020
shadow-0.020
 thirsty-0.020
Token she
Token ID673
Feature activation+8.29e-01

Pos logit contributions
chard+0.009
nam+0.009
shadow+0.009
anut+0.008
|+0.008
Neg logit contributions
ously-0.009
 words-0.009
der-0.009
anked-0.008
itted-0.008
Token needed
Token ID2622
Feature activation+3.05e+00

Pos logit contributions
ously+0.019
 words+0.018
der+0.018
itted+0.016
anked+0.016
Neg logit contributions
nam-0.019
chard-0.018
shadow-0.017
 Merry-0.016
|-0.016
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
der+0.066
itted+0.063
ously+0.062
aced+0.058
anked+0.056
Neg logit contributions
chard-0.069
nam-0.068
shadow-0.064
 Merry-0.062
 cozy-0.061
Token know
Token ID760
Feature activation+2.16e-01

Pos logit contributions
ously+0.027
 words+0.026
der+0.026
itted+0.024
anked+0.023
Neg logit contributions
chard-0.020
nam-0.020
shadow-0.018
 Merry-0.017
anut-0.017
Token how
Token ID703
Feature activation+1.00e+00

Pos logit contributions
ously+0.005
 words+0.005
der+0.005
anked+0.005
itted+0.005
Neg logit contributions
chard-0.004
nam-0.004
shadow-0.004
|-0.004
anut-0.004
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
ously+0.015
 words+0.014
der+0.013
itted+0.011
anked+0.011
Neg logit contributions
chard-0.031
nam-0.030
shadow-0.029
anut-0.028
|-0.027
Token prevent
Token ID2948
Feature activation+9.17e-01

Pos logit contributions
ously+0.025
 words+0.024
der+0.024
anked+0.022
itted+0.022
Neg logit contributions
nam-0.020
chard-0.020
shadow-0.018
 Merry-0.017
 upon-0.017
Token accidents
Token ID17390
Feature activation-3.16e-02

Pos logit contributions
ously+0.024
der+0.022
 words+0.022
itted+0.021
eared+0.020
Neg logit contributions
nam-0.018
chard-0.017
shadow-0.016
 Merry-0.015
 forget-0.015
Token from
Token ID422
Feature activation-1.85e+00

Pos logit contributions
nam+0.001
chard+0.001
shadow+0.001
 forget+0.001
 upon+0.001
Neg logit contributions
ously-0.001
 words-0.001
itted-0.001
der-0.001
eared-0.001
Token happening
Token ID5836
Feature activation-4.70e-02

Pos logit contributions
chard+0.038
nam+0.038
shadow+0.035
 Merry+0.032
anut+0.032
Neg logit contributions
ously-0.046
 words-0.044
der-0.043
itted-0.040
anked-0.039
Token.
Token ID13
Feature activation+9.78e-01

Pos logit contributions
chard+0.001
nam+0.001
shadow+0.001
 Merry+0.001
 upon+0.001
Neg logit contributions
ously-0.001
 words-0.001
der-0.001
itted-0.001
anked-0.001
Token universe
Token ID6881
Feature activation-4.23e+00

Pos logit contributions
chard+0.003
shadow+0.003
nam+0.003
anut+0.002
|+0.002
Neg logit contributions
 words-0.002
ously-0.002
der-0.002
itted-0.002
anked-0.002
Token?"
Token ID1701
Feature activation+1.56e-01

Pos logit contributions
nam+0.098
chard+0.095
anut+0.090
shadow+0.089
 Merry+0.088
Neg logit contributions
 words-0.088
der-0.085
ously-0.082
 enough-0.071
itted-0.070
Token
Token ID220
Feature activation-1.14e+00

Pos logit contributions
ously+0.004
 words+0.004
der+0.004
itted+0.004
anked+0.004
Neg logit contributions
chard-0.003
nam-0.003
shadow-0.002
 Merry-0.002
 upon-0.002
Token
Token ID198
Feature activation-6.89e-01

Pos logit contributions
nam+0.031
chard+0.031
 unatt+0.028
 forget+0.027
shadow+0.027
Neg logit contributions
ously-0.021
 words-0.020
der-0.020
itted-0.018
anked-0.018
Token
Token ID198
Feature activation-2.10e-01

Pos logit contributions
nam+0.016
chard+0.016
shadow+0.015
 forget+0.014
 unatt+0.014
Neg logit contributions
ously-0.015
 words-0.014
der-0.014
itted-0.013
anked-0.013
TokenHer
Token ID9360
Feature activation-7.05e-01

Pos logit contributions
chard+0.005
nam+0.004
shadow+0.004
 Merry+0.004
 upon+0.004
Neg logit contributions
ously-0.005
der-0.005
 words-0.005
itted-0.004
anked-0.004
Token mom
Token ID1995
Feature activation+1.40e+00

Pos logit contributions
nam+0.011
chard+0.010
shadow+0.009
 upon+0.008
 forget+0.008
Neg logit contributions
ously-0.022
der-0.020
 words-0.020
itted-0.019
anked-0.019
Token chuckled
Token ID39889
Feature activation+2.76e+00

Pos logit contributions
der+0.038
ously+0.038
 words+0.037
itted+0.036
 fault+0.035
Neg logit contributions
chard-0.022
nam-0.022
shadow-0.021
 upon-0.020
anut-0.019
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
 fault+0.062
ously+0.058
ield+0.058
itted+0.057
 words+0.057
Neg logit contributions
 upon-0.053
chard-0.052
nam-0.049
 thirsty-0.047
 cozy-0.044
Token said
Token ID531
Feature activation+1.77e+00

Pos logit contributions
chard+0.022
nam+0.022
shadow+0.020
 Merry+0.019
 upon+0.018
Neg logit contributions
ously-0.025
 words-0.024
der-0.024
itted-0.022
anked-0.022
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
ously+0.035
 words+0.035
itted+0.034
der+0.033
eared+0.033
Neg logit contributions
chard-0.039
nam-0.039
 Merry-0.037
shadow-0.036
 upon-0.035
Token her
Token ID607
Feature activation-2.99e-01

Pos logit contributions
nam+0.033
chard+0.033
shadow+0.031
|+0.030
anut+0.029
Neg logit contributions
ously-0.036
der-0.034
 words-0.034
anked-0.030
itted-0.030
Token friends
Token ID2460
Feature activation+1.07e+00

Pos logit contributions
chard+0.006
nam+0.006
shadow+0.005
 forget+0.005
 thirsty+0.005
Neg logit contributions
ously-0.008
der-0.007
 words-0.007
itted-0.007
anked-0.007
Token did
Token ID750
Feature activation+3.66e+00

Pos logit contributions
ously+0.025
 words+0.024
der+0.024
itted+0.022
anked+0.022
Neg logit contributions
chard-0.023
nam-0.022
shadow-0.021
 Merry-0.019
 forget-0.019
Token not
Token ID407
Feature activation+1.95e+00

Pos logit contributions
ously+0.098
 words+0.094
der+0.094
itted+0.088
anked+0.085
Neg logit contributions
nam-0.066
chard-0.065
shadow-0.061
 forget-0.056
 Merry-0.055
Token join
Token ID4654
Feature activation-1.74e-01

Pos logit contributions
ously+0.048
 words+0.047
der+0.045
itted+0.041
eared+0.040
Neg logit contributions
chard-0.041
nam-0.040
shadow-0.036
 Merry-0.034
anut-0.034
Token her
Token ID607
Feature activation+1.02e+00

Pos logit contributions
nam+0.004
chard+0.004
shadow+0.004
 Merry+0.003
 upon+0.003
Neg logit contributions
ously-0.004
der-0.004
 words-0.004
itted-0.003
anked-0.003
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
ously+0.022
der+0.019
itted+0.019
 words+0.019
anked+0.019
Neg logit contributions
nam-0.023
chard-0.023
 forget-0.022
 unatt-0.021
 thirsty-0.021
Token They
Token ID1119
Feature activation-5.08e-01

Pos logit contributions
ously+0.026
 words+0.023
der+0.023
anked+0.021
aced+0.021
Neg logit contributions
chard-0.029
nam-0.028
 Merry-0.028
 upon-0.028
 thirsty-0.027
Token watched
Token ID7342
Feature activation+4.81e-01

Pos logit contributions
chard+0.012
nam+0.012
shadow+0.011
 Merry+0.011
 unatt+0.010
Neg logit contributions
ously-0.011
 words-0.011
der-0.010
itted-0.009
anked-0.009
Token her
Token ID607
Feature activation-1.29e-01

Pos logit contributions
ously+0.011
der+0.010
 words+0.010
itted+0.010
aced+0.009
Neg logit contributions
nam-0.010
chard-0.010
shadow-0.009
 upon-0.009
 Merry-0.009
Token roll
Token ID4836
Feature activation+5.77e-01

Pos logit contributions
nam+0.003
chard+0.003
shadow+0.003
 forget+0.003
 Merry+0.002
Neg logit contributions
ously-0.003
der-0.003
 words-0.003
itted-0.003
anked-0.003
Token here
Token ID994
Feature activation-2.97e+00

Pos logit contributions
nam+0.010
chard+0.010
shadow+0.009
anut+0.008
 Merry+0.008
Neg logit contributions
ously-0.022
 words-0.022
der-0.021
anked-0.020
itted-0.020
Token,
Token ID11
Feature activation-2.99e+00

Pos logit contributions
nam+0.052
chard+0.046
shadow+0.043
anut+0.042
|+0.040
Neg logit contributions
 words-0.081
der-0.077
ously-0.077
ield-0.073
anked-0.072
Token little
Token ID1310
Feature activation-9.70e-01

Pos logit contributions
nam+0.090
chard+0.087
shadow+0.083
 unatt+0.083
|+0.082
Neg logit contributions
 words-0.047
der-0.044
ously-0.042
 enough-0.033
anked-0.033
Token one
Token ID530
Feature activation-1.80e+00

Pos logit contributions
chard+0.024
nam+0.024
shadow+0.023
 Merry+0.021
 upon+0.021
Neg logit contributions
ously-0.020
 words-0.018
der-0.018
itted-0.017
anked-0.016
Token?
Token ID30
Feature activation-7.47e-01

Pos logit contributions
nam+0.030
chard+0.028
shadow+0.027
anut+0.025
cel+0.024
Neg logit contributions
 words-0.052
ously-0.049
der-0.048
ield-0.046
 word-0.045
Tokenâ
Token ID22940
Feature activation-1.18e+00

Pos logit contributions
nam+0.021
anut+0.020
chard+0.020
 unatt+0.020
 forget+0.019
Neg logit contributions
 words-0.011
ously-0.011
der-0.010
ield-0.010
itted-0.009
Token€
Token ID26391
Feature activation-1.48e+00

Pos logit contributions
nam+0.024
chard+0.023
shadow+0.021
 upon+0.020
 Merry+0.020
Neg logit contributions
ously-0.029
 words-0.028
der-0.028
itted-0.026
anked-0.026
Token Bob
Token ID5811
Feature activation-3.52e-01

Pos logit contributions
nam+0.042
chard+0.040
anut+0.035
 unatt+0.035
shadow+0.035
Neg logit contributions
 words-0.026
ously-0.025
ield-0.023
der-0.022
itted-0.021
Token answered
Token ID9373
Feature activation+2.98e-01

Pos logit contributions
nam+0.010
chard+0.009
shadow+0.009
 unatt+0.009
|+0.008
Neg logit contributions
ously-0.006
 words-0.006
der-0.006
ield-0.005
itted-0.005
Token without
Token ID1231
Feature activation-1.34e+00

Pos logit contributions
ously+0.005
 words+0.005
der+0.004
anked+0.004
itted+0.004
Neg logit contributions
nam-0.009
chard-0.009
shadow-0.008
|-0.008
 forget-0.008
Token missing
Token ID4814
Feature activation+1.41e-01

Pos logit contributions
nam+0.029
chard+0.029
shadow+0.027
 Merry+0.025
anut+0.025
Neg logit contributions
ously-0.031
der-0.030
 words-0.030
itted-0.028
anked-0.027
Token�
Token ID95
Feature activation+3.89e-01

Pos logit contributions
ously+0.006
 words+0.006
der+0.005
itted+0.005
aced+0.005
Neg logit contributions
chard-0.005
nam-0.005
shadow-0.005
 Merry-0.004
|-0.004
Token€
Token ID26391
Feature activation+5.69e-01

Pos logit contributions
ously+0.009
 words+0.009
der+0.008
itted+0.008
anked+0.008
Neg logit contributions
chard-0.008
nam-0.008
shadow-0.008
 Merry-0.007
|-0.007
Token�
Token ID129
Feature activation+7.51e-01

Pos logit contributions
ously+0.013
 words+0.012
der+0.012
itted+0.011
ield+0.011
Neg logit contributions
nam-0.013
chard-0.013
shadow-0.011
anut-0.011
 Merry-0.011
Token�
Token ID241
Feature activation+2.62e+00

Pos logit contributions
ously+0.019
 words+0.018
der+0.018
itted+0.016
anked+0.016
Neg logit contributions
chard-0.014
shadow-0.014
nam-0.014
anut-0.013
|-0.013
TokenJohn
Token ID7554
Feature activation+1.58e+00

Pos logit contributions
ously+0.063
 words+0.061
der+0.061
itted+0.056
anked+0.055
Neg logit contributions
chard-0.052
nam-0.052
shadow-0.050
 upon-0.048
 Merry-0.047
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
ously+0.034
 words+0.033
der+0.032
itted+0.029
anked+0.029
Neg logit contributions
chard-0.036
nam-0.036
shadow-0.034
 Merry-0.032
 forget-0.031
Token why
Token ID1521
Feature activation-1.02e+00

Pos logit contributions
chard+0.024
nam+0.022
shadow+0.021
anut+0.021
|+0.020
Neg logit contributions
 words-0.026
ously-0.025
der-0.024
anked-0.021
ield-0.021
Token didn
Token ID1422
Feature activation+2.85e+00

Pos logit contributions
chard+0.020
nam+0.018
shadow+0.017
anut+0.017
 unatt+0.016
Neg logit contributions
 words-0.028
ously-0.027
der-0.026
anked-0.023
ield-0.023
Tokenâ
Token ID22940
Feature activation-9.20e-02

Pos logit contributions
ously+0.070
 words+0.065
der+0.065
itted+0.060
eared+0.058
Neg logit contributions
nam-0.058
chard-0.058
shadow-0.055
 Merry-0.053
|-0.052
Token€
Token ID26391
Feature activation-1.45e-01

Pos logit contributions
chard+0.001
nam+0.001
shadow+0.001
|+0.001
 Merry+0.001
Neg logit contributions
ously-0.003
 words-0.003
der-0.003
itted-0.003
anked-0.002
Tokenâ„¢
Token ID8151
Feature activation-8.25e-01

Pos logit contributions
nam+0.004
chard+0.004
anut+0.003
 forth+0.003
shadow+0.003
Neg logit contributions
ously-0.003
 words-0.003
der-0.003
itted-0.003
ield-0.003
Token asked
Token ID1965
Feature activation-2.51e-01

Pos logit contributions
ously+0.052
 words+0.050
der+0.050
itted+0.048
anked+0.046
Neg logit contributions
chard-0.039
nam-0.037
shadow-0.035
 forget-0.034
 thirsty-0.033
Token again
Token ID757
Feature activation+5.87e-01

Pos logit contributions
chard+0.005
nam+0.005
shadow+0.005
 Merry+0.004
 upon+0.004
Neg logit contributions
ously-0.006
der-0.006
 words-0.006
itted-0.006
anked-0.005
Token,
Token ID11
Feature activation-1.93e+00

Pos logit contributions
ously+0.011
 words+0.010
der+0.010
itted+0.009
anked+0.009
Neg logit contributions
chard-0.016
nam-0.015
shadow-0.015
 upon-0.014
 Merry-0.014
Token "
Token ID366
Feature activation+1.13e+00

Pos logit contributions
chard+0.049
nam+0.049
shadow+0.046
 Merry+0.044
anut+0.043
Neg logit contributions
ously-0.037
 words-0.036
der-0.035
itted-0.031
anked-0.031
TokenAre
Token ID8491
Feature activation-2.51e-01

Pos logit contributions
ously+0.029
 words+0.029
der+0.028
eared+0.026
itted+0.025
Neg logit contributions
nam-0.021
chard-0.019
shadow-0.018
 thirsty-0.018
|-0.018
Token you
Token ID345
Feature activation-1.27e+00

Pos logit contributions
chard+0.005
|+0.005
shadow+0.005
aj+0.005
 unatt+0.005
Neg logit contributions
der-0.005
 words-0.005
 enough-0.005
ield-0.004
ously-0.004
Token polite
Token ID23507
Feature activation-6.00e-01

Pos logit contributions
nam+0.026
chard+0.025
shadow+0.023
|+0.023
 Merry+0.022
Neg logit contributions
ously-0.030
 words-0.030
der-0.029
itted-0.027
eared-0.026
Token?"
Token ID1701
Feature activation+7.74e-01

Pos logit contributions
chard+0.009
nam+0.009
shadow+0.008
 Merry+0.008
 upon+0.007
Neg logit contributions
ously-0.018
 words-0.017
der-0.017
itted-0.016
anked-0.016
Token The
Token ID383
Feature activation-7.84e-01

Pos logit contributions
ously+0.018
 words+0.018
der+0.017
itted+0.016
anked+0.016
Neg logit contributions
chard-0.016
nam-0.016
shadow-0.015
 Merry-0.014
 forget-0.014
Token tank
Token ID6873
Feature activation+8.31e-01

Pos logit contributions
nam+0.018
chard+0.018
shadow+0.018
 Merry+0.016
anut+0.016
Neg logit contributions
 words-0.017
ously-0.016
der-0.016
itted-0.014
anked-0.014
Token still
Token ID991
Feature activation+5.64e-01

Pos logit contributions
ously+0.020
 words+0.019
der+0.018
itted+0.018
anked+0.017
Neg logit contributions
chard-0.017
nam-0.017
shadow-0.015
 forget-0.015
 upon-0.015
Token.
Token ID13
Feature activation+5.69e-01

Pos logit contributions
ously+0.008
 words+0.008
der+0.008
itted+0.007
anked+0.007
Neg logit contributions
nam-0.012
chard-0.012
shadow-0.011
 upon-0.010
 Merry-0.010
Token She
Token ID1375
Feature activation-1.75e-01

Pos logit contributions
ously+0.009
 words+0.009
der+0.009
itted+0.008
anked+0.007
Neg logit contributions
chard-0.016
nam-0.016
shadow-0.016
anut-0.015
illas-0.015
Token loved
Token ID6151
Feature activation+2.11e+00

Pos logit contributions
nam+0.004
chard+0.004
shadow+0.004
illas+0.004
riage+0.004
Neg logit contributions
 words-0.003
ously-0.003
der-0.003
itted-0.003
eared-0.003
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
ously+0.040
der+0.035
aced+0.031
 words+0.031
itted+0.030
Neg logit contributions
nam-0.061
chard-0.059
 Merry-0.054
 thirsty-0.054
 upon-0.052
Token play
Token ID711
Feature activation+1.04e-01

Pos logit contributions
ously+0.038
der+0.036
 words+0.036
itted+0.034
anked+0.033
Neg logit contributions
nam-0.020
chard-0.020
shadow-0.017
 forget-0.017
 Merry-0.016
Token in
Token ID287
Feature activation-1.31e+00

Pos logit contributions
ously+0.003
 words+0.003
der+0.002
itted+0.002
anked+0.002
Neg logit contributions
chard-0.002
nam-0.002
shadow-0.002
 Merry-0.002
 upon-0.002
Token her
Token ID607
Feature activation-2.00e+00

Pos logit contributions
chard+0.025
nam+0.024
shadow+0.023
 Merry+0.021
anut+0.020
Neg logit contributions
ously-0.034
 words-0.033
der-0.032
itted-0.030
anked-0.030
Token garden
Token ID11376
Feature activation-4.30e-01

Pos logit contributions
chard+0.044
nam+0.044
shadow+0.042
 Merry+0.038
|+0.037
Neg logit contributions
ously-0.045
 words-0.045
der-0.043
itted-0.041
anked-0.039
Token every
Token ID790
Feature activation-7.63e-02

Pos logit contributions
chard+0.010
nam+0.010
 upon+0.009
 thirsty+0.008
shadow+0.008
Neg logit contributions
ously-0.010
der-0.009
 words-0.009
itted-0.009
anked-0.008
Token day
Token ID1110
Feature activation-2.63e-01

Pos logit contributions
chard+0.002
nam+0.002
shadow+0.002
 Merry+0.002
 upon+0.002
Neg logit contributions
ously-0.002
der-0.002
 words-0.002
itted-0.001
anked-0.001
Token.
Token ID13
Feature activation+8.61e-01

Pos logit contributions
nam+0.006
chard+0.006
 thirsty+0.005
 unatt+0.005
 Merry+0.005
Neg logit contributions
ously-0.006
der-0.005
 words-0.005
anked-0.005
aced-0.005
Token<|endoftext|>
Token ID50256
Feature activation-2.26e+00

Pos logit contributions
ously+0.001
 words+0.001
der+0.001
itted+0.001
anked+0.001
Neg logit contributions
nam-0.001
chard-0.001
shadow-0.001
 unatt-0.001
anut-0.001
TokenOnce
Token ID7454
Feature activation-4.94e+00

Pos logit contributions
nam+0.059
chard+0.057
shadow+0.055
anut+0.050
 forget+0.050
Neg logit contributions
ously-0.044
 words-0.044
der-0.041
itted-0.037
anked-0.036
Token upon
Token ID2402
Feature activation-4.06e-01

Pos logit contributions
nam+0.068
chard+0.065
shadow+0.059
cel+0.054
 Merry+0.052
Neg logit contributions
ously-0.161
 words-0.150
der-0.146
itted-0.141
anked-0.139
Token a
Token ID257
Feature activation+1.53e-01

Pos logit contributions
chard+0.011
nam+0.011
shadow+0.011
 Merry+0.010
anut+0.010
Neg logit contributions
ously-0.007
 words-0.006
der-0.006
itted-0.006
anked-0.005
Token time
Token ID640
Feature activation-4.45e-01

Pos logit contributions
ously+0.003
der+0.003
 words+0.003
itted+0.003
anked+0.002
Neg logit contributions
chard-0.004
nam-0.004
 upon-0.004
|-0.004
shadow-0.003
Token,
Token ID11
Feature activation-3.20e+00

Pos logit contributions
chard+0.013
nam+0.013
shadow+0.013
 Merry+0.012
anut+0.012
Neg logit contributions
ously-0.007
 words-0.006
der-0.006
itted-0.005
anked-0.005
Token there
Token ID612
Feature activation+6.21e-01

Pos logit contributions
nam+0.067
shadow+0.066
chard+0.064
 unatt+0.062
cel+0.061
Neg logit contributions
ously-0.070
 words-0.068
itted-0.064
der-0.063
anked-0.062
Token was
Token ID373
Feature activation-6.23e-01

Pos logit contributions
ously+0.014
 words+0.013
der+0.013
itted+0.012
anked+0.012
Neg logit contributions
chard-0.014
nam-0.014
shadow-0.013
 Merry-0.012
 upon-0.012
Token a
Token ID257
Feature activation-1.34e+00

Pos logit contributions
chard+0.016
nam+0.016
shadow+0.014
 thirsty+0.014
 Merry+0.014
Neg logit contributions
ously-0.013
der-0.012
 words-0.012
itted-0.010
anked-0.010
Token little
Token ID1310
Feature activation+6.29e-01

Pos logit contributions
shadow+0.043
nam+0.041
|+0.040
illas+0.039
chard+0.039
Neg logit contributions
 words-0.016
der-0.013
ously-0.012
 idea-0.012
 word-0.012
Token dog
Token ID3290
Feature activation-7.46e-01

Pos logit contributions
ously+0.015
 words+0.014
der+0.014
itted+0.013
anked+0.012
Neg logit contributions
chard-0.014
nam-0.014
shadow-0.013
 Merry-0.012
 forget-0.012
Tokenmy
Token ID1820
Feature activation+6.85e-01

Pos logit contributions
chard+0.014
nam+0.014
shadow+0.013
 Merry+0.012
 upon+0.012
Neg logit contributions
ously-0.031
der-0.029
 words-0.029
itted-0.027
anked-0.027
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
ously+0.013
der+0.012
 words+0.012
itted+0.011
anked+0.010
Neg logit contributions
chard-0.018
nam-0.018
shadow-0.017
 forget-0.017
 upon-0.017
Token can
Token ID460
Feature activation+8.85e-01

Pos logit contributions
chard+0.044
shadow+0.042
|+0.042
anut+0.042
nam+0.041
Neg logit contributions
 words-0.038
ously-0.032
der-0.032
 enough-0.031
ield-0.030
Token I
Token ID314
Feature activation+2.68e-01

Pos logit contributions
 words+0.018
ield+0.016
anked+0.014
 advice+0.013
 ideas+0.013
Neg logit contributions
chard-0.021
anut-0.021
shadow-0.021
 unatt-0.020
|-0.019
Token add
Token ID751
Feature activation-8.63e-01

Pos logit contributions
 words+0.007
ously+0.006
itted+0.006
der+0.006
anked+0.006
Neg logit contributions
chard-0.006
nam-0.005
shadow-0.005
anut-0.005
illas-0.005
Token clothes
Token ID8242
Feature activation-5.20e+00

Pos logit contributions
chard+0.022
nam+0.021
shadow+0.020
|+0.019
 forget+0.019
Neg logit contributions
 words-0.017
ously-0.016
der-0.016
itted-0.014
anked-0.014
Token?"
Token ID1701
Feature activation+4.81e-01

Pos logit contributions
nam+0.134
chard+0.131
|+0.130
shadow+0.128
illas+0.122
Neg logit contributions
 words-0.090
der-0.086
ously-0.082
itted-0.079
 enough-0.077
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
ously+0.010
der+0.009
 words+0.009
itted+0.008
eared+0.008
Neg logit contributions
chard-0.012
nam-0.012
 Merry-0.011
 upon-0.011
shadow-0.011
TokenHer
Token ID9360
Feature activation-3.88e-01

Pos logit contributions
nam+0.007
chard+0.007
shadow+0.007
 unatt+0.006
anut+0.006
Neg logit contributions
ously-0.006
 words-0.006
der-0.006
itted-0.005
anked-0.005
Token mother
Token ID2802
Feature activation+1.30e+00

Pos logit contributions
nam+0.006
chard+0.005
 upon+0.005
 forget+0.005
 Merry+0.004
Neg logit contributions
ously-0.012
der-0.012
anked-0.011
eared-0.011
itted-0.011
Token replied
Token ID8712
Feature activation+1.99e+00

Pos logit contributions
ously+0.032
der+0.031
 words+0.031
itted+0.029
anked+0.028
Neg logit contributions
chard-0.025
nam-0.025
shadow-0.023
 upon-0.022
 Merry-0.022
Token up
Token ID510
Feature activation-2.49e+00

Pos logit contributions
nam+0.067
chard+0.066
shadow+0.061
 Merry+0.059
 forget+0.058
Neg logit contributions
ously-0.060
 words-0.060
der-0.058
itted-0.053
anked-0.051
Token for
Token ID329
Feature activation-1.68e+00

Pos logit contributions
chard+0.063
nam+0.062
shadow+0.058
illas+0.056
|+0.055
Neg logit contributions
 words-0.048
der-0.046
ously-0.045
 enough-0.039
itted-0.039
Token me
Token ID502
Feature activation-1.62e+00

Pos logit contributions
chard+0.029
nam+0.028
shadow+0.026
 upon+0.024
|+0.024
Neg logit contributions
 words-0.046
ously-0.045
der-0.044
itted-0.041
anked-0.041
Token!"
Token ID2474
Feature activation+5.03e-01

Pos logit contributions
nam+0.039
chard+0.039
shadow+0.035
 Merry+0.034
|+0.033
Neg logit contributions
 words-0.033
ously-0.033
der-0.032
itted-0.030
eared-0.028
Token<|endoftext|>
Token ID50256
Feature activation-1.47e+00

Pos logit contributions
ously+0.012
 words+0.012
der+0.011
itted+0.010
eared+0.010
Neg logit contributions
nam-0.010
chard-0.010
shadow-0.010
 Merry-0.009
|-0.009
TokenOnce
Token ID7454
Feature activation-5.06e+00

Pos logit contributions
nam+0.039
chard+0.038
shadow+0.036
anut+0.033
 forget+0.033
Neg logit contributions
 words-0.028
ously-0.028
der-0.026
itted-0.024
eared-0.023
Token upon
Token ID2402
Feature activation-2.86e-01

Pos logit contributions
nam+0.059
chard+0.059
shadow+0.051
cel+0.048
illas+0.045
Neg logit contributions
ously-0.173
 words-0.161
der-0.159
itted-0.153
anked-0.150
Token a
Token ID257
Feature activation+6.24e-01

Pos logit contributions
chard+0.008
nam+0.008
shadow+0.007
 Merry+0.007
 upon+0.007
Neg logit contributions
ously-0.005
 words-0.005
der-0.005
itted-0.004
anked-0.004
Token time
Token ID640
Feature activation-5.62e-01

Pos logit contributions
ously+0.013
der+0.012
 words+0.012
itted+0.010
anked+0.010
Neg logit contributions
chard-0.015
nam-0.014
|-0.014
 upon-0.014
 forget-0.014
Token,
Token ID11
Feature activation-2.93e+00

Pos logit contributions
chard+0.017
nam+0.017
shadow+0.016
 Merry+0.015
 upon+0.015
Neg logit contributions
ously-0.009
 words-0.008
der-0.008
itted-0.007
anked-0.007
Token there
Token ID612
Feature activation+7.51e-01

Pos logit contributions
shadow+0.059
nam+0.059
chard+0.056
anut+0.056
aj+0.056
Neg logit contributions
ously-0.064
 words-0.062
der-0.058
itted-0.058
 enough-0.056
Token"
Token ID1
Feature activation+1.89e+00

Pos logit contributions
nam+0.031
chard+0.031
 unatt+0.028
 Merry+0.027
|+0.027
Neg logit contributions
ously-0.019
 words-0.018
der-0.016
anked-0.016
eared-0.015
TokenDid
Token ID11633
Feature activation-2.47e-01

Pos logit contributions
ously+0.045
 words+0.044
der+0.042
itted+0.039
anked+0.038
Neg logit contributions
chard-0.040
nam-0.040
shadow-0.037
anut-0.034
 forget-0.034
Token you
Token ID345
Feature activation-1.34e+00

Pos logit contributions
chard+0.005
shadow+0.005
nam+0.005
 unatt+0.004
|+0.004
Neg logit contributions
 words-0.006
der-0.006
ously-0.005
ield-0.005
 enough-0.005
Token take
Token ID1011
Feature activation+2.25e-01

Pos logit contributions
nam+0.030
chard+0.029
 Merry+0.027
shadow+0.027
aj+0.025
Neg logit contributions
 words-0.030
ously-0.028
der-0.026
 idea-0.025
 enough-0.025
Token my
Token ID616
Feature activation-1.88e-01

Pos logit contributions
ously+0.007
der+0.006
 words+0.006
itted+0.006
anked+0.006
Neg logit contributions
nam-0.004
chard-0.004
shadow-0.003
 Merry-0.003
anut-0.003
Token axe
Token ID23272
Feature activation-4.50e+00

Pos logit contributions
chard+0.005
nam+0.005
shadow+0.004
|+0.004
 Merry+0.004
Neg logit contributions
ously-0.004
 words-0.004
der-0.004
itted-0.003
anked-0.003
Token?
Token ID30
Feature activation-7.72e-01

Pos logit contributions
nam+0.107
chard+0.101
 Merry+0.099
shadow+0.095
|+0.093
Neg logit contributions
 words-0.091
ously-0.091
der-0.089
 enough-0.080
anked-0.080
Token Did
Token ID7731
Feature activation-6.09e-01

Pos logit contributions
chard+0.022
nam+0.022
shadow+0.021
anut+0.021
 unatt+0.021
Neg logit contributions
 words-0.012
der-0.012
ously-0.011
itted-0.010
ield-0.010
Token you
Token ID345
Feature activation-1.65e+00

Pos logit contributions
chard+0.011
nam+0.010
shadow+0.010
|+0.009
 unatt+0.009
Neg logit contributions
 words-0.017
der-0.016
ously-0.016
anked-0.014
ield-0.014
Token hurt
Token ID5938
Feature activation-2.08e+00

Pos logit contributions
nam+0.036
chard+0.033
 Merry+0.033
shadow+0.030
aj+0.029
Neg logit contributions
 words-0.037
ously-0.036
der-0.034
ield-0.032
 idea-0.031
Token yourself
Token ID3511
Feature activation-3.64e+00

Pos logit contributions
nam+0.040
chard+0.040
shadow+0.038
|+0.036
 forget+0.036
Neg logit contributions
 words-0.052
ously-0.050
der-0.049
anked-0.045
eared-0.044
Token on
Token ID319
Feature activation+4.96e-01

Pos logit contributions
ously+0.001
 words+0.001
der+0.001
anked+0.001
itted+0.001
Neg logit contributions
chard-0.000
nam-0.000
shadow-0.000
 upon-0.000
 Merry-0.000
Token her
Token ID607
Feature activation-5.54e-01

Pos logit contributions
ously+0.012
der+0.011
 words+0.011
itted+0.011
eared+0.010
Neg logit contributions
nam-0.010
chard-0.010
 Merry-0.009
shadow-0.009
 forget-0.009
Token face
Token ID1986
Feature activation+3.86e-01

Pos logit contributions
chard+0.015
nam+0.015
shadow+0.014
 Merry+0.013
anut+0.013
Neg logit contributions
ously-0.010
 words-0.010
der-0.009
itted-0.008
anked-0.008
Token.
Token ID13
Feature activation-9.10e-02

Pos logit contributions
ously+0.008
 words+0.008
der+0.007
itted+0.007
anked+0.007
Neg logit contributions
chard-0.009
nam-0.009
shadow-0.008
 unatt-0.008
 Merry-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.61e+00

Pos logit contributions
chard+0.003
nam+0.003
illas+0.003
 unatt+0.002
anut+0.002
Neg logit contributions
der-0.001
ously-0.001
 words-0.001
itted-0.001
anked-0.001
TokenOnce
Token ID7454
Feature activation-5.39e+00

Pos logit contributions
nam+0.043
chard+0.042
shadow+0.040
anut+0.038
 forget+0.037
Neg logit contributions
ously-0.030
 words-0.030
der-0.027
itted-0.026
eared-0.025
Token upon
Token ID2402
Feature activation-2.66e-01

Pos logit contributions
nam+0.063
chard+0.063
cel+0.056
shadow+0.055
illas+0.052
Neg logit contributions
ously-0.183
der-0.169
 words-0.168
itted-0.163
anked-0.156
Token a
Token ID257
Feature activation+7.39e-01

Pos logit contributions
chard+0.007
nam+0.007
shadow+0.007
 Merry+0.006
 upon+0.006
Neg logit contributions
ously-0.005
 words-0.004
der-0.004
itted-0.004
anked-0.004
Token time
Token ID640
Feature activation-7.53e-01

Pos logit contributions
ously+0.014
der+0.014
 words+0.013
itted+0.012
anked+0.012
Neg logit contributions
chard-0.018
nam-0.018
 upon-0.017
|-0.017
 forget-0.017
Token,
Token ID11
Feature activation-3.84e+00

Pos logit contributions
chard+0.023
nam+0.023
shadow+0.022
 Merry+0.021
anut+0.021
Neg logit contributions
ously-0.010
 words-0.010
der-0.009
itted-0.008
anked-0.008
Token there
Token ID612
Feature activation+7.24e-01

Pos logit contributions
nam+0.072
 Merry+0.070
chard+0.070
 forget+0.068
shadow+0.066
Neg logit contributions
ously-0.091
der-0.084
itted-0.084
 words-0.084
aced-0.079
Token all
Token ID477
Feature activation-5.86e-01

Pos logit contributions
ously+0.000
itted+0.000
anked+0.000
ield+0.000
 words+0.000
Neg logit contributions
chard-0.000
nam-0.000
 upon-0.000
 forget-0.000
 forth-0.000
Token day
Token ID1110
Feature activation-6.44e-01

Pos logit contributions
chard+0.014
nam+0.014
shadow+0.013
 Merry+0.012
 upon+0.012
Neg logit contributions
ously-0.012
 words-0.012
der-0.012
itted-0.011
anked-0.010
Token long
Token ID890
Feature activation+8.47e-01

Pos logit contributions
chard+0.013
nam+0.013
 thirsty+0.012
 forget+0.012
 upon+0.011
Neg logit contributions
ously-0.016
 words-0.014
der-0.014
anked-0.014
eared-0.014
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
itted+0.020
ously+0.020
ield+0.019
 words+0.018
der+0.018
Neg logit contributions
chard-0.015
nam-0.015
 unatt-0.014
illas-0.014
 under-0.013
Token<|endoftext|>
Token ID50256
Feature activation-2.29e+00

Pos logit contributions
ously+0.004
 words+0.004
der+0.004
itted+0.004
anked+0.004
Neg logit contributions
nam-0.006
chard-0.006
shadow-0.005
 unatt-0.005
anut-0.005
TokenOnce
Token ID7454
Feature activation-4.88e+00

Pos logit contributions
nam+0.061
chard+0.058
shadow+0.056
anut+0.052
 forget+0.052
Neg logit contributions
 words-0.044
ously-0.043
der-0.040
itted-0.037
eared-0.036
Token upon
Token ID2402
Feature activation-5.58e-01

Pos logit contributions
nam+0.078
chard+0.073
shadow+0.068
cel+0.064
 unatt+0.062
Neg logit contributions
ously-0.149
 words-0.140
der-0.134
itted-0.130
anked-0.127
Token a
Token ID257
Feature activation-1.39e-03

Pos logit contributions
chard+0.016
nam+0.016
shadow+0.015
 Merry+0.014
anut+0.014
Neg logit contributions
ously-0.009
 words-0.009
der-0.008
itted-0.007
anked-0.007
Token time
Token ID640
Feature activation-5.42e-01

Pos logit contributions
chard+0.000
nam+0.000
shadow+0.000
 upon+0.000
|+0.000
Neg logit contributions
ously-0.000
der-0.000
 words-0.000
itted-0.000
anked-0.000
Token,
Token ID11
Feature activation-3.47e+00

Pos logit contributions
chard+0.016
nam+0.016
shadow+0.016
 Merry+0.015
anut+0.015
Neg logit contributions
ously-0.008
 words-0.007
der-0.007
itted-0.006
anked-0.006
Token there
Token ID612
Feature activation+5.70e-01

Pos logit contributions
nam+0.076
shadow+0.074
chard+0.072
cel+0.070
 Merry+0.070
Neg logit contributions
ously-0.072
 words-0.072
itted-0.068
der-0.066
anked-0.065