TOP ACTIVATIONS
MAX = 0.141

stamps and handed one to Alison . Alison was very happy
together . They shared a common boundary and had their own
. And from that day forward , the bunny and the
â So Beat rice tried ! She started
. They all shared a common goal
And so Beat rice had lots of fun
to run around and chase Alison . He also liked to
She was so cute ! Beat rice had five fingers on
From that day forward , Daisy stri ved to
appeared from the ground . Lori was so excited ! She
that loved to play . Alison introduced Brown ie to all
The stranger waved goodbye and Paige went home happy . <|endoftext|>
adventure . <|endoftext|> John and Tara were playing in the park
nap . <|endoftext|> Eric and Alison liked to play together .
separate the stamps and give Alison one . "
to leave the park . Lara 's mom said , "
Today was no different . Lori grabbed her sp ade and
finger in the air . Beat rice did as her mom
out of the house . Carly stepped carefully , because the
They seemed so happy . Carly felt so good running about

MIN ACTIVATIONS
MIN = -0.140

this flower was even l itt ler than the da isy
They get a m itt and a spat ula .
, there was a l itt tle girl . She had
sund ae with lots of topp ings . They
She told him it was seaw eed . Despite the smell
and get it , k itt ies !" she called ,
called Yet i was also living in the forest .
. She used a m itt to take out the pan
sad because he had been living alone in a cabin in
elly anymore and the l itt lest ones can go up
out the mouse had been living in the book case and
sometimes , even the l itt lest things can lead to
loud oct opus named O ll ie who liked to sing
not toys . They are living beings that have feelings .
a witch who had been living there for many years .
three , who had been living in the jail for weeks
sund ae with lots of topp ings . Tim my was
a squirrel . It was living in the tree . It
unk ! He 's been living around here for a while
little pig gy named O ll ie . O ll ie

INTERVAL 0.071 - 0.141
CONTAINS 0.115%

't believe that she had wrapped the blocks so perfectly !
have something much more valuable than that ." Sarah
worry . We believed the old man 's words and we
would never forget it no matter where she went .
freezing everything in sight . Everywhere he went , people were

INTERVAL 0.001 - 0.071
CONTAINS 49.656%

were no longer restless and happy to have gone on their
he wasn 't frightened . He approached the dog slowly and
love and kindness . I am happy to have you ,
on its back . Tom said , "
find the l oll ipop . After a while , the

INTERVAL -0.069 - 0.001
CONTAINS 50.017%

he yelled : " Hey , you ! Get out of
surprise for you , princess !" The princess was curious and
the purple tent again ! <|endoftext|> One day , a little
have a ball and a k ite . They run and
then calmly walked out of the room .

INTERVAL -0.140 - -0.069
CONTAINS 0.212%

, there was a family who lived in a big country
there was a little rabbit who wanted to find something delicious
and said , " Of course , I 'll bring it
a j olly old woman who lived in a small village
and took her to the living room . She found a
Token stamps
Token ID25560
Feature activation+2.18e-02

Pos logit contributions
 like+0.012
 soon+0.012
 forward+0.012
 than+0.011
 surely+0.011
Neg logit contributions
 spoon-0.009
 carrots-0.008
 peas-0.008
berries-0.008
 lettuce-0.008
Token and
Token ID290
Feature activation+4.41e-03

Pos logit contributions
 like+0.018
 soon+0.017
 forward+0.016
te+0.015
 surely+0.015
Neg logit contributions
 spoon-0.053
 carrots-0.051
 peas-0.050
 lettuce-0.050
berries-0.049
Token handed
Token ID10158
Feature activation+6.44e-02

Pos logit contributions
 like+0.005
 soon+0.005
 forward+0.005
 than+0.005
 surely+0.005
Neg logit contributions
 spoon-0.009
 carrots-0.009
 peas-0.009
berries-0.009
 lettuce-0.009
Token one
Token ID530
Feature activation+2.11e-02

Pos logit contributions
 like+0.076
 soon+0.073
 than+0.071
ually+0.071
 surely+0.069
Neg logit contributions
 spoon-0.132
 carrots-0.131
 peas-0.125
 lettuce-0.124
 knives-0.124
Token to
Token ID284
Feature activation+1.45e-02

Pos logit contributions
 like+0.023
 soon+0.022
 forward+0.021
 than+0.021
 surely+0.020
Neg logit contributions
 spoon-0.046
 peas-0.045
 carrots-0.045
 knives-0.044
berries-0.044
Token Alison
Token ID33629
Feature activation+1.29e-01

Pos logit contributions
 like+0.016
 soon+0.015
 than+0.015
ually+0.014
 forward+0.014
Neg logit contributions
 spoon-0.031
 carrots-0.030
 peas-0.029
 knives-0.029
berries-0.029
Token.
Token ID13
Feature activation-3.09e-03

Pos logit contributions
 like+0.143
 soon+0.142
 forward+0.125
 when+0.122
 after+0.122
Neg logit contributions
 spoon-0.284
 carrots-0.279
 peas-0.277
berries-0.274
 lettuce-0.272
Token Alison
Token ID33629
Feature activation+1.09e-01

Pos logit contributions
 spoon+0.005
 carrots+0.005
berries+0.005
 lettuce+0.005
 peas+0.005
Neg logit contributions
 like-0.004
 soon-0.004
 than-0.004
ually-0.004
 surely-0.004
Token was
Token ID373
Feature activation+5.78e-03

Pos logit contributions
 soon+0.134
 like+0.130
te+0.115
 surely+0.114
 than+0.112
Neg logit contributions
 carrots-0.224
 spoon-0.223
 peas-0.218
berries-0.217
 lettuce-0.216
Token very
Token ID845
Feature activation-2.24e-02

Pos logit contributions
away+0.007
ually+0.006
bye+0.006
bek+0.006
 than+0.006
Neg logit contributions
 spoon-0.012
 carrots-0.011
 bowl-0.011
 peas-0.011
 lettuce-0.011
Token happy
Token ID3772
Feature activation-1.06e-02

Pos logit contributions
 spoon+0.047
 carrots+0.043
 peas+0.042
 lettuce+0.042
 bowl+0.041
Neg logit contributions
ually-0.024
 like-0.023
ained-0.023
 soon-0.022
away-0.022
Token together
Token ID1978
Feature activation-4.91e-02

Pos logit contributions
 spoon+0.029
 carrots+0.029
 peas+0.028
 knives+0.027
berries+0.027
Neg logit contributions
 like-0.012
 soon-0.011
 surely-0.011
ually-0.011
 than-0.011
Token.
Token ID13
Feature activation+4.36e-03

Pos logit contributions
 spoon+0.111
 carrots+0.108
 peas+0.107
 lettuce+0.107
berries+0.107
Neg logit contributions
 like-0.058
 soon-0.056
 than-0.050
 when-0.047
 after-0.046
Token They
Token ID1119
Feature activation+5.32e-03

Pos logit contributions
 surely+0.006
vy+0.006
ook+0.006
 soon+0.006
 like+0.006
Neg logit contributions
 carrots-0.007
 knives-0.007
berries-0.007
 spoon-0.007
 couch-0.007
Token shared
Token ID4888
Feature activation+1.14e-02

Pos logit contributions
 like+0.007
 soon+0.007
 forward+0.006
 than+0.006
 surely+0.006
Neg logit contributions
 spoon-0.011
 carrots-0.010
 peas-0.010
berries-0.010
 lettuce-0.010
Token a
Token ID257
Feature activation+4.26e-02

Pos logit contributions
 like+0.014
 soon+0.014
 surely+0.014
 forward+0.013
ually+0.013
Neg logit contributions
 spoon-0.023
 carrots-0.023
 peas-0.022
 lettuce-0.021
berries-0.021
Token common
Token ID2219
Feature activation+1.27e-01

Pos logit contributions
 like+0.069
 soon+0.067
 forward+0.064
 than+0.064
te+0.063
Neg logit contributions
 spoon-0.070
 carrots-0.068
 peas-0.067
berries-0.066
 knives-0.066
Token boundary
Token ID18645
Feature activation+2.09e-02

Pos logit contributions
 like+0.191
 soon+0.182
 than+0.181
te+0.172
 forward+0.167
Neg logit contributions
 spoon-0.229
 carrots-0.223
berries-0.220
 peas-0.220
 knives-0.218
Token and
Token ID290
Feature activation+8.35e-03

Pos logit contributions
 like+0.020
 soon+0.019
 than+0.018
 forward+0.018
 surely+0.017
Neg logit contributions
 spoon-0.049
 carrots-0.047
 peas-0.047
berries-0.046
 lettuce-0.046
Token had
Token ID550
Feature activation+1.45e-02

Pos logit contributions
 forward+0.010
 like+0.009
 surely+0.009
 than+0.009
 soon+0.009
Neg logit contributions
 spoon-0.017
 carrots-0.016
 peas-0.016
 knives-0.016
 lettuce-0.015
Token their
Token ID511
Feature activation-1.28e-02

Pos logit contributions
 like+0.015
 soon+0.014
 than+0.013
 forward+0.013
 surely+0.012
Neg logit contributions
 spoon-0.033
 carrots-0.032
 peas-0.032
berries-0.032
 lettuce-0.031
Token own
Token ID898
Feature activation-8.77e-03

Pos logit contributions
berries+0.024
 spoon+0.024
 carrots+0.024
 peas+0.024
 lettuce+0.024
Neg logit contributions
 like-0.018
 soon-0.017
 than-0.016
 forward-0.016
 surely-0.016
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
 like+0.003
 soon+0.003
 when+0.003
 than+0.003
 after+0.003
Neg logit contributions
 spoon-0.007
 carrots-0.007
 peas-0.007
 lettuce-0.007
 knives-0.007
Token And
Token ID843
Feature activation+1.74e-02

Pos logit contributions
 like+0.013
 soon+0.013
 forward+0.011
 when+0.011
te+0.011
Neg logit contributions
 spoon-0.029
 carrots-0.028
 peas-0.028
 knives-0.028
 lettuce-0.027
Token from
Token ID422
Feature activation+3.85e-02

Pos logit contributions
 like+0.016
 forward+0.016
 soon+0.015
te+0.015
 surely+0.015
Neg logit contributions
 spoon-0.038
 carrots-0.038
 peas-0.037
 lettuce-0.036
 knives-0.036
Token that
Token ID326
Feature activation+1.27e-02

Pos logit contributions
 like+0.037
 soon+0.035
 forward+0.034
 than+0.033
 surely+0.033
Neg logit contributions
 spoon-0.088
 carrots-0.086
 peas-0.084
 lettuce-0.083
 knives-0.082
Token day
Token ID1110
Feature activation+2.14e-02

Pos logit contributions
 like+0.012
 soon+0.012
 forward+0.011
 than+0.010
 surely+0.010
Neg logit contributions
 spoon-0.030
 carrots-0.030
 peas-0.030
berries-0.029
 lettuce-0.029
Token forward
Token ID2651
Feature activation+1.26e-01

Pos logit contributions
 like+0.019
 soon+0.018
 than+0.017
ually+0.017
 surely+0.016
Neg logit contributions
 spoon-0.050
 carrots-0.048
 peas-0.047
 knives-0.047
berries-0.047
Token,
Token ID11
Feature activation+1.73e-02

Pos logit contributions
 like+0.099
 soon+0.093
 than+0.080
 after+0.075
 forward+0.074
Neg logit contributions
 spoon-0.331
 carrots-0.322
 peas-0.319
 lettuce-0.317
 knives-0.311
Token the
Token ID262
Feature activation+1.80e-02

Pos logit contributions
 like+0.018
 soon+0.018
 forward+0.015
 than+0.015
 after+0.015
Neg logit contributions
 spoon-0.041
 carrots-0.039
 peas-0.039
berries-0.039
 lettuce-0.038
Token bunny
Token ID44915
Feature activation+3.85e-03

Pos logit contributions
 like+0.028
 soon+0.028
 true+0.027
 when+0.025
 surely+0.025
Neg logit contributions
berries-0.032
 peas-0.030
 knives-0.030
 spoon-0.029
 sleeves-0.028
Token and
Token ID290
Feature activation+3.72e-03

Pos logit contributions
 like+0.004
 soon+0.004
 than+0.003
 when+0.003
 surely+0.003
Neg logit contributions
 spoon-0.009
 carrots-0.009
 knives-0.009
 peas-0.009
berries-0.009
Token the
Token ID262
Feature activation+1.06e-02

Pos logit contributions
 like+0.003
 soon+0.003
 forward+0.003
 than+0.003
 surely+0.003
Neg logit contributions
 spoon-0.009
 carrots-0.009
 peas-0.009
berries-0.009
 lettuce-0.008
Tokenâ
Token ID22940
Feature activation+2.80e-02

Pos logit contributions
 like+0.016
 soon+0.015
 forward+0.014
 than+0.013
 after+0.013
Neg logit contributions
 spoon-0.030
 carrots-0.029
 peas-0.029
 lettuce-0.028
 knives-0.028
Token
Token ID26391
Feature activation+6.14e-02

Pos logit contributions
 soon+0.037
ook+0.037
bel+0.036
 theirs+0.036
 known+0.034
Neg logit contributions
 spoon-0.044
 sleeves-0.042
berries-0.041
opus-0.040
 carrots-0.040
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
 like+0.088
 soon+0.085
 surely+0.083
 forward+0.083
 than+0.080
Neg logit contributions
 spoon-0.109
 carrots-0.104
berries-0.102
 knives-0.100
 peas-0.100
Token
Token ID198
Feature activation+8.74e-03

Pos logit contributions
 like+0.016
 soon+0.016
 forward+0.014
te+0.014
 than+0.013
Neg logit contributions
 spoon-0.033
 carrots-0.032
 peas-0.032
 knives-0.031
 lettuce-0.031
TokenSo
Token ID2396
Feature activation-8.51e-04

Pos logit contributions
 like+0.010
 soon+0.010
 forward+0.009
 than+0.009
 surely+0.009
Neg logit contributions
 spoon-0.019
 carrots-0.018
 peas-0.018
berries-0.018
 lettuce-0.018
Token Beat
Token ID12568
Feature activation+1.25e-01

Pos logit contributions
 spoon+0.002
 carrots+0.002
 berries+0.001
berries+0.001
 peas+0.001
Neg logit contributions
ook-0.001
 like-0.001
 than-0.001
organ-0.001
ually-0.001
Tokenrice
Token ID20970
Feature activation+6.60e-02

Pos logit contributions
ores+0.172
ained+0.172
 like+0.163
 surely+0.163
ook+0.158
Neg logit contributions
 sleeves-0.176
 spoon-0.172
opus-0.172
 cries-0.172
 cheeks-0.171
Token tried
Token ID3088
Feature activation-3.09e-02

Pos logit contributions
 soon+0.074
 like+0.074
 surely+0.062
 than+0.062
 after+0.061
Neg logit contributions
 carrots-0.144
 spoon-0.144
 peas-0.143
berries-0.141
 lettuce-0.140
Token!
Token ID0
Feature activation+1.49e-02

Pos logit contributions
 spoon+0.075
 peas+0.074
 carrots+0.073
berries+0.073
 lettuce+0.072
Neg logit contributions
 like-0.027
 soon-0.027
 forward-0.023
 than-0.023
 over-0.022
Token She
Token ID1375
Feature activation+1.38e-02

Pos logit contributions
 like+0.018
 soon+0.018
 than+0.016
 forward+0.016
 surely+0.016
Neg logit contributions
 spoon-0.031
 carrots-0.030
 peas-0.029
berries-0.029
 lettuce-0.029
Token started
Token ID2067
Feature activation+1.17e-02

Pos logit contributions
 soon+0.017
 like+0.016
 than+0.013
 surely+0.013
 over+0.013
Neg logit contributions
berries-0.031
 peas-0.030
 knives-0.030
 carrots-0.030
 lettuce-0.030
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
 spoon+0.042
 carrots+0.041
 peas+0.041
 lettuce+0.040
 knives+0.040
Neg logit contributions
 like-0.020
 soon-0.019
 than-0.017
 forward-0.017
 after-0.016
Token They
Token ID1119
Feature activation+1.78e-03

Pos logit contributions
 spoon+0.003
 carrots+0.003
 peas+0.003
 knives+0.003
 lettuce+0.003
Neg logit contributions
 like-0.001
 soon-0.001
 forward-0.001
 than-0.001
 surely-0.001
Token all
Token ID477
Feature activation-2.92e-02

Pos logit contributions
 like+0.002
 soon+0.002
 forward+0.002
 than+0.002
 surely+0.002
Neg logit contributions
 spoon-0.004
 carrots-0.003
 peas-0.003
berries-0.003
 lettuce-0.003
Token shared
Token ID4888
Feature activation+1.34e-02

Pos logit contributions
 spoon+0.059
 carrots+0.057
 peas+0.056
berries+0.055
 lettuce+0.055
Neg logit contributions
 like-0.036
 soon-0.035
 forward-0.033
 than-0.033
 surely-0.032
Token a
Token ID257
Feature activation+4.00e-02

Pos logit contributions
 like+0.014
 soon+0.014
 surely+0.013
 forward+0.013
ually+0.013
Neg logit contributions
 spoon-0.029
 carrots-0.029
 peas-0.028
berries-0.027
 lettuce-0.027
Token common
Token ID2219
Feature activation+1.24e-01

Pos logit contributions
 like+0.061
 soon+0.057
 forward+0.056
 than+0.055
te+0.055
Neg logit contributions
 spoon-0.068
 knives-0.068
 peas-0.068
 carrots-0.067
berries-0.066
Token goal
Token ID3061
Feature activation+1.30e-02

Pos logit contributions
 like+0.190
 than+0.182
 soon+0.174
te+0.169
 when+0.159
Neg logit contributions
 spoon-0.224
 knives-0.218
 peas-0.218
 carrots-0.217
berries-0.215
Token
Token ID6184
Feature activation+2.13e-02

Pos logit contributions
 like+0.013
 soon+0.013
 than+0.012
 forward+0.011
te+0.011
Neg logit contributions
 spoon-0.030
 carrots-0.029
 peas-0.029
 lettuce-0.028
 knives-0.028
Token
Token ID95
Feature activation+1.69e-02

Pos logit contributions
 soon+0.030
 forward+0.027
 from+0.027
 known+0.027
 than+0.026
Neg logit contributions
 spoon-0.036
berries-0.032
opus-0.032
 carrots-0.032
 peas-0.032
Token
Token ID26391
Feature activation+2.42e-02

Pos logit contributions
 soon+0.026
bel+0.024
 forward+0.024
 though+0.024
 from+0.024
Neg logit contributions
 spoon-0.023
 couch-0.022
pill-0.022
 bowl-0.022
opus-0.021
Token
Token ID447
Feature activation-1.62e-02

Pos logit contributions
 like+0.039
 surely+0.037
 forward+0.037
 soon+0.036
year+0.036
Neg logit contributions
 spoon-0.037
berries-0.036
 knives-0.034
 carrots-0.034
 bowl-0.033
Token
Token ID26391
Feature activation+5.86e-02

Pos logit contributions
 soon+0.033
 like+0.032
 than+0.031
 forward+0.029
ends+0.029
Neg logit contributions
 spoon-0.056
 carrots-0.051
berries-0.051
 onion-0.050
 peas-0.050
Token
Token ID198
Feature activation+9.90e-03

Pos logit contributions
 like+0.079
 soon+0.078
 forward+0.073
 than+0.072
 surely+0.071
Neg logit contributions
 spoon-0.113
 carrots-0.110
 peas-0.107
berries-0.105
 lettuce-0.105
Token
Token ID198
Feature activation+7.47e-03

Pos logit contributions
 soon+0.009
 like+0.008
te+0.007
 forward+0.007
 over+0.007
Neg logit contributions
 spoon-0.025
 peas-0.024
 knives-0.024
 carrots-0.024
 lettuce-0.023
TokenAnd
Token ID1870
Feature activation+2.76e-02

Pos logit contributions
 soon+0.008
 like+0.008
 forward+0.007
te+0.007
 than+0.007
Neg logit contributions
 spoon-0.017
 carrots-0.017
 peas-0.017
 lettuce-0.016
berries-0.016
Token so
Token ID523
Feature activation+2.95e-02

Pos logit contributions
organ+0.027
 like+0.027
ook+0.027
 than+0.026
amed+0.026
Neg logit contributions
 spoon-0.056
 carrots-0.055
 peas-0.053
 berries-0.052
 knives-0.051
Token Beat
Token ID12568
Feature activation+1.24e-01

Pos logit contributions
 like+0.034
 than+0.031
 soon+0.031
 forward+0.031
 surely+0.031
Neg logit contributions
 spoon-0.061
 carrots-0.059
 peas-0.058
berries-0.057
 knives-0.056
Tokenrice
Token ID20970
Feature activation+6.77e-02

Pos logit contributions
ores+0.157
ained+0.151
 like+0.146
 surely+0.144
organ+0.143
Neg logit contributions
 sleeves-0.185
opus-0.183
 cheeks-0.180
 spoon-0.180
 carrots-0.177
Token had
Token ID550
Feature activation+2.21e-02

Pos logit contributions
 like+0.074
 soon+0.072
 forward+0.068
 than+0.067
 surely+0.065
Neg logit contributions
 spoon-0.146
 carrots-0.142
 peas-0.139
berries-0.137
 lettuce-0.137
Token lots
Token ID6041
Feature activation-2.23e-02

Pos logit contributions
 like+0.020
 soon+0.019
 than+0.015
 forward+0.015
 known+0.015
Neg logit contributions
 spoon-0.054
berries-0.054
 peas-0.053
 carrots-0.052
 knives-0.052
Token of
Token ID286
Feature activation-6.84e-03

Pos logit contributions
 spoon+0.055
 peas+0.054
berries+0.053
 lettuce+0.052
 knives+0.052
Neg logit contributions
 like-0.022
 soon-0.018
 than-0.018
 after-0.017
 from-0.016
Token fun
Token ID1257
Feature activation+2.32e-02

Pos logit contributions
berries+0.015
opus+0.015
 peas+0.014
actus+0.013
 spoon+0.013
Neg logit contributions
 like-0.009
 soon-0.007
 than-0.007
 as-0.007
 known-0.007
Token to
Token ID284
Feature activation+1.08e-02

Pos logit contributions
 spoon+0.029
berries+0.027
 lettuce+0.027
 sleeves+0.027
 peas+0.026
Neg logit contributions
 like-0.011
 soon-0.011
 when-0.010
 than-0.010
 after-0.010
Token run
Token ID1057
Feature activation-5.38e-03

Pos logit contributions
 like+0.013
 soon+0.012
 when+0.010
 over+0.010
 as+0.010
Neg logit contributions
berries-0.023
 spoon-0.023
 knives-0.023
 peas-0.022
 carrots-0.022
Token around
Token ID1088
Feature activation+3.93e-02

Pos logit contributions
 spoon+0.012
 carrots+0.012
 peas+0.012
berries+0.012
 lettuce+0.011
Neg logit contributions
 soon-0.005
 like-0.005
 surely-0.005
ually-0.005
te-0.004
Token and
Token ID290
Feature activation+1.60e-02

Pos logit contributions
 like+0.030
 soon+0.029
 than+0.025
 after+0.024
 forward+0.023
Neg logit contributions
 spoon-0.101
 carrots-0.098
 peas-0.097
 lettuce-0.097
berries-0.097
Token chase
Token ID15505
Feature activation-3.36e-02

Pos logit contributions
 like+0.021
 soon+0.021
 after+0.018
 over+0.018
 when+0.017
Neg logit contributions
 spoon-0.034
berries-0.033
 knives-0.033
 peas-0.033
 carrots-0.032
Token Alison
Token ID33629
Feature activation+1.24e-01

Pos logit contributions
 spoon+0.066
berries+0.060
 peas+0.059
 knives+0.059
 lettuce+0.059
Neg logit contributions
 like-0.050
 soon-0.046
 after-0.046
 than-0.045
 from-0.043
Token.
Token ID13
Feature activation-4.21e-04

Pos logit contributions
 like+0.133
 soon+0.132
 when+0.123
 forward+0.118
 after+0.117
Neg logit contributions
 spoon-0.292
 peas-0.280
berries-0.279
 lettuce-0.277
 knives-0.271
Token He
Token ID679
Feature activation+1.89e-04

Pos logit contributions
 spoon+0.001
 carrots+0.001
 peas+0.001
berries+0.001
 lettuce+0.001
Neg logit contributions
 like-0.001
 soon-0.001
 forward-0.001
 than-0.001
 surely-0.001
Token also
Token ID635
Feature activation+3.50e-02

Pos logit contributions
 soon+0.000
 like+0.000
 than+0.000
 surely+0.000
te+0.000
Neg logit contributions
 spoon-0.000
berries-0.000
 carrots-0.000
 lettuce-0.000
 knives-0.000
Token liked
Token ID8288
Feature activation-4.31e-03

Pos logit contributions
 like+0.038
 soon+0.037
 forward+0.034
 than+0.034
 surely+0.033
Neg logit contributions
 spoon-0.077
 carrots-0.075
 peas-0.073
berries-0.072
 lettuce-0.072
Token to
Token ID284
Feature activation+5.73e-03

Pos logit contributions
 spoon+0.011
berries+0.011
 lettuce+0.011
 peas+0.011
 carrots+0.011
Neg logit contributions
 like-0.003
 soon-0.003
 than-0.003
 when-0.003
 after-0.003
Token She
Token ID1375
Feature activation+5.65e-03

Pos logit contributions
berries+0.002
 sleeves+0.002
 berries+0.002
 carrots+0.002
 rocks+0.002
Neg logit contributions
bek-0.003
ually-0.003
ores-0.003
 surely-0.003
ook-0.003
Token was
Token ID373
Feature activation+1.01e-02

Pos logit contributions
 forward+0.007
 surely+0.007
 soon+0.007
ually+0.007
 like+0.007
Neg logit contributions
 spoon-0.011
 carrots-0.011
 peas-0.010
 lettuce-0.010
 carrot-0.010
Token so
Token ID523
Feature activation-1.41e-02

Pos logit contributions
away+0.015
bek+0.014
ually+0.013
 than+0.013
riage+0.013
Neg logit contributions
 spoon-0.020
 carrots-0.019
 peas-0.018
 soup-0.017
 bowl-0.017
Token cute
Token ID13779
Feature activation-1.88e-03

Pos logit contributions
 spoon+0.028
 carrots+0.027
 peas+0.027
berries+0.026
 lettuce+0.026
Neg logit contributions
 like-0.018
 soon-0.018
 forward-0.017
 than-0.017
 surely-0.016
Token!
Token ID0
Feature activation+9.14e-03

Pos logit contributions
 spoon+0.004
 carrots+0.004
berries+0.003
 peas+0.003
 carrot+0.003
Neg logit contributions
 surely-0.003
ually-0.003
 known-0.002
bek-0.002
bel-0.002
Token Beat
Token ID12568
Feature activation+1.24e-01

Pos logit contributions
ually+0.014
 soon+0.014
 surely+0.014
bek+0.014
 like+0.013
Neg logit contributions
berries-0.014
 carrots-0.014
 spoon-0.014
 berries-0.013
 sleeves-0.013
Tokenrice
Token ID20970
Feature activation+6.50e-02

Pos logit contributions
ores+0.200
 surely+0.193
ained+0.185
 forward+0.184
 known+0.178
Neg logit contributions
 sleeves-0.163
 cries-0.152
 cheeks-0.151
opus-0.147
 couch-0.147
Token had
Token ID550
Feature activation+1.65e-02

Pos logit contributions
 like+0.067
 soon+0.065
 forward+0.063
 surely+0.061
 than+0.061
Neg logit contributions
 spoon-0.143
 carrots-0.140
 peas-0.137
 lettuce-0.134
berries-0.134
Token five
Token ID1936
Feature activation-1.97e-02

Pos logit contributions
 like+0.013
 soon+0.011
 when+0.009
 than+0.008
 after+0.008
Neg logit contributions
 spoon-0.044
berries-0.043
 peas-0.042
 knives-0.042
 carrots-0.042
Token fingers
Token ID9353
Feature activation-5.05e-03

Pos logit contributions
 spoon+0.031
 lettuce+0.031
berries+0.031
 knives+0.029
 onion+0.029
Neg logit contributions
 like-0.034
 than-0.032
 soon-0.029
 when-0.028
 after-0.027
Token on
Token ID319
Feature activation+1.81e-02

Pos logit contributions
 spoon+0.012
 carrots+0.012
 peas+0.012
 lettuce+0.012
 knives+0.012
Neg logit contributions
 like-0.005
 soon-0.004
 than-0.004
 when-0.004
 forward-0.003
Token
Token ID198
Feature activation+8.58e-04

Pos logit contributions
 like+0.023
 soon+0.022
 when+0.017
 known+0.017
 after+0.017
Neg logit contributions
berries-0.065
 spoon-0.064
 carrots-0.062
 peas-0.061
 lettuce-0.061
Token
Token ID198
Feature activation+2.12e-03

Pos logit contributions
 soon+0.001
 like+0.001
 forward+0.001
 than+0.001
te+0.001
Neg logit contributions
 spoon-0.002
 carrots-0.002
 peas-0.002
berries-0.002
 knives-0.002
TokenFrom
Token ID4863
Feature activation+4.20e-02

Pos logit contributions
 like+0.002
 soon+0.002
 forward+0.002
 than+0.002
 surely+0.002
Neg logit contributions
 spoon-0.005
 carrots-0.005
 peas-0.005
berries-0.004
 lettuce-0.004
Token that
Token ID326
Feature activation+2.05e-02

Pos logit contributions
te+0.047
ook+0.047
 forward+0.047
ually+0.047
 like+0.046
Neg logit contributions
 carrots-0.083
 spoon-0.080
 peas-0.078
 berries-0.077
 lettuce-0.077
Token day
Token ID1110
Feature activation+2.37e-02

Pos logit contributions
 like+0.014
 soon+0.014
 forward+0.012
 surely+0.012
 than+0.011
Neg logit contributions
 peas-0.053
 spoon-0.053
 carrots-0.053
berries-0.053
 lettuce-0.052
Token forward
Token ID2651
Feature activation+1.23e-01

Pos logit contributions
 like+0.025
 than+0.024
 soon+0.024
ually+0.023
bel+0.022
Neg logit contributions
 spoon-0.051
 carrots-0.049
 knives-0.048
berries-0.047
 peas-0.046
Token,
Token ID11
Feature activation+2.82e-02

Pos logit contributions
 like+0.102
 soon+0.098
 than+0.085
 forward+0.083
 after+0.080
Neg logit contributions
 spoon-0.308
 carrots-0.301
 peas-0.297
 lettuce-0.293
berries-0.291
Token Daisy
Token ID40355
Feature activation+1.19e-02

Pos logit contributions
 like+0.032
 soon+0.031
 than+0.028
 forward+0.028
 surely+0.027
Neg logit contributions
 spoon-0.061
 carrots-0.059
 peas-0.058
berries-0.058
 lettuce-0.058
Token stri
Token ID2338
Feature activation+4.51e-02

Pos logit contributions
 like+0.013
 soon+0.013
 forward+0.012
 than+0.012
 surely+0.011
Neg logit contributions
 spoon-0.026
 carrots-0.025
 peas-0.025
berries-0.024
 lettuce-0.024
Tokenved
Token ID1079
Feature activation+2.02e-02

Pos logit contributions
 like+0.055
 soon+0.053
 than+0.050
 forward+0.050
 surely+0.049
Neg logit contributions
 spoon-0.092
 carrots-0.089
 peas-0.088
 lettuce-0.087
berries-0.086
Token to
Token ID284
Feature activation+5.94e-03

Pos logit contributions
 like+0.019
 soon+0.019
 than+0.017
 forward+0.017
 after+0.015
Neg logit contributions
 spoon-0.049
 peas-0.047
berries-0.047
 carrots-0.047
 lettuce-0.046
Token appeared
Token ID4120
Feature activation+1.48e-02

Pos logit contributions
 like+0.007
 soon+0.007
 than+0.006
 when+0.006
 after+0.006
Neg logit contributions
 spoon-0.013
 peas-0.013
 carrots-0.013
berries-0.013
 lettuce-0.013
Token from
Token ID422
Feature activation+1.59e-02

Pos logit contributions
 like+0.014
ually+0.014
 soon+0.014
 surely+0.014
 than+0.013
Neg logit contributions
 carrots-0.032
 spoon-0.032
berries-0.031
 lettuce-0.030
 knives-0.030
Token the
Token ID262
Feature activation-5.76e-03

Pos logit contributions
te+0.016
ually+0.015
 forward+0.015
ook+0.015
ends+0.015
Neg logit contributions
 carrots-0.035
 spoon-0.034
 peas-0.033
 berries-0.033
 lettuce-0.032
Token ground
Token ID2323
Feature activation-2.51e-02

Pos logit contributions
berries+0.008
 peas+0.008
pill+0.008
 spoon+0.008
affles+0.008
Neg logit contributions
 like-0.011
 soon-0.010
 when-0.010
 after-0.010
 over-0.010
Token.
Token ID13
Feature activation+3.36e-03

Pos logit contributions
 spoon+0.058
 peas+0.058
 carrots+0.057
 lettuce+0.056
 knives+0.055
Neg logit contributions
 like-0.028
 soon-0.028
 than-0.025
 when-0.024
 over-0.024
Token Lori
Token ID43741
Feature activation+1.21e-01

Pos logit contributions
ually+0.005
 soon+0.005
 like+0.005
te+0.005
 surely+0.005
Neg logit contributions
 spoon-0.006
 carrots-0.006
 lettuce-0.005
 berries-0.005
berries-0.005
Token was
Token ID373
Feature activation+1.21e-02

Pos logit contributions
 soon+0.133
 like+0.127
 over+0.106
 when+0.106
 after+0.104
Neg logit contributions
 spoon-0.275
berries-0.274
 carrots-0.273
 peas-0.269
 lettuce-0.268
Token so
Token ID523
Feature activation-1.09e-02

Pos logit contributions
 soon+0.013
 like+0.013
ually+0.013
 than+0.013
 surely+0.013
Neg logit contributions
 spoon-0.025
 carrots-0.025
 peas-0.024
 lettuce-0.023
 berries-0.023
Token excited
Token ID6568
Feature activation+1.17e-03

Pos logit contributions
 spoon+0.023
 carrots+0.022
 peas+0.021
 lettuce+0.021
berries+0.021
Neg logit contributions
 like-0.012
 soon-0.012
 forward-0.012
 surely-0.011
ually-0.011
Token!
Token ID0
Feature activation+5.54e-03

Pos logit contributions
 like+0.001
 soon+0.001
 forward+0.001
 than+0.001
 surely+0.001
Neg logit contributions
 spoon-0.003
 carrots-0.003
 peas-0.003
berries-0.002
 lettuce-0.002
Token She
Token ID1375
Feature activation+1.60e-02

Pos logit contributions
 like+0.008
 soon+0.008
 than+0.007
 surely+0.007
te+0.007
Neg logit contributions
 spoon-0.010
 carrots-0.010
berries-0.010
 lettuce-0.009
 peas-0.009
Token that
Token ID326
Feature activation-1.78e-02

Pos logit contributions
 like+0.018
 soon+0.017
 than+0.016
 forward+0.016
te+0.015
Neg logit contributions
 spoon-0.032
 carrots-0.031
 peas-0.030
berries-0.030
 lettuce-0.030
Token loved
Token ID6151
Feature activation-2.52e-02

Pos logit contributions
 spoon+0.036
berries+0.035
 carrots+0.035
 peas+0.034
 lettuce+0.034
Neg logit contributions
 like-0.023
 soon-0.022
 than-0.020
te-0.020
 forward-0.019
Token to
Token ID284
Feature activation-1.55e-04

Pos logit contributions
 spoon+0.058
 carrots+0.057
 peas+0.055
berries+0.055
 lettuce+0.054
Neg logit contributions
 like-0.025
 soon-0.024
 forward-0.022
 than-0.022
 surely-0.022
Token play
Token ID711
Feature activation-1.55e-02

Pos logit contributions
 spoon+0.000
berries+0.000
 knives+0.000
 peas+0.000
 lettuce+0.000
Neg logit contributions
 like-0.000
 soon-0.000
 than-0.000
 after-0.000
 from-0.000
Token.
Token ID13
Feature activation+4.14e-03

Pos logit contributions
 carrots+0.034
 spoon+0.034
 peas+0.032
berries+0.032
 knives+0.032
Neg logit contributions
 soon-0.015
 like-0.015
 surely-0.015
ually-0.015
 forward-0.014
Token Alison
Token ID33629
Feature activation+1.21e-01

Pos logit contributions
vy+0.007
 surely+0.007
te+0.007
 like+0.006
 soon+0.006
Neg logit contributions
 carrots-0.006
 spoon-0.006
 knives-0.006
 lettuce-0.006
berries-0.006
Token introduced
Token ID5495
Feature activation+5.80e-02

Pos logit contributions
 like+0.141
 soon+0.140
 than+0.122
te+0.121
 forward+0.120
Neg logit contributions
 spoon-0.257
 carrots-0.252
berries-0.247
 peas-0.246
 lettuce-0.244
Token Brown
Token ID4373
Feature activation+2.39e-02

Pos logit contributions
 like+0.075
 soon+0.071
 surely+0.070
 forward+0.070
 than+0.069
Neg logit contributions
 spoon-0.112
 carrots-0.112
 peas-0.108
berries-0.107
 lettuce-0.106
Tokenie
Token ID494
Feature activation+2.94e-02

Pos logit contributions
 like+0.028
 soon+0.028
 than+0.025
 forward+0.025
te+0.024
Neg logit contributions
 spoon-0.051
 carrots-0.048
 peas-0.048
berries-0.047
 lettuce-0.047
Token to
Token ID284
Feature activation+1.22e-02

Pos logit contributions
 like+0.026
 soon+0.025
 forward+0.022
 than+0.022
 after+0.021
Neg logit contributions
 spoon-0.072
 carrots-0.069
 peas-0.069
berries-0.069
 lettuce-0.068
Token all
Token ID477
Feature activation+7.66e-04

Pos logit contributions
 like+0.013
 soon+0.013
 than+0.011
 forward+0.011
 after+0.011
Neg logit contributions
 spoon-0.027
 carrots-0.026
berries-0.026
 peas-0.026
 lettuce-0.026
TokenThe
Token ID464
Feature activation-1.17e-02

Pos logit contributions
 like+0.009
ually+0.008
 soon+0.008
 than+0.008
 surely+0.008
Neg logit contributions
 carrots-0.012
 lettuce-0.012
 spoon-0.011
berries-0.011
pill-0.011
Token stranger
Token ID16195
Feature activation+2.27e-02

Pos logit contributions
 spoon+0.024
berries+0.023
 knives+0.023
 peas+0.022
 sleeves+0.022
Neg logit contributions
 forward-0.016
 like-0.016
 soon-0.016
 over-0.015
 surely-0.015
Token waved
Token ID26834
Feature activation+3.75e-02

Pos logit contributions
 soon+0.029
 like+0.028
 surely+0.024
te+0.024
 after+0.023
Neg logit contributions
 spoon-0.047
 carrots-0.046
berries-0.046
 knives-0.045
 peas-0.045
Token goodbye
Token ID24829
Feature activation+2.10e-02

Pos logit contributions
ually+0.049
 soon+0.049
 like+0.048
 surely+0.047
te+0.046
Neg logit contributions
 spoon-0.070
 carrots-0.070
 peas-0.066
 lettuce-0.066
berries-0.065
Token and
Token ID290
Feature activation+2.43e-02

Pos logit contributions
 like+0.022
 surely+0.022
ually+0.021
 forward+0.021
 soon+0.021
Neg logit contributions
 spoon-0.043
 carrots-0.043
 peas-0.040
 lettuce-0.040
berries-0.040
Token Paige
Token ID42847
Feature activation+1.20e-01

Pos logit contributions
te+0.039
ually+0.037
organ+0.037
ook+0.037
 like+0.037
Neg logit contributions
 carrots-0.037
 berries-0.037
 spoon-0.036
 lettuce-0.035
 carrot-0.034
Token went
Token ID1816
Feature activation+1.87e-02

Pos logit contributions
 soon+0.148
 like+0.146
 forward+0.133
 than+0.131
 surely+0.130
Neg logit contributions
 spoon-0.246
 carrots-0.238
 peas-0.234
berries-0.233
 knives-0.232
Token home
Token ID1363
Feature activation-1.10e-02

Pos logit contributions
 like+0.021
 soon+0.020
ually+0.019
 than+0.019
 surely+0.019
Neg logit contributions
 spoon-0.039
 carrots-0.039
 peas-0.037
 lettuce-0.037
berries-0.037
Token happy
Token ID3772
Feature activation+1.81e-02

Pos logit contributions
 spoon+0.023
 carrots+0.023
 peas+0.022
berries+0.022
 lettuce+0.022
Neg logit contributions
 like-0.013
 soon-0.012
 forward-0.011
 than-0.011
 surely-0.011
Token.
Token ID13
Feature activation+2.17e-03

Pos logit contributions
 like+0.017
 soon+0.017
 than+0.015
 after+0.015
 from+0.014
Neg logit contributions
 spoon-0.046
 lettuce-0.044
 knives-0.044
 peas-0.044
 carrots-0.044
Token<|endoftext|>
Token ID50256
Feature activation-2.07e-02

Pos logit contributions
 soon+0.003
ually+0.003
te+0.003
organ+0.003
away+0.003
Neg logit contributions
berries-0.003
 lettuce-0.003
 carrots-0.003
 spoon-0.003
 berries-0.003
Token adventure
Token ID8855
Feature activation+1.63e-02

Pos logit contributions
 soon+0.067
 like+0.065
 than+0.060
 after+0.060
away+0.059
Neg logit contributions
 knives-0.073
berries-0.071
 spoon-0.069
 peas-0.068
 lettuce-0.066
Token.
Token ID13
Feature activation+7.33e-03

Pos logit contributions
 like+0.016
 soon+0.016
 forward+0.015
 than+0.014
 surely+0.014
Neg logit contributions
 spoon-0.037
 carrots-0.036
 peas-0.035
berries-0.035
 lettuce-0.035
Token<|endoftext|>
Token ID50256
Feature activation-1.60e-02

Pos logit contributions
 like+0.008
 soon+0.008
 forward+0.007
 than+0.007
 surely+0.007
Neg logit contributions
 spoon-0.016
 carrots-0.015
 peas-0.015
 knives-0.015
berries-0.015
TokenJohn
Token ID7554
Feature activation+1.44e-02

Pos logit contributions
 spoon+0.032
 peas+0.031
 carrots+0.031
 knives+0.030
 lettuce+0.030
Neg logit contributions
 like-0.021
 soon-0.021
 over-0.019
 forward-0.019
te-0.019
Token and
Token ID290
Feature activation+8.69e-03

Pos logit contributions
 like+0.015
 soon+0.015
 than+0.013
 over+0.012
 when+0.012
Neg logit contributions
 spoon-0.033
 carrots-0.032
 peas-0.031
 knives-0.031
 lettuce-0.031
Token Tara
Token ID37723
Feature activation+1.19e-01

Pos logit contributions
 like+0.011
 soon+0.011
 forward+0.010
 than+0.010
 surely+0.010
Neg logit contributions
 spoon-0.017
 carrots-0.017
 peas-0.017
 lettuce-0.016
berries-0.016
Token were
Token ID547
Feature activation-1.87e-02

Pos logit contributions
 like+0.146
 soon+0.142
 forward+0.132
 than+0.131
 surely+0.129
Neg logit contributions
 spoon-0.244
 carrots-0.238
 peas-0.233
berries-0.229
 lettuce-0.229
Token playing
Token ID2712
Feature activation-3.01e-02

Pos logit contributions
 spoon+0.037
 carrots+0.036
 peas+0.035
 knives+0.035
 lettuce+0.035
Neg logit contributions
 like-0.025
 soon-0.025
 forward-0.024
 than-0.022
 surely-0.022
Token in
Token ID287
Feature activation-3.92e-03

Pos logit contributions
 spoon+0.079
 peas+0.077
 carrots+0.077
berries+0.076
 lettuce+0.076
Neg logit contributions
 like-0.021
 soon-0.020
 forward-0.016
 than-0.016
 after-0.016
Token the
Token ID262
Feature activation+9.73e-03

Pos logit contributions
 spoon+0.010
 carrots+0.010
 peas+0.010
 lettuce+0.010
berries+0.010
Neg logit contributions
 like-0.003
 soon-0.003
 than-0.002
 forward-0.002
 surely-0.002
Token park
Token ID3952
Feature activation-4.03e-03

Pos logit contributions
 like+0.018
 soon+0.018
 when+0.016
 over+0.016
 after+0.016
Neg logit contributions
 spoon-0.015
 peas-0.014
 carrots-0.014
berries-0.014
 knives-0.013
Token nap
Token ID25422
Feature activation-3.96e-02

Pos logit contributions
 knives+0.046
berries+0.045
 peas+0.044
 lettuce+0.043
 carrots+0.043
Neg logit contributions
 like-0.031
 soon-0.030
 after-0.027
 forward-0.026
 than-0.025
Token.
Token ID13
Feature activation+2.67e-03

Pos logit contributions
 spoon+0.088
 carrots+0.086
berries+0.085
 peas+0.085
 lettuce+0.084
Neg logit contributions
 like-0.043
 soon-0.042
 than-0.038
 forward-0.038
 surely-0.036
Token<|endoftext|>
Token ID50256
Feature activation-2.45e-02

Pos logit contributions
 like+0.003
 soon+0.003
 forward+0.003
 than+0.003
 surely+0.003
Neg logit contributions
 spoon-0.006
 peas-0.006
 carrots-0.006
 knives-0.006
berries-0.005
TokenEric
Token ID25004
Feature activation+6.07e-02

Pos logit contributions
 spoon+0.049
 peas+0.047
 carrots+0.047
 knives+0.046
 lettuce+0.046
Neg logit contributions
 like-0.033
 soon-0.032
 forward-0.030
 over-0.030
te-0.029
Token and
Token ID290
Feature activation+3.21e-03

Pos logit contributions
 like+0.070
 soon+0.068
 forward+0.064
 than+0.063
 surely+0.062
Neg logit contributions
 spoon-0.128
 carrots-0.125
 peas-0.122
berries-0.120
 lettuce-0.120
Token Alison
Token ID33629
Feature activation+1.19e-01

Pos logit contributions
 like+0.004
 soon+0.004
 surely+0.004
 forward+0.004
 than+0.004
Neg logit contributions
 spoon-0.006
 carrots-0.006
 peas-0.006
 lettuce-0.006
berries-0.006
Token liked
Token ID8288
Feature activation-1.13e-02

Pos logit contributions
 like+0.138
 soon+0.133
 forward+0.121
 than+0.120
 surely+0.118
Neg logit contributions
 spoon-0.255
 carrots-0.249
 peas-0.243
berries-0.240
 lettuce-0.239
Token to
Token ID284
Feature activation+1.91e-03

Pos logit contributions
 spoon+0.032
 sleeves+0.031
berries+0.030
 peas+0.030
 knives+0.030
Neg logit contributions
 like-0.006
 soon-0.005
 when-0.004
 their-0.004
 after-0.004
Token play
Token ID711
Feature activation-1.05e-02

Pos logit contributions
 like+0.002
 soon+0.002
 when+0.002
 over+0.002
 after+0.002
Neg logit contributions
 knives-0.004
 carrots-0.004
 peas-0.004
 spoon-0.004
berries-0.004
Token together
Token ID1978
Feature activation-1.30e-02

Pos logit contributions
 spoon+0.027
 peas+0.027
 carrots+0.026
berries+0.026
 lettuce+0.026
Neg logit contributions
 like-0.008
 soon-0.007
 after-0.006
 when-0.006
 than-0.006
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
 spoon+0.028
 peas+0.027
 carrots+0.027
 lettuce+0.027
 knives+0.027
Neg logit contributions
 like-0.016
 soon-0.015
 than-0.014
 when-0.014
 after-0.013
Token separate
Token ID4553
Feature activation+3.24e-02

Pos logit contributions
 like+0.044
 soon+0.041
 surely+0.036
 over+0.035
 when+0.034
Neg logit contributions
berries-0.078
 peas-0.076
 carrots-0.076
 spoon-0.075
 knives-0.075
Token the
Token ID262
Feature activation+3.14e-02

Pos logit contributions
 like+0.030
 soon+0.029
 forward+0.026
 than+0.026
 surely+0.026
Neg logit contributions
 spoon-0.076
 carrots-0.074
 peas-0.073
berries-0.072
 lettuce-0.072
Token stamps
Token ID25560
Feature activation+3.96e-02

Pos logit contributions
 like+0.053
 than+0.050
 soon+0.050
 forward+0.049
 after+0.047
Neg logit contributions
 spoon-0.050
berries-0.049
 peas-0.048
 carrots-0.047
 knives-0.045
Token and
Token ID290
Feature activation+2.05e-02

Pos logit contributions
 like+0.033
 soon+0.032
 forward+0.029
te+0.028
 than+0.028
Neg logit contributions
 spoon-0.095
 carrots-0.093
 peas-0.090
 lettuce-0.089
berries-0.089
Token give
Token ID1577
Feature activation+9.19e-03

Pos logit contributions
 like+0.026
 soon+0.026
 surely+0.022
 than+0.022
 after+0.022
Neg logit contributions
 spoon-0.042
 carrots-0.041
berries-0.041
 peas-0.040
 lettuce-0.039
Token Alison
Token ID33629
Feature activation+1.18e-01

Pos logit contributions
 like+0.011
 soon+0.011
 than+0.010
 forward+0.010
 surely+0.010
Neg logit contributions
 spoon-0.019
 carrots-0.018
 peas-0.018
 lettuce-0.017
berries-0.017
Token one
Token ID530
Feature activation+1.19e-02

Pos logit contributions
 like+0.112
 soon+0.107
 forward+0.096
 than+0.092
 after+0.091
Neg logit contributions
 spoon-0.280
 carrots-0.268
 peas-0.267
berries-0.267
 knives-0.261
Token.
Token ID13
Feature activation-5.65e-03

Pos logit contributions
 like+0.013
 soon+0.011
 than+0.011
 forward+0.011
 from+0.011
Neg logit contributions
 peas-0.028
 carrots-0.027
berries-0.027
 spoon-0.027
 knives-0.026
Token
Token ID198
Feature activation+1.88e-03

Pos logit contributions
 spoon+0.010
 carrots+0.010
berries+0.009
 lettuce+0.009
 peas+0.009
Neg logit contributions
 like-0.008
 soon-0.008
ually-0.007
 than-0.007
 surely-0.007
Token
Token ID198
Feature activation-1.69e-03

Pos logit contributions
 like+0.003
 soon+0.002
 than+0.002
ually+0.002
 surely+0.002
Neg logit contributions
 spoon-0.003
 carrots-0.003
 peas-0.003
berries-0.003
 lettuce-0.003
Token"
Token ID1
Feature activation+1.21e-02

Pos logit contributions
 spoon+0.003
 carrots+0.003
 knives+0.003
 peas+0.003
 lettuce+0.003
Neg logit contributions
 like-0.002
ually-0.002
 than-0.002
 soon-0.002
 theirs-0.002
Token to
Token ID284
Feature activation-1.98e-02

Pos logit contributions
ually+0.015
 surely+0.015
te+0.015
 like+0.015
organ+0.014
Neg logit contributions
 spoon-0.025
 carrots-0.023
 knives-0.022
berries-0.022
 lettuce-0.022
Token leave
Token ID2666
Feature activation+1.70e-02

Pos logit contributions
 spoon+0.039
 carrots+0.037
 peas+0.036
 lettuce+0.036
berries+0.035
Neg logit contributions
 than-0.026
 like-0.024
 soon-0.024
ually-0.024
 forward-0.024
Token the
Token ID262
Feature activation+1.22e-02

Pos logit contributions
ually+0.025
 surely+0.023
organ+0.023
bel+0.023
ends+0.023
Neg logit contributions
 spoon-0.030
 carrots-0.028
 knives-0.027
 berries-0.027
 peas-0.027
Token park
Token ID3952
Feature activation-8.98e-03

Pos logit contributions
 like+0.021
 soon+0.020
 than+0.019
 after+0.019
 when+0.018
Neg logit contributions
 spoon-0.021
 peas-0.020
berries-0.020
 knives-0.020
 carrots-0.020
Token.
Token ID13
Feature activation+5.99e-03

Pos logit contributions
 spoon+0.016
 carrots+0.016
berries+0.015
 bowl+0.015
 berries+0.015
Neg logit contributions
ually-0.011
 surely-0.011
 like-0.010
organ-0.010
te-0.010
Token Lara
Token ID29849
Feature activation+1.18e-01

Pos logit contributions
ually+0.010
 soon+0.009
bek+0.009
te+0.009
 than+0.008
Neg logit contributions
 spoon-0.010
 carrots-0.009
 wipes-0.009
 lettuce-0.009
berries-0.009
Token's
Token ID338
Feature activation+1.17e-02

Pos logit contributions
 soon+0.145
 like+0.142
 surely+0.126
 forward+0.126
 than+0.125
Neg logit contributions
 spoon-0.240
 carrots-0.237
 peas-0.232
berries-0.232
 lettuce-0.229
Token mom
Token ID1995
Feature activation+1.34e-02

Pos logit contributions
 soon+0.018
 like+0.018
 forward+0.017
 surely+0.016
 when+0.016
Neg logit contributions
 peas-0.020
 spoon-0.019
 carrots-0.019
 knives-0.019
berries-0.019
Token said
Token ID531
Feature activation+1.18e-03

Pos logit contributions
 soon+0.018
 like+0.018
 forward+0.016
 surely+0.016
te+0.016
Neg logit contributions
 spoon-0.026
 carrots-0.026
 peas-0.025
 lettuce-0.025
berries-0.025
Token,
Token ID11
Feature activation-1.07e-02

Pos logit contributions
 like+0.001
 soon+0.001
 than+0.001
 surely+0.001
ually+0.001
Neg logit contributions
 spoon-0.003
 carrots-0.003
 peas-0.002
 knives-0.002
 lettuce-0.002
Token "
Token ID366
Feature activation+7.58e-03

Pos logit contributions
 spoon+0.021
 carrots+0.020
 peas+0.020
berries+0.020
 lettuce+0.019
Neg logit contributions
 like-0.014
 soon-0.014
 forward-0.013
 than-0.013
 surely-0.013
Token Today
Token ID6288
Feature activation+2.58e-02

Pos logit contributions
 spoon+0.007
 carrots+0.007
berries+0.007
 lettuce+0.006
 knives+0.006
Neg logit contributions
 soon-0.006
te-0.006
ually-0.006
 surely-0.006
 like-0.006
Token was
Token ID373
Feature activation+2.42e-02

Pos logit contributions
bek+0.038
ually+0.038
year+0.038
bel+0.037
amed+0.037
Neg logit contributions
 spoon-0.047
berries-0.043
 berries-0.042
 carrots-0.042
 bowl-0.041
Token no
Token ID645
Feature activation-2.47e-02

Pos logit contributions
ually+0.033
away+0.030
te+0.030
bek+0.030
k+0.029
Neg logit contributions
 spoon-0.047
 carrots-0.047
 berries-0.045
 knives-0.044
 lettuce-0.044
Token different
Token ID1180
Feature activation-2.21e-03

Pos logit contributions
 spoon+0.046
 carrots+0.046
 lettuce+0.045
 peas+0.045
berries+0.045
Neg logit contributions
 like-0.036
 soon-0.034
 than-0.032
 after-0.031
 forward-0.031
Token.
Token ID13
Feature activation-3.21e-03

Pos logit contributions
 spoon+0.004
 carrots+0.004
 lettuce+0.004
 peas+0.004
berries+0.004
Neg logit contributions
 like-0.003
 soon-0.003
 than-0.003
 after-0.003
 when-0.003
Token Lori
Token ID43741
Feature activation+1.18e-01

Pos logit contributions
 spoon+0.005
 carrots+0.005
berries+0.005
 lettuce+0.005
 berries+0.005
Neg logit contributions
 soon-0.005
ually-0.005
te-0.005
 surely-0.005
 than-0.005
Token grabbed
Token ID13176
Feature activation-5.65e-03

Pos logit contributions
 soon+0.137
 like+0.132
 after+0.113
 when+0.112
te+0.106
Neg logit contributions
berries-0.255
 carrots-0.255
 spoon-0.251
 lettuce-0.250
 peas-0.248
Token her
Token ID607
Feature activation+8.20e-03

Pos logit contributions
 spoon+0.013
 carrots+0.013
 peas+0.012
 lettuce+0.012
 knives+0.012
Neg logit contributions
 like-0.005
 soon-0.005
 than-0.005
ually-0.005
 forward-0.005
Token sp
Token ID599
Feature activation-6.23e-02

Pos logit contributions
 like+0.018
 soon+0.018
 when+0.017
 after+0.017
 over+0.016
Neg logit contributions
 carrots-0.008
 peas-0.008
berries-0.008
 knives-0.008
 lettuce-0.007
Tokenade
Token ID671
Feature activation-2.09e-02

Pos logit contributions
 carrots+0.098
 peas+0.095
 spoon+0.093
 knives+0.092
 berries+0.090
Neg logit contributions
 like-0.097
 than-0.094
ook-0.094
 soon-0.090
te-0.086
Token and
Token ID290
Feature activation+4.76e-03

Pos logit contributions
 spoon+0.050
 carrots+0.049
 peas+0.049
 lettuce+0.048
berries+0.048
Neg logit contributions
 like-0.018
 soon-0.018
 forward-0.016
 than-0.016
 after-0.015
Token finger
Token ID7660
Feature activation-1.44e-02

Pos logit contributions
 like+0.015
 soon+0.014
 forward+0.013
 than+0.013
 when+0.012
Neg logit contributions
 peas-0.020
berries-0.020
 carrots-0.019
 lettuce-0.019
 knives-0.019
Token in
Token ID287
Feature activation-1.38e-02

Pos logit contributions
 spoon+0.036
 peas+0.036
 carrots+0.036
berries+0.036
 lettuce+0.035
Neg logit contributions
 like-0.012
 soon-0.012
 than-0.010
 when-0.009
 over-0.009
Token the
Token ID262
Feature activation+6.37e-03

Pos logit contributions
 spoon+0.035
 peas+0.034
 carrots+0.034
berries+0.034
 lettuce+0.034
Neg logit contributions
 like-0.011
 soon-0.011
 than-0.009
 after-0.008
 forward-0.008
Token air
Token ID1633
Feature activation+2.19e-02

Pos logit contributions
 like+0.011
 forward+0.011
 soon+0.010
 over+0.010
 surely+0.010
Neg logit contributions
berries-0.009
 peas-0.008
pill-0.008
 carrots-0.008
unn-0.008
Token.
Token ID13
Feature activation+5.50e-03

Pos logit contributions
 like+0.025
 soon+0.023
 than+0.022
 when+0.020
 over+0.020
Neg logit contributions
 spoon-0.049
 carrots-0.049
 peas-0.049
 knives-0.048
 lettuce-0.048
Token Beat
Token ID12568
Feature activation+1.17e-01

Pos logit contributions
 like+0.005
 soon+0.005
 When+0.005
 when+0.004
 forward+0.004
Neg logit contributions
 spoon-0.013
 carrots-0.013
 peas-0.013
 knives-0.013
 lettuce-0.012
Tokenrice
Token ID20970
Feature activation+6.06e-02

Pos logit contributions
ores+0.151
 surely+0.144
ained+0.138
 forward+0.137
 like+0.137
Neg logit contributions
 sleeves-0.187
 spoon-0.182
 cheeks-0.174
 carrots-0.173
 cries-0.173
Token did
Token ID750
Feature activation-4.70e-03

Pos logit contributions
 soon+0.075
 like+0.074
 over+0.061
 when+0.061
 surely+0.061
Neg logit contributions
 carrots-0.126
 spoon-0.124
berries-0.124
 peas-0.123
 lettuce-0.123
Token as
Token ID355
Feature activation+1.57e-02

Pos logit contributions
 spoon+0.011
 carrots+0.010
 peas+0.010
 knives+0.010
 lettuce+0.010
Neg logit contributions
 soon-0.004
 like-0.004
 forward-0.004
ually-0.004
 surely-0.004
Token her
Token ID607
Feature activation+1.28e-03

Pos logit contributions
ually+0.022
 surely+0.021
 like+0.021
 than+0.021
 forward+0.021
Neg logit contributions
 spoon-0.026
 carrots-0.026
 peas-0.024
 knives-0.024
 berries-0.023
Token mom
Token ID1995
Feature activation+2.00e-02

Pos logit contributions
 soon+0.002
 like+0.002
 when+0.002
 as+0.002
 surely+0.002
Neg logit contributions
berries-0.002
 peas-0.002
 lettuce-0.002
pill-0.002
 carrots-0.002
Token out
Token ID503
Feature activation+3.40e-02

Pos logit contributions
 spoon+0.001
 carrots+0.001
 peas+0.001
 lettuce+0.001
berries+0.001
Neg logit contributions
ually-0.001
 known-0.001
bek-0.001
 than-0.001
 surely-0.001
Token of
Token ID286
Feature activation+3.59e-03

Pos logit contributions
 like+0.033
 soon+0.032
 forward+0.029
 than+0.028
 surely+0.027
Neg logit contributions
 spoon-0.079
 carrots-0.078
 peas-0.077
 lettuce-0.076
berries-0.075
Token the
Token ID262
Feature activation-4.48e-03

Pos logit contributions
 like+0.002
 soon+0.002
 forward+0.002
 over+0.002
 after+0.002
Neg logit contributions
 spoon-0.009
 carrots-0.009
 peas-0.009
 lettuce-0.009
berries-0.009
Token house
Token ID2156
Feature activation+7.51e-03

Pos logit contributions
 peas+0.007
 spoon+0.007
 knives+0.007
 carrots+0.007
berries+0.007
Neg logit contributions
 over-0.007
 like-0.007
 soon-0.007
 forward-0.006
 after-0.006
Token.
Token ID13
Feature activation+6.61e-04

Pos logit contributions
 like+0.009
 soon+0.008
 forward+0.008
 than+0.008
 surely+0.008
Neg logit contributions
 spoon-0.016
 carrots-0.016
 peas-0.015
berries-0.015
 lettuce-0.015
Token Carly
Token ID35982
Feature activation+1.17e-01

Pos logit contributions
te+0.001
 soon+0.001
 like+0.001
vy+0.001
away+0.001
Neg logit contributions
 carrots-0.001
 spoon-0.001
berries-0.001
 knives-0.001
 lettuce-0.001
Token stepped
Token ID10764
Feature activation-3.51e-03

Pos logit contributions
 soon+0.143
 like+0.140
 forward+0.121
te+0.121
 after+0.120
Neg logit contributions
 spoon-0.243
 carrots-0.242
berries-0.236
 peas-0.236
 lettuce-0.235
Token carefully
Token ID7773
Feature activation-1.21e-02

Pos logit contributions
 spoon+0.009
 carrots+0.009
 peas+0.009
 lettuce+0.009
berries+0.009
Neg logit contributions
 like-0.003
 soon-0.003
 forward-0.002
 surely-0.002
 than-0.002
Token,
Token ID11
Feature activation+7.06e-03

Pos logit contributions
 spoon+0.029
 carrots+0.029
 peas+0.028
 lettuce+0.028
berries+0.028
Neg logit contributions
 like-0.011
 soon-0.010
 forward-0.009
 than-0.009
 surely-0.009
Token because
Token ID780
Feature activation+2.74e-02

Pos logit contributions
 like+0.008
 soon+0.008
 forward+0.007
 than+0.007
 surely+0.007
Neg logit contributions
 spoon-0.015
 carrots-0.015
 peas-0.014
berries-0.014
 lettuce-0.014
Token the
Token ID262
Feature activation-3.90e-03

Pos logit contributions
ually+0.036
 like+0.036
 than+0.034
te+0.033
vy+0.033
Neg logit contributions
 spoon-0.051
 carrots-0.050
 knives-0.048
 peas-0.048
berries-0.046
Token They
Token ID1119
Feature activation+1.05e-02

Pos logit contributions
 like+0.006
te+0.006
 soon+0.006
away+0.006
 surely+0.006
Neg logit contributions
 carrots-0.009
 spoon-0.009
berries-0.009
 peas-0.009
 berries-0.009
Token seemed
Token ID3947
Feature activation+3.84e-02

Pos logit contributions
 soon+0.012
 like+0.012
 surely+0.009
 after+0.009
 when+0.009
Neg logit contributions
 spoon-0.023
 carrots-0.023
berries-0.023
 knives-0.023
 peas-0.022
Token so
Token ID523
Feature activation+3.71e-03

Pos logit contributions
ually+0.039
bek+0.039
te+0.038
 soon+0.035
 Jenn+0.035
Neg logit contributions
 spoon-0.082
 carrots-0.080
 peas-0.078
 lettuce-0.077
 carrot-0.073
Token happy
Token ID3772
Feature activation+1.63e-02

Pos logit contributions
 soon+0.004
te+0.004
 like+0.004
 surely+0.004
 forward+0.004
Neg logit contributions
 spoon-0.008
 carrots-0.007
 peas-0.007
 lettuce-0.007
 knives-0.007
Token.
Token ID13
Feature activation+9.04e-03

Pos logit contributions
 like+0.015
 soon+0.014
 than+0.013
 after+0.012
 forward+0.012
Neg logit contributions
 spoon-0.040
 carrots-0.039
 peas-0.038
 lettuce-0.038
 knives-0.038
Token Carly
Token ID35982
Feature activation+1.16e-01

Pos logit contributions
 like+0.011
te+0.011
away+0.011
 soon+0.011
 surely+0.010
Neg logit contributions
 carrots-0.016
 spoon-0.016
berries-0.016
 peas-0.015
 sleeves-0.015
Token felt
Token ID2936
Feature activation+9.26e-03

Pos logit contributions
 soon+0.139
 like+0.136
 than+0.120
te+0.116
 forward+0.115
Neg logit contributions
 spoon-0.245
 carrots-0.242
berries-0.237
 knives-0.236
 peas-0.236
Token so
Token ID523
Feature activation-2.30e-03

Pos logit contributions
bek+0.010
ually+0.010
te+0.009
 Jenn+0.009
organ+0.009
Neg logit contributions
 carrots-0.019
 spoon-0.019
 peas-0.019
 lettuce-0.018
 potatoes-0.017
Token good
Token ID922
Feature activation+1.63e-02

Pos logit contributions
 spoon+0.005
 carrots+0.005
 peas+0.005
 lettuce+0.005
 knives+0.005
Neg logit contributions
 like-0.002
 soon-0.002
te-0.002
 forward-0.002
 than-0.002
Token running
Token ID2491
Feature activation-5.02e-03

Pos logit contributions
 soon+0.015
 surely+0.015
ually+0.015
 like+0.015
te+0.015
Neg logit contributions
 spoon-0.036
 carrots-0.035
 peas-0.034
berries-0.034
 knives-0.033
Token about
Token ID546
Feature activation+5.79e-02

Pos logit contributions
 spoon+0.011
 carrots+0.011
berries+0.010
 peas+0.010
 lettuce+0.010
Neg logit contributions
ually-0.004
 soon-0.004
 known-0.004
 surely-0.004
bek-0.004
Token this
Token ID428
Feature activation-1.41e-02

Pos logit contributions
 spoon+0.012
 carrots+0.012
 peas+0.012
berries+0.012
 lettuce+0.012
Neg logit contributions
 like-0.005
 soon-0.005
 forward-0.004
 than-0.004
 surely-0.004
Token flower
Token ID15061
Feature activation-3.81e-02

Pos logit contributions
 spoon+0.022
 carrots+0.022
 peas+0.021
berries+0.021
 lettuce+0.021
Neg logit contributions
 like-0.024
 soon-0.023
 than-0.022
 forward-0.022
 surely-0.021
Token was
Token ID373
Feature activation+3.63e-03

Pos logit contributions
 carrots+0.086
 peas+0.086
 spoon+0.085
 lettuce+0.085
 knives+0.083
Neg logit contributions
 like-0.041
 soon-0.040
 than-0.034
 when-0.033
 surely-0.032
Token even
Token ID772
Feature activation-8.04e-04

Pos logit contributions
 like+0.004
 soon+0.004
 forward+0.004
 than+0.004
 surely+0.004
Neg logit contributions
 spoon-0.008
 carrots-0.008
 peas-0.007
berries-0.007
 lettuce-0.007
Token l
Token ID300
Feature activation-3.77e-02

Pos logit contributions
 spoon+0.002
 carrots+0.002
 peas+0.002
 lettuce+0.002
berries+0.002
Neg logit contributions
 like-0.001
 soon-0.001
 forward-0.001
 than-0.001
te-0.001
Tokenitt
Token ID715
Feature activation-1.30e-01

Pos logit contributions
 spoon+0.072
 carrots+0.070
 peas+0.070
 knives+0.068
 lettuce+0.067
Neg logit contributions
 like-0.056
 forward-0.052
 soon-0.051
 than-0.049
te-0.048
Tokenler
Token ID1754
Feature activation-1.06e-02

Pos logit contributions
 spoon+0.215
 carrots+0.215
 couch+0.212
 peas+0.212
 bowl+0.211
Neg logit contributions
 like-0.197
 than-0.195
 soon-0.190
 as-0.176
 after-0.175
Token than
Token ID621
Feature activation+7.32e-02

Pos logit contributions
 spoon+0.026
 peas+0.025
 carrots+0.025
 lettuce+0.025
berries+0.025
Neg logit contributions
 like-0.011
 than-0.009
 soon-0.009
 when-0.008
 from-0.008
Token the
Token ID262
Feature activation-6.97e-03

Pos logit contributions
 like+0.074
 soon+0.068
 than+0.060
 forward+0.058
 surely+0.057
Neg logit contributions
 spoon-0.172
berries-0.167
 carrots-0.167
 peas-0.165
 lettuce-0.163
Token da
Token ID12379
Feature activation-2.76e-02

Pos logit contributions
berries+0.011
 knives+0.011
 spoon+0.011
 peas+0.011
 carrots+0.011
Neg logit contributions
 like-0.013
 soon-0.012
 than-0.011
 when-0.011
 after-0.011
Tokenisy
Token ID13560
Feature activation-2.22e-02

Pos logit contributions
 forks+0.044
 knives+0.044
 spoon+0.043
 peas+0.043
 carrots+0.042
Neg logit contributions
 like-0.049
 soon-0.046
 than-0.045
te-0.044
 surely-0.042
Token
Token ID198
Feature activation-7.95e-03

Pos logit contributions
 spoon+0.009
 carrots+0.009
 peas+0.009
berries+0.009
 lettuce+0.009
Neg logit contributions
 like-0.005
 soon-0.005
 than-0.004
 forward-0.004
te-0.004
TokenThey
Token ID2990
Feature activation-2.36e-03

Pos logit contributions
 spoon+0.018
 carrots+0.018
 peas+0.017
berries+0.017
 knives+0.017
Neg logit contributions
 like-0.008
 soon-0.007
 forward-0.007
 surely-0.007
 than-0.007
Token get
Token ID651
Feature activation+1.63e-03

Pos logit contributions
berries+0.005
 carrots+0.005
 knives+0.005
 peas+0.005
 lettuce+0.005
Neg logit contributions
 like-0.003
 soon-0.003
 surely-0.002
 after-0.002
 over-0.002
Token a
Token ID257
Feature activation+8.36e-03

Pos logit contributions
 like+0.001
 soon+0.001
 forward+0.001
 after+0.001
 over+0.001
Neg logit contributions
 spoon-0.004
 carrots-0.004
berries-0.004
 peas-0.004
 lettuce-0.004
Token m
Token ID285
Feature activation-3.60e-02

Pos logit contributions
 like+0.015
 soon+0.014
 after+0.014
 surely+0.013
 forward+0.013
Neg logit contributions
berries-0.012
 peas-0.011
 carrots-0.011
 knives-0.011
pill-0.011
Tokenitt
Token ID715
Feature activation-1.30e-01

Pos logit contributions
 carrots+0.067
 peas+0.065
 lettuce+0.064
 spoon+0.062
 knives+0.061
Neg logit contributions
 soon-0.051
 like-0.050
 forward-0.048
ook-0.048
te-0.048
Token and
Token ID290
Feature activation-2.24e-02

Pos logit contributions
berries+0.270
 spoon+0.261
 peas+0.260
 carrots+0.260
 lettuce+0.259
Neg logit contributions
 like-0.157
 soon-0.153
 from-0.143
 than-0.143
 after-0.143
Token a
Token ID257
Feature activation+4.77e-03

Pos logit contributions
berries+0.050
 spoon+0.047
opus+0.047
 peas+0.046
 carrots+0.046
Neg logit contributions
 like-0.026
 soon-0.024
 after-0.021
 as-0.020
 when-0.020
Token spat
Token ID15246
Feature activation-3.29e-02

Pos logit contributions
 like+0.009
 soon+0.009
 surely+0.008
 after+0.008
 from+0.008
Neg logit contributions
berries-0.006
pill-0.006
 carrots-0.005
 knives-0.005
 peas-0.005
Tokenula
Token ID4712
Feature activation+7.13e-04

Pos logit contributions
 spoon+0.076
 carrots+0.074
 peas+0.072
berries+0.071
 lettuce+0.071
Neg logit contributions
 like-0.032
 soon-0.031
 than-0.028
 forward-0.028
 surely-0.028
Token.
Token ID13
Feature activation-7.11e-03

Pos logit contributions
 like+0.001
 after+0.001
 soon+0.001
 from+0.001
 than+0.001
Neg logit contributions
 peas-0.002
 carrots-0.002
 spoon-0.002
berries-0.001
 lettuce-0.001
Token,
Token ID11
Feature activation+1.35e-02

Pos logit contributions
 like+0.038
 soon+0.037
 forward+0.033
 than+0.033
 surely+0.032
Neg logit contributions
 spoon-0.106
 carrots-0.104
 peas-0.102
berries-0.101
 lettuce-0.101
Token there
Token ID612
Feature activation-3.44e-03

Pos logit contributions
 than+0.017
 soon+0.016
 like+0.016
 surely+0.016
 forward+0.016
Neg logit contributions
 spoon-0.026
 carrots-0.025
 peas-0.024
berries-0.024
 lettuce-0.024
Token was
Token ID373
Feature activation+1.45e-02

Pos logit contributions
 spoon+0.007
 carrots+0.006
 peas+0.006
berries+0.006
 knives+0.006
Neg logit contributions
 like-0.005
 soon-0.005
 forward-0.005
 than-0.004
 surely-0.004
Token a
Token ID257
Feature activation-9.94e-03

Pos logit contributions
 like+0.014
 soon+0.013
 forward+0.013
 than+0.013
 surely+0.012
Neg logit contributions
 spoon-0.033
 carrots-0.033
 peas-0.032
 lettuce-0.032
berries-0.031
Token l
Token ID300
Feature activation-3.37e-02

Pos logit contributions
 spoon+0.013
 carrots+0.013
berries+0.013
 peas+0.013
 knives+0.013
Neg logit contributions
 like-0.019
 soon-0.019
 forward-0.017
 than-0.017
te-0.017
Tokenitt
Token ID715
Feature activation-1.26e-01

Pos logit contributions
 spoon+0.064
 carrots+0.062
 peas+0.061
berries+0.060
 lettuce+0.060
Neg logit contributions
 like-0.048
 soon-0.045
 forward-0.044
 than-0.043
te-0.043
Tokentle
Token ID7100
Feature activation-2.59e-02

Pos logit contributions
 forks+0.244
berries+0.244
 knives+0.244
 spoon+0.242
 bowls+0.242
Neg logit contributions
 like-0.162
 than-0.160
 soon-0.152
 when-0.150
 as-0.143
Token girl
Token ID2576
Feature activation+7.16e-03

Pos logit contributions
 spoon+0.041
 knives+0.039
 carrots+0.039
 peas+0.039
 lettuce+0.038
Neg logit contributions
 like-0.046
 soon-0.042
 than-0.042
 forward-0.041
 when-0.040
Token.
Token ID13
Feature activation-2.63e-04

Pos logit contributions
 surely+0.006
 forward+0.006
 like+0.006
 soon+0.006
ually+0.006
Neg logit contributions
 spoon-0.017
 carrots-0.016
berries-0.016
 peas-0.016
 knives-0.016
Token She
Token ID1375
Feature activation+3.14e-03

Pos logit contributions
 spoon+0.000
berries+0.000
 carrots+0.000
 peas+0.000
 lettuce+0.000
Neg logit contributions
 soon-0.000
 forward-0.000
 surely-0.000
 like-0.000
ually-0.000
Token had
Token ID550
Feature activation+1.76e-02

Pos logit contributions
 forward+0.004
 surely+0.004
 soon+0.004
 like+0.004
ually+0.004
Neg logit contributions
 spoon-0.006
 carrots-0.006
 peas-0.006
 lettuce-0.006
 carrot-0.006
Token sund
Token ID37437
Feature activation-2.19e-02

Pos logit contributions
 carrots+0.002
berries+0.002
 knives+0.002
 peas+0.002
 lettuce+0.002
Neg logit contributions
 like-0.002
 soon-0.002
 forward-0.002
 surely-0.002
 after-0.002
Tokenae
Token ID3609
Feature activation+8.97e-03

Pos logit contributions
 spoon+0.044
 carrots+0.042
 peas+0.041
berries+0.041
 lettuce+0.040
Neg logit contributions
 like-0.028
 soon-0.027
 forward-0.026
 than-0.025
 surely-0.025
Token with
Token ID351
Feature activation-4.58e-02

Pos logit contributions
 like+0.010
 soon+0.009
 than+0.008
 after+0.008
 from+0.008
Neg logit contributions
 knives-0.020
 lettuce-0.020
 carrots-0.020
berries-0.020
 peas-0.020
Token lots
Token ID6041
Feature activation-4.74e-02

Pos logit contributions
 spoon+0.116
berries+0.116
 peas+0.115
 knives+0.113
opus+0.113
Neg logit contributions
 like-0.035
 soon-0.030
 their-0.024
 forward-0.023
 from-0.022
Token of
Token ID286
Feature activation-2.16e-02

Pos logit contributions
 spoon+0.122
 peas+0.121
 knives+0.121
 lettuce+0.119
 elbow+0.119
Neg logit contributions
 like-0.036
 than-0.031
 from-0.029
 as-0.028
 after-0.027
Token topp
Token ID23126
Feature activation-1.22e-01

Pos logit contributions
 bur+0.035
opus+0.035
berries+0.034
 lettuce+0.033
 knives+0.032
Neg logit contributions
 like-0.038
 soon-0.031
 their-0.031
 after-0.030
 than-0.030
Tokenings
Token ID654
Feature activation+3.95e-02

Pos logit contributions
 carrots+0.288
 spoon+0.285
 peas+0.282
 lettuce+0.282
 knives+0.279
Neg logit contributions
 like-0.099
 than-0.097
 soon-0.092
te-0.084
ook-0.083
Token.
Token ID13
Feature activation-7.21e-03

Pos logit contributions
 like+0.051
 soon+0.049
 forward+0.046
 than+0.045
 after+0.044
Neg logit contributions
 spoon-0.081
 carrots-0.078
 peas-0.077
 lettuce-0.076
berries-0.075
Token
Token ID198
Feature activation-8.97e-03

Pos logit contributions
 spoon+0.015
 carrots+0.014
 peas+0.014
berries+0.014
 lettuce+0.014
Neg logit contributions
 like-0.009
 soon-0.009
 forward-0.008
 than-0.008
 surely-0.008
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
 spoon+0.015
 carrots+0.015
berries+0.014
 lettuce+0.014
 peas+0.014
Neg logit contributions
 like-0.014
 surely-0.013
 than-0.013
 soon-0.013
 forward-0.013
TokenThey
Token ID2990
Feature activation+2.66e-03

Pos logit contributions
 spoon+0.022
 carrots+0.021
berries+0.020
 lettuce+0.020
 knives+0.020
Neg logit contributions
 like-0.013
 than-0.012
 surely-0.012
 forward-0.012
ually-0.011
Token She
Token ID1375
Feature activation+3.72e-03

Pos logit contributions
 spoon+0.009
 carrots+0.009
 peas+0.009
berries+0.008
 knives+0.008
Neg logit contributions
 like-0.007
 soon-0.007
 forward-0.006
 surely-0.006
 after-0.006
Token told
Token ID1297
Feature activation-1.58e-02

Pos logit contributions
 soon+0.005
 like+0.005
 surely+0.004
te+0.004
 than+0.004
Neg logit contributions
 spoon-0.007
 carrots-0.007
berries-0.007
 peas-0.007
 lettuce-0.007
Token him
Token ID683
Feature activation-3.27e-02

Pos logit contributions
 spoon+0.036
berries+0.035
 peas+0.035
 carrots+0.034
 sleeves+0.034
Neg logit contributions
 like-0.016
 soon-0.015
 as-0.014
 over-0.014
 when-0.014
Token it
Token ID340
Feature activation-8.56e-03

Pos logit contributions
 spoon+0.077
berries+0.075
 carrots+0.075
 peas+0.074
 lettuce+0.072
Neg logit contributions
 like-0.032
 soon-0.031
 after-0.025
 when-0.025
 forward-0.025
Token was
Token ID373
Feature activation-5.57e-03

Pos logit contributions
 spoon+0.019
berries+0.019
 peas+0.019
 carrots+0.019
 lettuce+0.018
Neg logit contributions
 like-0.009
 soon-0.009
 than-0.008
 over-0.008
 when-0.008
Token seaw
Token ID32085
Feature activation-1.21e-01

Pos logit contributions
 spoon+0.013
 carrots+0.013
berries+0.013
 peas+0.013
 lettuce+0.012
Neg logit contributions
 like-0.005
 soon-0.005
 than-0.005
 forward-0.005
 surely-0.004
Tokeneed
Token ID2308
Feature activation-2.06e-02

Pos logit contributions
 spoon+0.218
berries+0.217
 peas+0.212
 knives+0.211
 carrots+0.210
Neg logit contributions
 like-0.177
 than-0.173
ook-0.161
 soon-0.149
 forward-0.145
Token.
Token ID13
Feature activation+7.52e-04

Pos logit contributions
 spoon+0.047
berries+0.047
 peas+0.046
 carrots+0.046
 knives+0.046
Neg logit contributions
 like-0.022
 soon-0.019
 than-0.019
 from-0.016
 after-0.016
Token Despite
Token ID7945
Feature activation+2.26e-02

Pos logit contributions
 like+0.001
 soon+0.001
 forward+0.001
 when+0.001
 after+0.001
Neg logit contributions
 carrots-0.002
 spoon-0.002
 peas-0.002
 berries-0.002
 knives-0.002
Token the
Token ID262
Feature activation-2.37e-02

Pos logit contributions
 like+0.021
 soon+0.019
 forward+0.017
 after+0.016
 than+0.016
Neg logit contributions
 spoon-0.056
berries-0.055
 carrots-0.053
 peas-0.053
 lettuce-0.052
Token smell
Token ID8508
Feature activation+1.07e-02

Pos logit contributions
berries+0.044
pill+0.042
 knives+0.036
itt+0.036
ridge+0.035
Neg logit contributions
 like-0.044
 than-0.037
 after-0.036
 forward-0.035
 soon-0.035
Token and
Token ID290
Feature activation+1.03e-03

Pos logit contributions
 than+0.020
ually+0.019
 like+0.018
 known+0.018
 soon+0.018
Neg logit contributions
 spoon-0.042
 carrots-0.040
berries-0.039
 carrot-0.039
 peas-0.038
Token get
Token ID651
Feature activation+6.24e-03

Pos logit contributions
 like+0.001
 soon+0.001
 surely+0.001
 after+0.001
 over+0.001
Neg logit contributions
 carrots-0.002
berries-0.002
 peas-0.002
 spoon-0.002
 knives-0.002
Token it
Token ID340
Feature activation-2.97e-03

Pos logit contributions
 soon+0.006
 like+0.006
 after+0.005
 forward+0.005
 over+0.005
Neg logit contributions
berries-0.015
 spoon-0.015
 peas-0.015
 carrots-0.015
 lettuce-0.015
Token,
Token ID11
Feature activation+5.78e-03

Pos logit contributions
 spoon+0.007
 peas+0.006
 knives+0.006
 carrots+0.006
 lettuce+0.006
Neg logit contributions
 soon-0.003
 like-0.003
 over-0.003
 forward-0.003
 than-0.003
Token k
Token ID479
Feature activation-1.78e-02

Pos logit contributions
 like+0.009
 soon+0.008
 after+0.007
 when+0.007
 forward+0.007
Neg logit contributions
berries-0.011
 spoon-0.011
 carrots-0.011
 peas-0.011
 lettuce-0.011
Tokenitt
Token ID715
Feature activation-1.21e-01

Pos logit contributions
 carrots+0.035
 peas+0.035
 lettuce+0.035
 spoon+0.035
 knives+0.034
Neg logit contributions
 like-0.023
 soon-0.023
 forward-0.022
te-0.021
 than-0.021
Tokenies
Token ID444
Feature activation-8.85e-03

Pos logit contributions
 carrots+0.243
berries+0.240
 lettuce+0.239
 spoon+0.238
 peas+0.237
Neg logit contributions
 soon-0.146
 like-0.142
 than-0.129
 when-0.125
 after-0.123
Token!"
Token ID2474
Feature activation-3.09e-02

Pos logit contributions
 lettuce+0.019
 knives+0.019
 peas+0.019
 carrots+0.019
 spoon+0.019
Neg logit contributions
 like-0.012
 soon-0.011
 after-0.009
 than-0.009
 when-0.009
Token she
Token ID673
Feature activation-1.41e-02

Pos logit contributions
 carrots+0.068
 peas+0.068
 spoon+0.067
 lettuce+0.066
 knives+0.066
Neg logit contributions
 soon-0.035
 like-0.034
 after-0.032
 when-0.031
 from-0.029
Token called
Token ID1444
Feature activation+4.79e-02

Pos logit contributions
 spoon+0.026
 carrots+0.026
berries+0.026
 peas+0.026
 lettuce+0.026
Neg logit contributions
 soon-0.020
 like-0.020
 than-0.017
te-0.017
 after-0.017
Token,
Token ID11
Feature activation-9.64e-03

Pos logit contributions
 like+0.046
 soon+0.044
 forward+0.040
 than+0.039
 surely+0.038
Neg logit contributions
 spoon-0.112
 carrots-0.110
 peas-0.108
berries-0.107
 lettuce-0.107
Token called
Token ID1444
Feature activation+3.85e-02

Pos logit contributions
 spoon+0.038
 lettuce+0.038
 carrots+0.038
 peas+0.037
berries+0.037
Neg logit contributions
 like-0.014
 soon-0.013
 than-0.011
 when-0.011
 after-0.010
Token Yet
Token ID6430
Feature activation-3.75e-02

Pos logit contributions
 like+0.037
 forward+0.031
 from+0.030
 soon+0.029
 over+0.029
Neg logit contributions
 spoon-0.096
berries-0.092
 peas-0.091
 carrots-0.089
 lettuce-0.088
Tokeni
Token ID72
Feature activation-8.48e-03

Pos logit contributions
 spoon+0.080
 lettuce+0.075
berries+0.075
 carrots+0.074
 peas+0.074
Neg logit contributions
 soon-0.046
 like-0.046
 than-0.041
 after-0.041
 forward-0.040
Token was
Token ID373
Feature activation-1.85e-02

Pos logit contributions
 spoon+0.019
 peas+0.018
 lettuce+0.018
 carrots+0.018
berries+0.018
Neg logit contributions
 like-0.010
 soon-0.010
 when-0.009
 forward-0.009
 than-0.008
Token also
Token ID635
Feature activation-1.89e-02

Pos logit contributions
 spoon+0.039
berries+0.038
 carrots+0.037
 peas+0.037
 lettuce+0.037
Neg logit contributions
 like-0.023
 soon-0.022
 forward-0.020
 than-0.020
 surely-0.019
Token living
Token ID2877
Feature activation-1.19e-01

Pos logit contributions
 spoon+0.041
berries+0.040
 peas+0.039
 carrots+0.039
 lettuce+0.039
Neg logit contributions
 like-0.023
 soon-0.022
 than-0.020
 forward-0.020
 when-0.018
Token in
Token ID287
Feature activation-4.86e-02

Pos logit contributions
 spoon+0.299
 lettuce+0.292
berries+0.291
 carrots+0.286
 peas+0.284
Neg logit contributions
 like-0.102
 soon-0.097
 forward-0.088
 than-0.088
 after-0.086
Token the
Token ID262
Feature activation-1.74e-02

Pos logit contributions
 spoon+0.123
berries+0.120
 peas+0.118
 lettuce+0.117
 carrots+0.116
Neg logit contributions
 like-0.041
 soon-0.039
 than-0.034
 after-0.031
 forward-0.031
Token forest
Token ID8222
Feature activation-3.87e-02

Pos logit contributions
 spoon+0.032
berries+0.032
 peas+0.029
 knives+0.029
 forks+0.028
Neg logit contributions
 like-0.029
 soon-0.027
 forward-0.026
 after-0.025
 when-0.024
Token.
Token ID13
Feature activation+1.35e-04

Pos logit contributions
 spoon+0.082
 peas+0.079
 carrots+0.079
 lettuce+0.078
berries+0.078
Neg logit contributions
 like-0.050
 soon-0.049
 than-0.045
 when-0.043
 after-0.043
Token
Token ID220
Feature activation-2.98e-02

Pos logit contributions
 soon+0.000
 like+0.000
 When+0.000
 when+0.000
 over+0.000
Neg logit contributions
 spoon-0.000
 peas-0.000
 knives-0.000
 lettuce-0.000
berries-0.000
Token.
Token ID13
Feature activation+6.48e-04

Pos logit contributions
 like+0.016
ually+0.016
 surely+0.016
 soon+0.016
 known+0.016
Neg logit contributions
 spoon-0.018
 carrots-0.016
berries-0.016
 knives-0.016
 peas-0.016
Token She
Token ID1375
Feature activation+6.38e-03

Pos logit contributions
 like+0.001
 soon+0.001
 forward+0.001
 than+0.001
te+0.001
Neg logit contributions
 spoon-0.001
 carrots-0.001
 peas-0.001
 lettuce-0.001
berries-0.001
Token used
Token ID973
Feature activation+1.87e-02

Pos logit contributions
 soon+0.008
 like+0.008
 than+0.007
te+0.007
 surely+0.007
Neg logit contributions
 carrots-0.013
 spoon-0.013
berries-0.013
 peas-0.013
 lettuce-0.013
Token a
Token ID257
Feature activation+1.17e-02

Pos logit contributions
 like+0.013
 soon+0.013
 over+0.011
 after+0.010
 from+0.009
Neg logit contributions
 peas-0.047
berries-0.047
 spoon-0.047
 carrots-0.046
opus-0.046
Token m
Token ID285
Feature activation-2.46e-02

Pos logit contributions
 like+0.018
 soon+0.016
 forward+0.016
 surely+0.015
 true+0.015
Neg logit contributions
berries-0.021
 peas-0.020
 carrots-0.020
 knives-0.019
 berries-0.019
Tokenitt
Token ID715
Feature activation-1.18e-01

Pos logit contributions
 carrots+0.041
 peas+0.039
 lettuce+0.039
 berries+0.036
 spoon+0.036
Neg logit contributions
 soon-0.041
 like-0.041
 than-0.040
te-0.040
 forward-0.039
Token to
Token ID284
Feature activation-1.23e-02

Pos logit contributions
 peas+0.249
 spoon+0.249
 carrots+0.248
berries+0.244
 lettuce+0.241
Neg logit contributions
 like-0.135
 soon-0.131
 than-0.125
 after-0.120
 from-0.115
Token take
Token ID1011
Feature activation-3.03e-02

Pos logit contributions
 spoon+0.022
 carrots+0.022
berries+0.021
 peas+0.021
 lettuce+0.021
Neg logit contributions
 like-0.018
 soon-0.018
 surely-0.016
 than-0.016
 forward-0.016
Token out
Token ID503
Feature activation+2.53e-02

Pos logit contributions
 spoon+0.069
 carrots+0.068
 peas+0.066
 knives+0.065
 lettuce+0.065
Neg logit contributions
 like-0.029
 soon-0.028
 than-0.027
 forward-0.026
ually-0.026
Token the
Token ID262
Feature activation-3.44e-03

Pos logit contributions
 like+0.024
 soon+0.023
 forward+0.019
 after+0.019
 from+0.019
Neg logit contributions
 spoon-0.061
 peas-0.059
berries-0.059
 carrots-0.059
 lettuce-0.058
Token pan
Token ID3425
Feature activation-9.03e-03

Pos logit contributions
pill+0.005
berries+0.005
 lettuce+0.005
 carrots+0.005
opus+0.005
Neg logit contributions
 like-0.006
 soon-0.005
 forward-0.005
 very-0.005
 after-0.005
Token sad
Token ID6507
Feature activation+6.27e-03

Pos logit contributions
away+0.009
bek+0.009
ually+0.009
 than+0.009
riage+0.009
Neg logit contributions
 carrots-0.012
 spoon-0.012
 peas-0.012
 lettuce-0.012
 soup-0.011
Token because
Token ID780
Feature activation+6.84e-03

Pos logit contributions
 like+0.004
 soon+0.004
 than+0.003
 when+0.003
 after+0.003
Neg logit contributions
 spoon-0.017
 carrots-0.017
 peas-0.017
 lettuce-0.017
berries-0.017
Token he
Token ID339
Feature activation-2.52e-02

Pos logit contributions
 soon+0.010
 like+0.010
 forward+0.009
 than+0.009
 surely+0.009
Neg logit contributions
 spoon-0.013
 carrots-0.012
 peas-0.012
berries-0.012
 lettuce-0.012
Token had
Token ID550
Feature activation+8.12e-03

Pos logit contributions
 spoon+0.055
 carrots+0.054
berries+0.054
 peas+0.053
 lettuce+0.053
Neg logit contributions
 like-0.029
 soon-0.028
 than-0.024
te-0.024
 surely-0.024
Token been
Token ID587
Feature activation-2.73e-02

Pos logit contributions
 like+0.008
 soon+0.008
 known+0.006
 when+0.006
 after+0.006
Neg logit contributions
berries-0.020
 spoon-0.020
 carrots-0.019
 peas-0.019
 lettuce-0.019
Token living
Token ID2877
Feature activation-1.16e-01

Pos logit contributions
 spoon+0.057
berries+0.057
 carrots+0.056
 peas+0.055
 lettuce+0.055
Neg logit contributions
 like-0.033
 soon-0.033
 forward-0.029
 than-0.029
 after-0.028
Token alone
Token ID3436
Feature activation-4.85e-02

Pos logit contributions
 spoon+0.250
 carrots+0.244
 peas+0.237
 knives+0.234
 lettuce+0.232
Neg logit contributions
 like-0.126
 soon-0.124
 surely-0.120
te-0.119
ually-0.117
Token in
Token ID287
Feature activation-3.53e-02

Pos logit contributions
 spoon+0.108
 carrots+0.105
 peas+0.103
berries+0.102
 lettuce+0.102
Neg logit contributions
 like-0.052
 soon-0.051
 forward-0.047
 than-0.046
 surely-0.044
Token a
Token ID257
Feature activation+3.59e-03

Pos logit contributions
 spoon+0.090
 carrots+0.088
berries+0.087
 peas+0.087
 lettuce+0.086
Neg logit contributions
 like-0.026
 soon-0.026
 than-0.021
 forward-0.021
 surely-0.021
Token cabin
Token ID9351
Feature activation-1.07e-02

Pos logit contributions
 soon+0.005
 like+0.005
 forward+0.005
 after+0.004
away+0.004
Neg logit contributions
berries-0.007
 carrots-0.007
 spoon-0.007
 peas-0.007
 knives-0.007
Token in
Token ID287
Feature activation-3.83e-02

Pos logit contributions
 spoon+0.021
 carrots+0.020
 peas+0.020
 lettuce+0.019
 knives+0.019
Neg logit contributions
 like-0.013
 soon-0.013
 forward-0.012
 surely-0.012
te-0.012
Tokenelly
Token ID6148
Feature activation-8.52e-03

Pos logit contributions
 spoon+0.012
 carrots+0.011
berries+0.011
 peas+0.011
 knives+0.011
Neg logit contributions
 soon-0.011
 like-0.011
 surely-0.010
ually-0.010
 forward-0.010
Token anymore
Token ID7471
Feature activation+5.53e-04

Pos logit contributions
 spoon+0.021
 lettuce+0.020
 carrots+0.020
 peas+0.020
berries+0.020
Neg logit contributions
 like-0.009
 soon-0.007
 after-0.007
 than-0.007
 when-0.007
Token and
Token ID290
Feature activation+7.09e-03

Pos logit contributions
 like+0.001
 soon+0.001
 than+0.001
 when+0.001
 after+0.000
Neg logit contributions
 spoon-0.001
 lettuce-0.001
 peas-0.001
 carrots-0.001
berries-0.001
Token the
Token ID262
Feature activation+2.36e-03

Pos logit contributions
 like+0.008
 soon+0.008
 than+0.007
 after+0.007
 when+0.007
Neg logit contributions
 spoon-0.016
berries-0.016
 carrots-0.015
 peas-0.015
 lettuce-0.014
Token l
Token ID300
Feature activation-1.47e-02

Pos logit contributions
 like+0.004
 soon+0.004
 when+0.003
 after+0.003
 very+0.003
Neg logit contributions
berries-0.004
pill-0.004
 lettuce-0.003
lers-0.003
 peas-0.003
Tokenitt
Token ID715
Feature activation-1.15e-01

Pos logit contributions
 carrots+0.030
berries+0.030
 peas+0.029
 spoon+0.029
 lettuce+0.029
Neg logit contributions
 like-0.019
ook-0.018
te-0.017
 forward-0.017
 than-0.016
Tokenlest
Token ID32712
Feature activation-3.59e-02

Pos logit contributions
 carrots+0.202
berries+0.199
 spoon+0.198
 bur+0.197
 peas+0.197
Neg logit contributions
 soon-0.168
 like-0.165
 after-0.153
 when-0.153
 than-0.151
Token ones
Token ID3392
Feature activation-9.10e-03

Pos logit contributions
 lettuce+0.064
berries+0.060
 peas+0.060
 carrots+0.060
 sleeves+0.059
Neg logit contributions
 like-0.058
 soon-0.051
 than-0.050
 from-0.049
 when-0.049
Token can
Token ID460
Feature activation+1.37e-02

Pos logit contributions
 carrots+0.021
 lettuce+0.020
berries+0.020
 peas+0.020
 spoon+0.020
Neg logit contributions
 like-0.011
 soon-0.010
 than-0.009
 when-0.009
 after-0.008
Token go
Token ID467
Feature activation+2.02e-02

Pos logit contributions
 like+0.017
 soon+0.017
 surely+0.013
 over+0.013
 from+0.012
Neg logit contributions
 knives-0.032
 peas-0.032
 forks-0.032
 carrots-0.031
 lettuce-0.031
Token up
Token ID510
Feature activation+1.71e-02

Pos logit contributions
 like+0.019
 soon+0.018
 forward+0.017
 than+0.016
 surely+0.016
Neg logit contributions
 spoon-0.048
 carrots-0.047
 peas-0.046
berries-0.045
 lettuce-0.045
Token out
Token ID503
Feature activation+2.78e-02

Pos logit contributions
 like+0.059
 soon+0.058
ually+0.054
 than+0.054
 surely+0.054
Neg logit contributions
 spoon-0.128
 carrots-0.124
 peas-0.120
 knives-0.119
 lettuce-0.119
Token the
Token ID262
Feature activation-5.46e-03

Pos logit contributions
 like+0.029
 soon+0.029
ually+0.027
 surely+0.027
 than+0.027
Neg logit contributions
 spoon-0.060
 carrots-0.059
 knives-0.057
berries-0.057
 peas-0.057
Token mouse
Token ID10211
Feature activation-3.93e-03

Pos logit contributions
 peas+0.010
berries+0.010
 spoon+0.009
 carrots+0.009
 lettuce+0.009
Neg logit contributions
 like-0.008
 forward-0.008
 soon-0.008
 after-0.008
 from-0.008
Token had
Token ID550
Feature activation+5.69e-03

Pos logit contributions
 peas+0.009
 spoon+0.008
berries+0.008
 lettuce+0.008
 carrots+0.008
Neg logit contributions
 like-0.005
 soon-0.005
 when-0.004
 over-0.004
 from-0.004
Token been
Token ID587
Feature activation-3.11e-02

Pos logit contributions
 like+0.006
 soon+0.006
 known+0.005
 over+0.005
 from+0.005
Neg logit contributions
berries-0.014
 spoon-0.014
 peas-0.013
 carrots-0.013
 lettuce-0.013
Token living
Token ID2877
Feature activation-1.14e-01

Pos logit contributions
 spoon+0.060
berries+0.059
 carrots+0.058
 peas+0.058
 lettuce+0.057
Neg logit contributions
 like-0.044
 soon-0.043
 than-0.039
 forward-0.039
 over-0.037
Token in
Token ID287
Feature activation-4.41e-02

Pos logit contributions
 spoon+0.277
 carrots+0.273
 peas+0.269
berries+0.268
 lettuce+0.268
Neg logit contributions
 like-0.099
 soon-0.096
 forward-0.087
 than-0.086
 after-0.081
Token the
Token ID262
Feature activation-2.19e-02

Pos logit contributions
 spoon+0.104
 carrots+0.102
 peas+0.100
berries+0.099
 lettuce+0.099
Neg logit contributions
 like-0.041
 soon-0.040
 forward-0.036
 than-0.036
 surely-0.035
Token book
Token ID1492
Feature activation-1.63e-02

Pos logit contributions
berries+0.037
 peas+0.036
 spoon+0.036
 carrots+0.034
 knives+0.033
Neg logit contributions
 like-0.036
 forward-0.035
 soon-0.034
 after-0.033
 surely-0.032
Tokencase
Token ID7442
Feature activation-9.85e-03

Pos logit contributions
 spoon+0.034
 carrots+0.033
 peas+0.033
berries+0.032
 lettuce+0.032
Neg logit contributions
 like-0.021
 soon-0.020
 forward-0.018
 than-0.018
 surely-0.017
Token and
Token ID290
Feature activation+3.42e-03

Pos logit contributions
 spoon+0.020
 carrots+0.019
 peas+0.019
berries+0.018
 lettuce+0.018
Neg logit contributions
 like-0.013
 soon-0.012
 forward-0.012
 than-0.011
 surely-0.011
Token sometimes
Token ID3360
Feature activation-1.17e-02

Pos logit contributions
 spoon+0.020
berries+0.019
 carrots+0.019
 peas+0.019
 lettuce+0.019
Neg logit contributions
 soon-0.009
 like-0.008
 after-0.007
 when-0.007
 than-0.007
Token,
Token ID11
Feature activation+1.17e-02

Pos logit contributions
 spoon+0.027
berries+0.026
 peas+0.025
 carrots+0.025
 lettuce+0.025
Neg logit contributions
 soon-0.013
 like-0.013
 than-0.011
 after-0.011
 when-0.011
Token even
Token ID772
Feature activation+2.66e-02

Pos logit contributions
 like+0.013
 soon+0.013
 than+0.011
 after+0.011
 when+0.011
Neg logit contributions
 spoon-0.027
 carrots-0.025
berries-0.025
 peas-0.025
 lettuce-0.025
Token the
Token ID262
Feature activation-1.24e-02

Pos logit contributions
 like+0.018
 soon+0.017
 forward+0.016
 than+0.015
 surely+0.015
Neg logit contributions
 spoon-0.069
 carrots-0.068
 peas-0.066
 lettuce-0.065
berries-0.065
Token l
Token ID300
Feature activation-1.72e-02

Pos logit contributions
berries+0.024
pill+0.023
 carrots+0.022
 knives+0.022
 lettuce+0.021
Neg logit contributions
 like-0.018
 soon-0.017
 true-0.016
 surely-0.015
 when-0.015
Tokenitt
Token ID715
Feature activation-1.14e-01

Pos logit contributions
 carrots+0.030
 spoon+0.030
 peas+0.029
berries+0.028
 lettuce+0.028
Neg logit contributions
 like-0.028
 than-0.026
ook-0.026
 forward-0.026
te-0.025
Tokenlest
Token ID32712
Feature activation-3.57e-02

Pos logit contributions
 spoon+0.194
 carrots+0.194
berries+0.191
 peas+0.190
 couch+0.189
Neg logit contributions
 like-0.172
 soon-0.169
 than-0.163
 from-0.155
 when-0.153
Token things
Token ID1243
Feature activation+5.27e-04

Pos logit contributions
 carrots+0.054
 lettuce+0.054
 spoon+0.053
 peas+0.052
 couch+0.052
Neg logit contributions
 like-0.065
 than-0.062
 soon-0.060
 from-0.059
 after-0.056
Token can
Token ID460
Feature activation+1.66e-02

Pos logit contributions
 like+0.001
 soon+0.001
 than+0.001
 from+0.001
 forward+0.001
Neg logit contributions
 spoon-0.001
berries-0.001
 peas-0.001
 carrots-0.001
 lettuce-0.001
Token lead
Token ID1085
Feature activation+2.11e-02

Pos logit contributions
 like+0.020
 soon+0.019
 surely+0.017
 from+0.016
 true+0.015
Neg logit contributions
 spoon-0.038
berries-0.038
 peas-0.038
 carrots-0.037
 forks-0.037
Token to
Token ID284
Feature activation-1.20e-02

Pos logit contributions
 like+0.021
 soon+0.020
 than+0.018
 forward+0.018
 after+0.017
Neg logit contributions
 spoon-0.050
 carrots-0.048
 peas-0.048
berries-0.048
 lettuce-0.047
Token loud
Token ID7812
Feature activation+6.71e-03

Pos logit contributions
berries+0.008
 knives+0.007
 carrots+0.007
 peas+0.007
 forks+0.007
Neg logit contributions
 like-0.005
 soon-0.005
 surely-0.004
 forward-0.004
 true-0.004
Token oct
Token ID19318
Feature activation-5.78e-02

Pos logit contributions
 like+0.012
 soon+0.011
 than+0.010
 after+0.010
 from+0.010
Neg logit contributions
berries-0.011
 spoon-0.010
 knives-0.010
 carrots-0.010
 lettuce-0.010
Tokenopus
Token ID25790
Feature activation-4.18e-02

Pos logit contributions
 carrots+0.057
 knives+0.054
 berries+0.054
 peas+0.053
 spoon+0.052
Neg logit contributions
 like-0.131
te-0.128
 than-0.127
 soon-0.124
away-0.123
Token named
Token ID3706
Feature activation-6.17e-03

Pos logit contributions
 spoon+0.096
 peas+0.096
berries+0.095
 carrots+0.095
 lettuce+0.095
Neg logit contributions
 like-0.047
 soon-0.044
 when-0.041
 after-0.039
 than-0.038
Token O
Token ID440
Feature activation-8.39e-02

Pos logit contributions
 spoon+0.012
berries+0.011
 knives+0.011
 sleeves+0.011
 carrots+0.011
Neg logit contributions
 like-0.009
 soon-0.008
 when-0.007
 Pal-0.007
 over-0.007
Tokenll
Token ID297
Feature activation-1.14e-01

Pos logit contributions
 spoon+0.128
 carrots+0.128
 peas+0.123
 lettuce+0.121
 knives+0.120
Neg logit contributions
 soon-0.143
 like-0.141
ook-0.131
 surely-0.130
 than-0.130
Tokenie
Token ID494
Feature activation-2.89e-02

Pos logit contributions
 spoon+0.208
 carrots+0.203
 peas+0.198
berries+0.195
 lettuce+0.194
Neg logit contributions
 like-0.163
 soon-0.159
 forward-0.151
 than-0.150
 surely-0.148
Token who
Token ID508
Feature activation-8.36e-02

Pos logit contributions
 spoon+0.064
 carrots+0.062
 peas+0.061
berries+0.061
 lettuce+0.060
Neg logit contributions
 like-0.035
 soon-0.034
 when-0.030
 than-0.029
 after-0.029
Token liked
Token ID8288
Feature activation-2.46e-02

Pos logit contributions
 spoon+0.163
 carrots+0.158
 peas+0.154
 knives+0.151
 lettuce+0.151
Neg logit contributions
 like-0.108
 forward-0.105
 soon-0.104
 than-0.101
 surely-0.099
Token to
Token ID284
Feature activation-7.60e-03

Pos logit contributions
 spoon+0.070
berries+0.067
 sleeves+0.066
 fork+0.065
 bowl+0.064
Neg logit contributions
 like-0.015
 soon-0.014
 when-0.011
 after-0.011
 from-0.011
Token sing
Token ID1702
Feature activation+1.05e-02

Pos logit contributions
berries+0.015
 knives+0.015
 spoon+0.015
 forks+0.014
 lettuce+0.014
Neg logit contributions
 like-0.012
 soon-0.011
 after-0.009
 when-0.009
 surely-0.009
Token not
Token ID407
Feature activation+1.04e-03

Pos logit contributions
 spoon+0.058
berries+0.057
 knives+0.055
 peas+0.054
 carrots+0.054
Neg logit contributions
 like-0.030
 soon-0.028
 than-0.025
 when-0.023
 over-0.023
Token toys
Token ID14958
Feature activation-3.43e-02

Pos logit contributions
 like+0.001
 than+0.001
 soon+0.001
 from+0.001
 when+0.001
Neg logit contributions
 spoon-0.003
berries-0.003
 peas-0.002
 carrots-0.002
 knives-0.002
Token.
Token ID13
Feature activation+8.35e-03

Pos logit contributions
 spoon+0.078
 carrots+0.077
 lettuce+0.076
 peas+0.076
berries+0.075
Neg logit contributions
 like-0.042
 soon-0.037
 than-0.036
 after-0.034
 from-0.033
Token They
Token ID1119
Feature activation+2.77e-03

Pos logit contributions
 like+0.008
 soon+0.008
te+0.007
 than+0.007
 Jenn+0.006
Neg logit contributions
 spoon-0.020
 carrots-0.020
 peas-0.019
 knives-0.019
 lettuce-0.019
Token are
Token ID389
Feature activation-1.02e-02

Pos logit contributions
 like+0.004
 soon+0.003
 after+0.002
 surely+0.002
 than+0.002
Neg logit contributions
berries-0.006
 spoon-0.006
 carrots-0.006
 lettuce-0.006
 peas-0.006
Token living
Token ID2877
Feature activation-1.14e-01

Pos logit contributions
berries+0.023
 spoon+0.023
 carrots+0.022
 peas+0.022
 knives+0.021
Neg logit contributions
 like-0.012
 soon-0.011
 than-0.010
 forward-0.010
 over-0.010
Token beings
Token ID9791
Feature activation-1.57e-02

Pos logit contributions
 spoon+0.226
berries+0.223
 carrots+0.220
 lettuce+0.219
 peas+0.217
Neg logit contributions
 like-0.163
 soon-0.153
 than-0.149
 after-0.145
 forward-0.142
Token that
Token ID326
Feature activation-1.16e-02

Pos logit contributions
 spoon+0.037
berries+0.037
 lettuce+0.036
 carrots+0.036
 peas+0.036
Neg logit contributions
 like-0.018
 soon-0.016
 than-0.016
 when-0.015
 from-0.015
Token have
Token ID423
Feature activation+9.39e-04

Pos logit contributions
 spoon+0.026
berries+0.026
 peas+0.025
 carrots+0.025
 lettuce+0.025
Neg logit contributions
 like-0.014
 soon-0.013
 from-0.012
 than-0.012
 their-0.012
Token feelings
Token ID7666
Feature activation-1.56e-02

Pos logit contributions
 like+0.001
 than+0.001
 soon+0.001
 their+0.001
 when+0.001
Neg logit contributions
berries-0.002
 spoon-0.002
 peas-0.002
opus-0.002
 carrot-0.002
Token.
Token ID13
Feature activation+6.42e-03

Pos logit contributions
 spoon+0.038
 peas+0.037
 carrots+0.037
 lettuce+0.036
berries+0.036
Neg logit contributions
 like-0.017
 than-0.015
 soon-0.014
 when-0.013
 from-0.013
Token a
Token ID257
Feature activation+2.60e-03

Pos logit contributions
 like+0.001
 soon+0.001
 surely+0.001
 than+0.001
ually+0.001
Neg logit contributions
 carrots-0.002
 spoon-0.002
 lettuce-0.002
 peas-0.002
berries-0.002
Token witch
Token ID16365
Feature activation+1.11e-02

Pos logit contributions
 like+0.004
 soon+0.004
 forward+0.004
 than+0.004
 surely+0.004
Neg logit contributions
 spoon-0.004
 carrots-0.004
 peas-0.004
berries-0.004
 lettuce-0.004
Token who
Token ID508
Feature activation-7.53e-02

Pos logit contributions
 like+0.013
 soon+0.013
 than+0.012
 forward+0.011
te+0.011
Neg logit contributions
 spoon-0.024
 carrots-0.023
 peas-0.023
 lettuce-0.022
berries-0.022
Token had
Token ID550
Feature activation+7.50e-03

Pos logit contributions
 spoon+0.164
 carrots+0.163
 peas+0.158
 lettuce+0.155
berries+0.154
Neg logit contributions
 forward-0.074
 like-0.074
 soon-0.072
 surely-0.070
 than-0.068
Token been
Token ID587
Feature activation-2.89e-02

Pos logit contributions
 like+0.006
 soon+0.006
 than+0.005
 known+0.005
 when+0.004
Neg logit contributions
 spoon-0.020
berries-0.019
 peas-0.019
 carrots-0.018
 knives-0.018
Token living
Token ID2877
Feature activation-1.13e-01

Pos logit contributions
 spoon+0.061
 carrots+0.060
 peas+0.058
 lettuce+0.057
 knives+0.057
Neg logit contributions
 like-0.032
 soon-0.031
 surely-0.030
 forward-0.030
 than-0.029
Token there
Token ID612
Feature activation-2.84e-02

Pos logit contributions
 spoon+0.253
 carrots+0.251
 peas+0.242
 knives+0.238
 lettuce+0.233
Neg logit contributions
ually-0.108
 surely-0.108
te-0.105
 like-0.103
 soon-0.102
Token for
Token ID329
Feature activation+2.78e-03

Pos logit contributions
 spoon+0.062
 carrots+0.060
 peas+0.059
berries+0.058
 lettuce+0.058
Neg logit contributions
 like-0.032
 soon-0.031
 forward-0.029
 than-0.029
 surely-0.027
Token many
Token ID867
Feature activation-3.81e-02

Pos logit contributions
 soon+0.003
 like+0.002
 than+0.002
 after+0.002
te+0.002
Neg logit contributions
 spoon-0.007
berries-0.007
 peas-0.007
 carrots-0.006
 lettuce-0.006
Token years
Token ID812
Feature activation+1.99e-02

Pos logit contributions
 spoon+0.077
berries+0.070
 peas+0.068
 lettuce+0.067
 fork+0.067
Neg logit contributions
 soon-0.057
 like-0.057
 than-0.054
 after-0.051
 when-0.050
Token.
Token ID13
Feature activation-5.27e-03

Pos logit contributions
 like+0.024
 soon+0.023
 than+0.022
 after+0.020
 when+0.020
Neg logit contributions
 spoon-0.044
 carrots-0.041
 peas-0.041
berries-0.041
 lettuce-0.041
Token three
Token ID1115
Feature activation-1.48e-02

Pos logit contributions
 like+0.001
 soon+0.001
 after+0.001
 than+0.001
 forward+0.001
Neg logit contributions
 spoon-0.001
berries-0.001
 peas-0.001
 lettuce-0.001
opus-0.001
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
 lettuce+0.030
 spoon+0.029
berries+0.029
 carrot+0.029
 carrots+0.029
Neg logit contributions
 than-0.019
 soon-0.019
 like-0.018
 after-0.017
te-0.016
Token who
Token ID508
Feature activation-7.43e-02

Pos logit contributions
 spoon+0.025
berries+0.025
 carrots+0.024
 peas+0.024
 lettuce+0.024
Neg logit contributions
 like-0.011
 soon-0.011
 than-0.009
 after-0.009
 forward-0.009
Token had
Token ID550
Feature activation+6.87e-03

Pos logit contributions
 spoon+0.160
 carrots+0.159
 peas+0.155
berries+0.155
 lettuce+0.154
Neg logit contributions
 like-0.086
 soon-0.085
 surely-0.075
 forward-0.074
 than-0.074
Token been
Token ID587
Feature activation-2.55e-02

Pos logit contributions
 like+0.007
 soon+0.007
 than+0.006
 known+0.006
 surely+0.006
Neg logit contributions
 spoon-0.016
berries-0.016
 peas-0.015
 carrots-0.015
 lettuce-0.015
Token living
Token ID2877
Feature activation-1.13e-01

Pos logit contributions
 spoon+0.054
 carrots+0.053
 peas+0.052
berries+0.052
 lettuce+0.052
Neg logit contributions
 like-0.030
 soon-0.029
 forward-0.027
 than-0.026
 surely-0.026
Token in
Token ID287
Feature activation-4.74e-02

Pos logit contributions
 spoon+0.270
 carrots+0.263
 peas+0.258
 knives+0.254
berries+0.253
Neg logit contributions
 like-0.100
 soon-0.096
 forward-0.088
 surely-0.087
 than-0.086
Token the
Token ID262
Feature activation-2.14e-02

Pos logit contributions
 spoon+0.112
 carrots+0.110
 peas+0.107
 knives+0.105
 lettuce+0.105
Neg logit contributions
 like-0.041
 soon-0.040
 forward-0.037
ually-0.036
 surely-0.036
Token jail
Token ID7356
Feature activation+2.67e-03

Pos logit contributions
berries+0.043
 spoon+0.042
 peas+0.041
 carrots+0.041
 lettuce+0.039
Neg logit contributions
 like-0.030
 forward-0.029
 soon-0.028
 after-0.027
 surely-0.026
Token for
Token ID329
Feature activation+2.97e-03

Pos logit contributions
 soon+0.003
 like+0.003
 than+0.003
 forward+0.003
 after+0.003
Neg logit contributions
 spoon-0.006
 carrots-0.006
 peas-0.005
 lettuce-0.005
berries-0.005
Token weeks
Token ID2745
Feature activation+3.67e-02

Pos logit contributions
 like+0.003
 soon+0.003
 forward+0.003
 than+0.003
 surely+0.002
Neg logit contributions
 spoon-0.007
 carrots-0.007
 peas-0.007
berries-0.007
 lettuce-0.007
Token sund
Token ID37437
Feature activation-1.58e-02

Pos logit contributions
 spoon+0.064
 carrots+0.063
 peas+0.061
 lettuce+0.060
berries+0.060
Neg logit contributions
 like-0.069
 soon-0.068
 than-0.064
 forward-0.064
 surely-0.063
Tokenae
Token ID3609
Feature activation+1.34e-02

Pos logit contributions
 spoon+0.029
 carrots+0.027
 peas+0.026
berries+0.026
 knives+0.025
Neg logit contributions
 like-0.023
 soon-0.022
 forward-0.021
 surely-0.021
 Pal-0.020
Token with
Token ID351
Feature activation-3.27e-02

Pos logit contributions
 like+0.018
 soon+0.017
 than+0.016
 after+0.015
 when+0.015
Neg logit contributions
berries-0.027
 knives-0.027
 carrots-0.026
 lettuce-0.026
 peas-0.026
Token lots
Token ID6041
Feature activation-4.08e-02

Pos logit contributions
opus+0.063
berries+0.062
 sleeves+0.062
uddle+0.060
 bur+0.060
Neg logit contributions
 like-0.042
 their-0.037
 soon-0.036
 when-0.034
 after-0.033
Token of
Token ID286
Feature activation-1.63e-02

Pos logit contributions
 bur+0.108
 spoon+0.108
 elbow+0.107
 peas+0.107
 knives+0.106
Neg logit contributions
 like-0.026
 than-0.022
 after-0.020
 from-0.018
 as-0.018
Token topp
Token ID23126
Feature activation-1.13e-01

Pos logit contributions
 bur+0.028
berries+0.026
pill+0.026
opus+0.026
 sleeves+0.026
Neg logit contributions
 like-0.029
 after-0.025
 soon-0.024
 than-0.023
 when-0.023
Tokenings
Token ID654
Feature activation+4.59e-02

Pos logit contributions
 carrots+0.256
 spoon+0.254
 peas+0.251
 lettuce+0.250
 knives+0.247
Neg logit contributions
 like-0.103
 than-0.099
 soon-0.096
te-0.087
ook-0.084
Token.
Token ID13
Feature activation-2.64e-03

Pos logit contributions
 like+0.059
 soon+0.054
 forward+0.051
 after+0.051
 than+0.051
Neg logit contributions
 spoon-0.097
 carrots-0.093
 peas-0.093
 lettuce-0.091
berries-0.089
Token Tim
Token ID5045
Feature activation-3.01e-02

Pos logit contributions
 spoon+0.006
 carrots+0.005
 peas+0.005
 lettuce+0.005
 knives+0.005
Neg logit contributions
 like-0.003
 soon-0.003
 than-0.003
 forward-0.003
 surely-0.003
Tokenmy
Token ID1820
Feature activation-1.41e-02

Pos logit contributions
 spoon+0.063
 carrots+0.063
 peas+0.062
 lettuce+0.061
berries+0.060
Neg logit contributions
 like-0.035
 soon-0.035
 than-0.030
 over-0.029
 forward-0.029
Token was
Token ID373
Feature activation+5.49e-03

Pos logit contributions
berries+0.031
 carrots+0.031
 spoon+0.030
 peas+0.030
 knives+0.030
Neg logit contributions
 soon-0.015
 like-0.015
 when-0.012
 after-0.012
 than-0.012
Token a
Token ID257
Feature activation-1.14e-02

Pos logit contributions
ually+0.006
bek+0.005
 surely+0.005
bye+0.005
 forward+0.005
Neg logit contributions
 carrots-0.007
 spoon-0.007
 peas-0.007
 soup-0.007
 carrot-0.007
Token squirrel
Token ID33039
Feature activation-3.88e-02

Pos logit contributions
berries+0.018
 carrots+0.018
 spoon+0.018
 peas+0.018
 knives+0.017
Neg logit contributions
 like-0.019
 soon-0.019
 than-0.017
 forward-0.017
 surely-0.017
Token.
Token ID13
Feature activation-8.12e-03

Pos logit contributions
 peas+0.088
 spoon+0.088
 lettuce+0.087
 carrots+0.086
 forks+0.085
Neg logit contributions
 like-0.045
 soon-0.043
 from-0.039
 when-0.038
 after-0.037
Token It
Token ID632
Feature activation-5.47e-03

Pos logit contributions
 spoon+0.015
 carrots+0.015
berries+0.014
 lettuce+0.014
 peas+0.014
Neg logit contributions
 soon-0.011
ually-0.011
 like-0.011
 surely-0.011
 than-0.011
Token was
Token ID373
Feature activation+5.76e-04

Pos logit contributions
 spoon+0.011
 carrots+0.011
 peas+0.011
berries+0.010
 lettuce+0.010
Neg logit contributions
 like-0.007
 soon-0.007
 forward-0.006
 than-0.006
 surely-0.006
Token living
Token ID2877
Feature activation-1.13e-01

Pos logit contributions
 like+0.001
 soon+0.001
 forward+0.001
 than+0.001
 surely+0.001
Neg logit contributions
 spoon-0.001
 carrots-0.001
 peas-0.001
 lettuce-0.001
berries-0.001
Token in
Token ID287
Feature activation-4.66e-02

Pos logit contributions
 spoon+0.269
 carrots+0.263
 peas+0.258
 lettuce+0.256
berries+0.256
Neg logit contributions
 like-0.104
 soon-0.100
 than-0.091
 forward-0.091
 after-0.085
Token the
Token ID262
Feature activation-1.91e-02

Pos logit contributions
 spoon+0.111
 carrots+0.109
 peas+0.107
berries+0.106
 lettuce+0.105
Neg logit contributions
 like-0.042
 soon-0.040
 forward-0.037
 surely-0.036
 than-0.036
Token tree
Token ID5509
Feature activation-5.79e-02

Pos logit contributions
berries+0.030
 peas+0.029
 spoon+0.028
pill+0.027
 forks+0.027
Neg logit contributions
 like-0.034
 soon-0.031
 after-0.029
 across-0.029
 when-0.029
Token.
Token ID13
Feature activation-5.82e-03

Pos logit contributions
 spoon+0.129
 peas+0.128
 carrots+0.126
 lettuce+0.126
 knives+0.125
Neg logit contributions
 like-0.070
 soon-0.067
 than-0.061
 when-0.059
 after-0.057
Token It
Token ID632
Feature activation-3.38e-03

Pos logit contributions
 spoon+0.011
 carrots+0.011
 lettuce+0.011
 peas+0.011
berries+0.011
Neg logit contributions
 soon-0.008
 like-0.007
ually-0.007
 than-0.007
 surely-0.007
Tokenunk
Token ID2954
Feature activation-3.36e-02

Pos logit contributions
 spoon+0.005
 carrots+0.005
 lettuce+0.005
 peas+0.005
 carrot+0.005
Neg logit contributions
 like-0.006
 soon-0.005
te-0.005
 than-0.005
ained-0.005
Token!
Token ID0
Feature activation+4.89e-03

Pos logit contributions
 peas+0.077
berries+0.075
 lettuce+0.075
 spoon+0.075
 carrots+0.075
Neg logit contributions
 like-0.040
 soon-0.035
 when-0.035
 after-0.031
 than-0.030
Token He
Token ID679
Feature activation-1.24e-03

Pos logit contributions
 like+0.006
 soon+0.006
 forward+0.005
 surely+0.005
 than+0.005
Neg logit contributions
 spoon-0.010
 carrots-0.010
 peas-0.010
berries-0.010
 lettuce-0.010
Token's
Token ID338
Feature activation+1.07e-02

Pos logit contributions
berries+0.003
 spoon+0.002
 carrots+0.002
 peas+0.002
 lettuce+0.002
Neg logit contributions
 like-0.002
 soon-0.002
 when-0.001
 than-0.001
 surely-0.001
Token been
Token ID587
Feature activation-2.64e-02

Pos logit contributions
 like+0.012
 soon+0.011
 than+0.010
 when+0.009
 forward+0.009
Neg logit contributions
 spoon-0.024
berries-0.024
 carrots-0.023
 peas-0.023
 lettuce-0.023
Token living
Token ID2877
Feature activation-1.12e-01

Pos logit contributions
 spoon+0.051
berries+0.050
 carrots+0.049
 peas+0.048
 lettuce+0.048
Neg logit contributions
 like-0.037
 soon-0.036
 forward-0.033
 than-0.033
 after-0.031
Token around
Token ID1088
Feature activation+1.66e-02

Pos logit contributions
 spoon+0.245
 carrots+0.240
berries+0.238
 peas+0.236
 lettuce+0.235
Neg logit contributions
 like-0.126
 soon-0.121
 than-0.113
 forward-0.113
 after-0.107
Token here
Token ID994
Feature activation-2.61e-03

Pos logit contributions
 like+0.016
 soon+0.014
 than+0.013
 forward+0.013
 after+0.013
Neg logit contributions
 spoon-0.042
berries-0.040
 peas-0.040
 carrots-0.040
 lettuce-0.039
Token for
Token ID329
Feature activation+9.34e-03

Pos logit contributions
 spoon+0.007
berries+0.006
 peas+0.006
 carrots+0.006
 lettuce+0.006
Neg logit contributions
 like-0.002
 soon-0.002
 than-0.002
 forward-0.002
 after-0.002
Token a
Token ID257
Feature activation+1.61e-02

Pos logit contributions
 like+0.008
 soon+0.008
 than+0.007
 forward+0.007
 after+0.007
Neg logit contributions
 spoon-0.023
berries-0.022
 carrots-0.022
 peas-0.022
 lettuce-0.021
Token while
Token ID981
Feature activation-2.12e-02

Pos logit contributions
 soon+0.018
 like+0.017
 after+0.016
 surely+0.015
 when+0.014
Neg logit contributions
berries-0.038
 knives-0.036
 carrots-0.036
 peas-0.036
 lettuce-0.035
Token little
Token ID1310
Feature activation-3.01e-02

Pos logit contributions
 spoon+0.015
 carrots+0.014
berries+0.014
 peas+0.014
 lettuce+0.014
Neg logit contributions
 like-0.020
 soon-0.020
 forward-0.019
 than-0.019
 surely-0.018
Token pig
Token ID12967
Feature activation-2.24e-02

Pos logit contributions
 carrots+0.051
 knives+0.050
 spoon+0.050
 lettuce+0.050
 peas+0.049
Neg logit contributions
 like-0.045
 soon-0.042
 than-0.040
 after-0.039
 forward-0.039
Tokengy
Token ID1360
Feature activation-6.25e-03

Pos logit contributions
 spoon+0.053
 carrots+0.052
 peas+0.051
berries+0.050
 lettuce+0.050
Neg logit contributions
 forward-0.019
 surely-0.019
 like-0.018
 soon-0.018
 than-0.016
Token named
Token ID3706
Feature activation-6.68e-03

Pos logit contributions
 spoon+0.016
berries+0.015
 carrots+0.015
 knives+0.015
 lettuce+0.015
Neg logit contributions
 like-0.005
 soon-0.005
 when-0.005
 than-0.004
 after-0.004
Token O
Token ID440
Feature activation-7.96e-02

Pos logit contributions
 spoon+0.013
 carrots+0.013
 peas+0.012
berries+0.012
 knives+0.012
Neg logit contributions
 like-0.009
 soon-0.009
 forward-0.008
 after-0.008
 than-0.008
Tokenll
Token ID297
Feature activation-1.11e-01

Pos logit contributions
 carrots+0.134
 spoon+0.133
 peas+0.130
 lettuce+0.127
 berries+0.125
Neg logit contributions
 soon-0.124
 like-0.119
ook-0.115
ually-0.114
vy-0.113
Tokenie
Token ID494
Feature activation-3.32e-02

Pos logit contributions
 spoon+0.188
 carrots+0.184
 peas+0.180
berries+0.180
 lettuce+0.175
Neg logit contributions
 like-0.172
 soon-0.168
 forward-0.165
 surely-0.164
 than-0.160
Token.
Token ID13
Feature activation-6.87e-03

Pos logit contributions
 spoon+0.067
 carrots+0.066
 peas+0.065
berries+0.064
 lettuce+0.064
Neg logit contributions
 like-0.044
 soon-0.042
 than-0.039
 when-0.037
 forward-0.037
Token O
Token ID440
Feature activation-5.74e-02

Pos logit contributions
 spoon+0.012
 carrots+0.012
 peas+0.011
berries+0.011
 lettuce+0.011
Neg logit contributions
 like-0.010
 soon-0.010
 forward-0.010
 than-0.010
 surely-0.009
Tokenll
Token ID297
Feature activation-8.65e-02

Pos logit contributions
 spoon+0.087
 carrots+0.085
 peas+0.083
 lettuce+0.081
 knives+0.080
Neg logit contributions
 soon-0.099
 like-0.098
ually-0.092
 than-0.092
 surely-0.091
Tokenie
Token ID494
Feature activation-3.83e-03

Pos logit contributions
 spoon+0.150
 carrots+0.147
 peas+0.143
berries+0.142
 lettuce+0.139
Neg logit contributions
 like-0.131
 soon-0.128
 forward-0.125
 surely-0.123
 than-0.121
Token't
Token ID470
Feature activation+3.47e-03

Pos logit contributions
 soon+0.022
 like+0.021
 surely+0.021
te+0.020
 forward+0.020
Neg logit contributions
 carrots-0.033
 spoon-0.033
 peas-0.033
 lettuce-0.032
 knives-0.032
Token believe
Token ID1975
Feature activation+8.12e-03

Pos logit contributions
 like+0.004
 soon+0.004
 surely+0.003
te+0.003
 after+0.003
Neg logit contributions
 carrots-0.008
berries-0.008
 peas-0.008
 lettuce-0.008
 knives-0.008
Token that
Token ID326
Feature activation-1.47e-03

Pos logit contributions
 like+0.009
 soon+0.009
 surely+0.008
ually+0.008
 forward+0.008
Neg logit contributions
 spoon-0.017
 carrots-0.017
 peas-0.017
 knives-0.016
berries-0.016
Token she
Token ID673
Feature activation-2.09e-04

Pos logit contributions
 spoon+0.003
 carrots+0.003
 peas+0.003
berries+0.003
 lettuce+0.003
Neg logit contributions
 like-0.002
 soon-0.002
 than-0.002
 forward-0.002
 after-0.002
Token had
Token ID550
Feature activation+1.49e-02

Pos logit contributions
 carrots+0.000
 spoon+0.000
 peas+0.000
berries+0.000
 lettuce+0.000
Neg logit contributions
 like-0.000
 soon-0.000
 than-0.000
te-0.000
 surely-0.000
Token wrapped
Token ID12908
Feature activation+7.12e-02

Pos logit contributions
 like+0.015
 soon+0.014
 than+0.012
 surely+0.012
 forward+0.012
Neg logit contributions
 spoon-0.035
 carrots-0.034
berries-0.034
 peas-0.034
 lettuce-0.033
Token the
Token ID262
Feature activation+2.18e-02

Pos logit contributions
 like+0.067
 soon+0.065
 forward+0.060
 than+0.059
 surely+0.058
Neg logit contributions
 spoon-0.166
 carrots-0.162
 peas-0.159
berries-0.157
 lettuce-0.156
Token blocks
Token ID7021
Feature activation-1.21e-02

Pos logit contributions
 like+0.041
 over+0.037
 forward+0.036
 very+0.034
 across+0.032
Neg logit contributions
pill-0.033
unn-0.032
berries-0.031
opus-0.031
 bur-0.030
Token so
Token ID523
Feature activation+3.15e-02

Pos logit contributions
 spoon+0.029
 carrots+0.028
 peas+0.028
berries+0.027
 lettuce+0.027
Neg logit contributions
 like-0.010
 soon-0.010
ually-0.009
 than-0.009
te-0.009
Token perfectly
Token ID7138
Feature activation+4.04e-02

Pos logit contributions
 soon+0.021
 like+0.020
 forward+0.017
 than+0.016
 surely+0.016
Neg logit contributions
 spoon-0.085
berries-0.084
 carrots-0.083
 peas-0.082
 knives-0.081
Token!
Token ID0
Feature activation+1.48e-02

Pos logit contributions
 like+0.050
 soon+0.049
 than+0.045
 forward+0.044
 surely+0.043
Neg logit contributions
 spoon-0.084
 carrots-0.082
 peas-0.081
berries-0.080
 lettuce-0.080
Token have
Token ID423
Feature activation-5.31e-03

Pos logit contributions
 carrots+0.092
berries+0.091
 peas+0.090
 spoon+0.089
 lettuce+0.088
Neg logit contributions
 like-0.051
 soon-0.046
 surely-0.041
 after-0.038
 over-0.036
Token something
Token ID1223
Feature activation-4.07e-04

Pos logit contributions
berries+0.014
 spoon+0.014
 peas+0.013
 carrots+0.013
 lettuce+0.013
Neg logit contributions
 soon-0.004
 like-0.004
 after-0.003
 when-0.003
 than-0.003
Token much
Token ID881
Feature activation+1.02e-02

Pos logit contributions
berries+0.001
 spoon+0.001
 carrots+0.001
 peas+0.001
 lettuce+0.001
Neg logit contributions
 like-0.000
 soon-0.000
 than-0.000
 from-0.000
 after-0.000
Token more
Token ID517
Feature activation+5.47e-03

Pos logit contributions
 like+0.013
 soon+0.012
 than+0.011
 after+0.010
 when+0.010
Neg logit contributions
berries-0.023
 spoon-0.023
 peas-0.022
 carrots-0.022
 knives-0.021
Token valuable
Token ID8119
Feature activation+5.38e-02

Pos logit contributions
 like+0.007
 soon+0.006
 than+0.006
 forward+0.006
 after+0.005
Neg logit contributions
 spoon-0.012
berries-0.011
 carrots-0.011
 peas-0.011
 lettuce-0.011
Token than
Token ID621
Feature activation+8.47e-02

Pos logit contributions
 like+0.061
 soon+0.059
 than+0.055
 forward+0.054
te+0.052
Neg logit contributions
 spoon-0.117
 carrots-0.113
 peas-0.111
berries-0.110
 lettuce-0.110
Token that
Token ID326
Feature activation-3.84e-03

Pos logit contributions
 like+0.109
 soon+0.102
 than+0.091
 after+0.090
 when+0.090
Neg logit contributions
 spoon-0.184
berries-0.175
 peas-0.172
 carrots-0.170
 carrot-0.169
Token."
Token ID526
Feature activation-7.81e-05

Pos logit contributions
 spoon+0.007
berries+0.007
 peas+0.007
 carrots+0.007
 lettuce+0.007
Neg logit contributions
 like-0.005
 soon-0.005
 than-0.005
 after-0.005
 when-0.005
Token
Token ID198
Feature activation+5.26e-03

Pos logit contributions
 spoon+0.000
 carrots+0.000
 peas+0.000
 lettuce+0.000
 knives+0.000
Neg logit contributions
 soon-0.000
 like-0.000
 forward-0.000
 than-0.000
te-0.000
Token
Token ID198
Feature activation+2.79e-03

Pos logit contributions
 soon+0.005
 like+0.005
te+0.005
 forward+0.004
 over+0.004
Neg logit contributions
 spoon-0.012
 peas-0.012
 carrots-0.012
 knives-0.012
 lettuce-0.012
TokenSarah
Token ID29284
Feature activation+3.00e-02

Pos logit contributions
 like+0.003
 soon+0.002
 forward+0.002
 than+0.002
 surely+0.002
Neg logit contributions
 spoon-0.007
 carrots-0.006
 peas-0.006
berries-0.006
 lettuce-0.006
Token worry
Token ID5490
Feature activation-9.39e-03

Pos logit contributions
 carrots+0.045
 peas+0.045
 spoon+0.045
 lettuce+0.045
berries+0.044
Neg logit contributions
 like-0.026
 soon-0.024
 when-0.021
 after-0.021
 from-0.020
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
 spoon+0.023
 carrots+0.023
 peas+0.023
berries+0.023
 lettuce+0.022
Neg logit contributions
 like-0.008
 soon-0.007
 than-0.007
 forward-0.006
 after-0.006
Token We
Token ID775
Feature activation+3.96e-03

Pos logit contributions
 like+0.017
 soon+0.016
 surely+0.015
ually+0.015
te+0.015
Neg logit contributions
 carrots-0.025
 spoon-0.025
 peas-0.025
 lettuce-0.024
 knives-0.024
Token believed
Token ID4762
Feature activation+1.17e-02

Pos logit contributions
 like+0.005
 soon+0.005
 surely+0.004
 forward+0.004
 over+0.004
Neg logit contributions
 peas-0.009
 carrots-0.009
 spoon-0.009
 lettuce-0.009
berries-0.009
Token the
Token ID262
Feature activation+1.24e-02

Pos logit contributions
 like+0.019
 soon+0.019
 forward+0.018
 than+0.018
 after+0.017
Neg logit contributions
 spoon-0.020
 carrots-0.019
 peas-0.019
berries-0.019
 lettuce-0.019
Token old
Token ID1468
Feature activation+7.51e-02

Pos logit contributions
 like+0.020
 true+0.018
 soon+0.016
 surely+0.016
 forward+0.016
Neg logit contributions
berries-0.023
 peas-0.020
 knives-0.020
 sleeves-0.020
 spoon-0.020
Token man
Token ID582
Feature activation+4.37e-02

Pos logit contributions
 like+0.110
 than+0.100
 soon+0.097
 when+0.095
bye+0.094
Neg logit contributions
berries-0.142
 spoon-0.138
 peas-0.136
 lettuce-0.132
 carrots-0.131
Token's
Token ID338
Feature activation+2.67e-02

Pos logit contributions
 like+0.058
 soon+0.057
 than+0.052
 forward+0.051
te+0.050
Neg logit contributions
 spoon-0.089
 carrots-0.086
 peas-0.085
 lettuce-0.084
berries-0.084
Token words
Token ID2456
Feature activation+1.13e-02

Pos logit contributions
 like+0.038
 soon+0.035
 than+0.033
 forward+0.032
 true+0.032
Neg logit contributions
berries-0.055
 spoon-0.054
 peas-0.052
 lettuce-0.051
 carrots-0.051
Token and
Token ID290
Feature activation+1.07e-02

Pos logit contributions
 like+0.013
 soon+0.013
 than+0.011
 forward+0.011
te+0.011
Neg logit contributions
 spoon-0.025
 carrots-0.025
 peas-0.024
 lettuce-0.024
berries-0.024
Token we
Token ID356
Feature activation-1.47e-02

Pos logit contributions
 like+0.013
 soon+0.012
 than+0.011
 when+0.010
 after+0.010
Neg logit contributions
 spoon-0.024
berries-0.023
 carrots-0.023
 peas-0.022
 lettuce-0.022
Token would
Token ID561
Feature activation+4.25e-02

Pos logit contributions
 like+0.017
 soon+0.016
 over+0.013
 than+0.013
 when+0.013
Neg logit contributions
 spoon-0.028
berries-0.028
 carrots-0.028
 lettuce-0.027
 knives-0.027
Token never
Token ID1239
Feature activation+4.79e-02

Pos logit contributions
 like+0.055
 soon+0.048
 than+0.041
 surely+0.040
 over+0.039
Neg logit contributions
berries-0.095
 knives-0.093
 spoon-0.093
 carrots-0.093
 peas-0.092
Token forget
Token ID6044
Feature activation+6.35e-02

Pos logit contributions
 like+0.059
 soon+0.053
 than+0.048
 after+0.046
 surely+0.044
Neg logit contributions
 spoon-0.108
 carrots-0.107
berries-0.106
 peas-0.105
 knives-0.103
Token it
Token ID340
Feature activation+1.94e-02

Pos logit contributions
 like+0.048
 soon+0.045
 when+0.038
 after+0.036
 over+0.035
Neg logit contributions
 spoon-0.170
berries-0.164
 peas-0.162
 lettuce-0.162
 carrots-0.162
Token no
Token ID645
Feature activation+1.40e-03

Pos logit contributions
 like+0.026
 soon+0.025
 than+0.023
 forward+0.023
 surely+0.022
Neg logit contributions
 spoon-0.039
 carrots-0.038
 peas-0.037
berries-0.037
 lettuce-0.037
Token matter
Token ID2300
Feature activation+8.82e-02

Pos logit contributions
 like+0.001
 soon+0.001
 than+0.001
 after+0.001
 as+0.001
Neg logit contributions
 spoon-0.004
berries-0.004
 peas-0.004
 lettuce-0.004
 carrots-0.003
Token where
Token ID810
Feature activation-3.18e-02

Pos logit contributions
 like+0.072
 soon+0.069
 than+0.060
 forward+0.059
te+0.056
Neg logit contributions
 spoon-0.225
 carrots-0.219
 peas-0.217
 lettuce-0.215
berries-0.212
Token she
Token ID673
Feature activation+1.61e-03

Pos logit contributions
 spoon+0.077
 peas+0.076
 lettuce+0.076
 carrots+0.076
 carrot+0.075
Neg logit contributions
 soon-0.032
 like-0.029
 when-0.026
 after-0.025
 forward-0.023
Token went
Token ID1816
Feature activation+1.42e-02

Pos logit contributions
 like+0.002
 soon+0.002
 than+0.002
 after+0.002
 surely+0.002
Neg logit contributions
 spoon-0.003
 carrots-0.003
 peas-0.003
berries-0.003
 lettuce-0.003
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
 like+0.016
 soon+0.016
 forward+0.015
 than+0.014
 after+0.014
Neg logit contributions
 spoon-0.031
 carrots-0.030
 peas-0.030
 lettuce-0.029
berries-0.029
Token
Token ID198
Feature activation+5.40e-03

Pos logit contributions
 like+0.012
 soon+0.012
 forward+0.011
te+0.011
 surely+0.010
Neg logit contributions
 spoon-0.025
 carrots-0.024
 peas-0.024
 knives-0.023
 lettuce-0.023
Token freezing
Token ID20884
Feature activation+1.73e-03

Pos logit contributions
 like+0.036
 surely+0.036
te+0.035
 forward+0.035
 soon+0.034
Neg logit contributions
 carrots-0.088
 spoon-0.087
 peas-0.084
 lettuce-0.082
 knives-0.082
Token everything
Token ID2279
Feature activation+2.94e-02

Pos logit contributions
 like+0.003
 soon+0.002
 than+0.002
 forward+0.002
 after+0.002
Neg logit contributions
 spoon-0.003
 carrots-0.003
 peas-0.003
 lettuce-0.003
berries-0.003
Token in
Token ID287
Feature activation+1.18e-03

Pos logit contributions
 like+0.033
 soon+0.032
 forward+0.029
 than+0.029
 surely+0.028
Neg logit contributions
 spoon-0.064
 carrots-0.062
 peas-0.061
berries-0.060
 lettuce-0.060
Token sight
Token ID6504
Feature activation+2.86e-02

Pos logit contributions
 forward+0.001
ually+0.001
ends+0.001
te+0.001
 like+0.001
Neg logit contributions
 spoon-0.003
 carrots-0.003
 peas-0.003
 lettuce-0.002
 knives-0.002
Token.
Token ID13
Feature activation+7.88e-03

Pos logit contributions
 like+0.035
 soon+0.034
 than+0.033
 after+0.030
 when+0.030
Neg logit contributions
 spoon-0.064
 peas-0.062
 carrots-0.061
 lettuce-0.061
berries-0.059
Token Everywhere
Token ID48201
Feature activation+7.61e-02

Pos logit contributions
 soon+0.011
 like+0.010
 surely+0.010
 forward+0.010
away+0.010
Neg logit contributions
 carrots-0.014
 spoon-0.014
 lettuce-0.014
berries-0.014
 peas-0.013
Token he
Token ID339
Feature activation+3.91e-02

Pos logit contributions
ually+0.145
te+0.144
amed+0.144
 like+0.140
 surely+0.140
Neg logit contributions
 spoon-0.095
 carrots-0.094
 knives-0.092
 peas-0.090
 lettuce-0.085
Token went
Token ID1816
Feature activation+4.05e-02

Pos logit contributions
 like+0.058
ends+0.057
 than+0.057
te+0.055
 forward+0.055
Neg logit contributions
 spoon-0.070
 peas-0.066
 carrots-0.066
 carrot-0.062
 lettuce-0.062
Token,
Token ID11
Feature activation+3.18e-02

Pos logit contributions
 like+0.044
 soon+0.043
te+0.043
ually+0.043
 surely+0.042
Neg logit contributions
 carrots-0.083
 spoon-0.081
 knives-0.080
 peas-0.079
berries-0.079
Token people
Token ID661
Feature activation+4.43e-02

Pos logit contributions
 like+0.049
 soon+0.049
 forward+0.046
 than+0.046
 surely+0.045
Neg logit contributions
 spoon-0.055
 carrots-0.053
 peas-0.052
berries-0.051
 lettuce-0.051
Token were
Token ID547
Feature activation-3.03e-03

Pos logit contributions
 like+0.051
 soon+0.050
 forward+0.044
 than+0.044
 surely+0.042
Neg logit contributions
 spoon-0.096
 carrots-0.095
 peas-0.093
 lettuce-0.092
berries-0.092
Token were
Token ID547
Feature activation-8.47e-03

Pos logit contributions
 like+0.006
 soon+0.006
 forward+0.006
 surely+0.006
 than+0.006
Neg logit contributions
 spoon-0.013
 carrots-0.013
 peas-0.013
berries-0.012
 knives-0.012
Token no
Token ID645
Feature activation-2.55e-02

Pos logit contributions
 carrots+0.018
 spoon+0.018
 peas+0.017
 lettuce+0.017
 carrot+0.017
Neg logit contributions
away-0.008
ually-0.008
 like-0.008
te-0.008
 than-0.008
Token longer
Token ID2392
Feature activation-1.91e-03

Pos logit contributions
 spoon+0.061
 carrots+0.059
 peas+0.058
berries+0.058
 lettuce+0.058
Neg logit contributions
 like-0.024
 soon-0.024
 forward-0.021
 than-0.021
 surely-0.020
Token restless
Token ID42537
Feature activation-1.95e-02

Pos logit contributions
 spoon+0.004
 carrots+0.003
 peas+0.003
berries+0.003
 lettuce+0.003
Neg logit contributions
 like-0.003
 soon-0.003
 forward-0.002
 than-0.002
 surely-0.002
Token and
Token ID290
Feature activation-1.26e-03

Pos logit contributions
 spoon+0.049
 carrots+0.047
 lettuce+0.047
 peas+0.047
berries+0.046
Neg logit contributions
 like-0.018
 soon-0.016
 than-0.015
 after-0.014
 forward-0.014
Token happy
Token ID3772
Feature activation+8.10e-03

Pos logit contributions
 spoon+0.003
 carrots+0.003
 peas+0.003
 lettuce+0.003
 knives+0.003
Neg logit contributions
 like-0.001
 soon-0.001
 forward-0.001
 than-0.001
te-0.001
Token to
Token ID284
Feature activation+9.70e-03

Pos logit contributions
 like+0.008
 soon+0.008
 than+0.008
 after+0.007
 when+0.007
Neg logit contributions
 spoon-0.019
 carrots-0.018
 peas-0.018
 lettuce-0.018
 knives-0.018
Token have
Token ID423
Feature activation+1.03e-02

Pos logit contributions
 like+0.011
 soon+0.011
 forward+0.010
 than+0.010
 surely+0.010
Neg logit contributions
 spoon-0.020
 carrots-0.020
 peas-0.019
 lettuce-0.019
berries-0.019
Token gone
Token ID3750
Feature activation+1.67e-02

Pos logit contributions
 like+0.010
 soon+0.010
 than+0.009
 forward+0.009
 surely+0.008
Neg logit contributions
 spoon-0.024
 carrots-0.023
 peas-0.023
berries-0.023
 lettuce-0.023
Token on
Token ID319
Feature activation+1.65e-02

Pos logit contributions
 like+0.015
 soon+0.015
 surely+0.014
te+0.014
 forward+0.014
Neg logit contributions
 spoon-0.038
 carrots-0.038
 peas-0.037
 lettuce-0.036
berries-0.036
Token their
Token ID511
Feature activation+1.66e-02

Pos logit contributions
 like+0.013
 soon+0.013
 than+0.011
 after+0.010
 when+0.009
Neg logit contributions
 spoon-0.043
berries-0.041
 carrots-0.040
 peas-0.040
 lettuce-0.040
Token he
Token ID339
Feature activation-1.90e-02

Pos logit contributions
 like+0.035
 soon+0.034
 forward+0.032
 than+0.032
 surely+0.031
Neg logit contributions
 spoon-0.054
 carrots-0.052
 peas-0.051
berries-0.050
 lettuce-0.050
Token wasn
Token ID2492
Feature activation+5.07e-02

Pos logit contributions
 spoon+0.039
 carrots+0.039
 peas+0.038
berries+0.037
 lettuce+0.037
Neg logit contributions
 like-0.023
 soon-0.023
 forward-0.021
 than-0.021
 surely-0.020
Token't
Token ID470
Feature activation-3.41e-02

Pos logit contributions
 like+0.073
 soon+0.073
 forward+0.068
 surely+0.068
te+0.065
Neg logit contributions
 spoon-0.096
 carrots-0.093
 peas-0.092
 lettuce-0.090
 knives-0.090
Token frightened
Token ID24776
Feature activation-1.92e-02

Pos logit contributions
 spoon+0.066
 carrots+0.065
 peas+0.063
 lettuce+0.062
berries+0.062
Neg logit contributions
 like-0.045
 soon-0.044
 forward-0.042
 surely-0.041
 than-0.041
Token.
Token ID13
Feature activation+8.76e-03

Pos logit contributions
 spoon+0.049
 lettuce+0.046
 carrots+0.046
 peas+0.046
 forks+0.045
Neg logit contributions
 like-0.021
 after-0.017
 soon-0.016
 when-0.016
 from-0.016
Token He
Token ID679
Feature activation+6.05e-03

Pos logit contributions
 like+0.009
 soon+0.008
 When+0.008
 when+0.008
 over+0.007
Neg logit contributions
 spoon-0.020
 peas-0.020
 carrots-0.020
 knives-0.019
berries-0.019
Token approached
Token ID10448
Feature activation-8.20e-03

Pos logit contributions
 like+0.007
 soon+0.007
 than+0.006
 forward+0.006
 surely+0.006
Neg logit contributions
 spoon-0.013
 carrots-0.012
 peas-0.012
berries-0.012
 lettuce-0.012
Token the
Token ID262
Feature activation+1.99e-02

Pos logit contributions
 carrots+0.017
 spoon+0.017
 peas+0.016
 knives+0.016
 lettuce+0.016
Neg logit contributions
ually-0.009
 than-0.008
 surely-0.008
 like-0.008
ends-0.008
Token dog
Token ID3290
Feature activation+3.25e-02

Pos logit contributions
 like+0.034
 soon+0.030
 after+0.030
 forward+0.029
 when+0.029
Neg logit contributions
 peas-0.036
 spoon-0.036
berries-0.034
 lettuce-0.033
 knives-0.032
Token slowly
Token ID6364
Feature activation+2.50e-02

Pos logit contributions
 soon+0.030
 like+0.030
 after+0.025
 over+0.025
 when+0.025
Neg logit contributions
 spoon-0.081
 peas-0.079
 knives-0.077
 lettuce-0.077
 carrots-0.077
Token and
Token ID290
Feature activation+7.60e-03

Pos logit contributions
 like+0.023
 soon+0.022
 forward+0.020
 than+0.019
 surely+0.019
Neg logit contributions
 spoon-0.060
 carrots-0.058
 peas-0.058
 lettuce-0.057
 knives-0.057
Token love
Token ID1842
Feature activation-4.15e-03

Pos logit contributions
 like+0.012
 soon+0.012
 than+0.010
 forward+0.009
te+0.009
Neg logit contributions
 spoon-0.034
berries-0.033
 carrots-0.033
 peas-0.033
 knives-0.032
Token and
Token ID290
Feature activation+9.95e-03

Pos logit contributions
 spoon+0.009
 lettuce+0.009
 peas+0.008
 knives+0.008
 carrots+0.008
Neg logit contributions
 like-0.005
 soon-0.005
 than-0.005
 when-0.004
 from-0.004
Token kindness
Token ID23887
Feature activation+1.39e-02

Pos logit contributions
 like+0.014
 soon+0.013
 than+0.012
 when+0.011
 their+0.011
Neg logit contributions
berries-0.021
 spoon-0.021
 knives-0.020
 peas-0.019
 sleeves-0.019
Token.
Token ID13
Feature activation+2.06e-03

Pos logit contributions
 like+0.017
 than+0.016
 soon+0.015
 from+0.014
 when+0.013
Neg logit contributions
 spoon-0.032
 lettuce-0.031
 peas-0.031
 knives-0.030
berries-0.030
Token I
Token ID314
Feature activation-7.98e-03

Pos logit contributions
 like+0.003
 soon+0.003
 than+0.002
 forward+0.002
te+0.002
Neg logit contributions
 spoon-0.004
 carrots-0.004
 peas-0.004
berries-0.004
 knives-0.004
Token am
Token ID716
Feature activation+1.34e-02

Pos logit contributions
 spoon+0.016
 carrots+0.015
 peas+0.014
 carrot+0.014
 lettuce+0.014
Neg logit contributions
 forward-0.010
ually-0.009
ook-0.009
 beyond-0.009
 soon-0.009
Token happy
Token ID3772
Feature activation-6.77e-03

Pos logit contributions
ually+0.015
away+0.015
bek+0.015
 than+0.014
bye+0.014
Neg logit contributions
 spoon-0.028
 carrots-0.027
 peas-0.026
 lettuce-0.025
 soup-0.025
Token to
Token ID284
Feature activation-5.67e-03

Pos logit contributions
 spoon+0.018
 sleeves+0.017
 lettuce+0.017
 knives+0.017
 peas+0.017
Neg logit contributions
 like-0.006
 soon-0.006
 than-0.006
 when-0.005
 after-0.005
Token have
Token ID423
Feature activation+7.93e-04

Pos logit contributions
 spoon+0.012
 carrots+0.012
 peas+0.011
 lettuce+0.011
berries+0.011
Neg logit contributions
 like-0.006
 soon-0.006
 forward-0.006
 than-0.006
ually-0.006
Token you
Token ID345
Feature activation-2.77e-02

Pos logit contributions
 like+0.001
 soon+0.001
 than+0.001
 after+0.001
 when+0.001
Neg logit contributions
 spoon-0.002
berries-0.002
 peas-0.002
 knives-0.002
 carrots-0.002
Token,
Token ID11
Feature activation-8.35e-03

Pos logit contributions
 spoon+0.061
 carrots+0.059
 peas+0.058
berries+0.057
 lettuce+0.057
Neg logit contributions
 like-0.030
 soon-0.030
 forward-0.027
 than-0.027
 surely-0.026
Token on
Token ID319
Feature activation+8.41e-03

Pos logit contributions
 spoon+0.070
berries+0.069
 carrots+0.069
 peas+0.069
 knives+0.068
Neg logit contributions
 like-0.034
 than-0.028
 soon-0.028
 from-0.027
 when-0.026
Token its
Token ID663
Feature activation-2.41e-03

Pos logit contributions
 like+0.006
 soon+0.005
 over+0.005
 than+0.005
 when+0.004
Neg logit contributions
 spoon-0.022
berries-0.022
 carrots-0.021
 peas-0.021
 knives-0.021
Token back
Token ID736
Feature activation+9.27e-03

Pos logit contributions
berries+0.004
 spoon+0.004
 knives+0.004
pill+0.004
 peas+0.004
Neg logit contributions
 like-0.004
 soon-0.004
 than-0.004
 from-0.004
 when-0.003
Token.
Token ID13
Feature activation-1.01e-03

Pos logit contributions
 like+0.011
 than+0.010
 soon+0.010
 over+0.010
 from+0.010
Neg logit contributions
 spoon-0.021
 knives-0.020
 peas-0.020
 carrots-0.020
 lettuce-0.019
Token
Token ID220
Feature activation-4.61e-02

Pos logit contributions
 spoon+0.002
 carrots+0.002
 peas+0.002
 knives+0.002
 lettuce+0.002
Neg logit contributions
 like-0.001
 soon-0.001
 forward-0.001
te-0.001
 over-0.001
Token
Token ID198
Feature activation+2.22e-03

Pos logit contributions
 spoon+0.107
 carrots+0.104
 peas+0.104
 knives+0.100
 lettuce+0.100
Neg logit contributions
 like-0.046
 soon-0.044
 forward-0.041
 over-0.039
te-0.039
Token
Token ID198
Feature activation+1.74e-03

Pos logit contributions
 soon+0.002
 like+0.002
te+0.002
 forward+0.002
 than+0.002
Neg logit contributions
 spoon-0.005
 carrots-0.005
 peas-0.005
 knives-0.005
 lettuce-0.005
TokenTom
Token ID13787
Feature activation-5.20e-03

Pos logit contributions
 soon+0.002
 like+0.002
 forward+0.001
 than+0.001
te+0.001
Neg logit contributions
 spoon-0.004
 carrots-0.004
 peas-0.004
berries-0.004
 knives-0.004
Token said
Token ID531
Feature activation-2.28e-03

Pos logit contributions
 spoon+0.010
 carrots+0.010
 peas+0.010
berries+0.010
 lettuce+0.010
Neg logit contributions
 like-0.007
 soon-0.007
 forward-0.006
 than-0.006
 surely-0.006
Token,
Token ID11
Feature activation-1.15e-02

Pos logit contributions
 spoon+0.005
 carrots+0.005
 peas+0.005
berries+0.005
 lettuce+0.005
Neg logit contributions
 like-0.002
 soon-0.002
 forward-0.002
 than-0.002
 surely-0.002
Token "
Token ID366
Feature activation+1.19e-02

Pos logit contributions
 spoon+0.025
 carrots+0.025
 peas+0.025
berries+0.025
 lettuce+0.024
Neg logit contributions
 like-0.013
 soon-0.012
 forward-0.011
te-0.011
 over-0.011
Token find
Token ID1064
Feature activation-1.13e-02

Pos logit contributions
 spoon+0.006
 carrots+0.006
 peas+0.006
berries+0.006
 lettuce+0.006
Neg logit contributions
 like-0.004
 soon-0.004
 forward-0.004
 than-0.004
ually-0.004
Token the
Token ID262
Feature activation+5.89e-04

Pos logit contributions
 spoon+0.026
 carrots+0.026
 peas+0.025
berries+0.025
 lettuce+0.025
Neg logit contributions
 like-0.011
 soon-0.011
 forward-0.010
 than-0.010
 surely-0.009
Token l
Token ID300
Feature activation-1.93e-02

Pos logit contributions
 like+0.001
 soon+0.001
 than+0.001
 after+0.001
 when+0.001
Neg logit contributions
berries-0.001
 knives-0.001
 lettuce-0.001
 peas-0.001
 onions-0.001
Tokenoll
Token ID692
Feature activation-1.67e-03

Pos logit contributions
 spoon+0.030
berries+0.030
 carrots+0.030
 peas+0.030
 lettuce+0.029
Neg logit contributions
 like-0.034
 forward-0.031
 soon-0.030
 than-0.030
te-0.030
Tokenipop
Token ID42800
Feature activation-3.50e-02

Pos logit contributions
 carrots+0.002
 spoon+0.002
berries+0.002
 lettuce+0.002
 peas+0.002
Neg logit contributions
 soon-0.003
 like-0.003
 than-0.003
 after-0.003
 forward-0.003
Token.
Token ID13
Feature activation+6.81e-03

Pos logit contributions
 spoon+0.072
 carrots+0.070
 peas+0.070
berries+0.069
 lettuce+0.069
Neg logit contributions
 like-0.045
 soon-0.044
 than-0.040
 forward-0.040
 after-0.039
Token After
Token ID2293
Feature activation+1.71e-03

Pos logit contributions
 like+0.007
 soon+0.007
 forward+0.007
 after+0.006
 surely+0.006
Neg logit contributions
 spoon-0.015
 carrots-0.015
 peas-0.015
 lettuce-0.014
 knives-0.014
Token a
Token ID257
Feature activation+1.15e-03

Pos logit contributions
 soon+0.002
 like+0.001
 after+0.001
 when+0.001
 forward+0.001
Neg logit contributions
 spoon-0.004
 carrots-0.004
 peas-0.004
berries-0.004
 lettuce-0.004
Token while
Token ID981
Feature activation-1.59e-02

Pos logit contributions
 soon+0.001
 like+0.001
 after+0.001
 surely+0.001
 forward+0.001
Neg logit contributions
 carrots-0.003
 spoon-0.003
berries-0.003
 peas-0.003
 knives-0.003
Token,
Token ID11
Feature activation+3.07e-03

Pos logit contributions
 spoon+0.036
 carrots+0.035
 peas+0.034
berries+0.034
 knives+0.034
Neg logit contributions
 like-0.016
 soon-0.015
 than-0.014
ually-0.014
 forward-0.014
Token the
Token ID262
Feature activation-1.88e-02

Pos logit contributions
 like+0.004
te+0.004
 than+0.004
ually+0.004
ook+0.004
Neg logit contributions
 spoon-0.006
 carrots-0.005
 knives-0.005
 peas-0.005
 forks-0.005
Token he
Token ID339
Feature activation-3.09e-02

Pos logit contributions
 spoon+0.021
 carrots+0.020
 peas+0.020
 lettuce+0.020
berries+0.020
Neg logit contributions
 like-0.011
 soon-0.010
 forward-0.010
 than-0.009
te-0.009
Token yelled
Token ID22760
Feature activation-2.48e-02

Pos logit contributions
 spoon+0.067
 carrots+0.065
 peas+0.065
 lettuce+0.065
berries+0.064
Neg logit contributions
 like-0.038
 soon-0.037
 forward-0.033
te-0.032
 than-0.032
Token:
Token ID25
Feature activation-2.67e-02

Pos logit contributions
 spoon+0.061
 carrots+0.060
 peas+0.059
 lettuce+0.059
berries+0.058
Neg logit contributions
 like-0.023
 forward-0.021
 soon-0.021
 after-0.019
 over-0.019
Token "
Token ID366
Feature activation+4.43e-03

Pos logit contributions
 spoon+0.061
 carrots+0.061
 peas+0.060
berries+0.059
 lettuce+0.059
Neg logit contributions
 like-0.028
 soon-0.026
te-0.024
 forward-0.024
 over-0.024
TokenHey
Token ID10814
Feature activation-2.52e-02

Pos logit contributions
 soon+0.006
 like+0.005
 after+0.005
te+0.005
 over+0.005
Neg logit contributions
 spoon-0.010
 lettuce-0.009
 carrots-0.009
 peas-0.009
 knives-0.009
Token,
Token ID11
Feature activation-8.36e-03

Pos logit contributions
 spoon+0.053
 carrots+0.053
 lettuce+0.052
 peas+0.052
berries+0.051
Neg logit contributions
 like-0.030
 soon-0.029
 forward-0.026
 than-0.026
 after-0.026
Token you
Token ID345
Feature activation-2.80e-02

Pos logit contributions
berries+0.019
 carrots+0.018
 lettuce+0.018
 sleeves+0.018
 peas+0.018
Neg logit contributions
 like-0.010
 from-0.009
 soon-0.009
 after-0.008
 when-0.008
Token!
Token ID0
Feature activation+4.62e-03

Pos logit contributions
 peas+0.059
 carrots+0.058
berries+0.058
 lettuce+0.057
 spoon+0.055
Neg logit contributions
 like-0.038
 soon-0.034
 over-0.031
 when-0.031
 forward-0.031
Token Get
Token ID3497
Feature activation-4.61e-03

Pos logit contributions
 like+0.006
te+0.005
 soon+0.005
 forward+0.005
 surely+0.005
Neg logit contributions
 carrots-0.010
 spoon-0.010
 peas-0.009
 lettuce-0.009
berries-0.009
Token out
Token ID503
Feature activation+8.29e-03

Pos logit contributions
 spoon+0.011
 carrots+0.011
 peas+0.011
 knives+0.011
 lettuce+0.011
Neg logit contributions
 like-0.004
 soon-0.003
 than-0.003
 surely-0.003
ually-0.003
Token of
Token ID286
Feature activation-1.22e-02

Pos logit contributions
 like+0.009
 soon+0.009
 forward+0.008
 after+0.008
 than+0.008
Neg logit contributions
 spoon-0.018
 carrots-0.018
 peas-0.018
berries-0.018
 lettuce-0.018
Token surprise
Token ID5975
Feature activation+1.36e-02

Pos logit contributions
 carrots+0.008
berries+0.008
 spoon+0.008
 peas+0.008
 knives+0.008
Neg logit contributions
 like-0.007
 soon-0.006
 forward-0.006
 surely-0.006
te-0.006
Token for
Token ID329
Feature activation+7.87e-03

Pos logit contributions
 like+0.012
 soon+0.011
 forward+0.010
 than+0.010
te+0.009
Neg logit contributions
 spoon-0.033
 carrots-0.033
 peas-0.032
 lettuce-0.032
berries-0.032
Token you
Token ID345
Feature activation-1.32e-02

Pos logit contributions
 soon+0.012
 like+0.011
 after+0.010
 as+0.010
 when+0.010
Neg logit contributions
 spoon-0.015
berries-0.015
 carrot-0.014
 carrots-0.014
 lettuce-0.014
Token,
Token ID11
Feature activation-7.47e-03

Pos logit contributions
 spoon+0.029
 peas+0.028
 carrots+0.028
 lettuce+0.028
 knives+0.028
Neg logit contributions
 like-0.016
 soon-0.015
 forward-0.014
 than-0.014
 after-0.013
Token princess
Token ID21752
Feature activation+6.01e-03

Pos logit contributions
 spoon+0.014
berries+0.013
 carrots+0.013
 knives+0.013
 sleeves+0.013
Neg logit contributions
 like-0.012
 soon-0.011
 from-0.010
 after-0.010
 forward-0.010
Token!"
Token ID2474
Feature activation-1.26e-02

Pos logit contributions
 like+0.008
 soon+0.008
 than+0.007
 forward+0.007
te+0.007
Neg logit contributions
 spoon-0.012
 carrots-0.012
 peas-0.011
 lettuce-0.011
berries-0.011
Token The
Token ID383
Feature activation-2.23e-02

Pos logit contributions
 spoon+0.028
 carrots+0.027
 peas+0.027
berries+0.026
 knives+0.026
Neg logit contributions
 like-0.014
 soon-0.013
 forward-0.012
 than-0.012
 after-0.012
Token princess
Token ID21752
Feature activation+1.94e-02

Pos logit contributions
berries+0.042
 spoon+0.041
 knives+0.040
 carrots+0.039
 peas+0.038
Neg logit contributions
 soon-0.030
 like-0.030
 true-0.029
te-0.027
 forward-0.027
Token was
Token ID373
Feature activation-3.29e-03

Pos logit contributions
 soon+0.025
 like+0.024
te+0.022
 than+0.022
 surely+0.021
Neg logit contributions
 spoon-0.039
 carrots-0.039
 peas-0.038
berries-0.037
 lettuce-0.037
Token curious
Token ID11040
Feature activation-3.86e-02

Pos logit contributions
 spoon+0.006
 carrots+0.005
 bowl+0.005
 peas+0.005
 lettuce+0.005
Neg logit contributions
away-0.005
ually-0.005
bek-0.005
bye-0.005
riage-0.004
Token and
Token ID290
Feature activation-1.05e-02

Pos logit contributions
 spoon+0.096
 carrots+0.094
 peas+0.093
 lettuce+0.092
berries+0.091
Neg logit contributions
 like-0.034
 soon-0.031
 than-0.028
 forward-0.027
 after-0.026
Token the
Token ID262
Feature activation+4.27e-04

Pos logit contributions
 like+0.033
 soon+0.031
 than+0.024
 when+0.022
 after+0.021
Neg logit contributions
 spoon-0.150
berries-0.146
 lettuce-0.143
 carrots-0.143
 peas-0.141
Token purple
Token ID14032
Feature activation+3.59e-02

Pos logit contributions
 like+0.001
 soon+0.001
 forward+0.001
 than+0.001
 after+0.001
Neg logit contributions
 spoon-0.001
berries-0.001
 knives-0.001
 lettuce-0.001
 peas-0.001
Token tent
Token ID11105
Feature activation-1.15e-03

Pos logit contributions
 like+0.067
 soon+0.066
 after+0.062
 when+0.061
 than+0.061
Neg logit contributions
 spoon-0.051
 knives-0.051
 lettuce-0.049
 peas-0.047
 carrots-0.045
Token again
Token ID757
Feature activation-2.56e-02

Pos logit contributions
 spoon+0.003
 carrots+0.003
 peas+0.002
berries+0.002
 lettuce+0.002
Neg logit contributions
 like-0.001
 soon-0.001
 forward-0.001
 than-0.001
 surely-0.001
Token!
Token ID0
Feature activation+1.63e-02

Pos logit contributions
 spoon+0.046
 carrots+0.045
 knives+0.043
 peas+0.043
berries+0.043
Neg logit contributions
 like-0.034
 forward-0.034
 surely-0.034
ually-0.033
te-0.033
Token<|endoftext|>
Token ID50256
Feature activation-1.91e-02

Pos logit contributions
 like+0.021
 soon+0.020
 forward+0.019
 than+0.019
 surely+0.019
Neg logit contributions
 spoon-0.032
 carrots-0.031
 peas-0.031
berries-0.030
 lettuce-0.030
TokenOne
Token ID3198
Feature activation-6.31e-03

Pos logit contributions
 spoon+0.038
 peas+0.037
 carrots+0.037
 knives+0.036
 lettuce+0.036
Neg logit contributions
 like-0.025
 soon-0.024
 forward-0.023
te-0.022
 over-0.022
Token day
Token ID1110
Feature activation-1.79e-04

Pos logit contributions
 spoon+0.015
 carrots+0.015
 peas+0.015
berries+0.015
 knives+0.015
Neg logit contributions
 like-0.006
 soon-0.005
 forward-0.005
 than-0.005
 surely-0.005
Token,
Token ID11
Feature activation+1.90e-02

Pos logit contributions
 carrots+0.000
 spoon+0.000
 peas+0.000
 lettuce+0.000
berries+0.000
Neg logit contributions
 like-0.000
 soon-0.000
 when-0.000
 after-0.000
 forward-0.000
Token a
Token ID257
Feature activation+1.56e-02

Pos logit contributions
 soon+0.020
 forward+0.020
 surely+0.020
 like+0.019
 than+0.019
Neg logit contributions
 spoon-0.039
 carrots-0.039
 lettuce-0.037
 peas-0.037
berries-0.037
Token little
Token ID1310
Feature activation-1.65e-02

Pos logit contributions
 like+0.029
 soon+0.028
 forward+0.027
 than+0.027
 surely+0.027
Neg logit contributions
 spoon-0.022
 carrots-0.021
 peas-0.021
berries-0.021
 lettuce-0.020
Token have
Token ID423
Feature activation+8.17e-03

Pos logit contributions
 like+0.002
 soon+0.002
 forward+0.002
 surely+0.002
 than+0.002
Neg logit contributions
 spoon-0.004
 carrots-0.004
berries-0.004
 knives-0.004
 peas-0.004
Token a
Token ID257
Feature activation+3.76e-04

Pos logit contributions
 like+0.009
 soon+0.009
 than+0.007
 forward+0.007
 after+0.007
Neg logit contributions
 spoon-0.019
berries-0.018
 peas-0.018
 carrots-0.018
 knives-0.017
Token ball
Token ID2613
Feature activation-2.21e-02

Pos logit contributions
 like+0.001
 soon+0.001
 forward+0.000
 surely+0.000
 than+0.000
Neg logit contributions
berries-0.001
 peas-0.001
 spoon-0.001
 knives-0.001
 carrots-0.001
Token and
Token ID290
Feature activation-1.80e-03

Pos logit contributions
 spoon+0.053
 carrots+0.052
 peas+0.051
 knives+0.050
 lettuce+0.050
Neg logit contributions
 like-0.022
 soon-0.020
 than-0.018
 forward-0.017
 after-0.017
Token a
Token ID257
Feature activation+1.03e-02

Pos logit contributions
 spoon+0.004
berries+0.004
 peas+0.004
 carrots+0.004
 knives+0.004
Neg logit contributions
 like-0.002
 soon-0.001
 after-0.001
 when-0.001
 their-0.001
Token k
Token ID479
Feature activation-1.02e-02

Pos logit contributions
 like+0.018
 soon+0.017
 forward+0.016
 their+0.016
 true+0.015
Neg logit contributions
berries-0.016
 knives-0.016
 peas-0.015
pill-0.015
 carrots-0.014
Tokenite
Token ID578
Feature activation+2.64e-02

Pos logit contributions
 spoon+0.020
 carrots+0.020
 peas+0.019
 lettuce+0.019
 knives+0.019
Neg logit contributions
 like-0.013
 soon-0.013
 forward-0.012
 than-0.012
 surely-0.012
Token.
Token ID13
Feature activation+1.51e-03

Pos logit contributions
 like+0.034
 soon+0.032
 than+0.030
 forward+0.028
 over+0.028
Neg logit contributions
 spoon-0.056
 carrots-0.054
 peas-0.054
 knives-0.054
 lettuce-0.053
Token They
Token ID1119
Feature activation-2.50e-03

Pos logit contributions
 like+0.002
 soon+0.002
 forward+0.001
 than+0.001
 surely+0.001
Neg logit contributions
 spoon-0.003
 carrots-0.003
 peas-0.003
berries-0.003
 lettuce-0.003
Token run
Token ID1057
Feature activation-1.33e-02

Pos logit contributions
 spoon+0.005
 carrots+0.005
berries+0.005
 knives+0.005
 peas+0.005
Neg logit contributions
 like-0.003
 soon-0.003
 surely-0.002
 forward-0.002
 than-0.002
Token and
Token ID290
Feature activation+1.21e-03

Pos logit contributions
 carrots+0.019
 rocks+0.018
 berries+0.017
 spoon+0.017
 cones+0.016
Neg logit contributions
bek-0.024
 known-0.024
riage-0.023
ually-0.022
imm-0.022
Token then
Token ID788
Feature activation+7.29e-03

Pos logit contributions
 spoon+0.003
berries+0.003
 peas+0.003
 carrots+0.003
 knives+0.003
Neg logit contributions
 like-0.002
 soon-0.002
 than-0.002
 forward-0.002
 surely-0.002
Token calmly
Token ID30180
Feature activation-6.67e-03

Pos logit contributions
 like+0.009
 soon+0.009
 forward+0.008
 than+0.008
 surely+0.008
Neg logit contributions
 spoon-0.015
 carrots-0.015
 peas-0.014
berries-0.014
 lettuce-0.014
Token walked
Token ID6807
Feature activation-3.05e-02

Pos logit contributions
 spoon+0.014
 carrots+0.013
berries+0.013
 peas+0.013
 lettuce+0.013
Neg logit contributions
 soon-0.009
 like-0.008
 forward-0.008
 after-0.007
 than-0.007
Token out
Token ID503
Feature activation+1.89e-02

Pos logit contributions
 carrots+0.067
 spoon+0.065
 lettuce+0.063
 peas+0.062
berries+0.062
Neg logit contributions
ually-0.029
 like-0.028
 than-0.027
 surely-0.026
organ-0.026
Token of
Token ID286
Feature activation-4.05e-03

Pos logit contributions
 like+0.018
 soon+0.017
 forward+0.016
 over+0.015
 from+0.015
Neg logit contributions
 spoon-0.045
 carrots-0.044
 peas-0.044
berries-0.043
 lettuce-0.043
Token the
Token ID262
Feature activation-5.89e-03

Pos logit contributions
 spoon+0.010
 carrots+0.010
 peas+0.010
berries+0.010
 lettuce+0.010
Neg logit contributions
 like-0.003
 soon-0.003
 forward-0.003
 than-0.003
 surely-0.003
Token room
Token ID2119
Feature activation+2.72e-02

Pos logit contributions
berries+0.012
 peas+0.012
 carrots+0.012
 lettuce+0.012
 berries+0.012
Neg logit contributions
 forward-0.007
 like-0.007
 over-0.006
 soon-0.006
 after-0.006
Token.
Token ID13
Feature activation-1.56e-04

Pos logit contributions
 soon+0.033
 like+0.032
 forward+0.030
 than+0.029
 after+0.028
Neg logit contributions
 spoon-0.058
 carrots-0.058
 peas-0.058
 lettuce-0.057
berries-0.056
Token
Token ID220
Feature activation-3.76e-02

Pos logit contributions
 spoon+0.000
 carrots+0.000
 peas+0.000
berries+0.000
 knives+0.000
Neg logit contributions
 like-0.000
 soon-0.000
 forward-0.000
 than-0.000
 surely-0.000
Token
Token ID198
Feature activation+4.87e-03

Pos logit contributions
 spoon+0.083
 carrots+0.081
 peas+0.080
berries+0.078
 knives+0.078
Neg logit contributions
 like-0.041
 soon-0.040
 forward-0.037
 than-0.036
ually-0.036
Token
Token ID198
Feature activation+1.24e-03

Pos logit contributions
 like+0.006
 soon+0.006
 forward+0.006
 than+0.006
 surely+0.005
Neg logit contributions
 spoon-0.010
 carrots-0.010
 peas-0.009
berries-0.009
 lettuce-0.009
Token,
Token ID11
Feature activation+1.72e-02

Pos logit contributions
 like+0.051
 soon+0.050
 surely+0.046
 forward+0.046
 than+0.046
Neg logit contributions
 spoon-0.120
 carrots-0.116
berries-0.114
 peas-0.113
 knives-0.112
Token there
Token ID612
Feature activation+3.68e-03

Pos logit contributions
 than+0.023
 surely+0.023
bye+0.023
bek+0.023
 soon+0.022
Neg logit contributions
 spoon-0.031
 carrots-0.029
 peas-0.028
berries-0.027
 carrot-0.027
Token was
Token ID373
Feature activation+2.09e-02

Pos logit contributions
 forward+0.006
 like+0.005
 than+0.005
 soon+0.005
 surely+0.005
Neg logit contributions
 spoon-0.006
 carrots-0.006
berries-0.006
 peas-0.006
 knives-0.006
Token a
Token ID257
Feature activation-3.48e-03

Pos logit contributions
 soon+0.021
 like+0.021
 forward+0.020
ually+0.020
 surely+0.020
Neg logit contributions
 carrots-0.046
 spoon-0.046
 lettuce-0.044
 peas-0.044
berries-0.044
Token family
Token ID1641
Feature activation+5.84e-03

Pos logit contributions
 spoon+0.005
 carrots+0.004
 peas+0.004
berries+0.004
 lettuce+0.004
Neg logit contributions
 like-0.007
 soon-0.007
 forward-0.006
 than-0.006
 surely-0.006
Token who
Token ID508
Feature activation-7.22e-02

Pos logit contributions
 like+0.006
 soon+0.006
 forward+0.006
 than+0.006
 surely+0.006
Neg logit contributions
 spoon-0.013
 carrots-0.013
 peas-0.012
berries-0.012
 lettuce-0.012
Token lived
Token ID5615
Feature activation-4.39e-03

Pos logit contributions
 spoon+0.127
 carrots+0.124
 peas+0.121
 lettuce+0.116
 soup+0.114
Neg logit contributions
 surely-0.098
 forward-0.097
bek-0.090
 soon-0.089
away-0.088
Token in
Token ID287
Feature activation-3.56e-02

Pos logit contributions
 spoon+0.011
 carrots+0.011
 peas+0.011
berries+0.011
 lettuce+0.011
Neg logit contributions
 like-0.003
 soon-0.003
 forward-0.003
 than-0.003
 surely-0.003
Token a
Token ID257
Feature activation+1.27e-02

Pos logit contributions
 carrots+0.071
 spoon+0.071
 peas+0.068
 lettuce+0.066
 carrot+0.065
Neg logit contributions
 forward-0.044
ually-0.043
 soon-0.042
 than-0.042
ook-0.041
Token big
Token ID1263
Feature activation+9.24e-03

Pos logit contributions
 like+0.021
 soon+0.021
 forward+0.019
 than+0.019
 surely+0.019
Neg logit contributions
 spoon-0.021
 carrots-0.020
 peas-0.020
berries-0.020
 knives-0.019
Token country
Token ID1499
Feature activation+2.02e-02

Pos logit contributions
 like+0.016
 soon+0.016
 forward+0.015
 than+0.015
 after+0.015
Neg logit contributions
 spoon-0.014
 carrots-0.013
 peas-0.013
 knives-0.013
 lettuce-0.013
Token there
Token ID612
Feature activation+4.71e-03

Pos logit contributions
 like+0.019
 soon+0.019
 than+0.018
 forward+0.018
 surely+0.018
Neg logit contributions
 spoon-0.025
 carrots-0.024
 peas-0.023
berries-0.023
 lettuce-0.023
Token was
Token ID373
Feature activation+1.76e-02

Pos logit contributions
 like+0.006
 soon+0.006
 forward+0.006
 than+0.006
 surely+0.006
Neg logit contributions
 spoon-0.009
 carrots-0.008
berries-0.008
 peas-0.008
 knives-0.008
Token a
Token ID257
Feature activation-6.23e-03

Pos logit contributions
 like+0.015
 soon+0.014
 forward+0.013
 than+0.013
 surely+0.012
Neg logit contributions
 spoon-0.043
 carrots-0.042
 peas-0.041
berries-0.041
 knives-0.041
Token little
Token ID1310
Feature activation-2.73e-02

Pos logit contributions
 carrots+0.008
 spoon+0.008
berries+0.008
 peas+0.008
 knives+0.008
Neg logit contributions
 like-0.012
 soon-0.012
 than-0.011
 forward-0.011
 after-0.011
Token rabbit
Token ID22746
Feature activation-2.98e-02

Pos logit contributions
 carrots+0.045
 knives+0.044
 lettuce+0.044
 spoon+0.044
 peas+0.043
Neg logit contributions
 like-0.043
 soon-0.041
 than-0.038
 after-0.038
 forward-0.037
Token who
Token ID508
Feature activation-7.91e-02

Pos logit contributions
 spoon+0.081
 peas+0.079
 carrots+0.079
 knives+0.078
 lettuce+0.078
Neg logit contributions
 like-0.021
 soon-0.018
 when-0.016
 than-0.016
 after-0.013
Token wanted
Token ID2227
Feature activation-2.42e-02

Pos logit contributions
 spoon+0.177
 carrots+0.174
 peas+0.169
 lettuce+0.167
berries+0.165
Neg logit contributions
 soon-0.078
 like-0.077
 forward-0.074
 surely-0.072
 than-0.070
Token to
Token ID284
Feature activation-7.41e-03

Pos logit contributions
 spoon+0.072
berries+0.071
opus+0.068
 peas+0.068
 sleeves+0.068
Neg logit contributions
 like-0.010
 soon-0.009
 when-0.008
 after-0.007
 from-0.007
Token find
Token ID1064
Feature activation-1.80e-02

Pos logit contributions
berries+0.016
 knives+0.016
 peas+0.016
 spoon+0.016
 carrots+0.016
Neg logit contributions
 like-0.008
 soon-0.007
 from-0.007
 when-0.006
 surely-0.006
Token something
Token ID1223
Feature activation-8.74e-03

Pos logit contributions
 spoon+0.044
berries+0.042
 peas+0.042
 sleeves+0.040
 knives+0.040
Neg logit contributions
 like-0.016
 soon-0.013
 from-0.012
 after-0.012
 when-0.012
Token delicious
Token ID12625
Feature activation-3.17e-02

Pos logit contributions
berries+0.019
 spoon+0.019
 peas+0.018
 sleeves+0.018
 knives+0.018
Neg logit contributions
 like-0.011
 soon-0.009
 from-0.009
 than-0.009
 when-0.009
Token and
Token ID290
Feature activation+4.20e-03

Pos logit contributions
 carrots+0.017
 spoon+0.017
 carrot+0.016
 peas+0.016
berries+0.016
Neg logit contributions
ually-0.012
bek-0.011
 surely-0.011
 Jenn-0.010
ook-0.010
Token said
Token ID531
Feature activation+8.76e-03

Pos logit contributions
 soon+0.006
 like+0.006
 surely+0.005
 than+0.005
 forward+0.005
Neg logit contributions
 spoon-0.008
 carrots-0.007
berries-0.007
 peas-0.007
 knives-0.007
Token,
Token ID11
Feature activation+3.50e-04

Pos logit contributions
 like+0.008
 soon+0.007
 forward+0.006
 than+0.006
te+0.006
Neg logit contributions
 spoon-0.021
 carrots-0.021
 peas-0.020
berries-0.020
 lettuce-0.020
Token "
Token ID366
Feature activation+1.49e-02

Pos logit contributions
 like+0.000
 soon+0.000
 forward+0.000
 than+0.000
 surely+0.000
Neg logit contributions
 spoon-0.001
 carrots-0.001
 peas-0.001
berries-0.001
 knives-0.001
TokenOf
Token ID5189
Feature activation-7.91e-03

Pos logit contributions
 like+0.016
 soon+0.016
 forward+0.015
 than+0.014
 surely+0.014
Neg logit contributions
 spoon-0.033
 carrots-0.032
 peas-0.031
berries-0.031
 knives-0.031
Token course
Token ID1781
Feature activation-7.24e-02

Pos logit contributions
 spoon+0.007
 carrots+0.006
berries+0.006
 knives+0.005
 peas+0.005
Neg logit contributions
 like-0.019
 soon-0.019
 than-0.018
ually-0.018
ends-0.018
Token,
Token ID11
Feature activation+1.28e-02

Pos logit contributions
 spoon+0.166
 carrots+0.163
 peas+0.163
berries+0.161
 lettuce+0.160
Neg logit contributions
 like-0.076
 soon-0.075
 than-0.066
 after-0.065
 forward-0.064
Token I
Token ID314
Feature activation-9.65e-03

Pos logit contributions
 like+0.019
 soon+0.019
 when+0.017
 after+0.017
 from+0.016
Neg logit contributions
 spoon-0.023
berries-0.023
 peas-0.023
 carrots-0.023
 lettuce-0.022
Token'll
Token ID1183
Feature activation+2.03e-02

Pos logit contributions
 spoon+0.014
 carrots+0.012
 bowl+0.012
 carrot+0.012
 lettuce+0.012
Neg logit contributions
 than-0.016
ually-0.016
away-0.016
 Jenn-0.016
 beyond-0.016
Token bring
Token ID2222
Feature activation+2.93e-02

Pos logit contributions
 like+0.029
 soon+0.028
 than+0.026
 forward+0.025
 surely+0.025
Neg logit contributions
 spoon-0.038
 carrots-0.038
 peas-0.037
berries-0.037
 knives-0.036
Token it
Token ID340
Feature activation+9.27e-03

Pos logit contributions
 soon+0.036
 like+0.035
 after+0.033
 forward+0.031
 when+0.030
Neg logit contributions
 spoon-0.062
berries-0.061
 sleeves-0.059
 peas-0.059
 knives-0.057
Token a
Token ID257
Feature activation-2.17e-03

Pos logit contributions
 like+0.018
 soon+0.018
 forward+0.017
 than+0.017
 surely+0.017
Neg logit contributions
 spoon-0.045
 carrots-0.045
 peas-0.043
 lettuce-0.043
berries-0.043
Token j
Token ID474
Feature activation-2.80e-02

Pos logit contributions
 spoon+0.003
 carrots+0.003
berries+0.003
 peas+0.003
 knives+0.003
Neg logit contributions
 like-0.004
 soon-0.004
 forward-0.004
 than-0.004
 surely-0.004
Tokenolly
Token ID5098
Feature activation-6.23e-03

Pos logit contributions
 spoon+0.042
 carrots+0.042
 peas+0.041
 knives+0.041
 lettuce+0.040
Neg logit contributions
 like-0.048
 forward-0.048
 soon-0.046
 than-0.045
away-0.045
Token old
Token ID1468
Feature activation+3.35e-02

Pos logit contributions
berries+0.009
 knives+0.009
 carrots+0.009
 spoon+0.009
 peas+0.008
Neg logit contributions
 like-0.012
 soon-0.011
 than-0.011
te-0.010
 after-0.010
Token woman
Token ID2415
Feature activation+2.29e-02

Pos logit contributions
 like+0.055
 than+0.051
 soon+0.051
 from+0.049
 when+0.048
Neg logit contributions
 knives-0.052
 carrots-0.051
 forks-0.050
 lettuce-0.050
 peas-0.049
Token who
Token ID508
Feature activation-7.65e-02

Pos logit contributions
 like+0.027
 soon+0.026
 forward+0.024
 than+0.024
 surely+0.023
Neg logit contributions
 spoon-0.048
 carrots-0.047
 peas-0.046
berries-0.046
 lettuce-0.045
Token lived
Token ID5615
Feature activation-5.68e-03

Pos logit contributions
 spoon+0.167
 carrots+0.164
 peas+0.159
 lettuce+0.156
berries+0.155
Neg logit contributions
 forward-0.076
 soon-0.076
 like-0.075
 surely-0.073
 than-0.070
Token in
Token ID287
Feature activation-3.83e-02

Pos logit contributions
 spoon+0.015
 carrots+0.015
 peas+0.015
berries+0.014
 knives+0.014
Neg logit contributions
 like-0.004
 soon-0.004
 forward-0.003
 than-0.003
te-0.003
Token a
Token ID257
Feature activation+1.46e-02

Pos logit contributions
 spoon+0.085
 carrots+0.084
 peas+0.080
 lettuce+0.079
 knives+0.079
Neg logit contributions
 soon-0.039
 like-0.038
 forward-0.037
ually-0.037
 than-0.036
Token small
Token ID1402
Feature activation+9.33e-03

Pos logit contributions
 soon+0.022
 like+0.021
 forward+0.020
 after+0.020
 true+0.019
Neg logit contributions
 spoon-0.025
 carrots-0.025
berries-0.025
 peas-0.025
 knives-0.024
Token village
Token ID7404
Feature activation-3.38e-02

Pos logit contributions
 soon+0.016
 like+0.016
 than+0.015
 after+0.015
 forward+0.015
Neg logit contributions
 spoon-0.014
 carrots-0.013
 peas-0.013
 lettuce-0.013
 knives-0.013
Token and
Token ID290
Feature activation-1.10e-02

Pos logit contributions
 spoon+0.051
 carrots+0.049
 peas+0.049
berries+0.048
 lettuce+0.048
Neg logit contributions
 like-0.018
 soon-0.018
 than-0.016
 forward-0.016
 after-0.015
Token took
Token ID1718
Feature activation-2.59e-02

Pos logit contributions
 spoon+0.022
 carrots+0.022
berries+0.021
 peas+0.021
 lettuce+0.021
Neg logit contributions
 soon-0.014
 like-0.014
 forward-0.012
 after-0.012
 surely-0.012
Token her
Token ID607
Feature activation+7.50e-03

Pos logit contributions
 spoon+0.062
 carrots+0.060
 peas+0.059
berries+0.058
 lettuce+0.058
Neg logit contributions
 like-0.023
 soon-0.023
 than-0.021
 forward-0.020
 surely-0.020
Token to
Token ID284
Feature activation-5.78e-03

Pos logit contributions
 like+0.006
 forward+0.006
 soon+0.006
 after+0.005
 over+0.005
Neg logit contributions
 peas-0.019
berries-0.019
 lettuce-0.018
 carrots-0.018
 spoon-0.018
Token the
Token ID262
Feature activation-3.60e-03

Pos logit contributions
 spoon+0.014
 carrots+0.013
 peas+0.013
 knives+0.013
 lettuce+0.013
Neg logit contributions
 like-0.005
 forward-0.005
 soon-0.005
 than-0.005
ually-0.005
Token living
Token ID2877
Feature activation-7.52e-02

Pos logit contributions
berries+0.006
 spoon+0.006
 peas+0.005
 carrots+0.005
 knives+0.005
Neg logit contributions
 like-0.006
 soon-0.006
 forward-0.006
 after-0.006
 surely-0.006
Token room
Token ID2119
Feature activation+3.25e-02

Pos logit contributions
 spoon+0.117
 carrots+0.107
 peas+0.105
 knives+0.102
 bowl+0.100
Neg logit contributions
 like-0.134
 soon-0.129
 surely-0.127
 forward-0.125
te-0.123
Token.
Token ID13
Feature activation-8.29e-04

Pos logit contributions
 like+0.045
 soon+0.044
 forward+0.041
 than+0.041
 surely+0.040
Neg logit contributions
 spoon-0.061
 carrots-0.060
 peas-0.058
berries-0.058
 lettuce-0.057
Token She
Token ID1375
Feature activation+3.98e-03

Pos logit contributions
 spoon+0.002
 carrots+0.002
 peas+0.002
 knives+0.002
 lettuce+0.002
Neg logit contributions
 like-0.001
 soon-0.001
 forward-0.001
 than-0.001
 after-0.001
Token found
Token ID1043
Feature activation+1.04e-02

Pos logit contributions
 soon+0.005
 like+0.005
te+0.004
 surely+0.004
 over+0.004
Neg logit contributions
 carrots-0.008
 spoon-0.008
berries-0.008
 lettuce-0.008
 peas-0.008
Token a
Token ID257
Feature activation+5.16e-03

Pos logit contributions
 like+0.009
 soon+0.009
 forward+0.008
 after+0.007
 over+0.007
Neg logit contributions
 spoon-0.025
 carrots-0.024
 peas-0.024
berries-0.024
 lettuce-0.024