TOP ACTIVATIONS
MAX = 10.324

. The ducks and fish sw am away . The water
of Sam 's leg and sw am away . Tom helped
Tom heard Lily and sw am to her . He
another deep dive and slowly sw am back to shore .
. He jumped in and sw am down and saw a
, he leaned over and sw atted the mosquito with his
and Mia grabbed Leo and sw am to the edge of
buzz ed , as it sw armed out of the log
and he and Be cca sw am together . They had
They sw am and sw am until they were tired
The ducks hurried ly sw am away , but the
and Mia in trouble and sw am to them . It
Jill 's hand . They sw am back to the boat
. He sw am and sw am until he reached the
. They quickly sw am to the edge of
friends and watched as they sw am away .
, she jumped in and sw am to the bag .
o stretched his fins and sw am towards the scary fish
animals , and they quickly sw am away . But D
She cautiously sw am around and saw lots

MIN ACTIVATIONS
MIN = -7.395

chocolate and nuts . Which one do you want , Tom
. " Which one should we get , Ben
his mom , " Which one do I get ?" She
asked Anna , " Which one is your favorite ?"
animal to feed . Which one do you want ?"
He said , " Which one should I take ?"
read your book . Which one do you want ?"
He said , " Which one is it ?" Lily said
one toy each . Which one do you want ?"
, or white . Which one do you want ?" Sara
wanted to show you how much we love you , Mom
. She told Emma how much it meant to her .
Lucy presents to show how much he loved her . Lucy
about her day and how much she loved her . When
and said , " Which one should I pick ?"
his friends back home how much fun he had . <|endoftext|>
wanted to show everyone how much she loved them .
John smiled , knowing how much his sister wanted one .
were just talking about how much fun it is to play
it looks , but how much you love it ." And

INTERVAL 5.894 - 10.324
CONTAINS 0.252%

said , � � € � � It â € ™
friend !" She ran away from Tim and the
they fought , the fish sw am away . The dog
. L ily ran back to the house .
quickly grabbed his toy and ran outside with mom my .

INTERVAL 1.465 - 5.894
CONTAINS 23.251%

and said that he had been a very good boy .
wanted to play with it . Sarah said , " Wow
about it . She wanted to have fun . She wanted
, please behave ! You 're hurting my nose with your
all aspects of our lives . If we are clear and

INTERVAL -2.965 - 1.465
CONTAINS 69.857%

It was so big ! The elephant was very stubborn and
He saw Lily and her friends playing and said , "
. He was grateful to the bear for guiding him and
, but he kept on feeling scared . When they reached
It looks fun !" " OK !" Sam says

INTERVAL -7.395 - -2.965
CONTAINS 6.640%

bird and asked , " What happened , Lily ?"
adventure together . <|endoftext|> Once upon a time , there was
end . <|endoftext|> Once upon a time , there was a
she could find . One day , she found a really
away . Suddenly , a big , scary dragon
Token.
Token ID13
Feature activation+3.47e+00

Pos logit contributions
arted+0.028
pered+0.028
lee+0.026
anks+0.025
aled+0.024
Neg logit contributions
 else-0.021
str-0.021
 slice-0.021
 mel-0.020
 flavor-0.019
Token The
Token ID383
Feature activation+1.21e-01

Pos logit contributions
arted+0.085
pered+0.082
lee+0.078
 heads+0.073
sey+0.072
Neg logit contributions
 else-0.077
 slice-0.075
 mel-0.072
str-0.070
aster-0.069
Token ducks
Token ID39694
Feature activation+2.36e-01

Pos logit contributions
pered+0.003
arted+0.003
lee+0.003
 heads+0.003
anks+0.003
Neg logit contributions
 else-0.002
str-0.002
 slice-0.002
 mel-0.002
ww-0.002
Token and
Token ID290
Feature activation-7.83e-01

Pos logit contributions
arted+0.006
pered+0.006
lee+0.006
 heads+0.005
aled+0.005
Neg logit contributions
str-0.005
 mel-0.004
 slice-0.004
ww-0.004
 else-0.004
Token fish
Token ID5916
Feature activation+4.46e-01

Pos logit contributions
 else+0.014
str+0.014
arest+0.014
 slice+0.014
 mel+0.013
Neg logit contributions
arted-0.022
pered-0.022
lee-0.021
 heads-0.020
aled-0.020
Token sw
Token ID1509
Feature activation+1.03e+01

Pos logit contributions
pered+0.012
arted+0.012
lee+0.011
 heads+0.010
aled+0.010
Neg logit contributions
str-0.009
ww-0.008
 mel-0.008
 slice-0.008
body-0.008
Tokenam
Token ID321
Feature activation+5.29e+00

Pos logit contributions
arted+0.112
 Ghosts+0.106
pered+0.106
lee+0.106
idget+0.103
Neg logit contributions
w-0.322
 slice-0.316
 mel-0.309
 bread-0.300
 else-0.299
Token away
Token ID1497
Feature activation+5.63e-01

Pos logit contributions
arted+0.142
pered+0.141
lee+0.137
anks+0.129
aled+0.126
Neg logit contributions
 else-0.106
 slice-0.100
str-0.098
 mel-0.097
 flavor-0.088
Token.
Token ID13
Feature activation+3.01e+00

Pos logit contributions
arted+0.016
pered+0.016
lee+0.015
anks+0.014
aled+0.014
Neg logit contributions
 else-0.010
 slice-0.010
str-0.010
 mel-0.010
 flavor-0.009
Token The
Token ID383
Feature activation-9.12e-02

Pos logit contributions
arted+0.077
pered+0.075
lee+0.071
aled+0.066
anks+0.066
Neg logit contributions
 else-0.065
 slice-0.062
 mel-0.060
str-0.059
 jam-0.056
Token water
Token ID1660
Feature activation+7.48e-02

Pos logit contributions
 else+0.002
str+0.002
 slice+0.002
 mel+0.002
ww+0.002
Neg logit contributions
pered-0.002
arted-0.002
lee-0.002
 heads-0.002
anks-0.002
Token of
Token ID286
Feature activation-1.35e+00

Pos logit contributions
arted+0.136
pered+0.136
lee+0.128
aled+0.123
anks+0.122
Neg logit contributions
 else-0.052
str-0.050
 slice-0.050
 mel-0.049
 flavor-0.043
Token Sam
Token ID3409
Feature activation+3.24e+00

Pos logit contributions
 else+0.024
str+0.023
 slice+0.022
 mel+0.022
ww+0.021
Neg logit contributions
arted-0.039
pered-0.039
lee-0.037
anks-0.035
aled-0.035
Token's
Token ID338
Feature activation+4.60e+00

Pos logit contributions
arted+0.086
pered+0.086
lee+0.081
anks+0.076
aled+0.075
Neg logit contributions
 else-0.065
 slice-0.062
str-0.062
 mel-0.062
 flavor-0.056
Token leg
Token ID1232
Feature activation+2.17e+00

Pos logit contributions
arted+0.117
lee+0.109
pered+0.109
aled+0.102
idget+0.099
Neg logit contributions
 else-0.102
 slice-0.101
 mel-0.099
str-0.094
 jam-0.090
Token and
Token ID290
Feature activation+9.23e-01

Pos logit contributions
arted+0.061
pered+0.060
lee+0.059
anks+0.055
aled+0.054
Neg logit contributions
 else-0.041
 slice-0.039
 mel-0.038
str-0.037
 jam-0.034
Token sw
Token ID1509
Feature activation+1.02e+01

Pos logit contributions
arted+0.024
pered+0.024
lee+0.022
 heads+0.022
aled+0.021
Neg logit contributions
str-0.019
arest-0.018
 slice-0.017
 else-0.017
ww-0.017
Tokenam
Token ID321
Feature activation+4.54e+00

Pos logit contributions
arted+0.121
lee+0.113
pered+0.103
angers+0.093
idget+0.092
Neg logit contributions
w-0.327
 slice-0.324
 else-0.311
 mel-0.310
lly-0.302
Token away
Token ID1497
Feature activation-1.26e-01

Pos logit contributions
arted+0.116
pered+0.115
lee+0.113
anks+0.106
aled+0.102
Neg logit contributions
 else-0.098
 slice-0.091
 mel-0.089
str-0.088
 flavor-0.081
Token.
Token ID13
Feature activation+3.56e+00

Pos logit contributions
 else+0.002
 slice+0.002
 mel+0.002
str+0.002
 delicious+0.002
Neg logit contributions
arted-0.004
pered-0.004
lee-0.004
aled-0.003
anks-0.003
Token Tom
Token ID4186
Feature activation+2.05e+00

Pos logit contributions
arted+0.090
pered+0.088
lee+0.085
 heads+0.079
sey+0.078
Neg logit contributions
 else-0.077
 mel-0.074
 slice-0.072
str-0.072
aster-0.067
Token helped
Token ID4193
Feature activation+2.15e-01

Pos logit contributions
arted+0.059
pered+0.059
lee+0.056
aled+0.052
 heads+0.052
Neg logit contributions
 slice-0.035
 else-0.035
str-0.035
 mel-0.035
ww-0.032
Token
Token ID198
Feature activation-1.70e+00

Pos logit contributions
 else+0.031
 slice+0.030
str+0.030
 mel+0.030
 flavor+0.026
Neg logit contributions
arted-0.059
pered-0.059
lee-0.056
anks-0.053
aled-0.052
TokenTom
Token ID13787
Feature activation+2.55e+00

Pos logit contributions
 else+0.025
 slice+0.024
str+0.023
 mel+0.023
 flavor+0.020
Neg logit contributions
pered-0.055
arted-0.055
lee-0.052
anks-0.049
aled-0.049
Token heard
Token ID2982
Feature activation-1.04e+00

Pos logit contributions
arted+0.088
anks+0.087
pered+0.087
lee+0.084
angers+0.081
Neg logit contributions
 else-0.033
 slice-0.026
 mel-0.025
str-0.023
 jam-0.023
Token Lily
Token ID20037
Feature activation+3.72e+00

Pos logit contributions
 else+0.023
str+0.023
 slice+0.023
 mel+0.022
 flavor+0.020
Neg logit contributions
arted-0.025
pered-0.025
lee-0.024
anks-0.022
aled-0.022
Token and
Token ID290
Feature activation+9.36e-01

Pos logit contributions
arted+0.121
lee+0.117
pered+0.113
angers+0.112
anced+0.110
Neg logit contributions
 else-0.059
 bite-0.051
 slice-0.049
 that-0.049
 mel-0.042
Token sw
Token ID1509
Feature activation+1.01e+01

Pos logit contributions
arted+0.027
pered+0.027
lee+0.026
anks+0.025
aled+0.024
Neg logit contributions
 else-0.018
 slice-0.016
 mel-0.016
str-0.016
 jam-0.014
Tokenam
Token ID321
Feature activation+4.76e+00

Pos logit contributions
arted+0.120
lee+0.119
pered+0.104
 asleep+0.102
angers+0.100
Neg logit contributions
w-0.323
 slice-0.314
 else-0.303
 mel-0.293
lly-0.292
Token to
Token ID284
Feature activation+3.27e-01

Pos logit contributions
lee+0.160
arted+0.150
anks+0.149
pered+0.147
angers+0.142
Neg logit contributions
 else-0.075
 that-0.057
 slice-0.052
 bite-0.052
 there-0.048
Token her
Token ID607
Feature activation+3.15e+00

Pos logit contributions
arted+0.009
pered+0.009
lee+0.009
aled+0.008
anks+0.008
Neg logit contributions
 else-0.006
str-0.006
 slice-0.006
 mel-0.006
ww-0.005
Token.
Token ID13
Feature activation+3.83e+00

Pos logit contributions
arted+0.095
pered+0.092
lee+0.090
aled+0.085
anks+0.083
Neg logit contributions
 else-0.056
 slice-0.052
 mel-0.051
str-0.049
 jam-0.045
Token He
Token ID679
Feature activation+9.78e-01

Pos logit contributions
arted+0.099
lee+0.095
anks+0.086
aned+0.085
idget+0.085
Neg logit contributions
 else-0.081
 that-0.081
 mel-0.071
 slice-0.070
looking-0.070
Token another
Token ID1194
Feature activation-1.33e-01

Pos logit contributions
arted+0.070
pered+0.070
lee+0.066
anks+0.063
aled+0.062
Neg logit contributions
 else-0.043
 slice-0.041
str-0.040
 mel-0.040
 flavor-0.036
Token deep
Token ID2769
Feature activation+3.06e+00

Pos logit contributions
str+0.003
 else+0.003
 baked+0.003
mmmm+0.003
w+0.003
Neg logit contributions
arted-0.003
pered-0.003
 breaths-0.003
lee-0.003
 heads-0.002
Token dive
Token ID15647
Feature activation+3.12e+00

Pos logit contributions
arted+0.060
pered+0.059
lee+0.055
aled+0.050
anks+0.050
Neg logit contributions
 else-0.084
str-0.082
 slice-0.080
 mel-0.080
 flavor-0.075
Token and
Token ID290
Feature activation+2.97e-01

Pos logit contributions
arted+0.081
pered+0.081
lee+0.076
anks+0.071
aled+0.071
Neg logit contributions
 else-0.065
str-0.063
 slice-0.062
 mel-0.061
 flavor-0.056
Token slowly
Token ID6364
Feature activation+1.51e+00

Pos logit contributions
arted+0.008
pered+0.008
lee+0.007
aled+0.007
anks+0.007
Neg logit contributions
str-0.006
 else-0.006
 slice-0.006
 mel-0.006
 flavor-0.005
Token sw
Token ID1509
Feature activation+9.90e+00

Pos logit contributions
arted+0.039
pered+0.039
lee+0.037
anks+0.034
aled+0.034
Neg logit contributions
 else-0.031
 slice-0.030
str-0.030
 mel-0.029
 flavor-0.027
Tokenam
Token ID321
Feature activation+4.72e+00

Pos logit contributions
arted+0.123
pered+0.118
idget+0.108
angers+0.108
 pupils+0.106
Neg logit contributions
lly-0.302
 else-0.286
w-0.282
 slice-0.278
str-0.278
Token back
Token ID736
Feature activation+2.81e+00

Pos logit contributions
arted+0.128
pered+0.125
lee+0.124
anks+0.121
aled+0.119
Neg logit contributions
 else-0.091
 slice-0.079
 mel-0.077
str-0.077
 that-0.076
Token to
Token ID284
Feature activation+4.29e-01

Pos logit contributions
arted+0.079
pered+0.077
lee+0.076
aled+0.074
anks+0.071
Neg logit contributions
 else-0.049
 mel-0.045
 slice-0.044
 that-0.043
str-0.042
Token shore
Token ID15191
Feature activation+2.39e+00

Pos logit contributions
arted+0.014
pered+0.014
lee+0.014
anks+0.013
aled+0.013
Neg logit contributions
 else-0.006
 mel-0.005
 slice-0.005
str-0.005
 jam-0.005
Token.
Token ID13
Feature activation+2.49e+00

Pos logit contributions
arted+0.073
pered+0.071
lee+0.068
aled+0.068
anks+0.065
Neg logit contributions
 else-0.040
 slice-0.036
 mel-0.036
str-0.035
 that-0.032
Token.
Token ID13
Feature activation+2.46e+00

Pos logit contributions
 else+0.010
str+0.010
 slice+0.010
 mel+0.010
ww+0.009
Neg logit contributions
arted-0.013
pered-0.013
lee-0.013
anks-0.012
aled-0.011
Token He
Token ID679
Feature activation+3.59e-01

Pos logit contributions
arted+0.055
pered+0.052
anks+0.049
lee+0.049
aled+0.046
Neg logit contributions
 else-0.059
 slice-0.058
 mel-0.058
 steak-0.056
 flavor-0.054
Token jumped
Token ID11687
Feature activation+6.15e+00

Pos logit contributions
arted+0.010
pered+0.010
lee+0.010
aled+0.009
anks+0.009
Neg logit contributions
 else-0.007
 slice-0.007
str-0.006
 mel-0.006
 flavor-0.006
Token in
Token ID287
Feature activation-3.05e-01

Pos logit contributions
arted+0.156
pered+0.153
lee+0.149
anks+0.140
aled+0.137
Neg logit contributions
 else-0.131
 slice-0.125
 mel-0.124
str-0.121
 delicious-0.112
Token and
Token ID290
Feature activation+9.90e-01

Pos logit contributions
str+0.006
 else+0.006
 slice+0.006
 mel+0.006
ww+0.005
Neg logit contributions
arted-0.008
pered-0.008
lee-0.008
aled-0.007
 heads-0.007
Token sw
Token ID1509
Feature activation+9.88e+00

Pos logit contributions
arted+0.027
pered+0.026
lee+0.025
aled+0.023
 heads+0.023
Neg logit contributions
 slice-0.019
str-0.019
 mel-0.018
body-0.018
 else-0.018
Tokenam
Token ID321
Feature activation+4.17e+00

Pos logit contributions
pered+0.116
arted+0.115
anks+0.100
 asleep+0.097
 commands+0.097
Neg logit contributions
lly-0.302
 else-0.300
w-0.299
 slice-0.299
 mel-0.292
Token down
Token ID866
Feature activation+2.57e-01

Pos logit contributions
arted+0.117
pered+0.116
lee+0.114
anks+0.110
aled+0.107
Neg logit contributions
 else-0.079
 slice-0.070
 mel-0.070
str-0.068
 delicious-0.064
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
arted+0.008
pered+0.008
lee+0.007
anks+0.007
aled+0.007
Neg logit contributions
 else-0.004
 slice-0.004
 mel-0.004
str-0.004
 jam-0.004
Token saw
Token ID2497
Feature activation-2.04e+00

Pos logit contributions
arted+0.030
pered+0.030
lee+0.028
aled+0.026
anks+0.026
Neg logit contributions
 else-0.019
 slice-0.019
str-0.018
 mel-0.018
 flavor-0.016
Token a
Token ID257
Feature activation-2.13e+00

Pos logit contributions
str+0.029
 slice+0.028
 else+0.026
ww+0.025
 mel+0.025
Neg logit contributions
arted-0.066
pered-0.066
lee-0.064
aled-0.059
anks-0.059
Token,
Token ID11
Feature activation-4.12e+00

Pos logit contributions
 else+0.074
str+0.071
 slice+0.070
arest+0.069
body+0.064
Neg logit contributions
arted-0.088
pered-0.087
lee-0.086
 shoulders-0.079
 heads-0.078
Token he
Token ID339
Feature activation+1.87e-01

Pos logit contributions
arest+0.119
str+0.118
 else+0.112
 slice+0.112
prise+0.111
Neg logit contributions
arted-0.065
lee-0.058
pered-0.057
aled-0.050
anks-0.050
Token leaned
Token ID23831
Feature activation+4.22e+00

Pos logit contributions
arted+0.005
pered+0.005
lee+0.005
anks+0.005
aled+0.005
Neg logit contributions
 else-0.003
 mel-0.003
 slice-0.003
str-0.003
 jam-0.003
Token over
Token ID625
Feature activation-2.20e-02

Pos logit contributions
arted+0.104
pered+0.103
lee+0.100
anks+0.093
aled+0.090
Neg logit contributions
 else-0.092
 slice-0.090
 mel-0.088
str-0.088
 flavor-0.080
Token and
Token ID290
Feature activation+1.73e+00

Pos logit contributions
str+0.000
 else+0.000
 slice+0.000
arest+0.000
ww+0.000
Neg logit contributions
arted-0.001
pered-0.001
lee-0.001
aled-0.001
 heads-0.001
Token sw
Token ID1509
Feature activation+9.85e+00

Pos logit contributions
arted+0.043
pered+0.041
lee+0.040
aled+0.037
 heads+0.037
Neg logit contributions
 slice-0.035
str-0.035
iest-0.034
body-0.034
 else-0.034
Tokenatted
Token ID16898
Feature activation+2.18e+00

Pos logit contributions
arted+0.113
pered+0.110
lee+0.101
anks+0.089
pering+0.086
Neg logit contributions
w-0.318
 else-0.317
 slice-0.315
 mel-0.308
lly-0.307
Token the
Token ID262
Feature activation+1.53e-01

Pos logit contributions
arted+0.069
pered+0.069
lee+0.066
anks+0.062
aled+0.062
Neg logit contributions
 else-0.033
 slice-0.032
str-0.031
 mel-0.031
 flavor-0.027
Token mosquito
Token ID35446
Feature activation+2.12e+00

Pos logit contributions
arted+0.005
pered+0.005
 heads+0.004
lee+0.004
 shoulders+0.004
Neg logit contributions
 else-0.002
str-0.002
 mel-0.002
 slice-0.002
prise-0.002
Token with
Token ID351
Feature activation-1.48e+00

Pos logit contributions
arted+0.059
pered+0.057
lee+0.057
anks+0.053
aled+0.051
Neg logit contributions
 else-0.042
 slice-0.040
 mel-0.037
str-0.036
 jam-0.035
Token his
Token ID465
Feature activation+1.85e+00

Pos logit contributions
 else+0.030
str+0.029
 slice+0.029
 mel+0.028
ww+0.026
Neg logit contributions
arted-0.039
pered-0.039
lee-0.037
aled-0.035
anks-0.034
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
arted+0.061
pered+0.061
lee+0.058
anks+0.056
aled+0.054
Neg logit contributions
 else-0.040
 slice-0.038
str-0.038
 mel-0.037
 flavor-0.033
Token Mia
Token ID32189
Feature activation+1.65e-01

Pos logit contributions
 else+0.036
 slice+0.035
 mel+0.034
str+0.034
 jam+0.031
Neg logit contributions
arted-0.041
pered-0.041
lee-0.039
anks-0.036
aled-0.036
Token grabbed
Token ID13176
Feature activation+1.97e+00

Pos logit contributions
arted+0.005
pered+0.005
lee+0.005
aled+0.005
 heads+0.004
Neg logit contributions
 else-0.003
 slice-0.003
str-0.002
 mel-0.002
ww-0.002
Token Leo
Token ID19632
Feature activation+1.22e+00

Pos logit contributions
arted+0.063
pered+0.062
lee+0.060
anks+0.056
aled+0.056
Neg logit contributions
 else-0.030
str-0.029
 slice-0.028
 mel-0.028
 flavor-0.024
Token and
Token ID290
Feature activation+3.01e-01

Pos logit contributions
pered+0.029
arted+0.028
lee+0.027
 heads+0.025
sey+0.025
Neg logit contributions
str-0.028
 else-0.027
 mel-0.026
 slice-0.026
ww-0.025
Token sw
Token ID1509
Feature activation+9.83e+00

Pos logit contributions
arted+0.008
pered+0.008
lee+0.008
 heads+0.007
aled+0.007
Neg logit contributions
str-0.006
 else-0.006
 slice-0.006
arest-0.006
body-0.006
Tokenam
Token ID321
Feature activation+4.30e+00

Pos logit contributions
arted+0.121
pered+0.112
lee+0.110
angers+0.108
 asleep+0.106
Neg logit contributions
w-0.294
 slice-0.282
 else-0.276
lly-0.275
 mel-0.269
Token to
Token ID284
Feature activation-7.44e-02

Pos logit contributions
arted+0.115
lee+0.114
pered+0.113
anks+0.108
aled+0.103
Neg logit contributions
 else-0.086
 slice-0.078
 mel-0.075
str-0.074
 that-0.070
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
 else+0.001
str+0.001
 slice+0.001
 mel+0.001
ww+0.001
Neg logit contributions
arted-0.002
pered-0.002
lee-0.002
anks-0.002
aled-0.002
Token edge
Token ID5743
Feature activation-1.02e+00

Pos logit contributions
arted+0.031
pered+0.031
lee+0.029
anks+0.027
aled+0.027
Neg logit contributions
 else-0.028
str-0.027
 slice-0.027
 mel-0.026
 flavor-0.024
Token of
Token ID286
Feature activation-1.33e+00

Pos logit contributions
str+0.014
 else+0.013
 slice+0.012
 mel+0.012
ww+0.011
Neg logit contributions
arted-0.035
pered-0.034
lee-0.032
aled-0.031
 heads-0.030
Token buzz
Token ID14713
Feature activation+4.00e+00

Pos logit contributions
arted+0.030
pered+0.030
lee+0.029
aled+0.027
anks+0.026
Neg logit contributions
 slice-0.020
str-0.020
 else-0.020
 mel-0.020
 flavor-0.018
Tokened
Token ID276
Feature activation+3.00e+00

Pos logit contributions
arted+0.075
pered+0.075
lee+0.069
anks+0.062
aled+0.062
Neg logit contributions
 else-0.112
 slice-0.109
str-0.109
 mel-0.108
 flavor-0.101
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
anks+0.089
arted+0.088
lee+0.087
pered+0.086
angers+0.084
Neg logit contributions
 else-0.048
 mel-0.046
 slice-0.046
 delicious-0.041
 that-0.040
Token as
Token ID355
Feature activation-2.40e+00

Pos logit contributions
arted+0.043
pered+0.043
lee+0.041
anks+0.039
aled+0.038
Neg logit contributions
 else-0.036
 slice-0.035
str-0.034
 mel-0.034
 jam-0.031
Token it
Token ID340
Feature activation+2.50e+00

Pos logit contributions
 slice+0.036
str+0.036
 else+0.036
 mel+0.035
ww+0.032
Neg logit contributions
pered-0.074
arted-0.074
lee-0.069
aled-0.065
anks-0.064
Token sw
Token ID1509
Feature activation+9.83e+00

Pos logit contributions
arted+0.064
pered+0.062
lee+0.059
 heads+0.058
aled+0.057
Neg logit contributions
str-0.051
 mel-0.050
 slice-0.049
arest-0.048
body-0.047
Tokenarmed
Token ID12026
Feature activation+8.99e-01

Pos logit contributions
lee+0.114
arted+0.110
pered+0.104
idget+0.090
anks+0.089
Neg logit contributions
w-0.315
 slice-0.307
 mel-0.303
 else-0.301
str-0.301
Token out
Token ID503
Feature activation-1.69e+00

Pos logit contributions
arted+0.025
pered+0.025
lee+0.024
aled+0.022
anks+0.022
Neg logit contributions
 else-0.017
 slice-0.016
str-0.016
 mel-0.016
 flavor-0.015
Token of
Token ID286
Feature activation-2.61e+00

Pos logit contributions
 else+0.028
 slice+0.027
str+0.027
 mel+0.027
 flavor+0.024
Neg logit contributions
arted-0.051
pered-0.051
lee-0.048
anks-0.046
aled-0.045
Token the
Token ID262
Feature activation+1.25e+00

Pos logit contributions
 else+0.052
str+0.050
 slice+0.049
 mel+0.048
ww+0.045
Neg logit contributions
arted-0.071
pered-0.071
lee-0.067
aled-0.062
anks-0.061
Token log
Token ID2604
Feature activation+9.58e-01

Pos logit contributions
arted+0.034
pered+0.034
lee+0.032
anks+0.030
 heads+0.029
Neg logit contributions
 else-0.025
str-0.024
 slice-0.024
 mel-0.023
 flavor-0.021
Token and
Token ID290
Feature activation-7.48e-01

Pos logit contributions
arted+0.038
pered+0.037
lee+0.035
aled+0.033
anks+0.033
Neg logit contributions
str-0.024
 else-0.024
 slice-0.024
 mel-0.024
 flavor-0.021
Token he
Token ID339
Feature activation+7.29e-01

Pos logit contributions
str+0.013
 slice+0.013
 else+0.013
 mel+0.012
arest+0.012
Neg logit contributions
arted-0.021
pered-0.021
lee-0.020
aled-0.019
anks-0.018
Token and
Token ID290
Feature activation-5.01e-01

Pos logit contributions
arted+0.020
pered+0.020
lee+0.019
anks+0.018
aled+0.018
Neg logit contributions
 else-0.014
 slice-0.013
str-0.013
 mel-0.013
 flavor-0.012
Token Be
Token ID1355
Feature activation+2.31e+00

Pos logit contributions
 else+0.012
 slice+0.011
str+0.011
 mel+0.011
 flavor+0.010
Neg logit contributions
arted-0.012
pered-0.012
lee-0.011
anks-0.010
aled-0.010
Tokencca
Token ID13227
Feature activation+1.16e+00

Pos logit contributions
arted+0.058
pered+0.057
lee+0.054
anks+0.051
aled+0.050
Neg logit contributions
 else-0.050
 slice-0.048
str-0.048
 mel-0.048
 flavor-0.044
Token sw
Token ID1509
Feature activation+9.82e+00

Pos logit contributions
arted+0.031
pered+0.031
lee+0.030
anks+0.027
aled+0.027
Neg logit contributions
 else-0.022
str-0.022
 slice-0.022
 mel-0.022
ww-0.020
Tokenam
Token ID321
Feature activation+4.38e+00

Pos logit contributions
pered+0.113
 Ghosts+0.111
pering+0.108
arted+0.107
 commands+0.101
Neg logit contributions
w-0.282
 slice-0.274
 mel-0.269
str-0.266
lly-0.265
Token together
Token ID1978
Feature activation-6.14e-01

Pos logit contributions
arted+0.112
pered+0.111
lee+0.109
anks+0.105
aled+0.102
Neg logit contributions
 else-0.092
 slice-0.082
 mel-0.081
str-0.080
 delicious-0.076
Token.
Token ID13
Feature activation+2.25e+00

Pos logit contributions
 else+0.011
 slice+0.011
 mel+0.010
str+0.010
 flavor+0.009
Neg logit contributions
arted-0.018
pered-0.017
lee-0.017
anks-0.016
aled-0.016
Token They
Token ID1119
Feature activation+6.54e-02

Pos logit contributions
pered+0.065
arted+0.064
lee+0.061
 heads+0.060
anks+0.059
Neg logit contributions
 else-0.037
str-0.037
 mel-0.036
 slice-0.034
 flavor-0.030
Token had
Token ID550
Feature activation+6.87e-01

Pos logit contributions
arted+0.002
pered+0.002
lee+0.002
aled+0.002
anks+0.002
Neg logit contributions
 else-0.001
 slice-0.001
str-0.001
 mel-0.001
 jam-0.001
Token
Token ID198
Feature activation-2.12e+00

Pos logit contributions
 slice+0.088
 mel+0.087
 jam+0.080
str+0.077
 else+0.077
Neg logit contributions
arted-0.071
pered-0.069
lee-0.068
sey-0.059
aled-0.059
TokenThey
Token ID2990
Feature activation-8.14e-01

Pos logit contributions
 slice+0.030
 mel+0.029
 else+0.029
str+0.028
 flavor+0.026
Neg logit contributions
arted-0.068
pered-0.067
lee-0.065
 heads-0.060
aled-0.060
Token sw
Token ID1509
Feature activation+9.24e+00

Pos logit contributions
 else+0.016
 slice+0.016
str+0.016
 mel+0.015
 jam+0.014
Neg logit contributions
arted-0.022
pered-0.022
lee-0.021
aled-0.019
anks-0.019
Tokenam
Token ID321
Feature activation+4.83e+00

Pos logit contributions
pered+0.114
arted+0.111
 asleep+0.107
lee+0.105
pering+0.104
Neg logit contributions
w-0.272
 slice-0.260
str-0.259
lly-0.257
ww-0.249
Token and
Token ID290
Feature activation+9.32e-01

Pos logit contributions
arted+0.127
pered+0.126
lee+0.122
anks+0.114
aled+0.112
Neg logit contributions
 else-0.100
 slice-0.094
str-0.093
 mel-0.092
 flavor-0.084
Token sw
Token ID1509
Feature activation+9.79e+00

Pos logit contributions
pered+0.024
arted+0.024
lee+0.023
anks+0.022
aled+0.021
Neg logit contributions
 else-0.019
 mel-0.018
 slice-0.018
str-0.018
 flavor-0.017
Tokenam
Token ID321
Feature activation+4.83e+00

Pos logit contributions
arted+0.113
pered+0.108
lee+0.106
anks+0.093
idget+0.090
Neg logit contributions
w-0.307
 else-0.305
 slice-0.302
lly-0.301
str-0.300
Token until
Token ID1566
Feature activation-2.71e+00

Pos logit contributions
arted+0.143
pered+0.139
lee+0.139
anks+0.131
aled+0.129
Neg logit contributions
 else-0.086
 slice-0.076
str-0.076
 mel-0.074
 delicious-0.068
Token they
Token ID484
Feature activation+2.56e-02

Pos logit contributions
 else+0.037
 slice+0.035
 mel+0.034
str+0.034
 flavor+0.030
Neg logit contributions
arted-0.090
pered-0.089
lee-0.086
aled-0.080
anks-0.079
Token were
Token ID547
Feature activation+3.41e+00

Pos logit contributions
arted+0.001
pered+0.001
lee+0.001
aled+0.001
am+0.001
Neg logit contributions
 slice-0.000
 mel-0.000
ww-0.000
 jam-0.000
body-0.000
Token tired
Token ID10032
Feature activation-4.18e-01

Pos logit contributions
arted+0.087
pered+0.085
lee+0.084
anks+0.080
aled+0.077
Neg logit contributions
 else-0.074
 slice-0.068
str-0.067
 mel-0.066
 delicious-0.063
Token
Token ID198
Feature activation-2.43e+00

Pos logit contributions
 slice+0.054
 else+0.053
 mel+0.053
str+0.051
 jam+0.047
Neg logit contributions
pered-0.084
arted-0.084
lee-0.080
anks-0.074
aled-0.074
TokenThe
Token ID464
Feature activation-1.25e+00

Pos logit contributions
 slice+0.045
 mel+0.043
 happened+0.041
 else+0.039
str+0.038
Neg logit contributions
pered-0.066
arted-0.065
lee-0.060
 commands-0.057
aled-0.056
Token ducks
Token ID39694
Feature activation-6.09e-01

Pos logit contributions
 else+0.024
str+0.024
 slice+0.023
 mel+0.022
aster+0.021
Neg logit contributions
arted-0.034
pered-0.034
lee-0.032
 heads-0.030
aled-0.029
Token hurried
Token ID31662
Feature activation+3.93e+00

Pos logit contributions
 else+0.011
 slice+0.011
str+0.010
 mel+0.010
 jam+0.009
Neg logit contributions
arted-0.018
pered-0.017
lee-0.017
aled-0.015
anks-0.015
Tokenly
Token ID306
Feature activation+1.03e+00

Pos logit contributions
pered+0.107
lee+0.107
arted+0.103
anks+0.103
aled+0.094
Neg logit contributions
 else-0.070
 mel-0.066
str-0.066
 that-0.063
 slice-0.062
Token sw
Token ID1509
Feature activation+9.73e+00

Pos logit contributions
arted+0.027
pered+0.027
lee+0.025
aled+0.023
anks+0.023
Neg logit contributions
 else-0.021
 slice-0.021
str-0.021
 mel-0.020
 flavor-0.019
Tokenam
Token ID321
Feature activation+4.63e+00

Pos logit contributions
lee+0.136
idget+0.130
 weeping+0.130
arted+0.128
 Ghosts+0.123
Neg logit contributions
w-0.264
lly-0.256
 cream-0.240
ww-0.239
 bread-0.237
Token away
Token ID1497
Feature activation-1.27e-01

Pos logit contributions
arted+0.120
pered+0.119
lee+0.118
anks+0.111
aled+0.107
Neg logit contributions
 else-0.096
 slice-0.088
str-0.088
 mel-0.087
 delicious-0.080
Token,
Token ID11
Feature activation-5.43e-01

Pos logit contributions
 else+0.002
 slice+0.002
str+0.002
 mel+0.002
 flavor+0.002
Neg logit contributions
arted-0.004
pered-0.004
lee-0.004
anks-0.003
aled-0.003
Token but
Token ID475
Feature activation-2.74e+00

Pos logit contributions
 else+0.011
 slice+0.011
str+0.011
 mel+0.011
 flavor+0.010
Neg logit contributions
arted-0.014
pered-0.014
lee-0.013
anks-0.013
aled-0.012
Token the
Token ID262
Feature activation-7.30e-01

Pos logit contributions
 slice+0.050
 else+0.048
 mel+0.046
str+0.045
arest+0.043
Neg logit contributions
arted-0.078
pered-0.076
lee-0.073
 heads-0.069
sey-0.067
Token and
Token ID290
Feature activation-3.88e-01

Pos logit contributions
arted+0.075
pered+0.074
lee+0.071
anks+0.067
aled+0.066
Neg logit contributions
 else-0.049
 slice-0.046
str-0.045
 mel-0.045
 jam-0.040
Token Mia
Token ID32189
Feature activation+1.88e+00

Pos logit contributions
 else+0.009
 slice+0.008
str+0.008
 mel+0.008
 flavor+0.008
Neg logit contributions
arted-0.009
pered-0.009
lee-0.009
anks-0.008
aled-0.008
Token in
Token ID287
Feature activation-5.40e-01

Pos logit contributions
arted+0.057
pered+0.054
lee+0.054
anks+0.051
aled+0.051
Neg logit contributions
 else-0.033
 slice-0.031
 mel-0.029
str-0.028
 that-0.026
Token trouble
Token ID5876
Feature activation+1.78e+00

Pos logit contributions
 else+0.008
str+0.008
 slice+0.007
 mel+0.007
ww+0.006
Neg logit contributions
pered-0.017
arted-0.017
lee-0.017
aled-0.016
 heads-0.015
Token and
Token ID290
Feature activation-4.48e-01

Pos logit contributions
arted+0.053
pered+0.052
lee+0.050
aled+0.048
anks+0.048
Neg logit contributions
 else-0.032
 slice-0.029
 mel-0.028
str-0.028
 flavor-0.025
Token sw
Token ID1509
Feature activation+9.70e+00

Pos logit contributions
str+0.009
 slice+0.009
 else+0.009
 mel+0.009
ww+0.008
Neg logit contributions
arted-0.012
pered-0.012
lee-0.011
aled-0.010
 heads-0.010
Tokenam
Token ID321
Feature activation+4.25e+00

Pos logit contributions
arted+0.125
lee+0.112
 asleep+0.106
pered+0.106
angers+0.106
Neg logit contributions
w-0.283
lly-0.275
 else-0.271
 slice-0.270
str-0.258
Token to
Token ID284
Feature activation-4.16e-03

Pos logit contributions
lee+0.122
arted+0.120
pered+0.118
anks+0.116
aled+0.110
Neg logit contributions
 else-0.079
 slice-0.067
str-0.065
 mel-0.064
 that-0.062
Token them
Token ID606
Feature activation+1.84e+00

Pos logit contributions
 else+0.000
 slice+0.000
str+0.000
 mel+0.000
 flavor+0.000
Neg logit contributions
arted-0.000
pered-0.000
lee-0.000
anks-0.000
aled-0.000
Token.
Token ID13
Feature activation+3.35e+00

Pos logit contributions
arted+0.064
lee+0.062
pered+0.060
aled+0.058
anks+0.057
Neg logit contributions
 else-0.026
 that-0.022
 slice-0.022
 mel-0.020
str-0.018
Token It
Token ID632
Feature activation-3.95e-01

Pos logit contributions
arted+0.072
pered+0.069
lee+0.068
anks+0.063
aled+0.060
Neg logit contributions
 else-0.087
 mel-0.082
str-0.082
 slice-0.081
 jam-0.075
Token Jill
Token ID22772
Feature activation+1.73e+00

Pos logit contributions
arted+0.090
pered+0.088
lee+0.085
anks+0.081
aled+0.078
Neg logit contributions
 else-0.068
 slice-0.066
 mel-0.065
str-0.061
 jam-0.059
Token's
Token ID338
Feature activation+4.36e+00

Pos logit contributions
arted+0.045
pered+0.044
lee+0.042
anks+0.040
aled+0.039
Neg logit contributions
 else-0.037
 slice-0.035
 mel-0.034
str-0.034
 flavor-0.031
Token hand
Token ID1021
Feature activation+1.76e+00

Pos logit contributions
arted+0.111
pered+0.107
lee+0.103
anks+0.097
aled+0.097
Neg logit contributions
 else-0.096
 slice-0.094
 mel-0.093
str-0.090
 flavor-0.084
Token.
Token ID13
Feature activation+3.69e+00

Pos logit contributions
arted+0.060
lee+0.057
pered+0.057
aled+0.055
anks+0.055
Neg logit contributions
 else-0.025
 slice-0.023
 mel-0.021
 that-0.020
 delicious-0.018
Token They
Token ID1119
Feature activation+1.04e+00

Pos logit contributions
aned+0.084
 breaths+0.083
arted+0.082
sey+0.082
 heads+0.080
Neg logit contributions
 else-0.078
 that-0.076
 mel-0.076
 slice-0.074
 fruit-0.069
Token sw
Token ID1509
Feature activation+9.70e+00

Pos logit contributions
arted+0.028
pered+0.028
lee+0.026
anks+0.025
aled+0.025
Neg logit contributions
 else-0.021
 slice-0.020
str-0.020
 mel-0.020
 flavor-0.018
Tokenam
Token ID321
Feature activation+5.25e+00

Pos logit contributions
arted+0.123
pered+0.117
 asleep+0.114
lee+0.112
 Ghosts+0.112
Neg logit contributions
w-0.298
 slice-0.283
 mel-0.270
lly-0.268
 else-0.264
Token back
Token ID736
Feature activation+2.92e+00

Pos logit contributions
lee+0.143
arted+0.141
pered+0.138
anks+0.132
aled+0.125
Neg logit contributions
 else-0.106
 slice-0.096
 mel-0.091
str-0.090
 that-0.085
Token to
Token ID284
Feature activation+3.50e-02

Pos logit contributions
arted+0.091
pered+0.090
lee+0.089
anks+0.084
aled+0.082
Neg logit contributions
 else-0.046
 mel-0.041
 slice-0.040
str-0.040
 that-0.036
Token the
Token ID262
Feature activation+7.89e-01

Pos logit contributions
arted+0.001
pered+0.001
lee+0.001
anks+0.001
aled+0.001
Neg logit contributions
 else-0.000
 slice-0.000
 mel-0.000
str-0.000
 jam-0.000
Token boat
Token ID8848
Feature activation+2.33e+00

Pos logit contributions
pered+0.022
arted+0.022
lee+0.021
 heads+0.020
anks+0.019
Neg logit contributions
 else-0.014
str-0.014
 slice-0.014
 mel-0.013
ww-0.012
Token.
Token ID13
Feature activation+3.54e+00

Pos logit contributions
arted+0.014
pered+0.014
lee+0.013
anks+0.012
aled+0.012
Neg logit contributions
 else-0.011
str-0.011
 slice-0.011
 mel-0.011
ww-0.010
Token He
Token ID679
Feature activation+6.87e-01

Pos logit contributions
idget+0.083
lee+0.082
 heads+0.081
arted+0.078
sey+0.078
Neg logit contributions
 mel-0.069
 that-0.067
str-0.063
 fruit-0.062
 delicious-0.062
Token sw
Token ID1509
Feature activation+8.98e+00

Pos logit contributions
arted+0.019
pered+0.018
lee+0.018
 heads+0.017
aled+0.016
Neg logit contributions
str-0.013
 slice-0.012
 mel-0.012
 else-0.012
aster-0.012
Tokenam
Token ID321
Feature activation+4.52e+00

Pos logit contributions
 asleep+0.108
arted+0.104
pered+0.104
 Ghosts+0.102
pering+0.096
Neg logit contributions
w-0.266
 slice-0.263
 mel-0.253
lly-0.251
 else-0.242
Token and
Token ID290
Feature activation+1.77e+00

Pos logit contributions
arted+0.118
pered+0.117
lee+0.114
anks+0.108
aled+0.104
Neg logit contributions
 else-0.094
 slice-0.088
str-0.086
 mel-0.086
 delicious-0.079
Token sw
Token ID1509
Feature activation+9.68e+00

Pos logit contributions
arted+0.048
pered+0.047
lee+0.045
aled+0.042
anks+0.042
Neg logit contributions
 else-0.035
 slice-0.034
str-0.034
 mel-0.033
 flavor-0.030
Tokenam
Token ID321
Feature activation+4.47e+00

Pos logit contributions
arted+0.107
pered+0.104
lee+0.100
 asleep+0.095
 Ghosts+0.094
Neg logit contributions
w-0.304
 slice-0.301
 else-0.295
lly-0.293
 mel-0.291
Token until
Token ID1566
Feature activation-2.17e+00

Pos logit contributions
arted+0.132
pered+0.129
lee+0.129
anks+0.122
aled+0.119
Neg logit contributions
 else-0.079
 slice-0.069
 mel-0.068
str-0.068
 delicious-0.063
Token he
Token ID339
Feature activation+1.09e+00

Pos logit contributions
 else+0.029
 slice+0.028
str+0.027
 mel+0.027
 flavor+0.023
Neg logit contributions
arted-0.073
pered-0.072
lee-0.069
aled-0.066
anks-0.066
Token reached
Token ID4251
Feature activation+2.26e+00

Pos logit contributions
arted+0.029
pered+0.028
lee+0.028
am+0.026
aled+0.026
Neg logit contributions
 slice-0.020
 mel-0.020
str-0.019
ww-0.019
arest-0.019
Token the
Token ID262
Feature activation+1.56e+00

Pos logit contributions
arted+0.077
pered+0.077
lee+0.074
anks+0.070
aled+0.070
Neg logit contributions
 else-0.029
 slice-0.027
str-0.027
 mel-0.026
 flavor-0.022
Token.
Token ID13
Feature activation+2.44e+00

Pos logit contributions
arted+0.052
aled+0.050
pered+0.049
angers+0.049
anks+0.049
Neg logit contributions
 else-0.020
 slice-0.017
 that-0.016
 mel-0.016
str-0.014
Token
Token ID198
Feature activation-1.90e+00

Pos logit contributions
arted+0.079
pered+0.078
lee+0.075
aled+0.075
anks+0.074
Neg logit contributions
 else-0.034
str-0.032
 mel-0.031
 slice-0.028
 delicious-0.025
Token
Token ID198
Feature activation-1.58e+00

Pos logit contributions
 slice+0.033
 else+0.033
 mel+0.032
str+0.031
 jam+0.029
Neg logit contributions
pered-0.055
arted-0.055
lee-0.052
anks-0.049
aled-0.048
TokenThey
Token ID2990
Feature activation+3.86e-01

Pos logit contributions
 else+0.026
 slice+0.025
str+0.024
 mel+0.024
 flavor+0.021
Neg logit contributions
arted-0.048
pered-0.048
lee-0.046
anks-0.043
aled-0.043
Token quickly
Token ID2952
Feature activation+1.79e+00

Pos logit contributions
arted+0.010
pered+0.010
lee+0.009
anks+0.009
aled+0.009
Neg logit contributions
 else-0.008
 slice-0.008
str-0.008
 mel-0.008
 flavor-0.007
Token sw
Token ID1509
Feature activation+9.64e+00

Pos logit contributions
arted+0.049
pered+0.048
lee+0.046
aled+0.043
anks+0.042
Neg logit contributions
 else-0.035
 slice-0.034
str-0.034
 mel-0.033
 flavor-0.030
Tokenam
Token ID321
Feature activation+4.81e+00

Pos logit contributions
 Ghosts+0.109
lee+0.108
angers+0.106
 asleep+0.106
arted+0.105
Neg logit contributions
w-0.296
 slice-0.283
 mel-0.276
str-0.273
ww-0.273
Token to
Token ID284
Feature activation+5.53e-01

Pos logit contributions
arted+0.113
lee+0.113
pered+0.112
anks+0.106
aled+0.102
Neg logit contributions
 else-0.109
 slice-0.101
 mel-0.099
str-0.099
 delicious-0.093
Token the
Token ID262
Feature activation+9.88e-01

Pos logit contributions
arted+0.018
pered+0.018
lee+0.017
anks+0.017
aled+0.016
Neg logit contributions
 else-0.008
 slice-0.007
 mel-0.007
str-0.007
 delicious-0.006
Token edge
Token ID5743
Feature activation-5.67e-01

Pos logit contributions
pered+0.025
arted+0.025
lee+0.023
 heads+0.022
anks+0.022
Neg logit contributions
str-0.020
 else-0.020
 slice-0.020
 mel-0.019
ww-0.018
Token of
Token ID286
Feature activation-1.42e+00

Pos logit contributions
str+0.008
 else+0.008
 slice+0.007
 mel+0.007
ww+0.007
Neg logit contributions
arted-0.019
pered-0.019
lee-0.018
aled-0.016
anks-0.016
Token friends
Token ID2460
Feature activation-4.63e-02

Pos logit contributions
arted+0.056
pered+0.055
lee+0.051
aled+0.050
anks+0.048
Neg logit contributions
 else-0.048
 mel-0.047
 slice-0.046
str-0.044
 flavor-0.043
Token and
Token ID290
Feature activation-1.05e-01

Pos logit contributions
 else+0.001
 slice+0.001
str+0.001
 mel+0.001
ww+0.001
Neg logit contributions
arted-0.001
pered-0.001
lee-0.001
anks-0.001
aled-0.001
Token watched
Token ID7342
Feature activation+2.05e+00

Pos logit contributions
 else+0.002
 slice+0.002
str+0.002
 mel+0.002
 flavor+0.002
Neg logit contributions
arted-0.003
pered-0.003
lee-0.003
anks-0.003
aled-0.003
Token as
Token ID355
Feature activation-3.00e+00

Pos logit contributions
arted+0.058
pered+0.056
lee+0.054
anks+0.052
aled+0.051
Neg logit contributions
 else-0.038
 slice-0.036
 mel-0.036
str-0.034
 delicious-0.032
Token they
Token ID484
Feature activation+7.29e-01

Pos logit contributions
 else+0.036
 slice+0.032
 mel+0.032
str+0.032
 flavor+0.026
Neg logit contributions
arted-0.106
pered-0.106
lee-0.101
anks-0.097
aled-0.096
Token sw
Token ID1509
Feature activation+9.63e+00

Pos logit contributions
arted+0.018
pered+0.017
lee+0.017
aled+0.015
anks+0.015
Neg logit contributions
 else-0.016
 slice-0.016
str-0.016
 mel-0.016
 flavor-0.015
Tokenam
Token ID321
Feature activation+4.71e+00

Pos logit contributions
pered+0.102
arted+0.101
lee+0.095
anks+0.083
 Ghosts+0.081
Neg logit contributions
 slice-0.315
 else-0.310
w-0.307
 mel-0.305
lly-0.303
Token away
Token ID1497
Feature activation+1.23e-01

Pos logit contributions
arted+0.121
pered+0.118
lee+0.116
anks+0.110
aled+0.108
Neg logit contributions
 else-0.098
 slice-0.091
 mel-0.089
str-0.088
 delicious-0.082
Token.
Token ID13
Feature activation+2.28e+00

Pos logit contributions
arted+0.004
pered+0.004
lee+0.003
aled+0.003
anks+0.003
Neg logit contributions
 else-0.002
 slice-0.002
 mel-0.002
str-0.002
 delicious-0.002
Token
Token ID198
Feature activation-2.65e+00

Pos logit contributions
pered+0.059
arted+0.059
lee+0.055
 heads+0.054
 breaths+0.052
Neg logit contributions
 else-0.046
str-0.045
 mel-0.044
 slice-0.042
 that-0.041
Token
Token ID198
Feature activation-1.76e+00

Pos logit contributions
 mel+0.051
 slice+0.051
 else+0.047
str+0.047
 flavor+0.046
Neg logit contributions
pered-0.071
arted-0.068
lee-0.067
aled-0.063
anks-0.060
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
 slice+0.034
str+0.034
 else+0.033
 mel+0.033
ww+0.030
Neg logit contributions
arted-0.061
pered-0.061
lee-0.059
anks-0.054
 heads-0.053
Token she
Token ID673
Feature activation+1.16e+00

Pos logit contributions
 else+0.029
 slice+0.027
str+0.027
 mel+0.027
 flavor+0.023
Neg logit contributions
arted-0.069
pered-0.068
lee-0.066
anks-0.063
aled-0.062
Token jumped
Token ID11687
Feature activation+6.50e+00

Pos logit contributions
arted+0.033
pered+0.033
lee+0.031
anks+0.029
aled+0.029
Neg logit contributions
 else-0.022
 slice-0.021
str-0.020
 mel-0.020
 flavor-0.018
Token in
Token ID287
Feature activation+4.76e-01

Pos logit contributions
arted+0.185
lee+0.182
pered+0.173
anks+0.171
aled+0.167
Neg logit contributions
 else-0.110
 that-0.102
 mel-0.101
 slice-0.101
 delicious-0.098
Token and
Token ID290
Feature activation+1.23e+00

Pos logit contributions
arted+0.016
pered+0.015
lee+0.015
anks+0.014
aled+0.014
Neg logit contributions
 else-0.007
 slice-0.007
 mel-0.006
str-0.006
 jam-0.006
Token sw
Token ID1509
Feature activation+9.61e+00

Pos logit contributions
pered+0.034
arted+0.034
lee+0.032
anks+0.031
aled+0.030
Neg logit contributions
 else-0.024
 slice-0.022
str-0.022
 mel-0.022
 flavor-0.020
Tokenam
Token ID321
Feature activation+4.24e+00

Pos logit contributions
arted+0.114
pered+0.108
lee+0.096
anks+0.094
 Ghosts+0.090
Neg logit contributions
 slice-0.297
lly-0.296
 else-0.295
w-0.288
 mel-0.283
Token to
Token ID284
Feature activation+9.19e-01

Pos logit contributions
arted+0.129
lee+0.126
pered+0.123
anks+0.122
aled+0.120
Neg logit contributions
 else-0.071
 that-0.060
 slice-0.059
 mel-0.057
 delicious-0.057
Token the
Token ID262
Feature activation+1.48e+00

Pos logit contributions
arted+0.035
pered+0.034
lee+0.034
anks+0.033
aned+0.032
Neg logit contributions
 else-0.009
 mel-0.008
 slice-0.007
aster-0.007
 jam-0.006
Token bag
Token ID6131
Feature activation+1.42e+00

Pos logit contributions
arted+0.041
pered+0.040
lee+0.038
anks+0.036
aled+0.036
Neg logit contributions
 else-0.029
str-0.028
 slice-0.027
 mel-0.027
 flavor-0.024
Token.
Token ID13
Feature activation+2.78e+00

Pos logit contributions
arted+0.042
pered+0.042
lee+0.040
anks+0.038
aled+0.038
Neg logit contributions
 else-0.024
 slice-0.023
 mel-0.022
str-0.022
 flavor-0.020
Tokeno
Token ID78
Feature activation+2.20e+00

Pos logit contributions
arted+0.088
pered+0.088
lee+0.083
anks+0.078
aled+0.078
Neg logit contributions
 else-0.066
str-0.063
 slice-0.063
 mel-0.063
 flavor-0.056
Token stretched
Token ID19110
Feature activation+3.20e+00

Pos logit contributions
arted+0.063
pered+0.062
lee+0.059
anks+0.057
aled+0.055
Neg logit contributions
 else-0.040
 slice-0.038
str-0.038
 mel-0.038
 flavor-0.033
Token his
Token ID465
Feature activation+4.22e+00

Pos logit contributions
arted+0.091
lee+0.088
pered+0.087
anks+0.084
aled+0.081
Neg logit contributions
 else-0.057
 slice-0.053
 mel-0.052
str-0.051
aster-0.047
Token fins
Token ID42080
Feature activation+2.57e+00

Pos logit contributions
arted+0.082
pered+0.076
lee+0.072
idget+0.070
aled+0.070
Neg logit contributions
 else-0.118
 slice-0.112
 mel-0.112
str-0.108
 delicious-0.105
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
arted+0.073
pered+0.073
lee+0.071
anks+0.067
aled+0.067
Neg logit contributions
 else-0.047
 slice-0.043
 mel-0.042
str-0.041
 delicious-0.037
Token sw
Token ID1509
Feature activation+9.60e+00

Pos logit contributions
arted+0.032
pered+0.032
lee+0.030
aled+0.028
anks+0.028
Neg logit contributions
 else-0.023
 slice-0.022
str-0.022
 mel-0.022
 flavor-0.020
Tokenam
Token ID321
Feature activation+4.52e+00

Pos logit contributions
arted+0.129
idget+0.120
pered+0.118
lee+0.116
 Ghosts+0.114
Neg logit contributions
w-0.269
lly-0.265
 slice-0.259
 else-0.256
 mel-0.251
Token towards
Token ID3371
Feature activation-4.55e-01

Pos logit contributions
arted+0.128
lee+0.125
pered+0.124
anks+0.119
aled+0.116
Neg logit contributions
 else-0.086
 slice-0.074
 mel-0.073
str-0.073
 delicious-0.069
Token the
Token ID262
Feature activation+9.31e-01

Pos logit contributions
 else+0.006
 slice+0.005
str+0.005
 mel+0.005
 flavor+0.004
Neg logit contributions
arted-0.016
pered-0.015
lee-0.015
anks-0.014
aled-0.014
Token scary
Token ID14343
Feature activation+1.14e+00

Pos logit contributions
arted+0.031
pered+0.030
lee+0.029
anks+0.028
aled+0.028
Neg logit contributions
 else-0.013
 slice-0.012
str-0.012
 mel-0.012
 flavor-0.010
Token fish
Token ID5916
Feature activation+4.23e-01

Pos logit contributions
pered+0.033
arted+0.033
lee+0.032
 heads+0.030
anks+0.030
Neg logit contributions
 else-0.019
str-0.019
 slice-0.019
 mel-0.018
ww-0.016
Token animals
Token ID4695
Feature activation-7.63e-01

Pos logit contributions
 else+0.004
 mel+0.004
str+0.004
 slice+0.004
 delicious+0.003
Neg logit contributions
pered-0.006
arted-0.006
lee-0.006
anks-0.005
aled-0.005
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
 else+0.014
 slice+0.014
str+0.013
 mel+0.013
 flavor+0.012
Neg logit contributions
arted-0.022
pered-0.021
lee-0.020
anks-0.019
aled-0.019
Token and
Token ID290
Feature activation-2.01e+00

Pos logit contributions
 else+0.022
 mel+0.021
 slice+0.021
str+0.020
 delicious+0.019
Neg logit contributions
arted-0.036
pered-0.035
lee-0.034
anks-0.033
aled-0.032
Token they
Token ID484
Feature activation+1.40e-01

Pos logit contributions
 else+0.038
 slice+0.036
str+0.036
 mel+0.036
 flavor+0.032
Neg logit contributions
arted-0.056
pered-0.056
lee-0.053
anks-0.050
aled-0.049
Token quickly
Token ID2952
Feature activation+1.73e+00

Pos logit contributions
arted+0.004
pered+0.004
lee+0.004
anks+0.004
aled+0.003
Neg logit contributions
 else-0.003
str-0.002
 slice-0.002
 mel-0.002
 flavor-0.002
Token sw
Token ID1509
Feature activation+9.59e+00

Pos logit contributions
arted+0.047
pered+0.046
lee+0.044
anks+0.041
aled+0.041
Neg logit contributions
 else-0.035
 slice-0.033
str-0.033
 mel-0.033
 flavor-0.029
Tokenam
Token ID321
Feature activation+4.62e+00

Pos logit contributions
 asleep+0.131
angers+0.124
arted+0.123
pered+0.122
lee+0.121
Neg logit contributions
w-0.261
lly-0.260
 else-0.241
str-0.241
 slice-0.236
Token away
Token ID1497
Feature activation+2.32e-01

Pos logit contributions
arted+0.120
lee+0.120
pered+0.119
anks+0.113
aled+0.108
Neg logit contributions
 else-0.095
str-0.084
 slice-0.083
 mel-0.082
 delicious-0.078
Token.
Token ID13
Feature activation+2.45e+00

Pos logit contributions
arted+0.007
pered+0.007
lee+0.007
aled+0.006
anks+0.006
Neg logit contributions
 else-0.004
 slice-0.004
str-0.004
 mel-0.004
 delicious-0.003
Token But
Token ID887
Feature activation-2.51e+00

Pos logit contributions
pered+0.063
arted+0.062
 heads+0.061
aled+0.060
lee+0.059
Neg logit contributions
str-0.045
 else-0.045
 mel-0.043
 that-0.042
 delicious-0.040
Token D
Token ID360
Feature activation+2.33e+00

Pos logit contributions
 slice+0.053
 else+0.053
 mel+0.051
str+0.051
 flavor+0.047
Neg logit contributions
arted-0.065
pered-0.064
lee-0.060
aled-0.055
anks-0.055
Token
Token ID220
Feature activation+3.53e+00

Pos logit contributions
pered+0.052
arted+0.052
lee+0.049
anks+0.045
 heads+0.045
Neg logit contributions
 else-0.051
 mel-0.050
 slice-0.049
 that-0.047
 jam-0.047
Token
Token ID198
Feature activation-2.10e+00

Pos logit contributions
arted+0.107
pered+0.107
lee+0.102
anks+0.097
aled+0.096
Neg logit contributions
 else-0.057
 slice-0.055
str-0.054
 mel-0.054
 flavor-0.047
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
 slice+0.038
 mel+0.038
 else+0.037
str+0.037
 jam+0.033
Neg logit contributions
arted-0.058
pered-0.058
lee-0.055
aled-0.051
anks-0.051
TokenShe
Token ID3347
Feature activation-4.98e-02

Pos logit contributions
 else+0.025
 slice+0.024
 mel+0.024
str+0.024
 flavor+0.021
Neg logit contributions
arted-0.049
pered-0.049
lee-0.046
aled-0.044
anks-0.043
Token cautiously
Token ID39640
Feature activation+7.94e-01

Pos logit contributions
 slice+0.001
 else+0.001
str+0.001
 mel+0.001
 jam+0.001
Neg logit contributions
arted-0.001
pered-0.001
lee-0.001
aled-0.001
 heads-0.001
Token sw
Token ID1509
Feature activation+9.59e+00

Pos logit contributions
arted+0.020
pered+0.020
lee+0.019
aled+0.017
 heads+0.017
Neg logit contributions
 else-0.017
str-0.017
 slice-0.017
 mel-0.016
ww-0.015
Tokenam
Token ID321
Feature activation+4.51e+00

Pos logit contributions
pered+0.128
arted+0.126
 weeping+0.113
pering+0.112
lee+0.110
Neg logit contributions
lly-0.277
w-0.271
 else-0.256
 slice-0.253
 mel-0.252
Token around
Token ID1088
Feature activation+8.31e-02

Pos logit contributions
arted+0.120
pered+0.119
lee+0.116
anks+0.111
aled+0.108
Neg logit contributions
 else-0.092
str-0.082
 slice-0.082
 mel-0.082
 delicious-0.076
Token and
Token ID290
Feature activation+6.88e-01

Pos logit contributions
arted+0.003
pered+0.003
lee+0.003
anks+0.003
aled+0.003
Neg logit contributions
 else-0.001
 slice-0.001
 mel-0.001
str-0.001
 jam-0.001
Token saw
Token ID2497
Feature activation-1.98e+00

Pos logit contributions
pered+0.021
arted+0.021
lee+0.020
anks+0.020
aled+0.019
Neg logit contributions
 else-0.011
str-0.010
 mel-0.010
 slice-0.010
 jam-0.009
Token lots
Token ID6041
Feature activation-2.85e+00

Pos logit contributions
 else+0.026
 slice+0.026
str+0.025
 mel+0.025
ww+0.021
Neg logit contributions
arted-0.066
pered-0.065
lee-0.063
anks-0.059
aled-0.059
Token chocolate
Token ID11311
Feature activation-8.13e-01

Pos logit contributions
str+0.043
arest+0.035
 else+0.034
w+0.033
iest+0.032
Neg logit contributions
 heads-0.062
lee-0.061
pered-0.061
 shoulders-0.060
arted-0.059
Token and
Token ID290
Feature activation-2.40e+00

Pos logit contributions
str+0.016
 else+0.015
body+0.013
arest+0.013
 unexpected+0.013
Neg logit contributions
pered-0.022
anks-0.021
 heads-0.021
arted-0.021
 shoulders-0.020
Token nuts
Token ID14380
Feature activation-1.84e+00

Pos logit contributions
str+0.044
 else+0.035
arest+0.034
body+0.033
w+0.032
Neg logit contributions
 heads-0.067
 shoulders-0.066
lee-0.066
pered-0.066
arted-0.063
Token.
Token ID13
Feature activation+2.45e+00

Pos logit contributions
str+0.046
aster+0.041
Little+0.041
body+0.040
 mel+0.040
Neg logit contributions
pered-0.040
arted-0.039
 heads-0.036
anks-0.036
lee-0.035
Token Which
Token ID9022
Feature activation-4.92e+00

Pos logit contributions
arted+0.055
pered+0.053
lee+0.051
anks+0.047
aled+0.047
Neg logit contributions
 else-0.060
 slice-0.059
 mel-0.058
str-0.055
 flavor-0.053
Token one
Token ID530
Feature activation-7.31e+00

Pos logit contributions
arest+0.121
str+0.111
inner+0.107
 else+0.105
body+0.104
Neg logit contributions
 shoulders-0.082
 heads-0.078
 eyes-0.077
 arms-0.073
 ears-0.072
Token do
Token ID466
Feature activation-3.23e+00

Pos logit contributions
str+0.144
arest+0.144
Little+0.132
ha+0.128
ww+0.127
Neg logit contributions
 heads-0.135
arted-0.133
am-0.132
 requests-0.130
anks-0.126
Token you
Token ID345
Feature activation-1.85e-01

Pos logit contributions
str+0.064
 mel+0.056
 slice+0.056
he+0.054
w+0.052
Neg logit contributions
pered-0.071
lee-0.069
anks-0.068
 shoulders-0.067
 heads-0.067
Token want
Token ID765
Feature activation-2.53e+00

Pos logit contributions
str+0.005
ww+0.005
 else+0.005
 mel+0.004
 slice+0.004
Neg logit contributions
pered-0.003
arted-0.003
lee-0.003
anks-0.003
 heads-0.003
Token,
Token ID11
Feature activation-2.63e+00

Pos logit contributions
str+0.056
arest+0.046
Little+0.046
ww+0.044
 else+0.043
Neg logit contributions
anks-0.055
pered-0.052
am-0.050
arted-0.050
stick-0.049
Token Tom
Token ID4186
Feature activation-2.35e+00

Pos logit contributions
str+0.076
w+0.070
arest+0.070
iest+0.069
 else+0.067
Neg logit contributions
arted-0.043
lee-0.042
 shoulders-0.040
pered-0.040
 eyes-0.039
Token.
Token ID13
Feature activation+1.77e+00

Pos logit contributions
 else+0.012
str+0.011
 slice+0.011
 mel+0.011
 flavor+0.010
Neg logit contributions
arted-0.019
pered-0.019
lee-0.018
anks-0.017
aled-0.017
Token
Token ID198
Feature activation-3.01e+00

Pos logit contributions
arted+0.046
pered+0.045
lee+0.043
aled+0.040
anks+0.040
Neg logit contributions
 else-0.037
 slice-0.036
 mel-0.035
str-0.035
 flavor-0.032
Token
Token ID198
Feature activation-2.45e+00

Pos logit contributions
 slice+0.062
 mel+0.061
str+0.059
 else+0.058
ww+0.057
Neg logit contributions
pered-0.074
arted-0.073
lee-0.072
anks-0.064
aled-0.063
Token"
Token ID1
Feature activation-3.27e+00

Pos logit contributions
 slice+0.046
 mel+0.046
str+0.045
 happened+0.044
iest+0.044
Neg logit contributions
pered-0.063
arted-0.062
lee-0.058
 heads-0.053
aled-0.051
TokenWhich
Token ID13828
Feature activation-5.34e+00

Pos logit contributions
str+0.072
 else+0.070
 mel+0.065
 slice+0.064
body+0.061
Neg logit contributions
arted-0.078
pered-0.077
lee-0.069
aled-0.069
 heads-0.068
Token one
Token ID530
Feature activation-7.27e+00

Pos logit contributions
str+0.122
arest+0.117
Little+0.114
prise+0.111
aster+0.110
Neg logit contributions
 shoulders-0.096
 heads-0.095
 eyes-0.091
 arms-0.091
arted-0.090
Token should
Token ID815
Feature activation-2.46e-01

Pos logit contributions
arest+0.144
aster+0.143
str+0.142
Little+0.140
ww+0.135
Neg logit contributions
 heads-0.146
arted-0.137
am-0.137
lee-0.131
 spirits-0.124
Token we
Token ID356
Feature activation-6.84e-01

Pos logit contributions
str+0.005
ww+0.004
w+0.004
 else+0.004
 slice+0.004
Neg logit contributions
pered-0.006
lee-0.006
arted-0.006
 heads-0.006
anks-0.005
Token get
Token ID651
Feature activation+4.36e-01

Pos logit contributions
ww+0.016
str+0.015
iest+0.015
w+0.014
 slice+0.013
Neg logit contributions
 heads-0.014
pered-0.014
lee-0.013
anks-0.013
arted-0.013
Token,
Token ID11
Feature activation-2.87e+00

Pos logit contributions
arted+0.011
lee+0.011
pered+0.011
anks+0.010
aled+0.010
Neg logit contributions
str-0.009
ww-0.007
body-0.007
Little-0.007
lly-0.007
Token Ben
Token ID3932
Feature activation-1.71e+00

Pos logit contributions
str+0.083
 else+0.075
w+0.074
arest+0.073
ww+0.072
Neg logit contributions
lee-0.050
arted-0.049
pered-0.048
 shoulders-0.045
 heads-0.045
Token his
Token ID465
Feature activation+4.18e-01

Pos logit contributions
 else+0.021
str+0.021
 slice+0.021
 mel+0.020
ww+0.019
Neg logit contributions
arted-0.025
pered-0.025
lee-0.023
anks-0.022
aled-0.021
Token mom
Token ID1995
Feature activation-1.36e+00

Pos logit contributions
pered+0.010
arted+0.010
lee+0.009
anks+0.009
 heads+0.008
Neg logit contributions
 else-0.010
str-0.009
 slice-0.009
 mel-0.009
w-0.009
Token,
Token ID11
Feature activation-2.72e+00

Pos logit contributions
str+0.023
 else+0.023
 slice+0.022
 mel+0.021
ww+0.021
Neg logit contributions
arted-0.039
pered-0.039
lee-0.038
anks-0.036
aled-0.034
Token "
Token ID366
Feature activation-1.51e+00

Pos logit contributions
 else+0.080
str+0.080
 slice+0.079
w+0.079
ww+0.078
Neg logit contributions
pered-0.045
arted-0.043
lee-0.040
anks-0.033
 shoulders-0.033
TokenWhich
Token ID13828
Feature activation-4.74e+00

Pos logit contributions
str+0.033
 else+0.031
ww+0.028
arest+0.028
 happened+0.027
Neg logit contributions
arted-0.038
pered-0.036
lee-0.033
 heads-0.032
aled-0.032
Token one
Token ID530
Feature activation-7.19e+00

Pos logit contributions
arest+0.092
str+0.091
Little+0.089
he+0.086
keys+0.083
Neg logit contributions
 shoulders-0.096
 eyes-0.094
am-0.093
 heads-0.088
 arms-0.088
Token do
Token ID466
Feature activation-2.31e+00

Pos logit contributions
arest+0.141
str+0.138
aster+0.137
Little+0.135
ww+0.131
Neg logit contributions
 heads-0.136
am-0.133
arted-0.132
anks-0.128
 spirits-0.125
Token I
Token ID314
Feature activation+1.55e-01

Pos logit contributions
str+0.038
 slice+0.036
 mel+0.035
w+0.033
Little+0.031
Neg logit contributions
anks-0.054
pered-0.053
am-0.053
lee-0.053
 commands-0.051
Token get
Token ID651
Feature activation+6.84e-01

Pos logit contributions
pered+0.003
anks+0.003
lee+0.003
arted+0.003
 heads+0.003
Neg logit contributions
str-0.004
ww-0.004
 slice-0.003
arest-0.003
 else-0.003
Token?"
Token ID1701
Feature activation+4.61e-01

Pos logit contributions
arted+0.017
pered+0.017
lee+0.017
anks+0.016
aled+0.015
Neg logit contributions
str-0.014
 else-0.012
 slice-0.012
ww-0.012
body-0.011
Token She
Token ID1375
Feature activation-7.11e-02

Pos logit contributions
pered+0.010
arted+0.010
lee+0.009
anks+0.008
aled+0.008
Neg logit contributions
 else-0.012
 mel-0.012
 slice-0.012
str-0.011
 flavor-0.011
Token asked
Token ID1965
Feature activation-7.81e-01

Pos logit contributions
arted+0.002
pered+0.002
lee+0.002
anks+0.002
aled+0.002
Neg logit contributions
 else-0.001
 slice-0.001
str-0.001
 mel-0.001
 flavor-0.001
Token Anna
Token ID11735
Feature activation+6.47e-02

Pos logit contributions
 else+0.015
 slice+0.014
 mel+0.014
str+0.014
 jam+0.013
Neg logit contributions
arted-0.022
pered-0.021
lee-0.020
anks-0.019
aled-0.019
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
arted+0.002
pered+0.002
lee+0.002
anks+0.002
aled+0.002
Neg logit contributions
 else-0.001
 slice-0.001
str-0.001
 mel-0.001
 flavor-0.001
Token "
Token ID366
Feature activation-4.80e-01

Pos logit contributions
 else+0.045
str+0.044
 slice+0.044
 mel+0.043
ww+0.041
Neg logit contributions
pered-0.035
arted-0.035
lee-0.032
anks-0.030
aled-0.029
TokenWhich
Token ID13828
Feature activation-4.53e+00

Pos logit contributions
str+0.010
 mel+0.009
 else+0.009
 slice+0.009
 delicious+0.009
Neg logit contributions
pered-0.013
arted-0.012
aled-0.011
 heads-0.011
 commands-0.011
Token one
Token ID530
Feature activation-7.16e+00

Pos logit contributions
str+0.095
 slice+0.089
 mel+0.084
arest+0.083
 else+0.082
Neg logit contributions
pered-0.100
arted-0.097
 shoulders-0.096
 heads-0.093
anks-0.092
Token is
Token ID318
Feature activation-6.07e-01

Pos logit contributions
str+0.140
aster+0.137
Little+0.136
arest+0.136
 slice+0.135
Neg logit contributions
 heads-0.147
arted-0.141
am-0.140
anks-0.133
lee-0.131
Token your
Token ID534
Feature activation+1.37e+00

Pos logit contributions
str+0.011
 slice+0.011
 else+0.011
 mel+0.010
w+0.010
Neg logit contributions
pered-0.017
arted-0.017
lee-0.016
aled-0.015
anks-0.015
Token favorite
Token ID4004
Feature activation-2.41e+00

Pos logit contributions
pered+0.031
arted+0.031
lee+0.030
anks+0.028
 heads+0.028
Neg logit contributions
str-0.031
 else-0.031
 slice-0.030
 mel-0.029
ww-0.028
Token?"
Token ID1701
Feature activation+1.03e+00

Pos logit contributions
str+0.043
 else+0.039
 slice+0.037
arest+0.036
aster+0.035
Neg logit contributions
lee-0.065
pered-0.064
arted-0.064
anks-0.063
 shoulders-0.062
Token
Token ID198
Feature activation-2.20e+00

Pos logit contributions
arted+0.034
pered+0.033
lee+0.032
anks+0.030
aled+0.030
Neg logit contributions
 else-0.015
 slice-0.014
 mel-0.013
str-0.013
 jam-0.012
Token animal
Token ID5044
Feature activation-2.78e+00

Pos logit contributions
str+0.110
 happened+0.100
Little+0.094
aster+0.094
w+0.092
Neg logit contributions
pered-0.092
anks-0.087
arted-0.085
 commands-0.085
 shoulders-0.084
Token to
Token ID284
Feature activation-2.89e-02

Pos logit contributions
str+0.056
 slice+0.052
 mel+0.050
 else+0.049
aster+0.049
Neg logit contributions
pered-0.072
arted-0.069
anks-0.066
lee-0.066
 heads-0.063
Token feed
Token ID3745
Feature activation-2.89e+00

Pos logit contributions
str+0.001
 else+0.001
 happened+0.001
ww+0.001
w+0.001
Neg logit contributions
pered-0.001
 commands-0.001
arted-0.001
aled-0.001
 heads-0.001
Token.
Token ID13
Feature activation+9.53e-01

Pos logit contributions
str+0.059
 slice+0.057
 else+0.056
arest+0.054
 mel+0.053
Neg logit contributions
pered-0.070
arted-0.069
lee-0.065
anks-0.064
 heads-0.063
Token Which
Token ID9022
Feature activation-4.78e+00

Pos logit contributions
arted+0.022
pered+0.021
lee+0.020
anks+0.018
aled+0.018
Neg logit contributions
 else-0.023
 slice-0.023
 mel-0.022
str-0.021
 flavor-0.021
Token one
Token ID530
Feature activation-7.13e+00

Pos logit contributions
str+0.114
 else+0.109
arest+0.109
he+0.108
pron+0.104
Neg logit contributions
 heads-0.084
 commands-0.080
 shoulders-0.080
 ears-0.078
sey-0.077
Token do
Token ID466
Feature activation-2.04e+00

Pos logit contributions
str+0.143
Little+0.142
arest+0.137
ha+0.131
 slice+0.128
Neg logit contributions
 heads-0.129
am-0.127
 requests-0.117
 head-0.117
anks-0.115
Token you
Token ID345
Feature activation+7.59e-02

Pos logit contributions
str+0.038
 slice+0.035
 mel+0.035
he+0.030
ww+0.030
Neg logit contributions
pered-0.050
anks-0.048
 commands-0.047
lee-0.046
arted-0.046
Token want
Token ID765
Feature activation-1.81e+00

Pos logit contributions
pered+0.001
arted+0.001
 heads+0.001
anks+0.001
lee+0.001
Neg logit contributions
str-0.002
ww-0.002
 else-0.002
 mel-0.002
 slice-0.002
Token?"
Token ID1701
Feature activation+2.22e-01

Pos logit contributions
str+0.031
 else+0.027
 slice+0.025
arest+0.025
ww+0.024
Neg logit contributions
anks-0.049
pered-0.049
arted-0.046
lee-0.045
stick-0.044
Token
Token ID198
Feature activation-2.97e+00

Pos logit contributions
arted+0.006
pered+0.006
lee+0.005
anks+0.005
aled+0.005
Neg logit contributions
 else-0.005
str-0.005
 mel-0.005
 slice-0.005
ww-0.004
Token He
Token ID679
Feature activation-3.99e-01

Pos logit contributions
arted+0.020
pered+0.020
lee+0.019
anks+0.018
aled+0.017
Neg logit contributions
 else-0.023
 mel-0.022
 slice-0.022
str-0.022
 flavor-0.020
Token said
Token ID531
Feature activation-2.00e-01

Pos logit contributions
str+0.008
 else+0.007
 slice+0.007
 mel+0.007
ww+0.007
Neg logit contributions
arted-0.011
pered-0.010
lee-0.010
aled-0.009
 heads-0.009
Token,
Token ID11
Feature activation-1.75e+00

Pos logit contributions
str+0.003
 else+0.003
 slice+0.003
 mel+0.003
w+0.003
Neg logit contributions
pered-0.006
arted-0.006
lee-0.006
anks-0.005
aled-0.005
Token "
Token ID366
Feature activation-9.82e-01

Pos logit contributions
 else+0.038
 slice+0.037
 mel+0.037
str+0.036
 delicious+0.033
Neg logit contributions
arted-0.044
pered-0.043
lee-0.040
anks-0.040
aled-0.039
TokenWhich
Token ID13828
Feature activation-4.47e+00

Pos logit contributions
 else+0.018
str+0.017
 slice+0.016
 mel+0.016
 delicious+0.015
Neg logit contributions
arted-0.028
pered-0.028
lee-0.026
aled-0.025
anks-0.025
Token one
Token ID530
Feature activation-7.12e+00

Pos logit contributions
str+0.113
w+0.098
arest+0.098
ww+0.095
 mel+0.094
Neg logit contributions
 shoulders-0.086
 heads-0.081
 eyes-0.081
sey-0.078
 commands-0.078
Token should
Token ID815
Feature activation-4.55e-01

Pos logit contributions
str+0.148
aster+0.145
arest+0.138
 slice+0.137
Little+0.135
Neg logit contributions
 heads-0.137
lee-0.128
arted-0.128
 shoulders-0.126
 spirits-0.126
Token I
Token ID314
Feature activation-4.90e-01

Pos logit contributions
str+0.009
 mel+0.008
 else+0.008
 slice+0.008
w+0.008
Neg logit contributions
pered-0.011
lee-0.011
arted-0.011
 heads-0.010
 shoulders-0.010
Token take
Token ID1011
Feature activation+1.50e+00

Pos logit contributions
str+0.011
ww+0.011
w+0.010
 mel+0.010
 slice+0.010
Neg logit contributions
pered-0.010
lee-0.010
anks-0.010
 heads-0.009
arted-0.009
Token?"
Token ID1701
Feature activation+5.77e-01

Pos logit contributions
pered+0.040
arted+0.040
lee+0.039
anks+0.037
aled+0.037
Neg logit contributions
str-0.029
 slice-0.027
 else-0.026
 mel-0.026
arest-0.023
Token
Token ID198
Feature activation-2.54e+00

Pos logit contributions
arted+0.015
pered+0.015
lee+0.014
anks+0.013
aled+0.013
Neg logit contributions
 else-0.012
 slice-0.011
 mel-0.011
str-0.011
 flavor-0.010
Token read
Token ID1100
Feature activation-1.24e+00

Pos logit contributions
pered+0.001
arted+0.001
lee+0.001
 heads+0.001
anks+0.001
Neg logit contributions
str-0.001
 else-0.001
 mel-0.001
ww-0.001
 slice-0.001
Token your
Token ID534
Feature activation+2.76e+00

Pos logit contributions
str+0.030
body+0.025
arest+0.024
ww+0.023
 mel+0.023
Neg logit contributions
pered-0.027
 shoulders-0.027
lee-0.027
 heads-0.026
arted-0.026
Token book
Token ID1492
Feature activation-5.69e-01

Pos logit contributions
pered+0.074
arted+0.073
lee+0.071
anks+0.068
 heads+0.067
Neg logit contributions
str-0.053
 else-0.052
 mel-0.049
 slice-0.049
ww-0.044
Token.
Token ID13
Feature activation+2.78e+00

Pos logit contributions
str+0.013
 mel+0.012
body+0.012
 slice+0.012
aster+0.011
Neg logit contributions
pered-0.013
arted-0.012
anks-0.012
 shoulders-0.012
lee-0.012
Token Which
Token ID9022
Feature activation-5.17e+00

Pos logit contributions
arted+0.070
pered+0.064
lee+0.063
 heads+0.056
aled+0.055
Neg logit contributions
 slice-0.063
 else-0.062
 mel-0.059
 jam-0.057
aster-0.056
Token one
Token ID530
Feature activation-7.11e+00

Pos logit contributions
str+0.114
arest+0.109
 else+0.103
he+0.101
keys+0.100
Neg logit contributions
 shoulders-0.106
 eyes-0.104
 heads-0.102
 ears-0.100
 arms-0.100
Token do
Token ID466
Feature activation-2.61e+00

Pos logit contributions
str+0.141
arest+0.134
Little+0.128
ha+0.128
ww+0.126
Neg logit contributions
 heads-0.132
 requests-0.128
anks-0.127
am-0.125
arted-0.125
Token you
Token ID345
Feature activation+1.20e-01

Pos logit contributions
str+0.052
 mel+0.047
 slice+0.046
looking+0.043
ww+0.041
Neg logit contributions
pered-0.060
anks-0.059
lee-0.059
 shoulders-0.059
 heads-0.057
Token want
Token ID765
Feature activation-2.05e+00

Pos logit contributions
pered+0.003
lee+0.002
arted+0.002
anks+0.002
 heads+0.002
Neg logit contributions
str-0.003
 else-0.003
ww-0.003
 mel-0.003
 slice-0.003
Token?"
Token ID1701
Feature activation-1.03e+00

Pos logit contributions
str+0.043
ww+0.034
arest+0.033
 slice+0.032
 else+0.032
Neg logit contributions
anks-0.049
pered-0.048
lee-0.046
arted-0.046
 shoulders-0.045
Token
Token ID198
Feature activation-1.89e+00

Pos logit contributions
str+0.021
 slice+0.020
 mel+0.020
ww+0.020
 else+0.020
Neg logit contributions
pered-0.025
arted-0.025
lee-0.025
anks-0.022
aled-0.022
Token He
Token ID679
Feature activation-2.01e-01

Pos logit contributions
arted+0.048
pered+0.046
lee+0.044
anks+0.040
 heads+0.039
Neg logit contributions
 else-0.061
 slice-0.059
 mel-0.058
str-0.057
 flavor-0.054
Token said
Token ID531
Feature activation+2.68e-01

Pos logit contributions
str+0.004
 slice+0.003
 else+0.003
 mel+0.003
ww+0.003
Neg logit contributions
arted-0.006
pered-0.006
lee-0.005
 heads-0.005
aled-0.005
Token,
Token ID11
Feature activation-1.85e+00

Pos logit contributions
arted+0.009
pered+0.009
lee+0.008
anks+0.008
aled+0.008
Neg logit contributions
 else-0.004
 slice-0.004
 mel-0.004
str-0.004
 flavor-0.003
Token "
Token ID366
Feature activation-7.57e-01

Pos logit contributions
 else+0.041
 slice+0.040
 mel+0.040
str+0.039
 jam+0.035
Neg logit contributions
arted-0.046
pered-0.045
lee-0.042
anks-0.041
aled-0.041
TokenWhich
Token ID13828
Feature activation-4.15e+00

Pos logit contributions
 else+0.013
str+0.013
 slice+0.013
 mel+0.012
 delicious+0.011
Neg logit contributions
arted-0.022
pered-0.022
lee-0.020
aled-0.020
anks-0.019
Token one
Token ID530
Feature activation-7.07e+00

Pos logit contributions
str+0.104
Little+0.089
ww+0.089
 mel+0.087
w+0.087
Neg logit contributions
 shoulders-0.077
 eyes-0.075
 heads-0.074
pered-0.073
 commands-0.072
Token is
Token ID318
Feature activation-1.11e+00

Pos logit contributions
str+0.155
Little+0.149
aster+0.141
ww+0.140
arest+0.140
Neg logit contributions
 heads-0.128
am-0.126
anks-0.122
 shoulders-0.114
 spirits-0.112
Token it
Token ID340
Feature activation-2.15e+00

Pos logit contributions
str+0.023
w+0.020
 slice+0.020
 mel+0.020
 else+0.020
Neg logit contributions
pered-0.028
lee-0.027
arted-0.027
anks-0.026
aled-0.025
Token?"
Token ID1701
Feature activation+7.78e-01

Pos logit contributions
str+0.039
 slice+0.033
w+0.032
body+0.032
ww+0.032
Neg logit contributions
anks-0.058
pered-0.057
lee-0.055
arted-0.052
angers-0.051
Token Lily
Token ID20037
Feature activation+1.08e+00

Pos logit contributions
arted+0.023
pered+0.022
lee+0.021
anks+0.020
aled+0.020
Neg logit contributions
 else-0.014
 slice-0.013
 mel-0.013
str-0.013
 flavor-0.012
Token said
Token ID531
Feature activation+1.29e+00

Pos logit contributions
pered+0.038
arted+0.038
lee+0.035
anks+0.034
aled+0.034
Neg logit contributions
 else-0.014
 slice-0.013
 mel-0.012
str-0.012
 flavor-0.010
Token one
Token ID530
Feature activation-5.26e+00

Pos logit contributions
str+0.018
 slice+0.016
 else+0.015
arest+0.015
body+0.014
Neg logit contributions
lee-0.023
arted-0.023
pered-0.023
 shoulders-0.022
anks-0.022
Token toy
Token ID13373
Feature activation-2.76e+00

Pos logit contributions
str+0.119
 happened+0.113
 awaited+0.104
 else+0.100
prise+0.100
Neg logit contributions
 shoulders-0.104
anks-0.101
pered-0.100
 commands-0.097
 heads-0.094
Token each
Token ID1123
Feature activation-2.46e+00

Pos logit contributions
str+0.056
 slice+0.051
 else+0.050
 mel+0.047
aster+0.047
Neg logit contributions
pered-0.072
arted-0.070
anks-0.068
lee-0.066
 heads-0.063
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
str+0.059
prise+0.054
 else+0.054
 slice+0.053
arest+0.051
Neg logit contributions
pered-0.047
arted-0.046
sey-0.045
 heads-0.044
 shoulders-0.042
Token Which
Token ID9022
Feature activation-4.55e+00

Pos logit contributions
arted+0.038
pered+0.036
lee+0.035
anks+0.032
aled+0.031
Neg logit contributions
 else-0.036
 slice-0.035
 mel-0.034
str-0.032
 flavor-0.032
Token one
Token ID530
Feature activation-7.06e+00

Pos logit contributions
str+0.108
arest+0.099
 slice+0.099
prise+0.098
 else+0.097
Neg logit contributions
 shoulders-0.086
 heads-0.084
sey-0.080
 commands-0.078
am-0.078
Token do
Token ID466
Feature activation-2.40e+00

Pos logit contributions
Little+0.139
str+0.136
arest+0.130
aster+0.126
eh+0.124
Neg logit contributions
am-0.137
 heads-0.132
anks-0.124
 requests-0.121
 shoulders-0.116
Token you
Token ID345
Feature activation-2.51e-01

Pos logit contributions
str+0.045
 slice+0.040
 mel+0.038
ww+0.036
 else+0.036
Neg logit contributions
pered-0.060
anks-0.058
lee-0.057
 shoulders-0.057
 heads-0.056
Token want
Token ID765
Feature activation-2.12e+00

Pos logit contributions
str+0.006
ww+0.006
 else+0.005
 slice+0.005
 mel+0.005
Neg logit contributions
pered-0.005
arted-0.005
lee-0.005
anks-0.005
 heads-0.005
Token?"
Token ID1701
Feature activation-5.77e-02

Pos logit contributions
str+0.033
 else+0.028
 slice+0.027
ww+0.025
arest+0.025
Neg logit contributions
pered-0.063
arted-0.062
anks-0.062
lee-0.060
aled-0.057
Token
Token ID198
Feature activation-2.54e+00

Pos logit contributions
 else+0.001
 mel+0.001
 slice+0.001
str+0.001
 flavor+0.001
Neg logit contributions
arted-0.002
pered-0.002
lee-0.002
anks-0.001
aled-0.001
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
 else+0.003
str+0.003
 slice+0.003
 mel+0.003
ww+0.003
Neg logit contributions
pered-0.005
arted-0.005
lee-0.004
anks-0.004
 heads-0.004
Token or
Token ID393
Feature activation-2.55e+00

Pos logit contributions
 else+0.024
str+0.023
 slice+0.022
 mel+0.021
arest+0.020
Neg logit contributions
pered-0.029
arted-0.028
lee-0.028
 heads-0.025
 shoulders-0.025
Token white
Token ID2330
Feature activation-1.88e-01

Pos logit contributions
str+0.050
 else+0.047
 slice+0.045
 mel+0.044
w+0.042
Neg logit contributions
pered-0.068
arted-0.067
lee-0.065
 heads-0.061
 shoulders-0.060
Token.
Token ID13
Feature activation+2.86e+00

Pos logit contributions
 else+0.003
 slice+0.003
 mel+0.003
str+0.003
 flavor+0.003
Neg logit contributions
arted-0.005
pered-0.005
lee-0.005
anks-0.005
aled-0.005
Token Which
Token ID9022
Feature activation-5.02e+00

Pos logit contributions
arted+0.073
lee+0.068
pered+0.067
aned+0.062
 heads+0.060
Neg logit contributions
 else-0.061
 slice-0.060
 mel-0.059
 jam-0.056
ww-0.055
Token one
Token ID530
Feature activation-7.06e+00

Pos logit contributions
str+0.111
arest+0.101
 else+0.098
he+0.095
 slice+0.094
Neg logit contributions
 shoulders-0.107
 heads-0.105
 eyes-0.101
 arms-0.098
 ears-0.097
Token do
Token ID466
Feature activation-2.56e+00

Pos logit contributions
str+0.144
arest+0.129
Little+0.129
ww+0.124
 mel+0.123
Neg logit contributions
 heads-0.137
 requests-0.130
am-0.129
anks-0.129
arted-0.128
Token you
Token ID345
Feature activation+1.51e-01

Pos logit contributions
str+0.047
 slice+0.042
 mel+0.041
looking+0.036
ww+0.036
Neg logit contributions
pered-0.066
lee-0.064
anks-0.063
 shoulders-0.062
arted-0.061
Token want
Token ID765
Feature activation-2.04e+00

Pos logit contributions
pered+0.003
arted+0.003
lee+0.003
anks+0.003
 heads+0.003
Neg logit contributions
str-0.004
ww-0.004
 else-0.003
 mel-0.003
 slice-0.003
Token?"
Token ID1701
Feature activation+1.24e+00

Pos logit contributions
str+0.037
ww+0.029
 slice+0.029
 else+0.028
arest+0.028
Neg logit contributions
anks-0.055
pered-0.054
arted-0.052
lee-0.052
 shoulders-0.050
Token Sara
Token ID24799
Feature activation+1.44e+00

Pos logit contributions
arted+0.032
pered+0.032
lee+0.030
anks+0.028
aled+0.028
Neg logit contributions
 else-0.026
 slice-0.025
str-0.025
 mel-0.024
ww-0.022
Token wanted
Token ID2227
Feature activation-3.78e-01

Pos logit contributions
arted+0.011
pered+0.011
lee+0.011
aled+0.010
anks+0.010
Neg logit contributions
 else-0.006
str-0.006
 slice-0.006
 mel-0.006
ww-0.005
Token to
Token ID284
Feature activation+1.51e+00

Pos logit contributions
str+0.008
 slice+0.007
 else+0.007
arest+0.007
 mel+0.007
Neg logit contributions
arted-0.009
lee-0.009
pered-0.009
 heads-0.008
 shoulders-0.008
Token show
Token ID905
Feature activation+2.93e-01

Pos logit contributions
pered+0.037
lee+0.036
arted+0.036
anks+0.034
stick+0.033
Neg logit contributions
str-0.031
 else-0.029
 slice-0.028
ww-0.028
 mel-0.028
Token you
Token ID345
Feature activation-1.64e+00

Pos logit contributions
arted+0.009
pered+0.008
lee+0.008
aled+0.008
 heads+0.008
Neg logit contributions
str-0.005
 slice-0.005
 else-0.005
 mel-0.005
 flavor-0.004
Token how
Token ID703
Feature activation-2.83e+00

Pos logit contributions
str+0.030
 mel+0.027
 slice+0.026
 else+0.026
w+0.025
Neg logit contributions
lee-0.046
pered-0.045
arted-0.045
 shoulders-0.041
 heads-0.041
Token much
Token ID881
Feature activation-7.03e+00

Pos logit contributions
 slice+0.045
str+0.044
 else+0.044
 mel+0.043
 flavor+0.038
Neg logit contributions
arted-0.085
pered-0.084
lee-0.082
anks-0.075
aled-0.075
Token we
Token ID356
Feature activation+2.70e-01

Pos logit contributions
str+0.158
 slice+0.129
aster+0.126
arest+0.126
w+0.123
Neg logit contributions
 heads-0.148
 shoulders-0.148
arted-0.147
 spirits-0.143
 eyes-0.140
Token love
Token ID1842
Feature activation-1.64e+00

Pos logit contributions
arted+0.007
pered+0.007
lee+0.007
 heads+0.006
aled+0.006
Neg logit contributions
str-0.005
 slice-0.005
ww-0.005
 mel-0.005
 else-0.005
Token you
Token ID345
Feature activation-5.30e-01

Pos logit contributions
str+0.034
 slice+0.029
 else+0.028
arest+0.028
body+0.027
Neg logit contributions
lee-0.040
arted-0.039
pered-0.039
 heads-0.038
 shoulders-0.037
Token,
Token ID11
Feature activation-1.76e+00

Pos logit contributions
str+0.008
 else+0.008
 mel+0.008
 slice+0.008
aster+0.007
Neg logit contributions
pered-0.015
lee-0.015
arted-0.015
anks-0.014
 shoulders-0.014
Token Mom
Token ID11254
Feature activation-2.00e+00

Pos logit contributions
str+0.039
 else+0.037
 slice+0.036
 mel+0.036
w+0.035
Neg logit contributions
arted-0.042
pered-0.042
lee-0.040
 heads-0.037
 shoulders-0.037
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
 slice+0.055
 else+0.055
str+0.054
 mel+0.052
ww+0.049
Neg logit contributions
pered-0.064
arted-0.064
lee-0.063
anks-0.058
aled-0.055
Token She
Token ID1375
Feature activation+2.32e-01

Pos logit contributions
arted+0.026
pered+0.026
lee+0.024
anks+0.023
 heads+0.023
Neg logit contributions
 else-0.027
 mel-0.026
str-0.025
 slice-0.025
ww-0.023
Token told
Token ID1297
Feature activation-1.70e+00

Pos logit contributions
arted+0.007
pered+0.007
lee+0.006
aled+0.006
anks+0.006
Neg logit contributions
 else-0.004
str-0.004
 slice-0.004
 mel-0.004
ww-0.004
Token Emma
Token ID18966
Feature activation-7.15e-01

Pos logit contributions
 else+0.030
 mel+0.028
 slice+0.028
str+0.027
 jam+0.026
Neg logit contributions
pered-0.050
arted-0.049
lee-0.046
anks-0.045
aled-0.044
Token how
Token ID703
Feature activation-3.53e+00

Pos logit contributions
 else+0.010
 slice+0.010
str+0.009
 mel+0.009
 flavor+0.008
Neg logit contributions
arted-0.023
pered-0.023
lee-0.022
anks-0.021
aled-0.021
Token much
Token ID881
Feature activation-7.00e+00

Pos logit contributions
 slice+0.063
 mel+0.059
str+0.059
ww+0.054
 else+0.054
Neg logit contributions
lee-0.098
arted-0.097
pered-0.096
anks-0.084
aled-0.083
Token it
Token ID340
Feature activation-7.51e-01

Pos logit contributions
str+0.149
 slice+0.129
arest+0.127
w+0.123
aster+0.118
Neg logit contributions
arted-0.167
lee-0.163
 shoulders-0.157
pered-0.154
 heads-0.151
Token meant
Token ID4001
Feature activation-1.47e+00

Pos logit contributions
 slice+0.013
 mel+0.012
str+0.012
 else+0.012
aster+0.012
Neg logit contributions
arted-0.022
pered-0.021
lee-0.020
 heads-0.019
aled-0.019
Token to
Token ID284
Feature activation+6.39e-01

Pos logit contributions
 else+0.026
str+0.026
 slice+0.026
 mel+0.025
ww+0.023
Neg logit contributions
pered-0.041
arted-0.041
lee-0.039
anks-0.037
aled-0.036
Token her
Token ID607
Feature activation+7.66e-01

Pos logit contributions
pered+0.015
arted+0.015
lee+0.014
 heads+0.014
anks+0.013
Neg logit contributions
str-0.014
 else-0.014
 slice-0.013
 mel-0.013
ww-0.013
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
pered+0.018
lee+0.017
arted+0.017
anks+0.017
 heads+0.016
Neg logit contributions
str-0.017
ww-0.016
 else-0.016
w-0.016
 slice-0.016
Token Lucy
Token ID22162
Feature activation+9.18e-01

Pos logit contributions
arted+0.017
pered+0.017
lee+0.016
anks+0.015
aled+0.015
Neg logit contributions
str-0.013
 else-0.012
 slice-0.012
 mel-0.012
ww-0.011
Token presents
Token ID10969
Feature activation-1.50e+00

Pos logit contributions
pered+0.025
arted+0.024
lee+0.023
anks+0.022
sey+0.022
Neg logit contributions
str-0.017
 else-0.016
 mel-0.016
 slice-0.016
ww-0.015
Token to
Token ID284
Feature activation+4.57e-02

Pos logit contributions
str+0.036
ww+0.033
body+0.032
 else+0.032
 mel+0.032
Neg logit contributions
pered-0.034
arted-0.033
lee-0.032
anks-0.031
 shoulders-0.029
Token show
Token ID905
Feature activation+1.80e-02

Pos logit contributions
pered+0.001
arted+0.001
lee+0.001
anks+0.001
aled+0.001
Neg logit contributions
str-0.001
 slice-0.001
 mel-0.001
 else-0.001
ww-0.001
Token how
Token ID703
Feature activation-3.41e+00

Pos logit contributions
arted+0.000
pered+0.000
lee+0.000
anks+0.000
aled+0.000
Neg logit contributions
 else-0.000
 slice-0.000
str-0.000
 mel-0.000
 flavor-0.000
Token much
Token ID881
Feature activation-7.00e+00

Pos logit contributions
 else+0.054
 mel+0.049
str+0.049
 slice+0.048
 delicious+0.044
Neg logit contributions
arted-0.108
pered-0.108
lee-0.101
aled-0.098
anks-0.098
Token he
Token ID339
Feature activation-3.58e-01

Pos logit contributions
str+0.141
 slice+0.126
 mel+0.118
w+0.116
arest+0.115
Neg logit contributions
arted-0.182
pered-0.174
lee-0.173
 shoulders-0.164
anks-0.163
Token loved
Token ID6151
Feature activation-6.86e-01

Pos logit contributions
str+0.007
 slice+0.007
 mel+0.006
 else+0.006
ww+0.006
Neg logit contributions
arted-0.010
pered-0.009
lee-0.009
aled-0.009
 heads-0.009
Token her
Token ID607
Feature activation+2.04e-01

Pos logit contributions
 else+0.013
 slice+0.012
 mel+0.012
str+0.012
 jam+0.011
Neg logit contributions
arted-0.019
pered-0.019
lee-0.018
anks-0.017
aled-0.017
Token.
Token ID13
Feature activation+2.75e+00

Pos logit contributions
pered+0.005
arted+0.005
lee+0.005
anks+0.005
aled+0.004
Neg logit contributions
str-0.004
 else-0.004
 slice-0.004
 mel-0.004
ww-0.004
Token Lucy
Token ID22162
Feature activation+7.48e-01

Pos logit contributions
arted+0.066
pered+0.066
lee+0.062
anks+0.059
 heads+0.057
Neg logit contributions
 else-0.063
 slice-0.060
str-0.058
 mel-0.058
 delicious-0.055
Token about
Token ID546
Feature activation-2.81e+00

Pos logit contributions
 else+0.012
str+0.012
 slice+0.012
 mel+0.012
ww+0.011
Neg logit contributions
arted-0.023
pered-0.023
lee-0.022
anks-0.021
aled-0.020
Token her
Token ID607
Feature activation+5.79e-01

Pos logit contributions
 else+0.050
str+0.050
 slice+0.048
 mel+0.047
ww+0.044
Neg logit contributions
arted-0.079
pered-0.078
lee-0.076
anks-0.071
aled-0.069
Token day
Token ID1110
Feature activation-1.13e+00

Pos logit contributions
lee+0.013
pered+0.013
arted+0.013
anks+0.012
 heads+0.012
Neg logit contributions
str-0.013
 else-0.013
ww-0.012
 slice-0.012
w-0.012
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
str+0.023
 else+0.022
 slice+0.022
 mel+0.021
ww+0.020
Neg logit contributions
arted-0.029
pered-0.029
lee-0.029
anks-0.027
aled-0.026
Token how
Token ID703
Feature activation-3.16e+00

Pos logit contributions
str+0.022
 else+0.021
 slice+0.021
 mel+0.020
ww+0.019
Neg logit contributions
arted-0.030
pered-0.029
lee-0.029
anks-0.026
aled-0.026
Token much
Token ID881
Feature activation-6.98e+00

Pos logit contributions
 else+0.044
 slice+0.041
str+0.041
 mel+0.040
 flavor+0.034
Neg logit contributions
arted-0.105
pered-0.105
lee-0.100
anks-0.095
aled-0.095
Token she
Token ID673
Feature activation-2.61e-01

Pos logit contributions
str+0.144
 slice+0.128
arest+0.124
w+0.118
 mel+0.115
Neg logit contributions
arted-0.174
lee-0.169
pered-0.161
anks-0.161
 shoulders-0.157
Token loved
Token ID6151
Feature activation-9.02e-01

Pos logit contributions
str+0.005
 slice+0.005
 mel+0.005
ww+0.004
 else+0.004
Neg logit contributions
lee-0.007
arted-0.007
pered-0.007
aled-0.006
anks-0.006
Token her
Token ID607
Feature activation+1.70e-01

Pos logit contributions
 else+0.016
 mel+0.015
 slice+0.015
 jam+0.014
str+0.014
Neg logit contributions
arted-0.027
pered-0.027
lee-0.024
aled-0.024
anks-0.024
Token.
Token ID13
Feature activation+2.25e+00

Pos logit contributions
pered+0.004
arted+0.004
lee+0.004
anks+0.004
aled+0.004
Neg logit contributions
 else-0.004
str-0.003
 slice-0.003
 mel-0.003
ww-0.003
Token When
Token ID1649
Feature activation-4.87e+00

Pos logit contributions
pered+0.054
arted+0.050
 breaths+0.047
 heads+0.044
aled+0.044
Neg logit contributions
 mel-0.052
 slice-0.051
 else-0.051
 that-0.049
str-0.049
Token and
Token ID290
Feature activation-6.24e-01

Pos logit contributions
arted+0.025
pered+0.025
lee+0.024
anks+0.022
aled+0.021
Neg logit contributions
 else-0.018
str-0.018
 slice-0.018
 mel-0.018
ww-0.016
Token said
Token ID531
Feature activation-2.28e-01

Pos logit contributions
str+0.012
ww+0.012
body+0.011
arest+0.011
 slice+0.011
Neg logit contributions
lee-0.017
arted-0.017
pered-0.016
 heads-0.015
 shoulders-0.015
Token,
Token ID11
Feature activation-1.30e+00

Pos logit contributions
 else+0.004
 slice+0.003
str+0.003
 mel+0.003
 flavor+0.003
Neg logit contributions
arted-0.007
pered-0.007
lee-0.007
anks-0.006
aled-0.006
Token "
Token ID366
Feature activation-7.17e-01

Pos logit contributions
 else+0.027
 mel+0.026
 slice+0.026
str+0.025
 that+0.023
Neg logit contributions
arted-0.034
pered-0.033
anks-0.031
aled-0.031
lee-0.031
TokenWhich
Token ID13828
Feature activation-4.51e+00

Pos logit contributions
 else+0.013
str+0.013
 slice+0.012
 mel+0.012
 delicious+0.011
Neg logit contributions
arted-0.020
pered-0.020
lee-0.019
aled-0.018
anks-0.018
Token one
Token ID530
Feature activation-6.98e+00

Pos logit contributions
str+0.103
 slice+0.097
aster+0.093
 else+0.091
arest+0.090
Neg logit contributions
pered-0.090
lee-0.089
arted-0.089
 shoulders-0.089
stick-0.088
Token should
Token ID815
Feature activation-2.99e-01

Pos logit contributions
aster+0.136
 slice+0.135
str+0.133
 loaf+0.129
arest+0.126
Neg logit contributions
arted-0.140
 heads-0.139
lee-0.136
anks-0.133
 spirits-0.131
Token I
Token ID314
Feature activation-1.95e-01

Pos logit contributions
str+0.005
 else+0.005
 slice+0.005
 mel+0.005
ww+0.004
Neg logit contributions
pered-0.008
arted-0.008
lee-0.008
anks-0.008
 heads-0.007
Token pick
Token ID2298
Feature activation+9.16e-02

Pos logit contributions
str+0.004
ww+0.004
 slice+0.004
 mel+0.004
 jam+0.004
Neg logit contributions
lee-0.004
pered-0.004
anks-0.004
arted-0.004
 heads-0.004
Token?"
Token ID1701
Feature activation+1.09e+00

Pos logit contributions
arted+0.002
pered+0.002
lee+0.002
anks+0.002
aled+0.002
Neg logit contributions
str-0.002
 slice-0.002
 else-0.002
 mel-0.001
aster-0.001
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
pered+0.035
arted+0.035
lee+0.032
 heads+0.032
aled+0.032
Neg logit contributions
 else-0.016
 mel-0.015
 slice-0.015
str-0.014
aster-0.013
Token his
Token ID465
Feature activation+5.42e-01

Pos logit contributions
 else+0.041
str+0.041
 slice+0.040
 mel+0.039
ww+0.036
Neg logit contributions
arted-0.051
pered-0.050
lee-0.048
anks-0.045
aled-0.044
Token friends
Token ID2460
Feature activation-1.96e+00

Pos logit contributions
arted+0.014
pered+0.014
aled+0.013
lee+0.012
anks+0.012
Neg logit contributions
 else-0.013
 mel-0.011
 slice-0.011
 delicious-0.011
str-0.011
Token back
Token ID736
Feature activation+7.41e-01

Pos logit contributions
str+0.039
 slice+0.039
 else+0.038
 mel+0.038
ww+0.035
Neg logit contributions
arted-0.050
pered-0.050
lee-0.048
anks-0.044
aled-0.044
Token home
Token ID1363
Feature activation-9.60e-01

Pos logit contributions
arted+0.020
pered+0.020
lee+0.019
anks+0.018
aled+0.018
Neg logit contributions
 else-0.015
 slice-0.014
str-0.014
 mel-0.014
 flavor-0.012
Token how
Token ID703
Feature activation-3.23e+00

Pos logit contributions
 else+0.018
 slice+0.017
 mel+0.016
str+0.016
 flavor+0.015
Neg logit contributions
arted-0.028
pered-0.027
lee-0.026
aled-0.025
anks-0.024
Token much
Token ID881
Feature activation-6.95e+00

Pos logit contributions
 else+0.050
str+0.044
 mel+0.042
 slice+0.041
 delicious+0.039
Neg logit contributions
pered-0.107
arted-0.106
lee-0.099
anks-0.098
aled-0.097
Token fun
Token ID1257
Feature activation-3.66e+00

Pos logit contributions
str+0.135
 slice+0.126
w+0.118
arest+0.115
 mel+0.111
Neg logit contributions
arted-0.183
lee-0.177
pered-0.173
anks-0.162
aled-0.160
Token he
Token ID339
Feature activation-4.52e-01

Pos logit contributions
str+0.058
 slice+0.056
 else+0.053
 mel+0.051
w+0.048
Neg logit contributions
arted-0.111
pered-0.108
lee-0.108
anks-0.100
aled-0.098
Token had
Token ID550
Feature activation+5.08e-01

Pos logit contributions
 slice+0.008
 mel+0.008
str+0.008
aster+0.007
 jam+0.007
Neg logit contributions
arted-0.013
pered-0.012
lee-0.012
aled-0.011
sey-0.011
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
arted+0.012
pered+0.012
lee+0.012
anks+0.011
aled+0.011
Neg logit contributions
str-0.011
 slice-0.011
 mel-0.010
ww-0.010
 else-0.010
Token<|endoftext|>
Token ID50256
Feature activation-1.69e+00

Pos logit contributions
arted+0.068
pered+0.068
lee+0.065
anks+0.061
aled+0.061
Neg logit contributions
 else-0.031
str-0.029
 slice-0.029
 mel-0.028
 flavor-0.024
Token wanted
Token ID2227
Feature activation-8.03e-01

Pos logit contributions
 else+0.004
 mel+0.004
str+0.004
 slice+0.004
 delicious+0.003
Neg logit contributions
arted-0.008
pered-0.008
lee-0.007
anks-0.007
aled-0.007
Token to
Token ID284
Feature activation+9.12e-01

Pos logit contributions
str+0.016
 else+0.016
 slice+0.015
 mel+0.015
ww+0.014
Neg logit contributions
arted-0.021
pered-0.021
lee-0.020
anks-0.018
aled-0.018
Token show
Token ID905
Feature activation+6.84e-01

Pos logit contributions
pered+0.023
arted+0.023
lee+0.023
anks+0.021
aled+0.021
Neg logit contributions
 else-0.018
str-0.018
 slice-0.017
 mel-0.017
w-0.016
Token everyone
Token ID2506
Feature activation-3.60e+00

Pos logit contributions
pered+0.017
arted+0.017
lee+0.016
anks+0.015
aled+0.015
Neg logit contributions
 else-0.015
 slice-0.014
 mel-0.014
str-0.013
 jam-0.013
Token how
Token ID703
Feature activation-3.12e+00

Pos logit contributions
str+0.079
 slice+0.077
 mel+0.074
w+0.071
ww+0.070
Neg logit contributions
arted-0.086
lee-0.084
pered-0.083
anks-0.073
sey-0.073
Token much
Token ID881
Feature activation-6.95e+00

Pos logit contributions
 else+0.047
str+0.040
 mel+0.040
 slice+0.039
 delicious+0.036
Neg logit contributions
pered-0.105
arted-0.104
anks-0.096
lee-0.096
aled-0.095
Token she
Token ID673
Feature activation-3.92e-01

Pos logit contributions
str+0.142
 slice+0.134
 mel+0.124
aster+0.120
w+0.119
Neg logit contributions
arted-0.174
lee-0.166
pered-0.162
 shoulders-0.159
 heads-0.158
Token loved
Token ID6151
Feature activation-3.42e-01

Pos logit contributions
 else+0.006
 slice+0.006
str+0.006
 mel+0.006
ww+0.005
Neg logit contributions
arted-0.012
pered-0.012
lee-0.011
aled-0.011
anks-0.011
Token them
Token ID606
Feature activation-1.37e+00

Pos logit contributions
 else+0.007
 mel+0.006
 slice+0.006
str+0.006
 jam+0.005
Neg logit contributions
arted-0.010
pered-0.010
lee-0.009
aled-0.009
anks-0.009
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
 else+0.027
 slice+0.026
str+0.026
 mel+0.026
 flavor+0.023
Neg logit contributions
arted-0.037
pered-0.036
lee-0.035
anks-0.033
aled-0.032
Token
Token ID198
Feature activation-3.53e+00

Pos logit contributions
pered+0.036
arted+0.036
lee+0.034
anks+0.033
aled+0.032
Neg logit contributions
 else-0.023
 slice-0.021
str-0.021
 mel-0.021
 flavor-0.019
Token John
Token ID1757
Feature activation+1.01e+00

Pos logit contributions
arted+0.015
pered+0.015
lee+0.014
anks+0.013
aled+0.013
Neg logit contributions
 else-0.014
 slice-0.014
str-0.014
 mel-0.013
ww-0.013
Token smiled
Token ID13541
Feature activation+3.55e+00

Pos logit contributions
pered+0.030
arted+0.030
lee+0.028
anks+0.027
aled+0.027
Neg logit contributions
 else-0.017
 slice-0.016
str-0.016
 mel-0.016
 flavor-0.014
Token,
Token ID11
Feature activation-2.55e-01

Pos logit contributions
arted+0.120
pered+0.116
aled+0.115
anks+0.113
lee+0.112
Neg logit contributions
 else-0.044
 slice-0.042
 mel-0.040
 that-0.039
 delicious-0.039
Token knowing
Token ID6970
Feature activation-3.64e+00

Pos logit contributions
 else+0.005
 slice+0.005
 mel+0.005
str+0.005
 flavor+0.004
Neg logit contributions
arted-0.007
pered-0.007
lee-0.006
anks-0.006
aled-0.006
Token how
Token ID703
Feature activation-2.79e+00

Pos logit contributions
w+0.066
str+0.063
ww+0.061
arest+0.060
uch+0.059
Neg logit contributions
lee-0.103
arted-0.097
pered-0.097
 shoulders-0.088
 eyes-0.088
Token much
Token ID881
Feature activation-6.93e+00

Pos logit contributions
 slice+0.046
str+0.046
 mel+0.045
 else+0.044
w+0.041
Neg logit contributions
pered-0.080
arted-0.080
lee-0.079
anks-0.070
aled-0.069
Token his
Token ID465
Feature activation+1.08e+00

Pos logit contributions
str+0.149
arest+0.135
 slice+0.134
w+0.130
prise+0.124
Neg logit contributions
arted-0.155
 shoulders-0.148
lee-0.144
 breaths-0.143
 spirits-0.141
Token sister
Token ID6621
Feature activation-1.09e+00

Pos logit contributions
pered+0.024
arted+0.024
lee+0.023
anks+0.021
aled+0.021
Neg logit contributions
 else-0.026
str-0.025
 slice-0.025
 mel-0.025
ww-0.023
Token wanted
Token ID2227
Feature activation-8.18e-01

Pos logit contributions
 slice+0.018
str+0.018
 mel+0.018
 else+0.018
ww+0.016
Neg logit contributions
arted-0.031
pered-0.031
lee-0.030
anks-0.028
 heads-0.028
Token one
Token ID530
Feature activation-3.94e+00

Pos logit contributions
str+0.017
 else+0.017
 slice+0.016
arest+0.016
ww+0.016
Neg logit contributions
arted-0.020
pered-0.020
lee-0.019
anks-0.017
 commands-0.017
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
str+0.098
w+0.093
prise+0.089
lly+0.089
ww+0.088
Neg logit contributions
pered-0.067
arted-0.065
 shoulders-0.063
 commands-0.062
anks-0.059
Token were
Token ID547
Feature activation+2.92e+00

Pos logit contributions
str+0.009
 else+0.009
 mel+0.009
 slice+0.009
ww+0.008
Neg logit contributions
arted-0.014
pered-0.014
lee-0.014
aled-0.013
anks-0.013
Token just
Token ID655
Feature activation+1.39e+00

Pos logit contributions
arted+0.071
pered+0.071
lee+0.067
anks+0.062
aled+0.062
Neg logit contributions
 else-0.066
 slice-0.063
str-0.063
 mel-0.062
 flavor-0.057
Token talking
Token ID3375
Feature activation+6.53e-01

Pos logit contributions
arted+0.032
pered+0.031
lee+0.030
anks+0.027
aled+0.027
Neg logit contributions
 else-0.033
str-0.032
 slice-0.032
 mel-0.032
 flavor-0.029
Token about
Token ID546
Feature activation-2.70e+00

Pos logit contributions
arted+0.017
pered+0.017
lee+0.016
anks+0.015
aled+0.015
Neg logit contributions
str-0.013
 else-0.013
 slice-0.013
 mel-0.013
aster-0.012
Token how
Token ID703
Feature activation-3.33e+00

Pos logit contributions
str+0.050
 slice+0.046
arest+0.045
 else+0.045
w+0.044
Neg logit contributions
arted-0.071
lee-0.071
pered-0.070
 heads-0.064
 shoulders-0.064
Token much
Token ID881
Feature activation-6.92e+00

Pos logit contributions
 slice+0.049
str+0.047
 mel+0.047
 else+0.046
aster+0.041
Neg logit contributions
arted-0.103
pered-0.102
lee-0.100
anks-0.092
aled-0.091
Token fun
Token ID1257
Feature activation-3.28e+00

Pos logit contributions
str+0.141
 slice+0.122
arest+0.118
aster+0.117
w+0.116
Neg logit contributions
arted-0.160
lee-0.157
 heads-0.156
 shoulders-0.155
 spirits-0.149
Token it
Token ID340
Feature activation-2.09e+00

Pos logit contributions
str+0.058
 slice+0.052
 mel+0.049
 else+0.048
aster+0.046
Neg logit contributions
arted-0.092
lee-0.091
pered-0.091
 heads-0.085
 shoulders-0.084
Token is
Token ID318
Feature activation+1.02e+00

Pos logit contributions
 slice+0.039
 mel+0.038
str+0.038
aster+0.036
 else+0.035
Neg logit contributions
arted-0.056
pered-0.055
lee-0.053
 heads-0.051
anks-0.050
Token to
Token ID284
Feature activation+7.08e-01

Pos logit contributions
pered+0.026
arted+0.025
lee+0.024
anks+0.023
 heads+0.022
Neg logit contributions
str-0.020
 else-0.020
 slice-0.020
 mel-0.020
ww-0.018
Token play
Token ID711
Feature activation+2.41e-01

Pos logit contributions
pered+0.017
arted+0.017
lee+0.016
anks+0.015
 heads+0.015
Neg logit contributions
str-0.015
 else-0.015
 slice-0.014
 mel-0.014
ww-0.013
Token it
Token ID340
Feature activation-5.62e-01

Pos logit contributions
 else+0.030
str+0.027
 mel+0.026
 slice+0.025
ww+0.022
Neg logit contributions
pered-0.078
arted-0.078
lee-0.074
anks-0.072
aled-0.072
Token looks
Token ID3073
Feature activation+2.44e+00

Pos logit contributions
 else+0.011
 slice+0.011
str+0.011
 mel+0.011
 flavor+0.010
Neg logit contributions
arted-0.015
pered-0.015
lee-0.014
anks-0.013
aled-0.013
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
pered+0.070
arted+0.070
lee+0.067
anks+0.063
aled+0.061
Neg logit contributions
str-0.042
 slice-0.042
 else-0.041
 mel-0.040
aster-0.037
Token but
Token ID475
Feature activation-3.32e+00

Pos logit contributions
 else+0.022
 slice+0.020
str+0.020
 mel+0.020
 flavor+0.018
Neg logit contributions
arted-0.040
pered-0.040
lee-0.038
anks-0.036
aled-0.036
Token how
Token ID703
Feature activation-3.11e+00

Pos logit contributions
str+0.056
 slice+0.056
 else+0.052
aster+0.050
 mel+0.050
Neg logit contributions
arted-0.093
pered-0.091
lee-0.091
 heads-0.086
 shoulders-0.085
Token much
Token ID881
Feature activation-6.91e+00

Pos logit contributions
 slice+0.047
 else+0.044
str+0.044
 mel+0.044
aster+0.038
Neg logit contributions
arted-0.096
pered-0.095
lee-0.094
anks-0.087
aled-0.085
Token you
Token ID345
Feature activation+2.38e+00

Pos logit contributions
str+0.136
 slice+0.126
aster+0.113
arest+0.113
 mel+0.110
Neg logit contributions
arted-0.176
lee-0.169
pered-0.166
 heads-0.165
 shoulders-0.164
Token love
Token ID1842
Feature activation-7.83e-01

Pos logit contributions
arted+0.063
pered+0.062
lee+0.060
aled+0.057
anks+0.057
Neg logit contributions
 else-0.045
 slice-0.045
str-0.044
 mel-0.044
ww-0.042
Token it
Token ID340
Feature activation-1.63e+00

Pos logit contributions
 else+0.012
str+0.012
 slice+0.012
 mel+0.011
ww+0.010
Neg logit contributions
arted-0.024
pered-0.024
lee-0.023
anks-0.022
aled-0.022
Token."
Token ID526
Feature activation+9.76e-01

Pos logit contributions
str+0.035
 slice+0.033
 else+0.032
 mel+0.032
aster+0.030
Neg logit contributions
pered-0.040
arted-0.039
lee-0.038
anks-0.037
 shoulders-0.036
Token And
Token ID843
Feature activation-6.38e-01

Pos logit contributions
pered+0.025
arted+0.025
lee+0.023
anks+0.022
aled+0.022
Neg logit contributions
 else-0.021
str-0.020
 mel-0.020
 slice-0.020
 flavor-0.018
Token said
Token ID531
Feature activation-3.01e-01

Pos logit contributions
 slice+0.005
 else+0.005
str+0.005
 mel+0.004
aster+0.004
Neg logit contributions
arted-0.011
pered-0.011
lee-0.011
aled-0.010
 heads-0.010
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
 else+0.005
 slice+0.005
str+0.004
 mel+0.004
 jam+0.004
Neg logit contributions
pered-0.009
arted-0.009
lee-0.009
anks-0.008
aled-0.008
Token �
Token ID6184
Feature activation+4.31e+00

Pos logit contributions
 else+0.038
str+0.037
 mel+0.037
 slice+0.036
 that+0.033
Neg logit contributions
arted-0.046
pered-0.045
lee-0.042
anks-0.042
aled-0.041
Token�
Token ID95
Feature activation+5.11e+00

Pos logit contributions
arted+0.117
pered+0.112
lee+0.108
aled+0.101
 heads+0.099
Neg logit contributions
str-0.086
 else-0.086
 mel-0.083
 slice-0.082
ww-0.076
Token€
Token ID26391
Feature activation+3.45e+00

Pos logit contributions
arted+0.121
pered+0.115
lee+0.112
 heads+0.100
aled+0.099
Neg logit contributions
str-0.118
 slice-0.115
 mel-0.114
 else-0.114
aster-0.108
Token�
Token ID129
Feature activation+6.13e+00

Pos logit contributions
arted+0.092
pered+0.091
lee+0.087
anks+0.082
aled+0.080
Neg logit contributions
 else-0.069
 slice-0.066
 mel-0.065
str-0.065
 flavor-0.059
Token�
Token ID241
Feature activation-1.74e+00

Pos logit contributions
arted+0.137
pered+0.128
lee+0.123
angers+0.123
anks+0.121
Neg logit contributions
str-0.141
 else-0.128
 that-0.124
aster-0.124
 mel-0.124
TokenIt
Token ID1026
Feature activation-1.06e+00

Pos logit contributions
 else+0.031
 slice+0.031
str+0.030
 mel+0.030
 jam+0.027
Neg logit contributions
arted-0.048
pered-0.048
lee-0.046
anks-0.042
aled-0.042
Tokenâ
Token ID22940
Feature activation+3.20e+00

Pos logit contributions
 slice+0.019
 mel+0.019
str+0.019
 else+0.018
aster+0.018
Neg logit contributions
arted-0.029
pered-0.029
lee-0.028
sey-0.026
anks-0.026
Token€
Token ID26391
Feature activation+2.67e+00

Pos logit contributions
arted+0.074
pered+0.074
lee+0.069
aled+0.066
anks+0.065
Neg logit contributions
 else-0.076
 slice-0.072
 mel-0.071
str-0.070
 flavor-0.065
Tokenâ„¢
Token ID8151
Feature activation+2.05e+00

Pos logit contributions
pered+0.060
arted+0.059
lee+0.057
anks+0.053
aled+0.051
Neg logit contributions
 else-0.063
 slice-0.062
 mel-0.061
str-0.061
 flavor-0.056
Token friend
Token ID1545
Feature activation-2.56e+00

Pos logit contributions
arted+0.036
pered+0.036
lee+0.034
aled+0.032
anks+0.032
Neg logit contributions
 else-0.025
 mel-0.025
 slice-0.025
str-0.024
 flavor-0.022
Token!"
Token ID2474
Feature activation+1.08e+00

Pos logit contributions
str+0.053
 mel+0.047
 slice+0.047
 else+0.045
arest+0.045
Neg logit contributions
pered-0.068
arted-0.066
lee-0.061
anks-0.058
 shoulders-0.057
Token
Token ID198
Feature activation-2.36e+00

Pos logit contributions
arted+0.034
pered+0.034
lee+0.032
anks+0.031
aled+0.030
Neg logit contributions
 else-0.017
 slice-0.016
str-0.016
 mel-0.016
 jam-0.014
Token
Token ID198
Feature activation-2.01e+00

Pos logit contributions
 slice+0.042
 else+0.042
str+0.041
 mel+0.041
 flavor+0.037
Neg logit contributions
pered-0.067
arted-0.066
lee-0.064
anks-0.059
aled-0.058
TokenShe
Token ID3347
Feature activation+9.70e-01

Pos logit contributions
 slice+0.038
str+0.037
 else+0.037
 mel+0.036
 flavor+0.035
Neg logit contributions
pered-0.055
arted-0.054
lee-0.051
 heads-0.047
aled-0.047
Token ran
Token ID4966
Feature activation+6.54e+00

Pos logit contributions
arted+0.027
pered+0.027
lee+0.026
aled+0.024
anks+0.024
Neg logit contributions
 else-0.018
str-0.018
 slice-0.017
 mel-0.017
 flavor-0.016
Token away
Token ID1497
Feature activation+1.05e+00

Pos logit contributions
lee+0.213
arted+0.198
pered+0.197
anks+0.194
sey+0.190
Neg logit contributions
 else-0.103
 that-0.087
 slice-0.086
 mel-0.083
 bite-0.078
Token from
Token ID422
Feature activation-4.62e-01

Pos logit contributions
arted+0.033
pered+0.032
lee+0.031
anks+0.030
aled+0.029
Neg logit contributions
 else-0.017
 slice-0.015
 mel-0.015
str-0.015
 jam-0.013
Token Tim
Token ID5045
Feature activation+3.06e+00

Pos logit contributions
 else+0.008
 slice+0.007
str+0.007
 mel+0.007
 flavor+0.006
Neg logit contributions
arted-0.014
pered-0.014
lee-0.013
anks-0.012
aled-0.012
Token and
Token ID290
Feature activation+5.21e-01

Pos logit contributions
arted+0.086
pered+0.085
lee+0.082
anks+0.076
aled+0.076
Neg logit contributions
 else-0.058
 slice-0.055
 mel-0.054
str-0.054
 jam-0.049
Token the
Token ID262
Feature activation+4.43e-01

Pos logit contributions
arted+0.014
pered+0.013
lee+0.013
aled+0.012
 heads+0.012
Neg logit contributions
str-0.010
 else-0.010
 slice-0.010
 mel-0.010
arest-0.010
Token they
Token ID484
Feature activation+1.03e+00

Pos logit contributions
 else+0.066
 slice+0.065
str+0.064
 mel+0.062
 flavor+0.055
Neg logit contributions
pered-0.143
arted-0.143
lee-0.136
aled-0.126
anks-0.125
Token fought
Token ID8350
Feature activation+2.33e+00

Pos logit contributions
arted+0.028
pered+0.027
lee+0.025
 heads+0.025
aled+0.025
Neg logit contributions
str-0.020
 mel-0.018
 slice-0.018
aster-0.018
 else-0.018
Token,
Token ID11
Feature activation-3.79e+00

Pos logit contributions
arted+0.072
pered+0.072
lee+0.068
anks+0.065
aled+0.065
Neg logit contributions
 else-0.036
str-0.035
 slice-0.035
 mel-0.034
ww-0.030
Token the
Token ID262
Feature activation-7.37e-01

Pos logit contributions
 else+0.057
str+0.057
 slice+0.055
 mel+0.053
ww+0.051
Neg logit contributions
arted-0.116
pered-0.116
lee-0.110
 heads-0.103
aled-0.103
Token fish
Token ID5916
Feature activation-8.71e-01

Pos logit contributions
 else+0.016
str+0.015
 slice+0.014
 mel+0.014
arest+0.013
Neg logit contributions
pered-0.019
arted-0.018
lee-0.017
 heads-0.017
anks-0.016
Token sw
Token ID1509
Feature activation+8.33e+00

Pos logit contributions
 slice+0.016
 else+0.015
str+0.015
 mel+0.015
ww+0.015
Neg logit contributions
arted-0.024
pered-0.024
lee-0.023
aled-0.021
 heads-0.021
Tokenam
Token ID321
Feature activation+4.04e+00

Pos logit contributions
arted+0.111
pered+0.106
idget+0.106
lee+0.102
 Ghosts+0.097
Neg logit contributions
w-0.241
 mel-0.233
 slice-0.233
str-0.230
lly-0.227
Token away
Token ID1497
Feature activation-1.27e-01

Pos logit contributions
lee+0.108
arted+0.107
pered+0.106
anks+0.100
aled+0.096
Neg logit contributions
 else-0.081
 slice-0.073
str-0.073
 mel-0.072
 delicious-0.067
Token.
Token ID13
Feature activation+3.12e+00

Pos logit contributions
 else+0.003
str+0.002
 slice+0.002
 mel+0.002
ww+0.002
Neg logit contributions
arted-0.003
pered-0.003
lee-0.003
anks-0.003
aled-0.003
Token The
Token ID383
Feature activation-8.40e-01

Pos logit contributions
arted+0.084
lee+0.084
pered+0.083
idget+0.078
sey+0.076
Neg logit contributions
 that-0.052
 mel-0.051
 else-0.051
 slice-0.050
�-0.048
Token dog
Token ID3290
Feature activation+1.53e+00

Pos logit contributions
 else+0.018
str+0.017
 slice+0.016
 mel+0.016
ww+0.015
Neg logit contributions
pered-0.022
arted-0.021
lee-0.020
 heads-0.019
anks-0.019
Token.
Token ID13
Feature activation+2.86e+00

Pos logit contributions
pered+0.083
aled+0.083
anks+0.082
arted+0.082
lee+0.079
Neg logit contributions
 else-0.046
 mel-0.044
 jam-0.043
 slice-0.041
 that-0.040
Token
Token ID198
Feature activation-2.46e+00

Pos logit contributions
arted+0.073
pered+0.073
anks+0.066
lee+0.066
 heads+0.065
Neg logit contributions
 else-0.061
str-0.056
 mel-0.055
 slice-0.054
 that-0.052
Token
Token ID198
Feature activation-2.02e+00

Pos logit contributions
 slice+0.047
 mel+0.046
str+0.045
 else+0.044
 flavor+0.042
Neg logit contributions
pered-0.067
arted-0.066
lee-0.064
aled-0.059
anks-0.058
TokenL
Token ID43
Feature activation+1.03e+00

Pos logit contributions
 mel+0.033
 slice+0.033
str+0.032
 else+0.031
 flavor+0.031
Neg logit contributions
pered-0.060
arted-0.059
lee-0.057
aled-0.053
 heads-0.052
Tokenily
Token ID813
Feature activation+1.89e+00

Pos logit contributions
arted+0.026
pered+0.026
lee+0.024
anks+0.023
aled+0.023
Neg logit contributions
 else-0.022
str-0.022
 slice-0.022
 mel-0.022
 flavor-0.020
Token ran
Token ID4966
Feature activation+5.91e+00

Pos logit contributions
arted+0.057
pered+0.057
lee+0.054
anks+0.052
aled+0.051
Neg logit contributions
 else-0.032
 slice-0.030
 mel-0.029
str-0.029
 jam-0.026
Token back
Token ID736
Feature activation+2.63e+00

Pos logit contributions
lee+0.188
arted+0.176
pered+0.174
anks+0.171
sey+0.167
Neg logit contributions
 else-0.096
 that-0.086
 slice-0.077
 mel-0.077
 bite-0.074
Token to
Token ID284
Feature activation+3.34e-01

Pos logit contributions
arted+0.076
pered+0.076
lee+0.073
anks+0.069
aled+0.068
Neg logit contributions
 else-0.047
 mel-0.043
 slice-0.043
str-0.042
 jam-0.038
Token the
Token ID262
Feature activation+1.78e+00

Pos logit contributions
arted+0.011
pered+0.010
lee+0.010
anks+0.010
aled+0.010
Neg logit contributions
 else-0.005
 slice-0.005
 mel-0.005
str-0.004
 jam-0.004
Token house
Token ID2156
Feature activation+9.81e-01

Pos logit contributions
pered+0.041
arted+0.041
lee+0.038
 heads+0.037
anks+0.035
Neg logit contributions
 else-0.041
str-0.040
 slice-0.039
 mel-0.038
iest-0.037
Token.
Token ID13
Feature activation+4.20e+00

Pos logit contributions
arted+0.032
pered+0.031
lee+0.030
anks+0.029
aled+0.029
Neg logit contributions
 else-0.015
 slice-0.014
 that-0.013
 mel-0.013
str-0.011
Token quickly
Token ID2952
Feature activation+1.42e+00

Pos logit contributions
 else+0.002
 slice+0.002
str+0.002
 mel+0.002
 flavor+0.001
Neg logit contributions
arted-0.003
pered-0.003
lee-0.003
anks-0.003
aled-0.003
Token grabbed
Token ID13176
Feature activation+2.03e+00

Pos logit contributions
arted+0.036
pered+0.036
lee+0.034
anks+0.032
aled+0.031
Neg logit contributions
 else-0.030
 slice-0.029
str-0.029
 mel-0.029
 flavor-0.026
Token his
Token ID465
Feature activation+2.76e+00

Pos logit contributions
arted+0.055
pered+0.055
lee+0.051
aled+0.050
sey+0.048
Neg logit contributions
str-0.042
 else-0.040
 slice-0.038
 mel-0.037
ww-0.036
Token toy
Token ID13373
Feature activation+1.69e+00

Pos logit contributions
pered+0.062
arted+0.057
 heads+0.056
 shoulders+0.056
lee+0.056
Neg logit contributions
str-0.067
 else-0.065
ww-0.062
 slice-0.060
aster-0.060
Token and
Token ID290
Feature activation+8.17e-01

Pos logit contributions
arted+0.043
pered+0.042
lee+0.040
anks+0.038
aled+0.037
Neg logit contributions
 else-0.036
 slice-0.035
 mel-0.034
str-0.034
 flavor-0.031
Token ran
Token ID4966
Feature activation+6.16e+00

Pos logit contributions
arted+0.020
pered+0.020
lee+0.019
aled+0.018
 heads+0.018
Neg logit contributions
str-0.018
 else-0.017
 slice-0.017
 mel-0.017
ww-0.016
Token outside
Token ID2354
Feature activation+8.07e-01

Pos logit contributions
lee+0.177
anks+0.168
arted+0.168
pered+0.165
anced+0.159
Neg logit contributions
 else-0.111
 that-0.102
 slice-0.092
 mel-0.090
 bite-0.087
Token with
Token ID351
Feature activation-1.53e+00

Pos logit contributions
arted+0.026
pered+0.026
lee+0.025
aled+0.025
anks+0.024
Neg logit contributions
 else-0.012
 slice-0.011
 mel-0.010
 that-0.010
str-0.009
Token mom
Token ID1995
Feature activation-8.42e-03

Pos logit contributions
 else+0.031
str+0.030
 slice+0.030
 mel+0.030
 flavor+0.027
Neg logit contributions
arted-0.040
pered-0.040
lee-0.038
anks-0.036
aled-0.035
Tokenmy
Token ID1820
Feature activation+1.35e-01

Pos logit contributions
 else+0.000
 slice+0.000
str+0.000
 mel+0.000
 flavor+0.000
Neg logit contributions
arted-0.000
pered-0.000
lee-0.000
anks-0.000
aled-0.000
Token.
Token ID13
Feature activation+2.53e+00

Pos logit contributions
arted+0.004
pered+0.004
lee+0.004
anks+0.003
aled+0.003
Neg logit contributions
 else-0.003
 slice-0.002
str-0.002
 mel-0.002
 flavor-0.002
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
arted+0.033
pered+0.032
lee+0.031
aled+0.030
anks+0.029
Neg logit contributions
 else-0.016
 slice-0.016
 mel-0.015
str-0.015
 delicious-0.014
Token said
Token ID531
Feature activation-2.03e-01

Pos logit contributions
str+0.022
 slice+0.021
 else+0.020
 mel+0.020
w+0.020
Neg logit contributions
arted-0.035
lee-0.034
pered-0.034
aled-0.030
anks-0.030
Token that
Token ID326
Feature activation-4.14e+00

Pos logit contributions
 else+0.003
 slice+0.002
str+0.002
 mel+0.002
 flavor+0.002
Neg logit contributions
arted-0.007
pered-0.007
lee-0.007
anks-0.006
aled-0.006
Token he
Token ID339
Feature activation+2.47e-02

Pos logit contributions
str+0.066
 slice+0.065
 else+0.064
 mel+0.060
ww+0.058
Neg logit contributions
arted-0.123
pered-0.121
lee-0.119
anks-0.111
aled-0.108
Token had
Token ID550
Feature activation+4.69e-01

Pos logit contributions
arted+0.001
pered+0.001
lee+0.001
anks+0.001
aled+0.001
Neg logit contributions
 else-0.000
 slice-0.000
str-0.000
 mel-0.000
 jam-0.000
Token been
Token ID587
Feature activation+3.91e+00

Pos logit contributions
arted+0.013
pered+0.013
lee+0.013
aled+0.012
anks+0.012
Neg logit contributions
 else-0.008
str-0.008
 slice-0.008
 mel-0.008
ww-0.007
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
arted+0.106
pered+0.106
lee+0.101
anks+0.094
aled+0.093
Neg logit contributions
 else-0.076
 slice-0.074
str-0.073
 mel-0.073
 flavor-0.066
Token very
Token ID845
Feature activation+9.46e-01

Pos logit contributions
w+0.019
str+0.019
 else+0.018
 slice+0.018
 mel+0.018
Neg logit contributions
pered-0.027
arted-0.026
lee-0.026
 heads-0.024
 shoulders-0.024
Token good
Token ID922
Feature activation+2.24e+00

Pos logit contributions
pered+0.026
arted+0.026
lee+0.025
anks+0.023
aled+0.023
Neg logit contributions
 slice-0.017
 else-0.017
str-0.017
 mel-0.017
 flavor-0.015
Token boy
Token ID2933
Feature activation-2.05e-01

Pos logit contributions
arted+0.061
pered+0.060
lee+0.058
aled+0.054
anks+0.054
Neg logit contributions
 else-0.044
 slice-0.043
 mel-0.042
str-0.042
 flavor-0.038
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
str+0.004
 slice+0.004
 mel+0.004
 else+0.004
ww+0.003
Neg logit contributions
pered-0.006
arted-0.005
lee-0.005
anks-0.005
aled-0.005
Token wanted
Token ID2227
Feature activation-3.76e-01

Pos logit contributions
str+0.010
 slice+0.010
 mel+0.010
ww+0.009
 else+0.009
Neg logit contributions
arted-0.014
lee-0.014
pered-0.014
aled-0.013
anks-0.013
Token to
Token ID284
Feature activation+9.53e-01

Pos logit contributions
str+0.006
 else+0.006
 slice+0.006
 mel+0.005
ww+0.005
Neg logit contributions
arted-0.012
pered-0.012
lee-0.011
anks-0.010
aled-0.010
Token play
Token ID711
Feature activation+1.36e-01

Pos logit contributions
arted+0.024
pered+0.024
lee+0.023
anks+0.021
aled+0.021
Neg logit contributions
str-0.020
 else-0.020
 slice-0.019
 mel-0.019
ww-0.018
Token with
Token ID351
Feature activation-1.91e+00

Pos logit contributions
pered+0.004
arted+0.004
lee+0.004
anks+0.004
aled+0.004
Neg logit contributions
str-0.002
 else-0.002
 slice-0.002
 mel-0.002
ww-0.002
Token it
Token ID340
Feature activation-1.82e+00

Pos logit contributions
 else+0.024
 slice+0.022
 mel+0.022
str+0.021
 jam+0.018
Neg logit contributions
arted-0.066
pered-0.066
lee-0.063
anks-0.060
aled-0.060
Token.
Token ID13
Feature activation+1.67e+00

Pos logit contributions
 else+0.037
str+0.036
 slice+0.036
 mel+0.035
ww+0.032
Neg logit contributions
arted-0.047
pered-0.047
lee-0.045
anks-0.042
aled-0.041
Token Sarah
Token ID10490
Feature activation-1.75e-01

Pos logit contributions
pered+0.058
pering+0.053
arted+0.053
attering+0.053
 breaths+0.053
Neg logit contributions
 mel-0.022
 else-0.022
 that-0.021
 happened-0.018
 slice-0.018
Token said
Token ID531
Feature activation-4.34e-01

Pos logit contributions
 else+0.003
 slice+0.002
str+0.002
 mel+0.002
ww+0.002
Neg logit contributions
arted-0.006
pered-0.005
lee-0.005
aled-0.005
anks-0.005
Token,
Token ID11
Feature activation-2.81e+00

Pos logit contributions
str+0.006
 slice+0.006
 else+0.006
 mel+0.006
ww+0.005
Neg logit contributions
arted-0.013
pered-0.013
lee-0.013
anks-0.012
aled-0.012
Token "
Token ID366
Feature activation-1.63e+00

Pos logit contributions
 else+0.064
 mel+0.061
 slice+0.061
str+0.060
 delicious+0.055
Neg logit contributions
arted-0.069
pered-0.068
lee-0.064
anks-0.061
aled-0.061
TokenWow
Token ID22017
Feature activation+1.16e-01

Pos logit contributions
str+0.031
 else+0.030
 slice+0.030
 mel+0.029
 jam+0.027
Neg logit contributions
arted-0.044
pered-0.042
lee-0.040
aled-0.039
anks-0.038
Token about
Token ID546
Feature activation-2.08e+00

Pos logit contributions
arted+0.041
pered+0.041
lee+0.038
anks+0.037
aled+0.037
Neg logit contributions
 else-0.024
 slice-0.021
 mel-0.021
str-0.020
 jam-0.019
Token it
Token ID340
Feature activation+6.61e-01

Pos logit contributions
 else+0.027
 slice+0.026
str+0.026
 mel+0.025
 flavor+0.021
Neg logit contributions
arted-0.069
pered-0.069
lee-0.066
anks-0.063
aled-0.063
Token.
Token ID13
Feature activation+3.94e+00

Pos logit contributions
arted+0.018
pered+0.018
lee+0.017
anks+0.016
aled+0.016
Neg logit contributions
 else-0.013
 slice-0.012
str-0.012
 mel-0.012
 flavor-0.011
Token She
Token ID1375
Feature activation+2.11e-01

Pos logit contributions
arted+0.090
pered+0.086
lee+0.086
aned+0.079
anks+0.079
Neg logit contributions
 else-0.090
 that-0.082
 jam-0.081
�-0.081
 mel-0.081
Token wanted
Token ID2227
Feature activation-6.36e-01

Pos logit contributions
arted+0.006
pered+0.006
lee+0.006
aled+0.005
 heads+0.005
Neg logit contributions
 slice-0.004
str-0.004
 else-0.004
 mel-0.004
ww-0.003
Token to
Token ID284
Feature activation+1.82e+00

Pos logit contributions
 else+0.012
 slice+0.011
str+0.011
 mel+0.011
 flavor+0.010
Neg logit contributions
arted-0.018
pered-0.018
lee-0.017
anks-0.016
aled-0.016
Token have
Token ID423
Feature activation+3.05e-01

Pos logit contributions
arted+0.053
pered+0.052
lee+0.049
anks+0.046
aled+0.046
Neg logit contributions
 else-0.034
 slice-0.032
 mel-0.031
str-0.031
 flavor-0.028
Token fun
Token ID1257
Feature activation-5.97e-01

Pos logit contributions
arted+0.010
pered+0.010
lee+0.010
anks+0.009
aled+0.009
Neg logit contributions
 else-0.004
 slice-0.004
str-0.004
 mel-0.004
 flavor-0.003
Token.
Token ID13
Feature activation+3.23e+00

Pos logit contributions
 else+0.010
 slice+0.008
 mel+0.008
str+0.008
 jam+0.008
Neg logit contributions
arted-0.019
pered-0.019
lee-0.018
aled-0.017
anks-0.017
Token She
Token ID1375
Feature activation+4.50e-01

Pos logit contributions
arted+0.067
pered+0.065
lee+0.064
anks+0.057
aled+0.056
Neg logit contributions
 else-0.088
�-0.077
 slice-0.077
 mel-0.077
 that-0.076
Token wanted
Token ID2227
Feature activation-5.83e-01

Pos logit contributions
pered+0.014
arted+0.014
lee+0.013
anks+0.013
aled+0.012
Neg logit contributions
 else-0.007
 slice-0.007
 mel-0.007
str-0.007
 flavor-0.006
Token,
Token ID11
Feature activation-2.95e+00

Pos logit contributions
 else+0.002
 slice+0.002
str+0.002
 mel+0.002
 flavor+0.001
Neg logit contributions
arted-0.003
pered-0.003
lee-0.003
anks-0.003
aled-0.003
Token please
Token ID3387
Feature activation-1.92e-01

Pos logit contributions
str+0.050
 slice+0.048
arest+0.046
 mel+0.044
 loaf+0.043
Neg logit contributions
arted-0.080
lee-0.078
pered-0.077
 heads-0.073
 shoulders-0.071
Token behave
Token ID17438
Feature activation+9.44e-01

Pos logit contributions
 else+0.004
 slice+0.003
 mel+0.003
str+0.003
 jam+0.003
Neg logit contributions
arted-0.005
pered-0.005
lee-0.005
anks-0.005
aled-0.005
Token!
Token ID0
Feature activation+1.34e+00

Pos logit contributions
pered+0.025
arted+0.025
lee+0.023
anks+0.022
aled+0.022
Neg logit contributions
 slice-0.018
str-0.018
 else-0.018
 mel-0.018
 jam-0.016
Token You
Token ID921
Feature activation-4.29e-02

Pos logit contributions
arted+0.033
pered+0.033
lee+0.032
 heads+0.031
 breaths+0.030
Neg logit contributions
 else-0.028
str-0.026
ww-0.026
 slice-0.025
 mel-0.024
Token're
Token ID821
Feature activation+2.16e+00

Pos logit contributions
 slice+0.001
 mel+0.001
str+0.001
 jam+0.001
aster+0.001
Neg logit contributions
arted-0.001
pered-0.001
lee-0.001
aled-0.001
anks-0.001
Token hurting
Token ID21530
Feature activation-1.46e-01

Pos logit contributions
arted+0.056
pered+0.056
lee+0.052
anks+0.049
aled+0.049
Neg logit contributions
 else-0.045
 slice-0.044
str-0.044
 mel-0.043
 flavor-0.039
Token my
Token ID616
Feature activation+2.37e+00

Pos logit contributions
 else+0.003
 slice+0.003
str+0.003
 mel+0.003
 flavor+0.002
Neg logit contributions
arted-0.004
pered-0.004
lee-0.004
anks-0.004
aled-0.004
Token nose
Token ID9686
Feature activation-4.07e-01

Pos logit contributions
arted+0.045
pered+0.044
lee+0.041
aled+0.037
anks+0.037
Neg logit contributions
 else-0.067
 slice-0.065
 mel-0.064
str-0.064
 flavor-0.060
Token with
Token ID351
Feature activation-1.49e+00

Pos logit contributions
 else+0.008
str+0.008
 slice+0.008
 mel+0.008
 jam+0.007
Neg logit contributions
arted-0.011
pered-0.011
lee-0.010
anks-0.009
aled-0.009
Token your
Token ID534
Feature activation+2.98e+00

Pos logit contributions
str+0.032
 slice+0.030
 else+0.030
 mel+0.029
arest+0.027
Neg logit contributions
arted-0.037
pered-0.036
lee-0.035
 heads-0.033
aled-0.032
Token all
Token ID477
Feature activation-1.70e+00

Pos logit contributions
str+0.015
 else+0.015
 slice+0.014
 mel+0.014
ww+0.013
Neg logit contributions
pered-0.019
arted-0.019
lee-0.018
 heads-0.017
anks-0.017
Token aspects
Token ID7612
Feature activation-3.89e+00

Pos logit contributions
 slice+0.030
str+0.029
 else+0.028
 mel+0.028
 flavor+0.026
Neg logit contributions
arted-0.047
pered-0.046
lee-0.045
 heads-0.042
aled-0.042
Token of
Token ID286
Feature activation-2.24e+00

Pos logit contributions
str+0.066
 slice+0.066
 mel+0.061
 loaf+0.056
 jam+0.055
Neg logit contributions
arted-0.114
pered-0.110
lee-0.105
anks-0.104
aled-0.102
Token our
Token ID674
Feature activation+1.29e+00

Pos logit contributions
str+0.043
 slice+0.041
 else+0.040
 mel+0.038
ww+0.037
Neg logit contributions
pered-0.059
arted-0.059
lee-0.058
 heads-0.054
aled-0.052
Token lives
Token ID3160
Feature activation-6.43e-01

Pos logit contributions
pered+0.030
arted+0.030
lee+0.029
 heads+0.027
anks+0.026
Neg logit contributions
 else-0.028
 slice-0.028
str-0.028
 mel-0.027
ww-0.026
Token.
Token ID13
Feature activation+1.95e+00

Pos logit contributions
str+0.013
 slice+0.013
 mel+0.013
 else+0.013
 flavor+0.012
Neg logit contributions
pered-0.017
arted-0.016
lee-0.016
anks-0.014
aled-0.014
Token If
Token ID1002
Feature activation-3.80e+00

Pos logit contributions
arted+0.046
pered+0.045
lee+0.043
 heads+0.040
anks+0.039
Neg logit contributions
 else-0.043
 slice-0.042
str-0.041
 mel-0.040
ww-0.040
Token we
Token ID356
Feature activation+9.62e-01

Pos logit contributions
str+0.059
 slice+0.057
 mel+0.054
 else+0.052
 loaf+0.045
Neg logit contributions
pered-0.112
arted-0.112
lee-0.109
 heads-0.105
anks-0.104
Token are
Token ID389
Feature activation+1.97e+00

Pos logit contributions
arted+0.025
pered+0.024
lee+0.023
 heads+0.023
anks+0.022
Neg logit contributions
str-0.018
 slice-0.018
 else-0.018
 mel-0.018
ww-0.018
Token clear
Token ID1598
Feature activation+7.34e-02

Pos logit contributions
pered+0.051
arted+0.050
lee+0.048
anks+0.044
aled+0.044
Neg logit contributions
 else-0.040
str-0.040
 slice-0.039
 mel-0.039
ww-0.036
Token and
Token ID290
Feature activation-9.50e-01

Pos logit contributions
arted+0.002
pered+0.002
lee+0.002
anks+0.002
aled+0.002
Neg logit contributions
 else-0.001
 slice-0.001
 mel-0.001
str-0.001
 flavor-0.001
Token It
Token ID632
Feature activation-1.23e+00

Pos logit contributions
arted+0.020
pered+0.020
lee+0.019
anks+0.017
aled+0.017
Neg logit contributions
 mel-0.022
 slice-0.022
 else-0.022
str-0.022
 flavor-0.021
Token was
Token ID373
Feature activation+1.31e+00

Pos logit contributions
 slice+0.019
 else+0.018
 mel+0.018
str+0.017
arest+0.016
Neg logit contributions
arted-0.037
pered-0.037
lee-0.036
sey-0.034
aled-0.034
Token so
Token ID523
Feature activation-1.15e+00

Pos logit contributions
pered+0.035
arted+0.035
lee+0.033
aled+0.031
anks+0.030
Neg logit contributions
 else-0.026
str-0.024
 slice-0.024
arest-0.023
 mel-0.022
Token big
Token ID1263
Feature activation-3.48e+00

Pos logit contributions
 slice+0.016
 else+0.016
str+0.014
 mel+0.014
aster+0.013
Neg logit contributions
arted-0.036
lee-0.036
pered-0.036
aled-0.032
anks-0.032
Token!
Token ID0
Feature activation+8.36e-01

Pos logit contributions
 slice+0.043
str+0.043
 else+0.042
 mel+0.039
 flavor+0.034
Neg logit contributions
arted-0.118
pered-0.116
lee-0.113
anks-0.107
aled-0.106
Token The
Token ID383
Feature activation-7.30e-01

Pos logit contributions
arted+0.019
pered+0.018
lee+0.017
anks+0.016
 heads+0.016
Neg logit contributions
 mel-0.020
 else-0.020
 slice-0.020
str-0.020
 flavor-0.019
Token elephant
Token ID20950
Feature activation-1.61e+00

Pos logit contributions
 else+0.018
 slice+0.017
str+0.017
 mel+0.016
aster+0.016
Neg logit contributions
arted-0.016
pered-0.016
lee-0.015
anks-0.014
 heads-0.014
Token was
Token ID373
Feature activation+2.06e+00

Pos logit contributions
 slice+0.031
aster+0.028
 mel+0.028
arest+0.028
str+0.028
Neg logit contributions
lee-0.042
arted-0.042
pered-0.041
sey-0.039
am-0.038
Token very
Token ID845
Feature activation+3.98e-01

Pos logit contributions
arted+0.057
pered+0.057
lee+0.054
anks+0.051
aled+0.050
Neg logit contributions
 else-0.039
 slice-0.038
str-0.037
 mel-0.037
 flavor-0.033
Token stubborn
Token ID24444
Feature activation-7.57e-01

Pos logit contributions
arted+0.012
pered+0.012
lee+0.012
anks+0.011
aled+0.011
Neg logit contributions
 else-0.006
 slice-0.006
str-0.006
 mel-0.006
 flavor-0.005
Token and
Token ID290
Feature activation-1.46e+00

Pos logit contributions
 else+0.011
 that+0.010
 mel+0.010
 happened+0.009
 flavor+0.009
Neg logit contributions
pered-0.025
arted-0.025
lee-0.024
aled-0.023
anks-0.022
Token He
Token ID679
Feature activation-1.96e-01

Pos logit contributions
arted+0.057
pered+0.056
lee+0.052
anks+0.049
sey+0.048
Neg logit contributions
 else-0.060
 mel-0.060
 slice-0.059
str-0.057
 flavor-0.054
Token saw
Token ID2497
Feature activation-2.32e+00

Pos logit contributions
 slice+0.003
 mel+0.003
str+0.003
 else+0.003
arest+0.003
Neg logit contributions
arted-0.006
pered-0.005
lee-0.005
 heads-0.005
aled-0.005
Token Lily
Token ID20037
Feature activation+2.00e+00

Pos logit contributions
str+0.040
arest+0.039
 slice+0.038
ww+0.037
w+0.036
Neg logit contributions
arted-0.069
pered-0.066
lee-0.065
aled-0.060
 heads-0.057
Token and
Token ID290
Feature activation-1.22e+00

Pos logit contributions
arted+0.054
pered+0.053
lee+0.051
anks+0.048
aled+0.047
Neg logit contributions
 else-0.040
 slice-0.038
str-0.038
 mel-0.038
 flavor-0.034
Token her
Token ID607
Feature activation+1.20e+00

Pos logit contributions
 else+0.023
 slice+0.022
str+0.022
 mel+0.022
ww+0.020
Neg logit contributions
arted-0.034
pered-0.034
lee-0.033
aled-0.030
anks-0.030
Token friends
Token ID2460
Feature activation-4.60e-01

Pos logit contributions
arted+0.027
pered+0.027
lee+0.025
anks+0.023
aled+0.023
Neg logit contributions
 else-0.030
 slice-0.029
str-0.028
 mel-0.028
 flavor-0.026
Token playing
Token ID2712
Feature activation+1.05e+00

Pos logit contributions
 else+0.011
 mel+0.010
 slice+0.010
str+0.010
 flavor+0.009
Neg logit contributions
arted-0.011
pered-0.011
lee-0.010
anks-0.010
aled-0.010
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
pered+0.030
arted+0.030
lee+0.028
anks+0.027
aled+0.027
Neg logit contributions
str-0.019
 else-0.018
 slice-0.018
 mel-0.017
ww-0.016
Token said
Token ID531
Feature activation-7.74e-01

Pos logit contributions
 else+0.021
 mel+0.019
 slice+0.019
str+0.019
 flavor+0.017
Neg logit contributions
pered-0.030
arted-0.030
lee-0.029
anks-0.028
aled-0.027
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
 else+0.012
str+0.011
 mel+0.011
 slice+0.011
 flavor+0.009
Neg logit contributions
arted-0.025
pered-0.025
lee-0.023
anks-0.023
aled-0.023
Token "
Token ID366
Feature activation-8.54e-01

Pos logit contributions
 else+0.038
str+0.037
 mel+0.036
 slice+0.035
 that+0.032
Neg logit contributions
arted-0.045
pered-0.044
lee-0.041
aled-0.041
anks-0.041
Token.
Token ID13
Feature activation+3.21e+00

Pos logit contributions
str+0.003
 else+0.003
 slice+0.003
 mel+0.003
ww+0.003
Neg logit contributions
pered-0.004
arted-0.004
lee-0.004
anks-0.003
aled-0.003
Token He
Token ID679
Feature activation+7.23e-01

Pos logit contributions
pered+0.060
arted+0.059
lee+0.055
 heads+0.054
anks+0.054
Neg logit contributions
 else-0.080
 mel-0.078
 that-0.077
 slice-0.077
str-0.076
Token was
Token ID373
Feature activation+2.49e+00

Pos logit contributions
arted+0.021
pered+0.020
lee+0.020
aled+0.018
anks+0.018
Neg logit contributions
 else-0.013
 slice-0.013
str-0.013
 mel-0.012
 flavor-0.011
Token grateful
Token ID14066
Feature activation-1.57e+00

Pos logit contributions
arted+0.068
pered+0.068
lee+0.064
anks+0.061
aled+0.060
Neg logit contributions
 else-0.048
 slice-0.046
str-0.046
 mel-0.046
 flavor-0.041
Token to
Token ID284
Feature activation+2.96e-01

Pos logit contributions
str+0.033
 slice+0.032
 else+0.031
aster+0.030
arest+0.030
Neg logit contributions
arted-0.039
pered-0.038
lee-0.038
 shoulders-0.034
anks-0.034
Token the
Token ID262
Feature activation-4.11e-01

Pos logit contributions
arted+0.010
pered+0.010
lee+0.009
anks+0.009
 heads+0.009
Neg logit contributions
 else-0.004
 mel-0.003
 slice-0.003
str-0.003
aster-0.003
Token bear
Token ID6842
Feature activation+1.10e+00

Pos logit contributions
 else+0.007
str+0.007
 slice+0.007
 mel+0.007
 delicious+0.006
Neg logit contributions
arted-0.012
pered-0.012
lee-0.011
anks-0.011
aled-0.011
Token for
Token ID329
Feature activation-1.59e+00

Pos logit contributions
arted+0.030
pered+0.030
lee+0.028
anks+0.026
aled+0.026
Neg logit contributions
 else-0.021
str-0.021
 slice-0.020
 mel-0.020
ww-0.018
Token guiding
Token ID26727
Feature activation-3.82e-01

Pos logit contributions
 else+0.035
 slice+0.034
str+0.033
 mel+0.033
 flavor+0.030
Neg logit contributions
arted-0.040
pered-0.039
lee-0.037
anks-0.035
aled-0.035
Token him
Token ID683
Feature activation+2.17e+00

Pos logit contributions
 else+0.008
 mel+0.008
 slice+0.008
str+0.008
aster+0.007
Neg logit contributions
arted-0.010
pered-0.010
lee-0.009
anks-0.009
aled-0.008
Token and
Token ID290
Feature activation-4.96e-01

Pos logit contributions
arted+0.060
pered+0.059
lee+0.057
aled+0.053
anks+0.053
Neg logit contributions
 else-0.042
 slice-0.039
str-0.039
 mel-0.039
 flavor-0.035
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
arted+0.037
pered+0.037
lee+0.035
anks+0.033
aled+0.033
Neg logit contributions
 else-0.024
 slice-0.023
 mel-0.022
str-0.022
 flavor-0.020
Token but
Token ID475
Feature activation-2.99e+00

Pos logit contributions
 else+0.024
 slice+0.023
str+0.023
 mel+0.023
 flavor+0.020
Neg logit contributions
arted-0.034
pered-0.034
lee-0.032
anks-0.030
aled-0.030
Token he
Token ID339
Feature activation-7.07e-01

Pos logit contributions
 slice+0.050
 else+0.046
str+0.045
 mel+0.045
arest+0.041
Neg logit contributions
arted-0.089
pered-0.087
lee-0.085
aled-0.079
 heads-0.078
Token kept
Token ID4030
Feature activation+4.28e+00

Pos logit contributions
 slice+0.013
 else+0.013
 mel+0.013
str+0.012
arest+0.011
Neg logit contributions
arted-0.019
pered-0.019
lee-0.019
 heads-0.017
sey-0.017
Token on
Token ID319
Feature activation+2.71e+00

Pos logit contributions
arted+0.110
pered+0.108
lee+0.104
anks+0.101
aled+0.098
Neg logit contributions
 else-0.090
 mel-0.083
 slice-0.083
str-0.081
 delicious-0.077
Token feeling
Token ID4203
Feature activation+8.56e-01

Pos logit contributions
arted+0.069
pered+0.067
lee+0.062
anks+0.061
aled+0.061
Neg logit contributions
 else-0.058
 mel-0.055
 slice-0.053
 delicious-0.052
 jam-0.051
Token scared
Token ID12008
Feature activation-1.13e+00

Pos logit contributions
arted+0.024
pered+0.024
lee+0.023
anks+0.021
aled+0.021
Neg logit contributions
 else-0.016
str-0.015
 slice-0.015
 mel-0.015
 delicious-0.014
Token.
Token ID13
Feature activation+2.08e+00

Pos logit contributions
 else+0.020
 mel+0.017
 slice+0.016
str+0.016
 that+0.015
Neg logit contributions
pered-0.034
arted-0.034
lee-0.033
anks-0.032
aled-0.031
Token When
Token ID1649
Feature activation-4.50e+00

Pos logit contributions
pered+0.045
arted+0.044
 heads+0.041
anks+0.041
lee+0.040
Neg logit contributions
str-0.048
 mel-0.048
 else-0.047
 delicious-0.045
 flavor-0.043
Token they
Token ID484
Feature activation+2.28e-01

Pos logit contributions
 else+0.054
 mel+0.050
str+0.050
 slice+0.049
 flavor+0.042
Neg logit contributions
arted-0.158
pered-0.157
lee-0.150
anks-0.144
aled-0.143
Token reached
Token ID4251
Feature activation+2.14e+00

Pos logit contributions
arted+0.006
lee+0.006
pered+0.005
aled+0.005
idget+0.005
Neg logit contributions
 slice-0.005
 mel-0.005
ww-0.004
arest-0.004
 loaf-0.004
TokenIt
Token ID1026
Feature activation-1.63e-01

Pos logit contributions
str+0.008
 else+0.008
 mel+0.008
 slice+0.008
 delicious+0.007
Neg logit contributions
pered-0.013
arted-0.013
 heads-0.012
aled-0.012
lee-0.011
Token looks
Token ID3073
Feature activation+2.79e+00

Pos logit contributions
 else+0.002
str+0.002
 slice+0.002
 mel+0.002
 flavor+0.002
Neg logit contributions
arted-0.005
pered-0.005
lee-0.005
anks-0.005
aled-0.005
Token fun
Token ID1257
Feature activation-3.63e-01

Pos logit contributions
pered+0.072
arted+0.070
lee+0.068
 shoulders+0.066
 heads+0.062
Neg logit contributions
str-0.057
 slice-0.052
aster-0.051
w-0.051
 else-0.049
Token!"
Token ID2474
Feature activation+8.84e-01

Pos logit contributions
str+0.005
 else+0.005
 slice+0.005
 mel+0.005
aster+0.004
Neg logit contributions
pered-0.012
arted-0.011
lee-0.011
anks-0.010
 heads-0.010
Token
Token ID198
Feature activation-2.51e+00

Pos logit contributions
arted+0.028
pered+0.028
lee+0.027
aled+0.026
anks+0.026
Neg logit contributions
 else-0.014
 mel-0.013
str-0.012
 slice-0.012
 jam-0.011
Token
Token ID198
Feature activation-2.18e+00

Pos logit contributions
 slice+0.045
 else+0.044
 mel+0.044
str+0.043
 jam+0.039
Neg logit contributions
pered-0.070
arted-0.070
lee-0.067
anks-0.063
aled-0.062
Token"
Token ID1
Feature activation-2.43e+00

Pos logit contributions
 else+0.029
 slice+0.028
str+0.028
 mel+0.027
 flavor+0.023
Neg logit contributions
arted-0.073
pered-0.072
lee-0.069
anks-0.065
aled-0.065
TokenOK
Token ID11380
Feature activation+4.59e-01

Pos logit contributions
 slice+0.043
str+0.042
 mel+0.042
 else+0.042
aster+0.037
Neg logit contributions
arted-0.068
pered-0.067
 heads-0.060
lee-0.060
aled-0.059
Token!"
Token ID2474
Feature activation-5.39e-02

Pos logit contributions
pered+0.013
arted+0.013
lee+0.012
 shoulders+0.012
 heads+0.012
Neg logit contributions
str-0.008
 else-0.008
 slice-0.007
 mel-0.007
aster-0.007
Token Sam
Token ID3409
Feature activation+2.09e+00

Pos logit contributions
 else+0.001
 slice+0.001
str+0.001
 mel+0.001
 flavor+0.001
Neg logit contributions
arted-0.001
pered-0.001
lee-0.001
anks-0.001
aled-0.001
Token says
Token ID1139
Feature activation+2.41e+00

Pos logit contributions
arted+0.059
pered+0.059
lee+0.056
anks+0.053
aled+0.052
Neg logit contributions
 else-0.039
 slice-0.037
 mel-0.037
str-0.037
 jam-0.033
Token bird
Token ID6512
Feature activation+8.92e-01

Pos logit contributions
 heads+0.013
pered+0.013
 shoulders+0.013
arted+0.013
lee+0.013
Neg logit contributions
 else-0.011
str-0.011
ww-0.010
aster-0.010
 slice-0.010
Token and
Token ID290
Feature activation-1.56e+00

Pos logit contributions
pered+0.022
arted+0.022
lee+0.021
anks+0.019
 heads+0.019
Neg logit contributions
 else-0.019
str-0.018
 slice-0.018
 mel-0.018
ww-0.017
Token asked
Token ID1965
Feature activation-9.46e-01

Pos logit contributions
 slice+0.026
str+0.026
arest+0.026
 else+0.024
w+0.024
Neg logit contributions
arted-0.045
lee-0.045
pered-0.043
 heads-0.039
aled-0.038
Token,
Token ID11
Feature activation-2.57e+00

Pos logit contributions
 else+0.014
 slice+0.014
str+0.013
 mel+0.013
 flavor+0.012
Neg logit contributions
arted-0.030
pered-0.030
lee-0.028
anks-0.027
aled-0.027
Token "
Token ID366
Feature activation-1.52e+00

Pos logit contributions
 else+0.062
 slice+0.060
str+0.060
 mel+0.060
 delicious+0.054
Neg logit contributions
arted-0.058
pered-0.057
lee-0.054
anks-0.051
aled-0.051
TokenWhat
Token ID2061
Feature activation-4.98e+00

Pos logit contributions
str+0.035
 else+0.034
arest+0.033
 happened+0.032
 slice+0.032
Neg logit contributions
arted-0.033
pered-0.031
lee-0.027
 heads-0.027
stick-0.027
Token happened
Token ID3022
Feature activation+2.88e+00

Pos logit contributions
str+0.096
 slice+0.094
aster+0.091
arest+0.089
 mel+0.087
Neg logit contributions
lee-0.111
pered-0.110
arted-0.109
anks-0.108
 shoulders-0.103
Token,
Token ID11
Feature activation-2.66e+00

Pos logit contributions
pered+0.075
arted+0.074
lee+0.071
anks+0.066
 shoulders+0.061
Neg logit contributions
 slice-0.058
str-0.056
 else-0.055
arest-0.051
 mel-0.051
Token Lily
Token ID20037
Feature activation-1.68e+00

Pos logit contributions
str+0.069
arest+0.064
w+0.064
 else+0.063
aster+0.063
Neg logit contributions
lee-0.053
arted-0.051
pered-0.049
 eyes-0.045
 heads-0.045
Token?"
Token ID1701
Feature activation+9.76e-01

Pos logit contributions
 else+0.019
 slice+0.018
str+0.017
 mel+0.017
arest+0.017
Neg logit contributions
lee-0.057
arted-0.057
pered-0.056
anks-0.053
sey-0.051
Token
Token ID220
Feature activation+3.90e+00

Pos logit contributions
arted+0.028
pered+0.027
anks+0.025
aled+0.025
lee+0.025
Neg logit contributions
 else-0.019
 slice-0.018
 mel-0.018
str-0.017
 jam-0.017
Token adventure
Token ID8855
Feature activation-4.99e-03

Pos logit contributions
arted+0.044
pered+0.043
lee+0.043
 heads+0.041
 shoulders+0.041
Neg logit contributions
 slice-0.036
 else-0.035
str-0.034
w-0.033
ww-0.032
Token together
Token ID1978
Feature activation-1.93e+00

Pos logit contributions
 else+0.000
 slice+0.000
str+0.000
 mel+0.000
ww+0.000
Neg logit contributions
arted-0.000
pered-0.000
lee-0.000
anks-0.000
aled-0.000
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
str+0.044
 slice+0.043
 else+0.043
 mel+0.042
ww+0.041
Neg logit contributions
pered-0.045
arted-0.045
lee-0.043
anks-0.039
sey-0.038
Token<|endoftext|>
Token ID50256
Feature activation-1.77e+00

Pos logit contributions
pered+0.050
arted+0.049
lee+0.046
anks+0.044
 heads+0.044
Neg logit contributions
 else-0.033
str-0.032
 mel-0.031
 slice-0.031
aster-0.027
TokenOnce
Token ID7454
Feature activation-3.30e+00

Pos logit contributions
 else+0.032
 slice+0.031
str+0.030
 mel+0.030
ww+0.027
Neg logit contributions
pered-0.050
arted-0.050
lee-0.048
anks-0.045
aled-0.045
Token upon
Token ID2402
Feature activation-3.15e+00

Pos logit contributions
 slice+0.070
 mel+0.068
 else+0.065
str+0.064
ww+0.063
Neg logit contributions
arted-0.083
lee-0.082
pered-0.080
sey-0.070
anks-0.069
Token a
Token ID257
Feature activation-3.73e+00

Pos logit contributions
str+0.062
 mel+0.054
 slice+0.053
ww+0.052
 else+0.051
Neg logit contributions
pered-0.085
arted-0.080
lee-0.076
aled-0.074
sey-0.073
Token time
Token ID640
Feature activation-1.90e+00

Pos logit contributions
 mel+0.065
str+0.060
iest+0.060
 else+0.060
 slice+0.057
Neg logit contributions
pered-0.098
arted-0.096
lee-0.093
 shoulders-0.087
 heads-0.086
Token,
Token ID11
Feature activation-4.28e+00

Pos logit contributions
 slice+0.040
 mel+0.038
str+0.036
 else+0.036
 jam+0.036
Neg logit contributions
pered-0.048
arted-0.047
lee-0.045
sey-0.040
anks-0.040
Token there
Token ID612
Feature activation-3.01e+00

Pos logit contributions
 mel+0.080
str+0.075
 slice+0.074
aster+0.072
 jam+0.071
Neg logit contributions
pered-0.124
arted-0.115
sey-0.110
lee-0.102
aled-0.098
Token was
Token ID373
Feature activation-3.65e-01

Pos logit contributions
 slice+0.072
arest+0.070
 mel+0.068
ww+0.067
str+0.066
Neg logit contributions
sey-0.061
arted-0.061
pered-0.059
lee-0.059
am-0.055
Token end
Token ID886
Feature activation-2.05e+00

Pos logit contributions
 else+0.006
str+0.006
 slice+0.005
 mel+0.005
ww+0.005
Neg logit contributions
pered-0.007
arted-0.007
lee-0.006
 heads-0.006
aled-0.006
Token.
Token ID13
Feature activation+1.85e+00

Pos logit contributions
 else+0.043
str+0.042
 mel+0.042
 slice+0.042
ww+0.038
Neg logit contributions
pered-0.051
arted-0.051
lee-0.049
anks-0.046
aled-0.044
Token<|endoftext|>
Token ID50256
Feature activation-1.38e+00

Pos logit contributions
arted+0.049
pered+0.049
lee+0.047
anks+0.044
aled+0.043
Neg logit contributions
 else-0.038
 slice-0.034
str-0.034
 mel-0.034
w-0.031
TokenOnce
Token ID7454
Feature activation-2.51e+00

Pos logit contributions
 else+0.020
 slice+0.019
str+0.019
 mel+0.019
ww+0.016
Neg logit contributions
pered-0.044
arted-0.044
lee-0.042
anks-0.040
aled-0.040
Token upon
Token ID2402
Feature activation-2.95e+00

Pos logit contributions
 slice+0.056
 mel+0.054
 else+0.052
ww+0.052
str+0.051
Neg logit contributions
arted-0.060
lee-0.058
pered-0.056
am-0.049
anks-0.049
Token a
Token ID257
Feature activation-3.98e+00

Pos logit contributions
str+0.057
 mel+0.051
ww+0.049
 slice+0.049
 else+0.048
Neg logit contributions
pered-0.079
arted-0.074
lee-0.072
aled-0.070
sey-0.069
Token time
Token ID640
Feature activation-1.20e+00

Pos logit contributions
 mel+0.066
 else+0.064
str+0.062
 slice+0.059
iest+0.058
Neg logit contributions
pered-0.108
arted-0.106
lee-0.102
 shoulders-0.097
sey-0.095
Token,
Token ID11
Feature activation-4.16e+00

Pos logit contributions
 slice+0.023
 mel+0.022
 else+0.022
str+0.021
 jam+0.021
Neg logit contributions
pered-0.032
arted-0.031
lee-0.030
anks-0.027
sey-0.027
Token there
Token ID612
Feature activation-2.61e+00

Pos logit contributions
 mel+0.081
str+0.076
 slice+0.075
aster+0.073
 jam+0.072
Neg logit contributions
pered-0.114
arted-0.106
sey-0.100
lee-0.094
aled-0.092
Token was
Token ID373
Feature activation-4.90e-01

Pos logit contributions
 slice+0.057
arest+0.056
ww+0.055
 mel+0.054
str+0.053
Neg logit contributions
arted-0.058
pered-0.057
sey-0.057
lee-0.057
am-0.053
Token a
Token ID257
Feature activation-3.79e+00

Pos logit contributions
str+0.010
 else+0.009
 slice+0.008
 mel+0.008
arest+0.008
Neg logit contributions
pered-0.013
arted-0.013
lee-0.012
aled-0.011
sey-0.011
Token she
Token ID673
Feature activation-2.52e-02

Pos logit contributions
str+0.083
ww+0.081
 slice+0.081
aster+0.077
 mel+0.077
Neg logit contributions
arted-0.105
pered-0.104
lee-0.102
anks-0.091
stick-0.088
Token could
Token ID714
Feature activation+2.66e+00

Pos logit contributions
 slice+0.000
str+0.000
ww+0.000
aster+0.000
 mel+0.000
Neg logit contributions
pered-0.001
lee-0.001
arted-0.001
 heads-0.001
anks-0.001
Token find
Token ID1064
Feature activation-1.42e+00

Pos logit contributions
pered+0.063
arted+0.061
lee+0.058
anks+0.056
aled+0.052
Neg logit contributions
str-0.061
 else-0.059
 slice-0.058
 mel-0.057
ww-0.057
Token.
Token ID13
Feature activation+1.63e+00

Pos logit contributions
str+0.036
 slice+0.035
ww+0.034
aster+0.033
arest+0.033
Neg logit contributions
pered-0.029
arted-0.029
lee-0.028
anks-0.025
sey-0.023
Token One
Token ID1881
Feature activation-5.10e+00

Pos logit contributions
pered+0.045
arted+0.044
lee+0.041
 breaths+0.038
aled+0.038
Neg logit contributions
 else-0.033
 mel-0.032
str-0.031
 slice-0.031
 flavor-0.030
Token day
Token ID1110
Feature activation-3.42e+00

Pos logit contributions
arest+0.107
 else+0.102
prise+0.100
keys+0.098
 slice+0.095
Neg logit contributions
lee-0.105
arted-0.105
pered-0.103
sey-0.101
am-0.099
Token,
Token ID11
Feature activation-3.78e+00

Pos logit contributions
 slice+0.075
str+0.071
 else+0.069
arest+0.068
 mel+0.066
Neg logit contributions
lee-0.077
pered-0.077
arted-0.076
 breaths-0.067
 shoulders-0.067
Token she
Token ID673
Feature activation-2.19e-01

Pos logit contributions
arest+0.082
str+0.081
 slice+0.080
ww+0.078
prise+0.075
Neg logit contributions
arted-0.085
pered-0.081
lee-0.078
 heads-0.075
stick-0.072
Token found
Token ID1043
Feature activation-5.12e-01

Pos logit contributions
 else+0.003
str+0.003
 slice+0.003
 mel+0.003
ww+0.003
Neg logit contributions
pered-0.007
arted-0.007
lee-0.007
aled-0.006
 heads-0.006
Token a
Token ID257
Feature activation-2.37e+00

Pos logit contributions
str+0.009
 slice+0.009
ww+0.009
 else+0.009
arest+0.008
Neg logit contributions
arted-0.015
pered-0.014
lee-0.014
aled-0.013
sey-0.013
Token really
Token ID1107
Feature activation+5.64e-02

Pos logit contributions
w+0.046
aster+0.044
arest+0.043
iest+0.041
str+0.041
Neg logit contributions
 shoulders-0.050
 eyes-0.048
 younger-0.048
lee-0.047
pered-0.047
Token away
Token ID1497
Feature activation+2.95e-02

Pos logit contributions
lee+0.096
pered+0.096
arted+0.095
anks+0.092
angers+0.086
Neg logit contributions
 else-0.077
 mel-0.073
 slice-0.073
str-0.071
 delicious-0.069
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
arted+0.001
pered+0.001
lee+0.001
aled+0.001
anks+0.001
Neg logit contributions
 else-0.000
str-0.000
 slice-0.000
 mel-0.000
 delicious-0.000
Token
Token ID198
Feature activation-3.20e+00

Pos logit contributions
pered+0.051
arted+0.051
lee+0.048
anks+0.046
aled+0.046
Neg logit contributions
 else-0.025
 slice-0.022
str-0.022
 mel-0.022
 jam-0.019
Token
Token ID198
Feature activation-1.95e+00

Pos logit contributions
 mel+0.075
 slice+0.074
str+0.071
ww+0.066
 jam+0.066
Neg logit contributions
arted-0.070
pered-0.070
lee-0.067
anks-0.060
aled-0.060
TokenSuddenly
Token ID38582
Feature activation-3.11e+00

Pos logit contributions
 else+0.029
 slice+0.028
 mel+0.028
str+0.028
 flavor+0.024
Neg logit contributions
arted-0.062
pered-0.062
lee-0.059
aled-0.056
anks-0.055
Token,
Token ID11
Feature activation-3.84e+00

Pos logit contributions
 else+0.054
 slice+0.054
str+0.053
 mel+0.051
 flavor+0.047
Neg logit contributions
arted-0.088
pered-0.088
lee-0.085
anks-0.077
aled-0.077
Token a
Token ID257
Feature activation-2.93e+00

Pos logit contributions
str+0.091
 slice+0.088
arest+0.087
 else+0.087
 mel+0.082
Neg logit contributions
arted-0.081
pered-0.079
lee-0.077
 breaths-0.069
aled-0.069
Token big
Token ID1263
Feature activation-4.30e-01

Pos logit contributions
 else+0.050
 slice+0.050
w+0.047
aster+0.046
 mel+0.046
Neg logit contributions
pered-0.076
arted-0.075
lee-0.075
 heads-0.066
 shoulders-0.065
Token,
Token ID11
Feature activation-2.33e+00

Pos logit contributions
 else+0.008
str+0.008
 slice+0.008
 mel+0.007
arest+0.007
Neg logit contributions
pered-0.012
arted-0.012
lee-0.011
 heads-0.010
anks-0.010
Token scary
Token ID14343
Feature activation+6.53e-01

Pos logit contributions
 else+0.047
 slice+0.045
str+0.043
aster+0.041
 mel+0.040
Neg logit contributions
arted-0.061
pered-0.061
lee-0.057
 heads-0.053
aled-0.051
Token dragon
Token ID10441
Feature activation-6.69e-01

Pos logit contributions
arted+0.019
pered+0.019
lee+0.018
anks+0.017
aled+0.017
Neg logit contributions
 else-0.011
str-0.011
 slice-0.011
 mel-0.011
 flavor-0.009