TOP ACTIVATIONS
MAX = 9.667

mad at me ?" he wh ined to himself .
icked Anna and Ben and wh ined . "
I have it ?" Ben wh ined , trying to grab
to their new friends and wh izz ed off back into
of our toys !" Lily wh ined . " We want
. They p outed and wh ined . They did not
her . He smiled and wh inn ied . The little
, Mom my ," Anna wh ined . " I don
theater . They cried and wh ined and stomp ed their
cl apped his hands and wh irled around and around .
Todd smiled and the horse wh inn ied . Todd was
you okay ?" The puppy wh ined loudly in response .
" of course !" He wh ir red around the machine
Suddenly , Brown ie wh inn ied .
car is stupid !" Ben wh ined . "
key in the lock and wh ir rr rr ... it
. Max bark ed and wh ined , trying to speak
that . They cried and wh ined . They did not
S ally wh ined , " But Mom
please , please ." She wh ined and cried and stamped

MIN ACTIVATIONS
MIN = -7.329

said they could exchange the fake horse for a real one
You can pretend that the dull chalk is any color you
We can 't use the fake food . And we can
" the store where the red shoes are ". His mom
. He realized that the dull staff was not so bad
scale . It shows how heavy things are ." Tim my
! I can replace the weak fabric with a new one
" Who has the bigger animal ?" Tom asks .
It is for measuring how heavy you are . You should
" Look , Max ! These are my past els .
un cles can give me fake pills , and fake pills
head . She thought the fake heels were silly , but
give her a special brightly coloured tooth brush or some y
boring . I like things that can destroy things ." Tim
help the animals get the fake tw ig . He would
around and added the biggest , most colourful umbrella they could
what is happening about the shrinking . â
decided to buy the sweet smelling perfume . Mia was so
had enough to buy the fake dog . When
. He pretended that the red car was driving the blue

INTERVAL 5.418 - 9.667
CONTAINS 1.362%

ed . <|endoftext|> Once upon a time , there was an
. <|endoftext|> Once upon a time , there was a little
again . <|endoftext|> Once upon a time , there was a
ie . <|endoftext|> Once upon a time , there was a
there was a girl named Lily . Lily found a shiny

INTERVAL 1.169 - 5.418
CONTAINS 27.676%

its feathers . The turkey seemed to like it and didn
After a while , Billy came back with the sp ade
started to weep . She was so sad because she knew
with him anymore . He is mean and rude .
use . Every day , Jimmy went out to try and

INTERVAL -3.080 - 1.169
CONTAINS 63.630%

took the cushion from them and put it on the couch
brightly - colored ball . The children cheered and ran to
bit , but then we have to play tag ."
to ask his mom what he should do . She said
me try !" Anna says . She picks up a bigger

INTERVAL -7.329 - -3.080
CONTAINS 7.332%

. She saw him fix a bike with a broken chain
, it shook and rattled . â
the lady says . " These are lovely leaves . What
. So , instead of folding , she just watched all
She had a toy laser that she loved to play with
Token mad
Token ID8805
Feature activation-9.78e-01

Pos logit contributions
 named+0.036
ind+0.033
o+0.033
e+0.032
ippy+0.031
Neg logit contributions
rier-0.040
der-0.039
 less-0.037
fulness-0.036
 maybe-0.036
Token at
Token ID379
Feature activation-6.49e-01

Pos logit contributions
der+0.024
rier+0.023
umm+0.022
fulness+0.022
 horizon+0.022
Neg logit contributions
 named-0.016
e-0.014
ind-0.014
o-0.014
 walking-0.013
Token me
Token ID502
Feature activation-1.01e+00

Pos logit contributions
der+0.017
rier+0.016
umm+0.016
 horizon+0.016
bread+0.016
Neg logit contributions
 named-0.010
e-0.008
o-0.008
ind-0.008
 walking-0.008
Token?"
Token ID1701
Feature activation-4.40e-01

Pos logit contributions
der+0.024
rier+0.023
 Which+0.021
fulness+0.021
bread+0.021
Neg logit contributions
 named-0.017
e-0.015
ind-0.015
o-0.015
 walking-0.015
Token he
Token ID339
Feature activation+4.11e+00

Pos logit contributions
der+0.009
umm+0.009
rier+0.009
lights+0.009
 horizon+0.009
Neg logit contributions
 named-0.010
 walking-0.009
o-0.008
e-0.008
 playing-0.008
Token wh
Token ID348
Feature activation+9.67e+00

Pos logit contributions
 named+0.074
ind+0.067
e+0.066
o+0.065
 walking+0.062
Neg logit contributions
der-0.092
rier-0.090
 maybe-0.084
bread-0.083
fulness-0.082
Tokenined
Token ID1389
Feature activation+1.23e+00

Pos logit contributions
 named+0.169
 hiding+0.158
 walking+0.154
 playing+0.153
ind+0.153
Neg logit contributions
der-0.190
fulness-0.180
lights-0.180
 Moral-0.166
rier-0.162
Token to
Token ID284
Feature activation+5.07e-01

Pos logit contributions
 named+0.022
e+0.019
o+0.019
ind+0.019
 walking+0.019
Neg logit contributions
der-0.028
rier-0.028
umm-0.026
 horizon-0.026
lights-0.026
Token himself
Token ID2241
Feature activation+2.55e+00

Pos logit contributions
 named+0.009
e+0.008
 walking+0.007
 playing+0.007
 running+0.007
Neg logit contributions
umm-0.014
bread-0.013
der-0.013
lights-0.013
cakes-0.013
Token.
Token ID13
Feature activation-2.77e+00

Pos logit contributions
 named+0.047
ind+0.041
e+0.040
o+0.040
aky+0.039
Neg logit contributions
der-0.059
rier-0.057
 horizon-0.052
bread-0.052
fulness-0.052
Token
Token ID198
Feature activation+1.17e+00

Pos logit contributions
lights+0.069
umm+0.067
bread+0.067
rier+0.065
der+0.064
Neg logit contributions
 named-0.050
o-0.042
 walking-0.041
 playing-0.039
oo-0.036
Tokenicked
Token ID9484
Feature activation-9.24e-02

Pos logit contributions
 named+0.048
ind+0.048
 walking+0.045
 playing+0.044
o+0.044
Neg logit contributions
der-0.036
rier-0.031
fulness-0.029
lights-0.027
 Someone-0.027
Token Anna
Token ID11735
Feature activation-2.77e-01

Pos logit contributions
der+0.002
umm+0.002
rier+0.002
bread+0.002
 horizon+0.002
Neg logit contributions
 named-0.002
e-0.002
o-0.002
a-0.001
 walking-0.001
Token and
Token ID290
Feature activation-2.01e+00

Pos logit contributions
der+0.005
rier+0.005
 less+0.005
bread+0.005
fulness+0.005
Neg logit contributions
 named-0.006
ind-0.005
asc-0.005
o-0.005
 walking-0.005
Token Ben
Token ID3932
Feature activation+1.03e+00

Pos logit contributions
der+0.046
rier+0.046
umm+0.045
fulness+0.044
 horizon+0.043
Neg logit contributions
 named-0.036
e-0.030
o-0.030
 walking-0.030
 playing-0.029
Token and
Token ID290
Feature activation-3.25e+00

Pos logit contributions
 named+0.022
ind+0.021
asc+0.021
o+0.020
P+0.020
Neg logit contributions
der-0.018
rier-0.018
 less-0.018
fulness-0.016
 maybe-0.016
Token wh
Token ID348
Feature activation+9.59e+00

Pos logit contributions
rier+0.081
der+0.079
umm+0.078
 horizon+0.076
bread+0.075
Neg logit contributions
 named-0.057
o-0.046
 walking-0.045
e-0.045
 playing-0.043
Tokenined
Token ID1389
Feature activation-1.50e-02

Pos logit contributions
 named+0.166
 walking+0.151
 hiding+0.150
 playing+0.146
 abandoned+0.145
Neg logit contributions
lights-0.193
fulness-0.188
der-0.183
 Moral-0.167
oral-0.164
Token.
Token ID13
Feature activation-3.45e+00

Pos logit contributions
der+0.000
rier+0.000
umm+0.000
 horizon+0.000
bread+0.000
Neg logit contributions
 named-0.000
e-0.000
o-0.000
ind-0.000
 walking-0.000
Token
Token ID198
Feature activation+2.01e+00

Pos logit contributions
umm+0.066
rier+0.065
der+0.062
bread+0.062
fulness+0.061
Neg logit contributions
 named-0.080
o-0.070
 walking-0.066
 playing-0.064
e-0.062
Token
Token ID198
Feature activation+1.98e+00

Pos logit contributions
 named+0.048
o+0.043
e+0.043
ind+0.042
 walking+0.042
Neg logit contributions
der-0.035
rier-0.034
umm-0.032
bread-0.031
 horizon-0.031
Token"
Token ID1
Feature activation-1.25e-01

Pos logit contributions
 named+0.041
 walking+0.039
 playing+0.038
e+0.038
ow+0.037
Neg logit contributions
der-0.036
rier-0.034
 Which-0.034
 less-0.034
 Moral-0.033
Token I
Token ID314
Feature activation+5.30e-01

Pos logit contributions
 named+0.013
e+0.012
 playing+0.012
 walking+0.011
o+0.011
Neg logit contributions
der-0.011
rier-0.011
umm-0.010
 horizon-0.010
bread-0.010
Token have
Token ID423
Feature activation-3.01e-01

Pos logit contributions
 named+0.012
e+0.010
o+0.010
ind+0.010
 walking+0.010
Neg logit contributions
der-0.011
rier-0.010
umm-0.010
lights-0.010
 horizon-0.009
Token it
Token ID340
Feature activation-1.09e+00

Pos logit contributions
rier+0.006
umm+0.006
der+0.006
lights+0.006
 horizon+0.006
Neg logit contributions
 named-0.007
 playing-0.006
 walking-0.006
e-0.006
o-0.006
Token?"
Token ID1701
Feature activation+1.14e-01

Pos logit contributions
der+0.027
rier+0.027
umm+0.027
 horizon+0.026
lights+0.026
Neg logit contributions
 named-0.018
o-0.016
e-0.015
 playing-0.014
a-0.014
Token Ben
Token ID3932
Feature activation+4.98e+00

Pos logit contributions
 named+0.002
ind+0.002
e+0.002
o+0.002
aky+0.002
Neg logit contributions
der-0.003
rier-0.002
 maybe-0.002
 Which-0.002
 horizon-0.002
Token wh
Token ID348
Feature activation+9.40e+00

Pos logit contributions
 named+0.093
ind+0.093
asc+0.086
e+0.086
ippy+0.083
Neg logit contributions
der-0.104
rier-0.097
 maybe-0.093
 less-0.092
bread-0.092
Tokenined
Token ID1389
Feature activation+1.28e+00

Pos logit contributions
 named+0.158
 walking+0.147
ind+0.145
 abandoned+0.144
 hiding+0.141
Neg logit contributions
der-0.193
fulness-0.182
lights-0.181
 Moral-0.179
oral-0.173
Token,
Token ID11
Feature activation-1.91e-01

Pos logit contributions
 named+0.023
e+0.020
o+0.020
ind+0.020
 walking+0.020
Neg logit contributions
der-0.029
rier-0.029
umm-0.027
lights-0.027
 horizon-0.026
Token trying
Token ID2111
Feature activation+9.95e-01

Pos logit contributions
umm+0.005
fulness+0.004
rier+0.004
der+0.004
bread+0.004
Neg logit contributions
 named-0.004
 walking-0.003
 playing-0.003
e-0.003
 sitting-0.003
Token to
Token ID284
Feature activation+5.63e-01

Pos logit contributions
 named+0.018
o+0.015
e+0.015
 walking+0.015
 playing+0.015
Neg logit contributions
umm-0.023
der-0.023
rier-0.023
 horizon-0.022
bread-0.022
Token grab
Token ID5552
Feature activation-5.45e-01

Pos logit contributions
 named+0.012
e+0.011
 walking+0.010
 playing+0.010
ind+0.010
Neg logit contributions
der-0.011
rier-0.011
umm-0.011
lights-0.010
bread-0.010
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
 named+0.023
o+0.019
e+0.019
 walking+0.019
ind+0.019
Neg logit contributions
der-0.026
rier-0.026
umm-0.025
 Which-0.025
lights-0.025
Token their
Token ID511
Feature activation-7.30e-01

Pos logit contributions
der+0.027
rier+0.027
umm+0.027
bread+0.026
lights+0.025
Neg logit contributions
 named-0.020
 walking-0.017
e-0.017
 playing-0.016
ind-0.016
Token new
Token ID649
Feature activation-2.70e+00

Pos logit contributions
der+0.013
rier+0.013
 horizon+0.013
umm+0.013
lights+0.012
Neg logit contributions
 named-0.016
o-0.015
e-0.015
aky-0.014
ek-0.014
Token friends
Token ID2460
Feature activation+6.43e-01

Pos logit contributions
der+0.053
rier+0.052
 horizon+0.051
umm+0.051
 prayers+0.049
Neg logit contributions
 named-0.051
e-0.049
aky-0.048
o-0.048
ek-0.046
Token and
Token ID290
Feature activation-1.88e+00

Pos logit contributions
 named+0.011
ind+0.010
e+0.010
o+0.010
aky+0.010
Neg logit contributions
der-0.015
rier-0.014
fulness-0.013
 horizon-0.013
bread-0.013
Token wh
Token ID348
Feature activation+9.33e+00

Pos logit contributions
der+0.043
rier+0.042
umm+0.040
 Which+0.039
lights+0.039
Neg logit contributions
 named-0.036
e-0.031
o-0.030
 walking-0.030
ind-0.029
Tokenizz
Token ID6457
Feature activation+1.66e+00

Pos logit contributions
 named+0.165
 hiding+0.158
 abandoned+0.148
 playing+0.148
 walking+0.148
Neg logit contributions
lights-0.177
fulness-0.173
oral-0.162
der-0.160
 Moral-0.156
Tokened
Token ID276
Feature activation+6.34e-01

Pos logit contributions
 named+0.031
ind+0.026
aky+0.026
e+0.025
 walking+0.025
Neg logit contributions
der-0.036
rier-0.036
fulness-0.035
bread-0.034
lights-0.033
Token off
Token ID572
Feature activation+1.21e+00

Pos logit contributions
 named+0.014
e+0.012
o+0.012
ind+0.012
 walking+0.012
Neg logit contributions
der-0.012
rier-0.012
umm-0.011
 horizon-0.011
bread-0.011
Token back
Token ID736
Feature activation+1.54e+00

Pos logit contributions
 named+0.023
e+0.020
o+0.019
 walking+0.019
ind+0.019
Neg logit contributions
der-0.027
rier-0.026
umm-0.025
 Which-0.025
bread-0.025
Token into
Token ID656
Feature activation+8.26e-01

Pos logit contributions
 named+0.026
e+0.023
ind+0.022
o+0.022
aky+0.022
Neg logit contributions
der-0.037
rier-0.035
umm-0.034
 horizon-0.034
lights-0.034
Token of
Token ID286
Feature activation-1.42e+00

Pos logit contributions
der+0.006
rier+0.006
lights+0.005
umm+0.005
bread+0.005
Neg logit contributions
 named-0.005
e-0.004
ind-0.004
o-0.004
ek-0.004
Token our
Token ID674
Feature activation-2.73e+00

Pos logit contributions
der+0.034
rier+0.033
 maybe+0.031
lights+0.031
 horizon+0.031
Neg logit contributions
 named-0.024
e-0.021
o-0.021
aky-0.021
a-0.021
Token toys
Token ID14958
Feature activation-1.14e+00

Pos logit contributions
lights+0.049
der+0.047
 horizon+0.045
 lesson+0.044
 less+0.044
Neg logit contributions
e-0.057
 named-0.057
ek-0.055
aky-0.054
a-0.053
Token!"
Token ID2474
Feature activation-1.98e-01

Pos logit contributions
der+0.027
rier+0.025
lights+0.024
bread+0.024
 horizon+0.024
Neg logit contributions
 named-0.020
e-0.017
ind-0.017
o-0.017
 walking-0.017
Token Lily
Token ID20037
Feature activation+3.80e+00

Pos logit contributions
der+0.005
rier+0.005
umm+0.005
fulness+0.005
 horizon+0.005
Neg logit contributions
 named-0.003
e-0.003
o-0.003
 walking-0.003
 playing-0.003
Token wh
Token ID348
Feature activation+9.30e+00

Pos logit contributions
 named+0.070
ind+0.066
e+0.064
 walking+0.061
o+0.061
Neg logit contributions
der-0.085
rier-0.077
bread-0.075
 maybe-0.074
 less-0.074
Tokenined
Token ID1389
Feature activation+1.22e+00

Pos logit contributions
 named+0.154
 walking+0.144
ind+0.141
 hiding+0.141
 abandoned+0.138
Neg logit contributions
der-0.190
fulness-0.186
lights-0.185
 Moral-0.179
oral-0.173
Token.
Token ID13
Feature activation-1.69e+00

Pos logit contributions
 named+0.022
e+0.019
o+0.019
ind+0.019
 walking+0.018
Neg logit contributions
der-0.028
rier-0.027
umm-0.026
lights-0.026
 horizon-0.026
Token "
Token ID366
Feature activation-4.45e-01

Pos logit contributions
rier+0.041
umm+0.041
bread+0.040
 perfection+0.039
fulness+0.039
Neg logit contributions
 named-0.031
 playing-0.026
 walking-0.026
o-0.025
oo-0.023
TokenWe
Token ID1135
Feature activation+5.13e-01

Pos logit contributions
umm+0.009
rier+0.009
der+0.009
lights+0.009
 horizon+0.009
Neg logit contributions
 named-0.009
e-0.008
a-0.008
P-0.008
aky-0.008
Token want
Token ID765
Feature activation-1.69e+00

Pos logit contributions
ind+0.011
ippy+0.011
 named+0.011
 walking+0.011
oo+0.010
Neg logit contributions
der-0.008
 maybe-0.008
 less-0.008
 becomes-0.007
 Which-0.007
Token.
Token ID13
Feature activation-3.04e+00

Pos logit contributions
 named+0.013
o+0.010
 walking+0.010
e+0.010
 playing+0.010
Neg logit contributions
rier-0.020
der-0.020
umm-0.020
 horizon-0.019
bread-0.019
Token They
Token ID1119
Feature activation+1.88e+00

Pos logit contributions
fulness+0.055
der+0.054
rier+0.054
bread+0.052
umm+0.052
Neg logit contributions
 named-0.072
o-0.061
 playing-0.059
 walking-0.059
ind-0.058
Token p
Token ID279
Feature activation+3.63e+00

Pos logit contributions
ind+0.036
ippy+0.036
oo+0.035
aking+0.035
o+0.035
Neg logit contributions
der-0.040
 maybe-0.035
 less-0.034
 Someone-0.033
 Which-0.032
Tokenouted
Token ID18534
Feature activation+5.96e-01

Pos logit contributions
 named+0.079
e+0.078
iley+0.072
 hiding+0.071
 seeker+0.070
Neg logit contributions
der-0.061
lights-0.060
cakes-0.060
rier-0.055
 Which-0.054
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
 named+0.010
 walking+0.008
e+0.008
o+0.008
 playing+0.008
Neg logit contributions
rier-0.015
der-0.015
umm-0.015
bread-0.014
 horizon-0.014
Token wh
Token ID348
Feature activation+9.25e+00

Pos logit contributions
ind+0.019
ippy+0.018
 named+0.018
o+0.018
e+0.018
Neg logit contributions
der-0.023
 maybe-0.022
 less-0.021
rier-0.020
 horizon-0.020
Tokenined
Token ID1389
Feature activation+2.23e-01

Pos logit contributions
 named+0.164
 hiding+0.150
 walking+0.147
ind+0.143
 playing+0.143
Neg logit contributions
lights-0.180
der-0.178
fulness-0.171
 Moral-0.163
oral-0.159
Token.
Token ID13
Feature activation-3.46e+00

Pos logit contributions
 named+0.003
e+0.003
 walking+0.003
ind+0.003
o+0.003
Neg logit contributions
der-0.006
rier-0.006
umm-0.006
 horizon-0.005
bread-0.005
Token They
Token ID1119
Feature activation+6.07e-01

Pos logit contributions
rier+0.064
fulness+0.064
umm+0.063
der+0.063
bread+0.061
Neg logit contributions
 named-0.082
o-0.068
a-0.067
 walking-0.067
 playing-0.066
Token did
Token ID750
Feature activation+2.94e+00

Pos logit contributions
ind+0.011
ippy+0.011
aking+0.011
oo+0.011
asc+0.011
Neg logit contributions
der-0.013
 maybe-0.012
 less-0.011
rier-0.011
 Which-0.011
Token not
Token ID407
Feature activation+3.43e+00

Pos logit contributions
 named+0.046
 walking+0.038
 playing+0.038
e+0.037
o+0.034
Neg logit contributions
umm-0.080
rier-0.077
der-0.076
 horizon-0.074
bread-0.074
Token her
Token ID607
Feature activation-1.33e+00

Pos logit contributions
der+0.011
rier+0.010
lights+0.010
fulness+0.010
 horizon+0.010
Neg logit contributions
 named-0.010
e-0.009
ind-0.009
o-0.009
aky-0.009
Token.
Token ID13
Feature activation-4.10e+00

Pos logit contributions
der+0.032
rier+0.031
umm+0.029
 horizon+0.029
bread+0.029
Neg logit contributions
 named-0.023
e-0.020
o-0.020
ind-0.020
 walking-0.019
Token He
Token ID679
Feature activation-1.28e-01

Pos logit contributions
rier+0.074
lights+0.071
der+0.069
fulness+0.069
umm+0.067
Neg logit contributions
 named-0.098
o-0.086
 walking-0.086
 playing-0.083
e-0.077
Token smiled
Token ID13541
Feature activation+1.77e+00

Pos logit contributions
der+0.003
rier+0.003
umm+0.003
 horizon+0.003
bread+0.003
Neg logit contributions
 named-0.002
e-0.002
o-0.002
ind-0.002
 walking-0.002
Token and
Token ID290
Feature activation-6.46e-01

Pos logit contributions
 named+0.027
o+0.022
e+0.022
 walking+0.022
ind+0.022
Neg logit contributions
der-0.045
rier-0.044
umm-0.043
 Which-0.043
bread-0.042
Token wh
Token ID348
Feature activation+9.19e+00

Pos logit contributions
rier+0.016
der+0.016
umm+0.016
lights+0.015
 horizon+0.015
Neg logit contributions
 named-0.011
o-0.009
 walking-0.009
e-0.009
 playing-0.008
Tokeninn
Token ID3732
Feature activation+3.94e+00

Pos logit contributions
 named+0.165
 hiding+0.153
 playing+0.149
 walking+0.148
ind+0.146
Neg logit contributions
lights-0.174
fulness-0.171
der-0.169
 Moral-0.149
antly-0.147
Tokenied
Token ID798
Feature activation+5.41e-01

Pos logit contributions
 named+0.078
ind+0.068
P+0.066
 seeker+0.066
asc+0.066
Neg logit contributions
der-0.088
fulness-0.083
rier-0.077
lights-0.076
imet-0.070
Token.
Token ID13
Feature activation-3.76e+00

Pos logit contributions
 named+0.010
ind+0.009
aky+0.009
o+0.009
e+0.009
Neg logit contributions
der-0.012
rier-0.012
 less-0.011
fulness-0.011
bread-0.011
Token The
Token ID383
Feature activation-2.50e+00

Pos logit contributions
rier+0.072
der+0.067
lights+0.067
fulness+0.064
umm+0.064
Neg logit contributions
 named-0.088
o-0.080
 walking-0.077
 playing-0.075
e-0.070
Token little
Token ID1310
Feature activation-3.35e+00

Pos logit contributions
der+0.048
rier+0.047
bread+0.044
umm+0.044
 horizon+0.044
Neg logit contributions
 named-0.054
e-0.049
o-0.049
ind-0.048
 walking-0.047
Token,
Token ID11
Feature activation-4.41e+00

Pos logit contributions
der+0.018
rier+0.018
umm+0.017
lights+0.017
bread+0.017
Neg logit contributions
 named-0.016
e-0.015
ind-0.014
o-0.014
 walking-0.014
Token Mom
Token ID11254
Feature activation+1.21e+00

Pos logit contributions
der+0.093
umm+0.091
rier+0.088
bread+0.087
 horizon+0.086
Neg logit contributions
 named-0.089
 playing-0.076
e-0.074
a-0.074
 walking-0.072
Tokenmy
Token ID1820
Feature activation+1.11e+00

Pos logit contributions
 named+0.023
ind+0.020
e+0.020
aky+0.019
 walking+0.019
Neg logit contributions
der-0.026
rier-0.025
umm-0.024
bread-0.024
 maybe-0.024
Token,"
Token ID553
Feature activation-1.22e+00

Pos logit contributions
 named+0.022
e+0.020
ind+0.020
o+0.019
aky+0.019
Neg logit contributions
der-0.023
rier-0.022
bread-0.021
umm-0.021
fulness-0.021
Token Anna
Token ID11735
Feature activation+3.77e+00

Pos logit contributions
der+0.027
rier+0.027
umm+0.026
 horizon+0.026
bread+0.025
Neg logit contributions
 named-0.023
e-0.020
 walking-0.019
 playing-0.019
a-0.019
Token wh
Token ID348
Feature activation+9.16e+00

Pos logit contributions
 named+0.073
ind+0.073
asc+0.067
e+0.066
 walking+0.066
Neg logit contributions
der-0.078
rier-0.072
 maybe-0.070
bread-0.068
fulness-0.068
Tokenined
Token ID1389
Feature activation+6.51e-01

Pos logit contributions
 named+0.159
 hiding+0.147
 walking+0.145
ind+0.144
 abandoned+0.143
Neg logit contributions
der-0.179
fulness-0.178
lights-0.176
 Moral-0.164
oral-0.160
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
 named+0.012
e+0.011
o+0.010
ind+0.010
 walking+0.010
Neg logit contributions
der-0.015
rier-0.015
lights-0.014
umm-0.013
fulness-0.013
Token "
Token ID366
Feature activation-7.40e-01

Pos logit contributions
rier+0.037
umm+0.036
bread+0.036
lights+0.035
fulness+0.034
Neg logit contributions
 named-0.039
 walking-0.035
 playing-0.035
o-0.033
 running-0.031
TokenI
Token ID40
Feature activation+1.01e+00

Pos logit contributions
lights+0.016
der+0.016
rier+0.016
umm+0.016
 horizon+0.015
Neg logit contributions
a-0.013
e-0.013
aky-0.013
 named-0.013
P-0.013
Token don
Token ID836
Feature activation+2.82e+00

Pos logit contributions
 named+0.021
ind+0.019
 walking+0.018
e+0.018
o+0.018
Neg logit contributions
der-0.020
rier-0.019
bread-0.018
 maybe-0.018
 Which-0.018
Token theater
Token ID13766
Feature activation-1.39e+00

Pos logit contributions
 named+0.004
o+0.003
e+0.003
aky+0.003
ek+0.003
Neg logit contributions
der-0.005
rier-0.005
 horizon-0.005
 less-0.005
 Which-0.005
Token.
Token ID13
Feature activation-2.86e+00

Pos logit contributions
der+0.032
rier+0.031
umm+0.030
 Which+0.030
 horizon+0.029
Neg logit contributions
 named-0.025
e-0.021
o-0.021
ind-0.021
 walking-0.021
Token They
Token ID1119
Feature activation+6.66e-01

Pos logit contributions
rier+0.062
der+0.060
umm+0.059
bread+0.059
fulness+0.058
Neg logit contributions
 named-0.064
o-0.053
 playing-0.049
 walking-0.049
oo-0.048
Token cried
Token ID16896
Feature activation+4.92e-01

Pos logit contributions
ind+0.011
 named+0.011
o+0.011
e+0.011
aking+0.011
Neg logit contributions
der-0.015
 maybe-0.014
rier-0.013
 less-0.013
 Which-0.013
Token and
Token ID290
Feature activation+5.44e-01

Pos logit contributions
 named+0.009
e+0.007
o+0.007
ind+0.007
 walking+0.007
Neg logit contributions
der-0.012
rier-0.011
umm-0.011
 horizon-0.011
bread-0.011
Token wh
Token ID348
Feature activation+9.16e+00

Pos logit contributions
ind+0.009
aking+0.009
 named+0.009
ippy+0.009
e+0.009
Neg logit contributions
der-0.012
rier-0.011
 maybe-0.011
 less-0.011
lights-0.011
Tokenined
Token ID1389
Feature activation+1.13e-01

Pos logit contributions
 named+0.150
 walking+0.140
 hiding+0.139
 playing+0.134
ind+0.131
Neg logit contributions
der-0.189
lights-0.189
fulness-0.185
 Moral-0.174
oral-0.172
Token and
Token ID290
Feature activation-1.76e-01

Pos logit contributions
 named+0.002
e+0.002
o+0.002
ind+0.002
 walking+0.001
Neg logit contributions
der-0.003
rier-0.003
umm-0.003
 horizon-0.003
bread-0.003
Token stomp
Token ID32221
Feature activation+1.86e+00

Pos logit contributions
der+0.004
rier+0.004
 maybe+0.004
 less+0.004
lights+0.004
Neg logit contributions
 named-0.003
ind-0.003
ippy-0.003
o-0.003
e-0.003
Tokened
Token ID276
Feature activation-9.26e-01

Pos logit contributions
 named+0.042
e+0.038
ind+0.037
o+0.037
 walking+0.037
Neg logit contributions
der-0.034
rier-0.033
umm-0.031
fulness-0.030
lights-0.030
Token their
Token ID511
Feature activation-5.40e-01

Pos logit contributions
der+0.020
rier+0.020
lights+0.018
fulness+0.018
umm+0.018
Neg logit contributions
 named-0.018
o-0.016
e-0.015
ind-0.015
aky-0.015
Token cl
Token ID537
Feature activation+3.11e+00

Pos logit contributions
 named+0.010
ind+0.009
e+0.009
o+0.009
aky+0.009
Neg logit contributions
der-0.013
rier-0.012
 maybe-0.012
 Which-0.012
bread-0.012
Tokenapped
Token ID6320
Feature activation+8.45e-01

Pos logit contributions
 named+0.065
 walking+0.060
ind+0.059
aky+0.056
 hiding+0.056
Neg logit contributions
der-0.061
rier-0.057
umm-0.056
lights-0.054
 Which-0.053
Token his
Token ID465
Feature activation-9.85e-02

Pos logit contributions
 named+0.017
e+0.015
o+0.014
ind+0.014
 walking+0.014
Neg logit contributions
der-0.018
rier-0.017
umm-0.017
lights-0.016
 horizon-0.016
Token hands
Token ID2832
Feature activation+1.52e+00

Pos logit contributions
der+0.002
rier+0.002
umm+0.002
lights+0.002
 horizon+0.002
Neg logit contributions
 named-0.002
e-0.002
o-0.002
ind-0.002
 walking-0.002
Token and
Token ID290
Feature activation-8.21e-01

Pos logit contributions
 named+0.029
ind+0.027
aky+0.025
o+0.025
e+0.025
Neg logit contributions
der-0.033
rier-0.031
 horizon-0.030
fulness-0.029
 less-0.029
Token wh
Token ID348
Feature activation+9.13e+00

Pos logit contributions
der+0.020
rier+0.020
umm+0.019
 Which+0.019
 horizon+0.019
Neg logit contributions
 named-0.014
e-0.011
o-0.011
 walking-0.011
 playing-0.011
Tokenirled
Token ID49376
Feature activation+1.24e+00

Pos logit contributions
 named+0.171
 hiding+0.155
 walking+0.149
 abandoned+0.148
 playing+0.144
Neg logit contributions
lights-0.178
der-0.176
fulness-0.170
oral-0.152
 Moral-0.152
Token around
Token ID1088
Feature activation+5.74e-01

Pos logit contributions
 named+0.022
ind+0.019
e+0.019
o+0.019
aky+0.019
Neg logit contributions
der-0.029
rier-0.028
umm-0.026
fulness-0.026
lights-0.026
Token and
Token ID290
Feature activation-9.03e-01

Pos logit contributions
 named+0.009
e+0.008
o+0.008
ind+0.008
 walking+0.008
Neg logit contributions
der-0.014
rier-0.014
umm-0.013
bread-0.013
 Which-0.013
Token around
Token ID1088
Feature activation+1.16e+00

Pos logit contributions
der+0.022
rier+0.021
umm+0.020
bread+0.020
 Which+0.020
Neg logit contributions
 named-0.015
e-0.013
o-0.013
 walking-0.013
ind-0.013
Token.
Token ID13
Feature activation-3.81e+00

Pos logit contributions
 named+0.020
e+0.018
 walking+0.017
o+0.017
ind+0.017
Neg logit contributions
der-0.027
rier-0.027
umm-0.026
bread-0.025
 Which-0.025
Token Todd
Token ID14377
Feature activation+2.43e+00

Pos logit contributions
rier+0.038
der+0.038
umm+0.036
lights+0.036
fulness+0.035
Neg logit contributions
 named-0.047
 walking-0.043
o-0.042
e-0.041
ind-0.041
Token smiled
Token ID13541
Feature activation+2.11e+00

Pos logit contributions
asc+0.051
ind+0.047
 named+0.047
 seeker+0.046
ippy+0.045
Neg logit contributions
der-0.042
bread-0.039
 maybe-0.039
 becomes-0.039
antly-0.037
Token and
Token ID290
Feature activation-1.10e-01

Pos logit contributions
 named+0.035
o+0.029
e+0.029
ind+0.029
 walking+0.029
Neg logit contributions
der-0.052
rier-0.051
umm-0.049
 Which-0.048
fulness-0.048
Token the
Token ID262
Feature activation-8.50e-01

Pos logit contributions
der+0.003
rier+0.003
umm+0.003
fulness+0.003
lights+0.003
Neg logit contributions
 named-0.002
o-0.001
 walking-0.001
e-0.001
 playing-0.001
Token horse
Token ID8223
Feature activation-5.73e-02

Pos logit contributions
der+0.018
rier+0.018
umm+0.017
 horizon+0.016
bread+0.016
Neg logit contributions
 named-0.017
e-0.015
o-0.015
ind-0.015
 walking-0.014
Token wh
Token ID348
Feature activation+9.13e+00

Pos logit contributions
der+0.001
rier+0.001
bread+0.001
 horizon+0.001
umm+0.001
Neg logit contributions
 named-0.001
ind-0.001
e-0.001
o-0.001
aky-0.001
Tokeninn
Token ID3732
Feature activation+2.77e+00

Pos logit contributions
 named+0.171
 hiding+0.157
 walking+0.154
 playing+0.153
 abandoned+0.143
Neg logit contributions
lights-0.170
der-0.168
fulness-0.167
 Moral-0.145
oral-0.144
Tokenied
Token ID798
Feature activation+6.78e-01

Pos logit contributions
 named+0.057
P+0.050
ind+0.049
asc+0.047
e+0.047
Neg logit contributions
der-0.060
fulness-0.058
rier-0.056
lights-0.054
bread-0.050
Token.
Token ID13
Feature activation-2.33e+00

Pos logit contributions
 named+0.013
aky+0.012
o+0.012
ind+0.012
P+0.011
Neg logit contributions
rier-0.015
der-0.015
 less-0.013
fulness-0.013
bread-0.013
Token Todd
Token ID14377
Feature activation+1.95e+00

Pos logit contributions
rier+0.041
der+0.040
umm+0.038
lights+0.038
fulness+0.037
Neg logit contributions
 named-0.055
 walking-0.051
o-0.049
 playing-0.048
e-0.047
Token was
Token ID373
Feature activation+4.14e+00

Pos logit contributions
asc+0.039
 named+0.038
ind+0.038
ippy+0.036
aky+0.035
Neg logit contributions
der-0.034
bread-0.031
 becomes-0.031
 maybe-0.031
antly-0.030
Token you
Token ID345
Feature activation+1.72e+00

Pos logit contributions
 named+0.019
e+0.017
aky+0.017
ind+0.017
o+0.017
Neg logit contributions
der-0.018
rier-0.017
 maybe-0.016
 Which-0.016
fulness-0.016
Token okay
Token ID8788
Feature activation+4.71e-01

Pos logit contributions
 named+0.040
ind+0.039
ippy+0.037
e+0.036
P+0.036
Neg logit contributions
rier-0.026
 less-0.025
der-0.025
 maybe-0.025
fulness-0.024
Token?"
Token ID1701
Feature activation-1.77e+00

Pos logit contributions
 named+0.008
e+0.007
aky+0.007
ind+0.007
P+0.007
Neg logit contributions
der-0.011
rier-0.011
 maybe-0.010
 Which-0.010
lights-0.010
Token The
Token ID383
Feature activation-1.59e+00

Pos logit contributions
umm+0.034
rier+0.033
lights+0.033
der+0.033
bread+0.032
Neg logit contributions
 named-0.041
 walking-0.035
o-0.034
 playing-0.033
e-0.033
Token puppy
Token ID26188
Feature activation+7.85e-01

Pos logit contributions
der+0.035
rier+0.034
 horizon+0.033
 Which+0.032
umm+0.032
Neg logit contributions
 named-0.029
e-0.028
o-0.027
a-0.026
ek-0.026
Token wh
Token ID348
Feature activation+9.05e+00

Pos logit contributions
 named+0.015
ind+0.014
e+0.013
o+0.013
aky+0.013
Neg logit contributions
der-0.017
rier-0.016
 maybe-0.015
 Which-0.015
 horizon-0.015
Tokenined
Token ID1389
Feature activation+7.33e-01

Pos logit contributions
 named+0.162
 walking+0.149
 playing+0.149
 hiding+0.148
ind+0.142
Neg logit contributions
der-0.177
lights-0.176
fulness-0.173
antly-0.155
 Moral-0.155
Token loudly
Token ID23112
Feature activation+8.10e-01

Pos logit contributions
 named+0.014
e+0.013
o+0.012
ind+0.012
 walking+0.012
Neg logit contributions
der-0.016
rier-0.016
umm-0.014
lights-0.014
fulness-0.014
Token in
Token ID287
Feature activation+1.63e+00

Pos logit contributions
 named+0.014
ind+0.012
e+0.012
o+0.012
aky+0.012
Neg logit contributions
der-0.019
rier-0.018
fulness-0.017
umm-0.017
bread-0.017
Token response
Token ID2882
Feature activation-1.91e-01

Pos logit contributions
 named+0.036
o+0.033
ind+0.032
e+0.032
a+0.031
Neg logit contributions
der-0.030
rier-0.029
fulness-0.027
 horizon-0.027
umm-0.027
Token.
Token ID13
Feature activation-3.48e+00

Pos logit contributions
der+0.005
rier+0.005
fulness+0.004
umm+0.004
bread+0.004
Neg logit contributions
 named-0.003
ind-0.003
e-0.003
o-0.003
aky-0.003
Token "
Token ID366
Feature activation-5.02e-01

Pos logit contributions
umm+0.008
lights+0.008
 conclusions+0.007
rier+0.007
 perfection+0.007
Neg logit contributions
 named-0.010
 walking-0.009
 playing-0.008
 sitting-0.008
o-0.008
Tokenof
Token ID1659
Feature activation-2.73e-01

Pos logit contributions
der+0.011
rier+0.011
lights+0.011
 horizon+0.010
 prayers+0.010
Neg logit contributions
 named-0.009
P-0.009
o-0.009
a-0.009
e-0.008
Token course
Token ID1781
Feature activation-1.68e+00

Pos logit contributions
fulness+0.006
der+0.006
rier+0.006
 maybe+0.005
 less+0.005
Neg logit contributions
ind-0.005
 named-0.004
asc-0.004
 walking-0.004
ippy-0.004
Token!"
Token ID2474
Feature activation-2.00e+00

Pos logit contributions
der+0.035
rier+0.034
umm+0.032
lights+0.032
 horizon+0.032
Neg logit contributions
 named-0.035
e-0.031
 walking-0.030
o-0.030
 playing-0.030
Token He
Token ID679
Feature activation+3.72e-01

Pos logit contributions
rier+0.038
umm+0.035
der+0.034
 horizon+0.034
lights+0.034
Neg logit contributions
 named-0.050
o-0.043
e-0.042
 walking-0.041
 playing-0.040
Token wh
Token ID348
Feature activation+9.02e+00

Pos logit contributions
 named+0.007
e+0.006
o+0.006
ind+0.005
 walking+0.005
Neg logit contributions
der-0.009
rier-0.009
umm-0.008
 horizon-0.008
fulness-0.008
Tokenir
Token ID343
Feature activation+7.52e-01

Pos logit contributions
 named+0.159
ind+0.151
 walking+0.150
 hiding+0.150
 playing+0.146
Neg logit contributions
lights-0.169
fulness-0.169
der-0.169
 Moral-0.150
oral-0.145
Tokenred
Token ID445
Feature activation+1.83e+00

Pos logit contributions
 named+0.016
ind+0.014
e+0.014
 walking+0.014
o+0.014
Neg logit contributions
der-0.015
rier-0.014
fulness-0.013
umm-0.013
bread-0.013
Token around
Token ID1088
Feature activation+4.10e-01

Pos logit contributions
 named+0.034
e+0.030
ind+0.029
o+0.029
aky+0.028
Neg logit contributions
der-0.041
rier-0.040
umm-0.038
bread-0.037
fulness-0.037
Token the
Token ID262
Feature activation+9.51e-01

Pos logit contributions
 named+0.008
e+0.007
ind+0.007
o+0.007
aky+0.007
Neg logit contributions
der-0.009
rier-0.009
umm-0.008
 horizon-0.008
fulness-0.008
Token machine
Token ID4572
Feature activation+4.09e-01

Pos logit contributions
 named+0.021
o+0.020
ow+0.019
adow+0.019
ek+0.019
Neg logit contributions
 horizon-0.017
 Which-0.016
der-0.016
rier-0.016
 instructions-0.015
Token
Token ID198
Feature activation+1.90e+00

Pos logit contributions
 named+0.049
e+0.044
ind+0.044
 walking+0.043
o+0.043
Neg logit contributions
der-0.039
rier-0.037
 Which-0.036
fulness-0.036
 maybe-0.035
TokenSuddenly
Token ID38582
Feature activation+2.07e-01

Pos logit contributions
 named+0.046
 walking+0.043
ind+0.042
 playing+0.042
o+0.041
Neg logit contributions
der-0.027
 Which-0.026
 Moral-0.025
fulness-0.025
 Anything-0.025
Token,
Token ID11
Feature activation-9.54e-01

Pos logit contributions
 named+0.003
ind+0.003
aky+0.002
o+0.002
e+0.002
Neg logit contributions
der-0.005
rier-0.005
 maybe-0.005
bread-0.005
fulness-0.005
Token Brown
Token ID4373
Feature activation+3.68e+00

Pos logit contributions
der+0.020
rier+0.019
umm+0.018
 horizon+0.018
lights+0.018
Neg logit contributions
 named-0.019
e-0.017
o-0.017
ind-0.017
 walking-0.016
Tokenie
Token ID494
Feature activation+2.24e+00

Pos logit contributions
 named+0.054
asc+0.048
ind+0.046
e+0.045
oo+0.044
Neg logit contributions
der-0.088
antly-0.081
bread-0.081
 horizon-0.081
rays-0.080
Token wh
Token ID348
Feature activation+8.99e+00

Pos logit contributions
 named+0.041
ind+0.038
e+0.037
oo+0.037
ippy+0.036
Neg logit contributions
der-0.048
 horizon-0.043
bread-0.042
 becomes-0.042
rier-0.042
Tokeninn
Token ID3732
Feature activation+4.21e+00

Pos logit contributions
 named+0.157
 walking+0.137
 hiding+0.136
 abandoned+0.133
ind+0.132
Neg logit contributions
der-0.182
lights-0.176
fulness-0.164
 Moral-0.162
 horizon-0.159
Tokenied
Token ID798
Feature activation+9.36e-01

Pos logit contributions
 named+0.086
ind+0.075
e+0.073
 walking+0.069
P+0.069
Neg logit contributions
der-0.090
fulness-0.084
rier-0.081
lights-0.079
imet-0.076
Token.
Token ID13
Feature activation-4.16e+00

Pos logit contributions
 named+0.018
aky+0.016
o+0.016
ind+0.016
e+0.016
Neg logit contributions
der-0.021
rier-0.020
 horizon-0.018
 less-0.018
fulness-0.018
Token
Token ID6184
Feature activation+8.64e-01

Pos logit contributions
rier+0.063
der+0.062
lights+0.060
fulness+0.058
umm+0.058
Neg logit contributions
 named-0.108
 walking-0.096
o-0.095
 playing-0.093
e-0.092
Token
Token ID95
Feature activation+1.03e+00

Pos logit contributions
 named+0.020
o+0.018
aky+0.018
 walking+0.018
ind+0.018
Neg logit contributions
der-0.016
rier-0.014
umm-0.014
bread-0.014
imet-0.014
Token car
Token ID1097
Feature activation-2.97e+00

Pos logit contributions
der+0.063
rier+0.061
umm+0.058
bread+0.057
fulness+0.057
Neg logit contributions
 named-0.064
e-0.056
o-0.055
ind-0.055
 walking-0.055
Token is
Token ID318
Feature activation+1.04e+00

Pos logit contributions
der+0.060
rier+0.058
umm+0.056
 horizon+0.054
bread+0.054
Neg logit contributions
 named-0.062
e-0.055
o-0.055
ind-0.053
 walking-0.053
Token stupid
Token ID8531
Feature activation-1.14e+00

Pos logit contributions
 named+0.019
e+0.018
o+0.017
ind+0.017
ippy+0.017
Neg logit contributions
rier-0.022
der-0.022
 less-0.021
 brighter-0.020
 maybe-0.020
Token!"
Token ID2474
Feature activation-3.11e-01

Pos logit contributions
der+0.026
rier+0.025
umm+0.024
lights+0.024
fulness+0.023
Neg logit contributions
 named-0.021
e-0.018
ind-0.018
o-0.018
 walking-0.017
Token Ben
Token ID3932
Feature activation+4.03e+00

Pos logit contributions
der+0.007
rier+0.006
umm+0.006
 horizon+0.006
bread+0.006
Neg logit contributions
 named-0.006
e-0.005
o-0.005
 walking-0.005
ind-0.005
Token wh
Token ID348
Feature activation+8.95e+00

Pos logit contributions
 named+0.079
ind+0.070
e+0.070
o+0.068
 walking+0.067
Neg logit contributions
der-0.087
rier-0.083
bread-0.077
umm-0.077
 Which-0.077
Tokenined
Token ID1389
Feature activation+1.13e+00

Pos logit contributions
 named+0.153
 walking+0.140
ind+0.138
 hiding+0.134
 abandoned+0.131
Neg logit contributions
der-0.187
fulness-0.177
 Moral-0.176
lights-0.172
rier-0.165
Token.
Token ID13
Feature activation-1.96e+00

Pos logit contributions
 named+0.020
e+0.017
o+0.017
ind+0.017
 walking+0.017
Neg logit contributions
der-0.026
rier-0.026
umm-0.025
bread-0.025
 horizon-0.024
Token
Token ID198
Feature activation+2.46e+00

Pos logit contributions
umm+0.044
bread+0.042
rier+0.042
 perfection+0.041
fulness+0.040
Neg logit contributions
 named-0.041
o-0.034
 walking-0.032
 playing-0.032
 called-0.031
Token
Token ID198
Feature activation+2.49e+00

Pos logit contributions
 named+0.059
o+0.052
e+0.052
ind+0.051
 walking+0.051
Neg logit contributions
der-0.044
rier-0.043
umm-0.041
bread-0.039
 horizon-0.039
Token"
Token ID1
Feature activation+5.17e-01

Pos logit contributions
 named+0.056
 walking+0.053
 playing+0.051
e+0.051
ow+0.050
Neg logit contributions
der-0.041
 Which-0.039
rier-0.039
 less-0.038
 Moral-0.037
Token key
Token ID1994
Feature activation-2.67e-01

Pos logit contributions
der+0.085
rier+0.080
umm+0.078
lights+0.077
fulness+0.076
Neg logit contributions
 named-0.083
e-0.073
 walking-0.072
ind-0.071
o-0.071
Token in
Token ID287
Feature activation+1.81e-01

Pos logit contributions
der+0.006
rier+0.006
 horizon+0.006
lights+0.006
fulness+0.006
Neg logit contributions
 named-0.005
e-0.004
ind-0.004
o-0.004
aky-0.004
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
 named+0.004
e+0.003
o+0.003
ind+0.003
aky+0.003
Neg logit contributions
der-0.004
rier-0.004
umm-0.003
 horizon-0.003
fulness-0.003
Token lock
Token ID5793
Feature activation-4.30e-01

Pos logit contributions
der+0.024
rier+0.023
umm+0.022
fulness+0.022
bread+0.022
Neg logit contributions
 named-0.019
e-0.016
ind-0.016
o-0.016
 walking-0.016
Token and
Token ID290
Feature activation-2.28e+00

Pos logit contributions
der+0.010
rier+0.010
umm+0.009
 horizon+0.009
lights+0.009
Neg logit contributions
 named-0.007
e-0.006
ind-0.006
o-0.006
 walking-0.006
Token wh
Token ID348
Feature activation+8.95e+00

Pos logit contributions
der+0.056
rier+0.055
umm+0.053
bread+0.052
fulness+0.052
Neg logit contributions
 named-0.040
 walking-0.034
e-0.033
o-0.032
 playing-0.032
Tokenir
Token ID343
Feature activation+5.58e-01

Pos logit contributions
 named+0.164
 hiding+0.150
 walking+0.148
 playing+0.147
 abandoned+0.141
Neg logit contributions
lights-0.170
der-0.170
fulness-0.169
oral-0.153
rier-0.151
Tokenrr
Token ID21062
Feature activation+1.11e+00

Pos logit contributions
 named+0.012
e+0.010
o+0.010
ind+0.010
 walking+0.010
Neg logit contributions
der-0.012
rier-0.011
umm-0.011
fulness-0.010
bread-0.010
Tokenrr
Token ID21062
Feature activation+1.10e+00

Pos logit contributions
 named+0.020
o+0.016
 walking+0.016
ind+0.016
e+0.015
Neg logit contributions
der-0.026
rier-0.026
fulness-0.025
bread-0.024
lights-0.024
Token...
Token ID986
Feature activation-2.82e+00

Pos logit contributions
 named+0.020
 walking+0.016
o+0.016
ind+0.016
e+0.015
Neg logit contributions
der-0.026
rier-0.026
fulness-0.025
lights-0.024
bread-0.024
Tokenit
Token ID270
Feature activation-6.33e-01

Pos logit contributions
der+0.054
lights+0.051
rier+0.050
umm+0.049
 horizon+0.048
Neg logit contributions
 named-0.063
o-0.060
e-0.059
a-0.057
 walking-0.056
Token.
Token ID13
Feature activation-2.88e+00

Pos logit contributions
der+0.014
rier+0.013
fulness+0.013
lights+0.013
bread+0.013
Neg logit contributions
 named-0.012
ind-0.011
o-0.010
aky-0.010
e-0.010
Token Max
Token ID5436
Feature activation+1.02e+00

Pos logit contributions
rier+0.087
umm+0.083
der+0.081
bread+0.081
 perfection+0.078
Neg logit contributions
 named-0.045
o-0.036
 playing-0.034
 walking-0.032
 called-0.030
Token bark
Token ID21405
Feature activation+1.88e+00

Pos logit contributions
 named+0.021
ind+0.019
aky+0.019
e+0.019
o+0.018
Neg logit contributions
der-0.020
rier-0.018
 maybe-0.018
bread-0.018
 horizon-0.018
Tokened
Token ID276
Feature activation+6.61e-01

Pos logit contributions
 named+0.043
e+0.038
ind+0.038
o+0.038
 walking+0.037
Neg logit contributions
der-0.035
rier-0.033
umm-0.031
bread-0.030
fulness-0.030
Token and
Token ID290
Feature activation-9.72e-01

Pos logit contributions
 named+0.013
e+0.011
o+0.011
ind+0.011
aky+0.011
Neg logit contributions
der-0.014
rier-0.014
umm-0.013
lights-0.013
 horizon-0.013
Token wh
Token ID348
Feature activation+8.92e+00

Pos logit contributions
der+0.025
rier+0.024
umm+0.023
 horizon+0.023
fulness+0.023
Neg logit contributions
 named-0.016
e-0.013
o-0.013
 walking-0.013
 playing-0.012
Tokenined
Token ID1389
Feature activation+1.72e-01

Pos logit contributions
 named+0.162
 walking+0.150
 hiding+0.150
 playing+0.145
 abandoned+0.144
Neg logit contributions
lights-0.173
fulness-0.165
der-0.164
 Moral-0.144
oral-0.144
Token,
Token ID11
Feature activation-2.22e+00

Pos logit contributions
 named+0.003
e+0.002
o+0.002
ind+0.002
 walking+0.002
Neg logit contributions
der-0.004
rier-0.004
umm-0.004
 horizon-0.004
bread-0.004
Token trying
Token ID2111
Feature activation-5.35e-01

Pos logit contributions
umm+0.052
rier+0.049
lights+0.048
 Which+0.047
fulness+0.047
Neg logit contributions
 named-0.044
 playing-0.038
 walking-0.038
 sitting-0.034
 looking-0.034
Token to
Token ID284
Feature activation-4.85e-01

Pos logit contributions
umm+0.012
rier+0.012
der+0.012
 horizon+0.011
lights+0.011
Neg logit contributions
 named-0.010
 walking-0.009
 playing-0.009
e-0.009
o-0.008
Token speak
Token ID2740
Feature activation-1.91e+00

Pos logit contributions
der+0.010
rier+0.009
umm+0.009
lights+0.009
 Which+0.009
Neg logit contributions
 named-0.011
e-0.010
ind-0.009
 walking-0.009
o-0.009
Token that
Token ID326
Feature activation-2.70e+00

Pos logit contributions
der+0.032
rier+0.032
umm+0.031
 Which+0.030
 horizon+0.030
Neg logit contributions
 named-0.033
e-0.029
 walking-0.029
 playing-0.029
ind-0.028
Token.
Token ID13
Feature activation-3.83e+00

Pos logit contributions
umm+0.068
der+0.068
rier+0.067
 Which+0.066
 horizon+0.065
Neg logit contributions
 named-0.041
 playing-0.035
 walking-0.034
e-0.033
o-0.031
Token They
Token ID1119
Feature activation-6.19e-02

Pos logit contributions
fulness+0.070
rier+0.069
bread+0.068
der+0.063
rays+0.062
Neg logit contributions
 named-0.081
 playing-0.075
o-0.072
 walking-0.069
e-0.067
Token cried
Token ID16896
Feature activation-2.41e-01

Pos logit contributions
der+0.001
 maybe+0.001
 less+0.001
 becomes+0.001
rier+0.001
Neg logit contributions
ind-0.001
 named-0.001
ippy-0.001
asc-0.001
oo-0.001
Token and
Token ID290
Feature activation-3.00e-01

Pos logit contributions
der+0.006
rier+0.006
umm+0.005
 horizon+0.005
 Which+0.005
Neg logit contributions
 named-0.004
e-0.004
o-0.003
ind-0.003
 walking-0.003
Token wh
Token ID348
Feature activation+8.87e+00

Pos logit contributions
der+0.007
 maybe+0.006
 less+0.006
rier+0.006
lights+0.006
Neg logit contributions
ind-0.006
ippy-0.005
aking-0.005
 named-0.005
o-0.005
Tokenined
Token ID1389
Feature activation-3.79e-01

Pos logit contributions
 named+0.154
 hiding+0.142
 walking+0.141
ind+0.132
 playing+0.132
Neg logit contributions
lights-0.180
der-0.176
fulness-0.174
 Moral-0.158
oral-0.154
Token.
Token ID13
Feature activation-3.25e+00

Pos logit contributions
der+0.010
umm+0.009
rier+0.009
 horizon+0.009
 Which+0.009
Neg logit contributions
 named-0.006
e-0.005
 walking-0.005
ind-0.005
 playing-0.005
Token They
Token ID1119
Feature activation-1.37e-01

Pos logit contributions
rier+0.061
bread+0.058
fulness+0.057
umm+0.054
der+0.053
Neg logit contributions
 named-0.073
 playing-0.066
o-0.062
 walking-0.062
e-0.061
Token did
Token ID750
Feature activation+2.44e+00

Pos logit contributions
der+0.003
 maybe+0.003
 less+0.002
 becomes+0.002
 Which+0.002
Neg logit contributions
ind-0.003
ippy-0.003
aking-0.003
asc-0.003
oo-0.003
Token not
Token ID407
Feature activation+2.97e+00

Pos logit contributions
 named+0.038
 playing+0.032
e+0.032
 walking+0.032
o+0.030
Neg logit contributions
umm-0.065
rier-0.063
der-0.062
 horizon-0.060
bread-0.060
Token
Token ID220
Feature activation-6.28e-01

Pos logit contributions
fulness+0.064
rier+0.063
der+0.063
bread+0.062
lights+0.061
Neg logit contributions
 named-0.054
 playing-0.046
o-0.046
 walking-0.045
oo-0.040
Token
Token ID198
Feature activation+1.76e+00

Pos logit contributions
rier+0.015
umm+0.015
der+0.015
bread+0.015
 horizon+0.014
Neg logit contributions
 named-0.011
o-0.011
a-0.009
oo-0.009
 playing-0.009
Token
Token ID198
Feature activation+2.00e+00

Pos logit contributions
 named+0.041
o+0.037
e+0.036
ind+0.036
 walking+0.036
Neg logit contributions
der-0.032
rier-0.031
umm-0.029
bread-0.028
 horizon-0.028
TokenS
Token ID50
Feature activation+2.59e+00

Pos logit contributions
 named+0.036
e+0.032
ind+0.031
o+0.031
 walking+0.031
Neg logit contributions
der-0.045
rier-0.043
 Which-0.042
umm-0.041
 horizon-0.041
Tokenally
Token ID453
Feature activation+3.00e+00

Pos logit contributions
 named+0.057
e+0.054
ind+0.050
 walking+0.050
aky+0.050
Neg logit contributions
der-0.048
umm-0.047
 horizon-0.045
 Which-0.044
rier-0.044
Token wh
Token ID348
Feature activation+8.86e+00

Pos logit contributions
asc+0.063
ind+0.063
e+0.060
ippy+0.059
 named+0.058
Neg logit contributions
der-0.053
 maybe-0.045
 becomes-0.045
antly-0.044
bread-0.044
Tokenined
Token ID1389
Feature activation-1.85e-01

Pos logit contributions
 named+0.162
 hiding+0.152
ind+0.148
 walking+0.147
 playing+0.141
Neg logit contributions
der-0.164
lights-0.163
fulness-0.159
 Moral-0.149
oral-0.144
Token,
Token ID11
Feature activation-2.42e+00

Pos logit contributions
der+0.004
rier+0.004
umm+0.004
lights+0.004
 horizon+0.004
Neg logit contributions
 named-0.003
e-0.003
o-0.003
ind-0.003
 walking-0.003
Token "
Token ID366
Feature activation+2.28e-01

Pos logit contributions
rier+0.048
umm+0.047
der+0.047
lights+0.045
bread+0.045
Neg logit contributions
 named-0.053
 walking-0.046
 playing-0.045
o-0.044
e-0.043
TokenBut
Token ID1537
Feature activation-2.19e+00

Pos logit contributions
 named+0.005
e+0.004
o+0.004
a+0.004
ind+0.004
Neg logit contributions
der-0.005
rier-0.005
lights-0.004
bread-0.004
fulness-0.004
Token Mom
Token ID11254
Feature activation+4.42e-01

Pos logit contributions
der+0.039
rier+0.039
bread+0.036
umm+0.036
lights+0.036
Neg logit contributions
 named-0.051
e-0.045
 playing-0.045
 walking-0.045
o-0.045
Token please
Token ID3387
Feature activation+1.03e+00

Pos logit contributions
der+0.033
rier+0.032
umm+0.030
lights+0.030
 horizon+0.030
Neg logit contributions
 named-0.037
e-0.032
o-0.032
 walking-0.032
a-0.031
Token,
Token ID11
Feature activation-1.75e+00

Pos logit contributions
 named+0.018
ind+0.016
e+0.015
o+0.015
aky+0.015
Neg logit contributions
der-0.024
rier-0.023
 Which-0.022
umm-0.022
bread-0.022
Token please
Token ID3387
Feature activation+1.21e+00

Pos logit contributions
der+0.038
rier+0.036
lights+0.036
umm+0.036
 horizon+0.035
Neg logit contributions
 named-0.037
e-0.031
o-0.030
 walking-0.030
 playing-0.030
Token."
Token ID526
Feature activation-6.48e-01

Pos logit contributions
 named+0.023
e+0.020
ind+0.020
o+0.020
 walking+0.020
Neg logit contributions
der-0.026
rier-0.026
umm-0.024
bread-0.024
 Which-0.024
Token She
Token ID1375
Feature activation+2.28e+00

Pos logit contributions
der+0.012
rier+0.012
umm+0.012
bread+0.011
lights+0.011
Neg logit contributions
 named-0.016
o-0.014
e-0.013
 walking-0.013
 playing-0.013
Token wh
Token ID348
Feature activation+8.83e+00

Pos logit contributions
 named+0.053
e+0.048
ind+0.048
o+0.048
aky+0.047
Neg logit contributions
der-0.041
rier-0.038
 Which-0.035
 maybe-0.035
bread-0.035
Tokenined
Token ID1389
Feature activation+5.54e-01

Pos logit contributions
 named+0.175
ind+0.158
 abandoned+0.157
 walking+0.155
 hiding+0.146
Neg logit contributions
der-0.158
fulness-0.152
lights-0.151
 Moral-0.151
rier-0.143
Token and
Token ID290
Feature activation+2.54e-01

Pos logit contributions
 named+0.010
e+0.008
 walking+0.008
o+0.008
ind+0.008
Neg logit contributions
der-0.013
rier-0.013
umm-0.012
 horizon-0.012
 Which-0.012
Token cried
Token ID16896
Feature activation+1.50e+00

Pos logit contributions
 named+0.004
e+0.004
o+0.004
ind+0.004
 walking+0.004
Neg logit contributions
der-0.006
rier-0.006
umm-0.006
 horizon-0.005
bread-0.005
Token and
Token ID290
Feature activation+6.30e-01

Pos logit contributions
 named+0.023
e+0.019
o+0.019
 walking+0.019
ind+0.018
Neg logit contributions
der-0.038
rier-0.037
umm-0.037
 horizon-0.036
bread-0.036
Token stamped
Token ID28305
Feature activation+2.67e+00

Pos logit contributions
 named+0.011
ind+0.010
e+0.010
o+0.010
ippy+0.009
Neg logit contributions
der-0.015
rier-0.014
 maybe-0.013
 less-0.013
fulness-0.013
Token said
Token ID531
Feature activation-2.78e-01

Pos logit contributions
ind+0.013
 named+0.012
aky+0.012
e+0.012
P+0.012
Neg logit contributions
der-0.014
rier-0.012
fulness-0.012
 maybe-0.012
lights-0.012
Token they
Token ID484
Feature activation-1.43e+00

Pos logit contributions
der+0.008
rier+0.008
umm+0.008
 horizon+0.007
lights+0.007
Neg logit contributions
 named-0.004
e-0.003
 walking-0.003
o-0.003
 playing-0.003
Token could
Token ID714
Feature activation+8.42e-01

Pos logit contributions
der+0.028
rier+0.026
 Which+0.025
fulness+0.025
 maybe+0.025
Neg logit contributions
 named-0.030
e-0.027
ind-0.027
o-0.026
 walking-0.026
Token exchange
Token ID5163
Feature activation-2.55e+00

Pos logit contributions
 named+0.018
e+0.016
o+0.016
ind+0.016
 walking+0.016
Neg logit contributions
der-0.017
rier-0.016
umm-0.015
lights-0.015
bread-0.015
Token the
Token ID262
Feature activation-5.61e+00

Pos logit contributions
umm+0.064
der+0.063
 horizon+0.061
lights+0.059
rier+0.059
Neg logit contributions
 named-0.046
e-0.038
a-0.038
 walking-0.037
 playing-0.037
Token fake
Token ID8390
Feature activation-7.33e+00

Pos logit contributions
der+0.132
umm+0.125
rier+0.124
lights+0.124
fulness+0.121
Neg logit contributions
 named-0.099
 walking-0.087
e-0.086
ind-0.085
 playing-0.083
Token horse
Token ID8223
Feature activation-3.29e+00

Pos logit contributions
umm+0.190
lights+0.186
der+0.178
cakes+0.178
fulness+0.177
Neg logit contributions
 named-0.109
ind-0.098
 walking-0.097
 playing-0.092
e-0.090
Token for
Token ID329
Feature activation-4.22e-01

Pos logit contributions
der+0.076
rier+0.074
umm+0.070
bread+0.070
 horizon+0.069
Neg logit contributions
 named-0.058
e-0.050
o-0.050
ind-0.049
 walking-0.049
Token a
Token ID257
Feature activation-1.56e+00

Pos logit contributions
der+0.009
rier+0.008
umm+0.008
fulness+0.008
bread+0.008
Neg logit contributions
 named-0.009
o-0.008
e-0.008
ind-0.007
aky-0.007
Token real
Token ID1103
Feature activation-3.09e+00

Pos logit contributions
der+0.031
rier+0.030
umm+0.029
bread+0.028
 horizon+0.028
Neg logit contributions
 named-0.033
e-0.029
o-0.028
ind-0.028
 walking-0.028
Token one
Token ID530
Feature activation-2.52e+00

Pos logit contributions
der+0.063
rier+0.061
umm+0.058
bread+0.057
 horizon+0.057
Neg logit contributions
 named-0.063
e-0.056
o-0.056
ind-0.055
 walking-0.054
Token You
Token ID921
Feature activation-7.95e-01

Pos logit contributions
rier+0.076
lights+0.074
der+0.070
 horizon+0.068
bread+0.068
Neg logit contributions
 named-0.095
e-0.086
o-0.078
ow-0.074
oo-0.074
Token can
Token ID460
Feature activation+7.61e-01

Pos logit contributions
der+0.015
rier+0.015
umm+0.015
lights+0.014
 horizon+0.014
Neg logit contributions
 named-0.018
e-0.016
o-0.015
a-0.015
aky-0.015
Token pretend
Token ID16614
Feature activation-3.22e+00

Pos logit contributions
 named+0.017
e+0.015
o+0.015
ind+0.015
 walking+0.014
Neg logit contributions
der-0.015
rier-0.014
umm-0.014
lights-0.013
bread-0.013
Token that
Token ID326
Feature activation-5.08e+00

Pos logit contributions
umm+0.078
rier+0.077
der+0.077
bread+0.075
 horizon+0.072
Neg logit contributions
 named-0.055
 playing-0.053
 walking-0.049
e-0.047
 hiding-0.046
Token the
Token ID262
Feature activation-5.04e+00

Pos logit contributions
rier+0.124
umm+0.124
der+0.123
bread+0.119
lights+0.115
Neg logit contributions
 named-0.088
 playing-0.083
e-0.079
 walking-0.074
 hiding-0.071
Token dull
Token ID19222
Feature activation-7.31e+00

Pos logit contributions
der+0.119
rier+0.116
bread+0.110
fulness+0.109
lights+0.108
Neg logit contributions
 named-0.090
 walking-0.077
e-0.076
 playing-0.076
o-0.075
Token chalk
Token ID30860
Feature activation-5.36e+00

Pos logit contributions
rier+0.178
der+0.174
bread+0.168
umm+0.164
fulness+0.158
Neg logit contributions
 named-0.129
 playing-0.110
o-0.107
e-0.106
 walking-0.104
Token is
Token ID318
Feature activation-1.43e+00

Pos logit contributions
rier+0.122
der+0.117
umm+0.116
 perfection+0.113
 horizon+0.112
Neg logit contributions
 named-0.101
e-0.091
o-0.090
 playing-0.087
a-0.085
Token any
Token ID597
Feature activation-4.03e+00

Pos logit contributions
der+0.034
rier+0.033
umm+0.033
bread+0.032
 Which+0.031
Neg logit contributions
 named-0.025
 walking-0.021
 playing-0.021
e-0.021
o-0.021
Token color
Token ID3124
Feature activation-4.37e+00

Pos logit contributions
rier+0.099
der+0.099
bread+0.095
umm+0.094
fulness+0.093
Neg logit contributions
 named-0.069
 playing-0.060
 walking-0.057
 hiding-0.056
e-0.053
Token you
Token ID345
Feature activation-1.98e+00

Pos logit contributions
der+0.116
umm+0.115
rier+0.115
 Which+0.113
 Moral+0.110
Neg logit contributions
 named-0.058
e-0.053
 playing-0.052
o-0.050
a-0.048
Token We
Token ID775
Feature activation-3.21e-01

Pos logit contributions
rier+0.071
lights+0.071
 horizon+0.070
der+0.068
fulness+0.067
Neg logit contributions
 named-0.101
e-0.086
o-0.081
 playing-0.075
 called-0.075
Token can
Token ID460
Feature activation+3.97e-01

Pos logit contributions
der+0.005
rier+0.005
bread+0.005
 Which+0.005
umm+0.005
Neg logit contributions
 named-0.008
ind-0.007
 walking-0.007
o-0.007
e-0.007
Token't
Token ID470
Feature activation+9.55e-01

Pos logit contributions
 named+0.009
e+0.008
ind+0.008
o+0.008
 walking+0.008
Neg logit contributions
der-0.007
rier-0.007
umm-0.007
lights-0.007
fulness-0.007
Token use
Token ID779
Feature activation-1.96e+00

Pos logit contributions
 named+0.021
e+0.019
o+0.018
ind+0.018
 walking+0.018
Neg logit contributions
der-0.019
rier-0.018
umm-0.017
lights-0.017
 Which-0.017
Token the
Token ID262
Feature activation-4.84e+00

Pos logit contributions
umm+0.043
der+0.041
rier+0.041
 horizon+0.039
bread+0.039
Neg logit contributions
 named-0.038
e-0.035
 playing-0.035
 walking-0.033
ind-0.033
Token fake
Token ID8390
Feature activation-7.29e+00

Pos logit contributions
der+0.107
lights+0.103
umm+0.103
fulness+0.101
cakes+0.096
Neg logit contributions
 named-0.079
 hiding-0.076
 playing-0.076
 walking-0.075
aky-0.073
Token food
Token ID2057
Feature activation-3.89e+00

Pos logit contributions
umm+0.154
lights+0.154
fulness+0.151
 Which+0.150
Really+0.148
Neg logit contributions
 named-0.130
 playing-0.118
e-0.117
 hiding-0.114
a-0.111
Token.
Token ID13
Feature activation-4.65e+00

Pos logit contributions
der+0.091
umm+0.091
rier+0.090
 horizon+0.088
 perfection+0.088
Neg logit contributions
 named-0.065
o-0.055
e-0.054
a-0.054
 playing-0.052
Token And
Token ID843
Feature activation-4.65e+00

Pos logit contributions
 horizon+0.086
lights+0.084
rier+0.081
fulness+0.078
der+0.077
Neg logit contributions
 named-0.112
e-0.100
o-0.092
oo-0.084
a-0.084
Token we
Token ID356
Feature activation-2.50e-01

Pos logit contributions
der+0.087
umm+0.085
 horizon+0.084
rier+0.083
lights+0.083
Neg logit contributions
 named-0.105
 playing-0.094
e-0.093
a-0.089
 walking-0.089
Token can
Token ID460
Feature activation+9.59e-01

Pos logit contributions
der+0.004
rier+0.004
umm+0.004
 Which+0.004
bread+0.004
Neg logit contributions
 named-0.006
ind-0.005
e-0.005
o-0.005
 walking-0.005
Token "
Token ID366
Feature activation-1.11e+00

Pos logit contributions
der+0.000
rier+0.000
fulness+0.000
bread+0.000
umm+0.000
Neg logit contributions
 named-0.000
e-0.000
ind-0.000
o-0.000
aky-0.000
Tokenthe
Token ID1169
Feature activation-4.15e+00

Pos logit contributions
der+0.023
rier+0.023
lights+0.021
umm+0.020
bread+0.020
Neg logit contributions
 named-0.023
o-0.022
e-0.021
a-0.021
ind-0.021
Token store
Token ID3650
Feature activation-3.62e+00

Pos logit contributions
der+0.084
rier+0.082
umm+0.079
bread+0.077
 horizon+0.077
Neg logit contributions
 named-0.085
o-0.077
e-0.076
ind-0.074
aky-0.073
Token where
Token ID810
Feature activation-3.02e+00

Pos logit contributions
der+0.072
rier+0.069
umm+0.066
bread+0.065
 horizon+0.065
Neg logit contributions
 named-0.077
o-0.067
e-0.067
 walking-0.066
ind-0.066
Token the
Token ID262
Feature activation-5.51e+00

Pos logit contributions
der+0.058
rier+0.056
umm+0.054
 horizon+0.052
bread+0.052
Neg logit contributions
 named-0.067
e-0.059
 walking-0.059
o-0.058
 playing-0.058
Token red
Token ID2266
Feature activation-7.28e+00

Pos logit contributions
der+0.120
rier+0.117
umm+0.111
lights+0.109
 horizon+0.109
Neg logit contributions
 named-0.106
e-0.092
o-0.091
ind-0.091
 walking-0.090
Token shoes
Token ID10012
Feature activation-2.33e+00

Pos logit contributions
der+0.158
rier+0.152
umm+0.147
lights+0.144
fulness+0.144
Neg logit contributions
 named-0.143
e-0.124
 walking-0.121
o-0.120
ind-0.119
Token are
Token ID389
Feature activation-1.12e-01

Pos logit contributions
der+0.051
rier+0.050
umm+0.048
bread+0.046
lights+0.046
Neg logit contributions
 named-0.046
e-0.040
 walking-0.039
o-0.039
 playing-0.038
Token".
Token ID1911
Feature activation-2.94e+00

Pos logit contributions
der+0.002
rier+0.002
umm+0.002
lights+0.002
bread+0.002
Neg logit contributions
 named-0.002
e-0.002
o-0.002
ind-0.002
 walking-0.002
Token His
Token ID2399
Feature activation-3.62e-02

Pos logit contributions
rier+0.072
der+0.067
bread+0.065
umm+0.064
lights+0.063
Neg logit contributions
 named-0.059
o-0.053
e-0.050
 playing-0.048
 walking-0.047
Token mom
Token ID1995
Feature activation+1.79e+00

Pos logit contributions
der+0.000
rier+0.000
lights+0.000
 maybe+0.000
 Which+0.000
Neg logit contributions
 named-0.001
e-0.001
ind-0.001
aky-0.001
aking-0.001
Token.
Token ID13
Feature activation-2.81e+00

Pos logit contributions
der+0.073
rier+0.073
umm+0.071
lights+0.070
bread+0.069
Neg logit contributions
 named-0.055
e-0.049
 playing-0.048
 walking-0.048
o-0.048
Token He
Token ID679
Feature activation-1.69e+00

Pos logit contributions
rier+0.070
lights+0.064
bread+0.061
der+0.058
 horizon+0.058
Neg logit contributions
 named-0.058
o-0.055
e-0.048
 playing-0.045
oo-0.044
Token realized
Token ID6939
Feature activation-2.15e+00

Pos logit contributions
rier+0.041
der+0.039
umm+0.039
lights+0.038
 Which+0.038
Neg logit contributions
 named-0.031
e-0.026
o-0.026
a-0.025
 walking-0.024
Token that
Token ID326
Feature activation-3.11e+00

Pos logit contributions
rier+0.056
umm+0.056
der+0.054
bread+0.054
 horizon+0.053
Neg logit contributions
 named-0.033
 walking-0.027
 playing-0.026
e-0.025
o-0.025
Token the
Token ID262
Feature activation-3.85e+00

Pos logit contributions
rier+0.068
der+0.065
bread+0.064
lights+0.064
umm+0.063
Neg logit contributions
 named-0.062
 playing-0.060
 walking-0.057
e-0.055
o-0.053
Token dull
Token ID19222
Feature activation-7.28e+00

Pos logit contributions
der+0.078
rier+0.074
lights+0.072
umm+0.070
fulness+0.070
Neg logit contributions
 named-0.082
e-0.072
 walking-0.071
ind-0.071
o-0.070
Token staff
Token ID3085
Feature activation-4.07e+00

Pos logit contributions
der+0.157
rier+0.151
lights+0.146
umm+0.146
fulness+0.143
Neg logit contributions
 named-0.145
e-0.123
 walking-0.122
ind-0.122
 playing-0.120
Token was
Token ID373
Feature activation+1.23e+00

Pos logit contributions
der+0.090
rier+0.089
umm+0.086
bread+0.083
 Which+0.083
Neg logit contributions
 named-0.078
e-0.069
o-0.066
 walking-0.066
a-0.064
Token not
Token ID407
Feature activation+7.91e-01

Pos logit contributions
 named+0.022
 walking+0.019
e+0.019
 playing+0.018
o+0.018
Neg logit contributions
der-0.029
umm-0.028
rier-0.028
lights-0.027
bread-0.027
Token so
Token ID523
Feature activation-2.34e-01

Pos logit contributions
 named+0.014
 walking+0.012
 playing+0.012
e+0.012
aky+0.012
Neg logit contributions
der-0.018
umm-0.017
rier-0.017
 horizon-0.017
lights-0.017
Token bad
Token ID2089
Feature activation-3.26e+00

Pos logit contributions
der+0.005
rier+0.005
umm+0.005
 horizon+0.005
lights+0.005
Neg logit contributions
 named-0.004
e-0.004
o-0.004
ind-0.004
 walking-0.003
Token scale
Token ID5046
Feature activation-2.71e+00

Pos logit contributions
der+0.057
rier+0.055
umm+0.052
lights+0.051
fulness+0.051
Neg logit contributions
 named-0.067
e-0.059
o-0.058
ind-0.058
 walking-0.058
Token.
Token ID13
Feature activation-3.95e+00

Pos logit contributions
der+0.064
rier+0.062
bread+0.059
umm+0.058
fulness+0.058
Neg logit contributions
 named-0.046
e-0.041
ind-0.040
o-0.039
aky-0.039
Token It
Token ID632
Feature activation-4.00e+00

Pos logit contributions
der+0.059
rier+0.058
fulness+0.053
lights+0.052
 horizon+0.052
Neg logit contributions
 named-0.111
o-0.100
e-0.094
 playing-0.090
 walking-0.089
Token shows
Token ID2523
Feature activation-2.75e+00

Pos logit contributions
der+0.095
rier+0.094
fulness+0.094
 horizon+0.092
umm+0.092
Neg logit contributions
 named-0.075
o-0.070
a-0.062
e-0.062
 walking-0.061
Token how
Token ID703
Feature activation-3.17e+00

Pos logit contributions
der+0.062
rier+0.062
umm+0.060
lights+0.059
 horizon+0.058
Neg logit contributions
 named-0.052
o-0.045
e-0.045
ind-0.044
a-0.043
Token heavy
Token ID4334
Feature activation-7.25e+00

Pos logit contributions
der+0.071
rier+0.067
umm+0.067
lights+0.065
 horizon+0.064
Neg logit contributions
 named-0.060
 playing-0.055
 walking-0.054
o-0.054
ind-0.053
Token things
Token ID1243
Feature activation-4.75e+00

Pos logit contributions
der+0.157
bread+0.152
umm+0.152
 Moral+0.151
rier+0.150
Neg logit contributions
 named-0.129
 playing-0.120
 called-0.106
 walking-0.106
o-0.105
Token are
Token ID389
Feature activation-1.45e+00

Pos logit contributions
umm+0.110
der+0.109
rier+0.108
lights+0.101
 horizon+0.100
Neg logit contributions
 named-0.084
 playing-0.078
e-0.075
 walking-0.075
o-0.072
Token."
Token ID526
Feature activation-1.35e+00

Pos logit contributions
der+0.035
umm+0.035
rier+0.034
lights+0.033
fulness+0.033
Neg logit contributions
 named-0.025
 walking-0.022
o-0.022
 playing-0.021
ind-0.020
Token Tim
Token ID5045
Feature activation+2.59e+00

Pos logit contributions
rier+0.031
der+0.030
umm+0.028
bread+0.028
 horizon+0.028
Neg logit contributions
 named-0.028
o-0.025
 playing-0.023
 walking-0.023
e-0.022
Tokenmy
Token ID1820
Feature activation+2.55e+00

Pos logit contributions
 named+0.045
 walking+0.040
ind+0.040
e+0.038
oo+0.037
Neg logit contributions
der-0.056
 Which-0.052
 Moral-0.051
 Maybe-0.050
 less-0.050
Token!
Token ID0
Feature activation-3.69e+00

Pos logit contributions
der+0.016
rier+0.015
umm+0.015
 horizon+0.014
lights+0.014
Neg logit contributions
 named-0.007
e-0.006
o-0.006
ind-0.006
 walking-0.006
Token I
Token ID314
Feature activation+1.77e+00

Pos logit contributions
der+0.064
rier+0.061
lights+0.060
 horizon+0.059
umm+0.057
Neg logit contributions
 named-0.092
o-0.082
 walking-0.077
e-0.077
oo-0.076
Token can
Token ID460
Feature activation+7.27e-01

Pos logit contributions
 named+0.039
ind+0.036
e+0.035
 walking+0.035
o+0.035
Neg logit contributions
der-0.032
rier-0.030
 Which-0.029
fulness-0.029
 maybe-0.029
Token replace
Token ID6330
Feature activation-3.44e+00

Pos logit contributions
 named+0.016
e+0.014
o+0.013
a+0.013
 playing+0.013
Neg logit contributions
der-0.015
rier-0.015
umm-0.014
lights-0.014
 horizon-0.014
Token the
Token ID262
Feature activation-4.92e+00

Pos logit contributions
der+0.072
rier+0.069
umm+0.068
bread+0.066
 horizon+0.066
Neg logit contributions
 named-0.070
e-0.062
ind-0.061
o-0.060
 walking-0.060
Token weak
Token ID4939
Feature activation-7.05e+00

Pos logit contributions
der+0.116
rier+0.113
umm+0.108
 horizon+0.107
 Which+0.107
Neg logit contributions
 named-0.085
e-0.074
o-0.073
ind-0.072
 walking-0.070
Token fabric
Token ID9664
Feature activation-3.73e+00

Pos logit contributions
der+0.167
rier+0.161
umm+0.157
bread+0.154
fulness+0.152
Neg logit contributions
 named-0.123
e-0.103
a-0.102
ind-0.102
 walking-0.099
Token with
Token ID351
Feature activation-2.86e+00

Pos logit contributions
der+0.095
umm+0.092
rier+0.091
 Which+0.089
 horizon+0.089
Neg logit contributions
 named-0.057
e-0.047
a-0.046
o-0.045
 walking-0.044
Token a
Token ID257
Feature activation-3.93e+00

Pos logit contributions
der+0.065
rier+0.064
umm+0.062
bread+0.060
lights+0.060
Neg logit contributions
 named-0.053
e-0.046
ind-0.044
 walking-0.044
o-0.044
Token new
Token ID649
Feature activation-4.43e+00

Pos logit contributions
der+0.090
rier+0.086
umm+0.084
lights+0.082
fulness+0.082
Neg logit contributions
 named-0.074
e-0.062
a-0.061
ind-0.061
 walking-0.061
Token one
Token ID530
Feature activation-3.54e+00

Pos logit contributions
der+0.103
rier+0.100
umm+0.098
bread+0.095
lights+0.094
Neg logit contributions
 named-0.081
e-0.068
a-0.067
 walking-0.066
ind-0.066
Token
Token ID198
Feature activation+2.07e+00

Pos logit contributions
 named+0.051
e+0.046
o+0.046
ind+0.045
 walking+0.045
Neg logit contributions
der-0.036
rier-0.035
umm-0.033
bread-0.032
 horizon-0.031
Token"
Token ID1
Feature activation-4.60e-01

Pos logit contributions
 named+0.037
ind+0.034
e+0.034
 walking+0.034
 playing+0.032
Neg logit contributions
der-0.045
 Which-0.044
rier-0.042
bread-0.042
 less-0.042
TokenWho
Token ID8241
Feature activation-1.52e+00

Pos logit contributions
der+0.010
rier+0.010
lights+0.009
umm+0.009
fulness+0.009
Neg logit contributions
 named-0.010
e-0.009
o-0.008
a-0.008
P-0.008
Token has
Token ID468
Feature activation+1.79e+00

Pos logit contributions
der+0.025
 maybe+0.023
 Which+0.022
bread+0.022
fulness+0.022
Neg logit contributions
 named-0.034
ind-0.033
 walking-0.031
aky-0.031
o-0.031
Token the
Token ID262
Feature activation-5.46e+00

Pos logit contributions
 named+0.040
ind+0.037
aky+0.036
o+0.036
e+0.036
Neg logit contributions
der-0.031
rier-0.029
 Which-0.028
fulness-0.028
bread-0.028
Token bigger
Token ID5749
Feature activation-7.04e+00

Pos logit contributions
der+0.107
rier+0.104
umm+0.098
bread+0.096
 horizon+0.096
Neg logit contributions
 named-0.117
e-0.103
o-0.102
ind-0.102
 walking-0.101
Token animal
Token ID5044
Feature activation-3.99e+00

Pos logit contributions
der+0.158
umm+0.151
rier+0.147
fulness+0.145
lights+0.145
Neg logit contributions
 named-0.147
e-0.128
a-0.121
o-0.115
 playing-0.113
Token?"
Token ID1701
Feature activation-6.24e-01

Pos logit contributions
der+0.099
rier+0.097
umm+0.092
bread+0.091
 horizon+0.091
Neg logit contributions
 named-0.064
e-0.054
ind-0.053
o-0.053
 walking-0.052
Token Tom
Token ID4186
Feature activation+4.34e+00

Pos logit contributions
der+0.017
rier+0.017
umm+0.017
fulness+0.016
 horizon+0.016
Neg logit contributions
 named-0.009
 walking-0.007
o-0.007
 playing-0.007
e-0.007
Token asks
Token ID7893
Feature activation+9.47e-01

Pos logit contributions
 named+0.109
e+0.098
ind+0.098
o+0.097
aky+0.095
Neg logit contributions
der-0.069
rier-0.064
 Which-0.060
 horizon-0.059
umm-0.059
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
 named+0.018
ind+0.016
e+0.016
o+0.015
aky+0.015
Neg logit contributions
der-0.021
rier-0.020
lights-0.019
fulness-0.019
 Which-0.019
Token It
Token ID632
Feature activation-3.77e+00

Pos logit contributions
 horizon+0.072
lights+0.071
rier+0.071
der+0.070
bread+0.069
Neg logit contributions
 named-0.123
e-0.114
o-0.110
oo-0.104
ow-0.101
Token is
Token ID318
Feature activation-4.67e-01

Pos logit contributions
der+0.074
rier+0.073
umm+0.069
 horizon+0.068
fulness+0.067
Neg logit contributions
 named-0.081
e-0.072
o-0.072
ind-0.070
 walking-0.069
Token for
Token ID329
Feature activation-1.78e+00

Pos logit contributions
der+0.011
rier+0.011
umm+0.010
 horizon+0.010
lights+0.010
Neg logit contributions
 named-0.008
e-0.007
o-0.007
ind-0.007
 walking-0.007
Token measuring
Token ID15964
Feature activation-3.19e+00

Pos logit contributions
der+0.040
rier+0.039
umm+0.037
lights+0.037
bread+0.037
Neg logit contributions
 named-0.033
e-0.029
 walking-0.028
ind-0.028
o-0.028
Token how
Token ID703
Feature activation-3.04e+00

Pos logit contributions
umm+0.072
der+0.071
rier+0.070
bread+0.069
 horizon+0.067
Neg logit contributions
 named-0.055
e-0.051
 playing-0.050
ind-0.050
o-0.048
Token heavy
Token ID4334
Feature activation-7.02e+00

Pos logit contributions
der+0.073
umm+0.072
rier+0.071
 horizon+0.069
 Which+0.068
Neg logit contributions
 named-0.048
o-0.043
a-0.042
 walking-0.042
ind-0.042
Token you
Token ID345
Feature activation-3.49e+00

Pos logit contributions
umm+0.144
der+0.140
bread+0.139
rier+0.137
 Moral+0.134
Neg logit contributions
 named-0.133
 playing-0.126
e-0.120
 walking-0.114
ind-0.111
Token are
Token ID389
Feature activation-1.35e+00

Pos logit contributions
der+0.074
rier+0.073
umm+0.071
lights+0.069
 Which+0.067
Neg logit contributions
 named-0.071
e-0.066
o-0.062
aky-0.060
 walking-0.060
Token.
Token ID13
Feature activation-4.36e+00

Pos logit contributions
der+0.032
umm+0.031
rier+0.031
lights+0.030
 Which+0.030
Neg logit contributions
 named-0.023
 walking-0.020
e-0.020
o-0.020
 playing-0.019
Token You
Token ID921
Feature activation-7.22e-01

Pos logit contributions
rier+0.080
lights+0.080
der+0.078
 horizon+0.078
bread+0.076
Neg logit contributions
 named-0.106
e-0.097
o-0.090
oo-0.088
asc-0.082
Token should
Token ID815
Feature activation+3.74e-01

Pos logit contributions
der+0.014
rier+0.014
umm+0.013
 horizon+0.013
lights+0.013
Neg logit contributions
 named-0.016
e-0.014
o-0.014
ind-0.014
 walking-0.014
Token "
Token ID366
Feature activation-1.08e+00

Pos logit contributions
umm+0.019
rier+0.019
der+0.018
lights+0.017
 horizon+0.017
Neg logit contributions
 named-0.031
 walking-0.028
 playing-0.027
o-0.027
 sitting-0.026
TokenLook
Token ID8567
Feature activation+1.15e+00

Pos logit contributions
rier+0.022
der+0.022
lights+0.021
 horizon+0.020
umm+0.020
Neg logit contributions
 named-0.023
e-0.021
o-0.021
P-0.020
ind-0.020
Token,
Token ID11
Feature activation-3.74e+00

Pos logit contributions
 named+0.018
aky+0.017
aking+0.016
ind+0.016
o+0.015
Neg logit contributions
der-0.027
rier-0.027
 maybe-0.027
 Which-0.026
fulness-0.026
Token Max
Token ID5436
Feature activation+1.57e+00

Pos logit contributions
der+0.086
rier+0.083
umm+0.079
 horizon+0.078
bread+0.078
Neg logit contributions
 named-0.068
e-0.059
o-0.058
ind-0.058
 walking-0.057
Token!
Token ID0
Feature activation-2.79e+00

Pos logit contributions
ind+0.031
 named+0.028
aky+0.028
ippy+0.028
years+0.027
Neg logit contributions
der-0.029
 less-0.028
 maybe-0.028
rier-0.027
 becomes-0.027
Token These
Token ID2312
Feature activation-7.01e+00

Pos logit contributions
rier+0.044
der+0.044
 horizon+0.041
lights+0.040
fulness+0.039
Neg logit contributions
 named-0.074
o-0.065
e-0.064
 playing-0.062
 walking-0.062
Token are
Token ID389
Feature activation-1.72e+00

Pos logit contributions
der+0.158
rier+0.154
 horizon+0.147
fulness+0.146
lights+0.144
Neg logit contributions
 named-0.137
e-0.120
 playing-0.115
a-0.113
o-0.113
Token my
Token ID616
Feature activation-4.96e+00

Pos logit contributions
der+0.038
rier+0.037
umm+0.035
 Which+0.035
lights+0.035
Neg logit contributions
 named-0.032
e-0.028
o-0.028
ind-0.028
 walking-0.027
Token past
Token ID1613
Feature activation-3.01e+00

Pos logit contributions
der+0.127
rier+0.122
fulness+0.117
bread+0.115
lights+0.115
Neg logit contributions
 named-0.088
 playing-0.073
 walking-0.070
e-0.068
ind-0.067
Tokenels
Token ID1424
Feature activation-2.89e+00

Pos logit contributions
der+0.071
rier+0.070
bread+0.066
 horizon+0.066
fulness+0.066
Neg logit contributions
 named-0.054
e-0.047
o-0.046
ind-0.046
 walking-0.044
Token.
Token ID13
Feature activation-4.25e+00

Pos logit contributions
der+0.070
rier+0.068
umm+0.065
 horizon+0.064
bread+0.064
Neg logit contributions
 named-0.048
e-0.041
o-0.040
ind-0.040
 walking-0.040
Token un
Token ID555
Feature activation+4.04e+00

Pos logit contributions
rier+0.099
der+0.096
umm+0.095
 horizon+0.094
lights+0.093
Neg logit contributions
 named-0.104
 playing-0.086
 walking-0.086
e-0.085
o-0.082
Tokencles
Token ID5427
Feature activation-1.85e+00

Pos logit contributions
 named+0.078
adow+0.071
 walking+0.068
 playing+0.068
 hiding+0.067
Neg logit contributions
rier-0.081
 instructions-0.076
 Which-0.076
rays-0.074
 extract-0.074
Token can
Token ID460
Feature activation-8.87e-01

Pos logit contributions
der+0.039
rier+0.038
umm+0.036
lights+0.035
 horizon+0.035
Neg logit contributions
 named-0.036
e-0.032
o-0.032
ind-0.031
 walking-0.031
Token give
Token ID1577
Feature activation-1.76e+00

Pos logit contributions
der+0.017
rier+0.017
umm+0.016
lights+0.016
 Which+0.015
Neg logit contributions
 named-0.020
e-0.018
ind-0.017
o-0.017
 walking-0.017
Token me
Token ID502
Feature activation-2.62e+00

Pos logit contributions
der+0.040
rier+0.039
umm+0.037
lights+0.037
 horizon+0.037
Neg logit contributions
 named-0.032
e-0.028
o-0.027
ind-0.027
 walking-0.027
Token fake
Token ID8390
Feature activation-7.00e+00

Pos logit contributions
der+0.051
rier+0.049
umm+0.047
 horizon+0.046
lights+0.046
Neg logit contributions
 named-0.056
e-0.050
o-0.049
ind-0.049
 walking-0.048
Token pills
Token ID19521
Feature activation-2.45e+00

Pos logit contributions
 horizon+0.127
der+0.125
rier+0.125
lights+0.124
umm+0.123
Neg logit contributions
 named-0.157
 playing-0.136
ind-0.135
e-0.135
 walking-0.133
Token,
Token ID11
Feature activation-4.94e+00

Pos logit contributions
der+0.053
rier+0.052
umm+0.051
lights+0.050
 horizon+0.049
Neg logit contributions
 named-0.046
e-0.041
o-0.039
a-0.039
ind-0.039
Token and
Token ID290
Feature activation-5.03e+00

Pos logit contributions
umm+0.099
der+0.098
rier+0.097
lights+0.095
 horizon+0.094
Neg logit contributions
 named-0.104
e-0.087
 playing-0.087
 walking-0.085
a-0.084
Token fake
Token ID8390
Feature activation-6.16e+00

Pos logit contributions
rier+0.104
umm+0.104
der+0.102
 horizon+0.101
lights+0.099
Neg logit contributions
 named-0.109
 playing-0.093
 walking-0.091
e-0.090
o-0.085
Token pills
Token ID19521
Feature activation-1.37e+00

Pos logit contributions
rier+0.111
der+0.111
 horizon+0.110
lights+0.108
umm+0.108
Neg logit contributions
 named-0.138
e-0.119
ind-0.118
 playing-0.118
 walking-0.117
Token head
Token ID1182
Feature activation+2.85e-01

Pos logit contributions
der+0.011
rier+0.011
umm+0.011
 Which+0.011
bread+0.011
Neg logit contributions
 named-0.009
e-0.008
 walking-0.008
ind-0.008
a-0.008
Token.
Token ID13
Feature activation-3.93e+00

Pos logit contributions
 named+0.005
e+0.004
o+0.004
ind+0.004
 walking+0.004
Neg logit contributions
der-0.007
rier-0.007
umm-0.006
 horizon-0.006
lights-0.006
Token She
Token ID1375
Feature activation-1.25e+00

Pos logit contributions
lights+0.073
rier+0.070
der+0.067
fulness+0.066
umm+0.066
Neg logit contributions
 named-0.092
 walking-0.080
o-0.079
 playing-0.077
oo-0.074
Token thought
Token ID1807
Feature activation-1.28e+00

Pos logit contributions
der+0.029
rier+0.028
umm+0.027
lights+0.027
 horizon+0.027
Neg logit contributions
 named-0.022
e-0.019
o-0.019
ind-0.018
 walking-0.018
Token the
Token ID262
Feature activation-3.66e+00

Pos logit contributions
der+0.025
rier+0.025
umm+0.025
 horizon+0.024
bread+0.023
Neg logit contributions
 named-0.027
 walking-0.023
e-0.023
 playing-0.023
a-0.023
Token fake
Token ID8390
Feature activation-7.00e+00

Pos logit contributions
der+0.081
rier+0.078
lights+0.075
umm+0.074
fulness+0.074
Neg logit contributions
 named-0.070
e-0.061
 walking-0.060
ind-0.059
o-0.059
Token heels
Token ID18728
Feature activation-2.10e+00

Pos logit contributions
der+0.164
rier+0.163
umm+0.160
lights+0.158
fulness+0.157
Neg logit contributions
 named-0.119
 walking-0.101
ind-0.099
e-0.098
 playing-0.098
Token were
Token ID547
Feature activation-1.19e-01

Pos logit contributions
der+0.044
rier+0.043
umm+0.041
 Which+0.040
bread+0.040
Neg logit contributions
 named-0.042
e-0.037
o-0.036
 walking-0.036
ind-0.035
Token silly
Token ID14397
Feature activation-1.73e+00

Pos logit contributions
der+0.003
rier+0.002
fulness+0.002
lights+0.002
umm+0.002
Neg logit contributions
 named-0.002
e-0.002
o-0.002
ind-0.002
aky-0.002
Token,
Token ID11
Feature activation-4.54e+00

Pos logit contributions
der+0.042
rier+0.041
umm+0.039
 horizon+0.039
bread+0.039
Neg logit contributions
 named-0.029
e-0.024
o-0.024
 walking-0.024
 playing-0.023
Token but
Token ID475
Feature activation-1.89e+00

Pos logit contributions
umm+0.102
rier+0.098
lights+0.096
der+0.096
bread+0.095
Neg logit contributions
 named-0.087
 playing-0.075
 walking-0.073
e-0.071
o-0.068
Token give
Token ID1577
Feature activation-4.66e-01

Pos logit contributions
der+0.034
rier+0.034
umm+0.033
 Which+0.032
lights+0.032
Neg logit contributions
 named-0.029
e-0.025
 playing-0.025
 walking-0.025
a-0.023
Token her
Token ID607
Feature activation-4.28e+00

Pos logit contributions
der+0.013
umm+0.012
rier+0.012
lights+0.012
 horizon+0.012
Neg logit contributions
 named-0.006
e-0.006
 walking-0.005
a-0.005
 playing-0.005
Token a
Token ID257
Feature activation-2.71e+00

Pos logit contributions
der+0.109
rier+0.107
umm+0.106
 horizon+0.103
lights+0.102
Neg logit contributions
 named-0.068
e-0.061
a-0.058
 walking-0.057
o-0.056
Token special
Token ID2041
Feature activation-4.01e+00

Pos logit contributions
der+0.064
lights+0.062
rier+0.061
umm+0.061
 horizon+0.057
Neg logit contributions
 named-0.050
a-0.047
o-0.043
 walking-0.043
e-0.042
Token brightly
Token ID35254
Feature activation+1.59e-01

Pos logit contributions
der+0.079
rier+0.077
lights+0.075
umm+0.073
 horizon+0.072
Neg logit contributions
 named-0.087
e-0.077
 walking-0.075
a-0.075
o-0.074
Token coloured
Token ID34746
Feature activation-6.98e+00

Pos logit contributions
 named+0.003
e+0.002
o+0.002
 walking+0.002
ind+0.002
Neg logit contributions
der-0.004
rier-0.004
umm-0.004
 horizon-0.004
lights-0.004
Token tooth
Token ID16162
Feature activation-3.82e+00

Pos logit contributions
der+0.160
lights+0.153
umm+0.151
rier+0.150
 horizon+0.148
Neg logit contributions
 named-0.127
e-0.112
a-0.107
 walking-0.106
 playing-0.104
Tokenbrush
Token ID32680
Feature activation-8.74e-01

Pos logit contributions
 horizon+0.062
der+0.062
umm+0.062
 Which+0.061
rier+0.061
Neg logit contributions
 named-0.091
e-0.083
a-0.082
o-0.081
aky-0.079
Token or
Token ID393
Feature activation-4.94e+00

Pos logit contributions
der+0.022
rier+0.021
umm+0.021
 Which+0.020
 horizon+0.020
Neg logit contributions
 named-0.014
e-0.012
o-0.012
 walking-0.011
ind-0.011
Token some
Token ID617
Feature activation-5.27e+00

Pos logit contributions
rier+0.131
der+0.130
umm+0.130
lights+0.127
 horizon+0.126
Neg logit contributions
 named-0.078
 playing-0.064
 walking-0.063
e-0.061
 called-0.057
Token y
Token ID331
Feature activation+8.23e-01

Pos logit contributions
der+0.104
rier+0.104
umm+0.100
lights+0.100
 horizon+0.100
Neg logit contributions
 named-0.109
e-0.095
 walking-0.094
 playing-0.093
aky-0.090
Token boring
Token ID14262
Feature activation-2.62e+00

Pos logit contributions
der+0.026
rier+0.025
lights+0.023
 Which+0.023
umm+0.023
Neg logit contributions
 named-0.020
e-0.018
o-0.017
ind-0.017
aky-0.017
Token.
Token ID13
Feature activation-4.39e+00

Pos logit contributions
der+0.061
rier+0.059
lights+0.056
umm+0.056
fulness+0.055
Neg logit contributions
 named-0.046
e-0.041
ind-0.040
o-0.039
 walking-0.039
Token I
Token ID314
Feature activation+1.33e-01

Pos logit contributions
rier+0.063
der+0.061
bread+0.061
 horizon+0.058
fulness+0.056
Neg logit contributions
 named-0.125
o-0.109
e-0.104
 playing-0.103
ind-0.100
Token like
Token ID588
Feature activation-2.93e+00

Pos logit contributions
 named+0.003
ind+0.003
 walking+0.002
e+0.002
aky+0.002
Neg logit contributions
der-0.003
rier-0.002
 maybe-0.002
 Which-0.002
fulness-0.002
Token things
Token ID1243
Feature activation-2.56e+00

Pos logit contributions
rier+0.066
der+0.066
umm+0.065
bread+0.062
 horizon+0.061
Neg logit contributions
 named-0.057
 playing-0.049
 walking-0.048
ind-0.047
o-0.047
Token that
Token ID326
Feature activation-6.97e+00

Pos logit contributions
der+0.064
rier+0.064
umm+0.064
 horizon+0.062
bread+0.062
Neg logit contributions
 named-0.039
 playing-0.033
o-0.032
e-0.032
 walking-0.032
Token can
Token ID460
Feature activation-2.68e+00

Pos logit contributions
der+0.161
rier+0.160
umm+0.155
 horizon+0.149
bread+0.148
Neg logit contributions
 named-0.134
e-0.119
 playing-0.118
o-0.114
 walking-0.109
Token destroy
Token ID4117
Feature activation-4.01e+00

Pos logit contributions
umm+0.061
rier+0.059
der+0.057
bread+0.055
 Which+0.053
Neg logit contributions
 named-0.054
 playing-0.047
e-0.047
o-0.045
ind-0.045
Token things
Token ID1243
Feature activation-3.59e+00

Pos logit contributions
umm+0.089
der+0.089
rier+0.088
bread+0.086
 horizon+0.084
Neg logit contributions
 named-0.076
e-0.069
ind-0.069
 playing-0.068
o-0.064
Token."
Token ID526
Feature activation-2.01e+00

Pos logit contributions
umm+0.084
rier+0.080
der+0.079
 Which+0.078
bread+0.077
Neg logit contributions
 named-0.061
 playing-0.056
e-0.053
o-0.052
 walking-0.051
Token Tim
Token ID5045
Feature activation+2.44e+00

Pos logit contributions
rier+0.057
der+0.054
bread+0.053
umm+0.052
fulness+0.052
Neg logit contributions
 named-0.034
o-0.027
 playing-0.026
 walking-0.024
e-0.023
Token help
Token ID1037
Feature activation-1.31e-01

Pos logit contributions
 named+0.019
o+0.018
ind+0.018
aky+0.017
 walking+0.017
Neg logit contributions
der-0.013
rier-0.013
 perfection-0.012
bread-0.012
umm-0.012
Token the
Token ID262
Feature activation-2.41e+00

Pos logit contributions
der+0.003
umm+0.003
lights+0.003
rier+0.003
 horizon+0.003
Neg logit contributions
 named-0.002
e-0.002
 playing-0.002
 walking-0.002
o-0.002
Token animals
Token ID4695
Feature activation-8.61e-01

Pos logit contributions
der+0.043
rier+0.042
umm+0.039
 horizon+0.039
bread+0.038
Neg logit contributions
 named-0.056
o-0.050
e-0.050
ind-0.048
aky-0.048
Token get
Token ID651
Feature activation+1.42e+00

Pos logit contributions
der+0.020
rier+0.019
bread+0.018
 horizon+0.018
fulness+0.018
Neg logit contributions
 named-0.015
ind-0.013
e-0.013
o-0.013
aky-0.013
Token the
Token ID262
Feature activation-3.74e+00

Pos logit contributions
 named+0.028
e+0.024
o+0.024
 walking+0.024
ind+0.024
Neg logit contributions
der-0.031
rier-0.029
umm-0.029
 horizon-0.028
lights-0.028
Token fake
Token ID8390
Feature activation-6.97e+00

Pos logit contributions
der+0.073
rier+0.072
umm+0.068
bread+0.067
 horizon+0.067
Neg logit contributions
 named-0.082
o-0.072
e-0.072
ind-0.069
aky-0.068
Token tw
Token ID665
Feature activation-3.94e-01

Pos logit contributions
rier+0.175
der+0.173
lights+0.171
umm+0.170
 Which+0.169
Neg logit contributions
 named-0.105
 walking-0.090
 playing-0.088
e-0.086
ind-0.085
Tokenig
Token ID328
Feature activation-2.31e+00

Pos logit contributions
der+0.010
rier+0.010
umm+0.009
lights+0.009
bread+0.009
Neg logit contributions
 named-0.006
o-0.005
 walking-0.005
e-0.005
ind-0.005
Token.
Token ID13
Feature activation-3.58e+00

Pos logit contributions
der+0.058
rier+0.055
umm+0.055
 Which+0.054
 horizon+0.053
Neg logit contributions
 named-0.038
e-0.033
o-0.033
 walking-0.032
a-0.031
Token He
Token ID679
Feature activation-4.04e-01

Pos logit contributions
lights+0.062
fulness+0.061
rier+0.061
der+0.057
umm+0.056
Neg logit contributions
 named-0.089
o-0.078
 walking-0.074
 playing-0.073
 called-0.071
Token would
Token ID561
Feature activation+4.49e-01

Pos logit contributions
der+0.009
rier+0.009
bread+0.008
 maybe+0.008
 Which+0.008
Neg logit contributions
 named-0.007
e-0.006
ind-0.006
 walking-0.006
aky-0.006
Token around
Token ID1088
Feature activation+7.94e-01

Pos logit contributions
 named+0.029
e+0.025
o+0.025
ind+0.024
 walking+0.023
Neg logit contributions
der-0.052
rier-0.051
umm-0.048
lights-0.048
 horizon-0.048
Token and
Token ID290
Feature activation-3.35e-01

Pos logit contributions
 named+0.014
e+0.012
 walking+0.012
o+0.012
ind+0.012
Neg logit contributions
der-0.019
rier-0.018
umm-0.018
bread-0.017
 Which-0.017
Token added
Token ID2087
Feature activation-2.48e+00

Pos logit contributions
der+0.009
rier+0.009
umm+0.009
bread+0.008
lights+0.008
Neg logit contributions
 named-0.005
e-0.004
o-0.004
ind-0.004
 walking-0.004
Token the
Token ID262
Feature activation-4.10e+00

Pos logit contributions
umm+0.054
der+0.053
rier+0.052
 horizon+0.050
fulness+0.049
Neg logit contributions
 named-0.047
 playing-0.042
e-0.041
 walking-0.041
a-0.041
Token biggest
Token ID4094
Feature activation-5.02e+00

Pos logit contributions
der+0.098
rier+0.095
umm+0.090
bread+0.090
fulness+0.090
Neg logit contributions
 named-0.072
 walking-0.062
 playing-0.059
ind-0.059
e-0.058
Token,
Token ID11
Feature activation-6.96e+00

Pos logit contributions
der+0.131
rier+0.130
umm+0.125
fulness+0.124
bread+0.123
Neg logit contributions
 named-0.080
 walking-0.064
 playing-0.063
oo-0.057
a-0.057
Token most
Token ID749
Feature activation-2.03e+00

Pos logit contributions
der+0.183
umm+0.175
rier+0.174
bread+0.165
fulness+0.164
Neg logit contributions
 named-0.110
 walking-0.092
 playing-0.092
a-0.082
e-0.081
Token colourful
Token ID44799
Feature activation-3.65e+00

Pos logit contributions
der+0.044
rier+0.043
umm+0.041
 Which+0.041
fulness+0.040
Neg logit contributions
 named-0.039
 walking-0.033
e-0.033
ind-0.033
o-0.033
Token umbrella
Token ID25510
Feature activation-2.69e+00

Pos logit contributions
der+0.084
rier+0.082
umm+0.078
bread+0.077
fulness+0.076
Neg logit contributions
 named-0.067
e-0.057
 walking-0.056
ind-0.056
o-0.055
Token they
Token ID484
Feature activation-1.54e+00

Pos logit contributions
der+0.064
rier+0.063
umm+0.060
 Which+0.060
 horizon+0.059
Neg logit contributions
 named-0.046
o-0.039
e-0.039
 walking-0.038
 playing-0.037
Token could
Token ID714
Feature activation+4.50e-01

Pos logit contributions
der+0.032
rier+0.031
umm+0.030
bread+0.029
 horizon+0.029
Neg logit contributions
 named-0.033
e-0.028
o-0.028
 walking-0.027
a-0.027
Token what
Token ID644
Feature activation-2.50e+00

Pos logit contributions
der+0.046
rier+0.045
umm+0.043
 horizon+0.042
lights+0.042
Neg logit contributions
 named-0.036
e-0.031
o-0.031
ind-0.030
 walking-0.030
Token is
Token ID318
Feature activation+1.26e+00

Pos logit contributions
der+0.043
rier+0.041
umm+0.039
 Which+0.038
bread+0.038
Neg logit contributions
 named-0.059
e-0.053
ind-0.053
o-0.052
 walking-0.051
Token happening
Token ID5836
Feature activation+2.83e-01

Pos logit contributions
 named+0.020
o+0.018
ind+0.018
e+0.018
aky+0.018
Neg logit contributions
der-0.030
rier-0.030
 maybe-0.028
bread-0.028
imet-0.028
Token about
Token ID546
Feature activation-1.96e+00

Pos logit contributions
 named+0.006
e+0.005
ind+0.005
o+0.005
aky+0.005
Neg logit contributions
der-0.006
rier-0.006
 horizon-0.005
fulness-0.005
umm-0.005
Token the
Token ID262
Feature activation-3.81e+00

Pos logit contributions
der+0.037
rier+0.035
fulness+0.032
 horizon+0.032
bread+0.032
Neg logit contributions
 named-0.043
e-0.039
o-0.039
aky-0.038
ind-0.038
Token shrinking
Token ID27382
Feature activation-6.96e+00

Pos logit contributions
der+0.080
rier+0.077
 horizon+0.074
umm+0.073
 Which+0.073
Neg logit contributions
 named-0.076
o-0.067
ind-0.066
e-0.066
 walking-0.064
Token.
Token ID13
Feature activation-3.69e+00

Pos logit contributions
der+0.143
rier+0.140
lights+0.137
umm+0.136
 Which+0.134
Neg logit contributions
 named-0.140
e-0.122
a-0.118
 walking-0.117
aky-0.115
Tokenâ
Token ID22940
Feature activation+2.92e+00

Pos logit contributions
der+0.064
rier+0.062
umm+0.057
lights+0.057
bread+0.057
Neg logit contributions
 named-0.092
o-0.081
 walking-0.080
 playing-0.079
e-0.075
Token
Token ID26391
Feature activation+1.40e+00

Pos logit contributions
 named+0.067
ind+0.062
 walking+0.060
a+0.059
 playing+0.059
Neg logit contributions
der-0.049
fulness-0.047
 horizon-0.046
rier-0.046
umm-0.046
Token
Token ID198
Feature activation+2.00e+00

Pos logit contributions
 named+0.034
ind+0.031
aky+0.031
e+0.031
 walking+0.030
Neg logit contributions
der-0.022
rier-0.021
 Which-0.020
 maybe-0.020
umm-0.020
Token
Token ID198
Feature activation+1.64e+00

Pos logit contributions
 named+0.050
e+0.045
ind+0.045
o+0.045
 walking+0.044
Neg logit contributions
der-0.032
rier-0.030
umm-0.028
 horizon-0.028
fulness-0.028
Token decided
Token ID3066
Feature activation-1.43e+00

Pos logit contributions
der+0.007
rier+0.007
bread+0.007
umm+0.007
fulness+0.007
Neg logit contributions
 named-0.006
ind-0.005
e-0.005
o-0.005
 walking-0.005
Token to
Token ID284
Feature activation+1.16e+00

Pos logit contributions
umm+0.036
rier+0.035
der+0.034
lights+0.034
 horizon+0.033
Neg logit contributions
 named-0.024
 walking-0.021
 playing-0.021
o-0.019
 hiding-0.019
Token buy
Token ID2822
Feature activation-1.97e+00

Pos logit contributions
 named+0.027
e+0.025
o+0.024
ind+0.024
 walking+0.024
Neg logit contributions
der-0.020
rier-0.019
umm-0.018
 horizon-0.018
bread-0.018
Token the
Token ID262
Feature activation-5.25e+00

Pos logit contributions
der+0.049
rier+0.048
lights+0.047
umm+0.047
 horizon+0.046
Neg logit contributions
 named-0.034
e-0.030
 playing-0.029
 walking-0.028
a-0.028
Token sweet
Token ID6029
Feature activation-5.66e+00

Pos logit contributions
der+0.115
rier+0.111
umm+0.106
lights+0.104
 horizon+0.104
Neg logit contributions
 named-0.101
e-0.088
o-0.087
ind-0.086
 walking-0.085
Token smelling
Token ID48090
Feature activation-6.96e+00

Pos logit contributions
der+0.123
rier+0.118
umm+0.110
lights+0.110
 horizon+0.109
Neg logit contributions
 named-0.113
e-0.098
 walking-0.095
ind-0.095
o-0.094
Token perfume
Token ID40094
Feature activation-4.30e+00

Pos logit contributions
der+0.170
rier+0.162
umm+0.157
 horizon+0.155
lights+0.154
Neg logit contributions
 named-0.125
e-0.104
a-0.100
 walking-0.098
 playing-0.098
Token.
Token ID13
Feature activation-3.45e+00

Pos logit contributions
der+0.113
rier+0.112
 horizon+0.109
 Which+0.109
umm+0.107
Neg logit contributions
 named-0.061
e-0.049
a-0.049
o-0.049
 playing-0.047
Token Mia
Token ID32189
Feature activation+3.19e-01

Pos logit contributions
rier+0.096
fulness+0.090
lights+0.086
der+0.085
 horizon+0.084
Neg logit contributions
 named-0.055
o-0.047
e-0.042
 playing-0.041
ow-0.040
Token was
Token ID373
Feature activation+4.21e+00

Pos logit contributions
 named+0.006
ind+0.005
e+0.005
o+0.005
 walking+0.005
Neg logit contributions
der-0.007
rier-0.007
umm-0.006
bread-0.006
 horizon-0.006
Token so
Token ID523
Feature activation+2.73e+00

Pos logit contributions
 named+0.076
ind+0.070
o+0.067
e+0.066
aky+0.064
Neg logit contributions
der-0.092
rier-0.091
 less-0.087
 maybe-0.085
bread-0.083
Token had
Token ID550
Feature activation+1.67e+00

Pos logit contributions
 named+0.005
e+0.004
o+0.004
 walking+0.004
ind+0.004
Neg logit contributions
der-0.005
rier-0.005
umm-0.005
lights-0.005
 horizon-0.005
Token enough
Token ID1576
Feature activation-6.26e-01

Pos logit contributions
 named+0.031
 walking+0.027
e+0.026
 playing+0.026
o+0.026
Neg logit contributions
der-0.038
rier-0.038
umm-0.036
lights-0.036
 horizon-0.035
Token to
Token ID284
Feature activation+1.12e+00

Pos logit contributions
der+0.012
rier+0.011
umm+0.011
 horizon+0.011
 Which+0.011
Neg logit contributions
 named-0.014
 walking-0.013
 playing-0.012
e-0.012
o-0.012
Token buy
Token ID2822
Feature activation-1.66e+00

Pos logit contributions
 named+0.027
e+0.024
o+0.024
ind+0.024
 walking+0.024
Neg logit contributions
der-0.019
rier-0.018
umm-0.017
 horizon-0.017
bread-0.017
Token the
Token ID262
Feature activation-4.12e+00

Pos logit contributions
der+0.041
umm+0.040
 horizon+0.040
lights+0.039
bread+0.038
Neg logit contributions
 named-0.027
 playing-0.024
 walking-0.024
e-0.022
ind-0.021
Token fake
Token ID8390
Feature activation-6.95e+00

Pos logit contributions
der+0.080
rier+0.077
umm+0.073
 horizon+0.072
lights+0.072
Neg logit contributions
 named-0.089
e-0.079
o-0.078
ind-0.078
 walking-0.077
Token dog
Token ID3290
Feature activation-2.44e+00

Pos logit contributions
der+0.163
rier+0.158
lights+0.154
umm+0.152
fulness+0.152
Neg logit contributions
 named-0.124
 walking-0.102
e-0.101
ind-0.101
 playing-0.101
Token.
Token ID13
Feature activation-2.12e+00

Pos logit contributions
der+0.055
rier+0.053
bread+0.050
fulness+0.050
umm+0.050
Neg logit contributions
 named-0.044
ind-0.039
e-0.038
o-0.037
aky-0.037
Token
Token ID198
Feature activation+2.49e+00

Pos logit contributions
rier+0.053
bread+0.048
lights+0.047
der+0.047
 horizon+0.046
Neg logit contributions
 named-0.044
o-0.037
 playing-0.034
 walking-0.032
oo-0.031
Token
Token ID198
Feature activation+1.61e+00

Pos logit contributions
 named+0.043
e+0.037
o+0.037
ind+0.036
 walking+0.036
Neg logit contributions
der-0.059
rier-0.057
umm-0.055
 horizon-0.054
lights-0.054
TokenWhen
Token ID2215
Feature activation+5.91e-01

Pos logit contributions
 named+0.038
e+0.034
o+0.034
 walking+0.034
ind+0.034
Neg logit contributions
der-0.028
rier-0.026
 Which-0.025
fulness-0.025
umm-0.025
Token.
Token ID13
Feature activation-3.50e+00

Pos logit contributions
der+0.008
rier+0.008
 horizon+0.007
 less+0.007
lights+0.007
Neg logit contributions
 named-0.007
ind-0.006
e-0.006
aky-0.006
a-0.006
Token He
Token ID679
Feature activation-1.26e-01

Pos logit contributions
rier+0.069
fulness+0.061
bread+0.061
lights+0.059
 brighter+0.059
Neg logit contributions
 named-0.082
 playing-0.075
o-0.073
 walking-0.071
 running-0.064
Token pretended
Token ID36766
Feature activation-1.32e+00

Pos logit contributions
der+0.002
 maybe+0.002
 less+0.002
 becomes+0.002
 Which+0.002
Neg logit contributions
ind-0.002
asc-0.002
 named-0.002
oo-0.002
e-0.002
Token that
Token ID326
Feature activation-4.04e+00

Pos logit contributions
rier+0.029
umm+0.029
der+0.028
bread+0.026
 horizon+0.026
Neg logit contributions
 named-0.027
 walking-0.024
 playing-0.023
o-0.023
e-0.022
Token the
Token ID262
Feature activation-4.09e+00

Pos logit contributions
der+0.079
rier+0.078
umm+0.074
 horizon+0.072
bread+0.071
Neg logit contributions
 named-0.092
 walking-0.080
 playing-0.079
o-0.078
e-0.078
Token red
Token ID2266
Feature activation-6.94e+00

Pos logit contributions
der+0.091
rier+0.087
umm+0.083
bread+0.082
fulness+0.082
Neg logit contributions
 named-0.078
 walking-0.067
e-0.067
o-0.067
ind-0.066
Token car
Token ID1097
Feature activation-3.58e+00

Pos logit contributions
der+0.173
rier+0.168
umm+0.168
bread+0.164
fulness+0.163
Neg logit contributions
 named-0.115
o-0.097
 walking-0.097
 playing-0.094
ind-0.088
Token was
Token ID373
Feature activation+4.16e-01

Pos logit contributions
der+0.077
rier+0.074
umm+0.070
 horizon+0.069
bread+0.069
Neg logit contributions
 named-0.070
e-0.061
o-0.060
ind-0.060
 walking-0.059
Token driving
Token ID5059
Feature activation-1.01e+00

Pos logit contributions
 named+0.007
ind+0.007
e+0.007
ek+0.007
a+0.007
Neg logit contributions
der-0.009
 less-0.009
rier-0.009
 brighter-0.009
lights-0.008
Token the
Token ID262
Feature activation-6.68e-01

Pos logit contributions
der+0.021
rier+0.021
lights+0.020
fulness+0.020
 Which+0.019
Neg logit contributions
 named-0.019
ind-0.017
e-0.017
aky-0.017
o-0.016
Token blue
Token ID4171
Feature activation-4.26e+00

Pos logit contributions
der+0.013
rier+0.013
 horizon+0.012
umm+0.012
lights+0.012
Neg logit contributions
 named-0.014
e-0.013
o-0.012
ind-0.012
aky-0.012
Tokened
Token ID276
Feature activation-2.01e+00

Pos logit contributions
der+0.014
rier+0.013
umm+0.013
fulness+0.012
bread+0.012
Neg logit contributions
 named-0.017
e-0.015
ind-0.015
o-0.015
 walking-0.015
Token.
Token ID13
Feature activation-3.07e+00

Pos logit contributions
der+0.046
rier+0.044
umm+0.043
bread+0.042
lights+0.042
Neg logit contributions
 named-0.037
e-0.033
o-0.031
ind-0.031
 walking-0.031
Token<|endoftext|>
Token ID50256
Feature activation+4.26e+00

Pos logit contributions
rier+0.073
der+0.069
lights+0.068
bread+0.067
umm+0.065
Neg logit contributions
 named-0.060
o-0.054
 walking-0.049
e-0.049
 playing-0.048
TokenOnce
Token ID7454
Feature activation+6.42e+00

Pos logit contributions
 named+0.107
ind+0.102
aky+0.098
e+0.098
ek+0.097
Neg logit contributions
der-0.061
 Which-0.055
rier-0.055
 horizon-0.055
fulness-0.054
Token upon
Token ID2402
Feature activation+4.93e+00

Pos logit contributions
 named+0.109
e+0.101
ind+0.098
aky+0.093
ek+0.091
Neg logit contributions
der-0.147
rier-0.141
lights-0.140
fulness-0.140
 maybe-0.140
Token a
Token ID257
Feature activation+7.17e+00

Pos logit contributions
 named+0.095
ind+0.093
o+0.085
ek+0.084
asc+0.084
Neg logit contributions
der-0.098
 maybe-0.095
 less-0.093
fulness-0.089
 perfection-0.089
Token time
Token ID640
Feature activation+5.13e+00

Pos logit contributions
asc+0.154
 named+0.148
idd+0.143
ind+0.141
tails+0.139
Neg logit contributions
der-0.109
 lesson-0.105
 less-0.103
 maybe-0.102
fulness-0.099
Token,
Token ID11
Feature activation+3.55e+00

Pos logit contributions
ind+0.107
aky+0.099
 named+0.091
e+0.089
asc+0.089
Neg logit contributions
 maybe-0.111
der-0.096
fulness-0.094
 Which-0.094
 Maybe-0.092
Token there
Token ID612
Feature activation+4.93e+00

Pos logit contributions
ind+0.064
ek+0.054
ippy+0.054
akers+0.053
iley+0.052
Neg logit contributions
 maybe-0.085
 Which-0.076
 Maybe-0.073
 Someone-0.073
der-0.071
Token was
Token ID373
Feature activation+3.91e+00

Pos logit contributions
ind+0.120
 named+0.117
aky+0.116
aking+0.112
ek+0.110
Neg logit contributions
 maybe-0.070
der-0.069
 Which-0.065
fulness-0.063
 becomes-0.063
Token an
Token ID281
Feature activation+2.27e+00

Pos logit contributions
ind+0.091
ek+0.087
aking+0.084
o+0.083
aky+0.083
Neg logit contributions
 less-0.057
 maybe-0.057
der-0.056
 Bang-0.053
 perfection-0.053
Token.
Token ID13
Feature activation-2.18e+00

Pos logit contributions
der+0.051
rier+0.048
bread+0.046
fulness+0.046
umm+0.045
Neg logit contributions
 named-0.041
ind-0.037
e-0.036
aky-0.035
o-0.035
Token<|endoftext|>
Token ID50256
Feature activation+5.63e+00

Pos logit contributions
rier+0.065
der+0.056
 horizon+0.055
bread+0.054
lights+0.054
Neg logit contributions
 named-0.042
o-0.039
 playing-0.031
 walking-0.027
oo-0.027
TokenOnce
Token ID7454
Feature activation+6.86e+00

Pos logit contributions
 named+0.150
ind+0.141
aky+0.137
e+0.136
ek+0.135
Neg logit contributions
der-0.073
rier-0.067
 Which-0.065
fulness-0.065
 horizon-0.064
Token upon
Token ID2402
Feature activation+5.77e+00

Pos logit contributions
 named+0.112
e+0.101
ind+0.098
aky+0.093
ek+0.092
Neg logit contributions
der-0.164
rier-0.157
lights-0.154
fulness-0.153
 maybe-0.152
Token a
Token ID257
Feature activation+7.71e+00

Pos logit contributions
 named+0.103
ind+0.096
o+0.089
ek+0.088
aky+0.087
Neg logit contributions
der-0.126
 maybe-0.118
rier-0.117
 less-0.117
bread-0.116
Token time
Token ID640
Feature activation+5.56e+00

Pos logit contributions
asc+0.159
 named+0.157
ind+0.153
tails+0.148
idd+0.148
Neg logit contributions
der-0.120
 lesson-0.115
fulness-0.110
 less-0.110
 maybe-0.110
Token,
Token ID11
Feature activation+3.63e+00

Pos logit contributions
ind+0.116
aky+0.106
 named+0.100
asc+0.098
aking+0.096
Neg logit contributions
 maybe-0.117
der-0.101
 Which-0.100
fulness-0.099
 Maybe-0.098
Token there
Token ID612
Feature activation+5.17e+00

Pos logit contributions
ind+0.065
ek+0.055
ippy+0.054
iley+0.054
asc+0.052
Neg logit contributions
 maybe-0.087
 Which-0.079
 Someone-0.075
 Maybe-0.074
 less-0.073
Token was
Token ID373
Feature activation+4.21e+00

Pos logit contributions
 named+0.121
ind+0.121
aky+0.118
aking+0.113
ek+0.112
Neg logit contributions
der-0.077
 maybe-0.077
 Which-0.073
fulness-0.070
 becomes-0.070
Token a
Token ID257
Feature activation+2.19e+00

Pos logit contributions
ind+0.090
ek+0.087
 named+0.085
aking+0.083
o+0.083
Neg logit contributions
 less-0.068
 maybe-0.067
der-0.067
 perfection-0.063
 Bang-0.063
Token little
Token ID1310
Feature activation+6.10e-01

Pos logit contributions
e+0.050
ow+0.050
ek+0.047
adow+0.047
asc+0.047
Neg logit contributions
rier-0.034
bread-0.033
 less-0.032
 maybe-0.031
 brighter-0.031
Token again
Token ID757
Feature activation-3.04e-01

Pos logit contributions
der+0.114
rier+0.112
umm+0.112
 Which+0.108
 horizon+0.107
Neg logit contributions
 named-0.062
e-0.052
 walking-0.050
 playing-0.049
o-0.049
Token.
Token ID13
Feature activation-2.57e+00

Pos logit contributions
der+0.008
rier+0.008
umm+0.008
bread+0.008
lights+0.008
Neg logit contributions
 named-0.004
e-0.004
o-0.004
 walking-0.004
ind-0.003
Token<|endoftext|>
Token ID50256
Feature activation+5.58e+00

Pos logit contributions
rier+0.073
lights+0.064
der+0.061
 hotter+0.059
 brighter+0.058
Neg logit contributions
 named-0.050
o-0.047
e-0.040
 playing-0.037
 walking-0.036
TokenOnce
Token ID7454
Feature activation+6.21e+00

Pos logit contributions
 named+0.147
ind+0.138
e+0.134
aky+0.134
ek+0.132
Neg logit contributions
der-0.075
rier-0.069
 Which-0.067
fulness-0.066
 horizon-0.066
Token upon
Token ID2402
Feature activation+5.19e+00

Pos logit contributions
 named+0.101
e+0.090
ind+0.087
aky+0.083
o+0.083
Neg logit contributions
der-0.150
rier-0.145
lights-0.141
fulness-0.140
 maybe-0.139
Token a
Token ID257
Feature activation+6.84e+00

Pos logit contributions
 named+0.092
ind+0.084
o+0.079
e+0.077
aky+0.077
Neg logit contributions
der-0.117
rier-0.109
 maybe-0.109
bread-0.107
 less-0.107
Token time
Token ID640
Feature activation+4.59e+00

Pos logit contributions
asc+0.143
 named+0.140
ind+0.136
idd+0.134
years+0.130
Neg logit contributions
der-0.106
 lesson-0.105
fulness-0.102
 maybe-0.101
 Which-0.098
Token,
Token ID11
Feature activation+3.07e+00

Pos logit contributions
ind+0.097
aky+0.087
 named+0.082
asc+0.081
e+0.080
Neg logit contributions
 maybe-0.100
der-0.087
fulness-0.086
 Which-0.085
 Maybe-0.082
Token there
Token ID612
Feature activation+4.36e+00

Pos logit contributions
ind+0.053
ippy+0.047
ek+0.045
asc+0.044
iley+0.043
Neg logit contributions
 maybe-0.076
 Which-0.070
 Someone-0.066
 less-0.066
 Maybe-0.065
Token was
Token ID373
Feature activation+4.12e+00

Pos logit contributions
 named+0.100
ind+0.098
aky+0.095
aking+0.091
ek+0.091
Neg logit contributions
der-0.070
 maybe-0.068
 Which-0.065
fulness-0.064
umm-0.063
Token a
Token ID257
Feature activation+2.18e+00

Pos logit contributions
ind+0.092
ek+0.088
 named+0.086
o+0.085
asc+0.085
Neg logit contributions
 less-0.063
 maybe-0.062
der-0.062
 perfection-0.058
 Bang-0.058
Tokenie
Token ID494
Feature activation+2.66e-01

Pos logit contributions
 named+0.041
asc+0.041
e+0.037
oo+0.036
P+0.036
Neg logit contributions
der-0.060
rier-0.059
 less-0.054
 again-0.054
bread-0.053
Token.
Token ID13
Feature activation-1.78e+00

Pos logit contributions
 named+0.005
oo+0.005
ind+0.005
asc+0.005
e+0.005
Neg logit contributions
der-0.006
rier-0.005
 less-0.005
fulness-0.005
 maybe-0.005
Token<|endoftext|>
Token ID50256
Feature activation+5.53e+00

Pos logit contributions
rier+0.035
der+0.034
umm+0.032
bread+0.031
fulness+0.031
Neg logit contributions
 named-0.042
o-0.037
 walking-0.035
 playing-0.035
e-0.034
TokenOnce
Token ID7454
Feature activation+6.47e+00

Pos logit contributions
 named+0.147
ind+0.139
aky+0.134
e+0.134
ek+0.133
Neg logit contributions
der-0.071
rier-0.065
 Which-0.063
 horizon-0.062
 maybe-0.062
Token upon
Token ID2402
Feature activation+5.28e+00

Pos logit contributions
 named+0.115
e+0.109
ind+0.102
ek+0.098
asc+0.096
Neg logit contributions
der-0.144
rier-0.137
lights-0.135
fulness-0.135
 maybe-0.133
Token a
Token ID257
Feature activation+6.91e+00

Pos logit contributions
 named+0.098
ind+0.093
o+0.086
ek+0.086
ippy+0.084
Neg logit contributions
der-0.112
 maybe-0.104
 less-0.104
rier-0.101
bread-0.101
Token time
Token ID640
Feature activation+6.12e+00

Pos logit contributions
asc+0.141
 named+0.138
ind+0.131
 walking+0.127
idd+0.126
Neg logit contributions
der-0.111
 less-0.106
 lesson-0.104
umm-0.104
imet-0.103
Token,
Token ID11
Feature activation+4.10e+00

Pos logit contributions
ind+0.121
aky+0.111
 named+0.109
e+0.105
asc+0.102
Neg logit contributions
 maybe-0.128
der-0.116
imet-0.111
 Which-0.110
fulness-0.109
Token there
Token ID612
Feature activation+5.29e+00

Pos logit contributions
 named+0.061
ind+0.061
iley+0.061
e+0.060
 years+0.059
Neg logit contributions
 maybe-0.092
rier-0.090
der-0.087
 Which-0.085
 brighter-0.082
Token was
Token ID373
Feature activation+5.17e+00

Pos logit contributions
 named+0.121
ind+0.117
aky+0.112
ek+0.109
aking+0.108
Neg logit contributions
der-0.086
 maybe-0.083
 Which-0.079
fulness-0.078
rier-0.077
Token a
Token ID257
Feature activation+2.43e+00

Pos logit contributions
ind+0.112
ek+0.110
asc+0.108
 named+0.106
aking+0.106
Neg logit contributions
 maybe-0.077
 less-0.076
der-0.076
 Bang-0.073
bread-0.072
Token there
Token ID612
Feature activation+4.96e+00

Pos logit contributions
ind+0.062
iley+0.054
ippy+0.054
ek+0.052
 named+0.051
Neg logit contributions
 maybe-0.087
 Which-0.079
der-0.076
 Maybe-0.075
 Someone-0.075
Token was
Token ID373
Feature activation+4.19e+00

Pos logit contributions
 named+0.114
ind+0.112
aky+0.108
aking+0.104
ek+0.104
Neg logit contributions
der-0.079
 maybe-0.077
 Which-0.073
fulness-0.072
 becomes-0.070
Token a
Token ID257
Feature activation+2.35e+00

Pos logit contributions
ind+0.093
ek+0.087
 named+0.085
o+0.085
aking+0.084
Neg logit contributions
 less-0.068
 maybe-0.067
der-0.067
 Which-0.061
 Bang-0.061
Token girl
Token ID2576
Feature activation+5.72e+00

Pos logit contributions
e+0.053
ow+0.053
ek+0.051
asc+0.050
ippy+0.050
Neg logit contributions
rier-0.036
bread-0.035
 less-0.035
 maybe-0.034
 brighter-0.034
Token named
Token ID3706
Feature activation+6.25e-01

Pos logit contributions
ippy+0.074
ek+0.065
asc+0.064
scar+0.062
uff+0.060
Neg logit contributions
 less-0.127
 maybe-0.127
 But-0.126
 Maybe-0.125
 Let-0.122
Token Lily
Token ID20037
Feature activation+6.07e+00

Pos logit contributions
ind+0.016
 named+0.015
aky+0.015
e+0.015
ek+0.014
Neg logit contributions
der-0.008
 maybe-0.008
 Which-0.008
fulness-0.008
 less-0.008
Token.
Token ID13
Feature activation-1.60e+00

Pos logit contributions
ippy+0.141
asc+0.135
ubby+0.135
ek+0.134
iley+0.134
Neg logit contributions
der-0.081
 Maybe-0.074
 maybe-0.072
 less-0.069
fulness-0.068
Token Lily
Token ID20037
Feature activation+3.67e+00

Pos logit contributions
der+0.026
 Which+0.023
rier+0.023
umm+0.023
lights+0.022
Neg logit contributions
 named-0.039
ind-0.037
e-0.036
ek-0.036
o-0.035
Token found
Token ID1043
Feature activation+1.31e+00

Pos logit contributions
ippy+0.070
ind+0.068
oo+0.067
uff+0.067
ubby+0.065
Neg logit contributions
 maybe-0.062
 Maybe-0.061
 becomes-0.061
 But-0.059
Maybe-0.059
Token a
Token ID257
Feature activation-9.32e-01

Pos logit contributions
 named+0.022
ind+0.020
e+0.019
o+0.019
 walking+0.018
Neg logit contributions
der-0.031
rier-0.030
 Which-0.029
umm-0.028
bread-0.028
Token shiny
Token ID22441
Feature activation-2.15e+00

Pos logit contributions
rier+0.016
der+0.015
 lighter+0.015
 brighter+0.015
 perfection+0.015
Neg logit contributions
e-0.020
 named-0.020
ek-0.019
aking-0.019
o-0.019
Token its
Token ID663
Feature activation-2.47e+00

Pos logit contributions
umm+0.018
der+0.018
rier+0.017
lights+0.017
bread+0.017
Neg logit contributions
 named-0.018
e-0.016
 walking-0.015
o-0.015
 playing-0.015
Token feathers
Token ID27400
Feature activation+3.81e-01

Pos logit contributions
der+0.056
fulness+0.054
 Which+0.053
lights+0.053
umm+0.053
Neg logit contributions
 named-0.045
ind-0.040
e-0.039
 walking-0.039
o-0.038
Token.
Token ID13
Feature activation-3.17e+00

Pos logit contributions
 named+0.006
e+0.005
o+0.005
ind+0.005
 walking+0.005
Neg logit contributions
der-0.009
rier-0.009
umm-0.009
 horizon-0.008
 Which-0.008
Token The
Token ID383
Feature activation-1.76e+00

Pos logit contributions
lights+0.070
rier+0.070
der+0.067
fulness+0.066
umm+0.065
Neg logit contributions
 named-0.069
o-0.059
 walking-0.056
 playing-0.054
e-0.053
Token turkey
Token ID26876
Feature activation-7.87e-01

Pos logit contributions
der+0.041
umm+0.038
 Which+0.038
lights+0.038
imet+0.038
Neg logit contributions
 named-0.031
 walking-0.027
ind-0.026
aky-0.026
a-0.026
Token seemed
Token ID3947
Feature activation+1.24e+00

Pos logit contributions
der+0.017
rier+0.017
umm+0.016
 Which+0.016
lights+0.016
Neg logit contributions
 named-0.015
e-0.013
o-0.013
 walking-0.013
ind-0.013
Token to
Token ID284
Feature activation+2.43e+00

Pos logit contributions
 named+0.022
e+0.019
o+0.019
 walking+0.019
ind+0.019
Neg logit contributions
der-0.028
rier-0.028
umm-0.027
lights-0.026
bread-0.026
Token like
Token ID588
Feature activation-1.84e+00

Pos logit contributions
 named+0.051
e+0.044
 walking+0.043
o+0.043
 playing+0.042
Neg logit contributions
der-0.051
rier-0.050
umm-0.049
lights-0.048
 Which-0.047
Token it
Token ID340
Feature activation-1.41e+00

Pos logit contributions
der+0.037
rier+0.037
umm+0.036
 horizon+0.035
lights+0.035
Neg logit contributions
 named-0.038
 walking-0.033
e-0.033
 playing-0.032
o-0.032
Token and
Token ID290
Feature activation-2.08e+00

Pos logit contributions
der+0.032
rier+0.032
umm+0.030
lights+0.030
bread+0.030
Neg logit contributions
 named-0.026
e-0.022
o-0.022
 walking-0.021
ind-0.021
Token didn
Token ID1422
Feature activation+3.86e+00

Pos logit contributions
rier+0.049
umm+0.048
der+0.047
 horizon+0.046
lights+0.045
Neg logit contributions
 named-0.039
 walking-0.032
 playing-0.032
o-0.031
e-0.031
TokenAfter
Token ID3260
Feature activation+1.62e+00

Pos logit contributions
 named+0.037
e+0.033
o+0.032
ind+0.032
 walking+0.032
Neg logit contributions
der-0.028
rier-0.027
umm-0.026
 horizon-0.025
bread-0.025
Token a
Token ID257
Feature activation+1.88e+00

Pos logit contributions
 named+0.032
ind+0.031
aky+0.031
a+0.030
aking+0.030
Neg logit contributions
der-0.032
 maybe-0.030
 less-0.029
rier-0.029
fulness-0.029
Token while
Token ID981
Feature activation+1.29e+00

Pos logit contributions
 named+0.040
 walking+0.034
o+0.034
e+0.033
 playing+0.033
Neg logit contributions
der-0.039
rier-0.039
umm-0.036
lights-0.035
 horizon-0.034
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
 named+0.016
e+0.013
o+0.012
ind+0.012
 walking+0.012
Neg logit contributions
der-0.037
rier-0.037
umm-0.035
 horizon-0.035
bread-0.035
Token Billy
Token ID15890
Feature activation+1.91e+00

Pos logit contributions
der+0.044
rier+0.044
umm+0.041
bread+0.041
lights+0.040
Neg logit contributions
 named-0.031
 walking-0.026
 playing-0.025
e-0.023
o-0.023
Token came
Token ID1625
Feature activation+3.07e+00

Pos logit contributions
 named+0.038
ind+0.037
e+0.035
asc+0.034
aky+0.034
Neg logit contributions
der-0.037
bread-0.034
 horizon-0.034
rier-0.033
 less-0.033
Token back
Token ID736
Feature activation+2.60e+00

Pos logit contributions
 named+0.077
e+0.069
ind+0.069
aky+0.067
ek+0.066
Neg logit contributions
der-0.047
rier-0.046
 horizon-0.042
 less-0.042
 Which-0.041
Token with
Token ID351
Feature activation+1.89e+00

Pos logit contributions
 named+0.053
aky+0.050
e+0.049
ind+0.049
P+0.048
Neg logit contributions
der-0.048
rier-0.045
 less-0.044
 horizon-0.044
 maybe-0.044
Token the
Token ID262
Feature activation-1.58e+00

Pos logit contributions
 named+0.033
ind+0.028
e+0.028
o+0.028
a+0.027
Neg logit contributions
der-0.044
rier-0.043
umm-0.041
lights-0.040
fulness-0.040
Token sp
Token ID599
Feature activation-4.03e-01

Pos logit contributions
der+0.038
rier+0.037
lights+0.035
fulness+0.035
umm+0.035
Neg logit contributions
 named-0.028
o-0.023
 walking-0.023
e-0.023
ind-0.023
Tokenade
Token ID671
Feature activation+3.05e-01

Pos logit contributions
der+0.008
rier+0.008
umm+0.007
 Which+0.007
bread+0.007
Neg logit contributions
 named-0.008
e-0.007
o-0.007
ind-0.007
 walking-0.007
Token started
Token ID2067
Feature activation+2.63e+00

Pos logit contributions
 named+0.009
asc+0.009
ind+0.009
e+0.008
ippy+0.008
Neg logit contributions
der-0.011
 maybe-0.009
rier-0.009
 horizon-0.009
bread-0.009
Token to
Token ID284
Feature activation+1.54e+00

Pos logit contributions
 named+0.061
aky+0.058
ind+0.058
e+0.057
o+0.055
Neg logit contributions
der-0.044
rier-0.040
 less-0.039
 maybe-0.037
 horizon-0.037
Token weep
Token ID49671
Feature activation+2.58e-01

Pos logit contributions
 named+0.036
ind+0.035
o+0.034
aky+0.034
ippy+0.034
Neg logit contributions
 maybe-0.024
der-0.024
 perfection-0.023
 less-0.022
rier-0.022
Token.
Token ID13
Feature activation-3.14e+00

Pos logit contributions
 named+0.004
e+0.004
o+0.004
ind+0.004
 walking+0.004
Neg logit contributions
der-0.006
rier-0.006
umm-0.006
 horizon-0.006
bread-0.006
Token She
Token ID1375
Feature activation+6.14e-01

Pos logit contributions
rier+0.071
bread+0.066
umm+0.066
der+0.065
lights+0.062
Neg logit contributions
 named-0.066
 walking-0.056
o-0.055
a-0.055
 playing-0.054
Token was
Token ID373
Feature activation+4.47e+00

Pos logit contributions
 named+0.012
o+0.011
ind+0.011
aking+0.011
asc+0.011
Neg logit contributions
der-0.013
 maybe-0.011
rier-0.011
 less-0.011
lights-0.011
Token so
Token ID523
Feature activation+3.04e+00

Pos logit contributions
P+0.087
o+0.084
ind+0.082
ippy+0.082
aking+0.081
Neg logit contributions
der-0.089
 less-0.083
rier-0.081
 maybe-0.080
 brighter-0.077
Token sad
Token ID6507
Feature activation+3.61e-02

Pos logit contributions
 named+0.055
o+0.053
ippy+0.053
ind+0.051
e+0.051
Neg logit contributions
der-0.064
rier-0.061
 maybe-0.059
 less-0.059
 Someone-0.057
Token because
Token ID780
Feature activation-1.52e+00

Pos logit contributions
 named+0.001
 walking+0.000
o+0.000
e+0.000
ind+0.000
Neg logit contributions
der-0.001
rier-0.001
umm-0.001
bread-0.001
 Which-0.001
Token she
Token ID673
Feature activation+4.42e-01

Pos logit contributions
der+0.026
rier+0.026
umm+0.024
bread+0.024
 Which+0.023
Neg logit contributions
 named-0.036
e-0.032
o-0.032
 walking-0.032
ind-0.032
Token knew
Token ID2993
Feature activation-1.63e+00

Pos logit contributions
aking+0.009
asc+0.009
 named+0.009
ippy+0.009
oo+0.009
Neg logit contributions
der-0.008
 maybe-0.007
 becomes-0.006
 less-0.006
 perfection-0.006
Token with
Token ID351
Feature activation-1.32e+00

Pos logit contributions
der+0.008
rier+0.008
 maybe+0.007
 less+0.007
 Which+0.007
Neg logit contributions
 named-0.006
ind-0.006
o-0.006
aky-0.005
a-0.005
Token him
Token ID683
Feature activation-4.60e-01

Pos logit contributions
der+0.026
rier+0.025
 maybe+0.023
lights+0.023
bread+0.023
Neg logit contributions
 named-0.028
o-0.025
aky-0.024
e-0.024
ind-0.024
Token anymore
Token ID7471
Feature activation-7.21e-01

Pos logit contributions
der+0.009
rier+0.008
 less+0.008
 maybe+0.008
fulness+0.007
Neg logit contributions
 named-0.010
ind-0.009
e-0.009
o-0.009
aky-0.009
Token.
Token ID13
Feature activation-3.49e+00

Pos logit contributions
der+0.017
rier+0.016
lights+0.016
umm+0.016
fulness+0.016
Neg logit contributions
 named-0.012
e-0.011
ind-0.011
o-0.011
 walking-0.010
Token He
Token ID679
Feature activation-4.72e-01

Pos logit contributions
rier+0.070
umm+0.066
der+0.066
bread+0.064
fulness+0.063
Neg logit contributions
 named-0.083
o-0.071
e-0.067
 playing-0.066
 walking-0.065
Token is
Token ID318
Feature activation+2.82e+00

Pos logit contributions
der+0.008
rier+0.007
 maybe+0.007
 becomes+0.007
 less+0.007
Neg logit contributions
 named-0.011
o-0.010
ind-0.010
aky-0.010
e-0.010
Token mean
Token ID1612
Feature activation-5.13e-01

Pos logit contributions
ippy+0.051
ind+0.048
ek+0.048
e+0.048
iley+0.048
Neg logit contributions
 less-0.058
rier-0.055
der-0.055
 lighter-0.055
 brighter-0.052
Token and
Token ID290
Feature activation-1.37e+00

Pos logit contributions
der+0.012
rier+0.011
lights+0.010
fulness+0.010
umm+0.010
Neg logit contributions
 named-0.010
e-0.009
ind-0.008
o-0.008
 walking-0.008
Token rude
Token ID22066
Feature activation-1.58e+00

Pos logit contributions
der+0.035
rier+0.034
 less+0.033
 maybe+0.032
 Which+0.032
Neg logit contributions
 named-0.020
e-0.019
ind-0.019
ippy-0.018
o-0.017
Token.
Token ID13
Feature activation-3.73e+00

Pos logit contributions
der+0.039
rier+0.038
umm+0.036
 horizon+0.036
bread+0.036
Neg logit contributions
 named-0.026
e-0.022
o-0.022
ind-0.022
 walking-0.022
Token
Token ID198
Feature activation+2.01e+00

Pos logit contributions
rier+0.072
umm+0.071
der+0.070
 horizon+0.068
bread+0.068
Neg logit contributions
 named-0.088
o-0.077
e-0.072
 playing-0.070
 walking-0.069
Token use
Token ID779
Feature activation-1.33e+00

Pos logit contributions
der+0.013
rier+0.012
fulness+0.012
 horizon+0.012
umm+0.012
Neg logit contributions
 named-0.015
o-0.014
ind-0.014
aky-0.014
e-0.013
Token.
Token ID13
Feature activation-2.25e+00

Pos logit contributions
der+0.029
rier+0.028
umm+0.027
bread+0.026
 horizon+0.026
Neg logit contributions
 named-0.026
e-0.023
o-0.022
ind-0.022
 walking-0.022
Token Every
Token ID3887
Feature activation+8.31e-01

Pos logit contributions
rier+0.040
der+0.039
bread+0.037
umm+0.036
lights+0.036
Neg logit contributions
 named-0.053
o-0.048
 playing-0.047
 walking-0.047
e-0.045
Token day
Token ID1110
Feature activation+1.63e+00

Pos logit contributions
 named+0.016
ind+0.016
ek+0.015
aky+0.015
asc+0.015
Neg logit contributions
der-0.016
 horizon-0.015
 lesson-0.015
fulness-0.015
 Which-0.015
Token,
Token ID11
Feature activation-2.24e+00

Pos logit contributions
 named+0.021
ind+0.017
e+0.017
o+0.017
aky+0.017
Neg logit contributions
der-0.046
rier-0.044
umm-0.043
fulness-0.043
 Which-0.043
Token Jimmy
Token ID12963
Feature activation+2.02e+00

Pos logit contributions
der+0.037
rier+0.036
umm+0.035
bread+0.034
 horizon+0.034
Neg logit contributions
 named-0.055
 walking-0.049
 playing-0.049
e-0.049
o-0.048
Token went
Token ID1816
Feature activation+3.95e+00

Pos logit contributions
asc+0.043
ind+0.043
ippy+0.043
oo+0.042
 named+0.042
Neg logit contributions
der-0.031
 becomes-0.030
 maybe-0.029
bread-0.029
 less-0.028
Token out
Token ID503
Feature activation+1.80e+00

Pos logit contributions
 named+0.085
e+0.085
aky+0.083
ek+0.082
a+0.081
Neg logit contributions
 less-0.064
der-0.064
 Which-0.062
 maybe-0.061
 horizon-0.060
Token to
Token ID284
Feature activation+2.82e-01

Pos logit contributions
 named+0.038
ind+0.034
e+0.034
o+0.033
ek+0.033
Neg logit contributions
der-0.035
rier-0.033
lights-0.031
 maybe-0.031
 Which-0.031
Token try
Token ID1949
Feature activation-4.26e-02

Pos logit contributions
 named+0.005
aky+0.005
o+0.005
ind+0.005
a+0.005
Neg logit contributions
der-0.006
 maybe-0.006
 perfection-0.006
 Which-0.006
 horizon-0.005
Token and
Token ID290
Feature activation-9.59e-01

Pos logit contributions
umm+0.001
rier+0.001
der+0.001
bread+0.001
lights+0.001
Neg logit contributions
 named-0.001
 walking-0.001
 playing-0.001
e-0.001
ind-0.001
Token took
Token ID1718
Feature activation-8.84e-02

Pos logit contributions
 named+0.027
e+0.024
ind+0.024
o+0.023
 walking+0.023
Neg logit contributions
der-0.035
rier-0.033
 Which-0.031
lights-0.031
umm-0.031
Token the
Token ID262
Feature activation-3.05e+00

Pos logit contributions
der+0.002
rier+0.002
umm+0.002
 horizon+0.002
fulness+0.002
Neg logit contributions
 named-0.002
e-0.001
o-0.001
 walking-0.001
ind-0.001
Token cushion
Token ID33579
Feature activation-7.10e-01

Pos logit contributions
der+0.068
rier+0.066
umm+0.063
 horizon+0.062
bread+0.062
Neg logit contributions
 named-0.058
e-0.050
o-0.050
ind-0.049
 walking-0.049
Token from
Token ID422
Feature activation+8.06e-01

Pos logit contributions
der+0.016
rier+0.016
umm+0.015
fulness+0.015
 Which+0.015
Neg logit contributions
 named-0.013
e-0.011
o-0.011
 walking-0.011
ind-0.011
Token them
Token ID606
Feature activation-1.56e-01

Pos logit contributions
 named+0.017
e+0.015
o+0.015
ind+0.015
aky+0.015
Neg logit contributions
der-0.016
rier-0.016
umm-0.015
 horizon-0.015
 Which-0.014
Token and
Token ID290
Feature activation-1.83e+00

Pos logit contributions
der+0.003
rier+0.003
 less+0.003
fulness+0.003
 maybe+0.003
Neg logit contributions
 named-0.003
ind-0.003
e-0.003
aky-0.003
o-0.003
Token put
Token ID1234
Feature activation-1.65e-01

Pos logit contributions
der+0.046
rier+0.046
umm+0.044
 horizon+0.043
fulness+0.043
Neg logit contributions
 named-0.031
e-0.025
o-0.025
 walking-0.025
 playing-0.024
Token it
Token ID340
Feature activation+3.82e-02

Pos logit contributions
der+0.003
umm+0.003
bread+0.003
rier+0.003
 horizon+0.003
Neg logit contributions
 named-0.004
e-0.004
 walking-0.003
o-0.003
 playing-0.003
Token on
Token ID319
Feature activation-2.31e-01

Pos logit contributions
 named+0.001
ind+0.001
e+0.001
iley+0.001
o+0.001
Neg logit contributions
der-0.001
rier-0.001
lights-0.001
 less-0.001
fulness-0.001
Token the
Token ID262
Feature activation-8.48e-01

Pos logit contributions
der+0.005
rier+0.005
umm+0.005
bread+0.005
lights+0.005
Neg logit contributions
 named-0.004
e-0.004
o-0.004
ind-0.004
 walking-0.004
Token couch
Token ID18507
Feature activation-4.34e-01

Pos logit contributions
der+0.017
rier+0.016
umm+0.016
bread+0.016
fulness+0.016
Neg logit contributions
 named-0.017
 walking-0.015
e-0.015
 playing-0.015
ind-0.015
Token brightly
Token ID35254
Feature activation+1.96e+00

Pos logit contributions
der+0.086
rier+0.084
umm+0.080
 horizon+0.080
lights+0.080
Neg logit contributions
 named-0.074
e-0.066
o-0.065
ind-0.065
aky-0.063
Token-
Token ID12
Feature activation+6.42e-01

Pos logit contributions
 named+0.034
aky+0.034
ind+0.032
ek+0.031
a+0.031
Neg logit contributions
der-0.041
lights-0.041
 horizon-0.040
fulness-0.039
rier-0.039
Tokencolored
Token ID25717
Feature activation-4.27e+00

Pos logit contributions
 named+0.013
 walking+0.012
 playing+0.012
 hiding+0.012
aky+0.012
Neg logit contributions
bread-0.012
rier-0.012
der-0.011
cakes-0.011
fulness-0.011
Token ball
Token ID2613
Feature activation-2.44e-02

Pos logit contributions
der+0.089
rier+0.086
umm+0.082
 horizon+0.081
bread+0.081
Neg logit contributions
 named-0.085
e-0.076
o-0.075
ind-0.075
aky-0.074
Token.
Token ID13
Feature activation-4.09e+00

Pos logit contributions
der+0.001
 horizon+0.000
rier+0.000
umm+0.000
lights+0.000
Neg logit contributions
ind-0.000
aky-0.000
 named-0.000
e-0.000
ek-0.000
Token The
Token ID383
Feature activation-2.50e+00

Pos logit contributions
rier+0.112
der+0.101
bread+0.100
fulness+0.099
umm+0.098
Neg logit contributions
 named-0.074
o-0.062
 walking-0.056
 playing-0.056
 called-0.053
Token children
Token ID1751
Feature activation+2.13e+00

Pos logit contributions
der+0.051
rier+0.051
 horizon+0.049
umm+0.048
lights+0.047
Neg logit contributions
 named-0.049
e-0.045
o-0.045
ind-0.045
ek-0.043
Token cheered
Token ID34767
Feature activation+1.00e+00

Pos logit contributions
ind+0.045
 named+0.042
P+0.042
aky+0.041
asc+0.041
Neg logit contributions
der-0.039
 maybe-0.036
fulness-0.034
 horizon-0.034
rier-0.033
Token and
Token ID290
Feature activation-2.59e-01

Pos logit contributions
 named+0.019
e+0.017
ind+0.017
o+0.017
aky+0.016
Neg logit contributions
der-0.022
rier-0.021
fulness-0.020
lights-0.020
umm-0.020
Token ran
Token ID4966
Feature activation+3.03e+00

Pos logit contributions
der+0.006
rier+0.006
umm+0.006
 horizon+0.006
bread+0.006
Neg logit contributions
 named-0.004
ind-0.004
e-0.004
o-0.004
aky-0.004
Token to
Token ID284
Feature activation+3.71e-01

Pos logit contributions
 named+0.073
aky+0.072
ind+0.069
e+0.067
adow+0.066
Neg logit contributions
rier-0.045
der-0.044
 less-0.043
fulness-0.042
 horizon-0.042
Token bit
Token ID1643
Feature activation-3.39e-01

Pos logit contributions
der+0.008
rier+0.008
umm+0.007
lights+0.007
 horizon+0.007
Neg logit contributions
 named-0.009
e-0.008
o-0.008
ind-0.008
 walking-0.008
Token,
Token ID11
Feature activation-3.82e+00

Pos logit contributions
der+0.006
rier+0.006
 less+0.006
lights+0.006
umm+0.006
Neg logit contributions
 named-0.008
e-0.007
ind-0.007
 walking-0.007
o-0.007
Token but
Token ID475
Feature activation-2.33e+00

Pos logit contributions
umm+0.084
der+0.081
rier+0.080
 horizon+0.079
rays+0.077
Neg logit contributions
 named-0.080
 playing-0.067
 hiding-0.061
 called-0.061
 running-0.060
Token then
Token ID788
Feature activation-3.19e+00

Pos logit contributions
bread+0.052
rier+0.050
der+0.050
umm+0.049
lights+0.047
Neg logit contributions
 named-0.050
 playing-0.045
 hiding-0.042
 walking-0.042
 called-0.041
Token we
Token ID356
Feature activation-1.53e-01

Pos logit contributions
der+0.049
rier+0.048
umm+0.047
 horizon+0.044
lights+0.043
Neg logit contributions
 named-0.086
 playing-0.077
 walking-0.076
e-0.074
a-0.073
Token have
Token ID423
Feature activation-1.11e-01

Pos logit contributions
der+0.002
rier+0.002
umm+0.002
 horizon+0.002
 Which+0.002
Neg logit contributions
 named-0.004
ind-0.003
e-0.003
o-0.003
 walking-0.003
Token to
Token ID284
Feature activation+1.31e+00

Pos logit contributions
rier+0.003
der+0.003
umm+0.003
fulness+0.002
lights+0.002
Neg logit contributions
 named-0.002
 walking-0.002
 playing-0.002
o-0.002
e-0.002
Token play
Token ID711
Feature activation-8.12e-02

Pos logit contributions
 named+0.030
o+0.028
ind+0.028
aky+0.027
e+0.027
Neg logit contributions
der-0.021
rier-0.021
 less-0.020
 horizon-0.020
 Which-0.019
Token tag
Token ID7621
Feature activation-1.95e+00

Pos logit contributions
der+0.002
rier+0.002
umm+0.002
 horizon+0.002
lights+0.002
Neg logit contributions
 named-0.001
e-0.001
o-0.001
ind-0.001
 walking-0.001
Token."
Token ID526
Feature activation-2.06e+00

Pos logit contributions
der+0.043
rier+0.042
umm+0.041
 horizon+0.040
bread+0.040
Neg logit contributions
 named-0.037
e-0.032
o-0.031
ind-0.031
 walking-0.031
Token
Token ID220
Feature activation+6.98e-01

Pos logit contributions
rier+0.049
der+0.047
umm+0.044
 horizon+0.043
bread+0.042
Neg logit contributions
 named-0.046
o-0.041
 playing-0.036
e-0.036
 walking-0.035
Token to
Token ID284
Feature activation+9.66e-01

Pos logit contributions
 named+0.094
e+0.090
aky+0.087
ind+0.087
o+0.084
Neg logit contributions
der-0.080
rier-0.078
 less-0.076
 horizon-0.076
 Which-0.073
Token ask
Token ID1265
Feature activation-5.22e-01

Pos logit contributions
 named+0.020
e+0.017
o+0.017
ind+0.017
aky+0.017
Neg logit contributions
der-0.020
rier-0.019
umm-0.018
 horizon-0.018
 Which-0.018
Token his
Token ID465
Feature activation-6.56e-01

Pos logit contributions
der+0.012
umm+0.011
rier+0.011
bread+0.011
 horizon+0.011
Neg logit contributions
 named-0.010
 walking-0.009
e-0.008
 playing-0.008
o-0.008
Token mom
Token ID1995
Feature activation+5.75e-01

Pos logit contributions
der+0.012
rier+0.012
umm+0.011
 horizon+0.011
bread+0.011
Neg logit contributions
 named-0.015
e-0.013
o-0.013
ind-0.013
 walking-0.013
Token what
Token ID644
Feature activation-4.38e-01

Pos logit contributions
 named+0.010
o+0.009
e+0.009
 walking+0.009
ind+0.008
Neg logit contributions
der-0.013
rier-0.013
umm-0.013
lights-0.012
 horizon-0.012
Token he
Token ID339
Feature activation+2.86e-01

Pos logit contributions
der+0.005
 maybe+0.005
 Which+0.005
rier+0.004
fulness+0.004
Neg logit contributions
 named-0.012
ind-0.012
aky-0.012
e-0.012
aking-0.011
Token should
Token ID815
Feature activation+1.11e+00

Pos logit contributions
 named+0.006
ind+0.005
e+0.005
aky+0.005
oo+0.005
Neg logit contributions
der-0.006
 maybe-0.005
 Which-0.005
fulness-0.005
bread-0.005
Token do
Token ID466
Feature activation+4.77e-01

Pos logit contributions
 named+0.021
e+0.018
o+0.017
 walking+0.017
ind+0.017
Neg logit contributions
der-0.025
rier-0.025
umm-0.024
bread-0.023
 horizon-0.023
Token.
Token ID13
Feature activation-2.56e+00

Pos logit contributions
 named+0.008
e+0.007
o+0.007
ind+0.007
 walking+0.007
Neg logit contributions
der-0.012
rier-0.011
umm-0.011
fulness-0.011
lights-0.011
Token She
Token ID1375
Feature activation+9.22e-01

Pos logit contributions
rier+0.050
der+0.047
lights+0.047
bread+0.046
fulness+0.046
Neg logit contributions
 named-0.060
o-0.054
 walking-0.049
 playing-0.049
e-0.047
Token said
Token ID531
Feature activation+3.33e-01

Pos logit contributions
 named+0.016
e+0.015
ind+0.014
o+0.014
aky+0.014
Neg logit contributions
der-0.021
rier-0.019
 maybe-0.019
bread-0.019
 Which-0.019
Token me
Token ID502
Feature activation+1.03e+00

Pos logit contributions
 named+0.037
e+0.033
o+0.032
 walking+0.031
ind+0.031
Neg logit contributions
der-0.045
rier-0.043
umm-0.041
lights-0.041
bread-0.040
Token try
Token ID1949
Feature activation+9.83e-01

Pos logit contributions
 named+0.024
ind+0.023
aky+0.022
 walking+0.022
o+0.021
Neg logit contributions
der-0.016
rier-0.016
 Which-0.015
 maybe-0.015
 less-0.015
Token!"
Token ID2474
Feature activation-8.69e-01

Pos logit contributions
 named+0.021
o+0.018
e+0.018
 walking+0.018
 playing+0.018
Neg logit contributions
der-0.020
rier-0.020
umm-0.020
lights-0.018
fulness-0.018
Token Anna
Token ID11735
Feature activation+3.64e+00

Pos logit contributions
der+0.019
rier+0.019
umm+0.018
 horizon+0.018
fulness+0.018
Neg logit contributions
 named-0.017
e-0.014
o-0.014
 walking-0.014
ind-0.014
Token says
Token ID1139
Feature activation+2.76e+00

Pos logit contributions
 named+0.093
ind+0.086
e+0.083
aky+0.082
o+0.082
Neg logit contributions
der-0.057
rier-0.053
 maybe-0.049
bread-0.048
 Which-0.047
Token.
Token ID13
Feature activation-5.50e-01

Pos logit contributions
 named+0.051
ind+0.046
e+0.044
aky+0.043
o+0.043
Neg logit contributions
der-0.063
rier-0.061
fulness-0.057
 maybe-0.056
bread-0.056
Token She
Token ID1375
Feature activation+1.33e+00

Pos logit contributions
umm+0.012
lights+0.012
der+0.012
 perfection+0.011
rier+0.011
Neg logit contributions
 named-0.011
 walking-0.010
o-0.010
 playing-0.009
a-0.009
Token picks
Token ID11103
Feature activation+7.69e-01

Pos logit contributions
 named+0.031
e+0.028
o+0.028
 walking+0.028
ind+0.027
Neg logit contributions
der-0.023
rier-0.022
umm-0.021
fulness-0.021
 horizon-0.021
Token up
Token ID510
Feature activation-1.43e+00

Pos logit contributions
 named+0.015
e+0.013
o+0.013
 walking+0.013
ind+0.012
Neg logit contributions
der-0.017
rier-0.016
umm-0.016
lights-0.016
 horizon-0.016
Token a
Token ID257
Feature activation-1.37e+00

Pos logit contributions
der+0.025
 less+0.023
rier+0.023
 maybe+0.022
lights+0.022
Neg logit contributions
 named-0.033
aky-0.031
ind-0.031
o-0.031
asc-0.030
Token bigger
Token ID5749
Feature activation-2.36e+00

Pos logit contributions
der+0.027
rier+0.026
bread+0.025
 horizon+0.025
 Which+0.024
Neg logit contributions
 named-0.028
e-0.026
ind-0.026
o-0.025
 walking-0.025
Token.
Token ID13
Feature activation-3.70e+00

Pos logit contributions
der+0.044
rier+0.042
bread+0.039
lights+0.039
umm+0.039
Neg logit contributions
 named-0.034
ind-0.031
e-0.030
o-0.030
aky-0.029
Token She
Token ID1375
Feature activation-1.02e+00

Pos logit contributions
rier+0.086
fulness+0.083
umm+0.083
lights+0.081
 horizon+0.081
Neg logit contributions
 named-0.067
 playing-0.054
 walking-0.054
o-0.053
e-0.048
Token saw
Token ID2497
Feature activation-1.59e+00

Pos logit contributions
rier+0.026
umm+0.026
der+0.025
 horizon+0.025
fulness+0.025
Neg logit contributions
 named-0.017
o-0.014
e-0.014
a-0.013
aky-0.013
Token him
Token ID683
Feature activation-7.36e-01

Pos logit contributions
umm+0.037
der+0.037
rier+0.037
 horizon+0.036
fulness+0.035
Neg logit contributions
 named-0.027
 walking-0.023
 playing-0.023
e-0.022
o-0.022
Token fix
Token ID4259
Feature activation-2.04e+00

Pos logit contributions
der+0.017
rier+0.016
 maybe+0.016
 less+0.016
 brighter+0.015
Neg logit contributions
 named-0.012
ind-0.011
P-0.011
aky-0.011
asc-0.011
Token a
Token ID257
Feature activation-3.35e+00

Pos logit contributions
der+0.042
rier+0.041
umm+0.039
 horizon+0.038
bread+0.038
Neg logit contributions
 named-0.042
e-0.037
o-0.036
ind-0.036
 walking-0.036
Token bike
Token ID7161
Feature activation-1.31e+00

Pos logit contributions
der+0.074
rier+0.070
umm+0.069
fulness+0.069
lights+0.068
Neg logit contributions
 named-0.065
o-0.054
 walking-0.054
ind-0.054
a-0.054
Token with
Token ID351
Feature activation-2.38e+00

Pos logit contributions
der+0.030
rier+0.029
 horizon+0.027
umm+0.027
lights+0.027
Neg logit contributions
 named-0.023
ind-0.020
e-0.020
o-0.020
 walking-0.019
Token a
Token ID257
Feature activation-3.26e+00

Pos logit contributions
der+0.057
rier+0.056
umm+0.054
 horizon+0.053
bread+0.052
Neg logit contributions
 named-0.041
e-0.035
 walking-0.034
o-0.034
ind-0.033
Token broken
Token ID5445
Feature activation-4.26e+00

Pos logit contributions
der+0.075
umm+0.072
rier+0.071
fulness+0.071
lights+0.070
Neg logit contributions
 named-0.060
 walking-0.051
a-0.050
 playing-0.050
e-0.049
Token chain
Token ID6333
Feature activation-1.17e+00

Pos logit contributions
der+0.088
rier+0.085
umm+0.083
fulness+0.081
lights+0.081
Neg logit contributions
 named-0.087
e-0.075
o-0.074
 walking-0.074
ind-0.073
Token,
Token ID11
Feature activation-3.92e+00

Pos logit contributions
der+0.009
rier+0.009
umm+0.008
 horizon+0.008
lights+0.008
Neg logit contributions
 named-0.005
e-0.004
o-0.004
 walking-0.004
ind-0.004
Token it
Token ID340
Feature activation-1.41e+00

Pos logit contributions
der+0.068
umm+0.068
rier+0.068
lights+0.063
 horizon+0.063
Neg logit contributions
 named-0.094
 walking-0.083
 playing-0.081
o-0.078
a-0.078
Token shook
Token ID14682
Feature activation-7.27e-01

Pos logit contributions
der+0.033
rier+0.033
umm+0.032
 Which+0.031
fulness+0.031
Neg logit contributions
 named-0.025
e-0.021
o-0.021
 walking-0.020
ind-0.020
Token and
Token ID290
Feature activation-2.12e+00

Pos logit contributions
der+0.016
rier+0.016
umm+0.016
bread+0.015
 horizon+0.015
Neg logit contributions
 named-0.013
e-0.012
o-0.011
ind-0.011
 walking-0.011
Token rattled
Token ID48588
Feature activation-4.46e-01

Pos logit contributions
der+0.051
rier+0.051
umm+0.049
bread+0.048
 Which+0.048
Neg logit contributions
 named-0.037
 walking-0.030
e-0.030
o-0.030
 playing-0.029
Token.
Token ID13
Feature activation-4.19e+00

Pos logit contributions
der+0.011
rier+0.010
umm+0.010
bread+0.010
 horizon+0.010
Neg logit contributions
 named-0.008
e-0.007
o-0.007
ind-0.006
 walking-0.006
Token
Token ID198
Feature activation+1.61e+00

Pos logit contributions
rier+0.069
lights+0.069
umm+0.066
der+0.065
fulness+0.064
Neg logit contributions
 named-0.104
 walking-0.088
 playing-0.088
o-0.086
oo-0.084
Token
Token ID198
Feature activation+1.45e+00

Pos logit contributions
 named+0.040
e+0.036
ind+0.036
o+0.036
 walking+0.035
Neg logit contributions
der-0.026
rier-0.024
umm-0.023
 Which-0.023
 horizon-0.022
Tokenâ
Token ID22940
Feature activation+3.40e+00

Pos logit contributions
 named+0.031
e+0.029
 walking+0.029
o+0.029
 hiding+0.028
Neg logit contributions
der-0.026
 Which-0.025
rier-0.024
bread-0.023
 less-0.023
Token
Token ID26391
Feature activation+1.24e+00

Pos logit contributions
 named+0.079
ind+0.073
 walking+0.071
aky+0.070
o+0.069
Neg logit contributions
der-0.057
fulness-0.055
 horizon-0.054
rier-0.053
umm-0.052
Token
Token ID129
Feature activation+4.54e+00

Pos logit contributions
 named+0.026
ind+0.023
aky+0.023
e+0.023
o+0.022
Neg logit contributions
der-0.024
rier-0.023
bread-0.022
 horizon-0.022
umm-0.022
Token the
Token ID262
Feature activation-1.78e+00

Pos logit contributions
der+0.024
rier+0.024
umm+0.023
 horizon+0.022
fulness+0.022
Neg logit contributions
 named-0.024
 walking-0.020
e-0.020
o-0.020
 playing-0.020
Token lady
Token ID10846
Feature activation+2.18e+00

Pos logit contributions
der+0.028
rier+0.027
bread+0.024
 horizon+0.024
umm+0.024
Neg logit contributions
 named-0.045
e-0.042
o-0.042
ind-0.041
aky-0.041
Token says
Token ID1139
Feature activation+1.51e+00

Pos logit contributions
 named+0.051
o+0.046
e+0.045
 walking+0.044
a+0.044
Neg logit contributions
der-0.039
rier-0.038
umm-0.037
fulness-0.036
 horizon-0.036
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
 named+0.023
e+0.019
 walking+0.019
o+0.019
 playing+0.018
Neg logit contributions
der-0.039
umm-0.037
rier-0.037
 horizon-0.037
 Which-0.036
Token "
Token ID366
Feature activation-5.76e-01

Pos logit contributions
umm+0.026
 perfection+0.025
rier+0.025
lights+0.024
fulness+0.024
Neg logit contributions
 named-0.032
 walking-0.027
o-0.026
 playing-0.026
 called-0.025
TokenThese
Token ID4711
Feature activation-4.96e+00

Pos logit contributions
rier+0.013
der+0.013
lights+0.013
umm+0.013
 horizon+0.013
Neg logit contributions
 named-0.011
e-0.010
o-0.009
aky-0.009
a-0.009
Token are
Token ID389
Feature activation-1.02e+00

Pos logit contributions
der+0.102
rier+0.099
lights+0.097
umm+0.096
fulness+0.094
Neg logit contributions
 named-0.100
e-0.090
 walking-0.087
ind-0.087
 playing-0.086
Token lovely
Token ID14081
Feature activation-2.96e+00

Pos logit contributions
der+0.022
rier+0.022
umm+0.020
 Which+0.020
bread+0.020
Neg logit contributions
 named-0.019
e-0.017
o-0.017
ind-0.017
 walking-0.016
Token leaves
Token ID5667
Feature activation-1.93e+00

Pos logit contributions
der+0.071
rier+0.070
 horizon+0.068
umm+0.067
fulness+0.067
Neg logit contributions
 named-0.049
 playing-0.041
 walking-0.041
e-0.040
ind-0.040
Token.
Token ID13
Feature activation-4.35e+00

Pos logit contributions
der+0.048
rier+0.047
umm+0.046
 Which+0.045
 horizon+0.045
Neg logit contributions
 named-0.031
e-0.026
o-0.026
ind-0.025
 walking-0.025
Token What
Token ID1867
Feature activation-1.15e+00

Pos logit contributions
rier+0.074
der+0.074
 horizon+0.073
lights+0.072
bread+0.069
Neg logit contributions
 named-0.113
e-0.094
o-0.092
oo-0.087
 playing-0.086
Token.
Token ID13
Feature activation-4.00e+00

Pos logit contributions
 named+0.012
e+0.010
o+0.010
ind+0.010
 walking+0.010
Neg logit contributions
der-0.018
rier-0.018
umm-0.017
bread-0.017
 Which-0.017
Token So
Token ID1406
Feature activation-1.60e+00

Pos logit contributions
rier+0.094
lights+0.092
bread+0.091
 hotter+0.087
der+0.085
Neg logit contributions
 named-0.073
 walking-0.061
 playing-0.060
o-0.056
ow-0.056
Token,
Token ID11
Feature activation-2.38e+00

Pos logit contributions
der+0.038
rier+0.037
umm+0.035
bread+0.035
lights+0.035
Neg logit contributions
 named-0.029
 walking-0.024
e-0.024
 playing-0.024
a-0.023
Token instead
Token ID2427
Feature activation+5.04e-01

Pos logit contributions
der+0.054
rier+0.053
umm+0.050
bread+0.050
lights+0.050
Neg logit contributions
 named-0.049
 walking-0.042
 playing-0.041
e-0.040
a-0.037
Token of
Token ID286
Feature activation+1.19e-02

Pos logit contributions
 named+0.010
ind+0.009
e+0.009
aky+0.009
o+0.009
Neg logit contributions
der-0.011
rier-0.010
fulness-0.010
 maybe-0.010
bread-0.010
Token folding
Token ID29909
Feature activation-3.15e+00

Pos logit contributions
 named+0.000
o+0.000
ind+0.000
aky+0.000
a+0.000
Neg logit contributions
der-0.000
 maybe-0.000
fulness-0.000
 less-0.000
lights-0.000
Token,
Token ID11
Feature activation-4.30e+00

Pos logit contributions
der+0.077
rier+0.073
umm+0.073
bread+0.072
 horizon+0.070
Neg logit contributions
 named-0.053
e-0.048
 walking-0.045
 playing-0.045
a-0.044
Token she
Token ID673
Feature activation-8.35e-01

Pos logit contributions
der+0.088
rier+0.086
umm+0.084
bread+0.081
 horizon+0.080
Neg logit contributions
 named-0.091
 walking-0.080
 playing-0.079
e-0.075
a-0.074
Token just
Token ID655
Feature activation-8.17e-01

Pos logit contributions
der+0.019
rier+0.018
umm+0.018
 horizon+0.017
bread+0.017
Neg logit contributions
 named-0.015
e-0.013
o-0.013
ind-0.013
 walking-0.013
Token watched
Token ID7342
Feature activation-5.09e-01

Pos logit contributions
der+0.019
rier+0.018
umm+0.017
 horizon+0.017
fulness+0.017
Neg logit contributions
 named-0.015
ind-0.013
e-0.013
o-0.012
aky-0.012
Token all
Token ID477
Feature activation+4.76e-01

Pos logit contributions
der+0.011
umm+0.011
rier+0.011
 Which+0.011
bread+0.011
Neg logit contributions
 named-0.009
e-0.008
 walking-0.008
 playing-0.008
ind-0.008
Token She
Token ID1375
Feature activation+1.91e+00

Pos logit contributions
der+0.018
 Which+0.016
umm+0.016
rier+0.016
 maybe+0.016
Neg logit contributions
 named-0.030
ind-0.029
e-0.028
ek-0.028
a-0.028
Token had
Token ID550
Feature activation+2.35e+00

Pos logit contributions
ippy+0.039
uff+0.039
ek+0.039
ow+0.039
owing+0.038
Neg logit contributions
 maybe-0.029
 But-0.028
 becomes-0.027
 lesson-0.027
Maybe-0.026
Token a
Token ID257
Feature activation-6.08e-01

Pos logit contributions
 named+0.045
ek+0.044
ind+0.043
o+0.042
aking+0.042
Neg logit contributions
 less-0.044
der-0.044
 Which-0.042
 maybe-0.041
 Someone-0.040
Token toy
Token ID13373
Feature activation-2.35e+00

Pos logit contributions
rier+0.009
der+0.009
 lesson+0.009
 Which+0.009
 less+0.009
Neg logit contributions
e-0.013
aking-0.013
ek-0.013
o-0.013
aky-0.012
Token laser
Token ID12855
Feature activation-7.15e-01

Pos logit contributions
der+0.050
rier+0.047
 Which+0.044
umm+0.043
 horizon+0.043
Neg logit contributions
 named-0.044
e-0.042
aking-0.042
aky-0.042
o-0.042
Token that
Token ID326
Feature activation-3.30e+00

Pos logit contributions
der+0.015
rier+0.013
bread+0.012
lights+0.012
umm+0.012
Neg logit contributions
aking-0.015
ek-0.014
aky-0.014
asc-0.014
e-0.014
Token she
Token ID673
Feature activation-6.68e-01

Pos logit contributions
der+0.064
rier+0.063
bread+0.059
umm+0.059
 horizon+0.058
Neg logit contributions
 named-0.076
o-0.066
e-0.066
 walking-0.065
 playing-0.064
Token loved
Token ID6151
Feature activation-5.89e-02

Pos logit contributions
 maybe+0.014
der+0.014
 becomes+0.013
 Which+0.013
 less+0.013
Neg logit contributions
adow-0.010
ek-0.009
aking-0.009
oo-0.009
ippy-0.009
Token to
Token ID284
Feature activation+1.83e-02

Pos logit contributions
der+0.001
rier+0.001
 Which+0.001
fulness+0.001
 maybe+0.001
Neg logit contributions
 named-0.001
e-0.001
ek-0.001
aky-0.001
ind-0.001
Token play
Token ID711
Feature activation+1.99e+00

Pos logit contributions
 named+0.000
ind+0.000
o+0.000
a+0.000
e+0.000
Neg logit contributions
der-0.000
rier-0.000
lights-0.000
fulness-0.000
 maybe-0.000
Token with
Token ID351
Feature activation+8.52e-01

Pos logit contributions
 named+0.033
o+0.030
ind+0.029
e+0.029
aky+0.028
Neg logit contributions
der-0.047
rier-0.046
 Which-0.043
fulness-0.043
lights-0.043