TOP ACTIVATIONS
MAX = 0.114

made snow men and had snowball fights . They used their
- looking crab said from nearby . " There is another
: " O w , ow , ow ! It hurts
happy to share and showed bear all about the different kinds
a tent . They told each other stories and jokes .
and lunch . They told each other what they did and
was unique . He had black and white spots all over
. They write with cr ay ons on paper . They
" O w , ow !" Ben says . He
'm sorry " and hugged each other . They
, " O w , ow , ow . My arm
her mom . They used red juice and white powder and
I will write with cr ay ons . I will not
. M oral : Hon esty , generosity and
, " Let 's tell each other stories . I have
toys and draw with cr ay ons . One day ,
" O w , ow , ow !" she w
, drew pictures with cr ay ons , and built for
's okay " and hugged each other . Then they decided
had fun . They used red and yellow and blue and

MIN ACTIVATIONS
MIN = -0.132

. <|endoftext|> Peter was a mis chie vous 3 year old
, the squirrel got a mis chie vous look in its
baby . He was a mis chie vous little baby and
The town spe ople had to find a
while . Soon , the tad p oles became frogs and
sleep so that no more mis chie vous mice could come
But then , a mis chie vous rabbit came and
One day , a mis chie vous young boy decided
flower suddenly turned into a fa ery . The fa ery
can spell better than you " or " You spelled that
the mess and the qu arre ling twins . She is
Suddenly , a mis chie vous bird flew by
a sweet tune , a mis chie vous fox appeared .
. One day , a mis sel went into the store
looked at him with a mis chie vous grin . She
the torn book and the unw ra pped box . She
! I 'm the best gol fer in the world !"
! I 'm the best gol fer in the world and
grass around the hill was wit hered and yellow . He
best friend a long time ago . He was my dog

INTERVAL 0.053 - 0.114
CONTAINS 0.551%

toys . <|endoftext|> Once upon a time , there was a
park . <|endoftext|> Once upon a time , there was a
his mouth and showed his teeth . L ila and Ben
. She said , " Oh Jack , these are beautiful
" O w , ow , ow ! It hurts

INTERVAL -0.009 - 0.053
CONTAINS 67.949%

fun dream . She told her mom my all about it
and asked her friends . They said that someone had taken
. The dentist smiled and said , " Hello , Sara
take a shower . I want to play with my toys
the bad man with his lawyer . As the judge read

INTERVAL -0.070 - -0.009
CONTAINS 31.455%

. The boy promised to come back soon ,
He threw it as far as he could . The bird
find him . He was all alone .
, her aunt came to her room . She saw Lily
beautiful and everyone was happy with the fair way things were

INTERVAL -0.132 - -0.070
CONTAINS 0.044%

something new - there were tad p oles swimming around in
. He wondered what was going to happen .
, okay ?" the zo oke eper said .
every day and touch the terr ac otta pot . Maybe
a ship that sank long ago . She said they could
Token made
Token ID925
Feature activation+2.29e-04

Pos logit contributions
ses+0.004
 names+0.004
 trips+0.004
sed+0.004
 imagination+0.003
Neg logit contributions
ge-0.005
ters-0.005
is-0.005
lling-0.005
line-0.004
Token snow
Token ID6729
Feature activation+3.75e-02

Pos logit contributions
ses+0.000
sed+0.000
 dictionary+0.000
 names+0.000
tr+0.000
Neg logit contributions
ge-0.000
ters-0.000
is-0.000
lling-0.000
rier-0.000
Tokenmen
Token ID3653
Feature activation+9.77e-03

Pos logit contributions
ses+0.053
 trips+0.052
 names+0.052
sed+0.051
 imagination+0.050
Neg logit contributions
ge-0.075
ters-0.068
is-0.065
rier-0.064
ising-0.062
Token and
Token ID290
Feature activation+7.75e-03

Pos logit contributions
 names+0.017
 trips+0.017
 spears+0.016
 imagination+0.016
 favorite+0.016
Neg logit contributions
ge-0.017
rier-0.014
ters-0.014
is-0.014
chie-0.014
Token had
Token ID550
Feature activation+3.11e-02

Pos logit contributions
ses+0.011
sed+0.011
 trips+0.010
 names+0.010
 meanings+0.010
Neg logit contributions
ge-0.015
ters-0.014
is-0.013
ter-0.013
line-0.013
Token snowball
Token ID46275
Feature activation+1.14e-01

Pos logit contributions
ses+0.056
sed+0.054
 dictionary+0.050
 meanings+0.050
wl+0.049
Neg logit contributions
ters-0.046
ge-0.045
is-0.043
lling-0.041
Sus-0.040
Token fights
Token ID11418
Feature activation+1.31e-02

Pos logit contributions
 dictionary+0.149
 vitamin+0.148
 meanings+0.145
ses+0.144
 imagination+0.144
Neg logit contributions
ge-0.225
ters-0.201
is-0.201
ising-0.179
 longer-0.172
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
ses+0.029
sed+0.028
 imagination+0.028
 names+0.028
 imag+0.027
Neg logit contributions
ge-0.016
ters-0.015
is-0.013
rier-0.012
lling-0.012
Token They
Token ID1119
Feature activation+6.72e-03

Pos logit contributions
ses+0.020
sed+0.019
 imag+0.019
 spears+0.019
 trips+0.018
Neg logit contributions
ters-0.014
ge-0.013
is-0.012
line-0.012
Sus-0.012
Token used
Token ID973
Feature activation+1.34e-03

Pos logit contributions
ses+0.010
 trips+0.010
 names+0.010
sed+0.010
 imagination+0.009
Neg logit contributions
ge-0.013
ters-0.012
is-0.011
lling-0.011
ter-0.011
Token their
Token ID511
Feature activation+3.38e-02

Pos logit contributions
ses+0.002
sed+0.002
wl+0.002
 meanings+0.002
 dictionary+0.002
Neg logit contributions
ge-0.002
ters-0.002
is-0.002
 longer-0.002
lling-0.002
Token-
Token ID12
Feature activation-4.88e-03

Pos logit contributions
ge+0.026
ters+0.024
lling+0.022
mine+0.021
Sus+0.021
Neg logit contributions
 tum-0.018
 imagination-0.017
 names-0.017
ses-0.017
 trips-0.016
Tokenlooking
Token ID11534
Feature activation+1.24e-03

Pos logit contributions
ge+0.009
ters+0.008
is+0.007
rier+0.007
lling+0.007
Neg logit contributions
ses-0.008
 trips-0.008
 names-0.008
sed-0.008
 imagination-0.007
Token crab
Token ID32202
Feature activation+4.06e-03

Pos logit contributions
 tum+0.002
sed+0.002
 names+0.002
ses+0.002
 barrel+0.002
Neg logit contributions
ge-0.003
ters-0.002
lling-0.002
ner-0.002
Sus-0.002
Token said
Token ID531
Feature activation-2.87e-03

Pos logit contributions
 trips+0.006
sed+0.006
 lived+0.006
ses+0.006
 names+0.006
Neg logit contributions
ge-0.007
rier-0.007
ters-0.007
is-0.006
Sus-0.006
Token from
Token ID422
Feature activation+9.50e-03

Pos logit contributions
ge+0.005
is+0.005
ters+0.005
lling+0.005
ising+0.004
Neg logit contributions
 trips-0.004
 names-0.004
ses-0.004
sed-0.004
 tum-0.004
Token nearby
Token ID6716
Feature activation+9.99e-02

Pos logit contributions
ses+0.015
 trips+0.015
 names+0.015
sed+0.015
 imagination+0.015
Neg logit contributions
ge-0.017
ters-0.015
is-0.015
rier-0.014
lling-0.014
Token.
Token ID13
Feature activation+2.30e-02

Pos logit contributions
 trips+0.158
 names+0.148
 tum+0.147
 barrel+0.141
 spears+0.138
Neg logit contributions
ge-0.193
Sus-0.159
ters-0.156
lling-0.156
chie-0.153
Token "
Token ID366
Feature activation-7.25e-03

Pos logit contributions
ses+0.019
sed+0.019
 names+0.018
 Rex+0.017
 trips+0.017
Neg logit contributions
ge-0.061
lling-0.055
is-0.054
ters-0.054
ising-0.054
TokenThere
Token ID1858
Feature activation-3.01e-02

Pos logit contributions
ge+0.014
rier+0.013
is+0.013
ising+0.012
rily+0.012
Neg logit contributions
 names-0.009
tr-0.009
 trips-0.009
ses-0.008
 Rex-0.008
Token is
Token ID318
Feature activation+1.89e-02

Pos logit contributions
ge+0.066
ters+0.060
is+0.058
rier+0.057
ising+0.056
Neg logit contributions
ses-0.036
 trips-0.035
 names-0.035
sed-0.035
 imagination-0.033
Token another
Token ID1194
Feature activation-3.90e-03

Pos logit contributions
ses+0.024
 names+0.023
sed+0.023
 trips+0.023
 imagination+0.022
Neg logit contributions
ge-0.039
ters-0.037
is-0.034
rier-0.034
ter-0.034
Token:
Token ID25
Feature activation+7.37e-02

Pos logit contributions
ses+0.044
 habitats+0.043
 trips+0.043
 names+0.042
 meanings+0.042
Neg logit contributions
ge-0.028
rier-0.027
ters-0.026
ter-0.024
is-0.023
Token "
Token ID366
Feature activation+9.19e-03

Pos logit contributions
 trips+0.098
 names+0.096
ses+0.095
 spears+0.094
 imag+0.094
Neg logit contributions
ge-0.144
ters-0.131
is-0.126
rier-0.125
line-0.123
TokenO
Token ID46
Feature activation+2.63e-02

Pos logit contributions
 trips+0.012
 Rex+0.011
ses+0.011
 names+0.011
 tum+0.011
Neg logit contributions
ge-0.018
is-0.017
rier-0.017
rying-0.017
ters-0.017
Tokenw
Token ID86
Feature activation+6.09e-02

Pos logit contributions
 trips+0.028
ses+0.028
 names+0.027
sed+0.027
 imagination+0.025
Neg logit contributions
ge-0.062
ters-0.057
is-0.055
rier-0.054
lling-0.053
Token,
Token ID11
Feature activation+3.62e-02

Pos logit contributions
ses+0.101
 names+0.100
 trips+0.099
sed+0.097
 imagination+0.097
Neg logit contributions
ge-0.106
ters-0.097
is-0.092
rier-0.088
ter-0.087
Token ow
Token ID12334
Feature activation+9.56e-02

Pos logit contributions
 names+0.030
 trips+0.030
ses+0.029
sed+0.028
 imagination+0.027
Neg logit contributions
ge-0.091
ters-0.085
is-0.082
rier-0.082
ter-0.080
Token,
Token ID11
Feature activation+2.36e-02

Pos logit contributions
ses+0.143
 names+0.142
 trips+0.141
sed+0.139
 imagination+0.135
Neg logit contributions
ge-0.185
ters-0.165
is-0.160
rier-0.154
ising-0.151
Token ow
Token ID12334
Feature activation+8.41e-02

Pos logit contributions
 trips+0.019
 names+0.019
ses+0.019
sed+0.018
 imagination+0.017
Neg logit contributions
ge-0.062
ters-0.057
is-0.055
rier-0.054
lling-0.053
Token!
Token ID0
Feature activation+2.85e-02

Pos logit contributions
 trips+0.153
ses+0.153
 names+0.152
sed+0.150
 imagination+0.146
Neg logit contributions
ge-0.134
ters-0.117
is-0.111
rier-0.108
lling-0.104
Token It
Token ID632
Feature activation+2.84e-02

Pos logit contributions
 trips+0.042
 names+0.041
ses+0.041
sed+0.040
 spears+0.039
Neg logit contributions
ge-0.054
ters-0.049
rier-0.046
is-0.046
ter-0.045
Token hurts
Token ID20406
Feature activation+4.35e-03

Pos logit contributions
ses+0.029
 names+0.029
 trips+0.029
sed+0.028
 imagination+0.027
Neg logit contributions
ge-0.068
ters-0.062
is-0.060
rier-0.059
lling-0.058
Token happy
Token ID3772
Feature activation-3.61e-02

Pos logit contributions
ge+0.005
ters+0.005
mine+0.004
is+0.004
ter+0.004
Neg logit contributions
 names-0.003
 trips-0.003
 imagination-0.003
sed-0.003
ses-0.003
Token to
Token ID284
Feature activation-1.88e-02

Pos logit contributions
ge+0.068
ters+0.058
is+0.057
rier+0.056
chie+0.056
Neg logit contributions
 trips-0.058
 names-0.058
 imagination-0.055
sed-0.054
ses-0.054
Token share
Token ID2648
Feature activation+1.76e-02

Pos logit contributions
ge+0.042
ters+0.039
rier+0.038
is+0.036
lling+0.036
Neg logit contributions
 names-0.025
 imagination-0.024
 trips-0.023
ses-0.022
icks-0.022
Token and
Token ID290
Feature activation-6.81e-03

Pos logit contributions
 names+0.031
 trips+0.031
ses+0.029
 imagination+0.029
 spears+0.029
Neg logit contributions
ge-0.030
ters-0.026
is-0.025
rier-0.024
lling-0.023
Token showed
Token ID3751
Feature activation-2.08e-03

Pos logit contributions
ge+0.014
ters+0.012
rier+0.012
is+0.012
lling+0.011
Neg logit contributions
 names-0.010
 trips-0.010
 imagination-0.010
 lived-0.009
ses-0.009
Token bear
Token ID6842
Feature activation+9.55e-02

Pos logit contributions
ge+0.004
ters+0.004
is+0.003
rier+0.003
ising+0.003
Neg logit contributions
ses-0.003
sed-0.003
 trips-0.003
 names-0.003
 imagination-0.003
Token all
Token ID477
Feature activation+1.74e-02

Pos logit contributions
ses+0.175
sed+0.170
 imag+0.162
 trips+0.161
 imagination+0.160
Neg logit contributions
ge-0.160
ters-0.148
is-0.137
rier-0.129
ter-0.127
Token about
Token ID546
Feature activation+8.25e-03

Pos logit contributions
 names+0.029
 trips+0.029
ses+0.028
 imagination+0.028
sed+0.028
Neg logit contributions
ge-0.030
ters-0.027
is-0.025
rier-0.025
ising-0.024
Token the
Token ID262
Feature activation-1.92e-02

Pos logit contributions
ses+0.013
 trips+0.013
 names+0.013
sed+0.013
 imagination+0.013
Neg logit contributions
ge-0.015
ters-0.013
is-0.013
rier-0.012
ising-0.012
Token different
Token ID1180
Feature activation-3.55e-02

Pos logit contributions
ge+0.040
ters+0.036
is+0.036
rier+0.034
ising+0.034
Neg logit contributions
ses-0.025
 trips-0.025
sed-0.025
 names-0.025
 imagination-0.023
Token kinds
Token ID6982
Feature activation-3.90e-02

Pos logit contributions
ge+0.073
ters+0.066
is+0.064
rier+0.062
ising+0.061
Neg logit contributions
ses-0.048
 trips-0.048
 names-0.047
sed-0.047
 imagination-0.045
Token a
Token ID257
Feature activation+7.03e-03

Pos logit contributions
ses+0.010
sed+0.010
 names+0.010
 trips+0.010
tr+0.010
Neg logit contributions
ge-0.012
ters-0.010
is-0.010
rier-0.009
ising-0.009
Token tent
Token ID11105
Feature activation+4.02e-03

Pos logit contributions
ses+0.010
sed+0.010
 names+0.010
 meanings+0.009
 trips+0.009
Neg logit contributions
ge-0.014
is-0.013
ters-0.012
rier-0.012
line-0.011
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
 trips+0.007
 names+0.007
ses+0.006
sed+0.006
 imagination+0.006
Neg logit contributions
ge-0.007
ters-0.006
is-0.006
rier-0.006
chie-0.006
Token They
Token ID1119
Feature activation+1.88e-02

Pos logit contributions
ses+0.037
sed+0.037
 trips+0.037
 imag+0.037
ats+0.035
Neg logit contributions
ters-0.034
ge-0.034
line-0.030
Sus-0.030
is-0.030
Token told
Token ID1297
Feature activation+2.03e-02

Pos logit contributions
ses+0.029
 trips+0.029
 names+0.029
sed+0.028
 imagination+0.027
Neg logit contributions
ge-0.035
ters-0.031
is-0.030
lling-0.028
rier-0.028
Token each
Token ID1123
Feature activation+9.32e-02

Pos logit contributions
ses+0.035
tr+0.034
sed+0.034
 trips+0.033
 barrel+0.033
Neg logit contributions
ge-0.030
ters-0.030
is-0.027
ter-0.026
rier-0.026
Token other
Token ID584
Feature activation+4.14e-02

Pos logit contributions
ses+0.153
sed+0.151
 trips+0.145
 names+0.140
 imag+0.140
Neg logit contributions
ge-0.169
ters-0.150
is-0.150
lling-0.138
line-0.135
Token stories
Token ID3923
Feature activation+1.54e-02

Pos logit contributions
ses+0.047
sed+0.046
 imag+0.041
 trips+0.040
tr+0.039
Neg logit contributions
ge-0.094
ters-0.086
is-0.083
line-0.080
lling-0.079
Token and
Token ID290
Feature activation+1.00e-02

Pos logit contributions
ses+0.026
 trips+0.026
sed+0.026
 names+0.026
 imagination+0.025
Neg logit contributions
ge-0.026
ters-0.023
is-0.022
rier-0.021
lling-0.020
Token jokes
Token ID14532
Feature activation+4.13e-02

Pos logit contributions
ses+0.012
 trips+0.012
sed+0.012
 names+0.012
tr+0.011
Neg logit contributions
ge-0.021
ters-0.019
is-0.018
lling-0.018
ising-0.018
Token.
Token ID13
Feature activation+2.79e-02

Pos logit contributions
ses+0.078
 trips+0.078
 names+0.077
sed+0.077
 imagination+0.075
Neg logit contributions
ge-0.062
ters-0.055
is-0.051
rier-0.049
lling-0.048
Token and
Token ID290
Feature activation-3.70e-03

Pos logit contributions
 trips+0.014
 names+0.013
ses+0.013
sed+0.013
 imagination+0.013
Neg logit contributions
ge-0.013
ters-0.011
rier-0.011
is-0.011
chie-0.010
Token lunch
Token ID9965
Feature activation-1.48e-03

Pos logit contributions
ge+0.008
ters+0.007
is+0.007
rier+0.007
ising+0.006
Neg logit contributions
 names-0.005
 trips-0.005
 imagination-0.005
ses-0.005
sed-0.005
Token.
Token ID13
Feature activation+2.00e-02

Pos logit contributions
ge+0.003
ters+0.002
is+0.002
rier+0.002
chie+0.002
Neg logit contributions
 trips-0.003
 names-0.003
ses-0.003
sed-0.002
 imagination-0.002
Token They
Token ID1119
Feature activation+1.71e-02

Pos logit contributions
ses+0.028
sed+0.028
 trips+0.028
 names+0.027
 spears+0.026
Neg logit contributions
ge-0.038
ters-0.036
is-0.033
rier-0.032
line-0.032
Token told
Token ID1297
Feature activation+2.31e-02

Pos logit contributions
 names+0.024
 trips+0.024
ses+0.023
sed+0.023
 imagination+0.023
Neg logit contributions
ge-0.035
ters-0.031
is-0.030
rier-0.030
ising-0.029
Token each
Token ID1123
Feature activation+9.27e-02

Pos logit contributions
ses+0.032
 trips+0.032
sed+0.032
 names+0.031
 imagination+0.030
Neg logit contributions
ge-0.045
ters-0.041
is-0.040
rier-0.039
ising-0.038
Token other
Token ID584
Feature activation+4.00e-02

Pos logit contributions
ses+0.132
 trips+0.131
 names+0.131
sed+0.129
 imagination+0.125
Neg logit contributions
ge-0.184
ters-0.166
is-0.160
rier-0.156
lling-0.151
Token what
Token ID644
Feature activation+1.17e-02

Pos logit contributions
ses+0.042
sed+0.041
 trips+0.037
 imag+0.035
tr+0.035
Neg logit contributions
ge-0.095
ters-0.087
is-0.084
line-0.082
rier-0.080
Token they
Token ID484
Feature activation-4.22e-02

Pos logit contributions
ses+0.023
sed+0.022
tr+0.021
 barrel+0.021
 spears+0.020
Neg logit contributions
ge-0.016
ters-0.015
is-0.014
Sus-0.013
ner-0.013
Token did
Token ID750
Feature activation-3.04e-02

Pos logit contributions
ge+0.092
ters+0.083
is+0.080
rier+0.078
lling+0.077
Neg logit contributions
ses-0.052
 trips-0.052
 names-0.052
sed-0.051
 imagination-0.049
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
ge+0.059
ters+0.054
chie+0.053
rier+0.053
ter+0.051
Neg logit contributions
 names-0.047
 trips-0.046
 stories-0.043
 imagination-0.043
 favorite-0.042
Token was
Token ID373
Feature activation+1.40e-02

Pos logit contributions
ses+0.042
sed+0.040
 spears+0.040
 names+0.039
icks+0.038
Neg logit contributions
ge-0.045
ters-0.038
is-0.038
lling-0.035
ter-0.035
Token unique
Token ID3748
Feature activation+1.32e-02

Pos logit contributions
ses+0.018
 names+0.017
 trips+0.017
sed+0.017
 imagination+0.016
Neg logit contributions
ge-0.029
ters-0.027
is-0.026
rier-0.025
lling-0.025
Token.
Token ID13
Feature activation-2.37e-03

Pos logit contributions
ses+0.026
 names+0.026
 trips+0.026
sed+0.026
 imagination+0.025
Neg logit contributions
ge-0.019
ters-0.016
is-0.015
rier-0.015
ising-0.014
Token He
Token ID679
Feature activation+1.28e-02

Pos logit contributions
ge+0.003
ters+0.003
line+0.003
is+0.002
rier+0.002
Neg logit contributions
 imag-0.004
 trips-0.004
 meanings-0.004
sed-0.004
ses-0.004
Token had
Token ID550
Feature activation+1.18e-02

Pos logit contributions
ses+0.015
 names+0.015
 spears+0.015
 trips+0.015
sed+0.014
Neg logit contributions
ge-0.026
is-0.024
ters-0.024
ter-0.022
Sus-0.022
Token black
Token ID2042
Feature activation+9.27e-02

Pos logit contributions
ses+0.018
sed+0.017
 barrel+0.016
ccoli+0.016
 trips+0.016
Neg logit contributions
ge-0.020
ters-0.019
is-0.019
Sus-0.017
rier-0.017
Token and
Token ID290
Feature activation+7.55e-03

Pos logit contributions
zed+0.158
ccoli+0.157
ses+0.152
sed+0.147
gen+0.146
Neg logit contributions
ters-0.132
is-0.131
ge-0.120
ising-0.112
ter-0.103
Token white
Token ID2330
Feature activation+6.11e-02

Pos logit contributions
ses+0.012
 trips+0.012
sed+0.012
 imag+0.011
icks+0.011
Neg logit contributions
ge-0.012
is-0.011
ters-0.011
ner-0.010
ter-0.010
Token spots
Token ID10222
Feature activation+4.09e-02

Pos logit contributions
zed+0.084
ses+0.081
ccoli+0.081
wl+0.076
sed+0.076
Neg logit contributions
ters-0.109
is-0.103
ge-0.101
ising-0.092
ter-0.091
Token all
Token ID477
Feature activation-4.86e-03

Pos logit contributions
 trips+0.076
 names+0.075
ses+0.075
sed+0.074
 imagination+0.072
Neg logit contributions
ge-0.064
ters-0.055
is-0.052
rier-0.051
lling-0.049
Token over
Token ID625
Feature activation-2.45e-03

Pos logit contributions
ge+0.010
ters+0.009
is+0.008
rier+0.008
lling+0.008
Neg logit contributions
ses-0.007
 trips-0.007
 names-0.007
sed-0.007
 imagination-0.006
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
ses+0.010
sed+0.010
 trips+0.010
 names+0.009
 imag+0.009
Neg logit contributions
ge-0.007
ters-0.007
is-0.006
rier-0.006
ising-0.006
Token They
Token ID1119
Feature activation+1.21e-02

Pos logit contributions
sed+0.018
ses+0.017
 trips+0.017
 imag+0.017
 names+0.016
Neg logit contributions
ge-0.022
ters-0.020
line-0.018
is-0.018
rier-0.018
Token write
Token ID3551
Feature activation+8.57e-03

Pos logit contributions
 names+0.018
 trips+0.018
 imagination+0.017
ses+0.017
 lived+0.017
Neg logit contributions
ge-0.024
ters-0.022
is-0.021
rier-0.021
ising-0.020
Token with
Token ID351
Feature activation+2.38e-02

Pos logit contributions
ses+0.014
sed+0.013
 trips+0.012
 imag+0.012
 barrel+0.012
Neg logit contributions
ters-0.015
ge-0.015
is-0.014
rier-0.013
lling-0.013
Token cr
Token ID1067
Feature activation+4.87e-02

Pos logit contributions
sed+0.036
ses+0.035
zed+0.035
tr+0.034
uddy+0.034
Neg logit contributions
ge-0.037
ters-0.036
is-0.034
ising-0.032
lling-0.031
Tokenay
Token ID323
Feature activation+9.19e-02

Pos logit contributions
 names+0.061
 trips+0.060
ses+0.060
sed+0.058
 imagination+0.057
Neg logit contributions
ge-0.106
ters-0.095
is-0.091
rier-0.090
lling-0.088
Tokenons
Token ID684
Feature activation+1.83e-02

Pos logit contributions
ses+0.178
sed+0.174
 barrel+0.157
tr+0.156
 trips+0.150
Neg logit contributions
is-0.152
ge-0.137
ters-0.125
ising-0.114
mine-0.111
Token on
Token ID319
Feature activation+5.16e-02

Pos logit contributions
 names+0.032
 trips+0.031
ses+0.030
 imagination+0.030
sed+0.030
Neg logit contributions
ge-0.031
ters-0.026
rier-0.026
is-0.026
chie-0.025
Token paper
Token ID3348
Feature activation+4.02e-02

Pos logit contributions
ses+0.090
sed+0.089
 imag+0.083
 trips+0.082
 names+0.081
Neg logit contributions
ge-0.087
ters-0.077
is-0.076
rier-0.070
line-0.068
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 names+0.071
 trips+0.070
ses+0.069
sed+0.068
 imagination+0.067
Neg logit contributions
ge-0.067
ters-0.058
is-0.056
rier-0.055
lling-0.053
Token They
Token ID1119
Feature activation+1.47e-02

Pos logit contributions
sed+0.016
ses+0.016
 imag+0.015
 spears+0.015
 trips+0.015
Neg logit contributions
ge-0.023
ters-0.022
line-0.021
is-0.021
Sus-0.020
Token
Token ID198
Feature activation-3.96e-03

Pos logit contributions
ses+0.004
sed+0.004
 names+0.003
 imag+0.003
 trips+0.003
Neg logit contributions
ge-0.004
is-0.004
ters-0.003
Sus-0.003
line-0.003
Token"
Token ID1
Feature activation-2.97e-02

Pos logit contributions
ge+0.007
ters+0.006
is+0.006
rier+0.006
lling+0.005
Neg logit contributions
 trips-0.007
 names-0.007
ses-0.007
sed-0.007
 imagination-0.006
TokenO
Token ID46
Feature activation+2.54e-02

Pos logit contributions
ge+0.060
ters+0.052
rier+0.051
is+0.050
line+0.049
Neg logit contributions
ses-0.042
 trips-0.042
 names-0.042
sed-0.041
 imagination-0.040
Tokenw
Token ID86
Feature activation+5.71e-02

Pos logit contributions
 names+0.031
ses+0.031
sed+0.030
 trips+0.030
 imagination+0.029
Neg logit contributions
ge-0.056
ters-0.050
is-0.050
ising-0.048
rier-0.047
Token,
Token ID11
Feature activation+3.25e-02

Pos logit contributions
 names+0.096
ses+0.096
 trips+0.094
sed+0.092
 imagination+0.092
Neg logit contributions
ge-0.098
ters-0.089
is-0.085
rier-0.081
ter-0.080
Token ow
Token ID12334
Feature activation+9.18e-02

Pos logit contributions
 names+0.032
 trips+0.031
 habitats+0.030
ses+0.030
 meanings+0.029
Neg logit contributions
ge-0.075
ters-0.071
is-0.069
rier-0.068
ter-0.068
Token!"
Token ID2474
Feature activation-1.85e-03

Pos logit contributions
ses+0.131
 names+0.131
 trips+0.130
sed+0.128
 imagination+0.124
Neg logit contributions
ge-0.182
ters-0.163
is-0.158
rier-0.153
ising-0.150
Token Ben
Token ID3932
Feature activation-1.63e-02

Pos logit contributions
ge+0.004
ters+0.004
is+0.003
rier+0.003
lling+0.003
Neg logit contributions
 trips-0.002
ses-0.002
 names-0.002
sed-0.002
 imagination-0.002
Token says
Token ID1139
Feature activation-1.38e-02

Pos logit contributions
ge+0.038
ters+0.033
is+0.033
rier+0.032
line+0.031
Neg logit contributions
 trips-0.019
ses-0.019
 names-0.019
sed-0.018
 imagination-0.017
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
ge+0.029
is+0.027
lling+0.026
ters+0.025
Sus+0.025
Neg logit contributions
 names-0.018
sed-0.018
 trips-0.017
 tum-0.017
ses-0.017
Token He
Token ID679
Feature activation+2.11e-02

Pos logit contributions
ses+0.019
 trips+0.019
 names+0.019
sed+0.019
 imagination+0.018
Neg logit contributions
ge-0.029
ters-0.026
is-0.025
rier-0.025
ising-0.024
Token'm
Token ID1101
Feature activation+7.05e-03

Pos logit contributions
ses+0.020
 trips+0.020
sed+0.020
 names+0.020
 spears+0.019
Neg logit contributions
ge-0.029
ters-0.025
is-0.024
rier-0.023
ising-0.023
Token sorry
Token ID7926
Feature activation-2.53e-03

Pos logit contributions
ses+0.007
 trips+0.007
sed+0.007
 names+0.007
 imagination+0.007
Neg logit contributions
ge-0.017
ters-0.016
is-0.015
rier-0.015
ising-0.015
Token"
Token ID1
Feature activation-7.61e-02

Pos logit contributions
ge+0.005
ters+0.004
is+0.004
lling+0.004
rier+0.004
Neg logit contributions
 names-0.004
 trips-0.004
ses-0.004
sed-0.003
 imagination-0.003
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
ge+0.156
chie+0.134
ising+0.128
is+0.127
ivering+0.126
Neg logit contributions
 names-0.116
 Rex-0.109
 trips-0.103
 favorite-0.101
tr-0.100
Token hugged
Token ID41130
Feature activation+1.41e-02

Pos logit contributions
ses+0.017
sed+0.017
 names+0.016
 trips+0.016
 imagination+0.015
Neg logit contributions
ge-0.043
ters-0.039
is-0.038
lling-0.037
ising-0.037
Token each
Token ID1123
Feature activation+9.17e-02

Pos logit contributions
 names+0.022
 trips+0.021
ses+0.021
 spears+0.021
 imagination+0.020
Neg logit contributions
ge-0.027
is-0.022
ters-0.022
lling-0.022
ising-0.022
Token other
Token ID584
Feature activation+2.93e-02

Pos logit contributions
 names+0.120
 trips+0.113
ses+0.109
 imagination+0.106
sed+0.105
Neg logit contributions
ge-0.201
ters-0.177
rier-0.173
is-0.170
ising-0.168
Token.
Token ID13
Feature activation+3.08e-02

Pos logit contributions
 names+0.045
 trips+0.043
 stories+0.040
 favorite+0.039
 imagination+0.039
Neg logit contributions
ge-0.057
is-0.049
Sus-0.048
ters-0.046
chie-0.045
Token
Token ID198
Feature activation+7.38e-03

Pos logit contributions
 trips+0.044
ses+0.044
 names+0.043
sed+0.042
 spears+0.042
Neg logit contributions
ge-0.059
ters-0.055
is-0.051
rier-0.051
line-0.049
Token
Token ID198
Feature activation+2.29e-03

Pos logit contributions
ses+0.011
sed+0.011
 trips+0.011
 names+0.010
 imag+0.010
Neg logit contributions
ge-0.014
is-0.013
ters-0.013
Sus-0.012
ising-0.012
TokenThey
Token ID2990
Feature activation+1.58e-02

Pos logit contributions
 names+0.004
 trips+0.004
ses+0.004
 spears+0.004
sed+0.004
Neg logit contributions
ge-0.004
rier-0.004
ters-0.004
lling-0.003
chie-0.003
Token,
Token ID11
Feature activation+3.31e-02

Pos logit contributions
ses+0.038
 habitats+0.037
 trips+0.037
 names+0.036
 meanings+0.036
Neg logit contributions
rier-0.027
ge-0.027
ters-0.026
is-0.024
ter-0.024
Token "
Token ID366
Feature activation+7.93e-03

Pos logit contributions
 trips+0.045
 names+0.045
ses+0.043
 imag+0.043
 spears+0.042
Neg logit contributions
ge-0.064
ters-0.059
line-0.057
rier-0.057
ter-0.055
TokenO
Token ID46
Feature activation+2.37e-02

Pos logit contributions
 trips+0.010
 tum+0.010
 names+0.010
ses+0.010
 Rex+0.010
Neg logit contributions
ge-0.016
rying-0.014
is-0.014
rier-0.014
ters-0.014
Tokenw
Token ID86
Feature activation+5.92e-02

Pos logit contributions
ses+0.026
 trips+0.026
 names+0.026
sed+0.025
 imagination+0.024
Neg logit contributions
ge-0.055
ters-0.051
is-0.049
rier-0.048
lling-0.047
Token,
Token ID11
Feature activation+3.43e-02

Pos logit contributions
 names+0.097
ses+0.097
 trips+0.094
sed+0.093
 imagination+0.092
Neg logit contributions
ge-0.105
ters-0.095
is-0.091
rier-0.087
ter-0.086
Token ow
Token ID12334
Feature activation+9.16e-02

Pos logit contributions
 names+0.032
 trips+0.030
ses+0.030
sed+0.029
 imag+0.028
Neg logit contributions
ge-0.083
ters-0.078
is-0.075
rier-0.075
ter-0.075
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
 trips+0.123
ses+0.121
 names+0.119
sed+0.118
 imagination+0.115
Neg logit contributions
ge-0.189
ters-0.171
is-0.164
rier-0.163
lling-0.159
Token ow
Token ID12334
Feature activation+8.01e-02

Pos logit contributions
 names+0.017
 trips+0.017
ses+0.017
sed+0.016
 imagination+0.015
Neg logit contributions
ge-0.056
ters-0.052
is-0.050
rier-0.050
ter-0.048
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
 trips+0.136
ses+0.132
sed+0.129
 names+0.129
 spears+0.126
Neg logit contributions
ge-0.138
ters-0.121
rier-0.114
is-0.114
lling-0.112
Token My
Token ID2011
Feature activation+1.13e-02

Pos logit contributions
 trips+0.032
 names+0.031
 spears+0.030
ses+0.030
sed+0.030
Neg logit contributions
ge-0.036
ters-0.034
rier-0.031
ter-0.031
is-0.030
Token arm
Token ID3211
Feature activation+2.77e-02

Pos logit contributions
 names+0.016
 trips+0.016
 lived+0.016
 habitats+0.016
 imag+0.016
Neg logit contributions
ge-0.021
rier-0.018
ters-0.018
is-0.017
ter-0.017
Token her
Token ID607
Feature activation+1.52e-02

Pos logit contributions
ses+0.012
sed+0.012
 trips+0.012
 names+0.011
 imagination+0.011
Neg logit contributions
ge-0.011
ters-0.010
is-0.009
rier-0.009
ising-0.009
Token mom
Token ID1995
Feature activation+3.19e-02

Pos logit contributions
 trips+0.018
ses+0.018
 names+0.018
sed+0.018
 imagination+0.017
Neg logit contributions
ge-0.033
ters-0.030
is-0.029
rier-0.029
ising-0.028
Token.
Token ID13
Feature activation+3.20e-03

Pos logit contributions
 names+0.047
 favorite+0.045
 trips+0.045
 books+0.043
 imagination+0.042
Neg logit contributions
ge-0.065
rier-0.057
ner-0.056
ising-0.055
chie-0.055
Token They
Token ID1119
Feature activation+1.12e-02

Pos logit contributions
 names+0.004
ses+0.004
 trips+0.004
sed+0.004
 imagination+0.004
Neg logit contributions
ge-0.007
ters-0.006
is-0.006
rier-0.006
lling-0.006
Token used
Token ID973
Feature activation-2.05e-03

Pos logit contributions
ses+0.016
 trips+0.016
sed+0.016
 names+0.016
 spears+0.015
Neg logit contributions
ge-0.022
ters-0.020
is-0.019
lling-0.018
rier-0.018
Token red
Token ID2266
Feature activation+9.13e-02

Pos logit contributions
ge+0.004
ters+0.003
is+0.003
rier+0.003
lling+0.003
Neg logit contributions
ses-0.003
sed-0.003
 trips-0.003
 names-0.003
tr-0.003
Token juice
Token ID13135
Feature activation+3.02e-02

Pos logit contributions
ses+0.151
sed+0.147
 trips+0.143
 names+0.140
tr+0.138
Neg logit contributions
ge-0.161
ters-0.145
is-0.143
rier-0.131
ising-0.130
Token and
Token ID290
Feature activation+1.45e-02

Pos logit contributions
ses+0.055
 trips+0.054
sed+0.053
 names+0.053
 imagination+0.052
Neg logit contributions
ge-0.047
ters-0.044
is-0.041
rier-0.039
line-0.038
Token white
Token ID2330
Feature activation+6.57e-02

Pos logit contributions
ses+0.021
 trips+0.021
sed+0.020
 names+0.020
 imagination+0.019
Neg logit contributions
ge-0.028
ters-0.026
is-0.025
rier-0.023
lling-0.023
Token powder
Token ID11913
Feature activation+2.60e-02

Pos logit contributions
ses+0.103
sed+0.100
tr+0.095
 trips+0.094
 dictionary+0.091
Neg logit contributions
ge-0.117
ters-0.113
is-0.113
ising-0.098
rier-0.096
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
ses+0.046
 names+0.046
 trips+0.046
sed+0.045
 imagination+0.044
Neg logit contributions
ge-0.043
ters-0.037
is-0.036
rier-0.035
lling-0.034
Token I
Token ID314
Feature activation-4.37e-03

Pos logit contributions
 trips+0.032
sed+0.031
ses+0.031
 spears+0.030
 names+0.030
Neg logit contributions
ge-0.027
ters-0.027
rier-0.024
ter-0.024
is-0.024
Token will
Token ID481
Feature activation+8.27e-03

Pos logit contributions
ge+0.010
ters+0.009
is+0.009
rier+0.009
lling+0.009
Neg logit contributions
ses-0.005
 names-0.005
 trips-0.005
sed-0.005
 imagination-0.004
Token write
Token ID3551
Feature activation-5.92e-04

Pos logit contributions
 names+0.012
 imagination+0.012
 trips+0.011
 stories+0.010
 lived+0.010
Neg logit contributions
ge-0.018
ters-0.017
lling-0.016
rier-0.016
rying-0.015
Token with
Token ID351
Feature activation+1.76e-02

Pos logit contributions
ge+0.001
ters+0.001
is+0.001
rier+0.001
ising+0.001
Neg logit contributions
ses-0.001
sed-0.001
 trips-0.001
 names-0.001
 imagination-0.001
Token cr
Token ID1067
Feature activation+4.67e-02

Pos logit contributions
ses+0.034
sed+0.033
 trips+0.033
 names+0.032
tr+0.031
Neg logit contributions
ge-0.025
ters-0.023
is-0.022
rier-0.021
ising-0.020
Tokenay
Token ID323
Feature activation+9.13e-02

Pos logit contributions
 names+0.057
 trips+0.056
ses+0.055
sed+0.053
 imagination+0.052
Neg logit contributions
ge-0.103
ters-0.093
rier-0.089
is-0.089
lling-0.087
Tokenons
Token ID684
Feature activation+1.98e-02

Pos logit contributions
ses+0.172
sed+0.168
 barrel+0.152
 tum+0.149
tr+0.148
Neg logit contributions
is-0.155
ge-0.139
ters-0.137
mine-0.119
ising-0.119
Token.
Token ID13
Feature activation+2.44e-02

Pos logit contributions
 names+0.034
 trips+0.034
 imagination+0.032
ses+0.032
 spears+0.032
Neg logit contributions
ge-0.034
ters-0.028
is-0.027
lling-0.027
rier-0.027
Token I
Token ID314
Feature activation+6.79e-04

Pos logit contributions
 trips+0.039
sed+0.038
ses+0.037
 spears+0.036
 names+0.035
Neg logit contributions
ge-0.040
ters-0.040
rier-0.037
is-0.036
line-0.035
Token will
Token ID481
Feature activation+1.09e-02

Pos logit contributions
 names+0.001
 trips+0.001
ses+0.001
 imagination+0.001
sed+0.001
Neg logit contributions
ge-0.002
ters-0.002
rier-0.001
is-0.001
lling-0.001
Token not
Token ID407
Feature activation-8.80e-03

Pos logit contributions
 names+0.016
 imagination+0.016
 trips+0.014
 stories+0.014
 imag+0.014
Neg logit contributions
ge-0.023
ters-0.022
lling-0.021
rier-0.021
rying-0.020
Token.
Token ID13
Feature activation+7.59e-03

Pos logit contributions
ge+0.006
ters+0.005
is+0.004
rier+0.003
Sus+0.003
Neg logit contributions
sed-0.015
ses-0.015
 spears-0.014
 imag-0.014
 imagination-0.014
Token
Token ID198
Feature activation-3.70e-04

Pos logit contributions
sed+0.013
 trips+0.013
 imag+0.013
ses+0.013
 spears+0.012
Neg logit contributions
ge-0.012
ters-0.011
line-0.010
Sus-0.010
ner-0.010
Token
Token ID198
Feature activation-3.77e-03

Pos logit contributions
is+0.001
ge+0.001
Sus+0.001
ters+0.001
line+0.001
Neg logit contributions
sed-0.001
ses-0.001
 imag-0.001
 imagination-0.001
 meanings-0.001
TokenM
Token ID44
Feature activation+1.59e-02

Pos logit contributions
ge+0.005
is+0.004
ters+0.004
Sus+0.004
line+0.004
Neg logit contributions
ses-0.008
sed-0.008
 imagination-0.008
 imag-0.008
 names-0.008
Tokenoral
Token ID6864
Feature activation-1.59e-02

Pos logit contributions
ses+0.020
 names+0.019
 trips+0.019
sed+0.019
 imagination+0.018
Neg logit contributions
ge-0.034
is-0.031
ters-0.031
rier-0.030
ising-0.029
Token:
Token ID25
Feature activation+9.10e-02

Pos logit contributions
ge+0.027
is+0.026
ters+0.026
ising+0.024
rier+0.023
Neg logit contributions
sed-0.025
ses-0.025
 spears-0.024
 imag-0.023
 imagination-0.023
Token Hon
Token ID8835
Feature activation-3.27e-02

Pos logit contributions
ses+0.119
 trips+0.118
sed+0.117
 names+0.116
 imagination+0.110
Neg logit contributions
ge-0.190
ters-0.172
is-0.166
rier-0.161
line-0.157
Tokenesty
Token ID9673
Feature activation+5.17e-02

Pos logit contributions
ge+0.053
ters+0.050
ising+0.045
is+0.044
lling+0.044
Neg logit contributions
sed-0.058
ses-0.058
 marsh-0.056
 tum-0.055
 imagination-0.055
Token,
Token ID11
Feature activation+1.31e-02

Pos logit contributions
ses+0.050
sed+0.049
 trips+0.045
 spears+0.043
 tum+0.043
Neg logit contributions
ge-0.123
ters-0.119
is-0.114
ising-0.109
rier-0.108
Token generosity
Token ID31569
Feature activation+5.38e-04

Pos logit contributions
 names+0.022
 trips+0.022
 imagination+0.022
ses+0.022
 spears+0.021
Neg logit contributions
ge-0.023
ters-0.020
rier-0.020
lling-0.019
is-0.019
Token and
Token ID290
Feature activation+6.43e-03

Pos logit contributions
ses+0.001
 trips+0.001
sed+0.001
 names+0.001
 imagination+0.001
Neg logit contributions
ge-0.001
ters-0.001
is-0.001
rier-0.001
ising-0.001
Token,
Token ID11
Feature activation+2.08e-02

Pos logit contributions
ses+0.022
 trips+0.022
 names+0.021
sed+0.021
 imagination+0.021
Neg logit contributions
ge-0.027
ters-0.024
is-0.023
rier-0.023
line-0.022
Token "
Token ID366
Feature activation+3.00e-03

Pos logit contributions
ses+0.018
 names+0.018
sed+0.018
 trips+0.018
 imagination+0.016
Neg logit contributions
ge-0.053
ters-0.049
is-0.048
rier-0.046
lling-0.046
TokenLet
Token ID5756
Feature activation-5.69e-03

Pos logit contributions
 names+0.005
 trips+0.005
ses+0.005
sed+0.005
 imagination+0.005
Neg logit contributions
ge-0.005
ters-0.005
is-0.005
rier-0.005
lling-0.004
Token's
Token ID338
Feature activation+4.96e-02

Pos logit contributions
ge+0.010
ters+0.009
is+0.009
rier+0.009
ising+0.009
Neg logit contributions
ses-0.009
sed-0.009
 names-0.009
 trips-0.009
 imagination-0.008
Token tell
Token ID1560
Feature activation+2.45e-02

Pos logit contributions
 names+0.071
 trips+0.070
ses+0.067
 imagination+0.067
sed+0.065
Neg logit contributions
ge-0.102
ters-0.091
is-0.089
rier-0.087
lling-0.084
Token each
Token ID1123
Feature activation+9.09e-02

Pos logit contributions
ses+0.043
sed+0.042
 trips+0.040
 barrel+0.040
 names+0.039
Neg logit contributions
ge-0.038
ters-0.037
is-0.034
rier-0.033
ter-0.032
Token other
Token ID584
Feature activation+4.00e-02

Pos logit contributions
ses+0.151
sed+0.149
 trips+0.143
 names+0.138
 imag+0.137
Neg logit contributions
ge-0.162
ters-0.146
is-0.144
lling-0.134
rier-0.130
Token stories
Token ID3923
Feature activation+6.34e-03

Pos logit contributions
ses+0.053
sed+0.053
 trips+0.050
 imag+0.048
 names+0.047
Neg logit contributions
ge-0.083
ters-0.076
is-0.072
rier-0.070
ising-0.069
Token.
Token ID13
Feature activation+2.39e-02

Pos logit contributions
 names+0.011
 trips+0.011
ses+0.011
sed+0.011
 imagination+0.011
Neg logit contributions
ge-0.010
ters-0.009
is-0.009
rier-0.008
ising-0.008
Token I
Token ID314
Feature activation-4.08e-03

Pos logit contributions
 trips+0.037
ses+0.035
sed+0.035
 spears+0.033
 imag+0.033
Neg logit contributions
ters-0.041
ge-0.041
rier-0.038
line-0.037
ter-0.036
Token have
Token ID423
Feature activation+1.25e-02

Pos logit contributions
ge+0.008
ters+0.007
is+0.007
ising+0.006
rier+0.006
Neg logit contributions
ses-0.006
sed-0.006
 trips-0.006
 names-0.006
 spears-0.006
Token toys
Token ID14958
Feature activation+3.11e-03

Pos logit contributions
ses+0.002
sed+0.002
zed+0.002
tr+0.002
icks+0.002
Neg logit contributions
ge-0.002
ters-0.002
is-0.002
ter-0.002
Sus-0.002
Token and
Token ID290
Feature activation+5.13e-03

Pos logit contributions
 trips+0.006
 names+0.006
ses+0.006
sed+0.005
 imagination+0.005
Neg logit contributions
ge-0.005
ters-0.004
is-0.004
rier-0.004
lling-0.004
Token draw
Token ID3197
Feature activation+9.42e-03

Pos logit contributions
ses+0.006
 trips+0.006
 names+0.006
sed+0.006
 imagination+0.006
Neg logit contributions
ge-0.011
ters-0.010
is-0.010
rier-0.009
lling-0.009
Token with
Token ID351
Feature activation+1.76e-02

Pos logit contributions
ses+0.015
sed+0.014
 trips+0.014
 names+0.014
 imagination+0.014
Neg logit contributions
ge-0.017
ters-0.016
is-0.015
rier-0.015
ising-0.014
Token cr
Token ID1067
Feature activation+4.46e-02

Pos logit contributions
ses+0.026
sed+0.025
 trips+0.024
 names+0.023
 imag+0.023
Neg logit contributions
ge-0.035
ters-0.032
is-0.030
rier-0.029
lling-0.028
Tokenay
Token ID323
Feature activation+9.08e-02

Pos logit contributions
 names+0.057
 trips+0.057
ses+0.057
sed+0.056
 imagination+0.054
Neg logit contributions
ge-0.095
ters-0.085
is-0.082
rier-0.081
lling-0.079
Tokenons
Token ID684
Feature activation+2.82e-02

Pos logit contributions
ses+0.181
sed+0.176
 barrel+0.158
tr+0.156
 tum+0.153
Neg logit contributions
is-0.149
ge-0.125
ters-0.123
Sus-0.108
mine-0.106
Token.
Token ID13
Feature activation+3.17e-03

Pos logit contributions
 names+0.050
 trips+0.049
ses+0.047
sed+0.047
 imagination+0.047
Neg logit contributions
ge-0.047
ters-0.040
rier-0.038
is-0.038
lling-0.038
Token One
Token ID1881
Feature activation+3.93e-02

Pos logit contributions
sed+0.004
ses+0.004
 spears+0.004
 nightmare+0.004
 imag+0.004
Neg logit contributions
ge-0.006
ters-0.006
line-0.006
is-0.006
mine-0.005
Token day
Token ID1110
Feature activation+7.52e-03

Pos logit contributions
 spears+0.057
sed+0.056
ses+0.056
 imag+0.054
 trips+0.054
Neg logit contributions
ge-0.072
is-0.069
ters-0.067
line-0.062
ter-0.061
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
ses+0.014
sed+0.013
 spears+0.013
tr+0.013
 meanings+0.013
Neg logit contributions
ge-0.012
ters-0.011
is-0.010
line-0.010
Sus-0.010
Token
Token ID198
Feature activation-1.64e-03

Pos logit contributions
ses+0.010
sed+0.010
 names+0.010
 trips+0.010
 imag+0.010
Neg logit contributions
ge-0.011
is-0.010
ters-0.010
Sus-0.009
line-0.009
Token"
Token ID1
Feature activation-3.00e-02

Pos logit contributions
ge+0.003
ters+0.002
rier+0.002
is+0.002
lling+0.002
Neg logit contributions
 trips-0.003
ses-0.003
 names-0.003
sed-0.003
 imagination-0.003
TokenO
Token ID46
Feature activation+1.93e-02

Pos logit contributions
ge+0.060
rier+0.050
is+0.050
ters+0.050
ising+0.049
Neg logit contributions
ses-0.044
 trips-0.043
sed-0.042
 names-0.042
 Rex-0.040
Tokenw
Token ID86
Feature activation+5.74e-02

Pos logit contributions
ses+0.020
 trips+0.019
sed+0.019
 names+0.018
 imagination+0.017
Neg logit contributions
ge-0.046
ters-0.043
rier-0.041
is-0.041
lling-0.040
Token,
Token ID11
Feature activation+3.46e-02

Pos logit contributions
 names+0.088
ses+0.088
 trips+0.087
sed+0.085
 imagination+0.083
Neg logit contributions
ge-0.108
ters-0.097
is-0.093
rier-0.090
ising-0.088
Token ow
Token ID12334
Feature activation+9.06e-02

Pos logit contributions
 trips+0.023
 names+0.023
ses+0.023
sed+0.022
 imagination+0.020
Neg logit contributions
ge-0.095
ters-0.088
is-0.086
rier-0.084
lling-0.082
Token,
Token ID11
Feature activation+2.56e-02

Pos logit contributions
ses+0.128
 names+0.127
 trips+0.127
sed+0.125
 imagination+0.121
Neg logit contributions
ge-0.181
ters-0.163
is-0.157
rier-0.153
ising-0.149
Token ow
Token ID12334
Feature activation+8.26e-02

Pos logit contributions
ses+0.016
 trips+0.015
 names+0.015
sed+0.015
 imagination+0.014
Neg logit contributions
ge-0.072
ters-0.066
is-0.065
rier-0.064
ising-0.063
Token!"
Token ID2474
Feature activation-3.22e-04

Pos logit contributions
 trips+0.137
ses+0.137
 names+0.135
sed+0.134
 imagination+0.130
Neg logit contributions
ge-0.144
ters-0.127
is-0.121
rier-0.119
lling-0.115
Token she
Token ID673
Feature activation-9.35e-03

Pos logit contributions
ge+0.001
ters+0.001
is+0.001
lling+0.001
rier+0.001
Neg logit contributions
ses-0.000
sed-0.000
 trips-0.000
 names-0.000
 imagination-0.000
Token w
Token ID266
Feature activation-6.18e-03

Pos logit contributions
ge+0.017
ters+0.014
is+0.014
rier+0.014
line+0.014
Neg logit contributions
ses-0.015
sed-0.015
 trips-0.015
 names-0.015
 teeth-0.014
Token,
Token ID11
Feature activation-7.00e-03

Pos logit contributions
 names+0.028
 trips+0.028
 spears+0.026
 imagination+0.026
 favorite+0.025
Neg logit contributions
ge-0.040
rier-0.035
ters-0.035
lling-0.035
chie-0.035
Token drew
Token ID9859
Feature activation+5.27e-03

Pos logit contributions
ge+0.017
rier+0.015
ising+0.015
chie+0.014
rily+0.014
Neg logit contributions
 names-0.008
 trips-0.008
 spears-0.008
 helmets-0.007
 books-0.007
Token pictures
Token ID5986
Feature activation+2.31e-02

Pos logit contributions
ses+0.007
sed+0.007
 trips+0.006
 names+0.006
 imagination+0.006
Neg logit contributions
ge-0.011
ters-0.010
is-0.010
rier-0.010
lling-0.009
Token with
Token ID351
Feature activation+1.42e-02

Pos logit contributions
 names+0.044
 trips+0.044
ses+0.043
sed+0.042
 imagination+0.042
Neg logit contributions
ge-0.036
ters-0.030
rier-0.028
is-0.028
lling-0.027
Token cr
Token ID1067
Feature activation+4.55e-02

Pos logit contributions
ses+0.021
sed+0.021
 trips+0.020
 names+0.020
 imagination+0.019
Neg logit contributions
ge-0.028
ters-0.025
is-0.024
rier-0.023
lling-0.023
Tokenay
Token ID323
Feature activation+9.04e-02

Pos logit contributions
ses+0.061
 trips+0.060
 names+0.060
sed+0.060
 imagination+0.057
Neg logit contributions
ge-0.094
ters-0.085
is-0.082
rier-0.079
ising-0.077
Tokenons
Token ID684
Feature activation+2.01e-02

Pos logit contributions
ses+0.167
sed+0.160
 trips+0.140
 tum+0.140
tr+0.140
Neg logit contributions
is-0.156
ge-0.146
ters-0.140
Sus-0.121
mine-0.120
Token,
Token ID11
Feature activation+1.60e-04

Pos logit contributions
 names+0.032
 trips+0.032
 spears+0.030
 imagination+0.030
ses+0.029
Neg logit contributions
ge-0.037
ters-0.032
rier-0.032
chie-0.032
lling-0.031
Token and
Token ID290
Feature activation+8.95e-03

Pos logit contributions
 names+0.000
 trips+0.000
 imagination+0.000
 spears+0.000
 helmets+0.000
Neg logit contributions
ge-0.000
rier-0.000
ters-0.000
ising-0.000
chie-0.000
Token built
Token ID3170
Feature activation-5.21e-03

Pos logit contributions
ses+0.012
 trips+0.012
 names+0.012
sed+0.012
 imagination+0.012
Neg logit contributions
ge-0.018
ters-0.016
is-0.016
rier-0.015
lling-0.015
Token for
Token ID329
Feature activation+2.29e-02

Pos logit contributions
ge+0.012
ters+0.011
is+0.010
rier+0.010
lling+0.010
Neg logit contributions
 names-0.006
 trips-0.006
ses-0.006
sed-0.006
 imagination-0.006
Token's
Token ID338
Feature activation+2.80e-02

Pos logit contributions
 trips+0.036
ses+0.035
 spears+0.035
sed+0.034
 imagination+0.034
Neg logit contributions
ge-0.037
ters-0.037
is-0.033
lling-0.033
ter-0.032
Token okay
Token ID8788
Feature activation-1.24e-04

Pos logit contributions
 trips+0.043
ses+0.042
 names+0.042
sed+0.041
 imagination+0.039
Neg logit contributions
ge-0.053
ters-0.048
is-0.045
rier-0.044
ter-0.043
Token"
Token ID1
Feature activation-7.83e-02

Pos logit contributions
ge+0.000
ters+0.000
is+0.000
rier+0.000
ising+0.000
Neg logit contributions
ses-0.000
 trips-0.000
sed-0.000
 names-0.000
 imagination-0.000
Token and
Token ID290
Feature activation+1.23e-02

Pos logit contributions
ge+0.157
chie+0.136
is+0.130
ising+0.128
ivering+0.126
Neg logit contributions
 names-0.124
 Rex-0.109
 trips-0.108
tr-0.107
 favorite-0.105
Token hugged
Token ID41130
Feature activation+9.58e-03

Pos logit contributions
ses+0.016
 trips+0.016
 names+0.016
sed+0.015
 imagination+0.015
Neg logit contributions
ge-0.026
ters-0.024
is-0.023
rier-0.022
ising-0.022
Token each
Token ID1123
Feature activation+9.03e-02

Pos logit contributions
 names+0.016
 trips+0.015
 spears+0.014
ses+0.014
 Rex+0.014
Neg logit contributions
ge-0.018
is-0.015
lling-0.015
ising-0.015
ters-0.015
Token other
Token ID584
Feature activation+2.82e-02

Pos logit contributions
 names+0.120
 trips+0.115
ses+0.112
sed+0.109
 imagination+0.108
Neg logit contributions
ge-0.194
ters-0.170
rier-0.166
is-0.164
ising-0.161
Token.
Token ID13
Feature activation+3.09e-02

Pos logit contributions
 names+0.046
 trips+0.045
 spears+0.041
 imagination+0.041
 stories+0.041
Neg logit contributions
ge-0.051
is-0.043
ters-0.042
Sus-0.042
rier-0.041
Token Then
Token ID3244
Feature activation+2.51e-02

Pos logit contributions
 trips+0.044
ses+0.044
 names+0.043
sed+0.042
 spears+0.042
Neg logit contributions
ge-0.059
ters-0.055
is-0.052
rier-0.051
line-0.050
Token they
Token ID484
Feature activation+1.13e-02

Pos logit contributions
ses+0.040
 trips+0.040
 names+0.040
sed+0.039
 imagination+0.038
Neg logit contributions
ge-0.045
ters-0.040
is-0.039
rier-0.038
ising-0.036
Token decided
Token ID3066
Feature activation+1.07e-02

Pos logit contributions
ses+0.018
 trips+0.017
sed+0.017
 names+0.017
 imagination+0.017
Neg logit contributions
ge-0.020
ters-0.018
is-0.017
mine-0.016
lling-0.016
Token had
Token ID550
Feature activation+2.08e-02

Pos logit contributions
 trips+0.002
 names+0.002
 imagination+0.002
 lived+0.002
sed+0.002
Neg logit contributions
ge-0.003
ters-0.003
is-0.002
rier-0.002
chie-0.002
Token fun
Token ID1257
Feature activation+9.80e-03

Pos logit contributions
ses+0.024
sed+0.023
 trips+0.022
 names+0.022
 imagination+0.021
Neg logit contributions
ge-0.047
ters-0.043
is-0.042
rier-0.040
lling-0.040
Token.
Token ID13
Feature activation+8.05e-03

Pos logit contributions
 trips+0.016
 names+0.015
 imagination+0.015
 stories+0.014
 favorite+0.014
Neg logit contributions
ge-0.018
rier-0.017
ters-0.016
ter-0.016
chie-0.016
Token They
Token ID1119
Feature activation+1.47e-02

Pos logit contributions
ses+0.009
 trips+0.009
 names+0.009
sed+0.009
 imagination+0.009
Neg logit contributions
ge-0.018
ters-0.016
is-0.016
rier-0.015
line-0.015
Token used
Token ID973
Feature activation+5.10e-03

Pos logit contributions
 names+0.020
 lived+0.020
 trips+0.020
 imagination+0.019
 imag+0.019
Neg logit contributions
ge-0.031
rier-0.029
ters-0.028
is-0.027
ising-0.026
Token red
Token ID2266
Feature activation+9.02e-02

Pos logit contributions
ses+0.008
sed+0.008
 trips+0.008
 names+0.008
 lived+0.007
Neg logit contributions
ge-0.009
ters-0.008
is-0.008
rier-0.008
lling-0.007
Token and
Token ID290
Feature activation+1.67e-02

Pos logit contributions
ses+0.155
sed+0.152
 trips+0.147
 spears+0.141
 names+0.141
Neg logit contributions
ge-0.149
ters-0.138
is-0.136
ising-0.123
line-0.117
Token yellow
Token ID7872
Feature activation+6.84e-02

Pos logit contributions
ses+0.024
 trips+0.024
sed+0.024
 names+0.024
 spears+0.022
Neg logit contributions
ge-0.032
ters-0.029
is-0.028
rier-0.027
lling-0.026
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
ses+0.113
sed+0.111
 trips+0.111
 names+0.105
 spears+0.105
Neg logit contributions
ge-0.115
ters-0.107
is-0.105
ising-0.096
rier-0.093
Token blue
Token ID4171
Feature activation+5.99e-02

Pos logit contributions
ses+0.020
 trips+0.020
 names+0.020
sed+0.020
 spears+0.019
Neg logit contributions
ge-0.027
ters-0.025
is-0.024
rier-0.023
lling-0.022
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
ses+0.094
 trips+0.093
sed+0.092
 names+0.091
 imagination+0.088
Neg logit contributions
ge-0.109
ters-0.099
is-0.095
rier-0.090
ising-0.088
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
ge+0.058
ters+0.051
is+0.049
rier+0.047
lling+0.045
Neg logit contributions
 names-0.061
 trips-0.061
ses-0.058
 imagination-0.057
sed-0.057
Token<|endoftext|>
Token ID50256
Feature activation+2.24e-02

Pos logit contributions
ses+0.017
sed+0.017
 trips+0.017
 spears+0.016
 names+0.016
Neg logit contributions
ge-0.018
ters-0.017
is-0.016
Sus-0.016
line-0.016
TokenPeter
Token ID19727
Feature activation+2.74e-02

Pos logit contributions
ses+0.026
 names+0.025
sed+0.025
 trips+0.025
 imagination+0.024
Neg logit contributions
ge-0.050
ters-0.046
is-0.044
rier-0.043
lling-0.042
Token was
Token ID373
Feature activation+1.57e-02

Pos logit contributions
ses+0.040
 names+0.039
 trips+0.039
sed+0.039
 imagination+0.037
Neg logit contributions
ge-0.053
ters-0.048
is-0.046
rier-0.044
lling-0.044
Token a
Token ID257
Feature activation-9.34e-03

Pos logit contributions
 trips+0.021
ses+0.021
sed+0.021
 names+0.020
 imagination+0.020
Neg logit contributions
ge-0.033
ters-0.030
is-0.029
rier-0.028
ising-0.027
Token mis
Token ID2984
Feature activation-1.06e-01

Pos logit contributions
ge+0.017
ters+0.015
is+0.014
rier+0.013
lling+0.013
Neg logit contributions
 trips-0.015
ses-0.015
 names-0.015
sed-0.015
 imagination-0.015
Tokenchie
Token ID3043
Feature activation-4.56e-02

Pos logit contributions
ge+0.111
rily+0.108
rier+0.099
ters+0.098
 revealing+0.095
Neg logit contributions
adders-0.170
inea-0.161
ats-0.160
 colony-0.160
uddy-0.157
Tokenvous
Token ID31222
Feature activation-2.40e-03

Pos logit contributions
ge+0.109
ters+0.098
rier+0.095
is+0.092
ising+0.091
Neg logit contributions
 trips-0.047
 names-0.046
ses-0.045
 lived-0.044
sed-0.044
Token 3
Token ID513
Feature activation-3.95e-02

Pos logit contributions
ge+0.004
ters+0.003
is+0.003
lling+0.003
rier+0.003
Neg logit contributions
ses-0.004
 names-0.004
 trips-0.004
sed-0.004
 imagination-0.004
Token year
Token ID614
Feature activation-1.57e-02

Pos logit contributions
ge+0.074
ters+0.066
rier+0.062
is+0.061
line+0.059
Neg logit contributions
 trips-0.062
 names-0.059
 imagination-0.057
ses-0.057
 lived-0.057
Token old
Token ID1468
Feature activation-2.03e-02

Pos logit contributions
ge+0.034
ters+0.032
mine+0.029
lling+0.028
ter+0.028
Neg logit contributions
 trips-0.021
 names-0.019
 stories-0.019
 favorite-0.019
 imagination-0.018
Token,
Token ID11
Feature activation+1.95e-02

Pos logit contributions
ge+0.007
ters+0.007
is+0.006
rier+0.006
ising+0.006
Neg logit contributions
ses-0.006
 names-0.006
 trips-0.006
sed-0.006
 imagination-0.005
Token the
Token ID262
Feature activation-7.85e-03

Pos logit contributions
 names+0.027
 trips+0.026
 imagination+0.026
 Rex+0.025
 lived+0.025
Neg logit contributions
ge-0.041
is-0.036
chie-0.035
lling-0.035
ters-0.035
Token squirrel
Token ID33039
Feature activation+8.63e-03

Pos logit contributions
ge+0.016
ters+0.015
is+0.014
rier+0.014
ising+0.013
Neg logit contributions
ses-0.010
 trips-0.010
 names-0.010
sed-0.010
 imagination-0.010
Token got
Token ID1392
Feature activation-2.23e-02

Pos logit contributions
 trips+0.011
sed+0.011
ses+0.010
 names+0.010
 spears+0.010
Neg logit contributions
ge-0.019
ters-0.017
rier-0.017
is-0.016
line-0.016
Token a
Token ID257
Feature activation-1.06e-02

Pos logit contributions
ge+0.040
ters+0.035
is+0.033
rier+0.033
lling+0.032
Neg logit contributions
ses-0.037
 trips-0.036
 names-0.036
sed-0.036
 imagination-0.035
Token mis
Token ID2984
Feature activation-1.05e-01

Pos logit contributions
ge+0.023
ters+0.021
is+0.020
rier+0.019
lling+0.019
Neg logit contributions
ses-0.014
 trips-0.014
 names-0.013
sed-0.013
 imagination-0.013
Tokenchie
Token ID3043
Feature activation-5.10e-02

Pos logit contributions
ge+0.096
rily+0.091
ner+0.089
line+0.088
 Nina+0.084
Neg logit contributions
ats-0.198
 spears-0.193
haw-0.192
adders-0.191
icks-0.188
Tokenvous
Token ID31222
Feature activation-1.29e-02

Pos logit contributions
ge+0.116
ters+0.105
is+0.101
rier+0.101
ising+0.097
Neg logit contributions
 trips-0.057
 names-0.057
ses-0.057
sed-0.055
 imagination-0.053
Token look
Token ID804
Feature activation-1.33e-02

Pos logit contributions
ge+0.028
ters+0.025
lling+0.024
is+0.024
rier+0.024
Neg logit contributions
ses-0.017
sed-0.017
 trips-0.017
 names-0.016
 tum-0.016
Token in
Token ID287
Feature activation-1.55e-02

Pos logit contributions
ge+0.026
ters+0.023
chie+0.022
rier+0.022
ner+0.022
Neg logit contributions
 trips-0.020
 names-0.018
 spears-0.018
 lived-0.018
 barrel-0.018
Token its
Token ID663
Feature activation+2.61e-03

Pos logit contributions
ge+0.032
ters+0.030
is+0.029
rier+0.028
ner+0.028
Neg logit contributions
 trips-0.022
 names-0.021
 imagination-0.020
ses-0.020
 spears-0.020
Token baby
Token ID5156
Feature activation-9.41e-03

Pos logit contributions
ses+0.013
 trips+0.013
 names+0.013
sed+0.013
 imagination+0.013
Neg logit contributions
ge-0.014
ters-0.012
is-0.011
rier-0.011
lling-0.011
Token.
Token ID13
Feature activation-3.57e-03

Pos logit contributions
ge+0.021
ters+0.018
is+0.018
rier+0.018
lling+0.017
Neg logit contributions
 trips-0.012
ses-0.011
 names-0.011
sed-0.011
 imagination-0.011
Token He
Token ID679
Feature activation+6.72e-03

Pos logit contributions
ge+0.007
ters+0.007
is+0.006
Sus+0.006
mine+0.006
Neg logit contributions
 trips-0.005
ses-0.005
sed-0.005
 names-0.004
 imag-0.004
Token was
Token ID373
Feature activation+1.19e-02

Pos logit contributions
ses+0.009
 names+0.009
sed+0.008
 spears+0.008
icks+0.008
Neg logit contributions
ge-0.013
is-0.012
ters-0.012
ter-0.012
lling-0.011
Token a
Token ID257
Feature activation-1.39e-02

Pos logit contributions
ses+0.014
 trips+0.014
 names+0.014
sed+0.013
 imagination+0.013
Neg logit contributions
ge-0.027
ters-0.024
is-0.024
rier-0.023
lling-0.023
Token mis
Token ID2984
Feature activation-1.04e-01

Pos logit contributions
ge+0.027
ters+0.024
is+0.023
rier+0.023
ising+0.022
Neg logit contributions
 names-0.020
ses-0.020
 trips-0.020
sed-0.020
 imagination-0.019
Tokenchie
Token ID3043
Feature activation-4.93e-02

Pos logit contributions
ge+0.110
lling+0.090
rily+0.089
is+0.087
 revealing+0.087
Neg logit contributions
adders-0.186
inea-0.173
 names-0.168
ses-0.168
ats-0.168
Tokenvous
Token ID31222
Feature activation-4.25e-03

Pos logit contributions
ge+0.108
ters+0.098
is+0.095
rier+0.093
ising+0.091
Neg logit contributions
ses-0.059
 trips-0.059
 names-0.059
sed-0.057
 imagination-0.055
Token little
Token ID1310
Feature activation-3.19e-02

Pos logit contributions
ge+0.007
ters+0.006
is+0.006
rier+0.006
ising+0.006
Neg logit contributions
ses-0.007
 trips-0.007
 names-0.007
sed-0.007
 imagination-0.007
Token baby
Token ID5156
Feature activation-7.93e-03

Pos logit contributions
ge+0.060
is+0.049
lling+0.049
ters+0.049
Sus+0.047
Neg logit contributions
ses-0.053
 barrel-0.051
 trips-0.051
sed-0.051
 tum-0.051
Token and
Token ID290
Feature activation-4.03e-03

Pos logit contributions
ge+0.013
ters+0.011
is+0.011
rier+0.011
lling+0.010
Neg logit contributions
 trips-0.014
ses-0.014
 names-0.014
sed-0.014
 imagination-0.013
Token
Token ID220
Feature activation-2.03e-02

Pos logit contributions
ge+0.037
ters+0.033
rier+0.032
is+0.031
lling+0.031
Neg logit contributions
 trips-0.035
 names-0.032
icks-0.031
ses-0.030
 imagination-0.030
Token
Token ID198
Feature activation-2.96e-02

Pos logit contributions
ge+0.046
ters+0.041
is+0.040
rier+0.039
lling+0.039
Neg logit contributions
 names-0.024
 trips-0.023
ses-0.023
sed-0.023
 imagination-0.022
Token
Token ID198
Feature activation-2.77e-02

Pos logit contributions
ge+0.066
ters+0.060
rier+0.058
is+0.058
ising+0.056
Neg logit contributions
 trips-0.035
 names-0.035
ses-0.034
sed-0.033
 imagination-0.032
TokenThe
Token ID464
Feature activation-1.36e-02

Pos logit contributions
ge+0.049
ters+0.044
rier+0.044
is+0.041
lling+0.040
Neg logit contributions
 names-0.044
 trips-0.044
 lived-0.041
 imagination-0.041
 spears-0.041
Token town
Token ID3240
Feature activation-4.93e-02

Pos logit contributions
ge+0.025
ters+0.022
is+0.021
rier+0.021
lling+0.020
Neg logit contributions
 names-0.021
ses-0.021
 trips-0.021
sed-0.021
 imagination-0.020
Tokenspe
Token ID4125
Feature activation-1.01e-01

Pos logit contributions
ge+0.108
ters+0.094
rier+0.093
is+0.091
rily+0.087
Neg logit contributions
 lived-0.067
 trips-0.062
 names-0.061
sed-0.057
 imagination-0.057
Tokenople
Token ID643
Feature activation-4.57e-02

Pos logit contributions
ge+0.152
rier+0.130
line+0.130
is+0.126
rily+0.122
Neg logit contributions
 trips-0.180
 lived-0.180
 imag-0.175
 meanings-0.172
sed-0.169
Token had
Token ID550
Feature activation-1.38e-03

Pos logit contributions
ge+0.101
rier+0.096
ters+0.092
is+0.090
rily+0.087
Neg logit contributions
 lived-0.057
 trips-0.055
 names-0.054
 imagination-0.051
 imag-0.050
Token to
Token ID284
Feature activation+3.26e-03

Pos logit contributions
ge+0.003
ters+0.003
is+0.003
rier+0.003
ising+0.002
Neg logit contributions
 trips-0.002
 names-0.002
ses-0.002
sed-0.002
 imagination-0.002
Token find
Token ID1064
Feature activation+4.99e-03

Pos logit contributions
 names+0.005
 imagination+0.004
 trips+0.004
 lived+0.004
 imag+0.004
Neg logit contributions
ge-0.007
is-0.006
ters-0.006
rier-0.006
rily-0.006
Token a
Token ID257
Feature activation+9.68e-03

Pos logit contributions
ses+0.008
sed+0.008
 trips+0.007
 names+0.007
tr+0.007
Neg logit contributions
ge-0.009
ters-0.009
is-0.008
rier-0.008
ising-0.008
Token while
Token ID981
Feature activation-1.67e-02

Pos logit contributions
ge+0.021
ters+0.020
lling+0.018
mine+0.018
rier+0.018
Neg logit contributions
 trips-0.022
 tum-0.022
 imagination-0.021
 favorite-0.021
 names-0.021
Token.
Token ID13
Feature activation+1.13e-03

Pos logit contributions
ge+0.034
ters+0.029
lling+0.029
rier+0.028
is+0.028
Neg logit contributions
 trips-0.023
 imagination-0.021
 tum-0.021
sed-0.020
 Rex-0.020
Token Soon
Token ID15894
Feature activation+2.83e-03

Pos logit contributions
ses+0.001
 trips+0.001
sed+0.001
 names+0.001
 spears+0.001
Neg logit contributions
ge-0.002
ters-0.002
is-0.002
rier-0.002
line-0.002
Token,
Token ID11
Feature activation+4.32e-03

Pos logit contributions
 trips+0.004
 names+0.004
ses+0.004
sed+0.004
 imagination+0.004
Neg logit contributions
ge-0.006
ters-0.005
is-0.005
rier-0.005
lling-0.005
Token the
Token ID262
Feature activation-9.46e-03

Pos logit contributions
 trips+0.006
 names+0.006
 tum+0.006
 imagination+0.006
 lived+0.006
Neg logit contributions
ge-0.009
is-0.008
ters-0.008
rier-0.007
chie-0.007
Token tad
Token ID36264
Feature activation-9.88e-02

Pos logit contributions
ge+0.019
ters+0.017
is+0.016
rier+0.016
lling+0.015
Neg logit contributions
ses-0.014
 trips-0.014
 names-0.014
sed-0.013
 imagination-0.013
Tokenp
Token ID79
Feature activation+1.59e-02

Pos logit contributions
ge+0.184
ner+0.139
Sus+0.139
line+0.137
ters+0.137
Neg logit contributions
 names-0.160
sed-0.154
ses-0.154
 spears-0.153
 trips-0.148
Tokenoles
Token ID4316
Feature activation-1.27e-02

Pos logit contributions
 names+0.023
sed+0.023
 trips+0.023
ses+0.022
 lived+0.022
Neg logit contributions
ge-0.031
ters-0.027
rier-0.027
is-0.025
line-0.025
Token became
Token ID2627
Feature activation-2.46e-02

Pos logit contributions
ge+0.026
rier+0.024
ters+0.023
is+0.023
Sus+0.022
Neg logit contributions
 trips-0.018
 names-0.017
 lived-0.017
sed-0.016
ses-0.016
Token frogs
Token ID37475
Feature activation-1.72e-02

Pos logit contributions
ge+0.050
ters+0.044
is+0.044
rier+0.042
line+0.041
Neg logit contributions
 names-0.035
 trips-0.034
ses-0.033
sed-0.033
 imagination-0.032
Token and
Token ID290
Feature activation-6.91e-03

Pos logit contributions
ge+0.031
rier+0.028
ters+0.026
Sus+0.026
is+0.026
Neg logit contributions
 trips-0.028
 names-0.026
 spears-0.025
 lived-0.025
sed-0.025
Token sleep
Token ID3993
Feature activation-9.60e-03

Pos logit contributions
 trips+0.009
 imagination+0.009
 names+0.008
 lived+0.008
 imag+0.008
Neg logit contributions
ge-0.014
ters-0.013
is-0.013
rier-0.013
lling-0.012
Token so
Token ID523
Feature activation+9.79e-03

Pos logit contributions
ge+0.019
ters+0.017
chie+0.016
is+0.016
lling+0.015
Neg logit contributions
 trips-0.015
 lived-0.014
 tum-0.014
 imagination-0.014
zed-0.013
Token that
Token ID326
Feature activation-9.01e-03

Pos logit contributions
 trips+0.014
 imagination+0.013
 lived+0.013
sed+0.013
 names+0.013
Neg logit contributions
ge-0.019
ters-0.018
is-0.017
lling-0.016
rier-0.016
Token no
Token ID645
Feature activation+6.49e-03

Pos logit contributions
ge+0.017
ters+0.015
is+0.015
rier+0.014
ising+0.014
Neg logit contributions
 trips-0.014
 names-0.014
ses-0.014
 imagination-0.013
sed-0.013
Token more
Token ID517
Feature activation-6.22e-03

Pos logit contributions
 trips+0.010
 imagination+0.009
 names+0.009
ses+0.009
 tum+0.009
Neg logit contributions
ge-0.012
ters-0.011
line-0.011
rier-0.011
lling-0.011
Token mis
Token ID2984
Feature activation-9.87e-02

Pos logit contributions
ge+0.013
ters+0.012
is+0.011
rier+0.011
line+0.011
Neg logit contributions
 trips-0.008
 imagination-0.008
 names-0.008
ses-0.008
sed-0.008
Tokenchie
Token ID3043
Feature activation-4.83e-02

Pos logit contributions
rily+0.115
line+0.109
yt+0.098
is+0.097
 revealing+0.096
Neg logit contributions
adders-0.174
 trips-0.163
ats-0.161
 imag-0.157
eware-0.157
Tokenvous
Token ID31222
Feature activation+2.49e-03

Pos logit contributions
ge+0.109
ters+0.099
rier+0.095
is+0.095
ising+0.091
Neg logit contributions
 trips-0.055
 names-0.054
ses-0.053
sed-0.052
 lived-0.051
Token mice
Token ID10693
Feature activation-2.76e-02

Pos logit contributions
 trips+0.003
 names+0.003
ses+0.003
 imagination+0.003
sed+0.003
Neg logit contributions
ge-0.005
ters-0.005
is-0.004
rier-0.004
lling-0.004
Token could
Token ID714
Feature activation-4.03e-02

Pos logit contributions
ge+0.058
ters+0.053
is+0.052
rier+0.052
ising+0.049
Neg logit contributions
 trips-0.036
 names-0.034
ses-0.033
 imagination-0.033
sed-0.033
Token come
Token ID1282
Feature activation-1.69e-02

Pos logit contributions
ge+0.078
ters+0.077
rier+0.075
lling+0.073
is+0.073
Neg logit contributions
 trips-0.055
 names-0.054
 lived-0.053
 imagination-0.050
tr-0.048
Token
Token ID198
Feature activation-2.11e-02

Pos logit contributions
ge+0.032
is+0.029
ters+0.029
Sus+0.027
lling+0.027
Neg logit contributions
ses-0.020
sed-0.019
 names-0.019
 trips-0.019
 imagination-0.018
TokenBut
Token ID1537
Feature activation-8.38e-03

Pos logit contributions
ge+0.038
rier+0.034
ters+0.034
ter+0.031
line+0.030
Neg logit contributions
 trips-0.033
 names-0.033
 lived-0.032
 spears-0.031
 imagination-0.030
Token then
Token ID788
Feature activation-6.94e-03

Pos logit contributions
ge+0.017
is+0.015
ters+0.014
rier+0.014
chie+0.014
Neg logit contributions
 lived-0.012
 trips-0.012
sed-0.011
ses-0.011
 imagination-0.011
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
ge+0.013
ters+0.012
is+0.011
rier+0.011
ising+0.011
Neg logit contributions
ses-0.011
 trips-0.011
 names-0.011
sed-0.010
 imagination-0.010
Token a
Token ID257
Feature activation+3.47e-03

Pos logit contributions
 trips+0.015
 names+0.015
 imagination+0.014
 tum+0.014
 lived+0.014
Neg logit contributions
ge-0.024
is-0.021
ters-0.021
lling-0.020
rier-0.020
Token mis
Token ID2984
Feature activation-9.73e-02

Pos logit contributions
ses+0.005
 trips+0.005
 names+0.005
sed+0.005
 imagination+0.005
Neg logit contributions
ge-0.007
ters-0.006
is-0.006
rier-0.006
ising-0.006
Tokenchie
Token ID3043
Feature activation-4.71e-02

Pos logit contributions
ge+0.111
rily+0.091
lling+0.087
line+0.081
ner+0.078
Neg logit contributions
adders-0.181
 names-0.172
ats-0.169
icks-0.168
inea-0.166
Tokenvous
Token ID31222
Feature activation-7.90e-03

Pos logit contributions
ge+0.110
ters+0.100
rier+0.097
is+0.095
ising+0.092
Neg logit contributions
 trips-0.050
 names-0.049
ses-0.048
sed-0.047
 lived-0.046
Token rabbit
Token ID22746
Feature activation+2.89e-03

Pos logit contributions
ge+0.017
ters+0.015
is+0.014
lling+0.014
rier+0.014
Neg logit contributions
ses-0.011
sed-0.010
 names-0.010
 trips-0.010
 imagination-0.010
Token came
Token ID1625
Feature activation-2.34e-02

Pos logit contributions
 trips+0.004
sed+0.003
 names+0.003
ses+0.003
 lived+0.003
Neg logit contributions
ge-0.006
ters-0.006
rier-0.006
is-0.006
ising-0.005
Token and
Token ID290
Feature activation-2.56e-03

Pos logit contributions
ge+0.046
rier+0.042
is+0.041
line+0.040
ner+0.040
Neg logit contributions
 names-0.033
 trips-0.032
 spears-0.030
sed-0.028
 Rex-0.028
Token
Token ID198
Feature activation-2.64e-02

Pos logit contributions
ge+0.032
is+0.028
Sus+0.027
ters+0.027
ner+0.026
Neg logit contributions
ses-0.034
sed-0.034
 imag-0.031
 names-0.031
 spears-0.030
TokenOne
Token ID3198
Feature activation+2.81e-02

Pos logit contributions
ge+0.053
ters+0.048
is+0.048
Sus+0.044
ising+0.044
Neg logit contributions
ses-0.040
sed-0.039
 names-0.037
 trips-0.036
 imag-0.036
Token day
Token ID1110
Feature activation-1.08e-03

Pos logit contributions
ses+0.044
sed+0.044
 imag+0.042
 trips+0.041
 meanings+0.040
Neg logit contributions
ge-0.051
is-0.047
ters-0.045
ter-0.044
line-0.043
Token,
Token ID11
Feature activation+6.33e-03

Pos logit contributions
ters+0.001
ter+0.001
ner+0.001
ge+0.001
 longer+0.001
Neg logit contributions
ses-0.003
 habitats-0.003
sed-0.002
 meanings-0.002
 imag-0.002
Token a
Token ID257
Feature activation-3.58e-03

Pos logit contributions
ses+0.009
 trips+0.009
 names+0.009
sed+0.009
 imag+0.008
Neg logit contributions
ge-0.012
ters-0.012
ter-0.011
rier-0.011
is-0.011
Token mis
Token ID2984
Feature activation-9.72e-02

Pos logit contributions
ge+0.005
ters+0.005
is+0.005
rier+0.004
ising+0.004
Neg logit contributions
ses-0.007
 trips-0.007
 names-0.007
sed-0.007
 imagination-0.006
Tokenchie
Token ID3043
Feature activation-5.21e-02

Pos logit contributions
ge+0.101
rily+0.076
lling+0.073
rier+0.072
is+0.071
Neg logit contributions
 names-0.200
adders-0.198
tr-0.188
inea-0.187
 imag-0.186
Tokenvous
Token ID31222
Feature activation-4.10e-03

Pos logit contributions
ge+0.115
ters+0.104
is+0.101
rier+0.099
ising+0.096
Neg logit contributions
ses-0.062
 trips-0.062
 names-0.062
sed-0.060
 imagination-0.058
Token young
Token ID1862
Feature activation-6.25e-02

Pos logit contributions
ge+0.009
ters+0.008
is+0.008
rier+0.007
ter+0.007
Neg logit contributions
 trips-0.005
 names-0.005
ses-0.005
sed-0.005
 imagination-0.005
Token boy
Token ID2933
Feature activation-3.31e-02

Pos logit contributions
ge+0.136
lling+0.113
is+0.106
ising+0.105
rying+0.105
Neg logit contributions
 bull-0.092
 barrel-0.091
 rat-0.084
 jungle-0.084
 lived-0.083
Token decided
Token ID3066
Feature activation-2.48e-02

Pos logit contributions
ge+0.071
ters+0.065
is+0.062
rier+0.060
lling+0.059
Neg logit contributions
ses-0.042
 names-0.041
 trips-0.041
sed-0.041
 imagination-0.039
Token flower
Token ID15061
Feature activation+6.45e-03

Pos logit contributions
ge+0.017
ters+0.016
is+0.015
rier+0.015
ising+0.014
Neg logit contributions
ses-0.014
 trips-0.014
 names-0.014
sed-0.014
 imagination-0.013
Token suddenly
Token ID6451
Feature activation-2.43e-02

Pos logit contributions
 trips+0.009
 names+0.009
ses+0.009
sed+0.008
 imagination+0.008
Neg logit contributions
ge-0.013
ters-0.012
is-0.011
rier-0.011
lling-0.011
Token turned
Token ID2900
Feature activation-7.93e-03

Pos logit contributions
ge+0.050
ters+0.045
is+0.043
rier+0.042
ising+0.041
Neg logit contributions
 trips-0.034
 names-0.033
ses-0.033
sed-0.032
 imagination-0.031
Token into
Token ID656
Feature activation-4.45e-03

Pos logit contributions
ge+0.016
ters+0.014
is+0.013
ising+0.013
line+0.012
Neg logit contributions
 trips-0.012
 names-0.012
 tum-0.011
 imagination-0.011
 teeth-0.011
Token a
Token ID257
Feature activation-1.10e-02

Pos logit contributions
ge+0.011
ters+0.010
is+0.010
line+0.010
rier+0.009
Neg logit contributions
 names-0.004
 trips-0.004
 imagination-0.004
ses-0.004
 spears-0.004
Token fa
Token ID24685
Feature activation-9.64e-02

Pos logit contributions
ge+0.024
ters+0.021
rier+0.020
is+0.020
lling+0.020
Neg logit contributions
 trips-0.014
 names-0.014
ses-0.014
sed-0.014
 imagination-0.013
Tokenery
Token ID1924
Feature activation-2.15e-02

Pos logit contributions
ge+0.195
ner+0.154
ters+0.151
line+0.148
Sus+0.137
Neg logit contributions
 names-0.144
 trips-0.143
tr-0.139
 lived-0.137
sed-0.136
Token.
Token ID13
Feature activation-8.75e-03

Pos logit contributions
ge+0.049
Sus+0.041
chie+0.041
lling+0.040
ters+0.040
Neg logit contributions
 barrel-0.026
 trips-0.026
sed-0.025
tr-0.025
ses-0.025
Token The
Token ID383
Feature activation+4.88e-03

Pos logit contributions
ge+0.018
ters+0.017
rier+0.016
is+0.016
ter+0.015
Neg logit contributions
 trips-0.011
ses-0.011
 names-0.011
sed-0.011
 imagination-0.011
Token fa
Token ID24685
Feature activation-9.02e-02

Pos logit contributions
 names+0.007
 tum+0.006
ses+0.006
sed+0.006
 imagination+0.006
Neg logit contributions
ge-0.011
rily-0.009
ters-0.009
rier-0.009
lling-0.009
Tokenery
Token ID1924
Feature activation-2.59e-02

Pos logit contributions
ge+0.170
ters+0.131
line+0.129
ner+0.128
rier+0.124
Neg logit contributions
 names-0.151
 trips-0.144
ses-0.144
sed-0.144
 lived-0.140
Token can
Token ID460
Feature activation-5.00e-02

Pos logit contributions
ge+0.007
ters+0.006
is+0.006
ising+0.006
rier+0.006
Neg logit contributions
ses-0.006
sed-0.005
 trips-0.005
 names-0.005
 spears-0.005
Token spell
Token ID4822
Feature activation+9.12e-03

Pos logit contributions
ge+0.098
ters+0.086
rier+0.083
lling+0.081
rying+0.080
Neg logit contributions
 names-0.079
 trips-0.076
 imagination-0.073
ses-0.069
icks-0.068
Token better
Token ID1365
Feature activation-2.64e-02

Pos logit contributions
ses+0.015
 trips+0.014
 names+0.014
sed+0.014
 imagination+0.014
Neg logit contributions
ge-0.016
ters-0.015
is-0.014
rier-0.014
ising-0.013
Token than
Token ID621
Feature activation-1.17e-02

Pos logit contributions
ge+0.046
ters+0.039
is+0.038
lling+0.037
rier+0.037
Neg logit contributions
 names-0.046
 trips-0.044
 imagination-0.043
ses-0.042
sed-0.041
Token you
Token ID345
Feature activation-1.74e-02

Pos logit contributions
ge+0.022
ters+0.019
rier+0.019
lling+0.019
is+0.019
Neg logit contributions
 trips-0.018
 names-0.018
 imagination-0.018
 spears-0.017
ses-0.016
Token"
Token ID1
Feature activation-9.62e-02

Pos logit contributions
ge+0.027
ters+0.024
is+0.022
rier+0.022
lling+0.021
Neg logit contributions
 names-0.032
 trips-0.032
ses-0.032
sed-0.031
 imagination-0.031
Token or
Token ID393
Feature activation-4.99e-03

Pos logit contributions
ge+0.171
is+0.149
ters+0.143
rier+0.141
ising+0.140
Neg logit contributions
 names-0.162
 trips-0.157
ses-0.149
tr-0.148
sed-0.146
Token "
Token ID366
Feature activation-3.46e-02

Pos logit contributions
ge+0.010
ters+0.009
is+0.009
rier+0.008
ising+0.008
Neg logit contributions
 names-0.007
 trips-0.007
ses-0.007
sed-0.007
 imagination-0.007
TokenYou
Token ID1639
Feature activation+2.07e-02

Pos logit contributions
ge+0.059
rier+0.051
lling+0.050
is+0.049
ters+0.049
Neg logit contributions
 names-0.059
 trips-0.054
tr-0.054
 lived-0.050
 meanings-0.050
Token spelled
Token ID32213
Feature activation-4.34e-03

Pos logit contributions
ses+0.028
 trips+0.028
 names+0.028
sed+0.027
 imagination+0.026
Neg logit contributions
ge-0.042
ters-0.038
is-0.037
rier-0.036
ising-0.035
Token that
Token ID326
Feature activation+2.99e-02

Pos logit contributions
ge+0.008
ters+0.007
is+0.007
rier+0.007
lling+0.006
Neg logit contributions
 names-0.007
 trips-0.007
ses-0.007
sed-0.007
 imagination-0.006
Token the
Token ID262
Feature activation-2.49e-03

Pos logit contributions
 trips+0.051
 names+0.051
ses+0.051
sed+0.050
 imagination+0.049
Neg logit contributions
ge-0.058
ters-0.051
is-0.049
rier-0.048
ising-0.047
Token mess
Token ID2085
Feature activation-3.29e-02

Pos logit contributions
ge+0.005
chie+0.004
Sus+0.004
rying+0.004
shadow+0.004
Neg logit contributions
 names-0.004
 tum-0.004
 trips-0.004
ses-0.004
 teeth-0.004
Token and
Token ID290
Feature activation-2.30e-03

Pos logit contributions
ge+0.074
chie+0.072
shadow+0.065
rier+0.064
Sus+0.063
Neg logit contributions
 trips-0.037
icks-0.036
 books-0.034
 stories-0.033
 is-0.033
Token the
Token ID262
Feature activation-4.60e-03

Pos logit contributions
ge+0.005
ters+0.004
is+0.004
rier+0.004
ising+0.004
Neg logit contributions
 trips-0.003
 names-0.003
ses-0.003
sed-0.003
 imag-0.003
Token qu
Token ID627
Feature activation-1.66e-02

Pos logit contributions
ge+0.009
ters+0.007
rying+0.007
Sus+0.007
line+0.007
Neg logit contributions
 names-0.007
ses-0.007
 tum-0.007
sed-0.007
 trips-0.007
Tokenarre
Token ID9624
Feature activation-9.61e-02

Pos logit contributions
ge+0.033
rier+0.027
ner+0.027
ter+0.027
ters+0.027
Neg logit contributions
 trips-0.024
 names-0.024
ats-0.024
icks-0.023
ses-0.022
Tokenling
Token ID1359
Feature activation+7.16e-04

Pos logit contributions
ge+0.143
Sus+0.130
shadow+0.127
ter+0.123
line+0.121
Neg logit contributions
 stories-0.155
ats-0.139
 trips-0.138
 happens-0.137
 books-0.136
Token twins
Token ID20345
Feature activation-3.49e-02

Pos logit contributions
 names+0.001
 tum+0.001
 barrel+0.001
 stories+0.001
 teeth+0.001
Neg logit contributions
ge-0.002
shadow-0.001
rying-0.001
ner-0.001
is-0.001
Token.
Token ID13
Feature activation+2.68e-03

Pos logit contributions
ge+0.069
Sus+0.064
chie+0.064
rier+0.063
is+0.063
Neg logit contributions
 trips-0.054
 names-0.048
icks-0.048
 imagination-0.044
ses-0.044
Token She
Token ID1375
Feature activation+1.61e-02

Pos logit contributions
ses+0.005
 spears+0.005
sed+0.005
 trips+0.005
 imag+0.005
Neg logit contributions
ge-0.003
ters-0.003
line-0.003
is-0.003
Sus-0.003
Token is
Token ID318
Feature activation+2.53e-02

Pos logit contributions
ses+0.014
 names+0.014
 trips+0.014
sed+0.013
 imagination+0.013
Neg logit contributions
ge-0.041
ters-0.038
is-0.037
rier-0.035
ising-0.035
Token
Token ID198
Feature activation-2.01e-02

Pos logit contributions
ge+0.016
ters+0.015
is+0.014
line+0.014
Sus+0.014
Neg logit contributions
ses-0.014
 spears-0.014
sed-0.014
 trips-0.014
 names-0.014
Token
Token ID198
Feature activation-2.61e-02

Pos logit contributions
ge+0.037
is+0.032
ters+0.031
Sus+0.030
line+0.029
Neg logit contributions
ses-0.034
sed-0.033
 names-0.031
 imag-0.031
 spears-0.031
TokenSuddenly
Token ID38582
Feature activation+1.65e-02

Pos logit contributions
ge+0.050
ters+0.045
rier+0.044
is+0.041
ter+0.041
Neg logit contributions
 trips-0.039
 names-0.038
ses-0.036
 lived-0.036
sed-0.036
Token,
Token ID11
Feature activation+5.52e-03

Pos logit contributions
ses+0.031
 names+0.030
sed+0.029
 habitats+0.029
 spears+0.028
Neg logit contributions
ge-0.025
ters-0.023
is-0.022
ner-0.021
ter-0.021
Token a
Token ID257
Feature activation-5.31e-04

Pos logit contributions
 trips+0.007
 names+0.007
ses+0.007
 imagination+0.007
sed+0.007
Neg logit contributions
ge-0.012
ters-0.010
is-0.010
rier-0.010
lling-0.010
Token mis
Token ID2984
Feature activation-9.58e-02

Pos logit contributions
ge+0.001
ters+0.001
is+0.001
lling+0.001
rier+0.001
Neg logit contributions
 trips-0.001
ses-0.001
 names-0.001
sed-0.001
 imagination-0.001
Tokenchie
Token ID3043
Feature activation-5.04e-02

Pos logit contributions
ge+0.096
rily+0.070
line+0.070
is+0.067
lling+0.066
Neg logit contributions
ses-0.194
 names-0.194
adders-0.193
 imag-0.190
icks-0.188
Tokenvous
Token ID31222
Feature activation-8.24e-03

Pos logit contributions
ge+0.111
ters+0.100
is+0.096
rier+0.095
ising+0.092
Neg logit contributions
 trips-0.060
ses-0.060
 names-0.060
sed-0.059
 imagination-0.056
Token bird
Token ID6512
Feature activation+1.84e-03

Pos logit contributions
ge+0.018
ters+0.016
is+0.015
lling+0.015
rier+0.015
Neg logit contributions
ses-0.010
sed-0.010
 trips-0.010
 names-0.010
 imagination-0.010
Token flew
Token ID13112
Feature activation-2.42e-02

Pos logit contributions
 trips+0.002
ses+0.002
sed+0.002
 names+0.002
 lived+0.002
Neg logit contributions
ge-0.004
ters-0.004
rier-0.003
is-0.003
line-0.003
Token by
Token ID416
Feature activation-1.15e-02

Pos logit contributions
ge+0.050
is+0.043
ner+0.041
ters+0.039
line+0.039
Neg logit contributions
 names-0.036
 spears-0.034
 trips-0.033
sed-0.033
ses-0.033
Token a
Token ID257
Feature activation-1.42e-04

Pos logit contributions
ge+0.012
ters+0.010
is+0.010
rier+0.010
lling+0.010
Neg logit contributions
 trips-0.010
 names-0.010
ses-0.010
sed-0.010
 imagination-0.010
Token sweet
Token ID6029
Feature activation-1.17e-02

Pos logit contributions
ge+0.000
ters+0.000
rier+0.000
lling+0.000
ter+0.000
Neg logit contributions
 trips-0.000
 favorite-0.000
 tum-0.000
 names-0.000
 barrel-0.000
Token tune
Token ID14009
Feature activation+8.85e-03

Pos logit contributions
ge+0.023
ters+0.021
rier+0.020
chie+0.019
lling+0.019
Neg logit contributions
 trips-0.017
 names-0.017
 imagination-0.016
 tum-0.016
ses-0.016
Token,
Token ID11
Feature activation-8.20e-03

Pos logit contributions
 names+0.011
 trips+0.011
ses+0.011
sed+0.011
 imagination+0.011
Neg logit contributions
ge-0.019
ters-0.017
is-0.016
rier-0.016
lling-0.016
Token a
Token ID257
Feature activation+8.04e-03

Pos logit contributions
ge+0.015
ters+0.013
is+0.012
rier+0.012
lling+0.012
Neg logit contributions
 trips-0.014
 names-0.014
ses-0.013
 imagination-0.013
sed-0.013
Token mis
Token ID2984
Feature activation-9.57e-02

Pos logit contributions
sed+0.012
 trips+0.012
ses+0.011
 names+0.011
 tum+0.011
Neg logit contributions
ge-0.016
ters-0.014
lling-0.013
is-0.013
rier-0.013
Tokenchie
Token ID3043
Feature activation-4.51e-02

Pos logit contributions
ge+0.099
rily+0.076
rier+0.075
lling+0.072
is+0.072
Neg logit contributions
 names-0.179
adders-0.179
tr-0.178
ses-0.175
gen-0.173
Tokenvous
Token ID31222
Feature activation-3.20e-03

Pos logit contributions
ge+0.103
ters+0.093
rier+0.090
is+0.089
ising+0.086
Neg logit contributions
 trips-0.050
 names-0.050
ses-0.049
sed-0.048
 imagination-0.046
Token fox
Token ID21831
Feature activation+6.73e-03

Pos logit contributions
ge+0.007
ters+0.006
lling+0.006
is+0.006
rier+0.005
Neg logit contributions
ses-0.004
sed-0.004
 names-0.004
 trips-0.004
 imagination-0.004
Token appeared
Token ID4120
Feature activation-1.57e-02

Pos logit contributions
 trips+0.009
 names+0.008
sed+0.008
ses+0.008
 lived+0.008
Neg logit contributions
ge-0.014
ters-0.013
rier-0.013
is-0.012
ising-0.012
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
ge+0.027
is+0.024
ters+0.023
rier+0.022
ner+0.022
Neg logit contributions
 names-0.026
 trips-0.026
sed-0.024
ses-0.024
 spears-0.024
Token.
Token ID13
Feature activation+2.13e-04

Pos logit contributions
ge+0.048
ters+0.043
is+0.041
rier+0.040
ising+0.039
Neg logit contributions
ses-0.042
 trips-0.042
 names-0.042
sed-0.041
 imagination-0.040
Token One
Token ID1881
Feature activation+3.49e-02

Pos logit contributions
ses+0.000
sed+0.000
 trips+0.000
 names+0.000
 imag+0.000
Neg logit contributions
ge-0.000
ters-0.000
is-0.000
ner-0.000
line-0.000
Token day
Token ID1110
Feature activation+1.61e-04

Pos logit contributions
ses+0.045
sed+0.044
 names+0.043
 trips+0.043
 spears+0.042
Neg logit contributions
ge-0.074
ters-0.066
is-0.066
ter-0.063
ising-0.062
Token,
Token ID11
Feature activation+1.02e-02

Pos logit contributions
ses+0.000
sed+0.000
 habitats+0.000
 names+0.000
tr+0.000
Neg logit contributions
ge-0.000
ters-0.000
ner-0.000
ter-0.000
is-0.000
Token a
Token ID257
Feature activation-1.86e-04

Pos logit contributions
 trips+0.012
 names+0.012
ses+0.012
sed+0.011
 imagination+0.011
Neg logit contributions
ge-0.023
is-0.021
ters-0.021
rier-0.020
lling-0.020
Token mis
Token ID2984
Feature activation-9.52e-02

Pos logit contributions
ge+0.000
ters+0.000
is+0.000
rier+0.000
ising+0.000
Neg logit contributions
ses-0.000
 trips-0.000
 names-0.000
sed-0.000
 imagination-0.000
Tokensel
Token ID741
Feature activation-4.89e-02

Pos logit contributions
ge+0.099
rily+0.079
lling+0.077
is+0.071
izing+0.070
Neg logit contributions
 names-0.189
adders-0.188
tr-0.174
gen-0.174
 imag-0.173
Token went
Token ID1816
Feature activation-3.64e-02

Pos logit contributions
ge+0.103
rier+0.092
ters+0.088
chie+0.085
rying+0.084
Neg logit contributions
 lived-0.070
 trips-0.067
 tum-0.066
 names-0.064
 barrel-0.063
Token into
Token ID656
Feature activation-1.02e-02

Pos logit contributions
ge+0.090
rily+0.079
rier+0.078
ters+0.078
is+0.077
Neg logit contributions
 trips-0.041
 names-0.034
 books-0.033
 dictionary-0.032
 barrel-0.032
Token the
Token ID262
Feature activation-2.20e-02

Pos logit contributions
rier+0.025
ters+0.024
ivering+0.024
is+0.023
ge+0.023
Neg logit contributions
 trips-0.011
 books-0.010
 imagination-0.009
 names-0.009
 teeth-0.009
Token store
Token ID3650
Feature activation-3.93e-02

Pos logit contributions
ge+0.039
ters+0.034
is+0.032
rier+0.031
lling+0.031
Neg logit contributions
 trips-0.038
 names-0.037
ses-0.036
 imagination-0.036
sed-0.036
Token looked
Token ID3114
Feature activation-8.70e-03

Pos logit contributions
ge+0.033
rier+0.029
ters+0.028
is+0.027
line+0.026
Neg logit contributions
 trips-0.026
sed-0.026
 names-0.026
ses-0.026
 lived-0.025
Token at
Token ID379
Feature activation-6.42e-03

Pos logit contributions
ge+0.016
ters+0.014
is+0.014
rier+0.013
lling+0.013
Neg logit contributions
ses-0.014
 names-0.014
 trips-0.014
sed-0.013
 imagination-0.013
Token him
Token ID683
Feature activation+1.09e-02

Pos logit contributions
ge+0.011
ters+0.009
is+0.009
rier+0.009
lling+0.008
Neg logit contributions
ses-0.011
 trips-0.011
sed-0.011
 names-0.011
 imagination-0.011
Token with
Token ID351
Feature activation-1.41e-02

Pos logit contributions
 trips+0.018
 names+0.017
 teeth+0.016
 stories+0.016
ses+0.016
Neg logit contributions
ge-0.019
chie-0.017
ters-0.017
Sus-0.017
is-0.017
Token a
Token ID257
Feature activation-6.87e-04

Pos logit contributions
ge+0.027
ters+0.024
rier+0.023
is+0.023
ising+0.023
Neg logit contributions
ses-0.020
 trips-0.020
sed-0.020
tr-0.019
 names-0.019
Token mis
Token ID2984
Feature activation-9.42e-02

Pos logit contributions
ge+0.001
ters+0.001
is+0.001
rier+0.001
ising+0.001
Neg logit contributions
ses-0.001
 trips-0.001
 names-0.001
sed-0.001
 imagination-0.001
Tokenchie
Token ID3043
Feature activation-3.04e-02

Pos logit contributions
ge+0.087
ner+0.063
ter+0.062
ters+0.061
rier+0.060
Neg logit contributions
 names-0.208
icks-0.203
ats-0.201
ses-0.200
 imag-0.196
Tokenvous
Token ID31222
Feature activation+3.39e-03

Pos logit contributions
ge+0.060
is+0.055
ters+0.055
lling+0.052
rier+0.051
Neg logit contributions
ses-0.043
sed-0.042
 trips-0.042
 names-0.041
 imagination-0.040
Token grin
Token ID16770
Feature activation-3.99e-03

Pos logit contributions
ses+0.006
 names+0.005
sed+0.005
 trips+0.005
 imagination+0.005
Neg logit contributions
ge-0.006
ters-0.005
is-0.005
rier-0.005
lling-0.005
Token.
Token ID13
Feature activation-2.05e-02

Pos logit contributions
ge+0.008
chie+0.007
Sus+0.007
is+0.007
ters+0.007
Neg logit contributions
 trips-0.005
 names-0.005
 tum-0.005
 teeth-0.005
ses-0.005
Token She
Token ID1375
Feature activation+5.47e-04

Pos logit contributions
ge+0.035
ters+0.032
is+0.030
line+0.030
rier+0.029
Neg logit contributions
ses-0.033
 trips-0.033
sed-0.033
 names-0.032
 imag-0.031
Token the
Token ID262
Feature activation-6.58e-03

Pos logit contributions
ses+0.053
 trips+0.053
sed+0.052
 names+0.052
 imag+0.051
Neg logit contributions
ge-0.059
ters-0.056
is-0.051
ter-0.050
rier-0.050
Token torn
Token ID12445
Feature activation-2.21e-02

Pos logit contributions
ge+0.014
ters+0.012
is+0.012
rier+0.012
lling+0.012
Neg logit contributions
 names-0.009
ses-0.009
 trips-0.009
sed-0.008
 imagination-0.008
Token book
Token ID1492
Feature activation+1.40e-02

Pos logit contributions
ge+0.050
rying+0.045
is+0.044
lling+0.043
shadow+0.043
Neg logit contributions
 teeth-0.026
 tum-0.025
ses-0.025
 trips-0.025
sed-0.025
Token and
Token ID290
Feature activation+4.24e-03

Pos logit contributions
 trips+0.023
tr+0.021
 names+0.021
ses+0.021
sed+0.021
Neg logit contributions
ge-0.026
ters-0.023
chie-0.022
is-0.022
rier-0.022
Token the
Token ID262
Feature activation-1.64e-02

Pos logit contributions
 trips+0.006
 names+0.006
ses+0.006
sed+0.006
 imagination+0.006
Neg logit contributions
ge-0.008
ters-0.008
is-0.007
rier-0.007
ising-0.007
Token unw
Token ID7379
Feature activation-9.41e-02

Pos logit contributions
ge+0.035
ters+0.031
is+0.030
rier+0.030
lling+0.029
Neg logit contributions
ses-0.021
 names-0.021
 trips-0.021
sed-0.021
 imagination-0.020
Tokenra
Token ID430
Feature activation-2.40e-02

Pos logit contributions
ge+0.150
ters+0.144
line+0.144
is+0.140
ner+0.137
Neg logit contributions
sed-0.159
ses-0.157
 trips-0.155
 names-0.147
 teeth-0.145
Tokenpped
Token ID1496
Feature activation-1.44e-02

Pos logit contributions
ge+0.042
rier+0.041
ising+0.038
ters+0.038
chie+0.038
Neg logit contributions
 trips-0.036
 lived-0.033
zed-0.033
ats-0.033
 names-0.032
Token box
Token ID3091
Feature activation+1.76e-02

Pos logit contributions
ge+0.028
is+0.025
ters+0.025
ising+0.024
rier+0.023
Neg logit contributions
 trips-0.021
ses-0.021
 names-0.021
sed-0.020
 spears-0.019
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
 tum+0.018
 trips+0.018
 favorite+0.017
 names+0.017
 teeth+0.016
Neg logit contributions
ge-0.037
is-0.037
chie-0.036
hr-0.036
Sus-0.035
Token She
Token ID1375
Feature activation+1.55e-02

Pos logit contributions
ses+0.017
 trips+0.017
 names+0.017
sed+0.017
 imagination+0.016
Neg logit contributions
ge-0.017
ters-0.015
is-0.015
rier-0.014
line-0.014
Token!
Token ID0
Feature activation+2.02e-02

Pos logit contributions
ge+0.059
ters+0.053
rier+0.052
is+0.051
lling+0.049
Neg logit contributions
 trips-0.056
 names-0.056
ses-0.055
sed-0.055
 imagination-0.053
Token I
Token ID314
Feature activation-9.04e-03

Pos logit contributions
 trips+0.033
 names+0.032
ses+0.032
sed+0.031
 imag+0.031
Neg logit contributions
ge-0.034
ters-0.033
rier-0.029
is-0.029
ner-0.029
Token'm
Token ID1101
Feature activation-1.19e-04

Pos logit contributions
ge+0.021
ters+0.019
is+0.019
rier+0.018
lling+0.018
Neg logit contributions
 trips-0.010
 names-0.010
ses-0.010
sed-0.010
 imagination-0.009
Token the
Token ID262
Feature activation-1.97e-02

Pos logit contributions
ge+0.000
ters+0.000
is+0.000
rier+0.000
ising+0.000
Neg logit contributions
 trips-0.000
ses-0.000
sed-0.000
 imagination-0.000
 names-0.000
Token best
Token ID1266
Feature activation-1.81e-02

Pos logit contributions
ge+0.043
ters+0.037
is+0.036
rier+0.035
rily+0.035
Neg logit contributions
 names-0.027
ses-0.026
 trips-0.026
sed-0.025
 imagination-0.025
Token gol
Token ID43791
Feature activation-9.40e-02

Pos logit contributions
ge+0.041
rier+0.035
ters+0.035
rily+0.035
lling+0.035
Neg logit contributions
 trips-0.023
 names-0.022
ses-0.022
 imagination-0.022
sed-0.021
Tokenfer
Token ID2232
Feature activation-3.52e-02

Pos logit contributions
ge+0.122
rier+0.095
is+0.094
rily+0.092
lling+0.089
Neg logit contributions
sed-0.191
 trips-0.188
ses-0.188
 names-0.180
ggy-0.178
Token in
Token ID287
Feature activation-5.95e-03

Pos logit contributions
ge+0.062
ters+0.052
is+0.052
rier+0.051
lling+0.049
Neg logit contributions
 trips-0.058
 names-0.056
ses-0.056
sed-0.056
 imagination-0.056
Token the
Token ID262
Feature activation-1.41e-02

Pos logit contributions
ge+0.011
ters+0.010
is+0.010
rier+0.010
line+0.009
Neg logit contributions
 trips-0.010
 imagination-0.009
 names-0.009
 spears-0.009
ses-0.009
Token world
Token ID995
Feature activation-1.45e-02

Pos logit contributions
ge+0.031
ters+0.028
is+0.027
rier+0.027
lling+0.025
Neg logit contributions
 trips-0.018
 names-0.018
ses-0.018
sed-0.017
 imagination-0.017
Token!"
Token ID2474
Feature activation-1.23e-02

Pos logit contributions
ge+0.025
ters+0.022
is+0.021
rier+0.021
ising+0.020
Neg logit contributions
 trips-0.025
 names-0.024
ses-0.023
sed-0.023
 imagination-0.023
Token!
Token ID0
Feature activation+1.78e-02

Pos logit contributions
rier+0.039
rily+0.038
ge+0.037
ters+0.035
lling+0.034
Neg logit contributions
 trips-0.031
 names-0.028
 bone-0.026
 lived-0.026
 spears-0.026
Token I
Token ID314
Feature activation-9.63e-03

Pos logit contributions
 trips+0.027
ses+0.027
 names+0.026
sed+0.026
 imagination+0.025
Neg logit contributions
ge-0.033
ters-0.030
is-0.029
rier-0.028
ising-0.027
Token'm
Token ID1101
Feature activation-2.37e-03

Pos logit contributions
ge+0.022
ters+0.020
rier+0.020
is+0.020
line+0.019
Neg logit contributions
 trips-0.011
 names-0.011
ses-0.011
sed-0.010
 imagination-0.010
Token the
Token ID262
Feature activation-1.88e-02

Pos logit contributions
ge+0.005
ters+0.005
is+0.004
ising+0.004
rier+0.004
Neg logit contributions
 trips-0.003
ses-0.003
sed-0.003
 imagination-0.003
 names-0.003
Token best
Token ID1266
Feature activation-1.88e-02

Pos logit contributions
ge+0.042
ters+0.035
rily+0.035
rier+0.035
is+0.034
Neg logit contributions
 names-0.025
 trips-0.024
ses-0.024
sed-0.023
 imagination-0.023
Token gol
Token ID43791
Feature activation-9.39e-02

Pos logit contributions
ge+0.038
rily+0.034
lling+0.033
ters+0.033
rier+0.033
Neg logit contributions
 trips-0.027
 imagination-0.025
 names-0.025
ses-0.025
sed-0.024
Tokenfer
Token ID2232
Feature activation-3.75e-02

Pos logit contributions
ge+0.121
is+0.093
rier+0.092
rily+0.091
line+0.087
Neg logit contributions
sed-0.190
 trips-0.189
ses-0.186
ggy-0.179
 names-0.179
Token in
Token ID287
Feature activation-8.73e-03

Pos logit contributions
ge+0.069
ters+0.058
is+0.057
rier+0.056
lling+0.055
Neg logit contributions
 trips-0.059
 lived-0.057
sed-0.056
 imagination-0.056
 names-0.055
Token the
Token ID262
Feature activation-1.48e-02

Pos logit contributions
ge+0.016
ters+0.015
rier+0.014
is+0.014
line+0.014
Neg logit contributions
 trips-0.014
 imagination-0.013
 names-0.013
 spears-0.013
ses-0.013
Token world
Token ID995
Feature activation-1.69e-02

Pos logit contributions
ge+0.032
ters+0.029
rier+0.028
is+0.027
lling+0.026
Neg logit contributions
 trips-0.019
 names-0.019
ses-0.018
 imagination-0.018
sed-0.018
Token and
Token ID290
Feature activation-4.56e-03

Pos logit contributions
ge+0.031
ters+0.026
is+0.025
rier+0.025
Sus+0.025
Neg logit contributions
 trips-0.028
 names-0.026
ses-0.025
sed-0.025
 imagination-0.025
Token grass
Token ID8701
Feature activation-1.05e-02

Pos logit contributions
ge+0.031
ters+0.030
is+0.029
rier+0.028
ising+0.028
Neg logit contributions
ses-0.020
sed-0.019
icks-0.019
 trips-0.019
 names-0.019
Token around
Token ID1088
Feature activation-1.77e-03

Pos logit contributions
ge+0.022
ters+0.020
is+0.019
rier+0.019
ising+0.018
Neg logit contributions
ses-0.013
 trips-0.013
 names-0.013
sed-0.013
 imagination-0.013
Token the
Token ID262
Feature activation-2.84e-02

Pos logit contributions
ge+0.003
ters+0.003
is+0.003
rier+0.003
ising+0.003
Neg logit contributions
 trips-0.003
ses-0.003
 names-0.003
sed-0.003
 imagination-0.003
Token hill
Token ID12788
Feature activation-1.73e-02

Pos logit contributions
ge+0.066
ters+0.060
is+0.058
rier+0.057
ising+0.056
Neg logit contributions
ses-0.031
 names-0.031
 trips-0.031
sed-0.030
 imagination-0.029
Token was
Token ID373
Feature activation-2.30e-02

Pos logit contributions
ge+0.038
rier+0.033
is+0.033
ters+0.033
chie+0.032
Neg logit contributions
 trips-0.022
 lived-0.021
 barrel-0.020
 names-0.019
ses-0.019
Token wit
Token ID20868
Feature activation-9.36e-02

Pos logit contributions
ge+0.044
ters+0.039
is+0.038
rier+0.037
ising+0.036
Neg logit contributions
 trips-0.035
ses-0.034
 names-0.034
sed-0.034
 imagination-0.033
Tokenhered
Token ID6083
Feature activation-3.92e-02

Pos logit contributions
rily+0.166
rier+0.162
Sus+0.154
ge+0.153
iza+0.148
Neg logit contributions
 trips-0.136
tr-0.117
 Up-0.102
 walks-0.099
ats-0.098
Token and
Token ID290
Feature activation-1.26e-02

Pos logit contributions
ge+0.064
is+0.064
ner+0.059
ters+0.058
rier+0.058
Neg logit contributions
 trips-0.068
 names-0.062
 spears-0.062
 favorite-0.060
 barrel-0.060
Token yellow
Token ID7872
Feature activation+2.90e-02

Pos logit contributions
ge+0.025
ters+0.023
is+0.022
rier+0.021
ising+0.021
Neg logit contributions
 trips-0.018
ses-0.018
 names-0.017
sed-0.017
 imagination-0.017
Token.
Token ID13
Feature activation-1.63e-02

Pos logit contributions
ses+0.054
 trips+0.054
 names+0.054
sed+0.053
 imagination+0.052
Neg logit contributions
ge-0.045
ters-0.039
is-0.037
rier-0.036
lling-0.035
Token He
Token ID679
Feature activation-1.21e-04

Pos logit contributions
ge+0.031
ters+0.029
rier+0.027
is+0.027
line+0.026
Neg logit contributions
ses-0.023
 names-0.023
 trips-0.023
sed-0.023
 imag-0.022
Token best
Token ID1266
Feature activation-8.76e-05

Pos logit contributions
ge+0.015
ters+0.013
is+0.013
rier+0.013
ter+0.012
Neg logit contributions
 trips-0.011
ses-0.010
 names-0.010
 spears-0.010
sed-0.010
Token friend
Token ID1545
Feature activation-2.22e-03

Pos logit contributions
ge+0.000
ters+0.000
is+0.000
rier+0.000
ising+0.000
Neg logit contributions
ses-0.000
 trips-0.000
 names-0.000
sed-0.000
 imagination-0.000
Token a
Token ID257
Feature activation-7.46e-03

Pos logit contributions
ge+0.003
ters+0.003
is+0.002
ter+0.002
rier+0.002
Neg logit contributions
ses-0.005
 names-0.005
sed-0.004
 trips-0.004
 imag-0.004
Token long
Token ID890
Feature activation+2.01e-02

Pos logit contributions
ge+0.015
ters+0.014
is+0.013
rier+0.013
ising+0.012
Neg logit contributions
 trips-0.010
ses-0.010
 names-0.010
sed-0.010
 spears-0.009
Token time
Token ID640
Feature activation-4.67e-02

Pos logit contributions
ses+0.021
 trips+0.021
 names+0.021
sed+0.021
 imagination+0.020
Neg logit contributions
ge-0.047
ters-0.043
is-0.041
rier-0.041
lling-0.040
Token ago
Token ID2084
Feature activation-9.31e-02

Pos logit contributions
ge+0.091
rier+0.083
ters+0.083
is+0.082
lling+0.081
Neg logit contributions
 trips-0.068
 names-0.066
 imagination-0.066
ses-0.061
sed-0.059
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
ge+0.169
is+0.147
ters+0.145
lling+0.143
rier+0.141
Neg logit contributions
 trips-0.151
 names-0.150
ses-0.148
sed-0.147
 imagination-0.143
Token He
Token ID679
Feature activation+2.00e-02

Pos logit contributions
 trips+0.026
ses+0.025
sed+0.025
 spears+0.024
 names+0.024
Neg logit contributions
ge-0.024
ters-0.022
rier-0.020
is-0.020
ter-0.019
Token was
Token ID373
Feature activation+1.91e-02

Pos logit contributions
ses+0.018
 names+0.018
 trips+0.018
sed+0.018
 spears+0.017
Neg logit contributions
ge-0.049
ters-0.045
is-0.044
rier-0.043
ising-0.042
Token my
Token ID616
Feature activation-1.99e-02

Pos logit contributions
 trips+0.026
 names+0.025
ses+0.025
sed+0.024
 imagination+0.024
Neg logit contributions
ge-0.041
ters-0.037
ising-0.035
line-0.035
is-0.035
Token dog
Token ID3290
Feature activation+2.38e-02

Pos logit contributions
ge+0.044
ters+0.040
is+0.039
rier+0.038
ising+0.037
Neg logit contributions
ses-0.024
 trips-0.023
 names-0.023
sed-0.023
 imagination-0.022
Token toys
Token ID14958
Feature activation-1.34e-02

Pos logit contributions
ses+0.005
sed+0.005
 habitats+0.004
 trips+0.004
 names+0.004
Neg logit contributions
ge-0.007
ters-0.006
ising-0.005
is-0.005
rier-0.005
Token.
Token ID13
Feature activation+3.91e-03

Pos logit contributions
is+0.026
ge+0.026
ters+0.025
ivering+0.025
chie+0.025
Neg logit contributions
 trips-0.017
 favorite-0.016
 books-0.015
 names-0.015
 favourite-0.014
Token<|endoftext|>
Token ID50256
Feature activation+3.43e-02

Pos logit contributions
 trips+0.006
ses+0.006
sed+0.006
 spears+0.006
 imag+0.006
Neg logit contributions
ge-0.006
ters-0.006
line-0.006
is-0.005
Sus-0.005
TokenOnce
Token ID7454
Feature activation+1.94e-02

Pos logit contributions
ses+0.049
sed+0.048
 names+0.047
 trips+0.046
 imagination+0.045
Neg logit contributions
ge-0.068
ters-0.062
is-0.059
rier-0.056
lling-0.056
Token upon
Token ID2402
Feature activation+2.68e-02

Pos logit contributions
ses+0.043
 names+0.042
 trips+0.042
sed+0.042
 imag+0.040
Neg logit contributions
ge-0.024
ters-0.019
is-0.019
rier-0.017
line-0.016
Token a
Token ID257
Feature activation+6.51e-02

Pos logit contributions
ses+0.036
sed+0.035
 names+0.033
 trips+0.033
 imag+0.032
Neg logit contributions
ge-0.056
ters-0.050
is-0.048
rier-0.046
lling-0.046
Token time
Token ID640
Feature activation-7.21e-03

Pos logit contributions
ses+0.099
 names+0.095
sed+0.095
 spears+0.094
 meanings+0.093
Neg logit contributions
ge-0.116
is-0.114
 longer-0.096
ising-0.094
Sus-0.094
Token,
Token ID11
Feature activation+1.81e-02

Pos logit contributions
ge+0.010
ters+0.008
is+0.008
ner+0.008
ter+0.007
Neg logit contributions
 meanings-0.015
ses-0.015
 names-0.014
 habitats-0.014
sed-0.014
Token there
Token ID612
Feature activation-2.03e-02

Pos logit contributions
ses+0.035
 names+0.035
 trips+0.035
sed+0.035
 imag+0.034
Neg logit contributions
ge-0.026
ters-0.023
is-0.021
rier-0.020
ter-0.020
Token was
Token ID373
Feature activation+2.46e-02

Pos logit contributions
ge+0.027
ters+0.022
mine+0.021
ter+0.021
is+0.021
Neg logit contributions
ses-0.042
 spears-0.041
sed-0.040
 helmets-0.039
 meanings-0.039
Token a
Token ID257
Feature activation+1.77e-03

Pos logit contributions
sed+0.039
ses+0.039
 meanings+0.037
 names+0.037
 trips+0.036
Neg logit contributions
ge-0.040
ters-0.039
is-0.036
ter-0.036
Sus-0.036
Token park
Token ID3952
Feature activation+8.70e-03

Pos logit contributions
 names+0.021
ses+0.020
 trips+0.020
sed+0.020
 imagination+0.019
Neg logit contributions
ge-0.035
ters-0.032
is-0.031
rier-0.030
lling-0.030
Token.
Token ID13
Feature activation+2.12e-03

Pos logit contributions
 trips+0.014
 names+0.013
 tum+0.013
 stories+0.013
 favorite+0.013
Neg logit contributions
ge-0.016
ters-0.014
is-0.014
chie-0.013
rier-0.013
Token<|endoftext|>
Token ID50256
Feature activation+3.37e-02

Pos logit contributions
ses+0.004
 trips+0.004
sed+0.003
 names+0.003
 imag+0.003
Neg logit contributions
ge-0.003
Sus-0.003
ters-0.003
is-0.003
line-0.003
TokenOnce
Token ID7454
Feature activation+1.91e-02

Pos logit contributions
ses+0.048
sed+0.047
 names+0.046
 trips+0.046
 imagination+0.045
Neg logit contributions
ge-0.067
ters-0.060
is-0.058
rier-0.055
lling-0.055
Token upon
Token ID2402
Feature activation+2.42e-02

Pos logit contributions
ses+0.042
 names+0.041
sed+0.041
 trips+0.041
 imag+0.040
Neg logit contributions
ge-0.024
ters-0.019
is-0.018
rier-0.017
line-0.016
Token a
Token ID257
Feature activation+6.66e-02

Pos logit contributions
ses+0.032
sed+0.031
 names+0.030
 trips+0.030
 imag+0.029
Neg logit contributions
ge-0.051
ters-0.046
is-0.044
lling-0.042
rier-0.042
Token time
Token ID640
Feature activation-8.40e-03

Pos logit contributions
ses+0.101
 spears+0.098
 names+0.097
sed+0.097
 meanings+0.095
Neg logit contributions
ge-0.118
is-0.115
ising-0.097
 longer-0.097
Sus-0.096
Token,
Token ID11
Feature activation+1.79e-02

Pos logit contributions
ge+0.012
ters+0.010
is+0.009
Sus+0.009
ner+0.009
Neg logit contributions
 meanings-0.017
ses-0.017
 names-0.016
 habitats-0.016
 spears-0.016
Token there
Token ID612
Feature activation-2.21e-02

Pos logit contributions
ses+0.034
 names+0.034
 trips+0.034
sed+0.033
 imagination+0.032
Neg logit contributions
ge-0.027
ters-0.023
is-0.022
rier-0.021
lling-0.020
Token was
Token ID373
Feature activation+2.52e-02

Pos logit contributions
ge+0.030
mine+0.024
ters+0.023
ter+0.023
is+0.023
Neg logit contributions
ses-0.046
 spears-0.045
sed-0.044
 meanings-0.043
 helmets-0.043
Token a
Token ID257
Feature activation+2.58e-03

Pos logit contributions
sed+0.039
ses+0.039
 names+0.037
 meanings+0.037
 trips+0.036
Neg logit contributions
ge-0.043
ters-0.041
is-0.039
Sus-0.038
ter-0.038
Token his
Token ID465
Feature activation+1.50e-02

Pos logit contributions
ge+0.018
ters+0.016
is+0.015
rier+0.015
lling+0.014
Neg logit contributions
ses-0.017
 trips-0.017
 names-0.017
sed-0.016
 imagination-0.016
Token mouth
Token ID5422
Feature activation+3.87e-02

Pos logit contributions
 meanings+0.020
 lived+0.020
 habitats+0.020
 trips+0.020
ses+0.020
Neg logit contributions
ge-0.026
rier-0.024
ters-0.024
rily-0.023
line-0.023
Token and
Token ID290
Feature activation+1.36e-04

Pos logit contributions
 names+0.072
 trips+0.072
ses+0.071
sed+0.070
 spears+0.067
Neg logit contributions
ge-0.061
ters-0.051
is-0.050
rier-0.048
lling-0.047
Token showed
Token ID3751
Feature activation-4.72e-03

Pos logit contributions
ses+0.000
 names+0.000
 trips+0.000
sed+0.000
 imagination+0.000
Neg logit contributions
ge-0.000
ters-0.000
is-0.000
rier-0.000
lling-0.000
Token his
Token ID465
Feature activation+3.67e-02

Pos logit contributions
ge+0.007
ters+0.007
is+0.006
rier+0.006
lling+0.006
Neg logit contributions
ses-0.009
 trips-0.008
sed-0.008
 names-0.008
 imagination-0.008
Token teeth
Token ID9941
Feature activation+5.35e-02

Pos logit contributions
tr+0.049
ses+0.047
 trips+0.046
 lived+0.045
 meanings+0.045
Neg logit contributions
ge-0.070
ters-0.063
rier-0.063
rily-0.059
is-0.058
Token.
Token ID13
Feature activation-8.22e-04

Pos logit contributions
 names+0.095
 trips+0.094
ses+0.091
sed+0.089
 imagination+0.088
Neg logit contributions
ge-0.086
ters-0.077
is-0.074
rier-0.072
ising-0.070
Token L
Token ID406
Feature activation-1.96e-02

Pos logit contributions
ge+0.001
ters+0.001
is+0.001
rier+0.001
line+0.001
Neg logit contributions
ses-0.001
 trips-0.001
 names-0.001
sed-0.001
 imagination-0.001
Tokenila
Token ID10102
Feature activation-8.11e-03

Pos logit contributions
ge+0.038
rier+0.034
ters+0.033
ising+0.031
chie+0.031
Neg logit contributions
 trips-0.029
 names-0.028
ses-0.027
ats-0.026
sed-0.026
Token and
Token ID290
Feature activation-7.80e-03

Pos logit contributions
ge+0.011
ters+0.010
is+0.009
rier+0.008
lling+0.008
Neg logit contributions
ses-0.016
 names-0.016
 trips-0.016
sed-0.016
 imagination-0.016
Token Ben
Token ID3932
Feature activation-1.40e-02

Pos logit contributions
ge+0.017
ters+0.016
is+0.015
rier+0.015
ising+0.014
Neg logit contributions
ses-0.009
 trips-0.009
 names-0.009
sed-0.009
 imagination-0.009
Token.
Token ID13
Feature activation-3.73e-03

Pos logit contributions
ge+0.039
chie+0.038
ters+0.036
rier+0.036
Sus+0.034
Neg logit contributions
 trips-0.030
 imagination-0.026
 names-0.026
 books-0.026
 lived-0.026
Token She
Token ID1375
Feature activation+1.20e-02

Pos logit contributions
ge+0.006
ters+0.006
rier+0.005
is+0.005
line+0.005
Neg logit contributions
ses-0.006
 trips-0.006
sed-0.006
 names-0.006
 imag-0.006
Token said
Token ID531
Feature activation+8.12e-03

Pos logit contributions
 trips+0.019
 names+0.019
ses+0.019
sed+0.018
 imagination+0.018
Neg logit contributions
ge-0.022
ters-0.020
is-0.019
rier-0.019
ising-0.018
Token,
Token ID11
Feature activation+2.33e-02

Pos logit contributions
ses+0.013
 trips+0.013
 names+0.013
sed+0.012
 imag+0.012
Neg logit contributions
ge-0.014
ters-0.013
rier-0.013
is-0.013
line-0.012
Token "
Token ID366
Feature activation-3.64e-03

Pos logit contributions
 names+0.020
ses+0.019
sed+0.019
 trips+0.019
 imagination+0.018
Neg logit contributions
ge-0.060
ters-0.055
is-0.053
ising-0.052
lling-0.052
TokenOh
Token ID5812
Feature activation+5.69e-02

Pos logit contributions
ge+0.008
rier+0.007
rying+0.007
ising+0.007
ters+0.007
Neg logit contributions
 names-0.005
tr-0.004
 imagination-0.004
 meanings-0.004
 trips-0.004
Token Jack
Token ID3619
Feature activation+1.30e-02

Pos logit contributions
 trips+0.091
ses+0.090
 habitats+0.088
 spears+0.088
 names+0.088
Neg logit contributions
ge-0.094
ters-0.087
ter-0.087
chie-0.084
rier-0.083
Token,
Token ID11
Feature activation+1.97e-02

Pos logit contributions
 names+0.015
 trips+0.015
sed+0.015
ses+0.014
 imagination+0.014
Neg logit contributions
ge-0.029
ters-0.027
rier-0.026
is-0.026
lling-0.025
Token these
Token ID777
Feature activation+6.65e-03

Pos logit contributions
 names+0.031
ses+0.031
 trips+0.031
sed+0.030
 imagination+0.030
Neg logit contributions
ge-0.036
ters-0.032
is-0.031
rier-0.030
ising-0.029
Token are
Token ID389
Feature activation-2.04e-03

Pos logit contributions
 names+0.008
 trips+0.007
 imagination+0.007
ses+0.007
sed+0.007
Neg logit contributions
ge-0.016
ters-0.014
is-0.014
rier-0.013
lling-0.013
Token beautiful
Token ID4950
Feature activation+1.90e-03

Pos logit contributions
ge+0.004
ters+0.003
is+0.003
rier+0.003
ising+0.003
Neg logit contributions
 names-0.003
 trips-0.003
ses-0.003
sed-0.003
 imagination-0.003
Token
Token ID198
Feature activation-1.23e-02

Pos logit contributions
ge+0.008
ters+0.007
is+0.007
rier+0.006
Sus+0.006
Neg logit contributions
ses-0.006
sed-0.006
 names-0.005
 trips-0.005
 imag-0.005
Token"
Token ID1
Feature activation-3.17e-02

Pos logit contributions
ge+0.024
rier+0.020
ters+0.020
is+0.019
lling+0.019
Neg logit contributions
 trips-0.019
 names-0.018
 lived-0.018
sed-0.017
ses-0.017
TokenO
Token ID46
Feature activation+1.80e-02

Pos logit contributions
ge+0.064
rier+0.054
is+0.052
rying+0.052
ters+0.052
Neg logit contributions
 trips-0.044
 names-0.044
ses-0.044
sed-0.043
 marsh-0.042
Tokenw
Token ID86
Feature activation+5.47e-02

Pos logit contributions
ses+0.019
 trips+0.019
sed+0.019
 names+0.019
 imagination+0.017
Neg logit contributions
ge-0.042
ters-0.039
is-0.037
rier-0.037
lling-0.036
Token,
Token ID11
Feature activation+2.86e-02

Pos logit contributions
 names+0.091
ses+0.091
 trips+0.091
sed+0.088
 imagination+0.087
Neg logit contributions
ge-0.095
ters-0.085
is-0.081
rier-0.078
ising-0.076
Token ow
Token ID12334
Feature activation+8.32e-02

Pos logit contributions
 trips+0.023
 names+0.023
ses+0.023
sed+0.022
 imagination+0.021
Neg logit contributions
ge-0.072
ters-0.069
is-0.066
rier-0.065
ter-0.064
Token,
Token ID11
Feature activation+2.08e-02

Pos logit contributions
 trips+0.120
ses+0.120
 names+0.118
sed+0.117
 imagination+0.113
Neg logit contributions
ge-0.163
ters-0.146
is-0.140
rier-0.139
lling-0.135
Token ow
Token ID12334
Feature activation+7.65e-02

Pos logit contributions
ses+0.016
 trips+0.016
 names+0.015
sed+0.015
 imagination+0.014
Neg logit contributions
ge-0.055
ters-0.051
is-0.050
rier-0.049
lling-0.048
Token!
Token ID0
Feature activation+2.12e-02

Pos logit contributions
 trips+0.135
ses+0.132
sed+0.130
 names+0.129
 spears+0.126
Neg logit contributions
ge-0.127
ters-0.109
rier-0.104
is-0.103
line-0.101
Token It
Token ID632
Feature activation+2.85e-02

Pos logit contributions
ses+0.025
 names+0.025
sed+0.024
 trips+0.024
 imagination+0.023
Neg logit contributions
ge-0.048
ters-0.043
is-0.042
rier-0.041
ising-0.041
Token hurts
Token ID20406
Feature activation+1.50e-03

Pos logit contributions
ses+0.031
 trips+0.031
 names+0.031
sed+0.030
 imagination+0.029
Neg logit contributions
ge-0.066
ters-0.060
is-0.059
rier-0.057
ising-0.056
Token fun
Token ID1257
Feature activation-2.59e-02

Pos logit contributions
ses+0.004
sed+0.004
 names+0.004
 spears+0.004
 trips+0.004
Neg logit contributions
ge-0.006
is-0.006
ising-0.005
ters-0.005
rier-0.005
Token dream
Token ID4320
Feature activation-1.88e-02

Pos logit contributions
ge+0.056
is+0.050
ters+0.048
ising+0.046
Sus+0.046
Neg logit contributions
ses-0.033
sed-0.032
 names-0.031
 spears-0.030
 trips-0.030
Token.
Token ID13
Feature activation-1.00e-02

Pos logit contributions
ge+0.035
ters+0.031
rier+0.030
lling+0.029
chie+0.029
Neg logit contributions
 trips-0.030
 names-0.029
 stories-0.027
tr-0.027
 imagination-0.027
Token She
Token ID1375
Feature activation+4.22e-03

Pos logit contributions
ge+0.012
line+0.011
ters+0.010
is+0.010
Sus+0.010
Neg logit contributions
ses-0.020
 spears-0.020
sed-0.020
 trips-0.020
ats-0.020
Token told
Token ID1297
Feature activation+7.75e-03

Pos logit contributions
ses+0.006
 names+0.006
 trips+0.006
sed+0.006
 imagination+0.006
Neg logit contributions
ge-0.008
ters-0.007
is-0.007
rier-0.007
lling-0.007
Token her
Token ID607
Feature activation+4.59e-02

Pos logit contributions
ses+0.017
sed+0.016
 names+0.016
 trips+0.016
tr+0.016
Neg logit contributions
ge-0.009
is-0.008
ters-0.008
rier-0.008
lling-0.007
Token mom
Token ID1995
Feature activation+3.00e-02

Pos logit contributions
ses+0.085
sed+0.080
 imag+0.072
ats+0.072
 habitats+0.071
Neg logit contributions
ge-0.065
is-0.064
Sus-0.062
ter-0.056
hr-0.055
Tokenmy
Token ID1820
Feature activation+3.75e-02

Pos logit contributions
ses+0.058
 trips+0.057
sed+0.057
 names+0.057
 imagination+0.055
Neg logit contributions
ge-0.045
ters-0.039
is-0.037
rier-0.036
lling-0.034
Token all
Token ID477
Feature activation+6.35e-03

Pos logit contributions
ses+0.058
sed+0.057
 trips+0.057
 names+0.056
 imag+0.054
Neg logit contributions
ge-0.070
ters-0.063
is-0.061
rier-0.058
line-0.057
Token about
Token ID546
Feature activation-1.19e-03

Pos logit contributions
ses+0.006
 trips+0.006
 names+0.006
sed+0.006
 imagination+0.005
Neg logit contributions
ge-0.016
ters-0.015
is-0.014
rier-0.014
lling-0.014
Token it
Token ID340
Feature activation-4.68e-04

Pos logit contributions
ge+0.002
is+0.001
ters+0.001
ter+0.001
lling+0.001
Neg logit contributions
ses-0.002
sed-0.002
 names-0.002
 trips-0.002
tr-0.002
Token and
Token ID290
Feature activation-8.42e-03

Pos logit contributions
ge+0.118
ters+0.111
is+0.110
rier+0.106
chie+0.105
Neg logit contributions
 trips-0.085
 names-0.080
 imagination-0.078
 favorite-0.074
 imag-0.073
Token asked
Token ID1965
Feature activation-3.16e-03

Pos logit contributions
ge+0.017
is+0.016
ters+0.016
rier+0.015
ising+0.014
Neg logit contributions
 trips-0.012
 lived-0.012
 names-0.011
 imagination-0.011
sed-0.011
Token her
Token ID607
Feature activation+2.85e-02

Pos logit contributions
ge+0.005
ters+0.004
is+0.004
rier+0.004
line+0.004
Neg logit contributions
ses-0.006
sed-0.006
 trips-0.006
 names-0.006
 spears-0.006
Token friends
Token ID2460
Feature activation+1.74e-02

Pos logit contributions
ses+0.043
 trips+0.041
sed+0.041
 spears+0.040
 imag+0.039
Neg logit contributions
ge-0.053
ters-0.049
is-0.049
rier-0.047
ter-0.046
Token.
Token ID13
Feature activation-4.24e-04

Pos logit contributions
 names+0.024
 trips+0.023
ses+0.022
 imagination+0.022
sed+0.022
Neg logit contributions
ge-0.035
ters-0.032
is-0.031
rier-0.031
ising-0.030
Token They
Token ID1119
Feature activation-1.41e-03

Pos logit contributions
ge+0.001
ters+0.001
is+0.001
rier+0.001
ising+0.001
Neg logit contributions
ses-0.000
 trips-0.000
 names-0.000
sed-0.000
 imagination-0.000
Token said
Token ID531
Feature activation+5.37e-03

Pos logit contributions
ge+0.003
ters+0.002
rier+0.002
is+0.002
ising+0.002
Neg logit contributions
 names-0.002
 trips-0.002
sed-0.002
ses-0.002
 lived-0.002
Token that
Token ID326
Feature activation+1.76e-02

Pos logit contributions
ses+0.008
 trips+0.008
 names+0.008
sed+0.008
 imagination+0.008
Neg logit contributions
ge-0.010
ters-0.009
is-0.009
rier-0.009
ising-0.008
Token someone
Token ID2130
Feature activation-4.50e-03

Pos logit contributions
 names+0.028
 trips+0.028
ses+0.028
sed+0.027
 imagination+0.027
Neg logit contributions
ge-0.032
ters-0.028
is-0.027
rier-0.027
ising-0.026
Token had
Token ID550
Feature activation-1.12e-02

Pos logit contributions
ge+0.009
ters+0.009
is+0.008
lling+0.008
ising+0.008
Neg logit contributions
ses-0.006
 names-0.006
sed-0.006
 trips-0.006
 imagination-0.005
Token taken
Token ID2077
Feature activation-3.30e-02

Pos logit contributions
ge+0.022
ters+0.020
is+0.019
rier+0.019
ising+0.018
Neg logit contributions
 names-0.016
 trips-0.016
ses-0.016
sed-0.016
 imagination-0.016
Token.
Token ID13
Feature activation+7.70e-03

Pos logit contributions
ge+0.049
ters+0.047
chie+0.045
rier+0.044
lling+0.044
Neg logit contributions
 trips-0.037
sed-0.036
 tum-0.035
ses-0.035
 imagination-0.034
Token The
Token ID383
Feature activation+1.70e-02

Pos logit contributions
ses+0.010
 trips+0.010
 names+0.010
sed+0.010
 imagination+0.009
Neg logit contributions
ge-0.016
ters-0.015
is-0.014
rier-0.014
line-0.014
Token dentist
Token ID38408
Feature activation-1.87e-02

Pos logit contributions
ses+0.023
 trips+0.023
 names+0.023
sed+0.022
 imagination+0.021
Neg logit contributions
ge-0.034
ters-0.031
is-0.030
rier-0.029
ising-0.029
Token smiled
Token ID13541
Feature activation-2.18e-02

Pos logit contributions
ge+0.035
ters+0.031
is+0.030
rier+0.029
lling+0.028
Neg logit contributions
ses-0.029
 names-0.028
 trips-0.028
sed-0.028
 imagination-0.027
Token and
Token ID290
Feature activation+2.27e-02

Pos logit contributions
ge+0.041
is+0.035
ters+0.034
chie+0.033
ner+0.033
Neg logit contributions
 names-0.036
 trips-0.035
 favorite-0.035
ses-0.034
 tum-0.034
Token said
Token ID531
Feature activation+1.03e-02

Pos logit contributions
 names+0.042
 trips+0.042
ses+0.042
sed+0.041
 imagination+0.040
Neg logit contributions
ge-0.034
ters-0.030
is-0.028
rier-0.027
lling-0.027
Token,
Token ID11
Feature activation+3.04e-02

Pos logit contributions
ses+0.016
 trips+0.015
 names+0.015
sed+0.015
 imagination+0.014
Neg logit contributions
ge-0.019
ters-0.018
is-0.017
rier-0.017
line-0.016
Token "
Token ID366
Feature activation+1.72e-03

Pos logit contributions
 names+0.021
sed+0.021
ses+0.020
 trips+0.019
 imagination+0.019
Neg logit contributions
ge-0.083
ters-0.076
lling-0.074
ising-0.074
is-0.073
TokenHello
Token ID15496
Feature activation+3.56e-02

Pos logit contributions
 names+0.002
tr+0.002
 lived+0.002
 imagination+0.002
haw+0.002
Neg logit contributions
ge-0.004
rier-0.003
ising-0.003
ters-0.003
rily-0.003
Token,
Token ID11
Feature activation+3.13e-02

Pos logit contributions
ses+0.051
 trips+0.050
 names+0.050
sed+0.049
 imagination+0.048
Neg logit contributions
ge-0.071
ters-0.064
is-0.061
rier-0.060
ising-0.058
Token Sara
Token ID24799
Feature activation+1.76e-02

Pos logit contributions
 names+0.044
 imagination+0.043
 trips+0.042
 books+0.042
 stories+0.041
Neg logit contributions
ge-0.066
ters-0.058
lling-0.056
ising-0.056
mine-0.054
Token take
Token ID1011
Feature activation-1.49e-02

Pos logit contributions
 names+0.011
 trips+0.011
 imagination+0.010
 tum+0.010
ses+0.009
Neg logit contributions
ge-0.016
rier-0.014
ters-0.014
lling-0.014
is-0.013
Token a
Token ID257
Feature activation+2.48e-02

Pos logit contributions
ge+0.023
ters+0.021
is+0.021
ising+0.019
ner+0.019
Neg logit contributions
ses-0.028
sed-0.028
tr-0.026
 barrel-0.025
 spears-0.025
Token shower
Token ID14643
Feature activation-1.64e-04

Pos logit contributions
ses+0.041
sed+0.040
 spears+0.039
 names+0.038
 trips+0.038
Neg logit contributions
ge-0.045
is-0.040
ters-0.039
rier-0.036
ising-0.035
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
ge+0.000
lling+0.000
is+0.000
ters+0.000
chie+0.000
Neg logit contributions
 trips-0.000
 names-0.000
 tum-0.000
 favorite-0.000
tr-0.000
Token I
Token ID314
Feature activation-1.12e-02

Pos logit contributions
 trips+0.018
ses+0.017
sed+0.017
 imag+0.017
 spears+0.017
Neg logit contributions
ters-0.018
ge-0.018
rier-0.016
is-0.016
line-0.016
Token want
Token ID765
Feature activation+8.73e-03

Pos logit contributions
ge+0.022
ters+0.021
rier+0.020
is+0.020
lling+0.019
Neg logit contributions
 names-0.016
 trips-0.016
ses-0.015
 imagination-0.015
sed-0.015
Token to
Token ID284
Feature activation+1.28e-02

Pos logit contributions
ses+0.015
sed+0.015
 names+0.014
 trips+0.013
 spears+0.013
Neg logit contributions
ge-0.015
ters-0.014
is-0.013
Sus-0.012
ner-0.012
Token play
Token ID711
Feature activation+2.54e-02

Pos logit contributions
 names+0.018
 trips+0.017
 imagination+0.017
 tum+0.017
 marsh+0.016
Neg logit contributions
ge-0.028
rier-0.025
ters-0.024
lling-0.024
rily-0.023
Token with
Token ID351
Feature activation-8.23e-06

Pos logit contributions
 trips+0.047
 names+0.046
 imagination+0.044
ses+0.044
sed+0.043
Neg logit contributions
ge-0.042
ters-0.035
is-0.033
rier-0.033
lling-0.032
Token my
Token ID616
Feature activation+1.21e-02

Feature not active on prev token;
no predictions were affected.
Token toys
Token ID14958
Feature activation-1.26e-02

Pos logit contributions
ses+0.018
 spears+0.017
 trips+0.017
sed+0.017
 names+0.017
Neg logit contributions
ge-0.022
ters-0.021
is-0.020
ising-0.019
rier-0.019
Token the
Token ID262
Feature activation-1.74e-02

Pos logit contributions
ses+0.020
sed+0.019
 trips+0.019
 names+0.019
 imag+0.018
Neg logit contributions
ge-0.028
ters-0.026
is-0.025
rier-0.024
lling-0.024
Token bad
Token ID2089
Feature activation-1.95e-02

Pos logit contributions
ge+0.037
ters+0.033
is+0.031
rier+0.030
rily+0.030
Neg logit contributions
 names-0.023
 trips-0.023
ses-0.023
sed-0.022
 imagination-0.022
Token man
Token ID582
Feature activation-3.58e-02

Pos logit contributions
ge+0.038
ters+0.035
is+0.032
rily+0.031
rier+0.031
Neg logit contributions
 trips-0.030
 names-0.029
ses-0.028
sed-0.028
 imagination-0.028
Token with
Token ID351
Feature activation-1.47e-02

Pos logit contributions
ters+0.065
rier+0.064
ge+0.063
is+0.063
chie+0.061
Neg logit contributions
 trips-0.054
 lived-0.049
 names-0.047
 tum-0.046
 stories-0.045
Token his
Token ID465
Feature activation+7.81e-03

Pos logit contributions
ge+0.029
ters+0.026
is+0.025
rier+0.024
lling+0.024
Neg logit contributions
ses-0.020
 names-0.020
sed-0.020
 trips-0.020
 imag-0.019
Token lawyer
Token ID6853
Feature activation+1.76e-02

Pos logit contributions
 trips+0.010
ses+0.010
 names+0.010
 lived+0.010
 spears+0.010
Neg logit contributions
ge-0.016
ters-0.013
is-0.013
rier-0.013
ising-0.013
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
ses+0.029
 trips+0.029
 names+0.029
sed+0.029
 imagination+0.028
Neg logit contributions
ge-0.031
ters-0.027
is-0.026
rier-0.025
lling-0.024
Token As
Token ID1081
Feature activation-9.43e-03

Pos logit contributions
ge+0.026
ters+0.023
is+0.023
rier+0.022
ising+0.022
Neg logit contributions
ses-0.017
 trips-0.017
 names-0.017
sed-0.017
 imagination-0.016
Token the
Token ID262
Feature activation-1.35e-02

Pos logit contributions
ge+0.019
ters+0.018
chie+0.017
rier+0.017
is+0.017
Neg logit contributions
 trips-0.013
 imagination-0.012
 stories-0.012
 marsh-0.011
 names-0.011
Token judge
Token ID5052
Feature activation-1.96e-02

Pos logit contributions
ge+0.027
ters+0.025
is+0.024
rier+0.023
lling+0.023
Neg logit contributions
ses-0.019
 trips-0.019
 names-0.019
sed-0.018
 imagination-0.018
Token read
Token ID1100
Feature activation+1.43e-03

Pos logit contributions
ge+0.041
ters+0.037
rier+0.037
is+0.037
line+0.035
Neg logit contributions
 trips-0.026
 names-0.025
 lived-0.025
sed-0.024
ses-0.023
Token.
Token ID13
Feature activation+9.62e-03

Pos logit contributions
ge+0.027
ters+0.024
is+0.024
Sus+0.023
chie+0.023
Neg logit contributions
 trips-0.021
 names-0.021
 imagination-0.020
 lived-0.020
 tum-0.020
Token
Token ID198
Feature activation-9.45e-03

Pos logit contributions
ses+0.011
 names+0.011
 trips+0.011
sed+0.011
 imagination+0.011
Neg logit contributions
ge-0.021
ters-0.019
is-0.019
rier-0.018
lling-0.018
Token
Token ID198
Feature activation-1.55e-02

Pos logit contributions
ge+0.020
is+0.018
ters+0.017
Sus+0.017
lling+0.016
Neg logit contributions
ses-0.013
sed-0.013
 names-0.012
 trips-0.012
 imag-0.012
TokenThe
Token ID464
Feature activation+6.85e-03

Pos logit contributions
ge+0.026
ters+0.023
rier+0.022
is+0.021
ising+0.020
Neg logit contributions
 trips-0.027
 names-0.027
ses-0.026
sed-0.025
 imagination-0.025
Token boy
Token ID2933
Feature activation-3.38e-02

Pos logit contributions
 names+0.011
 trips+0.011
ses+0.011
sed+0.011
 imagination+0.011
Neg logit contributions
ge-0.012
ters-0.011
is-0.010
rier-0.010
line-0.010
Token promised
Token ID8072
Feature activation-3.34e-02

Pos logit contributions
ge+0.072
ters+0.066
rier+0.064
is+0.062
line+0.060
Neg logit contributions
 trips-0.046
 lived-0.044
sed-0.043
 names-0.043
ses-0.042
Token to
Token ID284
Feature activation+2.42e-02

Pos logit contributions
ge+0.077
ters+0.073
lling+0.070
ivering+0.069
rier+0.068
Neg logit contributions
 trips-0.035
 names-0.034
 Rex-0.034
zed-0.032
 imagination-0.031
Token come
Token ID1282
Feature activation+5.78e-03

Pos logit contributions
 names+0.031
 imagination+0.029
 lived+0.029
 stories+0.028
 trips+0.028
Neg logit contributions
ge-0.055
ters-0.050
rier-0.048
lling-0.047
rily-0.046
Token back
Token ID736
Feature activation+2.45e-02

Pos logit contributions
 names+0.007
 trips+0.006
 favorite+0.006
 imagination+0.006
 spears+0.005
Neg logit contributions
ge-0.013
is-0.012
ters-0.012
lling-0.012
ner-0.012
Token soon
Token ID2582
Feature activation-2.10e-02

Pos logit contributions
 trips+0.044
 names+0.043
 imagination+0.041
ses+0.041
sed+0.040
Neg logit contributions
ge-0.040
ters-0.036
is-0.035
rier-0.034
lling-0.033
Token,
Token ID11
Feature activation-3.92e-03

Pos logit contributions
ge+0.041
ters+0.039
Sus+0.038
is+0.038
chie+0.037
Neg logit contributions
 trips-0.031
 names-0.028
 imagination-0.027
 stories-0.027
 tum-0.027
Token He
Token ID679
Feature activation+4.08e-03

Pos logit contributions
ge+0.026
ters+0.024
rier+0.022
is+0.022
line+0.021
Neg logit contributions
ses-0.021
 trips-0.021
 names-0.020
sed-0.020
 imagination-0.020
Token threw
Token ID9617
Feature activation+1.04e-03

Pos logit contributions
ses+0.007
 names+0.007
 trips+0.007
sed+0.007
 imagination+0.006
Neg logit contributions
ge-0.007
ters-0.006
is-0.006
rier-0.006
lling-0.006
Token it
Token ID340
Feature activation+3.59e-02

Pos logit contributions
ses+0.002
sed+0.002
 names+0.002
 trips+0.002
 imagination+0.002
Neg logit contributions
ge-0.002
ters-0.002
is-0.001
rier-0.001
lling-0.001
Token as
Token ID355
Feature activation-2.91e-02

Pos logit contributions
ses+0.067
 names+0.066
sed+0.065
 trips+0.064
 imagination+0.063
Neg logit contributions
ge-0.055
ters-0.049
is-0.045
rier-0.044
ising-0.043
Token far
Token ID1290
Feature activation+7.60e-03

Pos logit contributions
ge+0.058
ters+0.052
is+0.050
rier+0.050
ter+0.048
Neg logit contributions
 names-0.041
ses-0.041
sed-0.041
 trips-0.040
 imag-0.040
Token as
Token ID355
Feature activation-2.66e-02

Pos logit contributions
 trips+0.011
ses+0.010
 spears+0.010
 Rex+0.010
 imagination+0.010
Neg logit contributions
ge-0.014
Sus-0.014
chie-0.013
ters-0.013
hr-0.013
Token he
Token ID339
Feature activation-2.11e-02

Pos logit contributions
ge+0.045
ters+0.039
is+0.038
rier+0.037
lling+0.035
Neg logit contributions
ses-0.046
 names-0.046
sed-0.045
 trips-0.045
 imagination-0.043
Token could
Token ID714
Feature activation-4.60e-02

Pos logit contributions
ge+0.047
ters+0.043
is+0.042
rier+0.040
ising+0.040
Neg logit contributions
ses-0.025
 names-0.024
 trips-0.024
sed-0.023
 imagination-0.023
Token.
Token ID13
Feature activation-8.23e-03

Pos logit contributions
ge+0.087
ters+0.082
is+0.078
rier+0.077
lling+0.076
Neg logit contributions
sed-0.068
 trips-0.068
ses-0.068
 names-0.065
 tum-0.065
Token The
Token ID383
Feature activation+3.25e-03

Pos logit contributions
ge+0.015
ters+0.014
is+0.013
rier+0.013
ner+0.013
Neg logit contributions
ses-0.012
sed-0.012
 trips-0.012
 names-0.011
 imag-0.011
Token bird
Token ID6512
Feature activation+4.05e-03

Pos logit contributions
ses+0.005
 trips+0.005
 names+0.005
sed+0.005
 imagination+0.005
Neg logit contributions
ge-0.006
ters-0.005
is-0.005
rier-0.005
lling-0.005
Token find
Token ID1064
Feature activation-1.45e-02

Pos logit contributions
ge+0.048
ters+0.045
is+0.045
rier+0.045
lling+0.042
Neg logit contributions
 trips-0.034
 names-0.034
 imagination-0.032
 lived-0.031
 tum-0.030
Token him
Token ID683
Feature activation+1.79e-02

Pos logit contributions
ge+0.021
ters+0.020
ter+0.019
rier+0.018
 longer+0.016
Neg logit contributions
ses-0.027
sed-0.027
tr-0.027
 habitats-0.026
 meanings-0.026
Token.
Token ID13
Feature activation-2.52e-03

Pos logit contributions
 trips+0.031
 names+0.031
ses+0.030
sed+0.030
 imagination+0.029
Neg logit contributions
ge-0.029
ters-0.026
is-0.025
rier-0.024
ising-0.023
Token He
Token ID679
Feature activation+5.52e-03

Pos logit contributions
ge+0.005
ters+0.005
is+0.004
rier+0.004
line+0.004
Neg logit contributions
ses-0.003
 trips-0.003
 names-0.003
sed-0.003
 imag-0.003
Token was
Token ID373
Feature activation+1.05e-02

Pos logit contributions
ses+0.008
 names+0.008
 trips+0.008
sed+0.008
 imagination+0.008
Neg logit contributions
ge-0.011
ters-0.010
is-0.009
rier-0.009
lling-0.009
Token all
Token ID477
Feature activation-1.74e-02

Pos logit contributions
 trips+0.016
ses+0.016
sed+0.016
 names+0.015
 imagination+0.015
Neg logit contributions
ge-0.020
ters-0.018
is-0.018
rier-0.017
ising-0.017
Token alone
Token ID3436
Feature activation-1.89e-02

Pos logit contributions
ge+0.030
ters+0.028
is+0.028
rier+0.026
ising+0.026
Neg logit contributions
 trips-0.029
 imagination-0.028
ses-0.028
 lived-0.027
sed-0.027
Token.
Token ID13
Feature activation-8.46e-04

Pos logit contributions
ge+0.035
ters+0.032
is+0.031
line+0.030
rier+0.030
Neg logit contributions
 trips-0.030
sed-0.027
ses-0.026
 imagination-0.026
 lived-0.026
Token
Token ID220
Feature activation+3.32e-02

Pos logit contributions
ge+0.001
ters+0.001
is+0.001
line+0.001
rier+0.001
Neg logit contributions
ses-0.001
 trips-0.001
sed-0.001
 names-0.001
 spears-0.001
Token
Token ID198
Feature activation-1.93e-02

Pos logit contributions
ses+0.043
 trips+0.043
 names+0.043
sed+0.042
 imagination+0.041
Neg logit contributions
ge-0.070
ters-0.063
is-0.061
rier-0.059
ising-0.058
Token
Token ID198
Feature activation-2.54e-02

Pos logit contributions
ge+0.040
ters+0.036
is+0.035
rier+0.034
lling+0.033
Neg logit contributions
ses-0.026
 names-0.025
 trips-0.025
sed-0.025
 imagination-0.024
Token,
Token ID11
Feature activation+1.53e-02

Pos logit contributions
ses+0.004
sed+0.004
 names+0.004
 trips+0.004
 spears+0.004
Neg logit contributions
ge-0.004
ters-0.004
is-0.004
ner-0.003
ter-0.003
Token her
Token ID607
Feature activation-1.85e-03

Pos logit contributions
 names+0.025
 trips+0.025
ses+0.025
sed+0.025
 imagination+0.024
Neg logit contributions
ge-0.027
ters-0.024
is-0.023
rier-0.022
lling-0.022
Token aunt
Token ID25949
Feature activation+2.59e-02

Pos logit contributions
ge+0.003
is+0.003
ters+0.003
broken+0.003
ter+0.003
Neg logit contributions
 spears-0.002
 trips-0.002
icks-0.002
 barrel-0.002
 imag-0.002
Token came
Token ID1625
Feature activation-1.66e-02

Pos logit contributions
 trips+0.035
 names+0.035
 lived+0.034
sed+0.033
ses+0.033
Neg logit contributions
ge-0.054
rier-0.048
ters-0.048
is-0.046
ising-0.046
Token to
Token ID284
Feature activation-9.76e-03

Pos logit contributions
ge+0.037
rier+0.035
is+0.034
ters+0.034
line+0.034
Neg logit contributions
 names-0.020
 trips-0.019
ses-0.017
 favorite-0.017
sed-0.017
Token her
Token ID607
Feature activation-1.92e-02

Pos logit contributions
ge+0.020
ters+0.019
rier+0.018
lling+0.018
is+0.018
Neg logit contributions
 names-0.014
 trips-0.013
 imagination-0.013
ses-0.013
 lived-0.012
Token room
Token ID2119
Feature activation-9.45e-03

Pos logit contributions
ge+0.036
ters+0.033
is+0.031
rier+0.031
lling+0.030
Neg logit contributions
 trips-0.029
 names-0.029
ses-0.029
sed-0.028
 imagination-0.028
Token.
Token ID13
Feature activation-7.54e-03

Pos logit contributions
ge+0.018
ters+0.017
chie+0.017
lling+0.016
is+0.016
Neg logit contributions
 trips-0.014
 tum-0.014
 names-0.013
 favorite-0.013
 stories-0.013
Token She
Token ID1375
Feature activation+1.21e-02

Pos logit contributions
ge+0.010
ters+0.009
rier+0.009
line+0.009
Sus+0.008
Neg logit contributions
 trips-0.015
 spears-0.014
 imag-0.014
ses-0.014
sed-0.014
Token saw
Token ID2497
Feature activation+2.70e-02

Pos logit contributions
ses+0.019
 names+0.018
 trips+0.018
sed+0.018
 spears+0.018
Neg logit contributions
ge-0.022
ters-0.019
is-0.019
lling-0.018
ising-0.018
Token Lily
Token ID20037
Feature activation-2.40e-02

Pos logit contributions
ses+0.051
 trips+0.051
sed+0.051
 names+0.050
 imag+0.049
Neg logit contributions
ge-0.041
ters-0.036
is-0.034
rier-0.032
ter-0.032
Token beautiful
Token ID4950
Feature activation+1.76e-03

Pos logit contributions
ge+0.023
ters+0.021
is+0.020
rier+0.019
ising+0.019
Neg logit contributions
 trips-0.018
 names-0.018
ses-0.017
sed-0.017
 imagination-0.017
Token and
Token ID290
Feature activation-7.76e-03

Pos logit contributions
 trips+0.003
 names+0.003
 favorite+0.003
 imagination+0.003
 stories+0.003
Neg logit contributions
ge-0.003
is-0.003
chie-0.003
ters-0.003
Sus-0.002
Token everyone
Token ID2506
Feature activation-3.35e-02

Pos logit contributions
ge+0.016
ters+0.015
is+0.014
rier+0.014
ising+0.013
Neg logit contributions
 names-0.010
 trips-0.010
ses-0.010
sed-0.010
 imagination-0.010
Token was
Token ID373
Feature activation-6.43e-05

Pos logit contributions
ge+0.080
ters+0.073
rier+0.072
is+0.071
ising+0.068
Neg logit contributions
 lived-0.036
 trips-0.035
 names-0.033
 imagination-0.032
 imag-0.032
Token happy
Token ID3772
Feature activation-3.94e-02

Pos logit contributions
ge+0.000
ters+0.000
is+0.000
ising+0.000
izing+0.000
Neg logit contributions
 favorite-0.000
 lived-0.000
 tum-0.000
 imagination-0.000
 trips-0.000
Token with
Token ID351
Feature activation-1.08e-02

Pos logit contributions
ge+0.074
ters+0.063
is+0.063
chie+0.060
rier+0.060
Neg logit contributions
 names-0.063
 trips-0.063
 imagination-0.061
ses-0.060
sed-0.059
Token the
Token ID262
Feature activation+1.63e-03

Pos logit contributions
ge+0.018
ters+0.016
is+0.015
rier+0.015
lling+0.015
Neg logit contributions
ses-0.019
sed-0.019
 trips-0.019
tr-0.018
 names-0.018
Token fair
Token ID3148
Feature activation-2.24e-02

Pos logit contributions
 names+0.002
 trips+0.002
 imagination+0.002
ses+0.002
sed+0.002
Neg logit contributions
ge-0.004
ters-0.003
rier-0.003
rily-0.003
lling-0.003
Token way
Token ID835
Feature activation-1.31e-02

Pos logit contributions
ge+0.045
ter+0.041
ters+0.040
lling+0.040
rily+0.040
Neg logit contributions
 names-0.032
 stories-0.031
 trips-0.030
 books-0.030
 imagination-0.029
Token things
Token ID1243
Feature activation-3.20e-02

Pos logit contributions
ge+0.028
is+0.026
ters+0.026
ising+0.025
lling+0.025
Neg logit contributions
 trips-0.015
 favorite-0.015
 names-0.015
 imagination-0.014
 tum-0.014
Token were
Token ID547
Feature activation-3.52e-02

Pos logit contributions
ge+0.058
ters+0.052
is+0.051
rier+0.049
ising+0.047
Neg logit contributions
 trips-0.052
 names-0.051
 imagination-0.049
 lived-0.048
ses-0.048
Token something
Token ID1223
Feature activation-1.47e-02

Pos logit contributions
 trips+0.006
ses+0.006
 names+0.006
sed+0.006
 imagination+0.006
Neg logit contributions
ge-0.009
ters-0.008
is-0.007
rier-0.007
lling-0.007
Token new
Token ID649
Feature activation-2.89e-02

Pos logit contributions
ge+0.029
ters+0.026
is+0.025
rier+0.025
ising+0.024
Neg logit contributions
 trips-0.021
 names-0.021
ses-0.021
sed-0.020
 imagination-0.019
Token-
Token ID12
Feature activation-3.77e-02

Pos logit contributions
ge+0.042
ters+0.036
is+0.034
rier+0.032
ising+0.032
Neg logit contributions
ses-0.057
 names-0.056
sed-0.056
 trips-0.056
 imagination-0.054
Token there
Token ID612
Feature activation-6.55e-02

Pos logit contributions
ge+0.066
ters+0.059
rier+0.056
is+0.054
ising+0.054
Neg logit contributions
 names-0.059
 trips-0.059
ses-0.056
 lived-0.056
 spears-0.055
Token were
Token ID547
Feature activation-3.33e-02

Pos logit contributions
ge+0.123
ters+0.109
is+0.106
rier+0.105
ising+0.100
Neg logit contributions
 trips-0.100
 names-0.100
ses-0.098
sed-0.096
 lived-0.095
Token tad
Token ID36264
Feature activation-8.19e-02

Pos logit contributions
ge+0.060
ters+0.053
is+0.051
rier+0.050
ising+0.048
Neg logit contributions
ses-0.054
 trips-0.054
 names-0.053
sed-0.053
 imagination-0.051
Tokenp
Token ID79
Feature activation+2.24e-02

Pos logit contributions
ge+0.148
Sus+0.120
ner+0.119
mine+0.118
line+0.117
Neg logit contributions
 spears-0.131
 names-0.128
icks-0.123
 trips-0.122
ses-0.120
Tokenoles
Token ID4316
Feature activation-1.15e-02

Pos logit contributions
 names+0.038
 trips+0.038
sed+0.037
ses+0.036
 spears+0.035
Neg logit contributions
ge-0.038
ters-0.033
rier-0.031
is-0.031
ner-0.030
Token swimming
Token ID14899
Feature activation-4.02e-03

Pos logit contributions
ge+0.021
rier+0.018
ters+0.018
Sus+0.018
is+0.018
Neg logit contributions
 trips-0.019
 names-0.017
 spears-0.017
sed-0.017
ses-0.017
Token around
Token ID1088
Feature activation-2.19e-03

Pos logit contributions
ge+0.007
is+0.006
ters+0.006
rier+0.005
ner+0.005
Neg logit contributions
 names-0.007
 trips-0.007
ses-0.007
sed-0.007
 spears-0.007
Token in
Token ID287
Feature activation-6.46e-03

Pos logit contributions
ge+0.004
is+0.004
ters+0.003
rier+0.003
ising+0.003
Neg logit contributions
 trips-0.003
 names-0.003
 lived-0.003
 spears-0.003
 imagination-0.003
Token.
Token ID13
Feature activation-2.25e-04

Pos logit contributions
ge+0.017
ters+0.015
is+0.015
rier+0.014
chie+0.014
Neg logit contributions
 trips-0.015
 names-0.015
 imagination-0.015
sed-0.014
ses-0.014
Token He
Token ID679
Feature activation+4.07e-03

Pos logit contributions
ge+0.000
ters+0.000
is+0.000
line+0.000
rier+0.000
Neg logit contributions
ses-0.000
 trips-0.000
sed-0.000
 names-0.000
 imag-0.000
Token wondered
Token ID14028
Feature activation-1.55e-02

Pos logit contributions
ses+0.006
 names+0.006
 trips+0.006
sed+0.006
 imagination+0.006
Neg logit contributions
ge-0.007
ters-0.007
is-0.006
rier-0.006
lling-0.006
Token what
Token ID644
Feature activation-9.36e-03

Pos logit contributions
ge+0.033
ters+0.029
is+0.028
rier+0.028
ising+0.027
Neg logit contributions
ses-0.021
sed-0.020
 trips-0.020
 names-0.020
 imagination-0.019
Token was
Token ID373
Feature activation-4.71e-02

Pos logit contributions
ge+0.012
is+0.010
ters+0.010
lling+0.009
line+0.009
Neg logit contributions
ses-0.021
sed-0.020
tr-0.019
icks-0.018
 habitats-0.017
Token going
Token ID1016
Feature activation-7.93e-02

Pos logit contributions
ge+0.084
ters+0.074
is+0.071
rier+0.069
ising+0.067
Neg logit contributions
 trips-0.077
ses-0.077
 names-0.077
sed-0.075
 imagination-0.073
Token to
Token ID284
Feature activation-4.32e-02

Pos logit contributions
ge+0.174
is+0.156
ters+0.152
rier+0.149
rily+0.147
Neg logit contributions
 names-0.100
 trips-0.100
 spears-0.092
 imagination-0.090
 barrel-0.089
Token happen
Token ID1645
Feature activation-4.60e-02

Pos logit contributions
ge+0.096
ters+0.090
rier+0.087
is+0.087
chie+0.082
Neg logit contributions
 names-0.055
 trips-0.055
 lived-0.050
 imagination-0.050
 stories-0.048
Token.
Token ID13
Feature activation-8.89e-04

Pos logit contributions
ge+0.084
ters+0.074
is+0.072
rier+0.069
lling+0.068
Neg logit contributions
 trips-0.074
 names-0.073
 imagination-0.070
 spears-0.068
 favorite-0.067
Token
Token ID198
Feature activation-1.89e-02

Pos logit contributions
ge+0.002
ters+0.002
is+0.001
line+0.001
rier+0.001
Neg logit contributions
ses-0.001
sed-0.001
 trips-0.001
 names-0.001
 spears-0.001
Token
Token ID198
Feature activation-2.42e-02

Pos logit contributions
ge+0.039
is+0.035
ters+0.034
Sus+0.032
rier+0.031
Neg logit contributions
ses-0.027
sed-0.027
 names-0.025
 trips-0.025
 imag-0.025
Token,
Token ID11
Feature activation+1.34e-02

Pos logit contributions
 trips+0.010
 names+0.010
 imagination+0.010
 favorite+0.009
 stories+0.009
Neg logit contributions
ge-0.013
ters-0.012
is-0.011
lling-0.011
chie-0.011
Token okay
Token ID8788
Feature activation+1.22e-02

Pos logit contributions
 names+0.021
 trips+0.021
 imagination+0.021
 tum+0.021
ses+0.020
Neg logit contributions
ge-0.026
ters-0.022
is-0.022
lling-0.021
ising-0.021
Token?"
Token ID1701
Feature activation+2.07e-03

Pos logit contributions
ses+0.019
 trips+0.018
sed+0.018
 names+0.018
 imagination+0.017
Neg logit contributions
ge-0.023
ters-0.021
is-0.020
rier-0.020
ter-0.019
Token the
Token ID262
Feature activation-5.83e-03

Pos logit contributions
 trips+0.003
 names+0.003
 imagination+0.003
ses+0.003
 Rex+0.003
Neg logit contributions
ge-0.004
ters-0.004
rier-0.004
is-0.004
lling-0.004
Token zo
Token ID40565
Feature activation-1.57e-02

Pos logit contributions
ge+0.013
ters+0.011
lling+0.011
rier+0.011
rily+0.011
Neg logit contributions
 names-0.007
ses-0.007
 trips-0.007
sed-0.007
 imagination-0.007
Tokenoke
Token ID2088
Feature activation-7.02e-02

Pos logit contributions
ge+0.030
lling+0.026
rily+0.026
ters+0.026
ising+0.025
Neg logit contributions
icks-0.021
 trips-0.021
 names-0.021
ses-0.020
sed-0.020
Tokeneper
Token ID5723
Feature activation-1.05e-02

Pos logit contributions
ge+0.165
line+0.150
rily+0.150
izing+0.149
is+0.148
Neg logit contributions
 lived-0.058
 names-0.057
icks-0.054
 Rex-0.053
 teeth-0.053
Token said
Token ID531
Feature activation-2.20e-02

Pos logit contributions
ge+0.023
rier+0.022
chie+0.021
ising+0.021
izing+0.021
Neg logit contributions
 tum-0.011
 teeth-0.011
 Leo-0.010
 names-0.010
 Rex-0.010
Token.
Token ID13
Feature activation+9.74e-03

Pos logit contributions
ge+0.046
lling+0.042
is+0.042
ters+0.042
chie+0.042
Neg logit contributions
 tum-0.026
icks-0.025
 names-0.024
 teeth-0.024
 favorite-0.023
Token
Token ID198
Feature activation-1.37e-03

Pos logit contributions
 names+0.009
 trips+0.009
ses+0.009
 imagination+0.008
sed+0.008
Neg logit contributions
ge-0.024
ters-0.022
lling-0.022
ising-0.022
is-0.021
Token
Token ID198
Feature activation-6.32e-03

Pos logit contributions
ge+0.003
ters+0.003
rier+0.003
lling+0.003
is+0.003
Neg logit contributions
 trips-0.001
 names-0.001
ses-0.001
 imagination-0.001
sed-0.001
Token every
Token ID790
Feature activation+1.95e-03

Pos logit contributions
ge+0.002
ters+0.002
is+0.001
rier+0.001
chie+0.001
Neg logit contributions
 trips-0.002
 names-0.002
ses-0.002
sed-0.001
 imagination-0.001
Token day
Token ID1110
Feature activation-2.61e-02

Pos logit contributions
 trips+0.002
ses+0.002
 names+0.002
sed+0.002
 imagination+0.002
Neg logit contributions
ge-0.005
ters-0.005
is-0.004
rier-0.004
ising-0.004
Token and
Token ID290
Feature activation-4.60e-04

Pos logit contributions
ge+0.052
ters+0.048
rier+0.046
is+0.046
chie+0.045
Neg logit contributions
 trips-0.040
 imagination-0.035
 names-0.035
 tum-0.034
 favorite-0.034
Token touch
Token ID3638
Feature activation+3.76e-02

Pos logit contributions
ge+0.001
ters+0.001
is+0.001
rier+0.001
lling+0.001
Neg logit contributions
 names-0.001
 trips-0.001
ses-0.001
sed-0.001
 imagination-0.001
Token the
Token ID262
Feature activation+3.81e-03

Pos logit contributions
ses+0.068
sed+0.067
 trips+0.066
 names+0.066
 imagination+0.064
Neg logit contributions
ge-0.060
ters-0.054
is-0.051
rier-0.050
lling-0.048
Token terr
Token ID8812
Feature activation-7.25e-02

Pos logit contributions
 names+0.005
ses+0.005
 trips+0.005
sed+0.005
 imagination+0.005
Neg logit contributions
ge-0.008
ters-0.007
is-0.007
rier-0.007
line-0.006
Tokenac
Token ID330
Feature activation-1.93e-02

Pos logit contributions
ge+0.118
ters+0.103
is+0.098
rier+0.095
lling+0.092
Neg logit contributions
ses-0.130
 trips-0.130
 names-0.129
sed-0.128
 imagination-0.124
Tokenotta
Token ID12375
Feature activation-3.75e-02

Pos logit contributions
ge+0.025
ters+0.021
is+0.021
rier+0.019
lling+0.019
Neg logit contributions
ses-0.041
 trips-0.041
sed-0.041
 names-0.040
 imagination-0.040
Token pot
Token ID1787
Feature activation+1.93e-03

Pos logit contributions
ge+0.076
ters+0.066
is+0.063
rier+0.062
Sus+0.062
Neg logit contributions
 trips-0.054
 names-0.054
ses-0.051
 imagination-0.051
sed-0.050
Token.
Token ID13
Feature activation-7.82e-04

Pos logit contributions
 trips+0.003
 stories+0.003
 tum+0.003
 names+0.003
 lived+0.003
Neg logit contributions
ge-0.004
chie-0.003
Sus-0.003
ters-0.003
ner-0.003
Token Maybe
Token ID6674
Feature activation-7.36e-03

Pos logit contributions
ge+0.002
ters+0.001
is+0.001
line+0.001
Sus+0.001
Neg logit contributions
ses-0.001
sed-0.001
 trips-0.001
 names-0.001
 imag-0.001
Token a
Token ID257
Feature activation+1.02e-02

Pos logit contributions
ge+0.040
ters+0.037
is+0.035
rier+0.035
chie+0.033
Neg logit contributions
 trips-0.033
ses-0.032
 names-0.031
sed-0.031
 imagination-0.031
Token ship
Token ID4074
Feature activation+8.18e-04

Pos logit contributions
 trips+0.013
 names+0.013
 imagination+0.013
 tum+0.013
 barrel+0.012
Neg logit contributions
ge-0.021
ters-0.020
lling-0.019
rier-0.019
ising-0.018
Token that
Token ID326
Feature activation+3.53e-03

Pos logit contributions
 trips+0.001
 names+0.001
 tum+0.001
 spears+0.001
 stories+0.001
Neg logit contributions
ge-0.002
chie-0.001
Sus-0.001
ters-0.001
lling-0.001
Token sank
Token ID30895
Feature activation-4.31e-02

Pos logit contributions
ses+0.005
 names+0.005
sed+0.005
 trips+0.005
 spears+0.005
Neg logit contributions
ge-0.006
ters-0.006
ter-0.005
line-0.005
lling-0.005
Token long
Token ID890
Feature activation+4.29e-02

Pos logit contributions
ge+0.077
is+0.070
ters+0.069
ner+0.067
rier+0.066
Neg logit contributions
 names-0.069
 trips-0.068
ses-0.067
sed-0.066
 imagination-0.064
Token ago
Token ID2084
Feature activation-8.02e-02

Pos logit contributions
 trips+0.075
 names+0.071
ses+0.070
 imagination+0.070
sed+0.068
Neg logit contributions
ge-0.074
ters-0.065
is-0.065
rier-0.063
Sus-0.062
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
ge+0.143
is+0.126
ters+0.126
rier+0.122
chie+0.121
Neg logit contributions
 trips-0.131
 names-0.128
ses-0.127
sed-0.126
 imagination-0.122
Token She
Token ID1375
Feature activation+1.88e-02

Pos logit contributions
 imag+0.025
 trips+0.024
ses+0.024
sed+0.024
 spears+0.024
Neg logit contributions
ge-0.021
ters-0.021
line-0.020
rier-0.020
is-0.019
Token said
Token ID531
Feature activation+1.18e-02

Pos logit contributions
ses+0.030
 trips+0.030
 names+0.030
sed+0.029
 imagination+0.029
Neg logit contributions
ge-0.034
ters-0.030
is-0.029
rier-0.029
ising-0.027
Token they
Token ID484
Feature activation+1.58e-02

Pos logit contributions
ses+0.020
 trips+0.019
sed+0.019
 names+0.019
tr+0.018
Neg logit contributions
ge-0.020
ters-0.019
is-0.018
rier-0.018
line-0.017
Token could
Token ID714
Feature activation-5.40e-04

Pos logit contributions
ses+0.024
sed+0.024
 trips+0.024
 names+0.023
 spears+0.022
Neg logit contributions
ge-0.029
ters-0.026
is-0.025
lling-0.024
mine-0.024