TOP ACTIVATIONS
MAX = 6.448

sun fl owers , and tul ips . They see pine
they do it . He imag ines he is one of
see her flower . She imag ines it will be red
oral of the story : Even if life gets hard ,
some time , the ant shr ank down to the size
she should shrink . She shr ank and became so small
When he does , she spr ays him with water .
enough , some little green spr outs started appearing in the
, the flowers started to spr out . The child was
, da is ies , tul ips and more . She
also likes children . He imag ines them having fun and
pond . The two ze br as stayed together for a
ie . They both had birth days coming up , and
oral of the story : Even the simplest of things can
every year , on their birth days , they would celebrate
She is happy . She spr ays them back . They
oral of the story : Even if someone appears kind and
happened . All of the blo oming flowers turned into monsters
it is funny . He spr ays Lily again with blue
things to explore . She kn elt down by a big

MIN ACTIVATIONS
MIN = -5.222

thank you . From that day on , Lily wore her
of fun . From that day on , Lily always drew
play together . From that day on , Lily and Tim
her toys . From that day on , Lily always carried
still helpful . From that day on , Tim started to
the best !" From that day on , Lily knew that
juice happily . From that day on , Lily loved av
loved Spot . From that day on , Lily and Max
all laughed . From that day on , Lily always shared
very happy . From that day on , Mia loved to
to me ." From that day on , Max and Lily
well done !" From that day on , Lily always remembered
lemon ade . From that day on , Tim my drank
doll tightly . From that day on , Lily made sure
day long . From that day on , Lily never forgot
light bulb . From that day on , Lily knew the
with him . From that day on , Lily knew she
her mom . From that day on , Lily made sure
more fun . From that day on , Lily knew that
big hug . From that day on , Lily and Mitt

INTERVAL 3.530 - 6.448
CONTAINS 0.153%

finished her candy , she sc run ched her eyes tight
, you must answer a r iddle . If you can
them to their room and t ucks them in . She
from down underground . She pe ered down the hole ,
her scarf and asked her parents , " Can I go

INTERVAL 0.613 - 3.530
CONTAINS 28.872%

was not fearful . She liked storms . She said to
to play with you ." The man was so happy to
key as a gift . The boy said that he wanted
o stretched his fins and sw am towards the scary fish
, and yellow . Lily had an idea to make her

INTERVAL -2.304 - 0.613
CONTAINS 68.421%

explained that she should never eat something without asking her first
and it made her sad . Her mom picked up the
and held it in her hand . " Hello
could help you ." After the mosquito enjoyed the
at him . He went to the dragon and fought it

INTERVAL -5.222 - -2.304
CONTAINS 2.554%

He did not want to play with the ball . He
cool . Can we touch it ?" " No
p uddle . He smiled and stepped in it . He
bl ouse belongs to me !" grow led the bear .
dangerous . Next time , play somewhere safe ," mom said
Token sun
Token ID4252
Feature activation+1.21e+00

Pos logit contributions
 planned+0.055
 especially+0.052
ust+0.051
 discovering+0.051
ride+0.050
Neg logit contributions
 trigger-0.063
 hurts-0.059
ze-0.057
 lid-0.056
 top-0.056
Tokenfl
Token ID2704
Feature activation+2.86e+00

Pos logit contributions
 especially+0.031
 planned+0.031
 discovering+0.029
 whether+0.028
 spending+0.028
Neg logit contributions
 trigger-0.034
 hurts-0.030
ze-0.028
 lid-0.028
 top-0.027
Tokenowers
Token ID3618
Feature activation+2.43e+00

Pos logit contributions
 planned+0.072
 especially+0.072
 discovering+0.067
 whether+0.066
 spending+0.064
Neg logit contributions
 trigger-0.080
 hurts-0.073
ze-0.068
 lid-0.066
antly-0.066
Token,
Token ID11
Feature activation+2.28e+00

Pos logit contributions
 especially+0.073
 planned+0.072
 whether+0.069
 discovering+0.068
 cultures+0.067
Neg logit contributions
 trigger-0.049
 hurts-0.047
inkle-0.043
 lid-0.042
ze-0.042
Token and
Token ID290
Feature activation+9.70e-01

Pos logit contributions
 planned+0.055
ust+0.052
 especially+0.051
 discovering+0.050
ride+0.050
Neg logit contributions
 trigger-0.061
 hurts-0.057
ze-0.055
 top-0.054
 lid-0.054
Token tul
Token ID48373
Feature activation+6.45e+00

Pos logit contributions
 planned+0.025
 especially+0.024
ride+0.023
ust+0.023
 discovering+0.023
Neg logit contributions
 trigger-0.025
 hurts-0.023
ze-0.022
 tears-0.021
 cries-0.021
Tokenips
Token ID2419
Feature activation+5.20e-01

Pos logit contributions
 especially+0.188
 planned+0.185
 discovering+0.177
 whether+0.174
 explorers+0.173
Neg logit contributions
 trigger-0.143
inkle-0.132
 hurts-0.126
itch-0.120
antly-0.119
Token.
Token ID13
Feature activation+9.96e-01

Pos logit contributions
 planned+0.012
 especially+0.012
 discovering+0.011
 spending+0.011
 whether+0.010
Neg logit contributions
 trigger-0.017
 hurts-0.015
ze-0.014
 lid-0.013
 cries-0.013
Token They
Token ID1119
Feature activation+1.20e+00

Pos logit contributions
 planned+0.023
 especially+0.022
 discovering+0.021
 spending+0.020
 traveling+0.019
Neg logit contributions
 trigger-0.031
 hurts-0.028
ze-0.026
 lid-0.026
 top-0.026
Token see
Token ID766
Feature activation+3.15e-01

Pos logit contributions
 planned+0.027
 especially+0.026
 discovering+0.026
 spending+0.025
 whether+0.025
Neg logit contributions
 trigger-0.035
 hurts-0.033
ze-0.031
 top-0.031
 lid-0.030
Token pine
Token ID20161
Feature activation+8.42e-01

Pos logit contributions
 especially+0.008
 planned+0.008
 discovering+0.007
 spending+0.007
 whether+0.007
Neg logit contributions
 trigger-0.009
 hurts-0.008
ze-0.008
 lid-0.007
antly-0.007
Token they
Token ID484
Feature activation+1.29e+00

Pos logit contributions
 especially+0.051
 planned+0.049
 discovering+0.049
 spending+0.046
 whether+0.046
Neg logit contributions
 trigger-0.046
 hurts-0.043
 top-0.039
ze-0.038
 breaks-0.038
Token do
Token ID466
Feature activation-5.04e-01

Pos logit contributions
 spending+0.031
 planned+0.031
 discovering+0.030
 especially+0.029
and+0.029
Neg logit contributions
 trigger-0.036
 hurts-0.034
 cries-0.031
 top-0.031
inkle-0.030
Token it
Token ID340
Feature activation-1.04e+00

Pos logit contributions
 hurts+0.010
 trigger+0.010
ze+0.010
 top+0.010
 tears+0.009
Neg logit contributions
 especially-0.014
 planned-0.014
ust-0.013
ait-0.013
 explorers-0.013
Token.
Token ID13
Feature activation+4.29e-01

Pos logit contributions
 hurts+0.022
 trigger+0.022
ze+0.020
 top+0.020
antly+0.019
Neg logit contributions
 discovering-0.029
 planned-0.029
 especially-0.029
 spending-0.028
 cultures-0.028
Token He
Token ID679
Feature activation-5.67e-02

Pos logit contributions
 planned+0.012
 especially+0.012
 discovering+0.012
 spending+0.011
 whether+0.011
Neg logit contributions
 trigger-0.011
 hurts-0.009
ze-0.009
 lid-0.009
 top-0.008
Token imag
Token ID3590
Feature activation+6.34e+00

Pos logit contributions
 trigger+0.001
 hurts+0.001
 cries+0.001
ze+0.001
 tears+0.001
Neg logit contributions
 planned-0.002
 especially-0.002
 discovering-0.002
 spending-0.002
 whether-0.002
Tokenines
Token ID1127
Feature activation+5.76e-01

Pos logit contributions
 explorers+0.169
 cultures+0.166
Ice+0.164
 someday+0.162
 planned+0.158
Neg logit contributions
inkle-0.145
 cries-0.118
 stern-0.116
ze-0.116
 hurts-0.115
Token he
Token ID339
Feature activation-3.32e-02

Pos logit contributions
 especially+0.014
 planned+0.014
 discovering+0.013
 spending+0.013
 whether+0.013
Neg logit contributions
 trigger-0.016
 hurts-0.015
ze-0.014
 lid-0.013
 top-0.013
Token is
Token ID318
Feature activation+1.03e-01

Pos logit contributions
 trigger+0.001
 hurts+0.001
ze+0.001
 lid+0.001
 cries+0.001
Neg logit contributions
 planned-0.001
 especially-0.001
 discovering-0.001
 spending-0.001
 whether-0.001
Token one
Token ID530
Feature activation+2.24e+00

Pos logit contributions
 especially+0.002
 planned+0.002
 discovering+0.002
 spending+0.002
 traveling+0.002
Neg logit contributions
 trigger-0.003
 hurts-0.003
 lid-0.003
ze-0.003
antly-0.003
Token of
Token ID286
Feature activation+1.34e+00

Pos logit contributions
 planned+0.043
 especially+0.043
 discovering+0.039
 spending+0.038
 whether+0.037
Neg logit contributions
 trigger-0.078
 hurts-0.071
ze-0.067
 lid-0.066
 top-0.065
Token see
Token ID766
Feature activation+4.20e-01

Pos logit contributions
 trigger+0.010
 hurts+0.009
ze+0.009
 top+0.008
 lid+0.008
Neg logit contributions
 especially-0.008
 planned-0.008
 discovering-0.007
 whether-0.007
 spending-0.007
Token her
Token ID607
Feature activation+5.73e-01

Pos logit contributions
 especially+0.011
 planned+0.011
 discovering+0.010
 spending+0.010
 whether+0.010
Neg logit contributions
 trigger-0.012
 hurts-0.011
ze-0.010
 lid-0.010
antly-0.009
Token flower
Token ID15061
Feature activation+4.75e-01

Pos logit contributions
 planned+0.015
 especially+0.015
 discovering+0.014
 spending+0.014
 whether+0.014
Neg logit contributions
 trigger-0.016
 hurts-0.014
ze-0.013
 lid-0.012
antly-0.012
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
 planned+0.012
 especially+0.011
 discovering+0.011
 spending+0.010
 whether+0.010
Neg logit contributions
 trigger-0.014
 hurts-0.013
ze-0.012
 lid-0.012
 cries-0.011
Token She
Token ID1375
Feature activation+2.25e-01

Pos logit contributions
 planned+0.003
 especially+0.003
 discovering+0.003
 spending+0.003
 cultures+0.003
Neg logit contributions
 trigger-0.003
 hurts-0.003
 lid-0.003
 top-0.003
ze-0.003
Token imag
Token ID3590
Feature activation+6.29e+00

Pos logit contributions
 planned+0.007
 especially+0.006
 discovering+0.006
 spending+0.006
 whether+0.006
Neg logit contributions
 trigger-0.005
 hurts-0.005
 cries-0.004
ze-0.004
 lid-0.004
Tokenines
Token ID1127
Feature activation+1.06e+00

Pos logit contributions
 cultures+0.160
 explorers+0.157
 planned+0.151
 someday+0.149
Ice+0.148
Neg logit contributions
inkle-0.153
ze-0.128
 stern-0.127
 hurts-0.127
 cries-0.126
Token it
Token ID340
Feature activation-7.73e-01

Pos logit contributions
 planned+0.027
 especially+0.027
 discovering+0.025
 cultures+0.024
 whether+0.024
Neg logit contributions
 trigger-0.028
 hurts-0.027
ze-0.025
 lid-0.024
 top-0.024
Token will
Token ID481
Feature activation-6.61e-01

Pos logit contributions
 trigger+0.020
 hurts+0.016
ze+0.015
 lid+0.015
itch+0.015
Neg logit contributions
 planned-0.023
 especially-0.022
 discovering-0.020
 spending-0.020
 whether-0.020
Token be
Token ID307
Feature activation-7.82e-01

Pos logit contributions
 trigger+0.019
 hurts+0.018
ze+0.017
 top+0.016
 lid+0.016
Neg logit contributions
 especially-0.015
 planned-0.015
 discovering-0.015
 spending-0.014
 cultures-0.014
Token red
Token ID2266
Feature activation-3.56e-01

Pos logit contributions
 trigger+0.023
 hurts+0.021
ze+0.019
 lid+0.019
 cries+0.018
Neg logit contributions
 planned-0.020
 especially-0.020
 discovering-0.018
 spending-0.018
 whether-0.018
Tokenoral
Token ID6864
Feature activation+2.87e+00

Pos logit contributions
 especially+0.087
 planned+0.085
 discovering+0.082
 spending+0.080
 whether+0.080
Neg logit contributions
 trigger-0.061
 hurts-0.054
ze-0.049
 lid-0.048
 top-0.048
Token of
Token ID286
Feature activation+9.76e-02

Pos logit contributions
 especially+0.061
 planned+0.059
 discovering+0.056
 whether+0.055
 spending+0.054
Neg logit contributions
 trigger-0.090
 hurts-0.086
 lid-0.078
antly-0.078
 top-0.078
Token the
Token ID262
Feature activation+1.96e-01

Pos logit contributions
 especially+0.003
 planned+0.003
 discovering+0.003
 spending+0.003
 whether+0.002
Neg logit contributions
 trigger-0.003
 hurts-0.002
ze-0.002
 lid-0.002
 cries-0.002
Token story
Token ID1621
Feature activation+7.91e-01

Pos logit contributions
 planned+0.006
 especially+0.006
 discovering+0.006
 spending+0.006
 traveling+0.006
Neg logit contributions
 trigger-0.004
 hurts-0.004
ze-0.003
antly-0.003
 lid-0.003
Token:
Token ID25
Feature activation+1.94e+00

Pos logit contributions
 planned+0.022
 especially+0.021
 discovering+0.020
 spending+0.020
 whether+0.019
Neg logit contributions
 trigger-0.022
 hurts-0.019
ze-0.018
 lid-0.017
 cries-0.016
Token Even
Token ID3412
Feature activation+6.24e+00

Pos logit contributions
 planned+0.049
 especially+0.049
 explorers+0.045
 discovering+0.044
packs+0.044
Neg logit contributions
 trigger-0.050
 hurts-0.047
 top-0.043
ze-0.043
 cries-0.042
Token if
Token ID611
Feature activation+1.08e+00

Pos logit contributions
 especially+0.114
 planned+0.111
ait+0.104
 discovering+0.103
 explorers+0.102
Neg logit contributions
 hurts-0.195
 trigger-0.193
 breaks-0.174
 top-0.172
 cries-0.171
Token life
Token ID1204
Feature activation+9.18e-01

Pos logit contributions
 especially+0.033
 planned+0.032
 discovering+0.031
 whether+0.031
packs+0.030
Neg logit contributions
 trigger-0.022
 hurts-0.021
 top-0.019
 cries-0.018
 breaks-0.018
Token gets
Token ID3011
Feature activation-1.63e-01

Pos logit contributions
 especially+0.026
 planned+0.026
 discovering+0.024
 spending+0.024
 whether+0.024
Neg logit contributions
 trigger-0.023
 hurts-0.021
ze-0.019
 lid-0.019
 cries-0.018
Token hard
Token ID1327
Feature activation-2.79e-01

Pos logit contributions
 trigger+0.004
 hurts+0.004
ze+0.004
 lid+0.004
 top+0.003
Neg logit contributions
 planned-0.004
 especially-0.004
 discovering-0.004
 spending-0.004
 whether-0.004
Token,
Token ID11
Feature activation+1.58e+00

Pos logit contributions
 trigger+0.007
 hurts+0.007
ze+0.006
 cries+0.006
 breaks+0.006
Neg logit contributions
 especially-0.007
 planned-0.007
 discovering-0.007
 whether-0.007
 spending-0.007
Token some
Token ID617
Feature activation+1.81e+00

Pos logit contributions
 especially+0.021
 planned+0.020
 cultures+0.019
 discovering+0.019
 whether+0.019
Neg logit contributions
 trigger-0.019
 hurts-0.016
 top-0.016
ze-0.015
 lid-0.015
Token time
Token ID640
Feature activation-3.36e-02

Pos logit contributions
 especially+0.056
 discovering+0.054
 whether+0.053
 planned+0.053
 cultures+0.053
Neg logit contributions
 trigger-0.037
 hurts-0.035
 breaks-0.032
 tears-0.031
 cries-0.030
Token,
Token ID11
Feature activation+5.55e-01

Pos logit contributions
 trigger+0.001
 hurts+0.000
 breaks+0.000
 top+0.000
ze+0.000
Neg logit contributions
 especially-0.001
 planned-0.001
 cultures-0.001
 discovering-0.001
 whether-0.001
Token the
Token ID262
Feature activation+4.19e-01

Pos logit contributions
 planned+0.015
 especially+0.015
 discovering+0.014
 spending+0.013
 whether+0.013
Neg logit contributions
 trigger-0.016
 hurts-0.014
ze-0.013
 lid-0.013
 cries-0.012
Token ant
Token ID1885
Feature activation+1.04e+00

Pos logit contributions
 especially+0.012
 planned+0.012
 discovering+0.011
 spending+0.011
 whether+0.011
Neg logit contributions
 trigger-0.011
 hurts-0.009
ze-0.009
antly-0.009
 lid-0.008
Token shr
Token ID10157
Feature activation+6.15e+00

Pos logit contributions
 planned+0.023
 especially+0.022
 discovering+0.021
 spending+0.021
 whether+0.020
Neg logit contributions
 trigger-0.033
 hurts-0.030
ze-0.028
 lid-0.028
 top-0.027
Tokenank
Token ID962
Feature activation-8.93e-02

Pos logit contributions
 planned+0.181
 especially+0.174
 cultures+0.174
 discovering+0.171
 spending+0.169
Neg logit contributions
 trigger-0.139
inkle-0.123
 hurts-0.122
ze-0.113
itch-0.113
Token down
Token ID866
Feature activation+2.45e-01

Pos logit contributions
 trigger+0.003
 hurts+0.003
ze+0.002
 lid+0.002
 cries+0.002
Neg logit contributions
 especially-0.002
 planned-0.002
 discovering-0.002
 whether-0.002
 spending-0.002
Token to
Token ID284
Feature activation-3.62e-01

Pos logit contributions
 planned+0.005
 especially+0.005
 discovering+0.005
 spending+0.004
 whether+0.004
Neg logit contributions
 trigger-0.008
 hurts-0.007
ze-0.007
 lid-0.007
 top-0.007
Token the
Token ID262
Feature activation-4.84e-01

Pos logit contributions
 trigger+0.011
 hurts+0.010
ze+0.009
 lid+0.009
 top+0.009
Neg logit contributions
 planned-0.009
 especially-0.009
 discovering-0.008
 spending-0.008
 whether-0.008
Token size
Token ID2546
Feature activation+2.06e+00

Pos logit contributions
 trigger+0.013
 hurts+0.011
inkle+0.010
antly+0.010
ze+0.010
Neg logit contributions
 especially-0.014
 planned-0.013
 discovering-0.013
 spending-0.012
 going-0.012
Token she
Token ID673
Feature activation-2.97e-02

Pos logit contributions
 trigger+0.038
 hurts+0.034
 cries+0.032
inkle+0.031
 lid+0.031
Neg logit contributions
 planned-0.023
 going-0.023
 especially-0.023
 seeing-0.023
 spending-0.022
Token should
Token ID815
Feature activation-5.38e-01

Pos logit contributions
 trigger+0.001
 hurts+0.001
 lid+0.001
ze+0.001
 top+0.001
Neg logit contributions
 planned-0.001
 especially-0.001
 discovering-0.001
 spending-0.001
 whether-0.001
Token shrink
Token ID22085
Feature activation-8.77e-02

Pos logit contributions
 trigger+0.018
 hurts+0.017
ze+0.016
 lid+0.015
 top+0.015
Neg logit contributions
 planned-0.011
 especially-0.011
 discovering-0.010
 spending-0.009
 whether-0.009
Token.
Token ID13
Feature activation-7.01e-01

Pos logit contributions
 trigger+0.003
 hurts+0.002
antly+0.002
 lid+0.002
ze+0.002
Neg logit contributions
 especially-0.002
 planned-0.002
 whether-0.002
 discovering-0.002
 going-0.002
Token She
Token ID1375
Feature activation-2.11e-03

Pos logit contributions
 trigger+0.021
 hurts+0.019
ze+0.017
 lid+0.017
inkle+0.016
Neg logit contributions
 planned-0.017
 especially-0.017
 discovering-0.016
 spending-0.016
 whether-0.015
Token shr
Token ID10157
Feature activation+6.12e+00

Pos logit contributions
 trigger+0.000
 hurts+0.000
ze+0.000
 lid+0.000
antly+0.000
Neg logit contributions
 especially-0.000
 planned-0.000
 discovering-0.000
 spending-0.000
 whether-0.000
Tokenank
Token ID962
Feature activation-3.76e-01

Pos logit contributions
 planned+0.163
 especially+0.156
 cultures+0.153
 discovering+0.151
 spending+0.150
Neg logit contributions
 trigger-0.156
 hurts-0.140
inkle-0.133
ze-0.132
 cries-0.128
Token and
Token ID290
Feature activation-1.53e+00

Pos logit contributions
 trigger+0.012
 hurts+0.011
�+0.010
inkle+0.009
ze+0.009
Neg logit contributions
 especially-0.009
 planned-0.008
 discovering-0.008
 whether-0.008
 often-0.007
Token became
Token ID2627
Feature activation+1.10e+00

Pos logit contributions
 trigger+0.047
 hurts+0.042
ze+0.039
 lid+0.038
 cries+0.037
Neg logit contributions
 planned-0.037
 especially-0.036
 discovering-0.034
 spending-0.032
 whether-0.032
Token so
Token ID523
Feature activation-9.11e-01

Pos logit contributions
 especially+0.029
 planned+0.028
 discovering+0.026
 spending+0.025
 going+0.024
Neg logit contributions
 trigger-0.034
 hurts-0.030
ze-0.027
 lid-0.027
antly-0.027
Token small
Token ID1402
Feature activation+5.41e-01

Pos logit contributions
 trigger+0.022
 hurts+0.020
ze+0.018
 lid+0.017
 top+0.017
Neg logit contributions
 especially-0.027
 planned-0.027
 discovering-0.025
 spending-0.025
 whether-0.025
Token When
Token ID1649
Feature activation-1.94e-01

Pos logit contributions
 trigger+0.007
 hurts+0.006
ze+0.006
 lid+0.006
antly+0.006
Neg logit contributions
 planned-0.007
 especially-0.007
 discovering-0.006
 spending-0.006
 whether-0.006
Token he
Token ID339
Feature activation-2.11e+00

Pos logit contributions
 trigger+0.007
 hurts+0.006
ze+0.006
 lid+0.006
inkle+0.006
Neg logit contributions
 planned-0.004
 especially-0.004
 spending-0.004
 discovering-0.004
 going-0.003
Token does
Token ID857
Feature activation-2.22e+00

Pos logit contributions
 trigger+0.064
 hurts+0.053
ze+0.052
 lid+0.051
antly+0.050
Neg logit contributions
 planned-0.052
 especially-0.052
 spending-0.046
 discovering-0.046
 going-0.045
Token,
Token ID11
Feature activation+4.85e-01

Pos logit contributions
 trigger+0.066
 hurts+0.058
 lid+0.054
antly+0.053
ze+0.051
Neg logit contributions
 planned-0.057
 especially-0.057
 spending-0.054
 discovering-0.053
 going-0.052
Token she
Token ID673
Feature activation-6.41e-01

Pos logit contributions
 planned+0.012
 especially+0.012
 discovering+0.011
 spending+0.011
 whether+0.010
Neg logit contributions
 trigger-0.014
 hurts-0.013
ze-0.012
 lid-0.012
 top-0.012
Token spr
Token ID7500
Feature activation+6.11e+00

Pos logit contributions
 trigger+0.017
 hurts+0.015
ze+0.014
 lid+0.014
antly+0.014
Neg logit contributions
 especially-0.017
 planned-0.017
 discovering-0.016
 spending-0.016
 whether-0.015
Tokenays
Token ID592
Feature activation-1.61e-02

Pos logit contributions
 planned+0.186
 especially+0.185
 discovering+0.176
 spending+0.170
 whether+0.169
Neg logit contributions
 trigger-0.144
 hurts-0.126
ze-0.113
 lid-0.111
antly-0.108
Token him
Token ID683
Feature activation-1.42e+00

Pos logit contributions
 trigger+0.001
ze+0.000
inkle+0.000
 AW+0.000
itch+0.000
Neg logit contributions
 especially-0.000
 planned-0.000
 whether-0.000
 someday-0.000
 going-0.000
Token with
Token ID351
Feature activation-1.80e+00

Pos logit contributions
 trigger+0.045
 hurts+0.035
inkle+0.035
 lid+0.035
itch+0.034
Neg logit contributions
 planned-0.033
 especially-0.033
 going-0.029
 whether-0.029
 spending-0.029
Token water
Token ID1660
Feature activation+2.58e-01

Pos logit contributions
 trigger+0.056
 hurts+0.051
antly+0.049
ze+0.049
inkle+0.049
Neg logit contributions
 especially-0.040
 planned-0.038
 going-0.036
 spending-0.036
 whether-0.035
Token.
Token ID13
Feature activation-6.15e-01

Pos logit contributions
 planned+0.006
 especially+0.006
 spending+0.005
 discovering+0.005
 whether+0.005
Neg logit contributions
 trigger-0.008
 hurts-0.007
ze-0.007
antly-0.006
inkle-0.006
Token enough
Token ID1576
Feature activation+6.77e-01

Pos logit contributions
 trigger+0.027
 hurts+0.024
ze+0.022
 lid+0.021
 top+0.021
Neg logit contributions
 planned-0.029
 especially-0.029
 discovering-0.027
 spending-0.027
 whether-0.026
Token,
Token ID11
Feature activation+1.43e+00

Pos logit contributions
 especially+0.022
 planned+0.022
 discovering+0.020
 spending+0.020
 whether+0.020
Neg logit contributions
 trigger-0.015
 hurts-0.013
 lid-0.011
 top-0.011
ze-0.011
Token some
Token ID617
Feature activation+2.45e+00

Pos logit contributions
 planned+0.037
 especially+0.037
 discovering+0.035
 spending+0.034
 whether+0.033
Neg logit contributions
 trigger-0.040
 hurts-0.035
ze-0.033
 lid-0.032
 top-0.032
Token little
Token ID1310
Feature activation+1.69e+00

Pos logit contributions
 especially+0.056
 planned+0.055
 discovering+0.053
 whether+0.052
 spending+0.052
Neg logit contributions
 trigger-0.074
 hurts-0.067
 lid-0.062
ze-0.062
 tears-0.060
Token green
Token ID4077
Feature activation+1.76e+00

Pos logit contributions
 planned+0.042
 especially+0.041
 discovering+0.039
 spending+0.038
 whether+0.037
Neg logit contributions
 trigger-0.049
 hurts-0.044
ze-0.041
 lid-0.041
 cries-0.040
Token spr
Token ID7500
Feature activation+6.10e+00

Pos logit contributions
 especially+0.044
 planned+0.043
 discovering+0.041
 spending+0.040
 whether+0.040
Neg logit contributions
 trigger-0.050
 hurts-0.045
ze-0.042
 lid-0.042
 top-0.041
Tokenouts
Token ID5269
Feature activation+5.61e-01

Pos logit contributions
 especially+0.173
 planned+0.173
 discovering+0.164
 spending+0.159
 whether+0.157
Neg logit contributions
 trigger-0.158
 hurts-0.137
ze-0.125
 lid-0.124
antly-0.123
Token started
Token ID2067
Feature activation+2.01e+00

Pos logit contributions
 planned+0.013
 especially+0.013
 discovering+0.012
 spending+0.012
 whether+0.011
Neg logit contributions
 trigger-0.017
 hurts-0.016
ze-0.015
 lid-0.014
 top-0.014
Token appearing
Token ID12655
Feature activation+6.64e-01

Pos logit contributions
 especially+0.035
 planned+0.034
 discovering+0.031
 whether+0.029
 cultures+0.029
Neg logit contributions
 trigger-0.072
 hurts-0.067
ze-0.063
 lid-0.062
 top-0.062
Token in
Token ID287
Feature activation+5.01e-01

Pos logit contributions
 planned+0.016
 especially+0.016
 discovering+0.015
 spending+0.015
 whether+0.014
Neg logit contributions
 trigger-0.019
 hurts-0.017
ze-0.016
 lid-0.016
 top-0.016
Token the
Token ID262
Feature activation-2.96e-01

Pos logit contributions
 planned+0.014
 especially+0.014
 discovering+0.013
 spending+0.012
 whether+0.012
Neg logit contributions
 trigger-0.014
 hurts-0.012
ze-0.011
 lid-0.011
 cries-0.011
Token,
Token ID11
Feature activation+1.44e+00

Pos logit contributions
 especially+0.029
 planned+0.028
 discovering+0.027
 whether+0.027
 cultures+0.026
Neg logit contributions
 trigger-0.016
 hurts-0.014
 lid-0.013
 top-0.012
 breaks-0.012
Token the
Token ID262
Feature activation+1.95e+00

Pos logit contributions
 planned+0.041
 especially+0.041
 discovering+0.038
 spending+0.037
 whether+0.037
Neg logit contributions
 trigger-0.037
 hurts-0.032
ze-0.030
 lid-0.029
 top-0.029
Token flowers
Token ID12734
Feature activation+2.77e+00

Pos logit contributions
 especially+0.056
 planned+0.055
 spending+0.054
 discovering+0.054
 whether+0.053
Neg logit contributions
 trigger-0.050
 hurts-0.041
 lid-0.041
 top-0.041
 tears-0.038
Token started
Token ID2067
Feature activation+2.33e+00

Pos logit contributions
 whether+0.065
 cultures+0.064
 especially+0.063
 spending+0.062
 discovering+0.062
Neg logit contributions
 trigger-0.075
 hurts-0.070
 lid-0.068
ze-0.068
 top-0.067
Token to
Token ID284
Feature activation+2.21e-02

Pos logit contributions
 especially+0.049
 cultures+0.047
ust+0.045
 planned+0.044
 appreciate+0.044
Neg logit contributions
 trigger-0.068
 hurts-0.065
 tears-0.064
ze-0.063
 top-0.063
Token spr
Token ID7500
Feature activation+6.06e+00

Pos logit contributions
 cultures+0.001
ooters+0.001
 planned+0.001
packs+0.001
ust+0.001
Neg logit contributions
 trigger-0.001
 top-0.001
 hurts-0.000
ze-0.000
 lid-0.000
Tokenout
Token ID448
Feature activation-9.03e-02

Pos logit contributions
 planned+0.153
 especially+0.152
 discovering+0.143
 spending+0.139
 whether+0.137
Neg logit contributions
 trigger-0.174
 hurts-0.155
ze-0.143
 lid-0.140
antly-0.138
Token.
Token ID13
Feature activation-2.14e-01

Pos logit contributions
 trigger+0.003
 hurts+0.002
ze+0.002
 lid+0.002
 top+0.002
Neg logit contributions
 planned-0.002
 especially-0.002
 discovering-0.002
 spending-0.002
 whether-0.002
Token The
Token ID383
Feature activation+1.63e+00

Pos logit contributions
 trigger+0.005
 hurts+0.004
ze+0.004
 lid+0.004
 top+0.004
Neg logit contributions
 planned-0.007
 especially-0.007
 discovering-0.006
 spending-0.006
 whether-0.006
Token child
Token ID1200
Feature activation+5.79e-01

Pos logit contributions
 planned+0.045
 especially+0.045
 discovering+0.042
 spending+0.041
 whether+0.041
Neg logit contributions
 trigger-0.043
 hurts-0.038
ze-0.035
 lid-0.035
 top-0.034
Token was
Token ID373
Feature activation+5.53e-01

Pos logit contributions
 planned+0.014
 especially+0.013
 discovering+0.013
 spending+0.013
 whether+0.013
Neg logit contributions
 trigger-0.017
 hurts-0.015
ze-0.015
 lid-0.014
 top-0.014
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
 planned+0.079
 especially+0.078
 discovering+0.073
 whether+0.073
 spending+0.071
Neg logit contributions
 trigger-0.067
 hurts-0.061
ze-0.056
 lid-0.055
 top-0.055
Token da
Token ID12379
Feature activation+1.61e+00

Pos logit contributions
 planned+0.044
 especially+0.042
 discovering+0.041
ride+0.040
 spending+0.039
Neg logit contributions
 trigger-0.046
 hurts-0.043
 tears-0.040
 lid-0.040
ze-0.039
Tokenis
Token ID271
Feature activation+4.76e-01

Pos logit contributions
 planned+0.045
 especially+0.045
 discovering+0.042
 spending+0.041
 whether+0.041
Neg logit contributions
 trigger-0.041
 hurts-0.037
ze-0.034
 lid-0.033
 top-0.032
Tokenies
Token ID444
Feature activation+1.89e+00

Pos logit contributions
 planned+0.008
 especially+0.008
 discovering+0.007
 spending+0.007
 whether+0.007
Neg logit contributions
 trigger-0.018
 hurts-0.016
ze-0.015
 lid-0.015
 top-0.015
Token,
Token ID11
Feature activation+2.01e+00

Pos logit contributions
 planned+0.058
 especially+0.058
 discovering+0.054
 whether+0.054
 cultures+0.053
Neg logit contributions
 trigger-0.042
 hurts-0.037
ze-0.034
 lid-0.033
 top-0.032
Token tul
Token ID48373
Feature activation+6.05e+00

Pos logit contributions
 planned+0.051
 especially+0.048
 discovering+0.047
ust+0.047
ride+0.046
Neg logit contributions
 trigger-0.054
 hurts-0.050
 lid-0.047
ze-0.047
 tears-0.047
Tokenips
Token ID2419
Feature activation+1.31e+00

Pos logit contributions
 especially+0.191
 planned+0.190
 discovering+0.179
 whether+0.175
 appreciate+0.173
Neg logit contributions
 trigger-0.131
 hurts-0.112
inkle-0.106
itch-0.102
ze-0.099
Token and
Token ID290
Feature activation-8.62e-01

Pos logit contributions
 planned+0.028
 especially+0.028
 discovering+0.025
 spending+0.024
 whether+0.024
Neg logit contributions
 trigger-0.044
 hurts-0.040
ze-0.037
 lid-0.036
 cries-0.036
Token more
Token ID517
Feature activation+4.60e-01

Pos logit contributions
 trigger+0.023
 hurts+0.021
ze+0.019
 lid+0.019
 cries+0.019
Neg logit contributions
 planned-0.023
 especially-0.022
 discovering-0.021
 whether-0.021
 spending-0.020
Token.
Token ID13
Feature activation-4.02e-01

Pos logit contributions
 planned+0.013
 especially+0.013
 discovering+0.012
 spending+0.012
 whether+0.011
Neg logit contributions
 trigger-0.012
 hurts-0.011
ze-0.010
 lid-0.010
antly-0.009
Token She
Token ID1375
Feature activation-1.64e-01

Pos logit contributions
 trigger+0.015
 hurts+0.014
ze+0.013
inkle+0.013
antly+0.013
Neg logit contributions
 especially-0.007
 planned-0.007
 spending-0.006
 discovering-0.006
 whether-0.006
Token also
Token ID635
Feature activation+8.12e-01

Pos logit contributions
 trigger+0.014
 hurts+0.012
 lid+0.011
ze+0.011
antly+0.011
Neg logit contributions
 especially-0.013
 planned-0.013
 discovering-0.012
 spending-0.012
 whether-0.012
Token likes
Token ID7832
Feature activation+1.05e+00

Pos logit contributions
 especially+0.020
 planned+0.019
 discovering+0.018
 spending+0.017
 whether+0.017
Neg logit contributions
 trigger-0.025
 hurts-0.022
ze-0.020
 lid-0.020
antly-0.020
Token children
Token ID1751
Feature activation-7.81e-01

Pos logit contributions
 planned+0.025
 especially+0.025
 discovering+0.023
 spending+0.022
 whether+0.022
Neg logit contributions
 trigger-0.031
 hurts-0.028
ze-0.026
 lid-0.025
 top-0.025
Token.
Token ID13
Feature activation-2.03e-01

Pos logit contributions
 trigger+0.023
 hurts+0.019
 lid+0.018
antly+0.018
ze+0.018
Neg logit contributions
 especially-0.021
 planned-0.020
 discovering-0.019
 whether-0.018
 spending-0.018
Token He
Token ID679
Feature activation-1.89e-01

Pos logit contributions
 trigger+0.006
 hurts+0.005
ze+0.005
 lid+0.005
 top+0.005
Neg logit contributions
 especially-0.005
 planned-0.005
 discovering-0.005
 spending-0.005
 whether-0.005
Token imag
Token ID3590
Feature activation+6.01e+00

Pos logit contributions
 trigger+0.005
 hurts+0.005
ze+0.004
 lid+0.004
antly+0.004
Neg logit contributions
 especially-0.005
 planned-0.005
 discovering-0.005
 spending-0.005
 whether-0.004
Tokenines
Token ID1127
Feature activation+4.29e-01

Pos logit contributions
 explorers+0.153
 cultures+0.150
 someday+0.148
 planned+0.147
Ice+0.144
Neg logit contributions
inkle-0.140
 hurts-0.130
 trigger-0.123
 cries-0.122
ze-0.121
Token them
Token ID606
Feature activation-4.04e-02

Pos logit contributions
 planned+0.011
 especially+0.011
 discovering+0.010
 spending+0.010
 cultures+0.010
Neg logit contributions
 trigger-0.012
 hurts-0.011
ze-0.010
 top-0.010
 lid-0.010
Token having
Token ID1719
Feature activation-7.77e-01

Pos logit contributions
 trigger+0.001
 hurts+0.001
 lid+0.001
ze+0.001
antly+0.001
Neg logit contributions
 especially-0.001
 planned-0.001
 discovering-0.001
 spending-0.001
 whether-0.001
Token fun
Token ID1257
Feature activation-2.74e-01

Pos logit contributions
 trigger+0.027
 hurts+0.024
 lid+0.022
ze+0.022
antly+0.022
Neg logit contributions
 especially-0.016
 planned-0.016
 discovering-0.014
 spending-0.014
 whether-0.014
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
 trigger+0.007
 hurts+0.006
ze+0.006
 lid+0.005
 top+0.005
Neg logit contributions
 planned-0.008
 especially-0.008
 discovering-0.007
 spending-0.007
 whether-0.007
Token pond
Token ID16723
Feature activation-4.32e-01

Pos logit contributions
 trigger+0.015
 hurts+0.013
ze+0.013
antly+0.013
itch+0.013
Neg logit contributions
 planned-0.010
 especially-0.009
 discovering-0.009
 traveling-0.009
 going-0.008
Token.
Token ID13
Feature activation+3.45e-02

Pos logit contributions
 trigger+0.011
 hurts+0.010
ze+0.009
 lid+0.009
 top+0.009
Neg logit contributions
 planned-0.012
 especially-0.012
 discovering-0.011
 spending-0.011
 whether-0.011
Token The
Token ID383
Feature activation+2.76e-01

Pos logit contributions
 planned+0.001
 especially+0.001
 discovering+0.001
 spending+0.001
 whether+0.001
Neg logit contributions
 trigger-0.001
 hurts-0.001
ze-0.001
inkle-0.001
 lid-0.001
Token two
Token ID734
Feature activation+3.01e+00

Pos logit contributions
 especially+0.005
 planned+0.005
 discovering+0.005
 traveling+0.005
 explorers+0.004
Neg logit contributions
 trigger-0.009
 hurts-0.009
inkle-0.008
antly-0.008
ze-0.008
Token ze
Token ID41271
Feature activation+1.35e+00

Pos logit contributions
 planned+0.062
 especially+0.061
 discovering+0.058
 spending+0.056
 whether+0.056
Neg logit contributions
 trigger-0.099
 hurts-0.090
ze-0.084
 lid-0.083
 cries-0.083
Tokenbr
Token ID1671
Feature activation+5.96e+00

Pos logit contributions
 planned+0.037
 especially+0.037
 discovering+0.035
 spending+0.034
 whether+0.033
Neg logit contributions
 trigger-0.036
 hurts-0.032
ze-0.029
 lid-0.028
 top-0.028
Tokenas
Token ID292
Feature activation+2.09e+00

Pos logit contributions
 especially+0.112
 discovering+0.103
 spending+0.098
 going+0.098
 traveling+0.097
Neg logit contributions
 trigger-0.206
 hurts-0.188
antly-0.184
 lid-0.182
itch-0.180
Token stayed
Token ID9658
Feature activation+5.07e-01

Pos logit contributions
 especially+0.045
 planned+0.045
 discovering+0.042
 spending+0.041
 whether+0.040
Neg logit contributions
 trigger-0.067
 hurts-0.060
ze-0.057
 lid-0.056
antly-0.055
Token together
Token ID1978
Feature activation+1.61e+00

Pos logit contributions
 planned+0.012
 especially+0.012
 discovering+0.011
 spending+0.010
 whether+0.010
Neg logit contributions
 trigger-0.015
 hurts-0.014
ze-0.013
 lid-0.013
 top-0.013
Token for
Token ID329
Feature activation-1.67e+00

Pos logit contributions
 planned+0.039
 especially+0.038
 cultures+0.035
 discovering+0.035
 spending+0.035
Neg logit contributions
 trigger-0.045
 hurts-0.040
ze-0.040
 top-0.039
 lid-0.038
Token a
Token ID257
Feature activation-9.40e-01

Pos logit contributions
 trigger+0.051
 hurts+0.046
 lid+0.041
ze+0.041
antly+0.040
Neg logit contributions
 planned-0.042
 especially-0.042
 discovering-0.040
 spending-0.039
 whether-0.037
Tokenie
Token ID494
Feature activation+3.92e-01

Pos logit contributions
 planned+0.031
 especially+0.031
 discovering+0.028
 spending+0.027
 whether+0.027
Neg logit contributions
 trigger-0.047
 hurts-0.043
ze-0.040
 lid-0.040
 top-0.039
Token.
Token ID13
Feature activation+5.32e-01

Pos logit contributions
 planned+0.010
 especially+0.010
 discovering+0.009
 spending+0.009
 whether+0.009
Neg logit contributions
 trigger-0.011
 hurts-0.010
ze-0.009
 lid-0.009
antly-0.009
Token They
Token ID1119
Feature activation+1.57e+00

Pos logit contributions
 planned+0.010
 especially+0.010
 discovering+0.009
 spending+0.009
 whether+0.009
Neg logit contributions
 trigger-0.018
 hurts-0.017
ze-0.016
 lid-0.016
antly-0.015
Token both
Token ID1111
Feature activation+2.16e+00

Pos logit contributions
 discovering+0.026
 planned+0.026
 especially+0.026
 whether+0.024
 traveling+0.024
Neg logit contributions
 trigger-0.054
 hurts-0.050
ze-0.047
 top-0.046
 cries-0.046
Token had
Token ID550
Feature activation+2.07e+00

Pos logit contributions
 planned+0.031
 especially+0.031
 discovering+0.026
 spending+0.025
 whether+0.024
Neg logit contributions
 trigger-0.088
 hurts-0.080
ze-0.076
 lid-0.076
 top-0.074
Token birth
Token ID4082
Feature activation+5.96e+00

Pos logit contributions
 planned+0.060
 especially+0.060
 discovering+0.056
 spending+0.055
 whether+0.054
Neg logit contributions
 trigger-0.051
 hurts-0.045
ze-0.042
 lid-0.041
 top-0.040
Tokendays
Token ID12545
Feature activation+2.51e+00

Pos logit contributions
 pic+0.109
 planned+0.099
 especially+0.098
 Kinder+0.095
 discovering+0.092
Neg logit contributions
antly-0.168
 hurts-0.161
 trigger-0.160
 away-0.154
ze-0.149
Token coming
Token ID2406
Feature activation+1.27e+00

Pos logit contributions
 discovering+0.056
 especially+0.055
 cultures+0.054
 traveling+0.053
 planned+0.053
Neg logit contributions
 hurts-0.062
 trigger-0.061
ze-0.057
 top-0.056
 breaks-0.056
Token up
Token ID510
Feature activation+2.48e-01

Pos logit contributions
 planned+0.021
 especially+0.021
 discovering+0.019
 spending+0.018
 whether+0.018
Neg logit contributions
 trigger-0.048
 hurts-0.044
ze-0.041
 lid-0.041
 cries-0.040
Token,
Token ID11
Feature activation+1.94e+00

Pos logit contributions
 especially+0.006
 planned+0.006
 discovering+0.006
 cultures+0.006
 whether+0.006
Neg logit contributions
 trigger-0.006
 hurts-0.006
 top-0.005
 breaks-0.005
 cries-0.005
Token and
Token ID290
Feature activation+3.48e-02

Pos logit contributions
 planned+0.042
ust+0.039
 cultures+0.038
 especially+0.037
ait+0.037
Neg logit contributions
 trigger-0.054
 hurts-0.053
 tears-0.050
 cries-0.050
 top-0.050
Tokenoral
Token ID6864
Feature activation+2.19e+00

Pos logit contributions
 especially+0.087
 discovering+0.083
 planned+0.083
 spending+0.081
 whether+0.080
Neg logit contributions
 trigger-0.044
 hurts-0.037
 lid-0.032
 top-0.031
ze-0.031
Token of
Token ID286
Feature activation-1.21e-01

Pos logit contributions
 planned+0.039
 especially+0.039
 discovering+0.035
 spending+0.034
 whether+0.033
Neg logit contributions
 trigger-0.080
 hurts-0.073
ze-0.069
 lid-0.068
 top-0.067
Token the
Token ID262
Feature activation-3.25e-01

Pos logit contributions
 trigger+0.003
 hurts+0.003
ze+0.002
 lid+0.002
 top+0.002
Neg logit contributions
 planned-0.004
 especially-0.004
 discovering-0.003
 spending-0.003
 whether-0.003
Token story
Token ID1621
Feature activation+4.31e-01

Pos logit contributions
 trigger+0.008
 hurts+0.007
ze+0.007
antly+0.006
 lid+0.006
Neg logit contributions
 planned-0.010
 especially-0.010
 discovering-0.009
 spending-0.009
 going-0.009
Token:
Token ID25
Feature activation+1.25e+00

Pos logit contributions
 planned+0.012
 especially+0.012
 discovering+0.011
 spending+0.011
 whether+0.011
Neg logit contributions
 trigger-0.011
 hurts-0.010
ze-0.009
 lid-0.009
 cries-0.009
Token Even
Token ID3412
Feature activation+5.92e+00

Pos logit contributions
 planned+0.033
 especially+0.033
 discovering+0.030
 explorers+0.030
 cultures+0.029
Neg logit contributions
 trigger-0.032
 hurts-0.030
ze-0.028
 top-0.027
 cries-0.027
Token the
Token ID262
Feature activation+2.00e-01

Pos logit contributions
 especially+0.109
 planned+0.109
 discovering+0.102
 cultures+0.098
ride+0.098
Neg logit contributions
 hurts-0.185
 trigger-0.184
 breaks-0.164
 tears-0.163
 cries-0.162
Token simplest
Token ID24043
Feature activation+2.36e+00

Pos logit contributions
 especially+0.005
 planned+0.005
 discovering+0.005
 spending+0.005
 traveling+0.005
Neg logit contributions
 trigger-0.006
 hurts-0.005
ze-0.005
antly-0.005
 cries-0.005
Token of
Token ID286
Feature activation+1.60e+00

Pos logit contributions
 planned+0.060
 especially+0.060
 discovering+0.056
 spending+0.054
 whether+0.054
Neg logit contributions
 trigger-0.067
 hurts-0.060
ze-0.056
 lid-0.055
 top-0.053
Token things
Token ID1243
Feature activation+2.15e+00

Pos logit contributions
 planned+0.045
 especially+0.045
 discovering+0.042
 spending+0.041
 whether+0.041
Neg logit contributions
 trigger-0.041
 hurts-0.036
ze-0.033
 lid-0.032
 cries-0.031
Token can
Token ID460
Feature activation+6.31e-01

Pos logit contributions
 especially+0.055
 planned+0.054
 discovering+0.052
 whether+0.051
 spending+0.051
Neg logit contributions
 trigger-0.055
 hurts-0.054
 breaks-0.047
ze-0.046
 lid-0.046
Token every
Token ID790
Feature activation+6.84e-01

Pos logit contributions
 especially+0.024
 planned+0.024
 discovering+0.022
 whether+0.022
 spending+0.021
Neg logit contributions
 trigger-0.032
 hurts-0.030
ze-0.028
antly-0.028
 top-0.027
Token year
Token ID614
Feature activation+1.91e+00

Pos logit contributions
 planned+0.016
 especially+0.016
 discovering+0.015
 spending+0.015
 whether+0.014
Neg logit contributions
 trigger-0.021
 hurts-0.019
ze-0.017
 lid-0.017
 cries-0.017
Token,
Token ID11
Feature activation+7.53e-01

Pos logit contributions
 especially+0.047
 planned+0.047
 discovering+0.044
 spending+0.043
 whether+0.043
Neg logit contributions
 trigger-0.055
 hurts-0.049
ze-0.045
 lid-0.045
 top-0.044
Token on
Token ID319
Feature activation-7.18e-01

Pos logit contributions
 planned+0.018
 especially+0.018
 discovering+0.017
 spending+0.017
 whether+0.016
Neg logit contributions
 trigger-0.022
 hurts-0.020
ze-0.019
 lid-0.018
 cries-0.018
Token their
Token ID511
Feature activation+4.61e-01

Pos logit contributions
 trigger+0.024
 hurts+0.020
ze+0.019
 lid+0.018
 cries+0.018
Neg logit contributions
 planned-0.017
 especially-0.017
 discovering-0.015
 spending-0.015
 traveling-0.015
Token birth
Token ID4082
Feature activation+5.92e+00

Pos logit contributions
 especially+0.010
 planned+0.010
 discovering+0.009
 spending+0.009
 cultures+0.008
Neg logit contributions
 trigger-0.016
 hurts-0.014
 lid-0.013
ze-0.013
antly-0.013
Tokendays
Token ID12545
Feature activation+1.84e+00

Pos logit contributions
 especially+0.102
 pic+0.101
 discovering+0.096
 planned+0.092
 traveling+0.091
Neg logit contributions
antly-0.183
 trigger-0.172
 hurts-0.172
 away-0.165
 lid-0.163
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
 especially+0.034
 planned+0.034
 discovering+0.033
 whether+0.032
 spending+0.031
Neg logit contributions
 trigger-0.059
 hurts-0.057
ze-0.054
antly-0.052
 top-0.052
Token they
Token ID484
Feature activation+2.05e+00

Pos logit contributions
 especially+0.032
 planned+0.031
 discovering+0.029
 whether+0.028
 spending+0.028
Neg logit contributions
 trigger-0.036
 hurts-0.033
ze-0.032
 top-0.031
antly-0.031
Token would
Token ID561
Feature activation+1.94e+00

Pos logit contributions
 discovering+0.033
 whether+0.033
and+0.032
 planned+0.031
 especially+0.031
Neg logit contributions
 trigger-0.066
 hurts-0.063
ze-0.061
 top-0.061
inkle-0.059
Token celebrate
Token ID10648
Feature activation+1.78e-01

Pos logit contributions
 planned+0.037
 especially+0.033
 whether+0.033
 discovering+0.033
 cultures+0.032
Neg logit contributions
 trigger-0.061
ze-0.058
 top-0.058
 hurts-0.058
inkle-0.054
Token She
Token ID1375
Feature activation-3.82e-01

Pos logit contributions
 trigger+0.014
 hurts+0.013
 lid+0.012
 cries+0.012
 top+0.012
Neg logit contributions
 planned-0.013
 especially-0.013
 discovering-0.012
 spending-0.011
 whether-0.011
Token is
Token ID318
Feature activation-3.25e-01

Pos logit contributions
 trigger+0.008
 hurts+0.007
ze+0.006
 lid+0.006
 cries+0.006
Neg logit contributions
 planned-0.013
 especially-0.013
 discovering-0.012
 spending-0.012
 whether-0.012
Token happy
Token ID3772
Feature activation-3.84e-02

Pos logit contributions
 trigger+0.010
 hurts+0.008
 lid+0.008
ze+0.008
antly+0.007
Neg logit contributions
 especially-0.009
 planned-0.008
 discovering-0.008
 spending-0.008
 whether-0.007
Token.
Token ID13
Feature activation-5.46e-01

Pos logit contributions
 hurts+0.001
 trigger+0.001
 cries+0.001
 tears+0.001
ze+0.001
Neg logit contributions
 planned-0.001
 especially-0.001
 cultures-0.001
 discovering-0.001
ust-0.001
Token She
Token ID1375
Feature activation-1.29e-01

Pos logit contributions
 trigger+0.014
 hurts+0.013
 lid+0.011
 cries+0.011
 top+0.011
Neg logit contributions
 planned-0.015
 especially-0.015
 discovering-0.014
 spending-0.014
 cultures-0.014
Token spr
Token ID7500
Feature activation+5.92e+00

Pos logit contributions
 trigger+0.002
 hurts+0.002
 cries+0.002
ze+0.002
 lid+0.002
Neg logit contributions
 especially-0.004
 planned-0.004
 discovering-0.004
 spending-0.004
 whether-0.004
Tokenays
Token ID592
Feature activation-1.72e-01

Pos logit contributions
 planned+0.178
 especially+0.175
 discovering+0.169
 spending+0.161
 whether+0.160
Neg logit contributions
 trigger-0.145
 hurts-0.126
ze-0.111
antly-0.110
 lid-0.110
Token them
Token ID606
Feature activation-9.44e-01

Pos logit contributions
 trigger+0.005
ze+0.004
 hurts+0.004
inkle+0.004
 AW+0.004
Neg logit contributions
 especially-0.004
 planned-0.004
 spending-0.004
 someday-0.004
 whether-0.004
Token back
Token ID736
Feature activation-1.12e-01

Pos logit contributions
 trigger+0.028
 hurts+0.024
ze+0.022
 lid+0.022
antly+0.022
Neg logit contributions
 planned-0.024
 especially-0.024
 discovering-0.021
 spending-0.021
 whether-0.021
Token.
Token ID13
Feature activation-5.46e-01

Pos logit contributions
 trigger+0.004
 hurts+0.004
ze+0.003
 lid+0.003
 top+0.003
Neg logit contributions
 planned-0.002
 especially-0.002
 discovering-0.002
 spending-0.002
 whether-0.002
Token They
Token ID1119
Feature activation+4.23e-01

Pos logit contributions
 trigger+0.016
 hurts+0.014
ze+0.013
 lid+0.013
 top+0.013
Neg logit contributions
 planned-0.014
 especially-0.013
 discovering-0.013
 spending-0.012
 whether-0.012
Tokenoral
Token ID6864
Feature activation+2.28e+00

Pos logit contributions
 especially+0.071
 planned+0.070
 discovering+0.067
 spending+0.066
 whether+0.065
Neg logit contributions
 trigger-0.048
 hurts-0.042
ze-0.038
 lid-0.037
 top-0.036
Token of
Token ID286
Feature activation-9.28e-02

Pos logit contributions
 planned+0.046
 especially+0.046
 discovering+0.041
 spending+0.040
 whether+0.039
Neg logit contributions
 trigger-0.078
 hurts-0.071
ze-0.067
 lid-0.066
 top-0.064
Token the
Token ID262
Feature activation-4.18e-01

Pos logit contributions
 trigger+0.003
 hurts+0.002
ze+0.002
 lid+0.002
antly+0.002
Neg logit contributions
 especially-0.003
 planned-0.003
 discovering-0.002
 spending-0.002
 whether-0.002
Token story
Token ID1621
Feature activation+1.36e-01

Pos logit contributions
 trigger+0.011
 hurts+0.009
ze+0.009
antly+0.009
inkle+0.008
Neg logit contributions
 planned-0.012
 especially-0.012
 discovering-0.011
 going-0.011
 spending-0.011
Token:
Token ID25
Feature activation+1.19e+00

Pos logit contributions
 planned+0.003
 especially+0.003
 discovering+0.003
 spending+0.003
 whether+0.003
Neg logit contributions
 trigger-0.004
ze-0.003
 hurts-0.003
 lid-0.003
 top-0.003
Token Even
Token ID3412
Feature activation+5.91e+00

Pos logit contributions
 planned+0.029
 especially+0.028
 explorers+0.026
packs+0.026
 discovering+0.025
Neg logit contributions
 trigger-0.032
 hurts-0.032
ze-0.029
 cries-0.028
 top-0.028
Token if
Token ID611
Feature activation+6.89e-01

Pos logit contributions
 planned+0.096
 especially+0.096
 discovering+0.090
ait+0.088
ride+0.087
Neg logit contributions
 hurts-0.199
 trigger-0.194
 breaks-0.176
 cries-0.175
 tears-0.174
Token someone
Token ID2130
Feature activation+1.76e+00

Pos logit contributions
 especially+0.021
 planned+0.020
 discovering+0.020
ride+0.019
 whether+0.019
Neg logit contributions
 trigger-0.014
 hurts-0.014
 top-0.012
 cries-0.012
 breaks-0.012
Token appears
Token ID3568
Feature activation+1.02e+00

Pos logit contributions
 especially+0.045
 planned+0.045
 discovering+0.043
 spending+0.041
 whether+0.041
Neg logit contributions
 trigger-0.047
 hurts-0.045
ze-0.040
 cries-0.039
 lid-0.038
Token kind
Token ID1611
Feature activation+1.87e+00

Pos logit contributions
 planned+0.025
 especially+0.025
 discovering+0.023
 spending+0.022
 whether+0.022
Neg logit contributions
 trigger-0.031
 hurts-0.027
ze-0.025
 lid-0.025
 top-0.024
Token and
Token ID290
Feature activation-8.92e-01

Pos logit contributions
 planned+0.037
 especially+0.037
 discovering+0.034
 spending+0.033
 whether+0.032
Neg logit contributions
 trigger-0.063
 hurts-0.059
ze-0.054
 lid-0.054
 cries-0.053
Token happened
Token ID3022
Feature activation-1.37e+00

Pos logit contributions
 trigger+0.032
 hurts+0.029
ze+0.027
 lid+0.026
 cries+0.025
Neg logit contributions
 especially-0.027
 planned-0.027
 discovering-0.025
 spending-0.025
 whether-0.024
Token.
Token ID13
Feature activation-5.26e-01

Pos logit contributions
 trigger+0.036
 hurts+0.032
ze+0.029
 lid+0.029
 cries+0.028
Neg logit contributions
 planned-0.038
 especially-0.038
 discovering-0.036
 spending-0.035
 whether-0.034
Token All
Token ID1439
Feature activation+2.50e+00

Pos logit contributions
 trigger+0.015
 hurts+0.014
ze+0.012
inkle+0.012
 lid+0.012
Neg logit contributions
 planned-0.014
 especially-0.014
 spending-0.013
 discovering-0.012
 whether-0.012
Token of
Token ID286
Feature activation+1.37e+00

Pos logit contributions
 planned+0.049
 especially+0.049
 discovering+0.044
 spending+0.042
 whether+0.041
Neg logit contributions
 trigger-0.088
 hurts-0.078
ze-0.076
 lid-0.074
 top-0.072
Token the
Token ID262
Feature activation+5.00e-01

Pos logit contributions
 planned+0.037
 especially+0.037
 discovering+0.034
 spending+0.034
 whether+0.033
Neg logit contributions
 trigger-0.038
 hurts-0.034
ze-0.031
 lid-0.030
antly-0.030
Token blo
Token ID24924
Feature activation+5.84e+00

Pos logit contributions
 planned+0.013
 especially+0.012
 discovering+0.011
 traveling+0.011
 explorers+0.011
Neg logit contributions
 trigger-0.015
 hurts-0.013
ze-0.012
antly-0.012
itch-0.012
Tokenoming
Token ID3383
Feature activation+3.23e-01

Pos logit contributions
 especially+0.144
 planned+0.139
 whether+0.131
 discovering+0.129
 spending+0.128
Neg logit contributions
 trigger-0.174
 hurts-0.159
 lid-0.147
inkle-0.144
antly-0.143
Token flowers
Token ID12734
Feature activation+1.41e+00

Pos logit contributions
 planned+0.008
 especially+0.008
 discovering+0.007
 spending+0.007
 explorers+0.007
Neg logit contributions
 trigger-0.010
 hurts-0.008
ze-0.008
itch-0.008
antly-0.008
Token turned
Token ID2900
Feature activation+4.64e-01

Pos logit contributions
 especially+0.033
 planned+0.033
 discovering+0.030
 whether+0.030
 spending+0.030
Neg logit contributions
 trigger-0.041
 hurts-0.038
ze-0.035
 lid-0.035
 top-0.034
Token into
Token ID656
Feature activation-8.83e-01

Pos logit contributions
 planned+0.011
 especially+0.010
 going+0.009
 spending+0.009
 explorers+0.009
Neg logit contributions
 trigger-0.014
 hurts-0.013
itch-0.012
ze-0.012
ggy-0.012
Token monsters
Token ID10963
Feature activation-3.02e-02

Pos logit contributions
 trigger+0.026
 hurts+0.024
ze+0.021
itch+0.021
antly+0.021
Neg logit contributions
 planned-0.022
 especially-0.022
 discovering-0.020
 spending-0.020
 explorers-0.020
Token it
Token ID340
Feature activation-2.10e+00

Pos logit contributions
 especially+0.014
 planned+0.014
 discovering+0.013
 spending+0.013
 whether+0.013
Neg logit contributions
 trigger-0.017
 hurts-0.015
ze-0.015
 lid-0.014
antly-0.014
Token is
Token ID318
Feature activation-2.47e+00

Pos logit contributions
 trigger+0.078
itch+0.065
 lid+0.063
eros+0.062
inkle+0.060
Neg logit contributions
 especially-0.041
 planned-0.038
 would-0.036
 always-0.036
 going-0.034
Token funny
Token ID8258
Feature activation-6.34e-01

Pos logit contributions
 trigger+0.078
antly+0.066
 lid+0.063
 hurts+0.060
ze+0.060
Neg logit contributions
 especially-0.065
 planned-0.060
 going-0.060
 spending-0.057
 whether-0.055
Token.
Token ID13
Feature activation-9.61e-01

Pos logit contributions
 trigger+0.018
 hurts+0.016
ze+0.015
 lid+0.015
 top+0.014
Neg logit contributions
 planned-0.016
 especially-0.016
 discovering-0.015
 spending-0.015
 whether-0.015
Token He
Token ID679
Feature activation-1.11e+00

Pos logit contributions
 trigger+0.031
 hurts+0.027
ze+0.026
inkle+0.025
 lid+0.025
Neg logit contributions
 especially-0.022
 planned-0.022
 whether-0.020
 spending-0.020
 discovering-0.019
Token spr
Token ID7500
Feature activation+5.84e+00

Pos logit contributions
 trigger+0.031
 hurts+0.026
ze+0.025
 lid+0.024
antly+0.024
Neg logit contributions
 especially-0.030
 planned-0.030
 discovering-0.027
 spending-0.027
 whether-0.026
Tokenays
Token ID592
Feature activation-1.64e-01

Pos logit contributions
 planned+0.170
 especially+0.169
 discovering+0.160
 spending+0.156
 whether+0.154
Neg logit contributions
 trigger-0.146
 hurts-0.128
ze-0.116
 lid-0.114
 top-0.111
Token Lily
Token ID20037
Feature activation-1.54e+00

Pos logit contributions
 trigger+0.005
ze+0.004
 hurts+0.004
inkle+0.004
itch+0.004
Neg logit contributions
 especially-0.004
 planned-0.004
 whether-0.003
 someday-0.003
 spending-0.003
Token again
Token ID757
Feature activation+5.81e-01

Pos logit contributions
 trigger+0.053
 hurts+0.042
 lid+0.041
inkle+0.041
antly+0.041
Neg logit contributions
 especially-0.032
 planned-0.031
 going-0.027
 discovering-0.027
 spending-0.026
Token with
Token ID351
Feature activation-1.32e+00

Pos logit contributions
 planned+0.016
 especially+0.016
 discovering+0.015
 spending+0.014
 whether+0.014
Neg logit contributions
 trigger-0.016
 hurts-0.014
ze-0.013
 lid-0.013
 cries-0.012
Token blue
Token ID4171
Feature activation+1.06e+00

Pos logit contributions
 trigger+0.041
 hurts+0.037
 AW+0.037
antly+0.036
�+0.036
Neg logit contributions
 especially-0.031
 planned-0.028
 whether-0.027
 spending-0.026
 going-0.026
Token things
Token ID1243
Feature activation+1.17e+00

Pos logit contributions
 planned+0.043
 especially+0.041
 someday+0.040
ust+0.040
 whether+0.040
Neg logit contributions
 trigger-0.035
 tears-0.033
 hurts-0.033
 top-0.031
 cries-0.031
Token to
Token ID284
Feature activation+4.46e-01

Pos logit contributions
 especially+0.033
 planned+0.032
 cultures+0.031
ust+0.031
 discovering+0.031
Neg logit contributions
 hurts-0.025
 trigger-0.025
 cries-0.022
ze-0.022
 breaks-0.022
Token explore
Token ID7301
Feature activation-1.08e-01

Pos logit contributions
 planned+0.010
ust+0.009
 spending+0.009
 especially+0.009
 discovering+0.009
Neg logit contributions
 trigger-0.013
 top-0.012
 hurts-0.012
ze-0.012
 tears-0.011
Token.
Token ID13
Feature activation-3.97e-02

Pos logit contributions
 trigger+0.002
 hurts+0.002
 top+0.002
ze+0.001
 lid+0.001
Neg logit contributions
 planned-0.004
 especially-0.003
 discovering-0.003
ride-0.003
 spending-0.003
Token She
Token ID1375
Feature activation+5.21e-01

Pos logit contributions
 trigger+0.001
 hurts+0.001
 lid+0.001
 top+0.001
 cries+0.001
Neg logit contributions
 planned-0.001
 especially-0.001
 discovering-0.001
 spending-0.001
 whether-0.001
Token kn
Token ID638
Feature activation+5.82e+00

Pos logit contributions
reating+0.015
 cultures+0.014
 whether+0.013
 spending+0.013
 discovering+0.013
Neg logit contributions
ze-0.011
 tears-0.011
 top-0.010
 cries-0.010
 hurts-0.010
Tokenelt
Token ID2120
Feature activation+1.73e+00

Pos logit contributions
 especially+0.138
 planned+0.131
 someday+0.124
 often+0.123
 spending+0.119
Neg logit contributions
 trigger-0.169
 hurts-0.149
antly-0.149
 lips-0.147
 cries-0.147
Token down
Token ID866
Feature activation+7.53e-01

Pos logit contributions
 planned+0.053
 especially+0.052
 discovering+0.050
 spending+0.049
 whether+0.048
Neg logit contributions
 trigger-0.038
 hurts-0.035
 lid-0.032
ze-0.031
 top-0.031
Token by
Token ID416
Feature activation+7.89e-01

Pos logit contributions
 cultures+0.019
 especially+0.019
ust+0.018
 appreciate+0.018
 planned+0.018
Neg logit contributions
 top-0.018
 trigger-0.018
 hurts-0.018
antly-0.017
 lid-0.016
Token a
Token ID257
Feature activation+5.90e-01

Pos logit contributions
 planned+0.022
 especially+0.022
ust+0.021
 cultures+0.020
 explorers+0.020
Neg logit contributions
 trigger-0.018
 hurts-0.017
 tears-0.017
 top-0.017
ze-0.016
Token big
Token ID1263
Feature activation+1.62e-01

Pos logit contributions
 especially+0.014
 planned+0.014
 discovering+0.013
 traveling+0.012
 seeing+0.012
Neg logit contributions
 trigger-0.018
 hurts-0.016
antly-0.015
ze-0.015
itch-0.014
Token thank
Token ID5875
Feature activation-2.28e-01

Pos logit contributions
 trigger+0.008
 hurts+0.006
 lid+0.006
ze+0.006
antly+0.006
Neg logit contributions
 planned-0.007
 especially-0.007
 spending-0.007
 discovering-0.007
 whether-0.006
Token you
Token ID345
Feature activation-4.22e-01

Pos logit contributions
 trigger+0.007
 hurts+0.006
 lid+0.006
ze+0.005
antly+0.005
Neg logit contributions
 especially-0.006
 planned-0.006
 spending-0.005
 discovering-0.005
 whether-0.005
Token.
Token ID13
Feature activation-8.14e-01

Pos logit contributions
 trigger+0.015
 hurts+0.013
ze+0.012
 lid+0.012
inkle+0.012
Neg logit contributions
 especially-0.008
 planned-0.008
 discovering-0.007
 spending-0.007
 whether-0.007
Token From
Token ID3574
Feature activation+1.49e+00

Pos logit contributions
 trigger+0.023
 hurts+0.019
ze+0.019
 lid+0.017
antly+0.017
Neg logit contributions
 planned-0.022
 especially-0.022
 discovering-0.021
 spending-0.020
 whether-0.020
Token that
Token ID326
Feature activation-2.02e+00

Pos logit contributions
 planned+0.036
 especially+0.036
 discovering+0.033
 spending+0.032
 whether+0.032
Neg logit contributions
 trigger-0.046
 hurts-0.041
ze-0.038
 lid-0.038
 cries-0.036
Token day
Token ID1110
Feature activation-5.11e+00

Pos logit contributions
 trigger+0.080
 hurts+0.070
ze+0.067
 lid+0.066
 cries+0.065
Neg logit contributions
 especially-0.034
 planned-0.033
 discovering-0.029
 spending-0.029
 whether-0.026
Token on
Token ID319
Feature activation-1.84e+00

Pos logit contributions
 trigger+0.148
 hurts+0.127
ze+0.119
 lid+0.116
 cries+0.116
Neg logit contributions
 planned-0.133
 especially-0.131
 spending-0.122
 discovering-0.121
 traveling-0.118
Token,
Token ID11
Feature activation+3.05e-01

Pos logit contributions
 trigger+0.063
ze+0.051
 hurts+0.049
 lid+0.047
antly+0.047
Neg logit contributions
 planned-0.042
 especially-0.040
 spending-0.039
 discovering-0.037
 often-0.037
Token Lily
Token ID20037
Feature activation-1.84e-01

Pos logit contributions
 planned+0.010
 especially+0.009
 discovering+0.009
 spending+0.009
 whether+0.009
Neg logit contributions
 trigger-0.007
 hurts-0.006
ze-0.005
 lid-0.005
 top-0.005
Token wore
Token ID12408
Feature activation+2.62e-01

Pos logit contributions
 trigger+0.006
 hurts+0.005
ze+0.005
antly+0.005
 top+0.005
Neg logit contributions
 planned-0.004
 especially-0.004
 discovering-0.004
 whether-0.004
 cultures-0.003
Token her
Token ID607
Feature activation+1.32e-01

Pos logit contributions
 especially+0.006
 planned+0.006
 discovering+0.005
 spending+0.005
 whether+0.005
Neg logit contributions
 trigger-0.009
 hurts-0.008
ze-0.007
 cries-0.007
 lid-0.007
Token of
Token ID286
Feature activation+6.99e-01

Pos logit contributions
 planned+0.008
 especially+0.008
 discovering+0.008
 whether+0.008
 cultures+0.007
Neg logit contributions
 trigger-0.016
 hurts-0.015
ze-0.014
 cries-0.014
 breaks-0.014
Token fun
Token ID1257
Feature activation-2.87e-02

Pos logit contributions
 planned+0.014
 especially+0.014
 discovering+0.013
 spending+0.012
 whether+0.012
Neg logit contributions
 trigger-0.024
 hurts-0.022
ze-0.020
 lid-0.020
 top-0.020
Token.
Token ID13
Feature activation-4.63e-01

Pos logit contributions
 trigger+0.001
 hurts+0.001
ze+0.001
 top+0.000
 cries+0.000
Neg logit contributions
 especially-0.001
 planned-0.001
 cultures-0.001
 whether-0.001
 discovering-0.001
Token From
Token ID3574
Feature activation+1.29e+00

Pos logit contributions
 trigger+0.014
 hurts+0.012
ze+0.011
 lid+0.011
antly+0.011
Neg logit contributions
 planned-0.012
 especially-0.012
 discovering-0.011
 spending-0.011
 whether-0.011
Token that
Token ID326
Feature activation-2.14e+00

Pos logit contributions
 planned+0.032
 especially+0.032
 discovering+0.030
 spending+0.029
 whether+0.029
Neg logit contributions
 trigger-0.038
 hurts-0.034
ze-0.032
 lid-0.031
antly-0.030
Token day
Token ID1110
Feature activation-5.09e+00

Pos logit contributions
 trigger+0.086
 hurts+0.076
ze+0.073
 lid+0.073
antly+0.071
Neg logit contributions
 especially-0.035
 planned-0.033
 discovering-0.030
 spending-0.029
 whether-0.027
Token on
Token ID319
Feature activation-1.95e+00

Pos logit contributions
 trigger+0.146
 hurts+0.126
ze+0.118
 lid+0.116
 cries+0.114
Neg logit contributions
 planned-0.132
 especially-0.130
 spending-0.121
 discovering-0.121
 traveling-0.118
Token,
Token ID11
Feature activation+3.10e-01

Pos logit contributions
 trigger+0.061
 hurts+0.050
ze+0.050
 lid+0.047
 cries+0.046
Neg logit contributions
 planned-0.050
 especially-0.049
 spending-0.046
 discovering-0.045
 traveling-0.043
Token Lily
Token ID20037
Feature activation-1.92e-01

Pos logit contributions
 planned+0.009
 especially+0.009
 discovering+0.008
 spending+0.008
 whether+0.008
Neg logit contributions
 trigger-0.008
 hurts-0.007
ze-0.006
 lid-0.006
 top-0.006
Token always
Token ID1464
Feature activation+9.97e-01

Pos logit contributions
 trigger+0.006
 hurts+0.006
ze+0.005
 cries+0.005
 top+0.005
Neg logit contributions
 planned-0.004
 especially-0.004
 discovering-0.004
 whether-0.004
 cultures-0.004
Token drew
Token ID9859
Feature activation-1.94e-01

Pos logit contributions
 especially+0.023
ait+0.022
ust+0.022
 whether+0.022
 planned+0.022
Neg logit contributions
 hurts-0.026
 trigger-0.025
 breaks-0.024
 tears-0.023
 cries-0.022
Token play
Token ID711
Feature activation-2.26e+00

Pos logit contributions
 planned+0.003
 especially+0.003
 whether+0.003
ust+0.003
and+0.003
Neg logit contributions
 trigger-0.004
 hurts-0.003
 top-0.003
ze-0.003
 breaks-0.003
Token together
Token ID1978
Feature activation-9.84e-02

Pos logit contributions
 trigger+0.084
 hurts+0.076
 lid+0.070
 cries+0.068
itch+0.068
Neg logit contributions
 especially-0.043
 planned-0.041
 spending-0.038
 discovering-0.038
 whether-0.035
Token.
Token ID13
Feature activation-5.26e-01

Pos logit contributions
 trigger+0.002
 hurts+0.002
 top+0.002
ze+0.002
 lid+0.002
Neg logit contributions
 planned-0.003
 especially-0.003
 discovering-0.003
 whether-0.003
 cultures-0.003
Token From
Token ID3574
Feature activation+1.05e+00

Pos logit contributions
 trigger+0.014
 hurts+0.013
ze+0.012
 lid+0.011
 top+0.011
Neg logit contributions
 planned-0.014
 especially-0.014
 discovering-0.013
 spending-0.013
 whether-0.013
Token that
Token ID326
Feature activation-2.20e+00

Pos logit contributions
 planned+0.026
 especially+0.025
 discovering+0.024
 spending+0.023
 whether+0.023
Neg logit contributions
 trigger-0.032
 hurts-0.028
ze-0.027
 lid-0.026
antly-0.025
Token day
Token ID1110
Feature activation-5.08e+00

Pos logit contributions
 trigger+0.088
 hurts+0.077
ze+0.074
 lid+0.074
antly+0.072
Neg logit contributions
 especially-0.036
 planned-0.035
 discovering-0.032
 spending-0.031
 traveling-0.029
Token on
Token ID319
Feature activation-2.10e+00

Pos logit contributions
 trigger+0.145
 hurts+0.125
ze+0.117
 lid+0.115
 cries+0.113
Neg logit contributions
 planned-0.133
 especially-0.131
 discovering-0.122
 spending-0.122
 traveling-0.119
Token,
Token ID11
Feature activation+4.55e-01

Pos logit contributions
 trigger+0.065
 hurts+0.053
ze+0.053
 lid+0.051
antly+0.048
Neg logit contributions
 planned-0.053
 especially-0.053
 spending-0.049
 discovering-0.048
 traveling-0.047
Token Lily
Token ID20037
Feature activation-4.16e-02

Pos logit contributions
 planned+0.014
 especially+0.013
 discovering+0.013
 cultures+0.013
 whether+0.013
Neg logit contributions
 trigger-0.010
 hurts-0.009
 top-0.008
ze-0.008
 breaks-0.008
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
 trigger+0.001
 hurts+0.001
ze+0.001
 cries+0.001
 breaks+0.001
Neg logit contributions
 planned-0.001
 especially-0.001
 discovering-0.001
 whether-0.001
 cultures-0.001
Token Tim
Token ID5045
Feature activation+1.17e+00

Pos logit contributions
 planned+0.027
 especially+0.027
 discovering+0.025
 whether+0.024
 spending+0.024
Neg logit contributions
 trigger-0.029
 hurts-0.027
ze-0.024
 cries-0.024
 top-0.024
Token her
Token ID607
Feature activation+3.89e-01

Pos logit contributions
 trigger+0.004
 AW+0.003
 hurts+0.003
 lid+0.003
antly+0.003
Neg logit contributions
 especially-0.002
 planned-0.002
 whether-0.001
 spending-0.001
 going-0.001
Token toys
Token ID14958
Feature activation+5.96e-01

Pos logit contributions
 especially+0.009
 planned+0.009
 traveling+0.008
 discovering+0.008
 going+0.008
Neg logit contributions
 trigger-0.013
antly-0.010
 hurts-0.010
 lid-0.010
 AW-0.010
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
 planned+0.018
 especially+0.018
 discovering+0.017
 cultures+0.017
 spending+0.016
Neg logit contributions
 trigger-0.014
 hurts-0.012
ze-0.011
 top-0.011
 lid-0.011
Token From
Token ID3574
Feature activation+1.28e+00

Pos logit contributions
 trigger+0.012
 hurts+0.011
ze+0.010
 lid+0.010
 cries+0.009
Neg logit contributions
 planned-0.016
 especially-0.016
 discovering-0.015
 spending-0.015
 whether-0.015
Token that
Token ID326
Feature activation-2.10e+00

Pos logit contributions
 planned+0.033
 especially+0.033
 discovering+0.030
 spending+0.030
 whether+0.029
Neg logit contributions
 trigger-0.037
 hurts-0.033
ze-0.030
 lid-0.030
 top-0.029
Token day
Token ID1110
Feature activation-5.08e+00

Pos logit contributions
 trigger+0.079
 hurts+0.070
ze+0.067
 lid+0.066
antly+0.065
Neg logit contributions
 especially-0.038
 planned-0.037
 discovering-0.033
 spending-0.032
 whether-0.031
Token on
Token ID319
Feature activation-1.77e+00

Pos logit contributions
 trigger+0.137
 hurts+0.120
ze+0.111
 lid+0.108
 cries+0.106
Neg logit contributions
 planned-0.138
 especially-0.137
 discovering-0.128
 spending-0.126
 whether-0.123
Token,
Token ID11
Feature activation+6.58e-01

Pos logit contributions
 trigger+0.055
 hurts+0.044
ze+0.044
 lid+0.042
 cries+0.040
Neg logit contributions
 planned-0.045
 especially-0.045
 spending-0.041
 discovering-0.041
 often-0.039
Token Lily
Token ID20037
Feature activation-4.77e-02

Pos logit contributions
 planned+0.020
 especially+0.019
 discovering+0.019
 cultures+0.019
 traveling+0.019
Neg logit contributions
 trigger-0.014
 hurts-0.013
 top-0.011
ze-0.011
 tears-0.011
Token always
Token ID1464
Feature activation+1.16e+00

Pos logit contributions
 trigger+0.001
 hurts+0.001
ze+0.001
 cries+0.001
antly+0.001
Neg logit contributions
 planned-0.001
 especially-0.001
 discovering-0.001
 cultures-0.001
 whether-0.001
Token carried
Token ID5281
Feature activation+4.43e-01

Pos logit contributions
 especially+0.028
ust+0.027
 cultures+0.027
 planned+0.027
ait+0.027
Neg logit contributions
 hurts-0.030
 trigger-0.027
ze-0.027
 breaks-0.026
 cries-0.025
Token still
Token ID991
Feature activation-7.03e-01

Pos logit contributions
 planned+0.003
 especially+0.003
 discovering+0.003
 cultures+0.003
 spending+0.003
Neg logit contributions
 trigger-0.004
 hurts-0.003
ze-0.003
 tears-0.003
 top-0.003
Token helpful
Token ID7613
Feature activation+3.04e-01

Pos logit contributions
 trigger+0.018
 hurts+0.017
ze+0.015
 lid+0.015
 top+0.015
Neg logit contributions
 especially-0.019
 planned-0.019
 discovering-0.018
 spending-0.017
 whether-0.017
Token.
Token ID13
Feature activation-8.92e-01

Pos logit contributions
 planned+0.007
 especially+0.007
 discovering+0.006
 spending+0.006
 whether+0.006
Neg logit contributions
 trigger-0.009
 hurts-0.009
ze-0.008
 lid-0.008
 cries-0.008
Token From
Token ID3574
Feature activation+1.60e+00

Pos logit contributions
 trigger+0.024
 hurts+0.022
ze+0.020
 lid+0.019
 top+0.019
Neg logit contributions
 planned-0.024
 especially-0.024
 discovering-0.022
 spending-0.022
 whether-0.022
Token that
Token ID326
Feature activation-1.78e+00

Pos logit contributions
 planned+0.038
 especially+0.037
 discovering+0.035
 spending+0.034
 whether+0.033
Neg logit contributions
 trigger-0.050
 hurts-0.045
ze-0.043
 lid-0.042
 cries-0.040
Token day
Token ID1110
Feature activation-5.08e+00

Pos logit contributions
 trigger+0.066
 hurts+0.059
ze+0.056
 lid+0.056
 cries+0.055
Neg logit contributions
 especially-0.032
 planned-0.032
 discovering-0.028
 spending-0.027
 whether-0.027
Token on
Token ID319
Feature activation-1.65e+00

Pos logit contributions
 trigger+0.143
 hurts+0.124
ze+0.117
 lid+0.113
 cries+0.113
Neg logit contributions
 planned-0.134
 especially-0.131
 discovering-0.122
 spending-0.122
 traveling-0.119
Token,
Token ID11
Feature activation+8.33e-01

Pos logit contributions
 trigger+0.055
ze+0.047
 hurts+0.044
 lid+0.043
 cries+0.042
Neg logit contributions
 planned-0.037
 especially-0.036
 spending-0.034
 discovering-0.033
 often-0.033
Token Tim
Token ID5045
Feature activation-1.12e-01

Pos logit contributions
 planned+0.024
 cultures+0.023
 discovering+0.023
 especially+0.023
ait+0.023
Neg logit contributions
 trigger-0.018
 hurts-0.017
 top-0.017
 breaks-0.016
 tears-0.015
Token started
Token ID2067
Feature activation+2.66e+00

Pos logit contributions
 trigger+0.004
 hurts+0.004
ze+0.003
antly+0.003
 cries+0.003
Neg logit contributions
 planned-0.002
 especially-0.002
 discovering-0.002
 spending-0.002
 whether-0.002
Token to
Token ID284
Feature activation+9.41e-01

Pos logit contributions
 planned+0.053
 especially+0.053
ust+0.051
ait+0.048
 someday+0.046
Neg logit contributions
 hurts-0.080
 trigger-0.080
ze-0.078
 tears-0.075
 cries-0.075
Token the
Token ID262
Feature activation-8.76e-01

Pos logit contributions
 especially+0.003
 planned+0.003
 spending+0.003
 discovering+0.003
 going+0.003
Neg logit contributions
 trigger-0.004
 hurts-0.004
ze-0.004
 lid-0.004
antly-0.003
Token best
Token ID1266
Feature activation+3.81e-01

Pos logit contributions
 trigger+0.024
 hurts+0.022
ze+0.020
 cries+0.020
antly+0.019
Neg logit contributions
 especially-0.024
 planned-0.023
 discovering-0.021
 whether-0.021
 spending-0.021
Token!"
Token ID2474
Feature activation-1.10e+00

Pos logit contributions
 planned+0.011
 especially+0.011
 discovering+0.010
 spending+0.010
 whether+0.010
Neg logit contributions
 trigger-0.010
 hurts-0.008
ze-0.008
 lid-0.008
 cries-0.007
Token From
Token ID3574
Feature activation+1.34e+00

Pos logit contributions
 trigger+0.034
 hurts+0.029
ze+0.028
 lid+0.027
antly+0.026
Neg logit contributions
 planned-0.027
 especially-0.027
 discovering-0.025
 spending-0.025
 whether-0.024
Token that
Token ID326
Feature activation-2.45e+00

Pos logit contributions
 planned+0.031
 especially+0.031
 discovering+0.029
 spending+0.028
 whether+0.027
Neg logit contributions
 trigger-0.042
 hurts-0.038
ze-0.036
 lid-0.035
antly-0.034
Token day
Token ID1110
Feature activation-5.06e+00

Pos logit contributions
 trigger+0.103
 hurts+0.089
ze+0.086
 lid+0.085
antly+0.084
Neg logit contributions
 especially-0.037
 planned-0.035
 discovering-0.031
 spending-0.030
 going-0.028
Token on
Token ID319
Feature activation-2.10e+00

Pos logit contributions
 trigger+0.141
 hurts+0.122
ze+0.114
 lid+0.111
 cries+0.110
Neg logit contributions
 planned-0.135
 especially-0.134
 discovering-0.124
 spending-0.124
 traveling-0.120
Token,
Token ID11
Feature activation+2.39e-01

Pos logit contributions
 trigger+0.075
ze+0.059
 hurts+0.057
 lid+0.056
 cries+0.055
Neg logit contributions
 planned-0.047
 especially-0.045
 spending-0.044
 going-0.041
 often-0.041
Token Lily
Token ID20037
Feature activation-6.88e-02

Pos logit contributions
 planned+0.007
 especially+0.007
 discovering+0.007
 cultures+0.007
 whether+0.007
Neg logit contributions
 trigger-0.005
 hurts-0.005
ze-0.004
 top-0.004
 lid-0.004
Token knew
Token ID2993
Feature activation+2.66e+00

Pos logit contributions
 trigger+0.002
 hurts+0.002
ze+0.002
antly+0.002
 top+0.002
Neg logit contributions
 planned-0.002
 especially-0.001
 discovering-0.001
 whether-0.001
 cultures-0.001
Token that
Token ID326
Feature activation-2.55e-01

Pos logit contributions
 especially+0.084
 cultures+0.084
 planned+0.083
adows+0.083
ashing+0.081
Neg logit contributions
 hurts-0.045
 top-0.043
 tears-0.040
 trigger-0.040
 cries-0.038
Token juice
Token ID13135
Feature activation-1.71e+00

Pos logit contributions
 trigger+0.051
antly+0.039
ze+0.038
 hurts+0.038
Craig+0.038
Neg logit contributions
 especially-0.041
 planned-0.039
 going-0.034
 spending-0.034
 discovering-0.032
Token happily
Token ID18177
Feature activation-7.83e-01

Pos logit contributions
 trigger+0.057
 hurts+0.051
ze+0.047
 cries+0.046
antly+0.046
Neg logit contributions
 especially-0.037
 planned-0.037
 often-0.031
 spending-0.030
 going-0.030
Token.
Token ID13
Feature activation-6.02e-01

Pos logit contributions
 trigger+0.024
 hurts+0.020
 lid+0.019
ze+0.019
antly+0.018
Neg logit contributions
 planned-0.020
 especially-0.020
 discovering-0.018
 spending-0.018
 someday-0.017
Token From
Token ID3574
Feature activation+1.45e+00

Pos logit contributions
 trigger+0.019
 hurts+0.016
ze+0.016
 lid+0.015
inkle+0.015
Neg logit contributions
 planned-0.014
 especially-0.014
 spending-0.013
 discovering-0.013
 whether-0.013
Token that
Token ID326
Feature activation-1.91e+00

Pos logit contributions
 planned+0.034
 especially+0.034
 discovering+0.031
 spending+0.030
 whether+0.030
Neg logit contributions
 trigger-0.046
 hurts-0.040
ze-0.039
 lid-0.038
antly-0.037
Token day
Token ID1110
Feature activation-5.06e+00

Pos logit contributions
 trigger+0.071
 hurts+0.064
ze+0.061
 lid+0.060
antly+0.059
Neg logit contributions
 especially-0.034
 planned-0.034
 discovering-0.030
 spending-0.029
 whether-0.028
Token on
Token ID319
Feature activation-1.93e+00

Pos logit contributions
 trigger+0.152
 hurts+0.130
ze+0.122
 lid+0.120
 cries+0.119
Neg logit contributions
 planned-0.125
 especially-0.123
 spending-0.115
 discovering-0.115
 traveling-0.113
Token,
Token ID11
Feature activation+5.63e-01

Pos logit contributions
 trigger+0.070
ze+0.056
 hurts+0.053
 lid+0.052
antly+0.052
Neg logit contributions
 planned-0.042
 especially-0.041
 spending-0.039
 often-0.037
 going-0.037
Token Lily
Token ID20037
Feature activation+1.56e-01

Pos logit contributions
 planned+0.016
 especially+0.016
 discovering+0.015
 spending+0.015
 whether+0.015
Neg logit contributions
 trigger-0.014
 hurts-0.013
ze-0.012
 lid-0.011
 top-0.011
Token loved
Token ID6151
Feature activation+2.58e+00

Pos logit contributions
 planned+0.003
 especially+0.003
 discovering+0.003
 whether+0.003
 spending+0.003
Neg logit contributions
 trigger-0.005
 hurts-0.005
ze-0.005
 lid-0.005
 cries-0.004
Token av
Token ID1196
Feature activation+9.45e-01

Pos logit contributions
ust+0.055
 planned+0.055
 cultures+0.053
packs+0.052
 especially+0.052
Neg logit contributions
 hurts-0.072
 trigger-0.070
ze-0.069
 tears-0.066
 breaks-0.066
Token loved
Token ID6151
Feature activation+1.33e+00

Pos logit contributions
 discovering+0.013
 planned+0.012
 especially+0.012
 spending+0.012
 whether+0.012
Neg logit contributions
 trigger-0.018
 hurts-0.017
ze-0.015
 tears-0.015
 breaks-0.015
Token Spot
Token ID15899
Feature activation-1.75e+00

Pos logit contributions
 planned+0.028
 especially+0.027
 discovering+0.025
 spending+0.024
 whether+0.024
Neg logit contributions
 trigger-0.044
 hurts-0.040
ze-0.038
 lid-0.037
 top-0.036
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
 trigger+0.056
 hurts+0.046
 lid+0.045
antly+0.043
ze+0.042
Neg logit contributions
 especially-0.045
 planned-0.044
 spending-0.040
 discovering-0.039
 someday-0.039
Token From
Token ID3574
Feature activation+9.88e-01

Pos logit contributions
 trigger+0.035
 hurts+0.030
ze+0.029
 lid+0.028
antly+0.027
Neg logit contributions
 planned-0.023
 especially-0.023
 spending-0.021
 discovering-0.021
 whether-0.020
Token that
Token ID326
Feature activation-2.06e+00

Pos logit contributions
 planned+0.023
 especially+0.022
 discovering+0.021
 spending+0.020
 traveling+0.020
Neg logit contributions
 trigger-0.032
 hurts-0.028
ze-0.027
 lid-0.026
antly-0.025
Token day
Token ID1110
Feature activation-5.05e+00

Pos logit contributions
 trigger+0.081
 hurts+0.072
ze+0.069
 lid+0.068
antly+0.067
Neg logit contributions
 especially-0.034
 planned-0.033
 discovering-0.030
 spending-0.029
 whether-0.027
Token on
Token ID319
Feature activation-1.71e+00

Pos logit contributions
 trigger+0.144
 hurts+0.124
ze+0.116
 lid+0.114
 cries+0.113
Neg logit contributions
 planned-0.132
 especially-0.130
 discovering-0.121
 spending-0.121
 traveling-0.118
Token,
Token ID11
Feature activation+6.56e-01

Pos logit contributions
 trigger+0.056
ze+0.046
 hurts+0.045
 lid+0.043
 cries+0.042
Neg logit contributions
 planned-0.041
 especially-0.040
 spending-0.038
 often-0.036
 discovering-0.036
Token Lily
Token ID20037
Feature activation+2.66e-02

Pos logit contributions
 especially+0.017
 planned+0.017
 discovering+0.015
 spending+0.015
 whether+0.015
Neg logit contributions
 trigger-0.019
 hurts-0.017
ze-0.016
 lid-0.016
antly-0.015
Token and
Token ID290
Feature activation+9.16e-01

Pos logit contributions
 planned+0.001
 especially+0.001
 discovering+0.001
 whether+0.001
 cultures+0.001
Neg logit contributions
 trigger-0.001
 hurts-0.001
ze-0.001
 cries-0.001
 top-0.001
Token Max
Token ID5436
Feature activation+9.29e-01

Pos logit contributions
 especially+0.029
 planned+0.028
 spending+0.027
 discovering+0.027
 going+0.027
Neg logit contributions
 trigger-0.021
 lid-0.017
 hurts-0.017
ze-0.017
antly-0.016
Token all
Token ID477
Feature activation+1.54e+00

Pos logit contributions
 planned+0.015
 especially+0.014
 discovering+0.013
 spending+0.013
 whether+0.012
Neg logit contributions
 trigger-0.025
 hurts-0.023
ze-0.021
 lid-0.021
 top-0.021
Token laughed
Token ID13818
Feature activation-4.18e-01

Pos logit contributions
 planned+0.032
 especially+0.032
 discovering+0.030
 spending+0.029
 whether+0.029
Neg logit contributions
 trigger-0.050
 hurts-0.046
ze-0.043
 lid-0.042
 cries-0.042
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
 trigger+0.015
 hurts+0.014
 lid+0.012
itch+0.012
inkle+0.012
Neg logit contributions
 especially-0.009
 planned-0.008
 spending-0.008
 discovering-0.007
 whether-0.007
Token From
Token ID3574
Feature activation+9.93e-01

Pos logit contributions
 trigger+0.049
ze+0.042
 hurts+0.042
inkle+0.040
 lid+0.039
Neg logit contributions
 planned-0.029
 especially-0.027
 spending-0.026
 discovering-0.025
 whether-0.024
Token that
Token ID326
Feature activation-2.14e+00

Pos logit contributions
 planned+0.021
 especially+0.021
 discovering+0.019
 spending+0.018
 seeing+0.018
Neg logit contributions
 trigger-0.035
 hurts-0.030
ze-0.029
 lid-0.029
antly-0.028
Token day
Token ID1110
Feature activation-5.05e+00

Pos logit contributions
 trigger+0.084
 hurts+0.074
ze+0.071
 lid+0.070
 cries+0.069
Neg logit contributions
 especially-0.036
 planned-0.036
 discovering-0.031
 spending-0.031
 traveling-0.029
Token on
Token ID319
Feature activation-2.20e+00

Pos logit contributions
 trigger+0.139
 hurts+0.120
ze+0.112
 lid+0.109
 cries+0.107
Neg logit contributions
 planned-0.136
 especially-0.134
 discovering-0.126
 spending-0.124
 traveling-0.121
Token,
Token ID11
Feature activation+4.19e-01

Pos logit contributions
 trigger+0.075
ze+0.062
 hurts+0.060
 lid+0.057
 cries+0.055
Neg logit contributions
 planned-0.051
 especially-0.049
 spending-0.047
 often-0.045
 discovering-0.044
Token Lily
Token ID20037
Feature activation-3.47e-01

Pos logit contributions
 planned+0.012
 especially+0.012
 discovering+0.011
 spending+0.011
 whether+0.011
Neg logit contributions
 trigger-0.011
 hurts-0.009
ze-0.008
 lid-0.008
antly-0.008
Token always
Token ID1464
Feature activation+8.95e-01

Pos logit contributions
 trigger+0.012
 hurts+0.011
ze+0.010
 lid+0.010
 cries+0.010
Neg logit contributions
 planned-0.007
 especially-0.007
 discovering-0.006
 whether-0.006
 spending-0.006
Token shared
Token ID4888
Feature activation+4.43e-01

Pos logit contributions
 especially+0.022
 planned+0.020
ust+0.020
 cultures+0.020
 whether+0.020
Neg logit contributions
 hurts-0.024
 trigger-0.023
 breaks-0.021
ze-0.021
 cries-0.021
Token very
Token ID845
Feature activation-2.38e+00

Pos logit contributions
 especially+0.002
 planned+0.002
 spending+0.002
 discovering+0.002
 going+0.002
Neg logit contributions
 trigger-0.003
 hurts-0.003
antly-0.003
 lid-0.003
ze-0.003
Token happy
Token ID3772
Feature activation-7.84e-01

Pos logit contributions
 trigger+0.074
 hurts+0.064
ze+0.060
antly+0.059
 lid+0.059
Neg logit contributions
 planned-0.058
 especially-0.058
 discovering-0.052
 spending-0.051
 whether-0.050
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 trigger+0.018
 hurts+0.016
ze+0.015
 lid+0.014
 top+0.014
Neg logit contributions
 planned-0.024
 especially-0.024
 discovering-0.022
 whether-0.022
 cultures-0.022
Token From
Token ID3574
Feature activation+1.28e+00

Pos logit contributions
 trigger+0.031
 hurts+0.028
ze+0.026
 lid+0.025
antly+0.025
Neg logit contributions
 planned-0.025
 especially-0.024
 discovering-0.023
 spending-0.022
 whether-0.022
Token that
Token ID326
Feature activation-1.96e+00

Pos logit contributions
 planned+0.031
 especially+0.030
 discovering+0.028
 spending+0.027
 whether+0.027
Neg logit contributions
 trigger-0.040
 hurts-0.035
ze-0.034
 lid-0.032
antly-0.032
Token day
Token ID1110
Feature activation-5.05e+00

Pos logit contributions
 trigger+0.069
 hurts+0.063
ze+0.059
 lid+0.058
 top+0.057
Neg logit contributions
 planned-0.038
 especially-0.037
 discovering-0.034
 spending-0.033
 whether-0.032
Token on
Token ID319
Feature activation-2.03e+00

Pos logit contributions
 trigger+0.138
 hurts+0.120
ze+0.112
 lid+0.109
 cries+0.107
Neg logit contributions
 planned-0.136
 especially-0.135
 discovering-0.126
 spending-0.124
 traveling-0.121
Token,
Token ID11
Feature activation+5.49e-01

Pos logit contributions
 trigger+0.073
ze+0.058
 hurts+0.056
 lid+0.052
 cries+0.052
Neg logit contributions
 planned-0.045
 especially-0.044
 spending-0.041
 often-0.040
 discovering-0.040
Token Mia
Token ID32189
Feature activation+9.60e-02

Pos logit contributions
 planned+0.017
 especially+0.017
 discovering+0.016
 cultures+0.016
 traveling+0.016
Neg logit contributions
 trigger-0.011
 hurts-0.011
 top-0.010
ze-0.009
 lid-0.009
Token loved
Token ID6151
Feature activation+2.47e+00

Pos logit contributions
 planned+0.002
 especially+0.002
 discovering+0.002
 whether+0.002
 spending+0.002
Neg logit contributions
 trigger-0.003
 hurts-0.003
ze-0.003
 cries-0.003
 top-0.003
Token to
Token ID284
Feature activation+4.98e-02

Pos logit contributions
 planned+0.057
ust+0.056
 especially+0.055
 cultures+0.053
packs+0.052
Neg logit contributions
 hurts-0.065
 trigger-0.062
ze-0.060
 top-0.060
 tears-0.060
Token to
Token ID284
Feature activation-1.81e+00

Pos logit contributions
 planned+0.016
 especially+0.016
 discovering+0.015
 spending+0.014
 whether+0.014
Neg logit contributions
 trigger-0.019
 hurts-0.017
ze-0.016
 lid-0.015
 cries-0.015
Token me
Token ID502
Feature activation-7.32e-02

Pos logit contributions
 trigger+0.050
 hurts+0.046
ze+0.042
 top+0.041
 lid+0.041
Neg logit contributions
 planned-0.047
 especially-0.046
 discovering-0.043
 whether-0.042
 spending-0.041
Token."
Token ID526
Feature activation-1.49e+00

Pos logit contributions
 trigger+0.002
 hurts+0.002
 lid+0.002
ze+0.002
 cries+0.002
Neg logit contributions
 especially-0.002
 planned-0.002
 discovering-0.002
 spending-0.002
 whether-0.002
Token From
Token ID3574
Feature activation+1.04e+00

Pos logit contributions
 trigger+0.050
 hurts+0.042
ze+0.042
inkle+0.042
 lid+0.039
Neg logit contributions
 planned-0.035
 especially-0.033
 spending-0.031
 discovering-0.030
 often-0.029
Token that
Token ID326
Feature activation-2.12e+00

Pos logit contributions
 planned+0.023
 especially+0.023
 discovering+0.021
 spending+0.020
 traveling+0.020
Neg logit contributions
 trigger-0.035
 hurts-0.031
ze-0.030
 lid-0.029
antly-0.028
Token day
Token ID1110
Feature activation-5.05e+00

Pos logit contributions
 trigger+0.083
 hurts+0.073
ze+0.070
 lid+0.069
 cries+0.068
Neg logit contributions
 especially-0.035
 planned-0.035
 discovering-0.031
 spending-0.030
 whether-0.028
Token on
Token ID319
Feature activation-2.38e+00

Pos logit contributions
 trigger+0.139
 hurts+0.120
ze+0.113
 lid+0.110
 cries+0.108
Neg logit contributions
 planned-0.135
 especially-0.134
 discovering-0.125
 spending-0.124
 traveling-0.121
Token,
Token ID11
Feature activation-8.32e-02

Pos logit contributions
 trigger+0.089
ze+0.074
 lid+0.070
 hurts+0.070
 cries+0.069
Neg logit contributions
 planned-0.048
 spending-0.045
 especially-0.045
 often-0.043
 going-0.042
Token Max
Token ID5436
Feature activation-8.92e-01

Pos logit contributions
 trigger+0.002
 hurts+0.002
ze+0.002
 lid+0.002
antly+0.002
Neg logit contributions
 especially-0.003
 planned-0.002
 spending-0.002
 discovering-0.002
 whether-0.002
Token and
Token ID290
Feature activation+8.31e-01

Pos logit contributions
 trigger+0.029
 hurts+0.027
ze+0.025
 lid+0.025
 top+0.024
Neg logit contributions
 planned-0.018
 especially-0.018
 discovering-0.017
 spending-0.016
 whether-0.016
Token Lily
Token ID20037
Feature activation+1.11e+00

Pos logit contributions
 planned+0.025
 especially+0.024
 discovering+0.023
 spending+0.023
 whether+0.022
Neg logit contributions
 trigger-0.021
 hurts-0.018
ze-0.017
 lid-0.016
antly-0.016
Token well
Token ID880
Feature activation-1.08e+00

Pos logit contributions
 planned+0.003
 especially+0.003
 discovering+0.003
 spending+0.003
 cultures+0.003
Neg logit contributions
 trigger-0.002
 hurts-0.002
ze-0.002
 lid-0.002
 top-0.002
Token done
Token ID1760
Feature activation-2.35e+00

Pos logit contributions
 trigger+0.031
 hurts+0.024
 lid+0.024
ze+0.023
antly+0.022
Neg logit contributions
 planned-0.030
 especially-0.029
 spending-0.028
 discovering-0.027
 going-0.026
Token!"
Token ID2474
Feature activation-1.97e+00

Pos logit contributions
 trigger+0.087
 lid+0.066
rified+0.062
 sleeve+0.061
antly+0.061
Neg logit contributions
 spending-0.056
 going-0.053
 planned-0.052
 especially-0.049
 discovering-0.048
Token From
Token ID3574
Feature activation+1.11e+00

Pos logit contributions
 trigger+0.060
 hurts+0.050
ze+0.048
 lid+0.047
antly+0.046
Neg logit contributions
 planned-0.051
 especially-0.050
 spending-0.047
 discovering-0.046
 whether-0.044
Token that
Token ID326
Feature activation-2.47e+00

Pos logit contributions
 planned+0.025
 especially+0.024
 discovering+0.022
 spending+0.022
 whether+0.021
Neg logit contributions
 trigger-0.037
 hurts-0.032
ze-0.031
 lid-0.031
antly-0.030
Token day
Token ID1110
Feature activation-5.04e+00

Pos logit contributions
 trigger+0.102
 hurts+0.088
ze+0.086
 lid+0.084
antly+0.083
Neg logit contributions
 especially-0.039
 planned-0.038
 discovering-0.033
 spending-0.033
 going-0.030
Token on
Token ID319
Feature activation-2.41e+00

Pos logit contributions
 trigger+0.143
 hurts+0.123
ze+0.115
 lid+0.113
 cries+0.111
Neg logit contributions
 planned-0.133
 especially-0.131
 discovering-0.122
 spending-0.122
 traveling-0.118
Token,
Token ID11
Feature activation+2.32e-01

Pos logit contributions
 trigger+0.087
ze+0.070
 hurts+0.065
 lid+0.065
antly+0.065
Neg logit contributions
 planned-0.053
 especially-0.050
 spending-0.049
 going-0.047
 discovering-0.046
Token Lily
Token ID20037
Feature activation-2.09e-01

Pos logit contributions
 planned+0.007
 especially+0.007
 discovering+0.007
 cultures+0.007
 whether+0.007
Neg logit contributions
 trigger-0.005
 hurts-0.004
ze-0.004
 top-0.004
 lid-0.004
Token always
Token ID1464
Feature activation+8.43e-01

Pos logit contributions
 trigger+0.006
 hurts+0.006
ze+0.005
 lid+0.005
antly+0.005
Neg logit contributions
 planned-0.005
 especially-0.005
 discovering-0.004
 whether-0.004
 cultures-0.004
Token remembered
Token ID12086
Feature activation+2.69e+00

Pos logit contributions
 especially+0.021
 cultures+0.020
 whether+0.020
ust+0.020
 planned+0.020
Neg logit contributions
 hurts-0.022
 breaks-0.020
 trigger-0.019
 tears-0.018
ze-0.018
Token lemon
Token ID18873
Feature activation-1.14e+00

Pos logit contributions
 trigger+0.032
 hurts+0.027
ze+0.027
antly+0.027
itch+0.026
Neg logit contributions
 planned-0.034
 especially-0.033
 spending-0.030
 discovering-0.029
 going-0.029
Tokenade
Token ID671
Feature activation-1.29e+00

Pos logit contributions
 trigger+0.037
 hurts+0.032
ze+0.030
 lid+0.030
antly+0.030
Neg logit contributions
 especially-0.026
 planned-0.025
 spending-0.022
 discovering-0.022
 whether-0.021
Token.
Token ID13
Feature activation-7.70e-01

Pos logit contributions
 trigger+0.043
 hurts+0.038
ze+0.035
 lid+0.035
antly+0.035
Neg logit contributions
 especially-0.028
 planned-0.027
 discovering-0.024
 spending-0.024
 whether-0.023
Token From
Token ID3574
Feature activation+1.27e+00

Pos logit contributions
 trigger+0.021
 hurts+0.018
ze+0.017
 lid+0.016
antly+0.016
Neg logit contributions
 planned-0.021
 especially-0.021
 discovering-0.020
 spending-0.020
 whether-0.019
Token that
Token ID326
Feature activation-2.06e+00

Pos logit contributions
 planned+0.030
 especially+0.030
 discovering+0.028
 spending+0.027
 whether+0.027
Neg logit contributions
 trigger-0.040
 hurts-0.035
ze-0.033
 lid-0.033
antly-0.031
Token day
Token ID1110
Feature activation-5.04e+00

Pos logit contributions
 trigger+0.074
 hurts+0.067
ze+0.063
 lid+0.062
 cries+0.061
Neg logit contributions
 planned-0.038
 especially-0.038
 discovering-0.035
 spending-0.034
 whether-0.033
Token on
Token ID319
Feature activation-2.08e+00

Pos logit contributions
 trigger+0.138
 hurts+0.120
ze+0.112
 lid+0.109
 cries+0.107
Neg logit contributions
 planned-0.136
 especially-0.134
 discovering-0.126
 spending-0.124
 traveling-0.121
Token,
Token ID11
Feature activation+4.92e-01

Pos logit contributions
 trigger+0.063
ze+0.052
 hurts+0.051
 lid+0.049
 cries+0.047
Neg logit contributions
 planned-0.053
 especially-0.053
 spending-0.049
 discovering-0.049
 traveling-0.047
Token Tim
Token ID5045
Feature activation+3.40e-01

Pos logit contributions
 cultures+0.014
 planned+0.014
 discovering+0.013
rying+0.013
 especially+0.013
Neg logit contributions
 trigger-0.011
 hurts-0.010
 top-0.010
 breaks-0.009
 tears-0.009
Tokenmy
Token ID1820
Feature activation+1.05e-01

Pos logit contributions
 planned+0.007
 especially+0.007
 discovering+0.006
 spending+0.006
 whether+0.006
Neg logit contributions
 trigger-0.011
 hurts-0.011
ze-0.010
 lid-0.010
 cries-0.010
Token drank
Token ID24070
Feature activation+4.69e-01

Pos logit contributions
 planned+0.002
 especially+0.002
 discovering+0.002
 whether+0.002
 cultures+0.002
Neg logit contributions
 trigger-0.003
 hurts-0.003
ze-0.003
 cries-0.003
 top-0.003
Token doll
Token ID3654
Feature activation-9.75e-01

Pos logit contributions
 trigger+0.045
 hurts+0.037
ze+0.037
itch+0.036
antly+0.036
Neg logit contributions
 especially-0.037
 planned-0.036
 going-0.036
 someday-0.035
 traveling-0.034
Token tightly
Token ID17707
Feature activation+5.73e-02

Pos logit contributions
 trigger+0.032
 hurts+0.028
 AW+0.026
inkle+0.025
 breaks+0.024
Neg logit contributions
 especially-0.024
 often-0.021
 planned-0.021
 someday-0.021
 whether-0.020
Token.
Token ID13
Feature activation-9.30e-01

Pos logit contributions
 especially+0.001
 planned+0.001
 spending+0.001
 discovering+0.001
 often+0.001
Neg logit contributions
 trigger-0.002
 hurts-0.002
itch-0.002
 lid-0.002
inkle-0.002
Token From
Token ID3574
Feature activation+1.15e+00

Pos logit contributions
 trigger+0.027
 hurts+0.024
ze+0.022
 lid+0.022
 cries+0.021
Neg logit contributions
 especially-0.023
 planned-0.023
 discovering-0.022
 spending-0.021
 whether-0.021
Token that
Token ID326
Feature activation-2.19e+00

Pos logit contributions
 planned+0.029
 especially+0.028
 discovering+0.026
 spending+0.026
 whether+0.025
Neg logit contributions
 trigger-0.034
 hurts-0.030
ze-0.028
 lid-0.028
antly-0.027
Token day
Token ID1110
Feature activation-5.02e+00

Pos logit contributions
 trigger+0.083
 hurts+0.074
ze+0.071
 lid+0.070
 cries+0.069
Neg logit contributions
 especially-0.038
 planned-0.037
 discovering-0.033
 spending-0.032
 whether-0.031
Token on
Token ID319
Feature activation-1.88e+00

Pos logit contributions
 trigger+0.137
 hurts+0.119
ze+0.111
 lid+0.108
 cries+0.106
Neg logit contributions
 planned-0.135
 especially-0.134
 discovering-0.125
 spending-0.124
 traveling-0.121
Token,
Token ID11
Feature activation+6.06e-01

Pos logit contributions
 trigger+0.062
ze+0.049
 hurts+0.049
 lid+0.046
antly+0.045
Neg logit contributions
 planned-0.046
 especially-0.045
 spending-0.042
 discovering-0.042
 going-0.040
Token Lily
Token ID20037
Feature activation-1.14e-02

Pos logit contributions
 planned+0.020
 especially+0.019
 discovering+0.019
 cultures+0.018
 traveling+0.018
Neg logit contributions
 trigger-0.012
 hurts-0.011
 top-0.010
ze-0.009
 lid-0.009
Token made
Token ID925
Feature activation-1.27e+00

Pos logit contributions
 trigger+0.000
 hurts+0.000
ze+0.000
antly+0.000
 cries+0.000
Neg logit contributions
 planned-0.000
 especially-0.000
 discovering-0.000
 whether-0.000
 cultures-0.000
Token sure
Token ID1654
Feature activation+1.80e+00

Pos logit contributions
 trigger+0.047
 hurts+0.042
 lid+0.040
antly+0.039
inkle+0.038
Neg logit contributions
 especially-0.024
 planned-0.023
 spending-0.022
 discovering-0.021
 whether-0.020
Token day
Token ID1110
Feature activation-2.56e+00

Pos logit contributions
 planned+0.018
 especially+0.018
 discovering+0.016
 whether+0.016
 spending+0.016
Neg logit contributions
 trigger-0.025
 hurts-0.024
ze-0.022
 top-0.022
 cries-0.021
Token long
Token ID890
Feature activation-1.30e+00

Pos logit contributions
 trigger+0.070
 hurts+0.061
ze+0.056
 lid+0.055
antly+0.054
Neg logit contributions
 planned-0.070
 especially-0.070
 discovering-0.065
 spending-0.064
 whether-0.062
Token.
Token ID13
Feature activation-6.18e-01

Pos logit contributions
 trigger+0.036
 hurts+0.032
 lid+0.029
ze+0.029
antly+0.028
Neg logit contributions
 planned-0.035
 especially-0.035
 discovering-0.032
 spending-0.032
 whether-0.031
Token From
Token ID3574
Feature activation+1.34e+00

Pos logit contributions
 trigger+0.017
 hurts+0.015
ze+0.014
 lid+0.014
 top+0.014
Neg logit contributions
 planned-0.016
 especially-0.016
 discovering-0.015
 spending-0.015
 whether-0.014
Token that
Token ID326
Feature activation-2.39e+00

Pos logit contributions
 planned+0.035
 especially+0.035
 discovering+0.032
 spending+0.031
 whether+0.031
Neg logit contributions
 trigger-0.038
 hurts-0.034
ze-0.031
 lid-0.031
 top-0.030
Token day
Token ID1110
Feature activation-5.02e+00

Pos logit contributions
 trigger+0.087
 hurts+0.079
ze+0.075
 lid+0.074
 cries+0.072
Neg logit contributions
 especially-0.044
 planned-0.044
 discovering-0.039
 spending-0.038
 whether-0.037
Token on
Token ID319
Feature activation-2.12e+00

Pos logit contributions
 trigger+0.137
 hurts+0.119
ze+0.110
 lid+0.108
 cries+0.106
Neg logit contributions
 planned-0.136
 especially-0.134
 discovering-0.126
 spending-0.124
 traveling-0.121
Token,
Token ID11
Feature activation+5.16e-01

Pos logit contributions
 trigger+0.067
ze+0.054
 hurts+0.054
 lid+0.050
antly+0.049
Neg logit contributions
 planned-0.054
 especially-0.053
 spending-0.049
 discovering-0.049
 traveling-0.047
Token Lily
Token ID20037
Feature activation-7.02e-02

Pos logit contributions
 planned+0.016
 especially+0.015
 discovering+0.015
 spending+0.014
 cultures+0.014
Neg logit contributions
 trigger-0.012
 hurts-0.011
ze-0.010
 top-0.009
 lid-0.009
Token never
Token ID1239
Feature activation+1.80e+00

Pos logit contributions
 trigger+0.002
 hurts+0.002
ze+0.002
 cries+0.002
 breaks+0.002
Neg logit contributions
 planned-0.001
 especially-0.001
 discovering-0.001
 whether-0.001
 cultures-0.001
Token forgot
Token ID16453
Feature activation+2.30e+00

Pos logit contributions
 especially+0.042
ait+0.041
 cultures+0.041
 discovering+0.040
 whether+0.040
Neg logit contributions
 hurts-0.049
 trigger-0.045
 cries-0.043
 breaks-0.042
 tears-0.040
Token light
Token ID1657
Feature activation+2.60e-01

Pos logit contributions
 trigger+0.015
 hurts+0.013
ze+0.012
itch+0.012
 AW+0.012
Neg logit contributions
 especially-0.014
 planned-0.014
 someday-0.013
 discovering-0.013
 whether-0.013
Token bulb
Token ID28287
Feature activation+3.41e-01

Pos logit contributions
 planned+0.007
 especially+0.007
 discovering+0.006
 spending+0.006
 whether+0.006
Neg logit contributions
 trigger-0.007
 hurts-0.006
ze-0.006
 lid-0.006
 cries-0.005
Token.
Token ID13
Feature activation-5.61e-01

Pos logit contributions
 especially+0.010
 planned+0.010
 discovering+0.009
 spending+0.009
 whether+0.009
Neg logit contributions
 trigger-0.008
 hurts-0.008
ze-0.007
 lid-0.007
 top-0.006
Token From
Token ID3574
Feature activation+1.51e+00

Pos logit contributions
 trigger+0.014
 hurts+0.013
ze+0.012
 lid+0.012
 cries+0.011
Neg logit contributions
 especially-0.016
 planned-0.016
 discovering-0.015
 spending-0.014
 whether-0.014
Token that
Token ID326
Feature activation-2.03e+00

Pos logit contributions
 planned+0.038
 especially+0.038
 discovering+0.035
 spending+0.034
 whether+0.034
Neg logit contributions
 trigger-0.044
 hurts-0.039
ze-0.036
 lid-0.035
 top-0.035
Token day
Token ID1110
Feature activation-5.01e+00

Pos logit contributions
 trigger+0.079
 hurts+0.070
ze+0.067
 lid+0.066
 cries+0.065
Neg logit contributions
 especially-0.034
 planned-0.034
 discovering-0.030
 spending-0.029
 whether-0.027
Token on
Token ID319
Feature activation-1.94e+00

Pos logit contributions
 trigger+0.138
 hurts+0.120
ze+0.111
 lid+0.109
 cries+0.107
Neg logit contributions
 planned-0.135
 especially-0.134
 discovering-0.124
 spending-0.123
 traveling-0.120
Token,
Token ID11
Feature activation+5.29e-01

Pos logit contributions
 trigger+0.061
ze+0.050
 hurts+0.050
 lid+0.046
 cries+0.045
Neg logit contributions
 planned-0.049
 especially-0.047
 spending-0.044
 discovering-0.044
 often-0.042
Token Lily
Token ID20037
Feature activation+1.82e-01

Pos logit contributions
 planned+0.017
 especially+0.017
 discovering+0.016
 cultures+0.016
 whether+0.016
Neg logit contributions
 trigger-0.011
 hurts-0.010
 top-0.009
ze-0.008
 lid-0.008
Token knew
Token ID2993
Feature activation+2.85e+00

Pos logit contributions
 planned+0.004
 especially+0.004
 whether+0.003
 discovering+0.003
 cultures+0.003
Neg logit contributions
 hurts-0.005
 trigger-0.005
ze-0.005
antly-0.005
 breaks-0.005
Token the
Token ID262
Feature activation-2.35e-01

Pos logit contributions
 especially+0.095
 cultures+0.093
packs+0.092
adows+0.092
ashing+0.092
Neg logit contributions
 hurts-0.043
 top-0.040
 tears-0.039
 trigger-0.036
 wrong-0.036
Token with
Token ID351
Feature activation-1.30e+00

Pos logit contributions
 trigger+0.028
 hurts+0.025
ze+0.023
 lid+0.022
antly+0.022
Neg logit contributions
 especially-0.022
 planned-0.022
 discovering-0.020
 spending-0.020
 whether-0.020
Token him
Token ID683
Feature activation-1.43e+00

Pos logit contributions
 trigger+0.053
 AW+0.050
 hurts+0.049
 lid+0.047
inkle+0.045
Neg logit contributions
 especially-0.023
 planned-0.019
 spending-0.018
 seeing-0.017
 whether-0.016
Token.
Token ID13
Feature activation-7.56e-01

Pos logit contributions
 trigger+0.053
 hurts+0.045
 AW+0.043
 lid+0.042
inkle+0.042
Neg logit contributions
 especially-0.029
 planned-0.028
 often-0.026
 spending-0.026
 someday-0.025
Token From
Token ID3574
Feature activation+1.49e+00

Pos logit contributions
 trigger+0.023
 hurts+0.020
ze+0.019
 lid+0.019
antly+0.018
Neg logit contributions
 planned-0.018
 especially-0.018
 discovering-0.017
 spending-0.017
 whether-0.016
Token that
Token ID326
Feature activation-2.26e+00

Pos logit contributions
 planned+0.033
 especially+0.033
 discovering+0.030
 spending+0.029
 whether+0.029
Neg logit contributions
 trigger-0.049
 hurts-0.043
ze-0.042
 lid-0.041
 cries-0.039
Token day
Token ID1110
Feature activation-5.01e+00

Pos logit contributions
 trigger+0.093
 hurts+0.081
ze+0.078
 lid+0.077
 cries+0.076
Neg logit contributions
 especially-0.035
 planned-0.034
 discovering-0.030
 spending-0.029
 going-0.027
Token on
Token ID319
Feature activation-1.78e+00

Pos logit contributions
 trigger+0.148
 hurts+0.127
ze+0.120
 lid+0.117
 cries+0.116
Neg logit contributions
 planned-0.126
 especially-0.125
 spending-0.117
 discovering-0.116
 traveling-0.113
Token,
Token ID11
Feature activation+7.93e-01

Pos logit contributions
 trigger+0.063
ze+0.052
 hurts+0.049
 lid+0.048
 cries+0.046
Neg logit contributions
 planned-0.039
 especially-0.038
 spending-0.036
 often-0.034
 discovering-0.034
Token Lily
Token ID20037
Feature activation-5.91e-02

Pos logit contributions
 planned+0.024
 especially+0.024
 discovering+0.023
 spending+0.022
 whether+0.022
Neg logit contributions
 trigger-0.018
 hurts-0.016
ze-0.015
 lid-0.014
 top-0.014
Token knew
Token ID2993
Feature activation+2.95e+00

Pos logit contributions
 trigger+0.002
 hurts+0.002
ze+0.002
antly+0.002
 lid+0.002
Neg logit contributions
 planned-0.001
 especially-0.001
 discovering-0.001
 whether-0.001
 spending-0.001
Token she
Token ID673
Feature activation+1.19e-01

Pos logit contributions
 especially+0.093
 planned+0.091
 cultures+0.090
packs+0.088
ait+0.087
Neg logit contributions
 hurts-0.054
 trigger-0.050
 top-0.048
 tears-0.046
antly-0.044
Token her
Token ID607
Feature activation-9.54e-01

Pos logit contributions
 especially+0.008
 planned+0.007
 spending+0.007
 whether+0.006
 discovering+0.006
Neg logit contributions
 trigger-0.015
 hurts-0.013
 lid-0.013
ze-0.013
 AW-0.013
Token mom
Token ID1995
Feature activation-2.92e-01

Pos logit contributions
 trigger+0.025
 hurts+0.021
ze+0.020
antly+0.019
 cries+0.018
Neg logit contributions
 especially-0.027
 planned-0.027
 discovering-0.024
 whether-0.024
 traveling-0.024
Token.
Token ID13
Feature activation-9.54e-01

Pos logit contributions
 trigger+0.009
 hurts+0.008
ze+0.007
 AW+0.007
inkle+0.007
Neg logit contributions
 especially-0.007
 planned-0.007
 discovering-0.006
 someday-0.006
 spending-0.006
Token From
Token ID3574
Feature activation+1.10e+00

Pos logit contributions
 trigger+0.031
 hurts+0.027
ze+0.026
 lid+0.025
 top+0.025
Neg logit contributions
 planned-0.021
 especially-0.021
 discovering-0.019
 spending-0.019
 whether-0.018
Token that
Token ID326
Feature activation-1.83e+00

Pos logit contributions
 planned+0.028
 especially+0.027
 discovering+0.026
 spending+0.025
 whether+0.024
Neg logit contributions
 trigger-0.033
 hurts-0.029
ze-0.027
 lid-0.026
antly-0.026
Token day
Token ID1110
Feature activation-5.00e+00

Pos logit contributions
 trigger+0.058
 hurts+0.055
ze+0.051
 lid+0.050
 top+0.050
Neg logit contributions
 planned-0.039
 especially-0.038
 discovering-0.035
 whether-0.035
 cultures-0.034
Token on
Token ID319
Feature activation-1.96e+00

Pos logit contributions
 trigger+0.131
 hurts+0.115
ze+0.106
 lid+0.103
 cries+0.101
Neg logit contributions
 planned-0.139
 especially-0.138
 discovering-0.130
 spending-0.127
 whether-0.125
Token,
Token ID11
Feature activation+5.86e-01

Pos logit contributions
 trigger+0.065
ze+0.053
 hurts+0.052
 lid+0.049
 cries+0.048
Neg logit contributions
 planned-0.046
 especially-0.045
 spending-0.043
 discovering-0.042
 going-0.041
Token Lily
Token ID20037
Feature activation+4.01e-02

Pos logit contributions
 planned+0.016
 especially+0.016
 discovering+0.016
 spending+0.015
 whether+0.015
Neg logit contributions
 trigger-0.015
 hurts-0.013
ze-0.012
 top-0.012
 lid-0.012
Token made
Token ID925
Feature activation-1.05e+00

Pos logit contributions
 planned+0.001
 especially+0.001
 cultures+0.001
 whether+0.001
 discovering+0.001
Neg logit contributions
 hurts-0.001
 trigger-0.001
ze-0.001
antly-0.001
 breaks-0.001
Token sure
Token ID1654
Feature activation+1.84e+00

Pos logit contributions
 trigger+0.037
 hurts+0.034
 lid+0.031
antly+0.031
ze+0.030
Neg logit contributions
 especially-0.021
 planned-0.020
 discovering-0.019
 spending-0.019
 whether-0.018
Token more
Token ID517
Feature activation+9.65e-01

Pos logit contributions
 trigger+0.004
 hurts+0.004
ze+0.004
 lid+0.003
 cries+0.003
Neg logit contributions
 planned-0.005
 especially-0.005
 discovering-0.005
 spending-0.005
 whether-0.005
Token fun
Token ID1257
Feature activation-3.47e-02

Pos logit contributions
 planned+0.026
 especially+0.026
 discovering+0.025
 whether+0.025
 cultures+0.025
Neg logit contributions
 trigger-0.021
 hurts-0.021
ze-0.020
 cries-0.019
 top-0.018
Token.
Token ID13
Feature activation-4.90e-01

Pos logit contributions
 trigger+0.001
 hurts+0.001
ze+0.001
 cries+0.001
 top+0.001
Neg logit contributions
 especially-0.001
 planned-0.001
 whether-0.001
 discovering-0.001
ust-0.001
Token From
Token ID3574
Feature activation+1.16e+00

Pos logit contributions
 trigger+0.011
 hurts+0.011
 cries+0.010
 lid+0.009
ze+0.009
Neg logit contributions
 planned-0.014
 especially-0.014
 discovering-0.013
 cultures-0.013
 explorers-0.013
Token that
Token ID326
Feature activation-2.25e+00

Pos logit contributions
 planned+0.027
 especially+0.027
 discovering+0.025
 spending+0.024
 whether+0.024
Neg logit contributions
 trigger-0.037
 hurts-0.033
ze-0.031
 lid-0.030
antly-0.029
Token day
Token ID1110
Feature activation-5.00e+00

Pos logit contributions
 trigger+0.087
 hurts+0.077
ze+0.074
 lid+0.073
antly+0.072
Neg logit contributions
 especially-0.038
 planned-0.038
 discovering-0.034
 spending-0.033
 whether-0.031
Token on
Token ID319
Feature activation-1.94e+00

Pos logit contributions
 trigger+0.146
 hurts+0.125
ze+0.118
 lid+0.115
 cries+0.114
Neg logit contributions
 planned-0.129
 especially-0.126
 spending-0.117
 discovering-0.117
 traveling-0.114
Token,
Token ID11
Feature activation+3.55e-01

Pos logit contributions
 trigger+0.067
ze+0.053
 hurts+0.052
 lid+0.050
 cries+0.048
Neg logit contributions
 planned-0.046
 especially-0.044
 spending-0.042
 often-0.040
 discovering-0.040
Token Lily
Token ID20037
Feature activation-9.48e-02

Pos logit contributions
 planned+0.011
 especially+0.011
 discovering+0.011
 cultures+0.010
 whether+0.010
Neg logit contributions
 trigger-0.008
 hurts-0.007
ze-0.006
 top-0.006
 lid-0.006
Token knew
Token ID2993
Feature activation+2.76e+00

Pos logit contributions
 trigger+0.003
 hurts+0.003
ze+0.003
antly+0.003
 cries+0.003
Neg logit contributions
 planned-0.002
 especially-0.002
 discovering-0.002
 whether-0.002
 cultures-0.002
Token that
Token ID326
Feature activation+2.74e-01

Pos logit contributions
 especially+0.092
packs+0.091
ashing+0.091
 cultures+0.090
adows+0.089
Neg logit contributions
 hurts-0.042
 top-0.037
 tears-0.035
 wrong-0.034
 cries-0.033
Token big
Token ID1263
Feature activation-1.30e+00

Pos logit contributions
 trigger+0.064
ze+0.053
antly+0.052
 hurts+0.051
inkle+0.050
Neg logit contributions
 planned-0.053
 especially-0.047
ride-0.044
 appreciate-0.043
 going-0.042
Token hug
Token ID16225
Feature activation-7.40e-02

Pos logit contributions
 trigger+0.035
ze+0.031
antly+0.030
 hurts+0.029
inkle+0.027
Neg logit contributions
 planned-0.034
 especially-0.031
ride-0.030
 explorers-0.029
 wander-0.029
Token.
Token ID13
Feature activation-8.67e-01

Pos logit contributions
 trigger+0.002
 hurts+0.002
 lid+0.002
inkle+0.002
ze+0.002
Neg logit contributions
 planned-0.002
 especially-0.002
 discovering-0.002
 spending-0.002
 whether-0.002
Token From
Token ID3574
Feature activation+1.40e+00

Pos logit contributions
 trigger+0.029
 hurts+0.025
ze+0.025
 lid+0.024
antly+0.023
Neg logit contributions
 planned-0.019
 especially-0.018
 spending-0.017
 discovering-0.017
 whether-0.016
Token that
Token ID326
Feature activation-2.06e+00

Pos logit contributions
 planned+0.035
 especially+0.035
 discovering+0.032
 spending+0.031
 whether+0.031
Neg logit contributions
 trigger-0.041
 hurts-0.037
ze-0.034
 lid-0.034
antly-0.033
Token day
Token ID1110
Feature activation-4.99e+00

Pos logit contributions
 trigger+0.071
 hurts+0.065
ze+0.061
 lid+0.060
 top+0.059
Neg logit contributions
 planned-0.040
 especially-0.040
 discovering-0.036
 spending-0.035
 whether-0.035
Token on
Token ID319
Feature activation-1.82e+00

Pos logit contributions
 trigger+0.141
 hurts+0.121
ze+0.114
 lid+0.111
 cries+0.110
Neg logit contributions
 planned-0.131
 especially-0.129
 discovering-0.121
 spending-0.120
 traveling-0.117
Token,
Token ID11
Feature activation+6.03e-01

Pos logit contributions
 trigger+0.061
ze+0.050
 hurts+0.049
 lid+0.047
 cries+0.046
Neg logit contributions
 planned-0.043
 especially-0.042
 spending-0.039
 discovering-0.037
 often-0.037
Token Lily
Token ID20037
Feature activation-2.03e-01

Pos logit contributions
 especially+0.017
 planned+0.017
 spending+0.015
 discovering+0.015
 whether+0.015
Neg logit contributions
 trigger-0.016
 hurts-0.014
ze-0.013
 lid-0.013
antly-0.013
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
 hurts+0.006
 trigger+0.006
ze+0.006
antly+0.005
 breaks+0.005
Neg logit contributions
 planned-0.004
 especially-0.004
 cultures-0.004
 whether-0.004
 discovering-0.004
Token Mitt
Token ID16627
Feature activation+8.32e-01

Pos logit contributions
 especially+0.035
 planned+0.035
 discovering+0.033
 spending+0.033
 whether+0.032
Neg logit contributions
 trigger-0.024
 hurts-0.020
ze-0.019
 lid-0.019
antly-0.018
Token finished
Token ID5201
Feature activation+5.99e-01

Pos logit contributions
 trigger+0.052
 hurts+0.049
ze+0.045
antly+0.044
 lid+0.043
Neg logit contributions
 especially-0.059
 planned-0.058
 discovering-0.057
 cultures-0.056
 whether-0.056
Token her
Token ID607
Feature activation-2.65e-01

Pos logit contributions
 especially+0.014
 planned+0.014
 discovering+0.013
 whether+0.013
 cultures+0.012
Neg logit contributions
 trigger-0.018
 hurts-0.016
ze-0.015
 lid-0.015
 top-0.015
Token candy
Token ID18550
Feature activation-4.02e-01

Pos logit contributions
 trigger+0.008
 hurts+0.006
ze+0.006
antly+0.006
 lid+0.006
Neg logit contributions
 planned-0.007
 especially-0.007
 discovering-0.006
 spending-0.006
 traveling-0.006
Token,
Token ID11
Feature activation+5.99e-01

Pos logit contributions
 trigger+0.011
 hurts+0.010
ze+0.009
 lid+0.009
antly+0.009
Neg logit contributions
 planned-0.011
 especially-0.011
 discovering-0.010
 spending-0.010
 whether-0.009
Token she
Token ID673
Feature activation-3.33e-01

Pos logit contributions
 especially+0.012
 planned+0.012
 discovering+0.011
 spending+0.011
 whether+0.010
Neg logit contributions
 trigger-0.021
 hurts-0.019
ze-0.018
 lid-0.017
 cries-0.017
Token sc
Token ID629
Feature activation+3.88e+00

Pos logit contributions
 trigger+0.010
 hurts+0.009
ze+0.009
 lid+0.008
 cries+0.008
Neg logit contributions
 planned-0.008
 especially-0.008
 discovering-0.007
 spending-0.007
 whether-0.007
Tokenrun
Token ID5143
Feature activation-6.01e-01

Pos logit contributions
 especially+0.079
 planned+0.079
 discovering+0.072
 spending+0.070
 whether+0.068
Neg logit contributions
 trigger-0.131
 hurts-0.119
ze-0.112
 lid-0.110
 cries-0.108
Tokenched
Token ID1740
Feature activation-1.32e+00

Pos logit contributions
 trigger+0.015
 hurts+0.012
 lid+0.011
ze+0.011
 tears+0.011
Neg logit contributions
 planned-0.019
 especially-0.018
 appreciate-0.017
 whether-0.016
 someday-0.016
Token her
Token ID607
Feature activation-3.65e-01

Pos logit contributions
 trigger+0.048
 hurts+0.044
 AW+0.043
ggy+0.040
 explodes+0.040
Neg logit contributions
 especially-0.024
 if-0.022
 going-0.020
 even-0.020
 discovering-0.019
Token eyes
Token ID2951
Feature activation+3.04e-01

Pos logit contributions
 trigger+0.011
 hurts+0.009
 AW+0.009
antly+0.008
ze+0.008
Neg logit contributions
 especially-0.010
 planned-0.009
 going-0.009
 discovering-0.008
 traveling-0.008
Token tight
Token ID5381
Feature activation-1.21e+00

Pos logit contributions
 especially+0.007
 planned+0.007
 discovering+0.006
 spending+0.006
 whether+0.006
Neg logit contributions
 trigger-0.010
 hurts-0.009
ze-0.008
 lid-0.008
 cries-0.008
Token,
Token ID11
Feature activation+1.48e-01

Pos logit contributions
 planned+0.017
 especially+0.017
 discovering+0.016
 spending+0.016
 whether+0.015
Neg logit contributions
 trigger-0.012
 hurts-0.011
ze-0.010
 lid-0.010
 top-0.010
Token you
Token ID345
Feature activation+6.85e-01

Pos logit contributions
 planned+0.004
 especially+0.004
 discovering+0.004
 spending+0.003
 whether+0.003
Neg logit contributions
 trigger-0.004
 hurts-0.004
ze-0.003
 lid-0.003
 top-0.003
Token must
Token ID1276
Feature activation-1.09e-02

Pos logit contributions
 planned+0.016
 especially+0.016
 discovering+0.015
 spending+0.015
 whether+0.014
Neg logit contributions
 trigger-0.021
 hurts-0.019
ze-0.017
 lid-0.017
 top-0.017
Token answer
Token ID3280
Feature activation+1.11e+00

Pos logit contributions
 trigger+0.000
 hurts+0.000
ze+0.000
 top+0.000
 lid+0.000
Neg logit contributions
 planned-0.000
 especially-0.000
 discovering-0.000
 spending-0.000
 whether-0.000
Token a
Token ID257
Feature activation-5.94e-01

Pos logit contributions
 especially+0.026
 planned+0.026
 discovering+0.024
 whether+0.023
 spending+0.023
Neg logit contributions
 trigger-0.035
 hurts-0.031
ze-0.029
 lid-0.028
 cries-0.027
Token r
Token ID374
Feature activation+4.33e+00

Pos logit contributions
 trigger+0.017
 hurts+0.015
ze+0.014
inkle+0.013
antly+0.013
Neg logit contributions
 planned-0.015
 especially-0.015
 discovering-0.014
 whether-0.013
 traveling-0.013
Tokeniddle
Token ID2509
Feature activation+1.33e+00

Pos logit contributions
 planned+0.100
 especially+0.099
 discovering+0.093
 spending+0.091
 whether+0.089
Neg logit contributions
 trigger-0.133
 hurts-0.123
ze-0.115
 lid-0.113
antly-0.111
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
 especially+0.038
 planned+0.038
 discovering+0.036
 spending+0.035
 whether+0.035
Neg logit contributions
 trigger-0.031
 hurts-0.029
ze-0.026
 top-0.025
 cries-0.025
Token If
Token ID1002
Feature activation+2.40e-01

Pos logit contributions
 planned+0.037
 especially+0.036
 discovering+0.034
 spending+0.033
 cultures+0.032
Neg logit contributions
 trigger-0.032
 hurts-0.029
ze-0.026
 lid-0.026
antly-0.026
Token you
Token ID345
Feature activation+7.99e-01

Pos logit contributions
 especially+0.007
 planned+0.007
 discovering+0.006
 spending+0.006
 whether+0.006
Neg logit contributions
 trigger-0.006
 hurts-0.006
ze-0.005
 lid-0.005
 top-0.005
Token can
Token ID460
Feature activation+1.06e+00

Pos logit contributions
 especially+0.017
 planned+0.017
 discovering+0.016
 spending+0.016
 whether+0.015
Neg logit contributions
 trigger-0.025
 hurts-0.024
ze-0.022
 top-0.021
 cries-0.021
Token them
Token ID606
Feature activation+3.69e-01

Pos logit contributions
 trigger+0.049
 hurts+0.048
 AW+0.044
ze+0.044
inkle+0.043
Neg logit contributions
 planned-0.020
 especially-0.019
 going-0.018
 explorers-0.018
 if-0.018
Token to
Token ID284
Feature activation+6.22e-02

Pos logit contributions
 planned+0.007
 especially+0.007
 discovering+0.007
 spending+0.006
 whether+0.006
Neg logit contributions
 trigger-0.013
 hurts-0.012
ze-0.011
 lid-0.011
antly-0.010
Token their
Token ID511
Feature activation+6.75e-01

Pos logit contributions
 planned+0.002
 especially+0.002
 discovering+0.002
 spending+0.002
 whether+0.002
Neg logit contributions
 trigger-0.002
 hurts-0.001
ze-0.001
 lid-0.001
 top-0.001
Token room
Token ID2119
Feature activation+1.87e+00

Pos logit contributions
 especially+0.014
 planned+0.014
 explorers+0.013
 discovering+0.013
 cultures+0.012
Neg logit contributions
 trigger-0.023
 hurts-0.020
ze-0.019
inkle-0.019
antly-0.018
Token and
Token ID290
Feature activation-6.40e-01

Pos logit contributions
 planned+0.047
 especially+0.047
 discovering+0.044
 spending+0.043
 whether+0.043
Neg logit contributions
 trigger-0.051
 hurts-0.047
ze-0.044
 lid-0.044
 top-0.043
Token t
Token ID256
Feature activation+3.58e+00

Pos logit contributions
 trigger+0.019
 hurts+0.018
ze+0.016
 cries+0.016
 lid+0.016
Neg logit contributions
 planned-0.015
 especially-0.015
 discovering-0.014
 spending-0.014
 whether-0.014
Tokenucks
Token ID6238
Feature activation+2.24e+00

Pos logit contributions
 especially+0.077
 planned+0.076
 discovering+0.072
 whether+0.068
 spending+0.068
Neg logit contributions
 trigger-0.115
 hurts-0.106
ze-0.099
antly-0.099
 lid-0.096
Token them
Token ID606
Feature activation-9.71e-01

Pos logit contributions
 especially+0.034
 planned+0.033
 someday+0.031
 spending+0.029
 whether+0.028
Neg logit contributions
 trigger-0.085
 hurts-0.080
ze-0.077
 AW-0.076
inkle-0.073
Token in
Token ID287
Feature activation-6.45e-01

Pos logit contributions
 trigger+0.033
 hurts+0.029
 lid+0.027
ze+0.027
antly+0.026
Neg logit contributions
 planned-0.021
 especially-0.020
 discovering-0.019
 spending-0.018
 traveling-0.018
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
 trigger+0.017
 hurts+0.015
ze+0.014
 lid+0.014
antly+0.013
Neg logit contributions
 planned-0.018
 especially-0.018
 discovering-0.017
 spending-0.016
 whether-0.016
Token She
Token ID1375
Feature activation-1.35e-01

Pos logit contributions
 planned+0.008
 especially+0.008
 discovering+0.008
 cultures+0.007
ust+0.007
Neg logit contributions
 trigger-0.008
 hurts-0.007
 lid-0.007
 cries-0.007
 top-0.007
Token from
Token ID422
Feature activation-1.50e+00

Pos logit contributions
 planned+0.012
 especially+0.012
 discovering+0.011
 spending+0.010
 whether+0.010
Neg logit contributions
 trigger-0.020
 hurts-0.019
ze-0.018
 lid-0.017
 top-0.017
Token down
Token ID866
Feature activation-5.94e-01

Pos logit contributions
 trigger+0.041
 hurts+0.036
ze+0.033
 lid+0.033
 top+0.032
Neg logit contributions
 planned-0.040
 especially-0.040
 discovering-0.037
 spending-0.036
 whether-0.036
Token underground
Token ID11447
Feature activation+8.86e-01

Pos logit contributions
 trigger+0.017
 hurts+0.015
ze+0.014
 lid+0.013
 cries+0.013
Neg logit contributions
 planned-0.016
 especially-0.015
 discovering-0.014
 spending-0.014
 whether-0.014
Token.
Token ID13
Feature activation-2.25e-01

Pos logit contributions
 especially+0.027
 planned+0.026
 discovering+0.025
 spending+0.025
 whether+0.025
Neg logit contributions
 trigger-0.019
 hurts-0.018
 top-0.016
 lid-0.016
ze-0.015
Token She
Token ID1375
Feature activation-3.85e-01

Pos logit contributions
 trigger+0.007
 hurts+0.006
ze+0.005
 lid+0.005
inkle+0.005
Neg logit contributions
 planned-0.006
 especially-0.006
 discovering-0.005
 spending-0.005
 whether-0.005
Token pe
Token ID613
Feature activation+3.54e+00

Pos logit contributions
 trigger+0.012
 hurts+0.011
ze+0.010
 lid+0.010
 cries+0.010
Neg logit contributions
 planned-0.009
 especially-0.009
 discovering-0.008
 spending-0.008
 whether-0.008
Tokenered
Token ID1068
Feature activation+1.83e+00

Pos logit contributions
 especially+0.107
 planned+0.107
 discovering+0.101
 spending+0.099
 whether+0.097
Neg logit contributions
 trigger-0.083
 hurts-0.073
ze-0.067
 lid-0.065
antly-0.064
Token down
Token ID866
Feature activation-1.45e-01

Pos logit contributions
 planned+0.046
 especially+0.046
 discovering+0.043
 spending+0.042
 whether+0.041
Neg logit contributions
 trigger-0.053
 hurts-0.047
ze-0.044
 lid-0.043
 top-0.042
Token the
Token ID262
Feature activation+2.18e-01

Pos logit contributions
 trigger+0.005
 hurts+0.004
ze+0.004
 lid+0.004
 top+0.004
Neg logit contributions
 planned-0.003
 especially-0.003
 discovering-0.003
 spending-0.003
 whether-0.003
Token hole
Token ID7604
Feature activation+1.41e+00

Pos logit contributions
 planned+0.005
 especially+0.005
 discovering+0.005
 explorers+0.005
 traveling+0.005
Neg logit contributions
 trigger-0.006
 hurts-0.006
ze-0.005
itch-0.005
inkle-0.005
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
 planned+0.034
 especially+0.034
 discovering+0.032
 spending+0.031
 whether+0.030
Neg logit contributions
 trigger-0.041
 hurts-0.037
ze-0.034
 lid-0.034
 top-0.034
Token her
Token ID607
Feature activation+1.01e+00

Pos logit contributions
 planned+0.008
 especially+0.007
 discovering+0.007
 cultures+0.007
ust+0.007
Neg logit contributions
 trigger-0.009
 hurts-0.009
ze-0.008
 tears-0.008
 top-0.008
Token scarf
Token ID36953
Feature activation-4.80e-01

Pos logit contributions
 planned+0.025
 especially+0.025
 discovering+0.024
 spending+0.023
 traveling+0.022
Neg logit contributions
 trigger-0.030
 hurts-0.027
antly-0.025
ze-0.024
 lid-0.023
Token and
Token ID290
Feature activation-5.79e-03

Pos logit contributions
 trigger+0.013
 hurts+0.012
ze+0.011
 lid+0.011
 cries+0.010
Neg logit contributions
 planned-0.013
 especially-0.012
 discovering-0.011
 spending-0.011
 whether-0.011
Token asked
Token ID1965
Feature activation+6.20e-01

Pos logit contributions
 trigger+0.000
 hurts+0.000
ze+0.000
 lid+0.000
 cries+0.000
Neg logit contributions
 planned-0.000
 especially-0.000
 discovering-0.000
 whether-0.000
 spending-0.000
Token her
Token ID607
Feature activation+1.41e+00

Pos logit contributions
 planned+0.014
 especially+0.014
 discovering+0.013
 spending+0.013
 whether+0.013
Neg logit contributions
 trigger-0.019
 hurts-0.017
ze-0.016
 lid-0.015
 top-0.015
Token parents
Token ID3397
Feature activation+4.65e+00

Pos logit contributions
 especially+0.045
 planned+0.045
 discovering+0.042
 spending+0.042
 whether+0.041
Neg logit contributions
 trigger-0.029
 hurts-0.026
 top-0.023
ze-0.023
 lid-0.023
Token,
Token ID11
Feature activation+2.98e+00

Pos logit contributions
 planned+0.084
 especially+0.079
 explorers+0.076
 discovering+0.074
asured+0.074
Neg logit contributions
 trigger-0.146
 top-0.133
 hurts-0.131
 lid-0.127
antly-0.125
Token "
Token ID366
Feature activation+2.39e+00

Pos logit contributions
 planned+0.080
 cultures+0.079
 especially+0.077
ust+0.073
packs+0.073
Neg logit contributions
 trigger-0.066
 top-0.059
ze-0.059
 hurts-0.059
 stern-0.057
TokenCan
Token ID6090
Feature activation-1.35e-01

Pos logit contributions
 planned+0.063
 discovering+0.059
 cultures+0.058
 explorers+0.058
 traveling+0.057
Neg logit contributions
 trigger-0.061
 top-0.053
antly-0.053
ze-0.052
 lid-0.050
Token I
Token ID314
Feature activation+4.46e-01

Pos logit contributions
 trigger+0.004
 hurts+0.004
 top+0.003
ze+0.003
 lid+0.003
Neg logit contributions
 planned-0.003
 discovering-0.003
 especially-0.003
 cultures-0.003
 spending-0.003
Token go
Token ID467
Feature activation-1.40e-01

Pos logit contributions
 cultures+0.011
 discovering+0.011
 planned+0.010
 spending+0.010
 explorers+0.010
Neg logit contributions
 hurts-0.011
 trigger-0.011
 top-0.011
ze-0.010
�-0.010
Token was
Token ID373
Feature activation-2.29e-01

Pos logit contributions
 trigger+0.054
 hurts+0.044
 lid+0.043
antly+0.041
ze+0.041
Neg logit contributions
 planned-0.030
 especially-0.030
 often-0.025
 always-0.025
 discovering-0.024
Token not
Token ID407
Feature activation+2.04e-01

Pos logit contributions
 trigger+0.006
 hurts+0.005
ze+0.005
 lid+0.005
 top+0.005
Neg logit contributions
 planned-0.006
 especially-0.006
 discovering-0.006
 spending-0.006
 whether-0.006
Token fearful
Token ID26844
Feature activation-5.96e-01

Pos logit contributions
 especially+0.005
 planned+0.005
 discovering+0.005
 cultures+0.005
 spending+0.005
Neg logit contributions
 trigger-0.005
 hurts-0.005
ze-0.005
 cries-0.004
 tears-0.004
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
 trigger+0.014
 hurts+0.013
ze+0.011
 cries+0.011
 lid+0.011
Neg logit contributions
 planned-0.018
 especially-0.018
 discovering-0.017
 spending-0.017
 whether-0.016
Token She
Token ID1375
Feature activation-4.29e-01

Pos logit contributions
 especially+0.002
 planned+0.002
 discovering+0.002
 spending+0.002
 whether+0.002
Neg logit contributions
 trigger-0.004
 hurts-0.004
ze-0.004
inkle-0.004
antly-0.004
Token liked
Token ID8288
Feature activation+2.00e+00

Pos logit contributions
 trigger+0.014
 hurts+0.013
ze+0.012
 lid+0.011
 cries+0.011
Neg logit contributions
 planned-0.009
 especially-0.009
 discovering-0.009
 spending-0.008
 whether-0.008
Token storms
Token ID20228
Feature activation+1.59e-01

Pos logit contributions
 planned+0.057
 especially+0.056
ust+0.054
packs+0.053
ride+0.052
Neg logit contributions
 hurts-0.042
 trigger-0.042
 tears-0.038
 cries-0.038
ze-0.037
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
 planned+0.005
 especially+0.004
 discovering+0.004
 spending+0.004
 whether+0.004
Neg logit contributions
 trigger-0.004
 hurts-0.004
ze-0.003
 cries-0.003
 lid-0.003
Token She
Token ID1375
Feature activation+2.72e-01

Pos logit contributions
 especially+0.005
 planned+0.005
 discovering+0.005
 spending+0.005
 whether+0.004
Neg logit contributions
 trigger-0.009
 hurts-0.009
ze-0.008
 lid-0.008
antly-0.008
Token said
Token ID531
Feature activation-6.74e-02

Pos logit contributions
 discovering+0.006
 spending+0.006
 planned+0.006
 especially+0.006
 whether+0.006
Neg logit contributions
 trigger-0.008
 hurts-0.008
 cries-0.007
ze-0.007
 tears-0.007
Token to
Token ID284
Feature activation-1.80e+00

Pos logit contributions
 trigger+0.002
 hurts+0.002
 lid+0.002
ze+0.002
antly+0.002
Neg logit contributions
 planned-0.002
 especially-0.002
 spending-0.002
 discovering-0.002
 whether-0.001
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 trigger+0.006
 hurts+0.006
ze+0.006
 top+0.006
 lid+0.006
Neg logit contributions
 planned-0.003
 especially-0.003
 discovering-0.003
 cultures-0.003
 spending-0.003
Token play
Token ID711
Feature activation-2.94e+00

Pos logit contributions
 trigger+0.028
 hurts+0.026
ze+0.024
 top+0.023
 lid+0.023
Neg logit contributions
 planned-0.026
 especially-0.025
 discovering-0.024
 spending-0.023
 whether-0.023
Token with
Token ID351
Feature activation-1.37e+00

Pos logit contributions
 trigger+0.105
 hurts+0.091
 lid+0.085
itch+0.085
 cries+0.084
Neg logit contributions
 especially-0.065
 planned-0.059
 spending-0.055
 discovering-0.055
 someday-0.055
Token you
Token ID345
Feature activation-6.71e-01

Pos logit contributions
 trigger+0.045
 hurts+0.037
 lid+0.036
inkle+0.036
antly+0.035
Neg logit contributions
 especially-0.033
 planned-0.032
 discovering-0.029
 whether-0.029
 spending-0.029
Token."
Token ID526
Feature activation-1.07e+00

Pos logit contributions
 trigger+0.019
 hurts+0.017
ze+0.015
 lid+0.015
antly+0.015
Neg logit contributions
 especially-0.018
 planned-0.018
 discovering-0.016
 spending-0.016
 whether-0.016
Token The
Token ID383
Feature activation+1.21e+00

Pos logit contributions
 trigger+0.028
 hurts+0.024
ze+0.023
inkle+0.022
 lid+0.021
Neg logit contributions
 planned-0.031
 especially-0.031
 discovering-0.029
 spending-0.028
 whether-0.028
Token man
Token ID582
Feature activation-5.90e-01

Pos logit contributions
 especially+0.032
 planned+0.031
 explorers+0.029
 traveling+0.029
 discovering+0.028
Neg logit contributions
 trigger-0.032
 hurts-0.029
 AW-0.028
antly-0.028
inkle-0.027
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
 trigger+0.019
 hurts+0.016
 lid+0.015
ze+0.014
itch+0.014
Neg logit contributions
 especially-0.015
 planned-0.014
 discovering-0.013
 spending-0.012
 whether-0.012
Token so
Token ID523
Feature activation-1.06e+00

Pos logit contributions
 especially+0.003
 planned+0.003
 discovering+0.003
 spending+0.003
 traveling+0.002
Neg logit contributions
 trigger-0.004
 hurts-0.003
 lid-0.003
antly-0.003
itch-0.003
Token happy
Token ID3772
Feature activation+2.91e-01

Pos logit contributions
 trigger+0.036
 hurts+0.031
 lid+0.030
ze+0.030
antly+0.029
Neg logit contributions
 planned-0.023
 especially-0.023
 discovering-0.021
 spending-0.020
 traveling-0.019
Token to
Token ID284
Feature activation-1.22e+00

Pos logit contributions
 especially+0.007
 planned+0.007
 cultures+0.006
 discovering+0.006
ust+0.006
Neg logit contributions
 trigger-0.008
 hurts-0.008
ze-0.007
 cries-0.007
 top-0.007
Token key
Token ID1994
Feature activation+1.28e+00

Pos logit contributions
 trigger+0.011
 hurts+0.010
ze+0.009
antly+0.009
 lid+0.009
Neg logit contributions
 planned-0.008
 especially-0.008
 discovering-0.007
 whether-0.007
 spending-0.007
Token as
Token ID355
Feature activation+8.71e-01

Pos logit contributions
 planned+0.031
 especially+0.031
 discovering+0.029
 spending+0.028
 whether+0.027
Neg logit contributions
 trigger-0.038
 hurts-0.034
ze-0.031
 lid-0.031
 top-0.030
Token a
Token ID257
Feature activation-3.20e-01

Pos logit contributions
 planned+0.025
 especially+0.024
 discovering+0.022
 spending+0.022
 whether+0.022
Neg logit contributions
 trigger-0.023
 hurts-0.020
ze-0.019
 lid-0.019
antly-0.018
Token gift
Token ID6979
Feature activation+5.02e-01

Pos logit contributions
 trigger+0.008
 hurts+0.007
ze+0.007
inkle+0.007
antly+0.007
Neg logit contributions
 planned-0.009
 especially-0.009
 discovering-0.008
 whether-0.008
 spending-0.008
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
 planned+0.013
 especially+0.013
 discovering+0.012
 spending+0.012
 whether+0.012
Neg logit contributions
 trigger-0.014
 hurts-0.012
ze-0.011
 lid-0.011
 cries-0.011
Token The
Token ID383
Feature activation+1.12e+00

Pos logit contributions
 trigger+0.006
 hurts+0.005
ze+0.005
inkle+0.005
 lid+0.005
Neg logit contributions
 planned-0.005
 especially-0.005
 spending-0.005
 discovering-0.005
 whether-0.005
Token boy
Token ID2933
Feature activation-7.03e-01

Pos logit contributions
 planned+0.028
 especially+0.028
 explorers+0.025
 discovering+0.025
 whether+0.024
Neg logit contributions
 trigger-0.031
 hurts-0.029
ze-0.028
inkle-0.028
antly-0.028
Token said
Token ID531
Feature activation+3.80e-02

Pos logit contributions
 trigger+0.026
 hurts+0.021
 AW+0.021
inkle+0.021
itch+0.020
Neg logit contributions
 especially-0.015
 planned-0.014
 always-0.013
 often-0.012
 visited-0.011
Token that
Token ID326
Feature activation+3.47e-01

Pos logit contributions
 especially+0.001
 planned+0.001
 spending+0.001
 going+0.001
 whether+0.001
Neg logit contributions
 trigger-0.001
 hurts-0.001
inkle-0.001
 lid-0.001
antly-0.001
Token he
Token ID339
Feature activation-2.99e-01

Pos logit contributions
 planned+0.010
 especially+0.009
 discovering+0.009
ust+0.009
 cultures+0.009
Neg logit contributions
 trigger-0.008
 hurts-0.008
 tears-0.007
ze-0.007
 top-0.007
Token wanted
Token ID2227
Feature activation+2.14e-01

Pos logit contributions
 trigger+0.011
 hurts+0.010
 lid+0.010
ze+0.009
 top+0.009
Neg logit contributions
 planned-0.005
 especially-0.005
 discovering-0.004
 spending-0.004
 whether-0.004
Tokeno
Token ID78
Feature activation-2.30e-01

Pos logit contributions
 especially+0.011
 planned+0.011
 discovering+0.010
 spending+0.010
 whether+0.010
Neg logit contributions
 trigger-0.020
 hurts-0.018
ze-0.017
 lid-0.017
antly-0.016
Token stretched
Token ID19110
Feature activation+9.65e-01

Pos logit contributions
 trigger+0.008
 hurts+0.007
ze+0.007
 lid+0.006
 top+0.006
Neg logit contributions
 planned-0.005
 especially-0.005
 discovering-0.004
 spending-0.004
 whether-0.004
Token his
Token ID465
Feature activation-7.69e-01

Pos logit contributions
 especially+0.028
 planned+0.028
 discovering+0.026
 spending+0.026
 whether+0.025
Neg logit contributions
 trigger-0.024
 hurts-0.021
ze-0.019
 lid-0.019
 cries-0.018
Token fins
Token ID42080
Feature activation+1.47e+00

Pos logit contributions
 trigger+0.019
 hurts+0.015
ze+0.015
inkle+0.014
antly+0.014
Neg logit contributions
 especially-0.022
 planned-0.021
 explorers-0.020
 traveling-0.020
 discovering-0.020
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
 planned+0.032
 especially+0.032
 discovering+0.029
 spending+0.028
 whether+0.028
Neg logit contributions
 trigger-0.047
 hurts-0.043
ze-0.040
 lid-0.040
 top-0.039
Token sw
Token ID1509
Feature activation+1.50e+00

Pos logit contributions
 trigger+0.033
 hurts+0.030
ze+0.028
 lid+0.028
 cries+0.027
Neg logit contributions
 planned-0.023
 especially-0.023
 discovering-0.021
 spending-0.021
 whether-0.020
Tokenam
Token ID321
Feature activation+4.88e-01

Pos logit contributions
 planned+0.030
 especially+0.029
 discovering+0.026
 whether+0.025
 spending+0.025
Neg logit contributions
 trigger-0.051
 hurts-0.046
 lid-0.043
 top-0.042
 cries-0.042
Token towards
Token ID3371
Feature activation-7.22e-01

Pos logit contributions
 especially+0.012
 planned+0.011
 discovering+0.010
 whether+0.010
 often+0.010
Neg logit contributions
 trigger-0.015
 hurts-0.014
ze-0.013
 lid-0.013
inkle-0.013
Token the
Token ID262
Feature activation-1.98e-01

Pos logit contributions
 trigger+0.020
 hurts+0.018
ze+0.017
 lid+0.016
antly+0.016
Neg logit contributions
 especially-0.019
 planned-0.019
 discovering-0.018
 spending-0.017
 whether-0.017
Token scary
Token ID14343
Feature activation-9.30e-01

Pos logit contributions
 trigger+0.005
antly+0.005
ze+0.005
 AW+0.005
inkle+0.005
Neg logit contributions
 especially-0.005
 planned-0.005
 explorers-0.005
 discovering-0.004
 going-0.004
Token fish
Token ID5916
Feature activation-4.17e-01

Pos logit contributions
 trigger+0.026
 hurts+0.023
ze+0.022
antly+0.022
itch+0.022
Neg logit contributions
 especially-0.023
 planned-0.023
 explorers-0.022
 discovering-0.021
 seeing-0.020
Token,
Token ID11
Feature activation+8.67e-01

Pos logit contributions
 planned+0.007
 especially+0.007
 discovering+0.007
 whether+0.007
 spending+0.007
Neg logit contributions
 trigger-0.007
 hurts-0.006
ze-0.005
 lid-0.005
 top-0.005
Token and
Token ID290
Feature activation-8.96e-01

Pos logit contributions
 planned+0.021
 especially+0.020
 discovering+0.019
 cultures+0.019
 whether+0.019
Neg logit contributions
 trigger-0.025
 hurts-0.023
ze-0.022
 top-0.022
 lid-0.022
Token yellow
Token ID7872
Feature activation-3.45e-01

Pos logit contributions
 trigger+0.022
 hurts+0.020
 top+0.018
ze+0.018
 lid+0.018
Neg logit contributions
 planned-0.025
 especially-0.025
 discovering-0.024
 whether-0.023
 cultures-0.023
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 trigger+0.009
 hurts+0.008
ze+0.007
 lid+0.007
antly+0.007
Neg logit contributions
 planned-0.010
 especially-0.010
 discovering-0.009
 spending-0.009
 whether-0.009
Token Lily
Token ID20037
Feature activation-2.43e-01

Pos logit contributions
 trigger+0.036
 hurts+0.032
ze+0.031
inkle+0.031
antly+0.030
Neg logit contributions
 planned-0.027
 especially-0.026
 spending-0.024
 discovering-0.023
 whether-0.023
Token had
Token ID550
Feature activation+8.95e-01

Pos logit contributions
 hurts+0.007
 trigger+0.007
ze+0.007
 cries+0.007
 tears+0.007
Neg logit contributions
 discovering-0.005
 spending-0.005
 planned-0.005
 especially-0.005
 whether-0.005
Token an
Token ID281
Feature activation+1.77e+00

Pos logit contributions
 planned+0.023
rying+0.023
 cultures+0.022
ride+0.022
ait+0.022
Neg logit contributions
 hurts-0.021
 top-0.021
 trigger-0.020
ze-0.020
 tears-0.020
Token idea
Token ID2126
Feature activation+8.57e-01

Pos logit contributions
 planned+0.053
 especially+0.052
 spending+0.051
 discovering+0.051
 whether+0.049
Neg logit contributions
 trigger-0.039
 hurts-0.034
 top-0.032
 tears-0.032
ze-0.031
Token to
Token ID284
Feature activation-1.98e-01

Pos logit contributions
 especially+0.030
 cultures+0.029
 planned+0.028
ust+0.027
 appreciate+0.027
Neg logit contributions
 hurts-0.013
 trigger-0.013
 cries-0.011
ze-0.011
 tears-0.011
Token make
Token ID787
Feature activation-2.34e+00

Pos logit contributions
 trigger+0.006
 hurts+0.006
ze+0.005
 top+0.005
 lid+0.005
Neg logit contributions
 planned-0.004
 especially-0.004
 discovering-0.004
 spending-0.004
 traveling-0.004
Token her
Token ID607
Feature activation-4.22e-01

Pos logit contributions
 trigger+0.075
 hurts+0.064
 lid+0.058
antly+0.058
itch+0.057
Neg logit contributions
 planned-0.057
 especially-0.057
 spending-0.053
 discovering-0.053
 going-0.051
Token explained
Token ID4893
Feature activation+1.28e+00

Pos logit contributions
 hurts+0.011
 trigger+0.011
 tears+0.011
 stern+0.011
 cries+0.010
Neg logit contributions
 cultures-0.012
packs-0.012
 especially-0.012
ride-0.012
adows-0.012
Token that
Token ID326
Feature activation+7.59e-01

Pos logit contributions
 planned+0.026
 especially+0.026
 discovering+0.024
 spending+0.023
 whether+0.023
Neg logit contributions
 trigger-0.044
 hurts-0.040
ze-0.037
 lid-0.037
antly-0.036
Token she
Token ID673
Feature activation+6.30e-01

Pos logit contributions
 planned+0.019
 especially+0.018
ride+0.018
 discovering+0.018
ust+0.018
Neg logit contributions
 trigger-0.019
 hurts-0.018
 top-0.017
ze-0.016
 tears-0.016
Token should
Token ID815
Feature activation+3.53e-01

Pos logit contributions
 planned+0.013
 especially+0.013
 discovering+0.012
 spending+0.011
 whether+0.011
Neg logit contributions
 trigger-0.021
 hurts-0.019
ze-0.018
 lid-0.018
 top-0.017
Token never
Token ID1239
Feature activation+1.79e+00

Pos logit contributions
 planned+0.007
 especially+0.007
 discovering+0.007
 whether+0.006
 spending+0.006
Neg logit contributions
 trigger-0.012
 hurts-0.011
ze-0.010
 top-0.010
 lid-0.010
Token eat
Token ID4483
Feature activation-5.47e-01

Pos logit contributions
 planned+0.043
 discovering+0.043
 especially+0.043
 spending+0.042
 cultures+0.042
Neg logit contributions
 hurts-0.049
 trigger-0.048
 cries-0.042
ze-0.041
 tears-0.040
Token something
Token ID1223
Feature activation-6.59e-01

Pos logit contributions
 trigger+0.015
 hurts+0.013
ze+0.012
 lid+0.012
 top+0.012
Neg logit contributions
 planned-0.015
 especially-0.014
 discovering-0.014
 spending-0.013
 whether-0.013
Token without
Token ID1231
Feature activation-9.05e-02

Pos logit contributions
 trigger+0.019
 hurts+0.017
ze+0.016
 lid+0.016
antly+0.015
Neg logit contributions
 planned-0.016
 especially-0.016
 discovering-0.015
 spending-0.015
 whether-0.015
Token asking
Token ID4737
Feature activation-5.75e-01

Pos logit contributions
 trigger+0.003
 hurts+0.003
ze+0.003
 lid+0.003
 top+0.003
Neg logit contributions
 planned-0.002
 especially-0.002
 discovering-0.002
 spending-0.002
 whether-0.002
Token her
Token ID607
Feature activation+4.25e-01

Pos logit contributions
 trigger+0.019
 hurts+0.017
ze+0.016
 lid+0.015
 AW+0.015
Neg logit contributions
 especially-0.013
 planned-0.013
 discovering-0.012
 whether-0.012
 spending-0.011
Token first
Token ID717
Feature activation-6.88e-01

Pos logit contributions
 planned+0.009
 especially+0.008
 discovering+0.008
 whether+0.007
 spending+0.007
Neg logit contributions
 trigger-0.015
 hurts-0.013
ze-0.012
 lid-0.012
antly-0.012
Token and
Token ID290
Feature activation-7.17e-01

Pos logit contributions
 trigger+0.004
 hurts+0.004
 lid+0.003
ze+0.003
itch+0.003
Neg logit contributions
 especially-0.002
 planned-0.002
 someday-0.002
 whether-0.002
 discovering-0.002
Token it
Token ID340
Feature activation-2.26e+00

Pos logit contributions
 trigger+0.021
 hurts+0.019
ze+0.018
 lid+0.018
 cries+0.017
Neg logit contributions
 especially-0.017
 planned-0.017
 discovering-0.016
 spending-0.016
 whether-0.015
Token made
Token ID925
Feature activation-2.38e+00

Pos logit contributions
 trigger+0.069
itch+0.057
 lid+0.055
inkle+0.054
eros+0.054
Neg logit contributions
 planned-0.056
 especially-0.056
 going-0.048
 always-0.047
 often-0.046
Token her
Token ID607
Feature activation-8.82e-01

Pos logit contributions
 trigger+0.087
 hurts+0.076
 AW+0.073
eros+0.073
�+0.072
Neg logit contributions
 planned-0.046
 especially-0.046
 going-0.040
 spending-0.039
 always-0.036
Token sad
Token ID6507
Feature activation-6.54e-01

Pos logit contributions
 trigger+0.026
antly+0.021
 hurts+0.020
 lid+0.020
 AW+0.020
Neg logit contributions
 especially-0.024
 planned-0.022
 whether-0.020
 discovering-0.020
 spending-0.020
Token.
Token ID13
Feature activation-4.25e-01

Pos logit contributions
 trigger+0.019
 hurts+0.016
 lid+0.015
ze+0.015
antly+0.015
Neg logit contributions
 especially-0.017
 planned-0.017
 spending-0.015
 discovering-0.015
 whether-0.015
Token Her
Token ID2332
Feature activation+7.19e-01

Pos logit contributions
 trigger+0.015
ze+0.013
 hurts+0.013
inkle+0.012
 lid+0.012
Neg logit contributions
 especially-0.009
 planned-0.008
 spending-0.008
 discovering-0.008
 whether-0.008
Token mom
Token ID1995
Feature activation+1.38e+00

Pos logit contributions
 especially+0.018
 planned+0.018
 discovering+0.016
 spending+0.015
 going+0.015
Neg logit contributions
 trigger-0.022
 AW-0.018
antly-0.017
 hurts-0.017
ze-0.017
Token picked
Token ID6497
Feature activation-8.14e-01

Pos logit contributions
 especially+0.034
 planned+0.033
 discovering+0.029
 spending+0.029
 whether+0.028
Neg logit contributions
 trigger-0.044
 hurts-0.038
 lid-0.035
ze-0.035
 top-0.034
Token up
Token ID510
Feature activation-8.83e-01

Pos logit contributions
 trigger+0.027
inkle+0.025
ggy+0.025
 sleeve+0.024
eg+0.024
Neg logit contributions
 if-0.015
 going-0.015
 spending-0.013
 especially-0.013
 planned-0.012
Token the
Token ID262
Feature activation-3.75e-01

Pos logit contributions
 trigger+0.037
 hurts+0.033
inkle+0.033
ze+0.033
 AW+0.032
Neg logit contributions
 especially-0.013
 planned-0.011
 if-0.010
 spending-0.010
 whether-0.010
Token and
Token ID290
Feature activation-9.62e-01

Pos logit contributions
 trigger+0.017
 hurts+0.015
ze+0.015
 lid+0.014
itch+0.014
Neg logit contributions
 planned-0.008
 especially-0.008
 discovering-0.007
 spending-0.007
 whether-0.007
Token held
Token ID2714
Feature activation-5.78e-01

Pos logit contributions
 trigger+0.030
 hurts+0.027
ze+0.026
 lid+0.025
 top+0.025
Neg logit contributions
 planned-0.022
 especially-0.022
 discovering-0.020
 spending-0.019
 whether-0.019
Token it
Token ID340
Feature activation-2.30e+00

Pos logit contributions
 trigger+0.021
 hurts+0.019
ze+0.019
osaurus+0.017
 racket+0.017
Neg logit contributions
 especially-0.009
 if-0.009
 planned-0.009
 going-0.009
 whether-0.008
Token in
Token ID287
Feature activation-7.00e-01

Pos logit contributions
 trigger+0.064
 hurts+0.057
ze+0.052
inkle+0.050
itch+0.050
Neg logit contributions
 especially-0.061
 planned-0.061
 discovering-0.055
 whether-0.054
 spending-0.053
Token her
Token ID607
Feature activation+1.80e-01

Pos logit contributions
 trigger+0.023
 hurts+0.021
ze+0.019
antly+0.018
 lid+0.018
Neg logit contributions
 planned-0.016
 especially-0.015
 discovering-0.014
 spending-0.014
 whether-0.014
Token hand
Token ID1021
Feature activation-2.32e-01

Pos logit contributions
 especially+0.004
 planned+0.004
 discovering+0.004
 whether+0.004
packs+0.004
Neg logit contributions
 trigger-0.005
ze-0.004
 hurts-0.004
inkle-0.004
chio-0.004
Token.
Token ID13
Feature activation-3.94e-01

Pos logit contributions
 trigger+0.007
 hurts+0.007
inkle+0.006
 lid+0.006
 AW+0.006
Neg logit contributions
 especially-0.005
 planned-0.005
 whether-0.005
 discovering-0.005
 going-0.004
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
 trigger+0.012
 hurts+0.011
ze+0.010
inkle+0.010
 lid+0.010
Neg logit contributions
 especially-0.009
 planned-0.009
 spending-0.008
 discovering-0.008
 whether-0.008
Token
Token ID198
Feature activation+5.14e-01

Pos logit contributions
 planned+0.027
 especially+0.027
 discovering+0.025
 spending+0.024
 cultures+0.024
Neg logit contributions
 trigger-0.027
 hurts-0.024
ze-0.022
 lid-0.022
 top-0.022
Token"
Token ID1
Feature activation+6.48e-01

Pos logit contributions
 especially+0.013
 planned+0.013
 discovering+0.012
 spending+0.011
 whether+0.011
Neg logit contributions
 trigger-0.015
 hurts-0.014
ze-0.013
 lid-0.013
 cries-0.012
TokenHello
Token ID15496
Feature activation-1.49e+00

Pos logit contributions
 especially+0.015
 planned+0.015
 discovering+0.013
 spending+0.013
 whether+0.013
Neg logit contributions
 trigger-0.020
 hurts-0.018
 cries-0.017
 lid-0.017
inkle-0.017
Token could
Token ID714
Feature activation-6.30e-01

Pos logit contributions
 especially+0.007
 planned+0.007
 discovering+0.007
 cultures+0.006
 spending+0.006
Neg logit contributions
 trigger-0.011
 hurts-0.010
ze-0.010
 lid-0.009
antly-0.009
Token help
Token ID1037
Feature activation-1.90e+00

Pos logit contributions
 trigger+0.018
 hurts+0.017
ze+0.016
 top+0.016
 lid+0.015
Neg logit contributions
 planned-0.015
 especially-0.015
 discovering-0.014
 cultures-0.014
 spending-0.013
Token you
Token ID345
Feature activation-9.91e-01

Pos logit contributions
 trigger+0.055
 hurts+0.047
 lid+0.044
ze+0.042
antly+0.042
Neg logit contributions
 especially-0.052
 planned-0.049
 spending-0.047
 discovering-0.047
 whether-0.046
Token."
Token ID526
Feature activation-1.67e+00

Pos logit contributions
 trigger+0.022
 hurts+0.020
ze+0.019
 top+0.019
 lid+0.018
Neg logit contributions
 planned-0.030
 especially-0.029
 discovering-0.028
 cultures-0.028
 spending-0.027
Token
Token ID198
Feature activation-6.54e-01

Pos logit contributions
 trigger+0.058
 hurts+0.049
ze+0.048
 lid+0.046
inkle+0.046
Neg logit contributions
 planned-0.035
 especially-0.034
 spending-0.034
 discovering-0.032
 someday-0.030
Token
Token ID198
Feature activation-3.78e-01

Pos logit contributions
 trigger+0.021
 hurts+0.018
 lid+0.016
ze+0.016
antly+0.016
Neg logit contributions
 especially-0.016
 planned-0.015
 discovering-0.014
 spending-0.014
 whether-0.014
TokenAfter
Token ID3260
Feature activation-2.22e-01

Pos logit contributions
 trigger+0.012
 hurts+0.011
 lid+0.010
ze+0.010
 cries+0.010
Neg logit contributions
 especially-0.009
 planned-0.009
 discovering-0.008
 spending-0.008
 whether-0.008
Token the
Token ID262
Feature activation-5.39e-01

Pos logit contributions
 trigger+0.008
 hurts+0.007
ze+0.006
 lid+0.006
antly+0.006
Neg logit contributions
 planned-0.005
 especially-0.005
 spending-0.005
 discovering-0.005
 traveling-0.004
Token mosquito
Token ID35446
Feature activation-1.18e+00

Pos logit contributions
 trigger+0.015
 hurts+0.013
antly+0.013
ze+0.012
 lid+0.012
Neg logit contributions
 planned-0.014
 especially-0.014
 spending-0.013
 going-0.013
 discovering-0.012
Token enjoyed
Token ID8359
Feature activation-5.59e-01

Pos logit contributions
 trigger+0.037
 hurts+0.032
 lid+0.030
ze+0.029
 top+0.029
Neg logit contributions
 planned-0.029
 especially-0.028
 discovering-0.026
 spending-0.025
 traveling-0.024
Token the
Token ID262
Feature activation-1.90e+00

Pos logit contributions
 trigger+0.016
 hurts+0.014
ze+0.013
 lid+0.013
 top+0.013
Neg logit contributions
 planned-0.014
 especially-0.014
 discovering-0.013
 spending-0.013
 whether-0.013
Token at
Token ID379
Feature activation+2.93e-01

Pos logit contributions
 planned+0.035
 especially+0.035
 discovering+0.032
 spending+0.032
 whether+0.031
Neg logit contributions
 trigger-0.047
 hurts-0.042
ze-0.040
 lid-0.039
 top-0.038
Token him
Token ID683
Feature activation-1.02e+00

Pos logit contributions
 especially+0.007
 planned+0.006
 discovering+0.006
 spending+0.006
 whether+0.006
Neg logit contributions
 trigger-0.009
 hurts-0.008
ze-0.008
 lid-0.008
 AW-0.008
Token.
Token ID13
Feature activation-8.69e-02

Pos logit contributions
 trigger+0.032
 hurts+0.029
 lid+0.027
 AW+0.026
antly+0.026
Neg logit contributions
 especially-0.023
 planned-0.022
 someday-0.021
 discovering-0.021
 spending-0.021
Token He
Token ID679
Feature activation-9.52e-01

Pos logit contributions
 trigger+0.002
 hurts+0.002
 top+0.002
 lid+0.002
 cries+0.002
Neg logit contributions
 planned-0.002
 especially-0.002
 discovering-0.002
 spending-0.002
 cultures-0.002
Token went
Token ID1816
Feature activation+1.06e+00

Pos logit contributions
 trigger+0.025
 hurts+0.024
ze+0.022
 tears+0.021
 cries+0.021
Neg logit contributions
 planned-0.024
 especially-0.024
 discovering-0.024
 spending-0.023
 whether-0.023
Token to
Token ID284
Feature activation+3.46e-01

Pos logit contributions
 planned+0.023
 especially+0.022
 discovering+0.021
 spending+0.020
 whether+0.020
Neg logit contributions
 trigger-0.035
 hurts-0.032
ze-0.030
 lid-0.029
antly-0.029
Token the
Token ID262
Feature activation-5.77e-01

Pos logit contributions
 planned+0.010
 especially+0.010
 discovering+0.009
 spending+0.009
 whether+0.009
Neg logit contributions
 trigger-0.009
 hurts-0.008
ze-0.007
 lid-0.007
 top-0.007
Token dragon
Token ID10441
Feature activation+9.14e-01

Pos logit contributions
 trigger+0.019
 hurts+0.017
inkle+0.017
 AW+0.016
antly+0.016
Neg logit contributions
 especially-0.012
 planned-0.011
 explorers-0.011
 discovering-0.011
 traveling-0.011
Token and
Token ID290
Feature activation-5.45e-01

Pos logit contributions
 planned+0.021
 especially+0.020
 discovering+0.019
 spending+0.018
 whether+0.018
Neg logit contributions
 trigger-0.028
 hurts-0.026
ze-0.025
 lid-0.024
 cries-0.024
Token fought
Token ID8350
Feature activation-1.19e+00

Pos logit contributions
 trigger+0.017
 hurts+0.015
ze+0.014
 lid+0.014
 top+0.014
Neg logit contributions
 planned-0.012
 especially-0.012
 discovering-0.011
 spending-0.011
 whether-0.011
Token it
Token ID340
Feature activation-2.86e+00

Pos logit contributions
 trigger+0.040
 hurts+0.034
inkle+0.033
 lid+0.033
itch+0.033
Neg logit contributions
 especially-0.026
 planned-0.024
 going-0.023
 someday-0.023
 whether-0.022
Token He
Token ID679
Feature activation-1.27e+00

Pos logit contributions
 trigger+0.013
 hurts+0.011
ze+0.011
 lid+0.010
inkle+0.010
Neg logit contributions
 planned-0.011
 especially-0.011
 discovering-0.010
 whether-0.010
 spending-0.010
Token did
Token ID750
Feature activation-1.37e+00

Pos logit contributions
 trigger+0.044
 hurts+0.039
 lid+0.037
ze+0.037
antly+0.036
Neg logit contributions
 especially-0.026
 planned-0.026
 discovering-0.022
 spending-0.022
 whether-0.022
Token not
Token ID407
Feature activation+2.80e-01

Pos logit contributions
 trigger+0.053
 lid+0.046
 hurts+0.045
antly+0.045
rified+0.043
Neg logit contributions
 especially-0.025
 planned-0.023
 spending-0.022
 whether-0.022
 discovering-0.021
Token want
Token ID765
Feature activation-9.71e-01

Pos logit contributions
 planned+0.005
 especially+0.005
 discovering+0.004
 spending+0.004
 whether+0.004
Neg logit contributions
 trigger-0.010
 hurts-0.010
ze-0.009
 lid-0.009
 top-0.009
Token to
Token ID284
Feature activation-1.38e+00

Pos logit contributions
 trigger+0.040
 hurts+0.037
ze+0.036
 lid+0.035
antly+0.035
Neg logit contributions
 planned-0.013
 especially-0.013
 discovering-0.011
 spending-0.010
 whether-0.010
Token play
Token ID711
Feature activation-2.68e+00

Pos logit contributions
 trigger+0.049
 hurts+0.044
 lid+0.042
ze+0.041
antly+0.041
Neg logit contributions
 especially-0.027
 planned-0.026
 discovering-0.023
 spending-0.023
 whether-0.022
Token with
Token ID351
Feature activation-1.18e+00

Pos logit contributions
 trigger+0.096
 hurts+0.084
 lid+0.082
antly+0.082
itch+0.080
Neg logit contributions
 especially-0.057
 planned-0.050
 someday-0.049
 spending-0.048
 explorers-0.046
Token the
Token ID262
Feature activation-9.84e-01

Pos logit contributions
 trigger+0.038
 hurts+0.035
antly+0.034
ze+0.033
 lid+0.033
Neg logit contributions
 especially-0.026
 planned-0.024
 whether-0.024
 explorers-0.022
 someday-0.022
Token ball
Token ID2613
Feature activation-1.65e+00

Pos logit contributions
 trigger+0.024
ze+0.021
antly+0.021
 hurts+0.021
inkle+0.020
Neg logit contributions
 especially-0.028
 planned-0.028
 spending-0.025
 discovering-0.025
 traveling-0.024
Token.
Token ID13
Feature activation-8.33e-01

Pos logit contributions
 trigger+0.047
 hurts+0.041
ze+0.039
 lid+0.039
antly+0.038
Neg logit contributions
 especially-0.044
 planned-0.043
 someday-0.039
 whether-0.038
 spending-0.038
Token He
Token ID679
Feature activation-1.29e+00

Pos logit contributions
 trigger+0.026
 hurts+0.023
ze+0.022
inkle+0.021
 lid+0.021
Neg logit contributions
 especially-0.020
 planned-0.019
 whether-0.018
 discovering-0.017
 spending-0.017
Token cool
Token ID3608
Feature activation-7.15e-01

Pos logit contributions
 trigger+0.050
 hurts+0.044
ze+0.041
 lid+0.040
antly+0.040
Neg logit contributions
 especially-0.041
 planned-0.041
 discovering-0.038
 spending-0.037
 going-0.037
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
 trigger+0.021
 hurts+0.018
ze+0.017
 lid+0.017
antly+0.016
Neg logit contributions
 planned-0.018
 especially-0.018
 discovering-0.017
 spending-0.016
 whether-0.016
Token Can
Token ID1680
Feature activation-1.43e-01

Pos logit contributions
 planned+0.003
 especially+0.003
 discovering+0.003
 spending+0.003
 whether+0.003
Neg logit contributions
 trigger-0.003
 hurts-0.002
ze-0.002
 lid-0.002
antly-0.002
Token we
Token ID356
Feature activation+6.74e-01

Pos logit contributions
 trigger+0.005
 hurts+0.005
ze+0.004
 lid+0.004
antly+0.004
Neg logit contributions
 especially-0.003
 planned-0.003
 discovering-0.002
 spending-0.002
 whether-0.002
Token touch
Token ID3638
Feature activation-2.60e-01

Pos logit contributions
 especially+0.014
 planned+0.014
 discovering+0.012
 whether+0.012
 spending+0.012
Neg logit contributions
 trigger-0.023
 hurts-0.021
ze-0.019
 lid-0.019
antly-0.019
Token it
Token ID340
Feature activation-2.51e+00

Pos logit contributions
 trigger+0.008
 hurts+0.007
 lid+0.007
ze+0.007
antly+0.006
Neg logit contributions
 especially-0.006
 planned-0.006
 whether-0.006
 someday-0.006
 spending-0.005
Token?"
Token ID1701
Feature activation-4.18e-01

Pos logit contributions
 trigger+0.080
 hurts+0.071
ze+0.066
 lid+0.066
 cries+0.065
Neg logit contributions
 especially-0.058
 planned-0.057
 someday-0.050
 spending-0.050
 discovering-0.050
Token
Token ID198
Feature activation-5.88e-01

Pos logit contributions
 trigger+0.012
 hurts+0.011
ze+0.010
 lid+0.010
 top+0.010
Neg logit contributions
 planned-0.011
 especially-0.010
 discovering-0.010
 spending-0.009
 whether-0.009
Token
Token ID198
Feature activation-3.63e-01

Pos logit contributions
 trigger+0.020
 hurts+0.018
ze+0.017
 lid+0.017
antly+0.016
Neg logit contributions
 especially-0.012
 planned-0.012
 discovering-0.010
 spending-0.010
 whether-0.010
Token"
Token ID1
Feature activation+1.07e+00

Pos logit contributions
 trigger+0.011
 hurts+0.010
ze+0.009
 lid+0.009
 tears+0.009
Neg logit contributions
 planned-0.009
 especially-0.009
 discovering-0.008
 whether-0.008
 someday-0.008
TokenNo
Token ID2949
Feature activation-1.41e+00

Pos logit contributions
 planned+0.026
 especially+0.026
 discovering+0.026
 spending+0.025
 traveling+0.024
Neg logit contributions
 trigger-0.031
 hurts-0.028
ze-0.026
 lid-0.025
 top-0.025
Token p
Token ID279
Feature activation+2.68e+00

Pos logit contributions
 trigger+0.023
 hurts+0.020
antly+0.019
ze+0.019
itch+0.018
Neg logit contributions
 especially-0.022
 planned-0.022
 discovering-0.021
 spending-0.020
 traveling-0.020
Tokenuddle
Token ID24500
Feature activation-1.06e+00

Pos logit contributions
 planned+0.086
 especially+0.083
 discovering+0.079
 whether+0.076
 traveling+0.076
Neg logit contributions
 trigger-0.059
 hurts-0.049
ze-0.047
antly-0.044
 tears-0.041
Token.
Token ID13
Feature activation-8.70e-01

Pos logit contributions
 trigger+0.028
 hurts+0.025
ze+0.023
 lid+0.023
 top+0.022
Neg logit contributions
 planned-0.029
 especially-0.029
 discovering-0.027
 spending-0.026
 whether-0.026
Token He
Token ID679
Feature activation-1.10e+00

Pos logit contributions
 trigger+0.026
 hurts+0.024
ze+0.023
inkle+0.021
antly+0.021
Neg logit contributions
 planned-0.021
 especially-0.021
 discovering-0.019
 spending-0.019
 whether-0.019
Token smiled
Token ID13541
Feature activation+8.65e-01

Pos logit contributions
 trigger+0.033
 hurts+0.031
ze+0.029
 lid+0.028
 cries+0.028
Neg logit contributions
 planned-0.025
 especially-0.025
 discovering-0.024
 spending-0.023
 whether-0.023
Token and
Token ID290
Feature activation-2.43e+00

Pos logit contributions
 planned+0.020
 especially+0.019
 discovering+0.018
 spending+0.017
 cultures+0.017
Neg logit contributions
 trigger-0.027
 hurts-0.024
ze-0.023
 top-0.023
 lid-0.022
Token stepped
Token ID10764
Feature activation+8.73e-01

Pos logit contributions
 trigger+0.070
 hurts+0.064
ze+0.059
 lid+0.058
 cries+0.057
Neg logit contributions
 especially-0.059
 planned-0.059
 discovering-0.055
 spending-0.053
 whether-0.053
Token in
Token ID287
Feature activation-9.58e-01

Pos logit contributions
 planned+0.024
 especially+0.024
 discovering+0.022
 spending+0.022
 whether+0.021
Neg logit contributions
 trigger-0.023
 hurts-0.022
ze-0.019
 cries-0.018
 lid-0.018
Token it
Token ID340
Feature activation-1.67e+00

Pos logit contributions
 trigger+0.026
 hurts+0.023
ze+0.021
 lid+0.020
inkle+0.020
Neg logit contributions
 especially-0.026
 planned-0.026
 discovering-0.025
 spending-0.024
 whether-0.024
Token.
Token ID13
Feature activation-5.96e-01

Pos logit contributions
 trigger+0.049
 hurts+0.043
ze+0.039
itch+0.039
 lid+0.038
Neg logit contributions
 especially-0.043
 planned-0.043
 discovering-0.039
 whether-0.038
 spending-0.037
Token He
Token ID679
Feature activation-9.21e-01

Pos logit contributions
 trigger+0.018
 hurts+0.016
ze+0.016
inkle+0.015
 lid+0.015
Neg logit contributions
 planned-0.014
 especially-0.014
 discovering-0.013
 spending-0.013
 whether-0.013
Token bl
Token ID698
Feature activation+1.34e+00

Pos logit contributions
 trigger+0.016
antly+0.013
inkle+0.012
 sleeve+0.012
 lid+0.012
Neg logit contributions
 especially-0.017
 planned-0.017
 going-0.016
 always-0.015
 spending-0.015
Tokenouse
Token ID1076
Feature activation-2.63e+00

Pos logit contributions
 planned+0.034
 especially+0.031
 haven+0.030
 discovering+0.030
ust+0.029
Neg logit contributions
 trigger-0.039
 top-0.032
 tears-0.031
ze-0.031
antly-0.031
Token belongs
Token ID14448
Feature activation+4.41e-01

Pos logit contributions
 trigger+0.087
antly+0.065
 top+0.063
 tears+0.062
Craig+0.062
Neg logit contributions
 planned-0.063
 especially-0.057
 going-0.056
 always-0.055
 spending-0.051
Token to
Token ID284
Feature activation-2.37e+00

Pos logit contributions
 planned+0.006
 especially+0.006
 discovering+0.005
 spending+0.005
 whether+0.005
Neg logit contributions
 trigger-0.018
 hurts-0.016
ze-0.016
 lid-0.015
 cries-0.015
Token me
Token ID502
Feature activation-4.30e-01

Pos logit contributions
 trigger+0.063
 hurts+0.049
 lid+0.049
antly+0.047
ze+0.046
Neg logit contributions
 planned-0.072
 especially-0.070
 spending-0.066
 discovering-0.065
 seeing-0.064
Token!"
Token ID2474
Feature activation-3.12e+00

Pos logit contributions
 trigger+0.013
 hurts+0.010
ze+0.010
 lid+0.009
antly+0.009
Neg logit contributions
 especially-0.012
 planned-0.012
 discovering-0.011
 whether-0.011
 spending-0.011
Token grow
Token ID1663
Feature activation+8.95e-02

Pos logit contributions
 trigger+0.103
inkle+0.094
ze+0.091
gh+0.088
 hurts+0.088
Neg logit contributions
 planned-0.065
 especially-0.062
 explorers-0.056
 discovering-0.053
 seeing-0.051
Tokenled
Token ID992
Feature activation-1.48e+00

Pos logit contributions
 planned+0.003
 especially+0.003
 explorers+0.002
 often+0.002
 going+0.002
Neg logit contributions
 trigger-0.002
 sleeve-0.002
chief-0.002
ze-0.002
itch-0.002
Token the
Token ID262
Feature activation+3.06e-01

Pos logit contributions
 trigger+0.057
 hurts+0.054
inkle+0.052
 lid+0.051
ze+0.051
Neg logit contributions
 especially-0.023
 planned-0.021
 explorers-0.020
 whether-0.019
 if-0.018
Token bear
Token ID6842
Feature activation-1.32e+00

Pos logit contributions
 especially+0.009
 planned+0.008
 traveling+0.007
 going+0.007
 discovering+0.007
Neg logit contributions
 trigger-0.008
 AW-0.008
antly-0.008
inkle-0.007
 hurts-0.007
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
 trigger+0.042
 hurts+0.036
 AW+0.033
 lid+0.033
antly+0.033
Neg logit contributions
 especially-0.031
 planned-0.030
 discovering-0.028
 whether-0.027
 someday-0.026
Token dangerous
Token ID4923
Feature activation-1.91e+00

Pos logit contributions
 trigger+0.005
 hurts+0.005
ze+0.004
 lid+0.004
 top+0.004
Neg logit contributions
 planned-0.005
 especially-0.005
 discovering-0.005
 spending-0.005
 whether-0.005
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
 trigger+0.043
 hurts+0.039
 cries+0.034
ze+0.034
 lid+0.032
Neg logit contributions
 planned-0.058
 especially-0.058
 discovering-0.056
 spending-0.054
 whether-0.053
Token Next
Token ID7406
Feature activation+1.15e+00

Pos logit contributions
 planned+0.008
 especially+0.008
 discovering+0.007
 spending+0.007
 traveling+0.007
Neg logit contributions
 trigger-0.007
 hurts-0.007
ze-0.006
 cries-0.006
 lid-0.006
Token time
Token ID640
Feature activation-1.36e+00

Pos logit contributions
 especially+0.033
 planned+0.032
 discovering+0.031
 spending+0.030
 whether+0.030
Neg logit contributions
 trigger-0.029
 hurts-0.026
ze-0.024
 lid-0.023
 top-0.023
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
 trigger+0.028
 hurts+0.026
ze+0.022
 lid+0.021
 cries+0.021
Neg logit contributions
 especially-0.044
 planned-0.044
 discovering-0.042
 whether-0.041
 spending-0.041
Token play
Token ID711
Feature activation-2.31e+00

Pos logit contributions
 planned+0.027
 especially+0.027
 discovering+0.026
asured+0.026
ride+0.026
Neg logit contributions
 trigger-0.026
 hurts-0.025
ze-0.023
 top-0.023
 breaks-0.022
Token somewhere
Token ID7382
Feature activation-8.95e-01

Pos logit contributions
 trigger+0.077
 hurts+0.070
 cries+0.064
antly+0.063
ze+0.063
Neg logit contributions
 especially-0.049
 planned-0.049
 discovering-0.044
 explorers-0.043
 spending-0.042
Token safe
Token ID3338
Feature activation-7.23e-01

Pos logit contributions
 trigger+0.023
 hurts+0.023
 cries+0.020
ze+0.020
 breaks+0.019
Neg logit contributions
 especially-0.023
 planned-0.023
 discovering-0.023
 spending-0.022
 cultures-0.021
Token,"
Token ID553
Feature activation-2.65e+00

Pos logit contributions
 trigger+0.019
 hurts+0.017
ze+0.015
 lid+0.015
 top+0.015
Neg logit contributions
 especially-0.019
 planned-0.019
 discovering-0.018
 spending-0.018
 whether-0.018
Token mom
Token ID1995
Feature activation+8.32e-01

Pos logit contributions
 trigger+0.082
 hurts+0.074
ze+0.069
 lid+0.068
antly+0.066
Neg logit contributions
 planned-0.062
 especially-0.062
 discovering-0.056
 spending-0.055
 whether-0.055
Token said
Token ID531
Feature activation+9.13e-02

Pos logit contributions
 planned+0.024
 especially+0.024
 discovering+0.023
 spending+0.022
 whether+0.022
Neg logit contributions
 trigger-0.020
 hurts-0.018
ze-0.016
 lid-0.016
 cries-0.016