TOP ACTIVATIONS
MAX = 4.875

, okay ?" the zo oke eper said .
doing here ?" the zo oke eper asked . " You
the bear and the zo oke ep ers and went back
hurt you ," the zo oke eper said .
and hide from the zo oke ep ers . Do you
would take out a j igsaw puzzle and try to solve
him to a k ara oke party . There was a
ran home and shared his story with the other ants .
Mason spun off into the sunset . <|endoftext|> Once upon a
the zoo ," the zo oke eper says . " Don
walked over to the zo oke eper and asked him if
. He asked the zo oke eper if he could borrow
, kids !" the zo oke eper said . " The
the theater and share her story with others . The brave
. They thanked the zo oke eper and ran to their
, sorry !" the zo oke ep ers said . "
it ran off into the sunset . He knew he had
. Ben asked the zo oke eper if he could touch
. They asked the zo oke ep ers , the other
and flew away into the sunset . She was so thankful

MIN ACTIVATIONS
MIN = -6.780

told her what he had seen , and his mom knew
told mummy what she had seen . M ummy
told her what they had seen and heard . Mom my
, " What have you seen , John ?" He pointed
see that things she had seen backstage were missing ; the
! That 's what grown - ups do when they need
excited about what he had seen . He knew he would
told her what he had seen and how he had escaped
, things he had never seen before . Stanley was so
the wonderful things she had seen in the ocean . It
asked them what they had seen and done . They told
" What are b erry - bus hes ?"
saw things she had never seen before . She felt guilty
the amazing things she had seen . She felt happy and
all about what he had seen and everyone listened .
all the things they had seen on their drive . It
Amazing things he had never seen before . He had gained
the wonderful things they had seen in the tunnel . <|endoftext|>
about the things they had seen and done . Bob was
the amazing things they had seen . The

INTERVAL 1.962 - 4.875
CONTAINS 0.285%

pocket . He took out two pieces of bread and said
pieces back together like a puzzle ." Lily and her mom
wrong . Tim saw the lime and knew it was not
and he heard that a story te ller was telling a
S ara holds Ben 's hand . " Don 't

INTERVAL -0.952 - 1.962
CONTAINS 93.318%

You have to share ." Sam shook his head
. It was a nightmare ! John was frightened , but
?" Her mom said , " Sure , I can carry
It 's dangerous out there ." But Lucy couldn
look at . Tom was so happy to be there !

INTERVAL -3.866 - -0.952
CONTAINS 6.391%

axe and said , " Yes , this axe is perfect
toys . <|endoftext|> Once upon a time , there was a
. She told them what was happening . They got up
No ! It â s my doll . â
. But what was it for ? Sl ither

INTERVAL -6.780 - -3.866
CONTAINS 0.005%

interesting thing she had ever seen . Eventually the
things that Kelly had never seen before , she didn 't
She put on her rain coat and ran outside to dance
little frog what he had seen . The little frog 's
about we make some mac aron i and cheese for dinner
Token,
Token ID11
Feature activation+2.75e-01

Pos logit contributions
+0.009
 supplies+0.008
ird+0.008
 markers+0.007
 help+0.007
Neg logit contributions
 thirteen-0.023
dy-0.022
die-0.022
 obstacle-0.021
Dean-0.020
Token okay
Token ID8788
Feature activation+8.70e-01

Pos logit contributions
 thirteen+0.014
die+0.014
dy+0.014
 obstacle+0.013
Dean+0.013
Neg logit contributions
 help-0.005
 yes-0.005
-0.004
 supplies-0.004
 decor-0.004
Token?"
Token ID1701
Feature activation-4.60e-02

Pos logit contributions
 thirteen+0.052
 obstacle+0.046
Dean+0.046
dy+0.045
die+0.045
Neg logit contributions
 help-0.011
-0.010
 decor-0.010
 supplies-0.010
 markers-0.009
Token the
Token ID262
Feature activation+8.05e-01

Pos logit contributions
+0.001
 help+0.001
 supplies+0.001
 markers+0.001
ird+0.001
Neg logit contributions
 thirteen-0.002
dy-0.002
die-0.002
 obstacle-0.002
reetings-0.002
Token zo
Token ID40565
Feature activation+9.10e-01

Pos logit contributions
 thirteen+0.032
die+0.031
dy+0.030
 obstacle+0.026
Dean+0.026
Neg logit contributions
 help-0.025
-0.024
 supplies-0.024
 show-0.023
ird-0.023
Tokenoke
Token ID2088
Feature activation+4.88e+00

Pos logit contributions
 thirteen+0.024
die+0.023
dy+0.023
 obstacle+0.018
Dean+0.018
Neg logit contributions
-0.041
ird-0.040
 supplies-0.039
 help-0.038
vell-0.038
Tokeneper
Token ID5723
Feature activation+2.30e-01

Pos logit contributions
die+0.157
 thirteen+0.156
 obstacle+0.143
dy+0.142
Today+0.131
Neg logit contributions
ird-0.173
 supplies-0.169
 help-0.167
 together-0.167
 fixed-0.161
Token said
Token ID531
Feature activation+8.93e-01

Pos logit contributions
 thirteen+0.013
dy+0.012
die+0.012
 obstacle+0.011
Dean+0.011
Neg logit contributions
-0.004
 supplies-0.004
ird-0.003
 markers-0.003
 help-0.003
Token.
Token ID13
Feature activation+9.36e-01

Pos logit contributions
 thirteen+0.046
dy+0.043
die+0.042
 obstacle+0.039
Dean+0.039
Neg logit contributions
-0.020
 supplies-0.018
ird-0.015
 markers-0.015
vell-0.015
Token
Token ID198
Feature activation+3.11e-01

Pos logit contributions
 thirteen+0.046
dy+0.042
die+0.041
 obstacle+0.040
Dean+0.038
Neg logit contributions
-0.024
 supplies-0.021
ird-0.019
 help-0.019
 markers-0.019
Token
Token ID198
Feature activation+2.08e-01

Pos logit contributions
 thirteen+0.015
dy+0.013
die+0.013
 obstacle+0.013
Dean+0.012
Neg logit contributions
-0.008
 supplies-0.007
 help-0.007
ird-0.007
 fixed-0.006
Token doing
Token ID1804
Feature activation+1.58e-01

Pos logit contributions
+0.035
 supplies+0.033
 help+0.032
ird+0.031
 decor+0.031
Neg logit contributions
 thirteen-0.036
die-0.032
dy-0.032
 obstacle-0.031
Dean-0.029
Token here
Token ID994
Feature activation-1.77e+00

Pos logit contributions
 thirteen+0.008
dy+0.008
die+0.008
 obstacle+0.007
Dean+0.007
Neg logit contributions
-0.003
 supplies-0.003
 help-0.003
ird-0.003
 markers-0.003
Token?"
Token ID1701
Feature activation-4.05e-01

Pos logit contributions
+0.040
 supplies+0.037
 help+0.033
ird+0.033
 markers+0.032
Neg logit contributions
 thirteen-0.089
dy-0.083
die-0.082
 obstacle-0.078
Dean-0.076
Token the
Token ID262
Feature activation+6.75e-01

Pos logit contributions
+0.010
 help+0.009
 supplies+0.009
 markers+0.008
ird+0.008
Neg logit contributions
 thirteen-0.020
dy-0.019
die-0.018
 obstacle-0.017
reetings-0.016
Token zo
Token ID40565
Feature activation+7.39e-01

Pos logit contributions
 thirteen+0.026
die+0.024
dy+0.024
 obstacle+0.021
Dean+0.021
Neg logit contributions
-0.022
 supplies-0.022
 help-0.022
ird-0.020
 markers-0.020
Tokenoke
Token ID2088
Feature activation+4.68e+00

Pos logit contributions
 thirteen+0.019
die+0.018
dy+0.018
 obstacle+0.014
Today+0.014
Neg logit contributions
-0.034
ird-0.033
 supplies-0.032
 help-0.031
vell-0.031
Tokeneper
Token ID5723
Feature activation-3.23e-03

Pos logit contributions
 thirteen+0.152
die+0.149
dy+0.136
 obstacle+0.136
Today+0.130
Neg logit contributions
ird-0.167
 help-0.165
 supplies-0.163
 together-0.156
 fixed-0.155
Token asked
Token ID1965
Feature activation+8.12e-01

Pos logit contributions
ird+0.000
 supplies+0.000
+0.000
 help+0.000
 fixed+0.000
Neg logit contributions
 thirteen-0.000
dy-0.000
die-0.000
 obstacle-0.000
Dean-0.000
Token.
Token ID13
Feature activation+7.82e-01

Pos logit contributions
 thirteen+0.047
dy+0.044
die+0.044
 obstacle+0.042
Dean+0.041
Neg logit contributions
-0.012
 supplies-0.011
 help-0.009
ird-0.009
 markers-0.009
Token "
Token ID366
Feature activation-3.72e-02

Pos logit contributions
 thirteen+0.036
dy+0.034
die+0.033
 obstacle+0.032
Dean+0.030
Neg logit contributions
-0.022
 supplies-0.018
 help-0.018
ird-0.017
 markers-0.017
TokenYou
Token ID1639
Feature activation+5.29e-01

Pos logit contributions
+0.001
he+0.001
ird+0.001
 together+0.001
 help+0.001
Neg logit contributions
dy-0.002
die-0.002
 thirteen-0.001
 obstacle-0.001
 instant-0.001
Token the
Token ID262
Feature activation+2.81e-02

Pos logit contributions
 thirteen+0.000
dy+0.000
 moral+0.000
 obstacle+0.000
die+0.000
Neg logit contributions
vell-0.000
-0.000
ird-0.000
 supplies-0.000
anger-0.000
Token bear
Token ID6842
Feature activation-1.59e-01

Pos logit contributions
 thirteen+0.001
dy+0.001
die+0.001
 obstacle+0.001
Dean+0.001
Neg logit contributions
-0.001
 supplies-0.001
 help-0.001
 markers-0.001
vell-0.001
Token and
Token ID290
Feature activation+6.82e-02

Pos logit contributions
+0.004
 supplies+0.004
 help+0.003
ird+0.003
 markers+0.003
Neg logit contributions
 thirteen-0.008
dy-0.007
die-0.007
 obstacle-0.006
Dean-0.006
Token the
Token ID262
Feature activation+1.69e-01

Pos logit contributions
 thirteen+0.002
dy+0.002
die+0.002
Dean+0.002
 obstacle+0.002
Neg logit contributions
-0.003
vell-0.002
 supplies-0.002
anger-0.002
ird-0.002
Token zo
Token ID40565
Feature activation+7.83e-01

Pos logit contributions
 thirteen+0.007
dy+0.006
die+0.006
 obstacle+0.006
Dean+0.005
Neg logit contributions
-0.006
 supplies-0.005
 help-0.005
 markers-0.005
vell-0.005
Tokenoke
Token ID2088
Feature activation+4.61e+00

Pos logit contributions
 thirteen+0.018
dy+0.016
die+0.015
 obstacle+0.013
Dean+0.012
Neg logit contributions
-0.039
 supplies-0.037
ird-0.036
 markers-0.035
 help-0.035
Tokenep
Token ID538
Feature activation-8.63e-01

Pos logit contributions
 thirteen+0.136
die+0.123
dy+0.117
 obstacle+0.114
Today+0.103
Neg logit contributions
 supplies-0.188
 help-0.182
ird-0.182
-0.178
 fixed-0.174
Tokeners
Token ID364
Feature activation-5.91e-01

Pos logit contributions
+0.024
 supplies+0.022
ird+0.020
 markers+0.020
vell+0.019
Neg logit contributions
 thirteen-0.039
dy-0.037
die-0.036
 obstacle-0.034
Dean-0.033
Token and
Token ID290
Feature activation+7.55e-02

Pos logit contributions
+0.017
 supplies+0.015
 markers+0.015
idents+0.014
enge+0.014
Neg logit contributions
 thirteen-0.025
dy-0.023
die-0.021
 moral-0.021
He-0.020
Token went
Token ID1816
Feature activation+8.64e-01

Pos logit contributions
 thirteen+0.003
dy+0.003
die+0.003
 obstacle+0.002
Dean+0.002
Neg logit contributions
-0.003
 supplies-0.002
ird-0.002
vell-0.002
 markers-0.002
Token back
Token ID736
Feature activation+1.34e-01

Pos logit contributions
 thirteen+0.033
dy+0.031
die+0.029
 obstacle+0.027
Dean+0.027
Neg logit contributions
-0.030
ird-0.026
 supplies-0.026
 markers-0.026
vell-0.025
Token hurt
Token ID5938
Feature activation+5.80e-01

Pos logit contributions
dy+0.021
 thirteen+0.021
die+0.021
Today+0.020
Dean+0.019
Neg logit contributions
 help-0.015
 fixed-0.013
 supplies-0.013
-0.012
 show-0.012
Token you
Token ID345
Feature activation+1.92e+00

Pos logit contributions
 thirteen+0.032
dy+0.030
die+0.030
 obstacle+0.028
Dean+0.028
Neg logit contributions
 supplies-0.009
 help-0.009
-0.008
ird-0.007
 together-0.007
Token,"
Token ID553
Feature activation-4.01e-01

Pos logit contributions
 thirteen+0.110
dy+0.103
die+0.102
 obstacle+0.097
Dean+0.096
Neg logit contributions
-0.030
 supplies-0.027
ird-0.025
 help-0.024
 fixed-0.022
Token the
Token ID262
Feature activation+6.56e-01

Pos logit contributions
+0.011
 supplies+0.010
 help+0.010
ird+0.009
 markers+0.009
Neg logit contributions
 thirteen-0.019
dy-0.018
die-0.017
 obstacle-0.016
Dean-0.015
Token zo
Token ID40565
Feature activation+9.64e-01

Pos logit contributions
 thirteen+0.027
die+0.026
dy+0.025
Dean+0.023
 obstacle+0.023
Neg logit contributions
-0.020
 help-0.019
 supplies-0.019
 fixed-0.017
ird-0.017
Tokenoke
Token ID2088
Feature activation+4.52e+00

Pos logit contributions
die+0.027
 thirteen+0.026
dy+0.026
Today+0.022
 obstacle+0.021
Neg logit contributions
-0.041
ird-0.041
 help-0.039
 supplies-0.039
vell-0.038
Tokeneper
Token ID5723
Feature activation+1.96e-02

Pos logit contributions
die+0.139
 thirteen+0.139
dy+0.130
 obstacle+0.125
Today+0.120
Neg logit contributions
ird-0.168
 supplies-0.166
 help-0.164
 fixed-0.158
 together-0.158
Token said
Token ID531
Feature activation+7.26e-01

Pos logit contributions
 thirteen+0.001
die+0.001
dy+0.001
 obstacle+0.001
Today+0.001
Neg logit contributions
 help-0.000
 supplies-0.000
ird-0.000
 fixed-0.000
-0.000
Token.
Token ID13
Feature activation+6.03e-01

Pos logit contributions
 thirteen+0.037
dy+0.035
die+0.034
 obstacle+0.032
Dean+0.031
Neg logit contributions
-0.016
 supplies-0.014
 markers-0.013
ird-0.013
vell-0.012
Token
Token ID198
Feature activation+1.32e-01

Pos logit contributions
 thirteen+0.028
dy+0.026
die+0.026
 obstacle+0.025
Dean+0.023
Neg logit contributions
-0.017
 supplies-0.014
ird-0.014
 help-0.013
 markers-0.013
Token
Token ID198
Feature activation+1.07e-01

Pos logit contributions
 thirteen+0.007
dy+0.006
die+0.006
 obstacle+0.006
Dean+0.005
Neg logit contributions
-0.003
 supplies-0.003
 help-0.003
ird-0.003
 Eve-0.002
Token and
Token ID290
Feature activation+2.05e-01

Pos logit contributions
 thirteen+0.006
die+0.006
dy+0.006
Dean+0.005
Meanwhile+0.005
Neg logit contributions
 together-0.001
 help-0.001
 supplies-0.001
 fixed-0.000
 decor-0.000
Token hide
Token ID7808
Feature activation-2.30e-01

Pos logit contributions
 thirteen+0.009
dy+0.009
die+0.009
Today+0.008
Dean+0.008
Neg logit contributions
 help-0.006
 supplies-0.005
 decor-0.005
 markers-0.005
 show-0.005
Token from
Token ID422
Feature activation+2.06e-01

Pos logit contributions
+0.006
 supplies+0.005
ird+0.005
 markers+0.005
 help+0.005
Neg logit contributions
 thirteen-0.011
dy-0.010
die-0.010
 obstacle-0.009
Dean-0.009
Token the
Token ID262
Feature activation+1.96e-01

Pos logit contributions
 thirteen+0.010
dy+0.009
die+0.009
Dean+0.008
 obstacle+0.008
Neg logit contributions
 supplies-0.005
-0.005
 help-0.004
 markers-0.004
 decor-0.004
Token zo
Token ID40565
Feature activation+9.14e-01

Pos logit contributions
 thirteen+0.008
dy+0.008
die+0.008
reetings+0.007
mo+0.007
Neg logit contributions
 supplies-0.006
 help-0.006
-0.006
 markers-0.005
 cr-0.005
Tokenoke
Token ID2088
Feature activation+4.52e+00

Pos logit contributions
 thirteen+0.024
dy+0.024
die+0.024
 obstacle+0.019
Dean+0.018
Neg logit contributions
-0.041
ird-0.039
 supplies-0.039
 help-0.037
vell-0.037
Tokenep
Token ID538
Feature activation-7.59e-01

Pos logit contributions
 thirteen+0.166
die+0.156
dy+0.147
 obstacle+0.141
Today+0.138
Neg logit contributions
ird-0.148
 supplies-0.147
 help-0.144
vell-0.139
 together-0.139
Tokeners
Token ID364
Feature activation-7.07e-01

Pos logit contributions
 supplies+0.010
+0.009
 help+0.009
ird+0.008
 together+0.008
Neg logit contributions
 thirteen-0.043
die-0.041
dy-0.041
 obstacle-0.040
Dean-0.038
Token.
Token ID13
Feature activation+4.08e-01

Pos logit contributions
+0.011
 supplies+0.010
ird+0.009
 help+0.008
 markers+0.008
Neg logit contributions
 thirteen-0.040
dy-0.038
die-0.038
 obstacle-0.036
Dean-0.035
Token Do
Token ID2141
Feature activation+1.08e+00

Pos logit contributions
 thirteen+0.016
dy+0.016
die+0.015
 obstacle+0.015
Dean+0.014
Neg logit contributions
-0.013
ird-0.012
 supplies-0.011
 help-0.011
 Tim-0.010
Token you
Token ID345
Feature activation+9.77e-01

Pos logit contributions
 thirteen+0.065
dy+0.061
die+0.061
 obstacle+0.059
Today+0.056
Neg logit contributions
-0.014
 help-0.012
 supplies-0.011
ird-0.009
 yes-0.009
Token would
Token ID561
Feature activation-3.61e-01

Pos logit contributions
 thirteen+0.017
dy+0.016
die+0.015
 obstacle+0.015
Dean+0.014
Neg logit contributions
-0.010
 supplies-0.009
ird-0.008
 markers-0.008
 help-0.008
Token take
Token ID1011
Feature activation+8.62e-01

Pos logit contributions
+0.010
 supplies+0.010
ird+0.009
 help+0.009
 markers+0.009
Neg logit contributions
 thirteen-0.016
dy-0.015
die-0.015
 obstacle-0.014
Dean-0.013
Token out
Token ID503
Feature activation-7.19e-01

Pos logit contributions
 thirteen+0.035
dy+0.032
die+0.031
 obstacle+0.030
Dean+0.029
Neg logit contributions
-0.030
ird-0.025
vell-0.024
 supplies-0.023
idents-0.023
Token a
Token ID257
Feature activation-3.08e-01

Pos logit contributions
 supplies+0.018
 markers+0.017
 help+0.016
+0.015
 decor+0.015
Neg logit contributions
 thirteen-0.035
dy-0.034
die-0.033
reetings-0.030
 obstacle-0.030
Token j
Token ID474
Feature activation+9.43e-01

Pos logit contributions
+0.009
 supplies+0.009
 markers+0.008
ird+0.008
 help+0.008
Neg logit contributions
 thirteen-0.013
dy-0.012
die-0.012
 obstacle-0.011
Dean-0.011
Tokenigsaw
Token ID45636
Feature activation+4.41e+00

Pos logit contributions
 thirteen+0.026
dy+0.025
 obstacle+0.024
die+0.022
Dean+0.020
Neg logit contributions
-0.040
 supplies-0.038
 markers-0.037
ird-0.037
vell-0.036
Token puzzle
Token ID15027
Feature activation+2.61e+00

Pos logit contributions
 thirteen+0.144
dy+0.143
die+0.135
 obstacle+0.134
Today+0.122
Neg logit contributions
-0.162
ird-0.146
 Bian-0.137
 yes-0.134
 supplies-0.133
Token and
Token ID290
Feature activation+1.21e-01

Pos logit contributions
 thirteen+0.107
dy+0.102
die+0.097
 obstacle+0.095
Dean+0.091
Neg logit contributions
-0.082
ird-0.068
idents-0.064
 supplies-0.064
vell-0.063
Token try
Token ID1949
Feature activation+1.15e+00

Pos logit contributions
 thirteen+0.005
dy+0.005
die+0.005
 obstacle+0.005
Dean+0.005
Neg logit contributions
-0.003
 supplies-0.003
ird-0.003
 markers-0.003
 help-0.003
Token to
Token ID284
Feature activation+5.81e-01

Pos logit contributions
 thirteen+0.064
dy+0.060
die+0.059
 obstacle+0.056
Dean+0.055
Neg logit contributions
 supplies-0.020
-0.018
 markers-0.017
 help-0.016
 decor-0.016
Token solve
Token ID8494
Feature activation+1.46e+00

Pos logit contributions
 thirteen+0.024
dy+0.022
die+0.021
 obstacle+0.021
Dean+0.020
Neg logit contributions
-0.018
 supplies-0.016
ird-0.015
anger-0.015
vell-0.015
Token him
Token ID683
Feature activation-5.86e-01

Pos logit contributions
 thirteen+0.003
dy+0.002
die+0.002
 obstacle+0.002
Dean+0.002
Neg logit contributions
-0.001
vell-0.001
ird-0.001
 supplies-0.001
 markers-0.001
Token to
Token ID284
Feature activation+9.12e-03

Pos logit contributions
+0.014
 supplies+0.013
ird+0.012
 markers+0.011
vell+0.011
Neg logit contributions
 thirteen-0.029
dy-0.027
die-0.026
 obstacle-0.025
Dean-0.024
Token a
Token ID257
Feature activation-2.48e-01

Pos logit contributions
 thirteen+0.000
dy+0.000
die+0.000
Dean+0.000
 moral+0.000
Neg logit contributions
-0.000
vell-0.000
anger-0.000
ird-0.000
 supplies-0.000
Token k
Token ID479
Feature activation-4.60e-01

Pos logit contributions
+0.008
 supplies+0.007
ird+0.006
 markers+0.006
 help+0.006
Neg logit contributions
 thirteen-0.011
dy-0.010
die-0.010
 obstacle-0.009
Dean-0.009
Tokenara
Token ID3301
Feature activation+1.30e-01

Pos logit contributions
 supplies+0.019
 help+0.017
+0.017
 markers+0.017
vell+0.017
Neg logit contributions
 thirteen-0.012
Today-0.012
dy-0.012
Dean-0.011
die-0.011
Tokenoke
Token ID2088
Feature activation+4.38e+00

Pos logit contributions
 thirteen+0.004
dy+0.004
die+0.004
 obstacle+0.003
Dean+0.003
Neg logit contributions
-0.005
 supplies-0.005
vell-0.005
 markers-0.005
ird-0.005
Token party
Token ID2151
Feature activation-1.14e-01

Pos logit contributions
 thirteen+0.113
dy+0.100
die+0.095
 obstacle+0.084
Dean+0.081
Neg logit contributions
-0.209
 supplies-0.197
ird-0.187
 markers-0.187
 help-0.183
Token.
Token ID13
Feature activation+6.08e-02

Pos logit contributions
+0.004
idents+0.003
vell+0.003
 markers+0.003
 supplies+0.003
Neg logit contributions
 thirteen-0.004
dy-0.004
Today-0.004
 moral-0.004
die-0.003
Token There
Token ID1318
Feature activation+9.24e-02

Pos logit contributions
 thirteen+0.002
dy+0.002
Today+0.002
die+0.002
Those+0.002
Neg logit contributions
vell-0.002
 supplies-0.002
isher-0.002
ird-0.002
 markers-0.002
Token was
Token ID373
Feature activation-5.67e-02

Pos logit contributions
 thirteen+0.003
dy+0.003
Today+0.003
die+0.003
 moral+0.003
Neg logit contributions
-0.003
vell-0.003
 markers-0.003
 supplies-0.003
resses-0.003
Token a
Token ID257
Feature activation-3.83e-01

Pos logit contributions
+0.001
 supplies+0.001
 markers+0.001
ird+0.001
 help+0.001
Neg logit contributions
 thirteen-0.003
dy-0.003
die-0.003
 obstacle-0.003
Dean-0.003
Token ran
Token ID4966
Feature activation+2.22e-01

Pos logit contributions
+0.029
 supplies+0.029
 help+0.026
ird+0.026
 markers+0.026
Neg logit contributions
 thirteen-0.048
dy-0.044
die-0.043
 obstacle-0.041
Dean-0.040
Token home
Token ID1363
Feature activation-1.19e+00

Pos logit contributions
 thirteen+0.009
dy+0.009
die+0.008
 obstacle+0.008
Dean+0.008
Neg logit contributions
-0.007
 supplies-0.006
ird-0.006
 markers-0.006
vell-0.006
Token and
Token ID290
Feature activation+9.47e-02

Pos logit contributions
+0.043
enge+0.038
ird+0.038
 markers+0.038
 supplies+0.037
Neg logit contributions
dy-0.041
 thirteen-0.039
die-0.037
Meanwhile-0.034
He-0.034
Token shared
Token ID4888
Feature activation+1.14e+00

Pos logit contributions
 thirteen+0.004
dy+0.004
die+0.004
Dean+0.003
 obstacle+0.003
Neg logit contributions
-0.003
 supplies-0.003
ird-0.002
vell-0.002
 markers-0.002
Token his
Token ID465
Feature activation-2.91e-01

Pos logit contributions
 thirteen+0.049
dy+0.046
die+0.045
 obstacle+0.041
Dean+0.040
Neg logit contributions
-0.035
 supplies-0.031
ird-0.030
vell-0.029
 markers-0.028
Token story
Token ID1621
Feature activation+4.35e+00

Pos logit contributions
+0.007
 supplies+0.006
ird+0.006
 markers+0.005
vell+0.005
Neg logit contributions
 thirteen-0.014
dy-0.014
die-0.013
 obstacle-0.013
Dean-0.012
Token with
Token ID351
Feature activation-5.23e-01

Pos logit contributions
 thirteen+0.221
dy+0.207
die+0.203
 obstacle+0.192
Dean+0.188
Neg logit contributions
-0.097
 supplies-0.088
ird-0.080
 markers-0.077
 help-0.075
Token the
Token ID262
Feature activation-1.47e-01

Pos logit contributions
+0.024
ird+0.021
idents+0.020
coat+0.020
vell+0.020
Neg logit contributions
 thirteen-0.015
dy-0.015
die-0.014
 obstacle-0.013
He-0.012
Token other
Token ID584
Feature activation+1.62e+00

Pos logit contributions
 supplies+0.005
 help+0.005
+0.005
 markers+0.004
vell+0.004
Neg logit contributions
 thirteen-0.006
die-0.005
dy-0.005
Dean-0.005
Nik-0.005
Token ants
Token ID27842
Feature activation+1.75e+00

Pos logit contributions
 thirteen+0.065
die+0.055
Dean+0.053
dy+0.052
Nik+0.051
Neg logit contributions
 supplies-0.057
-0.056
 help-0.053
 markers-0.049
ird-0.046
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
 thirteen+0.112
die+0.105
dy+0.102
 obstacle+0.101
Dean+0.100
Neg logit contributions
 supplies-0.016
-0.016
 together-0.015
 help-0.014
 decor-0.011
Token Mason
Token ID14737
Feature activation+6.54e-01

Pos logit contributions
 supplies+0.007
+0.007
 help+0.007
 Bian+0.006
 show+0.006
Neg logit contributions
 thirteen-0.008
dy-0.007
die-0.006
 obstacle-0.006
 seventeen-0.006
Token spun
Token ID26843
Feature activation+2.54e-01

Pos logit contributions
 thirteen+0.031
dy+0.029
die+0.029
 obstacle+0.027
reetings+0.025
Neg logit contributions
-0.016
ird-0.016
 supplies-0.016
 help-0.015
 fixed-0.014
Token off
Token ID572
Feature activation+2.82e-01

Pos logit contributions
 thirteen+0.011
dy+0.010
die+0.010
 obstacle+0.009
Dean+0.009
Neg logit contributions
-0.008
 supplies-0.007
ird-0.007
 markers-0.006
vell-0.006
Token into
Token ID656
Feature activation+1.90e-01

Pos logit contributions
 thirteen+0.013
dy+0.012
die+0.012
 obstacle+0.011
Dean+0.011
Neg logit contributions
-0.007
 supplies-0.007
ird-0.006
 markers-0.006
vell-0.006
Token the
Token ID262
Feature activation+9.38e-01

Pos logit contributions
 thirteen+0.010
dy+0.009
die+0.009
 obstacle+0.008
Dean+0.008
Neg logit contributions
-0.004
 supplies-0.004
 markers-0.004
ird-0.003
 help-0.003
Token sunset
Token ID26428
Feature activation+4.29e+00

Pos logit contributions
 thirteen+0.041
dy+0.039
die+0.038
Dean+0.034
reetings+0.034
Neg logit contributions
-0.028
 supplies-0.027
 help-0.024
 markers-0.024
 fixed-0.023
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
 thirteen+0.248
dy+0.233
die+0.232
 obstacle+0.218
Dean+0.216
Neg logit contributions
-0.070
 supplies-0.060
ird-0.050
 markers-0.048
 help-0.047
Token<|endoftext|>
Token ID50256
Feature activation+3.30e-01

Pos logit contributions
+0.009
 supplies+0.008
ird+0.007
 markers+0.007
 help+0.007
Neg logit contributions
 thirteen-0.011
dy-0.011
die-0.010
 obstacle-0.010
Dean-0.009
TokenOnce
Token ID7454
Feature activation+2.45e-01

Pos logit contributions
 thirteen+0.012
dy+0.011
die+0.011
Today+0.010
Dean+0.010
Neg logit contributions
-0.012
 supplies-0.011
ird-0.011
 markers-0.011
vell-0.010
Token upon
Token ID2402
Feature activation-7.88e-01

Pos logit contributions
 thirteen+0.007
dy+0.007
die+0.007
 obstacle+0.006
Dean+0.006
Neg logit contributions
-0.010
 supplies-0.010
ird-0.009
 markers-0.009
vell-0.009
Token a
Token ID257
Feature activation-1.02e+00

Pos logit contributions
+0.022
 supplies+0.020
ird+0.018
vell+0.018
 markers+0.018
Neg logit contributions
 thirteen-0.035
dy-0.034
die-0.033
Today-0.031
He-0.030
Token the
Token ID262
Feature activation-4.38e-01

Pos logit contributions
 thirteen+0.010
dy+0.009
die+0.009
 obstacle+0.008
Dean+0.008
Neg logit contributions
-0.006
 supplies-0.005
 help-0.005
ird-0.005
 decor-0.005
Token zoo
Token ID26626
Feature activation+3.36e-01

Pos logit contributions
 supplies+0.015
 help+0.015
+0.015
 decor+0.013
ird+0.013
Neg logit contributions
 thirteen-0.016
die-0.015
dy-0.015
Dean-0.014
 obstacle-0.013
Token,"
Token ID553
Feature activation-1.15e+00

Pos logit contributions
 thirteen+0.019
dy+0.018
die+0.018
 obstacle+0.017
Dean+0.017
Neg logit contributions
-0.005
 supplies-0.005
ird-0.004
 help-0.004
vell-0.003
Token the
Token ID262
Feature activation+1.94e-01

Pos logit contributions
 help+0.021
 supplies+0.019
 yes+0.017
 Tim+0.017
+0.017
Neg logit contributions
 thirteen-0.061
dy-0.061
die-0.060
reetings-0.056
 obstacle-0.056
Token zo
Token ID40565
Feature activation+4.60e-01

Pos logit contributions
 thirteen+0.007
dy+0.006
die+0.006
 obstacle+0.006
Dean+0.005
Neg logit contributions
-0.007
 supplies-0.007
 help-0.007
ird-0.007
 markers-0.006
Tokenoke
Token ID2088
Feature activation+4.29e+00

Pos logit contributions
 thirteen+0.012
die+0.012
dy+0.011
 obstacle+0.009
Today+0.009
Neg logit contributions
-0.021
ird-0.020
 supplies-0.020
 help-0.020
vell-0.019
Tokeneper
Token ID5723
Feature activation-3.85e-01

Pos logit contributions
 thirteen+0.128
die+0.124
dy+0.115
 obstacle+0.112
Today+0.104
Neg logit contributions
 supplies-0.165
ird-0.164
 help-0.162
-0.158
 fixed-0.156
Token says
Token ID1139
Feature activation+5.33e-01

Pos logit contributions
 supplies+0.009
 help+0.009
ird+0.008
+0.008
 markers+0.008
Neg logit contributions
 thirteen-0.018
die-0.017
dy-0.017
 obstacle-0.016
Today-0.016
Token.
Token ID13
Feature activation+5.43e-01

Pos logit contributions
 thirteen+0.027
dy+0.026
die+0.025
 obstacle+0.023
Dean+0.023
Neg logit contributions
-0.012
 supplies-0.011
ird-0.010
 markers-0.009
vell-0.009
Token "
Token ID366
Feature activation-4.71e-01

Pos logit contributions
 thirteen+0.024
dy+0.023
die+0.022
 obstacle+0.022
Dean+0.020
Neg logit contributions
-0.016
 supplies-0.014
 help-0.013
 markers-0.013
 fixed-0.012
TokenDon
Token ID3987
Feature activation-8.20e-01

Pos logit contributions
+0.015
 together+0.014
 help+0.013
 yes+0.013
 fixed+0.012
Neg logit contributions
dy-0.019
 obstacle-0.018
 thirteen-0.018
die-0.017
mo-0.015
Token walked
Token ID6807
Feature activation+5.71e-02

Pos logit contributions
+0.004
 supplies+0.004
ird+0.004
 markers+0.004
vell+0.004
Neg logit contributions
 thirteen-0.006
dy-0.005
die-0.005
 obstacle-0.005
Dean-0.005
Token over
Token ID625
Feature activation+3.59e-01

Pos logit contributions
 thirteen+0.003
dy+0.003
die+0.002
 obstacle+0.002
Dean+0.002
Neg logit contributions
 supplies-0.001
-0.001
 help-0.001
 together-0.001
 markers-0.001
Token to
Token ID284
Feature activation+2.03e-01

Pos logit contributions
 thirteen+0.016
dy+0.015
die+0.014
 obstacle+0.013
Dean+0.013
Neg logit contributions
-0.010
ird-0.009
 supplies-0.009
vell-0.009
 markers-0.008
Token the
Token ID262
Feature activation+3.12e-01

Pos logit contributions
 thirteen+0.008
dy+0.007
die+0.007
Dean+0.007
 obstacle+0.007
Neg logit contributions
-0.007
ird-0.006
vell-0.006
 supplies-0.006
 markers-0.006
Token zo
Token ID40565
Feature activation+6.55e-01

Pos logit contributions
 thirteen+0.012
die+0.011
dy+0.011
 obstacle+0.010
Dean+0.010
Neg logit contributions
 supplies-0.010
-0.010
 help-0.010
 show-0.009
ird-0.009
Tokenoke
Token ID2088
Feature activation+4.26e+00

Pos logit contributions
 thirteen+0.017
die+0.017
dy+0.017
 obstacle+0.014
Today+0.013
Neg logit contributions
-0.029
ird-0.028
 supplies-0.028
 help-0.027
vell-0.027
Tokeneper
Token ID5723
Feature activation-4.28e-01

Pos logit contributions
die+0.144
 thirteen+0.140
dy+0.133
 obstacle+0.128
Today+0.117
Neg logit contributions
 supplies-0.152
ird-0.148
 help-0.146
 together-0.145
 cr-0.139
Token and
Token ID290
Feature activation+2.73e-01

Pos logit contributions
+0.010
 supplies+0.010
ird+0.009
 help+0.009
 markers+0.009
Neg logit contributions
 thirteen-0.021
die-0.020
dy-0.020
 obstacle-0.019
Dean-0.018
Token asked
Token ID1965
Feature activation+9.19e-01

Pos logit contributions
 thirteen+0.013
dy+0.012
die+0.012
 obstacle+0.011
Dean+0.011
Neg logit contributions
-0.007
 supplies-0.006
ird-0.006
 markers-0.006
 help-0.006
Token him
Token ID683
Feature activation+8.03e-02

Pos logit contributions
 thirteen+0.044
dy+0.041
die+0.041
 obstacle+0.038
Dean+0.037
Neg logit contributions
-0.023
 supplies-0.021
ird-0.019
 markers-0.019
 help-0.019
Token if
Token ID611
Feature activation+7.15e-01

Pos logit contributions
 thirteen+0.004
 obstacle+0.004
die+0.004
dy+0.004
Dean+0.003
Neg logit contributions
 help-0.002
 yes-0.002
 supplies-0.002
-0.002
 together-0.002
Token.
Token ID13
Feature activation+2.50e-01

Pos logit contributions
 thirteen+0.010
dy+0.010
die+0.009
 obstacle+0.009
Today+0.009
Neg logit contributions
-0.006
 supplies-0.005
ird-0.005
 markers-0.005
vell-0.004
Token He
Token ID679
Feature activation+2.67e-02

Pos logit contributions
dy+0.008
 thirteen+0.008
die+0.008
He+0.007
Dean+0.007
Neg logit contributions
-0.009
ird-0.009
 supplies-0.009
vell-0.009
 markers-0.008
Token asked
Token ID1965
Feature activation+4.46e-01

Pos logit contributions
 thirteen+0.001
dy+0.001
die+0.001
 obstacle+0.001
Dean+0.001
Neg logit contributions
-0.001
 supplies-0.001
ird-0.001
 markers-0.001
 help-0.001
Token the
Token ID262
Feature activation+4.63e-01

Pos logit contributions
 thirteen+0.014
dy+0.014
die+0.014
 moral+0.013
 obstacle+0.013
Neg logit contributions
-0.020
ird-0.016
vell-0.016
idents-0.015
 supplies-0.015
Token zo
Token ID40565
Feature activation+1.04e+00

Pos logit contributions
 thirteen+0.015
die+0.014
dy+0.014
 obstacle+0.012
Dean+0.012
Neg logit contributions
-0.018
 supplies-0.018
 help-0.017
ird-0.016
 decor-0.016
Tokenoke
Token ID2088
Feature activation+4.26e+00

Pos logit contributions
 thirteen+0.026
die+0.024
dy+0.024
 obstacle+0.020
Dean+0.019
Neg logit contributions
-0.048
 supplies-0.046
ird-0.046
 help-0.045
vell-0.045
Tokeneper
Token ID5723
Feature activation+2.54e-01

Pos logit contributions
 thirteen+0.125
die+0.114
dy+0.108
 obstacle+0.104
Dean+0.096
Neg logit contributions
 supplies-0.173
 help-0.171
 together-0.163
ird-0.161
-0.161
Token if
Token ID611
Feature activation+5.66e-01

Pos logit contributions
 thirteen+0.014
 obstacle+0.014
die+0.014
dy+0.013
Dean+0.013
Neg logit contributions
 help-0.004
 together-0.003
 supplies-0.003
 yes-0.003
 online-0.003
Token he
Token ID339
Feature activation+8.98e-01

Pos logit contributions
 thirteen+0.016
dy+0.015
die+0.014
Dean+0.013
 obstacle+0.013
Neg logit contributions
-0.025
ird-0.023
 supplies-0.022
 markers-0.021
vell-0.021
Token could
Token ID714
Feature activation+5.88e-01

Pos logit contributions
 thirteen+0.042
dy+0.041
die+0.039
 moral+0.036
 obstacle+0.036
Neg logit contributions
-0.023
 supplies-0.021
 markers-0.018
vell-0.018
ird-0.017
Token borrow
Token ID8804
Feature activation+4.26e-01

Pos logit contributions
 thirteen+0.024
dy+0.021
die+0.020
 moral+0.019
 obstacle+0.019
Neg logit contributions
-0.018
 supplies-0.018
 markers-0.017
vell-0.016
ird-0.016
Token,
Token ID11
Feature activation-2.89e-01

Pos logit contributions
+0.004
 supplies+0.004
ird+0.003
 help+0.003
 markers+0.003
Neg logit contributions
 thirteen-0.014
dy-0.013
die-0.013
 obstacle-0.012
Dean-0.012
Token kids
Token ID3988
Feature activation-2.42e-01

Pos logit contributions
+0.008
 supplies+0.008
ird+0.007
 help+0.007
 markers+0.007
Neg logit contributions
 thirteen-0.013
dy-0.012
die-0.012
 obstacle-0.011
Dean-0.011
Token!"
Token ID2474
Feature activation-7.68e-01

Pos logit contributions
+0.008
 markers+0.007
 supplies+0.007
 brand+0.006
vell+0.006
Neg logit contributions
 thirteen-0.010
dy-0.009
 obstacle-0.008
die-0.008
He-0.008
Token the
Token ID262
Feature activation+3.20e-01

Pos logit contributions
+0.025
 supplies+0.022
ird+0.021
vell+0.021
 markers+0.021
Neg logit contributions
 thirteen-0.032
dy-0.029
die-0.028
 obstacle-0.026
Dean-0.026
Token zo
Token ID40565
Feature activation+5.03e-01

Pos logit contributions
 thirteen+0.012
dy+0.011
die+0.011
 obstacle+0.010
Today+0.010
Neg logit contributions
 help-0.011
 supplies-0.011
-0.010
ird-0.010
 fixed-0.009
Tokenoke
Token ID2088
Feature activation+4.25e+00

Pos logit contributions
 thirteen+0.011
dy+0.010
die+0.009
 obstacle+0.008
Dean+0.007
Neg logit contributions
-0.026
 supplies-0.025
ird-0.024
 markers-0.023
 help-0.023
Tokeneper
Token ID5723
Feature activation-4.36e-01

Pos logit contributions
 thirteen+0.124
die+0.118
dy+0.112
 obstacle+0.104
Dean+0.098
Neg logit contributions
 supplies-0.170
ird-0.166
 help-0.165
-0.164
 fixed-0.159
Token said
Token ID531
Feature activation+3.94e-01

Pos logit contributions
+0.012
 supplies+0.010
 markers+0.009
ird+0.009
vell+0.009
Neg logit contributions
 thirteen-0.021
dy-0.020
die-0.019
Dean-0.017
 obstacle-0.017
Token.
Token ID13
Feature activation+6.68e-01

Pos logit contributions
 thirteen+0.019
dy+0.017
die+0.016
He+0.015
 obstacle+0.015
Neg logit contributions
-0.011
 supplies-0.010
 markers-0.009
ird-0.008
vell-0.008
Token "
Token ID366
Feature activation-2.88e-01

Pos logit contributions
 thirteen+0.031
dy+0.029
die+0.028
 obstacle+0.027
Dean+0.026
Neg logit contributions
-0.018
 supplies-0.016
 help-0.015
ird-0.015
 fixed-0.014
TokenThe
Token ID464
Feature activation-2.56e-01

Pos logit contributions
+0.009
 help+0.008
 supplies+0.008
 together+0.008
 fixed+0.007
Neg logit contributions
dy-0.012
 thirteen-0.011
die-0.011
 obstacle-0.011
 instant-0.010
Token the
Token ID262
Feature activation+5.09e-02

Pos logit contributions
+0.000
 supplies+0.000
ird+0.000
 help+0.000
 markers+0.000
Neg logit contributions
 thirteen-0.000
dy-0.000
die-0.000
 obstacle-0.000
Dean-0.000
Token theater
Token ID13766
Feature activation+3.24e-01

Pos logit contributions
 thirteen+0.002
dy+0.002
die+0.002
 obstacle+0.002
Dean+0.002
Neg logit contributions
-0.001
 supplies-0.001
ird-0.001
 help-0.001
 markers-0.001
Token and
Token ID290
Feature activation-8.37e-02

Pos logit contributions
 thirteen+0.017
dy+0.016
die+0.016
 obstacle+0.015
Dean+0.014
Neg logit contributions
-0.007
 supplies-0.006
ird-0.005
 markers-0.005
 help-0.005
Token share
Token ID2648
Feature activation+9.58e-01

Pos logit contributions
+0.002
 supplies+0.002
 help+0.002
ird+0.002
 markers+0.002
Neg logit contributions
 thirteen-0.004
dy-0.004
die-0.004
 obstacle-0.003
Dean-0.003
Token her
Token ID607
Feature activation-2.64e-01

Pos logit contributions
 thirteen+0.048
dy+0.045
die+0.044
 obstacle+0.042
Dean+0.041
Neg logit contributions
-0.022
 supplies-0.020
ird-0.018
 markers-0.018
 help-0.017
Token story
Token ID1621
Feature activation+4.21e+00

Pos logit contributions
+0.008
 supplies+0.007
ird+0.007
 markers+0.006
 help+0.006
Neg logit contributions
 thirteen-0.012
dy-0.011
die-0.011
 obstacle-0.010
Dean-0.010
Token with
Token ID351
Feature activation-6.31e-01

Pos logit contributions
 thirteen+0.217
dy+0.204
die+0.200
 obstacle+0.190
Dean+0.185
Neg logit contributions
-0.090
 supplies-0.082
ird-0.074
 markers-0.071
 help-0.069
Token others
Token ID1854
Feature activation+1.40e+00

Pos logit contributions
+0.020
 supplies+0.017
vell+0.016
ird+0.016
 markers+0.015
Neg logit contributions
 thirteen-0.026
dy-0.025
die-0.025
Dean-0.023
 moral-0.022
Token.
Token ID13
Feature activation-4.04e-01

Pos logit contributions
 thirteen+0.081
dy+0.077
die+0.075
 obstacle+0.072
Dean+0.071
Neg logit contributions
-0.021
 supplies-0.018
ird-0.016
 help-0.015
 markers-0.014
Token The
Token ID383
Feature activation-5.70e-01

Pos logit contributions
+0.011
 supplies+0.010
ird+0.009
 markers+0.008
 help+0.008
Neg logit contributions
 thirteen-0.019
dy-0.018
die-0.017
 obstacle-0.017
Dean-0.016
Token brave
Token ID14802
Feature activation-6.76e-01

Pos logit contributions
+0.021
 supplies+0.019
ird+0.018
 markers+0.018
vell+0.017
Neg logit contributions
 thirteen-0.021
dy-0.019
die-0.019
 obstacle-0.017
Dean-0.017
Token.
Token ID13
Feature activation+6.07e-01

Pos logit contributions
+0.009
 supplies+0.008
ird+0.007
 markers+0.007
vell+0.007
Neg logit contributions
 thirteen-0.027
dy-0.026
die-0.025
 obstacle-0.024
Dean-0.023
Token They
Token ID1119
Feature activation-4.51e-01

Pos logit contributions
 thirteen+0.028
dy+0.026
 obstacle+0.026
die+0.026
Dean+0.025
Neg logit contributions
-0.016
 supplies-0.014
 Tim-0.013
 together-0.013
 markers-0.013
Token thanked
Token ID26280
Feature activation-1.04e+00

Pos logit contributions
+0.017
anger+0.015
 supplies+0.015
 markers+0.015
resses+0.014
Neg logit contributions
dy-0.015
 thirteen-0.015
He-0.014
die-0.014
 moral-0.013
Token the
Token ID262
Feature activation-1.52e-01

Pos logit contributions
+0.034
ird+0.033
vell+0.032
 supplies+0.031
 markers+0.029
Neg logit contributions
 thirteen-0.042
dy-0.037
die-0.035
Dean-0.035
 moral-0.035
Token zo
Token ID40565
Feature activation+5.31e-01

Pos logit contributions
+0.006
 supplies+0.006
 help+0.005
vell+0.005
ird+0.005
Neg logit contributions
 thirteen-0.005
die-0.005
dy-0.005
 obstacle-0.004
Dean-0.004
Tokenoke
Token ID2088
Feature activation+4.20e+00

Pos logit contributions
 thirteen+0.012
dy+0.011
die+0.011
 obstacle+0.009
Dean+0.008
Neg logit contributions
-0.026
 supplies-0.025
ird-0.024
 help-0.024
 markers-0.024
Tokeneper
Token ID5723
Feature activation-4.67e-01

Pos logit contributions
 thirteen+0.115
die+0.105
dy+0.101
 obstacle+0.092
Dean+0.086
Neg logit contributions
 supplies-0.180
-0.176
 help-0.175
ird-0.171
vell-0.168
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
+0.010
 supplies+0.010
 help+0.009
ird+0.009
 markers+0.008
Neg logit contributions
 thirteen-0.023
dy-0.022
die-0.022
 obstacle-0.021
Dean-0.020
Token ran
Token ID4966
Feature activation+3.17e-01

Pos logit contributions
+0.004
vell+0.003
anger+0.003
 supplies+0.003
ird+0.003
Neg logit contributions
 thirteen-0.004
dy-0.004
die-0.004
 moral-0.004
Dean-0.004
Token to
Token ID284
Feature activation-2.46e-01

Pos logit contributions
 thirteen+0.012
dy+0.011
die+0.011
 obstacle+0.010
Dean+0.010
Neg logit contributions
-0.011
ird-0.010
 supplies-0.010
vell-0.009
 markers-0.009
Token their
Token ID511
Feature activation-4.07e-01

Pos logit contributions
+0.008
ird+0.008
vell+0.008
 supplies+0.007
 markers+0.007
Neg logit contributions
 thirteen-0.009
dy-0.009
die-0.009
Dean-0.008
 obstacle-0.008
Token,
Token ID11
Feature activation+5.84e-02

Pos logit contributions
 thirteen+0.005
die+0.004
Dean+0.004
dy+0.004
Today+0.004
Neg logit contributions
 help-0.001
 supplies-0.001
idents-0.001
 Tim-0.001
-0.001
Token sorry
Token ID7926
Feature activation+4.22e-01

Pos logit contributions
die+0.003
 thirteen+0.003
dy+0.003
Today+0.003
Dean+0.002
Neg logit contributions
 help-0.001
-0.001
 yes-0.001
 supplies-0.001
 Tim-0.001
Token!"
Token ID2474
Feature activation-4.69e-01

Pos logit contributions
 thirteen+0.026
die+0.023
Dean+0.023
dy+0.023
Today+0.022
Neg logit contributions
 help-0.005
 supplies-0.004
-0.004
idents-0.004
 fixed-0.003
Token the
Token ID262
Feature activation+4.12e-01

Pos logit contributions
+0.012
 supplies+0.011
ird+0.010
 markers+0.010
 help+0.010
Neg logit contributions
 thirteen-0.022
dy-0.021
die-0.020
 obstacle-0.019
Dean-0.018
Token zo
Token ID40565
Feature activation+6.85e-01

Pos logit contributions
 thirteen+0.018
die+0.016
dy+0.016
reetings+0.015
Today+0.015
Neg logit contributions
 supplies-0.012
 help-0.011
-0.011
 show-0.011
 wrap-0.010
Tokenoke
Token ID2088
Feature activation+4.20e+00

Pos logit contributions
die+0.021
dy+0.020
 thirteen+0.019
Today+0.017
 obstacle+0.016
Neg logit contributions
-0.028
ird-0.028
 help-0.027
 supplies-0.026
vell-0.026
Tokenep
Token ID538
Feature activation-1.06e+00

Pos logit contributions
die+0.146
 thirteen+0.135
dy+0.130
Today+0.129
 obstacle+0.126
Neg logit contributions
 help-0.144
ird-0.141
 together-0.141
 supplies-0.141
 fixed-0.135
Tokeners
Token ID364
Feature activation-6.28e-01

Pos logit contributions
 supplies+0.025
+0.024
 fixed+0.023
 help+0.023
ird+0.022
Neg logit contributions
 thirteen-0.048
die-0.047
dy-0.047
 obstacle-0.044
Today-0.043
Token said
Token ID531
Feature activation+3.21e-01

Pos logit contributions
+0.014
 help+0.013
 supplies+0.013
 yes+0.013
 fixed+0.012
Neg logit contributions
die-0.031
 thirteen-0.030
dy-0.030
 obstacle-0.028
Today-0.027
Token.
Token ID13
Feature activation+4.62e-01

Pos logit contributions
 thirteen+0.017
dy+0.016
die+0.015
 obstacle+0.014
Dean+0.014
Neg logit contributions
-0.007
 supplies-0.006
ird-0.006
 markers-0.005
vell-0.005
Token "
Token ID366
Feature activation-4.06e-01

Pos logit contributions
 thirteen+0.019
dy+0.018
die+0.018
 obstacle+0.017
Dean+0.016
Neg logit contributions
-0.015
 supplies-0.013
ird-0.012
 markers-0.012
 help-0.012
Token it
Token ID340
Feature activation-3.18e-01

Pos logit contributions
+0.006
 supplies+0.005
ird+0.005
 markers+0.005
vell+0.005
Neg logit contributions
 thirteen-0.006
dy-0.006
die-0.005
 obstacle-0.005
Dean-0.005
Token ran
Token ID4966
Feature activation-1.06e-01

Pos logit contributions
+0.010
 supplies+0.009
ird+0.009
 help+0.008
 markers+0.008
Neg logit contributions
 thirteen-0.013
dy-0.012
die-0.012
 obstacle-0.011
Dean-0.011
Token off
Token ID572
Feature activation+4.97e-01

Pos logit contributions
+0.003
 supplies+0.003
ird+0.003
 markers+0.002
 help+0.002
Neg logit contributions
 thirteen-0.005
dy-0.004
die-0.004
 obstacle-0.004
Dean-0.004
Token into
Token ID656
Feature activation+5.78e-01

Pos logit contributions
 thirteen+0.025
dy+0.024
die+0.023
 obstacle+0.022
Dean+0.022
Neg logit contributions
-0.011
 supplies-0.010
 help-0.009
ird-0.009
 markers-0.009
Token the
Token ID262
Feature activation+8.40e-01

Pos logit contributions
 thirteen+0.032
dy+0.030
die+0.029
reetings+0.027
Dean+0.027
Neg logit contributions
 supplies-0.011
 help-0.010
-0.010
 decor-0.009
 markers-0.008
Token sunset
Token ID26428
Feature activation+4.20e+00

Pos logit contributions
 thirteen+0.045
Those+0.038
reetings+0.037
 seventeen+0.037
dy+0.037
Neg logit contributions
 supplies-0.016
 help-0.015
 show-0.015
 wrap-0.015
 brand-0.014
Token.
Token ID13
Feature activation-2.38e-01

Pos logit contributions
 thirteen+0.248
dy+0.232
die+0.231
 obstacle+0.219
Dean+0.216
Neg logit contributions
-0.065
 supplies-0.056
ird-0.044
 help-0.042
 markers-0.042
Token He
Token ID679
Feature activation-3.92e-01

Pos logit contributions
+0.008
 supplies+0.007
ird+0.006
 help+0.006
 fixed+0.006
Neg logit contributions
 thirteen-0.010
dy-0.010
die-0.009
 obstacle-0.009
Dean-0.008
Token knew
Token ID2993
Feature activation-1.14e+00

Pos logit contributions
+0.011
 supplies+0.010
ird+0.009
 help+0.009
 fixed+0.009
Neg logit contributions
 thirteen-0.018
dy-0.016
die-0.016
 obstacle-0.015
Dean-0.015
Token he
Token ID339
Feature activation+8.21e-03

Pos logit contributions
+0.029
 supplies+0.028
 help+0.026
ird+0.025
 markers+0.025
Neg logit contributions
 thirteen-0.054
dy-0.050
die-0.049
 obstacle-0.046
Dean-0.045
Token had
Token ID550
Feature activation-4.45e-01

Pos logit contributions
 thirteen+0.000
dy+0.000
die+0.000
 obstacle+0.000
Dean+0.000
Neg logit contributions
-0.000
 supplies-0.000
 markers-0.000
ird-0.000
vell-0.000
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 thirteen+0.028
dy+0.027
die+0.026
 obstacle+0.025
Dean+0.025
Neg logit contributions
-0.008
 supplies-0.007
ird-0.006
 help-0.006
 markers-0.006
Token Ben
Token ID3932
Feature activation-1.88e-01

Pos logit contributions
 thirteen+0.004
dy+0.004
die+0.004
 obstacle+0.004
Dean+0.004
Neg logit contributions
-0.004
 supplies-0.003
ird-0.003
 markers-0.003
 help-0.003
Token asked
Token ID1965
Feature activation+3.42e-01

Pos logit contributions
+0.006
 supplies+0.006
ird+0.005
 markers+0.005
vell+0.005
Neg logit contributions
 thirteen-0.007
dy-0.007
die-0.007
Dean-0.006
 obstacle-0.006
Token the
Token ID262
Feature activation+1.74e-01

Pos logit contributions
 thirteen+0.013
dy+0.012
die+0.011
 moral+0.011
 obstacle+0.010
Neg logit contributions
-0.013
ird-0.011
 supplies-0.011
vell-0.011
 markers-0.010
Token zo
Token ID40565
Feature activation+7.42e-01

Pos logit contributions
 thirteen+0.007
die+0.007
dy+0.006
 obstacle+0.006
Dean+0.006
Neg logit contributions
 help-0.006
 supplies-0.005
-0.005
 show-0.005
 decor-0.005
Tokenoke
Token ID2088
Feature activation+4.19e+00

Pos logit contributions
 thirteen+0.018
die+0.017
dy+0.017
 obstacle+0.014
Dean+0.013
Neg logit contributions
-0.035
 supplies-0.034
ird-0.033
 help-0.032
vell-0.032
Tokeneper
Token ID5723
Feature activation-1.12e-01

Pos logit contributions
 thirteen+0.120
die+0.108
dy+0.103
 obstacle+0.100
Dean+0.091
Neg logit contributions
 supplies-0.175
 help-0.170
ird-0.166
-0.166
 fixed-0.162
Token if
Token ID611
Feature activation+2.88e-01

Pos logit contributions
 help+0.002
 supplies+0.002
 yes+0.002
 together+0.001
 online+0.001
Neg logit contributions
 thirteen-0.006
 obstacle-0.006
die-0.006
dy-0.006
Dean-0.006
Token he
Token ID339
Feature activation+5.80e-01

Pos logit contributions
 thirteen+0.011
dy+0.010
die+0.010
 obstacle+0.009
Dean+0.009
Neg logit contributions
-0.010
 supplies-0.009
ird-0.009
 markers-0.009
vell-0.008
Token could
Token ID714
Feature activation+3.50e-01

Pos logit contributions
 thirteen+0.026
dy+0.026
die+0.024
 moral+0.022
Dean+0.022
Neg logit contributions
-0.016
 supplies-0.015
vell-0.013
 markers-0.013
ird-0.012
Token touch
Token ID3638
Feature activation-2.19e-01

Pos logit contributions
 thirteen+0.016
dy+0.015
die+0.015
 obstacle+0.014
Dean+0.014
Neg logit contributions
-0.010
 help-0.009
 supplies-0.008
ird-0.008
 decor-0.008
Token.
Token ID13
Feature activation+3.27e-01

Pos logit contributions
+0.052
 supplies+0.049
enge+0.048
 markers+0.044
vell+0.044
Neg logit contributions
dy-0.050
 thirteen-0.050
He-0.048
 moral-0.046
Today-0.045
Token They
Token ID1119
Feature activation-6.24e-01

Pos logit contributions
 thirteen+0.010
dy+0.009
die+0.009
Those+0.008
Dean+0.008
Neg logit contributions
vell-0.013
-0.013
 supplies-0.013
ird-0.012
 markers-0.012
Token asked
Token ID1965
Feature activation+1.64e-01

Pos logit contributions
+0.031
uggle+0.030
 supplies+0.028
anger+0.028
idents+0.028
Neg logit contributions
dy-0.014
He-0.013
die-0.012
 thirteen-0.011
 remembers-0.011
Token the
Token ID262
Feature activation+1.94e-01

Pos logit contributions
 thirteen+0.007
dy+0.007
die+0.006
 moral+0.006
 obstacle+0.006
Neg logit contributions
-0.005
vell-0.004
 supplies-0.004
ird-0.004
 markers-0.004
Token zo
Token ID40565
Feature activation+6.41e-01

Pos logit contributions
 thirteen+0.008
die+0.007
dy+0.007
 obstacle+0.006
reetings+0.006
Neg logit contributions
 help-0.007
-0.007
 supplies-0.006
 show-0.006
 cr-0.006
Tokenoke
Token ID2088
Feature activation+4.19e+00

Pos logit contributions
 thirteen+0.016
dy+0.015
die+0.015
 obstacle+0.012
Dean+0.011
Neg logit contributions
-0.030
 supplies-0.029
ird-0.029
 help-0.028
vell-0.028
Tokenep
Token ID538
Feature activation-1.02e+00

Pos logit contributions
 thirteen+0.111
dy+0.095
die+0.095
 obstacle+0.086
Dean+0.079
Neg logit contributions
 supplies-0.187
-0.187
 help-0.179
ird-0.178
 markers-0.173
Tokeners
Token ID364
Feature activation-4.64e-01

Pos logit contributions
+0.023
 supplies+0.020
ird+0.018
 markers+0.018
 help+0.017
Neg logit contributions
 thirteen-0.052
dy-0.049
die-0.048
 obstacle-0.045
Dean-0.045
Token,
Token ID11
Feature activation+3.17e-01

Pos logit contributions
+0.013
 supplies+0.010
 markers+0.010
ird+0.010
vell+0.010
Neg logit contributions
 thirteen-0.021
dy-0.020
die-0.019
 moral-0.018
Dean-0.018
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
 thirteen+0.014
dy+0.013
die+0.012
 obstacle+0.011
Dean+0.011
Neg logit contributions
-0.010
 supplies-0.009
ird-0.008
 markers-0.008
vell-0.008
Token other
Token ID584
Feature activation+1.51e+00

Pos logit contributions
 help+0.005
+0.005
 supplies+0.005
 show+0.005
 cr+0.004
Neg logit contributions
die-0.006
dy-0.006
 thirteen-0.006
iet-0.005
 obstacle-0.005
Token and
Token ID290
Feature activation-2.59e-02

Pos logit contributions
+0.000
 supplies+0.000
 help+0.000
ird+0.000
 markers+0.000
Neg logit contributions
 thirteen-0.000
dy-0.000
die-0.000
 obstacle-0.000
Dean-0.000
Token flew
Token ID13112
Feature activation+4.77e-01

Pos logit contributions
+0.001
 supplies+0.001
ird+0.001
 markers+0.001
 help+0.001
Neg logit contributions
 thirteen-0.001
dy-0.001
die-0.001
 obstacle-0.001
Dean-0.001
Token away
Token ID1497
Feature activation-5.97e-01

Pos logit contributions
 thirteen+0.022
dy+0.021
die+0.020
 obstacle+0.019
Dean+0.019
Neg logit contributions
-0.013
 supplies-0.012
ird-0.011
 markers-0.011
 help-0.010
Token into
Token ID656
Feature activation+3.16e-01

Pos logit contributions
+0.013
 supplies+0.012
ird+0.011
 markers+0.011
vell+0.010
Neg logit contributions
 thirteen-0.030
dy-0.029
die-0.028
 obstacle-0.027
Dean-0.026
Token the
Token ID262
Feature activation+7.95e-01

Pos logit contributions
 thirteen+0.016
dy+0.015
die+0.015
 obstacle+0.014
Dean+0.014
Neg logit contributions
-0.007
 supplies-0.006
 help-0.006
 markers-0.006
ird-0.006
Token sunset
Token ID26428
Feature activation+4.17e+00

Pos logit contributions
 thirteen+0.039
dy+0.033
die+0.032
reetings+0.031
Dean+0.031
Neg logit contributions
 supplies-0.021
-0.021
 fixed-0.019
 brand-0.019
 help-0.018
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
 thirteen+0.236
dy+0.223
die+0.220
 obstacle+0.208
Dean+0.205
Neg logit contributions
-0.071
 supplies-0.062
ird-0.053
 markers-0.051
 help-0.049
Token She
Token ID1375
Feature activation-3.78e-01

Pos logit contributions
+0.009
 supplies+0.007
ird+0.007
 markers+0.007
 Emmy+0.007
Neg logit contributions
 thirteen-0.013
dy-0.012
 obstacle-0.012
die-0.012
Dean-0.011
Token was
Token ID373
Feature activation-2.42e-01

Pos logit contributions
+0.012
 supplies+0.011
ird+0.010
 markers+0.010
vell+0.010
Neg logit contributions
 thirteen-0.016
dy-0.015
die-0.014
 obstacle-0.013
Dean-0.013
Token so
Token ID523
Feature activation-4.37e-01

Pos logit contributions
+0.004
 fixed+0.004
 supplies+0.004
 together+0.004
 help+0.004
Neg logit contributions
 thirteen-0.013
 obstacle-0.012
dy-0.012
die-0.012
Dean-0.011
Token thankful
Token ID25535
Feature activation-5.86e-01

Pos logit contributions
+0.012
 supplies+0.011
 help+0.011
ird+0.011
 fixed+0.010
Neg logit contributions
 thirteen-0.020
dy-0.018
die-0.018
 obstacle-0.017
Dean-0.016
Token told
Token ID1297
Feature activation-4.14e-01

Pos logit contributions
+0.008
 supplies+0.007
ird+0.007
vell+0.006
 markers+0.006
Neg logit contributions
 thirteen-0.010
dy-0.010
die-0.010
Dean-0.009
 obstacle-0.008
Token her
Token ID607
Feature activation-9.90e-01

Pos logit contributions
+0.012
 supplies+0.010
ird+0.010
vell+0.010
 markers+0.009
Neg logit contributions
 thirteen-0.019
dy-0.018
die-0.017
 obstacle-0.016
 moral-0.016
Token what
Token ID644
Feature activation-2.83e-01

Pos logit contributions
 help+0.017
 supplies+0.017
 yes+0.016
 decor+0.015
 fixed+0.015
Neg logit contributions
 thirteen-0.057
die-0.052
dy-0.050
 obstacle-0.049
Dean-0.049
Token he
Token ID339
Feature activation-1.49e+00

Pos logit contributions
+0.008
 supplies+0.007
 help+0.007
ird+0.007
 markers+0.007
Neg logit contributions
 thirteen-0.013
dy-0.012
die-0.012
 obstacle-0.011
Dean-0.011
Token had
Token ID550
Feature activation-2.00e+00

Pos logit contributions
+0.054
 supplies+0.049
ird+0.046
 markers+0.045
vell+0.044
Neg logit contributions
 thirteen-0.055
dy-0.054
die-0.052
 obstacle-0.046
Dean-0.045
Token seen
Token ID1775
Feature activation-4.83e+00

Pos logit contributions
+0.095
 Eve+0.078
uggle+0.077
 Bian+0.076
 Beth+0.074
Neg logit contributions
 thirteen-0.047
dy-0.045
 notices-0.044
 remembers-0.043
 moral-0.043
Token,
Token ID11
Feature activation-1.78e-01

Pos logit contributions
+0.166
 supplies+0.134
ird+0.134
 markers+0.133
 Bian+0.132
Neg logit contributions
 thirteen-0.189
dy-0.171
die-0.161
 obstacle-0.157
Today-0.152
Token and
Token ID290
Feature activation+2.61e-01

Pos logit contributions
+0.004
 supplies+0.004
 markers+0.004
ird+0.004
 help+0.004
Neg logit contributions
 thirteen-0.009
dy-0.008
die-0.008
 obstacle-0.007
Dean-0.007
Token his
Token ID465
Feature activation-6.64e-01

Pos logit contributions
 thirteen+0.012
dy+0.011
die+0.011
 obstacle+0.010
Dean+0.010
Neg logit contributions
-0.007
 supplies-0.007
ird-0.006
 markers-0.006
 help-0.006
Token mom
Token ID1995
Feature activation-7.11e-01

Pos logit contributions
+0.019
 supplies+0.017
ird+0.017
vell+0.016
 markers+0.016
Neg logit contributions
 thirteen-0.029
dy-0.028
die-0.027
 obstacle-0.025
 moral-0.024
Token knew
Token ID2993
Feature activation-6.04e-01

Pos logit contributions
+0.021
 supplies+0.019
vell+0.017
 markers+0.017
resses+0.017
Neg logit contributions
 thirteen-0.030
dy-0.029
die-0.028
 moral-0.025
 obstacle-0.025
Token told
Token ID1297
Feature activation-3.51e-01

Pos logit contributions
+0.007
 supplies+0.006
 markers+0.006
vell+0.006
ird+0.006
Neg logit contributions
 thirteen-0.007
dy-0.006
die-0.006
Dean-0.006
 moral-0.005
Token mummy
Token ID38567
Feature activation-8.24e-02

Pos logit contributions
+0.014
ird+0.013
vell+0.013
 supplies+0.012
anger+0.011
Neg logit contributions
 thirteen-0.012
die-0.011
dy-0.011
 moral-0.011
 obstacle-0.010
Token what
Token ID644
Feature activation-5.02e-02

Pos logit contributions
 supplies+0.002
 help+0.002
 markers+0.002
 together+0.002
 yes+0.001
Neg logit contributions
 thirteen-0.004
 obstacle-0.004
die-0.004
Dean-0.004
dy-0.004
Token she
Token ID673
Feature activation-1.71e+00

Pos logit contributions
+0.002
 supplies+0.001
vell+0.001
ird+0.001
 markers+0.001
Neg logit contributions
 thirteen-0.002
dy-0.002
die-0.002
 obstacle-0.002
 moral-0.002
Token had
Token ID550
Feature activation-1.91e+00

Pos logit contributions
+0.074
 supplies+0.065
ird+0.065
resses+0.062
 markers+0.062
Neg logit contributions
dy-0.052
die-0.050
 thirteen-0.048
 moral-0.044
 remembers-0.044
Token seen
Token ID1775
Feature activation-4.79e+00

Pos logit contributions
+0.089
 Bian+0.073
isher+0.072
 markers+0.072
 Beth+0.071
Neg logit contributions
 thirteen-0.047
dy-0.044
die-0.042
 remembers-0.042
 notices-0.041
Token.
Token ID13
Feature activation-2.03e-01

Pos logit contributions
+0.170
vell+0.138
 Bian+0.138
 markers+0.132
ird+0.132
Neg logit contributions
 thirteen-0.173
dy-0.152
Today-0.146
die-0.144
 moral-0.142
Token
Token ID198
Feature activation-4.86e-02

Pos logit contributions
+0.007
 supplies+0.007
vell+0.007
ird+0.007
 fixed+0.007
Neg logit contributions
 thirteen-0.007
die-0.006
dy-0.006
Dean-0.006
Today-0.005
Token
Token ID198
Feature activation-1.07e-01

Pos logit contributions
+0.002
 supplies+0.002
ird+0.002
vell+0.002
 markers+0.002
Neg logit contributions
 thirteen-0.002
dy-0.002
die-0.002
Dean-0.001
Today-0.001
TokenM
Token ID44
Feature activation-1.95e-01

Pos logit contributions
+0.005
 supplies+0.004
 help+0.004
 markers+0.004
ird+0.004
Neg logit contributions
 thirteen-0.004
dy-0.003
die-0.003
 obstacle-0.003
mo-0.003
Tokenummy
Token ID13513
Feature activation+1.53e+00

Pos logit contributions
+0.007
 supplies+0.007
ird+0.006
 markers+0.006
vell+0.006
Neg logit contributions
 thirteen-0.007
dy-0.007
die-0.006
 obstacle-0.006
 moral-0.006
Token told
Token ID1297
Feature activation+3.12e-01

Pos logit contributions
 thirteen+0.027
dy+0.026
die+0.025
Dean+0.023
Those+0.022
Neg logit contributions
-0.018
 supplies-0.016
vell-0.015
ird-0.014
 markers-0.014
Token her
Token ID607
Feature activation-5.64e-01

Pos logit contributions
 thirteen+0.015
dy+0.014
die+0.014
 obstacle+0.013
 moral+0.013
Neg logit contributions
-0.008
 supplies-0.007
ird-0.007
vell-0.007
 markers-0.006
Token what
Token ID644
Feature activation-2.94e-01

Pos logit contributions
 help+0.012
 supplies+0.012
 yes+0.011
 decor+0.011
 together+0.011
Neg logit contributions
 thirteen-0.029
die-0.026
 obstacle-0.026
dy-0.025
Dean-0.025
Token they
Token ID484
Feature activation-1.81e+00

Pos logit contributions
+0.007
 supplies+0.006
ird+0.006
 help+0.006
 markers+0.006
Neg logit contributions
 thirteen-0.015
dy-0.014
die-0.013
 obstacle-0.013
Dean-0.012
Token had
Token ID550
Feature activation-1.84e+00

Pos logit contributions
+0.080
 supplies+0.070
anger+0.069
ird+0.068
resses+0.068
Neg logit contributions
 thirteen-0.049
dy-0.048
die-0.046
Today-0.045
He-0.045
Token seen
Token ID1775
Feature activation-4.78e+00

Pos logit contributions
+0.089
uggle+0.070
idents+0.069
 Beth+0.069
resses+0.068
Neg logit contributions
 thirteen-0.043
 remembers-0.040
 moral-0.039
Today-0.038
reetings-0.038
Token and
Token ID290
Feature activation-3.24e-01

Pos logit contributions
+0.145
vell+0.119
 supplies+0.118
 markers+0.115
 Bian+0.114
Neg logit contributions
 thirteen-0.200
dy-0.182
die-0.173
Today-0.167
 moral-0.166
Token heard
Token ID2982
Feature activation-5.59e-02

Pos logit contributions
+0.008
 supplies+0.008
 help+0.007
ird+0.007
 markers+0.007
Neg logit contributions
 thirteen-0.016
dy-0.015
die-0.014
 obstacle-0.014
Dean-0.013
Token.
Token ID13
Feature activation+8.24e-02

Pos logit contributions
+0.001
 supplies+0.001
 help+0.001
 markers+0.001
ird+0.001
Neg logit contributions
 thirteen-0.003
dy-0.003
die-0.003
 obstacle-0.003
Dean-0.003
Token Mom
Token ID11254
Feature activation-1.19e+00

Pos logit contributions
 thirteen+0.004
dy+0.003
 obstacle+0.003
die+0.003
Dean+0.003
Neg logit contributions
-0.003
 supplies-0.002
ird-0.002
 markers-0.002
 Emmy-0.002
Tokenmy
Token ID1820
Feature activation-8.68e-01

Pos logit contributions
+0.049
 supplies+0.047
ird+0.045
 markers+0.045
 fixed+0.044
Neg logit contributions
 thirteen-0.038
dy-0.033
 obstacle-0.032
die-0.032
Dean-0.029
Token,
Token ID11
Feature activation+1.56e+00

Pos logit contributions
 thirteen+0.056
dy+0.053
die+0.052
 obstacle+0.049
Dean+0.048
Neg logit contributions
-0.028
 supplies-0.025
ird-0.023
 markers-0.023
 help-0.022
Token "
Token ID366
Feature activation-5.80e-01

Pos logit contributions
 thirteen+0.070
dy+0.067
 obstacle+0.066
die+0.066
Dean+0.061
Neg logit contributions
-0.045
 help-0.037
 yes-0.037
 supplies-0.036
ird-0.036
TokenWhat
Token ID2061
Feature activation-5.63e-01

Pos logit contributions
+0.024
ird+0.023
 yes+0.023
 markers+0.022
 supplies+0.022
Neg logit contributions
dy-0.018
 thirteen-0.017
 obstacle-0.016
die-0.015
 hops-0.014
Token have
Token ID423
Feature activation-1.88e+00

Pos logit contributions
+0.017
 supplies+0.016
ird+0.015
 markers+0.015
 help+0.015
Neg logit contributions
 thirteen-0.024
dy-0.022
die-0.021
 obstacle-0.020
Dean-0.020
Token you
Token ID345
Feature activation-1.22e+00

Pos logit contributions
+0.043
 supplies+0.037
ird+0.034
 markers+0.033
vell+0.032
Neg logit contributions
 thirteen-0.096
dy-0.088
die-0.086
 obstacle-0.083
Dean-0.080
Token seen
Token ID1775
Feature activation-4.77e+00

Pos logit contributions
+0.033
 supplies+0.033
 help+0.031
ird+0.031
 decor+0.030
Neg logit contributions
 thirteen-0.056
dy-0.052
die-0.052
 obstacle-0.049
Dean-0.048
Token,
Token ID11
Feature activation-7.81e-01

Pos logit contributions
+0.146
 supplies+0.128
ird+0.122
 markers+0.118
vell+0.116
Neg logit contributions
 thirteen-0.205
dy-0.190
die-0.185
 obstacle-0.172
Dean-0.168
Token John
Token ID1757
Feature activation-2.63e-01

Pos logit contributions
+0.015
 supplies+0.014
ird+0.013
 markers+0.013
 help+0.012
Neg logit contributions
 thirteen-0.041
dy-0.039
die-0.038
 obstacle-0.037
Dean-0.036
Token?"
Token ID1701
Feature activation-3.30e-01

Pos logit contributions
 fixed+0.003
 Emmy+0.002
 supplies+0.002
ird+0.002
 Tim+0.002
Neg logit contributions
 thirteen-0.017
 obstacle-0.015
Meanwhile-0.014
 seventeen-0.014
Dean-0.014
Token He
Token ID679
Feature activation+9.10e-01

Pos logit contributions
+0.009
 supplies+0.008
ird+0.007
 markers+0.007
 help+0.007
Neg logit contributions
 thirteen-0.016
dy-0.015
die-0.014
 obstacle-0.014
Dean-0.013
Token pointed
Token ID6235
Feature activation+6.64e-01

Pos logit contributions
 thirteen+0.051
 obstacle+0.047
dy+0.047
die+0.045
Dean+0.043
Neg logit contributions
-0.016
 supplies-0.013
 fixed-0.013
 yes-0.012
 markers-0.012
Token see
Token ID766
Feature activation-1.33e-01

Pos logit contributions
 thirteen+0.028
dy+0.026
die+0.026
 obstacle+0.024
Dean+0.023
Neg logit contributions
-0.022
 supplies-0.021
 help-0.019
ird-0.019
 markers-0.019
Token that
Token ID326
Feature activation+4.45e-01

Pos logit contributions
+0.004
 supplies+0.003
ird+0.003
 markers+0.003
vell+0.003
Neg logit contributions
 thirteen-0.006
dy-0.006
die-0.006
 obstacle-0.005
Dean-0.005
Token things
Token ID1243
Feature activation-5.48e-01

Pos logit contributions
 thirteen+0.023
dy+0.022
die+0.022
Dean+0.020
 obstacle+0.020
Neg logit contributions
 supplies-0.009
-0.009
 help-0.008
ird-0.008
 markers-0.008
Token she
Token ID673
Feature activation-1.30e+00

Pos logit contributions
+0.014
 supplies+0.012
ird+0.012
 markers+0.011
vell+0.011
Neg logit contributions
 thirteen-0.026
dy-0.024
die-0.023
 obstacle-0.022
Dean-0.021
Token had
Token ID550
Feature activation-1.53e+00

Pos logit contributions
+0.044
 supplies+0.040
ird+0.037
 markers+0.037
uggle+0.036
Neg logit contributions
 thirteen-0.050
dy-0.048
die-0.046
 obstacle-0.042
Dean-0.041
Token seen
Token ID1775
Feature activation-4.73e+00

Pos logit contributions
+0.054
 supplies+0.045
ird+0.043
 markers+0.043
uggle+0.042
Neg logit contributions
 thirteen-0.060
dy-0.054
die-0.052
 obstacle-0.048
Dean-0.048
Token backstage
Token ID43090
Feature activation-8.11e-01

Pos logit contributions
+0.130
 supplies+0.108
ird+0.103
 markers+0.100
vell+0.100
Neg logit contributions
 thirteen-0.218
dy-0.201
die-0.194
 obstacle-0.185
Dean-0.179
Token were
Token ID547
Feature activation+7.50e-02

Pos logit contributions
+0.014
 supplies+0.014
ird+0.012
 help+0.011
 markers+0.011
Neg logit contributions
 thirteen-0.045
dy-0.042
die-0.042
Dean-0.039
 obstacle-0.039
Token missing
Token ID4814
Feature activation-1.05e-01

Pos logit contributions
 thirteen+0.003
dy+0.003
die+0.003
 obstacle+0.003
Dean+0.003
Neg logit contributions
-0.002
 supplies-0.002
 markers-0.002
ird-0.002
 help-0.002
Token;
Token ID26
Feature activation-2.81e-02

Pos logit contributions
+0.002
 supplies+0.002
ird+0.002
 markers+0.001
 help+0.001
Neg logit contributions
 thirteen-0.006
dy-0.005
die-0.005
 obstacle-0.005
Dean-0.005
Token the
Token ID262
Feature activation-6.46e-01

Pos logit contributions
+0.001
 supplies+0.001
 help+0.000
 markers+0.000
 yes+0.000
Neg logit contributions
dy-0.001
 thirteen-0.001
die-0.001
 obstacle-0.001
Dean-0.001
Token!
Token ID0
Feature activation-2.24e-01

Pos logit contributions
 thirteen+0.034
dy+0.033
die+0.033
Dean+0.031
 obstacle+0.030
Neg logit contributions
 supplies-0.014
-0.013
 markers-0.012
 help-0.011
ird-0.011
Token That
Token ID1320
Feature activation+9.14e-01

Pos logit contributions
+0.007
ird+0.006
 supplies+0.005
 markers+0.005
 help+0.005
Neg logit contributions
 thirteen-0.009
dy-0.009
die-0.009
 obstacle-0.009
 moral-0.008
Token's
Token ID338
Feature activation-5.59e-01

Pos logit contributions
 thirteen+0.036
dy+0.033
die+0.032
 obstacle+0.030
Dean+0.029
Neg logit contributions
 supplies-0.031
-0.029
 markers-0.028
ird-0.028
 help-0.027
Token what
Token ID644
Feature activation+2.13e-01

Pos logit contributions
+0.013
 supplies+0.013
 help+0.011
 markers+0.011
ird+0.011
Neg logit contributions
 thirteen-0.027
dy-0.025
die-0.025
 obstacle-0.024
Dean-0.024
Token grown
Token ID7334
Feature activation-1.36e+00

Pos logit contributions
 thirteen+0.011
dy+0.011
die+0.010
 obstacle+0.010
Meanwhile+0.010
Neg logit contributions
 supplies-0.004
 help-0.004
 markers-0.004
ird-0.004
-0.004
Token-
Token ID12
Feature activation-4.68e+00

Pos logit contributions
+0.032
 supplies+0.030
ird+0.027
 markers+0.026
 help+0.025
Neg logit contributions
 thirteen-0.068
dy-0.063
die-0.062
 obstacle-0.059
Dean-0.058
Tokenups
Token ID4739
Feature activation-2.05e+00

Pos logit contributions
 supplies+0.143
+0.142
ird+0.132
 markers+0.129
 help+0.128
Neg logit contributions
 thirteen-0.191
dy-0.181
die-0.179
 obstacle-0.162
Dean-0.157
Token do
Token ID466
Feature activation-2.73e-01

Pos logit contributions
+0.056
 supplies+0.051
ird+0.047
 markers+0.046
 help+0.045
Neg logit contributions
 thirteen-0.094
dy-0.087
die-0.085
 obstacle-0.080
Dean-0.078
Token when
Token ID618
Feature activation+5.62e-01

Pos logit contributions
+0.007
 supplies+0.006
ird+0.005
 markers+0.005
vell+0.005
Neg logit contributions
 thirteen-0.013
dy-0.013
die-0.012
 obstacle-0.011
Dean-0.011
Token they
Token ID484
Feature activation+3.27e-02

Pos logit contributions
 thirteen+0.033
die+0.032
dy+0.032
Dean+0.029
 obstacle+0.029
Neg logit contributions
 supplies-0.008
 help-0.007
-0.007
 decor-0.006
 together-0.006
Token need
Token ID761
Feature activation-1.04e-01

Pos logit contributions
dy+0.002
 thirteen+0.002
die+0.002
 obstacle+0.001
Dean+0.001
Neg logit contributions
-0.001
 supplies-0.001
 help-0.001
ird-0.001
 decor-0.001
Token excited
Token ID6568
Feature activation-4.87e-02

Pos logit contributions
 thirteen+0.000
dy+0.000
die+0.000
 obstacle+0.000
Dean+0.000
Neg logit contributions
-0.000
 supplies-0.000
ird-0.000
 help-0.000
 fixed-0.000
Token about
Token ID546
Feature activation-2.60e-01

Pos logit contributions
 supplies+0.001
+0.001
 help+0.001
 together+0.000
 markers+0.000
Neg logit contributions
 thirteen-0.003
die-0.003
dy-0.003
Dean-0.003
 obstacle-0.003
Token what
Token ID644
Feature activation+2.91e-01

Pos logit contributions
 supplies+0.005
 help+0.005
 decor+0.005
+0.005
 together+0.005
Neg logit contributions
 thirteen-0.014
dy-0.013
die-0.013
Dean-0.012
umpy-0.012
Token he
Token ID339
Feature activation-1.08e+00

Pos logit contributions
 thirteen+0.013
dy+0.012
die+0.012
 obstacle+0.011
Dean+0.011
Neg logit contributions
 supplies-0.008
-0.008
 help-0.007
 markers-0.007
ird-0.007
Token had
Token ID550
Feature activation-1.75e+00

Pos logit contributions
+0.043
 supplies+0.040
ird+0.038
 markers+0.037
vell+0.036
Neg logit contributions
 thirteen-0.036
dy-0.034
die-0.033
 obstacle-0.030
Dean-0.029
Token seen
Token ID1775
Feature activation-4.66e+00

Pos logit contributions
+0.072
 markers+0.059
idents+0.057
ird+0.057
uggle+0.057
Neg logit contributions
 thirteen-0.056
dy-0.051
die-0.050
Today-0.048
 moral-0.047
Token.
Token ID13
Feature activation-3.56e-01

Pos logit contributions
+0.118
 supplies+0.101
ird+0.095
 markers+0.093
vell+0.090
Neg logit contributions
 thirteen-0.224
dy-0.207
die-0.202
 obstacle-0.192
Dean-0.187
Token He
Token ID679
Feature activation-4.47e-01

Pos logit contributions
+0.015
 supplies+0.014
ird+0.013
 markers+0.013
 help+0.013
Neg logit contributions
 thirteen-0.011
dy-0.010
die-0.010
 obstacle-0.009
Dean-0.008
Token knew
Token ID2993
Feature activation-1.20e+00

Pos logit contributions
 supplies+0.011
+0.011
 fixed+0.010
 help+0.010
ird+0.010
Neg logit contributions
 thirteen-0.022
dy-0.019
 obstacle-0.019
die-0.019
Dean-0.018
Token he
Token ID339
Feature activation-2.04e-01

Pos logit contributions
 supplies+0.029
+0.028
 help+0.026
ird+0.025
 markers+0.024
Neg logit contributions
 thirteen-0.059
dy-0.055
die-0.054
 obstacle-0.051
Dean-0.050
Token would
Token ID561
Feature activation-9.76e-01

Pos logit contributions
+0.007
 supplies+0.006
ird+0.006
 markers+0.006
 help+0.006
Neg logit contributions
 thirteen-0.008
dy-0.007
die-0.007
 obstacle-0.007
Dean-0.006
Token told
Token ID1297
Feature activation-2.16e-01

Pos logit contributions
+0.005
 supplies+0.004
 markers+0.004
ird+0.004
vell+0.004
Neg logit contributions
 thirteen-0.006
dy-0.006
die-0.005
Dean-0.005
 moral-0.005
Token her
Token ID607
Feature activation-8.59e-01

Pos logit contributions
+0.005
 supplies+0.005
ird+0.004
 markers+0.004
 help+0.004
Neg logit contributions
 thirteen-0.011
dy-0.010
die-0.010
 obstacle-0.009
Dean-0.009
Token what
Token ID644
Feature activation-2.06e-01

Pos logit contributions
 help+0.015
 supplies+0.014
 yes+0.013
 decor+0.012
 fixed+0.012
Neg logit contributions
 thirteen-0.050
die-0.045
 obstacle-0.044
dy-0.043
Dean-0.042
Token he
Token ID339
Feature activation-1.36e+00

Pos logit contributions
+0.005
 supplies+0.005
 help+0.005
ird+0.004
 markers+0.004
Neg logit contributions
 thirteen-0.010
dy-0.009
die-0.009
 obstacle-0.009
Dean-0.008
Token had
Token ID550
Feature activation-1.73e+00

Pos logit contributions
+0.052
 supplies+0.047
ird+0.045
 markers+0.045
vell+0.043
Neg logit contributions
 thirteen-0.047
dy-0.047
die-0.045
 moral-0.040
Today-0.039
Token seen
Token ID1775
Feature activation-4.61e+00

Pos logit contributions
+0.078
uggle+0.062
 Bian+0.062
 markers+0.061
resses+0.061
Neg logit contributions
 thirteen-0.046
dy-0.043
 remembers-0.041
 notices-0.040
reetings-0.040
Token and
Token ID290
Feature activation-1.84e-01

Pos logit contributions
+0.149
 supplies+0.126
 markers+0.122
ird+0.122
vell+0.121
Neg logit contributions
 thirteen-0.187
dy-0.172
die-0.165
 obstacle-0.157
Today-0.153
Token how
Token ID703
Feature activation-1.64e-01

Pos logit contributions
+0.005
 supplies+0.005
 help+0.004
ird+0.004
 markers+0.004
Neg logit contributions
 thirteen-0.008
dy-0.008
die-0.008
 obstacle-0.007
Dean-0.007
Token he
Token ID339
Feature activation-2.25e-01

Pos logit contributions
 supplies+0.004
+0.004
 help+0.004
 fixed+0.004
 decor+0.004
Neg logit contributions
 thirteen-0.008
die-0.007
dy-0.007
 obstacle-0.006
Dean-0.006
Token had
Token ID550
Feature activation-6.05e-01

Pos logit contributions
+0.009
 supplies+0.008
ird+0.007
uggle+0.007
vell+0.007
Neg logit contributions
dy-0.008
 thirteen-0.008
die-0.007
 remembers-0.006
 moral-0.006
Token escaped
Token ID13537
Feature activation-1.99e+00

Pos logit contributions
+0.019
 supplies+0.017
ird+0.016
 markers+0.016
 help+0.015
Neg logit contributions
 thirteen-0.026
dy-0.024
die-0.023
 obstacle-0.022
Dean-0.021
Token,
Token ID11
Feature activation-2.06e-01

Pos logit contributions
+0.018
ird+0.015
 supplies+0.015
enge+0.014
 markers+0.014
Neg logit contributions
 thirteen-0.030
dy-0.028
die-0.027
 obstacle-0.026
Today-0.025
Token things
Token ID1243
Feature activation-7.44e-01

Pos logit contributions
+0.006
 supplies+0.005
ird+0.005
 markers+0.005
vell+0.005
Neg logit contributions
 thirteen-0.009
dy-0.008
die-0.008
 obstacle-0.008
Dean-0.007
Token he
Token ID339
Feature activation-1.61e+00

Pos logit contributions
+0.022
ird+0.018
 supplies+0.018
 markers+0.017
vell+0.016
Neg logit contributions
 thirteen-0.033
dy-0.031
die-0.029
 obstacle-0.027
He-0.026
Token had
Token ID550
Feature activation-1.72e+00

Pos logit contributions
+0.064
uggle+0.054
ird+0.053
 supplies+0.052
anger+0.050
Neg logit contributions
dy-0.057
die-0.055
 thirteen-0.054
He-0.047
Dean-0.046
Token never
Token ID1239
Feature activation-2.21e+00

Pos logit contributions
+0.066
ird+0.054
uggle+0.053
 Bian+0.052
 supplies+0.052
Neg logit contributions
 thirteen-0.064
dy-0.056
die-0.054
 moral-0.049
Today-0.048
Token seen
Token ID1775
Feature activation-4.60e+00

Pos logit contributions
+0.082
 supplies+0.067
ird+0.064
uggle+0.064
 markers+0.062
Neg logit contributions
 thirteen-0.085
dy-0.078
die-0.076
 obstacle-0.070
 moral-0.068
Token before
Token ID878
Feature activation+1.57e-01

Pos logit contributions
+0.141
 supplies+0.125
ird+0.115
 markers+0.114
 Bian+0.113
Neg logit contributions
 thirteen-0.201
dy-0.177
die-0.172
 obstacle-0.164
 moral-0.160
Token.
Token ID13
Feature activation+9.35e-03

Pos logit contributions
 thirteen+0.009
dy+0.008
die+0.008
 obstacle+0.008
Dean+0.008
Neg logit contributions
-0.003
 supplies-0.002
ird-0.002
 help-0.002
 markers-0.002
Token Stanley
Token ID14167
Feature activation+5.66e-01

Pos logit contributions
 thirteen+0.000
dy+0.000
 obstacle+0.000
die+0.000
mo+0.000
Neg logit contributions
-0.000
 supplies-0.000
 help-0.000
 Eve-0.000
ird-0.000
Token was
Token ID373
Feature activation-5.45e-02

Pos logit contributions
 thirteen+0.029
 obstacle+0.026
die+0.025
dy+0.025
Today+0.024
Neg logit contributions
 supplies-0.012
-0.012
 fixed-0.011
 help-0.011
 markers-0.010
Token so
Token ID523
Feature activation-1.76e-01

Pos logit contributions
+0.001
 supplies+0.001
ird+0.001
 help+0.001
 fixed+0.001
Neg logit contributions
 thirteen-0.003
dy-0.003
die-0.003
 obstacle-0.002
Dean-0.002
Token the
Token ID262
Feature activation-8.56e-01

Pos logit contributions
 supplies+0.000
 help+0.000
 decor+0.000
 together+0.000
 show+0.000
Neg logit contributions
 thirteen-0.000
dy-0.000
die-0.000
Dean-0.000
 seventeen-0.000
Token wonderful
Token ID7932
Feature activation+3.52e-01

Pos logit contributions
 supplies+0.020
 help+0.017
+0.017
 markers+0.017
 decor+0.015
Neg logit contributions
 thirteen-0.044
dy-0.041
die-0.040
Dean-0.038
Meanwhile-0.037
Token things
Token ID1243
Feature activation-4.59e-01

Pos logit contributions
 thirteen+0.018
dy+0.016
die+0.016
Dean+0.015
Meanwhile+0.015
Neg logit contributions
 supplies-0.009
-0.008
 help-0.007
 markers-0.007
 decor-0.007
Token she
Token ID673
Feature activation-1.69e+00

Pos logit contributions
+0.014
 supplies+0.012
ird+0.012
 markers+0.012
vell+0.011
Neg logit contributions
 thirteen-0.019
dy-0.017
die-0.017
 obstacle-0.016
Today-0.015
Token had
Token ID550
Feature activation-1.62e+00

Pos logit contributions
+0.060
 supplies+0.056
ird+0.053
 markers+0.052
 help+0.051
Neg logit contributions
 thirteen-0.064
dy-0.059
die-0.057
 obstacle-0.053
Dean-0.051
Token seen
Token ID1775
Feature activation-4.59e+00

Pos logit contributions
+0.050
 supplies+0.044
 markers+0.042
ird+0.041
vell+0.040
Neg logit contributions
 thirteen-0.068
dy-0.063
die-0.062
Dean-0.056
Today-0.056
Token in
Token ID287
Feature activation-1.13e+00

Pos logit contributions
+0.114
 supplies+0.098
ird+0.092
 markers+0.090
vell+0.086
Neg logit contributions
 thirteen-0.222
dy-0.207
die-0.202
 obstacle-0.190
Dean-0.186
Token the
Token ID262
Feature activation-2.09e-01

Pos logit contributions
+0.024
 supplies+0.022
 help+0.019
 markers+0.019
ird+0.018
Neg logit contributions
 thirteen-0.059
dy-0.057
die-0.055
 obstacle-0.051
Dean-0.051
Token ocean
Token ID9151
Feature activation+1.77e-01

Pos logit contributions
 supplies+0.007
+0.007
 help+0.006
 markers+0.006
 fixed+0.006
Neg logit contributions
 thirteen-0.008
dy-0.008
die-0.008
Dean-0.007
emy-0.006
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
 thirteen+0.011
die+0.010
dy+0.010
Dean+0.010
 obstacle+0.010
Neg logit contributions
 supplies-0.002
-0.002
 help-0.002
ird-0.001
 decor-0.001
Token It
Token ID632
Feature activation-7.92e-01

Pos logit contributions
+0.009
 supplies+0.007
ird+0.007
 Emmy+0.007
 markers+0.007
Neg logit contributions
 thirteen-0.016
dy-0.015
die-0.014
 obstacle-0.014
Dean-0.013
Token asked
Token ID1965
Feature activation+2.22e-01

Pos logit contributions
+0.008
 supplies+0.007
ird+0.007
 markers+0.007
vell+0.007
Neg logit contributions
 thirteen-0.010
dy-0.010
die-0.009
 obstacle-0.008
 moral-0.008
Token them
Token ID606
Feature activation-5.80e-01

Pos logit contributions
 thirteen+0.010
dy+0.009
die+0.009
 moral+0.009
 obstacle+0.008
Neg logit contributions
-0.006
 supplies-0.006
ird-0.005
vell-0.005
 markers-0.005
Token what
Token ID644
Feature activation+2.19e-01

Pos logit contributions
 help+0.013
 supplies+0.012
 yes+0.011
 markers+0.011
 together+0.011
Neg logit contributions
 thirteen-0.029
die-0.027
dy-0.027
 obstacle-0.027
Dean-0.025
Token they
Token ID484
Feature activation-1.59e+00

Pos logit contributions
 thirteen+0.009
dy+0.009
die+0.008
 obstacle+0.008
 moral+0.008
Neg logit contributions
-0.007
 supplies-0.006
vell-0.006
ird-0.006
 markers-0.005
Token had
Token ID550
Feature activation-1.72e+00

Pos logit contributions
+0.062
anger+0.055
 supplies+0.054
ird+0.053
resses+0.053
Neg logit contributions
 thirteen-0.050
dy-0.049
die-0.047
 remembers-0.047
 moral-0.046
Token seen
Token ID1775
Feature activation-4.58e+00

Pos logit contributions
+0.084
uggle+0.065
resses+0.062
anger+0.062
 brand+0.061
Neg logit contributions
 remembers-0.042
 thirteen-0.041
reetings-0.038
 notices-0.038
 moral-0.037
Token and
Token ID290
Feature activation+2.30e-02

Pos logit contributions
+0.166
vell+0.140
 Bian+0.131
ird+0.131
 supplies+0.130
Neg logit contributions
 thirteen-0.164
dy-0.148
die-0.138
 moral-0.136
Today-0.135
Token done
Token ID1760
Feature activation+4.96e-01

Pos logit contributions
 thirteen+0.001
dy+0.001
die+0.001
 obstacle+0.001
Dean+0.001
Neg logit contributions
-0.001
 supplies-0.001
 help-0.001
ird-0.001
 markers-0.001
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
 thirteen+0.028
dy+0.026
die+0.026
 obstacle+0.024
Dean+0.024
Neg logit contributions
-0.009
 supplies-0.008
ird-0.007
 help-0.007
 markers-0.007
Token They
Token ID1119
Feature activation-7.31e-01

Pos logit contributions
 thirteen+0.008
dy+0.008
die+0.008
Dean+0.007
Today+0.007
Neg logit contributions
 supplies-0.008
-0.008
vell-0.007
ird-0.007
 markers-0.007
Token told
Token ID1297
Feature activation-3.21e-01

Pos logit contributions
+0.025
 supplies+0.021
uggle+0.021
anger+0.020
vell+0.020
Neg logit contributions
 thirteen-0.029
dy-0.027
die-0.026
 moral-0.025
Dean-0.024
Token "
Token ID366
Feature activation-3.17e-01

Pos logit contributions
 thirteen+0.045
dy+0.043
 obstacle+0.042
die+0.042
Dean+0.038
Neg logit contributions
-0.028
 help-0.023
 supplies-0.023
 yes-0.023
ird-0.022
TokenWhat
Token ID2061
Feature activation-4.77e-01

Pos logit contributions
+0.014
ird+0.013
 supplies+0.013
 yes+0.012
 markers+0.012
Neg logit contributions
dy-0.009
 thirteen-0.009
 obstacle-0.008
die-0.008
 ending-0.007
Token are
Token ID389
Feature activation-2.06e+00

Pos logit contributions
+0.016
 supplies+0.014
ird+0.013
 markers+0.013
 help+0.013
Neg logit contributions
 thirteen-0.019
dy-0.018
die-0.018
 obstacle-0.016
Dean-0.016
Token b
Token ID275
Feature activation-2.98e+00

Pos logit contributions
+0.041
 supplies+0.038
ird+0.033
 help+0.033
 markers+0.032
Neg logit contributions
 thirteen-0.109
dy-0.104
die-0.103
 obstacle-0.097
Dean-0.095
Tokenerry
Token ID6996
Feature activation-2.24e+00

Pos logit contributions
+0.112
 supplies+0.099
ird+0.098
 help+0.094
vell+0.089
Neg logit contributions
 thirteen-0.110
die-0.103
dy-0.102
 obstacle-0.091
Dean-0.091
Token-
Token ID12
Feature activation-4.55e+00

Pos logit contributions
+0.053
 supplies+0.048
ird+0.043
 help+0.043
vell+0.039
Neg logit contributions
 thirteen-0.114
dy-0.107
die-0.106
 obstacle-0.099
Dean-0.097
Tokenbus
Token ID10885
Feature activation-1.59e+00

Pos logit contributions
+0.126
 supplies+0.120
 help+0.114
ird+0.106
 markers+0.105
Neg logit contributions
 thirteen-0.197
dy-0.188
die-0.187
 obstacle-0.178
Dean-0.170
Tokenhes
Token ID956
Feature activation-1.55e+00

Pos logit contributions
+0.043
 supplies+0.034
ird+0.033
 Eve+0.031
 fixed+0.030
Neg logit contributions
 thirteen-0.077
die-0.071
dy-0.070
 obstacle-0.070
Dean-0.067
Token?"
Token ID1701
Feature activation-4.34e-01

Pos logit contributions
+0.032
 supplies+0.031
 help+0.028
ird+0.028
 markers+0.025
Neg logit contributions
 thirteen-0.080
dy-0.076
die-0.075
 obstacle-0.070
Dean-0.069
Token
Token ID198
Feature activation+3.55e-01

Pos logit contributions
+0.013
 supplies+0.013
ird+0.012
 markers+0.011
vell+0.011
Neg logit contributions
 thirteen-0.018
dy-0.017
die-0.016
 obstacle-0.015
Dean-0.015
Token
Token ID198
Feature activation+1.89e-01

Pos logit contributions
 thirteen+0.016
dy+0.015
die+0.014
 obstacle+0.014
Dean+0.013
Neg logit contributions
-0.010
 supplies-0.009
ird-0.009
 help-0.009
 markers-0.008
Token saw
Token ID2497
Feature activation-1.09e+00

Pos logit contributions
+0.004
 supplies+0.004
uggle+0.004
vell+0.003
anger+0.003
Neg logit contributions
dy-0.004
 thirteen-0.004
die-0.003
 remembers-0.003
 moral-0.003
Token things
Token ID1243
Feature activation-5.14e-01

Pos logit contributions
+0.026
 supplies+0.024
ird+0.022
 markers+0.022
 help+0.021
Neg logit contributions
 thirteen-0.053
dy-0.050
die-0.049
 obstacle-0.046
Dean-0.045
Token she
Token ID673
Feature activation-1.63e+00

Pos logit contributions
+0.014
 supplies+0.012
ird+0.012
 markers+0.011
vell+0.011
Neg logit contributions
 thirteen-0.024
dy-0.022
die-0.022
 obstacle-0.020
Dean-0.020
Token had
Token ID550
Feature activation-1.69e+00

Pos logit contributions
+0.054
 supplies+0.048
ird+0.046
uggle+0.043
vell+0.043
Neg logit contributions
 thirteen-0.066
dy-0.063
die-0.061
 obstacle-0.055
Dean-0.055
Token never
Token ID1239
Feature activation-1.99e+00

Pos logit contributions
+0.064
ird+0.052
 supplies+0.051
uggle+0.051
idents+0.049
Neg logit contributions
 thirteen-0.063
dy-0.056
die-0.054
 moral-0.049
Dean-0.049
Token seen
Token ID1775
Feature activation-4.55e+00

Pos logit contributions
+0.069
 supplies+0.058
ird+0.055
 markers+0.054
vell+0.054
Neg logit contributions
 thirteen-0.079
dy-0.073
die-0.071
 obstacle-0.066
Dean-0.064
Token before
Token ID878
Feature activation+2.67e-01

Pos logit contributions
+0.144
 supplies+0.127
 markers+0.116
ird+0.116
 Bian+0.115
Neg logit contributions
 thirteen-0.196
dy-0.170
die-0.165
 obstacle-0.158
 moral-0.155
Token.
Token ID13
Feature activation+6.54e-02

Pos logit contributions
 thirteen+0.015
dy+0.014
die+0.014
 obstacle+0.013
Dean+0.013
Neg logit contributions
-0.005
 supplies-0.004
ird-0.004
 markers-0.004
 help-0.004
Token She
Token ID1375
Feature activation+4.26e-02

Pos logit contributions
 thirteen+0.002
dy+0.002
die+0.002
 obstacle+0.002
Dean+0.002
Neg logit contributions
-0.002
 supplies-0.002
ird-0.002
 markers-0.002
vell-0.002
Token felt
Token ID2936
Feature activation-1.77e-01

Pos logit contributions
 thirteen+0.002
dy+0.002
die+0.002
 obstacle+0.002
Dean+0.002
Neg logit contributions
-0.001
 supplies-0.001
 markers-0.001
ird-0.001
 fixed-0.001
Token guilty
Token ID6717
Feature activation-2.07e-01

Pos logit contributions
+0.006
 supplies+0.005
ird+0.005
 markers+0.005
vell+0.005
Neg logit contributions
 thirteen-0.007
dy-0.006
die-0.006
 obstacle-0.006
Dean-0.006
Token the
Token ID262
Feature activation-8.96e-02

Pos logit contributions
+0.001
 supplies+0.001
ird+0.001
 markers+0.001
 help+0.001
Neg logit contributions
 thirteen-0.001
dy-0.001
die-0.001
 obstacle-0.001
Dean-0.001
Token amazing
Token ID4998
Feature activation+1.04e+00

Pos logit contributions
+0.003
 supplies+0.003
 help+0.003
 markers+0.003
ird+0.003
Neg logit contributions
 thirteen-0.003
dy-0.003
die-0.003
Dean-0.003
 obstacle-0.003
Token things
Token ID1243
Feature activation+9.16e-02

Pos logit contributions
 thirteen+0.047
die+0.041
dy+0.041
Dean+0.039
Nik+0.037
Neg logit contributions
-0.031
 supplies-0.031
 help-0.028
 decor-0.026
 show-0.026
Token she
Token ID673
Feature activation-1.46e+00

Pos logit contributions
 thirteen+0.003
dy+0.003
die+0.003
 obstacle+0.003
Today+0.003
Neg logit contributions
-0.003
ird-0.003
 supplies-0.003
 markers-0.003
idents-0.002
Token had
Token ID550
Feature activation-1.47e+00

Pos logit contributions
+0.055
 supplies+0.050
ird+0.048
 markers+0.047
vell+0.045
Neg logit contributions
 thirteen-0.052
dy-0.049
die-0.048
 obstacle-0.042
Dean-0.042
Token seen
Token ID1775
Feature activation-4.52e+00

Pos logit contributions
+0.049
 supplies+0.043
ird+0.041
 markers+0.041
vell+0.039
Neg logit contributions
 thirteen-0.059
dy-0.054
die-0.052
Dean-0.048
 obstacle-0.047
Token.
Token ID13
Feature activation-3.31e-01

Pos logit contributions
+0.114
 supplies+0.094
ird+0.091
 markers+0.089
vell+0.084
Neg logit contributions
 thirteen-0.217
dy-0.201
die-0.195
 obstacle-0.185
Today-0.183
Token She
Token ID1375
Feature activation-3.92e-01

Pos logit contributions
+0.012
 supplies+0.012
ird+0.011
 markers+0.011
vell+0.011
Neg logit contributions
 thirteen-0.012
dy-0.011
die-0.010
 obstacle-0.009
Dean-0.009
Token felt
Token ID2936
Feature activation-3.87e-01

Pos logit contributions
+0.010
 supplies+0.010
 fixed+0.009
ird+0.009
 help+0.009
Neg logit contributions
 thirteen-0.018
dy-0.016
die-0.016
 obstacle-0.016
Dean-0.015
Token happy
Token ID3772
Feature activation-3.38e-01

Pos logit contributions
+0.012
 supplies+0.010
ird+0.010
 markers+0.010
vell+0.009
Neg logit contributions
 thirteen-0.017
dy-0.016
die-0.015
 obstacle-0.014
Dean-0.014
Token and
Token ID290
Feature activation-3.44e-01

Pos logit contributions
 supplies+0.007
+0.007
 help+0.006
 together+0.006
 markers+0.006
Neg logit contributions
 thirteen-0.018
dy-0.016
die-0.016
Dean-0.016
 obstacle-0.015
Token all
Token ID477
Feature activation-7.25e-01

Pos logit contributions
 supplies+0.030
 help+0.030
 yes+0.027
 together+0.026
 markers+0.025
Neg logit contributions
 thirteen-0.087
die-0.081
dy-0.080
Dean-0.076
 obstacle-0.076
Token about
Token ID546
Feature activation-9.54e-01

Pos logit contributions
 supplies+0.015
 help+0.014
 together+0.013
 fixed+0.013
 decor+0.012
Neg logit contributions
 thirteen-0.038
dy-0.035
die-0.035
Dean-0.034
Nik-0.032
Token what
Token ID644
Feature activation+6.60e-02

Pos logit contributions
 supplies+0.022
+0.021
 help+0.020
 markers+0.019
 decor+0.018
Neg logit contributions
 thirteen-0.048
dy-0.044
die-0.043
Dean-0.041
 obstacle-0.041
Token he
Token ID339
Feature activation-1.23e+00

Pos logit contributions
 thirteen+0.003
dy+0.003
die+0.003
 obstacle+0.003
Dean+0.003
Neg logit contributions
 supplies-0.002
-0.002
 help-0.002
 markers-0.002
ird-0.001
Token had
Token ID550
Feature activation-1.76e+00

Pos logit contributions
+0.044
 supplies+0.040
ird+0.038
 markers+0.037
vell+0.036
Neg logit contributions
 thirteen-0.047
dy-0.044
die-0.042
 obstacle-0.039
Dean-0.038
Token seen
Token ID1775
Feature activation-4.50e+00

Pos logit contributions
+0.069
uggle+0.056
 markers+0.056
 Eve+0.055
ird+0.055
Neg logit contributions
 thirteen-0.058
dy-0.053
die-0.051
Today-0.050
 moral-0.050
Token and
Token ID290
Feature activation-2.21e-01

Pos logit contributions
+0.134
 supplies+0.111
ird+0.108
 markers+0.105
vell+0.104
Neg logit contributions
 thirteen-0.197
dy-0.181
die-0.175
 obstacle-0.167
Today-0.162
Token everyone
Token ID2506
Feature activation+1.20e+00

Pos logit contributions
 supplies+0.005
+0.005
 help+0.005
 fixed+0.005
 markers+0.005
Neg logit contributions
 thirteen-0.011
dy-0.010
die-0.010
 obstacle-0.009
Dean-0.009
Token listened
Token ID16399
Feature activation+4.03e-01

Pos logit contributions
 thirteen+0.063
die+0.058
dy+0.058
Dean+0.055
 obstacle+0.054
Neg logit contributions
 supplies-0.024
-0.023
 help-0.023
 fixed-0.023
 together-0.022
Token.
Token ID13
Feature activation-4.89e-01

Pos logit contributions
 thirteen+0.022
dy+0.021
die+0.021
Dean+0.020
 obstacle+0.020
Neg logit contributions
-0.006
 supplies-0.006
 help-0.006
 together-0.005
 markers-0.005
Token
Token ID220
Feature activation+1.36e-01

Pos logit contributions
+0.014
 supplies+0.012
ird+0.011
 markers+0.011
 help+0.011
Neg logit contributions
 thirteen-0.023
dy-0.021
die-0.021
 obstacle-0.020
Dean-0.019
Token all
Token ID477
Feature activation-4.41e-02

Pos logit contributions
+0.014
 supplies+0.014
 help+0.012
ird+0.012
 markers+0.012
Neg logit contributions
 thirteen-0.031
dy-0.029
die-0.028
Dean-0.026
 obstacle-0.026
Token the
Token ID262
Feature activation-9.29e-01

Pos logit contributions
 supplies+0.001
 together+0.001
 help+0.001
 fixed+0.001
 decor+0.000
Neg logit contributions
 thirteen-0.003
die-0.002
dy-0.002
Dean-0.002
reetings-0.002
Token things
Token ID1243
Feature activation+1.54e-01

Pos logit contributions
 supplies+0.028
 help+0.023
 decor+0.021
 fixed+0.021
 Eve+0.021
Neg logit contributions
 thirteen-0.044
die-0.039
dy-0.039
Dean-0.038
reetings-0.035
Token they
Token ID484
Feature activation-1.52e+00

Pos logit contributions
 thirteen+0.006
dy+0.006
die+0.006
 obstacle+0.005
He+0.005
Neg logit contributions
-0.005
 supplies-0.004
ird-0.004
 markers-0.004
vell-0.004
Token had
Token ID550
Feature activation-1.39e+00

Pos logit contributions
+0.068
anger+0.065
uggle+0.062
 supplies+0.062
ird+0.060
Neg logit contributions
 thirteen-0.038
dy-0.036
die-0.035
Today-0.031
He-0.031
Token seen
Token ID1775
Feature activation-4.50e+00

Pos logit contributions
+0.052
uggle+0.043
idents+0.043
 markers+0.042
anger+0.042
Neg logit contributions
 thirteen-0.048
dy-0.045
die-0.042
Today-0.040
reetings-0.038
Token on
Token ID319
Feature activation-5.52e-01

Pos logit contributions
+0.122
 supplies+0.097
ird+0.095
 markers+0.093
vell+0.091
Neg logit contributions
 thirteen-0.207
dy-0.193
die-0.185
 obstacle-0.176
Today-0.175
Token their
Token ID511
Feature activation-1.57e+00

Pos logit contributions
+0.014
 supplies+0.012
ird+0.011
 markers+0.011
 help+0.011
Neg logit contributions
 thirteen-0.027
dy-0.025
die-0.025
 obstacle-0.023
Dean-0.023
Token drive
Token ID3708
Feature activation-6.84e-01

Pos logit contributions
+0.043
 supplies+0.042
 help+0.038
 markers+0.037
ird+0.036
Neg logit contributions
 thirteen-0.072
dy-0.065
die-0.065
Dean-0.060
 obstacle-0.059
Token.
Token ID13
Feature activation-5.37e-01

Pos logit contributions
+0.016
 supplies+0.013
 markers+0.013
ird+0.013
vell+0.012
Neg logit contributions
 thirteen-0.033
dy-0.031
die-0.030
 obstacle-0.030
Today-0.028
Token It
Token ID632
Feature activation-1.25e+00

Pos logit contributions
+0.014
 supplies+0.012
ird+0.011
 markers+0.010
 help+0.010
Neg logit contributions
 thirteen-0.027
dy-0.024
die-0.024
 obstacle-0.024
Dean-0.022
Token Amazing
Token ID23181
Feature activation-1.19e-01

Pos logit contributions
+0.012
 supplies+0.011
ird+0.011
 markers+0.011
 help+0.011
Neg logit contributions
 thirteen-0.008
dy-0.007
die-0.007
 obstacle-0.006
Dean-0.006
Token things
Token ID1243
Feature activation-1.10e+00

Pos logit contributions
 supplies+0.003
+0.003
 markers+0.003
 help+0.003
ird+0.003
Neg logit contributions
 thirteen-0.005
dy-0.005
die-0.005
Dean-0.005
Today-0.005
Token he
Token ID339
Feature activation-1.66e+00

Pos logit contributions
+0.028
 supplies+0.025
ird+0.024
 markers+0.023
vell+0.022
Neg logit contributions
 thirteen-0.051
dy-0.048
die-0.046
 obstacle-0.044
Dean-0.043
Token had
Token ID550
Feature activation-1.78e+00

Pos logit contributions
+0.048
 supplies+0.044
ird+0.041
 markers+0.040
 help+0.039
Neg logit contributions
 thirteen-0.074
dy-0.068
die-0.067
 obstacle-0.063
Dean-0.061
Token never
Token ID1239
Feature activation-1.94e+00

Pos logit contributions
+0.052
 supplies+0.050
ird+0.045
 markers+0.045
 help+0.045
Neg logit contributions
 thirteen-0.077
dy-0.072
die-0.070
 obstacle-0.066
Dean-0.064
Token seen
Token ID1775
Feature activation-4.50e+00

Pos logit contributions
+0.053
 supplies+0.050
ird+0.046
 markers+0.045
 help+0.044
Neg logit contributions
 thirteen-0.088
dy-0.082
die-0.080
 obstacle-0.075
Dean-0.074
Token before
Token ID878
Feature activation+2.68e-01

Pos logit contributions
+0.129
 supplies+0.113
ird+0.107
 markers+0.103
vell+0.100
Neg logit contributions
 thirteen-0.206
dy-0.186
die-0.181
 obstacle-0.171
Dean-0.166
Token.
Token ID13
Feature activation+4.87e-02

Pos logit contributions
 thirteen+0.014
dy+0.013
die+0.013
 obstacle+0.012
Dean+0.012
Neg logit contributions
-0.005
 supplies-0.005
ird-0.004
 markers-0.004
 help-0.004
Token He
Token ID679
Feature activation+1.86e-01

Pos logit contributions
 thirteen+0.001
dy+0.001
die+0.001
 obstacle+0.001
Dean+0.001
Neg logit contributions
-0.002
 supplies-0.002
ird-0.002
vell-0.002
 help-0.002
Token had
Token ID550
Feature activation-1.71e-01

Pos logit contributions
 thirteen+0.009
dy+0.008
die+0.008
 obstacle+0.008
reetings+0.008
Neg logit contributions
 supplies-0.004
-0.004
 fixed-0.004
 markers-0.004
 help-0.004
Token gained
Token ID8618
Feature activation-1.32e-01

Pos logit contributions
 supplies+0.004
 help+0.003
 fixed+0.003
 markers+0.003
+0.003
Neg logit contributions
 thirteen-0.008
dy-0.008
die-0.008
 obstacle-0.008
Dean-0.007
Token the
Token ID262
Feature activation-8.62e-01

Pos logit contributions
 thirteen+0.021
dy+0.020
die+0.019
Dean+0.018
 obstacle+0.018
Neg logit contributions
 supplies-0.009
-0.008
 help-0.008
 markers-0.007
 decor-0.007
Token wonderful
Token ID7932
Feature activation+6.41e-01

Pos logit contributions
 supplies+0.023
+0.022
 help+0.021
 markers+0.019
 fixed+0.019
Neg logit contributions
 thirteen-0.040
die-0.037
dy-0.037
Dean-0.035
 obstacle-0.033
Token things
Token ID1243
Feature activation-1.81e-01

Pos logit contributions
 thirteen+0.032
die+0.028
dy+0.028
Dean+0.027
Nik+0.026
Neg logit contributions
 supplies-0.017
-0.015
 help-0.015
 markers-0.014
 decor-0.014
Token they
Token ID484
Feature activation-1.47e+00

Pos logit contributions
+0.005
ird+0.004
 supplies+0.004
 markers+0.004
vell+0.004
Neg logit contributions
 thirteen-0.008
dy-0.007
die-0.007
 obstacle-0.007
Dean-0.006
Token had
Token ID550
Feature activation-1.53e+00

Pos logit contributions
+0.068
anger+0.063
 supplies+0.060
ird+0.059
uggle+0.059
Neg logit contributions
 thirteen-0.037
dy-0.035
die-0.034
 remembers-0.030
 moral-0.030
Token seen
Token ID1775
Feature activation-4.50e+00

Pos logit contributions
+0.056
idents+0.046
anger+0.046
 markers+0.046
 Bian+0.046
Neg logit contributions
 thirteen-0.052
dy-0.051
die-0.049
Today-0.045
reetings-0.044
Token in
Token ID287
Feature activation-1.12e+00

Pos logit contributions
+0.120
 supplies+0.098
ird+0.096
 markers+0.093
vell+0.089
Neg logit contributions
 thirteen-0.208
dy-0.197
die-0.191
 obstacle-0.178
Today-0.175
Token the
Token ID262
Feature activation-3.19e-01

Pos logit contributions
+0.026
 supplies+0.024
 help+0.021
 markers+0.021
ird+0.021
Neg logit contributions
 thirteen-0.055
dy-0.052
die-0.051
 obstacle-0.047
Dean-0.047
Token tunnel
Token ID13275
Feature activation+1.15e-01

Pos logit contributions
+0.012
 supplies+0.012
ird+0.011
 markers+0.011
 help+0.011
Neg logit contributions
 thirteen-0.011
dy-0.010
die-0.010
 obstacle-0.009
Dean-0.009
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
 thirteen+0.006
dy+0.006
die+0.006
 obstacle+0.006
Dean+0.005
Neg logit contributions
-0.002
 supplies-0.002
ird-0.002
 markers-0.002
 help-0.002
Token<|endoftext|>
Token ID50256
Feature activation+3.25e-01

Pos logit contributions
+0.014
 supplies+0.012
ird+0.011
 markers+0.011
 help+0.011
Neg logit contributions
 thirteen-0.023
dy-0.021
die-0.020
 obstacle-0.020
Dean-0.019
Token about
Token ID546
Feature activation-5.11e-01

Pos logit contributions
+0.005
 supplies+0.005
 help+0.004
 markers+0.004
ird+0.004
Neg logit contributions
 thirteen-0.013
dy-0.012
die-0.012
 obstacle-0.011
Dean-0.011
Token the
Token ID262
Feature activation-3.69e-01

Pos logit contributions
 help+0.009
 supplies+0.009
+0.009
 decor+0.008
 together+0.008
Neg logit contributions
 thirteen-0.028
die-0.026
dy-0.026
Dean-0.025
 obstacle-0.024
Token things
Token ID1243
Feature activation+2.24e-01

Pos logit contributions
 supplies+0.011
+0.011
 help+0.010
 show+0.009
 markers+0.009
Neg logit contributions
 thirteen-0.017
die-0.015
dy-0.015
Dean-0.014
 obstacle-0.013
Token they
Token ID484
Feature activation-1.37e+00

Pos logit contributions
 thirteen+0.010
dy+0.009
die+0.009
 obstacle+0.008
Dean+0.008
Neg logit contributions
-0.007
 supplies-0.006
ird-0.006
 markers-0.006
vell-0.005
Token had
Token ID550
Feature activation-1.33e+00

Pos logit contributions
+0.064
anger+0.062
uggle+0.058
 supplies+0.058
ird+0.056
Neg logit contributions
 thirteen-0.031
dy-0.031
die-0.029
He-0.026
 remembers-0.026
Token seen
Token ID1775
Feature activation-4.47e+00

Pos logit contributions
+0.050
ird+0.040
 markers+0.040
 supplies+0.040
 Bian+0.039
Neg logit contributions
 thirteen-0.048
dy-0.044
die-0.042
 moral-0.038
Today-0.038
Token and
Token ID290
Feature activation-3.85e-01

Pos logit contributions
+0.134
 supplies+0.111
ird+0.109
 markers+0.106
vell+0.105
Neg logit contributions
 thirteen-0.193
dy-0.180
die-0.172
 obstacle-0.162
Today-0.159
Token done
Token ID1760
Feature activation+3.42e-01

Pos logit contributions
+0.010
 supplies+0.009
ird+0.009
 help+0.008
 markers+0.008
Neg logit contributions
 thirteen-0.018
dy-0.017
die-0.016
 obstacle-0.016
Dean-0.015
Token.
Token ID13
Feature activation-3.65e-01

Pos logit contributions
 thirteen+0.018
dy+0.017
die+0.017
Dean+0.016
 obstacle+0.016
Neg logit contributions
-0.007
 supplies-0.006
 help-0.005
 together-0.005
ird-0.005
Token Bob
Token ID5811
Feature activation-5.76e-01

Pos logit contributions
+0.010
 supplies+0.008
 Tim+0.008
 help+0.007
idents+0.007
Neg logit contributions
 thirteen-0.017
dy-0.016
die-0.016
 obstacle-0.015
Dean-0.015
Token was
Token ID373
Feature activation-2.54e-01

Pos logit contributions
+0.012
 fixed+0.012
 supplies+0.012
 helped+0.011
 help+0.011
Neg logit contributions
 thirteen-0.029
 obstacle-0.026
die-0.026
dy-0.025
reetings-0.025
Token the
Token ID262
Feature activation-4.71e-01

Pos logit contributions
+0.007
 supplies+0.006
 help+0.006
 markers+0.005
 decor+0.005
Neg logit contributions
 thirteen-0.016
dy-0.015
die-0.014
Dean-0.014
 obstacle-0.013
Token amazing
Token ID4998
Feature activation+7.95e-01

Pos logit contributions
 supplies+0.013
 help+0.012
 show+0.012
+0.011
 decor+0.010
Neg logit contributions
 thirteen-0.022
die-0.020
dy-0.020
Dean-0.019
Meanwhile-0.018
Token things
Token ID1243
Feature activation+1.89e-01

Pos logit contributions
 thirteen+0.040
die+0.036
Dean+0.035
Nik+0.034
Meanwhile+0.034
Neg logit contributions
 supplies-0.022
 show-0.020
 help-0.019
 decor-0.018
 decorations-0.017
Token they
Token ID484
Feature activation-1.55e+00

Pos logit contributions
 thirteen+0.009
dy+0.008
die+0.008
 obstacle+0.007
Dean+0.007
Neg logit contributions
-0.005
 supplies-0.004
ird-0.004
 markers-0.004
vell-0.004
Token had
Token ID550
Feature activation-1.44e+00

Pos logit contributions
+0.070
anger+0.066
 supplies+0.063
uggle+0.063
ird+0.062
Neg logit contributions
 thirteen-0.038
dy-0.036
die-0.036
Today-0.033
 remembers-0.032
Token seen
Token ID1775
Feature activation-4.47e+00

Pos logit contributions
+0.051
idents+0.042
 markers+0.042
ird+0.041
uggle+0.041
Neg logit contributions
 thirteen-0.052
dy-0.050
die-0.047
Today-0.045
reetings-0.043
Token.
Token ID13
Feature activation-5.55e-01

Pos logit contributions
+0.116
 supplies+0.094
ird+0.092
 markers+0.089
vell+0.085
Neg logit contributions
 thirteen-0.211
dy-0.198
die-0.190
 obstacle-0.181
Today-0.178
Token
Token ID220
Feature activation+2.95e-01

Pos logit contributions
+0.016
 supplies+0.013
ird+0.013
 markers+0.012
 help+0.012
Neg logit contributions
 thirteen-0.026
dy-0.024
die-0.023
 obstacle-0.023
Dean-0.022
Token
Token ID198
Feature activation-4.53e-01

Pos logit contributions
 thirteen+0.013
dy+0.012
die+0.012
 obstacle+0.011
Dean+0.011
Neg logit contributions
-0.009
 supplies-0.008
ird-0.008
 markers-0.007
 help-0.007
Token
Token ID198
Feature activation-5.04e-01

Pos logit contributions
+0.012
 supplies+0.010
ird+0.009
 help+0.009
 together+0.009
Neg logit contributions
 thirteen-0.023
dy-0.020
 obstacle-0.020
die-0.020
Dean-0.018
TokenThe
Token ID464
Feature activation-6.46e-02

Pos logit contributions
+0.017
 together+0.015
ird+0.013
 yes+0.013
 supplies+0.013
Neg logit contributions
 thirteen-0.021
dy-0.020
 obstacle-0.020
die-0.019
 appearance-0.019
Token pocket
Token ID10000
Feature activation-2.57e-01

Pos logit contributions
 thirteen+0.022
dy+0.020
die+0.020
 obstacle+0.018
Dean+0.018
Neg logit contributions
-0.013
 supplies-0.013
 help-0.011
 markers-0.011
ird-0.011
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
+0.008
 supplies+0.007
vell+0.006
ird+0.006
enge+0.006
Neg logit contributions
 thirteen-0.011
dy-0.010
die-0.010
 obstacle-0.009
Dean-0.009
Token He
Token ID679
Feature activation-2.19e-01

Pos logit contributions
+0.003
 supplies+0.003
ird+0.003
 markers+0.003
 help+0.003
Neg logit contributions
 thirteen-0.005
dy-0.004
die-0.004
 obstacle-0.004
Dean-0.004
Token took
Token ID1718
Feature activation-2.82e-01

Pos logit contributions
+0.006
 supplies+0.005
ird+0.005
 markers+0.005
 help+0.005
Neg logit contributions
 thirteen-0.010
dy-0.009
die-0.009
 obstacle-0.009
Dean-0.008
Token out
Token ID503
Feature activation-1.20e+00

Pos logit contributions
+0.008
 supplies+0.007
ird+0.006
vell+0.006
 markers+0.006
Neg logit contributions
 thirteen-0.013
dy-0.012
die-0.012
 obstacle-0.011
Dean-0.011
Token two
Token ID734
Feature activation+1.97e+00

Pos logit contributions
 supplies+0.026
 markers+0.024
 help+0.024
+0.022
 yes+0.021
Neg logit contributions
 thirteen-0.061
dy-0.058
die-0.057
 obstacle-0.054
reetings-0.052
Token pieces
Token ID5207
Feature activation+6.48e-02

Pos logit contributions
 thirteen+0.092
dy+0.087
die+0.086
Dean+0.078
reetings+0.078
Neg logit contributions
 supplies-0.053
 markers-0.048
 help-0.047
-0.047
ird-0.044
Token of
Token ID286
Feature activation-3.80e-01

Pos logit contributions
 thirteen+0.003
dy+0.003
die+0.003
 obstacle+0.003
Dean+0.003
Neg logit contributions
-0.002
 supplies-0.002
 markers-0.001
ird-0.001
 help-0.001
Token bread
Token ID8509
Feature activation-6.60e-01

Pos logit contributions
+0.010
 supplies+0.009
ird+0.009
vell+0.008
 markers+0.008
Neg logit contributions
 thirteen-0.018
dy-0.016
die-0.016
 obstacle-0.015
Dean-0.015
Token and
Token ID290
Feature activation-3.26e-01

Pos logit contributions
+0.017
 supplies+0.016
ird+0.014
 markers+0.014
 help+0.014
Neg logit contributions
 thirteen-0.031
dy-0.029
die-0.028
 obstacle-0.027
Dean-0.026
Token said
Token ID531
Feature activation+2.00e-01

Pos logit contributions
+0.009
 supplies+0.008
 markers+0.007
ird+0.007
 help+0.007
Neg logit contributions
 thirteen-0.015
dy-0.014
die-0.014
 obstacle-0.013
Dean-0.013
Token pieces
Token ID5207
Feature activation+1.91e+00

Pos logit contributions
 thirteen+0.019
dy+0.018
die+0.017
 obstacle+0.016
Dean+0.016
Neg logit contributions
-0.012
 supplies-0.012
ird-0.011
 markers-0.010
 help-0.010
Token back
Token ID736
Feature activation+1.13e-01

Pos logit contributions
 thirteen+0.084
dy+0.078
die+0.075
 obstacle+0.071
Dean+0.068
Neg logit contributions
-0.057
 supplies-0.049
ird-0.047
 markers-0.045
vell-0.043
Token together
Token ID1978
Feature activation+2.36e-01

Pos logit contributions
 thirteen+0.006
dy+0.005
die+0.005
 obstacle+0.005
Dean+0.005
Neg logit contributions
-0.003
 supplies-0.002
ird-0.002
 markers-0.002
 help-0.002
Token like
Token ID588
Feature activation+6.07e-01

Pos logit contributions
 thirteen+0.011
dy+0.010
die+0.010
Today+0.009
 moral+0.009
Neg logit contributions
-0.006
 supplies-0.005
vell-0.005
ird-0.005
 markers-0.005
Token a
Token ID257
Feature activation-8.30e-01

Pos logit contributions
 thirteen+0.027
dy+0.023
 moral+0.022
 obstacle+0.022
die+0.022
Neg logit contributions
-0.018
vell-0.015
 supplies-0.015
enge-0.014
ird-0.014
Token puzzle
Token ID15027
Feature activation+2.30e+00

Pos logit contributions
+0.024
 supplies+0.023
ird+0.022
 help+0.021
 markers+0.021
Neg logit contributions
 thirteen-0.036
dy-0.033
die-0.033
Dean-0.030
 obstacle-0.030
Token."
Token ID526
Feature activation-7.65e-01

Pos logit contributions
 thirteen+0.114
dy+0.106
die+0.102
 obstacle+0.099
 moral+0.095
Neg logit contributions
-0.055
 supplies-0.046
ird-0.043
 markers-0.042
vell-0.041
Token Lily
Token ID20037
Feature activation+5.18e-01

Pos logit contributions
vell+0.029
 supplies+0.028
+0.027
ird+0.026
 markers+0.025
Neg logit contributions
dy-0.025
 thirteen-0.024
die-0.023
 moral-0.020
Those-0.020
Token and
Token ID290
Feature activation+1.36e+00

Pos logit contributions
 thirteen+0.021
dy+0.020
die+0.020
 obstacle+0.018
Dean+0.018
Neg logit contributions
-0.016
 supplies-0.015
ird-0.014
vell-0.014
 markers-0.014
Token her
Token ID607
Feature activation+7.15e-01

Pos logit contributions
dy+0.055
 thirteen+0.053
die+0.052
 moral+0.048
Dean+0.046
Neg logit contributions
-0.045
vell-0.041
 supplies-0.040
ird-0.038
anger-0.037
Token mom
Token ID1995
Feature activation+1.24e-01

Pos logit contributions
dy+0.022
die+0.020
 moral+0.020
 remembers+0.016
 thirteen+0.016
Neg logit contributions
-0.030
ird-0.028
uggle-0.028
 supplies-0.028
vell-0.028
Token wrong
Token ID2642
Feature activation+3.73e-03

Pos logit contributions
+0.070
 supplies+0.064
ird+0.060
 markers+0.059
 help+0.058
Neg logit contributions
 thirteen-0.094
dy-0.087
die-0.085
 obstacle-0.079
Dean-0.077
Token.
Token ID13
Feature activation+1.75e-01

Pos logit contributions
 thirteen+0.000
die+0.000
dy+0.000
Dean+0.000
 obstacle+0.000
Neg logit contributions
-0.000
 supplies-0.000
 help-0.000
 together-0.000
ird-0.000
Token Tim
Token ID5045
Feature activation-6.73e-01

Pos logit contributions
 thirteen+0.007
dy+0.007
die+0.006
 obstacle+0.006
Dean+0.006
Neg logit contributions
-0.006
 supplies-0.005
ird-0.005
 markers-0.005
 help-0.005
Token saw
Token ID2497
Feature activation-9.42e-01

Pos logit contributions
 fixed+0.012
+0.011
 helped+0.011
 supplies+0.011
 help+0.010
Neg logit contributions
 thirteen-0.037
 obstacle-0.034
dy-0.032
die-0.032
reetings-0.031
Token the
Token ID262
Feature activation-1.39e-01

Pos logit contributions
+0.029
ird+0.026
vell+0.026
 supplies+0.025
 markers+0.024
Neg logit contributions
 thirteen-0.039
dy-0.039
die-0.036
 obstacle-0.034
He-0.034
Token lime
Token ID28738
Feature activation+2.22e+00

Pos logit contributions
+0.004
 supplies+0.004
 help+0.004
 markers+0.003
 cr+0.003
Neg logit contributions
 thirteen-0.007
mo-0.006
die-0.006
reetings-0.006
dy-0.006
Token and
Token ID290
Feature activation+3.88e-01

Pos logit contributions
 thirteen+0.112
die+0.109
dy+0.105
Dean+0.101
reetings+0.100
Neg logit contributions
-0.053
 supplies-0.045
 help-0.043
 markers-0.040
 fixed-0.040
Token knew
Token ID2993
Feature activation-1.22e-01

Pos logit contributions
 thirteen+0.021
dy+0.019
die+0.018
 obstacle+0.018
Dean+0.018
Neg logit contributions
-0.008
 supplies-0.007
 help-0.007
 markers-0.006
 fixed-0.006
Token it
Token ID340
Feature activation-4.78e-01

Pos logit contributions
+0.004
 supplies+0.004
ird+0.003
 markers+0.003
vell+0.003
Neg logit contributions
 thirteen-0.005
dy-0.005
die-0.004
 obstacle-0.004
 moral-0.004
Token was
Token ID373
Feature activation-1.85e-01

Pos logit contributions
+0.017
 supplies+0.015
ird+0.015
vell+0.014
 markers+0.014
Neg logit contributions
dy-0.018
 thirteen-0.018
die-0.017
 moral-0.015
 obstacle-0.015
Token not
Token ID407
Feature activation+9.68e-01

Pos logit contributions
+0.006
 supplies+0.006
ird+0.006
vell+0.005
 markers+0.005
Neg logit contributions
 thirteen-0.007
dy-0.007
die-0.007
 moral-0.006
 obstacle-0.006
Token and
Token ID290
Feature activation+2.50e-01

Pos logit contributions
 thirteen+0.015
dy+0.015
die+0.014
 obstacle+0.014
Dean+0.014
Neg logit contributions
 help-0.005
 yes-0.005
 supplies-0.005
 together-0.005
 fixed-0.005
Token he
Token ID339
Feature activation+4.43e-01

Pos logit contributions
 thirteen+0.011
dy+0.010
die+0.010
 obstacle+0.009
Dean+0.009
Neg logit contributions
-0.007
 supplies-0.007
 help-0.006
ird-0.006
 markers-0.006
Token heard
Token ID2982
Feature activation-3.72e-02

Pos logit contributions
 thirteen+0.020
dy+0.018
die+0.017
 obstacle+0.017
Dean+0.016
Neg logit contributions
-0.012
 supplies-0.011
 fixed-0.011
 help-0.011
ird-0.010
Token that
Token ID326
Feature activation+1.95e-02

Pos logit contributions
 supplies+0.001
 help+0.001
+0.001
 markers+0.001
 fixed+0.001
Neg logit contributions
 thirteen-0.002
dy-0.002
die-0.002
Dean-0.002
 obstacle-0.002
Token a
Token ID257
Feature activation-8.16e-01

Pos logit contributions
 thirteen+0.001
 seventeen+0.001
die+0.001
dy+0.001
Dean+0.001
Neg logit contributions
 supplies-0.000
 help-0.000
 together-0.000
 Tim-0.000
 it-0.000
Token story
Token ID1621
Feature activation+3.07e+00

Pos logit contributions
+0.026
 help+0.024
 supplies+0.023
 brand+0.021
 markers+0.021
Neg logit contributions
 thirteen-0.035
dy-0.030
die-0.030
Dean-0.028
mo-0.027
Tokente
Token ID660
Feature activation-2.21e+00

Pos logit contributions
 thirteen+0.127
dy+0.117
die+0.113
 obstacle+0.106
Dean+0.103
Neg logit contributions
-0.098
 supplies-0.091
ird-0.085
 markers-0.084
 help-0.081
Tokenller
Token ID6051
Feature activation-5.46e-02

Pos logit contributions
+0.085
 supplies+0.080
ird+0.075
 markers+0.074
 help+0.073
Neg logit contributions
 thirteen-0.077
dy-0.070
die-0.068
 obstacle-0.062
Dean-0.060
Token was
Token ID373
Feature activation-7.94e-02

Pos logit contributions
+0.001
 fixed+0.001
 help+0.001
 supplies+0.001
ird+0.001
Neg logit contributions
 thirteen-0.003
die-0.003
dy-0.003
Dean-0.003
 obstacle-0.003
Token telling
Token ID5149
Feature activation+6.25e-01

Pos logit contributions
+0.002
 supplies+0.002
 fixed+0.002
 help+0.002
 markers+0.002
Neg logit contributions
 thirteen-0.003
dy-0.003
die-0.003
 obstacle-0.003
Dean-0.003
Token a
Token ID257
Feature activation-1.66e+00

Pos logit contributions
 thirteen+0.033
dy+0.032
die+0.031
 obstacle+0.029
Dean+0.029
Neg logit contributions
 supplies-0.011
-0.011
 help-0.010
ird-0.009
 markers-0.009
Token
Token ID198
Feature activation+5.38e-01

Pos logit contributions
 thirteen+0.029
dy+0.027
die+0.026
 obstacle+0.025
Dean+0.024
Neg logit contributions
-0.020
 supplies-0.018
ird-0.017
 help-0.016
 markers-0.016
TokenS
Token ID50
Feature activation+1.99e-01

Pos logit contributions
 thirteen+0.022
 obstacle+0.022
mo+0.020
die+0.019
dy+0.019
Neg logit contributions
 together-0.014
ird-0.014
 Eve-0.014
-0.014
 supplies-0.013
Tokenara
Token ID3301
Feature activation+8.10e-01

Pos logit contributions
 thirteen+0.005
dy+0.004
die+0.004
 obstacle+0.004
Dean+0.004
Neg logit contributions
-0.010
 supplies-0.009
ird-0.009
 markers-0.009
vell-0.009
Token holds
Token ID6622
Feature activation+3.46e-01

Pos logit contributions
 thirteen+0.031
die+0.030
reetings+0.029
dy+0.028
 obstacle+0.027
Neg logit contributions
-0.024
 supplies-0.024
 help-0.023
 markers-0.022
 fixed-0.022
Token Ben
Token ID3932
Feature activation+1.55e+00

Pos logit contributions
 thirteen+0.013
dy+0.013
die+0.012
Dean+0.012
 obstacle+0.011
Neg logit contributions
-0.012
vell-0.010
 supplies-0.010
ird-0.010
 markers-0.010
Token's
Token ID338
Feature activation+2.43e+00

Pos logit contributions
 thirteen+0.075
 obstacle+0.067
reetings+0.067
die+0.066
Dean+0.064
Neg logit contributions
 supplies-0.036
 help-0.036
 together-0.036
 fixed-0.034
 yes-0.033
Token hand
Token ID1021
Feature activation+6.45e-01

Pos logit contributions
 thirteen+0.095
dy+0.087
die+0.085
 obstacle+0.079
Dean+0.077
Neg logit contributions
-0.084
 supplies-0.078
ird-0.073
 markers-0.072
 help-0.072
Token.
Token ID13
Feature activation+6.56e-01

Pos logit contributions
 thirteen+0.030
dy+0.029
die+0.028
 obstacle+0.026
Today+0.025
Neg logit contributions
-0.017
 supplies-0.015
ird-0.014
vell-0.013
 markers-0.013
Token "
Token ID366
Feature activation-1.57e-01

Pos logit contributions
 thirteen+0.032
 obstacle+0.030
dy+0.029
die+0.029
reetings+0.028
Neg logit contributions
-0.019
 Tim-0.014
 help-0.014
 supplies-0.013
 markers-0.013
TokenDon
Token ID3987
Feature activation-6.57e-01

Pos logit contributions
+0.005
 help+0.005
 supplies+0.004
ird+0.004
 yes+0.004
Neg logit contributions
 thirteen-0.006
dy-0.006
die-0.006
 obstacle-0.005
reetings-0.005
Token't
Token ID470
Feature activation+2.80e-01

Pos logit contributions
+0.015
 supplies+0.015
 markers+0.014
ird+0.014
vell+0.013
Neg logit contributions
 thirteen-0.032
dy-0.030
die-0.029
 obstacle-0.027
 moral-0.027
Token You
Token ID921
Feature activation+5.13e-01

Pos logit contributions
 supplies+0.005
+0.004
vell+0.004
 markers+0.004
ird+0.004
Neg logit contributions
 thirteen-0.004
dy-0.003
die-0.003
Dean-0.003
 moral-0.003
Token have
Token ID423
Feature activation+2.97e-02

Pos logit contributions
 thirteen+0.020
dy+0.017
die+0.016
 obstacle+0.016
Dean+0.016
Neg logit contributions
-0.018
 supplies-0.016
 brand-0.015
anger-0.015
 markers-0.014
Token to
Token ID284
Feature activation-2.28e-01

Pos logit contributions
 thirteen+0.001
dy+0.001
die+0.001
 obstacle+0.001
Dean+0.001
Neg logit contributions
-0.001
 supplies-0.001
ird-0.001
 markers-0.001
vell-0.001
Token share
Token ID2648
Feature activation+6.48e-01

Pos logit contributions
+0.007
 supplies+0.006
ird+0.006
 markers+0.006
 help+0.006
Neg logit contributions
 thirteen-0.010
dy-0.009
die-0.009
 obstacle-0.008
Dean-0.008
Token."
Token ID526
Feature activation-9.80e-01

Pos logit contributions
 thirteen+0.030
dy+0.028
die+0.027
 obstacle+0.025
 moral+0.025
Neg logit contributions
-0.019
 supplies-0.016
ird-0.015
vell-0.015
 Bian-0.014
Token
Token ID198
Feature activation+7.78e-02

Pos logit contributions
+0.035
 supplies+0.035
vell+0.034
ird+0.034
 markers+0.032
Neg logit contributions
 thirteen-0.033
dy-0.032
die-0.031
Dean-0.028
He-0.028
Token
Token ID198
Feature activation-2.13e-02

Pos logit contributions
 thirteen+0.003
dy+0.003
die+0.003
 obstacle+0.003
Dean+0.003
Neg logit contributions
-0.003
 supplies-0.002
ird-0.002
 markers-0.002
vell-0.002
TokenSam
Token ID16305
Feature activation+2.11e-01

Pos logit contributions
+0.001
 supplies+0.001
 together+0.001
 help+0.001
 fixed+0.001
Neg logit contributions
 thirteen-0.001
die-0.001
dy-0.001
mo-0.001
 obstacle-0.001
Token shook
Token ID14682
Feature activation-8.95e-01

Pos logit contributions
 thirteen+0.010
dy+0.009
die+0.009
 obstacle+0.009
Dean+0.008
Neg logit contributions
-0.005
 supplies-0.005
ird-0.005
 markers-0.004
 help-0.004
Token his
Token ID465
Feature activation+3.42e-01

Pos logit contributions
 help+0.019
 Tim+0.018
 supplies+0.018
 together+0.017
+0.017
Neg logit contributions
 thirteen-0.045
die-0.039
dy-0.039
reetings-0.039
Dean-0.039
Token head
Token ID1182
Feature activation+3.18e-01

Pos logit contributions
 thirteen+0.014
die+0.012
reetings+0.012
dy+0.012
Dean+0.011
Neg logit contributions
-0.011
 supplies-0.011
 help-0.010
 markers-0.010
 fixed-0.010
Token.
Token ID13
Feature activation+2.86e-01

Pos logit contributions
+0.007
 help+0.007
 supplies+0.006
 fixed+0.005
vell+0.005
Neg logit contributions
 thirteen-0.042
die-0.039
dy-0.038
Dean-0.037
 obstacle-0.037
Token It
Token ID632
Feature activation-7.26e-01

Pos logit contributions
 thirteen+0.012
dy+0.011
die+0.011
He+0.009
Today+0.009
Neg logit contributions
-0.009
 supplies-0.009
ird-0.008
vell-0.008
 markers-0.008
Token was
Token ID373
Feature activation-5.17e-02

Pos logit contributions
+0.019
 supplies+0.018
ird+0.016
 help+0.016
 markers+0.016
Neg logit contributions
 thirteen-0.034
dy-0.031
die-0.031
 obstacle-0.029
Dean-0.028
Token a
Token ID257
Feature activation-6.08e-01

Pos logit contributions
+0.002
 supplies+0.002
ird+0.001
 markers+0.001
 help+0.001
Neg logit contributions
 thirteen-0.002
dy-0.002
die-0.002
 obstacle-0.002
Dean-0.002
Token nightmare
Token ID17123
Feature activation+5.16e-02

Pos logit contributions
+0.020
 help+0.018
 supplies+0.018
 fixed+0.017
ird+0.017
Neg logit contributions
 thirteen-0.025
dy-0.022
die-0.022
Dean-0.020
 obstacle-0.020
Token!
Token ID0
Feature activation+2.65e-01

Pos logit contributions
 thirteen+0.003
dy+0.003
die+0.003
 obstacle+0.002
Dean+0.002
Neg logit contributions
-0.001
 supplies-0.001
ird-0.001
 help-0.001
 markers-0.001
Token John
Token ID1757
Feature activation+8.89e-02

Pos logit contributions
 thirteen+0.011
dy+0.010
die+0.010
He+0.009
Today+0.009
Neg logit contributions
-0.008
 supplies-0.008
vell-0.007
ird-0.007
 markers-0.007
Token was
Token ID373
Feature activation+3.79e-01

Pos logit contributions
 thirteen+0.005
 obstacle+0.004
dy+0.004
die+0.004
Today+0.004
Neg logit contributions
-0.002
 fixed-0.002
 supplies-0.002
 helped-0.002
ird-0.002
Token frightened
Token ID24776
Feature activation-6.60e-01

Pos logit contributions
 thirteen+0.017
dy+0.016
die+0.015
 obstacle+0.015
Dean+0.014
Neg logit contributions
-0.010
 supplies-0.010
 help-0.009
ird-0.009
 fixed-0.009
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
 help+0.008
 Tim+0.008
 together+0.008
+0.007
 supplies+0.007
Neg logit contributions
 thirteen-0.040
die-0.037
dy-0.036
Dean-0.036
 seventeen-0.035
Token but
Token ID475
Feature activation+5.77e-01

Pos logit contributions
 thirteen+0.065
dy+0.061
die+0.061
Dean+0.059
 obstacle+0.059
Neg logit contributions
 help-0.013
-0.013
 yes-0.012
 Tim-0.012
 supplies-0.011
Token?"
Token ID1701
Feature activation-4.87e-01

Pos logit contributions
 thirteen+0.000
dy+0.000
die+0.000
 obstacle+0.000
Dean+0.000
Neg logit contributions
-0.000
 supplies-0.000
ird-0.000
 help-0.000
 markers-0.000
Token Her
Token ID2332
Feature activation+8.01e-01

Pos logit contributions
+0.014
 supplies+0.014
vell+0.014
ird+0.013
 markers+0.013
Neg logit contributions
 thirteen-0.020
dy-0.019
die-0.018
Dean-0.017
 moral-0.017
Token mom
Token ID1995
Feature activation-6.55e-03

Pos logit contributions
dy+0.032
die+0.031
 thirteen+0.031
 moral+0.028
 obstacle+0.027
Neg logit contributions
-0.026
 supplies-0.024
ird-0.023
vell-0.021
uggle-0.021
Token said
Token ID531
Feature activation+5.27e-01

Pos logit contributions
+0.000
 supplies+0.000
ird+0.000
vell+0.000
 markers+0.000
Neg logit contributions
 thirteen-0.000
dy-0.000
die-0.000
 obstacle-0.000
Dean-0.000
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
 thirteen+0.022
dy+0.020
die+0.018
 moral+0.018
He+0.017
Neg logit contributions
 supplies-0.016
-0.016
vell-0.014
 markers-0.014
enge-0.013
Token "
Token ID366
Feature activation-9.38e-02

Pos logit contributions
 thirteen+0.045
dy+0.042
die+0.041
 obstacle+0.039
Dean+0.037
Neg logit contributions
-0.032
 supplies-0.028
ird-0.027
 help-0.026
 markers-0.026
TokenSure
Token ID19457
Feature activation-1.28e+00

Pos logit contributions
+0.003
 supplies+0.003
ird+0.003
 help+0.003
 markers+0.002
Neg logit contributions
 thirteen-0.004
dy-0.004
die-0.004
 obstacle-0.003
Dean-0.003
Token,
Token ID11
Feature activation-2.70e-01

Pos logit contributions
+0.030
 supplies+0.027
 markers+0.024
vell+0.024
ird+0.024
Neg logit contributions
 thirteen-0.064
dy-0.061
die-0.059
 obstacle-0.056
Dean-0.054
Token I
Token ID314
Feature activation-7.17e-02

Pos logit contributions
+0.008
 supplies+0.007
ird+0.007
 markers+0.007
vell+0.007
Neg logit contributions
 thirteen-0.012
dy-0.011
die-0.011
 obstacle-0.010
Dean-0.010
Token can
Token ID460
Feature activation+1.25e-01

Pos logit contributions
+0.002
 supplies+0.002
 markers+0.001
ird+0.001
vell+0.001
Neg logit contributions
 thirteen-0.003
dy-0.003
die-0.003
 obstacle-0.003
Dean-0.003
Token carry
Token ID3283
Feature activation-3.63e-02

Pos logit contributions
 thirteen+0.005
dy+0.005
die+0.005
He+0.005
Dean+0.005
Neg logit contributions
 supplies-0.003
 markers-0.003
-0.003
enge-0.003
vell-0.003
TokenIt
Token ID1026
Feature activation-6.81e-01

Pos logit contributions
+0.013
ird+0.011
 yes+0.011
 together+0.011
 markers+0.011
Neg logit contributions
 thirteen-0.015
dy-0.015
 obstacle-0.015
die-0.013
 remembers-0.012
Token's
Token ID338
Feature activation-3.20e-01

Pos logit contributions
+0.022
 supplies+0.021
 help+0.020
 fixed+0.020
 markers+0.020
Neg logit contributions
 thirteen-0.027
dy-0.024
die-0.023
 obstacle-0.023
Dean-0.022
Token dangerous
Token ID4923
Feature activation-1.11e+00

Pos logit contributions
+0.009
 supplies+0.008
ird+0.008
 markers+0.007
 help+0.007
Neg logit contributions
 thirteen-0.015
dy-0.014
die-0.013
 obstacle-0.012
Dean-0.012
Token out
Token ID503
Feature activation-1.34e+00

Pos logit contributions
+0.021
 supplies+0.020
 help+0.018
ird+0.017
 markers+0.017
Neg logit contributions
 thirteen-0.059
dy-0.055
die-0.055
Dean-0.052
 obstacle-0.052
Token there
Token ID612
Feature activation+2.48e-01

Pos logit contributions
 help+0.026
 supplies+0.025
ird+0.023
+0.023
 fixed+0.021
Neg logit contributions
 thirteen-0.071
die-0.068
dy-0.067
Dean-0.063
Today-0.060
Token."
Token ID526
Feature activation-8.36e-01

Pos logit contributions
 thirteen+0.013
dy+0.013
die+0.012
 obstacle+0.012
Dean+0.012
Neg logit contributions
-0.005
 supplies-0.004
 help-0.004
ird-0.004
 markers-0.004
Token
Token ID198
Feature activation+1.31e-02

Pos logit contributions
+0.026
 supplies+0.025
vell+0.023
 markers+0.023
ird+0.022
Neg logit contributions
 thirteen-0.033
dy-0.032
die-0.031
Dean-0.028
Today-0.028
Token
Token ID198
Feature activation-5.26e-02

Pos logit contributions
 thirteen+0.001
dy+0.001
die+0.001
 obstacle+0.000
Dean+0.000
Neg logit contributions
-0.000
 supplies-0.000
ird-0.000
 markers-0.000
 help-0.000
TokenBut
Token ID1537
Feature activation-1.24e-01

Pos logit contributions
+0.002
ird+0.001
 Eve+0.001
 Emmy+0.001
 together+0.001
Neg logit contributions
 thirteen-0.002
 obstacle-0.002
dy-0.002
 remembers-0.002
die-0.002
Token Lucy
Token ID22162
Feature activation+1.40e-01

Pos logit contributions
+0.003
 supplies+0.003
ird+0.002
 markers+0.002
 help+0.002
Neg logit contributions
 thirteen-0.006
dy-0.006
die-0.006
 obstacle-0.005
Dean-0.005
Token couldn
Token ID3521
Feature activation-2.93e-01

Pos logit contributions
 thirteen+0.007
dy+0.006
die+0.006
 obstacle+0.006
Dean+0.006
Neg logit contributions
-0.003
ird-0.003
 supplies-0.003
 fixed-0.003
 help-0.003
Token look
Token ID804
Feature activation-6.74e-01

Pos logit contributions
+0.016
 supplies+0.014
ird+0.013
 markers+0.013
vell+0.013
Neg logit contributions
 thirteen-0.023
dy-0.021
die-0.020
 obstacle-0.019
Dean-0.019
Token at
Token ID379
Feature activation-1.46e-01

Pos logit contributions
+0.022
 supplies+0.019
ird+0.018
 markers+0.017
 help+0.017
Neg logit contributions
 thirteen-0.029
dy-0.027
die-0.026
 obstacle-0.025
Dean-0.023
Token.
Token ID13
Feature activation+4.04e-02

Pos logit contributions
+0.004
ird+0.003
 supplies+0.003
vell+0.003
 markers+0.003
Neg logit contributions
 thirteen-0.007
dy-0.007
die-0.007
 obstacle-0.006
Dean-0.006
Token Tom
Token ID4186
Feature activation-2.59e-01

Pos logit contributions
 thirteen+0.002
dy+0.002
die+0.002
 obstacle+0.002
Dean+0.002
Neg logit contributions
-0.001
 supplies-0.001
ird-0.001
 markers-0.001
 fixed-0.001
Token was
Token ID373
Feature activation+8.30e-02

Pos logit contributions
 supplies+0.005
 fixed+0.005
+0.005
ird+0.005
 markers+0.005
Neg logit contributions
 thirteen-0.013
 obstacle-0.012
dy-0.012
die-0.012
Dean-0.011
Token so
Token ID523
Feature activation-1.02e-01

Pos logit contributions
 thirteen+0.004
dy+0.004
die+0.004
 obstacle+0.004
Dean+0.004
Neg logit contributions
-0.002
 supplies-0.002
ird-0.002
 help-0.001
 fixed-0.001
Token happy
Token ID3772
Feature activation-1.93e-01

Pos logit contributions
+0.002
 supplies+0.002
 help+0.002
ird+0.002
 fixed+0.002
Neg logit contributions
 thirteen-0.005
dy-0.005
die-0.005
 obstacle-0.005
Dean-0.005
Token to
Token ID284
Feature activation+4.92e-02

Pos logit contributions
 supplies+0.003
+0.003
 together+0.003
 help+0.002
 markers+0.002
Neg logit contributions
 thirteen-0.012
die-0.011
dy-0.011
 obstacle-0.010
Dean-0.010
Token be
Token ID307
Feature activation-1.78e-01

Pos logit contributions
 thirteen+0.002
dy+0.002
die+0.002
 obstacle+0.002
Dean+0.002
Neg logit contributions
-0.001
 help-0.001
 supplies-0.001
ird-0.001
 decor-0.001
Token there
Token ID612
Feature activation+8.83e-01

Pos logit contributions
+0.005
 supplies+0.004
ird+0.004
 markers+0.004
 help+0.004
Neg logit contributions
 thirteen-0.008
dy-0.008
die-0.007
 obstacle-0.007
Dean-0.007
Token!
Token ID0
Feature activation-1.85e-02

Pos logit contributions
 thirteen+0.048
dy+0.045
die+0.044
 obstacle+0.042
Dean+0.041
Neg logit contributions
-0.017
 supplies-0.016
 help-0.014
ird-0.013
 markers-0.013
Token axe
Token ID23272
Feature activation+5.53e-01

Pos logit contributions
 thirteen+0.003
dy+0.002
die+0.002
Dean+0.002
reetings+0.002
Neg logit contributions
 supplies-0.002
 help-0.002
-0.002
 fixed-0.002
 markers-0.002
Token and
Token ID290
Feature activation+2.26e-01

Pos logit contributions
 thirteen+0.028
die+0.027
dy+0.026
Dean+0.025
reetings+0.025
Neg logit contributions
-0.012
 supplies-0.011
 help-0.010
 markers-0.010
 together-0.010
Token said
Token ID531
Feature activation+5.56e-01

Pos logit contributions
 thirteen+0.012
dy+0.011
die+0.011
 obstacle+0.010
Dean+0.010
Neg logit contributions
-0.005
 supplies-0.004
ird-0.004
 markers-0.004
 help-0.003
Token,
Token ID11
Feature activation+9.68e-01

Pos logit contributions
 thirteen+0.030
dy+0.029
die+0.029
 obstacle+0.027
Dean+0.027
Neg logit contributions
-0.010
 supplies-0.009
ird-0.008
 help-0.008
 markers-0.008
Token "
Token ID366
Feature activation-6.79e-01

Pos logit contributions
 thirteen+0.044
dy+0.041
die+0.041
 obstacle+0.040
Dean+0.038
Neg logit contributions
-0.027
 supplies-0.024
 help-0.023
 yes-0.022
 markers-0.022
TokenYes
Token ID5297
Feature activation-1.41e+00

Pos logit contributions
+0.022
 supplies+0.020
 markers+0.020
enge+0.019
 together+0.019
Neg logit contributions
dy-0.026
 thirteen-0.025
die-0.024
 obstacle-0.024
 remembers-0.022
Token,
Token ID11
Feature activation-7.52e-01

Pos logit contributions
+0.032
 supplies+0.030
ird+0.026
 markers+0.026
vell+0.024
Neg logit contributions
 thirteen-0.071
dy-0.067
die-0.065
 obstacle-0.062
Dean-0.060
Token this
Token ID428
Feature activation-1.25e-01

Pos logit contributions
+0.014
 supplies+0.013
 help+0.012
 markers+0.012
ird+0.012
Neg logit contributions
 thirteen-0.040
dy-0.038
die-0.038
Dean-0.035
 obstacle-0.035
Token axe
Token ID23272
Feature activation-1.13e+00

Pos logit contributions
 supplies+0.004
+0.004
 help+0.004
ird+0.003
 fixed+0.003
Neg logit contributions
 thirteen-0.005
dy-0.005
die-0.005
Dean-0.004
 obstacle-0.004
Token is
Token ID318
Feature activation-1.87e-02

Pos logit contributions
+0.037
 supplies+0.034
ird+0.032
 markers+0.032
vell+0.031
Neg logit contributions
 thirteen-0.045
dy-0.042
die-0.040
 obstacle-0.038
Dean-0.036
Token perfect
Token ID2818
Feature activation-2.62e-01

Pos logit contributions
+0.000
 supplies+0.000
 help+0.000
 markers+0.000
ird+0.000
Neg logit contributions
 thirteen-0.001
dy-0.001
die-0.001
 obstacle-0.001
Dean-0.001
Token toys
Token ID14958
Feature activation+1.39e-01

Pos logit contributions
+0.015
 supplies+0.014
ird+0.012
 markers+0.012
 help+0.012
Neg logit contributions
 thirteen-0.040
dy-0.037
die-0.036
 obstacle-0.035
Dean-0.034
Token.
Token ID13
Feature activation-4.12e-01

Pos logit contributions
dy+0.005
 thirteen+0.005
die+0.004
 moral+0.004
He+0.004
Neg logit contributions
-0.005
 Bian-0.004
ird-0.004
anger-0.004
 supplies-0.004
Token<|endoftext|>
Token ID50256
Feature activation+5.35e-01

Pos logit contributions
+0.010
 supplies+0.008
 help+0.008
 Tim+0.008
 fixed+0.007
Neg logit contributions
 thirteen-0.021
 obstacle-0.018
dy-0.018
die-0.018
Dean-0.017
TokenOnce
Token ID7454
Feature activation+3.03e-01

Pos logit contributions
 thirteen+0.014
dy+0.013
die+0.013
Today+0.012
He+0.012
Neg logit contributions
vell-0.022
 markers-0.022
ird-0.022
 supplies-0.022
-0.022
Token upon
Token ID2402
Feature activation-6.60e-01

Pos logit contributions
 thirteen+0.011
dy+0.010
die+0.009
 obstacle+0.009
Dean+0.008
Neg logit contributions
-0.011
 supplies-0.011
ird-0.010
 markers-0.010
 help-0.010
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
+0.019
 supplies+0.017
ird+0.016
vell+0.015
 markers+0.015
Neg logit contributions
 thirteen-0.029
dy-0.028
die-0.028
He-0.025
Today-0.025
Token time
Token ID640
Feature activation-8.33e-01

Pos logit contributions
+0.041
 supplies+0.038
ird+0.036
 markers+0.035
 help+0.035
Neg logit contributions
 thirteen-0.043
dy-0.039
die-0.038
 obstacle-0.035
Dean-0.034
Token,
Token ID11
Feature activation-3.27e-01

Pos logit contributions
 supplies+0.021
+0.021
 markers+0.019
ird+0.019
vell+0.018
Neg logit contributions
 thirteen-0.038
dy-0.037
die-0.035
Today-0.032
 obstacle-0.032
Token there
Token ID612
Feature activation+2.66e-01

Pos logit contributions
+0.006
 together+0.006
 fixed+0.006
 help+0.006
 decor+0.005
Neg logit contributions
 thirteen-0.017
 obstacle-0.016
die-0.015
dy-0.015
Dean-0.015
Token was
Token ID373
Feature activation-1.48e-02

Pos logit contributions
dy+0.010
 thirteen+0.010
die+0.009
Today+0.008
 moral+0.008
Neg logit contributions
-0.009
 supplies-0.009
vell-0.008
 markers-0.008
ird-0.008
Token a
Token ID257
Feature activation-2.82e-01

Pos logit contributions
+0.000
 supplies+0.000
ird+0.000
vell+0.000
anger+0.000
Neg logit contributions
 thirteen-0.001
dy-0.001
die-0.001
Today-0.001
Dean-0.001
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
+0.026
 supplies+0.024
 help+0.021
ird+0.021
 markers+0.020
Neg logit contributions
 thirteen-0.077
dy-0.073
die-0.072
Dean-0.067
 obstacle-0.067
Token She
Token ID1375
Feature activation-2.13e-01

Pos logit contributions
 thirteen+0.012
dy+0.011
die+0.011
 obstacle+0.010
Dean+0.010
Neg logit contributions
-0.010
 supplies-0.009
ird-0.009
 markers-0.009
 help-0.009
Token told
Token ID1297
Feature activation-3.45e-01

Pos logit contributions
+0.006
 supplies+0.006
ird+0.005
vell+0.005
 markers+0.005
Neg logit contributions
 thirteen-0.009
dy-0.009
die-0.009
Dean-0.008
 obstacle-0.008
Token them
Token ID606
Feature activation-1.63e+00

Pos logit contributions
+0.008
 supplies+0.007
ird+0.006
 markers+0.006
 help+0.006
Neg logit contributions
 thirteen-0.017
dy-0.016
die-0.016
 obstacle-0.015
Dean-0.015
Token what
Token ID644
Feature activation-3.70e-01

Pos logit contributions
 supplies+0.032
 help+0.031
 markers+0.028
+0.028
 yes+0.027
Neg logit contributions
 thirteen-0.084
die-0.079
dy-0.078
 obstacle-0.076
Dean-0.074
Token was
Token ID373
Feature activation-2.69e+00

Pos logit contributions
+0.009
 supplies+0.008
ird+0.007
 markers+0.007
 help+0.007
Neg logit contributions
 thirteen-0.018
dy-0.017
die-0.017
 obstacle-0.016
Dean-0.015
Token happening
Token ID5836
Feature activation-9.71e-01

Pos logit contributions
+0.104
ird+0.089
 supplies+0.086
 markers+0.085
uggle+0.080
Neg logit contributions
 thirteen-0.101
dy-0.094
die-0.088
 obstacle-0.080
Dean-0.078
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
+0.020
 supplies+0.019
 help+0.017
 markers+0.016
ird+0.016
Neg logit contributions
 thirteen-0.051
dy-0.048
die-0.047
 obstacle-0.044
Dean-0.044
Token They
Token ID1119
Feature activation-7.35e-01

Pos logit contributions
 thirteen+0.006
dy+0.006
die+0.005
 obstacle+0.005
Dean+0.005
Neg logit contributions
-0.005
 supplies-0.004
ird-0.004
 markers-0.004
 help-0.004
Token got
Token ID1392
Feature activation+1.29e-02

Pos logit contributions
+0.023
 supplies+0.019
anger+0.018
uggle+0.018
vell+0.018
Neg logit contributions
 thirteen-0.031
dy-0.029
die-0.028
 moral-0.027
Dean-0.026
Token up
Token ID510
Feature activation+5.17e-02

Pos logit contributions
 thirteen+0.001
dy+0.001
die+0.000
 obstacle+0.000
Dean+0.000
Neg logit contributions
-0.000
ird-0.000
 supplies-0.000
vell-0.000
 markers-0.000
TokenNo
Token ID2949
Feature activation+2.04e-02

Pos logit contributions
+0.031
 supplies+0.028
ird+0.027
 help+0.027
 markers+0.026
Neg logit contributions
 thirteen-0.029
dy-0.027
die-0.026
 obstacle-0.024
Dean-0.023
Token!
Token ID0
Feature activation-6.44e-03

Pos logit contributions
 thirteen+0.001
dy+0.001
die+0.001
 obstacle+0.001
Dean+0.001
Neg logit contributions
-0.000
 supplies-0.000
 markers-0.000
ird-0.000
vell-0.000
Token It
Token ID632
Feature activation-1.00e+00

Pos logit contributions
+0.000
 supplies+0.000
ird+0.000
 markers+0.000
 help+0.000
Neg logit contributions
 thirteen-0.000
dy-0.000
die-0.000
 obstacle-0.000
Dean-0.000
Tokenâ
Token ID22940
Feature activation+4.41e-01

Pos logit contributions
+0.040
 supplies+0.037
ird+0.035
 markers+0.035
 help+0.034
Neg logit contributions
 thirteen-0.034
dy-0.031
die-0.030
 obstacle-0.027
Dean-0.026
Token
Token ID26391
Feature activation-4.51e-01

Pos logit contributions
 thirteen+0.016
dy+0.013
die+0.013
reetings+0.012
 obstacle+0.012
Neg logit contributions
-0.016
 supplies-0.016
 markers-0.016
vell-0.015
 brand-0.015
Token
Token ID8151
Feature activation-1.04e+00

Pos logit contributions
 supplies+0.016
+0.016
 markers+0.015
vell+0.015
 help+0.015
Neg logit contributions
 thirteen-0.015
dy-0.014
die-0.014
Dean-0.012
Today-0.012
Tokens
Token ID82
Feature activation+6.58e-03

Pos logit contributions
+0.033
 supplies+0.033
 markers+0.031
 help+0.029
vell+0.029
Neg logit contributions
 thirteen-0.042
dy-0.039
die-0.037
 obstacle-0.034
Today-0.033
Token my
Token ID616
Feature activation-1.79e-01

Pos logit contributions
 thirteen+0.000
dy+0.000
die+0.000
 obstacle+0.000
Dean+0.000
Neg logit contributions
-0.000
 supplies-0.000
ird-0.000
 markers-0.000
vell-0.000
Token doll
Token ID3654
Feature activation-7.99e-01

Pos logit contributions
+0.005
 supplies+0.004
vell+0.004
 markers+0.004
ird+0.004
Neg logit contributions
 thirteen-0.008
dy-0.008
die-0.007
 moral-0.007
 obstacle-0.007
Token.
Token ID13
Feature activation+1.72e-01

Pos logit contributions
+0.020
 supplies+0.018
ird+0.017
 markers+0.016
 help+0.016
Neg logit contributions
 thirteen-0.038
dy-0.036
die-0.035
 obstacle-0.033
Dean-0.032
Tokenâ
Token ID22940
Feature activation+6.68e-01

Pos logit contributions
 thirteen+0.007
dy+0.006
die+0.006
 obstacle+0.006
Dean+0.006
Neg logit contributions
-0.006
 supplies-0.005
ird-0.005
 markers-0.005
 help-0.005
Token.
Token ID13
Feature activation+2.31e-02

Pos logit contributions
 thirteen+0.051
dy+0.048
die+0.047
 obstacle+0.044
Dean+0.044
Neg logit contributions
-0.015
 supplies-0.012
 help-0.010
ird-0.010
 markers-0.010
Token
Token ID198
Feature activation-5.09e-02

Pos logit contributions
 thirteen+0.001
dy+0.001
die+0.001
 obstacle+0.001
Dean+0.001
Neg logit contributions
-0.001
 supplies-0.001
ird-0.001
 markers-0.001
 help-0.001
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
+0.001
 supplies+0.001
 help+0.001
ird+0.001
 fixed+0.001
Neg logit contributions
 thirteen-0.002
dy-0.002
die-0.002
 obstacle-0.002
Dean-0.002
TokenBut
Token ID1537
Feature activation-6.46e-01

Pos logit contributions
+0.004
 supplies+0.004
 help+0.004
 Bian+0.003
vell+0.003
Neg logit contributions
 thirteen-0.005
die-0.005
dy-0.005
bi-0.004
mo-0.004
Token what
Token ID644
Feature activation+4.13e-01

Pos logit contributions
+0.017
 supplies+0.016
ird+0.014
 help+0.014
 markers+0.014
Neg logit contributions
 thirteen-0.030
dy-0.028
die-0.028
 obstacle-0.026
Dean-0.025
Token was
Token ID373
Feature activation-2.32e+00

Pos logit contributions
 thirteen+0.019
dy+0.017
die+0.017
 obstacle+0.016
Dean+0.016
Neg logit contributions
-0.011
 supplies-0.011
ird-0.010
 markers-0.010
 help-0.009
Token it
Token ID340
Feature activation-1.36e+00

Pos logit contributions
+0.077
ird+0.066
 supplies+0.066
 markers+0.062
vell+0.058
Neg logit contributions
 thirteen-0.098
dy-0.089
die-0.086
 obstacle-0.083
 moral-0.078
Token for
Token ID329
Feature activation-6.08e-01

Pos logit contributions
 supplies+0.029
 help+0.025
+0.025
 fixed+0.023
vell+0.022
Neg logit contributions
 thirteen-0.074
 obstacle-0.062
 seventeen-0.062
die-0.062
dy-0.062
Token?
Token ID30
Feature activation-2.73e-01

Pos logit contributions
+0.018
vell+0.016
ird+0.016
 supplies+0.016
 markers+0.015
Neg logit contributions
 thirteen-0.024
dy-0.024
die-0.022
 moral-0.022
Dean-0.022
Token Sl
Token ID3454
Feature activation+4.25e-01

Pos logit contributions
+0.009
 supplies+0.008
ird+0.008
 markers+0.007
 help+0.007
Neg logit contributions
 thirteen-0.011
dy-0.010
die-0.010
 obstacle-0.009
Dean-0.009
Tokenither
Token ID1555
Feature activation+6.63e-02

Pos logit contributions
 thirteen+0.014
dy+0.013
die+0.013
 obstacle+0.012
Dean+0.011
Neg logit contributions
-0.017
 supplies-0.016
ird-0.015
 markers-0.015
 help-0.015
Token interesting
Token ID3499
Feature activation-2.51e-01

Pos logit contributions
+0.005
 supplies+0.005
ird+0.005
 markers+0.004
 help+0.004
Neg logit contributions
 thirteen-0.006
dy-0.006
die-0.005
 obstacle-0.005
 moral-0.005
Token thing
Token ID1517
Feature activation-3.24e-01

Pos logit contributions
 supplies+0.007
+0.006
 help+0.006
 markers+0.006
 decor+0.006
Neg logit contributions
 thirteen-0.012
dy-0.010
die-0.010
Dean-0.010
Those-0.009
Token she
Token ID673
Feature activation-1.04e+00

Pos logit contributions
+0.009
 supplies+0.008
ird+0.008
 markers+0.007
vell+0.007
Neg logit contributions
 thirteen-0.015
dy-0.014
die-0.013
 obstacle-0.012
Dean-0.012
Token had
Token ID550
Feature activation-8.18e-01

Pos logit contributions
+0.045
 supplies+0.041
 markers+0.039
ird+0.039
vell+0.037
Neg logit contributions
 thirteen-0.032
dy-0.029
die-0.028
Dean-0.025
 obstacle-0.024
Token ever
Token ID1683
Feature activation-1.04e+00

Pos logit contributions
+0.026
 supplies+0.023
 markers+0.022
ird+0.021
 Bian+0.021
Neg logit contributions
 thirteen-0.033
dy-0.030
die-0.029
Today-0.028
Dean-0.027
Token seen
Token ID1775
Feature activation-3.88e+00

Pos logit contributions
 supplies+0.025
+0.024
ird+0.022
 help+0.022
 markers+0.022
Neg logit contributions
 thirteen-0.050
dy-0.047
die-0.046
 obstacle-0.043
Dean-0.043
Token.
Token ID13
Feature activation-8.80e-02

Pos logit contributions
+0.101
 supplies+0.087
 markers+0.080
ird+0.080
 help+0.076
Neg logit contributions
 thirteen-0.184
dy-0.168
die-0.164
 obstacle-0.155
Today-0.153
Token
Token ID198
Feature activation-6.73e-02

Pos logit contributions
+0.003
 supplies+0.003
vell+0.003
 markers+0.003
ird+0.003
Neg logit contributions
 thirteen-0.003
dy-0.003
die-0.003
Dean-0.003
Today-0.002
Token
Token ID198
Feature activation-1.38e-01

Pos logit contributions
+0.002
 supplies+0.002
 help+0.002
ird+0.001
 fixed+0.001
Neg logit contributions
 thirteen-0.003
dy-0.003
die-0.003
 obstacle-0.003
Dean-0.003
TokenEventually
Token ID28724
Feature activation-1.01e+00

Pos logit contributions
+0.004
 help+0.004
ird+0.004
 supplies+0.004
 fixed+0.004
Neg logit contributions
 thirteen-0.006
dy-0.006
die-0.005
mo-0.005
 obstacle-0.005
Token the
Token ID262
Feature activation-7.89e-01

Pos logit contributions
+0.028
 supplies+0.025
vell+0.025
ird+0.024
 markers+0.024
Neg logit contributions
 thirteen-0.044
dy-0.040
 obstacle-0.039
die-0.038
 moral-0.037
Token things
Token ID1243
Feature activation-4.02e-01

Pos logit contributions
 thirteen+0.029
dy+0.026
die+0.026
Dean+0.024
 obstacle+0.024
Neg logit contributions
-0.015
 supplies-0.015
 markers-0.013
 help-0.013
ird-0.012
Token that
Token ID326
Feature activation-1.46e+00

Pos logit contributions
+0.011
ird+0.009
 supplies+0.009
 markers+0.008
enge+0.008
Neg logit contributions
 thirteen-0.018
dy-0.017
die-0.016
 obstacle-0.015
Today-0.015
Token Kelly
Token ID9077
Feature activation-4.05e-01

Pos logit contributions
+0.052
ird+0.043
 supplies+0.042
resses+0.041
uggle+0.041
Neg logit contributions
 thirteen-0.053
dy-0.050
die-0.049
Dean-0.046
Today-0.046
Token had
Token ID550
Feature activation-1.24e+00

Pos logit contributions
+0.011
 supplies+0.011
ird+0.010
 markers+0.010
 help+0.010
Neg logit contributions
 thirteen-0.018
dy-0.016
die-0.016
 obstacle-0.015
Dean-0.015
Token never
Token ID1239
Feature activation-1.62e+00

Pos logit contributions
+0.043
 supplies+0.038
ird+0.037
 markers+0.036
vell+0.035
Neg logit contributions
 thirteen-0.049
dy-0.045
die-0.044
 obstacle-0.040
Dean-0.039
Token seen
Token ID1775
Feature activation-4.11e+00

Pos logit contributions
 supplies+0.034
+0.033
 help+0.031
ird+0.031
 fixed+0.031
Neg logit contributions
 thirteen-0.083
dy-0.077
die-0.076
 obstacle-0.072
Dean-0.071
Token before
Token ID878
Feature activation+3.59e-01

Pos logit contributions
+0.120
 supplies+0.107
ird+0.099
 markers+0.096
vell+0.095
Neg logit contributions
 thirteen-0.185
dy-0.167
die-0.164
 obstacle-0.153
Dean-0.150
Token,
Token ID11
Feature activation+5.15e-01

Pos logit contributions
 thirteen+0.019
dy+0.018
die+0.018
 obstacle+0.017
Dean+0.017
Neg logit contributions
-0.007
 supplies-0.006
ird-0.005
 markers-0.005
 help-0.005
Token she
Token ID673
Feature activation+3.13e-01

Pos logit contributions
 thirteen+0.026
dy+0.025
die+0.024
 obstacle+0.022
Dean+0.022
Neg logit contributions
-0.012
 supplies-0.011
 markers-0.010
 help-0.010
ird-0.010
Token didn
Token ID1422
Feature activation-5.23e-01

Pos logit contributions
 thirteen+0.015
dy+0.014
die+0.014
 obstacle+0.013
Dean+0.013
Neg logit contributions
-0.007
 supplies-0.007
 fixed-0.007
ird-0.007
 help-0.007
Token't
Token ID470
Feature activation+2.89e-01

Pos logit contributions
+0.010
 supplies+0.008
 help+0.008
ird+0.007
 markers+0.007
Neg logit contributions
 thirteen-0.029
dy-0.028
die-0.027
 obstacle-0.025
Dean-0.025
Token She
Token ID1375
Feature activation+1.64e-01

Pos logit contributions
 thirteen+0.009
dy+0.009
die+0.009
Dean+0.007
 obstacle+0.007
Neg logit contributions
-0.010
 supplies-0.010
vell-0.010
ird-0.009
 markers-0.009
Token put
Token ID1234
Feature activation-1.73e-01

Pos logit contributions
 thirteen+0.008
dy+0.007
die+0.007
 obstacle+0.007
Dean+0.006
Neg logit contributions
-0.004
 supplies-0.004
ird-0.004
 markers-0.004
 help-0.004
Token on
Token ID319
Feature activation+1.28e-01

Pos logit contributions
+0.009
vell+0.009
ird+0.008
idents+0.008
ver+0.008
Neg logit contributions
 thirteen-0.003
dy-0.002
 moral-0.002
Today-0.002
die-0.002
Token her
Token ID607
Feature activation-3.57e-01

Pos logit contributions
 thirteen+0.005
dy+0.005
die+0.004
Today+0.004
 moral+0.004
Neg logit contributions
-0.004
vell-0.004
ird-0.004
enge-0.004
 supplies-0.004
Token rain
Token ID6290
Feature activation-3.41e-01

Pos logit contributions
+0.011
vell+0.011
ird+0.011
enge+0.010
 supplies+0.010
Neg logit contributions
dy-0.013
 moral-0.012
die-0.012
 thirteen-0.011
Today-0.011
Tokencoat
Token ID31434
Feature activation-3.88e+00

Pos logit contributions
+0.009
vell+0.008
ird+0.008
 supplies+0.007
 markers+0.007
Neg logit contributions
dy-0.015
 thirteen-0.015
die-0.013
Today-0.013
iet-0.013
Token and
Token ID290
Feature activation+2.23e-01

Pos logit contributions
+0.169
vell+0.165
 supplies+0.157
enge+0.150
ird+0.150
Neg logit contributions
 thirteen-0.099
dy-0.095
die-0.078
Meanwhile-0.075
Today-0.074
Token ran
Token ID4966
Feature activation+6.68e-01

Pos logit contributions
dy+0.008
 thirteen+0.008
die+0.008
 obstacle+0.007
Dean+0.007
Neg logit contributions
-0.009
vell-0.007
 supplies-0.007
uggle-0.007
ird-0.006
Token outside
Token ID2354
Feature activation-8.17e-01

Pos logit contributions
dy+0.024
 thirteen+0.024
die+0.023
Today+0.021
He+0.021
Neg logit contributions
-0.024
ird-0.021
vell-0.020
 supplies-0.018
 markers-0.018
Token to
Token ID284
Feature activation+3.81e-01

Pos logit contributions
+0.027
vell+0.026
 supplies+0.024
ird+0.024
enge+0.024
Neg logit contributions
dy-0.030
 thirteen-0.029
die-0.027
Today-0.027
 moral-0.026
Token dance
Token ID9280
Feature activation+1.40e+00

Pos logit contributions
dy+0.012
 thirteen+0.011
 obstacle+0.011
die+0.010
 moral+0.010
Neg logit contributions
-0.015
vell-0.014
enge-0.013
 supplies-0.012
idents-0.012
Token little
Token ID1310
Feature activation+3.66e-01

Pos logit contributions
 help+0.009
+0.008
 show+0.008
 supplies+0.008
 hide+0.007
Neg logit contributions
 thirteen-0.013
die-0.011
dy-0.010
Nik-0.010
 seventeen-0.010
Token frog
Token ID21264
Feature activation+8.73e-01

Pos logit contributions
 thirteen+0.015
die+0.012
Back+0.012
reetings+0.012
Dean+0.012
Neg logit contributions
 help-0.012
 supplies-0.011
-0.011
 cr-0.010
 markers-0.010
Token what
Token ID644
Feature activation+3.59e-01

Pos logit contributions
 thirteen+0.050
Today+0.045
die+0.044
 obstacle+0.044
Dean+0.044
Neg logit contributions
 help-0.015
 supplies-0.012
 yes-0.012
 together-0.012
 Tim-0.010
Token he
Token ID339
Feature activation-9.06e-01

Pos logit contributions
 thirteen+0.017
dy+0.016
die+0.016
 obstacle+0.015
Dean+0.014
Neg logit contributions
-0.009
 supplies-0.009
 help-0.008
 markers-0.008
 fixed-0.008
Token had
Token ID550
Feature activation-1.40e+00

Pos logit contributions
+0.029
 supplies+0.027
ird+0.025
 markers+0.025
 help+0.024
Neg logit contributions
 thirteen-0.037
dy-0.035
die-0.034
 obstacle-0.031
Dean-0.031
Token seen
Token ID1775
Feature activation-4.25e+00

Pos logit contributions
+0.052
 supplies+0.042
ird+0.041
 markers+0.041
 Bian+0.040
Neg logit contributions
 thirteen-0.051
dy-0.048
die-0.045
 moral-0.043
Today-0.042
Token.
Token ID13
Feature activation-8.71e-02

Pos logit contributions
+0.114
 supplies+0.095
ird+0.091
 markers+0.089
vell+0.086
Neg logit contributions
 thirteen-0.199
dy-0.183
die-0.177
 obstacle-0.169
Dean-0.164
Token The
Token ID383
Feature activation-1.02e-01

Pos logit contributions
 supplies+0.003
vell+0.003
+0.003
ird+0.003
 markers+0.003
Neg logit contributions
 thirteen-0.003
dy-0.003
die-0.003
He-0.002
 moral-0.002
Token little
Token ID1310
Feature activation-1.28e-01

Pos logit contributions
 help+0.003
+0.003
 show+0.003
 supplies+0.003
 yes+0.003
Neg logit contributions
 thirteen-0.005
die-0.004
dy-0.004
Nik-0.004
reetings-0.004
Token frog
Token ID21264
Feature activation+2.05e-01

Pos logit contributions
 help+0.004
 supplies+0.004
+0.003
 cr+0.003
 hide+0.003
Neg logit contributions
 thirteen-0.006
Nik-0.005
reetings-0.005
 seventeen-0.005
die-0.005
Token's
Token ID338
Feature activation+8.71e-01

Pos logit contributions
 thirteen+0.011
 obstacle+0.010
die+0.010
reetings+0.010
dy+0.010
Neg logit contributions
-0.003
 fixed-0.003
 yes-0.003
 help-0.003
 helped-0.003
Token about
Token ID546
Feature activation-1.63e+00

Pos logit contributions
+0.024
 supplies+0.022
ird+0.020
vell+0.020
 markers+0.020
Neg logit contributions
 thirteen-0.022
dy-0.020
die-0.019
 moral-0.018
Today-0.018
Token we
Token ID356
Feature activation-6.67e-01

Pos logit contributions
+0.039
 supplies+0.035
 help+0.033
 markers+0.032
ird+0.030
Neg logit contributions
 thirteen-0.081
dy-0.077
die-0.076
 obstacle-0.072
Today-0.069
Token make
Token ID787
Feature activation+2.03e-01

Pos logit contributions
+0.020
 supplies+0.018
ird+0.017
 markers+0.016
vell+0.016
Neg logit contributions
 thirteen-0.029
dy-0.027
die-0.026
 obstacle-0.024
Dean-0.024
Token some
Token ID617
Feature activation+8.40e-02

Pos logit contributions
 thirteen+0.009
dy+0.008
die+0.008
 obstacle+0.008
Dean+0.008
Neg logit contributions
-0.006
 supplies-0.005
ird-0.005
vell-0.005
 markers-0.005
Token mac
Token ID8352
Feature activation-1.16e+00

Pos logit contributions
 thirteen+0.004
dy+0.003
die+0.003
 obstacle+0.003
Dean+0.003
Neg logit contributions
-0.003
 supplies-0.002
ird-0.002
 markers-0.002
vell-0.002
Tokenaron
Token ID8045
Feature activation-3.93e+00

Pos logit contributions
+0.037
 supplies+0.035
ird+0.032
 markers+0.032
 help+0.031
Neg logit contributions
 thirteen-0.048
dy-0.044
die-0.043
 obstacle-0.040
Dean-0.039
Tokeni
Token ID72
Feature activation-8.74e-01

Pos logit contributions
+0.140
 supplies+0.133
 markers+0.125
ird+0.124
 help+0.124
Neg logit contributions
 thirteen-0.144
dy-0.129
die-0.128
 obstacle-0.125
Today-0.116
Token and
Token ID290
Feature activation-3.31e-01

Pos logit contributions
+0.019
 supplies+0.019
 help+0.018
ird+0.017
 decor+0.017
Neg logit contributions
 thirteen-0.043
 obstacle-0.040
die-0.039
dy-0.039
Meanwhile-0.037
Token cheese
Token ID9891
Feature activation-2.26e-01

Pos logit contributions
 help+0.008
+0.008
 supplies+0.008
 markers+0.008
 cr+0.008
Neg logit contributions
 thirteen-0.015
dy-0.014
die-0.014
 obstacle-0.014
Today-0.013
Token for
Token ID329
Feature activation+1.01e-01

Pos logit contributions
+0.005
 supplies+0.004
 help+0.004
ird+0.004
 markers+0.004
Neg logit contributions
 thirteen-0.012
dy-0.011
die-0.011
 obstacle-0.011
Dean-0.010
Token dinner
Token ID8073
Feature activation-4.30e-01

Pos logit contributions
 thirteen+0.005
dy+0.004
die+0.004
 obstacle+0.004
Dean+0.004
Neg logit contributions
-0.003
 supplies-0.002
ird-0.002
 markers-0.002
vell-0.002