TOP ACTIVATIONS
MAX = 4.414

together . They played hop sc ot ch and picked more
day , they play hop sc ot ch . They draw
love . They played hop sc ot ch , jumped rope
She played tag and hop sc ot ch and pretended to
and - seek and hop sc ot ch . Then ,
They started to play hop sc ot ch and had lots
also played tag and hop sc ot ch . They laughed
. You can play hop sc ot ch again . But
would play tag , hop sc ot ch , and many
hide and seek , hop sc ot ch and much more
When Daisy put the magn ifying glass down , the cater
. Barry enjoyed playing hop sc ot ch with them so
. Hop sc ot ch hopped to the
group of girls playing hop sc ot ch . They joined
play a game of hop sc ot ch , but it
know , right ? Micro sc opes are amazing . They
. Hop sc ot ch thanked the magical
flashlight , and a magn ifying glass . As
bunny ?" Hop sc ot ch replied , "
knew how to play hop sc ot ch .

MIN ACTIVATIONS
MIN = -3.483

it 's special and one - of - a - kind
to call him the one - foot king .
special and one - of - a - kind . Do
What is it ? Where is it ?" He followed the
" Dear ... An - na ... and ... Ben
to do more than anything - she loved to roll !
singing , so that one - day she could record her
nineteen , twenty , twenty - one , twenty - two
twenty - one , twenty - two , twenty - three
" That hurt . Where is my laser ?"
Where is mom ? Where is Sue ? Where is the
it was now a one - of - a - kind
she looked with the make - up . She realized that
'm here for a check - up on my heart .
proudly call himself the one - foot king . <|endoftext|> Once
the red car . Where is it ?" They
our moms first . Where is your mom ?"
" Oh no ! Where is my purse ?" she asked
" Oh no ! Where is my nose ?" she cried
done by asking for help from others . And sometimes ,

INTERVAL 2.440 - 4.414
CONTAINS 0.012%

It was such a fun day of swimming
The children played hide - and - seek in the barn
his knee . He cries out loud . "
. They played hide - and - seek and jumped on
to taste ." P olly thanked the friendly cat

INTERVAL 0.466 - 2.440
CONTAINS 20.912%

until he realized that he had melted the print . He
They eat the banana and enjoy their snack . They
. Panc akes were his favorite food in the whole wide
said , " Thank you , Spe edy . Let 's
finished his tower and looked for Spot . He wanted to

INTERVAL -1.509 - 0.466
CONTAINS 78.399%

was a happy planet called Earth . The planet had lots
too . <|endoftext|> Once upon a time , there was a
looking for some cheese ," replied S que ak .
fl ute . The end . <|endoftext|> Once upon a time
a happy heart and was excited to discover new things .

INTERVAL -3.483 - -1.509
CONTAINS 0.676%

was a little girl named Lily . She loved playing in
, Lily loved to call her friends on her toy phone
a car with a really large tire . He pretended he
was too expensive , but Car la wouldn 't take no
lived in a big city with many people and buildings .
Token together
Token ID1978
Feature activation-1.16e+00

Pos logit contributions
 quality+0.006
 de+0.006
 candy+0.006
 topp+0.006
 money+0.005
Neg logit contributions
Mom-0.004
 keeper-0.004
line-0.004
ull-0.004
 ranger-0.004
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
Mom+0.029
ull+0.029
 messenger+0.028
aro+0.028
line+0.027
Neg logit contributions
 candy-0.029
 quality-0.029
 money-0.028
 topp-0.027
 favorites-0.027
Token They
Token ID1119
Feature activation-7.05e-01

Pos logit contributions
ull+0.005
 hurried+0.005
aro+0.005
line+0.005
lee+0.005
Neg logit contributions
 Ow-0.005
 de-0.005
 topp-0.005
 tant-0.005
icer-0.005
Token played
Token ID2826
Feature activation+3.75e-01

Pos logit contributions
ull+0.018
aro+0.016
line+0.015
 below+0.015
 keeper+0.015
Neg logit contributions
 candy-0.019
 cris-0.018
 cand-0.018
 pret-0.017
 favorites-0.017
Token hop
Token ID1725
Feature activation+7.56e-01

Pos logit contributions
 Ow+0.011
 de+0.011
 tant+0.011
 topp+0.011
 ABC+0.011
Neg logit contributions
ull-0.007
aro-0.007
 hurried-0.007
 below-0.007
line-0.007
Tokensc
Token ID1416
Feature activation+4.41e+00

Pos logit contributions
 Ow+0.022
 topp+0.021
icer+0.021
 fault+0.021
 ABC+0.020
Neg logit contributions
 below-0.015
 soon-0.015
ull-0.015
 ran-0.015
aro-0.015
Tokenot
Token ID313
Feature activation+3.68e-01

Pos logit contributions
 de+0.196
 topp+0.195
 Ow+0.195
 tant+0.192
 candy+0.192
Neg logit contributions
ull-0.020
aro-0.015
lee-0.014
 hurried-0.013
 unf-0.013
Tokench
Token ID354
Feature activation+1.56e+00

Pos logit contributions
icer+0.011
 Ow+0.011
 topp+0.011
 fault+0.011
 savings+0.010
Neg logit contributions
 ran-0.008
aro-0.007
 hurried-0.007
 rushed-0.007
ull-0.007
Token and
Token ID290
Feature activation+1.21e-01

Pos logit contributions
 Ow+0.042
 de+0.042
 tant+0.042
 topp+0.042
 candy+0.041
Neg logit contributions
ull-0.033
aro-0.032
 hurried-0.031
line-0.031
 below-0.031
Token picked
Token ID6497
Feature activation+3.25e-01

Pos logit contributions
 candy+0.004
 de+0.004
 pret+0.004
 topp+0.004
 cris+0.004
Neg logit contributions
ull-0.002
lee-0.002
line-0.002
Mom-0.002
 unf-0.002
Token more
Token ID517
Feature activation-9.40e-01

Pos logit contributions
 shave+0.009
 license+0.009
icer+0.009
 tant+0.009
 savings+0.009
Neg logit contributions
 ran-0.007
 slowly-0.007
 decided-0.007
 below-0.007
ull-0.007
Token day
Token ID1110
Feature activation-2.69e-01

Pos logit contributions
 Ow+0.001
 de+0.001
 tant+0.001
 topp+0.001
icer+0.001
Neg logit contributions
ull-0.000
aro-0.000
 hurried-0.000
 below-0.000
line-0.000
Token,
Token ID11
Feature activation-7.24e-01

Pos logit contributions
ull+0.004
aro+0.004
 hurried+0.004
 soon+0.004
Mom+0.003
Neg logit contributions
 tant-0.009
 Ow-0.009
 de-0.009
icer-0.009
 topp-0.009
Token they
Token ID484
Feature activation-1.31e+00

Pos logit contributions
 Mom+0.007
 they+0.007
 soon+0.007
 Jas+0.006
 decided+0.006
Neg logit contributions
icer-0.031
pill-0.030
 tant-0.030
 unnecessary-0.030
rob-0.029
Token play
Token ID711
Feature activation-5.06e-01

Pos logit contributions
aro+0.026
ull+0.026
lee+0.023
 hurried+0.023
line+0.023
Neg logit contributions
 Ow-0.038
 de-0.038
 candy-0.038
 tant-0.038
 price-0.037
Token hop
Token ID1725
Feature activation+3.75e-01

Pos logit contributions
ull+0.011
 soon+0.011
 filled+0.011
 unf+0.011
 ran+0.011
Neg logit contributions
 de-0.014
 Ow-0.013
icer-0.013
 topp-0.013
 tant-0.013
Tokensc
Token ID1416
Feature activation+4.32e+00

Pos logit contributions
 Ow+0.013
 topp+0.013
icer+0.013
 ABC+0.013
 fault+0.012
Neg logit contributions
 below-0.006
 soon-0.005
 slowly-0.005
 decided-0.005
ull-0.005
Tokenot
Token ID313
Feature activation+1.56e-01

Pos logit contributions
 de+0.182
 Ow+0.181
 topp+0.179
 tant+0.179
 candy+0.178
Neg logit contributions
ull-0.029
aro-0.025
 hurried-0.024
line-0.023
lee-0.022
Tokench
Token ID354
Feature activation+1.15e+00

Pos logit contributions
icer+0.005
 Ow+0.005
 fault+0.005
 topp+0.005
 savings+0.005
Neg logit contributions
 ran-0.003
 rushed-0.003
 soon-0.003
 went-0.003
ful-0.003
Token.
Token ID13
Feature activation+1.85e-01

Pos logit contributions
 Ow+0.039
 de+0.039
 tant+0.039
 topp+0.038
 candy+0.038
Neg logit contributions
ull-0.017
aro-0.016
 hurried-0.015
line-0.015
 below-0.015
Token They
Token ID1119
Feature activation-8.16e-01

Pos logit contributions
 unnecessary+0.006
 tant+0.006
icer+0.006
rob+0.006
 de+0.006
Neg logit contributions
 Mom-0.003
 hurried-0.003
 ran-0.003
 below-0.003
 Jas-0.003
Token draw
Token ID3197
Feature activation+3.69e-01

Pos logit contributions
ull+0.017
aro+0.016
 hurried+0.016
lee+0.016
line+0.016
Neg logit contributions
 Ow-0.023
 de-0.023
 tant-0.022
 candy-0.022
 topp-0.022
Token love
Token ID1842
Feature activation+2.32e-01

Pos logit contributions
ull+0.016
 decided+0.016
 hurried+0.015
 below+0.015
 cheered+0.015
Neg logit contributions
icer-0.024
 Ow-0.024
 ABC-0.023
 tant-0.022
 license-0.022
Token.
Token ID13
Feature activation+1.85e-01

Pos logit contributions
 Ow+0.008
icer+0.007
 tant+0.007
 ABC+0.007
 unnecessary+0.007
Neg logit contributions
ull-0.004
 below-0.004
 soon-0.004
 hurried-0.004
 ranger-0.004
Token They
Token ID1119
Feature activation-5.77e-01

Pos logit contributions
 tant+0.005
 Ow+0.005
 topp+0.005
 de+0.005
 candy+0.005
Neg logit contributions
ull-0.004
lee-0.003
line-0.003
 hurried-0.003
 below-0.003
Token played
Token ID2826
Feature activation+2.77e-01

Pos logit contributions
ull+0.010
aro+0.009
line+0.009
Mom+0.009
 ranger+0.009
Neg logit contributions
 de-0.018
 candy-0.017
 tant-0.017
 cris-0.017
 ABC-0.017
Token hop
Token ID1725
Feature activation+5.83e-01

Pos logit contributions
 Ow+0.008
 tant+0.008
 de+0.008
icer+0.008
 topp+0.008
Neg logit contributions
ull-0.006
 hurried-0.005
 below-0.005
aro-0.005
line-0.005
Tokensc
Token ID1416
Feature activation+4.29e+00

Pos logit contributions
 Ow+0.018
icer+0.018
 topp+0.017
 fault+0.017
 unnecessary+0.017
Neg logit contributions
 below-0.011
 soon-0.011
 slowly-0.011
 ran-0.011
ull-0.011
Tokenot
Token ID313
Feature activation+5.17e-02

Pos logit contributions
 topp+0.202
 de+0.202
rob+0.201
 candy+0.200
icer+0.199
Neg logit contributions
ull-0.009
ful-0.004
ot-0.004
lee-0.002
aro-0.001
Tokench
Token ID354
Feature activation+1.22e+00

Pos logit contributions
icer+0.002
 Ow+0.002
 fault+0.002
 topp+0.002
 favorites+0.002
Neg logit contributions
 ran-0.001
 soon-0.001
aro-0.001
 rushed-0.001
ful-0.001
Token,
Token ID11
Feature activation-2.99e-01

Pos logit contributions
 Ow+0.038
 topp+0.038
 price+0.038
 de+0.038
icer+0.038
Neg logit contributions
ull-0.021
aro-0.021
line-0.019
 hurried-0.019
 below-0.019
Token jumped
Token ID11687
Feature activation+9.58e-02

Pos logit contributions
 ran+0.005
line+0.005
 hurried+0.005
 soon+0.005
 below+0.005
Neg logit contributions
 Ow-0.010
icer-0.010
 tant-0.009
 unnecessary-0.009
 fault-0.009
Token rope
Token ID17182
Feature activation-1.94e-01

Pos logit contributions
 topp+0.003
icer+0.003
 savings+0.003
 tant+0.003
 Ow+0.003
Neg logit contributions
 below-0.002
 soon-0.002
 slowly-0.002
 ran-0.001
 unf-0.001
Token She
Token ID1375
Feature activation-5.16e-01

Pos logit contributions
 Mom+0.002
 below+0.002
line+0.002
 hurried+0.002
 Dad+0.002
Neg logit contributions
 topp-0.004
 tant-0.004
rob-0.004
 unnecessary-0.004
osaurus-0.004
Token played
Token ID2826
Feature activation+2.34e-01

Pos logit contributions
ull+0.007
 hurried+0.007
aro+0.007
 unf+0.007
lee+0.006
Neg logit contributions
 Ow-0.018
 tant-0.018
 de-0.018
 topp-0.018
 price-0.017
Token tag
Token ID7621
Feature activation-2.13e-02

Pos logit contributions
 Ow+0.007
 tant+0.007
 topp+0.007
 fault+0.007
icer+0.007
Neg logit contributions
ull-0.004
 below-0.004
 hurried-0.004
 soon-0.004
aro-0.004
Token and
Token ID290
Feature activation+4.03e-01

Pos logit contributions
 soon+0.000
 slowly+0.000
 below+0.000
 filled+0.000
 ran+0.000
Neg logit contributions
 Ow-0.001
 fault-0.001
icer-0.001
 unnecessary-0.001
 topp-0.001
Token hop
Token ID1725
Feature activation+3.24e-01

Pos logit contributions
 Ow+0.016
icer+0.016
 tant+0.015
 de+0.015
 fault+0.015
Neg logit contributions
 soon-0.005
 ran-0.005
 hurried-0.005
 decided-0.005
 rushed-0.005
Tokensc
Token ID1416
Feature activation+4.17e+00

Pos logit contributions
 Ow+0.013
icer+0.012
 topp+0.012
 fault+0.012
 ABC+0.012
Neg logit contributions
 below-0.004
 slowly-0.004
 soon-0.004
 ran-0.004
aro-0.004
Tokenot
Token ID313
Feature activation-7.82e-02

Pos logit contributions
 topp+0.202
rob+0.202
 de+0.197
 price+0.196
icer+0.196
Neg logit contributions
ot-0.006
ful-0.004
lee-0.004
ull-0.002
Mom+0.001
Tokench
Token ID354
Feature activation+1.22e+00

Pos logit contributions
 ran+0.002
 soon+0.001
 hurried+0.001
 rushed+0.001
aro+0.001
Neg logit contributions
icer-0.002
 Ow-0.002
 fault-0.002
 topp-0.002
 price-0.002
Token and
Token ID290
Feature activation+1.81e-01

Pos logit contributions
 price+0.040
 Ow+0.039
 topp+0.039
 license+0.039
icer+0.039
Neg logit contributions
aro-0.021
ull-0.020
ful-0.019
 hurried-0.019
 soon-0.019
Token pretended
Token ID36766
Feature activation-4.91e-01

Pos logit contributions
 Ow+0.006
 de+0.006
 tant+0.006
icer+0.006
 topp+0.006
Neg logit contributions
ull-0.003
aro-0.003
 hurried-0.003
line-0.003
 soon-0.003
Token to
Token ID284
Feature activation-4.39e-01

Pos logit contributions
 soon+0.006
 below+0.006
aro+0.006
 ran+0.006
 hurried+0.005
Neg logit contributions
icer-0.019
 Ow-0.019
 tant-0.019
 topp-0.018
 license-0.018
Tokenand
Token ID392
Feature activation+2.99e+00

Pos logit contributions
ull+0.016
aro+0.015
line+0.014
ful+0.014
 soon+0.014
Neg logit contributions
 Ow-0.022
 de-0.021
 fault-0.021
 price-0.021
icer-0.021
Token-
Token ID12
Feature activation-1.16e-01

Pos logit contributions
 Ow+0.083
 de+0.082
 tant+0.081
 topp+0.081
 candy+0.080
Neg logit contributions
ull-0.063
aro-0.060
 hurried-0.059
line-0.059
 below-0.058
Tokenseek
Token ID36163
Feature activation+3.33e-01

Pos logit contributions
aro+0.002
ull+0.002
 soon+0.002
line+0.002
ful+0.002
Neg logit contributions
 Ow-0.003
icer-0.003
 price-0.003
 de-0.003
 tant-0.003
Token and
Token ID290
Feature activation+2.98e-01

Pos logit contributions
 Ow+0.009
 de+0.009
 tant+0.009
 topp+0.009
icer+0.009
Neg logit contributions
ull-0.007
 hurried-0.007
 below-0.007
aro-0.007
 soon-0.007
Token hop
Token ID1725
Feature activation+2.92e-01

Pos logit contributions
 candy+0.010
 de+0.009
 topp+0.009
 Ow+0.009
 ABC+0.009
Neg logit contributions
ull-0.005
line-0.005
 below-0.005
lee-0.005
aro-0.005
Tokensc
Token ID1416
Feature activation+4.15e+00

Pos logit contributions
 de+0.010
 Ow+0.010
 tant+0.009
 topp+0.009
 candy+0.009
Neg logit contributions
ull-0.005
aro-0.004
 hurried-0.004
line-0.004
lee-0.004
Tokenot
Token ID313
Feature activation+3.65e-02

Pos logit contributions
 de+0.177
 topp+0.175
 Ow+0.175
 tant+0.173
 candy+0.173
Neg logit contributions
ull-0.026
lee-0.020
 hurried-0.020
aro-0.020
 unf-0.020
Tokench
Token ID354
Feature activation+1.00e+00

Pos logit contributions
icer+0.001
 Ow+0.001
 fault+0.001
 topp+0.001
 price+0.001
Neg logit contributions
 ran-0.001
 hurried-0.001
 rushed-0.001
aro-0.001
 soon-0.001
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
 de+0.032
 Ow+0.032
 tant+0.032
 topp+0.031
 ABC+0.031
Neg logit contributions
ull-0.016
aro-0.015
 hurried-0.015
line-0.015
 below-0.015
Token Then
Token ID3244
Feature activation+9.22e-01

Pos logit contributions
 hurried+0.004
Mom+0.004
 soon+0.004
aro+0.004
 could+0.004
Neg logit contributions
 topp-0.004
 tant-0.004
 de-0.004
 Ow-0.004
icer-0.004
Token,
Token ID11
Feature activation-1.31e+00

Pos logit contributions
 candy+0.026
 Ow+0.026
 de+0.026
 license+0.026
 ABC+0.025
Neg logit contributions
ull-0.019
line-0.018
aro-0.018
 below-0.018
lee-0.017
Token They
Token ID1119
Feature activation-7.53e-01

Pos logit contributions
lee+0.016
 hurried+0.016
 ran+0.015
 rushed+0.015
line+0.015
Neg logit contributions
 topp-0.023
icer-0.023
 tant-0.023
 license-0.023
 price-0.023
Token started
Token ID2067
Feature activation-7.94e-01

Pos logit contributions
aro+0.021
ull+0.020
 keeper+0.019
line+0.018
 ranger+0.018
Neg logit contributions
 scar-0.018
 candy-0.018
 cris-0.017
 money-0.017
 cand-0.017
Token to
Token ID284
Feature activation+6.64e-02

Pos logit contributions
ull+0.019
aro+0.018
line+0.018
Mom+0.017
 hurried+0.017
Neg logit contributions
 Ow-0.020
 de-0.020
 tant-0.020
 ABC-0.020
 candy-0.020
Token play
Token ID711
Feature activation-5.10e-02

Pos logit contributions
 Ow+0.002
 de+0.002
 tant+0.002
 topp+0.002
 candy+0.002
Neg logit contributions
ull-0.001
aro-0.001
line-0.001
lee-0.001
 hurried-0.001
Token hop
Token ID1725
Feature activation+6.85e-01

Pos logit contributions
ull+0.001
aro+0.001
lee+0.001
line+0.001
 hurried+0.001
Neg logit contributions
 de-0.001
 candy-0.001
 tant-0.001
 Ow-0.001
 license-0.001
Tokensc
Token ID1416
Feature activation+4.11e+00

Pos logit contributions
 Ow+0.018
 de+0.017
 topp+0.017
 tant+0.017
 ABC+0.017
Neg logit contributions
ull-0.016
aro-0.015
 hurried-0.015
 below-0.015
line-0.015
Tokenot
Token ID313
Feature activation+3.58e-01

Pos logit contributions
 Ow+0.171
 de+0.171
 tant+0.169
 topp+0.168
 candy+0.167
Neg logit contributions
ull-0.029
aro-0.025
 hurried-0.024
line-0.023
 below-0.023
Tokench
Token ID354
Feature activation+1.21e+00

Pos logit contributions
icer+0.010
 Ow+0.010
 topp+0.010
 de+0.009
 favorites+0.009
Neg logit contributions
 ran-0.008
ull-0.007
 hurried-0.007
aro-0.007
 rushed-0.007
Token and
Token ID290
Feature activation+3.45e-01

Pos logit contributions
 Ow+0.037
 de+0.037
 tant+0.037
 topp+0.036
 ABC+0.036
Neg logit contributions
ull-0.021
 hurried-0.020
aro-0.020
lee-0.020
line-0.020
Token had
Token ID550
Feature activation+2.05e-01

Pos logit contributions
 candy+0.010
 pret+0.009
 topp+0.009
 tant+0.009
 de+0.009
Neg logit contributions
ull-0.008
lee-0.007
Mom-0.007
 unf-0.007
aro-0.006
Token lots
Token ID6041
Feature activation+3.27e-01

Pos logit contributions
icer+0.006
 tant+0.006
 Ow+0.005
 de+0.005
 ABC+0.005
Neg logit contributions
 decided-0.005
 hurried-0.005
 soon-0.005
 ran-0.005
 unf-0.004
Token also
Token ID635
Feature activation-4.12e-01

Pos logit contributions
 de+0.029
 candy+0.029
 cand+0.029
 topp+0.029
 money+0.029
Neg logit contributions
ull-0.018
line-0.017
aro-0.017
 below-0.016
 ranger-0.016
Token played
Token ID2826
Feature activation+1.17e-01

Pos logit contributions
ull+0.006
aro+0.006
line+0.005
 below+0.005
 ranger+0.005
Neg logit contributions
 de-0.014
 topp-0.014
 Ow-0.014
 candy-0.014
 tant-0.014
Token tag
Token ID7621
Feature activation-1.09e-01

Pos logit contributions
 Ow+0.004
 de+0.004
 tant+0.004
 topp+0.004
 ABC+0.003
Neg logit contributions
ull-0.002
aro-0.002
 hurried-0.002
line-0.002
 below-0.002
Token and
Token ID290
Feature activation+6.28e-02

Pos logit contributions
 soon+0.002
 below+0.002
 slowly+0.002
ull+0.002
 ran+0.002
Neg logit contributions
 Ow-0.004
icer-0.004
 unnecessary-0.003
 license-0.003
 price-0.003
Token hop
Token ID1725
Feature activation+3.08e-01

Pos logit contributions
 Ow+0.002
 de+0.002
 tant+0.002
 topp+0.002
 ABC+0.002
Neg logit contributions
ull-0.001
aro-0.001
 hurried-0.001
line-0.001
lee-0.001
Tokensc
Token ID1416
Feature activation+4.05e+00

Pos logit contributions
 Ow+0.012
 de+0.012
 topp+0.012
icer+0.012
 ABC+0.012
Neg logit contributions
ull-0.003
aro-0.003
 below-0.003
 hurried-0.003
line-0.003
Tokenot
Token ID313
Feature activation-6.78e-02

Pos logit contributions
rob+0.184
 de+0.183
 topp+0.183
 ABC+0.182
 candy+0.180
Neg logit contributions
ull-0.016
ful-0.013
lee-0.013
ot-0.012
aro-0.009
Tokench
Token ID354
Feature activation+9.87e-01

Pos logit contributions
 ran+0.001
aro+0.001
 rushed+0.001
 hurried+0.001
 soon+0.001
Neg logit contributions
icer-0.002
 Ow-0.002
 topp-0.002
 fault-0.002
 ABC-0.002
Token.
Token ID13
Feature activation-3.14e-01

Pos logit contributions
 Ow+0.031
 de+0.031
 tant+0.030
 topp+0.030
 candy+0.030
Neg logit contributions
ull-0.017
aro-0.017
 hurried-0.016
line-0.016
 below-0.016
Token They
Token ID1119
Feature activation-5.75e-01

Pos logit contributions
ull+0.005
lee+0.005
line+0.005
 hurried+0.004
 below+0.004
Neg logit contributions
 tant-0.010
 de-0.010
 topp-0.010
 candy-0.010
 license-0.010
Token laughed
Token ID13818
Feature activation+5.84e-03

Pos logit contributions
ull+0.010
aro+0.009
line+0.009
 ranger+0.009
 below+0.009
Neg logit contributions
 de-0.018
 tant-0.018
 candy-0.018
 cris-0.017
 topp-0.017
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
 soon+0.000
 below+0.000
ull+0.000
 decided+0.000
 unf+0.000
Neg logit contributions
 Ow-0.001
icer-0.001
 tant-0.001
 de-0.001
 topp-0.001
Token You
Token ID921
Feature activation-6.54e-01

Pos logit contributions
 Ow+0.005
 de+0.005
 topp+0.005
 candy+0.005
 ABC+0.005
Neg logit contributions
ull-0.006
aro-0.006
 hurried-0.006
line-0.006
lee-0.006
Token can
Token ID460
Feature activation+7.09e-02

Pos logit contributions
ull+0.018
 unf+0.017
 decided+0.017
 hurried+0.017
 could+0.017
Neg logit contributions
 tant-0.014
 de-0.014
icer-0.014
 Ow-0.014
 topp-0.013
Token play
Token ID711
Feature activation-2.73e-02

Pos logit contributions
 Ow+0.002
 de+0.002
 ABC+0.002
 candy+0.002
 license+0.002
Neg logit contributions
ull-0.001
aro-0.001
 hurried-0.001
line-0.001
lee-0.001
Token hop
Token ID1725
Feature activation+1.93e-01

Pos logit contributions
ull+0.000
aro+0.000
 hurried+0.000
line+0.000
 below+0.000
Neg logit contributions
 Ow-0.001
 de-0.001
 tant-0.001
 topp-0.001
 ABC-0.001
Tokensc
Token ID1416
Feature activation+4.00e+00

Pos logit contributions
 topp+0.006
 Ow+0.006
 de+0.006
 ABC+0.006
 fault+0.006
Neg logit contributions
 soon-0.004
 below-0.003
ull-0.003
 slowly-0.003
 decided-0.003
Tokenot
Token ID313
Feature activation-1.85e-01

Pos logit contributions
 de+0.173
 topp+0.171
 Ow+0.171
 ABC+0.170
 candy+0.169
Neg logit contributions
ull-0.020
ful-0.015
aro-0.015
Mom-0.014
 unf-0.014
Tokench
Token ID354
Feature activation+8.14e-01

Pos logit contributions
ull+0.004
 soon+0.004
aro+0.004
ful+0.004
 ran+0.004
Neg logit contributions
icer-0.005
 Ow-0.005
 de-0.005
 ABC-0.005
 topp-0.005
Token again
Token ID757
Feature activation+7.69e-03

Pos logit contributions
 Ow+0.026
 de+0.026
 tant+0.026
 topp+0.026
 candy+0.026
Neg logit contributions
ull-0.013
aro-0.013
 hurried-0.012
line-0.012
 below-0.012
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
 Ow+0.000
 de+0.000
 tant+0.000
 topp+0.000
icer+0.000
Neg logit contributions
ull-0.000
 soon-0.000
line-0.000
 below-0.000
aro-0.000
Token But
Token ID887
Feature activation+5.55e-01

Pos logit contributions
 Ow+0.003
 de+0.003
 tant+0.003
 topp+0.003
 candy+0.003
Neg logit contributions
ull-0.003
aro-0.003
 hurried-0.003
line-0.003
 below-0.003
Token would
Token ID561
Feature activation+8.80e-01

Pos logit contributions
 Ow+0.008
 de+0.008
 tant+0.008
 topp+0.008
icer+0.008
Neg logit contributions
ull-0.004
 hurried-0.004
aro-0.003
 decided-0.003
 soon-0.003
Token play
Token ID711
Feature activation+2.27e-02

Pos logit contributions
 Ow+0.028
 de+0.027
 tant+0.027
icer+0.026
 topp+0.026
Neg logit contributions
ull-0.015
 hurried-0.015
 unf-0.015
line-0.015
 soon-0.015
Token tag
Token ID7621
Feature activation-1.15e-01

Pos logit contributions
 Ow+0.001
 fault+0.001
 de+0.001
 savings+0.001
 unnecessary+0.001
Neg logit contributions
ull-0.000
 soon-0.000
 filled-0.000
 ran-0.000
 unf-0.000
Token,
Token ID11
Feature activation+2.23e-01

Pos logit contributions
 soon+0.002
 slowly+0.002
 quietly+0.001
 next+0.001
 filled+0.001
Neg logit contributions
 Ow-0.005
 fault-0.005
 unnecessary-0.005
 savings-0.004
icer-0.004
Token hop
Token ID1725
Feature activation+3.26e-01

Pos logit contributions
 Ow+0.009
icer+0.009
 unnecessary+0.009
 fault+0.009
 savings+0.009
Neg logit contributions
 ran-0.003
 let-0.003
 soon-0.003
 cheer-0.003
 decided-0.003
Tokensc
Token ID1416
Feature activation+3.98e+00

Pos logit contributions
 fault+0.011
 Ow+0.011
 topp+0.011
icer+0.011
 shave+0.011
Neg logit contributions
 slowly-0.006
 ran-0.006
 soon-0.006
 quickly-0.005
 could-0.005
Tokenot
Token ID313
Feature activation-2.62e-01

Pos logit contributions
 de+0.177
 topp+0.176
 Ow+0.174
 ABC+0.173
 candy+0.172
Neg logit contributions
ull-0.018
lee-0.014
 unf-0.013
 hurried-0.012
aro-0.012
Tokench
Token ID354
Feature activation+1.13e+00

Pos logit contributions
 ran+0.005
 rushed+0.005
 hurried+0.005
 soon+0.005
 went+0.004
Neg logit contributions
icer-0.008
 fault-0.008
 Ow-0.008
 savings-0.008
 topp-0.008
Token,
Token ID11
Feature activation-1.88e-02

Pos logit contributions
 price+0.035
 topp+0.035
 savings+0.035
 Ow+0.035
 fault+0.035
Neg logit contributions
aro-0.019
ull-0.019
 ran-0.018
ful-0.018
 hurried-0.018
Token and
Token ID290
Feature activation+8.06e-01

Pos logit contributions
 ran+0.000
line+0.000
 soon+0.000
 slowly+0.000
 hurried+0.000
Neg logit contributions
 Ow-0.001
icer-0.001
 unnecessary-0.001
 fault-0.001
 savings-0.001
Token many
Token ID867
Feature activation-4.46e-01

Pos logit contributions
 Ow+0.028
icer+0.028
 de+0.028
 tant+0.026
 fault+0.026
Neg logit contributions
 soon-0.012
 ran-0.012
 hurried-0.012
aro-0.012
ull-0.012
Token hide
Token ID7808
Feature activation-1.25e-01

Pos logit contributions
 Ow+0.019
icer+0.019
 tant+0.018
 de+0.018
 price+0.017
Neg logit contributions
 soon-0.011
 unf-0.010
 hurried-0.010
 ran-0.010
 decided-0.010
Token and
Token ID290
Feature activation-2.95e-01

Pos logit contributions
ull+0.003
aro+0.003
 below+0.003
 soon+0.003
 hurried+0.003
Neg logit contributions
 Ow-0.003
 topp-0.003
 ABC-0.003
 de-0.003
 tant-0.003
Token seek
Token ID5380
Feature activation-3.36e-01

Pos logit contributions
ull+0.003
aro+0.003
 hurried+0.003
line+0.003
 unf+0.003
Neg logit contributions
 Ow-0.011
 de-0.011
 tant-0.011
icer-0.011
 topp-0.011
Token,
Token ID11
Feature activation+3.45e-01

Pos logit contributions
ull+0.005
aro+0.005
 below+0.005
 hurried+0.005
 soon+0.005
Neg logit contributions
 Ow-0.011
 tant-0.011
 de-0.011
 topp-0.011
 ABC-0.011
Token hop
Token ID1725
Feature activation+5.60e-02

Pos logit contributions
 Ow+0.011
 tant+0.011
icer+0.011
 topp+0.010
 ABC+0.010
Neg logit contributions
 soon-0.007
 hurried-0.007
 unf-0.007
 below-0.006
 ran-0.006
Tokensc
Token ID1416
Feature activation+3.84e+00

Pos logit contributions
 Ow+0.002
 topp+0.002
 ABC+0.002
icer+0.002
 tant+0.002
Neg logit contributions
 below-0.001
 soon-0.001
aro-0.001
 hurried-0.001
ull-0.001
Tokenot
Token ID313
Feature activation-6.09e-01

Pos logit contributions
 de+0.162
 Ow+0.162
 tant+0.160
 topp+0.160
 candy+0.159
Neg logit contributions
ull-0.025
aro-0.022
 hurried-0.021
line-0.020
lee-0.020
Tokench
Token ID354
Feature activation+8.50e-01

Pos logit contributions
 hurried+0.013
 ran+0.013
aro+0.013
 rushed+0.013
ull+0.013
Neg logit contributions
icer-0.017
 Ow-0.017
 topp-0.016
 ABC-0.016
 tant-0.016
Token and
Token ID290
Feature activation-2.95e-01

Pos logit contributions
 Ow+0.027
 topp+0.026
 de+0.026
 tant+0.026
 candy+0.026
Neg logit contributions
ull-0.015
aro-0.015
 hurried-0.014
line-0.014
 unf-0.014
Token much
Token ID881
Feature activation-5.14e-01

Pos logit contributions
ull+0.005
aro+0.005
 hurried+0.005
line+0.005
 below+0.005
Neg logit contributions
 Ow-0.009
 de-0.009
 tant-0.009
 topp-0.009
 candy-0.009
Token more
Token ID517
Feature activation-9.90e-01

Pos logit contributions
ull+0.010
 hurried+0.010
aro+0.010
 unf+0.010
 below+0.010
Neg logit contributions
 Ow-0.015
 de-0.014
icer-0.014
 tant-0.014
 topp-0.014
Token When
Token ID1649
Feature activation-1.20e+00

Pos logit contributions
ull+0.009
aro+0.009
 hurried+0.009
line+0.008
 below+0.008
Neg logit contributions
 tant-0.012
 Ow-0.012
 topp-0.012
 de-0.012
icer-0.012
Token Daisy
Token ID40355
Feature activation+6.23e-01

Pos logit contributions
ull+0.015
aro+0.015
 hurried+0.015
line+0.014
 soon+0.014
Neg logit contributions
 Ow-0.043
 de-0.043
 tant-0.043
 topp-0.042
icer-0.042
Token put
Token ID1234
Feature activation+8.16e-01

Pos logit contributions
 tant+0.024
 Ow+0.024
 unnecessary+0.024
 topp+0.024
 fault+0.024
Neg logit contributions
 ran-0.008
 hurried-0.008
 decided-0.008
 could-0.007
 soon-0.007
Token the
Token ID262
Feature activation-7.06e-01

Pos logit contributions
 ABC+0.030
 Ow+0.030
 topp+0.029
 fault+0.029
pill+0.029
Neg logit contributions
 below-0.011
 slowly-0.010
 soon-0.010
aro-0.010
 Mom-0.010
Token magn
Token ID7842
Feature activation-3.72e-01

Pos logit contributions
 next+0.014
 nearby+0.014
 plant+0.014
 below+0.014
 mo+0.013
Neg logit contributions
 Ow-0.021
pill-0.020
rob-0.020
 unnecessary-0.020
ior-0.019
Tokenifying
Token ID4035
Feature activation+3.83e+00

Pos logit contributions
 below+0.011
ull+0.011
ful+0.010
line+0.010
aro+0.010
Neg logit contributions
 de-0.008
 ABC-0.008
 candy-0.007
 topp-0.007
 Ow-0.007
Token glass
Token ID5405
Feature activation+1.73e-01

Pos logit contributions
 unnecessary+0.123
 ABC+0.114
 options+0.106
 savings+0.105
 de+0.104
Neg logit contributions
 filled-0.084
ull-0.080
 Mom-0.080
 below-0.079
Mom-0.076
Token down
Token ID866
Feature activation+3.87e-01

Pos logit contributions
 Ow+0.006
 de+0.006
 ABC+0.006
 topp+0.005
 tant+0.005
Neg logit contributions
ull-0.003
line-0.003
aro-0.003
 below-0.003
 hurried-0.003
Token,
Token ID11
Feature activation-3.66e-01

Pos logit contributions
 Ow+0.013
 topp+0.013
 ABC+0.013
 license+0.013
 tant+0.013
Neg logit contributions
ull-0.005
 below-0.005
aro-0.005
 soon-0.005
 hurried-0.005
Token the
Token ID262
Feature activation+5.89e-01

Pos logit contributions
ull+0.004
aro+0.004
 hurried+0.004
 below+0.004
line+0.004
Neg logit contributions
 Ow-0.014
 topp-0.013
 de-0.013
 tant-0.013
icer-0.013
Token cater
Token ID20825
Feature activation+4.60e-01

Pos logit contributions
 Ow+0.017
icer+0.017
 fault+0.017
 tant+0.016
 unnecessary+0.016
Neg logit contributions
 ranger-0.012
 below-0.011
 keeper-0.011
Mom-0.011
 unf-0.011
Token.
Token ID13
Feature activation-2.70e-01

Pos logit contributions
ull+0.008
aro+0.007
line+0.007
 below+0.007
 hurried+0.007
Neg logit contributions
 Ow-0.017
icer-0.016
 de-0.016
 tant-0.016
 topp-0.016
Token Barry
Token ID14488
Feature activation+4.40e-01

Pos logit contributions
ull+0.006
line+0.006
aro+0.006
 hurried+0.006
 unf+0.006
Neg logit contributions
 Ow-0.007
 de-0.007
 tant-0.007
 topp-0.007
 ABC-0.007
Token enjoyed
Token ID8359
Feature activation+8.93e-01

Pos logit contributions
 Ow+0.016
icer+0.015
 tant+0.015
 unnecessary+0.015
 de+0.015
Neg logit contributions
 decided-0.007
 hurried-0.007
ull-0.007
 soon-0.007
 ran-0.006
Token playing
Token ID2712
Feature activation+1.02e+00

Pos logit contributions
 Ow+0.027
 tant+0.026
icer+0.026
 de+0.026
 topp+0.025
Neg logit contributions
ull-0.018
aro-0.017
 hurried-0.017
 soon-0.017
line-0.017
Token hop
Token ID1725
Feature activation+2.90e-01

Pos logit contributions
 Ow+0.032
 de+0.031
icer+0.031
 fault+0.031
 ABC+0.031
Neg logit contributions
ull-0.019
 below-0.018
aro-0.018
 hurried-0.017
 unf-0.017
Tokensc
Token ID1416
Feature activation+3.79e+00

Pos logit contributions
 Ow+0.009
 fault+0.009
icer+0.009
 topp+0.009
 ABC+0.009
Neg logit contributions
 below-0.005
 soon-0.005
 slowly-0.005
ull-0.005
 ranger-0.005
Tokenot
Token ID313
Feature activation+9.64e-02

Pos logit contributions
 de+0.177
 topp+0.176
rob+0.175
 Ow+0.174
 candy+0.173
Neg logit contributions
ull-0.012
ot-0.010
ful-0.009
lee-0.006
orse-0.004
Tokench
Token ID354
Feature activation+7.74e-01

Pos logit contributions
icer+0.003
 Ow+0.003
 fault+0.003
 topp+0.003
 savings+0.003
Neg logit contributions
 ran-0.002
aro-0.002
 soon-0.002
ful-0.002
 rushed-0.002
Token with
Token ID351
Feature activation-1.16e+00

Pos logit contributions
 Ow+0.024
 price+0.024
 de+0.024
 topp+0.024
 license+0.024
Neg logit contributions
ull-0.014
aro-0.014
line-0.013
ful-0.012
orse-0.012
Token them
Token ID606
Feature activation-1.01e+00

Pos logit contributions
ull+0.022
aro+0.021
 below+0.021
 unf+0.021
 hurried+0.021
Neg logit contributions
 Ow-0.035
 ABC-0.035
 de-0.034
icer-0.034
 license-0.034
Token so
Token ID523
Feature activation-3.67e-01

Pos logit contributions
ull+0.013
 below+0.013
aro+0.013
 hurried+0.013
 soon+0.012
Neg logit contributions
 Ow-0.036
 de-0.036
icer-0.035
 tant-0.035
 topp-0.035
Token.
Token ID13
Feature activation-3.95e-01

Pos logit contributions
 de+0.018
 tant+0.017
 Ow+0.017
 topp+0.017
 candy+0.017
Neg logit contributions
aro-0.006
ull-0.006
 hurried-0.006
lee-0.006
 unf-0.006
Token
Token ID220
Feature activation+4.98e-01

Pos logit contributions
 Mom+0.006
Mom+0.006
 ranger+0.006
 asked+0.006
line+0.006
Neg logit contributions
 unnecessary-0.014
 quality-0.014
 fault-0.014
 topp-0.014
 ABC-0.013
Token
Token ID198
Feature activation+3.17e-01

Pos logit contributions
 Ow+0.015
 de+0.015
 tant+0.014
 topp+0.014
 candy+0.014
Neg logit contributions
ull-0.009
aro-0.009
 hurried-0.009
line-0.009
 below-0.009
Token
Token ID198
Feature activation+7.26e-02

Pos logit contributions
 de+0.011
 fault+0.011
 topp+0.011
 candy+0.011
 tant+0.011
Neg logit contributions
Mom-0.005
ull-0.005
aro-0.004
 soon-0.004
 hurried-0.004
TokenHop
Token ID23483
Feature activation+1.53e-01

Pos logit contributions
 de+0.002
 Ow+0.002
 fault+0.002
 ABC+0.002
 topp+0.002
Neg logit contributions
Mom-0.001
ull-0.001
 hurried-0.001
aro-0.001
 soon-0.001
Tokensc
Token ID1416
Feature activation+3.79e+00

Pos logit contributions
 topp+0.004
 de+0.004
 tant+0.004
 Ow+0.004
icer+0.004
Neg logit contributions
line-0.003
ull-0.003
lee-0.003
aro-0.003
 unf-0.003
Tokenot
Token ID313
Feature activation+4.02e-01

Pos logit contributions
 de+0.111
 Ow+0.111
 tant+0.109
 topp+0.109
 ABC+0.108
Neg logit contributions
ull-0.073
aro-0.070
 hurried-0.069
line-0.068
 unf-0.067
Tokench
Token ID354
Feature activation+6.99e-01

Pos logit contributions
 Ow+0.012
 fault+0.012
icer+0.012
 unnecessary+0.012
 tant+0.012
Neg logit contributions
 hurried-0.008
ull-0.008
 rushed-0.008
aro-0.008
 soon-0.008
Token hopped
Token ID45230
Feature activation+2.50e-01

Pos logit contributions
 Ow+0.023
 de+0.023
 tant+0.023
 topp+0.023
 candy+0.023
Neg logit contributions
ull-0.011
aro-0.010
 hurried-0.010
line-0.010
 below-0.010
Token to
Token ID284
Feature activation+5.07e-01

Pos logit contributions
 topp+0.008
 Ow+0.008
 ABC+0.008
 de+0.008
 tant+0.008
Neg logit contributions
ull-0.004
 below-0.004
 soon-0.004
aro-0.004
line-0.004
Token the
Token ID262
Feature activation+1.24e-01

Pos logit contributions
 de+0.017
 tant+0.017
 topp+0.017
 ABC+0.017
 Ow+0.017
Neg logit contributions
ull-0.007
 soon-0.007
aro-0.007
 unf-0.007
 below-0.007
Token group
Token ID1448
Feature activation-5.83e-01

Pos logit contributions
 ranger+0.001
 keeper+0.001
ull+0.001
 unf+0.001
 filled+0.001
Neg logit contributions
 Ow-0.002
icer-0.002
 de-0.002
pill-0.002
rob-0.002
Token of
Token ID286
Feature activation+8.05e-01

Pos logit contributions
ull+0.011
Mom+0.011
aro+0.011
lee+0.011
 hurried+0.010
Neg logit contributions
 candy-0.017
 de-0.017
 money-0.016
 license-0.016
 tant-0.016
Token girls
Token ID4813
Feature activation+2.82e-01

Pos logit contributions
icer+0.028
 Ow+0.028
 fault+0.027
aman+0.026
 de+0.026
Neg logit contributions
 brave-0.013
 ran-0.013
 cheered-0.013
 cheer-0.013
 decided-0.012
Token playing
Token ID2712
Feature activation+8.28e-01

Pos logit contributions
 de+0.009
 candy+0.009
 license+0.009
 money+0.009
 tant+0.009
Neg logit contributions
ull-0.005
Mom-0.004
aro-0.004
 hurried-0.004
lee-0.004
Token hop
Token ID1725
Feature activation+2.13e-01

Pos logit contributions
 Ow+0.030
icer+0.029
 de+0.028
 ABC+0.028
 fault+0.028
Neg logit contributions
 below-0.011
ull-0.011
 soon-0.011
 hurried-0.011
aro-0.011
Tokensc
Token ID1416
Feature activation+3.78e+00

Pos logit contributions
 Ow+0.005
 de+0.005
 topp+0.004
icer+0.004
 ABC+0.004
Neg logit contributions
ull-0.006
 below-0.006
aro-0.006
 hurried-0.006
 unf-0.006
Tokenot
Token ID313
Feature activation-3.39e-01

Pos logit contributions
 de+0.149
 Ow+0.148
 topp+0.147
 candy+0.146
 tant+0.146
Neg logit contributions
ull-0.035
aro-0.031
lee-0.030
 hurried-0.030
 unf-0.029
Tokench
Token ID354
Feature activation+1.06e+00

Pos logit contributions
 ran+0.007
ull+0.007
aro+0.007
 hurried+0.007
 rushed+0.007
Neg logit contributions
icer-0.010
 Ow-0.009
 de-0.009
 ABC-0.009
 topp-0.009
Token.
Token ID13
Feature activation-3.60e-01

Pos logit contributions
 Ow+0.038
 de+0.038
 topp+0.037
 tant+0.037
 candy+0.037
Neg logit contributions
ull-0.014
aro-0.013
 hurried-0.012
line-0.012
 below-0.012
Token They
Token ID1119
Feature activation-3.89e-01

Pos logit contributions
 hurried+0.007
lee+0.007
line+0.007
ull+0.007
 below+0.007
Neg logit contributions
 Ow-0.010
 topp-0.010
 tant-0.010
icer-0.010
 de-0.010
Token joined
Token ID5399
Feature activation+5.12e-01

Pos logit contributions
ull+0.007
aro+0.007
Mom+0.007
line+0.007
 below+0.006
Neg logit contributions
 candy-0.012
 tant-0.011
 de-0.011
 topp-0.011
 cris-0.011
Token play
Token ID711
Feature activation-1.26e-01

Pos logit contributions
ull+0.007
aro+0.007
line+0.006
 below+0.006
lee+0.006
Neg logit contributions
 Ow-0.009
 candy-0.009
 ABC-0.009
 price-0.009
 de-0.009
Token a
Token ID257
Feature activation+5.47e-01

Pos logit contributions
ull+0.002
aro+0.002
lee+0.002
line+0.002
 hurried+0.002
Neg logit contributions
 Ow-0.004
 de-0.004
 candy-0.004
 tant-0.004
 license-0.004
Token game
Token ID983
Feature activation+1.07e-01

Pos logit contributions
 tant+0.012
 candy+0.012
 license+0.011
 de+0.011
 Ow+0.011
Neg logit contributions
ull-0.015
lee-0.015
 hurried-0.015
aro-0.015
 below-0.014
Token of
Token ID286
Feature activation+8.73e-01

Pos logit contributions
 de+0.003
 candy+0.003
 license+0.003
 quality+0.003
 cris+0.003
Neg logit contributions
lee-0.002
ull-0.002
aro-0.002
 hurried-0.002
line-0.002
Token hop
Token ID1725
Feature activation+2.59e-01

Pos logit contributions
 tant+0.029
icer+0.028
 de+0.028
 Ow+0.028
 savings+0.027
Neg logit contributions
 ran-0.015
aro-0.015
 unf-0.015
 below-0.014
 hurried-0.014
Tokensc
Token ID1416
Feature activation+3.76e+00

Pos logit contributions
 Ow+0.009
 topp+0.009
 ABC+0.009
 de+0.009
icer+0.008
Neg logit contributions
 below-0.004
ull-0.004
aro-0.004
 soon-0.004
 unf-0.004
Tokenot
Token ID313
Feature activation-1.06e-01

Pos logit contributions
 Ow+0.155
 de+0.154
 tant+0.153
 license+0.151
 candy+0.151
Neg logit contributions
ull-0.028
aro-0.025
 below-0.025
 hurried-0.024
line-0.024
Tokench
Token ID354
Feature activation+8.21e-01

Pos logit contributions
ull+0.002
aro+0.002
 hurried+0.002
 ran+0.002
 rushed+0.002
Neg logit contributions
icer-0.003
 Ow-0.003
 topp-0.003
 de-0.003
 ABC-0.003
Token,
Token ID11
Feature activation-3.18e-01

Pos logit contributions
 Ow+0.026
 tant+0.025
 de+0.025
 ABC+0.025
 topp+0.024
Neg logit contributions
ull-0.014
 hurried-0.013
lee-0.013
 below-0.013
line-0.013
Token but
Token ID475
Feature activation+8.47e-01

Pos logit contributions
line+0.006
 ran+0.006
 soon+0.006
 decided+0.006
 hurried+0.006
Neg logit contributions
icer-0.010
 Ow-0.010
 tant-0.010
 topp-0.010
 ABC-0.010
Token it
Token ID340
Feature activation+2.60e-01

Pos logit contributions
 Ow+0.029
 de+0.029
 tant+0.029
 topp+0.029
 ABC+0.028
Neg logit contributions
ull-0.012
aro-0.012
 hurried-0.011
line-0.011
 below-0.011
Token know
Token ID760
Feature activation-1.37e-02

Pos logit contributions
 Ow+0.017
 de+0.017
icer+0.017
 tant+0.017
 ABC+0.017
Neg logit contributions
ull-0.015
 soon-0.015
 decided-0.015
 hurried-0.015
 unf-0.015
Token,
Token ID11
Feature activation+5.44e-01

Pos logit contributions
ull+0.000
aro+0.000
 hurried+0.000
line+0.000
 below+0.000
Neg logit contributions
 Ow-0.000
 de-0.000
 tant-0.000
 topp-0.000
 candy-0.000
Token right
Token ID826
Feature activation-4.03e-01

Pos logit contributions
 tant+0.018
 quality+0.018
 Ow+0.018
 unnecessary+0.017
icer+0.017
Neg logit contributions
line-0.009
ull-0.008
 unf-0.008
 Mom-0.008
aro-0.008
Token?
Token ID30
Feature activation-6.49e-01

Pos logit contributions
aro+0.010
line+0.009
 hurried+0.009
orse+0.009
ull+0.009
Neg logit contributions
 price-0.011
 candy-0.010
 de-0.010
 license-0.010
 chocolate-0.010
Token Micro
Token ID4527
Feature activation-1.02e+00

Pos logit contributions
ull+0.014
aro+0.014
line+0.014
 below+0.014
 hurried+0.013
Neg logit contributions
 tant-0.018
 Ow-0.017
 de-0.017
 topp-0.017
icer-0.017
Tokensc
Token ID1416
Feature activation+3.71e+00

Pos logit contributions
ull+0.024
 below+0.024
 unf+0.024
aro+0.024
 hurried+0.023
Neg logit contributions
 tant-0.026
 Ow-0.025
icer-0.024
 topp-0.024
 ABC-0.024
Tokenopes
Token ID13920
Feature activation+4.99e-01

Pos logit contributions
 Ow+0.148
 de+0.146
 tant+0.146
 topp+0.143
 license+0.143
Neg logit contributions
ull-0.034
aro-0.031
line-0.030
 hurried-0.030
 below-0.030
Token are
Token ID389
Feature activation-4.15e-01

Pos logit contributions
 tant+0.016
 Ow+0.016
 de+0.016
icer+0.016
 topp+0.016
Neg logit contributions
 below-0.008
 soon-0.008
 unf-0.008
line-0.008
ful-0.008
Token amazing
Token ID4998
Feature activation+2.12e-01

Pos logit contributions
 soon+0.008
 unf+0.008
ull+0.007
 below+0.007
 ran+0.007
Neg logit contributions
 Ow-0.013
 tant-0.013
icer-0.012
 ABC-0.012
 de-0.012
Token.
Token ID13
Feature activation-1.84e-02

Pos logit contributions
 Ow+0.007
 de+0.007
 tant+0.007
 topp+0.007
icer+0.007
Neg logit contributions
ull-0.003
aro-0.003
 hurried-0.003
 below-0.003
 unf-0.003
Token They
Token ID1119
Feature activation-1.29e+00

Pos logit contributions
ull+0.000
aro+0.000
 hurried+0.000
line+0.000
lee+0.000
Neg logit contributions
 Ow-0.000
 de-0.000
 topp-0.000
 tant-0.000
 candy-0.000
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
 Ow+0.018
 tant+0.017
icer+0.017
 ABC+0.017
 license+0.016
Neg logit contributions
ull-0.009
 below-0.009
line-0.009
 unf-0.008
 hurried-0.008
Token
Token ID220
Feature activation+2.34e-01

Pos logit contributions
aro+0.004
 below+0.004
 Mom+0.004
Mom+0.004
 could+0.004
Neg logit contributions
 tant-0.009
 topp-0.008
 price-0.008
 ABC-0.008
 fault-0.008
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
 tant+0.009
 price+0.009
 topp+0.009
 candy+0.009
 quality+0.009
Neg logit contributions
ull-0.003
 below-0.003
ores-0.003
aro-0.003
 could-0.003
Token
Token ID198
Feature activation+2.22e-02

Pos logit contributions
 tant+0.005
 topp+0.005
 de+0.005
 price+0.005
 fault+0.005
Neg logit contributions
ull-0.002
Mom-0.002
aro-0.002
 hurried-0.002
 below-0.002
TokenHop
Token ID23483
Feature activation+3.80e-01

Pos logit contributions
 de+0.001
 Ow+0.001
 tant+0.001
 topp+0.001
 ABC+0.001
Neg logit contributions
ull-0.000
aro-0.000
Mom-0.000
 hurried-0.000
 unf-0.000
Tokensc
Token ID1416
Feature activation+3.65e+00

Pos logit contributions
 Ow+0.013
 de+0.013
 tant+0.013
 topp+0.013
 ABC+0.013
Neg logit contributions
ull-0.006
aro-0.005
 hurried-0.005
line-0.005
 below-0.005
Tokenot
Token ID313
Feature activation+1.42e-01

Pos logit contributions
 Ow+0.120
 de+0.120
 tant+0.119
 topp+0.118
 candy+0.117
Neg logit contributions
ull-0.057
aro-0.054
 hurried-0.053
line-0.052
 below-0.052
Tokench
Token ID354
Feature activation+4.87e-01

Pos logit contributions
 Ow+0.004
 fault+0.004
 license+0.004
icer+0.004
 topp+0.004
Neg logit contributions
ull-0.003
 hurried-0.003
aro-0.003
 rushed-0.003
 soon-0.003
Token thanked
Token ID26280
Feature activation+5.83e-01

Pos logit contributions
 license+0.017
 price+0.017
 candy+0.017
 Ow+0.017
 topp+0.017
Neg logit contributions
ull-0.007
aro-0.007
ful-0.007
 hurried-0.006
 rushed-0.006
Token the
Token ID262
Feature activation-7.17e-02

Pos logit contributions
 topp+0.020
 Ow+0.020
icer+0.020
 fault+0.020
 tant+0.019
Neg logit contributions
ull-0.009
aro-0.009
 soon-0.009
 slowly-0.009
 unf-0.008
Token magical
Token ID10883
Feature activation+3.15e-01

Pos logit contributions
line+0.001
ull+0.001
 hurried+0.001
lee+0.001
aro+0.001
Neg logit contributions
 candy-0.002
 de-0.002
 Ow-0.002
 price-0.002
 topp-0.002
Token flashlight
Token ID38371
Feature activation-2.25e-01

Pos logit contributions
 Ow+0.028
icer+0.027
rob+0.025
 unnecessary+0.025
pill+0.024
Neg logit contributions
 ranger-0.019
 mo-0.019
 keeper-0.018
 ran-0.018
 filled-0.018
Token,
Token ID11
Feature activation+7.79e-01

Pos logit contributions
ull+0.003
 below+0.003
 hurried+0.003
aro+0.003
line+0.003
Neg logit contributions
 Ow-0.008
 tant-0.008
 de-0.007
icer-0.007
 topp-0.007
Token and
Token ID290
Feature activation+5.18e-01

Pos logit contributions
icer+0.030
 Ow+0.029
 tant+0.029
 unnecessary+0.027
rob+0.027
Neg logit contributions
 decided-0.015
 soon-0.015
 filled-0.015
 slowly-0.014
 ran-0.014
Token a
Token ID257
Feature activation+1.22e+00

Pos logit contributions
icer+0.016
 Ow+0.016
 tant+0.015
 fault+0.015
 ABC+0.015
Neg logit contributions
 decided-0.011
 soon-0.011
 ran-0.011
 hurried-0.011
 could-0.011
Token magn
Token ID7842
Feature activation-6.86e-01

Pos logit contributions
 Ow+0.036
icer+0.035
rob+0.032
pill+0.032
 topp+0.032
Neg logit contributions
 ranger-0.025
 mo-0.025
 unf-0.024
 keeper-0.024
ull-0.024
Tokenifying
Token ID4035
Feature activation+3.62e+00

Pos logit contributions
 below+0.021
ull+0.021
line+0.020
aro+0.020
ful+0.020
Neg logit contributions
 candy-0.013
 ABC-0.013
 de-0.013
 topp-0.013
 Ow-0.012
Token glass
Token ID5405
Feature activation-3.96e-01

Pos logit contributions
 unnecessary+0.109
 ABC+0.102
 de+0.095
 options+0.095
 savings+0.094
Neg logit contributions
 filled-0.083
ull-0.079
 unf-0.077
 below-0.076
 Mom-0.075
Token.
Token ID13
Feature activation-3.44e-01

Pos logit contributions
ull+0.005
aro+0.005
 hurried+0.005
line+0.005
lee+0.005
Neg logit contributions
 de-0.014
 Ow-0.014
 tant-0.014
 topp-0.014
 candy-0.014
Token
Token ID198
Feature activation+2.52e-01

Pos logit contributions
ull+0.006
Mom+0.006
line+0.006
 hurried+0.006
aro+0.006
Neg logit contributions
 topp-0.011
 tant-0.011
icer-0.010
 Ow-0.010
 license-0.010
Token
Token ID198
Feature activation-1.80e-01

Pos logit contributions
 Ow+0.007
 de+0.007
 tant+0.007
 ABC+0.007
icer+0.007
Neg logit contributions
ull-0.005
aro-0.005
 hurried-0.005
line-0.005
 unf-0.005
TokenAs
Token ID1722
Feature activation-7.62e-01

Pos logit contributions
ull+0.003
lee+0.003
aro+0.003
 below+0.003
line+0.003
Neg logit contributions
 Ow-0.006
 de-0.006
 tant-0.006
 license-0.006
 cris-0.006
Token bunny
Token ID44915
Feature activation-2.70e-01

Pos logit contributions
 Ow+0.003
 de+0.003
 topp+0.003
 candy+0.003
 tant+0.003
Neg logit contributions
aro-0.002
 hurried-0.002
ull-0.002
line-0.002
 below-0.002
Token?"
Token ID1701
Feature activation-8.40e-01

Pos logit contributions
ull+0.005
aro+0.005
 hurried+0.005
line+0.005
lee+0.005
Neg logit contributions
 de-0.008
 Ow-0.007
 tant-0.007
 topp-0.007
 candy-0.007
Token
Token ID198
Feature activation-4.14e-01

Pos logit contributions
aro+0.013
 asked+0.012
 below+0.012
 hurried+0.012
 Mom+0.012
Neg logit contributions
 topp-0.030
 fault-0.030
 quality-0.029
 tant-0.029
 price-0.029
Token
Token ID198
Feature activation-5.32e-01

Pos logit contributions
ull+0.008
line+0.008
 hurried+0.008
aro+0.008
lee+0.008
Neg logit contributions
 Ow-0.012
 de-0.012
 tant-0.012
 license-0.011
 ABC-0.011
TokenHop
Token ID23483
Feature activation+6.22e-02

Pos logit contributions
ull+0.012
aro+0.011
 hurried+0.011
line+0.011
 below+0.011
Neg logit contributions
 Ow-0.014
 de-0.014
 tant-0.014
 topp-0.014
 candy-0.014
Tokensc
Token ID1416
Feature activation+3.55e+00

Pos logit contributions
 de+0.002
 Ow+0.002
 topp+0.002
 tant+0.002
 cris+0.002
Neg logit contributions
line-0.001
ull-0.001
lee-0.001
aro-0.001
Mom-0.001
Tokenot
Token ID313
Feature activation+1.86e-01

Pos logit contributions
 Ow+0.112
 tant+0.108
 de+0.107
 license+0.104
 candy+0.104
Neg logit contributions
ull-0.062
line-0.062
 below-0.060
 hurried-0.059
aro-0.058
Tokench
Token ID354
Feature activation+4.38e-01

Pos logit contributions
 Ow+0.006
 tant+0.006
icer+0.006
 de+0.006
 topp+0.006
Neg logit contributions
ull-0.003
 hurried-0.003
aro-0.003
lee-0.003
 rushed-0.003
Token replied
Token ID8712
Feature activation+8.16e-01

Pos logit contributions
 Ow+0.014
 de+0.014
 tant+0.014
 topp+0.014
 candy+0.014
Neg logit contributions
ull-0.007
aro-0.007
 hurried-0.007
line-0.007
 below-0.007
Token,
Token ID11
Feature activation+4.43e-01

Pos logit contributions
 de+0.021
 candy+0.021
 Ow+0.021
 tant+0.021
 ABC+0.020
Neg logit contributions
 hurried-0.017
ull-0.017
line-0.017
Mom-0.017
aro-0.017
Token "
Token ID366
Feature activation+1.08e-01

Pos logit contributions
 de+0.013
 Ow+0.013
 candy+0.013
 ABC+0.013
 topp+0.012
Neg logit contributions
ull-0.008
aro-0.008
 hurried-0.008
lee-0.008
line-0.008
Token knew
Token ID2993
Feature activation-1.36e-01

Pos logit contributions
 de+0.023
 Ow+0.022
 candy+0.022
 ABC+0.022
 topp+0.022
Neg logit contributions
ull-0.008
line-0.008
aro-0.008
lee-0.007
 below-0.007
Token how
Token ID703
Feature activation+1.78e-01

Pos logit contributions
aro+0.002
 soon+0.002
ull+0.002
 below+0.002
ful+0.002
Neg logit contributions
icer-0.005
 tant-0.005
 Ow-0.005
 de-0.004
 topp-0.004
Token to
Token ID284
Feature activation-1.44e+00

Pos logit contributions
 candy+0.005
 de+0.005
 topp+0.005
 Ow+0.005
 money+0.005
Neg logit contributions
ull-0.004
line-0.003
aro-0.003
 hurried-0.003
Mom-0.003
Token play
Token ID711
Feature activation-6.62e-01

Pos logit contributions
ull+0.026
aro+0.025
 hurried+0.024
line+0.024
 below+0.024
Neg logit contributions
 de-0.045
 Ow-0.045
 tant-0.044
 topp-0.044
icer-0.043
Token hop
Token ID1725
Feature activation+6.45e-02

Pos logit contributions
ull+0.011
aro+0.010
 hurried+0.010
lee+0.010
line+0.010
Neg logit contributions
 candy-0.021
 de-0.021
 tant-0.021
 Ow-0.021
 license-0.021
Tokensc
Token ID1416
Feature activation+3.54e+00

Pos logit contributions
 Ow+0.003
 topp+0.003
 ABC+0.003
icer+0.003
 de+0.002
Neg logit contributions
 below-0.001
 soon-0.001
ull-0.001
 decided-0.001
 slowly-0.001
Tokenot
Token ID313
Feature activation-7.13e-03

Pos logit contributions
 de+0.150
 topp+0.150
 Ow+0.149
 ABC+0.146
 candy+0.146
Neg logit contributions
ull-0.023
aro-0.019
 unf-0.018
ful-0.018
lee-0.018
Tokench
Token ID354
Feature activation+6.68e-01

Pos logit contributions
aro+0.000
 ran+0.000
ful+0.000
 soon+0.000
ull+0.000
Neg logit contributions
icer-0.000
 Ow-0.000
 topp-0.000
 de-0.000
 ABC-0.000
Token.
Token ID13
Feature activation-3.35e-01

Pos logit contributions
 Ow+0.025
 de+0.024
 tant+0.024
 topp+0.024
 ABC+0.024
Neg logit contributions
ull-0.008
 hurried-0.007
aro-0.007
lee-0.007
line-0.007
Token
Token ID198
Feature activation+6.01e-01

Pos logit contributions
ull+0.006
 hurried+0.006
lee+0.006
 ranger+0.006
line+0.005
Neg logit contributions
 topp-0.010
 tant-0.010
 ABC-0.010
 license-0.010
 price-0.010
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
 topp+0.020
 tant+0.020
 de+0.020
 price+0.019
 Ow+0.019
Neg logit contributions
ull-0.010
Mom-0.009
aro-0.009
 below-0.009
 hurried-0.009
Token it
Token ID340
Feature activation+3.60e-01

Pos logit contributions
icer+0.012
 tant+0.012
 Ow+0.012
rob+0.012
 cris+0.011
Neg logit contributions
 soon-0.005
aro-0.005
 they-0.005
ful-0.005
 could-0.005
Token's
Token ID338
Feature activation-2.67e-01

Pos logit contributions
 tant+0.011
icer+0.011
 Ow+0.011
 cris+0.010
 topp+0.010
Neg logit contributions
 could-0.008
 decided-0.007
 soon-0.007
 ran-0.007
 unf-0.007
Token special
Token ID2041
Feature activation+5.60e-01

Pos logit contributions
 soon+0.005
 filled+0.005
 Mom+0.005
ful+0.005
 slowly+0.005
Neg logit contributions
 tant-0.009
rob-0.009
icer-0.009
 Ow-0.008
pill-0.008
Token and
Token ID290
Feature activation-7.10e-01

Pos logit contributions
icer+0.021
 Ow+0.020
 tant+0.020
rob+0.019
 unnecessary+0.019
Neg logit contributions
ull-0.008
 soon-0.008
 filled-0.008
 could-0.008
ful-0.008
Token one
Token ID530
Feature activation-1.63e-01

Pos logit contributions
 soon+0.013
 hurried+0.013
ull+0.013
 unf+0.013
 decided+0.012
Neg logit contributions
 Ow-0.022
icer-0.022
 tant-0.021
 de-0.021
 ABC-0.021
Token-
Token ID12
Feature activation-3.48e+00

Pos logit contributions
 soon+0.003
ull+0.003
 below+0.003
 could+0.003
aro+0.003
Neg logit contributions
 Ow-0.005
icer-0.005
 tant-0.005
 cris-0.005
 de-0.005
Tokenof
Token ID1659
Feature activation-8.00e-01

Pos logit contributions
ull+0.075
Mom+0.072
aro+0.072
ful+0.072
 decided+0.071
Neg logit contributions
icer-0.096
 de-0.095
 Ow-0.095
 ABC-0.095
 tant-0.094
Token-
Token ID12
Feature activation-3.39e+00

Pos logit contributions
ull+0.014
aro+0.014
 could+0.014
ful+0.014
lee+0.013
Neg logit contributions
icer-0.025
 Ow-0.025
 tant-0.025
 cris-0.024
 de-0.024
Tokena
Token ID64
Feature activation-1.54e+00

Pos logit contributions
ull+0.074
ful+0.072
Mom+0.072
aro+0.072
 decided+0.071
Neg logit contributions
icer-0.093
 Ow-0.091
 ABC-0.091
 de-0.090
 tant-0.089
Token-
Token ID12
Feature activation-3.12e+00

Pos logit contributions
aro+0.029
 could+0.028
ull+0.028
 soon+0.028
 ran+0.027
Neg logit contributions
 Ow-0.048
icer-0.048
 tant-0.046
 cris-0.045
 topp-0.044
Tokenkind
Token ID11031
Feature activation-1.91e+00

Pos logit contributions
ull+0.072
ful+0.070
Mom+0.069
aro+0.069
 decided+0.069
Neg logit contributions
icer-0.083
 ABC-0.080
 de-0.080
 menu-0.080
 Ow-0.079
Token to
Token ID284
Feature activation-1.18e-01

Pos logit contributions
 soon+0.015
 below+0.014
 ran+0.014
 rushed+0.013
 slowly+0.013
Neg logit contributions
icer-0.035
 Ow-0.035
 tant-0.033
 de-0.033
rob-0.032
Token call
Token ID869
Feature activation+2.53e-01

Pos logit contributions
 soon+0.002
 unf+0.002
 cheer+0.002
 decided+0.002
 hurried+0.002
Neg logit contributions
 Ow-0.004
icer-0.004
 de-0.004
 tant-0.004
 topp-0.004
Token him
Token ID683
Feature activation-9.23e-01

Pos logit contributions
icer+0.008
 Ow+0.008
pill+0.008
 topp+0.008
 de+0.008
Neg logit contributions
ull-0.004
 soon-0.004
 below-0.004
ful-0.004
 slowly-0.004
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
 soon+0.013
 below+0.012
 ranger+0.011
 slowly+0.011
ull+0.011
Neg logit contributions
icer-0.034
 Ow-0.034
 tant-0.033
 topp-0.033
 de-0.033
Token one
Token ID530
Feature activation-3.43e-01

Pos logit contributions
 ranger+0.042
 keeper+0.040
 below+0.040
 next+0.039
Mom+0.039
Neg logit contributions
 Ow-0.053
 de-0.051
 topp-0.050
 tant-0.050
icer-0.050
Token-
Token ID12
Feature activation-3.40e+00

Pos logit contributions
ull+0.004
 below+0.004
 soon+0.003
aro+0.003
 ranger+0.003
Neg logit contributions
 Ow-0.013
 tant-0.013
icer-0.013
 de-0.013
 topp-0.013
Tokenfoot
Token ID5898
Feature activation-8.05e-01

Pos logit contributions
ull+0.074
ful+0.070
 decided+0.070
aro+0.069
 soon+0.068
Neg logit contributions
 Ow-0.097
icer-0.096
 de-0.096
 fault-0.093
 ABC-0.093
Token king
Token ID5822
Feature activation-6.76e-01

Pos logit contributions
ull+0.011
aro+0.010
 below+0.010
 hurried+0.010
line+0.010
Neg logit contributions
 Ow-0.028
 de-0.028
 tant-0.028
icer-0.027
 topp-0.027
Token.
Token ID13
Feature activation-2.75e-01

Pos logit contributions
ull+0.007
aro+0.006
 below+0.006
 hurried+0.006
 soon+0.006
Neg logit contributions
 Ow-0.027
 tant-0.026
icer-0.026
 de-0.026
 topp-0.026
Token
Token ID198
Feature activation+2.64e-01

Pos logit contributions
ull+0.006
aro+0.006
 hurried+0.006
 below+0.006
line+0.006
Neg logit contributions
 de-0.007
 Ow-0.007
 tant-0.007
 topp-0.007
 candy-0.007
Token
Token ID198
Feature activation+3.42e-03

Pos logit contributions
 Ow+0.007
 ABC+0.007
 de+0.007
icer+0.007
 topp+0.007
Neg logit contributions
line-0.006
 hurried-0.006
lee-0.006
 unf-0.005
aro-0.005
Token special
Token ID2041
Feature activation+5.60e-01

Pos logit contributions
 soon+0.005
 filled+0.005
 Mom+0.005
ful+0.005
 slowly+0.005
Neg logit contributions
 tant-0.009
rob-0.009
icer-0.009
 Ow-0.008
pill-0.008
Token and
Token ID290
Feature activation-7.10e-01

Pos logit contributions
icer+0.021
 Ow+0.020
 tant+0.020
rob+0.019
 unnecessary+0.019
Neg logit contributions
ull-0.008
 soon-0.008
 filled-0.008
 could-0.008
ful-0.008
Token one
Token ID530
Feature activation-1.63e-01

Pos logit contributions
 soon+0.013
 hurried+0.013
ull+0.013
 unf+0.013
 decided+0.012
Neg logit contributions
 Ow-0.022
icer-0.022
 tant-0.021
 de-0.021
 ABC-0.021
Token-
Token ID12
Feature activation-3.48e+00

Pos logit contributions
 soon+0.003
ull+0.003
 below+0.003
 could+0.003
aro+0.003
Neg logit contributions
 Ow-0.005
icer-0.005
 tant-0.005
 cris-0.005
 de-0.005
Tokenof
Token ID1659
Feature activation-8.00e-01

Pos logit contributions
ull+0.075
Mom+0.072
aro+0.072
ful+0.072
 decided+0.071
Neg logit contributions
icer-0.096
 de-0.095
 Ow-0.095
 ABC-0.095
 tant-0.094
Token-
Token ID12
Feature activation-3.39e+00

Pos logit contributions
ull+0.014
aro+0.014
 could+0.014
ful+0.014
lee+0.013
Neg logit contributions
icer-0.025
 Ow-0.025
 tant-0.025
 cris-0.024
 de-0.024
Tokena
Token ID64
Feature activation-1.54e+00

Pos logit contributions
ull+0.074
ful+0.072
Mom+0.072
aro+0.072
 decided+0.071
Neg logit contributions
icer-0.093
 Ow-0.091
 ABC-0.091
 de-0.090
 tant-0.089
Token-
Token ID12
Feature activation-3.12e+00

Pos logit contributions
aro+0.029
 could+0.028
ull+0.028
 soon+0.028
 ran+0.027
Neg logit contributions
 Ow-0.048
icer-0.048
 tant-0.046
 cris-0.045
 topp-0.044
Tokenkind
Token ID11031
Feature activation-1.91e+00

Pos logit contributions
ull+0.072
ful+0.070
Mom+0.069
aro+0.069
 decided+0.069
Neg logit contributions
icer-0.083
 ABC-0.080
 de-0.080
 menu-0.080
 Ow-0.079
Token.
Token ID13
Feature activation-6.44e-01

Pos logit contributions
ull+0.028
 soon+0.027
 could+0.027
ful+0.026
 hurried+0.026
Neg logit contributions
icer-0.067
 Ow-0.063
 topp-0.063
 tant-0.063
 cris-0.063
Token Do
Token ID2141
Feature activation+9.11e-01

Pos logit contributions
ull+0.016
aro+0.015
 hurried+0.015
line+0.015
 below+0.015
Neg logit contributions
 Ow-0.016
 de-0.016
 ABC-0.015
 topp-0.015
 candy-0.015
TokenWhat
Token ID2061
Feature activation+4.93e-01

Pos logit contributions
ull+0.012
Mom+0.012
 hurried+0.011
aro+0.011
line+0.011
Neg logit contributions
 de-0.023
 Ow-0.022
 tant-0.022
 ABC-0.022
 fault-0.022
Token is
Token ID318
Feature activation-1.12e+00

Pos logit contributions
 tant+0.014
icer+0.014
 de+0.013
 cris+0.013
 retri+0.013
Neg logit contributions
ull-0.012
 could-0.012
 decided-0.011
 ran-0.011
 filled-0.010
Token it
Token ID340
Feature activation+1.28e-01

Pos logit contributions
ull+0.020
aro+0.019
 below+0.019
 unf+0.018
line+0.018
Neg logit contributions
 tant-0.035
 Ow-0.034
 ABC-0.034
icer-0.034
 de-0.034
Token?
Token ID30
Feature activation-8.25e-01

Pos logit contributions
 tant+0.005
 savings+0.005
icer+0.005
 Ow+0.005
 topp+0.005
Neg logit contributions
ull-0.002
 filled-0.002
ful-0.002
line-0.002
 soon-0.002
Token Where
Token ID6350
Feature activation+8.45e-01

Pos logit contributions
ull+0.018
aro+0.018
line+0.017
 below+0.017
Mom+0.017
Neg logit contributions
 tant-0.022
 de-0.022
 topp-0.021
 Ow-0.021
 candy-0.021
Token is
Token ID318
Feature activation-3.38e+00

Pos logit contributions
 tant+0.022
 de+0.021
icer+0.021
 cris+0.021
 Ow+0.021
Neg logit contributions
ull-0.021
 could-0.020
 decided-0.019
aro-0.019
 ran-0.019
Token it
Token ID340
Feature activation-1.96e-01

Pos logit contributions
ull+0.078
aro+0.076
 hurried+0.075
line+0.074
 below+0.073
Neg logit contributions
 Ow-0.086
 de-0.086
 tant-0.084
 topp-0.084
 ABC-0.083
Token?"
Token ID1701
Feature activation-1.13e+00

Pos logit contributions
ull+0.003
 unf+0.003
 soon+0.003
 decided+0.003
 below+0.003
Neg logit contributions
 tant-0.007
icer-0.007
 topp-0.007
 Ow-0.007
 de-0.007
Token He
Token ID679
Feature activation-5.39e-01

Pos logit contributions
ull+0.019
aro+0.018
 hurried+0.018
Mom+0.018
 ranger+0.017
Neg logit contributions
 topp-0.036
 tant-0.036
 de-0.036
 Ow-0.036
icer-0.036
Token followed
Token ID3940
Feature activation+5.10e-02

Pos logit contributions
ull+0.007
aro+0.006
 hurried+0.006
line+0.006
lee+0.006
Neg logit contributions
 Ow-0.020
 de-0.019
 tant-0.019
 topp-0.019
 candy-0.019
Token the
Token ID262
Feature activation-2.52e-01

Pos logit contributions
 Ow+0.001
 de+0.001
 tant+0.001
 topp+0.001
 candy+0.001
Neg logit contributions
ull-0.001
aro-0.001
 hurried-0.001
line-0.001
 below-0.001
Token
Token ID198
Feature activation-6.16e-01

Pos logit contributions
ull+0.005
aro+0.005
 hurried+0.005
line+0.005
 below+0.005
Neg logit contributions
 Ow-0.008
 de-0.008
 tant-0.008
 topp-0.008
 ABC-0.008
Token"
Token ID1
Feature activation-1.06e+00

Pos logit contributions
ull+0.008
lee+0.008
aro+0.008
 hurried+0.007
 below+0.007
Neg logit contributions
 Ow-0.022
 de-0.022
 license-0.022
icer-0.022
 candy-0.022
TokenDear
Token ID20266
Feature activation-8.86e-01

Pos logit contributions
Mom+0.021
ull+0.019
 decided+0.018
 hurried+0.018
 below+0.018
Neg logit contributions
 de-0.033
 tant-0.033
 Ow-0.033
icer-0.032
 fault-0.032
Token...
Token ID986
Feature activation-5.44e-01

Pos logit contributions
ull+0.014
 below+0.014
 Mom+0.014
 ranger+0.013
line+0.013
Neg logit contributions
 tant-0.030
icer-0.029
 topp-0.028
 Ow-0.028
 fault-0.028
TokenAn
Token ID2025
Feature activation+1.05e-01

Pos logit contributions
line+0.011
aro+0.011
ull+0.010
 below+0.010
 could+0.010
Neg logit contributions
 fault-0.016
 tant-0.016
 Ow-0.016
 unnecessary-0.016
icer-0.016
Token-
Token ID12
Feature activation-3.37e+00

Pos logit contributions
 Ow+0.003
 cris+0.003
 tant+0.003
 candy+0.003
icer+0.003
Neg logit contributions
ull-0.002
aro-0.002
 below-0.002
 soon-0.002
lee-0.002
Tokenna
Token ID2616
Feature activation-1.23e+00

Pos logit contributions
ull+0.069
aro+0.066
Mom+0.065
 below+0.064
 hurried+0.063
Neg logit contributions
 Ow-0.096
 de-0.095
 tant-0.094
icer-0.093
 ABC-0.093
Token...
Token ID986
Feature activation-6.46e-01

Pos logit contributions
ull+0.027
aro+0.026
 soon+0.025
 below+0.025
 decided+0.025
Neg logit contributions
 Ow-0.035
 tant-0.035
icer-0.033
 topp-0.033
 cris-0.032
Tokenand
Token ID392
Feature activation+1.83e+00

Pos logit contributions
aro+0.012
 below+0.012
 could+0.012
 soon+0.012
line+0.012
Neg logit contributions
icer-0.020
 tant-0.020
 fault-0.020
 unnecessary-0.020
 Ow-0.020
Token...
Token ID986
Feature activation-5.94e-01

Pos logit contributions
 tant+0.059
 Ow+0.059
icer+0.059
 topp+0.059
 ABC+0.057
Neg logit contributions
aro-0.032
 soon-0.032
ful-0.030
ull-0.030
 hurried-0.030
TokenBen
Token ID11696
Feature activation-7.21e-01

Pos logit contributions
 below+0.012
 soon+0.011
 decided+0.011
line+0.011
aro+0.011
Neg logit contributions
 tant-0.018
icer-0.018
 fault-0.018
 unnecessary-0.018
 Ow-0.018
Token to
Token ID284
Feature activation+1.27e-01

Pos logit contributions
 soon+0.005
 below+0.005
ful+0.005
 could+0.005
 decided+0.005
Neg logit contributions
 Ow-0.015
icer-0.015
 tant-0.015
 topp-0.014
 cris-0.014
Token do
Token ID466
Feature activation-1.85e-01

Pos logit contributions
 de+0.004
icer+0.004
 Ow+0.004
 tant+0.004
 topp+0.004
Neg logit contributions
ull-0.002
 unf-0.002
 hurried-0.002
 below-0.002
aro-0.002
Token more
Token ID517
Feature activation-1.87e-01

Pos logit contributions
 soon+0.002
 decided+0.001
 could+0.001
 ran+0.001
ful+0.001
Neg logit contributions
 Ow-0.008
icer-0.008
 tant-0.008
 topp-0.008
 cris-0.008
Token than
Token ID621
Feature activation-4.11e-01

Pos logit contributions
 soon+0.002
.+0.002
 below+0.002
 next+0.002
 slowly+0.002
Neg logit contributions
icer-0.008
 Ow-0.007
rob-0.007
 cris-0.007
 de-0.006
Token anything
Token ID1997
Feature activation-4.52e-02

Pos logit contributions
 soon+0.007
 they+0.006
 ever+0.006
 below+0.006
ful+0.006
Neg logit contributions
icer-0.016
 Ow-0.015
rob-0.014
OO-0.014
 license-0.013
Token-
Token ID12
Feature activation-3.37e+00

Pos logit contributions
 below+0.000
aro+0.000
ull+0.000
 soon+0.000
 unf+0.000
Neg logit contributions
 Ow-0.002
icer-0.002
 topp-0.002
 tant-0.002
 ABC-0.002
Token she
Token ID673
Feature activation-5.33e-01

Pos logit contributions
 below+0.071
 soon+0.069
 decided+0.069
ull+0.069
ful+0.068
Neg logit contributions
icer-0.101
 de-0.100
 Ow-0.098
 ABC-0.094
 cris-0.093
Token loved
Token ID6151
Feature activation-1.45e+00

Pos logit contributions
 decided+0.005
 hurried+0.005
 could+0.005
 soon+0.005
 ran+0.005
Neg logit contributions
 Ow-0.022
 tant-0.021
icer-0.021
 de-0.021
 ABC-0.021
Token to
Token ID284
Feature activation-4.77e-01

Pos logit contributions
 soon+0.010
 below+0.009
ull+0.008
ful+0.008
 decided+0.008
Neg logit contributions
 Ow-0.063
 topp-0.061
icer-0.061
 tant-0.060
 de-0.060
Token roll
Token ID4836
Feature activation-5.84e-01

Pos logit contributions
ull+0.009
line+0.009
 unf+0.009
 below+0.009
 hurried+0.009
Neg logit contributions
 de-0.014
 Ow-0.014
icer-0.014
 tant-0.014
 topp-0.014
Token!
Token ID0
Feature activation-3.11e-01

Pos logit contributions
 below+0.006
 soon+0.006
ful+0.005
 decided+0.005
 slowly+0.005
Neg logit contributions
 topp-0.024
 ABC-0.023
 Ow-0.023
 de-0.023
icer-0.023
Token singing
Token ID13777
Feature activation+1.11e-01

Pos logit contributions
 next+0.011
 mom+0.011
 soon+0.010
 slowly+0.010
 ran+0.010
Neg logit contributions
icer-0.018
 cris-0.017
 Ow-0.017
 topp-0.016
rob-0.015
Token,
Token ID11
Feature activation-4.68e-01

Pos logit contributions
icer+0.004
 Ow+0.004
 topp+0.004
 cris+0.004
 unnecessary+0.004
Neg logit contributions
 soon-0.002
 slowly-0.002
ull-0.002
 below-0.002
 ran-0.002
Token so
Token ID523
Feature activation+2.39e-01

Pos logit contributions
 soon+0.009
 ran+0.008
 decided+0.008
 rushed+0.008
 hurried+0.008
Neg logit contributions
 Ow-0.017
icer-0.017
 license-0.015
 topp-0.015
 price-0.015
Token that
Token ID326
Feature activation-9.06e-01

Pos logit contributions
icer+0.010
 Ow+0.010
 topp+0.009
 tant+0.009
 license+0.009
Neg logit contributions
 soon-0.003
 slowly-0.002
ful-0.002
aro-0.002
 hurried-0.002
Token one
Token ID530
Feature activation+3.37e-01

Pos logit contributions
ull+0.012
 soon+0.011
 hurried+0.011
aro+0.011
 decided+0.011
Neg logit contributions
 Ow-0.033
 tant-0.032
 de-0.032
icer-0.032
 topp-0.032
Token-
Token ID12
Feature activation-3.32e+00

Pos logit contributions
 Ow+0.012
icer+0.011
 topp+0.011
 de+0.011
 tant+0.011
Neg logit contributions
ull-0.005
 soon-0.005
 below-0.005
 decided-0.005
 hurried-0.005
Tokenday
Token ID820
Feature activation+8.09e-01

Pos logit contributions
 soon+0.073
ful+0.073
ull+0.072
 decided+0.072
Mom+0.068
Neg logit contributions
icer-0.099
 Ow-0.094
 price-0.091
 de-0.090
 license-0.090
Token she
Token ID673
Feature activation-4.93e-01

Pos logit contributions
 Ow+0.030
icer+0.030
 tant+0.029
 topp+0.029
 fault+0.029
Neg logit contributions
 soon-0.011
 hurried-0.010
 ran-0.010
 filled-0.010
 decided-0.010
Token could
Token ID714
Feature activation+3.82e-01

Pos logit contributions
ull+0.008
aro+0.008
 hurried+0.008
line+0.007
 unf+0.007
Neg logit contributions
 Ow-0.016
 de-0.016
 tant-0.016
 topp-0.016
icer-0.015
Token record
Token ID1700
Feature activation-1.72e-01

Pos logit contributions
 Ow+0.011
 de+0.011
 tant+0.011
 ABC+0.010
 topp+0.010
Neg logit contributions
ull-0.008
aro-0.007
line-0.007
 hurried-0.007
lee-0.007
Token her
Token ID607
Feature activation-5.81e-01

Pos logit contributions
 soon+0.003
 slowly+0.003
ful+0.003
 Mom+0.003
 cheer+0.003
Neg logit contributions
icer-0.006
 Ow-0.006
 cris-0.006
 topp-0.006
rob-0.006
Token nineteen
Token ID43063
Feature activation-1.89e+00

Pos logit contributions
 soon+0.002
 ran+0.002
Mom+0.002
line+0.002
 unf+0.002
Neg logit contributions
 unnecessary-0.002
 tant-0.002
 Ow-0.002
 license-0.002
 favorites-0.002
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
 soon+0.020
lee+0.018
 slowly+0.017
 below+0.017
ull+0.017
Neg logit contributions
rob-0.074
 Ow-0.073
 license-0.073
 tant-0.072
icer-0.071
Token twenty
Token ID8208
Feature activation-1.77e+00

Pos logit contributions
 soon+0.042
line+0.042
 below+0.041
Mom+0.040
 slowly+0.040
Neg logit contributions
 tant-0.044
 unnecessary-0.042
 license-0.042
 Ow-0.041
 ABC-0.041
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
ull+0.030
aro+0.028
 hurried+0.027
 below+0.027
lee+0.027
Neg logit contributions
 Ow-0.057
 tant-0.056
 de-0.056
 license-0.055
 candy-0.055
Token twenty
Token ID8208
Feature activation-1.79e+00

Pos logit contributions
 soon+0.033
 below+0.032
line+0.032
Mom+0.032
 slowly+0.032
Neg logit contributions
 tant-0.037
 unnecessary-0.036
 Ow-0.035
 license-0.035
 ABC-0.035
Token-
Token ID12
Feature activation-3.30e+00

Pos logit contributions
ull+0.031
aro+0.029
 hurried+0.028
 below+0.028
line+0.028
Neg logit contributions
 Ow-0.057
 de-0.056
 tant-0.056
 topp-0.055
 license-0.055
Tokenone
Token ID505
Feature activation-1.00e+00

Pos logit contributions
ull+0.093
aro+0.091
 hurried+0.089
 below+0.089
line+0.089
Neg logit contributions
 Ow-0.068
 de-0.066
 tant-0.066
 ABC-0.065
icer-0.064
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 soon+0.011
ull+0.010
 below+0.010
 keeper+0.009
 ran+0.009
Neg logit contributions
 tant-0.041
 Ow-0.040
icer-0.039
 unnecessary-0.039
 cris-0.038
Token twenty
Token ID8208
Feature activation-1.86e+00

Pos logit contributions
 soon+0.031
 slowly+0.029
 next+0.029
 below+0.029
Mom+0.029
Neg logit contributions
 tant-0.037
 unnecessary-0.036
 Ow-0.035
 ABC-0.035
rob-0.034
Token-
Token ID12
Feature activation-3.27e+00

Pos logit contributions
ull+0.032
aro+0.030
 hurried+0.029
 below+0.029
lee+0.029
Neg logit contributions
 Ow-0.059
 tant-0.058
 de-0.058
 license-0.057
 topp-0.057
Tokentwo
Token ID11545
Feature activation-3.84e-01

Pos logit contributions
ull+0.090
aro+0.090
 below+0.087
Mom+0.086
 soon+0.086
Neg logit contributions
 Ow-0.069
 tant-0.067
 ABC-0.066
icer-0.066
 de-0.066
Token twenty
Token ID8208
Feature activation-1.79e+00

Pos logit contributions
 soon+0.033
 below+0.032
line+0.032
Mom+0.032
 slowly+0.032
Neg logit contributions
 tant-0.037
 unnecessary-0.036
 Ow-0.035
 license-0.035
 ABC-0.035
Token-
Token ID12
Feature activation-3.30e+00

Pos logit contributions
ull+0.031
aro+0.029
 hurried+0.028
 below+0.028
line+0.028
Neg logit contributions
 Ow-0.057
 de-0.056
 tant-0.056
 topp-0.055
 license-0.055
Tokenone
Token ID505
Feature activation-1.00e+00

Pos logit contributions
ull+0.093
aro+0.091
 hurried+0.089
 below+0.089
line+0.089
Neg logit contributions
 Ow-0.068
 de-0.066
 tant-0.066
 ABC-0.065
icer-0.064
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 soon+0.011
ull+0.010
 below+0.010
 keeper+0.009
 ran+0.009
Neg logit contributions
 tant-0.041
 Ow-0.040
icer-0.039
 unnecessary-0.039
 cris-0.038
Token twenty
Token ID8208
Feature activation-1.86e+00

Pos logit contributions
 soon+0.031
 slowly+0.029
 next+0.029
 below+0.029
Mom+0.029
Neg logit contributions
 tant-0.037
 unnecessary-0.036
 Ow-0.035
 ABC-0.035
rob-0.034
Token-
Token ID12
Feature activation-3.27e+00

Pos logit contributions
ull+0.032
aro+0.030
 hurried+0.029
 below+0.029
lee+0.029
Neg logit contributions
 Ow-0.059
 tant-0.058
 de-0.058
 license-0.057
 topp-0.057
Tokentwo
Token ID11545
Feature activation-3.84e-01

Pos logit contributions
ull+0.090
aro+0.090
 below+0.087
Mom+0.086
 soon+0.086
Neg logit contributions
 Ow-0.069
 tant-0.067
 ABC-0.066
icer-0.066
 de-0.066
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
 soon+0.004
ull+0.004
lee+0.004
 slowly+0.004
 below+0.004
Neg logit contributions
 Ow-0.015
 tant-0.015
 license-0.014
 shave-0.014
 unnecessary-0.014
Token twenty
Token ID8208
Feature activation-1.25e+00

Pos logit contributions
 soon+0.027
 next+0.026
Mom+0.026
 below+0.025
 slowly+0.025
Neg logit contributions
 tant-0.035
 unnecessary-0.034
 Ow-0.032
 fault-0.032
 license-0.031
Token-
Token ID12
Feature activation-2.64e+00

Pos logit contributions
ull+0.021
aro+0.021
lee+0.020
 soon+0.020
 below+0.020
Neg logit contributions
 Ow-0.040
 tant-0.039
 license-0.039
 de-0.038
 cris-0.038
Tokenthree
Token ID15542
Feature activation-1.35e-01

Pos logit contributions
aro+0.074
ull+0.073
 soon+0.071
 below+0.071
Mom+0.070
Neg logit contributions
 Ow-0.056
 tant-0.055
icer-0.053
 de-0.053
 ABC-0.052
Token "
Token ID366
Feature activation+1.26e-01

Pos logit contributions
 Ow+0.022
 tant+0.022
 topp+0.022
icer+0.022
 de+0.022
Neg logit contributions
ull-0.013
aro-0.012
 hurried-0.012
 below-0.012
line-0.012
TokenThat
Token ID2504
Feature activation+1.12e-01

Pos logit contributions
 de+0.004
 topp+0.004
 license+0.004
 ABC+0.004
 fault+0.004
Neg logit contributions
ull-0.002
Mom-0.002
line-0.002
 hurried-0.002
 below-0.002
Token hurt
Token ID5938
Feature activation+4.11e-01

Pos logit contributions
 de+0.002
 favorites+0.002
 cris+0.002
 savings+0.002
icer+0.002
Neg logit contributions
ull-0.003
 decided-0.003
 soon-0.003
 could-0.003
 unf-0.003
Token.
Token ID13
Feature activation+4.03e-01

Pos logit contributions
 de+0.012
 Ow+0.012
 tant+0.011
 topp+0.011
 ABC+0.011
Neg logit contributions
ull-0.008
aro-0.008
 hurried-0.008
 unf-0.008
line-0.008
Token Where
Token ID6350
Feature activation+1.32e+00

Pos logit contributions
 Ow+0.010
 de+0.010
 tant+0.009
 topp+0.009
 ABC+0.009
Neg logit contributions
ull-0.010
aro-0.010
 hurried-0.009
line-0.009
 below-0.009
Token is
Token ID318
Feature activation-3.26e+00

Pos logit contributions
 Ow+0.035
 de+0.034
 tant+0.034
 topp+0.034
 candy+0.033
Neg logit contributions
ull-0.029
aro-0.029
line-0.028
 hurried-0.028
lee-0.028
Token my
Token ID616
Feature activation-3.00e-01

Pos logit contributions
ull+0.086
aro+0.083
 hurried+0.082
line+0.082
 below+0.081
Neg logit contributions
 Ow-0.073
 de-0.073
 tant-0.071
 topp-0.071
 candy-0.071
Token laser
Token ID12855
Feature activation+3.09e-01

Pos logit contributions
 mom+0.005
 Mom+0.005
 mum+0.005
 next+0.005
 ranger+0.004
Neg logit contributions
pill-0.010
 cris-0.010
rob-0.010
icer-0.010
 topp-0.010
Token?"
Token ID1701
Feature activation-3.13e-01

Pos logit contributions
 de+0.009
 Ow+0.009
 tant+0.009
 topp+0.009
icer+0.009
Neg logit contributions
ull-0.006
 unf-0.005
aro-0.005
 hurried-0.005
 below-0.005
Token
Token ID198
Feature activation-3.39e-01

Pos logit contributions
ull+0.007
aro+0.006
line+0.006
 hurried+0.006
lee+0.006
Neg logit contributions
 de-0.009
 Ow-0.009
 tant-0.008
 candy-0.008
 license-0.008
Token
Token ID198
Feature activation-5.50e-01

Pos logit contributions
line+0.010
 hurried+0.010
castle+0.010
ths+0.010
aro+0.010
Neg logit contributions
 Ow-0.007
 de-0.007
 ABC-0.006
icer-0.006
 candy-0.005
Token Where
Token ID6350
Feature activation+1.18e+00

Pos logit contributions
 tant+0.000
 de+0.000
 topp+0.000
 quality+0.000
 ABC+0.000
Neg logit contributions
ull-0.000
Mom-0.000
 hurried-0.000
aro-0.000
 unf-0.000
Token is
Token ID318
Feature activation-3.13e+00

Pos logit contributions
 Ow+0.025
 de+0.024
 topp+0.023
 tant+0.023
 candy+0.023
Neg logit contributions
ull-0.033
aro-0.032
line-0.032
lee-0.032
 below-0.032
Token mom
Token ID1995
Feature activation-8.47e-01

Pos logit contributions
ull+0.083
line+0.082
lee+0.080
 below+0.078
 hurried+0.078
Neg logit contributions
 candy-0.073
 Ow-0.072
 license-0.068
 topp-0.068
 money-0.068
Token?
Token ID30
Feature activation-8.11e-01

Pos logit contributions
ull+0.017
line+0.016
 hurried+0.016
 below+0.016
 unf+0.016
Neg logit contributions
 de-0.025
 tant-0.025
 Ow-0.025
 topp-0.024
 ABC-0.024
Token Where
Token ID6350
Feature activation+6.35e-01

Pos logit contributions
ull+0.014
 hurried+0.014
aro+0.013
Mom+0.013
 below+0.013
Neg logit contributions
 topp-0.025
 tant-0.025
 de-0.025
 Ow-0.025
icer-0.025
Token is
Token ID318
Feature activation-3.26e+00

Pos logit contributions
 Ow+0.013
 de+0.013
 ABC+0.013
 candy+0.013
 tant+0.012
Neg logit contributions
ull-0.018
line-0.017
lee-0.017
aro-0.017
 below-0.017
Token Sue
Token ID26089
Feature activation-7.67e-01

Pos logit contributions
ull+0.089
line+0.085
lee+0.083
aro+0.083
 below+0.083
Neg logit contributions
 candy-0.073
 Ow-0.072
 de-0.070
 license-0.069
 money-0.069
Token?
Token ID30
Feature activation-6.92e-01

Pos logit contributions
ull+0.012
 below+0.012
 soon+0.011
 unf+0.011
 decided+0.011
Neg logit contributions
 de-0.026
 tant-0.026
 topp-0.026
 Ow-0.025
 ABC-0.025
Token Where
Token ID6350
Feature activation+5.69e-01

Pos logit contributions
ull+0.014
aro+0.014
 hurried+0.013
line+0.013
 below+0.013
Neg logit contributions
 Ow-0.019
 de-0.019
 tant-0.019
 topp-0.019
 candy-0.019
Token is
Token ID318
Feature activation-3.14e+00

Pos logit contributions
 Ow+0.012
 de+0.011
 ABC+0.011
 candy+0.011
 tant+0.011
Neg logit contributions
ull-0.016
lee-0.016
line-0.016
aro-0.016
 below-0.015
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
ull+0.091
line+0.087
aro+0.086
 below+0.085
 hurried+0.085
Neg logit contributions
 candy-0.064
 Ow-0.064
 de-0.063
 topp-0.061
 license-0.060
Token it
Token ID340
Feature activation-4.01e-01

Pos logit contributions
 hurried+0.004
 decided+0.004
ull+0.004
 soon+0.004
 rushed+0.004
Neg logit contributions
icer-0.011
 Ow-0.011
 tant-0.011
 de-0.011
 ABC-0.011
Token was
Token ID373
Feature activation+2.14e-01

Pos logit contributions
ull+0.008
aro+0.008
line+0.008
lee+0.007
Mom+0.007
Neg logit contributions
 de-0.012
 candy-0.012
 topp-0.011
 tant-0.011
 price-0.011
Token now
Token ID783
Feature activation-3.53e-02

Pos logit contributions
 Ow+0.006
 tant+0.006
 de+0.006
icer+0.006
 ABC+0.006
Neg logit contributions
ull-0.005
 hurried-0.004
 unf-0.004
aro-0.004
lee-0.004
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
ull+0.001
 hurried+0.001
 unf+0.001
 below+0.001
aro+0.001
Neg logit contributions
 Ow-0.001
 tant-0.001
icer-0.001
 de-0.001
 ABC-0.001
Token one
Token ID530
Feature activation+3.39e-01

Pos logit contributions
 Ow+0.029
icer+0.029
rob+0.027
 topp+0.027
pill+0.027
Neg logit contributions
ull-0.023
 unf-0.023
 ranger-0.023
 soon-0.022
 keeper-0.022
Token-
Token ID12
Feature activation-3.26e+00

Pos logit contributions
 Ow+0.012
 tant+0.012
icer+0.012
 de+0.012
 ABC+0.012
Neg logit contributions
ull-0.005
 below-0.004
aro-0.004
 soon-0.004
 unf-0.004
Tokenof
Token ID1659
Feature activation-3.18e-01

Pos logit contributions
ull+0.073
Mom+0.068
ful+0.068
aro+0.067
 decided+0.067
Neg logit contributions
 Ow-0.090
icer-0.089
 ABC-0.088
 de-0.087
 tant-0.086
Token-
Token ID12
Feature activation-3.12e+00

Pos logit contributions
ull+0.005
aro+0.004
 hurried+0.004
lee+0.004
line+0.004
Neg logit contributions
 Ow-0.011
 tant-0.011
 de-0.011
 topp-0.011
icer-0.011
Tokena
Token ID64
Feature activation-1.05e+00

Pos logit contributions
ull+0.070
ful+0.066
Mom+0.066
aro+0.065
 decided+0.064
Neg logit contributions
 Ow-0.087
icer-0.086
 ABC-0.085
 unnecessary-0.083
 fault-0.083
Token-
Token ID12
Feature activation-2.83e+00

Pos logit contributions
ull+0.016
aro+0.015
 hurried+0.015
line+0.015
 below+0.015
Neg logit contributions
 Ow-0.035
 de-0.034
 tant-0.034
 topp-0.034
icer-0.034
Tokenkind
Token ID11031
Feature activation-1.47e+00

Pos logit contributions
ull+0.063
ful+0.060
Mom+0.060
aro+0.059
 decided+0.059
Neg logit contributions
 Ow-0.079
icer-0.078
 unnecessary-0.077
 ABC-0.077
 fault-0.076
Token she
Token ID673
Feature activation-3.49e-01

Pos logit contributions
ull+0.012
aro+0.012
line+0.012
 ranger+0.011
 hurried+0.011
Neg logit contributions
 de-0.014
 candy-0.013
 Ow-0.013
 topp-0.013
 tant-0.013
Token looked
Token ID3114
Feature activation-8.08e-02

Pos logit contributions
ull+0.005
aro+0.005
 hurried+0.005
line+0.005
 below+0.005
Neg logit contributions
 Ow-0.012
 de-0.012
 tant-0.012
 topp-0.011
 candy-0.011
Token with
Token ID351
Feature activation-9.56e-01

Pos logit contributions
 soon+0.001
 below+0.001
 decided+0.001
 slowly+0.001
 filled+0.001
Neg logit contributions
 Ow-0.003
rob-0.003
icer-0.003
 ABC-0.003
 topp-0.003
Token the
Token ID262
Feature activation-1.44e+00

Pos logit contributions
 below+0.016
 cheer+0.016
lee+0.016
 Mom+0.016
 soon+0.016
Neg logit contributions
icer-0.031
rob-0.031
 Ow-0.031
 ABC-0.030
 retri-0.030
Token make
Token ID787
Feature activation-1.45e+00

Pos logit contributions
 below+0.030
 unf+0.029
 mom+0.029
 next+0.029
 mum+0.028
Neg logit contributions
rob-0.041
 Ow-0.041
ior-0.039
pill-0.039
icer-0.039
Token-
Token ID12
Feature activation-3.26e+00

Pos logit contributions
ull+0.030
 soon+0.029
 filled+0.029
lee+0.029
 unf+0.028
Neg logit contributions
 Ow-0.044
icer-0.044
rob-0.042
 tant-0.041
 retri-0.040
Tokenup
Token ID929
Feature activation-8.32e-02

Pos logit contributions
ull+0.063
ful+0.057
line+0.057
 decided+0.056
 soon+0.056
Neg logit contributions
 Ow-0.105
 de-0.101
icer-0.101
 fault-0.099
 license-0.098
Token.
Token ID13
Feature activation-3.13e-01

Pos logit contributions
line+0.001
ull+0.001
 below+0.001
 decided+0.001
 soon+0.001
Neg logit contributions
 Ow-0.003
icer-0.003
 topp-0.003
 cris-0.003
 de-0.003
Token She
Token ID1375
Feature activation-6.96e-01

Pos logit contributions
ull+0.006
lee+0.006
Mom+0.006
aro+0.006
line+0.006
Neg logit contributions
 tant-0.009
 topp-0.009
 Ow-0.009
 license-0.009
 de-0.009
Token realized
Token ID6939
Feature activation+1.10e-01

Pos logit contributions
ull+0.011
aro+0.011
line+0.011
 below+0.010
 hurried+0.010
Neg logit contributions
 de-0.023
 Ow-0.022
 tant-0.022
 topp-0.022
 ABC-0.022
Token that
Token ID326
Feature activation+1.38e-01

Pos logit contributions
icer+0.005
 Ow+0.005
 topp+0.005
 tant+0.005
 price+0.005
Neg logit contributions
aro-0.001
ful-0.001
 soon-0.001
ull-0.001
lee-0.001
Token'm
Token ID1101
Feature activation-1.38e+00

Pos logit contributions
ull+0.009
 hurried+0.009
 unf+0.009
 decided+0.009
 soon+0.009
Neg logit contributions
 Ow-0.013
 ABC-0.013
icer-0.013
 topp-0.012
 de-0.012
Token here
Token ID994
Feature activation-1.85e-01

Pos logit contributions
ull+0.023
line+0.021
aro+0.021
Mom+0.021
 ranger+0.020
Neg logit contributions
 de-0.046
 candy-0.044
 Ow-0.044
 fault-0.043
 license-0.043
Token for
Token ID329
Feature activation+6.08e-01

Pos logit contributions
ull+0.003
aro+0.003
 hurried+0.003
 below+0.003
 unf+0.003
Neg logit contributions
 Ow-0.006
 tant-0.005
 de-0.005
 topp-0.005
icer-0.005
Token a
Token ID257
Feature activation+3.73e-01

Pos logit contributions
 tant+0.015
 de+0.015
 Ow+0.015
icer+0.015
 topp+0.015
Neg logit contributions
ull-0.014
aro-0.014
 hurried-0.014
 unf-0.014
 soon-0.014
Token check
Token ID2198
Feature activation-8.20e-01

Pos logit contributions
 Ow+0.009
icer+0.009
 topp+0.009
 de+0.009
 tant+0.008
Neg logit contributions
 soon-0.009
ull-0.009
 ranger-0.009
 unf-0.009
Mom-0.009
Token-
Token ID12
Feature activation-3.25e+00

Pos logit contributions
aro+0.012
ull+0.012
line+0.012
 soon+0.012
 below+0.011
Neg logit contributions
 Ow-0.028
 tant-0.028
 topp-0.028
 de-0.027
icer-0.027
Tokenup
Token ID929
Feature activation+4.38e-02

Pos logit contributions
ull+0.074
aro+0.070
Mom+0.070
 below+0.070
line+0.069
Neg logit contributions
 de-0.088
 Ow-0.087
icer-0.084
 topp-0.084
 tant-0.084
Token on
Token ID319
Feature activation-3.20e-01

Pos logit contributions
 Ow+0.002
icer+0.002
 topp+0.001
 de+0.001
 tant+0.001
Neg logit contributions
line-0.001
 soon-0.001
 below-0.001
ull-0.001
 unf-0.001
Token my
Token ID616
Feature activation+8.80e-03

Pos logit contributions
 soon+0.005
 Mom+0.005
 below+0.005
 slowly+0.005
 next+0.005
Neg logit contributions
 topp-0.011
pill-0.011
 tant-0.011
icer-0.011
rob-0.011
Token heart
Token ID2612
Feature activation-5.00e-01

Pos logit contributions
 Ow+0.000
 topp+0.000
pill+0.000
 tant+0.000
icer+0.000
Neg logit contributions
 unf-0.000
 keeper-0.000
 mom-0.000
 ranger-0.000
 Mom-0.000
Token.
Token ID13
Feature activation-2.53e-01

Pos logit contributions
 soon+0.006
line+0.005
 below+0.005
ull+0.005
ful+0.005
Neg logit contributions
 Ow-0.020
icer-0.019
 topp-0.019
 cris-0.019
 de-0.019
Token proudly
Token ID23017
Feature activation+7.34e-01

Pos logit contributions
 Ow+0.010
 ABC+0.010
 license+0.010
 candy+0.010
 de+0.010
Neg logit contributions
ull-0.007
aro-0.007
lee-0.007
line-0.007
 ranger-0.007
Token call
Token ID869
Feature activation+4.87e-01

Pos logit contributions
 Ow+0.025
icer+0.025
 de+0.025
 tant+0.024
 topp+0.023
Neg logit contributions
 soon-0.012
 unf-0.012
 hurried-0.012
 ran-0.012
 below-0.012
Token himself
Token ID2241
Feature activation-3.97e-01

Pos logit contributions
icer+0.015
 Ow+0.015
 de+0.015
 topp+0.014
 tant+0.014
Neg logit contributions
ull-0.009
 below-0.009
aro-0.009
line-0.009
 soon-0.009
Token the
Token ID262
Feature activation-1.94e+00

Pos logit contributions
 soon+0.006
 below+0.006
ull+0.006
line+0.006
ful+0.005
Neg logit contributions
 Ow-0.014
icer-0.014
 tant-0.014
 de-0.014
 topp-0.014
Token one
Token ID530
Feature activation-1.83e-01

Pos logit contributions
 below+0.039
 ranger+0.038
 unf+0.038
 keeper+0.038
ull+0.037
Neg logit contributions
 Ow-0.057
 tant-0.054
 de-0.053
icer-0.053
 topp-0.053
Token-
Token ID12
Feature activation-3.25e+00

Pos logit contributions
 below+0.002
ull+0.002
 soon+0.002
line+0.002
aro+0.002
Neg logit contributions
 Ow-0.007
 tant-0.006
icer-0.006
 de-0.006
 topp-0.006
Tokenfoot
Token ID5898
Feature activation-5.44e-01

Pos logit contributions
ull+0.069
aro+0.064
 decided+0.064
ful+0.064
 soon+0.064
Neg logit contributions
 Ow-0.097
 de-0.094
icer-0.094
 tant-0.092
 fault-0.092
Token king
Token ID5822
Feature activation-4.79e-01

Pos logit contributions
ull+0.008
aro+0.008
 hurried+0.008
 below+0.008
line+0.008
Neg logit contributions
 Ow-0.018
 de-0.018
 tant-0.018
icer-0.018
 topp-0.018
Token.
Token ID13
Feature activation-5.53e-02

Pos logit contributions
ull+0.006
aro+0.006
 below+0.006
line+0.006
 soon+0.006
Neg logit contributions
 Ow-0.018
 tant-0.017
 de-0.017
icer-0.017
 topp-0.017
Token<|endoftext|>
Token ID50256
Feature activation-2.33e-02

Pos logit contributions
ull+0.001
aro+0.001
 below+0.001
 hurried+0.001
 ranger+0.001
Neg logit contributions
 tant-0.002
 de-0.002
 Ow-0.002
 topp-0.002
 candy-0.002
TokenOnce
Token ID7454
Feature activation-4.62e-01

Pos logit contributions
ull+0.001
 below+0.001
aro+0.001
 hurried+0.001
line+0.001
Neg logit contributions
 tant-0.001
 topp-0.001
 de-0.001
 license-0.000
 Ow-0.000
Token the
Token ID262
Feature activation-5.75e-01

Pos logit contributions
 Ow+0.002
 de+0.002
 tant+0.002
 topp+0.002
 ABC+0.002
Neg logit contributions
ull-0.002
aro-0.002
 hurried-0.002
 below-0.002
line-0.002
Token red
Token ID2266
Feature activation-7.96e-01

Pos logit contributions
ull+0.007
 unf+0.007
 ranger+0.007
 below+0.007
aro+0.007
Neg logit contributions
 Ow-0.021
 topp-0.021
 tant-0.021
 de-0.020
 ABC-0.020
Token car
Token ID1097
Feature activation-5.70e-01

Pos logit contributions
ull+0.015
aro+0.014
 below+0.014
 unf+0.014
line+0.013
Neg logit contributions
 de-0.024
 topp-0.024
 unnecessary-0.024
 tant-0.024
 Ow-0.023
Token.
Token ID13
Feature activation+3.78e-01

Pos logit contributions
ull+0.010
aro+0.010
 hurried+0.009
lee+0.009
line+0.009
Neg logit contributions
 de-0.018
 Ow-0.017
 tant-0.017
 topp-0.017
 candy-0.017
Token Where
Token ID6350
Feature activation+8.05e-01

Pos logit contributions
 Ow+0.011
 de+0.010
 license+0.010
 topp+0.010
 price+0.010
Neg logit contributions
ull-0.008
aro-0.008
lee-0.008
line-0.008
 hurried-0.008
Token is
Token ID318
Feature activation-3.24e+00

Pos logit contributions
 Ow+0.017
 de+0.016
 topp+0.016
 ABC+0.016
 candy+0.016
Neg logit contributions
ull-0.022
line-0.022
lee-0.022
aro-0.022
 below-0.022
Token it
Token ID340
Feature activation+2.11e-02

Pos logit contributions
line+0.092
aver+0.092
ull+0.090
lee+0.089
ok+0.089
Neg logit contributions
 candy-0.077
 money-0.075
 license-0.073
 price-0.068
 Ow-0.068
Token?"
Token ID1701
Feature activation-5.21e-01

Pos logit contributions
 tant+0.001
 Ow+0.001
 topp+0.001
 de+0.001
icer+0.001
Neg logit contributions
ull-0.000
 unf-0.000
 soon-0.000
 below-0.000
 decided-0.000
Token
Token ID198
Feature activation-5.33e-01

Pos logit contributions
ull+0.008
 ranger+0.008
aro+0.008
Mom+0.008
 below+0.007
Neg logit contributions
 topp-0.018
icer-0.017
 tant-0.017
 price-0.017
 Ow-0.017
Token
Token ID198
Feature activation-7.30e-01

Pos logit contributions
line+0.014
 hurried+0.014
aro+0.013
ull+0.013
castle+0.013
Neg logit contributions
 Ow-0.013
 de-0.012
 ABC-0.011
 license-0.011
 tant-0.011
TokenThey
Token ID2990
Feature activation-4.21e-01

Pos logit contributions
lee+0.015
line+0.015
ull+0.015
 below+0.015
 hurried+0.014
Neg logit contributions
 Ow-0.020
 license-0.020
 de-0.019
 candy-0.019
 tant-0.019
Token our
Token ID674
Feature activation-1.85e+00

Pos logit contributions
 tant+0.004
 topp+0.004
 de+0.004
icer+0.004
 Ow+0.004
Neg logit contributions
 soon-0.001
Mom-0.001
ull-0.001
aro-0.001
ful-0.001
Token moms
Token ID34015
Feature activation-3.93e-01

Pos logit contributions
ull+0.027
line+0.026
 hurried+0.024
aro+0.023
 below+0.023
Neg logit contributions
 candy-0.065
 de-0.065
 ABC-0.065
 price-0.065
 Ow-0.065
Token first
Token ID717
Feature activation-3.76e-01

Pos logit contributions
 soon+0.006
line+0.005
 could+0.005
ful+0.005
 next+0.005
Neg logit contributions
 tant-0.015
 Ow-0.014
 de-0.014
 unnecessary-0.014
pill-0.014
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
ull+0.003
 soon+0.003
 hurried+0.003
aro+0.003
line+0.003
Neg logit contributions
 Ow-0.015
 de-0.015
 tant-0.015
icer-0.015
 topp-0.015
Token Where
Token ID6350
Feature activation+1.13e+00

Pos logit contributions
ull+0.004
Mom+0.004
aro+0.004
line+0.004
 ranger+0.004
Neg logit contributions
 tant-0.006
 Ow-0.005
 topp-0.005
 de-0.005
icer-0.005
Token is
Token ID318
Feature activation-3.22e+00

Pos logit contributions
 tant+0.027
 Ow+0.027
 de+0.027
icer+0.026
 topp+0.026
Neg logit contributions
ull-0.028
aro-0.027
 hurried-0.027
 unf-0.026
line-0.026
Token your
Token ID534
Feature activation-4.79e-01

Pos logit contributions
ull+0.064
lee+0.060
line+0.060
 hurried+0.058
 unf+0.058
Neg logit contributions
 candy-0.097
 Ow-0.094
 de-0.094
 license-0.093
 money-0.093
Token mom
Token ID1995
Feature activation-7.41e-01

Pos logit contributions
 ranger+0.007
 keeper+0.007
 unf+0.007
ull+0.007
aro+0.007
Neg logit contributions
 Ow-0.016
icer-0.016
 tant-0.016
 topp-0.016
pill-0.016
Token?"
Token ID1701
Feature activation-4.21e-01

Pos logit contributions
 soon+0.011
 decided+0.011
line+0.011
 asked+0.011
 could+0.011
Neg logit contributions
 tant-0.027
 Ow-0.026
 de-0.026
icer-0.025
 topp-0.025
Token
Token ID198
Feature activation-5.65e-01

Pos logit contributions
ull+0.007
aro+0.007
line+0.007
 ranger+0.007
Mom+0.007
Neg logit contributions
 Ow-0.013
 topp-0.013
 de-0.013
 tant-0.013
icer-0.013
Token
Token ID198
Feature activation-7.62e-01

Pos logit contributions
aro+0.013
 hurried+0.013
line+0.013
lee+0.013
ull+0.013
Neg logit contributions
 Ow-0.014
 de-0.014
 tant-0.014
 ABC-0.013
 license-0.013
Token "
Token ID366
Feature activation-3.16e-01

Pos logit contributions
ull+0.001
 hurried+0.001
aro+0.001
line+0.001
lee+0.001
Neg logit contributions
 tant-0.001
 topp-0.001
 Ow-0.001
 de-0.001
 ABC-0.001
TokenOh
Token ID5812
Feature activation+3.39e-01

Pos logit contributions
Mom+0.005
ull+0.004
 hurried+0.004
 decided+0.004
line+0.004
Neg logit contributions
 de-0.011
 fault-0.011
 tant-0.011
 Ow-0.011
 topp-0.011
Token no
Token ID645
Feature activation+2.21e-01

Pos logit contributions
 tant+0.013
icer+0.012
 ABC+0.012
 license+0.012
 Ow+0.012
Neg logit contributions
 soon-0.004
 hurried-0.004
 unf-0.004
 ran-0.004
aro-0.004
Token!
Token ID0
Feature activation+5.05e-01

Pos logit contributions
 Ow+0.009
icer+0.008
 tant+0.008
pill+0.008
 ABC+0.008
Neg logit contributions
 soon-0.003
 ran-0.002
 hurried-0.002
 rushed-0.002
 unf-0.002
Token Where
Token ID6350
Feature activation+1.21e+00

Pos logit contributions
 tant+0.014
 de+0.014
 topp+0.014
 candy+0.014
 quality+0.013
Neg logit contributions
ull-0.010
aro-0.010
 hurried-0.010
Mom-0.010
lee-0.010
Token is
Token ID318
Feature activation-3.22e+00

Pos logit contributions
 de+0.036
 tant+0.036
 Ow+0.036
icer+0.036
 topp+0.035
Neg logit contributions
ull-0.023
 hurried-0.022
aro-0.022
 decided-0.021
 could-0.021
Token my
Token ID616
Feature activation-1.69e-01

Pos logit contributions
aro+0.072
 hurried+0.070
ull+0.070
 soon+0.069
 decided+0.069
Neg logit contributions
 tant-0.085
 Ow-0.083
icer-0.083
 de-0.082
 ABC-0.081
Token purse
Token ID27517
Feature activation-5.77e-01

Pos logit contributions
 mom+0.004
 next+0.004
 keeper+0.004
 Mom+0.004
 ranger+0.004
Neg logit contributions
rob-0.005
pill-0.005
 topp-0.005
icated-0.005
icer-0.005
Token?"
Token ID1701
Feature activation-5.67e-01

Pos logit contributions
line+0.009
ull+0.009
 below+0.009
 soon+0.009
 unf+0.009
Neg logit contributions
 Ow-0.019
 de-0.018
 tant-0.018
 topp-0.018
icer-0.018
Token she
Token ID673
Feature activation+4.56e-01

Pos logit contributions
ull+0.010
aro+0.009
 hurried+0.009
line+0.009
 below+0.009
Neg logit contributions
 de-0.018
 Ow-0.018
 tant-0.017
 topp-0.017
 candy-0.017
Token asked
Token ID1965
Feature activation+3.02e-01

Pos logit contributions
 de+0.018
 Ow+0.018
 tant+0.018
 topp+0.018
 ABC+0.018
Neg logit contributions
ull-0.004
aro-0.004
line-0.004
 hurried-0.003
 below-0.003
Token"
Token ID1
Feature activation-9.51e-01

Pos logit contributions
ull+0.008
aro+0.008
line+0.008
 hurried+0.008
 below+0.008
Neg logit contributions
 Ow-0.012
 de-0.012
 tant-0.012
 license-0.012
icer-0.012
TokenOh
Token ID5812
Feature activation+3.29e-01

Pos logit contributions
Mom+0.017
ull+0.016
 decided+0.016
 ranger+0.015
aro+0.015
Neg logit contributions
 de-0.031
 fault-0.031
 Ow-0.031
 unnecessary-0.030
icer-0.030
Token no
Token ID645
Feature activation+2.52e-01

Pos logit contributions
 tant+0.012
 ABC+0.012
icer+0.012
 price+0.012
 cris+0.012
Neg logit contributions
 soon-0.004
 hurried-0.004
ull-0.003
aro-0.003
 Mom-0.003
Token!
Token ID0
Feature activation+6.08e-02

Pos logit contributions
icer+0.010
 ABC+0.010
 Ow+0.010
pill+0.010
 tant+0.010
Neg logit contributions
 soon-0.003
 ran-0.002
 unf-0.002
.-0.002
 slowly-0.002
Token Where
Token ID6350
Feature activation+1.02e+00

Pos logit contributions
 tant+0.002
 de+0.002
 quality+0.002
 topp+0.002
 candy+0.002
Neg logit contributions
ull-0.001
Mom-0.001
aro-0.001
 hurried-0.001
 Mom-0.001
Token is
Token ID318
Feature activation-3.22e+00

Pos logit contributions
 de+0.026
 Ow+0.026
 tant+0.026
icer+0.026
 topp+0.026
Neg logit contributions
ull-0.023
aro-0.022
 hurried-0.022
 unf-0.022
line-0.022
Token my
Token ID616
Feature activation-1.25e-01

Pos logit contributions
ull+0.072
aro+0.072
 hurried+0.069
 unf+0.068
 below+0.068
Neg logit contributions
 tant-0.084
 de-0.084
 Ow-0.083
icer-0.082
 ABC-0.081
Token nose
Token ID9686
Feature activation-4.47e-01

Pos logit contributions
 mom+0.003
 Mom+0.002
 mum+0.002
 keeper+0.002
 next+0.002
Neg logit contributions
pill-0.004
 cris-0.004
rob-0.004
icer-0.004
 Ow-0.004
Token?"
Token ID1701
Feature activation-7.83e-01

Pos logit contributions
ull+0.008
line+0.008
 unf+0.007
 below+0.007
 soon+0.007
Neg logit contributions
 de-0.014
 topp-0.014
 Ow-0.014
 tant-0.014
 ABC-0.014
Token she
Token ID673
Feature activation+3.23e-01

Pos logit contributions
ull+0.012
aro+0.011
 hurried+0.011
 below+0.011
line+0.011
Neg logit contributions
 Ow-0.026
 topp-0.026
 de-0.026
 tant-0.026
 ABC-0.026
Token cried
Token ID16896
Feature activation-3.17e-01

Pos logit contributions
 de+0.012
 Ow+0.012
 tant+0.012
 topp+0.012
 ABC+0.012
Neg logit contributions
ull-0.004
aro-0.003
line-0.003
 below-0.003
 ranger-0.003
Token done
Token ID1760
Feature activation+5.53e-01

Pos logit contributions
ull+0.010
aro+0.010
 hurried+0.010
line+0.010
 below+0.009
Neg logit contributions
 Ow-0.021
 de-0.021
 tant-0.021
 topp-0.021
 ABC-0.020
Token by
Token ID416
Feature activation+3.53e-01

Pos logit contributions
 Ow+0.022
 tant+0.022
 de+0.022
icer+0.022
 topp+0.021
Neg logit contributions
ull-0.005
 soon-0.005
aro-0.004
line-0.004
 below-0.004
Token asking
Token ID4737
Feature activation+6.41e-01

Pos logit contributions
 tant+0.009
 de+0.009
 Ow+0.009
 license+0.009
 topp+0.009
Neg logit contributions
ull-0.008
aro-0.008
 unf-0.008
ful-0.008
lee-0.008
Token for
Token ID329
Feature activation+1.55e-04

Pos logit contributions
 tant+0.025
rob+0.025
icer+0.024
 Ow+0.024
pill+0.024
Neg logit contributions
 soon-0.008
 slowly-0.007
ful-0.007
 next-0.007
 below-0.007
Token help
Token ID1037
Feature activation-7.04e-01

Pos logit contributions
 de+0.000
 tant+0.000
icer+0.000
 Ow+0.000
 ABC+0.000
Neg logit contributions
ull-0.000
 soon-0.000
 unf-0.000
aro-0.000
 hurried-0.000
Token from
Token ID422
Feature activation-3.17e+00

Pos logit contributions
ull+0.007
 soon+0.006
ful+0.006
 below+0.006
line+0.006
Neg logit contributions
 de-0.028
 Ow-0.028
 tant-0.028
icer-0.028
 topp-0.027
Token others
Token ID1854
Feature activation+6.07e-01

Pos logit contributions
ull+0.063
aro+0.060
 below+0.059
 soon+0.059
 hurried+0.059
Neg logit contributions
 Ow-0.092
 de-0.092
 tant-0.091
 topp-0.090
 ABC-0.089
Token.
Token ID13
Feature activation-1.12e-01

Pos logit contributions
 Ow+0.025
 tant+0.025
icer+0.025
 de+0.025
 topp+0.024
Neg logit contributions
 soon-0.004
ull-0.004
 below-0.004
ful-0.004
line-0.004
Token And
Token ID843
Feature activation+3.78e-01

Pos logit contributions
ull+0.002
aro+0.002
Mom+0.002
 below+0.002
line+0.002
Neg logit contributions
 tant-0.004
 de-0.003
 topp-0.003
 Ow-0.003
 candy-0.003
Token sometimes
Token ID3360
Feature activation+2.44e-01

Pos logit contributions
 Ow+0.014
icer+0.014
 tant+0.014
 de+0.013
 topp+0.013
Neg logit contributions
 soon-0.005
ull-0.005
 below-0.005
aro-0.005
 decided-0.005
Token,
Token ID11
Feature activation+7.23e-02

Pos logit contributions
 Ow+0.008
 de+0.008
 tant+0.008
 topp+0.008
icer+0.008
Neg logit contributions
ull-0.004
aro-0.004
 hurried-0.004
 below-0.004
 unf-0.004
Token
Token ID220
Feature activation+3.85e-01

Pos logit contributions
ull+0.006
 hurried+0.006
aro+0.006
lee+0.005
line+0.005
Neg logit contributions
 tant-0.009
 Ow-0.009
 topp-0.009
 de-0.009
 ABC-0.009
Token
Token ID198
Feature activation+2.60e-01

Pos logit contributions
 tant+0.013
 price+0.013
 topp+0.013
 de+0.013
 candy+0.013
Neg logit contributions
ull-0.006
line-0.005
 below-0.005
Mom-0.005
aro-0.005
Token
Token ID198
Feature activation+2.03e-02

Pos logit contributions
 Ow+0.008
 de+0.008
 tant+0.008
 topp+0.008
 candy+0.007
Neg logit contributions
ull-0.005
aro-0.005
 hurried-0.005
line-0.005
 below-0.005
TokenIt
Token ID1026
Feature activation-1.73e-01

Pos logit contributions
 Ow+0.000
 de+0.000
 tant+0.000
 candy+0.000
 topp+0.000
Neg logit contributions
ull-0.000
lee-0.000
aro-0.000
 below-0.000
line-0.000
Token was
Token ID373
Feature activation+1.25e+00

Pos logit contributions
aro+0.004
line+0.004
ull+0.004
lee+0.004
 below+0.004
Neg logit contributions
 de-0.004
 Ow-0.004
 tant-0.004
 topp-0.004
 price-0.004
Token such
Token ID884
Feature activation+2.52e+00

Pos logit contributions
pill+0.036
 tant+0.036
 ABC+0.035
rob+0.035
 topp+0.034
Neg logit contributions
 soon-0.030
ull-0.029
 filled-0.028
 decided-0.028
 Mom-0.027
Token a
Token ID257
Feature activation+2.41e+00

Pos logit contributions
rob+0.092
icer+0.090
osaurus+0.085
 ABC+0.085
 tant+0.085
Neg logit contributions
 decided-0.051
 brave-0.051
 filled-0.049
 soon-0.046
 never-0.045
Token fun
Token ID1257
Feature activation+1.21e+00

Pos logit contributions
 de+0.050
 Ow+0.049
 tant+0.048
 candy+0.048
 topp+0.048
Neg logit contributions
ull-0.067
aro-0.066
 hurried-0.065
line-0.065
lee-0.065
Token day
Token ID1110
Feature activation+1.30e+00

Pos logit contributions
 Ow+0.042
icer+0.040
 tant+0.040
 de+0.039
 topp+0.039
Neg logit contributions
ull-0.020
 soon-0.018
 hurried-0.018
 ranger-0.017
 unf-0.017
Token of
Token ID286
Feature activation+5.60e-01

Pos logit contributions
 Ow+0.040
 de+0.040
 tant+0.039
 topp+0.039
 ABC+0.039
Neg logit contributions
ull-0.023
aro-0.022
 hurried-0.022
line-0.022
 below-0.022
Token swimming
Token ID14899
Feature activation+8.85e-01

Pos logit contributions
 Ow+0.016
 tant+0.016
 de+0.016
 topp+0.016
icer+0.016
Neg logit contributions
ull-0.011
aro-0.011
 hurried-0.011
 below-0.010
 unf-0.010
TokenThe
Token ID464
Feature activation+2.69e-01

Pos logit contributions
 Ow+0.003
 cris+0.003
 topp+0.003
 quality+0.003
icer+0.003
Neg logit contributions
aro-0.005
line-0.004
lee-0.004
 ranger-0.004
 below-0.004
Token children
Token ID1751
Feature activation+1.40e+00

Pos logit contributions
 de+0.007
 price+0.006
 candy+0.006
 Ow+0.006
 ABC+0.006
Neg logit contributions
ull-0.007
aro-0.007
line-0.007
lee-0.006
 below-0.006
Token played
Token ID2826
Feature activation+4.31e-01

Pos logit contributions
 de+0.043
 Ow+0.042
 topp+0.042
 candy+0.042
 tant+0.042
Neg logit contributions
ull-0.024
aro-0.023
line-0.022
lee-0.022
Mom-0.022
Token hide
Token ID7808
Feature activation+7.29e-01

Pos logit contributions
 Ow+0.014
 tant+0.014
icer+0.014
 de+0.014
 fault+0.014
Neg logit contributions
 soon-0.007
ull-0.007
 below-0.007
 slowly-0.007
 hurried-0.007
Token-
Token ID12
Feature activation-5.97e-01

Pos logit contributions
 Ow+0.020
 topp+0.019
 de+0.019
 ABC+0.019
 license+0.019
Neg logit contributions
ull-0.016
aro-0.016
 hurried-0.016
 below-0.016
 soon-0.016
Tokenand
Token ID392
Feature activation+2.89e+00

Pos logit contributions
ull+0.012
aro+0.012
line+0.012
 soon+0.011
ful+0.011
Neg logit contributions
 Ow-0.018
 de-0.017
 fault-0.017
icer-0.017
 price-0.017
Token-
Token ID12
Feature activation-7.22e-02

Pos logit contributions
 Ow+0.086
 tant+0.083
 de+0.083
icer+0.082
 topp+0.082
Neg logit contributions
ull-0.057
aro-0.057
 hurried-0.054
line-0.054
lee-0.054
Tokenseek
Token ID36163
Feature activation+2.24e-01

Pos logit contributions
aro+0.001
 soon+0.001
ull+0.001
line+0.001
ful+0.001
Neg logit contributions
 Ow-0.002
icer-0.002
 price-0.002
 tant-0.002
 de-0.002
Token in
Token ID287
Feature activation+8.33e-01

Pos logit contributions
 Ow+0.007
icer+0.007
 tant+0.007
 license+0.007
 price+0.007
Neg logit contributions
 soon-0.004
ull-0.004
 below-0.004
 unf-0.004
 hurried-0.004
Token the
Token ID262
Feature activation+5.64e-01

Pos logit contributions
 Ow+0.026
 de+0.026
 tant+0.025
 topp+0.025
 ABC+0.025
Neg logit contributions
ull-0.015
aro-0.014
 hurried-0.014
line-0.014
 below-0.014
Token barn
Token ID25203
Feature activation+6.61e-01

Pos logit contributions
 de+0.012
icer+0.011
 cris+0.011
 candy+0.011
 meat+0.011
Neg logit contributions
ull-0.018
aro-0.017
line-0.016
ful-0.016
orse-0.016
Token his
Token ID465
Feature activation+4.86e-01

Pos logit contributions
 Ow+0.010
 candy+0.009
 tant+0.009
 de+0.009
 meat+0.009
Neg logit contributions
aro-0.016
line-0.016
lee-0.016
 hurried-0.016
 below-0.016
Token knee
Token ID10329
Feature activation+6.19e-01

Pos logit contributions
pill+0.017
 topp+0.017
 cris+0.016
rob+0.016
icer+0.016
Neg logit contributions
ull-0.007
 unf-0.007
 mom-0.007
 Mom-0.006
Mom-0.006
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
 de+0.023
 tant+0.022
 Ow+0.022
 candy+0.022
 price+0.022
Neg logit contributions
ull-0.008
aro-0.008
lee-0.007
 hurried-0.007
line-0.007
Token He
Token ID679
Feature activation+3.78e-01

Pos logit contributions
 unnecessary+0.008
 topp+0.008
 de+0.008
 tant+0.008
 license+0.008
Neg logit contributions
ull-0.006
Mom-0.006
 ranger-0.006
 Mom-0.006
 hurried-0.005
Token cries
Token ID24691
Feature activation+1.19e-01

Pos logit contributions
 license+0.011
land+0.011
aman+0.011
 fault+0.011
 topp+0.011
Neg logit contributions
 ran-0.009
 could-0.009
 fell-0.008
 went-0.008
 decided-0.008
Token out
Token ID503
Feature activation+2.47e+00

Pos logit contributions
 Ow+0.003
 de+0.003
 tant+0.003
 candy+0.003
 ABC+0.003
Neg logit contributions
ull-0.003
aro-0.003
 hurried-0.002
line-0.002
lee-0.002
Token loud
Token ID7812
Feature activation-5.46e-01

Pos logit contributions
 tant+0.056
 de+0.056
 Ow+0.056
 candy+0.054
 price+0.053
Neg logit contributions
line-0.071
aro-0.065
orse-0.064
lee-0.064
ull-0.063
Token.
Token ID13
Feature activation+3.71e-01

Pos logit contributions
ull+0.004
 hurried+0.004
 below+0.004
 unf+0.004
 ran+0.004
Neg logit contributions
icer-0.022
 ABC-0.022
 Ow-0.022
 tant-0.022
 savings-0.022
Token
Token ID198
Feature activation+1.22e-01

Pos logit contributions
 de+0.011
 Ow+0.011
 tant+0.011
 topp+0.010
 candy+0.010
Neg logit contributions
ull-0.007
aro-0.007
 hurried-0.007
line-0.007
 below-0.007
Token
Token ID198
Feature activation-1.88e-01

Pos logit contributions
 Ow+0.003
 de+0.002
 ABC+0.002
 candy+0.002
 money+0.002
Neg logit contributions
line-0.004
 hurried-0.004
castle-0.004
aro-0.003
lee-0.003
Token"
Token ID1
Feature activation-8.07e-01

Pos logit contributions
line+0.004
lee+0.004
 below+0.004
castle+0.004
ull+0.004
Neg logit contributions
 Ow-0.005
 cris-0.005
 meat-0.005
icer-0.005
 chocolate-0.005
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
 candy+0.009
 de+0.009
 quality+0.009
 money+0.009
 fault+0.009
Neg logit contributions
ull-0.008
Mom-0.008
orse-0.007
aro-0.007
Dad-0.007
Token They
Token ID1119
Feature activation-5.32e-01

Pos logit contributions
 topp+0.000
icer+0.000
rob+0.000
 license+0.000
 tant+0.000
Neg logit contributions
lee-0.000
 Mom-0.000
 hurried-0.000
 decided-0.000
 could-0.000
Token played
Token ID2826
Feature activation+4.14e-01

Pos logit contributions
aro+0.010
ull+0.010
line+0.010
 keeper+0.010
 below+0.010
Neg logit contributions
 candy-0.015
 cris-0.015
 meat-0.015
 de-0.015
 topp-0.015
Token hide
Token ID7808
Feature activation+3.61e-01

Pos logit contributions
 de+0.011
 Ow+0.011
 candy+0.011
 topp+0.011
 tant+0.011
Neg logit contributions
ull-0.009
aro-0.009
lee-0.009
line-0.009
 hurried-0.008
Token-
Token ID12
Feature activation-8.57e-01

Pos logit contributions
 de+0.009
 Ow+0.009
 tant+0.008
 topp+0.008
 candy+0.008
Neg logit contributions
ull-0.009
aro-0.009
 hurried-0.009
line-0.009
 below-0.008
Tokenand
Token ID392
Feature activation+2.68e+00

Pos logit contributions
ull+0.020
aro+0.020
line+0.019
 hurried+0.019
 below+0.019
Neg logit contributions
 Ow-0.022
 de-0.022
 tant-0.021
 topp-0.021
icer-0.021
Token-
Token ID12
Feature activation-3.22e-01

Pos logit contributions
 de+0.065
 candy+0.064
 ABC+0.063
 tant+0.061
 topp+0.061
Neg logit contributions
ull-0.066
 hurried-0.064
 below-0.064
Mom-0.063
line-0.062
Tokenseek
Token ID36163
Feature activation+2.90e-01

Pos logit contributions
ull+0.008
aro+0.008
line+0.007
 soon+0.007
 hurried+0.007
Neg logit contributions
 Ow-0.008
 de-0.008
 tant-0.008
icer-0.008
 price-0.008
Token and
Token ID290
Feature activation+2.19e-01

Pos logit contributions
 de+0.007
 Ow+0.007
 tant+0.007
 topp+0.007
 candy+0.007
Neg logit contributions
ull-0.007
aro-0.007
 hurried-0.007
line-0.007
lee-0.007
Token jumped
Token ID11687
Feature activation+2.42e-02

Pos logit contributions
 candy+0.007
 topp+0.007
 de+0.007
 cris+0.007
 pret+0.007
Neg logit contributions
ull-0.004
lee-0.003
line-0.003
 below-0.003
aro-0.003
Token on
Token ID319
Feature activation+2.47e-01

Pos logit contributions
 Ow+0.001
 de+0.001
 tant+0.001
 topp+0.001
 ABC+0.001
Neg logit contributions
ull-0.000
aro-0.000
 hurried-0.000
 below-0.000
line-0.000
Token to
Token ID284
Feature activation+1.80e-01

Pos logit contributions
 de+0.025
 Ow+0.025
 tant+0.025
icer+0.025
 topp+0.024
Neg logit contributions
ull-0.012
 hurried-0.012
aro-0.012
 soon-0.011
 unf-0.011
Token taste
Token ID6938
Feature activation+9.51e-01

Pos logit contributions
 de+0.005
 Ow+0.005
 tant+0.005
 topp+0.005
 ABC+0.005
Neg logit contributions
ull-0.003
aro-0.003
 hurried-0.003
line-0.003
 unf-0.003
Token."
Token ID526
Feature activation-3.61e-01

Pos logit contributions
rob+0.034
 ABC+0.033
icer+0.033
 favorites+0.032
pill+0.032
Neg logit contributions
 soon-0.016
ful-0.015
 slowly-0.015
 decided-0.014
 below-0.014
Token
Token ID198
Feature activation-5.88e-02

Pos logit contributions
ull+0.008
aro+0.007
 hurried+0.007
 unf+0.007
 below+0.007
Neg logit contributions
 de-0.010
 Ow-0.010
 candy-0.010
 tant-0.009
 topp-0.009
Token
Token ID198
Feature activation-4.23e-01

Pos logit contributions
 hurried+0.001
line+0.001
ull+0.001
aro+0.001
 unf+0.001
Neg logit contributions
 Ow-0.002
 de-0.002
 tant-0.001
 ABC-0.001
icer-0.001
TokenP
Token ID47
Feature activation+2.55e+00

Pos logit contributions
ull+0.008
lee+0.008
 below+0.008
line+0.008
 hurried+0.008
Neg logit contributions
 Ow-0.012
 de-0.012
 tant-0.012
 license-0.011
 cris-0.011
Tokenolly
Token ID5098
Feature activation+1.15e+00

Pos logit contributions
 de+0.073
 price+0.072
 tant+0.072
 ABC+0.071
 Ow+0.071
Neg logit contributions
ull-0.050
lee-0.050
 unf-0.048
 hurried-0.048
Mom-0.048
Token thanked
Token ID26280
Feature activation+2.14e-03

Pos logit contributions
 de+0.037
 Ow+0.037
 tant+0.036
 topp+0.036
 candy+0.036
Neg logit contributions
ull-0.019
aro-0.019
line-0.018
 below-0.017
 hurried-0.017
Token the
Token ID262
Feature activation-6.67e-01

Pos logit contributions
 Ow+0.000
 de+0.000
 tant+0.000
 topp+0.000
 ABC+0.000
Neg logit contributions
ull-0.000
aro-0.000
 hurried-0.000
line-0.000
 below-0.000
Token friendly
Token ID8030
Feature activation-1.90e-01

Pos logit contributions
ull+0.016
aro+0.015
line+0.015
 hurried+0.015
lee+0.014
Neg logit contributions
 de-0.018
 Ow-0.017
 candy-0.017
 license-0.017
 tant-0.017
Token cat
Token ID3797
Feature activation-8.14e-01

Pos logit contributions
ull+0.004
aro+0.004
 hurried+0.004
line+0.004
 below+0.004
Neg logit contributions
 de-0.005
 Ow-0.005
 tant-0.005
 topp-0.005
 candy-0.005
Token until
Token ID1566
Feature activation-3.55e-01

Pos logit contributions
 hurried+0.007
 below+0.007
ull+0.007
 decided+0.007
line+0.007
Neg logit contributions
 tant-0.015
 Ow-0.015
icer-0.015
 topp-0.014
 ABC-0.014
Token he
Token ID339
Feature activation+5.24e-01

Pos logit contributions
lee+0.008
line+0.008
ull+0.008
aro+0.008
 ranger+0.007
Neg logit contributions
 candy-0.009
 Ow-0.009
 de-0.009
 beef-0.009
 money-0.009
Token realized
Token ID6939
Feature activation+1.19e-01

Pos logit contributions
 Ow+0.014
 de+0.013
 cris+0.013
 candy+0.013
 tant+0.012
Neg logit contributions
aro-0.013
line-0.011
 below-0.010
 ranger-0.010
jack-0.010
Token that
Token ID326
Feature activation-1.29e-01

Pos logit contributions
 tant+0.004
icer+0.004
 Ow+0.004
 ABC+0.004
 de+0.004
Neg logit contributions
ull-0.002
aro-0.001
 below-0.001
 hurried-0.001
 decided-0.001
Token he
Token ID339
Feature activation+9.06e-01

Pos logit contributions
ull+0.002
 below+0.002
 soon+0.002
aro+0.002
 decided+0.002
Neg logit contributions
 Ow-0.004
 tant-0.004
icer-0.004
 de-0.004
 ABC-0.004
Token had
Token ID550
Feature activation+6.29e-01

Pos logit contributions
 Ow+0.030
 de+0.030
 tant+0.029
 topp+0.029
 candy+0.029
Neg logit contributions
ull-0.014
aro-0.014
line-0.013
 hurried-0.013
 below-0.013
Token melted
Token ID24178
Feature activation+6.36e-02

Pos logit contributions
icer+0.020
 tant+0.020
 de+0.020
 ABC+0.019
 Ow+0.019
Neg logit contributions
ull-0.012
 hurried-0.011
 decided-0.011
 unf-0.011
 filled-0.011
Token the
Token ID262
Feature activation-5.63e-01

Pos logit contributions
pill+0.002
 ABC+0.002
 Ow+0.002
 Whe+0.002
 favorites+0.002
Neg logit contributions
 below-0.001
 soon-0.001
ull-0.001
 slowly-0.001
 unf-0.001
Token print
Token ID3601
Feature activation-8.74e-02

Pos logit contributions
ull+0.010
line+0.009
 unf+0.009
 below+0.009
 next+0.009
Neg logit contributions
 tant-0.018
 Ow-0.018
icer-0.017
pill-0.017
 novels-0.017
Token.
Token ID13
Feature activation-3.49e-01

Pos logit contributions
ull+0.001
 below+0.001
line+0.001
 unf+0.001
 hurried+0.001
Neg logit contributions
 Ow-0.003
 de-0.003
 tant-0.003
icer-0.003
 ABC-0.003
Token He
Token ID679
Feature activation+1.78e-01

Pos logit contributions
 hurried+0.007
 below+0.006
ull+0.006
 rushed+0.006
 decided+0.006
Neg logit contributions
 tant-0.011
icer-0.011
 topp-0.010
 fault-0.010
 unnecessary-0.010
Token
Token ID198
Feature activation-7.18e-01

Pos logit contributions
aro+0.009
 hurried+0.009
ull+0.009
line+0.009
lee+0.009
Neg logit contributions
 Ow-0.010
 de-0.010
 candy-0.009
 ABC-0.009
 license-0.009
TokenThey
Token ID2990
Feature activation-5.17e-01

Pos logit contributions
ull+0.014
aro+0.014
lee+0.013
 hurried+0.013
 rushed+0.012
Neg logit contributions
 de-0.022
 Ow-0.022
 candy-0.022
 cris-0.021
 license-0.021
Token eat
Token ID4483
Feature activation+1.02e+00

Pos logit contributions
ull+0.013
aro+0.013
lee+0.012
line+0.012
 hurried+0.011
Neg logit contributions
 candy-0.013
 de-0.013
 Ow-0.013
 cris-0.012
 topp-0.012
Token the
Token ID262
Feature activation+1.77e-01

Pos logit contributions
 unnecessary+0.038
pill+0.038
 ABC+0.038
 license+0.038
 fault+0.037
Neg logit contributions
 below-0.012
 slowly-0.012
 soon-0.012
 Mom-0.011
 they-0.010
Token banana
Token ID25996
Feature activation+1.14e+00

Pos logit contributions
 unnecessary+0.005
 Wed+0.005
land+0.005
aman+0.005
 fault+0.004
Neg logit contributions
 next-0.005
 tree-0.004
 keeper-0.004
 mom-0.004
 ranger-0.004
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
 Ow+0.035
 de+0.035
 tant+0.034
 topp+0.034
 ABC+0.034
Neg logit contributions
ull-0.021
aro-0.020
 hurried-0.019
line-0.019
 below-0.019
Token enjoy
Token ID2883
Feature activation+5.74e-01

Pos logit contributions
 Ow+0.031
icer+0.030
 tant+0.030
 de+0.030
 fault+0.030
Neg logit contributions
 hurried-0.019
 ran-0.019
 soon-0.019
 below-0.019
 unf-0.019
Token their
Token ID511
Feature activation+1.26e-02

Pos logit contributions
 candy+0.017
 de+0.016
 topp+0.016
 chocolate+0.016
 money+0.016
Neg logit contributions
ull-0.012
aro-0.012
 hurried-0.012
line-0.012
 unf-0.011
Token snack
Token ID26906
Feature activation+1.08e+00

Pos logit contributions
 tant+0.000
 Ow+0.000
 unnecessary+0.000
pill+0.000
land+0.000
Neg logit contributions
 next-0.000
 ranger-0.000
 keeper-0.000
 unf-0.000
 Mom-0.000
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 candy+0.039
 topp+0.038
 de+0.037
 money+0.037
 cand+0.037
Neg logit contributions
aro-0.016
ull-0.016
lee-0.015
 hurried-0.015
 rushed-0.014
Token They
Token ID1119
Feature activation-8.67e-01

Pos logit contributions
 tant+0.004
icer+0.004
 Ow+0.004
 topp+0.004
 de+0.004
Neg logit contributions
ull-0.003
 below-0.003
 hurried-0.003
Mom-0.003
line-0.003
Token.
Token ID13
Feature activation+7.42e-02

Pos logit contributions
 Ow+0.016
 de+0.016
 tant+0.016
 topp+0.016
icer+0.016
Neg logit contributions
ull-0.004
line-0.003
aro-0.003
 hurried-0.003
 below-0.003
Token Panc
Token ID49957
Feature activation-8.78e-02

Pos logit contributions
 unnecessary+0.002
 tant+0.002
 de+0.002
 savings+0.002
 quality+0.002
Neg logit contributions
 Mom-0.002
Mom-0.002
ull-0.001
line-0.001
lee-0.001
Tokenakes
Token ID1124
Feature activation-1.63e-01

Pos logit contributions
ull+0.002
Mom+0.002
orse+0.002
lee+0.002
ful+0.002
Neg logit contributions
 tant-0.002
 price-0.002
 candy-0.002
 de-0.002
 unnecessary-0.002
Token were
Token ID547
Feature activation+4.68e-02

Pos logit contributions
 soon+0.002
 ran+0.002
aro+0.002
 could+0.002
 hurried+0.002
Neg logit contributions
 Ow-0.006
icer-0.006
 unnecessary-0.006
 tant-0.006
 license-0.006
Token his
Token ID465
Feature activation+7.88e-02

Pos logit contributions
 tant+0.001
 Ow+0.001
 unnecessary+0.001
 fault+0.001
 license+0.001
Neg logit contributions
 soon-0.001
 filled-0.001
ull-0.001
Mom-0.001
 hurried-0.001
Token favorite
Token ID4004
Feature activation+6.72e-01

Pos logit contributions
rob+0.003
 Ow+0.002
pill+0.002
 tant+0.002
 unnecessary+0.002
Neg logit contributions
 mom-0.002
 Mom-0.002
 mum-0.002
 family-0.002
 soon-0.002
Token food
Token ID2057
Feature activation+1.81e+00

Pos logit contributions
 unnecessary+0.021
 Ow+0.019
 fault+0.018
pill+0.018
rob+0.017
Neg logit contributions
 soon-0.016
.-0.016
 mom-0.015
 Mom-0.015
 slowly-0.015
Token in
Token ID287
Feature activation+4.88e-01

Pos logit contributions
 Ow+0.064
 tant+0.063
 unnecessary+0.061
icer+0.060
 ABC+0.060
Neg logit contributions
 soon-0.027
 ranger-0.026
ull-0.026
 filled-0.026
 could-0.025
Token the
Token ID262
Feature activation+3.04e-01

Pos logit contributions
pill+0.021
 fault+0.021
 unnecessary+0.021
 tant+0.020
 de+0.020
Neg logit contributions
 Mom-0.005
Mom-0.005
 soon-0.005
 filled-0.004
 mom-0.004
Token whole
Token ID2187
Feature activation-6.22e-01

Pos logit contributions
 Ow+0.009
 de+0.009
 fault+0.009
 topp+0.009
 tant+0.009
Neg logit contributions
Mom-0.006
line-0.006
 unf-0.006
 below-0.006
 filled-0.006
Token wide
Token ID3094
Feature activation-1.22e+00

Pos logit contributions
aro+0.019
ull+0.017
orse+0.016
 ranger+0.016
bled+0.016
Neg logit contributions
 price-0.014
icer-0.014
 candy-0.014
 chocolate-0.014
 license-0.013
Token said
Token ID531
Feature activation+9.18e-02

Pos logit contributions
 de+0.002
 candy+0.002
 license+0.002
 retri+0.002
 pret+0.002
Neg logit contributions
ull-0.001
Mom-0.001
line-0.001
 below-0.001
aro-0.001
Token,
Token ID11
Feature activation+3.12e-01

Pos logit contributions
 de+0.003
 candy+0.003
 tant+0.002
 ABC+0.002
 Ow+0.002
Neg logit contributions
ull-0.002
Mom-0.002
line-0.002
 hurried-0.002
 below-0.002
Token "
Token ID366
Feature activation+2.82e-02

Pos logit contributions
 de+0.010
 candy+0.009
 ABC+0.009
 Ow+0.009
 license+0.009
Neg logit contributions
ull-0.006
aro-0.006
lee-0.005
 unf-0.005
 hurried-0.005
TokenThank
Token ID10449
Feature activation+3.78e-01

Pos logit contributions
 Ow+0.001
 tant+0.001
 de+0.001
 fault+0.001
 topp+0.001
Neg logit contributions
ull-0.000
Mom-0.000
aro-0.000
 hurried-0.000
 decided-0.000
Token you
Token ID345
Feature activation+1.21e-01

Pos logit contributions
 Ow+0.009
 de+0.009
 candy+0.008
 license+0.008
 ABC+0.008
Neg logit contributions
ull-0.010
line-0.009
 hurried-0.009
lee-0.009
 below-0.009
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
 de+0.004
 Ow+0.003
 candy+0.003
 license+0.003
 ABC+0.003
Neg logit contributions
ull-0.002
aro-0.002
 hurried-0.002
 below-0.002
line-0.002
Token Spe
Token ID2531
Feature activation-1.96e+00

Pos logit contributions
 de+0.030
 Ow+0.029
 license+0.029
 candy+0.029
 price+0.028
Neg logit contributions
ull-0.034
 hurried-0.032
aro-0.032
 below-0.032
line-0.032
Tokenedy
Token ID4716
Feature activation-1.13e+00

Pos logit contributions
ull+0.058
aro+0.056
 hurried+0.056
line+0.056
 below+0.055
Neg logit contributions
 de-0.037
 Ow-0.037
 tant-0.036
 topp-0.036
 license-0.036
Token.
Token ID13
Feature activation+7.81e-01

Pos logit contributions
ull+0.018
aro+0.017
 hurried+0.017
 below+0.016
 unf+0.016
Neg logit contributions
 Ow-0.038
 tant-0.037
 topp-0.037
 de-0.037
icer-0.036
Token Let
Token ID3914
Feature activation+3.88e-01

Pos logit contributions
 Ow+0.018
 de+0.018
 ABC+0.017
 topp+0.017
 license+0.017
Neg logit contributions
ull-0.020
aro-0.019
 hurried-0.019
line-0.019
 below-0.019
Token's
Token ID338
Feature activation-6.07e-01

Pos logit contributions
 de+0.011
 topp+0.011
 tant+0.011
 Ow+0.011
 ABC+0.011
Neg logit contributions
ull-0.008
 hurried-0.008
 decided-0.008
 unf-0.007
lee-0.007
Token finished
Token ID5201
Feature activation+1.51e+00

Pos logit contributions
 Ow+0.005
 de+0.005
 tant+0.005
 candy+0.005
 ABC+0.005
Neg logit contributions
ull-0.003
aro-0.003
line-0.003
 below-0.003
lee-0.003
Token his
Token ID465
Feature activation+6.20e-01

Pos logit contributions
 Ow+0.041
 tant+0.041
 de+0.041
 price+0.040
 ABC+0.040
Neg logit contributions
ull-0.031
aro-0.031
line-0.031
lee-0.031
 hurried-0.031
Token tower
Token ID10580
Feature activation+9.52e-01

Pos logit contributions
 Ow+0.016
 topp+0.016
icer+0.015
 de+0.015
 cris+0.015
Neg logit contributions
 unf-0.014
 next-0.014
 ran-0.014
lee-0.014
 ranger-0.014
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
 de+0.022
 tant+0.021
 Ow+0.021
 candy+0.021
 topp+0.021
Neg logit contributions
ull-0.025
aro-0.024
lee-0.023
 hurried-0.023
line-0.023
Token looked
Token ID3114
Feature activation+7.64e-01

Pos logit contributions
 de+0.043
 Ow+0.043
 candy+0.042
 topp+0.042
 tant+0.042
Neg logit contributions
ull-0.016
aro-0.015
line-0.015
lee-0.014
 below-0.014
Token for
Token ID329
Feature activation+8.52e-01

Pos logit contributions
 de+0.023
 tant+0.022
 Ow+0.022
 candy+0.022
 topp+0.021
Neg logit contributions
ull-0.016
aro-0.015
 hurried-0.014
line-0.014
Mom-0.014
Token Spot
Token ID15899
Feature activation-3.21e-03

Pos logit contributions
 Ow+0.021
 candy+0.021
 de+0.020
 topp+0.020
 license+0.020
Neg logit contributions
ull-0.021
line-0.020
aro-0.020
lee-0.020
 hurried-0.020
Token.
Token ID13
Feature activation+7.70e-01

Pos logit contributions
ull+0.000
aro+0.000
lee+0.000
 hurried+0.000
line+0.000
Neg logit contributions
 candy-0.000
 de-0.000
 Ow-0.000
 license-0.000
 topp-0.000
Token He
Token ID679
Feature activation+2.33e-01

Pos logit contributions
 tant+0.024
icer+0.023
 topp+0.023
 ABC+0.023
rob+0.023
Neg logit contributions
 ranger-0.015
 Mom-0.014
 keeper-0.013
 could-0.013
 below-0.013
Token wanted
Token ID2227
Feature activation-6.08e-01

Pos logit contributions
 de+0.007
 candy+0.007
 Ow+0.007
 cris+0.007
 topp+0.007
Neg logit contributions
ull-0.004
line-0.004
aro-0.004
ful-0.004
 below-0.004
Token to
Token ID284
Feature activation-5.00e-02

Pos logit contributions
 soon+0.009
 ranger+0.009
 slowly+0.009
 Mom+0.009
ull+0.009
Neg logit contributions
 tant-0.021
icer-0.021
 de-0.021
 ABC-0.020
 Ow-0.020
Token was
Token ID373
Feature activation+4.23e-01

Pos logit contributions
ull+0.007
 could+0.007
 soon+0.006
 hurried+0.006
 decided+0.006
Neg logit contributions
 tant-0.014
 topp-0.014
 unnecessary-0.014
 de-0.014
 license-0.014
Token a
Token ID257
Feature activation-1.51e-02

Pos logit contributions
 tant+0.013
 costs+0.012
 ABC+0.012
 price+0.012
icer+0.012
Neg logit contributions
ull-0.009
Mom-0.009
 Mom-0.009
 filled-0.009
 soon-0.009
Token happy
Token ID3772
Feature activation-4.38e-01

Pos logit contributions
aro+0.001
ull+0.000
 hurried+0.000
line+0.000
lee+0.000
Neg logit contributions
 de-0.000
 candy-0.000
 license-0.000
 ABC-0.000
 price-0.000
Token planet
Token ID5440
Feature activation-2.86e-01

Pos logit contributions
ull+0.010
aro+0.009
 hurried+0.009
line+0.009
 below+0.009
Neg logit contributions
 Ow-0.012
 de-0.012
 tant-0.011
 topp-0.011
 candy-0.011
Token called
Token ID1444
Feature activation+3.48e-01

Pos logit contributions
lee+0.006
 hurried+0.006
line+0.006
 Jas+0.006
Mom+0.006
Neg logit contributions
 de-0.008
 candy-0.008
 license-0.007
 retri-0.007
 menu-0.007
Token Earth
Token ID3668
Feature activation-3.24e-01

Pos logit contributions
 de+0.008
 Ow+0.008
 license+0.008
 ABC+0.008
 candy+0.008
Neg logit contributions
ull-0.009
 hurried-0.008
aro-0.008
line-0.008
lee-0.008
Token.
Token ID13
Feature activation+5.51e-01

Pos logit contributions
line+0.005
 hurried+0.005
Mom+0.005
lee+0.005
 rushed+0.005
Neg logit contributions
 de-0.011
 ABC-0.011
 candy-0.010
 license-0.010
 Ow-0.010
Token The
Token ID383
Feature activation+4.13e-01

Pos logit contributions
 Ow+0.012
 de+0.012
 ABC+0.011
 topp+0.011
 candy+0.011
Neg logit contributions
aro-0.015
ull-0.015
line-0.015
lee-0.015
 hurried-0.015
Token planet
Token ID5440
Feature activation-4.83e-02

Pos logit contributions
 de+0.011
 ABC+0.010
 candy+0.010
 Ow+0.010
 license+0.010
Neg logit contributions
lee-0.010
 hurried-0.010
aro-0.010
 rushed-0.009
ful-0.009
Token had
Token ID550
Feature activation+3.81e-01

Pos logit contributions
ores+0.001
line+0.001
 Jas+0.001
ful+0.001
aro+0.001
Neg logit contributions
 de-0.001
 ABC-0.001
 candy-0.001
 menu-0.001
 costs-0.001
Token lots
Token ID6041
Feature activation+4.57e-01

Pos logit contributions
 Ow+0.007
 de+0.007
 candy+0.007
 ABC+0.007
 topp+0.007
Neg logit contributions
ull-0.011
aro-0.011
line-0.011
 hurried-0.011
lee-0.011
Token too
Token ID1165
Feature activation-1.16e+00

Pos logit contributions
 soon+0.012
 below+0.010
line+0.010
ull+0.010
 unf+0.010
Neg logit contributions
 Ow-0.034
icer-0.033
 topp-0.032
 cris-0.032
 ABC-0.032
Token.
Token ID13
Feature activation-4.25e-01

Pos logit contributions
ull+0.010
 below+0.009
 unf+0.009
 soon+0.009
line+0.009
Neg logit contributions
 Ow-0.047
 tant-0.046
icer-0.046
 topp-0.046
 de-0.045
Token<|endoftext|>
Token ID50256
Feature activation-8.94e-02

Pos logit contributions
ull+0.008
aro+0.008
 hurried+0.008
line+0.008
lee+0.008
Neg logit contributions
 tant-0.012
 Ow-0.012
 de-0.012
 topp-0.012
 ABC-0.012
TokenOnce
Token ID7454
Feature activation-3.79e-01

Pos logit contributions
ull+0.002
aro+0.002
 hurried+0.002
line+0.002
 below+0.002
Neg logit contributions
 Ow-0.003
 de-0.003
 tant-0.003
 topp-0.003
 candy-0.003
Token upon
Token ID2402
Feature activation+5.31e-01

Pos logit contributions
 hurried+0.015
aro+0.015
ull+0.015
 unf+0.015
line+0.015
Neg logit contributions
 de-0.003
 Ow-0.003
 tant-0.003
icer-0.003
 ABC-0.003
Token a
Token ID257
Feature activation-5.75e-02

Pos logit contributions
 tant+0.016
icer+0.016
 fault+0.015
 price+0.015
 topp+0.015
Neg logit contributions
ull-0.012
 soon-0.011
 decided-0.011
Mom-0.011
 filled-0.011
Token time
Token ID640
Feature activation+7.02e-01

Pos logit contributions
ull+0.002
 unf+0.002
Mom+0.002
 soon+0.002
 hurried+0.002
Neg logit contributions
 candy-0.001
 ABC-0.001
 Ow-0.001
 topp-0.001
 price-0.001
Token,
Token ID11
Feature activation-3.09e-01

Pos logit contributions
 tant+0.024
 topp+0.024
 Ow+0.024
 de+0.023
 price+0.023
Neg logit contributions
ull-0.010
aro-0.010
 hurried-0.009
line-0.009
 below-0.009
Token there
Token ID612
Feature activation-4.57e-01

Pos logit contributions
ull+0.002
 below+0.001
Mom+0.001
aro+0.001
 soon+0.001
Neg logit contributions
 topp-0.013
 tant-0.013
 ABC-0.013
 de-0.013
 candy-0.013
Token was
Token ID373
Feature activation+4.53e-01

Pos logit contributions
ull+0.008
 hurried+0.007
 could+0.007
 soon+0.007
 below+0.007
Neg logit contributions
 tant-0.014
 de-0.014
 topp-0.014
 license-0.014
 unnecessary-0.014
Token a
Token ID257
Feature activation-7.75e-02

Pos logit contributions
 tant+0.015
 costs+0.014
 ABC+0.014
land+0.014
 price+0.013
Neg logit contributions
ull-0.009
Mom-0.009
 Mom-0.009
 filled-0.009
 decided-0.009
Token looking
Token ID2045
Feature activation+1.14e+00

Pos logit contributions
ull+0.001
aro+0.001
 unf+0.001
lee+0.001
 below+0.001
Neg logit contributions
 tant-0.001
 Ow-0.001
 topp-0.001
icer-0.001
 ABC-0.001
Token for
Token ID329
Feature activation+9.58e-01

Pos logit contributions
rob+0.047
 topp+0.046
 ABC+0.046
icer+0.044
 caramel+0.042
Neg logit contributions
 below-0.015
.-0.014
 alone-0.013
 brave-0.013
 around-0.013
Token some
Token ID617
Feature activation-9.95e-01

Pos logit contributions
 tant+0.025
icer+0.024
 de+0.023
 ABC+0.022
 fault+0.022
Neg logit contributions
aro-0.024
 unf-0.024
 soon-0.023
lee-0.023
ull-0.023
Token cheese
Token ID9891
Feature activation-6.69e-02

Pos logit contributions
 named+0.023
 unf+0.023
 plant+0.023
ull+0.023
 brave+0.023
Neg logit contributions
icer-0.026
 Ow-0.025
 fault-0.025
 tant-0.024
rob-0.024
Token,"
Token ID553
Feature activation+9.76e-02

Pos logit contributions
ull+0.001
 soon+0.001
aro+0.001
 below+0.001
line+0.001
Neg logit contributions
 Ow-0.002
 tant-0.002
 ABC-0.002
 de-0.002
icer-0.002
Token replied
Token ID8712
Feature activation+1.03e-02

Pos logit contributions
 tant+0.004
 Ow+0.003
 topp+0.003
icer+0.003
 de+0.003
Neg logit contributions
ull-0.001
aro-0.001
 decided-0.001
 hurried-0.001
 ranger-0.001
Token S
Token ID311
Feature activation+5.83e-01

Pos logit contributions
 Ow+0.000
 de+0.000
 candy+0.000
 tant+0.000
 topp+0.000
Neg logit contributions
ull-0.000
 hurried-0.000
line-0.000
aro-0.000
 below-0.000
Tokenque
Token ID4188
Feature activation-2.74e-01

Pos logit contributions
 topp+0.015
 tant+0.014
 menu+0.014
 beef+0.014
 meat+0.014
Neg logit contributions
lee-0.014
ull-0.014
 unf-0.014
aro-0.014
 ranger-0.014
Tokenak
Token ID461
Feature activation+5.74e-01

Pos logit contributions
ull+0.005
aro+0.004
 hurried+0.004
line+0.004
 below+0.004
Neg logit contributions
 Ow-0.009
 de-0.009
 tant-0.009
 topp-0.008
 candy-0.008
Token.
Token ID13
Feature activation-6.53e-01

Pos logit contributions
 de+0.019
 candy+0.018
 topp+0.018
 tant+0.018
 Ow+0.018
Neg logit contributions
aro-0.009
ull-0.008
Mom-0.008
 hurried-0.008
line-0.008
Token
Token ID198
Feature activation-1.93e-01

Pos logit contributions
ull+0.012
aro+0.011
 hurried+0.011
line+0.011
 below+0.011
Neg logit contributions
 Ow-0.020
 de-0.020
 tant-0.020
 topp-0.019
 candy-0.019
Token fl
Token ID781
Feature activation+4.53e-01

Pos logit contributions
 Ow+0.009
icer+0.009
 unnecessary+0.009
 cris+0.008
rob+0.008
Neg logit contributions
ull-0.005
 unf-0.005
 ranger-0.005
 below-0.005
 keeper-0.005
Tokenute
Token ID1133
Feature activation+3.72e-01

Pos logit contributions
 Ow+0.010
 de+0.010
 tant+0.010
 topp+0.010
 candy+0.010
Neg logit contributions
ull-0.012
aro-0.011
line-0.011
 hurried-0.011
 below-0.011
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
 Ow+0.013
icer+0.013
 topp+0.013
 de+0.013
 tant+0.012
Neg logit contributions
 soon-0.005
ull-0.005
 below-0.005
 unf-0.005
line-0.005
Token The
Token ID383
Feature activation+4.56e-03

Pos logit contributions
ull+0.000
aro+0.000
 hurried+0.000
line+0.000
 below+0.000
Neg logit contributions
 de-0.000
 Ow-0.000
 tant-0.000
 topp-0.000
 candy-0.000
Token end
Token ID886
Feature activation+3.28e-01

Pos logit contributions
 Ow+0.000
 topp+0.000
 tant+0.000
icer+0.000
 cris+0.000
Neg logit contributions
ull-0.000
Mom-0.000
 unf-0.000
 soon-0.000
 decided-0.000
Token.
Token ID13
Feature activation+4.21e-01

Pos logit contributions
 de+0.011
 candy+0.011
 ABC+0.011
 license+0.011
 Ow+0.011
Neg logit contributions
line-0.005
aro-0.005
 hurried-0.005
ull-0.005
lee-0.005
Token<|endoftext|>
Token ID50256
Feature activation+8.39e-02

Pos logit contributions
 tant+0.013
 Ow+0.013
 topp+0.013
 de+0.013
 ABC+0.013
Neg logit contributions
ull-0.008
aro-0.007
 hurried-0.007
line-0.007
 below-0.007
TokenOnce
Token ID7454
Feature activation-9.40e-02

Pos logit contributions
 Ow+0.002
 de+0.002
 tant+0.002
 topp+0.002
 candy+0.002
Neg logit contributions
ull-0.002
aro-0.002
 hurried-0.002
line-0.002
lee-0.002
Token upon
Token ID2402
Feature activation+7.70e-01

Pos logit contributions
 hurried+0.004
aro+0.004
ull+0.004
 unf+0.004
line+0.004
Neg logit contributions
 de-0.001
 Ow-0.001
 tant-0.001
icer-0.001
 ABC-0.001
Token a
Token ID257
Feature activation+5.67e-02

Pos logit contributions
 tant+0.021
icer+0.021
 fault+0.020
 topp+0.020
 savings+0.020
Neg logit contributions
ull-0.018
 soon-0.017
Mom-0.017
 decided-0.017
 filled-0.017
Token time
Token ID640
Feature activation+7.42e-01

Pos logit contributions
 Ow+0.001
 candy+0.001
 topp+0.001
 tant+0.001
 ABC+0.001
Neg logit contributions
ull-0.002
Mom-0.002
 unf-0.002
 hurried-0.002
 soon-0.002
Token a
Token ID257
Feature activation-2.47e-01

Pos logit contributions
ull+0.004
aro+0.004
 hurried+0.004
line+0.004
 unf+0.004
Neg logit contributions
 Ow-0.003
 de-0.003
 tant-0.003
 topp-0.003
 ABC-0.003
Token happy
Token ID3772
Feature activation-5.00e-01

Pos logit contributions
ull+0.005
 unf+0.004
 ran+0.004
 mo+0.004
 hurried+0.004
Neg logit contributions
 Ow-0.008
icer-0.007
 de-0.007
 topp-0.007
 fault-0.007
Token heart
Token ID2612
Feature activation-9.76e-01

Pos logit contributions
ull+0.009
 hurried+0.009
 unf+0.008
 ran+0.008
 rushed+0.008
Neg logit contributions
icer-0.017
 Ow-0.017
pill-0.016
 topp-0.016
 fault-0.016
Token and
Token ID290
Feature activation-1.86e-01

Pos logit contributions
ull+0.018
aro+0.017
 hurried+0.017
line+0.017
 below+0.017
Neg logit contributions
 Ow-0.030
 de-0.029
 tant-0.029
 topp-0.029
icer-0.028
Token was
Token ID373
Feature activation-8.55e-01

Pos logit contributions
ull+0.003
 hurried+0.003
aro+0.002
line+0.002
 soon+0.002
Neg logit contributions
 Ow-0.007
 tant-0.006
 de-0.006
icer-0.006
 topp-0.006
Token excited
Token ID6568
Feature activation+7.67e-02

Pos logit contributions
aro+0.020
ull+0.019
line+0.019
lee+0.018
Mom+0.018
Neg logit contributions
 de-0.022
 candy-0.021
 ABC-0.021
 topp-0.021
 license-0.021
Token to
Token ID284
Feature activation-2.43e-01

Pos logit contributions
 Ow+0.002
 tant+0.002
 de+0.002
icer+0.002
 topp+0.002
Neg logit contributions
ull-0.001
 hurried-0.001
line-0.001
 below-0.001
lee-0.001
Token discover
Token ID7073
Feature activation+5.48e-01

Pos logit contributions
aro+0.005
ull+0.005
 ranger+0.005
Mom+0.005
ful+0.005
Neg logit contributions
 ABC-0.007
 candy-0.006
 money-0.006
 license-0.006
 price-0.006
Token new
Token ID649
Feature activation+1.77e-01

Pos logit contributions
 Ow+0.013
 tant+0.013
 topp+0.013
 de+0.013
icer+0.013
Neg logit contributions
ull-0.014
 below-0.013
aro-0.013
 unf-0.013
line-0.013
Token things
Token ID1243
Feature activation-3.55e-01

Pos logit contributions
 Ow+0.004
icer+0.004
 topp+0.004
 de+0.004
 tant+0.004
Neg logit contributions
ull-0.005
 below-0.005
lee-0.005
 decided-0.005
 soon-0.005
Token.
Token ID13
Feature activation-2.68e-01

Pos logit contributions
ull+0.004
 hurried+0.004
aro+0.004
lee+0.004
Mom+0.004
Neg logit contributions
 de-0.013
 candy-0.013
 topp-0.013
 license-0.013
 Ow-0.013
Token was
Token ID373
Feature activation+3.90e-01

Pos logit contributions
ull+0.008
 hurried+0.008
 could+0.008
 soon+0.008
 decided+0.008
Neg logit contributions
 tant-0.015
 de-0.015
 topp-0.015
 unnecessary-0.015
 license-0.015
Token a
Token ID257
Feature activation-2.89e-02

Pos logit contributions
 tant+0.014
 unnecessary+0.013
 costs+0.012
 fault+0.012
 topp+0.012
Neg logit contributions
 Mom-0.008
Mom-0.008
 filled-0.007
ull-0.007
 named-0.007
Token little
Token ID1310
Feature activation+3.11e-01

Pos logit contributions
aro+0.001
ull+0.001
 hurried+0.001
lee+0.001
line+0.001
Neg logit contributions
 de-0.001
 candy-0.001
 license-0.000
 price-0.000
 ABC-0.000
Token girl
Token ID2576
Feature activation-9.09e-02

Pos logit contributions
 de+0.008
 candy+0.007
 Ow+0.007
 tant+0.007
 license+0.007
Neg logit contributions
ull-0.008
aro-0.008
 hurried-0.008
line-0.007
lee-0.007
Token named
Token ID3706
Feature activation-2.11e-01

Pos logit contributions
bled+0.003
line+0.003
castle+0.003
ipples+0.003
ths+0.003
Neg logit contributions
 chocolate-0.002
 de-0.002
 cris-0.002
 candy-0.002
 newspaper-0.002
Token Lily
Token ID20037
Feature activation-1.63e+00

Pos logit contributions
ull+0.007
aro+0.006
orse+0.006
castle+0.006
line+0.006
Neg logit contributions
 Ow-0.005
 candy-0.004
 de-0.004
 chocolate-0.004
 cris-0.004
Token.
Token ID13
Feature activation+5.32e-01

Pos logit contributions
aro+0.047
bled+0.044
ths+0.041
castle+0.040
line+0.040
Neg logit contributions
 de-0.042
 chocolate-0.037
 candy-0.036
 newspaper-0.036
 quality-0.036
Token She
Token ID1375
Feature activation-3.87e-01

Pos logit contributions
 Ow+0.012
 de+0.012
 candy+0.012
 ABC+0.011
 tant+0.011
Neg logit contributions
aro-0.015
ull-0.015
orse-0.014
 hurried-0.014
 unf-0.014
Token loved
Token ID6151
Feature activation-4.47e-01

Pos logit contributions
aro+0.010
line+0.010
 below+0.009
ull+0.009
lee+0.009
Neg logit contributions
 de-0.010
 candy-0.010
 ABC-0.009
 Ow-0.009
 cris-0.009
Token playing
Token ID2712
Feature activation+9.71e-01

Pos logit contributions
ull+0.008
 hurried+0.008
lee+0.008
 unf+0.007
aro+0.007
Neg logit contributions
 tant-0.014
 Ow-0.014
icer-0.014
 de-0.014
 license-0.013
Token in
Token ID287
Feature activation+5.19e-01

Pos logit contributions
 Ow+0.029
 candy+0.029
 tant+0.029
 de+0.028
 topp+0.028
Neg logit contributions
ull-0.019
aro-0.018
lee-0.017
 hurried-0.017
line-0.017
Token,
Token ID11
Feature activation-2.51e-02

Pos logit contributions
 topp+0.016
 Ow+0.016
 tant+0.016
icer+0.016
 de+0.016
Neg logit contributions
aro-0.005
 soon-0.005
 hurried-0.005
lee-0.005
 ran-0.005
Token Lily
Token ID20037
Feature activation-1.05e+00

Pos logit contributions
 soon+0.000
 decided+0.000
Mom+0.000
 hurried+0.000
ull+0.000
Neg logit contributions
 Ow-0.001
 topp-0.001
icer-0.001
 license-0.001
 ABC-0.001
Token loved
Token ID6151
Feature activation-4.22e-02

Pos logit contributions
ull+0.018
aro+0.018
line+0.017
 hurried+0.017
 below+0.017
Neg logit contributions
 de-0.032
 Ow-0.032
 tant-0.032
 topp-0.032
 ABC-0.032
Token to
Token ID284
Feature activation+5.36e-01

Pos logit contributions
ull+0.001
 below+0.001
lee+0.001
 hurried+0.001
 soon+0.001
Neg logit contributions
icer-0.001
 Ow-0.001
 de-0.001
 tant-0.001
 cris-0.001
Token call
Token ID869
Feature activation+8.35e-01

Pos logit contributions
 Ow+0.015
 de+0.015
 tant+0.015
 topp+0.015
 candy+0.015
Neg logit contributions
ull-0.011
aro-0.011
 hurried-0.010
line-0.010
 below-0.010
Token her
Token ID607
Feature activation-1.87e+00

Pos logit contributions
icer+0.029
 topp+0.028
pill+0.028
rob+0.028
 Ow+0.028
Neg logit contributions
 Mom-0.013
 mom-0.013
 slowly-0.013
 soon-0.013
 below-0.012
Token friends
Token ID2460
Feature activation+8.43e-01

Pos logit contributions
ull+0.028
aro+0.027
 hurried+0.025
line+0.025
 below+0.024
Neg logit contributions
 candy-0.065
 de-0.065
 license-0.064
 tant-0.064
 Ow-0.064
Token on
Token ID319
Feature activation-5.35e-01

Pos logit contributions
 Ow+0.030
icer+0.029
 topp+0.028
 de+0.028
 tant+0.028
Neg logit contributions
 soon-0.013
 below-0.013
line-0.013
ful-0.013
 ranger-0.012
Token her
Token ID607
Feature activation+1.14e-02

Pos logit contributions
 soon+0.010
 slowly+0.010
 Mom+0.009
 mom+0.009
 Jas+0.009
Neg logit contributions
 Ow-0.018
icer-0.017
 topp-0.017
 cris-0.017
 de-0.017
Token toy
Token ID13373
Feature activation+1.93e-01

Pos logit contributions
 de+0.000
 Ow+0.000
 tant+0.000
 topp+0.000
 candy+0.000
Neg logit contributions
ull-0.000
aro-0.000
 hurried-0.000
line-0.000
 below-0.000
Token phone
Token ID3072
Feature activation+3.34e-01

Pos logit contributions
 candy+0.004
 license+0.004
 tant+0.004
 newspaper+0.004
 money+0.004
Neg logit contributions
ull-0.005
aro-0.005
orse-0.005
 hurried-0.005
lee-0.005
Token a
Token ID257
Feature activation+2.15e-01

Pos logit contributions
 decided+0.009
 soon+0.008
ull+0.008
 ran+0.008
 unf+0.008
Neg logit contributions
icer-0.012
 Ow-0.011
 tant-0.011
 ABC-0.011
 de-0.011
Token car
Token ID1097
Feature activation-8.02e-01

Pos logit contributions
 Ow+0.006
icer+0.006
 de+0.006
 topp+0.005
 cris+0.005
Neg logit contributions
 ranger-0.005
ull-0.005
 unf-0.005
 soon-0.005
 keeper-0.005
Token with
Token ID351
Feature activation-1.57e+00

Pos logit contributions
ull+0.011
line+0.011
 below+0.010
 ranger+0.010
 soon+0.010
Neg logit contributions
 Ow-0.029
 de-0.028
icer-0.028
 tant-0.028
 topp-0.028
Token a
Token ID257
Feature activation-1.24e-01

Pos logit contributions
 soon+0.033
 below+0.031
 slowly+0.031
 decided+0.031
 unf+0.030
Neg logit contributions
icer-0.049
 Ow-0.047
 de-0.044
 ABC-0.044
rob-0.044
Token really
Token ID1107
Feature activation+3.33e-01

Pos logit contributions
ull+0.003
 unf+0.003
 ranger+0.003
 below+0.003
line+0.003
Neg logit contributions
 Ow-0.003
icer-0.003
 de-0.003
 cris-0.003
 topp-0.003
Token large
Token ID1588
Feature activation-1.57e+00

Pos logit contributions
 Ow+0.009
icer+0.008
 de+0.008
 topp+0.008
 tant+0.008
Neg logit contributions
ull-0.008
aro-0.007
 below-0.007
 hurried-0.007
 unf-0.007
Token tire
Token ID15867
Feature activation-8.56e-01

Pos logit contributions
 unf+0.030
 hurried+0.030
 ran+0.030
ull+0.029
 rushed+0.029
Neg logit contributions
 Ow-0.047
icer-0.045
pill-0.044
 cris-0.043
 unnecessary-0.043
Token.
Token ID13
Feature activation-4.68e-01

Pos logit contributions
ull+0.010
aro+0.009
 hurried+0.009
lee+0.009
line+0.009
Neg logit contributions
 de-0.032
 Ow-0.031
 tant-0.031
 topp-0.031
 candy-0.031
Token He
Token ID679
Feature activation-2.66e-02

Pos logit contributions
 Mom+0.010
 hurried+0.010
lee+0.009
Mom+0.009
ull+0.009
Neg logit contributions
 tant-0.013
 unnecessary-0.013
icer-0.013
rob-0.013
 topp-0.013
Token pretended
Token ID36766
Feature activation-3.97e-01

Pos logit contributions
ull+0.000
 hurried+0.000
aro+0.000
 unf+0.000
lee+0.000
Neg logit contributions
 Ow-0.001
 de-0.001
 tant-0.001
 topp-0.001
 candy-0.001
Token he
Token ID339
Feature activation+9.40e-01

Pos logit contributions
ull+0.005
 below+0.005
aro+0.005
 soon+0.005
 could+0.005
Neg logit contributions
icer-0.015
 Ow-0.015
 tant-0.014
 de-0.014
 topp-0.014
Token was
Token ID373
Feature activation+3.40e-01

Pos logit contributions
 could+0.009
 soon+0.009
ull+0.009
 filled+0.009
 decided+0.009
Neg logit contributions
 Ow-0.022
 unnecessary-0.022
icer-0.022
 tant-0.021
 topp-0.020
Token too
Token ID1165
Feature activation-6.66e-01

Pos logit contributions
 Ow+0.011
icer+0.010
 tant+0.010
rob+0.010
 topp+0.010
Neg logit contributions
 soon-0.007
 filled-0.007
 decided-0.007
ull-0.006
 hurried-0.006
Token expensive
Token ID5789
Feature activation+3.12e-01

Pos logit contributions
 soon+0.013
 below+0.013
ull+0.013
 unf+0.012
 decided+0.012
Neg logit contributions
 Ow-0.020
icer-0.020
 topp-0.020
 tant-0.019
rob-0.019
Token,
Token ID11
Feature activation+5.53e-01

Pos logit contributions
 Ow+0.013
 tant+0.013
icer+0.013
 ABC+0.012
 topp+0.012
Neg logit contributions
 soon-0.004
 below-0.003
line-0.003
 ranger-0.003
ful-0.003
Token but
Token ID475
Feature activation+1.51e+00

Pos logit contributions
icer+0.021
 Ow+0.021
 topp+0.019
rob+0.019
 tant+0.019
Neg logit contributions
 soon-0.011
 they-0.010
 decided-0.010
 slowly-0.009
 ran-0.009
Token Car
Token ID1879
Feature activation-1.92e+00

Pos logit contributions
 Ow+0.051
 topp+0.050
icer+0.049
 tant+0.049
 cris+0.048
Neg logit contributions
 soon-0.026
 hurried-0.025
line-0.025
 decided-0.025
 below-0.024
Tokenla
Token ID5031
Feature activation-1.21e+00

Pos logit contributions
ull+0.047
 hurried+0.045
aro+0.045
lee+0.045
line+0.044
Neg logit contributions
 Ow-0.047
 tant-0.047
 topp-0.046
 de-0.046
 candy-0.046
Token wouldn
Token ID3636
Feature activation+9.48e-01

Pos logit contributions
ull+0.018
aro+0.017
 hurried+0.017
line+0.016
 below+0.016
Neg logit contributions
 Ow-0.041
 de-0.041
 tant-0.040
 topp-0.040
 candy-0.040
Token't
Token ID470
Feature activation+2.85e-01

Pos logit contributions
 Ow+0.024
 de+0.023
 ABC+0.023
 tant+0.023
 license+0.023
Neg logit contributions
ull-0.022
aro-0.021
 hurried-0.021
lee-0.021
 ranger-0.021
Token take
Token ID1011
Feature activation+4.20e-01

Pos logit contributions
 candy+0.008
 price+0.007
 money+0.007
 license+0.007
 de+0.007
Neg logit contributions
ull-0.007
aro-0.007
lee-0.006
Mom-0.006
 hurried-0.006
Token no
Token ID645
Feature activation+6.22e-01

Pos logit contributions
rob+0.016
 topp+0.016
icer+0.016
pill+0.016
 Ow+0.016
Neg logit contributions
 soon-0.005
 slowly-0.005
 Mom-0.004
ful-0.004
 ran-0.004
Token lived
Token ID5615
Feature activation-1.29e-01

Pos logit contributions
ull+0.015
aro+0.014
 below+0.013
line+0.013
lee+0.013
Neg logit contributions
 de-0.022
 license-0.020
 candy-0.020
 cris-0.020
 ABC-0.020
Token in
Token ID287
Feature activation+8.74e-01

Pos logit contributions
lee+0.002
aro+0.002
ull+0.002
orse+0.002
line+0.002
Neg logit contributions
 de-0.004
 license-0.004
 candy-0.004
 Ow-0.004
 ABC-0.004
Token a
Token ID257
Feature activation-5.47e-01

Pos logit contributions
 tant+0.027
 Ow+0.027
 topp+0.026
 fault+0.026
pill+0.025
Neg logit contributions
 Mom-0.019
 soon-0.018
Mom-0.017
 filled-0.017
 mom-0.017
Token big
Token ID1263
Feature activation-1.06e+00

Pos logit contributions
ull+0.017
aro+0.017
line+0.015
ful+0.015
lee+0.015
Neg logit contributions
 license-0.011
 de-0.011
 tant-0.011
 candy-0.011
 price-0.011
Token city
Token ID1748
Feature activation-7.97e-02

Pos logit contributions
ull+0.025
aro+0.024
line+0.023
 hurried+0.023
 below+0.023
Neg logit contributions
 de-0.028
 tant-0.027
 candy-0.027
 license-0.027
icer-0.027
Token with
Token ID351
Feature activation-1.56e+00

Pos logit contributions
ull+0.002
aro+0.002
lee+0.002
 hurried+0.002
Mom+0.002
Neg logit contributions
 de-0.002
 ABC-0.002
 license-0.002
 candy-0.002
 topp-0.002
Token many
Token ID867
Feature activation-1.04e+00

Pos logit contributions
ull+0.042
aro+0.040
 hurried+0.040
line+0.040
 below+0.039
Neg logit contributions
 Ow-0.034
 de-0.034
 tant-0.034
 topp-0.033
 candy-0.033
Token people
Token ID661
Feature activation-2.83e-01

Pos logit contributions
ull+0.026
aro+0.025
line+0.025
 hurried+0.025
 unf+0.024
Neg logit contributions
 de-0.025
 tant-0.025
 candy-0.024
 ABC-0.024
 topp-0.024
Token and
Token ID290
Feature activation-2.67e-01

Pos logit contributions
ull+0.005
 hurried+0.005
Mom+0.005
aro+0.005
 unf+0.005
Neg logit contributions
 de-0.009
 candy-0.008
 license-0.008
 ABC-0.008
 money-0.008
Token buildings
Token ID6832
Feature activation-4.56e-01

Pos logit contributions
 decided+0.005
 ran+0.005
 soon+0.005
 hurried+0.005
 rushed+0.005
Neg logit contributions
icer-0.009
 Ow-0.009
 tant-0.008
 retri-0.008
 fault-0.008
Token.
Token ID13
Feature activation+9.51e-02

Pos logit contributions
ull+0.007
aro+0.007
Mom+0.006
lee+0.006
 hurried+0.006
Neg logit contributions
 de-0.015
 candy-0.015
 ABC-0.015
 license-0.015
 money-0.015