TOP ACTIVATIONS
MAX = 0.053

was too high and he wasn 't sure how he could
was too heavy and she wasn 't strong enough .
he felt afraid . He wasn 't sure what to do
Rose started to feel a little afraid . She didn 't
was big and one was little . The little human wanted
the top , but he wasn 't quite tall enough .
brave and daring but he wasn 't sure what it was
eddy bears , but he wasn 't sure what to do
do something fun but he wasn â t sure
she was worried that she wasn 't good enough .
an adventure , but she wasn 't sure which way to
she started to feel a little bit hurt in her tum
and curious , but he wasn 't ready to touch it
, but he felt a little scared . Just then ,
, twelve , thirteen , fourteen , fifteen , sixteen ,
Yes , but only a little . The oats are for
liked to explore but she wasn 't an adventurous swim mer
so loud , but he wasn 't scared . Instead ,
to slow down . He wasn 't as fast as before
wanted to resist . She wasn 't sure if she was

MIN ACTIVATIONS
MIN = -0.053

hugs Ben . She shares the blanket . She forg ives
" Maybe we can share the ball with him . Maybe
to their house and shared the candy . They also shared
Mom my , what is that ?" His mom replied ,
asked , " What is that well mom my ?". His
owl and decided to share the metal toy . They played
the mess and to share the music . She says they
outside to play with her yellow ball . She threw it
my , " What is that ?" "
Sam , " What is that orange ball in the sky
friend , " What 's that big ball up there ?"
to the park with his red ball . At the park
father , " What is that ?" His father
went home and remembered the bright plastic ball . She looked
The boy shared the ball with his friends .
can I play with your yellow ball too ?" Tom asked
" Maybe we can share the clay . We can each
all played together with the shiny trophy . From that day
mom and decided to share the toy . They played together
selfish . You should share the ball with me . We

INTERVAL 0.026 - 0.053
CONTAINS 0.422%

, " Let 's all run back to the house !"
could not stop . They were going too fast . They
living room . Lily tried her best , but she was
Lily noticed a nos y little monkey pe eking through the
, Jack set off to explore the world . He came

INTERVAL -0.000 - 0.026
CONTAINS 50.040%

mom and said , " Look , Mom my ! I
her pictures . Mia showed L ila her pictures . They
As she jumped into the water , she saw a deaf
more ice cream . They made chocolate ice cream . They
when she was outside she saw a big b erry .

INTERVAL -0.026 - -0.000
CONTAINS 49.222%

. He saw Sarah and took all the chalk away .
" Sure , I guess . Where is she ?"
letter came in the mail . As the silly house opened
. They took her home and put a band age on
ladder , the bell would ring . The little

INTERVAL -0.053 - -0.026
CONTAINS 0.316%

" Max passed the test ! He is a good dog
dinosaur . I will share my toys and say nice words
She went to bed that night , full of o at
She sees nothing but a white sheet . She feels sad
. Maybe we can share our dreams later . Let 's
Token was
Token ID373
Feature activation+1.62e-02

Pos logit contributions
 soon+0.008
 then+0.007
 stuck+0.006
 trapped+0.005
 suddenly+0.005
Neg logit contributions
inner-0.013
board-0.013
 similarities-0.012
ag-0.012
oker-0.012
Token too
Token ID1165
Feature activation+1.19e-02

Pos logit contributions
 soon+0.016
 then+0.013
 least+0.011
 Then+0.011
 able+0.011
Neg logit contributions
board-0.036
inner-0.036
ag-0.035
 medal-0.034
ool-0.034
Token high
Token ID1029
Feature activation+4.85e-03

Pos logit contributions
 soon+0.018
 then+0.017
 least+0.016
 able+0.015
 Then+0.015
Neg logit contributions
inner-0.021
board-0.019
ade-0.019
ork-0.019
ool-0.018
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
 soon+0.006
 then+0.006
 least+0.006
 afraid+0.006
 able+0.006
Neg logit contributions
board-0.009
inner-0.009
ag-0.008
anger-0.008
 guys-0.008
Token he
Token ID339
Feature activation+8.34e-03

Pos logit contributions
 soon+0.029
 then+0.025
 able+0.022
 least+0.022
 stuck+0.021
Neg logit contributions
board-0.039
inner-0.039
ag-0.036
ork-0.035
 medal-0.035
Token wasn
Token ID2492
Feature activation+5.27e-02

Pos logit contributions
 soon+0.010
 then+0.009
 stuck+0.007
 able+0.007
 least+0.007
Neg logit contributions
inner-0.019
board-0.018
ag-0.017
oker-0.017
 pleasures-0.017
Token't
Token ID470
Feature activation+1.10e-02

Pos logit contributions
 soon+0.071
 then+0.065
 Then+0.062
 hoping+0.060
 least+0.059
Neg logit contributions
inner-0.094
ag-0.086
ork-0.086
 medal-0.085
board-0.085
Token sure
Token ID1654
Feature activation-1.27e-02

Pos logit contributions
 soon+0.011
 then+0.010
 least+0.008
 able+0.008
 Then+0.007
Neg logit contributions
board-0.025
inner-0.025
ag-0.024
 medal-0.023
ork-0.023
Token how
Token ID703
Feature activation-1.24e-02

Pos logit contributions
board+0.029
inner+0.029
ag+0.028
ork+0.027
ade+0.027
Neg logit contributions
 soon-0.012
 then-0.010
 least-0.009
 able-0.008
 trapped-0.008
Token he
Token ID339
Feature activation-7.49e-03

Pos logit contributions
board+0.026
inner+0.026
ag+0.025
ork+0.024
ade+0.024
Neg logit contributions
 soon-0.014
 then-0.013
 least-0.011
 able-0.011
 hoping-0.010
Token could
Token ID714
Feature activation+8.54e-03

Pos logit contributions
board+0.015
inner+0.015
ag+0.014
ork+0.014
anger+0.014
Neg logit contributions
 soon-0.010
 then-0.009
 able-0.007
 least-0.007
 stuck-0.007
Token was
Token ID373
Feature activation+1.35e-02

Pos logit contributions
 soon+0.009
 then+0.008
 able+0.007
 stuck+0.007
 least+0.007
Neg logit contributions
inner-0.013
board-0.013
ag-0.012
affle-0.011
oker-0.011
Token too
Token ID1165
Feature activation+7.97e-03

Pos logit contributions
 soon+0.013
 then+0.011
 least+0.010
 Then+0.009
 able+0.009
Neg logit contributions
inner-0.029
board-0.029
ag-0.028
ool-0.028
 medal-0.028
Token heavy
Token ID4334
Feature activation+9.20e-03

Pos logit contributions
 soon+0.009
 then+0.008
 least+0.007
 able+0.007
 hoping+0.006
Neg logit contributions
inner-0.018
board-0.017
ork-0.016
ade-0.016
ool-0.016
Token and
Token ID290
Feature activation+1.90e-02

Pos logit contributions
 soon+0.012
 then+0.011
 least+0.010
 Amen+0.009
 able+0.009
Neg logit contributions
board-0.018
inner-0.017
ag-0.016
anger-0.016
ool-0.016
Token she
Token ID673
Feature activation+1.57e-02

Pos logit contributions
 soon+0.023
 then+0.020
 least+0.018
 able+0.018
 hoping+0.016
Neg logit contributions
board-0.039
inner-0.039
ag-0.036
ork-0.036
 medal-0.035
Token wasn
Token ID2492
Feature activation+5.10e-02

Pos logit contributions
 soon+0.020
 then+0.018
 able+0.015
 least+0.015
 stuck+0.014
Neg logit contributions
board-0.033
inner-0.033
ag-0.031
 medal-0.030
anger-0.029
Token't
Token ID470
Feature activation+1.39e-02

Pos logit contributions
 soon+0.073
 then+0.065
 hoping+0.064
 Then+0.063
 least+0.061
Neg logit contributions
inner-0.087
ork-0.081
board-0.079
ag-0.078
 medal-0.077
Token strong
Token ID1913
Feature activation-4.80e-04

Pos logit contributions
 soon+0.010
 then+0.008
 least+0.006
 able+0.005
 Then+0.005
Neg logit contributions
board-0.036
inner-0.036
ag-0.034
 medal-0.033
ork-0.033
Token enough
Token ID1576
Feature activation-1.48e-02

Pos logit contributions
board+0.001
inner+0.001
ag+0.001
 medal+0.001
ork+0.001
Neg logit contributions
 soon-0.001
 then-0.000
 least-0.000
 Then-0.000
 able-0.000
Token.
Token ID13
Feature activation-1.52e-03

Pos logit contributions
board+0.034
inner+0.034
ag+0.031
affle+0.031
 similarities+0.031
Neg logit contributions
 soon-0.017
 then-0.014
 able-0.012
 stuck-0.012
 now-0.011
Token
Token ID220
Feature activation+5.77e-03

Pos logit contributions
board+0.003
inner+0.003
ag+0.003
ork+0.003
anger+0.003
Neg logit contributions
 soon-0.002
 then-0.002
 able-0.002
 least-0.001
 stuck-0.001
Token he
Token ID339
Feature activation+8.09e-03

Pos logit contributions
 soon+0.022
 then+0.020
 able+0.018
 least+0.018
 hoping+0.016
Neg logit contributions
board-0.032
inner-0.031
ag-0.029
 medal-0.029
ork-0.028
Token felt
Token ID2936
Feature activation+3.23e-02

Pos logit contributions
 soon+0.010
 then+0.009
 stuck+0.007
 able+0.007
 least+0.007
Neg logit contributions
inner-0.018
board-0.017
ag-0.016
oker-0.016
 pleasures-0.016
Token afraid
Token ID7787
Feature activation+1.82e-02

Pos logit contributions
 soon+0.031
 then+0.025
 Amen+0.023
 least+0.022
 Then+0.020
Neg logit contributions
ag-0.067
board-0.066
inner-0.066
 medal-0.066
 som-0.065
Token.
Token ID13
Feature activation-9.28e-03

Pos logit contributions
 soon+0.020
 then+0.017
 least+0.017
 able+0.016
 Amen+0.015
Neg logit contributions
board-0.038
inner-0.037
ag-0.036
 medal-0.034
ork-0.034
Token He
Token ID679
Feature activation+8.99e-03

Pos logit contributions
inner+0.020
board+0.020
ag+0.018
oker+0.018
ool+0.018
Neg logit contributions
 soon-0.011
 then-0.010
 able-0.008
 Then-0.008
 least-0.008
Token wasn
Token ID2492
Feature activation+5.09e-02

Pos logit contributions
 soon+0.012
 then+0.010
 able+0.009
 least+0.009
 stuck+0.008
Neg logit contributions
board-0.019
inner-0.019
ag-0.017
ork-0.017
anger-0.017
Token't
Token ID470
Feature activation+1.30e-02

Pos logit contributions
 soon+0.072
 hoping+0.064
 then+0.062
 Then+0.062
 least+0.061
Neg logit contributions
inner-0.082
ag-0.080
ork-0.078
 medal-0.076
board-0.075
Token sure
Token ID1654
Feature activation-7.05e-03

Pos logit contributions
 soon+0.011
 then+0.010
 least+0.009
 Then+0.008
 Amen+0.008
Neg logit contributions
ag-0.029
board-0.029
inner-0.028
 medal-0.028
 guys-0.027
Token what
Token ID644
Feature activation-1.73e-02

Pos logit contributions
board+0.016
inner+0.015
ag+0.015
 medal+0.015
ool+0.014
Neg logit contributions
 soon-0.007
 then-0.006
 least-0.005
 Amen-0.005
 able-0.005
Token to
Token ID284
Feature activation-4.63e-03

Pos logit contributions
inner+0.047
board+0.046
oker+0.043
aff+0.043
affle+0.043
Neg logit contributions
 soon-0.015
 then-0.011
 never-0.009
 stuck-0.008
-0.008
Token do
Token ID466
Feature activation-1.19e-02

Pos logit contributions
board+0.006
inner+0.006
ork+0.005
oker+0.005
ag+0.005
Neg logit contributions
 soon-0.010
 then-0.009
 able-0.008
 least-0.008
 now-0.008
Token Rose
Token ID8049
Feature activation+2.81e-03

Pos logit contributions
board+0.006
inner+0.006
ag+0.005
affle+0.005
ool+0.005
Neg logit contributions
 soon-0.003
 then-0.003
 able-0.002
 suddenly-0.002
 Then-0.002
Token started
Token ID2067
Feature activation+9.97e-03

Pos logit contributions
 soon+0.004
 then+0.003
 able+0.003
 least+0.003
 stuck+0.003
Neg logit contributions
board-0.006
inner-0.006
ag-0.005
anger-0.005
ool-0.005
Token to
Token ID284
Feature activation-2.58e-03

Pos logit contributions
 soon+0.011
 then+0.009
 able+0.008
 least+0.008
 stuck+0.007
Neg logit contributions
board-0.023
inner-0.022
ag-0.021
ork-0.020
anger-0.020
Token feel
Token ID1254
Feature activation+3.94e-02

Pos logit contributions
board+0.005
inner+0.005
ag+0.005
anger+0.005
ork+0.005
Neg logit contributions
 soon-0.004
 then-0.003
 least-0.003
 able-0.003
 stuck-0.003
Token a
Token ID257
Feature activation+1.94e-02

Pos logit contributions
 soon+0.047
 then+0.039
 least+0.036
 able+0.035
 Amen+0.034
Neg logit contributions
ag-0.073
board-0.072
inner-0.072
 medal-0.070
 som-0.069
Token little
Token ID1310
Feature activation+4.98e-02

Pos logit contributions
 soon+0.026
 then+0.023
 able+0.020
 least+0.019
 stuck+0.019
Neg logit contributions
board-0.039
inner-0.039
ag-0.036
ork-0.035
anger-0.035
Token afraid
Token ID7787
Feature activation+1.16e-02

Pos logit contributions
 soon+0.062
 then+0.053
 able+0.047
 least+0.045
 Then+0.044
Neg logit contributions
board-0.100
inner-0.095
 medal-0.092
ag-0.092
 price-0.091
Token.
Token ID13
Feature activation-4.21e-03

Pos logit contributions
 soon+0.012
 then+0.010
 able+0.008
 least+0.008
 stuck+0.007
Neg logit contributions
board-0.027
inner-0.027
ag-0.025
ork-0.025
 medal-0.025
Token She
Token ID1375
Feature activation+1.35e-02

Pos logit contributions
inner+0.010
board+0.010
ag+0.010
oker+0.009
affle+0.009
Neg logit contributions
 soon-0.004
 then-0.004
 Then-0.003
 gently-0.003
-0.003
Token didn
Token ID1422
Feature activation+3.81e-02

Pos logit contributions
 soon+0.018
 then+0.015
 able+0.013
 least+0.013
 stuck+0.012
Neg logit contributions
board-0.028
inner-0.028
ag-0.026
anger-0.026
oker-0.025
Token't
Token ID470
Feature activation+1.80e-02

Pos logit contributions
 soon+0.051
 then+0.045
 able+0.043
 hoping+0.042
 least+0.041
Neg logit contributions
inner-0.074
board-0.071
ork-0.071
anger-0.067
ag-0.066
Token was
Token ID373
Feature activation+1.18e-02

Pos logit contributions
 soon+0.003
 then+0.003
 scared+0.003
 afraid+0.003
 stuck+0.003
Neg logit contributions
inner-0.003
board-0.003
 medal-0.003
 seas-0.003
anger-0.003
Token big
Token ID1263
Feature activation+1.73e-02

Pos logit contributions
 soon+0.016
 then+0.014
 least+0.012
 able+0.012
 hoping+0.011
Neg logit contributions
board-0.023
inner-0.023
ag-0.023
affle-0.021
ool-0.021
Token and
Token ID290
Feature activation+2.03e-02

Pos logit contributions
 soon+0.019
 then+0.016
 able+0.013
 least+0.013
 stuck+0.012
Neg logit contributions
board-0.039
inner-0.039
ag-0.037
 medal-0.036
ork-0.035
Token one
Token ID530
Feature activation+6.35e-03

Pos logit contributions
 soon+0.032
 then+0.028
 least+0.027
 able+0.026
 Then+0.025
Neg logit contributions
board-0.033
inner-0.033
ag-0.031
 medal-0.030
 seas-0.030
Token was
Token ID373
Feature activation+1.06e-03

Pos logit contributions
 soon+0.009
 then+0.008
 able+0.007
 least+0.007
 stuck+0.007
Neg logit contributions
inner-0.012
board-0.012
ag-0.011
anger-0.011
ork-0.011
Token little
Token ID1310
Feature activation+4.97e-02

Pos logit contributions
 soon+0.001
 then+0.001
 least+0.001
 able+0.001
 hoping+0.001
Neg logit contributions
board-0.002
ag-0.002
inner-0.002
affle-0.002
ool-0.002
Token.
Token ID13
Feature activation+8.73e-03

Pos logit contributions
 soon+0.086
 then+0.076
 able+0.076
 least+0.072
 never+0.071
Neg logit contributions
board-0.074
inner-0.068
 guys-0.067
 medal-0.065
 seas-0.064
Token The
Token ID383
Feature activation+6.98e-03

Pos logit contributions
 soon+0.012
 then+0.011
 able+0.009
 least+0.009
 stuck+0.009
Neg logit contributions
board-0.017
inner-0.017
ag-0.016
ork-0.015
anger-0.015
Token little
Token ID1310
Feature activation+2.65e-02

Pos logit contributions
 soon+0.012
 then+0.011
 able+0.010
 least+0.010
 stuck+0.010
Neg logit contributions
board-0.011
inner-0.011
ag-0.010
 guys-0.010
ork-0.010
Token human
Token ID1692
Feature activation+4.43e-03

Pos logit contributions
 soon+0.042
 then+0.038
 able+0.033
 least+0.033
 stuck+0.032
Neg logit contributions
board-0.047
inner-0.047
ag-0.043
ork-0.042
anger-0.041
Token wanted
Token ID2227
Feature activation+1.52e-02

Pos logit contributions
 soon+0.006
 then+0.005
 stuck+0.004
+0.004
 able+0.004
Neg logit contributions
inner-0.010
board-0.009
ag-0.009
anger-0.009
oker-0.009
Token the
Token ID262
Feature activation+7.79e-03

Pos logit contributions
board+0.008
inner+0.008
ag+0.007
 medal+0.007
ork+0.007
Neg logit contributions
 soon-0.006
 then-0.005
 able-0.004
 least-0.004
 afraid-0.004
Token top
Token ID1353
Feature activation-8.92e-04

Pos logit contributions
 soon+0.013
 then+0.012
 least+0.010
 now+0.010
 so+0.010
Neg logit contributions
ag-0.013
inner-0.013
 similarities-0.013
oker-0.013
board-0.013
Token,
Token ID11
Feature activation+4.35e-03

Pos logit contributions
board+0.002
inner+0.002
ag+0.002
affle+0.001
ork+0.001
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 stuck-0.001
Token but
Token ID475
Feature activation-9.96e-03

Pos logit contributions
 soon+0.006
 then+0.005
 able+0.005
 least+0.004
 Amen+0.004
Neg logit contributions
board-0.008
inner-0.008
 medal-0.008
 som-0.008
 seas-0.007
Token he
Token ID339
Feature activation+9.46e-03

Pos logit contributions
board+0.021
inner+0.021
ag+0.020
ool+0.019
affle+0.019
Neg logit contributions
 soon-0.014
 then-0.012
 able-0.010
 stuck-0.010
 least-0.010
Token wasn
Token ID2492
Feature activation+4.94e-02

Pos logit contributions
 soon+0.013
 then+0.011
 able+0.009
 least+0.009
 stuck+0.009
Neg logit contributions
board-0.019
inner-0.019
ag-0.018
ork-0.017
anger-0.017
Token't
Token ID470
Feature activation+1.94e-02

Pos logit contributions
 soon+0.071
 then+0.063
 hoping+0.061
 Then+0.060
 least+0.059
Neg logit contributions
inner-0.085
ork-0.078
ag-0.077
board-0.077
 medal-0.074
Token quite
Token ID2407
Feature activation+1.48e-02

Pos logit contributions
 soon+0.019
 then+0.017
 least+0.016
 Then+0.015
 Amen+0.014
Neg logit contributions
ag-0.040
inner-0.040
board-0.039
 medal-0.039
ade-0.038
Token tall
Token ID7331
Feature activation-1.11e-02

Pos logit contributions
 soon+0.016
 then+0.013
 least+0.011
 able+0.010
 Then+0.010
Neg logit contributions
board-0.033
inner-0.033
ag-0.032
 medal-0.031
oker-0.030
Token enough
Token ID1576
Feature activation-7.07e-03

Pos logit contributions
board+0.025
inner+0.024
 medal+0.023
ag+0.023
ork+0.022
Neg logit contributions
 soon-0.012
 then-0.010
 least-0.009
 able-0.009
 Then-0.008
Token.
Token ID13
Feature activation-9.02e-05

Pos logit contributions
board+0.016
inner+0.016
ag+0.015
 medal+0.015
ool+0.015
Neg logit contributions
 soon-0.007
 then-0.006
 least-0.005
 able-0.005
 stuck-0.004
Token brave
Token ID14802
Feature activation+4.61e-03

Pos logit contributions
board+0.005
inner+0.005
anger+0.005
ests+0.004
ag+0.004
Neg logit contributions
 soon-0.005
 then-0.005
 able-0.004
 very-0.004
 now-0.004
Token and
Token ID290
Feature activation+1.34e-02

Pos logit contributions
 soon+0.004
 then+0.004
 able+0.003
 least+0.003
 stuck+0.003
Neg logit contributions
board-0.011
inner-0.011
ag-0.010
 medal-0.010
anger-0.010
Token daring
Token ID27939
Feature activation+1.29e-02

Pos logit contributions
 soon+0.020
 then+0.017
 least+0.016
 able+0.016
 stuck+0.015
Neg logit contributions
board-0.025
inner-0.024
ag-0.022
 medal-0.022
ade-0.022
Token but
Token ID475
Feature activation+2.33e-03

Pos logit contributions
 soon+0.011
 then+0.009
 so+0.007
 able+0.006
 now+0.006
Neg logit contributions
inner-0.035
board-0.034
ool-0.032
affle-0.032
anger-0.031
Token he
Token ID339
Feature activation+9.81e-03

Pos logit contributions
 soon+0.003
 then+0.003
 stuck+0.002
 least+0.002
 now+0.002
Neg logit contributions
inner-0.005
board-0.005
ag-0.005
anger-0.005
affle-0.005
Token wasn
Token ID2492
Feature activation+4.91e-02

Pos logit contributions
 soon+0.013
 then+0.011
 stuck+0.009
 least+0.009
 able+0.009
Neg logit contributions
inner-0.021
board-0.021
ag-0.020
anger-0.020
 pleasures-0.019
Token't
Token ID470
Feature activation+2.00e-02

Pos logit contributions
 soon+0.065
 then+0.057
 hoping+0.053
 Then+0.052
 able+0.051
Neg logit contributions
inner-0.093
ag-0.088
board-0.087
ork-0.085
 medal-0.083
Token sure
Token ID1654
Feature activation-8.05e-03

Pos logit contributions
 soon+0.018
 then+0.015
 least+0.012
 Then+0.011
 able+0.011
Neg logit contributions
board-0.047
inner-0.047
ag-0.045
 medal-0.044
ork-0.043
Token what
Token ID644
Feature activation-7.29e-03

Pos logit contributions
board+0.019
inner+0.019
ag+0.018
ork+0.018
 medal+0.018
Neg logit contributions
 soon-0.008
 then-0.006
 least-0.005
 able-0.005
 stuck-0.005
Token it
Token ID340
Feature activation-1.41e-02

Pos logit contributions
inner+0.019
board+0.019
anger+0.018
ag+0.018
affle+0.018
Neg logit contributions
 soon-0.006
 then-0.005
-0.004
 now-0.003
 sometimes-0.003
Token was
Token ID373
Feature activation-6.15e-03

Pos logit contributions
inner+0.030
board+0.030
ag+0.029
ork+0.028
anger+0.028
Neg logit contributions
 soon-0.018
 then-0.015
 stuck-0.013
 least-0.013
 able-0.013
Tokeneddy
Token ID21874
Feature activation+2.25e-02

Pos logit contributions
 soon+0.003
 then+0.003
 able+0.003
 least+0.003
 now+0.003
Neg logit contributions
board-0.003
inner-0.003
 som-0.003
 similarities-0.003
ag-0.003
Token bears
Token ID13062
Feature activation+1.60e-02

Pos logit contributions
 soon+0.032
 then+0.028
 able+0.025
 least+0.024
 stuck+0.023
Neg logit contributions
board-0.044
inner-0.043
ag-0.040
ork-0.039
anger-0.039
Token,
Token ID11
Feature activation+1.11e-02

Pos logit contributions
 soon+0.020
 then+0.017
 least+0.016
 Amen+0.016
 Then+0.015
Neg logit contributions
board-0.032
ag-0.030
inner-0.028
ade-0.028
 medal-0.028
Token but
Token ID475
Feature activation+1.27e-02

Pos logit contributions
 soon+0.014
 then+0.012
 able+0.010
 least+0.010
 stuck+0.009
Neg logit contributions
board-0.023
inner-0.023
ag-0.022
 medal-0.021
ork-0.021
Token he
Token ID339
Feature activation+1.33e-02

Pos logit contributions
 soon+0.017
 then+0.015
 able+0.014
 least+0.014
 trapped+0.013
Neg logit contributions
board-0.024
inner-0.024
ag-0.022
 medal-0.022
 seas-0.022
Token wasn
Token ID2492
Feature activation+4.86e-02

Pos logit contributions
 soon+0.018
 then+0.016
 able+0.013
 least+0.013
 stuck+0.013
Neg logit contributions
board-0.027
inner-0.027
ag-0.025
ork-0.024
anger-0.024
Token't
Token ID470
Feature activation+2.09e-02

Pos logit contributions
 soon+0.064
 then+0.057
 hoping+0.052
 able+0.052
 Then+0.052
Neg logit contributions
inner-0.091
board-0.087
ag-0.086
ork-0.085
 medal-0.082
Token sure
Token ID1654
Feature activation-2.82e-03

Pos logit contributions
 soon+0.020
 then+0.017
 least+0.014
 able+0.013
 Then+0.013
Neg logit contributions
board-0.048
inner-0.048
ag-0.047
 medal-0.045
affle-0.045
Token what
Token ID644
Feature activation-2.02e-04

Pos logit contributions
board+0.007
inner+0.007
ag+0.006
ork+0.006
 medal+0.006
Neg logit contributions
 soon-0.003
 then-0.002
 least-0.002
 able-0.002
 stuck-0.002
Token to
Token ID284
Feature activation-4.58e-03

Pos logit contributions
inner+0.001
board+0.001
anger+0.000
ag+0.000
oker+0.000
Neg logit contributions
 soon-0.000
 then-0.000
 very-0.000
 so-0.000
 never-0.000
Token do
Token ID466
Feature activation-8.30e-03

Pos logit contributions
board+0.008
inner+0.008
anger+0.007
oker+0.007
ork+0.007
Neg logit contributions
 soon-0.008
 then-0.007
 now-0.007
 sometimes-0.006
 so-0.006
Token do
Token ID466
Feature activation+6.92e-03

Pos logit contributions
board+0.028
inner+0.028
anger+0.026
ool+0.026
ag+0.025
Neg logit contributions
 soon-0.025
 then-0.023
-0.019
 able-0.019
 least-0.019
Token something
Token ID1223
Feature activation+8.89e-03

Pos logit contributions
 soon+0.008
 then+0.007
 so+0.006
 able+0.006
 now+0.006
Neg logit contributions
inner-0.016
board-0.016
oker-0.015
aff-0.015
ool-0.015
Token fun
Token ID1257
Feature activation+8.58e-03

Pos logit contributions
 soon+0.015
 then+0.013
 able+0.012
 least+0.011
 so+0.011
Neg logit contributions
board-0.016
inner-0.016
ork-0.014
anger-0.014
ag-0.014
Token but
Token ID475
Feature activation-5.60e-03

Pos logit contributions
 soon+0.008
 then+0.007
 able+0.006
 least+0.005
 stuck+0.005
Neg logit contributions
board-0.021
inner-0.020
ag-0.019
ork-0.019
 medal-0.019
Token he
Token ID339
Feature activation-1.20e-03

Pos logit contributions
inner+0.012
board+0.012
ag+0.011
anger+0.011
aff+0.011
Neg logit contributions
 soon-0.008
 then-0.007
 now-0.006
 so-0.006
 suddenly-0.006
Token wasn
Token ID2492
Feature activation+4.83e-02

Pos logit contributions
inner+0.003
board+0.003
ag+0.003
 pleasures+0.003
anger+0.003
Neg logit contributions
 soon-0.002
 then-0.001
 never-0.001
 stuck-0.001
 now-0.001
Tokenâ
Token ID22940
Feature activation+4.90e-04

Pos logit contributions
 soon+0.046
 then+0.038
 hoping+0.032
 able+0.032
 Then+0.031
Neg logit contributions
inner-0.110
board-0.108
ag-0.105
ork-0.103
 medal-0.102
Token
Token ID26391
Feature activation+1.59e-02

Pos logit contributions
 soon+0.001
 then+0.001
 least+0.001
+0.001
 never+0.001
Neg logit contributions
 seas-0.001
inner-0.001
board-0.001
anger-0.001
 similarities-0.001
Token
Token ID8151
Feature activation+2.14e-02

Pos logit contributions
 soon+0.026
 then+0.024
 least+0.021
 able+0.021
 stuck+0.020
Neg logit contributions
board-0.027
inner-0.026
ag-0.024
ork-0.024
anger-0.024
Tokent
Token ID83
Feature activation+8.66e-03

Pos logit contributions
 soon+0.046
 then+0.043
 least+0.040
 able+0.040
 stuck+0.038
Neg logit contributions
board-0.025
inner-0.025
ag-0.022
ork-0.021
anger-0.021
Token sure
Token ID1654
Feature activation-8.28e-03

Pos logit contributions
 soon+0.008
 then+0.006
 able+0.005
 least+0.005
 stuck+0.004
Neg logit contributions
board-0.022
inner-0.022
ag-0.020
ork-0.020
anger-0.020
Token she
Token ID673
Feature activation+7.92e-03

Pos logit contributions
 soon+0.003
 then+0.002
 least+0.002
 stuck+0.002
 now+0.002
Neg logit contributions
inner-0.006
board-0.006
ag-0.006
affle-0.005
oker-0.005
Token was
Token ID373
Feature activation+2.18e-02

Pos logit contributions
 soon+0.011
 then+0.010
 able+0.008
 least+0.008
 stuck+0.008
Neg logit contributions
board-0.016
inner-0.016
ag-0.015
anger-0.014
ork-0.014
Token worried
Token ID7960
Feature activation+1.03e-02

Pos logit contributions
 soon+0.019
 then+0.016
 least+0.012
 able+0.012
 Then+0.012
Neg logit contributions
board-0.051
inner-0.051
ag-0.050
 medal-0.048
affle-0.047
Token that
Token ID326
Feature activation+1.66e-02

Pos logit contributions
 soon+0.013
 then+0.011
 able+0.009
 least+0.009
 stuck+0.009
Neg logit contributions
board-0.022
inner-0.022
ag-0.021
ork-0.020
anger-0.020
Token she
Token ID673
Feature activation+1.44e-02

Pos logit contributions
 soon+0.019
 then+0.016
 least+0.012
 able+0.012
 stuck+0.012
Neg logit contributions
inner-0.039
board-0.039
ag-0.037
affle-0.037
anger-0.036
Token wasn
Token ID2492
Feature activation+4.82e-02

Pos logit contributions
 soon+0.020
 then+0.018
 stuck+0.015
 least+0.015
 able+0.015
Neg logit contributions
inner-0.029
board-0.029
ag-0.027
oker-0.026
anger-0.026
Token't
Token ID470
Feature activation+1.18e-02

Pos logit contributions
 soon+0.064
 then+0.057
 Then+0.052
 hoping+0.052
 able+0.051
Neg logit contributions
inner-0.093
board-0.088
ag-0.087
ork-0.086
 medal-0.082
Token good
Token ID922
Feature activation+1.37e-02

Pos logit contributions
 soon+0.011
 then+0.009
 able+0.008
 least+0.007
 stuck+0.007
Neg logit contributions
board-0.028
inner-0.028
ag-0.026
ork-0.026
anger-0.026
Token enough
Token ID1576
Feature activation-5.02e-03

Pos logit contributions
 soon+0.019
 then+0.018
 least+0.015
 able+0.015
 trapped+0.015
Neg logit contributions
board-0.026
inner-0.025
ag-0.024
 medal-0.023
 guys-0.023
Token.
Token ID13
Feature activation+1.35e-03

Pos logit contributions
board+0.011
inner+0.011
ag+0.011
ork+0.010
 medal+0.010
Neg logit contributions
 soon-0.006
 then-0.005
 able-0.004
 least-0.004
 stuck-0.004
Token
Token ID220
Feature activation+8.38e-03

Pos logit contributions
 soon+0.001
 then+0.001
 able+0.001
 least+0.001
 trapped+0.001
Neg logit contributions
board-0.003
inner-0.003
ag-0.003
ork-0.003
anger-0.003
Token an
Token ID281
Feature activation-2.46e-04

Pos logit contributions
 soon+0.023
 then+0.019
 able+0.015
 least+0.014
 stuck+0.013
Neg logit contributions
board-0.066
inner-0.066
ag-0.062
ork-0.061
 medal-0.061
Token adventure
Token ID8855
Feature activation+1.43e-02

Pos logit contributions
board+0.000
inner+0.000
ag+0.000
ork+0.000
anger+0.000
Neg logit contributions
 soon-0.001
 then-0.001
 least-0.000
 able-0.000
 stuck-0.000
Token,
Token ID11
Feature activation+1.71e-03

Pos logit contributions
 soon+0.013
 then+0.010
 able+0.007
 stuck+0.007
 so+0.007
Neg logit contributions
inner-0.037
board-0.037
affle-0.035
 similarities-0.035
ork-0.034
Token but
Token ID475
Feature activation-2.18e-03

Pos logit contributions
 soon+0.001
 then+0.001
 able+0.001
 least+0.001
 stuck+0.001
Neg logit contributions
board-0.005
inner-0.005
ag-0.004
affle-0.004
ork-0.004
Token she
Token ID673
Feature activation+9.19e-04

Pos logit contributions
inner+0.005
board+0.005
ag+0.005
affle+0.005
oker+0.005
Neg logit contributions
 soon-0.002
 then-0.002
 least-0.002
 able-0.002
 stuck-0.002
Token wasn
Token ID2492
Feature activation+4.82e-02

Pos logit contributions
 soon+0.001
 then+0.001
 able+0.001
 least+0.001
 stuck+0.001
Neg logit contributions
board-0.002
inner-0.002
ag-0.002
ork-0.002
anger-0.002
Token't
Token ID470
Feature activation+1.87e-02

Pos logit contributions
 soon+0.065
 then+0.058
 least+0.052
 Then+0.052
 hoping+0.052
Neg logit contributions
inner-0.089
ag-0.084
board-0.084
ork-0.084
 medal-0.081
Token sure
Token ID1654
Feature activation-6.85e-03

Pos logit contributions
 soon+0.017
 then+0.015
 least+0.012
 Then+0.011
 able+0.011
Neg logit contributions
board-0.043
inner-0.043
ag-0.042
 medal-0.040
ork-0.040
Token which
Token ID543
Feature activation+4.30e-03

Pos logit contributions
board+0.016
inner+0.016
ag+0.015
ork+0.015
ool+0.015
Neg logit contributions
 soon-0.006
 then-0.005
 least-0.005
 able-0.004
 stuck-0.004
Token way
Token ID835
Feature activation+4.66e-03

Pos logit contributions
 soon+0.006
 then+0.006
 hoping+0.005
 so+0.005
 stuck+0.005
Neg logit contributions
inner-0.008
board-0.008
affle-0.008
ag-0.008
 similarities-0.007
Token to
Token ID284
Feature activation+1.26e-03

Pos logit contributions
 soon+0.005
 then+0.004
 able+0.003
 least+0.003
 stuck+0.003
Neg logit contributions
board-0.011
inner-0.011
affle-0.011
ag-0.010
oker-0.010
Token she
Token ID673
Feature activation+5.34e-04

Pos logit contributions
inner+0.003
board+0.003
ag+0.003
affle+0.003
ool+0.003
Neg logit contributions
 soon-0.002
 then-0.001
 able-0.001
 stuck-0.001
 suddenly-0.001
Token started
Token ID2067
Feature activation+1.45e-02

Pos logit contributions
 soon+0.001
 then+0.001
 able+0.001
 least+0.001
 stuck+0.001
Neg logit contributions
board-0.001
inner-0.001
ag-0.001
ork-0.001
 medal-0.001
Token to
Token ID284
Feature activation+1.45e-03

Pos logit contributions
 soon+0.017
 then+0.015
 least+0.012
 able+0.012
 stuck+0.011
Neg logit contributions
board-0.032
inner-0.031
ag-0.029
ork-0.029
 medal-0.028
Token feel
Token ID1254
Feature activation+4.63e-02

Pos logit contributions
 soon+0.002
 then+0.002
 least+0.002
 able+0.002
 stuck+0.001
Neg logit contributions
board-0.003
inner-0.003
ag-0.003
anger-0.003
ork-0.003
Token a
Token ID257
Feature activation+2.27e-02

Pos logit contributions
 soon+0.059
 then+0.048
 Amen+0.048
 least+0.044
 together+0.042
Neg logit contributions
ag-0.082
board-0.081
ade-0.080
inner-0.079
 som-0.078
Token little
Token ID1310
Feature activation+4.80e-02

Pos logit contributions
 soon+0.033
 then+0.029
 able+0.026
 least+0.025
 stuck+0.025
Neg logit contributions
board-0.043
inner-0.043
ag-0.040
anger-0.038
ork-0.038
Token bit
Token ID1643
Feature activation+2.76e-02

Pos logit contributions
 soon+0.064
 then+0.055
 able+0.049
 Then+0.048
 least+0.047
Neg logit contributions
board-0.092
inner-0.086
 medal-0.084
ag-0.084
 price-0.083
Token hurt
Token ID5938
Feature activation+7.99e-03

Pos logit contributions
 soon+0.037
 Amen+0.033
 least+0.033
 then+0.032
 never+0.031
Neg logit contributions
ade-0.049
ag-0.048
board-0.047
inner-0.045
oker-0.044
Token in
Token ID287
Feature activation+1.19e-02

Pos logit contributions
 soon+0.009
 then+0.008
 least+0.007
 able+0.007
 Amen+0.006
Neg logit contributions
board-0.017
inner-0.016
ag-0.016
ork-0.015
ade-0.015
Token her
Token ID607
Feature activation+1.40e-02

Pos logit contributions
 soon+0.017
 then+0.016
 able+0.014
 least+0.014
ends+0.014
Neg logit contributions
board-0.020
inner-0.019
 price-0.019
ork-0.019
ag-0.019
Token tum
Token ID11814
Feature activation-2.37e-02

Pos logit contributions
 soon+0.027
 then+0.025
 able+0.023
 least+0.022
 stuck+0.022
Neg logit contributions
board-0.020
inner-0.019
ag-0.018
 medal-0.017
ork-0.017
Token and
Token ID290
Feature activation+8.17e-03

Pos logit contributions
 soon+0.017
 Amen+0.015
 least+0.015
 then+0.014
 able+0.013
Neg logit contributions
board-0.032
ag-0.030
inner-0.030
 seas-0.029
 guys-0.029
Token curious
Token ID11040
Feature activation-4.37e-03

Pos logit contributions
 soon+0.012
 then+0.010
 least+0.009
 able+0.009
 stuck+0.008
Neg logit contributions
board-0.016
inner-0.015
ag-0.014
 medal-0.014
ork-0.014
Token,
Token ID11
Feature activation-1.17e-03

Pos logit contributions
inner+0.011
board+0.011
oker+0.010
anger+0.010
ork+0.010
Neg logit contributions
 soon-0.004
 then-0.003
 able-0.002
 so-0.002
 hoping-0.002
Token but
Token ID475
Feature activation-1.15e-03

Pos logit contributions
board+0.003
inner+0.003
ag+0.003
ork+0.003
 medal+0.003
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 stuck-0.001
Token he
Token ID339
Feature activation+4.36e-03

Pos logit contributions
board+0.003
inner+0.003
 medal+0.002
ag+0.002
 seas+0.002
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 hoping-0.001
Token wasn
Token ID2492
Feature activation+4.79e-02

Pos logit contributions
 soon+0.006
 then+0.005
 least+0.005
 able+0.005
 hoping+0.004
Neg logit contributions
board-0.009
inner-0.008
ag-0.008
ork-0.008
 medal-0.008
Token't
Token ID470
Feature activation+1.18e-02

Pos logit contributions
 soon+0.071
 Then+0.064
 hoping+0.064
 then+0.063
 least+0.062
Neg logit contributions
inner-0.074
ag-0.072
ork-0.068
ves-0.065
board-0.064
Token ready
Token ID3492
Feature activation-1.57e-02

Pos logit contributions
 then+0.011
 soon+0.011
 least+0.010
 Then+0.010
 Amen+0.009
Neg logit contributions
ag-0.024
 guys-0.023
 medal-0.023
 som-0.022
board-0.022
Token to
Token ID284
Feature activation+7.81e-03

Pos logit contributions
board+0.036
inner+0.036
ag+0.034
affle+0.033
anger+0.033
Neg logit contributions
 soon-0.018
 then-0.015
 able-0.012
 least-0.012
 hoping-0.011
Token touch
Token ID3638
Feature activation+1.11e-02

Pos logit contributions
 soon+0.013
 then+0.012
 able+0.011
 least+0.011
 Then+0.011
Neg logit contributions
board-0.012
inner-0.012
ag-0.011
 medal-0.011
ade-0.011
Token it
Token ID340
Feature activation+1.81e-02

Pos logit contributions
 soon+0.017
 then+0.016
 able+0.015
 least+0.015
 Then+0.014
Neg logit contributions
board-0.018
inner-0.017
ag-0.017
 seas-0.017
 medal-0.017
Token,
Token ID11
Feature activation-6.63e-03

Pos logit contributions
inner+0.024
board+0.024
ag+0.023
 similarities+0.023
affle+0.023
Neg logit contributions
 soon-0.009
 then-0.007
 able-0.005
 now-0.005
 stuck-0.005
Token but
Token ID475
Feature activation-1.63e-02

Pos logit contributions
board+0.015
inner+0.014
 som+0.013
 medal+0.013
ag+0.013
Neg logit contributions
 soon-0.006
 then-0.005
 least-0.005
 able-0.005
 Amen-0.004
Token he
Token ID339
Feature activation+3.70e-03

Pos logit contributions
board+0.035
inner+0.035
ag+0.033
ork+0.032
ool+0.032
Neg logit contributions
 soon-0.020
 then-0.018
 able-0.015
 least-0.015
 stuck-0.014
Token felt
Token ID2936
Feature activation+3.21e-02

Pos logit contributions
 soon+0.005
 then+0.004
 able+0.004
 least+0.004
 hoping+0.004
Neg logit contributions
board-0.007
inner-0.007
ag-0.007
ork-0.006
ade-0.006
Token a
Token ID257
Feature activation+1.57e-02

Pos logit contributions
 soon+0.033
 then+0.026
 Amen+0.024
 least+0.024
 able+0.023
Neg logit contributions
ag-0.066
board-0.065
inner-0.064
 medal-0.063
 seas-0.063
Token little
Token ID1310
Feature activation+4.76e-02

Pos logit contributions
 soon+0.024
 then+0.022
 least+0.019
 able+0.019
 Then+0.018
Neg logit contributions
board-0.028
inner-0.028
ag-0.026
ork-0.025
 medal-0.025
Token scared
Token ID12008
Feature activation+1.12e-02

Pos logit contributions
 soon+0.056
 then+0.046
 able+0.041
 least+0.041
 Then+0.039
Neg logit contributions
board-0.096
 medal-0.092
ag-0.091
inner-0.090
 som-0.089
Token.
Token ID13
Feature activation-8.45e-03

Pos logit contributions
 soon+0.013
 then+0.011
 least+0.011
 able+0.011
 Then+0.010
Neg logit contributions
board-0.023
ag-0.022
inner-0.021
 guys-0.021
 medal-0.021
Token Just
Token ID2329
Feature activation-6.96e-04

Pos logit contributions
inner+0.018
ag+0.017
ool+0.017
oker+0.017
ade+0.017
Neg logit contributions
 soon-0.010
 then-0.009
 Then-0.009
 so-0.008
 over-0.007
Token then
Token ID788
Feature activation+8.97e-03

Pos logit contributions
board+0.002
inner+0.002
ag+0.002
 medal+0.002
 seas+0.002
Neg logit contributions
 soon-0.000
 able-0.000
 then-0.000
 least-0.000
 trapped-0.000
Token,
Token ID11
Feature activation-4.25e-03

Pos logit contributions
 soon+0.008
 then+0.007
 least+0.006
 able+0.006
 hoping+0.005
Neg logit contributions
board-0.021
inner-0.020
ag-0.019
 medal-0.019
ork-0.019
Token,
Token ID11
Feature activation+1.65e-02

Pos logit contributions
 soon+0.015
 then+0.012
 able+0.010
 least+0.010
 stuck+0.009
Neg logit contributions
board-0.033
inner-0.033
ag-0.031
ork-0.030
anger-0.030
Token twelve
Token ID14104
Feature activation+1.39e-02

Pos logit contributions
 soon+0.027
 able+0.024
 least+0.024
 then+0.023
 trapped+0.022
Neg logit contributions
board-0.026
inner-0.025
 seas-0.022
 medal-0.022
 som-0.022
Token,
Token ID11
Feature activation+1.26e-02

Pos logit contributions
 soon+0.014
 then+0.012
 able+0.009
 least+0.009
 stuck+0.009
Neg logit contributions
board-0.033
inner-0.033
ag-0.031
ork-0.030
anger-0.030
Token thirteen
Token ID28306
Feature activation+2.23e-02

Pos logit contributions
 soon+0.021
 able+0.019
 least+0.018
 then+0.018
 afraid+0.017
Neg logit contributions
board-0.020
inner-0.019
 som-0.017
ork-0.017
 seas-0.017
Token,
Token ID11
Feature activation+1.17e-02

Pos logit contributions
 soon+0.021
 then+0.017
 able+0.013
 least+0.013
 stuck+0.012
Neg logit contributions
board-0.054
inner-0.054
ag-0.051
ork-0.050
anger-0.050
Token fourteen
Token ID29167
Feature activation+4.73e-02

Pos logit contributions
 soon+0.017
 able+0.016
 Amen+0.016
 afraid+0.016
 trapped+0.015
Neg logit contributions
board-0.019
inner-0.019
oons-0.017
 seas-0.017
 som-0.017
Token,
Token ID11
Feature activation+2.56e-02

Pos logit contributions
 soon+0.047
 then+0.038
 able+0.033
 least+0.032
 stuck+0.029
Neg logit contributions
board-0.111
inner-0.109
ag-0.102
 medal-0.101
 seas-0.101
Token fifteen
Token ID17280
Feature activation+2.52e-02

Pos logit contributions
 soon+0.042
 able+0.037
 then+0.037
 least+0.036
 trapped+0.035
Neg logit contributions
board-0.040
inner-0.039
 som-0.035
 medal-0.035
 seas-0.034
Token,
Token ID11
Feature activation+1.77e-02

Pos logit contributions
 soon+0.027
 then+0.022
 able+0.018
 least+0.017
 stuck+0.017
Neg logit contributions
board-0.058
inner-0.058
ag-0.055
anger-0.054
ork-0.053
Token sixteen
Token ID27913
Feature activation+2.67e-02

Pos logit contributions
 soon+0.031
 then+0.028
 able+0.027
 least+0.027
 afraid+0.026
Neg logit contributions
board-0.027
inner-0.026
ork-0.023
ag-0.023
 medal-0.023
Token,
Token ID11
Feature activation+1.22e-02

Pos logit contributions
 soon+0.025
 then+0.020
 able+0.016
 least+0.016
 stuck+0.014
Neg logit contributions
board-0.065
inner-0.065
ag-0.061
ork-0.060
anger-0.060
TokenYes
Token ID5297
Feature activation+9.00e-03

Pos logit contributions
 soon+0.018
 then+0.016
 least+0.014
 able+0.014
 gently+0.014
Neg logit contributions
board-0.015
inner-0.015
ag-0.014
ork-0.013
ool-0.013
Token,
Token ID11
Feature activation+1.48e-03

Pos logit contributions
 soon+0.008
 then+0.007
 able+0.006
 least+0.006
 trapped+0.005
Neg logit contributions
board-0.022
inner-0.021
ag-0.020
ork-0.020
anger-0.020
Token but
Token ID475
Feature activation-6.50e-04

Pos logit contributions
 soon+0.002
 then+0.002
 so+0.001
 now+0.001
 able+0.001
Neg logit contributions
board-0.003
inner-0.003
ag-0.003
ork-0.003
affle-0.003
Token only
Token ID691
Feature activation+6.31e-03

Pos logit contributions
board+0.001
inner+0.001
ag+0.001
 medal+0.001
ork+0.001
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 trapped-0.001
Token a
Token ID257
Feature activation+2.16e-02

Pos logit contributions
 soon+0.008
 then+0.007
 able+0.006
 least+0.006
 trapped+0.006
Neg logit contributions
inner-0.012
board-0.012
 medal-0.011
ag-0.011
 seas-0.011
Token little
Token ID1310
Feature activation+4.72e-02

Pos logit contributions
 soon+0.040
 then+0.036
 able+0.032
 least+0.032
 stuck+0.031
Neg logit contributions
board-0.034
inner-0.033
ag-0.031
anger-0.029
ork-0.029
Token.
Token ID13
Feature activation-7.47e-03

Pos logit contributions
 soon+0.064
 then+0.057
 able+0.054
 least+0.052
 Then+0.051
Neg logit contributions
board-0.088
inner-0.084
 medal-0.079
ool-0.078
 seas-0.078
Token The
Token ID383
Feature activation+1.77e-02

Pos logit contributions
board+0.013
inner+0.013
anger+0.011
ork+0.011
ag+0.011
Neg logit contributions
 soon-0.012
 able-0.010
 then-0.010
 least-0.009
 hoping-0.009
Token oats
Token ID47009
Feature activation-2.39e-05

Pos logit contributions
 soon+0.032
 then+0.029
 hoping+0.025
 least+0.025
 now+0.025
Neg logit contributions
board-0.028
inner-0.028
ag-0.026
oker-0.026
ork-0.025
Token are
Token ID389
Feature activation+2.15e-02

Pos logit contributions
board+0.000
inner+0.000
 seas+0.000
 similarities+0.000
ag+0.000
Neg logit contributions
 soon-0.000
 then-0.000
 stuck-0.000
 now-0.000
 least-0.000
Token for
Token ID329
Feature activation+2.07e-02

Pos logit contributions
 soon+0.025
 then+0.021
 least+0.018
 able+0.018
 stuck+0.017
Neg logit contributions
board-0.046
inner-0.046
ag-0.043
ork-0.042
 medal-0.042
Token liked
Token ID8288
Feature activation+5.17e-04

Pos logit contributions
board+0.015
inner+0.015
ag+0.013
anger+0.013
ork+0.013
Neg logit contributions
 soon-0.011
 then-0.010
 able-0.009
 least-0.009
 stuck-0.008
Token to
Token ID284
Feature activation-1.63e-02

Pos logit contributions
 soon+0.000
 then+0.000
 so+0.000
 hoping+0.000
 now+0.000
Neg logit contributions
inner-0.001
board-0.001
ool-0.001
anger-0.001
oker-0.001
Token explore
Token ID7301
Feature activation+2.66e-02

Pos logit contributions
inner+0.031
board+0.030
anger+0.030
ork+0.029
oker+0.028
Neg logit contributions
 soon-0.024
 then-0.021
 gently-0.018
 now-0.017
 sometimes-0.017
Token but
Token ID475
Feature activation-4.59e-03

Pos logit contributions
 soon+0.030
 then+0.025
 able+0.021
 least+0.020
 stuck+0.019
Neg logit contributions
board-0.060
inner-0.060
ag-0.056
anger-0.055
ork-0.055
Token she
Token ID673
Feature activation+1.23e-02

Pos logit contributions
board+0.010
inner+0.010
ag+0.009
anger+0.009
ork+0.009
Neg logit contributions
 soon-0.006
 then-0.006
 stuck-0.005
 least-0.004
 now-0.004
Token wasn
Token ID2492
Feature activation+4.71e-02

Pos logit contributions
 soon+0.017
 then+0.015
 able+0.013
 least+0.013
 stuck+0.012
Neg logit contributions
board-0.024
inner-0.024
ag-0.022
ork-0.022
 medal-0.021
Token't
Token ID470
Feature activation+2.04e-02

Pos logit contributions
 soon+0.063
 then+0.057
 Then+0.054
 hoping+0.052
 least+0.051
Neg logit contributions
inner-0.087
ag-0.082
ork-0.081
board-0.080
ade-0.077
Token an
Token ID281
Feature activation+5.90e-03

Pos logit contributions
 soon+0.019
 then+0.015
 least+0.012
 able+0.012
 Then+0.011
Neg logit contributions
board-0.049
inner-0.049
ag-0.047
ork-0.045
 medal-0.045
Token adventurous
Token ID39770
Feature activation-3.75e-03

Pos logit contributions
 soon+0.010
 then+0.009
 least+0.008
 able+0.008
 stuck+0.008
Neg logit contributions
board-0.010
inner-0.009
ag-0.009
ork-0.008
anger-0.008
Token swim
Token ID9422
Feature activation+5.96e-03

Pos logit contributions
inner+0.006
ag+0.006
ves+0.006
oons+0.006
 similarities+0.006
Neg logit contributions
 soon-0.007
 then-0.006
 now-0.005
 scared-0.005
 very-0.005
Tokenmer
Token ID647
Feature activation+1.50e-02

Pos logit contributions
 soon+0.006
 then+0.004
 so+0.003
 now+0.003
 scared+0.003
Neg logit contributions
inner-0.015
board-0.014
 similarities-0.014
affle-0.014
ool-0.014
Token so
Token ID523
Feature activation-6.25e-03

Pos logit contributions
ag+0.013
inner+0.013
board+0.013
 som+0.012
 guys+0.012
Neg logit contributions
 soon-0.009
 able-0.008
 least-0.007
 then-0.007
 Amen-0.007
Token loud
Token ID7812
Feature activation-1.17e-02

Pos logit contributions
inner+0.010
board+0.010
ag+0.009
ork+0.009
ade+0.009
Neg logit contributions
 soon-0.010
 then-0.010
 able-0.009
 least-0.009
 hoping-0.009
Token,
Token ID11
Feature activation-1.42e-02

Pos logit contributions
board+0.027
inner+0.026
ag+0.025
 medal+0.024
anger+0.024
Neg logit contributions
 soon-0.012
 then-0.010
 least-0.009
 able-0.009
 stuck-0.008
Token but
Token ID475
Feature activation-4.89e-03

Pos logit contributions
board+0.030
inner+0.029
ag+0.027
ork+0.027
 medal+0.026
Neg logit contributions
 soon-0.018
 then-0.016
 able-0.014
 least-0.014
 stuck-0.013
Token he
Token ID339
Feature activation+3.00e-03

Pos logit contributions
board+0.008
inner+0.008
 medal+0.008
 seas+0.007
 trophy+0.007
Neg logit contributions
 soon-0.007
 able-0.006
 then-0.006
 least-0.006
 trapped-0.006
Token wasn
Token ID2492
Feature activation+4.71e-02

Pos logit contributions
 soon+0.004
 then+0.004
 able+0.003
 least+0.003
 stuck+0.003
Neg logit contributions
inner-0.006
board-0.006
ag-0.006
ool-0.006
anger-0.006
Token't
Token ID470
Feature activation+1.35e-02

Pos logit contributions
 soon+0.065
 then+0.059
 hoping+0.056
 least+0.055
 Then+0.055
Neg logit contributions
inner-0.083
ork-0.076
ag-0.075
board-0.075
 medal-0.073
Token scared
Token ID12008
Feature activation-1.01e-03

Pos logit contributions
 soon+0.012
 then+0.012
 Then+0.010
 least+0.010
 Amen+0.009
Neg logit contributions
ag-0.028
board-0.028
inner-0.028
 guys-0.028
 medal-0.027
Token.
Token ID13
Feature activation-8.88e-03

Pos logit contributions
board+0.002
inner+0.002
ag+0.002
ork+0.002
 guys+0.002
Neg logit contributions
 soon-0.001
 then-0.001
 least-0.001
 able-0.001
 Then-0.001
Token Instead
Token ID5455
Feature activation-9.62e-03

Pos logit contributions
inner+0.017
board+0.017
ag+0.016
ool+0.016
 medal+0.015
Neg logit contributions
 soon-0.013
 then-0.012
 Then-0.010
 able-0.010
 least-0.010
Token,
Token ID11
Feature activation+4.24e-03

Pos logit contributions
inner+0.027
board+0.026
ag+0.026
affle+0.025
 similarities+0.025
Neg logit contributions
 soon-0.007
 then-0.005
 now-0.004
 able-0.003
 suddenly-0.003
Token to
Token ID284
Feature activation+2.12e-03

Pos logit contributions
 soon+0.009
 then+0.008
 able+0.007
 least+0.007
 stuck+0.006
Neg logit contributions
board-0.019
inner-0.018
ag-0.017
 medal-0.017
ork-0.017
Token slow
Token ID3105
Feature activation+1.76e-02

Pos logit contributions
 soon+0.003
 then+0.003
 able+0.003
 least+0.003
 stuck+0.002
Neg logit contributions
board-0.004
inner-0.004
ag-0.004
anger-0.003
ork-0.003
Token down
Token ID866
Feature activation+5.72e-03

Pos logit contributions
 soon+0.017
 then+0.014
 least+0.013
 able+0.013
 trapped+0.011
Neg logit contributions
board-0.040
inner-0.039
ag-0.038
 medal-0.037
anger-0.036
Token.
Token ID13
Feature activation+9.87e-04

Pos logit contributions
 soon+0.005
 then+0.004
 able+0.003
 least+0.003
 stuck+0.003
Neg logit contributions
board-0.014
inner-0.014
ag-0.013
affle-0.013
ool-0.013
Token He
Token ID679
Feature activation+2.48e-03

Pos logit contributions
 soon+0.001
 then+0.001
 Then+0.001
+0.001
 so+0.001
Neg logit contributions
inner-0.002
board-0.002
ade-0.002
ag-0.002
ool-0.002
Token wasn
Token ID2492
Feature activation+4.70e-02

Pos logit contributions
 soon+0.003
 then+0.003
 stuck+0.002
 suddenly+0.002
 able+0.002
Neg logit contributions
inner-0.006
board-0.005
ool-0.005
ag-0.005
oker-0.005
Token't
Token ID470
Feature activation+1.19e-02

Pos logit contributions
 soon+0.063
 then+0.055
 hoping+0.054
 Then+0.054
 least+0.053
Neg logit contributions
inner-0.084
ag-0.077
board-0.076
ork-0.076
 medal-0.075
Token as
Token ID355
Feature activation-1.04e-02

Pos logit contributions
 soon+0.011
 then+0.009
 least+0.007
 Then+0.007
 able+0.007
Neg logit contributions
board-0.028
inner-0.028
ag-0.027
 medal-0.026
oker-0.026
Token fast
Token ID3049
Feature activation+1.50e-02

Pos logit contributions
board+0.023
inner+0.022
ag+0.021
ork+0.020
anger+0.020
Neg logit contributions
 soon-0.013
 then-0.011
 able-0.009
 least-0.009
 stuck-0.009
Token as
Token ID355
Feature activation-5.43e-03

Pos logit contributions
 soon+0.016
 then+0.013
 least+0.012
 able+0.011
 afraid+0.010
Neg logit contributions
board-0.033
inner-0.033
ag-0.031
 medal-0.031
anger-0.030
Token before
Token ID878
Feature activation-3.20e-03

Pos logit contributions
board+0.010
inner+0.010
ag+0.009
ork+0.009
 medal+0.009
Neg logit contributions
 soon-0.008
 then-0.007
 able-0.006
 least-0.006
 stuck-0.006
Token wanted
Token ID2227
Feature activation+1.48e-02

Pos logit contributions
 soon+0.016
 then+0.014
 able+0.012
 least+0.012
 stuck+0.011
Neg logit contributions
board-0.027
inner-0.027
ag-0.025
ork-0.024
anger-0.024
Token to
Token ID284
Feature activation-4.03e-03

Pos logit contributions
 soon+0.018
 then+0.015
 able+0.013
 least+0.013
 Then+0.012
Neg logit contributions
board-0.031
inner-0.030
ag-0.029
ork-0.028
 medal-0.028
Token resist
Token ID4180
Feature activation+1.01e-02

Pos logit contributions
board+0.007
inner+0.007
ag+0.006
ork+0.006
 medal+0.006
Neg logit contributions
 soon-0.007
 then-0.006
 able-0.005
 least-0.005
 stuck-0.005
Token.
Token ID13
Feature activation-9.68e-04

Pos logit contributions
 soon+0.011
 then+0.009
 able+0.008
 least+0.008
 stuck+0.007
Neg logit contributions
board-0.022
inner-0.022
ag-0.021
 medal-0.020
ork-0.020
Token She
Token ID1375
Feature activation+1.10e-02

Pos logit contributions
board+0.002
inner+0.002
ag+0.002
ork+0.002
anger+0.002
Neg logit contributions
 soon-0.001
 able-0.001
 then-0.001
 least-0.001
 trapped-0.001
Token wasn
Token ID2492
Feature activation+4.69e-02

Pos logit contributions
 soon+0.013
 then+0.011
 able+0.009
 least+0.009
 stuck+0.009
Neg logit contributions
inner-0.024
board-0.024
ag-0.023
oker-0.022
anger-0.022
Token't
Token ID470
Feature activation+1.33e-02

Pos logit contributions
 soon+0.065
 then+0.056
 hoping+0.054
 Then+0.053
 able+0.052
Neg logit contributions
inner-0.084
ag-0.080
board-0.080
ork-0.078
 medal-0.076
Token sure
Token ID1654
Feature activation-1.76e-03

Pos logit contributions
 soon+0.012
 then+0.011
 least+0.010
 Then+0.009
 suddenly+0.008
Neg logit contributions
ag-0.029
board-0.027
 guys-0.027
 medal-0.027
inner-0.027
Token if
Token ID611
Feature activation-6.95e-04

Pos logit contributions
board+0.004
ag+0.003
inner+0.003
ool+0.003
ves+0.003
Neg logit contributions
 soon-0.002
 Amen-0.002
 least-0.002
 then-0.002
 trapped-0.001
Token she
Token ID673
Feature activation-6.26e-03

Pos logit contributions
board+0.001
inner+0.001
ag+0.001
 medal+0.001
ork+0.001
Neg logit contributions
 soon-0.001
 then-0.001
 least-0.001
 able-0.001
 Then-0.001
Token was
Token ID373
Feature activation+1.43e-02

Pos logit contributions
board+0.013
inner+0.013
ag+0.012
ork+0.011
anger+0.011
Neg logit contributions
 soon-0.008
 then-0.007
 able-0.006
 least-0.006
 stuck-0.006
Token hugs
Token ID40657
Feature activation-8.92e-03

Pos logit contributions
board+0.002
inner+0.002
ag+0.002
 medal+0.002
ork+0.002
Neg logit contributions
 soon-0.002
 then-0.002
 able-0.002
 least-0.002
 hoping-0.002
Token Ben
Token ID3932
Feature activation-6.21e-03

Pos logit contributions
board+0.017
inner+0.017
ag+0.015
ork+0.015
 medal+0.015
Neg logit contributions
 soon-0.013
 then-0.012
 able-0.010
 least-0.010
 stuck-0.010
Token.
Token ID13
Feature activation+2.39e-03

Pos logit contributions
board+0.013
ag+0.013
inner+0.012
 medal+0.012
ade+0.012
Neg logit contributions
 soon-0.007
 then-0.006
 least-0.006
 able-0.005
 suddenly-0.005
Token She
Token ID1375
Feature activation+3.58e-03

Pos logit contributions
 soon+0.004
 least+0.003
 then+0.003
 able+0.003
 trapped+0.003
Neg logit contributions
board-0.005
inner-0.004
anger-0.004
ag-0.004
oker-0.004
Token shares
Token ID7303
Feature activation-1.39e-02

Pos logit contributions
 soon+0.007
 able+0.006
 then+0.005
 least+0.005
 suddenly+0.005
Neg logit contributions
board-0.005
inner-0.005
 medal-0.005
affle-0.005
ag-0.005
Token the
Token ID262
Feature activation-5.28e-02

Pos logit contributions
board+0.026
inner+0.025
ag+0.024
 medal+0.023
ork+0.023
Neg logit contributions
 soon-0.019
 then-0.017
 able-0.016
 least-0.016
 trapped-0.015
Token blanket
Token ID18447
Feature activation-9.76e-03

Pos logit contributions
board+0.076
inner+0.074
ag+0.074
aff+0.073
affle+0.071
Neg logit contributions
 soon-0.095
 then-0.092
 stuck-0.082
 gently-0.081
 scared-0.080
Token.
Token ID13
Feature activation-5.39e-03

Pos logit contributions
board+0.022
inner+0.021
ag+0.021
ork+0.020
 medal+0.020
Neg logit contributions
 soon-0.010
 then-0.009
 least-0.008
 able-0.008
 hoping-0.007
Token She
Token ID1375
Feature activation-3.01e-03

Pos logit contributions
board+0.010
inner+0.010
ag+0.009
anger+0.009
ork+0.009
Neg logit contributions
 soon-0.008
 least-0.007
 able-0.007
 then-0.007
 trapped-0.006
Token forg
Token ID5784
Feature activation-1.94e-02

Pos logit contributions
board+0.005
inner+0.005
 medal+0.004
ade+0.004
affle+0.004
Neg logit contributions
 soon-0.005
 able-0.004
 then-0.004
 afraid-0.004
 least-0.004
Tokenives
Token ID1083
Feature activation-2.46e-02

Pos logit contributions
inner+0.023
board+0.022
 seas+0.021
affle+0.020
ool+0.020
Neg logit contributions
 soon-0.042
 then-0.039
 able-0.036
 gently-0.035
 Then-0.035
Token "
Token ID366
Feature activation+6.52e-03

Pos logit contributions
 least+0.000
 soon+0.000
 able+0.000
 trapped+0.000
 stuck+0.000
Neg logit contributions
board-0.000
anger-0.000
inner-0.000
ork-0.000
 medal-0.000
TokenMaybe
Token ID13300
Feature activation-6.53e-03

Pos logit contributions
 soon+0.008
 then+0.006
+0.005
 least+0.005
 suddenly+0.005
Neg logit contributions
inner-0.015
board-0.015
ag-0.014
ool-0.014
affle-0.014
Token we
Token ID356
Feature activation+2.92e-03

Pos logit contributions
board+0.012
inner+0.012
 seas+0.011
 medal+0.011
ag+0.011
Neg logit contributions
 soon-0.009
 able-0.008
 then-0.007
 least-0.007
 trapped-0.007
Token can
Token ID460
Feature activation+5.91e-03

Pos logit contributions
 soon+0.004
 then+0.004
 able+0.003
 least+0.003
 stuck+0.003
Neg logit contributions
board-0.006
inner-0.005
ag-0.005
ork-0.005
 medal-0.005
Token share
Token ID2648
Feature activation-1.52e-03

Pos logit contributions
 soon+0.010
 then+0.009
 able+0.009
 afraid+0.009
 trapped+0.009
Neg logit contributions
board-0.009
inner-0.008
ag-0.008
 trophy-0.008
ade-0.008
Token the
Token ID262
Feature activation-5.25e-02

Pos logit contributions
board+0.003
inner+0.003
ag+0.003
 medal+0.003
ork+0.003
Neg logit contributions
 soon-0.002
 then-0.002
 able-0.001
 least-0.001
 stuck-0.001
Token ball
Token ID2613
Feature activation+6.32e-03

Pos logit contributions
inner+0.070
board+0.069
ag+0.063
anger+0.061
oker+0.061
Neg logit contributions
 soon-0.104
 then-0.099
 stuck-0.089
 scared-0.087
 gently-0.087
Token with
Token ID351
Feature activation-1.55e-02

Pos logit contributions
 soon+0.006
 then+0.005
 able+0.004
 least+0.004
 hoping+0.004
Neg logit contributions
board-0.015
inner-0.015
ag-0.014
ork-0.014
 medal-0.014
Token him
Token ID683
Feature activation-1.37e-02

Pos logit contributions
board+0.035
inner+0.034
anger+0.032
ool+0.032
ork+0.031
Neg logit contributions
 soon-0.020
 then-0.018
 least-0.015
 now-0.014
 gently-0.014
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
board+0.033
inner+0.033
ag+0.031
ork+0.030
 medal+0.030
Neg logit contributions
 soon-0.013
 then-0.011
 able-0.009
 least-0.008
 stuck-0.008
Token Maybe
Token ID6674
Feature activation+4.26e-03

Pos logit contributions
board+0.026
inner+0.024
 guys+0.022
 similarities+0.022
anger+0.022
Neg logit contributions
 soon-0.022
 able-0.020
 then-0.017
 least-0.017
 trapped-0.017
Token to
Token ID284
Feature activation-2.07e-02

Pos logit contributions
board+0.031
inner+0.029
ork+0.029
ag+0.029
anger+0.027
Neg logit contributions
 soon-0.024
 then-0.023
 able-0.021
 least-0.020
 Amen-0.020
Token their
Token ID511
Feature activation-1.49e-02

Pos logit contributions
board+0.038
inner+0.037
ag+0.036
 medal+0.035
ork+0.034
Neg logit contributions
 soon-0.029
 then-0.025
 able-0.024
 stuck-0.022
 afraid-0.022
Token house
Token ID2156
Feature activation-2.05e-02

Pos logit contributions
board+0.024
inner+0.024
ag+0.022
ool+0.021
anger+0.021
Neg logit contributions
 soon-0.026
 then-0.024
 least-0.021
 able-0.021
 stuck-0.020
Token and
Token ID290
Feature activation-1.13e-02

Pos logit contributions
board+0.044
ork+0.039
inner+0.039
ag+0.039
 medal+0.037
Neg logit contributions
 soon-0.022
 least-0.019
 then-0.019
-0.018
 able-0.018
Token shared
Token ID4888
Feature activation-8.64e-03

Pos logit contributions
board+0.023
inner+0.022
ag+0.021
 medal+0.020
ork+0.020
Neg logit contributions
 soon-0.015
 then-0.013
 able-0.012
 least-0.011
 afraid-0.011
Token the
Token ID262
Feature activation-5.13e-02

Pos logit contributions
board+0.019
inner+0.019
ag+0.017
anger+0.017
ool+0.017
Neg logit contributions
 soon-0.011
 then-0.009
 able-0.007
 least-0.007
 gently-0.007
Token candy
Token ID18550
Feature activation-1.97e-02

Pos logit contributions
inner+0.065
board+0.064
anger+0.061
ag+0.061
ool+0.059
Neg logit contributions
 soon-0.101
 then-0.095
 least-0.085
 gently-0.085
 hoping-0.085
Token.
Token ID13
Feature activation-8.47e-03

Pos logit contributions
inner+0.049
board+0.049
ag+0.047
anger+0.046
affle+0.046
Neg logit contributions
 soon-0.017
 then-0.014
 able-0.010
 gently-0.010
 least-0.009
Token They
Token ID1119
Feature activation-7.23e-04

Pos logit contributions
board+0.012
inner+0.012
ork+0.012
 similarities+0.011
anger+0.010
Neg logit contributions
 soon-0.015
 able-0.014
 least-0.013
 then-0.013
 stuck-0.013
Token also
Token ID635
Feature activation-7.25e-03

Pos logit contributions
board+0.001
inner+0.001
ag+0.001
 medal+0.001
ade+0.001
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 afraid-0.001
Token shared
Token ID4888
Feature activation-1.09e-02

Pos logit contributions
board+0.014
inner+0.013
ag+0.012
 medal+0.012
 seas+0.012
Neg logit contributions
 soon-0.010
 then-0.009
 able-0.008
 Then-0.008
 least-0.008
TokenMom
Token ID29252
Feature activation+2.67e-03

Pos logit contributions
 soon+0.009
 then+0.008
 least+0.007
 able+0.007
 stuck+0.007
Neg logit contributions
board-0.011
inner-0.011
ag-0.010
ork-0.010
anger-0.010
Tokenmy
Token ID1820
Feature activation+1.13e-02

Pos logit contributions
 soon+0.003
 then+0.002
 trapped+0.002
 able+0.002
 least+0.002
Neg logit contributions
board-0.006
inner-0.006
ag-0.005
oom-0.005
ade-0.005
Token,
Token ID11
Feature activation-1.28e-02

Pos logit contributions
 soon+0.010
 then+0.008
 able+0.007
 least+0.007
 trapped+0.006
Neg logit contributions
board-0.028
inner-0.027
ag-0.026
 medal-0.025
ork-0.025
Token what
Token ID644
Feature activation-2.53e-02

Pos logit contributions
board+0.027
ag+0.026
inner+0.026
ool+0.025
ves+0.025
Neg logit contributions
 soon-0.017
 then-0.015
 least-0.013
 now-0.013
 able-0.012
Token is
Token ID318
Feature activation-1.97e-02

Pos logit contributions
anger+0.053
inner+0.052
board+0.052
aff+0.051
ork+0.050
Neg logit contributions
 soon-0.030
-0.029
 then-0.025
 scared-0.025
 a-0.025
Token that
Token ID326
Feature activation-5.12e-02

Pos logit contributions
board+0.045
inner+0.044
ag+0.041
ork+0.041
 pleasures+0.041
Neg logit contributions
 soon-0.022
 then-0.020
 able-0.016
 stuck-0.016
 least-0.016
Token?"
Token ID1701
Feature activation-2.53e-03

Pos logit contributions
board+0.105
inner+0.105
ag+0.099
ork+0.099
 similarities+0.098
Neg logit contributions
 soon-0.066
 then-0.059
 able-0.049
 now-0.048
 least-0.048
Token His
Token ID2399
Feature activation+9.01e-03

Pos logit contributions
inner+0.005
board+0.005
ag+0.005
ool+0.005
 medal+0.005
Neg logit contributions
 soon-0.003
 then-0.003
 Then-0.002
 able-0.002
 least-0.002
Token mom
Token ID1995
Feature activation-5.89e-03

Pos logit contributions
 soon+0.014
 then+0.012
 able+0.012
 least+0.011
 hoping+0.011
Neg logit contributions
board-0.016
inner-0.016
 medal-0.015
 trophy-0.015
ag-0.015
Token replied
Token ID8712
Feature activation-1.52e-02

Pos logit contributions
board+0.012
inner+0.011
ag+0.010
affle+0.010
ool+0.010
Neg logit contributions
 soon-0.008
 able-0.007
 then-0.007
 least-0.006
 trapped-0.006
Token,
Token ID11
Feature activation-7.18e-03

Pos logit contributions
board+0.037
inner+0.036
ag+0.034
ork+0.034
anger+0.033
Neg logit contributions
 soon-0.014
 then-0.011
 least-0.010
 able-0.009
 trapped-0.009
Token asked
Token ID1965
Feature activation-4.58e-03

Pos logit contributions
 soon+0.017
 least+0.015
 then+0.014
 able+0.013
 trapped+0.013
Neg logit contributions
board-0.025
inner-0.023
 medal-0.022
 som-0.022
 seas-0.022
Token,
Token ID11
Feature activation-1.32e-04

Pos logit contributions
board+0.011
inner+0.011
ag+0.010
ork+0.010
anger+0.010
Neg logit contributions
 soon-0.004
 then-0.004
 able-0.003
 least-0.003
 stuck-0.003
Token "
Token ID366
Feature activation+1.32e-02

Pos logit contributions
board+0.000
inner+0.000
 medal+0.000
 som+0.000
books+0.000
Neg logit contributions
 soon-0.000
 least-0.000
 able-0.000
 trapped-0.000
 then-0.000
TokenWhat
Token ID2061
Feature activation-5.63e-03

Pos logit contributions
 soon+0.016
 able+0.014
 then+0.014
 trapped+0.013
 afraid+0.013
Neg logit contributions
board-0.028
inner-0.027
ork-0.024
 seas-0.024
 medal-0.024
Token is
Token ID318
Feature activation-2.25e-02

Pos logit contributions
inner+0.011
board+0.011
ag+0.011
anger+0.011
affle+0.010
Neg logit contributions
 soon-0.008
 then-0.007
-0.006
 now-0.006
 so-0.006
Token that
Token ID326
Feature activation-5.12e-02

Pos logit contributions
inner+0.045
board+0.045
 medal+0.042
ag+0.041
 seas+0.041
Neg logit contributions
 soon-0.030
 then-0.025
 able-0.022
 least-0.022
 Then-0.020
Token well
Token ID880
Feature activation-2.55e-02

Pos logit contributions
inner+0.111
board+0.110
 similarities+0.108
ag+0.107
ests+0.106
Neg logit contributions
 soon-0.061
 then-0.055
 so-0.049
 now-0.049
 scared-0.048
Token mom
Token ID1995
Feature activation-2.25e-02

Pos logit contributions
board+0.060
inner+0.059
ag+0.056
 similarities+0.055
ork+0.055
Neg logit contributions
 soon-0.027
 then-0.022
 able-0.017
 least-0.017
 now-0.016
Tokenmy
Token ID1820
Feature activation-2.25e-02

Pos logit contributions
board+0.047
inner+0.045
ag+0.042
 medal+0.041
ade+0.041
Neg logit contributions
 soon-0.027
 then-0.023
 able-0.021
 Then-0.020
 least-0.020
Token?".
Token ID43634
Feature activation+1.15e-02

Pos logit contributions
board+0.047
inner+0.046
ag+0.043
 medal+0.042
 guys+0.041
Neg logit contributions
 soon-0.027
 then-0.023
 able-0.021
 least-0.020
 Then-0.020
Token His
Token ID2399
Feature activation+1.80e-02

Pos logit contributions
 soon+0.017
 then+0.015
 able+0.013
 least+0.013
 Then+0.013
Neg logit contributions
inner-0.022
board-0.022
ag-0.020
ade-0.020
oker-0.020
Token owl
Token ID39610
Feature activation-1.44e-02

Pos logit contributions
inner+0.015
board+0.015
affle+0.013
ade+0.013
oker+0.013
Neg logit contributions
 soon-0.008
 then-0.007
 stuck-0.006
 able-0.006
 afraid-0.006
Token and
Token ID290
Feature activation+5.00e-04

Pos logit contributions
board+0.034
inner+0.032
ag+0.032
 medal+0.031
ork+0.031
Neg logit contributions
 soon-0.014
 then-0.011
 least-0.010
 able-0.009
 stuck-0.008
Token decided
Token ID3066
Feature activation-1.23e-02

Pos logit contributions
 soon+0.001
 then+0.001
 able+0.001
 least+0.001
 stuck+0.000
Neg logit contributions
board-0.001
inner-0.001
ag-0.001
 medal-0.001
ork-0.001
Token to
Token ID284
Feature activation-1.04e-02

Pos logit contributions
board+0.026
inner+0.025
ag+0.024
 medal+0.024
ork+0.024
Neg logit contributions
 soon-0.014
 then-0.012
 least-0.011
 able-0.011
 trapped-0.010
Token share
Token ID2648
Feature activation-1.16e-03

Pos logit contributions
board+0.018
inner+0.017
ag+0.016
anger+0.015
ork+0.015
Neg logit contributions
 soon-0.018
 then-0.016
 able-0.014
 least-0.014
 stuck-0.014
Token the
Token ID262
Feature activation-5.11e-02

Pos logit contributions
board+0.003
inner+0.003
anger+0.002
ag+0.002
ool+0.002
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 stuck-0.001
Token metal
Token ID6147
Feature activation-2.41e-02

Pos logit contributions
board+0.086
inner+0.086
ag+0.079
oker+0.078
anger+0.076
Neg logit contributions
 soon-0.082
 then-0.077
 stuck-0.071
 now-0.068
 gently-0.068
Token toy
Token ID13373
Feature activation+7.85e-03

Pos logit contributions
inner+0.045
board+0.043
ag+0.041
oker+0.040
affle+0.040
Neg logit contributions
 soon-0.037
 then-0.033
 gently-0.029
 stuck-0.028
 now-0.028
Token.
Token ID13
Feature activation-2.22e-03

Pos logit contributions
 soon+0.008
 then+0.006
 able+0.005
 least+0.005
 afraid+0.005
Neg logit contributions
board-0.018
inner-0.018
ag-0.017
 medal-0.017
ork-0.017
Token They
Token ID1119
Feature activation-2.98e-04

Pos logit contributions
board+0.005
inner+0.005
ag+0.004
ork+0.004
anger+0.004
Neg logit contributions
 soon-0.003
 then-0.002
 able-0.002
 least-0.002
 stuck-0.002
Token played
Token ID2826
Feature activation-1.21e-02

Pos logit contributions
inner+0.001
board+0.001
ag+0.001
anger+0.001
ork+0.001
Neg logit contributions
 soon-0.000
 then-0.000
 able-0.000
 least-0.000
 stuck-0.000
Token the
Token ID262
Feature activation-4.28e-02

Pos logit contributions
board+0.037
inner+0.037
ag+0.034
anger+0.034
ool+0.034
Neg logit contributions
 soon-0.017
 then-0.014
 able-0.011
 least-0.011
 stuck-0.010
Token mess
Token ID2085
Feature activation-1.34e-02

Pos logit contributions
board+0.067
inner+0.066
affe+0.064
 seas+0.064
ool+0.063
Neg logit contributions
 soon-0.071
 then-0.065
 over-0.060
 now-0.058
 scared-0.057
Token and
Token ID290
Feature activation-1.52e-02

Pos logit contributions
board+0.031
inner+0.031
ag+0.029
ork+0.028
ade+0.028
Neg logit contributions
 soon-0.013
 then-0.011
 least-0.010
 able-0.010
 stuck-0.009
Token to
Token ID284
Feature activation-1.20e-02

Pos logit contributions
board+0.031
inner+0.031
ag+0.029
anger+0.029
ool+0.028
Neg logit contributions
 soon-0.021
 then-0.019
 least-0.016
 able-0.016
 stuck-0.015
Token share
Token ID2648
Feature activation-4.69e-03

Pos logit contributions
board+0.022
inner+0.022
ag+0.020
ork+0.019
 medal+0.019
Neg logit contributions
 soon-0.018
 then-0.016
 able-0.014
 least-0.014
 stuck-0.014
Token the
Token ID262
Feature activation-5.08e-02

Pos logit contributions
board+0.011
inner+0.011
ag+0.010
anger+0.010
ool+0.010
Neg logit contributions
 soon-0.005
 then-0.004
 able-0.003
 least-0.003
 stuck-0.003
Token music
Token ID2647
Feature activation+8.40e-03

Pos logit contributions
board+0.075
inner+0.071
ool+0.070
aff+0.070
affe+0.069
Neg logit contributions
 soon-0.091
 then-0.088
 scared-0.078
 very-0.077
 now-0.077
Token.
Token ID13
Feature activation+1.41e-04

Pos logit contributions
 soon+0.008
 then+0.006
 able+0.005
 least+0.005
 stuck+0.004
Neg logit contributions
board-0.021
inner-0.020
ag-0.019
ork-0.019
anger-0.019
Token She
Token ID1375
Feature activation+3.95e-03

Pos logit contributions
 soon+0.000
 able+0.000
 least+0.000
 then+0.000
 trapped+0.000
Neg logit contributions
board-0.000
inner-0.000
ork-0.000
 People-0.000
 similarities-0.000
Token says
Token ID1139
Feature activation+9.39e-03

Pos logit contributions
 soon+0.006
 able+0.005
 then+0.005
 suddenly+0.005
 afraid+0.005
Neg logit contributions
board-0.007
inner-0.007
 medal-0.006
ade-0.006
ag-0.006
Token they
Token ID484
Feature activation+2.76e-03

Pos logit contributions
 soon+0.011
 then+0.010
 able+0.008
 least+0.008
 stuck+0.008
Neg logit contributions
board-0.020
inner-0.020
ag-0.019
ork-0.018
 medal-0.018
Token outside
Token ID2354
Feature activation-2.00e-02

Pos logit contributions
board+0.022
inner+0.022
ag+0.021
ork+0.020
ool+0.020
Neg logit contributions
 soon-0.011
 then-0.009
 able-0.008
 least-0.008
 stuck-0.007
Token to
Token ID284
Feature activation-2.30e-02

Pos logit contributions
board+0.039
inner+0.037
ag+0.036
oker+0.034
ork+0.034
Neg logit contributions
 soon-0.025
 then-0.023
 able-0.021
 least-0.021
 trapped-0.019
Token play
Token ID711
Feature activation+5.14e-04

Pos logit contributions
board+0.038
inner+0.037
ag+0.034
ork+0.033
anger+0.033
Neg logit contributions
 soon-0.040
 then-0.036
 least-0.032
 able-0.032
 stuck-0.031
Token with
Token ID351
Feature activation-8.09e-03

Pos logit contributions
 soon+0.000
 then+0.000
 able+0.000
 stuck+0.000
 gently+0.000
Neg logit contributions
inner-0.001
board-0.001
ool-0.001
anger-0.001
oker-0.001
Token her
Token ID607
Feature activation-2.59e-02

Pos logit contributions
board+0.017
inner+0.017
ag+0.016
ade+0.016
ork+0.016
Neg logit contributions
 soon-0.010
 then-0.008
 able-0.007
 least-0.007
 stuck-0.007
Token yellow
Token ID7872
Feature activation-5.07e-02

Pos logit contributions
board+0.027
inner+0.025
 medal+0.023
ag+0.023
 trophy+0.022
Neg logit contributions
 soon-0.059
 then-0.054
 able-0.053
 least-0.052
 hoping-0.050
Token ball
Token ID2613
Feature activation+2.67e-03

Pos logit contributions
inner+0.074
board+0.067
oker+0.066
ag+0.064
 similarities+0.062
Neg logit contributions
 soon-0.096
 then-0.092
 stuck-0.082
 gently-0.081
 Then-0.079
Token.
Token ID13
Feature activation-2.45e-02

Pos logit contributions
 soon+0.003
 then+0.002
 least+0.002
 able+0.002
 Then+0.002
Neg logit contributions
board-0.006
inner-0.006
ag-0.006
ade-0.006
ool-0.005
Token She
Token ID1375
Feature activation-1.14e-02

Pos logit contributions
inner+0.058
board+0.056
ag+0.055
 medal+0.055
 seas+0.054
Neg logit contributions
 soon-0.024
 then-0.021
 Then-0.017
 able-0.015
 gently-0.015
Token threw
Token ID9617
Feature activation-1.44e-02

Pos logit contributions
board+0.023
inner+0.023
ag+0.021
anger+0.021
ork+0.021
Neg logit contributions
 soon-0.016
 then-0.014
 able-0.012
 least-0.012
 stuck-0.012
Token it
Token ID340
Feature activation-1.68e-02

Pos logit contributions
board+0.031
inner+0.030
ag+0.028
 medal+0.028
ork+0.028
Neg logit contributions
 soon-0.018
 then-0.015
 able-0.013
 least-0.013
 stuck-0.012
Tokenmy
Token ID1820
Feature activation-1.29e-02

Pos logit contributions
board+0.023
inner+0.021
ag+0.020
ade+0.019
 guys+0.019
Neg logit contributions
 soon-0.013
 trapped-0.012
 terrified-0.012
 able-0.011
 stuck-0.011
Token,
Token ID11
Feature activation-5.69e-03

Pos logit contributions
board+0.029
inner+0.028
ag+0.026
 medal+0.026
 trophy+0.026
Neg logit contributions
 soon-0.014
 then-0.011
 able-0.010
 trapped-0.010
 least-0.010
Token "
Token ID366
Feature activation+8.77e-03

Pos logit contributions
board+0.009
inner+0.008
ag+0.007
 trophy+0.007
 medal+0.007
Neg logit contributions
 soon-0.010
 then-0.008
 least-0.008
 trapped-0.008
 able-0.008
TokenWhat
Token ID2061
Feature activation-8.21e-03

Pos logit contributions
 soon+0.012
 able+0.011
 afraid+0.010
 trapped+0.010
 then+0.010
Neg logit contributions
board-0.017
inner-0.017
 seas-0.015
 guys-0.015
 medal-0.014
Token is
Token ID318
Feature activation-2.57e-02

Pos logit contributions
board+0.016
inner+0.016
ag+0.015
affle+0.015
anger+0.015
Neg logit contributions
 soon-0.012
 then-0.010
-0.009
 able-0.009
 stuck-0.009
Token that
Token ID326
Feature activation-5.06e-02

Pos logit contributions
board+0.055
inner+0.054
ag+0.051
 medal+0.050
ork+0.049
Neg logit contributions
 soon-0.032
 then-0.027
 able-0.023
 least-0.023
 stuck-0.021
Token?"
Token ID1701
Feature activation+9.21e-03

Pos logit contributions
inner+0.098
board+0.097
ag+0.091
ork+0.089
 similarities+0.089
Neg logit contributions
 soon-0.072
 then-0.064
 least-0.054
 able-0.054
 now-0.053
Token
Token ID220
Feature activation+6.42e-03

Pos logit contributions
 soon+0.015
 able+0.013
 least+0.013
 trapped+0.013
 then+0.012
Neg logit contributions
board-0.016
inner-0.014
ag-0.014
ork-0.013
anger-0.013
Token
Token ID198
Feature activation-2.87e-03

Pos logit contributions
 soon+0.007
 then+0.006
 least+0.006
 able+0.005
 trapped+0.005
Neg logit contributions
inner-0.014
board-0.014
ork-0.014
ag-0.013
anger-0.013
Token
Token ID198
Feature activation-6.71e-03

Pos logit contributions
board+0.006
inner+0.006
ork+0.006
ag+0.006
ves+0.006
Neg logit contributions
 soon-0.003
 least-0.003
 able-0.002
 then-0.002
 trapped-0.002
Token"
Token ID1
Feature activation-7.54e-03

Pos logit contributions
board+0.014
inner+0.014
ag+0.013
ork+0.013
anger+0.013
Neg logit contributions
 soon-0.008
 then-0.007
 able-0.006
 least-0.006
 stuck-0.006
Token Sam
Token ID3409
Feature activation-1.26e-02

Pos logit contributions
board+0.019
inner+0.018
ag+0.018
 medal+0.017
 trophy+0.017
Neg logit contributions
 soon-0.014
 then-0.012
 least-0.012
 able-0.011
 trapped-0.010
Token,
Token ID11
Feature activation-1.28e-02

Pos logit contributions
board+0.027
ag+0.027
inner+0.026
 medal+0.025
 guys+0.025
Neg logit contributions
 soon-0.013
 then-0.011
 able-0.010
 least-0.010
 Then-0.010
Token "
Token ID366
Feature activation+2.87e-03

Pos logit contributions
board+0.021
inner+0.019
ag+0.018
 guys+0.018
 medal+0.018
Neg logit contributions
 soon-0.019
 least-0.016
 then-0.016
 able-0.015
 Amen-0.014
TokenWhat
Token ID2061
Feature activation-6.81e-03

Pos logit contributions
 soon+0.004
 able+0.004
 then+0.003
 trapped+0.003
 afraid+0.003
Neg logit contributions
board-0.006
inner-0.005
 seas-0.005
ade-0.005
 guys-0.005
Token is
Token ID318
Feature activation-2.13e-02

Pos logit contributions
inner+0.013
board+0.013
anger+0.012
ag+0.012
affle+0.012
Neg logit contributions
 soon-0.010
 then-0.008
-0.007
 able-0.007
 stuck-0.007
Token that
Token ID326
Feature activation-5.04e-02

Pos logit contributions
board+0.042
inner+0.042
 medal+0.039
ag+0.039
 seas+0.039
Neg logit contributions
 soon-0.029
 then-0.024
 able-0.022
 least-0.022
 afraid-0.020
Token orange
Token ID10912
Feature activation-3.56e-02

Pos logit contributions
board+0.104
inner+0.103
ag+0.097
ork+0.094
anger+0.093
Neg logit contributions
 soon-0.065
 then-0.057
 able-0.048
 least-0.048
 stuck-0.047
Token ball
Token ID2613
Feature activation-1.13e-02

Pos logit contributions
inner+0.085
 similarities+0.085
ag+0.084
board+0.082
kids+0.081
Neg logit contributions
 soon-0.033
 then-0.030
 now-0.024
 stuck-0.022
 so-0.022
Token in
Token ID287
Feature activation-4.92e-03

Pos logit contributions
inner+0.026
board+0.026
ag+0.025
 similarities+0.024
ork+0.024
Neg logit contributions
 soon-0.012
 then-0.010
 able-0.008
 least-0.008
 stuck-0.008
Token the
Token ID262
Feature activation-1.28e-02

Pos logit contributions
board+0.010
inner+0.010
ag+0.010
 medal+0.010
ork+0.009
Neg logit contributions
 soon-0.006
 then-0.005
 able-0.005
 least-0.004
 hoping-0.004
Token sky
Token ID6766
Feature activation-1.17e-02

Pos logit contributions
inner+0.020
board+0.019
ag+0.018
oker+0.017
affle+0.017
Neg logit contributions
 soon-0.023
 then-0.021
 least-0.019
 stuck-0.019
 able-0.019
Token friend
Token ID1545
Feature activation-1.17e-02

Pos logit contributions
 soon+0.014
 then+0.013
 least+0.012
 able+0.012
 hoping+0.012
Neg logit contributions
board-0.011
inner-0.010
 medal-0.010
ag-0.010
ade-0.010
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
board+0.027
inner+0.026
ag+0.025
 guys+0.024
ade+0.024
Neg logit contributions
 soon-0.012
 then-0.010
 least-0.009
 able-0.008
 trapped-0.008
Token "
Token ID366
Feature activation+3.52e-03

Pos logit contributions
board+0.019
ag+0.019
inner+0.018
 medal+0.018
 trophy+0.017
Neg logit contributions
 soon-0.016
 then-0.014
 least-0.013
 able-0.012
 trapped-0.012
TokenWhat
Token ID2061
Feature activation-7.16e-03

Pos logit contributions
 soon+0.004
 then+0.004
 able+0.004
 afraid+0.003
 trapped+0.003
Neg logit contributions
board-0.007
inner-0.007
 seas-0.007
 medal-0.006
ag-0.006
Token's
Token ID338
Feature activation-2.65e-02

Pos logit contributions
board+0.014
inner+0.014
ag+0.013
anger+0.012
affle+0.012
Neg logit contributions
 soon-0.011
 then-0.009
 able-0.008
-0.008
 stuck-0.008
Token that
Token ID326
Feature activation-5.03e-02

Pos logit contributions
board+0.061
inner+0.060
ag+0.056
ork+0.055
anger+0.054
Neg logit contributions
 soon-0.029
 then-0.025
 able-0.021
 least-0.020
 stuck-0.020
Token big
Token ID1263
Feature activation-2.95e-02

Pos logit contributions
board+0.096
inner+0.096
ag+0.090
ork+0.087
 similarities+0.087
Neg logit contributions
 soon-0.071
 then-0.064
 least-0.055
 able-0.054
 stuck-0.054
Token ball
Token ID2613
Feature activation-1.44e-02

Pos logit contributions
inner+0.051
board+0.049
ag+0.049
 similarities+0.047
affle+0.046
Neg logit contributions
 soon-0.048
 then-0.045
 stuck-0.040
 now-0.039
 least-0.038
Token up
Token ID510
Feature activation-1.28e-02

Pos logit contributions
inner+0.033
board+0.033
ag+0.031
ork+0.031
 similarities+0.031
Neg logit contributions
 soon-0.015
 then-0.013
 able-0.010
 least-0.010
 stuck-0.010
Token there
Token ID612
Feature activation-8.33e-03

Pos logit contributions
inner+0.024
board+0.024
ag+0.023
 medal+0.023
 seas+0.023
Neg logit contributions
 soon-0.015
 then-0.014
 Then-0.013
 able-0.013
 least-0.013
Token?"
Token ID1701
Feature activation-3.41e-03

Pos logit contributions
board+0.017
inner+0.017
ag+0.016
 medal+0.016
ade+0.015
Neg logit contributions
 soon-0.010
 then-0.009
 able-0.008
 least-0.007
 Then-0.007
Token to
Token ID284
Feature activation-2.00e-02

Pos logit contributions
inner+0.021
board+0.021
ool+0.019
ork+0.019
ag+0.019
Neg logit contributions
 soon-0.009
 then-0.007
 able-0.006
 least-0.006
 stuck-0.006
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
board+0.046
inner+0.046
ag+0.043
ool+0.042
anger+0.042
Neg logit contributions
 soon-0.022
 then-0.019
 least-0.016
 able-0.015
 stuck-0.015
Token park
Token ID3952
Feature activation-1.14e-02

Pos logit contributions
inner+0.022
board+0.022
ag+0.021
ool+0.020
 pleasures+0.020
Neg logit contributions
 soon-0.025
 then-0.023
 least-0.021
 now-0.020
 stuck-0.020
Token with
Token ID351
Feature activation-5.67e-03

Pos logit contributions
board+0.026
inner+0.025
ag+0.024
ork+0.023
 medal+0.023
Neg logit contributions
 soon-0.012
 then-0.010
 able-0.009
 least-0.009
 stuck-0.008
Token his
Token ID465
Feature activation-1.46e-02

Pos logit contributions
board+0.008
inner+0.007
ag+0.007
 seas+0.007
ade+0.007
Neg logit contributions
 soon-0.011
 then-0.010
 least-0.009
 able-0.009
 trapped-0.009
Token red
Token ID2266
Feature activation-4.99e-02

Pos logit contributions
board+0.022
inner+0.021
ag+0.019
 medal+0.019
 trophy+0.019
Neg logit contributions
 soon-0.027
 then-0.024
 able-0.023
 least-0.023
 hoping-0.021
Token ball
Token ID2613
Feature activation+3.59e-03

Pos logit contributions
inner+0.075
oker+0.069
board+0.067
ag+0.065
affle+0.064
Neg logit contributions
 soon-0.091
 then-0.088
 stuck-0.078
 gently-0.077
 over-0.076
Token.
Token ID13
Feature activation-2.67e-02

Pos logit contributions
 soon+0.004
 then+0.003
 able+0.002
 least+0.002
 afraid+0.002
Neg logit contributions
board-0.009
inner-0.008
ag-0.008
 medal-0.008
anger-0.008
Token At
Token ID1629
Feature activation-5.17e-03

Pos logit contributions
inner+0.056
 seas+0.053
ag+0.053
oker+0.053
 medal+0.053
Neg logit contributions
 soon-0.032
 then-0.030
 Then-0.029
 so-0.025
 over-0.024
Token the
Token ID262
Feature activation-1.63e-02

Pos logit contributions
inner+0.009
board+0.009
 medal+0.008
 seas+0.008
 trophy+0.008
Neg logit contributions
 soon-0.007
 able-0.007
 afraid-0.006
-0.006
 Then-0.006
Token park
Token ID3952
Feature activation-1.53e-02

Pos logit contributions
inner+0.026
board+0.025
ool+0.023
anger+0.023
ork+0.023
Neg logit contributions
 soon-0.029
 then-0.027
 least-0.024
 now-0.023
 so-0.023
Token father
Token ID2988
Feature activation-1.21e-02

Pos logit contributions
 soon+0.023
 then+0.020
 able+0.019
 least+0.019
 stuck+0.018
Neg logit contributions
board-0.021
inner-0.020
 medal-0.018
ag-0.018
 trophy-0.018
Token,
Token ID11
Feature activation-7.07e-03

Pos logit contributions
board+0.026
inner+0.025
ag+0.024
 medal+0.023
 trophy+0.023
Neg logit contributions
 soon-0.013
 then-0.011
 trapped-0.010
 least-0.010
 able-0.010
Token "
Token ID366
Feature activation+5.44e-03

Pos logit contributions
board+0.011
inner+0.010
ag+0.010
 medal+0.010
 trophy+0.009
Neg logit contributions
 soon-0.012
 then-0.010
 least-0.010
 able-0.010
 trapped-0.009
TokenWhat
Token ID2061
Feature activation-8.13e-03

Pos logit contributions
 soon+0.007
 then+0.006
 able+0.006
 trapped+0.006
 afraid+0.005
Neg logit contributions
board-0.011
inner-0.011
 seas-0.010
 medal-0.010
 guys-0.010
Token is
Token ID318
Feature activation-2.00e-02

Pos logit contributions
inner+0.016
board+0.016
anger+0.015
ag+0.015
affle+0.015
Neg logit contributions
 soon-0.011
 then-0.010
-0.009
 stuck-0.009
 never-0.009
Token that
Token ID326
Feature activation-4.96e-02

Pos logit contributions
board+0.042
inner+0.042
ag+0.039
 medal+0.038
ork+0.038
Neg logit contributions
 soon-0.025
 then-0.021
 able-0.018
 least-0.018
 stuck-0.017
Token?"
Token ID1701
Feature activation+6.68e-03

Pos logit contributions
board+0.098
inner+0.096
ag+0.090
ork+0.088
affle+0.087
Neg logit contributions
 soon-0.069
 then-0.061
 able-0.053
 least-0.052
 stuck-0.050
Token
Token ID198
Feature activation+7.27e-04

Pos logit contributions
 soon+0.010
 then+0.009
 able+0.008
 least+0.008
 stuck+0.008
Neg logit contributions
board-0.012
inner-0.012
ag-0.011
ork-0.011
 medal-0.011
Token
Token ID198
Feature activation-1.69e-04

Pos logit contributions
 soon+0.001
 then+0.001
 least+0.001
 able+0.001
 trapped+0.001
Neg logit contributions
board-0.002
inner-0.002
ork-0.002
ag-0.002
ves-0.002
TokenHis
Token ID6653
Feature activation+3.10e-02

Pos logit contributions
inner+0.000
board+0.000
ag+0.000
anger+0.000
ool+0.000
Neg logit contributions
 soon-0.000
 then-0.000
 able-0.000
 least-0.000
-0.000
Token father
Token ID2988
Feature activation-1.12e-02

Pos logit contributions
 soon+0.051
 then+0.046
 able+0.045
 trapped+0.043
 least+0.043
Neg logit contributions
board-0.050
 medal-0.049
inner-0.046
 trophy-0.045
 guys-0.045
Token went
Token ID1816
Feature activation-9.27e-03

Pos logit contributions
board+0.027
inner+0.026
ag+0.024
ork+0.024
ade+0.024
Neg logit contributions
 soon-0.015
 then-0.013
 able-0.011
 least-0.011
 afraid-0.010
Token home
Token ID1363
Feature activation-2.60e-02

Pos logit contributions
board+0.019
inner+0.018
ag+0.018
 medal+0.017
ade+0.017
Neg logit contributions
 soon-0.012
 then-0.011
 least-0.010
 able-0.009
-0.009
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
board+0.052
inner+0.048
ork+0.047
ag+0.047
anger+0.045
Neg logit contributions
 soon-0.030
 then-0.028
 least-0.027
-0.025
 Amen-0.024
Token remembered
Token ID12086
Feature activation-1.04e-02

Pos logit contributions
board+0.033
inner+0.032
ag+0.030
ork+0.029
 medal+0.029
Neg logit contributions
 soon-0.020
 then-0.017
 able-0.015
 least-0.015
 stuck-0.014
Token the
Token ID262
Feature activation-2.84e-02

Pos logit contributions
board+0.022
inner+0.022
ag+0.021
 medal+0.020
ork+0.020
Neg logit contributions
 soon-0.012
 then-0.010
 least-0.009
 able-0.009
 Then-0.008
Token bright
Token ID6016
Feature activation-4.95e-02

Pos logit contributions
board+0.044
inner+0.044
ag+0.043
ocol+0.042
oons+0.042
Neg logit contributions
 soon-0.047
 then-0.046
 scared-0.042
 stuck-0.041
 now-0.041
Token plastic
Token ID7309
Feature activation-2.34e-02

Pos logit contributions
inner+0.095
ork+0.095
board+0.092
anger+0.091
affle+0.090
Neg logit contributions
 then-0.067
 soon-0.063
 over-0.057
 now-0.055
 sometimes-0.053
Token ball
Token ID2613
Feature activation-6.72e-03

Pos logit contributions
ag+0.040
ocol+0.039
inner+0.038
 similarities+0.037
oph+0.036
Neg logit contributions
 soon-0.038
 then-0.036
 stuck-0.034
 together-0.033
 now-0.033
Token.
Token ID13
Feature activation-2.24e-02

Pos logit contributions
inner+0.018
board+0.017
affle+0.017
ag+0.017
 similarities+0.016
Neg logit contributions
 soon-0.006
 then-0.004
 able-0.003
 stuck-0.003
 now-0.003
Token She
Token ID1375
Feature activation-5.70e-03

Pos logit contributions
inner+0.052
board+0.051
ag+0.050
 seas+0.049
 medal+0.049
Neg logit contributions
 soon-0.023
 then-0.019
 Then-0.016
 able-0.015
 now-0.014
Token looked
Token ID3114
Feature activation-6.99e-03

Pos logit contributions
board+0.012
inner+0.011
 medal+0.010
ade+0.010
ork+0.010
Neg logit contributions
 soon-0.007
 then-0.006
 able-0.006
 least-0.006
 afraid-0.005
Token
Token ID198
Feature activation-5.21e-04

Pos logit contributions
 soon+0.001
 then+0.000
 able+0.000
 least+0.000
 stuck+0.000
Neg logit contributions
board-0.001
inner-0.001
ag-0.001
ork-0.001
anger-0.001
Token
Token ID198
Feature activation-1.50e-03

Pos logit contributions
board+0.001
inner+0.001
ag+0.001
ork+0.001
ade+0.001
Neg logit contributions
 soon-0.000
 then-0.000
 least-0.000
 able-0.000
 stuck-0.000
TokenThe
Token ID464
Feature activation-1.59e-02

Pos logit contributions
board+0.003
inner+0.003
anger+0.003
ag+0.003
ool+0.003
Neg logit contributions
 soon-0.002
 then-0.001
-0.001
 never-0.001
 suddenly-0.001
Token boy
Token ID2933
Feature activation+2.58e-03

Pos logit contributions
board+0.022
inner+0.022
ag+0.021
anger+0.020
ork+0.020
Neg logit contributions
 soon-0.031
 then-0.029
 least-0.026
 stuck-0.025
 now-0.025
Token shared
Token ID4888
Feature activation-1.36e-02

Pos logit contributions
 soon+0.003
 then+0.003
 able+0.002
 least+0.002
 stuck+0.002
Neg logit contributions
inner-0.006
board-0.006
ag-0.005
anger-0.005
ool-0.005
Token the
Token ID262
Feature activation-4.94e-02

Pos logit contributions
board+0.031
inner+0.031
ag+0.029
anger+0.029
ool+0.028
Neg logit contributions
 soon-0.016
 then-0.013
 able-0.011
 least-0.010
 stuck-0.010
Token ball
Token ID2613
Feature activation+7.25e-03

Pos logit contributions
inner+0.068
board+0.067
ag+0.065
ocol+0.064
anger+0.064
Neg logit contributions
 soon-0.092
 then-0.091
 stuck-0.079
-0.079
 so-0.078
Token with
Token ID351
Feature activation-6.57e-03

Pos logit contributions
 soon+0.007
 then+0.006
 able+0.005
 least+0.005
 stuck+0.005
Neg logit contributions
board-0.017
inner-0.017
ag-0.016
ork-0.016
anger-0.016
Token his
Token ID465
Feature activation-1.13e-02

Pos logit contributions
inner+0.016
board+0.015
ool+0.015
anger+0.014
 pleasures+0.014
Neg logit contributions
 soon-0.008
 then-0.007
 least-0.005
 able-0.005
 Then-0.005
Token friends
Token ID2460
Feature activation-2.60e-03

Pos logit contributions
board+0.018
inner+0.018
ag+0.016
anger+0.015
ool+0.015
Neg logit contributions
 soon-0.021
 then-0.019
 able-0.017
 least-0.017
 stuck-0.017
Token.
Token ID13
Feature activation-6.45e-03

Pos logit contributions
board+0.006
inner+0.006
ag+0.006
ork+0.006
 medal+0.006
Neg logit contributions
 soon-0.002
 then-0.002
 able-0.002
 least-0.002
 stuck-0.001
Token can
Token ID460
Feature activation+7.80e-03

Pos logit contributions
board+0.038
inner+0.037
oons+0.037
ork+0.037
ool+0.036
Neg logit contributions
 soon-0.022
 then-0.021
 now-0.018
 never-0.017
 so-0.017
Token I
Token ID314
Feature activation-1.70e-02

Pos logit contributions
 soon+0.010
 then+0.009
+0.007
 stuck+0.007
 least+0.007
Neg logit contributions
board-0.017
inner-0.017
anger-0.016
ool-0.016
ag-0.016
Token play
Token ID711
Feature activation-1.35e-02

Pos logit contributions
board+0.029
inner+0.028
ag+0.026
anger+0.025
ork+0.025
Neg logit contributions
 soon-0.029
 then-0.026
 least-0.023
 able-0.023
 stuck-0.023
Token with
Token ID351
Feature activation-2.44e-02

Pos logit contributions
inner+0.032
board+0.032
ool+0.030
anger+0.030
ork+0.029
Neg logit contributions
 soon-0.014
 then-0.012
 able-0.009
 now-0.009
 stuck-0.009
Token your
Token ID534
Feature activation-3.40e-02

Pos logit contributions
board+0.053
inner+0.053
anger+0.048
ag+0.048
ool+0.048
Neg logit contributions
 soon-0.031
 then-0.028
 least-0.022
 able-0.022
 stuck-0.022
Token yellow
Token ID7872
Feature activation-4.93e-02

Pos logit contributions
board+0.045
inner+0.044
ag+0.039
ork+0.037
 medal+0.037
Neg logit contributions
 soon-0.070
 then-0.064
 able-0.059
 least-0.059
 stuck-0.057
Token ball
Token ID2613
Feature activation+2.27e-03

Pos logit contributions
inner+0.068
board+0.063
oker+0.059
ag+0.058
affle+0.056
Neg logit contributions
 soon-0.098
 then-0.093
 stuck-0.083
 gently-0.081
 now-0.081
Token too
Token ID1165
Feature activation-4.36e-03

Pos logit contributions
 soon+0.003
 then+0.002
 able+0.002
 least+0.002
 stuck+0.002
Neg logit contributions
board-0.005
inner-0.005
ag-0.005
ork-0.005
anger-0.005
Token?"
Token ID1701
Feature activation+3.98e-03

Pos logit contributions
board+0.008
inner+0.008
ag+0.008
ool+0.007
ade+0.007
Neg logit contributions
 soon-0.006
 then-0.005
 able-0.005
 least-0.005
 trapped-0.005
Token Tom
Token ID4186
Feature activation-8.34e-03

Pos logit contributions
 soon+0.006
 then+0.005
 least+0.005
 able+0.005
 trapped+0.005
Neg logit contributions
board-0.007
inner-0.007
ag-0.006
ade-0.006
oker-0.006
Token asked
Token ID1965
Feature activation-2.26e-02

Pos logit contributions
board+0.018
inner+0.017
ag+0.017
 medal+0.016
ade+0.016
Neg logit contributions
 soon-0.010
 then-0.008
 least-0.008
 able-0.007
 trapped-0.007
Token "
Token ID366
Feature activation+1.39e-02

Pos logit contributions
board+0.007
inner+0.006
 som+0.006
ork+0.006
anger+0.006
Neg logit contributions
 soon-0.010
 least-0.010
 trapped-0.009
 able-0.009
 stuck-0.008
TokenMaybe
Token ID13300
Feature activation-6.89e-03

Pos logit contributions
 soon+0.020
 then+0.018
 able+0.015
 least+0.015
 stuck+0.014
Neg logit contributions
board-0.027
inner-0.027
ag-0.025
anger-0.025
affle-0.025
Token we
Token ID356
Feature activation+4.38e-03

Pos logit contributions
board+0.011
inner+0.011
 seas+0.011
 medal+0.011
 guys+0.011
Neg logit contributions
 soon-0.010
 able-0.009
 least-0.009
 trapped-0.009
 Amen-0.008
Token can
Token ID460
Feature activation+7.31e-03

Pos logit contributions
 soon+0.006
 then+0.006
 able+0.005
 least+0.005
 stuck+0.005
Neg logit contributions
board-0.008
inner-0.008
ag-0.008
ork-0.007
anger-0.007
Token share
Token ID2648
Feature activation+3.71e-03

Pos logit contributions
 soon+0.013
 then+0.012
 able+0.012
 afraid+0.012
 trapped+0.012
Neg logit contributions
board-0.010
inner-0.010
ag-0.009
ade-0.009
 trophy-0.009
Token the
Token ID262
Feature activation-4.88e-02

Pos logit contributions
 soon+0.005
 then+0.005
 least+0.004
 able+0.004
 trapped+0.004
Neg logit contributions
board-0.007
inner-0.006
ag-0.006
ork-0.006
 similarities-0.006
Token clay
Token ID21558
Feature activation-3.61e-03

Pos logit contributions
board+0.078
inner+0.077
ag+0.070
ork+0.068
anger+0.068
Neg logit contributions
 soon-0.086
 then-0.078
 able-0.070
 least-0.070
 stuck-0.068
Token.
Token ID13
Feature activation-1.72e-02

Pos logit contributions
board+0.008
inner+0.008
 trophy+0.007
ade+0.007
ag+0.007
Neg logit contributions
 soon-0.004
 then-0.003
 able-0.003
 least-0.003
 hoping-0.003
Token We
Token ID775
Feature activation+1.29e-02

Pos logit contributions
board+0.026
inner+0.024
 People+0.022
 similarities+0.021
anger+0.021
Neg logit contributions
 soon-0.030
 able-0.028
 trapped-0.026
 stuck-0.025
 hoping-0.025
Token can
Token ID460
Feature activation+7.69e-03

Pos logit contributions
 soon+0.017
 then+0.015
 least+0.013
 stuck+0.012
 able+0.012
Neg logit contributions
board-0.027
inner-0.026
ag-0.025
anger-0.024
ork-0.024
Token each
Token ID1123
Feature activation-7.99e-03

Pos logit contributions
 soon+0.013
 able+0.012
 then+0.012
 afraid+0.012
 trapped+0.012
Neg logit contributions
board-0.011
inner-0.011
ag-0.010
 trophy-0.010
 medal-0.010
Token all
Token ID477
Feature activation-6.62e-03

Pos logit contributions
 soon+0.014
 then+0.012
 able+0.011
 least+0.011
 stuck+0.010
Neg logit contributions
board-0.018
inner-0.018
ag-0.017
ork-0.016
anger-0.016
Token played
Token ID2826
Feature activation-8.46e-03

Pos logit contributions
inner+0.014
board+0.014
anger+0.013
ork+0.013
ool+0.013
Neg logit contributions
 soon-0.010
 then-0.008
 able-0.007
 scared-0.007
 stuck-0.007
Token together
Token ID1978
Feature activation+1.62e-03

Pos logit contributions
inner+0.025
board+0.025
ool+0.024
anger+0.024
affle+0.024
Neg logit contributions
 soon-0.004
 then-0.003
 together-0.001
 able-0.001
 gently-0.001
Token with
Token ID351
Feature activation-1.18e-02

Pos logit contributions
 soon+0.002
 then+0.001
 able+0.001
 least+0.001
 stuck+0.001
Neg logit contributions
board-0.004
inner-0.004
ag-0.004
ork-0.003
 medal-0.003
Token the
Token ID262
Feature activation-2.98e-02

Pos logit contributions
board+0.024
inner+0.024
ag+0.023
ork+0.022
 medal+0.022
Neg logit contributions
 soon-0.015
 then-0.013
 able-0.012
 least-0.011
 stuck-0.011
Token shiny
Token ID22441
Feature activation-4.88e-02

Pos logit contributions
board+0.033
inner+0.032
ool+0.029
anger+0.029
ork+0.029
Neg logit contributions
 soon-0.066
 then-0.064
 scared-0.059
 least-0.059
 stuck-0.058
Token trophy
Token ID16383
Feature activation+5.71e-03

Pos logit contributions
inner+0.045
board+0.041
 similarities+0.039
abs+0.039
ool+0.039
Neg logit contributions
 soon-0.109
 then-0.105
 now-0.101
 closer-0.101
 over-0.099
Token.
Token ID13
Feature activation-3.86e-03

Pos logit contributions
 soon+0.005
 then+0.004
 able+0.003
 least+0.003
 stuck+0.003
Neg logit contributions
board-0.014
inner-0.014
ag-0.013
ork-0.013
affle-0.013
Token From
Token ID3574
Feature activation+6.91e-03

Pos logit contributions
board+0.009
inner+0.009
ag+0.008
ork+0.008
anger+0.008
Neg logit contributions
 soon-0.004
 then-0.004
 able-0.003
 least-0.003
 stuck-0.003
Token that
Token ID326
Feature activation+7.66e-03

Pos logit contributions
 soon+0.008
 able+0.007
+0.007
 least+0.007
 hoping+0.007
Neg logit contributions
board-0.013
inner-0.013
 medal-0.013
ag-0.013
 seas-0.012
Token day
Token ID1110
Feature activation+1.14e-02

Pos logit contributions
 soon+0.017
 then+0.016
 able+0.016
 least+0.015
 trapped+0.015
Neg logit contributions
board-0.008
inner-0.008
ag-0.007
 medal-0.006
ool-0.006
Token mom
Token ID1995
Feature activation-8.87e-03

Pos logit contributions
 soon+0.017
 then+0.016
 able+0.015
 least+0.015
 stuck+0.015
Neg logit contributions
board-0.016
inner-0.015
ag-0.014
 guys-0.014
 similarities-0.013
Token and
Token ID290
Feature activation+2.65e-03

Pos logit contributions
board+0.020
inner+0.019
ag+0.018
ork+0.018
ool+0.018
Neg logit contributions
 soon-0.009
 then-0.007
 least-0.007
 able-0.007
 trapped-0.006
Token decided
Token ID3066
Feature activation-1.31e-02

Pos logit contributions
 soon+0.003
 then+0.003
 able+0.003
 least+0.003
 trapped+0.003
Neg logit contributions
board-0.005
inner-0.005
ag-0.005
 medal-0.005
ork-0.005
Token to
Token ID284
Feature activation-1.48e-02

Pos logit contributions
board+0.029
inner+0.028
ag+0.027
ork+0.026
 medal+0.026
Neg logit contributions
 soon-0.015
 then-0.013
 least-0.011
 able-0.011
 stuck-0.010
Token share
Token ID2648
Feature activation+3.47e-04

Pos logit contributions
board+0.024
inner+0.024
ag+0.022
ork+0.021
anger+0.021
Neg logit contributions
 soon-0.025
 then-0.023
 able-0.021
 least-0.020
 stuck-0.020
Token the
Token ID262
Feature activation-4.86e-02

Pos logit contributions
 soon+0.000
 then+0.000
 able+0.000
 least+0.000
 stuck+0.000
Neg logit contributions
board-0.001
inner-0.001
ag-0.001
anger-0.001
ork-0.001
Token toy
Token ID13373
Feature activation+7.90e-03

Pos logit contributions
inner+0.080
board+0.080
anger+0.073
ag+0.073
ork+0.072
Neg logit contributions
 soon-0.079
 then-0.072
 stuck-0.066
 scared-0.065
 now-0.064
Token.
Token ID13
Feature activation-6.21e-03

Pos logit contributions
 soon+0.008
 then+0.006
 able+0.005
 least+0.005
 stuck+0.005
Neg logit contributions
board-0.019
inner-0.019
ag-0.018
ork-0.017
 medal-0.017
Token They
Token ID1119
Feature activation+4.23e-03

Pos logit contributions
board+0.012
inner+0.011
ork+0.010
ag+0.010
anger+0.010
Neg logit contributions
 soon-0.010
 then-0.008
 least-0.008
 able-0.008
 stuck-0.007
Token played
Token ID2826
Feature activation-7.25e-03

Pos logit contributions
 soon+0.006
 then+0.005
 able+0.004
 least+0.004
 stuck+0.004
Neg logit contributions
board-0.009
inner-0.009
ag-0.008
anger-0.008
ork-0.008
Token together
Token ID1978
Feature activation-5.09e-03

Pos logit contributions
board+0.020
inner+0.020
ool+0.019
anger+0.019
affle+0.019
Neg logit contributions
 soon-0.005
 then-0.004
 able-0.002
 together-0.002
 gently-0.002
Token selfish
Token ID20363
Feature activation-7.92e-03

Pos logit contributions
 soon+0.000
 then+0.000
 able+0.000
 least+0.000
 stuck+0.000
Neg logit contributions
board-0.000
inner-0.000
ag-0.000
ork-0.000
anger-0.000
Token.
Token ID13
Feature activation-9.80e-03

Pos logit contributions
board+0.019
inner+0.018
ag+0.017
 medal+0.017
ade+0.017
Neg logit contributions
 soon-0.008
 then-0.007
 able-0.006
 least-0.006
 stuck-0.005
Token You
Token ID921
Feature activation+5.50e-03

Pos logit contributions
board+0.016
inner+0.016
ag+0.015
oker+0.014
ork+0.014
Neg logit contributions
 soon-0.017
 then-0.015
 able-0.014
 least-0.013
 stuck-0.013
Token should
Token ID815
Feature activation+7.69e-04

Pos logit contributions
 soon+0.009
 then+0.008
 least+0.007
 stuck+0.007
 able+0.007
Neg logit contributions
board-0.010
inner-0.010
ag-0.009
anger-0.009
ork-0.009
Token share
Token ID2648
Feature activation-7.35e-03

Pos logit contributions
 soon+0.001
 then+0.001
 able+0.001
 least+0.001
 stuck+0.001
Neg logit contributions
board-0.002
inner-0.002
ag-0.001
ork-0.001
 medal-0.001
Token the
Token ID262
Feature activation-4.85e-02

Pos logit contributions
board+0.016
inner+0.016
ag+0.015
ork+0.015
 medal+0.014
Neg logit contributions
 soon-0.009
 then-0.007
 able-0.006
 least-0.006
 stuck-0.006
Token ball
Token ID2613
Feature activation+6.42e-03

Pos logit contributions
board+0.070
inner+0.069
ocol+0.067
affle+0.066
anger+0.066
Neg logit contributions
 soon-0.086
 then-0.083
 stuck-0.076
-0.073
 scared-0.073
Token with
Token ID351
Feature activation-2.07e-02

Pos logit contributions
 soon+0.007
 then+0.006
 able+0.005
 least+0.005
 Then+0.005
Neg logit contributions
board-0.014
inner-0.014
ag-0.013
ade-0.013
ork-0.013
Token me
Token ID502
Feature activation-3.99e-03

Pos logit contributions
inner+0.047
board+0.046
anger+0.043
ool+0.042
oker+0.042
Neg logit contributions
 soon-0.027
 then-0.024
 now-0.020
 never-0.019
 gently-0.019
Token.
Token ID13
Feature activation-1.63e-02

Pos logit contributions
board+0.009
inner+0.009
ag+0.009
ork+0.008
 medal+0.008
Neg logit contributions
 soon-0.004
 then-0.003
 able-0.003
 least-0.003
 stuck-0.003
Token We
Token ID775
Feature activation+9.25e-03

Pos logit contributions
board+0.024
inner+0.022
ag+0.020
oker+0.020
 similarities+0.020
Neg logit contributions
 soon-0.032
 able-0.027
 then-0.027
 least-0.025
 closer-0.025
Token,
Token ID11
Feature activation-4.16e-03

Pos logit contributions
board+0.008
inner+0.008
ag+0.007
ork+0.007
anger+0.007
Neg logit contributions
 soon-0.003
 then-0.002
 least-0.002
 able-0.002
 stuck-0.002
Token "
Token ID366
Feature activation+6.35e-03

Pos logit contributions
board+0.005
 som+0.005
inner+0.004
 medal+0.004
ork+0.004
Neg logit contributions
 soon-0.007
 least-0.007
 able-0.006
 trapped-0.006
 then-0.006
TokenLet
Token ID5756
Feature activation+2.21e-02

Pos logit contributions
 soon+0.009
 then+0.008
 least+0.006
+0.006
 suddenly+0.006
Neg logit contributions
board-0.013
inner-0.012
ag-0.012
ool-0.012
 medal-0.012
Token's
Token ID338
Feature activation+1.48e-02

Pos logit contributions
 soon+0.026
 then+0.023
 able+0.019
 least+0.019
 stuck+0.018
Neg logit contributions
board-0.049
inner-0.048
ag-0.045
anger-0.044
ork-0.044
Token all
Token ID477
Feature activation+2.25e-03

Pos logit contributions
 soon+0.026
 then+0.023
 able+0.022
 afraid+0.022
 hoping+0.022
Neg logit contributions
inner-0.021
board-0.021
 guys-0.019
 trophy-0.019
ag-0.019
Token run
Token ID1057
Feature activation+2.90e-02

Pos logit contributions
 soon+0.004
 then+0.003
 least+0.003
 able+0.003
 stuck+0.003
Neg logit contributions
board-0.004
inner-0.004
anger-0.004
ork-0.004
 seas-0.004
Token back
Token ID736
Feature activation+8.04e-03

Pos logit contributions
 soon+0.033
 then+0.028
 able+0.027
 least+0.026
 afraid+0.025
Neg logit contributions
board-0.061
inner-0.061
ag-0.057
ork-0.056
anger-0.056
Token to
Token ID284
Feature activation-4.74e-03

Pos logit contributions
 soon+0.009
 then+0.009
 Amen+0.009
 able+0.009
 least+0.008
Neg logit contributions
inner-0.015
board-0.015
anger-0.014
ade-0.014
ork-0.014
Token the
Token ID262
Feature activation-9.49e-04

Pos logit contributions
board+0.009
inner+0.009
ag+0.009
 medal+0.009
ork+0.009
Neg logit contributions
 soon-0.006
 then-0.005
 able-0.005
 least-0.004
 stuck-0.004
Token house
Token ID2156
Feature activation-1.13e-03

Pos logit contributions
inner+0.002
board+0.002
ag+0.002
ool+0.001
anger+0.001
Neg logit contributions
 soon-0.002
 then-0.001
 least-0.001
 stuck-0.001
 now-0.001
Token!"
Token ID2474
Feature activation+1.04e-02

Pos logit contributions
board+0.003
inner+0.002
ag+0.002
anger+0.002
ork+0.002
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 Then-0.001
Token could
Token ID714
Feature activation+6.82e-03

Pos logit contributions
 soon+0.025
 then+0.022
 least+0.020
 able+0.020
 Amen+0.019
Neg logit contributions
board-0.032
inner-0.029
ag-0.029
ork-0.028
ade-0.027
Token not
Token ID407
Feature activation+1.78e-02

Pos logit contributions
 soon+0.007
 least+0.007
 Owl+0.007
 Amen+0.007
 Then+0.007
Neg logit contributions
ag-0.012
 medal-0.012
ves-0.012
inner-0.011
ork-0.011
Token stop
Token ID2245
Feature activation+2.09e-02

Pos logit contributions
 soon+0.026
 then+0.023
 able+0.021
 least+0.021
 Then+0.021
Neg logit contributions
board-0.033
inner-0.032
ag-0.031
ork-0.029
ade-0.029
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
 soon+0.024
 then+0.021
 able+0.021
 least+0.021
 Amen+0.019
Neg logit contributions
ag-0.038
board-0.037
 som-0.036
inner-0.035
 medal-0.035
Token They
Token ID1119
Feature activation+1.60e-02

Pos logit contributions
 soon+0.020
 then+0.017
 Then+0.015
 least+0.014
 able+0.013
Neg logit contributions
inner-0.039
board-0.039
ag-0.037
affle-0.036
 medal-0.036
Token were
Token ID547
Feature activation+2.70e-02

Pos logit contributions
 soon+0.021
 then+0.019
 able+0.015
 stuck+0.015
 least+0.015
Neg logit contributions
inner-0.034
board-0.033
ag-0.031
ool-0.031
oker-0.030
Token going
Token ID1016
Feature activation+2.50e-02

Pos logit contributions
 soon+0.024
 then+0.019
 least+0.017
 Then+0.015
 able+0.014
Neg logit contributions
ag-0.062
board-0.062
inner-0.061
 medal-0.059
affle-0.058
Token too
Token ID1165
Feature activation+1.46e-02

Pos logit contributions
 soon+0.030
 then+0.029
 Amen+0.028
 least+0.027
 able+0.026
Neg logit contributions
board-0.044
ag-0.044
inner-0.043
 medal-0.043
ork-0.043
Token fast
Token ID3049
Feature activation+1.26e-02

Pos logit contributions
 least+0.020
 then+0.019
 Amen+0.018
ishly+0.017
 Then+0.017
Neg logit contributions
inner-0.025
ork-0.024
ag-0.023
oker-0.023
ade-0.022
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
 soon+0.014
 then+0.012
 least+0.011
 able+0.010
 Amen+0.010
Neg logit contributions
board-0.028
inner-0.026
ag-0.026
anger-0.025
 medal-0.025
Token They
Token ID1119
Feature activation+1.31e-02

Pos logit contributions
 soon+0.023
 then+0.020
 able+0.017
 least+0.017
 Then+0.016
Neg logit contributions
board-0.036
inner-0.035
ag-0.033
 medal-0.032
ool-0.032
Token living
Token ID2877
Feature activation+5.17e-05

Pos logit contributions
 soon+0.004
 then+0.003
 least+0.003
 stuck+0.003
 hoping+0.003
Neg logit contributions
inner-0.004
board-0.004
ag-0.004
 medal-0.004
oker-0.004
Token room
Token ID2119
Feature activation+4.04e-03

Pos logit contributions
 soon+0.000
 then+0.000
 able+0.000
 least+0.000
 stuck+0.000
Neg logit contributions
board-0.000
inner-0.000
ag-0.000
ork-0.000
anger-0.000
Token.
Token ID13
Feature activation-5.07e-03

Pos logit contributions
 soon+0.004
 then+0.003
 able+0.002
 least+0.002
 stuck+0.002
Neg logit contributions
board-0.010
inner-0.010
ag-0.009
ork-0.009
 medal-0.009
Token Lily
Token ID20037
Feature activation-4.52e-03

Pos logit contributions
inner+0.010
board+0.010
ag+0.010
 medal+0.009
affle+0.009
Neg logit contributions
 soon-0.007
 then-0.006
 Then-0.005
 able-0.005
 least-0.005
Token tried
Token ID3088
Feature activation+5.78e-03

Pos logit contributions
board+0.009
inner+0.009
ag+0.008
ork+0.008
anger+0.008
Neg logit contributions
 soon-0.006
 then-0.005
 able-0.005
 least-0.005
 stuck-0.004
Token her
Token ID607
Feature activation+3.03e-02

Pos logit contributions
 soon+0.007
 then+0.006
 least+0.006
 Amen+0.006
 trapped+0.006
Neg logit contributions
board-0.011
inner-0.010
ork-0.010
ag-0.010
ade-0.010
Token best
Token ID1266
Feature activation+1.58e-02

Pos logit contributions
 soon+0.051
 then+0.046
 able+0.043
 trapped+0.042
 Amen+0.041
Neg logit contributions
board-0.047
inner-0.046
 medal-0.045
 trophy-0.043
ag-0.043
Token,
Token ID11
Feature activation+1.99e-02

Pos logit contributions
 soon+0.016
 then+0.014
 least+0.012
 able+0.011
 trapped+0.011
Neg logit contributions
board-0.036
inner-0.036
ag-0.034
 medal-0.033
ork-0.033
Token but
Token ID475
Feature activation-1.25e-03

Pos logit contributions
 soon+0.024
 then+0.020
 able+0.018
 least+0.018
 trapped+0.016
Neg logit contributions
board-0.042
inner-0.041
ag-0.039
 seas-0.038
 medal-0.038
Token she
Token ID673
Feature activation+8.81e-03

Pos logit contributions
board+0.003
inner+0.003
ag+0.002
ork+0.002
 medal+0.002
Neg logit contributions
 soon-0.002
 then-0.001
 able-0.001
 least-0.001
 stuck-0.001
Token was
Token ID373
Feature activation+1.65e-02

Pos logit contributions
 soon+0.012
 then+0.010
 able+0.009
 least+0.009
 stuck+0.009
Neg logit contributions
board-0.018
inner-0.018
ag-0.017
anger-0.016
 medal-0.016
Token Lily
Token ID20037
Feature activation+4.25e-03

Pos logit contributions
inner+0.014
board+0.014
ag+0.013
ool+0.013
affle+0.013
Neg logit contributions
 soon-0.006
 then-0.006
 able-0.004
 least-0.004
 stuck-0.004
Token noticed
Token ID6810
Feature activation-1.73e-03

Pos logit contributions
 soon+0.004
 then+0.004
 able+0.003
 least+0.003
 Then+0.003
Neg logit contributions
board-0.010
inner-0.009
ag-0.009
ork-0.008
 medal-0.008
Token a
Token ID257
Feature activation+5.76e-03

Pos logit contributions
board+0.004
inner+0.004
ag+0.004
ork+0.004
 medal+0.004
Neg logit contributions
 soon-0.002
 then-0.002
 able-0.001
 least-0.001
 stuck-0.001
Token nos
Token ID43630
Feature activation-9.12e-03

Pos logit contributions
 soon+0.009
 then+0.007
 very+0.007
 scared+0.007
 closer+0.007
Neg logit contributions
inner-0.011
ag-0.011
board-0.011
ests-0.010
oker-0.010
Tokeny
Token ID88
Feature activation+7.28e-03

Pos logit contributions
board+0.016
inner+0.016
ag+0.014
ork+0.014
anger+0.014
Neg logit contributions
 soon-0.015
 then-0.013
 able-0.012
 least-0.012
 stuck-0.011
Token little
Token ID1310
Feature activation+3.18e-02

Pos logit contributions
 soon+0.011
 then+0.010
 very+0.009
 now+0.009
 scared+0.009
Neg logit contributions
inner-0.014
ag-0.013
oons-0.013
ests-0.013
 similarities-0.013
Token monkey
Token ID21657
Feature activation-1.39e-03

Pos logit contributions
 soon+0.056
 then+0.050
 stuck+0.044
 able+0.044
 least+0.044
Neg logit contributions
inner-0.053
board-0.052
ag-0.048
ork-0.046
affle-0.046
Token pe
Token ID613
Feature activation-2.18e-03

Pos logit contributions
board+0.003
inner+0.003
ag+0.003
ork+0.003
 medal+0.003
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 stuck-0.001
Tokeneking
Token ID18754
Feature activation+7.26e-03

Pos logit contributions
board+0.003
ag+0.003
inner+0.003
affle+0.003
ork+0.003
Neg logit contributions
 soon-0.004
 then-0.003
 least-0.003
 Then-0.003
 able-0.003
Token through
Token ID832
Feature activation+8.95e-03

Pos logit contributions
 soon+0.007
 then+0.005
 able+0.004
 stuck+0.004
 over+0.004
Neg logit contributions
board-0.018
inner-0.018
ag-0.017
oker-0.017
affle-0.017
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
 soon+0.010
 then+0.008
 able+0.007
 least+0.007
 stuck+0.006
Neg logit contributions
board-0.020
inner-0.020
ag-0.019
ork-0.019
 medal-0.019
Token,
Token ID11
Feature activation-1.22e-02

Pos logit contributions
board+0.041
inner+0.040
ag+0.037
 medal+0.037
ade+0.037
Neg logit contributions
 soon-0.015
 then-0.012
 least-0.011
 able-0.010
 trapped-0.009
Token Jack
Token ID3619
Feature activation-1.22e-02

Pos logit contributions
board+0.022
inner+0.021
ag+0.020
 medal+0.019
ork+0.019
Neg logit contributions
 soon-0.019
 then-0.017
 able-0.015
 least-0.015
 stuck-0.015
Token set
Token ID900
Feature activation-8.55e-03

Pos logit contributions
board+0.022
inner+0.020
ade+0.019
 seas+0.018
ork+0.018
Neg logit contributions
 soon-0.018
 then-0.016
 least-0.016
 able-0.016
 Amen-0.015
Token off
Token ID572
Feature activation-1.31e-02

Pos logit contributions
board+0.020
inner+0.020
ag+0.019
ork+0.018
 medal+0.018
Neg logit contributions
 soon-0.009
 then-0.007
 least-0.006
 able-0.006
 stuck-0.005
Token to
Token ID284
Feature activation-7.36e-03

Pos logit contributions
board+0.029
inner+0.029
ag+0.027
ork+0.027
 medal+0.027
Neg logit contributions
 soon-0.014
 then-0.012
 least-0.010
 able-0.010
 stuck-0.009
Token explore
Token ID7301
Feature activation+3.41e-02

Pos logit contributions
board+0.014
inner+0.014
ag+0.013
ork+0.012
anger+0.012
Neg logit contributions
 soon-0.011
 then-0.010
 able-0.009
 least-0.009
 stuck-0.008
Token the
Token ID262
Feature activation-1.03e-02

Pos logit contributions
 soon+0.037
 then+0.031
 able+0.025
 least+0.025
 stuck+0.023
Neg logit contributions
board-0.079
inner-0.078
ag-0.073
ork-0.072
anger-0.072
Token world
Token ID995
Feature activation+2.55e-03

Pos logit contributions
board+0.016
inner+0.016
ag+0.015
affle+0.015
oker+0.015
Neg logit contributions
 soon-0.018
 then-0.016
 least-0.015
 now-0.014
 stuck-0.014
Token.
Token ID13
Feature activation-2.62e-02

Pos logit contributions
 soon+0.002
 then+0.002
 able+0.002
 least+0.002
 stuck+0.001
Neg logit contributions
board-0.006
inner-0.006
ag-0.006
ork-0.006
anger-0.006
Token He
Token ID679
Feature activation-1.54e-02

Pos logit contributions
inner+0.052
ves+0.051
ag+0.051
ork+0.050
 medal+0.050
Neg logit contributions
 soon-0.033
 then-0.031
 Then-0.030
 so-0.025
-0.025
Token came
Token ID1625
Feature activation-1.12e-03

Pos logit contributions
inner+0.033
board+0.033
ag+0.031
oker+0.030
anger+0.030
Neg logit contributions
 soon-0.020
 then-0.017
 stuck-0.014
 able-0.014
 least-0.014
Token mom
Token ID1995
Feature activation-1.01e-02

Pos logit contributions
 soon+0.025
+0.022
 Amen+0.022
 least+0.022
 then+0.022
Neg logit contributions
board-0.013
 medal-0.013
 trophy-0.012
 game-0.012
 prize-0.012
Token and
Token ID290
Feature activation+3.42e-03

Pos logit contributions
board+0.018
ag+0.016
ade+0.015
inner+0.015
ool+0.014
Neg logit contributions
 trapped-0.013
 soon-0.013
 terrified-0.013
 Amen-0.012
 least-0.012
Token said
Token ID531
Feature activation-5.30e-03

Pos logit contributions
 soon+0.005
 least+0.005
 then+0.004
 trapped+0.004
 Then+0.004
Neg logit contributions
board-0.005
 medal-0.005
inner-0.005
 trophy-0.005
ade-0.005
Token,
Token ID11
Feature activation+8.19e-04

Pos logit contributions
board+0.012
inner+0.012
ag+0.011
 medal+0.011
ork+0.011
Neg logit contributions
 soon-0.006
 then-0.004
 least-0.004
 able-0.004
 trapped-0.004
Token "
Token ID366
Feature activation+1.64e-02

Pos logit contributions
 soon+0.001
 least+0.001
 able+0.001
 then+0.001
 trapped+0.001
Neg logit contributions
board-0.001
inner-0.001
 som-0.001
ork-0.001
anger-0.001
TokenLook
Token ID8567
Feature activation+2.23e-02

Pos logit contributions
 soon+0.024
 able+0.021
 then+0.021
 afraid+0.019
 least+0.019
Neg logit contributions
board-0.030
inner-0.029
 seas-0.026
 similarities-0.026
ork-0.026
Token,
Token ID11
Feature activation-8.31e-03

Pos logit contributions
 soon+0.024
 then+0.019
 least+0.017
 able+0.017
 hoping+0.015
Neg logit contributions
board-0.050
inner-0.050
ag-0.047
ork-0.046
 medal-0.046
Token Mom
Token ID11254
Feature activation+3.20e-03

Pos logit contributions
board+0.017
inner+0.016
 medal+0.015
ag+0.015
ork+0.015
Neg logit contributions
 soon-0.011
 then-0.009
 able-0.009
 least-0.008
 hoping-0.008
Tokenmy
Token ID1820
Feature activation+2.85e-03

Pos logit contributions
 soon+0.004
 then+0.004
 trapped+0.004
 able+0.003
 Then+0.003
Neg logit contributions
board-0.006
inner-0.006
ade-0.005
oom-0.005
ool-0.005
Token!
Token ID0
Feature activation-1.66e-02

Pos logit contributions
 soon+0.004
 then+0.003
 least+0.003
 able+0.003
 Then+0.003
Neg logit contributions
board-0.006
inner-0.005
ade-0.005
ork-0.005
ag-0.005
Token I
Token ID314
Feature activation-1.91e-02

Pos logit contributions
board+0.026
inner+0.024
ork+0.021
anger+0.021
 trophy+0.021
Neg logit contributions
 soon-0.030
 able-0.027
 then-0.026
 least-0.025
 hoping-0.025
Token her
Token ID607
Feature activation-6.05e-03

Pos logit contributions
board+0.046
inner+0.045
ag+0.043
ork+0.042
anger+0.042
Neg logit contributions
 soon-0.020
 then-0.017
 able-0.014
 least-0.014
 stuck-0.013
Token pictures
Token ID5986
Feature activation-1.09e-02

Pos logit contributions
inner+0.009
board+0.009
ag+0.008
affle+0.008
anger+0.008
Neg logit contributions
 soon-0.011
 then-0.011
 able-0.010
 least-0.009
 stuck-0.009
Token.
Token ID13
Feature activation+1.36e-03

Pos logit contributions
board+0.027
inner+0.027
ag+0.025
affle+0.025
ork+0.025
Neg logit contributions
 soon-0.010
 then-0.008
 able-0.006
 least-0.006
 stuck-0.006
Token Mia
Token ID32189
Feature activation-2.75e-03

Pos logit contributions
 soon+0.002
 least+0.002
 then+0.002
 able+0.002
 trapped+0.002
Neg logit contributions
board-0.003
inner-0.002
anger-0.002
ork-0.002
ag-0.002
Token showed
Token ID3751
Feature activation+8.72e-04

Pos logit contributions
board+0.006
inner+0.005
ag+0.005
ork+0.005
 medal+0.005
Neg logit contributions
 soon-0.004
 then-0.003
 able-0.003
 least-0.003
 stuck-0.003
Token L
Token ID406
Feature activation+2.23e-03

Pos logit contributions
 soon+0.001
 then+0.001
 least+0.001
 able+0.001
 trapped+0.001
Neg logit contributions
board-0.002
inner-0.002
ag-0.001
ade-0.001
 seas-0.001
Tokenila
Token ID10102
Feature activation-1.61e-02

Pos logit contributions
 soon+0.004
 then+0.004
 able+0.003
 now+0.003
 gently+0.003
Neg logit contributions
board-0.003
inner-0.003
 medal-0.003
 seas-0.003
 pleasures-0.003
Token her
Token ID607
Feature activation-3.23e-04

Pos logit contributions
inner+0.040
board+0.040
ag+0.036
affle+0.036
anger+0.036
Neg logit contributions
 soon-0.015
 then-0.013
 able-0.010
 least-0.009
 stuck-0.009
Token pictures
Token ID5986
Feature activation-7.68e-03

Pos logit contributions
inner+0.000
board+0.000
affle+0.000
ag+0.000
oker+0.000
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 stuck-0.001
Token.
Token ID13
Feature activation+6.54e-03

Pos logit contributions
board+0.019
inner+0.019
ag+0.018
affle+0.018
ork+0.017
Neg logit contributions
 soon-0.007
 then-0.006
 able-0.004
 least-0.004
 stuck-0.004
Token They
Token ID1119
Feature activation-1.73e-03

Pos logit contributions
 soon+0.010
 then+0.009
 least+0.008
 able+0.008
 trapped+0.008
Neg logit contributions
board-0.012
inner-0.012
anger-0.011
ork-0.011
ag-0.011
Token As
Token ID1081
Feature activation+2.40e-04

Pos logit contributions
board+0.029
inner+0.028
ag+0.026
ork+0.026
anger+0.026
Neg logit contributions
 soon-0.016
 then-0.014
 able-0.012
 least-0.012
 stuck-0.011
Token she
Token ID673
Feature activation-2.12e-02

Pos logit contributions
 soon+0.000
 then+0.000
 able+0.000
 least+0.000
 trapped+0.000
Neg logit contributions
inner-0.000
board-0.000
 seas-0.000
 medal-0.000
ag-0.000
Token jumped
Token ID11687
Feature activation-1.04e-04

Pos logit contributions
board+0.040
inner+0.040
ag+0.037
ork+0.036
anger+0.036
Neg logit contributions
 soon-0.032
 then-0.029
 able-0.024
 least-0.024
 stuck-0.023
Token into
Token ID656
Feature activation-5.66e-03

Pos logit contributions
board+0.000
inner+0.000
ag+0.000
 medal+0.000
anger+0.000
Neg logit contributions
 soon-0.000
 then-0.000
 able-0.000
 least-0.000
 stuck-0.000
Token the
Token ID262
Feature activation-1.08e-03

Pos logit contributions
board+0.013
inner+0.013
ag+0.012
ork+0.012
anger+0.012
Neg logit contributions
 soon-0.006
 then-0.005
 able-0.004
 least-0.004
 stuck-0.004
Token water
Token ID1660
Feature activation+1.42e-03

Pos logit contributions
board+0.002
inner+0.002
ag+0.002
ork+0.002
ool+0.002
Neg logit contributions
 soon-0.002
 then-0.002
 least-0.001
 stuck-0.001
 scared-0.001
Token,
Token ID11
Feature activation+2.39e-03

Pos logit contributions
 soon+0.001
 then+0.001
 able+0.001
 least+0.001
 stuck+0.001
Neg logit contributions
board-0.004
inner-0.004
ag-0.003
ork-0.003
ool-0.003
Token she
Token ID673
Feature activation-9.53e-04

Pos logit contributions
 soon+0.003
 then+0.002
 able+0.002
 suddenly+0.002
 stuck+0.002
Neg logit contributions
board-0.006
inner-0.006
ag-0.006
affle-0.005
ool-0.005
Token saw
Token ID2497
Feature activation+2.54e-03

Pos logit contributions
board+0.002
inner+0.002
ag+0.002
ork+0.002
anger+0.002
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 stuck-0.001
Token a
Token ID257
Feature activation+8.75e-03

Pos logit contributions
 soon+0.002
 then+0.002
 able+0.002
 least+0.002
 trapped+0.001
Neg logit contributions
board-0.006
inner-0.006
ag-0.006
 seas-0.006
ade-0.006
Token deaf
Token ID21815
Feature activation+4.82e-03

Pos logit contributions
 soon+0.014
 then+0.012
 scared+0.011
 stuck+0.011
 closer+0.011
Neg logit contributions
board-0.016
inner-0.016
ag-0.015
anger-0.014
 similarities-0.014
Token more
Token ID517
Feature activation-1.53e-02

Pos logit contributions
board+0.011
inner+0.011
ork+0.011
anger+0.011
oom+0.011
Neg logit contributions
 soon-0.011
 then-0.010
 gently-0.009
 together-0.009
 so-0.009
Token ice
Token ID4771
Feature activation-1.31e-02

Pos logit contributions
anger+0.024
board+0.024
inner+0.023
ork+0.023
oom+0.022
Neg logit contributions
 soon-0.027
 then-0.024
 together-0.022
 scared-0.022
 so-0.021
Token cream
Token ID8566
Feature activation+2.99e-03

Pos logit contributions
board+0.017
inner+0.015
anger+0.015
ag+0.015
ork+0.015
Neg logit contributions
 soon-0.028
 then-0.024
 together-0.023
 now-0.022
 gently-0.022
Token.
Token ID13
Feature activation-1.19e-03

Pos logit contributions
 soon+0.003
 then+0.003
 able+0.002
 least+0.002
 stuck+0.002
Neg logit contributions
board-0.007
inner-0.007
ag-0.006
ork-0.006
anger-0.006
Token They
Token ID1119
Feature activation+9.61e-03

Pos logit contributions
inner+0.002
board+0.002
ork+0.002
 similarities+0.002
anger+0.002
Neg logit contributions
 soon-0.002
 able-0.002
 then-0.002
 least-0.002
 stuck-0.002
Token made
Token ID925
Feature activation+1.44e-02

Pos logit contributions
 soon+0.013
 then+0.012
 able+0.011
 least+0.010
 stuck+0.010
Neg logit contributions
board-0.019
inner-0.018
ag-0.017
ork-0.017
 medal-0.017
Token chocolate
Token ID11311
Feature activation+4.01e-03

Pos logit contributions
 soon+0.018
 then+0.015
 able+0.013
 least+0.013
 stuck+0.012
Neg logit contributions
board-0.031
inner-0.030
ag-0.028
ork-0.028
anger-0.028
Token ice
Token ID4771
Feature activation-8.14e-03

Pos logit contributions
 soon+0.007
 then+0.006
 able+0.005
 least+0.005
 stuck+0.005
Neg logit contributions
board-0.007
inner-0.007
ag-0.006
ork-0.006
anger-0.006
Token cream
Token ID8566
Feature activation+1.30e-02

Pos logit contributions
board+0.010
inner+0.010
ag+0.009
anger+0.009
oker+0.008
Neg logit contributions
 soon-0.017
 then-0.016
 able-0.014
 least-0.014
 now-0.014
Token.
Token ID13
Feature activation+3.76e-03

Pos logit contributions
 soon+0.013
 then+0.011
 able+0.010
 least+0.010
 afraid+0.009
Neg logit contributions
board-0.030
inner-0.030
ag-0.027
 medal-0.027
ade-0.027
Token They
Token ID1119
Feature activation+1.23e-02

Pos logit contributions
 soon+0.006
 able+0.005
 then+0.005
 least+0.005
 trapped+0.005
Neg logit contributions
inner-0.006
board-0.006
ork-0.006
 similarities-0.006
ag-0.005
Token when
Token ID618
Feature activation+5.30e-03

Pos logit contributions
board+0.008
inner+0.008
ag+0.007
ork+0.007
anger+0.007
Neg logit contributions
 soon-0.003
 then-0.002
 least-0.002
 able-0.002
 stuck-0.002
Token she
Token ID673
Feature activation-2.62e-02

Pos logit contributions
 soon+0.007
 then+0.006
 able+0.006
 least+0.006
 trapped+0.005
Neg logit contributions
board-0.010
inner-0.010
ork-0.009
 medal-0.009
 seas-0.009
Token was
Token ID373
Feature activation-6.39e-03

Pos logit contributions
board+0.051
inner+0.050
ag+0.046
ork+0.045
ade+0.045
Neg logit contributions
 soon-0.038
 then-0.033
 able-0.030
 least-0.029
 hoping-0.028
Token outside
Token ID2354
Feature activation-1.38e-02

Pos logit contributions
board+0.013
inner+0.013
ag+0.012
affle+0.012
ade+0.011
Neg logit contributions
 soon-0.008
 then-0.007
 least-0.006
 able-0.006
 Then-0.006
Token she
Token ID673
Feature activation-4.09e-03

Pos logit contributions
board+0.035
inner+0.035
ag+0.033
ork+0.032
ool+0.032
Neg logit contributions
 soon-0.012
 then-0.010
 able-0.007
 least-0.007
 stuck-0.007
Token saw
Token ID2497
Feature activation+6.51e-03

Pos logit contributions
board+0.007
inner+0.007
ade+0.006
ag+0.006
 seas+0.006
Neg logit contributions
 soon-0.006
 then-0.005
 able-0.005
 least-0.005
 Amen-0.005
Token a
Token ID257
Feature activation+8.45e-03

Pos logit contributions
 soon+0.007
 able+0.005
 then+0.005
 least+0.005
 Amen+0.005
Neg logit contributions
board-0.014
inner-0.013
ade-0.013
 seas-0.013
ag-0.013
Token big
Token ID1263
Feature activation+7.47e-03

Pos logit contributions
 soon+0.012
 then+0.011
 scared+0.010
 very+0.010
 stuck+0.010
Neg logit contributions
ag-0.016
inner-0.016
board-0.016
 pleasures-0.015
ests-0.015
Token b
Token ID275
Feature activation-7.38e-04

Pos logit contributions
 soon+0.010
 then+0.010
 stuck+0.008
 now+0.008
 scared+0.008
Neg logit contributions
inner-0.015
ag-0.015
ves-0.014
board-0.014
ests-0.014
Tokenerry
Token ID6996
Feature activation+5.27e-03

Pos logit contributions
board+0.001
inner+0.001
ag+0.001
ork+0.001
 medal+0.001
Neg logit contributions
 soon-0.002
 then-0.001
 able-0.001
 least-0.001
 stuck-0.001
Token.
Token ID13
Feature activation-5.38e-03

Pos logit contributions
 soon+0.007
 then+0.006
 least+0.005
 able+0.005
 hoping+0.005
Neg logit contributions
board-0.011
inner-0.011
ade-0.010
ag-0.010
ool-0.010
Token.
Token ID13
Feature activation-6.77e-03

Pos logit contributions
board+0.004
inner+0.004
ag+0.004
ork+0.004
anger+0.004
Neg logit contributions
 soon-0.002
 then-0.002
 able-0.001
 least-0.001
 stuck-0.001
Token He
Token ID679
Feature activation+4.39e-03

Pos logit contributions
board+0.012
inner+0.012
ag+0.011
ork+0.011
anger+0.011
Neg logit contributions
 soon-0.011
 then-0.010
 able-0.009
 least-0.009
 stuck-0.008
Token saw
Token ID2497
Feature activation-1.09e-02

Pos logit contributions
 soon+0.006
 then+0.005
 able+0.004
 least+0.004
 stuck+0.004
Neg logit contributions
inner-0.009
board-0.009
ag-0.009
anger-0.009
oker-0.009
Token Sarah
Token ID10490
Feature activation-7.75e-03

Pos logit contributions
board+0.021
inner+0.020
ag+0.019
ork+0.018
ade+0.018
Neg logit contributions
 soon-0.016
 then-0.013
 least-0.012
 able-0.012
 stuck-0.012
Token and
Token ID290
Feature activation+9.42e-03

Pos logit contributions
board+0.018
inner+0.017
ag+0.016
 medal+0.016
ork+0.016
Neg logit contributions
 soon-0.008
 then-0.007
 least-0.006
 able-0.006
 Then-0.005
Token took
Token ID1718
Feature activation-8.97e-03

Pos logit contributions
 soon+0.011
 then+0.010
 able+0.008
 least+0.008
 stuck+0.008
Neg logit contributions
inner-0.021
board-0.021
ag-0.020
anger-0.019
ork-0.019
Token all
Token ID477
Feature activation-1.36e-02

Pos logit contributions
inner+0.023
board+0.023
affle+0.021
ool+0.021
anger+0.021
Neg logit contributions
 soon-0.008
 then-0.007
 able-0.005
 stuck-0.005
 so-0.005
Token the
Token ID262
Feature activation-1.87e-02

Pos logit contributions
board+0.030
inner+0.030
anger+0.029
ork+0.028
 medal+0.028
Neg logit contributions
 soon-0.016
 then-0.014
 over-0.012
 able-0.012
 so-0.011
Token chalk
Token ID30860
Feature activation+7.51e-03

Pos logit contributions
inner+0.028
board+0.025
anger+0.025
ves+0.025
ork+0.025
Neg logit contributions
 soon-0.034
 then-0.032
 scared-0.029
 so-0.029
 afraid-0.029
Token away
Token ID1497
Feature activation+2.18e-03

Pos logit contributions
 soon+0.007
 then+0.006
 least+0.005
 able+0.005
 Then+0.005
Neg logit contributions
board-0.018
inner-0.017
ag-0.016
ork-0.016
 medal-0.016
Token.
Token ID13
Feature activation-5.10e-03

Pos logit contributions
 soon+0.002
 then+0.002
 least+0.002
 able+0.002
 stuck+0.001
Neg logit contributions
board-0.005
inner-0.005
ag-0.005
ork-0.005
 medal-0.005
Token "
Token ID366
Feature activation+1.22e-02

Pos logit contributions
board+0.003
inner+0.003
 som+0.003
anger+0.003
 medal+0.003
Neg logit contributions
 soon-0.005
 least-0.005
 able-0.004
 trapped-0.004
 stuck-0.004
TokenSure
Token ID19457
Feature activation+1.42e-02

Pos logit contributions
 soon+0.022
 then+0.020
 least+0.017
 gently+0.017
+0.017
Neg logit contributions
board-0.019
inner-0.019
ag-0.018
affle-0.017
ool-0.017
Token,
Token ID11
Feature activation-3.70e-03

Pos logit contributions
 soon+0.013
 then+0.011
 able+0.009
 least+0.009
 stuck+0.008
Neg logit contributions
board-0.034
inner-0.034
ag-0.032
 medal-0.031
ork-0.031
Token I
Token ID314
Feature activation-5.21e-03

Pos logit contributions
board+0.007
inner+0.007
ag+0.006
 medal+0.006
ork+0.006
Neg logit contributions
 soon-0.005
 then-0.005
 able-0.004
 least-0.004
 stuck-0.004
Token guess
Token ID4724
Feature activation+1.66e-02

Pos logit contributions
board+0.009
inner+0.009
ag+0.008
anger+0.008
ork+0.008
Neg logit contributions
 soon-0.009
 then-0.008
 able-0.007
 least-0.007
 stuck-0.007
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
 soon+0.020
 then+0.017
 able+0.014
 least+0.014
 stuck+0.013
Neg logit contributions
board-0.036
inner-0.036
ag-0.034
ork-0.033
anger-0.033
Token Where
Token ID6350
Feature activation-2.23e-02

Pos logit contributions
board+0.021
inner+0.020
anger+0.018
 similarities+0.018
ag+0.018
Neg logit contributions
 soon-0.021
 able-0.018
 then-0.018
 least-0.016
 stuck-0.016
Token is
Token ID318
Feature activation+6.48e-04

Pos logit contributions
board+0.041
inner+0.041
 pleasures+0.038
ag+0.038
 similarities+0.037
Neg logit contributions
 soon-0.034
 then-0.030
-0.027
 now-0.026
 stuck-0.026
Token she
Token ID673
Feature activation-3.06e-02

Pos logit contributions
 soon+0.001
 then+0.001
 least+0.001
 able+0.001
 trapped+0.001
Neg logit contributions
board-0.001
inner-0.001
 medal-0.001
 seas-0.001
ag-0.001
Token?"
Token ID1701
Feature activation+6.11e-03

Pos logit contributions
board+0.062
inner+0.061
ag+0.057
 medal+0.056
ork+0.055
Neg logit contributions
 soon-0.039
 then-0.034
 able-0.030
 least-0.029
 Then-0.028
Token
Token ID198
Feature activation+7.53e-03

Pos logit contributions
 able+0.009
 soon+0.008
 least+0.008
 trapped+0.007
 stuck+0.007
Neg logit contributions
board-0.011
ork-0.010
 People-0.010
 Kids-0.009
ag-0.009
Token letter
Token ID3850
Feature activation-9.92e-03

Pos logit contributions
inner+0.006
ag+0.006
board+0.006
 pleasures+0.005
ool+0.005
Neg logit contributions
 soon-0.004
 then-0.004
 scared-0.003
 afraid-0.003
 very-0.003
Token came
Token ID1625
Feature activation+5.31e-04

Pos logit contributions
inner+0.021
ag+0.020
board+0.020
 pleasures+0.020
 similarities+0.019
Neg logit contributions
 soon-0.013
 then-0.012
 stuck-0.010
 able-0.009
 now-0.009
Token in
Token ID287
Feature activation+2.18e-03

Pos logit contributions
 then+0.001
 soon+0.001
 least+0.001
 Amen+0.001
 Then+0.001
Neg logit contributions
board-0.001
inner-0.001
ag-0.001
ork-0.001
anger-0.001
Token the
Token ID262
Feature activation-1.20e-02

Pos logit contributions
 soon+0.002
 then+0.002
 able+0.002
 least+0.002
 stuck+0.002
Neg logit contributions
board-0.005
inner-0.005
ag-0.004
ork-0.004
 medal-0.004
Token mail
Token ID6920
Feature activation+3.87e-03

Pos logit contributions
board+0.019
inner+0.019
ag+0.017
ork+0.017
 medal+0.017
Neg logit contributions
 soon-0.021
 then-0.019
 able-0.017
 least-0.017
 stuck-0.016
Token.
Token ID13
Feature activation-1.52e-02

Pos logit contributions
 soon+0.003
 then+0.003
 able+0.002
 least+0.002
 stuck+0.002
Neg logit contributions
inner-0.010
board-0.009
ag-0.009
affle-0.009
 similarities-0.009
Token As
Token ID1081
Feature activation-7.67e-04

Pos logit contributions
inner+0.036
affle+0.035
ade+0.035
ag+0.035
 seas+0.034
Neg logit contributions
 soon-0.015
 then-0.014
 Then-0.014
 so-0.012
-0.011
Token the
Token ID262
Feature activation-1.71e-02

Pos logit contributions
board+0.002
inner+0.002
 seas+0.001
 medal+0.001
ag+0.001
Neg logit contributions
 soon-0.001
 then-0.001
 able-0.001
 least-0.001
 stuck-0.001
Token silly
Token ID14397
Feature activation-2.67e-02

Pos logit contributions
board+0.024
inner+0.023
ag+0.021
ork+0.020
 medal+0.020
Neg logit contributions
 soon-0.034
 then-0.031
 able-0.029
 least-0.028
 stuck-0.027
Token house
Token ID2156
Feature activation-2.65e-02

Pos logit contributions
ag+0.045
affle+0.044
inner+0.043
ade+0.043
 som+0.043
Neg logit contributions
 soon-0.045
 then-0.042
 scared-0.037
 stuck-0.035
 now-0.035
Token opened
Token ID4721
Feature activation-5.28e-03

Pos logit contributions
affle+0.061
ag+0.061
reath+0.060
inner+0.060
 pleasures+0.059
Neg logit contributions
 soon-0.033
 then-0.028
 stuck-0.023
 suddenly-0.022
 now-0.022
Token.
Token ID13
Feature activation-2.52e-03

Pos logit contributions
board+0.033
ag+0.031
inner+0.031
 guys+0.031
ork+0.031
Neg logit contributions
 least-0.019
 soon-0.019
 then-0.019
 able-0.018
 trapped-0.017
Token They
Token ID1119
Feature activation+8.51e-03

Pos logit contributions
board+0.005
inner+0.005
ork+0.004
ag+0.004
anger+0.004
Neg logit contributions
 soon-0.004
 then-0.003
 able-0.003
 least-0.003
 trapped-0.003
Token took
Token ID1718
Feature activation-1.95e-03

Pos logit contributions
 soon+0.012
 then+0.010
 able+0.009
 least+0.009
 Then+0.009
Neg logit contributions
board-0.016
inner-0.016
ag-0.015
ade-0.014
 medal-0.014
Token her
Token ID607
Feature activation+5.55e-03

Pos logit contributions
board+0.004
inner+0.004
ag+0.004
ork+0.004
ade+0.004
Neg logit contributions
 soon-0.002
 then-0.002
 able-0.002
 least-0.002
 hoping-0.002
Token home
Token ID1363
Feature activation-8.69e-03

Pos logit contributions
 soon+0.007
 then+0.006
 able+0.005
 least+0.005
 Then+0.005
Neg logit contributions
board-0.011
inner-0.011
ag-0.011
 medal-0.010
ork-0.010
Token and
Token ID290
Feature activation-4.73e-03

Pos logit contributions
board+0.020
inner+0.019
ag+0.018
ork+0.018
ade+0.018
Neg logit contributions
 soon-0.008
 then-0.007
 least-0.007
 Amen-0.006
 afraid-0.006
Token put
Token ID1234
Feature activation-4.12e-04

Pos logit contributions
board+0.009
inner+0.009
ag+0.008
 medal+0.008
ade+0.008
Neg logit contributions
 soon-0.006
 then-0.005
 least-0.005
 able-0.005
 afraid-0.004
Token a
Token ID257
Feature activation-1.66e-02

Pos logit contributions
board+0.001
inner+0.001
ag+0.001
ade+0.001
 seas+0.001
Neg logit contributions
 soon-0.000
 then-0.000
 able-0.000
 least-0.000
 hoping-0.000
Token band
Token ID4097
Feature activation-4.01e-03

Pos logit contributions
board+0.026
inner+0.025
ag+0.023
 similarities+0.023
oker+0.023
Neg logit contributions
 soon-0.030
 then-0.027
 gently-0.025
 able-0.025
 stuck-0.024
Tokenage
Token ID496
Feature activation-1.34e-02

Pos logit contributions
board+0.005
inner+0.005
ag+0.005
ork+0.004
ade+0.004
Neg logit contributions
 soon-0.008
 then-0.008
 least-0.007
 able-0.007
 hoping-0.007
Token on
Token ID319
Feature activation-5.48e-03

Pos logit contributions
board+0.029
inner+0.029
ag+0.028
ork+0.027
anger+0.027
Neg logit contributions
 soon-0.015
 then-0.013
 least-0.011
 able-0.011
 afraid-0.010
Token ladder
Token ID18002
Feature activation+1.32e-02

Pos logit contributions
board+0.002
inner+0.002
oker+0.002
ag+0.002
affle+0.002
Neg logit contributions
 soon-0.003
 then-0.003
 least-0.002
 stuck-0.002
 scared-0.002
Token,
Token ID11
Feature activation+3.98e-03

Pos logit contributions
 soon+0.013
 then+0.011
 able+0.009
 least+0.009
 stuck+0.008
Neg logit contributions
board-0.031
inner-0.030
ag-0.029
ork-0.028
anger-0.028
Token the
Token ID262
Feature activation-5.41e-04

Pos logit contributions
 soon+0.004
 then+0.004
 able+0.003
 stuck+0.003
 least+0.003
Neg logit contributions
board-0.009
inner-0.009
ag-0.009
affle-0.009
ool-0.009
Token bell
Token ID8966
Feature activation+1.93e-03

Pos logit contributions
board+0.001
inner+0.001
ag+0.001
oker+0.001
ool+0.001
Neg logit contributions
 soon-0.001
 then-0.001
 stuck-0.001
 least-0.001
 scared-0.001
Token would
Token ID561
Feature activation-8.18e-03

Pos logit contributions
 soon+0.003
 then+0.002
 able+0.002
 stuck+0.002
 suddenly+0.002
Neg logit contributions
inner-0.004
board-0.004
affle-0.004
ag-0.004
oker-0.003
Token ring
Token ID5858
Feature activation-3.99e-03

Pos logit contributions
board+0.015
inner+0.015
ag+0.014
affle+0.013
anger+0.013
Neg logit contributions
 soon-0.013
 then-0.012
 able-0.010
 least-0.010
 stuck-0.010
Token.
Token ID13
Feature activation+6.69e-03

Pos logit contributions
inner+0.010
board+0.010
affle+0.009
ag+0.009
oker+0.009
Neg logit contributions
 soon-0.004
 then-0.003
 able-0.003
 stuck-0.002
 gently-0.002
Token
Token ID198
Feature activation+4.31e-03

Pos logit contributions
 soon+0.007
 then+0.006
 able+0.005
 least+0.005
 stuck+0.004
Neg logit contributions
board-0.016
inner-0.015
ag-0.014
ork-0.014
anger-0.014
Token
Token ID198
Feature activation+5.99e-03

Pos logit contributions
 soon+0.004
 then+0.004
 able+0.003
 least+0.003
 stuck+0.003
Neg logit contributions
board-0.010
inner-0.010
ag-0.009
ork-0.009
ade-0.009
TokenThe
Token ID464
Feature activation-1.37e-03

Pos logit contributions
 soon+0.008
 then+0.006
+0.005
 able+0.005
 least+0.005
Neg logit contributions
board-0.013
inner-0.013
ag-0.012
anger-0.012
 seas-0.012
Token little
Token ID1310
Feature activation+1.52e-02

Pos logit contributions
board+0.002
inner+0.002
ag+0.002
oker+0.002
ool+0.002
Neg logit contributions
 soon-0.002
 then-0.002
 least-0.002
 stuck-0.002
 scared-0.002
Token "
Token ID366
Feature activation+6.16e-03

Pos logit contributions
board+0.005
inner+0.004
ork+0.004
 medal+0.004
ag+0.004
Neg logit contributions
 soon-0.004
 least-0.004
 then-0.004
 able-0.004
 trapped-0.004
TokenMax
Token ID11518
Feature activation+9.44e-03

Pos logit contributions
 soon+0.009
 then+0.008
 able+0.007
 least+0.007
 Then+0.006
Neg logit contributions
board-0.012
inner-0.012
ag-0.011
anger-0.011
ork-0.011
Token passed
Token ID3804
Feature activation+5.37e-03

Pos logit contributions
 soon+0.008
 then+0.006
 able+0.005
 least+0.005
 stuck+0.004
Neg logit contributions
board-0.024
inner-0.023
ag-0.022
ork-0.022
 medal-0.022
Token the
Token ID262
Feature activation-1.38e-02

Pos logit contributions
 soon+0.007
 then+0.006
 able+0.006
 least+0.006
 hoping+0.005
Neg logit contributions
board-0.011
inner-0.010
ag-0.010
 medal-0.009
ork-0.009
Token test
Token ID1332
Feature activation-2.21e-03

Pos logit contributions
ag+0.020
ool+0.020
aff+0.019
oons+0.019
inner+0.019
Neg logit contributions
 soon-0.025
 then-0.022
 least-0.022
 scared-0.022
 very-0.022
Token!
Token ID0
Feature activation-2.77e-02

Pos logit contributions
inner+0.005
board+0.005
ag+0.005
ork+0.005
affle+0.005
Neg logit contributions
 soon-0.002
 then-0.002
 able-0.001
 least-0.001
 now-0.001
Token He
Token ID679
Feature activation-8.09e-03

Pos logit contributions
board+0.046
inner+0.045
oker+0.040
ag+0.039
anger+0.039
Neg logit contributions
 soon-0.047
 then-0.042
 able-0.039
 least-0.038
 trapped-0.038
Token is
Token ID318
Feature activation-1.11e-03

Pos logit contributions
inner+0.015
board+0.015
ag+0.014
ork+0.014
anger+0.014
Neg logit contributions
 soon-0.012
 then-0.011
 least-0.010
 able-0.009
 stuck-0.009
Token a
Token ID257
Feature activation+3.70e-03

Pos logit contributions
inner+0.002
board+0.002
ag+0.002
 medal+0.002
 trophy+0.002
Neg logit contributions
 soon-0.001
 then-0.001
 least-0.001
 Then-0.001
 able-0.001
Token good
Token ID922
Feature activation-5.96e-03

Pos logit contributions
 soon+0.006
 then+0.005
 able+0.005
 least+0.005
 stuck+0.004
Neg logit contributions
board-0.007
inner-0.006
ag-0.006
 medal-0.006
ork-0.006
Token dog
Token ID3290
Feature activation+8.17e-03

Pos logit contributions
inner+0.008
ag+0.008
 similarities+0.008
oons+0.007
board+0.007
Neg logit contributions
 soon-0.012
 then-0.011
 scared-0.010
 now-0.010
 afraid-0.010
Token dinosaur
Token ID28257
Feature activation-8.16e-03

Pos logit contributions
inner+0.014
board+0.013
ag+0.013
ade+0.012
ork+0.012
Neg logit contributions
 soon-0.018
 then-0.016
 least-0.014
 now-0.014
 able-0.014
Token.
Token ID13
Feature activation-1.59e-02

Pos logit contributions
board+0.018
inner+0.017
ag+0.016
 medal+0.016
ork+0.016
Neg logit contributions
 soon-0.010
 then-0.008
 able-0.007
 least-0.007
 stuck-0.007
Token I
Token ID314
Feature activation+4.57e-03

Pos logit contributions
board+0.025
inner+0.024
oker+0.021
 similarities+0.021
 medal+0.021
Neg logit contributions
 soon-0.028
 able-0.025
 then-0.024
 stuck-0.023
 least-0.023
Token will
Token ID481
Feature activation+7.34e-03

Pos logit contributions
 soon+0.006
 then+0.006
 able+0.005
 least+0.005
 stuck+0.005
Neg logit contributions
board-0.009
inner-0.009
ag-0.008
ork-0.008
 medal-0.008
Token share
Token ID2648
Feature activation+3.68e-03

Pos logit contributions
 soon+0.012
 then+0.011
 able+0.011
 trapped+0.010
 afraid+0.010
Neg logit contributions
board-0.010
inner-0.010
 medal-0.010
ag-0.010
 som-0.010
Token my
Token ID616
Feature activation-2.95e-02

Pos logit contributions
 soon+0.005
 then+0.004
 able+0.004
 least+0.004
 stuck+0.004
Neg logit contributions
board-0.007
inner-0.007
ag-0.007
ork-0.007
 medal-0.007
Token toys
Token ID14958
Feature activation-1.92e-03

Pos logit contributions
inner+0.049
board+0.044
anger+0.043
ool+0.042
oker+0.042
Neg logit contributions
 soon-0.051
 then-0.048
 now-0.042
 least-0.042
 very-0.040
Token and
Token ID290
Feature activation-1.90e-02

Pos logit contributions
board+0.005
inner+0.004
ag+0.004
 medal+0.004
ork+0.004
Neg logit contributions
 soon-0.002
 then-0.002
 able-0.001
 least-0.001
 afraid-0.001
Token say
Token ID910
Feature activation+1.25e-02

Pos logit contributions
board+0.034
inner+0.034
ag+0.031
anger+0.030
ork+0.030
Neg logit contributions
 soon-0.030
 then-0.027
 able-0.024
 least-0.024
 stuck-0.023
Token nice
Token ID3621
Feature activation-8.00e-03

Pos logit contributions
 soon+0.019
 then+0.017
 able+0.015
 least+0.015
 stuck+0.014
Neg logit contributions
board-0.023
inner-0.023
ag-0.021
ork-0.020
 medal-0.020
Token words
Token ID2456
Feature activation-1.63e-02

Pos logit contributions
inner+0.014
ool+0.013
ag+0.013
board+0.013
ork+0.013
Neg logit contributions
 soon-0.013
 then-0.011
 now-0.010
 sometimes-0.009
 over-0.009
Token She
Token ID1375
Feature activation+3.06e-03

Pos logit contributions
 soon+0.002
 then+0.002
 able+0.002
 least+0.002
 stuck+0.002
Neg logit contributions
board-0.004
inner-0.004
anger-0.003
ork-0.003
ag-0.003
Token went
Token ID1816
Feature activation-7.62e-03

Pos logit contributions
 soon+0.004
 then+0.003
 able+0.003
 least+0.003
 afraid+0.003
Neg logit contributions
board-0.006
inner-0.006
ag-0.006
 medal-0.005
ork-0.005
Token to
Token ID284
Feature activation-9.13e-03

Pos logit contributions
board+0.016
inner+0.015
ag+0.015
 medal+0.014
anger+0.014
Neg logit contributions
 soon-0.009
 then-0.008
 least-0.008
 able-0.007
 afraid-0.007
Token bed
Token ID3996
Feature activation-3.12e-02

Pos logit contributions
board+0.017
inner+0.016
ag+0.016
 medal+0.015
ork+0.015
Neg logit contributions
 soon-0.013
 then-0.012
 able-0.011
 least-0.010
 stuck-0.010
Token that
Token ID326
Feature activation+1.91e-03

Pos logit contributions
board+0.074
inner+0.073
ag+0.069
ork+0.067
anger+0.067
Neg logit contributions
 soon-0.031
 then-0.026
 able-0.021
 least-0.020
 stuck-0.019
Token night
Token ID1755
Feature activation-2.84e-02

Pos logit contributions
 soon+0.003
 then+0.003
 least+0.002
 able+0.002
 stuck+0.002
Neg logit contributions
board-0.003
inner-0.003
ag-0.003
affle-0.003
anger-0.003
Token,
Token ID11
Feature activation-7.79e-03

Pos logit contributions
board+0.066
inner+0.065
ag+0.061
ork+0.060
anger+0.060
Neg logit contributions
 soon-0.029
 then-0.024
 able-0.020
 least-0.020
 stuck-0.018
Token full
Token ID1336
Feature activation-4.23e-03

Pos logit contributions
board+0.017
inner+0.017
ag+0.016
ork+0.016
 medal+0.016
Neg logit contributions
 soon-0.008
 then-0.007
 able-0.006
 least-0.006
 stuck-0.006
Token of
Token ID286
Feature activation+1.38e-03

Pos logit contributions
inner+0.007
board+0.007
affle+0.007
ag+0.007
ork+0.007
Neg logit contributions
 soon-0.007
 then-0.007
 so-0.006
 now-0.006
 able-0.006
Token o
Token ID267
Feature activation-2.30e-03

Pos logit contributions
 soon+0.002
 then+0.002
 scared+0.002
 so+0.002
 afraid+0.002
Neg logit contributions
board-0.003
inner-0.003
ork-0.002
anger-0.002
ool-0.002
Tokenat
Token ID265
Feature activation-2.09e-04

Pos logit contributions
board+0.003
inner+0.003
ag+0.003
ork+0.002
 medal+0.002
Neg logit contributions
 soon-0.005
 then-0.004
 least-0.004
 able-0.004
 stuck-0.004
Token She
Token ID1375
Feature activation-8.83e-03

Pos logit contributions
board+0.022
inner+0.022
ork+0.020
anger+0.020
ag+0.020
Neg logit contributions
 soon-0.013
 then-0.011
 able-0.010
 least-0.010
 stuck-0.009
Token sees
Token ID7224
Feature activation-5.97e-05

Pos logit contributions
board+0.015
inner+0.014
ag+0.013
 medal+0.013
ork+0.013
Neg logit contributions
 soon-0.015
 then-0.013
 able-0.012
 least-0.012
 stuck-0.011
Token nothing
Token ID2147
Feature activation-9.37e-03

Pos logit contributions
board+0.000
inner+0.000
ag+0.000
 medal+0.000
 seas+0.000
Neg logit contributions
 soon-0.000
 then-0.000
 able-0.000
 least-0.000
 stuck-0.000
Token but
Token ID475
Feature activation-8.37e-03

Pos logit contributions
board+0.023
inner+0.022
ag+0.021
ork+0.021
anger+0.021
Neg logit contributions
 soon-0.009
 then-0.007
 able-0.006
 least-0.006
 stuck-0.005
Token a
Token ID257
Feature activation-1.26e-02

Pos logit contributions
board+0.021
inner+0.021
ag+0.020
ool+0.019
affle+0.019
Neg logit contributions
 soon-0.008
 then-0.007
 able-0.005
 stuck-0.005
 least-0.005
Token white
Token ID2330
Feature activation-2.90e-02

Pos logit contributions
inner+0.021
board+0.020
ag+0.020
anger+0.018
ool+0.018
Neg logit contributions
 soon-0.022
 then-0.020
 stuck-0.018
 scared-0.018
 afraid-0.017
Token sheet
Token ID9629
Feature activation-1.58e-02

Pos logit contributions
inner+0.055
ag+0.053
 similarities+0.051
oker+0.050
oons+0.050
Neg logit contributions
 soon-0.042
 then-0.041
 now-0.035
 stuck-0.034
 gently-0.034
Token.
Token ID13
Feature activation+2.28e-03

Pos logit contributions
inner+0.035
board+0.035
affle+0.033
ag+0.033
 seas+0.033
Neg logit contributions
 soon-0.018
 then-0.015
 stuck-0.012
 able-0.012
 now-0.012
Token She
Token ID1375
Feature activation+3.69e-03

Pos logit contributions
 soon+0.003
 then+0.003
 able+0.003
 least+0.003
 stuck+0.002
Neg logit contributions
board-0.005
inner-0.004
ork-0.004
anger-0.004
ag-0.004
Token feels
Token ID5300
Feature activation+6.13e-03

Pos logit contributions
 soon+0.006
 then+0.005
 able+0.005
 least+0.005
 stuck+0.004
Neg logit contributions
board-0.006
inner-0.006
ag-0.006
 medal-0.006
ork-0.006
Token sad
Token ID6507
Feature activation+9.67e-03

Pos logit contributions
 soon+0.008
 least+0.007
 then+0.006
 Amen+0.006
 Owl+0.006
Neg logit contributions
ag-0.011
 medal-0.010
board-0.010
inner-0.010
 som-0.010
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
 soon+0.009
 then+0.008
 least+0.008
 able+0.007
 suddenly+0.007
Neg logit contributions
board-0.016
inner-0.015
ag-0.014
 medal-0.014
 guys-0.014
Token Maybe
Token ID6674
Feature activation+5.18e-03

Pos logit contributions
board+0.021
inner+0.021
anger+0.018
ag+0.018
oker+0.018
Neg logit contributions
 soon-0.020
 then-0.018
 able-0.017
 stuck-0.016
 least-0.016
Token we
Token ID356
Feature activation+8.35e-03

Pos logit contributions
 soon+0.008
 able+0.007
 afraid+0.007
 least+0.007
 trapped+0.007
Neg logit contributions
board-0.008
 seas-0.008
inner-0.008
 medal-0.008
 guys-0.008
Token can
Token ID460
Feature activation+1.19e-02

Pos logit contributions
 soon+0.013
 then+0.012
 able+0.011
 Then+0.010
 least+0.010
Neg logit contributions
board-0.014
inner-0.014
ag-0.013
 guys-0.013
 medal-0.013
Token share
Token ID2648
Feature activation+5.61e-03

Pos logit contributions
 soon+0.020
 able+0.020
 then+0.019
 afraid+0.019
 trapped+0.018
Neg logit contributions
board-0.016
inner-0.015
ag-0.015
 guys-0.015
 medal-0.015
Token our
Token ID674
Feature activation-2.66e-02

Pos logit contributions
 soon+0.007
 then+0.006
 able+0.006
 least+0.005
 afraid+0.005
Neg logit contributions
board-0.011
inner-0.011
 seas-0.011
ag-0.011
 medal-0.011
Token dreams
Token ID10625
Feature activation-8.36e-03

Pos logit contributions
inner+0.043
board+0.042
anger+0.038
oker+0.037
ag+0.037
Neg logit contributions
 soon-0.048
 then-0.043
 least-0.040
 now-0.039
 trapped-0.039
Token later
Token ID1568
Feature activation-1.15e-02

Pos logit contributions
board+0.020
inner+0.019
ag+0.018
 medal+0.018
ork+0.018
Neg logit contributions
 soon-0.008
 then-0.007
 least-0.006
 able-0.006
 stuck-0.005
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
board+0.025
inner+0.024
ag+0.023
 guys+0.022
 medal+0.022
Neg logit contributions
 soon-0.012
 then-0.010
 able-0.010
 least-0.010
 afraid-0.009
Token Let
Token ID3914
Feature activation+1.71e-02

Pos logit contributions
board+0.026
inner+0.025
 guys+0.024
 similarities+0.024
 People+0.024
Neg logit contributions
 able-0.028
 soon-0.027
 trapped-0.025
 stuck-0.025
 worried-0.024
Token's
Token ID338
Feature activation+8.23e-03

Pos logit contributions
 soon+0.022
 then+0.019
 able+0.019
 trapped+0.017
 least+0.017
Neg logit contributions
board-0.034
inner-0.033
ag-0.031
 guys-0.031
ade-0.031