TOP ACTIVATIONS
MAX = 0.052

demands , crab ," the mer maid said . " I
a potato named Mr . Sp ud . They had lots
ever and he thanked the mer maid for being so kind
so happy and thanked the mer maid many times . With
. The strong mer maid thought about the promise
Hop in ." N ora was so excited she
swimming around her . The mer maid smiled and said ,
the clam stepped a beautiful mer maid . " I heard
Joe waved goodbye to the mer maid , and hopped back
so far ?" N emo replied , " I
Jack asked the mer maid why the waves were
the bird . N ora followed the bird all
y met a fish named Bub bles . Bub bles asked
Nora spin . N ora finally stopped spinning and
. She gave it to N iki without any hesitation .
made . The mer maid learnt that taking risks
so excited and when the mer maid saw him she gave
again . He said the mer maid and a dolphin were
castle so much . The mer maid said , " I
more ?" O ll ie said , " Yes

MIN ACTIVATIONS
MIN = -0.045

flowers . Anna is good at puzzles . She can finish
place ! The chip mun ks didn 't know
Lily and Ben are good at puzzles . They can finish
! You were not good at all . You have to
, fourteen , fifteen , sixteen , seventeen , eighteen ,
. The chip m unk told the boy
L ily is not good at writing and counting . She
!" The chip m unk finished the treat
books . I am better at numbers than you ."
floor . The chip mun ks were very curious
" You 're really good at matching , Lily !"
. He was very good at it . One day ,
Yeah , we are good at fixing things ," Ben said
too . The chip m unk helped the frog
. " You are good at this game ."
. He was very good at it and his towers were
Ben is good at writing and counting . He
! The chip m unk was excited and
flowers . Lily is good at cutting paper . She can
proud . We are good at counting ." They

INTERVAL 0.028 - 0.052
CONTAINS 0.066%

had a bald dog called M abel . M abel was
strong . One day , Frost y met a little girl
chocolate and cookies . How does that sound ?"
and caring , comfort ed Sus ie and said " It
buttons for eyes and a smile . " Hello

INTERVAL 0.003 - 0.028
CONTAINS 30.537%

as the bath tub . It fills up the whole bath
he had still found a hidden secret . <|endoftext|> Once upon
. Let me take a look and make it all better
and Sam , were walking in the city . Sam pointed
Tim my loved to play outside , especially when it r

INTERVAL -0.021 - 0.003
CONTAINS 69.169%

, his room looked so neat and ordinary . Tim my
more comfortable . He remembered his friend the fox had a
." L ily said , " Don 't
Everyone cl apped and laughed . Tim 's mom mentioned that
box wasn 't really empty - it was a home for

INTERVAL -0.045 - -0.021
CONTAINS 0.229%

S ue and Tim both felt sad . Sue
want . But you can also try other colors . You
a ball , a pillow , and a book . They
out why Spot did not like the new path . It
. They did not care about the blue bird anymore .
Token demands
Token ID8665
Feature activation-4.81e-04

Pos logit contributions
rant+0.024
mer+0.020
le+0.019
 Will+0.017
sc+0.017
Neg logit contributions
 aware-0.012
 decisions-0.011
 oak-0.011
 office-0.010
 bravery-0.010
Token,
Token ID11
Feature activation+2.10e-03

Pos logit contributions
 aware+0.001
 oak+0.001
 decisions+0.001
joy+0.001
 bravery+0.001
Neg logit contributions
rant-0.001
mer-0.001
le-0.001
sc-0.000
te-0.000
Token crab
Token ID32202
Feature activation-3.49e-03

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
 Will+0.003
ola+0.002
Neg logit contributions
 aware-0.004
 decisions-0.003
 office-0.003
 property-0.003
unch-0.003
Token,"
Token ID553
Feature activation-1.69e-04

Pos logit contributions
 aware+0.006
 decisions+0.006
joy+0.006
 bravery+0.006
unch+0.006
Neg logit contributions
rant-0.006
mer-0.005
le-0.004
led-0.004
sc-0.004
Token the
Token ID262
Feature activation+9.79e-03

Pos logit contributions
 aware+0.000
 decisions+0.000
 property+0.000
 bravery+0.000
 office+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
 Will-0.000
sc-0.000
Token mer
Token ID4017
Feature activation+5.19e-02

Pos logit contributions
rant+0.022
mer+0.018
le+0.017
sc+0.016
te+0.015
Neg logit contributions
 aware-0.014
 oak-0.013
 bravery-0.013
 discovery-0.013
 understanding-0.013
Tokenmaid
Token ID23151
Feature activation+7.73e-03

Pos logit contributions
rant+0.084
sc+0.060
ifying+0.057
mer+0.055
le+0.051
Neg logit contributions
 aware-0.097
 discovery-0.095
 Majesty-0.095
 decision-0.092
 decisions-0.092
Token said
Token ID531
Feature activation-6.15e-03

Pos logit contributions
rant+0.017
mer+0.013
le+0.012
sc+0.011
ifying+0.011
Neg logit contributions
 aware-0.012
 understanding-0.011
 bravery-0.010
 oak-0.010
 discovery-0.010
Token.
Token ID13
Feature activation+5.03e-04

Pos logit contributions
 aware+0.012
 decisions+0.011
 bravery+0.011
 understanding+0.011
 oak+0.011
Neg logit contributions
rant-0.010
mer-0.008
le-0.007
sc-0.006
led-0.006
Token "
Token ID366
Feature activation+3.13e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
 Will+0.001
Neg logit contributions
 aware-0.001
 decisions-0.001
 bravery-0.001
 oak-0.001
 property-0.001
TokenI
Token ID40
Feature activation-3.00e-04

Pos logit contributions
rant+0.006
mer+0.005
le+0.005
led+0.004
sc+0.004
Neg logit contributions
 aware-0.005
 understanding-0.005
 decisions-0.005
 oak-0.004
 bravery-0.004
Token a
Token ID257
Feature activation-2.43e-03

Pos logit contributions
 decisions+0.002
 aware+0.002
 property+0.001
 impact+0.001
 bravery+0.001
Neg logit contributions
rant-0.001
mer-0.001
le-0.001
 Will-0.001
te-0.001
Token potato
Token ID21219
Feature activation-5.00e-03

Pos logit contributions
 aware+0.004
 decisions+0.003
 bravery+0.003
 oak+0.003
 office+0.003
Neg logit contributions
rant-0.005
mer-0.004
le-0.004
sc-0.003
te-0.003
Token named
Token ID3706
Feature activation-2.89e-03

Pos logit contributions
 aware+0.009
 oak+0.008
 office+0.008
 decisions+0.008
 bravery+0.008
Neg logit contributions
rant-0.009
mer-0.007
le-0.007
sc-0.006
led-0.006
Token Mr
Token ID1770
Feature activation+1.73e-02

Pos logit contributions
 aware+0.003
 oak+0.003
 bravery+0.003
 decisions+0.003
unch+0.003
Neg logit contributions
rant-0.007
mer-0.006
le-0.006
led-0.005
sc-0.005
Token.
Token ID13
Feature activation-5.46e-03

Pos logit contributions
rant+0.027
mer+0.021
le+0.018
sc+0.015
te+0.015
Neg logit contributions
 aware-0.035
 decisions-0.034
 bravery-0.033
 oak-0.033
 office-0.033
Token Sp
Token ID1338
Feature activation+4.98e-02

Pos logit contributions
 aware+0.009
 oak+0.009
joy+0.009
 decisions+0.008
unch+0.008
Neg logit contributions
rant-0.010
mer-0.008
le-0.008
led-0.007
sc-0.007
Tokenud
Token ID463
Feature activation+1.32e-02

Pos logit contributions
rant+0.088
mer+0.065
ifying+0.062
sc+0.058
 lit+0.056
Neg logit contributions
unch-0.085
 aware-0.084
 discovery-0.083
 office-0.083
 property-0.083
Token.
Token ID13
Feature activation+3.00e-03

Pos logit contributions
rant+0.026
mer+0.019
sc+0.018
le+0.017
ifying+0.016
Neg logit contributions
 aware-0.021
 oak-0.020
 office-0.020
 bike-0.019
 climbing-0.019
Token They
Token ID1119
Feature activation+7.54e-03

Pos logit contributions
rant+0.007
mer+0.006
le+0.006
led+0.006
vy+0.005
Neg logit contributions
 oak-0.003
unch-0.003
 aware-0.003
 office-0.003
 decisions-0.003
Token had
Token ID550
Feature activation+3.55e-03

Pos logit contributions
rant+0.014
mer+0.012
le+0.010
sc+0.009
auc+0.009
Neg logit contributions
 oak-0.013
 aware-0.013
 bravery-0.011
 climbing-0.011
 nest-0.011
Token lots
Token ID6041
Feature activation+2.53e-03

Pos logit contributions
rant+0.008
mer+0.006
le+0.006
sc+0.005
te+0.005
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 oak-0.005
 office-0.005
Token ever
Token ID1683
Feature activation+4.63e-03

Pos logit contributions
rant+0.006
mer+0.004
le+0.004
sc+0.003
led+0.003
Neg logit contributions
 aware-0.009
 bravery-0.008
 decisions-0.008
 understanding-0.008
 discovery-0.008
Token and
Token ID290
Feature activation-3.15e-04

Pos logit contributions
rant+0.010
mer+0.008
le+0.007
izz+0.007
sc+0.007
Neg logit contributions
 aware-0.007
 decisions-0.006
 discovery-0.006
 bravery-0.006
 property-0.006
Token he
Token ID339
Feature activation-3.90e-03

Pos logit contributions
 aware+0.001
 decisions+0.000
 bravery+0.000
 oak+0.000
 office+0.000
Neg logit contributions
rant-0.001
mer-0.001
le-0.000
sc-0.000
te-0.000
Token thanked
Token ID26280
Feature activation+1.65e-02

Pos logit contributions
 aware+0.007
 bravery+0.007
 understanding+0.006
 oak+0.006
 decisions+0.006
Neg logit contributions
rant-0.007
mer-0.006
le-0.005
sc-0.005
ifying-0.005
Token the
Token ID262
Feature activation+1.26e-02

Pos logit contributions
rant+0.028
mer+0.022
le+0.020
 Will+0.017
te+0.017
Neg logit contributions
 aware-0.031
 decisions-0.030
 oak-0.029
 bravery-0.029
 office-0.029
Token mer
Token ID4017
Feature activation+4.96e-02

Pos logit contributions
rant+0.018
mer+0.013
le+0.011
sc+0.010
led+0.009
Neg logit contributions
 aware-0.027
 bravery-0.027
 discovery-0.027
 decisions-0.027
 understanding-0.027
Tokenmaid
Token ID23151
Feature activation+9.11e-03

Pos logit contributions
rant+0.089
sc+0.065
ifying+0.064
wy+0.061
ioned+0.061
Neg logit contributions
 Majesty-0.087
 discovery-0.082
 aware-0.078
 decision-0.075
joy-0.074
Token for
Token ID329
Feature activation-2.14e-03

Pos logit contributions
rant+0.017
mer+0.014
le+0.013
sc+0.011
te+0.011
Neg logit contributions
 aware-0.016
 decisions-0.015
 bravery-0.015
 oak-0.015
 office-0.014
Token being
Token ID852
Feature activation+1.31e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 oak+0.003
 property+0.003
unch+0.003
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
 Will-0.003
te-0.003
Token so
Token ID523
Feature activation-1.67e-03

Pos logit contributions
rant+0.003
mer+0.002
le+0.002
sc+0.002
auc+0.002
Neg logit contributions
 aware-0.002
 understanding-0.002
 bravery-0.002
 decisions-0.002
joy-0.002
Token kind
Token ID1611
Feature activation-2.58e-03

Pos logit contributions
 aware+0.002
 bravery+0.002
 understanding+0.002
 decisions+0.002
 oak+0.002
Neg logit contributions
rant-0.004
mer-0.004
le-0.003
sc-0.003
led-0.003
Token so
Token ID523
Feature activation-3.00e-03

Pos logit contributions
rant+0.003
mer+0.002
le+0.002
sc+0.002
led+0.002
Neg logit contributions
 aware-0.002
 decisions-0.002
 bravery-0.002
 understanding-0.002
 office-0.002
Token happy
Token ID3772
Feature activation-6.87e-03

Pos logit contributions
 aware+0.003
 understanding+0.003
 bravery+0.003
 decisions+0.003
 discovery+0.003
Neg logit contributions
rant-0.008
mer-0.007
le-0.006
sc-0.006
ifying-0.005
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
 aware+0.013
 decisions+0.013
 oak+0.013
 impact+0.012
 property+0.012
Neg logit contributions
rant-0.011
mer-0.009
le-0.008
 Will-0.007
te-0.007
Token thanked
Token ID26280
Feature activation+1.59e-02

Pos logit contributions
rant+0.020
mer+0.016
le+0.015
sc+0.013
te+0.012
Neg logit contributions
 aware-0.017
 decisions-0.017
 bravery-0.016
 oak-0.016
 office-0.016
Token the
Token ID262
Feature activation+1.06e-02

Pos logit contributions
rant+0.029
mer+0.023
le+0.021
sc+0.018
te+0.017
Neg logit contributions
 aware-0.029
 decisions-0.028
 bravery-0.027
 oak-0.027
 office-0.027
Token mer
Token ID4017
Feature activation+4.94e-02

Pos logit contributions
rant+0.020
mer+0.016
le+0.015
sc+0.013
led+0.012
Neg logit contributions
 aware-0.018
 decisions-0.018
 bravery-0.018
 discovery-0.018
 understanding-0.018
Tokenmaid
Token ID23151
Feature activation+4.87e-03

Pos logit contributions
rant+0.083
sc+0.059
ifying+0.055
mer+0.054
ioned+0.052
Neg logit contributions
 discovery-0.089
 aware-0.089
 Majesty-0.089
 decisions-0.087
 decision-0.086
Token many
Token ID867
Feature activation+1.17e-02

Pos logit contributions
rant+0.008
mer+0.006
le+0.006
te+0.005
 Will+0.005
Neg logit contributions
 aware-0.009
 decisions-0.009
 oak-0.009
 property-0.009
 bravery-0.009
Token times
Token ID1661
Feature activation-7.54e-03

Pos logit contributions
rant+0.027
mer+0.022
le+0.020
sc+0.019
izz+0.019
Neg logit contributions
 aware-0.015
 decisions-0.015
 bravery-0.015
 understanding-0.015
 discovery-0.014
Token.
Token ID13
Feature activation-3.34e-03

Pos logit contributions
 aware+0.014
 decisions+0.014
 bravery+0.014
 oak+0.014
 office+0.013
Neg logit contributions
rant-0.013
mer-0.010
le-0.009
sc-0.008
te-0.007
Token With
Token ID2080
Feature activation+6.16e-03

Pos logit contributions
 aware+0.006
 property+0.006
 decisions+0.005
 discovery+0.005
 bravery+0.005
Neg logit contributions
rant-0.006
mer-0.005
 Will-0.004
le-0.004
sc-0.004
Token.
Token ID13
Feature activation+8.48e-04

Pos logit contributions
rant+0.012
mer+0.009
le+0.008
sc+0.008
led+0.008
Neg logit contributions
 aware-0.010
 understanding-0.009
 decisions-0.009
 oak-0.009
 discovery-0.009
Token
Token ID220
Feature activation+2.68e-03

Pos logit contributions
rant+0.002
mer+0.001
le+0.001
 Will+0.001
sc+0.001
Neg logit contributions
 aware-0.001
 decisions-0.001
 bravery-0.001
 property-0.001
 oak-0.001
Token
Token ID198
Feature activation-6.19e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
sc+0.003
 Will+0.003
Neg logit contributions
 aware-0.004
 decisions-0.004
 oak-0.004
 bravery-0.004
 property-0.004
TokenThe
Token ID464
Feature activation+1.00e-02

Pos logit contributions
 aware+0.009
 decisions+0.008
 oak+0.008
 bravery+0.008
 office+0.008
Neg logit contributions
rant-0.013
mer-0.011
le-0.010
led-0.009
sc-0.009
Token strong
Token ID1913
Feature activation+1.67e-02

Pos logit contributions
rant+0.016
mer+0.011
le+0.010
sc+0.009
ifying+0.009
Neg logit contributions
 discovery-0.021
 understanding-0.020
 aware-0.020
 bravery-0.020
 oak-0.020
Token mer
Token ID4017
Feature activation+4.89e-02

Pos logit contributions
rant+0.036
mer+0.028
sc+0.027
ifying+0.027
wy+0.024
Neg logit contributions
 discovery-0.024
 oak-0.023
 aware-0.023
 understanding-0.023
 decision-0.022
Tokenmaid
Token ID23151
Feature activation+5.24e-03

Pos logit contributions
rant+0.086
sc+0.063
ifying+0.062
mer+0.059
wy+0.057
Neg logit contributions
 Majesty-0.086
 discovery-0.083
 aware-0.082
 decision-0.077
 decisions-0.077
Token thought
Token ID1807
Feature activation+3.65e-03

Pos logit contributions
rant+0.011
mer+0.008
le+0.007
ifying+0.007
sc+0.007
Neg logit contributions
 aware-0.009
 oak-0.008
 bravery-0.008
 understanding-0.008
 office-0.008
Token about
Token ID546
Feature activation-7.93e-03

Pos logit contributions
rant+0.008
mer+0.007
le+0.006
sc+0.006
te+0.005
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 oak-0.005
 office-0.005
Token the
Token ID262
Feature activation+9.94e-04

Pos logit contributions
 aware+0.014
unch+0.013
 decisions+0.013
shadow+0.013
 leash+0.013
Neg logit contributions
rant-0.013
 Will-0.011
le-0.010
 them-0.009
mer-0.009
Token promise
Token ID6991
Feature activation-3.37e-03

Pos logit contributions
rant+0.002
mer+0.002
le+0.001
sc+0.001
led+0.001
Neg logit contributions
 aware-0.002
 decisions-0.002
 bravery-0.002
 oak-0.002
 discovery-0.002
Token Hop
Token ID9996
Feature activation+5.21e-03

Pos logit contributions
 aware+0.007
 decisions+0.007
 discovery+0.007
 oak+0.007
 bravery+0.007
Neg logit contributions
rant-0.008
mer-0.007
le-0.006
 Will-0.006
sc-0.005
Token in
Token ID287
Feature activation+7.83e-03

Pos logit contributions
rant+0.010
mer+0.007
le+0.007
sc+0.006
te+0.006
Neg logit contributions
 aware-0.009
 decisions-0.009
 bravery-0.009
 oak-0.009
 office-0.009
Token."
Token ID526
Feature activation+1.41e-04

Pos logit contributions
rant+0.014
mer+0.011
le+0.011
sc+0.009
 Will+0.009
Neg logit contributions
 aware-0.014
 decisions-0.013
 bravery-0.012
unch-0.012
 property-0.012
Token
Token ID198
Feature activation+1.94e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
sc+0.000
led+0.000
Neg logit contributions
 aware-0.000
 decisions-0.000
 bravery-0.000
 oak-0.000
 office-0.000
Token
Token ID198
Feature activation-9.58e-04

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
ifying+0.003
te+0.003
Neg logit contributions
 aware-0.002
unch-0.002
 understanding-0.002
 discovery-0.002
 decisions-0.002
TokenN
Token ID45
Feature activation+4.84e-02

Pos logit contributions
 aware+0.002
 understanding+0.002
unch+0.002
 decisions+0.002
 discovery+0.002
Neg logit contributions
rant-0.002
mer-0.001
le-0.001
te-0.001
 Will-0.001
Tokenora
Token ID5799
Feature activation-5.35e-03

Pos logit contributions
rant+0.102
mer+0.079
ifying+0.074
sc+0.073
 lit+0.070
Neg logit contributions
 aware-0.074
 decisions-0.069
 discovery-0.064
joy-0.064
 property-0.064
Token was
Token ID373
Feature activation-1.33e-03

Pos logit contributions
 aware+0.010
 decisions+0.010
 bravery+0.010
 oak+0.010
 office+0.010
Neg logit contributions
rant-0.009
mer-0.007
le-0.006
sc-0.005
ifying-0.005
Token so
Token ID523
Feature activation-6.96e-03

Pos logit contributions
 aware+0.002
 decisions+0.002
 oak+0.002
 bravery+0.002
 property+0.002
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
sc-0.002
te-0.002
Token excited
Token ID6568
Feature activation+1.69e-04

Pos logit contributions
 aware+0.008
 understanding+0.007
 bravery+0.007
 decisions+0.007
 oak+0.007
Neg logit contributions
rant-0.018
mer-0.015
le-0.014
sc-0.013
led-0.012
Token she
Token ID673
Feature activation+1.39e-02

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
led+0.000
 Will+0.000
Neg logit contributions
 aware-0.000
 decisions-0.000
 oak-0.000
 bravery-0.000
 impact-0.000
Token swimming
Token ID14899
Feature activation+7.24e-03

Pos logit contributions
rant+0.028
mer+0.023
le+0.021
sc+0.019
te+0.019
Neg logit contributions
 aware-0.020
 oak-0.018
 decisions-0.018
 bravery-0.018
 discovery-0.018
Token around
Token ID1088
Feature activation+6.26e-03

Pos logit contributions
rant+0.013
mer+0.009
le+0.008
acy+0.008
izz+0.008
Neg logit contributions
 aware-0.013
 discovery-0.013
 oak-0.012
 leash-0.012
joy-0.012
Token her
Token ID607
Feature activation+9.71e-04

Pos logit contributions
rant+0.012
mer+0.010
le+0.009
sc+0.008
led+0.008
Neg logit contributions
 aware-0.010
joy-0.010
 discovery-0.010
 understanding-0.010
 oak-0.010
Token.
Token ID13
Feature activation-1.70e-03

Pos logit contributions
rant+0.002
mer+0.001
le+0.001
sc+0.001
te+0.001
Neg logit contributions
 aware-0.002
 decisions-0.002
 bravery-0.002
 oak-0.002
 office-0.002
Token The
Token ID383
Feature activation+7.46e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 bravery+0.002
 office+0.002
 property+0.002
Neg logit contributions
rant-0.003
mer-0.003
le-0.003
 Will-0.002
sc-0.002
Token mer
Token ID4017
Feature activation+4.79e-02

Pos logit contributions
rant+0.011
mer+0.008
le+0.006
sc+0.006
te+0.006
Neg logit contributions
 discovery-0.016
 bravery-0.016
 understanding-0.016
 aware-0.015
 decisions-0.015
Tokenmaid
Token ID23151
Feature activation+5.99e-03

Pos logit contributions
rant+0.081
sc+0.060
mer+0.056
ifying+0.056
wy+0.054
Neg logit contributions
 discovery-0.083
 Majesty-0.082
 aware-0.082
 discoveries-0.076
 bravery-0.075
Token smiled
Token ID13541
Feature activation+3.71e-03

Pos logit contributions
rant+0.012
mer+0.009
le+0.008
sc+0.008
auc+0.008
Neg logit contributions
 aware-0.010
 understanding-0.009
 discovery-0.009
 bravery-0.009
 oak-0.009
Token and
Token ID290
Feature activation+7.27e-03

Pos logit contributions
rant+0.008
mer+0.006
le+0.006
sc+0.006
izz+0.005
Neg logit contributions
 aware-0.006
 understanding-0.005
joy-0.005
 bravery-0.005
 discovery-0.005
Token said
Token ID531
Feature activation-5.03e-03

Pos logit contributions
rant+0.013
mer+0.011
le+0.010
sc+0.008
led+0.008
Neg logit contributions
 aware-0.013
 decisions-0.013
 oak-0.012
 bravery-0.012
 office-0.012
Token,
Token ID11
Feature activation-7.32e-03

Pos logit contributions
 aware+0.008
 decisions+0.008
 bravery+0.007
 oak+0.007
 understanding+0.007
Neg logit contributions
rant-0.010
mer-0.008
le-0.008
sc-0.007
led-0.007
Token the
Token ID262
Feature activation+4.29e-03

Pos logit contributions
rant+0.004
mer+0.004
le+0.003
sc+0.003
te+0.003
Neg logit contributions
 aware-0.004
 decisions-0.004
 bravery-0.004
 oak-0.004
 office-0.004
Token clam
Token ID30726
Feature activation-5.17e-03

Pos logit contributions
rant+0.009
mer+0.007
le+0.007
sc+0.006
te+0.006
Neg logit contributions
 decisions-0.006
 aware-0.006
 oak-0.006
 discovery-0.006
 bravery-0.006
Token stepped
Token ID10764
Feature activation-7.32e-03

Pos logit contributions
 aware+0.010
 decisions+0.010
 bravery+0.010
 oak+0.010
 impact+0.010
Neg logit contributions
rant-0.008
mer-0.006
le-0.006
sc-0.005
led-0.005
Token a
Token ID257
Feature activation+1.18e-02

Pos logit contributions
joy+0.012
 aware+0.012
 office+0.011
 decisions+0.011
 leash+0.011
Neg logit contributions
rant-0.013
mer-0.011
le-0.010
led-0.008
te-0.008
Token beautiful
Token ID4950
Feature activation+1.42e-02

Pos logit contributions
rant+0.024
mer+0.020
le+0.018
sc+0.016
te+0.016
Neg logit contributions
 aware-0.018
 decisions-0.017
 bravery-0.017
 oak-0.017
 office-0.017
Token mer
Token ID4017
Feature activation+4.79e-02

Pos logit contributions
rant+0.030
mer+0.025
le+0.023
sc+0.020
te+0.020
Neg logit contributions
 aware-0.021
 decisions-0.020
 bravery-0.019
 oak-0.019
 office-0.019
Tokenmaid
Token ID23151
Feature activation+1.11e-02

Pos logit contributions
rant+0.077
sc+0.054
mer+0.052
izz+0.048
 Will+0.048
Neg logit contributions
 discovery-0.088
 aware-0.087
 Majesty-0.086
 decision-0.086
 decisions-0.084
Token.
Token ID13
Feature activation-1.84e-03

Pos logit contributions
rant+0.020
mer+0.016
le+0.015
sc+0.013
te+0.012
Neg logit contributions
 aware-0.019
 decisions-0.019
 bravery-0.018
 oak-0.018
 office-0.018
Token "
Token ID366
Feature activation+8.53e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 bravery+0.003
 oak+0.003
 office+0.003
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
sc-0.003
te-0.003
TokenI
Token ID40
Feature activation-1.37e-03

Pos logit contributions
rant+0.016
mer+0.013
le+0.011
led+0.010
sc+0.010
Neg logit contributions
 aware-0.015
 understanding-0.014
 oak-0.014
joy-0.013
 decisions-0.013
Token heard
Token ID2982
Feature activation+6.72e-03

Pos logit contributions
 aware+0.002
 discovery+0.002
 bravery+0.002
 understanding+0.002
 impact+0.002
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
bly-0.002
led-0.002
Token Joe
Token ID5689
Feature activation-8.16e-03

Pos logit contributions
 aware+0.004
 leash+0.004
 property+0.004
 decisions+0.004
 bravery+0.004
Neg logit contributions
rant-0.004
mer-0.003
 Will-0.003
le-0.003
sc-0.003
Token waved
Token ID26834
Feature activation-2.94e-03

Pos logit contributions
 aware+0.015
 bravery+0.014
 oak+0.014
 understanding+0.014
 decisions+0.014
Neg logit contributions
rant-0.015
mer-0.011
le-0.010
sc-0.009
ifying-0.009
Token goodbye
Token ID24829
Feature activation-7.33e-06

Pos logit contributions
 aware+0.004
 bravery+0.003
 understanding+0.003
 decisions+0.003
joy+0.003
Neg logit contributions
rant-0.007
mer-0.006
le-0.005
sc-0.005
izz-0.005
Token to
Token ID284
Feature activation+1.24e-02

Feature not active on prev token;
no predictions were affected.
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
rant+0.021
mer+0.016
le+0.015
 Will+0.013
te+0.012
Neg logit contributions
 aware-0.024
 decisions-0.023
 property-0.022
 office-0.022
 oak-0.022
Token mer
Token ID4017
Feature activation+4.75e-02

Pos logit contributions
rant+0.015
mer+0.010
le+0.008
sc+0.007
ifying+0.006
Neg logit contributions
 aware-0.026
 bravery-0.025
 discovery-0.025
 decisions-0.025
 understanding-0.025
Tokenmaid
Token ID23151
Feature activation+7.45e-03

Pos logit contributions
rant+0.087
sc+0.066
ifying+0.065
wy+0.064
ioned+0.060
Neg logit contributions
 Majesty-0.077
 discovery-0.077
 aware-0.073
 decision-0.070
 discoveries-0.068
Token,
Token ID11
Feature activation-1.87e-02

Pos logit contributions
rant+0.012
mer+0.010
le+0.009
 Will+0.007
te+0.007
Neg logit contributions
 aware-0.014
 decisions-0.014
 property-0.014
 impact-0.014
 oak-0.014
Token and
Token ID290
Feature activation-2.02e-03

Pos logit contributions
 decisions+0.038
 property+0.035
 impact+0.034
 decision+0.034
 office+0.034
Neg logit contributions
rant-0.030
mer-0.023
le-0.020
 Will-0.020
 could-0.019
Token hopped
Token ID45230
Feature activation-3.16e-03

Pos logit contributions
 aware+0.004
 decisions+0.004
 property+0.004
unch+0.004
 office+0.004
Neg logit contributions
rant-0.003
mer-0.003
le-0.003
 Will-0.002
led-0.002
Token back
Token ID736
Feature activation+6.34e-03

Pos logit contributions
 aware+0.005
 leash+0.005
 bravery+0.005
 understanding+0.005
 discovery+0.005
Neg logit contributions
rant-0.006
mer-0.005
le-0.004
auc-0.004
izz-0.004
Token so
Token ID523
Feature activation+1.03e-02

Pos logit contributions
rant+0.028
mer+0.021
le+0.020
izz+0.019
led+0.018
Neg logit contributions
 aware-0.022
 discovery-0.021
 leash-0.021
 property-0.021
 office-0.021
Token far
Token ID1290
Feature activation+1.73e-02

Pos logit contributions
rant+0.023
mer+0.018
le+0.017
sc+0.016
te+0.015
Neg logit contributions
 aware-0.015
 decisions-0.014
 bravery-0.014
 oak-0.014
 understanding-0.013
Token?"
Token ID1701
Feature activation+8.22e-03

Pos logit contributions
rant+0.031
mer+0.025
le+0.022
te+0.020
sc+0.020
Neg logit contributions
 aware-0.030
 oak-0.028
 decisions-0.028
 office-0.028
 bravery-0.028
Token
Token ID198
Feature activation+5.85e-03

Pos logit contributions
rant+0.017
mer+0.014
le+0.013
te+0.011
sc+0.011
Neg logit contributions
 aware-0.013
 oak-0.012
 decisions-0.012
unch-0.012
 discovery-0.012
Token
Token ID198
Feature activation+2.78e-03

Pos logit contributions
rant+0.016
mer+0.014
le+0.013
ifying+0.012
led+0.012
Neg logit contributions
unch-0.005
 discovery-0.004
 aware-0.004
 understanding-0.004
 Nut-0.004
TokenN
Token ID45
Feature activation+4.74e-02

Pos logit contributions
rant+0.006
le+0.004
mer+0.004
led+0.003
te+0.003
Neg logit contributions
 aware-0.005
unch-0.005
 oak-0.004
 leash-0.004
 understanding-0.004
Tokenemo
Token ID41903
Feature activation+2.92e-03

Pos logit contributions
rant+0.093
mer+0.071
le+0.066
sc+0.065
ifying+0.062
Neg logit contributions
 aware-0.076
 decisions-0.074
unch-0.072
 discovery-0.072
 property-0.071
Token replied
Token ID8712
Feature activation-3.63e-03

Pos logit contributions
rant+0.006
mer+0.005
le+0.004
sc+0.004
ifying+0.004
Neg logit contributions
 aware-0.005
 bravery-0.004
 decisions-0.004
 discovery-0.004
 oak-0.004
Token,
Token ID11
Feature activation+4.64e-04

Pos logit contributions
 aware+0.005
 bravery+0.005
 understanding+0.005
 decisions+0.005
 discovery+0.005
Neg logit contributions
rant-0.008
mer-0.006
le-0.006
sc-0.005
ifying-0.005
Token "
Token ID366
Feature activation+9.56e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
te+0.001
Neg logit contributions
 aware-0.001
 decisions-0.000
 bravery-0.000
 oak-0.000
 understanding-0.000
TokenI
Token ID40
Feature activation+1.36e-03

Pos logit contributions
rant+0.020
le+0.015
mer+0.015
te+0.013
led+0.012
Neg logit contributions
 aware-0.015
 oak-0.014
 property-0.014
 decisions-0.014
unch-0.014
Token
Token ID198
Feature activation-3.57e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
te+0.001
Neg logit contributions
 aware-0.001
 decisions-0.001
 bravery-0.001
 oak-0.001
 office-0.001
Token
Token ID198
Feature activation-8.48e-03

Pos logit contributions
 aware+0.005
 discovery+0.004
 decisions+0.004
 understanding+0.004
 oak+0.004
Neg logit contributions
rant-0.008
mer-0.007
le-0.006
led-0.006
sc-0.006
TokenJack
Token ID14295
Feature activation-2.16e-03

Pos logit contributions
 aware+0.013
 understanding+0.012
 oak+0.012
 decisions+0.012
 discovery+0.012
Neg logit contributions
rant-0.018
mer-0.014
le-0.013
sc-0.012
led-0.011
Token asked
Token ID1965
Feature activation-5.86e-03

Pos logit contributions
 aware+0.004
 oak+0.003
 understanding+0.003
 bravery+0.003
 discovery+0.003
Neg logit contributions
rant-0.004
mer-0.003
sc-0.003
ifying-0.003
le-0.003
Token the
Token ID262
Feature activation+9.81e-03

Pos logit contributions
 aware+0.011
 decisions+0.010
 oak+0.010
 bravery+0.010
 property+0.010
Neg logit contributions
rant-0.010
mer-0.008
le-0.007
sc-0.006
te-0.006
Token mer
Token ID4017
Feature activation+4.71e-02

Pos logit contributions
rant+0.017
mer+0.011
wy+0.010
ifying+0.010
le+0.010
Neg logit contributions
 discovery-0.019
 understanding-0.019
 aware-0.018
 bravery-0.018
 oak-0.018
Tokenmaid
Token ID23151
Feature activation+9.56e-03

Pos logit contributions
rant+0.082
sc+0.062
ifying+0.057
mer+0.057
izz+0.055
Neg logit contributions
 Majesty-0.082
 discovery-0.078
 aware-0.077
 decisions-0.073
 decision-0.072
Token why
Token ID1521
Feature activation-2.84e-03

Pos logit contributions
rant+0.022
mer+0.018
le+0.017
sc+0.015
led+0.015
Neg logit contributions
 aware-0.013
 decisions-0.012
 bravery-0.012
 oak-0.012
 discovery-0.012
Token the
Token ID262
Feature activation+5.23e-03

Pos logit contributions
 aware+0.004
 bravery+0.004
 oak+0.004
 decisions+0.004
 understanding+0.003
Neg logit contributions
rant-0.007
mer-0.005
le-0.005
sc-0.005
ifying-0.004
Token waves
Token ID9813
Feature activation+1.87e-02

Pos logit contributions
rant+0.008
mer+0.005
le+0.005
sc+0.004
izz+0.004
Neg logit contributions
 aware-0.011
 discovery-0.011
 bravery-0.011
 oak-0.011
joy-0.011
Token were
Token ID547
Feature activation-4.15e-03

Pos logit contributions
rant+0.033
mer+0.025
le+0.022
sc+0.020
te+0.019
Neg logit contributions
 aware-0.033
 bravery-0.032
 oak-0.032
 understanding-0.032
 decisions-0.032
Token the
Token ID262
Feature activation+4.91e-03

Pos logit contributions
rant+0.015
mer+0.012
le+0.011
sc+0.009
te+0.009
Neg logit contributions
 aware-0.015
 decisions-0.014
 oak-0.014
 bravery-0.014
 office-0.014
Token bird
Token ID6512
Feature activation+1.90e-03

Pos logit contributions
rant+0.010
mer+0.009
le+0.008
 Will+0.007
sc+0.007
Neg logit contributions
 aware-0.007
unch-0.007
 decisions-0.006
joy-0.006
 oak-0.006
Token.
Token ID13
Feature activation+3.38e-03

Pos logit contributions
rant+0.003
mer+0.003
le+0.002
sc+0.002
 Will+0.002
Neg logit contributions
 aware-0.004
 decisions-0.003
 bravery-0.003
 oak-0.003
unch-0.003
Token
Token ID198
Feature activation-1.14e-03

Pos logit contributions
rant+0.006
mer+0.005
le+0.005
 Will+0.004
sc+0.004
Neg logit contributions
 aware-0.006
 bravery-0.006
 decisions-0.006
 property-0.006
 discovery-0.005
Token
Token ID198
Feature activation-4.59e-03

Pos logit contributions
 aware+0.002
 decisions+0.002
 oak+0.001
 bravery+0.001
 understanding+0.001
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
led-0.002
sc-0.002
TokenN
Token ID45
Feature activation+4.70e-02

Pos logit contributions
 aware+0.007
 understanding+0.006
 decisions+0.006
 oak+0.006
unch+0.006
Neg logit contributions
rant-0.010
mer-0.008
le-0.007
led-0.006
te-0.006
Tokenora
Token ID5799
Feature activation-1.38e-03

Pos logit contributions
rant+0.091
mer+0.070
le+0.063
sc+0.061
 Will+0.058
Neg logit contributions
 aware-0.081
 decisions-0.077
 discovery-0.073
 property-0.072
 oak-0.072
Token followed
Token ID3940
Feature activation+1.16e-02

Pos logit contributions
 aware+0.002
 bravery+0.002
 oak+0.002
 decisions+0.002
 office+0.002
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
sc-0.002
ifying-0.002
Token the
Token ID262
Feature activation+6.16e-03

Pos logit contributions
rant+0.021
mer+0.017
le+0.015
sc+0.013
te+0.013
Neg logit contributions
 aware-0.021
 decisions-0.020
 bravery-0.020
 oak-0.020
 office-0.019
Token bird
Token ID6512
Feature activation-2.41e-03

Pos logit contributions
rant+0.012
mer+0.011
le+0.010
 Will+0.009
te+0.009
Neg logit contributions
 aware-0.009
unch-0.008
 decisions-0.008
joy-0.008
 impact-0.008
Token all
Token ID477
Feature activation+1.58e-02

Pos logit contributions
unch+0.005
 decisions+0.005
joy+0.005
 aware+0.004
 impact+0.004
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
sc-0.002
y-0.002
Tokeny
Token ID88
Feature activation+5.01e-04

Pos logit contributions
rant+0.026
mer+0.015
sc+0.015
le+0.014
bly+0.014
Neg logit contributions
 aware-0.034
ounce-0.034
 oak-0.032
unch-0.032
 discovery-0.032
Token met
Token ID1138
Feature activation+3.23e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
ifying+0.001
Neg logit contributions
 aware-0.001
 oak-0.001
 office-0.001
 decisions-0.001
 bravery-0.001
Token a
Token ID257
Feature activation+1.52e-02

Pos logit contributions
rant+0.006
mer+0.005
le+0.005
sc+0.004
te+0.004
Neg logit contributions
 aware-0.005
 decisions-0.005
 oak-0.005
 bravery-0.005
 office-0.005
Token fish
Token ID5916
Feature activation+1.01e-02

Pos logit contributions
rant+0.030
mer+0.026
le+0.023
 Will+0.021
sc+0.020
Neg logit contributions
 aware-0.023
 decisions-0.023
 property-0.022
 bravery-0.022
 office-0.022
Token named
Token ID3706
Feature activation+6.39e-04

Pos logit contributions
rant+0.022
mer+0.018
le+0.016
sc+0.015
led+0.014
Neg logit contributions
 aware-0.015
 oak-0.014
 office-0.013
 decisions-0.013
unch-0.013
Token Bub
Token ID19645
Feature activation+4.68e-02

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
led+0.001
Neg logit contributions
 aware-0.001
 decisions-0.001
 oak-0.001
 bravery-0.001
 discovery-0.001
Tokenbles
Token ID7689
Feature activation+5.44e-04

Pos logit contributions
rant+0.111
mer+0.086
le+0.082
te+0.080
sc+0.079
Neg logit contributions
 aware-0.057
 discovery-0.054
unch-0.052
ounce-0.052
 property-0.051
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.000
te+0.000
Neg logit contributions
 aware-0.001
 decisions-0.001
 bravery-0.001
 oak-0.001
 office-0.001
Token Bub
Token ID19645
Feature activation+3.26e-02

Pos logit contributions
rant+0.028
mer+0.023
le+0.022
te+0.020
led+0.019
Neg logit contributions
 aware-0.015
 oak-0.015
 discovery-0.014
unch-0.014
 decisions-0.014
Tokenbles
Token ID7689
Feature activation-6.83e-03

Pos logit contributions
rant+0.078
mer+0.061
le+0.058
sc+0.054
te+0.054
Neg logit contributions
 aware-0.040
 discovery-0.039
 decisions-0.037
unch-0.037
 property-0.037
Token asked
Token ID1965
Feature activation+9.03e-04

Pos logit contributions
 aware+0.014
 decisions+0.013
 bravery+0.013
 oak+0.013
 office+0.013
Neg logit contributions
rant-0.011
mer-0.008
le-0.007
sc-0.006
te-0.006
Token Nora
Token ID32561
Feature activation+6.18e-03

Pos logit contributions
rant+0.022
mer+0.018
le+0.015
sc+0.014
led+0.014
Neg logit contributions
 aware-0.017
 bravery-0.017
 decisions-0.017
 understanding-0.017
 oak-0.017
Token spin
Token ID7906
Feature activation+3.74e-03

Pos logit contributions
rant+0.010
mer+0.008
le+0.008
sc+0.006
 Will+0.006
Neg logit contributions
 aware-0.012
 decisions-0.011
 property-0.011
joy-0.011
 oak-0.011
Token.
Token ID13
Feature activation+5.78e-03

Pos logit contributions
rant+0.007
mer+0.005
izz+0.005
le+0.004
ola+0.004
Neg logit contributions
 bike-0.006
 aware-0.006
 oak-0.006
 decisions-0.006
 leash-0.006
Token
Token ID198
Feature activation+1.11e-03

Pos logit contributions
rant+0.011
mer+0.009
le+0.008
 Will+0.007
sc+0.007
Neg logit contributions
 aware-0.010
 decisions-0.009
 bravery-0.009
 property-0.009
 office-0.009
Token
Token ID198
Feature activation-7.77e-04

Pos logit contributions
rant+0.003
mer+0.002
le+0.002
led+0.002
ifying+0.002
Neg logit contributions
 aware-0.001
unch-0.001
 discovery-0.001
 understanding-0.001
 oak-0.001
TokenN
Token ID45
Feature activation+4.68e-02

Pos logit contributions
 aware+0.001
unch+0.001
 understanding+0.001
 decisions+0.001
 oak+0.001
Neg logit contributions
rant-0.002
mer-0.001
le-0.001
te-0.001
 Will-0.001
Tokenora
Token ID5799
Feature activation-6.92e-03

Pos logit contributions
rant+0.100
mer+0.078
sc+0.073
ifying+0.072
 lit+0.068
Neg logit contributions
 aware-0.071
 decisions-0.066
 discovery-0.061
 property-0.061
joy-0.061
Token finally
Token ID3443
Feature activation+3.97e-03

Pos logit contributions
 aware+0.013
 decisions+0.013
 bravery+0.013
 oak+0.013
 office+0.013
Neg logit contributions
rant-0.012
mer-0.009
le-0.008
sc-0.007
ifying-0.006
Token stopped
Token ID5025
Feature activation-3.87e-03

Pos logit contributions
rant+0.007
mer+0.006
le+0.005
sc+0.005
 Will+0.004
Neg logit contributions
 aware-0.007
 oak-0.007
 bravery-0.007
 decisions-0.007
 understanding-0.006
Token spinning
Token ID19493
Feature activation-4.53e-03

Pos logit contributions
 decisions+0.006
 aware+0.006
 property+0.006
 bravery+0.006
unch+0.006
Neg logit contributions
rant-0.007
mer-0.006
le-0.006
 Will-0.005
te-0.005
Token and
Token ID290
Feature activation+2.95e-03

Pos logit contributions
 aware+0.007
 oak+0.007
 bravery+0.007
 decisions+0.007
 office+0.007
Neg logit contributions
rant-0.009
mer-0.007
le-0.006
sc-0.006
izz-0.006
Token.
Token ID13
Feature activation+3.72e-03

Pos logit contributions
 aware+0.006
 decisions+0.006
 bravery+0.006
 impact+0.006
 oak+0.006
Neg logit contributions
rant-0.005
mer-0.004
le-0.004
sc-0.003
led-0.003
Token She
Token ID1375
Feature activation-5.07e-03

Pos logit contributions
rant+0.008
mer+0.007
le+0.006
sc+0.005
led+0.005
Neg logit contributions
 aware-0.005
 decisions-0.005
 oak-0.005
 office-0.005
 bravery-0.005
Token gave
Token ID2921
Feature activation+3.84e-03

Pos logit contributions
 aware+0.009
 office+0.008
 oak+0.008
 bravery+0.008
 understanding+0.008
Neg logit contributions
rant-0.009
mer-0.007
le-0.007
sc-0.006
ifying-0.006
Token it
Token ID340
Feature activation-8.60e-03

Pos logit contributions
rant+0.008
mer+0.007
le+0.005
sc+0.005
ifying+0.005
Neg logit contributions
 discovery-0.006
 aware-0.006
 decisions-0.006
 oak-0.006
joy-0.006
Token to
Token ID284
Feature activation+9.80e-03

Pos logit contributions
 aware+0.016
unch+0.015
 decisions+0.015
 impact+0.015
 bravery+0.015
Neg logit contributions
rant-0.015
mer-0.011
le-0.010
sc-0.009
te-0.009
Token N
Token ID399
Feature activation+4.63e-02

Pos logit contributions
rant+0.019
mer+0.016
le+0.014
sc+0.013
led+0.013
Neg logit contributions
 aware-0.016
 bravery-0.015
 understanding-0.015
 office-0.015
 discovery-0.015
Tokeniki
Token ID5580
Feature activation+4.03e-03

Pos logit contributions
rant+0.105
mer+0.087
le+0.080
sc+0.074
led+0.073
Neg logit contributions
 aware-0.062
 decisions-0.059
 oak-0.057
unch-0.057
 office-0.057
Token without
Token ID1231
Feature activation-1.14e-02

Pos logit contributions
rant+0.006
mer+0.005
le+0.005
sc+0.004
te+0.004
Neg logit contributions
 aware-0.008
 decisions-0.008
 bravery-0.008
unch-0.007
 property-0.007
Token any
Token ID597
Feature activation-5.93e-03

Pos logit contributions
 aware+0.013
 oak+0.012
 decisions+0.012
unch+0.012
 office+0.012
Neg logit contributions
rant-0.028
mer-0.022
le-0.022
 Will-0.020
led-0.019
Token hesitation
Token ID29592
Feature activation+6.74e-03

Pos logit contributions
 aware+0.007
unch+0.006
 oak+0.006
ouncing+0.006
shadow+0.006
Neg logit contributions
rant-0.014
mer-0.012
le-0.012
sc-0.010
led-0.010
Token.
Token ID13
Feature activation+2.99e-03

Pos logit contributions
rant+0.013
mer+0.010
le+0.009
sc+0.008
led+0.008
Neg logit contributions
 aware-0.012
 decisions-0.011
 understanding-0.011
 bravery-0.011
 oak-0.011
Token made
Token ID925
Feature activation+2.23e-03

Pos logit contributions
rant+0.021
mer+0.016
le+0.014
sc+0.012
auc+0.011
Neg logit contributions
 aware-0.027
 bravery-0.026
 oak-0.026
 understanding-0.026
 decisions-0.025
Token.
Token ID13
Feature activation-4.72e-03

Pos logit contributions
rant+0.004
le+0.003
mer+0.003
led+0.002
 Will+0.002
Neg logit contributions
 aware-0.004
 oak-0.004
 property-0.004
 bravery-0.004
unch-0.004
Token
Token ID220
Feature activation-3.19e-03

Pos logit contributions
 aware+0.008
 decisions+0.008
 property+0.008
 bravery+0.008
 office+0.007
Neg logit contributions
rant-0.009
mer-0.007
 Will-0.007
le-0.006
ola-0.006
Token
Token ID198
Feature activation-8.94e-03

Pos logit contributions
 aware+0.006
 decisions+0.006
 bravery+0.006
 oak+0.006
 property+0.006
Neg logit contributions
rant-0.005
mer-0.004
le-0.004
izz-0.003
 Will-0.003
TokenThe
Token ID464
Feature activation+3.45e-03

Pos logit contributions
 aware+0.014
 decisions+0.014
 bravery+0.013
 oak+0.013
 property+0.013
Neg logit contributions
rant-0.018
mer-0.015
le-0.014
sc-0.012
te-0.012
Token mer
Token ID4017
Feature activation+4.63e-02

Pos logit contributions
rant+0.006
mer+0.005
le+0.005
te+0.004
 Will+0.004
Neg logit contributions
 aware-0.006
unch-0.006
 decisions-0.006
 property-0.006
shadow-0.006
Tokenmaid
Token ID23151
Feature activation+4.42e-03

Pos logit contributions
rant+0.077
sc+0.051
mer+0.049
ifying+0.049
izz+0.046
Neg logit contributions
 aware-0.087
 Majesty-0.087
 discovery-0.085
 decisions-0.083
 decision-0.082
Token learnt
Token ID26338
Feature activation-2.76e-03

Pos logit contributions
rant+0.008
mer+0.006
le+0.005
sc+0.005
te+0.004
Neg logit contributions
 aware-0.008
 bravery-0.008
 oak-0.008
 understanding-0.008
 decisions-0.008
Token that
Token ID326
Feature activation-1.40e-02

Pos logit contributions
 aware+0.004
 decisions+0.004
 oak+0.004
 bravery+0.004
 office+0.004
Neg logit contributions
rant-0.006
mer-0.005
le-0.004
sc-0.004
te-0.004
Token taking
Token ID2263
Feature activation-5.73e-03

Pos logit contributions
 aware+0.024
unch+0.024
 Majesty+0.024
 oak+0.024
 tal+0.024
Neg logit contributions
rant-0.022
mer-0.018
le-0.017
 Will-0.016
sc-0.015
Token risks
Token ID7476
Feature activation-5.30e-03

Pos logit contributions
 aware+0.009
 oak+0.008
unch+0.008
 decisions+0.008
 office+0.008
Neg logit contributions
rant-0.012
mer-0.010
le-0.009
sc-0.008
led-0.008
Token so
Token ID523
Feature activation-7.57e-03

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
sc+0.003
led+0.003
Neg logit contributions
 aware-0.003
 understanding-0.003
 decisions-0.003
 bravery-0.003
 office-0.003
Token excited
Token ID6568
Feature activation+2.08e-03

Pos logit contributions
 aware+0.008
 understanding+0.007
 bravery+0.006
 decisions+0.006
 discovery+0.006
Neg logit contributions
rant-0.021
mer-0.018
le-0.017
sc-0.016
ifying-0.015
Token and
Token ID290
Feature activation+8.14e-03

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
sc+0.003
led+0.003
Neg logit contributions
 aware-0.003
 decisions-0.003
 oak-0.003
 bravery-0.003
 office-0.003
Token when
Token ID618
Feature activation+1.05e-02

Pos logit contributions
rant+0.016
mer+0.013
le+0.012
sc+0.010
te+0.010
Neg logit contributions
 aware-0.014
 decisions-0.013
 bravery-0.013
 oak-0.013
 understanding-0.013
Token the
Token ID262
Feature activation+1.65e-02

Pos logit contributions
rant+0.021
mer+0.017
le+0.016
 Will+0.014
sc+0.014
Neg logit contributions
 aware-0.017
 decisions-0.016
 bravery-0.016
 oak-0.016
 office-0.016
Token mer
Token ID4017
Feature activation+4.59e-02

Pos logit contributions
rant+0.037
mer+0.030
le+0.028
sc+0.025
led+0.025
Neg logit contributions
 aware-0.023
 decisions-0.022
 oak-0.022
 bravery-0.022
 discovery-0.021
Tokenmaid
Token ID23151
Feature activation+9.86e-03

Pos logit contributions
rant+0.079
sc+0.058
wy+0.055
ifying+0.055
izz+0.053
Neg logit contributions
 Majesty-0.082
 discovery-0.078
 aware-0.077
 discoveries-0.073
 decisions-0.071
Token saw
Token ID2497
Feature activation+7.01e-03

Pos logit contributions
rant+0.019
mer+0.014
le+0.013
bly+0.012
sc+0.012
Neg logit contributions
 aware-0.016
 oak-0.016
 bravery-0.015
 discovery-0.015
 office-0.015
Token him
Token ID683
Feature activation-1.22e-03

Pos logit contributions
rant+0.014
mer+0.012
le+0.011
sc+0.010
te+0.010
Neg logit contributions
 aware-0.011
 bravery-0.010
 decisions-0.010
 discovery-0.010
 oak-0.010
Token she
Token ID673
Feature activation+5.64e-03

Pos logit contributions
 aware+0.002
 decisions+0.002
 oak+0.002
 property+0.002
 office+0.002
Neg logit contributions
rant-0.002
mer-0.002
le-0.001
 Will-0.001
te-0.001
Token gave
Token ID2921
Feature activation-5.28e-03

Pos logit contributions
rant+0.010
mer+0.007
le+0.006
sc+0.006
auc+0.006
Neg logit contributions
 aware-0.011
 bravery-0.010
 understanding-0.010
 discovery-0.010
 oak-0.010
Token again
Token ID757
Feature activation-3.72e-03

Pos logit contributions
 aware+0.010
 decisions+0.010
 bravery+0.010
 oak+0.010
 office+0.010
Neg logit contributions
rant-0.012
mer-0.010
le-0.009
sc-0.008
led-0.008
Token.
Token ID13
Feature activation-6.17e-03

Pos logit contributions
 aware+0.008
 decisions+0.008
 oak+0.007
 bravery+0.007
 office+0.007
Neg logit contributions
rant-0.005
mer-0.004
le-0.004
te-0.003
 Will-0.003
Token He
Token ID679
Feature activation-5.88e-03

Pos logit contributions
 aware+0.012
 discovery+0.011
 property+0.011
 decisions+0.011
 oak+0.011
Neg logit contributions
rant-0.010
mer-0.008
 Will-0.007
le-0.007
sc-0.007
Token said
Token ID531
Feature activation-3.01e-03

Pos logit contributions
 aware+0.011
 bravery+0.010
 understanding+0.010
 oak+0.010
 decisions+0.010
Neg logit contributions
rant-0.011
mer-0.008
le-0.007
sc-0.007
ifying-0.006
Token the
Token ID262
Feature activation+3.89e-03

Pos logit contributions
 aware+0.005
 decisions+0.004
 oak+0.004
 bravery+0.004
 office+0.004
Neg logit contributions
rant-0.006
mer-0.005
le-0.005
te-0.004
sc-0.004
Token mer
Token ID4017
Feature activation+4.59e-02

Pos logit contributions
rant+0.010
mer+0.008
le+0.008
sc+0.007
te+0.007
Neg logit contributions
 aware-0.004
 decisions-0.004
 bravery-0.004
 oak-0.004
 office-0.004
Tokenmaid
Token ID23151
Feature activation+3.25e-03

Pos logit contributions
rant+0.078
sc+0.056
ifying+0.054
mer+0.053
izz+0.052
Neg logit contributions
 Majesty-0.082
 discovery-0.081
 aware-0.079
 decisions-0.077
 decision-0.077
Token and
Token ID290
Feature activation-7.21e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
sc+0.003
izz+0.003
Neg logit contributions
 aware-0.006
 bravery-0.006
 decisions-0.006
 oak-0.006
 understanding-0.006
Token a
Token ID257
Feature activation+4.04e-03

Pos logit contributions
 aware+0.013
 decisions+0.013
unch+0.013
 property+0.012
 impact+0.012
Neg logit contributions
rant-0.012
mer-0.010
le-0.009
 Will-0.008
te-0.008
Token dolphin
Token ID44136
Feature activation-9.74e-04

Pos logit contributions
rant+0.008
mer+0.007
le+0.006
 Will+0.006
te+0.006
Neg logit contributions
 aware-0.006
 decisions-0.006
 property-0.006
unch-0.006
 oak-0.005
Token were
Token ID547
Feature activation-1.07e-03

Pos logit contributions
 aware+0.002
 decisions+0.002
 bravery+0.002
 oak+0.002
 office+0.002
Neg logit contributions
rant-0.002
mer-0.001
le-0.001
sc-0.001
te-0.001
Token castle
Token ID16669
Feature activation+4.06e-03

Pos logit contributions
rant+0.019
mer+0.016
le+0.015
sc+0.014
te+0.014
Neg logit contributions
 aware-0.010
 decisions-0.010
 bravery-0.010
 oak-0.010
 discovery-0.009
Token so
Token ID523
Feature activation+7.77e-04

Pos logit contributions
rant+0.007
mer+0.006
le+0.005
sc+0.004
 Will+0.004
Neg logit contributions
 aware-0.007
 decisions-0.007
 oak-0.007
 bravery-0.007
 office-0.007
Token much
Token ID881
Feature activation-4.79e-03

Pos logit contributions
rant+0.002
mer+0.002
sc+0.002
le+0.001
can+0.001
Neg logit contributions
 aware-0.001
 understanding-0.001
 bravery-0.001
 oak-0.001
 discovery-0.001
Token.
Token ID13
Feature activation-9.36e-05

Pos logit contributions
 bravery+0.006
 aware+0.006
 discovery+0.006
 decisions+0.006
 oak+0.006
Neg logit contributions
rant-0.011
mer-0.009
le-0.008
te-0.008
izz-0.008
Token The
Token ID383
Feature activation+9.87e-03

Pos logit contributions
 aware+0.000
 oak+0.000
 decisions+0.000
 understanding+0.000
 office+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
sc-0.000
led-0.000
Token mer
Token ID4017
Feature activation+4.57e-02

Pos logit contributions
rant+0.016
mer+0.011
sc+0.009
le+0.009
ifying+0.009
Neg logit contributions
 discovery-0.020
 understanding-0.020
 bravery-0.020
 aware-0.020
 discoveries-0.019
Tokenmaid
Token ID23151
Feature activation+8.51e-03

Pos logit contributions
rant+0.075
sc+0.057
mer+0.054
ifying+0.052
izz+0.049
Neg logit contributions
 Majesty-0.083
 discovery-0.081
 aware-0.080
 decisions-0.076
 bravery-0.075
Token said
Token ID531
Feature activation-1.97e-03

Pos logit contributions
rant+0.018
mer+0.015
le+0.013
sc+0.013
ifying+0.012
Neg logit contributions
 aware-0.013
 bravery-0.012
 understanding-0.012
 discovery-0.011
 oak-0.011
Token,
Token ID11
Feature activation+1.83e-04

Pos logit contributions
 aware+0.003
 bravery+0.003
 decisions+0.003
 understanding+0.003
 oak+0.002
Neg logit contributions
rant-0.004
mer-0.004
le-0.003
sc-0.003
ifying-0.003
Token "
Token ID366
Feature activation+3.94e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
sc+0.000
 Will+0.000
Neg logit contributions
 aware-0.000
 decisions-0.000
 bravery-0.000
 oak-0.000
 office-0.000
TokenI
Token ID40
Feature activation-1.26e-03

Pos logit contributions
rant+0.008
mer+0.006
le+0.006
led+0.005
sc+0.005
Neg logit contributions
 aware-0.006
 decisions-0.006
 oak-0.006
 understanding-0.006
unch-0.006
Token more
Token ID517
Feature activation+2.80e-02

Pos logit contributions
rant+0.067
mer+0.054
te+0.048
le+0.048
led+0.047
Neg logit contributions
 leash-0.039
 aware-0.038
 decisions-0.038
 bravery-0.038
 oak-0.038
Token?"
Token ID1701
Feature activation-3.10e-03

Pos logit contributions
rant+0.059
mer+0.046
led+0.040
le+0.040
te+0.039
Neg logit contributions
 aware-0.042
 leash-0.041
 discovery-0.040
 decisions-0.040
 office-0.039
Token
Token ID198
Feature activation-4.70e-03

Pos logit contributions
 aware+0.005
 decisions+0.005
 bravery+0.005
 oak+0.004
 office+0.004
Neg logit contributions
rant-0.006
mer-0.005
le-0.005
sc-0.004
 Will-0.004
Token
Token ID198
Feature activation-8.84e-03

Pos logit contributions
 aware+0.007
 decisions+0.007
 oak+0.006
 bravery+0.006
 office+0.006
Neg logit contributions
rant-0.010
mer-0.008
le-0.008
sc-0.007
te-0.007
TokenO
Token ID46
Feature activation+1.57e-02

Pos logit contributions
 aware+0.016
 decisions+0.016
 bravery+0.016
 oak+0.015
 office+0.015
Neg logit contributions
rant-0.015
mer-0.012
le-0.011
sc-0.009
te-0.009
Tokenll
Token ID297
Feature activation+4.55e-02

Pos logit contributions
rant+0.031
mer+0.025
le+0.023
led+0.020
te+0.020
Neg logit contributions
 aware-0.025
 decisions-0.025
 bravery-0.024
 oak-0.024
 office-0.024
Tokenie
Token ID494
Feature activation+7.43e-04

Pos logit contributions
rant+0.071
mer+0.060
sc+0.048
le+0.044
ifying+0.042
Neg logit contributions
 aware-0.090
 oak-0.086
 discovery-0.084
 decisions-0.084
 understanding-0.083
Token said
Token ID531
Feature activation-2.90e-03

Pos logit contributions
rant+0.002
mer+0.001
le+0.001
sc+0.001
ifying+0.001
Neg logit contributions
 aware-0.001
 bravery-0.001
 understanding-0.001
 decisions-0.001
 oak-0.001
Token,
Token ID11
Feature activation+4.12e-03

Pos logit contributions
 aware+0.004
 bravery+0.004
 decisions+0.004
 understanding+0.004
 oak+0.004
Neg logit contributions
rant-0.006
mer-0.005
le-0.005
sc-0.004
led-0.004
Token "
Token ID366
Feature activation+9.94e-03

Pos logit contributions
rant+0.009
mer+0.008
le+0.007
 Will+0.007
te+0.007
Neg logit contributions
 aware-0.005
 decisions-0.005
 property-0.005
 bravery-0.005
 oak-0.005
TokenYes
Token ID5297
Feature activation+5.61e-03

Pos logit contributions
rant+0.021
mer+0.016
le+0.015
led+0.013
sc+0.013
Neg logit contributions
 aware-0.016
 decisions-0.015
 oak-0.015
 bravery-0.014
 understanding-0.014
Token flowers
Token ID12734
Feature activation-6.82e-03

Pos logit contributions
 aware+0.038
 decisions+0.036
 bravery+0.035
 oak+0.035
unch+0.035
Neg logit contributions
rant-0.061
mer-0.050
le-0.047
sc-0.043
 Will-0.042
Token.
Token ID13
Feature activation+5.95e-03

Pos logit contributions
 aware+0.011
 oak+0.011
 office+0.011
 decisions+0.010
 climbing+0.010
Neg logit contributions
rant-0.013
mer-0.010
le-0.010
led-0.008
izz-0.008
Token Anna
Token ID11735
Feature activation-2.70e-03

Pos logit contributions
rant+0.014
mer+0.012
le+0.011
led+0.011
ka+0.010
Neg logit contributions
 aware-0.007
 decisions-0.006
 understanding-0.006
 office-0.006
unch-0.006
Token is
Token ID318
Feature activation-2.08e-03

Pos logit contributions
 aware+0.004
 office+0.004
 oak+0.004
 decisions+0.004
 bravery+0.004
Neg logit contributions
rant-0.005
mer-0.004
le-0.004
ifying-0.004
bly-0.004
Token good
Token ID922
Feature activation-1.03e-02

Pos logit contributions
 aware+0.003
 decisions+0.003
 bravery+0.003
 office+0.003
 oak+0.003
Neg logit contributions
rant-0.004
mer-0.004
le-0.003
sc-0.003
led-0.003
Token at
Token ID379
Feature activation-4.53e-02

Pos logit contributions
 aware+0.016
 decisions+0.015
 bravery+0.015
 office+0.015
 oak+0.015
Neg logit contributions
rant-0.022
mer-0.018
le-0.016
sc-0.014
led-0.014
Token puzzles
Token ID24367
Feature activation-9.12e-03

Pos logit contributions
 aware+0.069
 decisions+0.065
unch+0.063
 property+0.063
 tal+0.062
Neg logit contributions
rant-0.092
mer-0.077
le-0.068
 Will-0.067
sc-0.064
Token.
Token ID13
Feature activation+7.19e-03

Pos logit contributions
 aware+0.016
 office+0.015
 decisions+0.015
 oak+0.015
 bravery+0.015
Neg logit contributions
rant-0.017
mer-0.013
le-0.012
 Will-0.011
izz-0.011
Token She
Token ID1375
Feature activation-1.07e-02

Pos logit contributions
rant+0.016
mer+0.013
le+0.012
sc+0.011
led+0.011
Neg logit contributions
 aware-0.010
 decisions-0.009
 oak-0.009
 office-0.009
 bravery-0.009
Token can
Token ID460
Feature activation+2.55e-03

Pos logit contributions
 aware+0.018
 decisions+0.017
 bravery+0.017
 oak+0.017
 office+0.016
Neg logit contributions
rant-0.021
mer-0.017
le-0.015
sc-0.013
led-0.013
Token finish
Token ID5461
Feature activation-8.16e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
sc+0.003
te+0.003
Neg logit contributions
 aware-0.004
 decisions-0.004
 bravery-0.004
 oak-0.004
 office-0.004
Token place
Token ID1295
Feature activation-1.28e-02

Pos logit contributions
rant+0.006
mer+0.005
le+0.004
sc+0.004
led+0.004
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 discovery-0.005
 oak-0.005
Token!
Token ID0
Feature activation-3.97e-03

Pos logit contributions
 aware+0.021
 decisions+0.021
 bravery+0.020
 discovery+0.020
 understanding+0.020
Neg logit contributions
rant-0.025
mer-0.020
le-0.018
sc-0.016
led-0.015
Token
Token ID198
Feature activation-6.86e-03

Pos logit contributions
 aware+0.005
 decisions+0.005
 bravery+0.005
 oak+0.005
 office+0.005
Neg logit contributions
rant-0.009
mer-0.007
le-0.007
led-0.006
sc-0.006
Token
Token ID198
Feature activation-1.08e-02

Pos logit contributions
 aware+0.010
 decisions+0.009
 bravery+0.009
 oak+0.009
 understanding+0.009
Neg logit contributions
rant-0.015
mer-0.013
le-0.012
sc-0.010
led-0.010
TokenThe
Token ID464
Feature activation-5.19e-03

Pos logit contributions
 aware+0.017
 decisions+0.016
 understanding+0.015
 oak+0.015
 bravery+0.015
Neg logit contributions
rant-0.022
mer-0.018
le-0.017
led-0.015
sc-0.015
Token chip
Token ID11594
Feature activation-4.47e-02

Pos logit contributions
 aware+0.008
 decisions+0.008
 property+0.008
unch+0.008
 bravery+0.008
Neg logit contributions
rant-0.010
mer-0.009
le-0.008
 Will-0.007
sc-0.007
Tokenmun
Token ID6199
Feature activation+9.32e-03

Pos logit contributions
 aware+0.091
 decisions+0.088
 bravery+0.087
 oak+0.086
 office+0.085
Neg logit contributions
rant-0.069
mer-0.054
le-0.048
sc-0.039
led-0.038
Tokenks
Token ID591
Feature activation+9.92e-03

Pos logit contributions
rant+0.021
mer+0.017
le+0.015
sc+0.015
ifying+0.015
Neg logit contributions
 aware-0.012
 decisions-0.011
 understanding-0.011
 bravery-0.010
unch-0.010
Token didn
Token ID1422
Feature activation+1.47e-02

Pos logit contributions
rant+0.017
mer+0.013
le+0.011
sc+0.010
ola+0.010
Neg logit contributions
 aware-0.018
 bravery-0.017
 understanding-0.017
 decisions-0.016
 discovery-0.016
Token't
Token ID470
Feature activation+1.76e-03

Pos logit contributions
rant+0.031
mer+0.028
le+0.023
led+0.023
ka+0.021
Neg logit contributions
 decisions-0.021
 bravery-0.021
 climbing-0.020
unch-0.020
 aware-0.020
Token know
Token ID760
Feature activation-1.27e-02

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
 Will+0.003
te+0.003
Neg logit contributions
 aware-0.002
 decisions-0.002
 office-0.002
 oak-0.002
 bravery-0.002
Token Lily
Token ID20037
Feature activation-6.52e-03

Pos logit contributions
rant+0.007
mer+0.006
le+0.005
led+0.005
sc+0.004
Neg logit contributions
 aware-0.004
 decisions-0.004
 oak-0.004
 office-0.004
unch-0.004
Token and
Token ID290
Feature activation-7.82e-03

Pos logit contributions
 aware+0.010
 office+0.010
 decisions+0.010
 oak+0.010
 bravery+0.010
Neg logit contributions
rant-0.013
mer-0.011
le-0.010
ifying-0.009
bly-0.009
Token Ben
Token ID3932
Feature activation-4.99e-03

Pos logit contributions
 aware+0.013
 decisions+0.012
 bravery+0.012
 oak+0.012
 office+0.012
Neg logit contributions
rant-0.015
mer-0.012
le-0.011
sc-0.010
 Will-0.010
Token are
Token ID389
Feature activation-2.87e-03

Pos logit contributions
 aware+0.008
 decisions+0.008
 oak+0.007
 bravery+0.007
 office+0.007
Neg logit contributions
rant-0.010
mer-0.008
le-0.007
sc-0.007
led-0.006
Token good
Token ID922
Feature activation-1.12e-02

Pos logit contributions
 aware+0.003
 understanding+0.003
 office+0.003
 bravery+0.003
 decisions+0.003
Neg logit contributions
rant-0.007
mer-0.006
le-0.006
sc-0.005
led-0.005
Token at
Token ID379
Feature activation-4.22e-02

Pos logit contributions
 aware+0.009
 decisions+0.008
 bravery+0.007
 oak+0.007
 office+0.007
Neg logit contributions
rant-0.032
mer-0.028
le-0.026
sc-0.024
te-0.024
Token puzzles
Token ID24367
Feature activation-7.88e-03

Pos logit contributions
 aware+0.065
 decisions+0.062
unch+0.059
 property+0.059
 tal+0.059
Neg logit contributions
rant-0.084
mer-0.070
le-0.062
 Will-0.061
te-0.059
Token.
Token ID13
Feature activation+4.67e-03

Pos logit contributions
 aware+0.013
 office+0.013
 decisions+0.013
 bravery+0.012
 oak+0.012
Neg logit contributions
rant-0.015
mer-0.012
le-0.011
izz-0.010
 Will-0.010
Token They
Token ID1119
Feature activation-7.46e-03

Pos logit contributions
rant+0.011
mer+0.009
le+0.009
led+0.009
bly+0.008
Neg logit contributions
 aware-0.005
 decisions-0.005
 oak-0.005
 Copper-0.005
 office-0.005
Token can
Token ID460
Feature activation+4.69e-04

Pos logit contributions
 aware+0.012
 oak+0.011
 bravery+0.011
 decisions+0.011
 office+0.011
Neg logit contributions
rant-0.015
mer-0.013
le-0.012
led-0.010
sc-0.010
Token finish
Token ID5461
Feature activation-8.13e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
led+0.001
Neg logit contributions
 aware-0.001
 decisions-0.001
 oak-0.001
 bravery-0.001
 office-0.001
Token!
Token ID0
Feature activation-1.32e-04

Pos logit contributions
 decisions+0.005
 office+0.005
 aware+0.005
 property+0.005
 climbing+0.004
Neg logit contributions
rant-0.008
mer-0.007
le-0.006
 Will-0.005
sc-0.005
Token You
Token ID921
Feature activation-1.23e-03

Pos logit contributions
 aware+0.000
 bravery+0.000
 decisions+0.000
 discovery+0.000
 oak+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
 Will-0.000
te-0.000
Token were
Token ID547
Feature activation-2.92e-03

Pos logit contributions
 aware+0.002
 oak+0.002
 property+0.002
 decisions+0.002
 bravery+0.002
Neg logit contributions
rant-0.002
mer-0.002
le-0.002
sc-0.002
led-0.001
Token not
Token ID407
Feature activation-3.28e-03

Pos logit contributions
 aware+0.004
 decisions+0.004
 bravery+0.004
 oak+0.004
 office+0.004
Neg logit contributions
rant-0.006
mer-0.005
le-0.005
sc-0.004
te-0.004
Token good
Token ID922
Feature activation-1.30e-02

Pos logit contributions
 oak+0.005
 decisions+0.005
unch+0.005
 impact+0.005
 heights+0.005
Neg logit contributions
rant-0.006
mer-0.005
le-0.004
te-0.004
 Will-0.004
Token at
Token ID379
Feature activation-4.19e-02

Pos logit contributions
 aware+0.023
 oak+0.022
 bravery+0.021
 decisions+0.021
 office+0.021
Neg logit contributions
rant-0.022
mer-0.019
le-0.017
te-0.015
 Will-0.015
Token all
Token ID477
Feature activation-1.35e-02

Pos logit contributions
 aware+0.056
 decisions+0.054
 office+0.053
 oak+0.053
 bravery+0.053
Neg logit contributions
rant-0.095
mer-0.081
le-0.076
sc-0.068
led-0.067
Token.
Token ID13
Feature activation-2.74e-03

Pos logit contributions
 aware+0.023
 decisions+0.023
 oak+0.022
 bravery+0.022
 office+0.022
Neg logit contributions
rant-0.026
mer-0.021
le-0.019
sc-0.016
led-0.016
Token You
Token ID921
Feature activation-3.84e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 office+0.003
 oak+0.003
 property+0.003
Neg logit contributions
rant-0.006
mer-0.005
le-0.005
led-0.005
sc-0.004
Token have
Token ID423
Feature activation-1.98e-02

Pos logit contributions
 aware+0.006
 oak+0.006
 property+0.006
 bravery+0.006
 decisions+0.006
Neg logit contributions
rant-0.008
mer-0.006
le-0.006
sc-0.005
led-0.005
Token to
Token ID284
Feature activation-1.51e-02

Pos logit contributions
 aware+0.036
 decisions+0.035
 oak+0.035
unch+0.035
 bravery+0.035
Neg logit contributions
rant-0.033
mer-0.026
le-0.023
 Will-0.020
te-0.020
Token,
Token ID11
Feature activation-2.53e-02

Pos logit contributions
 aware+0.011
 decisions+0.010
 bravery+0.010
 oak+0.010
 office+0.010
Neg logit contributions
rant-0.011
mer-0.009
le-0.008
sc-0.007
te-0.007
Token fourteen
Token ID29167
Feature activation-1.35e-02

Pos logit contributions
 aware+0.035
 oak+0.034
 climbing+0.034
 decisions+0.033
 office+0.033
Neg logit contributions
rant-0.052
mer-0.046
le-0.041
sc-0.037
ola-0.036
Token,
Token ID11
Feature activation-2.27e-02

Pos logit contributions
 aware+0.026
 decisions+0.024
 property+0.024
 bravery+0.024
 office+0.024
Neg logit contributions
rant-0.022
mer-0.017
le-0.016
led-0.013
 Will-0.013
Token fifteen
Token ID17280
Feature activation-2.46e-02

Pos logit contributions
 aware+0.031
 decisions+0.030
 bravery+0.029
 oak+0.029
 office+0.029
Neg logit contributions
rant-0.050
mer-0.042
le-0.039
sc-0.035
te-0.034
Token,
Token ID11
Feature activation-2.65e-02

Pos logit contributions
 aware+0.046
 bravery+0.043
 decisions+0.043
 oak+0.043
 property+0.043
Neg logit contributions
rant-0.042
mer-0.033
le-0.031
led-0.025
te-0.025
Token sixteen
Token ID27913
Feature activation-4.18e-02

Pos logit contributions
 aware+0.036
 decisions+0.035
 oak+0.034
 bravery+0.034
 office+0.034
Neg logit contributions
rant-0.057
mer-0.049
le-0.045
sc-0.040
led-0.039
Token,
Token ID11
Feature activation-2.70e-02

Pos logit contributions
 aware+0.079
 bravery+0.074
 decisions+0.073
 oak+0.073
 property+0.073
Neg logit contributions
rant-0.070
mer-0.055
le-0.051
led-0.042
te-0.041
Token seventeen
Token ID38741
Feature activation-2.82e-02

Pos logit contributions
 aware+0.037
 decisions+0.036
 oak+0.036
 bravery+0.035
 office+0.035
Neg logit contributions
rant-0.058
mer-0.049
le-0.045
sc-0.040
led-0.039
Token,
Token ID11
Feature activation-2.48e-02

Pos logit contributions
 aware+0.052
 bravery+0.049
 oak+0.049
 decisions+0.049
 property+0.048
Neg logit contributions
rant-0.049
mer-0.039
le-0.036
led-0.030
te-0.029
Token eighteen
Token ID29095
Feature activation-2.74e-02

Pos logit contributions
 aware+0.034
 decisions+0.032
 oak+0.032
 bravery+0.032
 office+0.031
Neg logit contributions
rant-0.054
mer-0.046
le-0.042
sc-0.038
led-0.037
Token,
Token ID11
Feature activation-2.10e-02

Pos logit contributions
 aware+0.056
 oak+0.053
 leash+0.053
 property+0.053
 bravery+0.053
Neg logit contributions
rant-0.039
mer-0.030
le-0.028
y-0.022
led-0.022
Token.
Token ID13
Feature activation-4.95e-03

Pos logit contributions
 aware+0.022
 decisions+0.021
 oak+0.021
 bravery+0.020
 office+0.020
Neg logit contributions
rant-0.029
mer-0.024
le-0.022
sc-0.020
 Will-0.020
Token
Token ID220
Feature activation-7.10e-03

Pos logit contributions
 aware+0.008
 decisions+0.008
 bravery+0.007
 property+0.007
 discovery+0.007
Neg logit contributions
rant-0.010
mer-0.008
le-0.007
 Will-0.007
sc-0.007
Token
Token ID198
Feature activation-7.35e-03

Pos logit contributions
 aware+0.012
 decisions+0.011
 oak+0.011
 bravery+0.011
 property+0.011
Neg logit contributions
rant-0.013
mer-0.011
le-0.010
sc-0.009
 Will-0.009
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
 aware+0.011
 decisions+0.011
 bravery+0.011
 oak+0.010
 office+0.010
Neg logit contributions
rant-0.015
mer-0.013
le-0.011
sc-0.010
te-0.010
TokenThe
Token ID464
Feature activation-3.98e-03

Pos logit contributions
 aware+0.019
 decisions+0.019
 bravery+0.018
 oak+0.018
 discovery+0.018
Neg logit contributions
rant-0.020
mer-0.017
le-0.015
sc-0.013
te-0.013
Token chip
Token ID11594
Feature activation-4.16e-02

Pos logit contributions
 aware+0.006
 bravery+0.005
 tal+0.005
 oak+0.005
 decisions+0.005
Neg logit contributions
rant-0.008
mer-0.007
le-0.006
 Will-0.006
sc-0.005
Tokenm
Token ID76
Feature activation+2.77e-02

Pos logit contributions
 aware+0.088
 decisions+0.086
 bravery+0.085
 oak+0.084
 discovery+0.083
Neg logit contributions
rant-0.059
mer-0.047
le-0.041
sc-0.033
led-0.033
Tokenunk
Token ID2954
Feature activation-1.33e-02

Pos logit contributions
rant+0.053
mer+0.045
le+0.041
led+0.035
te+0.035
Neg logit contributions
 aware-0.045
 oak-0.043
 bravery-0.042
 decisions-0.042
 impact-0.041
Token told
Token ID1297
Feature activation-7.06e-03

Pos logit contributions
 aware+0.025
 decisions+0.024
 bravery+0.024
 oak+0.024
 office+0.023
Neg logit contributions
rant-0.022
mer-0.018
le-0.016
sc-0.013
te-0.013
Token the
Token ID262
Feature activation-6.91e-03

Pos logit contributions
 aware+0.014
 decisions+0.013
 oak+0.013
 property+0.013
unch+0.013
Neg logit contributions
rant-0.011
mer-0.008
le-0.008
 Will-0.007
te-0.006
Token boy
Token ID2933
Feature activation-1.11e-02

Pos logit contributions
 aware+0.010
 bravery+0.009
 oak+0.009
shadow+0.009
 tal+0.009
Neg logit contributions
rant-0.014
mer-0.012
le-0.011
 Will-0.010
sc-0.010
TokenL
Token ID43
Feature activation+2.04e-02

Pos logit contributions
 aware+0.000
 understanding+0.000
 leash+0.000
unch+0.000
shadow+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
led-0.000
 Will-0.000
Tokenily
Token ID813
Feature activation-1.33e-02

Pos logit contributions
rant+0.039
mer+0.034
ifying+0.031
 Will+0.028
 lit+0.026
Neg logit contributions
 aware-0.032
 decisions-0.032
 office-0.032
unch-0.031
 discovery-0.030
Token is
Token ID318
Feature activation-4.11e-04

Pos logit contributions
 aware+0.022
 decisions+0.021
 bravery+0.020
 oak+0.020
 office+0.020
Neg logit contributions
rant-0.026
mer-0.021
le-0.019
sc-0.017
te-0.016
Token not
Token ID407
Feature activation+5.76e-04

Pos logit contributions
 aware+0.001
 understanding+0.001
 decisions+0.001
 office+0.001
 bravery+0.001
Neg logit contributions
rant-0.001
mer-0.001
le-0.001
sc-0.001
led-0.001
Token good
Token ID922
Feature activation-6.73e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
te+0.001
Neg logit contributions
 aware-0.001
 decisions-0.001
 bravery-0.001
 oak-0.001
 office-0.001
Token at
Token ID379
Feature activation-4.10e-02

Pos logit contributions
 aware+0.010
 decisions+0.010
 bravery+0.010
 office+0.010
 oak+0.010
Neg logit contributions
rant-0.014
mer-0.011
le-0.010
sc-0.009
te-0.009
Token writing
Token ID3597
Feature activation-7.43e-03

Pos logit contributions
 aware+0.053
 decisions+0.049
 oak+0.048
unch+0.046
 bravery+0.046
Neg logit contributions
rant-0.095
mer-0.079
le-0.074
 Will-0.068
te-0.068
Token and
Token ID290
Feature activation+2.43e-03

Pos logit contributions
 decisions+0.012
 office+0.012
 bravery+0.012
 aware+0.012
 understanding+0.011
Neg logit contributions
rant-0.015
mer-0.012
le-0.010
sc-0.010
izz-0.010
Token counting
Token ID14143
Feature activation-7.65e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
 Will+0.003
sc+0.003
Neg logit contributions
 aware-0.004
 decisions-0.003
 oak-0.003
unch-0.003
 bravery-0.003
Token.
Token ID13
Feature activation+4.45e-03

Pos logit contributions
 decisions+0.013
 aware+0.013
 bravery+0.013
 office+0.012
 understanding+0.012
Neg logit contributions
rant-0.014
mer-0.012
le-0.010
sc-0.009
izz-0.009
Token She
Token ID1375
Feature activation-4.89e-03

Pos logit contributions
rant+0.010
mer+0.008
le+0.008
sc+0.007
te+0.007
Neg logit contributions
 aware-0.006
 decisions-0.006
 bravery-0.006
 oak-0.006
 office-0.006
Token!"
Token ID2474
Feature activation-9.25e-03

Pos logit contributions
 aware+0.015
 decisions+0.015
 oak+0.014
 bravery+0.014
 office+0.014
Neg logit contributions
rant-0.013
mer-0.010
le-0.009
sc-0.007
te-0.007
Token
Token ID220
Feature activation+2.84e-03

Pos logit contributions
 property+0.018
 aware+0.017
 decisions+0.017
 decision+0.017
 bravery+0.016
Neg logit contributions
rant-0.015
mer-0.012
 Will-0.011
le-0.011
sc-0.010
Token
Token ID198
Feature activation-4.37e-03

Pos logit contributions
rant+0.004
mer+0.004
le+0.003
 Will+0.003
ola+0.003
Neg logit contributions
 aware-0.005
 property-0.005
 bravery-0.005
 oak-0.005
 impact-0.005
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
 decisions+0.008
 aware+0.008
 property+0.008
 bravery+0.007
 impact+0.007
Neg logit contributions
rant-0.008
mer-0.006
le-0.006
 Will-0.005
sc-0.005
TokenThe
Token ID464
Feature activation+2.01e-03

Pos logit contributions
 decisions+0.021
 bravery+0.020
 aware+0.020
 property+0.020
 impact+0.020
Neg logit contributions
rant-0.015
mer-0.013
le-0.011
te-0.011
sc-0.011
Token chip
Token ID11594
Feature activation-4.07e-02

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
 Will+0.003
te+0.003
Neg logit contributions
 aware-0.003
 decisions-0.003
 property-0.003
shadow-0.003
 impact-0.003
Tokenm
Token ID76
Feature activation+3.22e-02

Pos logit contributions
 aware+0.081
 decisions+0.078
 bravery+0.077
 oak+0.076
 office+0.076
Neg logit contributions
rant-0.065
mer-0.051
le-0.046
sc-0.038
led-0.037
Tokenunk
Token ID2954
Feature activation-9.39e-03

Pos logit contributions
rant+0.066
mer+0.054
le+0.050
sc+0.044
te+0.043
Neg logit contributions
 aware-0.050
 decisions-0.048
 bravery-0.047
 oak-0.047
 office-0.046
Token finished
Token ID5201
Feature activation+3.98e-03

Pos logit contributions
 aware+0.019
 decisions+0.019
 oak+0.018
 bravery+0.018
 office+0.018
Neg logit contributions
rant-0.014
mer-0.011
le-0.010
sc-0.008
te-0.008
Token the
Token ID262
Feature activation-1.85e-03

Pos logit contributions
rant+0.008
mer+0.006
le+0.006
 Will+0.005
te+0.005
Neg logit contributions
 aware-0.006
 decisions-0.006
 property-0.006
 impact-0.006
 oak-0.006
Token treat
Token ID2190
Feature activation-9.19e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 property+0.003
 impact+0.003
joy+0.003
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
sc-0.003
 Will-0.002
Token books
Token ID3835
Feature activation-4.95e-03

Pos logit contributions
 aware+0.015
 decisions+0.015
 oak+0.014
 bravery+0.014
 office+0.014
Neg logit contributions
rant-0.017
mer-0.013
le-0.012
sc-0.011
te-0.010
Token.
Token ID13
Feature activation-2.33e-03

Pos logit contributions
 office+0.008
 bike+0.007
 aware+0.007
 climbing+0.007
 oak+0.007
Neg logit contributions
rant-0.010
mer-0.008
le-0.007
led-0.006
izz-0.006
Token I
Token ID314
Feature activation-4.90e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 office+0.003
 oak+0.003
 bravery+0.003
Neg logit contributions
rant-0.005
mer-0.005
le-0.004
led-0.004
vy-0.004
Token am
Token ID716
Feature activation+2.78e-03

Pos logit contributions
 aware+0.006
 impact+0.006
 property+0.006
 discovery+0.006
 bravery+0.006
Neg logit contributions
rant-0.011
mer-0.008
bly-0.007
le-0.007
led-0.007
Token better
Token ID1365
Feature activation-1.41e-02

Pos logit contributions
rant+0.006
mer+0.005
le+0.004
te+0.004
 Will+0.004
Neg logit contributions
 aware-0.004
 decisions-0.004
 oak-0.004
 bravery-0.004
 discovery-0.004
Token at
Token ID379
Feature activation-4.05e-02

Pos logit contributions
 aware+0.024
 decisions+0.024
 oak+0.024
 bravery+0.024
 discovery+0.023
Neg logit contributions
rant-0.025
mer-0.021
le-0.018
te-0.016
sc-0.016
Token numbers
Token ID3146
Feature activation-7.52e-03

Pos logit contributions
 aware+0.054
 decisions+0.051
 bravery+0.048
 oak+0.048
unch+0.047
Neg logit contributions
rant-0.092
mer-0.076
le-0.071
 Will-0.067
te-0.065
Token than
Token ID621
Feature activation-2.50e-02

Pos logit contributions
 aware+0.012
 office+0.012
 decisions+0.012
 bike+0.012
 bravery+0.012
Neg logit contributions
rant-0.014
mer-0.011
le-0.010
izz-0.009
sc-0.009
Token you
Token ID345
Feature activation-7.33e-03

Pos logit contributions
 aware+0.037
 decisions+0.036
 bravery+0.036
 oak+0.036
 office+0.035
Neg logit contributions
rant-0.051
mer-0.043
le-0.039
sc-0.035
led-0.034
Token."
Token ID526
Feature activation-1.11e-02

Pos logit contributions
 aware+0.014
 decisions+0.014
 bravery+0.014
 oak+0.014
 office+0.014
Neg logit contributions
rant-0.012
mer-0.009
le-0.008
sc-0.007
led-0.007
Token
Token ID198
Feature activation-8.99e-03

Pos logit contributions
 aware+0.021
 decisions+0.019
 property+0.019
 bravery+0.019
 oak+0.019
Neg logit contributions
rant-0.019
mer-0.015
le-0.014
 Will-0.013
sc-0.013
Token floor
Token ID4314
Feature activation-3.30e-03

Pos logit contributions
 aware+0.003
 oak+0.003
 decisions+0.003
 bravery+0.002
 office+0.002
Neg logit contributions
rant-0.005
mer-0.004
le-0.004
te-0.004
sc-0.003
Token.
Token ID13
Feature activation+3.46e-03

Pos logit contributions
 aware+0.006
 office+0.006
 decisions+0.006
 oak+0.006
 bravery+0.005
Neg logit contributions
rant-0.006
mer-0.005
le-0.004
 Will-0.004
sc-0.004
Token
Token ID198
Feature activation-2.77e-03

Pos logit contributions
rant+0.007
mer+0.006
le+0.005
sc+0.005
 Will+0.005
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 oak-0.005
 office-0.005
Token
Token ID198
Feature activation-6.14e-03

Pos logit contributions
 aware+0.004
 decisions+0.004
 oak+0.004
 bravery+0.004
 office+0.004
Neg logit contributions
rant-0.006
mer-0.005
le-0.005
sc-0.004
led-0.004
TokenThe
Token ID464
Feature activation-7.03e-04

Pos logit contributions
 aware+0.010
 decisions+0.009
 oak+0.009
 bravery+0.009
 office+0.009
Neg logit contributions
rant-0.013
mer-0.010
le-0.009
sc-0.008
led-0.008
Token chip
Token ID11594
Feature activation-4.00e-02

Pos logit contributions
 aware+0.001
 decisions+0.001
unch+0.001
shadow+0.001
 impact+0.001
Neg logit contributions
rant-0.001
mer-0.001
le-0.001
 Will-0.001
sc-0.001
Tokenmun
Token ID6199
Feature activation+9.40e-03

Pos logit contributions
 aware+0.081
 decisions+0.078
 bravery+0.077
 oak+0.076
 office+0.075
Neg logit contributions
rant-0.063
mer-0.050
le-0.044
sc-0.036
led-0.035
Tokenks
Token ID591
Feature activation+8.04e-03

Pos logit contributions
rant+0.021
mer+0.017
le+0.015
ifying+0.015
sc+0.015
Neg logit contributions
 aware-0.013
 decisions-0.011
 understanding-0.011
 bravery-0.010
 property-0.010
Token were
Token ID547
Feature activation-1.75e-03

Pos logit contributions
rant+0.014
mer+0.010
le+0.009
sc+0.009
ola+0.008
Neg logit contributions
 aware-0.014
 understanding-0.014
 bravery-0.014
 oak-0.013
 discovery-0.013
Token very
Token ID845
Feature activation+4.93e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 bravery+0.003
 understanding+0.003
 oak+0.003
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
sc-0.002
led-0.002
Token curious
Token ID11040
Feature activation-3.79e-03

Pos logit contributions
rant+0.013
mer+0.010
le+0.010
sc+0.009
ifying+0.009
Neg logit contributions
 aware-0.006
 understanding-0.006
 decisions-0.005
 bravery-0.005
 discovery-0.005
Token "
Token ID366
Feature activation+5.37e-03

Pos logit contributions
 decisions+0.002
 aware+0.002
 property+0.002
 oak+0.002
 bravery+0.002
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
 Will-0.003
te-0.003
TokenYou
Token ID1639
Feature activation-1.64e-04

Pos logit contributions
rant+0.012
mer+0.010
le+0.009
led+0.008
vy+0.008
Neg logit contributions
 aware-0.007
 decisions-0.007
 understanding-0.007
 oak-0.007
 impact-0.006
Token're
Token ID821
Feature activation-3.04e-03

Pos logit contributions
 aware+0.000
 decisions+0.000
 oak+0.000
 office+0.000
 bravery+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
te-0.000
 Will-0.000
Token really
Token ID1107
Feature activation-4.55e-03

Pos logit contributions
 aware+0.005
 decisions+0.005
 oak+0.005
 bravery+0.005
 office+0.005
Neg logit contributions
rant-0.006
mer-0.005
le-0.004
sc-0.004
te-0.004
Token good
Token ID922
Feature activation-1.22e-02

Pos logit contributions
 aware+0.006
 decisions+0.006
 bravery+0.006
 understanding+0.006
 oak+0.006
Neg logit contributions
rant-0.010
mer-0.009
le-0.008
sc-0.007
led-0.007
Token at
Token ID379
Feature activation-4.00e-02

Pos logit contributions
 aware+0.019
 decisions+0.018
 bravery+0.017
 oak+0.017
 office+0.017
Neg logit contributions
rant-0.025
mer-0.021
le-0.019
sc-0.017
te-0.017
Token matching
Token ID12336
Feature activation-9.95e-03

Pos logit contributions
 aware+0.048
 decisions+0.045
 bravery+0.044
 oak+0.044
 office+0.044
Neg logit contributions
rant-0.096
mer-0.082
le-0.077
sc-0.070
led-0.068
Token,
Token ID11
Feature activation+4.63e-05

Pos logit contributions
 decisions+0.013
 aware+0.013
 bravery+0.013
 discovery+0.013
 oak+0.013
Neg logit contributions
rant-0.022
mer-0.018
le-0.017
sc-0.015
led-0.015
Token Lily
Token ID20037
Feature activation-8.19e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
 Will+0.000
te+0.000
Neg logit contributions
 aware-0.000
 decisions-0.000
 property-0.000
 bravery-0.000
 oak-0.000
Token!"
Token ID2474
Feature activation-1.03e-02

Pos logit contributions
 aware+0.016
 decisions+0.015
 office+0.015
 bravery+0.015
 oak+0.015
Neg logit contributions
rant-0.014
mer-0.011
le-0.010
sc-0.008
led-0.008
Token
Token ID198
Feature activation-6.59e-03

Pos logit contributions
 aware+0.020
 property+0.020
 decisions+0.019
 bravery+0.019
 oak+0.019
Neg logit contributions
rant-0.017
mer-0.013
le-0.012
 Will-0.012
sc-0.011
Token.
Token ID13
Feature activation+2.19e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 oak+0.003
 office+0.003
 bravery+0.003
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
sc-0.002
 Will-0.002
Token He
Token ID679
Feature activation-7.22e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.003
sc+0.003
te+0.003
Neg logit contributions
 aware-0.003
 decisions-0.003
 bravery-0.003
 oak-0.003
 office-0.003
Token was
Token ID373
Feature activation+1.60e-03

Pos logit contributions
 office+0.014
 aware+0.013
 oak+0.013
 climbing+0.013
 bike+0.012
Neg logit contributions
rant-0.013
mer-0.010
le-0.009
ifying-0.009
bly-0.008
Token very
Token ID845
Feature activation-2.92e-03

Pos logit contributions
rant+0.003
mer+0.002
le+0.002
sc+0.002
te+0.002
Neg logit contributions
 aware-0.003
 decisions-0.003
 bravery-0.003
 oak-0.003
 discovery-0.003
Token good
Token ID922
Feature activation-1.32e-02

Pos logit contributions
 aware+0.003
 understanding+0.003
 oak+0.003
 office+0.003
 bravery+0.003
Neg logit contributions
rant-0.007
mer-0.006
le-0.006
sc-0.005
ifying-0.005
Token at
Token ID379
Feature activation-3.98e-02

Pos logit contributions
 aware+0.022
 decisions+0.022
 oak+0.021
 bravery+0.021
 discovery+0.021
Neg logit contributions
rant-0.024
mer-0.020
le-0.018
te-0.016
sc-0.016
Token it
Token ID340
Feature activation-1.02e-02

Pos logit contributions
 aware+0.048
 decisions+0.046
 bravery+0.044
 oak+0.044
 office+0.044
Neg logit contributions
rant-0.095
mer-0.080
le-0.075
sc-0.068
te-0.066
Token.
Token ID13
Feature activation+3.43e-03

Pos logit contributions
 aware+0.019
 decisions+0.018
 bravery+0.018
 oak+0.018
 understanding+0.017
Neg logit contributions
rant-0.018
mer-0.014
le-0.013
sc-0.011
te-0.011
Token One
Token ID1881
Feature activation-9.42e-03

Pos logit contributions
rant+0.007
mer+0.006
le+0.005
sc+0.005
 Will+0.005
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 oak-0.005
 property-0.005
Token day
Token ID1110
Feature activation+1.50e-02

Pos logit contributions
 aware+0.013
 oak+0.013
 decisions+0.013
 office+0.012
 bravery+0.012
Neg logit contributions
rant-0.021
mer-0.017
le-0.016
sc-0.014
led-0.014
Token,
Token ID11
Feature activation+4.95e-03

Pos logit contributions
rant+0.030
mer+0.024
le+0.022
sc+0.019
te+0.019
Neg logit contributions
 aware-0.024
 decisions-0.023
 oak-0.023
 bravery-0.023
 discovery-0.023
TokenYeah
Token ID10995
Feature activation-2.88e-03

Pos logit contributions
rant+0.006
le+0.005
mer+0.005
led+0.004
ola+0.004
Neg logit contributions
 aware-0.003
 understanding-0.003
 bravery-0.003
 leash-0.003
 oak-0.003
Token,
Token ID11
Feature activation+4.24e-03

Pos logit contributions
 aware+0.005
 decisions+0.005
 oak+0.005
 bravery+0.005
 office+0.005
Neg logit contributions
rant-0.005
mer-0.004
le-0.004
sc-0.003
 Will-0.003
Token we
Token ID356
Feature activation-3.44e-04

Pos logit contributions
rant+0.008
mer+0.006
le+0.006
 Will+0.005
sc+0.005
Neg logit contributions
 aware-0.007
 decisions-0.007
joy-0.007
 discovery-0.007
 oak-0.007
Token are
Token ID389
Feature activation-1.65e-03

Pos logit contributions
 aware+0.001
 decisions+0.001
joy+0.001
 oak+0.001
 bravery+0.001
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
sc-0.000
te-0.000
Token good
Token ID922
Feature activation-1.36e-02

Pos logit contributions
 aware+0.002
 decisions+0.002
 oak+0.002
unch+0.002
 discovery+0.002
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
 Will-0.003
te-0.003
Token at
Token ID379
Feature activation-3.97e-02

Pos logit contributions
 aware+0.020
 oak+0.019
 decisions+0.019
 bravery+0.019
 property+0.018
Neg logit contributions
rant-0.027
mer-0.023
le-0.022
 Will-0.019
te-0.019
Token fixing
Token ID18682
Feature activation-9.82e-03

Pos logit contributions
 aware+0.055
 decisions+0.051
unch+0.050
 decision+0.048
joy+0.048
Neg logit contributions
rant-0.083
mer-0.067
 Will-0.063
le-0.060
te-0.060
Token things
Token ID1243
Feature activation+4.07e-03

Pos logit contributions
 aware+0.014
 decisions+0.013
 bravery+0.013
 oak+0.013
 office+0.013
Neg logit contributions
rant-0.022
mer-0.018
le-0.017
sc-0.015
te-0.015
Token,"
Token ID553
Feature activation-5.87e-03

Pos logit contributions
rant+0.008
mer+0.006
le+0.006
sc+0.005
te+0.005
Neg logit contributions
 aware-0.007
 decisions-0.006
 bravery-0.006
 oak-0.006
 office-0.006
Token Ben
Token ID3932
Feature activation+1.20e-02

Pos logit contributions
 aware+0.010
 decisions+0.010
 property+0.010
 bravery+0.010
 discovery+0.010
Neg logit contributions
rant-0.011
mer-0.009
le-0.008
 Will-0.007
sc-0.007
Token said
Token ID531
Feature activation-7.82e-03

Pos logit contributions
rant+0.023
mer+0.019
le+0.017
sc+0.015
te+0.014
Neg logit contributions
 aware-0.020
 decisions-0.019
 bravery-0.019
 understanding-0.019
 oak-0.019
Token too
Token ID1165
Feature activation-2.67e-03

Pos logit contributions
 aware+0.010
 decisions+0.010
 oak+0.010
 office+0.010
 bravery+0.010
Neg logit contributions
rant-0.008
mer-0.006
le-0.005
sc-0.004
te-0.004
Token.
Token ID13
Feature activation-6.28e-03

Pos logit contributions
 aware+0.005
 decisions+0.005
 oak+0.005
 bravery+0.005
 office+0.005
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
sc-0.002
te-0.002
Token
Token ID198
Feature activation-8.22e-03

Pos logit contributions
 aware+0.012
 property+0.012
 decisions+0.011
 bravery+0.011
 office+0.011
Neg logit contributions
rant-0.011
mer-0.009
 Will-0.007
sc-0.007
le-0.007
Token
Token ID198
Feature activation-1.38e-02

Pos logit contributions
 aware+0.013
 decisions+0.013
 bravery+0.012
 oak+0.012
 office+0.012
Neg logit contributions
rant-0.016
mer-0.013
le-0.012
sc-0.011
te-0.011
TokenThe
Token ID464
Feature activation+1.35e-03

Pos logit contributions
 aware+0.022
 decisions+0.022
 bravery+0.021
 oak+0.021
 office+0.021
Neg logit contributions
rant-0.027
mer-0.022
le-0.020
sc-0.018
te-0.018
Token chip
Token ID11594
Feature activation-3.97e-02

Pos logit contributions
rant+0.002
mer+0.002
le+0.002
 Will+0.002
sc+0.002
Neg logit contributions
 aware-0.002
 property-0.002
 decisions-0.002
unch-0.002
joy-0.002
Tokenm
Token ID76
Feature activation+3.12e-02

Pos logit contributions
 aware+0.084
 decisions+0.082
 bravery+0.081
 oak+0.079
 property+0.079
Neg logit contributions
rant-0.057
mer-0.045
le-0.040
sc-0.031
led-0.031
Tokenunk
Token ID2954
Feature activation-1.35e-02

Pos logit contributions
rant+0.059
mer+0.049
le+0.045
led+0.039
te+0.038
Neg logit contributions
 aware-0.051
 oak-0.050
 decisions-0.049
 bravery-0.049
 impact-0.048
Token helped
Token ID4193
Feature activation+8.76e-03

Pos logit contributions
 aware+0.026
 decisions+0.025
 bravery+0.025
 oak+0.025
 office+0.025
Neg logit contributions
rant-0.022
mer-0.017
le-0.015
sc-0.013
te-0.013
Token the
Token ID262
Feature activation+2.80e-03

Pos logit contributions
rant+0.017
mer+0.014
le+0.012
sc+0.011
led+0.011
Neg logit contributions
 aware-0.015
 decisions-0.014
 oak-0.014
 bravery-0.014
 office-0.014
Token frog
Token ID21264
Feature activation-1.54e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
 Will+0.003
y+0.003
Neg logit contributions
 aware-0.005
joy-0.004
unch-0.004
 bravery-0.004
isdom-0.004
Token.
Token ID13
Feature activation+4.29e-03

Pos logit contributions
 aware+0.018
 understanding+0.017
 bravery+0.017
 decisions+0.017
 office+0.016
Neg logit contributions
rant-0.021
mer-0.016
le-0.014
sc-0.013
ifying-0.013
Token "
Token ID366
Feature activation+7.50e-03

Pos logit contributions
rant+0.010
mer+0.008
le+0.007
sc+0.006
led+0.006
Neg logit contributions
 aware-0.006
 decisions-0.006
 bravery-0.006
 understanding-0.005
 office-0.005
TokenYou
Token ID1639
Feature activation-1.80e-03

Pos logit contributions
rant+0.014
mer+0.012
le+0.011
sc+0.009
te+0.009
Neg logit contributions
 aware-0.013
 decisions-0.012
 bravery-0.012
 oak-0.012
 office-0.012
Token are
Token ID389
Feature activation+2.13e-04

Pos logit contributions
 aware+0.004
 decisions+0.003
 oak+0.003
joy+0.003
 discovery+0.003
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
sc-0.002
 Will-0.002
Token good
Token ID922
Feature activation-1.11e-02

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
 Will+0.000
te+0.000
Neg logit contributions
 aware-0.000
 oak-0.000
unch-0.000
 decisions-0.000
 discovery-0.000
Token at
Token ID379
Feature activation-3.96e-02

Pos logit contributions
 aware+0.017
 oak+0.016
 decisions+0.016
 bravery+0.016
 discovery+0.015
Neg logit contributions
rant-0.023
mer-0.019
le-0.018
te-0.016
 Will-0.016
Token this
Token ID428
Feature activation-9.68e-03

Pos logit contributions
 aware+0.060
 decisions+0.056
 oak+0.054
 property+0.053
 tal+0.053
Neg logit contributions
rant-0.081
mer-0.066
le-0.059
 Will-0.059
te-0.057
Token game
Token ID983
Feature activation-7.62e-03

Pos logit contributions
 aware+0.013
 decisions+0.012
 bravery+0.012
 oak+0.012
 office+0.012
Neg logit contributions
rant-0.022
mer-0.018
le-0.017
sc-0.015
te-0.015
Token."
Token ID526
Feature activation-5.44e-03

Pos logit contributions
 aware+0.013
 decisions+0.013
 office+0.013
 bravery+0.012
 oak+0.012
Neg logit contributions
rant-0.014
mer-0.011
le-0.010
led-0.009
izz-0.008
Token
Token ID198
Feature activation-4.54e-03

Pos logit contributions
 aware+0.009
 decisions+0.009
 oak+0.009
 bravery+0.009
 property+0.009
Neg logit contributions
rant-0.010
mer-0.008
le-0.008
sc-0.007
 Will-0.007
Token
Token ID198
Feature activation-3.86e-03

Pos logit contributions
 aware+0.007
 decisions+0.007
 oak+0.007
 bravery+0.007
 property+0.007
Neg logit contributions
rant-0.009
mer-0.007
le-0.007
sc-0.006
te-0.006
Token.
Token ID13
Feature activation+2.31e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
sc+0.000
 Will+0.000
Neg logit contributions
 aware-0.001
 decisions-0.000
 oak-0.000
 office-0.000
 bravery-0.000
Token He
Token ID679
Feature activation-8.88e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.003
sc+0.003
 Will+0.003
Neg logit contributions
 aware-0.004
 decisions-0.004
 bravery-0.003
 oak-0.003
 discovery-0.003
Token was
Token ID373
Feature activation+3.94e-05

Pos logit contributions
 office+0.016
 aware+0.015
 oak+0.015
 chain+0.015
 climbing+0.015
Neg logit contributions
rant-0.016
mer-0.013
le-0.012
ifying-0.011
bly-0.010
Token very
Token ID845
Feature activation-4.36e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
sc+0.000
te+0.000
Neg logit contributions
 aware-0.000
 decisions-0.000
 bravery-0.000
 oak-0.000
 discovery-0.000
Token good
Token ID922
Feature activation-1.41e-02

Pos logit contributions
 aware+0.005
 understanding+0.005
 oak+0.004
 bravery+0.004
 office+0.004
Neg logit contributions
rant-0.011
mer-0.009
le-0.009
sc-0.008
ifying-0.008
Token at
Token ID379
Feature activation-3.95e-02

Pos logit contributions
 aware+0.023
 decisions+0.023
 oak+0.022
 bravery+0.022
 discovery+0.022
Neg logit contributions
rant-0.027
mer-0.022
le-0.020
te-0.017
sc-0.017
Token it
Token ID340
Feature activation-1.09e-02

Pos logit contributions
 aware+0.054
 decisions+0.052
 bravery+0.050
 oak+0.050
 property+0.048
Neg logit contributions
rant-0.088
mer-0.073
le-0.068
sc-0.061
te-0.060
Token and
Token ID290
Feature activation-7.92e-03

Pos logit contributions
 aware+0.020
 decisions+0.019
 bravery+0.019
 oak+0.019
 office+0.018
Neg logit contributions
rant-0.020
mer-0.016
le-0.014
sc-0.012
te-0.012
Token his
Token ID465
Feature activation-1.97e-03

Pos logit contributions
 aware+0.014
 oak+0.013
 bravery+0.013
 decisions+0.013
 office+0.013
Neg logit contributions
rant-0.015
mer-0.012
le-0.011
sc-0.009
ifying-0.009
Token towers
Token ID18028
Feature activation-2.38e-03

Pos logit contributions
 aware+0.003
shadow+0.003
unch+0.003
 decisions+0.003
 tal+0.002
Neg logit contributions
rant-0.004
mer-0.003
te-0.003
 Will-0.003
le-0.003
Token were
Token ID547
Feature activation+3.99e-03

Pos logit contributions
 aware+0.005
 decisions+0.005
 bravery+0.005
 oak+0.005
 office+0.005
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
sc-0.002
te-0.002
Token
Token ID198
Feature activation+3.44e-03

Pos logit contributions
rant+0.021
mer+0.018
le+0.017
led+0.015
te+0.015
Neg logit contributions
 aware-0.011
 decisions-0.011
joy-0.010
 office-0.010
 oak-0.010
Token
Token ID198
Feature activation+3.46e-03

Pos logit contributions
rant+0.009
mer+0.008
le+0.007
led+0.007
ifying+0.006
Neg logit contributions
 aware-0.003
unch-0.003
 office-0.003
 discovery-0.003
 understanding-0.003
TokenBen
Token ID11696
Feature activation-2.75e-04

Pos logit contributions
rant+0.009
mer+0.007
le+0.007
led+0.006
 Will+0.006
Neg logit contributions
 aware-0.004
 understanding-0.003
 leash-0.003
unch-0.003
shadow-0.003
Token is
Token ID318
Feature activation-2.42e-03

Pos logit contributions
 aware+0.000
 office+0.000
 decisions+0.000
 oak+0.000
 bravery+0.000
Neg logit contributions
rant-0.001
mer-0.000
le-0.000
sc-0.000
ifying-0.000
Token good
Token ID922
Feature activation-8.47e-03

Pos logit contributions
 aware+0.004
 office+0.003
 decisions+0.003
 understanding+0.003
 bravery+0.003
Neg logit contributions
rant-0.005
mer-0.004
le-0.004
sc-0.004
led-0.004
Token at
Token ID379
Feature activation-3.93e-02

Pos logit contributions
 aware+0.014
 decisions+0.013
 oak+0.013
 bravery+0.013
 office+0.012
Neg logit contributions
rant-0.017
mer-0.014
le-0.013
sc-0.011
te-0.011
Token writing
Token ID3597
Feature activation-5.68e-03

Pos logit contributions
 aware+0.050
unch+0.046
 decisions+0.046
 oak+0.046
 tal+0.045
Neg logit contributions
rant-0.090
mer-0.074
le-0.069
 Will-0.068
te-0.065
Token and
Token ID290
Feature activation-6.27e-04

Pos logit contributions
 office+0.009
 decisions+0.009
 bravery+0.009
 aware+0.009
 understanding+0.008
Neg logit contributions
rant-0.011
mer-0.009
le-0.008
izz-0.008
sc-0.008
Token counting
Token ID14143
Feature activation-3.99e-03

Pos logit contributions
 aware+0.001
 decisions+0.001
 oak+0.001
 bravery+0.001
 office+0.001
Neg logit contributions
rant-0.001
mer-0.001
le-0.001
sc-0.001
te-0.001
Token.
Token ID13
Feature activation+4.57e-03

Pos logit contributions
 decisions+0.006
 aware+0.006
 bravery+0.006
 office+0.006
 discovery+0.006
Neg logit contributions
rant-0.008
mer-0.006
le-0.006
sc-0.005
izz-0.005
Token He
Token ID679
Feature activation-5.41e-03

Pos logit contributions
rant+0.009
mer+0.008
le+0.007
sc+0.006
te+0.006
Neg logit contributions
 aware-0.007
 decisions-0.007
 oak-0.007
 bravery-0.007
 office-0.007
Token!
Token ID0
Feature activation-5.16e-03

Pos logit contributions
rant+0.014
mer+0.011
le+0.010
sc+0.008
led+0.008
Neg logit contributions
 aware-0.015
 decisions-0.015
 oak-0.014
 bravery-0.014
 office-0.014
Token
Token ID220
Feature activation-3.92e-03

Pos logit contributions
 aware+0.010
 property+0.009
 decisions+0.009
 bravery+0.009
 discovery+0.009
Neg logit contributions
rant-0.009
mer-0.007
 Will-0.006
sc-0.006
le-0.006
Token
Token ID198
Feature activation-7.67e-03

Pos logit contributions
 aware+0.008
 oak+0.008
 bravery+0.007
 decisions+0.007
 property+0.007
Neg logit contributions
rant-0.006
mer-0.005
le-0.004
 Will-0.004
ola-0.004
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
 aware+0.012
 decisions+0.012
 bravery+0.012
 property+0.012
 office+0.012
Neg logit contributions
rant-0.015
mer-0.012
le-0.011
sc-0.010
te-0.010
TokenThe
Token ID464
Feature activation+2.11e-03

Pos logit contributions
 aware+0.023
 decisions+0.023
 bravery+0.022
 oak+0.022
 office+0.022
Neg logit contributions
rant-0.026
mer-0.022
le-0.020
sc-0.017
te-0.017
Token chip
Token ID11594
Feature activation-3.93e-02

Pos logit contributions
rant+0.003
mer+0.003
le+0.003
me+0.003
 Will+0.002
Neg logit contributions
 aware-0.003
isdom-0.003
ogs-0.003
shadow-0.003
owl-0.003
Tokenm
Token ID76
Feature activation+3.69e-02

Pos logit contributions
 aware+0.085
 decisions+0.084
 bravery+0.082
 discovery+0.081
 property+0.081
Neg logit contributions
rant-0.054
mer-0.043
le-0.037
led-0.029
sc-0.029
Tokenunk
Token ID2954
Feature activation-9.23e-03

Pos logit contributions
rant+0.064
mer+0.054
le+0.049
led+0.042
y+0.042
Neg logit contributions
 aware-0.064
 oak-0.063
 impact-0.063
 decisions-0.062
 bravery-0.062
Token was
Token ID373
Feature activation+5.66e-04

Pos logit contributions
 aware+0.019
 decisions+0.018
 bravery+0.018
 oak+0.018
 office+0.018
Neg logit contributions
rant-0.014
mer-0.011
le-0.010
sc-0.008
te-0.008
Token excited
Token ID6568
Feature activation-2.99e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
te+0.001
Neg logit contributions
 aware-0.001
 decisions-0.001
 oak-0.001
 bravery-0.001
 office-0.001
Token and
Token ID290
Feature activation+8.77e-03

Pos logit contributions
 aware+0.006
 oak+0.006
 decisions+0.006
 office+0.006
 property+0.006
Neg logit contributions
rant-0.005
mer-0.004
le-0.003
 Will-0.003
led-0.003
Token flowers
Token ID12734
Feature activation+1.94e-03

Pos logit contributions
 aware+0.029
 decisions+0.027
 oak+0.027
 bravery+0.027
 office+0.026
Neg logit contributions
rant-0.034
mer-0.028
le-0.026
sc-0.022
te-0.022
Token.
Token ID13
Feature activation+5.36e-03

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
sc+0.002
led+0.002
Neg logit contributions
 aware-0.003
 oak-0.003
 office-0.003
 decisions-0.003
 bravery-0.003
Token Lily
Token ID20037
Feature activation-5.84e-03

Pos logit contributions
rant+0.013
mer+0.011
le+0.010
led+0.010
te+0.009
Neg logit contributions
 aware-0.006
 oak-0.006
 decisions-0.006
 office-0.006
unch-0.006
Token is
Token ID318
Feature activation-1.51e-07

Pos logit contributions
 aware+0.008
 office+0.008
 oak+0.008
 climbing+0.007
 decisions+0.007
Neg logit contributions
rant-0.013
mer-0.011
ifying-0.010
le-0.009
bly-0.009
Token good
Token ID922
Feature activation-9.69e-03

Feature not active on prev token;
no predictions were affected.
Token at
Token ID379
Feature activation-3.92e-02

Pos logit contributions
 aware+0.015
 decisions+0.014
 bravery+0.014
 office+0.014
 oak+0.014
Neg logit contributions
rant-0.021
mer-0.017
le-0.015
sc-0.014
led-0.013
Token cutting
Token ID7720
Feature activation-1.28e-02

Pos logit contributions
 aware+0.054
 decisions+0.052
 oak+0.050
 bravery+0.049
 property+0.049
Neg logit contributions
rant-0.087
mer-0.071
le-0.066
 Will-0.060
te-0.060
Token paper
Token ID3348
Feature activation-6.06e-03

Pos logit contributions
 decisions+0.017
 office+0.016
 wisdom+0.015
 aware+0.015
 discovery+0.015
Neg logit contributions
rant-0.029
mer-0.025
le-0.021
te-0.020
vy-0.020
Token.
Token ID13
Feature activation+7.32e-03

Pos logit contributions
 decisions+0.010
 office+0.009
 aware+0.009
 oak+0.009
 bravery+0.009
Neg logit contributions
rant-0.012
mer-0.010
le-0.008
izz-0.008
vy-0.008
Token She
Token ID1375
Feature activation-5.96e-03

Pos logit contributions
rant+0.015
mer+0.013
le+0.012
sc+0.010
 Will+0.010
Neg logit contributions
 aware-0.011
 decisions-0.011
 bravery-0.010
 oak-0.010
 discovery-0.010
Token can
Token ID460
Feature activation+8.50e-03

Pos logit contributions
 aware+0.009
 office+0.008
 bravery+0.008
 understanding+0.008
 oak+0.008
Neg logit contributions
rant-0.013
mer-0.011
le-0.009
ifying-0.009
sc-0.009
Token proud
Token ID6613
Feature activation-7.48e-03

Pos logit contributions
rant+0.008
mer+0.007
le+0.007
sc+0.006
led+0.006
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 office-0.005
 oak-0.005
Token.
Token ID13
Feature activation+2.48e-05

Pos logit contributions
 aware+0.014
 decisions+0.014
 understanding+0.014
 bravery+0.014
 office+0.014
Neg logit contributions
rant-0.013
mer-0.010
le-0.009
sc-0.008
led-0.007
Token We
Token ID775
Feature activation+5.30e-04

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
led+0.000
ler+0.000
Neg logit contributions
 decisions-0.000
 aware-0.000
 office-0.000
 understanding-0.000
 bravery-0.000
Token are
Token ID389
Feature activation+4.07e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
led+0.001
Neg logit contributions
 aware-0.001
 oak-0.001
 bravery-0.001
 property-0.001
 decisions-0.001
Token good
Token ID922
Feature activation-1.09e-02

Pos logit contributions
rant+0.009
mer+0.008
le+0.007
sc+0.007
led+0.007
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 office-0.005
 understanding-0.005
Token at
Token ID379
Feature activation-3.87e-02

Pos logit contributions
 aware+0.013
 oak+0.012
 decisions+0.012
 bravery+0.012
 property+0.011
Neg logit contributions
rant-0.026
mer-0.023
le-0.021
 Will-0.019
te-0.019
Token counting
Token ID14143
Feature activation-1.34e-02

Pos logit contributions
 aware+0.056
 decisions+0.053
 oak+0.051
 bravery+0.050
 property+0.050
Neg logit contributions
rant-0.083
mer-0.068
le-0.063
 Will-0.057
te-0.057
Token."
Token ID526
Feature activation-5.23e-03

Pos logit contributions
 decisions+0.021
 aware+0.020
 office+0.020
 bravery+0.020
 heights+0.019
Neg logit contributions
rant-0.027
mer-0.022
le-0.020
sc-0.018
led-0.018
Token
Token ID198
Feature activation-5.95e-03

Pos logit contributions
 aware+0.008
 decisions+0.008
 oak+0.008
 bravery+0.008
 office+0.007
Neg logit contributions
rant-0.011
mer-0.009
le-0.008
sc-0.007
led-0.007
Token
Token ID198
Feature activation-9.00e-03

Pos logit contributions
 aware+0.009
 decisions+0.009
 bravery+0.009
 oak+0.009
 office+0.009
Neg logit contributions
rant-0.012
mer-0.010
le-0.009
sc-0.008
te-0.008
TokenThey
Token ID2990
Feature activation+2.28e-04

Pos logit contributions
 aware+0.017
 decisions+0.017
 bravery+0.017
 oak+0.016
 impact+0.016
Neg logit contributions
rant-0.015
mer-0.012
le-0.010
sc-0.009
te-0.009
Token had
Token ID550
Feature activation-7.71e-04

Pos logit contributions
rant+0.010
mer+0.008
sc+0.007
ifying+0.007
bly+0.007
Neg logit contributions
 aware-0.009
 oak-0.009
 office-0.008
 property-0.008
 discovery-0.008
Token a
Token ID257
Feature activation-3.58e-03

Pos logit contributions
 aware+0.001
 office+0.001
 oak+0.001
 decisions+0.001
joy+0.001
Neg logit contributions
rant-0.002
mer-0.002
le-0.001
sc-0.001
te-0.001
Token bald
Token ID28273
Feature activation-4.48e-03

Pos logit contributions
 aware+0.006
 decisions+0.006
shadow+0.006
unch+0.006
 property+0.006
Neg logit contributions
rant-0.006
mer-0.005
le-0.005
 Will-0.005
te-0.004
Token dog
Token ID3290
Feature activation+3.75e-03

Pos logit contributions
 aware+0.006
 decisions+0.006
shadow+0.006
 property+0.006
unch+0.006
Neg logit contributions
rant-0.009
mer-0.009
le-0.008
te-0.007
 Will-0.007
Token called
Token ID1444
Feature activation-3.00e-03

Pos logit contributions
rant+0.007
mer+0.005
le+0.005
sc+0.004
te+0.004
Neg logit contributions
 aware-0.007
 decisions-0.007
 bravery-0.007
 oak-0.006
 office-0.006
Token M
Token ID337
Feature activation+3.42e-02

Pos logit contributions
 aware+0.004
 decisions+0.004
 bravery+0.004
 oak+0.004
 office+0.004
Neg logit contributions
rant-0.007
mer-0.006
le-0.005
sc-0.005
led-0.005
Tokenabel
Token ID9608
Feature activation+2.35e-02

Pos logit contributions
rant+0.078
mer+0.066
sc+0.059
le+0.058
led+0.058
Neg logit contributions
 office-0.044
unch-0.043
 oak-0.042
 discovery-0.040
 aware-0.040
Token.
Token ID13
Feature activation+2.99e-03

Pos logit contributions
rant+0.046
mer+0.034
izz+0.031
sc+0.031
ifying+0.030
Neg logit contributions
 aware-0.036
 oak-0.035
 office-0.035
 bravery-0.034
 understanding-0.034
Token M
Token ID337
Feature activation+1.97e-02

Pos logit contributions
rant+0.006
mer+0.005
le+0.004
led+0.004
sc+0.004
Neg logit contributions
 aware-0.005
 oak-0.005
 decisions-0.005
 office-0.005
 bravery-0.004
Tokenabel
Token ID9608
Feature activation+1.77e-02

Pos logit contributions
rant+0.037
mer+0.030
sc+0.027
le+0.025
bly+0.024
Neg logit contributions
 office-0.034
 discovery-0.032
 decisions-0.032
unch-0.032
 aware-0.032
Token was
Token ID373
Feature activation+2.21e-03

Pos logit contributions
rant+0.034
mer+0.025
ifying+0.024
sc+0.022
bly+0.021
Neg logit contributions
 aware-0.030
 office-0.028
 oak-0.028
 understanding-0.027
 discovery-0.027
Token strong
Token ID1913
Feature activation-1.56e-03

Pos logit contributions
rant+0.009
mer+0.007
le+0.006
sc+0.006
te+0.005
Neg logit contributions
 aware-0.009
 decisions-0.009
 bravery-0.008
 oak-0.008
 office-0.008
Token.
Token ID13
Feature activation+2.95e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 bravery+0.003
 oak+0.003
 office+0.003
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
ola-0.002
izz-0.002
Token One
Token ID1881
Feature activation-4.62e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
 Will+0.004
sc+0.003
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 property-0.005
 understanding-0.005
Token day
Token ID1110
Feature activation+1.75e-02

Pos logit contributions
 aware+0.007
 decisions+0.007
 bravery+0.007
 oak+0.007
 property+0.007
Neg logit contributions
rant-0.009
mer-0.008
le-0.007
sc-0.006
te-0.006
Token,
Token ID11
Feature activation+5.89e-03

Pos logit contributions
rant+0.040
mer+0.033
le+0.030
sc+0.027
auc+0.027
Neg logit contributions
 aware-0.023
 climbing-0.022
 bravery-0.021
 decisions-0.021
unch-0.021
Token Frost
Token ID15122
Feature activation+3.11e-02

Pos logit contributions
rant+0.012
mer+0.010
le+0.009
sc+0.008
auc+0.008
Neg logit contributions
 aware-0.009
 bravery-0.009
 oak-0.008
 discovery-0.008
 decisions-0.008
Tokeny
Token ID88
Feature activation-1.77e-03

Pos logit contributions
rant+0.048
sc+0.035
mer+0.032
 widened+0.029
 spec+0.028
Neg logit contributions
 aware-0.058
 property-0.054
ounce-0.052
 office-0.052
ouncing-0.052
Token met
Token ID1138
Feature activation-4.24e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 oak+0.003
 office+0.003
 bravery+0.003
Neg logit contributions
rant-0.003
mer-0.003
le-0.002
sc-0.002
te-0.002
Token a
Token ID257
Feature activation+1.98e-03

Pos logit contributions
 aware+0.007
 decisions+0.006
 oak+0.006
 bravery+0.006
 office+0.006
Neg logit contributions
rant-0.008
mer-0.007
le-0.006
sc-0.006
te-0.006
Token little
Token ID1310
Feature activation-9.11e-03

Pos logit contributions
rant+0.003
mer+0.003
 Will+0.002
 mer+0.002
le+0.002
Neg logit contributions
 decisions-0.004
 aware-0.003
 property-0.003
 tal-0.003
unch-0.003
Token girl
Token ID2576
Feature activation-1.00e-03

Pos logit contributions
 aware+0.014
 decisions+0.013
 bravery+0.013
unch+0.013
 property+0.013
Neg logit contributions
rant-0.018
mer-0.016
le-0.015
 Will-0.013
sc-0.013
Token chocolate
Token ID11311
Feature activation+2.50e-03

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
ola+0.003
acy+0.003
Neg logit contributions
 decisions-0.002
 office-0.002
 aware-0.002
 oak-0.002
 leash-0.002
Token and
Token ID290
Feature activation-7.19e-04

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
ola+0.003
acy+0.003
Neg logit contributions
 office-0.004
 bike-0.004
 leash-0.004
 oak-0.004
 decisions-0.004
Token cookies
Token ID14746
Feature activation-8.25e-04

Pos logit contributions
 aware+0.001
 decisions+0.001
 oak+0.001
 bravery+0.001
 office+0.001
Neg logit contributions
rant-0.002
mer-0.001
le-0.001
sc-0.001
led-0.001
Token.
Token ID13
Feature activation+2.76e-03

Pos logit contributions
 office+0.001
 aware+0.001
 leash+0.001
 bike+0.001
 property+0.001
Neg logit contributions
rant-0.002
mer-0.001
le-0.001
ka-0.001
izz-0.001
Token How
Token ID1374
Feature activation-7.06e-03

Pos logit contributions
rant+0.006
mer+0.006
le+0.005
led+0.005
sc+0.005
Neg logit contributions
 aware-0.003
 decisions-0.003
 bravery-0.003
 office-0.003
 oak-0.003
Token does
Token ID857
Feature activation+2.95e-02

Pos logit contributions
 aware+0.011
 decisions+0.011
 bravery+0.010
 office+0.010
 oak+0.010
Neg logit contributions
rant-0.015
mer-0.012
le-0.010
sc-0.010
led-0.009
Token that
Token ID326
Feature activation+1.52e-02

Pos logit contributions
rant+0.061
mer+0.050
le+0.048
 Will+0.041
te+0.041
Neg logit contributions
 aware-0.044
 decisions-0.042
 bravery-0.041
 oak-0.040
 discovery-0.040
Token sound
Token ID2128
Feature activation+2.97e-02

Pos logit contributions
rant+0.039
mer+0.033
ifying+0.029
sc+0.029
le+0.029
Neg logit contributions
 aware-0.016
 decisions-0.016
 office-0.016
 understanding-0.015
 decision-0.015
Token?"
Token ID1701
Feature activation-2.19e-03

Pos logit contributions
rant+0.058
mer+0.050
le+0.039
ola+0.039
sc+0.039
Neg logit contributions
 aware-0.045
 decisions-0.045
 office-0.044
 bravery-0.042
 understanding-0.042
Token
Token ID198
Feature activation+3.41e-04

Pos logit contributions
 aware+0.004
 decisions+0.004
 property+0.004
 bravery+0.004
 discovery+0.004
Neg logit contributions
rant-0.003
mer-0.003
 Will-0.002
le-0.002
ola-0.002
Token
Token ID198
Feature activation-1.38e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.000
 Will+0.000
sc+0.000
Neg logit contributions
 aware-0.001
 decisions-0.001
 bravery-0.001
 property-0.001
 oak-0.001
Token and
Token ID290
Feature activation+1.30e-03

Pos logit contributions
 aware+0.010
 decisions+0.009
 bravery+0.009
 office+0.009
 understanding+0.009
Neg logit contributions
rant-0.011
mer-0.009
le-0.008
sc-0.007
led-0.007
Token caring
Token ID18088
Feature activation+1.15e-03

Pos logit contributions
rant+0.003
mer+0.002
le+0.002
sc+0.002
te+0.002
Neg logit contributions
 aware-0.002
 decisions-0.002
 bravery-0.002
 oak-0.002
 office-0.002
Token,
Token ID11
Feature activation-6.13e-03

Pos logit contributions
rant+0.002
mer+0.002
le+0.002
sc+0.002
te+0.002
Neg logit contributions
 aware-0.002
 decisions-0.002
 bravery-0.002
 oak-0.002
 office-0.002
Token comfort
Token ID4467
Feature activation+8.75e-03

Pos logit contributions
 aware+0.011
 decisions+0.011
 impact+0.011
unch+0.011
 tal+0.011
Neg logit contributions
rant-0.010
mer-0.007
le-0.007
 Will-0.006
 lit-0.006
Tokened
Token ID276
Feature activation+3.57e-03

Pos logit contributions
rant+0.015
mer+0.012
le+0.010
sc+0.009
ifying+0.009
Neg logit contributions
 aware-0.016
 understanding-0.015
 office-0.015
 decisions-0.015
 oak-0.015
Token Sus
Token ID8932
Feature activation+2.99e-02

Pos logit contributions
rant+0.008
mer+0.006
le+0.006
sc+0.005
ifying+0.005
Neg logit contributions
 aware-0.005
 understanding-0.005
 decisions-0.005
 office-0.005
 oak-0.005
Tokenie
Token ID494
Feature activation-7.49e-03

Pos logit contributions
rant+0.056
mer+0.051
sc+0.045
ifying+0.043
 Will+0.041
Neg logit contributions
 aware-0.047
 office-0.043
 oak-0.043
 discovery-0.042
 decisions-0.042
Token and
Token ID290
Feature activation-4.36e-03

Pos logit contributions
 aware+0.016
unch+0.015
 decisions+0.015
 oak+0.015
 property+0.015
Neg logit contributions
rant-0.011
mer-0.008
le-0.008
te-0.006
led-0.006
Token said
Token ID531
Feature activation-4.68e-03

Pos logit contributions
 aware+0.007
 decisions+0.007
unch+0.007
 oak+0.007
 property+0.007
Neg logit contributions
rant-0.008
mer-0.006
le-0.006
 Will-0.005
led-0.005
Token "
Token ID366
Feature activation+1.45e-03

Pos logit contributions
 aware+0.007
 decisions+0.007
 bravery+0.007
 oak+0.007
 understanding+0.007
Neg logit contributions
rant-0.009
mer-0.008
le-0.007
sc-0.006
led-0.006
TokenIt
Token ID1026
Feature activation+5.26e-04

Pos logit contributions
rant+0.003
mer+0.002
le+0.002
sc+0.002
te+0.002
Neg logit contributions
 aware-0.002
 decisions-0.002
 oak-0.002
 understanding-0.002
 bravery-0.002
Token buttons
Token ID12163
Feature activation-5.13e-03

Pos logit contributions
 aware+0.003
 decisions+0.003
 bravery+0.003
 oak+0.003
 office+0.003
Neg logit contributions
rant-0.005
mer-0.004
le-0.004
sc-0.003
te-0.003
Token for
Token ID329
Feature activation+5.97e-03

Pos logit contributions
 bravery+0.007
 aware+0.007
 bike+0.007
 decisions+0.007
 office+0.007
Neg logit contributions
rant-0.011
mer-0.009
le-0.008
izz-0.008
sc-0.008
Token eyes
Token ID2951
Feature activation+2.48e-03

Pos logit contributions
rant+0.012
mer+0.009
le+0.009
 Will+0.008
te+0.008
Neg logit contributions
 aware-0.010
 decisions-0.009
 oak-0.009
 office-0.009
 bravery-0.009
Token and
Token ID290
Feature activation-6.21e-03

Pos logit contributions
rant+0.005
mer+0.005
ola+0.004
izz+0.004
le+0.004
Neg logit contributions
 aware-0.003
 bike-0.003
 bravery-0.003
 office-0.003
 oak-0.003
Token a
Token ID257
Feature activation-8.20e-03

Pos logit contributions
 aware+0.009
 decisions+0.009
 oak+0.008
unch+0.008
 office+0.008
Neg logit contributions
rant-0.013
mer-0.010
le-0.010
 Will-0.009
sc-0.009
Token smile
Token ID8212
Feature activation+2.80e-02

Pos logit contributions
 aware+0.012
 decisions+0.011
 property+0.011
unch+0.011
 bravery+0.010
Neg logit contributions
rant-0.017
mer-0.014
le-0.013
sc-0.012
 Will-0.012
Token.
Token ID13
Feature activation+8.28e-03

Pos logit contributions
rant+0.053
mer+0.042
le+0.036
sc+0.035
ola+0.034
Neg logit contributions
 aware-0.046
 office-0.045
 decisions-0.044
 oak-0.044
 bike-0.044
Token
Token ID198
Feature activation+1.24e-03

Pos logit contributions
rant+0.021
mer+0.018
le+0.017
led+0.015
te+0.015
Neg logit contributions
joy-0.008
unch-0.008
 aware-0.008
 oak-0.008
 decisions-0.008
Token
Token ID198
Feature activation-8.18e-05

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
led+0.003
ifying+0.003
Neg logit contributions
unch-0.001
 aware-0.001
 understanding-0.001
 discovery-0.001
 office-0.001
Token"
Token ID1
Feature activation+7.51e-03

Pos logit contributions
 aware+0.000
 understanding+0.000
shadow+0.000
unch+0.000
 leash+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
led-0.000
te-0.000
TokenHello
Token ID15496
Feature activation+1.73e-02

Pos logit contributions
rant+0.017
mer+0.015
le+0.013
led+0.013
ola+0.011
Neg logit contributions
 understanding-0.009
 oak-0.009
 aware-0.009
 bike-0.008
unch-0.008
Token as
Token ID355
Feature activation-2.99e-03

Pos logit contributions
rant+0.010
mer+0.008
le+0.008
sc+0.007
te+0.006
Neg logit contributions
 aware-0.008
 decisions-0.008
 bravery-0.007
 oak-0.007
 office-0.007
Token the
Token ID262
Feature activation-9.22e-04

Pos logit contributions
 aware+0.004
 bravery+0.004
 decisions+0.004
 understanding+0.004
 office+0.004
Neg logit contributions
rant-0.007
mer-0.006
le-0.005
sc-0.005
led-0.004
Token bath
Token ID7837
Feature activation-6.28e-03

Pos logit contributions
 aware+0.001
 decisions+0.001
 office+0.001
shadow+0.001
unch+0.001
Neg logit contributions
rant-0.002
mer-0.002
le-0.001
sc-0.001
 Will-0.001
Tokentub
Token ID37995
Feature activation+4.88e-03

Pos logit contributions
 decisions+0.011
 office+0.011
 bravery+0.011
 oak+0.011
 aware+0.011
Neg logit contributions
rant-0.011
mer-0.008
le-0.008
vy-0.007
 Will-0.006
Token.
Token ID13
Feature activation+1.08e-04

Pos logit contributions
rant+0.009
mer+0.007
le+0.006
te+0.005
 Will+0.005
Neg logit contributions
 aware-0.008
 decisions-0.008
 bravery-0.008
 oak-0.008
 office-0.008
Token It
Token ID632
Feature activation+1.42e-02

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
sc+0.000
 Will+0.000
Neg logit contributions
 aware-0.000
 decisions-0.000
 oak-0.000
 bravery-0.000
 discovery-0.000
Token fills
Token ID23816
Feature activation+1.18e-02

Pos logit contributions
rant+0.029
mer+0.024
le+0.021
izz+0.019
te+0.019
Neg logit contributions
 aware-0.021
 decisions-0.020
 bravery-0.020
 oak-0.020
 discovery-0.019
Token up
Token ID510
Feature activation+2.80e-03

Pos logit contributions
rant+0.026
mer+0.022
le+0.019
auc+0.018
ifying+0.018
Neg logit contributions
 discovery-0.015
 decisions-0.014
 aware-0.014
 office-0.013
 bravery-0.013
Token the
Token ID262
Feature activation-7.74e-03

Pos logit contributions
rant+0.006
mer+0.005
le+0.004
sc+0.004
led+0.004
Neg logit contributions
 aware-0.004
 decisions-0.004
 oak-0.004
 bravery-0.004
 office-0.004
Token whole
Token ID2187
Feature activation-8.05e-03

Pos logit contributions
 decisions+0.011
 aware+0.011
 office+0.011
 oak+0.010
 property+0.010
Neg logit contributions
rant-0.017
mer-0.014
le-0.013
led-0.011
te-0.011
Token bath
Token ID7837
Feature activation-1.18e-02

Pos logit contributions
 decisions+0.010
 aware+0.010
 oak+0.010
 office+0.010
 discovery+0.010
Neg logit contributions
rant-0.019
mer-0.015
le-0.015
led-0.013
te-0.013
Token he
Token ID339
Feature activation+1.31e-02

Pos logit contributions
 aware+0.011
 decisions+0.010
 bravery+0.010
 oak+0.010
 office+0.010
Neg logit contributions
rant-0.013
mer-0.011
le-0.010
sc-0.009
te-0.008
Token had
Token ID550
Feature activation+4.24e-04

Pos logit contributions
rant+0.021
mer+0.016
le+0.015
sc+0.012
te+0.012
Neg logit contributions
 aware-0.026
 decisions-0.025
 bravery-0.025
 oak-0.025
 office-0.024
Token still
Token ID991
Feature activation-1.30e-02

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
te+0.001
 Will+0.000
Neg logit contributions
 aware-0.001
 decisions-0.001
 oak-0.001
unch-0.001
 office-0.001
Token found
Token ID1043
Feature activation-1.01e-02

Pos logit contributions
 aware+0.022
 decisions+0.021
unch+0.021
 oak+0.021
 office+0.021
Neg logit contributions
rant-0.023
mer-0.019
le-0.017
te-0.015
led-0.015
Token a
Token ID257
Feature activation+6.60e-04

Pos logit contributions
 aware+0.015
 decisions+0.014
 oak+0.014
 bravery+0.014
unch+0.014
Neg logit contributions
rant-0.021
mer-0.017
le-0.016
sc-0.014
led-0.014
Token hidden
Token ID7104
Feature activation+5.97e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
 mer+0.001
sc+0.001
Neg logit contributions
unch-0.001
 aware-0.001
 decisions-0.001
ouncing-0.001
isdom-0.001
Token secret
Token ID3200
Feature activation+1.70e-03

Pos logit contributions
rant+0.013
mer+0.011
le+0.010
sc+0.009
te+0.009
Neg logit contributions
 aware-0.009
unch-0.008
 decisions-0.008
shadow-0.008
 bravery-0.008
Token.
Token ID13
Feature activation-4.49e-03

Pos logit contributions
rant+0.003
mer+0.003
le+0.002
sc+0.002
te+0.002
Neg logit contributions
 aware-0.003
 decisions-0.003
 bravery-0.003
 oak-0.003
unch-0.003
Token<|endoftext|>
Token ID50256
Feature activation-3.28e-03

Pos logit contributions
 aware+0.008
 decisions+0.007
 bravery+0.007
 oak+0.007
 property+0.007
Neg logit contributions
rant-0.008
mer-0.007
 Will-0.006
le-0.006
sc-0.005
TokenOnce
Token ID7454
Feature activation+2.00e-04

Pos logit contributions
joy+0.004
 office+0.004
 aware+0.003
 decisions+0.003
 oak+0.003
Neg logit contributions
rant-0.008
mer-0.007
le-0.006
led-0.005
bly-0.005
Token upon
Token ID2402
Feature activation+6.81e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
sc+0.000
te+0.000
Neg logit contributions
 aware-0.000
 decisions-0.000
 oak-0.000
 bravery-0.000
 discovery-0.000
Token.
Token ID13
Feature activation-1.72e-05

Pos logit contributions
 aware+0.006
 decisions+0.005
 office+0.005
 bravery+0.005
 understanding+0.005
Neg logit contributions
rant-0.006
mer-0.005
le-0.004
ola-0.004
sc-0.004
Token Let
Token ID3914
Feature activation+4.91e-03

Pos logit contributions
 aware+0.000
 decisions+0.000
 oak+0.000
 bravery+0.000
 office+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
led-0.000
sc-0.000
Token me
Token ID502
Feature activation+6.76e-03

Pos logit contributions
rant+0.011
mer+0.009
le+0.008
ifying+0.007
te+0.007
Neg logit contributions
 decisions-0.007
 aware-0.007
 oak-0.007
 office-0.007
 bravery-0.007
Token take
Token ID1011
Feature activation-8.29e-03

Pos logit contributions
rant+0.016
mer+0.014
le+0.012
sc+0.011
led+0.011
Neg logit contributions
 aware-0.009
 bravery-0.008
 office-0.008
 decisions-0.008
 understanding-0.008
Token a
Token ID257
Feature activation+2.68e-03

Pos logit contributions
 aware+0.013
 decisions+0.012
 bravery+0.012
 oak+0.012
 understanding+0.012
Neg logit contributions
rant-0.017
mer-0.015
le-0.013
sc-0.012
te-0.011
Token look
Token ID804
Feature activation+6.65e-03

Pos logit contributions
rant+0.006
mer+0.005
le+0.004
sc+0.004
 Will+0.004
Neg logit contributions
 aware-0.004
 decisions-0.004
 bravery-0.004
 oak-0.004
 office-0.004
Token and
Token ID290
Feature activation+8.20e-03

Pos logit contributions
rant+0.013
mer+0.011
ola+0.010
acy+0.010
le+0.010
Neg logit contributions
 aware-0.010
 decisions-0.009
 understanding-0.009
 bravery-0.009
 office-0.009
Token make
Token ID787
Feature activation+8.55e-03

Pos logit contributions
rant+0.016
mer+0.013
le+0.012
 Will+0.011
sc+0.011
Neg logit contributions
 aware-0.013
 decisions-0.012
unch-0.012
joy-0.012
 office-0.012
Token it
Token ID340
Feature activation+8.11e-03

Pos logit contributions
rant+0.015
le+0.012
mer+0.011
 Will+0.010
led+0.010
Neg logit contributions
 aware-0.015
 oak-0.014
joy-0.014
 property-0.014
 impact-0.014
Token all
Token ID477
Feature activation+1.77e-02

Pos logit contributions
rant+0.016
mer+0.013
le+0.012
sc+0.010
te+0.010
Neg logit contributions
 aware-0.013
 decisions-0.013
 bravery-0.012
 oak-0.012
 office-0.012
Token better
Token ID1365
Feature activation+2.12e-03

Pos logit contributions
rant+0.035
mer+0.030
le+0.026
te+0.023
ifying+0.023
Neg logit contributions
 aware-0.029
 decisions-0.028
 bravery-0.027
 understanding-0.027
 impact-0.027
Token and
Token ID290
Feature activation-4.52e-03

Pos logit contributions
rant+0.027
mer+0.022
le+0.020
sc+0.018
ifying+0.017
Neg logit contributions
 aware-0.025
 decisions-0.023
 bravery-0.023
 office-0.023
 oak-0.023
Token Sam
Token ID3409
Feature activation-7.78e-03

Pos logit contributions
 aware+0.006
 oak+0.006
 bravery+0.006
 decisions+0.006
 understanding+0.006
Neg logit contributions
rant-0.010
mer-0.009
le-0.008
sc-0.007
led-0.007
Token,
Token ID11
Feature activation-3.41e-03

Pos logit contributions
 aware+0.015
 decisions+0.015
 bravery+0.015
 oak+0.014
 office+0.014
Neg logit contributions
rant-0.013
mer-0.010
le-0.009
sc-0.007
te-0.007
Token were
Token ID547
Feature activation-1.07e-02

Pos logit contributions
 aware+0.008
 decisions+0.007
 bravery+0.007
 oak+0.007
 discovery+0.007
Neg logit contributions
rant-0.005
mer-0.003
le-0.003
sc-0.002
 Will-0.002
Token walking
Token ID6155
Feature activation+3.98e-04

Pos logit contributions
 aware+0.012
 decisions+0.011
 bravery+0.011
 oak+0.011
 property+0.010
Neg logit contributions
rant-0.027
mer-0.023
le-0.021
sc-0.020
 Will-0.019
Token in
Token ID287
Feature activation+2.11e-02

Pos logit contributions
rant+0.001
mer+0.001
le+0.000
sc+0.000
led+0.000
Neg logit contributions
 aware-0.001
 decisions-0.001
 oak-0.001
 bravery-0.001
 office-0.001
Token the
Token ID262
Feature activation+5.52e-03

Pos logit contributions
rant+0.028
mer+0.021
le+0.017
 Will+0.017
 them+0.015
Neg logit contributions
 aware-0.046
 decisions-0.045
unch-0.045
 bravery-0.043
 property-0.043
Token city
Token ID1748
Feature activation-2.98e-03

Pos logit contributions
rant+0.014
mer+0.012
le+0.011
sc+0.010
 Will+0.010
Neg logit contributions
 aware-0.006
 decisions-0.006
 bravery-0.006
unch-0.005
 understanding-0.005
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
 aware+0.006
 decisions+0.006
 bravery+0.006
 oak+0.006
 impact+0.006
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
sc-0.002
te-0.002
Token Sam
Token ID3409
Feature activation-5.01e-04

Pos logit contributions
rant+0.021
mer+0.017
le+0.016
sc+0.014
te+0.014
Neg logit contributions
 aware-0.017
 decisions-0.016
 bravery-0.016
 oak-0.016
 office-0.015
Token pointed
Token ID6235
Feature activation+3.35e-03

Pos logit contributions
 aware+0.001
 oak+0.001
 decisions+0.001
 bravery+0.001
 office+0.001
Neg logit contributions
rant-0.001
mer-0.001
le-0.001
sc-0.001
te-0.001
Token Tim
Token ID5045
Feature activation-4.82e-03

Pos logit contributions
 aware+0.012
 decisions+0.012
 bravery+0.012
 oak+0.012
 office+0.011
Neg logit contributions
rant-0.018
mer-0.015
le-0.013
sc-0.012
te-0.012
Tokenmy
Token ID1820
Feature activation-2.15e-03

Pos logit contributions
 aware+0.006
 oak+0.005
 office+0.005
 property+0.005
 decisions+0.005
Neg logit contributions
rant-0.012
mer-0.010
le-0.009
sc-0.009
ifying-0.008
Token loved
Token ID6151
Feature activation-1.75e-03

Pos logit contributions
 aware+0.003
 office+0.003
 oak+0.003
 property+0.003
 understanding+0.003
Neg logit contributions
rant-0.005
mer-0.004
le-0.003
sc-0.003
ifying-0.003
Token to
Token ID284
Feature activation-1.14e-02

Pos logit contributions
 aware+0.003
 decisions+0.003
 bravery+0.003
 impact+0.003
 decision+0.003
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
 Will-0.002
sc-0.002
Token play
Token ID711
Feature activation+1.34e-03

Pos logit contributions
 aware+0.017
 decisions+0.017
 bravery+0.016
 oak+0.016
 office+0.016
Neg logit contributions
rant-0.024
mer-0.019
le-0.018
sc-0.016
te-0.016
Token outside
Token ID2354
Feature activation+7.79e-03

Pos logit contributions
rant+0.003
mer+0.002
le+0.002
ifying+0.002
sc+0.002
Neg logit contributions
 aware-0.002
 office-0.002
 oak-0.002
 discovery-0.002
 bravery-0.002
Token,
Token ID11
Feature activation+2.48e-03

Pos logit contributions
rant+0.017
mer+0.013
le+0.012
izz+0.011
ola+0.011
Neg logit contributions
 climbing-0.013
 office-0.012
 oak-0.012
 swinging-0.012
 aware-0.012
Token especially
Token ID2592
Feature activation-5.55e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
sc+0.003
 Will+0.003
Neg logit contributions
 decisions-0.004
 aware-0.004
 bravery-0.004
 office-0.004
 property-0.004
Token when
Token ID618
Feature activation-4.26e-03

Pos logit contributions
 aware+0.010
 decisions+0.010
 bravery+0.010
 oak+0.010
 office+0.010
Neg logit contributions
rant-0.009
mer-0.007
le-0.007
sc-0.006
led-0.006
Token it
Token ID340
Feature activation+1.83e-02

Pos logit contributions
 aware+0.007
 decisions+0.007
 bravery+0.006
 oak+0.006
 office+0.006
Neg logit contributions
rant-0.009
mer-0.007
le-0.006
sc-0.006
te-0.005
Token r
Token ID374
Feature activation+1.72e-02

Pos logit contributions
rant+0.037
mer+0.029
ifying+0.027
le+0.026
te+0.024
Neg logit contributions
 aware-0.028
 climbing-0.028
 office-0.027
 oak-0.027
 branch-0.026
Token,
Token ID11
Feature activation+3.03e-03

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.001
led+0.001
Neg logit contributions
 aware-0.001
 decisions-0.001
 office-0.001
 bravery-0.001
 discovery-0.001
Token his
Token ID465
Feature activation-9.67e-04

Pos logit contributions
rant+0.005
mer+0.004
 Will+0.003
le+0.003
te+0.003
Neg logit contributions
 aware-0.006
 decisions-0.006
 property-0.006
unch-0.006
 impact-0.006
Token room
Token ID2119
Feature activation+1.58e-03

Pos logit contributions
 aware+0.001
 decisions+0.001
 bravery+0.001
 office+0.001
 discovery+0.001
Neg logit contributions
rant-0.002
mer-0.002
le-0.002
sc-0.002
led-0.002
Token looked
Token ID3114
Feature activation-9.14e-04

Pos logit contributions
rant+0.002
mer+0.002
le+0.002
sc+0.001
ifying+0.001
Neg logit contributions
 aware-0.003
 office-0.003
 bravery-0.003
 decisions-0.003
 discovery-0.003
Token so
Token ID523
Feature activation+1.36e-04

Pos logit contributions
 aware+0.001
 understanding+0.001
 bravery+0.001
 decisions+0.001
 discovery+0.001
Neg logit contributions
rant-0.002
mer-0.002
le-0.002
ola-0.002
acy-0.002
Token neat
Token ID15049
Feature activation+1.39e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
ifying+0.000
sc+0.000
Neg logit contributions
 aware-0.000
 understanding-0.000
 bravery-0.000
 decisions-0.000
 determination-0.000
Token and
Token ID290
Feature activation+1.01e-02

Pos logit contributions
rant+0.003
mer+0.003
le+0.002
izz+0.002
ifying+0.002
Neg logit contributions
 office-0.002
 aware-0.002
 bravery-0.002
 discovery-0.002
joy-0.002
Token ordinary
Token ID8850
Feature activation-1.15e-02

Pos logit contributions
rant+0.022
mer+0.018
le+0.017
te+0.015
ifying+0.015
Neg logit contributions
 aware-0.015
 understanding-0.014
 bravery-0.014
 discovery-0.014
 decisions-0.014
Token.
Token ID13
Feature activation-3.86e-03

Pos logit contributions
 aware+0.021
 decisions+0.021
 bravery+0.020
 discovery+0.020
 office+0.020
Neg logit contributions
rant-0.021
mer-0.016
le-0.015
sc-0.013
vy-0.012
Token Tim
Token ID5045
Feature activation-1.00e-02

Pos logit contributions
 aware+0.007
 decisions+0.007
 bravery+0.007
 property+0.007
 discovery+0.006
Neg logit contributions
rant-0.007
mer-0.005
 Will-0.005
le-0.004
sc-0.004
Tokenmy
Token ID1820
Feature activation-1.20e-02

Pos logit contributions
 aware+0.014
 decisions+0.013
 oak+0.013
 bravery+0.013
 office+0.013
Neg logit contributions
rant-0.022
mer-0.019
le-0.017
sc-0.016
te-0.015
Token more
Token ID517
Feature activation+2.24e-04

Pos logit contributions
 aware+0.004
 decisions+0.003
 bravery+0.003
 office+0.003
 oak+0.003
Neg logit contributions
rant-0.007
mer-0.006
le-0.005
sc-0.005
te-0.005
Token comfortable
Token ID6792
Feature activation+4.35e-04

Pos logit contributions
rant+0.001
mer+0.000
le+0.000
sc+0.000
te+0.000
Neg logit contributions
 aware-0.000
 decisions-0.000
 bravery-0.000
 understanding-0.000
 oak-0.000
Token.
Token ID13
Feature activation-5.32e-04

Pos logit contributions
rant+0.001
mer+0.001
le+0.001
sc+0.000
izz+0.000
Neg logit contributions
 aware-0.001
 decisions-0.001
 bravery-0.001
 office-0.001
 property-0.001
Token He
Token ID679
Feature activation-2.10e-03

Pos logit contributions
 property+0.001
 decisions+0.001
 aware+0.001
 bravery+0.001
 impact+0.001
Neg logit contributions
rant-0.001
 Will-0.001
mer-0.001
sc-0.001
 could-0.001
Token remembered
Token ID12086
Feature activation-1.83e-03

Pos logit contributions
 aware+0.004
 oak+0.003
 bravery+0.003
 office+0.003
 discovery+0.003
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
ifying-0.003
sc-0.003
Token his
Token ID465
Feature activation+1.53e-04

Pos logit contributions
 decisions+0.003
 impact+0.003
unch+0.003
 aware+0.003
 decision+0.003
Neg logit contributions
rant-0.003
le-0.002
mer-0.002
 Will-0.002
 mer-0.002
Token friend
Token ID1545
Feature activation+4.53e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
 Will+0.000
te+0.000
Neg logit contributions
 aware-0.000
shadow-0.000
unch-0.000
 decisions-0.000
onest-0.000
Token the
Token ID262
Feature activation-6.99e-03

Pos logit contributions
rant+0.009
mer+0.007
le+0.006
sc+0.006
 Will+0.006
Neg logit contributions
 aware-0.008
 decisions-0.007
 bravery-0.007
 oak-0.007
 discovery-0.007
Token fox
Token ID21831
Feature activation-6.04e-03

Pos logit contributions
shadow+0.011
unch+0.011
 aware+0.010
ouncing+0.010
isdom+0.010
Neg logit contributions
rant-0.012
mer-0.011
le-0.010
 Will-0.010
 mer-0.009
Token had
Token ID550
Feature activation+4.95e-03

Pos logit contributions
 decisions+0.010
 aware+0.010
 oak+0.010
 impact+0.010
 property+0.010
Neg logit contributions
rant-0.011
mer-0.009
le-0.008
 Will-0.007
sc-0.007
Token a
Token ID257
Feature activation-1.34e-03

Pos logit contributions
rant+0.010
mer+0.008
le+0.007
sc+0.007
te+0.006
Neg logit contributions
 aware-0.008
 decisions-0.008
 bravery-0.007
 oak-0.007
 office-0.007
Token."
Token ID526
Feature activation-9.79e-03

Pos logit contributions
 aware+0.013
 office+0.013
 decisions+0.012
 oak+0.012
 property+0.012
Neg logit contributions
rant-0.016
mer-0.013
le-0.012
izz-0.011
vy-0.010
Token
Token ID220
Feature activation+5.77e-03

Pos logit contributions
 decisions+0.018
 property+0.018
 aware+0.018
 bravery+0.017
 impact+0.017
Neg logit contributions
rant-0.015
 Will-0.012
mer-0.012
sc-0.012
 could-0.012
Token
Token ID198
Feature activation-3.65e-03

Pos logit contributions
rant+0.009
mer+0.007
le+0.006
 Will+0.006
sc+0.006
Neg logit contributions
 aware-0.011
 bravery-0.011
 property-0.011
 decisions-0.011
 oak-0.011
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
 aware+0.006
 decisions+0.006
 bravery+0.006
 property+0.006
 oak+0.006
Neg logit contributions
rant-0.007
mer-0.006
le-0.005
sc-0.005
 Will-0.004
TokenL
Token ID43
Feature activation+1.68e-02

Pos logit contributions
 decisions+0.021
 bravery+0.020
joy+0.020
 impact+0.020
 office+0.020
Neg logit contributions
rant-0.014
mer-0.013
sc-0.011
le-0.011
te-0.010
Tokenily
Token ID813
Feature activation-1.18e-02

Pos logit contributions
rant+0.029
mer+0.023
le+0.019
ifying+0.018
sc+0.017
Neg logit contributions
 aware-0.032
 decisions-0.030
 oak-0.030
 discovery-0.029
 bravery-0.029
Token said
Token ID531
Feature activation-5.95e-03

Pos logit contributions
 decisions+0.024
 aware+0.024
 impact+0.023
 bravery+0.023
 oak+0.023
Neg logit contributions
rant-0.017
mer-0.014
le-0.013
led-0.010
sc-0.010
Token,
Token ID11
Feature activation+1.18e-04

Pos logit contributions
 aware+0.011
 decisions+0.011
 property+0.011
 office+0.011
 oak+0.011
Neg logit contributions
rant-0.010
mer-0.008
le-0.007
te-0.006
 Will-0.006
Token "
Token ID366
Feature activation-1.56e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
 Will+0.000
te+0.000
Neg logit contributions
 decisions-0.000
 aware-0.000
 property-0.000
 impact-0.000
 discovery-0.000
TokenDon
Token ID3987
Feature activation+1.67e-03

Pos logit contributions
 decisions+0.003
 aware+0.003
 office+0.003
 bravery+0.003
 discovery+0.003
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
sc-0.002
 Will-0.002
Token't
Token ID470
Feature activation+1.05e-02

Pos logit contributions
rant+0.003
mer+0.003
le+0.002
led+0.002
sc+0.002
Neg logit contributions
 aware-0.003
 decisions-0.003
 oak-0.003
 bravery-0.003
 office-0.003
Token Everyone
Token ID11075
Feature activation+2.43e-03

Pos logit contributions
rant+0.002
mer+0.001
le+0.001
sc+0.001
 Will+0.001
Neg logit contributions
 aware-0.001
 decisions-0.001
 bravery-0.001
 discovery-0.001
 oak-0.001
Token cl
Token ID537
Feature activation+1.07e-03

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
ifying+0.003
sc+0.002
Neg logit contributions
 aware-0.004
 understanding-0.004
 bravery-0.004
 office-0.004
 decisions-0.004
Tokenapped
Token ID6320
Feature activation+6.16e-03

Pos logit contributions
rant+0.003
mer+0.002
le+0.002
sc+0.002
vy+0.002
Neg logit contributions
 aware-0.001
 decisions-0.001
unch-0.001
joy-0.001
 understanding-0.001
Token and
Token ID290
Feature activation+5.18e-03

Pos logit contributions
rant+0.012
mer+0.010
le+0.009
sc+0.008
izz+0.008
Neg logit contributions
 aware-0.010
 understanding-0.010
 decisions-0.010
 bravery-0.010
 discovery-0.009
Token laughed
Token ID13818
Feature activation+9.11e-04

Pos logit contributions
rant+0.009
mer+0.007
le+0.006
sc+0.005
ifying+0.005
Neg logit contributions
 aware-0.010
 bravery-0.010
 decisions-0.010
 understanding-0.010
 oak-0.010
Token.
Token ID13
Feature activation-2.24e-03

Pos logit contributions
rant+0.002
mer+0.002
le+0.001
izz+0.001
sc+0.001
Neg logit contributions
 aware-0.001
 understanding-0.001
 bravery-0.001
 decisions-0.001
joy-0.001
Token Tim
Token ID5045
Feature activation-1.18e-02

Pos logit contributions
 aware+0.004
 decisions+0.004
 bravery+0.004
 oak+0.004
 office+0.004
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
sc-0.003
te-0.003
Token's
Token ID338
Feature activation-1.29e-03

Pos logit contributions
 aware+0.021
 bravery+0.020
 office+0.020
 decisions+0.020
 oak+0.020
Neg logit contributions
rant-0.022
mer-0.017
le-0.015
ifying-0.014
sc-0.013
Token mom
Token ID1995
Feature activation-2.31e-03

Pos logit contributions
 aware+0.002
 oak+0.002
shadow+0.002
 decisions+0.002
 property+0.002
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
 Will-0.002
te-0.002
Token mentioned
Token ID4750
Feature activation-1.25e-02

Pos logit contributions
 aware+0.004
 understanding+0.004
 office+0.004
 bravery+0.004
 discovery+0.004
Neg logit contributions
rant-0.004
mer-0.004
le-0.003
ifying-0.003
sc-0.003
Token that
Token ID326
Feature activation-9.60e-03

Pos logit contributions
 aware+0.017
 decisions+0.017
 bravery+0.017
 understanding+0.016
 oak+0.016
Neg logit contributions
rant-0.028
mer-0.023
le-0.021
sc-0.019
led-0.019
Token box
Token ID3091
Feature activation-1.73e-04

Pos logit contributions
 aware+0.022
 decisions+0.021
 bravery+0.021
 oak+0.021
 office+0.021
Neg logit contributions
rant-0.020
mer-0.015
le-0.014
sc-0.012
te-0.011
Token wasn
Token ID2492
Feature activation+6.58e-03

Pos logit contributions
 bravery+0.000
 discovery+0.000
 oak+0.000
 aware+0.000
 office+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
 Will-0.000
sc-0.000
Token't
Token ID470
Feature activation+6.42e-03

Pos logit contributions
rant+0.012
mer+0.010
le+0.009
led+0.008
sc+0.008
Neg logit contributions
 aware-0.011
 bravery-0.011
 oak-0.011
 climbing-0.011
 decisions-0.011
Token really
Token ID1107
Feature activation+5.98e-03

Pos logit contributions
rant+0.013
mer+0.011
le+0.010
sc+0.009
te+0.009
Neg logit contributions
 aware-0.010
 decisions-0.009
 bravery-0.009
 oak-0.009
 office-0.009
Token empty
Token ID6565
Feature activation-5.85e-03

Pos logit contributions
rant+0.013
mer+0.011
le+0.010
sc+0.009
te+0.009
Neg logit contributions
 aware-0.009
 decisions-0.008
 oak-0.008
 bravery-0.008
 office-0.008
Token -
Token ID532
Feature activation-1.62e-04

Pos logit contributions
 decisions+0.009
 aware+0.009
 bravery+0.009
 discovery+0.009
 office+0.009
Neg logit contributions
rant-0.012
mer-0.009
le-0.009
izz-0.008
sc-0.008
Token it
Token ID340
Feature activation+1.72e-03

Pos logit contributions
 aware+0.000
 decisions+0.000
 oak+0.000
 office+0.000
unch+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
sc-0.000
 Will-0.000
Token was
Token ID373
Feature activation+5.74e-03

Pos logit contributions
rant+0.003
mer+0.002
le+0.002
te+0.002
ifying+0.002
Neg logit contributions
 aware-0.003
 bravery-0.003
 discovery-0.003
 decisions-0.003
 oak-0.003
Token a
Token ID257
Feature activation-4.80e-03

Pos logit contributions
rant+0.012
mer+0.011
le+0.010
sc+0.009
led+0.008
Neg logit contributions
 aware-0.008
 bravery-0.008
 decisions-0.008
 understanding-0.008
 discovery-0.008
Token home
Token ID1363
Feature activation-1.28e-03

Pos logit contributions
 aware+0.007
 decisions+0.007
 bravery+0.006
 oak+0.006
unch+0.006
Neg logit contributions
rant-0.010
mer-0.009
le-0.008
sc-0.007
te-0.007
Token for
Token ID329
Feature activation-3.48e-03

Pos logit contributions
 aware+0.002
 decisions+0.002
 office+0.002
 bravery+0.002
 oak+0.002
Neg logit contributions
rant-0.002
mer-0.002
le-0.002
sc-0.002
led-0.002
Token
Token ID198
Feature activation+3.49e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.003
sc+0.003
 Will+0.003
Neg logit contributions
 aware-0.003
 decisions-0.003
 bravery-0.003
 oak-0.003
 office-0.003
Token
Token ID198
Feature activation-7.04e-05

Pos logit contributions
rant+0.007
mer+0.006
le+0.006
sc+0.005
led+0.005
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 oak-0.005
 office-0.005
TokenS
Token ID50
Feature activation-3.18e-03

Pos logit contributions
 aware+0.000
 decisions+0.000
 oak+0.000
 bravery+0.000
 understanding+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
sc-0.000
led-0.000
Tokenue
Token ID518
Feature activation-2.14e-02

Pos logit contributions
 aware+0.004
 decisions+0.004
 bravery+0.004
 oak+0.004
 understanding+0.004
Neg logit contributions
rant-0.007
mer-0.006
le-0.006
te-0.005
led-0.005
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
 aware+0.038
 office+0.036
 decisions+0.036
 oak+0.036
 bravery+0.036
Neg logit contributions
rant-0.039
mer-0.031
le-0.027
sc-0.024
ifying-0.023
Token Tim
Token ID5045
Feature activation-2.24e-02

Pos logit contributions
 aware+0.019
 decisions+0.019
unch+0.018
 impact+0.018
 bravery+0.018
Neg logit contributions
rant-0.018
mer-0.014
le-0.013
 Will-0.012
te-0.012
Token both
Token ID1111
Feature activation-1.93e-03

Pos logit contributions
 aware+0.045
 decisions+0.043
 bravery+0.042
 oak+0.042
 discovery+0.042
Neg logit contributions
rant-0.036
mer-0.028
le-0.025
sc-0.021
te-0.020
Token felt
Token ID2936
Feature activation-9.15e-03

Pos logit contributions
 aware+0.003
 oak+0.003
 understanding+0.003
 bravery+0.003
 office+0.003
Neg logit contributions
rant-0.004
mer-0.003
le-0.003
sc-0.002
ifying-0.002
Token sad
Token ID6507
Feature activation+2.09e-03

Pos logit contributions
 aware+0.010
 decisions+0.009
 bravery+0.009
 understanding+0.009
 oak+0.008
Neg logit contributions
rant-0.024
mer-0.021
le-0.019
sc-0.018
te-0.017
Token.
Token ID13
Feature activation-9.83e-04

Pos logit contributions
rant+0.004
mer+0.003
le+0.003
sc+0.003
te+0.003
Neg logit contributions
 aware-0.004
 decisions-0.003
 bravery-0.003
 oak-0.003
 office-0.003
Token Sue
Token ID26089
Feature activation-2.15e-02

Pos logit contributions
 aware+0.002
 decisions+0.002
 bravery+0.002
 oak+0.002
 office+0.002
Neg logit contributions
rant-0.002
mer-0.001
le-0.001
sc-0.001
te-0.001
Token want
Token ID765
Feature activation-1.08e-02

Pos logit contributions
 aware+0.012
 decisions+0.012
 bravery+0.012
 oak+0.011
 property+0.011
Neg logit contributions
rant-0.018
mer-0.015
le-0.013
sc-0.012
led-0.012
Token.
Token ID13
Feature activation-1.15e-03

Pos logit contributions
 decisions+0.017
 aware+0.017
 bravery+0.017
 understanding+0.017
 office+0.016
Neg logit contributions
rant-0.021
mer-0.018
le-0.016
led-0.014
vy-0.014
Token But
Token ID887
Feature activation-1.41e-02

Pos logit contributions
 aware+0.002
 decisions+0.001
 oak+0.001
 bravery+0.001
 office+0.001
Neg logit contributions
rant-0.003
mer-0.002
le-0.002
led-0.002
sc-0.002
Token you
Token ID345
Feature activation-1.50e-03

Pos logit contributions
 aware+0.026
 decisions+0.025
 oak+0.025
joy+0.025
 tal+0.024
Neg logit contributions
rant-0.024
mer-0.018
le-0.017
 Will-0.015
te-0.014
Token can
Token ID460
Feature activation+9.91e-03

Pos logit contributions
 aware+0.002
 decisions+0.002
 bravery+0.002
 oak+0.002
 office+0.002
Neg logit contributions
rant-0.003
mer-0.003
le-0.002
sc-0.002
led-0.002
Token also
Token ID635
Feature activation-2.24e-02

Pos logit contributions
rant+0.020
mer+0.017
le+0.015
led+0.014
sc+0.014
Neg logit contributions
 aware-0.015
 oak-0.014
 decisions-0.014
 understanding-0.014
 office-0.014
Token try
Token ID1949
Feature activation-1.12e-02

Pos logit contributions
 aware+0.034
 decisions+0.033
 oak+0.032
joy+0.032
 bravery+0.032
Neg logit contributions
rant-0.045
mer-0.037
le-0.034
te-0.030
 Will-0.029
Token other
Token ID584
Feature activation-3.22e-03

Pos logit contributions
 aware+0.018
 decisions+0.018
 bravery+0.017
 oak+0.017
 office+0.017
Neg logit contributions
rant-0.022
mer-0.018
le-0.016
sc-0.014
led-0.014
Token colors
Token ID7577
Feature activation+9.62e-03

Pos logit contributions
 aware+0.004
 oak+0.004
 decisions+0.004
 bravery+0.003
unch+0.003
Neg logit contributions
rant-0.008
mer-0.007
le-0.006
sc-0.005
 Will-0.005
Token.
Token ID13
Feature activation+1.97e-03

Pos logit contributions
rant+0.019
mer+0.017
le+0.013
vy+0.013
led+0.013
Neg logit contributions
 decisions-0.014
 aware-0.014
 bike-0.013
 oak-0.013
 bravery-0.013
Token You
Token ID921
Feature activation-5.55e-04

Pos logit contributions
rant+0.004
mer+0.004
le+0.003
led+0.003
sc+0.003
Neg logit contributions
 aware-0.003
 decisions-0.002
 oak-0.002
 bravery-0.002
 office-0.002
Token a
Token ID257
Feature activation+2.87e-03

Pos logit contributions
rant+0.000
mer+0.000
le+0.000
sc+0.000
te+0.000
Neg logit contributions
 aware-0.000
 decisions-0.000
 bravery-0.000
 oak-0.000
 office-0.000
Token ball
Token ID2613
Feature activation+2.83e-03

Pos logit contributions
rant+0.005
mer+0.005
le+0.004
sc+0.004
 Will+0.004
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.004
unch-0.004
 property-0.004
Token,
Token ID11
Feature activation-1.46e-02

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
te+0.003
led+0.003
Neg logit contributions
 aware-0.005
 oak-0.005
 office-0.005
 decisions-0.005
 bravery-0.005
Token a
Token ID257
Feature activation-7.78e-03

Pos logit contributions
 aware+0.024
 decisions+0.024
 office+0.022
 impact+0.022
 property+0.022
Neg logit contributions
rant-0.028
mer-0.022
le-0.021
sc-0.019
 Will-0.019
Token pillow
Token ID28774
Feature activation+4.60e-03

Pos logit contributions
 aware+0.012
 decisions+0.011
 bravery+0.011
 property+0.010
unch+0.010
Neg logit contributions
rant-0.016
mer-0.014
le-0.012
sc-0.011
ifying-0.011
Token,
Token ID11
Feature activation-2.18e-02

Pos logit contributions
rant+0.009
mer+0.007
le+0.006
sc+0.006
led+0.006
Neg logit contributions
 aware-0.007
 office-0.007
 oak-0.007
 bravery-0.007
 discovery-0.007
Token and
Token ID290
Feature activation-1.26e-02

Pos logit contributions
 aware+0.037
 decisions+0.035
 bravery+0.035
 oak+0.035
 office+0.034
Neg logit contributions
rant-0.042
mer-0.034
le-0.031
sc-0.027
te-0.026
Token a
Token ID257
Feature activation-1.22e-02

Pos logit contributions
 aware+0.019
 decisions+0.019
 bravery+0.018
 oak+0.018
 office+0.018
Neg logit contributions
rant-0.026
mer-0.021
le-0.019
sc-0.017
te-0.017
Token book
Token ID1492
Feature activation-3.63e-03

Pos logit contributions
 aware+0.018
 decisions+0.017
 property+0.016
 bravery+0.016
unch+0.016
Neg logit contributions
rant-0.025
mer-0.021
le-0.019
sc-0.018
 Will-0.018
Token.
Token ID13
Feature activation+4.25e-03

Pos logit contributions
 office+0.006
 aware+0.006
 oak+0.006
 bravery+0.006
 bike+0.006
Neg logit contributions
rant-0.007
mer-0.005
le-0.005
 Will-0.004
led-0.004
Token They
Token ID1119
Feature activation-1.56e-03

Pos logit contributions
rant+0.009
mer+0.007
le+0.007
led+0.006
sc+0.006
Neg logit contributions
 aware-0.006
 decisions-0.006
 oak-0.006
 office-0.006
 bravery-0.006
Token out
Token ID503
Feature activation-5.84e-03

Pos logit contributions
 aware+0.012
 decisions+0.012
 bravery+0.012
 oak+0.012
 office+0.011
Neg logit contributions
rant-0.017
mer-0.014
le-0.012
sc-0.011
te-0.011
Token why
Token ID1521
Feature activation-5.70e-03

Pos logit contributions
 aware+0.009
 decisions+0.009
 bravery+0.009
 oak+0.009
 office+0.009
Neg logit contributions
rant-0.012
mer-0.010
le-0.009
sc-0.008
te-0.007
Token Spot
Token ID15899
Feature activation+7.30e-03

Pos logit contributions
 aware+0.010
 decisions+0.010
 oak+0.010
 office+0.010
 bravery+0.009
Neg logit contributions
rant-0.010
mer-0.008
le-0.007
 Will-0.006
te-0.006
Token did
Token ID750
Feature activation+2.75e-03

Pos logit contributions
rant+0.011
mer+0.008
le+0.007
sc+0.005
te+0.005
Neg logit contributions
 aware-0.016
 bravery-0.015
 oak-0.015
 decisions-0.015
 office-0.015
Token not
Token ID407
Feature activation-6.01e-03

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
te+0.004
 Will+0.004
Neg logit contributions
 aware-0.004
 oak-0.004
 decisions-0.004
unch-0.004
 office-0.004
Token like
Token ID588
Feature activation-2.18e-02

Pos logit contributions
 aware+0.009
 decisions+0.009
 bravery+0.008
 oak+0.008
 office+0.008
Neg logit contributions
rant-0.013
mer-0.010
le-0.010
sc-0.008
te-0.008
Token the
Token ID262
Feature activation-6.22e-04

Pos logit contributions
 aware+0.038
 decisions+0.036
 oak+0.034
 office+0.034
 bravery+0.034
Neg logit contributions
rant-0.040
mer-0.032
le-0.030
 Will-0.026
sc-0.026
Token new
Token ID649
Feature activation+9.22e-03

Pos logit contributions
unch+0.001
shadow+0.001
 aware+0.001
isdom+0.001
onest+0.001
Neg logit contributions
rant-0.001
mer-0.001
le-0.001
sc-0.001
 mer-0.001
Token path
Token ID3108
Feature activation-1.16e-04

Pos logit contributions
mer+0.017
rant+0.017
le+0.015
y+0.014
sc+0.013
Neg logit contributions
unch-0.013
shadow-0.013
onest-0.012
isdom-0.011
 aware-0.011
Token.
Token ID13
Feature activation+2.79e-03

Pos logit contributions
 bravery+0.000
 oak+0.000
 office+0.000
 decisions+0.000
 aware+0.000
Neg logit contributions
rant-0.000
mer-0.000
le-0.000
izz-0.000
 Will-0.000
Token It
Token ID632
Feature activation+1.67e-02

Pos logit contributions
rant+0.005
mer+0.004
le+0.004
sc+0.003
te+0.003
Neg logit contributions
 aware-0.005
 decisions-0.005
 bravery-0.005
 oak-0.005
 office-0.005
Token.
Token ID13
Feature activation-4.09e-03

Pos logit contributions
rant+0.002
mer+0.002
le+0.002
sc+0.001
izz+0.001
Neg logit contributions
 aware-0.003
 bravery-0.002
 oak-0.002
 office-0.002
 decisions-0.002
Token They
Token ID1119
Feature activation-4.53e-03

Pos logit contributions
 aware+0.007
 decisions+0.007
 bravery+0.007
 oak+0.007
 office+0.007
Neg logit contributions
rant-0.007
mer-0.006
le-0.005
sc-0.005
te-0.004
Token did
Token ID750
Feature activation+4.20e-03

Pos logit contributions
 oak+0.007
 aware+0.007
 bravery+0.007
 office+0.007
 understanding+0.007
Neg logit contributions
rant-0.009
mer-0.007
le-0.006
sc-0.006
ifying-0.006
Token not
Token ID407
Feature activation-8.03e-03

Pos logit contributions
rant+0.009
mer+0.007
le+0.007
te+0.006
 Will+0.006
Neg logit contributions
 aware-0.006
 oak-0.006
 decisions-0.006
unch-0.005
 property-0.005
Token care
Token ID1337
Feature activation-5.01e-03

Pos logit contributions
 aware+0.011
 decisions+0.011
 bravery+0.010
 oak+0.010
 office+0.010
Neg logit contributions
rant-0.018
mer-0.015
le-0.014
sc-0.012
te-0.012
Token about
Token ID546
Feature activation-2.15e-02

Pos logit contributions
 aware+0.005
 decisions+0.005
 bravery+0.005
 office+0.005
 understanding+0.005
Neg logit contributions
rant-0.013
mer-0.011
le-0.010
sc-0.009
led-0.009
Token the
Token ID262
Feature activation-9.99e-03

Pos logit contributions
 aware+0.043
 decisions+0.041
 bravery+0.040
 oak+0.040
 office+0.040
Neg logit contributions
rant-0.035
mer-0.027
le-0.024
sc-0.020
 Will-0.019
Token blue
Token ID4171
Feature activation-7.83e-03

Pos logit contributions
 aware+0.018
unch+0.017
shadow+0.017
onest+0.016
 decisions+0.016
Neg logit contributions
rant-0.018
mer-0.015
le-0.013
 mer-0.011
 Will-0.011
Token bird
Token ID6512
Feature activation-5.28e-03

Pos logit contributions
 aware+0.011
 decisions+0.011
 bravery+0.010
 understanding+0.010
 office+0.010
Neg logit contributions
rant-0.017
mer-0.014
le-0.013
sc-0.011
te-0.011
Token anymore
Token ID7471
Feature activation+9.82e-03

Pos logit contributions
 aware+0.008
 oak+0.008
 office+0.008
 decisions+0.008
 bravery+0.008
Neg logit contributions
rant-0.010
mer-0.008
le-0.007
sc-0.007
izz-0.007
Token.
Token ID13
Feature activation-1.61e-03

Pos logit contributions
rant+0.020
mer+0.015
sc+0.013
le+0.013
ka+0.013
Neg logit contributions
 aware-0.015
 bravery-0.015
 office-0.015
 decisions-0.015
 discovery-0.015