TOP ACTIVATIONS
MAX = 3.670

," said the rh in oc eros . The
was a big rh in oc eros . He was very
't touch the rh in oc eros ! It might bite
and all the other av oc ados were very happy too
They find the rh in oc eros . They are gentle
The rh in oc eros was very tired ,
give to the rh in oc eros . Lily walked to
turned out that the av oc ados were bad and made
hurt . The rh in oc eros kept sn orting and
She loved to eat av oc ados because they were y
I will buy more av oc ados tomorrow . You can
couldn 't find any av oc ados . Finally , she
to see the rh in oc eros . Tim my was
playing ! The rh in oc eros was very worried ,
fence where the rh in oc eros was . She saw
She loved to eat av oc ados . One day ,
was a cheerful rh in oc eros . He had a
try to grab the av oc ados . But the rock
Then , the rh in oc eros had one last request
am taking the rh in oc eros to a new home

MIN ACTIVATIONS
MIN = -3.846

them and never threw a tant rum again . <|endoftext|> Once
temper and often threw temper tant rums . One day ,
surprise . He threw a tant rum . He kicked and
sad . She threw a tant rum and cried loudly .
listen . She threw a tant rum in the store .
and so she threw a tant rum when her parents said
to cry and threw a tant rum . She didn 't
. He kept throwing a tant rum . His mom was
kept crying and throwing a tant rum . Finally
stubborn . She throws a tant rum . She kicks and
her feet and threw a tant rum . She grabbed the
and started to throw a tant rum . Suddenly , she
She cried and threw a tant rum . Her mom tried
his feet and threw a tant rum . "
years . The End . <|endoftext|> Once a day
!" The End . <|endoftext|> The puppy was
!" The End . <|endoftext|> Once upon a
. The End ! <|endoftext|> Once upon a
better . The End . <|endoftext|> Once there was
!" The End <|endoftext|> Once upon a time

INTERVAL 1.791 - 3.670
CONTAINS 0.020%

could go out and explore without fear of being seen !
and said the rh in oc eros didn 't need to
right thing to do , without compromising your principles and values
He was excited , and without wasting any time , he
have been able to do without his own equipment . He

INTERVAL -0.088 - 1.791
CONTAINS 65.083%

he saw a ch ubby spider with a big mug .
. One day , her mom my brought home a new
he is lucky . He smiles and writes " I love
She was very upset . Nothing she did helped . Eventually
to the park to play . Lily saw a merry -

INTERVAL -1.967 - -0.088
CONTAINS 34.849%

's okay . You don 't have to ride it again
But then , something goes wrong . The sticks catch fire
down the street in his bath robe when a car came
eat soap . They will listen to Mom and Dad .
smiled and they all went back to the garden to fix

INTERVAL -3.846 - -1.967
CONTAINS 0.048%

the garden and in the end , the garden was even
cool things . In the end , everyone was happy .
again . The end . <|endoftext|> Once upon a
place . The End . <|endoftext|> Once upon a
. In the end , the boy had to
Token,"
Token ID553
Feature activation-2.88e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 second+0.005
 plate+0.005
Neg logit contributions
apy-0.004
 live-0.004
keys-0.004
 she-0.003
ons-0.003
Token said
Token ID531
Feature activation+1.49e-02

Pos logit contributions
 fault+0.013
 tant+0.013
 pad+0.011
 second+0.011
 plate+0.011
Neg logit contributions
apy-0.008
keys-0.008
 live-0.008
 she-0.007
ons-0.007
Token the
Token ID262
Feature activation+4.15e-01

Pos logit contributions
apy+0.000
keys+0.000
 live+0.000
 she+0.000
ons+0.000
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.000
 plate-0.000
Token rh
Token ID9529
Feature activation-7.59e-01

Pos logit contributions
 she+0.015
keys+0.015
 live+0.015
apy+0.015
 they+0.014
Neg logit contributions
 fault-0.016
 tant-0.015
 pad-0.013
rett-0.011
 puck-0.011
Tokenin
Token ID259
Feature activation-3.64e-01

Pos logit contributions
 fault+0.030
 tant+0.028
 pad+0.025
 asleep+0.022
 second+0.022
Neg logit contributions
keys-0.028
apy-0.027
 live-0.027
ons-0.026
 she-0.026
Tokenoc
Token ID420
Feature activation+3.67e+00

Pos logit contributions
 fault+0.020
 tant+0.018
 asleep+0.016
 pad+0.016
 puck+0.015
Neg logit contributions
keys-0.008
 she-0.008
 live-0.008
apy-0.008
 they-0.007
Tokeneros
Token ID27498
Feature activation-8.17e-01

Pos logit contributions
keys+0.108
ons+0.103
mm+0.100
apy+0.099
 live+0.095
Neg logit contributions
 fault-0.164
 tant-0.159
 pad-0.146
 second-0.134
 asleep-0.133
Token.
Token ID13
Feature activation-5.03e-02

Pos logit contributions
 fault+0.043
 tant+0.042
 pad+0.036
 puck+0.034
 plate+0.034
Neg logit contributions
apy-0.019
 she-0.019
keys-0.018
 live-0.018
 they-0.017
Token
Token ID198
Feature activation+7.58e-02

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 plate+0.002
 puck+0.002
Neg logit contributions
apy-0.001
 she-0.001
 live-0.001
keys-0.001
 they-0.001
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
apy+0.002
keys+0.002
 live+0.002
 she+0.002
ons+0.002
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.003
 second-0.003
 plate-0.003
TokenThe
Token ID464
Feature activation-1.30e-01

Pos logit contributions
apy+0.000
 live+0.000
keys+0.000
 she+0.000
ons+0.000
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.000
 puck-0.000
Token was
Token ID373
Feature activation+4.00e-01

Pos logit contributions
 fault+0.011
 tant+0.011
 pad+0.009
 puck+0.009
 plate+0.009
Neg logit contributions
 live-0.004
apy-0.004
 she-0.003
 lived-0.003
 they-0.003
Token a
Token ID257
Feature activation+1.79e-01

Pos logit contributions
 she+0.012
apy+0.012
 live+0.011
 they+0.011
 lived+0.010
Neg logit contributions
 tant-0.020
 fault-0.019
 pad-0.016
 puck-0.015
 pace-0.015
Token big
Token ID1263
Feature activation+2.39e-01

Pos logit contributions
apy+0.008
 live+0.008
keys+0.008
 she+0.008
 they+0.008
Neg logit contributions
 fault-0.005
 tant-0.005
 pad-0.004
 second-0.003
 asleep-0.003
Token rh
Token ID9529
Feature activation-7.28e-01

Pos logit contributions
 live+0.008
 she+0.008
apy+0.008
 lived+0.008
 they+0.008
Neg logit contributions
 fault-0.010
 tant-0.009
 pad-0.008
rett-0.007
 second-0.007
Tokenin
Token ID259
Feature activation-3.81e-01

Pos logit contributions
 fault+0.027
 tant+0.026
 pad+0.021
 second+0.019
 plate+0.019
Neg logit contributions
apy-0.028
keys-0.028
 live-0.028
 she-0.027
ons-0.026
Tokenoc
Token ID420
Feature activation+3.61e+00

Pos logit contributions
 fault+0.021
 tant+0.020
 pad+0.018
 asleep+0.017
 second+0.016
Neg logit contributions
 live-0.008
keys-0.008
apy-0.008
 she-0.008
 lived-0.008
Tokeneros
Token ID27498
Feature activation-7.92e-01

Pos logit contributions
keys+0.113
apy+0.107
ons+0.106
 live+0.103
mm+0.102
Neg logit contributions
 fault-0.158
 tant-0.156
 pad-0.136
 second-0.127
 asleep-0.121
Token.
Token ID13
Feature activation+2.48e-01

Pos logit contributions
 fault+0.040
 tant+0.039
 pad+0.035
 plate+0.033
 second+0.032
Neg logit contributions
apy-0.019
keys-0.017
 live-0.016
 she-0.016
mm-0.016
Token He
Token ID679
Feature activation-5.00e-01

Pos logit contributions
apy+0.007
keys+0.007
 live+0.006
 she+0.006
ons+0.006
Neg logit contributions
 fault-0.012
 tant-0.012
 pad-0.010
 second-0.009
 plate-0.009
Token was
Token ID373
Feature activation+3.58e-01

Pos logit contributions
 fault+0.031
 tant+0.030
 pad+0.027
 plate+0.025
 second+0.025
Neg logit contributions
apy-0.007
 live-0.007
keys-0.007
 she-0.006
 lived-0.006
Token very
Token ID845
Feature activation+1.66e-01

Pos logit contributions
 live+0.011
 she+0.011
 lived+0.010
apy+0.010
 they+0.010
Neg logit contributions
 tant-0.017
 fault-0.017
 pad-0.014
 plate-0.013
 puck-0.013
Token't
Token ID470
Feature activation-1.91e-01

Pos logit contributions
apy+0.020
 live+0.019
keys+0.018
 she+0.017
ons+0.016
Neg logit contributions
 fault-0.037
 tant-0.037
 pad-0.032
 second-0.031
 plate-0.030
Token touch
Token ID3638
Feature activation+5.66e-01

Pos logit contributions
 fault+0.010
 tant+0.009
 pad+0.008
 second+0.008
 puck+0.008
Neg logit contributions
apy-0.005
keys-0.005
 live-0.004
ons-0.004
 she-0.004
Token the
Token ID262
Feature activation+2.83e-01

Pos logit contributions
 she+0.012
apy+0.011
ons+0.011
 live+0.011
keys+0.011
Neg logit contributions
 fault-0.031
 tant-0.030
 pad-0.027
 puck-0.026
 second-0.025
Token rh
Token ID9529
Feature activation-8.63e-01

Pos logit contributions
 live+0.010
keys+0.010
 you+0.010
 she+0.010
 we+0.009
Neg logit contributions
 fault-0.010
 tant-0.010
rett-0.009
ingo-0.008
 pad-0.008
Tokenin
Token ID259
Feature activation-4.16e-01

Pos logit contributions
 fault+0.033
 tant+0.031
 pad+0.027
 second+0.024
 plate+0.023
Neg logit contributions
keys-0.032
apy-0.032
 live-0.031
ons-0.031
 she-0.031
Tokenoc
Token ID420
Feature activation+3.60e+00

Pos logit contributions
 fault+0.023
 tant+0.022
 pad+0.019
 asleep+0.019
 second+0.018
Neg logit contributions
keys-0.009
 live-0.008
apy-0.008
ons-0.008
 she-0.008
Tokeneros
Token ID27498
Feature activation-8.22e-01

Pos logit contributions
keys+0.112
ons+0.108
apy+0.107
mm+0.104
 live+0.102
Neg logit contributions
 fault-0.154
 tant-0.151
 pad-0.135
 second-0.124
 asleep-0.122
Token!
Token ID0
Feature activation-2.56e-01

Pos logit contributions
 fault+0.041
 tant+0.040
 pad+0.035
 second+0.033
 puck+0.032
Neg logit contributions
keys-0.020
 live-0.020
apy-0.020
 she-0.020
ons-0.018
Token It
Token ID632
Feature activation+3.82e-01

Pos logit contributions
 fault+0.012
 tant+0.012
 pad+0.011
 second+0.010
 puck+0.010
Neg logit contributions
apy-0.007
keys-0.006
 live-0.006
ons-0.006
 she-0.006
Token might
Token ID1244
Feature activation-2.92e-01

Pos logit contributions
apy+0.012
keys+0.011
 live+0.010
mm+0.010
ons+0.010
Neg logit contributions
 fault-0.018
 tant-0.017
 pad-0.015
 second-0.014
 plate-0.013
Token bite
Token ID13197
Feature activation-6.79e-01

Pos logit contributions
 fault+0.013
 tant+0.013
 pad+0.011
 plate+0.010
 puck+0.010
Neg logit contributions
 live-0.009
apy-0.009
 she-0.008
keys-0.008
 they-0.008
Token and
Token ID290
Feature activation-6.55e-02

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 second+0.003
 plate+0.003
Neg logit contributions
 she-0.003
apy-0.002
 live-0.002
 they-0.002
 we-0.002
Token all
Token ID477
Feature activation+5.00e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 plate+0.003
 puck+0.003
Neg logit contributions
keys-0.001
apy-0.001
 live-0.001
ons-0.001
mm-0.001
Token the
Token ID262
Feature activation+3.23e-01

Pos logit contributions
keys+0.014
apy+0.014
mm+0.013
 live+0.013
ves+0.012
Neg logit contributions
 fault-0.023
 tant-0.023
 pad-0.020
 asleep-0.018
 plate-0.018
Token other
Token ID584
Feature activation+2.86e-01

Pos logit contributions
 live+0.009
keys+0.009
 she+0.009
ons+0.009
apy+0.009
Neg logit contributions
 fault-0.015
 tant-0.015
 pad-0.013
 asleep-0.012
 puck-0.012
Token av
Token ID1196
Feature activation+9.57e-01

Pos logit contributions
apy+0.010
keys+0.010
 live+0.010
 she+0.009
ves+0.009
Neg logit contributions
 fault-0.011
 tant-0.011
 pad-0.009
 plate-0.009
 second-0.008
Tokenoc
Token ID420
Feature activation+3.60e+00

Pos logit contributions
apy+0.030
 she+0.028
 live+0.028
keys+0.028
 they+0.026
Neg logit contributions
 tant-0.043
 fault-0.042
 pad-0.038
 plate-0.032
 puck-0.032
Tokenados
Token ID22484
Feature activation+7.22e-03

Pos logit contributions
apy+0.132
 live+0.128
 she+0.125
keys+0.121
 they+0.120
Neg logit contributions
 fault-0.144
 tant-0.137
 pad-0.114
 second-0.101
 plate-0.101
Token were
Token ID547
Feature activation+1.73e-01

Pos logit contributions
apy+0.000
keys+0.000
 live+0.000
mm+0.000
 she+0.000
Neg logit contributions
 fault-0.000
 tant-0.000
 pad-0.000
 plate-0.000
 second-0.000
Token very
Token ID845
Feature activation+1.23e-01

Pos logit contributions
apy+0.005
 she+0.005
 live+0.005
keys+0.005
 they+0.005
Neg logit contributions
 tant-0.008
 fault-0.008
 pad-0.006
 puck-0.006
 plate-0.006
Token happy
Token ID3772
Feature activation+1.31e-01

Pos logit contributions
ons+0.005
keys+0.004
apy+0.004
 live+0.004
es+0.004
Neg logit contributions
 tant-0.004
 fault-0.004
 second-0.003
 plate-0.003
 pad-0.003
Token too
Token ID1165
Feature activation+6.20e-02

Pos logit contributions
apy+0.003
 she+0.003
 live+0.003
keys+0.003
 they+0.003
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.006
 second-0.005
 puck-0.005
Token They
Token ID1119
Feature activation+3.72e-01

Pos logit contributions
keys+0.001
apy+0.001
 live+0.001
ons+0.001
 she+0.001
Neg logit contributions
 fault-0.002
 tant-0.002
 pad-0.002
 second-0.002
 puck-0.002
Token find
Token ID1064
Feature activation+2.31e-01

Pos logit contributions
apy+0.010
keys+0.010
mm+0.009
 she+0.009
ons+0.009
Neg logit contributions
 fault-0.017
 tant-0.017
 pad-0.015
 second-0.014
 puck-0.014
Token the
Token ID262
Feature activation+3.78e-01

Pos logit contributions
apy+0.008
 live+0.007
 she+0.007
 they+0.007
keys+0.007
Neg logit contributions
 fault-0.010
 tant-0.010
 pad-0.009
 puck-0.008
 plate-0.007
Token rh
Token ID9529
Feature activation-7.14e-01

Pos logit contributions
 live+0.013
 they+0.013
 we+0.012
keys+0.012
 she+0.012
Neg logit contributions
 tant-0.015
rett-0.013
 fault-0.012
ingo-0.011
 pad-0.011
Tokenin
Token ID259
Feature activation-4.05e-01

Pos logit contributions
 fault+0.027
 tant+0.025
 pad+0.021
 second+0.019
 plate+0.019
Neg logit contributions
apy-0.027
keys-0.027
 live-0.027
 she-0.026
ons-0.026
Tokenoc
Token ID420
Feature activation+3.59e+00

Pos logit contributions
 fault+0.021
 tant+0.020
 pad+0.017
 asleep+0.017
 second+0.016
Neg logit contributions
 live-0.010
keys-0.010
apy-0.010
 they-0.010
 she-0.010
Tokeneros
Token ID27498
Feature activation-8.33e-01

Pos logit contributions
keys+0.112
ons+0.107
apy+0.107
mm+0.103
 live+0.102
Neg logit contributions
 fault-0.155
 tant-0.153
 pad-0.135
 second-0.125
 asleep-0.123
Token.
Token ID13
Feature activation-5.23e-02

Pos logit contributions
 fault+0.038
 tant+0.037
 pad+0.032
 second+0.029
 plate+0.029
Neg logit contributions
apy-0.025
keys-0.024
 live-0.024
 she-0.023
ons-0.022
Token They
Token ID1119
Feature activation+4.25e-01

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.002
 second+0.002
 plate+0.002
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token are
Token ID389
Feature activation+1.55e-01

Pos logit contributions
apy+0.012
keys+0.011
mm+0.010
ons+0.010
 she+0.010
Neg logit contributions
 tant-0.020
 fault-0.020
 pad-0.018
 second-0.016
 puck-0.016
Token gentle
Token ID10296
Feature activation+6.51e-01

Pos logit contributions
 live+0.005
 she+0.004
apy+0.004
 they+0.004
keys+0.004
Neg logit contributions
 tant-0.007
 fault-0.007
 pad-0.006
rett-0.006
 plate-0.005
Token
Token ID198
Feature activation-1.59e-01

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.003
 second+0.002
 plate+0.002
Neg logit contributions
apy-0.002
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token
Token ID198
Feature activation-1.88e-01

Pos logit contributions
 fault+0.008
 tant+0.008
 pad+0.007
 plate+0.006
 puck+0.006
Neg logit contributions
apy-0.004
 she-0.004
 live-0.004
keys-0.004
ons-0.004
TokenThe
Token ID464
Feature activation-2.87e-01

Pos logit contributions
 fault+0.009
 tant+0.009
 pad+0.008
 second+0.007
 plate+0.007
Neg logit contributions
apy-0.005
keys-0.005
 live-0.005
 she-0.005
ons-0.004
Token rh
Token ID9529
Feature activation-8.85e-01

Pos logit contributions
 fault+0.011
 tant+0.011
 pad+0.008
rett+0.008
 asleep+0.008
Neg logit contributions
 she-0.011
 live-0.010
keys-0.010
apy-0.010
 we-0.010
Tokenin
Token ID259
Feature activation-5.13e-01

Pos logit contributions
 fault+0.031
 tant+0.030
 pad+0.025
 second+0.022
 plate+0.022
Neg logit contributions
apy-0.036
 live-0.034
keys-0.034
 she-0.034
ons-0.033
Tokenoc
Token ID420
Feature activation+3.59e+00

Pos logit contributions
 fault+0.027
 tant+0.026
 pad+0.023
 asleep+0.021
 plate+0.021
Neg logit contributions
apy-0.012
keys-0.012
 live-0.012
 she-0.011
ons-0.011
Tokeneros
Token ID27498
Feature activation-8.06e-01

Pos logit contributions
apy+0.130
keys+0.127
 live+0.126
 she+0.124
ons+0.120
Neg logit contributions
 fault-0.139
 tant-0.135
 pad-0.114
 second-0.103
 plate-0.102
Token was
Token ID373
Feature activation+2.57e-01

Pos logit contributions
 fault+0.040
 tant+0.039
 pad+0.034
 second+0.032
 plate+0.032
Neg logit contributions
apy-0.020
keys-0.019
 live-0.019
 she-0.019
ons-0.018
Token very
Token ID845
Feature activation+3.82e-02

Pos logit contributions
 she+0.009
apy+0.008
 live+0.008
 they+0.008
 lived+0.008
Neg logit contributions
 tant-0.011
 fault-0.011
 pad-0.009
 puck-0.008
ingo-0.008
Token tired
Token ID10032
Feature activation-2.42e-01

Pos logit contributions
apy+0.001
keys+0.001
 live+0.001
 she+0.001
ons+0.001
Neg logit contributions
 fault-0.002
 tant-0.002
 pad-0.001
 second-0.001
 puck-0.001
Token,
Token ID11
Feature activation-1.58e-01

Pos logit contributions
 fault+0.011
 tant+0.011
 pad+0.010
 puck+0.009
 plate+0.009
Neg logit contributions
 she-0.007
 they-0.007
 live-0.006
apy-0.006
 we-0.006
Token give
Token ID1577
Feature activation+1.09e-01

Pos logit contributions
apy+0.010
keys+0.010
 live+0.009
 she+0.009
ons+0.009
Neg logit contributions
 fault-0.017
 tant-0.017
 pad-0.015
 second-0.014
 plate-0.014
Token to
Token ID284
Feature activation+2.92e-01

Pos logit contributions
apy+0.003
keys+0.003
 live+0.003
 she+0.003
ons+0.003
Neg logit contributions
 fault-0.005
 tant-0.005
 pad-0.004
 second-0.004
 plate-0.004
Token the
Token ID262
Feature activation+4.83e-01

Pos logit contributions
apy+0.009
keys+0.008
 live+0.008
 she+0.008
ons+0.008
Neg logit contributions
 fault-0.013
 tant-0.013
 pad-0.011
 second-0.010
 plate-0.010
Token rh
Token ID9529
Feature activation-7.78e-01

Pos logit contributions
 live+0.017
 she+0.017
keys+0.017
apy+0.017
 they+0.016
Neg logit contributions
 fault-0.019
 tant-0.019
 pad-0.015
rett-0.014
 puck-0.014
Tokenin
Token ID259
Feature activation-4.11e-01

Pos logit contributions
 fault+0.033
 tant+0.030
 pad+0.027
 asleep+0.024
 second+0.024
Neg logit contributions
keys-0.028
ons-0.026
 live-0.026
apy-0.026
 she-0.024
Tokenoc
Token ID420
Feature activation+3.59e+00

Pos logit contributions
 fault+0.023
 tant+0.021
 pad+0.019
 asleep+0.018
 puck+0.018
Neg logit contributions
 live-0.009
apy-0.009
 she-0.009
keys-0.009
ons-0.008
Tokeneros
Token ID27498
Feature activation-9.23e-01

Pos logit contributions
keys+0.107
ons+0.104
apy+0.100
mm+0.098
 live+0.096
Neg logit contributions
 fault-0.157
 tant-0.154
 pad-0.139
 second-0.129
 asleep-0.127
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
 fault+0.041
 tant+0.041
 pad+0.035
 second+0.032
 puck+0.032
Neg logit contributions
apy-0.028
 live-0.027
keys-0.027
 she-0.026
ons-0.025
Token Lily
Token ID20037
Feature activation+3.90e-02

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 second+0.005
 plate+0.005
Neg logit contributions
apy-0.003
 live-0.003
keys-0.003
 she-0.003
ons-0.003
Token walked
Token ID6807
Feature activation-1.29e-01

Pos logit contributions
apy+0.001
keys+0.001
 live+0.001
 she+0.001
ons+0.001
Neg logit contributions
 fault-0.002
 tant-0.002
 pad-0.002
 plate-0.001
 second-0.001
Token to
Token ID284
Feature activation-2.42e-01

Pos logit contributions
 fault+0.007
 tant+0.006
 puck+0.005
 pad+0.005
 plate+0.005
Neg logit contributions
 she-0.004
 they-0.004
 live-0.004
 we-0.003
apy-0.003
Token turned
Token ID2900
Feature activation+3.35e-02

Pos logit contributions
apy+0.005
 live+0.005
 she+0.005
keys+0.005
 they+0.005
Neg logit contributions
 fault-0.011
 tant-0.011
 pad-0.010
 puck-0.009
 plate-0.009
Token out
Token ID503
Feature activation-9.73e-02

Pos logit contributions
apy+0.001
 she+0.001
 they+0.001
 live+0.001
ons+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 puck-0.001
rett-0.001
Token that
Token ID326
Feature activation+1.00e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.004
 puck+0.003
 plate+0.003
Neg logit contributions
apy-0.003
 live-0.003
 she-0.003
keys-0.003
 they-0.003
Token the
Token ID262
Feature activation+2.30e-01

Pos logit contributions
keys+0.003
apy+0.003
 live+0.003
mm+0.003
ons+0.003
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.004
 asleep-0.004
 second-0.004
Token av
Token ID1196
Feature activation+8.16e-01

Pos logit contributions
 live+0.008
keys+0.008
 she+0.008
 we+0.007
 you+0.007
Neg logit contributions
 tant-0.009
 fault-0.009
rett-0.008
ingo-0.007
 pad-0.007
Tokenoc
Token ID420
Feature activation+3.58e+00

Pos logit contributions
apy+0.023
keys+0.021
 live+0.021
 she+0.021
 they+0.020
Neg logit contributions
 fault-0.039
 tant-0.038
 pad-0.034
 second-0.031
 plate-0.031
Tokenados
Token ID22484
Feature activation-2.82e-02

Pos logit contributions
apy+0.117
 live+0.113
keys+0.112
 she+0.110
ons+0.105
Neg logit contributions
 fault-0.152
 tant-0.149
 pad-0.127
 second-0.116
 plate-0.115
Token were
Token ID547
Feature activation+1.38e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 asleep+0.001
Neg logit contributions
apy-0.001
keys-0.001
mm-0.001
 live-0.001
mmm-0.001
Token bad
Token ID2089
Feature activation-3.21e-01

Pos logit contributions
 she+0.004
apy+0.004
 live+0.004
 they+0.004
keys+0.004
Neg logit contributions
 tant-0.006
 fault-0.006
 pad-0.005
rett-0.005
 puck-0.005
Token and
Token ID290
Feature activation-2.22e-01

Pos logit contributions
 fault+0.014
 tant+0.014
 pad+0.012
 second+0.011
 plate+0.011
Neg logit contributions
apy-0.010
keys-0.010
 live-0.010
 she-0.009
ons-0.009
Token made
Token ID925
Feature activation-9.14e-01

Pos logit contributions
 fault+0.010
 tant+0.009
 pad+0.008
 plate+0.008
 second+0.007
Neg logit contributions
keys-0.007
apy-0.007
 live-0.007
mm-0.006
ves-0.006
Token hurt
Token ID5938
Feature activation-3.76e-01

Pos logit contributions
 she+0.006
 live+0.006
 they+0.006
apy+0.006
 we+0.005
Neg logit contributions
 fault-0.008
 tant-0.008
 pad-0.007
 puck-0.006
 plate-0.006
Token.
Token ID13
Feature activation+7.76e-03

Pos logit contributions
 fault+0.021
 tant+0.020
 pad+0.018
 plate+0.017
 puck+0.017
Neg logit contributions
 she-0.008
 live-0.008
 they-0.007
apy-0.007
 you-0.007
Token The
Token ID383
Feature activation-4.49e-02

Pos logit contributions
 she+0.000
apy+0.000
 they+0.000
 live+0.000
 People+0.000
Neg logit contributions
 fault-0.000
 tant-0.000
 pad-0.000
 plate-0.000
 asleep-0.000
Token rh
Token ID9529
Feature activation-8.52e-01

Pos logit contributions
 fault+0.002
 tant+0.002
rett+0.001
 pad+0.001
ingo+0.001
Neg logit contributions
 she-0.002
 live-0.002
apy-0.002
 they-0.002
keys-0.002
Tokenin
Token ID259
Feature activation-4.50e-01

Pos logit contributions
 fault+0.030
 tant+0.029
 pad+0.024
 second+0.022
 plate+0.022
Neg logit contributions
apy-0.034
keys-0.033
 live-0.033
 she-0.032
ons-0.031
Tokenoc
Token ID420
Feature activation+3.58e+00

Pos logit contributions
 fault+0.026
 tant+0.024
 pad+0.021
 asleep+0.020
joy+0.020
Neg logit contributions
 lived-0.009
keys-0.009
 live-0.009
apy-0.008
ons-0.008
Tokeneros
Token ID27498
Feature activation-8.24e-01

Pos logit contributions
apy+0.123
keys+0.122
 live+0.119
ons+0.115
 she+0.115
Neg logit contributions
 fault-0.144
 tant-0.140
 pad-0.121
 second-0.109
 plate-0.107
Token kept
Token ID4030
Feature activation+3.63e-01

Pos logit contributions
 fault+0.042
 tant+0.041
 pad+0.036
 plate+0.033
 second+0.033
Neg logit contributions
apy-0.020
keys-0.019
 live-0.019
 she-0.019
ons-0.017
Token sn
Token ID3013
Feature activation-5.81e-01

Pos logit contributions
 she+0.010
apy+0.010
 live+0.010
 they+0.009
ons+0.009
Neg logit contributions
 fault-0.018
 tant-0.017
 pad-0.015
 puck-0.014
 plate-0.014
Tokenorting
Token ID24707
Feature activation-4.19e-01

Pos logit contributions
 tant+0.022
 fault+0.022
 pad+0.019
 puck+0.017
 asleep+0.017
Neg logit contributions
keys-0.020
apy-0.020
 live-0.020
 she-0.020
 expect-0.020
Token and
Token ID290
Feature activation-1.34e-01

Pos logit contributions
 fault+0.019
 tant+0.019
 pad+0.016
 puck+0.015
 second+0.014
Neg logit contributions
 she-0.012
apy-0.012
keys-0.012
 live-0.012
 they-0.012
Token She
Token ID1375
Feature activation-1.99e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 second+0.001
 plate+0.001
Neg logit contributions
apy-0.000
 live-0.000
keys-0.000
 she-0.000
ons-0.000
Token loved
Token ID6151
Feature activation+1.02e-01

Pos logit contributions
 fault+0.007
 pad+0.007
 tant+0.007
 second+0.006
 plate+0.006
Neg logit contributions
keys-0.007
apy-0.007
mm-0.006
vere-0.006
ons-0.006
Token to
Token ID284
Feature activation+3.65e-02

Pos logit contributions
 she+0.004
apy+0.004
 live+0.004
 they+0.004
ons+0.004
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.003
 second-0.003
 puck-0.003
Token eat
Token ID4483
Feature activation+2.97e-01

Pos logit contributions
keys+0.001
apy+0.001
vere+0.001
ended+0.001
mm+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 plate-0.001
 pad-0.001
 puck-0.001
Token av
Token ID1196
Feature activation+9.57e-01

Pos logit contributions
apy+0.012
keys+0.012
 live+0.012
 lived+0.011
ended+0.011
Neg logit contributions
 tant-0.010
 fault-0.010
 pad-0.008
 plate-0.008
 second-0.007
Tokenoc
Token ID420
Feature activation+3.57e+00

Pos logit contributions
apy+0.026
keys+0.025
 live+0.025
 she+0.024
ons+0.023
Neg logit contributions
 fault-0.046
 tant-0.045
 pad-0.039
 second-0.036
 plate-0.036
Tokenados
Token ID22484
Feature activation+4.86e-02

Pos logit contributions
apy+0.125
 live+0.121
 she+0.117
keys+0.116
 they+0.113
Neg logit contributions
 fault-0.147
 tant-0.142
 pad-0.119
 plate-0.108
 second-0.107
Token because
Token ID780
Feature activation-2.68e-01

Pos logit contributions
apy+0.002
keys+0.002
 live+0.002
 she+0.001
ons+0.001
Neg logit contributions
 fault-0.002
 tant-0.002
 pad-0.002
 plate-0.002
 second-0.002
Token they
Token ID484
Feature activation-7.98e-02

Pos logit contributions
 tant+0.014
 fault+0.013
 second+0.012
 pad+0.012
 sausage+0.012
Neg logit contributions
apy-0.005
keys-0.005
 live-0.004
odic-0.004
vere-0.004
Token were
Token ID547
Feature activation+1.72e-01

Pos logit contributions
 tant+0.003
 fault+0.003
 plate+0.003
 pad+0.003
 second+0.003
Neg logit contributions
apy-0.003
ended-0.002
vere-0.002
keys-0.002
ves-0.002
Token y
Token ID331
Feature activation-8.95e-02

Pos logit contributions
apy+0.007
keys+0.006
 live+0.006
mm+0.006
ons+0.006
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.005
 second-0.005
 plate-0.005
TokenI
Token ID40
Feature activation+6.75e-02

Pos logit contributions
apy+0.014
keys+0.014
 live+0.014
 she+0.013
ons+0.013
Neg logit contributions
 fault-0.019
 tant-0.018
 pad-0.016
 second-0.015
 puck-0.014
Token will
Token ID481
Feature activation-3.40e-01

Pos logit contributions
apy+0.002
keys+0.002
 live+0.002
 she+0.002
ons+0.002
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.003
 second-0.003
 plate-0.003
Token buy
Token ID2822
Feature activation+1.27e-02

Pos logit contributions
 fault+0.016
 tant+0.015
 pad+0.013
 second+0.013
 plate+0.013
Neg logit contributions
apy-0.010
keys-0.009
ons-0.009
 live-0.009
 she-0.008
Token more
Token ID517
Feature activation+4.05e-01

Pos logit contributions
apy+0.000
keys+0.000
 live+0.000
 lived+0.000
 she+0.000
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.001
 plate-0.001
Token av
Token ID1196
Feature activation+8.72e-01

Pos logit contributions
apy+0.014
 live+0.014
keys+0.014
 she+0.014
ons+0.013
Neg logit contributions
 fault-0.016
 tant-0.016
 pad-0.013
 second-0.012
 puck-0.012
Tokenoc
Token ID420
Feature activation+3.57e+00

Pos logit contributions
apy+0.026
 live+0.024
keys+0.024
 she+0.024
 they+0.023
Neg logit contributions
 tant-0.040
 fault-0.039
 pad-0.035
 plate-0.031
 puck-0.031
Tokenados
Token ID22484
Feature activation+7.56e-02

Pos logit contributions
apy+0.134
 live+0.132
 she+0.127
 lived+0.123
 they+0.123
Neg logit contributions
 fault-0.139
 tant-0.133
 pad-0.109
 second-0.097
 plate-0.097
Token tomorrow
Token ID9439
Feature activation+5.42e-01

Pos logit contributions
apy+0.002
keys+0.002
 live+0.002
 she+0.002
ons+0.002
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.003
 second-0.003
 plate-0.003
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
apy+0.017
keys+0.015
 live+0.015
 she+0.015
ons+0.014
Neg logit contributions
 fault-0.025
 tant-0.024
 pad-0.021
 plate-0.019
 second-0.019
Token You
Token ID921
Feature activation-3.98e-01

Pos logit contributions
 fault+0.005
 tant+0.004
 pad+0.004
 second+0.004
 puck+0.004
Neg logit contributions
apy-0.003
ons-0.003
keys-0.003
ves-0.003
 live-0.003
Token can
Token ID460
Feature activation-2.15e-01

Pos logit contributions
 fault+0.019
 tant+0.018
 pad+0.016
 second+0.015
 plate+0.015
Neg logit contributions
apy-0.011
keys-0.010
 she-0.010
 live-0.010
ons-0.010
Token couldn
Token ID3521
Feature activation+2.90e-01

Pos logit contributions
apy+0.007
keys+0.007
 live+0.006
ons+0.006
 she+0.006
Neg logit contributions
 fault-0.012
 tant-0.011
 pad-0.010
 second-0.009
 asleep-0.009
Token't
Token ID470
Feature activation-7.96e-02

Pos logit contributions
keys+0.009
apy+0.009
 live+0.009
ating+0.008
ons+0.008
Neg logit contributions
 fault-0.012
 tant-0.011
 second-0.010
 pad-0.009
 asleep-0.009
Token find
Token ID1064
Feature activation+1.21e-01

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.003
 second+0.003
 plate+0.002
Neg logit contributions
apy-0.003
keys-0.003
 live-0.002
ons-0.002
 she-0.002
Token any
Token ID597
Feature activation+7.48e-01

Pos logit contributions
 she+0.003
apy+0.003
 live+0.003
keys+0.003
 they+0.003
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.005
 puck-0.005
 plate-0.005
Token av
Token ID1196
Feature activation+8.66e-01

Pos logit contributions
 live+0.024
 she+0.024
apy+0.022
 expect+0.022
ons+0.022
Neg logit contributions
 tant-0.031
 fault-0.030
 pad-0.025
rett-0.025
ingo-0.024
Tokenoc
Token ID420
Feature activation+3.57e+00

Pos logit contributions
apy+0.021
keys+0.020
 live+0.020
 she+0.020
ons+0.019
Neg logit contributions
 fault-0.044
 tant-0.043
 pad-0.037
 second-0.035
 plate-0.035
Tokenados
Token ID22484
Feature activation+6.20e-03

Pos logit contributions
apy+0.124
 live+0.120
 she+0.116
keys+0.115
 they+0.111
Neg logit contributions
 fault-0.148
 tant-0.143
 pad-0.121
 second-0.109
 plate-0.108
Token.
Token ID13
Feature activation-1.84e-02

Pos logit contributions
apy+0.000
 live+0.000
keys+0.000
 she+0.000
ons+0.000
Neg logit contributions
 fault-0.000
 tant-0.000
 pad-0.000
 second-0.000
 puck-0.000
Token Finally
Token ID9461
Feature activation+2.35e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 puck+0.001
Neg logit contributions
apy-0.000
 she-0.000
 live-0.000
keys-0.000
 they-0.000
Token,
Token ID11
Feature activation-9.27e-02

Pos logit contributions
keys+0.008
apy+0.007
 live+0.007
odic+0.006
ons+0.006
Neg logit contributions
 fault-0.010
 tant-0.009
 second-0.008
 pad-0.008
 plate-0.008
Token she
Token ID673
Feature activation-4.83e-02

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 second+0.005
 plate+0.005
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token to
Token ID284
Feature activation+2.77e-01

Pos logit contributions
apy+0.008
 live+0.008
 she+0.008
keys+0.008
 they+0.008
Neg logit contributions
 fault-0.008
 tant-0.008
 pad-0.006
 plate-0.006
 second-0.006
Token see
Token ID766
Feature activation+4.45e-01

Pos logit contributions
 live+0.007
 she+0.007
apy+0.007
 they+0.006
 we+0.006
Neg logit contributions
 fault-0.014
 tant-0.014
 pad-0.012
 puck-0.011
 plate-0.011
Token the
Token ID262
Feature activation+4.12e-01

Pos logit contributions
apy+0.014
 live+0.014
keys+0.014
 she+0.014
 they+0.013
Neg logit contributions
 fault-0.019
 tant-0.019
 pad-0.016
 puck-0.014
 second-0.014
Token rh
Token ID9529
Feature activation-7.62e-01

Pos logit contributions
 live+0.014
keys+0.013
 she+0.013
 you+0.013
 we+0.012
Neg logit contributions
 fault-0.016
 tant-0.016
 pad-0.014
rett-0.013
ingo-0.012
Tokenin
Token ID259
Feature activation-4.17e-01

Pos logit contributions
 fault+0.028
 tant+0.027
 pad+0.023
 second+0.020
 plate+0.020
Neg logit contributions
apy-0.029
keys-0.029
 live-0.028
 she-0.028
ons-0.027
Tokenoc
Token ID420
Feature activation+3.56e+00

Pos logit contributions
 fault+0.022
 tant+0.021
 pad+0.018
 second+0.017
 puck+0.017
Neg logit contributions
apy-0.010
keys-0.010
 live-0.010
 she-0.009
ons-0.009
Tokeneros
Token ID27498
Feature activation-8.19e-01

Pos logit contributions
keys+0.112
ons+0.107
apy+0.106
mm+0.103
 live+0.103
Neg logit contributions
 fault-0.154
 tant-0.152
 pad-0.134
 second-0.123
 asleep-0.120
Token.
Token ID13
Feature activation+6.79e-02

Pos logit contributions
 fault+0.038
 tant+0.037
 pad+0.033
 second+0.030
 plate+0.030
Neg logit contributions
apy-0.023
keys-0.022
 live-0.022
 she-0.021
ons-0.020
Token Tim
Token ID5045
Feature activation+1.22e-01

Pos logit contributions
apy+0.001
 live+0.001
 she+0.001
keys+0.001
 they+0.001
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.003
 second-0.003
 puck-0.003
Tokenmy
Token ID1820
Feature activation+4.22e-01

Pos logit contributions
 she+0.004
 lived+0.004
 live+0.004
ons+0.004
keys+0.004
Neg logit contributions
 fault-0.006
 tant-0.005
 puck-0.004
 second-0.004
 plate-0.004
Token was
Token ID373
Feature activation+3.58e-01

Pos logit contributions
apy+0.011
keys+0.010
 live+0.010
 she+0.010
ons+0.009
Neg logit contributions
 fault-0.021
 tant-0.021
 pad-0.018
 second-0.017
 plate-0.017
Token playing
Token ID2712
Feature activation+4.90e-02

Pos logit contributions
 fault+0.023
 tant+0.022
 pad+0.020
 asleep+0.018
 second+0.018
Neg logit contributions
apy-0.015
keys-0.014
 live-0.013
mm-0.013
 she-0.013
Token!
Token ID0
Feature activation-6.83e-02

Pos logit contributions
apy+0.001
 live+0.001
keys+0.001
 she+0.001
ons+0.001
Neg logit contributions
 fault-0.002
 tant-0.002
 pad-0.002
 second-0.002
 plate-0.002
Token The
Token ID383
Feature activation+2.98e-02

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.003
 plate+0.003
 second+0.003
Neg logit contributions
apy-0.002
 live-0.002
keys-0.002
 she-0.002
 they-0.001
Token rh
Token ID9529
Feature activation-8.33e-01

Pos logit contributions
 live+0.001
 she+0.001
keys+0.001
 they+0.001
 we+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
rett-0.001
 asleep-0.001
Tokenin
Token ID259
Feature activation-4.57e-01

Pos logit contributions
 fault+0.031
 tant+0.030
 pad+0.025
 second+0.023
 plate+0.022
Neg logit contributions
apy-0.032
keys-0.031
 live-0.031
 she-0.030
ons-0.030
Tokenoc
Token ID420
Feature activation+3.55e+00

Pos logit contributions
 fault+0.027
 tant+0.025
 pad+0.022
 asleep+0.021
 second+0.020
Neg logit contributions
keys-0.009
 lived-0.008
apy-0.008
 live-0.008
ons-0.008
Tokeneros
Token ID27498
Feature activation-7.93e-01

Pos logit contributions
keys+0.116
apy+0.115
 live+0.111
ons+0.111
mm+0.107
Neg logit contributions
 fault-0.147
 tant-0.145
 pad-0.127
 second-0.115
 plate-0.113
Token was
Token ID373
Feature activation+3.15e-01

Pos logit contributions
 fault+0.042
 tant+0.040
 pad+0.035
 plate+0.033
 second+0.033
Neg logit contributions
apy-0.018
keys-0.018
 live-0.017
 she-0.017
 lived-0.016
Token very
Token ID845
Feature activation+5.60e-02

Pos logit contributions
 she+0.010
 live+0.009
apy+0.009
 they+0.009
 lived+0.009
Neg logit contributions
 tant-0.015
 fault-0.014
 pad-0.012
 plate-0.011
 puck-0.011
Token worried
Token ID7960
Feature activation-2.68e-01

Pos logit contributions
apy+0.002
keys+0.002
 live+0.002
 she+0.002
ons+0.002
Neg logit contributions
 fault-0.002
 tant-0.002
 pad-0.002
 second-0.002
 puck-0.002
Token,
Token ID11
Feature activation-1.08e-01

Pos logit contributions
 fault+0.012
 tant+0.012
 pad+0.010
 second+0.010
 plate+0.010
Neg logit contributions
apy-0.008
 live-0.007
keys-0.007
 she-0.007
ons-0.007
Token fence
Token ID13990
Feature activation+2.39e-02

Pos logit contributions
 live+0.010
keys+0.010
 we+0.009
 you+0.009
 lived+0.009
Neg logit contributions
 fault-0.013
 tant-0.012
rett-0.009
 pad-0.009
uster-0.008
Token where
Token ID810
Feature activation-3.13e-01

Pos logit contributions
 she+0.001
 they+0.001
 live+0.001
apy+0.001
 lived+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 puck-0.001
rett-0.001
Token the
Token ID262
Feature activation+2.88e-01

Pos logit contributions
 fault+0.015
 tant+0.014
 pad+0.012
 asleep+0.012
 plate+0.012
Neg logit contributions
keys-0.009
apy-0.008
 live-0.008
mm-0.008
mmm-0.007
Token rh
Token ID9529
Feature activation-8.43e-01

Pos logit contributions
keys+0.010
 live+0.010
 we+0.009
 she+0.009
apy+0.009
Neg logit contributions
 fault-0.012
 tant-0.012
 pad-0.009
rett-0.009
 asleep-0.008
Tokenin
Token ID259
Feature activation-4.90e-01

Pos logit contributions
 fault+0.033
 tant+0.031
 pad+0.027
 second+0.024
 asleep+0.024
Neg logit contributions
keys-0.031
apy-0.031
 live-0.030
ons-0.030
 she-0.029
Tokenoc
Token ID420
Feature activation+3.55e+00

Pos logit contributions
 fault+0.024
 tant+0.023
 pad+0.021
 second+0.019
 plate+0.019
Neg logit contributions
apy-0.013
keys-0.012
 live-0.012
 she-0.012
ons-0.011
Tokeneros
Token ID27498
Feature activation-8.68e-01

Pos logit contributions
keys+0.109
apy+0.105
ons+0.105
mm+0.102
 live+0.101
Neg logit contributions
 fault-0.154
 tant-0.151
 pad-0.135
 second-0.124
 asleep-0.120
Token was
Token ID373
Feature activation+2.83e-01

Pos logit contributions
 fault+0.038
 tant+0.037
 pad+0.036
 plate+0.031
 second+0.031
Neg logit contributions
apy-0.026
keys-0.023
mm-0.023
 live-0.022
mmm-0.021
Token.
Token ID13
Feature activation-8.55e-02

Pos logit contributions
apy+0.008
keys+0.008
 live+0.008
 she+0.008
ons+0.007
Neg logit contributions
 fault-0.013
 tant-0.013
 pad-0.011
 second-0.010
 plate-0.010
Token She
Token ID1375
Feature activation-1.30e-02

Pos logit contributions
 fault+0.005
 tant+0.004
 pad+0.004
 second+0.004
 plate+0.004
Neg logit contributions
apy-0.002
 live-0.002
 she-0.002
keys-0.002
 they-0.002
Token saw
Token ID2497
Feature activation-7.86e-02

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 second+0.000
 puck+0.000
Neg logit contributions
apy-0.000
keys-0.000
 live-0.000
 she-0.000
ons-0.000
Token She
Token ID1375
Feature activation-1.93e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 second+0.003
 plate+0.003
Neg logit contributions
apy-0.001
 live-0.001
keys-0.001
 she-0.001
ons-0.001
Token loved
Token ID6151
Feature activation+1.09e-01

Pos logit contributions
 fault+0.007
 pad+0.007
 tant+0.006
 second+0.005
die+0.005
Neg logit contributions
keys-0.007
apy-0.007
mm-0.006
vere-0.006
othe-0.006
Token to
Token ID284
Feature activation+3.64e-02

Pos logit contributions
 she+0.004
apy+0.004
 live+0.004
ons+0.004
 they+0.004
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.004
 second-0.003
 puck-0.003
Token eat
Token ID4483
Feature activation+2.68e-01

Pos logit contributions
keys+0.001
apy+0.001
ended+0.001
vere+0.001
mm+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 plate-0.001
 pad-0.001
 puck-0.001
Token av
Token ID1196
Feature activation+9.45e-01

Pos logit contributions
apy+0.011
keys+0.011
 live+0.010
 she+0.010
 lived+0.010
Neg logit contributions
 tant-0.009
 fault-0.009
 pad-0.007
 plate-0.007
 second-0.007
Tokenoc
Token ID420
Feature activation+3.55e+00

Pos logit contributions
apy+0.025
keys+0.024
 live+0.024
 she+0.023
ons+0.022
Neg logit contributions
 fault-0.046
 tant-0.045
 pad-0.039
 second-0.036
 plate-0.036
Tokenados
Token ID22484
Feature activation+9.58e-03

Pos logit contributions
apy+0.120
 live+0.117
 she+0.113
keys+0.113
 they+0.109
Neg logit contributions
 fault-0.148
 tant-0.144
 pad-0.122
 plate-0.110
 second-0.110
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
apy+0.000
keys+0.000
 live+0.000
 she+0.000
ons+0.000
Neg logit contributions
 fault-0.000
 tant-0.000
 pad-0.000
 plate-0.000
 second-0.000
Token One
Token ID1881
Feature activation-3.27e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 second+0.001
 puck+0.001
Neg logit contributions
apy-0.001
 live-0.001
 she-0.001
keys-0.001
ons-0.000
Token day
Token ID1110
Feature activation+2.96e-01

Pos logit contributions
 fault+0.011
 tant+0.011
 pad+0.009
 second+0.008
 plate+0.008
Neg logit contributions
apy-0.014
keys-0.013
 live-0.013
 she-0.013
ons-0.013
Token,
Token ID11
Feature activation+8.66e-03

Pos logit contributions
keys+0.011
 live+0.010
mm+0.009
ating+0.009
 contains+0.009
Neg logit contributions
 tant-0.011
 fault-0.010
 pad-0.009
 second-0.009
 asleep-0.009
Token was
Token ID373
Feature activation+3.55e-01

Pos logit contributions
 fault+0.013
 tant+0.013
 pad+0.011
 second+0.011
 puck+0.011
Neg logit contributions
apy-0.005
 live-0.005
 she-0.005
keys-0.005
 lived-0.005
Token a
Token ID257
Feature activation+1.16e-01

Pos logit contributions
 she+0.010
apy+0.010
 live+0.010
 they+0.009
 lived+0.009
Neg logit contributions
 tant-0.018
 fault-0.018
 pad-0.015
 pace-0.014
 puck-0.014
Token cheerful
Token ID37999
Feature activation+3.32e-01

Pos logit contributions
apy+0.005
 live+0.005
keys+0.005
 she+0.005
 they+0.005
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.003
 asleep-0.002
 second-0.002
Token rh
Token ID9529
Feature activation-7.39e-01

Pos logit contributions
 live+0.013
 she+0.013
 lived+0.012
 they+0.012
apy+0.012
Neg logit contributions
 fault-0.013
 tant-0.012
 pad-0.009
rett-0.009
 second-0.009
Tokenin
Token ID259
Feature activation-4.92e-01

Pos logit contributions
 tant+0.024
 fault+0.024
 pad+0.019
 second+0.017
 plate+0.017
Neg logit contributions
apy-0.031
 live-0.030
 she-0.029
keys-0.029
 they-0.028
Tokenoc
Token ID420
Feature activation+3.55e+00

Pos logit contributions
 fault+0.027
 tant+0.026
 pad+0.022
 asleep+0.021
 second+0.021
Neg logit contributions
 live-0.011
apy-0.011
keys-0.011
 she-0.010
 lived-0.010
Tokeneros
Token ID27498
Feature activation-7.91e-01

Pos logit contributions
keys+0.119
apy+0.117
 live+0.113
ons+0.112
 she+0.109
Neg logit contributions
 fault-0.146
 tant-0.144
 pad-0.124
 second-0.113
 plate-0.110
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
 fault+0.040
 tant+0.039
 pad+0.035
 plate+0.033
 second+0.033
Neg logit contributions
apy-0.018
keys-0.017
 live-0.016
 she-0.016
mm-0.016
Token He
Token ID679
Feature activation-5.39e-01

Pos logit contributions
apy+0.006
keys+0.006
 live+0.005
 she+0.005
ons+0.005
Neg logit contributions
 fault-0.009
 tant-0.009
 pad-0.008
 plate-0.007
 second-0.007
Token had
Token ID550
Feature activation+1.14e-03

Pos logit contributions
 fault+0.031
 tant+0.031
 pad+0.028
 second+0.026
 puck+0.026
Neg logit contributions
apy-0.009
keys-0.008
 live-0.008
 she-0.008
ons-0.007
Token a
Token ID257
Feature activation+6.37e-02

Pos logit contributions
 she+0.000
 live+0.000
 lived+0.000
apy+0.000
 they+0.000
Neg logit contributions
 tant-0.000
 fault-0.000
 pad-0.000
 plate-0.000
 angle-0.000
Token try
Token ID1949
Feature activation+1.58e-01

Pos logit contributions
keys+0.000
apy+0.000
ons+0.000
ended+0.000
mm+0.000
Neg logit contributions
 tant-0.000
 fault-0.000
 pad-0.000
 puck-0.000
 second-0.000
Token to
Token ID284
Feature activation-1.44e-01

Pos logit contributions
 she+0.006
 they+0.006
apy+0.005
 live+0.005
 we+0.005
Neg logit contributions
 tant-0.006
 fault-0.006
 pad-0.005
 plate-0.005
 puck-0.004
Token grab
Token ID5552
Feature activation+3.64e-01

Pos logit contributions
 fault+0.007
 tant+0.006
 pad+0.006
 second+0.005
 plate+0.005
Neg logit contributions
apy-0.004
 live-0.004
keys-0.004
 she-0.004
ons-0.004
Token the
Token ID262
Feature activation+2.20e-01

Pos logit contributions
 they+0.008
apy+0.007
 she+0.007
 live+0.007
 we+0.006
Neg logit contributions
 fault-0.020
 tant-0.019
 pad-0.018
 asleep-0.017
 lair-0.016
Token av
Token ID1196
Feature activation+9.37e-01

Pos logit contributions
 she+0.007
 we+0.007
 live+0.007
 you+0.007
 they+0.007
Neg logit contributions
 fault-0.008
rett-0.008
 tant-0.008
 pad-0.007
ingo-0.007
Tokenoc
Token ID420
Feature activation+3.54e+00

Pos logit contributions
keys+0.019
ons+0.018
 live+0.018
apy+0.018
 she+0.017
Neg logit contributions
 fault-0.051
 tant-0.048
 asleep-0.042
 pad-0.041
 second-0.040
Tokenados
Token ID22484
Feature activation+1.84e-01

Pos logit contributions
apy+0.104
keys+0.103
 live+0.099
ons+0.098
 she+0.096
Neg logit contributions
 fault-0.159
 tant-0.155
 pad-0.135
 second-0.125
 asleep-0.124
Token.
Token ID13
Feature activation+7.63e-02

Pos logit contributions
 she+0.004
 they+0.004
keys+0.004
 live+0.004
 we+0.004
Neg logit contributions
 tant-0.010
 fault-0.009
 pad-0.008
 puck-0.008
joy-0.007
Token But
Token ID887
Feature activation+1.55e-01

Pos logit contributions
apy+0.002
 she+0.001
 live+0.001
 they+0.001
keys+0.001
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.004
 plate-0.004
 second-0.004
Token the
Token ID262
Feature activation+1.63e-01

Pos logit contributions
apy+0.003
keys+0.003
 live+0.003
 she+0.003
ons+0.003
Neg logit contributions
 fault-0.009
 tant-0.008
 pad-0.007
 second-0.007
 plate-0.007
Token rock
Token ID3881
Feature activation-4.54e-01

Pos logit contributions
 she+0.006
 live+0.005
 you+0.005
keys+0.005
 they+0.005
Neg logit contributions
 fault-0.007
 tant-0.006
rett-0.006
 pad-0.005
ingo-0.005
TokenThen
Token ID6423
Feature activation-3.01e-02

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 asleep+0.001
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 second+0.001
 plate+0.001
Neg logit contributions
keys-0.001
apy-0.001
 live-0.001
ons-0.001
mm-0.001
Token the
Token ID262
Feature activation+2.67e-01

Pos logit contributions
 fault+0.008
 tant+0.008
 pad+0.007
 second+0.007
 plate+0.007
Neg logit contributions
apy-0.004
keys-0.003
 live-0.003
ons-0.003
 she-0.003
Token rh
Token ID9529
Feature activation-8.84e-01

Pos logit contributions
 she+0.010
 live+0.010
apy+0.010
keys+0.009
 they+0.009
Neg logit contributions
 fault-0.011
 tant-0.010
 pad-0.008
 asleep-0.007
rett-0.007
Tokenin
Token ID259
Feature activation-4.67e-01

Pos logit contributions
 fault+0.032
 tant+0.031
 pad+0.026
 second+0.023
 plate+0.023
Neg logit contributions
apy-0.034
keys-0.034
 live-0.034
 she-0.033
ons-0.032
Tokenoc
Token ID420
Feature activation+3.54e+00

Pos logit contributions
 fault+0.026
 tant+0.025
 pad+0.021
 asleep+0.021
 second+0.020
Neg logit contributions
keys-0.010
 live-0.010
apy-0.009
 lived-0.009
 she-0.009
Tokeneros
Token ID27498
Feature activation-8.14e-01

Pos logit contributions
apy+0.123
keys+0.122
 live+0.119
 she+0.116
ons+0.115
Neg logit contributions
 fault-0.141
 tant-0.138
 pad-0.118
 second-0.107
 plate-0.105
Token had
Token ID550
Feature activation-1.90e-01

Pos logit contributions
 fault+0.042
 tant+0.041
 pad+0.035
 plate+0.033
 second+0.033
Neg logit contributions
apy-0.020
keys-0.019
 live-0.019
 she-0.019
ons-0.017
Token one
Token ID530
Feature activation-3.96e-01

Pos logit contributions
 tant+0.008
 fault+0.008
 pad+0.007
 plate+0.006
 puck+0.006
Neg logit contributions
apy-0.006
 she-0.006
 live-0.006
keys-0.006
 they-0.006
Token last
Token ID938
Feature activation+2.12e-01

Pos logit contributions
 tant+0.017
 fault+0.016
 pad+0.013
 asleep+0.013
rett+0.013
Neg logit contributions
 she-0.013
 live-0.013
apy-0.013
keys-0.012
 they-0.012
Token request
Token ID2581
Feature activation-1.30e-01

Pos logit contributions
 live+0.008
 she+0.007
 expect+0.007
 lived+0.007
apy+0.007
Neg logit contributions
 fault-0.008
 tant-0.008
 puck-0.006
ingo-0.006
icer-0.006
Token am
Token ID716
Feature activation+9.73e-01

Pos logit contributions
apy+0.007
keys+0.007
 live+0.007
 she+0.006
ons+0.006
Neg logit contributions
 fault-0.013
 tant-0.013
 pad-0.011
 second-0.011
 plate-0.011
Token taking
Token ID2263
Feature activation-6.97e-01

Pos logit contributions
apy+0.032
keys+0.031
 live+0.030
ons+0.029
 she+0.029
Neg logit contributions
 fault-0.042
 tant-0.039
 pad-0.034
 second-0.032
 asleep-0.031
Token the
Token ID262
Feature activation+4.08e-01

Pos logit contributions
 fault+0.034
 tant+0.033
 pad+0.029
 puck+0.027
 plate+0.027
Neg logit contributions
apy-0.018
 she-0.018
 live-0.017
 you-0.017
keys-0.017
Token rh
Token ID9529
Feature activation-7.48e-01

Pos logit contributions
 you+0.014
 live+0.014
keys+0.014
 we+0.014
 she+0.014
Neg logit contributions
 fault-0.015
 tant-0.014
 pad-0.012
joy-0.012
rett-0.012
Tokenin
Token ID259
Feature activation-3.95e-01

Pos logit contributions
 fault+0.030
 tant+0.028
 pad+0.025
 second+0.022
 asleep+0.022
Neg logit contributions
keys-0.027
apy-0.026
 live-0.026
ons-0.026
 she-0.024
Tokenoc
Token ID420
Feature activation+3.54e+00

Pos logit contributions
 fault+0.024
 tant+0.022
 pad+0.020
 asleep+0.019
joy+0.019
Neg logit contributions
keys-0.007
 live-0.007
 we-0.006
ons-0.006
 you-0.006
Tokeneros
Token ID27498
Feature activation-8.95e-01

Pos logit contributions
keys+0.112
apy+0.108
ons+0.106
 live+0.103
mm+0.102
Neg logit contributions
 fault-0.150
 tant-0.147
 pad-0.130
 second-0.121
 plate-0.118
Token to
Token ID284
Feature activation-2.51e-01

Pos logit contributions
 fault+0.039
 tant+0.038
 pad+0.032
 puck+0.030
 second+0.030
Neg logit contributions
 live-0.028
keys-0.027
 she-0.027
apy-0.027
 they-0.026
Token a
Token ID257
Feature activation+1.21e-02

Pos logit contributions
 fault+0.011
 tant+0.011
 pad+0.010
 second+0.009
 plate+0.009
Neg logit contributions
apy-0.007
 live-0.007
keys-0.007
 she-0.007
ons-0.007
Token new
Token ID649
Feature activation-3.22e-02

Pos logit contributions
 live+0.000
 we+0.000
 you+0.000
keys+0.000
 solve+0.000
Neg logit contributions
 fault-0.001
 tant-0.001
rett-0.000
 pad-0.000
ingo-0.000
Token home
Token ID1363
Feature activation-5.57e-02

Pos logit contributions
 fault+0.001
 tant+0.001
uster+0.001
ingo+0.001
rett+0.001
Neg logit contributions
 live-0.001
 lived-0.001
 we-0.001
 you-0.001
 solve-0.001
Token them
Token ID606
Feature activation-2.25e-01

Pos logit contributions
apy+0.018
keys+0.017
 live+0.016
 she+0.016
ons+0.016
Neg logit contributions
 fault-0.023
 tant-0.022
 pad-0.019
 second-0.018
 plate-0.018
Token and
Token ID290
Feature activation-1.44e-01

Pos logit contributions
 tant+0.011
 fault+0.011
 pad+0.009
 puck+0.009
 second+0.009
Neg logit contributions
 live-0.006
 she-0.006
keys-0.006
apy-0.006
 they-0.006
Token never
Token ID1239
Feature activation+5.63e-01

Pos logit contributions
 fault+0.007
 tant+0.006
 pad+0.005
 plate+0.005
 asleep+0.005
Neg logit contributions
apy-0.004
keys-0.004
ons-0.004
mm-0.004
ves-0.004
Token threw
Token ID9617
Feature activation-2.05e-01

Pos logit contributions
apy+0.017
keys+0.016
 live+0.016
ons+0.015
 she+0.015
Neg logit contributions
 fault-0.026
 tant-0.025
 pad-0.022
 second-0.020
 plate-0.020
Token a
Token ID257
Feature activation+1.84e-01

Pos logit contributions
 fault+0.010
 tant+0.010
 pad+0.009
 puck+0.008
rett+0.008
Neg logit contributions
 she-0.005
 live-0.005
 they-0.005
apy-0.005
ons-0.005
Token tant
Token ID24246
Feature activation-3.67e+00

Pos logit contributions
bs+0.007
apy+0.007
 she+0.007
 live+0.007
ons+0.007
Neg logit contributions
 fault-0.006
rett-0.005
 habitat-0.005
ASH-0.005
ACK-0.005
Tokenrum
Token ID6582
Feature activation+7.48e-01

Pos logit contributions
 fault+0.146
 tant+0.143
 pad+0.118
 second+0.106
 plate+0.105
Neg logit contributions
apy-0.130
 live-0.127
 she-0.125
keys-0.124
 they-0.120
Token again
Token ID757
Feature activation+4.69e-02

Pos logit contributions
apy+0.022
keys+0.022
 live+0.021
ons+0.020
mm+0.019
Neg logit contributions
 fault-0.034
 tant-0.032
 pad-0.028
 asleep-0.026
 second-0.026
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
apy+0.002
keys+0.001
 live+0.001
ons+0.001
mm+0.001
Neg logit contributions
 fault-0.002
 tant-0.002
 pad-0.002
 second-0.002
 asleep-0.002
Token<|endoftext|>
Token ID50256
Feature activation-6.06e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 second+0.005
 puck+0.005
Neg logit contributions
apy-0.003
 she-0.003
 live-0.003
keys-0.003
 they-0.003
TokenOnce
Token ID7454
Feature activation+4.25e-01

Pos logit contributions
 fault+0.031
 tant+0.030
 pad+0.026
 puck+0.024
 second+0.024
Neg logit contributions
 live-0.015
apy-0.015
 she-0.014
ons-0.014
 lived-0.014
Token temper
Token ID4124
Feature activation-1.35e-01

Pos logit contributions
 fault+0.058
 tant+0.053
 pad+0.051
 asleep+0.046
 puck+0.046
Neg logit contributions
 live-0.054
apy-0.052
 she-0.052
ves-0.049
 expect-0.049
Token and
Token ID290
Feature activation-3.69e-02

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 second+0.005
rett+0.005
Neg logit contributions
 she-0.004
 live-0.004
apy-0.004
ons-0.004
 they-0.004
Token often
Token ID1690
Feature activation+6.61e-01

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 second+0.001
 plate+0.001
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token threw
Token ID9617
Feature activation-2.13e-01

Pos logit contributions
apy+0.019
keys+0.019
mm+0.017
 live+0.017
ons+0.016
Neg logit contributions
 fault-0.030
 tant-0.030
 pad-0.026
 asleep-0.024
 puck-0.024
Token temper
Token ID4124
Feature activation-1.23e-01

Pos logit contributions
 fault+0.011
 pad+0.009
 tant+0.009
rett+0.009
 plate+0.008
Neg logit contributions
 she-0.005
ons-0.005
 you-0.005
apy-0.005
 live-0.005
Token tant
Token ID24246
Feature activation-3.65e+00

Pos logit contributions
 fault+0.005
 pad+0.004
 tant+0.004
 plate+0.004
rett+0.004
Neg logit contributions
apy-0.004
ons-0.004
 she-0.004
 live-0.004
 they-0.004
Tokenrums
Token ID45241
Feature activation+4.52e-01

Pos logit contributions
 fault+0.152
 tant+0.147
 pad+0.119
rett+0.109
 second+0.107
Neg logit contributions
 live-0.125
apy-0.123
 she-0.122
 they-0.119
 we-0.119
Token.
Token ID13
Feature activation+2.90e-02

Pos logit contributions
apy+0.013
 live+0.013
keys+0.013
 she+0.012
ons+0.012
Neg logit contributions
 fault-0.021
 tant-0.020
 pad-0.018
 second-0.016
 plate-0.016
Token One
Token ID1881
Feature activation-3.32e-01

Pos logit contributions
apy+0.001
 she+0.001
 live+0.001
keys+0.001
 they+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 plate-0.001
 puck-0.001
Token day
Token ID1110
Feature activation+4.31e-01

Pos logit contributions
 fault+0.015
 tant+0.014
 pad+0.012
 asleep+0.011
 puck+0.011
Neg logit contributions
 she-0.011
 live-0.011
apy-0.011
 lived-0.011
 they-0.010
Token,
Token ID11
Feature activation+2.54e-02

Pos logit contributions
keys+0.012
apy+0.012
 live+0.012
ons+0.011
mm+0.011
Neg logit contributions
 tant-0.019
 fault-0.019
 pad-0.016
 second-0.016
 plate-0.015
Token surprise
Token ID5975
Feature activation+4.65e-01

Pos logit contributions
 live+0.003
 she+0.003
 you+0.003
 we+0.003
 they+0.003
Neg logit contributions
 fault-0.003
 tant-0.003
rett-0.003
uster-0.003
 pad-0.003
Token.
Token ID13
Feature activation+1.95e-02

Pos logit contributions
apy+0.012
 live+0.012
keys+0.012
 she+0.012
ons+0.011
Neg logit contributions
 fault-0.023
 tant-0.022
 pad-0.019
 second-0.018
 puck-0.018
Token He
Token ID679
Feature activation-7.05e-02

Pos logit contributions
apy+0.001
 live+0.001
 she+0.001
 they+0.000
keys+0.000
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 plate-0.001
 asleep-0.001
Token threw
Token ID9617
Feature activation-2.29e-01

Pos logit contributions
 fault+0.004
 tant+0.003
 pad+0.003
 second+0.003
 plate+0.003
Neg logit contributions
apy-0.002
 live-0.002
keys-0.002
 she-0.002
ons-0.002
Token a
Token ID257
Feature activation+3.56e-02

Pos logit contributions
 fault+0.014
rett+0.013
 pad+0.012
ipper+0.012
 habitat+0.012
Neg logit contributions
 they-0.003
 she-0.003
 you-0.003
ons-0.003
apy-0.002
Token tant
Token ID24246
Feature activation-3.56e+00

Pos logit contributions
apy+0.001
 she+0.001
ons+0.001
bs+0.001
 live+0.001
Neg logit contributions
 fault-0.001
 habitat-0.001
rett-0.001
 lair-0.001
joy-0.001
Tokenrum
Token ID6582
Feature activation+8.00e-01

Pos logit contributions
 fault+0.131
 tant+0.129
 pad+0.106
 second+0.096
 plate+0.095
Neg logit contributions
apy-0.135
 live-0.131
keys-0.130
 she-0.129
 they-0.123
Token.
Token ID13
Feature activation-2.86e-02

Pos logit contributions
apy+0.020
 she+0.019
 live+0.018
 they+0.018
keys+0.018
Neg logit contributions
 tant-0.041
 fault-0.041
 pad-0.035
 plate-0.033
 second-0.032
Token He
Token ID679
Feature activation+8.86e-02

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 second+0.001
Neg logit contributions
apy-0.001
 she-0.001
 live-0.001
 they-0.001
mm-0.001
Token kicked
Token ID12165
Feature activation-5.07e-01

Pos logit contributions
apy+0.002
 live+0.002
 she+0.002
 lived+0.002
keys+0.002
Neg logit contributions
 fault-0.005
 tant-0.004
 pad-0.004
 asleep-0.004
joy-0.004
Token and
Token ID290
Feature activation-7.08e-02

Pos logit contributions
 fault+0.034
rett+0.030
ipper+0.030
 habitat+0.029
 lair+0.029
Neg logit contributions
 they-0.005
 you-0.005
 she-0.004
 it-0.004
ons-0.003
Token sad
Token ID6507
Feature activation-4.54e-01

Pos logit contributions
keys+0.000
mm+0.000
ons+0.000
apy+0.000
 live+0.000
Neg logit contributions
 tant-0.000
 fault-0.000
 second-0.000
 pad-0.000
 puck-0.000
Token.
Token ID13
Feature activation+1.06e-01

Pos logit contributions
 fault+0.023
 tant+0.022
 pad+0.020
 puck+0.018
 plate+0.018
Neg logit contributions
apy-0.011
 she-0.011
 live-0.011
keys-0.010
 they-0.010
Token She
Token ID1375
Feature activation-1.11e-02

Pos logit contributions
apy+0.003
 she+0.002
 live+0.002
keys+0.002
 they+0.002
Neg logit contributions
 fault-0.005
 tant-0.005
 pad-0.005
 plate-0.004
 puck-0.004
Token threw
Token ID9617
Feature activation-1.62e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.000
 second+0.000
 plate+0.000
Neg logit contributions
apy-0.000
keys-0.000
 live-0.000
ons-0.000
 she-0.000
Token a
Token ID257
Feature activation-4.60e-02

Pos logit contributions
 fault+0.010
 habitat+0.009
 pad+0.009
rett+0.008
 lair+0.008
Neg logit contributions
 she-0.003
ons-0.002
 they-0.002
apy-0.002
 live-0.002
Token tant
Token ID24246
Feature activation-3.56e+00

Pos logit contributions
 fault+0.002
 habitat+0.001
 lair+0.001
rett+0.001
 asleep+0.001
Neg logit contributions
apy-0.002
 she-0.002
ons-0.002
bs-0.002
 they-0.002
Tokenrum
Token ID6582
Feature activation+8.37e-01

Pos logit contributions
 fault+0.129
 tant+0.128
 pad+0.104
 second+0.094
 plate+0.092
Neg logit contributions
apy-0.135
 live-0.133
keys-0.132
 she-0.130
 they-0.125
Token and
Token ID290
Feature activation+2.28e-02

Pos logit contributions
apy+0.028
 she+0.027
 live+0.027
keys+0.026
 they+0.026
Neg logit contributions
 fault-0.035
 tant-0.034
 pad-0.029
 plate-0.027
 puck-0.027
Token cried
Token ID16896
Feature activation+1.71e-01

Pos logit contributions
keys+0.001
apy+0.001
 live+0.001
mm+0.001
ons+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.001
 puck-0.001
Token loudly
Token ID23112
Feature activation+2.11e-01

Pos logit contributions
apy+0.005
keys+0.005
 live+0.005
 she+0.005
ons+0.005
Neg logit contributions
 fault-0.008
 tant-0.007
 pad-0.006
 second-0.006
 plate-0.006
Token.
Token ID13
Feature activation+4.39e-02

Pos logit contributions
 she+0.005
apy+0.005
 they+0.005
 live+0.005
keys+0.004
Neg logit contributions
 fault-0.011
 tant-0.011
 pad-0.009
 puck-0.009
 plate-0.009
Token listen
Token ID6004
Feature activation-9.56e-02

Pos logit contributions
 fault+0.018
 tant+0.018
 pad+0.015
 second+0.014
 plate+0.014
Neg logit contributions
apy-0.010
keys-0.009
 live-0.009
 she-0.009
ons-0.009
Token.
Token ID13
Feature activation-2.53e-02

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 puck+0.003
 plate+0.003
Neg logit contributions
 she-0.004
 they-0.003
apy-0.003
 live-0.003
keys-0.003
Token She
Token ID1375
Feature activation+2.40e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 second+0.001
 plate+0.001
Neg logit contributions
apy-0.000
keys-0.000
 live-0.000
 she-0.000
ons-0.000
Token threw
Token ID9617
Feature activation-1.99e-01

Pos logit contributions
apy+0.006
 live+0.006
keys+0.006
 she+0.006
ons+0.005
Neg logit contributions
 fault-0.012
 tant-0.012
 pad-0.010
 second-0.009
 plate-0.009
Token a
Token ID257
Feature activation-4.40e-02

Pos logit contributions
 fault+0.012
rett+0.010
 lair+0.010
 pad+0.010
ingo+0.010
Neg logit contributions
 she-0.004
 they-0.003
 you-0.003
 live-0.003
ons-0.002
Token tant
Token ID24246
Feature activation-3.55e+00

Pos logit contributions
 fault+0.002
rett+0.001
 habitat+0.001
 lair+0.001
bur+0.001
Neg logit contributions
apy-0.002
 they-0.002
bs-0.002
 live-0.002
 we-0.002
Tokenrum
Token ID6582
Feature activation+7.65e-01

Pos logit contributions
 fault+0.131
 tant+0.131
 pad+0.105
 second+0.096
 plate+0.094
Neg logit contributions
apy-0.133
 live-0.131
keys-0.129
 she-0.128
 they-0.123
Token in
Token ID287
Feature activation-3.31e-01

Pos logit contributions
apy+0.026
keys+0.025
 live+0.025
 she+0.024
ons+0.023
Neg logit contributions
 fault-0.032
 tant-0.031
 pad-0.027
 second-0.024
 plate-0.024
Token the
Token ID262
Feature activation+2.37e-03

Pos logit contributions
 fault+0.019
 pad+0.017
 tant+0.016
 puck+0.016
rett+0.016
Neg logit contributions
 she-0.007
apy-0.006
ons-0.006
 you-0.006
 they-0.006
Token store
Token ID3650
Feature activation-4.49e-01

Pos logit contributions
 live+0.000
 they+0.000
 you+0.000
 we+0.000
apy+0.000
Neg logit contributions
 fault-0.000
 pad-0.000
 tant-0.000
rett-0.000
 asleep-0.000
Token.
Token ID13
Feature activation-5.41e-02

Pos logit contributions
 fault+0.021
 tant+0.021
 pad+0.018
 puck+0.017
rett+0.017
Neg logit contributions
 she-0.013
 they-0.012
 live-0.012
keys-0.011
apy-0.011
Token and
Token ID290
Feature activation+1.76e-02

Pos logit contributions
 she+0.005
 they+0.005
 we+0.004
 live+0.004
 lived+0.004
Neg logit contributions
 tant-0.010
 fault-0.010
 pad-0.008
 second-0.008
 puck-0.008
Token so
Token ID523
Feature activation-7.90e-02

Pos logit contributions
keys+0.001
apy+0.001
mm+0.001
ons+0.001
 live+0.001
Neg logit contributions
 tant-0.001
 fault-0.001
 pad-0.001
 second-0.001
 plate-0.001
Token she
Token ID673
Feature activation+8.09e-02

Pos logit contributions
 tant+0.004
 fault+0.004
 pad+0.003
 second+0.003
 arm+0.003
Neg logit contributions
keys-0.002
qu-0.002
vals-0.002
ons-0.001
vere-0.001
Token threw
Token ID9617
Feature activation-1.50e-01

Pos logit contributions
apy+0.002
 live+0.002
keys+0.002
 she+0.002
 they+0.002
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.004
 plate-0.003
 second-0.003
Token a
Token ID257
Feature activation-1.12e-01

Pos logit contributions
 fault+0.010
 tant+0.008
 lair+0.008
ingo+0.008
 pad+0.008
Neg logit contributions
 she-0.003
 you-0.002
apy-0.002
ons-0.002
 they-0.002
Token tant
Token ID24246
Feature activation-3.55e+00

Pos logit contributions
 fault+0.005
 habitat+0.004
rett+0.004
 lair+0.004
joy+0.004
Neg logit contributions
apy-0.004
bs-0.003
 they-0.003
 she-0.003
 we-0.003
Tokenrum
Token ID6582
Feature activation+8.16e-01

Pos logit contributions
 tant+0.122
 fault+0.121
 pad+0.094
 second+0.085
 plate+0.083
Neg logit contributions
apy-0.141
 live-0.140
 she-0.137
keys-0.137
 they-0.133
Token when
Token ID618
Feature activation-3.79e-01

Pos logit contributions
apy+0.022
 live+0.021
keys+0.021
 she+0.021
ons+0.020
Neg logit contributions
 fault-0.039
 tant-0.038
 pad-0.033
 second-0.031
 plate-0.031
Token her
Token ID607
Feature activation-1.00e-01

Pos logit contributions
 fault+0.021
 tant+0.020
 second+0.019
 pad+0.018
 asleep+0.018
Neg logit contributions
keys-0.007
mm-0.006
 live-0.006
apy-0.005
mmm-0.005
Token parents
Token ID3397
Feature activation-2.09e-01

Pos logit contributions
 tant+0.004
 fault+0.004
 pad+0.004
rett+0.003
die+0.003
Neg logit contributions
apy-0.004
 they-0.003
 she-0.003
 we-0.003
keys-0.003
Token said
Token ID531
Feature activation+3.17e-03

Pos logit contributions
 fault+0.011
 tant+0.011
 pad+0.009
 second+0.009
 plate+0.009
Neg logit contributions
apy-0.005
keys-0.005
 live-0.005
 she-0.004
mm-0.004
Token to
Token ID284
Feature activation-2.07e-01

Pos logit contributions
 fault+0.013
 tant+0.012
 pad+0.011
 puck+0.010
ingo+0.010
Neg logit contributions
 she-0.010
 they-0.009
apy-0.009
 live-0.009
ons-0.008
Token cry
Token ID3960
Feature activation+1.25e-01

Pos logit contributions
 fault+0.009
 tant+0.009
 pad+0.008
 puck+0.007
 second+0.007
Neg logit contributions
 live-0.006
apy-0.006
 she-0.006
keys-0.006
 they-0.006
Token and
Token ID290
Feature activation+1.37e-02

Pos logit contributions
apy+0.004
 she+0.004
 live+0.004
keys+0.004
 they+0.004
Neg logit contributions
 fault-0.005
 tant-0.005
 pad-0.004
 puck-0.004
 plate-0.004
Token threw
Token ID9617
Feature activation-1.69e-01

Pos logit contributions
keys+0.000
apy+0.000
mm+0.000
ons+0.000
ves+0.000
Neg logit contributions
 tant-0.001
 fault-0.001
 pad-0.001
 second-0.001
 plate-0.000
Token a
Token ID257
Feature activation-8.71e-02

Pos logit contributions
 fault+0.011
 pad+0.009
 tant+0.009
ingo+0.009
 habitat+0.009
Neg logit contributions
 she-0.003
apy-0.002
 they-0.002
ons-0.002
 live-0.002
Token tant
Token ID24246
Feature activation-3.54e+00

Pos logit contributions
 fault+0.003
 habitat+0.003
 lair+0.002
ingo+0.002
rett+0.002
Neg logit contributions
apy-0.004
 she-0.004
bs-0.003
 they-0.003
 live-0.003
Tokenrum
Token ID6582
Feature activation+8.22e-01

Pos logit contributions
 fault+0.128
 tant+0.127
 pad+0.103
 second+0.093
 plate+0.091
Neg logit contributions
apy-0.135
 live-0.133
keys-0.131
 she-0.130
 they-0.125
Token.
Token ID13
Feature activation-2.52e-03

Pos logit contributions
apy+0.023
keys+0.023
 live+0.022
ons+0.021
 she+0.021
Neg logit contributions
 fault-0.039
 tant-0.037
 pad-0.033
 second-0.030
 asleep-0.030
Token She
Token ID1375
Feature activation+1.29e-01

Pos logit contributions
 fault+0.000
 tant+0.000
 pad+0.000
 second+0.000
 plate+0.000
Neg logit contributions
apy-0.000
keys-0.000
 live-0.000
 she-0.000
ons-0.000
Token didn
Token ID1422
Feature activation-3.56e-02

Pos logit contributions
apy+0.003
 live+0.003
keys+0.003
 she+0.003
ons+0.003
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.006
 second-0.005
 plate-0.005
Token't
Token ID470
Feature activation-8.66e-02

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 second+0.001
 plate+0.001
Neg logit contributions
apy-0.001
 live-0.001
keys-0.001
 she-0.001
ons-0.001
Token.
Token ID13
Feature activation-2.78e-02

Pos logit contributions
 fault+0.005
 tant+0.005
 pad+0.005
 puck+0.004
 plate+0.004
Neg logit contributions
 she-0.002
 they-0.002
 we-0.001
 live-0.001
apy-0.001
Token He
Token ID679
Feature activation+1.74e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 asleep+0.001
Neg logit contributions
apy-0.001
 she-0.001
 live-0.001
 they-0.001
keys-0.001
Token kept
Token ID4030
Feature activation+2.31e-01

Pos logit contributions
apy+0.004
keys+0.004
 live+0.004
 she+0.004
ons+0.004
Neg logit contributions
 fault-0.009
 tant-0.008
 pad-0.007
 second-0.007
 puck-0.007
Token throwing
Token ID9644
Feature activation-4.42e-01

Pos logit contributions
 she+0.005
 they+0.005
 live+0.005
apy+0.005
ons+0.005
Neg logit contributions
 fault-0.012
 tant-0.012
 pad-0.011
 puck-0.009
rett-0.009
Token a
Token ID257
Feature activation-3.31e-02

Pos logit contributions
 fault+0.022
 tant+0.018
 pad+0.018
rett+0.018
 asleep+0.016
Neg logit contributions
 live-0.012
 she-0.012
apy-0.011
ons-0.011
 they-0.011
Token tant
Token ID24246
Feature activation-3.51e+00

Pos logit contributions
 fault+0.001
rett+0.001
 habitat+0.001
 asleep+0.001
 pad+0.001
Neg logit contributions
apy-0.001
 live-0.001
ons-0.001
 she-0.001
keys-0.001
Tokenrum
Token ID6582
Feature activation+7.82e-01

Pos logit contributions
 fault+0.128
 tant+0.126
 pad+0.103
 second+0.093
 plate+0.092
Neg logit contributions
apy-0.134
 live-0.131
keys-0.129
 she-0.129
 they-0.123
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
apy+0.018
 she+0.018
 live+0.018
keys+0.017
 they+0.017
Neg logit contributions
 fault-0.040
 tant-0.040
 pad-0.035
 plate-0.032
 puck-0.032
Token His
Token ID2399
Feature activation+1.56e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.000
 plate+0.000
 asleep+0.000
Neg logit contributions
apy-0.000
 she-0.000
 live-0.000
 they-0.000
 People-0.000
Token mom
Token ID1995
Feature activation+2.00e-01

Pos logit contributions
apy+0.004
 she+0.004
 they+0.003
 live+0.003
 we+0.003
Neg logit contributions
 pad-0.007
 tant-0.007
 fault-0.007
rett-0.007
 furthe-0.007
Token was
Token ID373
Feature activation+1.38e-01

Pos logit contributions
apy+0.005
 live+0.005
keys+0.005
 she+0.005
ons+0.005
Neg logit contributions
 fault-0.010
 tant-0.010
 pad-0.008
 second-0.008
 plate-0.008
Token kept
Token ID4030
Feature activation+3.03e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 second+0.005
 plate+0.005
Neg logit contributions
apy-0.003
keys-0.003
 live-0.003
 she-0.003
ons-0.003
Token crying
Token ID13774
Feature activation-7.34e-01

Pos logit contributions
 she+0.008
apy+0.007
 they+0.007
ons+0.007
 live+0.007
Neg logit contributions
 fault-0.015
 tant-0.015
 pad-0.013
 puck-0.012
ingo-0.012
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
 fault+0.028
 tant+0.027
 pad+0.023
 second+0.021
 plate+0.020
Neg logit contributions
apy-0.027
keys-0.027
 live-0.027
 she-0.026
ons-0.025
Token throwing
Token ID9644
Feature activation-3.40e-01

Pos logit contributions
keys+0.001
apy+0.001
 live+0.001
ons+0.001
 she+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.001
 asleep-0.001
Token a
Token ID257
Feature activation-1.44e-02

Pos logit contributions
 fault+0.019
 tant+0.016
 pad+0.016
rett+0.015
 puck+0.015
Neg logit contributions
 she-0.008
apy-0.007
 they-0.007
 live-0.006
ons-0.006
Token tant
Token ID24246
Feature activation-3.51e+00

Pos logit contributions
 fault+0.000
 habitat+0.000
rett+0.000
joy+0.000
 pad+0.000
Neg logit contributions
apy-0.001
 she-0.001
ons-0.001
 they-0.001
bs-0.001
Tokenrum
Token ID6582
Feature activation+8.52e-01

Pos logit contributions
 fault+0.129
 tant+0.128
 pad+0.104
 second+0.094
 plate+0.093
Neg logit contributions
apy-0.132
 live-0.130
keys-0.129
 she-0.127
 they-0.122
Token.
Token ID13
Feature activation+8.07e-02

Pos logit contributions
apy+0.024
keys+0.023
 live+0.023
 she+0.022
ons+0.022
Neg logit contributions
 fault-0.040
 tant-0.039
 pad-0.034
 second-0.031
 asleep-0.031
Token
Token ID198
Feature activation-1.01e-01

Pos logit contributions
apy+0.002
 live+0.002
 she+0.002
keys+0.002
ons+0.002
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.003
 puck-0.003
 plate-0.003
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.004
 second+0.003
 plate+0.003
Neg logit contributions
apy-0.003
keys-0.003
 live-0.003
 she-0.003
ons-0.003
TokenFinally
Token ID11158
Feature activation+4.64e-02

Pos logit contributions
 tant+0.005
 fault+0.005
 pad+0.005
 second+0.004
 plate+0.004
Neg logit contributions
apy-0.004
 live-0.004
keys-0.004
ons-0.004
 she-0.004
Token stubborn
Token ID24444
Feature activation+1.36e-01

Pos logit contributions
 fault+0.007
 tant+0.007
 pad+0.006
 puck+0.006
 plate+0.006
Neg logit contributions
apy-0.006
 live-0.006
 she-0.005
keys-0.005
 they-0.005
Token.
Token ID13
Feature activation-5.29e-02

Pos logit contributions
 she+0.003
 they+0.003
 live+0.002
 we+0.002
apy+0.002
Neg logit contributions
 tant-0.007
 fault-0.007
 pad-0.006
 puck-0.006
rett-0.006
Token She
Token ID1375
Feature activation+1.23e-01

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.002
 second+0.002
 asleep+0.002
Neg logit contributions
keys-0.001
apy-0.001
 live-0.001
ons-0.001
ves-0.001
Token throws
Token ID12542
Feature activation-3.60e-01

Pos logit contributions
apy+0.004
keys+0.004
ons+0.003
 live+0.003
 awaited+0.003
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.005
 puck-0.004
 second-0.004
Token a
Token ID257
Feature activation+8.25e-02

Pos logit contributions
 fault+0.023
 lair+0.020
rett+0.019
 pad+0.019
 habitat+0.019
Neg logit contributions
 she-0.007
 they-0.005
 you-0.005
apy-0.004
ons-0.004
Token tant
Token ID24246
Feature activation-3.51e+00

Pos logit contributions
apy+0.004
 she+0.004
bs+0.004
ons+0.004
 they+0.003
Neg logit contributions
 fault-0.003
 habitat-0.002
 lair-0.002
rett-0.002
ASH-0.002
Tokenrum
Token ID6582
Feature activation+8.04e-01

Pos logit contributions
 fault+0.127
 tant+0.126
 pad+0.101
 second+0.091
 plate+0.090
Neg logit contributions
apy-0.135
 live-0.132
keys-0.130
 she-0.129
 they-0.124
Token.
Token ID13
Feature activation-6.36e-02

Pos logit contributions
apy+0.023
keys+0.023
 live+0.022
ons+0.021
 she+0.021
Neg logit contributions
 fault-0.037
 tant-0.035
 pad-0.031
 second-0.029
 asleep-0.029
Token She
Token ID1375
Feature activation+2.29e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 second+0.003
 plate+0.003
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
ons-0.001
 she-0.001
Token kicks
Token ID17364
Feature activation-4.23e-01

Pos logit contributions
apy+0.006
keys+0.006
 live+0.006
 she+0.006
ons+0.005
Neg logit contributions
 fault-0.011
 tant-0.011
 pad-0.009
 second-0.009
 plate-0.009
Token and
Token ID290
Feature activation-6.07e-02

Pos logit contributions
 fault+0.026
 lair+0.023
 habitat+0.022
 pad+0.022
rett+0.021
Neg logit contributions
 she-0.008
 they-0.007
 you-0.006
ons-0.005
apy-0.005
Token her
Token ID607
Feature activation-1.55e-01

Pos logit contributions
 she+0.004
 they+0.004
apy+0.003
 live+0.003
 you+0.003
Neg logit contributions
 fault-0.015
 pad-0.013
 puck-0.012
 tant-0.012
ingo-0.012
Token feet
Token ID3625
Feature activation-6.00e-02

Pos logit contributions
 fault+0.006
 tant+0.005
umb+0.005
 pad+0.005
die+0.005
Neg logit contributions
 she-0.005
 they-0.005
 we-0.005
apy-0.005
 you-0.004
Token and
Token ID290
Feature activation+5.18e-02

Pos logit contributions
 fault+0.003
 tant+0.003
die+0.002
 puck+0.002
 pad+0.002
Neg logit contributions
 she-0.002
 they-0.002
apy-0.001
 live-0.001
othe-0.001
Token threw
Token ID9617
Feature activation-1.36e-01

Pos logit contributions
keys+0.001
apy+0.001
 live+0.001
mm+0.001
ons+0.001
Neg logit contributions
 tant-0.003
 fault-0.002
 pad-0.002
 second-0.002
 puck-0.002
Token a
Token ID257
Feature activation-2.54e-02

Pos logit contributions
 fault+0.008
 lair+0.007
 pad+0.007
 tant+0.007
rett+0.006
Neg logit contributions
 she-0.003
 they-0.002
apy-0.002
 live-0.002
 you-0.002
Token tant
Token ID24246
Feature activation-3.48e+00

Pos logit contributions
 fault+0.001
 habitat+0.000
rett+0.000
 lair+0.000
ASH+0.000
Neg logit contributions
apy-0.001
bs-0.001
 she-0.001
 they-0.001
ons-0.001
Tokenrum
Token ID6582
Feature activation+8.24e-01

Pos logit contributions
 fault+0.127
 tant+0.125
 pad+0.102
 second+0.092
 plate+0.091
Neg logit contributions
apy-0.133
 live-0.129
keys-0.129
 she-0.127
 they-0.122
Token.
Token ID13
Feature activation+4.47e-02

Pos logit contributions
apy+0.023
keys+0.022
 live+0.022
 she+0.021
ons+0.020
Neg logit contributions
 fault-0.039
 tant-0.038
 pad-0.033
 second-0.031
 asleep-0.030
Token She
Token ID1375
Feature activation+3.21e-01

Pos logit contributions
apy+0.001
 she+0.001
 live+0.001
keys+0.001
 they+0.001
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.002
 plate-0.002
 puck-0.002
Token grabbed
Token ID13176
Feature activation+2.06e-01

Pos logit contributions
apy+0.008
keys+0.007
 live+0.007
 she+0.007
ons+0.007
Neg logit contributions
 fault-0.017
 tant-0.016
 pad-0.014
 second-0.013
 plate-0.013
Token the
Token ID262
Feature activation+3.04e-02

Pos logit contributions
 she+0.005
apy+0.004
 they+0.004
ons+0.004
 live+0.003
Neg logit contributions
 fault-0.012
 pad-0.010
 tant-0.010
 lair-0.010
 puck-0.009
Token and
Token ID290
Feature activation-1.51e-03

Pos logit contributions
 she+0.001
apy+0.001
 they+0.001
 live+0.001
keys+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.001
 puck-0.001
Token started
Token ID2067
Feature activation-2.19e-01

Pos logit contributions
 tant+0.000
 fault+0.000
 second+0.000
 plate+0.000
 pad+0.000
Neg logit contributions
keys-0.000
mm-0.000
 live-0.000
apy-0.000
mmm-0.000
Token to
Token ID284
Feature activation-1.76e-01

Pos logit contributions
 fault+0.009
 tant+0.009
 pad+0.008
 puck+0.007
uster+0.007
Neg logit contributions
 she-0.008
 they-0.007
apy-0.007
 live-0.007
othe-0.006
Token throw
Token ID3714
Feature activation+1.42e-01

Pos logit contributions
 fault+0.008
 tant+0.007
 pad+0.007
 second+0.006
 puck+0.006
Neg logit contributions
apy-0.006
 live-0.005
 she-0.005
keys-0.005
 they-0.005
Token a
Token ID257
Feature activation-3.20e-02

Pos logit contributions
 she+0.003
apy+0.003
 they+0.003
ons+0.003
 live+0.002
Neg logit contributions
 fault-0.008
 pad-0.007
 tant-0.007
rett-0.007
 puck-0.007
Token tant
Token ID24246
Feature activation-3.46e+00

Pos logit contributions
 fault+0.001
 habitat+0.001
joy+0.001
rett+0.001
ACK+0.001
Neg logit contributions
apy-0.001
bs-0.001
 she-0.001
 they-0.001
ons-0.001
Tokenrum
Token ID6582
Feature activation+8.69e-01

Pos logit contributions
 fault+0.126
 tant+0.125
 pad+0.101
 second+0.091
 plate+0.090
Neg logit contributions
apy-0.132
 live-0.130
keys-0.127
 she-0.127
 they-0.122
Token.
Token ID13
Feature activation+6.85e-02

Pos logit contributions
apy+0.025
keys+0.024
 live+0.024
ons+0.022
 she+0.022
Neg logit contributions
 fault-0.041
 tant-0.039
 pad-0.035
 second-0.032
 asleep-0.032
Token Suddenly
Token ID24975
Feature activation+1.93e-02

Pos logit contributions
apy+0.001
keys+0.001
 live+0.001
 she+0.001
ons+0.001
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.003
 second-0.003
 plate-0.003
Token,
Token ID11
Feature activation-2.73e-02

Pos logit contributions
keys+0.001
 live+0.001
mm+0.001
 expect+0.001
 contribute+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 second-0.001
 asleep-0.001
 pad-0.001
Token she
Token ID673
Feature activation+7.13e-02

Pos logit contributions
 tant+0.002
 fault+0.001
 second+0.001
 pad+0.001
 plate+0.001
Neg logit contributions
keys-0.001
 live-0.000
apy-0.000
ons-0.000
mm-0.000
Token She
Token ID1375
Feature activation+2.73e-01

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.001
 second+0.001
 plate+0.001
Neg logit contributions
apy-0.000
keys-0.000
 live-0.000
 she-0.000
ons-0.000
Token cried
Token ID16896
Feature activation+1.54e-01

Pos logit contributions
apy+0.006
keys+0.006
 live+0.006
 she+0.006
ons+0.005
Neg logit contributions
 fault-0.014
 tant-0.014
 pad-0.012
 second-0.011
 plate-0.011
Token and
Token ID290
Feature activation-6.49e-03

Pos logit contributions
apy+0.005
 she+0.005
 live+0.005
 they+0.005
keys+0.005
Neg logit contributions
 fault-0.007
 tant-0.006
 pad-0.005
 puck-0.005
 plate-0.005
Token threw
Token ID9617
Feature activation-1.57e-01

Pos logit contributions
 tant+0.000
 fault+0.000
 pad+0.000
 second+0.000
 plate+0.000
Neg logit contributions
keys-0.000
apy-0.000
 live-0.000
ons-0.000
mm-0.000
Token a
Token ID257
Feature activation-8.82e-02

Pos logit contributions
 fault+0.010
rett+0.008
 pad+0.008
ingo+0.008
 puck+0.008
Neg logit contributions
 she-0.003
apy-0.002
 live-0.002
ons-0.002
 they-0.002
Token tant
Token ID24246
Feature activation-3.45e+00

Pos logit contributions
 fault+0.003
 habitat+0.002
rett+0.002
ingo+0.002
 lair+0.002
Neg logit contributions
apy-0.004
 she-0.004
bs-0.004
ons-0.004
 live-0.004
Tokenrum
Token ID6582
Feature activation+8.27e-01

Pos logit contributions
 fault+0.127
 tant+0.126
 pad+0.102
 second+0.092
 plate+0.091
Neg logit contributions
apy-0.130
 live-0.128
keys-0.126
 she-0.125
 they-0.120
Token.
Token ID13
Feature activation-9.79e-03

Pos logit contributions
apy+0.022
keys+0.022
 live+0.021
 she+0.021
ons+0.020
Neg logit contributions
 fault-0.040
 tant-0.039
 pad-0.034
 second-0.032
 plate-0.031
Token Her
Token ID2332
Feature activation+1.96e-01

Pos logit contributions
 fault+0.001
 tant+0.000
 pad+0.000
 second+0.000
 asleep+0.000
Neg logit contributions
keys-0.000
apy-0.000
 live-0.000
ons-0.000
 expect-0.000
Token mom
Token ID1995
Feature activation+1.42e-01

Pos logit contributions
apy+0.006
 she+0.005
 live+0.005
 they+0.005
 we+0.005
Neg logit contributions
 fault-0.009
 tant-0.009
 pad-0.008
rett-0.008
uster-0.007
Token tried
Token ID3088
Feature activation-4.96e-01

Pos logit contributions
apy+0.004
keys+0.003
 live+0.003
 she+0.003
ons+0.003
Neg logit contributions
 fault-0.007
 tant-0.007
 pad-0.006
 second-0.006
 plate-0.006
Token his
Token ID465
Feature activation-2.10e-01

Pos logit contributions
 they+0.011
apy+0.011
 she+0.011
 live+0.010
 lived+0.010
Neg logit contributions
 fault-0.022
 pad-0.018
 tant-0.017
 plate-0.016
 asleep-0.015
Token feet
Token ID3625
Feature activation-5.97e-02

Pos logit contributions
 fault+0.008
 tant+0.007
rett+0.006
icer+0.006
joy+0.006
Neg logit contributions
apy-0.007
 she-0.007
 we-0.006
keys-0.006
 they-0.006
Token and
Token ID290
Feature activation+2.47e-02

Pos logit contributions
 fault+0.003
 tant+0.003
die+0.002
joy+0.002
 pad+0.002
Neg logit contributions
 she-0.002
 they-0.001
apy-0.001
 live-0.001
 we-0.001
Token threw
Token ID9617
Feature activation-6.10e-02

Pos logit contributions
apy+0.001
 live+0.001
keys+0.001
 she+0.001
ons+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.001
 plate-0.001
Token a
Token ID257
Feature activation+2.33e-01

Pos logit contributions
 fault+0.003
rett+0.003
 pad+0.003
 lair+0.003
 angle+0.003
Neg logit contributions
 they-0.001
 she-0.001
 you-0.001
apy-0.001
 live-0.001
Token tant
Token ID24246
Feature activation-3.30e+00

Pos logit contributions
apy+0.010
 she+0.009
bs+0.009
ons+0.008
 they+0.008
Neg logit contributions
 fault-0.009
 habitat-0.007
rett-0.006
 lair-0.006
joy-0.006
Tokenrum
Token ID6582
Feature activation+9.00e-01

Pos logit contributions
 fault+0.127
 tant+0.123
 pad+0.104
 second+0.094
 plate+0.093
Neg logit contributions
apy-0.120
keys-0.117
 live-0.116
 she-0.114
ons-0.110
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
apy+0.022
 she+0.021
 live+0.020
keys+0.019
 they+0.019
Neg logit contributions
 fault-0.046
 tant-0.046
 pad-0.040
 plate-0.038
 second-0.037
Token
Token ID198
Feature activation-4.91e-02

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 angle+0.001
Neg logit contributions
apy-0.000
 she-0.000
 they-0.000
 we-0.000
 People-0.000
Token
Token ID198
Feature activation-9.55e-02

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 plate+0.002
 asleep+0.002
Neg logit contributions
apy-0.001
 live-0.001
 she-0.001
keys-0.001
 they-0.001
Token"
Token ID1
Feature activation-3.88e-01

Pos logit contributions
 fault+0.005
 tant+0.005
 pad+0.004
 second+0.004
 plate+0.004
Neg logit contributions
apy-0.002
keys-0.002
 live-0.002
 she-0.002
ons-0.002
Token years
Token ID812
Feature activation-9.21e-01

Pos logit contributions
apy+0.015
keys+0.014
 live+0.014
mm+0.013
 she+0.013
Neg logit contributions
 fault-0.018
 tant-0.017
 pad-0.015
 second-0.014
 plate-0.013
Token.
Token ID13
Feature activation-3.29e-02

Pos logit contributions
 fault+0.042
 tant+0.041
 pad+0.036
 second+0.033
 plate+0.032
Neg logit contributions
apy-0.028
keys-0.026
 live-0.026
 she-0.025
ons-0.024
Token
Token ID198
Feature activation+7.13e-02

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.001
 second+0.001
 plate+0.001
Neg logit contributions
apy-0.001
 live-0.001
keys-0.001
 she-0.001
ons-0.001
Token
Token ID198
Feature activation-2.44e-02

Pos logit contributions
apy+0.002
 live+0.002
 she+0.002
keys+0.002
 lived+0.002
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.003
 plate-0.003
 puck-0.003
TokenThe
Token ID464
Feature activation-2.44e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 second+0.001
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token End
Token ID5268
Feature activation-2.96e+00

Pos logit contributions
 fault+0.010
 tant+0.009
 pad+0.008
rett+0.007
 asleep+0.007
Neg logit contributions
keys-0.009
 she-0.009
 live-0.008
apy-0.008
 we-0.008
Token.
Token ID13
Feature activation-3.81e-02

Pos logit contributions
 tant+0.111
 fault+0.110
 pad+0.091
 second+0.091
 plate+0.089
Neg logit contributions
apy-0.101
keys-0.098
 live-0.097
ons-0.089
mm-0.088
Token<|endoftext|>
Token ID50256
Feature activation-8.03e-01

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 puck+0.001
 plate+0.001
Neg logit contributions
apy-0.001
 live-0.001
 she-0.001
keys-0.001
 they-0.001
TokenOnce
Token ID7454
Feature activation+3.63e-01

Pos logit contributions
 fault+0.034
 tant+0.033
 pad+0.029
 second+0.026
 asleep+0.026
Neg logit contributions
apy-0.026
keys-0.026
 live-0.024
 she-0.024
 they-0.023
Token a
Token ID257
Feature activation+1.34e-01

Pos logit contributions
 she+0.016
apy+0.015
 they+0.015
 live+0.015
 lived+0.015
Neg logit contributions
 fault-0.012
 tant-0.012
 pad-0.009
 puck-0.009
 plate-0.009
Token day
Token ID1110
Feature activation+5.65e-01

Pos logit contributions
 live+0.005
 she+0.005
 lived+0.005
 they+0.005
keys+0.005
Neg logit contributions
 fault-0.005
 tant-0.005
 asleep-0.004
 puck-0.004
 pad-0.004
Token!"
Token ID2474
Feature activation+1.94e-02

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 plate+0.002
 second+0.002
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token
Token ID220
Feature activation-3.83e-01

Pos logit contributions
apy+0.001
keys+0.001
 live+0.001
ons+0.001
 she+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.001
 plate-0.001
Token
Token ID198
Feature activation-7.89e-02

Pos logit contributions
 fault+0.016
 tant+0.016
 pad+0.014
 second+0.012
 plate+0.012
Neg logit contributions
apy-0.012
keys-0.012
 live-0.012
 she-0.011
ons-0.011
Token
Token ID198
Feature activation-1.35e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 plate+0.003
 puck+0.003
Neg logit contributions
apy-0.002
 live-0.002
keys-0.002
 she-0.002
ons-0.002
TokenThe
Token ID464
Feature activation-2.80e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.006
 second+0.005
 plate+0.005
Neg logit contributions
apy-0.004
keys-0.004
 live-0.004
 she-0.003
ons-0.003
Token End
Token ID5268
Feature activation-2.96e+00

Pos logit contributions
 fault+0.011
 tant+0.010
 pad+0.008
rett+0.007
 asleep+0.007
Neg logit contributions
 she-0.011
keys-0.011
 live-0.011
apy-0.010
 they-0.010
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
 fault+0.120
 tant+0.120
 pad+0.101
 second+0.098
 plate+0.096
Neg logit contributions
apy-0.095
keys-0.094
 live-0.092
ons-0.085
mm-0.084
Token<|endoftext|>
Token ID50256
Feature activation-7.21e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 puck+0.005
 plate+0.005
Neg logit contributions
apy-0.003
 she-0.003
 live-0.003
 they-0.003
keys-0.003
TokenThe
Token ID464
Feature activation-1.64e-01

Pos logit contributions
 fault+0.031
 tant+0.031
 pad+0.027
 second+0.024
 asleep+0.024
Neg logit contributions
apy-0.022
keys-0.022
 live-0.021
 she-0.021
ons-0.020
Token puppy
Token ID26188
Feature activation-1.34e-02

Pos logit contributions
 fault+0.007
 tant+0.006
 pad+0.005
 puck+0.004
 asleep+0.004
Neg logit contributions
 she-0.006
 live-0.006
keys-0.006
 they-0.006
apy-0.006
Token was
Token ID373
Feature activation+2.91e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.000
 asleep+0.000
Neg logit contributions
apy-0.000
keys-0.000
 live-0.000
mm-0.000
ons-0.000
Token!"
Token ID2474
Feature activation+9.00e-02

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 second+0.003
 plate+0.003
Neg logit contributions
apy-0.003
keys-0.002
 live-0.002
 she-0.002
ons-0.002
Token
Token ID220
Feature activation-3.13e-01

Pos logit contributions
apy+0.003
keys+0.003
 live+0.003
 she+0.002
ons+0.002
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.003
 second-0.003
 plate-0.003
Token
Token ID198
Feature activation-2.12e-02

Pos logit contributions
 fault+0.014
 tant+0.013
 pad+0.012
 second+0.011
 plate+0.010
Neg logit contributions
apy-0.010
keys-0.009
 live-0.009
 she-0.009
ons-0.009
Token
Token ID198
Feature activation-8.79e-02

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 second+0.001
 puck+0.001
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
TokenThe
Token ID464
Feature activation-1.79e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 second+0.003
 puck+0.003
Neg logit contributions
apy-0.003
 live-0.002
keys-0.002
 she-0.002
ons-0.002
Token End
Token ID5268
Feature activation-2.94e+00

Pos logit contributions
 fault+0.007
 tant+0.007
 pad+0.006
 puck+0.005
 asleep+0.005
Neg logit contributions
 live-0.007
keys-0.006
 she-0.006
apy-0.006
 they-0.006
Token.
Token ID13
Feature activation-8.38e-02

Pos logit contributions
 fault+0.119
 tant+0.119
 pad+0.100
 second+0.097
 plate+0.093
Neg logit contributions
apy-0.096
keys-0.094
 live-0.092
ons-0.085
mm-0.084
Token<|endoftext|>
Token ID50256
Feature activation-7.02e-01

Pos logit contributions
 fault+0.004
 tant+0.003
 pad+0.003
 second+0.003
 plate+0.003
Neg logit contributions
apy-0.003
keys-0.003
 live-0.003
 she-0.002
ons-0.002
TokenOnce
Token ID7454
Feature activation+4.98e-01

Pos logit contributions
 fault+0.030
 tant+0.029
 pad+0.025
 second+0.023
 asleep+0.023
Neg logit contributions
apy-0.023
keys-0.022
 live-0.021
 she-0.021
 they-0.020
Token upon
Token ID2402
Feature activation-2.62e-02

Pos logit contributions
 she+0.022
apy+0.021
 they+0.021
 live+0.021
 lived+0.020
Neg logit contributions
 fault-0.016
 tant-0.015
 pad-0.012
 puck-0.012
 plate-0.011
Token a
Token ID257
Feature activation+1.64e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 puck+0.001
Neg logit contributions
 live-0.001
apy-0.001
 she-0.001
keys-0.001
 they-0.001
Token.
Token ID13
Feature activation-1.36e-01

Pos logit contributions
 fault+0.023
 tant+0.023
 pad+0.020
 second+0.018
 plate+0.018
Neg logit contributions
apy-0.013
keys-0.013
 live-0.013
 she-0.012
ons-0.012
Token
Token ID220
Feature activation-3.39e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 second+0.005
 plate+0.005
Neg logit contributions
apy-0.004
 live-0.004
keys-0.004
 she-0.004
ons-0.003
Token
Token ID198
Feature activation-1.14e-01

Pos logit contributions
 fault+0.015
 tant+0.014
 pad+0.012
 second+0.011
 puck+0.011
Neg logit contributions
apy-0.011
keys-0.011
 live-0.010
 she-0.010
ons-0.010
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 plate+0.005
 asleep+0.004
Neg logit contributions
apy-0.003
 live-0.003
 she-0.003
keys-0.003
ons-0.003
TokenThe
Token ID464
Feature activation-2.61e-01

Pos logit contributions
 fault+0.009
 tant+0.009
 pad+0.008
 plate+0.007
 asleep+0.007
Neg logit contributions
apy-0.004
 live-0.004
keys-0.004
 she-0.004
ons-0.004
Token End
Token ID5268
Feature activation-2.93e+00

Pos logit contributions
 fault+0.011
 tant+0.010
 pad+0.009
rett+0.009
 asleep+0.008
Neg logit contributions
 live-0.009
keys-0.009
 she-0.009
apy-0.009
 lived-0.008
Token!
Token ID0
Feature activation-2.71e-01

Pos logit contributions
 fault+0.115
 tant+0.114
 pad+0.094
 second+0.092
 plate+0.090
Neg logit contributions
apy-0.098
keys-0.096
 live-0.094
ons-0.087
 she-0.086
Token<|endoftext|>
Token ID50256
Feature activation-8.01e-01

Pos logit contributions
 fault+0.014
 tant+0.014
 pad+0.011
 puck+0.011
 plate+0.011
Neg logit contributions
apy-0.007
 live-0.007
 she-0.007
 they-0.006
keys-0.006
TokenOnce
Token ID7454
Feature activation+3.24e-01

Pos logit contributions
 fault+0.035
 tant+0.035
 pad+0.030
 second+0.028
 plate+0.027
Neg logit contributions
apy-0.024
keys-0.024
 live-0.023
 she-0.023
ons-0.022
Token upon
Token ID2402
Feature activation-1.14e-01

Pos logit contributions
 she+0.014
 they+0.013
apy+0.013
 live+0.013
 lived+0.013
Neg logit contributions
 fault-0.011
 tant-0.011
 pad-0.009
 puck-0.008
 plate-0.008
Token a
Token ID257
Feature activation+1.67e-02

Pos logit contributions
 fault+0.005
 tant+0.005
 pad+0.004
 plate+0.004
 puck+0.004
Neg logit contributions
 live-0.004
 she-0.004
apy-0.004
keys-0.003
 they-0.003
Token better
Token ID1365
Feature activation+1.93e-01

Pos logit contributions
 fault+0.014
 tant+0.013
 pad+0.011
 second+0.010
 asleep+0.010
Neg logit contributions
apy-0.012
keys-0.012
 live-0.012
ons-0.011
 she-0.011
Token.
Token ID13
Feature activation-3.06e-02

Pos logit contributions
 she+0.005
 live+0.005
apy+0.005
keys+0.004
 they+0.004
Neg logit contributions
 fault-0.009
 tant-0.009
 pad-0.008
 puck-0.008
 second-0.008
Token
Token ID198
Feature activation+3.55e-02

Pos logit contributions
 fault+0.002
 tant+0.001
 pad+0.001
 second+0.001
 plate+0.001
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token
Token ID198
Feature activation-9.32e-02

Pos logit contributions
apy+0.001
 live+0.001
 she+0.001
keys+0.001
ons+0.001
Neg logit contributions
 fault-0.002
 tant-0.002
 pad-0.001
 plate-0.001
 asleep-0.001
TokenThe
Token ID464
Feature activation-2.20e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.004
 second+0.004
 plate+0.004
Neg logit contributions
apy-0.003
keys-0.002
 live-0.002
 she-0.002
ons-0.002
Token End
Token ID5268
Feature activation-2.93e+00

Pos logit contributions
 fault+0.009
 tant+0.008
 pad+0.007
rett+0.006
 asleep+0.006
Neg logit contributions
 live-0.008
 she-0.008
keys-0.008
apy-0.008
 we-0.007
Token.
Token ID13
Feature activation-7.82e-02

Pos logit contributions
 tant+0.110
 fault+0.109
 second+0.088
 pad+0.088
 plate+0.087
Neg logit contributions
apy-0.099
keys-0.098
 live-0.096
ons-0.089
mm-0.087
Token<|endoftext|>
Token ID50256
Feature activation-6.67e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 puck+0.003
 plate+0.003
Neg logit contributions
apy-0.002
 live-0.002
 she-0.002
keys-0.002
 they-0.002
TokenOnce
Token ID7454
Feature activation+4.85e-01

Pos logit contributions
 fault+0.028
 tant+0.027
 pad+0.024
 second+0.022
 asleep+0.022
Neg logit contributions
apy-0.022
keys-0.021
 live-0.020
 she-0.020
 they-0.019
Token there
Token ID612
Feature activation-2.01e-01

Pos logit contributions
 she+0.021
 they+0.020
apy+0.020
 live+0.020
 lived+0.019
Neg logit contributions
 fault-0.016
 tant-0.016
 pad-0.013
 puck-0.012
 plate-0.012
Token was
Token ID373
Feature activation+3.31e-01

Pos logit contributions
 fault+0.012
 tant+0.011
 pad+0.010
 puck+0.010
 plate+0.010
Neg logit contributions
 she-0.004
apy-0.004
 live-0.004
 lived-0.004
 they-0.003
Token!"
Token ID2474
Feature activation+2.24e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 second+0.001
 plate+0.001
Neg logit contributions
keys-0.000
apy-0.000
 live-0.000
ves-0.000
ons-0.000
Token
Token ID220
Feature activation-3.67e-01

Pos logit contributions
apy+0.006
keys+0.006
 live+0.006
ons+0.006
 she+0.006
Neg logit contributions
 fault-0.010
 tant-0.010
 pad-0.009
 second-0.008
 plate-0.008
Token
Token ID198
Feature activation+3.69e-02

Pos logit contributions
 fault+0.017
 tant+0.017
 pad+0.014
 plate+0.013
 second+0.013
Neg logit contributions
apy-0.011
 live-0.010
keys-0.010
 she-0.010
ons-0.010
Token
Token ID198
Feature activation-8.31e-02

Pos logit contributions
apy+0.001
 live+0.001
keys+0.001
 she+0.001
ons+0.001
Neg logit contributions
 fault-0.002
 tant-0.002
 pad-0.001
 second-0.001
 plate-0.001
TokenThe
Token ID464
Feature activation-1.39e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 second+0.003
 plate+0.003
Neg logit contributions
apy-0.002
keys-0.002
 live-0.002
 she-0.002
ons-0.002
Token End
Token ID5268
Feature activation-2.93e+00

Pos logit contributions
 fault+0.005
 tant+0.005
 pad+0.004
rett+0.004
 asleep+0.004
Neg logit contributions
keys-0.005
 she-0.005
 live-0.005
apy-0.005
 they-0.005
Token<|endoftext|>
Token ID50256
Feature activation-6.87e-01

Pos logit contributions
 fault+0.119
 tant+0.119
 pad+0.100
 second+0.097
 plate+0.094
Neg logit contributions
apy-0.095
keys-0.094
 live-0.091
ons-0.084
mm-0.082
TokenOnce
Token ID7454
Feature activation+4.67e-01

Pos logit contributions
 fault+0.028
 tant+0.028
 pad+0.024
 second+0.022
 asleep+0.022
Neg logit contributions
apy-0.023
keys-0.022
 live-0.021
 she-0.021
 they-0.020
Token upon
Token ID2402
Feature activation+2.87e-02

Pos logit contributions
 she+0.019
 they+0.018
 lived+0.017
apy+0.017
 live+0.017
Neg logit contributions
 fault-0.017
 tant-0.017
 puck-0.014
 pad-0.014
 plate-0.013
Token a
Token ID257
Feature activation+1.71e-01

Pos logit contributions
 live+0.001
 she+0.001
apy+0.001
keys+0.001
 they+0.001
Neg logit contributions
 tant-0.001
 fault-0.001
 pad-0.001
 plate-0.001
 puck-0.001
Token time
Token ID640
Feature activation-9.93e-02

Pos logit contributions
 she+0.007
 they+0.007
apy+0.006
 live+0.006
keys+0.006
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.005
 asleep-0.005
 puck-0.004
Token could
Token ID714
Feature activation+2.05e-01

Pos logit contributions
 fault+0.014
 tant+0.013
 pad+0.012
 second+0.011
 plate+0.011
Neg logit contributions
apy-0.007
keys-0.006
 live-0.006
 she-0.006
ons-0.006
Token go
Token ID467
Feature activation+1.58e-01

Pos logit contributions
apy+0.006
keys+0.005
 live+0.005
 she+0.005
ons+0.005
Neg logit contributions
 fault-0.010
 tant-0.009
 pad-0.008
 second-0.008
 plate-0.008
Token out
Token ID503
Feature activation-2.28e-01

Pos logit contributions
 she+0.006
 live+0.006
apy+0.006
 they+0.006
keys+0.006
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.005
 plate-0.004
 second-0.004
Token and
Token ID290
Feature activation-7.83e-02

Pos logit contributions
 fault+0.011
 tant+0.010
 pad+0.009
 puck+0.008
 plate+0.008
Neg logit contributions
 she-0.007
 live-0.006
apy-0.006
 they-0.006
ons-0.006
Token explore
Token ID7301
Feature activation+9.70e-01

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.003
 puck+0.003
 second+0.003
Neg logit contributions
keys-0.003
apy-0.002
mm-0.002
ons-0.002
ves-0.002
Token without
Token ID1231
Feature activation+1.80e+00

Pos logit contributions
apy+0.031
keys+0.030
 live+0.029
 she+0.028
ons+0.028
Neg logit contributions
 fault-0.041
 tant-0.041
 pad-0.035
 second-0.033
 asleep-0.032
Token fear
Token ID3252
Feature activation+2.35e-02

Pos logit contributions
 she+0.054
 live+0.052
apy+0.051
 they+0.049
 you+0.048
Neg logit contributions
 fault-0.080
 tant-0.080
 pad-0.070
 plate-0.063
 puck-0.062
Token of
Token ID286
Feature activation+4.54e-01

Pos logit contributions
apy+0.001
 live+0.001
 she+0.001
keys+0.001
 they+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 puck-0.001
 plate-0.001
Token being
Token ID852
Feature activation-2.72e-01

Pos logit contributions
 she+0.012
 live+0.012
apy+0.011
 they+0.010
ons+0.010
Neg logit contributions
 tant-0.022
 fault-0.022
 pad-0.019
 plate-0.018
 puck-0.018
Token seen
Token ID1775
Feature activation+7.35e-01

Pos logit contributions
 fault+0.012
 tant+0.012
 pad+0.010
 plate+0.010
 puck+0.010
Neg logit contributions
apy-0.008
 live-0.008
 she-0.008
keys-0.008
ons-0.007
Token!
Token ID0
Feature activation-9.72e-02

Pos logit contributions
 she+0.016
 live+0.016
apy+0.016
keys+0.015
ons+0.015
Neg logit contributions
 fault-0.038
 tant-0.038
 pad-0.032
 puck-0.031
 second-0.031
Token and
Token ID290
Feature activation-8.96e-02

Pos logit contributions
apy+0.001
 she+0.001
 live+0.001
keys+0.001
 they+0.001
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.002
 plate-0.002
 puck-0.002
Token said
Token ID531
Feature activation-2.27e-01

Pos logit contributions
 tant+0.004
 fault+0.004
 pad+0.004
 second+0.003
 puck+0.003
Neg logit contributions
apy-0.002
keys-0.002
mm-0.002
 live-0.002
ons-0.002
Token the
Token ID262
Feature activation+2.37e-02

Pos logit contributions
 fault+0.010
 tant+0.010
 pad+0.009
 second+0.008
 asleep+0.008
Neg logit contributions
apy-0.006
 live-0.006
keys-0.006
ons-0.006
 she-0.006
Token rh
Token ID9529
Feature activation-8.80e-01

Pos logit contributions
 live+0.001
 lived+0.001
keys+0.001
 you+0.001
 we+0.001
Neg logit contributions
 tant-0.001
 fault-0.001
rett-0.001
 pad-0.001
ingo-0.001
Tokenin
Token ID259
Feature activation-7.63e-01

Pos logit contributions
 fault+0.034
 tant+0.032
 pad+0.027
 plate+0.024
 second+0.024
Neg logit contributions
keys-0.033
apy-0.033
 live-0.032
 she-0.031
ons-0.031
Tokenoc
Token ID420
Feature activation+3.11e+00

Pos logit contributions
 fault+0.040
 tant+0.039
 pad+0.034
 asleep+0.032
 puck+0.032
Neg logit contributions
keys-0.017
apy-0.017
 live-0.017
 she-0.016
 lived-0.016
Tokeneros
Token ID27498
Feature activation-7.53e-01

Pos logit contributions
keys+0.105
apy+0.105
 live+0.102
 she+0.098
ons+0.098
Neg logit contributions
 fault-0.126
 tant-0.123
 pad-0.106
 second-0.096
 plate-0.095
Token didn
Token ID1422
Feature activation-1.31e-01

Pos logit contributions
 fault+0.037
 tant+0.036
 pad+0.032
 second+0.030
 plate+0.030
Neg logit contributions
apy-0.019
 live-0.018
keys-0.018
 she-0.017
ons-0.016
Token't
Token ID470
Feature activation-1.86e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 second+0.005
 pad+0.005
 plate+0.005
Neg logit contributions
apy-0.003
keys-0.003
 live-0.003
 lived-0.003
ons-0.003
Token need
Token ID761
Feature activation-1.05e+00

Pos logit contributions
 fault+0.009
 tant+0.009
 pad+0.008
 second+0.007
 plate+0.007
Neg logit contributions
apy-0.005
keys-0.005
 live-0.005
 she-0.005
ons-0.005
Token to
Token ID284
Feature activation-3.31e-01

Pos logit contributions
 tant+0.040
 fault+0.040
 pad+0.034
 puck+0.030
 plate+0.030
Neg logit contributions
 live-0.038
 she-0.037
apy-0.037
keys-0.036
ons-0.036
Token right
Token ID826
Feature activation-3.17e-01

Pos logit contributions
 tant+0.004
rett+0.004
 pad+0.004
ingo+0.003
 puck+0.003
Neg logit contributions
 you-0.003
 she-0.003
 we-0.003
 live-0.003
 they-0.002
Token thing
Token ID1517
Feature activation-4.97e-01

Pos logit contributions
 tant+0.013
 pad+0.011
 fault+0.011
 asleep+0.010
uster+0.010
Neg logit contributions
 live-0.010
 they-0.009
 she-0.009
 we-0.009
 lived-0.009
Token to
Token ID284
Feature activation-3.16e-01

Pos logit contributions
 fault+0.020
 tant+0.019
 pad+0.016
 second+0.014
 plate+0.014
Neg logit contributions
apy-0.018
keys-0.017
 live-0.017
 she-0.016
ons-0.016
Token do
Token ID466
Feature activation-3.98e-01

Pos logit contributions
 fault+0.016
 tant+0.015
 pad+0.014
 second+0.013
 plate+0.013
Neg logit contributions
apy-0.008
keys-0.007
ons-0.007
 she-0.007
mm-0.006
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
 fault+0.018
 tant+0.016
 pad+0.014
 second+0.013
 asleep+0.013
Neg logit contributions
apy-0.013
keys-0.012
 live-0.012
mm-0.011
 she-0.011
Token without
Token ID1231
Feature activation+1.92e+00

Pos logit contributions
 tant+0.005
 fault+0.005
 pad+0.004
 plate+0.004
 puck+0.004
Neg logit contributions
 she-0.004
 live-0.004
apy-0.004
 they-0.004
 we-0.004
Token compromising
Token ID35294
Feature activation-3.25e-01

Pos logit contributions
 she+0.063
 live+0.063
apy+0.061
 they+0.061
keys+0.060
Neg logit contributions
 tant-0.079
 fault-0.078
 pad-0.067
 puck-0.063
 plate-0.061
Token your
Token ID534
Feature activation-3.45e-01

Pos logit contributions
 fault+0.014
 tant+0.014
 pad+0.012
 second+0.011
 plate+0.011
Neg logit contributions
apy-0.010
 live-0.010
keys-0.010
 she-0.009
ons-0.009
Token principles
Token ID7811
Feature activation-2.62e-01

Pos logit contributions
 tant+0.014
 fault+0.013
 pad+0.013
 asleep+0.012
 puck+0.012
Neg logit contributions
 live-0.011
apy-0.010
 you-0.010
ons-0.010
 we-0.010
Token and
Token ID290
Feature activation-7.52e-02

Pos logit contributions
 tant+0.012
 fault+0.012
 pad+0.011
 second+0.010
 puck+0.010
Neg logit contributions
 she-0.007
apy-0.007
 live-0.007
ons-0.006
keys-0.006
Token values
Token ID3815
Feature activation-9.40e-01

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.003
 second+0.002
 plate+0.002
Neg logit contributions
apy-0.002
keys-0.002
 live-0.002
 she-0.002
mm-0.002
Token He
Token ID679
Feature activation-1.88e-01

Pos logit contributions
apy+0.001
keys+0.001
 live+0.001
 she+0.001
ons+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.001
 plate-0.001
Token was
Token ID373
Feature activation+2.30e-01

Pos logit contributions
 fault+0.010
 tant+0.010
 pad+0.008
 plate+0.008
 second+0.008
Neg logit contributions
apy-0.004
 live-0.004
keys-0.004
 she-0.004
 lived-0.004
Token excited
Token ID6568
Feature activation+2.51e-01

Pos logit contributions
 live+0.007
 she+0.007
apy+0.007
 they+0.007
 lived+0.007
Neg logit contributions
 tant-0.010
 fault-0.010
 pad-0.008
 plate-0.008
ingo-0.008
Token,
Token ID11
Feature activation-7.87e-02

Pos logit contributions
 she+0.009
 they+0.008
 live+0.008
 we+0.008
 you+0.008
Neg logit contributions
 tant-0.011
 fault-0.010
 pad-0.008
 puck-0.008
 second-0.008
Token and
Token ID290
Feature activation-1.77e-02

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.003
 plate+0.002
 second+0.002
Neg logit contributions
 she-0.003
 they-0.003
apy-0.003
 live-0.003
 we-0.003
Token without
Token ID1231
Feature activation+1.90e+00

Pos logit contributions
 tant+0.001
 fault+0.001
 pad+0.001
 second+0.001
 puck+0.001
Neg logit contributions
keys-0.000
apy-0.000
mm-0.000
 live-0.000
ons-0.000
Token wasting
Token ID24430
Feature activation-2.15e-01

Pos logit contributions
 she+0.064
 live+0.063
 they+0.060
apy+0.060
 expect+0.057
Neg logit contributions
 fault-0.079
 tant-0.078
 pad-0.066
 plate-0.061
 puck-0.060
Token any
Token ID597
Feature activation+5.92e-01

Pos logit contributions
 fault+0.009
 tant+0.009
 pad+0.008
 second+0.007
 puck+0.007
Neg logit contributions
apy-0.007
 live-0.007
 she-0.007
ons-0.006
keys-0.006
Token time
Token ID640
Feature activation+1.34e-02

Pos logit contributions
apy+0.022
keys+0.022
 live+0.021
 she+0.021
mm+0.020
Neg logit contributions
 fault-0.022
 tant-0.022
 pad-0.018
 second-0.016
 plate-0.016
Token,
Token ID11
Feature activation-9.71e-02

Pos logit contributions
apy+0.000
keys+0.000
 live+0.000
mm+0.000
ons+0.000
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.000
 asleep-0.000
Token he
Token ID339
Feature activation-2.54e-01

Pos logit contributions
 fault+0.005
 tant+0.005
 pad+0.004
 puck+0.004
 plate+0.004
Neg logit contributions
 she-0.003
apy-0.002
 live-0.002
 they-0.002
keys-0.002
Token have
Token ID423
Feature activation-4.34e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.004
 plate+0.003
 puck+0.003
Neg logit contributions
 live-0.002
 she-0.002
apy-0.002
keys-0.002
 they-0.002
Token been
Token ID587
Feature activation+4.85e-02

Pos logit contributions
 tant+0.020
 fault+0.019
 pad+0.017
 plate+0.016
 puck+0.016
Neg logit contributions
 she-0.013
 live-0.012
 they-0.011
ons-0.011
 lived-0.011
Token able
Token ID1498
Feature activation-1.99e-01

Pos logit contributions
 she+0.002
 live+0.002
 they+0.001
ons+0.001
keys+0.001
Neg logit contributions
 tant-0.002
 fault-0.002
 pad-0.002
 plate-0.002
 puck-0.002
Token to
Token ID284
Feature activation-2.80e-01

Pos logit contributions
 tant+0.008
 fault+0.007
 pad+0.006
 puck+0.006
 plate+0.006
Neg logit contributions
 she-0.007
 live-0.007
 they-0.007
 we-0.007
apy-0.006
Token do
Token ID466
Feature activation-4.13e-01

Pos logit contributions
 fault+0.012
 tant+0.012
 pad+0.010
 second+0.010
 puck+0.010
Neg logit contributions
apy-0.009
keys-0.008
mm-0.008
ons-0.007
 she-0.007
Token without
Token ID1231
Feature activation+1.92e+00

Pos logit contributions
 fault+0.019
 tant+0.019
 pad+0.016
 second+0.015
 plate+0.015
Neg logit contributions
apy-0.012
 live-0.011
keys-0.011
 she-0.011
ons-0.010
Token his
Token ID465
Feature activation-4.90e-01

Pos logit contributions
 she+0.061
 live+0.060
apy+0.060
 they+0.058
keys+0.057
Neg logit contributions
 fault-0.081
 tant-0.080
 pad-0.069
 plate-0.063
 puck-0.062
Token own
Token ID898
Feature activation+5.39e-01

Pos logit contributions
 tant+0.019
 fault+0.019
 pad+0.016
 asleep+0.015
 plate+0.015
Neg logit contributions
apy-0.017
 live-0.016
 she-0.016
 they-0.016
keys-0.015
Token equipment
Token ID5112
Feature activation+1.04e-01

Pos logit contributions
apy+0.018
 live+0.018
 she+0.018
keys+0.017
 they+0.017
Neg logit contributions
 fault-0.022
 tant-0.021
 pad-0.018
 asleep-0.016
 plate-0.016
Token.
Token ID13
Feature activation-8.72e-03

Pos logit contributions
 she+0.002
 live+0.002
keys+0.002
 they+0.002
apy+0.002
Neg logit contributions
 tant-0.006
 fault-0.006
 pad-0.005
 second-0.004
 puck-0.004
Token He
Token ID679
Feature activation-2.49e-01

Pos logit contributions
 tant+0.000
 fault+0.000
 pad+0.000
 plate+0.000
 second+0.000
Neg logit contributions
 People-0.000
 she-0.000
apy-0.000
 live-0.000
 they-0.000
Token he
Token ID339
Feature activation-2.23e-01

Pos logit contributions
apy+0.003
 she+0.002
 live+0.002
keys+0.002
 they+0.002
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.005
 plate-0.005
 second-0.005
Token saw
Token ID2497
Feature activation+7.02e-02

Pos logit contributions
 fault+0.011
 tant+0.011
 pad+0.009
 plate+0.009
 puck+0.009
Neg logit contributions
 live-0.006
apy-0.006
 she-0.006
keys-0.006
 lived-0.005
Token a
Token ID257
Feature activation+2.14e-01

Pos logit contributions
 she+0.002
apy+0.002
 they+0.002
 live+0.002
 lived+0.002
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.003
 plate-0.003
ingo-0.003
Token ch
Token ID442
Feature activation+3.15e-01

Pos logit contributions
 she+0.007
apy+0.007
 live+0.007
keys+0.007
 they+0.007
Neg logit contributions
 fault-0.009
 tant-0.009
 pad-0.007
ingo-0.006
 asleep-0.006
Tokenubby
Token ID38393
Feature activation+4.16e-01

Pos logit contributions
apy+0.012
 live+0.012
keys+0.011
 she+0.011
ons+0.010
Neg logit contributions
 fault-0.012
 tant-0.011
 pad-0.010
 plate-0.009
 second-0.009
Token spider
Token ID19230
Feature activation-2.24e-02

Pos logit contributions
 she+0.017
 live+0.017
apy+0.016
keys+0.016
 they+0.016
Neg logit contributions
 fault-0.015
 tant-0.014
 pad-0.011
 second-0.010
rett-0.010
Token with
Token ID351
Feature activation+1.77e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 asleep+0.001
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
mm-0.001
 she-0.001
Token a
Token ID257
Feature activation+2.03e-01

Pos logit contributions
apy+0.006
 she+0.006
 live+0.005
ons+0.005
 lived+0.005
Neg logit contributions
 fault-0.008
 tant-0.008
 pad-0.006
ingo-0.006
 plate-0.006
Token big
Token ID1263
Feature activation+1.77e-01

Pos logit contributions
apy+0.007
 she+0.007
 live+0.007
keys+0.006
 they+0.006
Neg logit contributions
 fault-0.008
 tant-0.008
ingo-0.006
 pad-0.006
rett-0.006
Token mug
Token ID25152
Feature activation+3.21e-01

Pos logit contributions
apy+0.006
 she+0.006
 live+0.006
keys+0.006
ons+0.005
Neg logit contributions
 fault-0.007
 tant-0.007
 pad-0.006
 second-0.005
 puck-0.005
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
apy+0.009
 live+0.008
keys+0.008
 she+0.008
ons+0.008
Neg logit contributions
 fault-0.015
 tant-0.015
 pad-0.013
 second-0.012
 puck-0.012
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
 fault+0.005
 tant+0.004
 pad+0.004
 plate+0.004
 second+0.004
Neg logit contributions
apy-0.004
keys-0.004
 live-0.004
 expect-0.004
ended-0.003
Token One
Token ID1881
Feature activation-1.66e-01

Pos logit contributions
apy+0.008
keys+0.007
 live+0.007
 she+0.007
ons+0.007
Neg logit contributions
 fault-0.010
 tant-0.009
 pad-0.008
 second-0.007
 plate-0.007
Token day
Token ID1110
Feature activation+6.02e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 asleep+0.005
 puck+0.005
Neg logit contributions
apy-0.006
 she-0.006
 live-0.006
keys-0.006
 they-0.006
Token,
Token ID11
Feature activation+1.82e-01

Pos logit contributions
keys+0.022
 live+0.019
 expect+0.018
vere+0.018
 accomplish+0.018
Neg logit contributions
 tant-0.024
 fault-0.023
 pad-0.020
 second-0.019
 plate-0.018
Token her
Token ID607
Feature activation+8.59e-02

Pos logit contributions
keys+0.002
apy+0.002
 live+0.002
ons+0.002
 she+0.002
Neg logit contributions
 tant-0.011
 fault-0.011
 pad-0.010
 second-0.009
 plate-0.009
Token mom
Token ID1995
Feature activation+9.95e-02

Pos logit contributions
 she+0.003
 they+0.002
apy+0.002
keys+0.002
 live+0.002
Neg logit contributions
 tant-0.004
 fault-0.004
 pad-0.003
 puck-0.003
 angle-0.003
Tokenmy
Token ID1820
Feature activation+7.39e-01

Pos logit contributions
apy+0.004
keys+0.004
 live+0.004
 she+0.003
ves+0.003
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.003
 plate-0.003
 second-0.003
Token brought
Token ID3181
Feature activation-3.68e-01

Pos logit contributions
apy+0.021
keys+0.020
 live+0.019
mm+0.018
 she+0.017
Neg logit contributions
 fault-0.034
 tant-0.033
 pad-0.030
 plate-0.029
 second-0.027
Token home
Token ID1363
Feature activation+3.52e-01

Pos logit contributions
 fault+0.018
 tant+0.017
 puck+0.015
 pad+0.015
 pace+0.014
Neg logit contributions
 she-0.010
 live-0.010
ons-0.010
 they-0.009
apy-0.009
Token a
Token ID257
Feature activation+1.53e-01

Pos logit contributions
apy+0.012
 live+0.012
 she+0.012
keys+0.012
ons+0.011
Neg logit contributions
 fault-0.014
 tant-0.014
 pad-0.012
 puck-0.011
 second-0.010
Token new
Token ID649
Feature activation+1.68e-03

Pos logit contributions
 she+0.005
apy+0.005
 they+0.005
 live+0.004
keys+0.004
Neg logit contributions
 fault-0.007
 tant-0.006
 pad-0.005
 asleep-0.005
rett-0.005
Token he
Token ID339
Feature activation-1.34e-01

Pos logit contributions
 fault+0.018
 tant+0.018
 pad+0.015
 plate+0.015
 asleep+0.015
Neg logit contributions
keys-0.008
apy-0.008
 live-0.008
 lived-0.007
ons-0.007
Token is
Token ID318
Feature activation-2.10e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 plate+0.005
 puck+0.005
Neg logit contributions
apy-0.004
keys-0.004
 she-0.004
 they-0.003
othe-0.003
Token lucky
Token ID9670
Feature activation-8.11e-02

Pos logit contributions
 fault+0.008
 tant+0.008
 pad+0.007
 second+0.006
 plate+0.006
Neg logit contributions
apy-0.008
keys-0.007
 live-0.007
 she-0.007
ons-0.007
Token.
Token ID13
Feature activation+9.10e-02

Pos logit contributions
 tant+0.004
 fault+0.004
 pad+0.003
 second+0.003
 puck+0.003
Neg logit contributions
 she-0.002
 they-0.002
 live-0.002
apy-0.002
 we-0.002
Token He
Token ID679
Feature activation+1.03e-01

Pos logit contributions
apy+0.003
keys+0.003
 live+0.003
 she+0.003
ons+0.003
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.003
 second-0.003
 puck-0.003
Token smiles
Token ID21845
Feature activation+5.55e-02

Pos logit contributions
apy+0.003
keys+0.003
 live+0.002
 she+0.002
ons+0.002
Neg logit contributions
 fault-0.005
 tant-0.005
 pad-0.004
 second-0.004
 puck-0.004
Token and
Token ID290
Feature activation-5.03e-02

Pos logit contributions
 she+0.002
 live+0.002
 they+0.002
apy+0.002
 lived+0.002
Neg logit contributions
 fault-0.003
 tant-0.002
 pad-0.002
 puck-0.002
 plate-0.002
Token writes
Token ID6797
Feature activation-5.56e-02

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 puck+0.002
 asleep+0.002
Neg logit contributions
keys-0.001
apy-0.001
ons-0.001
ves-0.001
ended-0.001
Token "
Token ID366
Feature activation+6.21e-01

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 puck+0.002
 plate+0.002
Neg logit contributions
apy-0.002
 she-0.002
 live-0.002
keys-0.002
 they-0.002
TokenI
Token ID40
Feature activation+1.53e-02

Pos logit contributions
apy+0.018
keys+0.017
 live+0.017
 she+0.016
ons+0.016
Neg logit contributions
 tant-0.028
 fault-0.027
 pad-0.024
 second-0.022
 puck-0.022
Token love
Token ID1842
Feature activation-1.69e-01

Pos logit contributions
apy+0.000
keys+0.000
 she+0.000
ons+0.000
 live+0.000
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 second-0.001
 plate-0.001
Token She
Token ID1375
Feature activation-8.22e-02

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.002
 puck+0.002
 second+0.002
Neg logit contributions
apy-0.001
 live-0.001
keys-0.001
 she-0.001
ons-0.001
Token was
Token ID373
Feature activation+1.57e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.004
 asleep+0.003
 second+0.003
Neg logit contributions
apy-0.002
 she-0.002
 live-0.002
keys-0.002
ons-0.002
Token very
Token ID845
Feature activation+1.37e-02

Pos logit contributions
 she+0.005
apy+0.004
 they+0.004
 live+0.004
keys+0.004
Neg logit contributions
 tant-0.007
 fault-0.007
 pad-0.006
ingo-0.006
 puck-0.006
Token upset
Token ID9247
Feature activation-5.96e-01

Pos logit contributions
apy+0.001
keys+0.001
 live+0.001
 she+0.001
ons+0.001
Neg logit contributions
 fault-0.000
 tant-0.000
 pad-0.000
 second-0.000
 plate-0.000
Token.
Token ID13
Feature activation-3.42e-02

Pos logit contributions
 fault+0.028
 tant+0.028
 pad+0.024
 puck+0.022
 second+0.022
Neg logit contributions
apy-0.017
 she-0.016
 live-0.016
keys-0.016
 they-0.015
Token Nothing
Token ID10528
Feature activation+2.91e-01

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 second+0.002
 plate+0.001
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token she
Token ID673
Feature activation-1.10e-02

Pos logit contributions
keys+0.008
apy+0.008
mm+0.007
 live+0.007
mmm+0.006
Neg logit contributions
 fault-0.014
 tant-0.014
 pad-0.012
 second-0.011
 asleep-0.011
Token did
Token ID750
Feature activation-1.81e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.000
 plate+0.000
 second+0.000
Neg logit contributions
apy-0.000
keys-0.000
 live-0.000
 she-0.000
mm-0.000
Token helped
Token ID4193
Feature activation-2.41e-01

Pos logit contributions
 fault+0.008
 tant+0.008
 pad+0.007
 second+0.007
 plate+0.006
Neg logit contributions
apy-0.005
keys-0.005
 live-0.005
mm-0.005
ons-0.005
Token.
Token ID13
Feature activation-4.68e-02

Pos logit contributions
 fault+0.012
 tant+0.012
 pad+0.010
 second+0.010
 plate+0.010
Neg logit contributions
apy-0.006
keys-0.006
 live-0.006
 she-0.005
ons-0.005
Token Eventually
Token ID16178
Feature activation+6.94e-02

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 second+0.002
 plate+0.002
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token to
Token ID284
Feature activation-5.01e-02

Pos logit contributions
apy+0.008
keys+0.008
mm+0.008
 live+0.007
ons+0.007
Neg logit contributions
 fault-0.005
 tant-0.005
 pad-0.004
 asleep-0.004
 second-0.004
Token the
Token ID262
Feature activation+2.45e-01

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 second+0.002
 plate+0.002
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token park
Token ID3952
Feature activation-3.47e-01

Pos logit contributions
 live+0.007
 she+0.007
 they+0.007
 we+0.007
 you+0.006
Neg logit contributions
 tant-0.012
 fault-0.012
 pad-0.010
joy-0.009
 puck-0.009
Token to
Token ID284
Feature activation-6.13e-02

Pos logit contributions
 fault+0.012
 tant+0.011
 pad+0.010
 second+0.009
 plate+0.009
Neg logit contributions
apy-0.014
keys-0.014
 live-0.013
ons-0.013
mm-0.013
Token play
Token ID711
Feature activation+1.43e-01

Pos logit contributions
 tant+0.002
 fault+0.002
 second+0.002
 pad+0.002
 plate+0.002
Neg logit contributions
keys-0.002
apy-0.002
ons-0.002
ves-0.002
mm-0.002
Token.
Token ID13
Feature activation+1.85e-01

Pos logit contributions
keys+0.006
apy+0.005
mm+0.005
mmm+0.005
ended+0.005
Neg logit contributions
 tant-0.005
 fault-0.005
 pad-0.004
 second-0.004
 plate-0.004
Token Lily
Token ID20037
Feature activation+1.15e-01

Pos logit contributions
apy+0.005
 live+0.005
keys+0.005
 she+0.005
ons+0.004
Neg logit contributions
 fault-0.009
 tant-0.009
 pad-0.008
 second-0.007
 plate-0.007
Token saw
Token ID2497
Feature activation+8.88e-02

Pos logit contributions
apy+0.003
keys+0.003
 live+0.003
 she+0.003
mm+0.003
Neg logit contributions
 fault-0.005
 tant-0.005
 pad-0.005
 second-0.004
 plate-0.004
Token a
Token ID257
Feature activation+1.46e-01

Pos logit contributions
 she+0.003
 they+0.002
apy+0.002
 live+0.002
ons+0.002
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.004
 plate-0.003
 puck-0.003
Token merry
Token ID44570
Feature activation+5.11e-01

Pos logit contributions
apy+0.005
 she+0.005
 live+0.005
keys+0.005
 they+0.005
Neg logit contributions
 fault-0.006
 tant-0.005
 pad-0.005
 asleep-0.004
rett-0.004
Token-
Token ID12
Feature activation+2.63e-01

Pos logit contributions
apy+0.012
 she+0.012
 live+0.012
keys+0.011
 they+0.011
Neg logit contributions
 fault-0.027
 tant-0.025
 pad-0.023
 asleep-0.021
 second-0.021
Token's
Token ID338
Feature activation+7.52e-02

Pos logit contributions
apy+0.011
keys+0.011
mm+0.010
 live+0.010
mmm+0.010
Neg logit contributions
 fault-0.012
 tant-0.010
 pad-0.009
 second-0.008
 arm-0.008
Token okay
Token ID8788
Feature activation-7.87e-02

Pos logit contributions
apy+0.003
keys+0.002
 live+0.002
 she+0.002
ons+0.002
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.003
 second-0.002
 plate-0.002
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
 fault+0.003
 tant+0.003
 second+0.003
 pad+0.003
 plate+0.003
Neg logit contributions
apy-0.003
keys-0.003
ons-0.002
 live-0.002
odic-0.002
Token You
Token ID921
Feature activation-3.53e-01

Pos logit contributions
 fault+0.005
 tant+0.004
 pad+0.004
ro+0.003
 asleep+0.003
Neg logit contributions
keys-0.004
ons-0.003
ves-0.003
apy-0.003
odic-0.003
Token don
Token ID836
Feature activation+2.40e-01

Pos logit contributions
 fault+0.017
 tant+0.016
 pad+0.014
 second+0.013
 asleep+0.013
Neg logit contributions
apy-0.010
keys-0.009
ves-0.009
 she-0.009
mm-0.009
Token't
Token ID470
Feature activation-2.84e-01

Pos logit contributions
apy+0.008
 live+0.008
 lived+0.007
keys+0.007
 she+0.007
Neg logit contributions
 fault-0.009
 tant-0.009
 second-0.008
 plate-0.007
 asleep-0.007
Token have
Token ID423
Feature activation-3.67e-01

Pos logit contributions
 fault+0.014
 tant+0.014
 pad+0.012
 second+0.012
 plate+0.012
Neg logit contributions
apy-0.007
keys-0.007
 live-0.007
 she-0.006
ons-0.006
Token to
Token ID284
Feature activation-3.72e-01

Pos logit contributions
 tant+0.015
 pad+0.014
 fault+0.014
 plate+0.013
pit+0.012
Neg logit contributions
 they-0.012
 you-0.012
 she-0.012
 we-0.011
apy-0.011
Token ride
Token ID6594
Feature activation+1.88e-01

Pos logit contributions
 fault+0.017
 tant+0.017
 pad+0.014
 second+0.014
 puck+0.013
Neg logit contributions
apy-0.011
keys-0.010
 live-0.010
 she-0.010
ons-0.010
Token it
Token ID340
Feature activation+3.74e-01

Pos logit contributions
apy+0.006
keys+0.006
 live+0.005
 she+0.005
ons+0.005
Neg logit contributions
 fault-0.008
 tant-0.008
 pad-0.007
 second-0.007
 puck-0.007
Token again
Token ID757
Feature activation-1.12e-01

Pos logit contributions
 she+0.011
 live+0.010
 they+0.010
 we+0.009
 you+0.009
Neg logit contributions
 fault-0.017
 tant-0.017
 pad-0.015
 plate-0.014
 puck-0.014
TokenBut
Token ID1537
Feature activation-1.21e-01

Pos logit contributions
 expect+0.001
mm+0.001
ons+0.001
keys+0.001
owers+0.001
Neg logit contributions
 second-0.002
 tant-0.002
 pad-0.001
 puck-0.001
 fault-0.001
Token then
Token ID788
Feature activation-1.94e-01

Pos logit contributions
 fault+0.006
 tant+0.005
 pad+0.005
 second+0.005
 puck+0.004
Neg logit contributions
keys-0.003
apy-0.003
 live-0.003
ons-0.003
 she-0.003
Token,
Token ID11
Feature activation-1.40e-01

Pos logit contributions
 fault+0.007
 tant+0.007
 second+0.007
 puck+0.006
 asleep+0.006
Neg logit contributions
keys-0.006
ended-0.006
ons-0.006
odic-0.006
mm-0.005
Token something
Token ID1223
Feature activation+1.17e-01

Pos logit contributions
 fault+0.008
 tant+0.008
 pad+0.007
 second+0.007
 puck+0.007
Neg logit contributions
apy-0.002
keys-0.002
 live-0.002
ons-0.002
 she-0.002
Token goes
Token ID2925
Feature activation-1.73e-01

Pos logit contributions
keys+0.005
mm+0.005
apy+0.005
ons+0.004
 live+0.004
Neg logit contributions
 fault-0.004
 tant-0.004
 second-0.003
 puck-0.003
 pad-0.003
Token wrong
Token ID2642
Feature activation-1.24e+00

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 second+0.005
 plate+0.005
Neg logit contributions
apy-0.007
keys-0.006
 live-0.006
 she-0.006
ons-0.006
Token.
Token ID13
Feature activation+6.32e-03

Pos logit contributions
 tant+0.066
 fault+0.066
 pad+0.058
 puck+0.054
 plate+0.053
Neg logit contributions
 live-0.026
apy-0.026
 she-0.026
 they-0.025
keys-0.024
Token The
Token ID383
Feature activation+1.55e-01

Pos logit contributions
apy+0.000
keys+0.000
 live+0.000
 she+0.000
ons+0.000
Neg logit contributions
 fault-0.000
 tant-0.000
 pad-0.000
 second-0.000
 plate-0.000
Token sticks
Token ID16461
Feature activation+5.37e-02

Pos logit contributions
apy+0.005
 live+0.005
keys+0.005
 she+0.005
 they+0.005
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.005
 puck-0.004
 plate-0.004
Token catch
Token ID4929
Feature activation+5.34e-01

Pos logit contributions
apy+0.001
keys+0.001
 live+0.001
mm+0.001
ons+0.001
Neg logit contributions
 fault-0.003
 tant-0.002
 pad-0.002
 second-0.002
 puck-0.002
Token fire
Token ID2046
Feature activation+2.19e-01

Pos logit contributions
 they+0.015
 live+0.014
apy+0.014
 she+0.014
ves+0.014
Neg logit contributions
 fault-0.025
 tant-0.025
 pad-0.022
joy-0.020
 plate-0.020
Token down
Token ID866
Feature activation-1.37e-02

Pos logit contributions
 fault+0.000
 tant+0.000
 pad+0.000
rett+0.000
 plate+0.000
Neg logit contributions
 she-0.000
 they-0.000
 live-0.000
apy-0.000
 you-0.000
Token the
Token ID262
Feature activation+9.55e-02

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 plate+0.001
 puck+0.001
Neg logit contributions
 she-0.000
 live-0.000
 they-0.000
apy-0.000
ons-0.000
Token street
Token ID4675
Feature activation+2.80e-01

Pos logit contributions
 she+0.003
 live+0.003
 they+0.003
 we+0.003
keys+0.002
Neg logit contributions
 fault-0.005
 tant-0.004
 pad-0.004
joy-0.003
rett-0.003
Token in
Token ID287
Feature activation-2.47e-01

Pos logit contributions
 she+0.008
 they+0.007
 live+0.007
apy+0.006
 we+0.006
Neg logit contributions
 fault-0.014
 tant-0.014
 pad-0.012
rett-0.011
 plate-0.011
Token his
Token ID465
Feature activation-3.99e-01

Pos logit contributions
 fault+0.015
 pad+0.014
 tant+0.013
urger+0.013
ingo+0.012
Neg logit contributions
 she-0.005
 live-0.004
 they-0.004
 you-0.003
 expect-0.003
Token bath
Token ID7837
Feature activation-1.15e+00

Pos logit contributions
 fault+0.015
 tant+0.014
 pad+0.012
rett+0.011
 puck+0.011
Neg logit contributions
 live-0.015
apy-0.015
 she-0.014
 they-0.014
 lived-0.014
Tokenrobe
Token ID25481
Feature activation-4.86e-01

Pos logit contributions
 fault+0.044
 tant+0.044
 pad+0.038
 second+0.034
 puck+0.034
Neg logit contributions
 she-0.040
apy-0.039
 they-0.038
 live-0.038
 lived-0.037
Token when
Token ID618
Feature activation-2.86e-01

Pos logit contributions
 fault+0.024
 tant+0.024
 pad+0.020
 puck+0.019
 second+0.019
Neg logit contributions
 she-0.013
 they-0.012
 live-0.011
apy-0.011
 we-0.011
Token a
Token ID257
Feature activation+2.41e-02

Pos logit contributions
 fault+0.015
 tant+0.014
 pad+0.013
 second+0.012
 puck+0.012
Neg logit contributions
keys-0.007
apy-0.006
 live-0.006
ons-0.006
mm-0.005
Token car
Token ID1097
Feature activation-6.10e-02

Pos logit contributions
apy+0.001
 she+0.001
 live+0.001
keys+0.001
 they+0.001
Neg logit contributions
 fault-0.001
 tant-0.001
 pad-0.001
 asleep-0.001
rett-0.001
Token came
Token ID1625
Feature activation+1.91e-01

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.003
 second+0.002
 puck+0.002
Neg logit contributions
apy-0.002
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token eat
Token ID4483
Feature activation+1.60e-01

Pos logit contributions
 fault+0.022
 tant+0.022
 pad+0.019
 second+0.018
 plate+0.018
Neg logit contributions
apy-0.011
 live-0.011
keys-0.011
 she-0.011
ons-0.010
Token soap
Token ID19533
Feature activation-1.35e-01

Pos logit contributions
apy+0.005
 live+0.005
keys+0.005
 she+0.005
ons+0.005
Neg logit contributions
 fault-0.007
 tant-0.007
 pad-0.006
 second-0.005
 plate-0.005
Token.
Token ID13
Feature activation+2.78e-04

Pos logit contributions
 fault+0.007
 tant+0.006
 pad+0.006
 second+0.005
rett+0.005
Neg logit contributions
 she-0.003
 they-0.003
 live-0.003
 we-0.003
apy-0.003
Token They
Token ID1119
Feature activation+5.21e-01

Pos logit contributions
keys+0.000
apy+0.000
ons+0.000
 she+0.000
 live+0.000
Neg logit contributions
 fault-0.000
 tant-0.000
 pad-0.000
 second-0.000
 plate-0.000
Token will
Token ID481
Feature activation-5.51e-01

Pos logit contributions
apy+0.014
keys+0.013
ons+0.012
 she+0.012
 live+0.011
Neg logit contributions
 fault-0.026
 tant-0.025
 pad-0.022
 second-0.021
 plate-0.020
Token listen
Token ID6004
Feature activation-1.12e-01

Pos logit contributions
 fault+0.027
 tant+0.027
 pad+0.024
 second+0.022
 puck+0.022
Neg logit contributions
 live-0.014
apy-0.014
 she-0.013
keys-0.013
 they-0.013
Token to
Token ID284
Feature activation-3.61e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 puck+0.003
 plate+0.003
Neg logit contributions
 she-0.004
 they-0.004
apy-0.004
 live-0.004
 we-0.004
Token Mom
Token ID11254
Feature activation+4.61e-01

Pos logit contributions
 fault+0.015
 tant+0.015
 pad+0.013
 puck+0.012
 second+0.011
Neg logit contributions
apy-0.012
 live-0.012
 she-0.011
keys-0.011
 they-0.011
Token and
Token ID290
Feature activation-8.22e-02

Pos logit contributions
 she+0.015
 live+0.015
keys+0.014
 they+0.014
apy+0.014
Neg logit contributions
 tant-0.020
 fault-0.020
 pad-0.016
 puck-0.015
 second-0.015
Token Dad
Token ID17415
Feature activation+3.10e-01

Pos logit contributions
 tant+0.003
 fault+0.003
 pad+0.003
 puck+0.002
rett+0.002
Neg logit contributions
 she-0.003
 they-0.003
 we-0.003
 live-0.003
apy-0.003
Token.
Token ID13
Feature activation+9.86e-03

Pos logit contributions
 live+0.007
 she+0.007
keys+0.007
apy+0.007
 they+0.007
Neg logit contributions
 fault-0.016
 tant-0.016
 pad-0.014
 puck-0.013
 second-0.013
Token smiled
Token ID13541
Feature activation+2.37e-01

Pos logit contributions
apy+0.003
keys+0.003
 live+0.003
 she+0.003
ons+0.002
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.005
 second-0.005
 plate-0.005
Token and
Token ID290
Feature activation-8.63e-02

Pos logit contributions
 she+0.007
 they+0.007
apy+0.006
 live+0.006
 lived+0.006
Neg logit contributions
 fault-0.012
 tant-0.011
 pad-0.009
 puck-0.009
ingo-0.009
Token they
Token ID484
Feature activation-4.12e-02

Pos logit contributions
 fault+0.003
 tant+0.003
 plate+0.003
 pad+0.003
 puck+0.003
Neg logit contributions
keys-0.003
 live-0.002
vere-0.002
ons-0.002
ves-0.002
Token all
Token ID477
Feature activation+2.26e-01

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.001
 plate+0.001
 second+0.001
Neg logit contributions
keys-0.001
apy-0.001
 live-0.001
ons-0.001
 she-0.001
Token went
Token ID1816
Feature activation-9.26e-02

Pos logit contributions
keys+0.007
apy+0.006
ons+0.006
mm+0.006
ves+0.006
Neg logit contributions
 tant-0.010
 fault-0.010
 pad-0.009
 plate-0.008
 second-0.008
Token back
Token ID736
Feature activation-4.45e-01

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.001
 asleep+0.001
 second+0.001
Neg logit contributions
keys-0.005
apy-0.005
 live-0.005
mm-0.005
 she-0.005
Token to
Token ID284
Feature activation-3.48e-01

Pos logit contributions
 fault+0.013
 tant+0.012
 pad+0.010
 asleep+0.009
 plate+0.008
Neg logit contributions
keys-0.021
apy-0.021
mm-0.020
 live-0.020
ves-0.019
Token the
Token ID262
Feature activation-8.61e-02

Pos logit contributions
 fault+0.015
 tant+0.014
 pad+0.012
 plate+0.012
 asleep+0.012
Neg logit contributions
keys-0.012
apy-0.011
ves-0.010
mm-0.010
ons-0.010
Token garden
Token ID11376
Feature activation+3.22e-01

Pos logit contributions
 tant+0.004
 fault+0.004
 pad+0.003
rett+0.003
uster+0.003
Neg logit contributions
 she-0.003
 live-0.003
 we-0.003
 they-0.003
 you-0.002
Token to
Token ID284
Feature activation-3.45e-01

Pos logit contributions
apy+0.009
 live+0.009
keys+0.009
 she+0.009
ons+0.008
Neg logit contributions
 fault-0.015
 tant-0.015
 pad-0.013
 second-0.012
 plate-0.012
Token fix
Token ID4259
Feature activation+1.45e-01

Pos logit contributions
 fault+0.013
 tant+0.012
 plate+0.012
 second+0.011
 home+0.011
Neg logit contributions
keys-0.013
ves-0.012
apy-0.011
ons-0.011
vere-0.011
Token the
Token ID262
Feature activation-1.95e-02

Pos logit contributions
 fault+0.017
 tant+0.017
 pad+0.015
 puck+0.014
rett+0.013
Neg logit contributions
 live-0.010
 she-0.009
apy-0.009
 they-0.009
ons-0.008
Token garden
Token ID11376
Feature activation+3.07e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 second+0.000
 plate+0.000
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
ons-0.001
 she-0.001
Token and
Token ID290
Feature activation-1.12e-01

Pos logit contributions
apy+0.009
keys+0.009
 live+0.008
 she+0.008
ons+0.008
Neg logit contributions
 fault-0.014
 tant-0.014
 pad-0.012
 second-0.011
 plate-0.011
Token in
Token ID287
Feature activation-3.57e-01

Pos logit contributions
 fault+0.005
 tant+0.005
 pad+0.005
 second+0.004
 plate+0.004
Neg logit contributions
apy-0.003
keys-0.003
 live-0.003
mm-0.003
ons-0.003
Token the
Token ID262
Feature activation-2.48e-02

Pos logit contributions
 fault+0.017
 tant+0.017
 pad+0.016
 puck+0.014
rett+0.014
Neg logit contributions
 she-0.010
 live-0.009
 they-0.009
apy-0.009
 lived-0.008
Token end
Token ID886
Feature activation-1.99e+00

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 second+0.001
 plate+0.001
Neg logit contributions
apy-0.001
keys-0.001
 live-0.001
 she-0.001
ons-0.001
Token,
Token ID11
Feature activation-1.63e-01

Pos logit contributions
 fault+0.095
 tant+0.092
 second+0.082
 pad+0.080
 plate+0.079
Neg logit contributions
keys-0.049
mm-0.047
apy-0.046
 live-0.046
 lived-0.044
Token the
Token ID262
Feature activation-2.06e-02

Pos logit contributions
 fault+0.008
 tant+0.008
 pad+0.007
 second+0.006
 plate+0.006
Neg logit contributions
apy-0.004
keys-0.004
 live-0.004
ons-0.004
 she-0.004
Token garden
Token ID11376
Feature activation+3.15e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
 asleep+0.001
 puck+0.001
Neg logit contributions
 she-0.001
 live-0.001
apy-0.001
keys-0.001
 they-0.001
Token was
Token ID373
Feature activation+1.34e-01

Pos logit contributions
apy+0.008
keys+0.008
 live+0.008
 she+0.007
ons+0.007
Neg logit contributions
 fault-0.015
 tant-0.015
 pad-0.013
 plate-0.012
 second-0.012
Token even
Token ID772
Feature activation+5.53e-01

Pos logit contributions
apy+0.004
 live+0.004
 she+0.004
 they+0.003
keys+0.003
Neg logit contributions
 tant-0.006
 fault-0.006
 pad-0.005
 puck-0.005
ingo-0.005
Token cool
Token ID3608
Feature activation+1.70e-01

Pos logit contributions
apy+0.013
keys+0.013
 live+0.013
 she+0.012
ons+0.012
Neg logit contributions
 fault-0.015
 tant-0.015
 pad-0.012
 second-0.011
 plate-0.011
Token things
Token ID1243
Feature activation-2.06e-01

Pos logit contributions
keys+0.006
apy+0.006
 live+0.006
 lived+0.006
mm+0.006
Neg logit contributions
 tant-0.007
 fault-0.007
 pad-0.005
 asleep-0.005
 plate-0.005
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
 fault+0.010
 tant+0.009
 pad+0.008
 asleep+0.007
 second+0.007
Neg logit contributions
apy-0.006
keys-0.006
 live-0.006
ons-0.006
 lived-0.006
Token In
Token ID554
Feature activation+4.95e-01

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.000
 second+0.000
 plate+0.000
Neg logit contributions
apy-0.000
 live-0.000
keys-0.000
 she-0.000
 they-0.000
Token the
Token ID262
Feature activation+4.80e-02

Pos logit contributions
 she+0.009
 live+0.009
 they+0.009
apy+0.008
 lived+0.008
Neg logit contributions
 tant-0.029
 fault-0.028
 pad-0.026
 puck-0.024
rett-0.023
Token end
Token ID886
Feature activation-2.00e+00

Pos logit contributions
 she+0.001
 live+0.001
ons+0.001
 they+0.001
 lived+0.001
Neg logit contributions
 fault-0.002
 tant-0.002
rett-0.002
 pad-0.002
 asleep-0.002
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
 fault+0.068
 tant+0.063
 second+0.055
 home+0.050
 asleep+0.049
Neg logit contributions
keys-0.075
mm-0.068
 live-0.068
odic-0.068
 expect-0.067
Token everyone
Token ID2506
Feature activation-1.32e-01

Pos logit contributions
 fault+0.007
 tant+0.007
 pad+0.006
 second+0.006
 plate+0.006
Neg logit contributions
apy-0.002
keys-0.002
 live-0.002
ons-0.002
mm-0.002
Token was
Token ID373
Feature activation+2.11e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 plate+0.005
 asleep+0.005
Neg logit contributions
apy-0.004
keys-0.004
mm-0.003
 live-0.003
mmm-0.003
Token happy
Token ID3772
Feature activation+7.95e-02

Pos logit contributions
apy+0.006
 live+0.006
 she+0.006
keys+0.006
 they+0.006
Neg logit contributions
 tant-0.009
 fault-0.009
 pad-0.008
 puck-0.007
 plate-0.007
Token.
Token ID13
Feature activation-2.11e-02

Pos logit contributions
 she+0.002
apy+0.002
 live+0.002
 they+0.002
keys+0.002
Neg logit contributions
 tant-0.004
 fault-0.004
 pad-0.003
 second-0.003
 puck-0.003
Token again
Token ID757
Feature activation-1.52e-01

Pos logit contributions
 live+0.012
 she+0.012
keys+0.011
 they+0.011
apy+0.011
Neg logit contributions
 fault-0.018
 tant-0.018
 pad-0.014
 puck-0.014
 plate-0.014
Token.
Token ID13
Feature activation-4.68e-02

Pos logit contributions
 fault+0.007
 tant+0.007
 pad+0.006
 second+0.006
 plate+0.006
Neg logit contributions
apy-0.004
keys-0.004
 live-0.004
 she-0.004
ons-0.004
Token
Token ID198
Feature activation-2.33e-01

Pos logit contributions
 fault+0.002
 tant+0.002
 pad+0.002
 plate+0.002
 second+0.002
Neg logit contributions
apy-0.001
 live-0.001
 she-0.001
keys-0.001
 they-0.001
Token
Token ID198
Feature activation-2.51e-01

Pos logit contributions
 fault+0.010
 tant+0.010
 pad+0.008
 plate+0.007
 puck+0.007
Neg logit contributions
apy-0.008
 live-0.008
 she-0.007
keys-0.007
ons-0.007
TokenThe
Token ID464
Feature activation-3.51e-01

Pos logit contributions
 fault+0.013
 tant+0.012
 pad+0.011
 plate+0.010
 asleep+0.010
Neg logit contributions
apy-0.007
 live-0.006
keys-0.006
 she-0.006
ons-0.006
Token end
Token ID886
Feature activation-2.09e+00

Pos logit contributions
 fault+0.016
 tant+0.015
 pad+0.013
ingo+0.013
rett+0.012
Neg logit contributions
 she-0.012
 live-0.011
keys-0.011
 we-0.010
apy-0.010
Token.
Token ID13
Feature activation-8.54e-02

Pos logit contributions
 fault+0.084
 tant+0.079
 pad+0.070
 second+0.065
 plate+0.063
Neg logit contributions
keys-0.072
apy-0.070
mm-0.068
ons-0.066
 live-0.065
Token<|endoftext|>
Token ID50256
Feature activation-6.22e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.004
 plate+0.003
 second+0.003
Neg logit contributions
apy-0.002
 live-0.002
 she-0.002
 they-0.002
keys-0.002
TokenOnce
Token ID7454
Feature activation+8.20e-01

Pos logit contributions
 fault+0.030
 tant+0.029
 pad+0.025
 plate+0.023
 second+0.023
Neg logit contributions
apy-0.017
 live-0.017
 she-0.016
keys-0.016
ons-0.015
Token upon
Token ID2402
Feature activation+7.77e-02

Pos logit contributions
 she+0.036
apy+0.036
 live+0.035
 they+0.035
 lived+0.034
Neg logit contributions
 fault-0.025
 tant-0.025
 pad-0.020
 puck-0.018
 plate-0.018
Token a
Token ID257
Feature activation-7.18e-02

Pos logit contributions
 live+0.003
apy+0.003
 she+0.003
keys+0.002
 they+0.002
Neg logit contributions
 fault-0.003
 tant-0.003
 pad-0.003
 puck-0.002
 plate-0.002
Token place
Token ID1295
Feature activation+5.61e-01

Pos logit contributions
 she+0.004
 live+0.004
 they+0.004
 we+0.004
apy+0.004
Neg logit contributions
 tant-0.005
 fault-0.005
 pad-0.004
ingo-0.004
 puck-0.004
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
apy+0.015
 live+0.014
 she+0.014
keys+0.014
 they+0.013
Neg logit contributions
 fault-0.027
 tant-0.027
 pad-0.023
 second-0.021
 puck-0.021
Token
Token ID198
Feature activation-6.14e-02

Pos logit contributions
apy+0.000
 she+0.000
 live+0.000
keys+0.000
 they+0.000
Neg logit contributions
 fault-0.000
 tant-0.000
 pad-0.000
 plate-0.000
 asleep-0.000
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
 fault+0.003
 tant+0.003
 pad+0.002
 asleep+0.002
 plate+0.002
Neg logit contributions
apy-0.002
 she-0.002
 live-0.002
ons-0.002
keys-0.002
TokenThe
Token ID464
Feature activation-3.04e-01

Pos logit contributions
 fault+0.006
 tant+0.006
 pad+0.005
 asleep+0.005
 plate+0.004
Neg logit contributions
apy-0.003
 she-0.003
keys-0.003
 live-0.003
ons-0.003
Token End
Token ID5268
Feature activation-2.75e+00

Pos logit contributions
 fault+0.014
 tant+0.013
 pad+0.011
 asleep+0.009
ingo+0.009
Neg logit contributions
 live-0.010
 she-0.010
keys-0.010
apy-0.009
 they-0.009
Token.
Token ID13
Feature activation-4.52e-02

Pos logit contributions
 fault+0.110
 tant+0.110
 pad+0.091
 second+0.089
 plate+0.087
Neg logit contributions
apy-0.090
keys-0.087
 live-0.087
mm-0.080
 she-0.079
Token<|endoftext|>
Token ID50256
Feature activation-6.42e-01

Pos logit contributions
 tant+0.002
 fault+0.002
 pad+0.002
 puck+0.002
 plate+0.002
Neg logit contributions
apy-0.001
 she-0.001
 live-0.001
 they-0.001
keys-0.001
TokenOnce
Token ID7454
Feature activation+5.37e-01

Pos logit contributions
 fault+0.028
 tant+0.028
 pad+0.024
 second+0.022
 plate+0.022
Neg logit contributions
apy-0.020
keys-0.019
 live-0.018
 she-0.018
ons-0.017
Token upon
Token ID2402
Feature activation+7.27e-03

Pos logit contributions
 she+0.024
apy+0.023
 they+0.023
 live+0.023
 lived+0.022
Neg logit contributions
 fault-0.017
 tant-0.017
 pad-0.013
 puck-0.012
 plate-0.012
Token a
Token ID257
Feature activation+4.46e-02

Pos logit contributions
 she+0.000
 live+0.000
apy+0.000
 they+0.000
keys+0.000
Neg logit contributions
 fault-0.000
 tant-0.000
 pad-0.000
 puck-0.000
 plate-0.000
Token.
Token ID13
Feature activation-8.60e-02

Pos logit contributions
 she+0.003
apy+0.003
 live+0.002
 they+0.002
keys+0.002
Neg logit contributions
 fault-0.006
 tant-0.006
 pad-0.006
 plate-0.005
 puck-0.005
Token
Token ID198
Feature activation-2.10e-01

Pos logit contributions
 fault+0.004
 tant+0.004
 pad+0.003
 plate+0.003
 puck+0.003
Neg logit contributions
apy-0.002
 she-0.002
 live-0.002
keys-0.002
 they-0.002
Token
Token ID198
Feature activation-2.35e-01

Pos logit contributions
 fault+0.010
 tant+0.010
 pad+0.009
 plate+0.008
 puck+0.008
Neg logit contributions
apy-0.006
 live-0.006
 she-0.006
keys-0.006
ons-0.005
TokenIn
Token ID818
Feature activation+6.73e-02

Pos logit contributions
 fault+0.011
 tant+0.011
 pad+0.009
 plate+0.009
 asleep+0.009
Neg logit contributions
apy-0.007
keys-0.006
 live-0.006
 she-0.006
ons-0.006
Token the
Token ID262
Feature activation-2.25e-02

Pos logit contributions
 she+0.001
apy+0.001
 they+0.001
 live+0.001
 lived+0.001
Neg logit contributions
 fault-0.004
 tant-0.004
 pad-0.003
 puck-0.003
ingo-0.003
Token end
Token ID886
Feature activation-2.14e+00

Pos logit contributions
 fault+0.001
 tant+0.001
 pad+0.001
rett+0.001
 asleep+0.001
Neg logit contributions
 live-0.001
 she-0.001
apy-0.001
keys-0.001
 lived-0.001
Token,
Token ID11
Feature activation-2.27e-01

Pos logit contributions
 fault+0.081
 tant+0.080
 asleep+0.068
 second+0.068
 pad+0.065
Neg logit contributions
keys-0.070
mm-0.069
apy-0.064
ons-0.063
 live-0.063
Token the
Token ID262
Feature activation-5.17e-03

Pos logit contributions
 fault+0.010
 tant+0.010
 pad+0.009
 second+0.009
 puck+0.008
Neg logit contributions
apy-0.006
keys-0.006
 live-0.006
ons-0.006
 she-0.006
Token boy
Token ID2933
Feature activation-6.90e-01

Pos logit contributions
 fault+0.000
 tant+0.000
 pad+0.000
 asleep+0.000
 puck+0.000
Neg logit contributions
apy-0.000
 live-0.000
 she-0.000
keys-0.000
ons-0.000
Token had
Token ID550
Feature activation-1.15e-01

Pos logit contributions
 fault+0.034
 tant+0.033
 pad+0.029
 second+0.027
 plate+0.027
Neg logit contributions
apy-0.018
keys-0.017
 live-0.017
 she-0.016
ons-0.015
Token to
Token ID284
Feature activation-3.37e-01

Pos logit contributions
 tant+0.005
 fault+0.005
 pad+0.004
 plate+0.004
 angle+0.004
Neg logit contributions
 she-0.004
apy-0.003
 they-0.003
 live-0.003
 lived-0.003