TOP ACTIVATIONS
MAX = 7.313

he 's tired and to rely on his friends when he
friend to sort - of rely on . <|endoftext|> Once upon
Good . I rely on your word . This
She knew she could always rely on her own ideas to
that he shouldn 't always rely on others to supply him
, knowing that she could rely on Jim . And with
He knew that he could rely on the clock to tell
She knew she could always rely on her heart for her
and knew that she could rely on him . <|endoftext|> Once
you . You can always rely on me . â
bear knew she could always rely on her mom my bear
Bear knew he could always rely on himself to do great
farmer knew he could always rely on the little girl to
knew that she could always rely on her voice to sing
. They knew they could rely on each other . <|endoftext|>
as she held the l oll ipop in her hand .
always have her muscles to rely on , whether her friends
good friend . From that day on , Lucy and the
great time . From that day on , Tim my knew
a place he could always rely on . <|endoftext|> Once upon

MIN ACTIVATIONS
MIN = -12.130

his new board he could hardly contain himself . His dad
wall . It said " DO NOT T OU CH ".
red button that said " DO NOT PRESS ." She couldn
st inky that she could hardly stand it !
so much that he could hardly move . Finally he had
the zoo that she could hardly wait . She put on
so excited - she could hardly contain herself . She started
so excited , she could hardly wait to eat the tasty
so excited that she could hardly stand it .
and tomatoes . He could hardly wait to eat it ,
so excited that she could hardly stand it .
wind , the trees to wered high , and the clouds
much fun that they could hardly contain their excitement .
so amazed , it could hardly believe its eyes !
and sizes . He could hardly believe his eyes . He
to go , he could hardly contain his excitement .
ever before . He could hardly believe it
. As the waves crashed upon the shore , Bob could
again , and Lucy could hardly wait to see what surprise
very excited . He could hardly contain his joy as he

INTERVAL 2.452 - 7.313
CONTAINS 4.456%

herself safe . From that day on , Lily 's new
feathers . One day , Wisdom told N appy
doing ?!" her mother said in a stern voice .
. He decided to take a bath and clean himself up
the shop and give the truck back . Maybe we can

INTERVAL -2.409 - 2.452
CONTAINS 91.065%

around . But the mud was very filthy and soon the
Anna feels sad . She really wants to taste
out of her box and puts them on the floor .
and started to feel better . As the days
song that was much better than her own . She felt

INTERVAL -7.269 - -2.409
CONTAINS 4.476%

hair . Suddenly , he accidentally cut his finger and started
exclaimed . Lily was so happy and thanked Tim my .
soup for you . But you have to relax and wait
so big and unknown to her . But then , the
But today , Tuesday wasn 't going to be so fun

INTERVAL -12.130 - -7.269
CONTAINS 0.003%

his new board he could hardly contain himself . His dad
wall . It said " DO NOT T OU CH ".
Token he
Token ID339
Feature activation-1.56e-01

Pos logit contributions
P+0.022
unt+0.020
ww+0.020
 cherry+0.019
 baker+0.019
Neg logit contributions
der-0.025
en-0.019
ilst-0.018
iece-0.018
 before-0.018
Token's
Token ID338
Feature activation-9.10e-01

Pos logit contributions
der+0.005
icles+0.004
iece+0.004
reating+0.004
ities+0.004
Neg logit contributions
P-0.003
 st-0.002
ww-0.002
 good-0.002
 cherry-0.002
Token tired
Token ID10032
Feature activation-8.02e-01

Pos logit contributions
der+0.030
en+0.025
iece+0.024
ilst+0.024
icles+0.024
Neg logit contributions
P-0.015
 cherry-0.014
unt-0.014
 good-0.014
ww-0.014
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
der+0.023
ilst+0.020
inch+0.019
ussed+0.019
iece+0.018
Neg logit contributions
 good-0.015
 long-0.015
ww-0.014
P-0.014
 cherry-0.014
Token to
Token ID284
Feature activation+9.00e-01

Pos logit contributions
P+0.018
 good+0.018
 cherry+0.017
ww+0.017
 baker+0.016
Neg logit contributions
der-0.035
ilst-0.030
en-0.029
ities-0.029
icles-0.029
Token rely
Token ID8814
Feature activation+7.31e+00

Pos logit contributions
P+0.017
 cherry+0.015
unt+0.015
 good+0.015
ww+0.015
Neg logit contributions
der-0.029
ilst-0.023
iece-0.022
ities-0.022
en-0.022
Token on
Token ID319
Feature activation+3.37e+00

Pos logit contributions
P+0.099
unt+0.094
 baker+0.093
 cherry+0.092
ww+0.092
Neg logit contributions
der-0.260
 before-0.221
en-0.213
 himself-0.209
ilst-0.208
Token his
Token ID465
Feature activation+1.84e-01

Pos logit contributions
P+0.068
 good+0.066
unt+0.063
ww+0.062
 long+0.061
Neg logit contributions
der-0.099
ill-0.079
ilst-0.078
ipples-0.077
inch-0.076
Token friends
Token ID2460
Feature activation+3.70e+00

Pos logit contributions
P+0.004
unt+0.004
ww+0.004
Ell+0.004
Elizabeth+0.003
Neg logit contributions
der-0.005
 before-0.004
en-0.004
ities-0.004
 himself-0.004
Token when
Token ID618
Feature activation+1.41e+00

Pos logit contributions
P+0.067
 good+0.063
unt+0.060
 baker+0.059
 pastry+0.059
Neg logit contributions
der-0.116
icles-0.092
iece-0.092
en-0.091
ipples-0.091
Token he
Token ID339
Feature activation+3.19e-01

Pos logit contributions
P+0.035
unt+0.032
ww+0.032
 cherry+0.031
 baker+0.031
Neg logit contributions
der-0.037
en-0.028
ilst-0.027
iece-0.026
inch-0.026
Token friend
Token ID1545
Feature activation+3.55e+00

Pos logit contributions
P+0.096
Ell+0.086
Elizabeth+0.081
ww+0.078
unt+0.078
Neg logit contributions
der-0.122
 before-0.114
inch-0.100
 how-0.098
 since-0.096
Token to
Token ID284
Feature activation+1.69e+00

Pos logit contributions
P+0.092
unt+0.086
ww+0.084
 baker+0.083
 cherry+0.083
Neg logit contributions
der-0.090
en-0.065
ilst-0.063
iece-0.062
ities-0.062
Token sort
Token ID3297
Feature activation+2.56e+00

Pos logit contributions
P+0.033
unt+0.031
ww+0.030
 cherry+0.030
 baker+0.029
Neg logit contributions
der-0.053
en-0.041
ilst-0.040
iece-0.040
 before-0.040
Token-
Token ID12
Feature activation+1.93e+00

Pos logit contributions
P+0.052
unt+0.048
 pastry+0.046
 cherry+0.046
ww+0.046
Neg logit contributions
der-0.079
ilst-0.060
ities-0.060
iece-0.059
en-0.059
Tokenof
Token ID1659
Feature activation+1.53e+00

Pos logit contributions
P+0.055
unt+0.051
ww+0.050
 baker+0.050
 cherry+0.049
Neg logit contributions
der-0.044
en-0.031
 before-0.030
ilst-0.030
ities-0.029
Token rely
Token ID8814
Feature activation+7.07e+00

Pos logit contributions
P+0.033
unt+0.030
ww+0.030
 cherry+0.030
 pastry+0.029
Neg logit contributions
der-0.045
ilst-0.033
iece-0.033
en-0.033
ities-0.033
Token on
Token ID319
Feature activation+3.49e+00

Pos logit contributions
P+0.097
unt+0.095
ww+0.093
 baker+0.091
 cherry+0.090
Neg logit contributions
der-0.245
 before-0.209
en-0.203
ilst-0.199
 himself-0.197
Token.
Token ID13
Feature activation+3.79e-01

Pos logit contributions
P+0.070
unt+0.065
ww+0.062
 cherry+0.061
 baker+0.061
Neg logit contributions
der-0.108
en-0.082
ilst-0.082
iece-0.081
inch-0.080
Token<|endoftext|>
Token ID50256
Feature activation-1.10e+00

Pos logit contributions
P+0.008
unt+0.007
ww+0.007
 baker+0.007
 cherry+0.007
Neg logit contributions
der-0.012
en-0.009
 before-0.009
ilst-0.009
iece-0.009
TokenOnce
Token ID7454
Feature activation-8.20e-01

Pos logit contributions
der+0.038
en+0.030
ities+0.030
ilst+0.029
iece+0.029
Neg logit contributions
P-0.020
unt-0.017
ww-0.016
 cherry-0.016
 baker-0.016
Token upon
Token ID2402
Feature activation-5.06e+00

Pos logit contributions
der+0.017
en+0.012
ities+0.011
iece+0.011
ilst+0.011
Neg logit contributions
P-0.025
unt-0.024
ww-0.023
 cherry-0.023
 baker-0.023
Token
Token ID129
Feature activation-1.45e+00

Pos logit contributions
P+0.042
unt+0.040
ww+0.039
 cherry+0.036
Elizabeth+0.036
Neg logit contributions
der-0.067
en-0.052
 before-0.050
ilst-0.050
iece-0.050
Token
Token ID241
Feature activation-1.66e+00

Pos logit contributions
der+0.039
en+0.028
 before+0.028
ities+0.028
inch+0.028
Neg logit contributions
P-0.034
unt-0.033
ww-0.032
Elizabeth-0.031
 cherry-0.031
TokenGood
Token ID10248
Feature activation+1.15e+00

Pos logit contributions
der+0.054
icking+0.046
en+0.046
iece+0.045
ility+0.045
Neg logit contributions
P-0.029
unt-0.028
ww-0.028
 baker-0.025
www-0.025
Token.
Token ID13
Feature activation+8.63e-01

Pos logit contributions
P+0.025
unt+0.024
ww+0.023
 baker+0.022
 cherry+0.022
Neg logit contributions
der-0.033
en-0.025
ilst-0.025
 before-0.025
iece-0.025
Token I
Token ID314
Feature activation+1.06e+00

Pos logit contributions
unt+0.017
P+0.017
ww+0.016
 baker+0.015
 cherry+0.015
Neg logit contributions
der-0.026
en-0.021
ilst-0.021
iece-0.021
ities-0.020
Token rely
Token ID8814
Feature activation+7.05e+00

Pos logit contributions
P+0.018
unt+0.016
ww+0.016
 good+0.015
 cherry+0.015
Neg logit contributions
der-0.037
iece-0.029
ities-0.029
icles-0.029
icated-0.028
Token on
Token ID319
Feature activation+2.55e+00

Pos logit contributions
P+0.104
 baker+0.104
 cherry+0.103
unt+0.100
ww+0.099
Neg logit contributions
der-0.230
 before-0.201
en-0.192
ilst-0.189
inch-0.186
Token your
Token ID534
Feature activation+4.55e-02

Pos logit contributions
P+0.053
unt+0.049
ww+0.049
 good+0.046
 cherry+0.046
Neg logit contributions
der-0.077
en-0.059
ilst-0.059
ipples-0.059
iece-0.058
Token word
Token ID1573
Feature activation-9.66e-01

Pos logit contributions
P+0.001
unt+0.001
Ell+0.001
ww+0.001
Elizabeth+0.001
Neg logit contributions
der-0.001
 before-0.001
en-0.001
ities-0.001
 himself-0.001
Token.
Token ID13
Feature activation+6.22e-01

Pos logit contributions
der+0.027
en+0.021
ilst+0.021
iece+0.020
inch+0.020
Neg logit contributions
P-0.021
unt-0.020
ww-0.020
 baker-0.019
 cherry-0.019
Token This
Token ID770
Feature activation+1.35e+00

Pos logit contributions
unt+0.013
P+0.013
ww+0.012
 baker+0.011
 cherry+0.011
Neg logit contributions
der-0.019
en-0.015
iece-0.015
ilst-0.015
inch-0.014
Token She
Token ID1375
Feature activation+6.20e-01

Pos logit contributions
P+0.018
unt+0.017
 baker+0.017
 cherry+0.016
ww+0.016
Neg logit contributions
der-0.030
en-0.024
ilst-0.024
iece-0.024
ities-0.023
Token knew
Token ID2993
Feature activation-1.15e+00

Pos logit contributions
P+0.014
unt+0.013
ww+0.012
 baker+0.012
 cherry+0.012
Neg logit contributions
der-0.018
en-0.014
 before-0.013
ilst-0.013
iece-0.013
Token she
Token ID673
Feature activation-1.36e-01

Pos logit contributions
der+0.032
en+0.026
ilst+0.026
ipples+0.025
iece+0.025
Neg logit contributions
P-0.025
unt-0.023
 cherry-0.023
 baker-0.022
ww-0.022
Token could
Token ID714
Feature activation+1.02e+00

Pos logit contributions
der+0.004
icles+0.003
iece+0.003
en+0.003
inch+0.003
Neg logit contributions
P-0.003
unt-0.003
ww-0.002
 lawyer-0.002
 baker-0.002
Token always
Token ID1464
Feature activation-7.18e-01

Pos logit contributions
P+0.017
 cherry+0.015
 good+0.015
 long+0.015
unt+0.015
Neg logit contributions
der-0.035
iece-0.028
icated-0.027
ilst-0.027
ities-0.027
Token rely
Token ID8814
Feature activation+6.85e+00

Pos logit contributions
der+0.023
iece+0.017
ilst+0.017
icles+0.017
en+0.017
Neg logit contributions
P-0.014
unt-0.013
ww-0.012
 cherry-0.012
 good-0.012
Token on
Token ID319
Feature activation+3.02e+00

Pos logit contributions
ww+0.107
 cherry+0.107
unt+0.106
 baker+0.105
P+0.102
Neg logit contributions
der-0.215
 before-0.193
 himself-0.180
en-0.177
ities-0.174
Token her
Token ID607
Feature activation-2.49e-01

Pos logit contributions
P+0.056
unt+0.051
ww+0.049
 cherry+0.048
 pastry+0.048
Neg logit contributions
der-0.097
ilst-0.075
en-0.075
inch-0.074
ipples-0.074
Token own
Token ID898
Feature activation+3.85e-01

Pos logit contributions
der+0.009
en+0.007
 before+0.007
ilst+0.007
iece+0.007
Neg logit contributions
P-0.004
unt-0.004
ww-0.004
 baker-0.003
 cherry-0.003
Token ideas
Token ID4213
Feature activation+4.37e-01

Pos logit contributions
P+0.007
unt+0.006
ww+0.006
 cherry+0.006
 baker+0.006
Neg logit contributions
der-0.013
en-0.010
ilst-0.010
 before-0.010
iece-0.010
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
P+0.008
unt+0.008
 good+0.008
 long+0.008
 cherry+0.008
Neg logit contributions
der-0.013
inch-0.010
ipples-0.010
icated-0.010
ities-0.010
Token that
Token ID326
Feature activation+1.45e+00

Pos logit contributions
der+0.068
ipples+0.064
iece+0.063
ilst+0.063
ill+0.062
Neg logit contributions
P-0.043
unt-0.042
 good-0.041
 lawyer-0.041
ww-0.039
Token he
Token ID339
Feature activation-3.20e-01

Pos logit contributions
P+0.030
 good+0.029
 cherry+0.029
unt+0.029
 lawyer+0.029
Neg logit contributions
der-0.041
ilst-0.033
en-0.031
iece-0.031
ussed-0.031
Token shouldn
Token ID6584
Feature activation-1.87e+00

Pos logit contributions
der+0.010
icles+0.009
iece+0.008
ities+0.008
ipples+0.008
Neg logit contributions
P-0.006
 good-0.005
 lawyer-0.005
unt-0.005
ww-0.005
Token't
Token ID470
Feature activation-3.44e+00

Pos logit contributions
der+0.067
ities+0.055
iece+0.054
ilst+0.054
inch+0.053
Neg logit contributions
P-0.031
unt-0.028
Ell-0.025
 lawyer-0.024
ww-0.024
Token always
Token ID1464
Feature activation-3.62e-01

Pos logit contributions
der+0.113
iece+0.091
ilst+0.090
en+0.088
inch+0.087
Neg logit contributions
P-0.060
 cherry-0.057
unt-0.055
ww-0.054
 lawyer-0.054
Token rely
Token ID8814
Feature activation+6.66e+00

Pos logit contributions
der+0.011
en+0.008
 before+0.008
ilst+0.008
iece+0.008
Neg logit contributions
P-0.008
unt-0.007
ww-0.007
 baker-0.007
 cherry-0.007
Token on
Token ID319
Feature activation+3.12e+00

Pos logit contributions
P+0.093
unt+0.087
ww+0.084
 cherry+0.084
 baker+0.083
Neg logit contributions
der-0.235
 before-0.200
en-0.193
ities-0.190
ilst-0.189
Token others
Token ID1854
Feature activation+1.87e+00

Pos logit contributions
P+0.063
 good+0.061
 long+0.058
unt+0.057
ww+0.057
Neg logit contributions
der-0.092
ilst-0.072
ipples-0.071
ill-0.071
iece-0.069
Token to
Token ID284
Feature activation+1.96e+00

Pos logit contributions
P+0.035
 good+0.034
 lawyer+0.033
unt+0.032
 pastry+0.032
Neg logit contributions
der-0.055
icles-0.045
ipples-0.045
inch-0.045
iece-0.044
Token supply
Token ID5127
Feature activation+9.83e-01

Pos logit contributions
P+0.038
unt+0.035
ww+0.033
 cherry+0.033
 baker+0.033
Neg logit contributions
der-0.062
ilst-0.048
en-0.047
iece-0.047
inch-0.046
Token him
Token ID683
Feature activation+7.37e-01

Pos logit contributions
P+0.022
 good+0.020
unt+0.020
ww+0.020
 baker+0.019
Neg logit contributions
der-0.028
ilst-0.021
iece-0.020
ussed-0.020
ipples-0.020
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
der+0.076
icated+0.070
ilst+0.068
ipples+0.067
inch+0.066
Neg logit contributions
 good-0.046
 baker-0.046
P-0.045
unt-0.045
 long-0.044
Token knowing
Token ID6970
Feature activation-5.43e-01

Pos logit contributions
der+0.003
en+0.002
ilst+0.002
 before+0.002
iece+0.002
Neg logit contributions
P-0.003
unt-0.003
ww-0.003
 baker-0.003
 cherry-0.003
Token that
Token ID326
Feature activation+1.34e+00

Pos logit contributions
der+0.015
en+0.012
ilst+0.012
iece+0.011
ities+0.011
Neg logit contributions
P-0.012
unt-0.011
ww-0.011
 cherry-0.011
 baker-0.011
Token she
Token ID673
Feature activation+2.62e-01

Pos logit contributions
P+0.022
unt+0.021
ww+0.020
 cherry+0.019
 baker+0.019
Neg logit contributions
der-0.046
en-0.037
ilst-0.036
inch-0.035
iece-0.035
Token could
Token ID714
Feature activation+1.11e+00

Pos logit contributions
P+0.004
unt+0.004
ww+0.004
Elizabeth+0.004
 baker+0.004
Neg logit contributions
der-0.009
icles-0.007
inch-0.007
iece-0.007
ities-0.007
Token rely
Token ID8814
Feature activation+6.52e+00

Pos logit contributions
P+0.019
 cherry+0.018
unt+0.017
 baker+0.017
 good+0.017
Neg logit contributions
der-0.037
iece-0.029
icated-0.029
ilst-0.028
inch-0.028
Token on
Token ID319
Feature activation+2.76e+00

Pos logit contributions
ww+0.100
unt+0.099
P+0.099
 cherry+0.099
 baker+0.097
Neg logit contributions
der-0.205
 before-0.185
 himself-0.173
ilst-0.168
icking-0.167
Token Jim
Token ID5395
Feature activation+1.12e+00

Pos logit contributions
P+0.051
unt+0.046
ww+0.045
 baker+0.044
 pastry+0.044
Neg logit contributions
der-0.089
ilst-0.069
en-0.069
inch-0.067
iece-0.067
Token.
Token ID13
Feature activation+1.84e+00

Pos logit contributions
P+0.023
unt+0.021
ww+0.021
 baker+0.021
 cherry+0.021
Neg logit contributions
der-0.033
en-0.025
ilst-0.025
iece-0.025
ities-0.025
Token And
Token ID843
Feature activation-2.70e-01

Pos logit contributions
P+0.033
unt+0.031
ww+0.030
 baker+0.029
 cherry+0.029
Neg logit contributions
der-0.059
en-0.047
ilst-0.046
 before-0.046
ities-0.046
Token with
Token ID351
Feature activation+1.63e+00

Pos logit contributions
der+0.008
en+0.006
ilst+0.006
 before+0.006
iece+0.006
Neg logit contributions
P-0.006
unt-0.005
ww-0.005
 baker-0.005
 cherry-0.005
Token He
Token ID679
Feature activation+3.49e-01

Pos logit contributions
unt+0.025
P+0.024
 lawyer+0.023
 baker+0.023
Elizabeth+0.022
Neg logit contributions
der-0.041
en-0.034
icking-0.034
inch-0.033
ility-0.032
Token knew
Token ID2993
Feature activation-1.12e+00

Pos logit contributions
P+0.008
unt+0.007
ww+0.007
 baker+0.007
 cherry+0.007
Neg logit contributions
der-0.010
en-0.007
ilst-0.007
 before-0.007
iece-0.007
Token that
Token ID326
Feature activation+1.34e+00

Pos logit contributions
der+0.031
ipples+0.026
ilst+0.026
en+0.026
iece+0.026
Neg logit contributions
 good-0.022
unt-0.021
 lawyer-0.021
P-0.021
 long-0.021
Token he
Token ID339
Feature activation-1.84e-01

Pos logit contributions
P+0.029
unt+0.028
 good+0.028
 lawyer+0.028
 baker+0.028
Neg logit contributions
der-0.037
ilst-0.029
en-0.028
inch-0.028
iece-0.028
Token could
Token ID714
Feature activation+7.23e-01

Pos logit contributions
der+0.006
iece+0.005
icles+0.005
inch+0.005
ities+0.004
Neg logit contributions
P-0.003
unt-0.003
 lawyer-0.003
ww-0.003
 good-0.003
Token rely
Token ID8814
Feature activation+6.51e+00

Pos logit contributions
P+0.013
 cherry+0.012
 baker+0.011
unt+0.011
 good+0.011
Neg logit contributions
der-0.024
iece-0.019
ilst-0.019
inch-0.018
icated-0.018
Token on
Token ID319
Feature activation+2.55e+00

Pos logit contributions
P+0.091
unt+0.084
ww+0.083
 cherry+0.081
 baker+0.081
Neg logit contributions
der-0.231
 before-0.197
en-0.191
ilst-0.186
iece-0.186
Token the
Token ID262
Feature activation+1.59e+00

Pos logit contributions
P+0.052
unt+0.050
 good+0.050
ww+0.049
 long+0.048
Neg logit contributions
der-0.074
ilst-0.056
inch-0.055
ipples-0.055
en-0.054
Token clock
Token ID8801
Feature activation+1.08e+00

Pos logit contributions
P+0.029
ww+0.025
unt+0.024
Ell+0.023
Elizabeth+0.023
Neg logit contributions
der-0.053
 before-0.044
iece-0.042
ilst-0.041
en-0.041
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
P+0.025
unt+0.023
ww+0.023
 baker+0.022
 cherry+0.022
Neg logit contributions
der-0.029
en-0.022
ilst-0.022
inch-0.021
ities-0.021
Token tell
Token ID1560
Feature activation-5.47e-01

Pos logit contributions
P+0.023
unt+0.022
 good+0.021
 cherry+0.021
 long+0.021
Neg logit contributions
der-0.037
ilst-0.029
inch-0.028
iece-0.028
en-0.028
Token She
Token ID1375
Feature activation-3.50e-01

Pos logit contributions
P+0.033
unt+0.031
ww+0.030
 baker+0.029
 cherry+0.029
Neg logit contributions
der-0.049
en-0.038
 before-0.038
ilst-0.037
iece-0.037
Token knew
Token ID2993
Feature activation-1.33e+00

Pos logit contributions
der+0.011
icles+0.009
ities+0.008
inch+0.008
ilst+0.008
Neg logit contributions
P-0.007
unt-0.006
ww-0.006
 good-0.006
Elizabeth-0.006
Token she
Token ID673
Feature activation-5.98e-01

Pos logit contributions
der+0.039
ipples+0.035
ilst+0.034
iece+0.033
en+0.033
Neg logit contributions
unt-0.024
 good-0.023
P-0.023
 lawyer-0.022
 cherry-0.022
Token could
Token ID714
Feature activation+7.61e-01

Pos logit contributions
der+0.019
icles+0.016
inch+0.015
iece+0.015
icated+0.015
Neg logit contributions
P-0.010
unt-0.010
 good-0.010
 lawyer-0.009
Elizabeth-0.009
Token always
Token ID1464
Feature activation-1.04e+00

Pos logit contributions
P+0.013
 good+0.012
unt+0.012
 long+0.012
 cherry+0.012
Neg logit contributions
der-0.024
icles-0.020
ilst-0.020
iece-0.020
inch-0.019
Token rely
Token ID8814
Feature activation+6.46e+00

Pos logit contributions
der+0.032
icles+0.026
ilst+0.026
inch+0.025
icking+0.024
Neg logit contributions
P-0.019
 good-0.018
unt-0.018
 long-0.017
Elizabeth-0.017
Token on
Token ID319
Feature activation+2.64e+00

Pos logit contributions
P+0.088
 cherry+0.086
 baker+0.084
ww+0.084
unt+0.084
Neg logit contributions
der-0.228
 before-0.195
en-0.188
iece-0.183
ilst-0.181
Token her
Token ID607
Feature activation+4.12e-02

Pos logit contributions
P+0.040
unt+0.038
 good+0.037
 long+0.035
Elizabeth+0.035
Neg logit contributions
der-0.091
ipples-0.073
ilst-0.073
icles-0.073
inch-0.072
Token heart
Token ID2612
Feature activation-2.91e-01

Pos logit contributions
P+0.001
unt+0.001
ww+0.001
 good+0.001
 pastry+0.001
Neg logit contributions
der-0.001
en-0.001
ilst-0.001
icles-0.001
iece-0.001
Token for
Token ID329
Feature activation+1.69e+00

Pos logit contributions
der+0.008
ilst+0.006
inch+0.006
icles+0.006
en+0.006
Neg logit contributions
P-0.006
unt-0.006
 good-0.006
 long-0.006
ww-0.006
Token her
Token ID607
Feature activation-4.53e-01

Pos logit contributions
P+0.034
unt+0.031
ww+0.030
 baker+0.030
 cherry+0.029
Neg logit contributions
der-0.052
en-0.040
ilst-0.040
inch-0.039
iece-0.039
Token and
Token ID290
Feature activation-5.31e-02

Pos logit contributions
der+0.051
inch+0.047
ilst+0.045
icated+0.045
icles+0.045
Neg logit contributions
 good-0.041
 lawyer-0.040
 baker-0.040
 long-0.040
 cherry-0.040
Token knew
Token ID2993
Feature activation-1.10e+00

Pos logit contributions
der+0.002
ities+0.001
en+0.001
inch+0.001
ilst+0.001
Neg logit contributions
P-0.001
 cherry-0.001
 baker-0.001
 good-0.001
unt-0.001
Token that
Token ID326
Feature activation+1.17e+00

Pos logit contributions
der+0.030
en+0.025
ipples+0.024
ilst+0.024
iece+0.024
Neg logit contributions
P-0.024
unt-0.023
 cherry-0.022
 pastry-0.022
 good-0.022
Token she
Token ID673
Feature activation-4.38e-01

Pos logit contributions
P+0.024
unt+0.022
ww+0.021
 baker+0.021
 cherry+0.021
Neg logit contributions
der-0.036
en-0.028
ilst-0.027
 before-0.027
iece-0.027
Token could
Token ID714
Feature activation+6.30e-01

Pos logit contributions
der+0.013
iece+0.010
inch+0.010
en+0.010
icles+0.010
Neg logit contributions
P-0.009
unt-0.009
ww-0.008
 baker-0.008
 cherry-0.008
Token rely
Token ID8814
Feature activation+6.44e+00

Pos logit contributions
P+0.010
 cherry+0.009
 baker+0.009
unt+0.009
 good+0.009
Neg logit contributions
der-0.022
iece-0.017
ilst-0.017
icated-0.017
inch-0.017
Token on
Token ID319
Feature activation+2.83e+00

Pos logit contributions
P+0.093
unt+0.091
ww+0.090
 baker+0.089
 cherry+0.088
Neg logit contributions
der-0.217
 before-0.189
en-0.178
 himself-0.177
ities-0.173
Token him
Token ID683
Feature activation+1.28e-01

Pos logit contributions
P+0.054
unt+0.049
ww+0.048
 cherry+0.047
 baker+0.047
Neg logit contributions
der-0.090
en-0.070
ilst-0.069
iece-0.068
inch-0.068
Token.
Token ID13
Feature activation+5.45e-01

Pos logit contributions
P+0.003
unt+0.003
ww+0.003
 baker+0.003
 cherry+0.003
Neg logit contributions
der-0.003
en-0.003
ilst-0.003
inch-0.003
iece-0.003
Token<|endoftext|>
Token ID50256
Feature activation-1.40e+00

Pos logit contributions
P+0.011
unt+0.010
ww+0.010
 baker+0.010
 cherry+0.010
Neg logit contributions
der-0.017
en-0.013
 before-0.013
ilst-0.013
iece-0.013
TokenOnce
Token ID7454
Feature activation-8.87e-01

Pos logit contributions
der+0.050
en+0.039
ities+0.039
ilst+0.038
inch+0.038
Neg logit contributions
P-0.024
unt-0.021
 cherry-0.019
ww-0.019
 baker-0.019
Token you
Token ID345
Feature activation-1.40e+00

Pos logit contributions
P+0.019
unt+0.018
ww+0.018
 cherry+0.017
 baker+0.017
Neg logit contributions
der-0.027
en-0.021
iece-0.021
ilst-0.021
inch-0.020
Token.
Token ID13
Feature activation+1.13e-01

Pos logit contributions
der+0.040
iece+0.031
en+0.030
ilst+0.030
inch+0.029
Neg logit contributions
P-0.030
unt-0.029
ww-0.028
 good-0.028
 long-0.028
Token You
Token ID921
Feature activation-1.84e+00

Pos logit contributions
unt+0.002
P+0.002
ww+0.002
 baker+0.002
 cherry+0.002
Neg logit contributions
der-0.003
en-0.003
iece-0.003
ilst-0.003
inch-0.003
Token can
Token ID460
Feature activation+7.43e-01

Pos logit contributions
der+0.061
iece+0.049
ities+0.047
icles+0.046
ilst+0.045
Neg logit contributions
P-0.033
unt-0.032
 good-0.031
ww-0.030
 cherry-0.029
Token always
Token ID1464
Feature activation-1.11e+00

Pos logit contributions
P+0.014
unt+0.013
 cherry+0.012
 taxi+0.012
 baker+0.012
Neg logit contributions
der-0.025
iece-0.020
ities-0.019
ilst-0.019
icated-0.019
Token rely
Token ID8814
Feature activation+6.36e+00

Pos logit contributions
der+0.036
ilst+0.028
iece+0.027
inch+0.027
icles+0.027
Neg logit contributions
P-0.021
unt-0.019
ww-0.019
 cherry-0.018
 baker-0.018
Token on
Token ID319
Feature activation+2.66e+00

Pos logit contributions
P+0.086
 cherry+0.078
unt+0.078
 baker+0.078
ww+0.077
Neg logit contributions
der-0.228
 before-0.193
en-0.188
ilst-0.182
ities-0.182
Token me
Token ID502
Feature activation-8.93e-02

Pos logit contributions
P+0.049
unt+0.046
ww+0.045
 good+0.041
 pastry+0.041
Neg logit contributions
der-0.086
ilst-0.067
iece-0.067
ipples-0.066
en-0.066
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
der+0.002
ilst+0.002
en+0.002
iece+0.002
inch+0.002
Neg logit contributions
P-0.002
unt-0.002
ww-0.002
 baker-0.002
 cherry-0.002
Tokenâ
Token ID22940
Feature activation-3.60e+00

Pos logit contributions
P+0.004
unt+0.004
ww+0.004
 baker+0.004
 cherry+0.004
Neg logit contributions
der-0.005
en-0.004
ilst-0.004
iece-0.004
inch-0.004
Token
Token ID26391
Feature activation+1.68e+00

Pos logit contributions
der+0.104
en+0.075
ities+0.075
inch+0.074
 before+0.074
Neg logit contributions
P-0.084
unt-0.079
ww-0.077
 baker-0.072
www-0.071
Token bear
Token ID6842
Feature activation+2.35e+00

Pos logit contributions
P+0.042
unt+0.039
ww+0.039
 baker+0.038
 cherry+0.038
Neg logit contributions
der-0.045
en-0.034
 before-0.033
ilst-0.032
iece-0.032
Token knew
Token ID2993
Feature activation-1.98e+00

Pos logit contributions
P+0.053
unt+0.049
ww+0.048
 cherry+0.047
 baker+0.047
Neg logit contributions
der-0.065
en-0.049
ilst-0.049
ities-0.048
inch-0.048
Token she
Token ID673
Feature activation-6.40e-01

Pos logit contributions
der+0.055
en+0.048
ities+0.047
ipples+0.047
iece+0.047
Neg logit contributions
P-0.040
unt-0.039
 good-0.037
 cherry-0.036
 lawyer-0.036
Token could
Token ID714
Feature activation+5.16e-01

Pos logit contributions
der+0.021
icles+0.018
ipples+0.017
icated+0.017
inch+0.017
Neg logit contributions
P-0.011
 good-0.010
 lawyer-0.010
unt-0.010
ww-0.009
Token always
Token ID1464
Feature activation-9.20e-01

Pos logit contributions
P+0.009
 cherry+0.008
 good+0.008
unt+0.008
 long+0.008
Neg logit contributions
der-0.017
icated-0.014
iece-0.013
ilst-0.013
ities-0.013
Token rely
Token ID8814
Feature activation+6.34e+00

Pos logit contributions
der+0.029
en+0.022
ilst+0.022
inch+0.022
iece+0.022
Neg logit contributions
P-0.018
unt-0.016
ww-0.016
 cherry-0.016
 baker-0.016
Token on
Token ID319
Feature activation+3.31e+00

Pos logit contributions
P+0.085
 baker+0.084
unt+0.083
 cherry+0.083
ww+0.081
Neg logit contributions
der-0.219
 before-0.188
en-0.183
 himself-0.180
ock-0.177
Token her
Token ID607
Feature activation-1.54e+00

Pos logit contributions
P+0.059
unt+0.054
ww+0.051
 cherry+0.050
 baker+0.050
Neg logit contributions
der-0.109
ilst-0.085
en-0.085
inch-0.084
iece-0.084
Token mom
Token ID1995
Feature activation+1.98e+00

Pos logit contributions
der+0.043
en+0.034
ilst+0.034
ility+0.033
icles+0.033
Neg logit contributions
P-0.032
 good-0.031
unt-0.030
 lawyer-0.030
 baker-0.030
Tokenmy
Token ID1820
Feature activation+1.62e+00

Pos logit contributions
P+0.045
unt+0.043
ww+0.041
 baker+0.041
 lawyer+0.041
Neg logit contributions
der-0.056
ipples-0.039
ilst-0.039
inch-0.039
en-0.039
Token bear
Token ID6842
Feature activation+2.28e+00

Pos logit contributions
P+0.038
unt+0.036
ww+0.034
 baker+0.034
 lawyer+0.034
Neg logit contributions
der-0.044
ilst-0.031
ipples-0.031
inch-0.031
en-0.031
Token Bear
Token ID14732
Feature activation+1.20e+00

Pos logit contributions
P+0.027
unt+0.025
ww+0.025
 baker+0.024
 cherry+0.024
Neg logit contributions
der-0.033
en-0.025
 before-0.025
ilst-0.024
iece-0.024
Token knew
Token ID2993
Feature activation-1.62e+00

Pos logit contributions
P+0.025
unt+0.024
ww+0.023
 cherry+0.023
 baker+0.022
Neg logit contributions
der-0.035
en-0.027
ities-0.027
iece-0.026
ilst-0.026
Token he
Token ID339
Feature activation-4.38e-02

Pos logit contributions
der+0.046
iece+0.043
ipples+0.041
ilst+0.040
en+0.039
Neg logit contributions
 good-0.031
unt-0.031
P-0.030
 cherry-0.029
ww-0.028
Token could
Token ID714
Feature activation+9.05e-01

Pos logit contributions
der+0.001
icles+0.001
iece+0.001
inch+0.001
ipples+0.001
Neg logit contributions
P-0.001
unt-0.001
 good-0.001
 lawyer-0.001
ww-0.001
Token always
Token ID1464
Feature activation-9.22e-01

Pos logit contributions
P+0.015
 good+0.014
 long+0.013
 cherry+0.013
 ha+0.013
Neg logit contributions
der-0.031
iece-0.025
icated-0.024
ilst-0.024
ities-0.024
Token rely
Token ID8814
Feature activation+6.33e+00

Pos logit contributions
der+0.030
icles+0.023
ilst+0.023
inch+0.023
iece+0.023
Neg logit contributions
P-0.017
 good-0.015
unt-0.015
ww-0.015
 lawyer-0.015
Token on
Token ID319
Feature activation+2.75e+00

Pos logit contributions
P+0.080
unt+0.075
ww+0.072
 cherry+0.071
 baker+0.071
Neg logit contributions
der-0.234
 before-0.196
en-0.193
ilst-0.190
iece-0.189
Token himself
Token ID2241
Feature activation+2.29e-01

Pos logit contributions
P+0.062
 good+0.060
 long+0.058
unt+0.057
ww+0.055
Neg logit contributions
der-0.075
ilst-0.057
ipples-0.056
icles-0.055
inch-0.055
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
P+0.005
 good+0.005
 long+0.005
ww+0.005
unt+0.005
Neg logit contributions
der-0.006
en-0.005
icated-0.005
ilst-0.005
inch-0.005
Token do
Token ID466
Feature activation+1.49e+00

Pos logit contributions
P+0.030
 good+0.028
unt+0.028
 cherry+0.027
 long+0.027
Neg logit contributions
der-0.051
ilst-0.039
en-0.038
inch-0.038
ities-0.038
Token great
Token ID1049
Feature activation+1.75e+00

Pos logit contributions
P+0.031
 good+0.031
 long+0.029
ww+0.028
unt+0.028
Neg logit contributions
der-0.042
ilst-0.032
icles-0.032
ipples-0.032
inch-0.032
Token farmer
Token ID18739
Feature activation+1.80e+00

Pos logit contributions
ww+0.028
P+0.027
Elizabeth+0.024
Ell+0.024
 cherry+0.024
Neg logit contributions
der-0.057
 before-0.047
ilst-0.046
en-0.045
icking-0.045
Token knew
Token ID2993
Feature activation-1.69e+00

Pos logit contributions
P+0.044
unt+0.041
ww+0.040
 baker+0.040
 cherry+0.039
Neg logit contributions
der-0.047
en-0.035
ilst-0.034
 before-0.034
iece-0.034
Token he
Token ID339
Feature activation-3.46e-01

Pos logit contributions
der+0.047
en+0.038
ipples+0.038
iece+0.038
ities+0.038
Neg logit contributions
P-0.035
unt-0.033
 cherry-0.032
 good-0.032
 lawyer-0.032
Token could
Token ID714
Feature activation+5.21e-01

Pos logit contributions
der+0.011
icles+0.008
iece+0.008
inch+0.008
ipples+0.008
Neg logit contributions
P-0.007
unt-0.006
 lawyer-0.006
 good-0.006
ww-0.006
Token always
Token ID1464
Feature activation-1.19e+00

Pos logit contributions
P+0.009
 cherry+0.008
unt+0.008
 good+0.008
 lawyer+0.008
Neg logit contributions
der-0.017
iece-0.014
icated-0.013
ities-0.013
ilst-0.013
Token rely
Token ID8814
Feature activation+6.32e+00

Pos logit contributions
der+0.038
en+0.029
ilst+0.028
iece+0.028
inch+0.028
Neg logit contributions
P-0.023
unt-0.021
ww-0.021
 cherry-0.020
 baker-0.020
Token on
Token ID319
Feature activation+2.69e+00

Pos logit contributions
P+0.086
unt+0.082
ww+0.081
 baker+0.081
 cherry+0.080
Neg logit contributions
der-0.222
 before-0.190
en-0.183
ilst-0.180
 himself-0.180
Token the
Token ID262
Feature activation+8.87e-01

Pos logit contributions
P+0.059
unt+0.054
ww+0.053
 baker+0.052
 cherry+0.052
Neg logit contributions
der-0.077
ilst-0.058
en-0.057
inch-0.057
iece-0.056
Token little
Token ID1310
Feature activation+2.25e+00

Pos logit contributions
P+0.015
ww+0.014
Elizabeth+0.013
unt+0.013
Ell+0.012
Neg logit contributions
der-0.029
 before-0.024
ilst-0.023
 himself-0.023
icking-0.022
Token girl
Token ID2576
Feature activation+1.99e+00

Pos logit contributions
P+0.037
Elizabeth+0.036
unt+0.036
ww+0.035
Ell+0.033
Neg logit contributions
der-0.072
 before-0.063
en-0.061
 since-0.060
ilst-0.056
Token to
Token ID284
Feature activation+1.62e+00

Pos logit contributions
P+0.044
unt+0.042
ww+0.041
 baker+0.040
 cherry+0.040
Neg logit contributions
der-0.056
en-0.042
ilst-0.041
inch-0.041
ities-0.040
Token knew
Token ID2993
Feature activation-1.03e+00

Pos logit contributions
P+0.007
unt+0.006
ww+0.006
 cherry+0.006
 baker+0.006
Neg logit contributions
der-0.009
en-0.007
ilst-0.007
iece-0.007
ities-0.007
Token that
Token ID326
Feature activation+1.18e+00

Pos logit contributions
der+0.029
en+0.023
ilst+0.023
iece+0.023
ities+0.023
Neg logit contributions
P-0.022
unt-0.021
 cherry-0.020
ww-0.020
 baker-0.020
Token she
Token ID673
Feature activation-4.65e-01

Pos logit contributions
P+0.026
unt+0.024
ww+0.024
 cherry+0.023
 baker+0.023
Neg logit contributions
der-0.034
en-0.026
ilst-0.026
iece-0.025
 before-0.025
Token could
Token ID714
Feature activation+5.89e-01

Pos logit contributions
der+0.014
iece+0.011
icles+0.011
inch+0.011
ities+0.011
Neg logit contributions
P-0.009
unt-0.009
ww-0.008
 cherry-0.008
 baker-0.008
Token always
Token ID1464
Feature activation-1.02e+00

Pos logit contributions
P+0.011
 cherry+0.010
unt+0.010
 baker+0.010
Elizabeth+0.010
Neg logit contributions
der-0.019
ilst-0.015
iece-0.015
ities-0.014
inch-0.014
Token rely
Token ID8814
Feature activation+6.31e+00

Pos logit contributions
der+0.031
ilst+0.023
en+0.023
inch+0.023
iece+0.023
Neg logit contributions
P-0.021
unt-0.019
ww-0.019
 cherry-0.019
 baker-0.019
Token on
Token ID319
Feature activation+2.62e+00

Pos logit contributions
P+0.095
 baker+0.093
ww+0.093
 cherry+0.092
unt+0.092
Neg logit contributions
der-0.206
 before-0.182
en-0.170
 himself-0.165
ock-0.164
Token her
Token ID607
Feature activation+1.03e-01

Pos logit contributions
P+0.052
unt+0.048
ww+0.046
 cherry+0.045
 baker+0.045
Neg logit contributions
der-0.081
ilst-0.062
en-0.062
inch-0.061
iece-0.061
Token voice
Token ID3809
Feature activation+5.99e-01

Pos logit contributions
P+0.002
ww+0.002
unt+0.002
Ell+0.002
 baker+0.002
Neg logit contributions
der-0.003
 before-0.003
en-0.003
iece-0.002
ities-0.002
Token to
Token ID284
Feature activation+1.85e+00

Pos logit contributions
P+0.012
unt+0.012
ww+0.011
 cherry+0.011
 baker+0.011
Neg logit contributions
der-0.018
en-0.014
ilst-0.014
inch-0.013
iece-0.013
Token sing
Token ID1702
Feature activation+3.47e-01

Pos logit contributions
P+0.038
unt+0.035
ww+0.033
 cherry+0.033
 baker+0.033
Neg logit contributions
der-0.056
ilst-0.043
en-0.042
iece-0.042
inch-0.042
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
 good+0.018
 long+0.017
 high+0.016
 few+0.015
unt+0.015
Neg logit contributions
der-0.022
icles-0.018
ilst-0.018
inch-0.018
ipples-0.018
Token They
Token ID1119
Feature activation+1.01e+00

Pos logit contributions
P+0.031
unt+0.030
ww+0.028
 cherry+0.028
 baker+0.028
Neg logit contributions
der-0.052
en-0.041
iece-0.041
ilst-0.040
ities-0.040
Token knew
Token ID2993
Feature activation-1.20e+00

Pos logit contributions
P+0.022
unt+0.020
ww+0.020
 cherry+0.019
 baker+0.019
Neg logit contributions
der-0.030
en-0.023
ities-0.023
iece-0.023
ilst-0.023
Token they
Token ID484
Feature activation+7.55e-01

Pos logit contributions
der+0.037
ilst+0.033
iece+0.032
en+0.032
ities+0.031
Neg logit contributions
 good-0.020
unt-0.020
 cherry-0.019
P-0.019
 lawyer-0.019
Token could
Token ID714
Feature activation+9.79e-01

Pos logit contributions
 good+0.011
P+0.011
unt+0.010
 lawyer+0.010
 high+0.009
Neg logit contributions
der-0.026
icles-0.022
iece-0.021
en-0.021
ities-0.021
Token rely
Token ID8814
Feature activation+6.26e+00

Pos logit contributions
P+0.017
 good+0.016
 cherry+0.015
unt+0.015
 baker+0.014
Neg logit contributions
der-0.033
iece-0.026
ilst-0.026
icated-0.025
ities-0.025
Token on
Token ID319
Feature activation+2.33e+00

Pos logit contributions
P+0.088
unt+0.084
ww+0.084
 cherry+0.083
 baker+0.083
Neg logit contributions
der-0.218
 before-0.185
en-0.178
 himself-0.174
iece-0.173
Token each
Token ID1123
Feature activation+3.46e+00

Pos logit contributions
P+0.042
unt+0.039
 good+0.038
ww+0.038
 cherry+0.038
Neg logit contributions
der-0.074
ilst-0.059
en-0.057
inch-0.057
iece-0.056
Token other
Token ID584
Feature activation+8.43e-01

Pos logit contributions
P+0.073
unt+0.067
ww+0.066
 baker+0.064
 cherry+0.064
Neg logit contributions
der-0.104
en-0.081
 before-0.080
ilst-0.079
iece-0.078
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
P+0.017
 good+0.017
unt+0.017
 long+0.016
 lawyer+0.016
Neg logit contributions
der-0.023
ilst-0.018
inch-0.018
icles-0.018
en-0.018
Token<|endoftext|>
Token ID50256
Feature activation-1.90e+00

Pos logit contributions
P+0.028
unt+0.027
ww+0.026
 cherry+0.025
 baker+0.025
Neg logit contributions
der-0.047
en-0.038
iece-0.037
ilst-0.037
 before-0.037
Token as
Token ID355
Feature activation+2.82e+00

Pos logit contributions
der+0.010
icated+0.009
ilst+0.009
icles+0.008
ility+0.008
Neg logit contributions
P-0.008
 good-0.008
unt-0.007
 long-0.007
 baker-0.007
Token she
Token ID673
Feature activation+7.14e-01

Pos logit contributions
P+0.041
unt+0.038
ww+0.036
 baker+0.035
 cherry+0.035
Neg logit contributions
der-0.102
en-0.083
ilst-0.083
ities-0.081
iece-0.081
Token held
Token ID2714
Feature activation+2.68e+00

Pos logit contributions
P+0.015
unt+0.014
ww+0.013
 baker+0.013
 cherry+0.013
Neg logit contributions
der-0.022
en-0.017
 before-0.017
ilst-0.016
iece-0.016
Token the
Token ID262
Feature activation+1.99e+00

Pos logit contributions
P+0.066
ww+0.064
unt+0.061
Ell+0.061
 baker+0.058
Neg logit contributions
der-0.066
 before-0.058
en-0.055
 himself-0.054
 earlier-0.050
Token l
Token ID300
Feature activation+6.82e-01

Pos logit contributions
P+0.035
unt+0.031
ww+0.031
Ell+0.030
Michelle+0.028
Neg logit contributions
der-0.066
 before-0.055
ilst-0.052
en-0.051
ities-0.051
Tokenoll
Token ID692
Feature activation+6.23e+00

Pos logit contributions
P+0.019
unt+0.017
 baker+0.017
ww+0.017
 cherry+0.017
Neg logit contributions
der-0.016
en-0.012
ilst-0.011
icated-0.011
 before-0.011
Tokenipop
Token ID42800
Feature activation+2.78e+00

Pos logit contributions
P+0.119
unt+0.111
Ell+0.108
ww+0.107
Elizabeth+0.105
Neg logit contributions
der-0.196
 before-0.158
en-0.152
inch-0.150
ilst-0.149
Token in
Token ID287
Feature activation+2.52e+00

Pos logit contributions
P+0.060
 cherry+0.055
unt+0.055
 pastry+0.054
ww+0.054
Neg logit contributions
der-0.080
en-0.062
icated-0.060
ilst-0.060
ities-0.059
Token her
Token ID607
Feature activation-1.84e-01

Pos logit contributions
P+0.049
unt+0.045
ww+0.044
 baker+0.043
 cherry+0.042
Neg logit contributions
der-0.079
en-0.064
 before-0.063
ilst-0.061
iece-0.060
Token hand
Token ID1021
Feature activation+1.69e+00

Pos logit contributions
der+0.005
 before+0.004
en+0.004
 himself+0.004
ilst+0.004
Neg logit contributions
P-0.004
ww-0.004
unt-0.004
Ell-0.004
 baker-0.004
Token.
Token ID13
Feature activation+1.69e+00

Pos logit contributions
P+0.038
unt+0.038
 cherry+0.038
 long+0.037
 pastry+0.037
Neg logit contributions
der-0.041
icated-0.033
ilst-0.032
ipples-0.032
inch-0.031
Token always
Token ID1464
Feature activation-1.46e+00

Pos logit contributions
P+0.006
 good+0.006
unt+0.006
 long+0.005
 cherry+0.005
Neg logit contributions
der-0.013
icles-0.010
iece-0.010
icated-0.010
ities-0.010
Token have
Token ID423
Feature activation+1.92e+00

Pos logit contributions
der+0.046
ilst+0.036
iece+0.036
icles+0.036
en+0.036
Neg logit contributions
P-0.027
unt-0.025
 good-0.025
ww-0.024
 cherry-0.024
Token her
Token ID607
Feature activation-1.21e-01

Pos logit contributions
P+0.040
unt+0.037
ww+0.036
 cherry+0.035
 baker+0.035
Neg logit contributions
der-0.057
en-0.045
ilst-0.044
iece-0.044
 before-0.043
Token muscles
Token ID12749
Feature activation+1.13e-01

Pos logit contributions
der+0.004
en+0.003
ilst+0.003
iece+0.003
inch+0.003
Neg logit contributions
P-0.002
unt-0.002
ww-0.002
 baker-0.002
 cherry-0.002
Token to
Token ID284
Feature activation+2.77e-01

Pos logit contributions
 good+0.002
P+0.002
 long+0.002
 high+0.002
 cherry+0.002
Neg logit contributions
der-0.003
ities-0.003
en-0.003
inch-0.003
ilst-0.003
Token rely
Token ID8814
Feature activation+6.23e+00

Pos logit contributions
P+0.005
 good+0.004
 cherry+0.004
 long+0.004
 pastry+0.004
Neg logit contributions
der-0.009
ilst-0.007
iece-0.007
inch-0.007
en-0.007
Token on
Token ID319
Feature activation+2.41e+00

Pos logit contributions
P+0.086
unt+0.079
ww+0.077
 cherry+0.075
 baker+0.075
Neg logit contributions
der-0.225
 before-0.186
en-0.184
ilst-0.180
iece-0.178
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
P+0.049
unt+0.045
ww+0.044
 cherry+0.043
 pastry+0.043
Neg logit contributions
der-0.073
ilst-0.057
inch-0.056
iece-0.055
en-0.055
Token whether
Token ID1771
Feature activation-1.72e+00

Pos logit contributions
der+0.025
en+0.019
ilst+0.018
inch+0.018
iece+0.018
Neg logit contributions
P-0.025
unt-0.024
ww-0.024
 cherry-0.023
 baker-0.023
Token her
Token ID607
Feature activation-1.77e+00

Pos logit contributions
der+0.054
ilst+0.043
iece+0.043
en+0.043
inch+0.042
Neg logit contributions
P-0.032
unt-0.030
 cherry-0.029
ww-0.029
 baker-0.028
Token friends
Token ID2460
Feature activation+8.50e-01

Pos logit contributions
der+0.057
en+0.046
ilst+0.045
iece+0.045
ility+0.044
Neg logit contributions
P-0.031
 cherry-0.029
unt-0.029
 baker-0.029
ww-0.028
Token good
Token ID922
Feature activation+3.95e+00

Pos logit contributions
P+0.076
Elizabeth+0.067
unt+0.067
Ell+0.067
Michelle+0.063
Neg logit contributions
der-0.140
 before-0.118
 since-0.110
en-0.109
inch-0.106
Token friend
Token ID1545
Feature activation+4.37e+00

Pos logit contributions
P+0.068
unt+0.062
ww+0.060
Elizabeth+0.057
 cherry+0.056
Neg logit contributions
der-0.134
en-0.108
 before-0.106
ilst-0.104
inch-0.103
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
P+0.104
unt+0.100
 lawyer+0.099
 good+0.097
 baker+0.096
Neg logit contributions
der-0.111
ilst-0.086
icles-0.085
ipples-0.085
inch-0.084
Token From
Token ID3574
Feature activation-1.25e+00

Pos logit contributions
P+0.025
unt+0.023
ww+0.022
 cherry+0.021
 baker+0.021
Neg logit contributions
der-0.044
en-0.036
ities-0.035
iece-0.035
ilst-0.035
Token that
Token ID326
Feature activation+4.62e+00

Pos logit contributions
der+0.029
en+0.020
ilst+0.020
 before+0.020
iece+0.019
Neg logit contributions
P-0.035
unt-0.033
ww-0.032
 baker-0.032
 cherry-0.032
Token day
Token ID1110
Feature activation+6.19e+00

Pos logit contributions
P+0.117
Michelle+0.113
Elizabeth+0.113
 baker+0.111
Ell+0.111
Neg logit contributions
der-0.093
 before-0.093
 since-0.082
icated-0.078
en-0.077
Token on
Token ID319
Feature activation+2.71e+00

Pos logit contributions
 baker+0.125
 pastry+0.123
 bakery+0.123
 cherry+0.121
Ell+0.111
Neg logit contributions
der-0.172
 before-0.155
iece-0.147
icking-0.143
 himself-0.141
Token,
Token ID11
Feature activation+2.11e+00

Pos logit contributions
P+0.056
unt+0.051
ww+0.050
 baker+0.050
 cherry+0.049
Neg logit contributions
der-0.082
en-0.065
 before-0.064
ilst-0.062
iece-0.060
Token Lucy
Token ID22162
Feature activation+1.40e+00

Pos logit contributions
P+0.043
 cherry+0.041
unt+0.041
 baker+0.041
ww+0.039
Neg logit contributions
der-0.063
en-0.049
iece-0.048
ilst-0.048
ities-0.048
Token and
Token ID290
Feature activation+7.13e-01

Pos logit contributions
P+0.032
unt+0.029
ww+0.029
 baker+0.028
 cherry+0.028
Neg logit contributions
der-0.039
en-0.030
ilst-0.029
 before-0.029
iece-0.029
Token the
Token ID262
Feature activation+2.48e-01

Pos logit contributions
P+0.017
unt+0.016
ww+0.016
 baker+0.015
 cherry+0.015
Neg logit contributions
der-0.019
en-0.015
 before-0.015
ilst-0.014
iece-0.014
Token great
Token ID1049
Feature activation+3.52e+00

Pos logit contributions
P+0.073
Elizabeth+0.063
unt+0.062
ww+0.060
www+0.060
Neg logit contributions
der-0.108
 before-0.094
inch-0.090
 since-0.087
ilst-0.085
Token time
Token ID640
Feature activation+1.27e+00

Pos logit contributions
P+0.069
unt+0.063
ww+0.062
 baker+0.060
 cherry+0.060
Neg logit contributions
der-0.110
en-0.087
 before-0.085
ilst-0.084
iece-0.083
Token.
Token ID13
Feature activation+1.89e+00

Pos logit contributions
P+0.029
 cherry+0.027
 long+0.027
 baker+0.026
unt+0.026
Neg logit contributions
der-0.033
ilst-0.027
icles-0.026
en-0.026
iece-0.025
Token From
Token ID3574
Feature activation-8.48e-01

Pos logit contributions
P+0.035
unt+0.033
ww+0.031
 cherry+0.030
Elizabeth+0.029
Neg logit contributions
der-0.062
en-0.049
ities-0.048
iece-0.048
 before-0.047
Token that
Token ID326
Feature activation+4.63e+00

Pos logit contributions
der+0.020
en+0.014
 before+0.013
ilst+0.013
iece+0.013
Neg logit contributions
P-0.024
unt-0.022
ww-0.022
 baker-0.022
 cherry-0.022
Token day
Token ID1110
Feature activation+6.11e+00

Pos logit contributions
P+0.112
 baker+0.106
 Found+0.105
 jury+0.104
Michelle+0.102
Neg logit contributions
der-0.102
 before-0.099
 since-0.090
ilst-0.089
en-0.086
Token on
Token ID319
Feature activation+2.61e+00

Pos logit contributions
 baker+0.134
 bakery+0.128
 pastry+0.125
 cherry+0.123
iw+0.112
Neg logit contributions
der-0.161
 before-0.145
icking-0.144
iece-0.138
aying-0.134
Token,
Token ID11
Feature activation+2.56e+00

Pos logit contributions
P+0.061
 baker+0.056
ww+0.054
unt+0.054
Elizabeth+0.054
Neg logit contributions
der-0.070
 before-0.056
en-0.056
ilst-0.052
 since-0.051
Token Tim
Token ID5045
Feature activation+1.22e+00

Pos logit contributions
P+0.058
unt+0.054
ww+0.053
 baker+0.052
 cherry+0.051
Neg logit contributions
der-0.073
en-0.056
 before-0.054
ilst-0.054
iece-0.053
Tokenmy
Token ID1820
Feature activation+2.66e+00

Pos logit contributions
P+0.031
unt+0.029
ww+0.029
 baker+0.028
 cherry+0.028
Neg logit contributions
der-0.031
en-0.022
 before-0.022
ilst-0.022
iece-0.022
Token knew
Token ID2993
Feature activation-8.91e-01

Pos logit contributions
P+0.059
unt+0.054
ww+0.053
 baker+0.052
 cherry+0.051
Neg logit contributions
der-0.077
en-0.058
ilst-0.057
iece-0.057
 before-0.057
Token a
Token ID257
Feature activation+3.40e+00

Pos logit contributions
der+0.026
iece+0.020
ilst+0.020
en+0.020
ities+0.020
Neg logit contributions
P-0.015
unt-0.014
ww-0.013
 good-0.013
 cherry-0.013
Token place
Token ID1295
Feature activation+1.11e-01

Pos logit contributions
P+0.061
Elizabeth+0.051
unt+0.050
Ell+0.049
Michelle+0.048
Neg logit contributions
der-0.106
 before-0.095
 since-0.088
en-0.086
ilst-0.083
Token he
Token ID339
Feature activation+1.19e-01

Pos logit contributions
P+0.002
unt+0.002
 good+0.001
 cherry+0.001
 high+0.001
Neg logit contributions
der-0.004
iece-0.003
ities-0.003
en-0.003
ilst-0.003
Token could
Token ID714
Feature activation-3.92e-02

Pos logit contributions
P+0.003
unt+0.002
ww+0.002
 cherry+0.002
 baker+0.002
Neg logit contributions
der-0.003
en-0.003
iece-0.003
inch-0.003
ities-0.003
Token always
Token ID1464
Feature activation-1.42e+00

Pos logit contributions
der+0.001
iece+0.001
inch+0.001
ilst+0.001
ities+0.001
Neg logit contributions
P-0.001
 cherry-0.001
 long-0.001
unt-0.001
 good-0.001
Token rely
Token ID8814
Feature activation+6.11e+00

Pos logit contributions
der+0.048
ilst+0.039
icles+0.038
inch+0.038
icated+0.037
Neg logit contributions
P-0.023
 good-0.021
 cherry-0.020
 long-0.020
 merry-0.020
Token on
Token ID319
Feature activation+2.70e+00

Pos logit contributions
P+0.085
unt+0.077
ww+0.075
 baker+0.074
 cherry+0.073
Neg logit contributions
der-0.220
 before-0.184
en-0.181
ilst-0.176
iece-0.176
Token.
Token ID13
Feature activation+8.14e-01

Pos logit contributions
P+0.061
 good+0.058
unt+0.058
 long+0.058
 lawyer+0.056
Neg logit contributions
der-0.073
ilst-0.055
inch-0.054
ipples-0.054
ussed-0.052
Token<|endoftext|>
Token ID50256
Feature activation-1.97e+00

Pos logit contributions
unt+0.013
P+0.013
ww+0.011
Elizabeth+0.011
 lawyer+0.011
Neg logit contributions
der-0.028
en-0.023
ities-0.023
iece-0.022
ipples-0.022
TokenOnce
Token ID7454
Feature activation-1.42e+00

Pos logit contributions
der+0.074
ities+0.058
en+0.058
inch+0.058
ilst+0.058
Neg logit contributions
P-0.031
unt-0.025
ww-0.023
 baker-0.023
 cherry-0.022
Token upon
Token ID2402
Feature activation-4.95e+00

Pos logit contributions
der+0.037
ities+0.028
en+0.026
iece+0.026
ipples+0.025
Neg logit contributions
unt-0.036
P-0.035
 long-0.033
 ladder-0.033
 high-0.033
Token his
Token ID465
Feature activation-2.15e+00

Pos logit contributions
P+0.045
unt+0.041
ww+0.040
 cherry+0.039
 baker+0.039
Neg logit contributions
der-0.067
en-0.052
ilst-0.051
 before-0.050
ities-0.050
Token new
Token ID649
Feature activation+3.11e-01

Pos logit contributions
der+0.073
 before+0.059
en+0.059
ilst+0.057
inch+0.056
Neg logit contributions
P-0.036
unt-0.032
ww-0.032
 baker-0.031
Elizabeth-0.030
Token board
Token ID3096
Feature activation+4.50e-02

Pos logit contributions
P+0.006
unt+0.005
ww+0.005
Elizabeth+0.005
 baker+0.005
Neg logit contributions
der-0.010
en-0.008
 before-0.008
ilst-0.008
iece-0.008
Token he
Token ID339
Feature activation-1.26e+00

Pos logit contributions
P+0.001
unt+0.001
ww+0.001
 long+0.001
 contest+0.001
Neg logit contributions
der-0.001
ities-0.001
en-0.001
ility-0.001
ipples-0.001
Token could
Token ID714
Feature activation-8.52e-01

Pos logit contributions
der+0.036
en+0.028
ilst+0.027
ities+0.027
iece+0.027
Neg logit contributions
P-0.028
unt-0.026
ww-0.025
 baker-0.024
 cherry-0.024
Token hardly
Token ID8941
Feature activation-7.50e+00

Pos logit contributions
der+0.031
ities+0.026
ility+0.025
icated+0.025
rified+0.025
Neg logit contributions
 long-0.012
 high-0.012
P-0.011
 ladder-0.011
 taxi-0.011
Token contain
Token ID3994
Feature activation-3.47e-01

Pos logit contributions
der+0.249
ities+0.197
en+0.196
icking+0.185
inch+0.185
Neg logit contributions
 long-0.128
P-0.123
 cherry-0.122
 high-0.122
 taxi-0.120
Token himself
Token ID2241
Feature activation+6.61e-02

Pos logit contributions
der+0.010
ipples+0.008
rified+0.008
places+0.008
icking+0.008
Neg logit contributions
 good-0.008
P-0.007
 long-0.007
 high-0.007
 big-0.007
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
P+0.002
 long+0.002
 good+0.002
 high+0.001
 taxi+0.001
Neg logit contributions
der-0.002
ilst-0.001
icles-0.001
ility-0.001
en-0.001
Token His
Token ID2399
Feature activation-2.27e+00

Pos logit contributions
der+0.035
en+0.028
ities+0.027
iece+0.027
ilst+0.027
Neg logit contributions
P-0.021
unt-0.019
 baker-0.018
 cherry-0.018
ww-0.018
Token dad
Token ID9955
Feature activation+2.24e+00

Pos logit contributions
der+0.075
en+0.060
 before+0.059
ilst+0.059
iece+0.058
Neg logit contributions
P-0.041
unt-0.037
ww-0.036
 baker-0.035
 cherry-0.034
Token wall
Token ID3355
Feature activation-3.45e+00

Pos logit contributions
der+0.055
 before+0.045
iece+0.041
en+0.040
ilst+0.040
Neg logit contributions
P-0.055
ww-0.051
unt-0.049
Ell-0.047
Elizabeth-0.047
Token.
Token ID13
Feature activation-5.53e-01

Pos logit contributions
der+0.079
inch+0.062
en+0.059
ities+0.058
ilst+0.058
Neg logit contributions
P-0.088
unt-0.087
 cherry-0.084
 pastry-0.083
 baker-0.083
Token It
Token ID632
Feature activation-1.38e+00

Pos logit contributions
der+0.017
en+0.013
 before+0.013
ilst+0.012
iece+0.012
Neg logit contributions
P-0.012
unt-0.011
ww-0.011
 baker-0.010
 cherry-0.010
Token said
Token ID531
Feature activation-1.31e+00

Pos logit contributions
der+0.028
 before+0.020
en+0.020
ilst+0.018
 since+0.017
Neg logit contributions
P-0.042
 baker-0.038
ww-0.038
 cherry-0.037
 pastry-0.037
Token "
Token ID366
Feature activation-7.52e-01

Pos logit contributions
der+0.035
 before+0.027
en+0.024
 himself+0.024
iece+0.024
Neg logit contributions
P-0.035
ww-0.030
Elizabeth-0.029
unt-0.029
www-0.028
TokenDO
Token ID18227
Feature activation-7.50e+00

Pos logit contributions
der+0.022
en+0.018
 before+0.018
ilst+0.018
iece+0.017
Neg logit contributions
P-0.014
unt-0.014
 baker-0.014
 pastry-0.013
 cherry-0.013
Token NOT
Token ID5626
Feature activation-7.25e-01

Pos logit contributions
der+0.216
en+0.167
 before+0.163
ilst+0.161
iece+0.160
Neg logit contributions
P-0.166
unt-0.154
ww-0.151
 cherry-0.146
 baker-0.145
Token T
Token ID309
Feature activation-2.93e+00

Pos logit contributions
der+0.020
en+0.016
 before+0.015
ilst+0.015
iece+0.015
Neg logit contributions
P-0.017
unt-0.015
ww-0.015
 baker-0.015
 cherry-0.014
TokenOU
Token ID2606
Feature activation-2.04e-01

Pos logit contributions
der+0.080
en+0.061
 before+0.061
iece+0.059
ilst+0.059
Neg logit contributions
P-0.067
unt-0.062
ww-0.062
 baker-0.061
 lawyer-0.059
TokenCH
Token ID3398
Feature activation-1.79e+00

Pos logit contributions
der+0.005
 before+0.005
ilst+0.004
en+0.004
ussed+0.004
Neg logit contributions
 baker-0.004
P-0.004
unt-0.004
 cherry-0.004
 pastry-0.004
Token".
Token ID1911
Feature activation-1.16e+00

Pos logit contributions
der+0.045
en+0.033
 before+0.033
ilst+0.032
iece+0.032
Neg logit contributions
P-0.046
unt-0.043
ww-0.042
 baker-0.041
 cherry-0.041
Token red
Token ID2266
Feature activation-1.06e+00

Pos logit contributions
P+0.015
Elizabeth+0.013
ww+0.012
Michelle+0.012
Ell+0.012
Neg logit contributions
der-0.022
en-0.020
 before-0.019
ock-0.017
ilst-0.017
Token button
Token ID4936
Feature activation-1.98e+00

Pos logit contributions
der+0.035
en+0.028
 before+0.027
ilst+0.026
icles+0.026
Neg logit contributions
P-0.020
unt-0.018
ww-0.017
Elizabeth-0.017
Ell-0.016
Token that
Token ID326
Feature activation-8.37e-01

Pos logit contributions
der+0.066
ilst+0.054
inch+0.054
en+0.053
ities+0.053
Neg logit contributions
P-0.031
unt-0.029
 cherry-0.028
 pastry-0.028
ww-0.028
Token said
Token ID531
Feature activation-4.09e-01

Pos logit contributions
der+0.023
 before+0.018
en+0.018
ilst+0.017
iece+0.017
Neg logit contributions
P-0.019
ww-0.018
unt-0.017
 baker-0.017
 cherry-0.017
Token "
Token ID366
Feature activation+4.51e-02

Pos logit contributions
der+0.013
 before+0.010
en+0.009
iece+0.009
 himself+0.009
Neg logit contributions
P-0.009
ww-0.008
unt-0.008
Elizabeth-0.008
 baker-0.007
TokenDO
Token ID18227
Feature activation-7.19e+00

Pos logit contributions
 baker+0.001
unt+0.001
P+0.001
 pastry+0.001
 ladder+0.001
Neg logit contributions
der-0.001
en-0.001
 before-0.001
ock-0.001
ilst-0.001
Token NOT
Token ID5626
Feature activation-5.75e-03

Pos logit contributions
der+0.215
en+0.172
 before+0.166
iece+0.165
ilst+0.165
Neg logit contributions
P-0.150
unt-0.136
ww-0.134
 cherry-0.128
 pastry-0.125
Token PRESS
Token ID44310
Feature activation-6.44e+00

Pos logit contributions
der+0.000
en+0.000
 before+0.000
ilst+0.000
iece+0.000
Neg logit contributions
P-0.000
unt-0.000
ww-0.000
 baker-0.000
 cherry-0.000
Token."
Token ID526
Feature activation+3.83e-01

Pos logit contributions
der+0.145
inch+0.105
ilst+0.105
en+0.104
iece+0.103
Neg logit contributions
P-0.174
unt-0.170
ww-0.163
 baker-0.160
 cherry-0.156
Token She
Token ID1375
Feature activation-2.51e-01

Pos logit contributions
P+0.008
ww+0.007
unt+0.007
 baker+0.007
 cherry+0.007
Neg logit contributions
der-0.012
 before-0.009
en-0.009
ilst-0.009
iece-0.009
Token couldn
Token ID3521
Feature activation-2.07e+00

Pos logit contributions
der+0.007
en+0.005
 before+0.005
ilst+0.005
ities+0.005
Neg logit contributions
P-0.006
unt-0.006
ww-0.006
 baker-0.005
 cherry-0.005
Token st
Token ID336
Feature activation-1.61e+00

Pos logit contributions
P+0.004
unt+0.004
Elizabeth+0.004
 baker+0.004
 lawyer+0.004
Neg logit contributions
der-0.011
 before-0.008
 himself-0.008
en-0.008
inch-0.008
Tokeninky
Token ID29246
Feature activation-1.54e+00

Pos logit contributions
der+0.055
en+0.045
ities+0.044
ock+0.044
iece+0.043
Neg logit contributions
P-0.026
unt-0.023
ww-0.023
 cherry-0.022
Elizabeth-0.021
Token that
Token ID326
Feature activation-1.38e-01

Pos logit contributions
der+0.045
ities+0.037
icles+0.036
iece+0.036
ipples+0.035
Neg logit contributions
 good-0.030
P-0.030
 long-0.029
 high-0.029
ww-0.028
Token she
Token ID673
Feature activation-2.00e+00

Pos logit contributions
der+0.005
en+0.004
ilst+0.004
iece+0.004
inch+0.004
Neg logit contributions
P-0.002
unt-0.002
ww-0.002
 cherry-0.002
 baker-0.002
Token could
Token ID714
Feature activation-1.06e+00

Pos logit contributions
der+0.050
en+0.037
 before+0.036
ilst+0.036
iece+0.035
Neg logit contributions
P-0.052
unt-0.048
ww-0.047
 baker-0.046
 cherry-0.046
Token hardly
Token ID8941
Feature activation-7.14e+00

Pos logit contributions
der+0.033
en+0.026
iece+0.025
ilst+0.025
ities+0.025
Neg logit contributions
P-0.021
unt-0.019
 cherry-0.019
 baker-0.019
 pastry-0.019
Token stand
Token ID1302
Feature activation+1.11e-01

Pos logit contributions
der+0.217
en+0.172
ities+0.164
ilst+0.161
inch+0.160
Neg logit contributions
P-0.141
 cherry-0.137
 baker-0.131
 pastry-0.130
unt-0.127
Token it
Token ID340
Feature activation-5.52e-01

Pos logit contributions
P+0.002
unt+0.002
 cherry+0.002
ww+0.002
 baker+0.002
Neg logit contributions
der-0.004
ilst-0.003
en-0.003
iece-0.003
inch-0.003
Token!
Token ID0
Feature activation-9.41e-01

Pos logit contributions
der+0.016
en+0.013
ities+0.013
ipples+0.013
icated+0.013
Neg logit contributions
P-0.011
 long-0.011
 high-0.010
 good-0.010
 cherry-0.010
Token
Token ID220
Feature activation-2.09e+00

Pos logit contributions
der+0.025
 before+0.020
ilst+0.019
en+0.019
iece+0.018
Neg logit contributions
P-0.022
ww-0.021
unt-0.020
 baker-0.020
 cherry-0.019
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
der+0.076
en+0.064
ities+0.061
ility+0.060
icated+0.060
Neg logit contributions
P-0.031
unt-0.028
Elizabeth-0.025
Ell-0.025
 cherry-0.024
Token so
Token ID523
Feature activation+8.47e-01

Pos logit contributions
P+0.003
 cherry+0.003
 pastry+0.003
 baker+0.003
unt+0.003
Neg logit contributions
der-0.004
en-0.003
ilst-0.003
iece-0.003
ities-0.003
Token much
Token ID881
Feature activation+1.20e+00

Pos logit contributions
P+0.014
unt+0.013
ww+0.012
 baker+0.012
Elizabeth+0.011
Neg logit contributions
der-0.030
 before-0.023
en-0.023
ilst-0.023
inch-0.023
Token that
Token ID326
Feature activation+6.17e-01

Pos logit contributions
P+0.022
ww+0.021
 pastry+0.021
 cherry+0.021
 good+0.021
Neg logit contributions
der-0.037
iece-0.031
ilst-0.030
ities-0.030
en-0.030
Token he
Token ID339
Feature activation-1.02e+00

Pos logit contributions
 good+0.013
P+0.012
 long+0.012
 cherry+0.012
ww+0.012
Neg logit contributions
der-0.018
ilst-0.014
ities-0.014
ussed-0.014
en-0.014
Token could
Token ID714
Feature activation-3.77e-01

Pos logit contributions
der+0.028
iece+0.022
ities+0.022
ility+0.021
icles+0.021
Neg logit contributions
P-0.023
ww-0.021
unt-0.021
 good-0.021
 cherry-0.021
Token hardly
Token ID8941
Feature activation-7.14e+00

Pos logit contributions
der+0.012
iece+0.010
en+0.010
ities+0.010
ilst+0.010
Neg logit contributions
 cherry-0.007
P-0.007
 baker-0.006
unt-0.006
 pastry-0.006
Token move
Token ID1445
Feature activation+3.93e-01

Pos logit contributions
der+0.219
en+0.176
ities+0.174
inch+0.167
ilst+0.164
Neg logit contributions
 cherry-0.138
 baker-0.129
 pastry-0.126
 good-0.125
P-0.124
Token.
Token ID13
Feature activation+4.63e-01

Pos logit contributions
P+0.010
 good+0.010
 cherry+0.010
 long+0.010
unt+0.009
Neg logit contributions
der-0.009
ilst-0.007
ities-0.006
iece-0.006
inch-0.006
Token Finally
Token ID9461
Feature activation-4.25e-01

Pos logit contributions
P+0.010
 cherry+0.009
unt+0.009
 baker+0.009
ww+0.009
Neg logit contributions
der-0.013
en-0.010
iece-0.010
ities-0.010
ock-0.010
Token he
Token ID339
Feature activation-9.77e-01

Pos logit contributions
der+0.012
 before+0.010
en+0.010
 himself+0.009
ilst+0.009
Neg logit contributions
P-0.009
unt-0.009
ww-0.008
Elizabeth-0.008
 baker-0.008
Token had
Token ID550
Feature activation+1.49e+00

Pos logit contributions
der+0.032
ities+0.025
iece+0.025
en+0.025
icles+0.024
Neg logit contributions
P-0.019
ww-0.016
unt-0.016
 cherry-0.016
 baker-0.015
Token the
Token ID262
Feature activation-3.10e+00

Pos logit contributions
der+0.038
iece+0.029
ities+0.029
ilst+0.029
inch+0.029
Neg logit contributions
P-0.021
 cherry-0.021
unt-0.020
 pastry-0.020
 baker-0.020
Token zoo
Token ID26626
Feature activation-2.11e+00

Pos logit contributions
der+0.096
 before+0.075
en+0.075
ilst+0.073
iece+0.072
Neg logit contributions
P-0.061
unt-0.056
ww-0.055
Elizabeth-0.052
 baker-0.052
Token that
Token ID326
Feature activation-1.13e-02

Pos logit contributions
der+0.054
en+0.040
ities+0.039
inch+0.039
ility+0.038
Neg logit contributions
P-0.052
unt-0.050
 baker-0.050
 cherry-0.049
 bakery-0.049
Token she
Token ID673
Feature activation-2.08e+00

Pos logit contributions
der+0.000
en+0.000
 before+0.000
ilst+0.000
iece+0.000
Neg logit contributions
P-0.000
unt-0.000
ww-0.000
 baker-0.000
 cherry-0.000
Token could
Token ID714
Feature activation-8.47e-01

Pos logit contributions
der+0.059
en+0.045
 before+0.044
ilst+0.044
iece+0.043
Neg logit contributions
P-0.047
unt-0.043
ww-0.043
 baker-0.041
 cherry-0.041
Token hardly
Token ID8941
Feature activation-7.09e+00

Pos logit contributions
der+0.031
ities+0.024
iece+0.024
en+0.024
inch+0.024
Neg logit contributions
P-0.012
unt-0.012
 cherry-0.012
 baker-0.011
 taxi-0.011
Token wait
Token ID4043
Feature activation-2.93e-01

Pos logit contributions
der+0.224
en+0.177
ities+0.172
inch+0.170
ilst+0.168
Neg logit contributions
P-0.131
 cherry-0.130
 baker-0.125
unt-0.123
 taxi-0.123
Token.
Token ID13
Feature activation-9.40e-01

Pos logit contributions
der+0.009
en+0.007
ilst+0.007
iece+0.007
inch+0.007
Neg logit contributions
P-0.006
unt-0.006
ww-0.005
 baker-0.005
 cherry-0.005
Token She
Token ID1375
Feature activation-9.82e-01

Pos logit contributions
der+0.029
en+0.023
ilst+0.022
 before+0.022
iece+0.022
Neg logit contributions
P-0.019
unt-0.018
ww-0.017
 baker-0.017
 cherry-0.017
Token put
Token ID1234
Feature activation+2.08e+00

Pos logit contributions
der+0.029
en+0.022
 before+0.022
ilst+0.022
iece+0.021
Neg logit contributions
P-0.021
unt-0.019
ww-0.019
 baker-0.019
 cherry-0.019
Token on
Token ID319
Feature activation+1.46e+00

Pos logit contributions
P+0.042
ww+0.041
unt+0.040
 lawyer+0.039
 baker+0.039
Neg logit contributions
der-0.055
 before-0.052
 himself-0.049
icles-0.049
en-0.047
Token so
Token ID523
Feature activation-2.64e-01

Pos logit contributions
der+0.005
en+0.004
 before+0.004
ilst+0.004
iece+0.003
Neg logit contributions
P-0.003
ww-0.003
unt-0.003
 cherry-0.003
 baker-0.003
Token excited
Token ID6568
Feature activation-3.22e+00

Pos logit contributions
der+0.010
en+0.008
ilst+0.008
ities+0.008
iece+0.008
Neg logit contributions
P-0.004
unt-0.003
ww-0.003
 cherry-0.003
 baker-0.003
Token -
Token ID532
Feature activation-8.45e-01

Pos logit contributions
der+0.097
ities+0.082
inch+0.082
ility+0.081
iece+0.081
Neg logit contributions
unt-0.056
 long-0.056
ww-0.054
 baker-0.054
P-0.053
Token she
Token ID673
Feature activation-1.93e+00

Pos logit contributions
der+0.025
en+0.019
ilst+0.019
ities+0.018
iece+0.018
Neg logit contributions
P-0.018
unt-0.017
 cherry-0.017
ww-0.017
 baker-0.017
Token could
Token ID714
Feature activation-9.56e-01

Pos logit contributions
der+0.052
en+0.039
ilst+0.038
 before+0.038
iece+0.037
Neg logit contributions
P-0.047
unt-0.043
ww-0.043
 baker-0.042
 cherry-0.041
Token hardly
Token ID8941
Feature activation-7.08e+00

Pos logit contributions
der+0.033
ities+0.026
iece+0.026
inch+0.026
ilst+0.025
Neg logit contributions
P-0.016
 cherry-0.015
unt-0.014
 baker-0.014
 long-0.014
Token contain
Token ID3994
Feature activation-3.56e-01

Pos logit contributions
der+0.218
en+0.171
ities+0.167
inch+0.165
ilst+0.163
Neg logit contributions
 cherry-0.134
P-0.134
 long-0.130
 baker-0.128
 pastry-0.126
Token herself
Token ID5223
Feature activation-7.56e-01

Pos logit contributions
der+0.014
ipples+0.012
ities+0.012
icles+0.012
icking+0.012
Neg logit contributions
 good-0.004
P-0.004
 high-0.004
 long-0.004
 big-0.003
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
der+0.020
icles+0.015
inch+0.015
ities+0.014
icated+0.014
Neg logit contributions
 long-0.018
P-0.017
 good-0.017
 high-0.017
 pastry-0.017
Token She
Token ID1375
Feature activation-1.36e+00

Pos logit contributions
der+0.043
en+0.034
ilst+0.033
 before+0.033
iece+0.033
Neg logit contributions
P-0.027
unt-0.025
ww-0.024
 baker-0.023
 cherry-0.023
Token started
Token ID2067
Feature activation+5.63e-01

Pos logit contributions
der+0.037
en+0.028
 before+0.028
ilst+0.027
iece+0.027
Neg logit contributions
P-0.032
unt-0.030
ww-0.029
 baker-0.028
 cherry-0.028
Token so
Token ID523
Feature activation+1.63e+00

Pos logit contributions
P+0.001
unt+0.001
ww+0.001
 baker+0.001
 cherry+0.001
Neg logit contributions
der-0.001
en-0.001
ilst-0.001
 before-0.001
iece-0.001
Token excited
Token ID6568
Feature activation-3.46e+00

Pos logit contributions
P+0.030
unt+0.027
ww+0.026
Ell+0.025
Elizabeth+0.025
Neg logit contributions
der-0.053
 before-0.043
en-0.041
 himself-0.041
ilst-0.041
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
der+0.106
ipples+0.092
ities+0.091
inch+0.091
iece+0.090
Neg logit contributions
unt-0.059
 pastry-0.057
 long-0.055
 baker-0.054
ww-0.054
Token she
Token ID673
Feature activation-4.52e-01

Pos logit contributions
der+0.029
 before+0.023
en+0.022
ilst+0.022
iece+0.021
Neg logit contributions
P-0.022
unt-0.020
ww-0.020
 baker-0.019
 cherry-0.019
Token could
Token ID714
Feature activation+1.96e-01

Pos logit contributions
der+0.012
en+0.010
 before+0.009
ilst+0.009
iece+0.009
Neg logit contributions
P-0.011
unt-0.010
ww-0.010
 baker-0.009
 cherry-0.009
Token hardly
Token ID8941
Feature activation-7.06e+00

Pos logit contributions
P+0.004
unt+0.004
 baker+0.004
 cherry+0.004
ww+0.004
Neg logit contributions
der-0.006
en-0.005
ilst-0.005
iece-0.005
inch-0.005
Token wait
Token ID4043
Feature activation-1.30e-01

Pos logit contributions
der+0.193
en+0.148
inch+0.144
ities+0.142
ilst+0.139
Neg logit contributions
 cherry-0.157
P-0.155
 baker-0.153
 pastry-0.153
unt-0.148
Token to
Token ID284
Feature activation+1.22e+00

Pos logit contributions
der+0.004
en+0.003
ilst+0.003
iece+0.003
inch+0.003
Neg logit contributions
P-0.002
unt-0.002
ww-0.002
 baker-0.002
 cherry-0.002
Token eat
Token ID4483
Feature activation+5.63e-01

Pos logit contributions
P+0.026
unt+0.024
ww+0.024
 baker+0.023
 cherry+0.022
Neg logit contributions
der-0.036
en-0.029
 before-0.029
ilst-0.028
iece-0.027
Token the
Token ID262
Feature activation+1.57e-01

Pos logit contributions
P+0.013
unt+0.012
ww+0.012
 cherry+0.012
 baker+0.012
Neg logit contributions
der-0.016
en-0.012
ilst-0.012
iece-0.011
 before-0.011
Token tasty
Token ID25103
Feature activation+1.31e+00

Pos logit contributions
P+0.003
ww+0.002
Ell+0.002
unt+0.002
Elizabeth+0.002
Neg logit contributions
der-0.005
 before-0.004
ilst-0.004
en-0.004
 earlier-0.004
Token so
Token ID523
Feature activation+1.69e+00

Pos logit contributions
der+0.005
 before+0.004
en+0.004
ilst+0.004
iece+0.004
Neg logit contributions
P-0.003
ww-0.003
unt-0.003
 baker-0.003
Elizabeth-0.003
Token excited
Token ID6568
Feature activation-3.51e+00

Pos logit contributions
P+0.034
unt+0.030
ww+0.030
Elizabeth+0.029
Ell+0.029
Neg logit contributions
der-0.051
 before-0.041
en-0.039
ilst-0.039
inch-0.039
Token that
Token ID326
Feature activation+7.84e-01

Pos logit contributions
der+0.113
iece+0.100
ipples+0.097
ility+0.097
ities+0.096
Neg logit contributions
unt-0.060
 long-0.056
ww-0.056
 pastry-0.054
P-0.054
Token she
Token ID673
Feature activation-1.21e+00

Pos logit contributions
P+0.011
unt+0.010
ww+0.010
 cherry+0.009
 baker+0.009
Neg logit contributions
der-0.029
en-0.024
ilst-0.023
iece-0.023
inch-0.023
Token could
Token ID714
Feature activation-6.48e-01

Pos logit contributions
der+0.032
en+0.024
iece+0.024
ilst+0.023
 before+0.023
Neg logit contributions
P-0.029
unt-0.028
ww-0.027
 baker-0.026
 cherry-0.026
Token hardly
Token ID8941
Feature activation-7.06e+00

Pos logit contributions
der+0.022
en+0.017
iece+0.017
ility+0.017
ities+0.017
Neg logit contributions
P-0.011
unt-0.011
 taxi-0.011
 cherry-0.011
 baker-0.010
Token stand
Token ID1302
Feature activation-6.25e-02

Pos logit contributions
der+0.226
en+0.175
ities+0.171
inch+0.165
ilst+0.161
Neg logit contributions
 cherry-0.131
 taxi-0.130
P-0.129
 lawyer-0.126
unt-0.124
Token it
Token ID340
Feature activation-6.72e-01

Pos logit contributions
der+0.002
ilst+0.002
en+0.002
iece+0.001
inch+0.001
Neg logit contributions
P-0.001
unt-0.001
 cherry-0.001
ww-0.001
 pastry-0.001
Token.
Token ID13
Feature activation-4.15e-01

Pos logit contributions
der+0.019
ities+0.015
en+0.014
ipples+0.014
inch+0.014
Neg logit contributions
P-0.014
 long-0.014
 high-0.014
 good-0.013
unt-0.013
Token
Token ID198
Feature activation-1.65e+00

Pos logit contributions
der+0.011
 before+0.009
ilst+0.008
en+0.008
 himself+0.008
Neg logit contributions
P-0.010
ww-0.009
 baker-0.009
unt-0.008
 cherry-0.008
Token
Token ID198
Feature activation-8.82e-01

Pos logit contributions
der+0.041
 before+0.032
ilst+0.032
en+0.031
 himself+0.030
Neg logit contributions
P-0.040
ww-0.037
 baker-0.037
 cherry-0.037
 pastry-0.036
Token and
Token ID290
Feature activation-7.05e-01

Pos logit contributions
P+0.002
ww+0.002
 cherry+0.002
 baker+0.002
unt+0.002
Neg logit contributions
der-0.004
en-0.003
ilst-0.003
iece-0.003
icated-0.003
Token tomatoes
Token ID23972
Feature activation-1.43e-01

Pos logit contributions
der+0.022
ilst+0.018
en+0.018
iece+0.018
inch+0.018
Neg logit contributions
P-0.013
 cherry-0.012
ww-0.012
 baker-0.011
unt-0.011
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
der+0.004
ilst+0.003
icated+0.003
en+0.003
ility+0.003
Neg logit contributions
P-0.003
 cherry-0.003
 long-0.003
 good-0.003
 pastry-0.003
Token He
Token ID679
Feature activation-7.99e-02

Pos logit contributions
unt+0.021
P+0.020
 cherry+0.020
 baker+0.019
ww+0.019
Neg logit contributions
der-0.034
en-0.029
ities-0.028
ility-0.028
iece-0.028
Token could
Token ID714
Feature activation-3.03e-01

Pos logit contributions
der+0.002
en+0.002
iece+0.002
ilst+0.002
 before+0.002
Neg logit contributions
P-0.002
unt-0.002
ww-0.002
 baker-0.002
 cherry-0.002
Token hardly
Token ID8941
Feature activation-7.04e+00

Pos logit contributions
der+0.010
iece+0.008
ilst+0.008
en+0.008
ities+0.008
Neg logit contributions
 cherry-0.005
P-0.005
 baker-0.005
 pastry-0.005
 long-0.005
Token wait
Token ID4043
Feature activation-1.84e-01

Pos logit contributions
der+0.213
en+0.173
ilst+0.165
ities+0.163
iece+0.161
Neg logit contributions
 cherry-0.139
 baker-0.132
 pastry-0.132
P-0.131
 good-0.121
Token to
Token ID284
Feature activation+1.24e+00

Pos logit contributions
der+0.006
en+0.005
ilst+0.005
ities+0.005
iece+0.005
Neg logit contributions
P-0.003
 baker-0.002
 good-0.002
 pastry-0.002
 cherry-0.002
Token eat
Token ID4483
Feature activation+7.49e-01

Pos logit contributions
P+0.025
unt+0.023
ww+0.023
 baker+0.022
 cherry+0.022
Neg logit contributions
der-0.038
en-0.030
 before-0.029
ilst-0.029
iece-0.028
Token it
Token ID340
Feature activation+1.07e-01

Pos logit contributions
P+0.014
 cherry+0.014
 good+0.013
 pastry+0.013
 baker+0.013
Neg logit contributions
der-0.023
ilst-0.019
ussed-0.018
iece-0.018
ities-0.018
Token,
Token ID11
Feature activation-2.05e+00

Pos logit contributions
P+0.003
 cherry+0.003
 good+0.003
 pastry+0.003
 long+0.003
Neg logit contributions
der-0.003
ities-0.002
ussed-0.002
ilst-0.002
en-0.002
Token so
Token ID523
Feature activation+2.22e+00

Pos logit contributions
P+0.018
ww+0.017
unt+0.017
www+0.015
Ell+0.015
Neg logit contributions
der-0.019
 before-0.016
en-0.015
ilst-0.015
 himself-0.015
Token excited
Token ID6568
Feature activation-3.26e+00

Pos logit contributions
P+0.043
unt+0.042
Ell+0.039
ww+0.039
Elizabeth+0.038
Neg logit contributions
der-0.059
 before-0.054
 himself-0.051
ressed-0.048
ipples-0.048
Token that
Token ID326
Feature activation+8.48e-01

Pos logit contributions
der+0.111
ipples+0.097
ities+0.096
inch+0.095
iece+0.095
Neg logit contributions
unt-0.046
 long-0.043
 pastry-0.042
ww-0.042
P-0.042
Token she
Token ID673
Feature activation-7.56e-01

Pos logit contributions
P+0.015
unt+0.014
ww+0.014
 baker+0.013
 cherry+0.013
Neg logit contributions
der-0.028
 before-0.023
en-0.023
iece-0.022
ilst-0.022
Token could
Token ID714
Feature activation-1.55e-01

Pos logit contributions
der+0.021
en+0.016
 before+0.015
ilst+0.015
iece+0.015
Neg logit contributions
P-0.018
unt-0.017
ww-0.016
 baker-0.016
 cherry-0.016
Token hardly
Token ID8941
Feature activation-7.03e+00

Pos logit contributions
der+0.005
en+0.004
ilst+0.004
iece+0.004
 before+0.004
Neg logit contributions
P-0.003
unt-0.003
ww-0.003
 baker-0.003
 cherry-0.003
Token stand
Token ID1302
Feature activation+2.69e-01

Pos logit contributions
der+0.190
en+0.143
ities+0.137
ilst+0.136
inch+0.136
Neg logit contributions
P-0.163
 cherry-0.156
 baker-0.154
unt-0.153
 pastry-0.151
Token it
Token ID340
Feature activation-5.48e-01

Pos logit contributions
P+0.006
unt+0.005
ww+0.005
 cherry+0.005
 baker+0.005
Neg logit contributions
der-0.008
ilst-0.006
en-0.006
inch-0.006
iece-0.006
Token.
Token ID13
Feature activation+4.41e-01

Pos logit contributions
der+0.015
ities+0.012
en+0.011
ipples+0.011
inch+0.011
Neg logit contributions
P-0.012
 long-0.011
 good-0.011
 high-0.011
unt-0.011
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
P+0.010
ww+0.009
unt+0.009
 baker+0.009
 cherry+0.009
Neg logit contributions
der-0.013
 before-0.010
en-0.010
ilst-0.010
iece-0.009
Token
Token ID198
Feature activation-2.20e-01

Pos logit contributions
der+0.027
ilst+0.021
 before+0.021
en+0.021
 himself+0.020
Neg logit contributions
P-0.025
ww-0.024
 baker-0.023
unt-0.022
 cherry-0.022
Token wind
Token ID2344
Feature activation-8.14e-01

Pos logit contributions
der+0.020
en+0.016
 before+0.015
iece+0.015
ilst+0.015
Neg logit contributions
P-0.017
unt-0.016
ww-0.016
 baker-0.016
Elizabeth-0.015
Token,
Token ID11
Feature activation-2.06e+00

Pos logit contributions
der+0.023
inch+0.020
ities+0.020
ilst+0.019
icated+0.019
Neg logit contributions
unt-0.014
 long-0.014
ww-0.014
 cherry-0.014
P-0.014
Token the
Token ID262
Feature activation-9.18e-01

Pos logit contributions
der+0.055
en+0.041
ilst+0.040
inch+0.040
ities+0.039
Neg logit contributions
P-0.049
unt-0.046
ww-0.045
 cherry-0.045
 baker-0.044
Token trees
Token ID7150
Feature activation-5.84e-01

Pos logit contributions
der+0.025
en+0.019
 before+0.019
ilst+0.018
iece+0.018
Neg logit contributions
P-0.022
unt-0.020
ww-0.020
 baker-0.019
Elizabeth-0.019
Token to
Token ID284
Feature activation-5.83e-01

Pos logit contributions
der+0.018
ities+0.014
inch+0.014
icated+0.014
ilst+0.014
Neg logit contributions
 cherry-0.011
P-0.010
unt-0.010
 long-0.010
 high-0.010
Tokenwered
Token ID8279
Feature activation-7.01e+00

Pos logit contributions
der+0.017
en+0.013
 before+0.013
ilst+0.013
iece+0.013
Neg logit contributions
P-0.012
unt-0.012
ww-0.011
 baker-0.011
 cherry-0.011
Token high
Token ID1029
Feature activation-2.59e+00

Pos logit contributions
der+0.217
ities+0.191
inch+0.187
ussed+0.174
arded+0.173
Neg logit contributions
 high-0.136
P-0.122
 big-0.122
 red-0.119
 cherry-0.119
Token,
Token ID11
Feature activation-2.23e+00

Pos logit contributions
der+0.068
inch+0.063
ilst+0.059
icated+0.058
ility+0.057
Neg logit contributions
 high-0.051
 long-0.051
P-0.050
 good-0.048
 cherry-0.048
Token and
Token ID290
Feature activation-1.18e+00

Pos logit contributions
der+0.059
inch+0.048
ilst+0.046
ities+0.046
en+0.044
Neg logit contributions
P-0.048
 cherry-0.047
unt-0.047
 high-0.046
 long-0.046
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
der+0.034
en+0.027
inch+0.027
ilst+0.027
ities+0.026
Neg logit contributions
P-0.024
 cherry-0.023
unt-0.023
 baker-0.022
ww-0.022
Token clouds
Token ID15114
Feature activation-1.07e-01

Pos logit contributions
der+0.035
en+0.027
ilst+0.026
 before+0.026
iece+0.026
Neg logit contributions
P-0.025
unt-0.024
ww-0.023
 cherry-0.023
 baker-0.023
Token much
Token ID881
Feature activation-1.74e-01

Pos logit contributions
P+0.010
unt+0.009
ww+0.009
 baker+0.008
 cherry+0.008
Neg logit contributions
der-0.026
en-0.021
 before-0.021
ilst-0.021
iece-0.020
Token fun
Token ID1257
Feature activation-8.74e-01

Pos logit contributions
der+0.007
ities+0.006
en+0.006
iece+0.006
inch+0.006
Neg logit contributions
P-0.002
unt-0.002
ww-0.002
 good-0.002
 cherry-0.002
Token that
Token ID326
Feature activation+1.87e-01

Pos logit contributions
der+0.023
ities+0.019
inch+0.019
iece+0.019
ility+0.018
Neg logit contributions
 long-0.018
P-0.018
ww-0.018
unt-0.018
 good-0.018
Token they
Token ID484
Feature activation-1.27e+00

Pos logit contributions
P+0.004
unt+0.003
ww+0.003
 baker+0.003
 cherry+0.003
Neg logit contributions
der-0.006
en-0.005
 before-0.004
ilst-0.004
iece-0.004
Token could
Token ID714
Feature activation-5.60e-01

Pos logit contributions
der+0.036
ilst+0.028
ities+0.027
iece+0.027
en+0.027
Neg logit contributions
P-0.028
unt-0.025
ww-0.025
 baker-0.025
 cherry-0.024
Token hardly
Token ID8941
Feature activation-7.01e+00

Pos logit contributions
der+0.019
ities+0.016
ilst+0.015
iece+0.015
en+0.015
Neg logit contributions
P-0.009
 cherry-0.008
unt-0.008
 taxi-0.008
 long-0.008
Token contain
Token ID3994
Feature activation-6.49e-01

Pos logit contributions
der+0.218
ities+0.173
en+0.168
inch+0.166
ilst+0.165
Neg logit contributions
 cherry-0.130
P-0.125
 baker-0.123
 taxi-0.121
 lawyer-0.120
Token their
Token ID511
Feature activation-8.62e-02

Pos logit contributions
der+0.024
rified+0.020
icles+0.020
ilst+0.020
iness+0.020
Neg logit contributions
 good-0.010
P-0.009
 high-0.007
 big-0.007
 long-0.007
Token excitement
Token ID14067
Feature activation-1.63e+00

Pos logit contributions
der+0.002
en+0.002
ilst+0.002
 before+0.002
iece+0.002
Neg logit contributions
P-0.002
unt-0.002
ww-0.002
 cherry-0.002
 baker-0.002
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
der+0.044
ilst+0.040
rified+0.039
ichever+0.038
inch+0.038
Neg logit contributions
 long-0.034
 good-0.031
 high-0.030
 few-0.029
 big-0.029
Token
Token ID198
Feature activation-1.26e+00

Pos logit contributions
P+0.011
ww+0.010
unt+0.010
 baker+0.009
 cherry+0.009
Neg logit contributions
der-0.013
 before-0.010
en-0.010
ilst-0.010
iece-0.009
Token so
Token ID523
Feature activation+1.40e+00

Pos logit contributions
der+0.020
en+0.016
ities+0.015
ilst+0.015
iece+0.015
Neg logit contributions
P-0.013
 cherry-0.012
unt-0.012
ww-0.012
 baker-0.012
Token amazed
Token ID24562
Feature activation-3.12e+00

Pos logit contributions
P+0.019
unt+0.017
ww+0.016
 baker+0.015
Elizabeth+0.015
Neg logit contributions
der-0.052
 before-0.043
en-0.043
ilst-0.042
ipples-0.042
Token,
Token ID11
Feature activation-6.49e-01

Pos logit contributions
der+0.092
ities+0.081
iece+0.081
ussed+0.077
ilst+0.076
Neg logit contributions
P-0.058
 high-0.056
ww-0.054
 long-0.053
 good-0.052
Token it
Token ID340
Feature activation-1.64e-01

Pos logit contributions
der+0.019
ilst+0.015
en+0.015
ities+0.015
inch+0.015
Neg logit contributions
P-0.013
unt-0.012
 cherry-0.012
ww-0.012
 baker-0.012
Token could
Token ID714
Feature activation-8.81e-01

Pos logit contributions
der+0.005
ities+0.004
icles+0.004
inch+0.004
iece+0.004
Neg logit contributions
P-0.003
ww-0.003
unt-0.003
 cherry-0.003
 good-0.003
Token hardly
Token ID8941
Feature activation-7.01e+00

Pos logit contributions
der+0.028
ities+0.023
en+0.023
ilst+0.023
iece+0.022
Neg logit contributions
P-0.016
 cherry-0.015
 baker-0.014
unt-0.014
 pastry-0.014
Token believe
Token ID1975
Feature activation-6.42e-01

Pos logit contributions
der+0.205
en+0.163
ities+0.161
ilst+0.158
inch+0.155
Neg logit contributions
 cherry-0.140
P-0.137
 baker-0.132
 long-0.130
 pastry-0.130
Token its
Token ID663
Feature activation-1.41e+00

Pos logit contributions
der+0.019
ilst+0.018
ities+0.017
ock+0.016
iece+0.016
Neg logit contributions
P-0.011
 good-0.010
 high-0.010
 pastry-0.010
 cherry-0.010
Token eyes
Token ID2951
Feature activation-4.58e-01

Pos logit contributions
der+0.044
iece+0.035
en+0.035
ilst+0.035
 before+0.034
Neg logit contributions
P-0.026
unt-0.024
ww-0.024
 cherry-0.024
 baker-0.023
Token!
Token ID0
Feature activation+6.97e-01

Pos logit contributions
der+0.011
inch+0.009
ities+0.009
icated+0.009
ilst+0.009
Neg logit contributions
 long-0.011
 good-0.010
 cherry-0.010
 high-0.010
P-0.010
Token
Token ID198
Feature activation-2.43e-01

Pos logit contributions
P+0.016
unt+0.015
ww+0.014
 baker+0.014
 cherry+0.014
Neg logit contributions
der-0.020
 before-0.015
en-0.015
ilst-0.014
iece-0.014
Token and
Token ID290
Feature activation-4.42e-01

Pos logit contributions
der+0.037
ilst+0.030
ities+0.030
inch+0.029
iece+0.029
Neg logit contributions
P-0.025
 cherry-0.024
 pastry-0.023
 good-0.023
 baker-0.023
Token sizes
Token ID10620
Feature activation-2.45e+00

Pos logit contributions
der+0.012
ilst+0.009
inch+0.009
iece+0.009
icking+0.009
Neg logit contributions
P-0.010
 cherry-0.010
ww-0.010
 baker-0.009
unt-0.009
Token.
Token ID13
Feature activation+3.30e-01

Pos logit contributions
der+0.067
ilst+0.056
ities+0.055
iece+0.055
inch+0.054
Neg logit contributions
 cherry-0.046
 good-0.045
 pastry-0.044
 long-0.044
P-0.044
Token He
Token ID679
Feature activation-4.57e-01

Pos logit contributions
unt+0.007
P+0.007
 baker+0.007
 cherry+0.007
 pastry+0.006
Neg logit contributions
der-0.009
en-0.007
inch-0.007
ility-0.007
ities-0.007
Token could
Token ID714
Feature activation-2.13e-01

Pos logit contributions
der+0.013
en+0.010
 before+0.009
ilst+0.009
iece+0.009
Neg logit contributions
P-0.011
unt-0.010
ww-0.010
 baker-0.009
 cherry-0.009
Token hardly
Token ID8941
Feature activation-7.01e+00

Pos logit contributions
der+0.007
ilst+0.005
iece+0.005
ities+0.005
en+0.005
Neg logit contributions
P-0.004
 cherry-0.004
 baker-0.004
 pastry-0.003
 long-0.003
Token believe
Token ID1975
Feature activation-1.01e-01

Pos logit contributions
der+0.196
en+0.155
ilst+0.151
ities+0.150
inch+0.150
Neg logit contributions
 cherry-0.146
P-0.142
 baker-0.141
 pastry-0.139
 long-0.134
Token his
Token ID465
Feature activation-2.27e-01

Pos logit contributions
der+0.003
ilst+0.003
ipples+0.003
ock+0.002
inch+0.002
Neg logit contributions
 good-0.002
P-0.002
 pastry-0.002
 long-0.002
 high-0.002
Token eyes
Token ID2951
Feature activation-4.14e-01

Pos logit contributions
der+0.005
en+0.004
 before+0.004
ilst+0.004
inch+0.003
Neg logit contributions
P-0.006
unt-0.006
ww-0.006
 baker-0.006
 cherry-0.006
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
der+0.010
inch+0.009
icated+0.008
ilst+0.008
ities+0.008
Neg logit contributions
 long-0.009
 cherry-0.009
 good-0.009
P-0.009
 high-0.009
Token He
Token ID679
Feature activation-4.66e-01

Pos logit contributions
P+0.003
unt+0.003
 baker+0.003
 cherry+0.003
ww+0.003
Neg logit contributions
der-0.004
en-0.003
ilst-0.003
iece-0.003
ities-0.003
Token to
Token ID284
Feature activation-2.71e-01

Pos logit contributions
der+0.100
ility+0.090
iece+0.089
ipples+0.088
ussed+0.088
Neg logit contributions
 long-0.019
unt-0.016
 few-0.014
 lawyer-0.014
 good-0.014
Token go
Token ID467
Feature activation+2.91e+00

Pos logit contributions
der+0.010
ities+0.008
ilst+0.007
inch+0.007
ility+0.007
Neg logit contributions
P-0.004
 taxi-0.004
 long-0.004
 good-0.004
unt-0.004
Token,
Token ID11
Feature activation-2.61e+00

Pos logit contributions
P+0.056
unt+0.051
ww+0.050
 baker+0.048
 cherry+0.048
Neg logit contributions
der-0.093
en-0.072
ilst-0.071
 before-0.070
iece-0.070
Token he
Token ID339
Feature activation-1.77e+00

Pos logit contributions
der+0.070
en+0.052
ility+0.051
iece+0.050
ilst+0.050
Neg logit contributions
P-0.062
unt-0.060
ww-0.058
 cherry-0.058
 pastry-0.057
Token could
Token ID714
Feature activation-7.48e-01

Pos logit contributions
der+0.052
en+0.040
ilst+0.039
iece+0.039
 before+0.039
Neg logit contributions
P-0.038
unt-0.035
ww-0.034
 baker-0.034
 cherry-0.033
Token hardly
Token ID8941
Feature activation-7.00e+00

Pos logit contributions
der+0.026
en+0.021
ility+0.020
ities+0.020
icated+0.020
Neg logit contributions
P-0.013
unt-0.012
 taxi-0.011
 cherry-0.011
 long-0.011
Token contain
Token ID3994
Feature activation-8.73e-01

Pos logit contributions
der+0.215
en+0.162
ities+0.159
inch+0.153
ilst+0.152
Neg logit contributions
P-0.140
 cherry-0.136
 taxi-0.133
 lawyer-0.131
 long-0.131
Token his
Token ID465
Feature activation-1.16e-01

Pos logit contributions
der+0.030
rified+0.025
ility+0.024
ipples+0.024
places+0.024
Neg logit contributions
 good-0.016
P-0.016
 long-0.014
 big-0.014
 high-0.014
Token excitement
Token ID14067
Feature activation-1.57e+00

Pos logit contributions
der+0.003
en+0.003
 before+0.003
ilst+0.003
inch+0.003
Neg logit contributions
P-0.002
unt-0.002
ww-0.002
 baker-0.002
 cherry-0.002
Token.
Token ID13
Feature activation-7.45e-01

Pos logit contributions
der+0.044
rified+0.036
ilst+0.036
ichever+0.034
inch+0.034
Neg logit contributions
 long-0.034
 few-0.031
 good-0.030
 taxi-0.030
 big-0.030
Token
Token ID220
Feature activation-2.28e+00

Pos logit contributions
der+0.025
ility+0.020
en+0.020
ities+0.020
icated+0.020
Neg logit contributions
P-0.013
unt-0.012
 cherry-0.011
 baker-0.011
 pastry-0.011
Token ever
Token ID1683
Feature activation-2.19e+00

Pos logit contributions
der+0.040
ussed+0.033
ities+0.033
en+0.033
icking+0.032
Neg logit contributions
 long-0.021
 good-0.020
 high-0.020
P-0.019
 lawyer-0.019
Token before
Token ID878
Feature activation-2.06e+00

Pos logit contributions
der+0.048
inch+0.040
ussed+0.040
aying+0.039
icles+0.038
Neg logit contributions
 hungry-0.057
 good-0.056
 long-0.055
 busy-0.055
 past-0.055
Token.
Token ID13
Feature activation-3.16e-01

Pos logit contributions
der+0.053
inch+0.044
ility+0.042
ilst+0.042
ities+0.042
Neg logit contributions
 long-0.050
P-0.048
 high-0.046
unt-0.045
 good-0.044
Token He
Token ID679
Feature activation-1.39e+00

Pos logit contributions
der+0.009
en+0.007
ility+0.007
iness+0.007
iece+0.007
Neg logit contributions
P-0.007
unt-0.007
 long-0.006
ww-0.006
Elizabeth-0.006
Token could
Token ID714
Feature activation-5.14e-01

Pos logit contributions
der+0.044
en+0.035
ilst+0.034
 before+0.034
iece+0.034
Neg logit contributions
P-0.027
unt-0.024
ww-0.024
 baker-0.023
 cherry-0.023
Token hardly
Token ID8941
Feature activation-6.96e+00

Pos logit contributions
der+0.019
en+0.015
ities+0.015
icated+0.015
ility+0.015
Neg logit contributions
P-0.007
 cherry-0.007
unt-0.006
Elizabeth-0.006
 taxi-0.006
Token believe
Token ID1975
Feature activation-1.41e-01

Pos logit contributions
der+0.234
ities+0.189
en+0.187
inch+0.181
ills+0.179
Neg logit contributions
 cherry-0.114
 high-0.109
 taxi-0.109
 long-0.108
P-0.106
Token it
Token ID340
Feature activation+4.63e-01

Pos logit contributions
ipples+0.004
der+0.004
ilst+0.004
ill+0.004
ussed+0.004
Neg logit contributions
 good-0.002
 high-0.002
 long-0.002
P-0.002
 on-0.002
Token
Token ID6184
Feature activation-1.42e+00

Pos logit contributions
P+0.009
 long+0.009
 good+0.009
 big+0.008
 high+0.008
Neg logit contributions
der-0.014
ipples-0.012
inch-0.012
reating-0.012
icated-0.012
Token
Token ID95
Feature activation-2.35e+00

Pos logit contributions
der+0.043
en+0.034
 before+0.033
iece+0.033
ilst+0.032
Neg logit contributions
P-0.030
unt-0.027
ww-0.027
 cherry-0.026
 baker-0.025
Token
Token ID26391
Feature activation+1.71e+00

Pos logit contributions
der+0.075
en+0.057
inch+0.056
 before+0.055
icles+0.055
Neg logit contributions
P-0.049
unt-0.044
ww-0.043
Elizabeth-0.041
Michelle-0.040
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
der+0.002
inch+0.002
ilst+0.002
icated+0.002
ities+0.002
Neg logit contributions
 taxi-0.002
unt-0.002
P-0.002
 cherry-0.002
 ladder-0.002
Token As
Token ID1081
Feature activation+5.10e-01

Pos logit contributions
der+0.033
en+0.027
ities+0.026
iece+0.026
ility+0.026
Neg logit contributions
P-0.020
unt-0.020
 cherry-0.019
 baker-0.019
 pastry-0.018
Token the
Token ID262
Feature activation-4.19e-01

Pos logit contributions
P+0.013
unt+0.012
ww+0.012
 baker+0.012
 cherry+0.012
Neg logit contributions
der-0.013
 before-0.010
en-0.010
ilst-0.009
iece-0.009
Token waves
Token ID9813
Feature activation+2.33e+00

Pos logit contributions
der+0.014
ities+0.011
en+0.011
ilst+0.011
ility+0.011
Neg logit contributions
P-0.008
 baker-0.007
unt-0.007
ww-0.007
 taxi-0.007
Token crashed
Token ID14997
Feature activation-2.62e+00

Pos logit contributions
P+0.048
unt+0.044
ww+0.044
 baker+0.043
 cherry+0.042
Neg logit contributions
der-0.070
en-0.054
ilst-0.053
ities-0.053
 before-0.052
Token upon
Token ID2402
Feature activation-6.96e+00

Pos logit contributions
der+0.074
en+0.056
ilst+0.055
 before+0.055
iece+0.054
Neg logit contributions
P-0.059
unt-0.055
ww-0.054
 baker-0.053
 cherry-0.053
Token the
Token ID262
Feature activation-1.67e+00

Pos logit contributions
der+0.220
ities+0.183
ilst+0.178
iece+0.178
inch+0.178
Neg logit contributions
P-0.129
 long-0.120
 cherry-0.119
 good-0.119
 high-0.117
Token shore
Token ID15191
Feature activation+3.55e-01

Pos logit contributions
der+0.047
ilst+0.039
en+0.037
ities+0.037
ility+0.036
Neg logit contributions
P-0.038
 baker-0.034
 ladder-0.034
ww-0.034
 cherry-0.033
Token,
Token ID11
Feature activation-8.08e-01

Pos logit contributions
P+0.005
unt+0.005
 baker+0.005
 cherry+0.005
 ladder+0.005
Neg logit contributions
der-0.012
icated-0.010
inch-0.010
ilst-0.010
ities-0.010
Token Bob
Token ID5811
Feature activation-1.00e+00

Pos logit contributions
der+0.023
en+0.019
ilst+0.018
iece+0.018
ities+0.018
Neg logit contributions
P-0.018
 cherry-0.017
 baker-0.017
unt-0.016
ww-0.016
Token could
Token ID714
Feature activation-7.49e-01

Pos logit contributions
der+0.034
icated+0.027
reating+0.027
ities+0.027
ility+0.027
Neg logit contributions
P-0.017
unt-0.014
 st-0.014
ww-0.014
 long-0.014
Token again
Token ID757
Feature activation+1.42e+00

Pos logit contributions
P+0.022
unt+0.021
 baker+0.021
 cherry+0.021
 taxi+0.021
Neg logit contributions
der-0.025
ities-0.019
en-0.019
ilst-0.018
icated-0.018
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
P+0.041
unt+0.038
ww+0.038
 cherry+0.037
 baker+0.037
Neg logit contributions
der-0.032
en-0.022
ilst-0.021
iece-0.021
ities-0.021
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
der+0.043
en+0.032
ities+0.031
ilst+0.031
iece+0.031
Neg logit contributions
P-0.036
unt-0.034
 cherry-0.033
 baker-0.033
ww-0.033
Token Lucy
Token ID22162
Feature activation+6.33e-01

Pos logit contributions
der+0.038
en+0.031
 before+0.030
ilst+0.030
iece+0.029
Neg logit contributions
P-0.022
unt-0.020
ww-0.020
 baker-0.019
 cherry-0.019
Token could
Token ID714
Feature activation+5.68e-02

Pos logit contributions
P+0.015
unt+0.014
ww+0.013
 baker+0.013
 cherry+0.013
Neg logit contributions
der-0.018
en-0.013
iece-0.013
ilst-0.013
ities-0.013
Token hardly
Token ID8941
Feature activation-6.95e+00

Pos logit contributions
P+0.001
 cherry+0.001
 baker+0.001
 pastry+0.001
 good+0.001
Neg logit contributions
der-0.002
iece-0.002
icated-0.002
en-0.002
ities-0.002
Token wait
Token ID4043
Feature activation+8.06e-02

Pos logit contributions
der+0.204
en+0.156
ities+0.151
icated+0.144
ilst+0.143
Neg logit contributions
 cherry-0.151
P-0.145
 baker-0.142
 pastry-0.141
 taxi-0.136
Token to
Token ID284
Feature activation+1.62e+00

Pos logit contributions
P+0.001
unt+0.001
 cherry+0.001
ww+0.001
 baker+0.001
Neg logit contributions
der-0.003
en-0.002
ilst-0.002
ities-0.002
iece-0.002
Token see
Token ID766
Feature activation+3.08e-01

Pos logit contributions
P+0.034
unt+0.031
 cherry+0.031
ww+0.031
 baker+0.030
Neg logit contributions
der-0.049
en-0.037
ilst-0.036
iece-0.036
 before-0.036
Token what
Token ID644
Feature activation+2.09e+00

Pos logit contributions
P+0.006
 cherry+0.006
 pastry+0.006
unt+0.006
 baker+0.006
Neg logit contributions
der-0.009
ilst-0.007
iece-0.007
ities-0.007
en-0.007
Token surprise
Token ID5975
Feature activation+2.76e-01

Pos logit contributions
P+0.057
unt+0.053
ww+0.052
Elizabeth+0.051
 baker+0.050
Neg logit contributions
der-0.045
 before-0.036
en-0.032
ilst-0.032
 since-0.030
Token very
Token ID845
Feature activation+1.81e+00

Pos logit contributions
der+0.008
 before+0.008
 himself+0.007
 about+0.007
en+0.007
Neg logit contributions
P-0.007
ww-0.007
unt-0.006
www-0.006
Elizabeth-0.006
Token excited
Token ID6568
Feature activation-4.00e+00

Pos logit contributions
P+0.044
Elizabeth+0.042
Ell+0.041
Michelle+0.041
ww+0.039
Neg logit contributions
der-0.042
 himself-0.036
 before-0.036
 about-0.033
ipples-0.033
Token.
Token ID13
Feature activation-8.46e-01

Pos logit contributions
der+0.148
ipples+0.131
ility+0.129
iece+0.129
inch+0.127
Neg logit contributions
 long-0.046
 pastry-0.043
unt-0.042
 lawyer-0.041
 baker-0.040
Token He
Token ID679
Feature activation-6.48e-01

Pos logit contributions
der+0.030
iness+0.024
ities+0.024
ility+0.024
ock+0.024
Neg logit contributions
P-0.012
unt-0.012
 baker-0.012
 cherry-0.012
 bakery-0.012
Token could
Token ID714
Feature activation-5.43e-01

Pos logit contributions
der+0.018
en+0.014
 before+0.014
ilst+0.014
inch+0.013
Neg logit contributions
P-0.015
ww-0.014
unt-0.013
 baker-0.013
 cherry-0.013
Token hardly
Token ID8941
Feature activation-6.95e+00

Pos logit contributions
der+0.020
en+0.016
iece+0.016
ities+0.016
ilst+0.016
Neg logit contributions
P-0.008
unt-0.007
 cherry-0.007
 baker-0.007
 pastry-0.007
Token contain
Token ID3994
Feature activation-1.03e+00

Pos logit contributions
der+0.193
en+0.144
ities+0.138
inch+0.136
ilst+0.135
Neg logit contributions
P-0.159
 cherry-0.153
 baker-0.149
unt-0.149
 pastry-0.147
Token his
Token ID465
Feature activation-3.40e-01

Pos logit contributions
der+0.038
ility+0.031
rified+0.031
ipples+0.030
places+0.030
Neg logit contributions
 good-0.017
P-0.016
 big-0.015
 nice-0.014
 long-0.014
Token joy
Token ID8716
Feature activation-6.72e-01

Pos logit contributions
der+0.010
en+0.007
 before+0.007
ilst+0.007
inch+0.007
Neg logit contributions
P-0.008
unt-0.007
ww-0.007
Ell-0.007
 baker-0.007
Token as
Token ID355
Feature activation+1.84e+00

Pos logit contributions
der+0.019
ilst+0.017
rified+0.016
inch+0.016
ipples+0.016
Neg logit contributions
 long-0.012
 good-0.011
 few-0.011
 on-0.011
 stamp-0.011
Token he
Token ID339
Feature activation-6.35e-01

Pos logit contributions
 good+0.037
 long+0.037
P+0.036
 baker+0.035
 cherry+0.035
Neg logit contributions
der-0.053
ilst-0.044
en-0.042
ities-0.042
iece-0.041
Token herself
Token ID5223
Feature activation-6.39e-01

Pos logit contributions
P+0.028
unt+0.026
ww+0.026
 baker+0.025
 cherry+0.024
Neg logit contributions
der-0.038
en-0.030
 before-0.030
ilst-0.028
iece-0.028
Token safe
Token ID3338
Feature activation-1.60e-01

Pos logit contributions
der+0.023
ilst+0.019
en+0.018
ussed+0.018
icated+0.018
Neg logit contributions
P-0.009
 good-0.009
 cherry-0.009
 long-0.008
 high-0.008
Token.
Token ID13
Feature activation+4.89e-01

Pos logit contributions
der+0.004
ilst+0.004
ussed+0.004
inch+0.004
icated+0.004
Neg logit contributions
 good-0.003
 long-0.003
P-0.003
 cherry-0.003
 high-0.003
Token From
Token ID3574
Feature activation-1.82e+00

Pos logit contributions
P+0.011
unt+0.010
ww+0.010
 baker+0.010
 cherry+0.010
Neg logit contributions
der-0.014
en-0.010
 before-0.010
ilst-0.010
inch-0.010
Token that
Token ID326
Feature activation+3.95e+00

Pos logit contributions
der+0.037
 before+0.025
en+0.024
ilst+0.023
 since+0.022
Neg logit contributions
P-0.056
unt-0.053
ww-0.052
 baker-0.052
 cherry-0.051
Token day
Token ID1110
Feature activation+5.41e+00

Pos logit contributions
P+0.086
 baker+0.079
 jury+0.078
Michelle+0.078
Elizabeth+0.077
Neg logit contributions
der-0.104
 before-0.095
en-0.086
 since-0.086
ilst-0.086
Token on
Token ID319
Feature activation+1.84e+00

Pos logit contributions
 baker+0.122
 bakery+0.118
 pastry+0.115
 cherry+0.112
 jury+0.110
Neg logit contributions
der-0.133
 before-0.126
 when-0.114
iece-0.111
icking-0.111
Token,
Token ID11
Feature activation+1.69e+00

Pos logit contributions
P+0.052
 baker+0.049
ww+0.047
Lady+0.046
 cardinal+0.046
Neg logit contributions
der-0.039
 before-0.034
en-0.031
 since-0.029
 when-0.027
Token Lily
Token ID20037
Feature activation+7.17e-01

Pos logit contributions
P+0.036
unt+0.033
ww+0.033
 baker+0.031
Elizabeth+0.031
Neg logit contributions
der-0.050
 before-0.040
en-0.039
ilst-0.038
iece-0.037
Token's
Token ID338
Feature activation-9.35e-01

Pos logit contributions
P+0.016
unt+0.015
ww+0.015
 baker+0.015
 cherry+0.015
Neg logit contributions
der-0.020
en-0.015
 before-0.015
ilst-0.015
iece-0.015
Token new
Token ID649
Feature activation+1.06e+00

Pos logit contributions
der+0.033
en+0.027
ilst+0.027
 before+0.026
iece+0.026
Neg logit contributions
P-0.014
unt-0.013
ww-0.012
 baker-0.012
 cherry-0.012
Token feathers
Token ID27400
Feature activation-2.33e-01

Pos logit contributions
P+0.002
unt+0.002
ww+0.002
Ell+0.002
Elizabeth+0.002
Neg logit contributions
der-0.002
 before-0.002
en-0.002
ities-0.002
 himself-0.002
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
der+0.006
ussed+0.005
ilst+0.005
ility+0.005
ities+0.005
Neg logit contributions
P-0.005
 high-0.005
 good-0.005
 long-0.005
 cherry-0.005
Token
Token ID198
Feature activation-8.70e-01

Pos logit contributions
P+0.006
unt+0.006
ww+0.005
 baker+0.005
 cherry+0.005
Neg logit contributions
der-0.011
en-0.009
ilst-0.009
iece-0.009
ities-0.009
Token
Token ID198
Feature activation-5.80e-02

Pos logit contributions
der+0.026
en+0.021
 before+0.020
ilst+0.020
iece+0.020
Neg logit contributions
P-0.018
unt-0.017
ww-0.016
 baker-0.016
 cherry-0.016
TokenOne
Token ID3198
Feature activation-1.03e+00

Pos logit contributions
der+0.002
en+0.002
 before+0.001
iece+0.001
ilst+0.001
Neg logit contributions
P-0.001
unt-0.001
ww-0.001
 baker-0.001
 cherry-0.001
Token day
Token ID1110
Feature activation+2.53e+00

Pos logit contributions
der+0.032
 before+0.027
en+0.026
ilst+0.025
icated+0.025
Neg logit contributions
P-0.019
unt-0.017
Elizabeth-0.017
Ell-0.016
Michelle-0.016
Token,
Token ID11
Feature activation+4.47e-01

Pos logit contributions
P+0.050
unt+0.045
ww+0.045
 baker+0.043
 cherry+0.042
Neg logit contributions
der-0.079
en-0.062
ilst-0.061
iece-0.060
 before-0.060
Token Wisdom
Token ID24075
Feature activation-1.46e+00

Pos logit contributions
P+0.011
unt+0.010
ww+0.010
 baker+0.009
 cherry+0.009
Neg logit contributions
der-0.012
ilst-0.009
en-0.009
iece-0.009
ility-0.009
Token told
Token ID1297
Feature activation-1.53e+00

Pos logit contributions
der+0.046
icles+0.038
ility+0.036
iece+0.036
ities+0.036
Neg logit contributions
P-0.029
unt-0.026
ww-0.025
 lawyer-0.024
 baker-0.023
Token N
Token ID399
Feature activation-2.31e+00

Pos logit contributions
der+0.044
ilst+0.034
en+0.034
iece+0.034
ipples+0.033
Neg logit contributions
P-0.033
unt-0.031
ww-0.030
 baker-0.029
 cherry-0.029
Tokenappy
Token ID7774
Feature activation+1.38e+00

Pos logit contributions
der+0.059
iece+0.046
en+0.046
ilst+0.045
 before+0.045
Neg logit contributions
P-0.056
unt-0.052
 baker-0.050
ww-0.050
 lawyer-0.048
Token doing
Token ID1804
Feature activation-7.95e-01

Pos logit contributions
der+0.041
 before+0.033
en+0.032
ilst+0.031
ities+0.031
Neg logit contributions
P-0.032
unt-0.030
ww-0.029
 baker-0.028
 cherry-0.028
Token?!"
Token ID30823
Feature activation+1.59e+00

Pos logit contributions
der+0.021
 before+0.017
en+0.016
ilst+0.015
ities+0.015
Neg logit contributions
P-0.019
ww-0.018
unt-0.018
Elizabeth-0.017
 cherry-0.017
Token her
Token ID607
Feature activation+2.36e-01

Pos logit contributions
P+0.035
ww+0.032
 baker+0.030
unt+0.030
 cherry+0.029
Neg logit contributions
der-0.044
 before-0.037
en-0.034
ilst-0.034
 himself-0.034
Token mother
Token ID2802
Feature activation+3.51e-01

Pos logit contributions
P+0.006
ww+0.006
 baker+0.006
iw+0.006
 bakery+0.006
Neg logit contributions
der-0.005
 before-0.004
en-0.004
ities-0.004
icking-0.004
Token said
Token ID531
Feature activation-1.68e-01

Pos logit contributions
P+0.008
ww+0.008
 baker+0.007
unt+0.007
 cherry+0.007
Neg logit contributions
der-0.010
 before-0.008
en-0.008
ilst-0.007
 since-0.007
Token in
Token ID287
Feature activation+2.61e+00

Pos logit contributions
der+0.004
 before+0.003
en+0.003
iece+0.003
ilst+0.002
Neg logit contributions
P-0.005
ww-0.005
unt-0.004
 baker-0.004
Elizabeth-0.004
Token a
Token ID257
Feature activation+2.92e+00

Pos logit contributions
P+0.063
unt+0.059
ww+0.058
 baker+0.056
 cherry+0.056
Neg logit contributions
der-0.070
en-0.053
 before-0.052
ilst-0.051
ities-0.050
Token stern
Token ID26370
Feature activation-9.74e-01

Pos logit contributions
P+0.066
ww+0.066
opard+0.059
Chuck+0.059
iw+0.058
Neg logit contributions
der-0.064
 before-0.056
 sill-0.053
 weren-0.051
 wasn-0.051
Token voice
Token ID3809
Feature activation-1.13e+00

Pos logit contributions
der+0.030
en+0.026
 before+0.024
iece+0.024
 himself+0.023
Neg logit contributions
P-0.021
ww-0.018
unt-0.018
 cherry-0.017
Elizabeth-0.017
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
der+0.024
en+0.017
ilst+0.016
inch+0.016
 before+0.016
Neg logit contributions
P-0.033
unt-0.031
ww-0.030
 baker-0.030
 cherry-0.030
Token
Token ID220
Feature activation-1.36e+00

Pos logit contributions
P+0.003
ww+0.003
 baker+0.002
www+0.002
 stamp+0.002
Neg logit contributions
der-0.003
 before-0.002
 himself-0.002
ilst-0.002
en-0.002
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
 good+0.005
 cherry+0.005
 pastry+0.005
P+0.005
 long+0.005
Neg logit contributions
der-0.006
inch-0.005
ities-0.005
ussed-0.005
iece-0.005
Token He
Token ID679
Feature activation+2.23e-02

Pos logit contributions
P+0.027
unt+0.027
 baker+0.025
ww+0.025
 cherry+0.025
Neg logit contributions
der-0.035
en-0.030
iece-0.029
ities-0.029
ilst-0.028
Token decided
Token ID3066
Feature activation+5.30e-01

Pos logit contributions
P+0.000
unt+0.000
ww+0.000
 cherry+0.000
 baker+0.000
Neg logit contributions
der-0.001
iece-0.001
ilst-0.001
ities-0.000
en-0.000
Token to
Token ID284
Feature activation-1.05e-01

Pos logit contributions
P+0.008
unt+0.008
ww+0.007
 cherry+0.007
 baker+0.007
Neg logit contributions
der-0.018
en-0.015
ilst-0.015
ities-0.015
inch-0.014
Token take
Token ID1011
Feature activation+2.27e+00

Pos logit contributions
der+0.004
iece+0.003
ilst+0.003
ities+0.003
icated+0.003
Neg logit contributions
P-0.002
 cherry-0.002
unt-0.002
 good-0.002
 baker-0.002
Token a
Token ID257
Feature activation+3.04e+00

Pos logit contributions
P+0.056
unt+0.055
 baker+0.052
ww+0.052
 cherry+0.051
Neg logit contributions
der-0.057
en-0.045
 before-0.044
 himself-0.042
iece-0.041
Token bath
Token ID7837
Feature activation+9.13e-01

Pos logit contributions
P+0.076
 baker+0.071
unt+0.070
Lady+0.068
Elizabeth+0.068
Neg logit contributions
der-0.066
 before-0.062
ock-0.054
inch-0.054
ilst-0.054
Token and
Token ID290
Feature activation-9.83e-01

Pos logit contributions
P+0.021
 baker+0.018
unt+0.017
ww+0.017
 good+0.017
Neg logit contributions
der-0.025
icated-0.021
en-0.020
ility-0.020
ities-0.020
Token clean
Token ID3424
Feature activation+8.22e-01

Pos logit contributions
der+0.033
ities+0.027
ilst+0.026
iece+0.026
en+0.026
Neg logit contributions
P-0.017
 cherry-0.016
 good-0.016
 long-0.015
 ladder-0.015
Token himself
Token ID2241
Feature activation+8.74e-02

Pos logit contributions
 good+0.015
P+0.015
 pastry+0.013
 high+0.013
 basket+0.013
Neg logit contributions
der-0.026
ities-0.022
ilst-0.021
ility-0.021
icated-0.021
Token up
Token ID510
Feature activation+8.35e-01

Pos logit contributions
 good+0.002
P+0.002
 long+0.002
 high+0.002
 cherry+0.002
Neg logit contributions
der-0.003
icated-0.002
ities-0.002
ility-0.002
ilst-0.002
Token the
Token ID262
Feature activation-1.79e+00

Pos logit contributions
P+0.032
unt+0.029
ww+0.029
 baker+0.028
 cherry+0.028
Neg logit contributions
der-0.043
en-0.033
 before-0.033
ilst-0.033
iece-0.032
Token shop
Token ID6128
Feature activation+1.23e+00

Pos logit contributions
der+0.053
 before+0.042
en+0.040
 himself+0.038
inch+0.038
Neg logit contributions
P-0.037
unt-0.034
ww-0.033
Elizabeth-0.032
Ell-0.032
Token and
Token ID290
Feature activation-2.85e-02

Pos logit contributions
 baker+0.026
 taxi+0.026
 lawyer+0.026
 ladder+0.026
unt+0.025
Neg logit contributions
der-0.034
ility-0.027
ilst-0.026
en-0.026
ities-0.026
Token give
Token ID1577
Feature activation+1.96e+00

Pos logit contributions
der+0.001
en+0.001
 before+0.001
ilst+0.001
iece+0.001
Neg logit contributions
P-0.001
unt-0.001
ww-0.001
 baker-0.001
 cherry-0.001
Token the
Token ID262
Feature activation+4.69e-01

Pos logit contributions
P+0.050
unt+0.049
ww+0.047
Ell+0.046
 baker+0.046
Neg logit contributions
der-0.046
 before-0.037
en-0.036
inch-0.034
 himself-0.034
Token truck
Token ID7779
Feature activation+2.72e+00

Pos logit contributions
P+0.008
Ell+0.007
ww+0.007
unt+0.007
Elizabeth+0.007
Neg logit contributions
der-0.015
 before-0.013
icated-0.012
icles-0.012
inch-0.011
Token back
Token ID736
Feature activation+1.67e+00

Pos logit contributions
P+0.048
unt+0.044
 baker+0.044
 cherry+0.044
ww+0.043
Neg logit contributions
der-0.090
en-0.073
ilst-0.070
ities-0.070
ility-0.070
Token.
Token ID13
Feature activation+8.04e-01

Pos logit contributions
P+0.034
unt+0.032
 cherry+0.031
ww+0.031
 taxi+0.031
Neg logit contributions
der-0.051
ities-0.039
en-0.039
iece-0.039
ussed-0.039
Token Maybe
Token ID6674
Feature activation+3.10e-01

Pos logit contributions
P+0.016
unt+0.016
ww+0.015
 cherry+0.014
 baker+0.014
Neg logit contributions
der-0.025
en-0.020
ilst-0.019
iece-0.019
inch-0.019
Token we
Token ID356
Feature activation+2.42e-01

Pos logit contributions
P+0.006
unt+0.006
 cherry+0.006
 baker+0.006
ww+0.006
Neg logit contributions
der-0.010
en-0.008
ilst-0.008
iece-0.007
inch-0.007
Token can
Token ID460
Feature activation+1.21e+00

Pos logit contributions
P+0.005
 cherry+0.004
unt+0.004
ww+0.004
 baker+0.004
Neg logit contributions
der-0.008
en-0.006
iece-0.006
ilst-0.006
ities-0.006
Token around
Token ID1088
Feature activation+7.57e-01

Pos logit contributions
P+0.017
 good+0.015
 high+0.015
 long+0.014
 few+0.014
Neg logit contributions
der-0.027
ities-0.022
icking-0.021
ility-0.021
ilst-0.021
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
P+0.018
 baker+0.016
 cherry+0.016
 pastry+0.016
 long+0.016
Neg logit contributions
der-0.020
inch-0.016
ities-0.015
ilst-0.015
en-0.015
Token But
Token ID887
Feature activation-3.04e+00

Pos logit contributions
der+0.007
en+0.006
iece+0.005
ities+0.005
ilst+0.005
Neg logit contributions
P-0.005
unt-0.005
 cherry-0.004
 baker-0.004
ww-0.004
Token the
Token ID262
Feature activation-1.65e+00

Pos logit contributions
der+0.077
en+0.056
ities+0.055
ilst+0.055
inch+0.054
Neg logit contributions
P-0.076
unt-0.071
 cherry-0.071
ww-0.071
 baker-0.069
Token mud
Token ID17492
Feature activation-1.38e+00

Pos logit contributions
der+0.054
en+0.043
ilst+0.043
iece+0.042
ities+0.042
Neg logit contributions
P-0.030
unt-0.027
ww-0.027
 baker-0.026
 cherry-0.026
Token was
Token ID373
Feature activation-6.88e-01

Pos logit contributions
der+0.039
icated+0.034
ility+0.034
ities+0.032
inch+0.031
Neg logit contributions
P-0.027
 ladder-0.026
 cherry-0.026
 good-0.025
ww-0.025
Token very
Token ID845
Feature activation+1.30e+00

Pos logit contributions
der+0.022
iece+0.018
en+0.017
ities+0.017
inch+0.017
Neg logit contributions
P-0.013
 cherry-0.012
 long-0.012
 good-0.012
unt-0.012
Token filthy
Token ID35677
Feature activation-2.80e+00

Pos logit contributions
P+0.020
unt+0.018
ww+0.018
 baker+0.017
 cherry+0.017
Neg logit contributions
der-0.046
en-0.037
 before-0.036
ilst-0.036
iece-0.036
Token and
Token ID290
Feature activation-3.94e-01

Pos logit contributions
der+0.076
ility+0.074
inch+0.073
ric+0.072
iece+0.070
Neg logit contributions
 good-0.051
 long-0.051
 high-0.048
 round-0.047
 big-0.046
Token soon
Token ID2582
Feature activation-8.19e-01

Pos logit contributions
der+0.011
iece+0.009
en+0.009
ities+0.009
ilst+0.009
Neg logit contributions
P-0.008
ww-0.008
unt-0.008
 cherry-0.008
 good-0.007
Token the
Token ID262
Feature activation-1.49e+00

Pos logit contributions
der+0.024
icated+0.020
ities+0.020
iece+0.019
ussed+0.019
Neg logit contributions
P-0.015
 good-0.015
 cherry-0.015
ww-0.015
 long-0.015
Token
Token ID198
Feature activation-7.10e-01

Pos logit contributions
P+0.006
unt+0.005
ww+0.005
 baker+0.005
 cherry+0.005
Neg logit contributions
der-0.008
en-0.006
 before-0.006
ilst-0.006
iece-0.006
Token
Token ID198
Feature activation+1.38e-01

Pos logit contributions
der+0.023
en+0.019
ilst+0.018
ities+0.018
iece+0.018
Neg logit contributions
P-0.013
unt-0.012
ww-0.011
 cherry-0.011
 baker-0.011
TokenAnna
Token ID31160
Feature activation-2.63e-01

Pos logit contributions
P+0.002
unt+0.002
ww+0.002
 cherry+0.002
 baker+0.002
Neg logit contributions
der-0.005
en-0.004
 before-0.004
ilst-0.004
iece-0.004
Token feels
Token ID5300
Feature activation-7.96e-02

Pos logit contributions
der+0.008
ilst+0.006
 before+0.006
en+0.006
iece+0.006
Neg logit contributions
P-0.005
unt-0.005
 baker-0.005
 cherry-0.005
 pastry-0.005
Token sad
Token ID6507
Feature activation-4.28e+00

Pos logit contributions
der+0.003
ilst+0.003
en+0.003
icles+0.003
inch+0.003
Neg logit contributions
P-0.001
 good-0.001
 cherry-0.001
 baker-0.001
 pastry-0.001
Token.
Token ID13
Feature activation+1.08e-01

Pos logit contributions
ilst+0.111
icles+0.105
inch+0.103
ressed+0.101
ussed+0.101
Neg logit contributions
 good-0.085
 long-0.080
 big-0.073
 lawyer-0.073
unt-0.072
Token She
Token ID1375
Feature activation-1.42e-01

Pos logit contributions
P+0.002
ww+0.002
unt+0.002
 cherry+0.002
 baker+0.002
Neg logit contributions
der-0.003
 before-0.002
en-0.002
iece-0.002
ities-0.002
Token really
Token ID1107
Feature activation+5.13e-01

Pos logit contributions
der+0.005
icles+0.004
ilst+0.004
iece+0.004
en+0.004
Neg logit contributions
P-0.002
 good-0.002
unt-0.002
 pastry-0.002
 cherry-0.002
Token wants
Token ID3382
Feature activation-1.35e-01

Pos logit contributions
P+0.010
unt+0.009
ww+0.009
 baker+0.009
 cherry+0.008
Neg logit contributions
der-0.016
en-0.013
ilst-0.013
 before-0.013
icles-0.012
Token to
Token ID284
Feature activation+1.64e-01

Pos logit contributions
der+0.004
 before+0.003
en+0.003
 himself+0.003
icles+0.003
Neg logit contributions
P-0.003
ww-0.003
unt-0.003
www-0.003
Ell-0.003
Token taste
Token ID6938
Feature activation-3.41e-01

Pos logit contributions
P+0.004
unt+0.003
ww+0.003
 baker+0.003
Elizabeth+0.003
Neg logit contributions
der-0.005
en-0.004
 before-0.004
ilst-0.004
inch-0.003
Token out
Token ID503
Feature activation+6.24e-01

Pos logit contributions
P+0.009
ww+0.009
unt+0.009
 baker+0.008
www+0.008
Neg logit contributions
der-0.010
 before-0.009
en-0.008
ilst-0.008
iece-0.008
Token of
Token ID286
Feature activation+6.59e-01

Pos logit contributions
P+0.019
ww+0.018
unt+0.017
 baker+0.017
Elizabeth+0.016
Neg logit contributions
der-0.014
 before-0.010
en-0.010
ilst-0.009
iece-0.009
Token her
Token ID607
Feature activation-1.74e+00

Pos logit contributions
P+0.016
ww+0.016
unt+0.015
 baker+0.014
www+0.014
Neg logit contributions
der-0.017
 before-0.014
en-0.014
ilst-0.013
 himself-0.013
Token box
Token ID3091
Feature activation+7.23e-01

Pos logit contributions
der+0.042
 before+0.039
ilst+0.035
 as+0.034
 how+0.033
Neg logit contributions
ww-0.040
P-0.040
www-0.037
unt-0.036
Ell-0.036
Token and
Token ID290
Feature activation-1.96e-01

Pos logit contributions
P+0.015
unt+0.015
 cherry+0.015
 long+0.014
 ladder+0.014
Neg logit contributions
der-0.020
icated-0.015
en-0.015
ility-0.015
inch-0.015
Token puts
Token ID7584
Feature activation+1.59e+00

Pos logit contributions
der+0.006
en+0.005
 before+0.004
ilst+0.004
iece+0.004
Neg logit contributions
P-0.004
unt-0.004
ww-0.004
 baker-0.004
 cherry-0.004
Token them
Token ID606
Feature activation+9.32e-01

Pos logit contributions
unt+0.047
ww+0.046
P+0.045
 lawyer+0.043
Elizabeth+0.043
Neg logit contributions
der-0.029
 before-0.025
en-0.024
 himself-0.024
icles-0.023
Token on
Token ID319
Feature activation+1.01e+00

Pos logit contributions
unt+0.019
ww+0.019
P+0.019
www+0.017
 baker+0.017
Neg logit contributions
der-0.027
 before-0.023
iece-0.022
 himself-0.022
en-0.022
Token the
Token ID262
Feature activation-1.27e+00

Pos logit contributions
P+0.024
ww+0.022
unt+0.022
 baker+0.021
 cherry+0.020
Neg logit contributions
der-0.028
en-0.022
 before-0.021
ilst-0.021
iece-0.020
Token floor
Token ID4314
Feature activation-5.01e-01

Pos logit contributions
der+0.030
 before+0.026
iece+0.024
 himself+0.024
icles+0.022
Neg logit contributions
ww-0.033
P-0.033
unt-0.030
www-0.029
Elizabeth-0.029
Token.
Token ID13
Feature activation+7.34e-01

Pos logit contributions
der+0.012
icated+0.009
inch+0.009
en+0.009
ilst+0.009
Neg logit contributions
P-0.012
 baker-0.012
 pastry-0.012
 long-0.012
 cherry-0.012
Token and
Token ID290
Feature activation-1.44e+00

Pos logit contributions
der+0.001
en+0.001
 before+0.001
ilst+0.001
iece+0.001
Neg logit contributions
P-0.001
unt-0.001
ww-0.001
 baker-0.001
 cherry-0.001
Token started
Token ID2067
Feature activation+7.54e-01

Pos logit contributions
der+0.042
en+0.033
ilst+0.033
iece+0.032
inch+0.032
Neg logit contributions
P-0.030
unt-0.028
ww-0.027
 cherry-0.027
 baker-0.027
Token to
Token ID284
Feature activation-1.25e+00

Pos logit contributions
P+0.017
unt+0.016
ww+0.016
www+0.015
Elizabeth+0.015
Neg logit contributions
der-0.022
 before-0.018
en-0.017
 himself-0.017
ilst-0.016
Token feel
Token ID1254
Feature activation-6.59e-01

Pos logit contributions
der+0.041
ilst+0.032
en+0.032
iece+0.031
inch+0.031
Neg logit contributions
P-0.022
 cherry-0.021
unt-0.020
 baker-0.020
 pastry-0.020
Token better
Token ID1365
Feature activation+5.04e-02

Pos logit contributions
der+0.025
ilst+0.021
en+0.021
ities+0.021
icles+0.021
Neg logit contributions
 good-0.008
P-0.008
 cherry-0.008
 high-0.007
 long-0.007
Token.
Token ID13
Feature activation+5.59e-02

Pos logit contributions
 long+0.001
 good+0.001
P+0.001
 high+0.001
 busy+0.001
Neg logit contributions
der-0.001
icles-0.001
ilst-0.001
icated-0.001
ussed-0.001
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
P+0.001
ww+0.001
unt+0.001
 baker+0.001
 lawyer+0.001
Neg logit contributions
der-0.002
 before-0.001
ilst-0.001
en-0.001
iece-0.001
Token
Token ID198
Feature activation-2.53e-01

Pos logit contributions
der+0.031
en+0.024
 before+0.024
ilst+0.024
iece+0.023
Neg logit contributions
P-0.024
ww-0.022
unt-0.022
 baker-0.022
 cherry-0.022
TokenAs
Token ID1722
Feature activation+7.18e-01

Pos logit contributions
der+0.007
en+0.006
 before+0.006
ilst+0.006
iece+0.006
Neg logit contributions
P-0.005
unt-0.005
ww-0.005
 baker-0.005
 cherry-0.005
Token the
Token ID262
Feature activation+2.37e-02

Pos logit contributions
P+0.020
ww+0.018
 baker+0.017
unt+0.017
lder+0.017
Neg logit contributions
der-0.016
 before-0.015
 since-0.013
 when-0.012
en-0.012
Token days
Token ID1528
Feature activation-1.46e-01

Pos logit contributions
P+0.000
 jury+0.000
Ell+0.000
Michelle+0.000
Elizabeth+0.000
Neg logit contributions
der-0.001
 before-0.001
en-0.001
icles-0.001
ilst-0.001
Token song
Token ID3496
Feature activation-9.22e-01

Pos logit contributions
P+0.029
Elizabeth+0.025
unt+0.024
Michelle+0.024
 baker+0.024
Neg logit contributions
der-0.050
 before-0.042
en-0.040
icated-0.039
 since-0.038
Token that
Token ID326
Feature activation-4.65e-01

Pos logit contributions
der+0.023
inch+0.018
ities+0.018
ilst+0.018
en+0.018
Neg logit contributions
unt-0.022
P-0.021
ww-0.020
 cherry-0.020
 pastry-0.020
Token was
Token ID373
Feature activation-1.37e+00

Pos logit contributions
der+0.013
en+0.010
 before+0.009
ilst+0.009
iece+0.009
Neg logit contributions
P-0.011
unt-0.010
ww-0.010
 baker-0.010
 cherry-0.010
Token much
Token ID881
Feature activation+1.51e-01

Pos logit contributions
der+0.044
en+0.035
 before+0.035
ilst+0.034
iece+0.033
Neg logit contributions
P-0.026
unt-0.024
ww-0.023
 baker-0.023
 cherry-0.022
Token better
Token ID1365
Feature activation-2.60e-01

Pos logit contributions
P+0.003
unt+0.003
ww+0.003
 baker+0.003
Elizabeth+0.003
Neg logit contributions
der-0.004
en-0.003
 before-0.003
ilst-0.003
iece-0.003
Token than
Token ID621
Feature activation-1.28e+00

Pos logit contributions
der+0.007
iece+0.006
ilst+0.006
icles+0.006
inch+0.006
Neg logit contributions
P-0.006
ww-0.005
unt-0.005
 good-0.005
 long-0.005
Token her
Token ID607
Feature activation-1.89e+00

Pos logit contributions
der+0.037
ilst+0.029
en+0.029
iece+0.029
inch+0.028
Neg logit contributions
P-0.026
unt-0.025
 cherry-0.025
 baker-0.024
ww-0.024
Token own
Token ID898
Feature activation-3.19e-01

Pos logit contributions
der+0.057
ilst+0.045
en+0.045
iece+0.045
ility+0.043
Neg logit contributions
P-0.037
unt-0.035
 cherry-0.034
ww-0.034
 baker-0.034
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
der+0.008
iece+0.006
ilst+0.006
ities+0.006
inch+0.005
Neg logit contributions
P-0.008
unt-0.008
ww-0.008
 cherry-0.008
 baker-0.008
Token She
Token ID1375
Feature activation-7.15e-01

Pos logit contributions
der+0.000
 before+0.000
en+0.000
ilst+0.000
 since+0.000
Neg logit contributions
P-0.000
ww-0.000
unt-0.000
Elizabeth-0.000
 baker-0.000
Token felt
Token ID2936
Feature activation-1.77e-01

Pos logit contributions
der+0.020
en+0.016
 before+0.015
ilst+0.015
iece+0.015
Neg logit contributions
P-0.016
unt-0.015
ww-0.015
 baker-0.014
 cherry-0.014
Token hair
Token ID4190
Feature activation-2.43e+00

Pos logit contributions
P+0.016
ww+0.014
 cherry+0.014
unt+0.014
 good+0.014
Neg logit contributions
der-0.023
en-0.018
iece-0.017
ility-0.017
ities-0.017
Token.
Token ID13
Feature activation-2.35e+00

Pos logit contributions
der+0.060
ilst+0.048
en+0.045
ility+0.045
icated+0.045
Neg logit contributions
P-0.059
 cherry-0.055
unt-0.054
ww-0.054
 baker-0.053
Token Suddenly
Token ID24975
Feature activation-1.59e+00

Pos logit contributions
der+0.067
en+0.052
 before+0.051
ilst+0.050
iece+0.050
Neg logit contributions
P-0.052
unt-0.048
ww-0.047
 baker-0.046
 cherry-0.045
Token,
Token ID11
Feature activation-8.85e-01

Pos logit contributions
der+0.048
 before+0.038
en+0.038
iece+0.036
ilst+0.036
Neg logit contributions
P-0.033
unt-0.030
ww-0.029
 baker-0.028
 cherry-0.028
Token he
Token ID339
Feature activation-2.34e+00

Pos logit contributions
der+0.025
en+0.020
 before+0.019
ilst+0.019
iece+0.019
Neg logit contributions
P-0.019
unt-0.018
ww-0.018
 baker-0.017
 cherry-0.017
Token accidentally
Token ID14716
Feature activation-3.68e+00

Pos logit contributions
der+0.080
en+0.064
ilst+0.064
 before+0.063
ities+0.062
Neg logit contributions
P-0.039
unt-0.034
ww-0.034
 cherry-0.033
 baker-0.033
Token cut
Token ID2005
Feature activation-1.04e+00

Pos logit contributions
der+0.131
en+0.105
ities+0.104
ilst+0.104
iece+0.104
Neg logit contributions
P-0.056
 cherry-0.048
ww-0.048
unt-0.046
 pastry-0.045
Token his
Token ID465
Feature activation-2.02e+00

Pos logit contributions
der+0.032
ility+0.025
ilst+0.025
ities+0.024
ussed+0.024
Neg logit contributions
P-0.020
 good-0.018
 cherry-0.018
 long-0.018
ww-0.018
Token finger
Token ID7660
Feature activation-1.12e+00

Pos logit contributions
der+0.059
 before+0.046
en+0.045
ities+0.044
iece+0.044
Neg logit contributions
P-0.044
unt-0.041
ww-0.040
 baker-0.038
Elizabeth-0.038
Token and
Token ID290
Feature activation-8.03e-01

Pos logit contributions
der+0.031
ilst+0.024
ility+0.024
icated+0.024
inch+0.024
Neg logit contributions
P-0.023
 cherry-0.022
 long-0.022
 pastry-0.022
 high-0.022
Token started
Token ID2067
Feature activation+8.01e-01

Pos logit contributions
der+0.024
en+0.019
ilst+0.019
inch+0.019
iece+0.019
Neg logit contributions
P-0.016
ww-0.014
 cherry-0.014
unt-0.014
 pastry-0.014
Token exclaimed
Token ID33552
Feature activation-7.77e-01

Pos logit contributions
der+0.015
 before+0.012
en+0.012
 himself+0.011
ilst+0.011
Neg logit contributions
P-0.017
 baker-0.016
ww-0.016
 cherry-0.016
unt-0.016
Token.
Token ID13
Feature activation-4.76e-01

Pos logit contributions
der+0.015
en+0.010
ilst+0.010
 before+0.010
iece+0.010
Neg logit contributions
P-0.024
unt-0.023
ww-0.022
 baker-0.022
 cherry-0.022
Token Lily
Token ID20037
Feature activation-5.62e-01

Pos logit contributions
der+0.011
 before+0.009
 himself+0.008
icles+0.008
 since+0.007
Neg logit contributions
P-0.013
ww-0.012
Elizabeth-0.012
Michelle-0.011
 merry-0.011
Token was
Token ID373
Feature activation+1.75e-01

Pos logit contributions
der+0.014
en+0.010
 before+0.010
ilst+0.010
inch+0.009
Neg logit contributions
P-0.015
unt-0.014
ww-0.014
 baker-0.013
 cherry-0.013
Token so
Token ID523
Feature activation+1.91e+00

Pos logit contributions
P+0.004
unt+0.004
ww+0.004
 baker+0.004
 cherry+0.003
Neg logit contributions
der-0.005
 before-0.004
en-0.004
ilst-0.004
iece-0.004
Token happy
Token ID3772
Feature activation-2.71e+00

Pos logit contributions
unt+0.036
P+0.036
ww+0.033
Elizabeth+0.033
 lawyer+0.031
Neg logit contributions
der-0.056
 himself-0.046
inch-0.046
 before-0.045
ipples-0.044
Token and
Token ID290
Feature activation+8.49e-01

Pos logit contributions
der+0.086
ility+0.077
ipples+0.076
iece+0.076
ities+0.076
Neg logit contributions
P-0.040
 pastry-0.040
unt-0.039
 baker-0.038
 lawyer-0.038
Token thanked
Token ID26280
Feature activation+8.17e-01

Pos logit contributions
P+0.019
unt+0.018
ww+0.017
 baker+0.017
 cherry+0.017
Neg logit contributions
der-0.024
 before-0.019
en-0.018
ilst-0.018
 himself-0.018
Token Tim
Token ID5045
Feature activation+3.77e-01

Pos logit contributions
P+0.021
unt+0.020
ww+0.020
 baker+0.019
 cherry+0.019
Neg logit contributions
der-0.020
en-0.015
 before-0.015
ilst-0.015
iece-0.014
Tokenmy
Token ID1820
Feature activation+2.06e+00

Pos logit contributions
P+0.007
unt+0.006
 baker+0.006
 good+0.006
 lawyer+0.006
Neg logit contributions
der-0.012
ipples-0.009
ities-0.009
ussed-0.009
en-0.009
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
P+0.051
unt+0.049
 cherry+0.049
 baker+0.049
 pastry+0.049
Neg logit contributions
der-0.050
en-0.036
ilst-0.036
icles-0.035
ipples-0.034
Token soup
Token ID17141
Feature activation-5.58e-01

Pos logit contributions
der+0.024
ilst+0.019
en+0.018
ussed+0.018
iece+0.018
Neg logit contributions
P-0.016
ww-0.014
 cherry-0.014
 pastry-0.014
 baker-0.014
Token for
Token ID329
Feature activation-2.31e-01

Pos logit contributions
der+0.015
en+0.012
ilst+0.012
iece+0.011
inch+0.011
Neg logit contributions
P-0.013
unt-0.012
ww-0.012
 baker-0.011
 cherry-0.011
Token you
Token ID345
Feature activation-1.82e+00

Pos logit contributions
der+0.006
 before+0.005
en+0.005
ities+0.005
ilst+0.005
Neg logit contributions
P-0.005
unt-0.005
ww-0.005
 baker-0.005
Elizabeth-0.005
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
der+0.052
en+0.040
ilst+0.039
iece+0.039
inch+0.038
Neg logit contributions
P-0.040
unt-0.037
ww-0.036
 baker-0.036
 cherry-0.036
Token But
Token ID887
Feature activation-3.20e+00

Pos logit contributions
der+0.030
en+0.024
ilst+0.024
iece+0.023
inch+0.023
Neg logit contributions
P-0.023
unt-0.021
ww-0.021
 baker-0.021
 cherry-0.021
Token you
Token ID345
Feature activation-3.72e+00

Pos logit contributions
der+0.091
en+0.069
ilst+0.068
iece+0.067
inch+0.066
Neg logit contributions
P-0.071
unt-0.066
ww-0.066
 cherry-0.064
 baker-0.064
Token have
Token ID423
Feature activation+2.24e-01

Pos logit contributions
der+0.130
iece+0.103
en+0.103
ilst+0.101
inch+0.099
Neg logit contributions
P-0.060
 cherry-0.055
unt-0.055
ww-0.053
 baker-0.052
Token to
Token ID284
Feature activation-1.05e+00

Pos logit contributions
P+0.005
unt+0.004
ww+0.004
Ell+0.004
Elizabeth+0.004
Neg logit contributions
der-0.007
 before-0.005
en-0.005
ilst-0.005
ities-0.005
Token relax
Token ID8960
Feature activation-4.81e-01

Pos logit contributions
der+0.034
ilst+0.026
iece+0.026
en+0.025
inch+0.025
Neg logit contributions
P-0.020
 cherry-0.018
 baker-0.018
ww-0.018
unt-0.018
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
der+0.012
en+0.009
 before+0.009
ilst+0.009
iece+0.009
Neg logit contributions
P-0.012
unt-0.011
ww-0.011
 baker-0.011
 cherry-0.011
Token wait
Token ID4043
Feature activation-7.88e-01

Pos logit contributions
der+0.039
ilst+0.031
en+0.031
iece+0.030
inch+0.030
Neg logit contributions
P-0.026
 cherry-0.024
 baker-0.024
ww-0.024
unt-0.024
Token so
Token ID523
Feature activation+9.85e-01

Pos logit contributions
der+0.014
en+0.011
 before+0.011
ilst+0.011
iece+0.011
Neg logit contributions
P-0.008
unt-0.007
ww-0.007
 baker-0.007
 cherry-0.007
Token big
Token ID1263
Feature activation-5.14e-01

Pos logit contributions
P+0.014
unt+0.013
 baker+0.012
Elizabeth+0.012
ww+0.012
Neg logit contributions
der-0.035
 before-0.028
en-0.028
ipples-0.028
 himself-0.028
Token and
Token ID290
Feature activation-9.36e-02

Pos logit contributions
der+0.014
inch+0.013
ussed+0.013
ilst+0.013
ipples+0.013
Neg logit contributions
P-0.009
unt-0.009
 good-0.009
ww-0.009
 long-0.009
Token unknown
Token ID6439
Feature activation-2.89e+00

Pos logit contributions
der+0.003
 before+0.003
en+0.002
 since+0.002
 himself+0.002
Neg logit contributions
P-0.002
unt-0.001
 baker-0.001
ww-0.001
Elizabeth-0.001
Token to
Token ID284
Feature activation-3.02e-01

Pos logit contributions
der+0.070
ilst+0.064
ussed+0.060
inch+0.060
ility+0.058
Neg logit contributions
 pastry-0.062
 good-0.061
 cherry-0.061
 long-0.060
P-0.060
Token her
Token ID607
Feature activation-2.71e+00

Pos logit contributions
der+0.009
en+0.007
 before+0.007
ilst+0.007
iece+0.007
Neg logit contributions
P-0.006
unt-0.006
ww-0.006
 baker-0.005
 cherry-0.005
Token.
Token ID13
Feature activation+3.01e-01

Pos logit contributions
der+0.072
ilst+0.059
ility+0.056
ussed+0.056
ock+0.056
Neg logit contributions
 good-0.060
 cherry-0.059
 long-0.058
 taxi-0.057
 high-0.056
Token But
Token ID887
Feature activation-2.83e+00

Pos logit contributions
P+0.007
ww+0.006
unt+0.006
 baker+0.006
Elizabeth+0.006
Neg logit contributions
der-0.009
 before-0.007
en-0.007
ilst-0.007
iece-0.007
Token then
Token ID788
Feature activation+8.21e-01

Pos logit contributions
der+0.064
 before+0.048
en+0.045
iece+0.044
ilst+0.043
Neg logit contributions
P-0.081
unt-0.076
ww-0.076
 baker-0.073
Elizabeth-0.071
Token,
Token ID11
Feature activation-5.90e-02

Pos logit contributions
P+0.022
 baker+0.021
unt+0.020
 bakery+0.020
ww+0.019
Neg logit contributions
der-0.018
 before-0.016
en-0.014
 since-0.014
 when-0.014
Token the
Token ID262
Feature activation-1.41e+00

Pos logit contributions
der+0.002
 before+0.001
en+0.001
ilst+0.001
iece+0.001
Neg logit contributions
P-0.001
unt-0.001
ww-0.001
 baker-0.001
Elizabeth-0.001
Token But
Token ID887
Feature activation-3.29e+00

Pos logit contributions
der+0.009
en+0.007
 before+0.007
ilst+0.007
iece+0.007
Neg logit contributions
P-0.007
ww-0.006
unt-0.006
 baker-0.006
 cherry-0.006
Token today
Token ID1909
Feature activation-1.10e+00

Pos logit contributions
der+0.089
en+0.067
 before+0.066
ilst+0.065
iece+0.064
Neg logit contributions
P-0.079
unt-0.074
ww-0.072
 baker-0.071
 cherry-0.070
Token,
Token ID11
Feature activation-3.23e-01

Pos logit contributions
der+0.036
en+0.028
ilst+0.028
iece+0.028
 before+0.028
Neg logit contributions
P-0.020
unt-0.019
ww-0.018
 baker-0.018
 cherry-0.018
Token Tuesday
Token ID3431
Feature activation-7.69e-02

Pos logit contributions
der+0.010
en+0.008
ilst+0.008
inch+0.008
 before+0.008
Neg logit contributions
P-0.006
unt-0.006
ww-0.006
 cherry-0.006
 baker-0.006
Token wasn
Token ID2492
Feature activation-4.08e+00

Pos logit contributions
der+0.002
 before+0.002
en+0.002
ilst+0.002
 since+0.002
Neg logit contributions
P-0.002
 baker-0.002
 cherry-0.002
ww-0.002
unt-0.002
Token't
Token ID470
Feature activation-2.70e+00

Pos logit contributions
der+0.150
ities+0.129
iece+0.128
en+0.125
ipples+0.122
Neg logit contributions
unt-0.059
P-0.056
 cherry-0.055
ww-0.054
 long-0.050
Token going
Token ID1016
Feature activation+1.05e+00

Pos logit contributions
der+0.084
iece+0.065
inch+0.064
ities+0.064
en+0.064
Neg logit contributions
P-0.053
unt-0.051
 cherry-0.050
ww-0.050
 good-0.049
Token to
Token ID284
Feature activation-2.05e+00

Pos logit contributions
P+0.023
unt+0.021
ww+0.021
Elizabeth+0.020
 baker+0.020
Neg logit contributions
der-0.031
 before-0.027
en-0.025
 himself-0.023
icles-0.023
Token be
Token ID307
Feature activation-1.42e+00

Pos logit contributions
der+0.070
iece+0.055
rified+0.054
ilst+0.054
ities+0.054
Neg logit contributions
 cherry-0.036
 good-0.034
 pastry-0.031
 taxi-0.031
 long-0.031
Token so
Token ID523
Feature activation+4.50e-01

Pos logit contributions
der+0.042
en+0.033
 before+0.032
ilst+0.032
iece+0.031
Neg logit contributions
P-0.030
unt-0.028
ww-0.028
 baker-0.027
 cherry-0.027
Token fun
Token ID1257
Feature activation-5.26e-01

Pos logit contributions
P+0.008
unt+0.007
Elizabeth+0.006
ww+0.006
Michelle+0.006
Neg logit contributions
der-0.015
 before-0.013
en-0.012
ilst-0.012
inch-0.012
Token his
Token ID465
Feature activation-2.15e+00

Pos logit contributions
P+0.045
unt+0.041
ww+0.040
 cherry+0.039
 baker+0.039
Neg logit contributions
der-0.067
en-0.052
ilst-0.051
 before-0.050
ities-0.050
Token new
Token ID649
Feature activation+3.11e-01

Pos logit contributions
der+0.073
 before+0.059
en+0.059
ilst+0.057
inch+0.056
Neg logit contributions
P-0.036
unt-0.032
ww-0.032
 baker-0.031
Elizabeth-0.030
Token board
Token ID3096
Feature activation+4.50e-02

Pos logit contributions
P+0.006
unt+0.005
ww+0.005
Elizabeth+0.005
 baker+0.005
Neg logit contributions
der-0.010
en-0.008
 before-0.008
ilst-0.008
iece-0.008
Token he
Token ID339
Feature activation-1.26e+00

Pos logit contributions
P+0.001
unt+0.001
ww+0.001
 long+0.001
 contest+0.001
Neg logit contributions
der-0.001
ities-0.001
en-0.001
ility-0.001
ipples-0.001
Token could
Token ID714
Feature activation-8.52e-01

Pos logit contributions
der+0.036
en+0.028
ilst+0.027
ities+0.027
iece+0.027
Neg logit contributions
P-0.028
unt-0.026
ww-0.025
 baker-0.024
 cherry-0.024
Token hardly
Token ID8941
Feature activation-7.50e+00

Pos logit contributions
der+0.031
ities+0.026
ility+0.025
icated+0.025
rified+0.025
Neg logit contributions
 long-0.012
 high-0.012
P-0.011
 ladder-0.011
 taxi-0.011
Token contain
Token ID3994
Feature activation-3.47e-01

Pos logit contributions
der+0.249
ities+0.197
en+0.196
icking+0.185
inch+0.185
Neg logit contributions
 long-0.128
P-0.123
 cherry-0.122
 high-0.122
 taxi-0.120
Token himself
Token ID2241
Feature activation+6.61e-02

Pos logit contributions
der+0.010
ipples+0.008
rified+0.008
places+0.008
icking+0.008
Neg logit contributions
 good-0.008
P-0.007
 long-0.007
 high-0.007
 big-0.007
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
P+0.002
 long+0.002
 good+0.002
 high+0.001
 taxi+0.001
Neg logit contributions
der-0.002
ilst-0.001
icles-0.001
ility-0.001
en-0.001
Token His
Token ID2399
Feature activation-2.27e+00

Pos logit contributions
der+0.035
en+0.028
ities+0.027
iece+0.027
ilst+0.027
Neg logit contributions
P-0.021
unt-0.019
 baker-0.018
 cherry-0.018
ww-0.018
Token dad
Token ID9955
Feature activation+2.24e+00

Pos logit contributions
der+0.075
en+0.060
 before+0.059
ilst+0.059
iece+0.058
Neg logit contributions
P-0.041
unt-0.037
ww-0.036
 baker-0.035
 cherry-0.034
Token wall
Token ID3355
Feature activation-3.45e+00

Pos logit contributions
der+0.055
 before+0.045
iece+0.041
en+0.040
ilst+0.040
Neg logit contributions
P-0.055
ww-0.051
unt-0.049
Ell-0.047
Elizabeth-0.047
Token.
Token ID13
Feature activation-5.53e-01

Pos logit contributions
der+0.079
inch+0.062
en+0.059
ities+0.058
ilst+0.058
Neg logit contributions
P-0.088
unt-0.087
 cherry-0.084
 pastry-0.083
 baker-0.083
Token It
Token ID632
Feature activation-1.38e+00

Pos logit contributions
der+0.017
en+0.013
 before+0.013
ilst+0.012
iece+0.012
Neg logit contributions
P-0.012
unt-0.011
ww-0.011
 baker-0.010
 cherry-0.010
Token said
Token ID531
Feature activation-1.31e+00

Pos logit contributions
der+0.028
 before+0.020
en+0.020
ilst+0.018
 since+0.017
Neg logit contributions
P-0.042
 baker-0.038
ww-0.038
 cherry-0.037
 pastry-0.037
Token "
Token ID366
Feature activation-7.52e-01

Pos logit contributions
der+0.035
 before+0.027
en+0.024
 himself+0.024
iece+0.024
Neg logit contributions
P-0.035
ww-0.030
Elizabeth-0.029
unt-0.029
www-0.028
TokenDO
Token ID18227
Feature activation-7.50e+00

Pos logit contributions
der+0.022
en+0.018
 before+0.018
ilst+0.018
iece+0.017
Neg logit contributions
P-0.014
unt-0.014
 baker-0.014
 pastry-0.013
 cherry-0.013
Token NOT
Token ID5626
Feature activation-7.25e-01

Pos logit contributions
der+0.216
en+0.167
 before+0.163
ilst+0.161
iece+0.160
Neg logit contributions
P-0.166
unt-0.154
ww-0.151
 cherry-0.146
 baker-0.145
Token T
Token ID309
Feature activation-2.93e+00

Pos logit contributions
der+0.020
en+0.016
 before+0.015
ilst+0.015
iece+0.015
Neg logit contributions
P-0.017
unt-0.015
ww-0.015
 baker-0.015
 cherry-0.014
TokenOU
Token ID2606
Feature activation-2.04e-01

Pos logit contributions
der+0.080
en+0.061
 before+0.061
iece+0.059
ilst+0.059
Neg logit contributions
P-0.067
unt-0.062
ww-0.062
 baker-0.061
 lawyer-0.059
TokenCH
Token ID3398
Feature activation-1.79e+00

Pos logit contributions
der+0.005
 before+0.005
ilst+0.004
en+0.004
ussed+0.004
Neg logit contributions
 baker-0.004
P-0.004
unt-0.004
 cherry-0.004
 pastry-0.004
Token".
Token ID1911
Feature activation-1.16e+00

Pos logit contributions
der+0.045
en+0.033
 before+0.033
ilst+0.032
iece+0.032
Neg logit contributions
P-0.046
unt-0.043
ww-0.042
 baker-0.041
 cherry-0.041