TOP ACTIVATIONS
MAX = 0.136

loved to play make - bel ieve . Today she was
liked to play make - bel ieve . One
- up and make - bel ieve . One day ,
red cape and make - bel ieve he had super powers
a world of make - bel ieve . She
He loved playing make - bel ieve with his friends .
and they played make - bel ieve . Finally , they
was ready for make - bel ieve fun . <|endoftext|> Once
he was playing make - bel ieve at home , too
John began to make - bel ieve he was a hero
" Please keep your seat bel ts on . We will
He would race make - bel ieve cars around the track
Finally , her make - bel ieve adventures were recorded .
She told him how she se wed , glued , and
a needle and thread and se wed Teddy 's eye back
a needle and thread and se wed a patch on Patch
of waves and wind and se ag ull s .
the elderly man on a stret cher . She thanked Tim
She cut and glued and se wed different fabrics together .
matter how small or how big , it is important to

MIN ACTIVATIONS
MIN = -0.080

, cats , birds , ham sters , and more .
Patty 's bedroom was decorated perfectly ! <|endoftext|> One day ,
. She looked at the ham ster and saw that he
. Andy stood perfectly still for the entire duration
worked ! The door opened easily . He was free !
It was hiding under a seaw eed . Lily said ,
it come out of the seaw eed and walk around the
pet - a cute little ham ster . Lily was so
tug . It came out easily , and she held it
rod . It came off easily . It fell on the
The dentist put a b ib around her neck . He
. He runs to the candy store . L
was hiding under a seaw eed . Lily said , "
it on . It fit perfectly and he found it so
so with a burst of energy , she ran to the
. He sat up stra ighter and waited for the cat
the fancy suit was wrapped perfectly . Kat
and Sue ran to the candy store and bought some candy
the fridge . It fit perfectly ! The little girl was
attractive sandwich with cheese , ham , and lettuce . Tim

INTERVAL 0.082 - 0.136
CONTAINS 0.001%

a needle and thread and se wed a patch on Patch
liked to play make - bel ieve . One
watched as his mom carefully se wed the stitch in the
She told him how she se wed , glued , and
thread and a needle and se wed up the tear in

INTERVAL 0.028 - 0.082
CONTAINS 3.159%

our snow man is smiling !" Anna said .
couldn 't believe what he was seeing . He wondered if
But only a little bit ." Ben said .
Lily . He realized his car was broken and old too
a lab where Lily 's mom worked . It was a

INTERVAL -0.026 - 0.028
CONTAINS 93.377%

Sandy noticed something delicious in the grass - butter . She
That is a mug . It is disgusting . Someone left
to Mia 's house . It was pink and shiny and
to the fountain . She saw the bee and smiled .
wash their hair . They say , " No , no

INTERVAL -0.080 - -0.026
CONTAINS 3.463%

was a bright , shining necklace . The girl
toys . You have to share and respect them . Now
course you can ! Just use your hands and make a
little girl felt so com fy and happy . The t
some more and saw their own happy faces in the p
Token loved
Token ID6151
Feature activation-1.70e-02

Pos logit contributions
 coc+0.019
T+0.016
una+0.016
R+0.014
 and+0.014
Neg logit contributions
 blame-0.032
date-0.031
ivated-0.031
eks-0.030
pieces-0.030
Token to
Token ID284
Feature activation-2.29e-02

Pos logit contributions
 coc+0.026
T+0.025
una+0.024
R+0.023
Ho+0.023
Neg logit contributions
 blame-0.025
 fight-0.024
date-0.024
pieces-0.023
 measured-0.023
Token play
Token ID711
Feature activation-8.50e-03

Pos logit contributions
 coc+0.041
 and+0.038
T+0.038
una+0.036
 Lizard+0.034
Neg logit contributions
ivated-0.028
 blame-0.028
eks-0.028
cks-0.028
date-0.028
Token make
Token ID787
Feature activation+3.75e-03

Pos logit contributions
 coc+0.012
T+0.011
una+0.010
 and+0.010
R+0.010
Neg logit contributions
 blame-0.014
date-0.013
eks-0.013
ivated-0.013
pieces-0.013
Token-
Token ID12
Feature activation+5.00e-02

Pos logit contributions
 blame+0.007
date+0.006
pieces+0.006
ivated+0.006
eks+0.006
Neg logit contributions
 coc-0.005
T-0.004
una-0.004
R-0.004
 Lizard-0.004
Tokenbel
Token ID6667
Feature activation+1.36e-01

Pos logit contributions
 blame+0.059
cks+0.055
ivated+0.053
 blamed+0.052
 begs+0.052
Neg logit contributions
T-0.095
 coc-0.093
L-0.086
una-0.086
 and-0.085
Tokenieve
Token ID12311
Feature activation-2.21e-03

Pos logit contributions
ishly+0.149
 blame+0.145
ays+0.143
ican+0.142
date+0.140
Neg logit contributions
T-0.247
 and-0.240
 coc-0.237
una-0.232
"-0.224
Token.
Token ID13
Feature activation-3.94e-03

Pos logit contributions
 coc+0.003
T+0.002
una+0.002
R+0.002
die+0.002
Neg logit contributions
 blame-0.004
date-0.004
pieces-0.004
ivated-0.004
eks-0.004
Token Today
Token ID6288
Feature activation-9.39e-03

Pos logit contributions
 coc+0.006
T+0.006
 and+0.005
una+0.005
 Lizard+0.005
Neg logit contributions
 blame-0.006
eks-0.006
date-0.006
ivated-0.006
cks-0.006
Token she
Token ID673
Feature activation+3.35e-03

Pos logit contributions
 coc+0.008
T+0.007
una+0.007
 and+0.007
R+0.006
Neg logit contributions
 blame-0.020
date-0.020
ivated-0.019
eks-0.019
pieces-0.019
Token was
Token ID373
Feature activation+8.21e-03

Pos logit contributions
 blame+0.006
ivated+0.006
cks+0.006
date+0.006
pieces+0.006
Neg logit contributions
 coc-0.004
 and-0.003
T-0.003
una-0.003
die-0.003
Token liked
Token ID8288
Feature activation-4.90e-03

Pos logit contributions
 coc+0.016
una+0.014
T+0.013
die+0.013
R+0.013
Neg logit contributions
 blame-0.016
ivated-0.015
 measured-0.015
date-0.015
eks-0.015
Token to
Token ID284
Feature activation-6.83e-03

Pos logit contributions
 coc+0.007
una+0.007
T+0.007
die+0.007
 Lizard+0.006
Neg logit contributions
 blame-0.007
 fight-0.007
date-0.007
ivated-0.007
pieces-0.007
Token play
Token ID711
Feature activation+3.10e-03

Pos logit contributions
 coc+0.011
 and+0.010
T+0.010
una+0.009
 little+0.009
Neg logit contributions
ays-0.010
cks-0.010
ivated-0.010
cking-0.010
pieces-0.010
Token make
Token ID787
Feature activation+9.39e-03

Pos logit contributions
 blame+0.006
eks+0.005
pieces+0.005
cks+0.005
ivated+0.005
Neg logit contributions
 coc-0.004
 and-0.004
T-0.004
una-0.003
R-0.003
Token-
Token ID12
Feature activation+4.72e-02

Pos logit contributions
 blame+0.017
date+0.017
pieces+0.017
eks+0.017
ivated+0.017
Neg logit contributions
 coc-0.011
T-0.010
una-0.009
die-0.009
R-0.009
Tokenbel
Token ID6667
Feature activation+1.33e-01

Pos logit contributions
cks+0.045
 blame+0.044
ays+0.043
 begs+0.040
 blamed+0.039
Neg logit contributions
T-0.099
 coc-0.096
L-0.091
 and-0.089
una-0.088
Tokenieve
Token ID12311
Feature activation+7.89e-03

Pos logit contributions
 blame+0.125
date+0.124
ays+0.121
ishly+0.117
ican+0.116
Neg logit contributions
T-0.259
 and-0.257
 coc-0.255
una-0.245
"-0.235
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
date+0.013
 blame+0.013
pieces+0.013
eks+0.012
ivated+0.012
Neg logit contributions
 coc-0.011
T-0.010
una-0.009
die-0.009
R-0.009
Token
Token ID198
Feature activation+1.91e-04

Pos logit contributions
 blame+0.016
date+0.016
ivated+0.015
pieces+0.015
eks+0.014
Neg logit contributions
 coc-0.031
T-0.029
una-0.029
R-0.028
die-0.027
Token
Token ID198
Feature activation+1.69e-04

Pos logit contributions
date+0.000
 blame+0.000
ivated+0.000
eks+0.000
pieces+0.000
Neg logit contributions
 coc-0.000
T-0.000
una-0.000
die-0.000
 Lizard-0.000
TokenOne
Token ID3198
Feature activation-3.75e-03

Pos logit contributions
date+0.000
ivated+0.000
ued+0.000
 blame+0.000
roads+0.000
Neg logit contributions
 coc-0.000
una-0.000
 Lizard-0.000
die-0.000
T-0.000
Token-
Token ID12
Feature activation+4.32e-02

Pos logit contributions
date+0.006
 blame+0.006
pieces+0.006
ivated+0.006
eks+0.005
Neg logit contributions
 coc-0.004
T-0.003
die-0.003
una-0.003
 Lizard-0.003
Tokenup
Token ID929
Feature activation+1.27e-02

Pos logit contributions
 blame+0.049
date+0.047
eks+0.046
ivated+0.046
pieces+0.046
Neg logit contributions
 coc-0.082
T-0.076
una-0.075
 and-0.073
R-0.072
Token and
Token ID290
Feature activation-3.82e-03

Pos logit contributions
 blame+0.026
date+0.025
eks+0.025
ivated+0.025
pieces+0.025
Neg logit contributions
 coc-0.013
T-0.011
una-0.010
 and-0.010
R-0.009
Token make
Token ID787
Feature activation+1.14e-02

Pos logit contributions
 coc+0.007
T+0.006
 and+0.006
una+0.006
die+0.006
Neg logit contributions
 blame-0.005
date-0.005
ivated-0.005
eks-0.005
cks-0.004
Token-
Token ID12
Feature activation+5.12e-02

Pos logit contributions
 blame+0.021
date+0.020
ivated+0.020
eks+0.020
pieces+0.020
Neg logit contributions
 coc-0.014
T-0.013
una-0.012
 and-0.012
R-0.011
Tokenbel
Token ID6667
Feature activation+1.32e-01

Pos logit contributions
 blame+0.061
cks+0.058
ivated+0.056
date+0.054
 fight+0.053
Neg logit contributions
T-0.095
 coc-0.093
L-0.086
 and-0.086
una-0.085
Tokenieve
Token ID12311
Feature activation+1.25e-02

Pos logit contributions
 blame+0.126
date+0.124
ishly+0.119
ays+0.119
ican+0.118
Neg logit contributions
T-0.259
 coc-0.255
 and-0.250
una-0.243
"-0.231
Token.
Token ID13
Feature activation+8.18e-03

Pos logit contributions
 blame+0.021
date+0.021
pieces+0.021
eks+0.021
ivated+0.020
Neg logit contributions
 coc-0.016
T-0.015
una-0.014
die-0.014
R-0.014
Token One
Token ID1881
Feature activation-1.37e-03

Pos logit contributions
 blame+0.009
date+0.009
pieces+0.008
ivated+0.008
eks+0.008
Neg logit contributions
 coc-0.016
T-0.014
una-0.014
R-0.014
die-0.014
Token day
Token ID1110
Feature activation+1.26e-02

Pos logit contributions
 coc+0.002
die+0.002
T+0.002
Ho+0.002
R+0.002
Neg logit contributions
date-0.002
 fight-0.002
 measured-0.002
 blame-0.002
ci-0.002
Token,
Token ID11
Feature activation+6.03e-03

Pos logit contributions
date+0.020
 blame+0.019
 fight+0.019
ivated+0.019
 measured+0.018
Neg logit contributions
 coc-0.018
die-0.017
una-0.017
oby-0.015
T-0.015
Token red
Token ID2266
Feature activation+2.37e-02

Pos logit contributions
 blame+0.038
date+0.037
eks+0.037
 fight+0.036
pieces+0.036
Neg logit contributions
 coc-0.033
T-0.031
una-0.029
R-0.028
aste-0.028
Token cape
Token ID36023
Feature activation+1.11e-02

Pos logit contributions
date+0.038
 blame+0.037
eks+0.036
 fight+0.036
pieces+0.035
Neg logit contributions
 coc-0.033
T-0.030
una-0.029
R-0.028
mber-0.028
Token and
Token ID290
Feature activation-3.76e-03

Pos logit contributions
date+0.024
 blame+0.023
ivated+0.023
eks+0.023
pieces+0.023
Neg logit contributions
 coc-0.010
T-0.008
una-0.008
R-0.007
die-0.007
Token make
Token ID787
Feature activation+1.10e-02

Pos logit contributions
 coc+0.006
una+0.005
T+0.005
R+0.005
mber+0.005
Neg logit contributions
 blame-0.006
eks-0.005
 fight-0.005
 measured-0.005
pieces-0.005
Token-
Token ID12
Feature activation+4.61e-02

Pos logit contributions
 blame+0.020
date+0.020
ivated+0.020
eks+0.019
pieces+0.019
Neg logit contributions
 coc-0.013
T-0.012
una-0.012
 and-0.011
R-0.011
Tokenbel
Token ID6667
Feature activation+1.30e-01

Pos logit contributions
 blame+0.031
cks+0.028
ivated+0.027
date+0.027
ays+0.026
Neg logit contributions
T-0.109
 coc-0.107
 and-0.101
una-0.100
R-0.099
Tokenieve
Token ID12311
Feature activation+1.13e-02

Pos logit contributions
 blame+0.102
date+0.102
ays+0.093
ican+0.091
 fight+0.086
Neg logit contributions
 coc-0.280
T-0.279
 and-0.270
una-0.265
R-0.253
Token he
Token ID339
Feature activation+2.47e-02

Pos logit contributions
date+0.018
 blame+0.018
pieces+0.018
cks+0.018
eks+0.018
Neg logit contributions
 coc-0.015
T-0.014
una-0.013
die-0.013
R-0.013
Token had
Token ID550
Feature activation+5.99e-03

Pos logit contributions
 blame+0.046
date+0.046
ivated+0.046
pieces+0.044
eks+0.044
Neg logit contributions
 coc-0.028
 and-0.026
T-0.025
una-0.024
 Lizard-0.023
Token super
Token ID2208
Feature activation+2.10e-02

Pos logit contributions
 blame+0.010
date+0.010
eks+0.010
pieces+0.010
ivated+0.010
Neg logit contributions
 coc-0.008
T-0.007
una-0.007
die-0.006
R-0.006
Tokenpowers
Token ID30132
Feature activation+1.27e-02

Pos logit contributions
 blame+0.042
date+0.041
ivated+0.040
eks+0.040
pieces+0.040
Neg logit contributions
 coc-0.022
T-0.019
una-0.018
 and-0.018
R-0.017
Token a
Token ID257
Feature activation+7.29e-03

Pos logit contributions
 blame+0.026
 fight+0.026
date+0.025
pieces+0.024
eks+0.024
Neg logit contributions
una-0.025
mber-0.025
 coc-0.025
die-0.025
Ho-0.024
Token world
Token ID995
Feature activation-1.52e-02

Pos logit contributions
date+0.013
 fight+0.012
pieces+0.012
 blame+0.012
 measured+0.011
Neg logit contributions
una-0.009
 coc-0.009
T-0.009
aste-0.008
oby-0.008
Token of
Token ID286
Feature activation+1.11e-02

Pos logit contributions
 coc+0.022
T+0.020
una+0.019
die+0.019
R+0.019
Neg logit contributions
date-0.024
 blame-0.023
pieces-0.023
ivated-0.023
roads-0.023
Token make
Token ID787
Feature activation+1.28e-02

Pos logit contributions
 blame+0.018
 fight+0.018
pieces+0.017
eks+0.016
 measured+0.016
Neg logit contributions
T-0.015
die-0.014
mber-0.014
una-0.014
 coc-0.013
Token-
Token ID12
Feature activation+4.45e-02

Pos logit contributions
 blame+0.024
date+0.023
ivated+0.023
eks+0.023
pieces+0.023
Neg logit contributions
 coc-0.015
T-0.013
una-0.013
 and-0.012
R-0.012
Tokenbel
Token ID6667
Feature activation+1.29e-01

Pos logit contributions
 blame+0.041
cks+0.038
date+0.038
ays+0.036
ivated+0.035
Neg logit contributions
 coc-0.093
T-0.091
 and-0.086
una-0.085
R-0.083
Tokenieve
Token ID12311
Feature activation+4.34e-03

Pos logit contributions
ays+0.114
 blame+0.113
date+0.112
ican+0.110
ishly+0.105
Neg logit contributions
T-0.256
 and-0.254
 coc-0.252
una-0.242
"-0.235
Token.
Token ID13
Feature activation+7.93e-03

Pos logit contributions
date+0.007
pieces+0.007
 blame+0.007
 fight+0.007
eks+0.007
Neg logit contributions
 coc-0.006
T-0.005
una-0.005
R-0.005
Ho-0.005
Token
Token ID198
Feature activation+3.83e-04

Pos logit contributions
 blame+0.011
pieces+0.011
date+0.010
ivated+0.010
eks+0.010
Neg logit contributions
 coc-0.013
una-0.012
T-0.012
R-0.011
die-0.011
Token
Token ID198
Feature activation+1.16e-03

Pos logit contributions
 blame+0.001
date+0.001
eks+0.001
ivated+0.001
pieces+0.001
Neg logit contributions
 coc-0.000
T-0.000
una-0.000
die-0.000
aste-0.000
TokenShe
Token ID3347
Feature activation+4.47e-03

Pos logit contributions
ivated+0.002
date+0.002
 blame+0.002
ued+0.002
pieces+0.002
Neg logit contributions
 coc-0.002
die-0.001
una-0.001
 Lizard-0.001
 skull-0.001
Token He
Token ID679
Feature activation+7.67e-04

Pos logit contributions
 coc+0.005
 and+0.004
T+0.004
die+0.004
una+0.004
Neg logit contributions
 blame-0.009
eks-0.009
date-0.009
ivated-0.009
pieces-0.009
Token loved
Token ID6151
Feature activation-1.18e-02

Pos logit contributions
 blame+0.001
date+0.001
ivated+0.001
pieces+0.001
eks+0.001
Neg logit contributions
 coc-0.001
T-0.001
una-0.001
 and-0.001
R-0.001
Token playing
Token ID2712
Feature activation+2.16e-04

Pos logit contributions
 coc+0.017
T+0.015
una+0.015
R+0.014
 Lizard+0.014
Neg logit contributions
 blame-0.019
date-0.018
pieces-0.018
 fight-0.018
eks-0.018
Token make
Token ID787
Feature activation-2.62e-04

Pos logit contributions
 blame+0.000
eks+0.000
date+0.000
ivated+0.000
pieces+0.000
Neg logit contributions
 coc-0.000
T-0.000
una-0.000
 and-0.000
R-0.000
Token-
Token ID12
Feature activation+4.46e-02

Pos logit contributions
 coc+0.000
T+0.000
una+0.000
 and+0.000
R+0.000
Neg logit contributions
 blame-0.000
date-0.000
ivated-0.000
eks-0.000
pieces-0.000
Tokenbel
Token ID6667
Feature activation+1.28e-01

Pos logit contributions
 blame+0.042
cks+0.041
ays+0.038
eks+0.037
ivated+0.037
Neg logit contributions
T-0.093
 coc-0.088
 and-0.084
L-0.084
una-0.083
Tokenieve
Token ID12311
Feature activation+1.51e-04

Pos logit contributions
ays+0.124
 blame+0.122
ican+0.121
date+0.120
ishly+0.118
Neg logit contributions
T-0.245
 and-0.243
 coc-0.237
una-0.227
"-0.225
Token with
Token ID351
Feature activation+6.79e-03

Pos logit contributions
 blame+0.000
date+0.000
pieces+0.000
eks+0.000
ivated+0.000
Neg logit contributions
 coc-0.000
T-0.000
una-0.000
R-0.000
die-0.000
Token his
Token ID465
Feature activation+1.96e-02

Pos logit contributions
 blame+0.012
date+0.012
pieces+0.012
eks+0.012
ivated+0.012
Neg logit contributions
 coc-0.008
T-0.007
una-0.007
R-0.007
die-0.007
Token friends
Token ID2460
Feature activation+2.08e-02

Pos logit contributions
 blame+0.034
date+0.033
pieces+0.032
 fight+0.032
eks+0.031
Neg logit contributions
 coc-0.025
T-0.024
una-0.023
R-0.022
die-0.021
Token.
Token ID13
Feature activation+1.69e-03

Pos logit contributions
 blame+0.038
eks+0.037
date+0.037
ivated+0.037
pieces+0.036
Neg logit contributions
 coc-0.025
T-0.022
 and-0.021
una-0.021
 Lizard-0.020
Token and
Token ID290
Feature activation+4.01e-03

Pos logit contributions
 coc+0.025
T+0.022
una+0.021
R+0.020
die+0.020
Neg logit contributions
 blame-0.040
date-0.039
pieces-0.038
ivated-0.038
eks-0.038
Token they
Token ID484
Feature activation+3.02e-02

Pos logit contributions
 blame+0.006
pieces+0.006
date+0.006
eks+0.006
 fight+0.006
Neg logit contributions
 coc-0.006
T-0.005
una-0.005
R-0.005
 Lizard-0.005
Token played
Token ID2826
Feature activation+3.69e-03

Pos logit contributions
date+0.056
cks+0.056
ivated+0.055
 blame+0.054
pieces+0.054
Neg logit contributions
 and-0.035
 coc-0.035
T-0.032
 since-0.030
una-0.027
Token make
Token ID787
Feature activation+8.15e-03

Pos logit contributions
 blame+0.006
date+0.005
ivated+0.005
eks+0.005
pieces+0.005
Neg logit contributions
 coc-0.006
T-0.005
una-0.005
 and-0.005
R-0.005
Token-
Token ID12
Feature activation+4.31e-02

Pos logit contributions
 blame+0.013
date+0.012
pieces+0.012
eks+0.012
ivated+0.012
Neg logit contributions
 coc-0.012
T-0.011
una-0.010
R-0.010
die-0.010
Tokenbel
Token ID6667
Feature activation+1.26e-01

Pos logit contributions
 blame+0.030
cks+0.028
eks+0.026
ivated+0.025
ays+0.024
Neg logit contributions
T-0.101
 coc-0.096
 and-0.093
una-0.092
L-0.091
Tokenieve
Token ID12311
Feature activation+5.36e-03

Pos logit contributions
 blame+0.117
ays+0.114
ican+0.113
date+0.112
ishly+0.107
Neg logit contributions
T-0.247
 and-0.246
 coc-0.240
una-0.229
"-0.224
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
 blame+0.009
date+0.009
pieces+0.009
eks+0.008
ivated+0.008
Neg logit contributions
 coc-0.007
T-0.007
una-0.006
R-0.006
die-0.006
Token Finally
Token ID9461
Feature activation-2.23e-03

Pos logit contributions
ays+0.016
date+0.015
 blame+0.015
pieces+0.015
ivated+0.015
Neg logit contributions
 coc-0.014
T-0.013
 Lizard-0.012
R-0.012
 since-0.012
Token,
Token ID11
Feature activation+7.08e-04

Pos logit contributions
 coc+0.002
T+0.002
die+0.002
una+0.002
aste+0.002
Neg logit contributions
 blame-0.004
date-0.004
ivated-0.004
 fight-0.004
eks-0.004
Token they
Token ID484
Feature activation+1.46e-02

Pos logit contributions
 blame+0.001
date+0.001
ivated+0.001
pieces+0.001
eks+0.001
Neg logit contributions
 coc-0.001
T-0.001
una-0.001
die-0.001
 Lizard-0.001
Token was
Token ID373
Feature activation-3.51e-03

Pos logit contributions
 coc+0.013
T+0.012
 and+0.011
una+0.011
die+0.011
Neg logit contributions
 blame-0.020
date-0.019
ivated-0.019
eks-0.019
pieces-0.019
Token ready
Token ID3492
Feature activation-3.37e-02

Pos logit contributions
 coc+0.004
 and+0.004
T+0.004
una+0.004
die+0.004
Neg logit contributions
 blame-0.007
eks-0.006
date-0.006
ivated-0.006
pieces-0.006
Token for
Token ID329
Feature activation-1.91e-02

Pos logit contributions
 coc+0.041
T+0.037
una+0.035
R+0.034
 Lizard+0.033
Neg logit contributions
 blame-0.060
date-0.059
pieces-0.058
ivated-0.058
eks-0.058
Token make
Token ID787
Feature activation+2.11e-03

Pos logit contributions
 coc+0.024
T+0.022
una+0.021
die+0.020
R+0.020
Neg logit contributions
 blame-0.033
date-0.032
pieces-0.032
eks-0.032
cks-0.031
Token-
Token ID12
Feature activation+3.86e-02

Pos logit contributions
 blame+0.004
ivated+0.004
date+0.004
eks+0.004
pieces+0.004
Neg logit contributions
 coc-0.002
T-0.002
una-0.002
 and-0.002
die-0.002
Tokenbel
Token ID6667
Feature activation+1.25e-01

Pos logit contributions
 blame+0.052
cks+0.051
ays+0.050
ivated+0.048
eks+0.048
Neg logit contributions
T-0.064
 coc-0.059
una-0.058
 and-0.057
R-0.055
Tokenieve
Token ID12311
Feature activation-6.67e-03

Pos logit contributions
ays+0.124
 blame+0.123
date+0.121
ican+0.120
aved+0.118
Neg logit contributions
T-0.234
 and-0.233
 coc-0.231
una-0.227
"-0.219
Token fun
Token ID1257
Feature activation+3.46e-03

Pos logit contributions
 coc+0.007
T+0.007
una+0.006
 and+0.006
R+0.006
Neg logit contributions
 blame-0.013
date-0.012
ivated-0.012
eks-0.012
pieces-0.012
Token.
Token ID13
Feature activation+1.75e-04

Pos logit contributions
 blame+0.006
date+0.006
pieces+0.006
eks+0.006
ivated+0.006
Neg logit contributions
 coc-0.004
T-0.004
una-0.004
R-0.003
die-0.003
Token<|endoftext|>
Token ID50256
Feature activation-1.53e-02

Pos logit contributions
 blame+0.000
date+0.000
pieces+0.000
ivated+0.000
eks+0.000
Neg logit contributions
 coc-0.000
T-0.000
una-0.000
R-0.000
 Lizard-0.000
TokenOnce
Token ID7454
Feature activation+5.46e-03

Pos logit contributions
 coc+0.026
una+0.023
T+0.022
die+0.021
mber+0.021
Neg logit contributions
date-0.021
eks-0.020
 blame-0.019
cks-0.019
roads-0.019
Token he
Token ID339
Feature activation+1.22e-02

Pos logit contributions
 blame+0.003
ivated+0.003
date+0.003
pieces+0.003
 fight+0.003
Neg logit contributions
 coc-0.003
una-0.003
T-0.003
die-0.002
aste-0.002
Token was
Token ID373
Feature activation+1.96e-02

Pos logit contributions
 blame+0.018
date+0.018
eks+0.017
ivated+0.017
pieces+0.017
Neg logit contributions
 coc-0.019
T-0.017
una-0.017
 and-0.016
die-0.016
Token playing
Token ID2712
Feature activation+2.50e-03

Pos logit contributions
 blame+0.035
eks+0.033
pieces+0.033
date+0.033
cks+0.033
Neg logit contributions
 and-0.024
 coc-0.024
T-0.022
 little-0.021
una-0.021
Token make
Token ID787
Feature activation+6.11e-03

Pos logit contributions
date+0.004
 blame+0.004
ivated+0.004
pieces+0.004
eks+0.004
Neg logit contributions
 coc-0.003
T-0.003
una-0.003
die-0.003
R-0.003
Token-
Token ID12
Feature activation+4.00e-02

Pos logit contributions
 blame+0.011
date+0.011
pieces+0.010
eks+0.010
ivated+0.010
Neg logit contributions
 coc-0.008
T-0.007
una-0.007
R-0.006
 and-0.006
Tokenbel
Token ID6667
Feature activation+1.25e-01

Pos logit contributions
 blame+0.035
cks+0.034
eks+0.031
 blamed+0.030
ivated+0.029
Neg logit contributions
T-0.088
 coc-0.085
 and-0.080
una-0.079
L-0.079
Tokenieve
Token ID12311
Feature activation-1.52e-03

Pos logit contributions
ays+0.115
 blame+0.113
ican+0.110
date+0.109
 argue+0.106
Neg logit contributions
T-0.245
 and-0.244
 coc-0.235
"-0.224
una-0.223
Token at
Token ID379
Feature activation+2.60e-02

Pos logit contributions
 coc+0.002
T+0.002
una+0.002
R+0.002
die+0.002
Neg logit contributions
date-0.003
 blame-0.003
pieces-0.003
 fight-0.002
eks-0.002
Token home
Token ID1363
Feature activation-1.58e-02

Pos logit contributions
 blame+0.045
date+0.044
eks+0.044
pieces+0.044
cks+0.043
Neg logit contributions
 coc-0.033
T-0.030
una-0.029
R-0.027
die-0.027
Token,
Token ID11
Feature activation-6.01e-03

Pos logit contributions
 coc+0.023
una+0.020
T+0.020
die+0.019
R+0.019
Neg logit contributions
date-0.025
pieces-0.025
 blame-0.025
 fight-0.024
ivated-0.024
Token too
Token ID1165
Feature activation-1.97e-02

Pos logit contributions
 coc+0.006
una+0.005
T+0.005
R+0.005
mber+0.004
Neg logit contributions
 blame-0.012
 fight-0.012
pieces-0.012
date-0.012
 measured-0.012
Token John
Token ID1757
Feature activation-1.57e-02

Pos logit contributions
 coc+0.001
una+0.001
T+0.001
R+0.001
die+0.001
Neg logit contributions
 blame-0.001
date-0.001
ivated-0.001
pieces-0.001
 measured-0.001
Token began
Token ID2540
Feature activation-2.62e-02

Pos logit contributions
 coc+0.024
una+0.022
T+0.021
aste+0.020
R+0.019
Neg logit contributions
 measured-0.023
 blame-0.023
eks-0.022
ivated-0.022
date-0.021
Token to
Token ID284
Feature activation+2.84e-03

Pos logit contributions
 coc+0.036
T+0.032
una+0.032
R+0.030
 Lizard+0.029
Neg logit contributions
 blame-0.044
date-0.042
ivated-0.042
eks-0.041
pieces-0.041
Token make
Token ID787
Feature activation+2.97e-03

Pos logit contributions
 blame+0.003
date+0.003
eks+0.003
 fight+0.003
ivated+0.003
Neg logit contributions
 coc-0.005
una-0.005
T-0.005
R-0.005
mber-0.005
Token-
Token ID12
Feature activation+3.64e-02

Pos logit contributions
 blame+0.006
date+0.005
ivated+0.005
eks+0.005
pieces+0.005
Neg logit contributions
 coc-0.004
T-0.003
una-0.003
 and-0.003
R-0.003
Tokenbel
Token ID6667
Feature activation+1.24e-01

Pos logit contributions
cks+0.035
 blame+0.033
ays+0.032
ivated+0.029
 blamed+0.029
Neg logit contributions
T-0.076
 coc-0.074
R-0.070
 and-0.070
 since-0.068
Tokenieve
Token ID12311
Feature activation-5.89e-03

Pos logit contributions
date+0.106
ays+0.103
 blame+0.102
ican+0.095
aved+0.094
Neg logit contributions
 coc-0.253
T-0.251
 and-0.250
una-0.242
R-0.230
Token he
Token ID339
Feature activation+1.04e-02

Pos logit contributions
 coc+0.007
T+0.006
una+0.006
R+0.006
die+0.006
Neg logit contributions
date-0.011
 blame-0.011
pieces-0.010
 fight-0.010
cks-0.010
Token was
Token ID373
Feature activation+1.26e-02

Pos logit contributions
 blame+0.020
ivated+0.020
date+0.020
eks+0.019
cks+0.019
Neg logit contributions
 coc-0.012
 and-0.011
T-0.010
una-0.009
 since-0.009
Token a
Token ID257
Feature activation+3.08e-03

Pos logit contributions
 blame+0.024
ivated+0.023
eks+0.023
date+0.022
cks+0.022
Neg logit contributions
 coc-0.015
 and-0.014
T-0.013
 since-0.012
una-0.012
Token hero
Token ID4293
Feature activation+7.23e-04

Pos logit contributions
 fight+0.005
date+0.005
pieces+0.005
 blame+0.005
 competition+0.005
Neg logit contributions
una-0.004
 coc-0.004
T-0.004
aste-0.004
R-0.004
Token "
Token ID366
Feature activation+4.72e-03

Pos logit contributions
 coc+0.032
una+0.026
T+0.024
die+0.023
aste+0.022
Neg logit contributions
 blame-0.044
ivated-0.043
 measured-0.041
pieces-0.041
 fight-0.041
TokenPlease
Token ID5492
Feature activation-1.69e-02

Pos logit contributions
 blame+0.008
date+0.008
ivated+0.007
eks+0.007
pieces+0.007
Neg logit contributions
 coc-0.007
T-0.006
una-0.006
 and-0.006
die-0.005
Token keep
Token ID1394
Feature activation-9.57e-03

Pos logit contributions
 coc+0.026
una+0.023
T+0.023
die+0.023
mber+0.022
Neg logit contributions
 blame-0.026
date-0.025
eks-0.025
 fight-0.025
ivated-0.023
Token your
Token ID534
Feature activation+1.74e-02

Pos logit contributions
 coc+0.012
T+0.011
 and+0.011
una+0.010
R+0.010
Neg logit contributions
 blame-0.017
date-0.017
pieces-0.016
ivated-0.016
eks-0.016
Token seat
Token ID5852
Feature activation+3.45e-02

Pos logit contributions
 blame+0.028
ivated+0.027
date+0.027
cks+0.027
pieces+0.026
Neg logit contributions
 coc-0.026
 and-0.023
T-0.022
una-0.021
 little-0.020
Tokenbel
Token ID6667
Feature activation+1.21e-01

Pos logit contributions
 blame+0.061
date+0.059
ivated+0.058
eks+0.058
pieces+0.058
Neg logit contributions
 coc-0.044
T-0.040
una-0.038
 and-0.037
R-0.036
Tokents
Token ID912
Feature activation-1.09e-02

Pos logit contributions
 blame+0.107
date+0.103
ays+0.100
 fight+0.093
ican+0.093
Neg logit contributions
 coc-0.244
T-0.242
 and-0.238
una-0.234
R-0.225
Token on
Token ID319
Feature activation-1.87e-03

Pos logit contributions
 coc+0.013
T+0.012
una+0.011
 Lizard+0.011
R+0.010
Neg logit contributions
date-0.020
ivated-0.019
cks-0.019
 blame-0.019
eks-0.019
Token.
Token ID13
Feature activation+9.17e-04

Pos logit contributions
 coc+0.002
T+0.002
una+0.002
die+0.002
mber+0.002
Neg logit contributions
cks-0.003
date-0.003
ivated-0.003
eks-0.003
 blame-0.003
Token We
Token ID775
Feature activation+1.25e-02

Pos logit contributions
 blame+0.001
date+0.001
pieces+0.001
ivated+0.001
eks+0.001
Neg logit contributions
 coc-0.001
T-0.001
una-0.001
die-0.001
R-0.001
Token will
Token ID481
Feature activation-9.76e-03

Pos logit contributions
pieces+0.022
cks+0.022
ivated+0.022
date+0.022
 blame+0.022
Neg logit contributions
 coc-0.015
 and-0.014
T-0.013
una-0.012
 since-0.011
Token He
Token ID679
Feature activation+9.76e-03

Pos logit contributions
 blame+0.008
date+0.007
pieces+0.007
ivated+0.007
eks+0.007
Neg logit contributions
 coc-0.006
T-0.005
una-0.005
R-0.005
 and-0.005
Token would
Token ID561
Feature activation-1.28e-02

Pos logit contributions
 blame+0.017
date+0.017
ivated+0.017
pieces+0.017
cks+0.016
Neg logit contributions
 coc-0.012
 and-0.011
T-0.011
una-0.010
die-0.010
Token race
Token ID3234
Feature activation+1.50e-02

Pos logit contributions
 and+0.020
 coc+0.020
T+0.018
una+0.017
 since+0.017
Neg logit contributions
date-0.018
ays-0.018
 blame-0.017
ivated-0.017
limits-0.017
Token make
Token ID787
Feature activation-9.62e-03

Pos logit contributions
date+0.024
 blame+0.024
eks+0.024
 fight+0.024
pieces+0.024
Neg logit contributions
 coc-0.021
T-0.019
una-0.019
mber-0.018
die-0.017
Token-
Token ID12
Feature activation+2.65e-02

Pos logit contributions
 coc+0.012
T+0.011
una+0.010
 and+0.010
R+0.010
Neg logit contributions
 blame-0.017
date-0.017
ivated-0.016
eks-0.016
pieces-0.016
Tokenbel
Token ID6667
Feature activation+1.17e-01

Pos logit contributions
 blame+0.030
cks+0.027
ivated+0.027
date+0.027
eks+0.026
Neg logit contributions
 coc-0.050
T-0.050
 and-0.046
R-0.046
una-0.046
Tokenieve
Token ID12311
Feature activation-1.94e-02

Pos logit contributions
date+0.125
 blame+0.122
ays+0.120
ci+0.118
ican+0.114
Neg logit contributions
T-0.214
 coc-0.214
 and-0.213
una-0.206
"-0.198
Token cars
Token ID5006
Feature activation+5.46e-03

Pos logit contributions
 coc+0.021
T+0.019
una+0.018
R+0.017
die+0.017
Neg logit contributions
date-0.037
 blame-0.037
pieces-0.036
eks-0.036
 fight-0.036
Token around
Token ID1088
Feature activation-2.01e-03

Pos logit contributions
 blame+0.013
date+0.012
ivated+0.012
eks+0.012
pieces+0.012
Neg logit contributions
 coc-0.004
T-0.003
 and-0.003
una-0.003
R-0.003
Token the
Token ID262
Feature activation-2.31e-02

Pos logit contributions
 coc+0.003
T+0.002
una+0.002
mber+0.002
die+0.002
Neg logit contributions
date-0.003
 blame-0.003
 fight-0.003
pieces-0.003
eks-0.003
Token track
Token ID2610
Feature activation+8.21e-05

Pos logit contributions
 coc+0.034
una+0.034
T+0.034
mber+0.032
R+0.032
Neg logit contributions
 fight-0.033
date-0.033
pieces-0.033
 blame-0.033
 measured-0.031
Token Finally
Token ID9461
Feature activation-1.33e-02

Pos logit contributions
 coc+0.001
una+0.001
T+0.001
R+0.001
die+0.001
Neg logit contributions
 blame-0.001
pieces-0.001
date-0.001
ivated-0.001
eks-0.001
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
 coc+0.011
T+0.010
una+0.009
die+0.008
R+0.008
Neg logit contributions
 blame-0.029
date-0.029
eks-0.028
ivated-0.028
pieces-0.028
Token her
Token ID607
Feature activation-6.71e-03

Pos logit contributions
 coc+0.014
T+0.013
una+0.012
 and+0.012
R+0.011
Neg logit contributions
 blame-0.020
date-0.019
eks-0.019
ivated-0.019
cks-0.019
Token make
Token ID787
Feature activation+2.38e-03

Pos logit contributions
 coc+0.009
T+0.008
una+0.007
 and+0.007
R+0.007
Neg logit contributions
 blame-0.012
date-0.012
ivated-0.011
eks-0.011
pieces-0.011
Token-
Token ID12
Feature activation+2.52e-02

Pos logit contributions
 blame+0.004
date+0.004
eks+0.004
ivated+0.004
pieces+0.004
Neg logit contributions
 coc-0.003
T-0.002
una-0.002
 and-0.002
R-0.002
Tokenbel
Token ID6667
Feature activation+1.17e-01

Pos logit contributions
 blame+0.030
cks+0.028
ivated+0.028
date+0.027
 blamed+0.027
Neg logit contributions
T-0.047
 coc-0.046
una-0.043
 and-0.043
R-0.042
Tokenieve
Token ID12311
Feature activation-1.12e-02

Pos logit contributions
 blame+0.091
date+0.087
ays+0.084
ican+0.078
 fight+0.078
Neg logit contributions
 coc-0.255
T-0.251
 and-0.242
una-0.241
R-0.230
Token adventures
Token ID17545
Feature activation-6.49e-04

Pos logit contributions
 coc+0.013
T+0.012
una+0.011
 and+0.011
R+0.010
Neg logit contributions
 blame-0.021
date-0.020
ivated-0.020
eks-0.020
pieces-0.020
Token were
Token ID547
Feature activation-1.29e-02

Pos logit contributions
 coc+0.001
T+0.001
una+0.001
R+0.001
die+0.001
Neg logit contributions
date-0.001
 blame-0.001
pieces-0.001
ivated-0.001
eks-0.001
Token recorded
Token ID6264
Feature activation-6.91e-03

Pos logit contributions
 coc+0.016
 and+0.014
T+0.014
una+0.014
die+0.013
Neg logit contributions
 blame-0.023
ivated-0.023
eks-0.022
date-0.022
cks-0.022
Token.
Token ID13
Feature activation+1.39e-03

Pos logit contributions
 coc+0.008
T+0.007
una+0.007
R+0.007
die+0.007
Neg logit contributions
pieces-0.012
date-0.012
 blame-0.012
ivated-0.012
eks-0.012
Token She
Token ID1375
Feature activation-2.07e-04

Pos logit contributions
 coc+0.025
una+0.021
mber+0.021
T+0.021
R+0.020
Neg logit contributions
 blame-0.016
date-0.015
 blamed-0.014
ays-0.014
ivated-0.014
Token told
Token ID1297
Feature activation-1.31e-02

Pos logit contributions
 coc+0.000
T+0.000
una+0.000
R+0.000
 Lizard+0.000
Neg logit contributions
 blame-0.000
eks-0.000
date-0.000
ivated-0.000
pieces-0.000
Token him
Token ID683
Feature activation-1.63e-02

Pos logit contributions
 coc+0.020
T+0.018
die+0.017
R+0.017
mber+0.017
Neg logit contributions
 blame-0.021
eks-0.019
 fight-0.019
date-0.019
pieces-0.019
Token how
Token ID703
Feature activation-3.65e-03

Pos logit contributions
 coc+0.023
T+0.021
una+0.020
R+0.020
die+0.019
Neg logit contributions
 blame-0.026
pieces-0.025
eks-0.025
date-0.025
ivated-0.024
Token she
Token ID673
Feature activation+2.23e-02

Pos logit contributions
 coc+0.005
T+0.004
una+0.004
die+0.004
 and+0.004
Neg logit contributions
 blame-0.006
date-0.006
ivated-0.006
pieces-0.006
eks-0.006
Token se
Token ID384
Feature activation+9.62e-02

Pos logit contributions
 blame+0.038
date+0.037
ivated+0.037
pieces+0.036
cks+0.036
Neg logit contributions
 coc-0.030
T-0.027
 and-0.027
una-0.026
 Lizard-0.024
Tokenwed
Token ID19103
Feature activation-1.49e-02

Pos logit contributions
ivated+0.111
 blame+0.108
date+0.100
 fight+0.098
pieces+0.095
Neg logit contributions
una-0.176
 coc-0.170
aste-0.165
T-0.165
 and-0.164
Token,
Token ID11
Feature activation-5.86e-03

Pos logit contributions
 coc+0.024
mber+0.024
die+0.022
una+0.021
T+0.021
Neg logit contributions
pieces-0.020
eks-0.020
cks-0.020
ays-0.019
 measured-0.018
Token glued
Token ID39783
Feature activation-2.41e-02

Pos logit contributions
 coc+0.005
T+0.005
una+0.004
R+0.004
die+0.004
Neg logit contributions
 blame-0.013
pieces-0.012
ivated-0.012
eks-0.012
 measured-0.012
Token,
Token ID11
Feature activation-8.99e-03

Pos logit contributions
mber+0.039
 coc+0.039
una+0.036
die+0.036
T+0.035
Neg logit contributions
pieces-0.032
cks-0.032
eks-0.031
ays-0.030
 measured-0.030
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
 coc+0.009
una+0.008
T+0.008
die+0.008
mber+0.008
Neg logit contributions
 blame-0.018
pieces-0.018
 measured-0.018
eks-0.018
ivated-0.017
Token a
Token ID257
Feature activation-1.41e-03

Pos logit contributions
 blame+0.027
date+0.026
eks+0.026
ivated+0.025
pieces+0.025
Neg logit contributions
 coc-0.027
una-0.024
mber-0.024
die-0.023
T-0.023
Token needle
Token ID17598
Feature activation-2.47e-02

Pos logit contributions
 coc+0.002
una+0.002
T+0.002
die+0.002
R+0.002
Neg logit contributions
 blame-0.002
 fight-0.002
date-0.002
ivated-0.002
eks-0.002
Token and
Token ID290
Feature activation+9.92e-03

Pos logit contributions
 coc+0.015
T+0.011
mber+0.010
una+0.010
R+0.010
Neg logit contributions
date-0.059
 blame-0.058
ivated-0.058
 fight-0.058
eks-0.057
Token thread
Token ID4704
Feature activation-1.11e-02

Pos logit contributions
 blame+0.016
date+0.016
ivated+0.016
pieces+0.016
cks+0.016
Neg logit contributions
 coc-0.014
T-0.013
 and-0.012
una-0.012
R-0.011
Token and
Token ID290
Feature activation+9.06e-03

Pos logit contributions
 coc+0.010
T+0.009
 and+0.009
una+0.008
R+0.008
Neg logit contributions
 blame-0.024
date-0.023
ivated-0.023
pieces-0.023
eks-0.023
Token se
Token ID384
Feature activation+9.58e-02

Pos logit contributions
 blame+0.014
date+0.013
ivated+0.013
pieces+0.013
eks+0.013
Neg logit contributions
 coc-0.014
T-0.013
una-0.012
 and-0.012
R-0.011
Tokenwed
Token ID19103
Feature activation-5.61e-03

Pos logit contributions
ivated+0.110
 blame+0.106
date+0.098
 fight+0.097
pieces+0.097
Neg logit contributions
 coc-0.177
una-0.173
T-0.171
aste-0.163
 and-0.163
Token Teddy
Token ID29345
Feature activation-2.49e-03

Pos logit contributions
mber+0.010
 coc+0.010
die+0.009
una+0.009
 lizard+0.008
Neg logit contributions
ays-0.006
aved-0.006
pieces-0.006
cks-0.006
date-0.006
Token's
Token ID338
Feature activation+1.12e-02

Pos logit contributions
 coc+0.003
T+0.003
 and+0.003
una+0.003
 Lizard+0.003
Neg logit contributions
 blame-0.004
date-0.004
eks-0.004
ivated-0.004
pieces-0.004
Token eye
Token ID4151
Feature activation+1.27e-02

Pos logit contributions
 blame+0.016
date+0.015
 carve+0.015
 fight+0.015
bel+0.015
Neg logit contributions
mber-0.017
 coc-0.016
una-0.015
R-0.015
oby-0.015
Token back
Token ID736
Feature activation+2.78e-02

Pos logit contributions
 blame+0.023
date+0.022
eks+0.022
ivated+0.022
pieces+0.022
Neg logit contributions
 coc-0.016
T-0.014
una-0.013
 and-0.013
R-0.012
Token a
Token ID257
Feature activation-5.87e-03

Pos logit contributions
cks+0.010
date+0.010
pieces+0.010
pter+0.009
ivated+0.009
Neg logit contributions
 and-0.005
 coc-0.004
 tight-0.004
 from-0.004
 little-0.004
Token needle
Token ID17598
Feature activation-2.69e-02

Pos logit contributions
una+0.008
 coc+0.008
T+0.008
R+0.008
oby+0.007
Neg logit contributions
 blame-0.010
 fight-0.009
date-0.009
 measured-0.009
 punish-0.008
Token and
Token ID290
Feature activation+4.41e-03

Pos logit contributions
 coc+0.013
T+0.009
una+0.008
R+0.007
mber+0.007
Neg logit contributions
 blame-0.069
date-0.068
ivated-0.067
eks-0.067
 fight-0.066
Token thread
Token ID4704
Feature activation-1.97e-02

Pos logit contributions
 blame+0.007
date+0.007
ivated+0.007
pieces+0.007
eks+0.007
Neg logit contributions
 coc-0.006
T-0.006
una-0.006
 and-0.006
R-0.005
Token and
Token ID290
Feature activation+2.66e-03

Pos logit contributions
 coc+0.015
T+0.012
una+0.011
R+0.010
die+0.010
Neg logit contributions
 blame-0.045
date-0.044
eks-0.044
ivated-0.044
pieces-0.044
Token se
Token ID384
Feature activation+9.41e-02

Pos logit contributions
 blame+0.004
date+0.004
eks+0.004
pieces+0.004
ivated+0.004
Neg logit contributions
 coc-0.004
T-0.004
una-0.004
R-0.004
die-0.004
Tokenwed
Token ID19103
Feature activation-9.74e-03

Pos logit contributions
ivated+0.107
 blame+0.102
date+0.097
pieces+0.094
 fight+0.093
Neg logit contributions
 coc-0.175
una-0.170
T-0.169
aste-0.162
 and-0.161
Token a
Token ID257
Feature activation+9.33e-03

Pos logit contributions
mber+0.017
 coc+0.016
die+0.014
una+0.014
ww+0.014
Neg logit contributions
eks-0.012
pieces-0.012
cks-0.012
 measured-0.012
ays-0.012
Token patch
Token ID8529
Feature activation-5.64e-04

Pos logit contributions
 blame+0.013
 fight+0.013
 measured+0.013
date+0.012
 rebuild+0.012
Neg logit contributions
mber-0.014
una-0.014
oby-0.014
iley-0.013
die-0.013
Token on
Token ID319
Feature activation+2.09e-04

Pos logit contributions
 coc+0.001
T+0.001
una+0.001
 and+0.000
R+0.000
Neg logit contributions
 blame-0.001
date-0.001
ivated-0.001
eks-0.001
pieces-0.001
Token Patch
Token ID17106
Feature activation-2.16e-02

Pos logit contributions
 fight+0.000
cks+0.000
pieces+0.000
 blame+0.000
eks+0.000
Neg logit contributions
mber-0.000
 coc-0.000
die-0.000
T-0.000
R-0.000
Token of
Token ID286
Feature activation+6.06e-03

Pos logit contributions
 coc+0.008
T+0.007
una+0.007
die+0.007
 Lizard+0.007
Neg logit contributions
pieces-0.016
ivated-0.016
 blame-0.016
 measured-0.016
date-0.016
Token waves
Token ID9813
Feature activation+3.03e-03

Pos logit contributions
 fight+0.009
 blame+0.009
 rebuild+0.008
 argue+0.008
pieces+0.008
Neg logit contributions
mber-0.010
T-0.009
una-0.009
R-0.009
 Lizard-0.009
Token and
Token ID290
Feature activation+7.81e-03

Pos logit contributions
pieces+0.007
 blame+0.007
 fight+0.007
date+0.007
ivated+0.007
Neg logit contributions
 coc-0.002
T-0.001
una-0.001
die-0.001
R-0.001
Token wind
Token ID2344
Feature activation+7.40e-03

Pos logit contributions
 blame+0.011
pieces+0.010
date+0.010
eks+0.010
ivated+0.010
Neg logit contributions
 coc-0.013
T-0.012
una-0.012
R-0.011
 Lizard-0.011
Token and
Token ID290
Feature activation+1.71e-02

Pos logit contributions
 blame+0.014
date+0.014
eks+0.014
pieces+0.014
ivated+0.014
Neg logit contributions
 coc-0.008
T-0.007
 and-0.007
una-0.007
R-0.007
Token se
Token ID384
Feature activation+9.18e-02

Pos logit contributions
 blame+0.026
date+0.025
pieces+0.025
eks+0.024
ivated+0.024
Neg logit contributions
 coc-0.026
T-0.024
una-0.023
R-0.022
 Lizard-0.022
Tokenag
Token ID363
Feature activation+7.25e-03

Pos logit contributions
ivated+0.117
 blame+0.111
date+0.106
roads+0.101
 fight+0.099
Neg logit contributions
una-0.158
 coc-0.154
T-0.152
 and-0.148
aste-0.147
Tokenull
Token ID724
Feature activation+3.91e-02

Pos logit contributions
cks+0.011
 blame+0.011
date+0.011
eks+0.011
pieces+0.011
Neg logit contributions
 coc-0.011
T-0.009
una-0.009
 Lizard-0.009
mber-0.009
Tokens
Token ID82
Feature activation+1.54e-02

Pos logit contributions
 blame+0.079
eks+0.077
date+0.077
ivated+0.076
pieces+0.074
Neg logit contributions
 coc-0.040
 and-0.039
T-0.038
una-0.034
R-0.032
Token.
Token ID13
Feature activation+6.52e-03

Pos logit contributions
 blame+0.032
eks+0.032
date+0.032
pieces+0.031
ivated+0.031
Neg logit contributions
 coc-0.014
 and-0.014
T-0.012
una-0.011
R-0.010
Token
Token ID198
Feature activation+4.35e-03

Pos logit contributions
 blame+0.014
date+0.013
eks+0.013
ivated+0.013
pieces+0.013
Neg logit contributions
 coc-0.006
T-0.005
una-0.005
 and-0.005
R-0.005
Token the
Token ID262
Feature activation+1.06e-02

Pos logit contributions
 coc+0.004
T+0.004
 and+0.004
una+0.004
R+0.003
Neg logit contributions
 blame-0.005
ivated-0.005
date-0.005
pieces-0.005
eks-0.005
Token elderly
Token ID13830
Feature activation-3.62e-04

Pos logit contributions
 blame+0.018
date+0.017
eks+0.017
ivated+0.017
pieces+0.017
Neg logit contributions
 coc-0.014
T-0.013
una-0.013
 and-0.012
R-0.012
Token man
Token ID582
Feature activation+2.87e-02

Pos logit contributions
 and+0.000
 coc+0.000
"+0.000
T+0.000
 little+0.000
Neg logit contributions
cks-0.001
eks-0.001
ivated-0.001
 blame-0.001
date-0.001
Token on
Token ID319
Feature activation+9.15e-04

Pos logit contributions
ivated+0.054
 blame+0.054
date+0.053
 blamed+0.050
pieces+0.050
Neg logit contributions
 and-0.033
 coc-0.029
T-0.027
una-0.025
"-0.024
Token a
Token ID257
Feature activation-3.92e-03

Pos logit contributions
 blame+0.001
date+0.001
 fight+0.001
eks+0.001
pieces+0.001
Neg logit contributions
 coc-0.001
mber-0.001
una-0.001
die-0.001
T-0.001
Token stret
Token ID5759
Feature activation+9.16e-02

Pos logit contributions
 coc+0.009
T+0.009
una+0.008
die+0.008
 Lizard+0.008
Neg logit contributions
 blame-0.003
date-0.003
pieces-0.003
 fight-0.002
ivated-0.002
Tokencher
Token ID2044
Feature activation-3.10e-02

Pos logit contributions
cks+0.140
ivated+0.135
 blame+0.135
ays+0.132
eks+0.130
Neg logit contributions
T-0.129
 and-0.123
 since-0.112
"-0.110
 Lizard-0.110
Token.
Token ID13
Feature activation-9.17e-03

Pos logit contributions
 coc+0.034
 and+0.031
T+0.031
una+0.029
 Lizard+0.027
Neg logit contributions
 blame-0.060
eks-0.058
ivated-0.057
pieces-0.057
date-0.057
Token She
Token ID1375
Feature activation+4.41e-03

Pos logit contributions
 coc+0.013
T+0.012
una+0.012
die+0.011
R+0.011
Neg logit contributions
 blame-0.015
date-0.014
eks-0.014
ivated-0.014
pieces-0.014
Token thanked
Token ID26280
Feature activation+1.05e-02

Pos logit contributions
date+0.006
ivated+0.006
 blame+0.006
pieces+0.006
cks+0.006
Neg logit contributions
 coc-0.007
T-0.006
 and-0.006
una-0.006
R-0.006
Token Tim
Token ID5045
Feature activation-1.88e-02

Pos logit contributions
 blame+0.015
date+0.015
ivated+0.015
eks+0.014
pieces+0.014
Neg logit contributions
 coc-0.016
T-0.015
una-0.015
 and-0.015
R-0.014
Token She
Token ID1375
Feature activation+7.11e-03

Pos logit contributions
 coc+0.010
una+0.009
T+0.009
mber+0.008
die+0.008
Neg logit contributions
 blame-0.012
date-0.012
pieces-0.011
ivated-0.011
eks-0.011
Token cut
Token ID2005
Feature activation-2.22e-02

Pos logit contributions
ivated+0.014
date+0.014
 blame+0.014
pieces+0.014
cks+0.013
Neg logit contributions
 and-0.008
 coc-0.007
T-0.007
una-0.007
 since-0.006
Token and
Token ID290
Feature activation-7.12e-03

Pos logit contributions
 coc+0.040
mber+0.036
die+0.034
T+0.034
 Lizard+0.033
Neg logit contributions
eks-0.028
cks-0.027
pieces-0.027
ays-0.026
 measured-0.026
Token glued
Token ID39783
Feature activation-1.51e-02

Pos logit contributions
 coc+0.011
T+0.010
 and+0.010
una+0.010
R+0.009
Neg logit contributions
 blame-0.011
date-0.011
ivated-0.010
eks-0.010
pieces-0.010
Token and
Token ID290
Feature activation+2.17e-03

Pos logit contributions
mber+0.028
 coc+0.027
die+0.025
R+0.025
T+0.024
Neg logit contributions
cks-0.017
eks-0.017
pieces-0.017
 measured-0.016
ays-0.016
Token se
Token ID384
Feature activation+9.12e-02

Pos logit contributions
 blame+0.003
date+0.003
ivated+0.003
eks+0.003
pieces+0.003
Neg logit contributions
 coc-0.004
T-0.004
una-0.003
 and-0.003
R-0.003
Tokenwed
Token ID19103
Feature activation-2.07e-02

Pos logit contributions
ivated+0.116
 blame+0.112
 fight+0.102
date+0.101
roads+0.097
Neg logit contributions
una-0.160
 coc-0.158
T-0.154
 and-0.149
aste-0.147
Token different
Token ID1180
Feature activation+8.61e-03

Pos logit contributions
 coc+0.030
mber+0.030
die+0.028
una+0.026
 Lizard+0.026
Neg logit contributions
pieces-0.031
cks-0.030
 measured-0.030
eks-0.029
date-0.029
Token fabrics
Token ID39533
Feature activation-3.80e-02

Pos logit contributions
 blame+0.014
pieces+0.014
eks+0.013
cks+0.013
date+0.013
Neg logit contributions
 coc-0.012
T-0.011
una-0.011
 Lizard-0.011
R-0.011
Token together
Token ID1978
Feature activation+2.07e-03

Pos logit contributions
 coc+0.051
T+0.045
una+0.044
R+0.043
die+0.043
Neg logit contributions
 blame-0.062
pieces-0.062
date-0.062
ivated-0.061
cks-0.060
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
pieces+0.003
 blame+0.003
date+0.003
 fight+0.003
cks+0.003
Neg logit contributions
 coc-0.003
T-0.003
una-0.003
mber-0.003
R-0.003
Token matter
Token ID2300
Feature activation-9.70e-03

Pos logit contributions
 blame+0.051
pieces+0.049
date+0.049
eks+0.048
 fight+0.047
Neg logit contributions
 coc-0.047
T-0.044
una-0.042
R-0.040
 Lizard-0.039
Token how
Token ID703
Feature activation+8.03e-03

Pos logit contributions
 coc+0.017
T+0.017
mber+0.016
una+0.016
R+0.016
Neg logit contributions
 blame-0.012
cks-0.011
pieces-0.011
eks-0.011
 fight-0.011
Token small
Token ID1402
Feature activation+7.34e-02

Pos logit contributions
 blame+0.015
ivated+0.015
pieces+0.015
 fight+0.015
eks+0.014
Neg logit contributions
 coc-0.009
una-0.008
T-0.008
 Lizard-0.007
R-0.007
Token or
Token ID393
Feature activation+2.89e-02

Pos logit contributions
 blame+0.122
date+0.117
eks+0.116
pieces+0.116
ivated+0.116
Neg logit contributions
 coc-0.100
T-0.092
una-0.088
R-0.083
die-0.082
Token how
Token ID703
Feature activation+1.38e-02

Pos logit contributions
 blame+0.050
date+0.049
eks+0.048
ivated+0.048
pieces+0.047
Neg logit contributions
 coc-0.038
T-0.034
una-0.033
 and-0.032
R-0.031
Token big
Token ID1263
Feature activation+8.92e-02

Pos logit contributions
 blame+0.025
ivated+0.024
date+0.024
pieces+0.024
eks+0.024
Neg logit contributions
 coc-0.017
T-0.015
una-0.014
R-0.013
 Lizard-0.013
Token,
Token ID11
Feature activation+2.00e-02

Pos logit contributions
 blame+0.160
date+0.155
pieces+0.153
eks+0.153
ivated+0.153
Neg logit contributions
 coc-0.109
T-0.100
una-0.095
R-0.089
 Lizard-0.088
Token it
Token ID340
Feature activation+4.37e-03

Pos logit contributions
 blame+0.029
pieces+0.028
 fight+0.027
 measured+0.027
ivated+0.026
Neg logit contributions
 coc-0.031
T-0.030
una-0.029
 Lizard-0.028
R-0.027
Token is
Token ID318
Feature activation+1.25e-02

Pos logit contributions
 blame+0.008
date+0.008
ivated+0.008
pieces+0.008
eks+0.008
Neg logit contributions
 coc-0.005
T-0.005
 and-0.005
una-0.004
die-0.004
Token important
Token ID1593
Feature activation-1.42e-02

Pos logit contributions
 blame+0.021
date+0.020
ivated+0.020
eks+0.020
 measured+0.020
Neg logit contributions
 coc-0.017
T-0.015
una-0.015
R-0.014
 Lizard-0.014
Token to
Token ID284
Feature activation+2.33e-03

Pos logit contributions
 coc+0.021
T+0.019
una+0.019
Ho+0.018
R+0.018
Neg logit contributions
 fight-0.021
date-0.021
 blame-0.021
pieces-0.021
 measured-0.021
Token,
Token ID11
Feature activation-7.14e-03

Pos logit contributions
 coc+0.013
T+0.011
una+0.010
die+0.009
R+0.009
Neg logit contributions
 blame-0.029
date-0.028
ivated-0.028
 fight-0.028
cks-0.027
Token cats
Token ID11875
Feature activation-2.64e-02

Pos logit contributions
 coc+0.010
T+0.010
una+0.009
mber+0.009
die+0.009
Neg logit contributions
 blame-0.011
 fight-0.011
pieces-0.011
eks-0.011
date-0.011
Token,
Token ID11
Feature activation-9.95e-03

Pos logit contributions
 coc+0.027
T+0.024
una+0.022
die+0.021
mber+0.021
Neg logit contributions
 blame-0.054
 fight-0.053
ivated-0.052
date-0.052
cks-0.051
Token birds
Token ID10087
Feature activation-2.58e-02

Pos logit contributions
 coc+0.009
T+0.009
mber+0.008
una+0.008
oby+0.007
Neg logit contributions
 fight-0.021
 blame-0.021
 measured-0.020
eks-0.020
pieces-0.019
Token,
Token ID11
Feature activation-1.21e-02

Pos logit contributions
 coc+0.024
T+0.021
una+0.019
mber+0.017
die+0.017
Neg logit contributions
 blame-0.054
 fight-0.053
ivated-0.053
date-0.053
cks-0.053
Token ham
Token ID8891
Feature activation-7.95e-02

Pos logit contributions
 coc+0.009
T+0.009
mber+0.008
oby+0.007
una+0.007
Neg logit contributions
 fight-0.027
 blame-0.027
 measured-0.027
eks-0.027
cks-0.026
Tokensters
Token ID5937
Feature activation-3.87e-02

Pos logit contributions
 coc+0.093
T+0.083
mber+0.076
die+0.071
R+0.069
Neg logit contributions
cks-0.148
 fight-0.145
date-0.145
 blame-0.141
eks-0.140
Token,
Token ID11
Feature activation-1.49e-02

Pos logit contributions
 coc+0.039
T+0.035
una+0.031
R+0.030
die+0.029
Neg logit contributions
cks-0.076
 blame-0.076
 fight-0.076
date-0.075
 measured-0.075
Token and
Token ID290
Feature activation-1.40e-02

Pos logit contributions
 coc+0.010
T+0.009
mber+0.008
una+0.007
oby+0.007
Neg logit contributions
 fight-0.035
 blame-0.034
 measured-0.034
eks-0.034
cks-0.034
Token more
Token ID517
Feature activation-1.01e-02

Pos logit contributions
 coc+0.018
T+0.016
una+0.015
R+0.014
 and+0.014
Neg logit contributions
 blame-0.025
date-0.024
pieces-0.024
eks-0.024
ivated-0.024
Token.
Token ID13
Feature activation+2.06e-03

Pos logit contributions
 coc+0.012
T+0.012
una+0.011
R+0.010
 Lizard+0.010
Neg logit contributions
 blame-0.018
pieces-0.018
date-0.018
cks-0.017
 fight-0.017
Token Patty
Token ID42417
Feature activation-3.78e-02

Pos logit contributions
 coc+0.015
una+0.014
T+0.014
die+0.013
R+0.013
Neg logit contributions
 blame-0.017
pieces-0.017
date-0.017
ivated-0.016
 fight-0.016
Token's
Token ID338
Feature activation-2.86e-03

Pos logit contributions
 coc+0.048
T+0.044
 and+0.043
una+0.041
R+0.039
Neg logit contributions
 blame-0.067
date-0.065
ivated-0.064
cks-0.063
eks-0.063
Token bedroom
Token ID14043
Feature activation-4.92e-03

Pos logit contributions
 coc+0.004
T+0.003
una+0.003
 and+0.003
R+0.003
Neg logit contributions
 blame-0.005
date-0.005
ivated-0.005
eks-0.005
pieces-0.005
Token was
Token ID373
Feature activation-1.73e-02

Pos logit contributions
 coc+0.005
T+0.004
una+0.004
 and+0.004
R+0.004
Neg logit contributions
 blame-0.010
date-0.010
ivated-0.009
eks-0.009
pieces-0.009
Token decorated
Token ID24789
Feature activation-1.84e-02

Pos logit contributions
 coc+0.025
T+0.023
una+0.022
 and+0.021
R+0.021
Neg logit contributions
 blame-0.028
date-0.027
ivated-0.026
eks-0.026
pieces-0.026
Token perfectly
Token ID7138
Feature activation-7.94e-02

Pos logit contributions
 coc+0.027
T+0.024
una+0.024
R+0.023
die+0.023
Neg logit contributions
date-0.030
bel-0.027
pieces-0.026
ivated-0.026
ics-0.026
Token!
Token ID0
Feature activation+4.63e-03

Pos logit contributions
 coc+0.102
una+0.098
T+0.092
R+0.091
 Lizard+0.088
Neg logit contributions
ivated-0.132
pieces-0.131
 measured-0.130
date-0.128
cking-0.127
Token<|endoftext|>
Token ID50256
Feature activation-1.58e-02

Pos logit contributions
 blame+0.007
date+0.007
pieces+0.007
 Asked+0.007
ivated+0.007
Neg logit contributions
 coc-0.007
T-0.006
una-0.006
die-0.006
aste-0.005
TokenOne
Token ID3198
Feature activation-3.46e-03

Pos logit contributions
 coc+0.029
una+0.025
T+0.025
die+0.025
aste+0.025
Neg logit contributions
date-0.019
eks-0.018
roads-0.017
cks-0.017
 Asked-0.017
Token day
Token ID1110
Feature activation-5.45e-03

Pos logit contributions
 coc+0.003
T+0.003
una+0.003
 and+0.002
R+0.002
Neg logit contributions
 blame-0.007
date-0.007
ivated-0.007
eks-0.007
pieces-0.007
Token,
Token ID11
Feature activation-1.13e-02

Pos logit contributions
 coc+0.006
T+0.005
una+0.005
die+0.004
 Lizard+0.004
Neg logit contributions
date-0.011
ivated-0.011
 blame-0.011
eks-0.010
ued-0.010
Token.
Token ID13
Feature activation-6.69e-03

Pos logit contributions
 blame+0.030
date+0.030
ivated+0.029
eks+0.029
pieces+0.029
Neg logit contributions
 coc-0.021
T-0.018
una-0.018
 and-0.018
 Lizard-0.016
Token She
Token ID1375
Feature activation+3.40e-03

Pos logit contributions
 coc+0.012
T+0.011
una+0.010
Ho+0.010
R+0.010
Neg logit contributions
 blame-0.009
ivated-0.008
 blamed-0.008
 Asked-0.007
 begs-0.007
Token looked
Token ID3114
Feature activation+7.82e-03

Pos logit contributions
 blame+0.005
eks+0.004
date+0.004
 measured+0.004
ivated+0.004
Neg logit contributions
 coc-0.006
T-0.005
una-0.005
R-0.005
 Lizard-0.005
Token at
Token ID379
Feature activation+1.86e-02

Pos logit contributions
date+0.011
 blame+0.011
pieces+0.011
cks+0.011
ivated+0.011
Neg logit contributions
 coc-0.012
 and-0.012
una-0.011
T-0.011
die-0.011
Token the
Token ID262
Feature activation-2.45e-02

Pos logit contributions
 blame+0.030
eks+0.030
cks+0.029
pieces+0.028
date+0.028
Neg logit contributions
 coc-0.026
T-0.024
mber-0.021
R-0.021
una-0.021
Token ham
Token ID8891
Feature activation-7.87e-02

Pos logit contributions
T+0.030
 coc+0.030
R+0.027
una+0.026
 since+0.026
Neg logit contributions
 blame-0.042
 fight-0.041
date-0.040
pieces-0.039
eks-0.039
Tokenster
Token ID1706
Feature activation-2.96e-02

Pos logit contributions
 coc+0.104
T+0.101
mber+0.093
die+0.092
R+0.090
Neg logit contributions
cks-0.124
date-0.121
 fight-0.118
eks-0.117
pieces-0.117
Token and
Token ID290
Feature activation-3.62e-03

Pos logit contributions
 coc+0.021
T+0.020
R+0.017
una+0.017
mber+0.016
Neg logit contributions
 blame-0.066
pieces-0.066
date-0.065
cks-0.064
 fight-0.064
Token saw
Token ID2497
Feature activation+2.15e-02

Pos logit contributions
 coc+0.005
 and+0.004
T+0.004
una+0.004
die+0.004
Neg logit contributions
date-0.006
 blame-0.006
ivated-0.006
pieces-0.006
eks-0.006
Token that
Token ID326
Feature activation+7.63e-03

Pos logit contributions
 blame+0.037
date+0.037
ivated+0.036
eks+0.036
pieces+0.035
Neg logit contributions
 coc-0.028
T-0.026
una-0.025
 and-0.024
 Lizard-0.023
Token he
Token ID339
Feature activation+8.62e-03

Pos logit contributions
 blame+0.013
date+0.012
pieces+0.012
eks+0.012
cks+0.012
Neg logit contributions
 coc-0.010
T-0.009
una-0.009
R-0.008
mber-0.008
Token.
Token ID13
Feature activation+2.48e-03

Pos logit contributions
 coc+0.000
 and+0.000
T+0.000
una+0.000
 since+0.000
Neg logit contributions
 blame-0.001
eks-0.001
ivated-0.001
date-0.001
cks-0.001
Token
Token ID198
Feature activation-1.96e-03

Pos logit contributions
 blame+0.004
date+0.004
pieces+0.004
ivated+0.004
eks+0.004
Neg logit contributions
 coc-0.003
una-0.003
T-0.003
 and-0.003
die-0.003
Token
Token ID198
Feature activation-4.31e-03

Pos logit contributions
 coc+0.002
una+0.002
T+0.002
 Lizard+0.002
die+0.002
Neg logit contributions
 blame-0.004
date-0.004
pieces-0.004
ivated-0.003
eks-0.003
TokenAndy
Token ID35314
Feature activation-2.14e-03

Pos logit contributions
 coc+0.005
una+0.004
 and+0.004
 Lizard+0.004
die+0.004
Neg logit contributions
date-0.008
 blame-0.008
pieces-0.008
ivated-0.008
eks-0.007
Token stood
Token ID6204
Feature activation-1.82e-02

Pos logit contributions
 coc+0.003
T+0.002
una+0.002
R+0.002
 Lizard+0.002
Neg logit contributions
 blame-0.004
date-0.004
eks-0.004
ivated-0.004
pieces-0.004
Token perfectly
Token ID7138
Feature activation-7.76e-02

Pos logit contributions
 coc+0.030
T+0.027
una+0.027
R+0.025
die+0.025
Neg logit contributions
 blame-0.026
ivated-0.025
date-0.025
eks-0.025
pieces-0.024
Token still
Token ID991
Feature activation-3.94e-02

Pos logit contributions
 coc+0.069
una+0.058
T+0.053
die+0.051
R+0.049
Neg logit contributions
ivated-0.171
 blame-0.163
 fight-0.161
cking-0.160
 measured-0.159
Token for
Token ID329
Feature activation-2.12e-02

Pos logit contributions
 coc+0.033
T+0.028
una+0.027
R+0.024
die+0.024
Neg logit contributions
 blame-0.087
ivated-0.086
date-0.085
pieces-0.084
 fight-0.084
Token the
Token ID262
Feature activation-1.15e-03

Pos logit contributions
 coc+0.030
die+0.029
una+0.027
T+0.027
mber+0.027
Neg logit contributions
 blame-0.033
 fight-0.033
eks-0.032
pieces-0.032
date-0.031
Token entire
Token ID2104
Feature activation+2.47e-03

Pos logit contributions
 coc+0.002
T+0.002
una+0.002
 and+0.002
R+0.002
Neg logit contributions
 blame-0.002
date-0.001
ivated-0.001
eks-0.001
pieces-0.001
Token duration
Token ID9478
Feature activation+6.10e-03

Pos logit contributions
 blame+0.004
date+0.004
pieces+0.004
 fight+0.004
ivated+0.004
Neg logit contributions
 coc-0.003
T-0.003
una-0.003
die-0.003
R-0.003
Token worked
Token ID3111
Feature activation-1.55e-02

Pos logit contributions
 coc+0.006
 and+0.005
T+0.005
una+0.005
 Lizard+0.005
Neg logit contributions
pieces-0.009
ivated-0.009
 blame-0.009
date-0.009
cks-0.009
Token!
Token ID0
Feature activation+6.88e-03

Pos logit contributions
 coc+0.023
una+0.020
T+0.019
mber+0.019
R+0.019
Neg logit contributions
date-0.024
 fight-0.024
 measured-0.024
eks-0.024
ivated-0.024
Token The
Token ID383
Feature activation-9.54e-03

Pos logit contributions
 blame+0.011
date+0.011
pieces+0.010
ivated+0.010
eks+0.010
Neg logit contributions
 coc-0.010
una-0.009
T-0.009
die-0.008
R-0.008
Token door
Token ID3420
Feature activation+7.66e-03

Pos logit contributions
 coc+0.014
T+0.013
una+0.013
die+0.012
R+0.012
Neg logit contributions
 blame-0.016
 fight-0.015
date-0.015
eks-0.015
pieces-0.014
Token opened
Token ID4721
Feature activation-3.48e-02

Pos logit contributions
date+0.014
pieces+0.014
ivated+0.013
cks+0.013
 blame+0.013
Neg logit contributions
 and-0.010
 coc-0.009
T-0.008
 since-0.008
 little-0.008
Token easily
Token ID3538
Feature activation-7.53e-02

Pos logit contributions
 coc+0.040
una+0.036
T+0.034
mber+0.032
R+0.032
Neg logit contributions
 measured-0.064
eks-0.064
cks-0.063
date-0.063
ivated-0.063
Token.
Token ID13
Feature activation-4.97e-03

Pos logit contributions
 coc+0.095
una+0.087
T+0.084
die+0.082
R+0.082
Neg logit contributions
 blame-0.132
ivated-0.130
eks-0.129
 fight-0.129
date-0.128
Token He
Token ID679
Feature activation-2.49e-03

Pos logit contributions
 coc+0.008
una+0.007
T+0.007
die+0.007
aste+0.007
Neg logit contributions
 blame-0.007
date-0.007
pieces-0.006
ivated-0.006
 measured-0.006
Token was
Token ID373
Feature activation-5.06e-03

Pos logit contributions
 coc+0.004
T+0.004
una+0.004
R+0.003
die+0.003
Neg logit contributions
 blame-0.004
eks-0.003
date-0.003
 measured-0.003
ivated-0.003
Token free
Token ID1479
Feature activation-3.79e-02

Pos logit contributions
 coc+0.006
T+0.005
 and+0.005
una+0.005
 Lizard+0.005
Neg logit contributions
 blame-0.009
date-0.009
pieces-0.009
eks-0.009
ivated-0.009
Token!
Token ID0
Feature activation+3.19e-03

Pos logit contributions
 coc+0.049
una+0.046
mber+0.045
R+0.045
T+0.044
Neg logit contributions
date-0.063
 fight-0.062
ivated-0.061
 measured-0.061
 blame-0.060
Token It
Token ID632
Feature activation+2.62e-03

Pos logit contributions
 coc+0.019
T+0.017
una+0.017
die+0.016
aste+0.016
Neg logit contributions
 blame-0.020
date-0.019
ivated-0.019
pieces-0.019
cks-0.019
Token was
Token ID373
Feature activation-2.01e-02

Pos logit contributions
 blame+0.006
date+0.006
pieces+0.006
ivated+0.006
eks+0.006
Neg logit contributions
 coc-0.002
T-0.002
 and-0.002
una-0.002
R-0.002
Token hiding
Token ID11816
Feature activation-2.77e-02

Pos logit contributions
 coc+0.028
T+0.028
mber+0.026
una+0.025
R+0.025
Neg logit contributions
cks-0.031
 blame-0.030
 fight-0.030
 measured-0.029
cking-0.029
Token under
Token ID739
Feature activation-1.28e-02

Pos logit contributions
 coc+0.054
T+0.051
mber+0.050
una+0.049
die+0.047
Neg logit contributions
cks-0.029
eks-0.029
bel-0.027
ivated-0.027
cking-0.026
Token a
Token ID257
Feature activation-2.40e-02

Pos logit contributions
 coc+0.017
T+0.016
mber+0.016
die+0.016
una+0.016
Neg logit contributions
cks-0.019
 fight-0.019
 blame-0.019
eks-0.019
 measured-0.018
Token seaw
Token ID32085
Feature activation-7.49e-02

Pos logit contributions
T+0.034
una+0.032
 since+0.031
die+0.030
oby+0.030
Neg logit contributions
 fight-0.035
cks-0.033
date-0.032
 contest-0.032
pieces-0.031
Tokeneed
Token ID2308
Feature activation-7.09e-02

Pos logit contributions
 since+0.095
T+0.091
 Lizard+0.089
Sleep+0.088
 coc+0.088
Neg logit contributions
 carve-0.107
cks-0.105
okes-0.104
 measured-0.101
date-0.101
Token.
Token ID13
Feature activation-8.49e-03

Pos logit contributions
 coc+0.075
T+0.073
una+0.064
die+0.064
R+0.063
Neg logit contributions
 fight-0.133
cks-0.130
date-0.128
 blame-0.128
ivated-0.127
Token Lily
Token ID20037
Feature activation-1.87e-02

Pos logit contributions
 coc+0.011
T+0.010
una+0.010
die+0.009
aste+0.009
Neg logit contributions
 blame-0.014
date-0.014
ivated-0.014
pieces-0.014
cks-0.013
Token said
Token ID531
Feature activation-1.10e-02

Pos logit contributions
 coc+0.030
una+0.027
T+0.026
Ho+0.025
aste+0.025
Neg logit contributions
 measured-0.027
 blamed-0.027
 blame-0.026
 fight-0.025
 Asked-0.024
Token,
Token ID11
Feature activation-2.80e-02

Pos logit contributions
 coc+0.007
T+0.006
 and+0.006
una+0.006
R+0.005
Neg logit contributions
 blame-0.026
date-0.026
pieces-0.025
ivated-0.025
eks-0.025
Token it
Token ID340
Feature activation-1.84e-02

Pos logit contributions
date+0.037
 blame+0.036
ivated+0.035
eks+0.035
pieces+0.034
Neg logit contributions
 coc-0.020
 and-0.017
T-0.017
una-0.017
 Lizard-0.015
Token come
Token ID1282
Feature activation-1.36e-04

Pos logit contributions
 coc+0.020
 and+0.019
T+0.017
una+0.016
 Lizard+0.016
Neg logit contributions
date-0.037
 blame-0.036
pieces-0.035
ivated-0.035
eks-0.035
Token out
Token ID503
Feature activation-1.93e-03

Pos logit contributions
 coc+0.000
T+0.000
una+0.000
mber+0.000
R+0.000
Neg logit contributions
 blame-0.000
eks-0.000
date-0.000
ivated-0.000
cks-0.000
Token of
Token ID286
Feature activation-4.87e-03

Pos logit contributions
 coc+0.002
T+0.002
una+0.002
mber+0.002
die+0.002
Neg logit contributions
 blame-0.004
ivated-0.004
pieces-0.004
eks-0.004
cks-0.004
Token the
Token ID262
Feature activation-2.30e-02

Pos logit contributions
 coc+0.007
T+0.007
mber+0.006
una+0.006
R+0.006
Neg logit contributions
 blame-0.007
cks-0.007
eks-0.007
 fight-0.007
date-0.007
Token seaw
Token ID32085
Feature activation-7.47e-02

Pos logit contributions
T+0.034
 coc+0.032
una+0.032
R+0.030
 Lizard+0.030
Neg logit contributions
 fight-0.034
 blame-0.034
date-0.032
cks-0.032
eks-0.032
Tokeneed
Token ID2308
Feature activation-6.30e-02

Pos logit contributions
T+0.092
 since+0.091
 Lizard+0.090
 coc+0.090
 Gray+0.086
Neg logit contributions
date-0.106
 carve-0.106
cks-0.104
okes-0.103
our-0.100
Token and
Token ID290
Feature activation-7.57e-03

Pos logit contributions
T+0.062
 coc+0.062
R+0.055
una+0.054
die+0.054
Neg logit contributions
 fight-0.120
date-0.119
pieces-0.116
cks-0.116
 blame-0.115
Token walk
Token ID2513
Feature activation+1.26e-02

Pos logit contributions
 coc+0.011
 and+0.010
T+0.008
die+0.008
 Lizard+0.008
Neg logit contributions
date-0.013
ivated-0.013
pieces-0.012
 blame-0.012
eks-0.012
Token around
Token ID1088
Feature activation-6.39e-03

Pos logit contributions
 blame+0.020
date+0.019
ivated+0.019
eks+0.019
pieces+0.019
Neg logit contributions
 coc-0.018
T-0.017
 and-0.016
una-0.016
R-0.015
Token the
Token ID262
Feature activation-2.40e-02

Pos logit contributions
 coc+0.008
T+0.007
una+0.007
 and+0.007
R+0.006
Neg logit contributions
 blame-0.012
date-0.011
ivated-0.011
eks-0.011
pieces-0.011
Token pet
Token ID4273
Feature activation-8.15e-03

Pos logit contributions
 coc+0.017
 and+0.015
una+0.014
die+0.013
T+0.013
Neg logit contributions
 blame-0.028
ivated-0.028
pieces-0.028
eks-0.028
cks-0.027
Token -
Token ID532
Feature activation-1.97e-02

Pos logit contributions
 coc+0.010
T+0.009
una+0.008
die+0.008
R+0.008
Neg logit contributions
 blame-0.015
date-0.015
pieces-0.014
ivated-0.014
eks-0.014
Token a
Token ID257
Feature activation-9.28e-03

Pos logit contributions
 coc+0.028
T+0.026
una+0.025
R+0.024
die+0.024
Neg logit contributions
 blame-0.031
pieces-0.030
date-0.030
ivated-0.029
 measured-0.029
Token cute
Token ID13779
Feature activation-3.68e-02

Pos logit contributions
 coc+0.012
T+0.011
una+0.011
 since+0.010
R+0.010
Neg logit contributions
 fight-0.015
date-0.015
 blame-0.015
 measured-0.014
pieces-0.014
Token little
Token ID1310
Feature activation-7.67e-03

Pos logit contributions
T+0.047
 coc+0.046
R+0.043
una+0.043
Ho+0.041
Neg logit contributions
 fight-0.062
date-0.061
 tease-0.056
 blame-0.056
 competition-0.053
Token ham
Token ID8891
Feature activation-7.35e-02

Pos logit contributions
 coc+0.010
T+0.009
una+0.009
R+0.008
die+0.008
Neg logit contributions
 blame-0.013
date-0.013
 fight-0.012
pieces-0.012
ivated-0.012
Tokenster
Token ID1706
Feature activation-2.10e-02

Pos logit contributions
 coc+0.097
T+0.090
die+0.082
mber+0.082
R+0.078
Neg logit contributions
date-0.123
 fight-0.113
 measured-0.112
cks-0.111
 blame-0.109
Token.
Token ID13
Feature activation+1.55e-03

Pos logit contributions
 coc+0.023
T+0.020
una+0.019
R+0.018
die+0.018
Neg logit contributions
 blame-0.040
date-0.040
pieces-0.039
eks-0.039
ivated-0.039
Token Lily
Token ID20037
Feature activation-1.82e-02

Pos logit contributions
 blame+0.003
date+0.002
pieces+0.002
ivated+0.002
 blamed+0.002
Neg logit contributions
 coc-0.002
una-0.002
T-0.002
R-0.002
die-0.002
Token was
Token ID373
Feature activation-4.67e-03

Pos logit contributions
 coc+0.026
una+0.023
T+0.023
R+0.022
Ho+0.022
Neg logit contributions
 blame-0.028
 measured-0.027
 fight-0.026
eks-0.026
date-0.026
Token so
Token ID523
Feature activation+3.23e-03

Pos logit contributions
 coc+0.004
T+0.004
 and+0.004
una+0.003
die+0.003
Neg logit contributions
 blame-0.010
date-0.010
eks-0.010
pieces-0.010
ivated-0.009
Token tug
Token ID27762
Feature activation+1.89e-02

Pos logit contributions
 coc+0.011
una+0.011
T+0.011
 since+0.010
R+0.010
Neg logit contributions
 blame-0.013
 fight-0.013
date-0.013
pieces-0.012
 measured-0.012
Token.
Token ID13
Feature activation+2.81e-03

Pos logit contributions
 blame+0.031
date+0.031
eks+0.031
pieces+0.030
cks+0.030
Neg logit contributions
 coc-0.026
T-0.023
una-0.022
die-0.021
R-0.021
Token It
Token ID632
Feature activation+4.39e-03

Pos logit contributions
 blame+0.004
date+0.004
ivated+0.004
pieces+0.004
eks+0.004
Neg logit contributions
 coc-0.004
una-0.004
T-0.004
die-0.003
aste-0.003
Token came
Token ID1625
Feature activation+4.39e-03

Pos logit contributions
 blame+0.010
date+0.010
ivated+0.010
pieces+0.010
cks+0.009
Neg logit contributions
 coc-0.004
T-0.003
 and-0.003
una-0.003
 Lizard-0.003
Token out
Token ID503
Feature activation-3.14e-03

Pos logit contributions
eks+0.004
ivated+0.004
aved+0.004
date+0.004
ered+0.004
Neg logit contributions
 coc-0.009
mber-0.009
una-0.008
T-0.008
R-0.008
Token easily
Token ID3538
Feature activation-7.31e-02

Pos logit contributions
 coc+0.003
una+0.003
T+0.003
R+0.003
mber+0.003
Neg logit contributions
 blame-0.006
ivated-0.006
date-0.006
eks-0.006
pieces-0.006
Token,
Token ID11
Feature activation-8.51e-03

Pos logit contributions
 coc+0.076
una+0.072
T+0.067
R+0.066
mber+0.064
Neg logit contributions
 fight-0.141
 measured-0.136
ivated-0.135
 blame-0.134
 argue-0.133
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
 coc+0.009
T+0.008
una+0.008
R+0.007
mber+0.007
Neg logit contributions
 blame-0.017
ivated-0.016
 fight-0.016
 measured-0.016
pieces-0.016
Token she
Token ID673
Feature activation+5.82e-03

Pos logit contributions
 coc+0.027
T+0.025
una+0.024
 and+0.023
R+0.022
Neg logit contributions
 blame-0.035
date-0.034
ivated-0.033
eks-0.033
pieces-0.033
Token held
Token ID2714
Feature activation+4.44e-03

Pos logit contributions
 blame+0.010
date+0.010
ivated+0.010
pieces+0.010
eks+0.010
Neg logit contributions
 coc-0.008
T-0.007
 and-0.007
una-0.006
 Lizard-0.006
Token it
Token ID340
Feature activation+7.37e-03

Pos logit contributions
 blame+0.008
eks+0.007
pieces+0.007
date+0.007
cks+0.007
Neg logit contributions
 coc-0.006
T-0.005
una-0.005
mber-0.005
die-0.005
Token rod
Token ID15299
Feature activation-5.19e-04

Pos logit contributions
 coc+0.013
T+0.012
una+0.012
 and+0.011
R+0.011
Neg logit contributions
 blame-0.017
date-0.017
ivated-0.016
eks-0.016
pieces-0.016
Token.
Token ID13
Feature activation+7.55e-05

Pos logit contributions
 coc+0.001
T+0.000
una+0.000
 and+0.000
R+0.000
Neg logit contributions
 blame-0.001
date-0.001
ivated-0.001
eks-0.001
pieces-0.001
Token It
Token ID632
Feature activation+2.86e-03

Pos logit contributions
 blame+0.000
date+0.000
ivated+0.000
pieces+0.000
 blamed+0.000
Neg logit contributions
 coc-0.000
una-0.000
T-0.000
die-0.000
R-0.000
Token came
Token ID1625
Feature activation+6.74e-03

Pos logit contributions
ivated+0.007
pieces+0.006
date+0.006
 blame+0.006
cks+0.006
Neg logit contributions
 and-0.002
 coc-0.002
T-0.002
una-0.002
 Lizard-0.002
Token off
Token ID572
Feature activation-3.79e-02

Pos logit contributions
 fight+0.007
eks+0.007
date+0.006
 blame+0.006
ivated+0.006
Neg logit contributions
 coc-0.013
mber-0.013
una-0.012
T-0.012
R-0.012
Token easily
Token ID3538
Feature activation-7.27e-02

Pos logit contributions
 coc+0.041
T+0.037
una+0.036
R+0.033
die+0.033
Neg logit contributions
 blame-0.073
date-0.072
 fight-0.071
ivated-0.071
eks-0.069
Token.
Token ID13
Feature activation-2.06e-03

Pos logit contributions
 coc+0.089
T+0.082
una+0.082
R+0.078
mber+0.076
Neg logit contributions
 fight-0.127
 blame-0.127
date-0.123
ivated-0.123
cks-0.122
Token It
Token ID632
Feature activation-3.01e-04

Pos logit contributions
 coc+0.003
una+0.002
T+0.002
die+0.002
R+0.002
Neg logit contributions
 blame-0.004
date-0.003
ivated-0.003
pieces-0.003
eks-0.003
Token fell
Token ID3214
Feature activation-1.59e-02

Pos logit contributions
 and+0.000
 coc+0.000
T+0.000
una+0.000
 Lizard+0.000
Neg logit contributions
pieces-0.001
ivated-0.001
date-0.001
cks-0.001
 blame-0.001
Token on
Token ID319
Feature activation-1.49e-02

Pos logit contributions
 coc+0.030
mber+0.028
T+0.028
una+0.028
R+0.027
Neg logit contributions
 blame-0.017
eks-0.017
 fight-0.017
date-0.017
ivated-0.016
Token the
Token ID262
Feature activation-1.47e-02

Pos logit contributions
 coc+0.022
mber+0.022
die+0.021
una+0.021
T+0.021
Neg logit contributions
 fight-0.020
 blame-0.020
date-0.020
eks-0.019
pieces-0.019
Token The
Token ID383
Feature activation-5.17e-03

Pos logit contributions
 coc+0.036
una+0.031
die+0.031
T+0.031
aste+0.030
Neg logit contributions
 blame-0.032
ivated-0.029
 blamed-0.029
 Asked-0.029
date-0.028
Token dentist
Token ID38408
Feature activation+5.61e-03

Pos logit contributions
die+0.008
 coc+0.008
mber+0.007
T+0.007
aste+0.007
Neg logit contributions
 blame-0.007
 fight-0.007
 measured-0.007
date-0.007
eks-0.007
Token put
Token ID1234
Feature activation-1.11e-02

Pos logit contributions
 blame+0.009
ivated+0.009
date+0.009
pieces+0.009
eks+0.009
Neg logit contributions
 coc-0.008
T-0.007
 and-0.007
una-0.007
 Lizard-0.006
Token a
Token ID257
Feature activation+1.78e-03

Pos logit contributions
 coc+0.015
T+0.014
una+0.013
die+0.013
mber+0.013
Neg logit contributions
 blame-0.018
eks-0.018
date-0.018
cks-0.017
pieces-0.017
Token b
Token ID275
Feature activation-1.10e-02

Pos logit contributions
 blame+0.003
 fight+0.003
 measured+0.003
date+0.003
bel+0.002
Neg logit contributions
una-0.002
oby-0.002
 since-0.002
T-0.002
die-0.002
Tokenib
Token ID571
Feature activation-7.24e-02

Pos logit contributions
T+0.018
 coc+0.017
die+0.017
mber+0.017
R+0.017
Neg logit contributions
eks-0.014
ow-0.013
 blame-0.013
pieces-0.013
 measured-0.013
Token around
Token ID1088
Feature activation+1.03e-02

Pos logit contributions
 coc+0.099
mber+0.093
T+0.093
R+0.088
 since+0.086
Neg logit contributions
cks-0.109
eks-0.109
cking-0.102
 measured-0.102
bel-0.101
Token her
Token ID607
Feature activation-2.16e-02

Pos logit contributions
 blame+0.015
date+0.014
eks+0.014
pieces+0.014
 fight+0.014
Neg logit contributions
 coc-0.016
die-0.014
mber-0.014
una-0.014
aste-0.014
Token neck
Token ID7393
Feature activation-3.61e-02

Pos logit contributions
 coc+0.029
T+0.029
mber+0.028
R+0.028
una+0.027
Neg logit contributions
 blame-0.034
 fight-0.033
date-0.032
pieces-0.032
 measured-0.032
Token.
Token ID13
Feature activation-1.91e-02

Pos logit contributions
 coc+0.050
die+0.045
mber+0.045
T+0.044
R+0.043
Neg logit contributions
 measured-0.055
cks-0.054
pieces-0.053
date-0.053
roads-0.053
Token He
Token ID679
Feature activation+3.22e-03

Pos logit contributions
 coc+0.030
T+0.027
die+0.026
una+0.026
oby+0.026
Neg logit contributions
 blame-0.027
 blamed-0.027
 Asked-0.025
 measured-0.025
ivated-0.025
Token.
Token ID13
Feature activation-1.39e-02

Pos logit contributions
date+0.030
 blame+0.029
eks+0.029
ivated+0.029
pieces+0.028
Neg logit contributions
 coc-0.015
 and-0.015
T-0.014
una-0.013
 Lizard-0.012
Token He
Token ID679
Feature activation-1.41e-03

Pos logit contributions
 coc+0.018
una+0.014
T+0.014
 since+0.014
aste+0.014
Neg logit contributions
 blame-0.024
date-0.023
cks-0.023
 blamed-0.023
 Asked-0.022
Token runs
Token ID4539
Feature activation+3.03e-03

Pos logit contributions
 coc+0.002
T+0.002
 and+0.002
una+0.002
 Lizard+0.002
Neg logit contributions
 blame-0.002
date-0.002
ivated-0.002
pieces-0.002
eks-0.002
Token to
Token ID284
Feature activation-9.89e-03

Pos logit contributions
date+0.004
 blame+0.004
 fight+0.004
eks+0.004
ivated+0.004
Neg logit contributions
 coc-0.005
T-0.004
una-0.004
die-0.004
mber-0.004
Token the
Token ID262
Feature activation-2.56e-02

Pos logit contributions
 coc+0.010
 and+0.009
T+0.009
una+0.009
 Lizard+0.008
Neg logit contributions
date-0.020
 blame-0.020
pieces-0.019
ivated-0.019
eks-0.019
Token candy
Token ID18550
Feature activation-7.15e-02

Pos logit contributions
 coc+0.039
T+0.037
una+0.036
R+0.034
die+0.033
Neg logit contributions
 blame-0.039
date-0.037
pieces-0.036
ivated-0.036
eks-0.036
Token store
Token ID3650
Feature activation-2.64e-02

Pos logit contributions
 coc+0.112
T+0.107
die+0.105
una+0.101
R+0.101
Neg logit contributions
date-0.102
 fight-0.099
 blame-0.095
roads-0.092
 measured-0.091
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
 coc+0.030
T+0.027
una+0.026
R+0.024
die+0.024
Neg logit contributions
 blame-0.050
date-0.049
ivated-0.048
eks-0.048
pieces-0.048
Token
Token ID198
Feature activation-9.28e-03

Pos logit contributions
 coc+0.017
T+0.014
una+0.013
die+0.013
aste+0.013
Neg logit contributions
 blame-0.023
date-0.022
ivated-0.022
 blamed-0.022
cks-0.022
Token
Token ID198
Feature activation-1.04e-02

Pos logit contributions
 coc+0.012
mber+0.010
T+0.010
 since+0.010
die+0.010
Neg logit contributions
 blame-0.016
cks-0.015
eks-0.015
date-0.015
pieces-0.014
TokenL
Token ID43
Feature activation-2.60e-03

Pos logit contributions
 coc+0.013
 Lizard+0.011
 pad+0.010
una+0.010
 lizard+0.010
Neg logit contributions
date-0.017
bel-0.017
 rebuild-0.017
 measured-0.017
limits-0.016
Token was
Token ID373
Feature activation-2.01e-02

Pos logit contributions
 blame+0.006
date+0.006
pieces+0.006
ivated+0.006
eks+0.006
Neg logit contributions
 coc-0.002
T-0.002
 and-0.002
una-0.002
R-0.002
Token hiding
Token ID11816
Feature activation-2.77e-02

Pos logit contributions
 coc+0.028
T+0.028
mber+0.026
una+0.025
R+0.025
Neg logit contributions
cks-0.031
 blame-0.030
 fight-0.030
 measured-0.029
cking-0.029
Token under
Token ID739
Feature activation-1.28e-02

Pos logit contributions
 coc+0.054
T+0.051
mber+0.050
una+0.049
die+0.047
Neg logit contributions
cks-0.029
eks-0.029
bel-0.027
ivated-0.027
cking-0.026
Token a
Token ID257
Feature activation-2.40e-02

Pos logit contributions
 coc+0.017
T+0.016
mber+0.016
die+0.016
una+0.016
Neg logit contributions
cks-0.019
 fight-0.019
 blame-0.019
eks-0.019
 measured-0.018
Token seaw
Token ID32085
Feature activation-7.49e-02

Pos logit contributions
T+0.034
una+0.032
 since+0.031
die+0.030
oby+0.030
Neg logit contributions
 fight-0.035
cks-0.033
date-0.032
 contest-0.032
pieces-0.031
Tokeneed
Token ID2308
Feature activation-7.09e-02

Pos logit contributions
 since+0.095
T+0.091
 Lizard+0.089
Sleep+0.088
 coc+0.088
Neg logit contributions
 carve-0.107
cks-0.105
okes-0.104
 measured-0.101
date-0.101
Token.
Token ID13
Feature activation-8.49e-03

Pos logit contributions
 coc+0.075
T+0.073
una+0.064
die+0.064
R+0.063
Neg logit contributions
 fight-0.133
cks-0.130
date-0.128
 blame-0.128
ivated-0.127
Token Lily
Token ID20037
Feature activation-1.87e-02

Pos logit contributions
 coc+0.011
T+0.010
una+0.010
die+0.009
aste+0.009
Neg logit contributions
 blame-0.014
date-0.014
ivated-0.014
pieces-0.014
cks-0.013
Token said
Token ID531
Feature activation-1.10e-02

Pos logit contributions
 coc+0.030
una+0.027
T+0.026
Ho+0.025
aste+0.025
Neg logit contributions
 measured-0.027
 blamed-0.027
 blame-0.026
 fight-0.025
 Asked-0.024
Token,
Token ID11
Feature activation-2.80e-02

Pos logit contributions
 coc+0.007
T+0.006
 and+0.006
una+0.006
R+0.005
Neg logit contributions
 blame-0.026
date-0.026
pieces-0.025
ivated-0.025
eks-0.025
Token "
Token ID366
Feature activation+6.92e-03

Pos logit contributions
 coc+0.035
una+0.029
T+0.028
die+0.026
aste+0.025
Neg logit contributions
 blame-0.052
ivated-0.051
 fight-0.049
 measured-0.049
date-0.049
Token it
Token ID340
Feature activation+2.22e-03

Pos logit contributions
 coc+0.051
una+0.049
T+0.047
Ho+0.047
mber+0.046
Neg logit contributions
 fight-0.047
 measured-0.046
 blame-0.046
pieces-0.046
 argue-0.044
Token on
Token ID319
Feature activation-2.35e-03

Pos logit contributions
 blame+0.003
date+0.003
eks+0.003
pieces+0.003
ivated+0.003
Neg logit contributions
 coc-0.004
T-0.003
una-0.003
R-0.003
die-0.003
Token.
Token ID13
Feature activation-2.30e-03

Pos logit contributions
 coc+0.003
una+0.003
mber+0.003
Ho+0.003
T+0.003
Neg logit contributions
 fight-0.004
date-0.004
 blame-0.003
 measured-0.003
 argue-0.003
Token It
Token ID632
Feature activation-5.87e-03

Pos logit contributions
 coc+0.003
T+0.002
una+0.002
 and+0.002
R+0.002
Neg logit contributions
 blame-0.004
date-0.004
ivated-0.004
eks-0.004
pieces-0.004
Token fit
Token ID4197
Feature activation-1.39e-02

Pos logit contributions
 coc+0.006
T+0.005
 and+0.005
una+0.005
die+0.005
Neg logit contributions
 blame-0.012
ivated-0.012
pieces-0.012
date-0.012
eks-0.011
Token perfectly
Token ID7138
Feature activation-7.04e-02

Pos logit contributions
 coc+0.024
mber+0.023
una+0.022
T+0.022
R+0.022
Neg logit contributions
eks-0.017
 measured-0.017
cks-0.016
 fight-0.016
ered-0.016
Token and
Token ID290
Feature activation-6.63e-03

Pos logit contributions
 coc+0.077
una+0.067
T+0.065
R+0.061
die+0.061
Neg logit contributions
ivated-0.135
 measured-0.133
date-0.131
cks-0.131
pieces-0.131
Token he
Token ID339
Feature activation+1.47e-02

Pos logit contributions
 coc+0.010
una+0.009
T+0.009
R+0.008
mber+0.008
Neg logit contributions
 blame-0.010
 measured-0.010
pieces-0.010
date-0.010
eks-0.010
Token found
Token ID1043
Feature activation-1.29e-02

Pos logit contributions
 blame+0.024
date+0.023
ivated+0.023
pieces+0.023
eks+0.023
Neg logit contributions
 coc-0.021
T-0.019
 and-0.018
una-0.018
die-0.017
Token it
Token ID340
Feature activation+5.18e-04

Pos logit contributions
 coc+0.017
T+0.015
una+0.014
mber+0.014
R+0.014
Neg logit contributions
 blame-0.022
pieces-0.022
eks-0.022
date-0.022
 measured-0.021
Token so
Token ID523
Feature activation+1.04e-02

Pos logit contributions
 blame+0.001
date+0.001
ivated+0.001
eks+0.001
pieces+0.001
Neg logit contributions
 coc-0.001
T-0.001
una-0.001
 and-0.001
 Lizard-0.001
Token so
Token ID523
Feature activation+5.63e-04

Pos logit contributions
 coc+0.014
una+0.013
T+0.013
R+0.012
mber+0.011
Neg logit contributions
 blame-0.025
ivated-0.024
pieces-0.024
date-0.024
eks-0.024
Token with
Token ID351
Feature activation-6.12e-03

Pos logit contributions
 blame+0.001
date+0.001
pieces+0.001
ivated+0.001
eks+0.001
Neg logit contributions
 coc-0.001
una-0.001
T-0.001
die-0.001
R-0.001
Token a
Token ID257
Feature activation-9.37e-03

Pos logit contributions
 coc+0.008
T+0.007
una+0.007
R+0.007
die+0.007
Neg logit contributions
 blame-0.011
eks-0.010
date-0.010
pieces-0.010
ivated-0.010
Token burst
Token ID11173
Feature activation-2.80e-02

Pos logit contributions
una+0.013
 coc+0.011
T+0.011
aste+0.011
oby+0.011
Neg logit contributions
 fight-0.016
 blame-0.015
date-0.015
 measured-0.014
 argue-0.014
Token of
Token ID286
Feature activation-2.94e-03

Pos logit contributions
 coc+0.037
T+0.035
una+0.034
R+0.033
die+0.032
Neg logit contributions
date-0.046
ivated-0.045
 blame-0.044
eks-0.043
 measured-0.043
Token energy
Token ID2568
Feature activation-6.99e-02

Pos logit contributions
T+0.004
 coc+0.004
una+0.004
mber+0.004
R+0.004
Neg logit contributions
 blame-0.005
eks-0.004
 fight-0.004
ivated-0.004
date-0.004
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
 coc+0.072
T+0.066
una+0.064
R+0.059
die+0.058
Neg logit contributions
 blame-0.136
date-0.134
ivated-0.133
pieces-0.132
 fight-0.130
Token she
Token ID673
Feature activation-1.29e-03

Pos logit contributions
 coc+0.018
una+0.017
T+0.016
die+0.016
aste+0.016
Neg logit contributions
 blame-0.015
ivated-0.015
pieces-0.015
date-0.014
eks-0.014
Token ran
Token ID4966
Feature activation+3.04e-03

Pos logit contributions
 coc+0.002
T+0.002
una+0.002
 and+0.002
R+0.002
Neg logit contributions
 blame-0.002
date-0.002
ivated-0.002
pieces-0.002
eks-0.002
Token to
Token ID284
Feature activation-1.23e-02

Pos logit contributions
ivated+0.004
eks+0.004
date+0.004
 fight+0.003
 blame+0.003
Neg logit contributions
 coc-0.006
una-0.005
T-0.005
mber-0.005
die-0.005
Token the
Token ID262
Feature activation-2.80e-02

Pos logit contributions
 coc+0.017
T+0.016
una+0.015
R+0.014
die+0.014
Neg logit contributions
 blame-0.020
date-0.020
eks-0.020
ivated-0.019
pieces-0.019
Token.
Token ID13
Feature activation-3.06e-03

Pos logit contributions
 blame+0.029
date+0.028
pieces+0.027
ivated+0.027
 fight+0.027
Neg logit contributions
 coc-0.024
T-0.022
una-0.021
R-0.020
die-0.020
Token He
Token ID679
Feature activation-1.75e-03

Pos logit contributions
 coc+0.004
T+0.004
una+0.004
aste+0.004
R+0.004
Neg logit contributions
 blame-0.005
date-0.005
ivated-0.005
 blamed-0.004
pieces-0.004
Token sat
Token ID3332
Feature activation-1.73e-02

Pos logit contributions
 coc+0.003
T+0.003
una+0.003
R+0.003
 Lizard+0.003
Neg logit contributions
 blame-0.002
eks-0.002
 measured-0.002
date-0.002
ivated-0.002
Token up
Token ID510
Feature activation-1.35e-03

Pos logit contributions
 coc+0.033
T+0.030
una+0.029
R+0.027
die+0.027
Neg logit contributions
eks-0.021
ivated-0.020
 blame-0.020
date-0.020
cks-0.019
Token stra
Token ID3534
Feature activation+3.64e-02

Pos logit contributions
 coc+0.001
T+0.001
una+0.001
R+0.001
die+0.001
Neg logit contributions
 blame-0.003
date-0.003
ivated-0.003
eks-0.003
pieces-0.003
Tokenighter
Token ID4799
Feature activation-6.97e-02

Pos logit contributions
 blame+0.054
date+0.052
ivated+0.051
eks+0.051
pieces+0.051
Neg logit contributions
 coc-0.057
T-0.052
una-0.051
 and-0.049
R-0.048
Token and
Token ID290
Feature activation-1.33e-02

Pos logit contributions
 coc+0.054
T+0.050
R+0.044
Ho+0.042
una+0.041
Neg logit contributions
bel-0.148
 fight-0.147
date-0.147
cks-0.146
pieces-0.144
Token waited
Token ID13488
Feature activation-2.57e-02

Pos logit contributions
 coc+0.021
T+0.020
 and+0.019
una+0.019
die+0.018
Neg logit contributions
date-0.019
 blame-0.019
pieces-0.018
ivated-0.018
cks-0.018
Token for
Token ID329
Feature activation-1.48e-02

Pos logit contributions
 coc+0.034
T+0.030
una+0.029
R+0.028
die+0.028
Neg logit contributions
ivated-0.043
 blame-0.043
date-0.043
pieces-0.042
eks-0.042
Token the
Token ID262
Feature activation-2.92e-03

Pos logit contributions
 coc+0.022
die+0.022
 pad+0.021
mber+0.021
aste+0.020
Neg logit contributions
 blame-0.020
 fight-0.019
eks-0.019
pieces-0.019
 measured-0.019
Token cat
Token ID3797
Feature activation+1.09e-02

Pos logit contributions
 coc+0.004
T+0.003
una+0.003
R+0.003
die+0.003
Neg logit contributions
 blame-0.005
date-0.005
 fight-0.005
pieces-0.005
ivated-0.005
Token the
Token ID262
Feature activation+3.37e-03

Pos logit contributions
 coc+0.005
T+0.005
una+0.005
die+0.004
aste+0.004
Neg logit contributions
 blame-0.005
pieces-0.005
date-0.005
eks-0.005
ivated-0.005
Token fancy
Token ID14996
Feature activation+1.94e-03

Pos logit contributions
 blame+0.006
date+0.006
pieces+0.006
eks+0.006
ivated+0.006
Neg logit contributions
 coc-0.004
T-0.004
una-0.004
R-0.004
 and-0.003
Token suit
Token ID6050
Feature activation+2.35e-02

Pos logit contributions
 blame+0.003
date+0.003
pieces+0.003
ivated+0.003
eks+0.003
Neg logit contributions
 coc-0.003
T-0.002
una-0.002
R-0.002
die-0.002
Token was
Token ID373
Feature activation-1.29e-02

Pos logit contributions
 blame+0.049
ivated+0.048
cks+0.047
pieces+0.047
date+0.046
Neg logit contributions
 and-0.024
 coc-0.019
T-0.019
 little-0.017
 since-0.017
Token wrapped
Token ID12908
Feature activation-2.17e-02

Pos logit contributions
 coc+0.019
T+0.017
 and+0.017
una+0.016
die+0.016
Neg logit contributions
 blame-0.020
eks-0.019
ivated-0.019
date-0.019
pieces-0.019
Token perfectly
Token ID7138
Feature activation-6.97e-02

Pos logit contributions
 coc+0.037
T+0.033
una+0.033
R+0.032
mber+0.032
Neg logit contributions
date-0.029
 blame-0.027
ivated-0.027
eks-0.027
pieces-0.027
Token.
Token ID13
Feature activation+2.83e-03

Pos logit contributions
 coc+0.105
una+0.095
T+0.094
R+0.088
mber+0.087
Neg logit contributions
date-0.103
 blame-0.103
ivated-0.102
pieces-0.101
cks-0.101
Token
Token ID220
Feature activation-9.73e-04

Pos logit contributions
 blame+0.004
date+0.004
pieces+0.004
ivated+0.004
eks+0.004
Neg logit contributions
 coc-0.004
una-0.004
T-0.004
die-0.003
R-0.003
Token
Token ID198
Feature activation-5.05e-03

Pos logit contributions
 coc+0.001
T+0.001
una+0.001
 and+0.001
R+0.001
Neg logit contributions
 blame-0.002
date-0.002
ivated-0.002
eks-0.002
pieces-0.002
Token
Token ID198
Feature activation-6.26e-03

Pos logit contributions
 coc+0.007
una+0.006
aste+0.005
 lizard+0.005
 skull+0.005
Neg logit contributions
date-0.009
 blame-0.009
ivated-0.008
pieces-0.008
 Asked-0.008
TokenKat
Token ID25881
Feature activation-7.07e-03

Pos logit contributions
 coc+0.008
una+0.007
 and+0.007
die+0.007
 Lizard+0.006
Neg logit contributions
date-0.011
 blame-0.011
pieces-0.011
ivated-0.011
roads-0.010
Token and
Token ID290
Feature activation-8.06e-03

Pos logit contributions
 coc+0.010
 silk+0.008
T+0.008
una+0.007
R+0.007
Neg logit contributions
 measured-0.020
 blamed-0.019
 blame-0.018
 greeted-0.018
 fight-0.018
Token Sue
Token ID26089
Feature activation+3.58e-03

Pos logit contributions
 coc+0.011
 and+0.011
una+0.010
T+0.010
 Lizard+0.010
Neg logit contributions
ivated-0.013
date-0.013
cks-0.013
pieces-0.013
 blame-0.013
Token ran
Token ID4966
Feature activation+8.29e-03

Pos logit contributions
date+0.006
cks+0.006
pieces+0.006
ivated+0.006
 blame+0.006
Neg logit contributions
 and-0.005
 coc-0.005
T-0.004
 since-0.004
 Lizard-0.004
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
 blame+0.013
date+0.013
ivated+0.012
 fight+0.012
eks+0.012
Neg logit contributions
 coc-0.013
T-0.012
una-0.011
die-0.011
R-0.011
Token the
Token ID262
Feature activation-3.37e-02

Pos logit contributions
 coc+0.012
 and+0.012
T+0.011
una+0.010
 Lizard+0.010
Neg logit contributions
date-0.024
eks-0.024
ivated-0.024
pieces-0.024
cks-0.024
Token candy
Token ID18550
Feature activation-6.97e-02

Pos logit contributions
 coc+0.043
T+0.039
una+0.037
 and+0.035
R+0.035
Neg logit contributions
 blame-0.060
date-0.058
ivated-0.057
eks-0.057
pieces-0.057
Token store
Token ID3650
Feature activation-2.79e-02

Pos logit contributions
 coc+0.109
T+0.102
die+0.099
una+0.098
R+0.096
Neg logit contributions
date-0.100
 blame-0.096
 fight-0.095
ivated-0.094
pieces-0.093
Token and
Token ID290
Feature activation-1.63e-02

Pos logit contributions
 coc+0.027
 and+0.024
T+0.023
una+0.022
 Lizard+0.020
Neg logit contributions
 blame-0.058
eks-0.056
date-0.056
ivated-0.056
pieces-0.056
Token bought
Token ID5839
Feature activation-1.56e-02

Pos logit contributions
 coc+0.028
 and+0.026
T+0.026
 since+0.024
 Lizard+0.024
Neg logit contributions
date-0.021
 blame-0.021
cks-0.021
ivated-0.021
pieces-0.020
Token some
Token ID617
Feature activation-1.72e-02

Pos logit contributions
 coc+0.020
T+0.018
una+0.017
 and+0.017
R+0.016
Neg logit contributions
 blame-0.028
date-0.027
ivated-0.027
eks-0.027
cks-0.026
Token candy
Token ID18550
Feature activation-3.57e-02

Pos logit contributions
 coc+0.018
T+0.017
una+0.016
die+0.016
R+0.015
Neg logit contributions
 blame-0.033
pieces-0.032
date-0.032
 measured-0.031
ivated-0.031
Token the
Token ID262
Feature activation-9.84e-03

Pos logit contributions
date+0.032
 fight+0.032
 blame+0.031
 argue+0.030
 measured+0.030
Neg logit contributions
 coc-0.031
una-0.031
mber-0.030
T-0.028
Ho-0.028
Token fridge
Token ID25772
Feature activation+2.26e-03

Pos logit contributions
 coc+0.014
T+0.013
una+0.013
R+0.012
 and+0.012
Neg logit contributions
 blame-0.015
date-0.015
pieces-0.015
eks-0.015
ivated-0.015
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
 blame+0.004
eks+0.004
ivated+0.004
pieces+0.004
date+0.004
Neg logit contributions
 coc-0.003
 and-0.002
T-0.002
una-0.002
 since-0.002
Token It
Token ID632
Feature activation-2.14e-04

Pos logit contributions
 coc+0.015
T+0.014
una+0.013
 and+0.013
R+0.012
Neg logit contributions
 blame-0.018
eks-0.017
date-0.017
ivated-0.017
pieces-0.017
Token fit
Token ID4197
Feature activation-1.21e-02

Pos logit contributions
 coc+0.000
 and+0.000
T+0.000
una+0.000
die+0.000
Neg logit contributions
pieces-0.000
ivated-0.000
 blame-0.000
date-0.000
cks-0.000
Token perfectly
Token ID7138
Feature activation-6.96e-02

Pos logit contributions
 coc+0.021
mber+0.020
T+0.019
una+0.019
R+0.019
Neg logit contributions
eks-0.015
 measured-0.014
cks-0.014
pieces-0.014
ered-0.014
Token!
Token ID0
Feature activation+4.77e-03

Pos logit contributions
 coc+0.088
una+0.079
T+0.077
R+0.072
die+0.072
Neg logit contributions
ivated-0.120
 measured-0.118
date-0.118
cks-0.117
 fight-0.117
Token The
Token ID383
Feature activation-9.44e-03

Pos logit contributions
 blame+0.008
date+0.008
eks+0.008
ivated+0.008
pieces+0.008
Neg logit contributions
 coc-0.006
T-0.006
una-0.005
 and-0.005
R-0.005
Token little
Token ID1310
Feature activation-6.44e-03

Pos logit contributions
 coc+0.006
T+0.006
una+0.006
R+0.006
 Lizard+0.005
Neg logit contributions
 blame-0.022
date-0.021
 fight-0.021
pieces-0.021
 measured-0.020
Token girl
Token ID2576
Feature activation-1.12e-02

Pos logit contributions
 coc+0.005
T+0.005
una+0.005
R+0.004
die+0.004
Neg logit contributions
 blame-0.014
date-0.014
pieces-0.014
ivated-0.014
eks-0.013
Token was
Token ID373
Feature activation-9.96e-03

Pos logit contributions
 coc+0.012
T+0.011
una+0.011
R+0.010
die+0.010
Neg logit contributions
 blame-0.021
date-0.021
eks-0.021
ivated-0.020
pieces-0.020
Token attractive
Token ID10966
Feature activation-1.16e-02

Pos logit contributions
 coc+0.009
una+0.008
 and+0.008
T+0.008
aste+0.007
Neg logit contributions
 blame-0.011
date-0.011
eks-0.011
ivated-0.011
pieces-0.011
Token sandwich
Token ID20433
Feature activation-2.14e-02

Pos logit contributions
T+0.018
 coc+0.018
R+0.017
una+0.017
Ho+0.016
Neg logit contributions
date-0.016
 blame-0.016
 fight-0.016
pieces-0.015
 measured-0.015
Token with
Token ID351
Feature activation-7.26e-03

Pos logit contributions
 coc+0.026
T+0.024
R+0.022
die+0.022
una+0.021
Neg logit contributions
date-0.038
 blame-0.037
 fight-0.036
 measured-0.036
pieces-0.036
Token cheese
Token ID9891
Feature activation-3.07e-02

Pos logit contributions
 coc+0.009
T+0.008
una+0.008
mber+0.007
die+0.007
Neg logit contributions
 blame-0.013
date-0.012
pieces-0.012
eks-0.012
cks-0.012
Token,
Token ID11
Feature activation-6.63e-03

Pos logit contributions
 coc+0.035
T+0.035
die+0.031
mber+0.031
R+0.031
Neg logit contributions
date-0.054
 fight-0.053
cks-0.053
 blame-0.053
 measured-0.052
Token ham
Token ID8891
Feature activation-6.95e-02

Pos logit contributions
 coc+0.007
T+0.007
una+0.007
 Lizard+0.006
R+0.006
Neg logit contributions
 blame-0.012
 measured-0.012
 fight-0.012
pieces-0.012
 blamed-0.012
Token,
Token ID11
Feature activation-1.28e-02

Pos logit contributions
 coc+0.074
T+0.068
mber+0.065
R+0.060
die+0.060
Neg logit contributions
cks-0.132
 measured-0.127
 fight-0.126
ays-0.125
date-0.124
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
 coc+0.009
T+0.008
 Lizard+0.007
una+0.007
mber+0.007
Neg logit contributions
 measured-0.029
 blame-0.028
 fight-0.027
cks-0.027
eks-0.027
Token lettuce
Token ID39406
Feature activation-1.45e-02

Pos logit contributions
 coc+0.017
T+0.015
una+0.014
 and+0.014
R+0.014
Neg logit contributions
 blame-0.024
date-0.024
ivated-0.024
eks-0.023
cks-0.023
Token.
Token ID13
Feature activation-6.02e-03

Pos logit contributions
 coc+0.017
T+0.014
una+0.014
 and+0.014
R+0.013
Neg logit contributions
 blame-0.028
date-0.027
ivated-0.027
eks-0.027
pieces-0.027
Token Tim
Token ID5045
Feature activation-6.11e-03

Pos logit contributions
 coc+0.007
T+0.006
 and+0.006
una+0.006
R+0.006
Neg logit contributions
 blame-0.011
eks-0.011
pieces-0.011
date-0.011
ivated-0.011
Token a
Token ID257
Feature activation-5.87e-03

Pos logit contributions
cks+0.010
date+0.010
pieces+0.010
pter+0.009
ivated+0.009
Neg logit contributions
 and-0.005
 coc-0.004
 tight-0.004
 from-0.004
 little-0.004
Token needle
Token ID17598
Feature activation-2.69e-02

Pos logit contributions
una+0.008
 coc+0.008
T+0.008
R+0.008
oby+0.007
Neg logit contributions
 blame-0.010
 fight-0.009
date-0.009
 measured-0.009
 punish-0.008
Token and
Token ID290
Feature activation+4.41e-03

Pos logit contributions
 coc+0.013
T+0.009
una+0.008
R+0.007
mber+0.007
Neg logit contributions
 blame-0.069
date-0.068
ivated-0.067
eks-0.067
 fight-0.066
Token thread
Token ID4704
Feature activation-1.97e-02

Pos logit contributions
 blame+0.007
date+0.007
ivated+0.007
pieces+0.007
eks+0.007
Neg logit contributions
 coc-0.006
T-0.006
una-0.006
 and-0.006
R-0.005
Token and
Token ID290
Feature activation+2.66e-03

Pos logit contributions
 coc+0.015
T+0.012
una+0.011
R+0.010
die+0.010
Neg logit contributions
 blame-0.045
date-0.044
eks-0.044
ivated-0.044
pieces-0.044
Token se
Token ID384
Feature activation+9.41e-02

Pos logit contributions
 blame+0.004
date+0.004
eks+0.004
pieces+0.004
ivated+0.004
Neg logit contributions
 coc-0.004
T-0.004
una-0.004
R-0.004
die-0.004
Tokenwed
Token ID19103
Feature activation-9.74e-03

Pos logit contributions
ivated+0.107
 blame+0.102
date+0.097
pieces+0.094
 fight+0.093
Neg logit contributions
 coc-0.175
una-0.170
T-0.169
aste-0.162
 and-0.161
Token a
Token ID257
Feature activation+9.33e-03

Pos logit contributions
mber+0.017
 coc+0.016
die+0.014
una+0.014
ww+0.014
Neg logit contributions
eks-0.012
pieces-0.012
cks-0.012
 measured-0.012
ays-0.012
Token patch
Token ID8529
Feature activation-5.64e-04

Pos logit contributions
 blame+0.013
 fight+0.013
 measured+0.013
date+0.012
 rebuild+0.012
Neg logit contributions
mber-0.014
una-0.014
oby-0.014
iley-0.013
die-0.013
Token on
Token ID319
Feature activation+2.09e-04

Pos logit contributions
 coc+0.001
T+0.001
una+0.001
 and+0.000
R+0.000
Neg logit contributions
 blame-0.001
date-0.001
ivated-0.001
eks-0.001
pieces-0.001
Token Patch
Token ID17106
Feature activation-2.16e-02

Pos logit contributions
 fight+0.000
cks+0.000
pieces+0.000
 blame+0.000
eks+0.000
Neg logit contributions
mber-0.000
 coc-0.000
die-0.000
T-0.000
R-0.000
Token liked
Token ID8288
Feature activation-4.90e-03

Pos logit contributions
 coc+0.016
una+0.014
T+0.013
die+0.013
R+0.013
Neg logit contributions
 blame-0.016
ivated-0.015
 measured-0.015
date-0.015
eks-0.015
Token to
Token ID284
Feature activation-6.83e-03

Pos logit contributions
 coc+0.007
una+0.007
T+0.007
die+0.007
 Lizard+0.006
Neg logit contributions
 blame-0.007
 fight-0.007
date-0.007
ivated-0.007
pieces-0.007
Token play
Token ID711
Feature activation+3.10e-03

Pos logit contributions
 coc+0.011
 and+0.010
T+0.010
una+0.009
 little+0.009
Neg logit contributions
ays-0.010
cks-0.010
ivated-0.010
cking-0.010
pieces-0.010
Token make
Token ID787
Feature activation+9.39e-03

Pos logit contributions
 blame+0.006
eks+0.005
pieces+0.005
cks+0.005
ivated+0.005
Neg logit contributions
 coc-0.004
 and-0.004
T-0.004
una-0.003
R-0.003
Token-
Token ID12
Feature activation+4.72e-02

Pos logit contributions
 blame+0.017
date+0.017
pieces+0.017
eks+0.017
ivated+0.017
Neg logit contributions
 coc-0.011
T-0.010
una-0.009
die-0.009
R-0.009
Tokenbel
Token ID6667
Feature activation+1.33e-01

Pos logit contributions
cks+0.045
 blame+0.044
ays+0.043
 begs+0.040
 blamed+0.039
Neg logit contributions
T-0.099
 coc-0.096
L-0.091
 and-0.089
una-0.088
Tokenieve
Token ID12311
Feature activation+7.89e-03

Pos logit contributions
 blame+0.125
date+0.124
ays+0.121
ishly+0.117
ican+0.116
Neg logit contributions
T-0.259
 and-0.257
 coc-0.255
una-0.245
"-0.235
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
date+0.013
 blame+0.013
pieces+0.013
eks+0.012
ivated+0.012
Neg logit contributions
 coc-0.011
T-0.010
una-0.009
die-0.009
R-0.009
Token
Token ID198
Feature activation+1.91e-04

Pos logit contributions
 blame+0.016
date+0.016
ivated+0.015
pieces+0.015
eks+0.014
Neg logit contributions
 coc-0.031
T-0.029
una-0.029
R-0.028
die-0.027
Token
Token ID198
Feature activation+1.69e-04

Pos logit contributions
date+0.000
 blame+0.000
ivated+0.000
eks+0.000
pieces+0.000
Neg logit contributions
 coc-0.000
T-0.000
una-0.000
die-0.000
 Lizard-0.000
TokenOne
Token ID3198
Feature activation-3.75e-03

Pos logit contributions
date+0.000
ivated+0.000
ued+0.000
 blame+0.000
roads+0.000
Neg logit contributions
 coc-0.000
una-0.000
 Lizard-0.000
die-0.000
T-0.000
Token watched
Token ID7342
Feature activation-1.12e-03

Pos logit contributions
 coc+0.026
una+0.023
T+0.023
R+0.022
 Lizard+0.022
Neg logit contributions
 blame-0.029
eks-0.028
date-0.028
 measured-0.028
ivated-0.027
Token as
Token ID355
Feature activation-5.25e-03

Pos logit contributions
 coc+0.001
T+0.001
una+0.001
die+0.001
R+0.001
Neg logit contributions
 blame-0.002
date-0.002
ivated-0.002
eks-0.002
pieces-0.002
Token his
Token ID465
Feature activation+1.59e-02

Pos logit contributions
 coc+0.008
una+0.007
T+0.007
mber+0.006
die+0.006
Neg logit contributions
 blame-0.008
eks-0.008
ivated-0.008
 fight-0.008
 measured-0.008
Token mom
Token ID1995
Feature activation+3.53e-02

Pos logit contributions
 blame+0.027
date+0.026
eks+0.025
ivated+0.025
pieces+0.025
Neg logit contributions
 coc-0.022
T-0.020
una-0.019
R-0.018
die-0.018
Token carefully
Token ID7773
Feature activation-1.17e-02

Pos logit contributions
 blame+0.051
date+0.050
ivated+0.050
cks+0.049
pieces+0.049
Neg logit contributions
 coc-0.055
T-0.052
 and-0.052
una-0.048
 since-0.047
Token se
Token ID384
Feature activation+8.27e-02

Pos logit contributions
 coc+0.018
 and+0.017
T+0.017
una+0.016
R+0.015
Neg logit contributions
 blame-0.017
date-0.017
pieces-0.017
ivated-0.017
cks-0.016
Tokenwed
Token ID19103
Feature activation-2.76e-02

Pos logit contributions
ivated+0.100
 blame+0.094
 fight+0.086
date+0.084
pieces+0.084
Neg logit contributions
una-0.151
 coc-0.145
T-0.141
die-0.140
aste-0.139
Token the
Token ID262
Feature activation-7.76e-03

Pos logit contributions
 coc+0.049
mber+0.048
una+0.045
T+0.043
die+0.043
Neg logit contributions
eks-0.033
 measured-0.032
ays-0.031
date-0.031
ered-0.031
Token stitch
Token ID24695
Feature activation-8.56e-03

Pos logit contributions
 coc+0.010
T+0.009
una+0.009
die+0.008
R+0.008
Neg logit contributions
 blame-0.014
date-0.013
eks-0.013
ivated-0.013
pieces-0.013
Token in
Token ID287
Feature activation+1.45e-02

Pos logit contributions
 coc+0.008
T+0.007
una+0.007
R+0.006
die+0.006
Neg logit contributions
 blame-0.018
date-0.018
pieces-0.017
ivated-0.017
eks-0.017
Token the
Token ID262
Feature activation-1.33e-02

Pos logit contributions
 blame+0.022
 fight+0.021
 measured+0.020
 argue+0.020
date+0.020
Neg logit contributions
mber-0.022
una-0.022
 coc-0.021
R-0.020
T-0.019
Token She
Token ID1375
Feature activation-2.07e-04

Pos logit contributions
 coc+0.025
una+0.021
mber+0.021
T+0.021
R+0.020
Neg logit contributions
 blame-0.016
date-0.015
 blamed-0.014
ays-0.014
ivated-0.014
Token told
Token ID1297
Feature activation-1.31e-02

Pos logit contributions
 coc+0.000
T+0.000
una+0.000
R+0.000
 Lizard+0.000
Neg logit contributions
 blame-0.000
eks-0.000
date-0.000
ivated-0.000
pieces-0.000
Token him
Token ID683
Feature activation-1.63e-02

Pos logit contributions
 coc+0.020
T+0.018
die+0.017
R+0.017
mber+0.017
Neg logit contributions
 blame-0.021
eks-0.019
 fight-0.019
date-0.019
pieces-0.019
Token how
Token ID703
Feature activation-3.65e-03

Pos logit contributions
 coc+0.023
T+0.021
una+0.020
R+0.020
die+0.019
Neg logit contributions
 blame-0.026
pieces-0.025
eks-0.025
date-0.025
ivated-0.024
Token she
Token ID673
Feature activation+2.23e-02

Pos logit contributions
 coc+0.005
T+0.004
una+0.004
die+0.004
 and+0.004
Neg logit contributions
 blame-0.006
date-0.006
ivated-0.006
pieces-0.006
eks-0.006
Token se
Token ID384
Feature activation+9.62e-02

Pos logit contributions
 blame+0.038
date+0.037
ivated+0.037
pieces+0.036
cks+0.036
Neg logit contributions
 coc-0.030
T-0.027
 and-0.027
una-0.026
 Lizard-0.024
Tokenwed
Token ID19103
Feature activation-1.49e-02

Pos logit contributions
ivated+0.111
 blame+0.108
date+0.100
 fight+0.098
pieces+0.095
Neg logit contributions
una-0.176
 coc-0.170
aste-0.165
T-0.165
 and-0.164
Token,
Token ID11
Feature activation-5.86e-03

Pos logit contributions
 coc+0.024
mber+0.024
die+0.022
una+0.021
T+0.021
Neg logit contributions
pieces-0.020
eks-0.020
cks-0.020
ays-0.019
 measured-0.018
Token glued
Token ID39783
Feature activation-2.41e-02

Pos logit contributions
 coc+0.005
T+0.005
una+0.004
R+0.004
die+0.004
Neg logit contributions
 blame-0.013
pieces-0.012
ivated-0.012
eks-0.012
 measured-0.012
Token,
Token ID11
Feature activation-8.99e-03

Pos logit contributions
mber+0.039
 coc+0.039
una+0.036
die+0.036
T+0.035
Neg logit contributions
pieces-0.032
cks-0.032
eks-0.031
ays-0.030
 measured-0.030
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
 coc+0.009
una+0.008
T+0.008
die+0.008
mber+0.008
Neg logit contributions
 blame-0.018
pieces-0.018
 measured-0.018
eks-0.018
ivated-0.017
Token thread
Token ID4704
Feature activation-4.03e-02

Pos logit contributions
 coc+0.026
T+0.024
una+0.023
 and+0.023
R+0.022
Neg logit contributions
 blame-0.026
date-0.025
ivated-0.025
eks-0.025
pieces-0.025
Token and
Token ID290
Feature activation-5.57e-03

Pos logit contributions
 coc+0.021
T+0.016
una+0.015
R+0.014
mber+0.013
Neg logit contributions
 blame-0.100
date-0.099
ivated-0.098
eks-0.098
pieces-0.098
Token a
Token ID257
Feature activation+1.26e-02

Pos logit contributions
 coc+0.009
T+0.008
 and+0.008
una+0.008
R+0.007
Neg logit contributions
 blame-0.008
date-0.008
ivated-0.008
eks-0.008
cks-0.008
Token needle
Token ID17598
Feature activation-1.69e-02

Pos logit contributions
 blame+0.021
date+0.020
 fight+0.019
pieces+0.019
ivated+0.019
Neg logit contributions
 coc-0.017
una-0.016
T-0.016
R-0.015
die-0.015
Token and
Token ID290
Feature activation+1.97e-03

Pos logit contributions
 and+0.012
 coc+0.010
T+0.008
una+0.007
 since+0.006
Neg logit contributions
 blame-0.041
ivated-0.041
pieces-0.040
date-0.040
eks-0.040
Token se
Token ID384
Feature activation+8.74e-02

Pos logit contributions
 blame+0.003
date+0.003
ivated+0.003
cks+0.003
eks+0.003
Neg logit contributions
 coc-0.003
T-0.003
 and-0.003
una-0.003
R-0.002
Tokenwed
Token ID19103
Feature activation-1.43e-02

Pos logit contributions
ivated+0.103
 blame+0.097
date+0.091
pieces+0.090
 fight+0.088
Neg logit contributions
 coc-0.159
una-0.156
T-0.154
aste-0.148
 and-0.147
Token up
Token ID510
Feature activation+3.51e-02

Pos logit contributions
 coc+0.025
mber+0.023
una+0.022
die+0.021
 Lizard+0.021
Neg logit contributions
 measured-0.019
eks-0.018
cks-0.018
pieces-0.018
date-0.017
Token the
Token ID262
Feature activation-1.92e-03

Pos logit contributions
 blame+0.062
date+0.060
pieces+0.059
eks+0.059
cks+0.058
Neg logit contributions
 coc-0.044
T-0.040
una-0.039
mber-0.037
R-0.037
Token tear
Token ID11626
Feature activation-1.76e-02

Pos logit contributions
 coc+0.003
T+0.002
una+0.002
R+0.002
mber+0.002
Neg logit contributions
 blame-0.003
date-0.003
eks-0.003
ivated-0.003
pieces-0.003
Token in
Token ID287
Feature activation+1.96e-02

Pos logit contributions
 coc+0.024
T+0.021
una+0.021
R+0.020
mber+0.020
Neg logit contributions
 blame-0.029
date-0.029
pieces-0.028
eks-0.028
ivated-0.028
Token our
Token ID674
Feature activation+7.48e-03

Pos logit contributions
 blame+0.006
date+0.006
eks+0.006
pieces+0.006
 fight+0.006
Neg logit contributions
 coc-0.005
T-0.005
una-0.004
die-0.004
R-0.004
Token snow
Token ID6729
Feature activation+2.68e-04

Pos logit contributions
 blame+0.014
eks+0.013
 measured+0.013
 fight+0.013
cks+0.013
Neg logit contributions
 coc-0.008
T-0.008
mber-0.007
 since-0.007
una-0.007
Tokenman
Token ID805
Feature activation+5.28e-02

Pos logit contributions
 blame+0.000
ivated+0.000
date+0.000
pieces+0.000
eks+0.000
Neg logit contributions
 coc-0.000
T-0.000
una-0.000
 and-0.000
die-0.000
Token is
Token ID318
Feature activation+4.91e-03

Pos logit contributions
ivated+0.120
 blame+0.117
pieces+0.115
date+0.114
limits+0.111
Neg logit contributions
 and-0.042
 coc-0.041
una-0.035
T-0.034
 Lizard-0.032
Token smiling
Token ID16755
Feature activation+1.07e-02

Pos logit contributions
 blame+0.009
date+0.009
cks+0.009
eks+0.008
ivated+0.008
Neg logit contributions
 coc-0.006
T-0.005
una-0.005
R-0.005
 Lizard-0.005
Token!"
Token ID2474
Feature activation+4.21e-02

Pos logit contributions
 blame+0.020
date+0.020
eks+0.020
ivated+0.020
pieces+0.020
Neg logit contributions
 coc-0.012
T-0.011
una-0.010
 and-0.010
die-0.009
Token Anna
Token ID11735
Feature activation+3.97e-03

Pos logit contributions
pieces+0.092
date+0.092
eks+0.092
aved+0.090
cking+0.089
Neg logit contributions
 and-0.037
 coc-0.032
T-0.032
R-0.030
 little-0.029
Token said
Token ID531
Feature activation+2.19e-02

Pos logit contributions
 blame+0.006
date+0.005
eks+0.005
pieces+0.005
ivated+0.005
Neg logit contributions
 coc-0.007
T-0.006
 and-0.006
una-0.006
 Lizard-0.006
Token.
Token ID13
Feature activation-8.98e-03

Pos logit contributions
eks+0.052
cks+0.050
roads+0.049
date+0.049
 blame+0.049
Neg logit contributions
 and-0.019
T-0.014
 coc-0.013
 little-0.012
 tight-0.011
Token
Token ID198
Feature activation+2.94e-04

Pos logit contributions
 coc+0.008
T+0.007
 and+0.007
una+0.007
R+0.006
Neg logit contributions
 blame-0.019
eks-0.019
pieces-0.019
date-0.019
cks-0.018
Token
Token ID198
Feature activation-2.35e-03

Pos logit contributions
 blame+0.001
date+0.001
ivated+0.001
eks+0.001
cks+0.001
Neg logit contributions
 coc-0.000
una-0.000
T-0.000
die-0.000
 Lizard-0.000
Token couldn
Token ID3521
Feature activation-2.37e-02

Pos logit contributions
 coc+0.050
T+0.044
 and+0.042
una+0.042
R+0.039
Neg logit contributions
 blame-0.089
date-0.087
eks-0.086
pieces-0.085
ivated-0.085
Token't
Token ID470
Feature activation-8.76e-03

Pos logit contributions
 coc+0.043
T+0.042
 and+0.040
una+0.040
R+0.039
Neg logit contributions
 blame-0.028
date-0.027
pieces-0.026
eks-0.026
ivated-0.026
Token believe
Token ID1975
Feature activation+1.79e-02

Pos logit contributions
 coc+0.014
 and+0.014
T+0.012
 since+0.012
una+0.012
Neg logit contributions
cks-0.013
ays-0.013
eks-0.012
date-0.012
pieces-0.012
Token what
Token ID644
Feature activation+6.96e-03

Pos logit contributions
 blame+0.029
ivated+0.028
cks+0.028
eks+0.028
date+0.028
Neg logit contributions
 coc-0.026
T-0.022
una-0.022
die-0.021
R-0.020
Token he
Token ID339
Feature activation+3.54e-02

Pos logit contributions
 blame+0.011
 measured+0.010
pieces+0.010
date+0.010
 fight+0.010
Neg logit contributions
 coc-0.010
una-0.009
die-0.009
T-0.009
aste-0.009
Token was
Token ID373
Feature activation+3.80e-02

Pos logit contributions
 blame+0.070
date+0.068
ivated+0.068
pieces+0.067
eks+0.067
Neg logit contributions
 coc-0.038
T-0.033
una-0.032
 and-0.031
R-0.029
Token seeing
Token ID4379
Feature activation+1.65e-02

Pos logit contributions
 blame+0.075
date+0.075
eks+0.074
pieces+0.072
ivated+0.071
Neg logit contributions
 and-0.042
 coc-0.039
 since-0.033
una-0.033
T-0.032
Token.
Token ID13
Feature activation+9.95e-03

Pos logit contributions
 blame+0.025
date+0.025
pieces+0.024
cks+0.024
ivated+0.024
Neg logit contributions
 coc-0.025
die-0.022
una-0.021
T-0.021
mber-0.020
Token He
Token ID679
Feature activation+8.26e-03

Pos logit contributions
 blame+0.014
 Asked+0.014
 blamed+0.014
ivated+0.013
 measured+0.013
Neg logit contributions
 coc-0.016
una-0.014
die-0.013
T-0.013
aste-0.012
Token wondered
Token ID14028
Feature activation+3.86e-03

Pos logit contributions
 blame+0.013
date+0.013
eks+0.012
pieces+0.012
ivated+0.012
Neg logit contributions
 coc-0.012
T-0.011
una-0.011
R-0.010
 Lizard-0.010
Token if
Token ID611
Feature activation+7.56e-03

Pos logit contributions
 blame+0.006
pieces+0.006
ivated+0.006
eks+0.006
 measured+0.006
Neg logit contributions
 coc-0.006
una-0.005
T-0.005
R-0.005
die-0.005
Token But
Token ID887
Feature activation-6.58e-03

Pos logit contributions
 coc+0.010
T+0.009
una+0.009
mber+0.008
R+0.008
Neg logit contributions
 blame-0.009
date-0.008
pieces-0.008
ivated-0.008
eks-0.008
Token only
Token ID691
Feature activation-7.12e-04

Pos logit contributions
 coc+0.010
T+0.010
una+0.009
mber+0.009
R+0.009
Neg logit contributions
 blame-0.010
 fight-0.009
eks-0.009
 measured-0.009
date-0.009
Token a
Token ID257
Feature activation+2.03e-02

Pos logit contributions
 coc+0.001
T+0.001
una+0.001
R+0.001
mber+0.001
Neg logit contributions
 blame-0.001
 fight-0.001
pieces-0.001
 measured-0.001
date-0.001
Token little
Token ID1310
Feature activation+1.21e-02

Pos logit contributions
 fight+0.039
 blame+0.037
date+0.036
 measured+0.036
 contest+0.034
Neg logit contributions
 coc-0.021
T-0.021
una-0.019
R-0.018
Ho-0.018
Token bit
Token ID1643
Feature activation+8.46e-03

Pos logit contributions
 blame+0.022
eks+0.022
ivated+0.022
date+0.022
cks+0.022
Neg logit contributions
 coc-0.015
 and-0.013
T-0.012
una-0.012
die-0.012
Token."
Token ID526
Feature activation+3.50e-02

Pos logit contributions
 blame+0.014
date+0.013
eks+0.013
ivated+0.013
pieces+0.013
Neg logit contributions
 coc-0.012
T-0.011
una-0.010
R-0.010
 Lizard-0.010
Token Ben
Token ID3932
Feature activation-8.14e-03

Pos logit contributions
 blame+0.074
date+0.072
ivated+0.072
eks+0.072
pieces+0.071
Neg logit contributions
 coc-0.033
T-0.029
una-0.027
 and-0.026
R-0.025
Token said
Token ID531
Feature activation+1.14e-02

Pos logit contributions
 coc+0.012
 and+0.012
T+0.012
una+0.011
 Lizard+0.011
Neg logit contributions
date-0.012
pieces-0.012
 blame-0.012
ivated-0.012
cks-0.012
Token.
Token ID13
Feature activation-1.81e-02

Pos logit contributions
eks+0.025
cks+0.025
date+0.025
pieces+0.025
 blame+0.025
Neg logit contributions
 and-0.010
 coc-0.008
T-0.008
una-0.007
 Lizard-0.007
Token
Token ID198
Feature activation-5.41e-03

Pos logit contributions
 coc+0.022
T+0.018
una+0.018
die+0.016
aste+0.016
Neg logit contributions
 blame-0.034
date-0.033
ivated-0.031
 fight-0.031
 measured-0.031
Token
Token ID198
Feature activation-7.35e-03

Pos logit contributions
 coc+0.007
una+0.006
 Lizard+0.006
mber+0.006
T+0.006
Neg logit contributions
 blame-0.009
cks-0.009
date-0.009
eks-0.009
pieces-0.008
Token Lily
Token ID20037
Feature activation-7.31e-04

Pos logit contributions
pieces+0.007
date+0.007
ivated+0.007
ays+0.007
eks+0.007
Neg logit contributions
 coc-0.004
 and-0.004
T-0.004
una-0.003
 little-0.003
Token.
Token ID13
Feature activation-5.31e-03

Pos logit contributions
 coc+0.001
T+0.001
una+0.001
R+0.001
 and+0.001
Neg logit contributions
 blame-0.001
date-0.001
ivated-0.001
pieces-0.001
eks-0.001
Token He
Token ID679
Feature activation+8.84e-03

Pos logit contributions
 coc+0.007
una+0.006
T+0.006
R+0.005
aste+0.005
Neg logit contributions
 blame-0.010
date-0.009
ivated-0.009
 blamed-0.009
eks-0.009
Token realized
Token ID6939
Feature activation-1.19e-02

Pos logit contributions
date+0.015
ivated+0.015
pieces+0.015
cks+0.014
 blame+0.014
Neg logit contributions
 coc-0.012
 and-0.012
T-0.011
una-0.011
die-0.010
Token his
Token ID465
Feature activation+1.22e-02

Pos logit contributions
 coc+0.019
T+0.017
una+0.016
die+0.016
R+0.016
Neg logit contributions
 blame-0.018
date-0.017
pieces-0.017
eks-0.017
ivated-0.016
Token car
Token ID1097
Feature activation+3.84e-02

Pos logit contributions
 blame+0.020
date+0.019
ivated+0.019
eks+0.019
pieces+0.019
Neg logit contributions
 coc-0.017
T-0.016
una-0.015
 and-0.014
R-0.014
Token was
Token ID373
Feature activation+5.06e-04

Pos logit contributions
ivated+0.091
pieces+0.091
 blame+0.089
date+0.089
cks+0.088
Neg logit contributions
 and-0.026
 coc-0.022
T-0.019
una-0.016
 since-0.016
Token broken
Token ID5445
Feature activation-2.30e-02

Pos logit contributions
 blame+0.001
date+0.001
eks+0.001
ivated+0.001
pieces+0.001
Neg logit contributions
 coc-0.001
T-0.001
una-0.001
R-0.001
mber-0.001
Token and
Token ID290
Feature activation-4.85e-03

Pos logit contributions
mber+0.038
 coc+0.038
R+0.034
una+0.034
T+0.032
Neg logit contributions
 fight-0.033
 rebuild-0.030
 blame-0.030
 neither-0.030
cks-0.030
Token old
Token ID1468
Feature activation+1.17e-02

Pos logit contributions
 coc+0.006
 and+0.006
T+0.005
 since+0.005
una+0.005
Neg logit contributions
date-0.009
ivated-0.009
pieces-0.008
 blame-0.008
eks-0.008
Token too
Token ID1165
Feature activation-7.99e-03

Pos logit contributions
 blame+0.021
date+0.021
ivated+0.020
eks+0.020
pieces+0.020
Neg logit contributions
 coc-0.014
T-0.013
una-0.012
 and-0.012
R-0.011
Token a
Token ID257
Feature activation-7.78e-03

Pos logit contributions
 blame+0.001
date+0.001
ivated+0.001
eks+0.001
pieces+0.001
Neg logit contributions
 coc-0.001
T-0.001
 and-0.001
una-0.001
 Lizard-0.001
Token lab
Token ID2248
Feature activation-9.82e-03

Pos logit contributions
 coc+0.010
T+0.009
una+0.009
 and+0.008
 Lizard+0.008
Neg logit contributions
 blame-0.013
date-0.013
pieces-0.013
ivated-0.013
 fight-0.013
Token where
Token ID810
Feature activation+1.25e-02

Pos logit contributions
 and+0.010
 coc+0.008
 since+0.008
T+0.007
una+0.007
Neg logit contributions
eks-0.021
cks-0.021
 blame-0.021
pieces-0.021
cking-0.020
Token Lily
Token ID20037
Feature activation+1.29e-02

Pos logit contributions
 blame+0.020
date+0.019
ivated+0.019
eks+0.019
cks+0.019
Neg logit contributions
 coc-0.019
T-0.017
una-0.016
 and-0.016
R-0.015
Token's
Token ID338
Feature activation+1.86e-02

Pos logit contributions
 blame+0.020
ivated+0.019
date+0.019
eks+0.019
pieces+0.019
Neg logit contributions
 coc-0.019
T-0.018
 and-0.017
una-0.017
die-0.016
Token mom
Token ID1995
Feature activation+4.03e-02

Pos logit contributions
 blame+0.033
ivated+0.033
eks+0.033
pieces+0.032
cks+0.032
Neg logit contributions
 coc-0.024
 and-0.021
T-0.020
una-0.020
 Lizard-0.019
Token worked
Token ID3111
Feature activation+8.39e-04

Pos logit contributions
 blame+0.066
ivated+0.066
date+0.064
cks+0.064
pieces+0.064
Neg logit contributions
 and-0.054
 coc-0.053
T-0.051
una-0.048
 since-0.047
Token.
Token ID13
Feature activation+3.11e-03

Pos logit contributions
 blame+0.001
date+0.001
eks+0.001
ivated+0.001
pieces+0.001
Neg logit contributions
 coc-0.001
T-0.001
una-0.001
R-0.001
 Lizard-0.001
Token It
Token ID632
Feature activation+4.05e-03

Pos logit contributions
 blame+0.006
date+0.005
eks+0.005
ivated+0.005
pieces+0.005
Neg logit contributions
 coc-0.004
T-0.003
una-0.003
 and-0.003
R-0.003
Token was
Token ID373
Feature activation-5.84e-03

Pos logit contributions
 blame+0.010
pieces+0.010
ivated+0.009
date+0.009
cks+0.009
Neg logit contributions
 coc-0.003
 and-0.002
T-0.002
una-0.002
die-0.002
Token a
Token ID257
Feature activation-1.24e-03

Pos logit contributions
 coc+0.009
T+0.009
Ho+0.008
mber+0.008
R+0.008
Neg logit contributions
 fight-0.008
 measured-0.008
date-0.008
 drawn-0.008
 blame-0.007
Token Sandy
Token ID17581
Feature activation-3.08e-02

Pos logit contributions
 coc+0.007
una+0.007
T+0.006
die+0.006
aste+0.006
Neg logit contributions
date-0.009
 blame-0.009
ivated-0.009
pieces-0.009
 fight-0.008
Token noticed
Token ID6810
Feature activation+1.70e-02

Pos logit contributions
 coc+0.051
T+0.046
una+0.045
R+0.043
 Lizard+0.043
Neg logit contributions
 blame-0.042
date-0.042
eks-0.041
ivated-0.040
pieces-0.040
Token something
Token ID1223
Feature activation-8.29e-03

Pos logit contributions
 blame+0.029
date+0.028
eks+0.028
ivated+0.028
pieces+0.028
Neg logit contributions
 coc-0.022
T-0.019
una-0.019
die-0.018
R-0.018
Token delicious
Token ID12625
Feature activation+4.08e-03

Pos logit contributions
 coc+0.010
T+0.008
 and+0.008
una+0.008
 Lizard+0.007
Neg logit contributions
 blame-0.016
date-0.015
eks-0.015
ivated-0.015
pieces-0.015
Token in
Token ID287
Feature activation+1.16e-02

Pos logit contributions
date+0.007
 fight+0.007
 measured+0.007
cks+0.006
 blame+0.006
Neg logit contributions
 coc-0.005
T-0.005
Ho-0.005
R-0.005
die-0.005
Token the
Token ID262
Feature activation-3.24e-03

Pos logit contributions
 blame+0.017
 fight+0.017
cks+0.017
date+0.016
eks+0.016
Neg logit contributions
 coc-0.016
T-0.016
una-0.016
mber-0.015
Ho-0.015
Token grass
Token ID8701
Feature activation-4.15e-02

Pos logit contributions
T+0.005
una+0.005
 coc+0.004
 Lizard+0.004
R+0.004
Neg logit contributions
date-0.005
roads-0.004
 fight-0.004
pieces-0.004
 measured-0.004
Token -
Token ID532
Feature activation-1.37e-02

Pos logit contributions
 coc+0.052
T+0.047
una+0.044
die+0.042
R+0.042
Neg logit contributions
date-0.073
 blame-0.072
ivated-0.071
pieces-0.070
eks-0.070
Token butter
Token ID9215
Feature activation-3.34e-02

Pos logit contributions
 coc+0.019
T+0.018
una+0.018
Ho+0.017
oby+0.017
Neg logit contributions
pieces-0.020
 measured-0.020
 blame-0.020
ivated-0.020
 fight-0.020
Token.
Token ID13
Feature activation-2.17e-03

Pos logit contributions
 coc+0.043
T+0.038
una+0.036
R+0.035
 Lizard+0.035
Neg logit contributions
date-0.057
 blame-0.056
cks-0.056
eks-0.056
ivated-0.056
Token She
Token ID1375
Feature activation+4.65e-04

Pos logit contributions
 coc+0.003
una+0.003
T+0.003
die+0.003
Ho+0.003
Neg logit contributions
 blame-0.003
date-0.003
ivated-0.003
 Asked-0.003
eks-0.003
TokenThat
Token ID2504
Feature activation-5.63e-03

Pos logit contributions
 blame+0.017
date+0.016
ivated+0.016
pieces+0.016
eks+0.016
Neg logit contributions
 coc-0.012
T-0.010
una-0.010
 and-0.010
die-0.010
Token is
Token ID318
Feature activation+1.20e-02

Pos logit contributions
 coc+0.008
T+0.007
Ho+0.007
una+0.007
R+0.007
Neg logit contributions
 blame-0.009
date-0.009
 fight-0.009
pieces-0.008
ivated-0.008
Token a
Token ID257
Feature activation+1.01e-02

Pos logit contributions
 blame+0.019
 fight+0.019
cks+0.018
eks+0.018
 measured+0.018
Neg logit contributions
 coc-0.017
T-0.016
mber-0.015
una-0.015
R-0.015
Token mug
Token ID25152
Feature activation+2.23e-02

Pos logit contributions
 fight+0.016
date+0.016
 blame+0.016
ivated+0.014
pieces+0.014
Neg logit contributions
 coc-0.013
T-0.013
una-0.013
Ho-0.013
R-0.012
Token.
Token ID13
Feature activation-3.75e-03

Pos logit contributions
 blame+0.044
eks+0.043
date+0.043
ivated+0.043
pieces+0.042
Neg logit contributions
 coc-0.023
T-0.020
una-0.020
 and-0.019
 Lizard-0.018
Token It
Token ID632
Feature activation+7.60e-03

Pos logit contributions
 coc+0.006
T+0.005
una+0.005
die+0.005
R+0.005
Neg logit contributions
 blame-0.006
date-0.006
pieces-0.006
ivated-0.006
eks-0.005
Token is
Token ID318
Feature activation-1.01e-05

Pos logit contributions
date+0.016
ivated+0.015
 blame+0.015
pieces+0.015
eks+0.015
Neg logit contributions
 coc-0.007
 and-0.007
T-0.006
una-0.006
 Lizard-0.006
Token disgusting
Token ID23374
Feature activation-2.70e-03

Pos logit contributions
T+0.000
 coc+0.000
mber+0.000
R+0.000
una+0.000
Neg logit contributions
 blame-0.000
cks-0.000
 fight-0.000
 measured-0.000
pieces-0.000
Token.
Token ID13
Feature activation+2.75e-03

Pos logit contributions
 coc+0.004
T+0.004
Ho+0.004
die+0.004
R+0.004
Neg logit contributions
 blame-0.004
date-0.004
 fight-0.003
ays-0.003
pieces-0.003
Token Someone
Token ID17877
Feature activation+1.65e-02

Pos logit contributions
 blame+0.004
date+0.004
pieces+0.004
ivated+0.004
cks+0.004
Neg logit contributions
 coc-0.004
T-0.004
una-0.004
die-0.004
mber-0.003
Token left
Token ID1364
Feature activation+5.60e-03

Pos logit contributions
date+0.031
ivated+0.030
pieces+0.030
roads+0.029
 blame+0.029
Neg logit contributions
 and-0.020
 coc-0.018
una-0.017
 little-0.016
T-0.016
Token to
Token ID284
Feature activation+1.02e-02

Pos logit contributions
 blame+0.043
eks+0.042
ivated+0.041
cks+0.041
date+0.040
Neg logit contributions
 coc-0.013
 and-0.013
T-0.010
una-0.010
 since-0.009
Token Mia
Token ID32189
Feature activation+3.70e-04

Pos logit contributions
ivated+0.017
cking+0.017
limits+0.017
cks+0.017
 blame+0.016
Neg logit contributions
 coc-0.014
 and-0.013
T-0.013
una-0.012
 little-0.011
Token's
Token ID338
Feature activation+2.46e-03

Pos logit contributions
 blame+0.001
eks+0.001
ivated+0.001
cks+0.001
cking+0.001
Neg logit contributions
 and-0.000
 coc-0.000
T-0.000
 since-0.000
una-0.000
Token house
Token ID2156
Feature activation+2.11e-02

Pos logit contributions
ivated+0.005
eks+0.005
cks+0.005
cking+0.004
 blame+0.004
Neg logit contributions
 coc-0.003
 and-0.003
una-0.002
 Lizard-0.002
 little-0.002
Token.
Token ID13
Feature activation-7.23e-03

Pos logit contributions
 blame+0.041
eks+0.040
ivated+0.040
date+0.039
pieces+0.039
Neg logit contributions
 coc-0.023
 and-0.022
T-0.020
una-0.020
 since-0.018
Token It
Token ID632
Feature activation+7.91e-03

Pos logit contributions
 coc+0.010
T+0.009
una+0.009
R+0.008
die+0.008
Neg logit contributions
 blame-0.012
date-0.011
ivated-0.011
eks-0.011
pieces-0.011
Token was
Token ID373
Feature activation-1.05e-02

Pos logit contributions
ivated+0.019
pieces+0.019
 blame+0.019
date+0.019
eks+0.019
Neg logit contributions
 coc-0.005
 and-0.005
T-0.004
 Lizard-0.004
una-0.003
Token pink
Token ID11398
Feature activation+2.03e-03

Pos logit contributions
 coc+0.015
T+0.014
mber+0.014
R+0.014
una+0.013
Neg logit contributions
 blame-0.016
 fight-0.015
cks-0.015
 measured-0.015
 drawn-0.015
Token and
Token ID290
Feature activation-3.87e-03

Pos logit contributions
date+0.004
 fight+0.004
ays+0.004
 measured+0.004
ci+0.004
Neg logit contributions
T-0.002
R-0.002
 coc-0.002
Ho-0.002
mber-0.002
Token shiny
Token ID22441
Feature activation+9.76e-03

Pos logit contributions
 coc+0.005
una+0.005
T+0.005
aste+0.005
R+0.005
Neg logit contributions
 blame-0.006
 fight-0.006
 measured-0.006
cks-0.006
pieces-0.006
Token and
Token ID290
Feature activation+8.95e-03

Pos logit contributions
 blame+0.015
 fight+0.015
 measured+0.015
date+0.015
cks+0.015
Neg logit contributions
 coc-0.013
T-0.013
R-0.012
mber-0.012
una-0.012
Token to
Token ID284
Feature activation-3.51e-03

Pos logit contributions
eks+0.033
 blame+0.033
cks+0.031
 fight+0.030
 argue+0.029
Neg logit contributions
 coc-0.040
T-0.037
mber-0.036
una-0.036
 Lizard-0.034
Token the
Token ID262
Feature activation-2.75e-02

Pos logit contributions
 coc+0.005
T+0.005
 and+0.004
una+0.004
R+0.004
Neg logit contributions
pieces-0.006
 blame-0.006
date-0.006
ivated-0.006
cks-0.005
Token fountain
Token ID29420
Feature activation-1.52e-02

Pos logit contributions
 coc+0.049
T+0.044
 and+0.044
una+0.043
R+0.042
Neg logit contributions
 blame-0.035
date-0.034
pieces-0.034
ivated-0.034
eks-0.033
Token.
Token ID13
Feature activation-1.78e-02

Pos logit contributions
 and+0.015
 coc+0.013
una+0.011
T+0.011
 since+0.010
Neg logit contributions
 blame-0.033
eks-0.032
pieces-0.032
date-0.031
ivated-0.031
Token She
Token ID1375
Feature activation-1.43e-03

Pos logit contributions
 coc+0.030
una+0.027
T+0.027
die+0.026
mber+0.026
Neg logit contributions
 blame-0.027
limits-0.024
 blamed-0.023
ivated-0.023
 fight-0.023
Token saw
Token ID2497
Feature activation+2.25e-02

Pos logit contributions
 coc+0.003
una+0.003
T+0.003
 Lizard+0.002
R+0.002
Neg logit contributions
 blame-0.002
eks-0.001
 measured-0.001
ivated-0.001
date-0.001
Token the
Token ID262
Feature activation-1.12e-02

Pos logit contributions
 blame+0.041
ivated+0.039
date+0.039
eks+0.039
pieces+0.039
Neg logit contributions
 coc-0.028
T-0.025
una-0.024
die-0.023
R-0.023
Token bee
Token ID20697
Feature activation-3.85e-03

Pos logit contributions
 coc+0.014
 and+0.012
una+0.012
 lizard+0.011
T+0.011
Neg logit contributions
date-0.021
 blame-0.021
pieces-0.021
ivated-0.020
eks-0.020
Token and
Token ID290
Feature activation-1.28e-02

Pos logit contributions
 coc+0.003
 and+0.002
una+0.002
T+0.002
R+0.002
Neg logit contributions
 blame-0.009
date-0.009
ivated-0.009
pieces-0.009
eks-0.009
Token smiled
Token ID13541
Feature activation+8.47e-03

Pos logit contributions
 coc+0.019
T+0.018
una+0.017
oby+0.016
aste+0.016
Neg logit contributions
 blame-0.020
 measured-0.019
eks-0.019
cks-0.019
 blamed-0.018
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
 blame+0.011
ivated+0.010
cks+0.010
cking+0.010
 fight+0.009
Neg logit contributions
 coc-0.015
mber-0.014
T-0.013
una-0.013
R-0.013
Token wash
Token ID13502
Feature activation+2.82e-03

Pos logit contributions
pieces+0.021
eks+0.020
ivated+0.020
cks+0.020
date+0.020
Neg logit contributions
 coc-0.017
 and-0.016
T-0.016
una-0.015
R-0.014
Token their
Token ID511
Feature activation+1.68e-02

Pos logit contributions
cks+0.004
 blame+0.004
 fight+0.003
 neither+0.003
ays+0.003
Neg logit contributions
 coc-0.005
mber-0.005
T-0.005
die-0.004
Ho-0.004
Token hair
Token ID4190
Feature activation+5.16e-03

Pos logit contributions
 blame+0.026
date+0.026
ivated+0.025
eks+0.025
pieces+0.025
Neg logit contributions
 coc-0.025
T-0.022
una-0.022
 and-0.021
R-0.021
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
 blame+0.009
date+0.008
ivated+0.008
pieces+0.008
 fight+0.008
Neg logit contributions
 coc-0.007
T-0.006
una-0.006
die-0.006
R-0.006
Token They
Token ID1119
Feature activation+8.80e-03

Pos logit contributions
 coc+0.021
T+0.019
una+0.019
die+0.018
 Lizard+0.018
Neg logit contributions
 blame-0.019
date-0.018
ivated-0.018
 fight-0.017
 blamed-0.017
Token say
Token ID910
Feature activation-6.50e-03

Pos logit contributions
date+0.013
pieces+0.013
cks+0.012
ivated+0.012
 blame+0.012
Neg logit contributions
 coc-0.014
 and-0.013
T-0.012
una-0.012
 Lizard-0.011
Token,
Token ID11
Feature activation-2.05e-02

Pos logit contributions
 coc+0.007
die+0.006
T+0.006
una+0.006
Ho+0.005
Neg logit contributions
 blame-0.013
date-0.012
 fight-0.012
ivated-0.012
pieces-0.012
Token "
Token ID366
Feature activation+9.15e-03

Pos logit contributions
 coc+0.025
una+0.021
die+0.020
T+0.020
 Lizard+0.019
Neg logit contributions
 blame-0.039
ivated-0.037
 fight-0.036
date-0.036
 measured-0.035
TokenNo
Token ID2949
Feature activation+1.58e-02

Pos logit contributions
 blame+0.016
eks+0.016
pieces+0.016
date+0.016
ivated+0.015
Neg logit contributions
 coc-0.012
T-0.011
una-0.010
R-0.010
 and-0.010
Token,
Token ID11
Feature activation+2.76e-03

Pos logit contributions
 fight+0.028
 blame+0.028
 argue+0.027
 neither+0.026
 punish+0.025
Neg logit contributions
 coc-0.019
T-0.019
die-0.018
Ho-0.018
una-0.017
Token no
Token ID645
Feature activation+2.37e-02

Pos logit contributions
 blame+0.004
date+0.004
pieces+0.004
 fight+0.004
eks+0.004
Neg logit contributions
 coc-0.004
T-0.004
una-0.004
die-0.003
R-0.003
Token was
Token ID373
Feature activation-1.60e-02

Pos logit contributions
 coc+0.033
T+0.029
una+0.028
R+0.026
die+0.026
Neg logit contributions
 blame-0.060
date-0.059
eks-0.058
ivated-0.058
pieces-0.058
Token a
Token ID257
Feature activation-8.12e-03

Pos logit contributions
 coc+0.021
T+0.019
una+0.018
R+0.017
mber+0.017
Neg logit contributions
 blame-0.027
date-0.027
pieces-0.026
 measured-0.026
eks-0.026
Token bright
Token ID6016
Feature activation-1.61e-02

Pos logit contributions
una+0.011
T+0.011
R+0.010
 coc+0.010
aste+0.010
Neg logit contributions
 fight-0.013
 blame-0.013
date-0.013
pieces-0.012
 measured-0.012
Token,
Token ID11
Feature activation-7.87e-03

Pos logit contributions
T+0.022
una+0.021
R+0.021
 coc+0.021
Ho+0.019
Neg logit contributions
date-0.026
 blame-0.026
 fight-0.026
 measured-0.025
pieces-0.025
Token shining
Token ID22751
Feature activation-1.54e-02

Pos logit contributions
una+0.011
T+0.011
 coc+0.011
R+0.011
Ho+0.010
Neg logit contributions
 blame-0.012
pieces-0.011
 measured-0.011
 fight-0.011
date-0.011
Token necklace
Token ID38987
Feature activation-2.73e-02

Pos logit contributions
T+0.024
una+0.022
R+0.022
 coc+0.022
die+0.021
Neg logit contributions
date-0.023
 fight-0.021
 measured-0.021
 blame-0.021
pieces-0.020
Token.
Token ID13
Feature activation-2.06e-04

Pos logit contributions
 coc+0.030
T+0.026
una+0.025
R+0.023
die+0.023
Neg logit contributions
 blame-0.052
date-0.052
pieces-0.051
ivated-0.051
eks-0.051
Token
Token ID198
Feature activation-2.91e-03

Pos logit contributions
 coc+0.000
una+0.000
T+0.000
R+0.000
aste+0.000
Neg logit contributions
 blame-0.000
date-0.000
pieces-0.000
ivated-0.000
eks-0.000
Token
Token ID198
Feature activation-3.67e-03

Pos logit contributions
 coc+0.004
una+0.003
aste+0.003
 since+0.003
die+0.003
Neg logit contributions
date-0.005
 blame-0.005
pieces-0.005
ivated-0.005
 Asked-0.005
TokenThe
Token ID464
Feature activation-1.01e-02

Pos logit contributions
 coc+0.005
una+0.004
 and+0.004
 Lizard+0.004
die+0.004
Neg logit contributions
date-0.006
ivated-0.006
pieces-0.006
 blame-0.006
roads-0.006
Token girl
Token ID2576
Feature activation-3.95e-03

Pos logit contributions
T+0.009
 coc+0.009
una+0.009
R+0.009
aste+0.008
Neg logit contributions
 blame-0.021
date-0.020
pieces-0.020
 fight-0.019
eks-0.019
Token toys
Token ID14958
Feature activation-1.93e-02

Pos logit contributions
ivated+0.014
date+0.013
 blame+0.013
pieces+0.013
eks+0.013
Neg logit contributions
 coc-0.007
 and-0.006
T-0.006
una-0.005
 Lizard-0.005
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 coc+0.029
mber+0.027
T+0.027
R+0.026
die+0.026
Neg logit contributions
 fight-0.029
 blame-0.028
date-0.028
 rebuild-0.027
pieces-0.027
Token You
Token ID921
Feature activation-1.65e-02

Pos logit contributions
 coc+0.020
mber+0.017
una+0.017
die+0.016
T+0.016
Neg logit contributions
 blame-0.016
cks-0.014
 blamed-0.014
date-0.013
limits-0.013
Token have
Token ID423
Feature activation-6.02e-03

Pos logit contributions
 coc+0.021
 and+0.020
T+0.019
una+0.018
 since+0.017
Neg logit contributions
date-0.028
pieces-0.028
ivated-0.028
cks-0.027
 blame-0.027
Token to
Token ID284
Feature activation+2.03e-02

Pos logit contributions
 coc+0.008
T+0.007
una+0.007
R+0.006
 Lizard+0.006
Neg logit contributions
 blame-0.011
date-0.010
eks-0.010
ivated-0.010
pieces-0.010
Token share
Token ID2648
Feature activation-3.01e-02

Pos logit contributions
 blame+0.027
 fight+0.025
date+0.025
eks+0.024
 argue+0.024
Neg logit contributions
 coc-0.035
T-0.033
una-0.032
R-0.031
mber-0.031
Token and
Token ID290
Feature activation+4.00e-03

Pos logit contributions
mber+0.037
 coc+0.034
oby+0.033
una+0.032
T+0.032
Neg logit contributions
 fight-0.053
cks-0.053
 blame-0.053
 rebuild-0.051
 sell-0.051
Token respect
Token ID2461
Feature activation-1.30e-03

Pos logit contributions
 blame+0.006
date+0.006
ivated+0.006
pieces+0.006
eks+0.006
Neg logit contributions
 coc-0.006
T-0.005
una-0.005
 and-0.005
die-0.005
Token them
Token ID606
Feature activation+2.98e-02

Pos logit contributions
mber+0.002
 coc+0.002
T+0.002
Ho+0.002
 Lizard+0.002
Neg logit contributions
cks-0.002
 blame-0.002
 fight-0.002
 rebuild-0.002
 sell-0.002
Token.
Token ID13
Feature activation-7.46e-03

Pos logit contributions
 blame+0.045
 fight+0.043
cks+0.043
pieces+0.042
ivated+0.042
Neg logit contributions
 coc-0.045
T-0.042
mber-0.041
una-0.040
R-0.040
Token Now
Token ID2735
Feature activation-1.68e-02

Pos logit contributions
 coc+0.014
mber+0.012
una+0.011
die+0.011
 lizard+0.011
Neg logit contributions
 blame-0.010
 blamed-0.009
cks-0.009
date-0.009
ays-0.009
Token course
Token ID1781
Feature activation+2.20e-02

Pos logit contributions
 blame+0.018
date+0.017
eks+0.017
ivated+0.017
pieces+0.017
Neg logit contributions
 coc-0.024
T-0.022
una-0.022
 and-0.021
R-0.021
Token you
Token ID345
Feature activation-1.77e-02

Pos logit contributions
 blame+0.041
date+0.040
pieces+0.040
ivated+0.039
eks+0.039
Neg logit contributions
 coc-0.025
T-0.023
una-0.022
die-0.020
R-0.020
Token can
Token ID460
Feature activation-3.34e-03

Pos logit contributions
 coc+0.023
T+0.021
una+0.020
R+0.020
die+0.019
Neg logit contributions
 blame-0.032
 fight-0.030
eks-0.030
date-0.030
pieces-0.029
Token!
Token ID0
Feature activation+5.74e-03

Pos logit contributions
 coc+0.003
 and+0.003
T+0.003
una+0.003
aste+0.003
Neg logit contributions
 blame-0.007
eks-0.007
ivated-0.007
cks-0.007
pieces-0.007
Token Just
Token ID2329
Feature activation-6.61e-03

Pos logit contributions
 blame+0.009
date+0.009
ivated+0.008
pieces+0.008
eks+0.008
Neg logit contributions
 coc-0.009
T-0.008
una-0.008
 and-0.007
R-0.007
Token use
Token ID779
Feature activation-2.65e-02

Pos logit contributions
 coc+0.011
T+0.010
una+0.010
 and+0.009
R+0.009
Neg logit contributions
 blame-0.009
date-0.009
ivated-0.009
eks-0.009
pieces-0.009
Token your
Token ID534
Feature activation+1.67e-02

Pos logit contributions
 coc+0.032
T+0.030
una+0.028
mber+0.027
die+0.027
Neg logit contributions
 blame-0.047
pieces-0.046
eks-0.045
 fight-0.045
date-0.045
Token hands
Token ID2832
Feature activation+1.15e-02

Pos logit contributions
ivated+0.027
 blame+0.027
date+0.027
pieces+0.026
eks+0.026
Neg logit contributions
 coc-0.024
 and-0.021
una-0.020
T-0.020
die-0.019
Token and
Token ID290
Feature activation-2.71e-03

Pos logit contributions
date+0.019
pieces+0.019
 measured+0.019
ivated+0.019
 blame+0.019
Neg logit contributions
 coc-0.015
T-0.013
una-0.013
R-0.012
 Lizard-0.012
Token make
Token ID787
Feature activation+1.32e-02

Pos logit contributions
 coc+0.004
T+0.004
una+0.004
 and+0.003
R+0.003
Neg logit contributions
 blame-0.004
date-0.004
eks-0.004
pieces-0.004
ivated-0.004
Token a
Token ID257
Feature activation+1.66e-02

Pos logit contributions
 blame+0.024
pieces+0.024
date+0.023
eks+0.023
ivated+0.023
Neg logit contributions
 coc-0.016
T-0.014
una-0.013
 and-0.013
R-0.013
Token little
Token ID1310
Feature activation-5.30e-04

Pos logit contributions
 coc+0.008
T+0.007
una+0.007
R+0.006
mber+0.006
Neg logit contributions
 blame-0.020
date-0.019
 fight-0.018
 blamed-0.017
pieces-0.017
Token girl
Token ID2576
Feature activation-5.85e-03

Pos logit contributions
 coc+0.001
una+0.001
T+0.001
R+0.000
mber+0.000
Neg logit contributions
 blame-0.001
date-0.001
 fight-0.001
eks-0.001
 measured-0.001
Token felt
Token ID2936
Feature activation-8.51e-03

Pos logit contributions
 coc+0.008
T+0.007
una+0.007
 and+0.007
R+0.006
Neg logit contributions
 blame-0.010
date-0.010
ivated-0.010
eks-0.009
pieces-0.009
Token so
Token ID523
Feature activation+9.04e-03

Pos logit contributions
 coc+0.011
una+0.010
T+0.010
die+0.009
R+0.009
Neg logit contributions
 blame-0.015
ivated-0.015
date-0.014
eks-0.014
pieces-0.014
Token com
Token ID401
Feature activation+1.08e-02

Pos logit contributions
 blame+0.017
date+0.016
ivated+0.016
eks+0.016
pieces+0.016
Neg logit contributions
 coc-0.011
T-0.009
una-0.009
R-0.008
 and-0.008
Tokenfy
Token ID24928
Feature activation-2.84e-02

Pos logit contributions
 blame+0.019
date+0.018
eks+0.018
pieces+0.018
ivated+0.018
Neg logit contributions
 coc-0.014
T-0.013
una-0.012
 and-0.012
R-0.011
Token and
Token ID290
Feature activation-8.25e-03

Pos logit contributions
 coc+0.034
die+0.030
T+0.029
una+0.029
R+0.027
Neg logit contributions
ivated-0.051
 fight-0.051
date-0.050
 measured-0.050
 blame-0.049
Token happy
Token ID3772
Feature activation+2.06e-02

Pos logit contributions
 coc+0.011
una+0.010
T+0.010
R+0.009
aste+0.009
Neg logit contributions
 blame-0.014
 measured-0.013
eks-0.013
ivated-0.013
date-0.013
Token.
Token ID13
Feature activation-9.33e-04

Pos logit contributions
 blame+0.035
date+0.034
ivated+0.034
eks+0.033
pieces+0.033
Neg logit contributions
 coc-0.027
T-0.025
una-0.024
R-0.023
die-0.023
Token The
Token ID383
Feature activation-7.45e-03

Pos logit contributions
 coc+0.002
una+0.001
T+0.001
R+0.001
aste+0.001
Neg logit contributions
 blame-0.001
ivated-0.001
date-0.001
 Asked-0.001
pieces-0.001
Token t
Token ID256
Feature activation-1.14e-02

Pos logit contributions
 coc+0.008
una+0.007
T+0.007
R+0.007
mber+0.006
Neg logit contributions
 blame-0.016
 fight-0.015
date-0.014
 blamed-0.013
eks-0.013
Token some
Token ID617
Feature activation-2.25e-02

Pos logit contributions
 blame+0.009
date+0.009
pieces+0.009
eks+0.009
ivated+0.009
Neg logit contributions
 coc-0.010
 and-0.010
T-0.009
una-0.009
die-0.009
Token more
Token ID517
Feature activation-2.44e-02

Pos logit contributions
 coc+0.027
T+0.024
una+0.023
 and+0.022
R+0.021
Neg logit contributions
 blame-0.042
date-0.041
ivated-0.040
eks-0.040
pieces-0.040
Token and
Token ID290
Feature activation+1.64e-02

Pos logit contributions
 coc+0.026
T+0.024
una+0.022
R+0.021
die+0.021
Neg logit contributions
 blame-0.048
cks-0.046
ivated-0.046
eks-0.046
pieces-0.046
Token saw
Token ID2497
Feature activation+2.39e-02

Pos logit contributions
 blame+0.027
eks+0.025
ivated+0.025
date+0.025
pieces+0.025
Neg logit contributions
 coc-0.023
T-0.021
una-0.021
R-0.020
 Lizard-0.019
Token their
Token ID511
Feature activation+1.00e-02

Pos logit contributions
 blame+0.043
date+0.042
ivated+0.041
eks+0.041
pieces+0.041
Neg logit contributions
 coc-0.030
T-0.027
una-0.026
 and-0.026
 Lizard-0.024
Token own
Token ID898
Feature activation-2.63e-02

Pos logit contributions
 blame+0.017
date+0.016
eks+0.016
ivated+0.016
pieces+0.016
Neg logit contributions
 coc-0.014
T-0.013
una-0.012
R-0.012
die-0.012
Token happy
Token ID3772
Feature activation+7.67e-03

Pos logit contributions
 coc+0.034
T+0.032
una+0.030
R+0.028
die+0.028
Neg logit contributions
 blame-0.046
date-0.044
eks-0.043
pieces-0.043
ivated-0.043
Token faces
Token ID6698
Feature activation-8.30e-03

Pos logit contributions
 blame+0.011
 fight+0.010
date+0.010
pieces+0.010
eks+0.009
Neg logit contributions
 coc-0.012
T-0.012
una-0.011
R-0.011
die-0.011
Token in
Token ID287
Feature activation+1.01e-02

Pos logit contributions
 coc+0.009
 and+0.008
T+0.008
una+0.007
 Lizard+0.007
Neg logit contributions
 blame-0.017
eks-0.016
date-0.016
ivated-0.016
pieces-0.016
Token the
Token ID262
Feature activation-7.52e-03

Pos logit contributions
 blame+0.015
 fight+0.014
cks+0.014
date+0.014
pieces+0.013
Neg logit contributions
 coc-0.016
mber-0.015
una-0.015
T-0.015
R-0.014
Token p
Token ID279
Feature activation+1.24e-02

Pos logit contributions
 coc+0.010
T+0.009
una+0.009
die+0.008
R+0.008
Neg logit contributions
 blame-0.013
date-0.013
eks-0.012
ivated-0.012
pieces-0.012