TOP ACTIVATIONS
MAX = 0.073

It was her favorite -- ch ocolate chip ! She was
shell was called a con ch shell . Tom
guess it 's a horn et 's nest ! We shouldn
, Lily , a che et ah !" Sam said ,
led and her dress gl imm ered in the sun .
t you start with the roller co aster ? â
She lly a f auc et . It was a shiny
She put on her sw imm ies and went off to
, this is the best summer ever !" Anna says .
to play and explore the freshly organized lawn . <|endoftext|> Once
popular shell - the con ch shell ! <|endoftext|> Once upon
picked and the lucky cl over . Her mom smiled and
the road and it gl imm ered in the bright sunshine
loved wearing her red m itten when it was cold outside
She wore her favorite m itten every day . It was
a big , beautiful con ch shell ! He
Daddy her new f auc et . <|endoftext|> B eth was
was sitting on his per ch in a big , tall
Lily and the f auc et played gently , singing ,
birthday , her dress sh imm ered in the light .

MIN ACTIVATIONS
MIN = -0.123

tree could see lots of av oc ados that the supplier
still couldn 't find any av oc ados . Finally ,
Maybe they could grow more av oc ados and share them
on , whenever they wanted av oc ados , they would
happy that it produced more av oc ados than ever before
" I will buy more av oc ados tomorrow . You
the store to get more av oc ados . Gary looked
girl who didn 't like av oc ados . She always
ongo didn 't like wet av oc ados , so he
away . She learned that av oc ados are good for
store and forgot to buy av oc ados . Lily was
Everyday he would take the av oc ados and scatter them
said , " I love av oc ados ! They are
if she could request some av oc ados from the store
the tree if they were av oc ados , and they
and try to grab the av oc ados . But the
but there were no other av oc ados .
. Did you know that av oc ados are very healthy
something new ! She thought av oc ados were the most
, and all the other av oc ados were very happy

INTERVAL 0.024 - 0.073
CONTAINS 3.919%

grabbed a wand and waved it around . " A br
happy . She realised that doing the right thing can feel
he thanked the boy and said goodbye . The boy
come back again ," said Susan to her mom my .
was thankful for the special week she spent with her family

INTERVAL -0.025 - 0.024
CONTAINS 93.231%

know you saw it first . I was just looking for
very icy outside . Do you want to wear my scarf
" Very good ," said the teacher , looking
eyes . He felt warm and cozy . He heard his
hurry . She grabbed her cushion , hugged it close ,

INTERVAL -0.074 - -0.025
CONTAINS 2.850%

Once upon a time , there was a parade . Everyone
baby had so much to learn , but M ummy and
more . And as they read , they laughed and talked
wonderful adventure . <|endoftext|> Once there was a boy named John
share and take turns . Now you have to clean up

INTERVAL -0.123 - -0.074
CONTAINS 0.001%

can we have some or nam ents for our snow man
to me . The or nam ents were very important to
time there was a little av ac ado . It was
. She loved to eat av oc ados because they were
He quickly hung colorful or nam ents and tw ink ling
Token It
Token ID632
Feature activation-9.48e-03

Pos logit contributions
pron+0.002
light+0.002
ina+0.002
kin+0.002
 particular+0.002
Neg logit contributions
atters-0.001
 apart-0.001
licks-0.001
 blamed-0.001
 wre-0.001
Token was
Token ID373
Feature activation-1.32e-03

Pos logit contributions
 Instead+0.014
 They+0.014
 If+0.013
 apart+0.013
 So+0.013
Neg logit contributions
pron-0.017
kin-0.015
 lifetime-0.015
cle-0.014
orse-0.014
Token her
Token ID607
Feature activation-9.53e-03

Pos logit contributions
 If+0.001
 all+0.001
 Instead+0.001
 apart+0.001
 They+0.001
Neg logit contributions
fish-0.003
pron-0.003
orse-0.003
 lifetime-0.003
cle-0.003
Token favorite
Token ID4004
Feature activation+1.59e-02

Pos logit contributions
 If+0.012
 Instead+0.012
 So+0.011
 They+0.011
 all+0.010
Neg logit contributions
pron-0.019
kin-0.017
orse-0.017
fish-0.017
light-0.017
Token--
Token ID438
Feature activation+8.68e-03

Pos logit contributions
pron+0.031
orse+0.031
 lifetime+0.030
wen+0.029
fish+0.029
Neg logit contributions
 Instead-0.019
 If-0.018
 all-0.018
 They-0.018
 So-0.017
Tokench
Token ID354
Feature activation+7.26e-02

Pos logit contributions
pron+0.015
 lifetime+0.015
kin+0.015
orse+0.015
inkle+0.014
Neg logit contributions
 If-0.013
 Instead-0.011
licks-0.011
 They-0.011
 apart-0.010
Tokenocolate
Token ID9140
Feature activation+8.02e-04

Pos logit contributions
 lifetime+0.127
pron+0.107
 pearl+0.105
cle+0.102
 newfound+0.102
Neg logit contributions
 If-0.116
 all-0.110
 They-0.105
icks-0.104
atters-0.102
Token chip
Token ID11594
Feature activation-1.60e-02

Pos logit contributions
pron+0.001
kin+0.001
 lifetime+0.001
orse+0.001
cle+0.001
Neg logit contributions
 Instead-0.001
 They-0.001
 If-0.001
 all-0.001
 So-0.001
Token!
Token ID0
Feature activation+1.02e-02

Pos logit contributions
 If+0.023
 Instead+0.022
 They+0.022
 apart+0.021
licks+0.021
Neg logit contributions
pron-0.030
kin-0.026
 lifetime-0.025
fish-0.025
cle-0.024
Token She
Token ID1375
Feature activation+2.36e-03

Pos logit contributions
pron+0.020
light+0.018
kin+0.017
 happened+0.017
 lifetime+0.017
Neg logit contributions
 apart-0.012
atters-0.011
licks-0.011
 If-0.011
icks-0.010
Token was
Token ID373
Feature activation-4.06e-03

Pos logit contributions
pron+0.005
kin+0.004
fish+0.004
 lifetime+0.004
iest+0.004
Neg logit contributions
 If-0.003
 Instead-0.003
 They-0.003
 apart-0.003
 So-0.003
Token shell
Token ID7582
Feature activation+2.65e-03

Pos logit contributions
pron+0.045
cle+0.042
wen+0.040
inkle+0.039
light+0.037
Neg logit contributions
 all-0.031
 They-0.027
 many-0.026
 So-0.025
 Instead-0.025
Token was
Token ID373
Feature activation-4.90e-03

Pos logit contributions
pron+0.006
cle+0.005
inkle+0.005
 lifetime+0.005
wen+0.005
Neg logit contributions
 They-0.003
 all-0.003
 apart-0.003
 He-0.003
 Instead-0.003
Token called
Token ID1444
Feature activation-1.24e-02

Pos logit contributions
 all+0.004
 apart+0.004
 They+0.004
 If+0.004
 many+0.003
Neg logit contributions
pron-0.013
cle-0.012
itten-0.011
kin-0.011
light-0.011
Token a
Token ID257
Feature activation-6.06e-03

Pos logit contributions
 If+0.015
 They+0.013
 apart+0.012
 Instead+0.012
 all+0.012
Neg logit contributions
pron-0.028
cle-0.025
 lifetime-0.024
kin-0.024
light-0.024
Token con
Token ID369
Feature activation+6.17e-03

Pos logit contributions
 all+0.007
 If+0.007
 zo+0.007
 Instead+0.007
 many+0.006
Neg logit contributions
pron-0.011
light-0.011
wen-0.011
cle-0.011
inkle-0.010
Tokench
Token ID354
Feature activation+7.25e-02

Pos logit contributions
kin+0.011
pron+0.011
orse+0.010
kins+0.010
light+0.009
Neg logit contributions
 If-0.010
 Instead-0.009
 They-0.009
 So-0.009
licks-0.008
Token shell
Token ID7582
Feature activation-6.75e-03

Pos logit contributions
pron+0.133
 lifetime+0.131
cle+0.120
light+0.114
wen+0.112
Neg logit contributions
 all-0.100
 They-0.097
icks-0.097
 If-0.097
 apart-0.087
Token.
Token ID13
Feature activation-2.75e-03

Pos logit contributions
 They+0.006
 all+0.006
 apart+0.006
 If+0.005
 Instead+0.005
Neg logit contributions
pron-0.017
cle-0.015
wen-0.015
 lifetime-0.014
inkle-0.014
Token
Token ID198
Feature activation+6.95e-03

Pos logit contributions
 If+0.002
 apart+0.002
licks+0.002
icks+0.002
 Instead+0.002
Neg logit contributions
pron-0.006
kin-0.006
orse-0.005
 lifetime-0.005
 happened-0.005
Token
Token ID198
Feature activation+9.92e-03

Pos logit contributions
pron+0.016
kin+0.015
fish+0.014
iest+0.014
 lifetime+0.014
Neg logit contributions
 If-0.007
 Instead-0.007
 They-0.006
 apart-0.006
atters-0.006
TokenTom
Token ID13787
Feature activation+1.56e-02

Pos logit contributions
pron+0.018
kin+0.016
fish+0.016
light+0.015
iest+0.015
Neg logit contributions
 If-0.016
 Instead-0.015
 They-0.014
 So-0.014
 Even-0.013
Token guess
Token ID4724
Feature activation+3.49e-02

Pos logit contributions
 If+0.007
 Instead+0.007
 apart+0.006
 They+0.006
atters+0.006
Neg logit contributions
pron-0.013
kin-0.012
fish-0.011
 lifetime-0.011
cle-0.011
Token it
Token ID340
Feature activation+1.23e-02

Pos logit contributions
pron+0.081
kin+0.070
cle+0.065
 lifetime+0.064
inkle+0.063
Neg logit contributions
 If-0.044
 Instead-0.041
 They-0.038
 apart-0.036
 all-0.036
Token's
Token ID338
Feature activation+2.32e-03

Pos logit contributions
pron+0.026
kin+0.024
 lifetime+0.022
fish+0.022
orse+0.022
Neg logit contributions
 If-0.016
 Instead-0.015
 They-0.014
 apart-0.014
 So-0.013
Token a
Token ID257
Feature activation+9.67e-03

Pos logit contributions
pron+0.006
kin+0.005
fish+0.005
iest+0.005
orse+0.005
Neg logit contributions
 If-0.002
 apart-0.002
 Instead-0.002
 all-0.002
 They-0.002
Token horn
Token ID12718
Feature activation+2.37e-02

Pos logit contributions
pron+0.018
kin+0.017
kins+0.016
light+0.016
iest+0.016
Neg logit contributions
 If-0.012
 Instead-0.012
 apart-0.011
 zo-0.010
 Even-0.010
Tokenet
Token ID316
Feature activation+7.23e-02

Pos logit contributions
pron+0.055
kin+0.049
 lifetime+0.047
orse+0.046
cle+0.046
Neg logit contributions
 If-0.025
 apart-0.024
 Instead-0.023
 They-0.022
icks-0.021
Token's
Token ID338
Feature activation+4.75e-03

Pos logit contributions
pron+0.169
kin+0.154
 lifetime+0.150
 newfound+0.142
cle+0.139
Neg logit contributions
 If-0.076
 apart-0.067
 Instead-0.066
 They-0.065
icks-0.064
Token nest
Token ID16343
Feature activation+2.01e-02

Pos logit contributions
pron+0.011
kin+0.010
fish+0.009
 lifetime+0.009
kins+0.009
Neg logit contributions
 If-0.006
 apart-0.005
 Instead-0.005
 They-0.005
 all-0.004
Token!
Token ID0
Feature activation+1.68e-02

Pos logit contributions
pron+0.045
orse+0.040
kin+0.040
 lifetime+0.040
cle+0.039
Neg logit contributions
 If-0.021
 apart-0.021
 Instead-0.018
 They-0.018
atters-0.017
Token We
Token ID775
Feature activation-9.30e-03

Pos logit contributions
pron+0.032
orse+0.028
light+0.028
kin+0.028
cle+0.027
Neg logit contributions
 apart-0.021
 If-0.018
atters-0.018
licks-0.018
 Instead-0.017
Token shouldn
Token ID6584
Feature activation+1.09e-02

Pos logit contributions
 If+0.011
 Instead+0.010
 apart+0.009
 They+0.009
licks+0.009
Neg logit contributions
pron-0.020
kin-0.019
fish-0.018
 lifetime-0.017
iest-0.017
Token,
Token ID11
Feature activation-1.55e-02

Pos logit contributions
 If+0.001
 Instead+0.001
 apart+0.001
 They+0.001
 So+0.001
Neg logit contributions
pron-0.002
kin-0.002
 lifetime-0.002
fish-0.002
iest-0.002
Token Lily
Token ID20037
Feature activation-3.39e-04

Pos logit contributions
 If+0.020
 Instead+0.018
 apart+0.018
 They+0.017
atters+0.017
Neg logit contributions
pron-0.032
kin-0.029
fish-0.028
orse-0.027
iest-0.027
Token,
Token ID11
Feature activation-3.89e-04

Pos logit contributions
 If+0.000
 Instead+0.000
 They+0.000
 apart+0.000
 So+0.000
Neg logit contributions
pron-0.001
kin-0.001
fish-0.001
 lifetime-0.001
iest-0.001
Token a
Token ID257
Feature activation+6.97e-03

Pos logit contributions
 If+0.001
 Instead+0.000
 They+0.000
 apart+0.000
licks+0.000
Neg logit contributions
pron-0.001
kin-0.001
fish-0.001
 lifetime-0.001
iest-0.001
Token che
Token ID1125
Feature activation+1.38e-02

Pos logit contributions
pron+0.014
kin+0.014
light+0.013
iest+0.013
wen+0.012
Neg logit contributions
 They-0.008
 If-0.007
 zo-0.007
 Instead-0.007
 apart-0.006
Tokenet
Token ID316
Feature activation+7.23e-02

Pos logit contributions
pron+0.020
 lifetime+0.018
light+0.016
kin+0.015
cle+0.015
Neg logit contributions
 If-0.024
atters-0.024
icks-0.024
licks-0.022
 apart-0.022
Tokenah
Token ID993
Feature activation+4.11e-02

Pos logit contributions
pron+0.156
kin+0.145
 lifetime+0.143
 newfound+0.133
fish+0.126
Neg logit contributions
 If-0.077
atters-0.072
 Instead-0.072
icks-0.070
 They-0.069
Token!"
Token ID2474
Feature activation-1.09e-02

Pos logit contributions
pron+0.104
 lifetime+0.092
kin+0.092
 particular+0.087
 newfound+0.087
Neg logit contributions
icks-0.032
atters-0.031
 If-0.031
 Instead-0.027
 They-0.025
Token Sam
Token ID3409
Feature activation+3.14e-02

Pos logit contributions
 They+0.009
 If+0.008
 Instead+0.007
 So+0.007
 He+0.006
Neg logit contributions
pron-0.030
iest-0.027
kin-0.027
 lifetime-0.027
fish-0.026
Token said
Token ID531
Feature activation+3.50e-02

Pos logit contributions
pron+0.041
fish+0.036
kin+0.035
 happened+0.035
orse+0.035
Neg logit contributions
 If-0.059
atters-0.054
 Instead-0.053
 apart-0.053
 Other-0.052
Token,
Token ID11
Feature activation+1.86e-02

Pos logit contributions
pron+0.062
kin+0.056
fish+0.053
orse+0.051
iest+0.051
Neg logit contributions
 If-0.054
 Instead-0.050
licks-0.048
atters-0.047
 They-0.047
Tokenled
Token ID992
Feature activation-7.45e-03

Pos logit contributions
pron+0.030
cle+0.028
 lifetime+0.025
orse+0.025
kin+0.023
Neg logit contributions
 apart-0.019
 all-0.018
 Instead-0.017
 They-0.016
 So-0.016
Token and
Token ID290
Feature activation+1.28e-02

Pos logit contributions
 apart+0.006
 all+0.006
 If+0.005
 Instead+0.005
 They+0.005
Neg logit contributions
pron-0.020
orse-0.018
cle-0.018
kin-0.018
 lifetime-0.017
Token her
Token ID607
Feature activation+1.13e-02

Pos logit contributions
pron+0.024
kin+0.022
cle+0.021
fish+0.021
inkle+0.020
Neg logit contributions
 apart-0.017
 They-0.017
 Instead-0.016
 all-0.016
 If-0.016
Token dress
Token ID6576
Feature activation+4.67e-02

Pos logit contributions
cle+0.025
pron+0.024
orse+0.022
 lifetime+0.022
wen+0.022
Neg logit contributions
 all-0.012
 g-0.010
 apart-0.010
 both-0.010
 They-0.010
Token gl
Token ID1278
Feature activation+1.29e-02

Pos logit contributions
pron+0.093
cle+0.091
orse+0.087
 lifetime+0.082
 pearl+0.080
Neg logit contributions
 all-0.059
 apart-0.057
 Instead-0.056
 They-0.056
 All-0.050
Tokenimm
Token ID8608
Feature activation+7.20e-02

Pos logit contributions
pron+0.019
orse+0.016
kin+0.015
 happened+0.015
 lifetime+0.015
Neg logit contributions
 If-0.023
atters-0.022
 Instead-0.022
 apart-0.022
licks-0.022
Tokenered
Token ID1068
Feature activation+5.23e-03

Pos logit contributions
pron+0.153
 lifetime+0.137
orse+0.132
cle+0.129
kin+0.128
Neg logit contributions
 If-0.090
licks-0.080
icks-0.078
 apart-0.078
atters-0.078
Token in
Token ID287
Feature activation-1.08e-02

Pos logit contributions
pron+0.014
cle+0.012
orse+0.012
kin+0.012
 lifetime+0.011
Neg logit contributions
 apart-0.005
 all-0.004
 Instead-0.004
 They-0.004
 If-0.004
Token the
Token ID262
Feature activation+1.76e-02

Pos logit contributions
 apart+0.009
 all+0.009
 Instead+0.008
 If+0.008
 They+0.008
Neg logit contributions
pron-0.028
orse-0.026
fish-0.025
kin-0.025
ves-0.025
Token sun
Token ID4252
Feature activation+4.62e-02

Pos logit contributions
pron+0.033
orse+0.033
ves+0.032
kin+0.031
fish+0.031
Neg logit contributions
 apart-0.023
 Instead-0.022
 If-0.021
 They-0.020
 all-0.019
Token.
Token ID13
Feature activation+3.85e-05

Pos logit contributions
pron+0.110
kin+0.100
orse+0.097
cle+0.096
fish+0.095
Neg logit contributions
 apart-0.044
 They-0.044
 Instead-0.041
 If-0.038
 all-0.036
Tokent
Token ID83
Feature activation-6.71e-05

Pos logit contributions
 If+0.089
 Instead+0.083
 They+0.079
 So+0.076
 apart+0.076
Neg logit contributions
pron-0.091
kin-0.081
fish-0.076
 lifetime-0.076
light-0.074
Token you
Token ID345
Feature activation-7.32e-03

Pos logit contributions
 If+0.000
 Instead+0.000
 They+0.000
 All+0.000
 So+0.000
Neg logit contributions
pron-0.000
kin-0.000
 lifetime-0.000
inkle-0.000
cle-0.000
Token start
Token ID923
Feature activation+1.37e-02

Pos logit contributions
 If+0.007
 Instead+0.006
 They+0.006
 apart+0.006
 So+0.006
Neg logit contributions
pron-0.018
kin-0.016
fish-0.015
 lifetime-0.015
iest-0.015
Token with
Token ID351
Feature activation-1.35e-03

Pos logit contributions
pron+0.032
kin+0.030
iest+0.028
 lifetime+0.028
cle+0.028
Neg logit contributions
 If-0.015
 apart-0.014
 Instead-0.013
 They-0.013
 all-0.012
Token the
Token ID262
Feature activation+1.45e-02

Pos logit contributions
 If+0.001
 all+0.001
 Instead+0.001
 apart+0.001
 both+0.001
Neg logit contributions
pron-0.003
kin-0.003
fish-0.003
 lifetime-0.003
orse-0.003
Token roller
Token ID24471
Feature activation+7.19e-02

Pos logit contributions
pron+0.028
cle+0.026
orse+0.026
kin+0.025
light+0.025
Neg logit contributions
 If-0.019
 apart-0.018
 They-0.018
 Instead-0.017
 all-0.017
Tokenco
Token ID1073
Feature activation+4.81e-02

Pos logit contributions
 lifetime+0.123
pron+0.120
kin+0.117
 newfound+0.115
 pearl+0.115
Neg logit contributions
 If-0.091
 So-0.089
 They-0.087
 Instead-0.087
 All-0.084
Tokenaster
Token ID1603
Feature activation+5.43e-02

Pos logit contributions
pron+0.076
kin+0.070
 lifetime+0.067
 veil+0.065
 pearl+0.065
Neg logit contributions
 If-0.078
 They-0.072
 So-0.071
 Other-0.071
atters-0.070
Token?
Token ID30
Feature activation+6.75e-03

Pos logit contributions
pron+0.116
kin+0.104
 lifetime+0.104
cle+0.104
orse+0.102
Neg logit contributions
 If-0.061
 apart-0.059
 Instead-0.056
 They-0.056
 So-0.054
Tokenâ
Token ID22940
Feature activation+2.75e-02

Pos logit contributions
pron+0.016
kin+0.014
fish+0.014
 lifetime+0.013
orse+0.013
Neg logit contributions
 If-0.006
 Instead-0.006
 apart-0.006
atters-0.005
licks-0.005
Token
Token ID26391
Feature activation+2.94e-02

Pos logit contributions
pron+0.045
kin+0.040
fish+0.038
 lifetime+0.036
iest+0.036
Neg logit contributions
 If-0.047
 Instead-0.044
 They-0.043
 apart-0.042
atters-0.042
Token She
Token ID1375
Feature activation+9.57e-04

Pos logit contributions
 If+0.040
 Instead+0.036
 They+0.035
 apart+0.035
licks+0.034
Neg logit contributions
pron-0.055
kin-0.049
fish-0.046
 lifetime-0.046
orse-0.045
Tokenlly
Token ID12810
Feature activation+2.07e-02

Pos logit contributions
pron+0.002
kin+0.002
fish+0.002
iest+0.002
 lifetime+0.002
Neg logit contributions
 They-0.001
 Instead-0.001
 If-0.001
 So-0.001
 Even-0.001
Token a
Token ID257
Feature activation+6.09e-03

Pos logit contributions
pron+0.051
kin+0.048
 lifetime+0.045
orse+0.044
inkle+0.044
Neg logit contributions
 If-0.020
 They-0.019
 Instead-0.018
 apart-0.017
licks-0.016
Token f
Token ID277
Feature activation-8.89e-03

Pos logit contributions
kin+0.011
orse+0.011
we+0.010
cle+0.010
iest+0.010
Neg logit contributions
 Instead-0.007
 all-0.007
 If-0.007
 They-0.007
 Enjoy-0.006
Tokenauc
Token ID14272
Feature activation-2.05e-02

Pos logit contributions
 If+0.014
 Instead+0.013
 They+0.012
 apart+0.012
 Even+0.012
Neg logit contributions
pron-0.016
kin-0.014
fish-0.014
iest-0.014
light-0.013
Tokenet
Token ID316
Feature activation+7.19e-02

Pos logit contributions
 If+0.042
 Instead+0.040
 apart+0.039
 They+0.039
licks+0.038
Neg logit contributions
pron-0.027
kin-0.023
fish-0.021
 lifetime-0.020
orse-0.020
Token.
Token ID13
Feature activation-6.88e-03

Pos logit contributions
pron+0.179
 lifetime+0.174
kin+0.165
cle+0.161
orse+0.157
Neg logit contributions
 They-0.058
 all-0.057
 apart-0.052
 If-0.050
 over-0.044
Token It
Token ID632
Feature activation-8.31e-03

Pos logit contributions
 If+0.007
 apart+0.006
atters+0.006
licks+0.006
 Instead+0.006
Neg logit contributions
pron-0.015
kin-0.014
fish-0.013
orse-0.013
 particular-0.013
Token was
Token ID373
Feature activation-1.01e-03

Pos logit contributions
 They+0.012
 Instead+0.012
 If+0.012
 apart+0.011
 Even+0.011
Neg logit contributions
pron-0.016
kin-0.014
cle-0.014
 lifetime-0.014
orse-0.013
Token a
Token ID257
Feature activation-6.24e-05

Pos logit contributions
 If+0.001
 They+0.001
 all+0.001
 apart+0.001
 Instead+0.001
Neg logit contributions
pron-0.003
 lifetime-0.002
orse-0.002
fish-0.002
cle-0.002
Token shiny
Token ID22441
Feature activation+1.32e-02

Pos logit contributions
 They+0.000
 zo+0.000
 Instead+0.000
 all+0.000
 If+0.000
Neg logit contributions
wen-0.000
cle-0.000
asha-0.000
iest-0.000
light-0.000
Token She
Token ID1375
Feature activation+8.26e-03

Pos logit contributions
pron+0.010
 particular+0.008
 moment+0.008
kin+0.008
 happened+0.008
Neg logit contributions
 apart-0.004
icks-0.004
atters-0.004
licks-0.004
 If-0.003
Token put
Token ID1234
Feature activation+1.32e-03

Pos logit contributions
pron+0.018
kin+0.016
 lifetime+0.016
iest+0.016
light+0.016
Neg logit contributions
 They-0.010
 If-0.009
 So-0.009
 Instead-0.008
 all-0.008
Token on
Token ID319
Feature activation+4.29e-03

Pos logit contributions
pron+0.003
 lifetime+0.003
orse+0.003
cle+0.003
fish+0.003
Neg logit contributions
 all-0.001
 If-0.001
 They-0.001
 apart-0.001
icks-0.001
Token her
Token ID607
Feature activation+2.08e-03

Pos logit contributions
pron+0.008
kin+0.007
fish+0.007
 lifetime+0.006
iest+0.006
Neg logit contributions
 If-0.006
 Instead-0.006
 They-0.006
 apart-0.006
licks-0.006
Token sw
Token ID1509
Feature activation+2.78e-03

Pos logit contributions
cle+0.004
light+0.004
pron+0.004
 lifetime+0.004
web+0.004
Neg logit contributions
 all-0.002
 They-0.002
 both-0.002
 p-0.002
 many-0.002
Tokenimm
Token ID8608
Feature activation+7.18e-02

Pos logit contributions
pron+0.005
kin+0.005
 happened+0.004
iest+0.004
fish+0.004
Neg logit contributions
 If-0.004
 Instead-0.004
atters-0.004
 They-0.004
icks-0.004
Tokenies
Token ID444
Feature activation+2.64e-02

Pos logit contributions
pron+0.147
 lifetime+0.142
light+0.129
cle+0.126
orse+0.121
Neg logit contributions
 If-0.089
icks-0.087
oops-0.086
atters-0.082
 all-0.077
Token and
Token ID290
Feature activation+1.75e-02

Pos logit contributions
pron+0.070
cle+0.066
kin+0.065
 lifetime+0.062
inkle+0.061
Neg logit contributions
 all-0.020
 They-0.016
 apart-0.015
 If-0.015
 both-0.015
Token went
Token ID1816
Feature activation-1.59e-03

Pos logit contributions
pron+0.038
iest+0.035
inkle+0.035
kin+0.035
fish+0.034
Neg logit contributions
 They-0.018
 all-0.017
 many-0.015
 both-0.015
 apart-0.014
Token off
Token ID572
Feature activation+1.70e-02

Pos logit contributions
 If+0.002
 Instead+0.002
 They+0.002
 apart+0.002
licks+0.001
Neg logit contributions
pron-0.004
kin-0.003
fish-0.003
 lifetime-0.003
iest-0.003
Token to
Token ID284
Feature activation+2.54e-02

Pos logit contributions
pron+0.044
kin+0.042
iest+0.038
cle+0.038
 lifetime+0.038
Neg logit contributions
 If-0.013
 They-0.013
 apart-0.012
 all-0.011
 Instead-0.011
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
 If+0.011
 Instead+0.010
 They+0.010
 apart+0.010
icks+0.009
Neg logit contributions
pron-0.021
kin-0.018
fish-0.017
 lifetime-0.017
cle-0.017
Token this
Token ID428
Feature activation+2.84e-02

Pos logit contributions
 If+0.019
 Instead+0.017
 apart+0.017
 They+0.016
licks+0.016
Neg logit contributions
pron-0.029
kin-0.026
fish-0.025
iest-0.024
 lifetime-0.024
Token is
Token ID318
Feature activation+9.96e-03

Pos logit contributions
pron+0.062
kin+0.056
cle+0.055
orse+0.055
fish+0.054
Neg logit contributions
 If-0.033
 Instead-0.033
 They-0.032
 So-0.029
 apart-0.029
Token the
Token ID262
Feature activation+1.67e-02

Pos logit contributions
pron+0.022
kin+0.020
fish+0.019
orse+0.019
 lifetime+0.019
Neg logit contributions
 If-0.012
 Instead-0.011
 They-0.011
 apart-0.010
 So-0.009
Token best
Token ID1266
Feature activation+3.05e-02

Pos logit contributions
pron+0.031
orse+0.027
kin+0.027
fish+0.026
light+0.025
Neg logit contributions
 If-0.025
 Instead-0.023
 apart-0.023
 They-0.023
 Other-0.021
Token summer
Token ID3931
Feature activation+7.16e-02

Pos logit contributions
pron+0.050
kin+0.043
fish+0.041
orse+0.039
 lifetime+0.038
Neg logit contributions
 If-0.053
 Instead-0.051
 They-0.050
 apart-0.049
atters-0.047
Token ever
Token ID1683
Feature activation+5.43e-03

Pos logit contributions
pron+0.129
orse+0.115
inkle+0.110
 lifetime+0.107
 newfound+0.107
Neg logit contributions
 They-0.107
 If-0.095
 apart-0.094
 all-0.090
 Instead-0.088
Token!"
Token ID2474
Feature activation-1.96e-02

Pos logit contributions
pron+0.014
kin+0.013
 lifetime+0.012
cle+0.012
orse+0.012
Neg logit contributions
 If-0.004
 apart-0.004
 They-0.004
 Instead-0.004
atters-0.004
Token Anna
Token ID11735
Feature activation+1.53e-02

Pos logit contributions
 They+0.020
 If+0.020
 Instead+0.017
 So+0.016
 She+0.015
Neg logit contributions
pron-0.051
kin-0.045
 lifetime-0.045
iest-0.044
fish-0.043
Token says
Token ID1139
Feature activation+3.45e-02

Pos logit contributions
pron+0.023
fish+0.022
orse+0.021
kin+0.021
 happened+0.021
Neg logit contributions
 If-0.023
 Instead-0.022
atters-0.021
licks-0.021
 apart-0.021
Token.
Token ID13
Feature activation+2.67e-04

Pos logit contributions
pron+0.059
kin+0.054
fish+0.051
orse+0.049
light+0.048
Neg logit contributions
 If-0.054
 Instead-0.051
licks-0.049
atters-0.048
 apart-0.048
Token to
Token ID284
Feature activation+9.38e-03

Pos logit contributions
pron+0.036
kin+0.033
wen+0.032
inkle+0.032
cle+0.032
Neg logit contributions
 all-0.008
 They-0.007
 If-0.006
 apart-0.006
 Instead-0.006
Token play
Token ID711
Feature activation+3.06e-02

Pos logit contributions
pron+0.023
kin+0.021
fish+0.020
iest+0.020
cle+0.020
Neg logit contributions
 If-0.009
 They-0.009
 Instead-0.008
 all-0.008
 apart-0.008
Token and
Token ID290
Feature activation+6.47e-03

Pos logit contributions
pron+0.079
cle+0.077
wen+0.074
 lifetime+0.071
itten+0.070
Neg logit contributions
 all-0.025
 They-0.024
 If-0.020
 apart-0.020
 Instead-0.018
Token explore
Token ID7301
Feature activation+1.88e-02

Pos logit contributions
pron+0.014
kin+0.013
fish+0.013
iest+0.012
 lifetime+0.012
Neg logit contributions
 If-0.007
 Instead-0.007
 They-0.006
 apart-0.006
atters-0.006
Token the
Token ID262
Feature activation+1.13e-02

Pos logit contributions
pron+0.048
kin+0.044
cle+0.043
itten+0.041
orse+0.041
Neg logit contributions
 all-0.017
 They-0.015
 If-0.015
 Instead-0.014
 apart-0.014
Token freshly
Token ID29026
Feature activation+7.14e-02

Pos logit contributions
pron+0.021
kin+0.020
orse+0.019
fish+0.018
light+0.018
Neg logit contributions
 If-0.016
 They-0.016
 apart-0.015
 Instead-0.015
 Other-0.014
Token organized
Token ID8389
Feature activation+1.19e-02

Pos logit contributions
pron+0.129
kin+0.122
orse+0.115
fish+0.115
cle+0.111
Neg logit contributions
 They-0.088
 Instead-0.087
 all-0.086
 All-0.083
atters-0.081
Token lawn
Token ID21133
Feature activation+1.76e-02

Pos logit contributions
pron+0.024
kin+0.022
orse+0.022
cle+0.021
fish+0.021
Neg logit contributions
 They-0.015
 If-0.015
 apart-0.014
 Instead-0.014
 all-0.013
Token.
Token ID13
Feature activation+1.18e-03

Pos logit contributions
pron+0.041
kin+0.036
inkle+0.035
cle+0.035
 happened+0.034
Neg logit contributions
 They-0.019
 If-0.019
 Instead-0.018
 apart-0.018
 So-0.016
Token<|endoftext|>
Token ID50256
Feature activation-6.68e-03

Pos logit contributions
pron+0.003
kin+0.002
fish+0.002
orse+0.002
 lifetime+0.002
Neg logit contributions
 If-0.001
 apart-0.001
 Instead-0.001
atters-0.001
licks-0.001
TokenOnce
Token ID7454
Feature activation-8.49e-03

Pos logit contributions
 If+0.008
 They+0.008
 Instead+0.008
 So+0.007
 apart+0.007
Neg logit contributions
pron-0.015
kin-0.013
fish-0.013
 lifetime-0.012
orse-0.012
Token popular
Token ID2968
Feature activation+9.63e-03

Pos logit contributions
pron+0.012
cle+0.011
kin+0.010
wen+0.009
lf+0.009
Neg logit contributions
 Instead-0.007
 all-0.007
 If-0.007
 apart-0.007
 many-0.007
Token shell
Token ID7582
Feature activation-2.21e-03

Pos logit contributions
pron+0.019
cle+0.017
wen+0.016
inkle+0.015
itten+0.015
Neg logit contributions
 all-0.014
 many-0.011
 They-0.011
 hide-0.011
 look-0.010
Token -
Token ID532
Feature activation+8.21e-03

Pos logit contributions
 all+0.002
 apart+0.002
 They+0.002
 He+0.001
 Instead+0.001
Neg logit contributions
pron-0.005
cle-0.005
inkle-0.005
 lifetime-0.005
kin-0.005
Token the
Token ID262
Feature activation+1.53e-02

Pos logit contributions
pron+0.018
kin+0.017
cle+0.016
inkle+0.016
fish+0.015
Neg logit contributions
 If-0.010
 They-0.009
 Instead-0.009
 all-0.009
 apart-0.008
Token con
Token ID369
Feature activation+8.59e-03

Pos logit contributions
pron+0.031
cle+0.030
light+0.028
ves+0.028
wen+0.027
Neg logit contributions
 all-0.020
 right-0.018
 many-0.018
 both-0.017
 If-0.017
Tokench
Token ID354
Feature activation+7.07e-02

Pos logit contributions
kin+0.016
pron+0.014
kins+0.014
orse+0.014
fish+0.013
Neg logit contributions
 If-0.013
 Instead-0.012
 They-0.012
 All-0.012
 So-0.012
Token shell
Token ID7582
Feature activation-4.18e-03

Pos logit contributions
pron+0.102
cle+0.101
 lifetime+0.100
Kelly+0.090
light+0.081
Neg logit contributions
 all-0.126
 They-0.106
icks-0.104
 off-0.100
 both-0.100
Token!
Token ID0
Feature activation+1.16e-02

Pos logit contributions
 all+0.004
 apart+0.004
 They+0.004
 Instead+0.003
 He+0.003
Neg logit contributions
pron-0.010
cle-0.009
 lifetime-0.008
inkle-0.008
iko-0.008
Token<|endoftext|>
Token ID50256
Feature activation-2.64e-03

Pos logit contributions
pron+0.024
kin+0.021
 lifetime+0.021
orse+0.021
 happened+0.021
Neg logit contributions
 apart-0.012
 If-0.011
licks-0.011
icks-0.011
atters-0.010
TokenOnce
Token ID7454
Feature activation-4.65e-03

Pos logit contributions
 If+0.003
 They+0.003
 Instead+0.003
 So+0.003
 All+0.003
Neg logit contributions
pron-0.006
kin-0.005
fish-0.005
 lifetime-0.005
cle-0.005
Token upon
Token ID2402
Feature activation-1.64e-03

Pos logit contributions
 If+0.009
 They+0.008
 Instead+0.008
 So+0.008
 Even+0.008
Neg logit contributions
pron-0.007
kin-0.006
 lifetime-0.006
light-0.006
cle-0.005
Token picked
Token ID6497
Feature activation-6.33e-03

Pos logit contributions
kin+0.017
iest+0.016
pron+0.016
fish+0.016
light+0.015
Neg logit contributions
 Instead-0.007
 all-0.007
 both-0.007
 They-0.007
 If-0.007
Token and
Token ID290
Feature activation-6.70e-03

Pos logit contributions
 If+0.005
 apart+0.005
 all+0.005
 They+0.005
 Instead+0.004
Neg logit contributions
pron-0.017
orse-0.015
kin-0.015
cle-0.015
 lifetime-0.015
Token the
Token ID262
Feature activation+1.33e-02

Pos logit contributions
 If+0.009
 Instead+0.009
 They+0.009
 apart+0.008
 Even+0.008
Neg logit contributions
pron-0.013
kin-0.012
fish-0.011
iest-0.011
 lifetime-0.011
Token lucky
Token ID9670
Feature activation+7.88e-03

Pos logit contributions
pron+0.026
light+0.025
kin+0.025
cle+0.025
orse+0.025
Neg logit contributions
 If-0.016
 They-0.015
 Instead-0.015
 all-0.015
 apart-0.014
Token cl
Token ID537
Feature activation+2.47e-02

Pos logit contributions
pron+0.014
orse+0.014
cle+0.014
kin+0.013
 lifetime+0.013
Neg logit contributions
 They-0.010
 all-0.009
 Instead-0.009
 If-0.009
 All-0.009
Tokenover
Token ID2502
Feature activation+7.06e-02

Pos logit contributions
pron+0.043
 lifetime+0.036
kin+0.036
cle+0.035
fish+0.035
Neg logit contributions
 If-0.037
 Instead-0.036
atters-0.034
 They-0.034
icks-0.034
Token.
Token ID13
Feature activation-9.10e-04

Pos logit contributions
pron+0.156
cle+0.153
inkle+0.148
 lifetime+0.141
light+0.139
Neg logit contributions
 all-0.080
 They-0.079
 All-0.067
 If-0.065
 She-0.063
Token Her
Token ID2332
Feature activation-2.41e-02

Pos logit contributions
 apart+0.001
licks+0.001
 If+0.001
icks+0.001
atters+0.001
Neg logit contributions
pron-0.002
kin-0.002
 happened-0.002
 lifetime-0.002
light-0.002
Token mom
Token ID1995
Feature activation+4.12e-03

Pos logit contributions
 Instead+0.026
 If+0.025
 They+0.024
 So+0.023
 all+0.022
Neg logit contributions
pron-0.050
iest-0.047
fish-0.047
kin-0.045
orse-0.045
Token smiled
Token ID13541
Feature activation-1.26e-02

Pos logit contributions
pron+0.007
kin+0.007
fish+0.006
iest+0.006
 lifetime+0.006
Neg logit contributions
 If-0.006
 Instead-0.006
 They-0.006
 Even-0.006
 apart-0.006
Token and
Token ID290
Feature activation+2.52e-04

Pos logit contributions
 If+0.014
 Instead+0.013
 They+0.012
 apart+0.012
 So+0.011
Neg logit contributions
pron-0.028
kin-0.026
fish-0.025
orse-0.024
iest-0.024
Token the
Token ID262
Feature activation+7.94e-03

Pos logit contributions
pron+0.020
kin+0.019
fish+0.018
iest+0.018
 lifetime+0.018
Neg logit contributions
 If-0.008
 They-0.007
 Instead-0.007
 apart-0.007
 all-0.006
Token road
Token ID2975
Feature activation+3.67e-03

Pos logit contributions
pron+0.014
kin+0.013
fish+0.012
iest+0.011
orse+0.011
Neg logit contributions
 If-0.013
 Instead-0.012
 They-0.011
 apart-0.011
atters-0.011
Token and
Token ID290
Feature activation+2.99e-03

Pos logit contributions
pron+0.007
kin+0.007
fish+0.006
 lifetime+0.006
iest+0.006
Neg logit contributions
 If-0.005
 Instead-0.005
 They-0.004
 apart-0.004
licks-0.004
Token it
Token ID340
Feature activation+1.84e-02

Pos logit contributions
pron+0.006
kin+0.005
fish+0.005
 lifetime+0.005
iest+0.005
Neg logit contributions
 If-0.004
 apart-0.004
 They-0.004
 Instead-0.004
 all-0.003
Token gl
Token ID1278
Feature activation-1.17e-03

Pos logit contributions
pron+0.033
kin+0.029
 lifetime+0.027
fish+0.027
cle+0.027
Neg logit contributions
 If-0.027
 Instead-0.026
 apart-0.026
 They-0.026
licks-0.024
Tokenimm
Token ID8608
Feature activation+7.04e-02

Pos logit contributions
 If+0.002
 Instead+0.002
atters+0.002
 apart+0.002
licks+0.002
Neg logit contributions
pron-0.002
kin-0.001
fish-0.001
 lifetime-0.001
orse-0.001
Tokenered
Token ID1068
Feature activation+1.10e-02

Pos logit contributions
pron+0.147
kin+0.126
 lifetime+0.126
light+0.121
orse+0.119
Neg logit contributions
 If-0.093
 Instead-0.084
atters-0.083
 apart-0.081
icks-0.081
Token in
Token ID287
Feature activation-7.17e-03

Pos logit contributions
pron+0.026
kin+0.024
cle+0.022
orse+0.022
fish+0.022
Neg logit contributions
 If-0.011
 apart-0.011
 Instead-0.010
 They-0.010
 all-0.009
Token the
Token ID262
Feature activation+2.29e-02

Pos logit contributions
 If+0.007
 Instead+0.007
 They+0.006
 apart+0.006
 all+0.005
Neg logit contributions
pron-0.017
kin-0.016
fish-0.015
orse-0.015
iest-0.015
Token bright
Token ID6016
Feature activation-1.13e-03

Pos logit contributions
pron+0.041
kin+0.037
fish+0.035
iest+0.033
orse+0.033
Neg logit contributions
 If-0.036
 Instead-0.033
 apart-0.032
 They-0.032
atters-0.031
Token sunshine
Token ID34488
Feature activation+3.98e-02

Pos logit contributions
 If+0.002
 Instead+0.002
 They+0.002
 apart+0.001
 So+0.001
Neg logit contributions
pron-0.002
kin-0.002
fish-0.002
orse-0.002
 lifetime-0.002
Token loved
Token ID6151
Feature activation+6.03e-03

Pos logit contributions
pron+0.028
kin+0.027
iest+0.027
 lifetime+0.025
fish+0.025
Neg logit contributions
 They-0.011
 From-0.009
 So-0.009
 Even-0.009
 all-0.008
Token wearing
Token ID5762
Feature activation+1.57e-03

Pos logit contributions
pron+0.013
kin+0.012
fish+0.011
 lifetime+0.011
iest+0.011
Neg logit contributions
 If-0.007
 Instead-0.007
 apart-0.006
 They-0.006
atters-0.006
Token her
Token ID607
Feature activation+1.73e-02

Pos logit contributions
pron+0.003
kin+0.003
fish+0.003
 lifetime+0.003
orse+0.003
Neg logit contributions
 If-0.002
 They-0.002
 Instead-0.002
 apart-0.002
 all-0.002
Token red
Token ID2266
Feature activation+3.99e-02

Pos logit contributions
orse+0.040
cle+0.039
fer+0.039
light+0.038
 lifetime+0.038
Neg logit contributions
 all-0.017
 both-0.012
 many-0.012
 They-0.012
 The-0.011
Token m
Token ID285
Feature activation+4.00e-02

Pos logit contributions
 lifetime+0.075
wen+0.074
orse+0.071
pron+0.071
kin+0.070
Neg logit contributions
 all-0.049
 They-0.048
 So-0.047
 Instead-0.043
 He-0.042
Tokenitten
Token ID2621
Feature activation+7.03e-02

Pos logit contributions
pron+0.043
kin+0.040
fish+0.037
 happened+0.034
 lifetime+0.034
Neg logit contributions
 If-0.084
 Instead-0.082
 They-0.079
atters-0.079
 So-0.079
Token when
Token ID618
Feature activation+1.26e-02

Pos logit contributions
pron+0.154
kin+0.143
 lifetime+0.138
fish+0.137
orse+0.135
Neg logit contributions
 If-0.074
 Instead-0.070
 They-0.068
 apart-0.066
 So-0.065
Token it
Token ID340
Feature activation+5.30e-04

Pos logit contributions
aker+0.029
inkle+0.029
pron+0.029
cle+0.028
immer+0.028
Neg logit contributions
 all-0.015
 many-0.013
 She-0.013
 They-0.013
 All-0.011
Token was
Token ID373
Feature activation-1.99e-04

Pos logit contributions
pron+0.001
cle+0.001
 lifetime+0.001
kin+0.001
 pearl+0.001
Neg logit contributions
 Instead-0.001
 all-0.001
 They-0.001
 All-0.001
 She-0.001
Token cold
Token ID4692
Feature activation+9.96e-03

Pos logit contributions
 all+0.000
 apart+0.000
 If+0.000
 Instead+0.000
 They+0.000
Neg logit contributions
fish-0.000
pron-0.000
orse-0.000
cle-0.000
 lifetime-0.000
Token outside
Token ID2354
Feature activation+6.64e-03

Pos logit contributions
pron+0.017
kin+0.015
iest+0.014
light+0.014
 happened+0.014
Neg logit contributions
 If-0.016
 Instead-0.015
licks-0.014
atters-0.014
icks-0.014
Token She
Token ID1375
Feature activation+1.08e-02

Pos logit contributions
pron+0.000
 particular+0.000
kin+0.000
 moment+0.000
ina+0.000
Neg logit contributions
 apart-0.000
atters-0.000
icks-0.000
 If-0.000
aled-0.000
Token wore
Token ID12408
Feature activation-8.52e-04

Pos logit contributions
pron+0.023
kin+0.021
 lifetime+0.021
iest+0.021
fish+0.020
Neg logit contributions
 They-0.012
 If-0.011
 Instead-0.011
 Even-0.010
 So-0.010
Token her
Token ID607
Feature activation+1.80e-02

Pos logit contributions
 If+0.001
 Instead+0.001
 They+0.001
 apart+0.001
 all+0.001
Neg logit contributions
pron-0.002
kin-0.002
fish-0.002
 lifetime-0.002
orse-0.002
Token favorite
Token ID4004
Feature activation+3.18e-02

Pos logit contributions
cle+0.039
orse+0.039
 lifetime+0.039
light+0.039
wen+0.038
Neg logit contributions
 all-0.017
 many-0.013
 only-0.013
 They-0.012
 The-0.012
Token m
Token ID285
Feature activation+3.76e-02

Pos logit contributions
wen+0.074
cle+0.071
 lifetime+0.070
 possible+0.068
orse+0.067
Neg logit contributions
 all-0.031
 p-0.024
 g-0.022
 hide-0.022
 pet-0.022
Tokenitten
Token ID2621
Feature activation+7.02e-02

Pos logit contributions
pron+0.041
kin+0.037
fish+0.034
 happened+0.032
 lifetime+0.032
Neg logit contributions
 If-0.079
 Instead-0.077
 So-0.075
 They-0.074
 Other-0.073
Token every
Token ID790
Feature activation+4.14e-02

Pos logit contributions
pron+0.158
kin+0.148
 lifetime+0.144
fish+0.141
inkle+0.141
Neg logit contributions
 If-0.066
 They-0.064
 all-0.063
 Instead-0.061
 So-0.060
Token day
Token ID1110
Feature activation+2.92e-02

Pos logit contributions
kins+0.079
inkle+0.077
wen+0.071
pron+0.070
andra+0.070
Neg logit contributions
 all-0.050
 While-0.050
 If-0.048
 They-0.048
 Every-0.048
Token.
Token ID13
Feature activation+4.00e-03

Pos logit contributions
pron+0.075
kin+0.069
fish+0.067
inkle+0.065
 lifetime+0.063
Neg logit contributions
 They-0.025
 If-0.024
 all-0.022
 Instead-0.021
 So-0.020
Token It
Token ID632
Feature activation-5.01e-03

Pos logit contributions
pron+0.008
kin+0.007
 particular+0.007
light+0.007
fish+0.007
Neg logit contributions
 apart-0.004
atters-0.004
 If-0.004
licks-0.003
icks-0.003
Token was
Token ID373
Feature activation+5.58e-03

Pos logit contributions
 They+0.007
 If+0.007
 Instead+0.007
 apart+0.007
 Even+0.007
Neg logit contributions
pron-0.009
kin-0.008
 lifetime-0.008
cle-0.008
orse-0.007
Token a
Token ID257
Feature activation+1.21e-03

Pos logit contributions
pron+0.032
kin+0.030
iest+0.027
fish+0.027
light+0.027
Neg logit contributions
 If-0.016
 They-0.015
 Instead-0.015
 apart-0.013
 Even-0.013
Token big
Token ID1263
Feature activation-2.84e-03

Pos logit contributions
pron+0.002
kin+0.002
light+0.002
inkle+0.002
cle+0.002
Neg logit contributions
 They-0.001
 Instead-0.001
 zo-0.001
 all-0.001
 If-0.001
Token,
Token ID11
Feature activation+1.73e-02

Pos logit contributions
 They+0.003
 apart+0.003
 all+0.003
 Instead+0.003
 If+0.003
Neg logit contributions
pron-0.006
ves-0.005
wen-0.005
kin-0.005
cle-0.005
Token beautiful
Token ID4950
Feature activation+3.83e-03

Pos logit contributions
pron+0.037
cle+0.035
wen+0.035
kin+0.035
light+0.034
Neg logit contributions
 They-0.018
 apart-0.017
 If-0.016
 Instead-0.015
 He-0.015
Token con
Token ID369
Feature activation+3.33e-03

Pos logit contributions
pron+0.007
cle+0.007
wen+0.007
fs+0.007
ves+0.007
Neg logit contributions
 all-0.005
 They-0.005
 Instead-0.004
 look-0.004
 open-0.004
Tokench
Token ID354
Feature activation+7.01e-02

Pos logit contributions
pron+0.006
kin+0.006
orse+0.005
kins+0.005
fish+0.005
Neg logit contributions
 If-0.005
 Instead-0.005
 They-0.005
 So-0.005
licks-0.005
Token shell
Token ID7582
Feature activation-1.57e-02

Pos logit contributions
pron+0.131
 lifetime+0.127
cle+0.121
 newfound+0.111
light+0.110
Neg logit contributions
 all-0.096
 They-0.088
icks-0.087
 If-0.083
 apart-0.078
Token!
Token ID0
Feature activation+9.03e-03

Pos logit contributions
 all+0.014
 They+0.014
 apart+0.013
 Instead+0.012
 If+0.012
Neg logit contributions
pron-0.040
cle-0.036
kin-0.033
 lifetime-0.033
 newfound-0.032
Token
Token ID198
Feature activation+8.83e-03

Pos logit contributions
pron+0.020
 happened+0.017
 lifetime+0.017
kin+0.017
orse+0.017
Neg logit contributions
 apart-0.008
icks-0.008
licks-0.008
 If-0.007
atters-0.007
Token
Token ID198
Feature activation+1.02e-02

Pos logit contributions
pron+0.018
kin+0.017
fish+0.016
orse+0.016
 lifetime+0.015
Neg logit contributions
 If-0.011
 apart-0.010
 Instead-0.010
licks-0.009
 They-0.009
TokenHe
Token ID1544
Feature activation-4.03e-03

Pos logit contributions
pron+0.018
kin+0.016
fish+0.015
iest+0.014
 lifetime+0.014
Neg logit contributions
 If-0.017
 Instead-0.015
 They-0.015
 apart-0.015
atters-0.014
Token Daddy
Token ID26926
Feature activation-8.89e-03

Pos logit contributions
 If+0.004
 Instead+0.004
 They+0.003
 apart+0.003
 So+0.003
Neg logit contributions
pron-0.005
kin-0.005
fish-0.005
orse-0.005
 lifetime-0.005
Token her
Token ID607
Feature activation-7.00e-03

Pos logit contributions
 If+0.009
 They+0.008
 apart+0.008
 Instead+0.008
 all+0.008
Neg logit contributions
pron-0.021
kin-0.019
 lifetime-0.018
orse-0.018
cle-0.018
Token new
Token ID649
Feature activation+9.97e-03

Pos logit contributions
 If+0.009
 all+0.008
 Instead+0.008
 They+0.008
 apart+0.008
Neg logit contributions
pron-0.014
cle-0.013
kin-0.013
fish-0.012
light-0.012
Token f
Token ID277
Feature activation-1.02e-02

Pos logit contributions
pron+0.019
kin+0.019
fish+0.017
orse+0.017
light+0.017
Neg logit contributions
 If-0.012
 They-0.012
 Instead-0.012
 all-0.012
 apart-0.011
Tokenauc
Token ID14272
Feature activation-2.41e-02

Pos logit contributions
 If+0.014
 Instead+0.013
 apart+0.012
 They+0.012
atters+0.012
Neg logit contributions
pron-0.020
kin-0.018
fish-0.017
 lifetime-0.017
orse-0.017
Tokenet
Token ID316
Feature activation+7.00e-02

Pos logit contributions
 If+0.045
 Instead+0.042
 apart+0.042
 They+0.042
atters+0.041
Neg logit contributions
pron-0.035
kin-0.030
fish-0.029
 lifetime-0.028
orse-0.028
Token.
Token ID13
Feature activation+4.09e-03

Pos logit contributions
pron+0.177
kin+0.160
cle+0.158
 lifetime+0.158
inkle+0.152
Neg logit contributions
 all-0.055
 They-0.052
 apart-0.050
 If-0.046
 Instead-0.040
Token<|endoftext|>
Token ID50256
Feature activation-8.55e-03

Pos logit contributions
pron+0.010
kin+0.009
fish+0.009
 lifetime+0.008
iest+0.008
Neg logit contributions
 If-0.004
 Instead-0.004
 apart-0.003
 They-0.003
licks-0.003
TokenB
Token ID33
Feature activation-4.45e-02

Pos logit contributions
 If+0.009
 Instead+0.009
 They+0.009
 apart+0.008
 So+0.007
Neg logit contributions
pron-0.020
kin-0.018
fish-0.017
 lifetime-0.017
orse-0.017
Tokeneth
Token ID2788
Feature activation+9.33e-03

Pos logit contributions
 If+0.061
 Instead+0.057
 They+0.056
 So+0.054
 Even+0.052
Neg logit contributions
pron-0.089
kin-0.082
 veil-0.078
fish-0.078
 happened-0.078
Token was
Token ID373
Feature activation+1.87e-03

Pos logit contributions
pron+0.019
kin+0.017
fish+0.016
 lifetime+0.016
iest+0.016
Neg logit contributions
 If-0.012
 They-0.012
 Instead-0.012
 apart-0.011
 So-0.011
Token was
Token ID373
Feature activation+9.13e-04

Pos logit contributions
 They+0.008
 Instead+0.008
 If+0.008
 So+0.007
 apart+0.007
Neg logit contributions
pron-0.016
kin-0.014
 lifetime-0.014
iest-0.013
fs-0.013
Token sitting
Token ID5586
Feature activation+2.47e-03

Pos logit contributions
pron+0.001
kin+0.001
fish+0.001
orse+0.001
 happened+0.001
Neg logit contributions
 If-0.002
 Instead-0.002
 They-0.001
 apart-0.001
licks-0.001
Token on
Token ID319
Feature activation+1.15e-02

Pos logit contributions
pron+0.005
kin+0.004
fish+0.004
iest+0.004
 lifetime+0.004
Neg logit contributions
 If-0.004
 Instead-0.003
 They-0.003
 apart-0.003
atters-0.003
Token his
Token ID465
Feature activation-4.63e-03

Pos logit contributions
pron+0.026
kin+0.024
fish+0.023
 lifetime+0.022
iest+0.022
Neg logit contributions
 If-0.013
 Instead-0.012
 They-0.011
 apart-0.011
licks-0.010
Token per
Token ID583
Feature activation-1.34e-02

Pos logit contributions
 If+0.007
 Instead+0.007
 apart+0.006
 They+0.006
atters+0.006
Neg logit contributions
pron-0.008
kin-0.007
fish-0.007
iest-0.007
 lifetime-0.007
Tokench
Token ID354
Feature activation+7.00e-02

Pos logit contributions
 If+0.028
 They+0.027
 Instead+0.027
 apart+0.025
 Share+0.025
Neg logit contributions
pron-0.016
kin-0.012
light-0.012
fish-0.012
inkle-0.011
Token in
Token ID287
Feature activation-1.07e-02

Pos logit contributions
pron+0.133
 lifetime+0.124
fish+0.111
inkle+0.111
kin+0.111
Neg logit contributions
 If-0.099
 apart-0.089
 They-0.088
icks-0.084
 Instead-0.084
Token a
Token ID257
Feature activation+9.54e-03

Pos logit contributions
 If+0.014
 Instead+0.013
 apart+0.012
 They+0.012
licks+0.012
Neg logit contributions
pron-0.022
kin-0.020
fish-0.019
iest-0.018
 lifetime-0.018
Token big
Token ID1263
Feature activation+5.05e-03

Pos logit contributions
pron+0.019
kin+0.018
iest+0.017
fish+0.017
 happened+0.017
Neg logit contributions
 If-0.011
 Instead-0.011
 They-0.010
 apart-0.010
atters-0.010
Token,
Token ID11
Feature activation+1.18e-02

Pos logit contributions
pron+0.010
kin+0.010
fish+0.009
iest+0.009
 lifetime+0.009
Neg logit contributions
 If-0.006
 Instead-0.006
 apart-0.006
 They-0.006
 Other-0.005
Token tall
Token ID7331
Feature activation+3.83e-03

Pos logit contributions
pron+0.023
kin+0.022
fish+0.020
iest+0.020
 lifetime+0.020
Neg logit contributions
 If-0.016
 apart-0.015
 Instead-0.014
 They-0.014
licks-0.013
Token Lily
Token ID20037
Feature activation+1.39e-02

Pos logit contributions
pron+0.010
 lifetime+0.009
iest+0.009
fish+0.009
orse+0.009
Neg logit contributions
 They-0.003
 all-0.003
 The-0.002
 If-0.002
 Instead-0.002
Token and
Token ID290
Feature activation-1.44e-03

Pos logit contributions
pron+0.030
kin+0.028
fish+0.026
iest+0.026
 lifetime+0.025
Neg logit contributions
 If-0.017
 Instead-0.016
 They-0.016
 apart-0.016
 So-0.014
Token the
Token ID262
Feature activation+2.28e-02

Pos logit contributions
 If+0.002
 Instead+0.002
 apart+0.002
 They+0.002
licks+0.001
Neg logit contributions
pron-0.003
kin-0.003
fish-0.003
iest-0.003
 lifetime-0.003
Token f
Token ID277
Feature activation-4.42e-03

Pos logit contributions
pron+0.049
light+0.046
kin+0.046
cle+0.044
 lifetime+0.044
Neg logit contributions
 If-0.022
 They-0.021
 apart-0.020
 zo-0.019
 Instead-0.019
Tokenauc
Token ID14272
Feature activation-2.70e-02

Pos logit contributions
 If+0.007
 Instead+0.006
 apart+0.006
 They+0.006
licks+0.006
Neg logit contributions
pron-0.008
kin-0.007
fish-0.007
iest-0.007
orse-0.006
Tokenet
Token ID316
Feature activation+7.00e-02

Pos logit contributions
 If+0.056
 Instead+0.053
 apart+0.052
 They+0.052
licks+0.050
Neg logit contributions
pron-0.035
kin-0.029
fish-0.027
 lifetime-0.026
iest-0.025
Token played
Token ID2826
Feature activation+1.08e-02

Pos logit contributions
pron+0.159
kin+0.134
iflower+0.133
cle+0.132
 avail+0.131
Neg logit contributions
 They-0.081
 all-0.079
 both-0.070
 apart-0.070
 So-0.069
Token gently
Token ID15165
Feature activation+9.00e-03

Pos logit contributions
pron+0.025
kin+0.022
fish+0.022
 lifetime+0.021
orse+0.021
Neg logit contributions
 If-0.012
 Instead-0.011
 They-0.010
 apart-0.010
atters-0.009
Token,
Token ID11
Feature activation+2.12e-03

Pos logit contributions
pron+0.020
kin+0.018
cle+0.017
 lifetime+0.017
iest+0.017
Neg logit contributions
 apart-0.010
 If-0.010
 They-0.010
 Instead-0.009
atters-0.008
Token singing
Token ID13777
Feature activation+9.99e-04

Pos logit contributions
pron+0.004
kin+0.004
iest+0.004
fish+0.004
 lifetime+0.004
Neg logit contributions
 If-0.003
 apart-0.003
 They-0.003
 Instead-0.003
 Even-0.002
Token,
Token ID11
Feature activation+9.65e-03

Pos logit contributions
pron+0.002
kin+0.002
 lifetime+0.002
cle+0.002
fish+0.002
Neg logit contributions
 They-0.001
 If-0.001
 apart-0.001
 Instead-0.001
 all-0.001
Token birthday
Token ID10955
Feature activation+2.24e-02

Pos logit contributions
 Instead+0.036
 If+0.035
 all+0.035
 So+0.034
 They+0.033
Neg logit contributions
pron-0.059
cle-0.054
kin-0.053
fish-0.053
orse-0.052
Token,
Token ID11
Feature activation-1.40e-03

Pos logit contributions
pron+0.052
orse+0.050
fish+0.048
cle+0.047
kin+0.047
Neg logit contributions
 apart-0.021
 They-0.020
 all-0.020
 Instead-0.019
 So-0.017
Token her
Token ID607
Feature activation-6.86e-03

Pos logit contributions
 They+0.002
 If+0.002
 Instead+0.001
 all+0.001
 apart+0.001
Neg logit contributions
pron-0.003
kin-0.003
fish-0.003
cle-0.003
inkle-0.003
Token dress
Token ID6576
Feature activation+3.32e-02

Pos logit contributions
 all+0.007
 Instead+0.006
 both+0.006
 many+0.006
 So+0.006
Neg logit contributions
pron-0.015
cle-0.014
orse-0.014
iest-0.014
fish-0.013
Token sh
Token ID427
Feature activation+7.58e-03

Pos logit contributions
orse+0.067
pron+0.067
cle+0.066
 pearl+0.060
 lifetime+0.059
Neg logit contributions
 all-0.039
 apart-0.036
 Instead-0.035
 They-0.034
 She-0.031
Tokenimm
Token ID8608
Feature activation+6.98e-02

Pos logit contributions
pron+0.009
fish+0.008
kin+0.008
 happened+0.007
orse+0.007
Neg logit contributions
 If-0.015
 Instead-0.015
atters-0.015
 apart-0.014
licks-0.014
Tokenered
Token ID1068
Feature activation+4.82e-03

Pos logit contributions
pron+0.163
 lifetime+0.152
orse+0.148
cle+0.142
light+0.140
Neg logit contributions
 If-0.072
icks-0.062
atters-0.061
licks-0.059
 Instead-0.056
Token in
Token ID287
Feature activation-1.14e-02

Pos logit contributions
pron+0.013
orse+0.012
kin+0.012
cle+0.012
fish+0.011
Neg logit contributions
 apart-0.004
 all-0.004
 Instead-0.003
 If-0.003
 They-0.003
Token the
Token ID262
Feature activation+1.86e-02

Pos logit contributions
 apart+0.009
 all+0.009
 Instead+0.008
 They+0.008
 If+0.008
Neg logit contributions
pron-0.029
orse-0.028
fish-0.028
kin-0.027
ves-0.027
Token light
Token ID1657
Feature activation+2.51e-02

Pos logit contributions
orse+0.037
fish+0.036
ves+0.035
pron+0.035
kin+0.035
Neg logit contributions
 apart-0.022
 Instead-0.022
 If-0.020
 So-0.019
 They-0.019
Token.
Token ID13
Feature activation+7.92e-03

Pos logit contributions
pron+0.059
orse+0.057
cle+0.056
kin+0.056
 happened+0.055
Neg logit contributions
 apart-0.022
 They-0.019
 all-0.019
 Instead-0.018
 So-0.016
Token tree
Token ID5509
Feature activation-5.46e-03

Pos logit contributions
 Instead+0.008
 all+0.008
 They+0.008
 apart+0.008
 So+0.007
Neg logit contributions
pron-0.014
cle-0.012
wen-0.011
iko-0.011
kin-0.011
Token could
Token ID714
Feature activation-1.12e-02

Pos logit contributions
 Instead+0.007
 They+0.007
 apart+0.007
 all+0.007
 If+0.007
Neg logit contributions
pron-0.011
cle-0.010
kin-0.010
wen-0.009
iko-0.009
Token see
Token ID766
Feature activation-3.97e-03

Pos logit contributions
 If+0.013
 Instead+0.012
 apart+0.011
 They+0.011
licks+0.011
Neg logit contributions
pron-0.024
kin-0.022
fish-0.021
orse-0.020
iest-0.020
Token lots
Token ID6041
Feature activation-2.24e-02

Pos logit contributions
 If+0.005
 Instead+0.004
 They+0.004
 apart+0.004
licks+0.004
Neg logit contributions
pron-0.009
kin-0.008
orse-0.008
fish-0.007
 lifetime-0.007
Token of
Token ID286
Feature activation-8.39e-03

Pos logit contributions
 If+0.032
 Instead+0.029
 They+0.028
 apart+0.028
atters+0.027
Neg logit contributions
pron-0.043
kin-0.038
fish-0.037
iest-0.036
 lifetime-0.035
Token av
Token ID1196
Feature activation-1.23e-01

Pos logit contributions
 If+0.013
 Instead+0.012
 apart+0.012
 They+0.012
licks+0.011
Neg logit contributions
pron-0.015
kin-0.013
fish-0.013
 lifetime-0.012
iest-0.012
Tokenoc
Token ID420
Feature activation-6.23e-02

Pos logit contributions
 If+0.238
 They+0.210
 slams+0.206
 blamed+0.205
 Instead+0.202
Neg logit contributions
pron-0.168
 lifetime-0.157
light-0.149
iflower-0.149
inkle-0.148
Tokenados
Token ID22484
Feature activation-4.68e-02

Pos logit contributions
 If+0.103
 They+0.100
 Instead+0.092
 apart+0.092
 While+0.091
Neg logit contributions
pron-0.086
iest-0.083
iflower-0.079
 lifetime-0.079
cle-0.077
Token that
Token ID326
Feature activation-4.88e-04

Pos logit contributions
 If+0.059
 apart+0.057
 Instead+0.056
 They+0.054
licks+0.051
Neg logit contributions
pron-0.097
kin-0.087
cle-0.080
fish-0.080
orse-0.079
Token the
Token ID262
Feature activation+5.78e-03

Pos logit contributions
 They+0.001
 If+0.001
 Instead+0.001
 apart+0.001
 all+0.001
Neg logit contributions
pron-0.001
kin-0.001
 lifetime-0.001
cle-0.001
fish-0.001
Token supplier
Token ID22693
Feature activation+9.39e-03

Pos logit contributions
pron+0.012
kin+0.011
fish+0.010
light+0.010
iest+0.010
Neg logit contributions
 If-0.007
 Instead-0.007
 They-0.007
 apart-0.007
 Other-0.006
Token still
Token ID991
Feature activation+1.87e-02

Pos logit contributions
pron+0.018
 lifetime+0.015
iest+0.015
kin+0.015
fish+0.015
Neg logit contributions
 If-0.013
 all-0.012
 apart-0.012
 Instead-0.012
 They-0.012
Token couldn
Token ID3521
Feature activation+3.85e-03

Pos logit contributions
pron+0.040
kin+0.035
fish+0.034
iest+0.034
ves+0.033
Neg logit contributions
 apart-0.023
 Instead-0.023
 If-0.023
 all-0.022
 both-0.020
Token't
Token ID470
Feature activation+1.07e-02

Pos logit contributions
pron+0.010
kin+0.009
fish+0.009
 lifetime+0.008
iest+0.008
Neg logit contributions
 If-0.003
 Instead-0.003
 They-0.003
 apart-0.003
 Even-0.002
Token find
Token ID1064
Feature activation-7.23e-03

Pos logit contributions
pron+0.025
kin+0.023
fish+0.022
iest+0.021
 lifetime+0.021
Neg logit contributions
 If-0.011
 Instead-0.010
 They-0.010
 apart-0.010
licks-0.009
Token any
Token ID597
Feature activation+7.63e-03

Pos logit contributions
 If+0.010
 Instead+0.009
 apart+0.009
 They+0.009
licks+0.009
Neg logit contributions
pron-0.015
kin-0.013
fish-0.013
 lifetime-0.012
orse-0.012
Token av
Token ID1196
Feature activation-1.23e-01

Pos logit contributions
pron+0.017
kin+0.015
cle+0.014
fish+0.014
orse+0.014
Neg logit contributions
 If-0.010
 Instead-0.009
 apart-0.009
 They-0.008
licks-0.008
Tokenoc
Token ID420
Feature activation-5.61e-02

Pos logit contributions
 If+0.214
 They+0.203
 Instead+0.194
 blamed+0.188
icks+0.187
Neg logit contributions
pron-0.184
 lifetime-0.162
b-0.161
light-0.159
fish-0.158
Tokenados
Token ID22484
Feature activation-3.88e-02

Pos logit contributions
 If+0.080
 They+0.077
 Instead+0.074
 apart+0.071
atters+0.069
Neg logit contributions
pron-0.099
iest-0.089
cle-0.088
 lifetime-0.086
fish-0.085
Token.
Token ID13
Feature activation+2.50e-03

Pos logit contributions
 apart+0.039
 If+0.039
 all+0.035
 Instead+0.034
 They+0.033
Neg logit contributions
pron-0.097
kin-0.083
inkle-0.083
orse-0.080
cle-0.080
Token Finally
Token ID9461
Feature activation+1.39e-02

Pos logit contributions
pron+0.006
kin+0.006
fish+0.006
iest+0.005
 lifetime+0.005
Neg logit contributions
 If-0.002
 Instead-0.002
 They-0.002
 apart-0.002
licks-0.002
Token,
Token ID11
Feature activation-1.12e-02

Pos logit contributions
pron+0.037
fish+0.033
 lifetime+0.032
cle+0.031
kin+0.031
Neg logit contributions
 all-0.011
 They-0.011
 Instead-0.011
 If-0.010
 apart-0.010
Token Maybe
Token ID6674
Feature activation-2.12e-02

Pos logit contributions
 apart+0.007
licks+0.006
 If+0.005
atters+0.005
icks+0.005
Neg logit contributions
pron-0.030
kin-0.028
orse-0.028
light-0.028
 lifetime-0.027
Token they
Token ID484
Feature activation-1.07e-02

Pos logit contributions
 They+0.026
 If+0.025
 Instead+0.023
 all+0.021
 many+0.021
Neg logit contributions
pron-0.049
kin-0.042
inkle-0.040
 lifetime-0.040
fish-0.038
Token could
Token ID714
Feature activation-2.18e-02

Pos logit contributions
 If+0.018
 Instead+0.017
 apart+0.016
 They+0.016
licks+0.016
Neg logit contributions
pron-0.018
kin-0.016
fish-0.015
 lifetime-0.014
iest-0.014
Token grow
Token ID1663
Feature activation-1.04e-02

Pos logit contributions
 If+0.029
licks+0.026
 Instead+0.026
 apart+0.025
 Other+0.024
Neg logit contributions
pron-0.044
kin-0.041
fish-0.040
inkle-0.038
orse-0.038
Token more
Token ID517
Feature activation+8.56e-03

Pos logit contributions
 If+0.012
 Instead+0.011
 apart+0.011
 They+0.011
licks+0.010
Neg logit contributions
pron-0.023
kin-0.020
fish-0.019
 lifetime-0.019
orse-0.019
Token av
Token ID1196
Feature activation-1.20e-01

Pos logit contributions
pron+0.016
kin+0.014
fish+0.013
 lifetime+0.013
orse+0.013
Neg logit contributions
 If-0.013
 Instead-0.012
 They-0.012
 apart-0.012
licks-0.011
Tokenoc
Token ID420
Feature activation-6.52e-02

Pos logit contributions
 If+0.237
 blamed+0.211
 Instead+0.211
 They+0.207
licks+0.204
Neg logit contributions
pron-0.167
light-0.150
orse-0.145
inkle-0.143
fish-0.142
Tokenados
Token ID22484
Feature activation-4.61e-02

Pos logit contributions
 If+0.109
 They+0.106
 Instead+0.104
 While+0.102
 apart+0.101
Neg logit contributions
pron-0.090
iest-0.087
cle-0.087
gh-0.085
kin-0.084
Token and
Token ID290
Feature activation+1.63e-03

Pos logit contributions
 If+0.051
 apart+0.049
 They+0.046
 Instead+0.046
licks+0.042
Neg logit contributions
pron-0.101
kin-0.091
 lifetime-0.086
cle-0.085
orse-0.085
Token share
Token ID2648
Feature activation-9.58e-03

Pos logit contributions
pron+0.004
kin+0.003
fish+0.003
 lifetime+0.003
orse+0.003
Neg logit contributions
 If-0.002
 Instead-0.002
 apart-0.002
 They-0.002
licks-0.002
Token them
Token ID606
Feature activation+2.70e-02

Pos logit contributions
 If+0.015
 Instead+0.014
 They+0.014
 apart+0.013
licks+0.013
Neg logit contributions
pron-0.017
kin-0.015
 lifetime-0.014
fish-0.014
iest-0.014
Token on
Token ID319
Feature activation-3.92e-03

Pos logit contributions
pron+0.018
fish+0.016
kin+0.016
 lifetime+0.015
orse+0.015
Neg logit contributions
 If-0.012
 They-0.011
 Instead-0.011
 apart-0.010
icks-0.010
Token,
Token ID11
Feature activation-9.09e-03

Pos logit contributions
 They+0.003
 If+0.002
 all+0.002
 So+0.002
 Instead+0.002
Neg logit contributions
pron-0.011
 lifetime-0.010
orse-0.010
light-0.009
fish-0.009
Token whenever
Token ID8797
Feature activation+6.40e-03

Pos logit contributions
 If+0.011
 They+0.010
 Instead+0.010
 apart+0.009
 Even+0.009
Neg logit contributions
pron-0.020
kin-0.018
fish-0.017
 lifetime-0.017
orse-0.017
Token they
Token ID484
Feature activation-4.21e-03

Pos logit contributions
pron+0.016
kin+0.014
itten+0.013
fish+0.013
aker+0.013
Neg logit contributions
 They-0.007
 Instead-0.006
 If-0.006
 all-0.006
 apart-0.005
Token wanted
Token ID2227
Feature activation-2.90e-03

Pos logit contributions
 If+0.007
 Instead+0.006
 They+0.006
 apart+0.006
atters+0.006
Neg logit contributions
pron-0.007
kin-0.006
fish-0.006
iest-0.006
 lifetime-0.006
Token av
Token ID1196
Feature activation-1.19e-01

Pos logit contributions
 If+0.003
 Instead+0.003
 They+0.003
 apart+0.003
 Even+0.003
Neg logit contributions
pron-0.006
kin-0.006
fish-0.006
 lifetime-0.005
orse-0.005
Tokenoc
Token ID420
Feature activation-4.85e-02

Pos logit contributions
 If+0.211
 They+0.188
atters+0.184
 Instead+0.184
 apart+0.182
Neg logit contributions
pron-0.179
light-0.156
 lifetime-0.155
iest-0.150
inkle-0.149
Tokenados
Token ID22484
Feature activation-5.36e-02

Pos logit contributions
 If+0.079
 They+0.074
 apart+0.073
 Instead+0.073
 While+0.071
Neg logit contributions
pron-0.067
cle-0.063
iest-0.063
 happened-0.062
gh-0.061
Token,
Token ID11
Feature activation-5.98e-03

Pos logit contributions
 apart+0.049
 They+0.046
 If+0.046
 Instead+0.045
 all+0.043
Neg logit contributions
pron-0.135
kin-0.118
orse-0.116
cle-0.114
wen-0.114
Token they
Token ID484
Feature activation+3.93e-03

Pos logit contributions
 They+0.008
 If+0.007
 Instead+0.007
 all+0.007
 apart+0.007
Neg logit contributions
pron-0.013
kin-0.011
inkle-0.011
cle-0.011
fish-0.011
Token would
Token ID561
Feature activation-6.82e-03

Pos logit contributions
pron+0.006
kin+0.005
fish+0.005
 lifetime+0.005
iest+0.005
Neg logit contributions
 If-0.007
 Instead-0.006
 They-0.006
 apart-0.006
licks-0.006
Token happy
Token ID3772
Feature activation-1.54e-02

Pos logit contributions
pron+0.003
kin+0.002
fish+0.002
 lifetime+0.002
iest+0.002
Neg logit contributions
 If-0.002
 Instead-0.002
 They-0.002
 apart-0.002
licks-0.002
Token that
Token ID326
Feature activation+4.21e-03

Pos logit contributions
 They+0.013
 Instead+0.012
 If+0.012
 all+0.011
 apart+0.010
Neg logit contributions
pron-0.042
kin-0.038
inkle-0.036
cle-0.036
 lifetime-0.034
Token it
Token ID340
Feature activation+3.98e-03

Pos logit contributions
pron+0.010
kin+0.008
fish+0.008
inkle+0.008
orse+0.008
Neg logit contributions
 If-0.005
 They-0.005
 Instead-0.005
 apart-0.005
 all-0.004
Token produced
Token ID4635
Feature activation-3.83e-03

Pos logit contributions
pron+0.007
kin+0.006
cle+0.006
 lifetime+0.006
orse+0.006
Neg logit contributions
 If-0.006
 Instead-0.006
 They-0.006
 apart-0.006
licks-0.005
Token more
Token ID517
Feature activation+9.32e-03

Pos logit contributions
 If+0.004
 Instead+0.004
 apart+0.003
 They+0.003
 So+0.003
Neg logit contributions
pron-0.009
kin-0.008
fish-0.008
orse-0.008
cle-0.008
Token av
Token ID1196
Feature activation-1.18e-01

Pos logit contributions
pron+0.019
kin+0.017
fish+0.016
 lifetime+0.015
orse+0.015
Neg logit contributions
 If-0.013
 Instead-0.012
 apart-0.011
 They-0.011
licks-0.011
Tokenoc
Token ID420
Feature activation-6.30e-02

Pos logit contributions
 If+0.216
 blamed+0.193
 They+0.193
 Instead+0.193
 slams+0.193
Neg logit contributions
pron-0.176
 lifetime-0.157
light-0.151
fish-0.149
orse-0.147
Tokenados
Token ID22484
Feature activation-4.49e-02

Pos logit contributions
 If+0.094
 They+0.090
 Instead+0.087
 apart+0.085
 While+0.084
Neg logit contributions
pron-0.101
iest-0.095
fish-0.092
cle-0.092
kin-0.091
Token than
Token ID621
Feature activation+1.73e-02

Pos logit contributions
 apart+0.048
 If+0.045
 They+0.044
 Instead+0.044
 all+0.041
Neg logit contributions
pron-0.104
kin-0.093
cle-0.088
inkle-0.086
orse-0.086
Token ever
Token ID1683
Feature activation-1.92e-03

Pos logit contributions
pron+0.036
kin+0.033
fish+0.031
light+0.030
 lifetime+0.030
Neg logit contributions
 If-0.022
 apart-0.020
 Instead-0.020
 They-0.019
licks-0.018
Token before
Token ID878
Feature activation-1.31e-02

Pos logit contributions
 apart+0.002
 If+0.002
 Instead+0.002
 They+0.002
 all+0.002
Neg logit contributions
pron-0.004
kin-0.004
fish-0.003
ves-0.003
cle-0.003
Token "
Token ID366
Feature activation-6.65e-04

Pos logit contributions
 If+0.001
 Instead+0.001
 apart+0.001
 They+0.001
licks+0.001
Neg logit contributions
pron-0.002
kin-0.002
fish-0.002
 lifetime-0.002
iest-0.002
TokenI
Token ID40
Feature activation-3.70e-03

Pos logit contributions
 If+0.001
 They+0.001
 Instead+0.001
 So+0.001
 Even+0.001
Neg logit contributions
kin-0.001
pron-0.001
inkle-0.001
fish-0.001
orse-0.001
Token will
Token ID481
Feature activation-1.25e-02

Pos logit contributions
 If+0.004
 Instead+0.004
 They+0.004
 apart+0.003
atters+0.003
Neg logit contributions
pron-0.008
kin-0.008
fish-0.007
 lifetime-0.007
iest-0.007
Token buy
Token ID2822
Feature activation+5.05e-03

Pos logit contributions
 If+0.012
 Instead+0.011
 apart+0.010
 They+0.010
licks+0.010
Neg logit contributions
pron-0.030
kin-0.027
fish-0.026
 lifetime-0.025
iest-0.025
Token more
Token ID517
Feature activation+1.41e-02

Pos logit contributions
pron+0.010
kin+0.009
fish+0.009
 lifetime+0.009
orse+0.008
Neg logit contributions
 If-0.007
 Instead-0.007
 They-0.006
 apart-0.006
 So-0.006
Token av
Token ID1196
Feature activation-1.18e-01

Pos logit contributions
pron+0.029
kin+0.026
fish+0.025
 lifetime+0.025
light+0.024
Neg logit contributions
 If-0.019
 Instead-0.017
 They-0.017
 apart-0.016
 So-0.016
Tokenoc
Token ID420
Feature activation-5.26e-02

Pos logit contributions
 If+0.210
 They+0.197
 Instead+0.195
 apart+0.187
 blamed+0.187
Neg logit contributions
pron-0.179
kin-0.149
iest-0.149
cle-0.148
b-0.148
Tokenados
Token ID22484
Feature activation-3.40e-02

Pos logit contributions
 If+0.078
 They+0.077
 Instead+0.074
 apart+0.073
 While+0.069
Neg logit contributions
pron-0.086
iest-0.080
cle-0.078
fish-0.076
kin-0.075
Token tomorrow
Token ID9439
Feature activation+1.48e-02

Pos logit contributions
 If+0.041
 apart+0.037
 Instead+0.037
 They+0.036
licks+0.033
Neg logit contributions
pron-0.075
kin-0.067
 lifetime-0.063
fish-0.063
orse-0.063
Token.
Token ID13
Feature activation-2.48e-04

Pos logit contributions
pron+0.037
kin+0.033
fish+0.032
orse+0.032
 lifetime+0.032
Neg logit contributions
 If-0.014
 They-0.013
 Instead-0.013
 So-0.012
 apart-0.012
Token You
Token ID921
Feature activation-4.65e-03

Pos logit contributions
 apart+0.000
atters+0.000
icks+0.000
licks+0.000
 wre+0.000
Neg logit contributions
pron-0.000
cle-0.000
light-0.000
orse-0.000
kin-0.000
Token the
Token ID262
Feature activation-7.82e-04

Pos logit contributions
 If+0.007
 Instead+0.007
licks+0.006
 apart+0.006
atters+0.006
Neg logit contributions
pron-0.008
kin-0.007
fish-0.007
orse-0.007
 lifetime-0.007
Token store
Token ID3650
Feature activation-1.76e-02

Pos logit contributions
 If+0.001
 Instead+0.001
 They+0.001
 apart+0.001
 Other+0.001
Neg logit contributions
pron-0.002
kin-0.001
fish-0.001
iest-0.001
orse-0.001
Token to
Token ID284
Feature activation-6.26e-04

Pos logit contributions
 If+0.021
 Instead+0.019
 They+0.019
 apart+0.018
 So+0.017
Neg logit contributions
pron-0.039
kin-0.035
fish-0.034
iest-0.033
orse-0.032
Token get
Token ID651
Feature activation+1.42e-02

Pos logit contributions
 If+0.001
licks+0.001
 Instead+0.001
atters+0.001
icks+0.001
Neg logit contributions
pron-0.001
kin-0.001
 particular-0.001
fish-0.001
orse-0.001
Token more
Token ID517
Feature activation+7.21e-03

Pos logit contributions
pron+0.033
kin+0.031
fish+0.030
cle+0.029
orse+0.029
Neg logit contributions
 If-0.014
 apart-0.013
 Instead-0.013
 They-0.013
 all-0.011
Token av
Token ID1196
Feature activation-1.18e-01

Pos logit contributions
pron+0.014
kin+0.013
fish+0.012
 lifetime+0.012
cle+0.012
Neg logit contributions
 If-0.010
 Instead-0.009
 apart-0.009
 They-0.009
 So-0.008
Tokenoc
Token ID420
Feature activation-5.97e-02

Pos logit contributions
 If+0.235
 blamed+0.210
 slams+0.205
 Instead+0.205
 They+0.205
Neg logit contributions
pron-0.161
light-0.142
 lifetime-0.141
b-0.138
orse-0.138
Tokenados
Token ID22484
Feature activation-4.33e-02

Pos logit contributions
 If+0.098
 They+0.095
 Instead+0.092
 While+0.090
 apart+0.087
Neg logit contributions
pron-0.086
cle-0.083
iest-0.081
gh-0.080
kin-0.080
Token.
Token ID13
Feature activation-4.72e-03

Pos logit contributions
 apart+0.048
 If+0.047
 Instead+0.046
 They+0.045
 So+0.041
Neg logit contributions
pron-0.097
kin-0.086
fish-0.081
orse-0.080
inkle-0.080
Token Gary
Token ID10936
Feature activation-2.49e-03

Pos logit contributions
atters+0.005
 apart+0.005
 If+0.005
licks+0.004
 slams+0.004
Neg logit contributions
pron-0.009
kin-0.008
orse-0.008
fish-0.008
 lifetime-0.008
Token looked
Token ID3114
Feature activation-5.22e-03

Pos logit contributions
 If+0.004
 Instead+0.003
 They+0.003
 So+0.003
 Even+0.003
Neg logit contributions
pron-0.005
kin-0.004
iest-0.004
fish-0.004
 lifetime-0.004
Token girl
Token ID2576
Feature activation-4.65e-03

Pos logit contributions
 If+0.029
 Instead+0.028
 apart+0.027
 They+0.027
licks+0.026
Neg logit contributions
pron-0.026
kin-0.023
fish-0.022
 lifetime-0.020
iest-0.020
Token who
Token ID508
Feature activation-1.10e-02

Pos logit contributions
 If+0.006
 Instead+0.005
 They+0.005
 apart+0.005
atters+0.005
Neg logit contributions
pron-0.010
kin-0.009
fish-0.008
 lifetime-0.008
iest-0.008
Token didn
Token ID1422
Feature activation-4.66e-03

Pos logit contributions
 If+0.014
 They+0.013
 Instead+0.013
atters+0.011
 So+0.011
Neg logit contributions
pron-0.023
kin-0.021
fish-0.021
 lifetime-0.020
orse-0.020
Token't
Token ID470
Feature activation+2.57e-03

Pos logit contributions
 If+0.005
 Instead+0.004
 apart+0.004
 They+0.004
licks+0.004
Neg logit contributions
pron-0.011
kin-0.010
fish-0.010
 lifetime-0.009
iest-0.009
Token like
Token ID588
Feature activation+1.07e-02

Pos logit contributions
pron+0.006
kin+0.006
fish+0.005
 lifetime+0.005
iest+0.005
Neg logit contributions
 If-0.002
 Instead-0.002
 They-0.002
 apart-0.002
licks-0.002
Token av
Token ID1196
Feature activation-1.15e-01

Pos logit contributions
pron+0.023
kin+0.022
fish+0.020
 lifetime+0.020
iest+0.020
Neg logit contributions
 If-0.012
 Instead-0.011
 They-0.011
 apart-0.011
licks-0.010
Tokenoc
Token ID420
Feature activation-4.74e-02

Pos logit contributions
 If+0.218
 They+0.193
 Instead+0.182
 apart+0.181
atters+0.180
Neg logit contributions
pron-0.171
inkle-0.146
light-0.145
b-0.143
iflower-0.141
Tokenados
Token ID22484
Feature activation-4.27e-02

Pos logit contributions
 If+0.084
 They+0.082
 apart+0.075
 Instead+0.073
 While+0.072
Neg logit contributions
pron-0.066
gh-0.061
 happened-0.060
iest-0.060
iflower-0.056
Token.
Token ID13
Feature activation-4.69e-04

Pos logit contributions
 If+0.055
 Instead+0.051
 apart+0.050
 They+0.049
licks+0.046
Neg logit contributions
pron-0.088
kin-0.079
fish-0.074
 lifetime-0.074
orse-0.073
Token She
Token ID1375
Feature activation+5.89e-03

Pos logit contributions
 apart+0.000
icks+0.000
atters+0.000
licks+0.000
 If+0.000
Neg logit contributions
pron-0.001
kin-0.001
 particular-0.001
 moment-0.001
light-0.001
Token always
Token ID1464
Feature activation+1.71e-02

Pos logit contributions
pron+0.012
kin+0.011
 lifetime+0.011
iest+0.011
fish+0.010
Neg logit contributions
 If-0.007
 They-0.007
 Instead-0.007
 So-0.006
 Even-0.006
Tokenongo
Token ID25162
Feature activation+4.92e-03

Pos logit contributions
 If+0.045
 Instead+0.042
 apart+0.041
 They+0.041
licks+0.040
Neg logit contributions
pron-0.044
kin-0.039
fish-0.037
 lifetime-0.035
iest-0.035
Token didn
Token ID1422
Feature activation-1.89e-02

Pos logit contributions
pron+0.007
kin+0.006
fish+0.006
orse+0.006
 happened+0.006
Neg logit contributions
 If-0.009
 apart-0.008
 Instead-0.008
 They-0.008
licks-0.008
Token't
Token ID470
Feature activation-6.95e-03

Pos logit contributions
 If+0.023
 Instead+0.022
 apart+0.021
licks+0.021
 They+0.020
Neg logit contributions
pron-0.038
kin-0.035
fish-0.032
 happened-0.032
inkle-0.032
Token like
Token ID588
Feature activation+2.17e-03

Pos logit contributions
 If+0.007
 Instead+0.007
licks+0.007
 Other+0.007
ocular+0.006
Neg logit contributions
pron-0.014
kin-0.014
orse-0.013
iest-0.013
 moment-0.013
Token wet
Token ID9583
Feature activation-8.13e-03

Pos logit contributions
pron+0.004
kin+0.003
fish+0.003
 happened+0.003
orse+0.003
Neg logit contributions
 If-0.003
 Instead-0.003
atters-0.003
 apart-0.003
 They-0.003
Token av
Token ID1196
Feature activation-1.15e-01

Pos logit contributions
 If+0.014
 Instead+0.012
atters+0.012
licks+0.012
icks+0.012
Neg logit contributions
pron-0.013
kin-0.012
 happened-0.011
iest-0.011
fish-0.011
Tokenoc
Token ID420
Feature activation-5.16e-02

Pos logit contributions
 If+0.235
 They+0.207
 blamed+0.200
 First+0.199
 slams+0.197
Neg logit contributions
pron-0.144
b-0.130
light-0.124
inkle-0.123
iflower-0.122
Tokenados
Token ID22484
Feature activation-4.46e-02

Pos logit contributions
 They+0.094
 If+0.093
 While+0.086
 apart+0.084
 slam+0.082
Neg logit contributions
gh-0.063
iest-0.060
 happened-0.059
pron-0.059
iflower-0.057
Token,
Token ID11
Feature activation+8.82e-03

Pos logit contributions
 If+0.059
 Instead+0.055
 apart+0.054
 They+0.053
licks+0.050
Neg logit contributions
pron-0.090
kin-0.081
fish-0.077
 lifetime-0.075
iest-0.074
Token so
Token ID523
Feature activation+2.67e-02

Pos logit contributions
pron+0.024
kin+0.023
 lifetime+0.021
fish+0.021
inkle+0.021
Neg logit contributions
 If-0.006
 They-0.006
 Instead-0.006
 apart-0.005
 So-0.004
Token he
Token ID339
Feature activation-1.78e-03

Pos logit contributions
pron+0.063
kin+0.056
fish+0.054
inkle+0.054
iest+0.052
Neg logit contributions
 They-0.030
 all-0.030
 Instead-0.029
 He-0.028
 many-0.027
Token away
Token ID1497
Feature activation+1.49e-03

Pos logit contributions
 If+0.023
 Instead+0.022
 apart+0.021
 They+0.020
licks+0.019
Neg logit contributions
pron-0.045
kin-0.042
fish-0.040
iest-0.039
orse-0.039
Token.
Token ID13
Feature activation+2.00e-03

Pos logit contributions
pron+0.003
kin+0.003
fish+0.002
 lifetime+0.002
iest+0.002
Neg logit contributions
 If-0.002
 Instead-0.002
 They-0.002
licks-0.002
 apart-0.002
Token She
Token ID1375
Feature activation+2.23e-04

Pos logit contributions
pron+0.004
 happened+0.003
 lifetime+0.003
 moment+0.003
light+0.003
Neg logit contributions
 apart-0.002
licks-0.002
 wre-0.002
 If-0.002
icks-0.002
Token learned
Token ID4499
Feature activation-2.62e-02

Pos logit contributions
pron+0.000
kin+0.000
iest+0.000
fish+0.000
 lifetime+0.000
Neg logit contributions
 If-0.000
 Instead-0.000
 They-0.000
 apart-0.000
 Even-0.000
Token that
Token ID326
Feature activation-3.25e-03

Pos logit contributions
 If+0.040
 Instead+0.037
 apart+0.037
licks+0.037
atters+0.037
Neg logit contributions
pron-0.046
kin-0.040
fish-0.039
 happened-0.037
iest-0.037
Token av
Token ID1196
Feature activation-1.14e-01

Pos logit contributions
 If+0.005
 Instead+0.004
 They+0.004
 apart+0.004
licks+0.004
Neg logit contributions
pron-0.006
kin-0.006
fish-0.005
 lifetime-0.005
iest-0.005
Tokenoc
Token ID420
Feature activation-4.63e-02

Pos logit contributions
 If+0.188
 Instead+0.173
 They+0.173
 apart+0.170
 blamed+0.167
Neg logit contributions
pron-0.180
kin-0.161
iest-0.154
light-0.153
fish-0.150
Tokenados
Token ID22484
Feature activation-4.15e-02

Pos logit contributions
 If+0.076
 They+0.076
 apart+0.072
 Instead+0.071
 While+0.070
Neg logit contributions
pron-0.069
iest-0.063
gh-0.063
iflower-0.062
kin-0.062
Token are
Token ID389
Feature activation-2.13e-02

Pos logit contributions
 If+0.043
 apart+0.039
 Instead+0.039
 They+0.038
licks+0.036
Neg logit contributions
pron-0.098
kin-0.087
 lifetime-0.084
cle-0.083
orse-0.083
Token good
Token ID922
Feature activation-4.61e-03

Pos logit contributions
 If+0.027
 Instead+0.024
 apart+0.024
 They+0.023
licks+0.022
Neg logit contributions
pron-0.045
kin-0.041
fish-0.039
orse-0.038
 lifetime-0.038
Token for
Token ID329
Feature activation-5.64e-03

Pos logit contributions
 If+0.006
 Instead+0.005
 apart+0.005
 They+0.005
licks+0.005
Neg logit contributions
pron-0.009
kin-0.009
fish-0.008
iest-0.008
 lifetime-0.008
Token store
Token ID3650
Feature activation-1.70e-02

Pos logit contributions
pron+0.022
kin+0.019
fish+0.018
 lifetime+0.018
iest+0.017
Neg logit contributions
 If-0.015
 Instead-0.014
 apart-0.014
 They-0.014
atters-0.013
Token and
Token ID290
Feature activation-7.76e-04

Pos logit contributions
 If+0.020
 Instead+0.019
 apart+0.018
 They+0.018
licks+0.017
Neg logit contributions
pron-0.037
kin-0.033
fish-0.032
 lifetime-0.031
iest-0.031
Token forgot
Token ID16453
Feature activation+1.65e-02

Pos logit contributions
 If+0.001
 Instead+0.001
 They+0.001
 apart+0.001
licks+0.001
Neg logit contributions
pron-0.001
kin-0.001
fish-0.001
iest-0.001
inkle-0.001
Token to
Token ID284
Feature activation+1.07e-02

Pos logit contributions
pron+0.039
kin+0.036
fish+0.035
 lifetime+0.034
iest+0.034
Neg logit contributions
 If-0.016
 Instead-0.015
 They-0.014
 apart-0.014
licks-0.013
Token buy
Token ID2822
Feature activation+1.21e-02

Pos logit contributions
pron+0.024
kin+0.022
fish+0.020
orse+0.020
cle+0.020
Neg logit contributions
 If-0.012
 Instead-0.011
licks-0.011
 apart-0.010
atters-0.010
Token av
Token ID1196
Feature activation-1.13e-01

Pos logit contributions
pron+0.023
kin+0.021
fish+0.020
iest+0.019
 lifetime+0.019
Neg logit contributions
 If-0.017
 Instead-0.015
 apart-0.015
 They-0.015
atters-0.014
Tokenoc
Token ID420
Feature activation-4.24e-02

Pos logit contributions
 If+0.203
 They+0.190
 blamed+0.182
icks+0.181
 apart+0.180
Neg logit contributions
pron-0.169
b-0.143
cle-0.141
 lifetime-0.138
iflower-0.136
Tokenados
Token ID22484
Feature activation-3.80e-02

Pos logit contributions
 If+0.069
 They+0.068
 apart+0.065
 Instead+0.064
 While+0.063
Neg logit contributions
pron-0.061
iest-0.057
cle-0.055
iflower-0.054
 happened-0.053
Token.
Token ID13
Feature activation-2.57e-03

Pos logit contributions
 If+0.039
 apart+0.037
 Instead+0.036
 They+0.035
 So+0.032
Neg logit contributions
pron-0.089
kin-0.081
orse-0.077
 lifetime-0.077
fish-0.076
Token Lily
Token ID20037
Feature activation+8.39e-03

Pos logit contributions
icks+0.003
 If+0.003
licks+0.002
 apart+0.002
atters+0.002
Neg logit contributions
pron-0.006
kin-0.005
 happened-0.005
 moment-0.005
 lifetime-0.005
Token was
Token ID373
Feature activation-3.21e-03

Pos logit contributions
pron+0.013
kin+0.011
fish+0.011
orse+0.010
 lifetime+0.010
Neg logit contributions
 If-0.015
 Instead-0.014
 apart-0.014
icks-0.014
 They-0.014
Token Everyday
Token ID48154
Feature activation+1.25e-02

Pos logit contributions
pron+0.021
kin+0.020
cle+0.019
 lifetime+0.018
light+0.018
Neg logit contributions
 If-0.009
 They-0.009
 apart-0.009
 Instead-0.009
 all-0.008
Token he
Token ID339
Feature activation+1.20e-02

Pos logit contributions
pron+0.030
kin+0.028
inkle+0.026
fish+0.024
aker+0.024
Neg logit contributions
 They-0.015
 all-0.013
 He-0.012
 Instead-0.012
 So-0.012
Token would
Token ID561
Feature activation+2.33e-03

Pos logit contributions
pron+0.024
kin+0.022
iest+0.020
cle+0.020
 lifetime+0.019
Neg logit contributions
 If-0.016
 They-0.015
 Instead-0.015
 So-0.015
 apart-0.014
Token take
Token ID1011
Feature activation-7.56e-03

Pos logit contributions
pron+0.005
kin+0.005
fish+0.005
iest+0.005
 lifetime+0.005
Neg logit contributions
 If-0.002
 Instead-0.002
 They-0.002
 apart-0.002
 Even-0.002
Token the
Token ID262
Feature activation-4.90e-03

Pos logit contributions
 If+0.009
 apart+0.008
 They+0.008
 Instead+0.008
 Other+0.007
Neg logit contributions
pron-0.017
kin-0.016
fish-0.015
orse-0.014
 lifetime-0.014
Token av
Token ID1196
Feature activation-1.13e-01

Pos logit contributions
 If+0.007
 apart+0.006
 all+0.006
 Instead+0.006
 Other+0.006
Neg logit contributions
pron-0.009
kin-0.008
orse-0.008
inkle-0.008
ves-0.008
Tokenoc
Token ID420
Feature activation-4.72e-02

Pos logit contributions
 If+0.186
 They+0.173
 Instead+0.172
 blamed+0.165
atters+0.164
Neg logit contributions
pron-0.191
fish-0.159
 lifetime-0.158
kin-0.158
light-0.157
Tokenados
Token ID22484
Feature activation-3.12e-02

Pos logit contributions
 If+0.064
 They+0.060
 Instead+0.059
 apart+0.057
atters+0.056
Neg logit contributions
pron-0.091
kin-0.079
fish-0.078
cle-0.078
iest-0.078
Token and
Token ID290
Feature activation+6.30e-03

Pos logit contributions
 If+0.029
 apart+0.027
 They+0.025
 Instead+0.024
 all+0.022
Neg logit contributions
pron-0.078
kin-0.071
orse-0.068
inkle-0.068
 lifetime-0.066
Token scatter
Token ID41058
Feature activation+4.16e-03

Pos logit contributions
pron+0.013
kin+0.012
fish+0.011
iest+0.011
orse+0.011
Neg logit contributions
 If-0.008
 Instead-0.007
 They-0.007
 apart-0.007
licks-0.006
Token them
Token ID606
Feature activation+2.48e-02

Pos logit contributions
pron+0.008
kin+0.008
fish+0.007
orse+0.007
 lifetime+0.007
Neg logit contributions
 If-0.006
 Instead-0.005
 apart-0.005
 They-0.005
licks-0.005
Token said
Token ID531
Feature activation+2.52e-02

Pos logit contributions
pron+0.022
kin+0.020
iest+0.018
fish+0.018
 lifetime+0.018
Neg logit contributions
 If-0.019
 Instead-0.018
 They-0.017
 apart-0.017
 So-0.017
Token,
Token ID11
Feature activation-1.40e-04

Pos logit contributions
pron+0.054
kin+0.049
 lifetime+0.047
light+0.046
fish+0.046
Neg logit contributions
 If-0.031
 Instead-0.028
 They-0.027
 apart-0.027
 Even-0.025
Token "
Token ID366
Feature activation-2.39e-03

Pos logit contributions
 If+0.000
 They+0.000
 Instead+0.000
 Even+0.000
 apart+0.000
Neg logit contributions
pron-0.000
kin-0.000
iest-0.000
fish-0.000
inkle-0.000
TokenI
Token ID40
Feature activation-5.35e-03

Pos logit contributions
 If+0.003
 Instead+0.003
 They+0.003
 So+0.003
 Even+0.003
Neg logit contributions
pron-0.005
kin-0.005
inkle-0.004
fish-0.004
iest-0.004
Token love
Token ID1842
Feature activation+8.42e-03

Pos logit contributions
 If+0.006
 Instead+0.005
 They+0.005
 apart+0.005
atters+0.004
Neg logit contributions
pron-0.012
kin-0.011
fish-0.011
 lifetime-0.011
iest-0.011
Token av
Token ID1196
Feature activation-1.12e-01

Pos logit contributions
pron+0.015
kin+0.013
fish+0.013
orse+0.012
 lifetime+0.012
Neg logit contributions
 If-0.013
 Instead-0.012
 apart-0.012
 They-0.012
atters-0.011
Tokenoc
Token ID420
Feature activation-4.27e-02

Pos logit contributions
 If+0.211
 They+0.200
 Instead+0.190
 slams+0.189
 blamed+0.188
Neg logit contributions
pron-0.159
b-0.130
light-0.127
 lifetime-0.127
cle-0.127
Tokenados
Token ID22484
Feature activation-4.18e-02

Pos logit contributions
 They+0.071
 If+0.070
 Instead+0.065
 apart+0.065
 While+0.064
Neg logit contributions
pron-0.060
iest-0.056
cle-0.054
gh-0.053
iflower-0.053
Token!
Token ID0
Feature activation+2.12e-02

Pos logit contributions
 If+0.053
 Instead+0.049
 apart+0.048
 They+0.047
licks+0.044
Neg logit contributions
pron-0.087
kin-0.078
fish-0.074
 lifetime-0.073
orse-0.072
Token They
Token ID1119
Feature activation-9.27e-04

Pos logit contributions
pron+0.037
orse+0.032
kin+0.032
cle+0.030
fish+0.030
Neg logit contributions
 apart-0.030
atters-0.026
licks-0.025
icks-0.025
 slams-0.025
Token are
Token ID389
Feature activation-9.98e-04

Pos logit contributions
 If+0.001
 Instead+0.001
licks+0.001
atters+0.001
 apart+0.001
Neg logit contributions
pron-0.002
kin-0.002
fish-0.002
iest-0.002
 happened-0.002
Token if
Token ID611
Feature activation+3.04e-03

Pos logit contributions
 If+0.024
 Instead+0.022
atters+0.022
licks+0.021
 apart+0.021
Neg logit contributions
pron-0.035
kin-0.031
fish-0.030
iest-0.028
 happened-0.028
Token she
Token ID673
Feature activation-1.01e-02

Pos logit contributions
pron+0.006
inkle+0.005
kin+0.005
fish+0.005
light+0.005
Neg logit contributions
 If-0.004
 They-0.004
 Instead-0.004
 all-0.004
 apart-0.004
Token could
Token ID714
Feature activation-2.20e-02

Pos logit contributions
 If+0.017
 Instead+0.015
 apart+0.015
 They+0.015
licks+0.015
Neg logit contributions
pron-0.017
kin-0.015
fish-0.014
 happened-0.014
orse-0.014
Token request
Token ID2581
Feature activation+9.88e-03

Pos logit contributions
 If+0.028
licks+0.027
 apart+0.025
 Instead+0.025
atters+0.024
Neg logit contributions
pron-0.045
kin-0.040
iest-0.037
fish-0.037
inkle-0.036
Token some
Token ID617
Feature activation+3.55e-04

Pos logit contributions
pron+0.020
kin+0.018
fish+0.017
 lifetime+0.016
orse+0.016
Neg logit contributions
 If-0.014
 Instead-0.013
 They-0.012
 apart-0.012
licks-0.011
Token av
Token ID1196
Feature activation-1.12e-01

Pos logit contributions
pron+0.001
kin+0.001
fish+0.001
light+0.001
orse+0.001
Neg logit contributions
 If-0.000
 Instead-0.000
 apart-0.000
 They-0.000
 So-0.000
Tokenoc
Token ID420
Feature activation-4.60e-02

Pos logit contributions
 If+0.197
 They+0.183
 Instead+0.179
 blamed+0.178
atters+0.177
Neg logit contributions
pron-0.169
cle-0.140
iest-0.140
 lifetime-0.139
b-0.139
Tokenados
Token ID22484
Feature activation-4.49e-02

Pos logit contributions
 If+0.070
 They+0.068
 Instead+0.066
 apart+0.065
atters+0.063
Neg logit contributions
pron-0.076
iest-0.068
cle-0.067
fish-0.065
kin-0.064
Token from
Token ID422
Feature activation-1.96e-02

Pos logit contributions
 If+0.039
 apart+0.038
 They+0.036
 Instead+0.035
 all+0.033
Neg logit contributions
pron-0.115
inkle-0.101
kin-0.100
 lifetime-0.100
orse-0.099
Token the
Token ID262
Feature activation+2.15e-02

Pos logit contributions
 If+0.023
 Instead+0.020
 They+0.020
 apart+0.020
 all+0.018
Neg logit contributions
pron-0.043
kin-0.040
iest-0.039
fish-0.039
 lifetime-0.038
Token store
Token ID3650
Feature activation-1.03e-02

Pos logit contributions
pron+0.044
kin+0.040
fish+0.039
iest+0.037
 lifetime+0.037
Neg logit contributions
 If-0.028
 Instead-0.025
 apart-0.024
 They-0.024
licks-0.023
Token the
Token ID262
Feature activation-1.10e-02

Pos logit contributions
 If+0.018
 Instead+0.017
 apart+0.016
 They+0.016
licks+0.016
Neg logit contributions
pron-0.024
kin-0.021
fish-0.020
 lifetime-0.019
iest-0.019
Token tree
Token ID5509
Feature activation-3.62e-02

Pos logit contributions
 If+0.016
 Instead+0.015
 apart+0.015
 They+0.015
licks+0.014
Neg logit contributions
pron-0.021
kin-0.018
fish-0.018
 lifetime-0.017
iest-0.017
Token if
Token ID611
Feature activation+6.13e-03

Pos logit contributions
 If+0.047
 Instead+0.043
 They+0.041
 apart+0.041
licks+0.039
Neg logit contributions
pron-0.075
kin-0.068
fish-0.064
 lifetime-0.062
iest-0.062
Token they
Token ID484
Feature activation-1.74e-02

Pos logit contributions
pron+0.012
kin+0.011
inkle+0.010
orse+0.010
fish+0.010
Neg logit contributions
 If-0.009
 Instead-0.008
 They-0.008
 all-0.008
 apart-0.008
Token were
Token ID547
Feature activation-1.19e-02

Pos logit contributions
 If+0.027
 Instead+0.025
 apart+0.024
 They+0.024
licks+0.024
Neg logit contributions
pron-0.031
kin-0.028
fish-0.026
 lifetime-0.025
iest-0.025
Token av
Token ID1196
Feature activation-1.11e-01

Pos logit contributions
 If+0.016
 Instead+0.015
 apart+0.015
 They+0.014
licks+0.013
Neg logit contributions
pron-0.024
kin-0.021
fish-0.021
 lifetime-0.020
light-0.020
Tokenoc
Token ID420
Feature activation-4.14e-02

Pos logit contributions
 If+0.214
 They+0.200
 Instead+0.187
 All+0.185
 blamed+0.184
Neg logit contributions
pron-0.146
light-0.136
 lifetime-0.134
b-0.129
cle-0.126
Tokenados
Token ID22484
Feature activation-5.40e-02

Pos logit contributions
 If+0.065
 They+0.063
 Instead+0.060
 apart+0.059
 While+0.059
Neg logit contributions
pron-0.060
iest-0.059
cle-0.058
gh-0.055
fish-0.055
Token,
Token ID11
Feature activation-5.36e-03

Pos logit contributions
 apart+0.054
 If+0.051
 Instead+0.049
 They+0.048
 all+0.046
Neg logit contributions
pron-0.131
kin-0.114
inkle-0.110
orse-0.110
cle-0.109
Token and
Token ID290
Feature activation-1.07e-03

Pos logit contributions
 If+0.005
 They+0.005
 Instead+0.005
 apart+0.004
 So+0.004
Neg logit contributions
pron-0.013
kin-0.012
 lifetime-0.012
inkle-0.012
fish-0.012
Token they
Token ID484
Feature activation+1.01e-02

Pos logit contributions
 If+0.002
 Instead+0.001
 They+0.001
 apart+0.001
 So+0.001
Neg logit contributions
pron-0.002
kin-0.002
inkle-0.002
fish-0.002
iest-0.002
Token and
Token ID290
Feature activation-1.40e-02

Pos logit contributions
 If+0.008
 Instead+0.007
atters+0.007
licks+0.007
 Other+0.007
Neg logit contributions
pron-0.009
kin-0.008
fish-0.008
light-0.008
iest-0.007
Token try
Token ID1949
Feature activation+9.80e-03

Pos logit contributions
 If+0.020
 Instead+0.019
 Other+0.018
licks+0.017
atters+0.017
Neg logit contributions
pron-0.026
kin-0.024
fish-0.023
orse-0.022
light-0.022
Token to
Token ID284
Feature activation-1.68e-02

Pos logit contributions
pron+0.021
kin+0.019
fish+0.018
orse+0.017
 lifetime+0.017
Neg logit contributions
 If-0.012
 Instead-0.011
 apart-0.010
 They-0.010
licks-0.010
Token grab
Token ID5552
Feature activation+1.30e-02

Pos logit contributions
 If+0.026
 Instead+0.025
 They+0.023
 Other+0.023
licks+0.023
Neg logit contributions
pron-0.030
kin-0.029
fish-0.028
orse-0.025
inkle-0.025
Token the
Token ID262
Feature activation-1.07e-02

Pos logit contributions
pron+0.023
kin+0.022
fish+0.021
iest+0.020
light+0.019
Neg logit contributions
 If-0.019
 Instead-0.018
atters-0.017
licks-0.017
 They-0.017
Token av
Token ID1196
Feature activation-1.11e-01

Pos logit contributions
 If+0.014
 apart+0.013
 Instead+0.013
 They+0.013
icks+0.012
Neg logit contributions
pron-0.022
kin-0.019
light-0.019
orse-0.018
 lifetime-0.018
Tokenoc
Token ID420
Feature activation-4.30e-02

Pos logit contributions
 If+0.200
 blamed+0.194
 They+0.186
 Instead+0.182
 slams+0.178
Neg logit contributions
pron-0.166
fish-0.144
kin-0.139
b-0.137
light-0.136
Tokenados
Token ID22484
Feature activation-3.67e-02

Pos logit contributions
 If+0.067
 They+0.063
 Instead+0.061
 apart+0.059
 blamed+0.059
Neg logit contributions
pron-0.072
kin-0.064
fish-0.064
cle-0.064
iest-0.062
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
 If+0.052
 Instead+0.049
atters+0.046
 Other+0.046
 They+0.046
Neg logit contributions
pron-0.067
kin-0.063
fish-0.061
iest-0.056
orse-0.056
Token But
Token ID887
Feature activation+1.04e-02

Pos logit contributions
 apart+0.009
licks+0.008
 If+0.008
atters+0.008
 Instead+0.007
Neg logit contributions
pron-0.035
kin-0.032
orse-0.031
fish-0.030
ves-0.030
Token the
Token ID262
Feature activation+3.45e-03

Pos logit contributions
pron+0.023
 lifetime+0.021
iest+0.020
light+0.019
kin+0.019
Neg logit contributions
 If-0.013
 apart-0.012
 They-0.011
 Instead-0.011
 all-0.011
Token but
Token ID475
Feature activation+8.22e-03

Pos logit contributions
pron+0.018
kin+0.016
fish+0.016
 lifetime+0.015
iest+0.015
Neg logit contributions
 If-0.012
 Instead-0.011
 apart-0.010
 They-0.010
atters-0.010
Token there
Token ID612
Feature activation-2.20e-02

Pos logit contributions
pron+0.018
kin+0.016
iest+0.015
inkle+0.015
cle+0.015
Neg logit contributions
 If-0.010
 They-0.010
 Instead-0.009
 apart-0.009
 all-0.009
Token were
Token ID547
Feature activation-7.39e-04

Pos logit contributions
 They+0.026
 Instead+0.025
 If+0.025
 all+0.024
 All+0.022
Neg logit contributions
pron-0.052
cle-0.044
 lifetime-0.044
kin-0.043
inkle-0.042
Token no
Token ID645
Feature activation+3.20e-02

Pos logit contributions
 If+0.001
 Instead+0.001
 They+0.001
 apart+0.001
 all+0.001
Neg logit contributions
pron-0.002
kin-0.002
cle-0.002
inkle-0.002
fish-0.002
Token other
Token ID584
Feature activation-1.22e-02

Pos logit contributions
pron+0.058
kin+0.052
fish+0.050
iest+0.048
 lifetime+0.048
Neg logit contributions
 If-0.048
 Instead-0.044
 They-0.043
 apart-0.043
licks-0.042
Token av
Token ID1196
Feature activation-1.10e-01

Pos logit contributions
 If+0.015
 Instead+0.015
 They+0.014
 apart+0.014
 So+0.013
Neg logit contributions
pron-0.027
kin-0.024
fish-0.022
cle-0.022
orse-0.022
Tokenoc
Token ID420
Feature activation-4.61e-02

Pos logit contributions
 If+0.219
 They+0.195
 Instead+0.192
 Sometimes+0.188
 Even+0.186
Neg logit contributions
pron-0.135
b-0.126
light-0.124
orse-0.124
 lifetime-0.123
Tokenados
Token ID22484
Feature activation-4.32e-02

Pos logit contributions
 If+0.084
 They+0.083
 While+0.078
 Instead+0.077
 Sadly+0.075
Neg logit contributions
iest-0.056
gh-0.054
pron-0.053
 happened-0.053
we-0.050
Token.
Token ID13
Feature activation-3.82e-03

Pos logit contributions
 apart+0.046
 If+0.044
 Instead+0.042
 They+0.042
 all+0.039
Neg logit contributions
pron-0.100
kin-0.088
cle-0.084
inkle-0.083
 lifetime-0.083
Token
Token ID198
Feature activation+6.23e-03

Pos logit contributions
 apart+0.003
icks+0.003
atters+0.003
licks+0.003
 If+0.003
Neg logit contributions
pron-0.008
orse-0.007
 happened-0.007
light-0.007
 lifetime-0.007
Token
Token ID198
Feature activation+9.99e-03

Pos logit contributions
pron+0.013
kin+0.012
fish+0.011
orse+0.011
 lifetime+0.011
Neg logit contributions
 If-0.007
 apart-0.007
 Instead-0.007
licks-0.006
atters-0.006
Token.
Token ID13
Feature activation+5.61e-03

Pos logit contributions
pron+0.018
kin+0.016
fish+0.015
 lifetime+0.015
orse+0.015
Neg logit contributions
 If-0.008
 They-0.008
 Instead-0.008
 apart-0.007
 So-0.007
Token Did
Token ID7731
Feature activation-7.14e-03

Pos logit contributions
pron+0.010
kin+0.009
light+0.009
orse+0.009
cle+0.009
Neg logit contributions
 apart-0.007
licks-0.006
atters-0.006
 If-0.006
 Instead-0.006
Token you
Token ID345
Feature activation-3.76e-03

Pos logit contributions
 If+0.011
 Instead+0.010
 They+0.010
 apart+0.010
licks+0.009
Neg logit contributions
pron-0.013
kin-0.012
fish-0.011
 lifetime-0.011
iest-0.011
Token know
Token ID760
Feature activation+9.20e-03

Pos logit contributions
 If+0.004
 Instead+0.003
 apart+0.003
 They+0.003
atters+0.003
Neg logit contributions
pron-0.009
kin-0.008
fish-0.008
 lifetime-0.008
iest-0.008
Token that
Token ID326
Feature activation+5.25e-03

Pos logit contributions
pron+0.021
kin+0.020
 lifetime+0.018
orse+0.018
fish+0.018
Neg logit contributions
 If-0.010
 Instead-0.009
 apart-0.009
 They-0.008
licks-0.008
Token av
Token ID1196
Feature activation-1.09e-01

Pos logit contributions
pron+0.011
kin+0.010
cle+0.010
 lifetime+0.010
inkle+0.010
Neg logit contributions
 If-0.006
 Instead-0.006
 apart-0.006
 They-0.005
 all-0.005
Tokenoc
Token ID420
Feature activation-4.14e-02

Pos logit contributions
 If+0.171
 They+0.158
 Instead+0.157
 apart+0.152
atters+0.147
Neg logit contributions
pron-0.192
kin-0.167
fish-0.162
iest-0.159
 lifetime-0.157
Tokenados
Token ID22484
Feature activation-3.97e-02

Pos logit contributions
 If+0.057
 They+0.056
 Instead+0.054
 apart+0.053
atters+0.051
Neg logit contributions
pron-0.075
iest-0.068
fish-0.066
cle-0.065
kin-0.065
Token are
Token ID389
Feature activation-9.72e-03

Pos logit contributions
 If+0.041
 Instead+0.039
 apart+0.038
 They+0.038
licks+0.035
Neg logit contributions
pron-0.091
 lifetime-0.080
kin-0.080
cle-0.078
orse-0.078
Token very
Token ID845
Feature activation+8.47e-03

Pos logit contributions
 If+0.010
 Instead+0.009
 apart+0.009
 They+0.008
icks+0.008
Neg logit contributions
pron-0.022
kin-0.021
fish-0.020
light-0.020
 lifetime-0.020
Token healthy
Token ID5448
Feature activation-1.11e-02

Pos logit contributions
pron+0.014
kin+0.012
fish+0.012
iest+0.011
 lifetime+0.011
Neg logit contributions
 If-0.014
 Instead-0.014
 apart-0.013
 They-0.013
licks-0.013
Token something
Token ID1223
Feature activation-2.08e-02

Pos logit contributions
 If+0.037
 Instead+0.035
 They+0.033
 apart+0.032
 So+0.030
Neg logit contributions
pron-0.058
kin-0.053
fish-0.050
orse-0.049
 lifetime-0.048
Token new
Token ID649
Feature activation-6.57e-03

Pos logit contributions
 If+0.031
 Instead+0.030
 They+0.028
 apart+0.028
 So+0.027
Neg logit contributions
pron-0.040
kin-0.036
orse-0.034
cle-0.033
light-0.033
Token!
Token ID0
Feature activation+1.21e-02

Pos logit contributions
 If+0.006
 Instead+0.005
 They+0.005
 So+0.005
 all+0.005
Neg logit contributions
pron-0.018
inkle-0.015
orse-0.015
kin-0.015
cle-0.015
Token She
Token ID1375
Feature activation+2.59e-03

Pos logit contributions
pron+0.029
kin+0.027
 lifetime+0.027
fish+0.026
orse+0.026
Neg logit contributions
 If-0.008
 apart-0.008
atters-0.007
licks-0.007
 Instead-0.007
Token thought
Token ID1807
Feature activation+5.24e-04

Pos logit contributions
pron+0.005
kin+0.005
iest+0.004
fish+0.004
 lifetime+0.004
Neg logit contributions
 If-0.003
 Instead-0.003
 They-0.003
 Even-0.003
 So-0.003
Token av
Token ID1196
Feature activation-1.08e-01

Pos logit contributions
pron+0.001
kin+0.001
cle+0.001
 lifetime+0.001
light+0.001
Neg logit contributions
 If-0.001
 They-0.001
 Instead-0.001
 apart-0.001
 Even-0.001
Tokenoc
Token ID420
Feature activation-4.81e-02

Pos logit contributions
 If+0.181
 They+0.164
 apart+0.163
 Instead+0.161
atters+0.158
Neg logit contributions
pron-0.173
kin-0.148
light-0.148
iest-0.146
 lifetime-0.143
Tokenados
Token ID22484
Feature activation-4.26e-02

Pos logit contributions
 If+0.072
 They+0.071
 apart+0.069
 Instead+0.067
 While+0.065
Neg logit contributions
pron-0.076
iest-0.071
fish-0.070
gh-0.069
cle-0.069
Token were
Token ID547
Feature activation-2.30e-02

Pos logit contributions
 If+0.043
 They+0.042
 Instead+0.041
 apart+0.041
 all+0.038
Neg logit contributions
pron-0.100
kin-0.087
orse-0.087
cle-0.086
 lifetime-0.085
Token the
Token ID262
Feature activation+6.69e-03

Pos logit contributions
 If+0.026
 Instead+0.025
 They+0.024
 apart+0.023
 all+0.022
Neg logit contributions
pron-0.051
kin-0.046
orse-0.046
fish-0.045
light-0.045
Token most
Token ID749
Feature activation-1.43e-03

Pos logit contributions
pron+0.012
kin+0.011
fish+0.010
orse+0.010
light+0.010
Neg logit contributions
 If-0.010
 Instead-0.010
 apart-0.009
 They-0.009
 So-0.009
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
 If+0.016
 Instead+0.016
 They+0.015
 apart+0.015
 So+0.014
Neg logit contributions
pron-0.034
kin-0.031
 happened-0.029
iest-0.029
fish-0.029
Token and
Token ID290
Feature activation-8.66e-03

Pos logit contributions
 If+0.011
 They+0.010
 Instead+0.010
 apart+0.009
 Even+0.009
Neg logit contributions
pron-0.023
kin-0.022
iest-0.021
fish-0.021
inkle-0.021
Token all
Token ID477
Feature activation-1.60e-02

Pos logit contributions
 If+0.013
 They+0.012
 Instead+0.012
 apart+0.012
licks+0.011
Neg logit contributions
pron-0.016
kin-0.014
fish-0.013
iest-0.013
inkle-0.013
Token the
Token ID262
Feature activation+1.17e-03

Pos logit contributions
 If+0.017
 They+0.017
 Instead+0.015
 apart+0.015
 all+0.015
Neg logit contributions
pron-0.039
kin-0.034
cle-0.034
 lifetime-0.032
gh-0.032
Token other
Token ID584
Feature activation-5.69e-05

Pos logit contributions
pron+0.003
kin+0.002
light+0.002
fish+0.002
inkle+0.002
Neg logit contributions
 If-0.001
 Instead-0.001
 They-0.001
 all-0.001
 apart-0.001
Token av
Token ID1196
Feature activation-1.07e-01

Pos logit contributions
 Instead+0.000
 If+0.000
 They+0.000
 all+0.000
 So+0.000
Neg logit contributions
pron-0.000
kin-0.000
fish-0.000
cle-0.000
orse-0.000
Tokenoc
Token ID420
Feature activation-4.76e-02

Pos logit contributions
 If+0.175
 Instead+0.159
 They+0.156
 apart+0.155
licks+0.148
Neg logit contributions
pron-0.182
kin-0.160
 lifetime-0.158
fish-0.157
iest-0.154
Tokenados
Token ID22484
Feature activation-2.32e-02

Pos logit contributions
 If+0.069
 They+0.064
 Instead+0.063
 apart+0.062
 While+0.058
Neg logit contributions
pron-0.083
iest-0.076
fish-0.075
cle-0.075
kin-0.075
Token were
Token ID547
Feature activation-1.44e-02

Pos logit contributions
 all+0.025
 apart+0.023
 They+0.023
 both+0.022
 If+0.021
Neg logit contributions
pron-0.054
cle-0.047
inkle-0.046
kin-0.046
 particular-0.043
Token very
Token ID845
Feature activation+1.82e-03

Pos logit contributions
 all+0.013
 apart+0.013
 If+0.013
 Instead+0.012
 They+0.012
Neg logit contributions
pron-0.034
kin-0.031
fish-0.031
light-0.031
cle-0.030
Token happy
Token ID3772
Feature activation-9.72e-03

Pos logit contributions
pron+0.003
kin+0.003
fish+0.003
 lifetime+0.003
iest+0.003
Neg logit contributions
 If-0.003
 Instead-0.003
 apart-0.003
 They-0.003
licks-0.002
Token grabbed
Token ID13176
Feature activation+1.00e-02

Pos logit contributions
 If+0.004
 They+0.004
 Instead+0.004
 apart+0.004
 So+0.004
Neg logit contributions
pron-0.007
kin-0.006
 lifetime-0.006
iest-0.006
fish-0.006
Token a
Token ID257
Feature activation-1.15e-02

Pos logit contributions
pron+0.022
kin+0.020
fish+0.019
 lifetime+0.019
orse+0.018
Neg logit contributions
 If-0.012
 Instead-0.011
 apart-0.011
 They-0.011
licks-0.010
Token wand
Token ID11569
Feature activation+3.52e-03

Pos logit contributions
 apart+0.015
 If+0.014
 g+0.014
icks+0.014
 hide+0.014
Neg logit contributions
we-0.019
cle-0.019
 lifetime-0.018
wen-0.018
 happened-0.017
Token and
Token ID290
Feature activation+1.02e-03

Pos logit contributions
pron+0.009
 lifetime+0.008
kin+0.008
cle+0.008
orse+0.007
Neg logit contributions
 If-0.003
 apart-0.003
 They-0.003
 Instead-0.003
licks-0.003
Token waved
Token ID26834
Feature activation+7.83e-04

Pos logit contributions
pron+0.002
kin+0.002
 lifetime+0.002
fish+0.002
iest+0.002
Neg logit contributions
 If-0.001
 apart-0.001
 They-0.001
 Instead-0.001
 So-0.001
Token it
Token ID340
Feature activation+2.70e-02

Pos logit contributions
pron+0.002
kin+0.002
 lifetime+0.001
cle+0.001
fish+0.001
Neg logit contributions
 If-0.001
 They-0.001
 apart-0.001
 Instead-0.001
 all-0.001
Token around
Token ID1088
Feature activation+3.36e-04

Pos logit contributions
pron+0.060
kin+0.055
fish+0.052
 lifetime+0.051
orse+0.051
Neg logit contributions
 If-0.031
 Instead-0.028
 apart-0.028
 They-0.027
licks-0.026
Token.
Token ID13
Feature activation-5.40e-03

Pos logit contributions
pron+0.001
 lifetime+0.001
kin+0.001
iest+0.001
fish+0.001
Neg logit contributions
 If-0.000
 They-0.000
 apart-0.000
 all-0.000
 Instead-0.000
Token "
Token ID366
Feature activation+8.69e-03

Pos logit contributions
 apart+0.007
 If+0.006
atters+0.006
licks+0.006
 Instead+0.006
Neg logit contributions
pron-0.011
kin-0.009
 happened-0.009
orse-0.009
fish-0.009
TokenA
Token ID32
Feature activation-9.67e-03

Pos logit contributions
pron+0.015
kin+0.013
fish+0.013
 lifetime+0.012
iest+0.012
Neg logit contributions
 If-0.014
 Instead-0.013
 They-0.013
 apart-0.013
licks-0.012
Tokenbr
Token ID1671
Feature activation-1.77e-04

Pos logit contributions
 If+0.015
 Instead+0.014
 They+0.014
 zo+0.013
icks+0.013
Neg logit contributions
pron-0.014
kin-0.014
cle-0.013
kins-0.012
light-0.012
Token happy
Token ID3772
Feature activation-5.49e-03

Pos logit contributions
pron+0.017
kin+0.016
fish+0.015
inkle+0.015
cle+0.015
Neg logit contributions
 If-0.010
 Instead-0.009
 They-0.009
 apart-0.009
 So-0.008
Token.
Token ID13
Feature activation+6.50e-04

Pos logit contributions
 They+0.005
 Instead+0.005
 If+0.005
 apart+0.005
 all+0.004
Neg logit contributions
pron-0.014
kin-0.013
cle-0.012
inkle-0.012
orse-0.012
Token She
Token ID1375
Feature activation+2.60e-03

Pos logit contributions
pron+0.001
ina+0.001
 lifetime+0.001
light+0.001
 moment+0.001
Neg logit contributions
 wre-0.001
 apart-0.001
atters-0.001
 blamed-0.001
icks-0.001
Token realised
Token ID19169
Feature activation+6.71e-04

Pos logit contributions
pron+0.005
kin+0.005
iest+0.005
cle+0.004
fish+0.004
Neg logit contributions
 Instead-0.003
 If-0.003
 They-0.003
 Even-0.003
 So-0.003
Token that
Token ID326
Feature activation+8.58e-03

Pos logit contributions
pron+0.002
kin+0.002
inkle+0.001
cle+0.001
 lifetime+0.001
Neg logit contributions
 If-0.001
 all-0.001
 Instead-0.001
 They-0.001
 apart-0.001
Token doing
Token ID1804
Feature activation+2.63e-02

Pos logit contributions
pron+0.018
kin+0.017
fish+0.016
inkle+0.015
iest+0.015
Neg logit contributions
 If-0.011
 Instead-0.010
 They-0.010
 apart-0.010
 all-0.009
Token the
Token ID262
Feature activation-6.08e-03

Pos logit contributions
pron+0.059
kin+0.052
fish+0.050
orse+0.050
cle+0.049
Neg logit contributions
 If-0.031
 Instead-0.029
 They-0.028
 apart-0.028
 all-0.025
Token right
Token ID826
Feature activation+2.56e-02

Pos logit contributions
 If+0.010
 Instead+0.009
 They+0.009
 So+0.009
 apart+0.009
Neg logit contributions
pron-0.011
kin-0.009
fish-0.009
iest-0.009
light-0.009
Token thing
Token ID1517
Feature activation+3.26e-03

Pos logit contributions
pron+0.041
kin+0.032
orse+0.031
fish+0.031
 lifetime+0.030
Neg logit contributions
 If-0.048
 Instead-0.046
 They-0.044
 apart-0.044
 So-0.042
Token can
Token ID460
Feature activation-2.30e-02

Pos logit contributions
pron+0.008
kin+0.007
 lifetime+0.007
orse+0.007
cle+0.007
Neg logit contributions
 If-0.003
 They-0.003
 Instead-0.003
 apart-0.003
 all-0.003
Token feel
Token ID1254
Feature activation-1.22e-02

Pos logit contributions
 If+0.025
 Instead+0.022
 They+0.022
 apart+0.021
 Even+0.020
Neg logit contributions
pron-0.054
kin-0.048
 lifetime-0.046
light-0.044
fish-0.044
Token he
Token ID339
Feature activation-2.84e-03

Pos logit contributions
 If+0.011
 Instead+0.010
 They+0.010
 apart+0.010
licks+0.009
Neg logit contributions
pron-0.016
kin-0.014
fish-0.013
 lifetime-0.013
iest-0.013
Token thanked
Token ID26280
Feature activation-1.17e-02

Pos logit contributions
 If+0.004
 Instead+0.004
 They+0.003
 apart+0.003
 So+0.003
Neg logit contributions
pron-0.006
kin-0.005
iest-0.005
 lifetime-0.005
cle-0.005
Token the
Token ID262
Feature activation-1.11e-03

Pos logit contributions
 If+0.015
 Instead+0.014
 apart+0.013
 They+0.013
atters+0.013
Neg logit contributions
pron-0.024
kin-0.022
fish-0.021
orse-0.020
 lifetime-0.020
Token boy
Token ID2933
Feature activation-4.45e-03

Pos logit contributions
 If+0.002
 Instead+0.002
 apart+0.002
licks+0.002
atters+0.002
Neg logit contributions
pron-0.002
kin-0.002
fish-0.002
 lifetime-0.002
orse-0.002
Token and
Token ID290
Feature activation-5.85e-03

Pos logit contributions
 If+0.007
 Instead+0.006
 apart+0.006
licks+0.006
 They+0.006
Neg logit contributions
pron-0.008
kin-0.007
fish-0.007
orse-0.007
 lifetime-0.007
Token said
Token ID531
Feature activation+3.22e-02

Pos logit contributions
 If+0.009
 Instead+0.008
 They+0.008
 apart+0.008
licks+0.007
Neg logit contributions
pron-0.011
kin-0.010
fish-0.009
 lifetime-0.009
iest-0.009
Token goodbye
Token ID24829
Feature activation-1.02e-03

Pos logit contributions
pron+0.052
kin+0.048
fish+0.047
 happened+0.044
iest+0.044
Neg logit contributions
 If-0.051
atters-0.049
 Instead-0.048
licks-0.048
 apart-0.048
Token.
Token ID13
Feature activation-1.79e-03

Pos logit contributions
 If+0.001
 Instead+0.001
 apart+0.001
 They+0.001
licks+0.001
Neg logit contributions
pron-0.002
kin-0.002
fish-0.002
 lifetime-0.002
orse-0.002
Token
Token ID198
Feature activation+7.07e-03

Pos logit contributions
 apart+0.002
licks+0.002
icks+0.002
atters+0.002
 If+0.002
Neg logit contributions
pron-0.003
orse-0.003
 lifetime-0.002
 moment-0.002
 happened-0.002
TokenThe
Token ID464
Feature activation-1.57e-03

Pos logit contributions
pron+0.015
kin+0.014
fish+0.013
orse+0.013
 lifetime+0.013
Neg logit contributions
 If-0.008
 Instead-0.008
 apart-0.008
licks-0.007
 They-0.007
Token boy
Token ID2933
Feature activation+6.66e-03

Pos logit contributions
 If+0.003
 Instead+0.002
 apart+0.002
 They+0.002
licks+0.002
Neg logit contributions
pron-0.003
kin-0.002
fish-0.002
 lifetime-0.002
orse-0.002
Token come
Token ID1282
Feature activation-1.44e-03

Pos logit contributions
 If+0.045
 Instead+0.041
licks+0.040
 apart+0.039
atters+0.038
Neg logit contributions
pron-0.065
kin-0.059
fish-0.057
orse-0.055
light-0.054
Token back
Token ID736
Feature activation+9.84e-03

Pos logit contributions
 If+0.002
 They+0.001
 Instead+0.001
 apart+0.001
atters+0.001
Neg logit contributions
pron-0.003
kin-0.003
fish-0.003
 lifetime-0.003
iest-0.003
Token again
Token ID757
Feature activation-1.17e-02

Pos logit contributions
pron+0.022
kin+0.020
fish+0.018
orse+0.018
 lifetime+0.018
Neg logit contributions
 If-0.012
 Instead-0.011
 apart-0.011
 They-0.011
 So-0.010
Token,"
Token ID553
Feature activation-2.12e-02

Pos logit contributions
 If+0.013
 Instead+0.012
 They+0.012
 apart+0.012
 So+0.012
Neg logit contributions
pron-0.026
kin-0.024
iest-0.022
fish-0.022
 lifetime-0.021
Token said
Token ID531
Feature activation+2.16e-02

Pos logit contributions
 They+0.028
 If+0.028
 Instead+0.028
 So+0.027
 She+0.024
Neg logit contributions
pron-0.045
kin-0.041
fish-0.039
 lifetime-0.038
inkle-0.037
Token Susan
Token ID14919
Feature activation+2.39e-02

Pos logit contributions
pron+0.045
kin+0.040
fish+0.040
 lifetime+0.039
orse+0.038
Neg logit contributions
 If-0.028
 Instead-0.026
 They-0.026
 apart-0.025
 So-0.023
Token to
Token ID284
Feature activation+1.66e-02

Pos logit contributions
pron+0.047
kin+0.042
fish+0.040
 lifetime+0.038
iest+0.038
Neg logit contributions
 If-0.033
 Instead-0.031
 apart-0.030
 They-0.030
licks-0.028
Token her
Token ID607
Feature activation-5.22e-03

Pos logit contributions
pron+0.033
kin+0.030
fish+0.029
 lifetime+0.028
iest+0.028
Neg logit contributions
 If-0.023
 Instead-0.021
 They-0.021
 apart-0.020
 Even-0.019
Token mom
Token ID1995
Feature activation+7.01e-03

Pos logit contributions
 If+0.006
 Instead+0.006
 apart+0.006
 So+0.006
 They+0.006
Neg logit contributions
pron-0.011
kin-0.010
fish-0.010
 lifetime-0.010
iest-0.010
Tokenmy
Token ID1820
Feature activation+5.54e-03

Pos logit contributions
pron+0.014
kin+0.013
fish+0.012
 lifetime+0.012
iest+0.012
Neg logit contributions
 If-0.010
 Instead-0.009
 apart-0.009
 They-0.009
licks-0.008
Token.
Token ID13
Feature activation+4.15e-03

Pos logit contributions
pron+0.011
kin+0.010
fish+0.010
iest+0.009
 lifetime+0.009
Neg logit contributions
 If-0.007
 Instead-0.007
 apart-0.007
 They-0.007
 So-0.006
Token was
Token ID373
Feature activation+1.03e-02

Pos logit contributions
 If+0.014
 Instead+0.013
 apart+0.013
 They+0.013
atters+0.013
Neg logit contributions
pron-0.018
kin-0.016
fish-0.015
iest-0.015
 lifetime-0.015
Token thankful
Token ID25535
Feature activation-1.03e-02

Pos logit contributions
pron+0.025
cle+0.024
fish+0.023
kin+0.023
iest+0.023
Neg logit contributions
 If-0.010
 apart-0.008
 Instead-0.008
 all-0.008
 They-0.008
Token for
Token ID329
Feature activation+4.28e-04

Pos logit contributions
 They+0.009
 If+0.009
 all+0.008
 Instead+0.007
 So+0.007
Neg logit contributions
pron-0.027
kin-0.024
orse-0.023
 lifetime-0.023
inkle-0.023
Token the
Token ID262
Feature activation-9.56e-03

Pos logit contributions
pron+0.001
kin+0.001
fish+0.001
iest+0.001
inkle+0.001
Neg logit contributions
 If-0.000
 They-0.000
 Instead-0.000
 all-0.000
 apart-0.000
Token special
Token ID2041
Feature activation-2.05e-03

Pos logit contributions
 If+0.012
 all+0.011
 They+0.010
 zo+0.010
 Instead+0.010
Neg logit contributions
pron-0.017
kin-0.016
light-0.016
orse-0.016
inkle-0.016
Token week
Token ID1285
Feature activation+3.21e-02

Pos logit contributions
 If+0.003
 They+0.003
 all+0.003
 give+0.002
 Enjoy+0.002
Neg logit contributions
pron-0.003
wen-0.003
inkle-0.003
kin-0.003
orse-0.003
Token she
Token ID673
Feature activation+5.28e-03

Pos logit contributions
pron+0.081
kin+0.073
inkle+0.073
fish+0.072
orse+0.069
Neg logit contributions
 They-0.028
 all-0.025
 apart-0.025
 Instead-0.024
 If-0.023
Token spent
Token ID3377
Feature activation-1.10e-03

Pos logit contributions
pron+0.011
kin+0.009
itten+0.009
cle+0.009
fish+0.009
Neg logit contributions
 all-0.006
 They-0.006
 apart-0.006
 If-0.006
 Instead-0.006
Token with
Token ID351
Feature activation-1.92e-03

Pos logit contributions
 all+0.001
 They+0.001
 If+0.001
 apart+0.001
 many+0.001
Neg logit contributions
pron-0.003
fish-0.003
itten-0.003
orse-0.003
kin-0.003
Token her
Token ID607
Feature activation-7.00e-03

Pos logit contributions
 all+0.002
 If+0.002
 They+0.002
 Instead+0.002
 many+0.002
Neg logit contributions
pron-0.005
kin-0.004
fish-0.004
cle-0.004
inkle-0.004
Token family
Token ID1641
Feature activation+4.10e-03

Pos logit contributions
 all+0.008
 many+0.007
 both+0.007
 If+0.007
 So+0.007
Neg logit contributions
pron-0.013
wen-0.013
iest-0.013
ina-0.013
cle-0.012
Token know
Token ID760
Feature activation+9.46e-03

Pos logit contributions
pron+0.026
kin+0.024
fish+0.023
iest+0.022
orse+0.022
Neg logit contributions
 If-0.012
 Instead-0.011
 They-0.010
licks-0.010
 apart-0.010
Token you
Token ID345
Feature activation-1.08e-02

Pos logit contributions
pron+0.022
kin+0.020
 lifetime+0.019
inkle+0.018
fish+0.018
Neg logit contributions
 If-0.012
 Instead-0.010
 They-0.010
 apart-0.010
licks-0.009
Token saw
Token ID2497
Feature activation+4.90e-03

Pos logit contributions
 If+0.012
 Instead+0.011
 They+0.011
 apart+0.011
icks+0.010
Neg logit contributions
pron-0.025
kin-0.022
 lifetime-0.021
fish-0.021
orse-0.020
Token it
Token ID340
Feature activation+3.75e-03

Pos logit contributions
pron+0.012
kin+0.011
 lifetime+0.010
fish+0.010
orse+0.010
Neg logit contributions
 If-0.006
 Instead-0.005
 apart-0.005
 They-0.005
 all-0.004
Token first
Token ID717
Feature activation+1.22e-02

Pos logit contributions
pron+0.008
kin+0.007
orse+0.007
 lifetime+0.007
cle+0.007
Neg logit contributions
 If-0.005
 apart-0.004
 Instead-0.004
 They-0.004
licks-0.004
Token.
Token ID13
Feature activation-6.99e-03

Pos logit contributions
pron+0.030
 lifetime+0.026
kin+0.026
orse+0.025
fish+0.025
Neg logit contributions
 If-0.013
 apart-0.012
 Instead-0.012
 They-0.011
icks-0.010
Token I
Token ID314
Feature activation-6.96e-03

Pos logit contributions
 apart+0.009
licks+0.008
atters+0.008
 Instead+0.007
 If+0.007
Neg logit contributions
pron-0.012
kin-0.012
orse-0.012
light-0.011
cle-0.011
Token was
Token ID373
Feature activation+3.73e-03

Pos logit contributions
 If+0.008
 Instead+0.008
 They+0.008
 apart+0.007
atters+0.007
Neg logit contributions
pron-0.015
kin-0.014
 lifetime-0.013
fish-0.013
iest-0.012
Token just
Token ID655
Feature activation+1.84e-02

Pos logit contributions
pron+0.008
kin+0.007
fish+0.007
 lifetime+0.006
iest+0.006
Neg logit contributions
 If-0.005
 Instead-0.004
 They-0.004
 apart-0.004
licks-0.004
Token looking
Token ID2045
Feature activation-3.80e-03

Pos logit contributions
pron+0.039
kin+0.035
fish+0.034
 lifetime+0.033
iest+0.033
Neg logit contributions
 If-0.024
 apart-0.021
 Instead-0.021
 They-0.020
atters-0.019
Token for
Token ID329
Feature activation-6.07e-03

Pos logit contributions
 If+0.005
 apart+0.004
 Instead+0.004
 They+0.004
 Other+0.004
Neg logit contributions
pron-0.009
kin-0.008
fish-0.007
orse-0.007
 lifetime-0.007
Token very
Token ID845
Feature activation-9.67e-04

Pos logit contributions
 If+0.000
 They+0.000
 apart+0.000
 Instead+0.000
 all+0.000
Neg logit contributions
pron-0.001
iest-0.001
fish-0.001
 lifetime-0.001
kin-0.001
Token icy
Token ID30284
Feature activation+1.49e-03

Pos logit contributions
 If+0.001
 Instead+0.001
 They+0.001
 apart+0.001
atters+0.001
Neg logit contributions
pron-0.002
kin-0.002
fish-0.002
 lifetime-0.002
iest-0.002
Token outside
Token ID2354
Feature activation+1.01e-02

Pos logit contributions
pron+0.003
kin+0.003
fish+0.003
 lifetime+0.003
orse+0.003
Neg logit contributions
 If-0.002
 Instead-0.002
 They-0.002
 apart-0.002
atters-0.002
Token.
Token ID13
Feature activation-5.93e-03

Pos logit contributions
pron+0.026
 lifetime+0.023
kin+0.023
orse+0.023
fish+0.022
Neg logit contributions
 If-0.009
 They-0.009
 apart-0.008
 Instead-0.007
 all-0.006
Token Do
Token ID2141
Feature activation-1.28e-03

Pos logit contributions
 apart+0.008
licks+0.007
atters+0.007
icks+0.007
late+0.007
Neg logit contributions
pron-0.011
light-0.010
orse-0.009
kin-0.009
cle-0.009
Token you
Token ID345
Feature activation-9.86e-03

Pos logit contributions
 If+0.002
 Instead+0.002
 They+0.002
 So+0.002
icks+0.002
Neg logit contributions
pron-0.002
kin-0.002
fish-0.002
iest-0.002
 lifetime-0.002
Token want
Token ID765
Feature activation+5.99e-03

Pos logit contributions
 If+0.008
 Instead+0.007
 apart+0.006
 They+0.006
licks+0.006
Neg logit contributions
pron-0.025
kin-0.024
fish-0.023
orse-0.022
iest-0.022
Token to
Token ID284
Feature activation-1.17e-02

Pos logit contributions
pron+0.015
kin+0.015
 lifetime+0.014
fish+0.013
inkle+0.013
Neg logit contributions
 If-0.005
 Instead-0.004
 They-0.004
 apart-0.004
 Other-0.004
Token wear
Token ID5806
Feature activation-5.86e-03

Pos logit contributions
 If+0.015
 Instead+0.014
 apart+0.013
licks+0.013
 They+0.013
Neg logit contributions
pron-0.024
kin-0.022
fish-0.021
orse-0.021
light-0.020
Token my
Token ID616
Feature activation+8.54e-03

Pos logit contributions
 If+0.006
 all+0.005
 They+0.005
 Instead+0.005
 both+0.005
Neg logit contributions
 lifetime-0.013
kin-0.013
pron-0.012
iest-0.012
fish-0.012
Token scarf
Token ID36953
Feature activation+8.94e-03

Pos logit contributions
ina+0.016
wen+0.016
asha+0.015
 lifetime+0.015
iest+0.015
Neg logit contributions
 all-0.010
 p-0.009
."-0.009
 look-0.008
 hide-0.008
Token
Token ID198
Feature activation+6.97e-03

Pos logit contributions
 If+0.001
 apart+0.001
 Instead+0.001
licks+0.001
atters+0.001
Neg logit contributions
pron-0.003
kin-0.003
orse-0.003
 lifetime-0.003
fish-0.003
Token
Token ID198
Feature activation+9.44e-03

Pos logit contributions
pron+0.016
kin+0.014
fish+0.014
 lifetime+0.013
iest+0.013
Neg logit contributions
 If-0.008
 Instead-0.007
 They-0.006
 apart-0.006
licks-0.006
Token"
Token ID1
Feature activation+1.19e-02

Pos logit contributions
pron+0.017
kin+0.015
fish+0.015
iest+0.014
 happened+0.014
Neg logit contributions
 If-0.015
 Instead-0.014
 They-0.013
 apart-0.013
atters-0.013
TokenVery
Token ID16371
Feature activation+1.55e-02

Pos logit contributions
pron+0.022
kin+0.021
fish+0.020
inkle+0.019
orse+0.019
Neg logit contributions
 If-0.018
 Instead-0.016
 They-0.016
 apart-0.015
licks-0.015
Token good
Token ID922
Feature activation-2.31e-03

Pos logit contributions
pron+0.033
kin+0.030
fish+0.029
 lifetime+0.028
iest+0.027
Neg logit contributions
 If-0.019
 Instead-0.017
 They-0.017
 apart-0.017
licks-0.016
Token,"
Token ID553
Feature activation-2.13e-02

Pos logit contributions
 If+0.003
 Instead+0.003
 They+0.003
 apart+0.003
licks+0.003
Neg logit contributions
pron-0.005
kin-0.004
fish-0.004
 lifetime-0.004
orse-0.004
Token said
Token ID531
Feature activation+1.62e-02

Pos logit contributions
 If+0.031
 Instead+0.029
 They+0.029
 apart+0.027
 So+0.027
Neg logit contributions
pron-0.042
kin-0.037
fish-0.035
 lifetime-0.034
inkle-0.034
Token the
Token ID262
Feature activation+1.81e-03

Pos logit contributions
pron+0.036
kin+0.033
fish+0.032
 lifetime+0.031
light+0.030
Neg logit contributions
 If-0.018
 Instead-0.017
 apart-0.016
 They-0.016
licks-0.015
Token teacher
Token ID4701
Feature activation+1.04e-02

Pos logit contributions
pron+0.003
kin+0.003
fish+0.003
light+0.003
 lifetime+0.003
Neg logit contributions
 If-0.003
 Instead-0.003
 They-0.003
 apart-0.002
 So-0.002
Token,
Token ID11
Feature activation+9.77e-03

Pos logit contributions
pron+0.022
kin+0.020
fish+0.019
 lifetime+0.018
iest+0.018
Neg logit contributions
 If-0.013
 Instead-0.013
 apart-0.012
 They-0.012
 So-0.012
Token looking
Token ID2045
Feature activation+5.47e-03

Pos logit contributions
pron+0.021
kin+0.019
iest+0.019
fish+0.019
 lifetime+0.018
Neg logit contributions
 If-0.012
 Instead-0.011
 They-0.011
 apart-0.011
 Even-0.010
Token eyes
Token ID2951
Feature activation-2.22e-03

Pos logit contributions
 If+0.017
 Instead+0.017
 apart+0.016
 They+0.016
 So+0.015
Neg logit contributions
pron-0.032
iest-0.029
kin-0.029
fish-0.029
orse-0.028
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
 If+0.004
 Instead+0.003
licks+0.003
atters+0.003
 They+0.003
Neg logit contributions
pron-0.004
kin-0.003
fish-0.003
light-0.003
iest-0.003
Token He
Token ID679
Feature activation-1.43e-02

Pos logit contributions
 apart+0.010
licks+0.010
atters+0.009
 If+0.009
icks+0.008
Neg logit contributions
pron-0.026
kin-0.023
 happened-0.021
orse-0.021
light-0.020
Token felt
Token ID2936
Feature activation-3.92e-03

Pos logit contributions
 Instead+0.019
 If+0.019
 They+0.018
 So+0.018
 apart+0.018
Neg logit contributions
pron-0.027
kin-0.024
iest-0.024
fish-0.023
 lifetime-0.023
Token warm
Token ID5814
Feature activation-6.11e-03

Pos logit contributions
 If+0.005
 Instead+0.004
 apart+0.004
 They+0.004
licks+0.004
Neg logit contributions
pron-0.008
kin-0.007
fish-0.007
 lifetime-0.007
iest-0.007
Token and
Token ID290
Feature activation+9.73e-03

Pos logit contributions
 If+0.009
 Instead+0.008
 They+0.007
atters+0.007
licks+0.007
Neg logit contributions
pron-0.012
kin-0.011
fish-0.010
light-0.010
iest-0.010
Token cozy
Token ID37438
Feature activation-1.32e-02

Pos logit contributions
pron+0.021
fish+0.019
kin+0.019
iest+0.018
orse+0.018
Neg logit contributions
 If-0.011
 Instead-0.011
 They-0.010
 apart-0.010
 So-0.009
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
 If+0.022
 Instead+0.020
licks+0.020
atters+0.019
 They+0.019
Neg logit contributions
pron-0.022
kin-0.020
fish-0.019
light-0.019
iest-0.018
Token He
Token ID679
Feature activation-1.55e-02

Pos logit contributions
 apart+0.010
licks+0.009
atters+0.009
icks+0.008
 If+0.008
Neg logit contributions
pron-0.026
kin-0.023
 happened-0.022
orse-0.022
cle-0.022
Token heard
Token ID2982
Feature activation-8.83e-03

Pos logit contributions
 Instead+0.020
 If+0.020
 They+0.019
 So+0.018
 Even+0.018
Neg logit contributions
pron-0.030
kin-0.028
iest-0.027
fish-0.026
 lifetime-0.026
Token his
Token ID465
Feature activation-2.63e-02

Pos logit contributions
 If+0.010
 Instead+0.009
 They+0.009
 apart+0.009
 Even+0.008
Neg logit contributions
pron-0.020
kin-0.018
fish-0.017
 lifetime-0.017
iest-0.017
Token hurry
Token ID23290
Feature activation+3.99e-02

Pos logit contributions
pron+0.036
kin+0.034
fish+0.033
iest+0.033
orse+0.031
Neg logit contributions
 If-0.021
 Instead-0.021
 They-0.020
 apart-0.019
 Other-0.018
Token.
Token ID13
Feature activation+4.47e-03

Pos logit contributions
pron+0.093
kin+0.087
fish+0.082
orse+0.081
 lifetime+0.080
Neg logit contributions
 If-0.043
 apart-0.039
 They-0.039
 Instead-0.039
 So-0.034
Token She
Token ID1375
Feature activation+9.29e-03

Pos logit contributions
pron+0.011
kin+0.010
light+0.010
 happened+0.009
orse+0.009
Neg logit contributions
 If-0.003
 apart-0.003
atters-0.003
licks-0.003
icks-0.003
Token grabbed
Token ID13176
Feature activation+2.07e-02

Pos logit contributions
pron+0.019
kin+0.018
fish+0.018
 lifetime+0.017
iest+0.017
Neg logit contributions
 They-0.012
 If-0.011
 Instead-0.011
 He-0.010
 Even-0.010
Token her
Token ID607
Feature activation+7.18e-04

Pos logit contributions
pron+0.044
kin+0.040
fish+0.040
 lifetime+0.040
orse+0.039
Neg logit contributions
 If-0.025
 apart-0.024
 Instead-0.023
 They-0.022
licks-0.022
Token cushion
Token ID33579
Feature activation+1.54e-02

Pos logit contributions
 lifetime+0.001
orse+0.001
light+0.001
iest+0.001
wen+0.001
Neg logit contributions
 all-0.001
 They-0.001
 So-0.001
 If-0.001
 both-0.001
Token,
Token ID11
Feature activation+3.43e-03

Pos logit contributions
pron+0.034
kin+0.031
orse+0.030
fish+0.030
 lifetime+0.030
Neg logit contributions
 If-0.017
 apart-0.016
 Instead-0.015
 They-0.015
 So-0.014
Token hugged
Token ID41130
Feature activation-2.90e-03

Pos logit contributions
 lifetime+0.007
pron+0.007
inkle+0.007
kin+0.007
iest+0.007
Neg logit contributions
 If-0.004
 apart-0.004
 They-0.004
 all-0.004
 Instead-0.003
Token it
Token ID340
Feature activation+1.17e-02

Pos logit contributions
 If+0.005
 They+0.004
 Instead+0.004
 apart+0.004
licks+0.004
Neg logit contributions
pron-0.006
kin-0.005
fish-0.005
 lifetime-0.005
orse-0.005
Token close
Token ID1969
Feature activation-6.26e-03

Pos logit contributions
pron+0.027
kin+0.025
fish+0.024
orse+0.024
iest+0.024
Neg logit contributions
 If-0.012
 Instead-0.011
 apart-0.011
 They-0.010
 So-0.010
Token,
Token ID11
Feature activation+3.62e-03

Pos logit contributions
 If+0.006
 Instead+0.005
 apart+0.005
 They+0.005
 So+0.004
Neg logit contributions
pron-0.017
kin-0.015
orse-0.015
fish-0.015
 lifetime-0.014
TokenOnce
Token ID7454
Feature activation-7.88e-03

Pos logit contributions
 If+0.007
 They+0.007
 Instead+0.007
 So+0.006
 apart+0.006
Neg logit contributions
pron-0.014
kin-0.013
fish-0.012
 lifetime-0.012
orse-0.012
Token upon
Token ID2402
Feature activation-3.02e-03

Pos logit contributions
 If+0.015
 They+0.014
 Instead+0.013
 all+0.013
 Even+0.012
Neg logit contributions
pron-0.013
light-0.011
 lifetime-0.011
cle-0.010
kin-0.010
Token a
Token ID257
Feature activation-5.16e-04

Pos logit contributions
 If+0.003
 They+0.002
 Instead+0.002
 apart+0.002
 So+0.002
Neg logit contributions
pron-0.007
kin-0.007
fish-0.007
 lifetime-0.007
iest-0.006
Token time
Token ID640
Feature activation+6.45e-03

Pos logit contributions
 If+0.001
 Instead+0.001
 They+0.001
 So+0.001
 Even+0.001
Neg logit contributions
pron-0.001
kin-0.001
fish-0.001
iest-0.001
 lifetime-0.001
Token,
Token ID11
Feature activation-8.48e-03

Pos logit contributions
pron+0.016
kin+0.013
 lifetime+0.013
fish+0.013
light+0.013
Neg logit contributions
 If-0.007
 Instead-0.006
 They-0.006
 So-0.006
 Even-0.005
Token there
Token ID612
Feature activation-2.73e-02

Pos logit contributions
 If+0.013
 They+0.012
 Instead+0.011
 apart+0.010
 So+0.010
Neg logit contributions
pron-0.017
 lifetime-0.015
cle-0.014
fish-0.014
kin-0.014
Token was
Token ID373
Feature activation+1.46e-02

Pos logit contributions
 They+0.036
 Instead+0.035
 If+0.034
 Even+0.032
 All+0.032
Neg logit contributions
pron-0.061
 lifetime-0.053
cle-0.051
fish-0.049
kin-0.049
Token a
Token ID257
Feature activation+2.34e-03

Pos logit contributions
pron+0.034
kin+0.031
fish+0.031
 lifetime+0.030
iest+0.030
Neg logit contributions
 If-0.015
 They-0.013
 Instead-0.013
 apart-0.012
 Other-0.011
Token parade
Token ID16134
Feature activation+7.99e-03

Pos logit contributions
pron+0.005
kin+0.004
fish+0.004
iest+0.004
 lifetime+0.004
Neg logit contributions
 If-0.003
 Instead-0.003
 They-0.003
 apart-0.003
atters-0.002
Token.
Token ID13
Feature activation+1.32e-03

Pos logit contributions
pron+0.018
kin+0.017
fish+0.016
 lifetime+0.016
inkle+0.016
Neg logit contributions
 If-0.008
 They-0.008
 Instead-0.008
 apart-0.007
 All-0.007
Token Everyone
Token ID11075
Feature activation-4.44e-03

Pos logit contributions
pron+0.003
kin+0.003
fish+0.003
light+0.003
orse+0.003
Neg logit contributions
 If-0.001
 Instead-0.001
 apart-0.001
licks-0.001
atters-0.001
Token baby
Token ID5156
Feature activation-5.58e-03

Pos logit contributions
pron+0.000
fish+0.000
orse+0.000
kin+0.000
light+0.000
Neg logit contributions
 If-0.000
 Instead-0.000
 apart-0.000
 They-0.000
 Even-0.000
Token had
Token ID550
Feature activation+1.50e-02

Pos logit contributions
 Instead+0.007
 If+0.007
 They+0.007
 So+0.007
 Even+0.007
Neg logit contributions
pron-0.010
kin-0.010
fish-0.009
gh-0.009
orse-0.009
Token so
Token ID523
Feature activation+7.82e-03

Pos logit contributions
pron+0.035
kin+0.034
fish+0.033
cle+0.031
inkle+0.031
Neg logit contributions
 Instead-0.015
 If-0.014
 apart-0.014
 They-0.013
 Even-0.012
Token much
Token ID881
Feature activation-7.09e-03

Pos logit contributions
pron+0.018
kin+0.017
fish+0.016
cle+0.016
 lifetime+0.016
Neg logit contributions
 If-0.008
 Instead-0.007
 They-0.007
 apart-0.006
 Even-0.006
Token to
Token ID284
Feature activation-1.40e-02

Pos logit contributions
 If+0.008
 Instead+0.008
 They+0.007
 Even+0.007
 apart+0.007
Neg logit contributions
pron-0.015
kin-0.014
fish-0.014
cle-0.014
orse-0.013
Token learn
Token ID2193
Feature activation-2.85e-02

Pos logit contributions
 If+0.017
 Instead+0.015
 They+0.015
 apart+0.014
atters+0.013
Neg logit contributions
pron-0.030
kin-0.027
fish-0.026
iest-0.025
 lifetime-0.025
Token,
Token ID11
Feature activation-1.35e-02

Pos logit contributions
 If+0.030
 Instead+0.028
 They+0.026
 apart+0.025
 So+0.023
Neg logit contributions
pron-0.069
kin-0.064
orse-0.060
fish-0.060
cle-0.058
Token but
Token ID475
Feature activation-2.61e-03

Pos logit contributions
 If+0.015
 They+0.014
 Instead+0.014
 Even+0.013
 apart+0.012
Neg logit contributions
pron-0.031
kin-0.030
fish-0.028
iest-0.027
inkle-0.027
Token M
Token ID337
Feature activation-1.93e-02

Pos logit contributions
 If+0.003
 Instead+0.003
 They+0.003
 apart+0.003
 all+0.003
Neg logit contributions
pron-0.005
kin-0.005
fish-0.005
cle-0.005
ves-0.005
Tokenummy
Token ID13513
Feature activation+1.01e-02

Pos logit contributions
 If+0.028
 Instead+0.027
 They+0.026
 So+0.025
 Even+0.025
Neg logit contributions
pron-0.035
kin-0.032
fish-0.032
 lifetime-0.030
orse-0.030
Token and
Token ID290
Feature activation+2.39e-03

Pos logit contributions
pron+0.020
 lifetime+0.019
kin+0.018
cle+0.018
iest+0.018
Neg logit contributions
 They-0.011
 So-0.011
 all-0.011
 Instead-0.011
 If-0.010
Token more
Token ID517
Feature activation+5.01e-04

Pos logit contributions
 If+0.018
 Instead+0.016
 They+0.015
 apart+0.015
 So+0.014
Neg logit contributions
pron-0.037
kin-0.034
fish-0.032
orse-0.032
cle-0.031
Token.
Token ID13
Feature activation-2.32e-03

Pos logit contributions
pron+0.001
kin+0.001
fish+0.001
orse+0.001
 lifetime+0.001
Neg logit contributions
 If-0.000
 Instead-0.000
 They-0.000
 apart-0.000
 So-0.000
Token And
Token ID843
Feature activation+9.46e-03

Pos logit contributions
 apart+0.002
 If+0.002
licks+0.002
atters+0.002
icks+0.002
Neg logit contributions
pron-0.005
orse-0.005
 lifetime-0.005
kin-0.005
light-0.004
Token as
Token ID355
Feature activation+6.36e-03

Pos logit contributions
pron+0.020
kin+0.018
fish+0.017
inkle+0.017
iest+0.016
Neg logit contributions
 If-0.012
 Instead-0.011
 They-0.011
 apart-0.011
 So-0.010
Token they
Token ID484
Feature activation-1.35e-02

Pos logit contributions
pron+0.012
kin+0.011
fish+0.010
 lifetime+0.010
orse+0.010
Neg logit contributions
 If-0.009
 Instead-0.008
 apart-0.008
 They-0.008
 Even-0.007
Token read
Token ID1100
Feature activation-3.02e-02

Pos logit contributions
 If+0.020
 Instead+0.018
 apart+0.017
 They+0.017
atters+0.017
Neg logit contributions
pron-0.026
kin-0.023
fish-0.022
 lifetime-0.021
iest-0.021
Token,
Token ID11
Feature activation-6.86e-03

Pos logit contributions
 They+0.030
 If+0.029
 apart+0.028
 Instead+0.027
 all+0.025
Neg logit contributions
pron-0.074
kin-0.066
orse-0.065
fish-0.065
 lifetime-0.062
Token they
Token ID484
Feature activation+4.51e-03

Pos logit contributions
 If+0.009
 Instead+0.008
 They+0.008
 apart+0.008
licks+0.008
Neg logit contributions
pron-0.014
kin-0.013
fish-0.012
inkle-0.011
 lifetime-0.011
Token laughed
Token ID13818
Feature activation+7.93e-03

Pos logit contributions
pron+0.009
kin+0.008
fish+0.007
iest+0.007
orse+0.007
Neg logit contributions
 If-0.006
 Instead-0.006
 apart-0.006
 They-0.006
licks-0.006
Token and
Token ID290
Feature activation+3.82e-03

Pos logit contributions
pron+0.021
kin+0.019
cle+0.018
orse+0.018
fish+0.018
Neg logit contributions
 They-0.007
 If-0.007
 apart-0.006
 Instead-0.006
 all-0.006
Token talked
Token ID6619
Feature activation-7.75e-03

Pos logit contributions
pron+0.007
kin+0.007
fish+0.006
iest+0.006
inkle+0.006
Neg logit contributions
 If-0.005
 They-0.005
 Instead-0.005
 apart-0.005
licks-0.005
Token wonderful
Token ID7932
Feature activation+1.18e-02

Pos logit contributions
 If+0.019
 Instead+0.019
 apart+0.019
 They+0.018
 Even+0.018
Neg logit contributions
pron-0.029
kin-0.029
 happened-0.027
iest-0.027
kins-0.026
Token adventure
Token ID8855
Feature activation+2.09e-02

Pos logit contributions
pron+0.019
kin+0.019
kins+0.018
wen+0.018
cle+0.017
Neg logit contributions
 all-0.017
 They-0.016
 Even-0.016
 So-0.015
 All-0.015
Token.
Token ID13
Feature activation+2.62e-04

Pos logit contributions
pron+0.053
kin+0.046
inkle+0.045
cle+0.043
orse+0.043
Neg logit contributions
 apart-0.018
 all-0.018
 They-0.017
 If-0.017
 Instead-0.016
Token<|endoftext|>
Token ID50256
Feature activation-7.29e-03

Pos logit contributions
pron+0.001
kin+0.001
 lifetime+0.001
orse+0.001
light+0.001
Neg logit contributions
 If-0.000
 apart-0.000
licks-0.000
 Instead-0.000
atters-0.000
TokenOnce
Token ID7454
Feature activation-3.90e-03

Pos logit contributions
 If+0.007
 Instead+0.007
 They+0.007
 apart+0.006
 So+0.006
Neg logit contributions
pron-0.018
kin-0.016
fish-0.015
 lifetime-0.015
orse-0.015
Token there
Token ID612
Feature activation-2.75e-02

Pos logit contributions
 If+0.007
 They+0.007
 Instead+0.007
 all+0.006
 Sometimes+0.006
Neg logit contributions
pron-0.007
 lifetime-0.005
light-0.005
cle-0.005
kin-0.005
Token was
Token ID373
Feature activation+9.71e-03

Pos logit contributions
 They+0.033
 Instead+0.033
 If+0.031
 So+0.030
 The+0.029
Neg logit contributions
pron-0.066
 lifetime-0.056
kin-0.054
cle-0.054
fish-0.053
Token a
Token ID257
Feature activation+1.96e-03

Pos logit contributions
pron+0.024
kin+0.022
fish+0.021
 lifetime+0.021
cle+0.020
Neg logit contributions
 If-0.009
 They-0.008
 Instead-0.008
 Other-0.007
 So-0.007
Token boy
Token ID2933
Feature activation-1.06e-02

Pos logit contributions
pron+0.004
kin+0.004
fish+0.003
iest+0.003
 lifetime+0.003
Neg logit contributions
 If-0.003
 Instead-0.002
 They-0.002
 apart-0.002
atters-0.002
Token named
Token ID3706
Feature activation+1.12e-02

Pos logit contributions
 If+0.016
 Instead+0.015
 apart+0.015
licks+0.014
 They+0.014
Neg logit contributions
pron-0.019
kin-0.017
fish-0.016
orse-0.016
light-0.016
Token John
Token ID1757
Feature activation+1.04e-02

Pos logit contributions
pron+0.021
orse+0.018
 happened+0.018
kin+0.018
fish+0.018
Neg logit contributions
licks-0.014
 apart-0.014
 If-0.014
atters-0.014
 Instead-0.013
Token share
Token ID2648
Feature activation-2.10e-02

Pos logit contributions
 If+0.008
licks+0.008
 Instead+0.007
 Other+0.007
 apart+0.007
Neg logit contributions
pron-0.013
kin-0.012
inkle-0.011
cle-0.010
fish-0.010
Token and
Token ID290
Feature activation+5.90e-03

Pos logit contributions
 If+0.024
 Instead+0.022
 apart+0.022
 They+0.021
licks+0.019
Neg logit contributions
pron-0.047
kin-0.043
 lifetime-0.041
fish-0.041
iest-0.040
Token take
Token ID1011
Feature activation-2.07e-03

Pos logit contributions
pron+0.013
kin+0.012
fish+0.011
iest+0.011
 lifetime+0.011
Neg logit contributions
 If-0.007
 Instead-0.006
 They-0.006
 apart-0.006
atters-0.006
Token turns
Token ID4962
Feature activation+1.07e-02

Pos logit contributions
 If+0.003
licks+0.003
 Instead+0.002
atters+0.002
 Even+0.002
Neg logit contributions
pron-0.003
kin-0.003
 happened-0.003
iest-0.003
 moment-0.003
Token.
Token ID13
Feature activation-7.76e-03

Pos logit contributions
pron+0.024
kin+0.022
fish+0.021
iest+0.021
 lifetime+0.021
Neg logit contributions
 If-0.012
 apart-0.011
 Instead-0.011
 They-0.011
atters-0.010
Token Now
Token ID2735
Feature activation-3.92e-02

Pos logit contributions
 apart+0.011
 all+0.011
licks+0.010
atters+0.010
late+0.010
Neg logit contributions
pron-0.011
cle-0.011
kin-0.010
orse-0.010
light-0.010
Token you
Token ID345
Feature activation-2.91e-02

Pos logit contributions
 If+0.054
 Instead+0.050
 apart+0.048
 They+0.048
 So+0.045
Neg logit contributions
pron-0.079
kin-0.070
fish-0.067
 lifetime-0.066
iest-0.066
Token have
Token ID423
Feature activation-5.22e-03

Pos logit contributions
 If+0.044
 Other+0.042
 Instead+0.041
licks+0.040
 They+0.038
Neg logit contributions
pron-0.047
kin-0.045
fish-0.042
 happened-0.041
inkle-0.040
Token to
Token ID284
Feature activation-1.47e-02

Pos logit contributions
 If+0.005
 Instead+0.005
 apart+0.004
 They+0.004
icks+0.004
Neg logit contributions
pron-0.013
kin-0.012
fish-0.012
iest-0.011
inkle-0.011
Token clean
Token ID3424
Feature activation-1.80e-02

Pos logit contributions
 If+0.017
 Instead+0.016
licks+0.016
 apart+0.015
 Other+0.014
Neg logit contributions
pron-0.032
kin-0.029
fish-0.027
inkle-0.026
orse-0.026
Token up
Token ID510
Feature activation+9.67e-03

Pos logit contributions
 If+0.019
 Instead+0.017
 They+0.016
atters+0.016
 apart+0.016
Neg logit contributions
pron-0.041
kin-0.037
fish-0.036
iest-0.034
inkle-0.034
Token can
Token ID460
Feature activation-1.61e-02

Pos logit contributions
pron+0.010
kin+0.010
fish+0.009
 lifetime+0.009
iest+0.009
Neg logit contributions
 If-0.006
 Instead-0.005
 They-0.005
 apart-0.005
licks-0.005
Token we
Token ID356
Feature activation-6.43e-03

Pos logit contributions
 If+0.017
 Instead+0.016
 They+0.015
 So+0.014
 apart+0.014
Neg logit contributions
pron-0.037
kin-0.034
 lifetime-0.033
fish-0.033
iest-0.032
Token have
Token ID423
Feature activation+7.88e-03

Pos logit contributions
 If+0.008
 Instead+0.007
licks+0.007
 apart+0.007
 They+0.007
Neg logit contributions
pron-0.014
kin-0.012
fish-0.012
light-0.011
iest-0.011
Token some
Token ID617
Feature activation-6.00e-03

Pos logit contributions
pron+0.018
kin+0.018
fish+0.017
orse+0.016
cle+0.016
Neg logit contributions
 If-0.008
 Instead-0.008
 apart-0.007
 They-0.007
 all-0.007
Token or
Token ID393
Feature activation+1.99e-02

Pos logit contributions
 If+0.007
 Instead+0.007
 all+0.007
 So+0.006
 They+0.006
Neg logit contributions
pron-0.012
kin-0.012
orse-0.011
fish-0.011
cle-0.011
Tokennam
Token ID7402
Feature activation-8.12e-02

Pos logit contributions
pron+0.044
kin+0.041
fish+0.039
iest+0.038
 lifetime+0.038
Neg logit contributions
 If-0.022
 Instead-0.021
 apart-0.020
 They-0.020
 So-0.018
Tokenents
Token ID658
Feature activation-2.68e-02

Pos logit contributions
 If+0.086
 They+0.075
 Instead+0.074
icks+0.069
 Other+0.068
Neg logit contributions
pron-0.184
light-0.162
kin-0.162
fish-0.159
orse-0.157
Token for
Token ID329
Feature activation+1.29e-02

Pos logit contributions
 apart+0.029
 If+0.026
 all+0.026
 They+0.024
 Instead+0.022
Neg logit contributions
pron-0.060
 lifetime-0.057
cle-0.055
orse-0.054
kin-0.053
Token our
Token ID674
Feature activation-1.20e-02

Pos logit contributions
pron+0.027
kin+0.026
fish+0.025
iest+0.024
 lifetime+0.023
Neg logit contributions
 If-0.016
 Instead-0.015
 They-0.014
 apart-0.014
 Other-0.013
Token snow
Token ID6729
Feature activation+3.24e-02

Pos logit contributions
 If+0.016
 Instead+0.015
 apart+0.014
 all+0.014
 They+0.014
Neg logit contributions
pron-0.023
kin-0.021
orse-0.020
iest-0.020
fish-0.020
Tokenman
Token ID805
Feature activation-2.18e-02

Pos logit contributions
pron+0.049
 lifetime+0.044
kin+0.044
orse+0.043
 newfound+0.043
Neg logit contributions
 Instead-0.050
 apart-0.050
 If-0.050
 They-0.049
 So-0.046
Token to
Token ID284
Feature activation-7.79e-03

Pos logit contributions
 If+0.003
 They+0.002
 Instead+0.002
 apart+0.002
 So+0.002
Neg logit contributions
pron-0.008
kin-0.007
 lifetime-0.007
orse-0.007
fish-0.007
Token me
Token ID502
Feature activation-3.86e-03

Pos logit contributions
 If+0.010
 Instead+0.009
 They+0.009
 apart+0.009
licks+0.008
Neg logit contributions
pron-0.016
kin-0.015
fish-0.014
iest-0.014
 lifetime-0.014
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
 If+0.005
 apart+0.004
 Instead+0.004
 They+0.004
 So+0.004
Neg logit contributions
pron-0.008
kin-0.007
fish-0.007
 happened-0.007
 lifetime-0.007
Token The
Token ID383
Feature activation+6.20e-03

Pos logit contributions
 apart+0.015
licks+0.013
 all+0.012
atters+0.012
late+0.011
Neg logit contributions
pron-0.021
orse-0.019
cle-0.019
light-0.019
kin-0.019
Token or
Token ID393
Feature activation+2.60e-02

Pos logit contributions
pron+0.012
orse+0.011
kin+0.011
light+0.010
iest+0.010
Neg logit contributions
 If-0.008
 They-0.008
 Instead-0.007
 zo-0.007
 apart-0.007
Tokennam
Token ID7402
Feature activation-8.47e-02

Pos logit contributions
pron+0.060
kin+0.056
fish+0.053
 lifetime+0.053
orse+0.053
Neg logit contributions
 If-0.028
 apart-0.025
 Instead-0.024
 They-0.024
licks-0.021
Tokenents
Token ID658
Feature activation-2.50e-02

Pos logit contributions
 If+0.090
 They+0.080
 Instead+0.080
 apart+0.073
 Other+0.073
Neg logit contributions
pron-0.188
kin-0.167
light-0.166
fish-0.162
orse-0.161
Token were
Token ID547
Feature activation-1.53e-02

Pos logit contributions
 If+0.021
 apart+0.019
 They+0.018
 Instead+0.016
icks+0.016
Neg logit contributions
pron-0.065
 lifetime-0.058
cle-0.056
kin-0.056
light-0.055
Token very
Token ID845
Feature activation-4.89e-03

Pos logit contributions
 If+0.018
 Instead+0.017
 apart+0.017
 They+0.016
licks+0.015
Neg logit contributions
pron-0.033
kin-0.030
fish-0.029
 lifetime-0.028
iest-0.028
Token important
Token ID1593
Feature activation-1.78e-02

Pos logit contributions
 If+0.007
 Instead+0.006
 They+0.006
 apart+0.006
licks+0.006
Neg logit contributions
pron-0.010
kin-0.009
fish-0.008
 lifetime-0.008
iest-0.008
Token to
Token ID284
Feature activation-4.19e-03

Pos logit contributions
 If+0.023
 They+0.021
 Instead+0.021
 apart+0.020
 So+0.019
Neg logit contributions
pron-0.039
kin-0.034
fish-0.033
orse-0.032
 lifetime-0.032
Token time
Token ID640
Feature activation+7.46e-03

Pos logit contributions
 If+0.003
 Instead+0.003
 They+0.003
 So+0.003
 Even+0.003
Neg logit contributions
pron-0.002
kin-0.001
fish-0.001
iest-0.001
 lifetime-0.001
Token there
Token ID612
Feature activation-3.04e-02

Pos logit contributions
pron+0.019
kin+0.016
 lifetime+0.016
fish+0.015
cle+0.015
Neg logit contributions
 If-0.008
 They-0.007
 Instead-0.007
 So-0.006
 Even-0.006
Token was
Token ID373
Feature activation+1.00e-02

Pos logit contributions
 They+0.037
 Instead+0.036
 All+0.032
 If+0.032
 He+0.032
Neg logit contributions
pron-0.074
 lifetime-0.065
cle-0.061
fish-0.060
kin-0.059
Token a
Token ID257
Feature activation-1.40e-03

Pos logit contributions
pron+0.024
kin+0.022
fish+0.022
 lifetime+0.022
iest+0.021
Neg logit contributions
 If-0.009
 They-0.009
 Instead-0.008
 apart-0.007
 Other-0.007
Token little
Token ID1310
Feature activation-1.84e-02

Pos logit contributions
 If+0.002
 Instead+0.002
 They+0.002
 apart+0.002
atters+0.002
Neg logit contributions
pron-0.003
kin-0.003
fish-0.002
iest-0.002
 lifetime-0.002
Token av
Token ID1196
Feature activation-9.45e-02

Pos logit contributions
 If+0.034
 Instead+0.032
 apart+0.031
 They+0.030
licks+0.030
Neg logit contributions
pron-0.028
kin-0.024
fish-0.023
iest-0.022
 lifetime-0.022
Tokenac
Token ID330
Feature activation-2.27e-02

Pos logit contributions
 If+0.187
 Instead+0.168
 apart+0.166
 They+0.166
licks+0.156
Neg logit contributions
pron-0.137
fish-0.119
orse-0.119
light-0.118
kin-0.116
Tokenado
Token ID4533
Feature activation+3.65e-03

Pos logit contributions
 If+0.051
 Instead+0.049
 They+0.047
 apart+0.047
licks+0.046
Neg logit contributions
pron-0.025
kin-0.021
fish-0.019
 lifetime-0.017
iest-0.017
Token.
Token ID13
Feature activation+5.05e-04

Pos logit contributions
pron+0.007
kin+0.006
fish+0.006
 lifetime+0.006
iest+0.006
Neg logit contributions
 If-0.005
 Instead-0.005
 They-0.005
 apart-0.005
licks-0.004
Token It
Token ID632
Feature activation-2.36e-03

Pos logit contributions
pron+0.001
kin+0.001
light+0.001
 happened+0.001
 particular+0.001
Neg logit contributions
 apart-0.000
 If-0.000
atters-0.000
licks-0.000
icks-0.000
Token was
Token ID373
Feature activation+9.12e-03

Pos logit contributions
 They+0.004
 If+0.004
 Instead+0.004
 Even+0.003
 So+0.003
Neg logit contributions
pron-0.004
kin-0.004
 lifetime-0.003
iest-0.003
cle-0.003
Token.
Token ID13
Feature activation+1.58e-03

Pos logit contributions
pron+0.020
kin+0.018
fish+0.017
light+0.017
orse+0.017
Neg logit contributions
 If-0.017
 Instead-0.016
 apart-0.016
licks-0.015
 They-0.015
Token She
Token ID1375
Feature activation+1.10e-02

Pos logit contributions
pron+0.003
kin+0.003
 moment+0.003
 particular+0.003
light+0.003
Neg logit contributions
 apart-0.001
icks-0.001
 If-0.001
licks-0.001
 wre-0.001
Token loved
Token ID6151
Feature activation+7.79e-03

Pos logit contributions
pron+0.022
kin+0.020
fish+0.019
 lifetime+0.019
iest+0.019
Neg logit contributions
 If-0.014
 They-0.014
 Instead-0.013
 apart-0.012
atters-0.012
Token to
Token ID284
Feature activation+1.24e-02

Pos logit contributions
pron+0.016
kin+0.013
fish+0.013
 happened+0.013
 lifetime+0.013
Neg logit contributions
 If-0.010
 Instead-0.010
 apart-0.009
atters-0.009
licks-0.009
Token eat
Token ID4483
Feature activation-2.30e-03

Pos logit contributions
pron+0.026
kin+0.024
fish+0.023
orse+0.022
iest+0.022
Neg logit contributions
 If-0.016
 Instead-0.014
 apart-0.014
licks-0.014
 They-0.013
Token av
Token ID1196
Feature activation-9.72e-02

Pos logit contributions
 If+0.004
 Instead+0.004
atters+0.003
 apart+0.003
licks+0.003
Neg logit contributions
pron-0.004
kin-0.003
fish-0.003
iest-0.003
 happened-0.003
Tokenoc
Token ID420
Feature activation-3.46e-02

Pos logit contributions
 If+0.186
 They+0.168
 blamed+0.165
 apart+0.164
 Instead+0.161
Neg logit contributions
pron-0.138
light-0.117
b-0.116
iflower-0.115
cle-0.114
Tokenados
Token ID22484
Feature activation-3.60e-02

Pos logit contributions
 If+0.061
 They+0.059
 apart+0.056
atters+0.055
 Instead+0.055
Neg logit contributions
pron-0.048
gh-0.043
iflower-0.043
cle-0.042
 happened-0.042
Token because
Token ID780
Feature activation+2.51e-02

Pos logit contributions
 If+0.053
 Instead+0.048
atters+0.047
 They+0.045
licks+0.045
Neg logit contributions
pron-0.068
kin-0.061
fish-0.060
iest-0.057
light-0.056
Token they
Token ID484
Feature activation+1.22e-02

Pos logit contributions
pron+0.037
kin+0.032
fish+0.030
 lifetime+0.028
iest+0.028
Neg logit contributions
 If-0.047
 Instead-0.044
 apart-0.043
 They-0.043
licks-0.042
Token were
Token ID547
Feature activation-2.61e-03

Pos logit contributions
pron+0.020
kin+0.018
fish+0.017
 happened+0.016
iest+0.016
Neg logit contributions
 If-0.022
 Instead-0.020
atters-0.019
 apart-0.019
icks-0.019
Token He
Token ID679
Feature activation-4.71e-03

Pos logit contributions
pron+0.011
kin+0.010
orse+0.010
 lifetime+0.010
 happened+0.009
Neg logit contributions
 apart-0.004
licks-0.004
 If-0.004
atters-0.004
icks-0.004
Token quickly
Token ID2952
Feature activation+1.25e-02

Pos logit contributions
 If+0.005
 Instead+0.005
 They+0.005
 apart+0.005
 So+0.005
Neg logit contributions
pron-0.010
kin-0.009
iest-0.009
cle-0.008
 lifetime-0.008
Token hung
Token ID9174
Feature activation-2.08e-03

Pos logit contributions
pron+0.025
kin+0.023
fish+0.022
iest+0.021
 lifetime+0.021
Neg logit contributions
 If-0.016
 Instead-0.015
 apart-0.015
 They-0.015
atters-0.014
Token colorful
Token ID20239
Feature activation-1.02e-02

Pos logit contributions
 If+0.002
 apart+0.002
 They+0.002
 Instead+0.002
licks+0.002
Neg logit contributions
pron-0.005
 lifetime-0.004
orse-0.004
cle-0.004
kin-0.004
Token or
Token ID393
Feature activation+1.96e-02

Pos logit contributions
 apart+0.013
 They+0.012
 If+0.012
 Instead+0.012
licks+0.011
Neg logit contributions
pron-0.021
kin-0.019
cle-0.018
orse-0.018
 lifetime-0.017
Tokennam
Token ID7402
Feature activation-7.59e-02

Pos logit contributions
pron+0.045
kin+0.042
fish+0.041
iest+0.040
orse+0.040
Neg logit contributions
 If-0.019
 apart-0.019
 Instead-0.017
 They-0.017
licks-0.015
Tokenents
Token ID658
Feature activation-2.11e-02

Pos logit contributions
 If+0.087
 They+0.073
 Instead+0.073
 Enjoy+0.070
icks+0.069
Neg logit contributions
pron-0.163
light-0.151
kin-0.145
orse-0.141
fish-0.141
Token and
Token ID290
Feature activation+1.58e-02

Pos logit contributions
 apart+0.023
 If+0.019
 all+0.018
 They+0.018
licks+0.017
Neg logit contributions
pron-0.050
cle-0.045
 lifetime-0.045
orse-0.043
kin-0.043
Token tw
Token ID665
Feature activation+1.01e-02

Pos logit contributions
pron+0.030
kin+0.027
fish+0.026
iest+0.025
cle+0.025
Neg logit contributions
 If-0.022
 apart-0.021
 Instead-0.021
 They-0.020
licks-0.019
Tokenink
Token ID676
Feature activation-8.58e-05

Pos logit contributions
pron+0.013
kin+0.011
 lifetime+0.011
fish+0.010
 moment+0.010
Neg logit contributions
 If-0.020
atters-0.019
 Instead-0.019
icks-0.019
licks-0.019
Tokenling
Token ID1359
Feature activation+1.07e-02

Pos logit contributions
 apart+0.000
 all+0.000
 If+0.000
 They+0.000
 Instead+0.000
Neg logit contributions
pron-0.000
 lifetime-0.000
cle-0.000
 particular-0.000
kin-0.000