TOP ACTIVATIONS
MAX = 4.171

old lady with a mis chie vous tw inkle in her
. They put them on opposite sides of the room and
It was a big j igsaw puzzle and she wanted to
went on with his mis chie vous ways , causing problems
. He ran in the opposite direction of the earthquake and
The little boy smiled mis chie v ously and said ,
and replied with a mis chie vous gl int in his
at him with a mis chie vous grin . She said
is trying to share their talent . She says everyone has
pushed the lock in the opposite direction of the way it
them up and bundled the group of rabbits in with them
would take out a j igsaw puzzle and try to solve
Jack went off in the opposite direction . When
was able to share her talent with the world ! <|endoftext|>
The town spe ople had to find a
Maria ran over to the group and admired their fair play
can I sit in the new chair ?" Tim my asked
anymore . They sat on opposite sides of the room and
breeze blew her in the opposite direction of the cat .
The cat grinned mis chie v ously . " Never

MIN ACTIVATIONS
MIN = -3.913

y ummy and made her throat feel better . She hugged
- it was bigger and ju ic ier than any he
smells bad and makes him cough . But he likes the
good . It made her throat feel less sore . She
tongue feel dry and her throat hurt . L
M um and Dad were c udd ling on the arm
air . It made her throat hurt and her eyes sting
It made him spit and cough . It made him cry
and bitter and makes them cough . They spit it out
is salty and make them cough . They are not happy
made the flowers and trees grow big and strong . In
itch . He feels his throat swell . He feels his
covered Tom and made him cough . He tried to run
young girl had caught a severe case of hyp other mia
the plants and watch them grow big and tall .
was bitter and made them cough . " Wow
was the sweet est and ju ic iest strawberry she had
She found the biggest and ju ic iest one , but
cricket woke up with a sore throat and couldn 't sing
ummy . It made her tongue pink . She l icked

INTERVAL 2.150 - 4.171
CONTAINS 0.118%

J ill put on her wide uniform and looked in the
Let 's look in the wild fl owers ."
home and Sam could barely contain his impat ience as he
to be available to help others . From that day on
he remembered the brav est thing to do was to stay

INTERVAL 0.129 - 2.150
CONTAINS 40.312%

." Tom followed Anna to the kitchen . He
of Years . I can give you the wisdom of all
, there was a dog named Max . Max was a
the stone in her pocket and went to play with her
't wait to get there . When she arrived , the

INTERVAL -1.892 - 0.129
CONTAINS 59.318%

find treasure or animals ." Max nodded . He
When they got to the zoo , they saw
and stretch and stretch . The little girl couldn 't believe
. " Good night , Sam ," Tom says
loved the soup and said , " Thank you , Mia

INTERVAL -3.913 - -1.892
CONTAINS 0.252%

good . She had a sore throat and a fever .
, there was a big pig . The big pig had
it ? It was an egg ! They stopped and looked
Pam was lying in her bed . It was a normal
then she saw a spoiled egg on the ground . "
Token old
Token ID1468
Feature activation+1.99e+00

Pos logit contributions
ucker+0.011
please+0.010
eter+0.010
 unnecessary+0.010
ounced+0.010
Neg logit contributions
nam-0.009
 globe-0.009
 tripod-0.009
pe-0.009
book-0.009
Token lady
Token ID10846
Feature activation-1.64e-03

Pos logit contributions
nam+0.047
lf+0.046
kins+0.046
 Marie+0.045
 globe+0.044
Neg logit contributions
 unnecessary-0.052
please-0.050
ucker-0.049
eter-0.049
aa-0.047
Token with
Token ID351
Feature activation+8.90e-01

Pos logit contributions
 unnecessary+0.000
eter+0.000
aa+0.000
please+0.000
ch+0.000
Neg logit contributions
nam-0.000
kins-0.000
lf-0.000
 Marie-0.000
 tripod-0.000
Token a
Token ID257
Feature activation-7.88e-01

Pos logit contributions
nam+0.020
kins+0.020
lf+0.019
 Marie+0.019
 globe+0.018
Neg logit contributions
please-0.024
 unnecessary-0.024
eter-0.023
ucker-0.023
aa-0.023
Token mis
Token ID2984
Feature activation+7.15e-02

Pos logit contributions
eter+0.019
please+0.018
ucker+0.018
 unnecessary+0.018
ead+0.018
Neg logit contributions
nam-0.020
 telescope-0.019
 tripod-0.019
 Marie-0.019
 globe-0.019
Tokenchie
Token ID3043
Feature activation+4.17e+00

Pos logit contributions
nam+0.001
kins+0.001
lf+0.001
 Marie+0.001
book+0.001
Neg logit contributions
 unnecessary-0.002
ucker-0.002
please-0.002
eter-0.002
aa-0.002
Tokenvous
Token ID31222
Feature activation-1.04e+00

Pos logit contributions
nam+0.160
kins+0.151
 globe+0.147
lf+0.146
 tripod+0.144
Neg logit contributions
ch-0.067
aa-0.061
please-0.060
 unnecessary-0.055
eter-0.050
Token tw
Token ID665
Feature activation+2.00e-01

Pos logit contributions
please+0.026
 unnecessary+0.026
ucker+0.026
eter+0.026
ounced+0.024
Neg logit contributions
 Marie-0.024
book-0.023
nam-0.023
 tripod-0.023
kins-0.023
Tokeninkle
Token ID19894
Feature activation-1.13e+00

Pos logit contributions
nam+0.005
kins+0.005
lf+0.005
 Marie+0.005
 globe+0.005
Neg logit contributions
 unnecessary-0.005
please-0.005
eter-0.005
aa-0.004
ucker-0.004
Token in
Token ID287
Feature activation+3.09e-01

Pos logit contributions
 unnecessary+0.029
please+0.028
eter+0.028
aa+0.028
ucker+0.028
Neg logit contributions
nam-0.027
kins-0.026
lf-0.026
 Marie-0.025
 globe-0.025
Token her
Token ID607
Feature activation+4.27e-01

Pos logit contributions
nam+0.007
kins+0.007
lf+0.007
 Marie+0.007
 globe+0.007
Neg logit contributions
 unnecessary-0.008
please-0.008
eter-0.008
aa-0.008
ucker-0.008
Token.
Token ID13
Feature activation-3.18e-01

Pos logit contributions
nam+0.039
lf+0.037
kins+0.037
 Marie+0.036
atively+0.036
Neg logit contributions
 unnecessary-0.031
please-0.030
ch-0.030
eter-0.030
aa-0.030
Token They
Token ID1119
Feature activation-5.89e-02

Pos logit contributions
 unnecessary+0.009
please+0.009
eter+0.009
aa+0.008
ch+0.008
Neg logit contributions
 Marie-0.007
nam-0.007
kins-0.007
lf-0.007
book-0.006
Token put
Token ID1234
Feature activation+1.44e-02

Pos logit contributions
please+0.001
aa+0.001
 unnecessary+0.001
ch+0.001
eter+0.001
Neg logit contributions
kins-0.001
nam-0.001
lf-0.001
 Marie-0.001
 globe-0.001
Token them
Token ID606
Feature activation+2.06e-01

Pos logit contributions
lf+0.000
nam+0.000
kins+0.000
 Marie+0.000
atively+0.000
Neg logit contributions
 unnecessary-0.000
ch-0.000
eter-0.000
aa-0.000
please-0.000
Token on
Token ID319
Feature activation+1.12e+00

Pos logit contributions
 Marie+0.007
lf+0.007
 Laura+0.006
 tripod+0.006
atively+0.006
Neg logit contributions
ch-0.004
 unnecessary-0.003
aa-0.003
 cho-0.003
eter-0.003
Token opposite
Token ID6697
Feature activation+4.15e+00

Pos logit contributions
kins+0.032
nam+0.032
ophone+0.031
igator+0.030
lf+0.030
Neg logit contributions
 trouble-0.023
eter-0.022
 unnecessary-0.021
 fle-0.021
ch-0.021
Token sides
Token ID5389
Feature activation+2.41e+00

Pos logit contributions
nam+0.120
 globe+0.104
 tripod+0.103
 Marie+0.102
kins+0.100
Neg logit contributions
 trouble-0.091
eter-0.091
 unnecessary-0.088
ch-0.087
 tum-0.084
Token of
Token ID286
Feature activation+9.44e-01

Pos logit contributions
nam+0.085
lf+0.077
 tripod+0.076
kins+0.076
 Marie+0.075
Neg logit contributions
eter-0.040
aa-0.040
ch-0.039
ucker-0.038
ead-0.038
Token the
Token ID262
Feature activation+1.55e+00

Pos logit contributions
ophone+0.029
nam+0.028
kins+0.028
atively+0.028
taker+0.027
Neg logit contributions
 trouble-0.017
 unnecessary-0.015
ch-0.015
eter-0.014
 mischief-0.014
Token room
Token ID2119
Feature activation+3.76e-01

Pos logit contributions
nam+0.039
kins+0.035
atively+0.035
 Marie+0.034
lf+0.033
Neg logit contributions
 trouble-0.039
ch-0.039
 unnecessary-0.038
 fever-0.037
 fle-0.036
Token and
Token ID290
Feature activation-8.02e-02

Pos logit contributions
nam+0.012
igator+0.011
 tripod+0.011
 Marie+0.011
lf+0.011
Neg logit contributions
eter-0.007
ead-0.007
 trouble-0.006
 tum-0.006
aa-0.006
Token It
Token ID632
Feature activation+4.39e-01

Pos logit contributions
eter+0.000
please+0.000
ucker+0.000
 unnecessary+0.000
ead+0.000
Neg logit contributions
nam-0.001
kins-0.000
lf-0.000
 globe-0.000
 tripod-0.000
Token was
Token ID373
Feature activation+1.02e-01

Pos logit contributions
nam+0.012
kins+0.011
lf+0.011
 Marie+0.011
 globe+0.011
Neg logit contributions
please-0.010
eter-0.010
 unnecessary-0.010
ch-0.010
aa-0.010
Token a
Token ID257
Feature activation+4.41e-01

Pos logit contributions
nam+0.002
 Marie+0.002
lf+0.002
kins+0.002
book+0.002
Neg logit contributions
 unnecessary-0.003
aa-0.003
please-0.003
eter-0.003
ucker-0.003
Token big
Token ID1263
Feature activation+9.35e-01

Pos logit contributions
nam+0.012
lf+0.011
 globe+0.011
kins+0.011
 Marie+0.011
Neg logit contributions
 unnecessary-0.011
eter-0.010
please-0.010
ucker-0.010
aa-0.010
Token j
Token ID474
Feature activation+1.25e+00

Pos logit contributions
nam+0.022
lf+0.021
 globe+0.021
 Marie+0.021
kins+0.021
Neg logit contributions
 unnecessary-0.026
please-0.025
ucker-0.024
aa-0.024
eter-0.024
Tokenigsaw
Token ID45636
Feature activation+4.15e+00

Pos logit contributions
nam+0.037
kins+0.035
lf+0.034
 Marie+0.033
 globe+0.033
Neg logit contributions
aa-0.027
ch-0.026
eter-0.026
ucker-0.025
 unnecessary-0.025
Token puzzle
Token ID15027
Feature activation+9.52e-01

Pos logit contributions
nam+0.115
ophone+0.100
atively+0.099
 tripod+0.099
iece+0.098
Neg logit contributions
ch-0.092
umm-0.086
 fle-0.083
 problem-0.082
eter-0.081
Token and
Token ID290
Feature activation+1.43e-01

Pos logit contributions
nam+0.030
kins+0.028
 tripod+0.027
 Marie+0.027
atively+0.027
Neg logit contributions
ch-0.017
eter-0.017
please-0.016
 trouble-0.015
ead-0.015
Token she
Token ID673
Feature activation-2.58e-01

Pos logit contributions
nam+0.003
kins+0.003
 globe+0.003
lf+0.003
 tripod+0.003
Neg logit contributions
please-0.003
eter-0.003
 unnecessary-0.003
ch-0.003
ucker-0.003
Token wanted
Token ID2227
Feature activation-2.68e-01

Pos logit contributions
please+0.006
eter+0.006
 unnecessary+0.006
aa+0.006
ucker+0.006
Neg logit contributions
nam-0.007
kins-0.007
 Marie-0.006
lf-0.006
 globe-0.006
Token to
Token ID284
Feature activation-2.76e-01

Pos logit contributions
ucker+0.005
aa+0.005
 unnecessary+0.005
please+0.005
eter+0.005
Neg logit contributions
nam-0.008
 Marie-0.008
lf-0.008
kins-0.007
book-0.007
Token went
Token ID1816
Feature activation+9.04e-01

Pos logit contributions
nam+0.017
lf+0.016
kins+0.016
 Marie+0.016
atively+0.015
Neg logit contributions
 unnecessary-0.020
eter-0.019
ucker-0.019
please-0.019
ead-0.018
Token on
Token ID319
Feature activation+7.85e-01

Pos logit contributions
nam+0.030
kins+0.030
lf+0.029
 Marie+0.029
 globe+0.028
Neg logit contributions
 unnecessary-0.015
please-0.015
aa-0.014
eter-0.014
ch-0.014
Token with
Token ID351
Feature activation+8.07e-01

Pos logit contributions
nam+0.018
kins+0.018
lf+0.017
 Marie+0.017
 globe+0.017
Neg logit contributions
 unnecessary-0.021
please-0.021
eter-0.020
aa-0.020
ch-0.020
Token his
Token ID465
Feature activation-4.71e-01

Pos logit contributions
nam+0.016
kins+0.015
lf+0.015
 Marie+0.014
 globe+0.014
Neg logit contributions
ucker-0.025
please-0.025
eter-0.025
 unnecessary-0.024
aa-0.024
Token mis
Token ID2984
Feature activation+8.13e-01

Pos logit contributions
ucker+0.010
eter+0.010
please+0.010
 unnecessary+0.010
ch+0.009
Neg logit contributions
nam-0.013
kins-0.013
lf-0.013
 globe-0.012
 Marie-0.012
Tokenchie
Token ID3043
Feature activation+4.11e+00

Pos logit contributions
nam+0.017
kins+0.016
lf+0.016
 Marie+0.016
book+0.015
Neg logit contributions
 unnecessary-0.022
ucker-0.022
please-0.022
ch-0.021
 fever-0.021
Tokenvous
Token ID31222
Feature activation-1.49e-01

Pos logit contributions
nam+0.159
kins+0.154
 globe+0.152
 telescope+0.149
lf+0.146
Neg logit contributions
ch-0.062
please-0.059
aa-0.057
 unnecessary-0.054
eter-0.051
Token ways
Token ID2842
Feature activation+1.57e+00

Pos logit contributions
ucker+0.004
please+0.004
eter+0.004
 unnecessary+0.003
ch+0.003
Neg logit contributions
kins-0.004
 Marie-0.004
nam-0.004
lf-0.003
atively-0.003
Token,
Token ID11
Feature activation-2.45e-01

Pos logit contributions
nam+0.048
kins+0.047
lf+0.045
 tripod+0.045
 Marie+0.045
Neg logit contributions
 unnecessary-0.029
eter-0.029
aa-0.029
 trouble-0.028
ch-0.028
Token causing
Token ID6666
Feature activation+9.65e-01

Pos logit contributions
 unnecessary+0.007
ucker+0.007
eter+0.007
please+0.007
aa+0.007
Neg logit contributions
nam-0.005
kins-0.005
lf-0.005
 Marie-0.005
 globe-0.005
Token problems
Token ID2761
Feature activation+6.31e-01

Pos logit contributions
nam+0.025
 Marie+0.022
 Laura+0.021
 Anne+0.020
 James+0.020
Neg logit contributions
ucker-0.027
please-0.024
eter-0.024
ounced-0.023
umm-0.023
Token.
Token ID13
Feature activation+6.44e-01

Pos logit contributions
eter+0.009
 unnecessary+0.009
 trouble+0.008
aa+0.008
please+0.008
Neg logit contributions
nam-0.010
kins-0.010
lf-0.010
 Marie-0.010
 tripod-0.010
Token He
Token ID679
Feature activation+7.71e-01

Pos logit contributions
kins+0.017
nam+0.016
lf+0.016
 Marie+0.016
igator+0.015
Neg logit contributions
 unnecessary-0.015
eter-0.015
please-0.015
aa-0.015
ead-0.015
Token ran
Token ID4966
Feature activation+1.11e+00

Pos logit contributions
nam+0.018
kins+0.017
lf+0.017
 Marie+0.016
atively+0.016
Neg logit contributions
 unnecessary-0.021
please-0.020
eter-0.020
ucker-0.020
aa-0.020
Token in
Token ID287
Feature activation+1.26e+00

Pos logit contributions
nam+0.030
kins+0.030
lf+0.029
 Marie+0.028
 globe+0.028
Neg logit contributions
 unnecessary-0.025
please-0.024
eter-0.024
aa-0.024
ch-0.024
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
nam+0.033
kins+0.033
lf+0.031
 Marie+0.030
 globe+0.030
Neg logit contributions
 unnecessary-0.029
 trouble-0.029
eter-0.029
ch-0.028
aa-0.028
Token opposite
Token ID6697
Feature activation+4.09e+00

Pos logit contributions
nam+0.029
kins+0.028
lf+0.028
 Marie+0.027
 globe+0.027
Neg logit contributions
 unnecessary-0.023
please-0.023
eter-0.023
aa-0.023
ucker-0.022
Token direction
Token ID4571
Feature activation+1.42e+00

Pos logit contributions
nam+0.107
kins+0.102
lf+0.100
 tripod+0.098
 globe+0.097
Neg logit contributions
eter-0.099
 unnecessary-0.098
 trouble-0.095
aa-0.092
ucker-0.092
Token of
Token ID286
Feature activation-3.07e-01

Pos logit contributions
nam+0.040
kins+0.039
lf+0.038
 Marie+0.037
 globe+0.036
Neg logit contributions
 unnecessary-0.031
eter-0.030
please-0.030
aa-0.030
ch-0.029
Token the
Token ID262
Feature activation+2.65e-02

Pos logit contributions
ucker+0.009
please+0.009
eter+0.009
aa+0.009
 unnecessary+0.009
Neg logit contributions
nam-0.006
lf-0.006
kins-0.006
 Marie-0.006
 globe-0.005
Token earthquake
Token ID16295
Feature activation+1.72e+00

Pos logit contributions
nam+0.001
lf+0.001
kins+0.001
 globe+0.001
 Marie+0.001
Neg logit contributions
ucker-0.001
eter-0.001
please-0.001
aa-0.001
 unnecessary-0.001
Token and
Token ID290
Feature activation+9.70e-02

Pos logit contributions
nam+0.040
kins+0.040
lf+0.039
 Marie+0.038
 globe+0.037
Neg logit contributions
 unnecessary-0.045
please-0.044
eter-0.043
aa-0.043
ucker-0.043
Token The
Token ID383
Feature activation-6.99e-02

Pos logit contributions
 unnecessary+0.007
please+0.007
eter+0.007
ucker+0.007
aa+0.007
Neg logit contributions
nam-0.007
kins-0.007
lf-0.007
 Marie-0.007
 globe-0.006
Token little
Token ID1310
Feature activation+1.31e+00

Pos logit contributions
eter+0.002
 unnecessary+0.002
ucker+0.002
please+0.002
aa+0.002
Neg logit contributions
lf-0.002
nam-0.002
kins-0.002
 Marie-0.002
atively-0.002
Token boy
Token ID2933
Feature activation+8.49e-01

Pos logit contributions
nam+0.031
 Marie+0.029
kins+0.029
igator+0.028
lf+0.028
Neg logit contributions
 unnecessary-0.036
please-0.035
ucker-0.034
eter-0.033
aa-0.033
Token smiled
Token ID13541
Feature activation+6.74e-01

Pos logit contributions
nam+0.020
kins+0.020
lf+0.019
 globe+0.019
 Marie+0.019
Neg logit contributions
please-0.022
eter-0.021
 unnecessary-0.021
aa-0.021
ucker-0.021
Token mis
Token ID2984
Feature activation-1.88e-01

Pos logit contributions
nam+0.017
kins+0.017
lf+0.017
 Marie+0.016
 globe+0.016
Neg logit contributions
eter-0.016
please-0.016
 unnecessary-0.016
aa-0.016
ucker-0.016
Tokenchie
Token ID3043
Feature activation+4.07e+00

Pos logit contributions
 unnecessary+0.005
please+0.005
ucker+0.005
eter+0.005
aa+0.005
Neg logit contributions
nam-0.005
kins-0.004
lf-0.004
 Marie-0.004
 globe-0.004
Tokenv
Token ID85
Feature activation+1.32e+00

Pos logit contributions
nam+0.151
kins+0.139
lf+0.135
 telescope+0.135
 globe+0.135
Neg logit contributions
ch-0.072
aa-0.068
please-0.066
eter-0.061
ead-0.055
Tokenously
Token ID3481
Feature activation+9.26e-01

Pos logit contributions
nam+0.044
 tripod+0.038
 telescope+0.036
 Marie+0.035
 globe+0.035
Neg logit contributions
aa-0.028
eter-0.027
ucker-0.027
uck-0.026
 trouble-0.024
Token and
Token ID290
Feature activation+6.43e-01

Pos logit contributions
nam+0.026
kins+0.025
lf+0.025
 globe+0.024
 Marie+0.024
Neg logit contributions
aa-0.020
please-0.020
eter-0.019
 unnecessary-0.019
ead-0.019
Token said
Token ID531
Feature activation-9.01e-02

Pos logit contributions
kins+0.017
nam+0.017
 globe+0.017
lf+0.017
book+0.017
Neg logit contributions
please-0.014
aa-0.014
eter-0.014
ch-0.013
 unnecessary-0.013
Token,
Token ID11
Feature activation+1.86e-01

Pos logit contributions
please+0.002
ch+0.002
aa+0.002
 unnecessary+0.002
eter+0.002
Neg logit contributions
nam-0.003
lf-0.003
 tripod-0.003
kins-0.003
 globe-0.003
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
nam+0.023
kins+0.022
lf+0.022
 Marie+0.021
 globe+0.021
Neg logit contributions
 unnecessary-0.022
eter-0.022
please-0.022
aa-0.022
ucker-0.022
Token replied
Token ID8712
Feature activation+1.73e-02

Pos logit contributions
nam+0.026
kins+0.025
lf+0.025
 globe+0.024
 Marie+0.024
Neg logit contributions
please-0.025
 unnecessary-0.025
aa-0.025
eter-0.024
ch-0.024
Token with
Token ID351
Feature activation+8.18e-01

Pos logit contributions
nam+0.000
kins+0.000
lf+0.000
 Marie+0.000
 globe+0.000
Neg logit contributions
 unnecessary-0.000
please-0.000
aa-0.000
eter-0.000
ch-0.000
Token a
Token ID257
Feature activation-6.15e-01

Pos logit contributions
nam+0.020
kins+0.020
lf+0.019
 Marie+0.018
 globe+0.018
Neg logit contributions
 unnecessary-0.021
please-0.020
eter-0.020
ch-0.020
aa-0.020
Token mis
Token ID2984
Feature activation+1.18e-01

Pos logit contributions
 unnecessary+0.015
eter+0.015
please+0.014
ucker+0.014
aa+0.014
Neg logit contributions
nam-0.016
kins-0.015
 Marie-0.015
lf-0.015
 globe-0.015
Tokenchie
Token ID3043
Feature activation+4.05e+00

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.002
 globe+0.002
Neg logit contributions
 unnecessary-0.003
please-0.003
eter-0.003
aa-0.003
ucker-0.003
Tokenvous
Token ID31222
Feature activation-1.09e+00

Pos logit contributions
nam+0.163
kins+0.151
lf+0.148
 globe+0.147
 telescope+0.147
Neg logit contributions
ch-0.061
aa-0.054
please-0.052
 unnecessary-0.045
eter-0.043
Token gl
Token ID1278
Feature activation+2.79e-01

Pos logit contributions
please+0.029
 unnecessary+0.029
eter+0.029
ucker+0.028
ounced+0.027
Neg logit contributions
 Marie-0.024
kins-0.023
nam-0.023
atively-0.023
lf-0.023
Tokenint
Token ID600
Feature activation-1.33e+00

Pos logit contributions
nam+0.008
lf+0.007
 Marie+0.007
 tripod+0.006
kins+0.006
Neg logit contributions
aa-0.007
ch-0.007
 unnecessary-0.007
eter-0.007
ucker-0.007
Token in
Token ID287
Feature activation+2.83e-01

Pos logit contributions
 unnecessary+0.024
please+0.024
eter+0.023
aa+0.023
ucker+0.023
Neg logit contributions
nam-0.042
kins-0.041
lf-0.040
 Marie-0.040
 globe-0.039
Token his
Token ID465
Feature activation+8.51e-01

Pos logit contributions
nam+0.006
kins+0.006
lf+0.006
 Marie+0.006
 globe+0.005
Neg logit contributions
 unnecessary-0.008
please-0.008
eter-0.008
ucker-0.008
aa-0.008
Token at
Token ID379
Feature activation+3.76e-01

Pos logit contributions
nam+0.026
 Marie+0.025
kins+0.025
lf+0.024
atively+0.024
Neg logit contributions
please-0.022
 unnecessary-0.022
eter-0.021
aa-0.021
ucker-0.021
Token him
Token ID683
Feature activation+4.27e-01

Pos logit contributions
kins+0.011
nam+0.010
lf+0.010
 Marie+0.010
book+0.010
Neg logit contributions
 unnecessary-0.008
aa-0.008
eter-0.008
please-0.008
ch-0.008
Token with
Token ID351
Feature activation+5.32e-01

Pos logit contributions
nam+0.011
kins+0.011
lf+0.011
 Marie+0.010
 tripod+0.010
Neg logit contributions
eter-0.010
 unnecessary-0.010
ucker-0.010
please-0.009
ead-0.009
Token a
Token ID257
Feature activation-7.94e-01

Pos logit contributions
nam+0.013
 Marie+0.012
kins+0.012
lf+0.011
book+0.011
Neg logit contributions
please-0.014
 unnecessary-0.014
aa-0.014
eter-0.014
ucker-0.014
Token mis
Token ID2984
Feature activation-2.83e-02

Pos logit contributions
 unnecessary+0.021
eter+0.021
please+0.021
aa+0.020
ead+0.019
Neg logit contributions
nam-0.018
kins-0.018
 Marie-0.018
 globe-0.017
 tripod-0.017
Tokenchie
Token ID3043
Feature activation+3.99e+00

Pos logit contributions
 unnecessary+0.001
please+0.001
eter+0.001
ucker+0.001
aa+0.001
Neg logit contributions
nam-0.001
kins-0.001
lf-0.001
 Marie-0.001
 tripod-0.001
Tokenvous
Token ID31222
Feature activation-1.20e+00

Pos logit contributions
nam+0.157
kins+0.145
 globe+0.143
lf+0.142
 telescope+0.139
Neg logit contributions
ch-0.059
aa-0.056
please-0.056
eter-0.048
 unnecessary-0.046
Token grin
Token ID16770
Feature activation-4.90e-01

Pos logit contributions
 unnecessary+0.034
please+0.034
eter+0.033
ucker+0.032
ch+0.032
Neg logit contributions
 Marie-0.025
kins-0.024
nam-0.024
 tripod-0.023
atively-0.023
Token.
Token ID13
Feature activation+2.90e-01

Pos logit contributions
aa+0.012
please+0.011
ch+0.011
 unnecessary+0.011
eter+0.011
Neg logit contributions
nam-0.013
 Marie-0.012
kins-0.012
lf-0.012
atively-0.011
Token She
Token ID1375
Feature activation+6.89e-01

Pos logit contributions
nam+0.007
kins+0.007
lf+0.007
 Marie+0.006
 globe+0.006
Neg logit contributions
 unnecessary-0.008
please-0.007
eter-0.007
aa-0.007
ucker-0.007
Token said
Token ID531
Feature activation-4.09e-01

Pos logit contributions
nam+0.015
lf+0.015
kins+0.015
 Marie+0.015
atively+0.014
Neg logit contributions
 unnecessary-0.020
eter-0.019
please-0.018
aa-0.018
ucker-0.018
Token is
Token ID318
Feature activation+5.12e-01

Pos logit contributions
nam+0.011
kins+0.011
lf+0.010
 Marie+0.010
 globe+0.010
Neg logit contributions
 unnecessary-0.009
please-0.009
eter-0.009
aa-0.009
ucker-0.009
Token trying
Token ID2111
Feature activation+1.20e-01

Pos logit contributions
nam+0.013
kins+0.012
 Marie+0.012
lf+0.012
 globe+0.011
Neg logit contributions
 unnecessary-0.013
eter-0.012
please-0.012
aa-0.012
ch-0.012
Token to
Token ID284
Feature activation-3.92e-01

Pos logit contributions
nam+0.004
kins+0.004
 Marie+0.004
lf+0.004
 globe+0.004
Neg logit contributions
please-0.002
 unnecessary-0.002
eter-0.002
aa-0.002
ucker-0.002
Token share
Token ID2648
Feature activation-2.26e-01

Pos logit contributions
 unnecessary+0.010
ucker+0.010
ounced+0.010
eter+0.010
aa+0.009
Neg logit contributions
nam-0.009
lf-0.009
kins-0.009
 Marie-0.009
pe-0.009
Token their
Token ID511
Feature activation+3.58e-01

Pos logit contributions
 unnecessary+0.006
eter+0.006
please+0.005
ucker+0.005
aa+0.005
Neg logit contributions
nam-0.006
kins-0.005
lf-0.005
atively-0.005
 Marie-0.005
Token talent
Token ID7401
Feature activation+3.94e+00

Pos logit contributions
nam+0.010
atively+0.010
igator+0.010
lf+0.010
kins+0.010
Neg logit contributions
 unnecessary-0.008
 trouble-0.007
eter-0.007
please-0.007
 fle-0.007
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
nam+0.137
atively+0.125
 tripod+0.125
igator+0.118
 Becky+0.115
Neg logit contributions
eter-0.056
 trouble-0.054
please-0.052
ead-0.052
ch-0.051
Token She
Token ID1375
Feature activation-3.54e-01

Pos logit contributions
 unnecessary+0.003
eter+0.003
please+0.003
ucker+0.003
aa+0.003
Neg logit contributions
nam-0.003
kins-0.003
lf-0.003
 globe-0.003
 Marie-0.003
Token says
Token ID1139
Feature activation-3.86e-01

Pos logit contributions
 unnecessary+0.008
please+0.008
ucker+0.008
eter+0.008
aa+0.008
Neg logit contributions
nam-0.010
kins-0.009
lf-0.009
 Marie-0.009
 globe-0.009
Token everyone
Token ID2506
Feature activation+8.54e-01

Pos logit contributions
 unnecessary+0.009
please+0.009
eter+0.009
aa+0.009
ucker+0.009
Neg logit contributions
nam-0.010
kins-0.010
lf-0.010
 Marie-0.009
 globe-0.009
Token has
Token ID468
Feature activation-7.74e-01

Pos logit contributions
nam+0.025
kins+0.024
lf+0.024
 Marie+0.023
atively+0.023
Neg logit contributions
 unnecessary-0.018
aa-0.017
please-0.017
ucker-0.017
eter-0.017
Token pushed
Token ID7121
Feature activation+1.52e+00

Pos logit contributions
nam+0.011
kins+0.011
lf+0.011
 Marie+0.011
 globe+0.011
Neg logit contributions
 unnecessary-0.011
please-0.011
eter-0.011
aa-0.011
ch-0.011
Token the
Token ID262
Feature activation+3.73e-01

Pos logit contributions
nam+0.038
kins+0.037
lf+0.036
 Marie+0.035
 globe+0.035
Neg logit contributions
 unnecessary-0.038
please-0.037
eter-0.037
aa-0.037
ch-0.036
Token lock
Token ID5793
Feature activation+5.93e-01

Pos logit contributions
nam+0.009
kins+0.008
lf+0.008
 Marie+0.008
 globe+0.008
Neg logit contributions
 unnecessary-0.010
please-0.010
eter-0.010
aa-0.010
ucker-0.010
Token in
Token ID287
Feature activation+1.08e+00

Pos logit contributions
kins+0.018
nam+0.018
 tripod+0.018
lf+0.017
 Marie+0.017
Neg logit contributions
ch-0.011
eter-0.011
 unnecessary-0.010
aa-0.010
ead-0.010
Token the
Token ID262
Feature activation+9.12e-01

Pos logit contributions
kins+0.028
nam+0.028
lf+0.027
ophone+0.027
igator+0.027
Neg logit contributions
 trouble-0.025
 unnecessary-0.025
eter-0.025
ch-0.023
please-0.023
Token opposite
Token ID6697
Feature activation+3.85e+00

Pos logit contributions
nam+0.021
kins+0.020
lf+0.020
 Marie+0.019
 globe+0.019
Neg logit contributions
 unnecessary-0.025
please-0.025
eter-0.024
ucker-0.024
aa-0.024
Token direction
Token ID4571
Feature activation+1.52e+00

Pos logit contributions
nam+0.103
kins+0.097
lf+0.095
 Marie+0.094
 tripod+0.093
Neg logit contributions
eter-0.092
 unnecessary-0.090
aa-0.087
 trouble-0.087
ch-0.086
Token of
Token ID286
Feature activation+3.94e-02

Pos logit contributions
nam+0.042
kins+0.041
lf+0.040
 Marie+0.039
 tripod+0.039
Neg logit contributions
eter-0.034
 unnecessary-0.033
ch-0.032
aa-0.032
ead-0.031
Token the
Token ID262
Feature activation+1.99e-01

Pos logit contributions
nam+0.001
kins+0.001
lf+0.001
 Marie+0.001
 globe+0.001
Neg logit contributions
 unnecessary-0.001
please-0.001
eter-0.001
aa-0.001
ucker-0.001
Token way
Token ID835
Feature activation+1.27e+00

Pos logit contributions
nam+0.005
lf+0.005
kins+0.005
 globe+0.004
 telescope+0.004
Neg logit contributions
ucker-0.006
eter-0.005
please-0.005
 unnecessary-0.005
aa-0.005
Token it
Token ID340
Feature activation+7.96e-01

Pos logit contributions
nam+0.036
kins+0.036
lf+0.035
 Marie+0.034
 tripod+0.034
Neg logit contributions
eter-0.026
 unnecessary-0.026
aa-0.026
ch-0.026
ead-0.024
Token them
Token ID606
Feature activation+1.07e+00

Pos logit contributions
nam+0.024
lf+0.023
kins+0.023
 Marie+0.022
 tripod+0.022
Neg logit contributions
 unnecessary-0.018
ch-0.017
eter-0.017
please-0.017
aa-0.017
Token up
Token ID510
Feature activation+5.60e-01

Pos logit contributions
lf+0.036
 tripod+0.036
nam+0.035
 Marie+0.034
 Laura+0.034
Neg logit contributions
eter-0.017
ch-0.017
 unnecessary-0.016
ead-0.016
ounced-0.015
Token and
Token ID290
Feature activation+1.73e-01

Pos logit contributions
 tripod+0.017
lf+0.017
nam+0.017
ophone+0.016
kins+0.016
Neg logit contributions
eter-0.011
ch-0.010
ucker-0.010
please-0.010
ounced-0.010
Token bundled
Token ID34358
Feature activation+3.86e-01

Pos logit contributions
nam+0.005
kins+0.004
lf+0.004
 Marie+0.004
 globe+0.004
Neg logit contributions
please-0.004
 unnecessary-0.004
eter-0.004
aa-0.004
ch-0.004
Token the
Token ID262
Feature activation-9.38e-01

Pos logit contributions
nam+0.009
lf+0.009
kins+0.009
 Marie+0.009
 tripod+0.009
Neg logit contributions
 unnecessary-0.010
please-0.009
eter-0.009
aa-0.009
ch-0.009
Token group
Token ID1448
Feature activation+3.78e+00

Pos logit contributions
ucker+0.025
please+0.023
eter+0.023
 unnecessary+0.023
aa+0.023
Neg logit contributions
nam-0.023
kins-0.022
 globe-0.021
atively-0.021
igator-0.021
Token of
Token ID286
Feature activation-9.89e-01

Pos logit contributions
nam+0.126
kins+0.118
lf+0.117
 tripod+0.116
 keyboard+0.112
Neg logit contributions
eter-0.062
ch-0.058
less-0.056
ead-0.055
 tum-0.053
Token rabbits
Token ID33043
Feature activation+1.52e+00

Pos logit contributions
ucker+0.028
eter+0.028
please+0.028
ounced+0.026
ead+0.026
Neg logit contributions
nam-0.022
igator-0.021
kins-0.021
 telescope-0.020
 Becky-0.019
Token in
Token ID287
Feature activation+5.75e-01

Pos logit contributions
lf+0.052
nam+0.049
anie+0.049
Lisa+0.048
Anne+0.048
Neg logit contributions
eter-0.021
ch-0.021
ead-0.021
 home-0.020
 ac-0.020
Token with
Token ID351
Feature activation+9.76e-01

Pos logit contributions
nam+0.014
kins+0.014
lf+0.013
 Marie+0.013
atively+0.013
Neg logit contributions
 unnecessary-0.015
eter-0.014
please-0.014
ch-0.014
aa-0.014
Token them
Token ID606
Feature activation+1.04e+00

Pos logit contributions
nam+0.023
kins+0.022
 Marie+0.021
lf+0.021
 globe+0.020
Neg logit contributions
please-0.026
 unnecessary-0.026
ucker-0.026
eter-0.025
aa-0.025
Token would
Token ID561
Feature activation+1.28e-02

Pos logit contributions
please+0.006
ch+0.006
eter+0.006
 unnecessary+0.006
aa+0.005
Neg logit contributions
nam-0.009
kins-0.009
 Marie-0.008
 tripod-0.008
atively-0.008
Token take
Token ID1011
Feature activation-3.16e-01

Pos logit contributions
kins+0.000
nam+0.000
 tripod+0.000
 Marie+0.000
 globe+0.000
Neg logit contributions
please-0.000
 fever-0.000
ead-0.000
eter-0.000
ch-0.000
Token out
Token ID503
Feature activation+3.62e-03

Pos logit contributions
 unnecessary+0.008
please+0.008
eter+0.008
ch+0.008
aa+0.008
Neg logit contributions
nam-0.007
kins-0.007
lf-0.007
 Marie-0.007
 globe-0.007
Token a
Token ID257
Feature activation-3.16e-01

Pos logit contributions
 tripod+0.000
 globe+0.000
kins+0.000
atively+0.000
lf+0.000
Neg logit contributions
 trouble-0.000
 unnecessary-0.000
ch-0.000
eter-0.000
ead-0.000
Token j
Token ID474
Feature activation+6.55e-01

Pos logit contributions
 unnecessary+0.007
ch+0.007
please+0.007
aa+0.007
eter+0.007
Neg logit contributions
kins-0.008
nam-0.008
lf-0.008
atively-0.008
 Marie-0.008
Tokenigsaw
Token ID45636
Feature activation+3.77e+00

Pos logit contributions
nam+0.018
kins+0.018
lf+0.017
 Marie+0.017
 globe+0.017
Neg logit contributions
 unnecessary-0.015
eter-0.014
aa-0.014
please-0.014
ucker-0.014
Token puzzle
Token ID15027
Feature activation+5.52e-01

Pos logit contributions
nam+0.108
 tripod+0.094
lyn+0.093
ophone+0.093
atively+0.092
Neg logit contributions
ch-0.078
 problem-0.076
umm-0.074
ead-0.070
 trouble-0.070
Token and
Token ID290
Feature activation+4.46e-01

Pos logit contributions
nam+0.019
kins+0.018
 Marie+0.018
 tripod+0.018
atively+0.017
Neg logit contributions
ch-0.008
 trouble-0.008
eter-0.008
please-0.008
ead-0.007
Token try
Token ID1949
Feature activation-9.09e-01

Pos logit contributions
kins+0.013
nam+0.013
 globe+0.012
lf+0.012
 Marie+0.012
Neg logit contributions
please-0.009
 unnecessary-0.008
eter-0.008
ead-0.008
ch-0.008
Token to
Token ID284
Feature activation-1.88e-01

Pos logit contributions
 unnecessary+0.017
eter+0.017
please+0.017
aa+0.017
ucker+0.017
Neg logit contributions
nam-0.028
kins-0.027
lf-0.027
 Marie-0.026
 globe-0.026
Token solve
Token ID8494
Feature activation+8.49e-01

Pos logit contributions
please+0.004
 unnecessary+0.004
eter+0.004
ucker+0.004
ch+0.004
Neg logit contributions
nam-0.005
kins-0.005
lf-0.005
 Marie-0.005
 globe-0.005
Token Jack
Token ID3619
Feature activation+9.25e-01

Pos logit contributions
nam+0.012
kins+0.012
lf+0.012
 globe+0.012
 Marie+0.012
Neg logit contributions
 unnecessary-0.011
please-0.011
eter-0.011
aa-0.010
ucker-0.010
Token went
Token ID1816
Feature activation+7.98e-01

Pos logit contributions
nam+0.024
kins+0.023
lf+0.022
 Marie+0.022
 globe+0.022
Neg logit contributions
please-0.022
eter-0.022
 unnecessary-0.022
aa-0.022
ucker-0.021
Token off
Token ID572
Feature activation-5.78e-01

Pos logit contributions
nam+0.027
kins+0.026
lf+0.026
 Marie+0.025
 globe+0.025
Neg logit contributions
 unnecessary-0.013
please-0.013
aa-0.012
ch-0.012
eter-0.012
Token in
Token ID287
Feature activation+9.71e-01

Pos logit contributions
please+0.009
aa+0.008
 unnecessary+0.008
eter+0.008
ead+0.008
Neg logit contributions
nam-0.020
kins-0.020
lf-0.019
 globe-0.019
 Marie-0.019
Token the
Token ID262
Feature activation+1.64e+00

Pos logit contributions
nam+0.024
kins+0.023
lf+0.022
 Marie+0.022
 globe+0.022
Neg logit contributions
 unnecessary-0.024
eter-0.024
please-0.024
 trouble-0.023
aa-0.023
Token opposite
Token ID6697
Feature activation+3.76e+00

Pos logit contributions
nam+0.042
 Marie+0.041
lf+0.041
kins+0.041
 globe+0.040
Neg logit contributions
 unnecessary-0.039
please-0.039
ucker-0.038
eter-0.038
aa-0.038
Token direction
Token ID4571
Feature activation+1.39e+00

Pos logit contributions
nam+0.094
kins+0.090
lf+0.089
 Marie+0.086
 globe+0.086
Neg logit contributions
 unnecessary-0.093
eter-0.093
please-0.090
aa-0.089
ucker-0.089
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
nam+0.034
kins+0.033
lf+0.032
 Marie+0.032
 globe+0.031
Neg logit contributions
 unnecessary-0.035
please-0.035
eter-0.034
aa-0.034
ucker-0.034
Token
Token ID198
Feature activation-5.01e-01

Pos logit contributions
 unnecessary+0.006
eter+0.006
please+0.006
aa+0.006
ucker+0.006
Neg logit contributions
nam-0.007
kins-0.007
lf-0.007
 globe-0.006
 Marie-0.006
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
 unnecessary+0.012
eter+0.012
please+0.012
aa+0.012
ucker+0.012
Neg logit contributions
nam-0.012
kins-0.012
lf-0.012
 Marie-0.012
 globe-0.011
TokenWhen
Token ID2215
Feature activation-7.92e-01

Pos logit contributions
please+0.005
 unnecessary+0.005
ch+0.005
aa+0.005
eter+0.005
Neg logit contributions
nam-0.006
kins-0.006
lf-0.005
 globe-0.005
 Marie-0.005
Token was
Token ID373
Feature activation+2.99e-01

Pos logit contributions
please+0.008
eter+0.008
 unnecessary+0.008
aa+0.008
ch+0.008
Neg logit contributions
nam-0.010
kins-0.009
lf-0.009
 Marie-0.009
 globe-0.009
Token able
Token ID1498
Feature activation-6.17e-01

Pos logit contributions
nam+0.008
kins+0.008
lf+0.008
 globe+0.008
 tripod+0.008
Neg logit contributions
please-0.007
 unnecessary-0.006
eter-0.006
ch-0.006
aa-0.006
Token to
Token ID284
Feature activation-4.19e-01

Pos logit contributions
 unnecessary+0.011
please+0.010
ucker+0.010
aa+0.010
ch+0.010
Neg logit contributions
nam-0.020
 Marie-0.019
kins-0.019
lf-0.018
 Laura-0.018
Token share
Token ID2648
Feature activation+1.83e-01

Pos logit contributions
 unnecessary+0.008
please+0.008
eter+0.008
aa+0.008
ucker+0.008
Neg logit contributions
nam-0.013
kins-0.012
lf-0.012
 Marie-0.012
 globe-0.012
Token her
Token ID607
Feature activation+8.85e-02

Pos logit contributions
nam+0.004
kins+0.004
atively+0.004
lf+0.004
 tripod+0.004
Neg logit contributions
eter-0.005
 unnecessary-0.005
 trouble-0.005
please-0.004
ch-0.004
Token talent
Token ID7401
Feature activation+3.72e+00

Pos logit contributions
nam+0.003
 tripod+0.003
atively+0.003
lf+0.003
ophone+0.003
Neg logit contributions
 trouble-0.001
 unnecessary-0.001
 fle-0.001
eter-0.001
 fever-0.001
Token with
Token ID351
Feature activation+1.05e+00

Pos logit contributions
nam+0.131
 tripod+0.121
atively+0.115
igator+0.112
ophone+0.112
Neg logit contributions
eter-0.055
ead-0.051
please-0.049
 tum-0.047
 trouble-0.047
Token the
Token ID262
Feature activation-3.36e-01

Pos logit contributions
nam+0.023
kins+0.023
lf+0.023
 globe+0.022
 tripod+0.022
Neg logit contributions
 unnecessary-0.028
eter-0.028
please-0.028
 trouble-0.026
aa-0.026
Token world
Token ID995
Feature activation+1.52e+00

Pos logit contributions
 unnecessary+0.007
 trouble+0.006
please+0.006
eter+0.006
 fle+0.006
Neg logit contributions
lf-0.010
nam-0.010
kins-0.010
atively-0.010
 tripod-0.010
Token!
Token ID0
Feature activation-1.87e-01

Pos logit contributions
nam+0.046
kins+0.044
 tripod+0.043
lf+0.042
 telescope+0.042
Neg logit contributions
eter-0.030
ch-0.029
ead-0.028
aa-0.027
please-0.027
Token<|endoftext|>
Token ID50256
Feature activation-4.73e-02

Pos logit contributions
eter+0.004
please+0.003
ch+0.003
ead+0.003
 unnecessary+0.003
Neg logit contributions
kins-0.006
nam-0.006
lf-0.006
atively-0.005
 globe-0.005
Token
Token ID220
Feature activation-3.31e-01

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
 globe+0.003
Neg logit contributions
 unnecessary-0.002
please-0.002
eter-0.002
aa-0.002
ch-0.002
Token
Token ID198
Feature activation-1.21e-01

Pos logit contributions
eter+0.007
please+0.007
 unnecessary+0.007
ucker+0.007
aa+0.007
Neg logit contributions
nam-0.009
kins-0.009
lf-0.009
 globe-0.008
 Marie-0.008
Token
Token ID198
Feature activation+1.43e-01

Pos logit contributions
please+0.002
ch+0.002
aa+0.002
 unnecessary+0.002
eter+0.002
Neg logit contributions
nam-0.004
kins-0.004
 globe-0.003
lf-0.003
atively-0.003
TokenThe
Token ID464
Feature activation-5.19e-02

Pos logit contributions
nam+0.004
kins+0.004
 globe+0.004
atively+0.004
 telescope+0.003
Neg logit contributions
please-0.003
ch-0.003
aa-0.003
eter-0.003
 unnecessary-0.003
Token town
Token ID3240
Feature activation+1.69e+00

Pos logit contributions
 unnecessary+0.001
please+0.001
eter+0.001
ucker+0.001
aa+0.001
Neg logit contributions
nam-0.001
kins-0.001
lf-0.001
 Marie-0.001
 globe-0.001
Tokenspe
Token ID4125
Feature activation+3.70e+00

Pos logit contributions
nam+0.042
kins+0.040
 Marie+0.038
lf+0.038
 tripod+0.038
Neg logit contributions
eter-0.041
please-0.041
aa-0.040
 unnecessary-0.040
ucker-0.040
Tokenople
Token ID643
Feature activation+8.43e-01

Pos logit contributions
 tripod+0.110
kins+0.108
 telescope+0.108
 Marie+0.107
nam+0.107
Neg logit contributions
eter-0.075
please-0.068
ead-0.065
 unnecessary-0.064
ounced-0.063
Token had
Token ID550
Feature activation-5.26e-03

Pos logit contributions
nam+0.024
kins+0.023
 Marie+0.022
 globe+0.022
lf+0.022
Neg logit contributions
please-0.017
aa-0.017
eter-0.016
ch-0.016
ucker-0.015
Token to
Token ID284
Feature activation+4.93e-01

Pos logit contributions
 unnecessary+0.000
please+0.000
eter+0.000
ucker+0.000
aa+0.000
Neg logit contributions
nam-0.000
kins-0.000
lf-0.000
 Marie-0.000
atively-0.000
Token find
Token ID1064
Feature activation+1.03e-01

Pos logit contributions
nam+0.016
kins+0.016
 globe+0.015
lf+0.015
 Marie+0.015
Neg logit contributions
 unnecessary-0.008
please-0.008
 trouble-0.008
eter-0.008
aa-0.007
Token a
Token ID257
Feature activation-9.31e-01

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.002
 globe+0.002
Neg logit contributions
 unnecessary-0.003
eter-0.002
please-0.002
aa-0.002
ch-0.002
TokenMaria
Token ID46827
Feature activation+1.06e+00

Pos logit contributions
nam+0.002
ophone+0.002
kin+0.002
kins+0.002
 globe+0.002
Neg logit contributions
please-0.002
ch-0.002
 sudden-0.002
eter-0.002
 unnecessary-0.002
Token ran
Token ID4966
Feature activation+1.08e+00

Pos logit contributions
nam+0.028
kins+0.026
 globe+0.025
lf+0.024
 tripod+0.024
Neg logit contributions
eter-0.026
aa-0.026
please-0.026
ucker-0.025
ch-0.025
Token over
Token ID625
Feature activation+7.76e-01

Pos logit contributions
nam+0.034
kins+0.034
lf+0.032
 globe+0.031
 Marie+0.031
Neg logit contributions
 unnecessary-0.020
aa-0.020
please-0.020
ch-0.020
eter-0.020
Token to
Token ID284
Feature activation+1.74e-01

Pos logit contributions
nam+0.026
kins+0.025
lf+0.025
 Marie+0.024
 globe+0.024
Neg logit contributions
 unnecessary-0.013
please-0.012
eter-0.012
aa-0.012
ch-0.012
Token the
Token ID262
Feature activation+9.68e-01

Pos logit contributions
kins+0.005
lf+0.005
nam+0.005
 globe+0.005
book+0.005
Neg logit contributions
please-0.003
aa-0.003
ead-0.003
 unnecessary-0.003
 trouble-0.003
Token group
Token ID1448
Feature activation+3.70e+00

Pos logit contributions
nam+0.025
kins+0.024
lf+0.024
 Marie+0.023
 globe+0.023
Neg logit contributions
 unnecessary-0.023
please-0.023
eter-0.023
aa-0.023
ucker-0.022
Token and
Token ID290
Feature activation+7.64e-01

Pos logit contributions
nam+0.136
kins+0.128
 globe+0.126
lf+0.120
 tripod+0.119
Neg logit contributions
ead-0.052
aa-0.050
eter-0.048
ch-0.047
 trouble-0.043
Token admired
Token ID29382
Feature activation+5.71e-01

Pos logit contributions
nam+0.019
kins+0.019
lf+0.018
 globe+0.018
 Marie+0.017
Neg logit contributions
please-0.019
 unnecessary-0.019
aa-0.019
eter-0.018
ucker-0.018
Token their
Token ID511
Feature activation-2.85e-01

Pos logit contributions
kins+0.013
lf+0.012
nam+0.012
 tripod+0.012
 globe+0.012
Neg logit contributions
 unnecessary-0.015
eter-0.015
please-0.015
aa-0.014
ucker-0.014
Token fair
Token ID3148
Feature activation+1.24e+00

Pos logit contributions
please+0.007
 unnecessary+0.007
ucker+0.007
aa+0.007
ch+0.007
Neg logit contributions
kins-0.007
nam-0.007
 Marie-0.007
lf-0.007
 globe-0.007
Token play
Token ID711
Feature activation+4.09e-01

Pos logit contributions
nam+0.036
kins+0.036
lf+0.035
 Marie+0.034
 globe+0.034
Neg logit contributions
 unnecessary-0.026
please-0.025
eter-0.025
aa-0.024
ucker-0.024
Token can
Token ID460
Feature activation+1.16e+00

Pos logit contributions
nam+0.004
lf+0.004
kins+0.004
 Marie+0.004
 globe+0.004
Neg logit contributions
 unnecessary-0.003
please-0.003
eter-0.003
ucker-0.003
aa-0.003
Token I
Token ID314
Feature activation-4.49e-01

Pos logit contributions
nam+0.030
kins+0.029
lf+0.028
 Marie+0.028
 globe+0.027
Neg logit contributions
 unnecessary-0.028
please-0.027
eter-0.027
aa-0.027
ucker-0.027
Token sit
Token ID1650
Feature activation-2.17e+00

Pos logit contributions
 unnecessary+0.013
ucker+0.012
eter+0.012
ounced+0.012
please+0.011
Neg logit contributions
igator-0.010
lf-0.010
nam-0.009
nob-0.009
 telescope-0.009
Token in
Token ID287
Feature activation+3.94e-01

Pos logit contributions
 unnecessary+0.064
eter+0.061
aa+0.061
ucker+0.061
please+0.060
Neg logit contributions
nam-0.047
kins-0.045
lf-0.043
 globe-0.041
 Marie-0.040
Token the
Token ID262
Feature activation+1.68e+00

Pos logit contributions
kins+0.012
atively+0.012
nam+0.011
 Marie+0.011
 tripod+0.011
Neg logit contributions
ch-0.007
 trouble-0.007
eter-0.007
please-0.007
 unnecessary-0.007
Token new
Token ID649
Feature activation+3.69e+00

Pos logit contributions
kins+0.040
nam+0.037
 Marie+0.035
lf+0.034
 tripod+0.034
Neg logit contributions
ch-0.048
ead-0.045
 fever-0.045
 tum-0.044
 unnecessary-0.044
Token chair
Token ID5118
Feature activation-3.45e-01

Pos logit contributions
kins+0.075
nam+0.074
 Marie+0.069
 tripod+0.069
atively+0.067
Neg logit contributions
eter-0.107
ch-0.106
please-0.104
 trouble-0.103
ead-0.103
Token?"
Token ID1701
Feature activation-6.90e-01

Pos logit contributions
 unnecessary+0.009
please+0.009
eter+0.008
aa+0.008
ucker+0.008
Neg logit contributions
nam-0.008
lf-0.008
kins-0.008
 Marie-0.008
 globe-0.008
Token Tim
Token ID5045
Feature activation+1.01e+00

Pos logit contributions
 unnecessary+0.013
please+0.013
eter+0.012
aa+0.012
ucker+0.012
Neg logit contributions
nam-0.022
kins-0.021
lf-0.021
 Marie-0.020
 globe-0.020
Tokenmy
Token ID1820
Feature activation+1.51e+00

Pos logit contributions
nam+0.029
kins+0.028
 Marie+0.027
lf+0.027
 globe+0.027
Neg logit contributions
please-0.022
eter-0.021
aa-0.021
ucker-0.021
 unnecessary-0.021
Token asked
Token ID1965
Feature activation+4.94e-01

Pos logit contributions
 globe+0.054
kins+0.053
atively+0.052
nam+0.051
 Marie+0.050
Neg logit contributions
please-0.024
aa-0.021
ucker-0.020
eter-0.020
ead-0.020
Token anymore
Token ID7471
Feature activation+6.69e-01

Pos logit contributions
nam+0.019
kins+0.019
lf+0.018
 Marie+0.018
 tripod+0.018
Neg logit contributions
 unnecessary-0.007
eter-0.007
please-0.007
aa-0.006
ucker-0.006
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
nam+0.017
kins+0.016
 Marie+0.015
lf+0.015
atively+0.015
Neg logit contributions
 unnecessary-0.016
please-0.016
eter-0.016
aa-0.016
ucker-0.016
Token They
Token ID1119
Feature activation+1.17e-01

Pos logit contributions
nam+0.007
kins+0.007
lf+0.007
 Marie+0.007
atively+0.007
Neg logit contributions
 unnecessary-0.006
eter-0.006
please-0.006
ucker-0.006
aa-0.006
Token sat
Token ID3332
Feature activation-1.61e+00

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
atively+0.002
Neg logit contributions
 unnecessary-0.003
eter-0.003
please-0.003
ucker-0.003
aa-0.003
Token on
Token ID319
Feature activation+1.49e-01

Pos logit contributions
 unnecessary+0.039
please+0.039
eter+0.038
aa+0.038
ucker+0.038
Neg logit contributions
nam-0.041
kins-0.039
lf-0.039
 Marie-0.039
 globe-0.037
Token opposite
Token ID6697
Feature activation+3.68e+00

Pos logit contributions
nam+0.004
kins+0.004
lf+0.004
 Marie+0.004
atively+0.004
Neg logit contributions
 unnecessary-0.004
eter-0.003
please-0.003
ch-0.003
 trouble-0.003
Token sides
Token ID5389
Feature activation+2.02e+00

Pos logit contributions
nam+0.109
 tripod+0.095
 globe+0.094
ophone+0.093
kins+0.093
Neg logit contributions
 trouble-0.080
eter-0.076
 tum-0.074
ch-0.071
 unnecessary-0.071
Token of
Token ID286
Feature activation+3.75e-01

Pos logit contributions
nam+0.078
kins+0.072
lf+0.071
 tripod+0.071
 Marie+0.070
Neg logit contributions
ch-0.025
eter-0.025
aa-0.025
ucker-0.024
 trouble-0.023
Token the
Token ID262
Feature activation+1.10e+00

Pos logit contributions
kins+0.011
ophone+0.011
nam+0.011
atively+0.011
book+0.010
Neg logit contributions
 trouble-0.007
 unnecessary-0.007
 fle-0.007
please-0.007
eter-0.007
Token room
Token ID2119
Feature activation+4.85e-01

Pos logit contributions
nam+0.030
kins+0.030
 Marie+0.029
atively+0.029
fully+0.027
Neg logit contributions
 trouble-0.026
ch-0.025
aa-0.023
 fever-0.023
 tum-0.022
Token and
Token ID290
Feature activation-1.50e-01

Pos logit contributions
nam+0.016
 tripod+0.014
igator+0.014
 telescope+0.014
ophone+0.014
Neg logit contributions
eter-0.009
 trouble-0.009
 tum-0.008
ead-0.008
 injuries-0.008
Token breeze
Token ID28633
Feature activation+4.45e-01

Pos logit contributions
ucker+0.003
eter+0.003
aa+0.003
 unnecessary+0.003
please+0.003
Neg logit contributions
nam-0.004
lf-0.004
kins-0.004
 globe-0.004
 tripod-0.004
Token blew
Token ID17948
Feature activation-2.39e-02

Pos logit contributions
kins+0.012
nam+0.012
lf+0.011
 Marie+0.011
atively+0.011
Neg logit contributions
 unnecessary-0.011
please-0.010
eter-0.010
ucker-0.010
aa-0.010
Token her
Token ID607
Feature activation-7.57e-01

Pos logit contributions
 unnecessary+0.001
please+0.001
eter+0.001
ucker+0.001
aa+0.001
Neg logit contributions
nam-0.001
kins-0.001
 Marie-0.001
lf-0.001
 globe-0.000
Token in
Token ID287
Feature activation+7.24e-01

Pos logit contributions
 unnecessary+0.017
please+0.017
eter+0.017
aa+0.017
ucker+0.017
Neg logit contributions
nam-0.021
kins-0.020
lf-0.020
 Marie-0.019
 globe-0.019
Token the
Token ID262
Feature activation+1.90e-01

Pos logit contributions
kins+0.018
nam+0.018
lf+0.018
ophone+0.017
 tripod+0.017
Neg logit contributions
 trouble-0.018
eter-0.018
 unnecessary-0.018
please-0.017
 fever-0.017
Token opposite
Token ID6697
Feature activation+3.65e+00

Pos logit contributions
nam+0.005
 globe+0.005
 telescope+0.004
 tripod+0.004
 Marie+0.004
Neg logit contributions
ucker-0.005
please-0.005
eter-0.005
aa-0.004
 unnecessary-0.004
Token direction
Token ID4571
Feature activation+1.02e+00

Pos logit contributions
nam+0.098
kins+0.094
lf+0.093
 Marie+0.091
 globe+0.089
Neg logit contributions
 unnecessary-0.085
eter-0.084
please-0.082
aa-0.082
ch-0.081
Token of
Token ID286
Feature activation-1.64e-01

Pos logit contributions
nam+0.026
kins+0.026
lf+0.025
 Marie+0.024
 globe+0.024
Neg logit contributions
 unnecessary-0.024
eter-0.024
please-0.024
aa-0.024
ch-0.023
Token the
Token ID262
Feature activation-2.66e-01

Pos logit contributions
 unnecessary+0.004
please+0.004
eter+0.004
ucker+0.004
aa+0.004
Neg logit contributions
nam-0.004
kins-0.004
lf-0.004
 Marie-0.003
 globe-0.003
Token cat
Token ID3797
Feature activation-2.44e-02

Pos logit contributions
ucker+0.007
eter+0.006
aa+0.006
please+0.006
ounced+0.006
Neg logit contributions
lf-0.007
nam-0.007
 globe-0.007
 tripod-0.007
 keyboard-0.006
Token.
Token ID13
Feature activation-7.88e-01

Pos logit contributions
 unnecessary+0.001
please+0.001
eter+0.001
ch+0.001
aa+0.001
Neg logit contributions
nam-0.001
kins-0.001
lf-0.001
 Marie-0.000
 globe-0.000
Token
Token ID198
Feature activation+3.45e-01

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
 globe+0.003
Neg logit contributions
 unnecessary-0.003
eter-0.003
please-0.003
aa-0.003
ucker-0.003
TokenThe
Token ID464
Feature activation-2.08e-01

Pos logit contributions
nam+0.008
kins+0.007
lf+0.007
 Marie+0.007
 globe+0.007
Neg logit contributions
 unnecessary-0.010
please-0.010
eter-0.009
aa-0.009
ch-0.009
Token cat
Token ID3797
Feature activation-6.46e-02

Pos logit contributions
ucker+0.005
eter+0.005
aa+0.005
 unnecessary+0.005
please+0.005
Neg logit contributions
nam-0.005
lf-0.005
kins-0.005
 globe-0.005
 Marie-0.005
Token grinned
Token ID36268
Feature activation+7.87e-01

Pos logit contributions
 unnecessary+0.002
please+0.002
eter+0.002
aa+0.002
ucker+0.002
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
 Marie-0.002
 globe-0.001
Token mis
Token ID2984
Feature activation-1.61e-01

Pos logit contributions
nam+0.018
kins+0.018
lf+0.017
 Marie+0.017
atively+0.017
Neg logit contributions
 unnecessary-0.021
please-0.021
eter-0.020
aa-0.020
ucker-0.020
Tokenchie
Token ID3043
Feature activation+3.65e+00

Pos logit contributions
 unnecessary+0.005
ucker+0.005
please+0.005
 fever+0.004
eter+0.004
Neg logit contributions
nam-0.003
kins-0.003
 Marie-0.003
lf-0.003
igator-0.003
Tokenv
Token ID85
Feature activation+5.75e-01

Pos logit contributions
nam+0.135
kins+0.126
lf+0.122
 telescope+0.121
 globe+0.120
Neg logit contributions
ch-0.063
aa-0.060
please-0.057
eter-0.055
 unnecessary-0.053
Tokenously
Token ID3481
Feature activation+4.65e-01

Pos logit contributions
nam+0.018
 tripod+0.015
 Marie+0.015
 telescope+0.015
lf+0.014
Neg logit contributions
eter-0.013
aa-0.013
ucker-0.013
 trouble-0.012
ch-0.012
Token.
Token ID13
Feature activation+2.18e-01

Pos logit contributions
nam+0.012
kins+0.011
lf+0.011
 Marie+0.011
 globe+0.011
Neg logit contributions
 unnecessary-0.012
please-0.011
eter-0.011
aa-0.011
ucker-0.011
Token "
Token ID366
Feature activation+9.78e-02

Pos logit contributions
nam+0.006
kins+0.006
lf+0.005
 globe+0.005
 Marie+0.005
Neg logit contributions
 unnecessary-0.005
eter-0.005
please-0.005
ucker-0.005
aa-0.005
TokenNever
Token ID12295
Feature activation-1.86e-01

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
 globe+0.003
Neg logit contributions
please-0.002
 unnecessary-0.002
eter-0.002
ch-0.002
aa-0.002
Token y
Token ID331
Feature activation-1.88e+00

Pos logit contributions
 unnecessary+0.005
eter+0.005
please+0.005
aa+0.005
ch+0.005
Neg logit contributions
nam-0.005
kins-0.005
lf-0.005
 Marie-0.005
 globe-0.005
Tokenummy
Token ID13513
Feature activation-6.65e-01

Pos logit contributions
 unnecessary+0.042
please+0.040
ch+0.037
 disease+0.035
 trouble+0.034
Neg logit contributions
kins-0.053
nam-0.047
 Marie-0.046
atively-0.046
book-0.046
Token and
Token ID290
Feature activation-2.19e-01

Pos logit contributions
 unnecessary+0.020
please+0.020
aa+0.019
ucker+0.019
ch+0.019
Neg logit contributions
kins-0.013
nam-0.013
 Marie-0.012
lf-0.012
 globe-0.012
Token made
Token ID925
Feature activation+8.18e-02

Pos logit contributions
 unnecessary+0.005
eter+0.005
please+0.005
aa+0.005
ucker+0.005
Neg logit contributions
nam-0.006
kins-0.006
lf-0.006
 Marie-0.005
 globe-0.005
Token her
Token ID607
Feature activation-4.54e-01

Pos logit contributions
kins+0.002
 Marie+0.002
nam+0.002
 Laura+0.001
 Becky+0.001
Neg logit contributions
please-0.002
 unnecessary-0.002
aa-0.002
ounced-0.002
eter-0.002
Token throat
Token ID13589
Feature activation-3.91e+00

Pos logit contributions
 unnecessary+0.011
please+0.011
ch+0.011
aa+0.011
ucker+0.010
Neg logit contributions
nam-0.012
kins-0.012
 Marie-0.011
 globe-0.011
lf-0.011
Token feel
Token ID1254
Feature activation-1.62e+00

Pos logit contributions
 unnecessary+0.116
please+0.112
aa+0.110
eter+0.107
ead+0.107
Neg logit contributions
kins-0.084
nam-0.076
atively-0.074
book-0.071
kin-0.071
Token better
Token ID1365
Feature activation-1.14e+00

Pos logit contributions
aa+0.042
eter+0.041
ounced+0.041
please+0.041
ead+0.041
Neg logit contributions
kins-0.039
nam-0.037
 Marie-0.036
kin-0.034
igator-0.034
Token.
Token ID13
Feature activation+9.70e-03

Pos logit contributions
 unnecessary+0.033
please+0.032
eter+0.032
aa+0.032
ucker+0.032
Neg logit contributions
nam-0.024
kins-0.024
lf-0.022
 Marie-0.022
 globe-0.022
Token She
Token ID1375
Feature activation-4.42e-02

Pos logit contributions
nam+0.000
kins+0.000
lf+0.000
 Marie+0.000
 globe+0.000
Neg logit contributions
 unnecessary-0.000
please-0.000
eter-0.000
aa-0.000
ucker-0.000
Token hugged
Token ID41130
Feature activation+5.45e-02

Pos logit contributions
 unnecessary+0.001
please+0.001
eter+0.001
aa+0.001
ucker+0.001
Neg logit contributions
nam-0.001
kins-0.001
lf-0.001
 Marie-0.001
 globe-0.001
Token -
Token ID532
Feature activation-1.81e-01

Pos logit contributions
 unnecessary+0.034
please+0.033
eter+0.033
aa+0.033
ucker+0.033
Neg logit contributions
nam-0.030
kins-0.029
lf-0.029
 Marie-0.028
 globe-0.027
Token it
Token ID340
Feature activation+4.07e-01

Pos logit contributions
 unnecessary+0.005
please+0.005
eter+0.005
aa+0.005
ucker+0.005
Neg logit contributions
nam-0.004
kins-0.004
lf-0.004
 Marie-0.004
 globe-0.004
Token was
Token ID373
Feature activation+2.86e-01

Pos logit contributions
nam+0.011
kins+0.011
lf+0.011
 Marie+0.011
 tripod+0.010
Neg logit contributions
please-0.009
ch-0.009
 unnecessary-0.009
ucker-0.008
eter-0.008
Token bigger
Token ID5749
Feature activation+7.23e-01

Pos logit contributions
nam+0.005
kins+0.005
lf+0.005
 Marie+0.005
book+0.005
Neg logit contributions
 unnecessary-0.010
aa-0.009
eter-0.009
please-0.009
ead-0.009
Token and
Token ID290
Feature activation+5.69e-01

Pos logit contributions
kins+0.015
lf+0.014
 globe+0.014
nam+0.013
 Marie+0.013
Neg logit contributions
 unnecessary-0.022
please-0.021
aa-0.021
ucker-0.020
ch-0.020
Token ju
Token ID7544
Feature activation-3.91e+00

Pos logit contributions
nam+0.016
kins+0.015
lf+0.014
 Marie+0.014
 globe+0.014
Neg logit contributions
 unnecessary-0.013
aa-0.013
eter-0.013
ucker-0.013
please-0.013
Tokenic
Token ID291
Feature activation-4.73e-01

Pos logit contributions
 unnecessary+0.080
please+0.074
ch+0.073
aa+0.072
 trouble+0.070
Neg logit contributions
kins-0.116
lf-0.111
nam-0.108
atively-0.106
 globe-0.103
Tokenier
Token ID959
Feature activation+3.02e-01

Pos logit contributions
aa+0.010
 unnecessary+0.010
please+0.010
ch+0.010
eter+0.009
Neg logit contributions
nam-0.014
 Marie-0.013
lf-0.013
kins-0.013
 tripod-0.013
Token than
Token ID621
Feature activation-1.36e-02

Pos logit contributions
kins+0.007
nam+0.006
lf+0.006
 globe+0.006
 Marie+0.006
Neg logit contributions
 unnecessary-0.009
ch-0.008
aa-0.008
please-0.008
ucker-0.008
Token any
Token ID597
Feature activation+4.48e-01

Pos logit contributions
 unnecessary+0.000
please+0.000
eter+0.000
aa+0.000
ucker+0.000
Neg logit contributions
nam-0.000
kins-0.000
lf-0.000
 Marie-0.000
 globe-0.000
Token he
Token ID339
Feature activation-3.03e-01

Pos logit contributions
igator+0.008
kins+0.008
nam+0.008
 Marie+0.007
lf+0.007
Neg logit contributions
ucker-0.014
aa-0.014
ead-0.014
eter-0.014
ounced-0.013
Token smells
Token ID25760
Feature activation-2.91e+00

Pos logit contributions
kins+0.006
igator+0.006
 globe+0.006
lf+0.006
nam+0.005
Neg logit contributions
 unnecessary-0.007
please-0.006
eter-0.006
aa-0.006
ead-0.006
Token bad
Token ID2089
Feature activation-7.56e-01

Pos logit contributions
eter+0.087
please+0.085
ounced+0.081
ucker+0.080
ead+0.080
Neg logit contributions
kins-0.068
book-0.061
nam-0.059
kin-0.058
igator-0.057
Token and
Token ID290
Feature activation-9.73e-01

Pos logit contributions
 unnecessary+0.022
please+0.022
eter+0.021
ucker+0.021
aa+0.021
Neg logit contributions
kins-0.015
nam-0.015
lf-0.015
 Marie-0.015
 globe-0.014
Token makes
Token ID1838
Feature activation+2.64e-01

Pos logit contributions
please+0.023
eter+0.022
aa+0.022
ucker+0.021
 unnecessary+0.021
Neg logit contributions
nam-0.027
kins-0.025
 Marie-0.024
igator-0.024
lf-0.023
Token him
Token ID683
Feature activation-4.14e-01

Pos logit contributions
 Marie+0.005
lf+0.005
 Laura+0.005
kins+0.005
nam+0.005
Neg logit contributions
please-0.008
ucker-0.008
eter-0.007
ounced-0.007
 unnecessary-0.007
Token cough
Token ID22094
Feature activation-3.74e+00

Pos logit contributions
please+0.008
 unnecessary+0.008
eter+0.008
ucker+0.008
aa+0.008
Neg logit contributions
nam-0.012
kins-0.012
igator-0.011
 Marie-0.011
lf-0.011
Token.
Token ID13
Feature activation-3.28e-01

Pos logit contributions
 unnecessary+0.122
please+0.122
eter+0.121
ucker+0.118
ounced+0.117
Neg logit contributions
kins-0.060
nam-0.053
book-0.053
atively-0.052
 globe-0.051
Token But
Token ID887
Feature activation-1.11e+00

Pos logit contributions
 unnecessary+0.009
please+0.009
eter+0.009
aa+0.009
ucker+0.009
Neg logit contributions
 Marie-0.007
nam-0.007
kins-0.007
lf-0.007
atively-0.007
Token he
Token ID339
Feature activation-4.42e-01

Pos logit contributions
 unnecessary+0.031
please+0.030
eter+0.030
aa+0.030
ucker+0.030
Neg logit contributions
nam-0.025
kins-0.024
lf-0.024
 Marie-0.024
 globe-0.022
Token likes
Token ID7832
Feature activation+2.25e-01

Pos logit contributions
 unnecessary+0.013
please+0.013
eter+0.013
aa+0.012
ead+0.012
Neg logit contributions
kins-0.009
 Marie-0.009
lf-0.009
igator-0.008
 globe-0.008
Token the
Token ID262
Feature activation-7.19e-01

Pos logit contributions
nam+0.004
lf+0.004
 Marie+0.004
kins+0.004
 Laura+0.004
Neg logit contributions
eter-0.007
 unnecessary-0.007
ucker-0.007
aa-0.006
umm-0.006
Token good
Token ID922
Feature activation-2.67e-01

Pos logit contributions
aa+0.010
 unnecessary+0.010
eter+0.009
please+0.009
ead+0.009
Neg logit contributions
nam-0.008
kins-0.008
 Marie-0.008
lf-0.008
book-0.008
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
 unnecessary+0.008
please+0.007
aa+0.007
eter+0.007
ch+0.007
Neg logit contributions
kins-0.006
 Marie-0.006
lf-0.006
book-0.006
fully-0.005
Token It
Token ID632
Feature activation+6.43e-01

Pos logit contributions
nam+0.005
kins+0.005
lf+0.005
 Marie+0.005
 globe+0.005
Neg logit contributions
 unnecessary-0.005
please-0.005
eter-0.005
aa-0.005
ucker-0.005
Token made
Token ID925
Feature activation+7.56e-01

Pos logit contributions
nam+0.016
kins+0.016
lf+0.016
 Marie+0.015
 globe+0.015
Neg logit contributions
 unnecessary-0.015
please-0.015
eter-0.015
aa-0.015
ucker-0.015
Token her
Token ID607
Feature activation-6.18e-01

Pos logit contributions
 Marie+0.011
nam+0.010
kins+0.009
 Becky+0.009
lf+0.009
Neg logit contributions
please-0.028
 unnecessary-0.028
aa-0.027
ounced-0.027
eter-0.027
Token throat
Token ID13589
Feature activation-3.65e+00

Pos logit contributions
 unnecessary+0.015
please+0.015
aa+0.014
ch+0.014
ounced+0.014
Neg logit contributions
nam-0.016
kins-0.016
 Marie-0.015
 globe-0.015
 Laura-0.015
Token feel
Token ID1254
Feature activation-1.11e+00

Pos logit contributions
 unnecessary+0.097
please+0.092
aa+0.089
eter+0.088
ounced+0.087
Neg logit contributions
kins-0.088
nam-0.080
atively-0.079
book-0.079
 Marie-0.079
Token less
Token ID1342
Feature activation-2.17e-02

Pos logit contributions
aa+0.028
eter+0.028
ounced+0.027
please+0.027
 unnecessary+0.027
Neg logit contributions
kins-0.027
nam-0.026
 Marie-0.026
igator-0.024
book-0.024
Token sore
Token ID19597
Feature activation-2.12e+00

Pos logit contributions
 unnecessary+0.000
aa+0.000
ch+0.000
please+0.000
eter+0.000
Neg logit contributions
kins-0.001
nam-0.001
 Marie-0.001
book-0.001
kin-0.001
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
 unnecessary+0.054
please+0.053
eter+0.053
aa+0.052
ch+0.051
Neg logit contributions
nam-0.052
kins-0.050
lf-0.050
 Marie-0.048
 globe-0.047
Token She
Token ID1375
Feature activation+1.70e-01

Pos logit contributions
nam+0.003
kins+0.003
 Marie+0.003
lf+0.003
 globe+0.003
Neg logit contributions
 unnecessary-0.003
please-0.003
eter-0.003
aa-0.003
ucker-0.003
Token tongue
Token ID11880
Feature activation-3.35e+00

Pos logit contributions
please+0.014
 unnecessary+0.014
ch+0.013
aa+0.013
ounced+0.013
Neg logit contributions
nam-0.015
kins-0.015
 Marie-0.014
 globe-0.013
igator-0.013
Token feel
Token ID1254
Feature activation-1.17e+00

Pos logit contributions
 unnecessary+0.082
please+0.082
eter+0.078
aa+0.077
ead+0.074
Neg logit contributions
kins-0.085
nam-0.084
 globe-0.073
lf-0.073
kin-0.072
Token dry
Token ID5894
Feature activation-1.01e+00

Pos logit contributions
eter+0.029
please+0.029
aa+0.029
ounced+0.028
ead+0.028
Neg logit contributions
kins-0.029
nam-0.028
 Marie-0.027
kin-0.026
igator-0.026
Token and
Token ID290
Feature activation-2.06e-01

Pos logit contributions
eter+0.023
please+0.022
 unnecessary+0.021
ch+0.021
ead+0.021
Neg logit contributions
nam-0.028
lf-0.027
kins-0.026
 Marie-0.026
 tripod-0.026
Token her
Token ID607
Feature activation-9.89e-01

Pos logit contributions
 unnecessary+0.005
please+0.005
aa+0.005
eter+0.005
ch+0.005
Neg logit contributions
nam-0.006
kins-0.005
 Marie-0.005
lf-0.005
 globe-0.005
Token throat
Token ID13589
Feature activation-3.62e+00

Pos logit contributions
please+0.024
 unnecessary+0.023
aa+0.023
eter+0.022
ch+0.022
Neg logit contributions
kins-0.026
nam-0.025
 Marie-0.024
 globe-0.024
lf-0.024
Token hurt
Token ID5938
Feature activation-7.58e-01

Pos logit contributions
 unnecessary+0.096
please+0.095
eter+0.092
aa+0.091
ounced+0.090
Neg logit contributions
kins-0.087
kin-0.076
book-0.075
nam-0.074
atively-0.072
Token.
Token ID13
Feature activation-2.42e-01

Pos logit contributions
 unnecessary+0.022
please+0.021
eter+0.021
ucker+0.021
aa+0.021
Neg logit contributions
nam-0.016
kins-0.016
lf-0.015
 Marie-0.015
 globe-0.014
Token
Token ID198
Feature activation-3.10e-01

Pos logit contributions
 unnecessary+0.007
please+0.007
eter+0.007
aa+0.007
ch+0.006
Neg logit contributions
nam-0.005
 Marie-0.005
kins-0.005
lf-0.005
book-0.005
Token
Token ID198
Feature activation+5.39e-01

Pos logit contributions
 unnecessary+0.009
eter+0.009
please+0.009
aa+0.008
ucker+0.008
Neg logit contributions
lf-0.006
kins-0.006
nam-0.006
 Marie-0.006
 globe-0.006
TokenL
Token ID43
Feature activation-1.58e-01

Pos logit contributions
nam+0.015
 globe+0.015
atively+0.015
kins+0.015
 Marie+0.015
Neg logit contributions
ch-0.011
please-0.011
aa-0.011
 unnecessary-0.011
ucker-0.011
TokenM
Token ID44
Feature activation-1.66e+00

Pos logit contributions
 unnecessary+0.004
please+0.004
eter+0.004
aa+0.004
ch+0.004
Neg logit contributions
nam-0.006
kins-0.006
lf-0.006
 Marie-0.006
atively-0.006
Tokenum
Token ID388
Feature activation-1.54e+00

Pos logit contributions
 unnecessary+0.042
eter+0.040
please+0.039
ead+0.038
ch+0.037
Neg logit contributions
lf-0.043
kins-0.043
 Marie-0.042
nam-0.041
 Laura-0.040
Token and
Token ID290
Feature activation-5.72e-01

Pos logit contributions
please+0.037
 unnecessary+0.037
aa+0.037
eter+0.037
ucker+0.036
Neg logit contributions
nam-0.039
kins-0.037
lf-0.037
 Marie-0.036
 globe-0.035
Token Dad
Token ID17415
Feature activation-8.48e-01

Pos logit contributions
 unnecessary+0.014
please+0.014
eter+0.014
aa+0.014
ucker+0.014
Neg logit contributions
nam-0.014
kins-0.014
lf-0.014
 Marie-0.013
 globe-0.013
Token were
Token ID547
Feature activation+4.85e-03

Pos logit contributions
 unnecessary+0.023
eter+0.021
please+0.021
ucker+0.021
ch+0.020
Neg logit contributions
nam-0.020
kins-0.020
lf-0.019
 Marie-0.019
 globe-0.019
Token c
Token ID269
Feature activation-3.62e+00

Pos logit contributions
nam+0.000
kins+0.000
lf+0.000
 Marie+0.000
 globe+0.000
Neg logit contributions
 unnecessary-0.000
please-0.000
eter-0.000
aa-0.000
ucker-0.000
Tokenudd
Token ID4185
Feature activation-6.52e-01

Pos logit contributions
 unnecessary+0.075
 trouble+0.071
 disease+0.062
eter+0.062
ch+0.061
Neg logit contributions
kins-0.107
lf-0.105
nam-0.101
atively-0.098
ophone-0.095
Tokenling
Token ID1359
Feature activation-2.49e-01

Pos logit contributions
 unnecessary+0.015
please+0.015
eter+0.015
ucker+0.014
aa+0.014
Neg logit contributions
nam-0.017
kins-0.017
lf-0.017
 Marie-0.016
 globe-0.016
Token on
Token ID319
Feature activation+4.88e-02

Pos logit contributions
eter+0.006
please+0.006
 unnecessary+0.006
aa+0.006
ch+0.006
Neg logit contributions
nam-0.006
kins-0.006
lf-0.006
 Marie-0.006
 globe-0.006
Token the
Token ID262
Feature activation+4.44e-01

Pos logit contributions
nam+0.001
lf+0.001
kins+0.001
 Marie+0.001
 globe+0.001
Neg logit contributions
 unnecessary-0.001
please-0.001
ucker-0.001
eter-0.001
aa-0.001
Token arm
Token ID3211
Feature activation-2.70e-01

Pos logit contributions
kins+0.011
nam+0.011
 Marie+0.010
atively+0.010
book+0.010
Neg logit contributions
ch-0.011
 unnecessary-0.011
please-0.011
aa-0.011
 fever-0.010
Token air
Token ID1633
Feature activation-5.67e-01

Pos logit contributions
nam+0.035
kins+0.034
lf+0.034
 Marie+0.033
 globe+0.033
Neg logit contributions
 unnecessary-0.029
please-0.028
eter-0.028
aa-0.028
ucker-0.028
Token.
Token ID13
Feature activation+4.01e-01

Pos logit contributions
eter+0.014
please+0.014
 unnecessary+0.014
aa+0.014
ucker+0.014
Neg logit contributions
nam-0.014
kins-0.013
lf-0.013
 Marie-0.013
 globe-0.012
Token It
Token ID632
Feature activation+3.47e-01

Pos logit contributions
kins+0.011
nam+0.010
lf+0.010
 Marie+0.010
 globe+0.010
Neg logit contributions
 unnecessary-0.009
please-0.009
eter-0.009
aa-0.009
ch-0.009
Token made
Token ID925
Feature activation+8.66e-01

Pos logit contributions
nam+0.008
kins+0.008
lf+0.008
 Marie+0.008
 globe+0.008
Neg logit contributions
 unnecessary-0.009
please-0.009
eter-0.009
aa-0.008
ucker-0.008
Token her
Token ID607
Feature activation-5.71e-01

Pos logit contributions
nam+0.015
 Marie+0.015
lf+0.014
 Anne+0.014
kins+0.014
Neg logit contributions
please-0.028
ucker-0.028
eter-0.028
 unnecessary-0.028
aa-0.027
Token throat
Token ID13589
Feature activation-3.61e+00

Pos logit contributions
please+0.013
 unnecessary+0.012
ucker+0.012
ch+0.012
aa+0.012
Neg logit contributions
nam-0.016
kins-0.015
 Marie-0.015
lf-0.014
 globe-0.014
Token hurt
Token ID5938
Feature activation-3.55e-01

Pos logit contributions
please+0.090
 unnecessary+0.087
eter+0.084
aa+0.083
ucker+0.082
Neg logit contributions
kins-0.088
nam-0.086
atively-0.084
 Marie-0.082
lf-0.082
Token and
Token ID290
Feature activation-3.06e-01

Pos logit contributions
 unnecessary+0.009
eter+0.008
please+0.008
ch+0.008
aa+0.008
Neg logit contributions
nam-0.009
kins-0.009
lf-0.009
 tripod-0.009
 Marie-0.008
Token her
Token ID607
Feature activation-8.77e-01

Pos logit contributions
please+0.008
eter+0.008
 unnecessary+0.007
aa+0.007
ucker+0.007
Neg logit contributions
nam-0.008
kins-0.007
lf-0.007
 Marie-0.007
atively-0.007
Token eyes
Token ID2951
Feature activation-1.94e+00

Pos logit contributions
please+0.022
 unnecessary+0.021
ucker+0.021
ch+0.021
eter+0.020
Neg logit contributions
kins-0.022
nam-0.022
lf-0.021
 Marie-0.021
 globe-0.021
Token sting
Token ID24276
Feature activation-1.32e+00

Pos logit contributions
 unnecessary+0.047
please+0.047
ucker+0.046
eter+0.045
aa+0.045
Neg logit contributions
nam-0.048
kins-0.048
lf-0.046
atively-0.046
 Marie-0.045
Token It
Token ID632
Feature activation+5.33e-01

Pos logit contributions
 unnecessary+0.003
please+0.003
eter+0.003
aa+0.003
ucker+0.003
Neg logit contributions
nam-0.003
kins-0.003
lf-0.003
 Marie-0.003
 globe-0.003
Token made
Token ID925
Feature activation+4.19e-01

Pos logit contributions
nam+0.014
kins+0.014
lf+0.014
 Marie+0.014
atively+0.013
Neg logit contributions
 unnecessary-0.012
eter-0.012
please-0.012
aa-0.012
ucker-0.011
Token him
Token ID683
Feature activation-3.50e-01

Pos logit contributions
 Marie+0.006
lf+0.006
nam+0.006
kins+0.006
 Anne+0.006
Neg logit contributions
please-0.014
eter-0.013
ucker-0.013
 unnecessary-0.013
aa-0.013
Token spit
Token ID27591
Feature activation-2.65e+00

Pos logit contributions
please+0.008
 unnecessary+0.008
eter+0.008
ch+0.007
ucker+0.007
Neg logit contributions
nam-0.010
kins-0.009
 Marie-0.009
igator-0.009
lf-0.009
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
please+0.093
eter+0.090
ucker+0.089
ounced+0.088
 unnecessary+0.087
Neg logit contributions
book-0.040
 Marie-0.038
kins-0.037
taker-0.037
fully-0.036
Token cough
Token ID22094
Feature activation-3.58e+00

Pos logit contributions
 unnecessary+0.026
eter+0.026
please+0.026
aa+0.025
ch+0.024
Neg logit contributions
nam-0.036
kins-0.035
 Marie-0.034
lf-0.034
atively-0.032
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
 unnecessary+0.113
eter+0.111
please+0.110
ucker+0.106
ead+0.104
Neg logit contributions
kins-0.064
atively-0.060
book-0.059
nam-0.058
fully-0.056
Token It
Token ID632
Feature activation+2.54e-01

Pos logit contributions
 unnecessary+0.005
please+0.005
eter+0.005
aa+0.005
ucker+0.005
Neg logit contributions
nam-0.005
kins-0.005
 Marie-0.005
lf-0.005
 globe-0.004
Token made
Token ID925
Feature activation+4.21e-01

Pos logit contributions
kins+0.007
nam+0.007
lf+0.006
atively+0.006
 Marie+0.006
Neg logit contributions
 unnecessary-0.006
eter-0.006
please-0.006
aa-0.006
ead-0.006
Token him
Token ID683
Feature activation-4.49e-01

Pos logit contributions
lf+0.007
 Marie+0.007
kins+0.007
nam+0.006
atively+0.006
Neg logit contributions
please-0.014
ucker-0.013
eter-0.013
 unnecessary-0.013
aa-0.013
Token cry
Token ID3960
Feature activation-1.02e+00

Pos logit contributions
please+0.009
 unnecessary+0.009
eter+0.009
ucker+0.009
aa+0.009
Neg logit contributions
nam-0.013
kins-0.012
 Marie-0.012
igator-0.012
lf-0.012
Token and
Token ID290
Feature activation+4.25e-02

Pos logit contributions
 unnecessary+0.040
please+0.039
aa+0.037
eter+0.037
ucker+0.037
Neg logit contributions
kins-0.025
lf-0.024
nam-0.023
 Marie-0.023
 globe-0.023
Token bitter
Token ID12922
Feature activation-1.46e+00

Pos logit contributions
nam+0.001
kins+0.001
 Marie+0.001
lf+0.001
igator+0.001
Neg logit contributions
 unnecessary-0.001
please-0.001
eter-0.001
aa-0.001
ead-0.001
Token and
Token ID290
Feature activation+4.45e-01

Pos logit contributions
 unnecessary+0.048
please+0.047
eter+0.045
aa+0.045
ucker+0.045
Neg logit contributions
kins-0.026
 Marie-0.024
lf-0.023
nam-0.023
book-0.023
Token makes
Token ID1838
Feature activation+4.89e-01

Pos logit contributions
nam+0.013
kins+0.013
 Marie+0.012
lf+0.012
igator+0.012
Neg logit contributions
 unnecessary-0.010
please-0.010
eter-0.009
aa-0.009
ead-0.009
Token them
Token ID606
Feature activation+3.71e-02

Pos logit contributions
kins+0.009
 Marie+0.009
nam+0.009
lf+0.009
 Laura+0.008
Neg logit contributions
please-0.016
 unnecessary-0.014
eter-0.014
ucker-0.014
aa-0.014
Token cough
Token ID22094
Feature activation-3.52e+00

Pos logit contributions
nam+0.001
kins+0.001
 Marie+0.001
igator+0.001
atively+0.001
Neg logit contributions
please-0.001
 unnecessary-0.001
eter-0.001
ch-0.001
aa-0.001
Token.
Token ID13
Feature activation-3.29e-02

Pos logit contributions
 unnecessary+0.117
please+0.116
eter+0.115
ounced+0.112
ucker+0.111
Neg logit contributions
kins-0.056
book-0.051
atively-0.049
nam-0.048
taker-0.048
Token They
Token ID1119
Feature activation-2.46e-02

Pos logit contributions
 unnecessary+0.001
please+0.001
eter+0.001
aa+0.001
ch+0.001
Neg logit contributions
 Marie-0.001
nam-0.001
kins-0.001
 tripod-0.001
 Laura-0.001
Token spit
Token ID27591
Feature activation-1.38e+00

Pos logit contributions
 unnecessary+0.001
eter+0.001
please+0.001
aa+0.001
ch+0.001
Neg logit contributions
nam-0.001
kins-0.001
 Marie-0.001
lf-0.001
 tripod-0.001
Token it
Token ID340
Feature activation-1.24e+00

Pos logit contributions
please+0.036
eter+0.035
 unnecessary+0.035
ucker+0.034
ounced+0.033
Neg logit contributions
kins-0.033
 Marie-0.033
book-0.032
nam-0.031
 globe-0.031
Token out
Token ID503
Feature activation-3.36e-01

Pos logit contributions
 unnecessary+0.040
please+0.040
eter+0.040
ucker+0.039
aa+0.037
Neg logit contributions
kins-0.020
book-0.020
 Marie-0.019
nam-0.019
 globe-0.019
Token is
Token ID318
Feature activation+9.15e-02

Pos logit contributions
kins+0.003
igator+0.003
 globe+0.003
 Marie+0.003
nam+0.003
Neg logit contributions
please-0.004
 unnecessary-0.004
aa-0.004
eter-0.004
ead-0.004
Token salty
Token ID36021
Feature activation-2.05e+00

Pos logit contributions
nam+0.002
kins+0.002
 Marie+0.002
igator+0.002
lf+0.002
Neg logit contributions
please-0.002
 unnecessary-0.002
aa-0.002
eter-0.002
ead-0.002
Token and
Token ID290
Feature activation-2.16e-01

Pos logit contributions
 unnecessary+0.067
please+0.064
eter+0.063
aa+0.062
ucker+0.061
Neg logit contributions
kins-0.038
lf-0.035
 Marie-0.034
 globe-0.033
book-0.033
Token make
Token ID787
Feature activation+2.97e-01

Pos logit contributions
 unnecessary+0.005
please+0.005
eter+0.005
aa+0.004
ead+0.004
Neg logit contributions
nam-0.006
kins-0.006
 Marie-0.006
igator-0.006
lf-0.006
Token them
Token ID606
Feature activation+1.33e-01

Pos logit contributions
kins+0.006
 Marie+0.006
nam+0.005
lf+0.005
 Becky+0.005
Neg logit contributions
please-0.009
aa-0.008
eter-0.008
 unnecessary-0.008
ounced-0.008
Token cough
Token ID22094
Feature activation-3.52e+00

Pos logit contributions
nam+0.004
kins+0.004
igator+0.003
 Marie+0.003
 globe+0.003
Neg logit contributions
please-0.003
 unnecessary-0.003
eter-0.003
ch-0.003
aa-0.003
Token.
Token ID13
Feature activation-6.04e-03

Pos logit contributions
 unnecessary+0.117
please+0.117
eter+0.116
ounced+0.112
ead+0.111
Neg logit contributions
kins-0.056
book-0.050
atively-0.047
fully-0.046
nam-0.046
Token They
Token ID1119
Feature activation-3.08e-01

Pos logit contributions
 unnecessary+0.000
please+0.000
aa+0.000
eter+0.000
ch+0.000
Neg logit contributions
kins-0.000
nam-0.000
 Marie-0.000
book-0.000
 tripod-0.000
Token are
Token ID389
Feature activation-1.56e-01

Pos logit contributions
eter+0.008
 unnecessary+0.008
please+0.008
aa+0.007
ch+0.007
Neg logit contributions
nam-0.008
kins-0.007
 Marie-0.007
igator-0.007
 tripod-0.007
Token not
Token ID407
Feature activation-1.18e-01

Pos logit contributions
eter+0.004
please+0.004
aa+0.004
 unnecessary+0.004
ch+0.004
Neg logit contributions
nam-0.004
kins-0.004
igator-0.003
 Marie-0.003
kin-0.003
Token happy
Token ID3772
Feature activation-4.78e-03

Pos logit contributions
eter+0.003
 unnecessary+0.003
ch+0.003
aa+0.003
please+0.003
Neg logit contributions
nam-0.003
kins-0.003
 Marie-0.003
igator-0.003
lf-0.003
Token made
Token ID925
Feature activation+3.57e-01

Pos logit contributions
kins+0.011
lf+0.011
nam+0.011
 globe+0.011
 Marie+0.010
Neg logit contributions
 unnecessary-0.014
please-0.013
eter-0.013
ucker-0.013
aa-0.012
Token the
Token ID262
Feature activation-7.92e-01

Pos logit contributions
lf+0.007
kins+0.007
nam+0.006
 Marie+0.006
book+0.006
Neg logit contributions
please-0.011
eter-0.011
 unnecessary-0.011
ucker-0.011
ounced-0.010
Token flowers
Token ID12734
Feature activation-9.95e-01

Pos logit contributions
ucker+0.022
eter+0.020
ead+0.019
ounced+0.019
 unnecessary+0.019
Neg logit contributions
nam-0.019
kins-0.019
 telescope-0.018
 globe-0.018
lf-0.018
Token and
Token ID290
Feature activation-4.67e-01

Pos logit contributions
 unnecessary+0.022
please+0.021
eter+0.021
aa+0.021
ucker+0.021
Neg logit contributions
nam-0.028
kins-0.027
lf-0.026
 Marie-0.026
 globe-0.026
Token trees
Token ID7150
Feature activation-1.44e+00

Pos logit contributions
 unnecessary+0.011
please+0.010
eter+0.010
aa+0.010
ucker+0.010
Neg logit contributions
nam-0.013
kins-0.012
lf-0.012
 Marie-0.012
 globe-0.012
Token grow
Token ID1663
Feature activation-3.49e+00

Pos logit contributions
please+0.034
aa+0.034
ch+0.034
 unnecessary+0.033
eter+0.033
Neg logit contributions
nam-0.037
 Marie-0.036
kins-0.036
lf-0.036
 tripod-0.034
Token big
Token ID1263
Feature activation-4.37e-01

Pos logit contributions
 unnecessary+0.094
please+0.094
eter+0.093
aa+0.091
ucker+0.091
Neg logit contributions
nam-0.078
kins-0.078
lf-0.074
 Marie-0.072
 globe-0.071
Token and
Token ID290
Feature activation+2.70e-02

Pos logit contributions
 unnecessary+0.012
please+0.012
ucker+0.012
aa+0.012
eter+0.012
Neg logit contributions
nam-0.009
kins-0.009
lf-0.009
 Marie-0.009
 globe-0.009
Token strong
Token ID1913
Feature activation-7.69e-01

Pos logit contributions
nam+0.001
kins+0.001
lf+0.001
 Marie+0.001
 globe+0.001
Neg logit contributions
 unnecessary-0.001
please-0.001
eter-0.001
aa-0.001
ucker-0.001
Token.
Token ID13
Feature activation-8.68e-02

Pos logit contributions
ucker+0.025
please+0.024
 unnecessary+0.024
aa+0.024
eter+0.023
Neg logit contributions
nam-0.014
kins-0.013
lf-0.012
 Marie-0.012
 globe-0.012
Token In
Token ID554
Feature activation+6.15e-01

Pos logit contributions
ch+0.002
eter+0.001
ead+0.001
please+0.001
aa+0.001
Neg logit contributions
kins-0.003
nam-0.003
lf-0.003
atively-0.003
 globe-0.003
Token itch
Token ID49700
Feature activation-1.65e+00

Pos logit contributions
 unnecessary+0.063
please+0.062
eter+0.060
aa+0.058
ead+0.057
Neg logit contributions
kins-0.069
nam-0.066
lf-0.065
atively-0.064
 globe-0.063
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
ch+0.038
please+0.037
 unnecessary+0.037
eter+0.037
aa+0.036
Neg logit contributions
nam-0.045
 Marie-0.042
kins-0.042
lf-0.041
 tripod-0.040
Token He
Token ID679
Feature activation-5.82e-02

Pos logit contributions
 unnecessary+0.006
please+0.006
aa+0.006
eter+0.006
ch+0.006
Neg logit contributions
nam-0.004
 Marie-0.004
kins-0.004
book-0.004
atively-0.004
Token feels
Token ID5300
Feature activation-3.55e-01

Pos logit contributions
eter+0.002
please+0.002
 unnecessary+0.002
aa+0.001
ead+0.001
Neg logit contributions
kins-0.001
nam-0.001
 Marie-0.001
lf-0.001
atively-0.001
Token his
Token ID465
Feature activation-9.57e-01

Pos logit contributions
eter+0.010
aa+0.010
please+0.010
ucker+0.010
ead+0.009
Neg logit contributions
nam-0.008
kins-0.008
 Marie-0.007
igator-0.007
taker-0.007
Token throat
Token ID13589
Feature activation-3.49e+00

Pos logit contributions
please+0.020
 unnecessary+0.020
eter+0.020
ch+0.019
aa+0.019
Neg logit contributions
kins-0.027
nam-0.027
atively-0.025
 globe-0.025
lf-0.025
Token swell
Token ID36214
Feature activation-1.93e-01

Pos logit contributions
 unnecessary+0.081
please+0.080
eter+0.079
aa+0.074
ucker+0.074
Neg logit contributions
kins-0.094
nam-0.089
lf-0.088
atively-0.086
book-0.085
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
ch+0.004
eter+0.004
 unnecessary+0.004
please+0.004
aa+0.004
Neg logit contributions
nam-0.005
kins-0.005
 tripod-0.005
 Marie-0.005
lf-0.005
Token He
Token ID679
Feature activation-1.50e-01

Pos logit contributions
 unnecessary+0.006
please+0.005
eter+0.005
aa+0.005
ch+0.005
Neg logit contributions
 Marie-0.003
nam-0.003
kins-0.003
book-0.003
atively-0.003
Token feels
Token ID5300
Feature activation-4.16e-01

Pos logit contributions
eter+0.004
please+0.004
 unnecessary+0.004
aa+0.004
ead+0.004
Neg logit contributions
kins-0.003
nam-0.003
 Marie-0.003
lf-0.003
atively-0.003
Token his
Token ID465
Feature activation-9.54e-01

Pos logit contributions
eter+0.012
aa+0.011
please+0.011
ucker+0.011
ead+0.011
Neg logit contributions
nam-0.010
kins-0.009
 Marie-0.009
taker-0.008
igator-0.008
Token covered
Token ID5017
Feature activation+5.41e-01

Pos logit contributions
kins+0.005
nam+0.005
lf+0.005
 Marie+0.005
 globe+0.004
Neg logit contributions
 unnecessary-0.005
eter-0.004
please-0.004
aa-0.004
ucker-0.004
Token Tom
Token ID4186
Feature activation+8.11e-02

Pos logit contributions
nam+0.016
lf+0.015
kins+0.015
 tripod+0.015
 Marie+0.015
Neg logit contributions
ch-0.010
eter-0.010
 trouble-0.010
aa-0.010
 unnecessary-0.010
Token and
Token ID290
Feature activation+2.93e-01

Pos logit contributions
nam+0.002
kins+0.002
 Marie+0.002
atively+0.002
lf+0.002
Neg logit contributions
ch-0.002
eter-0.002
aa-0.002
ucker-0.002
 unnecessary-0.002
Token made
Token ID925
Feature activation+3.09e-01

Pos logit contributions
nam+0.008
kins+0.007
lf+0.007
 Marie+0.007
 globe+0.007
Neg logit contributions
 unnecessary-0.007
please-0.007
eter-0.007
aa-0.007
ucker-0.007
Token him
Token ID683
Feature activation-2.64e-01

Pos logit contributions
 Marie+0.006
lf+0.006
nam+0.006
kins+0.006
atively+0.006
Neg logit contributions
please-0.009
ucker-0.009
 unnecessary-0.009
eter-0.009
aa-0.009
Token cough
Token ID22094
Feature activation-3.49e+00

Pos logit contributions
 unnecessary+0.005
eter+0.005
please+0.005
aa+0.005
ucker+0.005
Neg logit contributions
nam-0.008
kins-0.008
lf-0.008
 Marie-0.007
 globe-0.007
Token.
Token ID13
Feature activation+6.43e-02

Pos logit contributions
 unnecessary+0.111
please+0.110
eter+0.109
ucker+0.107
ead+0.104
Neg logit contributions
kins-0.060
nam-0.056
atively-0.055
book-0.054
 globe-0.054
Token He
Token ID679
Feature activation+1.27e-01

Pos logit contributions
nam+0.002
 Marie+0.001
kins+0.001
lf+0.001
book+0.001
Neg logit contributions
please-0.002
 unnecessary-0.002
eter-0.002
aa-0.002
ucker-0.002
Token tried
Token ID3088
Feature activation+2.47e-01

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
 globe+0.003
Neg logit contributions
 unnecessary-0.003
please-0.003
eter-0.003
aa-0.003
ucker-0.003
Token to
Token ID284
Feature activation+6.83e-01

Pos logit contributions
nam+0.008
 Marie+0.008
kins+0.008
lf+0.008
 globe+0.007
Neg logit contributions
 unnecessary-0.004
please-0.004
eter-0.004
ucker-0.004
aa-0.004
Token run
Token ID1057
Feature activation+5.42e-02

Pos logit contributions
nam+0.020
kins+0.020
lf+0.020
 Marie+0.019
 globe+0.019
Neg logit contributions
 unnecessary-0.013
please-0.013
eter-0.013
aa-0.013
ch-0.013
Token young
Token ID1862
Feature activation+1.03e+00

Pos logit contributions
eter+0.022
 unnecessary+0.021
ucker+0.020
ead+0.020
please+0.020
Neg logit contributions
lf-0.020
 globe-0.019
 telescope-0.019
nam-0.019
 tripod-0.019
Token girl
Token ID2576
Feature activation-2.06e-01

Pos logit contributions
nam+0.027
kins+0.026
lf+0.026
 Marie+0.026
 globe+0.025
Neg logit contributions
 unnecessary-0.025
please-0.024
eter-0.024
ucker-0.024
ch-0.024
Token had
Token ID550
Feature activation-6.02e-01

Pos logit contributions
 unnecessary+0.006
please+0.006
eter+0.006
aa+0.006
ch+0.006
Neg logit contributions
nam-0.004
lf-0.004
kins-0.004
 Marie-0.004
 globe-0.004
Token caught
Token ID4978
Feature activation-5.07e-01

Pos logit contributions
 unnecessary+0.016
ucker+0.015
eter+0.015
aa+0.015
please+0.015
Neg logit contributions
nam-0.015
 Marie-0.014
lf-0.014
kins-0.014
atively-0.013
Token a
Token ID257
Feature activation-9.87e-01

Pos logit contributions
 unnecessary+0.014
please+0.014
eter+0.013
aa+0.013
ucker+0.013
Neg logit contributions
nam-0.011
kins-0.011
lf-0.011
 Marie-0.011
 globe-0.010
Token severe
Token ID6049
Feature activation-3.48e+00

Pos logit contributions
eter+0.025
 unnecessary+0.024
please+0.024
ucker+0.024
ead+0.023
Neg logit contributions
nam-0.025
 globe-0.024
lf-0.023
kins-0.023
 telescope-0.023
Token case
Token ID1339
Feature activation-4.80e-01

Pos logit contributions
please+0.083
 unnecessary+0.083
ch+0.082
eter+0.082
ucker+0.081
Neg logit contributions
lf-0.092
nam-0.089
kins-0.089
 globe-0.087
igator-0.087
Token of
Token ID286
Feature activation-7.43e-01

Pos logit contributions
 trouble+0.008
ch+0.008
less+0.008
aa+0.008
eter+0.008
Neg logit contributions
nam-0.014
kins-0.014
 Marie-0.014
lf-0.014
 Becky-0.013
Token hyp
Token ID5328
Feature activation+2.75e-01

Pos logit contributions
eter+0.018
ucker+0.017
please+0.017
aa+0.017
ch+0.016
Neg logit contributions
nam-0.021
kins-0.019
lf-0.019
 Marie-0.018
igator-0.018
Tokenother
Token ID847
Feature activation+7.51e-01

Pos logit contributions
nam+0.007
kins+0.007
 Marie+0.007
lf+0.007
book+0.007
Neg logit contributions
 unnecessary-0.006
please-0.006
eter-0.006
ucker-0.006
aa-0.006
Tokenmia
Token ID20730
Feature activation+5.05e-01

Pos logit contributions
nam+0.021
kins+0.020
lf+0.020
 Marie+0.020
atively+0.019
Neg logit contributions
please-0.017
 unnecessary-0.017
eter-0.016
ucker-0.016
aa-0.016
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
 unnecessary+0.039
please+0.037
eter+0.037
ucker+0.035
aa+0.035
Neg logit contributions
kins-0.028
 globe-0.026
lf-0.025
nam-0.025
igator-0.024
Token plants
Token ID6134
Feature activation-3.10e-01

Pos logit contributions
 unnecessary+0.025
eter+0.025
please+0.025
ucker+0.025
aa+0.024
Neg logit contributions
nam-0.034
kins-0.033
lf-0.032
 Marie-0.031
 globe-0.031
Token and
Token ID290
Feature activation-9.87e-01

Pos logit contributions
please+0.007
ch+0.007
 unnecessary+0.007
eter+0.007
ucker+0.007
Neg logit contributions
nam-0.008
lf-0.008
 Marie-0.008
kins-0.008
 tripod-0.008
Token watch
Token ID2342
Feature activation-1.17e+00

Pos logit contributions
 unnecessary+0.023
eter+0.023
ucker+0.022
aa+0.022
please+0.022
Neg logit contributions
nam-0.027
kins-0.026
lf-0.025
 Marie-0.025
 globe-0.024
Token them
Token ID606
Feature activation-5.95e-01

Pos logit contributions
ucker+0.035
 unnecessary+0.035
please+0.035
eter+0.034
ounced+0.034
Neg logit contributions
nam-0.024
lf-0.023
kins-0.023
 Marie-0.023
 globe-0.022
Token grow
Token ID1663
Feature activation-3.48e+00

Pos logit contributions
 unnecessary+0.014
please+0.014
eter+0.014
ucker+0.014
aa+0.014
Neg logit contributions
nam-0.016
kins-0.016
lf-0.015
 Marie-0.015
 globe-0.015
Token big
Token ID1263
Feature activation-1.01e+00

Pos logit contributions
please+0.093
 unnecessary+0.093
eter+0.091
aa+0.091
ucker+0.090
Neg logit contributions
kins-0.080
nam-0.077
 Marie-0.073
lf-0.072
 globe-0.071
Token and
Token ID290
Feature activation-2.13e-01

Pos logit contributions
 unnecessary+0.030
please+0.030
ucker+0.030
aa+0.029
eter+0.029
Neg logit contributions
kins-0.020
nam-0.019
 Marie-0.019
lf-0.018
 globe-0.018
Token tall
Token ID7331
Feature activation-5.42e-01

Pos logit contributions
 unnecessary+0.005
eter+0.005
please+0.005
aa+0.005
ucker+0.005
Neg logit contributions
nam-0.006
kins-0.006
lf-0.006
 Marie-0.005
 globe-0.005
Token.
Token ID13
Feature activation-2.30e-01

Pos logit contributions
 unnecessary+0.015
please+0.015
ucker+0.015
aa+0.015
eter+0.015
Neg logit contributions
kins-0.012
nam-0.011
lf-0.011
 globe-0.011
 Marie-0.011
Token
Token ID220
Feature activation-5.53e-01

Pos logit contributions
 unnecessary+0.006
please+0.006
eter+0.006
aa+0.006
ucker+0.006
Neg logit contributions
nam-0.006
kins-0.005
lf-0.005
 Marie-0.005
 globe-0.005
Token was
Token ID373
Feature activation+3.24e-01

Pos logit contributions
please+0.003
 unnecessary+0.003
eter+0.003
ch+0.003
aa+0.003
Neg logit contributions
nam-0.004
kins-0.004
lf-0.004
 Marie-0.004
 globe-0.004
Token bitter
Token ID12922
Feature activation-1.46e+00

Pos logit contributions
nam+0.008
kins+0.007
lf+0.007
 Marie+0.007
book+0.007
Neg logit contributions
please-0.009
 unnecessary-0.009
eter-0.009
aa-0.009
ucker-0.009
Token and
Token ID290
Feature activation-1.15e-01

Pos logit contributions
please+0.051
 unnecessary+0.051
ucker+0.049
ounced+0.049
eter+0.049
Neg logit contributions
kins-0.023
lf-0.022
 globe-0.021
book-0.020
igator-0.018
Token made
Token ID925
Feature activation+6.34e-01

Pos logit contributions
 unnecessary+0.002
please+0.002
eter+0.002
aa+0.002
ucker+0.002
Neg logit contributions
nam-0.003
kins-0.003
lf-0.003
 Marie-0.003
 globe-0.003
Token them
Token ID606
Feature activation+3.27e-01

Pos logit contributions
kins+0.015
nam+0.015
lf+0.015
 Marie+0.015
book+0.015
Neg logit contributions
please-0.016
 unnecessary-0.016
ucker-0.015
eter-0.015
aa-0.015
Token cough
Token ID22094
Feature activation-3.45e+00

Pos logit contributions
nam+0.009
kins+0.009
lf+0.009
 Marie+0.009
 globe+0.009
Neg logit contributions
please-0.007
 unnecessary-0.007
eter-0.007
aa-0.007
ucker-0.007
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
 unnecessary+0.110
please+0.110
eter+0.108
ucker+0.108
ead+0.104
Neg logit contributions
kins-0.059
nam-0.055
book-0.054
atively-0.054
 globe-0.052
Token
Token ID198
Feature activation+1.85e-01

Pos logit contributions
nam+0.009
kins+0.009
lf+0.008
 Marie+0.008
 globe+0.008
Neg logit contributions
 unnecessary-0.007
please-0.007
eter-0.007
aa-0.007
ucker-0.007
Token
Token ID198
Feature activation+5.57e-01

Pos logit contributions
nam+0.005
kins+0.005
lf+0.005
 Marie+0.004
 globe+0.004
Neg logit contributions
 unnecessary-0.004
please-0.004
eter-0.004
aa-0.004
ucker-0.004
Token"
Token ID1
Feature activation-1.18e-01

Pos logit contributions
nam+0.016
kins+0.015
 Marie+0.015
 globe+0.015
 telescope+0.015
Neg logit contributions
please-0.012
aa-0.012
ch-0.012
ucker-0.011
eter-0.011
TokenWow
Token ID22017
Feature activation+5.43e-01

Pos logit contributions
 unnecessary+0.002
please+0.002
eter+0.002
aa+0.002
ucker+0.002
Neg logit contributions
nam-0.003
kins-0.003
lf-0.003
 Marie-0.003
 globe-0.003
Token was
Token ID373
Feature activation-5.12e-01

Pos logit contributions
kins+0.012
lf+0.012
nam+0.012
 Marie+0.012
atively+0.011
Neg logit contributions
 unnecessary-0.016
aa-0.015
eter-0.015
please-0.015
ucker-0.015
Token the
Token ID262
Feature activation-1.01e+00

Pos logit contributions
aa+0.017
 unnecessary+0.017
please+0.015
eter+0.015
ead+0.015
Neg logit contributions
nam-0.010
 Marie-0.009
kins-0.009
 Laura-0.008
lf-0.008
Token sweet
Token ID6029
Feature activation-8.03e-01

Pos logit contributions
 unnecessary+0.027
ucker+0.027
aa+0.027
ch+0.027
eter+0.026
Neg logit contributions
nam-0.025
 globe-0.022
kins-0.022
 Marie-0.022
igator-0.021
Tokenest
Token ID395
Feature activation+9.53e-01

Pos logit contributions
 unnecessary+0.025
ounced+0.021
please+0.021
ch+0.021
check+0.021
Neg logit contributions
kins-0.018
 Marie-0.017
 globe-0.017
igator-0.016
lf-0.016
Token and
Token ID290
Feature activation+5.21e-01

Pos logit contributions
kins+0.025
 globe+0.025
lf+0.025
 Marie+0.024
atively+0.024
Neg logit contributions
 unnecessary-0.024
ucker-0.020
please-0.020
ch-0.020
aa-0.019
Token ju
Token ID7544
Feature activation-3.45e+00

Pos logit contributions
nam+0.016
kins+0.016
 Marie+0.015
lf+0.015
 globe+0.015
Neg logit contributions
 unnecessary-0.010
aa-0.010
please-0.010
ucker-0.010
eter-0.010
Tokenic
Token ID291
Feature activation+6.18e-01

Pos logit contributions
 unnecessary+0.067
please+0.062
aa+0.061
ch+0.060
 trouble+0.059
Neg logit contributions
kins-0.108
lf-0.100
atively-0.098
 globe-0.096
nam-0.095
Tokeniest
Token ID6386
Feature activation+5.98e-01

Pos logit contributions
nam+0.019
 Marie+0.017
 tripod+0.017
 telescope+0.016
lf+0.016
Neg logit contributions
aa-0.013
ch-0.013
please-0.012
ucker-0.012
eter-0.012
Token strawberry
Token ID41236
Feature activation-3.94e-01

Pos logit contributions
kins+0.016
nam+0.016
lf+0.015
 Marie+0.015
 globe+0.015
Neg logit contributions
 unnecessary-0.015
ch-0.013
ucker-0.013
aa-0.013
please-0.012
Token she
Token ID673
Feature activation-6.17e-01

Pos logit contributions
 unnecessary+0.010
please+0.010
eter+0.010
aa+0.010
ucker+0.010
Neg logit contributions
nam-0.010
kins-0.010
lf-0.009
 Marie-0.009
 globe-0.009
Token had
Token ID550
Feature activation-9.20e-02

Pos logit contributions
 unnecessary+0.018
eter+0.017
please+0.017
aa+0.017
ucker+0.017
Neg logit contributions
nam-0.013
kins-0.013
lf-0.012
 Marie-0.012
 globe-0.012
Token She
Token ID1375
Feature activation+1.70e-01

Pos logit contributions
 unnecessary+0.003
please+0.002
eter+0.002
aa+0.002
ucker+0.002
Neg logit contributions
nam-0.002
 Marie-0.002
kins-0.002
lf-0.002
 globe-0.002
Token found
Token ID1043
Feature activation-3.85e-01

Pos logit contributions
nam+0.005
kins+0.005
lf+0.004
 Marie+0.004
 globe+0.004
Neg logit contributions
 unnecessary-0.004
eter-0.004
please-0.004
aa-0.004
ucker-0.004
Token the
Token ID262
Feature activation-7.26e-01

Pos logit contributions
eter+0.013
 unnecessary+0.013
ucker+0.013
aa+0.013
please+0.012
Neg logit contributions
nam-0.007
 Marie-0.006
kins-0.006
atively-0.005
igator-0.005
Token biggest
Token ID4094
Feature activation+7.14e-01

Pos logit contributions
 unnecessary+0.018
ch+0.017
eter+0.017
aa+0.017
ucker+0.017
Neg logit contributions
nam-0.019
kins-0.019
 Marie-0.017
 globe-0.016
atively-0.016
Token and
Token ID290
Feature activation+3.73e-01

Pos logit contributions
kins+0.019
nam+0.018
 Marie+0.017
 globe+0.016
lf+0.016
Neg logit contributions
 unnecessary-0.019
ch-0.017
aa-0.016
eter-0.016
please-0.016
Token ju
Token ID7544
Feature activation-3.44e+00

Pos logit contributions
nam+0.011
kins+0.011
lf+0.011
 Marie+0.010
 globe+0.010
Neg logit contributions
 unnecessary-0.008
eter-0.007
please-0.007
aa-0.007
ucker-0.007
Tokenic
Token ID291
Feature activation+6.53e-01

Pos logit contributions
 unnecessary+0.071
aa+0.065
ch+0.065
please+0.063
 trouble+0.063
Neg logit contributions
kins-0.105
lf-0.100
nam-0.091
atively-0.091
 globe-0.090
Tokeniest
Token ID6386
Feature activation+7.45e-01

Pos logit contributions
nam+0.020
 Marie+0.018
 tripod+0.018
 telescope+0.018
 globe+0.017
Neg logit contributions
aa-0.013
please-0.013
ch-0.013
ucker-0.012
 unnecessary-0.012
Token one
Token ID530
Feature activation+6.04e-01

Pos logit contributions
kins+0.022
nam+0.022
 Marie+0.021
lf+0.020
atively+0.020
Neg logit contributions
 unnecessary-0.016
ch-0.015
eter-0.014
aa-0.014
ucker-0.014
Token,
Token ID11
Feature activation-8.79e-01

Pos logit contributions
nam+0.017
lf+0.016
kins+0.016
 tripod+0.016
atively+0.016
Neg logit contributions
please-0.013
eter-0.013
 unnecessary-0.012
aa-0.012
ch-0.012
Token but
Token ID475
Feature activation-3.36e-01

Pos logit contributions
 unnecessary+0.020
please+0.020
eter+0.020
aa+0.019
ucker+0.019
Neg logit contributions
nam-0.023
kins-0.023
lf-0.023
 Marie-0.022
 globe-0.022
Token cricket
Token ID18836
Feature activation+6.04e-01

Pos logit contributions
eter+0.005
 unnecessary+0.005
ucker+0.005
aa+0.005
ounced+0.005
Neg logit contributions
lf-0.005
kins-0.005
nam-0.005
 Marie-0.005
 globe-0.004
Token woke
Token ID19092
Feature activation-7.84e-01

Pos logit contributions
nam+0.016
kins+0.016
lf+0.015
 Marie+0.015
 globe+0.015
Neg logit contributions
 unnecessary-0.014
please-0.014
eter-0.013
aa-0.013
ch-0.013
Token up
Token ID510
Feature activation-7.60e-01

Pos logit contributions
eter+0.026
please+0.026
aa+0.025
 unnecessary+0.025
ucker+0.025
Neg logit contributions
lf-0.013
 Marie-0.013
nam-0.012
kins-0.012
atively-0.012
Token with
Token ID351
Feature activation+7.21e-01

Pos logit contributions
 unnecessary+0.019
please+0.018
eter+0.018
aa+0.018
ucker+0.018
Neg logit contributions
nam-0.019
kins-0.019
lf-0.018
 Marie-0.018
atively-0.018
Token a
Token ID257
Feature activation-1.40e+00

Pos logit contributions
nam+0.016
kins+0.015
lf+0.015
 Marie+0.014
 globe+0.014
Neg logit contributions
 unnecessary-0.020
please-0.020
eter-0.020
aa-0.020
ucker-0.020
Token sore
Token ID19597
Feature activation-3.43e+00

Pos logit contributions
eter+0.037
please+0.036
ead+0.035
ucker+0.034
aa+0.034
Neg logit contributions
nam-0.033
 telescope-0.033
 Marie-0.032
 tripod-0.032
kins-0.031
Token throat
Token ID13589
Feature activation-3.00e+00

Pos logit contributions
 unnecessary+0.088
eter+0.084
please+0.082
ead+0.077
ucker+0.077
Neg logit contributions
kin-0.086
kins-0.082
atively-0.080
book-0.077
pe-0.077
Token and
Token ID290
Feature activation-2.06e-01

Pos logit contributions
 unnecessary+0.091
eter+0.088
please+0.087
ucker+0.086
aa+0.085
Neg logit contributions
kins-0.058
nam-0.057
lf-0.056
book-0.054
 Marie-0.054
Token couldn
Token ID3521
Feature activation-2.19e-02

Pos logit contributions
 unnecessary+0.005
eter+0.005
ucker+0.005
please+0.005
aa+0.005
Neg logit contributions
nam-0.005
kins-0.005
lf-0.005
 Marie-0.005
atively-0.004
Token't
Token ID470
Feature activation+5.47e-01

Pos logit contributions
 unnecessary+0.000
please+0.000
eter+0.000
aa+0.000
ucker+0.000
Neg logit contributions
nam-0.001
kins-0.001
lf-0.001
 Marie-0.001
 globe-0.001
Token sing
Token ID1702
Feature activation-1.04e+00

Pos logit contributions
nam+0.016
kins+0.016
lf+0.016
 Marie+0.015
 globe+0.015
Neg logit contributions
 unnecessary-0.011
eter-0.011
please-0.011
ucker-0.011
aa-0.011
Tokenummy
Token ID13513
Feature activation-8.37e-01

Pos logit contributions
 unnecessary+0.030
please+0.030
aa+0.026
ch+0.026
 fever+0.025
Neg logit contributions
kins-0.059
nam-0.058
 Marie-0.058
lf-0.057
book-0.057
Token.
Token ID13
Feature activation+2.77e-02

Pos logit contributions
 unnecessary+0.024
please+0.023
aa+0.023
eter+0.023
ucker+0.023
Neg logit contributions
nam-0.018
kins-0.018
lf-0.017
 Marie-0.017
 globe-0.016
Token It
Token ID632
Feature activation+5.25e-01

Pos logit contributions
nam+0.001
kins+0.001
lf+0.001
 Marie+0.001
 globe+0.001
Neg logit contributions
 unnecessary-0.001
eter-0.001
please-0.001
ucker-0.001
aa-0.001
Token made
Token ID925
Feature activation+7.55e-01

Pos logit contributions
nam+0.014
kins+0.013
lf+0.013
 Marie+0.013
 tripod+0.013
Neg logit contributions
please-0.012
eter-0.012
 unnecessary-0.012
ch-0.012
ucker-0.012
Token her
Token ID607
Feature activation-5.73e-01

Pos logit contributions
nam+0.010
 Marie+0.010
 Kate+0.009
 Anne+0.009
 Becky+0.009
Neg logit contributions
please-0.029
 unnecessary-0.028
eter-0.027
aa-0.027
ounced-0.027
Token tongue
Token ID11880
Feature activation-3.43e+00

Pos logit contributions
please+0.015
 unnecessary+0.015
ch+0.015
aa+0.015
ucker+0.014
Neg logit contributions
nam-0.014
kins-0.013
 Marie-0.013
 globe-0.012
lf-0.012
Token pink
Token ID11398
Feature activation-3.69e-01

Pos logit contributions
 unnecessary+0.090
please+0.086
aa+0.084
eter+0.081
ounced+0.079
Neg logit contributions
kins-0.082
nam-0.082
lf-0.074
atively-0.074
 globe-0.072
Token.
Token ID13
Feature activation-4.56e-02

Pos logit contributions
 unnecessary+0.010
please+0.009
eter+0.009
aa+0.009
ucker+0.009
Neg logit contributions
nam-0.009
kins-0.009
lf-0.008
 Marie-0.008
 globe-0.008
Token She
Token ID1375
Feature activation+1.14e-01

Pos logit contributions
 unnecessary+0.001
please+0.001
eter+0.001
aa+0.001
ucker+0.001
Neg logit contributions
nam-0.001
kins-0.001
lf-0.001
 Marie-0.001
 globe-0.001
Token l
Token ID300
Feature activation-1.46e+00

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
 globe+0.002
Neg logit contributions
please-0.003
 unnecessary-0.003
eter-0.003
ucker-0.003
aa-0.003
Tokenicked
Token ID9484
Feature activation+4.74e-01

Pos logit contributions
 unnecessary+0.039
please+0.038
eter+0.038
aa+0.038
ucker+0.038
Neg logit contributions
nam-0.034
kins-0.033
lf-0.032
 Marie-0.032
 globe-0.031
TokenJ
Token ID41
Feature activation+6.60e-01

Pos logit contributions
 unnecessary+0.001
please+0.001
eter+0.001
ch+0.001
aa+0.001
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
 Marie-0.002
atively-0.002
Tokenill
Token ID359
Feature activation+5.55e-01

Pos logit contributions
nam+0.018
lf+0.016
kins+0.016
 globe+0.015
atively+0.015
Neg logit contributions
aa-0.017
ucker-0.017
eter-0.017
ead-0.016
ch-0.016
Token put
Token ID1234
Feature activation+5.87e-01

Pos logit contributions
nam+0.015
kins+0.014
lf+0.014
 Marie+0.013
 globe+0.013
Neg logit contributions
please-0.013
aa-0.013
ucker-0.013
eter-0.013
 unnecessary-0.012
Token on
Token ID319
Feature activation+1.78e+00

Pos logit contributions
nam+0.014
kins+0.014
lf+0.014
 Marie+0.013
atively+0.013
Neg logit contributions
please-0.015
 unnecessary-0.014
ch-0.014
aa-0.014
eter-0.014
Token her
Token ID607
Feature activation+1.14e+00

Pos logit contributions
igator+0.047
 globe+0.046
kins+0.046
taker+0.046
atively+0.045
Neg logit contributions
ead-0.039
eter-0.039
 lemon-0.038
 trouble-0.038
please-0.038
Token wide
Token ID3094
Feature activation+3.10e+00

Pos logit contributions
kins+0.034
nam+0.034
taker+0.034
 Marie+0.034
atively+0.034
Neg logit contributions
 fever-0.020
 fle-0.020
 tum-0.020
ch-0.019
please-0.019
Token uniform
Token ID8187
Feature activation+1.04e+00

Pos logit contributions
nam+0.087
kins+0.086
 Marie+0.080
 globe+0.078
taker+0.078
Neg logit contributions
ch-0.066
aa-0.065
 fever-0.065
eter-0.064
 tum-0.064
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
nam+0.033
kins+0.032
 Marie+0.031
igator+0.031
 tripod+0.030
Neg logit contributions
eter-0.018
ead-0.018
ch-0.017
please-0.017
aa-0.017
Token looked
Token ID3114
Feature activation+3.44e-01

Pos logit contributions
kins+0.034
 globe+0.033
nam+0.032
atively+0.032
pe+0.032
Neg logit contributions
please-0.024
aa-0.023
ch-0.022
 unnecessary-0.022
eter-0.021
Token in
Token ID287
Feature activation+4.27e-01

Pos logit contributions
kins+0.010
nam+0.010
lf+0.010
 Marie+0.009
 globe+0.009
Neg logit contributions
 unnecessary-0.007
please-0.007
aa-0.007
eter-0.007
ch-0.007
Token the
Token ID262
Feature activation+7.09e-01

Pos logit contributions
nam+0.010
lf+0.009
kins+0.009
 Marie+0.009
 globe+0.009
Neg logit contributions
please-0.012
 unnecessary-0.012
aa-0.012
ucker-0.012
eter-0.012
Token Let
Token ID3914
Feature activation-2.85e-01

Pos logit contributions
 unnecessary+0.000
please+0.000
eter+0.000
aa+0.000
ucker+0.000
Neg logit contributions
nam-0.000
kins-0.000
lf-0.000
 Marie-0.000
 globe-0.000
Token's
Token ID338
Feature activation+8.11e-01

Pos logit contributions
eter+0.010
ucker+0.010
 unnecessary+0.010
please+0.010
ead+0.010
Neg logit contributions
 Marie-0.004
kins-0.004
lf-0.004
nam-0.004
 tripod-0.003
Token look
Token ID804
Feature activation-6.83e-01

Pos logit contributions
nam+0.022
kins+0.022
lf+0.022
 Marie+0.022
 globe+0.021
Neg logit contributions
 unnecessary-0.018
ucker-0.017
eter-0.017
please-0.017
aa-0.017
Token in
Token ID287
Feature activation+2.11e-02

Pos logit contributions
please+0.014
 unnecessary+0.014
eter+0.014
ucker+0.014
aa+0.013
Neg logit contributions
nam-0.020
kins-0.019
lf-0.019
 Marie-0.019
 globe-0.018
Token the
Token ID262
Feature activation+3.03e-01

Pos logit contributions
nam+0.001
kins+0.001
lf+0.001
 Marie+0.000
 globe+0.000
Neg logit contributions
 unnecessary-0.001
please-0.001
eter-0.001
aa-0.001
ucker-0.001
Token wild
Token ID4295
Feature activation+2.53e+00

Pos logit contributions
nam+0.006
kins+0.006
 Marie+0.006
 globe+0.006
lf+0.006
Neg logit contributions
please-0.009
eter-0.009
aa-0.009
ucker-0.009
 unnecessary-0.009
Tokenfl
Token ID2704
Feature activation-8.82e-01

Pos logit contributions
nam+0.062
kins+0.061
lf+0.059
 Marie+0.058
 globe+0.057
Neg logit contributions
 unnecessary-0.064
please-0.063
eter-0.062
aa-0.062
ucker-0.061
Tokenowers
Token ID3618
Feature activation+2.39e-01

Pos logit contributions
 unnecessary+0.021
please+0.021
eter+0.021
ucker+0.020
aa+0.020
Neg logit contributions
nam-0.023
kins-0.022
lf-0.022
 Marie-0.022
atively-0.021
Token."
Token ID526
Feature activation+2.35e-02

Pos logit contributions
nam+0.007
lf+0.007
atively+0.006
 Marie+0.006
kins+0.006
Neg logit contributions
eter-0.005
 unnecessary-0.005
please-0.005
ch-0.005
aa-0.005
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
nam+0.001
lf+0.001
kins+0.001
igator+0.001
 globe+0.001
Neg logit contributions
ead-0.000
eter-0.000
ucker-0.000
aa-0.000
please-0.000
Token
Token ID198
Feature activation+3.12e-01

Pos logit contributions
please+0.003
 unnecessary+0.003
aa+0.003
ucker+0.003
eter+0.003
Neg logit contributions
nam-0.004
kins-0.004
lf-0.004
 Marie-0.004
 globe-0.004
Token home
Token ID1363
Feature activation-1.16e+00

Pos logit contributions
nam+0.002
lf+0.002
kins+0.002
 globe+0.002
 Marie+0.002
Neg logit contributions
eter-0.001
 unnecessary-0.001
please-0.001
aa-0.001
ucker-0.001
Token and
Token ID290
Feature activation+1.36e-01

Pos logit contributions
eter+0.025
aa+0.025
please+0.025
 unnecessary+0.024
ucker+0.024
Neg logit contributions
nam-0.031
 tripod-0.029
lf-0.029
 globe-0.029
kins-0.029
Token Sam
Token ID3409
Feature activation+2.70e-01

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 globe+0.003
 Marie+0.003
Neg logit contributions
please-0.003
 unnecessary-0.003
eter-0.003
aa-0.003
ucker-0.003
Token could
Token ID714
Feature activation-8.88e-03

Pos logit contributions
nam+0.007
kins+0.007
lf+0.007
 globe+0.006
 Marie+0.006
Neg logit contributions
please-0.006
eter-0.006
 unnecessary-0.006
aa-0.006
ucker-0.006
Token barely
Token ID8523
Feature activation-2.58e-01

Pos logit contributions
please+0.000
 unnecessary+0.000
eter+0.000
aa+0.000
ucker+0.000
Neg logit contributions
nam-0.000
kins-0.000
lf-0.000
 Marie-0.000
 globe-0.000
Token contain
Token ID3994
Feature activation+2.31e+00

Pos logit contributions
 unnecessary+0.006
please+0.006
eter+0.006
aa+0.006
ucker+0.006
Neg logit contributions
nam-0.007
kins-0.006
lf-0.006
 Marie-0.006
 globe-0.006
Token his
Token ID465
Feature activation-1.05e-01

Pos logit contributions
nam+0.072
igator+0.069
ophone+0.069
 globe+0.068
book+0.068
Neg logit contributions
 trouble-0.041
 unnecessary-0.039
 mischief-0.037
 sickness-0.032
eter-0.032
Token impat
Token ID28189
Feature activation+3.09e-01

Pos logit contributions
 unnecessary+0.002
aa+0.002
eter+0.002
 trouble+0.002
please+0.002
Neg logit contributions
nam-0.003
lf-0.003
kins-0.003
 globe-0.003
 Marie-0.003
Tokenience
Token ID1240
Feature activation-4.58e-01

Pos logit contributions
nam+0.008
kins+0.008
lf+0.008
 globe+0.008
 Marie+0.008
Neg logit contributions
 unnecessary-0.007
aa-0.007
please-0.007
ch-0.007
ucker-0.007
Token as
Token ID355
Feature activation+4.71e-03

Pos logit contributions
 unnecessary+0.011
eter+0.011
please+0.011
aa+0.011
ch+0.011
Neg logit contributions
nam-0.012
kins-0.011
lf-0.011
 Marie-0.011
 globe-0.010
Token he
Token ID339
Feature activation-1.19e-01

Pos logit contributions
nam+0.000
kins+0.000
lf+0.000
 tripod+0.000
 globe+0.000
Neg logit contributions
 unnecessary-0.000
please-0.000
aa-0.000
eter-0.000
ch-0.000
Token to
Token ID284
Feature activation+5.78e-01

Pos logit contributions
 unnecessary+0.002
please+0.002
eter+0.002
ucker+0.002
aa+0.002
Neg logit contributions
kins-0.004
nam-0.004
lf-0.004
 Marie-0.004
 globe-0.004
Token be
Token ID307
Feature activation+4.81e-01

Pos logit contributions
nam+0.017
kins+0.016
lf+0.016
 Marie+0.016
 globe+0.016
Neg logit contributions
 unnecessary-0.012
eter-0.012
please-0.012
ucker-0.011
aa-0.011
Token available
Token ID1695
Feature activation+5.69e-01

Pos logit contributions
nam+0.012
kins+0.011
 Marie+0.011
lf+0.010
 globe+0.010
Neg logit contributions
aa-0.012
ucker-0.012
 unnecessary-0.012
ch-0.012
eter-0.012
Token to
Token ID284
Feature activation+2.68e-01

Pos logit contributions
nam+0.015
kins+0.015
lf+0.015
 Marie+0.015
 globe+0.014
Neg logit contributions
 unnecessary-0.013
please-0.013
eter-0.012
aa-0.012
ucker-0.012
Token help
Token ID1037
Feature activation-4.32e-02

Pos logit contributions
nam+0.007
kins+0.007
lf+0.007
 Marie+0.007
pe+0.006
Neg logit contributions
eter-0.007
ucker-0.006
aa-0.006
ounced-0.006
ch-0.006
Token others
Token ID1854
Feature activation+2.17e+00

Pos logit contributions
ounced+0.001
ucker+0.001
eter+0.001
aa+0.001
please+0.001
Neg logit contributions
nam-0.001
lf-0.001
kins-0.001
 Marie-0.001
 Becky-0.001
Token.
Token ID13
Feature activation-6.99e-01

Pos logit contributions
nam+0.056
lf+0.055
 Marie+0.054
atively+0.053
 tripod+0.053
Neg logit contributions
 unnecessary-0.052
please-0.050
eter-0.049
ucker-0.048
aa-0.048
Token From
Token ID3574
Feature activation+2.26e-01

Pos logit contributions
 unnecessary+0.019
please+0.018
eter+0.018
aa+0.018
ucker+0.018
Neg logit contributions
nam-0.016
kins-0.016
lf-0.015
 Marie-0.015
 globe-0.015
Token that
Token ID326
Feature activation+8.42e-01

Pos logit contributions
lf+0.006
nam+0.006
kins+0.006
 Marie+0.006
 globe+0.006
Neg logit contributions
eter-0.005
 unnecessary-0.005
please-0.005
ucker-0.005
ead-0.005
Token day
Token ID1110
Feature activation-1.60e-01

Pos logit contributions
nam+0.032
kins+0.030
atively+0.030
 tripod+0.029
 Marie+0.029
Neg logit contributions
ch-0.012
please-0.011
aa-0.011
 trouble-0.010
 unnecessary-0.010
Token on
Token ID319
Feature activation+2.22e-02

Pos logit contributions
ch+0.003
 unnecessary+0.003
aa+0.003
please+0.003
ead+0.003
Neg logit contributions
nam-0.005
 Marie-0.004
kins-0.004
lf-0.004
atively-0.004
Token he
Token ID339
Feature activation-1.47e-02

Pos logit contributions
 trouble+0.017
 unnecessary+0.017
please+0.017
eter+0.016
ch+0.016
Neg logit contributions
kins-0.023
 tripod-0.023
nam-0.023
book-0.023
lf-0.023
Token remembered
Token ID12086
Feature activation-1.62e-01

Pos logit contributions
please+0.000
 unnecessary+0.000
eter+0.000
aa+0.000
ch+0.000
Neg logit contributions
nam-0.000
kins-0.000
lf-0.000
 Marie-0.000
 globe-0.000
Token the
Token ID262
Feature activation-8.16e-01

Pos logit contributions
 unnecessary+0.004
please+0.004
eter+0.004
aa+0.004
ucker+0.004
Neg logit contributions
nam-0.004
kins-0.004
lf-0.004
 Marie-0.004
 globe-0.004
Token brav
Token ID49025
Feature activation+3.78e-01

Pos logit contributions
ucker+0.019
eter+0.019
 unnecessary+0.018
please+0.018
aa+0.018
Neg logit contributions
 globe-0.022
lf-0.021
nam-0.021
kins-0.021
 Marie-0.021
Tokenest
Token ID395
Feature activation+4.63e-01

Pos logit contributions
 globe+0.008
igator+0.008
kins+0.008
lf+0.008
nam+0.008
Neg logit contributions
ucker-0.010
 unnecessary-0.010
please-0.010
ead-0.009
ounced-0.009
Token thing
Token ID1517
Feature activation+2.22e+00

Pos logit contributions
lf+0.014
 globe+0.014
kins+0.014
atively+0.013
 Marie+0.013
Neg logit contributions
 unnecessary-0.009
ucker-0.009
please-0.009
eter-0.009
aa-0.008
Token to
Token ID284
Feature activation-6.08e-01

Pos logit contributions
nam+0.064
kins+0.060
 tripod+0.060
lf+0.059
 Marie+0.057
Neg logit contributions
ch-0.044
aa-0.042
please-0.042
 trouble-0.042
eter-0.042
Token do
Token ID466
Feature activation+2.72e-01

Pos logit contributions
ucker+0.014
 unnecessary+0.014
eter+0.014
aa+0.013
ounced+0.013
Neg logit contributions
lf-0.017
pe-0.016
kins-0.016
 Marie-0.016
 globe-0.015
Token was
Token ID373
Feature activation+3.62e-01

Pos logit contributions
nam+0.009
 tripod+0.008
 Marie+0.008
igator+0.007
 telescope+0.007
Neg logit contributions
 trouble-0.005
 unnecessary-0.005
ch-0.005
please-0.005
eter-0.005
Token to
Token ID284
Feature activation-3.63e-01

Pos logit contributions
nam+0.010
kins+0.010
lf+0.009
 Marie+0.009
 globe+0.009
Neg logit contributions
 unnecessary-0.008
eter-0.008
ucker-0.008
please-0.008
aa-0.008
Token stay
Token ID2652
Feature activation+1.32e-01

Pos logit contributions
ucker+0.009
eter+0.009
 unnecessary+0.008
aa+0.008
ounced+0.008
Neg logit contributions
lf-0.009
pe-0.009
nam-0.009
igator-0.009
 Marie-0.009
Token."
Token ID526
Feature activation-5.70e-02

Pos logit contributions
 unnecessary+0.006
please+0.006
eter+0.006
aa+0.006
ucker+0.006
Neg logit contributions
nam-0.006
kins-0.006
lf-0.006
 Marie-0.006
 globe-0.005
Token
Token ID198
Feature activation-2.82e-01

Pos logit contributions
please+0.001
eter+0.001
ucker+0.001
 unnecessary+0.001
aa+0.001
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
 globe-0.002
atively-0.002
Token
Token ID198
Feature activation+4.54e-01

Pos logit contributions
 unnecessary+0.008
eter+0.008
please+0.007
aa+0.007
ucker+0.007
Neg logit contributions
lf-0.006
nam-0.006
 Marie-0.006
kins-0.006
book-0.006
TokenTom
Token ID13787
Feature activation+1.17e-01

Pos logit contributions
kins+0.015
nam+0.015
 globe+0.015
taker+0.014
ophone+0.014
Neg logit contributions
please-0.008
eter-0.007
ch-0.007
aa-0.007
ead-0.007
Token followed
Token ID3940
Feature activation+6.23e-01

Pos logit contributions
nam+0.004
kins+0.003
 globe+0.003
lf+0.003
 tripod+0.003
Neg logit contributions
please-0.003
eter-0.002
aa-0.002
ucker-0.002
ch-0.002
Token Anna
Token ID11735
Feature activation+2.99e-01

Pos logit contributions
nam+0.013
kins+0.013
lf+0.012
 Marie+0.012
 globe+0.012
Neg logit contributions
 unnecessary-0.018
please-0.018
eter-0.017
aa-0.017
ucker-0.017
Token to
Token ID284
Feature activation+1.33e-01

Pos logit contributions
nam+0.011
kins+0.011
lf+0.010
 globe+0.010
atively+0.010
Neg logit contributions
ead-0.004
eter-0.004
 unnecessary-0.004
please-0.004
aa-0.004
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
 globe+0.003
Neg logit contributions
 unnecessary-0.003
please-0.003
eter-0.003
aa-0.003
ucker-0.003
Token kitchen
Token ID9592
Feature activation-7.41e-01

Pos logit contributions
nam+0.020
lf+0.018
kins+0.018
 Marie+0.018
 globe+0.017
Neg logit contributions
 unnecessary-0.037
eter-0.037
ucker-0.037
aa-0.036
please-0.036
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
eter+0.017
ead+0.016
please+0.016
ucker+0.016
aa+0.016
Neg logit contributions
nam-0.020
 Marie-0.019
igator-0.019
lf-0.018
kins-0.018
Token He
Token ID679
Feature activation+1.05e+00

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
 globe+0.003
Neg logit contributions
 unnecessary-0.003
please-0.003
eter-0.003
aa-0.003
ucker-0.003
Token of
Token ID286
Feature activation-7.38e-01

Pos logit contributions
nam+0.026
kins+0.024
lf+0.024
atively+0.023
 Marie+0.023
Neg logit contributions
ch-0.023
eter-0.023
 unnecessary-0.023
aa-0.023
please-0.022
Token Years
Token ID13212
Feature activation+2.40e-01

Pos logit contributions
please+0.019
eter+0.019
ucker+0.019
 unnecessary+0.019
aa+0.019
Neg logit contributions
nam-0.017
kins-0.017
lf-0.016
 Marie-0.016
 globe-0.016
Token.
Token ID13
Feature activation-3.56e-02

Pos logit contributions
nam+0.007
kins+0.006
lf+0.006
 Marie+0.006
 globe+0.006
Neg logit contributions
 unnecessary-0.005
aa-0.005
please-0.005
ch-0.005
eter-0.005
Token I
Token ID314
Feature activation+1.49e-01

Pos logit contributions
 unnecessary+0.001
eter+0.001
please+0.001
aa+0.001
ch+0.001
Neg logit contributions
nam-0.001
kins-0.001
lf-0.001
 Marie-0.001
 globe-0.001
Token can
Token ID460
Feature activation+4.98e-01

Pos logit contributions
kins+0.004
nam+0.004
 Marie+0.004
lf+0.004
atively+0.004
Neg logit contributions
 unnecessary-0.003
please-0.003
eter-0.003
ucker-0.003
aa-0.003
Token give
Token ID1577
Feature activation+1.90e-01

Pos logit contributions
nam+0.014
lf+0.014
kins+0.014
 Marie+0.014
 globe+0.013
Neg logit contributions
 unnecessary-0.011
eter-0.010
please-0.010
ucker-0.010
aa-0.010
Token you
Token ID345
Feature activation+1.67e-01

Pos logit contributions
nam+0.005
 Marie+0.005
kins+0.004
lf+0.004
 globe+0.004
Neg logit contributions
ucker-0.005
please-0.005
eter-0.005
 unnecessary-0.005
 fever-0.004
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
nam+0.004
kins+0.004
lf+0.004
 Marie+0.004
 globe+0.004
Neg logit contributions
 unnecessary-0.004
eter-0.004
please-0.004
aa-0.004
ch-0.004
Token wisdom
Token ID11501
Feature activation+1.25e+00

Pos logit contributions
ucker+0.033
please+0.032
eter+0.032
 unnecessary+0.032
aa+0.032
Neg logit contributions
nam-0.028
 globe-0.027
lf-0.026
kins-0.026
 Marie-0.026
Token of
Token ID286
Feature activation-6.50e-02

Pos logit contributions
nam+0.041
lf+0.040
kins+0.040
 tripod+0.038
atively+0.038
Neg logit contributions
eter-0.021
 unnecessary-0.021
ch-0.021
aa-0.021
please-0.020
Token all
Token ID477
Feature activation+7.09e-01

Pos logit contributions
 unnecessary+0.001
please+0.001
eter+0.001
aa+0.001
ch+0.001
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
 Marie-0.002
atively-0.002
Token,
Token ID11
Feature activation+8.84e-01

Pos logit contributions
nam+0.041
 tripod+0.039
 keyboard+0.037
atively+0.037
ophone+0.037
Neg logit contributions
ch-0.013
 trouble-0.013
aa-0.013
ucker-0.012
 fle-0.012
Token there
Token ID612
Feature activation-9.27e-01

Pos logit contributions
nam+0.023
lf+0.022
 tripod+0.022
kins+0.022
 Marie+0.021
Neg logit contributions
ch-0.020
 unnecessary-0.020
ucker-0.020
aa-0.019
please-0.019
Token was
Token ID373
Feature activation+6.06e-01

Pos logit contributions
 unnecessary+0.026
please+0.025
eter+0.024
aa+0.024
ucker+0.024
Neg logit contributions
kins-0.021
lf-0.021
nam-0.021
 Marie-0.020
 globe-0.020
Token a
Token ID257
Feature activation-3.61e-01

Pos logit contributions
kins+0.016
nam+0.016
lf+0.016
atively+0.015
 globe+0.015
Neg logit contributions
ch-0.014
please-0.014
 unnecessary-0.014
 trouble-0.013
 fle-0.013
Token dog
Token ID3290
Feature activation-7.66e-01

Pos logit contributions
 unnecessary+0.008
eter+0.008
ucker+0.008
please+0.008
aa+0.008
Neg logit contributions
nam-0.010
lf-0.010
kins-0.010
 globe-0.009
 Marie-0.009
Token named
Token ID3706
Feature activation+8.54e-01

Pos logit contributions
ch+0.016
please+0.015
aa+0.015
ucker+0.015
 fle+0.014
Neg logit contributions
nam-0.022
kins-0.022
 Marie-0.021
atively-0.021
book-0.020
Token Max
Token ID5436
Feature activation-4.75e-01

Pos logit contributions
nam+0.020
 Marie+0.020
 tripod+0.019
lf+0.019
 Anne+0.019
Neg logit contributions
please-0.022
eter-0.022
ucker-0.022
 unnecessary-0.022
ounced-0.021
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
please+0.010
ch+0.009
aa+0.009
 fle+0.009
ucker+0.009
Neg logit contributions
nam-0.014
kins-0.014
 Marie-0.014
atively-0.013
lf-0.013
Token Max
Token ID5436
Feature activation-9.94e-02

Pos logit contributions
nam+0.005
kins+0.004
lf+0.004
 Marie+0.004
 globe+0.004
Neg logit contributions
 unnecessary-0.005
please-0.005
eter-0.005
aa-0.005
ucker-0.005
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
please+0.002
ch+0.002
aa+0.002
 fle+0.002
ucker+0.002
Neg logit contributions
nam-0.003
kins-0.003
atively-0.003
 Marie-0.003
lf-0.003
Token a
Token ID257
Feature activation-2.98e-01

Pos logit contributions
eter+0.032
 unnecessary+0.032
ucker+0.031
please+0.030
aa+0.030
Neg logit contributions
nam-0.019
 Marie-0.017
lf-0.016
kins-0.016
igator-0.016
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 unnecessary+0.005
please+0.004
eter+0.004
ch+0.004
aa+0.004
Neg logit contributions
nam-0.005
kins-0.005
lf-0.005
 Marie-0.005
atively-0.005
Token stone
Token ID7815
Feature activation+1.01e+00

Pos logit contributions
 unnecessary+0.031
please+0.030
eter+0.030
ch+0.029
aa+0.029
Neg logit contributions
nam-0.031
kins-0.030
lf-0.030
 Marie-0.029
atively-0.028
Token in
Token ID287
Feature activation+1.18e+00

Pos logit contributions
lf+0.030
 tripod+0.030
nam+0.029
 Marie+0.029
kins+0.028
Neg logit contributions
eter-0.018
ch-0.018
aa-0.017
ead-0.016
 unnecessary-0.016
Token her
Token ID607
Feature activation+8.52e-01

Pos logit contributions
kins+0.028
nam+0.028
ophone+0.027
lf+0.026
igator+0.026
Neg logit contributions
 trouble-0.030
eter-0.029
 unnecessary-0.028
please-0.028
ch-0.028
Token pocket
Token ID10000
Feature activation-5.19e-01

Pos logit contributions
nam+0.023
lf+0.022
kins+0.022
 Marie+0.022
igator+0.021
Neg logit contributions
 unnecessary-0.018
eter-0.018
 tum-0.018
 fever-0.018
 fle-0.018
Token and
Token ID290
Feature activation+2.91e-01

Pos logit contributions
eter+0.009
ucker+0.009
aa+0.009
 tum+0.009
ead+0.008
Neg logit contributions
 Marie-0.016
nam-0.016
lf-0.015
 tripod-0.015
 globe-0.015
Token went
Token ID1816
Feature activation+4.17e-01

Pos logit contributions
kins+0.008
nam+0.007
lf+0.007
 globe+0.007
igator+0.007
Neg logit contributions
please-0.006
eter-0.006
 unnecessary-0.006
 trouble-0.006
 fever-0.006
Token to
Token ID284
Feature activation+1.10e-01

Pos logit contributions
nam+0.013
kins+0.013
lf+0.013
 Marie+0.013
 globe+0.012
Neg logit contributions
 unnecessary-0.008
please-0.007
eter-0.007
aa-0.007
ucker-0.007
Token play
Token ID711
Feature activation+1.39e-01

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
 globe+0.003
Neg logit contributions
 unnecessary-0.003
please-0.002
eter-0.002
aa-0.002
ch-0.002
Token with
Token ID351
Feature activation+7.09e-01

Pos logit contributions
nam+0.004
kins+0.004
lf+0.003
 globe+0.003
 Marie+0.003
Neg logit contributions
please-0.003
 unnecessary-0.003
ch-0.003
eter-0.003
ead-0.003
Token her
Token ID607
Feature activation-4.61e-01

Pos logit contributions
nam+0.019
kins+0.018
lf+0.018
 Marie+0.017
 globe+0.017
Neg logit contributions
 unnecessary-0.017
please-0.017
eter-0.017
aa-0.016
ucker-0.016
Token't
Token ID470
Feature activation+4.23e-01

Pos logit contributions
nam+0.000
kins+0.000
lf+0.000
 globe+0.000
 Marie+0.000
Neg logit contributions
ch-0.000
 unnecessary-0.000
please-0.000
aa-0.000
eter-0.000
Token wait
Token ID4043
Feature activation-5.02e-01

Pos logit contributions
nam+0.013
kins+0.012
lf+0.012
 globe+0.012
 Marie+0.012
Neg logit contributions
please-0.009
 unnecessary-0.008
ch-0.008
aa-0.008
eter-0.008
Token to
Token ID284
Feature activation-3.46e-01

Pos logit contributions
 unnecessary+0.007
please+0.007
eter+0.007
aa+0.007
ch+0.007
Neg logit contributions
nam-0.018
kins-0.017
lf-0.017
 Marie-0.017
 globe-0.017
Token get
Token ID651
Feature activation-5.98e-01

Pos logit contributions
 unnecessary+0.007
eter+0.007
please+0.007
ucker+0.007
aa+0.007
Neg logit contributions
nam-0.010
kins-0.010
lf-0.010
 Marie-0.010
 globe-0.009
Token there
Token ID612
Feature activation-9.06e-01

Pos logit contributions
eter+0.016
ucker+0.016
 unnecessary+0.016
please+0.016
aa+0.016
Neg logit contributions
nam-0.014
kins-0.013
 Marie-0.013
lf-0.013
book-0.012
Token.
Token ID13
Feature activation+5.01e-01

Pos logit contributions
 unnecessary+0.021
eter+0.021
please+0.021
aa+0.020
ch+0.020
Neg logit contributions
nam-0.024
kins-0.023
lf-0.022
 Marie-0.022
 tripod-0.022
Token When
Token ID1649
Feature activation-2.78e-01

Pos logit contributions
kins+0.016
nam+0.016
lf+0.015
 globe+0.015
igator+0.015
Neg logit contributions
eter-0.010
ead-0.009
ch-0.009
please-0.009
 unnecessary-0.009
Token she
Token ID673
Feature activation+5.34e-01

Pos logit contributions
 unnecessary+0.008
eter+0.007
please+0.007
ch+0.007
 trouble+0.007
Neg logit contributions
nam-0.006
kins-0.006
 tripod-0.006
 globe-0.006
igator-0.005
Token arrived
Token ID5284
Feature activation+7.62e-01

Pos logit contributions
nam+0.015
kins+0.015
lf+0.014
 Marie+0.014
 globe+0.014
Neg logit contributions
please-0.012
 unnecessary-0.012
eter-0.011
aa-0.011
ch-0.011
Token,
Token ID11
Feature activation+5.67e-01

Pos logit contributions
nam+0.022
kins+0.021
lf+0.021
 tripod+0.021
 globe+0.021
Neg logit contributions
 unnecessary-0.016
eter-0.015
aa-0.015
please-0.015
ucker-0.015
Token the
Token ID262
Feature activation+4.29e-03

Pos logit contributions
 tripod+0.015
pe+0.015
 globe+0.014
kins+0.014
igator+0.014
Neg logit contributions
 trouble-0.013
 unnecessary-0.012
eter-0.012
ch-0.011
please-0.011
Token find
Token ID1064
Feature activation+1.51e-01

Pos logit contributions
please+0.004
aa+0.004
eter+0.004
ucker+0.004
ch+0.004
Neg logit contributions
nam-0.005
kins-0.005
lf-0.004
 Marie-0.004
atively-0.004
Token treasure
Token ID14068
Feature activation+5.20e-01

Pos logit contributions
nam+0.004
kins+0.004
lf+0.004
 Marie+0.004
 globe+0.004
Neg logit contributions
 unnecessary-0.004
please-0.003
eter-0.003
ch-0.003
aa-0.003
Token or
Token ID393
Feature activation-1.36e-01

Pos logit contributions
nam+0.015
kins+0.014
lf+0.014
 Marie+0.014
 tripod+0.013
Neg logit contributions
 unnecessary-0.011
please-0.011
eter-0.011
aa-0.011
ch-0.011
Token animals
Token ID4695
Feature activation+6.39e-01

Pos logit contributions
 unnecessary+0.003
please+0.003
eter+0.003
aa+0.003
ucker+0.003
Neg logit contributions
nam-0.004
kins-0.003
lf-0.003
 Marie-0.003
 globe-0.003
Token."
Token ID526
Feature activation-2.26e-01

Pos logit contributions
nam+0.016
kins+0.015
lf+0.015
 Marie+0.015
 globe+0.014
Neg logit contributions
 unnecessary-0.016
please-0.016
eter-0.016
aa-0.015
ucker-0.015
Token
Token ID198
Feature activation-5.61e-02

Pos logit contributions
please+0.004
aa+0.004
eter+0.004
ucker+0.004
ch+0.004
Neg logit contributions
nam-0.007
kins-0.007
lf-0.006
 globe-0.006
atively-0.006
Token
Token ID198
Feature activation+3.28e-01

Pos logit contributions
please+0.001
aa+0.001
ch+0.001
 unnecessary+0.001
ucker+0.001
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
 globe-0.002
atively-0.002
TokenMax
Token ID11518
Feature activation+1.53e-01

Pos logit contributions
nam+0.010
kins+0.010
 globe+0.009
atively+0.009
 telescope+0.009
Neg logit contributions
please-0.007
ch-0.006
aa-0.006
 unnecessary-0.006
eter-0.006
Token nodded
Token ID14464
Feature activation+7.89e-01

Pos logit contributions
nam+0.004
kins+0.004
lf+0.004
 globe+0.004
 tripod+0.004
Neg logit contributions
eter-0.003
please-0.003
aa-0.003
ucker-0.003
ch-0.003
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
nam+0.024
 tripod+0.023
lf+0.022
 globe+0.022
kins+0.022
Neg logit contributions
please-0.016
eter-0.015
aa-0.015
 fever-0.014
ch-0.014
Token He
Token ID679
Feature activation+1.11e+00

Pos logit contributions
kins+0.030
nam+0.030
lf+0.029
 globe+0.028
 Marie+0.028
Neg logit contributions
eter-0.022
 unnecessary-0.022
please-0.022
ch-0.022
aa-0.022
Token
Token ID198
Feature activation-3.80e-01

Pos logit contributions
nam+0.006
 Marie+0.006
kins+0.006
 tripod+0.006
 Laura+0.006
Neg logit contributions
 unnecessary-0.009
please-0.009
eter-0.008
aa-0.008
ch-0.008
Token
Token ID198
Feature activation+4.43e-01

Pos logit contributions
 unnecessary+0.010
eter+0.009
please+0.009
aa+0.009
ucker+0.009
Neg logit contributions
nam-0.009
kins-0.009
lf-0.009
 Marie-0.009
 globe-0.009
TokenWhen
Token ID2215
Feature activation-4.85e-01

Pos logit contributions
nam+0.011
kins+0.011
lf+0.011
 globe+0.011
 Marie+0.011
Neg logit contributions
ch-0.011
aa-0.011
please-0.011
 unnecessary-0.010
ucker-0.010
Token they
Token ID484
Feature activation+8.26e-02

Pos logit contributions
 unnecessary+0.018
eter+0.018
please+0.018
ucker+0.017
aa+0.017
Neg logit contributions
nam-0.007
kins-0.007
 Marie-0.006
lf-0.006
 tripod-0.006
Token got
Token ID1392
Feature activation-3.22e-01

Pos logit contributions
nam+0.003
kins+0.002
lf+0.002
 Marie+0.002
 globe+0.002
Neg logit contributions
please-0.002
aa-0.002
ch-0.002
eter-0.001
 unnecessary-0.001
Token to
Token ID284
Feature activation-7.75e-01

Pos logit contributions
 unnecessary+0.007
eter+0.007
please+0.007
ucker+0.007
ch+0.006
Neg logit contributions
nam-0.010
kins-0.009
 Marie-0.009
 globe-0.008
atively-0.008
Token the
Token ID262
Feature activation-4.26e-01

Pos logit contributions
 unnecessary+0.021
please+0.021
ucker+0.020
eter+0.020
aa+0.020
Neg logit contributions
nam-0.018
kins-0.017
lf-0.017
 Marie-0.017
 globe-0.016
Token zoo
Token ID26626
Feature activation-9.04e-02

Pos logit contributions
ucker+0.013
eter+0.012
 unnecessary+0.012
please+0.012
aa+0.012
Neg logit contributions
 globe-0.010
lf-0.010
 tripod-0.009
atively-0.009
nam-0.009
Token,
Token ID11
Feature activation-1.30e-01

Pos logit contributions
 unnecessary+0.002
please+0.002
eter+0.002
aa+0.002
ucker+0.002
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
 Marie-0.002
 globe-0.002
Token they
Token ID484
Feature activation-1.93e-01

Pos logit contributions
 unnecessary+0.004
please+0.004
eter+0.004
aa+0.004
ucker+0.004
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
 Marie-0.002
 globe-0.002
Token saw
Token ID2497
Feature activation-3.28e-01

Pos logit contributions
please+0.003
aa+0.003
 unnecessary+0.003
eter+0.003
ch+0.003
Neg logit contributions
nam-0.007
kins-0.007
lf-0.007
 Marie-0.006
 globe-0.006
Token and
Token ID290
Feature activation+8.48e-01

Pos logit contributions
eter+0.006
 unnecessary+0.006
ch+0.005
aa+0.005
please+0.005
Neg logit contributions
nam-0.007
lf-0.007
 tripod-0.007
kins-0.007
 globe-0.006
Token stretch
Token ID7539
Feature activation-4.78e-01

Pos logit contributions
nam+0.022
kins+0.022
lf+0.021
 Marie+0.021
 globe+0.020
Neg logit contributions
 unnecessary-0.020
please-0.020
eter-0.019
aa-0.019
ucker-0.019
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
eter+0.011
 unnecessary+0.011
ch+0.011
aa+0.011
please+0.011
Neg logit contributions
nam-0.013
lf-0.013
kins-0.012
 tripod-0.012
 globe-0.012
Token stretch
Token ID7539
Feature activation-4.88e-01

Pos logit contributions
nam+0.027
kins+0.027
lf+0.026
 Marie+0.026
 globe+0.025
Neg logit contributions
 unnecessary-0.024
please-0.024
eter-0.024
aa-0.023
ch-0.023
Token.
Token ID13
Feature activation+1.94e-01

Pos logit contributions
eter+0.011
 unnecessary+0.011
ch+0.011
aa+0.011
please+0.011
Neg logit contributions
nam-0.013
lf-0.013
kins-0.012
 tripod-0.012
 globe-0.012
Token The
Token ID383
Feature activation-1.70e-01

Pos logit contributions
nam+0.005
kins+0.005
lf+0.005
 Marie+0.005
 globe+0.005
Neg logit contributions
 unnecessary-0.005
please-0.005
eter-0.004
aa-0.004
ucker-0.004
Token little
Token ID1310
Feature activation+1.09e+00

Pos logit contributions
please+0.004
ucker+0.004
 unnecessary+0.004
eter+0.004
aa+0.004
Neg logit contributions
nam-0.004
kins-0.004
lf-0.004
 Marie-0.004
 globe-0.004
Token girl
Token ID2576
Feature activation+4.07e-01

Pos logit contributions
nam+0.028
kins+0.027
 Marie+0.026
lf+0.026
 globe+0.025
Neg logit contributions
please-0.027
 unnecessary-0.027
ucker-0.026
aa-0.026
eter-0.026
Token couldn
Token ID3521
Feature activation-1.97e-01

Pos logit contributions
nam+0.010
kins+0.009
lf+0.009
 Marie+0.009
 globe+0.009
Neg logit contributions
 unnecessary-0.011
please-0.010
eter-0.010
aa-0.010
ucker-0.010
Token't
Token ID470
Feature activation+1.61e-01

Pos logit contributions
 unnecessary+0.004
ch+0.004
please+0.004
aa+0.004
eter+0.004
Neg logit contributions
nam-0.006
kins-0.006
lf-0.005
 Marie-0.005
 globe-0.005
Token believe
Token ID1975
Feature activation+1.60e+00

Pos logit contributions
nam+0.005
kins+0.004
lf+0.004
 Marie+0.004
 globe+0.004
Neg logit contributions
please-0.003
 unnecessary-0.003
eter-0.003
aa-0.003
ch-0.003
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
nam+0.015
lf+0.015
 tripod+0.015
igator+0.014
 globe+0.014
Neg logit contributions
aa-0.009
ead-0.008
eter-0.008
ucker-0.008
ch-0.008
Token
Token ID198
Feature activation+1.15e-01

Pos logit contributions
nam+0.045
kins+0.043
lf+0.042
 Marie+0.041
 globe+0.040
Neg logit contributions
please-0.041
aa-0.041
ch-0.040
ucker-0.040
eter-0.040
Token
Token ID198
Feature activation+4.07e-01

Pos logit contributions
nam+0.003
kins+0.003
lf+0.003
 Marie+0.003
atively+0.003
Neg logit contributions
please-0.002
aa-0.002
 unnecessary-0.002
ch-0.002
ucker-0.002
Token"
Token ID1
Feature activation+1.69e-02

Pos logit contributions
nam+0.012
 Marie+0.011
kins+0.011
ophone+0.011
atively+0.011
Neg logit contributions
ch-0.009
please-0.008
aa-0.008
eter-0.008
ead-0.008
TokenGood
Token ID10248
Feature activation+4.40e-01

Pos logit contributions
nam+0.001
kins+0.001
lf+0.001
 Marie+0.001
 tripod+0.001
Neg logit contributions
please-0.000
ch-0.000
eter-0.000
aa-0.000
 unnecessary-0.000
Token night
Token ID1755
Feature activation-7.20e-01

Pos logit contributions
kins+0.011
 globe+0.010
 Marie+0.010
lf+0.010
atively+0.010
Neg logit contributions
 unnecessary-0.011
ucker-0.011
please-0.011
ounced-0.010
eter-0.010
Token,
Token ID11
Feature activation+2.32e+00

Pos logit contributions
 unnecessary+0.023
please+0.022
eter+0.022
ucker+0.021
ch+0.021
Neg logit contributions
kins-0.014
lf-0.013
 Marie-0.013
 Laura-0.012
 globe-0.012
Token Sam
Token ID3409
Feature activation+1.93e-02

Pos logit contributions
nam+0.071
kins+0.069
lf+0.067
 Marie+0.066
atively+0.066
Neg logit contributions
eter-0.044
 unnecessary-0.044
ucker-0.043
ch-0.043
aa-0.042
Token,"
Token ID553
Feature activation-7.09e-01

Pos logit contributions
kins+0.000
lf+0.000
nam+0.000
 Marie+0.000
 globe+0.000
Neg logit contributions
 unnecessary-0.001
eter-0.001
please-0.001
ucker-0.000
aa-0.000
Token Tom
Token ID4186
Feature activation-5.75e-02

Pos logit contributions
 unnecessary+0.016
please+0.016
eter+0.015
aa+0.015
ucker+0.015
Neg logit contributions
nam-0.020
kins-0.019
lf-0.019
 Marie-0.018
 globe-0.018
Token says
Token ID1139
Feature activation+4.62e-01

Pos logit contributions
ch+0.001
 trouble+0.001
aa+0.001
ucker+0.001
 fle+0.001
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
atively-0.002
 globe-0.002
Token loved
Token ID6151
Feature activation-1.94e-01

Pos logit contributions
nam+0.034
kins+0.031
atively+0.031
 tripod+0.031
lf+0.031
Neg logit contributions
please-0.026
aa-0.025
ucker-0.024
ch-0.024
eter-0.024
Token the
Token ID262
Feature activation-8.17e-01

Pos logit contributions
 unnecessary+0.005
eter+0.005
aa+0.005
ucker+0.005
please+0.005
Neg logit contributions
nam-0.004
 Marie-0.004
kins-0.004
lf-0.004
 Laura-0.004
Token soup
Token ID17141
Feature activation-5.84e-01

Pos logit contributions
 unnecessary+0.021
eter+0.020
aa+0.020
please+0.020
ucker+0.019
Neg logit contributions
kins-0.021
 Marie-0.020
lf-0.020
nam-0.020
 globe-0.019
Token and
Token ID290
Feature activation-1.39e+00

Pos logit contributions
eter+0.014
please+0.013
ucker+0.013
 unnecessary+0.013
aa+0.013
Neg logit contributions
nam-0.016
lf-0.015
kins-0.014
 tripod-0.014
 Marie-0.014
Token said
Token ID531
Feature activation-7.75e-01

Pos logit contributions
 unnecessary+0.041
aa+0.039
eter+0.039
please+0.039
ucker+0.039
Neg logit contributions
nam-0.030
kins-0.029
 Marie-0.029
lf-0.028
 globe-0.026
Token,
Token ID11
Feature activation-2.71e-01

Pos logit contributions
 unnecessary+0.018
please+0.018
eter+0.018
aa+0.017
ucker+0.017
Neg logit contributions
nam-0.020
kins-0.020
lf-0.020
 Marie-0.019
 globe-0.019
Token "
Token ID366
Feature activation-1.35e-01

Pos logit contributions
please+0.007
ch+0.006
aa+0.006
ead+0.006
ucker+0.006
Neg logit contributions
nam-0.007
kins-0.007
lf-0.007
igator-0.006
 Marie-0.006
TokenThank
Token ID10449
Feature activation+5.24e-01

Pos logit contributions
 unnecessary+0.003
ucker+0.003
aa+0.003
eter+0.003
 fever+0.003
Neg logit contributions
nam-0.003
kins-0.003
 Marie-0.003
lf-0.003
 globe-0.003
Token you
Token ID345
Feature activation-7.36e-02

Pos logit contributions
nam+0.016
lf+0.015
kins+0.015
atively+0.015
 tripod+0.015
Neg logit contributions
 unnecessary-0.010
ch-0.010
please-0.010
aa-0.010
eter-0.010
Token,
Token ID11
Feature activation+1.16e+00

Pos logit contributions
 unnecessary+0.001
eter+0.001
please+0.001
aa+0.001
ch+0.001
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
 Marie-0.002
 globe-0.002
Token Mia
Token ID32189
Feature activation-1.15e+00

Pos logit contributions
nam+0.026
kins+0.025
lf+0.025
 Marie+0.024
 globe+0.024
Neg logit contributions
 unnecessary-0.031
eter-0.031
please-0.031
aa-0.030
ucker-0.030
Token good
Token ID922
Feature activation-8.84e-01

Pos logit contributions
nam+0.008
kins+0.008
lf+0.008
 Marie+0.007
 globe+0.007
Neg logit contributions
 unnecessary-0.005
please-0.004
aa-0.004
eter-0.004
ucker-0.004
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
eter+0.022
 unnecessary+0.022
please+0.022
ucker+0.022
ch+0.022
Neg logit contributions
nam-0.021
kins-0.021
lf-0.020
 Marie-0.020
book-0.019
Token She
Token ID1375
Feature activation-1.75e-02

Pos logit contributions
kins+0.005
nam+0.004
lf+0.004
 globe+0.004
 Marie+0.004
Neg logit contributions
eter-0.004
please-0.004
 unnecessary-0.004
aa-0.004
ead-0.004
Token had
Token ID550
Feature activation+2.14e-01

Pos logit contributions
 unnecessary+0.000
eter+0.000
ucker+0.000
please+0.000
ch+0.000
Neg logit contributions
nam-0.000
lf-0.000
kins-0.000
 Marie-0.000
 tripod-0.000
Token a
Token ID257
Feature activation-2.59e-01

Pos logit contributions
nam+0.005
 Marie+0.004
lf+0.004
kins+0.004
 tripod+0.004
Neg logit contributions
ucker-0.006
 unnecessary-0.006
eter-0.006
aa-0.006
please-0.006
Token sore
Token ID19597
Feature activation-2.50e+00

Pos logit contributions
eter+0.006
 unnecessary+0.006
please+0.006
ucker+0.006
ead+0.006
Neg logit contributions
nam-0.007
 telescope-0.006
 Marie-0.006
lf-0.006
 globe-0.006
Token throat
Token ID13589
Feature activation-2.53e+00

Pos logit contributions
 unnecessary+0.070
eter+0.068
please+0.068
ucker+0.065
ead+0.065
Neg logit contributions
kins-0.055
kin-0.054
lf-0.054
nam-0.053
 globe-0.051
Token and
Token ID290
Feature activation+2.04e-02

Pos logit contributions
 unnecessary+0.081
eter+0.077
please+0.075
ucker+0.074
ead+0.072
Neg logit contributions
lf-0.045
kins-0.045
book-0.044
nam-0.042
atively-0.041
Token a
Token ID257
Feature activation-3.88e-01

Pos logit contributions
nam+0.000
kins+0.000
lf+0.000
 Marie+0.000
atively+0.000
Neg logit contributions
 unnecessary-0.001
eter-0.001
please-0.001
ucker-0.001
aa-0.001
Token fever
Token ID17372
Feature activation-2.10e+00

Pos logit contributions
eter+0.010
 unnecessary+0.010
ead+0.009
please+0.009
ucker+0.009
Neg logit contributions
nam-0.010
kins-0.009
lf-0.009
 telescope-0.009
 Marie-0.009
Token.
Token ID13
Feature activation+3.99e-01

Pos logit contributions
 unnecessary+0.064
eter+0.061
please+0.060
ucker+0.060
ead+0.058
Neg logit contributions
lf-0.042
kins-0.042
nam-0.040
book-0.040
atively-0.039
Token,
Token ID11
Feature activation+8.81e-01

Pos logit contributions
nam+0.045
 tripod+0.043
 keyboard+0.040
kins+0.040
atively+0.040
Neg logit contributions
ch-0.015
aa-0.015
 trouble-0.014
ucker-0.014
 tum-0.013
Token there
Token ID612
Feature activation-9.78e-01

Pos logit contributions
nam+0.023
lf+0.022
kins+0.022
 tripod+0.022
 Marie+0.021
Neg logit contributions
ch-0.021
 unnecessary-0.020
ucker-0.020
aa-0.020
please-0.019
Token was
Token ID373
Feature activation+4.78e-01

Pos logit contributions
 unnecessary+0.028
please+0.027
eter+0.027
aa+0.026
ucker+0.026
Neg logit contributions
kins-0.021
lf-0.021
nam-0.021
 globe-0.020
 Marie-0.020
Token a
Token ID257
Feature activation-5.15e-01

Pos logit contributions
kins+0.012
nam+0.012
lf+0.012
atively+0.012
 globe+0.011
Neg logit contributions
please-0.011
 unnecessary-0.011
ch-0.011
 trouble-0.011
eter-0.011
Token big
Token ID1263
Feature activation-1.54e-01

Pos logit contributions
 unnecessary+0.013
eter+0.013
ucker+0.012
please+0.012
ead+0.012
Neg logit contributions
nam-0.013
lf-0.013
kins-0.013
 globe-0.013
 Marie-0.012
Token pig
Token ID12967
Feature activation-2.12e+00

Pos logit contributions
 unnecessary+0.004
eter+0.004
please+0.004
ucker+0.004
aa+0.003
Neg logit contributions
lf-0.004
nam-0.004
 globe-0.004
kins-0.004
 Marie-0.004
Token.
Token ID13
Feature activation-9.95e-02

Pos logit contributions
please+0.048
aa+0.047
eter+0.046
ucker+0.046
ch+0.045
Neg logit contributions
nam-0.057
 Marie-0.055
kins-0.055
atively-0.053
lf-0.053
Token The
Token ID383
Feature activation-6.23e-01

Pos logit contributions
 unnecessary+0.002
please+0.002
eter+0.002
aa+0.002
ucker+0.002
Neg logit contributions
nam-0.003
kins-0.003
lf-0.003
 Marie-0.003
 globe-0.003
Token big
Token ID1263
Feature activation+8.03e-01

Pos logit contributions
eter+0.017
 unnecessary+0.015
ucker+0.015
ead+0.015
ounce+0.014
Neg logit contributions
kins-0.017
 globe-0.016
pe-0.016
nam-0.015
lf-0.015
Token pig
Token ID12967
Feature activation-1.42e+00

Pos logit contributions
kins+0.023
 globe+0.022
nam+0.022
pe+0.022
lf+0.022
Neg logit contributions
eter-0.020
 unnecessary-0.019
ucker-0.018
ead-0.018
please-0.017
Token had
Token ID550
Feature activation-3.78e-01

Pos logit contributions
please+0.033
aa+0.033
ucker+0.032
eter+0.031
ch+0.031
Neg logit contributions
nam-0.039
 Marie-0.036
atively-0.035
kins-0.034
lf-0.034
Token it
Token ID340
Feature activation-2.66e-01

Pos logit contributions
kins+0.024
 Marie+0.024
lf+0.024
atively+0.023
nam+0.023
Neg logit contributions
 unnecessary-0.015
ch-0.015
 trouble-0.014
please-0.014
 fle-0.014
Token?
Token ID30
Feature activation+2.07e-01

Pos logit contributions
 unnecessary+0.006
ch+0.006
 trouble+0.006
please+0.006
ucker+0.006
Neg logit contributions
nam-0.008
lf-0.007
 tripod-0.007
kins-0.007
atively-0.007
Token It
Token ID632
Feature activation+4.14e-01

Pos logit contributions
kins+0.007
lf+0.007
nam+0.007
 globe+0.006
atively+0.006
Neg logit contributions
ch-0.003
eter-0.003
ead-0.003
 unnecessary-0.003
aa-0.003
Token was
Token ID373
Feature activation-1.30e-01

Pos logit contributions
nam+0.011
kins+0.010
lf+0.010
 Marie+0.010
 globe+0.010
Neg logit contributions
 unnecessary-0.010
please-0.010
eter-0.010
aa-0.010
ch-0.010
Token an
Token ID281
Feature activation+1.44e-02

Pos logit contributions
please+0.004
aa+0.004
 unnecessary+0.004
ucker+0.004
eter+0.004
Neg logit contributions
nam-0.002
kins-0.002
lf-0.002
 Marie-0.002
 globe-0.002
Token egg
Token ID5935
Feature activation-2.07e+00

Pos logit contributions
nam+0.000
kins+0.000
 Marie+0.000
lf+0.000
 globe+0.000
Neg logit contributions
 unnecessary-0.000
please-0.000
eter-0.000
aa-0.000
ch-0.000
Token!
Token ID0
Feature activation-4.21e-04

Pos logit contributions
 unnecessary+0.050
please+0.048
eter+0.047
aa+0.046
ucker+0.046
Neg logit contributions
kins-0.055
lf-0.055
nam-0.054
 globe-0.052
 Marie-0.051
Token They
Token ID1119
Feature activation+4.13e-01

Pos logit contributions
ch+0.000
eter+0.000
aa+0.000
ead+0.000
ucker+0.000
Neg logit contributions
nam-0.000
kins-0.000
lf-0.000
 globe-0.000
atively-0.000
Token stopped
Token ID5025
Feature activation+5.34e-01

Pos logit contributions
kins+0.011
 Marie+0.011
lf+0.011
book+0.011
 globe+0.011
Neg logit contributions
please-0.008
aa-0.008
ounced-0.008
eter-0.007
ucker-0.007
Token and
Token ID290
Feature activation+2.26e-01

Pos logit contributions
nam+0.012
kins+0.012
 telescope+0.011
 Marie+0.011
 globe+0.011
Neg logit contributions
eter-0.014
 unnecessary-0.014
ucker-0.014
please-0.014
aa-0.013
Token looked
Token ID3114
Feature activation+7.87e-01

Pos logit contributions
kins+0.006
lf+0.006
book+0.006
nam+0.006
 globe+0.006
Neg logit contributions
please-0.005
eter-0.005
aa-0.005
ch-0.005
 fle-0.005
Token Pam
Token ID24497
Feature activation-6.64e-01

Pos logit contributions
kins+0.028
nam+0.027
lf+0.026
atively+0.026
 globe+0.025
Neg logit contributions
ch-0.012
aa-0.012
please-0.012
 unnecessary-0.011
eter-0.011
Token was
Token ID373
Feature activation-1.25e-01

Pos logit contributions
please+0.013
aa+0.012
ch+0.012
 trouble+0.011
 unnecessary+0.011
Neg logit contributions
nam-0.022
atively-0.020
 Marie-0.020
kins-0.019
 tripod-0.019
Token lying
Token ID9105
Feature activation-1.54e+00

Pos logit contributions
 unnecessary+0.003
eter+0.003
please+0.003
aa+0.003
ucker+0.003
Neg logit contributions
nam-0.003
kins-0.003
lf-0.003
 Marie-0.003
 globe-0.003
Token in
Token ID287
Feature activation+2.44e-02

Pos logit contributions
please+0.040
ch+0.039
 unnecessary+0.039
aa+0.039
eter+0.038
Neg logit contributions
nam-0.037
kins-0.036
lf-0.035
 Marie-0.035
 tripod-0.034
Token her
Token ID607
Feature activation+5.08e-01

Pos logit contributions
nam+0.001
kins+0.001
lf+0.001
 Marie+0.001
atively+0.001
Neg logit contributions
 unnecessary-0.001
please-0.001
eter-0.001
aa-0.001
ch-0.001
Token bed
Token ID3996
Feature activation-2.12e+00

Pos logit contributions
nam+0.010
lf+0.010
kins+0.010
 Marie+0.010
 globe+0.010
Neg logit contributions
 unnecessary-0.015
please-0.015
eter-0.014
aa-0.014
ucker-0.014
Token.
Token ID13
Feature activation-8.15e-01

Pos logit contributions
please+0.049
aa+0.049
ch+0.048
eter+0.048
ucker+0.047
Neg logit contributions
nam-0.056
kins-0.053
 Marie-0.052
 tripod-0.051
 globe-0.050
Token It
Token ID632
Feature activation+4.65e-01

Pos logit contributions
 unnecessary+0.022
please+0.021
eter+0.021
ucker+0.021
aa+0.021
Neg logit contributions
nam-0.019
kins-0.018
 Marie-0.018
lf-0.017
atively-0.017
Token was
Token ID373
Feature activation-3.14e-01

Pos logit contributions
nam+0.012
kins+0.012
lf+0.011
 Marie+0.011
 globe+0.011
Neg logit contributions
 unnecessary-0.011
please-0.011
eter-0.011
aa-0.011
ucker-0.011
Token a
Token ID257
Feature activation-2.36e-01

Pos logit contributions
 unnecessary+0.009
eter+0.009
aa+0.009
please+0.009
ucker+0.009
Neg logit contributions
nam-0.007
lf-0.007
kins-0.006
 Marie-0.006
book-0.006
Token normal
Token ID3487
Feature activation+4.16e-01

Pos logit contributions
eter+0.006
 unnecessary+0.006
ucker+0.006
ead+0.006
please+0.006
Neg logit contributions
nam-0.006
lf-0.006
 globe-0.006
 telescope-0.005
 tripod-0.005
Token then
Token ID788
Feature activation+2.17e-01

Pos logit contributions
eter+0.015
please+0.015
 unnecessary+0.014
aa+0.014
 trouble+0.014
Neg logit contributions
kins-0.017
 globe-0.016
lf-0.016
 tripod-0.016
ophone-0.016
Token she
Token ID673
Feature activation-7.04e-01

Pos logit contributions
nam+0.004
kins+0.004
lf+0.004
 Marie+0.004
atively+0.004
Neg logit contributions
 unnecessary-0.007
please-0.007
ucker-0.007
eter-0.007
aa-0.007
Token saw
Token ID2497
Feature activation+3.80e-01

Pos logit contributions
 unnecessary+0.018
eter+0.017
ead+0.017
ucker+0.017
aa+0.016
Neg logit contributions
lf-0.017
kins-0.017
nam-0.017
 Marie-0.017
atively-0.016
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
nam+0.009
kins+0.009
lf+0.009
 Marie+0.009
 globe+0.009
Neg logit contributions
 unnecessary-0.010
please-0.009
eter-0.009
aa-0.009
ucker-0.009
Token spoiled
Token ID34781
Feature activation-6.78e-01

Pos logit contributions
eter+0.027
 unnecessary+0.026
ucker+0.026
ead+0.026
please+0.025
Neg logit contributions
nam-0.023
 globe-0.023
 tripod-0.023
lf-0.023
 telescope-0.022
Token egg
Token ID5935
Feature activation-2.08e+00

Pos logit contributions
 unnecessary+0.016
eter+0.016
ucker+0.016
please+0.016
ounced+0.015
Neg logit contributions
igator-0.017
lf-0.017
kins-0.017
 Marie-0.017
 globe-0.017
Token on
Token ID319
Feature activation-1.38e-01

Pos logit contributions
 unnecessary+0.059
please+0.057
eter+0.057
aa+0.055
ucker+0.055
Neg logit contributions
kins-0.046
nam-0.046
lf-0.045
 globe-0.043
 Marie-0.042
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
 unnecessary+0.003
please+0.003
eter+0.003
ch+0.003
aa+0.003
Neg logit contributions
nam-0.003
kins-0.003
lf-0.003
 Marie-0.003
atively-0.003
Token ground
Token ID2323
Feature activation-9.60e-03

Pos logit contributions
nam+0.033
kins+0.033
lf+0.031
 Marie+0.031
book+0.030
Neg logit contributions
please-0.027
 unnecessary-0.027
ch-0.026
aa-0.026
eter-0.026
Token.
Token ID13
Feature activation-4.68e-03

Pos logit contributions
eter+0.000
please+0.000
 unnecessary+0.000
aa+0.000
ucker+0.000
Neg logit contributions
nam-0.000
kins-0.000
lf-0.000
 Marie-0.000
 tripod-0.000
Token "
Token ID366
Feature activation-1.81e-01

Pos logit contributions
eter+0.000
please+0.000
ead+0.000
ucker+0.000
aa+0.000
Neg logit contributions
kins-0.000
nam-0.000
lf-0.000
 globe-0.000
igator-0.000