TOP ACTIVATIONS
MAX = 0.082

have ever seen ! Your heart must be so calm to
Lily says . " Your heart is working too hard .
necklace is broken ! The heart is gone !" Mia sob
learn something new , your heart can heal and grow stronger
the girl would put her ear against the fence .
is a shape . A heart means love ," Anna says
heart with red . A heart is a shape . A
but when we show our heart we can do amazing things
ucky said , " F ro ggy , you are such
something strange happens . The heart does not stay on the
look , Amy , your heart is made up of lots
forward to it . " Will it be hard , mom
in his eye . " Will my wish come true ?"
children put on their bath ro bes and ran around in
tired and sore . My heart hurts ." L
lovely if you put your heart into it . Not only
you . It has a heart and it says I love
hug and said , " Will you play with me ?".
you . It has a heart and it says I love
not your heart . Your heart is in your chest .

MIN ACTIVATIONS
MIN = -0.098

playing and never touch anything they were not allowed to .
to make them do what we want . <|endoftext|> Once upon
They did not know what they would see . <|endoftext|> Tom
inside was nothing like what they had expected , instead ,
what other y ummy things they could find . But they
we never know what opportunities we might miss out on .
us happy , not what we can sell or trade them
if it is not what we want to do .
They did not know what they were playing with . They
Fire is not a toy " on the bottom . They
tell all their friends what they had seen and done .
learned what " ra re " meant . It meant something
wondered what other brave things they could do . They felt
asked if there was anything they could do to help the
and asked her teacher what they meant . Her teacher said
understand what " for give " meant , but the dolphin
to try and get what we want . We should always
They felt sorry for what they did . They remembered how
Sally didn 't know what they were . So
take it with them wherever they went . And so they

INTERVAL 0.037 - 0.082
CONTAINS 0.677%

away . S ara is very sad and angry
thanked her and put the glass over his head . The
, a little girl named Lucy had a question . She
and ugly . Put the tire back on the fire truck
mum said , � � € � � No , Jill

INTERVAL -0.008 - 0.037
CONTAINS 71.537%

. They make a lot of noise and a lot of
need to clean up first . Otherwise , we won 't
She was scared and didn 't know what to do .
" Maybe we should go out ." But before
At the end of the day , Sam was very tired

INTERVAL -0.053 - -0.008
CONTAINS 27.585%

they came back , they all worked together to make the
. She says she will not do it again . She
fix it together ." They looked for some tools and started
Her mom asked , " What do you mean , Lily
tall metal rail near the park . It was perfect for

INTERVAL -0.098 - -0.053
CONTAINS 0.201%

they had no idea why they were there . They looked
should forgive and apologize when we make mistakes . We should
" No , we can 't do that ,"
around . So they went on a journey together
spider , but no matter how hard she tried , it
Token have
Token ID423
Feature activation-1.62e-02

Pos logit contributions
ait+0.093
ount+0.089
ada+0.084
enced+0.082
 pigs+0.080
Neg logit contributions
 now-0.060
 wishes-0.055
 hurts-0.054
 be-0.054
 myself-0.054
Token ever
Token ID1683
Feature activation+1.94e-02

Pos logit contributions
ait+0.035
ount+0.033
ada+0.032
enced+0.031
 pigs+0.031
Neg logit contributions
 now-0.013
 hurts-0.012
 wishes-0.012
 myself-0.012
 be-0.011
Token seen
Token ID1775
Feature activation+1.51e-03

Pos logit contributions
 hurts+0.031
 now+0.030
ggy+0.030
 myself+0.030
 wishes+0.029
Neg logit contributions
ount-0.023
ait-0.022
ada-0.020
enced-0.020
line-0.020
Token!
Token ID0
Feature activation+1.04e-02

Pos logit contributions
 now+0.001
 myself+0.001
 hurts+0.001
 wishes+0.001
 my+0.001
Neg logit contributions
ait-0.003
ount-0.003
ada-0.003
enced-0.003
 pigs-0.003
Token Your
Token ID3406
Feature activation+3.35e-02

Pos logit contributions
 now+0.010
 hurts+0.010
 wishes+0.009
 myself+0.009
 be+0.009
Neg logit contributions
ait-0.021
ount-0.020
ada-0.019
enced-0.018
 pigs-0.018
Token heart
Token ID2612
Feature activation+8.15e-02

Pos logit contributions
 now+0.034
 hurts+0.032
 myself+0.031
 wishes+0.031
 be+0.030
Neg logit contributions
ait-0.064
ount-0.061
ada-0.060
 pigs-0.057
enced-0.057
Token must
Token ID1276
Feature activation+5.63e-02

Pos logit contributions
 wishes+0.052
 now+0.050
ggy+0.048
 myself+0.047
 eyel+0.046
Neg logit contributions
ait-0.168
ount-0.166
ada-0.157
enced-0.154
arre-0.153
Token be
Token ID307
Feature activation+1.03e-02

Pos logit contributions
 now+0.037
 be+0.032
 myself+0.032
 wishes+0.032
 hurts+0.031
Neg logit contributions
ait-0.133
ount-0.128
ada-0.123
enced-0.121
 pigs-0.120
Token so
Token ID523
Feature activation+2.18e-02

Pos logit contributions
 now+0.010
 myself+0.009
 hurts+0.009
 wishes+0.009
 be+0.009
Neg logit contributions
ait-0.021
ount-0.020
ada-0.019
enced-0.019
 pigs-0.018
Token calm
Token ID9480
Feature activation+1.30e-02

Pos logit contributions
 now+0.021
 hurts+0.019
 wishes+0.019
 myself+0.019
 be+0.018
Neg logit contributions
ait-0.044
ount-0.042
ada-0.040
enced-0.039
 pigs-0.038
Token to
Token ID284
Feature activation+2.41e-02

Pos logit contributions
 now+0.012
 myself+0.010
 be+0.009
 hurts+0.009
 my+0.009
Neg logit contributions
ait-0.030
ount-0.029
enced-0.027
ada-0.027
umbers-0.027
Token Lily
Token ID20037
Feature activation+4.45e-03

Pos logit contributions
ait+0.018
ount+0.017
ada+0.016
 911+0.015
enced+0.015
Neg logit contributions
 myself-0.018
 now-0.018
 hurts-0.018
 wishes-0.017
ggy-0.017
Token says
Token ID1139
Feature activation-2.90e-02

Pos logit contributions
 now+0.005
 myself+0.004
 hurts+0.004
 wishes+0.004
 be+0.004
Neg logit contributions
ait-0.008
ount-0.008
ada-0.008
enced-0.008
 pigs-0.007
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
ait+0.052
ount+0.049
ada+0.047
enced+0.045
 pigs+0.044
Neg logit contributions
 now-0.035
 hurts-0.033
 wishes-0.033
 myself-0.032
 be-0.032
Token "
Token ID366
Feature activation-3.18e-03

Pos logit contributions
ait+0.040
ount+0.038
ada+0.036
enced+0.035
 pigs+0.034
Neg logit contributions
 now-0.027
 hurts-0.025
 be-0.025
 myself-0.024
 wishes-0.024
TokenYour
Token ID7120
Feature activation+3.01e-02

Pos logit contributions
ait+0.005
ount+0.005
ada+0.005
enced+0.005
 pigs+0.004
Neg logit contributions
 now-0.004
 hurts-0.004
 wishes-0.004
 myself-0.004
 be-0.004
Token heart
Token ID2612
Feature activation+7.51e-02

Pos logit contributions
 now+0.031
 myself+0.029
 hurts+0.027
 wishes+0.026
ggy+0.026
Neg logit contributions
ait-0.057
ount-0.053
ada-0.052
 pigs-0.051
 911-0.050
Token is
Token ID318
Feature activation+3.04e-02

Pos logit contributions
ggy+0.066
 eyel+0.064
aura+0.061
 myself+0.060
Just+0.060
Neg logit contributions
ait-0.126
ount-0.125
arre-0.120
ains-0.117
line-0.116
Token working
Token ID1762
Feature activation+7.00e-03

Pos logit contributions
 now+0.026
 hurts+0.025
 wishes+0.025
 myself+0.025
 be+0.023
Neg logit contributions
ait-0.062
ount-0.060
ada-0.057
 pigs-0.055
enced-0.055
Token too
Token ID1165
Feature activation+3.63e-03

Pos logit contributions
 now+0.007
 wishes+0.007
 hurts+0.007
 myself+0.007
 be+0.007
Neg logit contributions
ait-0.013
ount-0.013
ada-0.012
enced-0.012
 pigs-0.012
Token hard
Token ID1327
Feature activation+2.96e-03

Pos logit contributions
 now+0.003
 hurts+0.002
 wishes+0.002
 myself+0.002
 be+0.002
Neg logit contributions
ait-0.008
ount-0.008
ada-0.008
enced-0.007
 pigs-0.007
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
 now+0.003
 hurts+0.003
 wishes+0.003
 myself+0.003
 be+0.003
Neg logit contributions
ait-0.006
ount-0.006
ada-0.005
enced-0.005
 pigs-0.005
Token necklace
Token ID38987
Feature activation+4.61e-02

Pos logit contributions
 myself+0.036
Just+0.035
 now+0.035
p+0.033
aura+0.033
Neg logit contributions
ait-0.040
ada-0.039
 stret-0.038
 pigs-0.036
line-0.035
Token is
Token ID318
Feature activation+1.90e-02

Pos logit contributions
aura+0.034
 Listen+0.034
 Raven+0.033
 wishes+0.033
Just+0.033
Neg logit contributions
ait-0.081
ount-0.077
ada-0.077
auc-0.074
pan-0.074
Token broken
Token ID5445
Feature activation+1.08e-02

Pos logit contributions
 wishes+0.023
bye+0.022
ggy+0.021
Just+0.021
 wish+0.020
Neg logit contributions
ait-0.027
ount-0.026
 pigs-0.025
 911-0.025
 onions-0.025
Token!
Token ID0
Feature activation-1.10e-02

Pos logit contributions
 wishes+0.011
 now+0.011
 hurts+0.011
 myself+0.010
ggy+0.010
Neg logit contributions
ait-0.020
ount-0.019
ada-0.018
enced-0.017
 pigs-0.017
Token The
Token ID383
Feature activation+1.04e-02

Pos logit contributions
ait+0.017
ada+0.016
ount+0.016
hot+0.015
 911+0.014
Neg logit contributions
 now-0.015
 wishes-0.014
 lands-0.014
 spell-0.014
 be-0.014
Token heart
Token ID2612
Feature activation+7.50e-02

Pos logit contributions
 now+0.010
 myself+0.010
 hurts+0.009
ggy+0.009
 wishes+0.009
Neg logit contributions
ait-0.019
ount-0.018
ada-0.018
 pigs-0.018
line-0.017
Token is
Token ID318
Feature activation+2.70e-02

Pos logit contributions
Just+0.069
edly+0.067
aura+0.066
 Listen+0.065
 myself+0.064
Neg logit contributions
ait-0.119
ount-0.114
auc-0.111
ada-0.108
pan-0.107
Token gone
Token ID3750
Feature activation+2.04e-02

Pos logit contributions
 wishes+0.030
ggy+0.029
bye+0.029
 myself+0.029
 Raven+0.028
Neg logit contributions
ait-0.041
ount-0.039
 911-0.038
 pigs-0.038
 onions-0.037
Token!"
Token ID2474
Feature activation-2.44e-02

Pos logit contributions
 now+0.018
 wishes+0.018
 hurts+0.017
 myself+0.017
 be+0.017
Neg logit contributions
ait-0.041
ount-0.039
ada-0.038
enced-0.036
 pigs-0.036
Token Mia
Token ID32189
Feature activation-3.53e-03

Pos logit contributions
ait+0.035
ount+0.034
ada+0.032
enced+0.031
 pigs+0.030
Neg logit contributions
 now-0.035
 myself-0.034
 hurts-0.032
 wishes-0.032
 be-0.031
Token sob
Token ID27355
Feature activation-3.84e-03

Pos logit contributions
ait+0.006
ount+0.006
ada+0.006
enced+0.005
 pigs+0.005
Neg logit contributions
 now-0.004
 myself-0.004
 wishes-0.004
 hurts-0.004
 be-0.004
Token learn
Token ID2193
Feature activation+2.64e-02

Pos logit contributions
ait+0.063
ount+0.060
ada+0.057
enced+0.056
 pigs+0.055
Neg logit contributions
 now-0.038
 hurts-0.035
 wishes-0.035
 myself-0.035
 be-0.035
Token something
Token ID1223
Feature activation-1.44e-03

Pos logit contributions
 now+0.026
 hurts+0.024
 wishes+0.024
 myself+0.023
 be+0.023
Neg logit contributions
ait-0.051
ount-0.049
ada-0.047
enced-0.046
 pigs-0.045
Token new
Token ID649
Feature activation+1.28e-02

Pos logit contributions
ait+0.003
ount+0.003
ada+0.003
enced+0.003
 pigs+0.003
Neg logit contributions
 now-0.001
 myself-0.001
 wishes-0.001
 hurts-0.001
 be-0.001
Token,
Token ID11
Feature activation+2.59e-03

Pos logit contributions
 now+0.011
 hurts+0.011
 myself+0.011
 wishes+0.010
ggy+0.010
Neg logit contributions
ait-0.024
ount-0.024
ada-0.022
 pigs-0.022
iers-0.022
Token your
Token ID534
Feature activation+2.32e-02

Pos logit contributions
 now+0.002
 be+0.002
 myself+0.002
 wishes+0.002
 hurts+0.002
Neg logit contributions
ait-0.006
ount-0.006
ada-0.006
enced-0.006
 pigs-0.005
Token heart
Token ID2612
Feature activation+7.49e-02

Pos logit contributions
 now+0.021
 myself+0.020
 hurts+0.020
 be+0.018
 wishes+0.018
Neg logit contributions
ait-0.046
ount-0.043
ada-0.042
 pigs-0.040
enced-0.040
Token can
Token ID460
Feature activation+2.14e-02

Pos logit contributions
ggy+0.063
 myself+0.063
 now+0.062
 eyel+0.058
 wishes+0.058
Neg logit contributions
ait-0.137
ount-0.135
arre-0.126
ada-0.125
 pigs-0.124
Token heal
Token ID12035
Feature activation+1.48e-02

Pos logit contributions
 now+0.018
 wishes+0.017
 hurts+0.017
 myself+0.017
ggy+0.016
Neg logit contributions
ait-0.043
ount-0.041
ada-0.040
enced-0.039
 pigs-0.038
Token and
Token ID290
Feature activation+3.30e-02

Pos logit contributions
 now+0.014
 hurts+0.014
 wishes+0.013
 myself+0.013
 be+0.013
Neg logit contributions
ait-0.029
ount-0.028
ada-0.027
 pigs-0.026
enced-0.026
Token grow
Token ID1663
Feature activation+1.67e-02

Pos logit contributions
ggy+0.032
 now+0.031
 myself+0.030
 hurts+0.030
 wishes+0.029
Neg logit contributions
ait-0.056
ount-0.053
 pigs-0.052
 onions-0.051
line-0.051
Token stronger
Token ID7387
Feature activation+9.28e-03

Pos logit contributions
 now+0.016
 hurts+0.014
 wishes+0.014
 myself+0.014
 be+0.014
Neg logit contributions
ait-0.034
ount-0.033
ada-0.031
enced-0.031
 pigs-0.030
Token the
Token ID262
Feature activation+1.94e-02

Pos logit contributions
 myself+0.031
 now+0.031
 wishes+0.030
 hurts+0.030
ggy+0.027
Neg logit contributions
ait-0.071
ount-0.067
ada-0.066
 pigs-0.063
enced-0.062
Token girl
Token ID2576
Feature activation+2.86e-02

Pos logit contributions
 now+0.017
 hurts+0.015
 wishes+0.015
 be+0.015
 myself+0.014
Neg logit contributions
ait-0.042
ount-0.040
ada-0.038
enced-0.038
iers-0.037
Token would
Token ID561
Feature activation+1.40e-02

Pos logit contributions
 now+0.033
 be+0.030
 wishes+0.029
 hurts+0.029
 dream+0.027
Neg logit contributions
ait-0.054
ount-0.053
ada-0.051
enced-0.048
iers-0.047
Token put
Token ID1234
Feature activation-3.69e-03

Pos logit contributions
 be+0.013
 now+0.012
 dream+0.010
 wish+0.010
 spell+0.009
Neg logit contributions
ount-0.032
ait-0.032
ada-0.031
iers-0.031
umbers-0.031
Token her
Token ID607
Feature activation+3.91e-02

Pos logit contributions
ait+0.008
ount+0.008
ada+0.007
enced+0.007
ules+0.007
Neg logit contributions
 now-0.004
 myself-0.003
 be-0.003
 wishes-0.003
 my-0.003
Token ear
Token ID1027
Feature activation+7.43e-02

Pos logit contributions
 now+0.041
 hurts+0.040
 wishes+0.039
 be+0.039
 my+0.036
Neg logit contributions
ait-0.078
ount-0.077
enced-0.071
ada-0.071
iers-0.069
Token against
Token ID1028
Feature activation+5.15e-03

Pos logit contributions
 wishes+0.123
ggy+0.119
mm+0.118
ww+0.116
aura+0.113
Neg logit contributions
ait-0.071
line-0.069
 911-0.067
pins-0.066
hot-0.063
Token the
Token ID262
Feature activation+2.56e-03

Pos logit contributions
 now+0.006
 hurts+0.006
 wishes+0.006
 myself+0.006
 be+0.006
Neg logit contributions
ait-0.009
ount-0.008
ada-0.008
enced-0.008
 pigs-0.008
Token fence
Token ID13990
Feature activation-8.08e-03

Pos logit contributions
 now+0.003
 be+0.003
 hurts+0.003
 wishes+0.003
 myself+0.003
Neg logit contributions
ait-0.005
ount-0.005
iers-0.004
ada-0.004
umbers-0.004
Token.
Token ID13
Feature activation+8.86e-03

Pos logit contributions
ait+0.020
ount+0.019
ada+0.019
enced+0.018
umbers+0.017
Neg logit contributions
 now-0.006
 be-0.005
 myself-0.004
 hurts-0.004
 wishes-0.004
Token
Token ID198
Feature activation+3.51e-03

Pos logit contributions
 now+0.009
 be+0.009
 wishes+0.009
 hurts+0.009
 myself+0.009
Neg logit contributions
ait-0.017
ount-0.016
ada-0.016
enced-0.015
iers-0.015
Token is
Token ID318
Feature activation+2.54e-02

Pos logit contributions
 myself+0.057
 wishes+0.057
ggy+0.056
 eyel+0.056
Just+0.054
Neg logit contributions
ount-0.138
ait-0.136
ada-0.127
line-0.126
enced-0.123
Token a
Token ID257
Feature activation+1.30e-03

Pos logit contributions
 hurts+0.023
 now+0.023
 wishes+0.023
 myself+0.022
ggy+0.021
Neg logit contributions
ait-0.048
ount-0.047
ada-0.044
 pigs-0.044
enced-0.042
Token shape
Token ID5485
Feature activation+1.93e-02

Pos logit contributions
 myself+0.001
 now+0.001
 hurts+0.001
 wishes+0.001
ggy+0.001
Neg logit contributions
ait-0.002
 pigs-0.002
ada-0.002
ount-0.002
 onions-0.002
Token.
Token ID13
Feature activation+2.01e-02

Pos logit contributions
 now+0.022
 hurts+0.021
 wishes+0.021
 myself+0.021
 be+0.020
Neg logit contributions
ait-0.033
ount-0.032
ada-0.031
enced-0.030
 pigs-0.030
Token A
Token ID317
Feature activation+1.52e-02

Pos logit contributions
 now+0.027
 hurts+0.026
 wishes+0.025
ggy+0.025
 lands+0.024
Neg logit contributions
ount-0.032
ait-0.031
ada-0.029
hot-0.028
iers-0.028
Token heart
Token ID2612
Feature activation+7.31e-02

Pos logit contributions
 now+0.015
 myself+0.014
 wishes+0.013
 hurts+0.013
ggy+0.013
Neg logit contributions
ait-0.028
ount-0.026
ada-0.026
 pigs-0.026
line-0.025
Token means
Token ID1724
Feature activation+1.13e-02

Pos logit contributions
 myself+0.058
ggy+0.056
 eyel+0.056
 wishes+0.055
Just+0.053
Neg logit contributions
ount-0.139
ait-0.137
line-0.127
ada-0.127
enced-0.124
Token love
Token ID1842
Feature activation+3.37e-02

Pos logit contributions
 now+0.010
 hurts+0.010
 wishes+0.010
 myself+0.010
ggy+0.009
Neg logit contributions
ait-0.021
ount-0.020
ada-0.019
 pigs-0.019
iers-0.018
Token,"
Token ID553
Feature activation-1.40e-02

Pos logit contributions
 now+0.037
 hurts+0.037
 wishes+0.037
 myself+0.035
ggy+0.035
Neg logit contributions
ait-0.059
ount-0.056
ada-0.054
 pigs-0.052
enced-0.051
Token Anna
Token ID11735
Feature activation+1.63e-02

Pos logit contributions
ait+0.024
ount+0.023
enced+0.021
ada+0.021
umbers+0.020
Neg logit contributions
 now-0.020
 be-0.019
 wishes-0.019
 myself-0.018
 hurts-0.018
Token says
Token ID1139
Feature activation-1.67e-02

Pos logit contributions
 now+0.017
 hurts+0.015
 wishes+0.015
 myself+0.015
 be+0.015
Neg logit contributions
ait-0.032
ount-0.031
ada-0.029
enced-0.029
 pigs-0.028
Token heart
Token ID2612
Feature activation+5.32e-02

Pos logit contributions
 now+0.020
 myself+0.019
 hurts+0.019
 wishes+0.018
ggy+0.017
Neg logit contributions
ait-0.039
ount-0.038
ada-0.037
 pigs-0.037
enced-0.035
Token with
Token ID351
Feature activation-4.01e-03

Pos logit contributions
 wishes+0.060
 now+0.059
 hurts+0.058
 myself+0.056
ggy+0.055
Neg logit contributions
ait-0.091
ount-0.089
ada-0.083
line-0.081
 pigs-0.081
Token red
Token ID2266
Feature activation+2.14e-02

Pos logit contributions
ait+0.008
ount+0.008
ada+0.008
 pigs+0.007
enced+0.007
Neg logit contributions
 now-0.003
 hurts-0.003
 wishes-0.003
 myself-0.003
 be-0.003
Token.
Token ID13
Feature activation+1.90e-02

Pos logit contributions
 now+0.026
ggy+0.023
 wishes+0.023
 myself+0.023
 hurts+0.023
Neg logit contributions
ait-0.036
ount-0.035
ada-0.034
 pigs-0.033
enced-0.031
Token A
Token ID317
Feature activation+1.84e-02

Pos logit contributions
 now+0.025
 hurts+0.025
 wishes+0.024
ggy+0.024
 lands+0.023
Neg logit contributions
ount-0.030
ait-0.030
ada-0.028
hot-0.027
 pigs-0.026
Token heart
Token ID2612
Feature activation+7.24e-02

Pos logit contributions
 now+0.019
 myself+0.018
 wishes+0.017
 hurts+0.017
ggy+0.016
Neg logit contributions
ait-0.033
 pigs-0.031
ada-0.031
ount-0.031
line-0.029
Token is
Token ID318
Feature activation+2.54e-02

Pos logit contributions
 myself+0.057
 wishes+0.057
ggy+0.056
 eyel+0.056
Just+0.054
Neg logit contributions
ount-0.138
ait-0.136
ada-0.127
line-0.126
enced-0.123
Token a
Token ID257
Feature activation+1.30e-03

Pos logit contributions
 hurts+0.023
 now+0.023
 wishes+0.023
 myself+0.022
ggy+0.021
Neg logit contributions
ait-0.048
ount-0.047
ada-0.044
 pigs-0.044
enced-0.042
Token shape
Token ID5485
Feature activation+1.93e-02

Pos logit contributions
 myself+0.001
 now+0.001
 hurts+0.001
 wishes+0.001
ggy+0.001
Neg logit contributions
ait-0.002
 pigs-0.002
ada-0.002
ount-0.002
 onions-0.002
Token.
Token ID13
Feature activation+2.01e-02

Pos logit contributions
 now+0.022
 hurts+0.021
 wishes+0.021
 myself+0.021
 be+0.020
Neg logit contributions
ait-0.033
ount-0.032
ada-0.031
enced-0.030
 pigs-0.030
Token A
Token ID317
Feature activation+1.52e-02

Pos logit contributions
 now+0.027
 hurts+0.026
 wishes+0.025
ggy+0.025
 lands+0.024
Neg logit contributions
ount-0.032
ait-0.031
ada-0.029
hot-0.028
iers-0.028
Token but
Token ID475
Feature activation+2.99e-02

Pos logit contributions
ait+0.014
ount+0.013
enced+0.013
ada+0.013
umbers+0.012
Neg logit contributions
 now-0.007
 be-0.006
 my-0.006
 myself-0.006
 wishes-0.005
Token when
Token ID618
Feature activation+2.17e-02

Pos logit contributions
 now+0.025
 hurts+0.025
 wishes+0.024
 myself+0.023
ggy+0.023
Neg logit contributions
ait-0.060
ount-0.058
ada-0.056
 pigs-0.054
enced-0.053
Token we
Token ID356
Feature activation-5.75e-02

Pos logit contributions
 hurts+0.019
 now+0.019
 wishes+0.018
ggy+0.018
 myself+0.018
Neg logit contributions
ait-0.041
ount-0.039
 pigs-0.039
ada-0.037
 onions-0.037
Token show
Token ID905
Feature activation+5.43e-03

Pos logit contributions
ait+0.123
ount+0.117
enced+0.110
iers+0.109
 pigs+0.108
Neg logit contributions
 be-0.060
 now-0.058
 wish-0.053
 hurts-0.051
 wishes-0.051
Token our
Token ID674
Feature activation+2.55e-04

Pos logit contributions
 now+0.006
 wishes+0.006
 be+0.006
 hurts+0.006
 myself+0.006
Neg logit contributions
ait-0.011
ount-0.010
enced-0.009
ada-0.009
iers-0.009
Token heart
Token ID2612
Feature activation+7.20e-02

Pos logit contributions
 now+0.000
 wishes+0.000
 hurts+0.000
 be+0.000
 myself+0.000
Neg logit contributions
ait-0.001
ount-0.001
ada-0.000
enced-0.000
iers-0.000
Token we
Token ID356
Feature activation-4.84e-02

Pos logit contributions
 now+0.062
 wishes+0.056
 hurts+0.056
 myself+0.055
 be+0.053
Neg logit contributions
ait-0.151
ount-0.145
ada-0.139
enced-0.135
 pigs-0.133
Token can
Token ID460
Feature activation+8.21e-03

Pos logit contributions
ait+0.098
ount+0.093
enced+0.089
ada+0.087
auc+0.085
Neg logit contributions
 now-0.058
 be-0.057
 wishes-0.053
 wish-0.052
 hurts-0.049
Token do
Token ID466
Feature activation+8.22e-03

Pos logit contributions
 now+0.007
 be+0.007
 hurts+0.007
 myself+0.007
 wishes+0.007
Neg logit contributions
ait-0.017
ount-0.017
ada-0.016
enced-0.016
 pigs-0.015
Token amazing
Token ID4998
Feature activation+7.76e-03

Pos logit contributions
 now+0.008
 wishes+0.007
 hurts+0.007
 myself+0.007
 be+0.007
Neg logit contributions
ait-0.017
ount-0.016
ada-0.016
enced-0.015
 pigs-0.015
Token things
Token ID1243
Feature activation-2.58e-02

Pos logit contributions
 now+0.008
 hurts+0.008
 myself+0.007
 be+0.007
ggy+0.007
Neg logit contributions
ait-0.013
ount-0.013
 pigs-0.013
ada-0.012
 onions-0.012
Tokenucky
Token ID5309
Feature activation+1.67e-02

Pos logit contributions
ait+0.013
ada+0.013
ount+0.012
arre+0.011
enced+0.011
Neg logit contributions
 now-0.025
 wishes-0.025
 myself-0.024
 spell-0.024
 wish-0.023
Token said
Token ID531
Feature activation+4.73e-03

Pos logit contributions
 now+0.017
 myself+0.017
 hurts+0.016
 wishes+0.016
 be+0.015
Neg logit contributions
ait-0.031
ount-0.030
ada-0.029
enced-0.028
 pigs-0.028
Token,
Token ID11
Feature activation-1.85e-02

Pos logit contributions
 myself+0.006
 hurts+0.006
 wishes+0.006
 now+0.006
 spell+0.005
Neg logit contributions
ait-0.007
ount-0.007
ada-0.007
hot-0.007
 pigs-0.007
Token "
Token ID366
Feature activation-4.48e-03

Pos logit contributions
ait+0.028
ount+0.027
enced+0.025
ada+0.024
umbers+0.023
Neg logit contributions
 now-0.029
 be-0.028
 hurts-0.026
 myself-0.026
 my-0.025
TokenF
Token ID37
Feature activation-2.55e-03

Pos logit contributions
ait+0.007
ount+0.007
ada+0.006
enced+0.006
 pigs+0.006
Neg logit contributions
 now-0.006
 hurts-0.006
 wishes-0.006
 myself-0.006
 be-0.006
Tokenro
Token ID305
Feature activation+7.17e-02

Pos logit contributions
ait+0.003
ount+0.003
ada+0.003
enced+0.003
hot+0.003
Neg logit contributions
 wishes-0.004
ller-0.004
 hurts-0.004
 unicorn-0.004
 Raven-0.004
Tokenggy
Token ID19970
Feature activation+1.96e-02

Pos logit contributions
 wishes+0.043
 now+0.040
 hurts+0.037
 wish+0.036
 myself+0.036
Neg logit contributions
ait-0.160
ount-0.156
ada-0.153
enced-0.148
 pigs-0.145
Token,
Token ID11
Feature activation+3.76e-03

Pos logit contributions
 now+0.014
 hurts+0.012
 be+0.012
 wishes+0.011
 myself+0.011
Neg logit contributions
ait-0.046
ount-0.045
ada-0.042
enced-0.042
iers-0.041
Token you
Token ID345
Feature activation-2.46e-02

Pos logit contributions
 now+0.004
 hurts+0.003
 wishes+0.003
 myself+0.003
ggy+0.003
Neg logit contributions
ait-0.007
ount-0.007
ada-0.007
enced-0.006
 pigs-0.006
Token are
Token ID389
Feature activation+1.03e-02

Pos logit contributions
ait+0.034
ount+0.032
ada+0.029
enced+0.028
 pigs+0.027
Neg logit contributions
 now-0.041
 hurts-0.038
 wishes-0.038
 myself-0.038
 be-0.038
Token such
Token ID884
Feature activation+1.68e-02

Pos logit contributions
 now+0.010
 hurts+0.010
 wishes+0.009
 myself+0.009
 be+0.009
Neg logit contributions
ait-0.021
ount-0.020
ada-0.019
enced-0.018
 pigs-0.018
Token something
Token ID1223
Feature activation-2.25e-02

Pos logit contributions
 now+0.007
 hurts+0.007
 myself+0.007
 wishes+0.006
 be+0.006
Neg logit contributions
ait-0.011
ount-0.010
ada-0.010
 pigs-0.010
 911-0.010
Token strange
Token ID6283
Feature activation-3.87e-03

Pos logit contributions
ount+0.031
enced+0.029
auc+0.028
 911+0.028
 onions+0.027
Neg logit contributions
 myself-0.033
 now-0.029
Just-0.027
zag-0.026
othy-0.026
Token happens
Token ID4325
Feature activation-6.79e-04

Pos logit contributions
ount+0.006
ait+0.006
auc+0.006
enced+0.006
hot+0.006
Neg logit contributions
 myself-0.005
 now-0.004
Just-0.004
 my-0.004
 Raven-0.004
Token.
Token ID13
Feature activation+4.69e-03

Pos logit contributions
ait+0.001
ount+0.001
ada+0.001
enced+0.001
line+0.001
Neg logit contributions
 now-0.001
 hurts-0.001
 wishes-0.001
 myself-0.001
 be-0.001
Token The
Token ID383
Feature activation+1.19e-02

Pos logit contributions
 now+0.005
 wishes+0.004
 hurts+0.004
 myself+0.004
 be+0.004
Neg logit contributions
ait-0.009
ount-0.009
ada-0.008
enced-0.008
 pigs-0.008
Token heart
Token ID2612
Feature activation+7.16e-02

Pos logit contributions
 now+0.014
ggy+0.013
 myself+0.013
 hurts+0.012
aura+0.012
Neg logit contributions
ait-0.017
 newspapers-0.017
 pigs-0.017
 onions-0.017
 911-0.017
Token does
Token ID857
Feature activation+4.47e-02

Pos logit contributions
 myself+0.083
oof+0.074
ggy+0.072
 hind+0.069
othy+0.068
Neg logit contributions
ount-0.108
auc-0.100
arre-0.098
ains-0.097
ait-0.097
Token not
Token ID407
Feature activation-1.36e-02

Pos logit contributions
ww+0.040
bye+0.040
 Raven+0.038
 hurts+0.038
 myself+0.037
Neg logit contributions
ount-0.075
 911-0.072
ait-0.072
line-0.071
pan-0.067
Token stay
Token ID2652
Feature activation-9.35e-03

Pos logit contributions
ait+0.026
ount+0.025
ada+0.024
enced+0.023
 pigs+0.022
Neg logit contributions
 now-0.014
 hurts-0.013
 wishes-0.013
 myself-0.013
ggy-0.012
Token on
Token ID319
Feature activation-3.17e-03

Pos logit contributions
ait+0.017
ount+0.017
ada+0.016
enced+0.015
 pigs+0.015
Neg logit contributions
 now-0.010
 hurts-0.010
 wishes-0.010
 myself-0.010
 be-0.009
Token the
Token ID262
Feature activation-2.02e-03

Pos logit contributions
ait+0.006
ount+0.006
ada+0.005
enced+0.005
 pigs+0.005
Neg logit contributions
 now-0.004
 wishes-0.003
 hurts-0.003
 myself-0.003
 be-0.003
Token look
Token ID804
Feature activation-5.44e-03

Pos logit contributions
 hurts+0.018
 now+0.018
 wishes+0.018
 myself+0.018
 lands+0.016
Neg logit contributions
ait-0.036
ount-0.033
ada-0.033
enced-0.031
 911-0.031
Token,
Token ID11
Feature activation+2.52e-03

Pos logit contributions
ait+0.012
ount+0.011
ada+0.011
enced+0.011
 pigs+0.010
Neg logit contributions
 now-0.005
 wishes-0.004
 hurts-0.004
 myself-0.004
 be-0.004
Token Amy
Token ID14235
Feature activation+3.60e-02

Pos logit contributions
 now+0.003
 be+0.003
 myself+0.003
 hurts+0.003
 wishes+0.003
Neg logit contributions
ait-0.005
ount-0.005
ada-0.004
enced-0.004
iers-0.004
Token,
Token ID11
Feature activation+8.11e-03

Pos logit contributions
 now+0.035
 hurts+0.030
 wishes+0.030
 myself+0.030
 be+0.030
Neg logit contributions
ait-0.075
ount-0.073
ada-0.069
enced-0.068
 pigs-0.065
Token your
Token ID534
Feature activation+2.94e-02

Pos logit contributions
 now+0.008
 be+0.007
 myself+0.007
 wishes+0.007
 hurts+0.007
Neg logit contributions
ait-0.017
ount-0.016
ada-0.016
enced-0.016
 pigs-0.015
Token heart
Token ID2612
Feature activation+7.11e-02

Pos logit contributions
 now+0.028
 myself+0.026
 hurts+0.025
 wishes+0.024
ggy+0.024
Neg logit contributions
ait-0.056
ount-0.051
ada-0.051
 pigs-0.050
 911-0.048
Token is
Token ID318
Feature activation+2.69e-02

Pos logit contributions
 wishes+0.054
ggy+0.051
 myself+0.050
 now+0.050
 eyel+0.050
Neg logit contributions
ait-0.132
ount-0.130
ada-0.123
line-0.122
arre-0.121
Token made
Token ID925
Feature activation-1.97e-03

Pos logit contributions
 now+0.025
 hurts+0.024
 wishes+0.023
 myself+0.023
 be+0.022
Neg logit contributions
ait-0.054
ount-0.052
ada-0.050
enced-0.048
 pigs-0.048
Token up
Token ID510
Feature activation+2.03e-03

Pos logit contributions
ait+0.004
ount+0.004
enced+0.004
ada+0.004
 pigs+0.004
Neg logit contributions
 now-0.002
 myself-0.002
 be-0.002
 hurts-0.002
 my-0.002
Token of
Token ID286
Feature activation+5.26e-03

Pos logit contributions
 now+0.002
 myself+0.002
 be+0.002
 hurts+0.001
 wishes+0.001
Neg logit contributions
ait-0.005
ount-0.004
ada-0.004
enced-0.004
umbers-0.004
Token lots
Token ID6041
Feature activation+7.39e-03

Pos logit contributions
 now+0.005
 myself+0.005
 wishes+0.005
 be+0.005
 hurts+0.005
Neg logit contributions
ait-0.011
ount-0.011
enced-0.010
ada-0.010
iers-0.009
Token forward
Token ID2651
Feature activation-9.77e-03

Pos logit contributions
ait+0.054
ount+0.048
umbers+0.047
pan+0.047
iers+0.046
Neg logit contributions
 now-0.023
--0.020
 wishes-0.020
 my-0.020
 very-0.018
Token to
Token ID284
Feature activation+5.13e-03

Pos logit contributions
ait+0.022
ount+0.021
ada+0.020
enced+0.020
umbers+0.020
Neg logit contributions
 now-0.009
 be-0.007
 myself-0.007
 hurts-0.006
 wishes-0.006
Token it
Token ID340
Feature activation-1.53e-02

Pos logit contributions
 now+0.006
 be+0.006
 wishes+0.006
 myself+0.006
 my+0.005
Neg logit contributions
ait-0.010
ount-0.009
ada-0.009
enced-0.008
umbers-0.008
Token.
Token ID13
Feature activation-6.29e-04

Pos logit contributions
ait+0.034
ount+0.034
ada+0.033
enced+0.031
iers+0.030
Neg logit contributions
 now-0.016
 be-0.013
 myself-0.012
 next-0.011
 my-0.010
Token "
Token ID366
Feature activation-2.11e-03

Pos logit contributions
ait+0.001
umbers+0.001
iers+0.001
ount+0.001
 pigs+0.001
Neg logit contributions
 now-0.001
 be-0.001
 my-0.001
 was-0.000
 hurts-0.000
TokenWill
Token ID8743
Feature activation+7.10e-02

Pos logit contributions
ait+0.004
ount+0.003
ada+0.003
enced+0.003
 pigs+0.003
Neg logit contributions
 now-0.003
 myself-0.003
 be-0.003
 hurts-0.002
 wishes-0.002
Token it
Token ID340
Feature activation+1.06e-03

Pos logit contributions
 wishes+0.070
 eyel+0.062
 now+0.061
 hurts+0.061
 spell+0.059
Neg logit contributions
ait-0.131
ount-0.123
ada-0.123
line-0.120
folk-0.117
Token be
Token ID307
Feature activation+1.26e-02

Pos logit contributions
 now+0.001
 hurts+0.001
 be+0.001
 myself+0.001
 wishes+0.001
Neg logit contributions
ait-0.002
ount-0.002
ada-0.002
enced-0.002
 pigs-0.002
Token hard
Token ID1327
Feature activation+8.40e-03

Pos logit contributions
 now+0.011
 myself+0.009
 my+0.009
 be+0.009
 hurts+0.008
Neg logit contributions
ait-0.029
ount-0.028
ada-0.027
enced-0.026
line-0.026
Token,
Token ID11
Feature activation-7.12e-03

Pos logit contributions
 wishes+0.007
 now+0.007
 hurts+0.007
 myself+0.007
ggy+0.007
Neg logit contributions
ait-0.017
ount-0.016
ada-0.016
enced-0.015
 pigs-0.015
Token mom
Token ID1995
Feature activation+2.59e-02

Pos logit contributions
ount+0.013
ait+0.013
enced+0.013
iers+0.012
ada+0.012
Neg logit contributions
 now-0.010
 be-0.009
 my-0.009
 myself-0.008
 hurts-0.007
Token in
Token ID287
Feature activation-1.26e-02

Pos logit contributions
 now+0.008
 be+0.007
 wishes+0.006
 hurts+0.006
-+0.006
Neg logit contributions
ait-0.021
ount-0.020
ada-0.019
enced-0.019
 911-0.018
Token his
Token ID465
Feature activation-2.76e-03

Pos logit contributions
ait+0.027
ount+0.026
ada+0.025
enced+0.024
umbers+0.024
Neg logit contributions
 now-0.013
 be-0.011
 wishes-0.011
 myself-0.010
 my-0.010
Token eye
Token ID4151
Feature activation+6.68e-03

Pos logit contributions
ait+0.004
ada+0.004
line+0.004
ount+0.004
 911+0.004
Neg logit contributions
 now-0.003
 myself-0.003
ggy-0.003
 hurts-0.003
 Raven-0.003
Token.
Token ID13
Feature activation-2.78e-03

Pos logit contributions
 now+0.006
 be+0.006
 wishes+0.005
 hurts+0.005
 myself+0.005
Neg logit contributions
ait-0.015
ount-0.014
ada-0.013
enced-0.013
 911-0.013
Token "
Token ID366
Feature activation+2.81e-03

Pos logit contributions
ait+0.005
ount+0.005
ada+0.004
enced+0.004
 pigs+0.004
Neg logit contributions
 now-0.003
 hurts-0.003
 wishes-0.003
 myself-0.003
 be-0.003
TokenWill
Token ID8743
Feature activation+7.10e-02

Pos logit contributions
 hurts+0.005
 wishes+0.004
 now+0.004
 lands+0.004
ggy+0.004
Neg logit contributions
ait-0.004
ount-0.004
ada-0.003
hot-0.003
enced-0.003
Token my
Token ID616
Feature activation+2.25e-02

Pos logit contributions
 wishes+0.077
 eyel+0.075
 hurts+0.067
Just+0.067
ggy+0.066
Neg logit contributions
ait-0.110
ada-0.101
folk-0.099
line-0.098
ount-0.096
Token wish
Token ID4601
Feature activation+5.67e-02

Pos logit contributions
 hurts+0.025
 myself+0.025
 now+0.024
ggy+0.024
p+0.023
Neg logit contributions
ait-0.037
ada-0.034
ount-0.033
 pigs-0.032
 911-0.032
Token come
Token ID1282
Feature activation+4.77e-03

Pos logit contributions
 wishes+0.047
 now+0.046
 myself+0.046
ggy+0.045
 hurts+0.044
Neg logit contributions
ait-0.110
ount-0.105
ada-0.101
line-0.099
 pigs-0.098
Token true
Token ID2081
Feature activation+8.88e-03

Pos logit contributions
 wishes+0.004
 hurts+0.004
ggy+0.004
 eyel+0.004
 now+0.004
Neg logit contributions
ait-0.009
ount-0.009
ada-0.008
 911-0.008
 pigs-0.008
Token?"
Token ID1701
Feature activation-1.70e-02

Pos logit contributions
 wishes+0.009
 hurts+0.009
 myself+0.009
 eyel+0.008
ggy+0.008
Neg logit contributions
ait-0.016
ount-0.015
ada-0.015
 pigs-0.014
 911-0.014
Token children
Token ID1751
Feature activation-2.11e-02

Pos logit contributions
 now+0.006
 myself+0.006
 hurts+0.006
 wishes+0.006
 be+0.005
Neg logit contributions
ait-0.011
ount-0.011
ada-0.011
enced-0.010
 pigs-0.010
Token put
Token ID1234
Feature activation-5.28e-03

Pos logit contributions
ait+0.045
ount+0.042
ada+0.040
umbers+0.039
enced+0.039
Neg logit contributions
 now-0.023
 be-0.021
 wishes-0.018
 hurts-0.017
 then-0.017
Token on
Token ID319
Feature activation+2.40e-02

Pos logit contributions
ait+0.011
ount+0.011
enced+0.010
ada+0.010
arre+0.010
Neg logit contributions
 now-0.006
 my-0.005
 be-0.005
 myself-0.005
 hurts-0.005
Token their
Token ID511
Feature activation-2.09e-03

Pos logit contributions
 now+0.025
 my+0.025
 be+0.024
 myself+0.023
 wishes+0.023
Neg logit contributions
ount-0.048
ait-0.048
enced-0.046
line-0.045
ada-0.045
Token bath
Token ID7837
Feature activation+3.11e-02

Pos logit contributions
ait+0.004
ount+0.003
ada+0.003
enced+0.003
 pigs+0.003
Neg logit contributions
 now-0.003
 hurts-0.002
 wishes-0.002
 myself-0.002
 be-0.002
Tokenro
Token ID305
Feature activation+7.09e-02

Pos logit contributions
 now+0.042
 hurts+0.038
 be+0.038
 myself+0.037
 wishes+0.037
Neg logit contributions
ait-0.053
ount-0.051
ada-0.048
enced-0.046
 pigs-0.044
Tokenbes
Token ID12636
Feature activation-2.22e-02

Pos logit contributions
 now+0.067
 wishes+0.066
 hurts+0.064
 myself+0.061
 wish+0.060
Neg logit contributions
ait-0.138
ount-0.132
ada-0.125
enced-0.123
line-0.121
Token and
Token ID290
Feature activation+2.53e-03

Pos logit contributions
ait+0.055
ount+0.054
ada+0.051
enced+0.049
 911+0.049
Neg logit contributions
 now-0.017
 be-0.011
 hurts-0.010
 next-0.010
 is-0.009
Token ran
Token ID4966
Feature activation-7.59e-03

Pos logit contributions
 now+0.003
 be+0.002
 hurts+0.002
 wishes+0.002
 myself+0.002
Neg logit contributions
ait-0.005
ount-0.005
ada-0.005
enced-0.005
iers-0.004
Token around
Token ID1088
Feature activation-2.12e-03

Pos logit contributions
ait+0.016
ount+0.015
ada+0.014
enced+0.014
umbers+0.014
Neg logit contributions
 now-0.008
 be-0.007
 hurts-0.006
 myself-0.006
 wishes-0.006
Token in
Token ID287
Feature activation+3.93e-03

Pos logit contributions
ait+0.005
ount+0.005
ada+0.005
enced+0.004
umbers+0.004
Neg logit contributions
 now-0.002
 be-0.002
 wishes-0.001
 next-0.001
 my-0.001
Token tired
Token ID10032
Feature activation+1.37e-02

Pos logit contributions
 now+0.014
 wishes+0.013
 hurts+0.013
 myself+0.012
 be+0.012
Neg logit contributions
ait-0.029
ount-0.028
ada-0.027
enced-0.026
line-0.025
Token and
Token ID290
Feature activation+2.11e-02

Pos logit contributions
 wishes+0.016
 hurts+0.015
 now+0.015
ggy+0.015
 myself+0.015
Neg logit contributions
ait-0.024
ount-0.022
ada-0.021
 pigs-0.021
line-0.021
Token sore
Token ID19597
Feature activation+7.12e-03

Pos logit contributions
 now+0.026
 myself+0.026
ggy+0.026
 wishes+0.025
 hurts+0.024
Neg logit contributions
ait-0.033
ount-0.032
enced-0.029
ada-0.029
 pigs-0.029
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
 wishes+0.008
 hurts+0.008
 now+0.008
ggy+0.008
 myself+0.008
Neg logit contributions
ait-0.012
ount-0.011
ada-0.011
 pigs-0.011
iers-0.011
Token My
Token ID2011
Feature activation+2.72e-02

Pos logit contributions
 now+0.011
 hurts+0.010
 wishes+0.010
ggy+0.010
 lands+0.010
Neg logit contributions
ait-0.019
ount-0.018
 pigs-0.017
ada-0.017
 911-0.017
Token heart
Token ID2612
Feature activation+7.07e-02

Pos logit contributions
 myself+0.032
 now+0.030
Just+0.028
ggy+0.028
 hurts+0.028
Neg logit contributions
ait-0.046
 pigs-0.042
 911-0.039
ount-0.039
 stret-0.039
Token hurts
Token ID20406
Feature activation+1.80e-02

Pos logit contributions
ggy+0.075
 myself+0.073
aura+0.069
Just+0.069
 eyel+0.067
Neg logit contributions
ait-0.112
ount-0.106
 pigs-0.100
line-0.100
ains-0.099
Token."
Token ID526
Feature activation-1.15e-03

Pos logit contributions
 now+0.018
 hurts+0.016
 myself+0.015
 be+0.015
 wishes+0.015
Neg logit contributions
ait-0.038
ount-0.037
ada-0.035
enced-0.034
iers-0.033
Token
Token ID198
Feature activation+4.95e-04

Pos logit contributions
ait+0.002
ount+0.002
ada+0.002
enced+0.002
 pigs+0.002
Neg logit contributions
 now-0.001
 hurts-0.001
 myself-0.001
 wishes-0.001
 be-0.001
Token
Token ID198
Feature activation+8.39e-03

Pos logit contributions
 now+0.001
 myself+0.001
 hurts+0.001
 wishes+0.001
 spell+0.001
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
enced-0.001
 pigs-0.001
TokenL
Token ID43
Feature activation-9.22e-04

Pos logit contributions
 now+0.011
 wishes+0.011
 hurts+0.011
 myself+0.010
 be+0.010
Neg logit contributions
ait-0.014
ount-0.013
ada-0.012
enced-0.012
 pigs-0.012
Token lovely
Token ID14081
Feature activation+4.25e-03

Pos logit contributions
 now+0.006
 be+0.005
 wishes+0.005
 hurts+0.005
 myself+0.005
Neg logit contributions
ait-0.015
ount-0.014
ada-0.014
enced-0.013
line-0.013
Token if
Token ID611
Feature activation+9.47e-03

Pos logit contributions
 now+0.004
 wishes+0.003
 be+0.003
 myself+0.003
 hurts+0.003
Neg logit contributions
ait-0.010
ount-0.009
ada-0.009
enced-0.009
umbers-0.008
Token you
Token ID345
Feature activation-3.55e-02

Pos logit contributions
 now+0.009
 hurts+0.008
 wishes+0.008
 myself+0.008
 be+0.008
Neg logit contributions
ait-0.019
ount-0.018
ada-0.018
enced-0.017
 pigs-0.017
Token put
Token ID1234
Feature activation+4.07e-04

Pos logit contributions
ait+0.070
ount+0.067
enced+0.064
 pigs+0.063
ada+0.063
Neg logit contributions
 now-0.041
 be-0.040
 wish-0.036
 wishes-0.035
 hurts-0.034
Token your
Token ID534
Feature activation+3.07e-02

Pos logit contributions
 now+0.000
 myself+0.000
 my+0.000
 wishes+0.000
 be+0.000
Neg logit contributions
ait-0.001
ount-0.001
enced-0.001
ada-0.001
ules-0.001
Token heart
Token ID2612
Feature activation+6.98e-02

Pos logit contributions
 now+0.032
 hurts+0.031
 wishes+0.031
 be+0.030
 myself+0.030
Neg logit contributions
ait-0.061
ount-0.060
ada-0.056
enced-0.055
line-0.052
Token into
Token ID656
Feature activation+1.06e-02

Pos logit contributions
 wishes+0.076
 now+0.075
 hurts+0.072
 myself+0.070
ggy+0.069
Neg logit contributions
ait-0.123
ount-0.118
ada-0.112
enced-0.109
line-0.108
Token it
Token ID340
Feature activation+1.09e-02

Pos logit contributions
 now+0.010
 wishes+0.009
 hurts+0.009
 myself+0.009
 be+0.009
Neg logit contributions
ait-0.022
ount-0.021
ada-0.020
enced-0.020
 pigs-0.019
Token.
Token ID13
Feature activation+8.57e-03

Pos logit contributions
 now+0.011
 hurts+0.010
 myself+0.010
 wishes+0.010
 be+0.010
Neg logit contributions
ait-0.022
ount-0.021
ada-0.020
enced-0.020
 pigs-0.019
Token Not
Token ID1892
Feature activation+7.07e-03

Pos logit contributions
 now+0.011
 hurts+0.011
 wishes+0.010
 myself+0.010
 be+0.010
Neg logit contributions
ait-0.014
ount-0.014
ada-0.013
enced-0.012
 pigs-0.012
Token only
Token ID691
Feature activation+2.04e-02

Pos logit contributions
 wishes+0.008
 hurts+0.008
 myself+0.008
 now+0.008
ggy+0.008
Neg logit contributions
ait-0.012
ount-0.011
ada-0.011
 pigs-0.011
 onions-0.010
Token you
Token ID345
Feature activation-1.17e-02

Pos logit contributions
ait+0.012
ount+0.011
ada+0.011
enced+0.010
 pigs+0.010
Neg logit contributions
 now-0.004
 myself-0.004
 wishes-0.004
 hurts-0.004
 be-0.003
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
ait+0.024
ount+0.023
ada+0.023
enced+0.022
 pigs+0.021
Neg logit contributions
 now-0.012
 myself-0.010
 hurts-0.010
 be-0.010
 wishes-0.010
Token It
Token ID632
Feature activation+2.40e-02

Pos logit contributions
 now+0.020
 hurts+0.019
 wishes+0.019
ggy+0.018
 myself+0.018
Neg logit contributions
ait-0.027
ount-0.026
ada-0.025
enced-0.023
hot-0.023
Token has
Token ID468
Feature activation+2.38e-02

Pos logit contributions
 now+0.022
 wishes+0.020
 myself+0.020
 hurts+0.019
ggy+0.019
Neg logit contributions
ait-0.047
ount-0.047
ada-0.044
enced-0.043
 pigs-0.043
Token a
Token ID257
Feature activation+1.86e-02

Pos logit contributions
 now+0.023
 hurts+0.023
ggy+0.022
 wishes+0.022
 myself+0.021
Neg logit contributions
ait-0.042
ount-0.042
ada-0.040
 pigs-0.039
enced-0.038
Token heart
Token ID2612
Feature activation+6.97e-02

Pos logit contributions
 myself+0.017
 now+0.017
 hurts+0.016
 wishes+0.016
ggy+0.015
Neg logit contributions
ait-0.034
ount-0.033
ada-0.033
 pigs-0.032
enced-0.031
Token and
Token ID290
Feature activation+3.24e-02

Pos logit contributions
 wishes+0.077
 hurts+0.070
 myself+0.069
ggy+0.068
 now+0.067
Neg logit contributions
ount-0.121
ait-0.120
line-0.113
ada-0.110
enced-0.108
Token it
Token ID340
Feature activation+1.99e-02

Pos logit contributions
ggy+0.032
aura+0.031
zag+0.031
Just+0.029
 myself+0.029
Neg logit contributions
 pigs-0.049
 onions-0.048
ait-0.048
 newspapers-0.047
ount-0.047
Token says
Token ID1139
Feature activation+2.02e-02

Pos logit contributions
 now+0.017
 myself+0.016
 wishes+0.016
ggy+0.016
 be+0.014
Neg logit contributions
ount-0.039
ait-0.038
ada-0.036
enced-0.035
 pigs-0.035
Token I
Token ID314
Feature activation-2.70e-02

Pos logit contributions
 now+0.024
 hurts+0.022
 wishes+0.022
 myself+0.022
 be+0.021
Neg logit contributions
ait-0.036
ount-0.035
ada-0.033
enced-0.032
 pigs-0.031
Token love
Token ID1842
Feature activation+3.26e-02

Pos logit contributions
ait+0.047
ount+0.046
ada+0.043
enced+0.042
 pigs+0.041
Neg logit contributions
 now-0.031
 myself-0.030
 hurts-0.030
 wishes-0.029
 be-0.028
Token hug
Token ID16225
Feature activation+1.49e-02

Pos logit contributions
 now+0.001
 hurts+0.001
 be+0.001
 wishes+0.001
 myself+0.001
Neg logit contributions
ait-0.002
ount-0.002
ada-0.002
enced-0.002
iers-0.002
Token and
Token ID290
Feature activation+9.22e-03

Pos logit contributions
 now+0.013
 be+0.011
 wishes+0.009
 hurts+0.009
-+0.008
Neg logit contributions
ait-0.034
ount-0.034
enced-0.031
ada-0.031
umbers-0.031
Token said
Token ID531
Feature activation+2.83e-03

Pos logit contributions
 myself+0.012
 hurts+0.011
 wishes+0.011
 now+0.011
ggy+0.010
Neg logit contributions
ait-0.014
ount-0.013
ada-0.013
 pigs-0.013
enced-0.012
Token,
Token ID11
Feature activation-2.16e-02

Pos logit contributions
 now+0.002
 hurts+0.002
 wishes+0.002
 myself+0.002
 be+0.002
Neg logit contributions
ait-0.007
ount-0.006
ada-0.006
enced-0.006
 pigs-0.006
Token "
Token ID366
Feature activation-8.59e-03

Pos logit contributions
ait+0.038
ount+0.035
enced+0.033
ada+0.032
umbers+0.031
Neg logit contributions
 now-0.032
 be-0.030
 myself-0.028
 next-0.027
Just-0.027
TokenWill
Token ID8743
Feature activation+6.97e-02

Pos logit contributions
ait+0.016
ount+0.015
ada+0.014
enced+0.013
 911+0.013
Neg logit contributions
 now-0.010
 myself-0.010
 be-0.010
 hurts-0.010
 wishes-0.009
Token you
Token ID345
Feature activation-2.63e-02

Pos logit contributions
 wishes+0.074
 eyel+0.067
 spell+0.064
Just+0.062
 wish+0.062
Neg logit contributions
ait-0.111
ada-0.109
ount-0.105
line-0.104
folk-0.104
Token play
Token ID711
Feature activation+1.09e-02

Pos logit contributions
ait+0.054
ount+0.052
ada+0.050
enced+0.049
 pigs+0.048
Neg logit contributions
 now-0.025
 hurts-0.022
 wishes-0.022
 myself-0.022
 be-0.022
Token with
Token ID351
Feature activation+5.01e-03

Pos logit contributions
 hurts+0.009
 wishes+0.009
 now+0.008
 myself+0.008
 be+0.008
Neg logit contributions
ait-0.022
ount-0.021
ada-0.020
enced-0.020
 pigs-0.020
Token me
Token ID502
Feature activation+4.40e-03

Pos logit contributions
 now+0.003
 myself+0.003
 hurts+0.002
 wishes+0.002
 be+0.002
Neg logit contributions
ait-0.012
ount-0.012
ada-0.011
enced-0.011
iers-0.011
Token?".
Token ID43634
Feature activation-4.95e-03

Pos logit contributions
 wishes+0.004
 hurts+0.004
 myself+0.004
 now+0.004
ggy+0.004
Neg logit contributions
ait-0.008
ount-0.008
enced-0.008
ada-0.008
 pigs-0.007
Token you
Token ID345
Feature activation-1.17e-02

Pos logit contributions
ait+0.011
ount+0.011
ada+0.010
enced+0.010
iers+0.010
Neg logit contributions
 now-0.004
 myself-0.003
 wishes-0.003
 be-0.003
 hurts-0.003
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
ait+0.025
ount+0.024
ada+0.023
enced+0.022
iers+0.021
Neg logit contributions
 now-0.011
 myself-0.010
 be-0.010
 hurts-0.010
 wishes-0.009
Token It
Token ID632
Feature activation+2.37e-02

Pos logit contributions
 now+0.021
 hurts+0.020
 wishes+0.020
ggy+0.019
 myself+0.019
Neg logit contributions
ait-0.029
ount-0.028
ada-0.027
enced-0.025
hot-0.025
Token has
Token ID468
Feature activation+2.16e-02

Pos logit contributions
 now+0.021
 wishes+0.019
 myself+0.019
 hurts+0.018
ggy+0.018
Neg logit contributions
ait-0.047
ount-0.046
ada-0.044
enced-0.043
 pigs-0.043
Token a
Token ID257
Feature activation+1.93e-02

Pos logit contributions
 now+0.021
 hurts+0.020
 wishes+0.019
ggy+0.019
 myself+0.019
Neg logit contributions
ait-0.040
ount-0.039
ada-0.037
 pigs-0.036
enced-0.036
Token heart
Token ID2612
Feature activation+6.92e-02

Pos logit contributions
 now+0.018
 myself+0.018
 hurts+0.016
 wishes+0.016
ggy+0.015
Neg logit contributions
ait-0.037
ount-0.035
ada-0.035
 pigs-0.034
enced-0.033
Token and
Token ID290
Feature activation+3.26e-02

Pos logit contributions
 wishes+0.073
 hurts+0.067
 now+0.067
 myself+0.066
ggy+0.065
Neg logit contributions
ait-0.123
ount-0.122
line-0.114
ada-0.113
 pigs-0.111
Token it
Token ID340
Feature activation+2.10e-02

Pos logit contributions
ggy+0.031
zag+0.029
 now+0.029
aura+0.029
 myself+0.028
Neg logit contributions
 pigs-0.052
ait-0.051
 onions-0.050
ount-0.050
ada-0.050
Token says
Token ID1139
Feature activation+1.99e-02

Pos logit contributions
 now+0.018
 myself+0.017
 wishes+0.017
ggy+0.017
 be+0.015
Neg logit contributions
ount-0.040
ait-0.040
ada-0.037
enced-0.037
 pigs-0.037
Token I
Token ID314
Feature activation-2.78e-02

Pos logit contributions
 now+0.021
 hurts+0.019
 wishes+0.019
 myself+0.019
 be+0.019
Neg logit contributions
ait-0.039
ount-0.037
ada-0.036
enced-0.035
 pigs-0.034
Token love
Token ID1842
Feature activation+3.11e-02

Pos logit contributions
ait+0.048
ount+0.046
ada+0.044
enced+0.042
 pigs+0.041
Neg logit contributions
 now-0.033
 myself-0.032
 hurts-0.032
 wishes-0.031
 be-0.030
Token not
Token ID407
Feature activation-7.04e-04

Pos logit contributions
 now+0.013
 hurts+0.012
 myself+0.012
 wishes+0.011
 be+0.011
Neg logit contributions
ait-0.033
ount-0.032
ada-0.031
enced-0.030
 pigs-0.029
Token your
Token ID534
Feature activation+7.50e-04

Pos logit contributions
ait+0.001
ount+0.001
enced+0.001
ada+0.001
umbers+0.001
Neg logit contributions
 now-0.001
 be-0.001
 myself-0.001
 my-0.001
 hurts-0.001
Token heart
Token ID2612
Feature activation+4.07e-02

Pos logit contributions
 now+0.001
 wishes+0.001
 hurts+0.001
 be+0.001
 myself+0.001
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
enced-0.001
hot-0.001
Token.
Token ID13
Feature activation-1.56e-03

Pos logit contributions
 wishes+0.048
 now+0.048
 myself+0.045
 hurts+0.044
ggy+0.044
Neg logit contributions
ait-0.068
ount-0.065
ada-0.062
enced-0.059
line-0.059
Token Your
Token ID3406
Feature activation+3.22e-02

Pos logit contributions
ait+0.003
ount+0.003
ada+0.003
enced+0.003
 pigs+0.002
Neg logit contributions
 now-0.002
 myself-0.002
 wishes-0.002
 hurts-0.002
 be-0.002
Token heart
Token ID2612
Feature activation+6.92e-02

Pos logit contributions
 now+0.036
 wishes+0.034
 be+0.034
 hurts+0.033
 myself+0.033
Neg logit contributions
ait-0.062
ount-0.061
enced-0.057
ada-0.057
hot-0.054
Token is
Token ID318
Feature activation+2.86e-02

Pos logit contributions
 myself+0.048
 wishes+0.046
 now+0.046
ggy+0.046
Just+0.044
Neg logit contributions
ait-0.135
ount-0.132
ada-0.124
line-0.123
arre-0.123
Token in
Token ID287
Feature activation+4.85e-03

Pos logit contributions
 now+0.025
 hurts+0.024
 wishes+0.023
 myself+0.023
 be+0.022
Neg logit contributions
ait-0.058
ount-0.056
ada-0.054
enced-0.052
 pigs-0.052
Token your
Token ID534
Feature activation-1.58e-04

Pos logit contributions
 now+0.004
 myself+0.004
 be+0.004
 wishes+0.004
 hurts+0.004
Neg logit contributions
ait-0.010
ount-0.010
ada-0.010
enced-0.010
umbers-0.009
Token chest
Token ID7721
Feature activation+2.72e-02

Pos logit contributions
ait+0.000
ount+0.000
ada+0.000
enced+0.000
 pigs+0.000
Neg logit contributions
 now-0.000
 myself-0.000
 hurts-0.000
 wishes-0.000
 be-0.000
Token.
Token ID13
Feature activation+9.61e-03

Pos logit contributions
 now+0.031
 hurts+0.029
 wishes+0.029
 myself+0.028
 be+0.028
Neg logit contributions
ait-0.050
ount-0.048
ada-0.046
enced-0.045
 pigs-0.044
Token playing
Token ID2712
Feature activation-2.61e-02

Pos logit contributions
ait+0.034
ount+0.033
ada+0.031
enced+0.031
hot+0.029
Neg logit contributions
 now-0.025
 be-0.023
 wishes-0.022
 myself-0.022
 hurts-0.022
Token and
Token ID290
Feature activation+1.83e-02

Pos logit contributions
ait+0.067
ada+0.061
ount+0.059
ules+0.059
ylan+0.058
Neg logit contributions
 now-0.022
 be-0.018
 myself-0.016
--0.014
 my-0.013
Token never
Token ID1239
Feature activation+1.17e-02

Pos logit contributions
 now+0.022
 be+0.020
 hurts+0.019
 myself+0.018
 wishes+0.017
Neg logit contributions
ait-0.038
ount-0.035
ada-0.034
enced-0.033
umbers-0.032
Token touch
Token ID3638
Feature activation-1.31e-02

Pos logit contributions
 now+0.013
 wishes+0.012
 myself+0.012
 hurts+0.012
 be+0.011
Neg logit contributions
ait-0.022
ount-0.021
ada-0.020
enced-0.019
 pigs-0.019
Token anything
Token ID1997
Feature activation-4.77e-02

Pos logit contributions
ait+0.026
ount+0.024
enced+0.023
ada+0.023
ules+0.022
Neg logit contributions
 now-0.016
 be-0.014
 myself-0.014
 my-0.014
 hurts-0.014
Token they
Token ID484
Feature activation-9.79e-02

Pos logit contributions
ait+0.105
ount+0.096
ada+0.096
enced+0.093
umbers+0.089
Neg logit contributions
 now-0.051
 hurts-0.044
 be-0.042
 myself-0.041
 wishes-0.040
Token were
Token ID547
Feature activation-3.35e-02

Pos logit contributions
ait+0.215
ount+0.202
ada+0.196
 911+0.195
iers+0.194
Neg logit contributions
 now-0.100
 be-0.092
 hurts-0.082
 wishes-0.079
 wish-0.076
Token not
Token ID407
Feature activation-1.79e-02

Pos logit contributions
ait+0.075
ount+0.071
ada+0.068
enced+0.068
auc+0.065
Neg logit contributions
 now-0.037
 be-0.029
 myself-0.027
 hurts-0.027
 wishes-0.025
Token allowed
Token ID3142
Feature activation-5.10e-02

Pos logit contributions
ait+0.038
ount+0.036
ada+0.035
enced+0.035
auc+0.034
Neg logit contributions
 now-0.019
 be-0.016
 my-0.016
 myself-0.016
 hurts-0.015
Token to
Token ID284
Feature activation-2.70e-02

Pos logit contributions
ait+0.125
ount+0.118
enced+0.117
umbers+0.113
ada+0.113
Neg logit contributions
 now-0.041
 myself-0.032
 be-0.031
 next-0.028
 my-0.027
Token.
Token ID13
Feature activation-4.52e-03

Pos logit contributions
ait+0.055
ount+0.052
ada+0.050
enced+0.049
iers+0.048
Neg logit contributions
 now-0.029
 be-0.027
 hurts-0.025
 wishes-0.025
 myself-0.025
Token to
Token ID284
Feature activation+3.86e-04

Pos logit contributions
 now+0.002
 hurts+0.001
 myself+0.001
 be+0.001
 wishes+0.001
Neg logit contributions
ait-0.003
ount-0.003
ada-0.003
enced-0.003
 pigs-0.003
Token make
Token ID787
Feature activation-1.33e-02

Pos logit contributions
 now+0.000
 hurts+0.000
 wishes+0.000
 myself+0.000
 be+0.000
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
enced-0.001
 pigs-0.001
Token them
Token ID606
Feature activation-4.01e-02

Pos logit contributions
ait+0.027
ount+0.026
enced+0.024
ada+0.024
line+0.023
Neg logit contributions
 now-0.014
 wishes-0.013
 hurts-0.013
 myself-0.013
 be-0.013
Token do
Token ID466
Feature activation-3.47e-03

Pos logit contributions
ait+0.084
ount+0.081
ada+0.077
enced+0.076
 pigs+0.072
Neg logit contributions
 now-0.040
 hurts-0.037
 be-0.037
 wishes-0.036
 myself-0.036
Token what
Token ID644
Feature activation-1.13e-02

Pos logit contributions
ait+0.007
ount+0.007
ada+0.006
enced+0.006
 pigs+0.006
Neg logit contributions
 now-0.003
 hurts-0.003
 wishes-0.003
 myself-0.003
 be-0.003
Token we
Token ID356
Feature activation-9.78e-02

Pos logit contributions
ait+0.024
ount+0.023
ada+0.022
enced+0.022
line+0.021
Neg logit contributions
 now-0.010
 hurts-0.010
 be-0.010
 wishes-0.009
 myself-0.009
Token want
Token ID765
Feature activation-4.76e-02

Pos logit contributions
ait+0.208
ount+0.197
enced+0.184
ada+0.183
 911+0.181
Neg logit contributions
 be-0.108
 now-0.105
 hurts-0.095
 wishes-0.095
 wish-0.094
Token.
Token ID13
Feature activation+2.27e-03

Pos logit contributions
ait+0.116
ount+0.111
enced+0.108
ada+0.105
 paramed+0.102
Neg logit contributions
 now-0.044
 be-0.037
 myself-0.033
 wishes-0.031
 hurts-0.031
Token<|endoftext|>
Token ID50256
Feature activation-1.26e-03

Pos logit contributions
 now+0.003
 wishes+0.003
 hurts+0.003
 myself+0.003
 lands+0.003
Neg logit contributions
ait-0.003
ount-0.003
hot-0.003
ada-0.003
 911-0.003
TokenOnce
Token ID7454
Feature activation+9.43e-03

Pos logit contributions
ait+0.002
ount+0.002
ada+0.002
enced+0.002
 pigs+0.002
Neg logit contributions
 now-0.001
 hurts-0.001
 wishes-0.001
 myself-0.001
 be-0.001
Token upon
Token ID2402
Feature activation+1.09e-02

Pos logit contributions
 now+0.014
 hurts+0.013
 be+0.013
 wishes+0.013
 myself+0.013
Neg logit contributions
ait-0.014
ount-0.014
ada-0.012
enced-0.012
 pigs-0.012
Token They
Token ID1119
Feature activation-3.72e-02

Pos logit contributions
ait+0.017
ount+0.016
ada+0.015
enced+0.015
 pigs+0.014
Neg logit contributions
 now-0.013
 wishes-0.012
 be-0.012
 hurts-0.012
 myself-0.012
Token did
Token ID750
Feature activation+2.21e-02

Pos logit contributions
ait+0.077
ount+0.077
ada+0.074
 911+0.070
arre+0.069
Neg logit contributions
 now-0.037
 be-0.034
 wishes-0.032
 wish-0.030
 dream-0.028
Token not
Token ID407
Feature activation-2.40e-02

Pos logit contributions
 hurts+0.019
ww+0.018
 myself+0.018
bye+0.016
uch+0.016
Neg logit contributions
ait-0.040
ount-0.040
 911-0.038
 pigs-0.037
line-0.037
Token know
Token ID760
Feature activation-4.52e-03

Pos logit contributions
ait+0.044
ount+0.042
ada+0.040
enced+0.039
 pigs+0.038
Neg logit contributions
 now-0.028
 hurts-0.026
 wishes-0.026
 myself-0.026
 be-0.025
Token what
Token ID644
Feature activation-3.46e-02

Pos logit contributions
ait+0.009
ount+0.008
ada+0.008
enced+0.008
 pigs+0.007
Neg logit contributions
 now-0.005
 wishes-0.004
 hurts-0.004
 myself-0.004
 be-0.004
Token they
Token ID484
Feature activation-9.67e-02

Pos logit contributions
ait+0.071
ount+0.068
ada+0.065
enced+0.063
 pigs+0.061
Neg logit contributions
 now-0.034
 wishes-0.031
 hurts-0.031
 be-0.030
 myself-0.029
Token would
Token ID561
Feature activation-3.97e-02

Pos logit contributions
ait+0.174
ount+0.167
ada+0.158
enced+0.154
 pigs+0.149
Neg logit contributions
 now-0.114
 hurts-0.107
 myself-0.106
 wishes-0.106
 be-0.102
Token see
Token ID766
Feature activation-4.64e-02

Pos logit contributions
ait+0.066
ount+0.063
ada+0.059
enced+0.058
line+0.057
Neg logit contributions
 hurts-0.047
 now-0.046
 myself-0.045
 wishes-0.043
ggy-0.043
Token.
Token ID13
Feature activation-6.32e-03

Pos logit contributions
ait+0.097
ount+0.093
ada+0.091
enced+0.087
arre+0.086
Neg logit contributions
 now-0.050
 wishes-0.040
 myself-0.039
 hurts-0.039
 be-0.039
Token<|endoftext|>
Token ID50256
Feature activation-2.64e-03

Pos logit contributions
ait+0.012
ount+0.012
ada+0.011
enced+0.011
 pigs+0.010
Neg logit contributions
 now-0.007
 wishes-0.006
 be-0.006
 hurts-0.006
 myself-0.006
TokenTom
Token ID13787
Feature activation+1.79e-02

Pos logit contributions
ait+0.004
ount+0.004
ada+0.004
enced+0.003
 pigs+0.003
Neg logit contributions
 now-0.004
 hurts-0.004
 wishes-0.004
 myself-0.004
 be-0.004
Token inside
Token ID2641
Feature activation-2.52e-03

Pos logit contributions
ait+0.076
ount+0.072
enced+0.069
 pigs+0.068
ada+0.067
Neg logit contributions
 now-0.034
 be-0.028
 is-0.025
 was-0.024
 wishes-0.024
Token was
Token ID373
Feature activation-1.71e-02

Pos logit contributions
ait+0.006
ount+0.006
umbers+0.006
iers+0.005
enced+0.005
Neg logit contributions
 now-0.003
 be-0.002
 my-0.002
 is-0.002
 was-0.002
Token nothing
Token ID2147
Feature activation-4.20e-02

Pos logit contributions
ait+0.040
ount+0.037
ada+0.036
umbers+0.035
 pigs+0.035
Neg logit contributions
 now-0.017
 be-0.013
 wishes-0.012
 hurts-0.011
 my-0.011
Token like
Token ID588
Feature activation-2.15e-02

Pos logit contributions
ait+0.092
ount+0.087
ada+0.084
enced+0.083
umbers+0.082
Neg logit contributions
 now-0.044
 be-0.036
 hurts-0.034
 wishes-0.032
 my-0.031
Token what
Token ID644
Feature activation-3.01e-02

Pos logit contributions
ait+0.052
ount+0.051
ada+0.049
enced+0.048
umbers+0.047
Neg logit contributions
 now-0.019
 my-0.013
 wishes-0.013
 be-0.012
 myself-0.011
Token they
Token ID484
Feature activation-9.66e-02

Pos logit contributions
ait+0.073
ount+0.070
ada+0.067
enced+0.067
umbers+0.064
Neg logit contributions
 now-0.029
 be-0.029
 was-0.025
 is-0.024
 not-0.022
Token had
Token ID550
Feature activation-5.24e-02

Pos logit contributions
ait+0.211
ount+0.200
ada+0.191
iers+0.188
 pigs+0.188
Neg logit contributions
 now-0.099
 be-0.092
 wishes-0.083
 wish-0.081
 hurts-0.079
Token expected
Token ID2938
Feature activation-3.96e-02

Pos logit contributions
ait+0.108
ount+0.102
ada+0.099
 pigs+0.098
enced+0.096
Neg logit contributions
 now-0.056
 be-0.048
 wishes-0.047
 hurts-0.046
 myself-0.046
Token,
Token ID11
Feature activation-1.43e-02

Pos logit contributions
ait+0.094
ount+0.090
enced+0.086
 pigs+0.086
ada+0.085
Neg logit contributions
 now-0.036
 be-0.031
 myself-0.026
 is-0.024
 my-0.024
Token instead
Token ID2427
Feature activation+1.63e-02

Pos logit contributions
ait+0.034
ount+0.033
enced+0.032
umbers+0.031
ada+0.031
Neg logit contributions
 now-0.014
 be-0.012
 my-0.010
 wishes-0.010
 then-0.010
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
 now+0.016
 be+0.014
 hurts+0.012
 wishes+0.012
ggy+0.012
Neg logit contributions
ait-0.036
ount-0.034
ada-0.033
umbers-0.032
 pigs-0.032
Token what
Token ID644
Feature activation-1.80e-02

Pos logit contributions
ait+0.017
ount+0.016
enced+0.016
ada+0.016
line+0.015
Neg logit contributions
 now-0.008
 my-0.007
 myself-0.006
 wishes-0.006
 be-0.006
Token other
Token ID584
Feature activation-7.66e-03

Pos logit contributions
ait+0.044
ount+0.043
ada+0.042
enced+0.040
line+0.039
Neg logit contributions
 now-0.014
 be-0.013
 hurts-0.012
 wishes-0.011
ggy-0.011
Token y
Token ID331
Feature activation-4.13e-02

Pos logit contributions
ait+0.016
ount+0.015
ada+0.015
enced+0.014
line+0.014
Neg logit contributions
 now-0.009
 be-0.008
 was-0.008
 is-0.008
 wishes-0.008
Tokenummy
Token ID13513
Feature activation+5.01e-05

Pos logit contributions
ait+0.083
ount+0.080
ada+0.075
enced+0.075
iers+0.072
Neg logit contributions
 now-0.042
 hurts-0.038
 myself-0.038
 wishes-0.037
 be-0.037
Token things
Token ID1243
Feature activation-5.33e-02

Pos logit contributions
 now+0.000
 be+0.000
 wishes+0.000
 is+0.000
 hurts+0.000
Neg logit contributions
ait-0.000
ount-0.000
ada-0.000
enced-0.000
umbers-0.000
Token they
Token ID484
Feature activation-9.66e-02

Pos logit contributions
ait+0.120
ada+0.108
ount+0.105
enced+0.104
 paramed+0.103
Neg logit contributions
 now-0.060
 be-0.054
 is-0.048
 hurts-0.048
 was-0.047
Token could
Token ID714
Feature activation-2.71e-02

Pos logit contributions
ait+0.223
ount+0.207
 911+0.205
ada+0.203
auc+0.196
Neg logit contributions
 now-0.095
 be-0.088
 wishes-0.072
 wish-0.070
 dream-0.068
Token find
Token ID1064
Feature activation-4.86e-02

Pos logit contributions
ait+0.054
ount+0.052
ada+0.050
enced+0.048
 pigs+0.047
Neg logit contributions
 now-0.027
 hurts-0.025
 wishes-0.025
 myself-0.025
 be-0.025
Token.
Token ID13
Feature activation-5.59e-03

Pos logit contributions
ait+0.116
ada+0.108
ount+0.107
 paramed+0.104
enced+0.103
Neg logit contributions
 now-0.046
 myself-0.034
 be-0.034
 wishes-0.033
 is-0.032
Token But
Token ID887
Feature activation+2.80e-02

Pos logit contributions
ait+0.010
ount+0.010
ada+0.009
enced+0.009
line+0.009
Neg logit contributions
 now-0.007
 wishes-0.006
 hurts-0.006
 be-0.006
 myself-0.006
Token they
Token ID484
Feature activation-3.59e-02

Pos logit contributions
 myself+0.031
 hurts+0.031
 wishes+0.031
ggy+0.029
 now+0.028
Neg logit contributions
ait-0.047
ount-0.045
 pigs-0.044
ada-0.043
 911-0.042
Token we
Token ID356
Feature activation-5.70e-02

Pos logit contributions
 now+0.014
 hurts+0.013
 wishes+0.013
 myself+0.013
 be+0.013
Neg logit contributions
ait-0.024
ount-0.023
ada-0.022
enced-0.021
 pigs-0.021
Token never
Token ID1239
Feature activation+1.35e-02

Pos logit contributions
ait+0.134
ount+0.128
enced+0.122
ada+0.118
 stret+0.117
Neg logit contributions
 now-0.056
 be-0.054
 wishes-0.049
 wish-0.047
 hurts-0.044
Token know
Token ID760
Feature activation+1.25e-02

Pos logit contributions
 now+0.015
 myself+0.015
 wishes+0.014
ggy+0.014
 hurts+0.014
Neg logit contributions
ait-0.023
ount-0.023
ada-0.021
enced-0.021
 pigs-0.020
Token what
Token ID644
Feature activation-1.85e-02

Pos logit contributions
 now+0.013
 wishes+0.011
 my+0.011
 be+0.011
 myself+0.011
Neg logit contributions
ait-0.027
ount-0.026
enced-0.025
ada-0.025
umbers-0.023
Token opportunities
Token ID6443
Feature activation-3.90e-02

Pos logit contributions
ait+0.042
ount+0.041
ada+0.039
enced+0.038
 pigs+0.037
Neg logit contributions
 now-0.015
 hurts-0.014
 wishes-0.013
 be-0.013
 myself-0.013
Token we
Token ID356
Feature activation-9.59e-02

Pos logit contributions
ait+0.085
ount+0.080
ada+0.078
enced+0.077
 pigs+0.075
Neg logit contributions
 now-0.040
 be-0.036
 hurts-0.034
 wishes-0.033
 myself-0.032
Token might
Token ID1244
Feature activation-1.59e-02

Pos logit contributions
ait+0.197
ount+0.178
enced+0.168
ada+0.168
 stret+0.165
Neg logit contributions
 now-0.116
 be-0.115
 wishes-0.106
 hurts-0.102
 wish-0.101
Token miss
Token ID2051
Feature activation-1.99e-02

Pos logit contributions
ait+0.034
ount+0.033
ada+0.031
enced+0.031
 pigs+0.030
Neg logit contributions
 now-0.014
 be-0.013
 wishes-0.013
 hurts-0.013
 myself-0.013
Token out
Token ID503
Feature activation-3.55e-02

Pos logit contributions
ait+0.043
ount+0.041
enced+0.039
ada+0.039
 paramed+0.037
Neg logit contributions
 now-0.021
 be-0.018
 myself-0.018
 wishes-0.018
 hurts-0.017
Token on
Token ID319
Feature activation-1.46e-02

Pos logit contributions
ait+0.073
ount+0.071
ada+0.068
enced+0.066
umbers+0.064
Neg logit contributions
 now-0.039
 wishes-0.033
 myself-0.032
 be-0.032
 hurts-0.031
Token.
Token ID13
Feature activation+7.95e-03

Pos logit contributions
ait+0.030
ount+0.029
ada+0.027
enced+0.027
iers+0.025
Neg logit contributions
 now-0.016
 wishes-0.015
 myself-0.014
 hurts-0.014
 be-0.014
Token us
Token ID514
Feature activation-4.02e-02

Pos logit contributions
ait+0.054
ount+0.052
enced+0.051
ada+0.047
ules+0.045
Neg logit contributions
 now-0.040
 be-0.039
 wishes-0.037
 my-0.036
 myself-0.036
Token happy
Token ID3772
Feature activation+7.94e-04

Pos logit contributions
ait+0.088
ount+0.082
ada+0.078
enced+0.078
 pigs+0.074
Neg logit contributions
 now-0.040
 be-0.037
 wishes-0.036
 myself-0.035
 hurts-0.034
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
 now+0.001
 wishes+0.001
 be+0.001
 hurts+0.001
 myself+0.001
Neg logit contributions
ait-0.002
ount-0.002
ada-0.001
enced-0.001
umbers-0.001
Token not
Token ID407
Feature activation+4.67e-03

Pos logit contributions
ait+0.031
ount+0.030
enced+0.029
ada+0.029
umbers+0.028
Neg logit contributions
 now-0.014
 be-0.012
 wishes-0.011
 myself-0.011
 my-0.011
Token what
Token ID644
Feature activation-2.13e-02

Pos logit contributions
 now+0.005
 be+0.005
 wishes+0.005
 hurts+0.005
 myself+0.005
Neg logit contributions
ait-0.009
ount-0.009
enced-0.009
ada-0.009
umbers-0.008
Token we
Token ID356
Feature activation-9.58e-02

Pos logit contributions
ait+0.048
ount+0.046
enced+0.044
ada+0.043
rol+0.040
Neg logit contributions
 now-0.022
 hurts-0.020
 be-0.020
 wishes-0.019
 is-0.019
Token can
Token ID460
Feature activation-2.28e-02

Pos logit contributions
ait+0.213
ount+0.197
 911+0.193
 stret+0.192
enced+0.191
Neg logit contributions
 be-0.109
 now-0.105
 wish-0.097
 wishes-0.091
 hurts-0.079
Token sell
Token ID3677
Feature activation-4.58e-02

Pos logit contributions
ait+0.046
ount+0.044
ada+0.042
enced+0.042
 pigs+0.040
Neg logit contributions
 now-0.024
 be-0.021
 myself-0.021
 wishes-0.021
 hurts-0.021
Token or
Token ID393
Feature activation-4.44e-03

Pos logit contributions
ait+0.100
ount+0.093
enced+0.092
ada+0.088
ules+0.086
Neg logit contributions
 now-0.052
 be-0.044
 myself-0.043
 my-0.042
 wishes-0.041
Token trade
Token ID3292
Feature activation-1.88e-02

Pos logit contributions
ait+0.008
ount+0.008
ada+0.007
enced+0.007
 pigs+0.007
Neg logit contributions
 now-0.005
 hurts-0.005
 wishes-0.005
 myself-0.005
 be-0.005
Token them
Token ID606
Feature activation-3.09e-02

Pos logit contributions
ait+0.043
ount+0.039
enced+0.038
ada+0.038
ules+0.036
Neg logit contributions
 now-0.019
 wishes-0.016
 myself-0.016
 be-0.016
 my-0.015
Token if
Token ID611
Feature activation+8.25e-03

Pos logit contributions
 now+0.005
 hurts+0.005
 wishes+0.005
 myself+0.005
 be+0.005
Neg logit contributions
ait-0.005
ount-0.005
ada-0.004
enced-0.004
 pigs-0.004
Token it
Token ID340
Feature activation-1.21e-02

Pos logit contributions
 now+0.008
 wishes+0.007
 hurts+0.007
 be+0.007
 myself+0.007
Neg logit contributions
ait-0.017
ount-0.017
ada-0.016
enced-0.016
line-0.015
Token is
Token ID318
Feature activation+6.49e-04

Pos logit contributions
ait+0.028
ount+0.027
ada+0.026
enced+0.025
line+0.025
Neg logit contributions
 now-0.009
 hurts-0.009
 wishes-0.008
 be-0.008
 myself-0.007
Token not
Token ID407
Feature activation-1.04e-02

Pos logit contributions
 now+0.001
 be+0.001
 hurts+0.000
 myself+0.000
 wishes+0.000
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
enced-0.001
line-0.001
Token what
Token ID644
Feature activation-1.92e-02

Pos logit contributions
ait+0.024
ount+0.024
enced+0.023
ada+0.022
auc+0.022
Neg logit contributions
 now-0.010
 be-0.009
 my-0.008
 hurts-0.007
 myself-0.007
Token we
Token ID356
Feature activation-9.57e-02

Pos logit contributions
ait+0.045
ount+0.045
enced+0.042
ada+0.042
line+0.039
Neg logit contributions
 now-0.016
 hurts-0.015
 be-0.015
 wishes-0.014
 my-0.013
Token want
Token ID765
Feature activation-4.17e-02

Pos logit contributions
ait+0.204
ount+0.199
auc+0.190
enced+0.189
 911+0.186
Neg logit contributions
 be-0.108
 now-0.102
 wish-0.096
 wishes-0.090
 spell-0.078
Token to
Token ID284
Feature activation-1.86e-02

Pos logit contributions
ait+0.099
ount+0.095
enced+0.093
ada+0.089
 paramed+0.086
Neg logit contributions
 now-0.039
 be-0.033
 wishes-0.030
 myself-0.030
 my-0.030
Token do
Token ID466
Feature activation-1.55e-02

Pos logit contributions
ait+0.036
ount+0.035
enced+0.033
ada+0.032
iers+0.031
Neg logit contributions
 now-0.022
 be-0.022
 wishes-0.020
 myself-0.020
 hurts-0.019
Token.
Token ID13
Feature activation-2.04e-03

Pos logit contributions
ait+0.036
ount+0.033
ada+0.032
enced+0.032
 pigs+0.031
Neg logit contributions
 now-0.016
 myself-0.013
 be-0.013
 wishes-0.012
 hurts-0.012
Token
Token ID198
Feature activation-3.55e-03

Pos logit contributions
ait+0.004
ount+0.004
ada+0.004
enced+0.004
 pigs+0.004
Neg logit contributions
 now-0.002
 wishes-0.002
 hurts-0.002
 be-0.002
 myself-0.002
Token They
Token ID1119
Feature activation-3.00e-02

Pos logit contributions
ait+0.024
ount+0.022
ada+0.021
enced+0.020
 pigs+0.019
Neg logit contributions
 now-0.028
 wishes-0.027
 hurts-0.027
 myself-0.026
 be-0.026
Token did
Token ID750
Feature activation+1.32e-02

Pos logit contributions
ait+0.057
ount+0.055
ada+0.052
enced+0.050
 911+0.050
Neg logit contributions
 now-0.038
 be-0.035
 wishes-0.034
 hurts-0.033
 wish-0.032
Token not
Token ID407
Feature activation-2.06e-02

Pos logit contributions
 myself+0.010
 hurts+0.009
 now+0.009
 wishes+0.009
ww+0.009
Neg logit contributions
ount-0.025
ait-0.025
 911-0.024
 pigs-0.023
 onions-0.022
Token know
Token ID760
Feature activation-3.81e-03

Pos logit contributions
ait+0.040
ount+0.037
ada+0.036
enced+0.034
umbers+0.034
Neg logit contributions
 now-0.024
 be-0.023
 hurts-0.023
 wishes-0.022
 wish-0.021
Token what
Token ID644
Feature activation-3.11e-02

Pos logit contributions
ait+0.008
ount+0.007
ada+0.007
enced+0.007
arre+0.007
Neg logit contributions
 now-0.004
 wishes-0.004
 hurts-0.003
 be-0.003
 myself-0.003
Token they
Token ID484
Feature activation-9.55e-02

Pos logit contributions
ait+0.066
ount+0.063
ada+0.061
enced+0.059
line+0.057
Neg logit contributions
 now-0.030
 hurts-0.027
 wishes-0.027
 be-0.027
 myself-0.025
Token were
Token ID547
Feature activation-6.27e-02

Pos logit contributions
ait+0.182
ount+0.174
ada+0.166
enced+0.160
 pigs+0.157
Neg logit contributions
 now-0.107
 hurts-0.099
 wishes-0.098
 be-0.097
 myself-0.094
Token playing
Token ID2712
Feature activation-2.89e-02

Pos logit contributions
ait+0.135
ount+0.128
ada+0.122
enced+0.120
 pigs+0.115
Neg logit contributions
 now-0.066
 wishes-0.056
 be-0.056
 hurts-0.054
 myself-0.053
Token with
Token ID351
Feature activation-3.67e-02

Pos logit contributions
ait+0.076
ount+0.070
ada+0.070
ules+0.069
enced+0.065
Neg logit contributions
 now-0.023
 be-0.017
 wishes-0.016
 myself-0.016
 next-0.014
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
ait+0.079
ount+0.077
ada+0.074
enced+0.073
arre+0.071
Neg logit contributions
 now-0.039
 wishes-0.035
 hurts-0.031
 be-0.031
 my-0.029
Token They
Token ID1119
Feature activation-3.31e-02

Pos logit contributions
ait+0.017
ount+0.016
ada+0.015
enced+0.015
 pigs+0.014
Neg logit contributions
 now-0.014
 wishes-0.014
 hurts-0.013
 myself-0.013
 be-0.013
TokenFire
Token ID13543
Feature activation-9.02e-03

Pos logit contributions
ait+0.047
ount+0.044
ada+0.042
enced+0.040
 pigs+0.039
Neg logit contributions
 now-0.043
 hurts-0.041
 wishes-0.041
 myself-0.041
 be-0.040
Token is
Token ID318
Feature activation+9.82e-03

Pos logit contributions
ait+0.015
ount+0.014
enced+0.014
ada+0.013
 bases+0.012
Neg logit contributions
 now-0.013
 be-0.012
 myself-0.011
 is-0.011
--0.011
Token not
Token ID407
Feature activation-7.44e-03

Pos logit contributions
 now+0.010
 be+0.008
 wishes+0.008
 my+0.007
 hurts+0.007
Neg logit contributions
ait-0.022
ount-0.021
ada-0.021
enced-0.020
umbers-0.019
Token a
Token ID257
Feature activation-2.35e-02

Pos logit contributions
ait+0.017
ount+0.017
ada+0.016
enced+0.016
umbers+0.015
Neg logit contributions
 now-0.007
 be-0.006
 my-0.005
 wishes-0.005
 myself-0.005
Token toy
Token ID13373
Feature activation-1.72e-02

Pos logit contributions
ait+0.048
ount+0.046
ada+0.044
enced+0.043
umbers+0.041
Neg logit contributions
 now-0.025
 wishes-0.023
 be-0.023
 hurts-0.022
 wish-0.021
Token"
Token ID1
Feature activation-9.55e-02

Pos logit contributions
ait+0.039
ount+0.038
enced+0.036
ada+0.035
arre+0.034
Neg logit contributions
 now-0.017
 be-0.014
 wishes-0.013
 hurts-0.012
 myself-0.012
Token on
Token ID319
Feature activation-2.10e-02

Pos logit contributions
ait+0.191
ount+0.183
ada+0.176
enced+0.172
 pigs+0.168
Neg logit contributions
 now-0.100
 be-0.088
 hurts-0.087
 myself-0.084
 wishes-0.079
Token the
Token ID262
Feature activation-2.19e-02

Pos logit contributions
ount+0.049
ait+0.048
ada+0.046
enced+0.045
ules+0.043
Neg logit contributions
 now-0.019
 be-0.017
 wishes-0.015
 my-0.015
 is-0.013
Token bottom
Token ID4220
Feature activation-1.49e-03

Pos logit contributions
ait+0.044
ount+0.043
ada+0.040
enced+0.040
ules+0.039
Neg logit contributions
 now-0.025
 be-0.024
 wishes-0.022
 myself-0.022
 hurts-0.021
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
ait+0.003
ount+0.003
ada+0.003
enced+0.003
arre+0.003
Neg logit contributions
 now-0.001
 be-0.001
--0.001
 next-0.001
 hurts-0.001
Token They
Token ID1119
Feature activation-2.00e-02

Pos logit contributions
ait+0.025
ada+0.024
ount+0.023
umbers+0.022
enced+0.022
Neg logit contributions
 now-0.020
 wishes-0.018
 be-0.018
 hurts-0.017
 myself-0.017
Token tell
Token ID1560
Feature activation-1.86e-02

Pos logit contributions
 hurts+0.018
 now+0.018
 wishes+0.017
 myself+0.017
ggy+0.016
Neg logit contributions
ait-0.028
ount-0.026
ada-0.025
 pigs-0.024
enced-0.024
Token all
Token ID477
Feature activation-1.14e-02

Pos logit contributions
ait+0.033
ount+0.031
umbers+0.030
enced+0.030
ada+0.029
Neg logit contributions
 now-0.027
 wishes-0.025
 be-0.025
 wish-0.023
 my-0.022
Token their
Token ID511
Feature activation-1.55e-02

Pos logit contributions
ait+0.021
umbers+0.021
ount+0.021
iers+0.021
ada+0.020
Neg logit contributions
 now-0.014
 my-0.013
 be-0.013
 wishes-0.012
 myself-0.011
Token friends
Token ID2460
Feature activation-2.91e-02

Pos logit contributions
ait+0.031
ount+0.030
ada+0.028
enced+0.028
 pigs+0.027
Neg logit contributions
 now-0.016
 wishes-0.015
 hurts-0.014
 be-0.014
 myself-0.014
Token what
Token ID644
Feature activation-3.02e-02

Pos logit contributions
ait+0.068
umbers+0.066
iers+0.065
ount+0.064
ada+0.063
Neg logit contributions
 now-0.028
 be-0.022
 is-0.018
 was-0.018
 wishes-0.017
Token they
Token ID484
Feature activation-9.53e-02

Pos logit contributions
ait+0.062
ount+0.061
ada+0.058
umbers+0.055
enced+0.055
Neg logit contributions
 now-0.037
 be-0.034
ggy-0.034
 wishes-0.032
 was-0.032
Token had
Token ID550
Feature activation-4.68e-02

Pos logit contributions
ait+0.219
ount+0.208
 911+0.204
ada+0.201
iers+0.197
Neg logit contributions
 now-0.085
 be-0.074
 wishes-0.065
 wish-0.065
 dream-0.062
Token seen
Token ID1775
Feature activation-3.00e-02

Pos logit contributions
ait+0.087
ount+0.083
ada+0.080
enced+0.077
 pigs+0.076
Neg logit contributions
 now-0.055
 wishes-0.050
 be-0.049
 hurts-0.049
 myself-0.049
Token and
Token ID290
Feature activation+1.57e-03

Pos logit contributions
ait+0.070
ount+0.067
ada+0.066
enced+0.064
 paramed+0.063
Neg logit contributions
 now-0.029
 be-0.021
 myself-0.020
 next-0.018
 my-0.018
Token done
Token ID1760
Feature activation+1.85e-02

Pos logit contributions
 now+0.002
 be+0.002
 wishes+0.002
 hurts+0.002
 myself+0.002
Neg logit contributions
ait-0.003
ount-0.003
ada-0.003
enced-0.003
umbers-0.003
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
 now+0.021
 wishes+0.019
 be+0.018
 myself+0.017
ggy+0.017
Neg logit contributions
ait-0.038
ount-0.035
ada-0.034
enced-0.033
iers-0.032
Token learned
Token ID4499
Feature activation+2.43e-02

Pos logit contributions
 now+0.005
 be+0.005
 wishes+0.004
 hurts+0.004
 myself+0.004
Neg logit contributions
ait-0.013
ount-0.013
ada-0.012
enced-0.012
 911-0.012
Token what
Token ID644
Feature activation-7.91e-03

Pos logit contributions
 now+0.022
 hurts+0.020
 wishes+0.020
 myself+0.020
 be+0.019
Neg logit contributions
ait-0.051
ount-0.049
ada-0.047
enced-0.046
 pigs-0.045
Token "
Token ID366
Feature activation-4.60e-02

Pos logit contributions
ait+0.013
ount+0.012
ada+0.012
enced+0.011
 pigs+0.011
Neg logit contributions
 now-0.011
 hurts-0.010
 wishes-0.010
 be-0.010
 myself-0.010
Tokenra
Token ID430
Feature activation+3.69e-02

Pos logit contributions
ait+0.071
ount+0.068
ada+0.064
enced+0.061
 pigs+0.060
Neg logit contributions
 now-0.066
 wishes-0.062
 hurts-0.062
 myself-0.061
 be-0.060
Tokenre
Token ID260
Feature activation-1.37e-02

Pos logit contributions
 now+0.046
 be+0.042
 hurts+0.041
 myself+0.041
 wishes+0.040
Neg logit contributions
ait-0.067
ount-0.066
enced-0.061
 pigs-0.059
ada-0.057
Token"
Token ID1
Feature activation-9.51e-02

Pos logit contributions
ait+0.028
ount+0.026
enced+0.026
 pigs+0.025
ada+0.025
Neg logit contributions
 now-0.015
 myself-0.013
 be-0.013
 wishes-0.012
 my-0.012
Token meant
Token ID4001
Feature activation-2.96e-02

Pos logit contributions
ait+0.190
ount+0.180
ada+0.172
enced+0.169
 911+0.164
Neg logit contributions
 now-0.102
 be-0.089
 hurts-0.088
 myself-0.084
 wishes-0.081
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
ait+0.072
ount+0.070
enced+0.068
arre+0.064
ada+0.064
Neg logit contributions
 now-0.023
 be-0.019
ggy-0.018
 myself-0.017
--0.015
Token It
Token ID632
Feature activation-8.06e-04

Pos logit contributions
ait+0.020
ount+0.018
ada+0.018
enced+0.018
 pigs+0.017
Neg logit contributions
 now-0.013
 wishes-0.012
 be-0.012
 myself-0.011
 my-0.011
Token meant
Token ID4001
Feature activation+1.50e-03

Pos logit contributions
ait+0.002
ount+0.002
ada+0.002
enced+0.002
 pigs+0.002
Neg logit contributions
 now-0.000
 hurts-0.000
 wishes-0.000
 myself-0.000
 be-0.000
Token something
Token ID1223
Feature activation-3.48e-02

Pos logit contributions
 now+0.002
 be+0.002
 wishes+0.001
 hurts+0.001
 myself+0.001
Neg logit contributions
ait-0.003
ount-0.003
ada-0.003
enced-0.003
umbers-0.003
Token wondered
Token ID14028
Feature activation+3.14e-03

Pos logit contributions
ount+0.074
ait+0.073
 911+0.069
iers+0.069
ada+0.068
Neg logit contributions
 now-0.036
 be-0.032
 wishes-0.029
 wish-0.029
 dream-0.028
Token what
Token ID644
Feature activation-1.91e-02

Pos logit contributions
 now+0.004
 be+0.003
 then+0.003
 wishes+0.003
 my+0.003
Neg logit contributions
ait-0.006
ount-0.006
enced-0.006
ada-0.006
arre-0.006
Token other
Token ID584
Feature activation-1.09e-02

Pos logit contributions
ait+0.036
ount+0.035
ada+0.033
enced+0.032
 pigs+0.031
Neg logit contributions
 now-0.022
 wishes-0.020
 hurts-0.020
 be-0.020
 myself-0.019
Token brave
Token ID14802
Feature activation-2.97e-02

Pos logit contributions
ait+0.021
ount+0.020
ada+0.019
enced+0.018
arre+0.018
Neg logit contributions
 now-0.013
 wishes-0.013
 be-0.012
 hurts-0.011
 wish-0.011
Token things
Token ID1243
Feature activation-5.62e-02

Pos logit contributions
ait+0.062
ount+0.061
ada+0.058
enced+0.057
umbers+0.055
Neg logit contributions
 now-0.029
 wishes-0.026
 hurts-0.025
 be-0.024
 myself-0.022
Token they
Token ID484
Feature activation-9.47e-02

Pos logit contributions
ait+0.126
ada+0.118
ount+0.115
enced+0.112
umbers+0.111
Neg logit contributions
 now-0.058
 be-0.049
 is-0.045
 hurts-0.043
 was-0.042
Token could
Token ID714
Feature activation-2.47e-02

Pos logit contributions
ait+0.196
ount+0.186
ada+0.181
 911+0.175
enced+0.172
Neg logit contributions
 now-0.105
 be-0.096
 wishes-0.087
 wish-0.083
 hurts-0.082
Token do
Token ID466
Feature activation-2.52e-02

Pos logit contributions
ait+0.045
ount+0.043
ada+0.042
line+0.040
enced+0.040
Neg logit contributions
 now-0.025
 hurts-0.025
 wishes-0.024
 myself-0.024
ggy-0.022
Token.
Token ID13
Feature activation-1.58e-03

Pos logit contributions
ait+0.052
ount+0.050
ada+0.048
enced+0.046
 pigs+0.045
Neg logit contributions
 now-0.024
 myself-0.022
 wishes-0.022
 hurts-0.022
 be-0.021
Token They
Token ID1119
Feature activation-3.20e-02

Pos logit contributions
ait+0.002
ount+0.002
ada+0.002
enced+0.002
 pigs+0.002
Neg logit contributions
 now-0.002
 hurts-0.002
 wishes-0.002
 myself-0.002
 be-0.002
Token felt
Token ID2936
Feature activation+5.73e-03

Pos logit contributions
ait+0.070
ount+0.070
ada+0.067
 911+0.065
iers+0.064
Neg logit contributions
 now-0.031
 be-0.027
 wishes-0.026
 wish-0.025
 dream-0.023
Token asked
Token ID1965
Feature activation-1.39e-02

Pos logit contributions
ait+0.005
ount+0.005
enced+0.004
ada+0.004
umbers+0.004
Neg logit contributions
 now-0.002
 be-0.002
 then-0.002
 my-0.002
 was-0.002
Token if
Token ID611
Feature activation-9.94e-03

Pos logit contributions
ount+0.033
ait+0.032
arre+0.031
enced+0.031
ada+0.030
Neg logit contributions
 now-0.015
,-0.011
 be-0.010
 then-0.010
--0.010
Token there
Token ID612
Feature activation-1.72e-02

Pos logit contributions
ount+0.021
ait+0.021
enced+0.020
arre+0.020
ada+0.020
Neg logit contributions
 now-0.011
 be-0.009
 my-0.009
 wishes-0.008
 next-0.008
Token was
Token ID373
Feature activation-2.00e-02

Pos logit contributions
ait+0.040
ount+0.038
ada+0.037
enced+0.036
arre+0.036
Neg logit contributions
 now-0.015
 be-0.015
 wishes-0.014
 is-0.014
 was-0.014
Token anything
Token ID1997
Feature activation-4.94e-02

Pos logit contributions
ait+0.044
ount+0.042
ada+0.040
enced+0.039
line+0.038
Neg logit contributions
 now-0.017
 wishes-0.015
 hurts-0.015
 be-0.014
 myself-0.014
Token they
Token ID484
Feature activation-9.44e-02

Pos logit contributions
ait+0.114
ada+0.107
ount+0.106
enced+0.102
umbers+0.099
Neg logit contributions
 now-0.049
 be-0.038
 wishes-0.037
 hurts-0.036
 my-0.033
Token could
Token ID714
Feature activation-2.92e-02

Pos logit contributions
ait+0.208
ount+0.195
ada+0.189
umbers+0.182
iers+0.182
Neg logit contributions
 now-0.097
 be-0.087
 wishes-0.081
 hurts-0.077
 wish-0.076
Token do
Token ID466
Feature activation-2.63e-02

Pos logit contributions
ait+0.057
ount+0.054
ada+0.052
enced+0.050
 pigs+0.049
Neg logit contributions
 now-0.032
 wishes-0.029
 hurts-0.029
 be-0.029
 myself-0.028
Token to
Token ID284
Feature activation-2.31e-02

Pos logit contributions
ait+0.051
ount+0.048
ada+0.047
enced+0.045
 pigs+0.044
Neg logit contributions
 now-0.030
 wishes-0.026
 myself-0.026
 be-0.026
 hurts-0.026
Token help
Token ID1037
Feature activation-3.90e-02

Pos logit contributions
ait+0.044
ount+0.043
ada+0.041
enced+0.039
 pigs+0.038
Neg logit contributions
 now-0.025
 wishes-0.023
 hurts-0.023
 myself-0.023
 be-0.022
Token the
Token ID262
Feature activation-8.40e-03

Pos logit contributions
ait+0.089
ount+0.087
enced+0.083
arre+0.080
rol+0.080
Neg logit contributions
 be-0.042
 now-0.038
 spell-0.030
 wishes-0.030
--0.029
Token and
Token ID290
Feature activation+1.41e-02

Pos logit contributions
ait+0.028
ount+0.026
ada+0.025
enced+0.025
arre+0.024
Neg logit contributions
 now-0.015
 wishes-0.013
 myself-0.012
 be-0.012
 hurts-0.012
Token asked
Token ID1965
Feature activation-1.70e-03

Pos logit contributions
 now+0.015
 be+0.013
 wishes+0.013
 hurts+0.012
 myself+0.012
Neg logit contributions
ait-0.030
ount-0.029
ada-0.027
enced-0.027
 pigs-0.026
Token her
Token ID607
Feature activation+3.71e-03

Pos logit contributions
ait+0.004
ount+0.004
enced+0.003
ada+0.003
arre+0.003
Neg logit contributions
 now-0.002
 be-0.002
 wishes-0.001
 myself-0.001
 my-0.001
Token teacher
Token ID4701
Feature activation-1.70e-02

Pos logit contributions
 now+0.003
 wishes+0.003
 be+0.003
 spell+0.003
 hurts+0.003
Neg logit contributions
ait-0.008
ount-0.008
enced-0.008
ada-0.008
rol-0.007
Token what
Token ID644
Feature activation-3.34e-02

Pos logit contributions
ait+0.037
ount+0.034
ada+0.033
enced+0.033
 stret+0.033
Neg logit contributions
 now-0.018
 be-0.014
 is-0.013
 my-0.013
 next-0.013
Token they
Token ID484
Feature activation-9.44e-02

Pos logit contributions
ait+0.079
ount+0.077
ada+0.073
enced+0.071
 pigs+0.068
Neg logit contributions
 now-0.027
 be-0.023
 wishes-0.023
 hurts-0.023
ggy-0.022
Token meant
Token ID4001
Feature activation-4.37e-02

Pos logit contributions
ait+0.183
ount+0.169
ada+0.157
 pigs+0.156
 stret+0.156
Neg logit contributions
 now-0.119
 be-0.111
 wishes-0.109
 hurts-0.106
 wish-0.103
Token.
Token ID13
Feature activation-1.85e-02

Pos logit contributions
ait+0.106
ount+0.102
enced+0.098
ada+0.096
 stret+0.095
Neg logit contributions
 now-0.041
 be-0.031
 myself-0.028
 next-0.027
 wishes-0.025
Token Her
Token ID2332
Feature activation+1.91e-02

Pos logit contributions
ait+0.035
ount+0.034
ada+0.032
 pigs+0.031
enced+0.031
Neg logit contributions
 now-0.021
 be-0.020
 hurts-0.020
 wishes-0.019
 myself-0.019
Token teacher
Token ID4701
Feature activation+1.64e-03

Pos logit contributions
 now+0.024
 myself+0.023
 hurts+0.023
 wishes+0.022
 be+0.022
Neg logit contributions
ait-0.033
ount-0.031
ada-0.030
enced-0.029
 pigs-0.028
Token said
Token ID531
Feature activation-4.95e-03

Pos logit contributions
 now+0.002
 be+0.002
 wishes+0.001
 was+0.001
ggy+0.001
Neg logit contributions
ait-0.003
ount-0.003
enced-0.003
ada-0.003
line-0.003
Token understand
Token ID1833
Feature activation-1.50e-02

Pos logit contributions
ait+0.032
ount+0.030
ada+0.028
enced+0.028
 pigs+0.027
Neg logit contributions
 now-0.021
 be-0.020
 hurts-0.020
 wishes-0.020
 myself-0.019
Token what
Token ID644
Feature activation-2.06e-02

Pos logit contributions
ait+0.030
ount+0.029
enced+0.027
ada+0.027
umbers+0.026
Neg logit contributions
 now-0.019
 be-0.017
 wishes-0.016
 hurts-0.015
 my-0.015
Token "
Token ID366
Feature activation-5.16e-02

Pos logit contributions
ait+0.046
ount+0.044
ada+0.042
enced+0.041
line+0.039
Neg logit contributions
 now-0.020
 be-0.017
 hurts-0.017
 wishes-0.017
ggy-0.016
Tokenfor
Token ID1640
Feature activation+2.76e-03

Pos logit contributions
ait+0.078
ount+0.076
ada+0.072
enced+0.068
hot+0.067
Neg logit contributions
 now-0.073
 wishes-0.071
 hurts-0.068
 myself-0.068
 be-0.067
Tokengive
Token ID26535
Feature activation-5.53e-03

Pos logit contributions
 now+0.003
 myself+0.003
 be+0.003
 wishes+0.002
 hurts+0.002
Neg logit contributions
ait-0.006
ount-0.005
enced-0.005
ada-0.005
 pigs-0.005
Token"
Token ID1
Feature activation-9.42e-02

Pos logit contributions
ait+0.012
ount+0.011
enced+0.011
auc+0.010
iers+0.010
Neg logit contributions
 now-0.006
 hurts-0.006
 myself-0.005
 be-0.005
 my-0.005
Token meant
Token ID4001
Feature activation-3.75e-02

Pos logit contributions
ait+0.188
ount+0.179
ada+0.169
enced+0.166
 pigs+0.162
Neg logit contributions
 now-0.100
 hurts-0.089
 be-0.088
 myself-0.085
 wishes-0.084
Token,
Token ID11
Feature activation-3.06e-02

Pos logit contributions
ait+0.101
ount+0.099
enced+0.092
rol+0.089
umbers+0.089
Neg logit contributions
 now-0.026
 be-0.021
 hurts-0.017
ggy-0.017
 myself-0.015
Token but
Token ID475
Feature activation+6.51e-03

Pos logit contributions
ait+0.066
ount+0.064
enced+0.060
ada+0.059
umbers+0.058
Neg logit contributions
 now-0.033
 be-0.029
 myself-0.025
 wishes-0.025
 then-0.025
Token the
Token ID262
Feature activation+4.11e-04

Pos logit contributions
 hurts+0.008
 myself+0.008
 wishes+0.008
 now+0.008
ggy+0.007
Neg logit contributions
ait-0.011
ount-0.010
ada-0.010
 pigs-0.010
enced-0.010
Token dolphin
Token ID44136
Feature activation-8.70e-03

Pos logit contributions
 myself+0.001
 now+0.001
 hurts+0.001
ggy+0.001
 wishes+0.001
Neg logit contributions
ait-0.000
ada-0.000
 pigs-0.000
line-0.000
ount-0.000
Token to
Token ID284
Feature activation-1.62e-02

Pos logit contributions
ait+0.024
ount+0.023
ada+0.022
enced+0.022
umbers+0.021
Neg logit contributions
 now-0.009
 be-0.008
 hurts-0.008
 wishes-0.008
 myself-0.008
Token try
Token ID1949
Feature activation+1.43e-02

Pos logit contributions
ait+0.033
ount+0.031
ada+0.029
enced+0.029
iers+0.028
Neg logit contributions
 now-0.017
 be-0.016
 wishes-0.016
 hurts-0.016
 myself-0.015
Token and
Token ID290
Feature activation+1.91e-02

Pos logit contributions
 now+0.013
 hurts+0.012
 be+0.011
 wishes+0.011
 myself+0.011
Neg logit contributions
ait-0.032
ount-0.030
ada-0.029
enced-0.029
iers-0.027
Token get
Token ID651
Feature activation-6.12e-03

Pos logit contributions
 myself+0.019
 now+0.019
 wishes+0.018
 hurts+0.018
ggy+0.018
Neg logit contributions
ait-0.032
ount-0.032
 pigs-0.030
ada-0.030
line-0.030
Token what
Token ID644
Feature activation-1.72e-02

Pos logit contributions
ait+0.012
ount+0.012
ada+0.011
enced+0.011
 pigs+0.010
Neg logit contributions
 now-0.006
 hurts-0.006
 wishes-0.006
 be-0.006
 myself-0.006
Token we
Token ID356
Feature activation-9.41e-02

Pos logit contributions
ait+0.036
ount+0.035
ada+0.033
enced+0.032
 pigs+0.032
Neg logit contributions
 now-0.015
 wishes-0.014
 hurts-0.014
 myself-0.013
 be-0.013
Token want
Token ID765
Feature activation-4.27e-02

Pos logit contributions
ait+0.237
ount+0.214
 stret+0.206
 911+0.205
enced+0.205
Neg logit contributions
 be-0.086
 now-0.078
 wish-0.072
 wishes-0.068
 hurts-0.063
Token.
Token ID13
Feature activation+5.42e-03

Pos logit contributions
ait+0.105
ount+0.099
enced+0.096
ada+0.094
 paramed+0.092
Neg logit contributions
 now-0.040
 be-0.034
 myself-0.030
 my-0.028
 hurts-0.028
Token We
Token ID775
Feature activation-4.77e-02

Pos logit contributions
 now+0.007
 hurts+0.006
 wishes+0.006
 myself+0.006
 be+0.006
Neg logit contributions
ait-0.009
ount-0.009
ada-0.008
enced-0.008
 pigs-0.008
Token should
Token ID815
Feature activation+3.10e-02

Pos logit contributions
ait+0.109
ount+0.104
enced+0.099
iers+0.097
ules+0.097
Neg logit contributions
 now-0.046
 wishes-0.044
 be-0.043
 wish-0.042
 hurts-0.037
Token always
Token ID1464
Feature activation+1.96e-02

Pos logit contributions
 now+0.022
 be+0.021
 wishes+0.019
 myself+0.018
 hurts+0.017
Neg logit contributions
ait-0.075
ount-0.073
enced-0.069
ada-0.068
iers-0.068
Token They
Token ID1119
Feature activation-3.97e-02

Pos logit contributions
ait+0.019
ount+0.017
ada+0.016
enced+0.016
umbers+0.016
Neg logit contributions
 now-0.018
 wishes-0.017
 be-0.017
 hurts-0.016
 myself-0.016
Token felt
Token ID2936
Feature activation+3.48e-03

Pos logit contributions
ait+0.088
ount+0.085
 911+0.080
enced+0.078
ada+0.078
Neg logit contributions
 now-0.039
 be-0.036
 wishes-0.033
 wish-0.031
 hurts-0.029
Token sorry
Token ID7926
Feature activation-7.24e-03

Pos logit contributions
 now+0.004
 hurts+0.004
 myself+0.004
 wishes+0.004
 be+0.004
Neg logit contributions
ait-0.006
ount-0.006
ada-0.006
enced-0.006
 pigs-0.005
Token for
Token ID329
Feature activation-1.92e-02

Pos logit contributions
ait+0.010
ount+0.009
 911+0.009
ada+0.009
 pigs+0.009
Neg logit contributions
 myself-0.011
ggy-0.010
 now-0.010
 hurts-0.010
 wishes-0.010
Token what
Token ID644
Feature activation-2.98e-02

Pos logit contributions
ount+0.037
ait+0.036
enced+0.035
ada+0.033
arre+0.033
Neg logit contributions
 now-0.026
 be-0.024
 myself-0.021
 my-0.021
 wishes-0.021
Token they
Token ID484
Feature activation-9.41e-02

Pos logit contributions
ait+0.055
ount+0.053
enced+0.050
ada+0.050
umbers+0.048
Neg logit contributions
 now-0.039
 be-0.037
 wishes-0.035
ggy-0.035
 hurts-0.034
Token did
Token ID750
Feature activation-2.07e-02

Pos logit contributions
ait+0.195
ount+0.187
ada+0.177
enced+0.173
 pigs+0.170
Neg logit contributions
 now-0.093
 be-0.084
 hurts-0.083
 wishes-0.083
 myself-0.077
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
ait+0.045
ount+0.042
ada+0.040
enced+0.040
 pigs+0.040
Neg logit contributions
 now-0.022
 be-0.019
 wishes-0.018
 myself-0.018
 hurts-0.017
Token They
Token ID1119
Feature activation-4.57e-02

Pos logit contributions
ait+0.021
ount+0.019
ada+0.018
enced+0.018
umbers+0.016
Neg logit contributions
 now-0.026
 wishes-0.025
 be-0.024
 hurts-0.024
 myself-0.024
Token remembered
Token ID12086
Feature activation-2.86e-04

Pos logit contributions
ait+0.100
ount+0.097
iers+0.089
enced+0.088
 911+0.088
Neg logit contributions
 now-0.045
 be-0.041
 wishes-0.039
 wish-0.037
 hurts-0.036
Token how
Token ID703
Feature activation-4.10e-02

Pos logit contributions
ait+0.001
ount+0.001
enced+0.001
iers+0.001
umbers+0.001
Neg logit contributions
 now-0.000
 be-0.000
 wishes-0.000
 my-0.000
 then-0.000
Token Sally
Token ID25737
Feature activation+1.28e-03

Pos logit contributions
 now+0.019
 be+0.017
 myself+0.017
 hurts+0.017
 wishes+0.017
Neg logit contributions
ait-0.034
ount-0.033
ada-0.031
enced-0.031
 pigs-0.030
Token didn
Token ID1422
Feature activation-1.36e-02

Pos logit contributions
 now+0.001
 be+0.001
 hurts+0.001
ggy+0.001
 wishes+0.001
Neg logit contributions
ount-0.003
ait-0.003
ada-0.003
 911-0.003
enced-0.003
Token't
Token ID470
Feature activation-1.79e-02

Pos logit contributions
ait+0.027
ount+0.026
ada+0.025
hot+0.024
enced+0.024
Neg logit contributions
 now-0.012
 wishes-0.012
 hurts-0.012
 myself-0.011
 be-0.011
Token know
Token ID760
Feature activation-1.04e-02

Pos logit contributions
ait+0.041
ount+0.039
umbers+0.038
enced+0.037
 pigs+0.037
Neg logit contributions
 be-0.016
 now-0.016
 wish-0.013
 spell-0.012
 hurts-0.012
Token what
Token ID644
Feature activation-3.07e-02

Pos logit contributions
ait+0.021
ount+0.021
enced+0.020
ada+0.020
arre+0.019
Neg logit contributions
 now-0.012
 wishes-0.009
 be-0.009
 myself-0.009
 hurts-0.009
Token they
Token ID484
Feature activation-9.38e-02

Pos logit contributions
ait+0.073
ount+0.072
ada+0.068
enced+0.066
arre+0.065
Neg logit contributions
 now-0.025
 be-0.022
 wishes-0.020
ggy-0.020
 spell-0.019
Token were
Token ID547
Feature activation-5.74e-02

Pos logit contributions
ait+0.177
ount+0.167
ada+0.156
 pigs+0.155
 911+0.154
Neg logit contributions
 now-0.122
 be-0.115
 wishes-0.108
 wish-0.107
 hurts-0.102
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
ait+0.135
ount+0.128
enced+0.123
folk+0.120
ada+0.120
Neg logit contributions
 now-0.056
 be-0.045
 wishes-0.038
 myself-0.038
 next-0.035
Token
Token ID198
Feature activation-5.64e-03

Pos logit contributions
ait+0.025
ount+0.023
enced+0.022
 pigs+0.022
 stret+0.022
Neg logit contributions
 now-0.010
 be-0.009
 myself-0.008
 wishes-0.008
Just-0.007
Token
Token ID198
Feature activation-1.17e-03

Pos logit contributions
ait+0.011
ount+0.011
ada+0.010
enced+0.010
 pigs+0.010
Neg logit contributions
 now-0.006
 myself-0.005
 be-0.005
 wishes-0.005
 hurts-0.005
TokenSo
Token ID2396
Feature activation+2.97e-03

Pos logit contributions
ait+0.003
ount+0.002
 pigs+0.002
 911+0.002
iers+0.002
Neg logit contributions
 now-0.001
 be-0.001
 myself-0.001
Just-0.001
ggy-0.001
Token take
Token ID1011
Feature activation-2.04e-02

Pos logit contributions
 hurts+0.002
 wishes+0.002
 now+0.002
 myself+0.002
 eyel+0.002
Neg logit contributions
ait-0.003
ada-0.003
ount-0.003
line-0.003
hot-0.002
Token it
Token ID340
Feature activation+1.71e-03

Pos logit contributions
ait+0.041
ount+0.039
ada+0.037
enced+0.037
 pigs+0.035
Neg logit contributions
 now-0.021
 wishes-0.020
 hurts-0.019
 be-0.019
 myself-0.019
Token with
Token ID351
Feature activation-2.69e-02

Pos logit contributions
 now+0.002
 myself+0.002
 be+0.002
 hurts+0.002
 wishes+0.002
Neg logit contributions
ait-0.003
ount-0.003
ada-0.003
enced-0.003
 pigs-0.003
Token them
Token ID606
Feature activation-3.67e-02

Pos logit contributions
ount+0.049
ait+0.047
enced+0.045
ada+0.043
auc+0.043
Neg logit contributions
 now-0.040
 wishes-0.037
 be-0.037
 my-0.033
 myself-0.033
Token wherever
Token ID14530
Feature activation-1.78e-02

Pos logit contributions
ount+0.085
ait+0.085
enced+0.080
ada+0.080
auc+0.077
Neg logit contributions
 now-0.035
 be-0.028
 myself-0.024
 wishes-0.023
 hurts-0.023
Token they
Token ID484
Feature activation-9.37e-02

Pos logit contributions
ount+0.036
ait+0.036
ada+0.034
enced+0.033
arre+0.032
Neg logit contributions
 now-0.020
 be-0.019
 wishes-0.018
 hurts-0.018
ggy-0.017
Token went
Token ID1816
Feature activation-4.38e-02

Pos logit contributions
ait+0.172
ount+0.164
ada+0.159
enced+0.152
hot+0.149
Neg logit contributions
 now-0.103
 myself-0.099
 hurts-0.095
 wishes-0.093
ggy-0.091
Token.
Token ID13
Feature activation-8.92e-03

Pos logit contributions
ait+0.087
ount+0.082
ada+0.078
enced+0.076
 pigs+0.074
Neg logit contributions
 now-0.048
 hurts-0.044
 be-0.044
 wishes-0.044
 myself-0.043
Token And
Token ID843
Feature activation+8.98e-03

Pos logit contributions
ait+0.018
ount+0.017
ada+0.016
enced+0.016
iers+0.016
Neg logit contributions
 now-0.009
 wishes-0.008
 be-0.008
 hurts-0.008
 myself-0.008
Token so
Token ID523
Feature activation+1.10e-02

Pos logit contributions
 hurts+0.013
 myself+0.013
 wishes+0.012
 eyel+0.012
ggy+0.012
Neg logit contributions
ait-0.013
ount-0.012
ada-0.011
line-0.011
 pigs-0.011
Token they
Token ID484
Feature activation-4.36e-02

Pos logit contributions
 myself+0.011
 now+0.010
 hurts+0.010
 wishes+0.010
 eyel+0.009
Neg logit contributions
ait-0.021
ount-0.020
ada-0.019
enced-0.019
 pigs-0.019
Token away
Token ID1497
Feature activation+3.56e-02

Pos logit contributions
ait+0.013
ount+0.012
ada+0.011
enced+0.011
 pigs+0.011
Neg logit contributions
 now-0.008
 hurts-0.007
 wishes-0.007
 be-0.007
 myself-0.007
Token.
Token ID13
Feature activation+3.60e-03

Pos logit contributions
 now+0.037
 be+0.031
 myself+0.031
 hurts+0.030
 wishes+0.030
Neg logit contributions
ait-0.075
ount-0.073
ada-0.070
enced-0.069
umbers-0.068
Token
Token ID198
Feature activation-7.64e-03

Pos logit contributions
 now+0.004
 hurts+0.004
 wishes+0.004
 myself+0.004
 be+0.004
Neg logit contributions
ait-0.007
ount-0.006
ada-0.006
enced-0.006
 pigs-0.006
Token
Token ID198
Feature activation+1.64e-03

Pos logit contributions
ait+0.012
ount+0.012
ada+0.011
enced+0.011
 pigs+0.010
Neg logit contributions
 now-0.010
 myself-0.010
 hurts-0.010
 wishes-0.010
 spell-0.009
TokenS
Token ID50
Feature activation-9.61e-03

Pos logit contributions
 now+0.002
 myself+0.002
 hurts+0.002
 wishes+0.002
 be+0.002
Neg logit contributions
ait-0.003
ount-0.003
ada-0.003
enced-0.002
 pigs-0.002
Tokenara
Token ID3301
Feature activation+4.14e-02

Pos logit contributions
ait+0.019
ount+0.018
ada+0.017
enced+0.017
 pigs+0.016
Neg logit contributions
 now-0.010
 myself-0.009
 wishes-0.009
 hurts-0.009
 be-0.009
Token is
Token ID318
Feature activation+7.03e-04

Pos logit contributions
 myself+0.036
 now+0.035
 wishes+0.032
 hurts+0.031
ggy+0.031
Neg logit contributions
ait-0.083
ount-0.080
ada-0.076
enced-0.076
 pigs-0.074
Token very
Token ID845
Feature activation-1.21e-02

Pos logit contributions
 myself+0.001
 wishes+0.001
ww+0.001
 vantage+0.001
mm+0.001
Neg logit contributions
ait-0.001
ount-0.001
enced-0.001
 pigs-0.001
 911-0.001
Token sad
Token ID6507
Feature activation+2.02e-03

Pos logit contributions
ait+0.015
ount+0.015
enced+0.015
ada+0.014
line+0.014
Neg logit contributions
 myself-0.018
 now-0.017
 wishes-0.017
 spell-0.017
 unicorn-0.016
Token and
Token ID290
Feature activation+2.70e-02

Pos logit contributions
 myself+0.003
 wishes+0.003
 hurts+0.002
 now+0.002
ggy+0.002
Neg logit contributions
ait-0.003
ount-0.003
ada-0.003
enced-0.003
 pigs-0.003
Token angry
Token ID7954
Feature activation-1.52e-02

Pos logit contributions
 myself+0.037
ggy+0.037
 now+0.036
 spell+0.035
 wishes+0.034
Neg logit contributions
ait-0.036
ount-0.036
enced-0.034
ada-0.034
 911-0.033
Token thanked
Token ID26280
Feature activation+6.66e-03

Pos logit contributions
ait+0.004
ount+0.004
ada+0.004
enced+0.004
 pigs+0.004
Neg logit contributions
 now-0.002
 hurts-0.002
 wishes-0.002
 myself-0.002
 be-0.002
Token her
Token ID607
Feature activation+2.97e-04

Pos logit contributions
 now+0.008
 be+0.007
 myself+0.007
 my+0.007
 wishes+0.007
Neg logit contributions
ount-0.013
ait-0.013
enced-0.012
arre-0.012
ada-0.012
Token and
Token ID290
Feature activation+3.36e-03

Pos logit contributions
 now+0.000
 be+0.000
 myself+0.000
 wishes+0.000
 hurts+0.000
Neg logit contributions
ait-0.001
ount-0.001
enced-0.001
ada-0.001
umbers-0.001
Token put
Token ID1234
Feature activation-9.41e-03

Pos logit contributions
 now+0.004
 hurts+0.004
 myself+0.004
 wishes+0.004
 be+0.004
Neg logit contributions
ait-0.006
ount-0.006
ada-0.005
enced-0.005
 pigs-0.005
Token the
Token ID262
Feature activation+2.37e-02

Pos logit contributions
ait+0.019
ount+0.019
enced+0.018
ada+0.017
arre+0.017
Neg logit contributions
 now-0.010
 myself-0.008
 be-0.008
 my-0.008
 hurts-0.008
Token glass
Token ID5405
Feature activation+3.84e-02

Pos logit contributions
 now+0.023
 hurts+0.021
 myself+0.021
 wishes+0.021
 be+0.020
Neg logit contributions
ait-0.048
ount-0.046
ada-0.044
enced-0.043
 pigs-0.042
Token over
Token ID625
Feature activation-5.11e-03

Pos logit contributions
 now+0.039
 be+0.033
 hurts+0.033
 myself+0.032
 wishes+0.031
Neg logit contributions
ait-0.081
ount-0.079
ada-0.075
enced-0.072
ules-0.072
Token his
Token ID465
Feature activation+8.26e-03

Pos logit contributions
ount+0.011
ait+0.011
arre+0.011
enced+0.010
ada+0.010
Neg logit contributions
 now-0.005
 my-0.004
 myself-0.004
 be-0.004
 next-0.004
Token head
Token ID1182
Feature activation+1.26e-02

Pos logit contributions
 now+0.008
 hurts+0.008
 myself+0.008
 wishes+0.008
 be+0.007
Neg logit contributions
ait-0.016
ount-0.015
ada-0.014
 pigs-0.014
enced-0.014
Token.
Token ID13
Feature activation+7.07e-04

Pos logit contributions
 now+0.012
 be+0.010
 myself+0.009
 hurts+0.008
 next+0.008
Neg logit contributions
ait-0.028
ount-0.027
ada-0.026
enced-0.025
 pigs-0.025
Token The
Token ID383
Feature activation+5.48e-03

Pos logit contributions
 now+0.001
 myself+0.001
 wishes+0.001
 be+0.001
 hurts+0.001
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
enced-0.001
umbers-0.001
Token,
Token ID11
Feature activation-5.81e-03

Pos logit contributions
ait+0.013
ount+0.012
ada+0.012
enced+0.011
 pigs+0.011
Neg logit contributions
 now-0.005
 wishes-0.004
 myself-0.004
 hurts-0.004
 be-0.004
Token a
Token ID257
Feature activation-9.12e-03

Pos logit contributions
ait+0.011
ount+0.011
ada+0.011
enced+0.010
hot+0.010
Neg logit contributions
 wishes-0.005
 hurts-0.005
 myself-0.005
 now-0.005
 spell-0.005
Token little
Token ID1310
Feature activation+3.75e-04

Pos logit contributions
ait+0.018
ount+0.017
ada+0.017
 pigs+0.016
enced+0.016
Neg logit contributions
 now-0.009
 myself-0.009
 wishes-0.008
 hurts-0.008
 be-0.008
Token girl
Token ID2576
Feature activation+3.22e-02

Pos logit contributions
 now+0.000
 hurts+0.000
 be+0.000
 wishes+0.000
 myself+0.000
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
enced-0.001
iers-0.001
Token named
Token ID3706
Feature activation-7.40e-03

Pos logit contributions
 myself+0.060
ller+0.053
 hurts+0.052
 wishes+0.051
iously+0.051
Neg logit contributions
ait-0.035
ada-0.032
enced-0.031
ount-0.031
 pigs-0.031
Token Lucy
Token ID22162
Feature activation+3.84e-02

Pos logit contributions
ait+0.010
ount+0.010
ada+0.009
enced+0.009
 911+0.008
Neg logit contributions
 wishes-0.011
 now-0.011
 hurts-0.011
 myself-0.011
ggy-0.010
Token had
Token ID550
Feature activation-9.75e-03

Pos logit contributions
 myself+0.053
 now+0.046
 hurts+0.046
 wishes+0.044
ller+0.044
Neg logit contributions
ait-0.058
ount-0.055
ada-0.054
enced-0.053
 pigs-0.051
Token a
Token ID257
Feature activation+1.64e-02

Pos logit contributions
ait+0.020
ount+0.019
ada+0.018
enced+0.018
 pigs+0.018
Neg logit contributions
 now-0.008
 myself-0.008
 hurts-0.008
 wishes-0.008
 spell-0.007
Token question
Token ID1808
Feature activation+1.08e-02

Pos logit contributions
 myself+0.021
 now+0.019
 wishes+0.017
 hurts+0.017
p+0.017
Neg logit contributions
ait-0.028
ada-0.026
 pigs-0.026
ount-0.025
enced-0.024
Token.
Token ID13
Feature activation-1.71e-03

Pos logit contributions
 now+0.011
 hurts+0.010
 be+0.010
 wishes+0.010
 myself+0.009
Neg logit contributions
ait-0.022
ount-0.021
ada-0.020
enced-0.019
 pigs-0.019
Token She
Token ID1375
Feature activation+2.31e-02

Pos logit contributions
ait+0.003
ount+0.002
ada+0.002
enced+0.002
 pigs+0.002
Neg logit contributions
 now-0.003
 hurts-0.002
 wishes-0.002
 myself-0.002
 be-0.002
Token and
Token ID290
Feature activation+3.64e-02

Pos logit contributions
ggy+0.014
 wishes+0.014
 hurts+0.014
 myself+0.014
 now+0.014
Neg logit contributions
ait-0.024
ount-0.023
 pigs-0.022
ada-0.022
 911-0.022
Token ugly
Token ID13400
Feature activation+1.27e-02

Pos logit contributions
 now+0.041
 hurts+0.039
 wishes+0.038
 myself+0.037
 be+0.037
Neg logit contributions
ait-0.069
ount-0.066
ada-0.063
enced-0.061
 pigs-0.059
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
 now+0.013
 hurts+0.012
 wishes+0.012
 myself+0.012
 be+0.012
Neg logit contributions
ait-0.025
ount-0.024
ada-0.023
enced-0.022
 pigs-0.022
Token Put
Token ID5930
Feature activation-2.02e-03

Pos logit contributions
 now+0.016
ggy+0.016
 hurts+0.015
 wishes+0.015
 lands+0.015
Neg logit contributions
ait-0.017
ount-0.017
ada-0.016
hot-0.016
 911-0.015
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
ait+0.004
ount+0.004
ada+0.004
enced+0.004
 pigs+0.004
Neg logit contributions
 now-0.002
 hurts-0.002
 myself-0.002
 wishes-0.002
 be-0.002
Token tire
Token ID15867
Feature activation+4.38e-02

Pos logit contributions
 be+0.027
 hurts+0.025
 now+0.024
 my+0.023
 is+0.022
Neg logit contributions
ait-0.045
ount-0.044
ules-0.042
ada-0.042
enced-0.041
Token back
Token ID736
Feature activation+4.72e-02

Pos logit contributions
 now+0.049
 wishes+0.047
 hurts+0.046
 myself+0.046
 be+0.045
Neg logit contributions
ait-0.079
ount-0.076
ada-0.072
enced-0.070
 pigs-0.069
Token on
Token ID319
Feature activation+1.20e-02

Pos logit contributions
 now+0.052
 wishes+0.051
 hurts+0.050
 myself+0.048
 be+0.048
Neg logit contributions
ait-0.086
ount-0.083
ada-0.078
enced-0.076
 pigs-0.075
Token the
Token ID262
Feature activation-5.22e-03

Pos logit contributions
 now+0.013
 myself+0.011
 my+0.011
 be+0.010
 hurts+0.010
Neg logit contributions
ount-0.025
ait-0.025
enced-0.024
ada-0.024
ules-0.022
Token fire
Token ID2046
Feature activation+1.40e-02

Pos logit contributions
ait+0.011
ount+0.011
ada+0.011
ules+0.010
enced+0.010
Neg logit contributions
 be-0.005
 now-0.005
 hurts-0.005
 my-0.004
 myself-0.004
Token truck
Token ID7779
Feature activation+1.90e-02

Pos logit contributions
 now+0.018
 be+0.016
 hurts+0.016
 myself+0.015
 wishes+0.015
Neg logit contributions
ait-0.025
ount-0.024
ada-0.023
enced-0.023
ules-0.022
Token mum
Token ID25682
Feature activation+1.65e-02

Pos logit contributions
 myself+0.042
 now+0.037
p+0.036
ulously+0.036
Just+0.035
Neg logit contributions
ait-0.033
 pigs-0.031
ada-0.027
 apr-0.027
 chickens-0.026
Token said
Token ID531
Feature activation-4.90e-03

Pos logit contributions
 now+0.018
 be+0.015
 hurts+0.013
 was+0.013
 wishes+0.013
Neg logit contributions
ount-0.035
ait-0.034
ada-0.031
enced-0.031
line-0.031
Token,
Token ID11
Feature activation-1.92e-02

Pos logit contributions
ount+0.012
ait+0.012
ada+0.011
enced+0.011
arre+0.010
Neg logit contributions
 now-0.004
 be-0.003
 hurts-0.003
 wishes-0.003
ggy-0.002
Token �
Token ID6184
Feature activation-2.03e-02

Pos logit contributions
ount+0.039
ait+0.038
enced+0.034
ada+0.033
arre+0.033
Neg logit contributions
 now-0.025
 be-0.023
 next-0.021
 then-0.020
 my-0.020
Token�
Token ID95
Feature activation-2.94e-03

Pos logit contributions
ait+0.029
ount+0.027
ada+0.026
 pigs+0.025
enced+0.025
Neg logit contributions
 now-0.032
 myself-0.030
 hurts-0.029
 wishes-0.029
 be-0.029
Token€
Token ID26391
Feature activation+4.01e-02

Pos logit contributions
ait+0.005
ount+0.005
ada+0.004
 pigs+0.004
enced+0.004
Neg logit contributions
 now-0.004
 myself-0.004
 hurts-0.004
 wishes-0.004
 be-0.004
Token�
Token ID129
Feature activation+1.07e-02

Pos logit contributions
 now+0.050
 hurts+0.046
 wishes+0.046
 be+0.045
 myself+0.045
Neg logit contributions
ait-0.070
ount-0.068
ada-0.065
enced-0.062
 pigs-0.061
Token�
Token ID241
Feature activation+9.04e-03

Pos logit contributions
 now+0.017
 hurts+0.016
 wishes+0.016
 myself+0.016
 be+0.016
Neg logit contributions
ait-0.015
ount-0.014
ada-0.013
enced-0.012
 pigs-0.012
TokenNo
Token ID2949
Feature activation+5.86e-03

Pos logit contributions
 now+0.013
 hurts+0.013
 wishes+0.013
 myself+0.013
 be+0.012
Neg logit contributions
ait-0.013
ount-0.012
ada-0.012
enced-0.011
line-0.011
Token,
Token ID11
Feature activation+2.82e-03

Pos logit contributions
 now+0.005
 hurts+0.004
 wishes+0.004
 myself+0.004
 be+0.004
Neg logit contributions
ait-0.013
ount-0.012
ada-0.012
enced-0.012
 pigs-0.011
Token Jill
Token ID22772
Feature activation+2.65e-02

Pos logit contributions
 now+0.003
 wishes+0.003
 myself+0.003
 be+0.003
 hurts+0.003
Neg logit contributions
ait-0.006
ount-0.006
ada-0.005
enced-0.005
 pigs-0.005
Token.
Token ID13
Feature activation+4.55e-03

Pos logit contributions
 now+0.004
 wishes+0.004
 be+0.004
 hurts+0.004
 myself+0.004
Neg logit contributions
ait-0.010
ada-0.009
ount-0.009
enced-0.009
umbers-0.009
Token They
Token ID1119
Feature activation-1.10e-02

Pos logit contributions
 now+0.005
 be+0.005
 wishes+0.005
 myself+0.004
 my+0.004
Neg logit contributions
ait-0.009
ada-0.008
ount-0.008
iers-0.008
folk-0.008
Token make
Token ID787
Feature activation+1.40e-02

Pos logit contributions
ait+0.027
ount+0.026
ada+0.026
ules+0.026
arre+0.025
Neg logit contributions
 be-0.009
 now-0.008
 dream-0.007
 wish-0.007
 wishes-0.006
Token a
Token ID257
Feature activation+2.55e-02

Pos logit contributions
 now+0.016
 wishes+0.015
 hurts+0.014
 myself+0.014
 be+0.014
Neg logit contributions
ait-0.026
ount-0.025
ada-0.024
enced-0.023
 pigs-0.022
Token lot
Token ID1256
Feature activation+3.25e-02

Pos logit contributions
 now+0.027
 wishes+0.025
 hurts+0.025
 be+0.025
 myself+0.024
Neg logit contributions
ait-0.049
ount-0.048
ada-0.045
enced-0.044
iers-0.043
Token of
Token ID286
Feature activation-5.99e-03

Pos logit contributions
 now+0.032
 be+0.029
 hurts+0.028
 myself+0.028
 wishes+0.028
Neg logit contributions
ait-0.068
ada-0.062
ount-0.061
 pigs-0.060
enced-0.058
Token noise
Token ID7838
Feature activation-3.21e-03

Pos logit contributions
ait+0.015
ount+0.014
ada+0.014
line+0.013
enced+0.013
Neg logit contributions
 wishes-0.005
 hurts-0.004
 be-0.004
 wish-0.004
 now-0.004
Token and
Token ID290
Feature activation+2.06e-02

Pos logit contributions
ait+0.007
ount+0.007
ada+0.007
enced+0.007
umbers+0.006
Neg logit contributions
 now-0.003
 be-0.002
 hurts-0.002
 wishes-0.002
 myself-0.002
Token a
Token ID257
Feature activation+2.24e-03

Pos logit contributions
 now+0.022
 hurts+0.020
 wishes+0.020
 myself+0.020
 be+0.020
Neg logit contributions
ait-0.039
ount-0.038
ada-0.036
enced-0.035
 pigs-0.034
Token lot
Token ID1256
Feature activation+9.89e-03

Pos logit contributions
 now+0.002
 be+0.002
 wishes+0.002
 hurts+0.002
 wish+0.002
Neg logit contributions
ait-0.005
ount-0.005
ada-0.004
enced-0.004
umbers-0.004
Token of
Token ID286
Feature activation-1.30e-02

Pos logit contributions
 now+0.008
 be+0.007
 hurts+0.006
 wishes+0.006
 spell+0.006
Neg logit contributions
ait-0.023
ada-0.022
ount-0.021
umbers-0.021
 pigs-0.021
Token need
Token ID761
Feature activation+9.19e-03

Pos logit contributions
ount+0.086
ait+0.086
enced+0.083
iers+0.081
ules+0.081
Neg logit contributions
 now-0.037
 be-0.035
 wish-0.033
 hurts-0.028
 wishes-0.028
Token to
Token ID284
Feature activation+3.58e-02

Pos logit contributions
 now+0.009
 hurts+0.008
 wishes+0.008
 myself+0.008
 be+0.008
Neg logit contributions
ait-0.018
ount-0.018
ada-0.017
enced-0.017
 pigs-0.016
Token clean
Token ID3424
Feature activation+1.05e-02

Pos logit contributions
 now+0.031
 hurts+0.028
 wishes+0.028
 myself+0.027
 be+0.027
Neg logit contributions
ait-0.076
ount-0.074
ada-0.070
enced-0.069
 pigs-0.067
Token up
Token ID510
Feature activation+7.09e-03

Pos logit contributions
 now+0.009
 hurts+0.008
 wishes+0.008
 myself+0.008
 be+0.007
Neg logit contributions
ait-0.023
ount-0.022
ada-0.021
enced-0.021
 pigs-0.020
Token first
Token ID717
Feature activation+2.90e-02

Pos logit contributions
 now+0.006
 myself+0.005
 my+0.005
 be+0.005
 hurts+0.005
Neg logit contributions
ait-0.016
ount-0.016
ada-0.015
enced-0.015
 pigs-0.014
Token.
Token ID13
Feature activation+1.91e-02

Pos logit contributions
 now+0.028
 myself+0.024
 hurts+0.024
 be+0.024
 wishes+0.024
Neg logit contributions
ait-0.061
ount-0.060
ada-0.057
enced-0.055
 pigs-0.054
Token Otherwise
Token ID15323
Feature activation+2.33e-02

Pos logit contributions
 now+0.021
 myself+0.020
 hurts+0.020
 wishes+0.019
 be+0.019
Neg logit contributions
ait-0.036
ount-0.034
ada-0.033
enced-0.032
 pigs-0.030
Token,
Token ID11
Feature activation+1.28e-02

Pos logit contributions
 now+0.016
 be+0.013
 myself+0.013
 hurts+0.013
 wishes+0.012
Neg logit contributions
ait-0.056
ount-0.055
ada-0.052
enced-0.052
umbers-0.049
Token we
Token ID356
Feature activation-4.87e-02

Pos logit contributions
 now+0.011
 be+0.010
 myself+0.009
 hurts+0.009
 my+0.009
Neg logit contributions
ait-0.029
ount-0.028
enced-0.027
ada-0.027
umbers-0.025
Token won
Token ID1839
Feature activation+1.57e-02

Pos logit contributions
ount+0.096
ait+0.095
enced+0.092
ada+0.089
ules+0.087
Neg logit contributions
 now-0.056
 be-0.053
 hurts-0.051
 wish-0.046
 wishes-0.046
Token't
Token ID470
Feature activation+3.71e-03

Pos logit contributions
 now+0.013
 hurts+0.011
 wishes+0.011
 myself+0.011
 be+0.011
Neg logit contributions
ait-0.034
ount-0.033
ada-0.032
enced-0.031
 pigs-0.030
Token She
Token ID1375
Feature activation+1.68e-02

Pos logit contributions
 now+0.015
 wishes+0.015
 myself+0.014
 hurts+0.014
 spell+0.014
Neg logit contributions
ait-0.014
ount-0.013
ada-0.013
enced-0.012
line-0.012
Token was
Token ID373
Feature activation+1.71e-02

Pos logit contributions
 myself+0.026
mm+0.024
 hurts+0.022
ww+0.022
 spell+0.022
Neg logit contributions
ait-0.021
 officers-0.021
 pigs-0.020
enced-0.020
ada-0.019
Token scared
Token ID12008
Feature activation-9.29e-03

Pos logit contributions
 myself+0.021
 hurts+0.021
 wishes+0.020
mm+0.020
 spell+0.020
Neg logit contributions
ait-0.026
ount-0.025
enced-0.025
 pigs-0.024
ada-0.024
Token and
Token ID290
Feature activation+1.61e-02

Pos logit contributions
ait+0.019
ount+0.019
ada+0.018
enced+0.017
 pigs+0.017
Neg logit contributions
 now-0.009
 hurts-0.008
 wishes-0.008
 be-0.008
 myself-0.008
Token didn
Token ID1422
Feature activation+5.06e-04

Pos logit contributions
 now+0.017
 myself+0.017
 hurts+0.016
 wishes+0.016
 be+0.015
Neg logit contributions
ait-0.029
ount-0.028
ada-0.027
enced-0.026
 pigs-0.026
Token't
Token ID470
Feature activation-4.49e-03

Pos logit contributions
 wishes+0.001
 now+0.001
 wish+0.001
 myself+0.001
 spell+0.001
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
hot-0.001
 911-0.001
Token know
Token ID760
Feature activation+3.39e-03

Pos logit contributions
ait+0.008
ount+0.007
ada+0.007
enced+0.007
 pigs+0.007
Neg logit contributions
 now-0.005
 myself-0.005
 hurts-0.005
 wishes-0.005
ggy-0.005
Token what
Token ID644
Feature activation-2.86e-02

Pos logit contributions
 now+0.004
 wishes+0.004
 myself+0.004
 be+0.004
 hurts+0.004
Neg logit contributions
ait-0.006
ount-0.006
ada-0.005
enced-0.005
arre-0.005
Token to
Token ID284
Feature activation-4.01e-02

Pos logit contributions
ait+0.057
ount+0.055
ada+0.052
enced+0.051
 pigs+0.050
Neg logit contributions
 now-0.029
 hurts-0.027
 wishes-0.026
 myself-0.026
 be-0.026
Token do
Token ID466
Feature activation-3.65e-02

Pos logit contributions
ait+0.076
ount+0.072
ada+0.070
enced+0.066
hot+0.065
Neg logit contributions
 now-0.041
 hurts-0.039
 myself-0.039
 wishes-0.038
 eyel-0.035
Token.
Token ID13
Feature activation-1.87e-03

Pos logit contributions
ait+0.073
ount+0.070
ada+0.067
enced+0.065
 pigs+0.064
Neg logit contributions
 now-0.041
 myself-0.035
 hurts-0.035
 wishes-0.035
 be-0.035
Token "
Token ID366
Feature activation-1.34e-02

Pos logit contributions
ait+0.026
ount+0.024
ada+0.024
enced+0.022
umbers+0.022
Neg logit contributions
 now-0.015
 be-0.014
 hurts-0.013
 myself-0.013
 wishes-0.013
TokenMaybe
Token ID13300
Feature activation+3.03e-03

Pos logit contributions
ait+0.023
ount+0.022
ada+0.021
enced+0.020
 pigs+0.020
Neg logit contributions
 now-0.017
 myself-0.016
 hurts-0.016
 wishes-0.016
 be-0.015
Token we
Token ID356
Feature activation-6.41e-02

Pos logit contributions
 hurts+0.004
 eyel+0.004
zag+0.004
 myself+0.004
 wishes+0.004
Neg logit contributions
ait-0.004
 pigs-0.004
 911-0.004
ount-0.004
 onions-0.004
Token should
Token ID815
Feature activation+2.78e-02

Pos logit contributions
ait+0.140
ount+0.133
ada+0.130
line+0.128
ules+0.126
Neg logit contributions
 now-0.064
 wishes-0.060
 be-0.058
 wish-0.057
 hurts-0.055
Token go
Token ID467
Feature activation-6.22e-03

Pos logit contributions
 now+0.024
 be+0.022
 wishes+0.021
 hurts+0.021
 myself+0.021
Neg logit contributions
ait-0.061
ount-0.059
ada-0.057
enced-0.056
iers-0.054
Token out
Token ID503
Feature activation-5.46e-03

Pos logit contributions
ait+0.009
ount+0.009
line+0.008
enced+0.008
ada+0.008
Neg logit contributions
 wishes-0.009
 hurts-0.009
ggy-0.008
 eyel-0.008
 myself-0.008
Token."
Token ID526
Feature activation-1.77e-02

Pos logit contributions
ait+0.011
ount+0.011
ada+0.010
enced+0.010
 pigs+0.010
Neg logit contributions
 now-0.005
 wishes-0.005
 myself-0.005
 hurts-0.005
 be-0.005
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
ait+0.030
ount+0.029
ada+0.028
enced+0.027
 pigs+0.026
Neg logit contributions
 now-0.022
 hurts-0.021
 myself-0.021
 wishes-0.021
 be-0.019
Token
Token ID198
Feature activation-4.33e-03

Pos logit contributions
ait+0.018
ount+0.018
ada+0.017
enced+0.016
 911+0.016
Neg logit contributions
 now-0.016
 myself-0.015
 wishes-0.015
 hurts-0.015
 spell-0.015
TokenBut
Token ID1537
Feature activation+1.44e-02

Pos logit contributions
ait+0.006
ount+0.006
ada+0.006
enced+0.006
 pigs+0.005
Neg logit contributions
 now-0.006
 wishes-0.006
 hurts-0.006
 myself-0.006
 be-0.006
Token before
Token ID878
Feature activation-1.89e-04

Pos logit contributions
 hurts+0.025
 wishes+0.024
 myself+0.023
 spell+0.022
 wish+0.021
Neg logit contributions
ait-0.017
ount-0.016
 911-0.015
 pigs-0.015
ada-0.014
TokenAt
Token ID2953
Feature activation+1.36e-02

Pos logit contributions
 now+0.011
 myself+0.010
 be+0.009
ggy+0.009
Just+0.009
Neg logit contributions
ait-0.026
ount-0.025
ada-0.025
 pigs-0.024
enced-0.023
Token the
Token ID262
Feature activation+5.64e-04

Pos logit contributions
 hurts+0.021
 wishes+0.021
ggy+0.020
 myself+0.019
 eyel+0.019
Neg logit contributions
ait-0.019
line-0.016
ount-0.015
ada-0.015
 911-0.015
Token end
Token ID886
Feature activation+3.55e-02

Pos logit contributions
 now+0.001
 hurts+0.001
 wishes+0.001
 myself+0.001
 be+0.001
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
enced-0.001
 pigs-0.001
Token of
Token ID286
Feature activation+1.60e-02

Pos logit contributions
 now+0.042
 hurts+0.038
 wishes+0.038
 be+0.038
 myself+0.037
Neg logit contributions
ait-0.065
ount-0.063
ada-0.060
enced-0.058
 pigs-0.056
Token the
Token ID262
Feature activation+7.85e-03

Pos logit contributions
 hurts+0.025
 wishes+0.025
 myself+0.024
 wish+0.023
 spell+0.023
Neg logit contributions
ait-0.021
ount-0.018
 pigs-0.018
ada-0.017
 Jon-0.017
Token day
Token ID1110
Feature activation+1.60e-02

Pos logit contributions
 now+0.007
 myself+0.006
 hurts+0.006
 wishes+0.006
 be+0.005
Neg logit contributions
ait-0.016
ount-0.015
ada-0.015
 pigs-0.015
enced-0.014
Token,
Token ID11
Feature activation-1.47e-02

Pos logit contributions
 now+0.011
 be+0.009
 wishes+0.007
 myself+0.007
 hurts+0.007
Neg logit contributions
ait-0.039
ount-0.038
ada-0.036
enced-0.036
 pigs-0.035
Token Sam
Token ID3409
Feature activation+8.72e-03

Pos logit contributions
ait+0.028
ount+0.027
enced+0.027
auc+0.025
ada+0.025
Neg logit contributions
 now-0.020
 my-0.018
 be-0.017
 myself-0.016
 then-0.015
Token was
Token ID373
Feature activation+1.09e-02

Pos logit contributions
 now+0.008
 be+0.006
 hurts+0.006
 wishes+0.006
 myself+0.006
Neg logit contributions
ait-0.019
ount-0.019
ada-0.018
enced-0.017
iers-0.017
Token very
Token ID845
Feature activation-5.73e-03

Pos logit contributions
 now+0.011
 hurts+0.011
 wishes+0.010
 myself+0.010
 be+0.010
Neg logit contributions
ait-0.021
ount-0.020
ada-0.019
enced-0.019
 pigs-0.018
Token tired
Token ID10032
Feature activation+5.17e-04

Pos logit contributions
ait+0.010
ount+0.009
ada+0.009
enced+0.009
line+0.008
Neg logit contributions
 now-0.007
 myself-0.007
 hurts-0.007
 wishes-0.007
 be-0.006
Token they
Token ID484
Feature activation-5.45e-02

Pos logit contributions
 now+0.024
 be+0.022
 wishes+0.022
 hurts+0.022
 myself+0.021
Neg logit contributions
ait-0.032
ount-0.031
ada-0.029
enced-0.028
iers-0.027
Token came
Token ID1625
Feature activation-2.23e-02

Pos logit contributions
ount+0.125
ait+0.125
 pigs+0.122
 911+0.118
iers+0.117
Neg logit contributions
 now-0.045
 be-0.039
 was-0.031
 wish-0.029
 wishes-0.028
Token back
Token ID736
Feature activation+1.54e-02

Pos logit contributions
ait+0.037
ount+0.036
ada+0.034
enced+0.033
 911+0.032
Neg logit contributions
 now-0.027
 hurts-0.027
 wishes-0.027
 myself-0.026
 be-0.025
Token,
Token ID11
Feature activation-1.00e-02

Pos logit contributions
 now+0.014
 hurts+0.013
 wishes+0.012
 myself+0.012
 be+0.012
Neg logit contributions
ait-0.032
ount-0.031
ada-0.030
enced-0.029
 pigs-0.029
Token they
Token ID484
Feature activation-4.16e-02

Pos logit contributions
ait+0.023
ount+0.023
enced+0.022
ales+0.021
ules+0.021
Neg logit contributions
 now-0.010
 be-0.009
 my-0.007
 was-0.007
 then-0.007
Token all
Token ID477
Feature activation-2.16e-02

Pos logit contributions
ount+0.100
ait+0.098
 911+0.094
iers+0.093
auc+0.092
Neg logit contributions
 now-0.044
 was-0.036
 be-0.032
 never-0.030
 flew-0.030
Token worked
Token ID3111
Feature activation-1.59e-03

Pos logit contributions
ait+0.049
ount+0.048
iers+0.046
umbers+0.045
 911+0.045
Neg logit contributions
 now-0.019
 be-0.017
 was-0.015
 flew-0.014
 wishes-0.014
Token together
Token ID1978
Feature activation+1.40e-03

Pos logit contributions
ait+0.003
ount+0.003
ada+0.003
enced+0.003
 pigs+0.003
Neg logit contributions
 now-0.002
 be-0.002
 wishes-0.002
 hurts-0.002
 myself-0.002
Token to
Token ID284
Feature activation+2.00e-02

Pos logit contributions
 now+0.001
 be+0.001
 next+0.001
 with+0.001
-+0.001
Neg logit contributions
ount-0.004
ait-0.004
ada-0.004
enced-0.003
 pigs-0.003
Token make
Token ID787
Feature activation-1.56e-03

Pos logit contributions
 now+0.024
 be+0.023
 myself+0.021
 wishes+0.021
 wish+0.019
Neg logit contributions
ait-0.039
ount-0.038
ada-0.035
enced-0.035
 pigs-0.034
Token the
Token ID262
Feature activation+5.07e-03

Pos logit contributions
ount+0.004
ait+0.004
enced+0.003
folk+0.003
ada+0.003
Neg logit contributions
 now-0.001
 wishes-0.001
 be-0.001
 myself-0.001
 my-0.001
Token.
Token ID13
Feature activation-6.41e-04

Pos logit contributions
ait+0.023
ount+0.022
ada+0.021
enced+0.021
 pigs+0.020
Neg logit contributions
 now-0.015
 hurts-0.014
 wishes-0.013
 myself-0.013
 be-0.013
Token She
Token ID1375
Feature activation+4.06e-03

Pos logit contributions
ait+0.001
ount+0.001
enced+0.001
ada+0.001
umbers+0.001
Neg logit contributions
 now-0.001
 be-0.001
 myself-0.001
 hurts-0.001
 wishes-0.001
Token says
Token ID1139
Feature activation-5.66e-03

Pos logit contributions
 myself+0.006
ggy+0.005
 now+0.005
zag+0.005
p+0.005
Neg logit contributions
ait-0.005
ount-0.005
ada-0.005
 pigs-0.005
auc-0.005
Token she
Token ID673
Feature activation-1.82e-02

Pos logit contributions
ait+0.013
ount+0.012
enced+0.012
ada+0.011
ales+0.011
Neg logit contributions
 now-0.006
 be-0.005
 wishes-0.005
 hurts-0.005
 my-0.004
Token will
Token ID481
Feature activation+3.46e-03

Pos logit contributions
ait+0.036
ount+0.034
ada+0.033
enced+0.032
 pigs+0.031
Neg logit contributions
 myself-0.018
 now-0.017
 hurts-0.015
 be-0.015
 my-0.015
Token not
Token ID407
Feature activation-1.29e-02

Pos logit contributions
 now+0.003
 hurts+0.003
 wishes+0.003
 myself+0.003
 be+0.002
Neg logit contributions
ait-0.007
ount-0.007
ada-0.007
enced-0.007
 pigs-0.007
Token do
Token ID466
Feature activation+1.89e-03

Pos logit contributions
ait+0.023
ount+0.023
ada+0.022
enced+0.021
 pigs+0.021
Neg logit contributions
 now-0.013
 wishes-0.013
 myself-0.013
 hurts-0.012
ggy-0.011
Token it
Token ID340
Feature activation-2.90e-02

Pos logit contributions
 now+0.002
 hurts+0.002
 myself+0.002
 wishes+0.002
 be+0.002
Neg logit contributions
ait-0.004
ount-0.004
ada-0.003
enced-0.003
 pigs-0.003
Token again
Token ID757
Feature activation+1.04e-02

Pos logit contributions
ait+0.053
ount+0.052
ada+0.049
enced+0.048
 pigs+0.046
Neg logit contributions
 wishes-0.030
 now-0.029
 hurts-0.029
 myself-0.028
 be-0.027
Token.
Token ID13
Feature activation-3.01e-03

Pos logit contributions
 now+0.012
 hurts+0.012
 wishes+0.012
 myself+0.012
 be+0.012
Neg logit contributions
ait-0.017
ount-0.016
ada-0.016
 pigs-0.015
enced-0.015
Token She
Token ID1375
Feature activation+6.91e-03

Pos logit contributions
ait+0.005
ount+0.005
ada+0.005
enced+0.005
umbers+0.005
Neg logit contributions
 now-0.004
 hurts-0.003
 myself-0.003
 be-0.003
 wishes-0.003
Token fix
Token ID4259
Feature activation+8.12e-03

Pos logit contributions
 myself+0.013
 hurts+0.012
 wishes+0.012
 now+0.011
 lands+0.011
Neg logit contributions
ait-0.012
 pigs-0.010
ada-0.010
 911-0.010
ount-0.010
Token it
Token ID340
Feature activation-1.12e-02

Pos logit contributions
 now+0.008
 hurts+0.007
 myself+0.007
 be+0.007
 wishes+0.007
Neg logit contributions
ait-0.017
ount-0.017
enced-0.016
ada-0.016
ules-0.015
Token together
Token ID1978
Feature activation-4.59e-03

Pos logit contributions
ait+0.021
ount+0.021
ada+0.020
enced+0.019
 pigs+0.019
Neg logit contributions
 now-0.012
 hurts-0.011
 wishes-0.011
 myself-0.011
 be-0.011
Token."
Token ID526
Feature activation-9.86e-03

Pos logit contributions
ait+0.010
ount+0.010
ada+0.009
enced+0.009
ules+0.009
Neg logit contributions
 now-0.005
 be-0.004
 myself-0.004
 hurts-0.004
 wishes-0.004
Token They
Token ID1119
Feature activation-3.34e-02

Pos logit contributions
ait+0.021
ount+0.021
enced+0.020
iers+0.019
ada+0.019
Neg logit contributions
 now-0.009
 be-0.009
 myself-0.008
 my-0.007
 wishes-0.007
Token looked
Token ID3114
Feature activation-2.79e-02

Pos logit contributions
ount+0.077
ait+0.077
ules+0.072
 pigs+0.070
ada+0.069
Neg logit contributions
 now-0.031
 be-0.031
 wish-0.027
 wishes-0.026
 dream-0.024
Token for
Token ID329
Feature activation-5.17e-03

Pos logit contributions
ait+0.062
ount+0.057
iers+0.055
ada+0.055
ules+0.055
Neg logit contributions
 now-0.031
 wishes-0.026
 wish-0.024
 then-0.024
 my-0.022
Token some
Token ID617
Feature activation-2.14e-02

Pos logit contributions
ount+0.013
ait+0.012
ada+0.012
enced+0.011
arre+0.011
Neg logit contributions
 now-0.004
 wishes-0.004
 be-0.004
 my-0.003
 wish-0.003
Token tools
Token ID4899
Feature activation-2.76e-02

Pos logit contributions
ount+0.043
ait+0.042
ada+0.040
enced+0.038
ules+0.037
Neg logit contributions
 now-0.025
 wishes-0.023
 be-0.022
 hurts-0.021
 spell-0.020
Token and
Token ID290
Feature activation-3.79e-03

Pos logit contributions
ount+0.074
ait+0.071
enced+0.069
ada+0.068
arre+0.068
Neg logit contributions
 now-0.021
 be-0.009
 next-0.009
 then-0.009
--0.008
Token started
Token ID2067
Feature activation+1.52e-02

Pos logit contributions
ait+0.007
ount+0.007
ada+0.006
enced+0.006
 pigs+0.006
Neg logit contributions
 now-0.004
 wishes-0.004
 be-0.004
 hurts-0.004
 myself-0.004
TokenHer
Token ID9360
Feature activation+1.46e-02

Pos logit contributions
 now+0.006
 hurts+0.006
 wishes+0.006
 myself+0.006
 be+0.006
Neg logit contributions
ait-0.008
ount-0.007
ada-0.007
enced-0.007
 pigs-0.007
Token mom
Token ID1995
Feature activation+8.03e-03

Pos logit contributions
 myself+0.024
 now+0.021
 hurts+0.021
Just+0.020
 my+0.019
Neg logit contributions
ait-0.020
ada-0.019
 pigs-0.018
ount-0.018
 carts-0.017
Token asked
Token ID1965
Feature activation-1.03e-02

Pos logit contributions
 now+0.008
 be+0.007
 hurts+0.006
 wishes+0.006
 myself+0.006
Neg logit contributions
ait-0.017
ount-0.017
ada-0.015
enced-0.015
iers-0.015
Token,
Token ID11
Feature activation-3.52e-02

Pos logit contributions
ait+0.027
ount+0.027
enced+0.025
ada+0.025
line+0.024
Neg logit contributions
 now-0.006
 be-0.004
 wishes-0.004
 my-0.003
 myself-0.003
Token "
Token ID366
Feature activation-1.58e-02

Pos logit contributions
ait+0.069
ount+0.066
enced+0.062
ales+0.060
ada+0.059
Neg logit contributions
 now-0.048
 be-0.044
 next-0.040
 then-0.038
 my-0.038
TokenWhat
Token ID2061
Feature activation-1.17e-02

Pos logit contributions
ait+0.031
ount+0.030
ada+0.028
enced+0.028
 911+0.026
Neg logit contributions
 now-0.019
 myself-0.017
 be-0.016
Just-0.016
 next-0.016
Token do
Token ID466
Feature activation-1.36e-03

Pos logit contributions
ait+0.024
ount+0.023
ada+0.022
enced+0.021
 pigs+0.021
Neg logit contributions
 now-0.010
 myself-0.010
 hurts-0.009
 wishes-0.009
 be-0.009
Token you
Token ID345
Feature activation-5.18e-02

Pos logit contributions
ait+0.002
ount+0.002
 pigs+0.002
ada+0.002
 onions+0.002
Neg logit contributions
 now-0.001
 wishes-0.001
 hurts-0.001
 myself-0.001
 eyel-0.001
Token mean
Token ID1612
Feature activation-2.59e-02

Pos logit contributions
ait+0.091
ount+0.087
ada+0.082
enced+0.080
 pigs+0.078
Neg logit contributions
 now-0.064
 hurts-0.060
 wishes-0.060
 myself-0.059
 be-0.059
Token,
Token ID11
Feature activation-3.17e-02

Pos logit contributions
ait+0.064
ount+0.062
enced+0.058
ada+0.057
umbers+0.056
Neg logit contributions
 now-0.018
 hurts-0.015
 be-0.014
 my-0.014
 myself-0.013
Token Lily
Token ID20037
Feature activation+1.49e-02

Pos logit contributions
ait+0.068
ount+0.065
enced+0.061
ada+0.061
iers+0.060
Neg logit contributions
 now-0.030
 be-0.027
 wishes-0.026
 myself-0.026
 hurts-0.025
Token tall
Token ID7331
Feature activation-1.12e-02

Pos logit contributions
ait+0.009
ount+0.008
ada+0.008
enced+0.008
 pigs+0.008
Neg logit contributions
 now-0.004
 hurts-0.004
 myself-0.004
 wishes-0.004
 be-0.004
Token metal
Token ID6147
Feature activation-6.05e-03

Pos logit contributions
ait+0.021
ount+0.021
ada+0.019
enced+0.019
iers+0.018
Neg logit contributions
 now-0.013
 be-0.012
 wishes-0.012
 hurts-0.012
 myself-0.011
Token rail
Token ID6787
Feature activation-2.42e-02

Pos logit contributions
ait+0.012
ount+0.011
ada+0.011
enced+0.010
ules+0.010
Neg logit contributions
 now-0.007
 be-0.006
 hurts-0.006
 wishes-0.006
 spell-0.006
Token near
Token ID1474
Feature activation-5.02e-02

Pos logit contributions
ait+0.061
ada+0.056
ount+0.056
umbers+0.053
 pigs+0.053
Neg logit contributions
 now-0.019
 be-0.014
 next-0.013
 then-0.012
--0.011
Token the
Token ID262
Feature activation-4.41e-02

Pos logit contributions
ount+0.109
ait+0.109
enced+0.101
ada+0.100
arre+0.100
Neg logit contributions
 now-0.050
 be-0.045
 myself-0.040
 wishes-0.039
 my-0.036
Token park
Token ID3952
Feature activation-1.09e-02

Pos logit contributions
ait+0.083
ount+0.079
ada+0.074
enced+0.072
iers+0.070
Neg logit contributions
 now-0.056
 be-0.052
 wishes-0.049
 hurts-0.048
 myself-0.048
Token.
Token ID13
Feature activation-8.72e-03

Pos logit contributions
ait+0.026
ada+0.024
ount+0.024
umbers+0.024
 onions+0.023
Neg logit contributions
 now-0.010
 be-0.009
 next-0.006
 was-0.006
 myself-0.006
Token It
Token ID632
Feature activation-3.91e-04

Pos logit contributions
ait+0.016
ount+0.015
ada+0.014
enced+0.014
 pigs+0.014
Neg logit contributions
 now-0.011
 be-0.010
 myself-0.010
 wishes-0.010
 hurts-0.009
Token was
Token ID373
Feature activation+2.34e-02

Pos logit contributions
ait+0.001
ount+0.001
ada+0.001
enced+0.001
 pigs+0.001
Neg logit contributions
 now-0.000
 hurts-0.000
 wishes-0.000
 myself-0.000
 be-0.000
Token perfect
Token ID2818
Feature activation+1.75e-03

Pos logit contributions
 now+0.019
 be+0.016
 myself+0.014
 wishes+0.014
 hurts+0.013
Neg logit contributions
ait-0.055
ount-0.053
ada-0.051
enced-0.049
umbers-0.049
Token for
Token ID329
Feature activation-1.03e-02

Pos logit contributions
 now+0.002
 be+0.001
 myself+0.001
 wishes+0.001
 my+0.001
Neg logit contributions
ait-0.004
ount-0.004
ada-0.004
enced-0.004
umbers-0.003
Token they
Token ID484
Feature activation-2.65e-02

Pos logit contributions
 now+0.031
 hurts+0.030
 wishes+0.030
 myself+0.029
 be+0.028
Neg logit contributions
ait-0.048
ount-0.046
ada-0.043
enced-0.043
 pigs-0.042
Token had
Token ID550
Feature activation-1.26e-04

Pos logit contributions
ait+0.059
ount+0.059
ada+0.056
 911+0.055
 stret+0.055
Neg logit contributions
 now-0.031
 be-0.026
 was-0.025
 is-0.021
 wish-0.020
Token no
Token ID645
Feature activation-2.04e-03

Pos logit contributions
ada+0.000
ait+0.000
ount+0.000
hot+0.000
umbers+0.000
Neg logit contributions
 now-0.000
 be-0.000
 wishes-0.000
 hurts-0.000
 my-0.000
Token idea
Token ID2126
Feature activation-2.04e-02

Pos logit contributions
ait+0.003
ount+0.003
ada+0.003
enced+0.003
 pigs+0.003
Neg logit contributions
 now-0.003
 wishes-0.002
 hurts-0.002
 be-0.002
 myself-0.002
Token why
Token ID1521
Feature activation-4.98e-03

Pos logit contributions
ait+0.044
ada+0.042
ount+0.042
enced+0.040
arre+0.040
Neg logit contributions
 now-0.022
 be-0.018
 myself-0.016
ggy-0.016
 wishes-0.016
Token they
Token ID484
Feature activation-6.24e-02

Pos logit contributions
ount+0.010
ada+0.010
arre+0.010
ait+0.010
umbers+0.010
Neg logit contributions
 now-0.005
 be-0.005
ggy-0.004
 wishes-0.004
 my-0.004
Token were
Token ID547
Feature activation-2.89e-02

Pos logit contributions
ait+0.125
ount+0.122
ada+0.119
iers+0.118
umbers+0.116
Neg logit contributions
 now-0.078
 be-0.068
 was-0.064
 hurts-0.059
 dream-0.057
Token there
Token ID612
Feature activation-3.01e-02

Pos logit contributions
ada+0.062
ait+0.061
ount+0.060
hot+0.059
folk+0.057
Neg logit contributions
 now-0.033
 be-0.023
 next-0.022
 not-0.021
 my-0.021
Token.
Token ID13
Feature activation+5.52e-03

Pos logit contributions
ait+0.071
ada+0.070
ount+0.070
arre+0.066
enced+0.066
Neg logit contributions
 now-0.029
 be-0.021
 next-0.019
 myself-0.017
 my-0.015
Token They
Token ID1119
Feature activation-2.68e-02

Pos logit contributions
 now+0.006
 be+0.006
 wishes+0.005
 myself+0.005
 hurts+0.005
Neg logit contributions
ait-0.010
ount-0.010
ada-0.010
folk-0.009
enced-0.009
Token looked
Token ID3114
Feature activation-2.01e-02

Pos logit contributions
ount+0.064
ait+0.061
ada+0.059
umbers+0.059
 stret+0.058
Neg logit contributions
 now-0.025
 be-0.022
 was-0.020
 then-0.019
 dream-0.018
Token should
Token ID815
Feature activation+3.07e-02

Pos logit contributions
ait+0.092
ount+0.089
enced+0.084
ules+0.083
line+0.083
Neg logit contributions
 now-0.041
 wishes-0.039
 wish-0.038
 be-0.037
 hurts-0.033
Token forgive
Token ID20927
Feature activation-7.48e-04

Pos logit contributions
 now+0.024
 wishes+0.021
 hurts+0.021
 be+0.021
 myself+0.021
Neg logit contributions
ait-0.069
ount-0.067
ada-0.064
enced-0.062
 pigs-0.061
Token and
Token ID290
Feature activation+1.59e-02

Pos logit contributions
ait+0.001
ount+0.001
ada+0.001
enced+0.001
 pigs+0.001
Neg logit contributions
 now-0.001
 hurts-0.001
 wishes-0.001
 myself-0.001
 be-0.001
Token apologize
Token ID16521
Feature activation-7.61e-03

Pos logit contributions
 hurts+0.016
 myself+0.015
ggy+0.015
 lands+0.015
 now+0.014
Neg logit contributions
ait-0.026
ount-0.024
ada-0.024
line-0.024
hot-0.023
Token when
Token ID618
Feature activation-1.58e-02

Pos logit contributions
ait+0.015
ount+0.014
ada+0.014
enced+0.013
 pigs+0.013
Neg logit contributions
 now-0.008
 hurts-0.007
 wishes-0.007
 myself-0.007
 be-0.007
Token we
Token ID356
Feature activation-7.28e-02

Pos logit contributions
ait+0.030
ount+0.029
ada+0.027
enced+0.026
 pigs+0.026
Neg logit contributions
 now-0.017
 hurts-0.016
 wishes-0.015
 myself-0.015
 be-0.015
Token make
Token ID787
Feature activation+1.81e-03

Pos logit contributions
ait+0.163
ount+0.147
ada+0.140
enced+0.139
ules+0.138
Neg logit contributions
 be-0.077
 now-0.072
 hurts-0.071
 wishes-0.070
 wish-0.070
Token mistakes
Token ID10135
Feature activation+4.11e-04

Pos logit contributions
 now+0.002
 hurts+0.002
 wishes+0.002
 myself+0.002
 be+0.001
Neg logit contributions
ait-0.004
ount-0.004
ada-0.003
enced-0.003
 pigs-0.003
Token.
Token ID13
Feature activation+6.79e-03

Pos logit contributions
 now+0.000
 hurts+0.000
 be+0.000
 myself+0.000
 wishes+0.000
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
enced-0.001
ules-0.001
Token We
Token ID775
Feature activation-3.77e-02

Pos logit contributions
 now+0.010
 hurts+0.009
ggy+0.009
 wishes+0.009
 lands+0.009
Neg logit contributions
ait-0.010
ount-0.010
ada-0.009
 pigs-0.009
hot-0.009
Token should
Token ID815
Feature activation+3.15e-02

Pos logit contributions
ait+0.087
ount+0.083
iers+0.079
enced+0.079
ules+0.078
Neg logit contributions
 now-0.037
 wish-0.036
 wishes-0.035
 be-0.035
 hurts-0.029
Token
Token ID198
Feature activation-4.48e-03

Pos logit contributions
ait+0.009
ount+0.009
ada+0.009
enced+0.008
 pigs+0.008
Neg logit contributions
 now-0.006
 be-0.005
 hurts-0.005
 wishes-0.005
 myself-0.005
Token
Token ID198
Feature activation+3.82e-03

Pos logit contributions
ait+0.009
ount+0.009
ada+0.008
enced+0.008
 pigs+0.008
Neg logit contributions
 now-0.004
 be-0.004
 hurts-0.004
 wishes-0.004
 myself-0.004
Token"
Token ID1
Feature activation-8.53e-03

Pos logit contributions
 now+0.004
 be+0.004
 myself+0.003
 hurts+0.003
 wishes+0.003
Neg logit contributions
ait-0.008
ount-0.007
ada-0.007
 911-0.007
enced-0.007
TokenNo
Token ID2949
Feature activation-1.57e-03

Pos logit contributions
ait+0.015
ount+0.015
ada+0.014
enced+0.013
 911+0.013
Neg logit contributions
 now-0.011
 be-0.010
 myself-0.010
 hurts-0.009
 wishes-0.009
Token,
Token ID11
Feature activation-6.55e-03

Pos logit contributions
ait+0.003
ount+0.003
ada+0.003
 pigs+0.003
enced+0.003
Neg logit contributions
 now-0.001
 wishes-0.001
 myself-0.001
 hurts-0.001
 be-0.001
Token we
Token ID356
Feature activation-5.85e-02

Pos logit contributions
ait+0.013
ount+0.013
enced+0.012
ada+0.012
iers+0.012
Neg logit contributions
 now-0.007
 be-0.006
 wishes-0.006
 myself-0.006
 hurts-0.006
Token can
Token ID460
Feature activation+5.55e-04

Pos logit contributions
ait+0.128
ount+0.127
 stret+0.121
enced+0.121
 pigs+0.121
Neg logit contributions
 now-0.058
 be-0.052
 wish-0.049
 hurts-0.046
 wishes-0.044
Token't
Token ID470
Feature activation+7.36e-04

Pos logit contributions
 now+0.000
 hurts+0.000
 wishes+0.000
 myself+0.000
 be+0.000
Neg logit contributions
ait-0.001
ount-0.001
ada-0.001
enced-0.001
 pigs-0.001
Token do
Token ID466
Feature activation+2.33e-03

Pos logit contributions
 now+0.001
 hurts+0.001
 wishes+0.001
 myself+0.001
 be+0.001
Neg logit contributions
ait-0.002
ount-0.001
ada-0.001
enced-0.001
 pigs-0.001
Token that
Token ID326
Feature activation-6.12e-03

Pos logit contributions
 now+0.002
 hurts+0.002
 wishes+0.002
 myself+0.002
 be+0.002
Neg logit contributions
ait-0.005
ount-0.005
ada-0.005
enced-0.004
 pigs-0.004
Token,"
Token ID553
Feature activation-1.43e-02

Pos logit contributions
ait+0.012
ount+0.012
ada+0.011
enced+0.011
 pigs+0.011
Neg logit contributions
 now-0.006
 hurts-0.005
 wishes-0.005
 myself-0.005
 be-0.005
Token around
Token ID1088
Feature activation-1.38e-02

Pos logit contributions
 now+0.004
 myself+0.004
 wishes+0.004
 be+0.003
 hurts+0.003
Neg logit contributions
ait-0.009
ount-0.009
ada-0.009
 pigs-0.008
enced-0.008
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
ait+0.033
ount+0.032
ada+0.031
enced+0.030
arre+0.029
Neg logit contributions
 now-0.014
 wishes-0.011
 myself-0.010
 be-0.010
 my-0.010
Token
Token ID198
Feature activation-9.12e-03

Pos logit contributions
ait+0.022
ount+0.021
ada+0.019
umbers+0.019
enced+0.019
Neg logit contributions
 now-0.019
 be-0.018
 myself-0.017
 hurts-0.016
 wishes-0.016
Token
Token ID198
Feature activation-2.12e-03

Pos logit contributions
ait+0.019
ount+0.019
ada+0.018
 pigs+0.017
enced+0.017
Neg logit contributions
 now-0.009
 be-0.008
 wishes-0.007
 hurts-0.007
 myself-0.007
TokenSo
Token ID2396
Feature activation-1.24e-03

Pos logit contributions
ait+0.004
ount+0.004
ada+0.003
 pigs+0.003
enced+0.003
Neg logit contributions
 now-0.003
 be-0.003
 myself-0.003
Just-0.003
 wishes-0.003
Token they
Token ID484
Feature activation-5.54e-02

Pos logit contributions
ait+0.002
ount+0.002
ada+0.001
 911+0.001
enced+0.001
Neg logit contributions
 hurts-0.002
 myself-0.002
 now-0.002
 wishes-0.002
ggy-0.002
Token went
Token ID1816
Feature activation-1.14e-02

Pos logit contributions
ount+0.123
ait+0.122
enced+0.113
 911+0.112
auc+0.111
Neg logit contributions
 now-0.049
 be-0.045
 flew-0.038
 closed-0.037
 then-0.036
Token on
Token ID319
Feature activation-1.04e-03

Pos logit contributions
ait+0.025
ount+0.024
ada+0.023
enced+0.022
folk+0.022
Neg logit contributions
 now-0.011
 be-0.009
 myself-0.009
 hurts-0.008
 my-0.008
Token a
Token ID257
Feature activation+4.86e-03

Pos logit contributions
ait+0.002
ount+0.002
ada+0.002
enced+0.002
 pigs+0.002
Neg logit contributions
 now-0.001
 wishes-0.001
 be-0.001
 hurts-0.001
 myself-0.001
Token journey
Token ID7002
Feature activation+9.52e-03

Pos logit contributions
 now+0.005
 hurts+0.005
 myself+0.005
 wishes+0.005
 be+0.005
Neg logit contributions
ait-0.009
ount-0.008
ada-0.008
enced-0.008
 pigs-0.008
Token together
Token ID1978
Feature activation-5.52e-03

Pos logit contributions
 now+0.007
 be+0.006
-+0.005
 next+0.004
 myself+0.004
Neg logit contributions
ait-0.024
ount-0.023
ada-0.023
 pigs-0.022
umbers-0.022
Token spider
Token ID19230
Feature activation+2.51e-03

Pos logit contributions
 now+0.011
 myself+0.010
 wishes+0.010
 hurts+0.010
 be+0.010
Neg logit contributions
ait-0.016
ount-0.015
ada-0.014
 pigs-0.014
 911-0.014
Token,
Token ID11
Feature activation-1.94e-02

Pos logit contributions
 now+0.002
 hurts+0.002
 wishes+0.002
 myself+0.002
 be+0.002
Neg logit contributions
ait-0.005
ount-0.005
ada-0.005
enced-0.004
 pigs-0.004
Token but
Token ID475
Feature activation+3.40e-03

Pos logit contributions
ait+0.037
ount+0.036
ada+0.034
enced+0.033
umbers+0.032
Neg logit contributions
 now-0.026
 be-0.025
 hurts-0.022
 myself-0.021
 my-0.021
Token no
Token ID645
Feature activation+8.58e-04

Pos logit contributions
 now+0.003
 be+0.003
 hurts+0.003
 myself+0.003
 wishes+0.003
Neg logit contributions
ait-0.007
ount-0.007
ada-0.007
enced-0.007
 pigs-0.007
Token matter
Token ID2300
Feature activation+1.65e-02

Pos logit contributions
 wishes+0.001
 myself+0.001
 now+0.001
 hurts+0.001
ggy+0.001
Neg logit contributions
ait-0.001
 911-0.001
 pigs-0.001
ada-0.001
 onions-0.001
Token how
Token ID703
Feature activation-5.65e-02

Pos logit contributions
 now+0.019
 myself+0.017
 hurts+0.017
 be+0.017
 wishes+0.017
Neg logit contributions
ait-0.031
ount-0.030
ada-0.029
enced-0.028
 pigs-0.027
Token hard
Token ID1327
Feature activation-1.01e-02

Pos logit contributions
ount+0.148
ait+0.148
ada+0.141
enced+0.138
 pigs+0.135
Neg logit contributions
 now-0.027
 hurts-0.024
 wishes-0.021
 myself-0.020
 be-0.019
Token she
Token ID673
Feature activation-3.51e-02

Pos logit contributions
ait+0.019
ount+0.018
ada+0.018
enced+0.017
 pigs+0.017
Neg logit contributions
 now-0.010
 hurts-0.010
 myself-0.010
 wishes-0.010
 be-0.009
Token tried
Token ID3088
Feature activation+4.40e-03

Pos logit contributions
ait+0.053
ount+0.048
ada+0.047
enced+0.046
hot+0.046
Neg logit contributions
 myself-0.049
 my-0.043
 wishes-0.042
 hurts-0.041
 now-0.040
Token,
Token ID11
Feature activation-5.15e-03

Pos logit contributions
 now+0.004
 be+0.003
 hurts+0.003
 myself+0.003
 wishes+0.003
Neg logit contributions
ait-0.010
ount-0.010
enced-0.009
ada-0.009
umbers-0.009
Token it
Token ID340
Feature activation-2.03e-03

Pos logit contributions
ait+0.011
ount+0.011
enced+0.010
ada+0.010
iers+0.010
Neg logit contributions
 now-0.005
 be-0.004
 myself-0.004
 wishes-0.004
 hurts-0.004