TOP ACTIVATIONS
MAX = 7.275

hands and laugh . The tou can makes a loud sound
the game - until the le opard stepped in .
. They play with the tou can until it is time
Everyday he would take the av oc ados and scatter them
it and picking up the le mons when they fell .
, and all the other av oc ados were very happy
the sun rose , the le opard said goodbye to her
friend ?" The tou can nods its head .
saw a picture of a le opard . They said it
very excited to see the le opard . At
. He picked all the le mons , one by one
sisters . They are all le op ards , too .
neck l aces , grape brace lets , and grape crown
N ina was panic - str icken . She screamed for
the rice back in its rightful place . John was proud
the ancient knob found its rightful place in their house .
waved the stick at the le opard and made loud noises
in the water . O str ich practiced and practiced until
even louder . Ste lla ran away with the
It turned out that the av oc ados were bad and

MIN ACTIVATIONS
MIN = -5.252

He said , " Rob ot , are you okay ?"
and I will help you .
and you will be happy .
Tim said , " Rob ot , you must yield when
My name is Hop sc ot ch , and I 'm
, which was an err and he didn 't understand or
and said , " Rob ot , remind me to clean
her back her te ap ot . She hoped Mom and
?" F ay replied , " It 's
dull â and he had to choose which
small and they made his toes hurt . He wanted something
her elephant without her cr ay ons . But
, this is my cr ay on . I am using
song ?" J os ie nodded eagerly . The
for ?" F ay said , " It is
and he could feel his toes getting crushed .
then he remembered his cr ay on . He quickly got
... are ... well ... and ... happy . I ...
she couldn 't feel her toes . She hopped and skipped
?" Hop sc ot ch replied , " My

INTERVAL 4.143 - 7.275
CONTAINS 0.018%

actus was starting to grow taller and wider . He couldn
's plant grew taller and taller . It was so big
had found the perfect fancy l oll ipop . The End
quickly sw am to the tort oise , and asked if
chasing each other around the golf course . The

INTERVAL 1.012 - 4.143
CONTAINS 10.676%

price . It is not too much . Mom says yes
ara and Ben hugged the lady and said they were sorry
do you value more than jelly ?" The bear was confused
in the sky . The bird felt proud of its own
fun and make friends . Anna 's mom and dad watch

INTERVAL -2.120 - 1.012
CONTAINS 88.066%

He says yes , but they have to be careful .
mean to hurt you or your son ." The
glad that you 're here . I have a special surprise
Jack . Jack felt happy too because he helped
So he asked Jim , " Can you help me get

INTERVAL -5.252 - -2.120
CONTAINS 1.240%

a little boy named Tim my . Tim my loved playing
M agg ie 's mom my was very angry with Maggie
asked Tom to give her lamp back . Tom
on their toys , and make them move . The whale
to go home !" Tim my didn 't want to leave
Token hands
Token ID2832
Feature activation-3.01e+00

Pos logit contributions
amas+0.017
 dishes+0.014
 cloth+0.013
bars+0.011
 hurts+0.011
Neg logit contributions
 known-0.058
 knowing-0.055
uth-0.055
rou-0.054
Fre-0.053
Token and
Token ID290
Feature activation-5.90e-01

Pos logit contributions
amas+0.114
 dishes+0.113
 cloth+0.111
 hurts+0.110
 cheeks+0.103
Neg logit contributions
 known-0.135
 knowing-0.131
rou-0.121
abella-0.121
Fre-0.118
Token laugh
Token ID6487
Feature activation-5.70e-01

Pos logit contributions
amas+0.019
 dishes+0.019
 cloth+0.018
 hurts+0.018
 hands+0.017
Neg logit contributions
 known-0.031
 knowing-0.028
Fre-0.027
abella-0.027
rou-0.027
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
 dishes+0.018
 cloth+0.017
amas+0.017
 hurts+0.017
 hands+0.017
Neg logit contributions
 known-0.030
abella-0.028
Fre-0.028
 knowing-0.027
rou-0.027
Token The
Token ID383
Feature activation-4.65e-01

Pos logit contributions
 hands+0.004
 cloth+0.004
amas+0.004
 dishes+0.004
 hurts+0.004
Neg logit contributions
 known-0.009
abella-0.008
rou-0.008
Fre-0.008
 knowing-0.008
Token tou
Token ID2819
Feature activation+6.17e+00

Pos logit contributions
 cloth+0.017
 dishes+0.017
 hurts+0.017
 voice+0.016
 cheeks+0.016
Neg logit contributions
abella-0.022
rou-0.020
 known-0.019
Fre-0.019
 knowing-0.019
Tokencan
Token ID5171
Feature activation+8.41e-01

Pos logit contributions
 known+0.300
 knowing+0.291
abella+0.291
Fre+0.281
rou+0.263
Neg logit contributions
amas-0.196
 hurts-0.192
 cloth-0.186
 hands-0.181
 dishes-0.181
Token makes
Token ID1838
Feature activation-1.69e+00

Pos logit contributions
 known+0.036
abella+0.033
 knowing+0.032
Fre+0.031
rou+0.031
Neg logit contributions
 dishes-0.035
amas-0.035
 cloth-0.034
 cheeks-0.033
 hands-0.032
Token a
Token ID257
Feature activation-3.08e-01

Pos logit contributions
 dishes+0.063
 cloth+0.062
 hands+0.061
 hurts+0.061
amas+0.060
Neg logit contributions
 known-0.079
abella-0.072
Fre-0.072
 knowing-0.071
rou-0.069
Token loud
Token ID7812
Feature activation-2.26e+00

Pos logit contributions
 cloth+0.010
 hurts+0.009
 hands+0.009
 dishes+0.009
 button+0.009
Neg logit contributions
abella-0.017
 known-0.016
Fre-0.015
 discovering-0.014
 knowing-0.014
Token sound
Token ID2128
Feature activation-1.21e+00

Pos logit contributions
 dishes+0.047
amas+0.047
 cloth+0.046
 hands+0.042
 hurts+0.041
Neg logit contributions
 known-0.139
 knowing-0.133
abella-0.129
Fre-0.129
rou-0.127
Token the
Token ID262
Feature activation+2.51e+00

Pos logit contributions
 known+0.187
abella+0.170
 knowing+0.169
Fre+0.168
rou+0.163
Neg logit contributions
amas-0.152
 dishes-0.152
 cloth-0.151
 hurts-0.147
 hands-0.143
Token game
Token ID983
Feature activation+1.54e+00

Pos logit contributions
 known+0.122
 knowing+0.112
Fre+0.110
abella+0.109
rou+0.108
Neg logit contributions
amas-0.086
 dishes-0.083
 cloth-0.081
 hands-0.079
 hurts-0.079
Token-
Token ID12
Feature activation+2.62e+00

Pos logit contributions
 known+0.062
 knowing+0.061
abella+0.057
rou+0.055
Fre+0.055
Neg logit contributions
amas-0.064
 dishes-0.063
 cloth-0.062
 hurts-0.061
 cheeks-0.060
Token until
Token ID1566
Feature activation-9.08e-01

Pos logit contributions
 known+0.123
abella+0.113
 knowing+0.112
Fre+0.111
rou+0.108
Neg logit contributions
 dishes-0.096
 cloth-0.095
amas-0.095
 hurts-0.092
 hands-0.091
Token the
Token ID262
Feature activation+1.57e+00

Pos logit contributions
amas+0.039
 dishes+0.038
 cloth+0.037
 hurts+0.037
 cheeks+0.036
Neg logit contributions
 known-0.038
abella-0.034
 knowing-0.034
Fre-0.032
rou-0.031
Token le
Token ID443
Feature activation+6.07e+00

Pos logit contributions
 known+0.079
abella+0.072
 knowing+0.072
Fre+0.071
rou+0.070
Neg logit contributions
amas-0.052
 dishes-0.052
 cloth-0.051
 hurts-0.050
 hands-0.049
Tokenopard
Token ID15478
Feature activation+3.15e+00

Pos logit contributions
 known+0.224
abella+0.205
rou+0.199
Fre+0.199
 knowing+0.195
Neg logit contributions
 dishes-0.281
 cloth-0.281
amas-0.274
 hurts-0.270
 hands-0.269
Token stepped
Token ID10764
Feature activation+8.23e-01

Pos logit contributions
 known+0.131
abella+0.117
 knowing+0.116
Fre+0.113
rou+0.111
Neg logit contributions
amas-0.135
 dishes-0.132
 cloth-0.132
 hands-0.124
 diaper-0.124
Token in
Token ID287
Feature activation+7.35e-02

Pos logit contributions
 known+0.042
abella+0.039
 knowing+0.038
Fre+0.038
rou+0.037
Neg logit contributions
 dishes-0.027
 cloth-0.027
amas-0.027
 hurts-0.026
 hands-0.025
Token.
Token ID13
Feature activation+4.65e-01

Pos logit contributions
 known+0.003
 knowing+0.003
abella+0.003
Fre+0.003
rou+0.003
Neg logit contributions
amas-0.003
 dishes-0.003
 cloth-0.003
 hurts-0.003
 cheeks-0.003
Token
Token ID198
Feature activation+3.05e-01

Pos logit contributions
 known+0.023
abella+0.021
 knowing+0.021
Fre+0.021
rou+0.020
Neg logit contributions
amas-0.016
 dishes-0.016
 cloth-0.016
 hurts-0.015
 hands-0.015
Token.
Token ID13
Feature activation+1.95e-03

Pos logit contributions
 known+0.151
abella+0.142
Fre+0.140
rou+0.135
 knowing+0.134
Neg logit contributions
 dishes-0.063
 hands-0.063
 cloth-0.061
 hurts-0.061
amas-0.059
Token They
Token ID1119
Feature activation+2.92e-02

Pos logit contributions
 known+0.000
abella+0.000
rou+0.000
 knowing+0.000
Fre+0.000
Neg logit contributions
 hands-0.000
 cloth-0.000
amas-0.000
 hurts-0.000
 dishes-0.000
Token play
Token ID711
Feature activation-1.20e+00

Pos logit contributions
 known+0.002
abella+0.001
 knowing+0.001
Fre+0.001
rou+0.001
Neg logit contributions
 cloth-0.001
 dishes-0.001
 hurts-0.001
 hands-0.001
 cheeks-0.001
Token with
Token ID351
Feature activation-1.78e-01

Pos logit contributions
 cloth+0.033
 dishes+0.033
 hands+0.032
 hurts+0.031
amas+0.030
Neg logit contributions
 known-0.067
abella-0.064
Fre-0.064
rou-0.062
uth-0.059
Token the
Token ID262
Feature activation+3.24e-02

Pos logit contributions
amas+0.008
 dishes+0.007
 hurts+0.007
 cloth+0.007
 cheeks+0.007
Neg logit contributions
 known-0.007
 knowing-0.007
abella-0.006
Fre-0.006
rou-0.006
Token tou
Token ID2819
Feature activation+6.00e+00

Pos logit contributions
abella+0.002
 known+0.002
Fre+0.002
rou+0.002
 knowing+0.001
Neg logit contributions
 cloth-0.001
 dishes-0.001
 hurts-0.001
 hands-0.001
 cheeks-0.001
Tokencan
Token ID5171
Feature activation+1.01e+00

Pos logit contributions
 known+0.288
abella+0.276
 knowing+0.271
Fre+0.266
rou+0.253
Neg logit contributions
amas-0.201
 cloth-0.194
 hurts-0.194
 dishes-0.191
 hands-0.188
Token until
Token ID1566
Feature activation-1.19e+00

Pos logit contributions
 known+0.043
 knowing+0.039
abella+0.038
rou+0.037
Fre+0.037
Neg logit contributions
 dishes-0.042
amas-0.042
 cloth-0.041
 cheeks-0.040
 hurts-0.038
Token it
Token ID340
Feature activation-6.57e-02

Pos logit contributions
 cloth+0.042
 dishes+0.041
 hands+0.040
 hurts+0.039
 cheeks+0.038
Neg logit contributions
 known-0.058
Fre-0.055
rou-0.054
abella-0.054
 knowing-0.052
Token is
Token ID318
Feature activation+3.80e-01

Pos logit contributions
 dishes+0.002
 cloth+0.002
amas+0.002
 hurts+0.002
 hands+0.002
Neg logit contributions
 known-0.004
abella-0.003
Fre-0.003
 knowing-0.003
rou-0.003
Token time
Token ID640
Feature activation-4.11e-02

Pos logit contributions
 known+0.017
Fre+0.017
abella+0.017
rou+0.016
 knowing+0.016
Neg logit contributions
 cloth-0.013
 hands-0.013
 dishes-0.013
 hurts-0.013
amas-0.013
Token Everyday
Token ID48154
Feature activation+6.53e-01

Pos logit contributions
 known+0.021
Fre+0.020
abella+0.020
 knowing+0.019
rou+0.019
Neg logit contributions
 cloth-0.019
 dishes-0.018
 hands-0.018
 hurts-0.018
amas-0.017
Token he
Token ID339
Feature activation+6.26e-01

Pos logit contributions
 known+0.028
abella+0.026
 knowing+0.026
Fre+0.025
 discovering+0.023
Neg logit contributions
amas-0.027
 dishes-0.026
 cloth-0.026
 hurts-0.025
 hands-0.025
Token would
Token ID561
Feature activation+1.44e+00

Pos logit contributions
 known+0.024
abella+0.021
 knowing+0.021
Fre+0.021
rou+0.020
Neg logit contributions
amas-0.029
 dishes-0.029
 cloth-0.028
 hurts-0.027
 hands-0.027
Token take
Token ID1011
Feature activation-3.46e-02

Pos logit contributions
 known+0.065
 knowing+0.057
Fre+0.054
 seeing+0.054
abella+0.054
Neg logit contributions
amas-0.060
 dishes-0.056
 hurts-0.054
 cloth-0.053
 diaper-0.052
Token the
Token ID262
Feature activation+2.49e-01

Pos logit contributions
amas+0.001
 dishes+0.001
 hurts+0.001
 cloth+0.001
 diaper+0.001
Neg logit contributions
 known-0.002
abella-0.001
 knowing-0.001
Fre-0.001
 discovering-0.001
Token av
Token ID1196
Feature activation+5.84e+00

Pos logit contributions
 known+0.012
 knowing+0.011
rou+0.011
Fre+0.011
abella+0.011
Neg logit contributions
amas-0.008
 hurts-0.008
 cheeks-0.007
 dishes-0.007
 hands-0.007
Tokenoc
Token ID420
Feature activation+3.32e+00

Pos logit contributions
 known+0.217
abella+0.202
 knowing+0.193
Fre+0.190
rou+0.183
Neg logit contributions
amas-0.268
 dishes-0.268
 cloth-0.267
 hands-0.261
 hurts-0.258
Tokenados
Token ID22484
Feature activation+1.25e+00

Pos logit contributions
 known+0.185
 knowing+0.171
Fre+0.170
abella+0.169
rou+0.168
Neg logit contributions
 cloth-0.092
 dishes-0.088
amas-0.086
 hurts-0.085
 hands-0.081
Token and
Token ID290
Feature activation+4.84e-02

Pos logit contributions
 known+0.048
 knowing+0.045
Fre+0.042
abella+0.041
 possible+0.040
Neg logit contributions
amas-0.056
 cheeks-0.056
 hurts-0.056
 cloth-0.054
 dishes-0.053
Token scatter
Token ID41058
Feature activation+2.46e-01

Pos logit contributions
 known+0.002
 knowing+0.002
abella+0.002
Fre+0.002
 discovering+0.002
Neg logit contributions
amas-0.002
 dishes-0.002
 hurts-0.002
 cloth-0.002
 diaper-0.002
Token them
Token ID606
Feature activation-1.21e+00

Pos logit contributions
 known+0.009
 knowing+0.008
Fre+0.008
abella+0.008
rou+0.008
Neg logit contributions
amas-0.011
 hurts-0.011
 cheeks-0.011
 dishes-0.011
 cloth-0.011
Token it
Token ID340
Feature activation+1.26e+00

Pos logit contributions
 known+0.057
 knowing+0.052
Fre+0.051
rou+0.050
 exactly+0.050
Neg logit contributions
amas-0.074
 cheeks-0.068
 lap-0.067
 knee-0.066
 elbows-0.065
Token and
Token ID290
Feature activation+8.21e-01

Pos logit contributions
 knowing+0.046
 known+0.045
rou+0.041
 discovering+0.041
 exactly+0.041
Neg logit contributions
amas-0.057
 dishes-0.051
 cheeks-0.050
 cloth-0.050
 elbows-0.050
Token picking
Token ID10868
Feature activation+1.72e+00

Pos logit contributions
 known+0.036
 knowing+0.033
abella+0.032
Fre+0.031
rou+0.031
Neg logit contributions
amas-0.034
 dishes-0.032
 cloth-0.032
 hurts-0.031
 cheeks-0.030
Token up
Token ID510
Feature activation+1.38e+00

Pos logit contributions
 known+0.080
abella+0.073
 knowing+0.073
Fre+0.073
rou+0.070
Neg logit contributions
amas-0.063
 dishes-0.062
 cloth-0.062
 hurts-0.061
 hands-0.059
Token the
Token ID262
Feature activation+7.73e-01

Pos logit contributions
 known+0.061
 knowing+0.056
abella+0.055
Fre+0.055
rou+0.053
Neg logit contributions
amas-0.055
 dishes-0.052
 cloth-0.052
 hurts-0.052
 cheeks-0.050
Token le
Token ID443
Feature activation+5.78e+00

Pos logit contributions
 known+0.039
 knowing+0.036
Fre+0.035
abella+0.035
rou+0.035
Neg logit contributions
amas-0.025
 hurts-0.024
 dishes-0.024
 cloth-0.023
 cheeks-0.023
Tokenmons
Token ID11567
Feature activation+3.59e+00

Pos logit contributions
 known+0.269
abella+0.248
Fre+0.245
 knowing+0.244
rou+0.240
Neg logit contributions
 dishes-0.211
 cloth-0.211
amas-0.209
 hurts-0.204
 hands-0.201
Token when
Token ID618
Feature activation-8.66e-01

Pos logit contributions
 known+0.144
 knowing+0.143
abella+0.132
rou+0.130
Fre+0.130
Neg logit contributions
amas-0.144
 hurts-0.139
 dishes-0.138
 cheeks-0.138
 cloth-0.138
Token they
Token ID484
Feature activation-7.57e-01

Pos logit contributions
amas+0.035
 dishes+0.035
 cloth+0.035
 hurts+0.034
 hands+0.034
Neg logit contributions
 known-0.037
abella-0.034
 knowing-0.033
Fre-0.033
rou-0.032
Token fell
Token ID3214
Feature activation-5.11e-01

Pos logit contributions
amas+0.031
 dishes+0.030
 cloth+0.029
 hurts+0.029
 hands+0.028
Neg logit contributions
 known-0.034
abella-0.030
Fre-0.030
 knowing-0.029
rou-0.028
Token.
Token ID13
Feature activation-1.35e-01

Pos logit contributions
amas+0.019
 dishes+0.018
 cheeks+0.017
 hurts+0.017
 lap+0.017
Neg logit contributions
 knowing-0.022
 known-0.022
rou-0.021
Fre-0.020
abella-0.019
Token,
Token ID11
Feature activation+7.47e-01

Pos logit contributions
 known+0.096
 knowing+0.089
abella+0.087
Fre+0.086
rou+0.085
Neg logit contributions
 dishes-0.074
amas-0.074
 cloth-0.074
 hurts-0.072
 hands-0.069
Token and
Token ID290
Feature activation+5.16e-01

Pos logit contributions
 known+0.034
abella+0.032
Fre+0.031
rou+0.030
 knowing+0.029
Neg logit contributions
 cloth-0.028
 dishes-0.028
 hands-0.027
amas-0.027
 hurts-0.027
Token all
Token ID477
Feature activation-2.97e-01

Pos logit contributions
 known+0.023
Fre+0.021
abella+0.021
rou+0.021
 knowing+0.020
Neg logit contributions
 cloth-0.020
 dishes-0.020
 hands-0.019
 hurts-0.019
 cheeks-0.019
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
amas+0.013
 dishes+0.012
 cloth+0.012
 cheeks+0.011
 hurts+0.011
Neg logit contributions
 known-0.013
 knowing-0.012
abella-0.011
rou-0.011
Fre-0.011
Token other
Token ID584
Feature activation+8.43e-01

Pos logit contributions
 known+0.068
abella+0.065
Fre+0.063
 knowing+0.063
rou+0.061
Neg logit contributions
 dishes-0.048
 cloth-0.047
 hands-0.045
 hurts-0.044
 cheeks-0.043
Token av
Token ID1196
Feature activation+5.70e+00

Pos logit contributions
 known+0.039
abella+0.035
 knowing+0.035
Fre+0.035
rou+0.034
Neg logit contributions
amas-0.032
 dishes-0.032
 cloth-0.032
 hurts-0.031
 hands-0.030
Tokenoc
Token ID420
Feature activation+2.95e+00

Pos logit contributions
abella+0.244
 known+0.237
Fre+0.223
 knowing+0.215
rou+0.206
Neg logit contributions
 hands-0.230
 cloth-0.227
 dishes-0.225
amas-0.220
 hurts-0.220
Tokenados
Token ID22484
Feature activation+1.33e+00

Pos logit contributions
 known+0.178
abella+0.166
 knowing+0.165
Fre+0.165
rou+0.162
Neg logit contributions
 cloth-0.068
 dishes-0.068
amas-0.067
 hurts-0.064
 hands-0.062
Token were
Token ID547
Feature activation-8.55e-01

Pos logit contributions
 known+0.056
 knowing+0.051
abella+0.048
Fre+0.047
 discovering+0.046
Neg logit contributions
amas-0.059
 dishes-0.056
 hurts-0.055
 cloth-0.055
 diaper-0.054
Token very
Token ID845
Feature activation-1.13e-01

Pos logit contributions
 dishes+0.035
 cloth+0.034
 hands+0.034
amas+0.032
 hurts+0.032
Neg logit contributions
abella-0.035
 known-0.034
Fre-0.034
rou-0.032
 knowing-0.031
Token happy
Token ID3772
Feature activation-2.45e-01

Pos logit contributions
 dishes+0.005
 cloth+0.005
 hands+0.005
 hurts+0.004
 cheeks+0.004
Neg logit contributions
 known-0.005
abella-0.004
Fre-0.004
 knowing-0.004
rou-0.004
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
 known+0.004
abella+0.004
 knowing+0.004
Fre+0.004
rou+0.003
Neg logit contributions
amas-0.004
 dishes-0.004
 cloth-0.003
 hurts-0.003
 hands-0.003
Token sun
Token ID4252
Feature activation+2.01e-01

Pos logit contributions
 known+0.066
abella+0.061
 knowing+0.061
Fre+0.060
rou+0.059
Neg logit contributions
amas-0.048
 dishes-0.047
 cloth-0.046
 hurts-0.046
 hands-0.044
Token rose
Token ID8278
Feature activation+2.24e+00

Pos logit contributions
 known+0.008
 knowing+0.007
abella+0.007
rou+0.007
 seeing+0.007
Neg logit contributions
 dishes-0.009
amas-0.008
 cloth-0.008
 diaper-0.008
 lap-0.008
Token,
Token ID11
Feature activation+4.84e-01

Pos logit contributions
 known+0.095
 knowing+0.087
abella+0.084
rou+0.084
Fre+0.082
Neg logit contributions
 dishes-0.090
amas-0.088
 cloth-0.086
 hands-0.084
 hurts-0.083
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
 known+0.022
abella+0.020
Fre+0.020
 knowing+0.019
rou+0.019
Neg logit contributions
 dishes-0.019
 cloth-0.019
amas-0.018
 hurts-0.018
 hands-0.018
Token le
Token ID443
Feature activation+5.63e+00

Pos logit contributions
 known+0.061
abella+0.057
 knowing+0.056
Fre+0.056
rou+0.054
Neg logit contributions
 dishes-0.039
 cloth-0.039
amas-0.039
 hurts-0.037
 hands-0.037
Tokenopard
Token ID15478
Feature activation+2.93e+00

Pos logit contributions
 known+0.196
abella+0.174
Fre+0.171
 knowing+0.171
rou+0.166
Neg logit contributions
 dishes-0.275
 cloth-0.273
amas-0.272
 hurts-0.267
 hands-0.264
Token said
Token ID531
Feature activation+1.03e+00

Pos logit contributions
 known+0.122
 knowing+0.110
abella+0.110
Fre+0.108
rou+0.106
Neg logit contributions
amas-0.124
 dishes-0.123
 cloth-0.120
 hurts-0.115
 hands-0.114
Token goodbye
Token ID24829
Feature activation+3.25e-02

Pos logit contributions
 known+0.044
 knowing+0.040
abella+0.040
Fre+0.039
rou+0.037
Neg logit contributions
amas-0.043
 dishes-0.043
 cloth-0.042
 hurts-0.041
 hands-0.040
Token to
Token ID284
Feature activation+1.17e+00

Pos logit contributions
 known+0.001
 knowing+0.001
abella+0.001
Fre+0.001
rou+0.001
Neg logit contributions
 dishes-0.001
 cloth-0.001
amas-0.001
 hurts-0.001
 hands-0.001
Token her
Token ID607
Feature activation+6.00e-02

Pos logit contributions
 known+0.058
abella+0.053
Fre+0.053
rou+0.052
 knowing+0.052
Neg logit contributions
 cloth-0.039
 dishes-0.039
 hands-0.038
amas-0.038
 hurts-0.038
Token friend
Token ID1545
Feature activation-1.98e-02

Pos logit contributions
 dishes+0.014
 cloth+0.014
 hands+0.014
amas+0.014
 hurts+0.014
Neg logit contributions
 known-0.014
abella-0.013
 knowing-0.013
Fre-0.013
rou-0.012
Token?"
Token ID1701
Feature activation+1.52e-01

Pos logit contributions
amas+0.001
 dishes+0.001
 cloth+0.001
 cheeks+0.001
 hands+0.001
Neg logit contributions
 known-0.001
 knowing-0.001
abella-0.001
Fre-0.001
rou-0.001
Token
Token ID198
Feature activation-4.73e-01

Pos logit contributions
 known+0.007
abella+0.006
 knowing+0.006
Fre+0.006
rou+0.006
Neg logit contributions
 cloth-0.006
 dishes-0.006
amas-0.005
 hands-0.005
 hurts-0.005
Token
Token ID198
Feature activation-6.70e-01

Pos logit contributions
 cloth+0.016
 dishes+0.015
amas+0.015
 hands+0.015
 hurts+0.015
Neg logit contributions
 known-0.024
 knowing-0.022
abella-0.022
Fre-0.021
rou-0.021
TokenThe
Token ID464
Feature activation-4.21e-01

Pos logit contributions
 cloth+0.028
 dishes+0.028
amas+0.028
 hurts+0.027
 hands+0.027
Neg logit contributions
 known-0.028
 knowing-0.025
abella-0.025
Fre-0.024
rou-0.024
Token tou
Token ID2819
Feature activation+5.63e+00

Pos logit contributions
 dishes+0.014
 cloth+0.014
amas+0.014
 hurts+0.014
 hands+0.013
Neg logit contributions
 known-0.021
abella-0.020
 knowing-0.019
Fre-0.019
rou-0.019
Tokencan
Token ID5171
Feature activation+5.74e-01

Pos logit contributions
 known+0.296
 knowing+0.296
abella+0.292
Fre+0.285
uth+0.271
Neg logit contributions
amas-0.141
 hurts-0.140
 cloth-0.130
 hands-0.127
el-0.126
Token nods
Token ID34833
Feature activation+1.18e+00

Pos logit contributions
 known+0.024
abella+0.022
 knowing+0.022
Fre+0.022
rou+0.021
Neg logit contributions
 dishes-0.024
amas-0.024
 cloth-0.023
 hurts-0.023
 hands-0.022
Token its
Token ID663
Feature activation-8.83e-01

Pos logit contributions
 known+0.054
 knowing+0.050
abella+0.049
Fre+0.048
rou+0.047
Neg logit contributions
amas-0.046
 dishes-0.046
 cloth-0.045
 hurts-0.043
 cheeks-0.042
Token head
Token ID1182
Feature activation-6.66e-01

Pos logit contributions
 cloth+0.029
 dishes+0.029
 hands+0.029
 hurts+0.029
amas+0.029
Neg logit contributions
 known-0.044
abella-0.042
Fre-0.041
 knowing-0.040
rou-0.039
Token.
Token ID13
Feature activation-6.19e-01

Pos logit contributions
 dishes+0.028
amas+0.028
 cloth+0.027
 hurts+0.025
 cheeks+0.025
Neg logit contributions
 known-0.029
 knowing-0.027
rou-0.026
abella-0.025
Fre-0.024
Token saw
Token ID2497
Feature activation-7.36e-02

Pos logit contributions
 dishes+0.009
amas+0.009
 cloth+0.009
 hurts+0.009
 hands+0.009
Neg logit contributions
 known-0.010
abella-0.009
 knowing-0.009
Fre-0.009
rou-0.009
Token a
Token ID257
Feature activation+9.58e-01

Pos logit contributions
amas+0.003
 dishes+0.003
 cloth+0.003
 hurts+0.003
 hands+0.003
Neg logit contributions
 known-0.003
abella-0.003
 knowing-0.003
Fre-0.003
rou-0.003
Token picture
Token ID4286
Feature activation+4.40e-01

Pos logit contributions
 known+0.053
abella+0.050
Fre+0.048
 knowing+0.048
rou+0.046
Neg logit contributions
 cloth-0.030
 dishes-0.028
 hurts-0.027
amas-0.026
 hands-0.026
Token of
Token ID286
Feature activation-4.22e-01

Pos logit contributions
 knowing+0.017
 known+0.016
abella+0.015
 indeed+0.014
 seeing+0.014
Neg logit contributions
amas-0.019
 dishes-0.019
 cloth-0.018
 elbows-0.018
 cheeks-0.018
Token a
Token ID257
Feature activation+1.22e+00

Pos logit contributions
amas+0.016
 dishes+0.016
 hurts+0.016
 cloth+0.016
 hands+0.015
Neg logit contributions
 known-0.019
 knowing-0.017
abella-0.017
Fre-0.017
rou-0.016
Token le
Token ID443
Feature activation+5.60e+00

Pos logit contributions
 known+0.067
abella+0.063
Fre+0.062
 knowing+0.060
rou+0.059
Neg logit contributions
 cloth-0.037
 dishes-0.036
amas-0.034
 hurts-0.034
 hands-0.033
Tokenopard
Token ID15478
Feature activation+2.50e+00

Pos logit contributions
 known+0.244
abella+0.221
 knowing+0.218
Fre+0.218
rou+0.208
Neg logit contributions
 dishes-0.227
amas-0.226
 cloth-0.225
 hurts-0.219
 hands-0.216
Token.
Token ID13
Feature activation+2.66e-02

Pos logit contributions
 known+0.120
abella+0.110
Fre+0.109
 knowing+0.109
rou+0.105
Neg logit contributions
 dishes-0.089
 cloth-0.088
amas-0.088
 hurts-0.085
 hands-0.084
Token They
Token ID1119
Feature activation+1.84e-01

Pos logit contributions
 known+0.001
abella+0.001
 knowing+0.001
Fre+0.001
rou+0.001
Neg logit contributions
 dishes-0.001
 cloth-0.001
amas-0.001
 hurts-0.001
 hands-0.001
Token said
Token ID531
Feature activation+1.12e+00

Pos logit contributions
 known+0.009
abella+0.008
Fre+0.008
 knowing+0.008
rou+0.008
Neg logit contributions
amas-0.007
 dishes-0.007
 cloth-0.007
 hurts-0.006
 hands-0.006
Token it
Token ID340
Feature activation-7.09e-01

Pos logit contributions
 known+0.053
abella+0.049
Fre+0.049
rou+0.048
 knowing+0.048
Neg logit contributions
 cloth-0.040
 dishes-0.040
 hands-0.039
 hurts-0.039
 cheeks-0.038
Token very
Token ID845
Feature activation+5.23e-01

Pos logit contributions
 hands+0.047
 dishes+0.047
 cloth+0.044
 hurts+0.043
 cheeks+0.042
Neg logit contributions
 known-0.054
Fre-0.053
abella-0.052
oph-0.050
rou-0.048
Token excited
Token ID6568
Feature activation-7.40e-01

Pos logit contributions
 known+0.023
abella+0.021
Fre+0.021
 knowing+0.021
rou+0.020
Neg logit contributions
 dishes-0.021
 cloth-0.020
 hurts-0.020
 hands-0.020
amas-0.020
Token to
Token ID284
Feature activation+1.57e+00

Pos logit contributions
 cloth+0.030
amas+0.030
 dishes+0.029
 hurts+0.029
 hands+0.028
Neg logit contributions
 known-0.031
 knowing-0.031
abella-0.028
rou-0.028
Fre-0.027
Token see
Token ID766
Feature activation+6.22e-01

Pos logit contributions
 known+0.073
 knowing+0.063
abella+0.063
Fre+0.062
 discovering+0.060
Neg logit contributions
amas-0.063
 dishes-0.060
 hurts-0.059
 cloth-0.057
 cheeks-0.056
Token the
Token ID262
Feature activation+8.46e-01

Pos logit contributions
 known+0.025
abella+0.022
 knowing+0.022
Fre+0.022
rou+0.021
Neg logit contributions
amas-0.028
 dishes-0.027
 cloth-0.027
 hurts-0.027
 cheeks-0.026
Token le
Token ID443
Feature activation+5.59e+00

Pos logit contributions
 known+0.044
abella+0.042
Fre+0.041
 knowing+0.041
rou+0.040
Neg logit contributions
 dishes-0.027
 cloth-0.026
amas-0.026
 hurts-0.025
 hands-0.025
Tokenopard
Token ID15478
Feature activation+3.10e+00

Pos logit contributions
 known+0.176
abella+0.154
Fre+0.152
 knowing+0.152
rou+0.149
Neg logit contributions
 cloth-0.290
 dishes-0.289
amas-0.287
 hurts-0.281
 hands-0.281
Token.
Token ID13
Feature activation+6.79e-01

Pos logit contributions
 known+0.142
abella+0.130
 knowing+0.130
Fre+0.128
rou+0.125
Neg logit contributions
amas-0.115
 dishes-0.115
 cloth-0.115
 hurts-0.111
 hands-0.110
Token
Token ID198
Feature activation+4.08e-01

Pos logit contributions
 known+0.035
abella+0.032
 knowing+0.032
Fre+0.031
rou+0.031
Neg logit contributions
 dishes-0.022
 cloth-0.022
amas-0.022
 hurts-0.021
 hands-0.021
Token
Token ID198
Feature activation-8.07e-02

Pos logit contributions
 known+0.022
 knowing+0.021
rou+0.020
Fre+0.020
abella+0.020
Neg logit contributions
 cloth-0.012
 hands-0.011
amas-0.011
 dishes-0.010
 hurts-0.010
TokenAt
Token ID2953
Feature activation-1.52e+00

Pos logit contributions
 cloth+0.003
 dishes+0.003
amas+0.003
 hands+0.003
 hurts+0.003
Neg logit contributions
 known-0.004
 knowing-0.003
abella-0.003
rou-0.003
joy-0.003
Token.
Token ID13
Feature activation+3.93e-01

Pos logit contributions
 known+0.012
 knowing+0.011
abella+0.011
rou+0.011
Fre+0.011
Neg logit contributions
amas-0.011
 cloth-0.011
 dishes-0.011
 hurts-0.011
 cheeks-0.011
Token He
Token ID679
Feature activation+4.58e-01

Pos logit contributions
 known+0.020
 knowing+0.018
abella+0.018
Fre+0.018
rou+0.017
Neg logit contributions
amas-0.013
 dishes-0.013
 cloth-0.013
 hands-0.013
 hurts-0.012
Token picked
Token ID6497
Feature activation+1.40e+00

Pos logit contributions
 known+0.019
abella+0.018
Fre+0.017
 knowing+0.017
rou+0.017
Neg logit contributions
 dishes-0.019
 cloth-0.019
amas-0.018
 hurts-0.018
 hands-0.018
Token all
Token ID477
Feature activation-6.34e-01

Pos logit contributions
 known+0.066
abella+0.061
 knowing+0.060
Fre+0.059
rou+0.058
Neg logit contributions
amas-0.052
 dishes-0.050
 cloth-0.050
 hurts-0.049
 cheeks-0.048
Token the
Token ID262
Feature activation+5.91e-01

Pos logit contributions
amas+0.034
 cheeks+0.033
 elbows+0.030
 diaper+0.030
 hurts+0.030
Neg logit contributions
 known-0.021
abella-0.017
 knowing-0.017
rou-0.017
Fre-0.017
Token le
Token ID443
Feature activation+5.53e+00

Pos logit contributions
 known+0.027
 knowing+0.024
rou+0.024
Fre+0.024
 discovering+0.024
Neg logit contributions
amas-0.021
 hurts-0.019
 cheeks-0.019
 elbows-0.018
 diaper-0.018
Tokenmons
Token ID11567
Feature activation+3.43e+00

Pos logit contributions
 known+0.262
rou+0.243
abella+0.243
Fre+0.241
 knowing+0.236
Neg logit contributions
 cloth-0.197
 dishes-0.195
amas-0.190
 hurts-0.188
 hands-0.183
Token,
Token ID11
Feature activation+6.05e-01

Pos logit contributions
 knowing+0.124
 known+0.123
rou+0.109
 discovering+0.108
abella+0.108
Neg logit contributions
amas-0.155
 hurts-0.151
 diaper-0.147
 cheeks-0.146
 dishes-0.145
Token one
Token ID530
Feature activation+5.01e-01

Pos logit contributions
 known+0.026
 knowing+0.024
abella+0.023
Fre+0.023
rou+0.023
Neg logit contributions
amas-0.025
 dishes-0.024
 hurts-0.024
 cloth-0.024
 hands-0.023
Token by
Token ID416
Feature activation+5.77e-01

Pos logit contributions
 known+0.022
 knowing+0.021
rou+0.020
abella+0.019
Fre+0.019
Neg logit contributions
amas-0.021
 dishes-0.020
 hurts-0.019
 cloth-0.018
 cheeks-0.018
Token one
Token ID530
Feature activation+6.77e-01

Pos logit contributions
 known+0.026
abella+0.024
Fre+0.024
rou+0.023
 knowing+0.023
Neg logit contributions
 cloth-0.023
 dishes-0.022
 hands-0.022
 hurts-0.021
 cheeks-0.021
Token sisters
Token ID15153
Feature activation-9.14e-01

Pos logit contributions
amas+0.020
 dishes+0.019
 cloth+0.019
 hurts+0.018
 cheeks+0.018
Neg logit contributions
 known-0.021
 knowing-0.019
Fre-0.019
abella-0.018
rou-0.018
Token.
Token ID13
Feature activation-4.73e-02

Pos logit contributions
amas+0.033
 dishes+0.033
 cloth+0.033
 hurts+0.032
 hands+0.031
Neg logit contributions
 known-0.043
 knowing-0.040
abella-0.039
Fre-0.039
rou-0.038
Token They
Token ID1119
Feature activation+1.80e-01

Pos logit contributions
 dishes+0.001
amas+0.001
 cloth+0.001
 hurts+0.001
 hands+0.001
Neg logit contributions
 known-0.002
abella-0.002
Fre-0.002
rou-0.002
 knowing-0.002
Token are
Token ID389
Feature activation-1.02e+00

Pos logit contributions
 known+0.009
 knowing+0.008
Fre+0.008
abella+0.008
rou+0.008
Neg logit contributions
amas-0.007
 dishes-0.006
 cloth-0.006
 hurts-0.006
 cheeks-0.006
Token all
Token ID477
Feature activation-1.03e+00

Pos logit contributions
 dishes+0.034
 hands+0.033
 cloth+0.031
 clothes+0.029
 hurts+0.029
Neg logit contributions
 known-0.047
abella-0.047
Fre-0.046
rou-0.046
joy-0.045
Token le
Token ID443
Feature activation+5.48e+00

Pos logit contributions
amas+0.045
 cloth+0.045
 dishes+0.044
 hurts+0.043
 cheeks+0.043
Neg logit contributions
 known-0.042
 knowing-0.037
abella-0.037
Fre-0.037
rou-0.036
Tokenop
Token ID404
Feature activation+4.63e-02

Pos logit contributions
 known+0.234
abella+0.213
Fre+0.213
 knowing+0.210
rou+0.208
Neg logit contributions
 cloth-0.224
 dishes-0.223
amas-0.221
 hurts-0.217
 hands-0.212
Tokenards
Token ID1371
Feature activation-3.52e-01

Pos logit contributions
rou+0.002
 knowing+0.002
 known+0.002
Fre+0.002
abella+0.002
Neg logit contributions
amas-0.002
 cloth-0.002
 lap-0.002
 hurts-0.002
 sponge-0.002
Token,
Token ID11
Feature activation+4.79e-01

Pos logit contributions
amas+0.015
 cloth+0.015
 cheeks+0.015
 dishes+0.015
 hands+0.015
Neg logit contributions
 known-0.013
 knowing-0.013
Fre-0.011
 discovering-0.011
 seeing-0.011
Token too
Token ID1165
Feature activation+2.18e+00

Pos logit contributions
 known+0.019
 knowing+0.018
Fre+0.018
abella+0.017
rou+0.017
Neg logit contributions
amas-0.021
 dishes-0.019
 hurts-0.019
 cloth-0.019
 hands-0.018
Token.
Token ID13
Feature activation-2.38e-01

Pos logit contributions
 known+0.104
 knowing+0.095
abella+0.095
Fre+0.095
rou+0.092
Neg logit contributions
amas-0.078
 dishes-0.078
 cloth-0.077
 hurts-0.074
 hands-0.073
Token neck
Token ID7393
Feature activation+1.93e+00

Pos logit contributions
 known+0.092
 knowing+0.085
abella+0.082
Fre+0.081
rou+0.080
Neg logit contributions
amas-0.066
 dishes-0.063
 cloth-0.063
 hurts-0.060
 hands-0.060
Tokenl
Token ID75
Feature activation+3.48e+00

Pos logit contributions
 known+0.093
abella+0.085
 knowing+0.084
Fre+0.084
rou+0.081
Neg logit contributions
 dishes-0.068
amas-0.068
 cloth-0.068
 hurts-0.066
 hands-0.065
Tokenaces
Token ID2114
Feature activation+1.07e+00

Pos logit contributions
 known+0.172
abella+0.158
 knowing+0.156
Fre+0.156
rou+0.152
Neg logit contributions
 dishes-0.119
 cloth-0.118
amas-0.117
 hurts-0.113
 hands-0.112
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
 known+0.049
 knowing+0.045
abella+0.043
rou+0.043
Fre+0.042
Neg logit contributions
 dishes-0.042
amas-0.041
 cloth-0.040
 hurts-0.039
 hands-0.038
Token grape
Token ID30777
Feature activation+4.14e+00

Pos logit contributions
 known+0.055
 knowing+0.053
 discovering+0.048
Fre+0.047
 seeing+0.047
Neg logit contributions
amas-0.060
 dishes-0.057
 hurts-0.056
 cloth-0.054
 diaper-0.053
Token brace
Token ID22581
Feature activation+5.48e+00

Pos logit contributions
 known+0.171
 knowing+0.157
abella+0.153
Fre+0.153
rou+0.145
Neg logit contributions
amas-0.176
 dishes-0.168
 cloth-0.165
 hurts-0.158
 hands-0.158
Tokenlets
Token ID5289
Feature activation+2.72e+00

Pos logit contributions
 known+0.246
abella+0.230
Fre+0.222
 knowing+0.216
 discovering+0.205
Neg logit contributions
 hands-0.214
 cheeks-0.205
 cloth-0.203
 hurts-0.203
amas-0.199
Token,
Token ID11
Feature activation+4.99e-01

Pos logit contributions
 known+0.123
 knowing+0.114
abella+0.112
Fre+0.110
rou+0.108
Neg logit contributions
amas-0.104
 dishes-0.103
 cloth-0.101
 hurts-0.100
 hands-0.096
Token and
Token ID290
Feature activation-5.85e-01

Pos logit contributions
 known+0.025
abella+0.023
 knowing+0.023
Fre+0.023
rou+0.022
Neg logit contributions
 dishes-0.016
amas-0.016
 cloth-0.016
 hurts-0.016
 hands-0.015
Token grape
Token ID30777
Feature activation+3.97e+00

Pos logit contributions
 cloth+0.021
 dishes+0.021
amas+0.020
 hurts+0.020
 hands+0.020
Neg logit contributions
 known-0.027
abella-0.026
Fre-0.025
 knowing-0.025
rou-0.024
Token crown
Token ID12389
Feature activation+1.82e+00

Pos logit contributions
 known+0.179
 knowing+0.163
abella+0.162
Fre+0.161
rou+0.154
Neg logit contributions
amas-0.155
 dishes-0.150
 cloth-0.147
 hurts-0.143
 hands-0.141
TokenN
Token ID45
Feature activation+7.72e-01

Pos logit contributions
 known+0.009
 knowing+0.008
abella+0.008
rou+0.008
Fre+0.007
Neg logit contributions
 cloth-0.008
amas-0.008
 dishes-0.008
 hands-0.008
 hurts-0.007
Tokenina
Token ID1437
Feature activation-2.97e-01

Pos logit contributions
 known+0.034
abella+0.030
Fre+0.030
 knowing+0.030
rou+0.029
Neg logit contributions
amas-0.031
 dishes-0.031
 cloth-0.030
 hurts-0.030
 hands-0.030
Token was
Token ID373
Feature activation-3.89e-02

Pos logit contributions
 dishes+0.012
amas+0.012
 cloth+0.012
 hurts+0.011
 hands+0.011
Neg logit contributions
 known-0.013
abella-0.012
 knowing-0.011
Fre-0.011
rou-0.011
Token panic
Token ID13619
Feature activation-1.90e+00

Pos logit contributions
 hands+0.001
 dishes+0.001
 cloth+0.001
amas+0.001
 hurts+0.001
Neg logit contributions
 known-0.002
Fre-0.002
abella-0.002
 knowing-0.002
rou-0.001
Token-
Token ID12
Feature activation+5.13e-01

Pos logit contributions
amas+0.085
 dishes+0.085
 cloth+0.081
 hurts+0.081
 hands+0.079
Neg logit contributions
 knowing-0.075
 known-0.070
rou-0.064
abella-0.062
 realizing-0.060
Tokenstr
Token ID2536
Feature activation+5.47e+00

Pos logit contributions
 known+0.025
abella+0.023
Fre+0.022
 knowing+0.022
rou+0.022
Neg logit contributions
 dishes-0.019
 cloth-0.019
amas-0.019
 hands-0.018
 cheeks-0.018
Tokenicken
Token ID5973
Feature activation-1.97e+00

Pos logit contributions
 known+0.213
abella+0.195
 knowing+0.189
Fre+0.186
 discovering+0.173
Neg logit contributions
amas-0.255
 dishes-0.237
 hands-0.234
 cloth-0.231
 hurts-0.230
Token.
Token ID13
Feature activation-4.45e-01

Pos logit contributions
 cloth+0.079
amas+0.078
 dishes+0.074
 sponge+0.073
 hurts+0.072
Neg logit contributions
 knowing-0.079
 known-0.078
rou-0.068
abella-0.067
Fre-0.064
Token She
Token ID1375
Feature activation+4.47e-01

Pos logit contributions
amas+0.012
 dishes+0.012
 cloth+0.012
 hands+0.012
 hurts+0.012
Neg logit contributions
 known-0.025
abella-0.023
 knowing-0.022
Fre-0.022
rou-0.022
Token screamed
Token ID25421
Feature activation-5.11e-01

Pos logit contributions
 known+0.020
abella+0.018
 knowing+0.018
Fre+0.018
rou+0.017
Neg logit contributions
 dishes-0.018
 cloth-0.018
amas-0.018
 hurts-0.017
 hands-0.017
Token for
Token ID329
Feature activation-9.62e-01

Pos logit contributions
 dishes+0.022
amas+0.021
 hurts+0.021
 cloth+0.021
 hands+0.020
Neg logit contributions
 knowing-0.020
 known-0.020
abella-0.018
rou-0.018
Fre-0.017
Token the
Token ID262
Feature activation-1.48e-01

Pos logit contributions
 known+0.021
abella+0.018
 knowing+0.018
 exactly+0.017
 discovering+0.017
Neg logit contributions
amas-0.033
 hurts-0.030
 cheeks-0.030
 diaper-0.029
bars-0.028
Token rice
Token ID11464
Feature activation+9.99e-01

Pos logit contributions
amas+0.005
 hurts+0.004
bars+0.004
 cheeks+0.004
iren+0.004
Neg logit contributions
 known-0.007
 knowing-0.006
rou-0.006
joy-0.006
Fre-0.006
Token back
Token ID736
Feature activation+7.29e-02

Pos logit contributions
 knowing+0.035
 known+0.033
 possible+0.030
 exactly+0.030
rou+0.029
Neg logit contributions
amas-0.047
 hurts-0.045
 cheeks-0.042
iren-0.041
 diaper-0.041
Token in
Token ID287
Feature activation+1.34e-01

Pos logit contributions
 known+0.003
 knowing+0.003
rou+0.003
 exactly+0.002
abella+0.002
Neg logit contributions
amas-0.003
 dishes-0.003
 cloth-0.003
 hurts-0.003
 cheeks-0.003
Token its
Token ID663
Feature activation-3.20e-01

Pos logit contributions
 known+0.006
 knowing+0.005
abella+0.005
Fre+0.005
rou+0.005
Neg logit contributions
amas-0.006
 hurts-0.005
 dishes-0.005
 cheeks-0.005
 cloth-0.005
Token rightful
Token ID43133
Feature activation+5.44e+00

Pos logit contributions
amas+0.011
 dishes+0.010
 hurts+0.010
 cloth+0.010
 cheeks+0.010
Neg logit contributions
 known-0.015
 knowing-0.014
rou-0.014
Fre-0.014
abella-0.013
Token place
Token ID1295
Feature activation-3.57e-01

Pos logit contributions
 known+0.267
rou+0.260
abella+0.258
Fre+0.258
 knowing+0.248
Neg logit contributions
 hands-0.189
 cloth-0.177
 dishes-0.177
 hurts-0.166
 lap-0.165
Token.
Token ID13
Feature activation+3.65e-02

Pos logit contributions
amas+0.014
 hurts+0.014
 dishes+0.014
 cloth+0.014
 cheeks+0.013
Neg logit contributions
 known-0.015
 knowing-0.015
rou-0.014
abella-0.014
 discovering-0.013
Token John
Token ID1757
Feature activation-1.36e-01

Pos logit contributions
 known+0.002
abella+0.002
Fre+0.002
 knowing+0.002
rou+0.002
Neg logit contributions
amas-0.001
 dishes-0.001
 cloth-0.001
 hands-0.001
 hurts-0.001
Token was
Token ID373
Feature activation-6.25e-03

Pos logit contributions
 dishes+0.006
amas+0.006
 cloth+0.006
 hurts+0.005
 hands+0.005
Neg logit contributions
 known-0.006
abella-0.005
 knowing-0.005
Fre-0.005
rou-0.005
Token proud
Token ID6613
Feature activation+6.19e-01

Pos logit contributions
 dishes+0.000
 cloth+0.000
amas+0.000
 hands+0.000
 hurts+0.000
Neg logit contributions
 known-0.000
abella-0.000
Fre-0.000
 knowing-0.000
rou-0.000
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
 known+0.024
abella+0.022
rou+0.022
Fre+0.022
 knowing+0.021
Neg logit contributions
 cloth-0.019
 hands-0.019
 dishes-0.019
amas-0.018
 cheeks-0.018
Token ancient
Token ID6156
Feature activation+1.75e+00

Pos logit contributions
 known+0.053
abella+0.049
Fre+0.048
 knowing+0.048
rou+0.046
Neg logit contributions
 cloth-0.038
 dishes-0.038
amas-0.037
 hands-0.036
 hurts-0.036
Token knob
Token ID36842
Feature activation-2.17e-02

Pos logit contributions
 known+0.083
 knowing+0.076
abella+0.076
Fre+0.075
rou+0.074
Neg logit contributions
amas-0.062
 dishes-0.062
 cloth-0.060
 hurts-0.059
 hands-0.058
Token found
Token ID1043
Feature activation-3.72e-02

Pos logit contributions
amas+0.001
 dishes+0.001
 diaper+0.001
 cloth+0.001
 elbows+0.001
Neg logit contributions
 known-0.001
 knowing-0.001
abella-0.001
rou-0.001
 exactly-0.001
Token its
Token ID663
Feature activation-7.51e-01

Pos logit contributions
amas+0.001
 dishes+0.001
 cloth+0.001
 hurts+0.001
 hands+0.001
Neg logit contributions
 known-0.002
abella-0.002
 knowing-0.001
Fre-0.001
rou-0.001
Token rightful
Token ID43133
Feature activation+5.41e+00

Pos logit contributions
amas+0.031
 dishes+0.030
 cloth+0.029
 hurts+0.029
 hands+0.028
Neg logit contributions
 known-0.033
 knowing-0.030
rou-0.028
Fre-0.028
abella-0.028
Token place
Token ID1295
Feature activation-7.74e-01

Pos logit contributions
 known+0.250
abella+0.243
Fre+0.240
rou+0.238
 knowing+0.230
Neg logit contributions
 hands-0.198
 cloth-0.192
 dishes-0.184
 hurts-0.180
 lap-0.174
Token in
Token ID287
Feature activation-3.89e-01

Pos logit contributions
amas+0.031
 dishes+0.030
 cloth+0.030
 hurts+0.030
 cheeks+0.028
Neg logit contributions
 known-0.033
 knowing-0.032
rou-0.030
abella-0.030
Fre-0.029
Token their
Token ID511
Feature activation-4.50e-01

Pos logit contributions
amas+0.017
 hurts+0.016
 dishes+0.016
 cheeks+0.016
 cloth+0.016
Neg logit contributions
 known-0.017
 knowing-0.015
Fre-0.014
abella-0.014
 discovering-0.014
Token house
Token ID2156
Feature activation-2.07e+00

Pos logit contributions
amas+0.019
 cloth+0.017
 dishes+0.017
 hurts+0.017
 sponge+0.016
Neg logit contributions
 known-0.019
 knowing-0.017
 discovering-0.017
Fre-0.016
joy-0.016
Token.
Token ID13
Feature activation-5.33e-02

Pos logit contributions
amas+0.081
 cloth+0.080
 hurts+0.080
 dishes+0.079
 cheeks+0.074
Neg logit contributions
 known-0.090
 knowing-0.088
rou-0.080
abella-0.079
 discovering-0.078
Token waved
Token ID26834
Feature activation+1.10e+00

Pos logit contributions
 known+0.010
Fre+0.010
abella+0.009
 knowing+0.009
 possible+0.009
Neg logit contributions
 cloth-0.007
 hurts-0.007
 hands-0.006
 dishes-0.006
 cheeks-0.006
Token the
Token ID262
Feature activation+5.29e-01

Pos logit contributions
 known+0.048
 knowing+0.046
abella+0.044
Fre+0.044
rou+0.043
Neg logit contributions
amas-0.045
 dishes-0.044
 cloth-0.042
 hurts-0.041
 cheeks-0.040
Token stick
Token ID4859
Feature activation+2.05e+00

Pos logit contributions
 known+0.027
abella+0.026
Fre+0.025
 knowing+0.025
rou+0.024
Neg logit contributions
 cloth-0.018
 dishes-0.017
 hurts-0.017
 hands-0.017
amas-0.016
Token at
Token ID379
Feature activation-5.38e-01

Pos logit contributions
 knowing+0.077
 known+0.077
rou+0.069
 discovering+0.065
abella+0.064
Neg logit contributions
amas-0.096
 dishes-0.095
 cloth-0.089
 cheeks-0.085
 diaper-0.085
Token the
Token ID262
Feature activation+2.76e-01

Pos logit contributions
 cloth+0.021
 dishes+0.021
 hands+0.020
amas+0.020
 hurts+0.019
Neg logit contributions
 known-0.025
abella-0.023
Fre-0.023
rou-0.022
 knowing-0.022
Token le
Token ID443
Feature activation+5.39e+00

Pos logit contributions
 known+0.012
abella+0.012
Fre+0.011
 knowing+0.011
Kat+0.011
Neg logit contributions
 hurts-0.011
 dishes-0.011
 cloth-0.011
 hands-0.011
 button-0.011
Tokenopard
Token ID15478
Feature activation+3.34e+00

Pos logit contributions
 known+0.199
abella+0.178
 knowing+0.174
Fre+0.171
rou+0.159
Neg logit contributions
amas-0.254
 dishes-0.253
 hurts-0.252
 cloth-0.251
 hands-0.246
Token and
Token ID290
Feature activation-4.45e-01

Pos logit contributions
 known+0.142
 knowing+0.139
rou+0.129
abella+0.128
Fre+0.126
Neg logit contributions
amas-0.132
 dishes-0.129
 cloth-0.127
 hurts-0.121
 cheeks-0.118
Token made
Token ID925
Feature activation-1.95e+00

Pos logit contributions
 cloth+0.015
 hurts+0.015
 hands+0.015
 dishes+0.014
 cheeks+0.014
Neg logit contributions
 known-0.022
Fre-0.021
abella-0.020
rou-0.020
 knowing-0.020
Token loud
Token ID7812
Feature activation-1.87e+00

Pos logit contributions
amas+0.084
 cloth+0.080
 dishes+0.080
 hurts+0.078
 cheeks+0.077
Neg logit contributions
 known-0.079
abella-0.074
 knowing-0.072
rou-0.069
Fre-0.069
Token noises
Token ID26782
Feature activation-1.49e+00

Pos logit contributions
 dishes+0.036
amas+0.035
 cloth+0.035
 hurts+0.033
 hands+0.032
Neg logit contributions
 known-0.120
 knowing-0.113
abella-0.112
Fre-0.112
rou-0.110
Token in
Token ID287
Feature activation-4.05e-01

Pos logit contributions
amas+0.067
 dishes+0.065
 hurts+0.063
 cloth+0.061
 hands+0.061
Neg logit contributions
 known-0.067
 knowing-0.064
rou-0.058
abella-0.056
Fre-0.055
Token the
Token ID262
Feature activation+2.96e-01

Pos logit contributions
amas+0.017
 dishes+0.016
 hurts+0.016
 cheeks+0.016
 cloth+0.015
Neg logit contributions
 known-0.018
Fre-0.016
 knowing-0.016
abella-0.015
rou-0.015
Token water
Token ID1660
Feature activation+7.15e-02

Pos logit contributions
 known+0.015
 knowing+0.014
Fre+0.013
abella+0.013
rou+0.013
Neg logit contributions
amas-0.010
 dishes-0.009
 cloth-0.009
 hurts-0.009
 cheeks-0.009
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
 known+0.003
 knowing+0.003
rou+0.003
abella+0.002
Fre+0.002
Neg logit contributions
amas-0.003
 hurts-0.003
 dishes-0.003
 cloth-0.003
 cheeks-0.003
Token O
Token ID440
Feature activation+6.40e-01

Pos logit contributions
 dishes+0.008
 cloth+0.008
amas+0.008
 hands+0.007
 hurts+0.007
Neg logit contributions
 known-0.012
 knowing-0.011
abella-0.011
Fre-0.011
rou-0.010
Tokenstr
Token ID2536
Feature activation+5.38e+00

Pos logit contributions
 known+0.019
 knowing+0.017
abella+0.016
Fre+0.015
joy+0.014
Neg logit contributions
amas-0.034
 dishes-0.034
 cloth-0.033
 hurts-0.033
 hands-0.032
Tokenich
Token ID488
Feature activation+9.46e-01

Pos logit contributions
 known+0.210
abella+0.208
 knowing+0.194
Fre+0.189
 discovering+0.176
Neg logit contributions
amas-0.237
 hands-0.217
 hurts-0.215
 dishes-0.211
el-0.210
Token practiced
Token ID19893
Feature activation+1.14e+00

Pos logit contributions
 known+0.041
abella+0.037
 knowing+0.036
Fre+0.036
rou+0.035
Neg logit contributions
 dishes-0.039
amas-0.038
 cloth-0.038
 hurts-0.037
 hands-0.036
Token and
Token ID290
Feature activation-2.64e-01

Pos logit contributions
 known+0.050
abella+0.046
Fre+0.046
 knowing+0.044
rou+0.044
Neg logit contributions
 cloth-0.044
 dishes-0.044
amas-0.043
 hurts-0.042
 hands-0.042
Token practiced
Token ID19893
Feature activation+1.21e+00

Pos logit contributions
 cloth+0.011
 dishes+0.011
 hands+0.011
 hurts+0.011
 cheeks+0.010
Neg logit contributions
 known-0.010
Fre-0.010
abella-0.010
rou-0.009
 knowing-0.009
Token until
Token ID1566
Feature activation-1.26e+00

Pos logit contributions
 known+0.053
abella+0.050
Fre+0.049
rou+0.047
 knowing+0.045
Neg logit contributions
 cloth-0.048
 dishes-0.047
amas-0.045
 hands-0.045
 hurts-0.044
Token even
Token ID772
Feature activation+7.71e-02

Pos logit contributions
 dishes+0.008
amas+0.008
 cloth+0.008
 hurts+0.008
 hands+0.007
Neg logit contributions
 knowing-0.008
 known-0.008
abella-0.007
rou-0.007
Fre-0.007
Token louder
Token ID27089
Feature activation+1.33e+00

Pos logit contributions
 known+0.003
Fre+0.003
abella+0.003
 knowing+0.003
uth+0.003
Neg logit contributions
 hurts-0.003
 cloth-0.003
 hands-0.003
 dishes-0.003
 cheeks-0.003
Token.
Token ID13
Feature activation+1.09e-01

Pos logit contributions
 known+0.059
 knowing+0.058
abella+0.054
Fre+0.053
rou+0.052
Neg logit contributions
amas-0.052
 cloth-0.051
 dishes-0.051
 hurts-0.049
 hands-0.047
Token
Token ID198
Feature activation+9.70e-02

Pos logit contributions
 known+0.005
abella+0.005
 knowing+0.005
Fre+0.005
rou+0.005
Neg logit contributions
 dishes-0.004
 cloth-0.004
amas-0.004
 hurts-0.004
 hands-0.003
Token
Token ID198
Feature activation-2.66e-01

Pos logit contributions
 known+0.005
 knowing+0.005
rou+0.004
abella+0.004
Fre+0.004
Neg logit contributions
 cloth-0.003
amas-0.003
 hurts-0.003
 hands-0.003
 dishes-0.003
TokenSte
Token ID7447
Feature activation+5.38e+00

Pos logit contributions
 cloth+0.010
amas+0.010
 dishes+0.010
 hurts+0.010
 hands+0.010
Neg logit contributions
 known-0.011
 knowing-0.011
abella-0.010
rou-0.010
Fre-0.010
Tokenlla
Token ID8466
Feature activation+1.99e+00

Pos logit contributions
joy+0.223
 knowing+0.211
 known+0.209
abella+0.204
rou+0.199
Neg logit contributions
 hurts-0.188
 dishes-0.183
 cloth-0.180
 sleeves-0.175
amas-0.173
Token ran
Token ID4966
Feature activation+6.90e-02

Pos logit contributions
 known+0.094
Fre+0.086
abella+0.086
 knowing+0.085
rou+0.083
Neg logit contributions
 dishes-0.071
 cloth-0.071
amas-0.070
 hands-0.070
 hurts-0.070
Token away
Token ID1497
Feature activation+4.28e-01

Pos logit contributions
 known+0.004
Fre+0.004
rou+0.004
abella+0.004
uth+0.003
Neg logit contributions
 hands-0.002
 cloth-0.002
 dishes-0.002
 hurts-0.002
amas-0.002
Token with
Token ID351
Feature activation-5.05e-01

Pos logit contributions
 known+0.021
abella+0.020
Fre+0.020
rou+0.019
 knowing+0.018
Neg logit contributions
 cloth-0.014
 dishes-0.014
 hands-0.014
 hurts-0.014
amas-0.014
Token the
Token ID262
Feature activation+8.81e-02

Pos logit contributions
amas+0.021
 dishes+0.019
 hurts+0.019
 cloth+0.019
 cheeks+0.018
Neg logit contributions
 known-0.023
 knowing-0.020
abella-0.020
Fre-0.020
rou-0.019
Token It
Token ID632
Feature activation+2.61e-02

Pos logit contributions
 dishes+0.003
amas+0.003
 cloth+0.003
 hurts+0.003
 hands+0.003
Neg logit contributions
 known-0.004
abella-0.004
 knowing-0.004
Fre-0.004
rou-0.004
Token turned
Token ID2900
Feature activation+9.83e-01

Pos logit contributions
 known+0.001
 knowing+0.001
abella+0.001
Fre+0.001
rou+0.001
Neg logit contributions
amas-0.001
 dishes-0.001
 cloth-0.001
 hands-0.001
 cheeks-0.001
Token out
Token ID503
Feature activation-1.15e+00

Pos logit contributions
 knowing+0.041
 known+0.040
rou+0.038
abella+0.037
 exactly+0.035
Neg logit contributions
amas-0.038
 dishes-0.036
 lap-0.036
 laptop-0.036
 cheeks-0.035
Token that
Token ID326
Feature activation+6.30e-02

Pos logit contributions
amas+0.050
 dishes+0.049
 cloth+0.048
 hurts+0.047
 cheeks+0.046
Neg logit contributions
 known-0.047
 knowing-0.044
abella-0.042
Fre-0.042
rou-0.041
Token the
Token ID262
Feature activation+4.78e-01

Pos logit contributions
 known+0.003
 knowing+0.002
abella+0.002
Fre+0.002
 discovering+0.002
Neg logit contributions
amas-0.003
 dishes-0.003
 cheeks-0.003
 lap-0.003
 hands-0.003
Token av
Token ID1196
Feature activation+5.35e+00

Pos logit contributions
 known+0.024
abella+0.022
 knowing+0.022
Fre+0.022
rou+0.021
Neg logit contributions
 dishes-0.016
amas-0.016
 cloth-0.016
 hurts-0.015
 hands-0.015
Tokenoc
Token ID420
Feature activation+2.88e+00

Pos logit contributions
 known+0.209
abella+0.203
Fre+0.190
 knowing+0.187
rou+0.177
Neg logit contributions
 cloth-0.237
amas-0.236
 hurts-0.234
 dishes-0.233
 hands-0.231
Tokenados
Token ID22484
Feature activation+6.59e-01

Pos logit contributions
 known+0.165
 knowing+0.154
Fre+0.153
abella+0.153
rou+0.151
Neg logit contributions
 cloth-0.074
 dishes-0.072
amas-0.071
 hurts-0.068
 hands-0.066
Token were
Token ID547
Feature activation-1.77e+00

Pos logit contributions
 known+0.023
 knowing+0.020
 seeing+0.019
 solved+0.018
 actually+0.018
Neg logit contributions
amas-0.032
 diaper-0.030
 lap-0.030
 dishes-0.030
 hands-0.029
Token bad
Token ID2089
Feature activation+8.72e-02

Pos logit contributions
amas+0.077
 dishes+0.074
 cloth+0.074
 hurts+0.072
 cheeks+0.071
Neg logit contributions
 known-0.074
 knowing-0.068
abella-0.064
rou-0.063
Fre-0.062
Token and
Token ID290
Feature activation-7.67e-01

Pos logit contributions
 known+0.003
 knowing+0.003
abella+0.003
rou+0.003
 discovering+0.003
Neg logit contributions
amas-0.004
 cloth-0.004
 dishes-0.004
 hurts-0.004
 lap-0.004
Token He
Token ID679
Feature activation-1.38e-03

Pos logit contributions
 dishes+0.017
 cloth+0.016
 sheets+0.015
amas+0.015
 hurts+0.015
Neg logit contributions
 known-0.020
 knowing-0.019
abella-0.019
Fre-0.018
 discovering-0.017
Token said
Token ID531
Feature activation+1.05e+00

Pos logit contributions
 cloth+0.000
 dishes+0.000
 hurts+0.000
amas+0.000
 hands+0.000
Neg logit contributions
 known-0.000
Fre-0.000
abella-0.000
 knowing-0.000
rou-0.000
Token,
Token ID11
Feature activation+5.31e-01

Pos logit contributions
 known+0.048
 knowing+0.044
abella+0.043
Fre+0.043
rou+0.042
Neg logit contributions
amas-0.040
 dishes-0.040
 cloth-0.039
 hurts-0.038
 hands-0.038
Token "
Token ID366
Feature activation-1.89e-01

Pos logit contributions
 known+0.020
abella+0.018
 knowing+0.018
Fre+0.017
rou+0.017
Neg logit contributions
 dishes-0.025
 cloth-0.024
amas-0.024
 hurts-0.024
 hands-0.024
TokenRob
Token ID14350
Feature activation+1.99e-01

Pos logit contributions
 dishes+0.009
 hands+0.009
amas+0.009
 cheeks+0.009
 cloth+0.009
Neg logit contributions
 known-0.007
 knowing-0.006
abella-0.006
Fre-0.006
rou-0.006
Tokenot
Token ID313
Feature activation-5.25e+00

Pos logit contributions
 known+0.009
 knowing+0.008
rou+0.008
abella+0.007
Fre+0.007
Neg logit contributions
amas-0.008
 dishes-0.008
 cloth-0.008
 hurts-0.008
 cheeks-0.007
Token,
Token ID11
Feature activation-1.18e+00

Pos logit contributions
amas+0.230
 dishes+0.227
 cloth+0.225
 hands+0.223
 lap+0.218
Neg logit contributions
 known-0.207
rou-0.183
 knowing-0.183
abella-0.181
Fre-0.169
Token are
Token ID389
Feature activation-1.93e+00

Pos logit contributions
amas+0.060
 dishes+0.057
 hands+0.056
 cheeks+0.056
 hurts+0.055
Neg logit contributions
 known-0.038
 knowing-0.036
abella-0.034
rou-0.034
 seeing-0.032
Token you
Token ID345
Feature activation-3.45e-01

Pos logit contributions
 dishes+0.058
 cloth+0.058
amas+0.057
 hurts+0.056
 hands+0.055
Neg logit contributions
 known-0.104
abella-0.096
Fre-0.095
 knowing-0.095
rou-0.093
Token okay
Token ID8788
Feature activation-1.77e-01

Pos logit contributions
amas+0.017
 dishes+0.017
 cloth+0.017
 cheeks+0.017
 backs+0.016
Neg logit contributions
 known-0.012
 knowing-0.011
rou-0.010
abella-0.010
 impressed-0.009
Token?"
Token ID1701
Feature activation-2.99e-02

Pos logit contributions
 dishes+0.007
amas+0.007
 cloth+0.007
 cheeks+0.006
 hands+0.006
Neg logit contributions
 known-0.008
 knowing-0.007
abella-0.007
Fre-0.007
rou-0.007
Token
Token ID6184
Feature activation+3.03e-02

Pos logit contributions
 known+0.000
abella+0.000
 knowing+0.000
Fre+0.000
rou+0.000
Neg logit contributions
 dishes-0.001
amas-0.001
 cloth-0.001
 hurts-0.001
 hands-0.001
Token
Token ID95
Feature activation+5.59e-01

Pos logit contributions
 known+0.001
abella+0.001
Fre+0.001
 knowing+0.001
rou+0.001
Neg logit contributions
 dishes-0.001
 cloth-0.001
amas-0.001
 hurts-0.001
 hands-0.001
Token
Token ID26391
Feature activation+2.80e+00

Pos logit contributions
 known+0.026
abella+0.024
 knowing+0.023
Fre+0.023
rou+0.023
Neg logit contributions
 dishes-0.021
 cloth-0.021
amas-0.021
 hurts-0.020
 hands-0.020
Token
Token ID129
Feature activation+8.42e-02

Pos logit contributions
 known+0.120
abella+0.109
 knowing+0.108
Fre+0.107
rou+0.105
Neg logit contributions
 dishes-0.115
 cloth-0.113
amas-0.110
 hurts-0.108
 hands-0.107
Token
Token ID241
Feature activation+2.74e-01

Pos logit contributions
 known+0.004
 knowing+0.003
Fre+0.003
abella+0.003
rou+0.003
Neg logit contributions
amas-0.004
 hurts-0.003
 dishes-0.003
 cloth-0.003
 hands-0.003
Tokenand
Token ID392
Feature activation-5.25e+00

Pos logit contributions
 known+0.011
abella+0.010
 knowing+0.010
Fre+0.010
rou+0.010
Neg logit contributions
 cloth-0.011
 dishes-0.011
amas-0.011
 hurts-0.011
 hands-0.011
Token I
Token ID314
Feature activation+1.20e+00

Pos logit contributions
amas+0.213
 dishes+0.213
 cloth+0.211
 hurts+0.204
 hands+0.202
Neg logit contributions
 known-0.225
abella-0.207
 knowing-0.202
Fre-0.201
rou-0.197
Token will
Token ID481
Feature activation-2.07e-02

Pos logit contributions
 known+0.052
abella+0.046
Fre+0.045
 knowing+0.044
rou+0.044
Neg logit contributions
amas-0.051
 dishes-0.050
 cloth-0.049
 cheeks-0.048
 hands-0.047
Token help
Token ID1037
Feature activation-2.07e+00

Pos logit contributions
amas+0.001
 dishes+0.001
 cloth+0.001
 hands+0.001
 hurts+0.001
Neg logit contributions
 known-0.001
 knowing-0.001
abella-0.001
Fre-0.001
rou-0.001
Token you
Token ID345
Feature activation+2.47e-01

Pos logit contributions
amas+0.077
 cloth+0.077
 dishes+0.076
 hurts+0.074
 cheeks+0.073
Neg logit contributions
 known-0.092
abella-0.088
 knowing-0.086
Fre-0.084
rou-0.081
Token.
Token ID13
Feature activation-2.40e-01

Pos logit contributions
 known+0.008
 knowing+0.008
rou+0.007
Fre+0.007
abella+0.007
Neg logit contributions
amas-0.012
 dishes-0.012
 cheeks-0.012
 cloth-0.011
 sheets-0.011
Token
Token ID6184
Feature activation+9.55e-02

Pos logit contributions
 known+0.001
abella+0.001
Fre+0.001
 knowing+0.001
rou+0.001
Neg logit contributions
 dishes-0.001
 cloth-0.001
amas-0.001
 hurts-0.001
 hands-0.001
Token
Token ID95
Feature activation+4.65e-01

Pos logit contributions
 known+0.004
 knowing+0.004
abella+0.003
Fre+0.003
rou+0.003
Neg logit contributions
amas-0.004
 cloth-0.004
 dishes-0.004
 hurts-0.004
 cheeks-0.004
Token
Token ID26391
Feature activation+2.70e+00

Pos logit contributions
 known+0.021
 knowing+0.019
Fre+0.019
abella+0.019
rou+0.019
Neg logit contributions
amas-0.018
 dishes-0.017
 hurts-0.017
 cloth-0.017
 hands-0.016
Token
Token ID129
Feature activation+1.53e-01

Pos logit contributions
 known+0.120
abella+0.110
Fre+0.108
 knowing+0.108
rou+0.105
Neg logit contributions
 dishes-0.106
 cloth-0.105
amas-0.104
 hurts-0.102
 hands-0.100
Token
Token ID241
Feature activation+3.91e-02

Pos logit contributions
 known+0.007
 knowing+0.006
Fre+0.006
abella+0.006
rou+0.006
Neg logit contributions
amas-0.006
 hurts-0.006
 dishes-0.006
 hands-0.006
 cheeks-0.006
Tokenand
Token ID392
Feature activation-5.23e+00

Pos logit contributions
 known+0.002
abella+0.002
 knowing+0.002
Fre+0.002
rou+0.002
Neg logit contributions
 cloth-0.001
 dishes-0.001
amas-0.001
 hurts-0.001
 hands-0.001
Token you
Token ID345
Feature activation+7.02e-01

Pos logit contributions
 dishes+0.208
 cloth+0.208
amas+0.205
 hurts+0.203
 hands+0.201
Neg logit contributions
 known-0.227
abella-0.205
 knowing-0.204
Fre-0.203
rou-0.196
Token will
Token ID481
Feature activation+1.39e-01

Pos logit contributions
 known+0.031
abella+0.027
Fre+0.027
rou+0.026
 knowing+0.026
Neg logit contributions
amas-0.029
 dishes-0.029
 cloth-0.027
 cheeks-0.027
 hurts-0.027
Token be
Token ID307
Feature activation+2.34e-01

Pos logit contributions
 known+0.006
abella+0.006
 knowing+0.006
Fre+0.006
rou+0.006
Neg logit contributions
 dishes-0.005
amas-0.005
 cloth-0.005
 hurts-0.005
 hands-0.005
Token happy
Token ID3772
Feature activation-1.22e+00

Pos logit contributions
 known+0.010
abella+0.010
Fre+0.010
rou+0.009
 knowing+0.009
Neg logit contributions
 hands-0.009
 dishes-0.009
 cloth-0.009
 hurts-0.009
amas-0.009
Token.
Token ID13
Feature activation-4.06e-01

Pos logit contributions
 dishes+0.049
amas+0.049
 cloth+0.048
 hurts+0.047
 hands+0.046
Neg logit contributions
 known-0.053
abella-0.049
 knowing-0.048
Fre-0.048
rou-0.046
Token Tim
Token ID5045
Feature activation-2.68e-01

Pos logit contributions
 dishes+0.009
 cloth+0.009
amas+0.009
 diaper+0.008
 hurts+0.008
Neg logit contributions
 known-0.009
abella-0.008
 knowing-0.008
Fre-0.008
rou-0.008
Token said
Token ID531
Feature activation+1.06e+00

Pos logit contributions
 dishes+0.011
amas+0.011
 cloth+0.011
 hurts+0.011
 hands+0.011
Neg logit contributions
 known-0.011
abella-0.010
 knowing-0.010
Fre-0.010
rou-0.010
Token,
Token ID11
Feature activation+3.99e-01

Pos logit contributions
 known+0.047
abella+0.042
 knowing+0.042
Fre+0.041
rou+0.040
Neg logit contributions
amas-0.042
 dishes-0.042
 cloth-0.042
 hurts-0.040
 hands-0.040
Token "
Token ID366
Feature activation-1.28e-01

Pos logit contributions
 known+0.015
abella+0.013
 knowing+0.013
Fre+0.013
rou+0.012
Neg logit contributions
 dishes-0.019
amas-0.018
 cloth-0.018
 hurts-0.018
 hands-0.018
TokenRob
Token ID14350
Feature activation+3.00e-01

Pos logit contributions
 dishes+0.006
amas+0.006
 cheeks+0.006
 hands+0.006
 cloth+0.006
Neg logit contributions
 known-0.005
Fre-0.005
abella-0.005
 knowing-0.004
rou-0.004
Tokenot
Token ID313
Feature activation-5.10e+00

Pos logit contributions
 known+0.012
 knowing+0.011
rou+0.011
abella+0.011
Fre+0.010
Neg logit contributions
amas-0.013
 dishes-0.013
 cloth-0.013
 hurts-0.012
 cheeks-0.012
Token,
Token ID11
Feature activation-9.39e-01

Pos logit contributions
amas+0.205
 cloth+0.204
 dishes+0.203
 hands+0.196
 cheeks+0.192
Neg logit contributions
 known-0.222
abella-0.201
 knowing-0.199
Fre-0.195
rou-0.192
Token you
Token ID345
Feature activation+3.97e-01

Pos logit contributions
amas+0.039
 dishes+0.037
 cloth+0.036
 hurts+0.036
 cheeks+0.035
Neg logit contributions
 known-0.040
 knowing-0.037
abella-0.036
rou-0.035
Fre-0.035
Token must
Token ID1276
Feature activation+8.70e-01

Pos logit contributions
 known+0.018
abella+0.016
rou+0.016
 knowing+0.016
Fre+0.016
Neg logit contributions
amas-0.016
 dishes-0.015
 cloth-0.015
 cheeks-0.014
 diaper-0.014
Token yield
Token ID7800
Feature activation+1.52e+00

Pos logit contributions
 known+0.043
abella+0.041
Fre+0.041
 knowing+0.041
rou+0.040
Neg logit contributions
 cloth-0.029
 dishes-0.029
 hurts-0.028
amas-0.027
 hands-0.027
Token when
Token ID618
Feature activation-8.55e-01

Pos logit contributions
 known+0.062
 knowing+0.058
abella+0.056
rou+0.055
Fre+0.054
Neg logit contributions
amas-0.065
 cloth-0.063
 dishes-0.062
 cheeks-0.061
 diaper-0.059
TokenMy
Token ID3666
Feature activation-6.22e-02

Pos logit contributions
 dishes+0.033
amas+0.033
 cloth+0.032
 hands+0.031
 hurts+0.031
Neg logit contributions
 known-0.033
Fre-0.030
abella-0.030
 knowing-0.029
rou-0.029
Token name
Token ID1438
Feature activation+8.67e-01

Pos logit contributions
amas+0.002
 dishes+0.002
 hurts+0.002
 cloth+0.002
bars+0.002
Neg logit contributions
 known-0.003
 knowing-0.003
rou-0.003
Fre-0.003
abella-0.003
Token is
Token ID318
Feature activation-2.18e-01

Pos logit contributions
 known+0.029
 indeed+0.025
 knowing+0.025
rou+0.024
Fre+0.024
Neg logit contributions
 elbows-0.041
amas-0.041
 sponge-0.040
 sleeves-0.040
 cloth-0.040
Token Hop
Token ID9996
Feature activation-5.38e-01

Pos logit contributions
 dishes+0.009
 cloth+0.009
amas+0.009
 hurts+0.009
 hands+0.009
Neg logit contributions
 known-0.009
abella-0.008
 knowing-0.008
Fre-0.008
rou-0.008
Tokensc
Token ID1416
Feature activation-1.08e+00

Pos logit contributions
 dishes+0.021
amas+0.021
 cloth+0.021
 hurts+0.020
 hands+0.020
Neg logit contributions
 known-0.024
abella-0.022
 knowing-0.022
Fre-0.022
rou-0.021
Tokenot
Token ID313
Feature activation-5.06e+00

Pos logit contributions
amas+0.038
 cloth+0.038
 dishes+0.037
chair+0.035
bars+0.035
Neg logit contributions
rou-0.046
Fre-0.044
Ag-0.044
 known-0.044
abella-0.043
Tokench
Token ID354
Feature activation-2.31e+00

Pos logit contributions
amas+0.230
 cloth+0.219
 cheeks+0.219
 dishes+0.215
 lap+0.210
Neg logit contributions
 known-0.188
 knowing-0.175
rou-0.169
Fre-0.163
 ze-0.160
Token,
Token ID11
Feature activation-5.00e-01

Pos logit contributions
 cloth+0.100
amas+0.099
 dishes+0.098
bars+0.097
 cheeks+0.094
Neg logit contributions
 known-0.083
 knowing-0.079
rou-0.079
Fre-0.076
abella-0.074
Token and
Token ID290
Feature activation-2.79e-01

Pos logit contributions
 dishes+0.020
amas+0.020
 cloth+0.020
 hurts+0.019
 hands+0.019
Neg logit contributions
 known-0.022
abella-0.020
 knowing-0.020
Fre-0.020
rou-0.019
Token I
Token ID314
Feature activation+1.08e+00

Pos logit contributions
 dishes+0.009
amas+0.009
 cloth+0.009
 hurts+0.009
 hands+0.009
Neg logit contributions
 known-0.014
abella-0.013
 knowing-0.013
Fre-0.013
rou-0.012
Token'm
Token ID1101
Feature activation+1.14e-01

Pos logit contributions
 known+0.049
abella+0.045
 knowing+0.044
Fre+0.044
rou+0.043
Neg logit contributions
 dishes-0.041
amas-0.041
 cloth-0.041
 hurts-0.039
 hands-0.039
Token,
Token ID11
Feature activation+3.82e-01

Pos logit contributions
 known+0.060
 knowing+0.056
abella+0.054
rou+0.051
 discovering+0.051
Neg logit contributions
 cloth-0.054
amas-0.053
 hurts-0.053
 dishes-0.052
 cheeks-0.050
Token which
Token ID543
Feature activation-1.01e-01

Pos logit contributions
 known+0.014
 knowing+0.013
abella+0.013
Fre+0.012
rou+0.012
Neg logit contributions
amas-0.018
 dishes-0.018
 cloth-0.017
 hurts-0.017
 hands-0.017
Token was
Token ID373
Feature activation-2.42e-01

Pos logit contributions
amas+0.005
 cheeks+0.005
 hands+0.005
 elbows+0.005
 lap+0.005
Neg logit contributions
 known-0.004
abella-0.003
 knowing-0.003
 possible-0.003
 seeing-0.003
Token an
Token ID281
Feature activation+2.26e-01

Pos logit contributions
 dishes+0.010
amas+0.010
 cloth+0.010
 hurts+0.009
 hands+0.009
Neg logit contributions
 known-0.011
abella-0.010
 knowing-0.009
Fre-0.009
rou-0.009
Token err
Token ID11454
Feature activation-8.14e-01

Pos logit contributions
 known+0.010
 knowing+0.010
abella+0.009
rou+0.009
Fre+0.009
Neg logit contributions
amas-0.009
 cloth-0.009
 hurts-0.008
 dishes-0.008
 hands-0.008
Tokenand
Token ID392
Feature activation-4.96e+00

Pos logit contributions
 cloth+0.032
amas+0.032
 dishes+0.031
 hurts+0.031
 hands+0.030
Neg logit contributions
 known-0.036
 knowing-0.033
abella-0.033
Fre-0.033
rou-0.032
Token he
Token ID339
Feature activation-8.35e-01

Pos logit contributions
amas+0.204
 cloth+0.200
 dishes+0.195
 hurts+0.194
 hands+0.188
Neg logit contributions
 known-0.208
 knowing-0.201
abella-0.199
rou-0.193
Fre-0.187
Token didn
Token ID1422
Feature activation-3.55e-01

Pos logit contributions
amas+0.039
 cloth+0.039
 dishes+0.039
 hands+0.038
 cheeks+0.037
Neg logit contributions
 known-0.032
abella-0.028
 knowing-0.027
Fre-0.026
rou-0.026
Token't
Token ID470
Feature activation+2.83e-01

Pos logit contributions
 hands+0.015
 cheeks+0.015
 cloth+0.015
 elbows+0.015
 dishes+0.015
Neg logit contributions
 known-0.014
abella-0.014
 possible-0.013
 knowing-0.012
Fre-0.012
Token understand
Token ID1833
Feature activation-1.25e+00

Pos logit contributions
 known+0.012
 knowing+0.011
abella+0.010
Fre+0.010
 possible+0.010
Neg logit contributions
amas-0.013
 hands-0.012
 cheeks-0.012
 dishes-0.012
 cloth-0.012
Token or
Token ID393
Feature activation+2.63e-01

Pos logit contributions
amas+0.052
 cloth+0.050
 dishes+0.049
 hurts+0.048
 cheeks+0.047
Neg logit contributions
 known-0.053
 knowing-0.050
abella-0.049
rou-0.046
Fre-0.046
Token and
Token ID290
Feature activation-2.81e-01

Pos logit contributions
 knowing+0.015
 discovering+0.013
 exactly+0.013
rou+0.012
 seeing+0.012
Neg logit contributions
amas-0.023
 hurts-0.022
akes-0.022
iren-0.022
 elbows-0.021
Token said
Token ID531
Feature activation+1.35e+00

Pos logit contributions
 cloth+0.010
 hands+0.010
 dishes+0.010
 cheeks+0.009
 hurts+0.009
Neg logit contributions
 known-0.013
Fre-0.013
abella-0.012
rou-0.012
uth-0.012
Token,
Token ID11
Feature activation+5.84e-01

Pos logit contributions
 known+0.059
 knowing+0.054
abella+0.054
Fre+0.052
rou+0.051
Neg logit contributions
amas-0.054
 dishes-0.053
 cloth-0.053
 hurts-0.051
 hands-0.051
Token "
Token ID366
Feature activation-3.77e-01

Pos logit contributions
 known+0.023
abella+0.021
Fre+0.020
 knowing+0.020
rou+0.020
Neg logit contributions
 dishes-0.026
amas-0.026
 cloth-0.026
 hurts-0.025
 hands-0.025
TokenRob
Token ID14350
Feature activation+4.33e-01

Pos logit contributions
 dishes+0.016
amas+0.015
 hands+0.015
 cloth+0.015
 hurts+0.015
Neg logit contributions
 known-0.016
 knowing-0.015
abella-0.015
Fre-0.014
rou-0.014
Tokenot
Token ID313
Feature activation-4.94e+00

Pos logit contributions
 known+0.018
rou+0.017
 knowing+0.017
abella+0.015
Fre+0.015
Neg logit contributions
amas-0.018
 dishes-0.017
 cloth-0.017
 hurts-0.016
 lap-0.016
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
amas+0.217
 cloth+0.211
 hands+0.211
 lap+0.209
 dishes+0.208
Neg logit contributions
 known-0.188
 knowing-0.170
abella-0.169
rou-0.168
Fre-0.148
Token remind
Token ID7101
Feature activation-6.97e-01

Pos logit contributions
amas+0.050
 dishes+0.046
 elbows+0.045
 diaper+0.044
 sleeves+0.044
Neg logit contributions
 known-0.039
 knowing-0.039
abella-0.037
rou-0.037
 exactly-0.036
Token me
Token ID502
Feature activation-8.10e-01

Pos logit contributions
amas+0.028
 cloth+0.028
 dishes+0.028
 hurts+0.027
 cheeks+0.027
Neg logit contributions
 known-0.030
abella-0.027
Fre-0.026
 knowing-0.026
rou-0.025
Token to
Token ID284
Feature activation-6.91e-01

Pos logit contributions
amas+0.034
 dishes+0.031
 cheeks+0.031
 elbows+0.031
 backs+0.030
Neg logit contributions
 knowing-0.033
 known-0.033
rou-0.033
abella-0.030
 exactly-0.029
Token clean
Token ID3424
Feature activation-5.90e-01

Pos logit contributions
amas+0.022
 dishes+0.020
 hurts+0.020
 hands+0.019
 cheeks+0.019
Neg logit contributions
 known-0.037
abella-0.033
 knowing-0.033
rou-0.032
Fre-0.032
Token her
Token ID607
Feature activation-2.03e+00

Pos logit contributions
 known+0.035
 knowing+0.033
abella+0.032
rou+0.030
Fre+0.029
Neg logit contributions
amas-0.030
 dishes-0.028
 cloth-0.028
 cheeks-0.028
 sponge-0.027
Token back
Token ID736
Feature activation-1.65e+00

Pos logit contributions
amas+0.103
iren+0.084
 cheeks+0.081
akes+0.079
op+0.079
Neg logit contributions
 known-0.077
 knowing-0.073
rou-0.068
joy-0.063
 possible-0.063
Token her
Token ID607
Feature activation-2.02e+00

Pos logit contributions
amas+0.065
 cloth+0.060
 dishes+0.060
 cheeks+0.058
 hurts+0.058
Neg logit contributions
 known-0.076
 knowing-0.070
abella-0.069
rou-0.067
Fre-0.067
Token te
Token ID573
Feature activation-1.36e-01

Pos logit contributions
amas+0.079
iren+0.071
akes+0.062
room+0.062
bars+0.060
Neg logit contributions
 knowing-0.095
 known-0.086
rou-0.084
joy-0.082
 reality-0.081
Tokenap
Token ID499
Feature activation-1.97e+00

Pos logit contributions
 cloth+0.006
amas+0.006
 hurts+0.006
 dishes+0.006
 cheeks+0.006
Neg logit contributions
 known-0.005
rou-0.004
Fre-0.004
abella-0.004
uth-0.004
Tokenot
Token ID313
Feature activation-4.85e+00

Pos logit contributions
amas+0.090
 dishes+0.084
 cloth+0.082
 hurts+0.079
 cheeks+0.077
Neg logit contributions
 knowing-0.077
 known-0.075
rou-0.067
abella-0.067
 exactly-0.062
Token.
Token ID13
Feature activation-3.26e-01

Pos logit contributions
amas+0.206
 cloth+0.198
 dishes+0.190
 cheeks+0.187
 sponge+0.186
Neg logit contributions
 knowing-0.190
 known-0.189
rou-0.175
abella-0.174
 ze-0.160
Token She
Token ID1375
Feature activation+5.17e-02

Pos logit contributions
 dishes+0.012
 cloth+0.011
amas+0.011
 hurts+0.011
 hands+0.011
Neg logit contributions
 known-0.016
 knowing-0.015
abella-0.015
Fre-0.014
rou-0.013
Token hoped
Token ID10719
Feature activation-6.04e-01

Pos logit contributions
 known+0.002
Fre+0.002
 knowing+0.002
abella+0.002
rou+0.002
Neg logit contributions
 cloth-0.002
 dishes-0.002
 hurts-0.002
amas-0.002
 hands-0.002
Token Mom
Token ID11254
Feature activation-4.19e-01

Pos logit contributions
amas+0.025
 dishes+0.024
 cloth+0.024
 hurts+0.024
 hands+0.023
Neg logit contributions
 known-0.026
abella-0.024
 knowing-0.024
Fre-0.022
rou-0.022
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
amas+0.025
 dishes+0.025
 cheeks+0.024
 cloth+0.024
 hands+0.024
Neg logit contributions
 known-0.011
 knowing-0.009
rou-0.008
abella-0.008
 seeing-0.007
Token?"
Token ID1701
Feature activation-2.18e-01

Pos logit contributions
amas+0.041
 cheeks+0.041
 hands+0.040
 sleeves+0.040
 dishes+0.039
Neg logit contributions
 known-0.038
 knowing-0.034
Fre-0.033
 seeing-0.032
abella-0.032
Token
Token ID220
Feature activation-1.17e+00

Pos logit contributions
 dishes+0.009
amas+0.009
 cloth+0.009
 hurts+0.009
 hands+0.008
Neg logit contributions
 known-0.009
abella-0.009
Fre-0.008
 knowing-0.008
rou-0.008
Token
Token ID198
Feature activation-5.20e-01

Pos logit contributions
 dishes+0.045
amas+0.045
 cloth+0.044
 hurts+0.043
 hands+0.043
Neg logit contributions
 known-0.053
abella-0.048
 knowing-0.048
Fre-0.048
rou-0.046
Token
Token ID198
Feature activation-6.68e-01

Pos logit contributions
 cloth+0.020
amas+0.020
 dishes+0.020
 hurts+0.019
 hands+0.019
Neg logit contributions
 known-0.023
 knowing-0.021
abella-0.021
Fre-0.021
rou-0.020
TokenF
Token ID37
Feature activation-6.11e-01

Pos logit contributions
 dishes+0.029
 cloth+0.029
amas+0.029
 hurts+0.028
 hands+0.028
Neg logit contributions
 known-0.027
abella-0.024
 knowing-0.024
Fre-0.024
rou-0.023
Tokenay
Token ID323
Feature activation-4.78e+00

Pos logit contributions
 cloth+0.025
 dishes+0.025
amas+0.025
 hurts+0.024
 hands+0.024
Neg logit contributions
 known-0.026
 knowing-0.023
abella-0.023
Fre-0.023
rou-0.022
Token replied
Token ID8712
Feature activation+2.37e-01

Pos logit contributions
 cloth+0.190
 hands+0.186
amas+0.185
 hurts+0.184
 dishes+0.184
Neg logit contributions
 known-0.213
 knowing-0.200
Fre-0.190
rou-0.182
abella-0.180
Token,
Token ID11
Feature activation-4.84e-01

Pos logit contributions
 known+0.011
abella+0.010
 knowing+0.010
Fre+0.010
rou+0.009
Neg logit contributions
 dishes-0.009
amas-0.009
 cloth-0.009
 hurts-0.009
 hands-0.008
Token "
Token ID366
Feature activation-9.95e-01

Pos logit contributions
 cloth+0.021
 dishes+0.021
amas+0.021
 hands+0.020
 hurts+0.020
Neg logit contributions
 known-0.020
abella-0.018
Fre-0.017
 knowing-0.017
rou-0.017
TokenIt
Token ID1026
Feature activation-3.52e-01

Pos logit contributions
 dishes+0.042
amas+0.041
 cloth+0.040
 hands+0.040
 cheeks+0.039
Neg logit contributions
 known-0.043
Fre-0.040
abella-0.039
 knowing-0.038
rou-0.037
Token's
Token ID338
Feature activation-8.34e-02

Pos logit contributions
amas+0.013
 dishes+0.012
 cloth+0.012
 cheeks+0.012
 elbows+0.012
Neg logit contributions
 known-0.016
abella-0.015
rou-0.015
 knowing-0.014
Fre-0.014
Token dull
Token ID19222
Feature activation-6.78e-01

Pos logit contributions
 known+0.009
 knowing+0.008
abella+0.008
rou+0.008
Fre+0.007
Neg logit contributions
amas-0.010
 dishes-0.009
 hurts-0.009
 elbows-0.008
 cloth-0.008
Tokenâ
Token ID22940
Feature activation+3.22e-01

Pos logit contributions
amas+0.025
 hurts+0.021
 dishes+0.020
 bleed+0.020
 elbows+0.019
Neg logit contributions
 known-0.032
 knowing-0.032
abella-0.029
rou-0.028
joy-0.028
Token
Token ID26391
Feature activation+2.94e+00

Pos logit contributions
 known+0.012
abella+0.011
 knowing+0.011
Fre+0.011
rou+0.010
Neg logit contributions
 dishes-0.015
 cloth-0.014
amas-0.014
 hands-0.014
 hurts-0.014
Token
Token ID447
Feature activation+1.74e+00

Pos logit contributions
 known+0.133
abella+0.120
 knowing+0.119
Fre+0.119
rou+0.116
Neg logit contributions
amas-0.113
 dishes-0.111
 cloth-0.111
 hurts-0.108
 hands-0.108
Token
Token ID251
Feature activation-6.98e-01

Pos logit contributions
 known+0.098
abella+0.090
Fre+0.089
 knowing+0.088
rou+0.088
Neg logit contributions
 dishes-0.049
 hurts-0.047
 cloth-0.046
 hands-0.046
amas-0.045
Tokenand
Token ID392
Feature activation-4.76e+00

Pos logit contributions
amas+0.029
 hurts+0.029
 dishes+0.029
 cloth+0.028
 diaper+0.028
Neg logit contributions
 known-0.029
abella-0.027
 knowing-0.027
Fre-0.026
rou-0.024
Token he
Token ID339
Feature activation-7.09e-01

Pos logit contributions
 cloth+0.186
 dishes+0.184
 hands+0.181
 hurts+0.180
amas+0.179
Neg logit contributions
 known-0.204
Fre-0.186
 knowing-0.183
rou-0.179
abella-0.176
Token had
Token ID550
Feature activation+8.62e-01

Pos logit contributions
 dishes+0.030
amas+0.030
 cloth+0.029
 hurts+0.029
 hands+0.028
Neg logit contributions
 known-0.030
abella-0.027
 knowing-0.026
Fre-0.026
rou-0.026
Token to
Token ID284
Feature activation-9.03e-01

Pos logit contributions
 known+0.035
abella+0.032
Fre+0.032
 knowing+0.032
rou+0.031
Neg logit contributions
 dishes-0.036
 cloth-0.036
amas-0.036
 hurts-0.035
 hands-0.035
Token choose
Token ID3853
Feature activation+1.69e+00

Pos logit contributions
amas+0.034
 hurts+0.031
 dishes+0.029
chair+0.029
 cheeks+0.029
Neg logit contributions
 known-0.044
abella-0.037
 knowing-0.037
Fre-0.037
 seeing-0.035
Token which
Token ID543
Feature activation+2.56e-01

Pos logit contributions
 known+0.075
abella+0.068
Fre+0.067
 knowing+0.067
rou+0.065
Neg logit contributions
 dishes-0.067
 cloth-0.066
amas-0.066
 hurts-0.064
 hands-0.064
Token small
Token ID1402
Feature activation-1.42e+00

Pos logit contributions
rou+0.054
 known+0.053
 knowing+0.052
Fre+0.050
abella+0.049
Neg logit contributions
amas-0.054
 dishes-0.050
 cloth-0.049
 hurts-0.048
 cheeks-0.048
Token and
Token ID290
Feature activation-4.43e-01

Pos logit contributions
amas+0.058
 cloth+0.058
 dishes+0.057
 hurts+0.056
 cheeks+0.055
Neg logit contributions
 known-0.060
 knowing-0.056
abella-0.054
rou-0.053
Fre-0.053
Token they
Token ID484
Feature activation-9.55e-01

Pos logit contributions
amas+0.020
 dishes+0.018
 cloth+0.018
 lap+0.018
 hurts+0.018
Neg logit contributions
 known-0.018
 knowing-0.016
abella-0.016
rou-0.015
 impressed-0.015
Token made
Token ID925
Feature activation-2.22e+00

Pos logit contributions
amas+0.055
 lap+0.050
 dishes+0.049
iren+0.048
 cloth+0.047
Neg logit contributions
 known-0.030
 knowing-0.024
abella-0.023
 truly-0.023
 exactly-0.021
Token his
Token ID465
Feature activation-1.85e+00

Pos logit contributions
amas+0.092
 hurts+0.090
 cloth+0.089
 cheeks+0.088
 sleeves+0.087
Neg logit contributions
 known-0.093
 knowing-0.087
abella-0.086
rou-0.080
 discovering-0.077
Token toes
Token ID23932
Feature activation-4.75e+00

Pos logit contributions
amas+0.074
iren+0.072
hing+0.065
 cloth+0.064
 dishes+0.064
Neg logit contributions
 known-0.079
 knowing-0.075
joy-0.069
abella-0.069
 discovering-0.067
Token hurt
Token ID5938
Feature activation-2.44e-01

Pos logit contributions
amas+0.193
 dishes+0.167
 cheeks+0.166
 hands+0.164
 hurts+0.161
Neg logit contributions
rou-0.197
 known-0.197
 knowing-0.196
 exactly-0.182
 impressed-0.182
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
amas+0.011
 hurts+0.010
 cloth+0.010
 dishes+0.010
 lap+0.010
Neg logit contributions
 knowing-0.010
 known-0.009
rou-0.008
abella-0.008
 exactly-0.008
Token He
Token ID679
Feature activation-1.24e-01

Pos logit contributions
 known+0.006
abella+0.005
 knowing+0.005
Fre+0.005
 discovering+0.005
Neg logit contributions
 dishes-0.006
 hurts-0.006
 cloth-0.006
amas-0.006
 cheeks-0.006
Token wanted
Token ID2227
Feature activation-2.77e-01

Pos logit contributions
amas+0.006
 dishes+0.006
 cloth+0.005
 hurts+0.005
 hands+0.005
Neg logit contributions
 known-0.005
abella-0.004
 knowing-0.004
Fre-0.004
rou-0.004
Token something
Token ID1223
Feature activation+8.41e-01

Pos logit contributions
amas+0.014
 cheeks+0.012
 hurts+0.012
akes+0.012
 dishes+0.012
Neg logit contributions
 known-0.011
 knowing-0.010
abella-0.009
 discovering-0.009
 seeing-0.009
Token her
Token ID607
Feature activation-1.57e+00

Pos logit contributions
amas+0.048
 cloth+0.048
 dishes+0.047
 hands+0.047
 hurts+0.046
Neg logit contributions
 known-0.049
 knowing-0.044
abella-0.044
Fre-0.042
rou-0.042
Token elephant
Token ID20950
Feature activation-3.26e-01

Pos logit contributions
amas+0.052
 dishes+0.051
 cloth+0.051
 hurts+0.050
 hands+0.049
Neg logit contributions
 known-0.079
 knowing-0.073
abella-0.072
Fre-0.071
rou-0.070
Token without
Token ID1231
Feature activation-3.13e-01

Pos logit contributions
amas+0.013
 cloth+0.012
 dishes+0.012
 hands+0.012
 hurts+0.012
Neg logit contributions
 known-0.015
 knowing-0.013
abella-0.013
rou-0.013
Fre-0.013
Token her
Token ID607
Feature activation-2.10e+00

Pos logit contributions
 dishes+0.012
 cloth+0.012
 hands+0.012
amas+0.012
 hurts+0.012
Neg logit contributions
 known-0.014
Fre-0.013
abella-0.013
rou-0.012
 knowing-0.012
Token cr
Token ID1067
Feature activation-1.95e+00

Pos logit contributions
amas+0.073
 dishes+0.071
 cloth+0.070
 hurts+0.068
 hands+0.066
Neg logit contributions
 known-0.103
 knowing-0.096
abella-0.092
rou-0.092
Fre-0.091
Tokenay
Token ID323
Feature activation-4.74e+00

Pos logit contributions
 dishes+0.049
 cloth+0.048
 hands+0.045
 cheeks+0.044
 elbows+0.042
Neg logit contributions
rou-0.111
 known-0.110
 knowing-0.103
abella-0.102
ris-0.101
Tokenons
Token ID684
Feature activation-1.42e+00

Pos logit contributions
 cloth+0.121
 dishes+0.120
amas+0.116
 cheeks+0.115
 lap+0.113
Neg logit contributions
 known-0.266
 knowing-0.243
rou-0.241
abella-0.232
joy-0.226
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
amas+0.064
 dishes+0.059
 cheeks+0.058
 hurts+0.057
 lap+0.057
Neg logit contributions
 knowing-0.054
 known-0.051
rou-0.046
 possible-0.045
 realizing-0.043
Token
Token ID198
Feature activation+1.77e-03

Pos logit contributions
amas+0.007
 cloth+0.007
 dishes+0.007
 hurts+0.007
 hands+0.007
Neg logit contributions
 known-0.010
abella-0.009
Fre-0.009
 knowing-0.009
rou-0.008
Token
Token ID198
Feature activation-4.31e-01

Pos logit contributions
 known+0.000
Fre+0.000
 knowing+0.000
abella+0.000
rou+0.000
Neg logit contributions
 cloth-0.000
amas-0.000
 hurts-0.000
 dishes-0.000
 hands-0.000
TokenBut
Token ID1537
Feature activation-5.79e-01

Pos logit contributions
 dishes+0.019
 cloth+0.019
amas+0.019
 hurts+0.018
 hands+0.018
Neg logit contributions
 known-0.017
abella-0.016
Fre-0.015
 knowing-0.015
rou-0.015
Token,
Token ID11
Feature activation-1.23e+00

Pos logit contributions
 dishes+0.058
 cloth+0.057
 hurts+0.056
 hands+0.056
amas+0.055
Neg logit contributions
 known-0.086
Fre-0.078
abella-0.078
 knowing-0.077
rou-0.075
Token this
Token ID428
Feature activation-1.11e+00

Pos logit contributions
amas+0.052
 dishes+0.051
 cloth+0.050
 hurts+0.049
 hands+0.048
Neg logit contributions
 known-0.051
abella-0.047
 knowing-0.047
Fre-0.046
rou-0.045
Token is
Token ID318
Feature activation-7.40e-01

Pos logit contributions
amas+0.041
 dishes+0.039
 cheeks+0.039
 hands+0.037
 elbows+0.037
Neg logit contributions
 known-0.050
abella-0.047
rou-0.045
 knowing-0.045
Fre-0.043
Token my
Token ID616
Feature activation-3.32e-01

Pos logit contributions
 hurts+0.026
 cloth+0.026
 dishes+0.026
 hands+0.025
amas+0.024
Neg logit contributions
 known-0.037
Fre-0.035
abella-0.033
 knowing-0.033
rou-0.033
Token cr
Token ID1067
Feature activation-2.23e+00

Pos logit contributions
amas+0.009
 dishes+0.009
 cloth+0.008
 hurts+0.008
 hands+0.008
Neg logit contributions
 known-0.019
 knowing-0.018
abella-0.017
rou-0.017
Fre-0.017
Tokenay
Token ID323
Feature activation-4.71e+00

Pos logit contributions
 dishes+0.063
 cheeks+0.060
 elbows+0.060
 cloth+0.058
 hands+0.057
Neg logit contributions
rou-0.119
 known-0.113
ris-0.111
abella-0.110
 knowing-0.107
Tokenon
Token ID261
Feature activation-1.88e+00

Pos logit contributions
 cheeks+0.145
amas+0.143
 dishes+0.143
 backs+0.141
 elbows+0.141
Neg logit contributions
 known-0.226
rou-0.216
 knowing-0.210
abella-0.209
ris-0.198
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
 elbows+0.088
amas+0.088
 cloth+0.084
 cheeks+0.084
 sleeves+0.083
Neg logit contributions
 knowing-0.059
rou-0.057
 known-0.054
 indeed-0.054
 possible-0.052
Token I
Token ID314
Feature activation+1.56e+00

Pos logit contributions
 dishes+0.048
amas+0.047
 cloth+0.047
 sheets+0.044
 sleeves+0.044
Neg logit contributions
 known-0.042
abella-0.042
Fre-0.040
 Sof-0.039
 knowing-0.039
Token am
Token ID716
Feature activation+1.69e-01

Pos logit contributions
 known+0.068
abella+0.061
 knowing+0.060
Fre+0.060
rou+0.058
Neg logit contributions
amas-0.063
 dishes-0.063
 cloth-0.062
 hurts-0.060
 hands-0.060
Token using
Token ID1262
Feature activation+6.34e-01

Pos logit contributions
Fre+0.008
 known+0.008
abella+0.008
oph+0.008
Ag+0.007
Neg logit contributions
 hands-0.006
 dishes-0.006
 hurts-0.006
 cloth-0.006
 diaper-0.005
Token song
Token ID3496
Feature activation-1.17e+00

Pos logit contributions
 known+0.050
 knowing+0.046
Fre+0.046
abella+0.046
rou+0.045
Neg logit contributions
amas-0.029
 dishes-0.029
 cloth-0.029
 hurts-0.028
 hands-0.027
Token?"
Token ID1701
Feature activation+2.18e-01

Pos logit contributions
 dishes+0.056
amas+0.056
 cloth+0.055
 cheeks+0.054
 hands+0.053
Neg logit contributions
 knowing-0.041
 known-0.040
abella-0.036
 seeing-0.035
rou-0.034
Token
Token ID198
Feature activation-2.68e-01

Pos logit contributions
 known+0.009
abella+0.009
 knowing+0.009
Fre+0.008
rou+0.008
Neg logit contributions
 dishes-0.009
amas-0.009
 cloth-0.009
 hurts-0.009
 hands-0.008
Token
Token ID198
Feature activation-4.91e-01

Pos logit contributions
 cloth+0.009
amas+0.009
 hands+0.008
 hurts+0.008
 dishes+0.008
Neg logit contributions
 known-0.013
 knowing-0.012
abella-0.012
rou-0.012
Fre-0.012
TokenJ
Token ID41
Feature activation+2.59e-01

Pos logit contributions
 cloth+0.021
 dishes+0.021
amas+0.021
 hurts+0.020
 hands+0.020
Neg logit contributions
 known-0.020
 knowing-0.018
abella-0.018
Fre-0.017
rou-0.017
Tokenos
Token ID418
Feature activation-4.71e+00

Pos logit contributions
 known+0.010
 knowing+0.009
abella+0.009
Fre+0.009
rou+0.008
Neg logit contributions
amas-0.011
 dishes-0.011
 cloth-0.011
 hurts-0.011
 hands-0.011
Tokenie
Token ID494
Feature activation-1.99e+00

Pos logit contributions
amas+0.209
 dishes+0.206
 cloth+0.198
 hands+0.195
 diaper+0.190
Neg logit contributions
 known-0.187
abella-0.170
Fre-0.169
 knowing-0.162
rou-0.160
Token nodded
Token ID14464
Feature activation+1.64e+00

Pos logit contributions
 dishes+0.078
amas+0.077
 cloth+0.077
 hurts+0.074
 hands+0.074
Neg logit contributions
 known-0.088
abella-0.081
Fre-0.080
 knowing-0.079
rou-0.077
Token eagerly
Token ID30130
Feature activation+4.62e-01

Pos logit contributions
 known+0.071
 knowing+0.069
abella+0.062
Fre+0.062
rou+0.060
Neg logit contributions
amas-0.067
 dishes-0.067
 hurts-0.066
 cloth-0.064
 sleeves-0.060
Token.
Token ID13
Feature activation-2.90e-02

Pos logit contributions
 known+0.022
 knowing+0.020
abella+0.020
Fre+0.019
rou+0.019
Neg logit contributions
 dishes-0.017
amas-0.017
 cloth-0.017
 hurts-0.016
 hands-0.015
Token The
Token ID383
Feature activation-2.18e-01

Pos logit contributions
amas+0.001
 cloth+0.001
 hands+0.001
 dishes+0.001
 hurts+0.001
Neg logit contributions
 known-0.001
abella-0.001
 knowing-0.001
Fre-0.001
rou-0.001
Token for
Token ID329
Feature activation-1.15e+00

Pos logit contributions
 cloth+0.068
amas+0.068
 hands+0.068
 dishes+0.068
 cheeks+0.067
Neg logit contributions
 known-0.069
 knowing-0.063
abella-0.062
Fre-0.061
rou-0.059
Token?"
Token ID1701
Feature activation-3.28e-01

Pos logit contributions
 cloth+0.054
bars+0.053
amas+0.053
 cheeks+0.053
 hurts+0.051
Neg logit contributions
 known-0.039
 knowing-0.035
 seeing-0.035
abella-0.035
 discovering-0.035
Token
Token ID198
Feature activation-5.40e-01

Pos logit contributions
 dishes+0.014
amas+0.014
 cloth+0.013
 hurts+0.013
 cheeks+0.013
Neg logit contributions
 known-0.014
abella-0.013
Fre-0.012
 knowing-0.012
rou-0.012
Token
Token ID198
Feature activation-6.90e-01

Pos logit contributions
 cloth+0.020
 dishes+0.020
amas+0.020
 hurts+0.020
 hands+0.020
Neg logit contributions
 known-0.025
 knowing-0.022
abella-0.022
Fre-0.022
rou-0.021
TokenF
Token ID37
Feature activation-6.57e-01

Pos logit contributions
 dishes+0.031
 cloth+0.031
amas+0.030
 hurts+0.030
 hands+0.030
Neg logit contributions
 known-0.027
 knowing-0.024
abella-0.024
Fre-0.024
rou-0.023
Tokenay
Token ID323
Feature activation-4.70e+00

Pos logit contributions
 cloth+0.027
 dishes+0.027
amas+0.027
 hurts+0.026
 hands+0.026
Neg logit contributions
 known-0.028
 knowing-0.025
abella-0.025
Fre-0.025
rou-0.024
Token said
Token ID531
Feature activation+2.97e-01

Pos logit contributions
 cloth+0.173
 hands+0.171
amas+0.170
 hurts+0.167
 cheeks+0.167
Neg logit contributions
 known-0.223
 knowing-0.207
Fre-0.199
rou-0.191
 discovering-0.190
Token,
Token ID11
Feature activation-2.30e-01

Pos logit contributions
 known+0.014
abella+0.013
Fre+0.013
 knowing+0.013
rou+0.013
Neg logit contributions
 dishes-0.011
 cloth-0.011
amas-0.010
 hurts-0.010
 hands-0.010
Token "
Token ID366
Feature activation-9.01e-01

Pos logit contributions
amas+0.010
 cloth+0.010
 dishes+0.010
 hands+0.010
 hurts+0.009
Neg logit contributions
 known-0.009
abella-0.008
Fre-0.008
 knowing-0.008
rou-0.008
TokenIt
Token ID1026
Feature activation-1.82e-01

Pos logit contributions
 dishes+0.040
amas+0.039
 cloth+0.038
 hurts+0.038
 hands+0.038
Neg logit contributions
 known-0.037
Fre-0.034
abella-0.033
 knowing-0.032
rou-0.032
Token is
Token ID318
Feature activation-8.01e-01

Pos logit contributions
amas+0.006
 dishes+0.006
 cloth+0.006
 cheeks+0.006
 diaper+0.006
Neg logit contributions
 known-0.009
abella-0.008
rou-0.008
Fre-0.008
 knowing-0.008
Token and
Token ID290
Feature activation-1.38e-01

Pos logit contributions
amas+0.007
 dishes+0.007
 cloth+0.007
 hurts+0.006
 hands+0.006
Neg logit contributions
 known-0.006
 knowing-0.006
abella-0.006
Fre-0.006
rou-0.005
Token he
Token ID339
Feature activation-8.41e-01

Pos logit contributions
amas+0.006
 dishes+0.006
 cloth+0.006
 hurts+0.006
 diaper+0.006
Neg logit contributions
 known-0.005
 knowing-0.005
abella-0.005
Fre-0.004
rou-0.004
Token could
Token ID714
Feature activation-6.05e-01

Pos logit contributions
 dishes+0.040
amas+0.040
 cloth+0.039
 hurts+0.037
 diaper+0.037
Neg logit contributions
 known-0.032
abella-0.028
 knowing-0.027
Fre-0.026
rou-0.026
Token feel
Token ID1254
Feature activation-1.53e-01

Pos logit contributions
 dishes+0.030
amas+0.030
 cloth+0.028
 hands+0.028
 hurts+0.027
Neg logit contributions
 known-0.022
abella-0.019
 knowing-0.019
Fre-0.019
rou-0.017
Token his
Token ID465
Feature activation-1.94e+00

Pos logit contributions
amas+0.007
 dishes+0.007
 cheeks+0.006
 cloth+0.006
 hurts+0.006
Neg logit contributions
 known-0.006
 knowing-0.006
abella-0.005
rou-0.005
Fre-0.005
Token toes
Token ID23932
Feature activation-4.68e+00

Pos logit contributions
amas+0.057
iren+0.053
 hurts+0.051
 dishes+0.050
chair+0.047
Neg logit contributions
 known-0.094
 knowing-0.093
joy-0.091
rou-0.088
 discovering-0.084
Token getting
Token ID1972
Feature activation-1.83e-01

Pos logit contributions
amas+0.196
 dishes+0.183
 hands+0.176
 cheeks+0.176
 diaper+0.174
Neg logit contributions
 knowing-0.183
rou-0.181
 known-0.178
 possible-0.163
 exactly-0.162
Token crushed
Token ID18577
Feature activation+1.45e+00

Pos logit contributions
amas+0.008
 dishes+0.007
 hurts+0.007
 cloth+0.007
 cheeks+0.007
Neg logit contributions
 known-0.008
 knowing-0.007
abella-0.007
rou-0.007
Fre-0.007
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
 known+0.063
 knowing+0.061
abella+0.058
rou+0.058
Fre+0.057
Neg logit contributions
amas-0.056
 hurts-0.054
 cloth-0.054
 dishes-0.054
 cheeks-0.052
Token
Token ID198
Feature activation+2.77e-01

Pos logit contributions
 dishes+0.009
 cloth+0.009
amas+0.009
 hurts+0.009
 cheeks+0.008
Neg logit contributions
 known-0.010
abella-0.009
 knowing-0.009
Fre-0.008
rou-0.008
Token
Token ID198
Feature activation-1.55e-01

Pos logit contributions
 known+0.013
 knowing+0.012
Fre+0.012
rou+0.012
abella+0.012
Neg logit contributions
amas-0.009
 cloth-0.009
 dishes-0.009
 hurts-0.009
 hands-0.009
Token then
Token ID788
Feature activation-3.65e-01

Pos logit contributions
 hands+0.012
 cloth+0.012
 hurts+0.011
 cheeks+0.011
 dishes+0.011
Neg logit contributions
 known-0.020
Fre-0.019
rou-0.019
abella-0.018
Ag-0.018
Token he
Token ID339
Feature activation-1.23e+00

Pos logit contributions
 cloth+0.011
 hands+0.010
 dishes+0.010
 hurts+0.010
 cheeks+0.010
Neg logit contributions
 known-0.019
Fre-0.019
rou-0.018
abella-0.018
 knowing-0.018
Token remembered
Token ID12086
Feature activation+7.51e-01

Pos logit contributions
 cloth+0.048
 hands+0.048
 hurts+0.046
 thumb+0.046
 cheeks+0.045
Neg logit contributions
Fre-0.050
 known-0.049
rou-0.047
 knowing-0.046
abella-0.046
Token his
Token ID465
Feature activation-1.82e+00

Pos logit contributions
 known+0.035
Fre+0.034
abella+0.033
rou+0.032
Ag+0.031
Neg logit contributions
 cloth-0.027
 dishes-0.026
 hands-0.026
 hurts-0.025
 cheeks-0.024
Token cr
Token ID1067
Feature activation-2.16e+00

Pos logit contributions
amas+0.068
 dishes+0.063
 hurts+0.061
bars+0.060
 cloth+0.060
Neg logit contributions
 known-0.082
 knowing-0.077
rou-0.073
abella-0.071
joy-0.071
Tokenay
Token ID323
Feature activation-4.66e+00

Pos logit contributions
 dishes+0.064
 cloth+0.059
 cheeks+0.058
 hands+0.056
 coats+0.055
Neg logit contributions
rou-0.119
 known-0.114
ris-0.107
 knowing-0.107
abella-0.105
Tokenon
Token ID261
Feature activation-1.89e+00

Pos logit contributions
 cheeks+0.159
 backs+0.155
 dishes+0.155
 diaper+0.152
amas+0.151
Neg logit contributions
 known-0.227
rou-0.210
 knowing-0.209
abella-0.195
 discovering-0.192
Token.
Token ID13
Feature activation+6.51e-02

Pos logit contributions
amas+0.089
 dishes+0.083
 elbows+0.081
 cheeks+0.081
 diaper+0.081
Neg logit contributions
 knowing-0.072
 known-0.068
rou-0.063
 exactly-0.063
 possible-0.057
Token He
Token ID679
Feature activation-5.90e-02

Pos logit contributions
 known+0.003
abella+0.003
 knowing+0.003
Fre+0.003
rou+0.003
Neg logit contributions
 dishes-0.002
 cloth-0.002
amas-0.002
 hurts-0.002
 cheeks-0.002
Token quickly
Token ID2952
Feature activation-1.03e+00

Pos logit contributions
 cloth+0.002
 hands+0.002
 hurts+0.002
 cheeks+0.002
 dishes+0.002
Neg logit contributions
 known-0.003
Fre-0.003
 knowing-0.003
rou-0.003
abella-0.003
Token got
Token ID1392
Feature activation+9.07e-02

Pos logit contributions
amas+0.041
 dishes+0.041
 cloth+0.040
 hurts+0.039
 hands+0.038
Neg logit contributions
 known-0.046
abella-0.041
 knowing-0.041
Fre-0.040
rou-0.039
Token...
Token ID986
Feature activation+9.10e-02

Pos logit contributions
 known+0.022
abella+0.020
Fre+0.019
 knowing+0.019
rou+0.019
Neg logit contributions
amas-0.021
 dishes-0.020
 cloth-0.020
 cheeks-0.019
 lap-0.019
Tokenare
Token ID533
Feature activation+6.78e-02

Pos logit contributions
 known+0.004
abella+0.003
Fre+0.003
 knowing+0.003
rou+0.003
Neg logit contributions
 cloth-0.004
 dishes-0.004
amas-0.004
 hurts-0.004
 hands-0.004
Token...
Token ID986
Feature activation+5.29e-02

Pos logit contributions
 known+0.003
abella+0.003
 knowing+0.003
Fre+0.003
rou+0.002
Neg logit contributions
 dishes-0.003
 cloth-0.003
amas-0.003
 hurts-0.003
 hands-0.003
Tokenwell
Token ID4053
Feature activation-1.83e+00

Pos logit contributions
 known+0.002
abella+0.002
Fre+0.002
 knowing+0.002
rou+0.002
Neg logit contributions
 cloth-0.002
 dishes-0.002
amas-0.002
 hurts-0.002
 cheeks-0.002
Token...
Token ID986
Feature activation+8.38e-02

Pos logit contributions
amas+0.079
 cloth+0.079
 dishes+0.078
 hurts+0.076
 hands+0.075
Neg logit contributions
 known-0.074
abella-0.067
 knowing-0.066
Fre-0.065
rou-0.063
Tokenand
Token ID392
Feature activation-4.66e+00

Pos logit contributions
 known+0.003
abella+0.003
Fre+0.003
 knowing+0.003
rou+0.003
Neg logit contributions
 dishes-0.004
 cloth-0.004
amas-0.004
 hurts-0.004
 cheeks-0.004
Token...
Token ID986
Feature activation+2.86e-01

Pos logit contributions
amas+0.190
 dishes+0.189
 cloth+0.187
 hurts+0.182
 hands+0.178
Neg logit contributions
 known-0.201
abella-0.184
 knowing-0.180
Fre-0.180
rou-0.173
Tokenhappy
Token ID34191
Feature activation-8.51e-01

Pos logit contributions
 known+0.012
abella+0.010
Fre+0.010
 knowing+0.010
rou+0.010
Neg logit contributions
 dishes-0.013
 cloth-0.013
amas-0.012
 hurts-0.012
 cheeks-0.012
Token.
Token ID13
Feature activation-5.13e-01

Pos logit contributions
 cloth+0.035
 dishes+0.035
 hurts+0.034
amas+0.034
 cheeks+0.033
Neg logit contributions
 known-0.034
 knowing-0.032
Fre-0.032
abella-0.031
rou-0.029
Token I
Token ID314
Feature activation+1.47e+00

Pos logit contributions
amas+0.018
 dishes+0.018
 cloth+0.018
 hurts+0.017
 hands+0.017
Neg logit contributions
 known-0.025
abella-0.023
 knowing-0.023
Fre-0.023
rou-0.022
Token...
Token ID986
Feature activation+1.31e-01

Pos logit contributions
 known+0.065
abella+0.057
Fre+0.056
 knowing+0.056
rou+0.054
Neg logit contributions
amas-0.061
 dishes-0.061
 cloth-0.059
 hands-0.058
 cheeks-0.057
Token she
Token ID673
Feature activation-4.87e-02

Pos logit contributions
 known+0.009
abella+0.009
 knowing+0.008
Fre+0.008
rou+0.008
Neg logit contributions
amas-0.010
 dishes-0.010
 hurts-0.009
 cheeks-0.009
 cloth-0.009
Token couldn
Token ID3521
Feature activation-3.96e-01

Pos logit contributions
amas+0.002
 dishes+0.002
 cloth+0.002
 cheeks+0.002
 hands+0.002
Neg logit contributions
 known-0.002
abella-0.002
rou-0.002
 knowing-0.001
Fre-0.001
Token't
Token ID470
Feature activation+9.19e-01

Pos logit contributions
 dishes+0.015
 cheeks+0.015
 hands+0.015
 cloth+0.015
amas+0.014
Neg logit contributions
 known-0.017
abella-0.017
Fre-0.016
Ag-0.015
rou-0.015
Token feel
Token ID1254
Feature activation-3.50e-01

Pos logit contributions
 known+0.041
 knowing+0.038
Fre+0.037
abella+0.037
 seeing+0.035
Neg logit contributions
amas-0.037
 dishes-0.036
 hurts-0.034
 cloth-0.033
 hands-0.033
Token her
Token ID607
Feature activation-1.76e+00

Pos logit contributions
amas+0.016
 dishes+0.015
 cheeks+0.015
 cloth+0.015
 hurts+0.015
Neg logit contributions
 known-0.014
 knowing-0.013
abella-0.012
rou-0.012
Fre-0.012
Token toes
Token ID23932
Feature activation-4.65e+00

Pos logit contributions
iren+0.060
amas+0.052
hing+0.051
ons+0.049
bars+0.048
Neg logit contributions
 knowing-0.082
rou-0.075
joy-0.073
 known-0.072
 possible-0.070
Token.
Token ID13
Feature activation-4.33e-01

Pos logit contributions
amas+0.206
 lap+0.190
 cheeks+0.187
 hands+0.184
 dishes+0.184
Neg logit contributions
 knowing-0.173
rou-0.169
 known-0.156
 possible-0.155
 exactly-0.145
Token She
Token ID1375
Feature activation+1.62e-01

Pos logit contributions
 dishes+0.015
 cloth+0.014
 hurts+0.014
amas+0.013
 hands+0.013
Neg logit contributions
 known-0.021
abella-0.020
 knowing-0.020
Fre-0.019
rou-0.019
Token hopped
Token ID45230
Feature activation+4.92e-01

Pos logit contributions
 known+0.008
abella+0.007
Fre+0.007
 knowing+0.007
rou+0.007
Neg logit contributions
 dishes-0.006
 cloth-0.006
amas-0.006
 hurts-0.006
 hands-0.006
Token and
Token ID290
Feature activation-3.71e-01

Pos logit contributions
 known+0.027
abella+0.025
Fre+0.025
rou+0.024
 knowing+0.024
Neg logit contributions
 cloth-0.015
 dishes-0.014
amas-0.014
 hands-0.014
 hurts-0.014
Token skipped
Token ID26684
Feature activation+7.26e-01

Pos logit contributions
amas+0.018
 dishes+0.017
 hurts+0.016
 diaper+0.016
 cloth+0.015
Neg logit contributions
 known-0.014
abella-0.013
 knowing-0.013
Fre-0.012
Ag-0.012
Token?"
Token ID1701
Feature activation-2.49e-01

Pos logit contributions
 known+0.029
abella+0.027
Fre+0.027
 knowing+0.026
rou+0.026
Neg logit contributions
 dishes-0.019
 cloth-0.019
 hurts-0.019
amas-0.019
 hands-0.018
Token
Token ID198
Feature activation-4.43e-01

Pos logit contributions
 dishes+0.010
amas+0.010
 cloth+0.010
 hurts+0.010
 hands+0.009
Neg logit contributions
 known-0.011
abella-0.010
Fre-0.010
 knowing-0.010
joy-0.009
Token
Token ID198
Feature activation-6.02e-01

Pos logit contributions
 dishes+0.018
 cloth+0.017
amas+0.017
 hurts+0.017
 hands+0.017
Neg logit contributions
 known-0.020
abella-0.018
Fre-0.018
 knowing-0.018
rou-0.017
TokenHop
Token ID23483
Feature activation+4.61e-01

Pos logit contributions
 dishes+0.031
amas+0.030
 cloth+0.030
 hurts+0.029
 hands+0.029
Neg logit contributions
 known-0.020
Fre-0.018
abella-0.018
 knowing-0.017
rou-0.016
Tokensc
Token ID1416
Feature activation-1.30e+00

Pos logit contributions
 known+0.021
abella+0.019
 knowing+0.019
Fre+0.019
rou+0.019
Neg logit contributions
 cloth-0.017
 dishes-0.017
 hurts-0.017
amas-0.017
 hands-0.016
Tokenot
Token ID313
Feature activation-4.63e+00

Pos logit contributions
 dishes+0.049
amas+0.049
 cloth+0.048
 hurts+0.045
 sheets+0.044
Neg logit contributions
 known-0.056
abella-0.055
Fre-0.054
rou-0.052
 knowing-0.052
Tokench
Token ID354
Feature activation-2.33e+00

Pos logit contributions
 dishes+0.187
 cloth+0.185
amas+0.185
 hurts+0.179
 hands+0.178
Neg logit contributions
 known-0.200
abella-0.184
 knowing-0.181
Fre-0.180
rou-0.174
Token replied
Token ID8712
Feature activation+6.31e-01

Pos logit contributions
 dishes+0.090
amas+0.089
 cloth+0.088
 hurts+0.087
 hands+0.082
Neg logit contributions
 known-0.106
abella-0.097
 knowing-0.096
Fre-0.094
rou-0.094
Token,
Token ID11
Feature activation-3.32e-02

Pos logit contributions
 known+0.030
abella+0.028
Fre+0.028
 knowing+0.027
rou+0.027
Neg logit contributions
 dishes-0.022
 cloth-0.022
amas-0.022
 hurts-0.022
 hands-0.021
Token "
Token ID366
Feature activation-7.74e-01

Pos logit contributions
 cloth+0.001
amas+0.001
 dishes+0.001
 hands+0.001
 hurts+0.001
Neg logit contributions
 known-0.001
abella-0.001
Fre-0.001
rou-0.001
 knowing-0.001
TokenMy
Token ID3666
Feature activation-6.22e-02

Pos logit contributions
 dishes+0.033
amas+0.033
 cloth+0.032
 hands+0.031
 hurts+0.031
Neg logit contributions
 known-0.033
Fre-0.030
abella-0.030
 knowing-0.029
rou-0.029
Tokenactus
Token ID34144
Feature activation+1.48e+00

Pos logit contributions
 known+0.085
abella+0.077
Fre+0.076
 knowing+0.076
rou+0.073
Neg logit contributions
amas-0.068
 dishes-0.068
 cloth-0.068
 hurts-0.066
 hands-0.064
Token was
Token ID373
Feature activation-5.82e-02

Pos logit contributions
 known+0.058
 knowing+0.055
rou+0.053
abella+0.050
 discovering+0.049
Neg logit contributions
amas-0.064
 dishes-0.063
 cloth-0.062
 hands-0.060
 diaper-0.060
Token starting
Token ID3599
Feature activation+1.35e+00

Pos logit contributions
 dishes+0.003
 cloth+0.003
amas+0.002
 hurts+0.002
 hands+0.002
Neg logit contributions
 known-0.002
abella-0.002
Fre-0.002
 knowing-0.002
rou-0.002
Token to
Token ID284
Feature activation-9.31e-02

Pos logit contributions
 known+0.065
abella+0.059
 knowing+0.059
Fre+0.058
rou+0.057
Neg logit contributions
 dishes-0.048
amas-0.048
 cloth-0.048
 hurts-0.047
 hands-0.046
Token grow
Token ID1663
Feature activation+1.22e+00

Pos logit contributions
 dishes+0.003
 cloth+0.003
amas+0.003
 hurts+0.003
 hands+0.003
Neg logit contributions
 known-0.005
abella-0.005
Fre-0.005
 knowing-0.005
rou-0.005
Token taller
Token ID25242
Feature activation+4.21e+00

Pos logit contributions
 known+0.058
abella+0.054
Fre+0.053
 knowing+0.052
rou+0.050
Neg logit contributions
 dishes-0.044
 cloth-0.044
amas-0.043
 hurts-0.042
 hands-0.041
Token and
Token ID290
Feature activation+3.14e-01

Pos logit contributions
 known+0.226
abella+0.212
Fre+0.204
Ag+0.190
rou+0.190
Neg logit contributions
 dishes-0.134
 cloth-0.131
 hands-0.129
 hurts-0.125
 cheeks-0.122
Token wider
Token ID10595
Feature activation+3.24e+00

Pos logit contributions
 known+0.012
abella+0.012
Fre+0.012
 knowing+0.011
uth+0.011
Neg logit contributions
 cloth-0.014
 hurts-0.013
 dishes-0.013
 hands-0.013
amas-0.012
Token.
Token ID13
Feature activation+2.49e-02

Pos logit contributions
 known+0.166
abella+0.158
Fre+0.153
rou+0.145
 knowing+0.145
Neg logit contributions
 cloth-0.105
 dishes-0.104
 hands-0.103
amas-0.099
 hurts-0.099
Token He
Token ID679
Feature activation+5.13e-01

Pos logit contributions
 known+0.001
abella+0.001
 knowing+0.001
Fre+0.001
rou+0.001
Neg logit contributions
 dishes-0.001
 cloth-0.001
amas-0.001
 hurts-0.001
 hands-0.001
Token couldn
Token ID3521
Feature activation-2.63e-01

Pos logit contributions
 known+0.022
Fre+0.020
abella+0.020
 knowing+0.020
rou+0.019
Neg logit contributions
 cloth-0.021
 hands-0.020
 hurts-0.020
 dishes-0.020
amas-0.019
Token's
Token ID338
Feature activation+1.60e+00

Pos logit contributions
 known+0.076
Fre+0.071
abella+0.071
 knowing+0.068
rou+0.067
Neg logit contributions
 cloth-0.055
 dishes-0.055
 hurts-0.054
amas-0.054
 hands-0.053
Token plant
Token ID4618
Feature activation+1.95e+00

Pos logit contributions
 known+0.076
 knowing+0.072
rou+0.071
abella+0.068
Fre+0.067
Neg logit contributions
amas-0.056
 hurts-0.052
 dishes-0.052
 cloth-0.051
 cheeks-0.048
Token grew
Token ID6348
Feature activation+4.73e-01

Pos logit contributions
 known+0.068
 knowing+0.063
rou+0.061
 exactly+0.054
abella+0.054
Neg logit contributions
amas-0.090
 dishes-0.082
 cheeks-0.081
 lap-0.080
 elbows-0.079
Token taller
Token ID25242
Feature activation+4.66e+00

Pos logit contributions
 known+0.022
abella+0.020
Fre+0.020
 knowing+0.020
rou+0.019
Neg logit contributions
 dishes-0.018
 cloth-0.018
amas-0.017
 hurts-0.017
 hands-0.017
Token and
Token ID290
Feature activation+2.43e-01

Pos logit contributions
 known+0.231
abella+0.213
Fre+0.207
 knowing+0.204
rou+0.200
Neg logit contributions
 dishes-0.163
 cloth-0.161
amas-0.158
 hurts-0.157
 hands-0.156
Token taller
Token ID25242
Feature activation+4.25e+00

Pos logit contributions
 known+0.010
Fre+0.010
abella+0.010
 knowing+0.010
uth+0.009
Neg logit contributions
 hurts-0.010
 cloth-0.010
 dishes-0.009
 hands-0.009
 blades-0.009
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
 known+0.209
abella+0.192
 knowing+0.191
Fre+0.190
rou+0.185
Neg logit contributions
 dishes-0.146
amas-0.145
 cloth-0.145
 hurts-0.140
 hands-0.138
Token It
Token ID632
Feature activation-2.40e-03

Pos logit contributions
 known+0.010
abella+0.009
Fre+0.009
 knowing+0.009
rou+0.009
Neg logit contributions
amas-0.005
 cloth-0.005
 dishes-0.005
 hurts-0.005
 hands-0.005
Token was
Token ID373
Feature activation-4.68e-01

Pos logit contributions
amas+0.000
 dishes+0.000
 cloth+0.000
 hands+0.000
 hurts+0.000
Neg logit contributions
 known-0.000
abella-0.000
Fre-0.000
 knowing-0.000
rou-0.000
Token so
Token ID523
Feature activation-7.07e-01

Pos logit contributions
 cloth+0.015
 dishes+0.014
 hands+0.014
 hurts+0.014
 milk+0.013
Neg logit contributions
abella-0.023
Fre-0.023
 known-0.022
joy-0.022
Ag-0.022
Token big
Token ID1263
Feature activation-5.08e-01

Pos logit contributions
 cloth+0.025
 dishes+0.024
 hurts+0.024
 hands+0.024
amas+0.023
Neg logit contributions
 known-0.034
abella-0.033
 knowing-0.031
Fre-0.031
joy-0.030
Token had
Token ID550
Feature activation+2.18e+00

Pos logit contributions
 known+0.036
Fre+0.033
abella+0.033
 knowing+0.032
rou+0.031
Neg logit contributions
 cloth-0.036
amas-0.036
 dishes-0.036
 hurts-0.036
 hands-0.035
Token found
Token ID1043
Feature activation+4.21e-01

Pos logit contributions
 known+0.083
Fre+0.077
abella+0.076
 knowing+0.074
rou+0.073
Neg logit contributions
 cloth-0.097
 dishes-0.096
amas-0.095
 hands-0.094
 hurts-0.094
Token the
Token ID262
Feature activation+3.24e-01

Pos logit contributions
 known+0.018
abella+0.016
 knowing+0.016
Fre+0.016
rou+0.016
Neg logit contributions
amas-0.017
 dishes-0.017
 cloth-0.017
 hurts-0.017
 cheeks-0.016
Token perfect
Token ID2818
Feature activation+5.25e-01

Pos logit contributions
 known+0.015
 knowing+0.013
abella+0.013
Fre+0.013
rou+0.013
Neg logit contributions
amas-0.012
 dishes-0.012
 cloth-0.012
 hurts-0.012
 hands-0.012
Token fancy
Token ID14996
Feature activation+5.39e-01

Pos logit contributions
 known+0.027
 knowing+0.025
abella+0.024
Fre+0.024
rou+0.023
Neg logit contributions
amas-0.017
 dishes-0.016
 hurts-0.016
 cloth-0.016
 hands-0.015
Token l
Token ID300
Feature activation+4.53e+00

Pos logit contributions
 known+0.027
 knowing+0.025
abella+0.024
Fre+0.024
rou+0.024
Neg logit contributions
amas-0.018
 hurts-0.017
 dishes-0.017
 cloth-0.016
 hands-0.016
Tokenoll
Token ID692
Feature activation+2.25e+00

Pos logit contributions
 known+0.216
 knowing+0.188
Fre+0.185
abella+0.183
 discovering+0.174
Neg logit contributions
amas-0.160
 cloth-0.158
 hands-0.157
 dishes-0.155
 hurts-0.152
Tokenipop
Token ID42800
Feature activation+1.75e+00

Pos logit contributions
 known+0.107
abella+0.101
 knowing+0.100
Fre+0.098
rou+0.097
Neg logit contributions
amas-0.078
 dishes-0.078
 cloth-0.077
 hurts-0.075
 hands-0.074
Token.
Token ID13
Feature activation+5.98e-01

Pos logit contributions
 known+0.072
 knowing+0.068
abella+0.066
Fre+0.064
rou+0.063
Neg logit contributions
 dishes-0.072
amas-0.072
 cloth-0.071
 hurts-0.070
 hands-0.068
Token The
Token ID383
Feature activation-9.53e-02

Pos logit contributions
 known+0.033
rou+0.031
abella+0.030
 knowing+0.030
Fre+0.030
Neg logit contributions
 hands-0.017
amas-0.016
 cloth-0.016
 dishes-0.015
 hurts-0.015
Token End
Token ID5268
Feature activation+1.18e+00

Pos logit contributions
 cloth+0.004
 dishes+0.004
 hands+0.004
 hurts+0.004
 cheeks+0.004
Neg logit contributions
 known-0.004
abella-0.004
Fre-0.004
 knowing-0.004
rou-0.004
Token quickly
Token ID2952
Feature activation-6.36e-01

Pos logit contributions
 known+0.034
 knowing+0.031
Fre+0.031
abella+0.031
rou+0.031
Neg logit contributions
 cloth-0.024
 hands-0.023
 hurts-0.022
 dishes-0.022
 cheeks-0.021
Token sw
Token ID1509
Feature activation+9.66e-01

Pos logit contributions
 cloth+0.023
 dishes+0.022
 hands+0.022
 hurts+0.022
 cheeks+0.022
Neg logit contributions
 known-0.030
abella-0.028
Fre-0.028
rou-0.027
 knowing-0.027
Tokenam
Token ID321
Feature activation+4.95e-01

Pos logit contributions
 known+0.053
abella+0.049
 knowing+0.049
Fre+0.049
rou+0.047
Neg logit contributions
 dishes-0.028
amas-0.028
 cloth-0.027
 hands-0.027
 hurts-0.027
Token to
Token ID284
Feature activation+6.26e-01

Pos logit contributions
 known+0.027
abella+0.026
Fre+0.025
rou+0.025
 knowing+0.025
Neg logit contributions
 hands-0.014
 cloth-0.013
 dishes-0.013
 hurts-0.013
amas-0.012
Token the
Token ID262
Feature activation+4.67e-01

Pos logit contributions
 known+0.032
abella+0.032
rou+0.031
 knowing+0.030
Fre+0.030
Neg logit contributions
 cloth-0.019
 dishes-0.019
 hands-0.019
 hurts-0.017
amas-0.017
Token tort
Token ID7619
Feature activation+4.37e+00

Pos logit contributions
 known+0.021
abella+0.020
 knowing+0.019
rou+0.019
Fre+0.018
Neg logit contributions
 dishes-0.019
 cloth-0.018
 hurts-0.018
amas-0.018
 hands-0.017
Tokenoise
Token ID25678
Feature activation+1.17e+00

Pos logit contributions
abella+0.173
 known+0.173
 knowing+0.172
rou+0.156
Fre+0.151
Neg logit contributions
 dishes-0.173
amas-0.171
 hands-0.167
 cloth-0.163
el-0.162
Token,
Token ID11
Feature activation+7.72e-01

Pos logit contributions
 known+0.052
 knowing+0.048
abella+0.047
Fre+0.046
rou+0.046
Neg logit contributions
amas-0.046
 dishes-0.045
 hurts-0.045
 cloth-0.044
 cheeks-0.042
Token and
Token ID290
Feature activation+2.04e-01

Pos logit contributions
 known+0.034
abella+0.032
Fre+0.031
 knowing+0.030
rou+0.030
Neg logit contributions
 cloth-0.030
 dishes-0.030
 hands-0.029
amas-0.029
 hurts-0.029
Token asked
Token ID1965
Feature activation+9.63e-01

Pos logit contributions
 known+0.009
Fre+0.009
abella+0.009
rou+0.009
 knowing+0.008
Neg logit contributions
 cloth-0.007
 hands-0.007
 dishes-0.007
 hurts-0.007
amas-0.007
Token if
Token ID611
Feature activation-9.14e-01

Pos logit contributions
 known+0.047
rou+0.047
Fre+0.045
abella+0.044
Ag+0.044
Neg logit contributions
 dishes-0.032
 hands-0.032
 cloth-0.030
 cheeks-0.027
 hurts-0.027
Token chasing
Token ID20023
Feature activation+6.56e-01

Pos logit contributions
Fre+0.024
abella+0.023
 known+0.023
uth+0.022
Wal+0.021
Neg logit contributions
 hands-0.024
 dishes-0.023
 cloth-0.023
 cheeks-0.023
 hurts-0.022
Token each
Token ID1123
Feature activation+6.21e-01

Pos logit contributions
 known+0.032
abella+0.029
Fre+0.029
 knowing+0.029
rou+0.028
Neg logit contributions
 dishes-0.023
 cloth-0.023
 hands-0.023
amas-0.022
 hurts-0.022
Token other
Token ID584
Feature activation+6.48e-01

Pos logit contributions
 known+0.031
abella+0.030
Fre+0.029
 knowing+0.029
rou+0.029
Neg logit contributions
 dishes-0.021
 cloth-0.021
 hands-0.021
 hurts-0.020
amas-0.019
Token around
Token ID1088
Feature activation-4.39e-01

Pos logit contributions
 known+0.030
 knowing+0.028
abella+0.028
Fre+0.027
rou+0.027
Neg logit contributions
amas-0.024
 dishes-0.023
 cloth-0.023
 hurts-0.023
 cheeks-0.022
Token the
Token ID262
Feature activation+1.54e+00

Pos logit contributions
 dishes+0.016
 cloth+0.016
amas+0.015
 hurts+0.015
 hands+0.015
Neg logit contributions
 known-0.021
abella-0.019
Fre-0.019
 knowing-0.019
rou-0.019
Token golf
Token ID13126
Feature activation+4.46e+00

Pos logit contributions
abella+0.074
 known+0.074
Fre+0.070
 knowing+0.065
uth+0.063
Neg logit contributions
 cloth-0.060
 dishes-0.059
 hands-0.058
amas-0.056
 hurts-0.056
Token course
Token ID1781
Feature activation+1.66e+00

Pos logit contributions
 known+0.225
abella+0.212
Fre+0.210
 knowing+0.206
rou+0.200
Neg logit contributions
 dishes-0.148
 cloth-0.148
amas-0.147
 hurts-0.141
 hands-0.140
Token.
Token ID13
Feature activation+3.11e-01

Pos logit contributions
 known+0.084
abella+0.078
Fre+0.077
 knowing+0.077
rou+0.075
Neg logit contributions
 dishes-0.054
 cloth-0.054
amas-0.053
 hurts-0.052
 hands-0.051
Token
Token ID198
Feature activation+7.69e-01

Pos logit contributions
 known+0.017
rou+0.015
Fre+0.015
 knowing+0.015
abella+0.015
Neg logit contributions
 hands-0.009
amas-0.009
 lap-0.008
 dishes-0.008
 cloth-0.008
Token
Token ID198
Feature activation+2.52e-01

Pos logit contributions
 known+0.039
 knowing+0.035
rou+0.035
Fre+0.035
joy+0.034
Neg logit contributions
 cloth-0.024
amas-0.024
 dishes-0.023
 hands-0.023
 hurts-0.023
TokenThe
Token ID464
Feature activation+1.03e-01

Pos logit contributions
 known+0.011
joy+0.011
 knowing+0.011
rou+0.011
abella+0.010
Neg logit contributions
 cloth-0.008
amas-0.008
 dishes-0.008
 hurts-0.008
apy-0.007
Token price
Token ID2756
Feature activation+2.30e+00

Pos logit contributions
 known+0.028
 knowing+0.026
rou+0.025
abella+0.025
joy+0.025
Neg logit contributions
amas-0.013
 cheeks-0.012
 dishes-0.011
 hands-0.011
 cloth-0.011
Token.
Token ID13
Feature activation-2.70e-01

Pos logit contributions
 known+0.105
 knowing+0.098
abella+0.098
rou+0.092
Fre+0.090
Neg logit contributions
amas-0.086
 cloth-0.083
 dishes-0.083
 hurts-0.080
 cheeks-0.079
Token It
Token ID632
Feature activation+2.81e-01

Pos logit contributions
 dishes+0.011
 sleeves+0.011
amas+0.010
 cheeks+0.010
 sheets+0.010
Neg logit contributions
 known-0.011
abella-0.010
 knowing-0.010
 Sof-0.010
joy-0.009
Token is
Token ID318
Feature activation+1.81e-02

Pos logit contributions
 known+0.014
abella+0.013
Fre+0.012
 knowing+0.012
rou+0.012
Neg logit contributions
amas-0.010
 dishes-0.010
 cloth-0.010
 hands-0.009
 cheeks-0.009
Token not
Token ID407
Feature activation+1.08e+00

Pos logit contributions
 known+0.001
Fre+0.001
abella+0.001
 knowing+0.001
rou+0.001
Neg logit contributions
 cloth-0.001
 dishes-0.001
 hands-0.001
 hurts-0.001
amas-0.001
Token too
Token ID1165
Feature activation+1.10e+00

Pos logit contributions
 known+0.047
 knowing+0.043
abella+0.042
Fre+0.040
rou+0.040
Neg logit contributions
amas-0.044
 dishes-0.043
 cloth-0.043
 hurts-0.042
 hands-0.041
Token much
Token ID881
Feature activation-7.85e-01

Pos logit contributions
 known+0.049
 knowing+0.045
abella+0.044
rou+0.044
Fre+0.043
Neg logit contributions
amas-0.044
 dishes-0.043
 cloth-0.042
 cheeks-0.042
 hands-0.041
Token.
Token ID13
Feature activation-5.37e-01

Pos logit contributions
amas+0.030
 dishes+0.028
 cloth+0.028
 hurts+0.027
 hands+0.027
Neg logit contributions
 known-0.035
 knowing-0.034
abella-0.032
rou-0.031
Fre-0.030
Token Mom
Token ID11254
Feature activation-8.58e-01

Pos logit contributions
 dishes+0.020
 sleeves+0.019
 cheeks+0.019
 cloth+0.019
amas+0.019
Neg logit contributions
 known-0.024
abella-0.023
 knowing-0.022
Fre-0.021
joy-0.021
Token says
Token ID1139
Feature activation+1.09e+00

Pos logit contributions
amas+0.032
 dishes+0.031
 cloth+0.031
 hands+0.030
 cheeks+0.030
Neg logit contributions
 known-0.040
abella-0.037
 knowing-0.036
rou-0.036
Fre-0.036
Token yes
Token ID3763
Feature activation-1.57e-01

Pos logit contributions
 known+0.047
abella+0.043
 knowing+0.043
Fre+0.042
rou+0.040
Neg logit contributions
amas-0.045
 dishes-0.044
 cloth-0.043
 hurts-0.042
 hands-0.041
Tokenara
Token ID3301
Feature activation+1.62e+00

Pos logit contributions
 known+0.046
 knowing+0.043
abella+0.039
Fre+0.039
 discovering+0.038
Neg logit contributions
amas-0.053
 dishes-0.053
 cloth-0.052
 hurts-0.052
 hands-0.050
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
 known+0.066
 knowing+0.061
abella+0.060
Fre+0.059
rou+0.059
Neg logit contributions
 dishes-0.069
amas-0.067
 cloth-0.066
 hurts-0.064
 diaper-0.063
Token Ben
Token ID3932
Feature activation-4.76e-01

Pos logit contributions
 dishes+0.042
 cloth+0.042
 hands+0.041
amas+0.040
 hurts+0.040
Neg logit contributions
 known-0.050
abella-0.046
 knowing-0.046
Fre-0.045
rou-0.045
Token hugged
Token ID41130
Feature activation-2.28e-01

Pos logit contributions
amas+0.021
 dishes+0.020
 hurts+0.019
 diaper+0.019
 cloth+0.019
Neg logit contributions
 known-0.020
 knowing-0.018
abella-0.017
Fre-0.017
rou-0.016
Token the
Token ID262
Feature activation-3.64e-01

Pos logit contributions
 dishes+0.008
 cloth+0.008
amas+0.008
 hurts+0.008
 hands+0.008
Neg logit contributions
 known-0.011
abella-0.010
 knowing-0.010
Fre-0.010
rou-0.010
Token lady
Token ID10846
Feature activation+2.31e+00

Pos logit contributions
amas+0.015
 dishes+0.015
 hurts+0.015
 cloth+0.015
 hands+0.015
Neg logit contributions
 known-0.015
 knowing-0.014
abella-0.013
Fre-0.013
rou-0.013
Token and
Token ID290
Feature activation-8.22e-01

Pos logit contributions
 known+0.112
abella+0.103
 knowing+0.103
Fre+0.102
rou+0.099
Neg logit contributions
 dishes-0.080
amas-0.080
 cloth-0.079
 hurts-0.077
 hands-0.075
Token said
Token ID531
Feature activation+1.33e+00

Pos logit contributions
 cloth+0.030
 hands+0.030
 dishes+0.029
 hurts+0.028
 cheeks+0.028
Neg logit contributions
 known-0.039
Fre-0.037
abella-0.037
rou-0.036
 knowing-0.035
Token they
Token ID484
Feature activation-6.23e-01

Pos logit contributions
 known+0.060
abella+0.055
 knowing+0.054
Fre+0.054
rou+0.052
Neg logit contributions
 dishes-0.052
amas-0.051
 cloth-0.051
 hurts-0.050
 hands-0.049
Token were
Token ID547
Feature activation-1.44e+00

Pos logit contributions
 cloth+0.021
 dishes+0.021
amas+0.021
 hurts+0.021
 hands+0.020
Neg logit contributions
 known-0.031
abella-0.028
Fre-0.028
 knowing-0.028
rou-0.027
Token sorry
Token ID7926
Feature activation-5.93e-01

Pos logit contributions
 hands+0.041
 dishes+0.040
 cloth+0.037
 hurts+0.036
 knee+0.034
Neg logit contributions
Fre-0.078
abella-0.076
 known-0.076
oph-0.074
rou-0.073
Token do
Token ID466
Feature activation-1.21e+00

Pos logit contributions
 known+0.002
abella+0.002
 knowing+0.002
Fre+0.002
rou+0.002
Neg logit contributions
amas-0.001
 cloth-0.001
 dishes-0.001
 cheeks-0.001
 hurts-0.001
Token you
Token ID345
Feature activation-8.89e-01

Pos logit contributions
amas+0.042
 cloth+0.042
 dishes+0.041
 hurts+0.039
 cheeks+0.039
Neg logit contributions
 known-0.059
abella-0.055
Fre-0.053
 knowing-0.052
rou-0.051
Token value
Token ID1988
Feature activation+3.68e-01

Pos logit contributions
amas+0.035
 dishes+0.034
 cloth+0.033
 hands+0.032
 hurts+0.032
Neg logit contributions
 known-0.041
 knowing-0.036
Fre-0.036
abella-0.035
rou-0.034
Token more
Token ID517
Feature activation+1.18e+00

Pos logit contributions
 known+0.014
abella+0.013
 knowing+0.013
rou+0.013
Fre+0.012
Neg logit contributions
amas-0.016
 cheeks-0.015
 elbows-0.015
 dishes-0.015
 cloth-0.015
Token than
Token ID621
Feature activation+1.24e-01

Pos logit contributions
 known+0.045
 knowing+0.041
abella+0.040
Fre+0.039
rou+0.039
Neg logit contributions
amas-0.053
 dishes-0.053
 cloth-0.052
 hurts-0.051
 hands-0.050
Token jelly
Token ID27644
Feature activation+1.22e+00

Pos logit contributions
 known+0.005
 knowing+0.005
abella+0.005
Fre+0.005
rou+0.005
Neg logit contributions
amas-0.005
 dishes-0.005
 cloth-0.005
 hurts-0.005
 hands-0.005
Token?"
Token ID1701
Feature activation-1.14e-01

Pos logit contributions
 known+0.050
 knowing+0.047
abella+0.047
Fre+0.046
rou+0.045
Neg logit contributions
amas-0.049
 cloth-0.049
 hands-0.048
 dishes-0.047
 cheeks-0.047
Token The
Token ID383
Feature activation-1.14e+00

Pos logit contributions
 dishes+0.005
amas+0.005
 cloth+0.005
 hurts+0.005
 cheeks+0.004
Neg logit contributions
 known-0.005
 knowing-0.004
abella-0.004
Fre-0.004
Wal-0.004
Token bear
Token ID6842
Feature activation-2.90e-01

Pos logit contributions
amas+0.038
 dishes+0.037
 cloth+0.036
 hurts+0.035
 hands+0.035
Neg logit contributions
 known-0.058
abella-0.053
Fre-0.053
 knowing-0.053
rou-0.051
Token was
Token ID373
Feature activation-5.24e-01

Pos logit contributions
 dishes+0.012
amas+0.012
 cloth+0.012
 hurts+0.012
 hands+0.011
Neg logit contributions
 known-0.012
abella-0.011
 knowing-0.011
Fre-0.011
rou-0.010
Token confused
Token ID10416
Feature activation-7.09e-01

Pos logit contributions
 dishes+0.022
 cloth+0.022
amas+0.022
 hurts+0.022
 hands+0.021
Neg logit contributions
 known-0.021
abella-0.019
Fre-0.019
 knowing-0.019
rou-0.018
Token in
Token ID287
Feature activation+1.15e+00

Pos logit contributions
 known+0.066
abella+0.063
Fre+0.061
 knowing+0.060
rou+0.059
Neg logit contributions
 cloth-0.033
 hurts-0.032
 dishes-0.032
amas-0.032
 hands-0.032
Token the
Token ID262
Feature activation+1.41e+00

Pos logit contributions
 known+0.050
 knowing+0.045
Fre+0.044
abella+0.044
rou+0.042
Neg logit contributions
amas-0.047
 dishes-0.046
 hurts-0.045
 cloth-0.045
 cheeks-0.044
Token sky
Token ID6766
Feature activation+1.79e-01

Pos logit contributions
 known+0.057
 knowing+0.052
 ze+0.051
Fre+0.050
abella+0.049
Neg logit contributions
 cheeks-0.053
 dishes-0.052
amas-0.052
 sponge-0.049
 diaper-0.048
Token.
Token ID13
Feature activation+3.91e-01

Pos logit contributions
 known+0.009
abella+0.009
Fre+0.009
 knowing+0.009
rou+0.008
Neg logit contributions
 dishes-0.006
 cloth-0.006
amas-0.006
 hurts-0.005
 hands-0.005
Token The
Token ID383
Feature activation-2.54e-01

Pos logit contributions
 known+0.019
abella+0.017
 knowing+0.017
Fre+0.017
rou+0.016
Neg logit contributions
 dishes-0.014
 cloth-0.014
amas-0.014
 hurts-0.014
 hands-0.014
Token bird
Token ID6512
Feature activation+1.23e+00

Pos logit contributions
 dishes+0.008
 cloth+0.008
amas+0.008
 hurts+0.007
 hands+0.007
Neg logit contributions
 known-0.014
abella-0.013
Fre-0.013
 knowing-0.012
rou-0.012
Token felt
Token ID2936
Feature activation-3.65e-01

Pos logit contributions
 known+0.055
abella+0.051
Fre+0.050
 knowing+0.049
rou+0.048
Neg logit contributions
 cloth-0.047
 dishes-0.047
 hurts-0.046
amas-0.046
 hands-0.045
Token proud
Token ID6613
Feature activation+5.28e-01

Pos logit contributions
 dishes+0.016
 cloth+0.015
amas+0.015
 hurts+0.015
 hands+0.015
Neg logit contributions
 known-0.015
abella-0.014
Fre-0.013
 knowing-0.013
rou-0.013
Token of
Token ID286
Feature activation-8.02e-01

Pos logit contributions
 known+0.030
abella+0.027
Fre+0.027
rou+0.026
uth+0.025
Neg logit contributions
 cloth-0.015
 hands-0.015
 dishes-0.015
 hurts-0.014
amas-0.014
Token its
Token ID663
Feature activation-3.91e-01

Pos logit contributions
 cloth+0.019
 hands+0.019
 hurts+0.018
 dishes+0.018
 cheeks+0.016
Neg logit contributions
abella-0.046
oph-0.046
Fre-0.046
 known-0.045
rou-0.044
Token own
Token ID898
Feature activation+6.92e-02

Pos logit contributions
amas+0.017
bars+0.016
 dishes+0.016
 cloth+0.016
 sleeves+0.015
Neg logit contributions
 known-0.015
 knowing-0.015
Fre-0.013
 discovering-0.013
 seeing-0.013
Token fun
Token ID1257
Feature activation+6.05e-01

Pos logit contributions
 known+0.025
abella+0.023
Fre+0.023
 knowing+0.023
rou+0.023
Neg logit contributions
 cloth-0.029
 dishes-0.029
 hands-0.028
 hurts-0.027
amas-0.027
Token and
Token ID290
Feature activation+3.23e-01

Pos logit contributions
 known+0.035
abella+0.032
Fre+0.032
Wal+0.030
Phill+0.030
Neg logit contributions
 hands-0.018
 cloth-0.018
 dishes-0.017
 hurts-0.016
 cheeks-0.016
Token make
Token ID787
Feature activation-1.79e+00

Pos logit contributions
 known+0.016
 knowing+0.015
abella+0.015
Fre+0.014
rou+0.014
Neg logit contributions
amas-0.011
 dishes-0.011
 cloth-0.011
 hurts-0.010
 hands-0.010
Token friends
Token ID2460
Feature activation+2.23e-01

Pos logit contributions
 dishes+0.063
 cloth+0.063
amas+0.062
 hands+0.060
 hurts+0.060
Neg logit contributions
 known-0.087
abella-0.080
Fre-0.080
 knowing-0.079
rou-0.076
Token.
Token ID13
Feature activation+2.32e-01

Pos logit contributions
 known+0.011
 knowing+0.010
abella+0.010
Fre+0.010
rou+0.010
Neg logit contributions
amas-0.008
 dishes-0.008
 cloth-0.008
 hurts-0.007
 hands-0.007
Token Anna
Token ID11735
Feature activation+1.07e+00

Pos logit contributions
 known+0.011
abella+0.010
 knowing+0.010
Fre+0.010
rou+0.010
Neg logit contributions
 dishes-0.008
 cloth-0.008
amas-0.008
 hurts-0.008
 hands-0.008
Token's
Token ID338
Feature activation+6.81e-01

Pos logit contributions
 known+0.054
abella+0.050
Fre+0.050
 knowing+0.049
rou+0.048
Neg logit contributions
 cloth-0.035
 dishes-0.035
 hurts-0.035
 hands-0.034
amas-0.034
Token mom
Token ID1995
Feature activation-4.33e-01

Pos logit contributions
 known+0.033
 knowing+0.033
abella+0.031
rou+0.030
joy+0.030
Neg logit contributions
amas-0.024
 dishes-0.022
 hurts-0.022
 cloth-0.022
 diaper-0.020
Token and
Token ID290
Feature activation-1.25e+00

Pos logit contributions
amas+0.016
 dishes+0.016
 cloth+0.016
 hurts+0.015
 hands+0.015
Neg logit contributions
 known-0.020
abella-0.018
 knowing-0.018
Fre-0.018
rou-0.018
Token dad
Token ID9955
Feature activation-6.16e-01

Pos logit contributions
amas+0.049
 dishes+0.046
 cloth+0.046
 hurts+0.045
 cheeks+0.044
Neg logit contributions
 known-0.060
 knowing-0.055
abella-0.053
Fre-0.052
rou-0.050
Token watch
Token ID2342
Feature activation-1.23e+00

Pos logit contributions
amas+0.021
 dishes+0.021
 cloth+0.020
 hurts+0.020
 hands+0.019
Neg logit contributions
 known-0.031
 knowing-0.028
abella-0.028
Fre-0.028
rou-0.027
Token He
Token ID679
Feature activation+2.40e-01

Pos logit contributions
 dishes+0.008
 cloth+0.008
amas+0.008
 hurts+0.008
 cheeks+0.007
Neg logit contributions
 known-0.009
abella-0.009
 knowing-0.008
Fre-0.008
rou-0.008
Token says
Token ID1139
Feature activation+1.45e+00

Pos logit contributions
 known+0.013
 knowing+0.012
Fre+0.012
abella+0.012
rou+0.011
Neg logit contributions
 dishes-0.007
 cloth-0.007
 hurts-0.007
 hands-0.007
amas-0.007
Token yes
Token ID3763
Feature activation+2.56e-01

Pos logit contributions
 known+0.060
abella+0.054
Fre+0.054
 knowing+0.053
rou+0.052
Neg logit contributions
 dishes-0.061
 cloth-0.061
amas-0.060
 hurts-0.059
 hands-0.059
Token,
Token ID11
Feature activation+7.26e-01

Pos logit contributions
 known+0.012
 knowing+0.011
abella+0.011
Fre+0.010
rou+0.010
Neg logit contributions
 dishes-0.010
amas-0.010
 cloth-0.010
 hands-0.009
 hurts-0.009
Token but
Token ID475
Feature activation+4.33e-01

Pos logit contributions
 known+0.031
abella+0.028
 knowing+0.028
Fre+0.027
rou+0.026
Neg logit contributions
 dishes-0.030
amas-0.030
 cloth-0.030
 hurts-0.029
 hands-0.029
Token they
Token ID484
Feature activation-3.60e-01

Pos logit contributions
 known+0.019
abella+0.018
Fre+0.018
rou+0.017
 knowing+0.017
Neg logit contributions
 dishes-0.017
 cloth-0.017
 hurts-0.017
 hands-0.017
amas-0.016
Token have
Token ID423
Feature activation+6.88e-01

Pos logit contributions
 dishes+0.013
 cloth+0.013
amas+0.013
 hurts+0.012
 hands+0.012
Neg logit contributions
 known-0.017
abella-0.016
 knowing-0.016
Fre-0.016
rou-0.015
Token to
Token ID284
Feature activation-7.72e-01

Pos logit contributions
 known+0.033
abella+0.030
 knowing+0.029
Fre+0.029
rou+0.028
Neg logit contributions
amas-0.025
 dishes-0.025
 cloth-0.025
 hurts-0.024
 hands-0.024
Token be
Token ID307
Feature activation+1.44e-01

Pos logit contributions
amas+0.028
 dishes+0.026
 hurts+0.026
 cloth+0.026
 hands+0.025
Neg logit contributions
 known-0.038
 knowing-0.034
Fre-0.034
abella-0.033
rou-0.032
Token careful
Token ID8161
Feature activation+4.97e-01

Pos logit contributions
 known+0.006
Fre+0.005
abella+0.005
 knowing+0.005
rou+0.005
Neg logit contributions
 dishes-0.006
 hands-0.006
 cloth-0.006
amas-0.006
 hurts-0.006
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
 knowing+0.019
 known+0.018
 discovering+0.016
rou+0.016
 seeing+0.016
Neg logit contributions
 cloth-0.023
amas-0.022
 dishes-0.021
 hurts-0.021
 sponge-0.021
Token mean
Token ID1612
Feature activation+8.86e-01

Pos logit contributions
 known+0.063
abella+0.058
Fre+0.057
 knowing+0.057
rou+0.055
Neg logit contributions
 dishes-0.054
 cloth-0.054
amas-0.053
 hurts-0.052
 hands-0.052
Token to
Token ID284
Feature activation-2.44e-01

Pos logit contributions
 known+0.038
 knowing+0.036
abella+0.035
Fre+0.034
rou+0.034
Neg logit contributions
amas-0.036
 cloth-0.035
 dishes-0.035
 cheeks-0.034
 hands-0.033
Token hurt
Token ID5938
Feature activation+6.05e-01

Pos logit contributions
amas+0.009
 dishes+0.008
 cloth+0.008
 hands+0.008
 cheeks+0.008
Neg logit contributions
 known-0.012
 knowing-0.011
abella-0.011
Fre-0.011
rou-0.010
Token you
Token ID345
Feature activation+1.19e-01

Pos logit contributions
 known+0.025
 knowing+0.023
Fre+0.023
abella+0.022
rou+0.021
Neg logit contributions
amas-0.026
 cloth-0.024
 cheeks-0.024
 dishes-0.024
 lap-0.023
Token or
Token ID393
Feature activation+5.87e-01

Pos logit contributions
 known+0.005
 knowing+0.005
rou+0.004
abella+0.004
Fre+0.004
Neg logit contributions
 cloth-0.005
amas-0.005
 dishes-0.005
 hands-0.005
 cheeks-0.005
Token your
Token ID534
Feature activation-6.63e-01

Pos logit contributions
 known+0.026
 knowing+0.024
abella+0.022
 seeing+0.022
rou+0.021
Neg logit contributions
amas-0.026
 dishes-0.023
 cloth-0.022
 cheeks-0.021
akes-0.021
Token son
Token ID3367
Feature activation-9.44e-01

Pos logit contributions
amas+0.022
 dishes+0.019
 cloth+0.019
 hurts+0.019
 cheeks+0.018
Neg logit contributions
 known-0.033
 knowing-0.031
rou-0.031
Fre-0.029
joy-0.029
Token."
Token ID526
Feature activation+2.01e-02

Pos logit contributions
 cloth+0.037
amas+0.037
 dishes+0.035
 hands+0.033
 sponge+0.033
Neg logit contributions
 known-0.042
 knowing-0.039
abella-0.037
rou-0.036
Fre-0.035
Token
Token ID198
Feature activation-3.71e-01

Pos logit contributions
 known+0.001
abella+0.001
 knowing+0.001
Fre+0.001
rou+0.001
Neg logit contributions
 dishes-0.001
amas-0.001
 cloth-0.001
 hurts-0.001
 hands-0.001
Token
Token ID198
Feature activation-6.24e-01

Pos logit contributions
 cloth+0.012
 hurts+0.012
amas+0.012
 hands+0.011
 dishes+0.011
Neg logit contributions
 known-0.019
 knowing-0.017
abella-0.017
Fre-0.017
rou-0.017
TokenThe
Token ID464
Feature activation-3.12e-01

Pos logit contributions
 cloth+0.025
 dishes+0.025
amas+0.025
 hurts+0.024
 hands+0.024
Neg logit contributions
 known-0.027
 knowing-0.024
abella-0.024
rou-0.023
Fre-0.023
Token glad
Token ID9675
Feature activation+2.78e-01

Pos logit contributions
 dishes+0.044
 cloth+0.044
 hurts+0.044
amas+0.043
 hands+0.043
Neg logit contributions
 known-0.038
abella-0.035
Fre-0.034
 knowing-0.033
rou-0.032
Token that
Token ID326
Feature activation+7.30e-01

Pos logit contributions
 known+0.015
abella+0.014
Fre+0.014
 knowing+0.013
rou+0.013
Neg logit contributions
 cloth-0.008
 dishes-0.008
 hands-0.008
 hurts-0.008
amas-0.008
Token you
Token ID345
Feature activation+1.47e+00

Pos logit contributions
 known+0.035
abella+0.032
 knowing+0.032
Fre+0.031
rou+0.031
Neg logit contributions
amas-0.026
 dishes-0.025
 cloth-0.025
 cheeks-0.024
 hands-0.024
Token're
Token ID821
Feature activation-4.41e-01

Pos logit contributions
 known+0.065
abella+0.058
 knowing+0.058
Fre+0.058
rou+0.056
Neg logit contributions
amas-0.059
 dishes-0.058
 cloth-0.057
 hands-0.055
 hurts-0.055
Token here
Token ID994
Feature activation-4.21e-01

Pos logit contributions
 dishes+0.018
 cloth+0.018
 hurts+0.018
amas+0.018
 hands+0.017
Neg logit contributions
 known-0.018
abella-0.017
Fre-0.017
rou-0.016
 knowing-0.016
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
 dishes+0.016
 cloth+0.016
amas+0.016
 hurts+0.016
 hands+0.016
Neg logit contributions
 known-0.019
abella-0.017
Fre-0.017
 knowing-0.017
rou-0.016
Token I
Token ID314
Feature activation+1.62e+00

Pos logit contributions
 known+0.006
abella+0.005
 knowing+0.005
Fre+0.005
rou+0.005
Neg logit contributions
amas-0.004
 hands-0.004
 cloth-0.004
 dishes-0.004
 hurts-0.004
Token have
Token ID423
Feature activation+4.22e-01

Pos logit contributions
 known+0.071
abella+0.065
 knowing+0.064
Fre+0.064
rou+0.062
Neg logit contributions
 cloth-0.063
 dishes-0.063
amas-0.062
 hurts-0.062
 hands-0.061
Token a
Token ID257
Feature activation-4.85e-01

Pos logit contributions
 known+0.018
abella+0.016
 knowing+0.016
Fre+0.016
rou+0.015
Neg logit contributions
amas-0.018
 dishes-0.018
 cloth-0.018
 hurts-0.017
 cheeks-0.017
Token special
Token ID2041
Feature activation-6.58e-01

Pos logit contributions
amas+0.018
 dishes+0.018
 cloth+0.017
 hurts+0.017
 hands+0.017
Neg logit contributions
 known-0.023
 knowing-0.021
abella-0.020
Fre-0.020
rou-0.020
Token surprise
Token ID5975
Feature activation-4.54e-01

Pos logit contributions
amas+0.024
 dishes+0.023
 hurts+0.022
 cheeks+0.021
 hands+0.021
Neg logit contributions
 known-0.031
 knowing-0.029
Fre-0.027
rou-0.027
abella-0.026
Token Jack
Token ID3619
Feature activation-6.29e-01

Pos logit contributions
 known+0.005
Fre+0.005
abella+0.005
rou+0.004
 knowing+0.004
Neg logit contributions
 dishes-0.004
 cloth-0.004
 hands-0.004
amas-0.003
 cheeks-0.003
Token.
Token ID13
Feature activation+5.78e-01

Pos logit contributions
 dishes+0.028
 hands+0.027
 cloth+0.027
amas+0.026
 hurts+0.026
Neg logit contributions
 known-0.026
Fre-0.024
abella-0.023
Ag-0.021
 knowing-0.021
Token
Token ID198
Feature activation+5.84e-01

Pos logit contributions
 known+0.029
Fre+0.027
abella+0.027
 knowing+0.026
rou+0.026
Neg logit contributions
amas-0.020
 dishes-0.019
 cloth-0.019
 hands-0.019
 hurts-0.018
Token
Token ID198
Feature activation+9.33e-02

Pos logit contributions
 known+0.031
 knowing+0.029
rou+0.028
Fre+0.028
abella+0.028
Neg logit contributions
 cloth-0.017
amas-0.017
 dishes-0.016
 hands-0.016
 hurts-0.016
TokenJack
Token ID14295
Feature activation-6.47e-01

Pos logit contributions
 known+0.004
abella+0.004
 knowing+0.004
Fre+0.004
rou+0.003
Neg logit contributions
 cloth-0.004
 dishes-0.004
amas-0.004
 hurts-0.003
 hands-0.003
Token felt
Token ID2936
Feature activation-4.37e-01

Pos logit contributions
 dishes+0.026
 cloth+0.026
amas+0.025
 hurts+0.025
 hands+0.025
Neg logit contributions
 known-0.028
abella-0.026
 knowing-0.025
Fre-0.025
rou-0.025
Token happy
Token ID3772
Feature activation-9.57e-01

Pos logit contributions
 dishes+0.020
 cloth+0.020
amas+0.020
 hurts+0.020
 hands+0.020
Neg logit contributions
 known-0.016
abella-0.015
Fre-0.014
 knowing-0.014
rou-0.014
Token too
Token ID1165
Feature activation+2.15e+00

Pos logit contributions
 cloth+0.036
 dishes+0.035
 hands+0.035
amas+0.034
 hurts+0.034
Neg logit contributions
 known-0.045
Fre-0.042
abella-0.042
rou-0.039
 knowing-0.038
Token because
Token ID780
Feature activation+3.92e-01

Pos logit contributions
 known+0.114
abella+0.106
Fre+0.106
rou+0.102
 knowing+0.101
Neg logit contributions
 dishes-0.067
amas-0.066
 cloth-0.066
 hurts-0.064
 hands-0.064
Token he
Token ID339
Feature activation-1.66e-01

Pos logit contributions
 known+0.021
Fre+0.021
abella+0.020
rou+0.020
uth+0.019
Neg logit contributions
 dishes-0.011
 cloth-0.011
 hands-0.010
 hurts-0.010
amas-0.010
Token helped
Token ID4193
Feature activation-1.43e+00

Pos logit contributions
 cloth+0.007
 hurts+0.007
 dishes+0.007
amas+0.007
 hands+0.006
Neg logit contributions
 known-0.007
Fre-0.006
abella-0.006
 knowing-0.006
rou-0.006
Token So
Token ID1406
Feature activation-1.79e-01

Pos logit contributions
 hands+0.000
amas+0.000
 dishes+0.000
 cloth+0.000
 hurts+0.000
Neg logit contributions
 known-0.001
rou-0.001
 knowing-0.001
abella-0.001
Fre-0.001
Token he
Token ID339
Feature activation-2.03e-01

Pos logit contributions
 hands+0.005
 cloth+0.004
 hurts+0.004
 cheeks+0.004
 dishes+0.004
Neg logit contributions
rou-0.011
Fre-0.010
 known-0.010
abella-0.010
 knowing-0.009
Token asked
Token ID1965
Feature activation+8.46e-01

Pos logit contributions
 cloth+0.007
 dishes+0.007
 hands+0.007
 hurts+0.007
amas+0.006
Neg logit contributions
 known-0.010
rou-0.009
Fre-0.009
 knowing-0.009
joy-0.009
Token Jim
Token ID5395
Feature activation-7.66e-01

Pos logit contributions
 known+0.040
rou+0.040
Fre+0.038
 knowing+0.037
Ag+0.037
Neg logit contributions
 hands-0.031
 dishes-0.030
 cloth-0.029
 hurts-0.027
 cheeks-0.026
Token,
Token ID11
Feature activation+1.64e-01

Pos logit contributions
 hands+0.026
 dishes+0.026
 cloth+0.025
 hurts+0.024
 cheeks+0.022
Neg logit contributions
 known-0.038
abella-0.035
Fre-0.035
Ag-0.034
rou-0.034
Token "
Token ID366
Feature activation-3.23e-01

Pos logit contributions
 known+0.008
abella+0.008
Fre+0.008
rou+0.008
uth+0.008
Neg logit contributions
 cloth-0.005
 hands-0.005
 dishes-0.005
 hand-0.004
amas-0.004
TokenCan
Token ID6090
Feature activation+6.16e-01

Pos logit contributions
 cloth+0.014
 dishes+0.013
amas+0.013
 hands+0.012
 hurts+0.012
Neg logit contributions
 known-0.013
abella-0.013
rou-0.012
 knowing-0.012
Fre-0.012
Token you
Token ID345
Feature activation-3.20e-01

Pos logit contributions
 known+0.029
abella+0.026
Fre+0.025
 knowing+0.025
rou+0.024
Neg logit contributions
amas-0.023
 dishes-0.022
 cloth-0.022
 hands-0.021
 hurts-0.021
Token help
Token ID1037
Feature activation-2.48e+00

Pos logit contributions
amas+0.010
 dishes+0.009
 cloth+0.009
 hurts+0.009
 hands+0.009
Neg logit contributions
 known-0.017
 knowing-0.016
abella-0.016
Fre-0.016
rou-0.015
Token me
Token ID502
Feature activation-1.34e+00

Pos logit contributions
amas+0.113
 cheeks+0.110
bars+0.109
 hurts+0.108
 sleeves+0.107
Neg logit contributions
 known-0.088
abella-0.080
Fre-0.079
 knowing-0.077
 seeing-0.077
Token get
Token ID651
Feature activation-2.96e-01

Pos logit contributions
amas+0.051
 cheeks+0.051
 cloth+0.050
 hurts+0.049
 sleeves+0.049
Neg logit contributions
 known-0.059
 knowing-0.054
Fre-0.051
 discovering-0.050
 seeing-0.050
Token a
Token ID257
Feature activation+1.75e-01

Pos logit contributions
 known+0.020
abella+0.019
Fre+0.019
rou+0.019
 knowing+0.018
Neg logit contributions
 cloth-0.012
 dishes-0.012
 hands-0.011
amas-0.011
 hurts-0.010
Token little
Token ID1310
Feature activation-7.05e-01

Pos logit contributions
 known+0.011
abella+0.010
Fre+0.010
 discovering+0.010
 knowing+0.010
Neg logit contributions
 cloth-0.005
 dishes-0.004
 thumb-0.004
 chair-0.004
 sponge-0.004
Token boy
Token ID2933
Feature activation-2.26e-01

Pos logit contributions
amas+0.028
 dishes+0.027
 hurts+0.026
 cheeks+0.026
 cloth+0.025
Neg logit contributions
 known-0.030
 knowing-0.028
Fre-0.028
rou-0.027
abella-0.027
Token named
Token ID3706
Feature activation+2.20e+00

Pos logit contributions
 dishes+0.014
 cloth+0.014
amas+0.014
 hurts+0.013
 hands+0.013
Neg logit contributions
 known-0.005
abella-0.004
Fre-0.004
 knowing-0.004
rou-0.004
Token Tim
Token ID5045
Feature activation-1.20e+00

Pos logit contributions
 known+0.082
abella+0.077
 knowing+0.073
Fre+0.071
rou+0.070
Neg logit contributions
amas-0.100
 cloth-0.099
 dishes-0.098
 hurts-0.097
 cheeks-0.095
Tokenmy
Token ID1820
Feature activation-2.37e+00

Pos logit contributions
 dishes+0.044
amas+0.044
 cloth+0.043
 hurts+0.042
 hands+0.041
Neg logit contributions
 known-0.057
abella-0.052
 knowing-0.051
Fre-0.051
rou-0.050
Token.
Token ID13
Feature activation-4.11e-02

Pos logit contributions
 cloth+0.071
 dishes+0.070
amas+0.068
 hurts+0.068
 hands+0.068
Neg logit contributions
 known-0.128
abella-0.120
Fre-0.119
 knowing-0.115
rou-0.113
Token Tim
Token ID5045
Feature activation-4.05e-01

Pos logit contributions
 dishes+0.002
amas+0.002
 cloth+0.002
 hurts+0.002
 cheeks+0.002
Neg logit contributions
 known-0.001
 knowing-0.001
rou-0.001
abella-0.001
 discovering-0.001
Tokenmy
Token ID1820
Feature activation-1.70e+00

Pos logit contributions
amas+0.018
 dishes+0.017
 cheeks+0.017
 cloth+0.017
 hands+0.017
Neg logit contributions
 known-0.016
 knowing-0.014
rou-0.014
abella-0.014
Fre-0.013
Token loved
Token ID6151
Feature activation-1.05e+00

Pos logit contributions
amas+0.086
 dishes+0.085
 cloth+0.083
 cheeks+0.081
 hands+0.081
Neg logit contributions
 known-0.057
 knowing-0.051
abella-0.049
rou-0.046
Fre-0.045
Token playing
Token ID2712
Feature activation-1.46e+00

Pos logit contributions
amas+0.049
 hurts+0.046
 cheeks+0.046
 sleeve+0.045
 cloth+0.045
Neg logit contributions
 known-0.039
 knowing-0.038
 seeing-0.036
 discovering-0.035
 spending-0.034
TokenM
Token ID44
Feature activation-9.40e-01

Pos logit contributions
 known+0.001
 knowing+0.001
abella+0.001
rou+0.001
Fre+0.001
Neg logit contributions
 cloth-0.002
 dishes-0.002
amas-0.002
 hands-0.001
 hurts-0.001
Tokenagg
Token ID9460
Feature activation+7.36e-01

Pos logit contributions
 dishes+0.046
 cloth+0.045
amas+0.045
 hurts+0.044
 hands+0.044
Neg logit contributions
 known-0.033
abella-0.029
 knowing-0.029
Fre-0.028
rou-0.027
Tokenie
Token ID494
Feature activation-1.74e+00

Pos logit contributions
 known+0.032
abella+0.030
Fre+0.029
 knowing+0.029
rou+0.028
Neg logit contributions
amas-0.030
 dishes-0.029
 cloth-0.029
 hurts-0.028
 hands-0.028
Token's
Token ID338
Feature activation+9.09e-01

Pos logit contributions
 dishes+0.065
amas+0.064
 cloth+0.064
 hurts+0.062
 hands+0.061
Neg logit contributions
 known-0.081
abella-0.074
 knowing-0.073
Fre-0.072
rou-0.070
Token mom
Token ID1995
Feature activation-1.15e+00

Pos logit contributions
 known+0.044
 knowing+0.042
rou+0.041
abella+0.040
joy+0.039
Neg logit contributions
amas-0.031
 hurts-0.030
 dishes-0.028
 cloth-0.028
 diaper-0.026
Tokenmy
Token ID1820
Feature activation-2.45e+00

Pos logit contributions
 dishes+0.043
 cloth+0.042
amas+0.042
 hurts+0.041
 hands+0.041
Neg logit contributions
 known-0.053
abella-0.049
 knowing-0.048
Fre-0.048
rou-0.047
Token was
Token ID373
Feature activation-4.09e-01

Pos logit contributions
 cloth+0.099
 dishes+0.099
amas+0.098
 hurts+0.096
 hands+0.095
Neg logit contributions
 known-0.104
abella-0.096
Fre-0.096
 knowing-0.094
rou-0.091
Token very
Token ID845
Feature activation-6.45e-01

Pos logit contributions
 dishes+0.017
amas+0.017
 cloth+0.017
 hands+0.016
 hurts+0.016
Neg logit contributions
 known-0.016
Fre-0.015
abella-0.015
 knowing-0.015
rou-0.015
Token angry
Token ID7954
Feature activation+1.99e-01

Pos logit contributions
 dishes+0.027
 cloth+0.027
amas+0.026
 hurts+0.026
 hands+0.026
Neg logit contributions
 known-0.026
abella-0.025
Fre-0.024
 knowing-0.024
rou-0.023
Token with
Token ID351
Feature activation-1.92e-01

Pos logit contributions
 known+0.010
abella+0.009
Fre+0.009
 knowing+0.009
rou+0.009
Neg logit contributions
 dishes-0.007
 cloth-0.007
amas-0.007
 hurts-0.007
 hands-0.007
Token Maggie
Token ID33014
Feature activation-1.14e+00

Pos logit contributions
 dishes+0.007
 cloth+0.007
amas+0.006
 hands+0.006
 hurts+0.006
Neg logit contributions
 known-0.009
abella-0.009
 knowing-0.009
Fre-0.009
rou-0.008
Token asked
Token ID1965
Feature activation+7.69e-01

Pos logit contributions
amas+0.006
 dishes+0.006
 cloth+0.006
 hurts+0.005
 hands+0.005
Neg logit contributions
 known-0.007
abella-0.006
 knowing-0.006
Fre-0.006
rou-0.006
Token Tom
Token ID4186
Feature activation-1.42e-01

Pos logit contributions
 known+0.040
rou+0.039
Fre+0.038
Ag+0.037
Wal+0.037
Neg logit contributions
 dishes-0.024
 hands-0.024
 cloth-0.023
 hurts-0.023
 cheeks-0.022
Token to
Token ID284
Feature activation-9.28e-01

Pos logit contributions
 dishes+0.005
 hurts+0.005
 cloth+0.005
 hands+0.005
amas+0.005
Neg logit contributions
 known-0.007
Fre-0.006
abella-0.006
rou-0.006
 knowing-0.006
Token give
Token ID1577
Feature activation+6.04e-01

Pos logit contributions
amas+0.036
 dishes+0.032
 sleeves+0.031
 sheets+0.030
 hands+0.030
Neg logit contributions
 known-0.046
 knowing-0.040
abella-0.038
 seeing-0.038
Fre-0.038
Token her
Token ID607
Feature activation-2.06e+00

Pos logit contributions
 known+0.031
Fre+0.029
 knowing+0.029
abella+0.029
rou+0.029
Neg logit contributions
 dishes-0.020
 cloth-0.019
 hands-0.019
 hurts-0.019
amas-0.018
Token lamp
Token ID20450
Feature activation-2.45e+00

Pos logit contributions
amas+0.115
 cheeks+0.108
 elbows+0.106
 sleeves+0.101
akes+0.098
Neg logit contributions
 known-0.060
 exactly-0.050
 knowing-0.050
rou-0.048
abella-0.046
Token back
Token ID736
Feature activation-4.32e-01

Pos logit contributions
amas+0.125
 cheeks+0.108
 dishes+0.108
 sleeves+0.105
 elbows+0.104
Neg logit contributions
 known-0.087
 knowing-0.081
rou-0.076
 exactly-0.074
abella-0.073
Token.
Token ID13
Feature activation+9.02e-02

Pos logit contributions
amas+0.015
 dishes+0.015
 cloth+0.015
 hurts+0.014
 cheeks+0.014
Neg logit contributions
 known-0.021
 knowing-0.019
abella-0.019
Fre-0.019
rou-0.019
Token
Token ID198
Feature activation+2.62e-01

Pos logit contributions
 known+0.003
abella+0.003
Fre+0.003
 knowing+0.003
rou+0.003
Neg logit contributions
 dishes-0.004
 cloth-0.004
amas-0.004
 hurts-0.004
 diaper-0.004
Token
Token ID198
Feature activation-1.69e-01

Pos logit contributions
 known+0.012
 knowing+0.011
abella+0.011
Fre+0.011
rou+0.011
Neg logit contributions
 cloth-0.009
amas-0.009
 dishes-0.009
 hurts-0.009
 hands-0.009
TokenTom
Token ID13787
Feature activation+4.00e-01

Pos logit contributions
 dishes+0.008
 cloth+0.008
amas+0.008
 hurts+0.008
 hands+0.008
Neg logit contributions
 known-0.006
abella-0.005
Fre-0.005
 knowing-0.005
rou-0.005
Token on
Token ID319
Feature activation-1.37e-01

Pos logit contributions
amas+0.000
 hurts+0.000
 cheeks+0.000
 dishes+0.000
 thumb+0.000
Neg logit contributions
 known-0.000
 knowing-0.000
abella-0.000
Fre-0.000
rou-0.000
Token their
Token ID511
Feature activation-4.55e-01

Pos logit contributions
amas+0.006
 hurts+0.006
 dishes+0.006
 cloth+0.005
 cheeks+0.005
Neg logit contributions
 known-0.006
abella-0.005
Fre-0.005
 knowing-0.005
rou-0.005
Token toys
Token ID14958
Feature activation-8.57e-01

Pos logit contributions
amas+0.016
akes+0.015
iren+0.015
rays+0.014
 hurts+0.014
Neg logit contributions
 ge-0.016
 ze-0.016
 discovering-0.016
 gor-0.016
ya-0.016
Token,
Token ID11
Feature activation+7.05e-01

Pos logit contributions
 hurts+0.043
amas+0.043
 diaper+0.041
 cheeks+0.040
 dishes+0.040
Neg logit contributions
 knowing-0.025
 known-0.023
 seeing-0.022
 discovering-0.021
abella-0.021
Token and
Token ID290
Feature activation+1.51e-01

Pos logit contributions
 known+0.029
 knowing+0.028
abella+0.025
 discovering+0.025
 seeing+0.025
Neg logit contributions
amas-0.031
 hurts-0.028
 dishes-0.028
 cloth-0.026
 diaper-0.026
Token make
Token ID787
Feature activation-2.35e+00

Pos logit contributions
 known+0.007
 knowing+0.007
abella+0.006
 discovering+0.006
Fre+0.006
Neg logit contributions
amas-0.006
 dishes-0.005
 hurts-0.005
 cloth-0.005
 diaper-0.005
Token them
Token ID606
Feature activation-2.03e+00

Pos logit contributions
 dishes+0.084
amas+0.083
 cloth+0.083
 hurts+0.081
 hands+0.079
Neg logit contributions
 known-0.113
abella-0.103
 knowing-0.102
Fre-0.102
rou-0.099
Token move
Token ID1445
Feature activation-5.91e-01

Pos logit contributions
amas+0.082
 hurts+0.072
 dishes+0.070
 cheeks+0.070
 sleeves+0.070
Neg logit contributions
 known-0.091
 knowing-0.084
 impressed-0.080
rou-0.079
Fre-0.077
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
amas+0.024
 hurts+0.023
 dishes+0.022
 cloth+0.022
 cheeks+0.022
Neg logit contributions
 known-0.025
 knowing-0.024
abella-0.023
rou-0.023
Fre-0.023
Token The
Token ID383
Feature activation-4.42e-01

Pos logit contributions
 dishes+0.007
 cloth+0.007
amas+0.007
 hurts+0.007
 hands+0.006
Neg logit contributions
 known-0.010
abella-0.010
Fre-0.010
 knowing-0.010
rou-0.009
Token whale
Token ID22206
Feature activation+5.28e-02

Pos logit contributions
amas+0.016
 hurts+0.014
chair+0.013
 cheeks+0.012
 diaper+0.012
Neg logit contributions
 known-0.021
Fre-0.019
 knowing-0.019
abella-0.018
joy-0.018
Token to
Token ID284
Feature activation+2.81e-01

Pos logit contributions
 dishes+0.036
amas+0.036
 cloth+0.036
 hurts+0.035
 hands+0.034
Neg logit contributions
 known-0.034
 knowing-0.031
abella-0.031
Fre-0.030
rou-0.029
Token go
Token ID467
Feature activation-1.35e-01

Pos logit contributions
 known+0.016
abella+0.015
Fre+0.014
 knowing+0.014
rou+0.014
Neg logit contributions
amas-0.008
 dishes-0.007
 hurts-0.007
 cloth-0.007
 hands-0.007
Token home
Token ID1363
Feature activation+4.95e-01

Pos logit contributions
 dishes+0.005
 cloth+0.005
amas+0.005
 hurts+0.005
 hands+0.005
Neg logit contributions
 known-0.006
abella-0.005
Fre-0.005
 knowing-0.005
rou-0.005
Token!"
Token ID2474
Feature activation-4.82e-01

Pos logit contributions
 known+0.023
abella+0.021
Fre+0.021
 knowing+0.020
rou+0.020
Neg logit contributions
 dishes-0.018
amas-0.018
 cloth-0.018
 hurts-0.018
 hands-0.018
Token Tim
Token ID5045
Feature activation-9.82e-01

Pos logit contributions
 dishes+0.023
 hurts+0.023
 diaper+0.023
 cloth+0.022
 sheets+0.022
Neg logit contributions
 knowing-0.016
 known-0.014
 seeing-0.013
 discovering-0.013
 Gig-0.013
Tokenmy
Token ID1820
Feature activation-2.38e+00

Pos logit contributions
 dishes+0.044
 cloth+0.043
amas+0.042
 cheeks+0.040
 hurts+0.040
Neg logit contributions
 known-0.039
 knowing-0.035
rou-0.035
abella-0.035
Fre-0.032
Token didn
Token ID1422
Feature activation-2.73e-01

Pos logit contributions
 dishes+0.097
 cloth+0.094
amas+0.094
 hurts+0.089
 diaper+0.087
Neg logit contributions
 known-0.102
abella-0.097
 knowing-0.094
Fre-0.089
rou-0.089
Token't
Token ID470
Feature activation+4.83e-01

Pos logit contributions
 hands+0.010
 cloth+0.010
 cheeks+0.010
 dishes+0.010
 sleeves+0.009
Neg logit contributions
abella-0.012
 known-0.012
Fre-0.011
 knowing-0.011
rou-0.011
Token want
Token ID765
Feature activation-9.88e-01

Pos logit contributions
 known+0.021
abella+0.019
Fre+0.019
 knowing+0.019
rou+0.019
Neg logit contributions
 dishes-0.019
 cloth-0.019
amas-0.019
 hurts-0.019
 hands-0.018
Token to
Token ID284
Feature activation-5.30e-01

Pos logit contributions
amas+0.044
 dishes+0.041
 cloth+0.041
 cheeks+0.040
 hands+0.039
Neg logit contributions
 known-0.041
 knowing-0.037
abella-0.037
Fre-0.034
 discovering-0.034
Token leave
Token ID2666
Feature activation-5.30e-01

Pos logit contributions
amas+0.018
 dishes+0.017
 cloth+0.017
 hurts+0.016
 hands+0.016
Neg logit contributions
 known-0.027
 knowing-0.024
abella-0.024
Fre-0.024
rou-0.023