TOP ACTIVATIONS
MAX = 9.248

old man who had a kay ak . He decided to
, he found a big seas hell and decided to design
They are very happy . Mom and Dad have
They are very happy . M ama says ,
park with their mom . They see a big
. He is happy . One day , he
day , Lily found a seas hell and showed it to
is a smart person . One day , Tim
little bird named Billy . At the park ,
and many other friends . Anna 's mom makes
girl was very excited . They got out of
the parade a lot . Tim saw a man
They are very happy . Mom and Dad have
celebrate with their friends . They help their mom
to show his friends . Tom met his friend
They are very excited . They go to the
was a good reader . One day , she
ways to write it . Tom was a farmer
loves Spike very much . One day , Ben
fun at the gym . One day , Tom

MIN ACTIVATIONS
MIN = -10.951

amazed at how much the little boy had learned in the
the story of how the little girl made her own green
thanked the elderly rh in oc eros for his wonderful surprise
no matter how hard the little boy worked , he never
have stopped the rh in oc eros . He is sleeping
the mystery of where the little girl had gone remained unsolved
. The av oc ados were shared with everybody
sky and said , " Little girl , why are you
then on , whenever the young girl came to visit ,
the story of how the little girl got her magical jewel
new home . Everywhere the little girl looked she saw something
wait until the rh in oc eros is gone ."
name and said , " Little girl , give me that
had never seen the av oc ados . They wished they
Mom my hugged Little Jimmy and said , "
And that was how the little girl learned her lesson -
They forget about the av oc ados . They just want
appeared and said , " Little girl , pick the right
know Tw inkle Tw inkle Little Star ?" L
sings Tw inkle Tw inkle Little Star . She falls asleep

INTERVAL 4.198 - 9.248
CONTAINS 9.326%

advice next time . The moral of this
after himself . <|endoftext|> Once upon a time , there was
One day , she saw a carpet in the hallway and
upon a time there was a orange cat who lived in
they joined her , they could help even more .

INTERVAL -0.852 - 4.198
CONTAINS 47.324%

turn into a bird so I can fly too ?"
there with her and that she had listened to her mom
j olly little town , there lived a girl named Lucy
when the two of them went to the store , there
lion was too fast . L ily saw

INTERVAL -5.901 - -0.852
CONTAINS 42.890%

excited , because it was Sunday . All week he had
. She tries to catch him . But she slips too
one day you can design your own elevator !"
it . One day Jack got the idea to
" Yes , we can play with the dog

INTERVAL -10.951 - -5.901
CONTAINS 0.460%

Do you remember where the other star is ?" Ben looked
my was tall and the little girl was small .
to stroke it . The little bird calmed down .
went to their mom and told her what happened .
it ?" " Sure , you can play with
Token old
Token ID1468
Feature activation+3.71e-01

Pos logit contributions
One+0.059
ggy+0.058
gh+0.053
 badly+0.052
 read+0.052
Neg logit contributions
elling-0.079
aughter-0.074
ener-0.073
mith-0.073
 Me-0.070
Token man
Token ID582
Feature activation+1.50e+00

Pos logit contributions
One+0.008
ggy+0.008
gh+0.007
 badly+0.007
ggie+0.007
Neg logit contributions
elling-0.006
aughter-0.006
ener-0.006
mith-0.006
ada-0.005
Token who
Token ID508
Feature activation+2.81e-01

Pos logit contributions
One+0.026
gh+0.025
ggy+0.024
 villains+0.024
 seek+0.024
Neg logit contributions
elling-0.027
ener-0.025
aughter-0.025
 Acc-0.025
mith-0.025
Token had
Token ID550
Feature activation+4.89e+00

Pos logit contributions
One+0.005
gh+0.005
ggy+0.005
 nap+0.005
vel+0.005
Neg logit contributions
 Acc-0.005
elling-0.004
 Me-0.004
aughter-0.004
cy-0.004
Token a
Token ID257
Feature activation+6.44e+00

Pos logit contributions
vel+0.096
irl+0.094
gh+0.094
 seek+0.093
ggy+0.092
Neg logit contributions
 Daddy-0.071
 Little-0.071
 The-0.069
 Dad-0.069
 Mom-0.065
Token kay
Token ID34681
Feature activation+9.25e+00

Pos logit contributions
One+0.085
ggy+0.085
gh+0.078
aches+0.078
vel+0.076
Neg logit contributions
aughter-0.139
 Girl-0.136
 Daughter-0.133
 Acc-0.133
 Daddy-0.133
Tokenak
Token ID461
Feature activation+4.86e+00

Pos logit contributions
ggy+0.101
 read+0.100
 nap+0.099
One+0.095
 prow+0.095
Neg logit contributions
elling-0.234
aughter-0.229
cy-0.225
isher-0.219
mith-0.215
Token.
Token ID13
Feature activation-1.61e+00

Pos logit contributions
One+0.084
ggy+0.082
gh+0.081
 seek+0.079
 villains+0.076
Neg logit contributions
elling-0.094
ener-0.086
mith-0.086
aughter-0.085
ishly-0.085
Token He
Token ID679
Feature activation+2.57e+00

Pos logit contributions
elling+0.026
aughter+0.024
ener+0.024
mith+0.023
ishly+0.023
Neg logit contributions
One-0.036
ggy-0.034
 read-0.033
 che-0.032
On-0.032
Token decided
Token ID3066
Feature activation-4.17e-01

Pos logit contributions
vel+0.046
gh+0.045
One+0.045
ggy+0.044
 nap+0.044
Neg logit contributions
elling-0.041
 Me-0.040
mith-0.040
ener-0.039
ted-0.039
Token to
Token ID284
Feature activation+3.08e+00

Pos logit contributions
elling+0.007
ener+0.007
aughter+0.007
mith+0.007
 Me+0.006
Neg logit contributions
One-0.008
ggy-0.008
gh-0.008
ggie-0.007
 nap-0.007
Token,
Token ID11
Feature activation-8.98e-01

Pos logit contributions
elling+0.023
aughter+0.021
ener+0.021
mith+0.021
ada+0.020
Neg logit contributions
One-0.024
ggy-0.023
gh-0.022
 badly-0.022
 nap-0.021
Token he
Token ID339
Feature activation+1.95e+00

Pos logit contributions
elling+0.017
aughter+0.016
ener+0.015
mith+0.014
ada+0.014
Neg logit contributions
One-0.019
ggy-0.018
 long-0.018
 not-0.018
 badly-0.018
Token found
Token ID1043
Feature activation+8.93e-01

Pos logit contributions
ggy+0.034
One+0.034
ggie+0.032
gh+0.032
 badly+0.032
Neg logit contributions
elling-0.036
ener-0.035
aughter-0.034
cy-0.034
ada-0.033
Token a
Token ID257
Feature activation+5.92e+00

Pos logit contributions
One+0.009
ggy+0.009
gh+0.008
 badly+0.008
ggie+0.008
Neg logit contributions
elling-0.025
aughter-0.023
ener-0.023
mith-0.023
ada-0.022
Token big
Token ID1263
Feature activation+2.00e+00

Pos logit contributions
One+0.083
ggy+0.075
gh+0.068
ggie+0.066
aches+0.065
Neg logit contributions
aughter-0.132
ener-0.129
elling-0.129
 Girl-0.128
ada-0.125
Token seas
Token ID21547
Feature activation+9.02e+00

Pos logit contributions
One+0.033
ggy+0.032
gh+0.030
ggie+0.029
 badly+0.028
Neg logit contributions
elling-0.043
aughter-0.040
ener-0.040
mith-0.039
ada-0.038
Tokenhell
Token ID12758
Feature activation+3.97e+00

Pos logit contributions
One+0.142
 villains+0.139
 prow+0.133
 seek+0.130
P+0.128
Neg logit contributions
elling-0.174
aughter-0.172
ada-0.163
ener-0.161
ion-0.157
Token and
Token ID290
Feature activation-8.97e-01

Pos logit contributions
ggy+0.080
One+0.077
 seek+0.077
gh+0.076
 villains+0.074
Neg logit contributions
elling-0.073
ted-0.069
ting-0.068
als-0.067
aughter-0.065
Token decided
Token ID3066
Feature activation-2.98e-01

Pos logit contributions
elling+0.018
ener+0.017
aughter+0.017
mith+0.016
ada+0.016
Neg logit contributions
One-0.016
ggy-0.016
gh-0.015
ggie-0.014
 badly-0.014
Token to
Token ID284
Feature activation+2.78e+00

Pos logit contributions
elling+0.004
ener+0.004
 Acc+0.004
mith+0.004
aughter+0.004
Neg logit contributions
One-0.006
ggy-0.006
 nap-0.006
ggie-0.006
gh-0.006
Token design
Token ID1486
Feature activation+1.93e+00

Pos logit contributions
ggy+0.048
One+0.047
gh+0.046
 badly+0.046
ggie+0.043
Neg logit contributions
elling-0.054
mith-0.050
ada-0.050
 Suz-0.050
 Joe-0.050
Token They
Token ID1119
Feature activation+1.77e+00

Pos logit contributions
ener+0.015
elling+0.015
aughter+0.014
mith+0.014
ishly+0.013
Neg logit contributions
One-0.022
ggy-0.021
On-0.020
 nap-0.020
 read-0.020
Token are
Token ID389
Feature activation+6.05e+00

Pos logit contributions
ggy+0.025
One+0.025
gh+0.023
 badly+0.022
ggie+0.022
Neg logit contributions
elling-0.042
aughter-0.040
ener-0.039
mith-0.039
ada-0.038
Token very
Token ID845
Feature activation+4.70e+00

Pos logit contributions
ggy+0.082
One+0.081
gh+0.074
ggie+0.070
 badly+0.069
Neg logit contributions
elling-0.146
aughter-0.140
ener-0.136
 Me-0.135
mith-0.135
Token happy
Token ID3772
Feature activation-9.86e-01

Pos logit contributions
One+0.062
ggy+0.060
gh+0.055
ggie+0.053
 prow+0.052
Neg logit contributions
elling-0.115
aughter-0.112
 Me-0.107
ener-0.106
 Acc-0.106
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
elling+0.020
aughter+0.019
ener+0.019
mith+0.018
 Me+0.018
Neg logit contributions
One-0.017
ggy-0.017
gh-0.016
ggie-0.015
 prow-0.014
Token
Token ID198
Feature activation+8.91e+00

Pos logit contributions
ener+0.005
elling+0.005
aughter+0.005
mith+0.005
ishly+0.005
Neg logit contributions
One-0.004
On-0.004
ggy-0.003
 read-0.003
 nap-0.003
Token
Token ID198
Feature activation+7.01e+00

Pos logit contributions
 che+0.207
ggy+0.202
gh+0.202
 prow+0.201
 badly+0.201
Neg logit contributions
 Me-0.099
aughter-0.095
elling-0.094
mith-0.091
 Little-0.089
TokenMom
Token ID29252
Feature activation-3.61e-01

Pos logit contributions
 prow+0.099
 lair+0.098
ggy+0.097
 villains+0.097
 nap+0.095
Neg logit contributions
elling-0.145
aughter-0.139
 Me-0.137
mith-0.136
ada-0.135
Token and
Token ID290
Feature activation-3.43e+00

Pos logit contributions
elling+0.006
mith+0.005
ener+0.005
aughter+0.005
ishly+0.005
Neg logit contributions
One-0.008
 read-0.008
ggy-0.008
 nap-0.007
 che-0.007
Token Dad
Token ID17415
Feature activation+1.27e+00

Pos logit contributions
elling+0.053
mith+0.045
ener+0.044
aughter+0.041
ishly+0.039
Neg logit contributions
One-0.085
 read-0.082
 not-0.079
On-0.079
 nap-0.078
Token have
Token ID423
Feature activation+4.11e+00

Pos logit contributions
ggy+0.021
One+0.020
gh+0.019
 badly+0.019
ggie+0.019
Neg logit contributions
elling-0.027
aughter-0.026
ener-0.025
mith-0.025
ada-0.024
Token They
Token ID1119
Feature activation+1.59e+00

Pos logit contributions
elling+0.018
ener+0.017
mith+0.017
aughter+0.016
ada+0.015
Neg logit contributions
One-0.031
ggy-0.029
 nap-0.029
On-0.028
 read-0.028
Token are
Token ID389
Feature activation+5.97e+00

Pos logit contributions
ggy+0.023
One+0.023
gh+0.021
 badly+0.020
ggie+0.020
Neg logit contributions
elling-0.037
aughter-0.035
ener-0.035
mith-0.035
ada-0.034
Token very
Token ID845
Feature activation+4.66e+00

Pos logit contributions
One+0.079
ggy+0.078
gh+0.071
ggie+0.067
 badly+0.067
Neg logit contributions
elling-0.147
aughter-0.140
ener-0.136
mith-0.136
 Me-0.135
Token happy
Token ID3772
Feature activation-1.15e+00

Pos logit contributions
One+0.060
ggy+0.059
gh+0.054
ggie+0.051
 badly+0.051
Neg logit contributions
elling-0.116
aughter-0.112
ener-0.107
 Me-0.106
mith-0.106
Token.
Token ID13
Feature activation-4.87e-01

Pos logit contributions
elling+0.025
aughter+0.023
ener+0.023
mith+0.023
 Me+0.022
Neg logit contributions
One-0.019
ggy-0.018
gh-0.017
ggie-0.016
 badly-0.016
Token
Token ID198
Feature activation+8.90e+00

Pos logit contributions
elling+0.010
ener+0.009
mith+0.009
aughter+0.009
ada+0.008
Neg logit contributions
One-0.010
 nap-0.009
ggy-0.009
On-0.009
 read-0.008
Token
Token ID198
Feature activation+7.00e+00

Pos logit contributions
 che+0.207
 badly+0.207
ggy+0.207
gh+0.206
aches+0.204
Neg logit contributions
 Me-0.101
aughter-0.096
elling-0.093
 Little-0.089
mith-0.088
TokenM
Token ID44
Feature activation-4.99e+00

Pos logit contributions
ggy+0.110
 prow+0.110
gh+0.108
 badly+0.107
 che+0.107
Neg logit contributions
aughter-0.130
 Me-0.129
elling-0.129
mith-0.122
 Our-0.121
Tokenama
Token ID1689
Feature activation-1.64e+00

Pos logit contributions
elling+0.064
aughter+0.057
ener+0.056
mith+0.055
ada+0.052
Neg logit contributions
One-0.125
ggy-0.124
gh-0.118
 badly-0.116
 nap-0.116
Token says
Token ID1139
Feature activation-3.67e+00

Pos logit contributions
elling+0.028
mith+0.027
 Acc+0.026
aughter+0.026
ener+0.026
Neg logit contributions
 read-0.033
One-0.033
ggy-0.032
 che-0.032
 nap-0.031
Token,
Token ID11
Feature activation-2.10e+00

Pos logit contributions
elling+0.076
ener+0.070
aughter+0.069
mith+0.068
ada+0.067
Neg logit contributions
ggy-0.067
One-0.066
gh-0.062
 nap-0.061
 read-0.061
Token park
Token ID3952
Feature activation+1.87e+00

Pos logit contributions
elling+0.018
aughter+0.017
 Me+0.017
mith+0.016
 Girl+0.016
Neg logit contributions
ggy-0.011
gh-0.011
One-0.010
 badly-0.010
ggie-0.010
Token with
Token ID351
Feature activation-1.83e+00

Pos logit contributions
One+0.030
ggy+0.030
gh+0.027
 badly+0.026
 nap+0.026
Neg logit contributions
elling-0.041
aughter-0.039
ener-0.038
mith-0.038
ada-0.036
Token their
Token ID511
Feature activation-7.34e-01

Pos logit contributions
elling+0.025
aughter+0.023
 Me+0.023
mith+0.021
ada+0.021
Neg logit contributions
ggy-0.042
One-0.042
gh-0.041
 badly-0.040
 prow-0.039
Token mom
Token ID1995
Feature activation-1.41e+00

Pos logit contributions
elling+0.014
ener+0.012
aughter+0.012
mith+0.012
ada+0.012
Neg logit contributions
ggy-0.015
One-0.015
 nap-0.014
 read-0.013
gh-0.013
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
elling+0.041
mith+0.038
 Acc+0.038
aughter+0.037
ener+0.037
Neg logit contributions
One-0.016
 read-0.014
 nap-0.013
On-0.012
 hidden-0.012
Token
Token ID198
Feature activation+8.85e+00

Pos logit contributions
elling+0.023
ener+0.023
mith+0.021
aughter+0.020
ishly+0.020
Neg logit contributions
One-0.019
 nap-0.017
On-0.017
 read-0.016
 long-0.016
Token
Token ID198
Feature activation+6.96e+00

Pos logit contributions
 badly+0.199
 che+0.197
ggy+0.197
gh+0.194
 prow+0.190
Neg logit contributions
aughter-0.110
 Me-0.107
elling-0.103
mith-0.101
 Let-0.095
TokenThey
Token ID2990
Feature activation+2.00e+00

Pos logit contributions
 badly+0.115
ggy+0.114
gh+0.112
 prow+0.111
 che+0.111
Neg logit contributions
aughter-0.125
 Me-0.124
elling-0.122
mith-0.115
cy-0.115
Token see
Token ID766
Feature activation-1.19e+00

Pos logit contributions
One+0.025
ggy+0.025
gh+0.022
 nap+0.021
 read+0.021
Neg logit contributions
elling-0.051
aughter-0.048
ener-0.047
mith-0.047
ada-0.045
Token a
Token ID257
Feature activation+5.67e+00

Pos logit contributions
elling+0.033
aughter+0.031
ener+0.031
mith+0.031
ada+0.030
Neg logit contributions
One-0.012
ggy-0.012
gh-0.011
 badly-0.010
ggie-0.010
Token big
Token ID1263
Feature activation+1.42e+00

Pos logit contributions
One+0.069
gh+0.067
ggy+0.066
 prow+0.059
ggie+0.058
Neg logit contributions
elling-0.132
aughter-0.131
 Girl-0.129
 Me-0.128
ada-0.124
Token.
Token ID13
Feature activation-6.32e-01

Pos logit contributions
One+0.024
ggy+0.023
gh+0.021
 read+0.021
 nap+0.020
Neg logit contributions
elling-0.043
aughter-0.040
ener-0.040
mith-0.039
ada-0.038
Token He
Token ID679
Feature activation+2.95e+00

Pos logit contributions
elling+0.013
ener+0.013
aughter+0.012
mith+0.012
ishly+0.012
Neg logit contributions
One-0.012
ggy-0.011
On-0.010
 nap-0.010
 read-0.010
Token is
Token ID318
Feature activation+6.62e+00

Pos logit contributions
One+0.057
ggy+0.055
 read+0.052
 nap+0.051
gh+0.051
Neg logit contributions
elling-0.056
aughter-0.052
ener-0.051
mith-0.051
ada-0.049
Token happy
Token ID3772
Feature activation-3.71e-01

Pos logit contributions
One+0.084
ggy+0.083
gh+0.076
 badly+0.072
 nap+0.071
Neg logit contributions
elling-0.163
aughter-0.155
ener-0.155
mith-0.152
 Me-0.151
Token.
Token ID13
Feature activation-2.18e-01

Pos logit contributions
elling+0.008
aughter+0.007
ener+0.007
mith+0.007
ada+0.007
Neg logit contributions
One-0.006
ggy-0.006
gh-0.006
 badly-0.005
ggie-0.005
Token
Token ID198
Feature activation+8.83e+00

Pos logit contributions
elling+0.005
ener+0.005
aughter+0.005
mith+0.004
ishly+0.004
Neg logit contributions
One-0.004
ggy-0.003
On-0.003
 read-0.003
 nap-0.003
Token
Token ID198
Feature activation+6.54e+00

Pos logit contributions
ggy+0.192
 che+0.190
gh+0.190
 badly+0.189
 seek+0.186
Neg logit contributions
 Me-0.113
aughter-0.112
elling-0.110
mith-0.108
 Little-0.102
TokenOne
Token ID3198
Feature activation+5.23e-01

Pos logit contributions
ggy+0.039
 prow+0.037
 nap+0.035
 hunt+0.032
 hiding+0.031
Neg logit contributions
elling-0.191
 Me-0.184
mith-0.181
aughter-0.181
 Let-0.176
Token day
Token ID1110
Feature activation-2.01e+00

Pos logit contributions
One+0.009
ggy+0.009
gh+0.009
 nap+0.008
 read+0.008
Neg logit contributions
elling-0.010
aughter-0.010
ener-0.010
mith-0.010
ishly-0.009
Token,
Token ID11
Feature activation-9.10e-01

Pos logit contributions
elling+0.041
aughter+0.038
ener+0.037
mith+0.037
ishly+0.036
Neg logit contributions
One-0.037
ggy-0.036
gh-0.034
 badly-0.033
 nap-0.033
Token he
Token ID339
Feature activation+1.97e+00

Pos logit contributions
elling+0.016
ener+0.014
aughter+0.014
ishly+0.014
mith+0.014
Neg logit contributions
One-0.020
ggy-0.019
 badly-0.018
On-0.018
 read-0.018
Token day
Token ID1110
Feature activation-1.10e+00

Pos logit contributions
elling+0.001
aughter+0.001
ener+0.001
mith+0.001
 Me+0.001
Neg logit contributions
One-0.001
ggy-0.001
gh-0.001
 badly-0.001
ggie-0.001
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
elling+0.022
ener+0.021
aughter+0.021
mith+0.021
ishly+0.019
Neg logit contributions
ggy-0.020
One-0.020
gh-0.019
 nap-0.018
 badly-0.018
Token Lily
Token ID20037
Feature activation-2.41e-01

Pos logit contributions
elling+0.024
ener+0.021
aughter+0.021
mith+0.020
 Acc+0.020
Neg logit contributions
 long-0.021
One-0.021
ggy-0.021
 nap-0.020
 not-0.020
Token found
Token ID1043
Feature activation+8.40e-01

Pos logit contributions
elling+0.005
aughter+0.004
ener+0.004
mith+0.004
ada+0.004
Neg logit contributions
One-0.004
ggy-0.004
gh-0.004
 badly-0.004
ggie-0.004
Token a
Token ID257
Feature activation+5.47e+00

Pos logit contributions
One+0.008
ggy+0.008
gh+0.007
 badly+0.006
 nap+0.006
Neg logit contributions
elling-0.024
aughter-0.023
ener-0.023
mith-0.022
ada-0.022
Token seas
Token ID21547
Feature activation+8.79e+00

Pos logit contributions
One+0.077
ggy+0.068
gh+0.061
vel+0.061
aches+0.060
Neg logit contributions
aughter-0.124
 Girl-0.119
elling-0.119
ener-0.118
mith-0.115
Tokenhell
Token ID12758
Feature activation+3.70e+00

Pos logit contributions
One+0.133
 villains+0.133
 seek+0.129
 prow+0.127
P+0.122
Neg logit contributions
elling-0.178
aughter-0.174
ada-0.163
ener-0.160
mith-0.156
Token and
Token ID290
Feature activation-1.72e+00

Pos logit contributions
ggy+0.079
 seek+0.077
One+0.077
gh+0.077
 villains+0.076
Neg logit contributions
elling-0.063
ted-0.058
als-0.057
aughter-0.056
ting-0.056
Token showed
Token ID3751
Feature activation-3.36e+00

Pos logit contributions
elling+0.033
aughter+0.031
ener+0.031
mith+0.030
ada+0.029
Neg logit contributions
One-0.032
ggy-0.032
gh-0.030
 badly-0.029
 nap-0.029
Token it
Token ID340
Feature activation+1.94e+00

Pos logit contributions
elling+0.066
aughter+0.062
ener+0.062
mith+0.062
 Acc+0.060
Neg logit contributions
ggy-0.061
One-0.060
 read-0.055
gh-0.055
 nap-0.055
Token to
Token ID284
Feature activation-1.86e+00

Pos logit contributions
One+0.035
ggy+0.035
gh+0.033
 badly+0.032
ggie+0.032
Neg logit contributions
elling-0.038
aughter-0.035
ener-0.035
mith-0.034
ada-0.033
Token is
Token ID318
Feature activation+6.46e+00

Pos logit contributions
ggy+0.056
One+0.055
gh+0.054
 badly+0.053
ggie+0.051
Neg logit contributions
elling-0.042
ener-0.039
aughter-0.038
 Me-0.037
mith-0.037
Token a
Token ID257
Feature activation+6.27e+00

Pos logit contributions
One+0.092
ggy+0.090
gh+0.089
 nap+0.081
 badly+0.079
Neg logit contributions
elling-0.143
ener-0.136
 Me-0.136
aughter-0.136
 Acc-0.134
Token smart
Token ID4451
Feature activation-8.82e-02

Pos logit contributions
One+0.080
ggy+0.078
gh+0.078
aches+0.073
 badly+0.070
Neg logit contributions
 Me-0.140
aughter-0.140
 Girl-0.139
elling-0.139
 Acc-0.138
Token person
Token ID1048
Feature activation+8.93e-01

Pos logit contributions
elling+0.002
aughter+0.002
mith+0.002
ener+0.002
ishly+0.002
Neg logit contributions
ggy-0.002
One-0.002
 nap-0.002
 read-0.002
 che-0.002
Token.
Token ID13
Feature activation-4.30e-01

Pos logit contributions
One+0.012
ggy+0.012
gh+0.011
 badly+0.011
 read+0.011
Neg logit contributions
elling-0.022
aughter-0.021
ener-0.020
mith-0.020
ada-0.020
Token
Token ID198
Feature activation+8.78e+00

Pos logit contributions
aughter+0.010
ener+0.010
elling+0.010
ishly+0.009
mith+0.009
Neg logit contributions
One-0.007
ggy-0.006
On-0.006
 che-0.006
 much-0.006
Token
Token ID198
Feature activation+6.44e+00

Pos logit contributions
ggy+0.194
gh+0.194
 seek+0.189
 che+0.189
 badly+0.189
Neg logit contributions
elling-0.106
 Me-0.105
mith-0.100
aughter-0.099
 Little-0.097
TokenOne
Token ID3198
Feature activation+5.56e-01

Pos logit contributions
ggy+0.032
 nap+0.024
ggie+0.024
gh+0.023
 villains+0.018
Neg logit contributions
elling-0.209
mith-0.196
 Acc-0.192
 Me-0.191
ada-0.190
Token day
Token ID1110
Feature activation-2.09e+00

Pos logit contributions
One+0.010
ggy+0.010
gh+0.009
 badly+0.009
 nap+0.009
Neg logit contributions
elling-0.011
aughter-0.010
ener-0.010
mith-0.010
ada-0.010
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
elling+0.043
aughter+0.041
ener+0.040
mith+0.039
ishly+0.038
Neg logit contributions
One-0.038
ggy-0.037
gh-0.034
 badly-0.034
 che-0.034
Token Tim
Token ID5045
Feature activation-2.45e+00

Pos logit contributions
elling+0.019
aughter+0.017
ener+0.017
ishly+0.016
 Acc+0.015
Neg logit contributions
One-0.022
 badly-0.021
 long-0.021
 not-0.021
ggy-0.021
Token little
Token ID1310
Feature activation-3.40e+00

Pos logit contributions
elling+0.002
aughter+0.002
ener+0.002
mith+0.002
ada+0.002
Neg logit contributions
One-0.002
ggy-0.002
gh-0.002
 badly-0.002
ggie-0.002
Token bird
Token ID6512
Feature activation+3.29e+00

Pos logit contributions
elling+0.074
mith+0.071
aughter+0.069
 Acc+0.066
ener+0.065
Neg logit contributions
One-0.056
 nap-0.055
ggy-0.053
 hidden-0.053
 seek-0.053
Token named
Token ID3706
Feature activation-1.76e+00

Pos logit contributions
One+0.052
ggy+0.052
gh+0.050
ggie+0.048
 badly+0.047
Neg logit contributions
elling-0.069
ener-0.065
aughter-0.064
mith-0.063
ada-0.062
Token Billy
Token ID15890
Feature activation+2.89e-01

Pos logit contributions
elling+0.027
ener+0.025
mith+0.024
aughter+0.024
 Acc+0.021
Neg logit contributions
One-0.041
ggy-0.041
 nap-0.039
 read-0.039
gh-0.038
Token.
Token ID13
Feature activation-6.07e-01

Pos logit contributions
One+0.004
ggy+0.004
 read+0.003
gh+0.003
 nap+0.003
Neg logit contributions
elling-0.007
aughter-0.007
mith-0.007
ener-0.007
ada-0.006
Token
Token ID198
Feature activation+8.76e+00

Pos logit contributions
ener+0.017
elling+0.017
isher+0.016
aughter+0.015
mith+0.015
Neg logit contributions
One-0.008
On-0.008
 read-0.007
 nap-0.007
 long-0.007
Token
Token ID198
Feature activation+6.77e+00

Pos logit contributions
 che+0.183
ggy+0.183
gh+0.182
 badly+0.179
 seek+0.176
Neg logit contributions
elling-0.121
aughter-0.120
mith-0.117
 Me-0.114
 Little-0.109
TokenAt
Token ID2953
Feature activation+2.12e+00

Pos logit contributions
gh+0.106
ggy+0.104
 che+0.100
 badly+0.097
 nap+0.097
Neg logit contributions
elling-0.132
mith-0.131
 Me-0.129
aughter-0.128
 Girl-0.124
Token the
Token ID262
Feature activation+3.05e-01

Pos logit contributions
ggy+0.047
One+0.046
gh+0.044
 badly+0.043
ggie+0.043
Neg logit contributions
elling-0.033
aughter-0.031
mith-0.030
ener-0.029
 Me-0.028
Token park
Token ID3952
Feature activation+1.16e+00

Pos logit contributions
ggy+0.005
gh+0.005
 badly+0.005
vel+0.004
 villains+0.004
Neg logit contributions
elling-0.007
aughter-0.006
 Me-0.006
ada-0.006
mith-0.006
Token,
Token ID11
Feature activation-1.56e+00

Pos logit contributions
 not+0.014
ggy+0.014
One+0.014
 hidden+0.013
 hiding+0.013
Neg logit contributions
elling-0.033
ener-0.032
mith-0.031
aughter-0.031
ada-0.030
Token and
Token ID290
Feature activation-2.50e+00

Pos logit contributions
One+0.005
ggy+0.005
 read+0.005
 hidden+0.004
 nap+0.004
Neg logit contributions
elling-0.007
mith-0.007
ener-0.006
aughter-0.006
 Acc-0.006
Token many
Token ID867
Feature activation-8.27e-01

Pos logit contributions
elling+0.044
mith+0.042
ener+0.039
aughter+0.037
 Acc+0.036
Neg logit contributions
 nap-0.051
One-0.051
 read-0.051
ggy-0.050
 che-0.049
Token other
Token ID584
Feature activation-4.53e+00

Pos logit contributions
elling+0.019
aughter+0.018
ener+0.018
mith+0.018
ishly+0.017
Neg logit contributions
One-0.012
ggy-0.012
gh-0.011
 nap-0.011
 read-0.011
Token friends
Token ID2460
Feature activation+7.32e-01

Pos logit contributions
elling+0.097
mith+0.092
aughter+0.090
ener+0.089
ishly+0.088
Neg logit contributions
One-0.075
ggy-0.073
 nap-0.069
 read-0.068
 hidden-0.067
Token.
Token ID13
Feature activation-7.74e-01

Pos logit contributions
One+0.010
ggy+0.009
 read+0.009
 nap+0.008
On+0.008
Neg logit contributions
elling-0.019
aughter-0.017
mith-0.017
ener-0.017
 Acc-0.017
Token
Token ID198
Feature activation+8.76e+00

Pos logit contributions
elling+0.017
ener+0.017
mith+0.016
ishly+0.015
aughter+0.015
Neg logit contributions
One-0.014
 nap-0.013
On-0.013
 read-0.012
ggy-0.011
Token
Token ID198
Feature activation+6.57e+00

Pos logit contributions
ggy+0.181
 badly+0.180
gh+0.176
 che+0.174
 prow+0.173
Neg logit contributions
aughter-0.130
elling-0.130
 Me-0.125
mith-0.121
ada-0.114
TokenAnna
Token ID31160
Feature activation+2.55e-01

Pos logit contributions
ggy+0.103
 badly+0.101
 prow+0.099
gh+0.096
 che+0.093
Neg logit contributions
aughter-0.130
elling-0.129
 Me-0.128
mith-0.120
ada-0.119
Token's
Token ID338
Feature activation+6.60e-01

Pos logit contributions
 read+0.005
One+0.004
 nap+0.004
 che+0.004
 hidden+0.004
Neg logit contributions
elling-0.005
mith-0.005
aughter-0.005
ener-0.005
ada-0.005
Token mom
Token ID1995
Feature activation+4.05e-02

Pos logit contributions
ggy+0.012
 nap+0.011
One+0.011
 read+0.011
 hidden+0.011
Neg logit contributions
elling-0.014
mith-0.013
ener-0.012
aughter-0.012
ada-0.012
Token makes
Token ID1838
Feature activation+6.82e-01

Pos logit contributions
 read+0.001
One+0.001
ggy+0.001
 hidden+0.001
 nap+0.001
Neg logit contributions
elling-0.001
mith-0.001
ener-0.001
aughter-0.001
 Acc-0.001
Token girl
Token ID2576
Feature activation+4.60e-01

Pos logit contributions
elling+0.111
 Acc+0.096
aughter+0.095
mith+0.093
ishly+0.091
Neg logit contributions
One-0.180
 nap-0.178
 hidden-0.173
 long-0.170
 read-0.170
Token was
Token ID373
Feature activation+6.00e+00

Pos logit contributions
ggy+0.007
One+0.007
 read+0.006
 che+0.006
gh+0.006
Neg logit contributions
elling-0.011
aughter-0.010
mith-0.010
ener-0.010
ada-0.010
Token very
Token ID845
Feature activation+4.33e+00

Pos logit contributions
One+0.095
ggy+0.090
gh+0.088
P+0.080
 nap+0.080
Neg logit contributions
elling-0.128
 Me-0.122
aughter-0.121
ener-0.120
mith-0.119
Token excited
Token ID6568
Feature activation+1.94e-02

Pos logit contributions
One+0.063
ggy+0.061
gh+0.058
ggie+0.054
 nap+0.053
Neg logit contributions
elling-0.101
aughter-0.096
ener-0.094
mith-0.093
 Me-0.092
Token.
Token ID13
Feature activation-4.89e-02

Pos logit contributions
One+0.000
ggy+0.000
gh+0.000
 badly+0.000
 nap+0.000
Neg logit contributions
elling-0.000
aughter-0.000
ener-0.000
mith-0.000
ada-0.000
Token
Token ID198
Feature activation+8.75e+00

Pos logit contributions
ener+0.001
elling+0.001
mith+0.001
ishly+0.001
isher+0.001
Neg logit contributions
One-0.001
On-0.001
 read-0.001
 long-0.001
 nap-0.001
Token
Token ID198
Feature activation+6.84e+00

Pos logit contributions
 che+0.183
gh+0.182
ggy+0.181
 seek+0.180
 badly+0.178
Neg logit contributions
elling-0.122
aughter-0.121
mith-0.116
 Me-0.114
 Little-0.110
TokenThey
Token ID2990
Feature activation+1.95e+00

Pos logit contributions
gh+0.100
 badly+0.099
 che+0.096
 nut+0.095
 nap+0.092
Neg logit contributions
 Me-0.133
 Little-0.130
mith-0.128
elling-0.126
aughter-0.125
Token got
Token ID1392
Feature activation+4.79e+00

Pos logit contributions
ggy+0.037
gh+0.035
One+0.034
 badly+0.033
ggie+0.033
Neg logit contributions
elling-0.035
mith-0.035
ener-0.034
aughter-0.034
ted-0.034
Token out
Token ID503
Feature activation+1.62e-01

Pos logit contributions
One+0.073
ggy+0.072
gh+0.067
ggie+0.064
 badly+0.063
Neg logit contributions
elling-0.108
aughter-0.102
ener-0.101
mith-0.100
ada-0.097
Token of
Token ID286
Feature activation-1.06e+00

Pos logit contributions
One+0.003
ggy+0.003
gh+0.003
 badly+0.003
 nap+0.003
Neg logit contributions
elling-0.003
aughter-0.003
ener-0.003
mith-0.003
ada-0.003
Token the
Token ID262
Feature activation+8.66e-01

Pos logit contributions
One+0.001
ggy+0.001
gh+0.001
 read+0.001
 nap+0.001
Neg logit contributions
elling-0.001
aughter-0.001
ener-0.001
mith-0.001
ada-0.001
Token parade
Token ID16134
Feature activation+1.43e+00

Pos logit contributions
One+0.015
ggy+0.015
gh+0.014
 badly+0.014
 che+0.013
Neg logit contributions
elling-0.018
aughter-0.017
mith-0.016
 Me-0.016
ener-0.016
Token a
Token ID257
Feature activation+5.67e+00

Pos logit contributions
ggy+0.019
One+0.019
 read+0.018
gh+0.017
ggie+0.017
Neg logit contributions
elling-0.036
aughter-0.034
ener-0.034
mith-0.033
 Acc-0.033
Token lot
Token ID1256
Feature activation+1.22e+00

Pos logit contributions
One+0.076
gh+0.075
aches+0.075
vel+0.072
 prow+0.071
Neg logit contributions
 Me-0.116
aughter-0.113
 Girl-0.111
mith-0.110
ada-0.107
Token.
Token ID13
Feature activation-7.18e-01

Pos logit contributions
ggy+0.013
One+0.013
 badly+0.011
 hidden+0.011
 long+0.011
Neg logit contributions
elling-0.033
ener-0.033
 Acc-0.032
ishly-0.031
mith-0.031
Token
Token ID198
Feature activation+8.72e+00

Pos logit contributions
ener+0.019
elling+0.019
aughter+0.018
ishly+0.018
mith+0.018
Neg logit contributions
One-0.010
On-0.009
ggy-0.008
 read-0.008
 nap-0.008
Token
Token ID198
Feature activation+6.84e+00

Pos logit contributions
 che+0.187
ggy+0.187
gh+0.185
 badly+0.182
 seek+0.182
Neg logit contributions
aughter-0.113
elling-0.110
mith-0.108
 Me-0.108
 Little-0.101
TokenTim
Token ID14967
Feature activation+1.43e-01

Pos logit contributions
vel+0.108
 badly+0.104
 lair+0.103
gh+0.101
 nap+0.100
Neg logit contributions
 Me-0.124
 Let-0.122
 Little-0.117
 Suz-0.116
elling-0.116
Token saw
Token ID2497
Feature activation-6.74e-01

Pos logit contributions
One+0.002
ggy+0.002
 read+0.002
 che+0.002
gh+0.002
Neg logit contributions
elling-0.004
aughter-0.003
ener-0.003
mith-0.003
ada-0.003
Token a
Token ID257
Feature activation+6.12e+00

Pos logit contributions
elling+0.019
aughter+0.018
ener+0.018
mith+0.017
ada+0.017
Neg logit contributions
One-0.007
ggy-0.007
gh-0.006
 badly-0.006
 nap-0.006
Token man
Token ID582
Feature activation+2.60e+00

Pos logit contributions
One+0.085
ggy+0.081
gh+0.079
 badly+0.072
ggie+0.071
Neg logit contributions
elling-0.140
aughter-0.138
mith-0.133
ener-0.132
 Girl-0.132
Token They
Token ID1119
Feature activation+1.70e+00

Pos logit contributions
ener+0.013
elling+0.013
mith+0.012
aughter+0.012
ishly+0.012
Neg logit contributions
One-0.015
On-0.014
 nap-0.014
ggy-0.014
 read-0.013
Token are
Token ID389
Feature activation+5.98e+00

Pos logit contributions
One+0.024
ggy+0.023
gh+0.021
 badly+0.021
ggie+0.020
Neg logit contributions
elling-0.041
aughter-0.038
ener-0.038
mith-0.038
ada-0.037
Token very
Token ID845
Feature activation+4.69e+00

Pos logit contributions
One+0.074
ggy+0.073
gh+0.066
ggie+0.062
 badly+0.062
Neg logit contributions
elling-0.153
aughter-0.145
ener-0.143
mith-0.142
 Me-0.139
Token happy
Token ID3772
Feature activation-1.09e+00

Pos logit contributions
One+0.063
ggy+0.062
gh+0.057
ggie+0.054
 badly+0.053
Neg logit contributions
elling-0.114
aughter-0.110
ener-0.106
mith-0.105
 Me-0.105
Token.
Token ID13
Feature activation-8.00e-02

Pos logit contributions
elling+0.024
aughter+0.022
ener+0.022
mith+0.022
 Me+0.021
Neg logit contributions
One-0.018
ggy-0.017
gh-0.016
 badly-0.016
ggie-0.016
Token
Token ID198
Feature activation+8.71e+00

Pos logit contributions
ener+0.002
elling+0.002
ishly+0.002
aughter+0.002
mith+0.002
Neg logit contributions
One-0.001
On-0.001
 nap-0.001
 read-0.001
ggy-0.001
Token
Token ID198
Feature activation+6.75e+00

Pos logit contributions
ggy+0.183
 che+0.179
gh+0.178
 badly+0.176
 prow+0.175
Neg logit contributions
elling-0.125
aughter-0.122
 Me-0.120
mith-0.117
ada-0.109
TokenMom
Token ID29252
Feature activation-5.06e-01

Pos logit contributions
ggy+0.101
 prow+0.101
gh+0.096
 nap+0.096
 che+0.096
Neg logit contributions
elling-0.138
aughter-0.133
 Me-0.128
ada-0.128
mith-0.128
Token and
Token ID290
Feature activation-3.29e+00

Pos logit contributions
elling+0.007
mith+0.007
ener+0.007
aughter+0.007
ishly+0.007
Neg logit contributions
One-0.011
 read-0.011
 che-0.011
ggy-0.011
 nap-0.011
Token Dad
Token ID17415
Feature activation+1.14e+00

Pos logit contributions
elling+0.052
mith+0.045
ener+0.043
aughter+0.042
ishly+0.041
Neg logit contributions
One-0.079
 read-0.078
 nap-0.076
On-0.075
 long-0.074
Token have
Token ID423
Feature activation+3.95e+00

Pos logit contributions
One+0.018
ggy+0.018
gh+0.017
 badly+0.016
ggie+0.016
Neg logit contributions
elling-0.025
aughter-0.023
ener-0.023
mith-0.023
ada-0.022
Token celebrate
Token ID10648
Feature activation-1.51e+00

Pos logit contributions
ggy+0.066
 badly+0.066
 prow+0.064
gh+0.063
P+0.063
Neg logit contributions
 Me-0.065
elling-0.063
 Daddy-0.063
 Daughter-0.062
aughter-0.060
Token with
Token ID351
Feature activation-2.83e+00

Pos logit contributions
elling+0.031
aughter+0.030
ener+0.029
mith+0.029
 Me+0.028
Neg logit contributions
One-0.025
ggy-0.025
gh-0.023
 badly-0.023
 prow-0.022
Token their
Token ID511
Feature activation-1.07e+00

Pos logit contributions
elling+0.043
aughter+0.040
ener+0.039
mith+0.039
ada+0.037
Neg logit contributions
One-0.064
ggy-0.064
gh-0.060
 nap-0.059
 badly-0.059
Token friends
Token ID2460
Feature activation+2.78e-01

Pos logit contributions
elling+0.025
ener+0.023
mith+0.023
aughter+0.023
ada+0.022
Neg logit contributions
One-0.016
ggy-0.016
 nap-0.015
 read-0.015
 hidden-0.014
Token.
Token ID13
Feature activation-8.10e-02

Pos logit contributions
One+0.004
ggy+0.004
gh+0.004
 badly+0.003
 read+0.003
Neg logit contributions
elling-0.007
aughter-0.006
ener-0.006
mith-0.006
ada-0.006
Token
Token ID198
Feature activation+8.71e+00

Pos logit contributions
elling+0.002
ener+0.002
aughter+0.002
mith+0.002
ishly+0.002
Neg logit contributions
One-0.001
On-0.001
 nap-0.001
ggy-0.001
 read-0.001
Token
Token ID198
Feature activation+6.56e+00

Pos logit contributions
 badly+0.209
 che+0.208
 prow+0.202
gh+0.200
aches+0.199
Neg logit contributions
 Me-0.096
aughter-0.091
mith-0.082
elling-0.082
 Little-0.080
TokenThey
Token ID2990
Feature activation+1.89e+00

Pos logit contributions
ggy+0.097
 prow+0.096
gh+0.095
 badly+0.093
 che+0.092
Neg logit contributions
aughter-0.131
elling-0.130
 Me-0.129
mith-0.125
 Girl-0.121
Token help
Token ID1037
Feature activation-2.48e+00

Pos logit contributions
ggy+0.030
One+0.028
gh+0.028
 badly+0.027
ggie+0.026
Neg logit contributions
elling-0.041
aughter-0.041
ener-0.040
mith-0.039
ada-0.038
Token their
Token ID511
Feature activation-1.33e+00

Pos logit contributions
elling+0.042
ener+0.038
mith+0.038
aughter+0.036
ishly+0.036
Neg logit contributions
ggy-0.054
One-0.054
 read-0.052
 nap-0.051
ggie-0.050
Token mom
Token ID1995
Feature activation-7.10e-01

Pos logit contributions
elling+0.023
aughter+0.021
ener+0.021
mith+0.021
ada+0.020
Neg logit contributions
One-0.028
ggy-0.028
gh-0.026
 nap-0.025
 badly-0.025
Token to
Token ID284
Feature activation-4.98e-01

Pos logit contributions
One+0.018
ggy+0.017
 read+0.017
 nap+0.016
On+0.016
Neg logit contributions
elling-0.024
aughter-0.023
ener-0.022
mith-0.022
ada-0.022
Token show
Token ID905
Feature activation-3.06e+00

Pos logit contributions
elling+0.010
aughter+0.010
 Me+0.009
ener+0.009
mith+0.009
Neg logit contributions
ggy-0.008
One-0.008
gh-0.008
 badly-0.008
ggie-0.008
Token his
Token ID465
Feature activation-1.02e+00

Pos logit contributions
elling+0.072
aughter+0.068
ener+0.067
mith+0.066
ada+0.064
Neg logit contributions
One-0.045
ggy-0.044
gh-0.040
 badly-0.039
On-0.038
Token friends
Token ID2460
Feature activation+1.25e+00

Pos logit contributions
elling+0.018
aughter+0.017
ener+0.016
mith+0.016
 Me+0.016
Neg logit contributions
One-0.021
ggy-0.020
gh-0.019
 badly-0.019
On-0.019
Token.
Token ID13
Feature activation-2.86e-01

Pos logit contributions
One+0.020
ggy+0.020
gh+0.018
 badly+0.018
 read+0.018
Neg logit contributions
elling-0.028
aughter-0.026
ener-0.026
mith-0.026
ada-0.024
Token
Token ID198
Feature activation+8.68e+00

Pos logit contributions
elling+0.007
ener+0.007
aughter+0.006
mith+0.006
ada+0.006
Neg logit contributions
One-0.005
ggy-0.004
On-0.004
 read-0.004
 nap-0.004
Token
Token ID198
Feature activation+6.71e+00

Pos logit contributions
 che+0.187
gh+0.185
ggy+0.184
 badly+0.183
 seek+0.183
Neg logit contributions
aughter-0.113
elling-0.112
 Me-0.106
mith-0.106
 Little-0.100
TokenTom
Token ID13787
Feature activation-9.33e-02

Pos logit contributions
 badly+0.113
vel+0.111
 lair+0.107
 treasure+0.103
aches+0.101
Neg logit contributions
 Me-0.115
 Now-0.113
 Let-0.111
ted-0.110
 The-0.106
Token met
Token ID1138
Feature activation-1.02e-01

Pos logit contributions
elling+0.002
aughter+0.002
mith+0.002
ener+0.002
ada+0.002
Neg logit contributions
One-0.001
ggy-0.001
 read-0.001
 che-0.001
 nap-0.001
Token his
Token ID465
Feature activation-3.55e-01

Pos logit contributions
elling+0.003
aughter+0.003
ener+0.003
mith+0.003
 Me+0.002
Neg logit contributions
One-0.001
ggy-0.001
gh-0.001
 badly-0.001
 nap-0.001
Token friend
Token ID1545
Feature activation-8.23e-01

Pos logit contributions
elling+0.008
aughter+0.008
ener+0.008
ada+0.007
 Me+0.007
Neg logit contributions
One-0.005
ggy-0.005
gh-0.005
 badly-0.005
On-0.005
Token They
Token ID1119
Feature activation+1.95e+00

Pos logit contributions
elling+0.011
ener+0.010
aughter+0.010
mith+0.009
ada+0.009
Neg logit contributions
One-0.015
ggy-0.015
 nap-0.014
On-0.014
 read-0.014
Token are
Token ID389
Feature activation+6.12e+00

Pos logit contributions
ggy+0.027
One+0.026
gh+0.024
 badly+0.023
ggie+0.023
Neg logit contributions
elling-0.047
aughter-0.045
ener-0.044
mith-0.044
ada-0.043
Token very
Token ID845
Feature activation+4.93e+00

Pos logit contributions
One+0.085
ggy+0.082
gh+0.080
 badly+0.073
ggie+0.071
Neg logit contributions
elling-0.143
aughter-0.137
 Me-0.135
ener-0.134
mith-0.133
Token excited
Token ID6568
Feature activation-7.39e-01

Pos logit contributions
One+0.069
ggy+0.067
gh+0.064
ggie+0.059
 badly+0.059
Neg logit contributions
elling-0.117
aughter-0.113
ener-0.108
mith-0.107
 Me-0.107
Token.
Token ID13
Feature activation-1.35e-01

Pos logit contributions
elling+0.017
aughter+0.016
ener+0.015
mith+0.015
ada+0.015
Neg logit contributions
One-0.012
ggy-0.011
gh-0.011
 badly-0.010
 nap-0.010
Token
Token ID198
Feature activation+8.65e+00

Pos logit contributions
elling+0.003
ener+0.003
aughter+0.003
mith+0.003
ada+0.003
Neg logit contributions
One-0.002
On-0.002
ggy-0.002
 nap-0.002
 read-0.002
Token
Token ID198
Feature activation+6.63e+00

Pos logit contributions
 che+0.187
ggy+0.187
gh+0.186
 badly+0.184
 seek+0.182
Neg logit contributions
aughter-0.111
 Me-0.110
elling-0.109
mith-0.107
 Little-0.100
TokenThey
Token ID2990
Feature activation+1.86e+00

Pos logit contributions
 che+0.097
 badly+0.096
 prow+0.096
gh+0.095
 lair+0.095
Neg logit contributions
 Me-0.130
aughter-0.128
elling-0.125
mith-0.124
 Our-0.120
Token go
Token ID467
Feature activation+2.54e+00

Pos logit contributions
ggy+0.026
One+0.025
gh+0.024
 badly+0.023
ggie+0.023
Neg logit contributions
elling-0.043
aughter-0.042
ener-0.041
mith-0.041
ada-0.040
Token to
Token ID284
Feature activation-5.56e-02

Pos logit contributions
One+0.044
ggy+0.043
gh+0.040
 badly+0.039
 nap+0.039
Neg logit contributions
elling-0.052
aughter-0.049
ener-0.048
mith-0.048
ada-0.046
Token the
Token ID262
Feature activation-5.87e-01

Pos logit contributions
elling+0.001
aughter+0.001
 Me+0.001
mith+0.001
ener+0.001
Neg logit contributions
ggy-0.001
gh-0.001
One-0.001
 badly-0.001
ggie-0.001
Token was
Token ID373
Feature activation+5.98e+00

Pos logit contributions
One+0.033
ggy+0.032
gh+0.032
 badly+0.031
 prow+0.029
Neg logit contributions
elling-0.041
ener-0.040
 Me-0.040
aughter-0.040
 Girl-0.039
Token a
Token ID257
Feature activation+5.96e+00

Pos logit contributions
One+0.087
ggy+0.084
gh+0.084
 nap+0.078
 hunt+0.077
Neg logit contributions
elling-0.132
aughter-0.126
 Me-0.125
 Acc-0.122
ener-0.122
Token good
Token ID922
Feature activation+8.30e-01

Pos logit contributions
One+0.072
ggy+0.071
gh+0.068
 badly+0.062
ggie+0.061
Neg logit contributions
elling-0.145
aughter-0.142
ener-0.140
 Me-0.138
 Girl-0.138
Token reader
Token ID9173
Feature activation+1.77e-01

Pos logit contributions
ggy+0.018
One+0.017
 read+0.016
 nap+0.016
gh+0.016
Neg logit contributions
elling-0.014
mith-0.013
aughter-0.013
ener-0.012
ishly-0.012
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
One+0.003
ggy+0.003
gh+0.002
 badly+0.002
ggie+0.002
Neg logit contributions
elling-0.004
aughter-0.004
ener-0.004
mith-0.004
ada-0.004
Token
Token ID198
Feature activation+8.65e+00

Pos logit contributions
elling+0.013
ener+0.012
aughter+0.012
mith+0.012
ishly+0.012
Neg logit contributions
One-0.011
ggy-0.010
 nap-0.009
On-0.009
 read-0.009
Token
Token ID198
Feature activation+6.50e+00

Pos logit contributions
 badly+0.196
 che+0.193
 seek+0.192
gh+0.191
ggy+0.190
Neg logit contributions
 Me-0.101
aughter-0.099
elling-0.095
mith-0.093
 Little-0.090
TokenOne
Token ID3198
Feature activation+5.33e-01

Pos logit contributions
ggy+0.052
 nut+0.048
 badly+0.047
 prow+0.045
gh+0.044
Neg logit contributions
 Me-0.173
elling-0.171
aughter-0.169
mith-0.162
ion-0.161
Token day
Token ID1110
Feature activation-2.15e+00

Pos logit contributions
One+0.010
ggy+0.010
gh+0.009
 badly+0.009
 nap+0.009
Neg logit contributions
elling-0.010
aughter-0.010
ener-0.009
mith-0.009
ada-0.009
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
elling+0.044
mith+0.040
aughter+0.040
ener+0.040
ada+0.038
Neg logit contributions
One-0.039
ggy-0.039
 nap-0.037
gh-0.036
 read-0.035
Token she
Token ID673
Feature activation+1.62e+00

Pos logit contributions
elling+0.020
ener+0.018
aughter+0.018
mith+0.018
ada+0.017
Neg logit contributions
One-0.022
ggy-0.022
 nap-0.021
 read-0.021
On-0.020
Token ways
Token ID2842
Feature activation+7.84e-01

Pos logit contributions
elling+0.082
aughter+0.078
mith+0.078
ener+0.075
ishly+0.075
Neg logit contributions
One-0.070
ggy-0.069
 nap-0.066
 hidden-0.064
 read-0.064
Token to
Token ID284
Feature activation+2.52e+00

Pos logit contributions
One+0.012
ggy+0.012
 badly+0.011
gh+0.011
 nap+0.010
Neg logit contributions
elling-0.018
aughter-0.017
ener-0.017
mith-0.017
ada-0.016
Token write
Token ID3551
Feature activation+5.21e-01

Pos logit contributions
One+0.037
 badly+0.035
gh+0.035
ggy+0.034
On+0.034
Neg logit contributions
elling-0.049
 Me-0.048
mith-0.048
aughter-0.047
 Acc-0.046
Token it
Token ID340
Feature activation+9.42e-01

Pos logit contributions
ggy+0.007
One+0.007
gh+0.006
ggie+0.006
 read+0.006
Neg logit contributions
elling-0.013
aughter-0.012
ener-0.012
mith-0.012
 Acc-0.012
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
One+0.014
ggy+0.014
gh+0.013
 read+0.012
 nap+0.012
Neg logit contributions
elling-0.022
aughter-0.020
ener-0.020
mith-0.020
ada-0.020
Token
Token ID198
Feature activation+8.63e+00

Pos logit contributions
ener+0.006
elling+0.006
aughter+0.005
mith+0.005
ishly+0.005
Neg logit contributions
One-0.009
ggy-0.008
 nap-0.008
 read-0.008
On-0.008
Token
Token ID198
Feature activation+7.03e+00

Pos logit contributions
 che+0.190
gh+0.190
ggy+0.190
aches+0.187
P+0.187
Neg logit contributions
 Me-0.103
aughter-0.099
elling-0.096
mith-0.095
 Little-0.091
TokenTom
Token ID13787
Feature activation-1.24e+00

Pos logit contributions
 lair+0.104
 badly+0.097
aches+0.096
 prow+0.095
 quest+0.093
Neg logit contributions
 Me-0.138
mith-0.124
aughter-0.122
cy-0.120
 Let-0.120
Token was
Token ID373
Feature activation+5.85e+00

Pos logit contributions
elling+0.023
aughter+0.022
mith+0.021
ener+0.021
ada+0.020
Neg logit contributions
One-0.024
ggy-0.023
gh-0.022
 read-0.022
 che-0.021
Token a
Token ID257
Feature activation+5.92e+00

Pos logit contributions
One+0.110
ggy+0.100
gh+0.098
On+0.094
 seek+0.093
Neg logit contributions
elling-0.108
ener-0.103
aughter-0.102
 Me-0.101
 Acc-0.097
Token farmer
Token ID18739
Feature activation+1.11e+00

Pos logit contributions
One+0.084
ggy+0.082
gh+0.078
aches+0.072
On+0.072
Neg logit contributions
elling-0.131
aughter-0.129
ener-0.129
 Girl-0.127
 Me-0.125
Token loves
Token ID10408
Feature activation-6.48e-01

Pos logit contributions
One+0.020
ggy+0.019
 che+0.018
 read+0.018
 nap+0.018
Neg logit contributions
elling-0.019
aughter-0.018
mith-0.018
ener-0.017
ada-0.017
Token Spike
Token ID26309
Feature activation+7.50e-01

Pos logit contributions
elling+0.014
aughter+0.013
ener+0.013
mith+0.013
ada+0.012
Neg logit contributions
One-0.011
ggy-0.011
gh-0.010
 nap-0.010
 che-0.010
Token very
Token ID845
Feature activation+2.59e+00

Pos logit contributions
One+0.013
ggy+0.013
gh+0.012
 seek+0.012
On+0.012
Neg logit contributions
elling-0.015
ener-0.014
aughter-0.014
mith-0.013
ishly-0.013
Token much
Token ID881
Feature activation-2.23e-01

Pos logit contributions
 seek+0.021
gh+0.021
ggy+0.020
 hunt+0.020
One+0.020
Neg logit contributions
elling-0.070
aughter-0.065
 Me-0.065
 Acc-0.063
mith-0.062
Token.
Token ID13
Feature activation-2.46e-01

Pos logit contributions
elling+0.005
aughter+0.005
ener+0.005
mith+0.004
 Me+0.004
Neg logit contributions
One-0.004
ggy-0.003
gh-0.003
ggie-0.003
 seek-0.003
Token
Token ID198
Feature activation+8.63e+00

Pos logit contributions
elling+0.006
ener+0.006
aughter+0.006
mith+0.005
ada+0.005
Neg logit contributions
One-0.004
ggy-0.004
 nap-0.004
On-0.003
 che-0.003
Token
Token ID198
Feature activation+6.35e+00

Pos logit contributions
ggy+0.189
 badly+0.184
gh+0.184
 che+0.179
 seek+0.179
Neg logit contributions
 Me-0.112
elling-0.112
aughter-0.111
mith-0.107
 Sure-0.098
TokenOne
Token ID3198
Feature activation+5.08e-02

Pos logit contributions
ggy+0.042
 badly+0.040
gh+0.033
 prow+0.032
aches+0.030
Neg logit contributions
 Me-0.180
elling-0.179
cy-0.172
aughter-0.172
mith-0.172
Token day
Token ID1110
Feature activation-2.62e+00

Pos logit contributions
One+0.001
ggy+0.001
gh+0.001
 badly+0.001
ggie+0.001
Neg logit contributions
elling-0.001
aughter-0.001
ener-0.001
mith-0.001
ada-0.001
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
elling+0.050
aughter+0.046
ener+0.045
mith+0.045
 Acc+0.043
Neg logit contributions
One-0.051
ggy-0.050
gh-0.047
 nap-0.046
 che-0.046
Token Ben
Token ID3932
Feature activation-1.76e-02

Pos logit contributions
elling+0.025
aughter+0.022
ener+0.022
mith+0.021
 Acc+0.021
Neg logit contributions
One-0.034
ggy-0.033
 nap-0.032
 che-0.031
On-0.031
Token fun
Token ID1257
Feature activation+2.50e+00

Pos logit contributions
ggy+0.051
gh+0.048
One+0.047
 badly+0.045
ggie+0.044
Neg logit contributions
elling-0.109
 Me-0.104
aughter-0.102
ener-0.102
als-0.102
Token at
Token ID379
Feature activation-3.70e-01

Pos logit contributions
ggy+0.046
One+0.046
gh+0.044
ggie+0.043
 nut+0.043
Neg logit contributions
 Me-0.040
elling-0.039
ener-0.038
 Joe-0.037
mith-0.036
Token the
Token ID262
Feature activation-5.63e-01

Pos logit contributions
elling+0.005
aughter+0.005
mith+0.005
 Me+0.005
ener+0.005
Neg logit contributions
ggy-0.008
One-0.008
gh-0.008
 badly-0.008
ggie-0.008
Token gym
Token ID11550
Feature activation+1.94e+00

Pos logit contributions
elling+0.012
aughter+0.011
 Me+0.011
mith+0.011
ener+0.011
Neg logit contributions
ggy-0.009
One-0.009
gh-0.009
 badly-0.008
ggie-0.008
Token.
Token ID13
Feature activation-8.74e-01

Pos logit contributions
One+0.028
ggy+0.028
gh+0.025
 badly+0.025
 nap+0.024
Neg logit contributions
elling-0.045
aughter-0.043
ener-0.042
mith-0.042
ada-0.041
Token
Token ID198
Feature activation+8.61e+00

Pos logit contributions
ener+0.021
elling+0.020
ishly+0.019
mith+0.019
aughter+0.019
Neg logit contributions
One-0.016
On-0.014
 not-0.013
 read-0.012
ggy-0.012
Token
Token ID198
Feature activation+6.50e+00

Pos logit contributions
ggy+0.176
 che+0.174
gh+0.174
 badly+0.173
 seek+0.170
Neg logit contributions
elling-0.125
aughter-0.122
 Me-0.119
mith-0.119
ada-0.107
TokenOne
Token ID3198
Feature activation+1.82e-01

Pos logit contributions
ggy+0.039
 nap+0.037
 prow+0.036
 badly+0.035
 nut+0.034
Neg logit contributions
elling-0.187
 Me-0.184
aughter-0.181
mith-0.180
ada-0.174
Token day
Token ID1110
Feature activation-2.36e+00

Pos logit contributions
One+0.003
ggy+0.003
 nap+0.003
On+0.003
 read+0.003
Neg logit contributions
elling-0.004
ener-0.004
mith-0.004
ishly-0.004
aughter-0.004
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
elling+0.055
ener+0.051
mith+0.050
ishly+0.049
aughter+0.049
Neg logit contributions
ggy-0.039
One-0.038
 long-0.035
gh-0.035
 nap-0.034
Token Tom
Token ID4186
Feature activation-6.00e-01

Pos logit contributions
elling+0.022
ener+0.020
ishly+0.019
 Acc+0.019
mith+0.018
Neg logit contributions
One-0.022
ggy-0.022
 not-0.021
 long-0.021
On-0.021
Token amazed
Token ID24562
Feature activation-1.37e+00

Pos logit contributions
One+0.062
ggy+0.062
gh+0.056
 badly+0.054
ggie+0.053
Neg logit contributions
elling-0.116
aughter-0.109
ener-0.108
mith-0.107
ada-0.104
Token at
Token ID379
Feature activation-3.11e+00

Pos logit contributions
elling+0.032
aughter+0.032
mith+0.031
 Acc+0.030
ener+0.030
Neg logit contributions
One-0.019
 badly-0.018
 hidden-0.018
 long-0.017
On-0.017
Token how
Token ID703
Feature activation-2.56e-01

Pos logit contributions
elling+0.064
aughter+0.060
ener+0.060
mith+0.058
ada+0.057
Neg logit contributions
ggy-0.054
One-0.054
 badly-0.050
gh-0.050
ggie-0.049
Token much
Token ID881
Feature activation-6.07e+00

Pos logit contributions
elling+0.007
aughter+0.006
ener+0.006
mith+0.006
ada+0.006
Neg logit contributions
One-0.003
ggy-0.003
gh-0.003
 badly-0.002
ggie-0.002
Token the
Token ID262
Feature activation-4.51e+00

Pos logit contributions
elling+0.164
aughter+0.157
ener+0.155
mith+0.153
ada+0.150
Neg logit contributions
One-0.066
ggy-0.065
gh-0.057
 nap-0.055
 badly-0.055
Token little
Token ID1310
Feature activation-1.10e+01

Pos logit contributions
elling+0.105
aughter+0.098
ener+0.097
mith+0.096
ada+0.094
Neg logit contributions
One-0.067
ggy-0.066
 nap-0.060
gh-0.060
 badly-0.059
Token boy
Token ID2933
Feature activation-2.73e+00

Pos logit contributions
elling+0.190
aughter+0.172
 Acc+0.162
mith+0.160
ishly+0.158
Neg logit contributions
One-0.226
 nap-0.223
 che-0.223
 hidden-0.220
ggy-0.220
Token had
Token ID550
Feature activation+1.70e+00

Pos logit contributions
elling+0.067
aughter+0.064
mith+0.062
ener+0.061
ada+0.061
Neg logit contributions
ggy-0.037
One-0.036
 read-0.035
 che-0.034
 hidden-0.033
Token learned
Token ID4499
Feature activation-6.35e-02

Pos logit contributions
One+0.023
ggy+0.022
 badly+0.020
gh+0.020
 nap+0.019
Neg logit contributions
elling-0.040
aughter-0.038
ener-0.038
mith-0.037
 Me-0.037
Token in
Token ID287
Feature activation-5.39e-01

Pos logit contributions
elling+0.001
aughter+0.001
ener+0.001
mith+0.001
ada+0.001
Neg logit contributions
One-0.001
ggy-0.001
gh-0.001
 badly-0.001
ggie-0.001
Token the
Token ID262
Feature activation-2.10e+00

Pos logit contributions
elling+0.010
aughter+0.010
mith+0.010
ener+0.010
 Me+0.009
Neg logit contributions
One-0.010
ggy-0.010
gh-0.009
 badly-0.009
ggie-0.009
Token the
Token ID262
Feature activation-1.71e+00

Pos logit contributions
One+0.058
ggy+0.057
 not+0.053
 much+0.053
 badly+0.052
Neg logit contributions
elling-0.089
ener-0.086
aughter-0.084
mith-0.082
ada-0.079
Token story
Token ID1621
Feature activation+2.45e-02

Pos logit contributions
elling+0.043
aughter+0.040
mith+0.040
ener+0.039
ada+0.038
Neg logit contributions
One-0.024
ggy-0.022
 nap-0.021
 read-0.020
 hidden-0.020
Token of
Token ID286
Feature activation-4.14e+00

Pos logit contributions
ggy+0.000
 read+0.000
One+0.000
ggie+0.000
 nap+0.000
Neg logit contributions
elling-0.001
ener-0.001
aughter-0.001
 Acc-0.001
ada-0.001
Token how
Token ID703
Feature activation-4.79e-01

Pos logit contributions
elling+0.087
ener+0.081
aughter+0.080
mith+0.077
ada+0.075
Neg logit contributions
One-0.073
ggy-0.072
 badly-0.067
 read-0.066
 nap-0.066
Token the
Token ID262
Feature activation-4.97e+00

Pos logit contributions
elling+0.008
ener+0.008
mith+0.008
aughter+0.007
ishly+0.007
Neg logit contributions
ggy-0.010
One-0.010
 badly-0.009
 long-0.009
 che-0.009
Token little
Token ID1310
Feature activation-1.09e+01

Pos logit contributions
elling+0.114
mith+0.106
aughter+0.106
ener+0.104
ishly+0.103
Neg logit contributions
One-0.076
ggy-0.074
 nap-0.070
 read-0.069
 hidden-0.068
Token girl
Token ID2576
Feature activation-2.71e+00

Pos logit contributions
elling+0.149
mith+0.130
aughter+0.127
 Acc+0.126
ishly+0.123
Neg logit contributions
One-0.264
 nap-0.264
ggy-0.257
 read-0.256
 seek-0.256
Token made
Token ID925
Feature activation+6.86e-01

Pos logit contributions
elling+0.070
aughter+0.067
mith+0.066
ener+0.063
ada+0.062
Neg logit contributions
 read-0.034
ggy-0.034
 che-0.032
One-0.032
 hidden-0.029
Token her
Token ID607
Feature activation-1.28e+00

Pos logit contributions
ggy+0.011
One+0.010
gh+0.010
ggie+0.010
 che+0.010
Neg logit contributions
elling-0.016
ener-0.015
mith-0.015
aughter-0.014
ishly-0.014
Token own
Token ID898
Feature activation-1.66e+00

Pos logit contributions
elling+0.034
mith+0.031
ted+0.031
aughter+0.030
ener+0.030
Neg logit contributions
ggy-0.016
 read-0.016
 nap-0.016
 not-0.016
 long-0.016
Token green
Token ID4077
Feature activation-1.91e+00

Pos logit contributions
elling+0.041
mith+0.039
ted+0.039
ener+0.038
aughter+0.038
Neg logit contributions
 long-0.023
ggy-0.022
 read-0.022
One-0.022
 big-0.021
Token thanked
Token ID26280
Feature activation-5.03e+00

Pos logit contributions
elling+0.037
aughter+0.036
ener+0.035
mith+0.035
 Me+0.033
Neg logit contributions
ggy-0.022
One-0.021
 read-0.020
 che-0.020
 nap-0.020
Token the
Token ID262
Feature activation-3.96e+00

Pos logit contributions
elling+0.122
ener+0.112
aughter+0.109
 Acc+0.106
ays+0.102
Neg logit contributions
One-0.077
ggy-0.076
 read-0.075
 che-0.074
 long-0.071
Token elderly
Token ID13830
Feature activation-7.80e+00

Pos logit contributions
elling+0.088
aughter+0.084
mith+0.083
ener+0.082
ada+0.079
Neg logit contributions
One-0.063
ggy-0.061
 che-0.056
gh-0.055
 read-0.055
Token rh
Token ID9529
Feature activation-2.90e+00

Pos logit contributions
elling+0.182
aughter+0.172
 Acc+0.165
mith+0.160
ada+0.160
Neg logit contributions
ggy-0.123
One-0.117
 che-0.110
 read-0.106
 hidden-0.106
Tokenin
Token ID259
Feature activation-3.87e+00

Pos logit contributions
elling+0.064
aughter+0.060
ener+0.059
mith+0.059
ada+0.057
Neg logit contributions
One-0.046
ggy-0.045
gh-0.042
 badly-0.041
 nap-0.040
Tokenoc
Token ID420
Feature activation-1.09e+01

Pos logit contributions
elling+0.105
 Acc+0.102
ays+0.099
ada+0.098
aughter+0.097
Neg logit contributions
ggy-0.044
One-0.044
 read-0.042
 che-0.039
 hunt-0.037
Tokeneros
Token ID27498
Feature activation-3.08e+00

Pos logit contributions
 Linda+0.168
aughter+0.161
 Jos+0.160
 Little+0.160
elling+0.159
Neg logit contributions
ggy-0.237
ggie-0.219
gh-0.218
 badly-0.208
 che-0.207
Token for
Token ID329
Feature activation-2.97e+00

Pos logit contributions
elling+0.072
mith+0.066
 Acc+0.065
ada+0.064
aughter+0.064
Neg logit contributions
ggy-0.047
 read-0.045
 nap-0.044
 che-0.044
 hidden-0.043
Token his
Token ID465
Feature activation-1.74e+00

Pos logit contributions
elling+0.055
aughter+0.053
mith+0.052
ener+0.051
 Me+0.050
Neg logit contributions
One-0.055
ggy-0.054
gh-0.050
 badly-0.049
 nap-0.049
Token wonderful
Token ID7932
Feature activation-2.58e+00

Pos logit contributions
elling+0.044
aughter+0.041
ener+0.041
mith+0.041
ada+0.040
Neg logit contributions
ggy-0.022
One-0.022
gh-0.020
 nap-0.020
 read-0.019
Token surprise
Token ID5975
Feature activation-2.02e+00

Pos logit contributions
elling+0.067
aughter+0.064
ener+0.063
mith+0.062
 Me+0.061
Neg logit contributions
One-0.030
ggy-0.028
 badly-0.025
gh-0.025
On-0.024
Token no
Token ID645
Feature activation-2.71e+00

Pos logit contributions
elling+0.041
aughter+0.037
ada+0.035
ener+0.035
 Acc+0.035
Neg logit contributions
One-0.031
ggy-0.031
 badly-0.029
 che-0.029
 read-0.029
Token matter
Token ID2300
Feature activation-1.20e-01

Pos logit contributions
elling+0.061
aughter+0.057
ener+0.057
mith+0.056
ada+0.054
Neg logit contributions
One-0.042
ggy-0.042
gh-0.038
 badly-0.037
 nap-0.037
Token how
Token ID703
Feature activation-1.01e+00

Pos logit contributions
elling+0.002
aughter+0.002
ener+0.002
mith+0.002
ada+0.002
Neg logit contributions
One-0.002
ggy-0.002
 badly-0.002
gh-0.002
 read-0.002
Token hard
Token ID1327
Feature activation-6.54e+00

Pos logit contributions
elling+0.027
aughter+0.026
mith+0.025
ener+0.025
 Acc+0.024
Neg logit contributions
ggy-0.011
One-0.011
gh-0.010
ggie-0.010
 read-0.010
Token the
Token ID262
Feature activation-5.38e+00

Pos logit contributions
elling+0.158
aughter+0.150
ada+0.143
ener+0.143
 Acc+0.141
Neg logit contributions
ggy-0.094
One-0.091
 che-0.087
gh-0.087
 nap-0.086
Token little
Token ID1310
Feature activation-1.09e+01

Pos logit contributions
elling+0.111
ener+0.105
aughter+0.104
mith+0.103
 Girl+0.099
Neg logit contributions
ggy-0.093
One-0.093
gh-0.088
ggie-0.084
On-0.082
Token boy
Token ID2933
Feature activation-4.08e+00

Pos logit contributions
elling+0.178
aughter+0.162
 Acc+0.154
ishly+0.154
ada+0.152
Neg logit contributions
One-0.231
 che-0.229
ggy-0.228
 nap-0.226
 hidden-0.224
Token worked
Token ID3111
Feature activation+2.55e-01

Pos logit contributions
elling+0.098
aughter+0.093
mith+0.090
ener+0.089
ada+0.088
Neg logit contributions
ggy-0.059
One-0.056
 che-0.054
 read-0.053
gh-0.052
Token,
Token ID11
Feature activation-3.18e+00

Pos logit contributions
One+0.004
ggy+0.004
gh+0.004
 badly+0.004
 che+0.004
Neg logit contributions
elling-0.005
aughter-0.005
ener-0.005
mith-0.005
ada-0.005
Token he
Token ID339
Feature activation+7.64e-01

Pos logit contributions
elling+0.078
aughter+0.073
ener+0.070
ada+0.069
 Acc+0.067
Neg logit contributions
One-0.046
ggy-0.046
 che-0.044
 badly-0.044
 not-0.043
Token never
Token ID1239
Feature activation-5.75e-01

Pos logit contributions
One+0.013
ggy+0.013
gh+0.012
 nap+0.011
 badly+0.011
Neg logit contributions
elling-0.016
ener-0.015
aughter-0.015
mith-0.015
ada-0.014
Token have
Token ID423
Feature activation+3.00e+00

Pos logit contributions
One+0.007
ggy+0.007
gh+0.006
 read+0.006
 nap+0.006
Neg logit contributions
elling-0.009
aughter-0.009
ener-0.008
mith-0.008
ada-0.008
Token stopped
Token ID5025
Feature activation-2.28e+00

Pos logit contributions
One+0.042
ggy+0.040
gh+0.037
 badly+0.036
On+0.036
Neg logit contributions
elling-0.071
ener-0.066
aughter-0.066
mith-0.065
 Me-0.063
Token the
Token ID262
Feature activation-1.41e+00

Pos logit contributions
elling+0.050
aughter+0.050
mith+0.049
ener+0.047
ishly+0.046
Neg logit contributions
One-0.036
ggy-0.034
 badly-0.032
 not-0.032
 hidden-0.032
Token rh
Token ID9529
Feature activation-2.21e+00

Pos logit contributions
elling+0.031
aughter+0.029
ener+0.029
mith+0.028
ada+0.028
Neg logit contributions
One-0.022
ggy-0.022
gh-0.020
 nap-0.020
 che-0.020
Tokenin
Token ID259
Feature activation-3.24e+00

Pos logit contributions
elling+0.042
mith+0.039
ener+0.039
aughter+0.039
ada+0.036
Neg logit contributions
One-0.041
ggy-0.040
gh-0.040
 seek-0.039
 nap-0.039
Tokenoc
Token ID420
Feature activation-1.08e+01

Pos logit contributions
elling+0.072
aughter+0.069
 Acc+0.066
ener+0.065
ada+0.065
Neg logit contributions
One-0.052
ggy-0.052
 read-0.047
 che-0.046
 nap-0.046
Tokeneros
Token ID27498
Feature activation-1.92e+00

Pos logit contributions
 Linda+0.161
aughter+0.157
elling+0.154
 Jenna+0.154
 Suz+0.153
Neg logit contributions
ggy-0.234
gh-0.221
 che-0.219
ggie-0.214
 badly-0.211
Token.
Token ID13
Feature activation-2.94e+00

Pos logit contributions
elling+0.043
aughter+0.041
ener+0.040
mith+0.040
ada+0.039
Neg logit contributions
One-0.029
ggy-0.029
gh-0.026
 badly-0.025
ggie-0.025
Token He
Token ID679
Feature activation+1.69e+00

Pos logit contributions
aughter+0.057
ener+0.056
elling+0.053
isher+0.051
atted+0.051
Neg logit contributions
One-0.055
 not-0.048
 much-0.047
On-0.047
 read-0.047
Token is
Token ID318
Feature activation+6.37e+00

Pos logit contributions
One+0.032
ggy+0.032
gh+0.030
 read+0.029
 che+0.029
Neg logit contributions
elling-0.032
aughter-0.030
ener-0.029
mith-0.029
ada-0.028
Token sleeping
Token ID11029
Feature activation-6.56e-01

Pos logit contributions
One+0.085
ggy+0.078
gh+0.074
On+0.070
 badly+0.068
Neg logit contributions
elling-0.155
aughter-0.147
ener-0.146
mith-0.144
 Me-0.142
Token the
Token ID262
Feature activation-4.30e+00

Pos logit contributions
elling+0.053
aughter+0.049
ener+0.047
ada+0.046
mith+0.046
Neg logit contributions
ggy-0.031
 hidden-0.031
 read-0.031
 not-0.031
 badly-0.030
Token mystery
Token ID10715
Feature activation-2.07e+00

Pos logit contributions
elling+0.104
aughter+0.097
mith+0.096
ener+0.094
ada+0.092
Neg logit contributions
One-0.059
ggy-0.059
 nap-0.055
 hidden-0.053
 seek-0.053
Token of
Token ID286
Feature activation-5.17e+00

Pos logit contributions
elling+0.049
 Acc+0.045
aughter+0.045
ener+0.044
ada+0.043
Neg logit contributions
ggy-0.031
 hidden-0.029
One-0.028
gh-0.028
 badly-0.027
Token where
Token ID810
Feature activation-3.90e+00

Pos logit contributions
elling+0.108
ener+0.102
aughter+0.099
mith+0.096
ada+0.093
Neg logit contributions
One-0.088
ggy-0.087
 read-0.083
 nap-0.083
 hidden-0.083
Token the
Token ID262
Feature activation-5.04e+00

Pos logit contributions
elling+0.081
ener+0.075
aughter+0.073
ada+0.071
ishly+0.071
Neg logit contributions
ggy-0.069
One-0.067
 hidden-0.065
 not-0.065
 badly-0.064
Token little
Token ID1310
Feature activation-1.08e+01

Pos logit contributions
elling+0.105
aughter+0.097
mith+0.096
ener+0.095
ishly+0.092
Neg logit contributions
One-0.086
ggy-0.085
 nap-0.079
gh-0.078
 seek-0.078
Token girl
Token ID2576
Feature activation-2.22e+00

Pos logit contributions
elling+0.160
mith+0.139
aughter+0.137
ishly+0.133
 Acc+0.132
Neg logit contributions
 nap-0.252
One-0.246
 seek-0.245
 hidden-0.244
 che-0.242
Token had
Token ID550
Feature activation+2.04e+00

Pos logit contributions
elling+0.052
aughter+0.049
mith+0.048
ener+0.047
ada+0.046
Neg logit contributions
ggy-0.033
One-0.032
gh-0.030
 read-0.029
 che-0.029
Token gone
Token ID3750
Feature activation-4.01e-01

Pos logit contributions
One+0.026
ggy+0.025
gh+0.022
ggie+0.022
On+0.022
Neg logit contributions
elling-0.049
aughter-0.047
ener-0.047
mith-0.047
ada-0.046
Token remained
Token ID6150
Feature activation+1.77e+00

Pos logit contributions
elling+0.009
aughter+0.008
ener+0.008
mith+0.008
ada+0.008
Neg logit contributions
One-0.007
ggy-0.006
gh-0.006
 badly-0.006
 nap-0.006
Token unsolved
Token ID48180
Feature activation-3.11e+00

Pos logit contributions
One+0.029
ggy+0.028
gh+0.026
ggie+0.026
On+0.025
Neg logit contributions
elling-0.037
ener-0.036
aughter-0.035
mith-0.035
ishly-0.033
Token.
Token ID13
Feature activation-1.84e+00

Pos logit contributions
elling+0.059
aughter+0.056
ener+0.055
mith+0.055
ada+0.053
Neg logit contributions
ggy-0.041
One-0.041
gh-0.037
ggie-0.036
 nap-0.036
Token
Token ID198
Feature activation+5.97e+00

Pos logit contributions
ener+0.044
elling+0.042
aughter+0.039
isher+0.038
 roles+0.037
Neg logit contributions
One-0.032
 much-0.027
On-0.027
 che-0.025
ggy-0.025
Token
Token ID198
Feature activation+4.78e+00

Pos logit contributions
One+0.071
ggy+0.071
gh+0.066
 badly+0.063
 seek+0.062
Neg logit contributions
elling-0.150
aughter-0.144
mith-0.141
ener-0.139
 Me-0.137
TokenThe
Token ID464
Feature activation-3.11e+00

Pos logit contributions
ggy+0.076
gh+0.073
 nap+0.072
 badly+0.071
One+0.070
Neg logit contributions
elling-0.097
mith-0.092
aughter-0.091
 Me-0.089
ada-0.087
Token av
Token ID1196
Feature activation+3.49e+00

Pos logit contributions
elling+0.063
ener+0.060
aughter+0.059
mith+0.058
 Girl+0.055
Neg logit contributions
ggy-0.055
One-0.052
gh-0.052
ggie-0.050
 badly-0.048
Tokenoc
Token ID420
Feature activation-1.07e+01

Pos logit contributions
One+0.065
ggy+0.065
gh+0.061
 badly+0.059
 nap+0.059
Neg logit contributions
elling-0.067
aughter-0.062
ener-0.061
mith-0.061
ada-0.058
Tokenados
Token ID22484
Feature activation+1.28e+00

Pos logit contributions
elling+0.138
mith+0.138
aughter+0.138
 Linda+0.136
 Jos+0.134
Neg logit contributions
ggy-0.238
 nut-0.232
 che-0.231
One-0.227
gh-0.227
Token were
Token ID547
Feature activation+5.02e+00

Pos logit contributions
One+0.021
ggy+0.021
gh+0.020
 badly+0.019
ggie+0.019
Neg logit contributions
elling-0.027
aughter-0.025
ener-0.025
mith-0.025
ishly-0.024
Token shared
Token ID4888
Feature activation-1.00e-01

Pos logit contributions
One+0.063
ggy+0.062
gh+0.056
 badly+0.054
 nap+0.053
Neg logit contributions
elling-0.128
aughter-0.121
ener-0.120
mith-0.119
ada-0.115
Token with
Token ID351
Feature activation-3.44e+00

Pos logit contributions
elling+0.002
aughter+0.002
ener+0.002
mith+0.002
ada+0.002
Neg logit contributions
One-0.002
ggy-0.002
gh-0.001
 badly-0.001
 read-0.001
Token everybody
Token ID7288
Feature activation-5.62e-01

Pos logit contributions
elling+0.076
ener+0.075
aughter+0.074
mith+0.069
 Girl+0.068
Neg logit contributions
ggy-0.054
One-0.054
 badly-0.052
 much-0.050
 che-0.050
Token sky
Token ID6766
Feature activation+1.19e-01

Pos logit contributions
elling+0.051
aughter+0.047
ener+0.046
mith+0.046
ada+0.044
Neg logit contributions
One-0.060
ggy-0.058
gh-0.055
 nap-0.055
 read-0.054
Token and
Token ID290
Feature activation-2.50e+00

Pos logit contributions
One+0.002
ggy+0.002
 read+0.002
 nap+0.002
 che+0.002
Neg logit contributions
elling-0.003
aughter-0.003
mith-0.003
ener-0.003
ada-0.002
Token said
Token ID531
Feature activation-3.74e+00

Pos logit contributions
elling+0.059
aughter+0.056
mith+0.053
ener+0.052
 Acc+0.052
Neg logit contributions
One-0.036
ggy-0.036
 read-0.036
 che-0.035
 hidden-0.034
Token,
Token ID11
Feature activation-2.32e+00

Pos logit contributions
elling+0.082
aughter+0.075
mith+0.074
ener+0.072
ada+0.070
Neg logit contributions
ggy-0.066
One-0.062
 badly-0.062
gh-0.061
 long-0.059
Token "
Token ID366
Feature activation-1.33e+00

Pos logit contributions
elling+0.043
aughter+0.039
ener+0.036
isher+0.036
mith+0.036
Neg logit contributions
One-0.050
 not-0.047
ggy-0.046
 long-0.046
On-0.046
TokenLittle
Token ID22253
Feature activation-1.07e+01

Pos logit contributions
elling+0.029
aughter+0.028
ener+0.027
mith+0.027
ishly+0.027
Neg logit contributions
One-0.024
ggy-0.020
On-0.020
 read-0.019
P-0.019
Token girl
Token ID2576
Feature activation-2.87e+00

Pos logit contributions
elling+0.174
mith+0.161
 Acc+0.159
aughter+0.152
ada+0.147
Neg logit contributions
ggy-0.227
 seek-0.222
One-0.221
 nap-0.219
 hidden-0.216
Token,
Token ID11
Feature activation-2.69e+00

Pos logit contributions
elling+0.062
mith+0.058
aughter+0.058
ener+0.055
ada+0.054
Neg logit contributions
ggy-0.050
One-0.046
gh-0.045
 badly-0.044
 che-0.044
Token why
Token ID1521
Feature activation-3.58e+00

Pos logit contributions
elling+0.057
aughter+0.054
ener+0.053
mith+0.053
ada+0.051
Neg logit contributions
One-0.044
ggy-0.044
 nap-0.041
 read-0.041
 badly-0.040
Token are
Token ID389
Feature activation+1.69e+00

Pos logit contributions
elling+0.070
aughter+0.066
ener+0.065
mith+0.064
 Me+0.062
Neg logit contributions
One-0.065
ggy-0.063
gh-0.060
On-0.057
ggie-0.057
Token you
Token ID345
Feature activation+1.92e+00

Pos logit contributions
One+0.030
ggy+0.030
gh+0.028
 badly+0.027
ggie+0.027
Neg logit contributions
elling-0.034
aughter-0.032
ener-0.032
mith-0.031
ada-0.030
Token then
Token ID788
Feature activation-1.64e+00

Pos logit contributions
elling+0.041
aughter+0.039
ener+0.039
mith+0.038
ada+0.037
Neg logit contributions
ggy-0.034
One-0.034
 badly-0.031
 read-0.030
 hidden-0.030
Token on
Token ID319
Feature activation-3.27e+00

Pos logit contributions
elling+0.044
aughter+0.042
ener+0.041
mith+0.041
 Me+0.040
Neg logit contributions
One-0.018
ggy-0.018
gh-0.016
 badly-0.015
ggie-0.015
Token,
Token ID11
Feature activation-2.26e+00

Pos logit contributions
elling+0.089
ada+0.078
ener+0.078
aughter+0.077
isher+0.076
Neg logit contributions
ggy-0.046
 long-0.044
gh-0.044
 not-0.042
One-0.042
Token whenever
Token ID8797
Feature activation-3.81e+00

Pos logit contributions
elling+0.057
ener+0.052
aughter+0.052
mith+0.050
ishly+0.050
Neg logit contributions
 well-0.033
 read-0.033
 long-0.033
 che-0.032
 not-0.032
Token the
Token ID262
Feature activation-5.18e+00

Pos logit contributions
elling+0.072
aughter+0.066
ener+0.066
mith+0.064
ada+0.063
Neg logit contributions
One-0.075
ggy-0.074
 read-0.070
 nap-0.069
 seek-0.069
Token young
Token ID1862
Feature activation-1.07e+01

Pos logit contributions
elling+0.114
aughter+0.109
ener+0.107
mith+0.106
 Girl+0.104
Neg logit contributions
ggy-0.082
One-0.080
gh-0.079
 badly-0.073
ggie-0.073
Token girl
Token ID2576
Feature activation-4.55e+00

Pos logit contributions
elling+0.160
 Acc+0.146
aughter+0.137
mith+0.135
ishly+0.131
Neg logit contributions
One-0.243
 seek-0.239
 nap-0.239
 hidden-0.237
 read-0.236
Token came
Token ID1625
Feature activation-6.12e-01

Pos logit contributions
elling+0.107
aughter+0.102
mith+0.100
ener+0.097
ada+0.096
Neg logit contributions
One-0.065
ggy-0.065
 read-0.062
 che-0.061
 nap-0.059
Token to
Token ID284
Feature activation-5.18e-02

Pos logit contributions
elling+0.012
aughter+0.012
mith+0.012
ener+0.011
ada+0.011
Neg logit contributions
One-0.011
ggy-0.011
 badly-0.010
 hidden-0.010
gh-0.010
Token visit
Token ID3187
Feature activation-2.37e+00

Pos logit contributions
elling+0.001
aughter+0.001
ener+0.001
mith+0.001
 Acc+0.001
Neg logit contributions
ggy-0.001
One-0.001
gh-0.001
 badly-0.001
ggie-0.001
Token,
Token ID11
Feature activation-2.22e+00

Pos logit contributions
elling+0.054
aughter+0.051
ener+0.049
ada+0.049
mith+0.049
Neg logit contributions
One-0.038
 read-0.037
 hidden-0.036
ggy-0.035
 well-0.035
Token the
Token ID262
Feature activation-1.56e+00

Pos logit contributions
One+0.063
ggy+0.062
 not+0.059
 badly+0.058
 much+0.056
Neg logit contributions
elling-0.082
ener-0.077
aughter-0.076
mith-0.072
ada-0.071
Token story
Token ID1621
Feature activation+2.60e-01

Pos logit contributions
elling+0.034
aughter+0.033
ener+0.033
mith+0.032
 Me+0.031
Neg logit contributions
ggy-0.024
One-0.024
gh-0.023
 badly-0.021
ggie-0.021
Token of
Token ID286
Feature activation-4.31e+00

Pos logit contributions
ggy+0.004
 read+0.004
One+0.004
ggie+0.004
 not+0.003
Neg logit contributions
elling-0.006
ener-0.006
aughter-0.006
ada-0.006
 Acc-0.006
Token how
Token ID703
Feature activation-3.12e-01

Pos logit contributions
elling+0.094
ener+0.087
aughter+0.087
mith+0.082
ada+0.082
Neg logit contributions
ggy-0.073
One-0.071
 badly-0.068
 not-0.066
 hidden-0.066
Token the
Token ID262
Feature activation-5.01e+00

Pos logit contributions
elling+0.007
ener+0.006
mith+0.006
aughter+0.006
ishly+0.006
Neg logit contributions
ggy-0.005
One-0.005
 badly-0.005
gh-0.005
ggie-0.005
Token little
Token ID1310
Feature activation-1.07e+01

Pos logit contributions
elling+0.098
aughter+0.093
ener+0.093
mith+0.090
 Girl+0.088
Neg logit contributions
ggy-0.090
One-0.090
gh-0.087
ggie-0.081
On-0.080
Token girl
Token ID2576
Feature activation-2.94e+00

Pos logit contributions
elling+0.151
mith+0.129
aughter+0.128
 Acc+0.127
ishly+0.124
Neg logit contributions
One-0.252
 nap-0.251
 hidden-0.246
 seek-0.246
 che-0.245
Token got
Token ID1392
Feature activation+2.20e+00

Pos logit contributions
elling+0.072
aughter+0.069
mith+0.067
ener+0.066
ada+0.065
Neg logit contributions
ggy-0.039
One-0.038
 read-0.038
 che-0.037
 hidden-0.035
Token her
Token ID607
Feature activation-1.97e+00

Pos logit contributions
ggy+0.033
 badly+0.033
 hidden+0.032
One+0.032
 che+0.032
Neg logit contributions
elling-0.049
aughter-0.048
mith-0.047
 Acc-0.045
 Me-0.045
Token magical
Token ID10883
Feature activation-2.97e+00

Pos logit contributions
elling+0.051
aughter+0.049
ener+0.049
mith+0.048
ada+0.046
Neg logit contributions
One-0.023
ggy-0.023
gh-0.021
ggie-0.019
 badly-0.019
Token jewel
Token ID16927
Feature activation+1.94e+00

Pos logit contributions
elling+0.073
ener+0.072
aughter+0.069
mith+0.069
 Girl+0.066
Neg logit contributions
One-0.038
ggy-0.038
gh-0.033
On-0.033
ggie-0.032
Token new
Token ID649
Feature activation-1.68e+00

Pos logit contributions
elling+0.040
aughter+0.038
ener+0.037
mith+0.037
ada+0.036
Neg logit contributions
One-0.025
ggy-0.025
gh-0.023
 nap-0.022
 badly-0.022
Token home
Token ID1363
Feature activation-1.63e-02

Pos logit contributions
elling+0.033
ener+0.032
aughter+0.032
mith+0.031
 Me+0.030
Neg logit contributions
One-0.029
ggy-0.029
gh-0.027
ggie-0.026
 che-0.026
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
elling+0.000
aughter+0.000
ener+0.000
mith+0.000
 Acc+0.000
Neg logit contributions
ggy-0.000
One-0.000
 badly-0.000
gh-0.000
 hidden-0.000
Token Everywhere
Token ID48201
Feature activation-3.57e+00

Pos logit contributions
ener+0.043
isher+0.040
elling+0.039
ishly+0.035
mith+0.035
Neg logit contributions
On-0.016
One-0.016
 much-0.013
 long-0.013
 read-0.012
Token the
Token ID262
Feature activation-6.10e+00

Pos logit contributions
elling+0.089
ada+0.081
ener+0.081
 Acc+0.081
aughter+0.080
Neg logit contributions
 long-0.053
 hidden-0.053
 far-0.051
ggy-0.050
 well-0.050
Token little
Token ID1310
Feature activation-1.07e+01

Pos logit contributions
elling+0.103
aughter+0.095
ener+0.093
mith+0.093
ada+0.088
Neg logit contributions
One-0.129
ggy-0.127
gh-0.119
 badly-0.117
 nap-0.117
Token girl
Token ID2576
Feature activation-2.94e+00

Pos logit contributions
elling+0.195
 Acc+0.192
ishly+0.174
aughter+0.165
idents+0.163
Neg logit contributions
 hidden-0.208
 long-0.205
 nap-0.204
 big-0.204
 read-0.198
Token looked
Token ID3114
Feature activation-2.69e-01

Pos logit contributions
elling+0.072
aughter+0.068
mith+0.066
ener+0.065
ada+0.064
Neg logit contributions
ggy-0.040
One-0.040
 read-0.039
 che-0.037
 hidden-0.036
Token she
Token ID673
Feature activation-7.70e-01

Pos logit contributions
elling+0.006
aughter+0.006
mith+0.006
ada+0.006
 Acc+0.005
Neg logit contributions
 hidden-0.004
One-0.004
ggy-0.004
 long-0.004
 badly-0.004
Token saw
Token ID2497
Feature activation-2.66e+00

Pos logit contributions
elling+0.019
aughter+0.018
mith+0.018
ener+0.018
ada+0.018
Neg logit contributions
One-0.010
ggy-0.010
 read-0.009
 che-0.009
 hidden-0.009
Token something
Token ID1223
Feature activation+2.01e+00

Pos logit contributions
elling+0.064
aughter+0.059
ener+0.058
ada+0.058
 Acc+0.057
Neg logit contributions
 hidden-0.040
ggy-0.036
 read-0.035
 long-0.035
One-0.035
Token wait
Token ID4043
Feature activation-1.74e-01

Pos logit contributions
ggy+0.049
gh+0.049
cro+0.049
One+0.048
 villains+0.048
Neg logit contributions
ada-0.039
 Daughter-0.038
elling-0.038
 Acc-0.038
ener-0.037
Token until
Token ID1566
Feature activation-3.93e+00

Pos logit contributions
elling+0.003
aughter+0.003
ener+0.003
mith+0.003
ada+0.003
Neg logit contributions
One-0.003
ggy-0.003
gh-0.003
 nap-0.003
 badly-0.003
Token the
Token ID262
Feature activation-2.40e+00

Pos logit contributions
elling+0.064
aughter+0.060
ener+0.059
mith+0.058
 Me+0.057
Neg logit contributions
ggy-0.082
One-0.082
gh-0.077
ggie-0.075
 badly-0.074
Token rh
Token ID9529
Feature activation-2.23e+00

Pos logit contributions
elling+0.049
aughter+0.046
ener+0.046
mith+0.045
 Me+0.044
Neg logit contributions
ggy-0.041
One-0.041
gh-0.038
 badly-0.038
ggie-0.037
Tokenin
Token ID259
Feature activation-3.18e+00

Pos logit contributions
elling+0.042
aughter+0.039
mith+0.039
ener+0.039
ada+0.036
Neg logit contributions
One-0.043
gh-0.041
ggy-0.040
 nap-0.040
 seek-0.039
Tokenoc
Token ID420
Feature activation-1.07e+01

Pos logit contributions
elling+0.068
aughter+0.064
mith+0.062
ada+0.062
ener+0.061
Neg logit contributions
ggy-0.055
One-0.054
 read-0.050
 che-0.049
gh-0.048
Tokeneros
Token ID27498
Feature activation-1.86e+00

Pos logit contributions
elling+0.148
aughter+0.145
 Linda+0.143
mith+0.141
ada+0.140
Neg logit contributions
ggy-0.243
gh-0.225
ggie-0.221
 che-0.218
One-0.213
Token is
Token ID318
Feature activation+5.34e+00

Pos logit contributions
elling+0.039
aughter+0.036
mith+0.036
ener+0.035
ada+0.035
Neg logit contributions
ggy-0.032
One-0.032
gh-0.029
 read-0.029
 nap-0.029
Token gone
Token ID3750
Feature activation+7.25e-01

Pos logit contributions
One+0.082
ggy+0.080
 villains+0.074
ggie+0.071
gh+0.071
Neg logit contributions
elling-0.111
ener-0.108
aughter-0.098
 Daddy-0.098
 Jill-0.095
Token."
Token ID526
Feature activation-1.65e+00

Pos logit contributions
ggy+0.014
One+0.013
gh+0.012
ggie+0.012
 che+0.012
Neg logit contributions
elling-0.014
ener-0.013
aughter-0.013
mith-0.013
ada-0.012
Token
Token ID198
Feature activation+5.71e+00

Pos logit contributions
elling+0.050
ener+0.049
ishly+0.047
isher+0.046
mith+0.045
Neg logit contributions
One-0.019
 nap-0.016
On-0.016
 long-0.015
 read-0.014
Token name
Token ID1438
Feature activation-1.92e+00

Pos logit contributions
elling+0.030
aughter+0.028
mith+0.028
ener+0.027
ada+0.026
Neg logit contributions
ggy-0.022
One-0.021
 nap-0.020
gh-0.020
 badly-0.020
Token and
Token ID290
Feature activation-2.26e+00

Pos logit contributions
elling+0.044
aughter+0.041
 Acc+0.041
ener+0.040
mith+0.039
Neg logit contributions
ggy-0.031
One-0.029
gh-0.028
ggie-0.027
 badly-0.027
Token said
Token ID531
Feature activation-3.59e+00

Pos logit contributions
elling+0.049
aughter+0.046
ener+0.044
mith+0.044
ada+0.043
Neg logit contributions
ggy-0.038
One-0.036
 read-0.035
 che-0.034
 seek-0.034
Token,
Token ID11
Feature activation-1.97e+00

Pos logit contributions
elling+0.078
aughter+0.072
mith+0.069
ener+0.069
ada+0.067
Neg logit contributions
ggy-0.065
gh-0.061
One-0.060
 badly-0.060
 not-0.058
Token "
Token ID366
Feature activation-9.95e-01

Pos logit contributions
elling+0.038
aughter+0.034
ener+0.031
mith+0.031
cy+0.031
Neg logit contributions
One-0.042
 not-0.041
ggy-0.040
 long-0.039
 read-0.039
TokenLittle
Token ID22253
Feature activation-1.07e+01

Pos logit contributions
elling+0.020
aughter+0.019
ener+0.019
mith+0.019
ada+0.018
Neg logit contributions
One-0.017
ggy-0.017
gh-0.016
 badly-0.016
 nap-0.016
Token girl
Token ID2576
Feature activation-2.70e+00

Pos logit contributions
elling+0.176
mith+0.161
aughter+0.157
 Acc+0.154
ada+0.152
Neg logit contributions
ggy-0.234
One-0.223
 seek-0.221
 nap-0.219
 che-0.216
Token,
Token ID11
Feature activation-2.45e+00

Pos logit contributions
elling+0.058
mith+0.054
aughter+0.053
ener+0.051
ada+0.050
Neg logit contributions
ggy-0.048
One-0.044
gh-0.043
ggie-0.043
 che-0.042
Token give
Token ID1577
Feature activation-1.71e+00

Pos logit contributions
elling+0.054
aughter+0.051
ener+0.050
mith+0.050
ada+0.048
Neg logit contributions
One-0.039
ggy-0.038
gh-0.035
 badly-0.034
 nap-0.034
Token me
Token ID502
Feature activation-1.43e+00

Pos logit contributions
mith+0.041
elling+0.041
 Acc+0.039
ener+0.039
aughter+0.039
Neg logit contributions
ggy-0.024
One-0.023
 hidden-0.021
ggie-0.021
 read-0.021
Token that
Token ID326
Feature activation-4.27e+00

Pos logit contributions
elling+0.031
aughter+0.030
mith+0.030
ener+0.029
 Acc+0.029
Neg logit contributions
ggy-0.023
One-0.022
 badly-0.020
 nap-0.020
gh-0.020
Token had
Token ID550
Feature activation+2.47e+00

Pos logit contributions
mith+0.004
ishly+0.004
ener+0.004
elling+0.004
ted+0.004
Neg logit contributions
One-0.004
ggy-0.004
gh-0.004
P-0.004
irl-0.004
Token never
Token ID1239
Feature activation-9.69e-01

Pos logit contributions
One+0.033
On+0.030
vel+0.030
 prow+0.028
irl+0.028
Neg logit contributions
 Dad-0.050
 Joe-0.049
 Daddy-0.049
elling-0.048
 Acc-0.048
Token seen
Token ID1775
Feature activation-1.93e+00

Pos logit contributions
elling+0.018
ener+0.017
ted+0.016
 Let+0.016
 means+0.016
Neg logit contributions
ggy-0.016
irl-0.016
One-0.016
 badly-0.015
ggie-0.015
Token the
Token ID262
Feature activation-2.74e+00

Pos logit contributions
elling+0.042
aughter+0.041
ener+0.040
mith+0.039
 Acc+0.038
Neg logit contributions
One-0.031
ggy-0.029
 badly-0.027
 che-0.027
gh-0.027
Token av
Token ID1196
Feature activation+3.79e+00

Pos logit contributions
elling+0.056
ener+0.054
aughter+0.054
mith+0.053
 Girl+0.051
Neg logit contributions
ggy-0.047
One-0.046
gh-0.043
ggie-0.042
On-0.042
Tokenoc
Token ID420
Feature activation-1.06e+01

Pos logit contributions
ggy+0.088
gh+0.081
ggie+0.081
One+0.081
 nap+0.078
Neg logit contributions
elling-0.058
ener-0.056
aughter-0.049
mith-0.048
 Girl-0.046
Tokenados
Token ID22484
Feature activation+8.93e-01

Pos logit contributions
elling+0.139
 Linda+0.138
aughter+0.138
 Jos+0.134
 Suz+0.134
Neg logit contributions
ggy-0.236
gh-0.232
 che-0.231
One-0.231
 nut-0.229
Token.
Token ID13
Feature activation-2.56e+00

Pos logit contributions
ggy+0.018
One+0.018
gh+0.017
ggie+0.016
 prow+0.016
Neg logit contributions
elling-0.015
ener-0.014
mith-0.014
aughter-0.014
 Me-0.013
Token They
Token ID1119
Feature activation+8.41e-01

Pos logit contributions
elling+0.052
ener+0.050
aughter+0.046
isher+0.044
ada+0.044
Neg logit contributions
One-0.053
On-0.050
 not-0.050
 long-0.049
 badly-0.047
Token wished
Token ID16555
Feature activation-3.87e+00

Pos logit contributions
gh+0.016
ggy+0.015
One+0.015
ggie+0.014
 seek+0.014
Neg logit contributions
ishly-0.015
mith-0.015
ener-0.014
ted-0.014
 Girl-0.014
Token they
Token ID484
Feature activation-3.02e-01

Pos logit contributions
elling+0.089
aughter+0.081
ener+0.078
ada+0.076
 Girl+0.073
Neg logit contributions
 not-0.073
ggy-0.068
 long-0.067
 badly-0.065
One-0.064
Token
Token ID198
Feature activation+6.38e+00

Pos logit contributions
One+0.026
ggy+0.026
On+0.024
gh+0.023
 badly+0.023
Neg logit contributions
elling-0.042
ener-0.041
mith-0.038
aughter-0.038
ishly-0.038
Token
Token ID198
Feature activation+5.15e+00

Pos logit contributions
ggy+0.103
One+0.103
gh+0.098
 badly+0.096
 che+0.094
Neg logit contributions
elling-0.131
aughter-0.125
mith-0.121
 Me-0.120
 Acc-0.119
TokenMom
Token ID29252
Feature activation-2.01e+00

Pos logit contributions
ggy+0.088
gh+0.084
 nap+0.083
One+0.083
 badly+0.082
Neg logit contributions
elling-0.097
aughter-0.093
mith-0.091
 Me-0.089
 Acc-0.088
Tokenmy
Token ID1820
Feature activation-1.01e+00

Pos logit contributions
elling+0.038
mith+0.037
aughter+0.036
ener+0.035
 Me+0.034
Neg logit contributions
One-0.037
 read-0.037
 che-0.035
ggy-0.035
 hidden-0.033
Token hugged
Token ID41130
Feature activation-4.31e+00

Pos logit contributions
elling+0.023
aughter+0.021
mith+0.020
ener+0.020
 Me+0.019
Neg logit contributions
ggy-0.017
 read-0.016
 che-0.016
One-0.016
 not-0.015
Token Little
Token ID7703
Feature activation-1.06e+01

Pos logit contributions
elling+0.106
ener+0.095
mith+0.090
als+0.090
 Acc+0.089
Neg logit contributions
 nap-0.062
One-0.060
 badly-0.058
 read-0.057
 long-0.057
Token Jimmy
Token ID12963
Feature activation-2.66e+00

Pos logit contributions
elling+0.208
mith+0.193
 Acc+0.190
aughter+0.182
ada+0.179
Neg logit contributions
 nap-0.186
 che-0.183
One-0.174
 read-0.172
 seek-0.166
Token and
Token ID290
Feature activation-3.02e+00

Pos logit contributions
elling+0.063
mith+0.058
aughter+0.056
ada+0.056
ener+0.055
Neg logit contributions
 badly-0.039
 che-0.038
 nap-0.038
ggy-0.038
 read-0.037
Token said
Token ID531
Feature activation-3.24e+00

Pos logit contributions
elling+0.067
ener+0.062
aughter+0.059
mith+0.059
ada+0.057
Neg logit contributions
 read-0.051
 che-0.050
ggy-0.050
 nap-0.047
One-0.047
Token,
Token ID11
Feature activation-2.06e+00

Pos logit contributions
elling+0.069
ener+0.064
mith+0.062
aughter+0.062
 Girl+0.059
Neg logit contributions
ggy-0.057
gh-0.055
 badly-0.055
One-0.054
 long-0.052
Token "
Token ID366
Feature activation-1.46e+00

Pos logit contributions
elling+0.038
ener+0.034
aughter+0.033
mith+0.032
cy+0.031
Neg logit contributions
One-0.044
 not-0.043
ggy-0.042
On-0.041
 read-0.041
TokenAnd
Token ID1870
Feature activation-1.97e+00

Pos logit contributions
ggy+0.070
One+0.070
gh+0.067
 nap+0.065
 badly+0.064
Neg logit contributions
elling-0.098
aughter-0.093
mith-0.091
ener-0.089
 Acc-0.089
Token that
Token ID326
Feature activation-3.75e+00

Pos logit contributions
elling+0.051
ener+0.047
aughter+0.047
mith+0.045
 Acc+0.045
Neg logit contributions
ggy-0.028
 not-0.026
 badly-0.026
 read-0.025
ggie-0.024
Token was
Token ID373
Feature activation+3.72e+00

Pos logit contributions
elling+0.080
mith+0.075
ener+0.075
aughter+0.075
ada+0.072
Neg logit contributions
One-0.064
ggy-0.063
gh-0.058
 che-0.057
 read-0.057
Token how
Token ID703
Feature activation+1.17e-02

Pos logit contributions
ggy+0.056
One+0.056
 badly+0.052
gh+0.050
 not+0.050
Neg logit contributions
elling-0.088
aughter-0.084
ener-0.084
mith-0.082
ada-0.079
Token the
Token ID262
Feature activation-4.10e+00

Pos logit contributions
ggy+0.000
One+0.000
 badly+0.000
 long+0.000
 che+0.000
Neg logit contributions
elling-0.000
ener-0.000
mith-0.000
aughter-0.000
ishly-0.000
Token little
Token ID1310
Feature activation-1.05e+01

Pos logit contributions
elling+0.098
aughter+0.092
mith+0.092
ener+0.090
ada+0.089
Neg logit contributions
One-0.059
ggy-0.056
 nap-0.052
 che-0.052
 read-0.051
Token girl
Token ID2576
Feature activation-2.84e+00

Pos logit contributions
elling+0.181
mith+0.170
ishly+0.166
aughter+0.165
 Acc+0.163
Neg logit contributions
 che-0.215
 hidden-0.213
 seek-0.211
One-0.208
 nut-0.207
Token learned
Token ID4499
Feature activation+1.44e-01

Pos logit contributions
elling+0.069
aughter+0.066
mith+0.065
ener+0.063
ada+0.062
Neg logit contributions
ggy-0.040
One-0.038
 che-0.038
 read-0.037
 nap-0.035
Token her
Token ID607
Feature activation-2.76e+00

Pos logit contributions
One+0.002
ggy+0.002
 badly+0.002
gh+0.002
 not+0.002
Neg logit contributions
elling-0.003
ener-0.003
aughter-0.003
mith-0.003
ada-0.003
Token lesson
Token ID11483
Feature activation-1.71e+00

Pos logit contributions
elling+0.062
mith+0.059
aughter+0.058
ener+0.058
ada+0.055
Neg logit contributions
ggy-0.046
One-0.043
 badly-0.041
gh-0.041
ggie-0.040
Token -
Token ID532
Feature activation-1.80e+00

Pos logit contributions
elling+0.043
ener+0.041
 Acc+0.041
ada+0.040
aughter+0.040
Neg logit contributions
ggy-0.024
 badly-0.022
 not-0.021
One-0.021
ggie-0.021
Token They
Token ID1119
Feature activation+1.17e+00

Pos logit contributions
ener+0.053
elling+0.052
aughter+0.049
mith+0.047
 Girl+0.044
Neg logit contributions
One-0.059
 long-0.057
On-0.057
 nap-0.056
ggy-0.055
Token forget
Token ID6044
Feature activation-7.97e-01

Pos logit contributions
ggy+0.017
One+0.017
gh+0.016
 badly+0.016
ggie+0.016
Neg logit contributions
elling-0.026
ener-0.025
aughter-0.025
mith-0.025
 Me-0.024
Token about
Token ID546
Feature activation-2.52e+00

Pos logit contributions
elling+0.021
ener+0.020
aughter+0.019
mith+0.019
ada+0.019
Neg logit contributions
ggy-0.010
One-0.010
gh-0.009
 badly-0.009
 nap-0.009
Token the
Token ID262
Feature activation-1.82e+00

Pos logit contributions
elling+0.041
aughter+0.038
ener+0.037
mith+0.037
ada+0.035
Neg logit contributions
One-0.055
ggy-0.054
gh-0.051
 badly-0.050
 nap-0.050
Token av
Token ID1196
Feature activation+3.87e+00

Pos logit contributions
elling+0.041
ener+0.040
mith+0.037
aughter+0.037
 Girl+0.036
Neg logit contributions
ggy-0.029
One-0.027
ggie-0.026
gh-0.025
On-0.025
Tokenoc
Token ID420
Feature activation-1.05e+01

Pos logit contributions
ggy+0.090
ggie+0.086
gh+0.086
One+0.085
 badly+0.085
Neg logit contributions
elling-0.054
ener-0.050
aughter-0.043
als-0.043
cy-0.042
Tokenados
Token ID22484
Feature activation+1.13e+00

Pos logit contributions
aughter+0.132
elling+0.131
mith+0.128
 Me+0.127
 Jos+0.121
Neg logit contributions
ggy-0.242
gh-0.238
 che-0.236
 nut-0.235
One-0.234
Token.
Token ID13
Feature activation-2.31e+00

Pos logit contributions
One+0.020
ggy+0.019
gh+0.018
ggie+0.017
 badly+0.017
Neg logit contributions
elling-0.022
ener-0.021
aughter-0.021
mith-0.021
ishly-0.020
Token They
Token ID1119
Feature activation+7.98e-01

Pos logit contributions
ener+0.041
elling+0.041
aughter+0.038
mith+0.037
ishly+0.034
Neg logit contributions
One-0.053
ggy-0.050
On-0.049
 long-0.048
 nap-0.047
Token just
Token ID655
Feature activation-1.69e+00

Pos logit contributions
ggy+0.014
gh+0.014
One+0.014
 badly+0.013
ggie+0.013
Neg logit contributions
elling-0.015
ener-0.015
 Me-0.014
mith-0.014
aughter-0.014
Token want
Token ID765
Feature activation-1.20e+00

Pos logit contributions
elling+0.035
ener+0.033
 Acc+0.031
 Me+0.031
aughter+0.031
Neg logit contributions
ggy-0.029
One-0.029
gh-0.028
ggie-0.027
 badly-0.026
Token appeared
Token ID4120
Feature activation-5.87e-01

Pos logit contributions
One+0.027
ggy+0.026
 seek+0.024
gh+0.024
 nap+0.023
Neg logit contributions
elling-0.026
ener-0.026
mith-0.025
ishly-0.024
ted-0.024
Token and
Token ID290
Feature activation-2.39e-01

Pos logit contributions
elling+0.014
aughter+0.013
mith+0.013
ener+0.013
 Acc+0.013
Neg logit contributions
One-0.008
ggy-0.008
 badly-0.007
gh-0.007
 hidden-0.007
Token said
Token ID531
Feature activation-3.27e+00

Pos logit contributions
elling+0.005
aughter+0.005
ener+0.005
mith+0.005
ada+0.004
Neg logit contributions
One-0.004
ggy-0.004
gh-0.004
 badly-0.004
 nap-0.004
Token,
Token ID11
Feature activation-2.04e+00

Pos logit contributions
elling+0.072
aughter+0.066
mith+0.064
ener+0.062
ada+0.061
Neg logit contributions
ggy-0.057
gh-0.055
One-0.054
 badly-0.053
 long-0.051
Token "
Token ID366
Feature activation-6.72e-01

Pos logit contributions
elling+0.038
aughter+0.034
cy+0.031
ener+0.031
mith+0.031
Neg logit contributions
One-0.044
 not-0.042
 long-0.041
ggy-0.041
On-0.041
TokenLittle
Token ID22253
Feature activation-1.05e+01

Pos logit contributions
elling+0.015
aughter+0.014
ener+0.014
mith+0.014
ishly+0.014
Neg logit contributions
One-0.012
On-0.010
P-0.010
ggy-0.010
 read-0.010
Token girl
Token ID2576
Feature activation-2.11e+00

Pos logit contributions
elling+0.177
 Acc+0.163
mith+0.160
aughter+0.157
ada+0.152
Neg logit contributions
 che-0.212
ggy-0.212
One-0.212
 seek-0.208
 nap-0.205
Token,
Token ID11
Feature activation-2.32e+00

Pos logit contributions
elling+0.046
aughter+0.043
mith+0.043
ener+0.040
ada+0.040
Neg logit contributions
ggy-0.037
One-0.033
 che-0.033
gh-0.032
ggie-0.032
Token pick
Token ID2298
Feature activation-1.74e+00

Pos logit contributions
elling+0.050
aughter+0.047
mith+0.046
ener+0.046
ada+0.044
Neg logit contributions
One-0.038
ggy-0.037
 read-0.035
 nap-0.035
 badly-0.034
Token the
Token ID262
Feature activation-2.97e+00

Pos logit contributions
 Acc+0.038
mith+0.038
ener+0.036
elling+0.035
anged+0.035
Neg logit contributions
 hidden-0.026
One-0.025
ggy-0.025
P-0.024
 well-0.024
Token right
Token ID826
Feature activation-3.74e+00

Pos logit contributions
elling+0.066
mith+0.062
aughter+0.061
 Acc+0.061
ishly+0.059
Neg logit contributions
One-0.045
 hidden-0.044
ggy-0.043
 nap-0.043
 long-0.042
Token know
Token ID760
Feature activation-1.31e+00

Pos logit contributions
ggy+0.057
ggie+0.052
gh+0.052
 badly+0.051
One+0.051
Neg logit contributions
elling-0.042
ener-0.041
ada-0.038
 Acc-0.038
aughter-0.037
Token Tw
Token ID1815
Feature activation-3.83e+00

Pos logit contributions
elling+0.025
aughter+0.024
ener+0.024
mith+0.023
ada+0.023
Neg logit contributions
One-0.024
ggy-0.024
gh-0.022
 badly-0.022
 read-0.022
Tokeninkle
Token ID19894
Feature activation-2.78e+00

Pos logit contributions
elling+0.070
aughter+0.066
ener+0.063
mith+0.062
ada+0.059
Neg logit contributions
One-0.073
 badly-0.070
 seek-0.070
ggy-0.070
On-0.069
Token Tw
Token ID1815
Feature activation-4.17e+00

Pos logit contributions
elling+0.066
aughter+0.063
mith+0.062
ener+0.061
 Acc+0.060
Neg logit contributions
ggy-0.040
One-0.039
gh-0.035
ggie-0.035
 read-0.035
Tokeninkle
Token ID19894
Feature activation-3.08e+00

Pos logit contributions
elling+0.077
aughter+0.071
ener+0.070
mith+0.068
ada+0.065
Neg logit contributions
One-0.081
ggy-0.078
 badly-0.076
 seek-0.076
On-0.075
Token Little
Token ID7703
Feature activation-1.05e+01

Pos logit contributions
elling+0.069
aughter+0.066
mith+0.065
ener+0.064
 Acc+0.063
Neg logit contributions
ggy-0.048
One-0.047
gh-0.043
ggie-0.043
 read-0.042
Token Star
Token ID2907
Feature activation-3.16e+00

Pos logit contributions
elling+0.190
mith+0.178
 Acc+0.176
aughter+0.174
ada+0.165
Neg logit contributions
ggy-0.206
One-0.200
 read-0.198
 nap-0.197
 che-0.196
Token?"
Token ID1701
Feature activation-1.93e+00

Pos logit contributions
elling+0.075
mith+0.071
aughter+0.070
 Acc+0.069
ener+0.069
Neg logit contributions
ggy-0.046
One-0.045
 read-0.042
ggie-0.041
gh-0.040
Token
Token ID198
Feature activation+5.44e+00

Pos logit contributions
elling+0.052
ener+0.048
aughter+0.048
ishly+0.048
mith+0.047
Neg logit contributions
One-0.027
 nap-0.024
ggy-0.023
On-0.023
 read-0.022
Token
Token ID198
Feature activation+4.54e+00

Pos logit contributions
One+0.093
ggy+0.090
gh+0.082
 nap+0.081
On+0.081
Neg logit contributions
elling-0.116
aughter-0.109
ener-0.109
mith-0.107
ishly-0.103
TokenL
Token ID43
Feature activation-4.59e+00

Pos logit contributions
One+0.067
ggy+0.063
gh+0.057
On+0.057
 seek+0.056
Neg logit contributions
elling-0.110
aughter-0.104
ener-0.103
mith-0.102
ishly-0.099
Token sings
Token ID33041
Feature activation+1.77e+00

Pos logit contributions
One+0.021
ggy+0.021
gh+0.021
 badly+0.020
ggie+0.020
Neg logit contributions
elling-0.015
aughter-0.013
ener-0.013
mith-0.013
 Me-0.012
Token Tw
Token ID1815
Feature activation-3.83e+00

Pos logit contributions
One+0.026
ggy+0.025
gh+0.023
 read+0.022
 badly+0.022
Neg logit contributions
elling-0.042
aughter-0.040
mith-0.040
ener-0.039
ada-0.038
Tokeninkle
Token ID19894
Feature activation-2.84e+00

Pos logit contributions
elling+0.073
aughter+0.068
ener+0.067
mith+0.065
ada+0.062
Neg logit contributions
One-0.072
ggy-0.071
 badly-0.066
 seek-0.066
On-0.066
Token Tw
Token ID1815
Feature activation-3.76e+00

Pos logit contributions
elling+0.069
aughter+0.066
mith+0.066
 Acc+0.064
ener+0.064
Neg logit contributions
ggy-0.039
One-0.038
 badly-0.034
gh-0.034
ggie-0.034
Tokeninkle
Token ID19894
Feature activation-2.96e+00

Pos logit contributions
elling+0.074
aughter+0.069
ener+0.068
mith+0.067
ada+0.064
Neg logit contributions
One-0.069
ggy-0.068
gh-0.063
 badly-0.063
 seek-0.062
Token Little
Token ID7703
Feature activation-1.05e+01

Pos logit contributions
elling+0.070
aughter+0.067
mith+0.067
ener+0.065
 Acc+0.064
Neg logit contributions
ggy-0.042
One-0.041
gh-0.037
 nap-0.037
 badly-0.037
Token Star
Token ID2907
Feature activation-2.77e+00

Pos logit contributions
elling+0.198
mith+0.194
aughter+0.190
ada+0.181
 Acc+0.180
Neg logit contributions
 nap-0.199
 che-0.198
ggy-0.198
One-0.190
 read-0.188
Token.
Token ID13
Feature activation-2.38e+00

Pos logit contributions
elling+0.072
mith+0.069
aughter+0.068
 Acc+0.066
ada+0.066
Neg logit contributions
ggy-0.034
One-0.033
 read-0.030
 nap-0.030
gh-0.030
Token She
Token ID1375
Feature activation+9.85e-01

Pos logit contributions
elling+0.043
aughter+0.041
ener+0.041
mith+0.040
ishly+0.038
Neg logit contributions
One-0.050
ggy-0.046
 nap-0.045
On-0.045
 read-0.045
Token falls
Token ID8953
Feature activation-1.49e+00

Pos logit contributions
One+0.020
ggy+0.020
gh+0.019
 read+0.019
 nap+0.018
Neg logit contributions
elling-0.017
aughter-0.016
ener-0.016
mith-0.015
ada-0.015
Token asleep
Token ID16039
Feature activation+1.03e+00

Pos logit contributions
elling+0.031
aughter+0.030
ener+0.029
mith+0.029
ada+0.028
Neg logit contributions
One-0.025
ggy-0.024
 nap-0.023
 badly-0.023
gh-0.023
Token advice
Token ID5608
Feature activation-7.77e-01

Pos logit contributions
elling+0.006
aughter+0.006
mith+0.006
ener+0.006
ada+0.005
Neg logit contributions
One-0.006
ggy-0.006
gh-0.006
 badly-0.006
 nap-0.006
Token next
Token ID1306
Feature activation-2.25e+00

Pos logit contributions
elling+0.018
aughter+0.017
ener+0.016
ada+0.016
 Me+0.016
Neg logit contributions
One-0.012
ggy-0.012
 badly-0.012
gh-0.011
 not-0.011
Token time
Token ID640
Feature activation-2.31e+00

Pos logit contributions
elling+0.058
ener+0.055
aughter+0.055
mith+0.054
 Girl+0.053
Neg logit contributions
ggy-0.025
 seek-0.022
One-0.022
 villains-0.021
 badly-0.020
Token.
Token ID13
Feature activation-2.02e-01

Pos logit contributions
elling+0.055
aughter+0.052
ener+0.050
mith+0.049
ada+0.048
Neg logit contributions
One-0.035
ggy-0.034
 badly-0.032
gh-0.032
 read-0.031
Token
Token ID220
Feature activation+2.62e+00

Pos logit contributions
elling+0.005
ener+0.004
aughter+0.004
mith+0.004
isher+0.004
Neg logit contributions
One-0.004
On-0.003
 read-0.003
ggy-0.003
 che-0.003
Token
Token ID198
Feature activation+6.84e+00

Pos logit contributions
One+0.036
ggy+0.033
On+0.030
gh+0.028
 badly+0.028
Neg logit contributions
elling-0.068
ener-0.066
ishly-0.062
aughter-0.062
 Girl-0.061
Token
Token ID198
Feature activation+5.76e+00

Pos logit contributions
One+0.116
ggy+0.116
gh+0.109
 badly+0.106
 nap+0.105
Neg logit contributions
elling-0.139
aughter-0.131
mith-0.128
ener-0.128
ada-0.124
TokenThe
Token ID464
Feature activation-2.03e+00

Pos logit contributions
ggy+0.080
gh+0.076
One+0.076
 nap+0.075
 badly+0.075
Neg logit contributions
elling-0.127
aughter-0.122
mith-0.120
ada-0.118
 Me-0.117
Token moral
Token ID6573
Feature activation+1.59e+00

Pos logit contributions
elling+0.045
aughter+0.042
mith+0.041
ener+0.041
 Me+0.040
Neg logit contributions
One-0.034
ggy-0.032
gh-0.029
 che-0.029
 read-0.029
Token of
Token ID286
Feature activation-5.16e+00

Pos logit contributions
One+0.029
ggy+0.029
gh+0.027
 badly+0.026
 nap+0.026
Neg logit contributions
elling-0.031
aughter-0.029
ener-0.028
mith-0.028
ada-0.027
Token this
Token ID428
Feature activation-2.78e+00

Pos logit contributions
elling+0.106
ener+0.106
aughter+0.098
mith+0.097
ada+0.097
Neg logit contributions
ggy-0.093
One-0.091
 not-0.084
 read-0.084
 nap-0.084
Token after
Token ID706
Feature activation-5.30e+00

Pos logit contributions
elling+0.036
aughter+0.033
ener+0.032
mith+0.032
ada+0.031
Neg logit contributions
ggy-0.024
One-0.024
 badly-0.022
gh-0.022
 read-0.021
Token himself
Token ID2241
Feature activation-1.97e+00

Pos logit contributions
elling+0.109
aughter+0.102
ener+0.101
mith+0.100
ada+0.096
Neg logit contributions
One-0.092
ggy-0.090
gh-0.084
 badly-0.081
 nap-0.081
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
elling+0.043
aughter+0.041
ener+0.041
mith+0.040
ada+0.038
Neg logit contributions
One-0.031
ggy-0.030
gh-0.028
ggie-0.027
 seek-0.027
Token<|endoftext|>
Token ID50256
Feature activation+3.08e+00

Pos logit contributions
ener+0.037
elling+0.036
mith+0.034
aughter+0.034
ishly+0.033
Neg logit contributions
One-0.021
On-0.018
 much-0.017
 che-0.016
 not-0.016
TokenOnce
Token ID7454
Feature activation-5.87e-01

Pos logit contributions
ggy+0.061
One+0.060
gh+0.057
 badly+0.056
ggie+0.055
Neg logit contributions
elling-0.054
aughter-0.051
mith-0.050
ener-0.049
 Me-0.048
Token upon
Token ID2402
Feature activation+4.22e+00

Pos logit contributions
elling+0.014
ener+0.013
ishly+0.012
aughter+0.012
mith+0.012
Neg logit contributions
One-0.009
ggy-0.009
 nap-0.008
 badly-0.008
gh-0.008
Token a
Token ID257
Feature activation+3.39e+00

Pos logit contributions
gh+0.093
aches+0.091
vel+0.087
 quest+0.087
 tresp+0.084
Neg logit contributions
 The-0.052
 Me-0.046
 Little-0.045
 Daddy-0.043
 Mom-0.043
Token time
Token ID640
Feature activation-1.73e-01

Pos logit contributions
 villains+0.050
aches+0.048
cro+0.045
 seek+0.045
 prow+0.045
Neg logit contributions
aughter-0.064
cy-0.063
 Little-0.063
elling-0.060
 Acc-0.060
Token,
Token ID11
Feature activation+4.14e-01

Pos logit contributions
elling+0.003
aughter+0.003
ener+0.003
mith+0.003
ada+0.003
Neg logit contributions
One-0.003
ggy-0.003
gh-0.003
 badly-0.003
 nap-0.003
Token there
Token ID612
Feature activation+6.82e-02

Pos logit contributions
ggy+0.005
One+0.004
gh+0.004
ggie+0.004
 read+0.004
Neg logit contributions
elling-0.010
mith-0.010
aughter-0.010
 Me-0.010
ener-0.010
Token was
Token ID373
Feature activation+3.29e+00

Pos logit contributions
ggy+0.001
One+0.001
gh+0.001
 nap+0.001
 badly+0.001
Neg logit contributions
elling-0.002
ener-0.001
aughter-0.001
mith-0.001
ada-0.001
Token One
Token ID1881
Feature activation-3.25e-01

Pos logit contributions
ener+0.017
elling+0.016
mith+0.016
aughter+0.015
ishly+0.015
Neg logit contributions
One-0.010
 read-0.008
 nap-0.008
On-0.008
 long-0.007
Token day
Token ID1110
Feature activation-1.23e+00

Pos logit contributions
elling+0.006
aughter+0.006
ener+0.006
mith+0.006
 Me+0.005
Neg logit contributions
One-0.006
ggy-0.006
gh-0.006
 badly-0.006
ggie-0.005
Token,
Token ID11
Feature activation-9.91e-01

Pos logit contributions
elling+0.029
aughter+0.027
mith+0.027
ener+0.026
ishly+0.025
Neg logit contributions
ggy-0.020
One-0.020
 nap-0.019
 long-0.018
 badly-0.018
Token she
Token ID673
Feature activation+1.58e+00

Pos logit contributions
elling+0.025
aughter+0.022
ener+0.022
mith+0.021
 Acc+0.021
Neg logit contributions
 long-0.016
One-0.016
 read-0.016
ggy-0.015
 nap-0.015
Token saw
Token ID2497
Feature activation-8.12e-01

Pos logit contributions
One+0.027
ggy+0.027
gh+0.026
 badly+0.025
ggie+0.025
Neg logit contributions
elling-0.031
aughter-0.029
ener-0.029
 Girl-0.029
ada-0.028
Token a
Token ID257
Feature activation+5.38e+00

Pos logit contributions
elling+0.022
aughter+0.020
ener+0.020
mith+0.020
ada+0.020
Neg logit contributions
One-0.009
ggy-0.009
gh-0.008
 badly-0.008
 seek-0.008
Token carpet
Token ID20710
Feature activation+3.38e+00

Pos logit contributions
One+0.081
gh+0.074
ggy+0.073
ggie+0.067
aches+0.067
Neg logit contributions
 Girl-0.115
aughter-0.113
elling-0.111
ener-0.110
 Me-0.108
Token in
Token ID287
Feature activation-1.75e-01

Pos logit contributions
One+0.056
ggy+0.054
gh+0.052
ggie+0.048
 villains+0.048
Neg logit contributions
elling-0.069
mith-0.066
ener-0.065
ishly-0.064
aughter-0.064
Token the
Token ID262
Feature activation-1.38e+00

Pos logit contributions
 Me+0.002
elling+0.002
mith+0.002
 Little+0.002
aughter+0.002
Neg logit contributions
gh-0.005
ggy-0.005
ggie-0.004
One-0.004
vel-0.004
Token hallway
Token ID23959
Feature activation+1.32e+00

Pos logit contributions
elling+0.031
aughter+0.029
ener+0.029
mith+0.028
ada+0.027
Neg logit contributions
ggy-0.022
One-0.022
gh-0.020
 badly-0.019
ggie-0.019
Token and
Token ID290
Feature activation-9.06e-01

Pos logit contributions
One+0.020
ggy+0.020
gh+0.019
 nap+0.018
 badly+0.018
Neg logit contributions
elling-0.029
aughter-0.027
ener-0.027
mith-0.027
ada-0.026
Token upon
Token ID2402
Feature activation+4.22e+00

Pos logit contributions
One+0.003
ggy+0.002
 nap+0.002
 badly+0.002
 long+0.002
Neg logit contributions
elling-0.004
ener-0.004
ishly-0.004
 Acc-0.004
aughter-0.004
Token a
Token ID257
Feature activation+3.85e+00

Pos logit contributions
gh+0.091
aches+0.087
 plank+0.084
 quest+0.084
 lair+0.083
Neg logit contributions
 The-0.054
 Little-0.053
 Me-0.052
 Daddy-0.047
 Joe-0.046
Token time
Token ID640
Feature activation+2.29e-01

Pos logit contributions
 villains+0.056
aches+0.055
 seek+0.052
cro+0.050
 prow+0.049
Neg logit contributions
aughter-0.075
 Little-0.074
cy-0.073
 Me-0.072
elling-0.071
Token there
Token ID612
Feature activation+6.48e-01

Pos logit contributions
One+0.004
ggy+0.004
gh+0.004
 badly+0.004
 nap+0.004
Neg logit contributions
elling-0.005
ener-0.004
aughter-0.004
mith-0.004
ada-0.004
Token was
Token ID373
Feature activation+3.22e+00

Pos logit contributions
ggy+0.010
One+0.010
gh+0.009
 nap+0.009
ggie+0.009
Neg logit contributions
elling-0.015
ener-0.014
aughter-0.014
mith-0.013
ada-0.013
Token a
Token ID257
Feature activation+6.10e+00

Pos logit contributions
ggy+0.042
gh+0.041
One+0.040
aches+0.038
 badly+0.037
Neg logit contributions
elling-0.074
aughter-0.071
 Me-0.070
 Little-0.069
ada-0.067
Token orange
Token ID10912
Feature activation+1.79e+00

Pos logit contributions
gh+0.103
One+0.102
ggy+0.100
aches+0.093
 badly+0.093
Neg logit contributions
aughter-0.115
elling-0.113
 Girl-0.112
 Me-0.111
ener-0.110
Token cat
Token ID3797
Feature activation+1.43e+00

Pos logit contributions
One+0.032
ggy+0.032
gh+0.030
 badly+0.029
 nap+0.029
Neg logit contributions
elling-0.035
aughter-0.033
ener-0.032
mith-0.032
ada-0.031
Token who
Token ID508
Feature activation+4.08e-01

Pos logit contributions
One+0.024
gh+0.024
ggy+0.023
 nap+0.023
 seek+0.023
Neg logit contributions
elling-0.026
ishly-0.025
ener-0.024
mith-0.024
aughter-0.024
Token lived
Token ID5615
Feature activation+5.50e+00

Pos logit contributions
One+0.007
ggy+0.007
gh+0.007
 badly+0.006
 nap+0.006
Neg logit contributions
elling-0.008
 Me-0.007
 Acc-0.007
aughter-0.007
ener-0.007
Token in
Token ID287
Feature activation+1.44e+00

Pos logit contributions
 villains+0.116
gh+0.116
ggy+0.114
 quest+0.111
 hunt+0.108
Neg logit contributions
cy-0.072
aughter-0.071
ting-0.070
elling-0.068
apped-0.067
Token they
Token ID484
Feature activation-2.41e-01

Pos logit contributions
elling+0.089
aughter+0.081
ener+0.080
mith+0.078
ada+0.075
Neg logit contributions
One-0.105
ggy-0.105
gh-0.098
 badly-0.096
 read-0.096
Token joined
Token ID5399
Feature activation-1.30e+00

Pos logit contributions
elling+0.005
aughter+0.005
ener+0.005
mith+0.005
ada+0.005
Neg logit contributions
One-0.004
ggy-0.004
gh-0.004
 badly-0.003
ggie-0.003
Token her
Token ID607
Feature activation-2.13e+00

Pos logit contributions
elling+0.030
ener+0.029
mith+0.028
aughter+0.028
ishly+0.027
Neg logit contributions
One-0.019
ggy-0.018
 read-0.018
 nap-0.018
 badly-0.017
Token,
Token ID11
Feature activation-1.31e+00

Pos logit contributions
elling+0.052
aughter+0.049
mith+0.048
ener+0.047
ada+0.047
Neg logit contributions
ggy-0.029
One-0.029
 nap-0.028
 read-0.027
 seek-0.027
Token they
Token ID484
Feature activation+2.21e-01

Pos logit contributions
elling+0.029
aughter+0.027
ener+0.026
mith+0.025
ada+0.025
Neg logit contributions
One-0.022
 read-0.022
 nap-0.021
 not-0.021
ggy-0.021
Token could
Token ID714
Feature activation+5.16e+00

Pos logit contributions
One+0.003
ggy+0.003
 read+0.003
 nap+0.003
 che+0.003
Neg logit contributions
elling-0.005
aughter-0.005
ener-0.005
mith-0.005
 Acc-0.005
Token help
Token ID1037
Feature activation-2.09e+00

Pos logit contributions
ggy+0.072
One+0.071
gh+0.068
 badly+0.065
ggie+0.064
Neg logit contributions
elling-0.113
mith-0.108
ener-0.107
ishly-0.107
aughter-0.107
Token even
Token ID772
Feature activation-9.54e-01

Pos logit contributions
elling+0.048
mith+0.044
aughter+0.044
ener+0.043
 Acc+0.042
Neg logit contributions
ggy-0.034
One-0.033
 read-0.031
gh-0.031
 nap-0.030
Token more
Token ID517
Feature activation-1.38e+00

Pos logit contributions
elling+0.019
ener+0.018
mith+0.017
aughter+0.017
 Girl+0.017
Neg logit contributions
ggy-0.017
gh-0.016
One-0.015
ggie-0.015
 badly-0.014
Token.
Token ID13
Feature activation-8.25e-01

Pos logit contributions
elling+0.033
aughter+0.031
ener+0.031
mith+0.030
ada+0.029
Neg logit contributions
One-0.020
ggy-0.020
gh-0.018
 nap-0.017
 badly-0.017
Token
Token ID198
Feature activation+6.84e+00

Pos logit contributions
ener+0.024
elling+0.024
isher+0.022
mith+0.021
ening+0.020
Neg logit contributions
One-0.011
On-0.008
Once-0.008
ggy-0.008
 read-0.007
Token turn
Token ID1210
Feature activation-1.55e-01

Pos logit contributions
elling+0.003
ener+0.003
mith+0.003
ada+0.003
ishly+0.003
Neg logit contributions
ggy-0.003
gh-0.003
 villains-0.003
One-0.003
ggie-0.003
Token into
Token ID656
Feature activation-7.47e-01

Pos logit contributions
aughter+0.003
elling+0.003
mith+0.003
ener+0.003
 Me+0.003
Neg logit contributions
One-0.003
 badly-0.002
ggy-0.002
 nap-0.002
 long-0.002
Token a
Token ID257
Feature activation+2.73e+00

Pos logit contributions
elling+0.012
mith+0.011
ener+0.010
aughter+0.010
 Acc+0.010
Neg logit contributions
ggy-0.016
gh-0.016
One-0.016
ggie-0.015
On-0.014
Token bird
Token ID6512
Feature activation+1.94e+00

Pos logit contributions
ggy+0.045
One+0.044
gh+0.043
ggie+0.041
On+0.040
Neg logit contributions
elling-0.053
ener-0.050
aughter-0.050
 Girl-0.049
mith-0.049
Token so
Token ID523
Feature activation+5.98e-01

Pos logit contributions
ggy+0.031
gh+0.030
One+0.030
ggie+0.028
 che+0.027
Neg logit contributions
elling-0.041
ener-0.038
mith-0.036
aughter-0.036
ishly-0.035
Token I
Token ID314
Feature activation+1.79e+00

Pos logit contributions
ggy+0.011
One+0.011
gh+0.011
ggie+0.010
 che+0.010
Neg logit contributions
elling-0.011
aughter-0.010
ener-0.010
mith-0.010
 Acc-0.010
Token can
Token ID460
Feature activation+4.16e+00

Pos logit contributions
ggy+0.043
One+0.041
gh+0.041
ggie+0.040
 badly+0.039
Neg logit contributions
elling-0.025
ener-0.023
mith-0.022
aughter-0.022
ada-0.021
Token fly
Token ID6129
Feature activation+2.76e+00

Pos logit contributions
gh+0.063
ggy+0.062
ggie+0.059
 badly+0.057
One+0.057
Neg logit contributions
elling-0.084
ishly-0.081
ener-0.078
als-0.078
ada-0.078
Token too
Token ID1165
Feature activation+1.90e+00

Pos logit contributions
ggy+0.044
One+0.043
gh+0.041
ggie+0.039
 che+0.038
Neg logit contributions
elling-0.060
ener-0.056
aughter-0.055
mith-0.055
ada-0.053
Token?"
Token ID1701
Feature activation-1.34e+00

Pos logit contributions
ggy+0.036
gh+0.034
ggie+0.033
 che+0.032
 villains+0.030
Neg logit contributions
elling-0.039
mith-0.034
aughter-0.032
ener-0.032
als-0.031
Token
Token ID198
Feature activation+6.00e+00

Pos logit contributions
elling+0.033
aughter+0.031
ener+0.031
mith+0.029
ada+0.029
Neg logit contributions
One-0.021
ggy-0.020
 read-0.018
 che-0.018
On-0.017
Token there
Token ID612
Feature activation+1.27e-01

Pos logit contributions
One+0.055
ggy+0.052
 nap+0.049
 villains+0.049
gh+0.049
Neg logit contributions
elling-0.085
mith-0.082
ener-0.082
aughter-0.081
ishly-0.079
Token with
Token ID351
Feature activation-1.43e+00

Pos logit contributions
One+0.002
ggy+0.002
 badly+0.002
 hidden+0.002
gh+0.002
Neg logit contributions
elling-0.003
aughter-0.003
ener-0.003
mith-0.003
ada-0.003
Token her
Token ID607
Feature activation-2.03e+00

Pos logit contributions
elling+0.032
aughter+0.030
ener+0.030
mith+0.029
ada+0.028
Neg logit contributions
One-0.023
ggy-0.022
gh-0.021
 nap-0.020
 read-0.020
Token and
Token ID290
Feature activation-1.77e+00

Pos logit contributions
elling+0.049
aughter+0.045
ada+0.044
mith+0.043
 Acc+0.042
Neg logit contributions
One-0.030
ggy-0.029
 nap-0.029
 badly-0.028
 hidden-0.028
Token that
Token ID326
Feature activation-4.48e+00

Pos logit contributions
elling+0.041
aughter+0.039
ener+0.037
ada+0.037
mith+0.037
Neg logit contributions
ggy-0.027
 read-0.027
 not-0.027
 hidden-0.027
 badly-0.026
Token she
Token ID673
Feature activation+5.56e-01

Pos logit contributions
elling+0.095
aughter+0.086
ener+0.085
ada+0.083
mith+0.083
Neg logit contributions
One-0.080
ggy-0.079
 read-0.076
 hidden-0.075
 nap-0.074
Token had
Token ID550
Feature activation+2.72e+00

Pos logit contributions
ggy+0.008
One+0.008
 read+0.008
 che+0.007
 badly+0.007
Neg logit contributions
elling-0.013
aughter-0.012
ener-0.012
mith-0.012
ada-0.012
Token listened
Token ID16399
Feature activation-8.36e-01

Pos logit contributions
One+0.037
ggy+0.036
gh+0.032
 badly+0.032
ggie+0.031
Neg logit contributions
elling-0.062
aughter-0.059
ener-0.059
mith-0.058
 Girl-0.057
Token to
Token ID284
Feature activation-1.88e+00

Pos logit contributions
elling+0.018
aughter+0.017
ener+0.017
mith+0.017
ada+0.016
Neg logit contributions
One-0.014
ggy-0.013
gh-0.013
 badly-0.012
On-0.012
Token her
Token ID607
Feature activation-2.80e+00

Pos logit contributions
elling+0.040
aughter+0.037
ener+0.037
mith+0.036
ada+0.035
Neg logit contributions
One-0.033
ggy-0.033
 read-0.031
 not-0.030
gh-0.030
Token mom
Token ID1995
Feature activation-2.43e+00

Pos logit contributions
elling+0.055
aughter+0.050
mith+0.049
ada+0.048
ener+0.047
Neg logit contributions
One-0.053
ggy-0.053
 long-0.050
 badly-0.050
 read-0.050
Token j
Token ID474
Feature activation+7.98e-01

Pos logit contributions
ggy+0.063
gh+0.063
aches+0.062
 badly+0.059
ggie+0.055
Neg logit contributions
cy-0.092
elling-0.091
aughter-0.090
 Me-0.089
 Little-0.089
Tokenolly
Token ID5098
Feature activation-4.20e-01

Pos logit contributions
One+0.017
ggy+0.017
gh+0.016
 badly+0.015
 nap+0.015
Neg logit contributions
elling-0.013
ener-0.012
aughter-0.012
mith-0.012
 Me-0.011
Token little
Token ID1310
Feature activation-1.15e+00

Pos logit contributions
elling+0.010
mith+0.010
aughter+0.010
ener+0.009
ishly+0.009
Neg logit contributions
ggy-0.006
One-0.006
 nap-0.005
 read-0.005
 che-0.005
Token town
Token ID3240
Feature activation+1.13e+00

Pos logit contributions
elling+0.023
aughter+0.022
ener+0.022
mith+0.021
ada+0.021
Neg logit contributions
ggy-0.020
One-0.020
gh-0.019
ggie-0.018
 badly-0.018
Token,
Token ID11
Feature activation+2.10e-01

Pos logit contributions
ggy+0.021
One+0.021
 badly+0.019
 read+0.019
gh+0.019
Neg logit contributions
elling-0.023
aughter-0.021
ener-0.021
mith-0.021
ishly-0.020
Token there
Token ID612
Feature activation+8.84e-01

Pos logit contributions
ggy+0.002
One+0.002
gh+0.002
ggie+0.002
 nap+0.002
Neg logit contributions
elling-0.006
aughter-0.006
mith-0.005
ener-0.005
 Me-0.005
Token lived
Token ID5615
Feature activation+4.47e+00

Pos logit contributions
One+0.017
ggy+0.017
gh+0.016
 badly+0.015
 seek+0.015
Neg logit contributions
elling-0.016
 Me-0.015
aughter-0.015
mith-0.014
 Acc-0.014
Token a
Token ID257
Feature activation+6.75e+00

Pos logit contributions
gh+0.076
ggy+0.073
aches+0.069
 quest+0.069
 hunt+0.068
Neg logit contributions
aughter-0.086
 Me-0.085
elling-0.082
 Little-0.079
ada-0.078
Token girl
Token ID2576
Feature activation+2.79e-01

Pos logit contributions
One+0.116
gh+0.114
ggy+0.114
ggie+0.104
 badly+0.104
Neg logit contributions
elling-0.128
aughter-0.126
 Me-0.122
ener-0.122
 Girl-0.122
Token named
Token ID3706
Feature activation-1.51e+00

Pos logit contributions
One+0.004
gh+0.004
ggy+0.004
 badly+0.004
ggie+0.004
Neg logit contributions
elling-0.006
aughter-0.005
ener-0.005
mith-0.005
 Me-0.005
Token Lucy
Token ID22162
Feature activation-3.59e-01

Pos logit contributions
elling+0.024
ener+0.023
mith+0.023
ishly+0.021
aughter+0.020
Neg logit contributions
One-0.036
ggy-0.033
 read-0.033
 nap-0.033
 long-0.032
Token when
Token ID618
Feature activation-3.12e+00

Pos logit contributions
elling+0.048
aughter+0.043
mith+0.041
ener+0.041
ada+0.039
Neg logit contributions
One-0.037
ggy-0.037
 not-0.036
 long-0.036
 read-0.036
Token the
Token ID262
Feature activation-2.28e+00

Pos logit contributions
elling+0.066
aughter+0.062
ener+0.059
mith+0.058
ada+0.057
Neg logit contributions
One-0.055
ggy-0.055
 nap-0.052
 read-0.051
 seek-0.051
Token two
Token ID734
Feature activation-4.38e+00

Pos logit contributions
elling+0.047
aughter+0.044
mith+0.043
ener+0.043
ada+0.042
Neg logit contributions
ggy-0.039
One-0.039
 nap-0.035
gh-0.035
 che-0.035
Token of
Token ID286
Feature activation-1.72e+00

Pos logit contributions
elling+0.106
aughter+0.102
mith+0.099
 Acc+0.098
ener+0.097
Neg logit contributions
ggy-0.062
One-0.060
 che-0.055
 nap-0.055
gh-0.054
Token them
Token ID606
Feature activation-1.32e+00

Pos logit contributions
elling+0.029
aughter+0.026
ener+0.026
mith+0.026
ada+0.024
Neg logit contributions
One-0.037
ggy-0.036
gh-0.034
 badly-0.034
 nap-0.033
Token went
Token ID1816
Feature activation+9.23e-01

Pos logit contributions
elling+0.033
aughter+0.031
mith+0.030
ener+0.030
ada+0.030
Neg logit contributions
One-0.017
ggy-0.017
 read-0.016
 che-0.016
 nap-0.016
Token to
Token ID284
Feature activation+7.59e-01

Pos logit contributions
One+0.012
ggy+0.012
 hidden+0.012
 seek+0.011
 nap+0.011
Neg logit contributions
elling-0.024
aughter-0.022
ener-0.022
 Acc-0.021
mith-0.021
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
One+0.012
ggy+0.012
gh+0.011
 badly+0.010
ggie+0.010
Neg logit contributions
elling-0.017
aughter-0.016
mith-0.016
ener-0.016
 Me-0.015
Token store
Token ID3650
Feature activation-3.06e-01

Pos logit contributions
elling+0.031
aughter+0.029
ener+0.029
mith+0.028
ada+0.028
Neg logit contributions
One-0.020
ggy-0.019
gh-0.018
 badly-0.017
ggie-0.017
Token,
Token ID11
Feature activation-2.04e+00

Pos logit contributions
elling+0.008
 Acc+0.007
ener+0.007
aughter+0.007
ted+0.007
Neg logit contributions
 not-0.004
 hidden-0.004
 badly-0.004
One-0.004
ggy-0.004
Token there
Token ID612
Feature activation-4.33e-01

Pos logit contributions
elling+0.048
aughter+0.040
ener+0.040
 Acc+0.039
mith+0.037
Neg logit contributions
 not-0.036
 long-0.036
ggy-0.036
 read-0.035
 seek-0.035
Token lion
Token ID18744
Feature activation-1.24e-01

Pos logit contributions
elling+0.042
 Me+0.042
ener+0.041
 Girl+0.038
aughter+0.037
Neg logit contributions
gh-0.056
 badly-0.054
aches-0.054
One-0.053
On-0.051
Token was
Token ID373
Feature activation+5.62e+00

Pos logit contributions
elling+0.003
ener+0.003
aughter+0.003
mith+0.003
ishly+0.002
Neg logit contributions
One-0.002
ggy-0.002
gh-0.002
 nap-0.002
On-0.002
Token too
Token ID1165
Feature activation+4.79e+00

Pos logit contributions
P+0.122
gh+0.121
One+0.117
 nap+0.114
irl+0.110
Neg logit contributions
ting-0.068
arted-0.066
ener-0.065
ishly-0.064
 Joe-0.059
Token fast
Token ID3049
Feature activation+9.95e-02

Pos logit contributions
gh+0.079
ggy+0.071
 nap+0.069
aches+0.069
ggie+0.068
Neg logit contributions
elling-0.093
ener-0.090
 Girl-0.088
 Little-0.087
aughter-0.087
Token.
Token ID13
Feature activation-2.36e+00

Pos logit contributions
One+0.002
ggy+0.002
gh+0.002
ggie+0.001
 seek+0.001
Neg logit contributions
elling-0.002
ener-0.002
aughter-0.002
mith-0.002
ada-0.002
Token
Token ID220
Feature activation+1.53e+00

Pos logit contributions
elling+0.044
mith+0.041
aughter+0.041
ener+0.041
ada+0.038
Neg logit contributions
One-0.048
 che-0.045
 long-0.045
On-0.045
 not-0.045
Token
Token ID198
Feature activation+6.23e+00

Pos logit contributions
One+0.022
ggy+0.021
On+0.019
 badly+0.019
 not+0.019
Neg logit contributions
elling-0.038
ener-0.037
aughter-0.035
mith-0.035
ishly-0.034
Token
Token ID198
Feature activation+4.84e+00

Pos logit contributions
One+0.104
ggy+0.100
gh+0.092
On+0.091
 badly+0.089
Neg logit contributions
elling-0.137
ener-0.129
aughter-0.127
mith-0.126
ishly-0.122
TokenL
Token ID43
Feature activation-4.83e+00

Pos logit contributions
ggy+0.082
One+0.080
gh+0.078
 nap+0.076
ggie+0.075
Neg logit contributions
elling-0.099
aughter-0.092
ener-0.090
mith-0.088
ada-0.086
Tokenily
Token ID813
Feature activation-7.88e-01

Pos logit contributions
elling+0.085
aughter+0.079
ener+0.075
mith+0.073
ada+0.072
Neg logit contributions
One-0.101
ggy-0.099
gh-0.095
 che-0.092
 prow-0.091
Token saw
Token ID2497
Feature activation-2.49e+00

Pos logit contributions
elling+0.018
aughter+0.017
mith+0.016
ener+0.016
ada+0.016
Neg logit contributions
ggy-0.013
One-0.013
gh-0.011
 read-0.011
 che-0.011
Token excited
Token ID6568
Feature activation-5.58e-01

Pos logit contributions
ggy+0.052
One+0.052
gh+0.049
ggie+0.047
 nap+0.046
Neg logit contributions
elling-0.057
aughter-0.053
ener-0.051
mith-0.051
 Acc-0.051
Token,
Token ID11
Feature activation+2.18e-01

Pos logit contributions
elling+0.013
mith+0.013
aughter+0.013
ener+0.013
 Acc+0.012
Neg logit contributions
ggy-0.008
One-0.008
gh-0.007
 badly-0.007
ggie-0.007
Token because
Token ID780
Feature activation-2.20e+00

Pos logit contributions
One+0.005
ggy+0.004
On+0.004
 read+0.004
 seek+0.004
Neg logit contributions
elling-0.004
aughter-0.004
ener-0.004
mith-0.003
 Acc-0.003
Token it
Token ID340
Feature activation-9.59e-01

Pos logit contributions
elling+0.048
aughter+0.046
ener+0.045
mith+0.044
ada+0.043
Neg logit contributions
One-0.036
ggy-0.035
gh-0.032
 badly-0.031
 nap-0.031
Token was
Token ID373
Feature activation+5.41e+00

Pos logit contributions
elling+0.019
aughter+0.018
ener+0.018
mith+0.018
 Me+0.017
Neg logit contributions
One-0.017
ggy-0.017
gh-0.016
 badly-0.015
ggie-0.015
Token Sunday
Token ID3502
Feature activation-1.93e+00

Pos logit contributions
One+0.074
ggy+0.073
gh+0.067
 badly+0.064
 nap+0.064
Neg logit contributions
elling-0.131
aughter-0.124
ener-0.123
mith-0.122
ada-0.118
Token.
Token ID13
Feature activation-7.47e-01

Pos logit contributions
elling+0.052
ener+0.050
mith+0.049
 Acc+0.049
aughter+0.048
Neg logit contributions
ggy-0.022
One-0.022
 long-0.021
 nap-0.021
 read-0.021
Token All
Token ID1439
Feature activation-1.28e+00

Pos logit contributions
elling+0.012
ener+0.011
mith+0.011
aughter+0.011
 Acc+0.010
Neg logit contributions
One-0.017
ggy-0.016
On-0.016
 nap-0.016
 che-0.015
Token week
Token ID1285
Feature activation-2.74e+00

Pos logit contributions
ener+0.029
ishly+0.029
aughter+0.028
mith+0.028
elling+0.028
Neg logit contributions
One-0.018
 nap-0.018
 much-0.017
 che-0.017
ggy-0.017
Token he
Token ID339
Feature activation+1.30e+00

Pos logit contributions
elling+0.053
ener+0.049
mith+0.049
aughter+0.048
 Acc+0.046
Neg logit contributions
ggy-0.052
One-0.052
 nap-0.049
gh-0.048
 read-0.047
Token had
Token ID550
Feature activation+4.09e+00

Pos logit contributions
One+0.020
ggy+0.019
gh+0.018
 nap+0.017
 read+0.017
Neg logit contributions
elling-0.030
aughter-0.028
ener-0.027
mith-0.027
ada-0.026
Token.
Token ID13
Feature activation-2.50e+00

Pos logit contributions
elling+0.059
aughter+0.056
ener+0.055
mith+0.055
ada+0.053
Neg logit contributions
One-0.035
ggy-0.034
 badly-0.031
gh-0.031
 nap-0.030
Token She
Token ID1375
Feature activation+1.41e+00

Pos logit contributions
elling+0.056
mith+0.054
ener+0.053
isher+0.053
aughter+0.050
Neg logit contributions
One-0.043
 not-0.040
 long-0.040
ggy-0.039
 nap-0.038
Token tries
Token ID8404
Feature activation+6.56e-01

Pos logit contributions
One+0.034
gh+0.033
ggy+0.033
 seek+0.032
ggie+0.032
Neg logit contributions
elling-0.020
ener-0.017
 Me-0.016
ted-0.016
aughter-0.016
Token to
Token ID284
Feature activation+1.64e+00

Pos logit contributions
One+0.011
ggy+0.011
gh+0.010
 badly+0.010
 nap+0.010
Neg logit contributions
elling-0.014
aughter-0.013
ener-0.013
mith-0.013
ada-0.012
Token catch
Token ID4929
Feature activation-3.05e+00

Pos logit contributions
One+0.025
gh+0.024
ggy+0.024
 badly+0.022
On+0.022
Neg logit contributions
elling-0.034
ener-0.034
 Me-0.033
mith-0.032
aughter-0.032
Token him
Token ID683
Feature activation-1.70e+00

Pos logit contributions
elling+0.069
 Acc+0.066
mith+0.066
aughter+0.065
ener+0.062
Neg logit contributions
 nap-0.049
One-0.049
ggy-0.047
 che-0.046
 well-0.044
Token.
Token ID13
Feature activation-2.38e+00

Pos logit contributions
elling+0.039
aughter+0.037
ener+0.036
mith+0.036
ada+0.035
Neg logit contributions
One-0.026
ggy-0.025
gh-0.023
 badly-0.023
 nap-0.022
Token But
Token ID887
Feature activation-3.34e+00

Pos logit contributions
elling+0.049
ener+0.045
mith+0.044
aughter+0.043
isher+0.042
Neg logit contributions
One-0.049
 not-0.047
 long-0.046
On-0.045
 nap-0.044
Token she
Token ID673
Feature activation+1.43e+00

Pos logit contributions
elling+0.084
aughter+0.077
ener+0.073
ada+0.072
mith+0.071
Neg logit contributions
 not-0.053
ggy-0.051
One-0.050
 nap-0.049
 che-0.048
Token slips
Token ID33841
Feature activation-4.12e-01

Pos logit contributions
One+0.031
ggy+0.030
gh+0.029
 seek+0.029
On+0.028
Neg logit contributions
elling-0.023
ener-0.021
aughter-0.021
 Me-0.020
mith-0.020
Token too
Token ID1165
Feature activation+2.53e+00

Pos logit contributions
elling+0.009
aughter+0.008
ener+0.008
mith+0.008
 Me+0.008
Neg logit contributions
One-0.007
ggy-0.007
gh-0.006
ggie-0.006
 badly-0.006
Token one
Token ID530
Feature activation-1.18e+00

Pos logit contributions
elling+0.049
aughter+0.048
ener+0.046
ishly+0.045
mith+0.045
Neg logit contributions
One-0.047
ggy-0.044
 read-0.042
 hidden-0.042
 long-0.042
Token day
Token ID1110
Feature activation-2.53e+00

Pos logit contributions
elling+0.027
aughter+0.026
ishly+0.026
ener+0.025
mith+0.025
Neg logit contributions
One-0.019
ggy-0.018
 read-0.017
 hidden-0.017
 nap-0.017
Token you
Token ID345
Feature activation+6.18e-01

Pos logit contributions
elling+0.055
aughter+0.053
mith+0.052
ener+0.051
ishly+0.049
Neg logit contributions
One-0.042
ggy-0.041
 long-0.040
 not-0.040
 nap-0.040
Token can
Token ID460
Feature activation+3.56e+00

Pos logit contributions
ggy+0.013
One+0.013
gh+0.013
ggie+0.012
 badly+0.012
Neg logit contributions
elling-0.010
ener-0.009
aughter-0.009
mith-0.009
ada-0.009
Token design
Token ID1486
Feature activation+9.99e-01

Pos logit contributions
ggy+0.063
gh+0.057
 badly+0.057
ggie+0.057
One+0.057
Neg logit contributions
elling-0.067
mith-0.063
ener-0.063
ada-0.063
aughter-0.061
Token your
Token ID534
Feature activation-2.11e+00

Pos logit contributions
One+0.013
ggy+0.013
gh+0.012
 read+0.011
 badly+0.011
Neg logit contributions
elling-0.025
aughter-0.024
ener-0.024
mith-0.023
ishly-0.023
Token own
Token ID898
Feature activation-2.64e+00

Pos logit contributions
elling+0.053
aughter+0.049
ada+0.048
mith+0.048
ener+0.048
Neg logit contributions
One-0.028
ggy-0.027
 nap-0.025
 hidden-0.025
 long-0.025
Token elevator
Token ID20932
Feature activation+2.17e+00

Pos logit contributions
elling+0.059
aughter+0.056
ener+0.054
ada+0.053
ishly+0.053
Neg logit contributions
One-0.041
ggy-0.041
 hidden-0.038
 read-0.038
gh-0.038
Token!"
Token ID2474
Feature activation-1.65e+00

Pos logit contributions
One+0.032
ggy+0.031
 read+0.030
 not+0.028
 hidden+0.028
Neg logit contributions
elling-0.050
aughter-0.049
ada-0.046
mith-0.046
ener-0.045
Token
Token ID220
Feature activation+1.23e+00

Pos logit contributions
elling+0.046
ener+0.046
aughter+0.044
mith+0.042
isher+0.041
Neg logit contributions
One-0.022
On-0.019
 much-0.019
 che-0.018
Once-0.018
Token
Token ID198
Feature activation+5.88e+00

Pos logit contributions
One+0.016
ggy+0.016
On+0.014
 not+0.014
 badly+0.014
Neg logit contributions
elling-0.032
ener-0.031
 Girl-0.029
mith-0.029
aughter-0.029
Token it
Token ID340
Feature activation+9.02e-02

Pos logit contributions
elling+0.091
aughter+0.087
ener+0.084
ada+0.082
 Acc+0.082
Neg logit contributions
One-0.061
ggy-0.059
 badly-0.058
 hidden-0.056
 che-0.055
Token.
Token ID13
Feature activation-1.59e+00

Pos logit contributions
One+0.001
ggy+0.001
gh+0.001
ggie+0.001
 badly+0.001
Neg logit contributions
elling-0.002
ener-0.002
aughter-0.002
mith-0.002
 Me-0.002
Token
Token ID198
Feature activation+6.46e+00

Pos logit contributions
ener+0.039
elling+0.038
mith+0.035
aughter+0.034
ishly+0.033
Neg logit contributions
One-0.026
On-0.023
 much-0.022
 nap-0.022
 read-0.021
Token
Token ID198
Feature activation+5.19e+00

Pos logit contributions
ggy+0.092
One+0.090
gh+0.087
 badly+0.086
 che+0.085
Neg logit contributions
elling-0.142
aughter-0.138
mith-0.134
 Me-0.131
 Acc-0.130
TokenOne
Token ID3198
Feature activation-8.20e-01

Pos logit contributions
ggy+0.082
 badly+0.077
 nap+0.076
 che+0.075
 prow+0.075
Neg logit contributions
elling-0.102
aughter-0.099
mith-0.096
 Acc-0.094
 Me-0.093
Token day
Token ID1110
Feature activation-2.29e+00

Pos logit contributions
elling+0.016
aughter+0.015
ener+0.015
mith+0.015
ada+0.014
Neg logit contributions
One-0.015
ggy-0.015
gh-0.014
 badly-0.014
 nap-0.014
Token Jack
Token ID3619
Feature activation-1.95e+00

Pos logit contributions
elling+0.054
aughter+0.048
ener+0.047
 Acc+0.047
mith+0.047
Neg logit contributions
ggy-0.037
One-0.036
 nap-0.035
gh-0.035
 che-0.035
Token got
Token ID1392
Feature activation+3.96e+00

Pos logit contributions
elling+0.043
aughter+0.041
ener+0.040
mith+0.040
ada+0.038
Neg logit contributions
One-0.031
ggy-0.030
gh-0.028
 badly-0.027
 nap-0.027
Token the
Token ID262
Feature activation-7.66e-01

Pos logit contributions
One+0.055
ggy+0.055
gh+0.050
 badly+0.049
 nap+0.048
Neg logit contributions
elling-0.095
aughter-0.090
ener-0.089
mith-0.089
ada-0.085
Token idea
Token ID2126
Feature activation-1.03e+00

Pos logit contributions
elling+0.018
aughter+0.017
ener+0.017
mith+0.016
ada+0.016
Neg logit contributions
One-0.011
ggy-0.011
gh-0.010
ggie-0.010
On-0.010
Token to
Token ID284
Feature activation+1.04e+00

Pos logit contributions
elling+0.023
aughter+0.022
ener+0.021
mith+0.021
ada+0.020
Neg logit contributions
ggy-0.016
One-0.016
gh-0.015
 read-0.015
 nap-0.015
Token
Token ID198
Feature activation+6.72e+00

Pos logit contributions
elling+0.036
ener+0.035
aughter+0.033
ishly+0.031
mith+0.031
Neg logit contributions
One-0.036
ggy-0.034
 nap-0.032
 long-0.032
 che-0.032
Token
Token ID198
Feature activation+5.07e+00

Pos logit contributions
ggy+0.121
One+0.120
gh+0.115
 badly+0.111
ggie+0.110
Neg logit contributions
elling-0.129
aughter-0.121
mith-0.119
ener-0.117
 Me-0.114
Token"
Token ID1
Feature activation-1.35e+00

Pos logit contributions
ggy+0.073
gh+0.069
One+0.069
 nap+0.066
 badly+0.066
Neg logit contributions
elling-0.114
aughter-0.109
mith-0.107
ener-0.105
 Me-0.104
TokenYes
Token ID5297
Feature activation-5.14e+00

Pos logit contributions
elling+0.024
ener+0.022
aughter+0.022
mith+0.022
 Me+0.021
Neg logit contributions
ggy-0.027
One-0.027
gh-0.026
 badly-0.025
ggie-0.025
Token,
Token ID11
Feature activation-4.63e+00

Pos logit contributions
elling+0.117
aughter+0.107
 Acc+0.105
mith+0.105
ener+0.102
Neg logit contributions
ggy-0.087
One-0.082
 badly-0.080
gh-0.079
 nap-0.077
Token we
Token ID356
Feature activation-8.73e-01

Pos logit contributions
elling+0.089
aughter+0.083
ener+0.081
mith+0.080
 Acc+0.077
Neg logit contributions
ggy-0.089
One-0.088
gh-0.081
 nap-0.081
 badly-0.081
Token can
Token ID460
Feature activation+2.74e+00

Pos logit contributions
ada+0.012
elling+0.012
ener+0.012
aughter+0.012
als+0.012
Neg logit contributions
One-0.017
ggy-0.017
 badly-0.016
On-0.016
gh-0.016
Token play
Token ID711
Feature activation+1.24e+00

Pos logit contributions
P+0.049
ggy+0.047
One+0.047
On+0.047
 villains+0.047
Neg logit contributions
ishly-0.046
elling-0.045
ener-0.044
 Me-0.044
als-0.042
Token with
Token ID351
Feature activation-3.31e+00

Pos logit contributions
One+0.020
On+0.019
gh+0.019
 prow+0.018
ggy+0.018
Neg logit contributions
elling-0.023
 Me-0.022
ishly-0.022
ener-0.022
mith-0.022
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
elling+0.070
aughter+0.069
ener+0.068
mith+0.066
ada+0.066
Neg logit contributions
ggy-0.058
One-0.056
 badly-0.053
 nap-0.053
 long-0.051
Token dog
Token ID3290
Feature activation-8.09e-01

Pos logit contributions
elling+0.026
aughter+0.024
ener+0.023
mith+0.023
ada+0.022
Neg logit contributions
One-0.026
ggy-0.025
 nap-0.023
gh-0.023
 badly-0.023
Token Do
Token ID2141
Feature activation-1.69e+00

Pos logit contributions
elling+0.025
ener+0.024
aughter+0.024
ishly+0.023
mith+0.022
Neg logit contributions
One-0.034
ggy-0.030
 read-0.030
On-0.029
 nap-0.028
Token you
Token ID345
Feature activation+3.29e+00

Pos logit contributions
elling+0.028
aughter+0.026
 Me+0.025
ener+0.025
mith+0.025
Neg logit contributions
One-0.034
ggy-0.033
gh-0.032
On-0.032
 badly-0.032
Token remember
Token ID3505
Feature activation-9.63e-01

Pos logit contributions
ggy+0.068
gh+0.062
One+0.062
 badly+0.061
ggie+0.061
Neg logit contributions
elling-0.054
ener-0.054
ada-0.050
aughter-0.049
 Acc-0.046
Token where
Token ID810
Feature activation-3.46e+00

Pos logit contributions
elling+0.020
aughter+0.018
ener+0.018
mith+0.018
 Me+0.017
Neg logit contributions
One-0.016
ggy-0.016
gh-0.015
 badly-0.015
ggie-0.015
Token the
Token ID262
Feature activation-2.93e+00

Pos logit contributions
elling+0.071
aughter+0.068
ener+0.066
mith+0.065
ada+0.064
Neg logit contributions
One-0.061
ggy-0.060
 nap-0.056
gh-0.056
 read-0.056
Token other
Token ID584
Feature activation-6.22e+00

Pos logit contributions
elling+0.066
aughter+0.062
ener+0.061
mith+0.061
ada+0.059
Neg logit contributions
One-0.045
ggy-0.044
gh-0.041
 badly-0.039
 nap-0.039
Token star
Token ID3491
Feature activation-3.94e-01

Pos logit contributions
elling+0.139
aughter+0.131
ener+0.129
mith+0.128
ada+0.126
Neg logit contributions
ggy-0.097
One-0.097
 nap-0.088
 read-0.088
gh-0.087
Token is
Token ID318
Feature activation+3.68e+00

Pos logit contributions
elling+0.008
aughter+0.008
mith+0.008
ener+0.008
ada+0.008
Neg logit contributions
ggy-0.006
One-0.006
gh-0.006
 read-0.006
 nap-0.006
Token?"
Token ID1701
Feature activation-1.31e+00

Pos logit contributions
One+0.058
ggy+0.058
gh+0.053
 badly+0.052
ggie+0.052
Neg logit contributions
elling-0.080
ener-0.074
aughter-0.074
mith-0.073
 Acc-0.069
Token Ben
Token ID3932
Feature activation+4.06e-02

Pos logit contributions
elling+0.038
ener+0.034
isher+0.032
aughter+0.032
ishly+0.032
Neg logit contributions
One-0.018
 read-0.016
ggy-0.016
 not-0.015
 che-0.015
Token looked
Token ID3114
Feature activation+2.19e+00

Pos logit contributions
One+0.001
 read+0.001
ggy+0.001
 che+0.001
 nap+0.001
Neg logit contributions
elling-0.001
mith-0.001
aughter-0.001
ada-0.001
ener-0.001
Tokenmy
Token ID1820
Feature activation-2.89e-01

Pos logit contributions
elling+0.020
mith+0.020
aughter+0.020
ener+0.019
ishly+0.019
Neg logit contributions
One-0.020
 read-0.020
ggy-0.019
 hidden-0.018
 che-0.018
Token was
Token ID373
Feature activation+6.66e+00

Pos logit contributions
elling+0.007
mith+0.007
aughter+0.007
ener+0.007
 Me+0.006
Neg logit contributions
ggy-0.004
 read-0.004
One-0.004
 che-0.004
 hidden-0.003
Token tall
Token ID7331
Feature activation+9.01e-01

Pos logit contributions
One+0.102
gh+0.101
ggy+0.099
 villains+0.094
 hunt+0.091
Neg logit contributions
elling-0.138
aughter-0.137
 Me-0.135
 Daughter-0.131
 Acc-0.131
Token and
Token ID290
Feature activation+5.70e-01

Pos logit contributions
One+0.018
ggy+0.018
gh+0.017
 badly+0.017
 nap+0.017
Neg logit contributions
elling-0.016
aughter-0.015
ener-0.014
mith-0.014
ada-0.013
Token the
Token ID262
Feature activation-2.49e+00

Pos logit contributions
One+0.006
ggy+0.006
 read+0.006
 hidden+0.005
 seek+0.005
Neg logit contributions
elling-0.016
ener-0.015
aughter-0.015
mith-0.015
ada-0.015
Token little
Token ID1310
Feature activation-6.42e+00

Pos logit contributions
elling+0.047
aughter+0.044
ener+0.044
mith+0.043
 Me+0.042
Neg logit contributions
ggy-0.047
One-0.047
gh-0.045
ggie-0.043
 badly-0.042
Token girl
Token ID2576
Feature activation+3.23e-01

Pos logit contributions
elling+0.109
 Acc+0.095
aughter+0.095
mith+0.092
ishly+0.088
Neg logit contributions
 nap-0.137
One-0.136
 hidden-0.135
 long-0.135
 badly-0.129
Token was
Token ID373
Feature activation+6.93e+00

Pos logit contributions
ggy+0.004
 read+0.004
One+0.004
 che+0.004
ggie+0.003
Neg logit contributions
elling-0.009
mith-0.008
aughter-0.008
ener-0.008
ada-0.008
Token small
Token ID1402
Feature activation-2.18e-01

Pos logit contributions
One+0.103
gh+0.100
ggy+0.098
 nap+0.093
 villains+0.093
Neg logit contributions
elling-0.147
aughter-0.143
 Me-0.142
 Acc-0.140
ada-0.138
Token.
Token ID13
Feature activation-5.85e-01

Pos logit contributions
elling+0.004
ener+0.004
aughter+0.004
mith+0.004
 Me+0.004
Neg logit contributions
ggy-0.004
One-0.004
gh-0.004
ggie-0.004
On-0.004
Token
Token ID220
Feature activation+3.04e+00

Pos logit contributions
ener+0.017
elling+0.016
isher+0.015
mith+0.015
aughter+0.015
Neg logit contributions
One-0.009
On-0.007
Once-0.006
 read-0.006
 nap-0.006
Token to
Token ID284
Feature activation+2.53e+00

Pos logit contributions
One+0.035
ggy+0.034
gh+0.032
ggie+0.031
 seek+0.031
Neg logit contributions
elling-0.036
ener-0.034
 Acc-0.033
ishly-0.033
aughter-0.033
Token stroke
Token ID14000
Feature activation-2.82e+00

Pos logit contributions
One+0.041
gh+0.039
ggy+0.039
ggie+0.038
P+0.037
Neg logit contributions
ishly-0.052
 Acc-0.051
ener-0.049
elling-0.048
 Suz-0.048
Token it
Token ID340
Feature activation-3.93e-01

Pos logit contributions
aughter+0.076
elling+0.075
 Girl+0.070
ener+0.069
ada+0.068
Neg logit contributions
 badly-0.036
ggy-0.035
 long-0.033
 not-0.033
 much-0.032
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
elling+0.009
aughter+0.009
mith+0.008
ener+0.008
ada+0.008
Neg logit contributions
One-0.006
ggy-0.006
gh-0.005
 badly-0.005
 nap-0.005
Token The
Token ID383
Feature activation-2.47e+00

Pos logit contributions
elling+0.023
ener+0.023
aughter+0.021
mith+0.020
als+0.019
Neg logit contributions
One-0.023
On-0.022
ggy-0.022
 che-0.022
 badly-0.021
Token little
Token ID1310
Feature activation-6.91e+00

Pos logit contributions
elling+0.048
aughter+0.045
ener+0.044
mith+0.044
ada+0.042
Neg logit contributions
One-0.046
ggy-0.045
gh-0.042
 badly-0.041
 nap-0.041
Token bird
Token ID6512
Feature activation+1.17e+00

Pos logit contributions
elling+0.160
aughter+0.158
mith+0.147
ada+0.147
 Acc+0.146
Neg logit contributions
One-0.104
ggy-0.099
 che-0.097
 hidden-0.097
 nap-0.096
Token calmed
Token ID49566
Feature activation-4.16e+00

Pos logit contributions
One+0.016
ggy+0.016
gh+0.015
 badly+0.014
 nap+0.014
Neg logit contributions
elling-0.028
aughter-0.027
ener-0.026
mith-0.026
ada-0.025
Token down
Token ID866
Feature activation-2.12e+00

Pos logit contributions
elling+0.084
aughter+0.080
 Acc+0.079
ion+0.073
mith+0.073
Neg logit contributions
 badly-0.079
 che-0.075
 lost-0.073
 not-0.073
 long-0.072
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
elling+0.056
aughter+0.055
 Acc+0.051
ener+0.051
ada+0.050
Neg logit contributions
One-0.029
 long-0.028
 badly-0.028
 far-0.027
 not-0.027
Token
Token ID220
Feature activation+1.73e+00

Pos logit contributions
elling+0.032
ener+0.030
aughter+0.028
mith+0.027
als+0.026
Neg logit contributions
One-0.038
On-0.036
ggy-0.035
 che-0.035
 badly-0.035
Token went
Token ID1816
Feature activation+9.69e-01

Pos logit contributions
ggy+0.021
gh+0.021
One+0.021
 seek+0.020
ggie+0.019
Neg logit contributions
ishly-0.021
ener-0.021
elling-0.021
ada-0.020
ted-0.020
Token to
Token ID284
Feature activation-1.56e+00

Pos logit contributions
One+0.017
ggy+0.017
gh+0.016
 badly+0.016
 nap+0.016
Neg logit contributions
elling-0.020
aughter-0.018
ener-0.018
mith-0.018
 Me-0.017
Token their
Token ID511
Feature activation-2.30e+00

Pos logit contributions
elling+0.032
ener+0.029
aughter+0.028
mith+0.027
als+0.027
Neg logit contributions
One-0.030
ggy-0.029
 nap-0.029
 read-0.028
 not-0.028
Token mom
Token ID1995
Feature activation-3.06e+00

Pos logit contributions
elling+0.045
aughter+0.042
ener+0.042
mith+0.041
ada+0.040
Neg logit contributions
One-0.042
ggy-0.041
gh-0.039
 badly-0.037
ggie-0.037
Token and
Token ID290
Feature activation-1.73e+00

Pos logit contributions
elling+0.065
aughter+0.060
mith+0.059
ener+0.058
 Acc+0.057
Neg logit contributions
One-0.053
 read-0.051
ggy-0.050
 nap-0.049
 hidden-0.048
Token told
Token ID1297
Feature activation-5.97e+00

Pos logit contributions
elling+0.039
aughter+0.035
ener+0.035
mith+0.035
 Me+0.034
Neg logit contributions
One-0.027
ggy-0.027
 read-0.026
 nap-0.025
 che-0.025
Token her
Token ID607
Feature activation-1.21e+00

Pos logit contributions
elling+0.123
ener+0.111
aughter+0.109
mith+0.109
 Girl+0.106
Neg logit contributions
ggy-0.111
One-0.110
gh-0.102
 not-0.102
 read-0.100
Token what
Token ID644
Feature activation-3.15e+00

Pos logit contributions
elling+0.023
aughter+0.022
ener+0.021
mith+0.021
 Acc+0.020
Neg logit contributions
One-0.022
ggy-0.022
gh-0.021
 seek-0.020
 che-0.020
Token happened
Token ID3022
Feature activation-8.91e-01

Pos logit contributions
elling+0.064
aughter+0.060
mith+0.059
ener+0.058
 Acc+0.058
Neg logit contributions
One-0.053
ggy-0.051
gh-0.048
 badly-0.047
 nap-0.047
Token.
Token ID13
Feature activation-2.48e+00

Pos logit contributions
elling+0.022
aughter+0.021
mith+0.021
ener+0.021
ada+0.020
Neg logit contributions
One-0.012
ggy-0.012
gh-0.011
 badly-0.011
 read-0.010
Token
Token ID198
Feature activation+6.07e+00

Pos logit contributions
elling+0.055
ener+0.053
isher+0.051
mith+0.047
ting+0.046
Neg logit contributions
One-0.046
 not-0.046
On-0.043
ggy-0.041
 long-0.041
Token it
Token ID340
Feature activation-6.09e-01

Pos logit contributions
elling+0.097
aughter+0.097
ener+0.094
 Girl+0.092
 Acc+0.091
Neg logit contributions
ggy-0.062
One-0.060
 badly-0.058
 not-0.056
 hidden-0.056
Token?"
Token ID1701
Feature activation-2.79e+00

Pos logit contributions
elling+0.014
aughter+0.014
ener+0.014
mith+0.013
ada+0.013
Neg logit contributions
One-0.009
ggy-0.009
gh-0.008
 badly-0.008
 read-0.008
Token
Token ID198
Feature activation+5.25e+00

Pos logit contributions
elling+0.086
ener+0.083
aughter+0.081
isher+0.080
mith+0.076
Neg logit contributions
One-0.036
On-0.028
 not-0.026
 long-0.026
 read-0.025
Token
Token ID198
Feature activation+4.48e+00

Pos logit contributions
One+0.089
ggy+0.087
gh+0.079
 badly+0.077
On+0.077
Neg logit contributions
elling-0.114
ener-0.107
aughter-0.106
mith-0.104
ishly-0.101
Token"
Token ID1
Feature activation-1.03e+00

Pos logit contributions
One+0.070
ggy+0.069
gh+0.065
 badly+0.062
 nap+0.062
Neg logit contributions
elling-0.099
aughter-0.093
ener-0.092
mith-0.092
ada-0.088
TokenSure
Token ID19457
Feature activation-6.18e+00

Pos logit contributions
elling+0.017
aughter+0.015
ener+0.015
mith+0.015
ada+0.014
Neg logit contributions
One-0.024
ggy-0.022
On-0.021
gh-0.021
 seek-0.021
Token,
Token ID11
Feature activation-4.54e+00

Pos logit contributions
elling+0.132
aughter+0.122
mith+0.119
ener+0.117
ada+0.116
Neg logit contributions
ggy-0.111
One-0.108
gh-0.102
 badly-0.100
ggie-0.098
Token you
Token ID345
Feature activation+3.84e-01

Pos logit contributions
elling+0.104
aughter+0.100
mith+0.096
ener+0.095
ada+0.093
Neg logit contributions
ggy-0.070
One-0.068
 nap-0.065
 read-0.064
 not-0.063
Token can
Token ID460
Feature activation+2.61e+00

Pos logit contributions
ggy+0.009
One+0.009
gh+0.008
ggie+0.008
 badly+0.008
Neg logit contributions
elling-0.006
aughter-0.005
ener-0.005
mith-0.005
ada-0.005
Token play
Token ID711
Feature activation+9.76e-01

Pos logit contributions
ggy+0.041
One+0.040
gh+0.038
On+0.038
 prow+0.037
Neg logit contributions
elling-0.054
ener-0.051
aughter-0.050
 Me-0.050
ishly-0.050
Token with
Token ID351
Feature activation-4.00e+00

Pos logit contributions
One+0.016
ggy+0.015
gh+0.015
On+0.014
 prow+0.014
Neg logit contributions
elling-0.019
mith-0.018
ener-0.018
 Me-0.018
aughter-0.018