TOP ACTIVATIONS
MAX = 10.081

you count how many ost ric hes there are ?"
wanted to touch the ost ric hes . They reached out
and looked at the ost ric hes . They saw one
ie . Unlike other ost ric hes , his feathers gl
There were also many cob we bs . Cob we bs
laugh and shout , " Whe e !" They
't like spiders or cob we bs . But I like
. One of the ost ric hes looked different from the
and run to the ost ric hes . "
. It sees other ost ric hes . It makes new
it was filled with cob we bs . She picked up
laugh and shout , " Whe e ! That was fun
They do not pack ost ric hes anymore . They pack
look at all the cob we bs ," Lily said .
did not know what ost ric hes were . They looked
. They were called ost ric hes . Lily and Ben
how much brighter and sun n ier the other side was
the shed and the cob we bs and the spider .
It was covered with cob we bs and the paint was
smiles and shouts , " Whe e ! That was fun

MIN ACTIVATIONS
MIN = -8.562

says : " Dear Lily and Ben , Happy
The thief replied " Dear little girl , I need
it . " Dear Lily and Ben , we
prayed . " Dear God , please help us
slowly . " Dear ... An - na ...
. She said , " Dear Tom , I made this
said : " Dear Sara , Thank
Tom . " Dear Sara and Tom , I
Santa . " Dear Santa , thank you for
mom . " Dear Sara , I miss you
together . " Dear God , bless us and
. It says : " Dear Tim , we love you
her and said , " Dear daughter , I knew that
prayed and said , " Dear God , please forgive us
said to Frank : " Dear all igator , if you
. It said , " Dear Anna , I have a
. It says : " Dear Sam , I like you
. She said : " Dear Bob , we love you
The letter said : " Dear Sara and Ben , I
. He writes : " Dear toys , I love you

INTERVAL 5.420 - 10.081
CONTAINS 0.657%

the bird some seeds . Suddenly , the
don 't walk near the all igator . He is not
other wonderful things . â € Bear listened
Lily print her drawings . L ily was
fountain and feel safe . <|endoftext|> There was a parade coming

INTERVAL 0.760 - 5.420
CONTAINS 38.902%

you are not nice . You are not sharing . You
shield and scarf . They were torn and dirty and had
was a success ! <|endoftext|> One day , a little bird
L ily and Ben were not okay . They had
, " You are mean . You are not a good

INTERVAL -3.901 - 0.760
CONTAINS 57.192%

to play with . One day , she saw
asked Luna , " What are you playing with ?" Luna
drink . However , the suit was so special that it
throw them away . " Look , Ben and
grandma and gave her one last hug . As she walked

INTERVAL -8.562 - -3.901
CONTAINS 3.249%

a bench . They ran to see what was inside .
toy store . Jack said , " L ily , let
, Tim my ?" Sally asked . Tim
to the park . She asked her mom if she could
it was a good chance to get some help . So
Token you
Token ID345
Feature activation+2.13e+00

Pos logit contributions
keeper+0.041
 lady+0.040
 keeper+0.040
lf+0.040
 vet+0.039
Neg logit contributions
|-0.045
're-0.040
text-0.040
 Ever-0.039
Any-0.039
Token count
Token ID954
Feature activation-1.30e+00

Pos logit contributions
're+0.046
|+0.043
 imag+0.035
 You+0.035
Any+0.035
Neg logit contributions
keeper-0.055
lf-0.053
 detector-0.051
 instincts-0.050
 keeper-0.050
Token how
Token ID703
Feature activation+3.24e+00

Pos logit contributions
keeper+0.034
lf+0.033
 keeper+0.032
 buyer+0.032
 detector+0.032
Neg logit contributions
|-0.023
're-0.023
 You-0.022
 Ever-0.020
Any-0.020
Token many
Token ID867
Feature activation-2.87e-01

Pos logit contributions
|+0.075
're+0.070
 Ever+0.066
text+0.066
Any+0.066
Neg logit contributions
lf-0.069
keeper-0.068
 detector-0.067
 keeper-0.066
 lady-0.065
Token ost
Token ID23619
Feature activation+5.30e+00

Pos logit contributions
keeper+0.007
lf+0.006
 keeper+0.006
 detector+0.006
 vet+0.006
Neg logit contributions
|-0.006
're-0.006
text-0.006
 imag-0.006
 You-0.005
Tokenric
Token ID1173
Feature activation+1.01e+01

Pos logit contributions
|+0.078
text+0.068
Whe+0.063
Any+0.063
 Ever+0.062
Neg logit contributions
 keeper-0.153
keeper-0.152
 detector-0.151
man-0.149
lf-0.148
Tokenhes
Token ID956
Feature activation+3.21e+00

Pos logit contributions
|+0.219
text+0.191
're+0.190
 eyel+0.189
 imag+0.186
Neg logit contributions
lf-0.233
 lady-0.231
keeper-0.230
 keeper-0.227
 worker-0.224
Token there
Token ID612
Feature activation+1.06e+00

Pos logit contributions
|+0.065
're+0.057
text+0.055
 Ever+0.054
Any+0.054
Neg logit contributions
keeper-0.079
lf-0.078
 lady-0.077
 keeper-0.077
 detector-0.076
Token are
Token ID389
Feature activation+1.60e+00

Pos logit contributions
|+0.028
're+0.027
 You+0.025
 Ever+0.025
text+0.025
Neg logit contributions
keeper-0.020
lf-0.019
 detector-0.019
 keeper-0.019
 lady-0.018
Token?"
Token ID1701
Feature activation+3.69e-01

Pos logit contributions
|+0.032
're+0.032
 imag+0.030
 You+0.030
 Ever+0.029
Neg logit contributions
keeper-0.038
 detector-0.036
lf-0.036
 keeper-0.034
 buyer-0.034
Token
Token ID198
Feature activation-2.12e+00

Pos logit contributions
|+0.008
're+0.007
text+0.007
Any+0.007
 Ever+0.006
Neg logit contributions
keeper-0.009
 lady-0.009
lf-0.009
 keeper-0.009
 vet-0.009
Token wanted
Token ID2227
Feature activation-1.41e+00

Pos logit contributions
 lady+0.022
keeper+0.022
 keeper+0.022
 detector+0.022
lf+0.022
Neg logit contributions
|-0.025
text-0.022
 Ever-0.021
Any-0.021
 You-0.021
Token to
Token ID284
Feature activation-4.08e+00

Pos logit contributions
lf+0.025
keeper+0.024
 detector+0.023
 keeper+0.022
 lady+0.022
Neg logit contributions
|-0.038
're-0.035
 imag-0.034
 eyel-0.034
text-0.034
Token touch
Token ID3638
Feature activation+1.80e+00

Pos logit contributions
keeper+0.111
lf+0.110
 detector+0.103
 worker+0.099
 buyer+0.099
Neg logit contributions
|-0.081
're-0.076
 imag-0.064
 eyel-0.063
 You-0.061
Token the
Token ID262
Feature activation+2.33e+00

Pos logit contributions
|+0.041
're+0.035
text+0.035
 Ever+0.035
Any+0.035
Neg logit contributions
 lady-0.041
keeper-0.041
 keeper-0.040
 detector-0.040
 man-0.039
Token ost
Token ID23619
Feature activation+4.64e+00

Pos logit contributions
|+0.051
're+0.049
 You+0.045
 imag+0.044
Any+0.044
Neg logit contributions
lf-0.053
keeper-0.052
 detector-0.049
 keeper-0.049
 buyer-0.048
Tokenric
Token ID1173
Feature activation+1.00e+01

Pos logit contributions
|+0.050
're+0.038
text+0.037
 Ever+0.034
Any+0.034
Neg logit contributions
keeper-0.159
lf-0.157
 keeper-0.156
 lady-0.156
 detector-0.155
Tokenhes
Token ID956
Feature activation+2.27e+00

Pos logit contributions
|+0.220
're+0.197
 imag+0.193
text+0.191
 eyel+0.188
Neg logit contributions
lf-0.231
 lady-0.229
keeper-0.228
 keeper-0.224
 detector-0.222
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
|+0.055
're+0.050
text+0.049
 Ever+0.048
Any+0.048
Neg logit contributions
keeper-0.046
lf-0.046
 lady-0.045
 keeper-0.045
 detector-0.044
Token They
Token ID1119
Feature activation-8.01e-01

Pos logit contributions
|+0.019
 You+0.017
 Ever+0.016
're+0.015
text+0.015
Neg logit contributions
lf-0.054
 detector-0.051
keeper-0.050
 recipe-0.049
ache-0.049
Token reached
Token ID4251
Feature activation+2.90e-01

Pos logit contributions
keeper+0.018
 lady+0.018
lf+0.018
 keeper+0.018
 detector+0.017
Neg logit contributions
|-0.018
're-0.016
text-0.016
 Ever-0.015
Any-0.015
Token out
Token ID503
Feature activation+1.51e-02

Pos logit contributions
|+0.008
pill+0.007
Any+0.007
text+0.007
're+0.007
Neg logit contributions
 lady-0.006
 vet-0.006
 keeper-0.006
 door-0.006
 worker-0.006
Token and
Token ID290
Feature activation+3.24e+00

Pos logit contributions
|+0.013
text+0.012
're+0.012
 Ever+0.011
Any+0.011
Neg logit contributions
 lady-0.011
 keeper-0.011
keeper-0.011
 vet-0.011
 detector-0.011
Token looked
Token ID3114
Feature activation+1.04e+00

Pos logit contributions
|+0.076
text+0.067
avy+0.066
Any+0.066
 Ever+0.065
Neg logit contributions
keeper-0.072
 lady-0.072
 keeper-0.070
 man-0.070
 detector-0.069
Token at
Token ID379
Feature activation-1.45e+00

Pos logit contributions
|+0.023
text+0.020
're+0.020
Any+0.019
 Ever+0.018
Neg logit contributions
keeper-0.025
 lady-0.024
 keeper-0.024
 man-0.024
 detector-0.024
Token the
Token ID262
Feature activation+1.51e+00

Pos logit contributions
 lady+0.031
keeper+0.030
 keeper+0.030
 vet+0.030
 man+0.030
Neg logit contributions
|-0.035
're-0.033
Any-0.031
pill-0.031
text-0.031
Token ost
Token ID23619
Feature activation+3.93e+00

Pos logit contributions
|+0.033
're+0.032
avy+0.029
 You+0.029
 eyel+0.029
Neg logit contributions
lf-0.034
keeper-0.032
 detector-0.031
 buyer-0.030
 vet-0.029
Tokenric
Token ID1173
Feature activation+9.92e+00

Pos logit contributions
|+0.043
're+0.033
text+0.032
 Ever+0.030
Any+0.030
Neg logit contributions
keeper-0.133
lf-0.132
 keeper-0.132
 lady-0.131
 detector-0.130
Tokenhes
Token ID956
Feature activation+2.06e+00

Pos logit contributions
|+0.229
're+0.205
 imag+0.202
text+0.199
 eyel+0.196
Neg logit contributions
lf-0.219
 lady-0.217
keeper-0.216
 keeper-0.213
 man-0.211
Token.
Token ID13
Feature activation+8.67e-01

Pos logit contributions
|+0.053
're+0.047
text+0.047
Any+0.046
 Ever+0.046
Neg logit contributions
keeper-0.040
 lady-0.039
lf-0.039
 keeper-0.039
 detector-0.038
Token They
Token ID1119
Feature activation-6.53e-01

Pos logit contributions
|+0.014
're+0.012
text+0.012
Any+0.011
 imag+0.011
Neg logit contributions
keeper-0.026
 lady-0.025
 keeper-0.025
 vet-0.025
 worker-0.024
Token saw
Token ID2497
Feature activation-3.00e+00

Pos logit contributions
keeper+0.014
lf+0.014
 keeper+0.013
 detector+0.013
 lady+0.013
Neg logit contributions
|-0.015
're-0.014
text-0.013
 Ever-0.013
 imag-0.013
Token one
Token ID530
Feature activation-1.38e+00

Pos logit contributions
lf+0.080
keeper+0.079
 detector+0.077
 keeper+0.077
 lady+0.077
Neg logit contributions
|-0.054
're-0.048
text-0.046
 Ever-0.045
Any-0.045
Tokenie
Token ID494
Feature activation+8.74e-01

Pos logit contributions
|+0.073
're+0.065
text+0.064
 Ever+0.063
Any+0.062
Neg logit contributions
keeper-0.061
lf-0.061
 lady-0.060
 keeper-0.060
 detector-0.059
Token.
Token ID13
Feature activation+1.34e+00

Pos logit contributions
|+0.021
're+0.019
text+0.018
 Ever+0.018
Any+0.018
Neg logit contributions
keeper-0.018
lf-0.018
 detector-0.018
 lady-0.018
 keeper-0.018
Token Unlike
Token ID12101
Feature activation-4.49e-02

Pos logit contributions
 imag+0.025
|+0.024
text+0.023
ez+0.022
hed+0.022
Neg logit contributions
 lady-0.031
 vet-0.031
keeper-0.031
 keeper-0.031
 man-0.030
Token other
Token ID584
Feature activation-2.66e-01

Pos logit contributions
lf+0.001
 man+0.001
 lady+0.001
 captain+0.001
 detector+0.001
Neg logit contributions
|-0.001
text-0.001
pill-0.001
 imag-0.001
 Ever-0.001
Token ost
Token ID23619
Feature activation+4.81e+00

Pos logit contributions
lf+0.006
keeper+0.006
 detector+0.005
 keeper+0.005
 lady+0.005
Neg logit contributions
|-0.006
're-0.006
text-0.006
 imag-0.005
 Ever-0.005
Tokenric
Token ID1173
Feature activation+9.66e+00

Pos logit contributions
|+0.059
text+0.047
're+0.045
 Ever+0.043
Any+0.043
Neg logit contributions
lf-0.156
keeper-0.155
 keeper-0.155
 detector-0.153
 lady-0.152
Tokenhes
Token ID956
Feature activation+3.28e+00

Pos logit contributions
|+0.231
 imag+0.208
text+0.208
 eyel+0.205
're+0.204
Neg logit contributions
lf-0.206
 lady-0.198
keeper-0.197
 keeper-0.195
 man-0.194
Token,
Token ID11
Feature activation-2.35e+00

Pos logit contributions
|+0.078
 imag+0.068
 Ever+0.068
Whe+0.067
Any+0.067
Neg logit contributions
 lady-0.070
lf-0.070
 man-0.070
 detector-0.069
keeper-0.069
Token his
Token ID465
Feature activation-1.75e+00

Pos logit contributions
 lady+0.048
 man+0.047
keeper+0.046
lf+0.046
 keeper+0.046
Neg logit contributions
|-0.062
text-0.055
 imag-0.054
 eyel-0.052
 Ever-0.052
Token feathers
Token ID27400
Feature activation+8.21e-01

Pos logit contributions
lf+0.036
keeper+0.035
 detector+0.035
 lady+0.034
 keeper+0.034
Neg logit contributions
|-0.042
're-0.039
text-0.038
 imag-0.038
 Ever-0.037
Token gl
Token ID1278
Feature activation+1.73e+00

Pos logit contributions
|+0.017
text+0.015
're+0.015
 Ever+0.014
 imag+0.014
Neg logit contributions
keeper-0.020
lf-0.019
 lady-0.019
 keeper-0.019
 detector-0.019
Token There
Token ID1318
Feature activation-1.68e+00

Pos logit contributions
|+0.030
're+0.026
text+0.025
 imag+0.024
Any+0.024
Neg logit contributions
keeper-0.044
 lady-0.044
 keeper-0.043
lf-0.043
 vet-0.043
Token were
Token ID547
Feature activation+5.92e-01

Pos logit contributions
keeper+0.049
lf+0.048
 lady+0.048
 keeper+0.048
 detector+0.047
Neg logit contributions
|-0.028
're-0.022
text-0.022
Any-0.022
 Ever-0.022
Token also
Token ID635
Feature activation+9.79e-01

Pos logit contributions
|+0.013
're+0.013
 imag+0.013
 eyel+0.012
text+0.012
Neg logit contributions
lf-0.012
keeper-0.012
 lady-0.012
 keeper-0.012
 detector-0.011
Token many
Token ID867
Feature activation-6.92e-01

Pos logit contributions
|+0.020
're+0.019
 imag+0.018
 Ever+0.018
 eyel+0.017
Neg logit contributions
lf-0.023
keeper-0.022
 lady-0.022
 detector-0.022
 keeper-0.021
Token cob
Token ID22843
Feature activation-3.51e-01

Pos logit contributions
keeper+0.015
 keeper+0.014
 vet+0.014
man+0.014
lf+0.014
Neg logit contributions
 imag-0.015
're-0.015
|-0.014
avy-0.014
 eyel-0.013
Tokenwe
Token ID732
Feature activation+9.59e+00

Pos logit contributions
keeper+0.010
 lady+0.009
 keeper+0.009
lf+0.009
 vet+0.009
Neg logit contributions
|-0.006
're-0.005
text-0.005
 imag-0.005
Any-0.005
Tokenbs
Token ID1443
Feature activation+2.75e+00

Pos logit contributions
|+0.222
Any+0.206
 Ever+0.197
 eyel+0.194
Whe+0.194
Neg logit contributions
 detector-0.197
keeper-0.192
lf-0.190
 lady-0.187
 man-0.186
Token.
Token ID13
Feature activation+2.26e+00

Pos logit contributions
|+0.064
're+0.060
text+0.056
 Ever+0.055
 You+0.055
Neg logit contributions
keeper-0.058
lf-0.057
 lady-0.057
 keeper-0.056
 vet-0.056
Token Cob
Token ID14828
Feature activation+8.63e-01

Pos logit contributions
|+0.044
're+0.039
text+0.038
 imag+0.037
Any+0.037
Neg logit contributions
keeper-0.058
 lady-0.057
 vet-0.056
 keeper-0.056
lf-0.055
Tokenwe
Token ID732
Feature activation+8.25e+00

Pos logit contributions
|+0.019
're+0.016
Any+0.016
avy+0.016
text+0.015
Neg logit contributions
 vet-0.020
 lady-0.020
keeper-0.020
 buyer-0.019
 keeper-0.019
Tokenbs
Token ID1443
Feature activation+2.17e+00

Pos logit contributions
|+0.179
Any+0.162
 Ever+0.159
 You+0.157
 eyel+0.156
Neg logit contributions
 detector-0.186
lf-0.183
keeper-0.178
 lady-0.175
 man-0.172
Token laugh
Token ID6487
Feature activation+1.84e+00

Pos logit contributions
 man+0.001
 lady+0.001
 keeper+0.001
 detector+0.001
keeper+0.001
Neg logit contributions
|-0.001
Any-0.001
text-0.001
pill-0.001
 Ever-0.001
Token and
Token ID290
Feature activation+3.43e+00

Pos logit contributions
|+0.053
're+0.047
text+0.046
Any+0.045
 eyel+0.045
Neg logit contributions
keeper-0.030
 lady-0.030
lf-0.030
 keeper-0.030
 man-0.029
Token shout
Token ID9191
Feature activation+1.12e+00

Pos logit contributions
pill+0.091
avy+0.090
|+0.090
ash+0.089
text+0.089
Neg logit contributions
 lady-0.065
 man-0.064
 keeper-0.057
 click-0.057
 ring-0.055
Token,
Token ID11
Feature activation-2.59e+00

Pos logit contributions
|+0.032
're+0.029
text+0.028
Any+0.028
 Ever+0.028
Neg logit contributions
keeper-0.019
 lady-0.019
lf-0.018
 keeper-0.018
 detector-0.018
Token "
Token ID366
Feature activation-4.07e-01

Pos logit contributions
keeper+0.035
 lady+0.035
 keeper+0.034
lf+0.033
 man+0.032
Neg logit contributions
|-0.083
text-0.075
're-0.075
 Ever-0.073
 imag-0.072
TokenWhe
Token ID10842
Feature activation+9.48e+00

Pos logit contributions
 lady+0.011
 keeper+0.010
 vet+0.010
 man+0.010
lf+0.010
Neg logit contributions
|-0.007
avy-0.006
 imag-0.006
 eyel-0.006
're-0.006
Tokene
Token ID68
Feature activation+4.01e+00

Pos logit contributions
pill+0.121
|+0.119
 eyel+0.117
 You+0.113
 Ever+0.112
Neg logit contributions
lf-0.291
 lady-0.265
 detector-0.257
"-0.256
man-0.254
Token!"
Token ID2474
Feature activation+1.29e+00

Pos logit contributions
|+0.129
're+0.117
Any+0.116
 imag+0.115
 Ever+0.115
Neg logit contributions
lf-0.052
 lady-0.051
keeper-0.049
 keeper-0.049
 man-0.048
Token
Token ID198
Feature activation-2.01e+00

Pos logit contributions
|+0.038
 imag+0.035
're+0.034
ez+0.034
Any+0.033
Neg logit contributions
 lady-0.022
 man-0.020
 vet-0.020
keeper-0.019
 keeper-0.018
Token
Token ID198
Feature activation-2.52e+00

Pos logit contributions
keeper+0.052
 detector+0.052
lf+0.052
 buyer+0.051
 keeper+0.051
Neg logit contributions
|-0.039
text-0.034
 Ever-0.032
're-0.032
 You-0.031
TokenThey
Token ID2990
Feature activation-3.88e-01

Pos logit contributions
keeper+0.043
 detector+0.043
lf+0.041
 keeper+0.041
 worker+0.039
Neg logit contributions
|-0.071
text-0.065
Any-0.064
're-0.063
Whe-0.063
Token't
Token ID470
Feature activation-1.35e-01

Pos logit contributions
|+0.075
're+0.068
 imag+0.067
text+0.066
 eyel+0.066
Neg logit contributions
lf-0.051
keeper-0.050
 lady-0.049
 keeper-0.049
 detector-0.048
Token like
Token ID588
Feature activation-6.02e-01

Pos logit contributions
 lady+0.003
 vet+0.003
 man+0.003
 keeper+0.003
 detector+0.003
Neg logit contributions
|-0.003
text-0.003
pill-0.003
Any-0.003
anced-0.003
Token spiders
Token ID26120
Feature activation+1.80e+00

Pos logit contributions
keeper+0.014
lf+0.014
 lady+0.014
 keeper+0.013
 detector+0.013
Neg logit contributions
|-0.013
're-0.012
text-0.012
 Ever-0.011
Any-0.011
Token or
Token ID393
Feature activation+1.60e+00

Pos logit contributions
|+0.041
're+0.039
 You+0.036
text+0.036
 imag+0.035
Neg logit contributions
keeper-0.040
lf-0.040
 buyer-0.039
 keeper-0.038
 lady-0.038
Token cob
Token ID22843
Feature activation+2.29e-01

Pos logit contributions
|+0.037
're+0.032
text+0.032
 Ever+0.032
Any+0.031
Neg logit contributions
keeper-0.035
lf-0.035
 lady-0.035
 keeper-0.034
 detector-0.034
Tokenwe
Token ID732
Feature activation+9.46e+00

Pos logit contributions
|+0.004
're+0.004
text+0.004
Any+0.004
 Ever+0.004
Neg logit contributions
keeper-0.006
lf-0.006
 lady-0.006
 keeper-0.006
 detector-0.006
Tokenbs
Token ID1443
Feature activation+3.59e+00

Pos logit contributions
|+0.225
 Ever+0.196
Any+0.195
 eyel+0.189
text+0.186
Neg logit contributions
ache-0.191
 detector-0.188
 lady-0.186
 man-0.183
 vet-0.181
Token.
Token ID13
Feature activation+2.08e+00

Pos logit contributions
|+0.084
're+0.078
text+0.074
 You+0.073
 Ever+0.071
Neg logit contributions
keeper-0.077
lf-0.076
 keeper-0.075
 buyer-0.074
 lady-0.074
Token But
Token ID887
Feature activation+3.05e-01

Pos logit contributions
|+0.044
text+0.036
're+0.036
hed+0.036
avy+0.034
Neg logit contributions
 vet-0.051
keeper-0.050
 lady-0.050
 keeper-0.049
 man-0.048
Token I
Token ID314
Feature activation+1.95e+00

Pos logit contributions
|+0.007
text+0.006
're+0.006
 Ever+0.005
Any+0.005
Neg logit contributions
lf-0.007
keeper-0.007
 lady-0.007
 keeper-0.007
 detector-0.007
Token like
Token ID588
Feature activation+1.06e-01

Pos logit contributions
|+0.040
text+0.039
 eyel+0.037
avy+0.036
 Ever+0.036
Neg logit contributions
 lady-0.045
keeper-0.045
 keeper-0.044
 vet-0.044
 man-0.044
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
keeper+0.050
lf+0.050
 lady+0.049
 keeper+0.049
 detector+0.048
Neg logit contributions
|-0.066
're-0.059
text-0.058
 Ever-0.057
Any-0.057
Token One
Token ID1881
Feature activation-1.27e+00

Pos logit contributions
|+0.023
text+0.020
're+0.020
Any+0.020
 imag+0.019
Neg logit contributions
 lady-0.029
keeper-0.029
 vet-0.028
 keeper-0.028
 worker-0.027
Token of
Token ID286
Feature activation+5.85e-02

Pos logit contributions
keeper+0.024
lf+0.024
 lady+0.023
 detector+0.023
 keeper+0.023
Neg logit contributions
|-0.033
're-0.030
text-0.029
 Ever-0.029
Any-0.029
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
|+0.001
're+0.001
text+0.001
 Ever+0.001
Any+0.001
Neg logit contributions
lf-0.001
 lady-0.001
keeper-0.001
 keeper-0.001
 detector-0.001
Token ost
Token ID23619
Feature activation+3.96e+00

Pos logit contributions
|+0.023
're+0.022
 imag+0.020
 eyel+0.020
 You+0.019
Neg logit contributions
lf-0.026
keeper-0.025
 detector-0.024
 keeper-0.023
 vet-0.023
Tokenric
Token ID1173
Feature activation+9.45e+00

Pos logit contributions
|+0.049
text+0.040
're+0.038
Any+0.038
 Ever+0.037
Neg logit contributions
keeper-0.126
 keeper-0.126
 detector-0.125
lf-0.125
 lady-0.124
Tokenhes
Token ID956
Feature activation+1.87e+00

Pos logit contributions
|+0.197
're+0.173
text+0.169
 imag+0.164
Any+0.164
Neg logit contributions
keeper-0.229
lf-0.228
 lady-0.225
 keeper-0.224
 detector-0.221
Token looked
Token ID3114
Feature activation+2.34e-01

Pos logit contributions
|+0.040
 Ever+0.034
Any+0.034
text+0.034
 imag+0.034
Neg logit contributions
keeper-0.045
 lady-0.045
lf-0.044
 keeper-0.044
 detector-0.043
Token different
Token ID1180
Feature activation-9.65e-01

Pos logit contributions
|+0.006
text+0.005
're+0.005
Whe+0.005
 Ever+0.005
Neg logit contributions
keeper-0.005
 lady-0.005
 keeper-0.005
 man-0.005
 vet-0.005
Token from
Token ID422
Feature activation+9.95e-01

Pos logit contributions
keeper+0.021
lf+0.021
 lady+0.020
 keeper+0.020
 detector+0.020
Neg logit contributions
|-0.022
're-0.020
text-0.020
 Ever-0.019
 imag-0.019
Token the
Token ID262
Feature activation+2.34e+00

Pos logit contributions
|+0.022
text+0.020
're+0.020
Any+0.019
 Ever+0.018
Neg logit contributions
 lady-0.023
keeper-0.023
 keeper-0.023
 vet-0.023
 worker-0.022
Token and
Token ID290
Feature activation+3.11e+00

Pos logit contributions
lf+0.008
 buyer+0.008
 instincts+0.008
ache+0.008
 vet+0.008
Neg logit contributions
|-0.006
 Ever-0.006
 You-0.006
're-0.006
hed-0.005
Token run
Token ID1057
Feature activation+8.18e-01

Pos logit contributions
|+0.080
're+0.071
text+0.071
Any+0.069
 Ever+0.069
Neg logit contributions
keeper-0.061
 lady-0.060
lf-0.059
 keeper-0.059
 detector-0.058
Token to
Token ID284
Feature activation-3.29e+00

Pos logit contributions
|+0.021
're+0.018
text+0.018
 Ever+0.018
 You+0.017
Neg logit contributions
lf-0.018
keeper-0.016
 detector-0.016
 vet-0.016
 buyer-0.015
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
lf+0.094
keeper+0.093
 detector+0.090
ache+0.088
 keeper+0.087
Neg logit contributions
|-0.056
're-0.049
 You-0.046
 Ever-0.045
 eyel-0.044
Token ost
Token ID23619
Feature activation+3.82e+00

Pos logit contributions
|+0.034
're+0.032
 You+0.032
 eyel+0.032
 imag+0.031
Neg logit contributions
lf-0.024
 detector-0.023
keeper-0.022
 buyer-0.021
 instincts-0.020
Tokenric
Token ID1173
Feature activation+9.41e+00

Pos logit contributions
|+0.035
're+0.025
text+0.022
 Ever+0.021
Any+0.020
Neg logit contributions
keeper-0.138
lf-0.137
 lady-0.137
 keeper-0.135
 detector-0.135
Tokenhes
Token ID956
Feature activation+2.05e+00

Pos logit contributions
|+0.207
're+0.186
 imag+0.181
text+0.179
 eyel+0.177
Neg logit contributions
lf-0.215
 lady-0.214
keeper-0.213
 keeper-0.210
 detector-0.207
Token.
Token ID13
Feature activation+2.03e+00

Pos logit contributions
|+0.048
're+0.043
text+0.042
 Ever+0.041
 You+0.041
Neg logit contributions
lf-0.044
keeper-0.044
 keeper-0.042
 detector-0.042
 lady-0.042
Token
Token ID198
Feature activation-1.08e+00

Pos logit contributions
|+0.028
're+0.023
 You+0.023
 Ever+0.023
text+0.022
Neg logit contributions
lf-0.065
keeper-0.063
 detector-0.062
 lady-0.061
 keeper-0.061
Token
Token ID198
Feature activation-1.83e+00

Pos logit contributions
 detector+0.032
keeper+0.032
 recipe+0.032
lf+0.032
ache+0.032
Neg logit contributions
|-0.017
text-0.014
 Ever-0.013
 You-0.013
're-0.012
Token"
Token ID1
Feature activation-2.74e+00

Pos logit contributions
 recipe+0.033
 detector+0.033
 instincts+0.031
keeper+0.031
ache+0.031
Neg logit contributions
|-0.051
You-0.048
 You-0.045
�-0.044
text-0.044
Token.
Token ID13
Feature activation+9.14e-01

Pos logit contributions
|+0.007
're+0.007
 Ever+0.006
 eyel+0.006
Any+0.006
Neg logit contributions
lf-0.009
keeper-0.009
 detector-0.009
 buyer-0.009
 vet-0.009
Token It
Token ID632
Feature activation+4.87e-01

Pos logit contributions
|+0.018
're+0.016
text+0.015
 imag+0.015
Any+0.015
Neg logit contributions
keeper-0.024
 lady-0.023
 keeper-0.023
 vet-0.023
 man-0.022
Token sees
Token ID7224
Feature activation-3.16e+00

Pos logit contributions
|+0.010
're+0.009
text+0.009
avy+0.008
Any+0.008
Neg logit contributions
keeper-0.012
 keeper-0.012
 lady-0.012
 man-0.012
 vet-0.012
Token other
Token ID584
Feature activation-8.05e-01

Pos logit contributions
lf+0.082
keeper+0.082
 keeper+0.080
 lady+0.080
 detector+0.080
Neg logit contributions
|-0.059
're-0.052
text-0.051
 Ever-0.050
Any-0.049
Token ost
Token ID23619
Feature activation+4.48e+00

Pos logit contributions
lf+0.016
keeper+0.016
 detector+0.016
 keeper+0.015
 lady+0.015
Neg logit contributions
|-0.020
're-0.018
text-0.017
 Ever-0.017
 You-0.017
Tokenric
Token ID1173
Feature activation+9.40e+00

Pos logit contributions
|+0.056
text+0.046
're+0.044
 Ever+0.043
Any+0.043
Neg logit contributions
keeper-0.143
 keeper-0.142
lf-0.141
 detector-0.141
 lady-0.139
Tokenhes
Token ID956
Feature activation+2.01e+00

Pos logit contributions
|+0.213
're+0.190
text+0.186
 imag+0.185
 eyel+0.183
Neg logit contributions
lf-0.212
keeper-0.210
 lady-0.208
 keeper-0.206
 man-0.203
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
|+0.047
're+0.041
text+0.040
Any+0.040
 imag+0.040
Neg logit contributions
keeper-0.045
 lady-0.045
 keeper-0.044
 man-0.043
 detector-0.043
Token It
Token ID632
Feature activation+1.03e-02

Pos logit contributions
|+0.021
're+0.017
text+0.017
 imag+0.016
 eyel+0.016
Neg logit contributions
keeper-0.037
 lady-0.037
 vet-0.036
 keeper-0.036
 man-0.035
Token makes
Token ID1838
Feature activation+1.72e+00

Pos logit contributions
|+0.000
're+0.000
text+0.000
Any+0.000
avy+0.000
Neg logit contributions
 keeper-0.000
 lady-0.000
keeper-0.000
 man-0.000
 vet-0.000
Token new
Token ID649
Feature activation-1.43e+00

Pos logit contributions
|+0.039
're+0.034
text+0.033
Any+0.033
 Ever+0.032
Neg logit contributions
 lady-0.039
keeper-0.039
 keeper-0.038
 man-0.038
 vet-0.038
Token it
Token ID340
Feature activation+2.73e+00

Pos logit contributions
|+0.049
're+0.046
 eyel+0.043
 imag+0.043
Any+0.042
Neg logit contributions
keeper-0.059
lf-0.058
 detector-0.058
 keeper-0.058
 lady-0.057
Token was
Token ID373
Feature activation+1.50e+00

Pos logit contributions
|+0.060
're+0.052
text+0.052
 Ever+0.051
Any+0.050
Neg logit contributions
keeper-0.064
lf-0.063
 lady-0.063
 keeper-0.063
 detector-0.062
Token filled
Token ID5901
Feature activation-1.12e+00

Pos logit contributions
|+0.029
're+0.026
 eyel+0.025
Any+0.025
text+0.024
Neg logit contributions
keeper-0.039
lf-0.037
 detector-0.037
 keeper-0.037
 lady-0.037
Token with
Token ID351
Feature activation+3.05e+00

Pos logit contributions
keeper+0.016
lf+0.016
 lady+0.015
 keeper+0.015
 worker+0.014
Neg logit contributions
|-0.034
're-0.033
text-0.032
 imag-0.031
 Ever-0.031
Token cob
Token ID22843
Feature activation-4.26e-02

Pos logit contributions
|+0.076
text+0.065
're+0.065
 Ever+0.064
Any+0.064
Neg logit contributions
 lady-0.063
keeper-0.062
 keeper-0.061
 man-0.061
 detector-0.061
Tokenwe
Token ID732
Feature activation+9.40e+00

Pos logit contributions
keeper+0.001
lf+0.001
 lady+0.001
 keeper+0.001
 detector+0.001
Neg logit contributions
|-0.001
're-0.001
text-0.001
Any-0.001
 Ever-0.001
Tokenbs
Token ID1443
Feature activation+2.71e+00

Pos logit contributions
|+0.240
 Ever+0.218
Any+0.217
 You+0.211
Whe+0.210
Neg logit contributions
 detector-0.163
ache-0.161
 lady-0.161
keeper-0.159
 man-0.157
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
|+0.066
're+0.060
text+0.058
 Ever+0.057
Any+0.057
Neg logit contributions
keeper-0.056
lf-0.055
 lady-0.054
 keeper-0.054
 detector-0.053
Token She
Token ID1375
Feature activation-1.79e+00

Pos logit contributions
|+0.030
're+0.026
 Ever+0.026
 You+0.026
Any+0.025
Neg logit contributions
lf-0.044
 detector-0.044
keeper-0.043
 keeper-0.043
 lady-0.042
Token picked
Token ID6497
Feature activation-7.66e-01

Pos logit contributions
keeper+0.041
 detector+0.040
lf+0.040
 keeper+0.040
 lady+0.040
Neg logit contributions
|-0.038
're-0.034
text-0.033
Any-0.033
 Ever-0.033
Token up
Token ID510
Feature activation-9.12e-01

Pos logit contributions
 lady+0.014
 man+0.014
keeper+0.013
 vet+0.012
 worker+0.012
Neg logit contributions
|-0.022
text-0.019
're-0.019
 Ever-0.019
Any-0.018
Token laugh
Token ID6487
Feature activation+1.90e+00

Pos logit contributions
|+0.007
text+0.007
Any+0.007
ained+0.006
 Ever+0.006
Neg logit contributions
 lady-0.008
 man-0.008
 keeper-0.008
 vet-0.008
keeper-0.008
Token and
Token ID290
Feature activation+3.60e+00

Pos logit contributions
|+0.055
're+0.049
text+0.048
Any+0.047
 eyel+0.047
Neg logit contributions
 lady-0.032
keeper-0.032
 keeper-0.031
 man-0.030
lf-0.030
Token shout
Token ID9191
Feature activation+1.28e+00

Pos logit contributions
avy+0.091
ash+0.089
text+0.089
pill+0.088
|+0.088
Neg logit contributions
 lady-0.074
 man-0.072
 keeper-0.065
 click-0.063
 sign-0.062
Token,
Token ID11
Feature activation-2.59e+00

Pos logit contributions
|+0.035
're+0.031
text+0.030
Any+0.030
 Ever+0.029
Neg logit contributions
keeper-0.024
 lady-0.023
 keeper-0.023
lf-0.023
 man-0.022
Token "
Token ID366
Feature activation-2.86e-01

Pos logit contributions
 lady+0.033
 keeper+0.031
keeper+0.031
 man+0.030
lf+0.029
Neg logit contributions
|-0.086
text-0.080
're-0.078
avy-0.077
 imag-0.075
TokenWhe
Token ID10842
Feature activation+9.38e+00

Pos logit contributions
 lady+0.007
 keeper+0.007
 vet+0.007
lf+0.007
 man+0.007
Neg logit contributions
|-0.005
avy-0.004
're-0.004
 imag-0.004
 eyel-0.004
Tokene
Token ID68
Feature activation+3.93e+00

Pos logit contributions
pill+0.129
|+0.125
 eyel+0.122
're+0.121
 imag+0.118
Neg logit contributions
lf-0.278
 lady-0.258
 detector-0.246
man-0.246
 keeper-0.245
Token!
Token ID0
Feature activation+3.71e+00

Pos logit contributions
|+0.126
're+0.113
 imag+0.111
Any+0.111
 Ever+0.110
Neg logit contributions
 lady-0.053
lf-0.053
 keeper-0.050
 man-0.050
keeper-0.050
Token That
Token ID1320
Feature activation+4.24e+00

Pos logit contributions
|+0.069
 immensely+0.069
avy+0.065
led+0.064
 eyel+0.064
Neg logit contributions
"-0.078
 click-0.078
 vet-0.077
 lady-0.076
keeper-0.075
Token was
Token ID373
Feature activation+1.05e+00

Pos logit contributions
|+0.102
text+0.086
 Ever+0.084
 eyel+0.084
avy+0.083
Neg logit contributions
 lady-0.094
 man-0.094
 keeper-0.094
keeper-0.092
lf-0.089
Token fun
Token ID1257
Feature activation-4.22e-01

Pos logit contributions
|+0.026
text+0.022
're+0.022
avy+0.022
pill+0.022
Neg logit contributions
 keeper-0.022
 lady-0.022
lf-0.022
keeper-0.021
 man-0.021
Token They
Token ID1119
Feature activation-8.15e-01

Pos logit contributions
|+0.020
're+0.016
text+0.016
Any+0.015
 eyel+0.015
Neg logit contributions
keeper-0.038
 keeper-0.037
 lady-0.037
 detector-0.037
 vet-0.037
Token do
Token ID466
Feature activation-1.66e+00

Pos logit contributions
keeper+0.017
 lady+0.017
 keeper+0.017
lf+0.016
 vet+0.016
Neg logit contributions
|-0.020
text-0.017
're-0.017
Any-0.017
 Ever-0.017
Token not
Token ID407
Feature activation-3.61e-01

Pos logit contributions
 lady+0.019
lf+0.019
keeper+0.019
 keeper+0.018
 vet+0.018
Neg logit contributions
|-0.056
text-0.052
're-0.052
 imag-0.051
pill-0.051
Token pack
Token ID2353
Feature activation-1.40e+00

Pos logit contributions
keeper+0.008
 lady+0.008
lf+0.008
 keeper+0.008
 detector+0.008
Neg logit contributions
|-0.008
text-0.007
're-0.007
 Ever-0.007
Any-0.007
Token ost
Token ID23619
Feature activation+4.38e+00

Pos logit contributions
keeper+0.033
 lady+0.033
 keeper+0.033
lf+0.032
 detector+0.032
Neg logit contributions
|-0.031
're-0.027
text-0.026
Any-0.026
 Ever-0.025
Tokenric
Token ID1173
Feature activation+9.35e+00

Pos logit contributions
|+0.062
text+0.049
're+0.048
 Ever+0.048
Any+0.047
Neg logit contributions
keeper-0.135
 keeper-0.133
 detector-0.133
lf-0.132
 lady-0.131
Tokenhes
Token ID956
Feature activation+2.22e+00

Pos logit contributions
|+0.231
 eyel+0.204
text+0.202
're+0.201
 imag+0.200
Neg logit contributions
 lady-0.190
lf-0.189
keeper-0.188
 man-0.186
 worker-0.186
Token anymore
Token ID7471
Feature activation+2.69e-01

Pos logit contributions
|+0.051
're+0.045
text+0.044
Any+0.044
 Ever+0.043
Neg logit contributions
keeper-0.050
 lady-0.049
 keeper-0.049
 detector-0.047
lf-0.047
Token.
Token ID13
Feature activation+1.99e+00

Pos logit contributions
|+0.006
're+0.005
text+0.005
 Ever+0.005
Any+0.005
Neg logit contributions
keeper-0.006
 lady-0.006
 detector-0.006
 keeper-0.006
lf-0.006
Token They
Token ID1119
Feature activation-1.89e-01

Pos logit contributions
|+0.034
're+0.030
text+0.028
 eyel+0.028
Any+0.028
Neg logit contributions
keeper-0.056
 keeper-0.055
 lady-0.055
 vet-0.054
 man-0.054
Token pack
Token ID2353
Feature activation-2.16e+00

Pos logit contributions
 lady+0.004
keeper+0.004
 vet+0.004
 keeper+0.004
 man+0.004
Neg logit contributions
|-0.004
text-0.004
Any-0.004
 Ever-0.004
 eyel-0.004
Token look
Token ID804
Feature activation+1.95e+00

Pos logit contributions
 lady+0.055
keeper+0.055
lf+0.055
 keeper+0.054
 man+0.053
Neg logit contributions
|-0.073
text-0.064
're-0.064
 Ever-0.064
Any-0.064
Token at
Token ID379
Feature activation-4.83e-01

Pos logit contributions
|+0.042
're+0.037
text+0.036
 Ever+0.035
Any+0.035
Neg logit contributions
keeper-0.046
lf-0.046
 lady-0.045
 keeper-0.045
 detector-0.045
Token all
Token ID477
Feature activation+5.26e+00

Pos logit contributions
keeper+0.011
lf+0.010
 lady+0.010
 keeper+0.010
 detector+0.010
Neg logit contributions
|-0.011
're-0.010
text-0.010
 Ever-0.010
Any-0.010
Token the
Token ID262
Feature activation+2.83e+00

Pos logit contributions
|+0.102
're+0.090
text+0.086
 Ever+0.084
 You+0.083
Neg logit contributions
keeper-0.136
lf-0.132
 lady-0.132
 keeper-0.132
 vet-0.130
Token cob
Token ID22843
Feature activation-3.25e-01

Pos logit contributions
|+0.069
're+0.064
 imag+0.061
text+0.060
 eyel+0.060
Neg logit contributions
keeper-0.058
lf-0.057
 keeper-0.055
 lady-0.055
 detector-0.054
Tokenwe
Token ID732
Feature activation+9.33e+00

Pos logit contributions
keeper+0.008
lf+0.008
 lady+0.008
 keeper+0.008
 detector+0.008
Neg logit contributions
|-0.007
're-0.006
text-0.006
 Ever-0.006
Any-0.006
Tokenbs
Token ID1443
Feature activation+3.27e+00

Pos logit contributions
|+0.208
Any+0.185
 eyel+0.184
 Ever+0.179
text+0.178
Neg logit contributions
 detector-0.203
 lady-0.198
 man-0.197
 post-0.192
lf-0.191
Token,"
Token ID553
Feature activation-4.40e-01

Pos logit contributions
|+0.089
're+0.084
text+0.080
 You+0.079
 Ever+0.079
Neg logit contributions
keeper-0.057
lf-0.055
 keeper-0.054
 lady-0.054
 vet-0.054
Token Lily
Token ID20037
Feature activation-3.84e-01

Pos logit contributions
 lady+0.010
 man+0.010
keeper+0.009
 vet+0.009
 keeper+0.009
Neg logit contributions
|-0.011
text-0.010
're-0.010
ained-0.009
Any-0.009
Token said
Token ID531
Feature activation-3.53e+00

Pos logit contributions
 detector+0.009
 buyer+0.009
 instincts+0.009
 vet+0.009
keeper+0.009
Neg logit contributions
|-0.007
're-0.007
 Ever-0.007
 You-0.007
hed-0.007
Token.
Token ID13
Feature activation-6.73e-03

Pos logit contributions
keeper+0.069
 lady+0.067
 keeper+0.066
lf+0.066
 detector+0.065
Neg logit contributions
|-0.093
're-0.083
text-0.082
Any-0.080
 Ever-0.079
Token did
Token ID750
Feature activation-1.23e+00

Pos logit contributions
|+0.008
text+0.007
Any+0.007
 Ever+0.006
 You+0.006
Neg logit contributions
keeper-0.009
 keeper-0.009
 lady-0.009
lf-0.009
 man-0.009
Token not
Token ID407
Feature activation-6.82e-01

Pos logit contributions
keeper+0.021
lf+0.020
 detector+0.020
 lady+0.019
 keeper+0.019
Neg logit contributions
|-0.035
're-0.032
 Ever-0.031
text-0.031
Any-0.031
Token know
Token ID760
Feature activation-2.93e+00

Pos logit contributions
keeper+0.016
lf+0.016
 lady+0.016
 keeper+0.016
 detector+0.016
Neg logit contributions
|-0.015
're-0.013
text-0.013
 Ever-0.012
Any-0.012
Token what
Token ID644
Feature activation+1.17e+00

Pos logit contributions
keeper+0.060
 buyer+0.057
lf+0.057
 keeper+0.057
 lady+0.056
Neg logit contributions
|-0.067
're-0.066
 You-0.063
text-0.063
Any-0.061
Token ost
Token ID23619
Feature activation+4.85e+00

Pos logit contributions
|+0.025
're+0.023
 You+0.023
 Ever+0.022
text+0.022
Neg logit contributions
keeper-0.026
lf-0.026
 detector-0.025
 keeper-0.025
 buyer-0.024
Tokenric
Token ID1173
Feature activation+9.28e+00

Pos logit contributions
|+0.068
text+0.060
Any+0.055
Whe+0.055
 Ever+0.054
Neg logit contributions
 keeper-0.145
keeper-0.144
 detector-0.143
lf-0.141
 lady-0.140
Tokenhes
Token ID956
Feature activation+2.88e+00

Pos logit contributions
|+0.206
text+0.178
 eyel+0.177
 imag+0.177
're+0.175
Neg logit contributions
lf-0.212
keeper-0.210
 lady-0.209
 keeper-0.206
 detector-0.205
Token were
Token ID547
Feature activation+8.93e-01

Pos logit contributions
|+0.067
're+0.058
text+0.057
 Ever+0.056
Any+0.056
Neg logit contributions
keeper-0.063
 lady-0.063
lf-0.063
 keeper-0.062
 detector-0.061
Token.
Token ID13
Feature activation+8.80e-01

Pos logit contributions
're+0.020
|+0.019
 You+0.017
 Ever+0.017
Any+0.017
Neg logit contributions
keeper-0.020
lf-0.019
 vet-0.019
 detector-0.019
 keeper-0.019
Token They
Token ID1119
Feature activation-7.65e-01

Pos logit contributions
|+0.013
're+0.011
text+0.011
Any+0.010
 Ever+0.010
Neg logit contributions
keeper-0.027
 lady-0.026
 keeper-0.026
lf-0.026
 detector-0.026
Token looked
Token ID3114
Feature activation+8.89e-01

Pos logit contributions
keeper+0.018
lf+0.018
 lady+0.018
 keeper+0.018
 detector+0.018
Neg logit contributions
|-0.016
're-0.015
text-0.014
 Ever-0.014
Any-0.013
Token.
Token ID13
Feature activation+6.39e-01

Pos logit contributions
|+0.050
're+0.046
text+0.043
 Ever+0.043
Any+0.043
Neg logit contributions
lf-0.048
keeper-0.048
 lady-0.047
 keeper-0.046
 vet-0.046
Token They
Token ID1119
Feature activation-9.54e-01

Pos logit contributions
|+0.011
're+0.010
text+0.009
Any+0.009
 imag+0.009
Neg logit contributions
keeper-0.018
 lady-0.018
 keeper-0.017
 vet-0.017
lf-0.017
Token were
Token ID547
Feature activation+9.23e-01

Pos logit contributions
keeper+0.022
 lady+0.022
lf+0.022
 keeper+0.021
 detector+0.021
Neg logit contributions
|-0.021
text-0.018
're-0.018
 Ever-0.018
Any-0.018
Token called
Token ID1444
Feature activation+1.84e+00

Pos logit contributions
|+0.018
're+0.016
text+0.016
 Ever+0.015
Any+0.015
Neg logit contributions
keeper-0.023
lf-0.023
 lady-0.023
 keeper-0.023
 detector-0.022
Token ost
Token ID23619
Feature activation+3.95e+00

Pos logit contributions
|+0.049
text+0.043
're+0.043
Any+0.042
 Ever+0.042
Neg logit contributions
 lady-0.035
keeper-0.034
 keeper-0.034
 man-0.033
 worker-0.033
Tokenric
Token ID1173
Feature activation+9.28e+00

Pos logit contributions
|+0.051
text+0.041
're+0.040
Any+0.038
 Ever+0.038
Neg logit contributions
keeper-0.126
 keeper-0.125
lf-0.124
 detector-0.123
 lady-0.123
Tokenhes
Token ID956
Feature activation+2.36e+00

Pos logit contributions
|+0.217
text+0.192
're+0.191
 imag+0.189
 eyel+0.188
Neg logit contributions
lf-0.199
keeper-0.196
 lady-0.195
 keeper-0.195
 detector-0.193
Token.
Token ID13
Feature activation+9.43e-01

Pos logit contributions
|+0.059
're+0.053
text+0.052
 Ever+0.051
Any+0.051
Neg logit contributions
keeper-0.047
 lady-0.046
lf-0.046
 keeper-0.046
 detector-0.045
Token Lily
Token ID20037
Feature activation+8.06e-01

Pos logit contributions
|+0.016
're+0.014
text+0.013
 Ever+0.013
Any+0.013
Neg logit contributions
keeper-0.026
lf-0.026
 lady-0.026
 keeper-0.026
 detector-0.026
Token and
Token ID290
Feature activation+1.94e+00

Pos logit contributions
|+0.018
're+0.016
text+0.016
 Ever+0.016
Any+0.016
Neg logit contributions
keeper-0.018
lf-0.018
 lady-0.017
 detector-0.017
 keeper-0.017
Token Ben
Token ID3932
Feature activation+1.28e+00

Pos logit contributions
|+0.049
text+0.045
're+0.043
Any+0.042
 Ever+0.042
Neg logit contributions
 lady-0.043
 man-0.041
 keeper-0.041
 worker-0.040
 vet-0.040
Token how
Token ID703
Feature activation+3.98e+00

Pos logit contributions
lf+0.033
keeper+0.032
 keeper+0.032
 lady+0.032
 detector+0.032
Neg logit contributions
|-0.033
're-0.030
text-0.030
 Ever-0.028
 imag-0.028
Token much
Token ID881
Feature activation+5.47e-01

Pos logit contributions
|+0.119
text+0.105
're+0.105
 eyel+0.105
ez+0.102
Neg logit contributions
keeper-0.068
 keeper-0.065
 man-0.065
 lady-0.063
 worker-0.058
Token brighter
Token ID26083
Feature activation+3.43e-01

Pos logit contributions
|+0.015
're+0.013
text+0.013
 Ever+0.013
avy+0.013
Neg logit contributions
lf-0.010
 lady-0.010
 man-0.010
keeper-0.010
 keeper-0.010
Token and
Token ID290
Feature activation+2.14e+00

Pos logit contributions
|+0.008
're+0.007
 You+0.007
Any+0.007
text+0.007
Neg logit contributions
keeper-0.007
 keeper-0.007
lf-0.007
 lady-0.007
 vet-0.007
Token sun
Token ID4252
Feature activation+3.85e+00

Pos logit contributions
|+0.051
're+0.046
Any+0.045
 Ever+0.045
 imag+0.044
Neg logit contributions
lf-0.044
 detector-0.044
keeper-0.043
 lady-0.043
 buyer-0.043
Tokenn
Token ID77
Feature activation+9.28e+00

Pos logit contributions
|+0.083
're+0.073
text+0.071
 Ever+0.070
Any+0.069
Neg logit contributions
keeper-0.091
lf-0.090
 lady-0.089
 keeper-0.089
 detector-0.088
Tokenier
Token ID959
Feature activation+4.53e-02

Pos logit contributions
|+0.172
 imag+0.152
Any+0.140
 eyel+0.137
 You+0.137
Neg logit contributions
lf-0.247
keeper-0.244
 detector-0.239
man-0.239
 lady-0.238
Token the
Token ID262
Feature activation+2.33e+00

Pos logit contributions
|+0.001
're+0.001
text+0.001
 Ever+0.001
Any+0.001
Neg logit contributions
keeper-0.001
lf-0.001
 lady-0.001
 keeper-0.001
 detector-0.001
Token other
Token ID584
Feature activation+2.96e-01

Pos logit contributions
|+0.055
text+0.049
're+0.048
Any+0.047
 Ever+0.046
Neg logit contributions
 lady-0.052
keeper-0.051
 keeper-0.050
 man-0.050
 vet-0.050
Token side
Token ID1735
Feature activation+1.21e+00

Pos logit contributions
|+0.009
ez+0.008
endo+0.008
text+0.008
Any+0.008
Neg logit contributions
 lady-0.005
 man-0.005
 door-0.004
 keeper-0.004
 vet-0.004
Token was
Token ID373
Feature activation+2.29e+00

Pos logit contributions
|+0.030
're+0.027
text+0.026
Any+0.026
 Ever+0.026
Neg logit contributions
keeper-0.025
lf-0.024
 keeper-0.024
 lady-0.024
 detector-0.024
Token the
Token ID262
Feature activation+1.77e+00

Pos logit contributions
|+0.006
text+0.005
're+0.005
 Ever+0.005
avy+0.005
Neg logit contributions
 lady-0.005
 keeper-0.005
lf-0.005
 man-0.005
 vet-0.005
Token shed
Token ID14999
Feature activation-2.33e+00

Pos logit contributions
|+0.050
text+0.046
're+0.046
 Ever+0.044
Any+0.044
Neg logit contributions
 lady-0.031
lf-0.030
keeper-0.030
 keeper-0.029
 man-0.029
Token and
Token ID290
Feature activation+2.56e+00

Pos logit contributions
lf+0.055
 buyer+0.055
 detector+0.054
 vet+0.053
Offic+0.052
Neg logit contributions
|-0.043
're-0.041
 Ever-0.038
 eyel-0.038
 You-0.038
Token the
Token ID262
Feature activation+7.13e-01

Pos logit contributions
|+0.052
're+0.045
text+0.044
 Ever+0.043
Any+0.043
Neg logit contributions
keeper-0.064
lf-0.063
 lady-0.063
 keeper-0.063
 detector-0.062
Token cob
Token ID22843
Feature activation-1.30e+00

Pos logit contributions
|+0.018
're+0.016
text+0.016
 Ever+0.016
Any+0.016
Neg logit contributions
keeper-0.014
lf-0.014
 lady-0.013
 keeper-0.013
 detector-0.013
Tokenwe
Token ID732
Feature activation+9.19e+00

Pos logit contributions
keeper+0.028
 buyer+0.028
 vet+0.026
 keeper+0.026
 mechanic+0.026
Neg logit contributions
|-0.030
Any-0.026
 eyel-0.026
 You-0.026
ez-0.026
Tokenbs
Token ID1443
Feature activation+2.58e+00

Pos logit contributions
|+0.195
Any+0.175
 eyel+0.172
 Ever+0.172
 You+0.169
Neg logit contributions
 lady-0.206
 detector-0.206
keeper-0.203
lf-0.200
 man-0.199
Token and
Token ID290
Feature activation+2.11e+00

Pos logit contributions
|+0.061
're+0.056
text+0.053
 Ever+0.052
Any+0.052
Neg logit contributions
keeper-0.055
lf-0.054
 lady-0.053
 keeper-0.053
 detector-0.052
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
|+0.042
're+0.037
text+0.035
 Ever+0.035
Any+0.034
Neg logit contributions
keeper-0.054
lf-0.053
 lady-0.053
 keeper-0.052
 detector-0.052
Token spider
Token ID19230
Feature activation+4.52e-01

Pos logit contributions
|+0.027
're+0.024
text+0.024
 Ever+0.023
Any+0.023
Neg logit contributions
keeper-0.021
lf-0.020
 lady-0.020
 keeper-0.020
 detector-0.020
Token.
Token ID13
Feature activation+2.26e+00

Pos logit contributions
|+0.011
're+0.009
text+0.009
 eyel+0.009
 Ever+0.009
Neg logit contributions
keeper-0.010
lf-0.009
 lady-0.009
 keeper-0.009
 detector-0.009
Token It
Token ID632
Feature activation+6.35e-01

Pos logit contributions
|+0.030
text+0.028
Any+0.027
 imag+0.027
're+0.027
Neg logit contributions
keeper-0.037
 keeper-0.035
 lady-0.035
 man-0.035
lf-0.034
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
|+0.014
're+0.012
text+0.012
Any+0.012
 Ever+0.012
Neg logit contributions
lf-0.015
keeper-0.015
 keeper-0.015
 lady-0.014
 detector-0.014
Token covered
Token ID5017
Feature activation-5.25e-01

Pos logit contributions
|+0.020
're+0.018
 eyel+0.018
 Ever+0.017
Any+0.017
Neg logit contributions
keeper-0.032
 detector-0.031
lf-0.031
 buyer-0.031
 lady-0.031
Token with
Token ID351
Feature activation+2.91e+00

Pos logit contributions
lf+0.011
keeper+0.011
 lady+0.011
 keeper+0.011
 detector+0.011
Neg logit contributions
|-0.013
're-0.012
text-0.011
 imag-0.011
Any-0.011
Token cob
Token ID22843
Feature activation-3.81e-01

Pos logit contributions
|+0.064
're+0.059
text+0.056
 imag+0.056
 eyel+0.055
Neg logit contributions
keeper-0.066
lf-0.064
 keeper-0.064
 lady-0.063
 detector-0.063
Tokenwe
Token ID732
Feature activation+9.11e+00

Pos logit contributions
keeper+0.009
 lady+0.009
 keeper+0.009
 vet+0.009
 buyer+0.009
Neg logit contributions
|-0.008
're-0.007
text-0.007
 imag-0.007
 Ever-0.007
Tokenbs
Token ID1443
Feature activation+2.42e+00

Pos logit contributions
|+0.213
Any+0.197
 Ever+0.193
Whe+0.188
 eyel+0.186
Neg logit contributions
 detector-0.179
lf-0.179
 lady-0.177
ache-0.175
keeper-0.175
Token and
Token ID290
Feature activation+2.54e+00

Pos logit contributions
|+0.061
're+0.055
text+0.054
 Ever+0.053
Any+0.053
Neg logit contributions
keeper-0.048
lf-0.047
 lady-0.046
 keeper-0.046
 detector-0.046
Token the
Token ID262
Feature activation+8.82e-01

Pos logit contributions
|+0.051
're+0.047
 imag+0.044
text+0.044
 Ever+0.044
Neg logit contributions
keeper-0.061
 keeper-0.059
 lady-0.059
 detector-0.059
 vet-0.059
Token paint
Token ID7521
Feature activation+9.80e-01

Pos logit contributions
|+0.023
're+0.021
 eyel+0.021
 You+0.021
 Ever+0.020
Neg logit contributions
keeper-0.016
 detector-0.016
 keeper-0.015
 vet-0.015
 buyer-0.015
Token was
Token ID373
Feature activation+2.00e+00

Pos logit contributions
|+0.021
're+0.019
text+0.018
 eyel+0.017
 Ever+0.017
Neg logit contributions
keeper-0.023
 vet-0.022
 buyer-0.022
 keeper-0.022
 lady-0.022
Token smiles
Token ID21845
Feature activation+1.43e+00

Pos logit contributions
 man+0.028
 lady+0.028
 vet+0.026
 detector+0.026
 keeper+0.025
Neg logit contributions
|-0.026
avy-0.020
Any-0.020
ained-0.020
text-0.020
Token and
Token ID290
Feature activation+2.78e+00

Pos logit contributions
|+0.036
're+0.032
text+0.032
 Ever+0.031
Any+0.031
Neg logit contributions
keeper-0.028
 lady-0.028
lf-0.028
 keeper-0.028
 detector-0.027
Token shouts
Token ID34757
Feature activation+2.11e-01

Pos logit contributions
|+0.078
akes+0.062
avy+0.062
text+0.062
're+0.061
Neg logit contributions
 lady-0.061
 man-0.059
 keeper-0.052
 vet-0.052
keeper-0.050
Token,
Token ID11
Feature activation-3.27e+00

Pos logit contributions
|+0.007
're+0.006
Any+0.006
text+0.006
pill+0.006
Neg logit contributions
 lady-0.003
keeper-0.003
 man-0.003
 keeper-0.003
 worker-0.003
Token "
Token ID366
Feature activation-1.01e+00

Pos logit contributions
keeper+0.052
lf+0.051
 detector+0.050
 keeper+0.049
 lady+0.048
Neg logit contributions
|-0.095
're-0.088
Any-0.085
 Ever-0.085
 You-0.085
TokenWhe
Token ID10842
Feature activation+9.11e+00

Pos logit contributions
 lady+0.029
 vet+0.028
 man+0.028
 keeper+0.028
keeper+0.027
Neg logit contributions
|-0.016
avy-0.014
're-0.014
 imag-0.013
text-0.013
Tokene
Token ID68
Feature activation+3.75e+00

Pos logit contributions
pill+0.123
|+0.115
 eyel+0.115
 imag+0.113
 Ever+0.112
Neg logit contributions
lf-0.278
 lady-0.253
 detector-0.244
"-0.242
man-0.240
Token!
Token ID0
Feature activation+3.07e+00

Pos logit contributions
|+0.121
're+0.110
 imag+0.109
Any+0.108
pill+0.107
Neg logit contributions
 lady-0.050
lf-0.049
 man-0.046
 keeper-0.046
keeper-0.045
Token That
Token ID1320
Feature activation+3.88e+00

Pos logit contributions
|+0.064
 immensely+0.056
endo+0.053
 imag+0.053
avy+0.053
Neg logit contributions
 lady-0.070
 vet-0.069
 click-0.067
"-0.065
keeper-0.064
Token was
Token ID373
Feature activation+1.12e+00

Pos logit contributions
|+0.091
 Ever+0.076
text+0.076
 eyel+0.074
avy+0.074
Neg logit contributions
 lady-0.088
 man-0.087
 keeper-0.087
keeper-0.086
lf-0.084
Token fun
Token ID1257
Feature activation-6.19e-01

Pos logit contributions
|+0.028
ez+0.024
pill+0.024
avy+0.024
text+0.024
Neg logit contributions
 keeper-0.022
 lady-0.022
lf-0.022
 man-0.021
keeper-0.021
Token says
Token ID1139
Feature activation-2.31e+00

Pos logit contributions
|+0.013
're+0.011
text+0.011
Any+0.011
avy+0.011
Neg logit contributions
keeper-0.016
 lady-0.016
 keeper-0.016
 man-0.016
 detector-0.015
Token:
Token ID25
Feature activation-1.16e+00

Pos logit contributions
 detector+0.044
keeper+0.043
lf+0.043
 buyer+0.043
 keeper+0.040
Neg logit contributions
|-0.056
 You-0.054
're-0.054
 Ever-0.053
text-0.053
Token
Token ID198
Feature activation-1.49e+00

Pos logit contributions
keeper+0.019
lf+0.019
 lady+0.018
 keeper+0.018
 detector+0.018
Neg logit contributions
|-0.033
're-0.030
text-0.030
 Ever-0.029
Any-0.029
Token
Token ID198
Feature activation-2.25e+00

Pos logit contributions
keeper+0.040
 detector+0.040
lf+0.039
 buyer+0.039
 keeper+0.039
Neg logit contributions
|-0.028
text-0.024
 Ever-0.023
 You-0.023
're-0.022
Token"
Token ID1
Feature activation-2.97e+00

Pos logit contributions
 detector+0.036
keeper+0.033
lf+0.032
 recipe+0.032
 keeper+0.031
Neg logit contributions
|-0.068
text-0.063
 You-0.062
 Ever-0.061
You-0.061
TokenDear
Token ID20266
Feature activation-8.56e+00

Pos logit contributions
lf+0.073
keeper+0.073
 detector+0.071
 keeper+0.071
 worker+0.069
Neg logit contributions
|-0.061
Any-0.053
text-0.052
're-0.052
 Ever-0.052
Token Lily
Token ID20037
Feature activation+3.98e-01

Pos logit contributions
keeper+0.215
lf+0.212
 keeper+0.208
 lady+0.208
 detector+0.208
Neg logit contributions
|-0.172
're-0.150
text-0.147
 Ever-0.143
Any-0.142
Token and
Token ID290
Feature activation+5.76e-01

Pos logit contributions
|+0.008
're+0.008
 You+0.008
text+0.008
 Ever+0.008
Neg logit contributions
 buyer-0.009
 detector-0.009
keeper-0.009
lf-0.009
 vet-0.009
Token Ben
Token ID3932
Feature activation+1.41e+00

Pos logit contributions
|+0.014
text+0.011
're+0.011
avy+0.011
ained+0.011
Neg logit contributions
 lady-0.014
 man-0.013
 vet-0.013
keeper-0.013
lf-0.013
Token,
Token ID11
Feature activation-4.79e+00

Pos logit contributions
|+0.040
're+0.037
text+0.036
 Ever+0.036
Any+0.035
Neg logit contributions
keeper-0.023
lf-0.023
 lady-0.022
 keeper-0.022
 detector-0.022
Token Happy
Token ID14628
Feature activation-3.14e+00

Pos logit contributions
 lady+0.102
lf+0.099
keeper+0.098
 man+0.098
 vet+0.097
Neg logit contributions
|-0.115
avy-0.100
 eyel-0.099
text-0.099
're-0.099
Token
Token ID198
Feature activation-3.27e+00

Pos logit contributions
 detector+0.076
keeper+0.074
lf+0.074
 buyer+0.073
 keeper+0.072
Neg logit contributions
|-0.052
 Ever-0.042
text-0.041
 You-0.041
Any-0.040
TokenThe
Token ID464
Feature activation-5.95e-01

Pos logit contributions
 detector+0.072
keeper+0.067
 recipe+0.066
 buyer+0.065
 keeper+0.064
Neg logit contributions
|-0.079
Any-0.069
Whe-0.067
 Ever-0.067
You-0.067
Token thief
Token ID25906
Feature activation-2.98e+00

Pos logit contributions
 detector+0.008
lf+0.008
keeper+0.007
 recipe+0.007
 keeper+0.007
Neg logit contributions
|-0.018
're-0.017
 Ever-0.017
 You-0.017
 imag-0.016
Token replied
Token ID8712
Feature activation-3.78e+00

Pos logit contributions
keeper+0.072
 detector+0.072
 buyer+0.072
 vet+0.071
 lady+0.070
Neg logit contributions
|-0.057
're-0.051
 eyel-0.048
hed-0.047
 Ever-0.047
Token "
Token ID366
Feature activation-1.83e+00

Pos logit contributions
lf+0.065
keeper+0.063
 detector+0.063
 keeper+0.062
 lady+0.061
Neg logit contributions
|-0.102
're-0.095
 Ever-0.092
Any-0.092
text-0.092
TokenDear
Token ID20266
Feature activation-8.43e+00

Pos logit contributions
keeper+0.039
 lady+0.038
 keeper+0.037
lf+0.036
 vet+0.036
Neg logit contributions
|-0.044
're-0.040
text-0.040
 imag-0.039
 eyel-0.039
Token little
Token ID1310
Feature activation-2.05e+00

Pos logit contributions
 detector+0.198
keeper+0.195
lf+0.195
 buyer+0.187
 keeper+0.186
Neg logit contributions
|-0.176
're-0.152
 You-0.152
 Ever-0.148
text-0.146
Token girl
Token ID2576
Feature activation-1.64e+00

Pos logit contributions
lf+0.039
keeper+0.038
 detector+0.037
 keeper+0.037
 lady+0.037
Neg logit contributions
|-0.053
're-0.048
text-0.046
 Ever-0.046
 You-0.046
Token,
Token ID11
Feature activation-4.16e+00

Pos logit contributions
 detector+0.029
keeper+0.029
lf+0.029
 keeper+0.028
 lady+0.028
Neg logit contributions
|-0.043
're-0.040
text-0.039
 You-0.039
 Ever-0.038
Token I
Token ID314
Feature activation+1.35e+00

Pos logit contributions
lf+0.081
keeper+0.080
 detector+0.079
 keeper+0.078
 lady+0.076
Neg logit contributions
|-0.103
're-0.098
 You-0.096
Any-0.093
 Ever-0.091
Token need
Token ID761
Feature activation+1.73e-01

Pos logit contributions
|+0.026
text+0.022
're+0.022
 Ever+0.022
 eyel+0.022
Neg logit contributions
keeper-0.034
 lady-0.034
 keeper-0.034
lf-0.033
 detector-0.033
Token it
Token ID340
Feature activation+1.60e+00

Pos logit contributions
lf+0.005
keeper+0.005
 detector+0.005
 buyer+0.005
 keeper+0.005
Neg logit contributions
 You-0.002
're-0.002
 Ever-0.002
|-0.002
 imag-0.002
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
|+0.041
 Ever+0.038
're+0.038
 You+0.037
 eyel+0.037
Neg logit contributions
lf-0.029
 detector-0.029
keeper-0.028
 buyer-0.028
 vet-0.028
Token
Token ID198
Feature activation-9.65e-01

Pos logit contributions
|+0.028
're+0.025
 You+0.024
 Ever+0.024
Any+0.023
Neg logit contributions
lf-0.047
keeper-0.046
 detector-0.046
 keeper-0.045
 worker-0.044
Token
Token ID198
Feature activation-1.80e+00

Pos logit contributions
keeper+0.027
 detector+0.027
 buyer+0.027
lf+0.027
 worker+0.027
Neg logit contributions
|-0.016
text-0.013
 Ever-0.013
 You-0.013
're-0.013
Token"
Token ID1
Feature activation-2.71e+00

Pos logit contributions
 detector+0.028
keeper+0.027
 keeper+0.026
 instincts+0.026
 recipe+0.025
Neg logit contributions
|-0.052
You-0.050
 You-0.050
text-0.048
Any-0.048
TokenDear
Token ID20266
Feature activation-8.37e+00

Pos logit contributions
keeper+0.066
lf+0.066
 keeper+0.065
 lady+0.065
 detector+0.064
Neg logit contributions
|-0.056
're-0.049
text-0.048
Any-0.047
 Ever-0.047
Token Lily
Token ID20037
Feature activation+3.78e-01

Pos logit contributions
keeper+0.237
lf+0.234
 detector+0.232
 keeper+0.230
 lady+0.227
Neg logit contributions
|-0.137
're-0.120
text-0.114
 You-0.114
 Ever-0.111
Token and
Token ID290
Feature activation+8.30e-01

Pos logit contributions
|+0.008
're+0.008
 You+0.008
 Ever+0.007
text+0.007
Neg logit contributions
 detector-0.009
keeper-0.008
 buyer-0.008
 vet-0.008
lf-0.008
Token Ben
Token ID3932
Feature activation+1.55e+00

Pos logit contributions
|+0.019
text+0.015
're+0.015
ained+0.015
akes+0.014
Neg logit contributions
 lady-0.021
 man-0.020
keeper-0.020
 vet-0.020
 keeper-0.019
Token,
Token ID11
Feature activation-4.63e+00

Pos logit contributions
|+0.040
're+0.038
 You+0.036
text+0.036
 Ever+0.035
Neg logit contributions
keeper-0.029
 detector-0.029
lf-0.028
 vet-0.028
 buyer-0.027
Token we
Token ID356
Feature activation+2.22e+00

Pos logit contributions
 lady+0.102
lf+0.101
keeper+0.101
 keeper+0.099
 detector+0.098
Neg logit contributions
|-0.107
're-0.093
text-0.092
 Ever-0.091
 eyel-0.091
Token prayed
Token ID32168
Feature activation-2.21e+00

Pos logit contributions
keeper+0.044
lf+0.043
 lady+0.043
 keeper+0.043
 detector+0.042
Neg logit contributions
|-0.044
're-0.039
text-0.038
 Ever-0.037
Any-0.037
Token.
Token ID13
Feature activation+8.32e-01

Pos logit contributions
lf+0.058
 buyer+0.057
 detector+0.056
 vet+0.056
keeper+0.055
Neg logit contributions
|-0.035
're-0.034
 Ever-0.033
 eyel-0.031
Any-0.031
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
|+0.014
 Ever+0.013
 You+0.012
're+0.012
text+0.012
Neg logit contributions
lf-0.023
 detector-0.023
keeper-0.023
 buyer-0.022
 keeper-0.022
Token
Token ID198
Feature activation-1.98e+00

Pos logit contributions
 detector+0.037
keeper+0.037
 buyer+0.036
 worker+0.036
lf+0.036
Neg logit contributions
|-0.024
 Ever-0.020
text-0.020
 You-0.019
Any-0.018
Token"
Token ID1
Feature activation-2.87e+00

Pos logit contributions
 detector+0.034
keeper+0.032
 recipe+0.031
 buyer+0.031
 instincts+0.031
Neg logit contributions
|-0.057
 Ever-0.050
 You-0.050
text-0.050
Any-0.049
TokenDear
Token ID20266
Feature activation-8.34e+00

Pos logit contributions
lf+0.061
keeper+0.061
 keeper+0.058
 detector+0.058
 worker+0.058
Neg logit contributions
|-0.068
Any-0.060
Whe-0.059
 Ever-0.059
're-0.058
Token God
Token ID1793
Feature activation-1.52e-01

Pos logit contributions
keeper+0.202
lf+0.199
 detector+0.198
 keeper+0.194
 lady+0.192
Neg logit contributions
|-0.172
're-0.150
text-0.147
 You-0.146
 Ever-0.144
Token,
Token ID11
Feature activation-4.86e+00

Pos logit contributions
lf+0.003
keeper+0.003
 detector+0.003
 vet+0.003
 keeper+0.003
Neg logit contributions
|-0.004
're-0.004
 You-0.003
 Ever-0.003
text-0.003
Token please
Token ID3387
Feature activation-3.70e+00

Pos logit contributions
keeper+0.099
lf+0.098
 detector+0.096
 keeper+0.095
 buyer+0.093
Neg logit contributions
|-0.117
're-0.110
 You-0.109
 Ever-0.102
text-0.101
Token help
Token ID1037
Feature activation-3.28e+00

Pos logit contributions
keeper+0.086
 detector+0.085
lf+0.084
 buyer+0.084
 worker+0.082
Neg logit contributions
|-0.077
're-0.077
 You-0.072
 eyel-0.068
 Ever-0.067
Token us
Token ID514
Feature activation-1.31e+00

Pos logit contributions
keeper+0.096
 buyer+0.095
 detector+0.094
 worker+0.094
 keeper+0.093
Neg logit contributions
|-0.050
text-0.043
're-0.043
 You-0.040
ee-0.038
Token slowly
Token ID6364
Feature activation-7.67e-01

Pos logit contributions
|+0.030
're+0.029
text+0.026
 Ever+0.026
 You+0.026
Neg logit contributions
keeper-0.031
lf-0.031
 buyer-0.030
 vet-0.030
 keeper-0.030
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
lf+0.017
keeper+0.017
 buyer+0.017
 keeper+0.016
 vet+0.016
Neg logit contributions
|-0.017
're-0.016
 eyel-0.015
 imag-0.015
 Ever-0.015
Token
Token ID198
Feature activation-9.82e-01

Pos logit contributions
|+0.029
're+0.025
text+0.025
 Ever+0.024
Any+0.024
Neg logit contributions
keeper-0.041
lf-0.041
 lady-0.041
 keeper-0.041
 detector-0.040
Token
Token ID198
Feature activation-1.84e+00

Pos logit contributions
keeper+0.027
 detector+0.027
lf+0.027
 buyer+0.026
 keeper+0.026
Neg logit contributions
|-0.018
 Ever-0.015
text-0.015
're-0.015
 You-0.014
Token"
Token ID1
Feature activation-2.82e+00

Pos logit contributions
 detector+0.028
keeper+0.027
 keeper+0.025
 instincts+0.025
 buyer+0.025
Neg logit contributions
|-0.055
 You-0.051
You-0.051
Any-0.050
text-0.050
TokenDear
Token ID20266
Feature activation-8.30e+00

Pos logit contributions
lf+0.063
keeper+0.062
 detector+0.060
 keeper+0.060
 worker+0.058
Neg logit contributions
|-0.065
Any-0.057
're-0.057
Whe-0.056
text-0.056
Token...
Token ID986
Feature activation-1.24e+00

Pos logit contributions
keeper+0.204
lf+0.201
 detector+0.199
 keeper+0.197
 lady+0.195
Neg logit contributions
|-0.168
're-0.148
text-0.144
 You-0.142
 Ever-0.141
TokenAn
Token ID2025
Feature activation+1.65e+00

Pos logit contributions
keeper+0.029
lf+0.028
 lady+0.028
 keeper+0.028
 detector+0.028
Neg logit contributions
|-0.027
're-0.024
text-0.023
 imag-0.023
 Ever-0.023
Token-
Token ID12
Feature activation-1.51e-01

Pos logit contributions
|+0.042
 Ever+0.037
're+0.037
text+0.037
 You+0.037
Neg logit contributions
keeper-0.031
lf-0.031
 keeper-0.030
 detector-0.030
 lady-0.030
Tokenna
Token ID2616
Feature activation+1.88e+00

Pos logit contributions
lf+0.003
 instincts+0.003
 detector+0.003
 keeper+0.003
 buyer+0.003
Neg logit contributions
|-0.003
 You-0.003
Any-0.003
 Ever-0.003
ee-0.003
Token...
Token ID986
Feature activation-5.24e-01

Pos logit contributions
|+0.041
're+0.036
text+0.036
Any+0.035
 eyel+0.034
Neg logit contributions
keeper-0.044
 lady-0.043
 keeper-0.043
lf-0.043
 detector-0.042
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
|+0.023
're+0.021
text+0.021
 Ever+0.020
Any+0.020
Neg logit contributions
keeper-0.019
lf-0.019
 lady-0.019
 keeper-0.019
 detector-0.018
Token She
Token ID1375
Feature activation-1.82e+00

Pos logit contributions
|+0.017
 You+0.016
 Ever+0.016
're+0.015
Any+0.014
Neg logit contributions
lf-0.034
 detector-0.033
keeper-0.033
 buyer-0.032
 keeper-0.032
Token said
Token ID531
Feature activation-2.37e+00

Pos logit contributions
keeper+0.041
 keeper+0.040
 detector+0.040
lf+0.040
 lady+0.040
Neg logit contributions
|-0.039
're-0.035
text-0.035
 Ever-0.034
 imag-0.034
Token,
Token ID11
Feature activation-4.48e+00

Pos logit contributions
 detector+0.049
lf+0.048
keeper+0.048
 keeper+0.047
 buyer+0.046
Neg logit contributions
|-0.053
're-0.052
 You-0.051
text-0.050
 Ever-0.049
Token "
Token ID366
Feature activation-1.48e+00

Pos logit contributions
 detector+0.092
keeper+0.090
lf+0.089
 buyer+0.088
 keeper+0.086
Neg logit contributions
 You-0.102
|-0.100
 Ever-0.098
're-0.097
Any-0.096
TokenDear
Token ID20266
Feature activation-8.28e+00

Pos logit contributions
 lady+0.034
keeper+0.033
 keeper+0.033
 vet+0.032
lf+0.032
Neg logit contributions
|-0.033
're-0.030
text-0.029
 eyel-0.029
 imag-0.029
Token Tom
Token ID4186
Feature activation-2.08e-01

Pos logit contributions
keeper+0.225
 detector+0.224
lf+0.220
 buyer+0.216
 keeper+0.212
Neg logit contributions
|-0.140
 You-0.126
're-0.125
text-0.120
 Ever-0.117
Token,
Token ID11
Feature activation-5.52e+00

Pos logit contributions
keeper+0.004
 detector+0.004
lf+0.004
 buyer+0.004
 vet+0.004
Neg logit contributions
|-0.005
're-0.005
 You-0.005
text-0.005
 Ever-0.005
Token I
Token ID314
Feature activation+1.81e+00

Pos logit contributions
keeper+0.130
 detector+0.128
lf+0.126
 keeper+0.125
 buyer+0.123
Neg logit contributions
|-0.115
 You-0.107
're-0.107
 Ever-0.101
Any-0.100
Token made
Token ID925
Feature activation-2.64e-01

Pos logit contributions
|+0.035
text+0.030
're+0.029
 Ever+0.029
 eyel+0.028
Neg logit contributions
keeper-0.046
 lady-0.046
lf-0.046
 keeper-0.045
 detector-0.045
Token this
Token ID428
Feature activation-1.93e+00

Pos logit contributions
lf+0.006
keeper+0.006
 lady+0.006
 keeper+0.006
 man+0.006
Neg logit contributions
|-0.006
're-0.005
text-0.005
Any-0.005
 Ever-0.005
Token said
Token ID531
Feature activation-2.63e+00

Pos logit contributions
 instincts+0.037
 buyer+0.036
 vet+0.036
 detector+0.035
Offic+0.034
Neg logit contributions
|-0.021
're-0.021
 eyel-0.020
 Ever-0.020
 You-0.019
Token:
Token ID25
Feature activation-8.54e-01

Pos logit contributions
 detector+0.062
 buyer+0.057
lf+0.056
 vet+0.055
 instincts+0.054
Neg logit contributions
 You-0.053
|-0.051
're-0.051
 Ever-0.051
text-0.050
Token
Token ID198
Feature activation-1.75e+00

Pos logit contributions
keeper+0.014
lf+0.013
 lady+0.013
 keeper+0.013
 detector+0.013
Neg logit contributions
|-0.025
're-0.023
text-0.022
 Ever-0.022
Any-0.022
Token
Token ID198
Feature activation-2.53e+00

Pos logit contributions
 detector+0.047
lf+0.047
keeper+0.046
 keeper+0.045
 buyer+0.045
Neg logit contributions
|-0.034
text-0.028
 Ever-0.027
 You-0.027
're-0.027
Token"
Token ID1
Feature activation-2.61e+00

Pos logit contributions
 detector+0.039
lf+0.039
keeper+0.037
 keeper+0.036
 worker+0.034
Neg logit contributions
|-0.077
text-0.070
're-0.069
Any-0.068
 You-0.068
TokenDear
Token ID20266
Feature activation-8.21e+00

Pos logit contributions
lf+0.055
keeper+0.054
 detector+0.053
 keeper+0.053
 worker+0.052
Neg logit contributions
|-0.063
're-0.056
text-0.055
Any-0.055
 Ever-0.055
Token Sara
Token ID24799
Feature activation+1.08e+00

Pos logit contributions
keeper+0.212
lf+0.210
 detector+0.210
 keeper+0.203
 buyer+0.201
Neg logit contributions
|-0.153
're-0.135
 You-0.132
text-0.131
 Ever-0.129
Token,
Token ID11
Feature activation-5.01e+00

Pos logit contributions
|+0.023
're+0.023
 You+0.022
text+0.022
 Ever+0.021
Neg logit contributions
 detector-0.024
keeper-0.024
 buyer-0.023
lf-0.022
 keeper-0.022
Token
Token ID198
Feature activation-2.44e+00

Pos logit contributions
keeper+0.106
 keeper+0.103
lf+0.103
 detector+0.103
 buyer+0.100
Neg logit contributions
|-0.119
're-0.110
 You-0.107
 Ever-0.105
text-0.104
Token
Token ID198
Feature activation-2.78e+00

Pos logit contributions
 detector+0.053
keeper+0.051
 buyer+0.050
 instincts+0.049
 recipe+0.049
Neg logit contributions
|-0.060
text-0.052
 You-0.051
 Ever-0.051
You-0.050
TokenThank
Token ID10449
Feature activation-3.43e+00

Pos logit contributions
 detector+0.058
keeper+0.053
 recipe+0.052
lf+0.051
 instincts+0.051
Neg logit contributions
|-0.070
You-0.066
text-0.062
 You-0.062
 Ever-0.059
Token Tom
Token ID4186
Feature activation+1.37e+00

Pos logit contributions
keeper+0.104
 detector+0.101
 keeper+0.100
 buyer+0.098
 worker+0.098
Neg logit contributions
|-0.072
're-0.069
 You-0.065
 Ever-0.063
 eyel-0.062
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
|+0.032
're+0.029
 You+0.027
text+0.027
 Ever+0.027
Neg logit contributions
keeper-0.030
lf-0.030
 detector-0.029
 keeper-0.029
 lady-0.029
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
 You+0.023
 Ever+0.020
're+0.019
|+0.019
 That+0.018
Neg logit contributions
lf-0.056
 detector-0.054
keeper-0.053
 instincts-0.053
 keeper-0.053
Token
Token ID198
Feature activation-1.93e+00

Pos logit contributions
keeper+0.034
 buyer+0.034
 detector+0.034
 instincts+0.033
 worker+0.033
Neg logit contributions
|-0.015
text-0.012
 Ever-0.012
 You-0.011
're-0.010
Token"
Token ID1
Feature activation-2.55e+00

Pos logit contributions
 detector+0.038
 instincts+0.036
keeper+0.035
 lair+0.035
 recipe+0.034
Neg logit contributions
You-0.049
 You-0.047
|-0.046
�-0.046
text-0.044
TokenDear
Token ID20266
Feature activation-8.20e+00

Pos logit contributions
keeper+0.058
lf+0.057
 detector+0.056
 keeper+0.056
 instincts+0.056
Neg logit contributions
|-0.057
're-0.051
Whe-0.050
Any-0.049
 Ever-0.049
Token Sara
Token ID24799
Feature activation+1.14e+00

Pos logit contributions
keeper+0.202
lf+0.199
 detector+0.198
 keeper+0.194
 buyer+0.191
Neg logit contributions
|-0.164
're-0.148
 You-0.142
text-0.142
 Ever-0.138
Token and
Token ID290
Feature activation+9.71e-01

Pos logit contributions
're+0.021
 You+0.021
|+0.021
 Ever+0.020
text+0.019
Neg logit contributions
 detector-0.028
keeper-0.027
 buyer-0.026
 vet-0.026
lf-0.025
Token Tom
Token ID4186
Feature activation+1.13e+00

Pos logit contributions
|+0.024
text+0.020
're+0.020
avy+0.020
Any+0.020
Neg logit contributions
 lady-0.022
keeper-0.021
 man-0.021
lf-0.020
 vet-0.020
Token,
Token ID11
Feature activation-4.31e+00

Pos logit contributions
|+0.032
're+0.030
text+0.029
 Ever+0.028
 You+0.028
Neg logit contributions
keeper-0.019
lf-0.019
 detector-0.018
 lady-0.018
 keeper-0.018
Token I
Token ID314
Feature activation+2.28e+00

Pos logit contributions
 lady+0.090
keeper+0.090
lf+0.089
 keeper+0.088
 detector+0.086
Neg logit contributions
|-0.104
text-0.091
're-0.090
 eyel-0.089
 Ever-0.089
Token Santa
Token ID8909
Feature activation+4.26e-01

Pos logit contributions
|+0.008
're+0.007
text+0.007
 Ever+0.006
Any+0.006
Neg logit contributions
keeper-0.008
lf-0.008
 keeper-0.008
 lady-0.008
 detector-0.008
Token.
Token ID13
Feature activation+1.98e+00

Pos logit contributions
|+0.010
're+0.010
text+0.009
 You+0.009
 Ever+0.009
Neg logit contributions
lf-0.008
 detector-0.008
keeper-0.008
 vet-0.008
 lady-0.008
Token
Token ID198
Feature activation-8.81e-01

Pos logit contributions
|+0.028
 You+0.025
 Ever+0.025
text+0.023
're+0.023
Neg logit contributions
lf-0.061
 detector-0.059
keeper-0.059
 worker-0.059
 keeper-0.059
Token
Token ID198
Feature activation-1.76e+00

Pos logit contributions
 detector+0.027
 buyer+0.027
 instincts+0.027
keeper+0.027
 worker+0.027
Neg logit contributions
|-0.013
text-0.010
 Ever-0.010
 You-0.009
Any-0.008
Token"
Token ID1
Feature activation-2.90e+00

Pos logit contributions
 instincts+0.029
 detector+0.029
keeper+0.028
 keeper+0.027
ache+0.027
Neg logit contributions
|-0.050
You-0.047
 You-0.046
text-0.045
 Ever-0.044
TokenDear
Token ID20266
Feature activation-8.18e+00

Pos logit contributions
keeper+0.062
lf+0.061
 keeper+0.061
 detector+0.060
 lady+0.059
Neg logit contributions
|-0.069
're-0.061
Any-0.060
text-0.060
 Ever-0.059
Token Santa
Token ID8909
Feature activation-6.33e-01

Pos logit contributions
keeper+0.169
lf+0.167
 detector+0.165
 keeper+0.165
 lady+0.161
Neg logit contributions
|-0.195
're-0.176
text-0.173
 You-0.172
 Ever-0.170
Token,
Token ID11
Feature activation-4.72e+00

Pos logit contributions
lf+0.012
 detector+0.012
keeper+0.012
 vet+0.011
 lady+0.011
Neg logit contributions
|-0.016
're-0.015
 You-0.014
 Ever-0.014
text-0.014
Token thank
Token ID5875
Feature activation-2.55e+00

Pos logit contributions
keeper+0.099
lf+0.097
 keeper+0.096
 detector+0.095
 lady+0.095
Neg logit contributions
|-0.114
're-0.104
text-0.100
 You-0.099
Any-0.099
Token you
Token ID345
Feature activation+1.04e+00

Pos logit contributions
keeper+0.074
lf+0.073
 lady+0.073
 keeper+0.073
 detector+0.073
Neg logit contributions
|-0.040
're-0.034
text-0.033
 Ever-0.032
Any-0.031
Token for
Token ID329
Feature activation+3.31e-01

Pos logit contributions
|+0.022
're+0.021
text+0.019
 Ever+0.019
 You+0.019
Neg logit contributions
 detector-0.025
lf-0.025
keeper-0.024
 buyer-0.023
 vet-0.023
Token mom
Token ID1995
Feature activation-4.03e-01

Pos logit contributions
 detector+0.013
lf+0.013
keeper+0.013
 buyer+0.013
 keeper+0.012
Neg logit contributions
|-0.021
're-0.020
 imag-0.019
 Ever-0.019
 eyel-0.019
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
lf+0.008
keeper+0.008
 detector+0.008
 lady+0.008
 keeper+0.008
Neg logit contributions
|-0.010
're-0.009
text-0.008
 Ever-0.008
 You-0.008
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
|+0.019
're+0.016
text+0.015
Any+0.015
 Ever+0.015
Neg logit contributions
keeper-0.029
lf-0.029
 lady-0.029
 keeper-0.029
 detector-0.028
Token
Token ID198
Feature activation-1.67e+00

Pos logit contributions
keeper+0.029
 detector+0.029
 keeper+0.028
 buyer+0.028
 worker+0.028
Neg logit contributions
|-0.022
text-0.018
 Ever-0.018
 You-0.017
Any-0.017
Token"
Token ID1
Feature activation-2.98e+00

Pos logit contributions
keeper+0.026
 detector+0.026
 keeper+0.024
 buyer+0.023
 worker+0.023
Neg logit contributions
|-0.051
Any-0.045
text-0.045
 You-0.045
 Ever-0.044
TokenDear
Token ID20266
Feature activation-8.12e+00

Pos logit contributions
keeper+0.063
lf+0.063
 detector+0.062
 keeper+0.061
 instincts+0.060
Neg logit contributions
|-0.071
Any-0.063
Whe-0.063
're-0.062
 You-0.062
Token Sara
Token ID24799
Feature activation+1.48e+00

Pos logit contributions
keeper+0.195
 detector+0.193
lf+0.193
 keeper+0.188
 buyer+0.186
Neg logit contributions
|-0.168
're-0.146
 You-0.143
text-0.143
 Ever-0.141
Token,
Token ID11
Feature activation-5.45e+00

Pos logit contributions
're+0.029
|+0.028
 You+0.027
 Ever+0.026
text+0.025
Neg logit contributions
 detector-0.036
 buyer-0.035
keeper-0.035
lf-0.033
 vet-0.033
Token I
Token ID314
Feature activation+1.94e+00

Pos logit contributions
keeper+0.120
 detector+0.116
 keeper+0.116
 buyer+0.115
lf+0.114
Neg logit contributions
|-0.122
 You-0.119
're-0.117
 Ever-0.110
text-0.108
Token miss
Token ID2051
Feature activation+2.35e-02

Pos logit contributions
|+0.038
avy+0.034
text+0.034
 Ever+0.033
pill+0.033
Neg logit contributions
 lady-0.048
keeper-0.047
 keeper-0.047
 man-0.047
 vet-0.046
Token you
Token ID345
Feature activation+2.78e+00

Pos logit contributions
|+0.001
're+0.000
text+0.000
Any+0.000
 Ever+0.000
Neg logit contributions
keeper-0.001
 lady-0.001
lf-0.001
 keeper-0.001
 detector-0.001
Token together
Token ID1978
Feature activation-5.12e-01

Pos logit contributions
|+0.078
're+0.070
text+0.069
 Ever+0.067
Any+0.067
Neg logit contributions
keeper-0.063
lf-0.062
 lady-0.062
 keeper-0.061
 detector-0.061
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
keeper+0.011
lf+0.011
 vet+0.011
 detector+0.011
 lady+0.011
Neg logit contributions
|-0.011
're-0.010
 Ever-0.010
text-0.010
Any-0.010
Token
Token ID198
Feature activation-5.62e-01

Pos logit contributions
|+0.031
're+0.029
 imag+0.027
text+0.027
avy+0.027
Neg logit contributions
keeper-0.042
 lady-0.041
 vet-0.040
 keeper-0.040
lf-0.040
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
keeper+0.013
lf+0.013
 lady+0.012
 keeper+0.012
 detector+0.012
Neg logit contributions
|-0.013
're-0.011
text-0.011
Any-0.011
 Ever-0.011
Token"
Token ID1
Feature activation-2.58e+00

Pos logit contributions
keeper+0.015
lf+0.015
 detector+0.015
 keeper+0.015
 lady+0.015
Neg logit contributions
|-0.032
're-0.029
text-0.029
Any-0.029
 Ever-0.029
TokenDear
Token ID20266
Feature activation-8.12e+00

Pos logit contributions
keeper+0.060
lf+0.059
 lady+0.058
 keeper+0.058
 detector+0.057
Neg logit contributions
|-0.057
're-0.051
text-0.049
 Ever-0.048
Any-0.048
Token God
Token ID1793
Feature activation-1.94e-01

Pos logit contributions
keeper+0.193
lf+0.191
 detector+0.189
 keeper+0.187
 lady+0.186
Neg logit contributions
|-0.170
're-0.150
text-0.146
 Ever-0.143
 You-0.143
Token,
Token ID11
Feature activation-4.93e+00

Pos logit contributions
lf+0.004
keeper+0.004
 detector+0.004
 vet+0.004
 keeper+0.004
Neg logit contributions
|-0.005
're-0.005
 Ever-0.004
 You-0.004
text-0.004
Token bless
Token ID12012
Feature activation-2.77e+00

Pos logit contributions
lf+0.097
keeper+0.097
 detector+0.095
 keeper+0.093
 lady+0.091
Neg logit contributions
|-0.121
're-0.114
 You-0.111
 Ever-0.107
text-0.106
Token us
Token ID514
Feature activation+2.75e-01

Pos logit contributions
keeper+0.079
 detector+0.078
lf+0.077
 keeper+0.076
 lady+0.075
Neg logit contributions
|-0.045
 You-0.038
're-0.038
 Ever-0.037
text-0.037
Token and
Token ID290
Feature activation+9.66e-01

Pos logit contributions
|+0.007
're+0.006
 Ever+0.006
text+0.006
 You+0.006
Neg logit contributions
keeper-0.005
 detector-0.005
lf-0.005
 lady-0.005
 keeper-0.005
Token.
Token ID13
Feature activation+1.70e+00

Pos logit contributions
keeper+0.001
lf+0.001
 keeper+0.001
 buyer+0.001
 vet+0.001
Neg logit contributions
|-0.001
're-0.001
 You-0.001
text-0.001
 Ever-0.001
Token It
Token ID632
Feature activation+5.86e-01

Pos logit contributions
|+0.027
're+0.023
 Ever+0.023
 You+0.023
text+0.022
Neg logit contributions
lf-0.049
keeper-0.049
 detector-0.048
 lady-0.048
 keeper-0.048
Token says
Token ID1139
Feature activation-2.09e+00

Pos logit contributions
|+0.010
're+0.009
text+0.009
Any+0.009
 Ever+0.008
Neg logit contributions
keeper-0.016
 lady-0.016
lf-0.016
 keeper-0.016
 detector-0.015
Token:
Token ID25
Feature activation-8.80e-01

Pos logit contributions
 detector+0.046
 buyer+0.044
keeper+0.043
 vet+0.043
lf+0.042
Neg logit contributions
 You-0.045
 Ever-0.042
text-0.042
're-0.041
|-0.041
Token "
Token ID366
Feature activation-1.16e+00

Pos logit contributions
keeper+0.014
lf+0.014
 detector+0.014
 keeper+0.014
 lady+0.014
Neg logit contributions
|-0.025
're-0.023
 Ever-0.022
 You-0.022
Any-0.022
TokenDear
Token ID20266
Feature activation-8.11e+00

Pos logit contributions
keeper+0.026
 lady+0.026
 keeper+0.025
lf+0.025
 vet+0.025
Neg logit contributions
|-0.026
're-0.024
text-0.023
 eyel-0.023
 imag-0.023
Token Tim
Token ID5045
Feature activation+2.03e-01

Pos logit contributions
keeper+0.213
lf+0.209
 detector+0.209
 keeper+0.205
 lady+0.205
Neg logit contributions
|-0.150
're-0.131
text-0.127
 You-0.126
 Ever-0.124
Token,
Token ID11
Feature activation-5.04e+00

Pos logit contributions
|+0.005
're+0.005
 You+0.005
text+0.005
 Ever+0.005
Neg logit contributions
keeper-0.004
 detector-0.004
lf-0.004
 buyer-0.004
 keeper-0.004
Token we
Token ID356
Feature activation+1.87e+00

Pos logit contributions
keeper+0.114
lf+0.112
 keeper+0.111
 lady+0.110
 detector+0.110
Neg logit contributions
|-0.114
're-0.102
text-0.098
 Ever-0.097
Any-0.097
Token love
Token ID1842
Feature activation-1.62e+00

Pos logit contributions
|+0.031
 Ever+0.026
Any+0.026
 eyel+0.026
pill+0.025
Neg logit contributions
 detector-0.051
lf-0.051
 lady-0.051
keeper-0.051
 man-0.051
Token you
Token ID345
Feature activation+2.91e+00

Pos logit contributions
keeper+0.041
lf+0.040
 lady+0.040
 keeper+0.040
 detector+0.040
Neg logit contributions
|-0.032
're-0.028
text-0.027
 Ever-0.027
Any-0.027
Token her
Token ID607
Feature activation-1.78e+00

Pos logit contributions
keeper+0.071
 lady+0.071
lf+0.070
 keeper+0.070
 detector+0.069
Neg logit contributions
|-0.051
're-0.045
text-0.043
Any-0.042
 Ever-0.041
Token and
Token ID290
Feature activation+1.68e+00

Pos logit contributions
lf+0.036
keeper+0.035
 detector+0.035
 lady+0.035
 vet+0.035
Neg logit contributions
|-0.043
're-0.039
 Ever-0.038
 eyel-0.038
 You-0.037
Token said
Token ID531
Feature activation-3.41e+00

Pos logit contributions
|+0.040
're+0.037
 Ever+0.036
Any+0.035
text+0.035
Neg logit contributions
lf-0.034
 detector-0.034
keeper-0.033
 keeper-0.032
 lady-0.032
Token,
Token ID11
Feature activation-4.85e+00

Pos logit contributions
lf+0.061
 detector+0.061
keeper+0.060
 keeper+0.059
 lady+0.059
Neg logit contributions
|-0.088
're-0.083
 You-0.080
 Ever-0.080
Any-0.080
Token "
Token ID366
Feature activation-1.60e+00

Pos logit contributions
lf+0.093
 detector+0.093
keeper+0.089
 buyer+0.088
 keeper+0.088
Neg logit contributions
|-0.117
 You-0.110
're-0.110
 Ever-0.110
Any-0.108
TokenDear
Token ID20266
Feature activation-8.10e+00

Pos logit contributions
keeper+0.038
 lady+0.037
 keeper+0.037
 vet+0.036
lf+0.036
Neg logit contributions
|-0.035
're-0.031
text-0.031
 imag-0.030
 eyel-0.030
Token daughter
Token ID4957
Feature activation-1.31e+00

Pos logit contributions
keeper+0.196
lf+0.194
 detector+0.192
 keeper+0.190
 lady+0.189
Neg logit contributions
|-0.167
're-0.147
text-0.143
 Ever-0.140
 You-0.140
Token,
Token ID11
Feature activation-4.16e+00

Pos logit contributions
lf+0.023
keeper+0.023
 detector+0.023
 keeper+0.022
 lady+0.022
Neg logit contributions
|-0.035
're-0.033
text-0.032
 You-0.031
 Ever-0.031
Token I
Token ID314
Feature activation+1.40e+00

Pos logit contributions
lf+0.084
keeper+0.084
 detector+0.082
 keeper+0.082
 lady+0.081
Neg logit contributions
|-0.101
're-0.094
 You-0.090
Any-0.089
text-0.088
Token knew
Token ID2993
Feature activation-3.03e+00

Pos logit contributions
|+0.030
text+0.027
Whe+0.026
 eyel+0.025
pill+0.025
Neg logit contributions
keeper-0.033
 lady-0.032
 keeper-0.032
 man-0.031
 detector-0.031
Token that
Token ID326
Feature activation-7.48e-01

Pos logit contributions
keeper+0.069
lf+0.068
 lady+0.068
 keeper+0.067
 detector+0.066
Neg logit contributions
|-0.069
're-0.060
text-0.059
 Ever-0.057
Any-0.057
Token prayed
Token ID32168
Feature activation-1.74e+00

Pos logit contributions
keeper+0.002
lf+0.002
 lady+0.002
 keeper+0.002
 detector+0.002
Neg logit contributions
|-0.002
're-0.001
text-0.001
Any-0.001
 Ever-0.001
Token and
Token ID290
Feature activation+2.74e+00

Pos logit contributions
lf+0.046
 detector+0.045
 buyer+0.044
keeper+0.042
 vet+0.042
Neg logit contributions
|-0.029
 Ever-0.028
're-0.028
Any-0.027
 eyel-0.026
Token said
Token ID531
Feature activation-2.16e+00

Pos logit contributions
|+0.059
're+0.053
 imag+0.051
 eyel+0.051
 Ever+0.051
Neg logit contributions
keeper-0.064
lf-0.063
 detector-0.062
 keeper-0.061
 worker-0.061
Token,
Token ID11
Feature activation-4.19e+00

Pos logit contributions
keeper+0.041
lf+0.040
 keeper+0.040
 detector+0.039
 lady+0.039
Neg logit contributions
|-0.056
're-0.051
text-0.050
 Ever-0.049
Any-0.049
Token "
Token ID366
Feature activation-1.50e+00

Pos logit contributions
 detector+0.077
keeper+0.076
lf+0.075
 keeper+0.074
 buyer+0.074
Neg logit contributions
|-0.106
're-0.102
 You-0.101
Any-0.101
 Ever-0.100
TokenDear
Token ID20266
Feature activation-8.08e+00

Pos logit contributions
keeper+0.033
 lady+0.033
 keeper+0.032
lf+0.032
 vet+0.032
Neg logit contributions
|-0.035
're-0.031
text-0.031
 imag-0.030
 eyel-0.029
Token God
Token ID1793
Feature activation+1.66e-01

Pos logit contributions
keeper+0.186
lf+0.184
 detector+0.182
 keeper+0.179
 lady+0.176
Neg logit contributions
|-0.175
're-0.157
text-0.151
 You-0.151
 Ever-0.150
Token,
Token ID11
Feature activation-4.69e+00

Pos logit contributions
|+0.004
're+0.004
text+0.004
 Ever+0.004
Any+0.004
Neg logit contributions
lf-0.003
keeper-0.003
 detector-0.003
 keeper-0.003
 lady-0.003
Token please
Token ID3387
Feature activation-3.77e+00

Pos logit contributions
keeper+0.108
lf+0.106
 lady+0.105
 keeper+0.105
 detector+0.104
Neg logit contributions
|-0.104
're-0.093
text-0.090
 Ever-0.088
Any-0.088
Token forgive
Token ID20927
Feature activation-1.09e+00

Pos logit contributions
keeper+0.089
lf+0.087
 detector+0.087
 buyer+0.085
 keeper+0.084
Neg logit contributions
're-0.079
|-0.076
 You-0.076
Any-0.071
 eyel-0.070
Token us
Token ID514
Feature activation-4.40e-01

Pos logit contributions
keeper+0.030
 detector+0.029
 worker+0.029
lf+0.029
 keeper+0.029
Neg logit contributions
|-0.019
're-0.017
 You-0.017
text-0.017
Any-0.016
Token said
Token ID531
Feature activation-3.54e+00

Pos logit contributions
|+0.052
're+0.052
 Ever+0.050
 You+0.048
Any+0.048
Neg logit contributions
 detector-0.060
lf-0.058
 instincts-0.056
 worker-0.056
 recipe-0.056
Token to
Token ID284
Feature activation-5.17e+00

Pos logit contributions
lf+0.075
 detector+0.073
 vet+0.067
 buyer+0.067
 keeper+0.066
Neg logit contributions
|-0.077
're-0.077
 You-0.076
 Ever-0.074
Any-0.073
Token Frank
Token ID5278
Feature activation-5.42e-01

Pos logit contributions
keeper+0.125
lf+0.124
 lady+0.123
 keeper+0.123
 detector+0.120
Neg logit contributions
|-0.108
're-0.094
text-0.094
Any-0.090
 eyel-0.090
Token:
Token ID25
Feature activation-1.13e+00

Pos logit contributions
lf+0.011
 detector+0.010
 vet+0.010
 lady+0.010
keeper+0.010
Neg logit contributions
|-0.013
're-0.013
 You-0.012
 Ever-0.012
Any-0.012
Token "
Token ID366
Feature activation-1.03e+00

Pos logit contributions
lf+0.021
keeper+0.021
 detector+0.020
 lady+0.020
 keeper+0.020
Neg logit contributions
|-0.030
're-0.027
 Ever-0.026
Any-0.026
 You-0.026
TokenDear
Token ID20266
Feature activation-8.06e+00

Pos logit contributions
keeper+0.023
 lady+0.022
 keeper+0.022
lf+0.022
 detector+0.022
Neg logit contributions
|-0.024
're-0.021
text-0.021
 eyel-0.020
 Ever-0.020
Token all
Token ID477
Feature activation+3.30e+00

Pos logit contributions
lf+0.194
 detector+0.192
keeper+0.189
 vet+0.181
 worker+0.180
Neg logit contributions
|-0.160
're-0.150
 You-0.147
 Ever-0.142
Any-0.137
Tokenigator
Token ID23823
Feature activation+4.61e-02

Pos logit contributions
|+0.084
text+0.074
're+0.073
 imag+0.073
 eyel+0.072
Neg logit contributions
lf-0.066
 lady-0.066
keeper-0.064
 keeper-0.064
 man-0.063
Token,
Token ID11
Feature activation-3.76e+00

Pos logit contributions
|+0.001
text+0.001
're+0.001
Whe+0.001
 eyel+0.001
Neg logit contributions
keeper-0.001
 keeper-0.001
 lady-0.001
lf-0.001
 man-0.001
Token if
Token ID611
Feature activation+5.62e-01

Pos logit contributions
lf+0.084
keeper+0.083
 detector+0.081
 keeper+0.081
 lady+0.080
Neg logit contributions
|-0.084
're-0.078
 You-0.074
Any-0.073
 Ever-0.073
Token you
Token ID345
Feature activation+2.90e+00

Pos logit contributions
|+0.009
're+0.008
text+0.007
Any+0.007
 Ever+0.007
Neg logit contributions
keeper-0.016
lf-0.016
 lady-0.016
 keeper-0.016
 detector-0.016
Token.
Token ID13
Feature activation+8.25e-01

Pos logit contributions
keeper+0.000
lf+0.000
 lady+0.000
 keeper+0.000
 detector+0.000
Neg logit contributions
|-0.000
text-0.000
're-0.000
Any-0.000
 Ever-0.000
Token It
Token ID632
Feature activation+8.86e-01

Pos logit contributions
|+0.015
're+0.013
text+0.013
 imag+0.013
avy+0.013
Neg logit contributions
keeper-0.022
 lady-0.021
lf-0.021
 keeper-0.021
 vet-0.021
Token said
Token ID531
Feature activation-2.32e+00

Pos logit contributions
|+0.023
avy+0.022
text+0.021
Whe+0.020
 Ever+0.020
Neg logit contributions
keeper-0.017
 keeper-0.016
lf-0.016
 man-0.016
 lady-0.016
Token,
Token ID11
Feature activation-5.03e+00

Pos logit contributions
keeper+0.044
lf+0.044
 lady+0.043
 keeper+0.043
 detector+0.042
Neg logit contributions
|-0.060
're-0.054
text-0.053
 Ever-0.052
Any-0.052
Token "
Token ID366
Feature activation-1.21e+00

Pos logit contributions
 detector+0.088
lf+0.088
keeper+0.088
 keeper+0.085
 buyer+0.085
Neg logit contributions
|-0.130
 You-0.125
're-0.125
 Ever-0.121
Any-0.120
TokenDear
Token ID20266
Feature activation-8.03e+00

Pos logit contributions
 lady+0.024
keeper+0.024
 vet+0.023
 keeper+0.023
lf+0.022
Neg logit contributions
|-0.030
're-0.028
text-0.028
 imag-0.028
avy-0.028
Token Anna
Token ID11735
Feature activation+4.68e-01

Pos logit contributions
keeper+0.202
lf+0.199
 detector+0.199
 keeper+0.196
 lady+0.195
Neg logit contributions
|-0.157
're-0.138
text-0.134
 Ever-0.131
 You-0.131
Token,
Token ID11
Feature activation-5.37e+00

Pos logit contributions
|+0.012
're+0.011
text+0.011
 You+0.011
 Ever+0.011
Neg logit contributions
keeper-0.009
 detector-0.009
lf-0.008
 keeper-0.008
 buyer-0.008
Token I
Token ID314
Feature activation+1.94e+00

Pos logit contributions
keeper+0.110
lf+0.107
 detector+0.106
 keeper+0.106
 lady+0.103
Neg logit contributions
|-0.131
're-0.122
 You-0.118
text-0.115
Any-0.115
Token have
Token ID423
Feature activation+2.28e+00

Pos logit contributions
|+0.040
text+0.036
pill+0.035
avy+0.035
 Ever+0.035
Neg logit contributions
 lady-0.046
keeper-0.045
 man-0.045
 keeper-0.045
 vet-0.044
Token a
Token ID257
Feature activation-3.49e+00

Pos logit contributions
|+0.045
're+0.038
text+0.037
 Ever+0.036
Any+0.036
Neg logit contributions
keeper-0.058
lf-0.058
 lady-0.058
 keeper-0.057
 detector-0.056
Token.
Token ID13
Feature activation+9.50e-01

Pos logit contributions
lf+0.040
 detector+0.037
 buyer+0.037
Offic+0.036
 vet+0.036
Neg logit contributions
 Ever-0.031
 eyel-0.030
 You-0.030
|-0.028
're-0.028
Token It
Token ID632
Feature activation+6.88e-01

Pos logit contributions
|+0.014
're+0.012
 Ever+0.011
 You+0.011
text+0.011
Neg logit contributions
keeper-0.029
lf-0.029
 detector-0.028
 keeper-0.028
 lady-0.028
Token says
Token ID1139
Feature activation-1.93e+00

Pos logit contributions
|+0.014
're+0.013
text+0.012
Any+0.012
 Ever+0.012
Neg logit contributions
keeper-0.017
 lady-0.017
 keeper-0.016
lf-0.016
 detector-0.016
Token:
Token ID25
Feature activation-9.29e-01

Pos logit contributions
lf+0.037
 detector+0.037
keeper+0.036
 buyer+0.036
 vet+0.035
Neg logit contributions
|-0.045
 You-0.044
're-0.044
 Ever-0.043
Any-0.043
Token "
Token ID366
Feature activation-9.42e-01

Pos logit contributions
keeper+0.015
lf+0.014
 lady+0.014
 keeper+0.014
 detector+0.014
Neg logit contributions
|-0.027
're-0.025
text-0.024
 Ever-0.024
Any-0.024
TokenDear
Token ID20266
Feature activation-8.02e+00

Pos logit contributions
keeper+0.022
 lady+0.021
 keeper+0.021
lf+0.021
 vet+0.021
Neg logit contributions
|-0.021
're-0.019
text-0.018
 eyel-0.018
 imag-0.018
Token Sam
Token ID3409
Feature activation+1.80e-01

Pos logit contributions
keeper+0.226
lf+0.222
 detector+0.221
 keeper+0.217
 buyer+0.217
Neg logit contributions
|-0.133
're-0.116
 You-0.111
text-0.110
 Ever-0.108
Token,
Token ID11
Feature activation-5.40e+00

Pos logit contributions
|+0.005
're+0.005
text+0.004
 You+0.004
 Ever+0.004
Neg logit contributions
keeper-0.003
lf-0.003
 detector-0.003
 keeper-0.003
 lady-0.003
Token I
Token ID314
Feature activation+1.83e+00

Pos logit contributions
keeper+0.126
lf+0.123
 detector+0.122
 keeper+0.122
 lady+0.120
Neg logit contributions
|-0.117
're-0.106
 You-0.103
 Ever-0.100
text-0.100
Token like
Token ID588
Feature activation-2.46e-01

Pos logit contributions
|+0.033
're+0.029
text+0.028
 Ever+0.027
Any+0.027
Neg logit contributions
keeper-0.049
lf-0.049
 lady-0.048
 keeper-0.048
 detector-0.048
Token you
Token ID345
Feature activation+3.36e+00

Pos logit contributions
keeper+0.006
lf+0.006
 keeper+0.006
 lady+0.006
 detector+0.006
Neg logit contributions
|-0.005
're-0.005
text-0.005
 Ever-0.004
Any-0.004
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
|+0.032
're+0.029
text+0.029
 Ever+0.028
Any+0.028
Neg logit contributions
keeper-0.026
lf-0.026
 lady-0.026
 keeper-0.025
 detector-0.025
Token She
Token ID1375
Feature activation-1.77e+00

Pos logit contributions
|+0.017
 You+0.016
 Ever+0.016
're+0.015
Any+0.014
Neg logit contributions
lf-0.036
keeper-0.035
 detector-0.035
 buyer-0.034
 keeper-0.034
Token said
Token ID531
Feature activation-2.21e+00

Pos logit contributions
keeper+0.039
 detector+0.039
lf+0.039
 buyer+0.039
 keeper+0.038
Neg logit contributions
|-0.037
're-0.034
 imag-0.033
 Ever-0.033
text-0.033
Token:
Token ID25
Feature activation-1.66e+00

Pos logit contributions
 detector+0.044
lf+0.044
keeper+0.043
 buyer+0.042
 keeper+0.042
Neg logit contributions
|-0.050
're-0.049
 You-0.049
text-0.048
 Ever-0.047
Token "
Token ID366
Feature activation-1.44e+00

Pos logit contributions
keeper+0.029
lf+0.028
 lady+0.028
 detector+0.028
 keeper+0.028
Neg logit contributions
|-0.046
're-0.041
text-0.040
 Ever-0.040
Any-0.040
TokenDear
Token ID20266
Feature activation-7.97e+00

Pos logit contributions
keeper+0.036
lf+0.035
 lady+0.035
 keeper+0.035
 detector+0.034
Neg logit contributions
|-0.030
're-0.026
text-0.025
 Ever-0.025
Any-0.025
Token Bob
Token ID5811
Feature activation+4.01e-01

Pos logit contributions
keeper+0.204
lf+0.202
 detector+0.202
 buyer+0.195
 keeper+0.193
Neg logit contributions
|-0.149
're-0.130
 You-0.128
text-0.126
 Ever-0.124
Token,
Token ID11
Feature activation-4.97e+00

Pos logit contributions
|+0.011
're+0.010
text+0.010
 Ever+0.010
Any+0.010
Neg logit contributions
keeper-0.007
lf-0.007
 detector-0.007
 lady-0.007
 keeper-0.007
Token we
Token ID356
Feature activation+1.78e+00

Pos logit contributions
keeper+0.109
lf+0.107
 lady+0.106
 keeper+0.106
 detector+0.104
Neg logit contributions
|-0.116
're-0.104
text-0.101
 Ever-0.099
Any-0.099
Token love
Token ID1842
Feature activation-1.61e+00

Pos logit contributions
|+0.031
 eyel+0.027
 Ever+0.026
text+0.026
Whe+0.026
Neg logit contributions
 lady-0.048
 detector-0.048
keeper-0.047
 man-0.047
 vet-0.047
Token you
Token ID345
Feature activation+3.12e+00

Pos logit contributions
keeper+0.043
lf+0.043
 lady+0.043
 keeper+0.042
 man+0.041
Neg logit contributions
|-0.030
're-0.027
text-0.026
Any-0.025
 Ever-0.024
Token The
Token ID383
Feature activation-1.41e-01

Pos logit contributions
|+0.032
're+0.028
text+0.027
 imag+0.027
Any+0.026
Neg logit contributions
keeper-0.035
 lady-0.035
 keeper-0.034
lf-0.034
 vet-0.034
Token letter
Token ID3850
Feature activation-1.92e+00

Pos logit contributions
 detector+0.002
lf+0.002
 instincts+0.002
ress+0.002
keeper+0.002
Neg logit contributions
 You-0.004
 de-0.004
're-0.004
 eyel-0.004
 Ever-0.004
Token said
Token ID531
Feature activation-2.89e+00

Pos logit contributions
 instincts+0.051
 buyer+0.050
 vet+0.050
 detector+0.048
Offic+0.048
Neg logit contributions
're-0.029
|-0.029
 Ever-0.027
 You-0.026
 imag-0.026
Token:
Token ID25
Feature activation-1.01e+00

Pos logit contributions
 detector+0.067
 buyer+0.065
 vet+0.062
lf+0.061
keeper+0.061
Neg logit contributions
 You-0.057
|-0.056
're-0.056
text-0.054
Any-0.053
Token "
Token ID366
Feature activation-9.18e-01

Pos logit contributions
keeper+0.016
lf+0.016
 lady+0.016
 keeper+0.016
 detector+0.016
Neg logit contributions
|-0.029
're-0.027
text-0.026
 Ever-0.026
Any-0.026
TokenDear
Token ID20266
Feature activation-7.95e+00

Pos logit contributions
keeper+0.020
 lady+0.020
 keeper+0.020
 vet+0.019
lf+0.019
Neg logit contributions
|-0.021
're-0.019
text-0.019
 imag-0.019
 eyel-0.019
Token Sara
Token ID24799
Feature activation+1.30e+00

Pos logit contributions
keeper+0.185
lf+0.183
 detector+0.181
 keeper+0.179
 lady+0.177
Neg logit contributions
|-0.170
're-0.152
text-0.148
 You-0.146
 Ever-0.145
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
|+0.026
're+0.024
text+0.024
 You+0.024
 Ever+0.023
Neg logit contributions
 detector-0.031
keeper-0.031
 buyer-0.030
lf-0.030
 keeper-0.029
Token Ben
Token ID3932
Feature activation+1.58e+00

Pos logit contributions
|+0.025
text+0.021
ained+0.020
akes+0.020
're+0.020
Neg logit contributions
 lady-0.025
 man-0.024
 vet-0.023
keeper-0.023
lf-0.023
Token,
Token ID11
Feature activation-4.42e+00

Pos logit contributions
|+0.044
're+0.040
text+0.039
 Ever+0.039
 eyel+0.039
Neg logit contributions
keeper-0.027
 lady-0.027
lf-0.026
 keeper-0.026
 man-0.025
Token I
Token ID314
Feature activation+2.17e+00

Pos logit contributions
 lady+0.095
keeper+0.095
lf+0.094
 keeper+0.093
 man+0.091
Neg logit contributions
|-0.104
text-0.090
're-0.089
 eyel-0.089
 Ever-0.088
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
keeper+0.023
 instincts+0.023
lf+0.023
 vet+0.023
Offic+0.023
Neg logit contributions
're-0.019
|-0.019
 You-0.018
 Ever-0.017
hed-0.016
Token He
Token ID679
Feature activation-2.24e+00

Pos logit contributions
 You+0.013
 Ever+0.012
|+0.012
 That+0.010
Any+0.010
Neg logit contributions
lf-0.038
 detector-0.036
ache-0.036
keeper-0.036
 recipe-0.035
Token writes
Token ID6797
Feature activation-1.77e+00

Pos logit contributions
keeper+0.049
 lady+0.048
lf+0.048
 detector+0.047
 keeper+0.047
Neg logit contributions
|-0.053
're-0.047
text-0.046
Any-0.045
 Ever-0.044
Token:
Token ID25
Feature activation+1.35e-01

Pos logit contributions
 vet+0.046
 detector+0.046
 buyer+0.045
keeper+0.045
lf+0.044
Neg logit contributions
 You-0.028
 Ever-0.025
're-0.024
 de-0.023
Any-0.022
Token "
Token ID366
Feature activation-1.69e-01

Pos logit contributions
|+0.004
're+0.003
text+0.003
 Ever+0.003
Any+0.003
Neg logit contributions
keeper-0.002
lf-0.002
 lady-0.002
 keeper-0.002
 detector-0.002
TokenDear
Token ID20266
Feature activation-7.92e+00

Pos logit contributions
keeper+0.004
 lady+0.004
lf+0.004
 keeper+0.004
 detector+0.004
Neg logit contributions
|-0.004
're-0.004
text-0.003
 eyel-0.003
 imag-0.003
Token toys
Token ID14958
Feature activation-9.33e-01

Pos logit contributions
keeper+0.191
 detector+0.187
lf+0.183
 keeper+0.180
 buyer+0.178
Neg logit contributions
|-0.157
 You-0.144
're-0.141
text-0.136
 Ever-0.135
Token,
Token ID11
Feature activation-3.72e+00

Pos logit contributions
keeper+0.020
lf+0.020
 keeper+0.020
 detector+0.019
 vet+0.019
Neg logit contributions
|-0.020
're-0.020
 You-0.019
text-0.018
 Ever-0.018
Token I
Token ID314
Feature activation+2.16e+00

Pos logit contributions
keeper+0.087
 keeper+0.084
 detector+0.083
lf+0.083
 lady+0.082
Neg logit contributions
|-0.079
're-0.074
 You-0.073
 Ever-0.069
Any-0.069
Token love
Token ID1842
Feature activation-1.89e+00

Pos logit contributions
|+0.044
're+0.040
text+0.037
Any+0.037
 Ever+0.037
Neg logit contributions
keeper-0.053
lf-0.053
 keeper-0.052
 lady-0.052
 detector-0.051
Token you
Token ID345
Feature activation+3.10e+00

Pos logit contributions
keeper+0.049
 keeper+0.047
lf+0.047
 detector+0.047
 lady+0.047
Neg logit contributions
|-0.036
're-0.031
text-0.030
 You-0.030
 Ever-0.030
Token the
Token ID262
Feature activation-1.29e-02

Pos logit contributions
keeper+0.011
 lady+0.011
lf+0.011
 keeper+0.011
 man+0.011
Neg logit contributions
|-0.010
're-0.009
text-0.009
Any-0.008
 Ever-0.008
Token bird
Token ID6512
Feature activation-1.78e+00

Pos logit contributions
lf+0.000
 detector+0.000
keeper+0.000
Offic+0.000
 site+0.000
Neg logit contributions
're-0.000
|-0.000
 imag-0.000
 You-0.000
 Ever-0.000
Token some
Token ID617
Feature activation-5.58e-01

Pos logit contributions
keeper+0.042
 lady+0.041
 keeper+0.041
 vet+0.039
 man+0.039
Neg logit contributions
|-0.040
 imag-0.035
text-0.035
're-0.035
Whe-0.035
Token seeds
Token ID11904
Feature activation-1.18e+00

Pos logit contributions
lf+0.011
keeper+0.011
 keeper+0.011
 lady+0.011
 detector+0.011
Neg logit contributions
|-0.014
're-0.013
text-0.012
 Ever-0.012
Any-0.012
Token.
Token ID13
Feature activation+2.19e+00

Pos logit contributions
lf+0.023
keeper+0.022
 lady+0.022
 keeper+0.022
 detector+0.022
Neg logit contributions
|-0.030
're-0.028
text-0.027
 Ever-0.026
Any-0.026
Token
Token ID220
Feature activation+5.46e+00

Pos logit contributions
|+0.040
're+0.035
text+0.034
 imag+0.034
 eyel+0.033
Neg logit contributions
keeper-0.060
 lady-0.059
 keeper-0.057
lf-0.057
 vet-0.057
Token
Token ID198
Feature activation-1.78e+00

Pos logit contributions
|+0.134
're+0.120
 imag+0.119
text+0.118
 eyel+0.117
Neg logit contributions
keeper-0.112
 lady-0.108
 keeper-0.107
lf-0.106
 vet-0.104
Token
Token ID198
Feature activation-2.39e+00

Pos logit contributions
keeper+0.041
lf+0.041
 detector+0.040
 keeper+0.040
 lady+0.040
Neg logit contributions
|-0.039
're-0.034
text-0.034
 Ever-0.033
Any-0.033
TokenSuddenly
Token ID38582
Feature activation+2.63e-02

Pos logit contributions
 detector+0.045
keeper+0.044
 keeper+0.043
lf+0.043
 lady+0.042
Neg logit contributions
|-0.064
Any-0.057
're-0.056
text-0.056
 Ever-0.055
Token,
Token ID11
Feature activation-2.65e+00

Pos logit contributions
|+0.001
're+0.001
text+0.001
Any+0.001
 Ever+0.001
Neg logit contributions
keeper-0.000
 lady-0.000
lf-0.000
 keeper-0.000
 man-0.000
Token the
Token ID262
Feature activation-4.27e-01

Pos logit contributions
keeper+0.063
 lady+0.063
lf+0.062
 keeper+0.061
 man+0.061
Neg logit contributions
|-0.058
're-0.050
text-0.050
 imag-0.048
 Ever-0.048
Token don
Token ID836
Feature activation+2.40e+00

Pos logit contributions
lf+0.069
keeper+0.068
 detector+0.066
 keeper+0.064
 buyer+0.064
Neg logit contributions
're-0.063
 You-0.060
|-0.059
 eyel-0.056
Any-0.053
Token't
Token ID470
Feature activation-5.81e-01

Pos logit contributions
|+0.057
're+0.052
text+0.051
 Ever+0.049
Any+0.049
Neg logit contributions
keeper-0.051
lf-0.050
 lady-0.049
 keeper-0.049
 detector-0.049
Token walk
Token ID2513
Feature activation+1.92e+00

Pos logit contributions
keeper+0.014
lf+0.014
 detector+0.013
 worker+0.013
 buyer+0.013
Neg logit contributions
|-0.012
're-0.012
 You-0.010
 eyel-0.010
text-0.010
Token near
Token ID1474
Feature activation+2.30e+00

Pos logit contributions
|+0.045
're+0.040
text+0.039
 You+0.038
 Ever+0.038
Neg logit contributions
lf-0.043
keeper-0.041
 detector-0.041
 lady-0.040
 buyer-0.039
Token the
Token ID262
Feature activation+2.63e+00

Pos logit contributions
|+0.049
're+0.043
text+0.042
 Ever+0.041
Any+0.041
Neg logit contributions
keeper-0.055
lf-0.054
 lady-0.054
 keeper-0.053
 detector-0.053
Token all
Token ID477
Feature activation+5.74e+00

Pos logit contributions
|+0.067
're+0.060
text+0.059
 Ever+0.058
Any+0.058
Neg logit contributions
keeper-0.052
lf-0.051
 lady-0.050
 keeper-0.050
 detector-0.049
Tokenigator
Token ID23823
Feature activation+3.94e-01

Pos logit contributions
|+0.140
're+0.126
text+0.125
 eyel+0.124
 imag+0.123
Neg logit contributions
keeper-0.118
 lady-0.117
 keeper-0.117
lf-0.116
 detector-0.113
Token.
Token ID13
Feature activation+2.04e+00

Pos logit contributions
|+0.010
're+0.009
text+0.008
 Ever+0.008
 You+0.008
Neg logit contributions
lf-0.008
keeper-0.008
 lady-0.008
 detector-0.008
 keeper-0.008
Token He
Token ID679
Feature activation-1.23e+00

Pos logit contributions
|+0.037
're+0.031
text+0.031
 imag+0.031
avy+0.030
Neg logit contributions
keeper-0.054
 vet-0.054
 lady-0.053
 keeper-0.053
 detector-0.052
Token is
Token ID318
Feature activation+3.21e-01

Pos logit contributions
 man+0.023
keeper+0.023
 vet+0.022
 keeper+0.022
 lady+0.022
Neg logit contributions
|-0.033
Whe-0.030
 eyel-0.029
text-0.029
avy-0.029
Token not
Token ID407
Feature activation+9.65e-01

Pos logit contributions
|+0.008
're+0.007
text+0.007
 Ever+0.007
 imag+0.007
Neg logit contributions
 lady-0.007
keeper-0.007
 keeper-0.007
 vet-0.007
lf-0.007
Token other
Token ID584
Feature activation+2.67e-01

Pos logit contributions
|+0.029
're+0.026
text+0.026
 Ever+0.025
Any+0.025
Neg logit contributions
keeper-0.027
lf-0.027
 lady-0.027
 keeper-0.027
 detector-0.026
Token wonderful
Token ID7932
Feature activation-1.04e+00

Pos logit contributions
|+0.006
're+0.006
text+0.006
 imag+0.006
 Ever+0.005
Neg logit contributions
keeper-0.006
lf-0.006
 keeper-0.005
 lady-0.005
 detector-0.005
Token things
Token ID1243
Feature activation+2.14e+00

Pos logit contributions
keeper+0.022
lf+0.022
 keeper+0.021
 lady+0.021
 detector+0.021
Neg logit contributions
|-0.024
're-0.022
text-0.022
 imag-0.022
 Ever-0.021
Token.
Token ID13
Feature activation+2.94e+00

Pos logit contributions
|+0.054
're+0.050
text+0.048
 Ever+0.047
Any+0.047
Neg logit contributions
lf-0.042
keeper-0.041
 lady-0.041
 keeper-0.040
 detector-0.040
Tokenâ
Token ID22940
Feature activation+4.92e+00

Pos logit contributions
avy+0.082
 imag+0.081
 eyel+0.079
|+0.079
ained+0.076
Neg logit contributions
keeper-0.046
 vet-0.044
 lady-0.041
"-0.040
 keeper-0.039
Token€
Token ID26391
Feature activation+5.65e+00

Pos logit contributions
pill+0.096
text+0.095
 imag+0.093
 eyel+0.090
're+0.087
Neg logit contributions
lf-0.117
 keeper-0.117
man-0.117
 man-0.116
keeper-0.115
Token
Token ID220
Feature activation+5.39e+00

Pos logit contributions
|+0.133
're+0.117
 eyel+0.116
text+0.115
 imag+0.114
Neg logit contributions
keeper-0.121
lf-0.120
 keeper-0.117
 lady-0.116
 man-0.114
Token
Token ID198
Feature activation-1.55e+00

Pos logit contributions
|+0.144
 imag+0.134
 eyel+0.130
text+0.127
're+0.126
Neg logit contributions
keeper-0.101
 keeper-0.094
 lady-0.093
 vet-0.090
 man-0.089
Token
Token ID198
Feature activation-2.31e+00

Pos logit contributions
lf+0.028
 lady+0.028
keeper+0.028
 keeper+0.028
 man+0.027
Neg logit contributions
|-0.039
're-0.037
 imag-0.037
 eyel-0.036
text-0.035
TokenBear
Token ID36352
Feature activation-5.37e-01

Pos logit contributions
keeper+0.039
lf+0.039
 lady+0.038
 keeper+0.038
 detector+0.037
Neg logit contributions
|-0.065
're-0.060
text-0.058
 imag-0.057
 Ever-0.057
Token listened
Token ID16399
Feature activation-1.30e+00

Pos logit contributions
keeper+0.011
 keeper+0.011
 lady+0.011
lf+0.011
 man+0.011
Neg logit contributions
|-0.013
text-0.012
're-0.012
 eyel-0.011
Any-0.011
Token Lily
Token ID20037
Feature activation+2.44e-01

Pos logit contributions
 detector+0.157
keeper+0.156
 buyer+0.154
 keeper+0.153
lf+0.152
Neg logit contributions
|-0.058
 You-0.043
 eyel-0.042
're-0.042
text-0.041
Token print
Token ID3601
Feature activation+9.62e-02

Pos logit contributions
|+0.005
're+0.005
 eyel+0.004
 Ever+0.004
 imag+0.004
Neg logit contributions
lf-0.006
keeper-0.006
 keeper-0.005
 lady-0.005
 buyer-0.005
Token her
Token ID607
Feature activation-1.26e+00

Pos logit contributions
|+0.002
're+0.002
text+0.001
 Ever+0.001
Any+0.001
Neg logit contributions
keeper-0.003
lf-0.003
 keeper-0.003
 lady-0.002
 detector-0.002
Token drawings
Token ID23388
Feature activation+5.10e-01

Pos logit contributions
keeper+0.025
 lady+0.025
lf+0.025
 keeper+0.025
 man+0.024
Neg logit contributions
|-0.032
're-0.028
text-0.028
Any-0.028
 Ever-0.027
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
|+0.010
're+0.010
 Ever+0.009
Any+0.009
text+0.009
Neg logit contributions
lf-0.012
keeper-0.012
 vet-0.012
 buyer-0.012
 detector-0.012
Token
Token ID220
Feature activation+5.42e+00

Pos logit contributions
|+0.028
 Ever+0.024
 You+0.023
're+0.023
text+0.023
Neg logit contributions
 detector-0.044
lf-0.044
keeper-0.044
 keeper-0.043
 buyer-0.043
Token
Token ID198
Feature activation-1.92e+00

Pos logit contributions
|+0.127
're+0.114
text+0.112
 imag+0.110
Any+0.109
Neg logit contributions
keeper-0.116
lf-0.113
 lady-0.113
 keeper-0.112
 detector-0.109
Token
Token ID198
Feature activation-2.57e+00

Pos logit contributions
 detector+0.050
keeper+0.050
 keeper+0.049
lf+0.048
 worker+0.048
Neg logit contributions
|-0.039
text-0.032
 Ever-0.032
're-0.032
Any-0.032
TokenL
Token ID43
Feature activation+1.82e+00

Pos logit contributions
 detector+0.056
keeper+0.055
 keeper+0.053
 buyer+0.053
 worker+0.051
Neg logit contributions
|-0.064
Any-0.057
text-0.055
 You-0.054
 Ever-0.054
Tokenily
Token ID813
Feature activation+5.90e-01

Pos logit contributions
|+0.033
 imag+0.030
're+0.030
 eyel+0.028
 Ever+0.027
Neg logit contributions
lf-0.049
 lady-0.048
keeper-0.046
 worker-0.046
 vet-0.046
Token was
Token ID373
Feature activation+9.95e-01

Pos logit contributions
|+0.012
're+0.010
text+0.010
 Ever+0.010
Any+0.010
Neg logit contributions
keeper-0.015
lf-0.015
 keeper-0.015
 lady-0.015
 detector-0.015
Token fountain
Token ID29420
Feature activation+3.50e-01

Pos logit contributions
|+0.065
text+0.058
're+0.058
 Ever+0.057
Any+0.056
Neg logit contributions
 lady-0.043
keeper-0.042
 keeper-0.041
 detector-0.041
 man-0.041
Token and
Token ID290
Feature activation+2.59e+00

Pos logit contributions
|+0.008
're+0.008
 Ever+0.007
Any+0.007
 eyel+0.007
Neg logit contributions
lf-0.007
keeper-0.007
 detector-0.007
 vet-0.007
 lady-0.007
Token feel
Token ID1254
Feature activation+3.51e+00

Pos logit contributions
|+0.057
're+0.052
 Ever+0.049
Any+0.049
text+0.049
Neg logit contributions
keeper-0.058
lf-0.058
 detector-0.057
 lady-0.057
 keeper-0.056
Token safe
Token ID3338
Feature activation-4.92e-01

Pos logit contributions
|+0.099
text+0.086
pill+0.085
 imag+0.085
 eyel+0.084
Neg logit contributions
 lady-0.062
keeper-0.062
 man-0.062
 vet-0.061
 keeper-0.061
Token.
Token ID13
Feature activation+2.32e+00

Pos logit contributions
keeper+0.010
 lady+0.010
lf+0.009
 keeper+0.009
 detector+0.009
Neg logit contributions
|-0.013
're-0.011
text-0.011
 Ever-0.011
Any-0.011
Token<|endoftext|>
Token ID50256
Feature activation+6.05e+00

Pos logit contributions
|+0.065
 imag+0.058
're+0.058
 eyel+0.058
avy+0.055
Neg logit contributions
keeper-0.041
 lady-0.040
 man-0.039
 keeper-0.039
 vet-0.039
TokenThere
Token ID1858
Feature activation-7.10e-01

Pos logit contributions
 imag+0.149
 eyel+0.147
're+0.146
|+0.144
avy+0.139
Neg logit contributions
 lady-0.112
 man-0.110
keeper-0.109
 keeper-0.108
lf-0.105
Token was
Token ID373
Feature activation+2.11e+00

Pos logit contributions
lf+0.014
 man+0.014
 lady+0.014
keeper+0.013
 keeper+0.013
Neg logit contributions
|-0.018
 Ever-0.016
avy-0.016
 imag-0.016
 eyel-0.016
Token a
Token ID257
Feature activation-1.66e+00

Pos logit contributions
|+0.056
anced+0.051
pill+0.051
ksh+0.050
're+0.049
Neg logit contributions
lf-0.034
 lady-0.034
 vet-0.033
 man-0.032
 door-0.032
Token parade
Token ID16134
Feature activation-6.91e-01

Pos logit contributions
keeper+0.028
lf+0.028
 detector+0.028
 keeper+0.027
 lady+0.027
Neg logit contributions
|-0.046
're-0.042
text-0.041
 Ever-0.040
Any-0.040
Token coming
Token ID2406
Feature activation+1.37e+00

Pos logit contributions
keeper+0.013
lf+0.012
 lady+0.012
 keeper+0.012
 detector+0.012
Neg logit contributions
|-0.019
're-0.017
text-0.017
 imag-0.016
 Ever-0.016
Token you
Token ID345
Feature activation+2.92e+00

Pos logit contributions
 lady+0.117
lf+0.115
keeper+0.113
 keeper+0.112
 man+0.112
Neg logit contributions
|-0.130
 Ever-0.113
 eyel-0.112
text-0.112
avy-0.111
Token are
Token ID389
Feature activation+2.34e-01

Pos logit contributions
|+0.043
text+0.034
 eyel+0.034
 Ever+0.034
Any+0.033
Neg logit contributions
keeper-0.087
 lady-0.087
lf-0.086
 keeper-0.086
 detector-0.086
Token not
Token ID407
Feature activation+6.36e-01

Pos logit contributions
|+0.006
text+0.005
pill+0.005
anced+0.005
're+0.005
Neg logit contributions
 lady-0.005
 keeper-0.005
 worker-0.005
lf-0.005
 vet-0.005
Token nice
Token ID3621
Feature activation-2.36e-01

Pos logit contributions
|+0.015
're+0.013
text+0.013
 Ever+0.013
Any+0.013
Neg logit contributions
keeper-0.014
 lady-0.014
lf-0.014
 keeper-0.013
 detector-0.013
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
lf+0.006
keeper+0.006
 detector+0.006
 buyer+0.006
 vet+0.006
Neg logit contributions
 You-0.004
|-0.004
're-0.004
 Ever-0.004
text-0.004
Token You
Token ID921
Feature activation+4.51e+00

Pos logit contributions
 immensely+0.021
|+0.020
 imag+0.018
're+0.017
 eyel+0.017
Neg logit contributions
 vet-0.038
 lady-0.038
 mechanic-0.038
keeper-0.037
 doctor-0.037
Token are
Token ID389
Feature activation+1.59e-02

Pos logit contributions
|+0.077
 eyel+0.068
 Ever+0.066
text+0.065
Any+0.065
Neg logit contributions
 lady-0.121
keeper-0.121
 detector-0.121
lf-0.119
 man-0.119
Token not
Token ID407
Feature activation+7.53e-01

Pos logit contributions
|+0.000
text+0.000
're+0.000
anced+0.000
akes+0.000
Neg logit contributions
 lady-0.000
 worker-0.000
 keeper-0.000
 vet-0.000
lf-0.000
Token sharing
Token ID7373
Feature activation+1.47e-01

Pos logit contributions
|+0.019
're+0.017
text+0.016
 Ever+0.016
Any+0.016
Neg logit contributions
keeper-0.015
 lady-0.015
lf-0.015
 keeper-0.015
 worker-0.015
Token.
Token ID13
Feature activation+1.94e+00

Pos logit contributions
|+0.003
 You+0.003
're+0.003
 Ever+0.003
text+0.003
Neg logit contributions
lf-0.003
keeper-0.003
 detector-0.003
 vet-0.003
 buyer-0.003
Token You
Token ID921
Feature activation+4.60e+00

Pos logit contributions
 immensely+0.028
|+0.028
 imag+0.025
're+0.024
hed+0.023
Neg logit contributions
 vet-0.055
 lady-0.052
 mechanic-0.052
"-0.051
 doctor-0.050
Token shield
Token ID7614
Feature activation-1.78e+00

Pos logit contributions
keeper+0.028
lf+0.027
 detector+0.027
 worker+0.026
 buyer+0.026
Neg logit contributions
 imag-0.032
|-0.030
 eyel-0.028
're-0.028
text-0.028
Token and
Token ID290
Feature activation+1.37e+00

Pos logit contributions
keeper+0.040
 buyer+0.040
lf+0.039
 lady+0.039
 detector+0.038
Neg logit contributions
|-0.038
're-0.035
 Ever-0.032
text-0.032
 eyel-0.032
Token scarf
Token ID36953
Feature activation+5.91e-01

Pos logit contributions
|+0.025
're+0.024
 imag+0.023
text+0.022
 eyel+0.022
Neg logit contributions
keeper-0.034
lf-0.033
 keeper-0.032
 buyer-0.032
 worker-0.032
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
|+0.013
're+0.012
text+0.011
 Ever+0.011
Any+0.011
Neg logit contributions
lf-0.013
keeper-0.013
 buyer-0.012
 vet-0.012
 detector-0.012
Token They
Token ID1119
Feature activation-6.82e-01

Pos logit contributions
|+0.026
're+0.022
text+0.021
Any+0.020
 imag+0.020
Neg logit contributions
keeper-0.044
 vet-0.043
 lady-0.043
 keeper-0.042
 man-0.041
Token were
Token ID547
Feature activation+1.08e+00

Pos logit contributions
keeper+0.017
lf+0.017
 keeper+0.017
 lady+0.017
 detector+0.016
Neg logit contributions
|-0.013
're-0.012
text-0.011
 imag-0.011
 eyel-0.011
Token torn
Token ID12445
Feature activation+3.93e-01

Pos logit contributions
|+0.023
're+0.021
text+0.020
 Ever+0.020
Any+0.020
Neg logit contributions
keeper-0.025
lf-0.025
 lady-0.025
 keeper-0.025
 detector-0.024
Token and
Token ID290
Feature activation+3.89e+00

Pos logit contributions
|+0.009
're+0.008
text+0.008
 Ever+0.008
 eyel+0.008
Neg logit contributions
lf-0.008
keeper-0.008
 vet-0.008
 lady-0.008
 keeper-0.008
Token dirty
Token ID11841
Feature activation-3.38e-01

Pos logit contributions
|+0.094
're+0.083
text+0.082
 Ever+0.080
Any+0.080
Neg logit contributions
lf-0.083
keeper-0.083
 lady-0.082
 keeper-0.081
 detector-0.080
Token and
Token ID290
Feature activation+3.46e+00

Pos logit contributions
keeper+0.007
lf+0.006
 lady+0.006
 keeper+0.006
 detector+0.006
Neg logit contributions
|-0.009
're-0.008
text-0.008
 Ever-0.007
 eyel-0.007
Token had
Token ID550
Feature activation+3.48e+00

Pos logit contributions
|+0.083
text+0.073
're+0.073
 Ever+0.072
Any+0.071
Neg logit contributions
lf-0.074
keeper-0.074
 lady-0.073
 keeper-0.073
 man-0.071
Token was
Token ID373
Feature activation+3.26e-01

Pos logit contributions
keeper+0.079
 lady+0.078
lf+0.077
 keeper+0.077
 detector+0.076
Neg logit contributions
|-0.071
're-0.062
text-0.060
Any-0.059
 Ever-0.059
Token a
Token ID257
Feature activation-3.70e+00

Pos logit contributions
|+0.007
're+0.006
text+0.006
Any+0.006
 Ever+0.006
Neg logit contributions
keeper-0.008
lf-0.008
 detector-0.008
 lady-0.008
 keeper-0.008
Token success
Token ID1943
Feature activation-2.77e+00

Pos logit contributions
lf+0.083
 detector+0.082
keeper+0.081
 keeper+0.077
 lady+0.076
Neg logit contributions
|-0.080
're-0.073
 imag-0.070
Any-0.070
 eyel-0.069
Token!
Token ID0
Feature activation+1.99e+00

Pos logit contributions
keeper+0.056
lf+0.055
 lady+0.055
 keeper+0.054
 detector+0.054
Neg logit contributions
|-0.068
're-0.061
text-0.060
 Ever-0.059
Any-0.059
Token<|endoftext|>
Token ID50256
Feature activation+5.98e+00

Pos logit contributions
 imag+0.050
|+0.048
avy+0.048
 eyel+0.048
're+0.045
Neg logit contributions
keeper-0.040
 vet-0.039
 lady-0.038
 keeper-0.036
 man-0.035
TokenOne
Token ID3198
Feature activation+1.38e+00

Pos logit contributions
 imag+0.140
 eyel+0.140
're+0.134
|+0.131
avy+0.129
Neg logit contributions
 lady-0.117
 man-0.116
 keeper-0.114
keeper-0.112
lf-0.110
Token day
Token ID1110
Feature activation+1.98e+00

Pos logit contributions
pill+0.034
|+0.032
anced+0.032
 imag+0.031
hed+0.031
Neg logit contributions
 lady-0.031
 man-0.030
 keeper-0.028
lf-0.028
 worker-0.026
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
|+0.074
pill+0.069
Whe+0.067
 Ever+0.066
ez+0.066
Neg logit contributions
lf-0.017
 lady-0.017
 man-0.016
"-0.015
 worker-0.014
Token a
Token ID257
Feature activation-2.11e+00

Pos logit contributions
 lady+0.023
"+0.023
 man+0.023
 worker+0.022
keeper+0.022
Neg logit contributions
pill-0.030
|-0.030
text-0.029
 imag-0.029
're-0.028
Token little
Token ID1310
Feature activation-1.50e+00

Pos logit contributions
keeper+0.028
 lady+0.028
lf+0.028
 keeper+0.027
 detector+0.026
Neg logit contributions
|-0.067
're-0.061
text-0.060
 Ever-0.059
Any-0.059
Token bird
Token ID6512
Feature activation-3.46e-01

Pos logit contributions
 lady+0.021
 man+0.020
 worker+0.018
keeper+0.018
 vet+0.018
Neg logit contributions
|-0.050
text-0.047
're-0.047
 imag-0.046
 Ever-0.045
Token
Token ID198
Feature activation-2.07e+00

Pos logit contributions
 detector+0.031
lf+0.031
 buyer+0.031
keeper+0.031
 worker+0.030
Neg logit contributions
|-0.021
text-0.017
 Ever-0.017
 You-0.016
're-0.016
TokenL
Token ID43
Feature activation+2.14e+00

Pos logit contributions
 detector+0.045
 recipe+0.044
 instincts+0.042
lf+0.042
 buyer+0.041
Neg logit contributions
|-0.048
You-0.044
 You-0.043
text-0.043
 Ever-0.040
Tokenily
Token ID813
Feature activation+8.36e-01

Pos logit contributions
|+0.039
 imag+0.033
're+0.033
 eyel+0.032
Any+0.031
Neg logit contributions
 lady-0.059
 detector-0.058
lf-0.058
 vet-0.057
 worker-0.057
Token and
Token ID290
Feature activation+2.38e+00

Pos logit contributions
|+0.016
're+0.015
text+0.015
hed+0.015
 Ever+0.014
Neg logit contributions
 buyer-0.020
lf-0.020
 recipe-0.020
keeper-0.019
 worker-0.019
Token Ben
Token ID3932
Feature activation+1.93e+00

Pos logit contributions
|+0.059
text+0.049
 Ever+0.048
Any+0.048
pill+0.047
Neg logit contributions
 lady-0.057
 man-0.056
 vet-0.054
 doctor-0.051
lf-0.051
Token were
Token ID547
Feature activation+1.20e+00

Pos logit contributions
|+0.031
're+0.028
text+0.026
 Ever+0.025
 imag+0.024
Neg logit contributions
lf-0.055
keeper-0.055
 keeper-0.054
 lady-0.054
 detector-0.054
Token not
Token ID407
Feature activation+7.62e-01

Pos logit contributions
|+0.022
're+0.020
text+0.019
 Ever+0.018
Any+0.018
Neg logit contributions
keeper-0.032
lf-0.032
 lady-0.031
 keeper-0.031
 detector-0.031
Token okay
Token ID8788
Feature activation+2.60e-01

Pos logit contributions
|+0.016
're+0.015
 Ever+0.014
 You+0.014
text+0.014
Neg logit contributions
keeper-0.018
lf-0.018
 keeper-0.017
 detector-0.017
 lady-0.017
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
|+0.006
're+0.005
 You+0.005
 Ever+0.005
 eyel+0.005
Neg logit contributions
lf-0.006
keeper-0.006
 detector-0.006
 lady-0.006
 buyer-0.006
Token They
Token ID1119
Feature activation-3.73e-01

Pos logit contributions
|+0.015
're+0.012
text+0.012
 Ever+0.011
Any+0.011
Neg logit contributions
keeper-0.036
lf-0.036
 lady-0.035
 keeper-0.035
 detector-0.035
Token had
Token ID550
Feature activation+3.01e+00

Pos logit contributions
 lady+0.010
keeper+0.010
 detector+0.010
 man+0.010
 keeper+0.010
Neg logit contributions
|-0.007
Any-0.006
text-0.006
 Ever-0.006
avy-0.005
Token,
Token ID11
Feature activation-4.36e+00

Pos logit contributions
lf+0.048
 detector+0.047
keeper+0.046
 keeper+0.045
 buyer+0.045
Neg logit contributions
|-0.058
're-0.056
 You-0.054
text-0.054
 Ever-0.052
Token "
Token ID366
Feature activation-2.20e+00

Pos logit contributions
 detector+0.091
lf+0.091
keeper+0.091
 buyer+0.089
 keeper+0.086
Neg logit contributions
 You-0.096
|-0.092
're-0.091
 Ever-0.090
Any-0.089
TokenYou
Token ID1639
Feature activation+4.45e+00

Pos logit contributions
 lady+0.045
keeper+0.044
 man+0.044
 keeper+0.043
lf+0.043
Neg logit contributions
|-0.054
're-0.051
text-0.049
 imag-0.049
 eyel-0.049
Token are
Token ID389
Feature activation-6.07e-02

Pos logit contributions
|+0.064
text+0.049
 Ever+0.049
 eyel+0.049
Any+0.048
Neg logit contributions
keeper-0.137
 lady-0.136
lf-0.136
 detector-0.135
 keeper-0.134
Token mean
Token ID1612
Feature activation-5.61e-01

Pos logit contributions
 lady+0.001
 keeper+0.001
 worker+0.001
lf+0.001
 man+0.001
Neg logit contributions
|-0.002
text-0.002
pill-0.002
akes-0.002
anced-0.002
Token.
Token ID13
Feature activation+8.42e-01

Pos logit contributions
lf+0.012
keeper+0.012
 buyer+0.012
 detector+0.011
 vet+0.011
Neg logit contributions
|-0.012
're-0.012
 You-0.012
text-0.012
 Ever-0.011
Token You
Token ID921
Feature activation+4.39e+00

Pos logit contributions
|+0.014
're+0.013
 imag+0.012
 immensely+0.012
ained+0.011
Neg logit contributions
 vet-0.023
 lady-0.023
 mechanic-0.023
keeper-0.023
 worker-0.022
Token are
Token ID389
Feature activation-1.44e-02

Pos logit contributions
|+0.075
 eyel+0.063
 Ever+0.062
text+0.062
Any+0.062
Neg logit contributions
keeper-0.120
 lady-0.120
 detector-0.119
lf-0.118
 keeper-0.118
Token not
Token ID407
Feature activation+7.98e-01

Pos logit contributions
 lady+0.000
 keeper+0.000
 worker+0.000
lf+0.000
 vet+0.000
Neg logit contributions
|-0.000
text-0.000
're-0.000
akes-0.000
hed-0.000
Token a
Token ID257
Feature activation-3.04e+00

Pos logit contributions
|+0.021
text+0.019
're+0.019
 Ever+0.018
Any+0.018
Neg logit contributions
 lady-0.015
keeper-0.015
lf-0.015
 keeper-0.015
 worker-0.014
Token good
Token ID922
Feature activation-6.56e-01

Pos logit contributions
lf+0.071
keeper+0.070
 detector+0.065
 keeper+0.065
 site+0.060
Neg logit contributions
|-0.061
're-0.059
 imag-0.059
 You-0.059
Any-0.056
Token to
Token ID284
Feature activation-4.14e+00

Pos logit contributions
lf+0.073
 detector+0.071
keeper+0.069
 keeper+0.065
 buyer+0.065
Neg logit contributions
|-0.066
're-0.064
 You-0.061
Any-0.058
 Ever-0.058
Token play
Token ID711
Feature activation-5.95e-01

Pos logit contributions
keeper+0.110
 detector+0.104
lf+0.103
 keeper+0.102
 worker+0.100
Neg logit contributions
|-0.082
're-0.074
 eyel-0.065
 imag-0.065
text-0.065
Token with
Token ID351
Feature activation+2.36e+00

Pos logit contributions
keeper+0.014
 detector+0.013
ache+0.013
 instincts+0.013
 keeper+0.013
Neg logit contributions
|-0.013
 You-0.012
Any-0.011
 Ever-0.011
're-0.011
Token.
Token ID13
Feature activation+9.71e-01

Pos logit contributions
|+0.054
're+0.047
text+0.046
 Ever+0.046
Any+0.045
Neg logit contributions
keeper-0.053
 detector-0.052
lf-0.051
 keeper-0.051
 lady-0.051
Token
Token ID198
Feature activation-2.11e+00

Pos logit contributions
|+0.023
're+0.020
text+0.020
 imag+0.020
Any+0.020
Neg logit contributions
keeper-0.021
 lady-0.021
 keeper-0.020
 vet-0.020
lf-0.020
Token
Token ID198
Feature activation-2.77e+00

Pos logit contributions
keeper+0.055
 detector+0.055
lf+0.054
 buyer+0.053
 keeper+0.053
Neg logit contributions
|-0.043
text-0.034
're-0.034
 Ever-0.034
Any-0.033
TokenOne
Token ID3198
Feature activation-1.13e+00

Pos logit contributions
keeper+0.054
 detector+0.052
 keeper+0.051
lf+0.050
 lady+0.050
Neg logit contributions
|-0.074
're-0.065
Any-0.064
text-0.063
 You-0.062
Token day
Token ID1110
Feature activation-2.04e-01

Pos logit contributions
 lady+0.032
 man+0.030
lf+0.029
 worker+0.029
 keeper+0.029
Neg logit contributions
|-0.022
 imag-0.020
're-0.020
pill-0.020
anced-0.019
Token,
Token ID11
Feature activation-3.43e+00

Pos logit contributions
 lady+0.003
lf+0.003
keeper+0.003
 keeper+0.003
 man+0.003
Neg logit contributions
|-0.007
're-0.006
text-0.006
 Ever-0.006
Any-0.006
Token she
Token ID673
Feature activation-9.27e-01

Pos logit contributions
keeper+0.088
lf+0.087
 keeper+0.086
 lady+0.086
 detector+0.086
Neg logit contributions
|-0.066
're-0.058
text-0.056
 Ever-0.055
Any-0.055
Token saw
Token ID2497
Feature activation-3.04e+00

Pos logit contributions
keeper+0.020
lf+0.019
 lady+0.019
 keeper+0.019
 detector+0.019
Neg logit contributions
|-0.022
're-0.020
text-0.019
 Ever-0.019
Any-0.019
Token asked
Token ID1965
Feature activation-4.33e+00

Pos logit contributions
|+0.001
're+0.001
text+0.001
 Ever+0.001
Any+0.001
Neg logit contributions
keeper-0.001
lf-0.001
 detector-0.001
 lady-0.001
 keeper-0.001
Token Luna
Token ID23694
Feature activation-2.82e-01

Pos logit contributions
 detector+0.121
keeper+0.121
lf+0.116
 buyer+0.114
 keeper+0.114
Neg logit contributions
|-0.077
text-0.067
 You-0.065
 Ever-0.064
're-0.064
Token,
Token ID11
Feature activation-4.22e+00

Pos logit contributions
 detector+0.006
lf+0.006
keeper+0.006
 buyer+0.006
 lady+0.006
Neg logit contributions
|-0.006
're-0.006
 You-0.005
Any-0.005
 Ever-0.005
Token "
Token ID366
Feature activation-1.96e+00

Pos logit contributions
lf+0.071
keeper+0.070
 detector+0.070
 keeper+0.067
 lady+0.066
Neg logit contributions
|-0.117
're-0.110
Any-0.108
 You-0.107
 Ever-0.107
TokenWhat
Token ID2061
Feature activation+2.63e+00

Pos logit contributions
keeper+0.043
 detector+0.042
lf+0.041
 instincts+0.041
 worker+0.040
Neg logit contributions
|-0.049
Any-0.042
You-0.041
 You-0.041
Whe-0.040
Token are
Token ID389
Feature activation-7.15e-01

Pos logit contributions
|+0.054
 eyel+0.044
 Ever+0.044
text+0.043
Any+0.043
Neg logit contributions
lf-0.064
 lady-0.064
 man-0.063
keeper-0.062
 detector-0.062
Token you
Token ID345
Feature activation+4.15e+00

Pos logit contributions
lf+0.017
 lady+0.015
 man+0.015
 worker+0.015
 vet+0.015
Neg logit contributions
|-0.017
're-0.015
 Ever-0.015
hed-0.014
ez-0.014
Token playing
Token ID2712
Feature activation+5.42e-01

Pos logit contributions
|+0.084
text+0.075
're+0.070
 Ever+0.069
 eyel+0.067
Neg logit contributions
lf-0.104
 lady-0.103
 keeper-0.100
 man-0.100
 worker-0.099
Token with
Token ID351
Feature activation+2.45e+00

Pos logit contributions
|+0.012
 You+0.012
Any+0.011
 Ever+0.011
're+0.011
Neg logit contributions
keeper-0.013
 detector-0.012
 keeper-0.011
ache-0.011
 buyer-0.011
Token?"
Token ID1701
Feature activation-4.31e-01

Pos logit contributions
|+0.049
 You+0.045
're+0.044
 imag+0.042
text+0.041
Neg logit contributions
keeper-0.063
 keeper-0.059
 buyer-0.059
 detector-0.056
ache-0.056
Token Luna
Token ID23694
Feature activation-5.27e-01

Pos logit contributions
 detector+0.011
keeper+0.011
 buyer+0.011
lf+0.011
 keeper+0.010
Neg logit contributions
|-0.009
 You-0.007
 Ever-0.007
text-0.007
're-0.007
Token drink
Token ID4144
Feature activation+4.88e-01

Pos logit contributions
lf+0.055
keeper+0.053
 keeper+0.053
 detector+0.052
 vet+0.052
Neg logit contributions
|-0.057
're-0.052
text-0.048
 eyel-0.047
Any-0.046
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
|+0.013
're+0.012
text+0.012
 Ever+0.011
Any+0.011
Neg logit contributions
keeper-0.009
 lady-0.009
lf-0.009
 keeper-0.009
 detector-0.009
Token However
Token ID2102
Feature activation-5.37e-01

Pos logit contributions
|+0.034
text+0.029
ez+0.029
 imag+0.028
're+0.028
Neg logit contributions
keeper-0.038
 lady-0.037
 vet-0.037
 keeper-0.035
 worker-0.034
Token,
Token ID11
Feature activation-2.92e+00

Pos logit contributions
keeper+0.009
lf+0.009
 detector+0.009
 keeper+0.009
 lady+0.009
Neg logit contributions
|-0.015
're-0.014
text-0.014
 Ever-0.013
Any-0.013
Token the
Token ID262
Feature activation-3.63e-01

Pos logit contributions
keeper+0.065
 detector+0.065
lf+0.064
 keeper+0.064
 lady+0.063
Neg logit contributions
|-0.064
're-0.059
Any-0.056
 Ever-0.056
text-0.055
Token suit
Token ID6050
Feature activation-5.16e-02

Pos logit contributions
keeper+0.007
lf+0.006
 lady+0.006
 keeper+0.006
 detector+0.006
Neg logit contributions
|-0.010
're-0.009
text-0.009
 Ever-0.009
Any-0.009
Token was
Token ID373
Feature activation+6.87e-01

Pos logit contributions
lf+0.001
 buyer+0.001
 vet+0.001
keeper+0.001
 detector+0.001
Neg logit contributions
|-0.001
're-0.001
text-0.001
Any-0.001
 eyel-0.001
Token so
Token ID523
Feature activation+2.09e+00

Pos logit contributions
|+0.015
're+0.013
text+0.013
 Ever+0.012
Any+0.012
Neg logit contributions
keeper-0.016
lf-0.016
 lady-0.016
 keeper-0.016
 detector-0.016
Token special
Token ID2041
Feature activation-1.02e+00

Pos logit contributions
|+0.053
text+0.047
Whe+0.046
're+0.046
 imag+0.045
Neg logit contributions
 lady-0.044
keeper-0.044
 keeper-0.043
 man-0.042
 vet-0.041
Token that
Token ID326
Feature activation-6.55e-01

Pos logit contributions
keeper+0.019
lf+0.019
 lady+0.018
 keeper+0.018
 detector+0.018
Neg logit contributions
|-0.027
're-0.024
text-0.024
 Ever-0.024
Any-0.023
Token it
Token ID340
Feature activation+2.55e+00

Pos logit contributions
 lady+0.013
keeper+0.013
lf+0.013
 keeper+0.013
 man+0.012
Neg logit contributions
|-0.017
text-0.015
're-0.015
 Ever-0.014
avy-0.014
Token throw
Token ID3714
Feature activation+1.60e+00

Pos logit contributions
keeper+0.006
 lady+0.006
 keeper+0.006
lf+0.006
 detector+0.006
Neg logit contributions
|-0.005
text-0.005
're-0.005
Any-0.004
 Ever-0.004
Token them
Token ID606
Feature activation+9.57e-01

Pos logit contributions
|+0.040
're+0.035
text+0.035
Any+0.034
 Ever+0.034
Neg logit contributions
keeper-0.033
 lady-0.032
 keeper-0.032
lf-0.032
 detector-0.031
Token away
Token ID1497
Feature activation-6.42e-01

Pos logit contributions
|+0.025
're+0.022
text+0.022
Any+0.021
 Ever+0.021
Neg logit contributions
keeper-0.019
 lady-0.018
 keeper-0.018
lf-0.018
 detector-0.018
Token.
Token ID13
Feature activation+3.02e+00

Pos logit contributions
lf+0.016
 vet+0.015
keeper+0.015
 buyer+0.015
 mechanic+0.015
Neg logit contributions
|-0.012
're-0.011
 Ever-0.011
text-0.011
 You-0.010
Token
Token ID198
Feature activation-1.97e-01

Pos logit contributions
|+0.057
're+0.050
text+0.048
Any+0.047
 Ever+0.047
Neg logit contributions
keeper-0.079
 lady-0.078
lf-0.077
 keeper-0.077
 detector-0.076
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
lf+0.005
keeper+0.005
 detector+0.005
 buyer+0.005
 lady+0.005
Neg logit contributions
|-0.004
text-0.003
're-0.003
 Ever-0.003
 You-0.003
Token"
Token ID1
Feature activation-2.53e+00

Pos logit contributions
 detector+0.019
keeper+0.019
lf+0.018
 keeper+0.018
 instincts+0.018
Neg logit contributions
|-0.036
 You-0.033
You-0.033
Any-0.032
text-0.032
TokenLook
Token ID8567
Feature activation+2.15e+00

Pos logit contributions
keeper+0.061
lf+0.061
 keeper+0.060
 lady+0.060
 detector+0.059
Neg logit contributions
|-0.053
're-0.047
text-0.046
Any-0.045
 Ever-0.045
Token,
Token ID11
Feature activation-3.52e+00

Pos logit contributions
|+0.055
're+0.053
text+0.050
 You+0.050
 Ever+0.049
Neg logit contributions
keeper-0.040
 detector-0.040
 vet-0.040
 keeper-0.040
lf-0.039
Token Ben
Token ID3932
Feature activation+2.00e+00

Pos logit contributions
keeper+0.091
lf+0.090
 keeper+0.089
 detector+0.088
 lady+0.088
Neg logit contributions
|-0.067
're-0.061
text-0.057
 You-0.057
Any-0.056
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
 You+0.035
're+0.034
 Ever+0.032
|+0.032
 de+0.030
Neg logit contributions
lf-0.050
 detector-0.049
 buyer-0.049
keeper-0.049
bel-0.047
Token grandma
Token ID49890
Feature activation+1.39e-01

Pos logit contributions
 detector+0.025
keeper+0.024
 buyer+0.023
 vet+0.023
 keeper+0.022
Neg logit contributions
|-0.044
 Ever-0.041
're-0.041
 eyel-0.040
 You-0.040
Token and
Token ID290
Feature activation+2.02e+00

Pos logit contributions
|+0.004
're+0.003
text+0.003
 Ever+0.003
Any+0.003
Neg logit contributions
keeper-0.003
lf-0.003
 lady-0.002
 keeper-0.002
 detector-0.002
Token gave
Token ID2921
Feature activation-1.26e-01

Pos logit contributions
|+0.038
're+0.033
text+0.032
 Ever+0.032
 imag+0.031
Neg logit contributions
keeper-0.052
 detector-0.052
lf-0.052
 keeper-0.051
 lady-0.051
Token her
Token ID607
Feature activation-2.29e+00

Pos logit contributions
keeper+0.003
 lady+0.003
lf+0.003
 keeper+0.003
 vet+0.003
Neg logit contributions
|-0.002
're-0.002
text-0.002
Any-0.002
 Ever-0.002
Token one
Token ID530
Feature activation-1.43e+00

Pos logit contributions
keeper+0.058
 lady+0.058
 keeper+0.056
lf+0.055
 man+0.055
Neg logit contributions
|-0.047
're-0.040
text-0.039
pill-0.039
avy-0.038
Token last
Token ID938
Feature activation-1.37e+00

Pos logit contributions
lf+0.033
 detector+0.031
keeper+0.031
 keeper+0.030
 buyer+0.030
Neg logit contributions
|-0.031
're-0.029
 Ever-0.028
text-0.028
Any-0.027
Token hug
Token ID16225
Feature activation-6.41e-01

Pos logit contributions
keeper+0.028
lf+0.028
 keeper+0.027
 lady+0.027
 detector+0.027
Neg logit contributions
|-0.033
're-0.030
text-0.029
 Ever-0.029
Any-0.029
Token.
Token ID13
Feature activation+1.97e+00

Pos logit contributions
lf+0.015
 detector+0.015
 buyer+0.015
keeper+0.015
 vet+0.015
Neg logit contributions
|-0.013
're-0.012
 You-0.012
 Ever-0.011
text-0.011
Token As
Token ID1081
Feature activation+1.33e+00

Pos logit contributions
|+0.037
're+0.032
text+0.031
 Ever+0.030
Any+0.030
Neg logit contributions
keeper-0.052
lf-0.052
 keeper-0.051
 lady-0.051
 detector-0.051
Token she
Token ID673
Feature activation+2.81e+00

Pos logit contributions
|+0.029
're+0.026
text+0.025
 Ever+0.025
Any+0.025
Neg logit contributions
keeper-0.031
 detector-0.031
 keeper-0.030
lf-0.030
 buyer-0.029
Token walked
Token ID6807
Feature activation+2.18e+00

Pos logit contributions
|+0.053
're+0.046
text+0.045
 Ever+0.043
Any+0.043
Neg logit contributions
keeper-0.074
lf-0.073
 lady-0.072
 keeper-0.072
 detector-0.072
Token a
Token ID257
Feature activation-2.47e+00

Pos logit contributions
|+0.045
're+0.038
text+0.037
 Ever+0.036
 eyel+0.035
Neg logit contributions
keeper-0.068
lf-0.066
 keeper-0.065
 detector-0.065
 lady-0.065
Token bench
Token ID7624
Feature activation+3.60e-01

Pos logit contributions
keeper+0.045
 instincts+0.044
 department+0.042
bel+0.042
keepers+0.041
Neg logit contributions
|-0.057
're-0.053
 eyel-0.052
ee-0.050
�-0.050
Token.
Token ID13
Feature activation+9.43e-01

Pos logit contributions
|+0.007
 You+0.006
're+0.006
 Ever+0.006
ained+0.006
Neg logit contributions
 instincts-0.009
keeper-0.009
 buyer-0.008
 vet-0.008
lf-0.008
Token They
Token ID1119
Feature activation-1.10e+00

Pos logit contributions
|+0.019
ez+0.017
Any+0.016
're+0.016
text+0.016
Neg logit contributions
 lady-0.024
keeper-0.023
 keeper-0.023
 man-0.023
 vet-0.022
Token ran
Token ID4966
Feature activation-4.50e-04

Pos logit contributions
keeper+0.024
 lady+0.024
lf+0.024
 keeper+0.024
 man+0.023
Neg logit contributions
|-0.026
text-0.022
Any-0.022
 Ever-0.022
 You-0.022
Token to
Token ID284
Feature activation-3.93e+00

Pos logit contributions
lf+0.000
keeper+0.000
 detector+0.000
 lady+0.000
 keeper+0.000
Neg logit contributions
|-0.000
're-0.000
text-0.000
 Ever-0.000
Any-0.000
Token see
Token ID766
Feature activation-1.62e+00

Pos logit contributions
keeper+0.085
lf+0.084
 lady+0.082
 keeper+0.082
 detector+0.082
Neg logit contributions
|-0.093
're-0.083
text-0.081
 Ever-0.080
Any-0.079
Token what
Token ID644
Feature activation+1.82e+00

Pos logit contributions
lf+0.040
keeper+0.039
 keeper+0.038
 detector+0.037
 worker+0.037
Neg logit contributions
're-0.031
|-0.031
 You-0.029
 Ever-0.029
Any-0.028
Token was
Token ID373
Feature activation-7.26e-01

Pos logit contributions
|+0.043
're+0.038
text+0.036
 Ever+0.036
Any+0.036
Neg logit contributions
keeper-0.040
 lady-0.040
lf-0.040
 keeper-0.039
 man-0.038
Token inside
Token ID2641
Feature activation-9.32e-01

Pos logit contributions
keeper+0.016
 keeper+0.015
lf+0.015
 vet+0.015
 lady+0.015
Neg logit contributions
|-0.017
're-0.015
 You-0.015
 Ever-0.015
text-0.014
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
keeper+0.019
lf+0.018
 lady+0.018
 keeper+0.018
 detector+0.018
Neg logit contributions
|-0.024
're-0.021
text-0.021
 Ever-0.020
Any-0.020
Token toy
Token ID13373
Feature activation-2.00e+00

Pos logit contributions
 detector+0.076
 instincts+0.069
lf+0.069
driver+0.068
 carriers+0.068
Neg logit contributions
 de-0.064
 night-0.057
 spl-0.057
 eyel-0.057
 freshly-0.057
Token store
Token ID3650
Feature activation-2.77e+00

Pos logit contributions
keeper+0.031
lf+0.030
 keeper+0.029
 detector+0.029
 instincts+0.029
Neg logit contributions
|-0.055
're-0.052
 You-0.051
 Ever-0.050
 eyel-0.050
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
lf+0.062
 detector+0.059
 instincts+0.058
 vet+0.056
ache+0.055
Neg logit contributions
|-0.054
 You-0.053
're-0.052
 Ever-0.052
 de-0.048
Token Jack
Token ID3619
Feature activation+2.04e-01

Pos logit contributions
|+0.019
text+0.016
're+0.016
ez+0.015
Any+0.015
Neg logit contributions
keeper-0.030
 lady-0.029
 keeper-0.029
lf-0.028
 vet-0.028
Token said
Token ID531
Feature activation-2.86e+00

Pos logit contributions
|+0.004
're+0.004
 Ever+0.003
Any+0.003
 imag+0.003
Neg logit contributions
 detector-0.005
lf-0.005
keeper-0.005
 vet-0.005
 lady-0.005
Token,
Token ID11
Feature activation-4.47e+00

Pos logit contributions
lf+0.057
 detector+0.057
keeper+0.054
 vet+0.054
 keeper+0.053
Neg logit contributions
|-0.068
're-0.066
 You-0.064
text-0.063
 Ever-0.063
Token "
Token ID366
Feature activation-1.81e+00

Pos logit contributions
 detector+0.087
lf+0.084
keeper+0.083
 buyer+0.081
 vet+0.081
Neg logit contributions
|-0.107
 You-0.104
're-0.102
 Ever-0.102
Any-0.102
TokenL
Token ID43
Feature activation+2.33e+00

Pos logit contributions
keeper+0.043
 lady+0.042
lf+0.042
 keeper+0.042
 detector+0.041
Neg logit contributions
|-0.039
're-0.035
text-0.034
 imag-0.033
 Ever-0.033
Tokenily
Token ID813
Feature activation+1.01e+00

Pos logit contributions
|+0.048
 imag+0.044
're+0.043
 eyel+0.041
text+0.041
Neg logit contributions
lf-0.057
 lady-0.056
 worker-0.056
 recipe-0.054
keeper-0.053
Token,
Token ID11
Feature activation-4.81e+00

Pos logit contributions
|+0.029
're+0.027
text+0.026
 Ever+0.026
Any+0.025
Neg logit contributions
keeper-0.017
lf-0.016
 lady-0.016
 keeper-0.016
 detector-0.016
Token let
Token ID1309
Feature activation+2.52e+00

Pos logit contributions
keeper+0.101
lf+0.099
 lady+0.098
 keeper+0.098
 detector+0.096
Neg logit contributions
|-0.116
're-0.105
text-0.102
 Ever-0.100
Any-0.100
Token,
Token ID11
Feature activation-4.60e+00

Pos logit contributions
 You+0.001
|+0.001
're+0.001
 D+0.001
 Ever+0.001
Neg logit contributions
 detector-0.001
keeper-0.001
 buyer-0.001
Offic-0.001
 vet-0.001
Token Tim
Token ID5045
Feature activation+2.42e-01

Pos logit contributions
keeper+0.138
lf+0.136
 keeper+0.134
 detector+0.134
 lady+0.133
Neg logit contributions
|-0.069
're-0.061
text-0.056
Any-0.055
 Ever-0.055
Tokenmy
Token ID1820
Feature activation-8.42e-01

Pos logit contributions
|+0.002
're+0.002
 You+0.002
Any+0.002
 Ever+0.001
Neg logit contributions
keeper-0.008
 buyer-0.008
 vet-0.008
 detector-0.008
 instincts-0.008
Token?"
Token ID1701
Feature activation+1.54e-01

Pos logit contributions
keeper+0.022
 buyer+0.022
 detector+0.021
 stall+0.021
 instincts+0.021
Neg logit contributions
 You-0.014
|-0.013
ez-0.012
're-0.011
ee-0.011
Token Sally
Token ID25737
Feature activation+3.50e-01

Pos logit contributions
|+0.003
 You+0.003
're+0.002
 Ever+0.002
Any+0.002
Neg logit contributions
 detector-0.004
lf-0.004
keeper-0.004
 buyer-0.004
 instincts-0.004
Token asked
Token ID1965
Feature activation-4.66e+00

Pos logit contributions
|+0.008
 You+0.007
're+0.007
 eyel+0.007
 Ever+0.007
Neg logit contributions
 detector-0.007
 buyer-0.007
 recipe-0.007
 carriers-0.007
keeper-0.007
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
keeper+0.083
 lady+0.081
lf+0.081
 keeper+0.080
 man+0.077
Neg logit contributions
|-0.130
text-0.117
're-0.116
Any-0.113
 imag-0.113
Token
Token ID220
Feature activation+4.08e+00

Pos logit contributions
|+0.004
're+0.004
text+0.004
 Ever+0.004
Any+0.004
Neg logit contributions
keeper-0.004
lf-0.004
 lady-0.004
 keeper-0.004
 detector-0.004
Token
Token ID198
Feature activation-2.85e+00

Pos logit contributions
|+0.085
're+0.076
 Ever+0.073
text+0.072
 You+0.072
Neg logit contributions
lf-0.098
keeper-0.098
 detector-0.096
 lady-0.096
 keeper-0.096
Token
Token ID198
Feature activation-3.27e+00

Pos logit contributions
 detector+0.086
ache+0.083
keeper+0.083
 instincts+0.082
lf+0.082
Neg logit contributions
|-0.045
 You-0.037
 Ever-0.036
're-0.036
Any-0.033
TokenTim
Token ID14967
Feature activation-3.84e-01

Pos logit contributions
 detector+0.057
 code+0.049
keeper+0.049
ache+0.048
 instincts+0.047
Neg logit contributions
|-0.091
 You-0.089
You-0.086
Any-0.086
're-0.083
Token to
Token ID284
Feature activation-2.30e+00

Pos logit contributions
|+0.041
text+0.037
're+0.037
Any+0.036
 Ever+0.036
Neg logit contributions
 lady-0.033
keeper-0.033
lf-0.033
 keeper-0.032
 detector-0.032
Token the
Token ID262
Feature activation+2.04e+00

Pos logit contributions
 lady+0.053
lf+0.053
keeper+0.052
 vet+0.052
 keeper+0.051
Neg logit contributions
|-0.050
're-0.045
text-0.044
Any-0.043
 Ever-0.042
Token park
Token ID3952
Feature activation-1.07e+00

Pos logit contributions
|+0.046
Any+0.041
're+0.040
 You+0.040
Whe+0.039
Neg logit contributions
 detector-0.043
lf-0.043
keeper-0.042
 instincts-0.042
 keeper-0.039
Token.
Token ID13
Feature activation+5.76e-01

Pos logit contributions
keeper+0.021
lf+0.020
 detector+0.020
 lady+0.020
 keeper+0.020
Neg logit contributions
|-0.027
're-0.025
text-0.024
 Ever-0.024
Any-0.024
Token She
Token ID1375
Feature activation-2.13e+00

Pos logit contributions
|+0.009
're+0.007
 Ever+0.007
 You+0.007
text+0.007
Neg logit contributions
lf-0.018
keeper-0.017
 detector-0.017
 keeper-0.017
 lady-0.017
Token asked
Token ID1965
Feature activation-4.23e+00

Pos logit contributions
lf+0.046
keeper+0.046
 lady+0.046
 keeper+0.045
 man+0.044
Neg logit contributions
|-0.050
're-0.045
text-0.043
Any-0.043
 Ever-0.043
Token her
Token ID607
Feature activation-2.79e+00

Pos logit contributions
keeper+0.128
 detector+0.126
lf+0.125
 keeper+0.125
 buyer+0.122
Neg logit contributions
|-0.062
're-0.055
text-0.053
 Ever-0.051
 eyel-0.049
Token mom
Token ID1995
Feature activation-2.77e+00

Pos logit contributions
 detector+0.064
lf+0.057
 site+0.056
keeper+0.055
heads+0.054
Neg logit contributions
 imag-0.056
text-0.056
 Ever-0.056
're-0.055
 eyel-0.053
Token if
Token ID611
Feature activation+3.01e-02

Pos logit contributions
lf+0.079
 detector+0.078
 vet+0.075
Offic+0.074
uers+0.073
Neg logit contributions
're-0.048
 You-0.041
|-0.038
 Ever-0.036
Any-0.035
Token she
Token ID673
Feature activation-3.44e-01

Pos logit contributions
|+0.001
're+0.001
Any+0.001
text+0.001
 You+0.001
Neg logit contributions
lf-0.001
 detector-0.001
keeper-0.001
 keeper-0.001
 lady-0.001
Token could
Token ID714
Feature activation-1.88e+00

Pos logit contributions
keeper+0.007
 lady+0.007
 keeper+0.007
lf+0.007
 man+0.007
Neg logit contributions
|-0.008
're-0.007
text-0.007
 Ever-0.007
Any-0.007
Token it
Token ID340
Feature activation+3.13e+00

Pos logit contributions
lf+0.024
keeper+0.023
 lady+0.023
 keeper+0.023
 detector+0.022
Neg logit contributions
|-0.034
're-0.030
text-0.030
 imag-0.029
 Ever-0.029
Token was
Token ID373
Feature activation+1.16e+00

Pos logit contributions
|+0.076
're+0.068
text+0.066
 Ever+0.065
Any+0.065
Neg logit contributions
keeper-0.066
lf-0.064
 lady-0.064
 keeper-0.064
 detector-0.063
Token a
Token ID257
Feature activation-3.67e+00

Pos logit contributions
|+0.029
text+0.026
're+0.024
Whe+0.024
avy+0.023
Neg logit contributions
 keeper-0.025
 lady-0.025
keeper-0.025
 man-0.024
lf-0.024
Token good
Token ID922
Feature activation-1.63e+00

Pos logit contributions
lf+0.091
keeper+0.088
 detector+0.087
 keeper+0.083
 lady+0.081
Neg logit contributions
|-0.069
 imag-0.065
're-0.064
Any-0.063
 eyel-0.063
Token chance
Token ID2863
Feature activation-2.59e+00

Pos logit contributions
keeper+0.033
lf+0.032
 keeper+0.032
 lady+0.032
 detector+0.031
Neg logit contributions
|-0.041
're-0.037
text-0.036
 Ever-0.035
Any-0.035
Token to
Token ID284
Feature activation-4.02e+00

Pos logit contributions
lf+0.051
keeper+0.050
 detector+0.049
 lady+0.048
 keeper+0.048
Neg logit contributions
|-0.063
're-0.060
 You-0.057
Any-0.057
 Ever-0.057
Token get
Token ID651
Feature activation+1.84e+00

Pos logit contributions
keeper+0.104
lf+0.099
 detector+0.096
 lady+0.096
 keeper+0.095
Neg logit contributions
|-0.079
're-0.078
 You-0.064
Any-0.064
 imag-0.062
Token some
Token ID617
Feature activation+8.66e-02

Pos logit contributions
|+0.047
text+0.042
're+0.041
Whe+0.040
 Ever+0.040
Neg logit contributions
keeper-0.038
 lady-0.037
 keeper-0.036
 man-0.035
 vet-0.035
Token help
Token ID1037
Feature activation-2.32e+00

Pos logit contributions
|+0.002
're+0.002
text+0.002
 Ever+0.002
Any+0.002
Neg logit contributions
keeper-0.002
lf-0.002
 lady-0.002
 keeper-0.002
 detector-0.002
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
keeper+0.045
 lady+0.044
 keeper+0.043
lf+0.043
 detector+0.042
Neg logit contributions
|-0.060
're-0.054
text-0.053
Any-0.052
 Ever-0.052
Token So
Token ID1406
Feature activation-2.67e+00

Pos logit contributions
|+0.030
text+0.027
 imag+0.026
avy+0.026
're+0.026
Neg logit contributions
keeper-0.035
 lady-0.033
 keeper-0.033
 vet-0.032
 man-0.032