TOP ACTIVATIONS
MAX = 5.370

, Lilly put on her nic est dress and said goodbye
down and landed beside the 3 year old . He was
whistle . It made the nic est sound , like a
the hive !" exclaimed the 3 - year - old excited
They got dressed in their nic est clothes and headed out
, he put on his brav est clothes and started his
I lost you !" the 3 year old said , with
a dinosaur !" shouted the 3 year old child . The
but he put on his brav est face . In an
, disgusting place for a 3 - year - old to
a loud shout . The 3 year old was scared ,
looked around and saw a 3 year old child .
it was shouting . The 3 year old was so scared
cube flew high above a 3 year old child who was
ball . They put the 3 balls on top of each
around and saw a small 3 year old child standing there
?" asked Bobby , a 3 year old boy .
is it ?" asked the 3 - year old .
. He saw a small 3 year old boy who was
The blue hook and the 3 - year - old were

MIN ACTIVATIONS
MIN = -3.969

is safe and what is not . OK ?" she said
a greater reward than any fl uttering wings or magical taps
parents - they know what is best for us ! <|endoftext|>
is safe and what is not . Your lungs help you
himself and the great bird house he made with Grand ma
hungry and want something really badly . And sorting things can
so happy that the tree house was released . They had
and know when something is too dangerous to do . <|endoftext|>
what to do and what not to do . We have
building such a great tree house . <|endoftext|> Once upon a
parents because they know what is best for you . <|endoftext|>
lesson - when something seems too good to be true ,
stay informed and know what is going on in the world
what is safe and what is not . OK ?" she
together in their old tree house . <|endoftext|> Once upon a
us . We know what is good for you . We
us . We know what is good for you . Now
parents because they know what is best for us . <|endoftext|>
as the best joke - te ller around . <|endoftext|> Once
, because they know what is best for you . <|endoftext|>

INTERVAL 3.035 - 5.370
CONTAINS 0.012%

young girl who owned the nic est jazz in the whole
. One day , a 3 year old child visited him
of smoke !" said the 3 year old . The family
Pat ty was a creative 3 year old girl who loved
. It landed on the other side of the field .

INTERVAL 0.701 - 3.035
CONTAINS 14.667%

so happy and she ate all of it . After dinner
in his rocking chair , but his chair was too heavy
the police officer says . " What are you doing with
. <|endoftext|> L ily and Ben were playing in the park
sounds . " Look , Ben , that bird

INTERVAL -1.634 - 0.701
CONTAINS 84.345%

of a plan and offered the cater pillar a leaf .
to act and perform , tw irling and dancing around the
, Tim my knew it was important to take care of
my and daddy at the park . <|endoftext|> Once upon a
dream . She was happy to hear about it . Tim

INTERVAL -3.969 - -1.634
CONTAINS 0.976%

make with his magical pocket toy . <|endoftext|> Once upon a
then said , " Ready or not , here I come
her mom , " What is this tool ?"
the woods when Molly suddenly stopped . Molly asked Max ,
Tim wished he had one too . So Bob decided to
Token,
Token ID11
Feature activation-9.86e-02

Pos logit contributions
 tant+0.099
ard+0.089
 Are+0.078
ly+0.077
 floors+0.074
Neg logit contributions
 theirs-0.047
vern-0.046
 Betty-0.041
 buddies-0.041
 moms-0.040
Token Lilly
Token ID35134
Feature activation+2.22e-01

Pos logit contributions
 theirs+0.002
 Sam+0.002
vern+0.002
 Betty+0.002
 buddies+0.002
Neg logit contributions
 tant-0.005
ard-0.004
 Are-0.004
ly-0.004
 Those-0.004
Token put
Token ID1234
Feature activation+8.91e-01

Pos logit contributions
 tant+0.009
ard+0.008
 Are+0.007
ly+0.006
 floors+0.006
Neg logit contributions
 theirs-0.007
vern-0.007
 Betty-0.006
 Sam-0.006
 buddies-0.006
Token on
Token ID319
Feature activation-4.17e-01

Pos logit contributions
 tant+0.039
ard+0.034
 Are+0.029
ly+0.028
 floors+0.028
Neg logit contributions
 theirs-0.025
vern-0.025
 Betty-0.023
 buddies-0.023
 Sam-0.023
Token her
Token ID607
Feature activation+5.95e-01

Pos logit contributions
vern+0.011
 theirs+0.011
girls+0.010
 buddies+0.010
 Betty+0.010
Neg logit contributions
 tant-0.019
ard-0.016
 floors-0.014
ly-0.014
 Are-0.014
Token nic
Token ID9200
Feature activation+5.37e+00

Pos logit contributions
 tant+0.026
ard+0.022
 Are+0.019
 floors+0.019
ly+0.018
Neg logit contributions
 theirs-0.017
vern-0.017
 Betty-0.016
 Sam-0.016
 Andy-0.015
Tokenest
Token ID395
Feature activation+3.01e+00

Pos logit contributions
 tant+0.124
ard+0.090
 Are+0.072
 million+0.060
 Those+0.060
Neg logit contributions
 theirs-0.264
vern-0.259
 Sam-0.249
 Betty-0.247
 buddies-0.245
Token dress
Token ID6576
Feature activation+1.03e-01

Pos logit contributions
 tant+0.127
ard+0.109
 Are+0.098
 Those+0.096
 million+0.090
Neg logit contributions
 theirs-0.088
vern-0.085
 buddies-0.082
 moms-0.081
 mom-0.079
Token and
Token ID290
Feature activation-5.85e-01

Pos logit contributions
 tant+0.005
ard+0.004
ly+0.004
est+0.004
 floors+0.003
Neg logit contributions
vern-0.003
 theirs-0.003
 Betty-0.002
 Sam-0.002
girls-0.002
Token said
Token ID531
Feature activation-1.73e-01

Pos logit contributions
 theirs+0.015
vern+0.014
 Sam+0.014
 Betty+0.013
 buddies+0.013
Neg logit contributions
 tant-0.028
ard-0.024
 Are-0.021
 Those-0.021
ly-0.020
Token goodbye
Token ID24829
Feature activation-5.80e-01

Pos logit contributions
 theirs+0.004
vern+0.004
 Betty+0.004
 Sam+0.004
 buddies+0.004
Neg logit contributions
 tant-0.008
ard-0.007
 Are-0.006
ly-0.006
 Those-0.006
Token down
Token ID866
Feature activation-4.70e-01

Pos logit contributions
 tant+0.014
ard+0.012
 Are+0.010
ly+0.010
est+0.009
Neg logit contributions
vern-0.012
 theirs-0.012
 buddies-0.011
 Betty-0.011
 moms-0.011
Token and
Token ID290
Feature activation-4.22e-01

Pos logit contributions
vern+0.013
 theirs+0.013
girls+0.012
 buddies+0.012
 moms+0.012
Neg logit contributions
 tant-0.021
ard-0.018
ly-0.016
 Are-0.015
est-0.015
Token landed
Token ID11406
Feature activation-4.89e-01

Pos logit contributions
 theirs+0.011
 Sam+0.011
 Betty+0.011
vern+0.010
 Andy+0.010
Neg logit contributions
 tant-0.019
ard-0.017
 Are-0.014
 floors-0.014
est-0.014
Token beside
Token ID13970
Feature activation+5.70e-01

Pos logit contributions
 theirs+0.014
vern+0.013
 Betty+0.012
 moms+0.012
 buddies+0.012
Neg logit contributions
 tant-0.021
ard-0.019
 Are-0.016
ly-0.016
 Those-0.015
Token the
Token ID262
Feature activation+8.34e-01

Pos logit contributions
 tant+0.026
ard+0.022
ly+0.020
 Are+0.020
est+0.019
Neg logit contributions
vern-0.016
 theirs-0.014
 buddies-0.013
girls-0.013
 moms-0.013
Token 3
Token ID513
Feature activation+4.91e+00

Pos logit contributions
 tant+0.036
ard+0.030
 Are+0.027
 Those+0.026
ly+0.025
Neg logit contributions
 theirs-0.024
vern-0.022
 moms-0.022
 Sam-0.022
 buddies-0.021
Token year
Token ID614
Feature activation+6.80e-01

Pos logit contributions
 tant+0.174
 Are+0.142
ard+0.136
 Those+0.122
 floors+0.116
Neg logit contributions
 theirs-0.160
vern-0.159
 feat-0.159
 moms-0.158
 buddies-0.158
Token old
Token ID1468
Feature activation+4.71e-02

Pos logit contributions
 tant+0.028
ard+0.024
 Are+0.021
ly+0.020
 floors+0.020
Neg logit contributions
 theirs-0.021
vern-0.020
 Betty-0.019
 Sam-0.019
 buddies-0.019
Token.
Token ID13
Feature activation-6.97e-02

Pos logit contributions
 tant+0.002
ard+0.002
 Are+0.002
ly+0.002
 floors+0.002
Neg logit contributions
vern-0.001
 theirs-0.001
 Betty-0.001
 buddies-0.001
 Sam-0.001
Token He
Token ID679
Feature activation+7.66e-02

Pos logit contributions
 theirs+0.002
 Sam+0.002
 Betty+0.002
 Andy+0.002
vern+0.002
Neg logit contributions
 tant-0.003
ard-0.002
ly-0.002
 floors-0.002
est-0.002
Token was
Token ID373
Feature activation+4.62e-01

Pos logit contributions
 tant+0.004
ard+0.003
 Are+0.003
 Those+0.003
 floors+0.003
Neg logit contributions
 theirs-0.002
 Sam-0.002
vern-0.002
 Betty-0.002
 buddies-0.002
Token whistle
Token ID16121
Feature activation+5.40e-01

Pos logit contributions
 theirs+0.014
vern+0.014
 Betty+0.013
 buddies+0.012
 Sam+0.012
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.017
ly-0.016
est-0.016
Token.
Token ID13
Feature activation+7.18e-02

Pos logit contributions
 tant+0.029
ard+0.026
ly+0.023
 Are+0.023
est+0.023
Neg logit contributions
vern-0.010
 theirs-0.009
 Betty-0.008
 moms-0.008
 buddies-0.008
Token It
Token ID632
Feature activation-5.85e-01

Pos logit contributions
 tant+0.003
ard+0.002
 Are+0.002
ly+0.002
 floors+0.002
Neg logit contributions
 theirs-0.002
vern-0.002
 Sam-0.002
 Betty-0.002
 buddies-0.002
Token made
Token ID925
Feature activation+1.17e+00

Pos logit contributions
 theirs+0.015
 Sam+0.014
 Betty+0.014
vern+0.014
 buddies+0.014
Neg logit contributions
 tant-0.027
ard-0.023
 Are-0.020
 Those-0.020
ly-0.019
Token the
Token ID262
Feature activation+7.38e-01

Pos logit contributions
 tant+0.061
ard+0.054
 Are+0.048
ly+0.046
 Those+0.046
Neg logit contributions
 theirs-0.023
vern-0.022
 Betty-0.021
 Sam-0.021
 buddies-0.020
Token nic
Token ID9200
Feature activation+4.82e+00

Pos logit contributions
 tant+0.031
ard+0.026
 Are+0.023
ly+0.022
 Those+0.022
Neg logit contributions
 theirs-0.022
vern-0.021
 Sam-0.020
 Betty-0.020
 buddies-0.020
Tokenest
Token ID395
Feature activation+2.56e+00

Pos logit contributions
 tant+0.111
 Are+0.077
ard+0.074
 Those+0.057
 million+0.057
Neg logit contributions
 theirs-0.227
 buddies-0.223
 each-0.218
vern-0.217
 themselves-0.216
Token sound
Token ID2128
Feature activation+2.03e-01

Pos logit contributions
 tant+0.096
ard+0.080
 Are+0.069
 Those+0.064
ly+0.064
Neg logit contributions
 theirs-0.088
vern-0.085
 Betty-0.082
 Sam-0.082
 buddies-0.082
Token,
Token ID11
Feature activation-6.78e-01

Pos logit contributions
 tant+0.010
ard+0.009
 Are+0.008
 Those+0.008
 floors+0.008
Neg logit contributions
 theirs-0.004
vern-0.004
 Sam-0.004
 Betty-0.004
 buddies-0.004
Token like
Token ID588
Feature activation-8.31e-02

Pos logit contributions
 theirs+0.018
 Sam+0.017
 Betty+0.016
 Andy+0.015
 buddies+0.015
Neg logit contributions
 tant-0.032
ard-0.028
 Are-0.024
 Those-0.024
ly-0.023
Token a
Token ID257
Feature activation+3.78e-01

Pos logit contributions
vern+0.002
 theirs+0.002
girls+0.002
 moms+0.002
 Betty+0.002
Neg logit contributions
 tant-0.004
ard-0.003
 Are-0.003
 floors-0.003
ly-0.003
Token the
Token ID262
Feature activation-4.17e-01

Pos logit contributions
 tant+0.023
ard+0.020
 Are+0.018
 Those+0.017
ly+0.017
Neg logit contributions
 theirs-0.012
 Betty-0.011
 Sam-0.011
vern-0.011
 Andy-0.011
Token hive
Token ID35881
Feature activation-6.33e-02

Pos logit contributions
 Betty+0.009
vern+0.009
 theirs+0.009
 buddies+0.009
 mom+0.009
Neg logit contributions
 tant-0.020
ard-0.018
 Are-0.016
 Those-0.015
ly-0.015
Token!"
Token ID2474
Feature activation-1.01e+00

Pos logit contributions
 theirs+0.001
vern+0.001
 Betty+0.001
 Sam+0.001
 buddies+0.001
Neg logit contributions
 tant-0.003
ard-0.003
 Are-0.003
 Those-0.002
ly-0.002
Token exclaimed
Token ID33552
Feature activation-1.11e+00

Pos logit contributions
 theirs+0.019
 Sam+0.019
 Betty+0.019
vern+0.018
 Andy+0.018
Neg logit contributions
 tant-0.052
ard-0.046
 Are-0.041
 floors-0.040
est-0.040
Token the
Token ID262
Feature activation+2.71e-01

Pos logit contributions
vern+0.030
 theirs+0.027
 buddies+0.025
 moms+0.025
girls+0.024
Neg logit contributions
 tant-0.052
ard-0.045
 Are-0.040
ly-0.040
 Those-0.038
Token 3
Token ID513
Feature activation+4.71e+00

Pos logit contributions
 tant+0.013
ard+0.012
 Are+0.010
ly+0.010
 Those+0.010
Neg logit contributions
 theirs-0.006
vern-0.006
 Betty-0.006
 Sam-0.006
 buddies-0.006
Token-
Token ID12
Feature activation-1.31e+00

Pos logit contributions
 tant+0.180
ard+0.150
 Are+0.146
 Those+0.131
 fart+0.130
Neg logit contributions
 Betty-0.143
 mom-0.142
 buddies-0.140
 Andy-0.139
 feat-0.138
Tokenyear
Token ID1941
Feature activation+2.69e+00

Pos logit contributions
 theirs+0.047
vern+0.045
girls+0.043
 buddies+0.043
 Sam+0.043
Neg logit contributions
 tant-0.048
ard-0.040
ly-0.033
 Are-0.033
 Those-0.031
Token-
Token ID12
Feature activation-1.20e+00

Pos logit contributions
 tant+0.110
ard+0.093
ly+0.078
 Are+0.077
 floors+0.075
Neg logit contributions
 theirs-0.085
vern-0.082
 Sam-0.078
 Betty-0.078
 buddies-0.077
Tokenold
Token ID727
Feature activation-3.64e-01

Pos logit contributions
 theirs+0.039
girls+0.036
vern+0.036
 buddies+0.035
 Sam+0.035
Neg logit contributions
 tant-0.047
ard-0.041
ly-0.035
 Are-0.033
 Those-0.032
Token excited
Token ID6568
Feature activation+7.06e-01

Pos logit contributions
 theirs+0.010
vern+0.009
 Betty+0.009
 Sam+0.009
 buddies+0.009
Neg logit contributions
 tant-0.017
ard-0.014
 Are-0.013
ly-0.012
 floors-0.012
Token They
Token ID1119
Feature activation-1.48e+00

Pos logit contributions
 Sam+0.009
 theirs+0.009
 Betty+0.008
 mom+0.008
 moms+0.008
Neg logit contributions
 tant-0.021
ard-0.019
 ounces-0.016
est-0.016
ly-0.016
Token got
Token ID1392
Feature activation-2.16e-01

Pos logit contributions
vern+0.056
 theirs+0.048
girls+0.048
 Betty+0.047
 buddies+0.046
Neg logit contributions
 tant-0.053
ard-0.045
 Are-0.039
ly-0.039
 million-0.038
Token dressed
Token ID12049
Feature activation+1.44e+00

Pos logit contributions
 theirs+0.006
 buddies+0.005
vern+0.005
 Sam+0.005
 moms+0.005
Neg logit contributions
 tant-0.010
ard-0.009
 Are-0.008
est-0.008
 Those-0.007
Token in
Token ID287
Feature activation-1.09e-01

Pos logit contributions
 tant+0.069
ard+0.060
 Are+0.054
 Those+0.051
 million+0.050
Neg logit contributions
 theirs-0.036
vern-0.033
 buddies-0.032
 Sam-0.031
 Betty-0.031
Token their
Token ID511
Feature activation-2.74e-01

Pos logit contributions
 theirs+0.002
vern+0.002
 Betty+0.002
 buddies+0.002
 Sam+0.002
Neg logit contributions
 tant-0.006
ard-0.005
 Are-0.005
ly-0.005
 floors-0.005
Token nic
Token ID9200
Feature activation+4.68e+00

Pos logit contributions
 mom+0.006
 theirs+0.006
 moms+0.006
 buddies+0.005
 dad+0.005
Neg logit contributions
 tant-0.014
ard-0.012
 Are-0.011
 Those-0.011
 ounces-0.010
Tokenest
Token ID395
Feature activation+2.86e+00

Pos logit contributions
 tant+0.128
ard+0.096
 Are+0.094
 million+0.078
 Those+0.076
Neg logit contributions
 theirs-0.207
vern-0.197
 buddies-0.194
 moms-0.193
 Sam-0.192
Token clothes
Token ID8242
Feature activation+6.00e-01

Pos logit contributions
 tant+0.123
ard+0.107
 Are+0.102
 Those+0.098
rified+0.097
Neg logit contributions
 mom-0.077
 theirs-0.074
 moms-0.073
 buddies-0.073
 friend-0.073
Token and
Token ID290
Feature activation-3.95e-01

Pos logit contributions
 tant+0.028
ard+0.025
 Are+0.021
ly+0.021
 floors+0.021
Neg logit contributions
 theirs-0.014
vern-0.014
 Betty-0.014
 Sam-0.013
 buddies-0.013
Token headed
Token ID9153
Feature activation+9.19e-01

Pos logit contributions
 theirs+0.009
 Sam+0.008
 mom+0.008
 they+0.008
 moms+0.008
Neg logit contributions
 tant-0.021
ard-0.018
 Are-0.016
 Those-0.016
icer-0.015
Token out
Token ID503
Feature activation-4.48e-01

Pos logit contributions
 tant+0.040
ard+0.035
ly+0.031
 floors+0.030
 Are+0.029
Neg logit contributions
vern-0.026
 Betty-0.024
girls-0.023
 theirs-0.023
 moms-0.023
Token,
Token ID11
Feature activation-2.41e-01

Pos logit contributions
 tant+0.038
ard+0.033
 Are+0.028
 floors+0.028
 Those+0.028
Neg logit contributions
 Sam-0.014
 theirs-0.013
 Betty-0.013
 Andy-0.012
 mom-0.012
Token he
Token ID339
Feature activation+1.20e-01

Pos logit contributions
 Sam+0.005
 theirs+0.005
 Betty+0.005
 they+0.005
 Andy+0.005
Neg logit contributions
 tant-0.013
ard-0.011
ly-0.010
 floors-0.010
 Are-0.010
Token put
Token ID1234
Feature activation+9.68e-01

Pos logit contributions
 tant+0.005
ard+0.004
 Are+0.004
ly+0.004
 floors+0.003
Neg logit contributions
 theirs-0.004
vern-0.004
 Betty-0.003
 buddies-0.003
 Sam-0.003
Token on
Token ID319
Feature activation-3.18e-01

Pos logit contributions
 tant+0.046
ard+0.041
 Are+0.037
 Those+0.035
ly+0.035
Neg logit contributions
 theirs-0.024
 Sam-0.022
 Betty-0.021
vern-0.021
 buddies-0.020
Token his
Token ID465
Feature activation+7.70e-01

Pos logit contributions
 theirs+0.008
 Sam+0.007
vern+0.007
 Betty+0.007
 Andy+0.007
Neg logit contributions
 tant-0.015
ard-0.014
 Are-0.012
ly-0.012
 Those-0.011
Token brav
Token ID49025
Feature activation+4.67e+00

Pos logit contributions
 tant+0.034
ard+0.029
 Are+0.026
 Those+0.024
ly+0.024
Neg logit contributions
 theirs-0.022
vern-0.021
 Betty-0.020
 Sam-0.020
 buddies-0.020
Tokenest
Token ID395
Feature activation+3.08e+00

Pos logit contributions
 tant+0.123
ard+0.092
 Are+0.079
 million+0.075
 Those+0.074
Neg logit contributions
 theirs-0.212
vern-0.204
 Sam-0.202
 Betty-0.201
 feat-0.198
Token clothes
Token ID8242
Feature activation+8.95e-01

Pos logit contributions
 tant+0.138
ard+0.121
 Those+0.114
 Are+0.114
 million+0.105
Neg logit contributions
 theirs-0.076
vern-0.073
 mom-0.071
 buddies-0.070
 feat-0.069
Token and
Token ID290
Feature activation-5.75e-01

Pos logit contributions
 tant+0.041
ard+0.036
 Are+0.032
ly+0.031
 floors+0.030
Neg logit contributions
 theirs-0.023
vern-0.022
 Betty-0.021
 Sam-0.021
 buddies-0.020
Token started
Token ID2067
Feature activation-1.20e+00

Pos logit contributions
 theirs+0.014
 Sam+0.013
 Betty+0.013
vern+0.013
 buddies+0.012
Neg logit contributions
 tant-0.028
ard-0.024
 Are-0.021
 Those-0.021
ly-0.020
Token his
Token ID465
Feature activation+1.78e-01

Pos logit contributions
 theirs+0.022
vern+0.021
 Betty+0.019
 Sam+0.019
 buddies+0.019
Neg logit contributions
 tant-0.065
ard-0.057
 Are-0.051
ly-0.050
 Those-0.049
Token I
Token ID314
Feature activation+2.43e-01

Pos logit contributions
 theirs+0.017
vern+0.016
 Betty+0.015
 Sam+0.015
 buddies+0.015
Neg logit contributions
 tant-0.037
ard-0.033
 Are-0.029
ly-0.028
 floors-0.028
Token lost
Token ID2626
Feature activation-7.55e-01

Pos logit contributions
 tant+0.010
ard+0.009
 Are+0.008
ly+0.007
est+0.007
Neg logit contributions
 theirs-0.007
vern-0.007
 buddies-0.007
 Betty-0.007
 moms-0.007
Token you
Token ID345
Feature activation+1.94e-01

Pos logit contributions
 theirs+0.023
vern+0.022
 Sam+0.021
 Betty+0.021
 Andy+0.020
Neg logit contributions
 tant-0.032
ard-0.028
 Are-0.024
 Those-0.023
ly-0.022
Token!"
Token ID2474
Feature activation-4.88e-01

Pos logit contributions
 tant+0.010
ard+0.009
 Are+0.008
ly+0.007
 floors+0.007
Neg logit contributions
 theirs-0.004
vern-0.004
 Betty-0.004
 Sam-0.004
 buddies-0.004
Token the
Token ID262
Feature activation+2.39e-01

Pos logit contributions
 Sam+0.009
 theirs+0.009
 Betty+0.009
 Andy+0.008
vern+0.008
Neg logit contributions
 tant-0.026
ard-0.023
 Are-0.020
 floors-0.020
est-0.020
Token 3
Token ID513
Feature activation+4.66e+00

Pos logit contributions
 tant+0.010
ard+0.009
 Are+0.007
ly+0.007
est+0.007
Neg logit contributions
 theirs-0.008
vern-0.008
 buddies-0.007
girls-0.007
 moms-0.006
Token year
Token ID614
Feature activation+7.44e-01

Pos logit contributions
 tant+0.186
ard+0.153
 Are+0.146
 Those+0.133
 floors+0.128
Neg logit contributions
 Betty-0.138
 buddies-0.136
 Andy-0.135
 theirs-0.135
 Sam-0.135
Token old
Token ID1468
Feature activation-1.72e-01

Pos logit contributions
 tant+0.026
ard+0.024
ly+0.022
 floors+0.018
 Are+0.018
Neg logit contributions
 theirs-0.027
vern-0.027
girls-0.025
 buddies-0.024
 Betty-0.024
Token said
Token ID531
Feature activation-8.12e-01

Pos logit contributions
vern+0.006
 theirs+0.006
 buddies+0.005
girls+0.005
ores+0.005
Neg logit contributions
 tant-0.006
ard-0.006
ly-0.005
 Are-0.005
est-0.004
Token,
Token ID11
Feature activation-4.91e-01

Pos logit contributions
vern+0.019
 theirs+0.018
 moms+0.018
girls+0.017
 buddies+0.017
Neg logit contributions
 tant-0.039
ard-0.035
ly-0.032
 Are-0.030
est-0.030
Token with
Token ID351
Feature activation-8.90e-01

Pos logit contributions
 theirs+0.013
vern+0.013
 Betty+0.012
 Sam+0.012
 buddies+0.012
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.017
ly-0.016
 floors-0.016
Token a
Token ID257
Feature activation-5.73e-01

Pos logit contributions
vern+0.009
 theirs+0.009
 buddies+0.008
girls+0.008
 Betty+0.008
Neg logit contributions
 tant-0.017
ard-0.015
 Are-0.014
 Those-0.013
ly-0.013
Token dinosaur
Token ID28257
Feature activation-5.55e-01

Pos logit contributions
 theirs+0.018
 Sam+0.018
vern+0.018
 moms+0.017
 buddies+0.017
Neg logit contributions
 tant-0.023
ard-0.019
 Are-0.016
 floors-0.016
ly-0.016
Token!"
Token ID2474
Feature activation-8.40e-01

Pos logit contributions
 theirs+0.015
vern+0.015
 Betty+0.013
 moms+0.013
girls+0.013
Neg logit contributions
 tant-0.024
ard-0.023
 Are-0.020
ly-0.019
 million-0.019
Token shouted
Token ID17293
Feature activation-1.15e+00

Pos logit contributions
 theirs+0.016
 Sam+0.015
 Betty+0.015
vern+0.014
 Andy+0.014
Neg logit contributions
 tant-0.045
ard-0.039
 Are-0.035
ly-0.034
 floors-0.034
Token the
Token ID262
Feature activation+2.16e-01

Pos logit contributions
vern+0.029
 theirs+0.024
 buddies+0.023
girls+0.022
 moms+0.021
Neg logit contributions
 tant-0.057
ard-0.049
 Are-0.045
ly-0.044
 Those-0.042
Token 3
Token ID513
Feature activation+4.63e+00

Pos logit contributions
 tant+0.009
ard+0.007
 Are+0.006
ly+0.006
 floors+0.006
Neg logit contributions
 theirs-0.007
 moms-0.007
 Sam-0.007
 mom-0.007
vern-0.007
Token year
Token ID614
Feature activation+9.07e-01

Pos logit contributions
 tant+0.173
ard+0.142
 Are+0.142
 Those+0.125
 fart+0.124
Neg logit contributions
 mom-0.143
 moms-0.143
 theirs-0.142
 buddies-0.142
 Andy-0.139
Token old
Token ID1468
Feature activation-3.36e-01

Pos logit contributions
 tant+0.031
ard+0.028
ly+0.025
 floors+0.022
 Are+0.020
Neg logit contributions
 theirs-0.033
vern-0.033
girls-0.032
 Betty-0.031
 buddies-0.031
Token child
Token ID1200
Feature activation+1.11e+00

Pos logit contributions
vern+0.013
 theirs+0.012
 accomplishment+0.011
 possible+0.011
ores+0.011
Neg logit contributions
ard-0.011
 tant-0.011
ly-0.010
 Are-0.009
est-0.008
Token.
Token ID13
Feature activation-6.66e-01

Pos logit contributions
 tant+0.054
ard+0.051
ly+0.045
 Are+0.044
est+0.042
Neg logit contributions
 theirs-0.025
vern-0.025
 buddies-0.021
girls-0.021
 moms-0.020
Token The
Token ID383
Feature activation-5.69e-01

Pos logit contributions
 theirs+0.019
vern+0.018
 buddies+0.017
 Betty+0.017
 moms+0.016
Neg logit contributions
 tant-0.029
ard-0.025
 Are-0.023
 Those-0.021
ly-0.021
Token but
Token ID475
Feature activation+1.45e+00

Pos logit contributions
 theirs+0.023
 Sam+0.023
 Betty+0.022
 Andy+0.021
vern+0.021
Neg logit contributions
 tant-0.058
ard-0.049
 Are-0.044
 floors-0.043
 Those-0.043
Token he
Token ID339
Feature activation+1.14e+00

Pos logit contributions
 tant+0.064
ard+0.055
 Are+0.049
ly+0.047
 floors+0.046
Neg logit contributions
 theirs-0.040
vern-0.040
 Betty-0.037
 buddies-0.036
 Sam-0.036
Token put
Token ID1234
Feature activation+1.16e+00

Pos logit contributions
 tant+0.046
ard+0.040
 Are+0.035
ly+0.034
est+0.032
Neg logit contributions
 theirs-0.036
vern-0.035
 buddies-0.032
 Betty-0.032
 moms-0.031
Token on
Token ID319
Feature activation-2.57e-02

Pos logit contributions
 tant+0.052
ard+0.045
 Are+0.039
ly+0.038
 floors+0.037
Neg logit contributions
 theirs-0.032
vern-0.031
 Betty-0.029
 buddies-0.029
 Sam-0.029
Token his
Token ID465
Feature activation+9.65e-01

Pos logit contributions
 theirs+0.001
vern+0.001
 Sam+0.001
 Betty+0.001
 buddies+0.000
Neg logit contributions
 tant-0.001
ard-0.001
 Are-0.001
ly-0.001
 Those-0.001
Token brav
Token ID49025
Feature activation+4.62e+00

Pos logit contributions
 tant+0.041
ard+0.035
 Are+0.030
ly+0.029
 floors+0.028
Neg logit contributions
 theirs-0.029
vern-0.028
 Betty-0.027
 Sam-0.026
 buddies-0.026
Tokenest
Token ID395
Feature activation+3.22e+00

Pos logit contributions
 tant+0.129
ard+0.098
 Are+0.083
 million+0.078
 Those+0.076
Neg logit contributions
 theirs-0.203
 Sam-0.196
vern-0.196
 Betty-0.195
 buddies-0.190
Token face
Token ID1986
Feature activation-1.94e-01

Pos logit contributions
 tant+0.136
ard+0.119
 Are+0.108
 Those+0.107
 million+0.100
Neg logit contributions
 theirs-0.088
vern-0.087
 buddies-0.085
 Sam-0.084
 mom-0.082
Token.
Token ID13
Feature activation-1.65e-03

Pos logit contributions
 theirs+0.003
vern+0.003
 Sam+0.003
 Betty+0.003
 buddies+0.003
Neg logit contributions
 tant-0.011
ard-0.009
 Are-0.008
 Those-0.008
 floors-0.008
Token In
Token ID554
Feature activation-2.48e-01

Pos logit contributions
 Sam+0.000
 Betty+0.000
 Andy+0.000
 theirs+0.000
 Bill+0.000
Neg logit contributions
 tant-0.000
ard-0.000
ly-0.000
 floors-0.000
est-0.000
Token an
Token ID281
Feature activation-6.43e-01

Pos logit contributions
vern+0.006
 theirs+0.006
 buddies+0.005
 Betty+0.005
girls+0.005
Neg logit contributions
 tant-0.012
ard-0.011
 Are-0.010
 Those-0.009
 million-0.009
Token,
Token ID11
Feature activation-2.50e-01

Pos logit contributions
 theirs+0.017
vern+0.016
 Betty+0.015
 Sam+0.015
 buddies+0.015
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.016
ly-0.016
 floors-0.015
Token disgusting
Token ID23374
Feature activation-5.78e-01

Pos logit contributions
 theirs+0.008
 Sam+0.008
 Betty+0.007
 Andy+0.007
vern+0.007
Neg logit contributions
 tant-0.010
ard-0.009
 Those-0.008
 Are-0.008
ly-0.007
Token place
Token ID1295
Feature activation+2.76e-02

Pos logit contributions
 theirs+0.016
 Sam+0.015
 Betty+0.015
vern+0.015
 Andy+0.014
Neg logit contributions
 tant-0.026
ard-0.022
 Are-0.019
 Those-0.019
 floors-0.018
Token for
Token ID329
Feature activation+1.96e-01

Pos logit contributions
 tant+0.001
ard+0.001
ly+0.001
 Are+0.001
 floors+0.001
Neg logit contributions
vern-0.001
 theirs-0.001
 moms-0.001
 buddies-0.000
girls-0.000
Token a
Token ID257
Feature activation+6.19e-01

Pos logit contributions
 tant+0.009
ard+0.008
 Are+0.007
 floors+0.007
ly+0.006
Neg logit contributions
vern-0.005
 theirs-0.005
 buddies-0.004
girls-0.004
 Betty-0.004
Token 3
Token ID513
Feature activation+4.62e+00

Pos logit contributions
 tant+0.025
ard+0.021
 Are+0.018
ly+0.017
 floors+0.017
Neg logit contributions
 theirs-0.020
vern-0.019
 Betty-0.018
 Sam-0.018
 buddies-0.018
Token-
Token ID12
Feature activation-1.18e+00

Pos logit contributions
 tant+0.158
ard+0.135
 Are+0.127
 Those+0.117
 million+0.108
Neg logit contributions
vern-0.154
 Betty-0.152
 theirs-0.151
 Sam-0.150
 buddies-0.149
Tokenyear
Token ID1941
Feature activation+2.37e+00

Pos logit contributions
 theirs+0.037
vern+0.036
 Betty+0.034
 Sam+0.034
 buddies+0.033
Neg logit contributions
 tant-0.049
ard-0.041
 Are-0.036
ly-0.034
 floors-0.034
Token-
Token ID12
Feature activation-1.18e+00

Pos logit contributions
 tant+0.099
ard+0.085
 Are+0.074
 Those+0.069
ly+0.069
Neg logit contributions
 theirs-0.071
vern-0.070
 Betty-0.066
 Sam-0.065
 buddies-0.065
Tokenold
Token ID727
Feature activation-8.71e-01

Pos logit contributions
 theirs+0.033
vern+0.032
 Betty+0.030
 Sam+0.030
 buddies+0.030
Neg logit contributions
 tant-0.052
ard-0.045
 Are-0.039
ly-0.038
 floors-0.037
Token to
Token ID284
Feature activation-7.19e-02

Pos logit contributions
 theirs+0.022
vern+0.021
 Betty+0.020
 Sam+0.020
 buddies+0.020
Neg logit contributions
 tant-0.041
ard-0.035
 Are-0.031
ly-0.030
 floors-0.029
Token a
Token ID257
Feature activation-2.39e-01

Pos logit contributions
 theirs+0.010
 Sam+0.009
 Betty+0.009
 Andy+0.008
vern+0.008
Neg logit contributions
 tant-0.034
ard-0.030
 Are-0.027
 Those-0.026
icer-0.025
Token loud
Token ID7812
Feature activation-5.93e-01

Pos logit contributions
vern+0.006
 mom+0.006
 Betty+0.006
 Sam+0.006
 Andy+0.006
Neg logit contributions
 tant-0.010
ard-0.009
est-0.008
vious-0.008
 Those-0.008
Token shout
Token ID9191
Feature activation-6.91e-01

Pos logit contributions
 Andy+0.022
 Sam+0.022
 Betty+0.022
 Buddy+0.021
 Dog+0.021
Neg logit contributions
 tant-0.021
ard-0.016
 floors-0.013
 ounces-0.013
 Those-0.013
Token.
Token ID13
Feature activation-4.65e-01

Pos logit contributions
vern+0.014
 theirs+0.014
 buddies+0.012
 Betty+0.012
 moms+0.012
Neg logit contributions
 tant-0.035
ard-0.032
 Are-0.028
ly-0.028
est-0.027
Token The
Token ID383
Feature activation-2.57e-01

Pos logit contributions
 Sam+0.011
 theirs+0.011
 Betty+0.010
 Andy+0.010
 Buddy+0.009
Neg logit contributions
 tant-0.023
ard-0.020
ly-0.017
est-0.017
 floors-0.017
Token 3
Token ID513
Feature activation+4.57e+00

Pos logit contributions
 mom+0.008
 Sam+0.008
 moms+0.008
 buddies+0.008
 Andy+0.008
Neg logit contributions
 tant-0.010
ard-0.008
 Those-0.007
 Are-0.007
 million-0.007
Token year
Token ID614
Feature activation+7.61e-01

Pos logit contributions
 tant+0.188
 Are+0.154
ard+0.151
 Those+0.142
 fart+0.142
Neg logit contributions
 Andy-0.127
 mom-0.126
ia-0.126
 Sam-0.125
 Betty-0.124
Token old
Token ID1468
Feature activation-2.84e-01

Pos logit contributions
 tant+0.031
ard+0.027
 Are+0.023
ly+0.023
 floors+0.022
Neg logit contributions
 theirs-0.023
vern-0.023
 Betty-0.021
 buddies-0.021
 Sam-0.021
Token was
Token ID373
Feature activation+2.40e-01

Pos logit contributions
 theirs+0.011
vern+0.011
 buddies+0.010
 Betty+0.010
girls+0.010
Neg logit contributions
 tant-0.009
ard-0.008
 Are-0.006
ly-0.006
 floors-0.006
Token scared
Token ID12008
Feature activation-9.98e-01

Pos logit contributions
 tant+0.010
ard+0.009
 Are+0.007
ly+0.007
 Those+0.007
Neg logit contributions
 theirs-0.007
vern-0.007
 Sam-0.007
 Betty-0.007
 buddies-0.007
Token,
Token ID11
Feature activation-1.34e-01

Pos logit contributions
 theirs+0.020
 Sam+0.020
 Betty+0.019
 Andy+0.018
vern+0.018
Neg logit contributions
 tant-0.052
ard-0.045
 Are-0.041
 floors-0.039
 Those-0.039
Token looked
Token ID3114
Feature activation+9.70e-01

Pos logit contributions
 theirs+0.040
vern+0.039
 buddies+0.036
girls+0.035
 moms+0.035
Neg logit contributions
 tant-0.055
ard-0.048
 Are-0.042
ly-0.040
 million-0.038
Token around
Token ID1088
Feature activation-1.47e-01

Pos logit contributions
 tant+0.041
ard+0.036
 Are+0.031
 Those+0.029
 floors+0.029
Neg logit contributions
 theirs-0.029
vern-0.027
 Betty-0.027
 Sam-0.027
 buddies-0.026
Token and
Token ID290
Feature activation-3.61e-01

Pos logit contributions
vern+0.004
 theirs+0.004
 moms+0.004
girls+0.004
 buddies+0.004
Neg logit contributions
 tant-0.006
ard-0.005
 Are-0.005
ly-0.005
est-0.004
Token saw
Token ID2497
Feature activation-7.36e-01

Pos logit contributions
 theirs+0.011
vern+0.011
 Betty+0.010
 Sam+0.010
 buddies+0.010
Neg logit contributions
 tant-0.015
ard-0.013
 Are-0.011
ly-0.011
 floors-0.011
Token a
Token ID257
Feature activation-3.43e-01

Pos logit contributions
vern+0.016
 theirs+0.015
 buddies+0.014
girls+0.013
 moms+0.013
Neg logit contributions
 tant-0.037
ard-0.032
 Are-0.029
ly-0.028
 Those-0.028
Token 3
Token ID513
Feature activation+4.57e+00

Pos logit contributions
vern+0.010
 theirs+0.009
 Betty+0.009
 Sam+0.009
 mom+0.009
Neg logit contributions
 tant-0.015
ard-0.012
 Those-0.011
ly-0.011
 Are-0.010
Token year
Token ID614
Feature activation+1.29e-01

Pos logit contributions
 tant+0.176
ard+0.140
 Are+0.136
 Those+0.132
 fart+0.126
Neg logit contributions
 Betty-0.140
 feat-0.138
vern-0.135
 mom-0.134
 Andy-0.134
Token old
Token ID1468
Feature activation-4.59e-01

Pos logit contributions
 tant+0.005
ard+0.005
 Are+0.004
ly+0.004
 floors+0.004
Neg logit contributions
 theirs-0.004
vern-0.004
 Betty-0.004
 buddies-0.004
 Sam-0.004
Token child
Token ID1200
Feature activation+7.90e-01

Pos logit contributions
 theirs+0.016
vern+0.016
 buddies+0.015
girls+0.015
 Betty+0.015
Neg logit contributions
 tant-0.017
ard-0.015
 Are-0.012
ly-0.012
 floors-0.011
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
 tant+0.036
ard+0.033
 Are+0.029
ly+0.028
 Those+0.027
Neg logit contributions
 theirs-0.020
vern-0.020
 moms-0.018
 buddies-0.018
 Betty-0.017
Token
Token ID220
Feature activation-9.26e-02

Pos logit contributions
 theirs+0.009
 Sam+0.009
 Betty+0.009
vern+0.009
 Andy+0.009
Neg logit contributions
 tant-0.013
ard-0.011
 Are-0.009
 floors-0.009
ly-0.009
Token it
Token ID340
Feature activation+3.49e-01

Pos logit contributions
 theirs+0.003
 Sam+0.003
 Betty+0.003
vern+0.003
 Andy+0.003
Neg logit contributions
 tant-0.008
ard-0.007
 Are-0.006
ly-0.006
 Those-0.006
Token was
Token ID373
Feature activation+5.35e-01

Pos logit contributions
 tant+0.016
ard+0.013
 Are+0.012
 Those+0.011
 floors+0.011
Neg logit contributions
 Sam-0.009
 theirs-0.009
 Betty-0.009
vern-0.009
 Andy-0.009
Token shouting
Token ID19642
Feature activation+6.15e-01

Pos logit contributions
 tant+0.022
ard+0.019
 Are+0.017
 floors+0.016
 million+0.016
Neg logit contributions
vern-0.016
 theirs-0.015
 moms-0.014
 buddies-0.014
 Betty-0.014
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
 tant+0.030
ard+0.027
 Are+0.024
 floors+0.023
 Those+0.023
Neg logit contributions
 theirs-0.014
vern-0.013
 Sam-0.013
 Betty-0.013
 buddies-0.012
Token The
Token ID383
Feature activation-1.05e-01

Pos logit contributions
 theirs+0.006
vern+0.006
 buddies+0.006
 Betty+0.006
 moms+0.005
Neg logit contributions
 tant-0.008
ard-0.007
 Are-0.006
 Those-0.005
ly-0.005
Token 3
Token ID513
Feature activation+4.57e+00

Pos logit contributions
 mom+0.003
 Sam+0.003
 friend+0.003
 moms+0.003
 dad+0.003
Neg logit contributions
 tant-0.004
ard-0.004
 Those-0.003
 Are-0.003
arre-0.003
Token year
Token ID614
Feature activation+9.03e-01

Pos logit contributions
 tant+0.184
 Are+0.160
ard+0.152
 Those+0.147
 fart+0.144
Neg logit contributions
ia-0.129
 mom-0.126
 friend-0.123
 Andy-0.122
 buddies-0.121
Token old
Token ID1468
Feature activation+2.16e-02

Pos logit contributions
 tant+0.037
ard+0.032
 Are+0.027
ly+0.027
 floors+0.026
Neg logit contributions
 theirs-0.028
vern-0.028
 Betty-0.026
 buddies-0.025
 Sam-0.025
Token was
Token ID373
Feature activation+3.88e-01

Pos logit contributions
 tant+0.001
ard+0.001
 Are+0.001
ly+0.001
 Those+0.001
Neg logit contributions
 theirs-0.001
vern-0.001
 Betty-0.001
 Sam-0.001
 buddies-0.001
Token so
Token ID523
Feature activation+9.58e-01

Pos logit contributions
 tant+0.014
ard+0.012
 Are+0.010
ly+0.010
 floors+0.010
Neg logit contributions
 theirs-0.013
vern-0.013
 Betty-0.012
 buddies-0.012
 moms-0.012
Token scared
Token ID12008
Feature activation-1.40e+00

Pos logit contributions
 tant+0.041
ard+0.036
ly+0.030
 Are+0.029
 floors+0.029
Neg logit contributions
 theirs-0.028
vern-0.028
 Sam-0.027
 Betty-0.027
 Andy-0.026
Token cube
Token ID23441
Feature activation+8.54e-01

Pos logit contributions
 tant+0.000
ard+0.000
 Are+0.000
ly+0.000
 floors+0.000
Neg logit contributions
 theirs-0.000
vern-0.000
 Betty-0.000
 Sam-0.000
 buddies-0.000
Token flew
Token ID13112
Feature activation-2.12e-01

Pos logit contributions
 tant+0.037
ard+0.031
 Are+0.028
 Those+0.027
ly+0.026
Neg logit contributions
 theirs-0.023
 Sam-0.023
 Betty-0.023
vern-0.022
 buddies-0.022
Token high
Token ID1029
Feature activation-1.16e+00

Pos logit contributions
vern+0.007
 theirs+0.007
 moms+0.007
 Betty+0.007
 buddies+0.006
Neg logit contributions
 tant-0.008
ard-0.007
ly-0.006
 Are-0.006
est-0.006
Token above
Token ID2029
Feature activation-2.77e-01

Pos logit contributions
 theirs+0.029
vern+0.028
 Betty+0.027
 buddies+0.027
 Sam+0.026
Neg logit contributions
 tant-0.054
ard-0.046
 Are-0.042
 Those-0.040
 floors-0.038
Token a
Token ID257
Feature activation+1.38e-01

Pos logit contributions
vern+0.007
 theirs+0.006
 moms+0.006
 buddies+0.006
girls+0.006
Neg logit contributions
 tant-0.013
ard-0.012
ly-0.011
est-0.010
 Are-0.010
Token 3
Token ID513
Feature activation+4.56e+00

Pos logit contributions
 tant+0.006
ard+0.005
 Are+0.005
 Those+0.005
ly+0.005
Neg logit contributions
vern-0.003
 Sam-0.003
 theirs-0.003
 mom-0.003
 Betty-0.003
Token year
Token ID614
Feature activation+4.54e-01

Pos logit contributions
 tant+0.165
 Are+0.133
ard+0.127
 Those+0.119
arre+0.112
Neg logit contributions
 feat-0.146
 Betty-0.144
 Andy-0.142
 Sam-0.141
vern-0.139
Token old
Token ID1468
Feature activation-8.00e-01

Pos logit contributions
 tant+0.020
ard+0.017
 Are+0.015
ly+0.014
 Those+0.014
Neg logit contributions
 theirs-0.013
vern-0.013
 Betty-0.012
 Sam-0.012
 buddies-0.012
Token child
Token ID1200
Feature activation+5.33e-01

Pos logit contributions
vern+0.026
 theirs+0.026
 buddies+0.023
 Betty+0.023
 moms+0.023
Neg logit contributions
 tant-0.031
ard-0.028
 Are-0.024
ly-0.023
 floors-0.022
Token who
Token ID508
Feature activation-1.16e+00

Pos logit contributions
 tant+0.022
ard+0.021
ly+0.018
 Are+0.018
est+0.017
Neg logit contributions
 theirs-0.016
vern-0.015
 moms-0.014
 Betty-0.014
 buddies-0.014
Token was
Token ID373
Feature activation-7.25e-01

Pos logit contributions
 theirs+0.033
vern+0.033
 Betty+0.031
 Sam+0.031
 buddies+0.030
Neg logit contributions
 tant-0.050
ard-0.043
 Are-0.037
ly-0.036
 floors-0.035
Token ball
Token ID2613
Feature activation-1.42e+00

Pos logit contributions
vern+0.012
 Sam+0.012
 theirs+0.012
 mom+0.012
 buddies+0.011
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.017
ly-0.017
 Those-0.017
Token.
Token ID13
Feature activation-2.33e-01

Pos logit contributions
vern+0.030
 theirs+0.028
 Betty+0.025
 moms+0.025
 buddies+0.024
Neg logit contributions
 tant-0.072
ard-0.068
ly-0.059
est-0.056
 Are-0.056
Token They
Token ID1119
Feature activation-1.02e+00

Pos logit contributions
 Sam+0.006
 Andy+0.006
 mom+0.006
 Betty+0.006
 theirs+0.006
Neg logit contributions
 tant-0.011
ard-0.009
vious-0.008
ly-0.008
est-0.007
Token put
Token ID1234
Feature activation+1.18e+00

Pos logit contributions
vern+0.035
 theirs+0.033
 buddies+0.030
 Betty+0.030
 moms+0.029
Neg logit contributions
 tant-0.041
ard-0.036
 floors-0.029
ly-0.029
 Are-0.028
Token the
Token ID262
Feature activation+3.74e-02

Pos logit contributions
 tant+0.057
ard+0.049
 Are+0.043
ly+0.042
 floors+0.042
Neg logit contributions
vern-0.029
 theirs-0.027
 Betty-0.025
 buddies-0.025
 moms-0.024
Token 3
Token ID513
Feature activation+4.55e+00

Pos logit contributions
 tant+0.002
ard+0.002
 Are+0.002
 Those+0.002
ly+0.001
Neg logit contributions
 theirs-0.001
vern-0.001
 Sam-0.001
 moms-0.001
 buddies-0.001
Token balls
Token ID11333
Feature activation-1.02e+00

Pos logit contributions
 tant+0.205
 Are+0.187
 Those+0.175
ard+0.172
rified+0.170
Neg logit contributions
 mom-0.094
 feat-0.094
 buddies-0.091
 adventure-0.091
 friend-0.090
Token on
Token ID319
Feature activation+5.22e-01

Pos logit contributions
vern+0.034
girls+0.034
 moms+0.032
 Betty+0.032
 theirs+0.032
Neg logit contributions
 tant-0.037
ard-0.032
ly-0.028
est-0.026
 floors-0.025
Token top
Token ID1353
Feature activation-3.08e-01

Pos logit contributions
 tant+0.027
ard+0.024
 Are+0.021
ly+0.021
 floors+0.021
Neg logit contributions
vern-0.010
 theirs-0.010
 buddies-0.009
 Betty-0.009
 moms-0.009
Token of
Token ID286
Feature activation+2.92e-01

Pos logit contributions
 theirs+0.007
 Sam+0.006
vern+0.006
 Betty+0.006
 buddies+0.006
Neg logit contributions
 tant-0.015
ard-0.013
 Are-0.012
 Those-0.011
 floors-0.011
Token each
Token ID1123
Feature activation-8.89e-03

Pos logit contributions
 tant+0.015
ard+0.013
 Are+0.012
 floors+0.011
ly+0.011
Neg logit contributions
vern-0.006
 theirs-0.006
 buddies-0.005
 Betty-0.005
 moms-0.005
Token around
Token ID1088
Feature activation+4.18e-01

Pos logit contributions
 tant+0.054
ard+0.048
 Are+0.041
 Those+0.039
 floors+0.039
Neg logit contributions
 theirs-0.037
vern-0.035
 Sam-0.035
 Betty-0.035
 buddies-0.033
Token and
Token ID290
Feature activation+1.89e-02

Pos logit contributions
 tant+0.018
ard+0.015
 Are+0.014
ly+0.013
est+0.013
Neg logit contributions
vern-0.012
 theirs-0.012
 moms-0.011
 buddies-0.011
girls-0.011
Token saw
Token ID2497
Feature activation-2.12e-01

Pos logit contributions
 tant+0.001
ard+0.001
 Are+0.001
ly+0.001
 floors+0.001
Neg logit contributions
 theirs-0.001
vern-0.001
 buddies-0.001
 Betty-0.001
 moms-0.001
Token a
Token ID257
Feature activation-2.46e-01

Pos logit contributions
 theirs+0.004
 Sam+0.003
 Betty+0.003
vern+0.003
 Andy+0.003
Neg logit contributions
 tant-0.012
ard-0.011
 Are-0.009
 floors-0.009
ly-0.009
Token small
Token ID1402
Feature activation+1.99e-01

Pos logit contributions
vern+0.006
 mom+0.006
 Sam+0.006
 Betty+0.006
 theirs+0.006
Neg logit contributions
 tant-0.011
ard-0.009
 Those-0.008
ly-0.008
est-0.008
Token 3
Token ID513
Feature activation+4.55e+00

Pos logit contributions
 tant+0.009
ard+0.008
 Those+0.007
 Are+0.007
 floors+0.007
Neg logit contributions
 Sam-0.005
 Betty-0.005
 mom-0.005
 theirs-0.004
 feat-0.004
Token year
Token ID614
Feature activation+2.23e-01

Pos logit contributions
 tant+0.187
 Are+0.150
ard+0.149
 Those+0.146
 fart+0.139
Neg logit contributions
 feat-0.125
 Betty-0.122
vern-0.121
 mom-0.121
 Sam-0.118
Token old
Token ID1468
Feature activation-1.80e-01

Pos logit contributions
 tant+0.009
ard+0.008
ly+0.006
 Are+0.006
 floors+0.006
Neg logit contributions
 theirs-0.007
vern-0.007
 Betty-0.007
 buddies-0.007
girls-0.007
Token child
Token ID1200
Feature activation+9.83e-01

Pos logit contributions
 theirs+0.006
vern+0.006
 Betty+0.005
 Sam+0.005
 buddies+0.005
Neg logit contributions
 tant-0.007
ard-0.006
 Are-0.005
ly-0.005
 floors-0.005
Token standing
Token ID5055
Feature activation+6.85e-01

Pos logit contributions
 tant+0.046
ard+0.043
 Are+0.038
ly+0.036
 Those+0.035
Neg logit contributions
 theirs-0.023
vern-0.023
 moms-0.021
 Betty-0.021
 buddies-0.020
Token there
Token ID612
Feature activation+4.82e-01

Pos logit contributions
 tant+0.029
ard+0.026
 Are+0.022
 floors+0.022
ly+0.021
Neg logit contributions
 theirs-0.020
vern-0.019
 Betty-0.018
girls-0.018
 buddies-0.018
Token?"
Token ID1701
Feature activation-8.09e-01

Pos logit contributions
 theirs+0.005
vern+0.005
 Betty+0.005
 Sam+0.005
 buddies+0.005
Neg logit contributions
 tant-0.011
ard-0.010
 Are-0.009
ly-0.008
 floors-0.008
Token asked
Token ID1965
Feature activation-5.23e-01

Pos logit contributions
 Betty+0.016
 Sam+0.016
 mom+0.015
 Andy+0.015
 Dog+0.014
Neg logit contributions
 tant-0.044
ard-0.038
ly-0.033
arre-0.033
 fart-0.033
Token Bobby
Token ID17155
Feature activation+4.37e-01

Pos logit contributions
vern+0.011
 theirs+0.010
 buddies+0.009
girls+0.009
 moms+0.009
Neg logit contributions
 tant-0.027
ard-0.024
 Are-0.021
ly-0.021
 floors-0.020
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
 tant+0.024
ard+0.021
 Are+0.019
ly+0.018
 floors+0.018
Neg logit contributions
 theirs-0.007
vern-0.007
 Betty-0.007
 Sam-0.007
 buddies-0.006
Token a
Token ID257
Feature activation-2.59e-01

Pos logit contributions
 theirs+0.031
vern+0.029
 Sam+0.029
 Betty+0.028
 buddies+0.027
Neg logit contributions
 tant-0.051
ard-0.044
 Are-0.038
ly-0.037
 Those-0.036
Token 3
Token ID513
Feature activation+4.53e+00

Pos logit contributions
vern+0.008
 Sam+0.008
 mom+0.008
 Betty+0.008
 theirs+0.008
Neg logit contributions
 tant-0.010
ard-0.009
ly-0.007
 Those-0.007
 floors-0.007
Token year
Token ID614
Feature activation+4.51e-01

Pos logit contributions
 tant+0.180
ard+0.152
 Are+0.143
 Those+0.136
 fart+0.132
Neg logit contributions
 Betty-0.129
 Sam-0.127
 mom-0.127
 theirs-0.127
vern-0.125
Token old
Token ID1468
Feature activation-3.25e-01

Pos logit contributions
 tant+0.017
ard+0.015
ly+0.013
 Are+0.012
 floors+0.012
Neg logit contributions
 theirs-0.015
vern-0.015
girls-0.015
 Betty-0.015
 buddies-0.014
Token boy
Token ID2933
Feature activation+4.75e-01

Pos logit contributions
vern+0.012
 theirs+0.012
girls+0.011
 buddies+0.011
 Betty+0.011
Neg logit contributions
 tant-0.011
ard-0.010
 Are-0.008
ly-0.008
est-0.007
Token.
Token ID13
Feature activation-5.08e-01

Pos logit contributions
 tant+0.025
ard+0.022
 Are+0.020
ly+0.019
 Those+0.019
Neg logit contributions
vern-0.009
 theirs-0.009
 buddies-0.008
 Betty-0.008
 moms-0.008
Token
Token ID198
Feature activation+3.02e-01

Pos logit contributions
 theirs+0.012
vern+0.012
 Betty+0.011
 buddies+0.011
 Sam+0.011
Neg logit contributions
 tant-0.024
ard-0.021
 Are-0.019
ly-0.018
 Those-0.018
Token is
Token ID318
Feature activation-2.67e+00

Pos logit contributions
 tant+0.010
ard+0.008
 Are+0.006
ly+0.006
 floors+0.006
Neg logit contributions
 theirs-0.011
vern-0.011
 Betty-0.010
 Sam-0.010
 buddies-0.010
Token it
Token ID340
Feature activation-9.48e-01

Pos logit contributions
vern+0.081
girls+0.075
 buddies+0.074
 theirs+0.073
 moms+0.071
Neg logit contributions
 tant-0.108
ard-0.096
 Are-0.087
ly-0.081
 floors-0.081
Token?"
Token ID1701
Feature activation-3.84e-01

Pos logit contributions
 theirs+0.017
 Betty+0.016
 Sam+0.016
vern+0.015
 Andy+0.015
Neg logit contributions
 tant-0.053
ard-0.046
 Are-0.041
 floors-0.040
 Those-0.040
Token asked
Token ID1965
Feature activation+3.80e-01

Pos logit contributions
 Sam+0.009
 theirs+0.009
 Betty+0.009
 Andy+0.008
vern+0.008
Neg logit contributions
 tant-0.019
ard-0.016
 Are-0.014
ly-0.014
est-0.013
Token the
Token ID262
Feature activation-5.19e-03

Pos logit contributions
 tant+0.018
ard+0.016
 Are+0.014
ly+0.014
 floors+0.014
Neg logit contributions
vern-0.009
 theirs-0.009
 buddies-0.008
girls-0.008
 moms-0.008
Token 3
Token ID513
Feature activation+4.53e+00

Pos logit contributions
 mom+0.000
 moms+0.000
 theirs+0.000
 buddies+0.000
 dad+0.000
Neg logit contributions
 tant-0.000
ard-0.000
 Are-0.000
 Those-0.000
ly-0.000
Token-
Token ID12
Feature activation-1.29e+00

Pos logit contributions
 tant+0.170
ard+0.138
 Are+0.132
 Those+0.119
 fart+0.118
Neg logit contributions
 Betty-0.144
 mom-0.143
 Andy-0.141
 moms-0.139
 buddies-0.139
Tokenyear
Token ID1941
Feature activation+2.46e+00

Pos logit contributions
 theirs+0.042
vern+0.041
 Betty+0.039
 Sam+0.039
 buddies+0.039
Neg logit contributions
 tant-0.051
ard-0.043
 Are-0.037
ly-0.035
 floors-0.034
Token old
Token ID1468
Feature activation+1.72e-01

Pos logit contributions
 tant+0.101
ard+0.086
 Are+0.074
ly+0.071
 floors+0.070
Neg logit contributions
 theirs-0.076
vern-0.074
 Betty-0.070
 Sam-0.070
 buddies-0.069
Token.
Token ID13
Feature activation-8.09e-01

Pos logit contributions
 tant+0.007
ard+0.006
 Are+0.005
 Those+0.005
ly+0.005
Neg logit contributions
 theirs-0.005
vern-0.005
 Betty-0.005
 Sam-0.005
 buddies-0.005
Token
Token ID220
Feature activation+3.21e-01

Pos logit contributions
 theirs+0.017
vern+0.017
 Sam+0.016
 Betty+0.016
 Andy+0.016
Neg logit contributions
 tant-0.041
ard-0.035
ly-0.031
 Are-0.031
 floors-0.030
Token.
Token ID13
Feature activation+7.26e-01

Pos logit contributions
 tant+0.008
ard+0.008
ly+0.007
est+0.007
 Are+0.007
Neg logit contributions
vern-0.003
 theirs-0.003
 Betty-0.002
 buddies-0.002
 moms-0.002
Token He
Token ID679
Feature activation+4.13e-01

Pos logit contributions
 tant+0.033
ard+0.027
ly+0.024
ited+0.024
 situations+0.023
Neg logit contributions
 Sam-0.024
 Andy-0.021
 Betty-0.021
 Sue-0.021
 mom-0.021
Token saw
Token ID2497
Feature activation+1.84e-01

Pos logit contributions
 tant+0.019
ard+0.016
 Those+0.015
 Are+0.015
est+0.014
Neg logit contributions
 theirs-0.010
 Sam-0.010
 moms-0.009
 mom-0.009
 Andy-0.009
Token a
Token ID257
Feature activation-5.99e-02

Pos logit contributions
 tant+0.011
ard+0.010
 Are+0.009
 Those+0.008
 million+0.008
Neg logit contributions
 theirs-0.003
 Sam-0.003
 mom-0.002
 moms-0.002
 Betty-0.002
Token small
Token ID1402
Feature activation+3.15e-01

Pos logit contributions
vern+0.001
 mom+0.001
 theirs+0.001
 Sam+0.001
 moms+0.001
Neg logit contributions
 tant-0.003
ard-0.002
 Those-0.002
ly-0.002
vious-0.002
Token 3
Token ID513
Feature activation+4.51e+00

Pos logit contributions
 tant+0.014
ard+0.012
 Those+0.011
 Are+0.011
 ounces+0.010
Neg logit contributions
 Sam-0.008
 mom-0.008
 friend-0.007
 theirs-0.007
 Betty-0.007
Token year
Token ID614
Feature activation+3.82e-01

Pos logit contributions
 tant+0.186
 Are+0.163
ard+0.154
 Those+0.154
 fart+0.150
Neg logit contributions
 mom-0.112
 feat-0.111
 friend-0.109
 Sam-0.108
 adventure-0.107
Token old
Token ID1468
Feature activation-4.21e-01

Pos logit contributions
 tant+0.016
ard+0.014
 Are+0.011
ly+0.011
 floors+0.011
Neg logit contributions
 theirs-0.012
vern-0.011
 Betty-0.011
 buddies-0.011
 Sam-0.011
Token boy
Token ID2933
Feature activation+7.18e-01

Pos logit contributions
 theirs+0.017
vern+0.016
 Betty+0.016
 Sam+0.016
 buddies+0.015
Neg logit contributions
 tant-0.014
ard-0.011
 Are-0.009
ly-0.009
 floors-0.008
Token who
Token ID508
Feature activation-4.52e-01

Pos logit contributions
 tant+0.034
ard+0.032
 Are+0.028
ly+0.027
 Those+0.026
Neg logit contributions
vern-0.017
 theirs-0.017
 moms-0.015
 buddies-0.015
 Betty-0.015
Token was
Token ID373
Feature activation-2.84e-01

Pos logit contributions
 theirs+0.013
vern+0.012
 Sam+0.012
 Betty+0.012
 buddies+0.011
Neg logit contributions
 tant-0.020
ard-0.017
 Are-0.015
ly-0.014
 floors-0.014
TokenThe
Token ID464
Feature activation+2.77e-01

Pos logit contributions
 theirs+0.009
vern+0.008
 Sam+0.008
 Betty+0.008
girls+0.008
Neg logit contributions
 tant-0.008
ard-0.007
 Are-0.006
 Those-0.005
 million-0.005
Token blue
Token ID4171
Feature activation-3.70e-01

Pos logit contributions
 tant+0.011
ard+0.009
 Are+0.008
 Those+0.007
ly+0.007
Neg logit contributions
vern-0.009
 theirs-0.009
 moms-0.009
 mom-0.009
 Sam-0.009
Token hook
Token ID8011
Feature activation+5.59e-01

Pos logit contributions
 Sam+0.009
 mom+0.008
 theirs+0.008
 moms+0.008
 friend+0.008
Neg logit contributions
 tant-0.017
ard-0.014
 Are-0.014
 million-0.013
 Those-0.013
Token and
Token ID290
Feature activation+1.92e-02

Pos logit contributions
 tant+0.022
ard+0.019
ly+0.016
 Are+0.016
est+0.016
Neg logit contributions
 theirs-0.018
vern-0.018
 Betty-0.016
 Sam-0.016
 buddies-0.016
Token the
Token ID262
Feature activation+1.41e+00

Pos logit contributions
 tant+0.001
ard+0.001
 Are+0.001
ly+0.001
 Those+0.001
Neg logit contributions
 theirs-0.000
vern-0.000
 Betty-0.000
 Sam-0.000
 buddies-0.000
Token 3
Token ID513
Feature activation+4.51e+00

Pos logit contributions
 tant+0.059
ard+0.045
 Are+0.042
 Those+0.042
arre+0.042
Neg logit contributions
 mom-0.042
 Sam-0.041
 feat-0.041
 moms-0.040
 dad-0.040
Token-
Token ID12
Feature activation-9.83e-01

Pos logit contributions
 tant+0.176
 Are+0.155
ard+0.139
 Those+0.135
 fart+0.134
Neg logit contributions
 buddies-0.129
 moms-0.128
 mom-0.126
 feat-0.126
 theirs-0.125
Tokenyear
Token ID1941
Feature activation+2.43e+00

Pos logit contributions
 theirs+0.034
vern+0.033
 Sam+0.031
 Betty+0.031
 buddies+0.031
Neg logit contributions
 tant-0.037
ard-0.031
ly-0.026
 Are-0.025
est-0.024
Token-
Token ID12
Feature activation-8.33e-01

Pos logit contributions
 tant+0.098
ard+0.084
ly+0.070
 Are+0.069
 floors+0.068
Neg logit contributions
 theirs-0.077
vern-0.076
 Betty-0.072
 Sam-0.072
 buddies-0.070
Tokenold
Token ID727
Feature activation-2.36e-01

Pos logit contributions
 theirs+0.025
vern+0.024
girls+0.023
 Sam+0.023
 Betty+0.023
Neg logit contributions
 tant-0.034
ard-0.030
ly-0.026
 Those-0.024
est-0.024
Token were
Token ID547
Feature activation+5.28e-01

Pos logit contributions
 theirs+0.007
vern+0.007
 Betty+0.007
 Sam+0.006
 buddies+0.006
Neg logit contributions
 tant-0.010
ard-0.008
 Are-0.007
ly-0.007
est-0.007
Token is
Token ID318
Feature activation-3.38e+00

Pos logit contributions
 theirs+0.029
vern+0.028
 Betty+0.027
 buddies+0.026
girls+0.026
Neg logit contributions
 tant-0.045
ard-0.039
 Are-0.036
ly-0.033
 floors-0.033
Token safe
Token ID3338
Feature activation-1.44e+00

Pos logit contributions
vern+0.103
 buddies+0.101
girls+0.099
 Betty+0.099
 theirs+0.099
Neg logit contributions
 tant-0.125
ard-0.108
 Are-0.098
 floors-0.095
ly-0.093
Token and
Token ID290
Feature activation-6.31e-01

Pos logit contributions
vern+0.031
 theirs+0.031
 Betty+0.028
 buddies+0.028
 Sam+0.027
Neg logit contributions
 tant-0.071
ard-0.063
 Are-0.056
ly-0.055
est-0.054
Token what
Token ID644
Feature activation-1.00e+00

Pos logit contributions
 theirs+0.014
vern+0.013
 Sam+0.012
 Betty+0.012
 buddies+0.012
Neg logit contributions
 tant-0.032
ard-0.029
 Those-0.024
 Are-0.024
est-0.024
Token is
Token ID318
Feature activation-3.53e+00

Pos logit contributions
 theirs+0.033
vern+0.033
 Betty+0.031
 buddies+0.030
girls+0.030
Neg logit contributions
 tant-0.038
ard-0.032
 Are-0.029
 Those-0.026
ly-0.026
Token not
Token ID407
Feature activation-3.97e+00

Pos logit contributions
vern+0.097
 buddies+0.093
 Betty+0.093
girls+0.092
 theirs+0.091
Neg logit contributions
 tant-0.145
ard-0.126
 Are-0.114
 floors-0.113
ly-0.108
Token.
Token ID13
Feature activation+6.31e-02

Pos logit contributions
vern+0.086
 theirs+0.079
 buddies+0.079
girls+0.078
 moms+0.078
Neg logit contributions
 tant-0.197
ard-0.171
 Are-0.154
ly-0.151
 floors-0.149
Token OK
Token ID7477
Feature activation-5.42e-01

Pos logit contributions
 tant+0.003
ard+0.002
ly+0.002
 floors+0.002
est+0.002
Neg logit contributions
 Sam-0.002
 Betty-0.002
 theirs-0.002
 Dog-0.002
 Andy-0.002
Token?"
Token ID1701
Feature activation-1.13e+00

Pos logit contributions
vern+0.017
 theirs+0.016
girls+0.015
 buddies+0.014
 accomplishment+0.014
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.019
est-0.017
ly-0.016
Token she
Token ID673
Feature activation-3.91e-01

Pos logit contributions
 theirs+0.032
vern+0.032
 buddies+0.030
girls+0.030
 Betty+0.029
Neg logit contributions
 tant-0.049
ard-0.041
 Are-0.039
 Those-0.035
ly-0.035
Token said
Token ID531
Feature activation-1.32e+00

Pos logit contributions
 theirs+0.008
vern+0.007
 Sam+0.007
 Betty+0.007
 buddies+0.007
Neg logit contributions
 tant-0.020
ard-0.018
 Are-0.016
ly-0.016
 Those-0.016
Token a
Token ID257
Feature activation-6.08e-01

Pos logit contributions
 theirs+0.018
vern+0.018
 Betty+0.017
 Sam+0.017
 buddies+0.016
Neg logit contributions
 tant-0.036
ard-0.031
 Are-0.027
ly-0.027
 Those-0.026
Token greater
Token ID3744
Feature activation+6.16e-01

Pos logit contributions
vern+0.017
 theirs+0.016
 Betty+0.016
 Sam+0.016
 buddies+0.016
Neg logit contributions
 tant-0.026
ard-0.022
 Are-0.019
 Those-0.019
ly-0.019
Token reward
Token ID6721
Feature activation-1.28e+00

Pos logit contributions
 tant+0.027
ard+0.023
 Are+0.020
 Those+0.019
ly+0.019
Neg logit contributions
 theirs-0.017
vern-0.017
 Betty-0.016
 Sam-0.016
 buddies-0.016
Token than
Token ID621
Feature activation-1.61e+00

Pos logit contributions
vern+0.027
 theirs+0.025
 moms+0.024
 Betty+0.024
girls+0.023
Neg logit contributions
 tant-0.060
ard-0.057
ly-0.050
est-0.049
 Are-0.048
Token any
Token ID597
Feature activation-6.88e-01

Pos logit contributions
 theirs+0.036
vern+0.036
 Betty+0.032
 buddies+0.032
 moms+0.032
Neg logit contributions
 tant-0.079
ard-0.069
 Are-0.061
ly-0.060
 floors-0.059
Token fl
Token ID781
Feature activation-3.82e+00

Pos logit contributions
 theirs+0.020
vern+0.019
 Betty+0.018
 Sam+0.018
 buddies+0.018
Neg logit contributions
 tant-0.029
ard-0.025
 Are-0.022
 Those-0.021
ly-0.021
Tokenuttering
Token ID33598
Feature activation-1.60e+00

Pos logit contributions
 theirs+0.104
vern+0.098
 Betty+0.096
 Sam+0.094
 Andy+0.093
Neg logit contributions
 tant-0.172
ard-0.152
 Are-0.127
ly-0.127
 Those-0.123
Token wings
Token ID12098
Feature activation-1.87e+00

Pos logit contributions
 theirs+0.044
vern+0.043
 Betty+0.040
 Sam+0.040
 buddies+0.039
Neg logit contributions
 tant-0.071
ard-0.062
 Are-0.054
ly-0.052
 floors-0.051
Token or
Token ID393
Feature activation-3.04e+00

Pos logit contributions
 theirs+0.030
vern+0.029
 Betty+0.026
 Sam+0.026
 buddies+0.025
Neg logit contributions
 tant-0.103
ard-0.092
 Are-0.083
ly-0.081
 floors-0.080
Token magical
Token ID10883
Feature activation-1.54e+00

Pos logit contributions
 theirs+0.079
 Sam+0.074
vern+0.072
 Betty+0.072
 moms+0.069
Neg logit contributions
 tant-0.142
ard-0.123
est-0.105
 Those-0.105
ly-0.104
Token taps
Token ID34531
Feature activation-1.41e+00

Pos logit contributions
 theirs+0.039
vern+0.038
 Betty+0.036
 Sam+0.035
 buddies+0.035
Neg logit contributions
 tant-0.072
ard-0.062
 Are-0.055
ly-0.052
 Those-0.052
Token parents
Token ID3397
Feature activation+1.49e+00

Pos logit contributions
 theirs+0.018
vern+0.017
 Betty+0.017
 Sam+0.017
 Andy+0.016
Neg logit contributions
 tant-0.026
ard-0.022
 Are-0.019
ly-0.019
 floors-0.019
Token -
Token ID532
Feature activation-7.46e-01

Pos logit contributions
 tant+0.074
ard+0.060
 Are+0.057
 floors+0.054
 million+0.053
Neg logit contributions
 theirs-0.035
vern-0.034
 Sam-0.032
 buddies-0.031
 Betty-0.031
Token they
Token ID484
Feature activation-8.72e-01

Pos logit contributions
 theirs+0.014
vern+0.014
 Betty+0.013
 Sam+0.012
 buddies+0.012
Neg logit contributions
 tant-0.039
ard-0.034
 Are-0.031
ly-0.030
 floors-0.030
Token know
Token ID760
Feature activation-4.83e-01

Pos logit contributions
 theirs+0.025
vern+0.024
 Betty+0.023
 Sam+0.023
 moms+0.022
Neg logit contributions
 tant-0.037
ard-0.033
 Are-0.028
ly-0.027
 Those-0.027
Token what
Token ID644
Feature activation-1.32e+00

Pos logit contributions
 theirs+0.011
vern+0.011
 Betty+0.010
girls+0.010
 moms+0.010
Neg logit contributions
 tant-0.022
ard-0.020
 Are-0.018
ly-0.018
 floors-0.017
Token is
Token ID318
Feature activation-3.78e+00

Pos logit contributions
 theirs+0.036
vern+0.035
 Betty+0.033
girls+0.033
 buddies+0.032
Neg logit contributions
 tant-0.056
ard-0.049
 Are-0.045
 Those-0.042
ly-0.041
Token best
Token ID1266
Feature activation-4.83e-01

Pos logit contributions
girls+0.105
 moms+0.100
 buddies+0.099
 Betty+0.099
vern+0.099
Neg logit contributions
 tant-0.146
ard-0.131
 Are-0.118
est-0.115
ly-0.115
Token for
Token ID329
Feature activation-1.38e+00

Pos logit contributions
girls+0.010
 theirs+0.010
 moms+0.010
vern+0.009
 Betty+0.009
Neg logit contributions
 tant-0.022
ard-0.021
ly-0.019
est-0.019
 Are-0.019
Token us
Token ID514
Feature activation-1.82e+00

Pos logit contributions
vern+0.039
 theirs+0.038
 Betty+0.037
 moms+0.037
girls+0.036
Neg logit contributions
 tant-0.054
ard-0.049
 Are-0.042
ly-0.041
 floors-0.040
Token!
Token ID0
Feature activation-1.10e+00

Pos logit contributions
 theirs+0.039
vern+0.038
 Betty+0.036
 moms+0.035
 Sam+0.035
Neg logit contributions
 tant-0.089
ard-0.080
 Are-0.071
ly-0.069
 Those-0.068
Token<|endoftext|>
Token ID50256
Feature activation-4.68e-01

Pos logit contributions
 theirs+0.030
vern+0.029
 buddies+0.027
 moms+0.026
girls+0.026
Neg logit contributions
 tant-0.047
ard-0.042
 Are-0.039
 Those-0.037
 million-0.035
Token is
Token ID318
Feature activation-3.08e+00

Pos logit contributions
 theirs+0.024
vern+0.023
 Betty+0.022
 buddies+0.021
girls+0.021
Neg logit contributions
 tant-0.037
ard-0.032
 Are-0.029
ly-0.027
 Those-0.027
Token safe
Token ID3338
Feature activation-1.27e+00

Pos logit contributions
girls+0.078
vern+0.078
 Betty+0.078
 buddies+0.076
 moms+0.076
Neg logit contributions
 tant-0.126
ard-0.112
 Are-0.099
 floors-0.098
ly-0.095
Token and
Token ID290
Feature activation-4.14e-01

Pos logit contributions
 theirs+0.033
vern+0.032
 Betty+0.030
 Sam+0.030
 buddies+0.030
Neg logit contributions
 tant-0.058
ard-0.050
 Are-0.044
ly-0.042
 floors-0.042
Token what
Token ID644
Feature activation-9.14e-01

Pos logit contributions
 theirs+0.010
vern+0.009
 Sam+0.009
 Betty+0.008
 buddies+0.008
Neg logit contributions
 tant-0.021
ard-0.018
 Are-0.016
 Those-0.015
ly-0.015
Token is
Token ID318
Feature activation-3.33e+00

Pos logit contributions
 theirs+0.027
vern+0.027
 Betty+0.025
 buddies+0.025
girls+0.024
Neg logit contributions
 tant-0.038
ard-0.032
 Are-0.029
 Those-0.027
ly-0.027
Token not
Token ID407
Feature activation-3.76e+00

Pos logit contributions
vern+0.083
 Betty+0.082
girls+0.081
 Andy+0.080
 buddies+0.079
Neg logit contributions
 tant-0.142
ard-0.125
 floors-0.112
 Are-0.111
ly-0.106
Token.
Token ID13
Feature activation-1.28e-01

Pos logit contributions
vern+0.075
 theirs+0.069
 Betty+0.068
 buddies+0.068
 moms+0.068
Neg logit contributions
 tant-0.192
ard-0.169
 Are-0.151
ly-0.148
 floors-0.146
Token Your
Token ID3406
Feature activation+1.78e-01

Pos logit contributions
 Sam+0.004
 theirs+0.004
 Betty+0.004
 Dog+0.004
vern+0.004
Neg logit contributions
 tant-0.005
ard-0.004
ly-0.004
 floors-0.004
est-0.004
Token lungs
Token ID21726
Feature activation+1.14e-02

Pos logit contributions
 tant+0.007
ard+0.006
 Are+0.005
 Those+0.005
ly+0.005
Neg logit contributions
 theirs-0.006
vern-0.005
 Sam-0.005
 buddies-0.005
 moms-0.005
Token help
Token ID1037
Feature activation-1.85e-01

Pos logit contributions
 tant+0.001
ard+0.000
 Are+0.000
 floors+0.000
ly+0.000
Neg logit contributions
 theirs-0.000
vern-0.000
 Sam-0.000
 together-0.000
 themselves-0.000
Token you
Token ID345
Feature activation+6.33e-01

Pos logit contributions
 theirs+0.004
 Sam+0.004
vern+0.004
 Betty+0.004
 Andy+0.004
Neg logit contributions
 tant-0.009
ard-0.008
 Are-0.007
 Those-0.007
 floors-0.007
Token himself
Token ID2241
Feature activation-2.40e-01

Pos logit contributions
vern+0.015
 theirs+0.015
 Betty+0.014
 buddies+0.014
 moms+0.014
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.017
 floors-0.016
ly-0.016
Token and
Token ID290
Feature activation-8.14e-02

Pos logit contributions
vern+0.005
 theirs+0.005
 moms+0.005
 Betty+0.004
girls+0.004
Neg logit contributions
 tant-0.012
ard-0.011
ly-0.010
est-0.009
 Those-0.009
Token the
Token ID262
Feature activation-1.89e-01

Pos logit contributions
 theirs+0.002
vern+0.002
 Betty+0.002
 Sam+0.002
 buddies+0.002
Neg logit contributions
 tant-0.004
ard-0.003
 Are-0.003
ly-0.003
 floors-0.003
Token great
Token ID1049
Feature activation+4.86e-02

Pos logit contributions
 theirs+0.007
vern+0.006
girls+0.006
ores+0.006
 buddies+0.006
Neg logit contributions
 tant-0.007
ard-0.006
 floors-0.005
est-0.005
 Are-0.005
Token bird
Token ID6512
Feature activation-1.83e+00

Pos logit contributions
 tant+0.002
ard+0.002
est+0.001
 floors+0.001
ly+0.001
Neg logit contributions
vern-0.002
girls-0.002
 theirs-0.002
 themselves-0.001
 buddies-0.001
Tokenhouse
Token ID4803
Feature activation-3.76e+00

Pos logit contributions
vern+0.042
 theirs+0.041
 moms+0.037
 Betty+0.035
 buddies+0.034
Neg logit contributions
 tant-0.087
ard-0.081
 Are-0.070
ly-0.070
est-0.069
Token he
Token ID339
Feature activation-8.12e-01

Pos logit contributions
vern+0.103
 theirs+0.099
 Betty+0.092
girls+0.090
 moms+0.087
Neg logit contributions
 tant-0.155
ard-0.144
est-0.127
ly-0.125
 million-0.122
Token made
Token ID925
Feature activation+7.05e-01

Pos logit contributions
 theirs+0.023
vern+0.022
 Betty+0.021
 buddies+0.020
 moms+0.020
Neg logit contributions
 tant-0.036
ard-0.032
 Are-0.028
ly-0.026
est-0.026
Token with
Token ID351
Feature activation-9.54e-01

Pos logit contributions
 tant+0.034
ard+0.030
 Are+0.027
ly+0.026
est+0.026
Neg logit contributions
 theirs-0.016
vern-0.015
 Betty-0.014
 moms-0.014
girls-0.014
Token Grand
Token ID5675
Feature activation+4.75e-01

Pos logit contributions
 theirs+0.022
vern+0.022
 Betty+0.020
 Sam+0.020
 buddies+0.020
Neg logit contributions
 tant-0.047
ard-0.041
 Are-0.036
ly-0.035
 floors-0.034
Tokenma
Token ID2611
Feature activation-3.64e-01

Pos logit contributions
 tant+0.020
ard+0.019
est+0.017
ly+0.016
 Those+0.015
Neg logit contributions
 theirs-0.014
 Betty-0.012
 Sam-0.012
vern-0.011
 Buddy-0.011
Token hungry
Token ID14720
Feature activation+2.64e-01

Pos logit contributions
 theirs+0.005
vern+0.005
 Betty+0.005
 Sam+0.005
 moms+0.005
Neg logit contributions
 tant-0.009
ard-0.008
 Are-0.007
 floors-0.007
ly-0.007
Token and
Token ID290
Feature activation-7.81e-01

Pos logit contributions
 tant+0.016
ard+0.014
 Are+0.013
 Those+0.012
icer+0.012
Neg logit contributions
 theirs-0.003
 Sam-0.003
vern-0.003
 buddies-0.003
 Betty-0.003
Token want
Token ID765
Feature activation-1.13e+00

Pos logit contributions
 theirs+0.018
vern+0.016
 Sam+0.016
 Betty+0.016
 buddies+0.015
Neg logit contributions
 tant-0.039
ard-0.035
est-0.030
 Those-0.029
 Are-0.029
Token something
Token ID1223
Feature activation-1.04e+00

Pos logit contributions
 theirs+0.022
vern+0.022
 Betty+0.019
 buddies+0.019
girls+0.019
Neg logit contributions
 tant-0.058
ard-0.052
 Are-0.046
ly-0.045
 floors-0.045
Token really
Token ID1107
Feature activation-1.28e+00

Pos logit contributions
 theirs+0.025
vern+0.024
 Betty+0.023
 Sam+0.022
 buddies+0.022
Neg logit contributions
 tant-0.049
ard-0.043
 Are-0.038
ly-0.037
 floors-0.037
Token badly
Token ID11234
Feature activation-3.73e+00

Pos logit contributions
 theirs+0.033
vern+0.032
 Sam+0.030
 Betty+0.030
 buddies+0.029
Neg logit contributions
 tant-0.060
ard-0.052
 Are-0.045
ly-0.043
 Those-0.043
Token.
Token ID13
Feature activation-5.83e-01

Pos logit contributions
 theirs+0.052
vern+0.049
 Betty+0.043
 Sam+0.042
 buddies+0.041
Neg logit contributions
 tant-0.217
ard-0.194
 Are-0.176
ly-0.170
 floors-0.170
Token And
Token ID843
Feature activation-4.82e-01

Pos logit contributions
 theirs+0.017
vern+0.016
 Sam+0.016
 Betty+0.016
 buddies+0.015
Neg logit contributions
 tant-0.026
ard-0.022
ly-0.019
 floors-0.018
 Are-0.018
Token sorting
Token ID29407
Feature activation+4.54e-01

Pos logit contributions
 theirs+0.009
 Sam+0.008
vern+0.008
 Betty+0.008
 buddies+0.008
Neg logit contributions
 tant-0.026
ard-0.023
 Are-0.020
ly-0.020
 floors-0.020
Token things
Token ID1243
Feature activation-2.44e-02

Pos logit contributions
 tant+0.021
ard+0.019
 Are+0.017
 Those+0.016
 floors+0.016
Neg logit contributions
 theirs-0.011
 Sam-0.010
vern-0.010
 together-0.010
 buddies-0.010
Token can
Token ID460
Feature activation+1.12e-01

Pos logit contributions
 theirs+0.001
vern+0.001
 Betty+0.000
 Sam+0.000
 buddies+0.000
Neg logit contributions
 tant-0.001
ard-0.001
 Are-0.001
ly-0.001
 floors-0.001
Token so
Token ID523
Feature activation+3.80e-01

Pos logit contributions
 tant+0.004
 million+0.003
 floors+0.002
ard+0.002
est+0.002
Neg logit contributions
vern-0.008
iki-0.008
girls-0.008
uddy-0.007
 moms-0.007
Token happy
Token ID3772
Feature activation-7.15e-02

Pos logit contributions
 tant+0.017
ard+0.014
 Are+0.013
 million+0.012
 floors+0.012
Neg logit contributions
 theirs-0.010
vern-0.010
 Betty-0.010
girls-0.009
 Sam-0.009
Token that
Token ID326
Feature activation-5.45e-01

Pos logit contributions
vern+0.002
 theirs+0.002
girls+0.002
 moms+0.002
 Betty+0.002
Neg logit contributions
ard-0.002
 tant-0.002
ly-0.002
 Are-0.002
 Those-0.002
Token the
Token ID262
Feature activation-5.91e-01

Pos logit contributions
vern+0.013
iki+0.010
girls+0.010
 buddies+0.009
 theirs+0.009
Neg logit contributions
 tant-0.026
ard-0.023
 Are-0.023
 floors-0.021
 Those-0.021
Token tree
Token ID5509
Feature activation-1.81e+00

Pos logit contributions
 theirs+0.021
vern+0.019
uzz+0.018
 themselves+0.017
girls+0.017
Neg logit contributions
 tant-0.022
ard-0.021
 floors-0.019
est-0.017
 million-0.016
Tokenhouse
Token ID4803
Feature activation-3.72e+00

Pos logit contributions
vern+0.066
 theirs+0.065
 Betty+0.059
 moms+0.058
 buddies+0.057
Neg logit contributions
 tant-0.062
ard-0.058
ly-0.046
est-0.045
 Are-0.045
Token was
Token ID373
Feature activation-7.10e-01

Pos logit contributions
vern+0.146
 theirs+0.140
girls+0.133
 Betty+0.131
 buddies+0.128
Neg logit contributions
 tant-0.117
ard-0.104
est-0.085
ly-0.084
 million-0.081
Token released
Token ID2716
Feature activation+2.92e-01

Pos logit contributions
girls+0.026
 moms+0.026
iki+0.025
uddy+0.023
ses+0.023
Neg logit contributions
 tant-0.020
 million-0.018
ard-0.017
est-0.015
 floors-0.015
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
 tant+0.014
ard+0.013
ly+0.011
est+0.011
 Are+0.010
Neg logit contributions
vern-0.007
 moms-0.007
 theirs-0.006
girls-0.006
 Betty-0.006
Token They
Token ID1119
Feature activation-1.50e+00

Pos logit contributions
vern+0.017
 theirs+0.016
 buddies+0.015
girls+0.014
 Betty+0.014
Neg logit contributions
 tant-0.017
ard-0.015
 Are-0.013
 Those-0.013
 million-0.011
Token had
Token ID550
Feature activation-6.93e-01

Pos logit contributions
vern+0.061
 theirs+0.056
 possible+0.052
ores+0.052
 moms+0.051
Neg logit contributions
 tant-0.042
ard-0.036
 million-0.032
 woke-0.030
ly-0.027
Token and
Token ID290
Feature activation-4.31e-01

Pos logit contributions
 theirs+0.015
vern+0.014
 Betty+0.013
 Sam+0.013
 buddies+0.013
Neg logit contributions
 tant-0.033
ard-0.029
 Are-0.026
ly-0.025
 floors-0.025
Token know
Token ID760
Feature activation-5.09e-01

Pos logit contributions
 theirs+0.009
vern+0.009
 Betty+0.008
 Sam+0.008
 buddies+0.008
Neg logit contributions
 tant-0.022
ard-0.020
 Are-0.017
ly-0.017
 Those-0.017
Token when
Token ID618
Feature activation-1.14e+00

Pos logit contributions
 theirs+0.011
vern+0.011
 Betty+0.010
 Sam+0.010
 buddies+0.010
Neg logit contributions
 tant-0.025
ard-0.022
 Are-0.020
ly-0.019
 floors-0.019
Token something
Token ID1223
Feature activation-1.42e+00

Pos logit contributions
 theirs+0.029
vern+0.028
 Betty+0.027
 Sam+0.026
 buddies+0.026
Neg logit contributions
 tant-0.054
ard-0.046
 Are-0.040
ly-0.039
 floors-0.038
Token is
Token ID318
Feature activation-2.36e+00

Pos logit contributions
 theirs+0.038
vern+0.038
 Sam+0.035
 Betty+0.035
 buddies+0.035
Neg logit contributions
 tant-0.065
ard-0.055
 Are-0.048
 floors-0.046
ly-0.046
Token too
Token ID1165
Feature activation-3.71e+00

Pos logit contributions
 theirs+0.061
vern+0.061
 Betty+0.056
 buddies+0.056
 Sam+0.056
Neg logit contributions
 tant-0.106
ard-0.092
 Are-0.081
ly-0.078
 floors-0.077
Token dangerous
Token ID4923
Feature activation-2.53e+00

Pos logit contributions
 theirs+0.101
vern+0.098
 Betty+0.092
 Sam+0.092
 buddies+0.088
Neg logit contributions
 tant-0.169
ard-0.143
 Are-0.127
 Those-0.121
 floors-0.121
Token to
Token ID284
Feature activation-7.11e-02

Pos logit contributions
 theirs+0.033
vern+0.031
 Betty+0.027
 Sam+0.027
 buddies+0.026
Neg logit contributions
 tant-0.149
ard-0.133
 Are-0.121
ly-0.117
 floors-0.117
Token do
Token ID466
Feature activation-5.73e-01

Pos logit contributions
 theirs+0.002
vern+0.002
 buddies+0.002
 Betty+0.002
 moms+0.002
Neg logit contributions
 tant-0.003
ard-0.003
 Are-0.002
ly-0.002
 Those-0.002
Token.
Token ID13
Feature activation-8.39e-01

Pos logit contributions
 theirs+0.009
vern+0.008
 Betty+0.007
 Sam+0.007
 buddies+0.007
Neg logit contributions
 tant-0.033
ard-0.029
 Are-0.026
ly-0.026
 floors-0.025
Token<|endoftext|>
Token ID50256
Feature activation-5.97e-01

Pos logit contributions
 theirs+0.017
vern+0.017
 Betty+0.017
 Sam+0.017
 Andy+0.016
Neg logit contributions
 tant-0.044
ard-0.038
ly-0.034
 floors-0.033
est-0.032
Token what
Token ID644
Feature activation-4.76e-01

Pos logit contributions
vern+0.006
 theirs+0.006
 Betty+0.005
girls+0.005
 buddies+0.005
Neg logit contributions
 tant-0.013
ard-0.012
ly-0.011
 Are-0.010
 Those-0.010
Token to
Token ID284
Feature activation-5.93e-01

Pos logit contributions
 theirs+0.012
vern+0.012
 Betty+0.011
 buddies+0.011
girls+0.011
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.017
 Those-0.016
ly-0.016
Token do
Token ID466
Feature activation-8.70e-01

Pos logit contributions
vern+0.017
 theirs+0.017
 buddies+0.017
 Betty+0.016
girls+0.016
Neg logit contributions
 tant-0.024
ard-0.021
 Are-0.018
ly-0.018
 Those-0.017
Token and
Token ID290
Feature activation+1.43e-01

Pos logit contributions
 theirs+0.021
vern+0.021
 Betty+0.020
 moms+0.019
 Sam+0.019
Neg logit contributions
 tant-0.039
ard-0.035
 Are-0.031
ly-0.030
 Those-0.030
Token what
Token ID644
Feature activation-5.15e-01

Pos logit contributions
 tant+0.008
ard+0.007
 Are+0.006
ly+0.006
est+0.006
Neg logit contributions
 theirs-0.003
vern-0.003
 Sam-0.003
 Betty-0.003
 buddies-0.002
Token not
Token ID407
Feature activation-3.69e+00

Pos logit contributions
 theirs+0.015
vern+0.015
 Betty+0.014
 buddies+0.013
 Sam+0.013
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.017
 Those-0.016
 floors-0.016
Token to
Token ID284
Feature activation-2.01e-01

Pos logit contributions
 theirs+0.088
vern+0.079
 Sam+0.078
 Betty+0.077
 Andy+0.073
Neg logit contributions
 tant-0.185
ard-0.160
 Are-0.142
ly-0.137
 Those-0.136
Token do
Token ID466
Feature activation-6.86e-01

Pos logit contributions
 theirs+0.005
vern+0.005
 Betty+0.005
 Sam+0.005
 buddies+0.005
Neg logit contributions
 tant-0.009
ard-0.008
 Are-0.007
ly-0.007
 floors-0.007
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
vern+0.016
 theirs+0.016
 Betty+0.016
 Sam+0.015
 Andy+0.015
Neg logit contributions
 tant-0.032
ard-0.028
 Are-0.025
ly-0.024
 floors-0.024
Token We
Token ID775
Feature activation-1.24e-01

Pos logit contributions
 theirs+0.036
 Betty+0.035
 Sam+0.035
vern+0.034
 Andy+0.034
Neg logit contributions
 tant-0.045
ard-0.037
ly-0.031
 Are-0.030
 floors-0.030
Token have
Token ID423
Feature activation-1.23e+00

Pos logit contributions
 theirs+0.004
vern+0.004
 buddies+0.004
 Betty+0.003
 Sam+0.003
Neg logit contributions
 tant-0.005
ard-0.004
 Are-0.004
ly-0.004
vious-0.003
Token building
Token ID2615
Feature activation-9.79e-01

Pos logit contributions
 tant+0.006
ard+0.005
 Are+0.005
 floors+0.005
 million+0.004
Neg logit contributions
vern-0.004
 moms-0.004
 theirs-0.004
girls-0.004
 Betty-0.004
Token such
Token ID884
Feature activation-5.02e-01

Pos logit contributions
vern+0.025
 theirs+0.024
 Betty+0.023
 moms+0.023
girls+0.023
Neg logit contributions
 tant-0.045
ard-0.038
 floors-0.033
 Are-0.032
est-0.032
Token a
Token ID257
Feature activation-7.14e-01

Pos logit contributions
ll+0.014
girls+0.014
vern+0.014
 moms+0.014
iki+0.013
Neg logit contributions
 tant-0.021
ard-0.015
 Are-0.015
 floors-0.014
est-0.014
Token great
Token ID1049
Feature activation-4.00e-01

Pos logit contributions
 theirs+0.022
girls+0.019
vern+0.019
 Betty+0.019
 Sam+0.019
Neg logit contributions
 tant-0.031
ard-0.026
 Are-0.023
 million-0.023
 floors-0.022
Token tree
Token ID5509
Feature activation-1.74e+00

Pos logit contributions
 theirs+0.012
girls+0.012
vern+0.012
 Betty+0.011
 themselves+0.011
Neg logit contributions
 tant-0.017
ard-0.014
est-0.012
 Are-0.012
 floors-0.012
Tokenhouse
Token ID4803
Feature activation-3.67e+00

Pos logit contributions
vern+0.040
 moms+0.039
 theirs+0.038
 teamwork+0.034
 buddies+0.034
Neg logit contributions
 tant-0.080
ard-0.076
ly-0.063
est-0.062
 million-0.061
Token.
Token ID13
Feature activation-4.17e-01

Pos logit contributions
vern+0.095
 moms+0.087
 Betty+0.086
utter+0.083
iki+0.083
Neg logit contributions
 tant-0.153
ard-0.140
est-0.123
ly-0.121
 Are-0.112
Token<|endoftext|>
Token ID50256
Feature activation-1.50e-01

Pos logit contributions
vern+0.012
 theirs+0.012
 buddies+0.011
girls+0.011
 moms+0.011
Neg logit contributions
 tant-0.017
ard-0.015
 Are-0.014
 Those-0.013
 million-0.012
TokenOnce
Token ID7454
Feature activation+8.95e-02

Pos logit contributions
 theirs+0.005
vern+0.005
 buddies+0.005
 teamwork+0.005
 moms+0.005
Neg logit contributions
 tant-0.005
ard-0.005
 Are-0.004
 Those-0.004
 million-0.004
Token upon
Token ID2402
Feature activation+1.54e-01

Pos logit contributions
 tant+0.003
ard+0.002
 Are+0.002
ly+0.002
 million+0.002
Neg logit contributions
girls-0.004
boys-0.004
vern-0.004
 carriers-0.004
 theirs-0.003
Token a
Token ID257
Feature activation-7.37e-01

Pos logit contributions
 tant+0.008
ard+0.007
 Are+0.006
ly+0.006
est+0.006
Neg logit contributions
 theirs-0.003
vern-0.003
 Betty-0.003
 buddies-0.003
 moms-0.003
Token parents
Token ID3397
Feature activation+1.34e+00

Pos logit contributions
 theirs+0.003
 Betty+0.003
vern+0.003
 Andy+0.003
 Fred+0.003
Neg logit contributions
 tant-0.009
ard-0.008
 floors-0.007
ly-0.007
 Are-0.007
Token because
Token ID780
Feature activation-1.42e+00

Pos logit contributions
 tant+0.065
ard+0.054
 Are+0.049
 floors+0.047
 million+0.047
Neg logit contributions
 theirs-0.032
vern-0.031
 Sam-0.030
 buddies-0.029
 Betty-0.029
Token they
Token ID484
Feature activation-1.12e+00

Pos logit contributions
vern+0.026
 theirs+0.025
girls+0.023
 Betty+0.023
 buddies+0.022
Neg logit contributions
 tant-0.074
ard-0.065
 Are-0.060
 floors-0.058
ly-0.058
Token know
Token ID760
Feature activation-6.24e-01

Pos logit contributions
 theirs+0.033
vern+0.031
 Betty+0.030
 Sam+0.030
 buddies+0.029
Neg logit contributions
 tant-0.047
ard-0.041
 Are-0.036
ly-0.034
 Those-0.034
Token what
Token ID644
Feature activation-1.40e+00

Pos logit contributions
vern+0.014
 theirs+0.014
 Betty+0.013
 buddies+0.013
girls+0.013
Neg logit contributions
 tant-0.029
ard-0.025
 Are-0.023
 floors-0.022
ly-0.022
Token is
Token ID318
Feature activation-3.61e+00

Pos logit contributions
 theirs+0.039
vern+0.037
 Betty+0.035
 buddies+0.035
girls+0.035
Neg logit contributions
 tant-0.060
ard-0.052
 Are-0.048
 Those-0.044
ly-0.044
Token best
Token ID1266
Feature activation-4.32e-01

Pos logit contributions
girls+0.100
vern+0.098
 Andy+0.097
 buddies+0.096
 Betty+0.096
Neg logit contributions
 tant-0.134
ard-0.119
 Are-0.108
est-0.105
ly-0.104
Token for
Token ID329
Feature activation-1.37e+00

Pos logit contributions
girls+0.009
 theirs+0.009
vern+0.009
 Betty+0.009
 moms+0.009
Neg logit contributions
 tant-0.019
ard-0.018
ly-0.017
 Are-0.016
est-0.016
Token you
Token ID345
Feature activation-1.18e+00

Pos logit contributions
vern+0.044
 Betty+0.042
 moms+0.041
girls+0.040
 theirs+0.040
Neg logit contributions
 tant-0.047
ard-0.044
ly-0.036
 floors-0.036
 Are-0.036
Token.
Token ID13
Feature activation-8.62e-01

Pos logit contributions
vern+0.027
 theirs+0.026
 Betty+0.026
 moms+0.025
 buddies+0.025
Neg logit contributions
 tant-0.052
ard-0.050
 Are-0.043
ly-0.043
 Those-0.042
Token<|endoftext|>
Token ID50256
Feature activation-3.84e-01

Pos logit contributions
 theirs+0.023
vern+0.023
 buddies+0.021
 moms+0.021
girls+0.020
Neg logit contributions
 tant-0.036
ard-0.033
 Are-0.031
 Those-0.029
 million-0.027
Token lesson
Token ID11483
Feature activation+7.27e-01

Pos logit contributions
 tant+0.047
ard+0.041
 Are+0.036
ly+0.036
est+0.035
Neg logit contributions
 theirs-0.019
vern-0.019
 Betty-0.017
 Sam-0.017
 buddies-0.017
Token -
Token ID532
Feature activation+3.47e-01

Pos logit contributions
 tant+0.035
ard+0.032
ly+0.029
 Are+0.028
est+0.027
Neg logit contributions
vern-0.015
 theirs-0.015
 moms-0.015
girls-0.014
 buddies-0.014
Token when
Token ID618
Feature activation-1.99e-01

Pos logit contributions
 tant+0.018
ard+0.016
 Are+0.014
ly+0.014
 floors+0.013
Neg logit contributions
 theirs-0.007
vern-0.007
 Betty-0.006
 Sam-0.006
 buddies-0.006
Token something
Token ID1223
Feature activation-5.31e-01

Pos logit contributions
 theirs+0.005
vern+0.005
 Betty+0.005
 Sam+0.005
 buddies+0.005
Neg logit contributions
 tant-0.009
ard-0.008
 Are-0.007
ly-0.007
 Those-0.007
Token seems
Token ID2331
Feature activation-1.79e+00

Pos logit contributions
 theirs+0.017
vern+0.017
 Betty+0.016
 Sam+0.016
 buddies+0.015
Neg logit contributions
 tant-0.021
ard-0.018
 Are-0.015
 floors-0.014
ly-0.014
Token too
Token ID1165
Feature activation-3.60e+00

Pos logit contributions
 theirs+0.049
vern+0.047
 Sam+0.045
 Betty+0.045
 buddies+0.044
Neg logit contributions
 tant-0.081
ard-0.070
 Are-0.060
 floors-0.058
ly-0.058
Token good
Token ID922
Feature activation-4.70e-01

Pos logit contributions
 theirs+0.103
vern+0.101
 Betty+0.096
 Sam+0.095
 buddies+0.092
Neg logit contributions
 tant-0.157
ard-0.134
 Are-0.117
 floors-0.111
 Those-0.111
Token to
Token ID284
Feature activation-1.42e-01

Pos logit contributions
vern+0.010
 theirs+0.010
 moms+0.009
girls+0.009
 buddies+0.009
Neg logit contributions
 tant-0.023
ard-0.021
ly-0.019
 Are-0.018
est-0.018
Token be
Token ID307
Feature activation-1.52e+00

Pos logit contributions
vern+0.004
 theirs+0.004
 buddies+0.004
girls+0.004
 moms+0.003
Neg logit contributions
 tant-0.006
ard-0.005
 Are-0.005
ly-0.005
est-0.005
Token true
Token ID2081
Feature activation+5.01e-01

Pos logit contributions
 theirs+0.038
vern+0.037
 Betty+0.034
 Sam+0.034
 moms+0.034
Neg logit contributions
 tant-0.071
ard-0.062
 Are-0.055
ly-0.053
 floors-0.052
Token,
Token ID11
Feature activation+1.15e-01

Pos logit contributions
 tant+0.026
ard+0.023
ly+0.021
 Are+0.021
est+0.020
Neg logit contributions
 theirs-0.010
vern-0.009
 Betty-0.009
 moms-0.008
girls-0.008
Token stay
Token ID2652
Feature activation-2.61e-01

Pos logit contributions
 theirs+0.002
vern+0.002
 Sam+0.002
 Betty+0.002
 moms+0.002
Neg logit contributions
 tant-0.005
ard-0.004
 Are-0.004
ly-0.004
 Those-0.004
Token informed
Token ID7981
Feature activation+5.40e-01

Pos logit contributions
 theirs+0.007
vern+0.007
 Betty+0.006
 Sam+0.006
 buddies+0.006
Neg logit contributions
 tant-0.012
ard-0.010
 Are-0.009
ly-0.009
 Those-0.009
Token and
Token ID290
Feature activation-2.97e-01

Pos logit contributions
 tant+0.028
ard+0.024
 Are+0.022
 floors+0.020
 Those+0.020
Neg logit contributions
 theirs-0.011
vern-0.011
 Sam-0.010
 Betty-0.010
 buddies-0.010
Token know
Token ID760
Feature activation+2.25e-01

Pos logit contributions
 theirs+0.006
 Sam+0.005
vern+0.005
 Betty+0.005
 moms+0.005
Neg logit contributions
 tant-0.016
ard-0.014
est-0.012
 Those-0.012
 Are-0.012
Token what
Token ID644
Feature activation-9.84e-01

Pos logit contributions
 tant+0.011
ard+0.010
 Are+0.009
ly+0.009
 floors+0.008
Neg logit contributions
 theirs-0.005
vern-0.005
 Betty-0.005
 buddies-0.004
 Sam-0.004
Token is
Token ID318
Feature activation-3.55e+00

Pos logit contributions
 theirs+0.029
vern+0.029
 Betty+0.027
 buddies+0.027
 Sam+0.027
Neg logit contributions
 tant-0.041
ard-0.035
 Are-0.031
ly-0.029
 floors-0.029
Token going
Token ID1016
Feature activation-4.31e-01

Pos logit contributions
vern+0.079
 theirs+0.079
 Betty+0.073
 buddies+0.072
 Sam+0.071
Neg logit contributions
 tant-0.173
ard-0.153
 Are-0.136
ly-0.132
 floors-0.131
Token on
Token ID319
Feature activation-1.66e+00

Pos logit contributions
 theirs+0.008
vern+0.008
 Sam+0.007
 Betty+0.007
 buddies+0.007
Neg logit contributions
 tant-0.023
ard-0.020
 Are-0.018
 floors-0.017
 Those-0.017
Token in
Token ID287
Feature activation-1.33e+00

Pos logit contributions
 theirs+0.037
vern+0.036
 Betty+0.033
 Sam+0.033
 buddies+0.033
Neg logit contributions
 tant-0.083
ard-0.073
 Are-0.065
ly-0.062
 floors-0.062
Token the
Token ID262
Feature activation-1.50e-02

Pos logit contributions
 theirs+0.024
 Sam+0.022
vern+0.021
 Betty+0.019
 mom+0.019
Neg logit contributions
 tant-0.074
ard-0.065
ly-0.059
 Are-0.059
est-0.057
Token world
Token ID995
Feature activation+2.19e-01

Pos logit contributions
 Sam+0.000
 adventure+0.000
 mom+0.000
 moms+0.000
vern+0.000
Neg logit contributions
 tant-0.001
ard-0.001
 Are-0.001
ly-0.001
 ounces-0.001
Token what
Token ID644
Feature activation-1.05e+00

Pos logit contributions
 tant+0.002
ard+0.002
 Are+0.001
ly+0.001
 floors+0.001
Neg logit contributions
 theirs-0.001
vern-0.001
 Betty-0.001
 buddies-0.001
 Sam-0.001
Token is
Token ID318
Feature activation-3.38e+00

Pos logit contributions
 theirs+0.029
vern+0.028
 Betty+0.027
 buddies+0.026
girls+0.026
Neg logit contributions
 tant-0.045
ard-0.039
 Are-0.036
ly-0.033
 floors-0.033
Token safe
Token ID3338
Feature activation-1.44e+00

Pos logit contributions
vern+0.103
 buddies+0.101
girls+0.099
 Betty+0.099
 theirs+0.099
Neg logit contributions
 tant-0.125
ard-0.108
 Are-0.098
 floors-0.095
ly-0.093
Token and
Token ID290
Feature activation-6.31e-01

Pos logit contributions
vern+0.031
 theirs+0.031
 Betty+0.028
 buddies+0.028
 Sam+0.027
Neg logit contributions
 tant-0.071
ard-0.063
 Are-0.056
ly-0.055
est-0.054
Token what
Token ID644
Feature activation-1.00e+00

Pos logit contributions
 theirs+0.014
vern+0.013
 Sam+0.012
 Betty+0.012
 buddies+0.012
Neg logit contributions
 tant-0.032
ard-0.029
 Those-0.024
 Are-0.024
est-0.024
Token is
Token ID318
Feature activation-3.53e+00

Pos logit contributions
 theirs+0.033
vern+0.033
 Betty+0.031
 buddies+0.030
girls+0.030
Neg logit contributions
 tant-0.038
ard-0.032
 Are-0.029
 Those-0.026
ly-0.026
Token not
Token ID407
Feature activation-3.97e+00

Pos logit contributions
vern+0.097
 buddies+0.093
 Betty+0.093
girls+0.092
 theirs+0.091
Neg logit contributions
 tant-0.145
ard-0.126
 Are-0.114
 floors-0.113
ly-0.108
Token.
Token ID13
Feature activation+6.31e-02

Pos logit contributions
vern+0.086
 theirs+0.079
 buddies+0.079
girls+0.078
 moms+0.078
Neg logit contributions
 tant-0.197
ard-0.171
 Are-0.154
ly-0.151
 floors-0.149
Token OK
Token ID7477
Feature activation-5.42e-01

Pos logit contributions
 tant+0.003
ard+0.002
ly+0.002
 floors+0.002
est+0.002
Neg logit contributions
 Sam-0.002
 Betty-0.002
 theirs-0.002
 Dog-0.002
 Andy-0.002
Token?"
Token ID1701
Feature activation-1.13e+00

Pos logit contributions
vern+0.017
 theirs+0.016
girls+0.015
 buddies+0.014
 accomplishment+0.014
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.019
est-0.017
ly-0.016
Token she
Token ID673
Feature activation-3.91e-01

Pos logit contributions
 theirs+0.032
vern+0.032
 buddies+0.030
girls+0.030
 Betty+0.029
Neg logit contributions
 tant-0.049
ard-0.041
 Are-0.039
 Those-0.035
ly-0.035
Token together
Token ID1978
Feature activation-1.59e-01

Pos logit contributions
 theirs+0.005
vern+0.005
 Betty+0.005
 Sam+0.005
 moms+0.005
Neg logit contributions
 tant-0.011
ard-0.010
 Are-0.009
ly-0.009
est-0.008
Token in
Token ID287
Feature activation-4.61e-01

Pos logit contributions
vern+0.005
 theirs+0.005
 Betty+0.004
 moms+0.004
 Andy+0.004
Neg logit contributions
 tant-0.007
ard-0.006
ly-0.005
 floors-0.005
est-0.005
Token their
Token ID511
Feature activation-5.41e-01

Pos logit contributions
vern+0.015
 theirs+0.013
 buddies+0.012
oto+0.012
ll+0.012
Neg logit contributions
 tant-0.018
ard-0.014
 million-0.014
 floors-0.014
 Are-0.013
Token old
Token ID1468
Feature activation+2.93e-01

Pos logit contributions
 theirs+0.017
vern+0.017
 Betty+0.016
girls+0.015
 Sam+0.015
Neg logit contributions
 tant-0.022
ard-0.018
 floors-0.016
 Are-0.015
 million-0.015
Token tree
Token ID5509
Feature activation-1.76e+00

Pos logit contributions
 tant+0.011
ard+0.009
 floors+0.008
est+0.008
 Are+0.007
Neg logit contributions
 theirs-0.010
vern-0.010
ores-0.010
 Betty-0.009
 Sam-0.009
Tokenhouse
Token ID4803
Feature activation-3.52e+00

Pos logit contributions
 theirs+0.049
vern+0.049
 Betty+0.042
 moms+0.041
 Sam+0.041
Neg logit contributions
 tant-0.076
ard-0.066
est-0.056
 floors-0.055
oc-0.054
Token.
Token ID13
Feature activation-5.55e-01

Pos logit contributions
vern+0.099
 theirs+0.094
 Betty+0.083
ethyst+0.080
girls+0.079
Neg logit contributions
 tant-0.156
ard-0.133
est-0.117
 floors-0.116
ly-0.109
Token<|endoftext|>
Token ID50256
Feature activation-2.32e-02

Pos logit contributions
vern+0.019
 theirs+0.018
 buddies+0.016
 feat+0.016
 moms+0.016
Neg logit contributions
 tant-0.020
ard-0.017
 Those-0.017
 Are-0.017
 million-0.015
TokenOnce
Token ID7454
Feature activation+3.59e-01

Pos logit contributions
 theirs+0.001
vern+0.001
 buddies+0.001
 Betty+0.001
 moms+0.001
Neg logit contributions
 tant-0.001
ard-0.001
 Are-0.001
 Those-0.001
 million-0.001
Token upon
Token ID2402
Feature activation+2.79e-01

Pos logit contributions
 tant+0.012
ard+0.007
ly+0.006
 million+0.006
 Are+0.006
Neg logit contributions
girls-0.016
boys-0.015
vern-0.014
 theirs-0.014
 carriers-0.014
Token a
Token ID257
Feature activation-4.33e-01

Pos logit contributions
 tant+0.014
ard+0.012
 Are+0.011
ly+0.010
est+0.010
Neg logit contributions
vern-0.006
 theirs-0.006
 Betty-0.006
 buddies-0.006
 moms-0.006
Token us
Token ID514
Feature activation-5.68e-01

Pos logit contributions
 theirs+0.016
vern+0.016
 Betty+0.015
 buddies+0.015
 Sam+0.015
Neg logit contributions
 tant-0.018
ard-0.015
 Are-0.013
ly-0.012
 floors-0.012
Token.
Token ID13
Feature activation+1.51e-01

Pos logit contributions
 theirs+0.011
vern+0.011
 Betty+0.010
 Sam+0.010
 buddies+0.009
Neg logit contributions
 tant-0.030
ard-0.026
 Are-0.024
ly-0.023
 floors-0.023
Token We
Token ID775
Feature activation+3.84e-02

Pos logit contributions
 tant+0.006
ard+0.005
ly+0.005
 floors+0.004
est+0.004
Neg logit contributions
 Sam-0.005
 theirs-0.005
 Betty-0.005
vern-0.004
 Andy-0.004
Token know
Token ID760
Feature activation-5.81e-01

Pos logit contributions
 tant+0.002
ard+0.001
 Are+0.001
vious+0.001
 Those+0.001
Neg logit contributions
 theirs-0.001
vern-0.001
 Betty-0.001
 buddies-0.001
 Andy-0.001
Token what
Token ID644
Feature activation-1.21e+00

Pos logit contributions
vern+0.018
 theirs+0.017
 Betty+0.017
girls+0.016
 buddies+0.016
Neg logit contributions
 tant-0.021
ard-0.019
ly-0.018
 Are-0.018
 floors-0.017
Token is
Token ID318
Feature activation-3.52e+00

Pos logit contributions
 theirs+0.034
vern+0.033
 Betty+0.032
girls+0.031
 buddies+0.031
Neg logit contributions
 tant-0.050
ard-0.043
 Are-0.041
 Those-0.037
ly-0.036
Token good
Token ID922
Feature activation-6.56e-01

Pos logit contributions
 Betty+0.125
vern+0.124
girls+0.123
 buddies+0.123
 Andy+0.121
Neg logit contributions
 tant-0.099
ard-0.085
 floors-0.082
 Are-0.079
ly-0.075
Token for
Token ID329
Feature activation-8.70e-01

Pos logit contributions
vern+0.013
 theirs+0.012
 Betty+0.012
 Andy+0.011
girls+0.011
Neg logit contributions
 tant-0.034
ard-0.030
ly-0.028
 Are-0.028
est-0.027
Token you
Token ID345
Feature activation-5.37e-01

Pos logit contributions
vern+0.033
 Betty+0.033
 theirs+0.031
 buddies+0.030
 feat+0.030
Neg logit contributions
ard-0.024
 tant-0.024
ly-0.019
 floors-0.019
 Are-0.018
Token.
Token ID13
Feature activation+2.07e-01

Pos logit contributions
 theirs+0.009
vern+0.008
 Betty+0.008
 buddies+0.007
 Sam+0.007
Neg logit contributions
 tant-0.030
ard-0.027
 Are-0.024
ly-0.023
 Those-0.023
Token We
Token ID775
Feature activation-3.03e-01

Pos logit contributions
 tant+0.008
ard+0.007
ly+0.006
 floors+0.005
 Are+0.005
Neg logit contributions
 theirs-0.007
 Sam-0.007
vern-0.007
 Betty-0.006
 moms-0.006
Token us
Token ID514
Feature activation-4.99e-01

Pos logit contributions
vern+0.021
 theirs+0.020
 Betty+0.019
 buddies+0.019
 Sam+0.018
Neg logit contributions
 tant-0.026
ard-0.022
 Are-0.019
ly-0.018
 floors-0.018
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
 theirs+0.009
vern+0.009
 Betty+0.009
 buddies+0.008
 Sam+0.008
Neg logit contributions
 tant-0.026
ard-0.023
 Are-0.021
ly-0.021
 floors-0.020
Token We
Token ID775
Feature activation+2.15e-02

Pos logit contributions
 tant+0.015
ard+0.012
ly+0.011
 floors+0.010
est+0.010
Neg logit contributions
 Sam-0.009
 theirs-0.009
 Betty-0.009
vern-0.008
 moms-0.008
Token know
Token ID760
Feature activation-6.56e-01

Pos logit contributions
 tant+0.001
ard+0.001
 Are+0.001
 Those+0.001
vious+0.001
Neg logit contributions
 theirs-0.001
vern-0.001
 Betty-0.001
 buddies-0.001
 Andy-0.001
Token what
Token ID644
Feature activation-1.27e+00

Pos logit contributions
vern+0.020
 Betty+0.018
 theirs+0.018
girls+0.018
 buddies+0.018
Neg logit contributions
 tant-0.024
ard-0.022
 Are-0.021
ly-0.020
 floors-0.020
Token is
Token ID318
Feature activation-3.52e+00

Pos logit contributions
 theirs+0.033
vern+0.032
 Betty+0.031
girls+0.030
 buddies+0.030
Neg logit contributions
 tant-0.054
ard-0.047
 Are-0.046
 Those-0.041
 floors-0.040
Token good
Token ID922
Feature activation-7.30e-01

Pos logit contributions
vern+0.117
 buddies+0.116
 Betty+0.116
girls+0.116
 Andy+0.115
Neg logit contributions
 tant-0.111
ard-0.094
 floors-0.091
 Are-0.087
ly-0.080
Token for
Token ID329
Feature activation-8.38e-01

Pos logit contributions
vern+0.015
 theirs+0.013
 Betty+0.013
 Andy+0.012
girls+0.012
Neg logit contributions
 tant-0.038
ard-0.034
ly-0.031
 Are-0.031
est-0.030
Token you
Token ID345
Feature activation-5.37e-01

Pos logit contributions
vern+0.030
 Betty+0.030
 theirs+0.028
 buddies+0.027
 feat+0.027
Neg logit contributions
 tant-0.025
ard-0.025
 floors-0.021
ly-0.020
 Are-0.020
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
 theirs+0.009
vern+0.009
 Betty+0.008
 buddies+0.008
 Sam+0.008
Neg logit contributions
 tant-0.029
ard-0.026
 Are-0.024
ly-0.023
 Those-0.023
Token Now
Token ID2735
Feature activation+6.85e-01

Pos logit contributions
 tant+0.013
ard+0.011
ly+0.009
 floors+0.009
est+0.009
Neg logit contributions
 theirs-0.010
 Sam-0.009
vern-0.009
 Betty-0.009
 moms-0.009
Token parents
Token ID3397
Feature activation+1.71e+00

Pos logit contributions
 theirs+0.017
vern+0.016
 Betty+0.016
 Andy+0.016
 Sam+0.015
Neg logit contributions
 tant-0.021
ard-0.017
 floors-0.014
ly-0.014
 Are-0.014
Token because
Token ID780
Feature activation-8.54e-01

Pos logit contributions
 tant+0.076
ard+0.066
 Are+0.057
ly+0.055
 Those+0.054
Neg logit contributions
 theirs-0.047
vern-0.046
 Betty-0.043
 Sam-0.043
 buddies-0.042
Token they
Token ID484
Feature activation-6.36e-01

Pos logit contributions
vern+0.013
 theirs+0.012
girls+0.011
 Betty+0.011
 buddies+0.010
Neg logit contributions
 tant-0.046
ard-0.042
 Are-0.039
 floors-0.038
ly-0.037
Token know
Token ID760
Feature activation-2.30e-01

Pos logit contributions
 theirs+0.018
vern+0.018
 Betty+0.017
 Sam+0.016
 buddies+0.016
Neg logit contributions
 tant-0.027
ard-0.024
 Are-0.021
ly-0.020
 Those-0.020
Token what
Token ID644
Feature activation-1.16e+00

Pos logit contributions
vern+0.005
 theirs+0.005
 Betty+0.005
 buddies+0.005
girls+0.005
Neg logit contributions
 tant-0.010
ard-0.009
 Are-0.008
 floors-0.008
ly-0.008
Token is
Token ID318
Feature activation-3.49e+00

Pos logit contributions
 theirs+0.031
vern+0.030
 Betty+0.028
girls+0.027
 buddies+0.027
Neg logit contributions
 tant-0.050
ard-0.044
 Are-0.041
 Those-0.038
ly-0.037
Token best
Token ID1266
Feature activation-1.14e-03

Pos logit contributions
girls+0.102
utter+0.102
vern+0.100
 Andy+0.100
 buddies+0.099
Neg logit contributions
 tant-0.118
ard-0.105
 Are-0.097
ly-0.093
 floors-0.093
Token for
Token ID329
Feature activation-1.23e+00

Pos logit contributions
girls+0.000
vern+0.000
 theirs+0.000
 Betty+0.000
 moms+0.000
Neg logit contributions
 tant-0.000
ard-0.000
ly-0.000
est-0.000
 Those-0.000
Token us
Token ID514
Feature activation-1.47e+00

Pos logit contributions
vern+0.042
 Betty+0.040
 theirs+0.039
 moms+0.038
 feat+0.038
Neg logit contributions
 tant-0.038
ard-0.036
ly-0.030
 Are-0.029
 floors-0.029
Token.
Token ID13
Feature activation-4.73e-01

Pos logit contributions
vern+0.029
 theirs+0.029
 Betty+0.028
 moms+0.026
 Andy+0.026
Neg logit contributions
 tant-0.072
ard-0.066
 Are-0.058
ly-0.057
 Those-0.056
Token<|endoftext|>
Token ID50256
Feature activation-3.87e-01

Pos logit contributions
 theirs+0.012
vern+0.011
 buddies+0.010
 Betty+0.010
 moms+0.010
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.018
 Those-0.017
 million-0.016
Token as
Token ID355
Feature activation-1.76e+00

Pos logit contributions
vern+0.003
 moms+0.003
girls+0.003
 theirs+0.003
 Betty+0.003
Neg logit contributions
 tant-0.005
ard-0.004
ly-0.004
 Are-0.003
est-0.003
Token the
Token ID262
Feature activation-3.57e-01

Pos logit contributions
 theirs+0.037
vern+0.036
 Betty+0.033
 buddies+0.033
 Sam+0.033
Neg logit contributions
 tant-0.089
ard-0.079
 Are-0.070
ly-0.068
 floors-0.067
Token best
Token ID1266
Feature activation+5.31e-01

Pos logit contributions
 theirs+0.011
vern+0.011
 Betty+0.010
 Sam+0.010
 buddies+0.010
Neg logit contributions
 tant-0.015
ard-0.013
 Are-0.011
ly-0.010
 floors-0.010
Token joke
Token ID9707
Feature activation-1.18e+00

Pos logit contributions
 tant+0.024
ard+0.020
 Are+0.018
ly+0.017
 Those+0.017
Neg logit contributions
 theirs-0.015
vern-0.014
 Sam-0.014
 Betty-0.013
 buddies-0.013
Token-
Token ID12
Feature activation-1.98e+00

Pos logit contributions
vern+0.028
 theirs+0.028
 Betty+0.025
 moms+0.024
 buddies+0.024
Neg logit contributions
 tant-0.056
ard-0.050
 Are-0.044
ly-0.043
est-0.042
Tokente
Token ID660
Feature activation-3.48e+00

Pos logit contributions
 theirs+0.063
vern+0.061
 Betty+0.059
 Sam+0.059
 buddies+0.058
Neg logit contributions
 tant-0.081
ard-0.067
 Are-0.059
 floors-0.056
 million-0.054
Tokenller
Token ID6051
Feature activation-2.11e+00

Pos logit contributions
vern+0.101
 theirs+0.101
 Betty+0.092
 Sam+0.091
 buddies+0.090
Neg logit contributions
 tant-0.148
ard-0.131
 Are-0.111
ly-0.111
est-0.106
Token around
Token ID1088
Feature activation-1.92e-01

Pos logit contributions
vern+0.046
 theirs+0.045
 Betty+0.041
 moms+0.040
 buddies+0.039
Neg logit contributions
 tant-0.104
ard-0.094
 Are-0.082
ly-0.081
est-0.079
Token.
Token ID13
Feature activation-6.91e-01

Pos logit contributions
 theirs+0.004
vern+0.004
 Betty+0.004
 Sam+0.004
 buddies+0.004
Neg logit contributions
 tant-0.010
ard-0.009
 Are-0.008
ly-0.007
 floors-0.007
Token<|endoftext|>
Token ID50256
Feature activation-3.82e-01

Pos logit contributions
 theirs+0.026
vern+0.026
 Betty+0.025
 Sam+0.025
 buddies+0.024
Neg logit contributions
 tant-0.024
ard-0.019
 Are-0.016
ly-0.015
 floors-0.015
TokenOnce
Token ID7454
Feature activation-2.85e-01

Pos logit contributions
 theirs+0.013
vern+0.013
 Betty+0.012
 buddies+0.012
 moms+0.012
Neg logit contributions
 tant-0.014
ard-0.012
 Are-0.011
 Those-0.010
est-0.009
Token,
Token ID11
Feature activation-5.00e-01

Pos logit contributions
 tant+0.075
ard+0.064
 Are+0.058
 floors+0.056
ly+0.055
Neg logit contributions
 theirs-0.034
vern-0.032
 Sam-0.030
 Betty-0.030
 buddies-0.030
Token because
Token ID780
Feature activation-6.41e-01

Pos logit contributions
vern+0.013
 theirs+0.013
 Betty+0.012
 buddies+0.012
girls+0.012
Neg logit contributions
 tant-0.022
ard-0.019
 Are-0.017
 floors-0.016
ly-0.016
Token they
Token ID484
Feature activation-9.26e-01

Pos logit contributions
vern+0.015
 theirs+0.014
girls+0.013
 buddies+0.013
 Betty+0.013
Neg logit contributions
 tant-0.030
ard-0.026
 Are-0.025
 floors-0.023
ly-0.023
Token know
Token ID760
Feature activation-4.12e-01

Pos logit contributions
 theirs+0.026
vern+0.026
 Betty+0.024
 Sam+0.024
 buddies+0.024
Neg logit contributions
 tant-0.040
ard-0.035
 Are-0.030
ly-0.029
 Those-0.028
Token what
Token ID644
Feature activation-1.09e+00

Pos logit contributions
vern+0.010
 theirs+0.010
 Betty+0.009
 buddies+0.009
girls+0.009
Neg logit contributions
 tant-0.018
ard-0.016
 Are-0.015
 floors-0.014
ly-0.014
Token is
Token ID318
Feature activation-3.48e+00

Pos logit contributions
 theirs+0.030
vern+0.029
 Betty+0.027
 buddies+0.027
girls+0.027
Neg logit contributions
 tant-0.046
ard-0.041
 Are-0.037
 Those-0.035
ly-0.034
Token best
Token ID1266
Feature activation-3.42e-01

Pos logit contributions
girls+0.103
 buddies+0.101
vern+0.100
 moms+0.099
utter+0.098
Neg logit contributions
 tant-0.121
ard-0.106
 Are-0.097
ly-0.094
est-0.094
Token for
Token ID329
Feature activation-1.19e+00

Pos logit contributions
girls+0.008
vern+0.008
 moms+0.008
 theirs+0.007
 Betty+0.007
Neg logit contributions
 tant-0.014
ard-0.013
ly-0.013
est-0.012
 Are-0.012
Token you
Token ID345
Feature activation-9.97e-01

Pos logit contributions
vern+0.038
 Betty+0.036
 moms+0.035
girls+0.034
 feat+0.034
Neg logit contributions
 tant-0.039
ard-0.037
ly-0.031
 situations-0.031
 floors-0.030
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
vern+0.024
 theirs+0.022
 moms+0.022
 buddies+0.022
 Betty+0.022
Neg logit contributions
 tant-0.043
ard-0.041
 Are-0.035
ly-0.035
 Those-0.034
Token<|endoftext|>
Token ID50256
Feature activation-1.24e-02

Pos logit contributions
 theirs+0.006
 Sam+0.006
vern+0.006
 Betty+0.006
 Andy+0.006
Neg logit contributions
 tant-0.011
ard-0.009
ly-0.008
 floors-0.008
est-0.008
Token young
Token ID1862
Feature activation+2.62e-01

Pos logit contributions
 theirs+0.037
vern+0.036
 Betty+0.034
girls+0.034
 buddies+0.033
Neg logit contributions
 tant-0.024
ard-0.019
 Are-0.016
ly-0.014
 million-0.014
Token girl
Token ID2576
Feature activation+5.23e-01

Pos logit contributions
 tant+0.008
ard+0.007
 Are+0.006
ly+0.005
est+0.005
Neg logit contributions
 theirs-0.011
vern-0.011
 Betty-0.010
 buddies-0.010
girls-0.010
Token who
Token ID508
Feature activation-5.38e-01

Pos logit contributions
 tant+0.024
ard+0.023
 Are+0.020
ly+0.019
est+0.018
Neg logit contributions
vern-0.014
 theirs-0.013
 buddies-0.011
girls-0.011
 accomplishment-0.011
Token owned
Token ID6898
Feature activation-1.04e+00

Pos logit contributions
 theirs+0.014
vern+0.013
 Betty+0.012
 Sam+0.012
 buddies+0.012
Neg logit contributions
 tant-0.025
ard-0.022
 Are-0.019
ly-0.019
 floors-0.018
Token the
Token ID262
Feature activation-5.16e-01

Pos logit contributions
vern+0.030
 theirs+0.027
girls+0.025
 buddies+0.025
 moms+0.025
Neg logit contributions
 tant-0.045
ard-0.040
 floors-0.034
 Are-0.034
ly-0.034
Token nic
Token ID9200
Feature activation+3.67e+00

Pos logit contributions
 theirs+0.016
vern+0.015
 Betty+0.015
 moms+0.015
 Sam+0.015
Neg logit contributions
 tant-0.021
ard-0.018
 Are-0.016
 Those-0.015
ly-0.015
Tokenest
Token ID395
Feature activation+2.01e+00

Pos logit contributions
 tant+0.069
ard+0.045
 Are+0.041
 Those+0.029
 million+0.029
Neg logit contributions
 theirs-0.188
vern-0.183
 buddies-0.180
 moms-0.177
 friend-0.176
Token jazz
Token ID21274
Feature activation-1.19e+00

Pos logit contributions
 tant+0.084
ard+0.072
 Are+0.065
 Those+0.064
 floors+0.059
Neg logit contributions
 theirs-0.056
 buddies-0.054
vern-0.053
 moms-0.053
 mom-0.053
Token in
Token ID287
Feature activation-1.24e-01

Pos logit contributions
 theirs+0.034
vern+0.034
 Betty+0.032
 Sam+0.031
 buddies+0.031
Neg logit contributions
 tant-0.052
ard-0.045
 Are-0.039
ly-0.037
 floors-0.037
Token the
Token ID262
Feature activation-4.24e-01

Pos logit contributions
 theirs+0.003
vern+0.003
 Betty+0.002
 buddies+0.002
 Sam+0.002
Neg logit contributions
 tant-0.006
ard-0.005
 Are-0.005
ly-0.005
 floors-0.005
Token whole
Token ID2187
Feature activation-1.78e-01

Pos logit contributions
 mom+0.011
 friend+0.010
 Sam+0.010
 their+0.010
 dad+0.009
Neg logit contributions
 tant-0.020
ard-0.016
 ounces-0.016
ly-0.016
 Those-0.014
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
 theirs+0.010
vern+0.010
 Betty+0.009
 buddies+0.008
 Sam+0.008
Neg logit contributions
 tant-0.041
ard-0.037
 Are-0.033
ly-0.032
 floors-0.032
Token One
Token ID1881
Feature activation-7.09e-01

Pos logit contributions
 tant+0.010
ard+0.009
ly+0.008
 Are+0.007
 floors+0.007
Neg logit contributions
 theirs-0.005
 Sam-0.005
 Betty-0.005
 Andy-0.005
vern-0.004
Token day
Token ID1110
Feature activation+5.56e-01

Pos logit contributions
 theirs+0.016
vern+0.015
 Sam+0.015
 Betty+0.014
 buddies+0.014
Neg logit contributions
 tant-0.036
ard-0.031
 Are-0.028
ly-0.027
 floors-0.027
Token,
Token ID11
Feature activation-7.25e-01

Pos logit contributions
 tant+0.032
ard+0.028
 floors+0.025
 Are+0.024
ly+0.024
Neg logit contributions
 Sam-0.008
 theirs-0.008
 Betty-0.008
 Andy-0.008
vern-0.008
Token a
Token ID257
Feature activation-5.46e-01

Pos logit contributions
 theirs+0.015
 Sam+0.014
 Betty+0.014
 Andy+0.013
 mom+0.013
Neg logit contributions
 tant-0.038
ard-0.034
ly-0.029
 Those-0.029
 Are-0.029
Token 3
Token ID513
Feature activation+3.77e+00

Pos logit contributions
 theirs+0.018
vern+0.018
 Betty+0.017
 Sam+0.017
 buddies+0.016
Neg logit contributions
 tant-0.021
ard-0.018
 Are-0.015
ly-0.015
 Those-0.014
Token year
Token ID614
Feature activation+1.15e-01

Pos logit contributions
 tant+0.159
ard+0.135
 Are+0.126
 fart+0.122
 Those+0.121
Neg logit contributions
 Betty-0.099
 Sam-0.099
 Andy-0.097
 mom-0.096
 theirs-0.096
Token old
Token ID1468
Feature activation-7.59e-01

Pos logit contributions
 tant+0.005
ard+0.004
ly+0.003
 Are+0.003
 floors+0.003
Neg logit contributions
 theirs-0.004
vern-0.004
 Betty-0.003
girls-0.003
 buddies-0.003
Token child
Token ID1200
Feature activation+6.90e-01

Pos logit contributions
vern+0.034
 theirs+0.033
girls+0.031
 buddies+0.031
 Betty+0.030
Neg logit contributions
 tant-0.022
ard-0.018
 Are-0.015
ly-0.014
est-0.013
Token visited
Token ID8672
Feature activation+3.53e-01

Pos logit contributions
 tant+0.026
ard+0.025
 Are+0.021
ly+0.020
 Those+0.019
Neg logit contributions
 theirs-0.024
vern-0.024
 buddies-0.021
 moms-0.021
 accomplishment-0.020
Token him
Token ID683
Feature activation-7.00e-01

Pos logit contributions
 tant+0.018
ard+0.016
 Are+0.014
ly+0.014
 floors+0.013
Neg logit contributions
vern-0.008
 theirs-0.007
 buddies-0.007
 moms-0.007
 Betty-0.006
Token of
Token ID286
Feature activation-2.59e-01

Pos logit contributions
 tant+0.045
ard+0.039
 Are+0.036
 Those+0.034
est+0.033
Neg logit contributions
 theirs-0.019
vern-0.018
 Sam-0.017
 buddies-0.017
 Betty-0.017
Token smoke
Token ID7523
Feature activation-4.54e-01

Pos logit contributions
 theirs+0.007
 Sam+0.007
 buddies+0.007
 Andy+0.006
 Betty+0.006
Neg logit contributions
 tant-0.011
ard-0.010
 Are-0.009
 Those-0.009
rified-0.008
Token!"
Token ID2474
Feature activation-7.97e-01

Pos logit contributions
 theirs+0.008
vern+0.007
 Sam+0.007
 Betty+0.007
 Andy+0.007
Neg logit contributions
 tant-0.025
ard-0.021
 Are-0.019
 Those-0.019
 floors-0.018
Token said
Token ID531
Feature activation-1.34e+00

Pos logit contributions
 Sam+0.014
 theirs+0.014
 Betty+0.014
 Andy+0.013
 mom+0.013
Neg logit contributions
 tant-0.043
ard-0.038
 Are-0.033
est-0.033
ly-0.033
Token the
Token ID262
Feature activation-1.30e-01

Pos logit contributions
vern+0.037
 buddies+0.029
girls+0.029
ll+0.028
 theirs+0.028
Neg logit contributions
 tant-0.065
ard-0.054
 Are-0.051
ly-0.049
est-0.048
Token 3
Token ID513
Feature activation+4.36e+00

Pos logit contributions
 theirs+0.002
vern+0.002
 moms+0.002
 Betty+0.002
 Sam+0.002
Neg logit contributions
 tant-0.007
ard-0.006
 Are-0.006
ly-0.006
 Those-0.006
Token year
Token ID614
Feature activation+5.42e-01

Pos logit contributions
 tant+0.159
ard+0.130
 Are+0.128
 Those+0.111
 fart+0.110
Neg logit contributions
 theirs-0.141
 Betty-0.140
 mom-0.140
 moms-0.139
 buddies-0.138
Token old
Token ID1468
Feature activation-4.18e-01

Pos logit contributions
 tant+0.021
ard+0.018
ly+0.015
 floors+0.014
 Are+0.013
Neg logit contributions
 theirs-0.019
vern-0.018
girls-0.017
 Betty-0.017
 buddies-0.017
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
vern+0.015
 theirs+0.014
girls+0.012
 possible+0.012
 buddies+0.012
Neg logit contributions
 tant-0.016
ard-0.015
ly-0.014
 Are-0.012
est-0.012
Token The
Token ID383
Feature activation-8.33e-01

Pos logit contributions
 theirs+0.035
vern+0.035
 Betty+0.032
 buddies+0.032
 Sam+0.032
Neg logit contributions
 tant-0.053
ard-0.045
 Are-0.039
ly-0.037
 Those-0.037
Token family
Token ID1641
Feature activation+2.07e+00

Pos logit contributions
 theirs+0.013
vern+0.012
 Betty+0.011
 Sam+0.011
 buddies+0.011
Neg logit contributions
 tant-0.047
ard-0.042
 Are-0.038
ly-0.037
 Those-0.037
TokenPat
Token ID12130
Feature activation-1.19e+00

Pos logit contributions
 theirs+0.018
vern+0.018
 Betty+0.017
 Sam+0.016
 buddies+0.016
Neg logit contributions
 tant-0.021
ard-0.018
 Are-0.015
ly-0.014
 Those-0.014
Tokenty
Token ID774
Feature activation-1.16e+00

Pos logit contributions
 theirs+0.038
vern+0.037
 buddies+0.035
 Betty+0.034
 moms+0.034
Neg logit contributions
 tant-0.048
ard-0.043
ly-0.036
 Are-0.034
est-0.034
Token was
Token ID373
Feature activation+2.52e-01

Pos logit contributions
 theirs+0.034
vern+0.034
 Betty+0.032
 buddies+0.031
 Sam+0.031
Neg logit contributions
 tant-0.049
ard-0.042
 Are-0.037
ly-0.035
 floors-0.034
Token a
Token ID257
Feature activation-5.55e-01

Pos logit contributions
 tant+0.012
ard+0.011
 Are+0.009
ly+0.009
 Those+0.009
Neg logit contributions
 theirs-0.006
vern-0.006
 Betty-0.005
 Sam-0.005
 buddies-0.005
Token creative
Token ID7325
Feature activation+6.25e-01

Pos logit contributions
 theirs+0.021
vern+0.020
 Betty+0.019
 Sam+0.019
 buddies+0.019
Neg logit contributions
 tant-0.019
ard-0.016
 Are-0.013
ly-0.013
 floors-0.012
Token 3
Token ID513
Feature activation+3.25e+00

Pos logit contributions
 tant+0.021
ard+0.017
 Are+0.014
ly+0.013
 floors+0.013
Neg logit contributions
 theirs-0.024
vern-0.024
 Betty-0.023
 Sam-0.023
 buddies-0.022
Token year
Token ID614
Feature activation-3.02e-01

Pos logit contributions
 tant+0.118
ard+0.095
 Are+0.092
 fart+0.084
 Those+0.084
Neg logit contributions
 buddies-0.104
 Sam-0.103
 mom-0.102
 theirs-0.102
 Betty-0.102
Token old
Token ID1468
Feature activation-1.23e+00

Pos logit contributions
 theirs+0.009
vern+0.008
 Betty+0.008
 Sam+0.008
 buddies+0.008
Neg logit contributions
 tant-0.013
ard-0.011
 Are-0.010
ly-0.009
 Those-0.009
Token girl
Token ID2576
Feature activation-8.58e-02

Pos logit contributions
 theirs+0.044
vern+0.044
 Betty+0.041
 Sam+0.041
 buddies+0.040
Neg logit contributions
 tant-0.044
ard-0.037
 Are-0.031
ly-0.030
 floors-0.029
Token who
Token ID508
Feature activation-9.54e-01

Pos logit contributions
vern+0.002
 theirs+0.002
 Betty+0.002
 moms+0.002
 buddies+0.002
Neg logit contributions
 tant-0.004
ard-0.004
 Are-0.003
ly-0.003
 Those-0.003
Token loved
Token ID6151
Feature activation-1.43e+00

Pos logit contributions
 theirs+0.023
vern+0.023
 Sam+0.021
 Betty+0.021
 buddies+0.021
Neg logit contributions
 tant-0.045
ard-0.039
 Are-0.034
ly-0.033
 floors-0.033
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
 tant+0.059
ard+0.052
 Are+0.046
ly+0.046
 floors+0.045
Neg logit contributions
vern-0.017
 theirs-0.017
 Betty-0.016
 buddies-0.015
 Andy-0.014
Token It
Token ID632
Feature activation-6.37e-02

Pos logit contributions
 tant+0.049
ard+0.042
 Are+0.037
ly+0.035
 Those+0.034
Neg logit contributions
 theirs-0.034
vern-0.034
 Betty-0.031
 buddies-0.031
 Sam-0.031
Token landed
Token ID11406
Feature activation+2.52e-01

Pos logit contributions
 theirs+0.002
 Sam+0.002
 moms+0.001
vern+0.001
 Betty+0.001
Neg logit contributions
 tant-0.003
ard-0.003
 Are-0.002
 Those-0.002
ly-0.002
Token on
Token ID319
Feature activation+1.18e+00

Pos logit contributions
 tant+0.011
ard+0.009
 Are+0.008
ly+0.008
 floors+0.008
Neg logit contributions
 theirs-0.007
vern-0.007
 Betty-0.007
 Sam-0.007
 Andy-0.006
Token the
Token ID262
Feature activation+1.76e+00

Pos logit contributions
 tant+0.055
ard+0.047
 Are+0.041
 floors+0.041
ly+0.040
Neg logit contributions
vern-0.032
 theirs-0.028
 buddies-0.027
girls-0.026
 Betty-0.026
Token other
Token ID584
Feature activation+3.61e+00

Pos logit contributions
 tant+0.065
ard+0.054
 Are+0.049
 Those+0.046
arre+0.043
Neg logit contributions
 theirs-0.060
vern-0.059
 mom-0.057
 moms-0.057
 Sam-0.056
Token side
Token ID1735
Feature activation+1.26e+00

Pos logit contributions
 tant+0.162
ard+0.130
 Are+0.123
arre+0.115
 million+0.113
Neg logit contributions
 theirs-0.100
 Sam-0.096
 mom-0.094
 moms-0.094
vern-0.093
Token of
Token ID286
Feature activation+9.97e-01

Pos logit contributions
 tant+0.051
ard+0.045
ly+0.036
est+0.036
 Are+0.036
Neg logit contributions
vern-0.039
 theirs-0.038
 Betty-0.036
girls-0.036
 buddies-0.035
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
 tant+0.049
ard+0.041
 Are+0.036
 floors+0.036
 million+0.035
Neg logit contributions
vern-0.025
 theirs-0.022
 buddies-0.022
girls-0.021
 Betty-0.020
Token field
Token ID2214
Feature activation+1.31e+00

Pos logit contributions
 tant+0.044
ard+0.037
 Are+0.033
 Those+0.031
ly+0.031
Neg logit contributions
 theirs-0.031
vern-0.030
 Sam-0.029
 moms-0.029
 Betty-0.028
Token.
Token ID13
Feature activation+9.81e-01

Pos logit contributions
 tant+0.069
ard+0.063
 Are+0.055
ly+0.054
est+0.054
Neg logit contributions
vern-0.024
 theirs-0.024
 Betty-0.022
 buddies-0.021
 moms-0.020
Token so
Token ID523
Feature activation+8.32e-01

Pos logit contributions
 tant+0.022
ard+0.019
ly+0.015
 Are+0.015
 Those+0.015
Neg logit contributions
 theirs-0.017
vern-0.016
 Sam-0.016
 Betty-0.016
 buddies-0.015
Token happy
Token ID3772
Feature activation+2.19e-01

Pos logit contributions
 tant+0.042
ard+0.037
 Are+0.032
ly+0.032
 Those+0.031
Neg logit contributions
 theirs-0.019
vern-0.018
 moms-0.016
 Sam-0.016
 buddies-0.016
Token and
Token ID290
Feature activation-2.04e-01

Pos logit contributions
 tant+0.010
ard+0.009
ly+0.008
 Are+0.008
 floors+0.007
Neg logit contributions
vern-0.006
 theirs-0.006
girls-0.005
 moms-0.005
 buddies-0.005
Token she
Token ID673
Feature activation-7.67e-01

Pos logit contributions
 theirs+0.006
vern+0.006
 buddies+0.005
 Betty+0.005
girls+0.005
Neg logit contributions
 tant-0.009
ard-0.008
 Are-0.007
ly-0.007
 floors-0.006
Token ate
Token ID15063
Feature activation-3.12e-02

Pos logit contributions
 theirs+0.025
vern+0.024
 Betty+0.022
 buddies+0.022
 Sam+0.022
Neg logit contributions
 tant-0.031
ard-0.027
 Are-0.023
ly-0.022
 floors-0.022
Token all
Token ID477
Feature activation+1.03e+00

Pos logit contributions
 theirs+0.001
vern+0.001
 Sam+0.001
 Betty+0.000
 buddies+0.000
Neg logit contributions
 tant-0.002
ard-0.002
 Are-0.001
 Those-0.001
ly-0.001
Token of
Token ID286
Feature activation-9.24e-02

Pos logit contributions
 tant+0.054
ard+0.048
 Are+0.043
ly+0.042
 floors+0.041
Neg logit contributions
 theirs-0.020
vern-0.019
 Betty-0.017
 Sam-0.017
 buddies-0.017
Token it
Token ID340
Feature activation-3.45e-01

Pos logit contributions
vern+0.002
 theirs+0.002
 buddies+0.002
 Betty+0.002
girls+0.002
Neg logit contributions
 tant-0.004
ard-0.004
 Are-0.003
 floors-0.003
ly-0.003
Token.
Token ID13
Feature activation-1.24e-01

Pos logit contributions
 theirs+0.006
vern+0.006
 Betty+0.006
 Sam+0.005
 buddies+0.005
Neg logit contributions
 tant-0.019
ard-0.016
 Are-0.015
ly-0.014
 floors-0.014
Token After
Token ID2293
Feature activation+1.69e-01

Pos logit contributions
 theirs+0.003
 Sam+0.003
 Betty+0.003
vern+0.003
 moms+0.003
Neg logit contributions
 tant-0.006
ard-0.005
 Are-0.005
ly-0.005
 floors-0.005
Token dinner
Token ID8073
Feature activation+1.77e+00

Pos logit contributions
 tant+0.007
ard+0.006
 Are+0.006
 Those+0.005
 million+0.005
Neg logit contributions
vern-0.005
 theirs-0.005
girls-0.004
 buddies-0.004
 Betty-0.004
Token in
Token ID287
Feature activation+9.29e-02

Pos logit contributions
 tant+0.022
ard+0.020
 Are+0.020
 Those+0.019
Uh+0.018
Neg logit contributions
 theirs-0.011
 Sam-0.010
vern-0.009
 Betty-0.009
 buddies-0.009
Token his
Token ID465
Feature activation-5.29e-01

Pos logit contributions
 tant+0.005
ard+0.005
ly+0.004
 Are+0.004
rified+0.004
Neg logit contributions
 theirs-0.002
 Sam-0.001
vern-0.001
 mom-0.001
 Betty-0.001
Token rocking
Token ID38059
Feature activation+3.47e-01

Pos logit contributions
 mom+0.014
 dad+0.013
 moms+0.013
 friend+0.013
 Sam+0.012
Neg logit contributions
 tant-0.025
ard-0.022
rified-0.020
ly-0.020
 Are-0.020
Token chair
Token ID5118
Feature activation+7.88e-02

Pos logit contributions
 tant+0.014
ard+0.012
 Are+0.011
 Those+0.011
 million+0.010
Neg logit contributions
 theirs-0.010
 Sam-0.010
 mom-0.010
 moms-0.009
 buddies-0.009
Token,
Token ID11
Feature activation-9.22e-02

Pos logit contributions
 tant+0.004
ard+0.004
 Are+0.004
 Those+0.004
ly+0.003
Neg logit contributions
 theirs-0.001
vern-0.001
 Sam-0.001
 buddies-0.001
 Betty-0.001
Token but
Token ID475
Feature activation+1.85e+00

Pos logit contributions
 Sam+0.002
 theirs+0.002
 not+0.002
 they+0.002
 Andy+0.002
Neg logit contributions
 tant-0.004
ard-0.004
ly-0.004
 Those-0.004
est-0.004
Token his
Token ID465
Feature activation+7.27e-02

Pos logit contributions
 tant+0.087
ard+0.077
 Are+0.065
ly+0.065
 Those+0.063
Neg logit contributions
 theirs-0.048
 Sam-0.045
vern-0.043
 Betty-0.043
 Andy-0.043
Token chair
Token ID5118
Feature activation+9.10e-01

Pos logit contributions
 tant+0.003
ard+0.003
 Are+0.002
ly+0.002
 Those+0.002
Neg logit contributions
 theirs-0.002
vern-0.002
 Sam-0.002
 buddies-0.002
 Betty-0.002
Token was
Token ID373
Feature activation+6.49e-01

Pos logit contributions
 tant+0.040
ard+0.034
 Are+0.030
 Those+0.028
ly+0.028
Neg logit contributions
 theirs-0.025
vern-0.025
 Sam-0.024
 Betty-0.023
 buddies-0.023
Token too
Token ID1165
Feature activation-1.43e+00

Pos logit contributions
 tant+0.029
ard+0.025
 Are+0.022
ly+0.021
 floors+0.021
Neg logit contributions
 theirs-0.018
vern-0.017
 Betty-0.016
 Sam-0.016
 buddies-0.016
Token heavy
Token ID4334
Feature activation-1.91e-01

Pos logit contributions
vern+0.037
 theirs+0.037
 Sam+0.034
 Betty+0.034
 buddies+0.033
Neg logit contributions
 tant-0.065
ard-0.057
 Are-0.051
 Those-0.049
ly-0.048
Token the
Token ID262
Feature activation+6.66e-01

Pos logit contributions
 Sam+0.003
 theirs+0.003
 moms+0.003
 mom+0.002
 they+0.002
Neg logit contributions
 tant-0.016
ard-0.015
est-0.013
arre-0.013
ly-0.013
Token police
Token ID1644
Feature activation+4.23e-01

Pos logit contributions
 tant+0.025
ard+0.020
 Are+0.018
 floors+0.017
ly+0.017
Neg logit contributions
 theirs-0.023
vern-0.023
 Betty-0.022
 Sam-0.021
 Andy-0.021
Token officer
Token ID3818
Feature activation+4.75e-01

Pos logit contributions
 tant+0.018
ard+0.015
 Are+0.013
ly+0.013
 floors+0.012
Neg logit contributions
 theirs-0.013
vern-0.013
 Betty-0.012
 buddies-0.012
 Sam-0.012
Token says
Token ID1139
Feature activation-7.47e-01

Pos logit contributions
 tant+0.022
ard+0.019
 Are+0.016
ly+0.016
 floors+0.015
Neg logit contributions
 theirs-0.013
vern-0.012
 Sam-0.012
 Betty-0.012
 buddies-0.011
Token.
Token ID13
Feature activation+3.64e-01

Pos logit contributions
 Betty+0.019
vern+0.018
 theirs+0.017
 Laser+0.017
 buddies+0.017
Neg logit contributions
 tant-0.034
ard-0.030
ly-0.030
 Are-0.026
 Those-0.025
Token "
Token ID366
Feature activation+9.11e-01

Pos logit contributions
 tant+0.019
ard+0.017
 floors+0.015
est+0.015
ly+0.015
Neg logit contributions
 Sam-0.007
 theirs-0.007
 moms-0.006
 mom-0.006
 Betty-0.006
TokenWhat
Token ID2061
Feature activation+3.18e-02

Pos logit contributions
 tant+0.040
ard+0.034
est+0.031
ly+0.030
 floors+0.029
Neg logit contributions
 theirs-0.025
 themselves-0.024
 Sam-0.023
 moms-0.023
 they-0.022
Token are
Token ID389
Feature activation-1.81e+00

Pos logit contributions
 tant+0.001
ard+0.001
 Are+0.001
 Those+0.001
est+0.001
Neg logit contributions
 theirs-0.001
vern-0.001
 Betty-0.001
 buddies-0.001
girls-0.001
Token you
Token ID345
Feature activation-2.78e-01

Pos logit contributions
vern+0.065
 Betty+0.063
 theirs+0.063
 feat+0.061
girls+0.061
Neg logit contributions
 tant-0.055
ard-0.050
 Are-0.043
 floors-0.041
ly-0.041
Token doing
Token ID1804
Feature activation+8.89e-01

Pos logit contributions
 Sam+0.006
 theirs+0.006
vern+0.006
 Betty+0.006
 buddies+0.006
Neg logit contributions
 tant-0.014
ard-0.012
ly-0.011
est-0.010
 Are-0.010
Token with
Token ID351
Feature activation-6.85e-01

Pos logit contributions
 tant+0.040
ard+0.035
 Are+0.032
ly+0.030
 floors+0.030
Neg logit contributions
vern-0.023
 theirs-0.023
 Betty-0.021
 buddies-0.021
girls-0.020
Token.
Token ID13
Feature activation+1.53e-02

Pos logit contributions
 theirs+0.001
vern+0.001
 Betty+0.001
 Andy+0.001
 Sam+0.001
Neg logit contributions
 tant-0.004
ard-0.004
 Are-0.003
ly-0.003
 floors-0.003
Token<|endoftext|>
Token ID50256
Feature activation-4.15e-02

Pos logit contributions
 tant+0.001
ard+0.001
 Are+0.001
 Those+0.001
 million+0.000
Neg logit contributions
vern-0.000
 theirs-0.000
 buddies-0.000
 Betty-0.000
girls-0.000
TokenL
Token ID43
Feature activation+2.34e-01

Pos logit contributions
 theirs+0.001
vern+0.001
 Betty+0.001
 Sam+0.001
 buddies+0.001
Neg logit contributions
 tant-0.002
ard-0.001
 Are-0.001
ly-0.001
 Those-0.001
Tokenily
Token ID813
Feature activation-9.00e-01

Pos logit contributions
 tant+0.011
ard+0.010
ly+0.009
 floors+0.008
est+0.008
Neg logit contributions
vern-0.006
 Betty-0.005
girls-0.005
 theirs-0.005
 Andy-0.005
Token and
Token ID290
Feature activation+4.82e-01

Pos logit contributions
 theirs+0.027
vern+0.026
 Betty+0.025
 buddies+0.025
 Sam+0.024
Neg logit contributions
 tant-0.038
ard-0.032
 Are-0.028
ly-0.027
 floors-0.026
Token Ben
Token ID3932
Feature activation+9.29e-01

Pos logit contributions
 tant+0.022
 Are+0.020
ard+0.019
ly+0.017
 Those+0.017
Neg logit contributions
vern-0.012
 buddies-0.011
girls-0.010
 theirs-0.010
 accomplishment-0.009
Token were
Token ID547
Feature activation+1.12e+00

Pos logit contributions
 tant+0.039
ard+0.034
 Are+0.029
ly+0.028
est+0.027
Neg logit contributions
 theirs-0.028
vern-0.028
 buddies-0.026
 Betty-0.026
 Sam-0.025
Token playing
Token ID2712
Feature activation+3.14e-01

Pos logit contributions
 tant+0.048
ard+0.041
 Are+0.036
 floors+0.035
arre+0.035
Neg logit contributions
vern-0.031
 theirs-0.031
 Betty-0.030
girls-0.028
 Andy-0.028
Token in
Token ID287
Feature activation+4.29e-01

Pos logit contributions
 tant+0.013
ard+0.011
 Are+0.010
ly+0.010
 floors+0.010
Neg logit contributions
 theirs-0.009
vern-0.009
 Betty-0.009
 moms-0.008
girls-0.008
Token the
Token ID262
Feature activation+4.20e-01

Pos logit contributions
 tant+0.020
ard+0.017
 Are+0.016
 floors+0.015
 Those+0.015
Neg logit contributions
vern-0.012
 theirs-0.010
 buddies-0.010
girls-0.010
 resolved-0.010
Token park
Token ID3952
Feature activation+4.07e-01

Pos logit contributions
 tant+0.020
ard+0.017
ly+0.015
 Are+0.015
 Those+0.014
Neg logit contributions
 Sam-0.011
 theirs-0.011
 Betty-0.010
 moms-0.010
vern-0.010
Token sounds
Token ID5238
Feature activation+8.10e-01

Pos logit contributions
 buddies+0.006
 theirs+0.006
 Sam+0.005
 Betty+0.005
 moms+0.005
Neg logit contributions
 tant-0.007
ard-0.006
 Are-0.005
rified-0.005
 Those-0.005
Token.
Token ID13
Feature activation+4.50e-01

Pos logit contributions
 tant+0.038
ard+0.034
ly+0.031
est+0.030
 Are+0.030
Neg logit contributions
vern-0.019
 theirs-0.017
 Betty-0.017
girls-0.016
 Andy-0.016
Token
Token ID198
Feature activation+9.22e-01

Pos logit contributions
 tant+0.024
ard+0.021
 Are+0.019
ly+0.019
 floors+0.018
Neg logit contributions
 theirs-0.008
 Sam-0.008
vern-0.008
 Betty-0.008
 moms-0.007
Token
Token ID198
Feature activation+4.45e-01

Pos logit contributions
 tant+0.040
ard+0.035
 Are+0.031
 Those+0.030
 million+0.029
Neg logit contributions
 theirs-0.025
vern-0.025
 Betty-0.024
 buddies-0.023
girls-0.022
Token"
Token ID1
Feature activation-1.59e-01

Pos logit contributions
 tant+0.021
ard+0.018
 Are+0.016
ly+0.016
est+0.016
Neg logit contributions
 theirs-0.011
vern-0.010
 moms-0.010
 Sam-0.010
 buddies-0.010
TokenLook
Token ID8567
Feature activation+1.64e+00

Pos logit contributions
 moms+0.005
 theirs+0.005
 Sam+0.005
 Betty+0.005
 mom+0.005
Neg logit contributions
 tant-0.007
ard-0.005
ly-0.005
 situations-0.004
 floors-0.004
Token,
Token ID11
Feature activation+5.05e-01

Pos logit contributions
 tant+0.067
ard+0.057
ly+0.052
 Are+0.049
est+0.047
Neg logit contributions
vern-0.053
 theirs-0.049
girls-0.046
 Betty-0.045
 buddies-0.044
Token Ben
Token ID3932
Feature activation+2.02e-01

Pos logit contributions
 tant+0.028
ard+0.025
 Are+0.022
ly+0.022
 floors+0.022
Neg logit contributions
 theirs-0.009
vern-0.008
 Sam-0.008
 Betty-0.007
 moms-0.007
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
ard+0.006
est+0.005
 Are+0.005
 tant+0.005
ly+0.005
Neg logit contributions
vern-0.008
 theirs-0.008
 buddies-0.007
 accomplishment-0.007
girls-0.007
Token that
Token ID326
Feature activation+8.43e-01

Pos logit contributions
 tant+0.062
ard+0.055
ly+0.047
 Are+0.047
 floors+0.046
Neg logit contributions
 theirs-0.025
 Sam-0.024
 Betty-0.022
vern-0.022
 moms-0.022
Token bird
Token ID6512
Feature activation-5.99e-01

Pos logit contributions
 tant+0.036
ard+0.029
ly+0.025
 Are+0.024
 floors+0.024
Neg logit contributions
 theirs-0.026
 Sam-0.025
 Betty-0.025
 moms-0.024
vern-0.024
Token of
Token ID286
Feature activation-4.40e-01

Pos logit contributions
 theirs+0.010
vern+0.010
girls+0.010
 moms+0.010
 buddies+0.009
Neg logit contributions
 tant-0.020
ard-0.018
ly-0.016
 Are-0.015
 floors-0.015
Token a
Token ID257
Feature activation-9.12e-02

Pos logit contributions
 theirs+0.010
vern+0.010
 moms+0.010
 buddies+0.010
girls+0.009
Neg logit contributions
 tant-0.020
ard-0.018
 Are-0.016
 floors-0.015
est-0.015
Token plan
Token ID1410
Feature activation+1.01e+00

Pos logit contributions
 theirs+0.002
vern+0.002
 Betty+0.002
 Sam+0.002
 buddies+0.002
Neg logit contributions
 tant-0.004
ard-0.004
 Are-0.003
ly-0.003
 floors-0.003
Token and
Token ID290
Feature activation-1.38e-02

Pos logit contributions
 tant+0.040
ard+0.034
ly+0.033
est+0.030
 Are+0.028
Neg logit contributions
 moms-0.030
girls-0.029
vern-0.028
 theirs-0.028
 buddies-0.027
Token offered
Token ID4438
Feature activation+8.27e-01

Pos logit contributions
vern+0.000
 theirs+0.000
girls+0.000
 buddies+0.000
 moms+0.000
Neg logit contributions
 tant-0.001
ard-0.000
 Are-0.000
ly-0.000
 floors-0.000
Token the
Token ID262
Feature activation-5.24e-01

Pos logit contributions
 tant+0.037
ard+0.031
ly+0.030
 floors+0.028
 Are+0.027
Neg logit contributions
vern-0.022
girls-0.020
 theirs-0.019
 buddies-0.019
 moms-0.018
Token cater
Token ID20825
Feature activation+3.72e-02

Pos logit contributions
 theirs+0.017
vern+0.016
 Betty+0.015
 Sam+0.015
 buddies+0.015
Neg logit contributions
 tant-0.021
ard-0.018
 Are-0.016
ly-0.015
 floors-0.015
Tokenpillar
Token ID41643
Feature activation+4.62e-01

Pos logit contributions
 tant+0.001
ard+0.001
 Are+0.000
 floors+0.000
ly+0.000
Neg logit contributions
 theirs-0.002
vern-0.002
 Betty-0.002
 Sam-0.002
 buddies-0.002
Token a
Token ID257
Feature activation-1.65e-01

Pos logit contributions
 tant+0.021
ard+0.018
ly+0.016
est+0.016
 Are+0.015
Neg logit contributions
vern-0.011
 theirs-0.011
girls-0.011
 moms-0.011
 Betty-0.010
Token leaf
Token ID12835
Feature activation-1.38e-01

Pos logit contributions
 theirs+0.005
vern+0.004
girls+0.004
 Sam+0.004
 buddies+0.004
Neg logit contributions
 tant-0.007
ard-0.006
 Are-0.005
ly-0.005
 million-0.005
Token.
Token ID13
Feature activation-5.50e-01

Pos logit contributions
 moms+0.003
vern+0.003
 theirs+0.003
girls+0.003
 Bill+0.003
Neg logit contributions
 tant-0.006
ard-0.005
ly-0.005
est-0.005
shirt-0.004
Token to
Token ID284
Feature activation+4.82e-01

Pos logit contributions
 theirs+0.013
vern+0.012
 Betty+0.011
 Sam+0.011
 buddies+0.011
Neg logit contributions
 tant-0.044
ard-0.039
 Are-0.035
ly-0.034
 floors-0.034
Token act
Token ID719
Feature activation+7.85e-01

Pos logit contributions
 tant+0.023
ard+0.020
ly+0.017
 Are+0.017
 Those+0.017
Neg logit contributions
 theirs-0.012
 Sam-0.012
vern-0.012
 Betty-0.011
 Andy-0.011
Token and
Token ID290
Feature activation-2.69e-01

Pos logit contributions
 tant+0.038
ard+0.033
 Are+0.030
 Those+0.029
 floors+0.028
Neg logit contributions
 theirs-0.018
vern-0.018
 Sam-0.017
 buddies-0.016
 Betty-0.016
Token perform
Token ID1620
Feature activation+6.25e-01

Pos logit contributions
 theirs+0.007
 Sam+0.006
vern+0.006
 Betty+0.006
 buddies+0.006
Neg logit contributions
 tant-0.013
ard-0.011
 Are-0.010
ly-0.010
 Those-0.009
Token,
Token ID11
Feature activation-3.30e-01

Pos logit contributions
 tant+0.031
ard+0.027
 Are+0.024
ly+0.024
 floors+0.023
Neg logit contributions
 theirs-0.014
vern-0.013
 Betty-0.012
 Sam-0.012
 buddies-0.012
Token tw
Token ID665
Feature activation+2.84e-01

Pos logit contributions
 theirs+0.006
 Sam+0.006
 Betty+0.006
 Andy+0.006
 mom+0.005
Neg logit contributions
 tant-0.018
ard-0.016
 Are-0.014
ly-0.014
 Those-0.013
Tokenirling
Token ID24297
Feature activation-2.29e-01

Pos logit contributions
 tant+0.014
ard+0.012
 Are+0.010
ly+0.010
 floors+0.010
Neg logit contributions
 theirs-0.007
vern-0.007
 Betty-0.006
 Sam-0.006
 buddies-0.006
Token and
Token ID290
Feature activation-6.15e-01

Pos logit contributions
 theirs+0.006
 Sam+0.006
vern+0.006
 Betty+0.005
 buddies+0.005
Neg logit contributions
 tant-0.010
ard-0.009
 Are-0.008
 Those-0.008
 million-0.008
Token dancing
Token ID15360
Feature activation+2.74e-01

Pos logit contributions
 theirs+0.014
 Sam+0.014
 Betty+0.013
vern+0.013
 buddies+0.013
Neg logit contributions
 tant-0.031
ard-0.026
ly-0.023
 Are-0.023
 Those-0.022
Token around
Token ID1088
Feature activation+3.25e-01

Pos logit contributions
 tant+0.013
ard+0.011
ly+0.010
 Are+0.010
est+0.009
Neg logit contributions
 theirs-0.007
vern-0.007
 Betty-0.006
 moms-0.006
 Sam-0.006
Token the
Token ID262
Feature activation+6.83e-01

Pos logit contributions
 tant+0.017
ard+0.015
 Are+0.014
ly+0.013
 floors+0.013
Neg logit contributions
vern-0.006
 theirs-0.006
 Betty-0.005
 moms-0.005
 buddies-0.005
Token,
Token ID11
Feature activation-5.92e-01

Pos logit contributions
 theirs+0.014
vern+0.014
 buddies+0.013
girls+0.013
 Betty+0.012
Neg logit contributions
 tant-0.028
ard-0.025
 Are-0.022
ly-0.021
 Those-0.021
Token Tim
Token ID5045
Feature activation-9.45e-01

Pos logit contributions
vern+0.009
 theirs+0.008
 buddies+0.007
girls+0.007
 Betty+0.007
Neg logit contributions
 tant-0.034
ard-0.030
 Are-0.028
 Those-0.027
est-0.027
Tokenmy
Token ID1820
Feature activation-1.31e-02

Pos logit contributions
 theirs+0.029
vern+0.028
 Sam+0.028
 Betty+0.028
 buddies+0.027
Neg logit contributions
 tant-0.039
ard-0.032
 Are-0.028
 floors-0.027
 Those-0.027
Token knew
Token ID2993
Feature activation+5.37e-01

Pos logit contributions
 theirs+0.000
vern+0.000
 Betty+0.000
 Sam+0.000
 buddies+0.000
Neg logit contributions
 tant-0.001
ard-0.001
 Are-0.000
ly-0.000
 Those-0.000
Token it
Token ID340
Feature activation+1.07e-01

Pos logit contributions
 tant+0.022
ard+0.019
 Are+0.018
ly+0.017
 floors+0.016
Neg logit contributions
girls-0.015
vern-0.014
 theirs-0.014
 moms-0.013
 buddies-0.012
Token was
Token ID373
Feature activation+4.13e-01

Pos logit contributions
 tant+0.005
ard+0.004
 Are+0.003
 floors+0.003
ly+0.003
Neg logit contributions
 theirs-0.003
vern-0.003
 Sam-0.003
 Betty-0.003
 buddies-0.003
Token important
Token ID1593
Feature activation+1.49e+00

Pos logit contributions
 tant+0.012
est+0.010
 Are+0.009
ard+0.009
vious+0.008
Neg logit contributions
girls-0.017
 moms-0.015
uddy-0.015
sed-0.015
iki-0.014
Token to
Token ID284
Feature activation+4.58e-01

Pos logit contributions
 tant+0.073
ard+0.064
ly+0.058
 Are+0.058
 floors+0.056
Neg logit contributions
 theirs-0.032
vern-0.031
girls-0.029
 Betty-0.029
 moms-0.028
Token take
Token ID1011
Feature activation+4.97e-01

Pos logit contributions
 tant+0.023
ard+0.020
 Are+0.018
ly+0.017
 floors+0.017
Neg logit contributions
 theirs-0.010
vern-0.010
 Betty-0.009
 buddies-0.009
 Sam-0.009
Token care
Token ID1337
Feature activation-1.18e-01

Pos logit contributions
 tant+0.025
ard+0.023
 Are+0.020
Uh+0.020
 Those+0.019
Neg logit contributions
 Sam-0.011
 together-0.010
 theirs-0.010
 teamwork-0.010
 buddies-0.010
Token of
Token ID286
Feature activation+1.23e-01

Pos logit contributions
vern+0.002
 theirs+0.002
 buddies+0.002
 together+0.002
 Sam+0.002
Neg logit contributions
 tant-0.006
ard-0.005
 Are-0.005
 floors-0.005
 Those-0.005
Tokenmy
Token ID1820
Feature activation-4.51e-01

Pos logit contributions
vern+0.023
 theirs+0.023
 Betty+0.022
 moms+0.021
 buddies+0.021
Neg logit contributions
 tant-0.021
ard-0.018
 Are-0.014
ly-0.014
est-0.014
Token and
Token ID290
Feature activation-5.30e-01

Pos logit contributions
vern+0.014
 theirs+0.014
 Betty+0.013
 moms+0.013
 buddies+0.012
Neg logit contributions
 tant-0.018
ard-0.016
ly-0.014
 Are-0.013
est-0.013
Token daddy
Token ID35822
Feature activation+2.00e+00

Pos logit contributions
vern+0.016
 theirs+0.014
girls+0.014
 Zip+0.014
 Betty+0.014
Neg logit contributions
 tant-0.021
ard-0.018
 Are-0.017
 floors-0.016
ly-0.015
Token at
Token ID379
Feature activation-1.60e+00

Pos logit contributions
 tant+0.090
ard+0.082
ly+0.070
 Are+0.070
est+0.068
Neg logit contributions
vern-0.050
 theirs-0.050
 Betty-0.049
 moms-0.046
 Sam-0.045
Token the
Token ID262
Feature activation-2.82e-01

Pos logit contributions
vern+0.044
 theirs+0.039
girls+0.038
 Betty+0.038
 buddies+0.037
Neg logit contributions
 tant-0.072
ard-0.057
 Are-0.053
 floors-0.051
 situations-0.050
Token park
Token ID3952
Feature activation-5.29e-01

Pos logit contributions
 theirs+0.008
vern+0.008
 Sam+0.007
 Betty+0.007
 moms+0.007
Neg logit contributions
 tant-0.013
ard-0.011
 Are-0.009
ly-0.009
 Those-0.009
Token.
Token ID13
Feature activation-6.72e-01

Pos logit contributions
vern+0.010
 theirs+0.010
 Betty+0.009
 moms+0.008
 buddies+0.008
Neg logit contributions
 tant-0.027
ard-0.025
 Are-0.022
ly-0.021
est-0.021
Token<|endoftext|>
Token ID50256
Feature activation-2.20e-01

Pos logit contributions
vern+0.021
 theirs+0.020
 buddies+0.019
girls+0.019
 feat+0.018
Neg logit contributions
 tant-0.027
ard-0.023
 Are-0.022
 Those-0.021
 million-0.020
TokenOnce
Token ID7454
Feature activation+6.32e-02

Pos logit contributions
 theirs+0.008
vern+0.008
 buddies+0.007
 teamwork+0.007
 Betty+0.007
Neg logit contributions
 tant-0.008
ard-0.007
 Are-0.006
 Those-0.006
 million-0.006
Token upon
Token ID2402
Feature activation+1.25e-01

Pos logit contributions
 tant+0.002
ard+0.001
 Are+0.001
ly+0.001
 million+0.001
Neg logit contributions
girls-0.003
boys-0.002
vern-0.002
 theirs-0.002
 carriers-0.002
Token a
Token ID257
Feature activation-6.92e-01

Pos logit contributions
 tant+0.006
ard+0.006
 Are+0.005
ly+0.005
 floors+0.005
Neg logit contributions
 theirs-0.003
vern-0.002
 Betty-0.002
 buddies-0.002
 Sam-0.002
Token dream
Token ID4320
Feature activation-7.73e-02

Pos logit contributions
 tant+0.036
ard+0.032
 Are+0.029
 Those+0.028
 million+0.028
Neg logit contributions
 theirs-0.019
 Sam-0.018
vern-0.018
 buddies-0.018
 mom-0.018
Token.
Token ID13
Feature activation-4.77e-01

Pos logit contributions
 theirs+0.001
vern+0.001
 Sam+0.001
 Betty+0.001
 buddies+0.001
Neg logit contributions
 tant-0.004
ard-0.004
 Are-0.003
 Those-0.003
 floors-0.003
Token She
Token ID1375
Feature activation-1.26e+00

Pos logit contributions
vern+0.017
 theirs+0.017
 buddies+0.016
 Betty+0.016
girls+0.016
Neg logit contributions
 tant-0.017
ard-0.014
 Are-0.012
 Those-0.011
ly-0.011
Token was
Token ID373
Feature activation-1.43e-01

Pos logit contributions
 theirs+0.044
vern+0.042
 Betty+0.039
girls+0.038
 buddies+0.038
Neg logit contributions
 tant-0.046
ard-0.041
 Are-0.035
ly-0.034
 million-0.033
Token happy
Token ID3772
Feature activation-5.34e-02

Pos logit contributions
vern+0.005
 theirs+0.005
girls+0.005
 Betty+0.005
 moms+0.005
Neg logit contributions
 tant-0.005
ard-0.004
 Are-0.004
 million-0.003
 floors-0.003
Token to
Token ID284
Feature activation-6.22e-01

Pos logit contributions
vern+0.002
 theirs+0.002
girls+0.001
 moms+0.001
 Betty+0.001
Neg logit contributions
 tant-0.002
ard-0.002
 Are-0.002
ly-0.002
est-0.002
Token hear
Token ID3285
Feature activation-1.07e+00

Pos logit contributions
 theirs+0.017
vern+0.016
 Betty+0.015
 buddies+0.015
 Sam+0.015
Neg logit contributions
 tant-0.028
ard-0.024
 Are-0.021
ly-0.020
 Those-0.020
Token about
Token ID546
Feature activation-1.01e+00

Pos logit contributions
 theirs+0.019
vern+0.017
 Sam+0.017
 Betty+0.016
 buddies+0.015
Neg logit contributions
 tant-0.059
ard-0.053
 Are-0.047
 Those-0.045
 floors-0.045
Token it
Token ID340
Feature activation-5.00e-01

Pos logit contributions
 theirs+0.023
vern+0.022
 Betty+0.020
 Sam+0.020
 buddies+0.020
Neg logit contributions
 tant-0.050
ard-0.044
 Are-0.039
ly-0.038
 Those-0.037
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
 theirs+0.009
vern+0.009
 Sam+0.008
 Betty+0.008
 buddies+0.008
Neg logit contributions
 tant-0.027
ard-0.024
 Are-0.021
 floors-0.020
 Those-0.020
Token Tim
Token ID5045
Feature activation-4.49e-01

Pos logit contributions
vern+0.038
 theirs+0.037
 buddies+0.034
girls+0.032
 moms+0.032
Neg logit contributions
 tant-0.052
ard-0.047
 Are-0.043
 Those-0.041
 million-0.038
Token make
Token ID787
Feature activation+1.33e-01

Pos logit contributions
 theirs+0.004
vern+0.003
 buddies+0.003
 Betty+0.003
girls+0.003
Neg logit contributions
 tant-0.005
ard-0.004
 Are-0.004
ly-0.004
est-0.004
Token with
Token ID351
Feature activation-1.13e+00

Pos logit contributions
 tant+0.007
ard+0.006
 Are+0.005
ly+0.005
 Those+0.005
Neg logit contributions
 theirs-0.003
vern-0.003
 Betty-0.003
 Sam-0.003
 buddies-0.003
Token his
Token ID465
Feature activation+5.35e-02

Pos logit contributions
 theirs+0.029
vern+0.028
 Betty+0.027
 Sam+0.026
 buddies+0.026
Neg logit contributions
 tant-0.053
ard-0.045
 Are-0.040
ly-0.039
 floors-0.038
Token magical
Token ID10883
Feature activation+1.16e-01

Pos logit contributions
 tant+0.002
ard+0.002
 Are+0.002
 floors+0.002
ly+0.002
Neg logit contributions
 theirs-0.001
vern-0.001
 Betty-0.001
 Sam-0.001
girls-0.001
Token pocket
Token ID10000
Feature activation-8.92e-01

Pos logit contributions
 tant+0.005
ard+0.004
 Are+0.004
ly+0.004
 floors+0.003
Neg logit contributions
 theirs-0.003
vern-0.003
 Betty-0.003
 Sam-0.003
 buddies-0.003
Token toy
Token ID13373
Feature activation-2.16e+00

Pos logit contributions
 theirs+0.013
vern+0.013
 Sam+0.011
 buddies+0.011
 Betty+0.011
Neg logit contributions
 tant-0.051
ard-0.045
 Are-0.041
 Those-0.040
ly-0.039
Token.
Token ID13
Feature activation-6.54e-01

Pos logit contributions
vern+0.045
 theirs+0.044
 Betty+0.041
 moms+0.038
 Sam+0.038
Neg logit contributions
 tant-0.109
ard-0.099
ly-0.088
est-0.084
 floors-0.083
Token<|endoftext|>
Token ID50256
Feature activation-1.66e-01

Pos logit contributions
 theirs+0.017
vern+0.017
 buddies+0.015
 Betty+0.015
 moms+0.015
Neg logit contributions
 tant-0.030
ard-0.026
 Are-0.023
 Those-0.022
ly-0.021
TokenOnce
Token ID7454
Feature activation+1.16e-01

Pos logit contributions
 theirs+0.006
vern+0.006
 buddies+0.005
 moms+0.005
 Betty+0.005
Neg logit contributions
 tant-0.006
ard-0.005
 Are-0.005
 Those-0.004
 million-0.004
Token upon
Token ID2402
Feature activation+5.11e-01

Pos logit contributions
 tant+0.004
ard+0.003
 Are+0.002
ly+0.002
 million+0.002
Neg logit contributions
girls-0.005
vern-0.005
boys-0.004
 theirs-0.004
 carriers-0.004
Token a
Token ID257
Feature activation-5.45e-01

Pos logit contributions
 tant+0.026
ard+0.023
 Are+0.021
ly+0.020
 floors+0.020
Neg logit contributions
 theirs-0.010
vern-0.010
 Betty-0.009
 buddies-0.009
 moms-0.009
Token then
Token ID788
Feature activation+5.59e-01

Pos logit contributions
 Sam+0.002
 theirs+0.002
 moms+0.002
 Betty+0.002
 mom+0.002
Neg logit contributions
 tant-0.011
ard-0.010
est-0.009
 Are-0.009
 million-0.009
Token said
Token ID531
Feature activation-5.01e-01

Pos logit contributions
 tant+0.026
ard+0.023
 Are+0.020
ly+0.019
 floors+0.019
Neg logit contributions
 theirs-0.014
vern-0.013
 Sam-0.013
 Betty-0.013
 buddies-0.012
Token,
Token ID11
Feature activation-7.05e-03

Pos logit contributions
 theirs+0.011
vern+0.011
 Betty+0.010
 buddies+0.010
girls+0.010
Neg logit contributions
 tant-0.025
ard-0.021
 Are-0.019
ly-0.019
 Those-0.018
Token "
Token ID366
Feature activation+9.74e-01

Pos logit contributions
vern+0.000
 theirs+0.000
 buddies+0.000
 moms+0.000
 Betty+0.000
Neg logit contributions
 tant-0.000
ard-0.000
 Are-0.000
 Those-0.000
ly-0.000
TokenReady
Token ID35474
Feature activation+5.15e-01

Pos logit contributions
 tant+0.041
ard+0.032
ly+0.027
 floors+0.027
 Are+0.026
Neg logit contributions
 Sam-0.030
 moms-0.030
 Betty-0.029
 theirs-0.028
girls-0.028
Token or
Token ID393
Feature activation-2.10e+00

Pos logit contributions
 tant+0.025
ard+0.022
 Are+0.019
ly+0.019
 Those+0.019
Neg logit contributions
 theirs-0.012
vern-0.012
 Betty-0.011
 buddies-0.011
 Sam-0.010
Token not
Token ID407
Feature activation-2.54e+00

Pos logit contributions
 theirs+0.056
vern+0.055
 Betty+0.051
 Sam+0.051
 buddies+0.050
Neg logit contributions
 tant-0.095
ard-0.082
 Are-0.072
ly-0.070
 floors-0.069
Token,
Token ID11
Feature activation+8.79e-01

Pos logit contributions
vern+0.068
 theirs+0.067
 buddies+0.060
girls+0.059
 moms+0.057
Neg logit contributions
 tant-0.107
ard-0.103
 Are-0.092
ly-0.087
est-0.087
Token here
Token ID994
Feature activation-8.21e-02

Pos logit contributions
 tant+0.044
ard+0.039
 Are+0.036
ly+0.034
 floors+0.034
Neg logit contributions
vern-0.019
 theirs-0.017
 buddies-0.016
 Betty-0.015
girls-0.015
Token I
Token ID314
Feature activation+5.87e-01

Pos logit contributions
vern+0.003
 theirs+0.003
girls+0.002
 buddies+0.002
 Betty+0.002
Neg logit contributions
 tant-0.003
ard-0.003
 Are-0.002
ly-0.002
 Those-0.002
Token come
Token ID1282
Feature activation-2.60e-01

Pos logit contributions
 tant+0.026
ard+0.023
 Are+0.021
est+0.019
 Those+0.019
Neg logit contributions
 theirs-0.016
vern-0.015
 Betty-0.014
 buddies-0.014
girls-0.013
Token her
Token ID607
Feature activation-6.80e-01

Pos logit contributions
 tant+0.016
 Are+0.012
ard+0.012
ly+0.012
est+0.011
Neg logit contributions
vern-0.015
girls-0.015
 theirs-0.014
 buddies-0.014
 moms-0.013
Token mom
Token ID1995
Feature activation-4.67e-02

Pos logit contributions
 theirs+0.019
vern+0.019
girls+0.018
oor+0.018
oof+0.018
Neg logit contributions
 tant-0.025
 little-0.020
's-0.020
 Are-0.019
ard-0.019
Token,
Token ID11
Feature activation-7.97e-01

Pos logit contributions
 theirs+0.001
vern+0.001
 Betty+0.001
 Sam+0.001
 buddies+0.001
Neg logit contributions
 tant-0.002
ard-0.002
 Are-0.002
ly-0.002
 floors-0.002
Token "
Token ID366
Feature activation+5.64e-01

Pos logit contributions
vern+0.028
girls+0.028
 theirs+0.027
 moms+0.027
 buddies+0.027
Neg logit contributions
 tant-0.026
 Are-0.024
ard-0.023
 Those-0.019
ly-0.018
TokenWhat
Token ID2061
Feature activation+1.92e-01

Pos logit contributions
 tant+0.025
ard+0.020
ly+0.018
est+0.017
 situations+0.017
Neg logit contributions
 theirs-0.015
 Sam-0.015
 Betty-0.015
 mom-0.015
 moms-0.014
Token is
Token ID318
Feature activation-2.67e+00

Pos logit contributions
 tant+0.006
ard+0.005
 Are+0.004
 Those+0.004
ly+0.004
Neg logit contributions
 theirs-0.008
vern-0.008
 Betty-0.007
 buddies-0.007
girls-0.007
Token this
Token ID428
Feature activation-6.87e-01

Pos logit contributions
vern+0.070
 buddies+0.067
 theirs+0.066
girls+0.066
 moms+0.064
Neg logit contributions
 tant-0.117
ard-0.106
 Are-0.096
 floors-0.092
ly-0.090
Token tool
Token ID2891
Feature activation-8.74e-01

Pos logit contributions
 theirs+0.017
vern+0.017
 Sam+0.016
 Betty+0.016
 Andy+0.015
Neg logit contributions
 tant-0.033
ard-0.028
 Are-0.025
ly-0.024
 Those-0.024
Token?"
Token ID1701
Feature activation-7.86e-01

Pos logit contributions
 Sam+0.013
 theirs+0.013
 Betty+0.012
 mom+0.012
vern+0.012
Neg logit contributions
 tant-0.050
ard-0.042
 Are-0.038
 floors-0.037
 Those-0.037
Token
Token ID220
Feature activation-1.73e-01

Pos logit contributions
 theirs+0.027
vern+0.027
 buddies+0.025
 Betty+0.024
girls+0.024
Neg logit contributions
 tant-0.030
ard-0.025
 Are-0.022
ly-0.020
 Those-0.020
Token
Token ID198
Feature activation-3.15e-01

Pos logit contributions
 theirs+0.006
girls+0.005
vern+0.005
 buddies+0.005
 teamwork+0.005
Neg logit contributions
 tant-0.006
ard-0.005
 Are-0.005
 Those-0.005
arre-0.004
Token the
Token ID262
Feature activation+1.50e-01

Pos logit contributions
 tant+0.017
ard+0.015
 Are+0.014
 floors+0.013
 Those+0.013
Neg logit contributions
vern-0.010
 theirs-0.009
 moms-0.008
 buddies-0.008
 Betty-0.008
Token woods
Token ID16479
Feature activation-1.13e-01

Pos logit contributions
 tant+0.007
ard+0.006
 Are+0.005
 ounces+0.005
ly+0.005
Neg logit contributions
 Sam-0.004
 Betty-0.004
 theirs-0.004
 Dog-0.004
 moms-0.004
Token when
Token ID618
Feature activation+1.64e-02

Pos logit contributions
vern+0.003
 theirs+0.003
 moms+0.003
 Betty+0.003
girls+0.003
Neg logit contributions
 tant-0.005
ard-0.005
 Are-0.004
ly-0.004
 floors-0.004
Token Molly
Token ID30236
Feature activation+4.54e-01

Pos logit contributions
 tant+0.001
 Are+0.001
ard+0.001
 million+0.001
 Those+0.001
Neg logit contributions
vern-0.000
 theirs-0.000
girls-0.000
 buddies-0.000
 feat-0.000
Token suddenly
Token ID6451
Feature activation-3.73e-01

Pos logit contributions
 tant+0.020
ard+0.016
 Are+0.015
 Those+0.014
est+0.014
Neg logit contributions
 Sam-0.013
 theirs-0.012
 Betty-0.012
vern-0.012
 moms-0.012
Token stopped
Token ID5025
Feature activation-2.31e+00

Pos logit contributions
 theirs+0.012
vern+0.012
 Betty+0.011
 Sam+0.011
 buddies+0.011
Neg logit contributions
 tant-0.015
ard-0.012
 Are-0.011
 Those-0.010
ly-0.010
Token.
Token ID13
Feature activation-2.19e-01

Pos logit contributions
vern+0.061
 theirs+0.057
 moms+0.055
 resolved+0.052
 buddies+0.052
Neg logit contributions
 tant-0.101
ard-0.090
 Are-0.082
 situations-0.077
ly-0.075
Token Molly
Token ID30236
Feature activation+5.45e-01

Pos logit contributions
 Sam+0.006
 Betty+0.006
 theirs+0.005
 Andy+0.005
 Fred+0.005
Neg logit contributions
 tant-0.010
ard-0.009
est-0.008
ly-0.008
 floors-0.007
Token asked
Token ID1965
Feature activation+4.57e-01

Pos logit contributions
 tant+0.024
ard+0.020
 Are+0.017
 Those+0.017
ly+0.017
Neg logit contributions
 theirs-0.015
vern-0.015
 Sam-0.015
 Betty-0.015
 buddies-0.014
Token Max
Token ID5436
Feature activation-5.94e-01

Pos logit contributions
 tant+0.022
ard+0.019
 Are+0.017
ly+0.017
 Those+0.016
Neg logit contributions
vern-0.011
 theirs-0.011
 buddies-0.009
 Betty-0.009
 moms-0.009
Token,
Token ID11
Feature activation-4.93e-01

Pos logit contributions
vern+0.015
 theirs+0.014
 moms+0.013
 buddies+0.013
girls+0.013
Neg logit contributions
 tant-0.027
ard-0.023
 Are-0.021
 Those-0.021
ly-0.020
Token Tim
Token ID5045
Feature activation-4.00e-01

Pos logit contributions
 theirs+0.000
 Sam+0.000
 Betty+0.000
vern+0.000
 moms+0.000
Neg logit contributions
 tant-0.000
ard-0.000
 Are-0.000
ly-0.000
 Those-0.000
Token wished
Token ID16555
Feature activation-2.43e-01

Pos logit contributions
vern+0.014
 theirs+0.013
 accomplishment+0.013
 moms+0.013
 Betty+0.013
Neg logit contributions
 tant-0.014
ard-0.014
ly-0.011
est-0.011
 Are-0.011
Token he
Token ID339
Feature activation+1.08e+00

Pos logit contributions
vern+0.007
 theirs+0.007
girls+0.006
 Betty+0.006
 buddies+0.006
Neg logit contributions
 tant-0.010
ard-0.009
 Are-0.008
ly-0.007
 floors-0.007
Token had
Token ID550
Feature activation-5.12e-01

Pos logit contributions
 tant+0.049
ard+0.043
 Are+0.038
ly+0.036
 floors+0.036
Neg logit contributions
 theirs-0.029
vern-0.028
 Betty-0.026
 buddies-0.026
 Sam-0.026
Token one
Token ID530
Feature activation-1.99e-01

Pos logit contributions
vern+0.012
 Betty+0.012
 theirs+0.011
 Andy+0.011
girls+0.011
Neg logit contributions
 tant-0.023
ard-0.020
 Are-0.018
ly-0.018
 floors-0.018
Token too
Token ID1165
Feature activation-2.04e+00

Pos logit contributions
vern+0.005
 Betty+0.004
 theirs+0.004
girls+0.004
 Jas+0.004
Neg logit contributions
 tant-0.010
ard-0.008
ly-0.007
est-0.007
 Are-0.007
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
girls+0.051
 Betty+0.050
 moms+0.050
 theirs+0.048
boys+0.047
Neg logit contributions
 tant-0.089
ard-0.078
ly-0.068
 ounces-0.066
 million-0.064
Token So
Token ID1406
Feature activation+9.26e-01

Pos logit contributions
 tant+0.009
ard+0.008
 Are+0.007
 Those+0.007
est+0.006
Neg logit contributions
vern-0.008
 theirs-0.007
 buddies-0.007
 accomplishment-0.007
 resolved-0.007
Token Bob
Token ID5811
Feature activation-6.33e-01

Pos logit contributions
 tant+0.045
ard+0.039
 Are+0.035
ly+0.034
est+0.034
Neg logit contributions
vern-0.022
 theirs-0.021
 buddies-0.019
girls-0.019
 Betty-0.019
Token decided
Token ID3066
Feature activation+1.56e+00

Pos logit contributions
vern+0.024
 theirs+0.022
 accomplishment+0.020
 buddies+0.020
 Betty+0.019
Neg logit contributions
 tant-0.021
ard-0.021
ly-0.016
 Are-0.016
est-0.015
Token to
Token ID284
Feature activation+2.41e-02

Pos logit contributions
 tant+0.072
ard+0.064
ly+0.059
 Are+0.059
 floors+0.055
Neg logit contributions
vern-0.037
 theirs-0.035
girls-0.035
 buddies-0.034
 Betty-0.031