TOP ACTIVATIONS
MAX = 3.889

shoot it . Jackson got her toy gun
loved to dance . Every day , he danced on the
, you can give me a massage . That would be
a little girl named Jackson . One day , Jackson went
she saw a large man . He yelled out to her
my and daddy . One day , when Pen elope was
ily , can you answer a question for me ?" Lily
world . One day , Ashley was at the
stop ! Finally , she reached the top of
she decided to shoot it . Jackson got
around the park . One day , as she was dancing
if he could give her a massage . Jane said "
named Pen elope . One day , while picking fresh flowers
mom if she could get a new haircut . Her mom
way home . So Pen elope and her mom
. The next day , Ashley brought the flag
my , we don 't have any popcorn left . We
a little car named Gray . Gray was very fast and
Tom nods . He takes a corn from the plant .
The end . <|endoftext|> One day , a

MIN ACTIVATIONS
MIN = -4.163

, it went on the w ett est of adventures .
M andy and Sally . M andy was a very kind
there were two friends , an elephant and a dinosaur .
there were two friends , an ostr ich and a goat
. Sally said , " M mm , this cake is
became best friends . They played together in the park every
. After the boy finished his swing , he shared the
Bob was so happy with his cool golf shot . He
One day , an especially brave and curious squirrel
One day , an unhappy thing happened . The
for his help . They played golf together and had a
. Sally said , " M mm , this cake is
wanted to make his mum an honest offer .
in the mud . What an adventure ! <|endoftext|> Once upon
selfish . One day , M andy got an invitation to
. It was an important day for the man
a very special friend , an old turtle . She liked
One day , there was an earthquake and it made them
come and play . What an incredible key ! <|endoftext|> Once
said Sally . M um smiled and said ,

INTERVAL 1.876 - 3.889
CONTAINS 0.610%

it to mom my and daddy !" They both agreed ,
to her bed . She turned on her lamp and opened
. Sam and Mia do not understand . They decide to
sun . But Ben felt a little dizz y , so
When they got to the bar ber , Tim my saw

INTERVAL -0.137 - 1.876
CONTAINS 56.953%

book to each other . They were both happy that they
I am proud of you . â € The
She said , " Always mind your surroundings and be prepared
's have some ." " No , we can
Tim would often stand at his window and make

INTERVAL -2.150 - -0.137
CONTAINS 42.172%

to get out of the tree . Twe ety
my dear ." <|endoftext|> Once upon a time , there was
asked , " Did you enjoy your large meal ?"
to buy the lamp . The ancient friend was thrilled and
was very sad . He wished he could make his player

INTERVAL -4.163 - -2.150
CONTAINS 0.265%

was a bouncing ball . His name was Bill and he
. She recognized it as an eagle !
find her . L ily was all alone and
They touch the bubbles and g iggle . They see the
" What a pretty yellow bou quet ," she said proudly
Token shoot
Token ID2686
Feature activation+2.50e+00

Pos logit contributions
ifying+0.004
 Is+0.003
 Ashley+0.003
 awoke+0.003
 bit+0.003
Neg logit contributions
bled-0.005
uddled-0.005
iggle-0.005
 olds-0.005
ails-0.005
Token it
Token ID340
Feature activation+3.56e-01

Pos logit contributions
ifying+0.114
 total+0.103
 distance+0.100
use+0.095
bie+0.094
Neg logit contributions
iggle-0.079
bled-0.078
 olds-0.077
 golf-0.075
ires-0.073
Token.
Token ID13
Feature activation+3.57e+00

Pos logit contributions
ifying+0.012
 Ashley+0.011
 Is+0.011
 awoke+0.011
 Debbie+0.011
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
iggle-0.015
 golf-0.015
Token
Token ID220
Feature activation+1.69e-02

Pos logit contributions
ifying+0.196
 bit+0.168
 tant+0.163
 trace+0.163
 single+0.162
Neg logit contributions
ires-0.090
yard-0.089
azes-0.081
ate-0.081
 olds-0.078
Token
Token ID198
Feature activation+2.63e+00

Pos logit contributions
ifying+0.000
 Is+0.000
 bit+0.000
 Ashley+0.000
 awoke+0.000
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token
Token ID198
Feature activation+3.89e+00

Pos logit contributions
ifying+0.142
 bit+0.125
 trace+0.111
 awoke+0.109
 single+0.109
Neg logit contributions
iggle-0.077
ails-0.075
bled-0.073
ires-0.073
uddled-0.072
TokenJackson
Token ID31270
Feature activation-9.80e-01

Pos logit contributions
ifying+0.167
bie+0.151
 single+0.143
inea+0.139
burn+0.137
Neg logit contributions
ate-0.133
ires-0.132
uring-0.117
lf-0.116
bles-0.115
Token got
Token ID1392
Feature activation+1.09e+00

Pos logit contributions
bled+0.053
uddled+0.052
 olds+0.051
ires+0.050
iggle+0.049
Neg logit contributions
 bit-0.026
 awoke-0.026
 Is-0.023
 care-0.023
 heard-0.023
Token her
Token ID607
Feature activation-1.82e-01

Pos logit contributions
ifying+0.045
 awoke+0.041
 Ashley+0.039
 Debbie+0.039
Jonathan+0.038
Neg logit contributions
bled-0.045
 golf-0.042
uddled-0.041
 olds-0.040
add-0.040
Token toy
Token ID13373
Feature activation+3.58e-01

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.007
 Is-0.006
 bit-0.006
 Ashley-0.006
 awoke-0.006
Token gun
Token ID2485
Feature activation-8.65e-02

Pos logit contributions
ifying+0.015
 Is+0.014
 Ashley+0.013
 awoke+0.013
 Debbie+0.013
Neg logit contributions
bled-0.014
uddled-0.014
 olds-0.013
iggle-0.013
uddle-0.013
Token loved
Token ID6151
Feature activation+1.31e+00

Pos logit contributions
ifying+0.045
 Is+0.041
 Ashley+0.040
 distance+0.040
 single+0.039
Neg logit contributions
uddled-0.028
bled-0.027
ails-0.025
iggle-0.025
ans-0.025
Token to
Token ID284
Feature activation-7.29e-01

Pos logit contributions
ifying+0.041
 awoke+0.035
 Is+0.033
Jonathan+0.030
 distance+0.029
Neg logit contributions
 golf-0.067
uddled-0.066
ans-0.065
ails-0.065
bled-0.065
Token dance
Token ID9280
Feature activation-2.55e-02

Pos logit contributions
bled+0.041
uddled+0.041
 olds+0.038
uddle+0.037
ails+0.037
Neg logit contributions
 care-0.018
 bit-0.018
ifying-0.018
 Is-0.017
 Ashley-0.017
Token.
Token ID13
Feature activation+2.52e+00

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token Every
Token ID3887
Feature activation+9.26e-01

Pos logit contributions
ifying+0.111
 awoke+0.096
 distance+0.095
 bit+0.095
 trace+0.094
Neg logit contributions
mm-0.089
ails-0.085
ires-0.082
agons-0.079
yard-0.079
Token day
Token ID1110
Feature activation+3.79e+00

Pos logit contributions
ifying+0.038
 Is+0.034
Am+0.032
 Ashley+0.031
 Debbie+0.031
Neg logit contributions
uddled-0.038
iggle-0.037
bled-0.037
uddle-0.035
ails-0.035
Token,
Token ID11
Feature activation+2.84e+00

Pos logit contributions
ifying+0.236
 trace+0.232
Laura+0.224
Lou+0.222
den+0.222
Neg logit contributions
mm-0.062
bled-0.054
uddled-0.051
ails-0.044
 golf-0.031
Token he
Token ID339
Feature activation+5.00e-01

Pos logit contributions
ifying+0.113
 awoke+0.103
ff+0.099
talk+0.099
 distance+0.099
Neg logit contributions
mm-0.113
bled-0.111
ires-0.101
 golf-0.100
ble-0.099
Token danced
Token ID39480
Feature activation+9.21e-01

Pos logit contributions
ifying+0.014
 Is+0.012
 Ashley+0.012
 Debbie+0.011
 total+0.011
Neg logit contributions
uddled-0.026
bled-0.026
iggle-0.025
irled-0.025
 olds-0.024
Token on
Token ID319
Feature activation+1.27e+00

Pos logit contributions
ifying+0.042
 Is+0.038
 Ashley+0.037
Am+0.036
 Debbie+0.036
Neg logit contributions
bled-0.035
uddled-0.034
iggle-0.033
 golf-0.031
bles-0.031
Token the
Token ID262
Feature activation-1.01e+00

Pos logit contributions
ifying+0.048
 Is+0.041
 awoke+0.041
 bit+0.039
 single+0.038
Neg logit contributions
 golf-0.056
 olds-0.053
ans-0.053
iggle-0.053
uddled-0.052
Token,
Token ID11
Feature activation+3.49e-01

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token you
Token ID345
Feature activation+5.10e-01

Pos logit contributions
ifying+0.013
 awoke+0.012
 bit+0.012
 Is+0.012
 single+0.012
Neg logit contributions
bled-0.015
uddled-0.015
 olds-0.014
iggle-0.014
 golf-0.014
Token can
Token ID460
Feature activation+3.29e-01

Pos logit contributions
ifying+0.013
 Ashley+0.012
 Debbie+0.011
 distance+0.011
 Is+0.011
Neg logit contributions
bled-0.028
uddled-0.027
iggle-0.027
bles-0.026
 olds-0.026
Token give
Token ID1577
Feature activation+1.09e+00

Pos logit contributions
ifying+0.011
 Is+0.010
 awoke+0.010
 Ashley+0.010
 bit+0.010
Neg logit contributions
bled-0.015
uddled-0.015
iggle-0.015
 olds-0.014
uddle-0.014
Token me
Token ID502
Feature activation+2.00e+00

Pos logit contributions
ifying+0.045
 awoke+0.040
 Is+0.039
 distance+0.039
 total+0.038
Neg logit contributions
bled-0.044
uddled-0.042
 olds-0.042
iggle-0.040
ans-0.040
Token a
Token ID257
Feature activation+3.73e+00

Pos logit contributions
 Is+0.077
ifying+0.077
 awoke+0.075
 distance+0.073
Lou+0.072
Neg logit contributions
bled-0.078
uddled-0.075
 golf-0.073
 olds-0.070
ans-0.070
Token massage
Token ID26900
Feature activation+2.25e+00

Pos logit contributions
ifying+0.145
ges+0.137
pter+0.134
Jonathan+0.130
Lou+0.129
Neg logit contributions
uddle-0.148
uddled-0.131
 new-0.126
 golf-0.124
iggle-0.121
Token.
Token ID13
Feature activation+3.52e-01

Pos logit contributions
ifying+0.092
Lou+0.091
Jonathan+0.088
 total+0.086
 awoke+0.085
Neg logit contributions
uddled-0.081
bled-0.078
uddle-0.077
 olds-0.075
 golf-0.074
Token That
Token ID1320
Feature activation+1.87e+00

Pos logit contributions
ifying+0.012
 awoke+0.011
 bit+0.011
 Is+0.011
 Ashley+0.011
Neg logit contributions
bled-0.016
uddled-0.015
 olds-0.015
ails-0.014
iggle-0.014
Token would
Token ID561
Feature activation+1.31e-02

Pos logit contributions
ifying+0.079
ges+0.063
 total+0.062
Am+0.061
 Is+0.060
Neg logit contributions
uddled-0.077
uddle-0.074
iggle-0.072
bled-0.068
 olds-0.068
Token be
Token ID307
Feature activation-1.37e-02

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 awoke+0.000
 bit+0.000
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token a
Token ID257
Feature activation-8.70e-01

Pos logit contributions
uddled+0.070
bled+0.069
 olds+0.068
uddle+0.067
castle+0.067
Neg logit contributions
ifying-0.023
 Ashley-0.022
 Will-0.022
 awoke-0.020
 single-0.020
Token little
Token ID1310
Feature activation-2.56e-01

Pos logit contributions
bled+0.041
uddled+0.040
 olds+0.037
ails+0.037
iggle+0.037
Neg logit contributions
 bit-0.030
ifying-0.030
 Is-0.029
 Ashley-0.028
 single-0.028
Token girl
Token ID2576
Feature activation+2.06e-01

Pos logit contributions
bled+0.013
uddled+0.013
 olds+0.012
iggle+0.012
ails+0.012
Neg logit contributions
ifying-0.007
 bit-0.007
 Is-0.007
 Ashley-0.007
 awoke-0.006
Token named
Token ID3706
Feature activation-3.88e-01

Pos logit contributions
ifying+0.010
 Is+0.010
 awoke+0.010
 Ashley+0.010
 Debbie+0.009
Neg logit contributions
bled-0.006
uddled-0.006
iggle-0.006
 olds-0.006
ails-0.006
Token Jackson
Token ID6612
Feature activation+1.87e+00

Pos logit contributions
bled+0.017
uddled+0.016
castle+0.015
agons+0.015
 olds+0.015
Neg logit contributions
ifying-0.015
 Is-0.014
 Ashley-0.014
 bit-0.014
 awoke-0.014
Token.
Token ID13
Feature activation+3.72e+00

Pos logit contributions
ifying+0.097
 Is+0.089
Oh+0.089
 distance+0.089
 Ashley+0.087
Neg logit contributions
uddled-0.051
iggle-0.051
ails-0.049
bles-0.049
ans-0.049
Token One
Token ID1881
Feature activation+1.92e-01

Pos logit contributions
ifying+0.198
 bit+0.179
 bumped+0.168
 fright+0.168
 distance+0.166
Neg logit contributions
ires-0.104
mm-0.096
azes-0.087
ate-0.086
iggle-0.084
Token day
Token ID1110
Feature activation+3.41e+00

Pos logit contributions
ifying+0.007
 Is+0.006
 Ashley+0.006
 awoke+0.006
 bit+0.006
Neg logit contributions
bled-0.009
uddled-0.009
iggle-0.008
 olds-0.008
ails-0.008
Token,
Token ID11
Feature activation+3.49e+00

Pos logit contributions
 distance+0.232
ifying+0.228
Oh+0.216
 trace+0.209
talk+0.209
Neg logit contributions
bled-0.043
kins-0.041
uddled-0.041
iggle-0.036
mm-0.030
Token Jackson
Token ID6612
Feature activation-1.04e-01

Pos logit contributions
ifying+0.201
ez+0.171
se+0.168
 distance+0.168
use+0.168
Neg logit contributions
kins-0.082
bled-0.076
mm-0.071
ires-0.071
 burying-0.061
Token went
Token ID1816
Feature activation-1.47e-01

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.003
 Is-0.003
 bit-0.003
 awoke-0.003
 Ashley-0.003
Token she
Token ID673
Feature activation+3.91e-01

Pos logit contributions
ifying+0.040
 bit+0.036
 Is+0.035
 awoke+0.035
 care+0.034
Neg logit contributions
bled-0.042
uddled-0.042
 olds-0.038
iggle-0.038
 golf-0.037
Token saw
Token ID2497
Feature activation-5.55e-01

Pos logit contributions
ifying+0.010
 Ashley+0.009
 Is+0.009
 Debbie+0.008
Jonathan+0.008
Neg logit contributions
uddled-0.021
bled-0.021
iggle-0.020
ails-0.020
uddle-0.020
Token a
Token ID257
Feature activation-2.74e-01

Pos logit contributions
bled+0.030
uddled+0.030
 olds+0.029
iggle+0.029
ails+0.028
Neg logit contributions
ifying-0.014
 Is-0.013
 Ashley-0.013
 awoke-0.013
 Debbie-0.013
Token large
Token ID1588
Feature activation-8.58e-01

Pos logit contributions
bled+0.012
uddled+0.012
 olds+0.011
iggle+0.011
ails+0.011
Neg logit contributions
ifying-0.010
 Is-0.009
 bit-0.009
 Ashley-0.009
 awoke-0.009
Token man
Token ID582
Feature activation-3.28e-01

Pos logit contributions
bled+0.041
uddled+0.040
iggle+0.037
irled+0.036
ails+0.036
Neg logit contributions
ifying-0.029
 bit-0.028
 Is-0.028
 single-0.027
 awoke-0.027
Token.
Token ID13
Feature activation+3.68e+00

Pos logit contributions
bled+0.014
uddled+0.014
 olds+0.013
iggle+0.013
ails+0.013
Neg logit contributions
ifying-0.012
 Is-0.011
 bit-0.011
 awoke-0.011
 Ashley-0.011
Token He
Token ID679
Feature activation-5.33e-02

Pos logit contributions
ifying+0.205
inea+0.184
w+0.183
 trace+0.179
Oh+0.178
Neg logit contributions
 olds-0.088
yard-0.068
 old-0.068
uddled-0.059
lee-0.056
Token yelled
Token ID22760
Feature activation+4.27e-01

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token out
Token ID503
Feature activation+1.12e+00

Pos logit contributions
ifying+0.015
 awoke+0.013
 bit+0.013
 Ashley+0.013
 Is+0.013
Neg logit contributions
bled-0.020
uddled-0.020
 olds-0.019
iggle-0.019
 golf-0.018
Token to
Token ID284
Feature activation+2.13e+00

Pos logit contributions
ifying+0.034
 distance+0.030
 awoke+0.030
 Debbie+0.029
 Ashley+0.029
Neg logit contributions
bled-0.056
uddled-0.055
iggle-0.053
 olds-0.053
 golf-0.053
Token her
Token ID607
Feature activation+3.02e-01

Pos logit contributions
ifying+0.112
inea+0.087
 single+0.086
Oh+0.078
 awoke+0.078
Neg logit contributions
iggle-0.070
 olds-0.065
 golf-0.058
bled-0.058
lee-0.057
Tokenmy
Token ID1820
Feature activation+1.15e+00

Pos logit contributions
ifying+0.047
 Is+0.044
Jonathan+0.043
 Debbie+0.042
Lou+0.042
Neg logit contributions
ails-0.048
uddled-0.048
bled-0.048
 golf-0.047
ini-0.046
Token and
Token ID290
Feature activation+5.62e-01

Pos logit contributions
ifying+0.050
 Is+0.048
 Debbie+0.046
Lou+0.046
Jonathan+0.046
Neg logit contributions
uddled-0.041
bled-0.040
ails-0.039
 golf-0.039
 olds-0.038
Token daddy
Token ID35822
Feature activation+1.57e+00

Pos logit contributions
ifying+0.023
 single+0.021
 Is+0.020
 awoke+0.020
 Ashley+0.020
Neg logit contributions
bled-0.023
uddled-0.023
iggle-0.021
 olds-0.021
castle-0.021
Token.
Token ID13
Feature activation+2.14e+00

Pos logit contributions
ifying+0.074
Lou+0.071
Jonathan+0.071
 Debbie+0.070
Georgia+0.070
Neg logit contributions
bled-0.049
uddled-0.048
 golf-0.047
ails-0.047
iggle-0.045
Token One
Token ID1881
Feature activation+1.27e-01

Pos logit contributions
ifying+0.100
 bit+0.091
 distance+0.087
 trace+0.087
Oh+0.083
Neg logit contributions
ires-0.066
 olds-0.065
uddled-0.065
iggle-0.064
mm-0.063
Token day
Token ID1110
Feature activation+3.66e+00

Pos logit contributions
ifying+0.004
 Is+0.004
 Ashley+0.004
 awoke+0.004
 bit+0.004
Neg logit contributions
bled-0.006
uddled-0.006
iggle-0.006
 olds-0.006
ails-0.006
Token,
Token ID11
Feature activation+3.33e+00

Pos logit contributions
ifying+0.226
 distance+0.215
 trace+0.213
 total+0.198
Oh+0.197
Neg logit contributions
uddled-0.081
bled-0.072
iggle-0.060
mm-0.058
irled-0.057
Token when
Token ID618
Feature activation+3.22e+00

Pos logit contributions
ifying+0.176
ez+0.164
use+0.159
 trace+0.159
 cavity+0.155
Neg logit contributions
uddled-0.093
bled-0.092
mm-0.077
iggle-0.073
ires-0.073
Token Pen
Token ID7507
Feature activation+5.03e-01

Pos logit contributions
ifying+0.178
 tant+0.163
ez+0.161
ff+0.161
talk+0.160
Neg logit contributions
bled-0.080
uddled-0.076
mm-0.076
irled-0.064
iggle-0.063
Tokenelope
Token ID47329
Feature activation-2.64e-01

Pos logit contributions
ifying+0.018
 Is+0.016
 total+0.015
 Ashley+0.015
 single+0.015
Neg logit contributions
uddled-0.023
bled-0.022
iggle-0.021
ails-0.021
irled-0.021
Token was
Token ID373
Feature activation-7.48e-01

Pos logit contributions
bled+0.013
uddled+0.013
 olds+0.012
iggle+0.012
ails+0.012
Neg logit contributions
ifying-0.008
 Is-0.007
 awoke-0.007
 bit-0.007
 Ashley-0.007
Tokenily
Token ID813
Feature activation-1.53e-01

Pos logit contributions
bled+0.099
 olds+0.094
castle+0.091
yard+0.090
 raft+0.089
Neg logit contributions
of-0.083
 care-0.082
 minute-0.081
 nine-0.081
ges-0.081
Token,
Token ID11
Feature activation+3.10e-02

Pos logit contributions
bled+0.007
uddled+0.007
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.005
 Is-0.005
 Ashley-0.005
 bit-0.005
 awoke-0.005
Token can
Token ID460
Feature activation-5.71e-01

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.001
Token you
Token ID345
Feature activation-8.37e-02

Pos logit contributions
bled+0.028
uddled+0.027
 olds+0.026
ails+0.026
iggle+0.026
Neg logit contributions
ifying-0.018
 Is-0.018
 Ashley-0.017
 bit-0.017
 Will-0.017
Token answer
Token ID3280
Feature activation+1.20e+00

Pos logit contributions
bled+0.004
uddled+0.004
 olds+0.004
iggle+0.004
ails+0.004
Neg logit contributions
ifying-0.003
 Is-0.002
 bit-0.002
 Ashley-0.002
 awoke-0.002
Token a
Token ID257
Feature activation+3.64e+00

Pos logit contributions
ifying+0.043
 awoke+0.040
 Is+0.039
 distance+0.039
 single+0.039
Neg logit contributions
bled-0.052
uddled-0.050
 olds-0.049
 golf-0.048
iggle-0.048
Token question
Token ID1808
Feature activation+1.39e+00

Pos logit contributions
ifying+0.177
ges+0.150
ez+0.147
R+0.137
talk+0.137
Neg logit contributions
 new-0.105
uddle-0.105
 old-0.100
 golf-0.096
 idea-0.096
Token for
Token ID329
Feature activation+8.78e-01

Pos logit contributions
ifying+0.059
 distance+0.057
Lou+0.057
 awoke+0.056
Jonathan+0.055
Neg logit contributions
bled-0.050
uddled-0.048
ails-0.047
 olds-0.046
ires-0.045
Token me
Token ID502
Feature activation+4.61e-01

Pos logit contributions
ifying+0.034
 awoke+0.031
 Is+0.030
 distance+0.029
 single+0.029
Neg logit contributions
bled-0.037
 olds-0.035
uddled-0.034
iggle-0.034
 golf-0.034
Token?"
Token ID1701
Feature activation-7.01e-01

Pos logit contributions
ifying+0.019
 Is+0.017
 Ashley+0.017
 awoke+0.017
 bit+0.017
Neg logit contributions
bled-0.018
uddled-0.018
 olds-0.017
iggle-0.017
ails-0.017
Token Lily
Token ID20037
Feature activation-4.90e-01

Pos logit contributions
bled+0.034
uddled+0.032
castle+0.031
bles+0.031
agons+0.031
Neg logit contributions
 Is-0.023
 bit-0.023
 Ashley-0.023
 Will-0.023
 care-0.023
Token world
Token ID995
Feature activation-6.92e-01

Pos logit contributions
bled+0.004
uddled+0.004
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.004
 Is-0.003
 Ashley-0.003
 bit-0.003
 awoke-0.003
Token.
Token ID13
Feature activation+2.94e+00

Pos logit contributions
bled+0.038
uddled+0.037
iggle+0.034
irled+0.034
uddle+0.034
Neg logit contributions
ifying-0.018
 Is-0.017
 bit-0.017
 single-0.017
 distance-0.017
Token
Token ID198
Feature activation+2.02e+00

Pos logit contributions
ifying+0.150
 bit+0.139
 away+0.127
 trace+0.126
 distance+0.123
Neg logit contributions
ires-0.078
iggle-0.077
mm-0.076
 olds-0.075
uddled-0.072
Token
Token ID198
Feature activation+3.19e+00

Pos logit contributions
ifying+0.094
 bit+0.085
 awoke+0.075
 trace+0.075
 Is+0.075
Neg logit contributions
uddled-0.072
iggle-0.070
ails-0.070
bled-0.066
uddle-0.065
TokenOne
Token ID3198
Feature activation+5.53e-01

Pos logit contributions
ifying+0.152
 bit+0.127
 trace+0.123
inea+0.123
 tant+0.122
Neg logit contributions
uddled-0.102
agons-0.094
irling-0.094
mm-0.092
ires-0.090
Token day
Token ID1110
Feature activation+3.59e+00

Pos logit contributions
ifying+0.022
 Is+0.020
 Ashley+0.019
 awoke+0.019
 Debbie+0.019
Neg logit contributions
uddled-0.022
bled-0.022
iggle-0.022
ails-0.021
 olds-0.021
Token,
Token ID11
Feature activation+3.34e+00

Pos logit contributions
ifying+0.228
 distance+0.222
 trace+0.214
 fright+0.205
Oh+0.203
Neg logit contributions
uddled-0.066
bled-0.052
mm-0.046
iggle-0.045
irled-0.041
Token Ashley
Token ID20899
Feature activation-6.50e-01

Pos logit contributions
ifying+0.125
ez+0.112
talk+0.109
 distance+0.109
ff+0.109
Neg logit contributions
uddled-0.138
bled-0.130
mm-0.124
ires-0.117
kins-0.117
Token was
Token ID373
Feature activation-4.66e-01

Pos logit contributions
bled+0.036
uddled+0.034
 olds+0.033
iggle+0.033
agons+0.033
Neg logit contributions
 awoke-0.017
 bit-0.016
ifying-0.016
 Is-0.016
 Ashley-0.016
Token at
Token ID379
Feature activation+7.21e-01

Pos logit contributions
bled+0.019
uddled+0.018
 olds+0.018
castle+0.017
uddle+0.017
Neg logit contributions
ifying-0.018
 Is-0.018
 bit-0.018
 Ashley-0.018
 single-0.017
Token the
Token ID262
Feature activation+4.34e-01

Pos logit contributions
ifying+0.025
 awoke+0.021
 bit+0.021
 Is+0.020
Jonathan+0.020
Neg logit contributions
bled-0.034
 golf-0.033
uddled-0.033
iggle-0.032
 olds-0.032
Token stop
Token ID2245
Feature activation+7.38e-01

Pos logit contributions
ifying+0.057
 Ashley+0.045
 awoke+0.045
Jonathan+0.045
Am+0.044
Neg logit contributions
iggle-0.058
 golf-0.055
bles-0.055
 olds-0.054
ails-0.054
Token!
Token ID0
Feature activation+2.42e+00

Pos logit contributions
ifying+0.030
 awoke+0.027
 Is+0.027
 Ashley+0.027
 total+0.026
Neg logit contributions
bled-0.030
iggle-0.029
uddled-0.028
 olds-0.028
 golf-0.028
Token
Token ID198
Feature activation-2.58e-01

Pos logit contributions
ifying+0.105
 trace+0.101
 bit+0.099
 distance+0.093
use+0.092
Neg logit contributions
uddled-0.079
 olds-0.079
bled-0.078
ails-0.076
add-0.075
Token
Token ID198
Feature activation+1.65e+00

Pos logit contributions
bled+0.014
uddled+0.014
 olds+0.013
iggle+0.013
castle+0.013
Neg logit contributions
ifying-0.007
 Is-0.006
 Ashley-0.006
 awoke-0.006
 Debbie-0.006
TokenFinally
Token ID11158
Feature activation+1.64e+00

Pos logit contributions
ifying+0.076
 distance+0.069
 trace+0.069
 bit+0.068
 care+0.067
Neg logit contributions
uddled-0.059
bled-0.058
irled-0.052
add-0.052
irling-0.050
Token,
Token ID11
Feature activation+3.58e+00

Pos logit contributions
ifying+0.090
 distance+0.080
Oh+0.077
 single+0.077
 volume+0.077
Neg logit contributions
bled-0.046
uddled-0.043
iggle-0.041
mm-0.040
irled-0.038
Token she
Token ID673
Feature activation-1.57e-01

Pos logit contributions
ifying+0.145
use+0.143
ff+0.137
ez+0.135
 trace+0.134
Neg logit contributions
bled-0.132
uddled-0.116
mm-0.113
iggle-0.107
overed-0.106
Token reached
Token ID4251
Feature activation+6.65e-01

Pos logit contributions
bled+0.009
uddled+0.008
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.004
 Is-0.004
 bit-0.004
 awoke-0.004
 Ashley-0.004
Token the
Token ID262
Feature activation+1.80e+00

Pos logit contributions
ifying+0.021
 Is+0.018
 bit+0.018
 awoke+0.018
Jonathan+0.018
Neg logit contributions
bled-0.033
uddled-0.031
iggle-0.031
 olds-0.030
 golf-0.030
Token top
Token ID1353
Feature activation+6.99e-01

Pos logit contributions
 Ashley+0.080
ifying+0.080
ges+0.079
ez+0.079
 Debbie+0.079
Neg logit contributions
castle-0.053
uddle-0.052
bled-0.052
 golf-0.051
 clubhouse-0.049
Token of
Token ID286
Feature activation+8.53e-01

Pos logit contributions
ifying+0.027
 Is+0.025
 Ashley+0.024
 awoke+0.024
 Debbie+0.024
Neg logit contributions
bled-0.028
uddled-0.028
 olds-0.027
iggle-0.027
castle-0.027
Token she
Token ID673
Feature activation-7.47e-01

Pos logit contributions
ifying+0.129
use+0.121
 distance+0.116
se+0.111
 volume+0.109
Neg logit contributions
ires-0.081
bled-0.079
uddled-0.071
mm-0.070
kins-0.069
Token decided
Token ID3066
Feature activation+2.82e-02

Pos logit contributions
bled+0.038
uddled+0.038
 olds+0.038
iggle+0.038
ails+0.037
Neg logit contributions
 bit-0.021
 awoke-0.019
ifying-0.019
 Is-0.019
 Will-0.019
Token to
Token ID284
Feature activation+1.11e-01

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 awoke+0.000
 bit+0.000
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token shoot
Token ID2686
Feature activation+2.50e+00

Pos logit contributions
ifying+0.004
 Is+0.003
 Ashley+0.003
 awoke+0.003
 bit+0.003
Neg logit contributions
bled-0.005
uddled-0.005
iggle-0.005
 olds-0.005
ails-0.005
Token it
Token ID340
Feature activation+3.56e-01

Pos logit contributions
ifying+0.114
 total+0.103
 distance+0.100
use+0.095
bie+0.094
Neg logit contributions
iggle-0.079
bled-0.078
 olds-0.077
 golf-0.075
ires-0.073
Token.
Token ID13
Feature activation+3.57e+00

Pos logit contributions
ifying+0.012
 Ashley+0.011
 Is+0.011
 awoke+0.011
 Debbie+0.011
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
iggle-0.015
 golf-0.015
Token
Token ID220
Feature activation+1.69e-02

Pos logit contributions
ifying+0.196
 bit+0.168
 tant+0.163
 trace+0.163
 single+0.162
Neg logit contributions
ires-0.090
yard-0.089
azes-0.081
ate-0.081
 olds-0.078
Token
Token ID198
Feature activation+2.63e+00

Pos logit contributions
ifying+0.000
 Is+0.000
 bit+0.000
 Ashley+0.000
 awoke+0.000
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token
Token ID198
Feature activation+3.89e+00

Pos logit contributions
ifying+0.142
 bit+0.125
 trace+0.111
 awoke+0.109
 single+0.109
Neg logit contributions
iggle-0.077
ails-0.075
bled-0.073
ires-0.073
uddled-0.072
TokenJackson
Token ID31270
Feature activation-9.80e-01

Pos logit contributions
ifying+0.167
bie+0.151
 single+0.143
inea+0.139
burn+0.137
Neg logit contributions
ate-0.133
ires-0.132
uring-0.117
lf-0.116
bles-0.115
Token got
Token ID1392
Feature activation+1.09e+00

Pos logit contributions
bled+0.053
uddled+0.052
 olds+0.051
ires+0.050
iggle+0.049
Neg logit contributions
 bit-0.026
 awoke-0.026
 Is-0.023
 care-0.023
 heard-0.023
Token around
Token ID1088
Feature activation+1.04e+00

Pos logit contributions
ifying+0.019
 Is+0.018
 Ashley+0.017
 awoke+0.017
 Debbie+0.017
Neg logit contributions
bled-0.020
uddled-0.020
iggle-0.019
 olds-0.019
 golf-0.019
Token the
Token ID262
Feature activation+4.45e-01

Pos logit contributions
ifying+0.042
 awoke+0.038
 Is+0.037
 Debbie+0.037
 bit+0.036
Neg logit contributions
iggle-0.042
bled-0.041
 golf-0.040
 olds-0.040
uddled-0.038
Token park
Token ID3952
Feature activation-5.42e-01

Pos logit contributions
ifying+0.019
 Is+0.019
 Ashley+0.018
 awoke+0.018
 bit+0.018
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
iggle-0.015
uddle-0.015
Token.
Token ID13
Feature activation+2.87e+00

Pos logit contributions
bled+0.028
uddled+0.027
 olds+0.025
uddle+0.025
iggle+0.025
Neg logit contributions
ifying-0.016
 bit-0.015
 Is-0.015
 Ashley-0.015
 single-0.014
Token One
Token ID1881
Feature activation+3.10e-01

Pos logit contributions
ifying+0.147
 bit+0.138
 away+0.122
 trace+0.121
 total+0.118
Neg logit contributions
ires-0.087
mm-0.087
iggle-0.079
ails-0.077
uddled-0.074
Token day
Token ID1110
Feature activation+3.56e+00

Pos logit contributions
ifying+0.011
 Is+0.010
 Ashley+0.010
 awoke+0.010
 Debbie+0.010
Neg logit contributions
bled-0.013
uddled-0.013
iggle-0.013
 olds-0.013
ails-0.012
Token,
Token ID11
Feature activation+3.45e+00

Pos logit contributions
ifying+0.229
 trace+0.214
 distance+0.210
 fright+0.208
Lou+0.205
Neg logit contributions
uddled-0.064
mm-0.055
bled-0.054
iggle-0.042
irled-0.041
Token as
Token ID355
Feature activation+3.32e+00

Pos logit contributions
ifying+0.158
ez+0.150
use+0.135
 trace+0.131
inea+0.130
Neg logit contributions
bled-0.120
uddled-0.117
mm-0.116
ires-0.114
iring-0.104
Token she
Token ID673
Feature activation-2.80e-01

Pos logit contributions
ifying+0.149
ez+0.139
 trace+0.136
 tant+0.135
se+0.133
Neg logit contributions
mm-0.113
bled-0.108
ires-0.107
iring-0.105
uddled-0.102
Token was
Token ID373
Feature activation-7.19e-01

Pos logit contributions
bled+0.013
uddled+0.013
 olds+0.012
ails+0.012
iggle+0.012
Neg logit contributions
ifying-0.009
 Is-0.009
 bit-0.009
 awoke-0.009
 Ashley-0.009
Token dancing
Token ID15360
Feature activation+5.94e-01

Pos logit contributions
bled+0.027
uddled+0.026
ini+0.025
 olds+0.025
castle+0.025
Neg logit contributions
ifying-0.029
 Ashley-0.029
 Is-0.029
 bit-0.029
 awoke-0.028
Token if
Token ID611
Feature activation-5.31e-01

Pos logit contributions
ifying+0.061
Oh+0.057
 distance+0.055
 awoke+0.053
Lou+0.053
Neg logit contributions
ails-0.063
bled-0.062
bles-0.062
 olds-0.062
uddled-0.061
Token he
Token ID339
Feature activation+4.25e-01

Pos logit contributions
uddled+0.028
bled+0.028
 olds+0.026
iggle+0.026
uddle+0.026
Neg logit contributions
ifying-0.014
 Is-0.014
 Ashley-0.013
 awoke-0.013
 bit-0.013
Token could
Token ID714
Feature activation+2.91e-01

Pos logit contributions
ifying+0.013
 Ashley+0.011
 distance+0.011
 Is+0.011
 Debbie+0.011
Neg logit contributions
bled-0.021
uddled-0.021
 olds-0.020
iggle-0.020
ails-0.020
Token give
Token ID1577
Feature activation-1.63e-01

Pos logit contributions
ifying+0.009
 Ashley+0.008
 Is+0.008
 awoke+0.008
Lou+0.008
Neg logit contributions
bled-0.014
uddled-0.014
iggle-0.013
 olds-0.013
ails-0.013
Token her
Token ID607
Feature activation+2.20e+00

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.005
 Is-0.004
 Ashley-0.004
 bit-0.004
 awoke-0.004
Token a
Token ID257
Feature activation+3.54e+00

Pos logit contributions
ifying+0.089
Lou+0.089
 Is+0.086
Jonathan+0.085
 distance+0.084
Neg logit contributions
uddled-0.081
bled-0.081
uddle-0.073
iggle-0.072
 golf-0.070
Token massage
Token ID26900
Feature activation+2.00e+00

Pos logit contributions
ifying+0.136
Jonathan+0.129
ges+0.129
Lou+0.128
Am+0.127
Neg logit contributions
uddle-0.142
uddled-0.126
 golf-0.122
 new-0.121
iggle-0.120
Token.
Token ID13
Feature activation-4.43e-01

Pos logit contributions
ifying+0.085
Lou+0.083
Jonathan+0.081
 total+0.080
Am+0.080
Neg logit contributions
uddled-0.072
bled-0.071
uddle-0.067
iggle-0.065
 golf-0.065
Token Jane
Token ID12091
Feature activation+9.34e-01

Pos logit contributions
bled+0.020
uddled+0.019
iggle+0.017
 olds+0.017
agons+0.017
Neg logit contributions
ifying-0.016
 Is-0.016
 Ashley-0.015
 awoke-0.015
 Debbie-0.015
Token said
Token ID531
Feature activation-2.24e-01

Pos logit contributions
ifying+0.042
 distance+0.036
 single+0.036
 Is+0.036
 total+0.035
Neg logit contributions
uddled-0.037
bled-0.035
irled-0.032
iggle-0.032
 olds-0.032
Token "
Token ID366
Feature activation+1.88e+00

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.009
 Is-0.009
 Ashley-0.009
 awoke-0.008
 bit-0.008
Token named
Token ID3706
Feature activation-3.90e-01

Pos logit contributions
ifying+0.027
 Is+0.025
 Ashley+0.025
 awoke+0.025
 Debbie+0.025
Neg logit contributions
uddled-0.014
bled-0.013
ails-0.013
iggle-0.013
 olds-0.012
Token Pen
Token ID7507
Feature activation+1.08e+00

Pos logit contributions
bled+0.017
uddled+0.016
castle+0.015
 olds+0.015
agons+0.015
Neg logit contributions
ifying-0.015
 Ashley-0.014
 Is-0.014
 bit-0.014
 care-0.014
Tokenelope
Token ID47329
Feature activation+5.18e-01

Pos logit contributions
ifying+0.042
 Is+0.038
 total+0.037
 bit+0.036
 single+0.035
Neg logit contributions
uddled-0.044
bled-0.043
iggle-0.042
ails-0.041
ini-0.040
Token.
Token ID13
Feature activation+3.16e+00

Pos logit contributions
ifying+0.020
 Is+0.019
 Ashley+0.018
 awoke+0.018
Jonathan+0.018
Neg logit contributions
uddled-0.021
bled-0.021
ails-0.020
iggle-0.020
 olds-0.020
Token One
Token ID1881
Feature activation+1.43e-01

Pos logit contributions
ifying+0.169
 trace+0.151
 bit+0.143
 tant+0.143
 distance+0.142
Neg logit contributions
ires-0.077
mm-0.076
 olds-0.076
uddled-0.072
iggle-0.072
Token day
Token ID1110
Feature activation+3.54e+00

Pos logit contributions
ifying+0.005
 Is+0.004
 Ashley+0.004
 awoke+0.004
 bit+0.004
Neg logit contributions
bled-0.007
uddled-0.007
iggle-0.006
 olds-0.006
ails-0.006
Token,
Token ID11
Feature activation+3.06e+00

Pos logit contributions
ifying+0.219
 trace+0.208
 distance+0.204
 tant+0.193
talk+0.193
Neg logit contributions
uddled-0.072
bled-0.065
mm-0.062
irled-0.052
iggle-0.050
Token while
Token ID981
Feature activation+1.69e+00

Pos logit contributions
ifying+0.172
ez+0.154
 trace+0.149
talk+0.146
use+0.144
Neg logit contributions
uddled-0.074
bled-0.071
mm-0.069
ires-0.059
iggle-0.058
Token picking
Token ID10868
Feature activation-2.81e-01

Pos logit contributions
ifying+0.082
 tant+0.073
 distance+0.072
 bit+0.072
se+0.071
Neg logit contributions
bled-0.057
uddled-0.053
mm-0.049
 golf-0.047
irled-0.047
Token fresh
Token ID4713
Feature activation-5.79e-01

Pos logit contributions
bled+0.012
uddled+0.012
 olds+0.011
iggle+0.011
ails+0.011
Neg logit contributions
ifying-0.011
 Is-0.010
 Ashley-0.010
 bit-0.010
 awoke-0.010
Token flowers
Token ID12734
Feature activation-9.10e-01

Pos logit contributions
bled+0.024
uddled+0.024
iggle+0.022
castle+0.022
 olds+0.022
Neg logit contributions
ifying-0.023
 bit-0.021
 care-0.021
 Is-0.021
 awoke-0.020
Token mom
Token ID1995
Feature activation+4.15e-01

Pos logit contributions
ifying+0.010
 Is+0.009
 awoke+0.009
 Ashley+0.009
 distance+0.009
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.008
iggle-0.008
uddle-0.008
Token if
Token ID611
Feature activation-4.17e-01

Pos logit contributions
ifying+0.014
 Is+0.012
 bit+0.012
 Ashley+0.012
 awoke+0.012
Neg logit contributions
bled-0.020
uddled-0.019
ails-0.018
 olds-0.018
iggle-0.018
Token she
Token ID673
Feature activation+4.72e-01

Pos logit contributions
bled+0.022
uddled+0.021
 olds+0.021
iggle+0.020
ails+0.020
Neg logit contributions
ifying-0.011
 Is-0.011
 Ashley-0.011
 bit-0.011
 awoke-0.011
Token could
Token ID714
Feature activation+1.09e+00

Pos logit contributions
ifying+0.014
 Ashley+0.012
 distance+0.011
 Is+0.011
 awoke+0.011
Neg logit contributions
uddled-0.025
bled-0.025
 olds-0.023
ails-0.023
iggle-0.023
Token get
Token ID651
Feature activation+1.87e+00

Pos logit contributions
ifying+0.041
Lou+0.039
Jonathan+0.037
 Ashley+0.037
Oh+0.036
Neg logit contributions
bled-0.045
uddled-0.045
iggle-0.044
ails-0.042
castle-0.042
Token a
Token ID257
Feature activation+3.53e+00

Pos logit contributions
ifying+0.079
 awoke+0.077
Lou+0.075
 distance+0.075
Jonathan+0.072
Neg logit contributions
bled-0.076
 golf-0.066
uddled-0.066
bles-0.064
ires-0.062
Token new
Token ID649
Feature activation+2.39e+00

Pos logit contributions
ifying+0.147
Lou+0.130
ges+0.126
Jonathan+0.125
 awoke+0.124
Neg logit contributions
uddle-0.143
uddled-0.125
 new-0.124
 golf-0.123
castle-0.112
Token haircut
Token ID45548
Feature activation-2.38e-02

Pos logit contributions
ifying+0.099
Lou+0.078
 awoke+0.075
Jonathan+0.074
Jackson+0.069
Neg logit contributions
uddle-0.100
uddled-0.100
castle-0.099
 golf-0.099
iggle-0.097
Token.
Token ID13
Feature activation+2.53e-02

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token Her
Token ID2332
Feature activation-4.80e-01

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token mom
Token ID1995
Feature activation-4.84e-01

Pos logit contributions
bled+0.023
uddled+0.022
 olds+0.022
ails+0.021
ans+0.021
Neg logit contributions
ifying-0.016
 bit-0.015
 Is-0.015
 Ashley-0.015
 care-0.014
Token way
Token ID835
Feature activation+4.01e-01

Pos logit contributions
ifying+0.013
 Is+0.012
 Ashley+0.012
 awoke+0.012
 Debbie+0.012
Neg logit contributions
bled-0.015
uddled-0.015
 olds-0.014
iggle-0.014
ails-0.014
Token home
Token ID1363
Feature activation-6.21e-01

Pos logit contributions
ifying+0.012
 Is+0.012
 Ashley+0.011
 awoke+0.011
 Debbie+0.011
Neg logit contributions
bled-0.020
uddled-0.019
 olds-0.019
iggle-0.019
ails-0.018
Token.
Token ID13
Feature activation+9.05e-01

Pos logit contributions
bled+0.034
uddled+0.034
iggle+0.032
 olds+0.032
irled+0.031
Neg logit contributions
ifying-0.016
 bit-0.014
 Is-0.014
 Ashley-0.014
 awoke-0.014
Token
Token ID198
Feature activation+4.17e-01

Pos logit contributions
ifying+0.030
 bit+0.028
 single+0.028
 awoke+0.028
 distance+0.028
Neg logit contributions
bled-0.039
uddled-0.039
 olds-0.038
ails-0.037
 golf-0.036
Token
Token ID198
Feature activation+2.40e+00

Pos logit contributions
ifying+0.015
 bit+0.014
 Is+0.014
 awoke+0.013
 total+0.013
Neg logit contributions
bled-0.019
uddled-0.019
iggle-0.018
 olds-0.017
ails-0.017
TokenSo
Token ID2396
Feature activation+3.52e+00

Pos logit contributions
ifying+0.105
 trace+0.095
 single+0.091
 distance+0.090
 care+0.090
Neg logit contributions
uddled-0.087
bled-0.084
mm-0.080
add-0.079
irling-0.076
Token Pen
Token ID7507
Feature activation+1.71e+00

Pos logit contributions
ifying+0.153
 trace+0.133
 cavity+0.133
ez+0.131
 distance+0.131
Neg logit contributions
mm-0.130
bled-0.122
iggle-0.114
uddled-0.112
 olds-0.111
Tokenelope
Token ID47329
Feature activation-5.22e-01

Pos logit contributions
ifying+0.081
 total+0.068
 Is+0.067
 single+0.064
 distance+0.064
Neg logit contributions
uddled-0.059
bled-0.058
irled-0.054
iggle-0.054
mm-0.052
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
bled+0.025
uddled+0.024
 olds+0.023
iggle+0.023
ails+0.022
Neg logit contributions
ifying-0.017
 bit-0.016
 awoke-0.016
 Is-0.016
 Ashley-0.016
Token her
Token ID607
Feature activation-4.82e-01

Pos logit contributions
ifying+0.046
 single+0.038
 total+0.037
 bit+0.035
Lou+0.035
Neg logit contributions
bled-0.046
iggle-0.045
uddled-0.045
irled-0.042
mm-0.041
Token mom
Token ID1995
Feature activation-3.13e-01

Pos logit contributions
bled+0.023
uddled+0.022
ails+0.021
 olds+0.021
iggle+0.021
Neg logit contributions
ifying-0.016
 bit-0.015
 Is-0.015
 Ashley-0.015
 care-0.014
Token.
Token ID13
Feature activation+2.65e+00

Pos logit contributions
bled+0.019
uddled+0.019
 olds+0.018
iggle+0.018
uddle+0.018
Neg logit contributions
ifying-0.007
 Is-0.006
 Ashley-0.006
 bit-0.006
 care-0.006
Token
Token ID198
Feature activation+1.32e+00

Pos logit contributions
ifying+0.146
 trace+0.130
 bit+0.128
 total+0.123
 distance+0.121
Neg logit contributions
uddled-0.071
 olds-0.066
ails-0.064
ires-0.064
iggle-0.063
Token
Token ID198
Feature activation+2.90e+00

Pos logit contributions
ifying+0.055
 bit+0.049
 total+0.045
 Is+0.045
 awoke+0.044
Neg logit contributions
uddled-0.054
iggle-0.053
bled-0.053
ails-0.052
 olds-0.049
TokenThe
Token ID464
Feature activation-1.41e+00

Pos logit contributions
ifying+0.129
inea+0.110
bie+0.109
 bit+0.106
 trace+0.106
Neg logit contributions
uddled-0.103
irling-0.099
ggies-0.091
ires-0.091
ves-0.091
Token next
Token ID1306
Feature activation+7.44e-01

Pos logit contributions
ails+0.063
uddled+0.063
iggle+0.062
bled+0.062
 olds+0.060
Neg logit contributions
 distance-0.050
 night-0.050
 bit-0.049
 care-0.047
 single-0.047
Token day
Token ID1110
Feature activation+3.52e+00

Pos logit contributions
ifying+0.033
 Is+0.029
 Ashley+0.028
 awoke+0.028
 Debbie+0.028
Neg logit contributions
 golf-0.027
ails-0.027
bled-0.027
uddled-0.027
castle-0.027
Token,
Token ID11
Feature activation+3.20e+00

Pos logit contributions
ifying+0.234
 trace+0.210
 tant+0.210
use+0.208
 distance+0.207
Neg logit contributions
uddled-0.054
iggle-0.035
bled-0.034
mm-0.033
uddle-0.028
Token Ashley
Token ID20899
Feature activation-4.30e-01

Pos logit contributions
ifying+0.108
use+0.094
talk+0.093
 distance+0.090
ff+0.090
Neg logit contributions
uddled-0.142
bled-0.130
mm-0.128
iggle-0.128
ires-0.125
Token brought
Token ID3181
Feature activation+2.35e-01

Pos logit contributions
bled+0.023
uddled+0.023
 olds+0.022
iggle+0.022
ails+0.021
Neg logit contributions
ifying-0.011
 awoke-0.011
 bit-0.011
 Is-0.011
 Ashley-0.010
Token the
Token ID262
Feature activation-8.41e-01

Pos logit contributions
ifying+0.007
 Is+0.006
 bit+0.006
 awoke+0.006
 Ashley+0.006
Neg logit contributions
bled-0.012
uddled-0.012
iggle-0.011
 olds-0.011
ails-0.011
Token flag
Token ID6056
Feature activation+3.64e-01

Pos logit contributions
uddled+0.031
bled+0.031
 olds+0.029
iggle+0.029
ails+0.029
Neg logit contributions
ifying-0.035
 bit-0.035
 distance-0.034
 single-0.034
 total-0.033
Tokenmy
Token ID1820
Feature activation+9.22e-01

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token,
Token ID11
Feature activation+8.51e-01

Pos logit contributions
ifying+0.044
Am+0.040
 Is+0.040
Pen+0.039
 distance+0.039
Neg logit contributions
uddled-0.031
bled-0.030
iggle-0.030
ails-0.029
 golf-0.029
Token we
Token ID356
Feature activation+1.53e+00

Pos logit contributions
ifying+0.033
 awoke+0.029
 distance+0.029
 Ashley+0.029
 single+0.029
Neg logit contributions
bled-0.036
uddled-0.035
 golf-0.035
iggle-0.033
castle-0.033
Token don
Token ID836
Feature activation-6.64e-01

Pos logit contributions
 Debbie+0.046
 Ashley+0.046
Jonathan+0.044
Lou+0.043
Pen+0.043
Neg logit contributions
bled-0.074
iggle-0.073
uddled-0.070
 golf-0.070
add-0.070
Token't
Token ID470
Feature activation+1.50e+00

Pos logit contributions
bled+0.040
uddled+0.039
 olds+0.038
ails+0.037
castle+0.037
Neg logit contributions
ifying-0.014
 Is-0.012
 Ashley-0.012
 awoke-0.011
 bit-0.011
Token have
Token ID423
Feature activation+3.52e+00

Pos logit contributions
ifying+0.050
Jonathan+0.047
 Ashley+0.046
 Debbie+0.046
Lou+0.045
Neg logit contributions
iggle-0.070
 golf-0.067
ails-0.067
ini-0.064
ans-0.063
Token any
Token ID597
Feature activation+1.42e+00

Pos logit contributions
 distance+0.152
ifying+0.151
 Is+0.151
Lou+0.149
Jonathan+0.146
Neg logit contributions
 golf-0.122
bled-0.106
add-0.102
 sandwiches-0.101
uddled-0.099
Token popcorn
Token ID38914
Feature activation+6.71e-01

Pos logit contributions
ifying+0.057
 Is+0.055
Lou+0.055
 awoke+0.053
 Ashley+0.051
Neg logit contributions
uddled-0.054
uddle-0.054
 olds-0.054
 golf-0.053
bled-0.053
Token left
Token ID1364
Feature activation+8.78e-01

Pos logit contributions
ifying+0.023
 Is+0.021
Lou+0.021
 Ashley+0.021
 awoke+0.021
Neg logit contributions
bled-0.031
uddled-0.031
castle-0.029
 olds-0.029
ails-0.029
Token.
Token ID13
Feature activation-4.80e-01

Pos logit contributions
ifying+0.031
 Is+0.031
Lou+0.030
 awoke+0.030
 Ashley+0.029
Neg logit contributions
bled-0.038
uddled-0.036
 olds-0.036
ails-0.035
iggle-0.035
Token We
Token ID775
Feature activation+1.83e+00

Pos logit contributions
bled+0.024
uddled+0.024
iggle+0.022
 olds+0.022
castle+0.022
Neg logit contributions
ifying-0.016
 Is-0.014
 Ashley-0.014
 bit-0.013
 Will-0.013
Token a
Token ID257
Feature activation-5.17e-01

Pos logit contributions
uddled+0.051
bled+0.051
 olds+0.049
uddle+0.049
ails+0.048
Neg logit contributions
ifying-0.021
 Ashley-0.019
 Will-0.018
 awoke-0.018
 Is-0.018
Token little
Token ID1310
Feature activation+1.61e-01

Pos logit contributions
bled+0.023
uddled+0.022
 olds+0.021
ails+0.021
iggle+0.021
Neg logit contributions
ifying-0.019
 bit-0.018
 Is-0.018
 Ashley-0.018
 single-0.017
Token car
Token ID1097
Feature activation+2.19e-01

Pos logit contributions
ifying+0.005
 Is+0.005
 awoke+0.005
 Ashley+0.005
 bit+0.004
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.007
iggle-0.007
ails-0.007
Token named
Token ID3706
Feature activation+1.89e-01

Pos logit contributions
ifying+0.009
 Is+0.009
 Ashley+0.009
 awoke+0.008
 Debbie+0.008
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.008
iggle-0.008
ails-0.008
Token Gray
Token ID12723
Feature activation+1.20e+00

Pos logit contributions
ifying+0.008
 Is+0.008
 bit+0.008
 awoke+0.008
 Ashley+0.008
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
 golf-0.006
Token.
Token ID13
Feature activation+3.52e+00

Pos logit contributions
ifying+0.053
 Is+0.048
 distance+0.047
Oh+0.047
 single+0.047
Neg logit contributions
uddled-0.043
bled-0.042
ails-0.041
bles-0.040
 olds-0.040
Token Gray
Token ID12723
Feature activation-1.88e-01

Pos logit contributions
ifying+0.122
 bit+0.106
 distance+0.105
bie+0.104
den+0.099
Neg logit contributions
agons-0.147
mm-0.146
 golf-0.136
ires-0.134
 olds-0.134
Token was
Token ID373
Feature activation+1.55e-03

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.007
 Is-0.006
 Ashley-0.006
 bit-0.006
 awoke-0.006
Token very
Token ID845
Feature activation-8.41e-01

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 awoke+0.000
 bit+0.000
Neg logit contributions
bled-0.000
uddled-0.000
 olds-0.000
iggle-0.000
ails-0.000
Token fast
Token ID3049
Feature activation+5.34e-01

Pos logit contributions
uddled+0.037
bled+0.036
 olds+0.033
ails+0.033
castle+0.033
Neg logit contributions
ifying-0.030
 bit-0.030
 single-0.030
 Is-0.030
 care-0.029
Token and
Token ID290
Feature activation+4.15e-02

Pos logit contributions
ifying+0.023
 Is+0.021
 awoke+0.021
 Ashley+0.021
 Debbie+0.021
Neg logit contributions
uddled-0.020
bled-0.020
 olds-0.019
iggle-0.018
 golf-0.018
TokenTom
Token ID13787
Feature activation+1.46e-01

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token nods
Token ID34833
Feature activation+2.86e-01

Pos logit contributions
ifying+0.005
 Is+0.005
 Ashley+0.005
 awoke+0.005
 bit+0.004
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
ails-0.006
Token.
Token ID13
Feature activation-5.97e-01

Pos logit contributions
ifying+0.011
 Is+0.010
 Ashley+0.010
 bit+0.010
 awoke+0.009
Neg logit contributions
bled-0.013
uddled-0.012
iggle-0.012
 olds-0.011
ails-0.011
Token He
Token ID679
Feature activation-7.28e-01

Pos logit contributions
bled+0.032
uddled+0.032
iggle+0.030
ini+0.030
ans+0.030
Neg logit contributions
 Is-0.016
ifying-0.016
 Ashley-0.015
 bit-0.014
 Will-0.014
Token takes
Token ID2753
Feature activation+1.33e+00

Pos logit contributions
bled+0.038
uddled+0.036
 olds+0.035
ini+0.034
iggle+0.034
Neg logit contributions
ifying-0.020
 bit-0.020
 Is-0.020
 Ashley-0.019
 awoke-0.019
Token a
Token ID257
Feature activation+3.51e+00

Pos logit contributions
ifying+0.055
 Is+0.049
 distance+0.049
 awoke+0.049
Jonathan+0.047
Neg logit contributions
bled-0.054
add-0.048
 golf-0.048
iggle-0.047
ails-0.047
Token corn
Token ID11676
Feature activation+1.98e+00

Pos logit contributions
ifying+0.163
 Is+0.151
Oh+0.144
ez+0.140
 fail+0.139
Neg logit contributions
uddle-0.110
iggle-0.105
 raft-0.098
 new-0.094
bles-0.094
Token from
Token ID422
Feature activation+8.71e-03

Pos logit contributions
ifying+0.088
 awoke+0.086
Jonathan+0.086
 Is+0.085
Lou+0.085
Neg logit contributions
bled-0.069
castle-0.066
bles-0.065
ails-0.064
ble-0.064
Token the
Token ID262
Feature activation+2.13e+00

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 awoke+0.000
 bit+0.000
Neg logit contributions
bled-0.000
uddled-0.000
 olds-0.000
iggle-0.000
ails-0.000
Token plant
Token ID4618
Feature activation+6.50e-01

Pos logit contributions
ifying+0.100
ez+0.100
 awoke+0.099
 Is+0.099
 Will+0.099
Neg logit contributions
 golf-0.059
yard-0.058
ans-0.055
 raft-0.054
agons-0.054
Token.
Token ID13
Feature activation-6.57e-01

Pos logit contributions
ifying+0.025
 Is+0.023
 Ashley+0.023
Jonathan+0.022
 awoke+0.022
Neg logit contributions
bled-0.027
uddled-0.027
ails-0.026
iggle-0.026
 olds-0.025
Token
Token ID220
Feature activation-2.52e-01

Pos logit contributions
 bit+0.127
ifying+0.126
 distance+0.122
 trace+0.118
 away+0.118
Neg logit contributions
ires-0.080
azes-0.077
ails-0.076
 olds-0.075
mm-0.074
Token
Token ID198
Feature activation+9.54e-01

Pos logit contributions
bled+0.014
uddled+0.014
 olds+0.013
iggle+0.013
castle+0.013
Neg logit contributions
ifying-0.007
 Is-0.006
 Ashley-0.006
 awoke-0.006
 Debbie-0.006
Token
Token ID198
Feature activation+2.14e+00

Pos logit contributions
ifying+0.038
 bit+0.035
 awoke+0.031
 total+0.031
 distance+0.031
Neg logit contributions
bled-0.042
uddled-0.041
iggle-0.039
irled-0.038
ails-0.038
TokenThe
Token ID464
Feature activation+4.00e-02

Pos logit contributions
ifying+0.094
 bit+0.089
use+0.082
 distance+0.082
 awoke+0.078
Neg logit contributions
bled-0.083
uddled-0.079
irled-0.079
irling-0.077
iggle-0.074
Token end
Token ID886
Feature activation+1.19e+00

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.001
iggle-0.001
ails-0.001
Token.
Token ID13
Feature activation+3.50e+00

Pos logit contributions
ifying+0.053
 awoke+0.048
 Debbie+0.048
 distance+0.047
 bit+0.047
Neg logit contributions
 olds-0.041
uddled-0.039
iggle-0.039
ails-0.038
uddle-0.037
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-01

Pos logit contributions
 bit+0.189
use+0.180
 trace+0.179
ifying+0.177
 away+0.176
Neg logit contributions
mm-0.078
ires-0.074
azes-0.065
iggle-0.063
ails-0.062
TokenOne
Token ID3198
Feature activation+5.19e-01

Pos logit contributions
ifying+0.013
 bit+0.011
 Is+0.011
 Ashley+0.011
 awoke+0.011
Neg logit contributions
bled-0.024
uddled-0.024
ails-0.023
 olds-0.023
iggle-0.023
Token day
Token ID1110
Feature activation+6.13e-01

Pos logit contributions
ifying+0.019
 Is+0.017
 Ashley+0.017
Am+0.016
 bit+0.016
Neg logit contributions
bled-0.023
uddled-0.022
iggle-0.022
 olds-0.021
 golf-0.021
Token,
Token ID11
Feature activation-3.67e-01

Pos logit contributions
ifying+0.026
 Is+0.023
 distance+0.023
 bit+0.023
 awoke+0.023
Neg logit contributions
bled-0.024
uddled-0.023
iggle-0.022
 olds-0.021
irled-0.021
Token a
Token ID257
Feature activation-8.06e-01

Pos logit contributions
bled+0.018
uddled+0.018
 olds+0.017
iggle+0.017
ails+0.017
Neg logit contributions
ifying-0.012
 Is-0.011
 bit-0.011
 Ashley-0.010
 awoke-0.010
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
ifying+0.085
Am+0.079
 distance+0.079
Georgia+0.078
 Debbie+0.078
Neg logit contributions
bled-0.041
 golf-0.039
uddled-0.038
ails-0.033
irled-0.033
Token it
Token ID340
Feature activation-1.45e-01

Pos logit contributions
ifying+0.039
 awoke+0.037
Am+0.035
 Debbie+0.035
use+0.034
Neg logit contributions
bled-0.044
 golf-0.042
uddled-0.041
 olds-0.038
bles-0.037
Token went
Token ID1816
Feature activation-5.46e-02

Pos logit contributions
bled+0.007
uddled+0.007
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.004
 Is-0.004
 awoke-0.004
 bit-0.004
 Ashley-0.004
Token on
Token ID319
Feature activation-5.73e-02

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token w
Token ID266
Feature activation-4.16e+00

Pos logit contributions
bled+0.051
uddled+0.050
irled+0.048
ails+0.048
iggle+0.048
Neg logit contributions
 distance-0.042
 bit-0.039
ifying-0.037
 awoke-0.036
 single-0.036
Tokenett
Token ID3087
Feature activation-1.16e+00

Pos logit contributions
bled+0.120
kins+0.093
uddled+0.092
mm+0.089
 olds+0.089
Neg logit contributions
 care-0.183
 single-0.177
ff-0.170
 month-0.166
 wake-0.164
Tokenest
Token ID395
Feature activation-3.50e-01

Pos logit contributions
uddled+0.054
bled+0.053
uddle+0.050
 olds+0.049
mm+0.048
Neg logit contributions
 awoke-0.039
 distance-0.039
 bit-0.038
ifying-0.038
 single-0.038
Token of
Token ID286
Feature activation-1.30e+00

Pos logit contributions
bled+0.016
uddled+0.016
 olds+0.015
iggle+0.015
ails+0.014
Neg logit contributions
ifying-0.012
 bit-0.011
 awoke-0.011
 Is-0.011
 distance-0.011
Token adventures
Token ID17545
Feature activation-9.56e-01

Pos logit contributions
uddled+0.071
bled+0.066
uddle+0.066
iggle+0.064
ini+0.063
Neg logit contributions
 care-0.035
 Will-0.034
 eight-0.033
 Is-0.032
ifying-0.032
Token.
Token ID13
Feature activation-8.80e-02

Pos logit contributions
bled+0.051
uddled+0.050
uddle+0.048
iggle+0.047
add+0.047
Neg logit contributions
ifying-0.026
 bit-0.024
 single-0.023
 care-0.023
 Is-0.023
Token M
Token ID337
Feature activation-2.34e+00

Pos logit contributions
bled+0.013
uddled+0.013
iggle+0.012
 olds+0.012
uddle+0.012
Neg logit contributions
ifying-0.013
 Is-0.012
 Ashley-0.012
 bit-0.012
 awoke-0.012
Tokenandy
Token ID10757
Feature activation-1.45e+00

Pos logit contributions
bled+0.089
 golf+0.080
oving+0.074
 olds+0.072
ized+0.072
Neg logit contributions
bie-0.100
 Will-0.097
 Mae-0.097
Am-0.096
 Ashley-0.096
Token and
Token ID290
Feature activation+5.82e-01

Pos logit contributions
bled+0.081
uddled+0.076
ires+0.072
uddle+0.072
mm+0.072
Neg logit contributions
 bit-0.038
 Ashley-0.035
 care-0.034
 awoke-0.033
 Will-0.033
Token Sally
Token ID25737
Feature activation+2.49e-01

Pos logit contributions
ifying+0.027
 awoke+0.025
 Is+0.024
 distance+0.024
 single+0.024
Neg logit contributions
bled-0.020
uddled-0.018
 golf-0.018
iggle-0.018
 olds-0.018
Token.
Token ID13
Feature activation-5.25e-01

Pos logit contributions
ifying+0.009
 Is+0.009
 awoke+0.008
 Ashley+0.008
 bit+0.008
Neg logit contributions
bled-0.011
uddled-0.011
iggle-0.010
 olds-0.010
ails-0.010
Token M
Token ID337
Feature activation-3.96e+00

Pos logit contributions
bled+0.024
uddled+0.024
 olds+0.021
iggle+0.021
irled+0.021
Neg logit contributions
ifying-0.019
 Is-0.018
 Ashley-0.017
 bit-0.017
 care-0.017
Tokenandy
Token ID10757
Feature activation-2.13e+00

Pos logit contributions
bled+0.200
oving+0.173
 golf+0.170
renches+0.162
agons+0.159
Neg logit contributions
bie-0.117
y-0.109
abel-0.101
 Mae-0.101
 night-0.099
Token was
Token ID373
Feature activation-1.10e+00

Pos logit contributions
bled+0.128
uddled+0.115
ires+0.115
agons+0.115
bles+0.113
Neg logit contributions
 bit-0.050
 care-0.046
 heard-0.046
 woke-0.045
 awoke-0.044
Token a
Token ID257
Feature activation+3.12e-01

Pos logit contributions
uddled+0.066
bled+0.063
uddle+0.062
 olds+0.060
castle+0.059
Neg logit contributions
 Ashley-0.025
 bit-0.025
 Will-0.025
 Is-0.024
ifying-0.024
Token very
Token ID845
Feature activation-2.03e+00

Pos logit contributions
ifying+0.013
 Is+0.012
 awoke+0.012
 Ashley+0.012
 Debbie+0.012
Neg logit contributions
bled-0.012
uddled-0.012
 golf-0.011
 olds-0.011
iggle-0.011
Token kind
Token ID1611
Feature activation-6.63e-01

Pos logit contributions
bled+0.098
uddled+0.098
ized+0.084
mm+0.082
ans+0.081
Neg logit contributions
 bit-0.075
 single-0.071
 care-0.071
 Will-0.067
 total-0.065
Token there
Token ID612
Feature activation+1.22e-01

Pos logit contributions
bled+0.028
uddled+0.027
 olds+0.026
ans+0.025
agons+0.025
Neg logit contributions
ifying-0.018
 Is-0.017
 Ashley-0.017
 bit-0.016
 awoke-0.016
Token were
Token ID547
Feature activation-9.78e-01

Pos logit contributions
ifying+0.005
 Is+0.004
 Ashley+0.004
 bit+0.004
 awoke+0.004
Neg logit contributions
bled-0.005
uddled-0.005
iggle-0.005
 olds-0.005
ails-0.005
Token two
Token ID734
Feature activation-1.83e+00

Pos logit contributions
uddled+0.049
bled+0.048
 olds+0.045
ails+0.045
iggle+0.045
Neg logit contributions
ifying-0.030
 Will-0.028
Am-0.028
 Ashley-0.028
Lou-0.027
Token friends
Token ID2460
Feature activation-3.97e-01

Pos logit contributions
bled+0.069
uddled+0.061
irled+0.056
ails+0.055
iggle+0.054
Neg logit contributions
 bit-0.084
ifying-0.078
 Ashley-0.078
 eight-0.077
 total-0.077
Token,
Token ID11
Feature activation-4.07e-01

Pos logit contributions
bled+0.018
uddled+0.018
 olds+0.016
uddle+0.016
iggle+0.016
Neg logit contributions
ifying-0.014
 bit-0.013
 Is-0.013
 Ashley-0.013
 care-0.013
Token an
Token ID281
Feature activation-3.91e+00

Pos logit contributions
bled+0.018
uddled+0.018
iggle+0.016
 olds+0.016
uddle+0.016
Neg logit contributions
ifying-0.015
 Ashley-0.014
 Is-0.014
 bit-0.014
 Will-0.014
Token elephant
Token ID20950
Feature activation+3.94e-02

Pos logit contributions
uddled+0.216
bled+0.212
uddle+0.196
irled+0.179
ades+0.179
Neg logit contributions
 baby-0.114
 night-0.113
 nine-0.110
 bit-0.107
 eight-0.107
Token and
Token ID290
Feature activation-4.65e-02

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.001
iggle-0.001
ails-0.001
Token a
Token ID257
Feature activation-3.67e-01

Pos logit contributions
bled+0.002
uddled+0.002
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.002
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token dinosaur
Token ID28257
Feature activation-7.39e-02

Pos logit contributions
bled+0.017
uddled+0.017
 olds+0.016
iggle+0.016
ails+0.016
Neg logit contributions
ifying-0.013
 bit-0.012
 Is-0.012
 Ashley-0.012
 awoke-0.011
Token.
Token ID13
Feature activation+2.77e-01

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.003
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token there
Token ID612
Feature activation-1.19e-01

Pos logit contributions
bled+0.039
uddled+0.038
 olds+0.036
ans+0.036
agons+0.035
Neg logit contributions
ifying-0.023
 Is-0.023
 Ashley-0.021
 awoke-0.021
 bit-0.021
Token were
Token ID547
Feature activation-1.25e+00

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.004
 Is-0.004
 Ashley-0.004
 awoke-0.004
 bit-0.004
Token two
Token ID734
Feature activation-2.01e+00

Pos logit contributions
uddled+0.065
bled+0.064
 olds+0.061
ails+0.060
uddle+0.060
Neg logit contributions
ifying-0.037
 Will-0.034
Am-0.034
 Ashley-0.033
Lou-0.033
Token friends
Token ID2460
Feature activation-4.89e-01

Pos logit contributions
bled+0.077
uddled+0.067
irled+0.064
ails+0.062
castle+0.061
Neg logit contributions
 bit-0.091
ifying-0.083
 Ashley-0.083
 eight-0.082
 total-0.082
Token,
Token ID11
Feature activation-4.41e-01

Pos logit contributions
bled+0.022
uddled+0.022
 olds+0.020
uddle+0.020
iggle+0.020
Neg logit contributions
ifying-0.017
 bit-0.016
 Is-0.016
 Ashley-0.016
 care-0.016
Token an
Token ID281
Feature activation-3.88e+00

Pos logit contributions
bled+0.020
uddled+0.020
iggle+0.018
 olds+0.018
uddle+0.018
Neg logit contributions
ifying-0.016
 Ashley-0.015
 Is-0.015
 bit-0.015
 Will-0.014
Token ostr
Token ID42387
Feature activation-1.40e+00

Pos logit contributions
uddled+0.212
bled+0.209
uddle+0.194
ails+0.180
ades+0.177
Neg logit contributions
 baby-0.116
 night-0.114
 nine-0.109
 bit-0.108
 eight-0.105
Tokenich
Token ID488
Feature activation-2.02e-02

Pos logit contributions
bled+0.048
 olds+0.047
iggle+0.045
 golf+0.044
uddled+0.044
Neg logit contributions
ifying-0.063
Am-0.060
 bit-0.057
Lou-0.056
 Debbie-0.056
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token a
Token ID257
Feature activation-1.59e-01

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.004
 Is-0.004
 Ashley-0.004
 bit-0.004
 awoke-0.004
Token goat
Token ID26917
Feature activation-1.33e-01

Pos logit contributions
bled+0.008
uddled+0.007
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.005
 Is-0.005
 bit-0.005
 Ashley-0.005
 awoke-0.005
Token.
Token ID13
Feature activation-2.48e+00

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token Sally
Token ID25737
Feature activation-5.03e-01

Pos logit contributions
bled+0.139
bles+0.138
ini+0.136
agons+0.136
kins+0.135
Neg logit contributions
 Is-0.060
 bit-0.056
 Will-0.055
 While-0.053
ifying-0.053
Token said
Token ID531
Feature activation-7.54e-02

Pos logit contributions
bled+0.025
uddled+0.024
 olds+0.023
iggle+0.022
ails+0.022
Neg logit contributions
 bit-0.016
ifying-0.015
 Is-0.015
 awoke-0.015
 Ashley-0.015
Token,
Token ID11
Feature activation-5.40e-01

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.003
 Is-0.003
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token "
Token ID366
Feature activation+4.96e-01

Pos logit contributions
bled+0.024
uddled+0.024
iggle+0.023
 olds+0.023
uddle+0.023
Neg logit contributions
ifying-0.018
 Is-0.018
 Ashley-0.017
 bit-0.017
 Will-0.017
TokenM
Token ID44
Feature activation-3.84e+00

Pos logit contributions
ifying+0.016
 awoke+0.014
 Ashley+0.014
 Is+0.014
 Debbie+0.013
Neg logit contributions
bled-0.024
uddled-0.024
 golf-0.023
ini-0.023
ails-0.023
Tokenmm
Token ID3020
Feature activation-1.85e+00

Pos logit contributions
bled+0.146
bes+0.117
uring+0.115
 burying+0.113
uddled+0.112
Neg logit contributions
ah-0.137
 single-0.137
of-0.135
Not-0.133
ifying-0.133
Token,
Token ID11
Feature activation+4.10e-02

Pos logit contributions
bled+0.109
uddled+0.105
ini+0.100
 olds+0.100
 clubhouse+0.099
Neg logit contributions
 bit-0.041
 care-0.034
ifying-0.032
 will-0.032
 Ashley-0.031
Token this
Token ID428
Feature activation+6.20e-01

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token cake
Token ID12187
Feature activation+6.57e-02

Pos logit contributions
ifying+0.026
 Is+0.024
 Ashley+0.023
Lou+0.023
 distance+0.023
Neg logit contributions
bled-0.024
uddled-0.024
castle-0.023
iggle-0.022
uddle-0.022
Token is
Token ID318
Feature activation+4.06e-02

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.002
Neg logit contributions
bled-0.003
uddled-0.003
 olds-0.003
iggle-0.003
ails-0.003
Token became
Token ID2627
Feature activation-1.11e+00

Pos logit contributions
bled+0.043
uddled+0.041
 olds+0.040
ires+0.039
ails+0.038
Neg logit contributions
 bit-0.027
ifying-0.026
 awoke-0.026
 care-0.026
 stroke-0.025
Token best
Token ID1266
Feature activation-1.52e+00

Pos logit contributions
uddled+0.044
bled+0.040
 olds+0.038
uddle+0.037
iggle+0.036
Neg logit contributions
 bit-0.047
 care-0.047
 total-0.045
ifying-0.045
 single-0.045
Token friends
Token ID2460
Feature activation+5.02e-01

Pos logit contributions
bled+0.061
uddled+0.053
mm+0.050
 olds+0.050
castle+0.047
Neg logit contributions
 care-0.066
 bit-0.063
 single-0.061
 stroke-0.060
ifying-0.060
Token.
Token ID13
Feature activation-9.93e-01

Pos logit contributions
ifying+0.019
 Is+0.018
 Ashley+0.018
 Debbie+0.018
 awoke+0.018
Neg logit contributions
uddled-0.021
bled-0.020
 olds-0.019
iggle-0.019
ails-0.019
Token They
Token ID1119
Feature activation-2.05e-01

Pos logit contributions
bled+0.052
uddled+0.050
ini+0.046
castle+0.046
bles+0.045
Neg logit contributions
ifying-0.031
 Is-0.030
 Will-0.029
 Ashley-0.029
 care-0.027
Token played
Token ID2826
Feature activation-3.81e+00

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.009
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.007
 bit-0.007
 Is-0.007
 awoke-0.007
 Ashley-0.007
Token together
Token ID1978
Feature activation+1.65e-01

Pos logit contributions
ini+0.236
 olds+0.230
ower+0.222
uddled+0.214
iqu+0.213
Neg logit contributions
 away-0.089
 to-0.079
 on-0.076
 after-0.072
 heard-0.070
Token in
Token ID287
Feature activation-8.52e-01

Pos logit contributions
ifying+0.006
 Is+0.006
 Ashley+0.006
 awoke+0.006
 bit+0.005
Neg logit contributions
bled-0.007
uddled-0.007
iggle-0.007
 olds-0.007
ails-0.007
Token the
Token ID262
Feature activation-1.21e+00

Pos logit contributions
uddled+0.060
bled+0.059
uddle+0.055
ails+0.054
 olds+0.054
Neg logit contributions
ifying-0.011
 bit-0.011
 total-0.011
 distance-0.011
 care-0.010
Token park
Token ID3952
Feature activation-1.07e+00

Pos logit contributions
bled+0.061
uddled+0.056
 olds+0.053
bles+0.053
iggle+0.052
Neg logit contributions
 distance-0.045
 single-0.041
 bit-0.040
 care-0.039
 night-0.039
Token every
Token ID790
Feature activation+1.41e-01

Pos logit contributions
bled+0.057
uddled+0.055
 olds+0.053
agons+0.053
ini+0.052
Neg logit contributions
 bit-0.029
ifying-0.027
 care-0.027
 single-0.027
 Is-0.027
Token.
Token ID13
Feature activation+1.12e-01

Pos logit contributions
ifying+0.012
 Is+0.011
 Ashley+0.011
 awoke+0.011
 bit+0.011
Neg logit contributions
bled-0.014
uddled-0.014
 olds-0.014
iggle-0.013
ails-0.013
Token After
Token ID2293
Feature activation-1.34e+00

Pos logit contributions
ifying+0.004
 Is+0.003
 bit+0.003
 awoke+0.003
 Ashley+0.003
Neg logit contributions
bled-0.005
uddled-0.005
 olds-0.005
iggle-0.005
ails-0.005
Token the
Token ID262
Feature activation-2.70e+00

Pos logit contributions
uddled+0.072
uddle+0.071
ini+0.071
ans+0.070
bled+0.070
Neg logit contributions
 Is-0.035
ifying-0.035
 care-0.034
 Will-0.034
 bit-0.033
Token boy
Token ID2933
Feature activation-6.75e-01

Pos logit contributions
bled+0.133
uddled+0.131
irled+0.129
urg+0.122
irling+0.122
Neg logit contributions
 bit-0.102
 distance-0.100
 care-0.097
 night-0.089
 bite-0.088
Token finished
Token ID5201
Feature activation-1.65e+00

Pos logit contributions
bled+0.036
uddled+0.035
 olds+0.033
iggle+0.033
agons+0.033
Neg logit contributions
ifying-0.018
 bit-0.018
 Is-0.018
 awoke-0.017
 care-0.017
Token his
Token ID465
Feature activation-3.77e+00

Pos logit contributions
uddled+0.082
bled+0.078
ini+0.075
 olds+0.073
add+0.073
Neg logit contributions
 Is-0.049
 away-0.048
 care-0.048
 single-0.047
ifying-0.047
Token swing
Token ID9628
Feature activation-1.62e+00

Pos logit contributions
bled+0.134
uddled+0.124
iders+0.120
iring+0.117
add+0.117
Neg logit contributions
 care-0.177
 bit-0.173
 bite-0.171
 to-0.164
 single-0.163
Token,
Token ID11
Feature activation-2.73e-01

Pos logit contributions
bled+0.085
uddled+0.082
 olds+0.080
ires+0.079
add+0.079
Neg logit contributions
ifying-0.044
 bit-0.043
 Is-0.041
 distance-0.041
 away-0.041
Token he
Token ID339
Feature activation-4.44e-01

Pos logit contributions
bled+0.014
uddled+0.014
iggle+0.013
 olds+0.013
ails+0.013
Neg logit contributions
ifying-0.008
 Is-0.007
 Ashley-0.007
 bit-0.007
 awoke-0.007
Token shared
Token ID4888
Feature activation-2.38e-01

Pos logit contributions
bled+0.022
uddled+0.021
 olds+0.021
iggle+0.020
ails+0.020
Neg logit contributions
ifying-0.014
 bit-0.013
 Is-0.013
 awoke-0.013
 Ashley-0.012
Token the
Token ID262
Feature activation-3.21e+00

Pos logit contributions
bled+0.013
uddled+0.013
 olds+0.013
iggle+0.013
ails+0.013
Neg logit contributions
ifying-0.006
 Is-0.005
 bit-0.005
 Ashley-0.005
 awoke-0.005
TokenBob
Token ID18861
Feature activation-9.65e-02

Pos logit contributions
ifying+0.010
 Is+0.009
 bit+0.009
 Ashley+0.009
 awoke+0.009
Neg logit contributions
bled-0.014
uddled-0.014
 olds-0.013
iggle-0.013
ails-0.013
Token was
Token ID373
Feature activation+4.23e-02

Pos logit contributions
bled+0.004
uddled+0.004
 olds+0.004
iggle+0.004
ails+0.004
Neg logit contributions
ifying-0.003
 Is-0.003
 bit-0.003
 Ashley-0.003
 awoke-0.003
Token so
Token ID523
Feature activation-1.59e-01

Pos logit contributions
ifying+0.002
 Is+0.001
 Ashley+0.001
 bit+0.001
 awoke+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token happy
Token ID3772
Feature activation-1.77e-02

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
ails+0.005
iggle+0.005
Neg logit contributions
ifying-0.008
 Is-0.007
 bit-0.007
 Ashley-0.007
 awoke-0.007
Token with
Token ID351
Feature activation-8.90e-01

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token his
Token ID465
Feature activation-3.72e+00

Pos logit contributions
uddled+0.057
uddle+0.054
bled+0.054
ini+0.053
ails+0.052
Neg logit contributions
 care-0.016
 Is-0.015
ifying-0.015
 Ashley-0.015
 Debbie-0.014
Token cool
Token ID3608
Feature activation-3.14e+00

Pos logit contributions
uddled+0.127
bled+0.126
mm+0.119
rim+0.119
ator+0.115
Neg logit contributions
 care-0.174
 bite-0.158
 single-0.158
 try-0.158
 bit-0.158
Token golf
Token ID13126
Feature activation-3.16e+00

Pos logit contributions
bled+0.106
uddled+0.096
mm+0.094
ires+0.087
add+0.087
Neg logit contributions
 care-0.158
 single-0.147
 bit-0.146
 away-0.142
 to-0.139
Token shot
Token ID2823
Feature activation-5.52e-01

Pos logit contributions
add+0.143
bled+0.140
urg+0.137
ires+0.137
 olds+0.136
Neg logit contributions
 bit-0.110
 away-0.104
 will-0.104
 single-0.099
 distance-0.098
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
bled+0.027
uddled+0.027
 olds+0.025
iggle+0.025
ails+0.025
Neg logit contributions
ifying-0.018
 Is-0.016
 bit-0.016
 awoke-0.016
 Ashley-0.015
Token He
Token ID679
Feature activation-6.43e-01

Pos logit contributions
bled+0.017
uddled+0.016
iggle+0.015
 olds+0.015
uddle+0.015
Neg logit contributions
ifying-0.010
 Is-0.010
 Ashley-0.009
 awoke-0.009
 bit-0.009
Token
Token ID198
Feature activation+2.74e-01

Pos logit contributions
bled+0.017
uddled+0.016
iggle+0.015
 olds+0.015
castle+0.015
Neg logit contributions
ifying-0.007
 Is-0.007
 Ashley-0.006
 awoke-0.006
 bit-0.006
Token
Token ID198
Feature activation+4.35e-01

Pos logit contributions
ifying+0.008
 Is+0.007
 bit+0.007
 awoke+0.007
 Ashley+0.007
Neg logit contributions
bled-0.014
uddled-0.014
iggle-0.013
 olds-0.013
ails-0.013
TokenOne
Token ID3198
Feature activation+5.01e-01

Pos logit contributions
ifying+0.011
 Is+0.010
 Ashley+0.010
 awoke+0.010
 bit+0.010
Neg logit contributions
bled-0.023
uddled-0.023
 golf-0.023
 olds-0.022
ails-0.022
Token day
Token ID1110
Feature activation+1.23e+00

Pos logit contributions
ifying+0.019
 Is+0.018
 Ashley+0.017
 Debbie+0.017
 awoke+0.017
Neg logit contributions
uddled-0.020
bled-0.020
iggle-0.020
ails-0.019
 golf-0.019
Token,
Token ID11
Feature activation+4.13e-01

Pos logit contributions
ifying+0.065
Am+0.059
 Debbie+0.059
 distance+0.059
Jonathan+0.059
Neg logit contributions
bled-0.035
uddled-0.034
 golf-0.033
ails-0.031
iggle-0.031
Token an
Token ID281
Feature activation-3.71e+00

Pos logit contributions
ifying+0.014
 awoke+0.012
 Ashley+0.012
 Is+0.012
 Debbie+0.012
Neg logit contributions
bled-0.020
uddled-0.019
 golf-0.019
 olds-0.018
iggle-0.018
Token especially
Token ID2592
Feature activation-1.59e+00

Pos logit contributions
uddled+0.203
bled+0.192
uddle+0.179
ades+0.172
irled+0.167
Neg logit contributions
 night-0.122
 bit-0.117
 awoke-0.113
 single-0.112
 month-0.111
Token brave
Token ID14802
Feature activation-1.11e+00

Pos logit contributions
bled+0.072
uddled+0.070
castle+0.067
mm+0.066
agons+0.065
Neg logit contributions
 bit-0.057
 single-0.055
ifying-0.054
 care-0.053
 heard-0.052
Token and
Token ID290
Feature activation-9.22e-01

Pos logit contributions
bled+0.063
uddled+0.061
irled+0.056
mm+0.054
castle+0.054
Neg logit contributions
 bit-0.031
ifying-0.029
 single-0.029
 awoke-0.028
 Is-0.028
Token curious
Token ID11040
Feature activation-5.99e-02

Pos logit contributions
uddled+0.041
bled+0.040
iggle+0.038
ails+0.037
uddle+0.037
Neg logit contributions
 bit-0.035
ifying-0.034
 Is-0.034
 care-0.033
 Ashley-0.033
Token squirrel
Token ID33039
Feature activation-1.43e-01

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token
Token ID198
Feature activation+7.24e-01

Pos logit contributions
bled+0.063
uddled+0.062
 olds+0.059
castle+0.059
iggle+0.058
Neg logit contributions
ifying-0.021
 Is-0.020
 Ashley-0.019
 Debbie-0.018
Am-0.017
Token
Token ID198
Feature activation+7.76e-01

Pos logit contributions
ifying+0.026
 bit+0.023
 awoke+0.022
 Is+0.022
 Ashley+0.021
Neg logit contributions
uddled-0.033
bled-0.033
ails-0.031
iggle-0.031
 olds-0.031
TokenOne
Token ID3198
Feature activation-1.01e-01

Pos logit contributions
ifying+0.025
 awoke+0.021
 Is+0.021
 bit+0.021
 Debbie+0.020
Neg logit contributions
uddled-0.037
 golf-0.037
bled-0.036
 olds-0.035
ails-0.035
Token day
Token ID1110
Feature activation+1.26e+00

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.003
 Is-0.003
 bit-0.003
 Ashley-0.003
 awoke-0.003
Token,
Token ID11
Feature activation+9.12e-01

Pos logit contributions
ifying+0.065
 distance+0.060
Jonathan+0.059
 awoke+0.058
 total+0.057
Neg logit contributions
uddled-0.036
 golf-0.035
bled-0.035
ails-0.032
 olds-0.031
Token an
Token ID281
Feature activation-3.70e+00

Pos logit contributions
ifying+0.032
 awoke+0.030
Jonathan+0.028
 Is+0.027
 single+0.027
Neg logit contributions
bled-0.041
uddled-0.040
 golf-0.040
 olds-0.038
iggle-0.037
Token unhappy
Token ID19283
Feature activation-8.89e-01

Pos logit contributions
uddled+0.184
bled+0.172
ades+0.166
uddle+0.163
kins+0.155
Neg logit contributions
 night-0.131
 bit-0.123
 single-0.122
 baby-0.121
 awoke-0.118
Token thing
Token ID1517
Feature activation+6.65e-01

Pos logit contributions
bled+0.047
uddled+0.045
 olds+0.041
ails+0.041
iggle+0.041
Neg logit contributions
 bit-0.026
 Is-0.025
 awoke-0.025
ifying-0.025
 single-0.025
Token happened
Token ID3022
Feature activation-7.70e-01

Pos logit contributions
ifying+0.024
 Ashley+0.022
 Is+0.022
 Debbie+0.021
Jonathan+0.021
Neg logit contributions
uddled-0.030
bled-0.030
iggle-0.027
 olds-0.027
ails-0.027
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
bled+0.040
uddled+0.039
 olds+0.038
castle+0.038
uddle+0.038
Neg logit contributions
ifying-0.021
 Is-0.019
 bit-0.019
 single-0.019
 awoke-0.018
Token The
Token ID383
Feature activation-1.99e+00

Pos logit contributions
ifying+0.004
 Is+0.004
 awoke+0.004
 bit+0.004
 Ashley+0.004
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
ails-0.006
Token for
Token ID329
Feature activation-8.75e-01

Pos logit contributions
ifying+0.006
 Is+0.005
 Ashley+0.005
 awoke+0.005
 bit+0.005
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
ails-0.006
Token his
Token ID465
Feature activation-2.62e+00

Pos logit contributions
uddled+0.040
uddle+0.040
 olds+0.038
ails+0.038
ans+0.037
Neg logit contributions
 care-0.029
 Is-0.029
ifying-0.029
 bit-0.028
 Will-0.028
Token help
Token ID1037
Feature activation+4.15e-02

Pos logit contributions
bled+0.147
uddled+0.142
ires+0.138
iring+0.133
ails+0.132
Neg logit contributions
 care-0.081
 bit-0.077
 bite-0.072
 single-0.069
 to-0.067
Token.
Token ID13
Feature activation-6.97e-01

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token They
Token ID1119
Feature activation-6.91e-01

Pos logit contributions
bled+0.039
uddled+0.037
iggle+0.036
castle+0.036
bles+0.035
Neg logit contributions
ifying-0.017
 Is-0.017
 Ashley-0.017
 Will-0.016
 Debbie-0.015
Token played
Token ID2826
Feature activation-3.70e+00

Pos logit contributions
bled+0.033
uddled+0.032
 olds+0.032
ires+0.031
ails+0.031
Neg logit contributions
ifying-0.022
 bit-0.022
 awoke-0.021
 care-0.020
 Is-0.020
Token golf
Token ID13126
Feature activation-2.95e+00

Pos logit contributions
ini+0.202
ower+0.184
mm+0.182
rim+0.178
urg+0.178
Neg logit contributions
 away-0.131
 after-0.115
 to-0.106
 the-0.105
 care-0.104
Token together
Token ID1978
Feature activation-1.29e-01

Pos logit contributions
bled+0.136
ini+0.134
ires+0.133
uddled+0.132
add+0.132
Neg logit contributions
 away-0.098
 bit-0.093
 will-0.087
 heard-0.084
 care-0.083
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.005
 Is-0.004
 bit-0.004
 Ashley-0.004
 awoke-0.004
Token had
Token ID550
Feature activation-8.01e-01

Pos logit contributions
uddled+0.063
bled+0.062
ires+0.062
agons+0.061
uddle+0.061
Neg logit contributions
 bit-0.024
 care-0.023
 Is-0.022
 Will-0.022
ifying-0.022
Token a
Token ID257
Feature activation-1.33e-01

Pos logit contributions
bled+0.040
uddled+0.040
ails+0.038
ires+0.037
iggle+0.037
Neg logit contributions
ifying-0.024
 bit-0.023
 care-0.023
 total-0.022
 awoke-0.022
Token.
Token ID13
Feature activation-2.32e+00

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 bit-0.002
 Ashley-0.002
 awoke-0.002
Token Sally
Token ID25737
Feature activation-4.72e-01

Pos logit contributions
bled+0.129
bles+0.128
agons+0.126
ini+0.126
kins+0.125
Neg logit contributions
 Is-0.058
 bit-0.054
 Will-0.053
ifying-0.052
 While-0.049
Token said
Token ID531
Feature activation-2.51e-02

Pos logit contributions
bled+0.023
uddled+0.022
 olds+0.021
iggle+0.021
ails+0.021
Neg logit contributions
 bit-0.015
ifying-0.014
 Is-0.014
 awoke-0.014
 Ashley-0.014
Token,
Token ID11
Feature activation-4.57e-01

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token "
Token ID366
Feature activation+6.27e-01

Pos logit contributions
bled+0.020
uddled+0.020
iggle+0.019
 olds+0.019
uddle+0.019
Neg logit contributions
ifying-0.016
 Is-0.016
 Ashley-0.015
 bit-0.015
 awoke-0.015
TokenM
Token ID44
Feature activation-3.69e+00

Pos logit contributions
ifying+0.020
 awoke+0.019
 Ashley+0.019
 Is+0.018
 Debbie+0.018
Neg logit contributions
bled-0.030
uddled-0.029
ini-0.028
 golf-0.028
ails-0.027
Tokenmm
Token ID3020
Feature activation-1.84e+00

Pos logit contributions
bled+0.138
bes+0.108
uring+0.107
 burying+0.107
uddled+0.105
Neg logit contributions
ah-0.135
 single-0.133
of-0.132
ifying-0.132
 total-0.130
Token,
Token ID11
Feature activation-2.20e-04

Pos logit contributions
bled+0.108
uddled+0.104
ini+0.100
 olds+0.100
 clubhouse+0.098
Neg logit contributions
 bit-0.040
 care-0.033
 will-0.031
ifying-0.031
 Ashley-0.031
Token this
Token ID428
Feature activation+5.51e-01

Pos logit contributions
bled+0.000
uddled+0.000
 olds+0.000
iggle+0.000
ails+0.000
Neg logit contributions
ifying-0.000
 Is-0.000
 Ashley-0.000
 awoke-0.000
 bit-0.000
Token cake
Token ID12187
Feature activation+2.70e-03

Pos logit contributions
ifying+0.023
 Is+0.021
 Ashley+0.020
Lou+0.020
Jonathan+0.020
Neg logit contributions
bled-0.022
uddled-0.022
castle-0.020
iggle-0.020
uddle-0.020
Token is
Token ID318
Feature activation+3.66e-02

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 awoke+0.000
 bit+0.000
Neg logit contributions
bled-0.000
uddled-0.000
 olds-0.000
iggle-0.000
ails-0.000
Token wanted
Token ID2227
Feature activation+4.89e-01

Pos logit contributions
bled+0.011
uddled+0.010
 olds+0.010
iggle+0.010
ails+0.010
Neg logit contributions
ifying-0.004
 Is-0.004
 bit-0.004
 awoke-0.004
 Ashley-0.004
Token to
Token ID284
Feature activation-3.14e-01

Pos logit contributions
ifying+0.011
 Is+0.009
 Ashley+0.008
 awoke+0.008
Jonathan+0.008
Neg logit contributions
bled-0.029
uddled-0.028
 olds-0.028
 golf-0.028
ails-0.027
Token make
Token ID787
Feature activation-1.50e+00

Pos logit contributions
bled+0.016
uddled+0.016
 olds+0.015
ails+0.015
iggle+0.015
Neg logit contributions
ifying-0.009
 Is-0.009
 bit-0.008
 Ashley-0.008
 care-0.008
Token his
Token ID465
Feature activation-1.08e+00

Pos logit contributions
uddled+0.093
bled+0.090
ails+0.085
agons+0.085
 olds+0.085
Neg logit contributions
 care-0.029
 Ashley-0.027
 Is-0.026
 Will-0.026
 away-0.025
Token mum
Token ID25682
Feature activation+3.53e-02

Pos logit contributions
bled+0.053
uddled+0.053
ails+0.048
 olds+0.047
bles+0.047
Neg logit contributions
 bit-0.035
 distance-0.035
 single-0.034
 care-0.034
ifying-0.034
Token an
Token ID281
Feature activation-3.68e+00

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token honest
Token ID5508
Feature activation-8.98e-01

Pos logit contributions
uddled+0.171
bled+0.168
bles+0.157
ails+0.155
kins+0.154
Neg logit contributions
 bit-0.128
 care-0.121
 night-0.121
 away-0.120
 bite-0.117
Token offer
Token ID2897
Feature activation+3.90e-01

Pos logit contributions
bled+0.034
uddled+0.033
ires+0.029
 olds+0.029
ails+0.029
Neg logit contributions
 bit-0.039
 care-0.037
ifying-0.037
 Is-0.037
 single-0.037
Token.
Token ID13
Feature activation+5.22e-01

Pos logit contributions
ifying+0.014
 Is+0.013
 Ashley+0.013
 awoke+0.013
 bit+0.012
Neg logit contributions
bled-0.017
uddled-0.017
 olds-0.016
iggle-0.016
ails-0.016
Token
Token ID220
Feature activation-1.47e+00

Pos logit contributions
ifying+0.021
 bit+0.019
 awoke+0.019
 Is+0.019
 Ashley+0.019
Neg logit contributions
bled-0.021
uddled-0.021
 olds-0.020
ails-0.019
iggle-0.019
Token
Token ID198
Feature activation+5.98e-02

Pos logit contributions
bled+0.090
uddled+0.089
castle+0.087
 olds+0.086
iggle+0.085
Neg logit contributions
 Is-0.025
 Ashley-0.024
ifying-0.023
 Debbie-0.022
 Will-0.021
Token in
Token ID287
Feature activation+1.47e-01

Pos logit contributions
uddled+0.109
bled+0.107
ini+0.106
mm+0.106
 olds+0.106
Neg logit contributions
 away-0.050
 bit-0.048
 care-0.046
 Is-0.044
 Will-0.043
Token the
Token ID262
Feature activation-1.85e+00

Pos logit contributions
ifying+0.004
 Is+0.003
 Ashley+0.003
 awoke+0.003
 bit+0.003
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.008
iggle-0.008
 golf-0.008
Token mud
Token ID17492
Feature activation-1.30e+00

Pos logit contributions
bled+0.092
uddled+0.089
bles+0.084
ails+0.083
iggle+0.083
Neg logit contributions
 distance-0.072
 night-0.063
 single-0.062
ifying-0.061
 bit-0.061
Token.
Token ID13
Feature activation+4.11e-01

Pos logit contributions
bled+0.065
uddled+0.063
 olds+0.062
agons+0.060
ires+0.059
Neg logit contributions
 bit-0.038
ifying-0.037
 distance-0.036
 awoke-0.034
 single-0.034
Token What
Token ID1867
Feature activation-1.86e-01

Pos logit contributions
ifying+0.015
 bit+0.014
 awoke+0.013
 Ashley+0.013
 Is+0.013
Neg logit contributions
bled-0.018
uddled-0.018
 olds-0.017
ails-0.017
iggle-0.017
Token an
Token ID281
Feature activation-3.68e+00

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.009
iggle+0.009
ails+0.009
Neg logit contributions
ifying-0.006
 Is-0.005
 Ashley-0.005
 awoke-0.005
 bit-0.005
Token adventure
Token ID8855
Feature activation-1.12e+00

Pos logit contributions
uddled+0.188
bled+0.177
kins+0.169
mons+0.168
bles+0.164
Neg logit contributions
 night-0.126
 care-0.119
 single-0.119
 need-0.113
 distance-0.110
Token!
Token ID0
Feature activation+6.80e-01

Pos logit contributions
bled+0.058
uddled+0.057
 olds+0.053
iggle+0.053
add+0.053
Neg logit contributions
ifying-0.032
 Is-0.030
 awoke-0.030
 bit-0.030
 Ashley-0.029
Token<|endoftext|>
Token ID50256
Feature activation+6.70e-01

Pos logit contributions
ifying+0.023
 bit+0.022
 awoke+0.021
 total+0.020
 Ashley+0.020
Neg logit contributions
bled-0.031
uddled-0.030
ails-0.029
 olds-0.029
 golf-0.029
TokenOnce
Token ID7454
Feature activation+6.13e-01

Pos logit contributions
ifying+0.019
 bit+0.018
 Is+0.017
 Ashley+0.017
 care+0.017
Neg logit contributions
bled-0.034
uddled-0.033
ails-0.032
 olds-0.032
iggle-0.031
Token upon
Token ID2402
Feature activation+2.70e-02

Pos logit contributions
ifying+0.016
 Is+0.014
 bit+0.014
 Ashley+0.014
 awoke+0.013
Neg logit contributions
bled-0.034
uddled-0.033
iggle-0.032
ails-0.032
 olds-0.031
Token selfish
Token ID20363
Feature activation+1.30e-01

Pos logit contributions
bled+0.013
uddled+0.012
iggle+0.011
 olds+0.011
ails+0.011
Neg logit contributions
ifying-0.013
 bit-0.013
 Is-0.013
 Ashley-0.013
 care-0.013
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
ifying+0.005
 Is+0.004
 Ashley+0.004
 awoke+0.004
 bit+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.006
iggle-0.005
ails-0.005
Token One
Token ID1881
Feature activation-3.06e-01

Pos logit contributions
bled+0.082
uddled+0.078
irled+0.072
bles+0.072
agons+0.072
Neg logit contributions
 Is-0.027
 Will-0.026
 Ashley-0.026
ifying-0.025
 Mae-0.024
Token day
Token ID1110
Feature activation+5.68e-01

Pos logit contributions
bled+0.017
uddled+0.016
 olds+0.016
bles+0.015
iggle+0.015
Neg logit contributions
ifying-0.008
 bit-0.008
 Is-0.008
 single-0.008
 awoke-0.008
Token,
Token ID11
Feature activation+1.93e-02

Pos logit contributions
ifying+0.025
 awoke+0.023
 distance+0.022
Jonathan+0.022
 Ashley+0.022
Neg logit contributions
bled-0.022
uddled-0.021
ails-0.020
iggle-0.020
 golf-0.020
Token M
Token ID337
Feature activation-3.65e+00

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Tokenandy
Token ID10757
Feature activation-1.83e+00

Pos logit contributions
bled+0.180
oving+0.160
 golf+0.152
yard+0.149
renches+0.147
Neg logit contributions
bie-0.114
 night-0.101
y-0.101
 Mae-0.099
 Will-0.095
Token got
Token ID1392
Feature activation-2.00e-03

Pos logit contributions
bled+0.109
ires+0.100
agons+0.099
uddled+0.099
iggle+0.097
Neg logit contributions
 bit-0.046
 awoke-0.042
 heard-0.041
 woke-0.039
 Ashley-0.038
Token an
Token ID281
Feature activation-2.50e+00

Pos logit contributions
bled+0.000
uddled+0.000
 olds+0.000
iggle+0.000
ails+0.000
Neg logit contributions
ifying-0.000
 Is-0.000
 Ashley-0.000
 awoke-0.000
 bit-0.000
Token invitation
Token ID17023
Feature activation+3.43e-01

Pos logit contributions
uddled+0.102
bled+0.101
bles+0.088
ails+0.088
uddle+0.088
Neg logit contributions
 bit-0.104
 care-0.098
 single-0.097
 night-0.095
 away-0.093
Token to
Token ID284
Feature activation-1.93e+00

Pos logit contributions
ifying+0.009
 Is+0.008
 Ashley+0.007
 awoke+0.007
 Debbie+0.007
Neg logit contributions
uddled-0.019
bled-0.019
iggle-0.018
 olds-0.018
ails-0.018
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
bled+0.014
uddled+0.014
 olds+0.013
ails+0.013
uddle+0.013
Neg logit contributions
ifying-0.009
 Is-0.009
 Ashley-0.008
 bit-0.008
 awoke-0.008
Token
Token ID220
Feature activation-1.30e+00

Pos logit contributions
bled+0.007
uddled+0.007
iggle+0.006
 olds+0.006
ails+0.006
Neg logit contributions
ifying-0.006
 Is-0.005
 Ashley-0.005
 awoke-0.005
 bit-0.005
Token
Token ID198
Feature activation+4.84e-01

Pos logit contributions
bled+0.079
uddled+0.079
castle+0.076
 olds+0.076
ini+0.075
Neg logit contributions
 Is-0.022
ifying-0.022
 Ashley-0.020
 Will-0.019
Jonathan-0.018
TokenIt
Token ID1026
Feature activation-4.21e-01

Pos logit contributions
ifying+0.016
 bit+0.015
 Is+0.014
 awoke+0.014
 Ashley+0.014
Neg logit contributions
bled-0.023
uddled-0.023
 olds-0.021
iggle-0.021
ails-0.021
Token was
Token ID373
Feature activation-4.65e-01

Pos logit contributions
bled+0.021
uddled+0.020
 olds+0.019
ails+0.019
iggle+0.019
Neg logit contributions
ifying-0.013
 bit-0.013
 Is-0.013
 awoke-0.012
 Ashley-0.012
Token an
Token ID281
Feature activation-3.64e+00

Pos logit contributions
uddled+0.024
bled+0.024
ails+0.022
 olds+0.022
uddle+0.022
Neg logit contributions
ifying-0.014
 Is-0.013
 bit-0.012
 awoke-0.012
 Ashley-0.012
Token important
Token ID1593
Feature activation-2.29e+00

Pos logit contributions
uddled+0.181
bled+0.168
uddle+0.156
kins+0.149
ails+0.149
Neg logit contributions
 night-0.133
 bit-0.132
 care-0.126
 single-0.122
 distance-0.121
Token day
Token ID1110
Feature activation-1.30e-01

Pos logit contributions
bled+0.128
uddled+0.117
ires+0.109
 olds+0.108
irled+0.107
Neg logit contributions
 single-0.076
 care-0.074
 bit-0.071
 night-0.066
 month-0.065
Token for
Token ID329
Feature activation-1.36e+00

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.004
 Is-0.004
 bit-0.004
 Ashley-0.004
 awoke-0.004
Token the
Token ID262
Feature activation-2.10e+00

Pos logit contributions
uddled+0.078
uddle+0.074
bled+0.074
ini+0.071
irled+0.070
Neg logit contributions
 Will-0.033
 care-0.033
ifying-0.032
 Is-0.031
 bit-0.031
Token man
Token ID582
Feature activation+2.61e-01

Pos logit contributions
irled+0.103
bled+0.102
uddled+0.102
iggle+0.094
add+0.092
Neg logit contributions
 distance-0.076
 care-0.075
 bit-0.073
 night-0.071
 single-0.066
Token a
Token ID257
Feature activation-7.90e-01

Pos logit contributions
ifying+0.004
 Is+0.004
 awoke+0.004
 Ashley+0.004
 bit+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.005
iggle-0.005
 golf-0.005
Token very
Token ID845
Feature activation-2.04e+00

Pos logit contributions
bled+0.039
uddled+0.037
 olds+0.035
ails+0.035
iggle+0.034
Neg logit contributions
 bit-0.026
ifying-0.026
 single-0.025
 Ashley-0.024
 total-0.024
Token special
Token ID2041
Feature activation-1.07e+00

Pos logit contributions
bled+0.107
uddled+0.103
 olds+0.091
urg+0.090
icking+0.090
Neg logit contributions
 single-0.067
 bit-0.067
 care-0.064
 month-0.060
 minute-0.058
Token friend
Token ID1545
Feature activation+2.62e-01

Pos logit contributions
bled+0.053
uddled+0.050
 olds+0.048
ires+0.046
iggle+0.046
Neg logit contributions
 bit-0.034
 single-0.033
ifying-0.033
 care-0.032
 awoke-0.031
Token,
Token ID11
Feature activation-3.22e-01

Pos logit contributions
ifying+0.011
 Is+0.010
 awoke+0.010
 Ashley+0.009
 bit+0.009
Neg logit contributions
bled-0.010
uddled-0.010
 olds-0.010
ails-0.010
iggle-0.010
Token an
Token ID281
Feature activation-3.63e+00

Pos logit contributions
bled+0.016
uddled+0.016
 olds+0.015
iggle+0.015
ails+0.015
Neg logit contributions
ifying-0.010
 Is-0.009
 Ashley-0.009
 bit-0.009
 awoke-0.009
Token old
Token ID1468
Feature activation-8.62e-01

Pos logit contributions
uddled+0.186
bled+0.183
uddle+0.170
ades+0.165
kins+0.160
Neg logit contributions
 night-0.119
 baby-0.112
 awoke-0.108
 bit-0.107
Am-0.106
Token turtle
Token ID28699
Feature activation+7.12e-01

Pos logit contributions
bled+0.045
uddled+0.043
iggle+0.041
 olds+0.040
irled+0.040
Neg logit contributions
 bit-0.025
ifying-0.024
 Ashley-0.024
 Is-0.023
 awoke-0.023
Token.
Token ID13
Feature activation+5.09e-01

Pos logit contributions
ifying+0.033
 Is+0.031
 Ashley+0.030
 awoke+0.030
 Debbie+0.030
Neg logit contributions
uddled-0.024
bled-0.023
ails-0.023
 olds-0.022
 golf-0.022
Token She
Token ID1375
Feature activation-3.19e-01

Pos logit contributions
ifying+0.020
 bit+0.018
 awoke+0.018
 single+0.017
 Is+0.017
Neg logit contributions
uddled-0.020
 olds-0.020
ails-0.020
bled-0.020
 golf-0.019
Token liked
Token ID8288
Feature activation+8.57e-01

Pos logit contributions
bled+0.015
uddled+0.015
 olds+0.014
iggle+0.014
ails+0.014
Neg logit contributions
ifying-0.011
 bit-0.010
 Is-0.010
 awoke-0.010
 Ashley-0.010
Token One
Token ID1881
Feature activation-2.78e-01

Pos logit contributions
bled+0.027
uddled+0.026
iggle+0.025
castle+0.024
 olds+0.024
Neg logit contributions
ifying-0.013
 Is-0.012
 Ashley-0.011
 Will-0.011
 awoke-0.011
Token day
Token ID1110
Feature activation+6.47e-01

Pos logit contributions
bled+0.015
uddled+0.014
 olds+0.014
iggle+0.013
ails+0.013
Neg logit contributions
ifying-0.008
 bit-0.007
 Is-0.007
 awoke-0.007
 single-0.007
Token,
Token ID11
Feature activation+4.27e-01

Pos logit contributions
ifying+0.029
 distance+0.027
 awoke+0.026
 Is+0.026
 Ashley+0.026
Neg logit contributions
bled-0.024
uddled-0.024
ails-0.022
iggle-0.022
 golf-0.022
Token there
Token ID612
Feature activation-1.40e-01

Pos logit contributions
ifying+0.019
 awoke+0.018
 Is+0.018
 Ashley+0.018
 Debbie+0.017
Neg logit contributions
bled-0.015
uddled-0.015
 golf-0.014
 olds-0.013
ails-0.013
Token was
Token ID373
Feature activation+1.30e-01

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.005
 Is-0.005
 awoke-0.004
 bit-0.004
 Ashley-0.004
Token an
Token ID281
Feature activation-3.61e+00

Pos logit contributions
ifying+0.004
 Is+0.004
 awoke+0.004
 Ashley+0.004
 bit+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.006
iggle-0.006
ails-0.006
Token earthquake
Token ID16295
Feature activation+3.18e-02

Pos logit contributions
uddled+0.187
bled+0.175
uddle+0.164
ades+0.159
irled+0.157
Neg logit contributions
 night-0.128
 bit-0.119
 month-0.112
 baby-0.112
 distance-0.110
Token and
Token ID290
Feature activation+4.81e-01

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token it
Token ID340
Feature activation+6.71e-01

Pos logit contributions
ifying+0.014
 Is+0.012
 awoke+0.012
 Ashley+0.012
Am+0.012
Neg logit contributions
bled-0.025
uddled-0.024
 olds-0.023
iggle-0.023
 golf-0.023
Token made
Token ID925
Feature activation-3.60e-01

Pos logit contributions
ifying+0.024
 Debbie+0.022
 Ashley+0.022
 Is+0.022
Jonathan+0.021
Neg logit contributions
bled-0.031
uddled-0.030
iggle-0.028
irled-0.028
 olds-0.028
Token them
Token ID606
Feature activation+1.20e+00

Pos logit contributions
uddled+0.019
bled+0.019
 olds+0.018
castle+0.018
uddle+0.018
Neg logit contributions
ifying-0.010
 Is-0.009
 Ashley-0.009
 bit-0.009
 care-0.009
Token come
Token ID1282
Feature activation-1.67e+00

Pos logit contributions
bled+0.062
uddled+0.061
 olds+0.059
ails+0.057
ires+0.056
Neg logit contributions
 Is-0.030
 care-0.029
ifying-0.029
 Ashley-0.029
 bit-0.028
Token and
Token ID290
Feature activation-2.10e-01

Pos logit contributions
bled+0.101
uddled+0.100
mm+0.094
 olds+0.093
agons+0.092
Neg logit contributions
ifying-0.034
 Is-0.032
 away-0.032
 care-0.031
 total-0.030
Token play
Token ID711
Feature activation-2.23e+00

Pos logit contributions
bled+0.011
uddled+0.010
 olds+0.010
iggle+0.010
ails+0.010
Neg logit contributions
ifying-0.006
 Is-0.006
 bit-0.006
 Ashley-0.006
 awoke-0.006
Token.
Token ID13
Feature activation+1.38e+00

Pos logit contributions
bled+0.105
uddled+0.101
ini+0.100
 olds+0.098
agons+0.098
Neg logit contributions
 away-0.078
 bit-0.073
 care-0.071
 heard-0.070
 Is-0.069
Token What
Token ID1867
Feature activation+6.46e-02

Pos logit contributions
ifying+0.053
 bit+0.053
 awoke+0.050
 trace+0.049
 total+0.047
Neg logit contributions
ails-0.052
 olds-0.051
uddled-0.051
 golf-0.050
bled-0.049
Token an
Token ID281
Feature activation-3.60e+00

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.002
Neg logit contributions
bled-0.003
uddled-0.003
 olds-0.003
iggle-0.003
ails-0.003
Token incredible
Token ID8082
Feature activation-8.44e-01

Pos logit contributions
bled+0.172
uddled+0.169
bles+0.162
kins+0.161
mons+0.153
Neg logit contributions
 night-0.119
 month-0.116
 care-0.115
 single-0.111
 fright-0.109
Token key
Token ID1994
Feature activation-1.16e+00

Pos logit contributions
bled+0.044
uddled+0.042
 olds+0.039
bles+0.039
ails+0.039
Neg logit contributions
 bit-0.026
 single-0.025
 care-0.024
 Is-0.024
ifying-0.024
Token!
Token ID0
Feature activation+1.41e+00

Pos logit contributions
bled+0.060
uddled+0.058
 olds+0.055
iggle+0.055
add+0.055
Neg logit contributions
ifying-0.034
 Is-0.032
 bit-0.031
 awoke-0.031
 Ashley-0.029
Token<|endoftext|>
Token ID50256
Feature activation+5.21e-01

Pos logit contributions
 bit+0.054
ifying+0.054
 trace+0.051
 awoke+0.050
 total+0.048
Neg logit contributions
ails-0.053
uddled-0.053
bled-0.052
 olds-0.051
mm-0.051
TokenOnce
Token ID7454
Feature activation+7.79e-01

Pos logit contributions
ifying+0.015
 bit+0.014
 Is+0.013
 Ashley+0.013
 care+0.013
Neg logit contributions
bled-0.027
uddled-0.026
ails-0.025
 olds-0.025
iggle-0.025
Token said
Token ID531
Feature activation-1.18e+00

Pos logit contributions
uddled+0.072
bled+0.071
bles+0.068
agons+0.067
 olds+0.067
Neg logit contributions
 care-0.050
 bit-0.049
 Is-0.048
 stroke-0.047
Oh-0.046
Token Sally
Token ID25737
Feature activation+9.17e-01

Pos logit contributions
uddled+0.047
bled+0.044
uddle+0.043
agons+0.043
 olds+0.043
Neg logit contributions
 Is-0.047
 care-0.047
 Ashley-0.047
 bit-0.046
ifying-0.046
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
ifying+0.039
 awoke+0.034
 Is+0.034
 total+0.033
Oh+0.033
Neg logit contributions
bled-0.037
uddled-0.035
 olds-0.035
iggle-0.035
ails-0.033
Token
Token ID198
Feature activation-1.41e+00

Pos logit contributions
bled+0.070
uddled+0.068
agons+0.067
castle+0.066
bles+0.065
Neg logit contributions
 Is-0.022
 Ashley-0.021
 Will-0.020
 Debbie-0.020
ifying-0.018
Token
Token ID198
Feature activation-1.65e+00

Pos logit contributions
bled+0.094
castle+0.090
 olds+0.089
uddled+0.089
agons+0.087
Neg logit contributions
 Ashley-0.018
Lou-0.017
Am-0.017
 Is-0.016
 Debbie-0.016
TokenM
Token ID44
Feature activation-3.58e+00

Pos logit contributions
bled+0.104
 olds+0.099
iggle+0.097
uddled+0.097
agons+0.097
Neg logit contributions
R-0.026
Am-0.025
Lou-0.025
Oh-0.024
where-0.023
Tokenum
Token ID388
Feature activation-8.57e-01

Pos logit contributions
bled+0.160
bes+0.156
ts+0.153
 structures+0.150
 olds+0.148
Neg logit contributions
 single-0.098
by-0.090
of-0.089
Not-0.088
 care-0.086
Token smiled
Token ID13541
Feature activation-2.32e-01

Pos logit contributions
bled+0.041
uddled+0.040
 olds+0.040
agons+0.040
ails+0.040
Neg logit contributions
 bit-0.026
 awoke-0.025
ifying-0.025
 Is-0.024
 Ashley-0.024
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
bled+0.010
uddled+0.010
 olds+0.010
ails+0.010
iggle+0.010
Neg logit contributions
ifying-0.008
 Is-0.008
 bit-0.008
 awoke-0.008
 Ashley-0.007
Token said
Token ID531
Feature activation-6.59e-01

Pos logit contributions
agons+0.049
bled+0.049
uddled+0.047
ails+0.047
ini+0.047
Neg logit contributions
 bit-0.031
 Is-0.030
 Ashley-0.030
 awoke-0.029
 Debbie-0.029
Token,
Token ID11
Feature activation-7.03e-01

Pos logit contributions
bled+0.032
uddled+0.031
agons+0.031
ini+0.031
 olds+0.031
Neg logit contributions
ifying-0.020
 Is-0.019
 bit-0.019
 Ashley-0.018
 awoke-0.018
Token it
Token ID340
Feature activation+8.37e-01

Pos logit contributions
ifying+0.012
 awoke+0.010
 distance+0.010
 Is+0.010
 bit+0.010
Neg logit contributions
bled-0.021
uddled-0.020
iggle-0.020
 olds-0.020
ails-0.019
Token to
Token ID284
Feature activation-7.74e-01

Pos logit contributions
ifying+0.019
 Is+0.017
 Ashley+0.017
Jonathan+0.017
 awoke+0.017
Neg logit contributions
bled-0.048
uddled-0.047
 olds-0.046
iggle-0.045
bles-0.045
Token mom
Token ID1995
Feature activation+4.84e-01

Pos logit contributions
uddled+0.040
bled+0.039
uddle+0.037
ails+0.037
 olds+0.036
Neg logit contributions
ifying-0.023
 Ashley-0.022
 Is-0.022
 Will-0.022
 care-0.021
Tokenmy
Token ID1820
Feature activation+9.01e-01

Pos logit contributions
ifying+0.015
 Is+0.013
 Ashley+0.013
 bit+0.013
 awoke+0.013
Neg logit contributions
bled-0.024
uddled-0.023
 olds-0.023
iggle-0.023
ails-0.022
Token and
Token ID290
Feature activation-2.76e-03

Pos logit contributions
ifying+0.033
 Is+0.030
Lou+0.029
 distance+0.029
Oh+0.029
Neg logit contributions
bled-0.039
uddled-0.038
 olds-0.037
iggle-0.037
ails-0.037
Token daddy
Token ID35822
Feature activation+2.16e+00

Pos logit contributions
bled+0.000
uddled+0.000
 olds+0.000
iggle+0.000
ails+0.000
Neg logit contributions
ifying-0.000
 Is-0.000
 Ashley-0.000
 awoke-0.000
 bit-0.000
Token!"
Token ID2474
Feature activation-1.59e-01

Pos logit contributions
ifying+0.093
Oh+0.091
Lou+0.090
 distance+0.087
 single+0.085
Neg logit contributions
bles-0.075
iggle-0.074
bled-0.073
 olds-0.071
castle-0.071
Token They
Token ID1119
Feature activation-4.11e-01

Pos logit contributions
bled+0.007
uddled+0.007
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.006
 Is-0.006
 Ashley-0.006
 bit-0.006
 awoke-0.005
Token both
Token ID1111
Feature activation-4.92e-01

Pos logit contributions
bled+0.019
uddled+0.019
 olds+0.019
ails+0.018
iggle+0.018
Neg logit contributions
ifying-0.013
 bit-0.012
 Is-0.012
 awoke-0.012
 Ashley-0.012
Token agreed
Token ID4987
Feature activation-2.06e-01

Pos logit contributions
bled+0.024
uddled+0.024
 olds+0.023
ails+0.023
castle+0.022
Neg logit contributions
ifying-0.015
 bit-0.015
 awoke-0.014
 Is-0.014
 Ashley-0.014
Token,
Token ID11
Feature activation-1.91e-01

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.009
iggle+0.009
ails+0.009
Neg logit contributions
ifying-0.007
 Is-0.007
 bit-0.007
 Ashley-0.006
 awoke-0.006
Token to
Token ID284
Feature activation+1.62e+00

Pos logit contributions
ifying+0.035
 Is+0.029
Lou+0.029
 awoke+0.029
Oh+0.029
Neg logit contributions
bled-0.059
uddled-0.059
 olds-0.057
 golf-0.057
ails-0.057
Token her
Token ID607
Feature activation+6.14e-01

Pos logit contributions
ifying+0.078
ah+0.061
inea+0.060
 awoke+0.060
 bit+0.060
Neg logit contributions
iggle-0.060
 golf-0.058
 olds-0.054
ires-0.052
 clubhouse-0.049
Token bed
Token ID3996
Feature activation+8.14e-02

Pos logit contributions
ifying+0.029
 awoke+0.027
 Is+0.026
 care+0.026
 Debbie+0.025
Neg logit contributions
uddled-0.020
bled-0.020
 olds-0.019
uddle-0.019
iggle-0.019
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
ifying+0.003
 Is+0.003
 Ashley+0.003
 awoke+0.003
 bit+0.003
Neg logit contributions
bled-0.004
uddled-0.004
 olds-0.003
iggle-0.003
ails-0.003
Token She
Token ID1375
Feature activation+1.27e+00

Pos logit contributions
bled+0.009
uddled+0.008
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.005
 Is-0.004
 Ashley-0.004
 bit-0.004
 awoke-0.004
Token turned
Token ID2900
Feature activation+2.72e+00

Pos logit contributions
ifying+0.050
 single+0.042
 distance+0.040
 total+0.039
Oh+0.039
Neg logit contributions
uddled-0.056
bled-0.053
irled-0.051
iggle-0.049
uddle-0.049
Token on
Token ID319
Feature activation+1.69e+00

Pos logit contributions
ifying+0.133
Pen+0.123
Jonathan+0.123
Albert+0.122
R+0.121
Neg logit contributions
 olds-0.078
bled-0.077
iggle-0.076
 old-0.075
uddled-0.073
Token her
Token ID607
Feature activation+5.70e-01

Pos logit contributions
ifying+0.072
inea+0.059
 awoke+0.058
Oh+0.058
use+0.058
Neg logit contributions
 golf-0.066
 olds-0.065
iggle-0.062
uddled-0.061
bled-0.060
Token lamp
Token ID20450
Feature activation+8.76e-01

Pos logit contributions
ifying+0.018
 Is+0.016
 awoke+0.016
 Ashley+0.015
 Debbie+0.015
Neg logit contributions
uddled-0.028
bled-0.028
 olds-0.026
iggle-0.026
uddle-0.026
Token and
Token ID290
Feature activation+1.46e+00

Pos logit contributions
ifying+0.038
 Is+0.035
 Debbie+0.034
Lou+0.034
Oh+0.034
Neg logit contributions
uddled-0.032
bled-0.032
iggle-0.030
 olds-0.030
castle-0.030
Token opened
Token ID4721
Feature activation+2.26e+00

Pos logit contributions
ifying+0.055
 single+0.049
 total+0.046
Pen+0.046
Am+0.045
Neg logit contributions
bled-0.066
uddled-0.063
irled-0.058
 golf-0.058
iggle-0.057
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.002
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token Sam
Token ID3409
Feature activation+4.00e-01

Pos logit contributions
bled+0.069
uddled+0.068
castle+0.065
ini+0.064
agons+0.063
Neg logit contributions
 Is-0.038
ifying-0.036
 Will-0.034
Lou-0.033
 Ashley-0.033
Token and
Token ID290
Feature activation+5.47e-01

Pos logit contributions
ifying+0.015
 Is+0.014
 single+0.013
 Ashley+0.013
 awoke+0.013
Neg logit contributions
uddled-0.017
bled-0.017
 olds-0.016
iggle-0.016
ails-0.016
Token Mia
Token ID32189
Feature activation+1.77e+00

Pos logit contributions
ifying+0.024
 awoke+0.021
 distance+0.020
 Is+0.020
Am+0.020
Neg logit contributions
bled-0.021
 olds-0.021
uddled-0.020
iggle-0.020
ails-0.020
Token do
Token ID466
Feature activation-4.41e-01

Pos logit contributions
ifying+0.067
 single+0.062
 Ashley+0.061
 Debbie+0.061
Oh+0.060
Neg logit contributions
iggle-0.072
bled-0.067
 olds-0.067
bles-0.065
 golf-0.065
Token not
Token ID407
Feature activation+1.98e+00

Pos logit contributions
uddled+0.016
bled+0.016
 olds+0.015
uddle+0.015
ini+0.015
Neg logit contributions
ifying-0.019
 Is-0.019
 bit-0.018
 care-0.018
 total-0.018
Token understand
Token ID1833
Feature activation+3.95e-01

Pos logit contributions
ifying+0.089
Pen+0.078
Oh+0.078
 Ashley+0.078
Lou+0.078
Neg logit contributions
iggle-0.074
 golf-0.072
bles-0.071
 olds-0.069
ails-0.068
Token.
Token ID13
Feature activation-4.27e-01

Pos logit contributions
ifying+0.014
 awoke+0.013
 Ashley+0.012
 Is+0.012
 bit+0.012
Neg logit contributions
bled-0.018
uddled-0.018
 olds-0.017
iggle-0.017
ails-0.017
Token They
Token ID1119
Feature activation+1.41e+00

Pos logit contributions
bled+0.019
uddled+0.019
iggle+0.017
 olds+0.017
castle+0.017
Neg logit contributions
ifying-0.015
 Is-0.015
 Ashley-0.014
 bit-0.014
 single-0.014
Token decide
Token ID5409
Feature activation+3.16e-01

Pos logit contributions
ifying+0.052
 single+0.048
 Ashley+0.047
 awoke+0.046
Am+0.046
Neg logit contributions
iggle-0.058
bled-0.056
 olds-0.055
 golf-0.054
uddled-0.054
Token to
Token ID284
Feature activation-6.18e-02

Pos logit contributions
ifying+0.006
 awoke+0.005
 Is+0.005
 Ashley+0.005
 single+0.005
Neg logit contributions
bled-0.019
uddled-0.019
 olds-0.019
 golf-0.018
ails-0.018
Token sun
Token ID4252
Feature activation+9.56e-02

Pos logit contributions
ifying+0.028
 Ashley+0.026
 awoke+0.026
 Debbie+0.026
 Is+0.026
Neg logit contributions
bled-0.027
 golf-0.027
 olds-0.027
castle-0.027
uddled-0.027
Token.
Token ID13
Feature activation+9.01e-01

Pos logit contributions
ifying+0.003
 Is+0.003
 Ashley+0.003
 awoke+0.003
 bit+0.003
Neg logit contributions
bled-0.004
uddled-0.004
 olds-0.004
iggle-0.004
ails-0.004
Token But
Token ID887
Feature activation+1.05e+00

Pos logit contributions
ifying+0.034
 distance+0.030
 bit+0.030
 single+0.029
 awoke+0.029
Neg logit contributions
uddled-0.040
bled-0.039
 olds-0.038
ails-0.037
iggle-0.037
Token Ben
Token ID3932
Feature activation-4.58e-01

Pos logit contributions
ifying+0.046
 distance+0.040
 single+0.039
 awoke+0.039
Am+0.039
Neg logit contributions
bled-0.043
uddled-0.040
iggle-0.038
 golf-0.037
irled-0.037
Token felt
Token ID2936
Feature activation+3.06e-01

Pos logit contributions
bled+0.022
uddled+0.021
 olds+0.021
iggle+0.020
ails+0.020
Neg logit contributions
ifying-0.014
 bit-0.014
 Is-0.014
 awoke-0.013
 Ashley-0.013
Token a
Token ID257
Feature activation+1.94e+00

Pos logit contributions
ifying+0.013
 Ashley+0.012
 Is+0.012
 awoke+0.012
 Debbie+0.012
Neg logit contributions
bled-0.012
uddled-0.011
iggle-0.011
 olds-0.011
 golf-0.011
Token little
Token ID1310
Feature activation+1.14e+00

Pos logit contributions
 Debbie+0.078
 Ashley+0.078
ifying+0.075
 Is+0.073
Jonathan+0.073
Neg logit contributions
iggle-0.075
bled-0.074
uddled-0.072
uddle-0.070
 golf-0.067
Token dizz
Token ID29527
Feature activation+2.33e-01

Pos logit contributions
ifying+0.051
Jonathan+0.050
 Debbie+0.050
 Ashley+0.049
Am+0.049
Neg logit contributions
iggle-0.038
 golf-0.037
uddle-0.037
uddled-0.037
bled-0.036
Tokeny
Token ID88
Feature activation+5.04e-01

Pos logit contributions
ifying+0.004
 Ashley+0.003
 Is+0.003
 awoke+0.003
 bit+0.003
Neg logit contributions
bled-0.015
uddled-0.015
iggle-0.014
 olds-0.014
ails-0.014
Token,
Token ID11
Feature activation-4.30e-01

Pos logit contributions
ifying+0.020
 Ashley+0.018
 Is+0.018
 Debbie+0.018
 awoke+0.018
Neg logit contributions
bled-0.020
uddled-0.020
iggle-0.019
 golf-0.019
 olds-0.019
Token so
Token ID523
Feature activation-2.33e-02

Pos logit contributions
bled+0.021
uddled+0.020
 olds+0.019
ails+0.019
iggle+0.019
Neg logit contributions
ifying-0.014
 Is-0.013
 bit-0.013
 Ashley-0.013
 awoke-0.013
TokenWhen
Token ID2215
Feature activation+3.21e-01

Pos logit contributions
bled+0.007
uddled+0.007
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.005
 Is-0.005
 bit-0.005
 Ashley-0.005
 awoke-0.005
Token they
Token ID484
Feature activation+2.04e+00

Pos logit contributions
ifying+0.016
 Ashley+0.015
 Is+0.015
 awoke+0.015
Lou+0.015
Neg logit contributions
bled-0.010
uddled-0.009
 golf-0.009
 olds-0.008
iggle-0.008
Token got
Token ID1392
Feature activation+1.07e+00

Pos logit contributions
Lou+0.073
Jonathan+0.070
ifying+0.070
 Ashley+0.069
Pen+0.067
Neg logit contributions
add-0.085
uddled-0.083
bled-0.081
iggle-0.078
 golf-0.078
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
ifying+0.035
 Is+0.032
Lou+0.032
Jonathan+0.031
 Ashley+0.030
Neg logit contributions
bled-0.053
uddled-0.052
 golf-0.049
uddle-0.047
ails-0.047
Token the
Token ID262
Feature activation+2.40e+00

Pos logit contributions
ifying+0.046
 awoke+0.038
Lou+0.038
 single+0.037
 bit+0.037
Neg logit contributions
 golf-0.053
iggle-0.050
 olds-0.049
bled-0.049
castle-0.047
Token bar
Token ID2318
Feature activation+2.17e+00

Pos logit contributions
ifying+0.084
Lou+0.083
ges+0.074
ff+0.072
 awoke+0.072
Neg logit contributions
uddled-0.102
uddle-0.101
 golf-0.100
ires-0.099
 swings-0.095
Tokenber
Token ID527
Feature activation+1.46e+00

Pos logit contributions
ifying+0.090
Lou+0.081
 awoke+0.079
 total+0.076
 stroke+0.076
Neg logit contributions
uddled-0.089
ails-0.087
ires-0.085
uddle-0.083
castle-0.082
Token,
Token ID11
Feature activation-1.56e-01

Pos logit contributions
ifying+0.059
Lou+0.053
Oh+0.053
Am+0.051
 single+0.051
Neg logit contributions
uddled-0.057
uddle-0.055
ails-0.054
bled-0.053
 golf-0.053
Token Tim
Token ID5045
Feature activation+9.71e-01

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.007
 Is-0.007
 bit-0.007
 Ashley-0.007
 awoke-0.007
Tokenmy
Token ID1820
Feature activation+2.28e-01

Pos logit contributions
ifying+0.037
 Is+0.031
 single+0.030
 Debbie+0.030
 Ashley+0.030
Neg logit contributions
uddled-0.043
bled-0.042
iggle-0.041
ini-0.040
uddle-0.040
Token saw
Token ID2497
Feature activation+3.49e-02

Pos logit contributions
ifying+0.007
 Is+0.006
 Ashley+0.006
 single+0.006
 bit+0.006
Neg logit contributions
bled-0.012
uddled-0.012
 olds-0.011
iggle-0.011
ails-0.011
Token book
Token ID1492
Feature activation-3.12e-01

Pos logit contributions
ifying+0.038
 Is+0.032
 Ashley+0.032
 Debbie+0.032
Am+0.032
Neg logit contributions
uddled-0.056
 golf-0.055
 olds-0.055
iggle-0.054
uddle-0.054
Token to
Token ID284
Feature activation-4.95e-01

Pos logit contributions
bled+0.015
uddled+0.014
 olds+0.013
iggle+0.013
ails+0.013
Neg logit contributions
ifying-0.011
 Is-0.010
 bit-0.010
 Ashley-0.010
 awoke-0.009
Token each
Token ID1123
Feature activation+1.07e+00

Pos logit contributions
uddled+0.027
bled+0.026
 olds+0.024
uddle+0.024
ails+0.024
Neg logit contributions
ifying-0.014
 Is-0.013
 bit-0.013
 Ashley-0.012
 care-0.012
Token other
Token ID584
Feature activation+6.55e-01

Pos logit contributions
ifying+0.048
 Is+0.043
 Debbie+0.042
Lou+0.042
 Ashley+0.041
Neg logit contributions
iggle-0.037
 golf-0.035
bled-0.035
ires-0.035
ails-0.034
Token.
Token ID13
Feature activation+5.82e-01

Pos logit contributions
ifying+0.025
 Is+0.023
 Ashley+0.022
 awoke+0.022
 Debbie+0.022
Neg logit contributions
bled-0.027
uddled-0.027
 olds-0.026
iggle-0.026
castle-0.025
Token They
Token ID1119
Feature activation+1.62e+00

Pos logit contributions
ifying+0.024
 bit+0.023
 awoke+0.022
 Is+0.022
 distance+0.021
Neg logit contributions
uddled-0.022
bled-0.022
ails-0.021
 olds-0.021
iggle-0.020
Token were
Token ID547
Feature activation+1.28e+00

Pos logit contributions
ifying+0.064
Oh+0.058
 Is+0.057
 single+0.055
 Ashley+0.054
Neg logit contributions
bled-0.065
uddled-0.063
iggle-0.062
castle-0.061
irled-0.059
Token both
Token ID1111
Feature activation+1.66e+00

Pos logit contributions
ifying+0.054
use+0.049
 awoke+0.048
 Is+0.048
 bit+0.047
Neg logit contributions
bled-0.050
iggle-0.045
uddled-0.045
 olds-0.044
castle-0.044
Token happy
Token ID3772
Feature activation+4.87e-01

Pos logit contributions
ifying+0.072
use+0.069
 awoke+0.069
Lou+0.067
Jonathan+0.067
Neg logit contributions
bled-0.054
 olds-0.052
uddled-0.051
iggle-0.051
castle-0.050
Token that
Token ID326
Feature activation-1.68e-01

Pos logit contributions
ifying+0.021
 Is+0.019
 Ashley+0.018
 awoke+0.018
 bit+0.018
Neg logit contributions
uddled-0.019
bled-0.019
 olds-0.018
iggle-0.018
castle-0.017
Token they
Token ID484
Feature activation+1.19e+00

Pos logit contributions
bled+0.007
uddled+0.007
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.006
 Is-0.006
 Ashley-0.006
 bit-0.006
 awoke-0.006
Token I
Token ID314
Feature activation+3.34e-01

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.004
 Is-0.004
 Ashley-0.004
 awoke-0.004
 bit-0.004
Token am
Token ID716
Feature activation+1.96e-02

Pos logit contributions
ifying+0.009
 Ashley+0.008
 Is+0.008
 Debbie+0.007
 awoke+0.007
Neg logit contributions
bled-0.018
uddled-0.018
iggle-0.017
 olds-0.017
ails-0.017
Token proud
Token ID6613
Feature activation+5.56e-01

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token of
Token ID286
Feature activation-7.30e-01

Pos logit contributions
ifying+0.022
 awoke+0.020
 Ashley+0.019
 Is+0.019
 Debbie+0.019
Neg logit contributions
uddled-0.023
bled-0.023
 olds-0.022
ails-0.021
iggle-0.021
Token you
Token ID345
Feature activation+5.65e-01

Pos logit contributions
uddled+0.036
bled+0.035
uddle+0.034
 olds+0.033
ails+0.033
Neg logit contributions
ifying-0.023
 Is-0.022
 Ashley-0.022
 care-0.021
 Debbie-0.021
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
ifying+0.020
 Is+0.019
 awoke+0.018
 bit+0.018
 Ashley+0.018
Neg logit contributions
bled-0.025
uddled-0.024
 olds-0.023
iggle-0.023
ails-0.022
Tokenâ
Token ID22940
Feature activation-1.20e+00

Pos logit contributions
ifying+0.004
 Is+0.004
 bit+0.004
 awoke+0.004
 Ashley+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.006
iggle-0.006
ails-0.006
Token€
Token ID26391
Feature activation+3.73e-01

Pos logit contributions
uddled+0.058
bled+0.058
bles+0.054
uddle+0.054
castle+0.053
Neg logit contributions
 care-0.038
ifying-0.037
 awoke-0.037
 bit-0.036
 Is-0.036
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
ifying+0.012
 Is+0.010
 bit+0.010
 awoke+0.010
 Ashley+0.010
Neg logit contributions
bled-0.018
uddled-0.018
 olds-0.017
ails-0.017
iggle-0.017
Token
Token ID198
Feature activation-2.63e-02

Pos logit contributions
bled+0.074
uddled+0.073
castle+0.071
 olds+0.069
bles+0.068
Neg logit contributions
 Is-0.022
 Ashley-0.022
 Debbie-0.020
ifying-0.020
Am-0.020
TokenThe
Token ID464
Feature activation-5.77e-02

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token She
Token ID1375
Feature activation-5.33e-01

Pos logit contributions
ifying+0.012
 bit+0.011
 awoke+0.011
 Is+0.011
 single+0.011
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.016
ails-0.015
iggle-0.015
Token said
Token ID531
Feature activation+9.43e-02

Pos logit contributions
bled+0.026
uddled+0.026
 olds+0.024
iggle+0.024
ails+0.024
Neg logit contributions
ifying-0.016
 bit-0.016
 Is-0.016
 awoke-0.016
 Ashley-0.015
Token,
Token ID11
Feature activation+5.29e-01

Pos logit contributions
ifying+0.003
 Is+0.003
 Ashley+0.003
 bit+0.003
 awoke+0.003
Neg logit contributions
bled-0.005
uddled-0.004
 olds-0.004
iggle-0.004
ails-0.004
Token "
Token ID366
Feature activation+1.32e+00

Pos logit contributions
ifying+0.023
 awoke+0.021
 bit+0.021
 distance+0.021
 Ashley+0.020
Neg logit contributions
bled-0.020
uddled-0.019
 olds-0.018
ails-0.018
iggle-0.018
TokenAlways
Token ID30374
Feature activation-9.45e-02

Pos logit contributions
ifying+0.045
 bit+0.041
 awoke+0.041
 Ashley+0.039
 single+0.039
Neg logit contributions
bled-0.058
uddled-0.057
ini-0.057
 golf-0.053
ized-0.053
Token mind
Token ID2000
Feature activation+1.25e+00

Pos logit contributions
bled+0.005
uddled+0.004
 olds+0.004
iggle+0.004
ails+0.004
Neg logit contributions
ifying-0.003
 Is-0.003
 Ashley-0.003
 bit-0.003
 awoke-0.003
Token your
Token ID534
Feature activation+1.05e+00

Pos logit contributions
ifying+0.045
 awoke+0.044
 Is+0.042
 bit+0.042
 single+0.041
Neg logit contributions
bled-0.053
 olds-0.051
uddled-0.051
ails-0.049
bles-0.049
Token surroundings
Token ID21334
Feature activation+3.69e-01

Pos logit contributions
ifying+0.049
 Is+0.046
 awoke+0.046
 Ashley+0.045
 Debbie+0.044
Neg logit contributions
uddled-0.034
 olds-0.034
bled-0.033
uddle-0.033
iggle-0.033
Token and
Token ID290
Feature activation-5.73e-02

Pos logit contributions
ifying+0.014
 Is+0.013
 awoke+0.012
 Ashley+0.012
 bit+0.012
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
iggle-0.015
ails-0.015
Token be
Token ID307
Feature activation-4.84e-02

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token prepared
Token ID5597
Feature activation+2.84e-01

Pos logit contributions
bled+0.002
uddled+0.002
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token's
Token ID338
Feature activation+2.94e-01

Pos logit contributions
ifying+0.008
 Is+0.007
 Ashley+0.006
 single+0.006
 awoke+0.006
Neg logit contributions
bled-0.013
uddled-0.013
iggle-0.012
 olds-0.012
uddle-0.012
Token have
Token ID423
Feature activation+1.43e+00

Pos logit contributions
ifying+0.010
 Is+0.009
 Ashley+0.009
 awoke+0.009
 single+0.009
Neg logit contributions
bled-0.014
uddled-0.013
iggle-0.013
 olds-0.013
castle-0.012
Token some
Token ID617
Feature activation+6.33e-01

Pos logit contributions
ifying+0.068
 Is+0.063
 Ashley+0.061
Jonathan+0.061
Lou+0.060
Neg logit contributions
bled-0.046
 golf-0.046
uddled-0.046
 olds-0.042
mm-0.042
Token."
Token ID526
Feature activation-3.53e-01

Pos logit contributions
ifying+0.027
 Is+0.026
 Ashley+0.026
Lou+0.025
Jonathan+0.025
Neg logit contributions
uddled-0.024
bled-0.023
 olds-0.022
iggle-0.022
ails-0.022
Token
Token ID198
Feature activation-6.95e-01

Pos logit contributions
bled+0.020
uddled+0.020
castle+0.019
 olds+0.019
iggle+0.019
Neg logit contributions
ifying-0.008
 Is-0.008
 Ashley-0.008
 Debbie-0.007
 bit-0.007
Token
Token ID198
Feature activation+3.16e-01

Pos logit contributions
bled+0.041
uddled+0.040
castle+0.039
 olds+0.039
iggle+0.038
Neg logit contributions
ifying-0.014
 Is-0.013
 Ashley-0.013
 Debbie-0.012
Am-0.012
Token"
Token ID1
Feature activation+6.60e-01

Pos logit contributions
ifying+0.013
 awoke+0.012
 Is+0.011
 single+0.011
 bit+0.011
Neg logit contributions
bled-0.013
uddled-0.013
 olds-0.012
iggle-0.012
ails-0.012
TokenNo
Token ID2949
Feature activation+3.18e-01

Pos logit contributions
ifying+0.023
 awoke+0.022
 Ashley+0.022
 Debbie+0.020
 Is+0.020
Neg logit contributions
bled-0.030
uddled-0.030
 golf-0.027
 olds-0.027
bles-0.027
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
ifying+0.014
 Is+0.012
 awoke+0.012
 distance+0.012
 Ashley+0.012
Neg logit contributions
bled-0.012
uddled-0.012
 olds-0.012
iggle-0.012
uddle-0.011
Token we
Token ID356
Feature activation+1.57e+00

Pos logit contributions
ifying+0.040
 distance+0.037
 single+0.037
 awoke+0.037
Lou+0.035
Neg logit contributions
bled-0.041
mm-0.039
uddled-0.038
 golf-0.038
 olds-0.038
Token can
Token ID460
Feature activation+4.04e-01

Pos logit contributions
 Debbie+0.055
 Ashley+0.055
Jonathan+0.053
Am+0.053
Pen+0.052
Neg logit contributions
uddle-0.068
 golf-0.067
iggle-0.066
bled-0.066
bles-0.066
Token
Token ID198
Feature activation+5.76e-01

Pos logit contributions
bled+0.050
uddled+0.050
 olds+0.048
castle+0.047
iggle+0.046
Neg logit contributions
ifying-0.018
 Is-0.017
 Ashley-0.016
 awoke-0.015
 Debbie-0.015
Token
Token ID198
Feature activation+3.78e-01

Pos logit contributions
ifying+0.020
 bit+0.017
 Is+0.017
 awoke+0.017
 Ashley+0.017
Neg logit contributions
bled-0.027
uddled-0.027
ails-0.026
iggle-0.026
 olds-0.025
TokenTim
Token ID14967
Feature activation+1.31e+00

Pos logit contributions
ifying+0.010
 Ashley+0.008
 Is+0.008
 bit+0.008
 awoke+0.008
Neg logit contributions
bled-0.021
uddled-0.021
 golf-0.020
ails-0.020
 olds-0.019
Token would
Token ID561
Feature activation+8.19e-01

Pos logit contributions
ifying+0.051
 Is+0.042
Oh+0.039
 single+0.039
 total+0.038
Neg logit contributions
uddled-0.059
bled-0.057
iggle-0.056
irled-0.056
ans-0.055
Token often
Token ID1690
Feature activation+1.21e+00

Pos logit contributions
ifying+0.028
 Is+0.024
Oh+0.024
 single+0.023
 Ashley+0.023
Neg logit contributions
iggle-0.038
bled-0.037
uddled-0.037
 olds-0.036
castle-0.035
Token stand
Token ID1302
Feature activation-1.37e-01

Pos logit contributions
ifying+0.041
 Is+0.036
Oh+0.036
 Ashley+0.035
Am+0.035
Neg logit contributions
iggle-0.055
bled-0.053
 olds-0.052
uddled-0.051
 golf-0.051
Token at
Token ID379
Feature activation+3.58e-01

Pos logit contributions
bled+0.006
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.005
 Is-0.005
 Ashley-0.005
 bit-0.005
 awoke-0.005
Token his
Token ID465
Feature activation+1.76e-01

Pos logit contributions
ifying+0.010
 Is+0.009
 awoke+0.008
 Ashley+0.008
 bit+0.008
Neg logit contributions
bled-0.019
uddled-0.019
iggle-0.018
 olds-0.018
ails-0.018
Token window
Token ID4324
Feature activation+9.82e-01

Pos logit contributions
ifying+0.008
 Is+0.007
 awoke+0.007
 Ashley+0.007
 bit+0.007
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
uddle-0.006
Token and
Token ID290
Feature activation+1.66e+00

Pos logit contributions
ifying+0.036
 Is+0.033
 Ashley+0.033
Am+0.032
 Debbie+0.032
Neg logit contributions
bled-0.042
uddled-0.042
castle-0.040
iggle-0.040
ails-0.039
Token make
Token ID787
Feature activation-2.18e-01

Pos logit contributions
ifying+0.060
Am+0.052
Oh+0.052
 single+0.051
 total+0.050
Neg logit contributions
bled-0.069
iggle-0.069
castle-0.067
 golf-0.066
uddled-0.065
Token to
Token ID284
Feature activation-1.38e+00

Pos logit contributions
ifying+0.020
 Ashley+0.019
 Is+0.019
 awoke+0.019
Jonathan+0.019
Neg logit contributions
bled-0.052
uddled-0.050
 olds-0.049
iggle-0.049
 golf-0.049
Token get
Token ID651
Feature activation+7.15e-01

Pos logit contributions
uddled+0.084
bled+0.079
uddle+0.077
 olds+0.076
ails+0.075
Neg logit contributions
 care-0.035
 bit-0.034
 Is-0.032
 stroke-0.032
 distance-0.032
Token out
Token ID503
Feature activation+3.54e-01

Pos logit contributions
ifying+0.020
 Ashley+0.019
 awoke+0.019
Jonathan+0.019
 Is+0.018
Neg logit contributions
bled-0.038
uddled-0.035
 golf-0.035
ails-0.034
 olds-0.034
Token of
Token ID286
Feature activation+2.52e-01

Pos logit contributions
ifying+0.012
 Is+0.012
 Ashley+0.011
 awoke+0.011
 Debbie+0.011
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
iggle-0.015
ails-0.015
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
ifying+0.007
 Is+0.006
 awoke+0.006
 Ashley+0.006
 bit+0.006
Neg logit contributions
bled-0.014
uddled-0.013
 olds-0.013
iggle-0.013
 golf-0.013
Token tree
Token ID5509
Feature activation-5.53e-01

Pos logit contributions
ifying+0.004
 Is+0.004
 Ashley+0.004
 awoke+0.004
 Debbie+0.004
Neg logit contributions
bled-0.004
uddled-0.004
 olds-0.004
iggle-0.004
 golf-0.004
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
bled+0.027
uddled+0.026
 olds+0.025
iggle+0.024
irled+0.024
Neg logit contributions
ifying-0.018
 bit-0.017
 Is-0.016
 Ashley-0.016
 awoke-0.016
Token
Token ID198
Feature activation-1.04e+00

Pos logit contributions
bled+0.077
iggle+0.076
uddled+0.076
castle+0.074
agons+0.073
Neg logit contributions
 Is-0.027
ifying-0.025
 Will-0.023
Oh-0.022
 single-0.021
Token
Token ID198
Feature activation-1.02e+00

Pos logit contributions
bled+0.065
uddled+0.064
castle+0.063
 olds+0.063
iggle+0.061
Neg logit contributions
 Is-0.017
ifying-0.016
Am-0.016
 single-0.016
 Ashley-0.015
TokenTwe
Token ID32665
Feature activation-4.19e-01

Pos logit contributions
iggle+0.055
bled+0.054
uddled+0.054
 olds+0.054
castle+0.052
Neg logit contributions
ifying-0.027
 Is-0.026
Am-0.025
Oh-0.025
 bit-0.025
Tokenety
Token ID2963
Feature activation-3.58e-01

Pos logit contributions
bled+0.021
uddled+0.021
 olds+0.020
iggle+0.020
castle+0.019
Neg logit contributions
ifying-0.013
 Is-0.011
 bit-0.011
 awoke-0.011
 Ashley-0.011
Token my
Token ID616
Feature activation+9.36e-02

Pos logit contributions
bled+0.023
uddled+0.022
ails+0.021
iggle+0.020
 olds+0.020
Neg logit contributions
ifying-0.018
 Is-0.017
 Ashley-0.017
 bit-0.017
 awoke-0.016
Token dear
Token ID13674
Feature activation+3.57e-01

Pos logit contributions
ifying+0.004
 Is+0.004
 Ashley+0.004
 awoke+0.004
 bit+0.004
Neg logit contributions
bled-0.004
uddled-0.004
 olds-0.003
iggle-0.003
ails-0.003
Token."
Token ID526
Feature activation-1.78e-01

Pos logit contributions
ifying+0.012
 Is+0.011
 Ashley+0.011
 awoke+0.011
 bit+0.011
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
iggle-0.015
castle-0.015
Token<|endoftext|>
Token ID50256
Feature activation+5.98e-01

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.009
iggle+0.009
ails+0.008
Neg logit contributions
ifying-0.005
 Is-0.005
 Ashley-0.005
 bit-0.005
 awoke-0.005
TokenOnce
Token ID7454
Feature activation+8.98e-01

Pos logit contributions
ifying+0.019
 bit+0.018
 Is+0.017
 Ashley+0.017
 awoke+0.016
Neg logit contributions
bled-0.028
uddled-0.028
ails-0.027
iggle-0.027
 olds-0.027
Token upon
Token ID2402
Feature activation-3.66e-01

Pos logit contributions
ifying+0.026
 Is+0.022
 bit+0.022
 awoke+0.022
 Ashley+0.022
Neg logit contributions
bled-0.047
uddled-0.047
iggle-0.046
ails-0.045
bles-0.044
Token a
Token ID257
Feature activation-1.29e-01

Pos logit contributions
bled+0.019
uddled+0.019
 olds+0.018
ails+0.018
iggle+0.018
Neg logit contributions
ifying-0.010
 Is-0.009
 Ashley-0.009
 bit-0.009
 awoke-0.009
Token time
Token ID640
Feature activation-7.10e-01

Pos logit contributions
bled+0.007
uddled+0.007
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.003
 Is-0.003
 bit-0.003
 Ashley-0.003
 awoke-0.003
Token,
Token ID11
Feature activation-9.48e-02

Pos logit contributions
bled+0.036
uddled+0.035
 olds+0.034
ans+0.033
ini+0.033
Neg logit contributions
ifying-0.021
 Is-0.020
 Ashley-0.019
 bit-0.019
 awoke-0.019
Token there
Token ID612
Feature activation+8.95e-02

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.004
iggle+0.004
ails+0.004
Neg logit contributions
ifying-0.003
 Is-0.003
 Ashley-0.003
 bit-0.003
 awoke-0.003
Token was
Token ID373
Feature activation-1.01e+00

Pos logit contributions
ifying+0.003
 Is+0.003
 Ashley+0.003
 bit+0.003
 awoke+0.003
Neg logit contributions
bled-0.004
uddled-0.004
iggle-0.004
 olds-0.004
ails-0.003
Token asked
Token ID1965
Feature activation+2.83e-01

Pos logit contributions
ifying+0.005
 Is+0.004
 Ashley+0.004
 awoke+0.004
 bit+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.006
iggle-0.005
ails-0.005
Token,
Token ID11
Feature activation+5.76e-01

Pos logit contributions
ifying+0.010
 Is+0.009
 awoke+0.009
 bit+0.009
 Ashley+0.008
Neg logit contributions
bled-0.013
uddled-0.013
 olds-0.012
iggle-0.012
ails-0.012
Token "
Token ID366
Feature activation+1.48e+00

Pos logit contributions
ifying+0.025
 awoke+0.023
 Ashley+0.023
 bit+0.022
 Is+0.022
Neg logit contributions
bled-0.022
uddled-0.021
 olds-0.019
ails-0.019
 golf-0.019
TokenDid
Token ID11633
Feature activation-8.38e-01

Pos logit contributions
ifying+0.041
 Ashley+0.038
 care+0.037
 single+0.037
 Debbie+0.036
Neg logit contributions
bled-0.074
uddled-0.072
 golf-0.069
ini-0.067
castle-0.067
Token you
Token ID345
Feature activation-6.89e-03

Pos logit contributions
uddled+0.042
bled+0.041
ails+0.038
ini+0.038
iggle+0.038
Neg logit contributions
ifying-0.026
 Is-0.025
 bit-0.025
 Ashley-0.025
 care-0.024
Token enjoy
Token ID2883
Feature activation-1.59e-01

Pos logit contributions
bled+0.000
uddled+0.000
 olds+0.000
iggle+0.000
ails+0.000
Neg logit contributions
ifying-0.000
 Is-0.000
 Ashley-0.000
 bit-0.000
 awoke-0.000
Token your
Token ID534
Feature activation-9.13e-01

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.004
 Is-0.004
 Ashley-0.004
 bit-0.004
 awoke-0.004
Token large
Token ID1588
Feature activation-1.24e-02

Pos logit contributions
bled+0.041
uddled+0.039
 olds+0.036
ails+0.036
bles+0.036
Neg logit contributions
ifying-0.034
 bit-0.033
 care-0.032
 single-0.032
 awoke-0.031
Token meal
Token ID9799
Feature activation-3.36e-01

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.000
iggle+0.000
ails+0.000
Neg logit contributions
ifying-0.000
 Is-0.000
 Ashley-0.000
 bit-0.000
 awoke-0.000
Token?"
Token ID1701
Feature activation+1.83e-01

Pos logit contributions
bled+0.015
uddled+0.015
 olds+0.014
iggle+0.014
ails+0.014
Neg logit contributions
ifying-0.012
 Is-0.011
 bit-0.011
 Ashley-0.011
 awoke-0.010
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
ifying+0.006
 Is+0.005
 bit+0.005
 awoke+0.005
 Ashley+0.005
Neg logit contributions
bled-0.009
uddled-0.009
 olds-0.009
iggle-0.009
ails-0.008
Token to
Token ID284
Feature activation+4.61e-01

Pos logit contributions
ifying+0.072
Pen+0.072
Jonathan+0.067
Lou+0.067
 Ashley+0.066
Neg logit contributions
 golf-0.104
 olds-0.100
bled-0.100
ails-0.099
uddled-0.098
Token buy
Token ID2822
Feature activation-5.10e-02

Pos logit contributions
ifying+0.016
 Ashley+0.015
 Is+0.014
 awoke+0.014
 Debbie+0.014
Neg logit contributions
bled-0.021
uddled-0.020
iggle-0.020
 olds-0.019
castle-0.019
Token the
Token ID262
Feature activation+1.04e+00

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token lamp
Token ID20450
Feature activation+2.27e-01

Pos logit contributions
ifying+0.050
 Debbie+0.045
 Is+0.045
 awoke+0.045
 Ashley+0.045
Neg logit contributions
bled-0.034
uddled-0.032
 olds-0.032
 golf-0.031
ini-0.031
Token.
Token ID13
Feature activation+2.60e-01

Pos logit contributions
ifying+0.008
 Is+0.007
 Ashley+0.007
 awoke+0.007
 bit+0.007
Neg logit contributions
bled-0.010
uddled-0.010
 olds-0.010
iggle-0.010
ails-0.009
Token The
Token ID383
Feature activation-8.46e-01

Pos logit contributions
ifying+0.009
 awoke+0.008
 Is+0.008
 bit+0.008
 Ashley+0.008
Neg logit contributions
bled-0.012
uddled-0.012
 olds-0.011
ails-0.011
iggle-0.011
Token ancient
Token ID6156
Feature activation+6.73e-01

Pos logit contributions
uddled+0.042
bled+0.042
iggle+0.041
ails+0.041
irled+0.040
Neg logit contributions
 bit-0.027
 single-0.026
ifying-0.025
 total-0.025
 distance-0.025
Token friend
Token ID1545
Feature activation+5.13e-01

Pos logit contributions
ifying+0.029
 Is+0.025
 Ashley+0.025
 Debbie+0.024
 awoke+0.024
Neg logit contributions
bled-0.027
uddled-0.026
castle-0.025
 olds-0.025
ini-0.025
Token was
Token ID373
Feature activation+5.28e-01

Pos logit contributions
ifying+0.021
 Is+0.018
 Ashley+0.018
 distance+0.018
 single+0.018
Neg logit contributions
bled-0.021
uddled-0.021
iggle-0.019
 olds-0.019
irled-0.019
Token thrilled
Token ID20536
Feature activation-3.34e-01

Pos logit contributions
ifying+0.022
 Is+0.020
 Ashley+0.020
 awoke+0.020
Lou+0.019
Neg logit contributions
bled-0.021
uddled-0.020
 olds-0.019
iggle-0.019
castle-0.018
Token and
Token ID290
Feature activation-6.73e-01

Pos logit contributions
bled+0.016
uddled+0.016
 olds+0.015
iggle+0.015
uddle+0.015
Neg logit contributions
ifying-0.010
 Is-0.010
 bit-0.010
 Ashley-0.010
 awoke-0.009
Token was
Token ID373
Feature activation+2.24e-01

Pos logit contributions
ifying+0.016
 Is+0.015
 Ashley+0.014
 single+0.014
 total+0.014
Neg logit contributions
bled-0.021
uddled-0.021
iggle-0.019
 olds-0.019
irled-0.019
Token very
Token ID845
Feature activation-2.91e-01

Pos logit contributions
ifying+0.009
 Is+0.008
 Ashley+0.008
 awoke+0.008
 bit+0.008
Neg logit contributions
bled-0.010
uddled-0.009
 olds-0.009
iggle-0.009
ails-0.008
Token sad
Token ID6507
Feature activation+4.66e-01

Pos logit contributions
uddled+0.012
bled+0.012
 olds+0.011
ails+0.011
iggle+0.011
Neg logit contributions
ifying-0.011
 bit-0.011
 Is-0.011
 care-0.010
 single-0.010
Token.
Token ID13
Feature activation-7.24e-01

Pos logit contributions
ifying+0.019
 Ashley+0.018
 awoke+0.018
 Is+0.017
 Debbie+0.017
Neg logit contributions
uddled-0.018
bled-0.018
 olds-0.017
iggle-0.017
ails-0.017
Token He
Token ID679
Feature activation+5.76e-01

Pos logit contributions
bled+0.041
uddled+0.041
iggle+0.038
agons+0.038
ans+0.038
Neg logit contributions
 Is-0.018
ifying-0.018
 Will-0.016
 Ashley-0.016
 bit-0.016
Token wished
Token ID16555
Feature activation-1.36e+00

Pos logit contributions
ifying+0.021
 Is+0.019
 Ashley+0.019
 Debbie+0.018
 single+0.018
Neg logit contributions
bled-0.025
uddled-0.025
 olds-0.024
iggle-0.024
irled-0.023
Token he
Token ID339
Feature activation+5.05e-01

Pos logit contributions
uddled+0.082
ans+0.081
agons+0.081
bled+0.080
uddle+0.080
Neg logit contributions
 Is-0.024
 Will-0.023
ifying-0.022
 Ashley-0.022
 care-0.022
Token could
Token ID714
Feature activation-1.77e-01

Pos logit contributions
ifying+0.016
 Ashley+0.014
 Is+0.014
 Debbie+0.014
 single+0.014
Neg logit contributions
bled-0.025
uddled-0.025
 olds-0.023
bles-0.022
irled-0.022
Token make
Token ID787
Feature activation-8.61e-01

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.008
ails+0.008
iggle+0.008
Neg logit contributions
ifying-0.006
 Is-0.005
 bit-0.005
 Ashley-0.005
 awoke-0.005
Token his
Token ID465
Feature activation-3.43e-01

Pos logit contributions
uddled+0.053
bled+0.052
ails+0.049
 olds+0.049
agons+0.049
Neg logit contributions
 care-0.016
 Is-0.016
 Ashley-0.016
 Will-0.016
ifying-0.015
Token player
Token ID2137
Feature activation+5.94e-01

Pos logit contributions
bled+0.015
uddled+0.015
 olds+0.014
ails+0.014
iggle+0.014
Neg logit contributions
ifying-0.012
 Is-0.012
 bit-0.012
 Ashley-0.011
 single-0.011
Token was
Token ID373
Feature activation-1.04e+00

Pos logit contributions
ifying+0.012
 Ashley+0.011
 Is+0.011
 bit+0.011
 awoke+0.010
Neg logit contributions
bled-0.012
uddled-0.012
iggle-0.011
 olds-0.011
ails-0.011
Token a
Token ID257
Feature activation-6.99e-01

Pos logit contributions
uddled+0.061
bled+0.061
 olds+0.060
uddle+0.059
ails+0.058
Neg logit contributions
ifying-0.022
 Ashley-0.021
 Will-0.020
 single-0.020
 awoke-0.020
Token bouncing
Token ID30713
Feature activation-1.53e+00

Pos logit contributions
bled+0.032
uddled+0.031
 olds+0.029
ails+0.029
iggle+0.029
Neg logit contributions
 bit-0.025
ifying-0.025
 Is-0.023
 Ashley-0.023
 single-0.023
Token ball
Token ID2613
Feature activation+7.40e-02

Pos logit contributions
bled+0.075
uddled+0.072
 olds+0.070
oving+0.067
irled+0.066
Neg logit contributions
 bit-0.050
ifying-0.048
 single-0.047
 distance-0.044
 awoke-0.043
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
ifying+0.003
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.002
Neg logit contributions
bled-0.003
uddled-0.003
 olds-0.003
iggle-0.003
ails-0.003
Token His
Token ID2399
Feature activation-2.55e+00

Pos logit contributions
ifying+0.052
 awoke+0.047
 bit+0.047
Am+0.046
Lou+0.045
Neg logit contributions
 golf-0.046
 olds-0.044
bled-0.042
ails-0.042
iggle-0.041
Token name
Token ID1438
Feature activation+4.32e-02

Pos logit contributions
uddled+0.151
ator+0.150
 olds+0.150
bled+0.148
rim+0.148
Neg logit contributions
 care-0.050
 try-0.048
 bit-0.048
 single-0.046
 stroke-0.044
Token was
Token ID373
Feature activation-5.94e-01

Pos logit contributions
ifying+0.002
 Is+0.001
 Ashley+0.001
 bit+0.001
 awoke+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token Bill
Token ID3941
Feature activation+7.24e-01

Pos logit contributions
uddled+0.024
bled+0.023
 olds+0.022
uddle+0.022
iggle+0.022
Neg logit contributions
ifying-0.024
 Is-0.023
 Ashley-0.023
 Will-0.023
 Debbie-0.022
Token and
Token ID290
Feature activation-4.76e-01

Pos logit contributions
ifying+0.030
 Is+0.028
 awoke+0.027
 Ashley+0.026
 total+0.026
Neg logit contributions
bled-0.029
uddled-0.028
bles-0.027
iggle-0.027
ails-0.027
Token he
Token ID339
Feature activation-3.36e-01

Pos logit contributions
uddled+0.027
bled+0.026
 olds+0.025
iggle+0.025
uddle+0.025
Neg logit contributions
ifying-0.011
 Is-0.011
 Ashley-0.011
 bit-0.011
 care-0.010
Token.
Token ID13
Feature activation+4.12e-01

Pos logit contributions
ifying+0.027
 Ashley+0.026
Jonathan+0.025
 Is+0.025
 Debbie+0.025
Neg logit contributions
bled-0.026
uddled-0.026
iggle-0.025
ails-0.025
castle-0.024
Token She
Token ID1375
Feature activation-1.35e-01

Pos logit contributions
ifying+0.014
 awoke+0.012
 bit+0.012
 total+0.012
Am+0.012
Neg logit contributions
uddled-0.019
bled-0.019
 olds-0.018
ails-0.018
 golf-0.018
Token recognized
Token ID8018
Feature activation+8.18e-01

Pos logit contributions
bled+0.007
uddled+0.006
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.004
 Is-0.004
 bit-0.004
 Ashley-0.004
 awoke-0.004
Token it
Token ID340
Feature activation-2.12e-01

Pos logit contributions
ifying+0.028
 awoke+0.025
 distance+0.024
 total+0.024
 bit+0.023
Neg logit contributions
bled-0.039
uddled-0.036
 golf-0.036
 olds-0.036
iggle-0.035
Token as
Token ID355
Feature activation-9.28e-01

Pos logit contributions
bled+0.010
uddled+0.010
 olds+0.009
ails+0.009
iggle+0.009
Neg logit contributions
ifying-0.007
 Is-0.007
 bit-0.007
 Ashley-0.007
 awoke-0.007
Token an
Token ID281
Feature activation-2.66e+00

Pos logit contributions
uddled+0.054
bled+0.053
uddle+0.052
 olds+0.052
ails+0.051
Neg logit contributions
ifying-0.019
 Is-0.019
 Ashley-0.018
 single-0.018
 care-0.017
Token eagle
Token ID31176
Feature activation+3.68e-02

Pos logit contributions
uddled+0.131
bled+0.128
uddle+0.122
ails+0.116
kins+0.114
Neg logit contributions
 bit-0.082
Am-0.082
 awoke-0.081
 single-0.081
 stroke-0.080
Token!
Token ID0
Feature activation+1.86e-01

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token
Token ID220
Feature activation-8.10e-01

Pos logit contributions
ifying+0.006
 awoke+0.005
 bit+0.005
 Is+0.005
 Ashley+0.005
Neg logit contributions
bled-0.009
uddled-0.009
 olds-0.009
ails-0.009
iggle-0.009
Token
Token ID198
Feature activation+5.07e-01

Pos logit contributions
bled+0.047
uddled+0.047
 olds+0.045
castle+0.045
 golf+0.044
Neg logit contributions
ifying-0.017
 Is-0.016
 Ashley-0.016
 awoke-0.014
 Debbie-0.014
Token
Token ID198
Feature activation-6.30e-02

Pos logit contributions
ifying+0.017
 bit+0.015
 Is+0.015
 awoke+0.014
 Ashley+0.014
Neg logit contributions
uddled-0.024
bled-0.024
iggle-0.023
 olds-0.023
ails-0.023
Token find
Token ID1064
Feature activation+3.01e-01

Pos logit contributions
ifying+0.006
 Is+0.005
 Ashley+0.005
 awoke+0.005
 bit+0.005
Neg logit contributions
bled-0.008
uddled-0.008
iggle-0.008
 olds-0.008
ails-0.007
Token her
Token ID607
Feature activation-2.48e-02

Pos logit contributions
ifying+0.010
 awoke+0.009
 Is+0.009
 bit+0.009
 Ashley+0.009
Neg logit contributions
bled-0.014
uddled-0.014
 olds-0.013
iggle-0.013
ails-0.013
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token
Token ID198
Feature activation+2.39e-02

Pos logit contributions
bled+0.022
uddled+0.021
iggle+0.020
 olds+0.020
castle+0.020
Neg logit contributions
ifying-0.011
 Is-0.011
 Ashley-0.010
 bit-0.010
 awoke-0.010
Token
Token ID198
Feature activation-2.41e-01

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 bit+0.001
 awoke+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
TokenL
Token ID43
Feature activation-2.20e+00

Pos logit contributions
bled+0.011
uddled+0.011
 olds+0.011
iggle+0.010
ails+0.010
Neg logit contributions
ifying-0.008
 Is-0.008
 bit-0.008
 Ashley-0.007
 awoke-0.007
Tokenily
Token ID813
Feature activation-3.47e-02

Pos logit contributions
bled+0.089
 olds+0.087
 golf+0.084
yard+0.082
 raft+0.081
Neg logit contributions
 awoke-0.078
 Is-0.075
 heard-0.074
 woke-0.073
y-0.073
Token was
Token ID373
Feature activation-1.02e-01

Pos logit contributions
bled+0.002
uddled+0.002
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token all
Token ID477
Feature activation+8.47e-01

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.004
iggle+0.004
ails+0.004
Neg logit contributions
ifying-0.003
 Is-0.003
 Ashley-0.003
 bit-0.003
 awoke-0.003
Token alone
Token ID3436
Feature activation+1.88e-01

Pos logit contributions
ifying+0.042
Lou+0.036
Jonathan+0.036
 Is+0.036
 Debbie+0.036
Neg logit contributions
bled-0.028
 olds-0.027
uddled-0.026
iggle-0.026
 golf-0.026
Token and
Token ID290
Feature activation+5.38e-01

Pos logit contributions
ifying+0.007
 Is+0.007
 Ashley+0.007
 awoke+0.007
 Debbie+0.006
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.007
iggle-0.007
ails-0.007
Token They
Token ID1119
Feature activation+6.39e-01

Pos logit contributions
bled+0.020
uddled+0.020
iggle+0.018
bles+0.018
irled+0.018
Neg logit contributions
ifying-0.016
 Is-0.016
 Ashley-0.015
 bit-0.015
 care-0.014
Token touch
Token ID3638
Feature activation+4.03e-01

Pos logit contributions
ifying+0.021
 Is+0.019
 single+0.019
 Ashley+0.018
 awoke+0.018
Neg logit contributions
bled-0.030
iggle-0.029
uddled-0.029
 olds-0.028
 golf-0.028
Token the
Token ID262
Feature activation-3.11e-01

Pos logit contributions
ifying+0.012
 awoke+0.011
 Is+0.011
 Ashley+0.011
 Debbie+0.011
Neg logit contributions
bled-0.020
uddled-0.020
 olds-0.019
iggle-0.019
ails-0.019
Token bubbles
Token ID25037
Feature activation-1.85e-01

Pos logit contributions
bled+0.013
uddled+0.012
 olds+0.012
iggle+0.012
ails+0.012
Neg logit contributions
ifying-0.013
 bit-0.012
 Is-0.011
 single-0.011
 awoke-0.011
Token and
Token ID290
Feature activation+9.83e-01

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.006
 Is-0.006
 bit-0.006
 Ashley-0.006
 awoke-0.006
Token g
Token ID308
Feature activation-2.22e+00

Pos logit contributions
ifying+0.034
Am+0.031
 single+0.031
 awoke+0.030
 total+0.029
Neg logit contributions
bled-0.043
 olds-0.042
 golf-0.042
iggle-0.042
uddled-0.041
Tokeniggle
Token ID24082
Feature activation+5.51e-01

Pos logit contributions
 golf+0.046
bled+0.042
 clubhouse+0.042
uddled+0.039
ini+0.038
Neg logit contributions
 care-0.118
ifying-0.116
ges-0.112
ff-0.110
 Will-0.110
Token.
Token ID13
Feature activation-9.83e-01

Pos logit contributions
ifying+0.021
 Ashley+0.019
 Is+0.019
 awoke+0.019
 Debbie+0.019
Neg logit contributions
bled-0.024
uddled-0.023
iggle-0.022
 olds-0.022
 golf-0.021
Token They
Token ID1119
Feature activation+1.45e+00

Pos logit contributions
bled+0.051
uddled+0.049
iggle+0.048
ini+0.047
bles+0.047
Neg logit contributions
 Is-0.030
ifying-0.029
 Will-0.027
 care-0.026
 bit-0.026
Token see
Token ID766
Feature activation-3.36e-01

Pos logit contributions
ifying+0.052
 single+0.049
Am+0.049
 awoke+0.048
Oh+0.048
Neg logit contributions
iggle-0.061
bled-0.059
 golf-0.057
 olds-0.057
bles-0.057
Token the
Token ID262
Feature activation-4.03e-01

Pos logit contributions
bled+0.017
uddled+0.017
 olds+0.016
iggle+0.016
ails+0.016
Neg logit contributions
ifying-0.010
 Is-0.009
 bit-0.009
 Ashley-0.009
 awoke-0.009
Token"
Token ID1
Feature activation+4.58e-01

Pos logit contributions
ifying+0.013
 bit+0.012
 Ashley+0.012
 awoke+0.012
 Is+0.012
Neg logit contributions
bled-0.021
uddled-0.020
 olds-0.019
iggle-0.019
ails-0.019
TokenWhat
Token ID2061
Feature activation+1.94e-01

Pos logit contributions
ifying+0.014
 Ashley+0.013
 awoke+0.013
 bit+0.012
 Is+0.012
Neg logit contributions
bled-0.023
uddled-0.022
 golf-0.021
 olds-0.021
ini-0.021
Token a
Token ID257
Feature activation-1.06e+00

Pos logit contributions
ifying+0.007
 Is+0.006
 Ashley+0.006
 awoke+0.006
 bit+0.006
Neg logit contributions
bled-0.009
uddled-0.009
iggle-0.008
 olds-0.008
ails-0.008
Token pretty
Token ID2495
Feature activation-7.43e-01

Pos logit contributions
bled+0.048
uddled+0.047
ails+0.042
bles+0.041
iggle+0.040
Neg logit contributions
 bit-0.042
 single-0.041
 care-0.040
ifying-0.039
 Is-0.039
Token yellow
Token ID7872
Feature activation-3.86e-01

Pos logit contributions
bled+0.036
uddled+0.034
iggle+0.031
 olds+0.031
bles+0.031
Neg logit contributions
ifying-0.025
 bit-0.025
 single-0.024
 Is-0.024
 awoke-0.023
Token bou
Token ID35833
Feature activation-2.39e+00

Pos logit contributions
bled+0.018
uddled+0.018
 olds+0.017
iggle+0.017
ails+0.016
Neg logit contributions
ifying-0.013
 bit-0.012
 Is-0.012
 awoke-0.012
 single-0.012
Tokenquet
Token ID21108
Feature activation-3.92e-01

Pos logit contributions
bled+0.095
uddled+0.085
ans+0.084
irled+0.084
oving+0.082
Neg logit contributions
ifying-0.095
of-0.091
Am-0.091
 Ashley-0.089
Lou-0.086
Token,"
Token ID553
Feature activation+9.31e-01

Pos logit contributions
bled+0.019
uddled+0.019
 olds+0.018
iggle+0.018
ails+0.018
Neg logit contributions
ifying-0.012
 bit-0.011
 Is-0.011
 awoke-0.011
 Ashley-0.011
Token she
Token ID673
Feature activation-1.33e-01

Pos logit contributions
ifying+0.043
 total+0.038
 distance+0.038
 awoke+0.038
 single+0.038
Neg logit contributions
bled-0.034
uddled-0.032
 olds-0.031
iggle-0.030
irled-0.030
Token said
Token ID531
Feature activation+9.29e-02

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.005
 Is-0.005
 Ashley-0.005
 bit-0.005
 awoke-0.005
Token proudly
Token ID23017
Feature activation+1.38e-01

Pos logit contributions
ifying+0.003
 Is+0.003
 awoke+0.003
 Ashley+0.003
 bit+0.003
Neg logit contributions
bled-0.004
uddled-0.004
 olds-0.004
iggle-0.004
ails-0.004