TOP ACTIVATIONS
MAX = 0.102

, it went on the w ett est of adventures .
M andy and Sally . M andy was a very kind
there were two friends , an elephant and a dinosaur .
there were two friends , an ostr ich and a goat
. Sally said , " M mm , this cake is
became best friends . They played together in the park every
. After the boy finished his swing , he shared the
Bob was so happy with his cool golf shot . He
for his help . They played golf together and had a
One day , an especially brave and curious squirrel
One day , an unhappy thing happened . The
. Sally said , " M mm , this cake is
wanted to make his mum an honest offer .
in the mud . What an adventure ! <|endoftext|> Once upon
selfish . One day , M andy got an invitation to
. It was an important day for the man
a very special friend , an old turtle . She liked
One day , there was an earthquake and it made them
come and play . What an incredible key ! <|endoftext|> Once
said Sally . M um smiled and said ,

MIN ACTIVATIONS
MIN = -0.096

shoot it . Jackson got her toy gun
loved to dance . Every day , he danced on the
, you can give me a massage . That would be
a little girl named Jackson . One day , Jackson went
she saw a large man . He yelled out to her
my and daddy . One day , when Pen elope was
ily , can you answer a question for me ?" Lily
world . One day , Ashley was at the
stop ! Finally , she reached the top of
she decided to shoot it . Jackson got
around the park . One day , as she was dancing
if he could give her a massage . Jane said "
named Pen elope . One day , while picking fresh flowers
mom if she could get a new haircut . Her mom
. The next day , Ashley brought the flag
my , we don 't have any popcorn left . We
way home . So Pen elope and her mom
a little car named Gray . Gray was very fast and
Tom nods . He takes a corn from the plant .
The end . <|endoftext|> One day , a

INTERVAL 0.053 - 0.102
CONTAINS 0.265%

. Jack said , " L ily , let 's buy
upon a time there lived an olive who wanted to do
miserable ! M ama saw what had happened
You can go outside and play with the rain . But
. They liked to play dress - up with their dolls

INTERVAL 0.003 - 0.053
CONTAINS 42.131%

threw the stick far away . Max saw the stick and
me how to be patient , loyal , and grateful .
then , Anna sees that the window is fog gy .
happy to be outside with his basketball . He threw it
new . M illy was forever thankful to

INTERVAL -0.046 - 0.003
CONTAINS 56.989%

alone and scared . It is getting dark and cold .
She was in shock . There was a giant ,
It is very good . You did a good job cleaning
went . One day , Tim my was playing
theater with her mom . When they arrived , Lucy was

INTERVAL -0.096 - -0.046
CONTAINS 0.614%

blue eyes . She wished Anna was nicer to her .
They all gave each other a big hug and the little
mysterious voice . It said " Tell me ! Tell me
saw a beautiful golden crown in the bottom of the pond
want to pray . We want to play ," Ben says
Token,
Token ID11
Feature activation-2.52e-02

Pos logit contributions
ifying+0.085
Am+0.080
 distance+0.079
Georgia+0.078
 Debbie+0.078
Neg logit contributions
bled-0.041
 golf-0.039
uddled-0.038
ails-0.033
irled-0.033
Token it
Token ID340
Feature activation+3.63e-03

Pos logit contributions
ifying+0.039
 awoke+0.037
Am+0.035
 Debbie+0.035
use+0.034
Neg logit contributions
bled-0.044
 golf-0.042
uddled-0.041
 olds-0.038
bles-0.037
Token went
Token ID1816
Feature activation+1.37e-03

Pos logit contributions
bled+0.008
uddled+0.007
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.004
 Is-0.004
 awoke-0.004
 bit-0.004
 Ashley-0.004
Token on
Token ID319
Feature activation+1.33e-03

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token the
Token ID262
Feature activation+2.71e-02

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.001
Token w
Token ID266
Feature activation+1.02e-01

Pos logit contributions
bled+0.051
uddled+0.050
irled+0.048
ails+0.048
iggle+0.048
Neg logit contributions
 distance-0.042
 bit-0.038
ifying-0.037
 awoke-0.036
 single-0.036
Tokenett
Token ID3087
Feature activation+2.87e-02

Pos logit contributions
bled+0.120
kins+0.093
uddled+0.092
mm+0.089
 olds+0.089
Neg logit contributions
 care-0.183
 single-0.177
ff-0.170
 month-0.166
 wake-0.164
Tokenest
Token ID395
Feature activation+8.58e-03

Pos logit contributions
uddled+0.054
bled+0.053
uddle+0.051
 olds+0.049
mm+0.048
Neg logit contributions
 distance-0.039
 awoke-0.039
 bit-0.038
ifying-0.038
 single-0.038
Token of
Token ID286
Feature activation+3.19e-02

Pos logit contributions
bled+0.016
uddled+0.016
 olds+0.015
iggle+0.015
ails+0.014
Neg logit contributions
ifying-0.012
 bit-0.011
 awoke-0.011
 Is-0.011
 distance-0.011
Token adventures
Token ID17545
Feature activation+2.35e-02

Pos logit contributions
uddled+0.071
bled+0.066
uddle+0.066
iggle+0.064
ini+0.063
Neg logit contributions
 care-0.035
 Will-0.034
 eight-0.033
 Is-0.032
ifying-0.032
Token.
Token ID13
Feature activation+2.19e-03

Pos logit contributions
bled+0.051
uddled+0.050
uddle+0.048
iggle+0.047
add+0.047
Neg logit contributions
ifying-0.026
 bit-0.024
 single-0.023
 care-0.023
 Is-0.023
Token M
Token ID337
Feature activation+5.76e-02

Pos logit contributions
bled+0.013
uddled+0.013
iggle+0.012
 olds+0.012
uddle+0.012
Neg logit contributions
ifying-0.013
 Is-0.012
 bit-0.012
 Ashley-0.012
 awoke-0.012
Tokenandy
Token ID10757
Feature activation+3.55e-02

Pos logit contributions
bled+0.089
 golf+0.081
oving+0.075
 olds+0.072
ized+0.072
Neg logit contributions
bie-0.100
 Will-0.097
 Mae-0.097
Am-0.097
 Ashley-0.096
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
bled+0.081
uddled+0.076
ires+0.072
uddle+0.072
mm+0.071
Neg logit contributions
 bit-0.038
 Ashley-0.035
 care-0.034
 awoke-0.033
 Will-0.032
Token Sally
Token ID25737
Feature activation-6.10e-03

Pos logit contributions
ifying+0.027
 awoke+0.025
 single+0.024
 Is+0.024
 distance+0.024
Neg logit contributions
bled-0.020
uddled-0.018
 golf-0.018
iggle-0.018
 olds-0.018
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
ifying+0.009
 Is+0.009
 awoke+0.008
 Ashley+0.008
 bit+0.008
Neg logit contributions
bled-0.011
uddled-0.011
iggle-0.010
 olds-0.010
ails-0.010
Token M
Token ID337
Feature activation+9.73e-02

Pos logit contributions
bled+0.025
uddled+0.024
 olds+0.022
iggle+0.021
irled+0.021
Neg logit contributions
ifying-0.019
 Is-0.018
 Ashley-0.017
 bit-0.017
 care-0.017
Tokenandy
Token ID10757
Feature activation+5.22e-02

Pos logit contributions
bled+0.200
oving+0.173
 golf+0.169
renches+0.162
agons+0.159
Neg logit contributions
bie-0.117
y-0.109
abel-0.101
 Mae-0.101
 night-0.099
Token was
Token ID373
Feature activation+2.70e-02

Pos logit contributions
bled+0.127
uddled+0.115
agons+0.114
ires+0.114
bles+0.113
Neg logit contributions
 bit-0.050
 care-0.046
 heard-0.045
 woke-0.044
 awoke-0.044
Token a
Token ID257
Feature activation-7.66e-03

Pos logit contributions
uddled+0.066
bled+0.063
uddle+0.062
 olds+0.060
castle+0.059
Neg logit contributions
 Ashley-0.025
 bit-0.025
 Will-0.025
 Is-0.024
ifying-0.024
Token very
Token ID845
Feature activation+5.01e-02

Pos logit contributions
ifying+0.013
 Is+0.012
 awoke+0.012
 Ashley+0.012
 Debbie+0.012
Neg logit contributions
bled-0.012
uddled-0.012
 golf-0.011
 olds-0.011
iggle-0.011
Token kind
Token ID1611
Feature activation+1.63e-02

Pos logit contributions
bled+0.098
uddled+0.098
ized+0.084
mm+0.082
 olds+0.082
Neg logit contributions
 bit-0.075
 single-0.072
 care-0.071
 Will-0.067
 total-0.065
Token there
Token ID612
Feature activation-2.94e-03

Pos logit contributions
bled+0.028
uddled+0.027
 olds+0.026
agons+0.025
ans+0.025
Neg logit contributions
ifying-0.018
 Is-0.017
 Ashley-0.017
 bit-0.016
 awoke-0.016
Token were
Token ID547
Feature activation+2.40e-02

Pos logit contributions
ifying+0.005
 Is+0.004
 Ashley+0.004
 bit+0.004
 awoke+0.004
Neg logit contributions
bled-0.005
uddled-0.005
iggle-0.005
 olds-0.005
ails-0.005
Token two
Token ID734
Feature activation+4.51e-02

Pos logit contributions
uddled+0.048
bled+0.048
 olds+0.045
ails+0.045
iggle+0.045
Neg logit contributions
ifying-0.030
 Will-0.028
Am-0.028
 Ashley-0.028
Lou-0.027
Token friends
Token ID2460
Feature activation+9.75e-03

Pos logit contributions
bled+0.069
uddled+0.061
irled+0.057
ails+0.055
iggle+0.054
Neg logit contributions
 bit-0.084
ifying-0.078
 Ashley-0.077
 eight-0.077
 total-0.077
Token,
Token ID11
Feature activation+1.00e-02

Pos logit contributions
bled+0.018
uddled+0.018
 olds+0.016
uddle+0.016
iggle+0.016
Neg logit contributions
ifying-0.014
 bit-0.013
 Is-0.013
 Ashley-0.013
 care-0.013
Token an
Token ID281
Feature activation+9.61e-02

Pos logit contributions
bled+0.018
uddled+0.018
iggle+0.016
 olds+0.016
uddle+0.016
Neg logit contributions
ifying-0.015
 Ashley-0.014
 Is-0.014
 bit-0.014
 Will-0.014
Token elephant
Token ID20950
Feature activation-1.14e-03

Pos logit contributions
uddled+0.216
bled+0.212
uddle+0.196
irled+0.179
ades+0.179
Neg logit contributions
 baby-0.114
 night-0.113
 nine-0.110
 bit-0.107
 eight-0.107
Token and
Token ID290
Feature activation+1.15e-03

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.002
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token a
Token ID257
Feature activation+9.04e-03

Pos logit contributions
bled+0.002
uddled+0.002
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.002
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token dinosaur
Token ID28257
Feature activation+1.74e-03

Pos logit contributions
bled+0.017
uddled+0.017
 olds+0.016
iggle+0.016
ails+0.016
Neg logit contributions
ifying-0.013
 bit-0.012
 Is-0.012
 Ashley-0.011
 awoke-0.011
Token.
Token ID13
Feature activation-6.86e-03

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 bit-0.002
 Ashley-0.002
 awoke-0.002
Token there
Token ID612
Feature activation+2.96e-03

Pos logit contributions
bled+0.038
uddled+0.038
 olds+0.036
ans+0.035
agons+0.035
Neg logit contributions
ifying-0.023
 Is-0.022
 Ashley-0.021
 awoke-0.021
 bit-0.021
Token were
Token ID547
Feature activation+3.07e-02

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.004
 Is-0.004
 Ashley-0.004
 awoke-0.004
 bit-0.004
Token two
Token ID734
Feature activation+4.94e-02

Pos logit contributions
uddled+0.065
bled+0.064
 olds+0.061
ails+0.060
uddle+0.060
Neg logit contributions
ifying-0.037
 Will-0.034
Am-0.034
 Ashley-0.033
Lou-0.033
Token friends
Token ID2460
Feature activation+1.20e-02

Pos logit contributions
bled+0.077
uddled+0.067
irled+0.064
ails+0.062
castle+0.061
Neg logit contributions
 bit-0.091
ifying-0.083
 Ashley-0.083
 eight-0.082
 total-0.082
Token,
Token ID11
Feature activation+1.08e-02

Pos logit contributions
bled+0.022
uddled+0.022
 olds+0.020
uddle+0.020
iggle+0.020
Neg logit contributions
ifying-0.017
 bit-0.016
 Is-0.016
 Ashley-0.016
 care-0.016
Token an
Token ID281
Feature activation+9.54e-02

Pos logit contributions
bled+0.020
uddled+0.020
iggle+0.018
 olds+0.018
uddle+0.018
Neg logit contributions
ifying-0.016
 Ashley-0.015
 bit-0.015
 Is-0.015
 Will-0.014
Token ostr
Token ID42387
Feature activation+3.42e-02

Pos logit contributions
uddled+0.212
bled+0.209
uddle+0.194
ails+0.179
ades+0.177
Neg logit contributions
 baby-0.115
 night-0.114
 nine-0.109
 bit-0.108
 eight-0.105
Tokenich
Token ID488
Feature activation+3.24e-04

Pos logit contributions
bled+0.048
 olds+0.047
iggle+0.045
uddled+0.043
 golf+0.043
Neg logit contributions
ifying-0.062
Am-0.059
 bit-0.056
Lou-0.056
 Debbie-0.056
Token and
Token ID290
Feature activation+3.11e-03

Pos logit contributions
bled+0.001
uddled+0.000
 olds+0.000
iggle+0.000
ails+0.000
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token a
Token ID257
Feature activation+3.90e-03

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.004
 Is-0.004
 Ashley-0.004
 bit-0.004
 awoke-0.004
Token goat
Token ID26917
Feature activation+3.22e-03

Pos logit contributions
bled+0.008
uddled+0.007
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.005
 Is-0.005
 bit-0.005
 Ashley-0.005
 awoke-0.005
Token.
Token ID13
Feature activation+6.11e-02

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.002
Neg logit contributions
ifying-0.002
 Is-0.002
 bit-0.002
 Ashley-0.002
 awoke-0.002
Token Sally
Token ID25737
Feature activation+1.23e-02

Pos logit contributions
bled+0.139
bles+0.138
ini+0.136
agons+0.136
kins+0.135
Neg logit contributions
 Is-0.060
 bit-0.056
 Will-0.054
 While-0.053
ifying-0.053
Token said
Token ID531
Feature activation+1.77e-03

Pos logit contributions
bled+0.024
uddled+0.023
 olds+0.023
iggle+0.022
agons+0.022
Neg logit contributions
 bit-0.015
ifying-0.015
 Is-0.015
 awoke-0.015
 Ashley-0.015
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.003
 Is-0.002
 bit-0.002
 Ashley-0.002
 awoke-0.002
Token "
Token ID366
Feature activation-1.23e-02

Pos logit contributions
bled+0.024
uddled+0.024
iggle+0.023
 olds+0.023
uddle+0.023
Neg logit contributions
ifying-0.018
 Is-0.018
 bit-0.017
 Ashley-0.017
 Will-0.017
TokenM
Token ID44
Feature activation+9.42e-02

Pos logit contributions
ifying+0.016
 awoke+0.014
 Ashley+0.014
 Is+0.014
 Debbie+0.014
Neg logit contributions
bled-0.025
uddled-0.024
 golf-0.023
ini-0.023
ails-0.023
Tokenmm
Token ID3020
Feature activation+4.55e-02

Pos logit contributions
bled+0.145
bes+0.117
uring+0.115
 burying+0.113
uddled+0.112
Neg logit contributions
ah-0.137
 single-0.137
of-0.135
Not-0.133
ifying-0.133
Token,
Token ID11
Feature activation-1.12e-03

Pos logit contributions
bled+0.109
uddled+0.105
 olds+0.100
ini+0.100
 clubhouse+0.099
Neg logit contributions
 bit-0.041
 care-0.034
ifying-0.032
 will-0.032
 Ashley-0.031
Token this
Token ID428
Feature activation-1.54e-02

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token cake
Token ID12187
Feature activation-1.61e-03

Pos logit contributions
ifying+0.026
 Is+0.024
 Ashley+0.023
Lou+0.023
 distance+0.023
Neg logit contributions
bled-0.024
uddled-0.024
castle-0.023
iggle-0.022
uddle-0.022
Token is
Token ID318
Feature activation-9.88e-04

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.002
Neg logit contributions
bled-0.003
uddled-0.003
 olds-0.003
iggle-0.003
ails-0.003
Token became
Token ID2627
Feature activation+2.73e-02

Pos logit contributions
bled+0.043
uddled+0.041
 olds+0.040
ires+0.039
agons+0.038
Neg logit contributions
 bit-0.027
ifying-0.026
 awoke-0.026
 care-0.026
 stroke-0.025
Token best
Token ID1266
Feature activation+3.74e-02

Pos logit contributions
uddled+0.044
bled+0.040
 olds+0.038
uddle+0.037
iggle+0.036
Neg logit contributions
 bit-0.047
 care-0.047
 total-0.045
ifying-0.045
 single-0.045
Token friends
Token ID2460
Feature activation-1.25e-02

Pos logit contributions
bled+0.061
uddled+0.053
mm+0.050
 olds+0.050
castle+0.047
Neg logit contributions
 care-0.066
 bit-0.063
 single-0.061
 stroke-0.060
ifying-0.060
Token.
Token ID13
Feature activation+2.45e-02

Pos logit contributions
ifying+0.020
 Is+0.018
 Ashley+0.018
 Debbie+0.018
 awoke+0.018
Neg logit contributions
uddled-0.021
bled-0.021
 olds-0.019
iggle-0.019
ails-0.019
Token They
Token ID1119
Feature activation+5.10e-03

Pos logit contributions
bled+0.052
uddled+0.050
ini+0.047
castle+0.047
bles+0.046
Neg logit contributions
ifying-0.031
 Is-0.030
 Will-0.029
 Ashley-0.029
 care-0.027
Token played
Token ID2826
Feature activation+9.39e-02

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.009
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.007
 bit-0.007
 Is-0.007
 awoke-0.007
 Ashley-0.007
Token together
Token ID1978
Feature activation-4.11e-03

Pos logit contributions
ini+0.236
 olds+0.230
ower+0.222
uddled+0.215
iqu+0.214
Neg logit contributions
 away-0.089
 to-0.079
 on-0.076
 after-0.072
 heard-0.070
Token in
Token ID287
Feature activation+2.09e-02

Pos logit contributions
ifying+0.006
 Is+0.006
 Ashley+0.006
 awoke+0.006
 bit+0.005
Neg logit contributions
bled-0.007
uddled-0.007
iggle-0.007
 olds-0.007
ails-0.007
Token the
Token ID262
Feature activation+2.96e-02

Pos logit contributions
uddled+0.060
bled+0.059
uddle+0.055
ails+0.054
 olds+0.054
Neg logit contributions
ifying-0.011
 bit-0.011
 distance-0.011
 total-0.011
 care-0.010
Token park
Token ID3952
Feature activation+2.63e-02

Pos logit contributions
bled+0.060
uddled+0.056
 olds+0.053
bles+0.053
iggle+0.052
Neg logit contributions
 distance-0.045
 single-0.041
 bit-0.040
 care-0.039
ifying-0.039
Token every
Token ID790
Feature activation-3.47e-03

Pos logit contributions
bled+0.057
uddled+0.055
 olds+0.053
agons+0.053
ini+0.052
Neg logit contributions
 bit-0.029
ifying-0.027
 care-0.027
 single-0.027
 Is-0.027
Token.
Token ID13
Feature activation-2.79e-03

Pos logit contributions
ifying+0.012
 Is+0.011
 Ashley+0.011
 awoke+0.011
 bit+0.011
Neg logit contributions
bled-0.014
uddled-0.014
 olds-0.013
iggle-0.013
ails-0.013
Token After
Token ID2293
Feature activation+3.31e-02

Pos logit contributions
ifying+0.004
 Is+0.003
 bit+0.003
 Ashley+0.003
 awoke+0.003
Neg logit contributions
bled-0.005
uddled-0.005
 olds-0.005
iggle-0.005
ails-0.005
Token the
Token ID262
Feature activation+6.63e-02

Pos logit contributions
uddled+0.072
uddle+0.072
ini+0.071
ans+0.070
bled+0.070
Neg logit contributions
 Is-0.035
ifying-0.035
 care-0.035
 Will-0.034
 bit-0.033
Token boy
Token ID2933
Feature activation+1.66e-02

Pos logit contributions
bled+0.133
uddled+0.131
irled+0.128
urg+0.122
iggle+0.122
Neg logit contributions
 bit-0.101
 distance-0.100
 care-0.097
 night-0.089
 bite-0.088
Token finished
Token ID5201
Feature activation+4.04e-02

Pos logit contributions
bled+0.036
uddled+0.035
 olds+0.033
agons+0.033
iggle+0.033
Neg logit contributions
ifying-0.018
 bit-0.018
 Is-0.018
 awoke-0.017
 care-0.017
Token his
Token ID465
Feature activation+9.25e-02

Pos logit contributions
uddled+0.082
bled+0.078
ini+0.074
 olds+0.073
add+0.073
Neg logit contributions
 Is-0.048
 away-0.048
 care-0.048
 single-0.047
ifying-0.047
Token swing
Token ID9628
Feature activation+3.99e-02

Pos logit contributions
bled+0.134
uddled+0.123
iders+0.119
iring+0.117
add+0.117
Neg logit contributions
 care-0.176
 bit-0.173
 bite-0.171
 to-0.163
 single-0.162
Token,
Token ID11
Feature activation+6.73e-03

Pos logit contributions
bled+0.085
uddled+0.082
 olds+0.079
ires+0.079
add+0.079
Neg logit contributions
ifying-0.044
 bit-0.043
 distance-0.041
 Is-0.041
 away-0.041
Token he
Token ID339
Feature activation+1.09e-02

Pos logit contributions
bled+0.014
uddled+0.014
iggle+0.013
 olds+0.013
ails+0.013
Neg logit contributions
ifying-0.008
 Is-0.007
 Ashley-0.007
 bit-0.007
 awoke-0.007
Token shared
Token ID4888
Feature activation+5.82e-03

Pos logit contributions
bled+0.022
uddled+0.021
 olds+0.021
iggle+0.020
ails+0.020
Neg logit contributions
ifying-0.013
 bit-0.013
 Is-0.013
 awoke-0.013
 Ashley-0.012
Token the
Token ID262
Feature activation+7.89e-02

Pos logit contributions
bled+0.013
uddled+0.013
 olds+0.013
iggle+0.013
ails+0.012
Neg logit contributions
ifying-0.006
 Is-0.005
 bit-0.005
 Ashley-0.005
 awoke-0.005
TokenBob
Token ID18861
Feature activation+2.28e-03

Pos logit contributions
ifying+0.010
 Is+0.009
 bit+0.009
 Ashley+0.009
 awoke+0.009
Neg logit contributions
bled-0.014
uddled-0.014
 olds-0.013
iggle-0.013
ails-0.013
Token was
Token ID373
Feature activation-1.14e-03

Pos logit contributions
bled+0.004
uddled+0.004
 olds+0.004
iggle+0.004
ails+0.004
Neg logit contributions
ifying-0.003
 Is-0.003
 bit-0.003
 Ashley-0.003
 awoke-0.003
Token so
Token ID523
Feature activation+3.77e-03

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 bit+0.002
 awoke+0.002
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token happy
Token ID3772
Feature activation+3.86e-04

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.007
 Is-0.007
 bit-0.007
 Ashley-0.007
 awoke-0.007
Token with
Token ID351
Feature activation+2.19e-02

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.000
 bit-0.000
 awoke-0.000
Token his
Token ID465
Feature activation+9.13e-02

Pos logit contributions
uddled+0.057
uddle+0.054
bled+0.054
ini+0.053
ails+0.052
Neg logit contributions
 care-0.016
 Is-0.015
ifying-0.015
 Ashley-0.015
 Debbie-0.014
Token cool
Token ID3608
Feature activation+7.71e-02

Pos logit contributions
uddled+0.127
bled+0.126
rim+0.119
mm+0.119
ator+0.115
Neg logit contributions
 care-0.174
 bite-0.158
 single-0.158
 try-0.158
 bit-0.157
Token golf
Token ID13126
Feature activation+7.75e-02

Pos logit contributions
bled+0.106
uddled+0.096
mm+0.093
add+0.087
ires+0.087
Neg logit contributions
 care-0.158
 single-0.147
 bit-0.145
 away-0.142
 to-0.138
Token shot
Token ID2823
Feature activation+1.36e-02

Pos logit contributions
add+0.143
bled+0.140
urg+0.137
ires+0.137
 olds+0.136
Neg logit contributions
 bit-0.110
 away-0.104
 will-0.103
 single-0.098
 distance-0.098
Token.
Token ID13
Feature activation+8.13e-03

Pos logit contributions
bled+0.027
uddled+0.027
 olds+0.025
iggle+0.025
ails+0.025
Neg logit contributions
ifying-0.018
 Is-0.016
 bit-0.016
 awoke-0.016
 Ashley-0.015
Token He
Token ID679
Feature activation+1.58e-02

Pos logit contributions
bled+0.017
uddled+0.016
iggle+0.015
 olds+0.015
uddle+0.015
Neg logit contributions
ifying-0.010
 Is-0.010
 Ashley-0.009
 awoke-0.009
 bit-0.009
Token for
Token ID329
Feature activation+2.16e-02

Pos logit contributions
ifying+0.005
 Is+0.005
 Ashley+0.005
 awoke+0.005
 bit+0.005
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
ails-0.006
Token his
Token ID465
Feature activation+6.44e-02

Pos logit contributions
uddled+0.041
uddle+0.040
 olds+0.039
ails+0.038
ans+0.038
Neg logit contributions
 care-0.029
 Is-0.029
ifying-0.029
 bit-0.028
 Will-0.028
Token help
Token ID1037
Feature activation-9.51e-04

Pos logit contributions
bled+0.147
uddled+0.142
ires+0.138
iring+0.133
ails+0.132
Neg logit contributions
 care-0.081
 bit-0.077
 bite-0.073
 single-0.069
 to-0.067
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token They
Token ID1119
Feature activation+1.71e-02

Pos logit contributions
bled+0.039
uddled+0.037
iggle+0.036
castle+0.036
bles+0.036
Neg logit contributions
ifying-0.017
 Is-0.017
 Ashley-0.017
 Will-0.016
 Debbie-0.016
Token played
Token ID2826
Feature activation+9.11e-02

Pos logit contributions
bled+0.033
uddled+0.032
 olds+0.032
ires+0.031
ails+0.031
Neg logit contributions
ifying-0.022
 bit-0.022
 awoke-0.021
 care-0.020
 Is-0.020
Token golf
Token ID13126
Feature activation+7.26e-02

Pos logit contributions
ini+0.202
ower+0.184
mm+0.182
rim+0.179
urg+0.178
Neg logit contributions
 away-0.131
 after-0.115
 to-0.106
 the-0.105
 care-0.104
Token together
Token ID1978
Feature activation+3.17e-03

Pos logit contributions
bled+0.136
ini+0.134
ires+0.133
uddled+0.132
add+0.132
Neg logit contributions
 away-0.098
 bit-0.093
 will-0.086
 heard-0.084
 care-0.083
Token and
Token ID290
Feature activation+2.66e-02

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.005
 Is-0.004
 bit-0.004
 Ashley-0.004
 awoke-0.004
Token had
Token ID550
Feature activation+1.98e-02

Pos logit contributions
uddled+0.063
bled+0.062
ires+0.062
agons+0.061
uddle+0.061
Neg logit contributions
 bit-0.024
 care-0.023
 Is-0.022
 Will-0.022
ifying-0.022
Token a
Token ID257
Feature activation+3.29e-03

Pos logit contributions
bled+0.040
uddled+0.040
ails+0.038
ires+0.038
iggle+0.038
Neg logit contributions
ifying-0.024
 bit-0.023
 care-0.023
 total-0.022
 awoke-0.022
Token
Token ID198
Feature activation-6.72e-03

Pos logit contributions
bled+0.017
uddled+0.016
iggle+0.016
 olds+0.015
castle+0.015
Neg logit contributions
ifying-0.007
 Is-0.007
 Ashley-0.006
 awoke-0.006
 bit-0.006
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
ifying+0.008
 Is+0.007
 bit+0.007
 Ashley+0.007
 awoke+0.007
Neg logit contributions
bled-0.014
uddled-0.014
iggle-0.013
 olds-0.013
ails-0.013
TokenOne
Token ID3198
Feature activation-1.23e-02

Pos logit contributions
ifying+0.011
 Is+0.010
 Ashley+0.010
 awoke+0.010
 bit+0.010
Neg logit contributions
bled-0.023
uddled-0.023
 golf-0.023
 olds-0.022
ails-0.022
Token day
Token ID1110
Feature activation-3.04e-02

Pos logit contributions
ifying+0.019
 Is+0.018
 Ashley+0.017
 Debbie+0.017
 awoke+0.017
Neg logit contributions
uddled-0.020
bled-0.020
iggle-0.020
ails-0.019
 golf-0.019
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
ifying+0.066
Am+0.060
 Debbie+0.059
 distance+0.059
Jonathan+0.059
Neg logit contributions
bled-0.035
uddled-0.034
 golf-0.033
ails-0.031
iggle-0.031
Token an
Token ID281
Feature activation+9.11e-02

Pos logit contributions
ifying+0.014
 awoke+0.012
 Is+0.012
 Ashley+0.012
 Debbie+0.012
Neg logit contributions
bled-0.020
uddled-0.019
 golf-0.019
 olds-0.018
iggle-0.018
Token especially
Token ID2592
Feature activation+3.92e-02

Pos logit contributions
uddled+0.203
bled+0.192
uddle+0.179
ades+0.172
irled+0.167
Neg logit contributions
 night-0.122
 bit-0.117
 awoke-0.113
 single-0.112
 month-0.111
Token brave
Token ID14802
Feature activation+2.73e-02

Pos logit contributions
bled+0.072
uddled+0.070
castle+0.067
mm+0.066
agons+0.065
Neg logit contributions
 bit-0.057
 single-0.055
ifying-0.054
 care-0.053
 heard-0.052
Token and
Token ID290
Feature activation+2.27e-02

Pos logit contributions
bled+0.063
uddled+0.061
irled+0.056
mm+0.054
castle+0.054
Neg logit contributions
 bit-0.031
 single-0.029
ifying-0.029
 awoke-0.028
 Is-0.028
Token curious
Token ID11040
Feature activation+1.48e-03

Pos logit contributions
uddled+0.041
bled+0.040
iggle+0.038
ails+0.037
uddle+0.037
Neg logit contributions
 bit-0.035
ifying-0.034
 Is-0.034
 care-0.033
 Ashley-0.033
Token squirrel
Token ID33039
Feature activation+3.63e-03

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token
Token ID198
Feature activation-1.78e-02

Pos logit contributions
bled+0.063
uddled+0.062
 olds+0.059
castle+0.059
iggle+0.058
Neg logit contributions
ifying-0.020
 Is-0.020
 Ashley-0.019
 Debbie-0.018
Am-0.017
Token
Token ID198
Feature activation-1.91e-02

Pos logit contributions
ifying+0.026
 bit+0.023
 awoke+0.022
 Is+0.022
 Ashley+0.021
Neg logit contributions
uddled-0.033
bled-0.033
ails-0.031
iggle-0.031
 olds-0.031
TokenOne
Token ID3198
Feature activation+2.43e-03

Pos logit contributions
ifying+0.025
 awoke+0.021
 Is+0.021
 bit+0.021
 Debbie+0.020
Neg logit contributions
uddled-0.037
 golf-0.037
bled-0.036
 olds-0.035
ails-0.035
Token day
Token ID1110
Feature activation-3.11e-02

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.003
 Is-0.003
 bit-0.003
 Ashley-0.003
 awoke-0.003
Token,
Token ID11
Feature activation-2.25e-02

Pos logit contributions
ifying+0.065
 distance+0.060
Jonathan+0.059
 awoke+0.058
 total+0.058
Neg logit contributions
uddled-0.036
 golf-0.035
bled-0.035
ails-0.032
 olds-0.031
Token an
Token ID281
Feature activation+9.10e-02

Pos logit contributions
ifying+0.032
 awoke+0.030
Jonathan+0.028
 Is+0.027
 single+0.027
Neg logit contributions
bled-0.041
uddled-0.040
 golf-0.040
 olds-0.038
iggle-0.037
Token unhappy
Token ID19283
Feature activation+2.18e-02

Pos logit contributions
uddled+0.184
bled+0.172
ades+0.166
uddle+0.163
kins+0.155
Neg logit contributions
 night-0.130
 bit-0.122
 single-0.122
 baby-0.120
 awoke-0.117
Token thing
Token ID1517
Feature activation-1.63e-02

Pos logit contributions
bled+0.046
uddled+0.045
 olds+0.041
ails+0.041
iggle+0.041
Neg logit contributions
 bit-0.026
 Is-0.025
 awoke-0.025
ifying-0.025
 single-0.025
Token happened
Token ID3022
Feature activation+1.89e-02

Pos logit contributions
ifying+0.024
 Ashley+0.022
 Is+0.022
 Debbie+0.021
Jonathan+0.021
Neg logit contributions
uddled-0.030
bled-0.030
iggle-0.027
 olds-0.027
ails-0.027
Token.
Token ID13
Feature activation-3.45e-03

Pos logit contributions
bled+0.040
uddled+0.039
 olds+0.038
castle+0.038
uddle+0.038
Neg logit contributions
ifying-0.021
 Is-0.019
 bit-0.019
 single-0.019
 awoke-0.018
Token The
Token ID383
Feature activation+4.90e-02

Pos logit contributions
ifying+0.004
 Is+0.004
 awoke+0.004
 bit+0.004
 Ashley+0.004
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
ails-0.006
Token.
Token ID13
Feature activation+5.70e-02

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 bit-0.002
 Ashley-0.002
 awoke-0.002
Token Sally
Token ID25737
Feature activation+1.15e-02

Pos logit contributions
bled+0.129
bles+0.128
agons+0.127
ini+0.126
kins+0.125
Neg logit contributions
 Is-0.058
 bit-0.054
 Will-0.053
ifying-0.052
 While-0.049
Token said
Token ID531
Feature activation+5.37e-04

Pos logit contributions
bled+0.023
uddled+0.022
 olds+0.021
iggle+0.021
ails+0.021
Neg logit contributions
 bit-0.014
ifying-0.014
 Is-0.014
 awoke-0.014
 Ashley-0.014
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token "
Token ID366
Feature activation-1.55e-02

Pos logit contributions
bled+0.020
uddled+0.020
iggle+0.019
 olds+0.019
uddle+0.019
Neg logit contributions
ifying-0.016
 Is-0.016
 Ashley-0.015
 bit-0.015
 awoke-0.015
TokenM
Token ID44
Feature activation+9.07e-02

Pos logit contributions
ifying+0.021
 awoke+0.019
 Ashley+0.019
 Is+0.018
 Debbie+0.018
Neg logit contributions
bled-0.030
uddled-0.030
ini-0.028
 golf-0.028
ails-0.028
Tokenmm
Token ID3020
Feature activation+4.54e-02

Pos logit contributions
bled+0.137
bes+0.108
uring+0.107
 burying+0.107
uddled+0.105
Neg logit contributions
ah-0.135
 single-0.133
of-0.132
ifying-0.132
 total-0.129
Token,
Token ID11
Feature activation-1.03e-04

Pos logit contributions
bled+0.108
uddled+0.104
ini+0.100
 olds+0.100
 clubhouse+0.098
Neg logit contributions
 bit-0.040
 care-0.033
 will-0.031
ifying-0.031
 Ashley-0.031
Token this
Token ID428
Feature activation-1.37e-02

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 bit+0.000
 awoke+0.000
Neg logit contributions
bled-0.000
uddled-0.000
 olds-0.000
iggle-0.000
ails-0.000
Token cake
Token ID12187
Feature activation-5.23e-05

Pos logit contributions
ifying+0.023
 Is+0.021
 Ashley+0.021
Lou+0.020
Jonathan+0.020
Neg logit contributions
bled-0.022
uddled-0.022
castle-0.021
iggle-0.020
uddle-0.020
Token is
Token ID318
Feature activation-8.85e-04

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 bit+0.000
 awoke+0.000
Neg logit contributions
bled-0.000
uddled-0.000
 olds-0.000
iggle-0.000
ails-0.000
Token wanted
Token ID2227
Feature activation-1.20e-02

Pos logit contributions
bled+0.011
uddled+0.011
 olds+0.010
iggle+0.010
ails+0.010
Neg logit contributions
ifying-0.005
 Is-0.004
 bit-0.004
 awoke-0.004
 Ashley-0.004
Token to
Token ID284
Feature activation+7.68e-03

Pos logit contributions
ifying+0.011
 Is+0.009
 Ashley+0.008
 awoke+0.008
Jonathan+0.008
Neg logit contributions
bled-0.029
uddled-0.028
 olds-0.028
 golf-0.028
ails-0.027
Token make
Token ID787
Feature activation+3.69e-02

Pos logit contributions
bled+0.016
uddled+0.016
 olds+0.015
ails+0.015
iggle+0.015
Neg logit contributions
ifying-0.009
 Is-0.008
 bit-0.008
 care-0.008
 Ashley-0.008
Token his
Token ID465
Feature activation+2.66e-02

Pos logit contributions
uddled+0.094
bled+0.091
ails+0.086
agons+0.085
 olds+0.085
Neg logit contributions
 care-0.029
 Ashley-0.027
 Is-0.026
 Will-0.025
 away-0.025
Token mum
Token ID25682
Feature activation-8.66e-04

Pos logit contributions
bled+0.053
uddled+0.052
ails+0.048
 olds+0.047
bles+0.047
Neg logit contributions
 bit-0.035
 distance-0.035
 single-0.034
 care-0.034
ifying-0.034
Token an
Token ID281
Feature activation+9.06e-02

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 bit+0.001
 awoke+0.001
Neg logit contributions
bled-0.002
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token honest
Token ID5508
Feature activation+2.21e-02

Pos logit contributions
uddled+0.171
bled+0.168
bles+0.157
ails+0.155
kins+0.154
Neg logit contributions
 bit-0.128
 care-0.121
 night-0.121
 away-0.120
 bite-0.117
Token offer
Token ID2897
Feature activation-9.53e-03

Pos logit contributions
bled+0.034
uddled+0.033
ires+0.029
 olds+0.029
ails+0.029
Neg logit contributions
 bit-0.039
 care-0.037
ifying-0.037
 Is-0.037
 single-0.037
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
ifying+0.014
 Is+0.013
 Ashley+0.013
 awoke+0.013
 bit+0.012
Neg logit contributions
bled-0.017
uddled-0.017
 olds-0.016
iggle-0.016
ails-0.016
Token
Token ID220
Feature activation+3.60e-02

Pos logit contributions
ifying+0.021
 bit+0.019
 awoke+0.019
 Is+0.019
 Ashley+0.019
Neg logit contributions
bled-0.021
uddled-0.021
 olds-0.020
ails-0.019
iggle-0.019
Token
Token ID198
Feature activation-1.43e-03

Pos logit contributions
bled+0.090
uddled+0.089
castle+0.087
 olds+0.086
iggle+0.085
Neg logit contributions
 Is-0.025
 Ashley-0.024
ifying-0.023
 Debbie-0.022
Jonathan-0.020
Token in
Token ID287
Feature activation-3.57e-03

Pos logit contributions
uddled+0.109
bled+0.107
ini+0.106
mm+0.106
 olds+0.106
Neg logit contributions
 away-0.050
 bit-0.048
 care-0.046
 Is-0.044
 Will-0.043
Token the
Token ID262
Feature activation+4.56e-02

Pos logit contributions
ifying+0.004
 Is+0.003
 Ashley+0.003
 awoke+0.003
 bit+0.003
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.008
iggle-0.008
 golf-0.007
Token mud
Token ID17492
Feature activation+3.20e-02

Pos logit contributions
bled+0.092
uddled+0.089
bles+0.084
iggle+0.083
ails+0.083
Neg logit contributions
 distance-0.073
 night-0.063
 single-0.062
ifying-0.061
 bit-0.061
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
bled+0.065
uddled+0.063
 olds+0.062
agons+0.060
ires+0.059
Neg logit contributions
 bit-0.038
ifying-0.037
 distance-0.036
 awoke-0.034
 single-0.034
Token What
Token ID1867
Feature activation+4.56e-03

Pos logit contributions
ifying+0.015
 bit+0.014
 awoke+0.013
 Ashley+0.013
 Is+0.013
Neg logit contributions
bled-0.018
uddled-0.018
 olds-0.017
ails-0.017
iggle-0.017
Token an
Token ID281
Feature activation+9.06e-02

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.009
iggle+0.009
ails+0.009
Neg logit contributions
ifying-0.006
 Is-0.005
 Ashley-0.005
 awoke-0.005
 bit-0.005
Token adventure
Token ID8855
Feature activation+2.76e-02

Pos logit contributions
uddled+0.188
bled+0.178
kins+0.169
mons+0.168
bles+0.164
Neg logit contributions
 night-0.126
 care-0.119
 single-0.119
 need-0.114
 distance-0.110
Token!
Token ID0
Feature activation-1.67e-02

Pos logit contributions
bled+0.058
uddled+0.057
 olds+0.053
iggle+0.053
add+0.053
Neg logit contributions
ifying-0.032
 Is-0.030
 awoke-0.030
 bit-0.030
 Ashley-0.029
Token<|endoftext|>
Token ID50256
Feature activation-1.64e-02

Pos logit contributions
ifying+0.023
 bit+0.022
 awoke+0.020
 total+0.020
 Ashley+0.020
Neg logit contributions
bled-0.031
uddled-0.030
ails-0.029
 olds-0.029
 golf-0.029
TokenOnce
Token ID7454
Feature activation-1.51e-02

Pos logit contributions
ifying+0.019
 bit+0.018
 Is+0.017
 care+0.017
 Ashley+0.017
Neg logit contributions
bled-0.034
uddled-0.033
ails-0.032
 olds-0.031
iggle-0.031
Token upon
Token ID2402
Feature activation-6.14e-04

Pos logit contributions
ifying+0.016
 Is+0.014
 bit+0.014
 Ashley+0.014
 awoke+0.013
Neg logit contributions
bled-0.034
uddled-0.033
iggle-0.032
ails-0.032
 olds-0.031
Token selfish
Token ID20363
Feature activation-3.18e-03

Pos logit contributions
bled+0.013
uddled+0.012
iggle+0.011
 olds+0.011
ails+0.011
Neg logit contributions
ifying-0.013
 bit-0.013
 Is-0.013
 Ashley-0.013
 care-0.013
Token.
Token ID13
Feature activation+3.23e-02

Pos logit contributions
ifying+0.005
 Is+0.004
 Ashley+0.004
 awoke+0.004
 bit+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.006
iggle-0.005
ails-0.005
Token One
Token ID1881
Feature activation+7.50e-03

Pos logit contributions
bled+0.083
uddled+0.078
irled+0.073
bles+0.072
agons+0.072
Neg logit contributions
 Is-0.027
 Will-0.026
 Ashley-0.026
ifying-0.025
 distance-0.024
Token day
Token ID1110
Feature activation-1.39e-02

Pos logit contributions
bled+0.017
uddled+0.016
 olds+0.016
iggle+0.015
bles+0.015
Neg logit contributions
ifying-0.008
 bit-0.008
 single-0.008
 Is-0.008
 awoke-0.007
Token,
Token ID11
Feature activation-4.12e-04

Pos logit contributions
ifying+0.024
 awoke+0.022
 distance+0.022
Jonathan+0.022
 Ashley+0.022
Neg logit contributions
bled-0.022
uddled-0.021
ails-0.020
iggle-0.020
 golf-0.020
Token M
Token ID337
Feature activation+8.98e-02

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Tokenandy
Token ID10757
Feature activation+4.47e-02

Pos logit contributions
bled+0.180
oving+0.160
 golf+0.152
yard+0.149
renches+0.147
Neg logit contributions
bie-0.114
 night-0.101
y-0.101
 Mae-0.099
 Will-0.094
Token got
Token ID1392
Feature activation-9.55e-06

Pos logit contributions
bled+0.109
ires+0.099
agons+0.099
uddled+0.098
iggle+0.097
Neg logit contributions
 bit-0.046
 awoke-0.042
 heard-0.041
 woke-0.039
 Ashley-0.038
Token an
Token ID281
Feature activation+6.14e-02

Feature not active on prev token;
no predictions were affected.
Token invitation
Token ID17023
Feature activation-8.47e-03

Pos logit contributions
uddled+0.101
bled+0.100
bles+0.088
ails+0.088
uddle+0.088
Neg logit contributions
 bit-0.104
 care-0.098
 single-0.097
 night-0.095
 away-0.093
Token to
Token ID284
Feature activation+4.75e-02

Pos logit contributions
ifying+0.009
 Is+0.008
 Ashley+0.007
 awoke+0.007
 Debbie+0.007
Neg logit contributions
uddled-0.019
bled-0.019
iggle-0.018
 olds-0.018
ails-0.018
Token.
Token ID13
Feature activation+3.80e-03

Pos logit contributions
bled+0.014
uddled+0.014
 olds+0.013
ails+0.013
uddle+0.013
Neg logit contributions
ifying-0.009
 Is-0.009
 bit-0.009
 Ashley-0.009
 care-0.008
Token
Token ID220
Feature activation+3.18e-02

Pos logit contributions
bled+0.007
uddled+0.007
iggle+0.006
 olds+0.006
ails+0.006
Neg logit contributions
ifying-0.006
 Is-0.005
 Ashley-0.005
 awoke-0.005
 bit-0.005
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
bled+0.079
uddled+0.079
castle+0.076
 olds+0.076
ini+0.074
Neg logit contributions
 Is-0.022
ifying-0.022
 Ashley-0.020
 Will-0.019
 Debbie-0.018
TokenIt
Token ID1026
Feature activation+1.04e-02

Pos logit contributions
ifying+0.016
 bit+0.015
 Is+0.014
 awoke+0.014
 Ashley+0.014
Neg logit contributions
bled-0.023
uddled-0.023
 olds-0.021
iggle-0.021
ails-0.021
Token was
Token ID373
Feature activation+1.15e-02

Pos logit contributions
bled+0.021
uddled+0.020
 olds+0.019
ails+0.019
iggle+0.019
Neg logit contributions
ifying-0.013
 bit-0.013
 Is-0.013
 awoke-0.012
 Ashley-0.012
Token an
Token ID281
Feature activation+8.94e-02

Pos logit contributions
uddled+0.024
bled+0.024
 olds+0.023
ails+0.023
uddle+0.022
Neg logit contributions
ifying-0.014
 Is-0.013
 bit-0.012
 awoke-0.012
 Ashley-0.012
Token important
Token ID1593
Feature activation+5.62e-02

Pos logit contributions
uddled+0.180
bled+0.168
uddle+0.156
kins+0.149
ails+0.149
Neg logit contributions
 night-0.133
 bit-0.132
 care-0.126
 single-0.122
 distance-0.121
Token day
Token ID1110
Feature activation+3.23e-03

Pos logit contributions
bled+0.127
uddled+0.117
ires+0.109
 olds+0.108
irled+0.107
Neg logit contributions
 single-0.076
 care-0.074
 bit-0.071
 night-0.066
 month-0.065
Token for
Token ID329
Feature activation+3.35e-02

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.004
 Is-0.004
 bit-0.004
 Ashley-0.004
 awoke-0.004
Token the
Token ID262
Feature activation+5.15e-02

Pos logit contributions
uddled+0.078
uddle+0.074
bled+0.074
ini+0.072
irled+0.070
Neg logit contributions
 care-0.033
 Will-0.033
ifying-0.032
 Is-0.031
 bit-0.031
Token man
Token ID582
Feature activation-6.42e-03

Pos logit contributions
irled+0.102
bled+0.102
uddled+0.102
iggle+0.094
add+0.092
Neg logit contributions
 distance-0.075
 care-0.075
 bit-0.073
 night-0.071
 single-0.066
Token a
Token ID257
Feature activation+1.94e-02

Pos logit contributions
ifying+0.004
 Is+0.004
 awoke+0.004
 Ashley+0.004
 bit+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.005
iggle-0.005
 golf-0.005
Token very
Token ID845
Feature activation+5.02e-02

Pos logit contributions
bled+0.039
uddled+0.037
 olds+0.035
ails+0.035
iggle+0.034
Neg logit contributions
 bit-0.026
ifying-0.026
 single-0.025
 Ashley-0.024
 total-0.024
Token special
Token ID2041
Feature activation+2.61e-02

Pos logit contributions
bled+0.107
uddled+0.103
 olds+0.091
urg+0.090
icking+0.090
Neg logit contributions
 single-0.067
 bit-0.067
 care-0.064
 month-0.060
 minute-0.058
Token friend
Token ID1545
Feature activation-6.44e-03

Pos logit contributions
bled+0.053
uddled+0.050
 olds+0.048
iggle+0.046
ires+0.046
Neg logit contributions
 bit-0.034
 single-0.033
ifying-0.032
 care-0.032
 awoke-0.031
Token,
Token ID11
Feature activation+7.89e-03

Pos logit contributions
ifying+0.011
 Is+0.010
 awoke+0.010
 Ashley+0.009
 bit+0.009
Neg logit contributions
bled-0.010
uddled-0.010
 olds-0.010
ails-0.010
iggle-0.010
Token an
Token ID281
Feature activation+8.91e-02

Pos logit contributions
bled+0.016
uddled+0.016
 olds+0.015
iggle+0.015
uddle+0.015
Neg logit contributions
ifying-0.010
 Is-0.009
 Ashley-0.009
 bit-0.009
 awoke-0.009
Token old
Token ID1468
Feature activation+2.11e-02

Pos logit contributions
uddled+0.186
bled+0.183
uddle+0.170
ades+0.165
kins+0.160
Neg logit contributions
 night-0.119
 baby-0.112
 awoke-0.107
 bit-0.107
Am-0.106
Token turtle
Token ID28699
Feature activation-1.76e-02

Pos logit contributions
bled+0.045
uddled+0.043
iggle+0.041
 olds+0.040
irled+0.040
Neg logit contributions
 bit-0.025
ifying-0.024
 Ashley-0.024
 Is-0.023
 awoke-0.023
Token.
Token ID13
Feature activation-1.25e-02

Pos logit contributions
ifying+0.033
 Is+0.031
 Ashley+0.030
 awoke+0.030
 Debbie+0.030
Neg logit contributions
uddled-0.024
bled-0.023
ails-0.023
 olds-0.022
 golf-0.022
Token She
Token ID1375
Feature activation+7.85e-03

Pos logit contributions
ifying+0.020
 bit+0.018
 awoke+0.018
 single+0.017
 Is+0.017
Neg logit contributions
uddled-0.020
 olds-0.020
bled-0.020
ails-0.020
 golf-0.019
Token liked
Token ID8288
Feature activation-2.10e-02

Pos logit contributions
bled+0.015
uddled+0.015
 olds+0.014
iggle+0.014
ails+0.014
Neg logit contributions
ifying-0.011
 bit-0.010
 Is-0.010
 awoke-0.010
 Ashley-0.010
Token One
Token ID1881
Feature activation+6.80e-03

Pos logit contributions
bled+0.027
uddled+0.026
iggle+0.025
castle+0.024
 olds+0.024
Neg logit contributions
ifying-0.013
 Is-0.012
 Ashley-0.011
 Will-0.011
 awoke-0.011
Token day
Token ID1110
Feature activation-1.60e-02

Pos logit contributions
bled+0.015
uddled+0.014
 olds+0.014
iggle+0.013
ails+0.013
Neg logit contributions
ifying-0.008
 bit-0.007
 Is-0.007
 awoke-0.007
 single-0.007
Token,
Token ID11
Feature activation-1.05e-02

Pos logit contributions
ifying+0.029
 distance+0.027
 awoke+0.026
 Is+0.026
 Ashley+0.026
Neg logit contributions
bled-0.024
uddled-0.024
ails-0.022
iggle-0.022
 golf-0.022
Token there
Token ID612
Feature activation+3.42e-03

Pos logit contributions
ifying+0.019
 awoke+0.018
 Is+0.018
 Ashley+0.018
 Debbie+0.018
Neg logit contributions
bled-0.015
uddled-0.015
 golf-0.014
 olds-0.013
ails-0.013
Token was
Token ID373
Feature activation-3.28e-03

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.005
 Is-0.005
 bit-0.004
 awoke-0.004
 Ashley-0.004
Token an
Token ID281
Feature activation+8.87e-02

Pos logit contributions
ifying+0.004
 Is+0.004
 awoke+0.004
 Ashley+0.004
 bit+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.006
iggle-0.006
ails-0.006
Token earthquake
Token ID16295
Feature activation-8.35e-04

Pos logit contributions
uddled+0.187
bled+0.175
uddle+0.164
ades+0.159
irled+0.157
Neg logit contributions
 night-0.128
 bit-0.119
 month-0.112
 baby-0.112
 distance-0.110
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.001
iggle-0.001
ails-0.001
Token it
Token ID340
Feature activation-1.65e-02

Pos logit contributions
ifying+0.014
 Is+0.012
 awoke+0.012
 Ashley+0.012
Am+0.012
Neg logit contributions
bled-0.025
uddled-0.024
 olds-0.024
iggle-0.023
 golf-0.023
Token made
Token ID925
Feature activation+8.90e-03

Pos logit contributions
ifying+0.024
 Debbie+0.022
 Ashley+0.022
 Is+0.022
Jonathan+0.021
Neg logit contributions
bled-0.031
uddled-0.030
iggle-0.028
irled-0.028
 olds-0.028
Token them
Token ID606
Feature activation-2.95e-02

Pos logit contributions
uddled+0.019
bled+0.019
 olds+0.018
castle+0.018
uddle+0.018
Neg logit contributions
ifying-0.010
 Is-0.009
 Ashley-0.009
 care-0.009
 bit-0.009
Token come
Token ID1282
Feature activation+4.12e-02

Pos logit contributions
bled+0.062
uddled+0.061
 olds+0.059
ails+0.058
ires+0.057
Neg logit contributions
 Is-0.030
 care-0.030
ifying-0.029
 Ashley-0.029
 bit-0.028
Token and
Token ID290
Feature activation+5.19e-03

Pos logit contributions
bled+0.101
uddled+0.101
mm+0.094
 olds+0.093
agons+0.093
Neg logit contributions
ifying-0.034
 Is-0.032
 away-0.032
 care-0.031
 total-0.030
Token play
Token ID711
Feature activation+5.49e-02

Pos logit contributions
bled+0.011
uddled+0.010
 olds+0.010
iggle+0.010
ails+0.010
Neg logit contributions
ifying-0.006
 Is-0.006
 bit-0.006
 Ashley-0.006
 awoke-0.006
Token.
Token ID13
Feature activation-3.39e-02

Pos logit contributions
bled+0.105
uddled+0.101
ini+0.100
 olds+0.098
agons+0.098
Neg logit contributions
 away-0.078
 bit-0.073
 care-0.072
 heard-0.070
 Is-0.069
Token What
Token ID1867
Feature activation-1.65e-03

Pos logit contributions
ifying+0.053
 bit+0.053
 awoke+0.050
 trace+0.049
 total+0.047
Neg logit contributions
ails-0.052
 olds-0.051
uddled-0.050
 golf-0.050
bled-0.049
Token an
Token ID281
Feature activation+8.86e-02

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 bit+0.002
 awoke+0.002
Neg logit contributions
bled-0.003
uddled-0.003
 olds-0.003
iggle-0.003
ails-0.003
Token incredible
Token ID8082
Feature activation+2.07e-02

Pos logit contributions
bled+0.172
uddled+0.169
bles+0.162
kins+0.161
mons+0.153
Neg logit contributions
 night-0.119
 month-0.116
 care-0.115
 single-0.111
 fright-0.109
Token key
Token ID1994
Feature activation+2.84e-02

Pos logit contributions
bled+0.044
uddled+0.042
 olds+0.039
bles+0.039
ails+0.039
Neg logit contributions
 bit-0.026
 single-0.025
 care-0.024
 Is-0.024
ifying-0.024
Token!
Token ID0
Feature activation-3.48e-02

Pos logit contributions
bled+0.060
uddled+0.058
 olds+0.055
iggle+0.055
add+0.055
Neg logit contributions
ifying-0.034
 Is-0.031
 bit-0.031
 awoke-0.031
 Ashley-0.029
Token<|endoftext|>
Token ID50256
Feature activation-1.28e-02

Pos logit contributions
 bit+0.054
ifying+0.054
 trace+0.051
 awoke+0.050
 total+0.048
Neg logit contributions
ails-0.053
uddled-0.053
bled-0.052
 olds-0.051
mm-0.051
TokenOnce
Token ID7454
Feature activation-1.91e-02

Pos logit contributions
ifying+0.015
 bit+0.014
 Is+0.013
 Ashley+0.013
 care+0.013
Neg logit contributions
bled-0.027
uddled-0.026
ails-0.025
 olds-0.025
iggle-0.025
Token said
Token ID531
Feature activation+2.90e-02

Pos logit contributions
uddled+0.072
bled+0.070
bles+0.068
agons+0.067
 olds+0.067
Neg logit contributions
 care-0.050
 bit-0.049
 Is-0.047
 stroke-0.047
Oh-0.046
Token Sally
Token ID25737
Feature activation-2.26e-02

Pos logit contributions
uddled+0.047
bled+0.044
uddle+0.043
agons+0.043
 olds+0.043
Neg logit contributions
 Is-0.047
 care-0.047
 Ashley-0.046
 bit-0.046
ifying-0.046
Token.
Token ID13
Feature activation+2.76e-02

Pos logit contributions
ifying+0.039
 awoke+0.034
 Is+0.034
 total+0.033
Oh+0.033
Neg logit contributions
bled-0.037
uddled-0.035
 olds-0.035
iggle-0.035
ails-0.034
Token
Token ID198
Feature activation+3.46e-02

Pos logit contributions
bled+0.070
uddled+0.067
agons+0.067
castle+0.066
bles+0.065
Neg logit contributions
 Is-0.022
 Ashley-0.020
 Debbie-0.020
 Will-0.019
ifying-0.018
Token
Token ID198
Feature activation+4.05e-02

Pos logit contributions
bled+0.093
castle+0.090
 olds+0.089
uddled+0.088
agons+0.087
Neg logit contributions
 Ashley-0.017
Lou-0.017
Am-0.017
 Debbie-0.016
 Is-0.016
TokenM
Token ID44
Feature activation+8.80e-02

Pos logit contributions
bled+0.104
 olds+0.099
iggle+0.097
uddled+0.097
agons+0.096
Neg logit contributions
R-0.026
Am-0.025
Lou-0.025
Oh-0.024
where-0.023
Tokenum
Token ID388
Feature activation+2.10e-02

Pos logit contributions
bled+0.160
bes+0.155
ts+0.153
 structures+0.150
 olds+0.148
Neg logit contributions
 single-0.098
by-0.090
of-0.088
Not-0.088
 care-0.086
Token smiled
Token ID13541
Feature activation+5.66e-03

Pos logit contributions
bled+0.041
uddled+0.040
 olds+0.040
agons+0.040
ails+0.039
Neg logit contributions
 bit-0.026
 awoke-0.025
ifying-0.025
 Is-0.024
 Ashley-0.024
Token and
Token ID290
Feature activation+2.46e-02

Pos logit contributions
bled+0.010
uddled+0.010
 olds+0.010
ails+0.010
iggle+0.010
Neg logit contributions
ifying-0.008
 Is-0.008
 bit-0.008
 awoke-0.007
 Ashley-0.007
Token said
Token ID531
Feature activation+1.62e-02

Pos logit contributions
agons+0.049
bled+0.048
uddled+0.047
ails+0.047
ini+0.047
Neg logit contributions
 bit-0.031
 Is-0.030
 Ashley-0.030
 awoke-0.029
 Debbie-0.029
Token,
Token ID11
Feature activation+1.73e-02

Pos logit contributions
bled+0.032
uddled+0.031
agons+0.031
ini+0.031
 olds+0.030
Neg logit contributions
ifying-0.019
 Is-0.019
 bit-0.019
 Ashley-0.018
 awoke-0.018
Token shoot
Token ID2686
Feature activation-6.14e-02

Pos logit contributions
ifying+0.004
 Is+0.003
 Ashley+0.003
 bit+0.003
 awoke+0.003
Neg logit contributions
bled-0.005
uddled-0.005
iggle-0.005
 olds-0.005
ails-0.005
Token it
Token ID340
Feature activation-8.67e-03

Pos logit contributions
ifying+0.114
 total+0.103
 distance+0.100
use+0.095
bie+0.094
Neg logit contributions
iggle-0.079
bled-0.078
 olds-0.077
 golf-0.075
ires-0.073
Token.
Token ID13
Feature activation-8.77e-02

Pos logit contributions
ifying+0.012
 Ashley+0.011
 Is+0.011
 awoke+0.011
 Debbie+0.011
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
iggle-0.015
 golf-0.015
Token
Token ID220
Feature activation-4.16e-04

Pos logit contributions
ifying+0.196
 bit+0.168
 tant+0.163
 trace+0.163
 single+0.162
Neg logit contributions
ires-0.090
yard-0.089
azes-0.081
ate-0.080
 olds-0.078
Token
Token ID198
Feature activation-6.48e-02

Pos logit contributions
ifying+0.000
 Is+0.000
 bit+0.000
 Ashley+0.000
 awoke+0.000
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token
Token ID198
Feature activation-9.56e-02

Pos logit contributions
ifying+0.142
 bit+0.125
 trace+0.111
 awoke+0.109
 single+0.109
Neg logit contributions
iggle-0.077
ails-0.075
bled-0.073
ires-0.073
uddled-0.072
TokenJackson
Token ID31270
Feature activation+2.41e-02

Pos logit contributions
ifying+0.167
bie+0.151
 single+0.143
inea+0.139
burn+0.137
Neg logit contributions
ate-0.132
ires-0.131
uring-0.117
lf-0.116
bles-0.115
Token got
Token ID1392
Feature activation-2.68e-02

Pos logit contributions
bled+0.053
uddled+0.052
 olds+0.050
ires+0.050
iggle+0.049
Neg logit contributions
 bit-0.026
 awoke-0.025
 Is-0.023
 care-0.023
 heard-0.023
Token her
Token ID607
Feature activation+4.39e-03

Pos logit contributions
ifying+0.045
 awoke+0.041
 Ashley+0.039
 Debbie+0.039
Jonathan+0.039
Neg logit contributions
bled-0.045
 golf-0.042
uddled-0.041
 olds-0.040
add-0.040
Token toy
Token ID13373
Feature activation-8.77e-03

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.007
 Is-0.006
 bit-0.006
 Ashley-0.006
 awoke-0.006
Token gun
Token ID2485
Feature activation+2.08e-03

Pos logit contributions
ifying+0.014
 Is+0.014
 Ashley+0.013
 awoke+0.013
 Debbie+0.013
Neg logit contributions
bled-0.014
uddled-0.014
 olds-0.013
iggle-0.013
uddle-0.013
Token loved
Token ID6151
Feature activation-3.20e-02

Pos logit contributions
ifying+0.045
 Is+0.041
 Ashley+0.040
 distance+0.040
 single+0.039
Neg logit contributions
uddled-0.028
bled-0.027
iggle-0.025
ails-0.025
ans-0.025
Token to
Token ID284
Feature activation+1.81e-02

Pos logit contributions
ifying+0.041
 awoke+0.034
 Is+0.033
Jonathan+0.029
 distance+0.029
Neg logit contributions
 golf-0.067
uddled-0.065
ans-0.065
ails-0.064
bled-0.064
Token dance
Token ID9280
Feature activation+6.96e-04

Pos logit contributions
bled+0.042
uddled+0.041
 olds+0.038
uddle+0.038
ails+0.037
Neg logit contributions
 care-0.018
 bit-0.018
ifying-0.018
 Is-0.017
 Ashley-0.017
Token.
Token ID13
Feature activation-6.19e-02

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token Every
Token ID3887
Feature activation-2.27e-02

Pos logit contributions
ifying+0.111
 awoke+0.096
 distance+0.095
 bit+0.095
 trace+0.094
Neg logit contributions
mm-0.089
ails-0.085
ires-0.081
agons-0.079
yard-0.079
Token day
Token ID1110
Feature activation-9.33e-02

Pos logit contributions
ifying+0.038
 Is+0.033
Am+0.032
 Ashley+0.031
 Debbie+0.031
Neg logit contributions
uddled-0.037
iggle-0.037
bled-0.037
uddle-0.035
ails-0.035
Token,
Token ID11
Feature activation-6.99e-02

Pos logit contributions
ifying+0.236
 trace+0.233
Laura+0.224
Lou+0.222
den+0.222
Neg logit contributions
mm-0.062
bled-0.054
uddled-0.050
ails-0.044
 golf-0.031
Token he
Token ID339
Feature activation-1.22e-02

Pos logit contributions
ifying+0.113
 awoke+0.103
talk+0.099
ff+0.099
 distance+0.099
Neg logit contributions
mm-0.113
bled-0.111
ires-0.100
 golf-0.100
ble-0.098
Token danced
Token ID39480
Feature activation-2.26e-02

Pos logit contributions
ifying+0.014
 Is+0.012
 Ashley+0.012
 Debbie+0.011
 total+0.011
Neg logit contributions
uddled-0.026
bled-0.026
iggle-0.025
irled-0.024
 olds-0.024
Token on
Token ID319
Feature activation-3.11e-02

Pos logit contributions
ifying+0.042
 Is+0.038
 Ashley+0.037
Am+0.036
 Debbie+0.036
Neg logit contributions
bled-0.035
uddled-0.034
iggle-0.033
 golf-0.031
bles-0.031
Token the
Token ID262
Feature activation+2.48e-02

Pos logit contributions
ifying+0.048
 Is+0.041
 awoke+0.040
 single+0.038
 bit+0.038
Neg logit contributions
 golf-0.056
 olds-0.053
iggle-0.053
ans-0.053
uddled-0.052
Token,
Token ID11
Feature activation-8.56e-03

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token you
Token ID345
Feature activation-1.25e-02

Pos logit contributions
ifying+0.013
 awoke+0.012
 bit+0.012
 Is+0.012
 single+0.012
Neg logit contributions
bled-0.015
uddled-0.015
 olds-0.014
iggle-0.014
 golf-0.014
Token can
Token ID460
Feature activation-8.10e-03

Pos logit contributions
ifying+0.013
 Ashley+0.012
 Debbie+0.011
 distance+0.011
 Is+0.011
Neg logit contributions
bled-0.028
uddled-0.027
iggle-0.027
bles-0.026
 olds-0.026
Token give
Token ID1577
Feature activation-2.70e-02

Pos logit contributions
ifying+0.011
 Is+0.010
 awoke+0.010
 bit+0.010
 Ashley+0.010
Neg logit contributions
bled-0.015
uddled-0.015
iggle-0.015
 olds-0.014
uddle-0.014
Token me
Token ID502
Feature activation-4.93e-02

Pos logit contributions
ifying+0.045
 awoke+0.040
 Is+0.040
 distance+0.039
 single+0.038
Neg logit contributions
bled-0.044
 olds-0.042
uddled-0.042
iggle-0.040
ans-0.040
Token a
Token ID257
Feature activation-9.18e-02

Pos logit contributions
 Is+0.077
ifying+0.077
 awoke+0.075
 distance+0.073
Lou+0.072
Neg logit contributions
bled-0.079
uddled-0.075
 golf-0.073
 olds-0.070
ans-0.070
Token massage
Token ID26900
Feature activation-5.52e-02

Pos logit contributions
ifying+0.145
ges+0.137
pter+0.134
Jonathan+0.130
Lou+0.129
Neg logit contributions
uddle-0.148
uddled-0.130
 new-0.126
 golf-0.124
iggle-0.121
Token.
Token ID13
Feature activation-8.63e-03

Pos logit contributions
ifying+0.092
Lou+0.090
Jonathan+0.088
 total+0.086
Am+0.085
Neg logit contributions
uddled-0.082
bled-0.078
uddle-0.077
 olds-0.075
 golf-0.074
Token That
Token ID1320
Feature activation-4.61e-02

Pos logit contributions
ifying+0.012
 awoke+0.011
 bit+0.011
 single+0.011
 Is+0.011
Neg logit contributions
bled-0.016
uddled-0.015
 olds-0.015
ails-0.014
iggle-0.014
Token would
Token ID561
Feature activation-3.07e-04

Pos logit contributions
ifying+0.079
ges+0.063
 total+0.062
Am+0.061
 Is+0.060
Neg logit contributions
uddled-0.077
uddle-0.074
iggle-0.072
bled-0.068
 olds-0.068
Token be
Token ID307
Feature activation+4.13e-04

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 awoke+0.000
 bit+0.000
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token a
Token ID257
Feature activation+2.14e-02

Pos logit contributions
uddled+0.070
bled+0.069
 olds+0.068
uddle+0.067
castle+0.067
Neg logit contributions
ifying-0.023
 Ashley-0.022
 Will-0.021
 single-0.020
 awoke-0.020
Token little
Token ID1310
Feature activation+6.19e-03

Pos logit contributions
bled+0.041
uddled+0.040
 olds+0.037
ails+0.037
iggle+0.037
Neg logit contributions
 bit-0.030
ifying-0.030
 Is-0.029
 single-0.028
 Ashley-0.028
Token girl
Token ID2576
Feature activation-5.14e-03

Pos logit contributions
bled+0.013
uddled+0.013
 olds+0.012
iggle+0.012
ails+0.012
Neg logit contributions
ifying-0.007
 bit-0.007
 Is-0.007
 Ashley-0.006
 awoke-0.006
Token named
Token ID3706
Feature activation+9.58e-03

Pos logit contributions
ifying+0.010
 Is+0.010
 Ashley+0.010
 awoke+0.010
 Debbie+0.009
Neg logit contributions
bled-0.006
uddled-0.006
iggle-0.006
 olds-0.006
ails-0.006
Token Jackson
Token ID6612
Feature activation-4.59e-02

Pos logit contributions
bled+0.017
uddled+0.016
castle+0.015
agons+0.015
 olds+0.015
Neg logit contributions
ifying-0.015
 Is-0.014
 Ashley-0.014
 bit-0.014
 awoke-0.014
Token.
Token ID13
Feature activation-9.13e-02

Pos logit contributions
ifying+0.097
 Is+0.089
Oh+0.089
 distance+0.089
 Ashley+0.087
Neg logit contributions
iggle-0.051
uddled-0.051
ails-0.049
bles-0.049
ans-0.049
Token One
Token ID1881
Feature activation-4.79e-03

Pos logit contributions
ifying+0.197
 bit+0.179
 bumped+0.168
 fright+0.168
 distance+0.166
Neg logit contributions
ires-0.104
mm-0.096
azes-0.087
ate-0.085
iggle-0.084
Token day
Token ID1110
Feature activation-8.38e-02

Pos logit contributions
ifying+0.007
 Is+0.006
 Ashley+0.006
 awoke+0.006
 bit+0.006
Neg logit contributions
bled-0.009
uddled-0.009
iggle-0.009
 olds-0.008
ails-0.008
Token,
Token ID11
Feature activation-8.57e-02

Pos logit contributions
 distance+0.232
ifying+0.228
Oh+0.216
 trace+0.209
talk+0.209
Neg logit contributions
bled-0.043
kins-0.042
uddled-0.041
iggle-0.036
mm-0.030
Token Jackson
Token ID6612
Feature activation+2.55e-03

Pos logit contributions
ifying+0.200
ez+0.170
se+0.168
 distance+0.168
use+0.167
Neg logit contributions
kins-0.082
bled-0.076
mm-0.071
ires-0.070
 burying-0.061
Token went
Token ID1816
Feature activation+3.70e-03

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.003
 Is-0.003
 bit-0.003
 awoke-0.003
 Ashley-0.003
Token she
Token ID673
Feature activation-9.62e-03

Pos logit contributions
ifying+0.040
 bit+0.036
 Is+0.035
 awoke+0.035
 care+0.034
Neg logit contributions
bled-0.042
uddled-0.042
 olds-0.038
iggle-0.038
 golf-0.037
Token saw
Token ID2497
Feature activation+1.35e-02

Pos logit contributions
ifying+0.010
 Ashley+0.009
 Is+0.009
 Debbie+0.008
Jonathan+0.008
Neg logit contributions
uddled-0.021
bled-0.021
iggle-0.020
uddle-0.020
ails-0.020
Token a
Token ID257
Feature activation+6.66e-03

Pos logit contributions
bled+0.030
uddled+0.030
 olds+0.028
iggle+0.028
ails+0.028
Neg logit contributions
ifying-0.014
 Is-0.013
 Ashley-0.013
 awoke-0.013
 Debbie-0.012
Token large
Token ID1588
Feature activation+2.09e-02

Pos logit contributions
bled+0.012
uddled+0.012
 olds+0.011
iggle+0.011
ails+0.011
Neg logit contributions
ifying-0.010
 Is-0.009
 bit-0.009
 Ashley-0.009
 awoke-0.009
Token man
Token ID582
Feature activation+8.03e-03

Pos logit contributions
bled+0.041
uddled+0.040
iggle+0.037
irled+0.036
ails+0.036
Neg logit contributions
ifying-0.029
 bit-0.028
 Is-0.027
 single-0.027
 awoke-0.027
Token.
Token ID13
Feature activation-9.05e-02

Pos logit contributions
bled+0.014
uddled+0.014
 olds+0.013
iggle+0.013
ails+0.013
Neg logit contributions
ifying-0.012
 Is-0.011
 bit-0.011
 awoke-0.011
 Ashley-0.011
Token He
Token ID679
Feature activation+1.24e-03

Pos logit contributions
ifying+0.205
inea+0.184
w+0.183
 trace+0.179
Oh+0.178
Neg logit contributions
 olds-0.088
yard-0.068
 old-0.067
uddled-0.059
lee-0.056
Token yelled
Token ID22760
Feature activation-1.05e-02

Pos logit contributions
bled+0.002
uddled+0.002
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.002
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token out
Token ID503
Feature activation-2.75e-02

Pos logit contributions
ifying+0.015
 awoke+0.013
 bit+0.013
 Is+0.013
 Ashley+0.013
Neg logit contributions
bled-0.020
uddled-0.020
 olds-0.019
iggle-0.019
 golf-0.018
Token to
Token ID284
Feature activation-5.24e-02

Pos logit contributions
ifying+0.034
 distance+0.030
 awoke+0.030
 Debbie+0.029
 Ashley+0.029
Neg logit contributions
bled-0.055
uddled-0.055
iggle-0.053
 olds-0.053
 golf-0.053
Token her
Token ID607
Feature activation-7.48e-03

Pos logit contributions
ifying+0.112
inea+0.087
 single+0.086
Oh+0.078
 awoke+0.078
Neg logit contributions
iggle-0.070
 olds-0.065
 golf-0.058
bled-0.058
lee-0.057
Tokenmy
Token ID1820
Feature activation-2.83e-02

Pos logit contributions
ifying+0.047
 Is+0.044
Jonathan+0.042
 Debbie+0.042
Lou+0.042
Neg logit contributions
ails-0.048
uddled-0.048
bled-0.048
 golf-0.047
iggle-0.046
Token and
Token ID290
Feature activation-1.37e-02

Pos logit contributions
ifying+0.050
 Is+0.048
 Debbie+0.046
Lou+0.046
Jonathan+0.046
Neg logit contributions
uddled-0.041
bled-0.040
ails-0.039
 golf-0.039
 olds-0.038
Token daddy
Token ID35822
Feature activation-3.86e-02

Pos logit contributions
ifying+0.023
 single+0.020
 Is+0.020
 awoke+0.020
 total+0.019
Neg logit contributions
bled-0.023
uddled-0.023
iggle-0.021
 olds-0.021
castle-0.021
Token.
Token ID13
Feature activation-5.25e-02

Pos logit contributions
ifying+0.074
Lou+0.071
Jonathan+0.070
Georgia+0.070
 Debbie+0.070
Neg logit contributions
bled-0.049
uddled-0.048
 golf-0.047
ails-0.047
iggle-0.045
Token One
Token ID1881
Feature activation-3.14e-03

Pos logit contributions
ifying+0.100
 bit+0.091
 distance+0.087
 trace+0.087
Oh+0.083
Neg logit contributions
ires-0.066
 olds-0.065
uddled-0.065
iggle-0.064
mm-0.063
Token day
Token ID1110
Feature activation-9.00e-02

Pos logit contributions
ifying+0.004
 Is+0.004
 Ashley+0.004
 bit+0.004
 awoke+0.004
Neg logit contributions
bled-0.006
uddled-0.006
iggle-0.006
 olds-0.006
ails-0.006
Token,
Token ID11
Feature activation-8.18e-02

Pos logit contributions
ifying+0.226
 distance+0.216
 trace+0.213
 total+0.198
Oh+0.198
Neg logit contributions
uddled-0.081
bled-0.072
iggle-0.060
mm-0.058
irled-0.057
Token when
Token ID618
Feature activation-7.91e-02

Pos logit contributions
ifying+0.176
ez+0.164
use+0.159
 trace+0.159
 cavity+0.155
Neg logit contributions
uddled-0.093
bled-0.092
mm-0.077
iggle-0.073
ires-0.072
Token Pen
Token ID7507
Feature activation-1.24e-02

Pos logit contributions
ifying+0.178
 tant+0.163
ez+0.161
ff+0.161
talk+0.160
Neg logit contributions
bled-0.079
uddled-0.076
mm-0.076
irled-0.064
iggle-0.063
Tokenelope
Token ID47329
Feature activation+6.51e-03

Pos logit contributions
ifying+0.018
 Is+0.016
 total+0.015
 Ashley+0.015
 single+0.015
Neg logit contributions
uddled-0.023
bled-0.022
iggle-0.021
ails-0.021
irled-0.021
Token was
Token ID373
Feature activation+1.84e-02

Pos logit contributions
bled+0.013
uddled+0.013
 olds+0.012
iggle+0.012
ails+0.012
Neg logit contributions
ifying-0.008
 Is-0.007
 bit-0.007
 awoke-0.007
 Ashley-0.007
Tokenily
Token ID813
Feature activation+3.71e-03

Pos logit contributions
bled+0.099
 olds+0.094
castle+0.091
yard+0.090
 raft+0.089
Neg logit contributions
of-0.083
 care-0.082
 minute-0.081
 nine-0.081
ges-0.081
Token,
Token ID11
Feature activation-6.56e-04

Pos logit contributions
bled+0.007
uddled+0.007
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.005
 Is-0.005
 Ashley-0.005
 bit-0.005
 awoke-0.005
Token can
Token ID460
Feature activation+1.40e-02

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token you
Token ID345
Feature activation+2.12e-03

Pos logit contributions
bled+0.027
uddled+0.027
 olds+0.026
ails+0.026
iggle+0.026
Neg logit contributions
ifying-0.018
 Is-0.018
 Ashley-0.017
 bit-0.017
 Will-0.017
Token answer
Token ID3280
Feature activation-2.96e-02

Pos logit contributions
bled+0.004
uddled+0.004
 olds+0.004
iggle+0.004
ails+0.004
Neg logit contributions
ifying-0.003
 Is-0.003
 bit-0.002
 Ashley-0.002
 awoke-0.002
Token a
Token ID257
Feature activation-8.95e-02

Pos logit contributions
ifying+0.043
 awoke+0.040
 Is+0.040
 distance+0.039
 single+0.039
Neg logit contributions
bled-0.052
uddled-0.051
 olds-0.049
 golf-0.048
iggle-0.048
Token question
Token ID1808
Feature activation-3.42e-02

Pos logit contributions
ifying+0.176
ges+0.150
ez+0.147
R+0.137
talk+0.137
Neg logit contributions
 new-0.105
uddle-0.105
 old-0.099
 golf-0.096
 idea-0.096
Token for
Token ID329
Feature activation-2.16e-02

Pos logit contributions
ifying+0.059
 distance+0.057
Lou+0.057
 awoke+0.056
Jonathan+0.055
Neg logit contributions
bled-0.050
uddled-0.048
ails-0.047
 olds-0.046
iggle-0.045
Token me
Token ID502
Feature activation-1.14e-02

Pos logit contributions
ifying+0.034
 awoke+0.031
 Is+0.030
 distance+0.029
 single+0.029
Neg logit contributions
bled-0.037
 olds-0.035
uddled-0.034
iggle-0.034
 golf-0.034
Token?"
Token ID1701
Feature activation+1.73e-02

Pos logit contributions
ifying+0.019
 Is+0.017
 Ashley+0.017
 awoke+0.017
 bit+0.017
Neg logit contributions
bled-0.018
uddled-0.018
 olds-0.017
iggle-0.017
ails-0.017
Token Lily
Token ID20037
Feature activation+1.20e-02

Pos logit contributions
bled+0.035
uddled+0.032
castle+0.031
agons+0.031
bles+0.031
Neg logit contributions
 Is-0.023
 bit-0.023
 Ashley-0.023
 Will-0.023
 care-0.023
Token world
Token ID995
Feature activation+1.70e-02

Pos logit contributions
bled+0.004
uddled+0.004
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.004
 Is-0.003
 Ashley-0.003
 bit-0.003
 awoke-0.003
Token.
Token ID13
Feature activation-7.21e-02

Pos logit contributions
bled+0.038
uddled+0.037
iggle+0.034
irled+0.034
uddle+0.034
Neg logit contributions
ifying-0.018
 Is-0.017
 bit-0.017
 single-0.017
 distance-0.017
Token
Token ID198
Feature activation-4.95e-02

Pos logit contributions
ifying+0.150
 bit+0.139
 away+0.127
 trace+0.126
 distance+0.123
Neg logit contributions
ires-0.078
iggle-0.077
mm-0.076
 olds-0.075
uddled-0.072
Token
Token ID198
Feature activation-7.83e-02

Pos logit contributions
ifying+0.094
 bit+0.085
 awoke+0.075
 trace+0.075
 Is+0.075
Neg logit contributions
uddled-0.072
iggle-0.070
ails-0.070
bled-0.066
uddle-0.065
TokenOne
Token ID3198
Feature activation-1.36e-02

Pos logit contributions
ifying+0.152
 bit+0.127
 trace+0.123
inea+0.122
 tant+0.122
Neg logit contributions
uddled-0.101
agons-0.094
irling-0.094
mm-0.092
ires-0.090
Token day
Token ID1110
Feature activation-8.83e-02

Pos logit contributions
ifying+0.022
 Is+0.020
 Ashley+0.019
 awoke+0.019
 Debbie+0.019
Neg logit contributions
uddled-0.022
bled-0.022
iggle-0.022
ails-0.021
 olds-0.021
Token,
Token ID11
Feature activation-8.21e-02

Pos logit contributions
ifying+0.228
 distance+0.222
 trace+0.214
 fright+0.205
Oh+0.203
Neg logit contributions
uddled-0.066
bled-0.052
mm-0.046
iggle-0.045
irled-0.041
Token Ashley
Token ID20899
Feature activation+1.60e-02

Pos logit contributions
ifying+0.125
ez+0.112
talk+0.109
 distance+0.109
ff+0.109
Neg logit contributions
uddled-0.138
bled-0.130
mm-0.124
kins-0.117
ires-0.117
Token was
Token ID373
Feature activation+1.14e-02

Pos logit contributions
bled+0.036
uddled+0.034
 olds+0.034
iggle+0.033
agons+0.033
Neg logit contributions
 awoke-0.017
 bit-0.016
ifying-0.016
 Is-0.016
 Ashley-0.016
Token at
Token ID379
Feature activation-1.78e-02

Pos logit contributions
bled+0.018
uddled+0.018
 olds+0.018
castle+0.017
uddle+0.017
Neg logit contributions
ifying-0.018
 Is-0.018
 bit-0.018
 Ashley-0.017
 single-0.017
Token the
Token ID262
Feature activation-1.07e-02

Pos logit contributions
ifying+0.025
 awoke+0.021
 bit+0.021
 Is+0.020
Jonathan+0.020
Neg logit contributions
bled-0.034
 golf-0.033
uddled-0.033
iggle-0.032
 olds-0.032
Token stop
Token ID2245
Feature activation-1.82e-02

Pos logit contributions
ifying+0.057
 Ashley+0.045
Jonathan+0.045
 awoke+0.045
Am+0.044
Neg logit contributions
iggle-0.059
 golf-0.055
bles-0.055
 olds-0.054
ails-0.054
Token!
Token ID0
Feature activation-5.96e-02

Pos logit contributions
ifying+0.030
 awoke+0.027
 Is+0.027
 Ashley+0.027
 total+0.026
Neg logit contributions
bled-0.030
iggle-0.029
uddled-0.028
 olds-0.028
 golf-0.028
Token
Token ID198
Feature activation+6.38e-03

Pos logit contributions
ifying+0.105
 trace+0.101
 bit+0.099
 distance+0.093
use+0.092
Neg logit contributions
uddled-0.079
 olds-0.079
bled-0.078
ails-0.076
add-0.075
Token
Token ID198
Feature activation-4.05e-02

Pos logit contributions
bled+0.014
uddled+0.014
 olds+0.013
iggle+0.013
castle+0.013
Neg logit contributions
ifying-0.007
 Is-0.006
 Ashley-0.006
 awoke-0.006
 Debbie-0.006
TokenFinally
Token ID11158
Feature activation-4.03e-02

Pos logit contributions
ifying+0.076
 distance+0.069
 trace+0.068
 bit+0.068
 care+0.067
Neg logit contributions
uddled-0.059
bled-0.058
irled-0.052
add-0.052
irling-0.050
Token,
Token ID11
Feature activation-8.80e-02

Pos logit contributions
ifying+0.090
 distance+0.080
Oh+0.077
 single+0.077
 volume+0.077
Neg logit contributions
bled-0.046
uddled-0.043
iggle-0.041
mm-0.040
irled-0.038
Token she
Token ID673
Feature activation+3.91e-03

Pos logit contributions
ifying+0.145
use+0.143
ff+0.137
ez+0.134
 trace+0.134
Neg logit contributions
bled-0.132
uddled-0.116
mm-0.113
iggle-0.107
ble-0.105
Token reached
Token ID4251
Feature activation-1.64e-02

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.004
 Is-0.004
 bit-0.004
 awoke-0.004
 Ashley-0.004
Token the
Token ID262
Feature activation-4.42e-02

Pos logit contributions
ifying+0.021
 Is+0.018
 bit+0.018
 awoke+0.018
Jonathan+0.017
Neg logit contributions
bled-0.033
uddled-0.031
iggle-0.031
 olds-0.031
 golf-0.030
Token top
Token ID1353
Feature activation-1.72e-02

Pos logit contributions
 Ashley+0.080
ifying+0.080
ges+0.079
ez+0.079
 Debbie+0.079
Neg logit contributions
castle-0.053
uddle-0.052
bled-0.052
 golf-0.051
 clubhouse-0.049
Token of
Token ID286
Feature activation-2.11e-02

Pos logit contributions
ifying+0.027
 Is+0.025
 Ashley+0.024
 awoke+0.024
 Debbie+0.024
Neg logit contributions
bled-0.028
uddled-0.028
 olds-0.028
iggle-0.027
castle-0.027
Token she
Token ID673
Feature activation+1.84e-02

Pos logit contributions
ifying+0.129
use+0.121
 distance+0.116
se+0.111
 volume+0.109
Neg logit contributions
ires-0.080
bled-0.079
uddled-0.071
mm-0.070
kins-0.069
Token decided
Token ID3066
Feature activation-7.55e-04

Pos logit contributions
bled+0.039
uddled+0.038
iggle+0.038
 olds+0.038
ails+0.037
Neg logit contributions
 bit-0.021
 awoke-0.019
ifying-0.019
 Is-0.019
 Will-0.019
Token to
Token ID284
Feature activation-2.75e-03

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 bit+0.000
 awoke+0.000
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token shoot
Token ID2686
Feature activation-6.14e-02

Pos logit contributions
ifying+0.004
 Is+0.003
 Ashley+0.003
 bit+0.003
 awoke+0.003
Neg logit contributions
bled-0.005
uddled-0.005
iggle-0.005
 olds-0.005
ails-0.005
Token it
Token ID340
Feature activation-8.67e-03

Pos logit contributions
ifying+0.114
 total+0.103
 distance+0.100
use+0.095
bie+0.094
Neg logit contributions
iggle-0.079
bled-0.078
 olds-0.077
 golf-0.075
ires-0.073
Token.
Token ID13
Feature activation-8.77e-02

Pos logit contributions
ifying+0.012
 Ashley+0.011
 Is+0.011
 awoke+0.011
 Debbie+0.011
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
iggle-0.015
 golf-0.015
Token
Token ID220
Feature activation-4.16e-04

Pos logit contributions
ifying+0.196
 bit+0.168
 tant+0.163
 trace+0.163
 single+0.162
Neg logit contributions
ires-0.090
yard-0.089
azes-0.081
ate-0.080
 olds-0.078
Token
Token ID198
Feature activation-6.48e-02

Pos logit contributions
ifying+0.000
 Is+0.000
 bit+0.000
 Ashley+0.000
 awoke+0.000
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token
Token ID198
Feature activation-9.56e-02

Pos logit contributions
ifying+0.142
 bit+0.125
 trace+0.111
 awoke+0.109
 single+0.109
Neg logit contributions
iggle-0.077
ails-0.075
bled-0.073
ires-0.073
uddled-0.072
TokenJackson
Token ID31270
Feature activation+2.41e-02

Pos logit contributions
ifying+0.167
bie+0.151
 single+0.143
inea+0.139
burn+0.137
Neg logit contributions
ate-0.132
ires-0.131
uring-0.117
lf-0.116
bles-0.115
Token got
Token ID1392
Feature activation-2.68e-02

Pos logit contributions
bled+0.053
uddled+0.052
 olds+0.050
ires+0.050
iggle+0.049
Neg logit contributions
 bit-0.026
 awoke-0.025
 Is-0.023
 care-0.023
 heard-0.023
Token around
Token ID1088
Feature activation-2.56e-02

Pos logit contributions
ifying+0.019
 Is+0.018
 Ashley+0.017
 awoke+0.017
 Debbie+0.017
Neg logit contributions
bled-0.020
uddled-0.020
iggle-0.019
 olds-0.019
 golf-0.019
Token the
Token ID262
Feature activation-1.09e-02

Pos logit contributions
ifying+0.042
 awoke+0.038
 Is+0.037
 Debbie+0.037
 bit+0.036
Neg logit contributions
iggle-0.042
bled-0.041
 golf-0.040
 olds-0.040
uddled-0.038
Token park
Token ID3952
Feature activation+1.33e-02

Pos logit contributions
ifying+0.019
 Is+0.018
 Ashley+0.017
 bit+0.017
 awoke+0.017
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
iggle-0.015
uddle-0.015
Token.
Token ID13
Feature activation-7.07e-02

Pos logit contributions
bled+0.028
uddled+0.027
 olds+0.025
uddle+0.025
iggle+0.025
Neg logit contributions
ifying-0.016
 bit-0.015
 Is-0.015
 Ashley-0.014
 single-0.014
Token One
Token ID1881
Feature activation-7.66e-03

Pos logit contributions
ifying+0.147
 bit+0.138
 away+0.122
 trace+0.121
 total+0.119
Neg logit contributions
mm-0.087
ires-0.087
iggle-0.079
ails-0.077
uddled-0.074
Token day
Token ID1110
Feature activation-8.76e-02

Pos logit contributions
ifying+0.012
 Is+0.010
 Ashley+0.010
 awoke+0.010
 Debbie+0.010
Neg logit contributions
bled-0.014
uddled-0.014
iggle-0.013
 olds-0.013
ails-0.013
Token,
Token ID11
Feature activation-8.48e-02

Pos logit contributions
ifying+0.230
 trace+0.215
 distance+0.210
 fright+0.208
Lou+0.206
Neg logit contributions
uddled-0.064
mm-0.054
bled-0.054
iggle-0.042
irled-0.041
Token as
Token ID355
Feature activation-8.17e-02

Pos logit contributions
ifying+0.158
ez+0.150
use+0.135
 trace+0.131
inea+0.130
Neg logit contributions
bled-0.120
uddled-0.117
mm-0.116
ires-0.114
iring-0.104
Token she
Token ID673
Feature activation+6.90e-03

Pos logit contributions
ifying+0.149
ez+0.139
 trace+0.136
 tant+0.135
se+0.134
Neg logit contributions
mm-0.113
bled-0.108
ires-0.107
iring-0.105
uddled-0.102
Token was
Token ID373
Feature activation+1.76e-02

Pos logit contributions
bled+0.013
uddled+0.013
 olds+0.012
ails+0.012
iggle+0.012
Neg logit contributions
ifying-0.009
 Is-0.009
 bit-0.009
 awoke-0.009
 Ashley-0.009
Token dancing
Token ID15360
Feature activation-1.46e-02

Pos logit contributions
bled+0.027
uddled+0.026
 olds+0.025
ini+0.025
castle+0.025
Neg logit contributions
ifying-0.029
 Ashley-0.029
 Is-0.029
 bit-0.028
 awoke-0.028
Token if
Token ID611
Feature activation+1.31e-02

Pos logit contributions
ifying+0.061
Oh+0.056
 distance+0.055
 awoke+0.053
Lou+0.053
Neg logit contributions
ails-0.062
bled-0.062
bles-0.062
 olds-0.061
uddled-0.061
Token he
Token ID339
Feature activation-1.05e-02

Pos logit contributions
uddled+0.028
bled+0.028
 olds+0.026
iggle+0.026
agons+0.026
Neg logit contributions
ifying-0.014
 Is-0.014
 Ashley-0.013
 awoke-0.013
 Debbie-0.013
Token could
Token ID714
Feature activation-7.17e-03

Pos logit contributions
ifying+0.013
 Ashley+0.011
 distance+0.011
 Is+0.011
 Debbie+0.011
Neg logit contributions
bled-0.021
uddled-0.021
 olds-0.020
iggle-0.020
ails-0.020
Token give
Token ID1577
Feature activation+3.99e-03

Pos logit contributions
ifying+0.009
 Is+0.008
 Ashley+0.008
 awoke+0.008
 bit+0.008
Neg logit contributions
bled-0.014
uddled-0.014
iggle-0.013
 olds-0.013
ails-0.013
Token her
Token ID607
Feature activation-5.43e-02

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.005
 Is-0.004
 Ashley-0.004
 bit-0.004
 awoke-0.004
Token a
Token ID257
Feature activation-8.71e-02

Pos logit contributions
ifying+0.090
Lou+0.089
 Is+0.086
Jonathan+0.085
 distance+0.084
Neg logit contributions
uddled-0.081
bled-0.081
uddle-0.073
iggle-0.073
 golf-0.070
Token massage
Token ID26900
Feature activation-4.92e-02

Pos logit contributions
ifying+0.136
ges+0.129
Jonathan+0.129
Lou+0.128
Am+0.127
Neg logit contributions
uddle-0.142
uddled-0.126
 golf-0.122
 new-0.121
iggle-0.120
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
ifying+0.084
Lou+0.083
Jonathan+0.081
 total+0.080
Am+0.080
Neg logit contributions
uddled-0.072
bled-0.071
uddle-0.067
iggle-0.065
 golf-0.065
Token Jane
Token ID12091
Feature activation-2.31e-02

Pos logit contributions
bled+0.020
uddled+0.019
iggle+0.018
agons+0.017
 olds+0.017
Neg logit contributions
ifying-0.016
 Is-0.016
 Ashley-0.015
 awoke-0.015
 Debbie-0.015
Token said
Token ID531
Feature activation+5.50e-03

Pos logit contributions
ifying+0.043
 distance+0.036
 single+0.036
 Is+0.036
 total+0.035
Neg logit contributions
uddled-0.037
bled-0.036
irled-0.033
iggle-0.032
 olds-0.032
Token "
Token ID366
Feature activation-4.63e-02

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.009
 Is-0.009
 Ashley-0.009
 bit-0.008
 awoke-0.008
Token named
Token ID3706
Feature activation+9.64e-03

Pos logit contributions
ifying+0.027
 Is+0.025
 Ashley+0.025
 awoke+0.025
 Debbie+0.025
Neg logit contributions
uddled-0.014
bled-0.013
ails-0.013
iggle-0.013
 olds-0.013
Token Pen
Token ID7507
Feature activation-2.66e-02

Pos logit contributions
bled+0.017
uddled+0.017
castle+0.015
 olds+0.015
agons+0.015
Neg logit contributions
ifying-0.015
 Ashley-0.014
 Is-0.014
 bit-0.014
 care-0.014
Tokenelope
Token ID47329
Feature activation-1.28e-02

Pos logit contributions
ifying+0.043
 Is+0.038
 total+0.037
 bit+0.036
 single+0.035
Neg logit contributions
uddled-0.044
bled-0.043
iggle-0.042
ails-0.041
ini-0.040
Token.
Token ID13
Feature activation-7.76e-02

Pos logit contributions
ifying+0.020
 Is+0.019
 Ashley+0.018
 awoke+0.018
 Debbie+0.018
Neg logit contributions
uddled-0.021
bled-0.021
ails-0.020
iggle-0.020
 olds-0.020
Token One
Token ID1881
Feature activation-3.57e-03

Pos logit contributions
ifying+0.169
 trace+0.151
 bit+0.143
 tant+0.143
 distance+0.142
Neg logit contributions
ires-0.077
mm-0.076
 olds-0.076
iggle-0.072
uddled-0.072
Token day
Token ID1110
Feature activation-8.70e-02

Pos logit contributions
ifying+0.005
 Is+0.004
 Ashley+0.004
 awoke+0.004
 bit+0.004
Neg logit contributions
bled-0.007
uddled-0.007
iggle-0.006
 olds-0.006
ails-0.006
Token,
Token ID11
Feature activation-7.53e-02

Pos logit contributions
ifying+0.219
 trace+0.208
 distance+0.205
talk+0.193
 tant+0.193
Neg logit contributions
uddled-0.072
bled-0.065
mm-0.062
irled-0.052
iggle-0.050
Token while
Token ID981
Feature activation-4.17e-02

Pos logit contributions
ifying+0.172
ez+0.154
 trace+0.149
talk+0.146
use+0.144
Neg logit contributions
uddled-0.074
bled-0.071
mm-0.069
ires-0.059
iggle-0.058
Token picking
Token ID10868
Feature activation+6.85e-03

Pos logit contributions
ifying+0.082
 tant+0.073
 distance+0.072
 bit+0.072
se+0.071
Neg logit contributions
bled-0.057
uddled-0.053
mm-0.049
 golf-0.047
irled-0.047
Token fresh
Token ID4713
Feature activation+1.42e-02

Pos logit contributions
bled+0.012
uddled+0.012
 olds+0.011
iggle+0.011
ails+0.011
Neg logit contributions
ifying-0.011
 Is-0.010
 Ashley-0.010
 bit-0.010
 awoke-0.010
Token flowers
Token ID12734
Feature activation+2.24e-02

Pos logit contributions
bled+0.024
uddled+0.024
iggle+0.022
castle+0.022
 olds+0.022
Neg logit contributions
ifying-0.023
 bit-0.021
 care-0.021
 Is-0.021
 awoke-0.020
Token mom
Token ID1995
Feature activation-1.02e-02

Pos logit contributions
ifying+0.010
 Is+0.009
 awoke+0.009
 distance+0.009
 bit+0.009
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.008
iggle-0.008
uddle-0.008
Token if
Token ID611
Feature activation+1.02e-02

Pos logit contributions
ifying+0.014
 bit+0.012
 Is+0.012
 Ashley+0.012
 awoke+0.012
Neg logit contributions
bled-0.020
uddled-0.019
ails-0.018
 olds-0.018
iggle-0.018
Token she
Token ID673
Feature activation-1.17e-02

Pos logit contributions
bled+0.021
uddled+0.021
 olds+0.020
iggle+0.020
ails+0.020
Neg logit contributions
ifying-0.011
 Is-0.011
 Ashley-0.011
 bit-0.011
 awoke-0.010
Token could
Token ID714
Feature activation-2.69e-02

Pos logit contributions
ifying+0.014
 Ashley+0.012
 distance+0.012
 Is+0.011
 awoke+0.011
Neg logit contributions
uddled-0.025
bled-0.025
 olds-0.023
ails-0.023
iggle-0.023
Token get
Token ID651
Feature activation-4.61e-02

Pos logit contributions
ifying+0.041
Lou+0.039
Jonathan+0.037
 Ashley+0.037
Oh+0.036
Neg logit contributions
bled-0.045
uddled-0.045
iggle-0.044
ails-0.043
castle-0.042
Token a
Token ID257
Feature activation-8.69e-02

Pos logit contributions
ifying+0.079
 awoke+0.077
Lou+0.076
 distance+0.075
Jonathan+0.072
Neg logit contributions
bled-0.076
 golf-0.066
uddled-0.066
bles-0.064
ires-0.062
Token new
Token ID649
Feature activation-5.89e-02

Pos logit contributions
ifying+0.147
Lou+0.130
ges+0.127
Jonathan+0.125
 awoke+0.124
Neg logit contributions
uddle-0.143
uddled-0.126
 new-0.124
 golf-0.123
castle-0.112
Token haircut
Token ID45548
Feature activation+5.24e-04

Pos logit contributions
ifying+0.099
Lou+0.078
 awoke+0.075
Jonathan+0.074
Jackson+0.070
Neg logit contributions
uddle-0.100
uddled-0.100
castle-0.100
 golf-0.099
iggle-0.097
Token.
Token ID13
Feature activation-7.71e-04

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token Her
Token ID2332
Feature activation+1.18e-02

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 bit+0.001
 awoke+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token mom
Token ID1995
Feature activation+1.18e-02

Pos logit contributions
bled+0.023
uddled+0.022
 olds+0.021
ails+0.021
iggle+0.021
Neg logit contributions
ifying-0.015
 bit-0.015
 Is-0.015
 Ashley-0.014
 care-0.014
Token.
Token ID13
Feature activation-6.53e-02

Pos logit contributions
bled+0.019
uddled+0.019
iggle+0.018
 olds+0.018
uddle+0.018
Neg logit contributions
ifying-0.007
 Is-0.006
 Ashley-0.006
 bit-0.006
 care-0.006
Token
Token ID198
Feature activation-3.25e-02

Pos logit contributions
ifying+0.145
 trace+0.130
 bit+0.128
 total+0.122
 distance+0.121
Neg logit contributions
uddled-0.071
 olds-0.066
ails-0.064
ires-0.064
iggle-0.063
Token
Token ID198
Feature activation-7.11e-02

Pos logit contributions
ifying+0.055
 bit+0.049
 total+0.045
 Is+0.044
 awoke+0.044
Neg logit contributions
uddled-0.054
iggle-0.053
bled-0.053
ails-0.052
 olds-0.049
TokenThe
Token ID464
Feature activation+3.47e-02

Pos logit contributions
ifying+0.128
inea+0.110
bie+0.108
 bit+0.106
 trace+0.106
Neg logit contributions
uddled-0.103
irling-0.098
ggies-0.091
ires-0.091
ves-0.091
Token next
Token ID1306
Feature activation-1.83e-02

Pos logit contributions
ails+0.063
uddled+0.063
iggle+0.062
bled+0.062
 olds+0.060
Neg logit contributions
 distance-0.050
 night-0.050
 bit-0.049
 care-0.047
 single-0.047
Token day
Token ID1110
Feature activation-8.66e-02

Pos logit contributions
ifying+0.033
 Is+0.029
 Ashley+0.028
 awoke+0.028
 Debbie+0.028
Neg logit contributions
 golf-0.027
ails-0.027
bled-0.027
uddled-0.027
castle-0.027
Token,
Token ID11
Feature activation-7.86e-02

Pos logit contributions
ifying+0.234
 trace+0.210
 tant+0.210
use+0.208
 distance+0.207
Neg logit contributions
uddled-0.054
iggle-0.036
bled-0.034
mm-0.033
uddle-0.029
Token Ashley
Token ID20899
Feature activation+1.06e-02

Pos logit contributions
ifying+0.108
use+0.093
talk+0.093
 distance+0.090
ff+0.090
Neg logit contributions
uddled-0.142
bled-0.130
mm-0.128
iggle-0.128
ires-0.125
Token brought
Token ID3181
Feature activation-5.81e-03

Pos logit contributions
bled+0.024
uddled+0.023
 olds+0.022
iggle+0.022
ails+0.022
Neg logit contributions
ifying-0.011
 awoke-0.011
 bit-0.011
 Is-0.011
 Ashley-0.010
Token the
Token ID262
Feature activation+2.05e-02

Pos logit contributions
ifying+0.007
 Is+0.006
 bit+0.006
 awoke+0.006
 Ashley+0.006
Neg logit contributions
bled-0.012
uddled-0.012
iggle-0.012
 olds-0.011
ails-0.011
Token flag
Token ID6056
Feature activation-9.02e-03

Pos logit contributions
uddled+0.031
bled+0.031
 olds+0.029
iggle+0.029
ails+0.029
Neg logit contributions
ifying-0.035
 bit-0.034
 distance-0.034
 single-0.033
 total-0.033
Tokenmy
Token ID1820
Feature activation-2.28e-02

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token,
Token ID11
Feature activation-2.09e-02

Pos logit contributions
ifying+0.044
Am+0.040
 Is+0.040
Pen+0.039
 distance+0.039
Neg logit contributions
uddled-0.031
bled-0.030
iggle-0.030
 golf-0.029
ails-0.029
Token we
Token ID356
Feature activation-3.76e-02

Pos logit contributions
ifying+0.032
 awoke+0.029
 distance+0.029
 Ashley+0.029
 single+0.029
Neg logit contributions
bled-0.036
uddled-0.035
 golf-0.034
iggle-0.033
castle-0.033
Token don
Token ID836
Feature activation+1.63e-02

Pos logit contributions
 Debbie+0.046
 Ashley+0.046
Jonathan+0.044
Lou+0.043
Georgia+0.043
Neg logit contributions
bled-0.074
iggle-0.073
uddled-0.070
add-0.070
 golf-0.070
Token't
Token ID470
Feature activation-3.69e-02

Pos logit contributions
bled+0.039
uddled+0.039
 olds+0.038
ails+0.037
iggle+0.037
Neg logit contributions
ifying-0.014
 Is-0.012
 Ashley-0.011
 awoke-0.011
 bit-0.011
Token have
Token ID423
Feature activation-8.66e-02

Pos logit contributions
ifying+0.050
Jonathan+0.047
 Ashley+0.046
 Debbie+0.046
Lou+0.045
Neg logit contributions
iggle-0.070
 golf-0.067
ails-0.067
ini-0.064
agons-0.063
Token any
Token ID597
Feature activation-3.50e-02

Pos logit contributions
 distance+0.152
ifying+0.151
 Is+0.151
Lou+0.149
Jonathan+0.146
Neg logit contributions
 golf-0.122
bled-0.106
add-0.102
 sandwiches-0.101
uddled-0.099
Token popcorn
Token ID38914
Feature activation-1.64e-02

Pos logit contributions
ifying+0.057
 Is+0.055
Lou+0.055
 awoke+0.053
Am+0.051
Neg logit contributions
uddled-0.054
uddle-0.054
 olds-0.053
 golf-0.053
bled-0.053
Token left
Token ID1364
Feature activation-2.16e-02

Pos logit contributions
ifying+0.022
 Is+0.021
Lou+0.021
 Ashley+0.021
 awoke+0.020
Neg logit contributions
bled-0.031
uddled-0.031
castle-0.029
 olds-0.029
ails-0.029
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
ifying+0.031
 Is+0.031
Lou+0.030
 awoke+0.029
 Ashley+0.029
Neg logit contributions
bled-0.038
uddled-0.036
 olds-0.036
ails-0.035
iggle-0.035
Token We
Token ID775
Feature activation-4.50e-02

Pos logit contributions
bled+0.024
uddled+0.024
iggle+0.023
 olds+0.022
castle+0.022
Neg logit contributions
ifying-0.016
 Is-0.015
 Ashley-0.014
 bit-0.014
 Will-0.013
Token way
Token ID835
Feature activation-9.87e-03

Pos logit contributions
ifying+0.013
 Is+0.012
 Ashley+0.012
 awoke+0.012
 Debbie+0.012
Neg logit contributions
bled-0.015
uddled-0.015
 olds-0.014
iggle-0.014
ails-0.014
Token home
Token ID1363
Feature activation+1.53e-02

Pos logit contributions
ifying+0.012
 Is+0.012
 Ashley+0.011
 awoke+0.011
 Debbie+0.011
Neg logit contributions
bled-0.020
uddled-0.019
 olds-0.019
iggle-0.019
ails-0.018
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
bled+0.034
uddled+0.034
iggle+0.032
 olds+0.032
irled+0.031
Neg logit contributions
ifying-0.016
 bit-0.014
 Is-0.014
 Ashley-0.014
 awoke-0.014
Token
Token ID198
Feature activation-1.01e-02

Pos logit contributions
ifying+0.030
 bit+0.028
 single+0.028
 distance+0.028
 awoke+0.028
Neg logit contributions
bled-0.039
uddled-0.039
 olds-0.038
ails-0.037
 golf-0.036
Token
Token ID198
Feature activation-5.88e-02

Pos logit contributions
ifying+0.015
 bit+0.014
 Is+0.013
 awoke+0.013
 total+0.013
Neg logit contributions
bled-0.018
uddled-0.018
iggle-0.017
 olds-0.017
ails-0.017
TokenSo
Token ID2396
Feature activation-8.66e-02

Pos logit contributions
ifying+0.105
 trace+0.095
 single+0.091
 distance+0.090
 care+0.090
Neg logit contributions
uddled-0.087
bled-0.084
mm-0.080
add-0.079
irling-0.076
Token Pen
Token ID7507
Feature activation-4.20e-02

Pos logit contributions
ifying+0.153
 trace+0.133
 cavity+0.133
 distance+0.131
ez+0.131
Neg logit contributions
mm-0.129
bled-0.122
iggle-0.114
uddled-0.112
 olds-0.111
Tokenelope
Token ID47329
Feature activation+1.28e-02

Pos logit contributions
ifying+0.081
 total+0.068
 Is+0.067
 single+0.064
 distance+0.064
Neg logit contributions
uddled-0.059
bled-0.058
irled-0.054
iggle-0.054
mm-0.052
Token and
Token ID290
Feature activation-2.67e-02

Pos logit contributions
bled+0.025
uddled+0.024
 olds+0.023
iggle+0.023
ails+0.022
Neg logit contributions
ifying-0.017
 bit-0.016
 awoke-0.016
 Is-0.016
 Ashley-0.016
Token her
Token ID607
Feature activation+1.18e-02

Pos logit contributions
ifying+0.045
 single+0.038
 total+0.037
 bit+0.035
Lou+0.035
Neg logit contributions
bled-0.046
iggle-0.045
uddled-0.045
irled-0.042
mm-0.041
Token mom
Token ID1995
Feature activation+7.77e-03

Pos logit contributions
bled+0.023
uddled+0.022
ails+0.021
 olds+0.021
iggle+0.021
Neg logit contributions
ifying-0.016
 bit-0.015
 Is-0.015
 Ashley-0.015
 care-0.014
Token a
Token ID257
Feature activation+1.27e-02

Pos logit contributions
uddled+0.051
bled+0.051
 olds+0.049
uddle+0.049
ails+0.048
Neg logit contributions
ifying-0.021
 Ashley-0.019
 Will-0.018
 awoke-0.018
 Is-0.018
Token little
Token ID1310
Feature activation-4.03e-03

Pos logit contributions
bled+0.023
uddled+0.022
 olds+0.021
ails+0.021
iggle+0.021
Neg logit contributions
ifying-0.019
 bit-0.018
 Is-0.018
 Ashley-0.018
 single-0.017
Token car
Token ID1097
Feature activation-5.50e-03

Pos logit contributions
ifying+0.005
 Is+0.005
 awoke+0.005
 Ashley+0.005
 bit+0.005
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.007
iggle-0.007
ails-0.007
Token named
Token ID3706
Feature activation-4.59e-03

Pos logit contributions
ifying+0.009
 Is+0.009
 Ashley+0.009
 awoke+0.009
 Debbie+0.009
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.008
iggle-0.008
ails-0.008
Token Gray
Token ID12723
Feature activation-2.94e-02

Pos logit contributions
ifying+0.008
 Is+0.008
 bit+0.008
 awoke+0.008
 Ashley+0.008
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
 golf-0.006
Token.
Token ID13
Feature activation-8.64e-02

Pos logit contributions
ifying+0.053
 Is+0.048
 distance+0.047
Oh+0.047
 single+0.047
Neg logit contributions
uddled-0.043
bled-0.042
ails-0.041
bles-0.040
 olds-0.040
Token Gray
Token ID12723
Feature activation+4.57e-03

Pos logit contributions
ifying+0.122
 bit+0.106
 distance+0.105
bie+0.104
den+0.099
Neg logit contributions
agons-0.147
mm-0.146
 golf-0.135
 olds-0.134
ires-0.134
Token was
Token ID373
Feature activation-1.04e-04

Pos logit contributions
bled+0.008
uddled+0.008
iggle+0.008
 olds+0.008
ails+0.007
Neg logit contributions
ifying-0.007
 Is-0.006
 bit-0.006
 Ashley-0.006
 awoke-0.006
Token very
Token ID845
Feature activation+2.06e-02

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 bit+0.000
 awoke+0.000
Neg logit contributions
bled-0.000
uddled-0.000
 olds-0.000
iggle-0.000
ails-0.000
Token fast
Token ID3049
Feature activation-1.32e-02

Pos logit contributions
uddled+0.037
bled+0.036
 olds+0.033
ails+0.033
castle+0.033
Neg logit contributions
ifying-0.030
 bit-0.030
 single-0.030
 Is-0.030
 care-0.029
Token and
Token ID290
Feature activation-9.93e-04

Pos logit contributions
ifying+0.023
 Is+0.021
 awoke+0.021
 Ashley+0.021
 Debbie+0.021
Neg logit contributions
uddled-0.020
bled-0.020
 olds-0.019
iggle-0.019
 golf-0.018
TokenTom
Token ID13787
Feature activation-3.64e-03

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token nods
Token ID34833
Feature activation-7.00e-03

Pos logit contributions
ifying+0.005
 Is+0.005
 Ashley+0.005
 bit+0.005
 awoke+0.005
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
ails-0.006
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
ifying+0.011
 Is+0.010
 Ashley+0.010
 bit+0.009
 awoke+0.009
Neg logit contributions
bled-0.013
uddled-0.012
iggle-0.011
 olds-0.011
ails-0.011
Token He
Token ID679
Feature activation+1.79e-02

Pos logit contributions
bled+0.033
uddled+0.032
iggle+0.031
ini+0.030
ans+0.030
Neg logit contributions
 Is-0.016
ifying-0.016
 Ashley-0.015
 bit-0.015
 Will-0.014
Token takes
Token ID2753
Feature activation-3.27e-02

Pos logit contributions
bled+0.038
uddled+0.037
 olds+0.035
ini+0.034
iggle+0.034
Neg logit contributions
ifying-0.020
 bit-0.020
 Is-0.020
 Ashley-0.019
 awoke-0.019
Token a
Token ID257
Feature activation-8.63e-02

Pos logit contributions
ifying+0.055
 Is+0.049
 distance+0.049
 awoke+0.048
Jonathan+0.047
Neg logit contributions
bled-0.054
add-0.048
 golf-0.048
iggle-0.047
ails-0.047
Token corn
Token ID11676
Feature activation-4.87e-02

Pos logit contributions
ifying+0.163
 Is+0.151
Oh+0.144
ez+0.140
 fail+0.139
Neg logit contributions
uddle-0.110
iggle-0.105
 raft-0.098
 new-0.094
bles-0.094
Token from
Token ID422
Feature activation-1.81e-04

Pos logit contributions
ifying+0.088
 awoke+0.086
Jonathan+0.086
 Is+0.085
Lou+0.085
Neg logit contributions
bled-0.069
castle-0.066
bles-0.065
ails-0.064
ble-0.063
Token the
Token ID262
Feature activation-5.24e-02

Pos logit contributions
ifying+0.000
 Is+0.000
 Ashley+0.000
 bit+0.000
 awoke+0.000
Neg logit contributions
bled-0.000
uddled-0.000
 olds-0.000
iggle-0.000
ails-0.000
Token plant
Token ID4618
Feature activation-1.60e-02

Pos logit contributions
ifying+0.100
ez+0.099
 awoke+0.099
 Is+0.099
 Will+0.098
Neg logit contributions
 golf-0.058
yard-0.058
ans-0.055
agons-0.054
 raft-0.054
Token.
Token ID13
Feature activation+1.62e-02

Pos logit contributions
ifying+0.025
 Is+0.023
 Ashley+0.023
Jonathan+0.022
 awoke+0.022
Neg logit contributions
bled-0.028
uddled-0.027
ails-0.026
iggle-0.026
 olds-0.025
Token
Token ID220
Feature activation+6.18e-03

Pos logit contributions
 bit+0.127
ifying+0.126
 distance+0.122
 trace+0.118
 away+0.118
Neg logit contributions
ires-0.080
azes-0.077
ails-0.076
 olds-0.075
mm-0.074
Token
Token ID198
Feature activation-2.34e-02

Pos logit contributions
bled+0.014
uddled+0.013
 olds+0.013
iggle+0.013
castle+0.013
Neg logit contributions
ifying-0.007
 Is-0.006
 Ashley-0.006
 awoke-0.006
 Debbie-0.006
Token
Token ID198
Feature activation-5.25e-02

Pos logit contributions
ifying+0.038
 bit+0.035
 awoke+0.031
 total+0.031
 distance+0.031
Neg logit contributions
bled-0.042
uddled-0.040
iggle-0.039
irled-0.038
ails-0.037
TokenThe
Token ID464
Feature activation-9.18e-04

Pos logit contributions
ifying+0.094
 bit+0.088
use+0.082
 distance+0.081
 awoke+0.078
Neg logit contributions
bled-0.083
uddled-0.079
irled-0.079
irling-0.077
iggle-0.074
Token end
Token ID886
Feature activation-2.93e-02

Pos logit contributions
ifying+0.002
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.002
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token.
Token ID13
Feature activation-8.62e-02

Pos logit contributions
ifying+0.053
 awoke+0.048
 Debbie+0.047
 distance+0.047
 bit+0.047
Neg logit contributions
 olds-0.041
uddled-0.039
iggle-0.039
ails-0.038
uddle-0.037
Token<|endoftext|>
Token ID50256
Feature activation-1.14e-02

Pos logit contributions
 bit+0.189
use+0.180
 trace+0.179
ifying+0.177
 away+0.176
Neg logit contributions
mm-0.078
ires-0.074
azes-0.065
iggle-0.063
ails-0.062
TokenOne
Token ID3198
Feature activation-1.28e-02

Pos logit contributions
ifying+0.013
 bit+0.011
 Is+0.011
 Ashley+0.011
 care+0.010
Neg logit contributions
bled-0.024
uddled-0.024
ails-0.023
 olds-0.023
iggle-0.023
Token day
Token ID1110
Feature activation-1.51e-02

Pos logit contributions
ifying+0.019
 Is+0.017
 Ashley+0.017
Am+0.016
 bit+0.016
Neg logit contributions
bled-0.023
uddled-0.022
iggle-0.022
 olds-0.021
 golf-0.021
Token,
Token ID11
Feature activation+9.03e-03

Pos logit contributions
ifying+0.026
 Is+0.023
 distance+0.023
 bit+0.023
 awoke+0.023
Neg logit contributions
bled-0.024
uddled-0.023
iggle-0.022
 olds-0.021
ails-0.021
Token a
Token ID257
Feature activation+1.98e-02

Pos logit contributions
bled+0.018
uddled+0.018
 olds+0.017
iggle+0.017
ails+0.017
Neg logit contributions
ifying-0.012
 Is-0.011
 bit-0.011
 Ashley-0.010
 awoke-0.010
Token.
Token ID13
Feature activation+3.99e-02

Pos logit contributions
bled+0.065
iggle+0.062
irled+0.062
uddled+0.062
agons+0.059
Neg logit contributions
 bit-0.035
ifying-0.035
 Is-0.034
 distance-0.033
 single-0.032
Token Jack
Token ID3619
Feature activation+6.63e-03

Pos logit contributions
agons+0.091
bled+0.091
bles+0.089
uddled+0.089
ans+0.087
Neg logit contributions
ifying-0.044
 Is-0.042
 Will-0.041
 bit-0.038
 awoke-0.036
Token said
Token ID531
Feature activation-1.14e-02

Pos logit contributions
bled+0.014
uddled+0.014
iggle+0.013
 olds+0.013
ails+0.013
Neg logit contributions
ifying-0.008
 bit-0.007
 Is-0.007
 Ashley-0.007
 awoke-0.007
Token,
Token ID11
Feature activation+5.69e-03

Pos logit contributions
ifying+0.019
 Ashley+0.017
 distance+0.017
 Is+0.017
 bit+0.017
Neg logit contributions
bled-0.019
uddled-0.018
 olds-0.017
iggle-0.017
 golf-0.017
Token "
Token ID366
Feature activation-1.82e-02

Pos logit contributions
bled+0.010
uddled+0.010
iggle+0.009
 olds+0.009
ails+0.009
Neg logit contributions
ifying-0.009
 Is-0.008
 Ashley-0.008
 bit-0.008
 awoke-0.008
TokenL
Token ID43
Feature activation+6.32e-02

Pos logit contributions
ifying+0.023
 Ashley+0.021
 bit+0.020
 awoke+0.020
 Debbie+0.020
Neg logit contributions
bled-0.036
uddled-0.035
ini-0.034
 golf-0.034
 olds-0.033
Tokenily
Token ID813
Feature activation+1.84e-03

Pos logit contributions
 olds+0.102
bled+0.101
yard+0.098
uddled+0.096
uddle+0.094
Neg logit contributions
 minute-0.086
 care-0.086
 awoke-0.084
ges-0.084
of-0.083
Token,
Token ID11
Feature activation+1.34e-03

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.003
 Is-0.003
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token let
Token ID1309
Feature activation+1.31e-02

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token's
Token ID338
Feature activation+6.32e-03

Pos logit contributions
bled+0.031
uddled+0.031
 olds+0.030
ails+0.029
castle+0.029
Neg logit contributions
ifying-0.011
 Ashley-0.011
 awoke-0.011
 Is-0.011
 bit-0.010
Token buy
Token ID2822
Feature activation+4.90e-03

Pos logit contributions
bled+0.013
uddled+0.013
 olds+0.012
iggle+0.012
ails+0.012
Neg logit contributions
ifying-0.008
 Is-0.007
 bit-0.007
 Ashley-0.007
 awoke-0.007
Token upon
Token ID2402
Feature activation-2.76e-03

Pos logit contributions
ifying+0.018
 Is+0.016
 bit+0.016
 awoke+0.015
 Ashley+0.015
Neg logit contributions
bled-0.037
uddled-0.037
iggle-0.036
ails-0.035
 olds-0.035
Token a
Token ID257
Feature activation+6.30e-03

Pos logit contributions
ifying+0.004
 Is+0.003
 Ashley+0.003
 bit+0.003
 awoke+0.003
Neg logit contributions
bled-0.005
uddled-0.005
iggle-0.005
 olds-0.005
ails-0.005
Token time
Token ID640
Feature activation+1.72e-02

Pos logit contributions
bled+0.015
uddled+0.014
 olds+0.014
ails+0.013
iggle+0.013
Neg logit contributions
ifying-0.006
 bit-0.006
 Is-0.005
 Ashley-0.005
 awoke-0.005
Token there
Token ID612
Feature activation-3.36e-03

Pos logit contributions
bled+0.035
uddled+0.035
 olds+0.033
ans+0.032
agons+0.032
Neg logit contributions
ifying-0.020
 Is-0.020
 Ashley-0.019
 bit-0.019
 awoke-0.019
Token lived
Token ID5615
Feature activation+1.61e-02

Pos logit contributions
ifying+0.005
 Is+0.005
 Ashley+0.005
 bit+0.005
 awoke+0.005
Neg logit contributions
bled-0.006
uddled-0.006
iggle-0.005
 olds-0.005
ails-0.005
Token an
Token ID281
Feature activation+7.16e-02

Pos logit contributions
uddled+0.035
 olds+0.035
bled+0.035
castle+0.033
ails+0.033
Neg logit contributions
ifying-0.017
 Ashley-0.015
Lou-0.015
Am-0.015
 Is-0.015
Token olive
Token ID19450
Feature activation+1.68e-02

Pos logit contributions
bled+0.135
uddled+0.134
uddle+0.116
ails+0.114
bles+0.111
Neg logit contributions
 night-0.113
Am-0.108
 bit-0.108
 single-0.107
 distance-0.104
Token who
Token ID508
Feature activation-7.53e-03

Pos logit contributions
bled+0.035
uddled+0.033
 olds+0.032
iggle+0.031
irled+0.031
Neg logit contributions
ifying-0.021
 bit-0.020
 awoke-0.019
 Is-0.019
 single-0.019
Token wanted
Token ID2227
Feature activation-1.58e-02

Pos logit contributions
ifying+0.013
 Is+0.012
 Ashley+0.012
 awoke+0.012
 Debbie+0.012
Neg logit contributions
bled-0.012
uddled-0.011
 olds-0.011
iggle-0.011
ails-0.010
Token to
Token ID284
Feature activation-7.90e-03

Pos logit contributions
ifying+0.016
 Is+0.013
 Ashley+0.012
 awoke+0.012
Jonathan+0.012
Neg logit contributions
 golf-0.036
bled-0.036
uddled-0.036
 olds-0.035
ails-0.035
Token do
Token ID466
Feature activation+1.89e-02

Pos logit contributions
ifying+0.011
 Is+0.010
 Ashley+0.010
 awoke+0.010
 bit+0.010
Neg logit contributions
bled-0.014
uddled-0.014
iggle-0.014
 olds-0.013
 golf-0.013
Token miserable
Token ID22444
Feature activation+4.22e-03

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.002
Neg logit contributions
bled-0.003
uddled-0.003
 olds-0.003
iggle-0.003
ails-0.002
Token!
Token ID0
Feature activation+4.47e-03

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.006
 Is-0.005
 bit-0.005
 Ashley-0.005
 awoke-0.005
Token
Token ID220
Feature activation+3.61e-02

Pos logit contributions
bled+0.010
uddled+0.009
 olds+0.009
iggle+0.009
castle+0.009
Neg logit contributions
ifying-0.005
 Is-0.005
 Ashley-0.005
 bit-0.004
 awoke-0.004
Token
Token ID198
Feature activation+4.65e-03

Pos logit contributions
bled+0.091
castle+0.089
uddled+0.089
 olds+0.087
agons+0.086
Neg logit contributions
ifying-0.024
 Is-0.024
 Ashley-0.024
 Debbie-0.020
 Will-0.020
Token
Token ID198
Feature activation-4.06e-03

Pos logit contributions
bled+0.010
uddled+0.010
 olds+0.010
iggle+0.010
castle+0.009
Neg logit contributions
ifying-0.005
 Is-0.005
 Ashley-0.004
 awoke-0.004
 bit-0.004
TokenM
Token ID44
Feature activation+5.92e-02

Pos logit contributions
ifying+0.005
 Is+0.005
 awoke+0.005
 bit+0.005
 Ashley+0.005
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.007
iggle-0.007
ails-0.007
Tokenama
Token ID1689
Feature activation+9.50e-03

Pos logit contributions
bled+0.096
bles+0.086
 golf+0.086
ans+0.086
 castles+0.083
Neg logit contributions
 single-0.093
Am-0.092
Lou-0.091
ifying-0.091
 month-0.085
Token saw
Token ID2497
Feature activation+6.25e-03

Pos logit contributions
bled+0.019
uddled+0.018
 olds+0.018
iggle+0.017
ails+0.017
Neg logit contributions
ifying-0.012
 bit-0.011
 Is-0.011
 Ashley-0.011
 awoke-0.011
Token what
Token ID644
Feature activation-1.96e-02

Pos logit contributions
bled+0.013
uddled+0.012
 olds+0.012
iggle+0.012
castle+0.012
Neg logit contributions
ifying-0.008
 Is-0.007
 Ashley-0.007
 bit-0.007
 awoke-0.007
Token had
Token ID550
Feature activation+7.72e-04

Pos logit contributions
ifying+0.033
 Is+0.029
Am+0.029
 awoke+0.029
 distance+0.029
Neg logit contributions
bled-0.032
uddled-0.032
iggle-0.030
 olds-0.029
 golf-0.028
Token happened
Token ID3022
Feature activation+2.17e-02

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token You
Token ID921
Feature activation-5.81e-03

Pos logit contributions
bled+0.015
uddled+0.014
iggle+0.014
 olds+0.013
castle+0.013
Neg logit contributions
ifying-0.010
 Is-0.009
 Ashley-0.009
 bit-0.009
 awoke-0.009
Token can
Token ID460
Feature activation+2.02e-03

Pos logit contributions
ifying+0.006
 Is+0.005
 Ashley+0.005
 Debbie+0.005
 awoke+0.004
Neg logit contributions
bled-0.013
uddled-0.013
iggle-0.013
 olds-0.013
uddle-0.013
Token go
Token ID467
Feature activation+2.17e-02

Pos logit contributions
bled+0.004
uddled+0.004
 olds+0.004
iggle+0.004
ails+0.004
Neg logit contributions
ifying-0.003
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token outside
Token ID2354
Feature activation+7.17e-04

Pos logit contributions
bled+0.049
uddled+0.048
 olds+0.045
iggle+0.045
ires+0.044
Neg logit contributions
ifying-0.023
 bit-0.023
 care-0.023
 distance-0.022
 away-0.022
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.001
iggle+0.001
ails+0.001
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token play
Token ID711
Feature activation+5.33e-02

Pos logit contributions
bled+0.022
uddled+0.021
 olds+0.020
ails+0.020
iggle+0.020
Neg logit contributions
ifying-0.013
 Is-0.012
 bit-0.012
 Ashley-0.011
 care-0.011
Token with
Token ID351
Feature activation+2.19e-02

Pos logit contributions
bled+0.134
uddled+0.132
ini+0.128
 olds+0.126
lee+0.125
Neg logit contributions
 away-0.050
 bit-0.045
 care-0.042
 after-0.037
 will-0.036
Token the
Token ID262
Feature activation+4.44e-02

Pos logit contributions
bled+0.052
uddled+0.052
 olds+0.049
castle+0.048
iggle+0.048
Neg logit contributions
ifying-0.020
 bit-0.019
 care-0.018
 Is-0.017
 Ashley-0.016
Token rain
Token ID6290
Feature activation+3.48e-02

Pos logit contributions
bled+0.086
uddled+0.079
iggle+0.076
irled+0.075
mm+0.074
Neg logit contributions
 distance-0.063
 bit-0.062
ifying-0.060
 will-0.059
 care-0.059
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
bled+0.079
uddled+0.074
ires+0.070
ini+0.070
add+0.069
Neg logit contributions
 bit-0.039
ifying-0.036
 care-0.035
 away-0.034
 distance-0.033
Token But
Token ID887
Feature activation+3.83e-02

Pos logit contributions
bled+0.046
uddled+0.045
iggle+0.042
castle+0.042
bles+0.042
Neg logit contributions
ifying-0.027
 Is-0.026
 Ashley-0.024
 Will-0.023
 bit-0.022
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
ifying+0.026
 Is+0.025
 Ashley+0.024
 awoke+0.024
Lou+0.024
Neg logit contributions
bled-0.026
uddled-0.026
 olds-0.024
iggle-0.024
ails-0.024
Token They
Token ID1119
Feature activation-3.34e-03

Pos logit contributions
ifying+0.017
 awoke+0.015
 bit+0.015
 Is+0.015
 distance+0.015
Neg logit contributions
bled-0.021
uddled-0.021
 olds-0.021
iggle-0.020
ails-0.020
Token liked
Token ID8288
Feature activation-7.74e-04

Pos logit contributions
ifying+0.005
 Is+0.005
 Ashley+0.005
 awoke+0.005
 bit+0.005
Neg logit contributions
bled-0.006
uddled-0.006
iggle-0.005
 olds-0.005
ails-0.005
Token to
Token ID284
Feature activation+2.72e-02

Pos logit contributions
ifying+0.001
 Is+0.000
 Ashley+0.000
 bit+0.000
 awoke+0.000
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Token play
Token ID711
Feature activation+5.98e-02

Pos logit contributions
bled+0.063
uddled+0.063
 olds+0.058
ails+0.056
ires+0.056
Neg logit contributions
 care-0.029
ifying-0.028
 stroke-0.027
 bit-0.027
 Is-0.026
Token dress
Token ID6576
Feature activation+5.90e-02

Pos logit contributions
 olds+0.139
bled+0.138
ini+0.138
uddled+0.134
lee+0.132
Neg logit contributions
 away-0.058
 bit-0.050
 to-0.050
 heard-0.048
where-0.047
Token-
Token ID12
Feature activation+8.58e-03

Pos logit contributions
bled+0.126
add+0.110
 dam+0.109
aded+0.108
ini+0.107
Neg logit contributions
 away-0.066
 to-0.060
of-0.059
 care-0.057
 Ashley-0.057
Tokenup
Token ID929
Feature activation+4.42e-02

Pos logit contributions
bled+0.012
uddled+0.012
 olds+0.011
iggle+0.011
castle+0.011
Neg logit contributions
ifying-0.016
 Is-0.015
 bit-0.015
 care-0.015
 Ashley-0.015
Token with
Token ID351
Feature activation+1.19e-02

Pos logit contributions
bled+0.102
uddled+0.096
add+0.094
ini+0.094
irled+0.090
Neg logit contributions
 bit-0.043
 care-0.042
 away-0.042
 Is-0.039
ifying-0.039
Token their
Token ID511
Feature activation+1.26e-02

Pos logit contributions
bled+0.021
uddled+0.020
 olds+0.019
iggle+0.019
ails+0.019
Neg logit contributions
ifying-0.019
 bit-0.017
 Is-0.017
 care-0.017
 Ashley-0.017
Token dolls
Token ID36062
Feature activation+8.75e-03

Pos logit contributions
bled+0.019
uddled+0.018
iggle+0.017
ails+0.017
 olds+0.017
Neg logit contributions
ifying-0.023
 bit-0.021
 Is-0.021
 care-0.021
 Ashley-0.021
Token threw
Token ID9617
Feature activation-2.72e-02

Pos logit contributions
bled+0.032
uddled+0.031
 olds+0.031
iggle+0.031
agons+0.030
Neg logit contributions
 bit-0.018
 Is-0.017
ifying-0.017
 Ashley-0.016
 awoke-0.016
Token the
Token ID262
Feature activation+8.41e-03

Pos logit contributions
ifying+0.041
 Debbie+0.038
 awoke+0.037
Am+0.037
 Ashley+0.036
Neg logit contributions
 golf-0.052
bled-0.049
 olds-0.046
iggle-0.045
bles-0.044
Token stick
Token ID4859
Feature activation-1.14e-02

Pos logit contributions
uddled+0.015
bled+0.015
 olds+0.014
iggle+0.014
ails+0.014
Neg logit contributions
ifying-0.012
 bit-0.012
 Is-0.011
 care-0.011
 single-0.011
Token far
Token ID1290
Feature activation-3.35e-02

Pos logit contributions
ifying+0.016
 Ashley+0.015
 Is+0.015
 awoke+0.014
 Debbie+0.014
Neg logit contributions
bled-0.022
uddled-0.021
iggle-0.020
 golf-0.020
 olds-0.020
Token away
Token ID1497
Feature activation-5.50e-03

Pos logit contributions
ifying+0.049
 Ashley+0.049
Lou+0.048
 Debbie+0.048
Jonathan+0.047
Neg logit contributions
bled-0.058
 olds-0.056
 golf-0.056
uddled-0.055
bles-0.054
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
ifying+0.008
 Is+0.008
 Ashley+0.008
 awoke+0.008
 Debbie+0.007
Neg logit contributions
bled-0.010
uddled-0.009
 olds-0.009
iggle-0.009
ails-0.009
Token Max
Token ID5436
Feature activation-1.99e-02

Pos logit contributions
bled+0.024
uddled+0.023
iggle+0.022
ini+0.022
castle+0.022
Neg logit contributions
ifying-0.014
 Is-0.013
 Ashley-0.013
 Will-0.012
 bit-0.012
Token saw
Token ID2497
Feature activation+1.78e-02

Pos logit contributions
ifying+0.034
 Debbie+0.028
 single+0.028
 Ashley+0.028
Am+0.028
Neg logit contributions
uddled-0.035
bled-0.034
uddle-0.032
 golf-0.031
irled-0.031
Token the
Token ID262
Feature activation+2.54e-02

Pos logit contributions
uddled+0.041
bled+0.041
ini+0.039
iggle+0.039
ower+0.039
Neg logit contributions
 Is-0.017
ifying-0.016
 bit-0.016
 care-0.016
 Ashley-0.016
Token stick
Token ID4859
Feature activation+2.08e-02

Pos logit contributions
uddled+0.054
bled+0.053
 olds+0.051
irled+0.049
add+0.049
Neg logit contributions
 bit-0.030
 distance-0.028
 single-0.028
 care-0.028
ifying-0.028
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
bled+0.042
uddled+0.041
 olds+0.040
ails+0.039
ini+0.039
Neg logit contributions
ifying-0.026
 bit-0.025
 Is-0.024
 single-0.023
 awoke-0.023
Token me
Token ID502
Feature activation-3.07e-03

Pos logit contributions
bled+0.000
uddled+0.000
 olds+0.000
iggle+0.000
ails+0.000
Neg logit contributions
ifying-0.000
 Is-0.000
 Ashley-0.000
 bit-0.000
 awoke-0.000
Token how
Token ID703
Feature activation-1.47e-02

Pos logit contributions
ifying+0.004
 Is+0.004
 Ashley+0.004
 bit+0.004
 awoke+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.005
iggle-0.005
ails-0.005
Token to
Token ID284
Feature activation+1.94e-02

Pos logit contributions
ifying+0.014
 Is+0.012
 awoke+0.012
 Ashley+0.012
 bit+0.012
Neg logit contributions
bled-0.034
uddled-0.034
 olds-0.033
iggle-0.032
ails-0.032
Token be
Token ID307
Feature activation+3.51e-04

Pos logit contributions
bled+0.043
uddled+0.042
 olds+0.039
uddle+0.038
ails+0.038
Neg logit contributions
ifying-0.021
 care-0.021
 bit-0.020
 Is-0.020
 stroke-0.020
Token patient
Token ID5827
Feature activation-5.88e-03

Pos logit contributions
bled+0.001
uddled+0.001
 olds+0.000
iggle+0.000
ails+0.000
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token,
Token ID11
Feature activation+1.15e-02

Pos logit contributions
ifying+0.009
 Is+0.008
 Ashley+0.008
 awoke+0.008
 bit+0.008
Neg logit contributions
bled-0.010
uddled-0.010
 olds-0.009
iggle-0.009
ails-0.009
Token loyal
Token ID9112
Feature activation-2.56e-02

Pos logit contributions
bled+0.023
uddled+0.023
iggle+0.021
 olds+0.021
ails+0.021
Neg logit contributions
ifying-0.015
 Is-0.014
 bit-0.014
 Ashley-0.014
 awoke-0.013
Token,
Token ID11
Feature activation+1.34e-02

Pos logit contributions
ifying+0.036
 Is+0.031
 awoke+0.031
 Ashley+0.030
 total+0.030
Neg logit contributions
 olds-0.048
uddled-0.047
bled-0.047
ails-0.046
 golf-0.046
Token and
Token ID290
Feature activation+7.05e-03

Pos logit contributions
bled+0.026
uddled+0.026
iggle+0.024
 olds+0.024
ails+0.024
Neg logit contributions
ifying-0.018
 Is-0.017
 bit-0.017
 Ashley-0.016
 awoke-0.016
Token grateful
Token ID14066
Feature activation-7.60e-03

Pos logit contributions
bled+0.014
uddled+0.013
iggle+0.013
 olds+0.013
ails+0.012
Neg logit contributions
ifying-0.010
 Is-0.009
 bit-0.009
 Ashley-0.009
 care-0.008
Token.
Token ID13
Feature activation+4.45e-03

Pos logit contributions
ifying+0.011
 Is+0.010
 Ashley+0.010
 awoke+0.010
 bit+0.010
Neg logit contributions
bled-0.013
uddled-0.013
 olds-0.013
iggle-0.012
ails-0.012
Token then
Token ID788
Feature activation-3.07e-02

Pos logit contributions
ifying+0.045
 awoke+0.040
Lou+0.040
 single+0.040
 distance+0.040
Neg logit contributions
bled-0.029
uddled-0.026
 golf-0.024
iggle-0.024
 olds-0.023
Token,
Token ID11
Feature activation-3.11e-02

Pos logit contributions
ifying+0.060
 distance+0.056
 single+0.053
Am+0.052
 trace+0.051
Neg logit contributions
bled-0.043
uddled-0.040
iggle-0.038
irled-0.038
 olds-0.038
Token Anna
Token ID11735
Feature activation-1.54e-03

Pos logit contributions
ifying+0.052
 single+0.048
 distance+0.048
Lou+0.046
Am+0.046
Neg logit contributions
bled-0.051
uddled-0.047
 golf-0.045
iggle-0.044
 olds-0.042
Token sees
Token ID7224
Feature activation+1.17e-02

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 bit+0.002
 awoke+0.001
Neg logit contributions
bled-0.003
uddled-0.003
 olds-0.003
iggle-0.003
ails-0.003
Token that
Token ID326
Feature activation-3.14e-02

Pos logit contributions
bled+0.025
uddled+0.025
 olds+0.023
iggle+0.023
castle+0.023
Neg logit contributions
ifying-0.013
 Ashley-0.013
 Is-0.012
 bit-0.012
 awoke-0.012
Token the
Token ID262
Feature activation+1.95e-02

Pos logit contributions
ifying+0.049
 awoke+0.041
 bit+0.041
 total+0.040
 distance+0.040
Neg logit contributions
bled-0.055
uddled-0.053
 golf-0.053
 olds-0.051
iggle-0.051
Token window
Token ID4324
Feature activation+1.23e-02

Pos logit contributions
bled+0.030
uddled+0.029
iggle+0.027
ails+0.026
irled+0.026
Neg logit contributions
ifying-0.035
 bit-0.034
 distance-0.033
 single-0.033
 Is-0.033
Token is
Token ID318
Feature activation-1.58e-02

Pos logit contributions
bled+0.024
uddled+0.024
 olds+0.023
iggle+0.022
ails+0.022
Neg logit contributions
ifying-0.016
 Is-0.015
 bit-0.015
 awoke-0.014
 Ashley-0.014
Token fog
Token ID19558
Feature activation+2.18e-02

Pos logit contributions
ifying+0.025
 Ashley+0.024
 Is+0.024
Jonathan+0.024
 awoke+0.024
Neg logit contributions
bled-0.026
uddled-0.025
 olds-0.023
 golf-0.023
iggle-0.023
Tokengy
Token ID1360
Feature activation-3.02e-03

Pos logit contributions
bled+0.044
uddled+0.043
 olds+0.041
add+0.041
iggle+0.041
Neg logit contributions
ifying-0.029
 bit-0.026
 Is-0.025
 Ashley-0.024
 awoke-0.024
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
ifying+0.004
 Is+0.004
 Ashley+0.004
 awoke+0.004
 bit+0.004
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.005
iggle-0.005
ails-0.005
Token happy
Token ID3772
Feature activation+2.83e-03

Pos logit contributions
uddled+0.017
bled+0.017
 olds+0.016
ails+0.016
uddle+0.015
Neg logit contributions
ifying-0.017
 Is-0.016
 bit-0.016
 care-0.015
 single-0.015
Token to
Token ID284
Feature activation+4.01e-02

Pos logit contributions
bled+0.006
uddled+0.006
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.004
 Is-0.003
 bit-0.003
 Ashley-0.003
 awoke-0.003
Token be
Token ID307
Feature activation+1.95e-02

Pos logit contributions
uddled+0.111
bled+0.105
uddle+0.102
 olds+0.098
bles+0.097
Neg logit contributions
 care-0.036
 Is-0.032
 single-0.028
 notice-0.028
 awoke-0.028
Token outside
Token ID2354
Feature activation-2.19e-02

Pos logit contributions
uddled+0.037
bled+0.036
 olds+0.035
uddle+0.035
mm+0.035
Neg logit contributions
ifying-0.026
 bit-0.025
 care-0.025
 single-0.025
 distance-0.024
Token with
Token ID351
Feature activation+4.03e-03

Pos logit contributions
ifying+0.034
 Ashley+0.032
Lou+0.032
 Is+0.031
 Debbie+0.031
Neg logit contributions
bled-0.036
uddled-0.035
 golf-0.034
 olds-0.034
iggle-0.034
Token his
Token ID465
Feature activation+2.97e-02

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.009
iggle+0.009
ails+0.009
Neg logit contributions
ifying-0.004
 Is-0.003
 bit-0.003
 Ashley-0.003
 awoke-0.003
Token basketball
Token ID9669
Feature activation+2.59e-02

Pos logit contributions
bled+0.044
uddled+0.042
ails+0.040
 olds+0.038
iggle+0.038
Neg logit contributions
ifying-0.054
 bit-0.053
 single-0.052
 care-0.051
 total-0.051
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
bled+0.055
uddled+0.053
iggle+0.051
 olds+0.050
irled+0.050
Neg logit contributions
ifying-0.030
 bit-0.029
 awoke-0.027
 Is-0.026
 distance-0.026
Token He
Token ID679
Feature activation+1.71e-02

Pos logit contributions
bled+0.039
uddled+0.039
iggle+0.036
bles+0.036
ans+0.035
Neg logit contributions
ifying-0.016
 Is-0.016
 Will-0.014
 Ashley-0.014
 single-0.014
Token threw
Token ID9617
Feature activation-6.67e-03

Pos logit contributions
bled+0.036
uddled+0.034
 olds+0.033
ails+0.033
iggle+0.033
Neg logit contributions
 bit-0.020
ifying-0.020
 awoke-0.019
 Is-0.019
 Ashley-0.019
Token it
Token ID340
Feature activation-4.29e-03

Pos logit contributions
ifying+0.008
 Is+0.007
 Ashley+0.006
 awoke+0.006
 Debbie+0.006
Neg logit contributions
bled-0.015
uddled-0.014
 golf-0.014
 olds-0.014
iggle-0.014
Token new
Token ID649
Feature activation+1.22e-02

Pos logit contributions
bled+0.034
uddled+0.033
iggle+0.029
irled+0.029
castle+0.028
Neg logit contributions
ifying-0.042
 Is-0.041
 bit-0.041
 care-0.040
 single-0.040
Token.
Token ID13
Feature activation+1.10e-03

Pos logit contributions
bled+0.025
uddled+0.025
iggle+0.023
 olds+0.023
add+0.023
Neg logit contributions
ifying-0.015
 Is-0.014
 bit-0.014
 care-0.013
 single-0.013
Token
Token ID220
Feature activation+3.90e-02

Pos logit contributions
bled+0.002
uddled+0.002
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
bled+0.097
uddled+0.096
castle+0.094
 olds+0.092
iggle+0.091
Neg logit contributions
ifying-0.028
 Ashley-0.027
 Is-0.027
 Debbie-0.024
Lou-0.023
Token
Token ID198
Feature activation-1.27e-03

Pos logit contributions
bled+0.029
uddled+0.029
 olds+0.028
castle+0.027
iggle+0.027
Neg logit contributions
ifying-0.012
 Is-0.011
 Ashley-0.011
Am-0.011
Lou-0.011
TokenM
Token ID44
Feature activation+3.58e-02

Pos logit contributions
ifying+0.002
 Is+0.002
 bit+0.002
 Ashley+0.002
 awoke+0.002
Neg logit contributions
bled-0.002
uddled-0.002
 olds-0.002
iggle-0.002
ails-0.002
Tokenilly
Token ID6548
Feature activation+2.72e-02

Pos logit contributions
bled+0.058
 golf+0.052
uddled+0.051
 olds+0.050
ans+0.049
Neg logit contributions
Am-0.059
ifying-0.057
 single-0.057
 awoke-0.055
Lou-0.055
Token was
Token ID373
Feature activation-6.09e-04

Pos logit contributions
bled+0.056
uddled+0.053
agons+0.053
ires+0.052
ails+0.052
Neg logit contributions
 bit-0.032
 awoke-0.031
ifying-0.031
 Is-0.031
 Ashley-0.030
Token forever
Token ID8097
Feature activation-2.61e-03

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 bit+0.001
 awoke+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token thankful
Token ID25535
Feature activation-6.93e-03

Pos logit contributions
ifying+0.004
 Is+0.004
 Ashley+0.003
 awoke+0.003
 bit+0.003
Neg logit contributions
bled-0.005
uddled-0.005
 olds-0.004
iggle-0.004
ails-0.004
Token to
Token ID284
Feature activation+2.04e-02

Pos logit contributions
ifying+0.011
 Is+0.011
 awoke+0.010
 Ashley+0.010
 bit+0.010
Neg logit contributions
bled-0.011
uddled-0.011
 olds-0.011
iggle-0.010
ails-0.010
Token alone
Token ID3436
Feature activation+1.22e-02

Pos logit contributions
bled+0.000
uddled+0.000
 olds+0.000
iggle+0.000
ails+0.000
Neg logit contributions
ifying-0.000
 Is-0.000
 Ashley-0.000
 bit-0.000
 awoke-0.000
Token and
Token ID290
Feature activation-7.40e-03

Pos logit contributions
bled+0.024
uddled+0.023
 olds+0.022
iggle+0.022
ails+0.022
Neg logit contributions
ifying-0.016
 bit-0.015
 Is-0.015
 single-0.014
 awoke-0.014
Token scared
Token ID12008
Feature activation+4.87e-03

Pos logit contributions
ifying+0.013
 Ashley+0.012
 Is+0.012
 awoke+0.012
 Debbie+0.012
Neg logit contributions
bled-0.011
uddled-0.011
 olds-0.010
iggle-0.010
 golf-0.010
Token.
Token ID13
Feature activation-3.56e-03

Pos logit contributions
bled+0.010
uddled+0.010
 olds+0.009
iggle+0.009
ails+0.009
Neg logit contributions
ifying-0.006
 Is-0.006
 bit-0.006
 Ashley-0.006
 awoke-0.006
Token It
Token ID632
Feature activation-2.87e-02

Pos logit contributions
ifying+0.006
 Is+0.005
 awoke+0.005
 bit+0.005
 Ashley+0.005
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.006
ails-0.005
iggle-0.005
Token is
Token ID318
Feature activation-1.53e-03

Pos logit contributions
ifying+0.047
 Debbie+0.044
 Ashley+0.043
 Is+0.042
Am+0.042
Neg logit contributions
uddled-0.046
bled-0.044
uddle-0.043
iggle-0.043
 olds-0.043
Token getting
Token ID1972
Feature activation+8.72e-04

Pos logit contributions
ifying+0.002
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.002
Neg logit contributions
bled-0.003
uddled-0.003
 olds-0.002
iggle-0.002
ails-0.002
Token dark
Token ID3223
Feature activation+2.22e-02

Pos logit contributions
bled+0.002
uddled+0.002
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.001
 Is-0.001
 Ashley-0.001
 bit-0.001
 awoke-0.001
Token and
Token ID290
Feature activation+4.31e-03

Pos logit contributions
bled+0.045
uddled+0.044
iggle+0.041
ails+0.041
uddle+0.041
Neg logit contributions
ifying-0.028
 bit-0.027
 awoke-0.026
 distance-0.025
 care-0.025
Token cold
Token ID4692
Feature activation+2.06e-02

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.006
 Is-0.006
 bit-0.006
 Ashley-0.006
 awoke-0.006
Token.
Token ID13
Feature activation+4.55e-03

Pos logit contributions
bled+0.044
uddled+0.042
irled+0.040
iggle+0.040
ini+0.039
Neg logit contributions
ifying-0.024
 bit-0.023
 distance-0.022
 care-0.022
 awoke-0.022
Token
Token ID198
Feature activation-3.77e-02

Pos logit contributions
ifying+0.034
 bit+0.030
 awoke+0.028
 single+0.027
 total+0.027
Neg logit contributions
bled-0.038
uddled-0.038
iggle-0.036
ails-0.036
 olds-0.036
TokenShe
Token ID3347
Feature activation+1.69e-02

Pos logit contributions
ifying+0.068
 bit+0.058
 single+0.056
 Will+0.055
 care+0.053
Neg logit contributions
uddled-0.059
bled-0.056
 olds-0.054
ower-0.053
irled-0.053
Token was
Token ID373
Feature activation+4.63e-03

Pos logit contributions
bled+0.036
uddled+0.035
 olds+0.034
ails+0.034
agons+0.034
Neg logit contributions
 bit-0.019
ifying-0.019
 Is-0.018
 awoke-0.018
 Ashley-0.017
Token in
Token ID287
Feature activation-4.76e-02

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.007
 Is-0.006
 Ashley-0.006
 bit-0.006
 awoke-0.006
Token shock
Token ID6380
Feature activation-3.52e-03

Pos logit contributions
ifying+0.081
 awoke+0.071
'll+0.070
ah+0.070
use+0.070
Neg logit contributions
 golf-0.070
 olds-0.065
iggle-0.061
 clubhouse-0.060
ires-0.059
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
ifying+0.005
 Is+0.005
 Ashley+0.005
 awoke+0.005
 bit+0.005
Neg logit contributions
bled-0.006
uddled-0.006
 olds-0.006
iggle-0.006
ails-0.006
Token There
Token ID1318
Feature activation+9.16e-03

Pos logit contributions
ifying+0.030
 bit+0.027
 awoke+0.025
 single+0.025
 care+0.025
Neg logit contributions
 olds-0.037
bled-0.037
uddled-0.037
ails-0.035
iggle-0.035
Token was
Token ID373
Feature activation-2.21e-02

Pos logit contributions
bled+0.018
uddled+0.018
 olds+0.017
ails+0.017
iggle+0.017
Neg logit contributions
ifying-0.012
 Is-0.011
 Ashley-0.011
 awoke-0.011
 bit-0.011
Token a
Token ID257
Feature activation-2.96e-02

Pos logit contributions
ifying+0.032
 awoke+0.029
 Is+0.029
 care+0.029
 Ashley+0.028
Neg logit contributions
bled-0.039
uddled-0.038
 olds-0.036
 golf-0.036
iggle-0.035
Token giant
Token ID6175
Feature activation-1.46e-02

Pos logit contributions
ifying+0.048
 Is+0.046
 Ashley+0.045
 Debbie+0.045
 awoke+0.045
Neg logit contributions
uddle-0.044
bled-0.044
 golf-0.044
uddled-0.044
castle-0.042
Token,
Token ID11
Feature activation-5.65e-03

Pos logit contributions
ifying+0.022
 Is+0.020
 Ashley+0.020
 Debbie+0.020
 awoke+0.020
Neg logit contributions
bled-0.025
uddled-0.024
 olds-0.024
castle-0.023
iggle-0.023
Token It
Token ID632
Feature activation+4.53e-03

Pos logit contributions
bled+0.007
uddled+0.006
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.005
 Is-0.004
 Ashley-0.004
 bit-0.004
 awoke-0.004
Token is
Token ID318
Feature activation-6.73e-03

Pos logit contributions
bled+0.010
uddled+0.010
 olds+0.010
iggle+0.010
ails+0.010
Neg logit contributions
ifying-0.005
 Is-0.004
 bit-0.004
 Ashley-0.004
 awoke-0.004
Token very
Token ID845
Feature activation+2.97e-02

Pos logit contributions
ifying+0.010
 Is+0.009
 Ashley+0.009
 awoke+0.009
 bit+0.009
Neg logit contributions
bled-0.012
uddled-0.011
 olds-0.011
iggle-0.011
 golf-0.011
Token good
Token ID922
Feature activation+1.27e-02

Pos logit contributions
uddled+0.055
bled+0.054
ans+0.051
ails+0.051
castle+0.049
Neg logit contributions
 bit-0.043
ifying-0.042
 care-0.042
 single-0.042
 total-0.039
Token.
Token ID13
Feature activation-1.89e-03

Pos logit contributions
bled+0.026
uddled+0.026
 olds+0.024
iggle+0.024
ans+0.024
Neg logit contributions
ifying-0.015
 bit-0.014
 care-0.014
 Is-0.014
 Ashley-0.014
Token You
Token ID921
Feature activation-1.45e-02

Pos logit contributions
ifying+0.003
 Is+0.002
 bit+0.002
 awoke+0.002
 Ashley+0.002
Neg logit contributions
bled-0.003
uddled-0.003
 olds-0.003
iggle-0.003
ails-0.003
Token did
Token ID750
Feature activation-9.19e-03

Pos logit contributions
ifying+0.017
 Ashley+0.015
 Is+0.015
 Debbie+0.015
 single+0.015
Neg logit contributions
bled-0.031
uddled-0.030
iggle-0.029
 olds-0.028
bles-0.028
Token a
Token ID257
Feature activation-2.91e-02

Pos logit contributions
ifying+0.012
 awoke+0.011
 Is+0.011
 distance+0.011
 Ashley+0.011
Neg logit contributions
bled-0.018
uddled-0.017
iggle-0.017
 olds-0.017
 golf-0.017
Token good
Token ID922
Feature activation-1.74e-02

Pos logit contributions
ifying+0.060
 Debbie+0.058
 Ashley+0.056
 awoke+0.056
 Is+0.056
Neg logit contributions
 golf-0.034
uddle-0.032
bled-0.032
iggle-0.032
uddled-0.031
Token job
Token ID1693
Feature activation-5.46e-03

Pos logit contributions
ifying+0.036
 Debbie+0.032
 Ashley+0.032
 awoke+0.032
Pen+0.032
Neg logit contributions
uddled-0.022
iggle-0.022
bled-0.022
 golf-0.021
ails-0.021
Token cleaning
Token ID12724
Feature activation+2.10e-02

Pos logit contributions
ifying+0.008
 Is+0.007
 awoke+0.007
 Ashley+0.007
 bit+0.007
Neg logit contributions
bled-0.010
uddled-0.010
 olds-0.009
iggle-0.009
ails-0.009
Token went
Token ID1816
Feature activation+1.69e-02

Pos logit contributions
bled+0.010
uddled+0.010
 olds+0.009
iggle+0.009
ails+0.009
Neg logit contributions
ifying-0.007
 Is-0.006
 bit-0.006
 awoke-0.006
 Ashley-0.006
Token.
Token ID13
Feature activation-6.92e-03

Pos logit contributions
bled+0.037
uddled+0.036
iggle+0.033
 olds+0.033
uddle+0.033
Neg logit contributions
ifying-0.020
 Is-0.018
 bit-0.018
 single-0.017
 awoke-0.017
Token
Token ID198
Feature activation-8.76e-03

Pos logit contributions
ifying+0.009
 bit+0.008
 Is+0.008
 awoke+0.008
 Ashley+0.008
Neg logit contributions
bled-0.014
uddled-0.013
 olds-0.013
ails-0.013
iggle-0.013
Token
Token ID198
Feature activation-2.42e-02

Pos logit contributions
ifying+0.012
 bit+0.010
 Is+0.010
 awoke+0.010
 Ashley+0.010
Neg logit contributions
bled-0.017
uddled-0.017
iggle-0.016
ails-0.016
 olds-0.016
TokenOne
Token ID3198
Feature activation-1.65e-02

Pos logit contributions
ifying+0.030
 bit+0.027
 awoke+0.026
 Ashley+0.025
 Is+0.025
Neg logit contributions
bled-0.048
 golf-0.048
uddled-0.047
castle-0.045
ails-0.045
Token day
Token ID1110
Feature activation-3.39e-02

Pos logit contributions
ifying+0.028
 Is+0.025
 Ashley+0.025
Lou+0.024
 Debbie+0.024
Neg logit contributions
bled-0.026
iggle-0.025
uddled-0.025
 golf-0.024
ails-0.024
Token,
Token ID11
Feature activation-2.73e-02

Pos logit contributions
ifying+0.067
 distance+0.061
 single+0.060
Pen+0.060
Am+0.059
Neg logit contributions
bled-0.044
 golf-0.042
uddled-0.041
ails-0.038
 olds-0.038
Token Tim
Token ID5045
Feature activation-8.23e-03

Pos logit contributions
ifying+0.043
 awoke+0.040
 single+0.038
 Is+0.037
Lou+0.037
Neg logit contributions
bled-0.045
 golf-0.045
uddled-0.042
 olds-0.041
castle-0.041
Tokenmy
Token ID1820
Feature activation-4.04e-03

Pos logit contributions
ifying+0.010
 Is+0.009
 Ashley+0.009
 single+0.009
 bit+0.008
Neg logit contributions
bled-0.017
uddled-0.017
iggle-0.016
 olds-0.016
ails-0.016
Token was
Token ID373
Feature activation-2.82e-03

Pos logit contributions
ifying+0.005
 Is+0.005
 Ashley+0.004
 bit+0.004
 single+0.004
Neg logit contributions
bled-0.008
uddled-0.008
 olds-0.008
iggle-0.008
ails-0.008
Token playing
Token ID2712
Feature activation+1.13e-02

Pos logit contributions
ifying+0.006
 Is+0.005
 Ashley+0.005
 awoke+0.005
 bit+0.005
Neg logit contributions
bled-0.004
uddled-0.004
iggle-0.003
 olds-0.003
ails-0.003
Token theater
Token ID13766
Feature activation-8.22e-03

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 bit+0.001
 awoke+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token with
Token ID351
Feature activation-5.58e-03

Pos logit contributions
ifying+0.013
 Ashley+0.012
 Is+0.012
 awoke+0.012
Lou+0.012
Neg logit contributions
uddled-0.014
bled-0.014
 olds-0.013
iggle-0.013
ails-0.013
Token her
Token ID607
Feature activation-3.88e-04

Pos logit contributions
ifying+0.007
 Is+0.006
 bit+0.006
 awoke+0.006
 Ashley+0.006
Neg logit contributions
bled-0.012
uddled-0.011
iggle-0.011
 olds-0.011
 golf-0.011
Token mom
Token ID1995
Feature activation-1.40e-02

Pos logit contributions
ifying+0.001
 Is+0.001
 awoke+0.001
 Ashley+0.001
 bit+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
ifying+0.021
 Is+0.019
Lou+0.019
 Ashley+0.019
 awoke+0.019
Neg logit contributions
bled-0.024
uddled-0.024
ails-0.023
 olds-0.023
iggle-0.023
Token When
Token ID1649
Feature activation-2.54e-02

Pos logit contributions
ifying+0.024
Lou+0.021
 bit+0.021
Jonathan+0.021
 awoke+0.021
Neg logit contributions
uddled-0.028
bled-0.027
 olds-0.027
iggle-0.026
ails-0.025
Token they
Token ID484
Feature activation-4.41e-02

Pos logit contributions
ifying+0.048
 bit+0.043
Lou+0.042
 Is+0.042
 stroke+0.042
Neg logit contributions
bled-0.036
uddled-0.035
 olds-0.032
mm-0.031
iggle-0.031
Token arrived
Token ID5284
Feature activation+1.97e-02

Pos logit contributions
ifying+0.066
Lou+0.065
Pen+0.063
Jonathan+0.062
 Ashley+0.062
Neg logit contributions
uddled-0.072
bled-0.070
add-0.069
castle-0.067
iggle-0.067
Token,
Token ID11
Feature activation-1.50e-02

Pos logit contributions
bled+0.041
agons+0.041
ans+0.040
uddled+0.039
 olds+0.039
Neg logit contributions
ifying-0.023
 Is-0.021
 bit-0.020
 Ashley-0.020
 Debbie-0.019
Token Lucy
Token ID22162
Feature activation-5.91e-03

Pos logit contributions
ifying+0.023
 awoke+0.021
 bit+0.021
 single+0.020
 Is+0.020
Neg logit contributions
bled-0.026
uddled-0.026
 olds-0.024
iggle-0.023
 golf-0.023
Token was
Token ID373
Feature activation+6.05e-03

Pos logit contributions
ifying+0.007
 Is+0.006
 Ashley+0.006
 bit+0.006
 single+0.006
Neg logit contributions
uddled-0.012
bled-0.012
 olds-0.012
iggle-0.012
ails-0.012
Token blue
Token ID4171
Feature activation-1.15e-02

Pos logit contributions
bled+0.037
uddled+0.036
ails+0.034
iggle+0.034
 olds+0.034
Neg logit contributions
ifying-0.023
 bit-0.022
 Is-0.022
 Ashley-0.022
 awoke-0.021
Token eyes
Token ID2951
Feature activation-1.01e-02

Pos logit contributions
ifying+0.017
 Is+0.016
 Ashley+0.016
 awoke+0.015
Jonathan+0.015
Neg logit contributions
bled-0.020
uddled-0.020
 olds-0.019
iggle-0.019
ails-0.019
Token.
Token ID13
Feature activation+5.65e-03

Pos logit contributions
ifying+0.015
 Is+0.014
 Ashley+0.014
Lou+0.013
 bit+0.013
Neg logit contributions
bled-0.018
uddled-0.018
iggle-0.017
 olds-0.017
ails-0.016
Token She
Token ID1375
Feature activation+4.07e-03

Pos logit contributions
bled+0.012
uddled+0.011
iggle+0.011
 olds+0.011
ails+0.010
Neg logit contributions
ifying-0.007
 Is-0.007
 Ashley-0.006
 awoke-0.006
 bit-0.006
Token wished
Token ID16555
Feature activation+1.51e-02

Pos logit contributions
bled+0.008
uddled+0.008
 olds+0.007
iggle+0.007
ails+0.007
Neg logit contributions
ifying-0.005
 Is-0.005
 bit-0.005
 Ashley-0.005
 awoke-0.005
Token Anna
Token ID11735
Feature activation-5.40e-02

Pos logit contributions
bled+0.032
uddled+0.031
uddle+0.030
 olds+0.030
iggle+0.030
Neg logit contributions
ifying-0.017
 Is-0.016
 Ashley-0.016
 bit-0.016
 awoke-0.015
Token was
Token ID373
Feature activation+1.48e-03

Pos logit contributions
ifying+0.077
Lou+0.068
Jonathan+0.067
Oh+0.067
 Is+0.067
Neg logit contributions
uddled-0.096
bled-0.092
 olds-0.086
ails-0.084
iggle-0.083
Token nicer
Token ID36597
Feature activation-3.13e-02

Pos logit contributions
bled+0.002
uddled+0.002
 olds+0.002
iggle+0.002
ails+0.002
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token to
Token ID284
Feature activation-5.22e-03

Pos logit contributions
ifying+0.047
 Is+0.043
 awoke+0.043
 Ashley+0.043
Lou+0.042
Neg logit contributions
uddled-0.054
bled-0.054
castle-0.051
 olds-0.051
bles-0.051
Token her
Token ID607
Feature activation-8.11e-04

Pos logit contributions
ifying+0.006
 Is+0.006
 bit+0.006
 awoke+0.005
 Ashley+0.005
Neg logit contributions
bled-0.011
uddled-0.011
iggle-0.010
 olds-0.010
ails-0.010
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
ifying+0.001
 Is+0.001
 Ashley+0.001
 awoke+0.001
 bit+0.001
Neg logit contributions
bled-0.001
uddled-0.001
 olds-0.001
iggle-0.001
ails-0.001
Token They
Token ID1119
Feature activation-1.11e-02

Pos logit contributions
ifying+0.015
 bit+0.014
 awoke+0.014
 Is+0.014
 Ashley+0.014
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
ails-0.015
iggle-0.015
Token all
Token ID477
Feature activation-2.30e-02

Pos logit contributions
ifying+0.016
 Is+0.015
 Ashley+0.014
 single+0.014
 Debbie+0.014
Neg logit contributions
bled-0.020
uddled-0.020
iggle-0.019
 olds-0.019
castle-0.018
Token gave
Token ID2921
Feature activation-2.15e-03

Pos logit contributions
ifying+0.036
 Is+0.034
 Ashley+0.033
Jonathan+0.032
Lou+0.032
Neg logit contributions
bled-0.038
uddled-0.037
iggle-0.036
irled-0.036
 olds-0.035
Token each
Token ID1123
Feature activation-3.96e-02

Pos logit contributions
ifying+0.003
 Is+0.002
 Ashley+0.002
 awoke+0.002
 bit+0.002
Neg logit contributions
bled-0.005
uddled-0.004
 olds-0.004
iggle-0.004
ails-0.004
Token other
Token ID584
Feature activation-3.27e-02

Pos logit contributions
ifying+0.077
 Is+0.072
 awoke+0.068
 fail+0.066
Lou+0.065
Neg logit contributions
bled-0.049
uddled-0.048
 olds-0.046
iggle-0.044
 golf-0.044
Token a
Token ID257
Feature activation-4.65e-02

Pos logit contributions
ifying+0.052
 Is+0.051
 awoke+0.047
Lou+0.046
 Ashley+0.046
Neg logit contributions
bled-0.053
uddled-0.053
 olds-0.050
 golf-0.049
iggle-0.048
Token big
Token ID1263
Feature activation-3.40e-02

Pos logit contributions
ifying+0.085
 Is+0.083
 Debbie+0.081
Pen+0.079
 Ashley+0.079
Neg logit contributions
uddle-0.063
uddled-0.061
 golf-0.057
bled-0.054
 olds-0.053
Token hug
Token ID16225
Feature activation-1.16e-02

Pos logit contributions
ifying+0.039
 Is+0.035
 Debbie+0.033
Am+0.033
Lou+0.033
Neg logit contributions
uddled-0.069
bled-0.069
iggle-0.068
 golf-0.068
uddle-0.067
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
ifying+0.018
 Is+0.017
 Ashley+0.017
 awoke+0.016
 bit+0.016
Neg logit contributions
bled-0.020
uddled-0.019
 olds-0.018
iggle-0.018
ails-0.018
Token the
Token ID262
Feature activation+7.71e-03

Pos logit contributions
bled+0.041
uddled+0.040
bles+0.038
ails+0.038
agons+0.038
Neg logit contributions
ifying-0.025
 bit-0.025
 care-0.025
 Is-0.024
 Ashley-0.024
Token little
Token ID1310
Feature activation-9.67e-03

Pos logit contributions
bled+0.013
uddled+0.013
ails+0.012
iggle+0.012
 olds+0.012
Neg logit contributions
ifying-0.012
 bit-0.011
 Ashley-0.011
 Is-0.011
 single-0.011
Token mysterious
Token ID11428
Feature activation+1.34e-02

Pos logit contributions
ifying+0.011
 Is+0.010
 Ashley+0.010
 awoke+0.010
 Debbie+0.010
Neg logit contributions
bled-0.010
uddled-0.010
 olds-0.009
iggle-0.009
uddle-0.009
Token voice
Token ID3809
Feature activation+1.81e-02

Pos logit contributions
bled+0.024
uddled+0.023
ails+0.021
 olds+0.020
bles+0.020
Neg logit contributions
ifying-0.020
 bit-0.020
 Is-0.020
 awoke-0.020
 single-0.019
Token.
Token ID13
Feature activation-3.52e-03

Pos logit contributions
bled+0.039
uddled+0.038
uddle+0.035
 olds+0.035
irled+0.035
Neg logit contributions
ifying-0.021
 Is-0.021
 awoke-0.020
 bit-0.020
 distance-0.019
Token It
Token ID632
Feature activation+1.17e-02

Pos logit contributions
ifying+0.005
 bit+0.004
 Is+0.004
 Ashley+0.004
 awoke+0.004
Neg logit contributions
bled-0.007
uddled-0.007
 olds-0.006
iggle-0.006
ails-0.006
Token said
Token ID531
Feature activation+4.93e-03

Pos logit contributions
bled+0.023
uddled+0.023
 olds+0.022
ails+0.022
iggle+0.022
Neg logit contributions
ifying-0.015
 Is-0.015
 awoke-0.014
 bit-0.014
 Ashley-0.014
Token "
Token ID366
Feature activation-5.20e-02

Pos logit contributions
bled+0.009
uddled+0.009
 olds+0.008
iggle+0.008
ails+0.008
Neg logit contributions
ifying-0.007
 Is-0.007
 Ashley-0.006
 awoke-0.006
 bit-0.006
TokenTell
Token ID24446
Feature activation-3.11e-02

Pos logit contributions
ifying+0.073
 bit+0.061
 Ashley+0.060
 cavity+0.059
 awoke+0.055
Neg logit contributions
ini-0.092
uddled-0.086
bled-0.086
ized-0.084
add-0.083
Token me
Token ID502
Feature activation-5.99e-03

Pos logit contributions
ifying+0.059
 bit+0.053
 single+0.053
 distance+0.052
 awoke+0.051
Neg logit contributions
 olds-0.043
iggle-0.042
bled-0.040
ires-0.039
ans-0.039
Token!
Token ID0
Feature activation-5.43e-03

Pos logit contributions
ifying+0.009
 Is+0.008
 bit+0.008
 Ashley+0.008
 awoke+0.008
Neg logit contributions
bled-0.011
uddled-0.011
 olds-0.010
iggle-0.010
ails-0.010
Token Tell
Token ID14026
Feature activation-1.29e-02

Pos logit contributions
ifying+0.007
 bit+0.006
 awoke+0.006
 Is+0.006
 Ashley+0.006
Neg logit contributions
bled-0.011
uddled-0.011
 olds-0.010
iggle-0.010
ails-0.010
Token me
Token ID502
Feature activation-2.33e-02

Pos logit contributions
ifying+0.022
 Is+0.020
 single+0.020
 bit+0.020
 awoke+0.020
Neg logit contributions
bled-0.020
uddled-0.019
iggle-0.019
 olds-0.019
ans-0.018
Token saw
Token ID2497
Feature activation+1.02e-02

Pos logit contributions
ifying+0.010
 Is+0.009
 Ashley+0.008
 bit+0.008
 single+0.008
Neg logit contributions
bled-0.016
uddled-0.016
iggle-0.015
 olds-0.015
ails-0.015
Token a
Token ID257
Feature activation-1.11e-02

Pos logit contributions
bled+0.021
uddled+0.021
 olds+0.020
iggle+0.020
ails+0.020
Neg logit contributions
ifying-0.012
 Is-0.011
 awoke-0.011
 bit-0.011
 Ashley-0.011
Token beautiful
Token ID4950
Feature activation+2.60e-02

Pos logit contributions
ifying+0.019
 Is+0.018
 Ashley+0.018
 awoke+0.018
 Debbie+0.018
Neg logit contributions
bled-0.016
uddled-0.016
 olds-0.015
uddle-0.015
 golf-0.015
Token golden
Token ID10861
Feature activation+2.52e-02

Pos logit contributions
bled+0.050
uddled+0.048
ails+0.045
agons+0.044
 olds+0.044
Neg logit contributions
ifying-0.037
 bit-0.036
 single-0.035
 awoke-0.035
 distance-0.035
Token crown
Token ID12389
Feature activation+2.63e-03

Pos logit contributions
bled+0.048
uddled+0.048
 olds+0.045
ails+0.045
iggle+0.044
Neg logit contributions
ifying-0.035
 bit-0.033
 single-0.032
 awoke-0.031
 distance-0.031
Token in
Token ID287
Feature activation-4.91e-02

Pos logit contributions
bled+0.005
uddled+0.005
 olds+0.005
iggle+0.005
ails+0.005
Neg logit contributions
ifying-0.004
 Is-0.003
 bit-0.003
 Ashley-0.003
 awoke-0.003
Token the
Token ID262
Feature activation-1.48e-02

Pos logit contributions
 Ashley+0.072
use+0.072
ifying+0.070
Jonathan+0.069
 awoke+0.069
Neg logit contributions
 golf-0.081
bled-0.074
ized-0.070
castle-0.067
 olds-0.067
Token bottom
Token ID4220
Feature activation+3.50e-03

Pos logit contributions
 Is+0.023
ifying+0.023
 Ashley+0.023
Jonathan+0.022
 Debbie+0.022
Neg logit contributions
bled-0.024
uddled-0.023
 golf-0.022
 olds-0.022
uddle-0.022
Token of
Token ID286
Feature activation-9.49e-03

Pos logit contributions
bled+0.007
uddled+0.007
 olds+0.006
iggle+0.006
ails+0.006
Neg logit contributions
ifying-0.005
 Is-0.004
 bit-0.004
 Ashley-0.004
 awoke-0.004
Token the
Token ID262
Feature activation-2.81e-03

Pos logit contributions
ifying+0.010
 Is+0.009
 Ashley+0.009
 awoke+0.009
 bit+0.009
Neg logit contributions
bled-0.021
uddled-0.020
 golf-0.019
 olds-0.019
iggle-0.019
Token pond
Token ID16723
Feature activation+1.74e-02

Pos logit contributions
ifying+0.005
 Is+0.005
 Ashley+0.005
 awoke+0.005
 bit+0.005
Neg logit contributions
bled-0.004
uddled-0.004
 olds-0.004
iggle-0.004
ails-0.004
Token want
Token ID765
Feature activation-6.32e-02

Pos logit contributions
ifying+0.057
Oh+0.050
 Is+0.049
Jonathan+0.049
 Ashley+0.048
Neg logit contributions
iggle-0.062
 olds-0.056
 golf-0.056
ans-0.054
ails-0.054
Token to
Token ID284
Feature activation-3.14e-02

Pos logit contributions
ifying+0.092
 Is+0.086
 awoke+0.078
Ba+0.076
Jonathan+0.075
Neg logit contributions
 olds-0.112
 golf-0.104
ires-0.101
uddled-0.098
iggle-0.097
Token pray
Token ID12472
Feature activation-1.79e-02

Pos logit contributions
ifying+0.051
 single+0.043
Oh+0.043
 Is+0.043
 Ashley+0.043
Neg logit contributions
iggle-0.053
 golf-0.049
castle-0.049
bled-0.049
 olds-0.048
Token.
Token ID13
Feature activation-1.78e-02

Pos logit contributions
ifying+0.027
 Is+0.025
 awoke+0.024
 Ashley+0.024
Lou+0.024
Neg logit contributions
bled-0.031
uddled-0.031
 olds-0.030
iggle-0.029
ails-0.028
Token We
Token ID775
Feature activation-5.06e-02

Pos logit contributions
ifying+0.029
 distance+0.027
 bit+0.027
 single+0.027
 awoke+0.027
Neg logit contributions
bled-0.027
uddled-0.027
 olds-0.027
mm-0.026
ires-0.026
Token want
Token ID765
Feature activation-5.52e-02

Pos logit contributions
ifying+0.076
Oh+0.070
Am+0.069
 Ashley+0.068
 Debbie+0.068
Neg logit contributions
iggle-0.087
ans-0.079
uddled-0.079
agons-0.078
uddle-0.077
Token to
Token ID284
Feature activation-3.85e-02

Pos logit contributions
ifying+0.080
 Is+0.069
 awoke+0.063
 distance+0.063
ah+0.062
Neg logit contributions
 olds-0.099
uddled-0.095
 golf-0.094
ires-0.093
bled-0.092
Token play
Token ID711
Feature activation+1.56e-03

Pos logit contributions
ifying+0.062
Oh+0.054
 single+0.054
 Is+0.053
 Ashley+0.051
Neg logit contributions
iggle-0.064
bled-0.059
castle-0.058
 golf-0.057
 olds-0.057
Token,"
Token ID553
Feature activation-1.91e-03

Pos logit contributions
bled+0.003
uddled+0.003
 olds+0.003
iggle+0.003
ails+0.003
Neg logit contributions
ifying-0.002
 Is-0.002
 Ashley-0.002
 bit-0.002
 awoke-0.002
Token Ben
Token ID3932
Feature activation+6.60e-03

Pos logit contributions
ifying+0.003
 Is+0.002
 awoke+0.002
 bit+0.002
 Ashley+0.002
Neg logit contributions
bled-0.004
uddled-0.004
 olds-0.003
iggle-0.003
ails-0.003
Token says
Token ID1139
Feature activation+1.36e-02

Pos logit contributions
bled+0.012
uddled+0.012
 olds+0.011
iggle+0.011
ails+0.011
Neg logit contributions
ifying-0.010
 Is-0.009
 Ashley-0.009
 bit-0.009
 awoke-0.009