TOP ACTIVATIONS
MAX = 4.645

end , Tom and Mr . Fox stopped being friends because
, Jane y and Mrs . Smith would chat and share
, Fl opsy and Mr . Snow man were the best
should not tease the ost ric hes . They are not
They do not pack ost ric hes anymore . They pack
day , Tim and Mr . Bear went to the park
orman and his tutor Mr . Johnson were in the park
Jen and Sam and Mr . Lee were covered in sticky
Tom and Mr . Fox went on a long
day , Lucy and Mr . Bear went to the park
named Tim . They spent every day together , running and
" I 'm sorry Mr . Raven , I didn 't
on , Lily and Mr . Johnson became friends and he
by side . They spent every day together , playing ,
laughed . Tim and Mr . Squirrel became good friends .
rain . Lily and Mr . Brown ran back to his
" Thank you , Mr . Owl ! I found the
wanted to touch the ost ric hes . They reached out
day , Lily and Mr . Sp ud went to the
visit their neighbor , Mrs . Lee . Mrs . Lee

MIN ACTIVATIONS
MIN = -4.188

too late . <|endoftext|> " Look , Mom my !" exclaimed
so happy . <|endoftext|> " Look !" Mary said to her
the medicine . <|endoftext|> " Look up there , Mama !"
lively adventure ! <|endoftext|> " Look , Mom ! It 's
voice shouted out . " Look ! It is a race
â Look into it , â
quiet anymore . <|endoftext|> " Look Mom my , what can
Look at your beautiful red pet
away . " Look , Ben and Lily ."
shouted to Mum , " Look ! There 's a dove
Look , these are for me
Look , Sam ! Look how
Daddy said , " Look under the bed . That
Mom and said , " Look , Mom , it 's
for others . <|endoftext|> " Look , there 's a stand
ooky . " Look , Lily , look !"
angry and said , " Look what you did , Ben
colorful . " Look , Mom ," Tom said
of different colours . " Look ," said M ummy .
and examined it . " Look at its little claws Ben

INTERVAL 2.437 - 4.645
CONTAINS 0.077%

mom about Max taking his shield . His mom said ,
Lily loved to eat av oc ados , they were her
?" Tom and Mia think . They are kind
What is this , Mr . Owl ?" Benny asked .
he watched the hero 's triumph . He had known all

INTERVAL 0.228 - 2.437
CONTAINS 37.283%

" You have the blue boat !" " I
In the woods , Buddy met a little bird
by himself . He looked around and saw a friendly bird
. " Will you come back ?" Tom asked
band age with pride . <|endoftext|> The teacher stood at the

INTERVAL -1.980 - 0.228
CONTAINS 62.088%

She continued to bring beef to her neighbors , and no
box with many nails inside . Lily wanted to make something
. To his surprise , he saw a little frog !
you can carry it around ." Sarah was so
L ila 's hand . L ila cried out loud .

INTERVAL -4.188 - -1.980
CONTAINS 0.552%

were having fun , but then Sara saw Ben 's car
This is a mineral , children . It is a part
looking for them in the me adow . She was very
. " Thank you , mom my ! You are very
a cozy room with many toys . One day
Token end
Token ID886
Feature activation-2.05e-01

Pos logit contributions
 someone+0.002
ily+0.001
Fore+0.001
Prin+0.001
P+0.001
Neg logit contributions
 dodging-0.004
iled-0.004
oing-0.004
ang-0.004
ples-0.004
Token,
Token ID11
Feature activation-2.85e-02

Pos logit contributions
 someone+0.005
ily+0.005
Fore+0.004
Prin+0.004
 Claire+0.004
Neg logit contributions
 dodging-0.005
iled-0.005
arate-0.005
ang-0.004
ples-0.004
Token Tom
Token ID4186
Feature activation-5.30e-02

Pos logit contributions
ily+0.001
Prin+0.001
P+0.001
Fore+0.001
abby+0.000
Neg logit contributions
 dodging-0.001
iled-0.001
arate-0.001
flies-0.001
oc-0.001
Token and
Token ID290
Feature activation+9.36e-01

Pos logit contributions
 someone+0.001
ily+0.001
Prin+0.001
Fore+0.001
 Claire+0.001
Neg logit contributions
 dodging-0.001
iled-0.001
arate-0.001
ples-0.001
ang-0.001
Token Mr
Token ID1770
Feature activation+1.45e+00

Pos logit contributions
 dodging+0.027
oc+0.026
iosity+0.025
flies+0.025
iled+0.025
Neg logit contributions
ily-0.017
Fore-0.015
 someone-0.014
P-0.014
Prin-0.014
Token.
Token ID13
Feature activation+4.64e+00

Pos logit contributions
 dodging+0.038
arate+0.036
iled+0.035
ang+0.035
ples+0.034
Neg logit contributions
 someone-0.033
ily-0.032
 Claire-0.029
Fore-0.028
Prin-0.028
Token Fox
Token ID5426
Feature activation+2.31e+00

Pos logit contributions
arate+0.115
ang+0.114
 dodging+0.114
iled+0.113
iosity+0.110
Neg logit contributions
 someone-0.110
 Claire-0.098
ily-0.095
Prin-0.093
Fore-0.091
Token stopped
Token ID5025
Feature activation+1.20e+00

Pos logit contributions
 dodging+0.083
itching+0.082
ang+0.082
tain+0.081
oing+0.080
Neg logit contributions
 someone-0.030
ily-0.029
 went-0.027
Prin-0.022
 out-0.021
Token being
Token ID852
Feature activation+1.66e+00

Pos logit contributions
ider+0.034
fy+0.032
ical+0.032
ience+0.031
mer+0.030
Neg logit contributions
 someone-0.029
 out-0.026
 her-0.025
 them-0.025
 Lily-0.025
Token friends
Token ID2460
Feature activation+3.31e-01

Pos logit contributions
 dodging+0.043
ples+0.041
oing+0.041
flies+0.041
tain+0.040
Neg logit contributions
 someone-0.040
ily-0.037
 Claire-0.033
Prin-0.032
P-0.031
Token because
Token ID780
Feature activation+1.29e+00

Pos logit contributions
itching+0.010
 dodging+0.010
ang+0.010
ples+0.010
tain+0.010
Neg logit contributions
 someone-0.006
ily-0.006
 Claire-0.005
 out-0.005
 Sophie-0.005
Token,
Token ID11
Feature activation+3.70e-02

Pos logit contributions
 someone+0.015
ily+0.015
Fore+0.014
Prin+0.014
 Claire+0.013
Neg logit contributions
 dodging-0.014
iled-0.014
arate-0.013
ples-0.013
flies-0.013
Token Jane
Token ID12091
Feature activation-8.70e-01

Pos logit contributions
 dodging+0.001
iled+0.001
flies+0.001
arate+0.001
iosity+0.001
Neg logit contributions
ily-0.001
 someone-0.001
Fore-0.001
Prin-0.001
P-0.001
Tokeny
Token ID88
Feature activation+8.28e-01

Pos logit contributions
ily+0.017
 someone+0.017
Prin+0.015
 Claire+0.014
Fore+0.014
Neg logit contributions
 dodging-0.026
arate-0.023
iosity-0.023
iled-0.023
oing-0.023
Token and
Token ID290
Feature activation+4.14e-01

Pos logit contributions
 dodging+0.023
iled+0.022
arate+0.021
ples+0.021
flies+0.021
Neg logit contributions
 someone-0.017
ily-0.017
Fore-0.015
Prin-0.015
 Claire-0.014
Token Mrs
Token ID9074
Feature activation+2.78e+00

Pos logit contributions
 dodging+0.013
 mastering+0.012
iosity+0.012
arate+0.012
flies+0.012
Neg logit contributions
ily-0.007
Fore-0.006
 someone-0.006
Prin-0.005
ackers-0.005
Token.
Token ID13
Feature activation+4.14e+00

Pos logit contributions
 dodging+0.072
iled+0.068
arate+0.068
ang+0.066
oing+0.065
Neg logit contributions
 someone-0.064
ily-0.063
 Claire-0.056
Fore-0.055
Prin-0.055
Token Smith
Token ID4176
Feature activation+1.47e+00

Pos logit contributions
 dodging+0.106
iled+0.101
arate+0.101
ang+0.099
oing+0.098
Neg logit contributions
 someone-0.094
ily-0.090
 Claire-0.087
Prin-0.082
 Sophie-0.081
Token would
Token ID561
Feature activation-6.33e-01

Pos logit contributions
 dodging+0.051
iosity+0.048
itching+0.048
arate+0.048
tain+0.047
Neg logit contributions
ily-0.021
 someone-0.020
Prin-0.018
 Claire-0.017
ackers-0.017
Token chat
Token ID8537
Feature activation+1.42e+00

Pos logit contributions
 someone+0.011
ily+0.010
 Claire+0.010
 go+0.009
Prin+0.009
Neg logit contributions
 dodging-0.020
ang-0.019
oing-0.019
iled-0.019
arate-0.018
Token and
Token ID290
Feature activation+3.55e-01

Pos logit contributions
 dodging+0.034
iled+0.033
arate+0.032
ples+0.031
ang+0.031
Neg logit contributions
 someone-0.034
ily-0.034
Fore-0.030
Prin-0.030
 Claire-0.030
Token share
Token ID2648
Feature activation-7.16e-03

Pos logit contributions
 dodging+0.009
iled+0.009
ang+0.009
oing+0.008
itching+0.008
Neg logit contributions
 someone-0.008
ily-0.008
 Claire-0.007
Prin-0.007
Fore-0.007
Token,
Token ID11
Feature activation+7.54e-01

Pos logit contributions
 dodging+0.014
iled+0.013
arate+0.013
ang+0.012
flies+0.012
Neg logit contributions
 someone-0.015
ily-0.015
Fore-0.014
Prin-0.014
 Claire-0.013
Token Fl
Token ID1610
Feature activation-9.57e-02

Pos logit contributions
 dodging+0.021
iled+0.020
oc+0.020
ang+0.020
arate+0.019
Neg logit contributions
ily-0.015
 someone-0.014
Fore-0.013
Prin-0.013
P-0.012
Tokenopsy
Token ID44522
Feature activation+2.64e-01

Pos logit contributions
 someone+0.002
ily+0.002
Prin+0.002
 Claire+0.002
Fore+0.002
Neg logit contributions
 dodging-0.002
iled-0.002
ang-0.002
oing-0.002
flies-0.002
Token and
Token ID290
Feature activation+7.33e-01

Pos logit contributions
 dodging+0.009
sing+0.008
itching+0.008
iosity+0.008
 looming+0.008
Neg logit contributions
ily-0.004
 someone-0.004
 went-0.004
Prin-0.004
P-0.003
Token Mr
Token ID1770
Feature activation+1.19e+00

Pos logit contributions
 dodging+0.020
oc+0.018
iosity+0.018
iled+0.018
girls+0.018
Neg logit contributions
ily-0.016
 someone-0.014
Fore-0.014
Prin-0.014
P-0.013
Token.
Token ID13
Feature activation+4.13e+00

Pos logit contributions
 dodging+0.030
iled+0.029
arate+0.028
ples+0.028
ang+0.027
Neg logit contributions
 someone-0.028
ily-0.027
Fore-0.024
Prin-0.024
 Claire-0.024
Token Snow
Token ID7967
Feature activation+2.06e+00

Pos logit contributions
 dodging+0.103
iled+0.099
arate+0.099
ples+0.098
iosity+0.096
Neg logit contributions
 someone-0.096
ily-0.091
Fore-0.084
 Claire-0.083
Prin-0.082
Tokenman
Token ID805
Feature activation+1.67e+00

Pos logit contributions
 dodging+0.051
arate+0.050
iosity+0.047
tain+0.047
itching+0.047
Neg logit contributions
 someone-0.048
ily-0.048
Prin-0.043
Fore-0.042
 Claire-0.041
Token were
Token ID547
Feature activation+2.84e-01

Pos logit contributions
 dodging+0.056
itching+0.056
iosity+0.054
akable+0.054
istent+0.054
Neg logit contributions
ily-0.026
 someone-0.026
 went-0.023
Fore-0.022
 out-0.021
Token the
Token ID262
Feature activation-4.38e-01

Pos logit contributions
 dodging+0.009
iled+0.009
arate+0.009
oing+0.009
ples+0.009
Neg logit contributions
 someone-0.005
ily-0.004
Fore-0.004
Prin-0.004
 Claire-0.004
Token best
Token ID1266
Feature activation+8.41e-01

Pos logit contributions
 someone+0.006
ily+0.006
Prin+0.005
 Claire+0.005
Fore+0.005
Neg logit contributions
 dodging-0.015
iled-0.014
arate-0.014
ang-0.014
flies-0.014
Token should
Token ID815
Feature activation-9.13e-01

Pos logit contributions
 dodging+0.015
itching+0.014
ang+0.014
oing+0.014
tain+0.014
Neg logit contributions
 someone-0.010
ily-0.010
Prin-0.008
Fore-0.008
 Claire-0.008
Token not
Token ID407
Feature activation+9.62e-01

Pos logit contributions
 someone+0.020
ily+0.020
Fore+0.018
Prin+0.018
 Claire+0.017
Neg logit contributions
 dodging-0.024
iled-0.023
arate-0.023
ples-0.022
flies-0.022
Token tease
Token ID34691
Feature activation+9.22e-01

Pos logit contributions
 dodging+0.027
years+0.026
ang+0.026
ience+0.025
iously+0.025
Neg logit contributions
 someone-0.021
ily-0.019
 go-0.017
Prin-0.016
Fore-0.016
Token the
Token ID262
Feature activation+5.52e-01

Pos logit contributions
 dodging+0.026
iled+0.025
flies+0.024
arate+0.024
ang+0.024
Neg logit contributions
ily-0.018
Prin-0.017
 someone-0.017
Fore-0.017
 Claire-0.016
Token ost
Token ID23619
Feature activation+2.29e+00

Pos logit contributions
iled+0.012
 dodging+0.012
arate+0.012
itching+0.012
oing+0.012
Neg logit contributions
 someone-0.014
ily-0.014
Fore-0.013
Prin-0.012
 Claire-0.012
Tokenric
Token ID1173
Feature activation+4.11e+00

Pos logit contributions
 dodging+0.041
arate+0.041
 tug+0.040
manship+0.039
iosity+0.038
Neg logit contributions
ily-0.063
 someone-0.063
Fore-0.057
Prin-0.055
ackers-0.054
Tokenhes
Token ID956
Feature activation+1.16e+00

Pos logit contributions
iled+0.131
ang+0.126
 dodging+0.124
oing+0.122
flies+0.120
Neg logit contributions
 Claire-0.065
 someone-0.063
 Sophie-0.061
 Natalie-0.059
oph-0.059
Token.
Token ID13
Feature activation+2.35e-02

Pos logit contributions
 dodging+0.030
iled+0.029
arate+0.028
ples+0.028
oing+0.028
Neg logit contributions
 someone-0.026
ily-0.025
Fore-0.022
Prin-0.022
 Claire-0.022
Token They
Token ID1119
Feature activation+1.52e-01

Pos logit contributions
 dodging+0.001
iled+0.001
arate+0.001
itching+0.001
oing+0.001
Neg logit contributions
 someone-0.000
ily-0.000
Fore-0.000
Prin-0.000
 Claire-0.000
Token are
Token ID389
Feature activation-4.70e-01

Pos logit contributions
 dodging+0.006
itching+0.006
ang+0.006
oing+0.006
tain+0.006
Neg logit contributions
 someone-0.002
ily-0.002
 rest-0.001
 go-0.001
 other-0.001
Token not
Token ID407
Feature activation-4.14e-01

Pos logit contributions
 someone+0.010
ily+0.009
Fore+0.007
 out+0.007
 Lily+0.007
Neg logit contributions
oing-0.014
oc-0.014
arate-0.014
 dodging-0.013
iled-0.013
Token They
Token ID1119
Feature activation+1.20e+00

Pos logit contributions
 dodging+0.020
itching+0.019
iled+0.019
oing+0.019
ples+0.019
Neg logit contributions
 someone-0.011
ily-0.010
Fore-0.008
 Claire-0.008
Prin-0.008
Token do
Token ID466
Feature activation+6.17e-01

Pos logit contributions
 dodging+0.039
ang+0.038
oing+0.038
ples+0.037
iled+0.037
Neg logit contributions
ily-0.022
 someone-0.021
Prin-0.018
Fore-0.018
 go-0.017
Token not
Token ID407
Feature activation+1.20e+00

Pos logit contributions
 dodging+0.017
iled+0.017
arate+0.016
ples+0.016
ang+0.016
Neg logit contributions
 someone-0.013
ily-0.012
Fore-0.011
Prin-0.011
 Claire-0.011
Token pack
Token ID2353
Feature activation+7.58e-02

Pos logit contributions
 dodging+0.036
ang+0.036
itching+0.035
years+0.035
iously+0.034
Neg logit contributions
 someone-0.024
ily-0.023
 go-0.021
 time-0.020
Prin-0.019
Token ost
Token ID23619
Feature activation+2.12e+00

Pos logit contributions
 dodging+0.002
iled+0.002
arate+0.002
flies+0.002
ples+0.002
Neg logit contributions
ily-0.001
 someone-0.001
Prin-0.001
Fore-0.001
 Claire-0.001
Tokenric
Token ID1173
Feature activation+4.08e+00

Pos logit contributions
 dodging+0.041
oc+0.035
iosity+0.035
 mastering+0.035
 tug+0.035
Neg logit contributions
 someone-0.058
ily-0.056
Fore-0.053
P-0.051
ate-0.051
Tokenhes
Token ID956
Feature activation+1.30e+00

Pos logit contributions
iled+0.131
 dodging+0.129
ang+0.125
flies+0.122
oing+0.121
Neg logit contributions
 someone-0.060
 Claire-0.059
oph-0.056
 Sophie-0.055
Prin-0.054
Token anymore
Token ID7471
Feature activation+8.64e-02

Pos logit contributions
 dodging+0.040
iled+0.038
arate+0.037
ples+0.037
oing+0.037
Neg logit contributions
 someone-0.023
ily-0.023
Fore-0.020
Prin-0.019
 Claire-0.019
Token.
Token ID13
Feature activation+2.70e-01

Pos logit contributions
 dodging+0.003
ples+0.003
arate+0.003
iosity+0.002
akable+0.002
Neg logit contributions
 someone-0.002
ily-0.002
Fore-0.001
 Claire-0.001
Prin-0.001
Token They
Token ID1119
Feature activation+8.81e-01

Pos logit contributions
 dodging+0.007
iled+0.007
arate+0.007
ples+0.007
ang+0.007
Neg logit contributions
 someone-0.006
ily-0.006
Fore-0.005
Prin-0.005
 Claire-0.005
Token pack
Token ID2353
Feature activation-6.46e-01

Pos logit contributions
 dodging+0.031
oing+0.031
ang+0.031
ples+0.030
iled+0.029
Neg logit contributions
ily-0.014
 someone-0.013
 go-0.012
 rest-0.012
 went-0.011
Token day
Token ID1110
Feature activation-1.74e-01

Pos logit contributions
 dodging+0.017
iled+0.017
ang+0.016
oing+0.016
arate+0.016
Neg logit contributions
 someone-0.010
ily-0.009
Prin-0.009
Fore-0.009
 Claire-0.008
Token,
Token ID11
Feature activation+1.31e-01

Pos logit contributions
ily+0.005
 someone+0.005
Prin+0.004
Fore+0.004
 Claire+0.004
Neg logit contributions
 dodging-0.004
iled-0.004
arate-0.003
oc-0.003
oing-0.003
Token Tim
Token ID5045
Feature activation-1.04e+00

Pos logit contributions
 dodging+0.004
iled+0.004
ples+0.004
arate+0.004
oing+0.004
Neg logit contributions
 someone-0.002
ily-0.002
 Claire-0.002
Fore-0.002
Prin-0.002
Token and
Token ID290
Feature activation+4.74e-01

Pos logit contributions
ily+0.023
 someone+0.023
Fore+0.022
 Claire+0.021
Prin+0.021
Neg logit contributions
iled-0.026
 dodging-0.025
oc-0.024
ky-0.023
arate-0.023
Token Mr
Token ID1770
Feature activation+1.03e+00

Pos logit contributions
 dodging+0.014
oc+0.013
iled+0.013
 roaring+0.013
iosity+0.013
Neg logit contributions
ily-0.009
Fore-0.007
 someone-0.007
Prin-0.007
P-0.007
Token.
Token ID13
Feature activation+4.08e+00

Pos logit contributions
 dodging+0.028
arate+0.026
iled+0.026
ang+0.025
iosity+0.025
Neg logit contributions
 someone-0.023
ily-0.022
 Claire-0.019
Fore-0.019
Prin-0.019
Token Bear
Token ID14732
Feature activation+1.97e+00

Pos logit contributions
 dodging+0.102
arate+0.099
iled+0.099
ang+0.098
iosity+0.097
Neg logit contributions
 someone-0.096
ily-0.089
 Claire-0.086
Prin-0.082
Fore-0.081
Token went
Token ID1816
Feature activation-7.05e-01

Pos logit contributions
 dodging+0.075
itching+0.073
arate+0.072
ang+0.072
tain+0.071
Neg logit contributions
 someone-0.025
ily-0.025
 went-0.020
 out-0.017
 Claire-0.016
Token to
Token ID284
Feature activation+1.72e-01

Pos logit contributions
 someone+0.012
ily+0.012
 out+0.011
 Lily+0.009
Prin+0.009
Neg logit contributions
ples-0.023
istent-0.022
vered-0.022
 grams-0.022
oc-0.022
Token the
Token ID262
Feature activation+2.46e-01

Pos logit contributions
 dodging+0.005
arate+0.004
iled+0.004
oc+0.004
 roaring+0.004
Neg logit contributions
ily-0.004
Fore-0.003
 someone-0.003
Prin-0.003
P-0.003
Token park
Token ID3952
Feature activation-7.00e-01

Pos logit contributions
 dodging+0.008
iled+0.008
arate+0.008
flies+0.008
ples+0.008
Neg logit contributions
ily-0.004
 someone-0.004
Fore-0.003
Prin-0.003
 Claire-0.003
Tokenorman
Token ID26183
Feature activation-3.45e-01

Pos logit contributions
 someone+0.041
Fore+0.037
ily+0.037
Prin+0.037
 Claire+0.036
Neg logit contributions
iled-0.027
flies-0.025
ang-0.024
oing-0.024
ples-0.024
Token and
Token ID290
Feature activation+8.01e-01

Pos logit contributions
 someone+0.007
ily+0.007
Fore+0.006
Prin+0.006
 Claire+0.006
Neg logit contributions
iled-0.009
 dodging-0.009
arate-0.009
flies-0.009
ples-0.009
Token his
Token ID465
Feature activation+2.94e-01

Pos logit contributions
iled+0.017
 because+0.017
ishly+0.016
 dodging+0.016
flies+0.016
Neg logit contributions
Fore-0.018
Prin-0.017
ily-0.017
oons-0.016
 flour-0.016
Token tutor
Token ID48680
Feature activation+1.97e+00

Pos logit contributions
 dodging+0.010
iled+0.009
ang+0.009
arate+0.009
flies+0.009
Neg logit contributions
 someone-0.004
ily-0.004
Prin-0.004
Fore-0.004
 Claire-0.003
Token Mr
Token ID1770
Feature activation+1.39e+00

Pos logit contributions
 dodging+0.060
iled+0.057
oing+0.056
ang+0.056
itching+0.055
Neg logit contributions
 someone-0.037
ily-0.036
 Claire-0.031
Prin-0.031
Fore-0.031
Token.
Token ID13
Feature activation+4.05e+00

Pos logit contributions
 dodging+0.036
iled+0.034
arate+0.034
ang+0.033
iosity+0.033
Neg logit contributions
 someone-0.031
ily-0.030
 Claire-0.027
Fore-0.027
Prin-0.027
Token Johnson
Token ID5030
Feature activation+8.42e-01

Pos logit contributions
 dodging+0.097
iled+0.093
arate+0.090
ples+0.089
ang+0.088
Neg logit contributions
 someone-0.098
ily-0.096
Fore-0.088
Prin-0.087
 Claire-0.085
Token were
Token ID547
Feature activation-2.38e-01

Pos logit contributions
 dodging+0.027
oing+0.026
ang+0.025
itching+0.025
arate+0.025
Neg logit contributions
 someone-0.014
ily-0.014
 Claire-0.012
Prin-0.011
Fore-0.011
Token in
Token ID287
Feature activation+3.06e-01

Pos logit contributions
ily+0.007
 someone+0.007
Fore+0.006
Prin+0.006
atalie+0.006
Neg logit contributions
 dodging-0.004
iled-0.004
ang-0.004
arate-0.004
 roaring-0.004
Token the
Token ID262
Feature activation+3.50e-01

Pos logit contributions
oing+0.012
iled+0.012
 dodging+0.012
flies+0.012
ang+0.012
Neg logit contributions
 someone-0.004
 Claire-0.003
ily-0.002
 Sophie-0.002
 time-0.002
Token park
Token ID3952
Feature activation-5.60e-01

Pos logit contributions
 dodging+0.010
arate+0.009
iled+0.009
ky+0.009
 roaring+0.009
Neg logit contributions
ily-0.007
 someone-0.006
Fore-0.006
Prin-0.006
atalie-0.005
TokenJen
Token ID44875
Feature activation+4.00e-01

Pos logit contributions
 dodging+0.000
iled+0.000
ples+0.000
arate+0.000
flies+0.000
Neg logit contributions
 someone-0.000
ily-0.000
Fore-0.000
Prin-0.000
 Claire-0.000
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
 dodging+0.011
arate+0.010
iled+0.010
ples+0.010
oing+0.010
Neg logit contributions
ily-0.009
 someone-0.009
Prin-0.008
Fore-0.008
 Claire-0.007
Token Sam
Token ID3409
Feature activation+2.22e+00

Pos logit contributions
 dodging+0.034
iosity+0.032
iled+0.031
girls+0.031
arate+0.031
Neg logit contributions
ily-0.020
Prin-0.020
Fore-0.020
abby-0.018
 someone-0.018
Token and
Token ID290
Feature activation+1.81e+00

Pos logit contributions
 dodging+0.075
arate+0.074
ang+0.073
itching+0.073
oing+0.072
Neg logit contributions
ily-0.034
 someone-0.034
Prin-0.027
 Claire-0.026
Fore-0.025
Token Mr
Token ID1770
Feature activation+1.80e+00

Pos logit contributions
 dodging+0.044
iled+0.041
flies+0.040
arate+0.040
iosity+0.040
Neg logit contributions
ily-0.042
 someone-0.041
Prin-0.040
Fore-0.040
 Claire-0.038
Token.
Token ID13
Feature activation+4.05e+00

Pos logit contributions
 dodging+0.049
arate+0.047
iled+0.046
ang+0.045
itching+0.045
Neg logit contributions
 someone-0.039
ily-0.039
Fore-0.032
 Claire-0.032
Prin-0.032
Token Lee
Token ID5741
Feature activation+2.12e+00

Pos logit contributions
ang+0.105
arate+0.105
years+0.103
ience+0.103
 dodging+0.102
Neg logit contributions
 someone-0.094
ily-0.081
 Claire-0.077
atalie-0.074
Fore-0.073
Token were
Token ID547
Feature activation+1.67e-03

Pos logit contributions
ang+0.076
itching+0.073
 dodging+0.072
arate+0.071
tain+0.070
Neg logit contributions
 someone-0.032
ily-0.029
 went-0.028
 out-0.027
 go-0.026
Token covered
Token ID5017
Feature activation+1.19e+00

Pos logit contributions
 dodging+0.000
iled+0.000
arate+0.000
oing+0.000
ples+0.000
Neg logit contributions
 someone-0.000
ily-0.000
Fore-0.000
Prin-0.000
 Claire-0.000
Token in
Token ID287
Feature activation+6.32e-01

Pos logit contributions
 dodging+0.031
iled+0.030
arate+0.029
ples+0.029
flies+0.029
Neg logit contributions
 someone-0.026
ily-0.025
Fore-0.023
Prin-0.023
 Claire-0.022
Token sticky
Token ID23408
Feature activation-7.21e-02

Pos logit contributions
 dodging+0.022
oing+0.021
flies+0.021
ang+0.021
arate+0.021
Neg logit contributions
 someone-0.011
ily-0.009
 flour-0.008
 Claire-0.007
Fore-0.007
Token
Token ID198
Feature activation+1.95e-02

Pos logit contributions
ily+0.010
Fore+0.008
 someone+0.008
P+0.008
Prin+0.008
Neg logit contributions
oc-0.015
 dodging-0.014
arate-0.014
iled-0.014
ller-0.013
Token
Token ID198
Feature activation-3.66e-02

Pos logit contributions
oc+0.001
iled+0.000
ang+0.000
oing+0.000
arate+0.000
Neg logit contributions
 Claire-0.000
 someone-0.000
Prin-0.000
P-0.000
ily-0.000
TokenTom
Token ID13787
Feature activation-6.44e-01

Pos logit contributions
 someone+0.001
ily+0.001
 Claire+0.001
Fore+0.001
Prin+0.001
Neg logit contributions
 dodging-0.001
iled-0.001
arate-0.001
oc-0.001
ang-0.001
Token and
Token ID290
Feature activation+4.87e-01

Pos logit contributions
 someone+0.010
ily+0.009
Prin+0.008
Fore+0.008
 Claire+0.007
Neg logit contributions
 dodging-0.022
arate-0.021
ang-0.021
itching-0.021
ples-0.020
Token Mr
Token ID1770
Feature activation+1.37e+00

Pos logit contributions
 dodging+0.016
oc+0.015
iled+0.015
iosity+0.015
arate+0.014
Neg logit contributions
ily-0.008
 someone-0.007
Fore-0.007
Prin-0.006
P-0.006
Token.
Token ID13
Feature activation+4.00e+00

Pos logit contributions
 dodging+0.035
iled+0.033
arate+0.033
ples+0.032
ang+0.032
Neg logit contributions
 someone-0.031
ily-0.030
Fore-0.027
Prin-0.027
 Claire-0.027
Token Fox
Token ID5426
Feature activation+1.89e+00

Pos logit contributions
 dodging+0.094
iled+0.092
arate+0.092
ang+0.092
oing+0.089
Neg logit contributions
 someone-0.100
ily-0.090
 Claire-0.089
Prin-0.087
Fore-0.084
Token went
Token ID1816
Feature activation-9.84e-01

Pos logit contributions
 dodging+0.072
itching+0.070
ang+0.068
arate+0.067
oing+0.066
Neg logit contributions
ily-0.024
 someone-0.024
 went-0.023
 out-0.018
Prin-0.018
Token on
Token ID319
Feature activation+8.13e-01

Pos logit contributions
 someone+0.016
 out+0.016
ily+0.015
Prin+0.012
 Lily+0.012
Neg logit contributions
istent-0.033
ples-0.032
itching-0.031
 grams-0.031
vered-0.031
Token a
Token ID257
Feature activation+7.96e-01

Pos logit contributions
ang+0.032
ples+0.032
flies+0.032
iled+0.032
oing+0.031
Neg logit contributions
 someone-0.009
Fore-0.006
ily-0.006
 time-0.005
 Claire-0.005
Token long
Token ID890
Feature activation+1.40e+00

Pos logit contributions
 dodging+0.020
iled+0.020
arate+0.019
ples+0.019
flies+0.019
Neg logit contributions
 someone-0.018
ily-0.017
Fore-0.016
Prin-0.016
 Claire-0.015
Token day
Token ID1110
Feature activation-1.41e-01

Pos logit contributions
 dodging+0.023
iled+0.022
ang+0.021
arate+0.021
oing+0.021
Neg logit contributions
 someone-0.012
ily-0.012
Prin-0.011
Fore-0.011
 Claire-0.010
Token,
Token ID11
Feature activation+3.69e-01

Pos logit contributions
ily+0.004
Fore+0.004
 someone+0.004
Prin+0.004
P+0.004
Neg logit contributions
iled-0.002
 dodging-0.002
arate-0.002
oc-0.002
 pretending-0.002
Token Lucy
Token ID22162
Feature activation+1.28e-01

Pos logit contributions
 dodging+0.011
iled+0.010
arate+0.010
flies+0.010
ples+0.010
Neg logit contributions
ily-0.007
 someone-0.007
Fore-0.006
Prin-0.006
P-0.006
Token and
Token ID290
Feature activation+4.06e-01

Pos logit contributions
 dodging+0.004
iled+0.004
arate+0.004
ples+0.003
flies+0.003
Neg logit contributions
 someone-0.002
ily-0.002
Fore-0.002
Prin-0.002
 Claire-0.002
Token Mr
Token ID1770
Feature activation+1.24e+00

Pos logit contributions
 dodging+0.013
iosity+0.012
iled+0.012
oc+0.012
tain+0.011
Neg logit contributions
ily-0.007
 someone-0.006
Fore-0.006
Prin-0.006
P-0.005
Token.
Token ID13
Feature activation+3.99e+00

Pos logit contributions
 dodging+0.032
iled+0.030
arate+0.030
ang+0.029
ples+0.029
Neg logit contributions
 someone-0.029
ily-0.028
Fore-0.025
 Claire-0.025
Prin-0.025
Token Bear
Token ID14732
Feature activation+2.23e+00

Pos logit contributions
 dodging+0.100
arate+0.098
iled+0.097
ang+0.096
iosity+0.095
Neg logit contributions
 someone-0.094
ily-0.087
 Claire-0.086
Prin-0.081
Fore-0.079
Token went
Token ID1816
Feature activation-6.57e-01

Pos logit contributions
 dodging+0.083
itching+0.081
arate+0.079
ang+0.079
tain+0.078
Neg logit contributions
 someone-0.030
ily-0.029
 went-0.023
 Claire-0.020
Prin-0.020
Token to
Token ID284
Feature activation+2.28e-01

Pos logit contributions
 someone+0.012
ily+0.011
 out+0.011
Prin+0.009
 Claire+0.009
Neg logit contributions
ples-0.021
istent-0.020
oc-0.020
iled-0.020
vered-0.020
Token the
Token ID262
Feature activation+4.12e-01

Pos logit contributions
 dodging+0.006
arate+0.005
 pretending+0.005
 roaring+0.005
 tug+0.005
Neg logit contributions
ily-0.005
Fore-0.005
P-0.005
Prin-0.004
 someone-0.004
Token park
Token ID3952
Feature activation-6.50e-01

Pos logit contributions
 dodging+0.013
iled+0.013
arate+0.012
flies+0.012
ples+0.012
Neg logit contributions
ily-0.006
 someone-0.006
Fore-0.005
Prin-0.005
 Claire-0.005
Token named
Token ID3706
Feature activation+4.86e-02

Pos logit contributions
 someone+0.027
ily+0.026
 Lily+0.020
+0.020
Prin+0.020
Neg logit contributions
 dodging-0.046
itching-0.045
istent-0.045
akable-0.044
iously-0.043
Token Tim
Token ID5045
Feature activation-4.28e-01

Pos logit contributions
 dodging+0.001
iled+0.001
flies+0.001
arate+0.001
girls+0.001
Neg logit contributions
ily-0.001
 someone-0.001
Fore-0.001
Prin-0.001
 Claire-0.001
Token.
Token ID13
Feature activation+6.46e-01

Pos logit contributions
 someone+0.010
ily+0.009
Fore+0.008
Prin+0.008
 Claire+0.008
Neg logit contributions
 dodging-0.011
iled-0.011
arate-0.010
ples-0.010
flies-0.010
Token They
Token ID1119
Feature activation+3.63e-01

Pos logit contributions
 dodging+0.020
iled+0.019
arate+0.019
ples+0.019
itching+0.019
Neg logit contributions
 someone-0.011
ily-0.011
Fore-0.009
 Claire-0.009
Prin-0.009
Token spent
Token ID3377
Feature activation+1.82e+00

Pos logit contributions
oing+0.012
 dodging+0.012
itching+0.012
arate+0.012
ang+0.011
Neg logit contributions
ily-0.006
 someone-0.005
Prin-0.005
 went-0.005
 Claire-0.005
Token every
Token ID790
Feature activation+3.99e+00

Pos logit contributions
 dodging+0.055
iled+0.052
arate+0.050
oc+0.050
ples+0.049
Neg logit contributions
 someone-0.033
ily-0.032
Fore-0.029
Prin-0.029
 Claire-0.029
Token day
Token ID1110
Feature activation+3.34e-01

Pos logit contributions
 dodging+0.120
iled+0.113
arate+0.111
ang+0.110
iosity+0.108
Neg logit contributions
 someone-0.073
ily-0.071
Prin-0.064
Fore-0.063
 Claire-0.062
Token together
Token ID1978
Feature activation+1.35e+00

Pos logit contributions
flies+0.009
ples+0.009
iosity+0.009
 terrifying+0.009
fly+0.008
Neg logit contributions
 someone-0.006
ily-0.006
P-0.006
 out-0.006
 rest-0.005
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
 dodging+0.039
akable+0.039
iosity+0.038
oing+0.038
ples+0.038
Neg logit contributions
 someone-0.026
ily-0.024
 out-0.021
 Claire-0.021
 time-0.020
Token running
Token ID2491
Feature activation-3.56e-02

Pos logit contributions
ang+0.023
ples+0.022
iled+0.022
itching+0.022
iosity+0.022
Neg logit contributions
 someone-0.027
ily-0.025
 Claire-0.024
 Lily-0.023
 Sophie-0.023
Token and
Token ID290
Feature activation+6.56e-01

Pos logit contributions
 someone+0.001
ily+0.001
Prin+0.001
 Claire+0.001
 out+0.001
Neg logit contributions
ples-0.001
 dodging-0.001
iled-0.001
itching-0.001
oing-0.001
Token "
Token ID366
Feature activation-1.43e-01

Pos logit contributions
arate+0.001
ang+0.001
iled+0.001
ples+0.001
 dodging+0.001
Neg logit contributions
 someone-0.000
ily-0.000
 Claire-0.000
Prin-0.000
Fore-0.000
TokenI
Token ID40
Feature activation-1.13e+00

Pos logit contributions
ily+0.003
 someone+0.003
 Claire+0.003
 out+0.003
 flour+0.003
Neg logit contributions
 dodging-0.003
iled-0.003
oing-0.003
ky-0.003
king-0.003
Token'm
Token ID1101
Feature activation+1.02e+00

Pos logit contributions
 someone+0.026
ily+0.025
Fore+0.022
Prin+0.022
 Claire+0.020
Neg logit contributions
 dodging-0.032
ples-0.029
itching-0.028
iled-0.028
ang-0.028
Token sorry
Token ID7926
Feature activation+9.25e-02

Pos logit contributions
 dodging+0.026
iled+0.025
arate+0.024
ang+0.024
flies+0.024
Neg logit contributions
 someone-0.023
ily-0.022
Prin-0.020
Fore-0.020
 Claire-0.020
Token Mr
Token ID1770
Feature activation+9.87e-01

Pos logit contributions
 dodging+0.002
iled+0.002
arate+0.002
flies+0.002
ples+0.002
Neg logit contributions
 someone-0.002
ily-0.002
Fore-0.002
Prin-0.002
 Claire-0.002
Token.
Token ID13
Feature activation+3.94e+00

Pos logit contributions
iled+0.021
 dodging+0.021
flies+0.021
oc+0.020
oing+0.020
Neg logit contributions
 someone-0.025
ily-0.024
Prin-0.023
Fore-0.023
 Claire-0.022
Token Raven
Token ID12552
Feature activation+1.21e+00

Pos logit contributions
 dodging+0.087
iled+0.079
flies+0.078
arate+0.077
ang+0.076
Neg logit contributions
ily-0.104
 someone-0.102
Fore-0.094
Prin-0.094
ackers-0.091
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
 dodging+0.039
ples+0.037
itching+0.037
akable+0.037
arate+0.036
Neg logit contributions
 someone-0.022
ily-0.021
 out-0.017
 Claire-0.016
 time-0.016
Token I
Token ID314
Feature activation-1.83e+00

Pos logit contributions
 dodging+0.033
iled+0.031
arate+0.030
flies+0.030
oing+0.029
Neg logit contributions
ily-0.026
 someone-0.025
Prin-0.023
Fore-0.023
 Claire-0.023
Token didn
Token ID1422
Feature activation+1.63e+00

Pos logit contributions
 someone+0.042
ily+0.041
Prin+0.037
Fore+0.036
 Claire+0.034
Neg logit contributions
 dodging-0.049
ples-0.045
iled-0.044
tain-0.043
ang-0.043
Token't
Token ID470
Feature activation+7.14e-01

Pos logit contributions
 dodging+0.040
iled+0.038
ples+0.037
arate+0.036
flies+0.036
Neg logit contributions
 someone-0.040
ily-0.039
Fore-0.035
Prin-0.035
 Claire-0.034
Token on
Token ID319
Feature activation+3.75e-01

Pos logit contributions
 dodging+0.011
iled+0.010
ang+0.010
arate+0.010
oing+0.010
Neg logit contributions
 someone-0.010
ily-0.009
 Claire-0.009
Prin-0.009
Fore-0.008
Token,
Token ID11
Feature activation+3.24e-01

Pos logit contributions
 dodging+0.008
iled+0.007
arate+0.007
itching+0.007
iosity+0.007
Neg logit contributions
ily-0.010
 someone-0.010
Fore-0.010
Prin-0.010
 Claire-0.009
Token Lily
Token ID20037
Feature activation-9.12e-01

Pos logit contributions
 dodging+0.011
oc+0.010
arate+0.010
iled+0.010
 pretending+0.010
Neg logit contributions
ily-0.004
 someone-0.004
Fore-0.003
atalie-0.003
P-0.003
Token and
Token ID290
Feature activation+8.02e-01

Pos logit contributions
 someone+0.017
ily+0.017
Prin+0.015
Fore+0.015
 Claire+0.014
Neg logit contributions
 dodging-0.027
arate-0.025
ang-0.025
itching-0.025
iled-0.025
Token Mr
Token ID1770
Feature activation+1.18e+00

Pos logit contributions
 dodging+0.025
iled+0.024
oc+0.024
iosity+0.024
arate+0.024
Neg logit contributions
ily-0.013
 someone-0.012
Fore-0.011
Prin-0.011
P-0.010
Token.
Token ID13
Feature activation+3.94e+00

Pos logit contributions
 dodging+0.030
iled+0.029
arate+0.028
ples+0.028
ang+0.028
Neg logit contributions
 someone-0.027
ily-0.026
Fore-0.024
Prin-0.023
 Claire-0.023
Token Johnson
Token ID5030
Feature activation+1.15e+00

Pos logit contributions
 dodging+0.094
iled+0.092
arate+0.090
ples+0.090
ang+0.089
Neg logit contributions
 someone-0.094
ily-0.090
 Claire-0.083
Fore-0.082
Prin-0.080
Token became
Token ID2627
Feature activation+1.10e+00

Pos logit contributions
itching+0.044
ang+0.043
 dodging+0.043
iosity+0.043
 looming+0.042
Neg logit contributions
 went-0.013
ily-0.012
 someone-0.011
 go-0.010
 out-0.009
Token friends
Token ID2460
Feature activation-7.97e-01

Pos logit contributions
 dodging+0.034
iled+0.032
arate+0.032
ang+0.031
ples+0.031
Neg logit contributions
ily-0.019
 someone-0.018
Prin-0.016
Fore-0.016
 Claire-0.016
Token and
Token ID290
Feature activation+3.32e-01

Pos logit contributions
 someone+0.012
ily+0.010
+0.010
 Claire+0.009
 out+0.009
Neg logit contributions
itching-0.028
 dodging-0.028
iosity-0.028
tain-0.027
sing-0.027
Token he
Token ID339
Feature activation+1.16e+00

Pos logit contributions
 dodging+0.009
itching+0.009
iled+0.009
ang+0.009
arate+0.009
Neg logit contributions
 someone-0.007
ily-0.007
 Claire-0.006
Prin-0.006
Fore-0.006
Token by
Token ID416
Feature activation+1.34e+00

Pos logit contributions
 dodging+0.020
itching+0.019
ples+0.019
istent+0.019
iosity+0.019
Neg logit contributions
 someone-0.010
ily-0.009
 Claire-0.008
 out-0.007
Fore-0.007
Token side
Token ID1735
Feature activation+7.12e-01

Pos logit contributions
ang+0.047
arate+0.047
iled+0.047
oing+0.047
itching+0.046
Neg logit contributions
 someone-0.022
ily-0.017
 Claire-0.016
 time-0.013
Prin-0.013
Token.
Token ID13
Feature activation+6.37e-01

Pos logit contributions
 dodging+0.023
itching+0.022
akable+0.022
arate+0.022
iosity+0.022
Neg logit contributions
 someone-0.013
ily-0.012
 Claire-0.010
 time-0.009
 out-0.009
Token They
Token ID1119
Feature activation+7.45e-01

Pos logit contributions
 dodging+0.017
iled+0.016
arate+0.016
ang+0.016
ples+0.016
Neg logit contributions
 someone-0.013
ily-0.013
Fore-0.012
Prin-0.012
P-0.011
Token spent
Token ID3377
Feature activation+1.88e+00

Pos logit contributions
oing+0.026
itching+0.026
akable+0.025
 dodging+0.025
arate+0.025
Neg logit contributions
ily-0.011
 someone-0.010
 Claire-0.010
 went-0.010
Prin-0.009
Token every
Token ID790
Feature activation+3.94e+00

Pos logit contributions
 dodging+0.059
iled+0.057
arate+0.055
ples+0.055
oing+0.055
Neg logit contributions
 someone-0.032
ily-0.031
Fore-0.027
Prin-0.027
 Claire-0.027
Token day
Token ID1110
Feature activation+3.90e-01

Pos logit contributions
 dodging+0.136
iled+0.133
flies+0.131
ples+0.131
oing+0.130
Neg logit contributions
 someone-0.054
ily-0.052
Fore-0.044
 Claire-0.043
Prin-0.042
Token together
Token ID1978
Feature activation+1.35e+00

Pos logit contributions
flies+0.010
iosity+0.009
ples+0.009
oc+0.009
tain+0.009
Neg logit contributions
 someone-0.008
ily-0.008
 out-0.008
P-0.007
 Claire-0.007
Token,
Token ID11
Feature activation+1.28e+00

Pos logit contributions
 dodging+0.038
akable+0.037
oing+0.037
istered+0.037
iosity+0.036
Neg logit contributions
 someone-0.027
ily-0.025
 Claire-0.023
 time-0.022
 out-0.021
Token playing
Token ID2712
Feature activation+8.57e-01

Pos logit contributions
itching+0.028
ang+0.028
iled+0.028
istered+0.028
oing+0.028
Neg logit contributions
 someone-0.033
ily-0.030
 Claire-0.030
 Sophie-0.028
 Lily-0.027
Token,
Token ID11
Feature activation+1.78e+00

Pos logit contributions
 dodging+0.020
iled+0.019
arate+0.019
ples+0.019
flies+0.018
Neg logit contributions
 someone-0.021
ily-0.021
Fore-0.019
Prin-0.019
 Claire-0.018
Token laughed
Token ID13818
Feature activation+5.68e-01

Pos logit contributions
 dodging+0.011
iled+0.010
ples+0.010
arate+0.010
flies+0.010
Neg logit contributions
 someone-0.014
ily-0.014
Fore-0.013
Prin-0.013
 Claire-0.012
Token.
Token ID13
Feature activation+7.78e-02

Pos logit contributions
arate+0.018
ang+0.017
 dodging+0.017
itching+0.017
ples+0.017
Neg logit contributions
 someone-0.011
ily-0.010
 out-0.009
 Claire-0.009
 Lily-0.008
Token Tim
Token ID5045
Feature activation-5.46e-01

Pos logit contributions
 dodging+0.002
iled+0.002
oc+0.002
arate+0.002
ang+0.002
Neg logit contributions
ily-0.002
 someone-0.002
Prin-0.002
Fore-0.002
P-0.002
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
 someone+0.011
ily+0.010
Prin+0.009
Fore+0.009
 Claire+0.009
Neg logit contributions
 dodging-0.016
iled-0.015
arate-0.015
oing-0.015
ples-0.015
Token Mr
Token ID1770
Feature activation+8.46e-01

Pos logit contributions
 dodging+0.034
iled+0.030
iosity+0.030
arate+0.030
gie+0.030
Neg logit contributions
ily-0.024
 someone-0.023
Fore-0.022
Prin-0.022
P-0.020
Token.
Token ID13
Feature activation+3.89e+00

Pos logit contributions
 dodging+0.022
iled+0.021
arate+0.021
ples+0.020
iosity+0.020
Neg logit contributions
 someone-0.019
ily-0.019
Fore-0.017
Prin-0.016
 Claire-0.016
Token Squirrel
Token ID44259
Feature activation+1.12e+00

Pos logit contributions
 dodging+0.086
ples+0.084
iled+0.084
iosity+0.083
arate+0.083
Neg logit contributions
 someone-0.101
ily-0.091
 Claire-0.087
Fore-0.085
Prin-0.083
Token became
Token ID2627
Feature activation+8.82e-01

Pos logit contributions
itching+0.040
ang+0.040
akable+0.039
years+0.039
 dodging+0.039
Neg logit contributions
 went-0.017
 someone-0.014
ily-0.014
 go-0.013
 Claire-0.012
Token good
Token ID922
Feature activation+1.34e+00

Pos logit contributions
 dodging+0.023
arate+0.022
iled+0.022
ang+0.021
 mastering+0.021
Neg logit contributions
ily-0.018
 someone-0.017
atalie-0.017
Prin-0.016
Fore-0.016
Token friends
Token ID2460
Feature activation-1.15e+00

Pos logit contributions
 dodging+0.047
tain+0.045
ples+0.045
akable+0.044
oing+0.044
Neg logit contributions
 someone-0.020
ily-0.020
 Claire-0.014
P-0.014
Prin-0.013
Token.
Token ID13
Feature activation+3.57e-01

Pos logit contributions
 someone+0.019
ily+0.018
 Claire+0.014
+0.013
Fore+0.013
Neg logit contributions
 dodging-0.038
itching-0.038
flies-0.037
ples-0.037
tain-0.037
Token rain
Token ID6290
Feature activation-6.08e-01

Pos logit contributions
ily+0.016
 someone+0.015
Fore+0.014
Prin+0.014
 Claire+0.014
Neg logit contributions
 dodging-0.020
iled-0.018
flies-0.018
 scree-0.018
tain-0.018
Token.
Token ID13
Feature activation+5.76e-02

Pos logit contributions
 someone+0.014
ily+0.014
Prin+0.012
Fore+0.012
 Claire+0.012
Neg logit contributions
 dodging-0.015
iled-0.015
arate-0.015
ples-0.014
ang-0.014
Token Lily
Token ID20037
Feature activation-8.36e-01

Pos logit contributions
 dodging+0.002
iled+0.002
oc+0.002
ples+0.002
tain+0.002
Neg logit contributions
Fore-0.001
ily-0.001
P-0.001
 someone-0.001
abby-0.001
Token and
Token ID290
Feature activation+3.38e-01

Pos logit contributions
 someone+0.016
ily+0.015
Prin+0.013
Fore+0.013
 Claire+0.013
Neg logit contributions
 dodging-0.025
arate-0.024
iled-0.024
ples-0.023
flies-0.023
Token Mr
Token ID1770
Feature activation+1.21e+00

Pos logit contributions
 dodging+0.010
iosity+0.009
oc+0.009
 bothering+0.009
nered+0.009
Neg logit contributions
ily-0.007
Fore-0.006
 someone-0.006
P-0.005
Prin-0.005
Token.
Token ID13
Feature activation+3.89e+00

Pos logit contributions
 dodging+0.033
arate+0.031
iled+0.030
ang+0.030
ples+0.030
Neg logit contributions
 someone-0.027
ily-0.026
Fore-0.023
 Claire-0.022
Prin-0.022
Token Brown
Token ID4373
Feature activation+1.95e+00

Pos logit contributions
ang+0.106
arate+0.105
iled+0.103
iosity+0.101
oing+0.101
Neg logit contributions
 someone-0.086
ily-0.072
 Claire-0.072
Prin-0.071
Fore-0.069
Token ran
Token ID4966
Feature activation-7.80e-01

Pos logit contributions
 dodging+0.063
arate+0.061
itching+0.060
ples+0.059
ang+0.059
Neg logit contributions
 someone-0.032
ily-0.032
Prin-0.026
Fore-0.025
 Claire-0.024
Token back
Token ID736
Feature activation-5.54e-01

Pos logit contributions
 someone+0.014
 out+0.013
ily+0.013
 Lily+0.011
Prin+0.010
Neg logit contributions
ples-0.025
flies-0.024
itching-0.024
oing-0.024
oc-0.024
Token to
Token ID284
Feature activation-1.72e-01

Pos logit contributions
 someone+0.010
ily+0.009
 out+0.009
 Claire+0.007
 Lily+0.007
Neg logit contributions
ples-0.017
istent-0.017
 dodging-0.016
itching-0.016
oing-0.016
Token his
Token ID465
Feature activation+2.46e-01

Pos logit contributions
ily+0.003
 someone+0.003
Prin+0.003
Fore+0.003
P+0.003
Neg logit contributions
 dodging-0.005
iled-0.005
arate-0.005
iosity-0.005
oing-0.005
Token "
Token ID366
Feature activation+1.86e-01

Pos logit contributions
 someone+0.000
ily+0.000
Fore+0.000
Prin+0.000
 Claire+0.000
Neg logit contributions
 dodging-0.001
iled-0.001
arate-0.001
ples-0.001
ang-0.001
TokenThank
Token ID10449
Feature activation+1.35e+00

Pos logit contributions
oc+0.004
ples+0.004
ky+0.004
iled+0.004
 dodging+0.004
Neg logit contributions
 someone-0.004
ily-0.004
atalie-0.004
 out-0.004
 flour-0.004
Token you
Token ID345
Feature activation+2.93e-01

Pos logit contributions
 dodging+0.041
iled+0.041
arate+0.040
ang+0.040
flies+0.039
Neg logit contributions
 someone-0.022
ily-0.021
Prin-0.020
 Claire-0.019
Fore-0.019
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
 dodging+0.007
iled+0.006
arate+0.006
flies+0.006
ang+0.006
Neg logit contributions
 someone-0.007
ily-0.007
Prin-0.007
Fore-0.007
 Claire-0.007
Token Mr
Token ID1770
Feature activation+2.56e-01

Pos logit contributions
 dodging+0.024
iled+0.023
arate+0.022
ang+0.022
flies+0.022
Neg logit contributions
 someone-0.026
ily-0.026
Prin-0.024
Fore-0.023
 Claire-0.023
Token.
Token ID13
Feature activation+3.88e+00

Pos logit contributions
 dodging+0.007
iled+0.006
arate+0.006
ples+0.006
oing+0.006
Neg logit contributions
 someone-0.006
ily-0.006
Fore-0.005
Prin-0.005
 Claire-0.005
Token Owl
Token ID37007
Feature activation+7.36e-01

Pos logit contributions
 dodging+0.066
iled+0.062
arate+0.060
ples+0.058
oing+0.058
Neg logit contributions
 someone-0.122
ily-0.119
Prin-0.111
Fore-0.111
 Claire-0.110
Token!
Token ID0
Feature activation+1.33e+00

Pos logit contributions
 dodging+0.020
itching+0.019
oing+0.018
arate+0.018
akable+0.018
Neg logit contributions
ily-0.016
 someone-0.016
 Claire-0.014
Prin-0.013
 Lily-0.013
Token I
Token ID314
Feature activation-1.89e+00

Pos logit contributions
iled+0.030
oc+0.029
arate+0.029
 dodging+0.029
ang+0.029
Neg logit contributions
 someone-0.033
ily-0.031
Fore-0.030
Prin-0.030
P-0.028
Token found
Token ID1043
Feature activation-6.78e-01

Pos logit contributions
ily+0.041
 someone+0.039
Prin+0.035
Fore+0.035
 Claire+0.034
Neg logit contributions
 dodging-0.054
oing-0.049
iled-0.048
ples-0.048
itching-0.048
Token the
Token ID262
Feature activation+4.32e-01

Pos logit contributions
ily+0.014
ackers+0.014
atalie+0.014
Prin+0.013
abby+0.013
Neg logit contributions
 dodging-0.016
flies-0.016
iled-0.016
oc-0.016
arate-0.016
Token wanted
Token ID2227
Feature activation+9.25e-02

Pos logit contributions
 dodging+0.014
oing+0.014
arate+0.013
ang+0.013
flies+0.013
Neg logit contributions
ily-0.009
 someone-0.009
Prin-0.007
 Claire-0.007
 went-0.007
Token to
Token ID284
Feature activation+6.07e-01

Pos logit contributions
 dodging+0.002
iled+0.002
ang+0.002
arate+0.002
oing+0.002
Neg logit contributions
 someone-0.002
ily-0.002
Fore-0.002
 Claire-0.002
Prin-0.002
Token touch
Token ID3638
Feature activation+3.34e-01

Pos logit contributions
 dodging+0.015
iled+0.014
arate+0.013
flies+0.013
ples+0.013
Neg logit contributions
 someone-0.015
ily-0.015
Fore-0.013
Prin-0.013
 Claire-0.013
Token the
Token ID262
Feature activation+6.64e-01

Pos logit contributions
 dodging+0.011
iled+0.011
arate+0.011
oing+0.011
ples+0.011
Neg logit contributions
 someone-0.005
ily-0.005
Fore-0.004
Prin-0.004
 Claire-0.004
Token ost
Token ID23619
Feature activation+2.20e+00

Pos logit contributions
iled+0.020
arate+0.020
 dodging+0.019
istent+0.019
 mastering+0.019
Neg logit contributions
 someone-0.013
ily-0.011
Fore-0.011
 other-0.010
Prin-0.010
Tokenric
Token ID1173
Feature activation+3.86e+00

Pos logit contributions
 dodging+0.039
arate+0.038
 tug+0.038
manship+0.036
iosity+0.035
Neg logit contributions
 someone-0.061
ily-0.060
Fore-0.057
ackers-0.056
Prin-0.055
Tokenhes
Token ID956
Feature activation+1.05e+00

Pos logit contributions
iled+0.113
ang+0.110
oing+0.104
 dodging+0.101
ines+0.100
Neg logit contributions
 Claire-0.072
 Natalie-0.070
oph-0.070
 Sophie-0.069
 someone-0.069
Token.
Token ID13
Feature activation-4.78e-01

Pos logit contributions
 dodging+0.028
iled+0.027
arate+0.027
ples+0.026
oing+0.026
Neg logit contributions
 someone-0.023
ily-0.023
Prin-0.020
 Claire-0.020
Fore-0.019
Token They
Token ID1119
Feature activation+2.35e-01

Pos logit contributions
ily+0.010
 someone+0.010
Prin+0.010
abby+0.009
Fore+0.009
Neg logit contributions
 dodging-0.012
iled-0.012
arate-0.011
ang-0.011
oc-0.011
Token reached
Token ID4251
Feature activation-1.06e+00

Pos logit contributions
 dodging+0.007
oing+0.006
arate+0.006
iled+0.006
flies+0.006
Neg logit contributions
ily-0.005
 someone-0.005
Prin-0.004
 Claire-0.004
Fore-0.004
Token out
Token ID503
Feature activation+8.83e-01

Pos logit contributions
Prin+0.016
ily+0.016
 someone+0.015
Fore+0.014
P+0.014
Neg logit contributions
 dodging-0.036
iled-0.033
ang-0.033
flies-0.033
arate-0.032
Token day
Token ID1110
Feature activation-6.63e-01

Pos logit contributions
ang+0.008
oc+0.008
arate+0.008
iled+0.008
 because+0.007
Neg logit contributions
Prin-0.010
Fore-0.010
ily-0.010
 someone-0.010
 Claire-0.010
Token,
Token ID11
Feature activation-1.37e-01

Pos logit contributions
ily+0.018
 someone+0.018
Fore+0.017
Prin+0.017
 Claire+0.016
Neg logit contributions
 dodging-0.013
iled-0.013
arate-0.012
oing-0.011
oc-0.011
Token Lily
Token ID20037
Feature activation-1.29e+00

Pos logit contributions
ily+0.001
 someone+0.001
Fore+0.001
Prin+0.001
 Claire+0.001
Neg logit contributions
 dodging-0.005
iled-0.005
arate-0.005
flies-0.005
ples-0.005
Token and
Token ID290
Feature activation+9.18e-02

Pos logit contributions
Fore+0.032
 someone+0.032
ily+0.032
Prin+0.029
 Claire+0.029
Neg logit contributions
iled-0.027
 dodging-0.026
oc-0.026
ky-0.025
arate-0.025
Token Mr
Token ID1770
Feature activation+1.13e+00

Pos logit contributions
 dodging+0.003
iosity+0.002
 because+0.002
tail+0.002
 tug+0.002
Neg logit contributions
ily-0.002
Fore-0.001
 someone-0.001
Prin-0.001
abby-0.001
Token.
Token ID13
Feature activation+3.86e+00

Pos logit contributions
 dodging+0.031
arate+0.029
iled+0.029
ang+0.028
iosity+0.028
Neg logit contributions
 someone-0.025
ily-0.024
 Claire-0.021
Fore-0.021
Prin-0.021
Token Sp
Token ID1338
Feature activation+1.51e+00

Pos logit contributions
ience+0.125
nered+0.118
iosity+0.117
aring+0.115
oing+0.115
Neg logit contributions
 someone-0.072
 Lily-0.071
 Ben-0.071
ily-0.067
 Sara-0.064
Tokenud
Token ID463
Feature activation-3.14e-01

Pos logit contributions
 dodging+0.035
arate+0.031
ples+0.031
tain+0.031
iled+0.030
Neg logit contributions
ily-0.041
 someone-0.039
Fore-0.036
ackers-0.035
 Claire-0.034
Token went
Token ID1816
Feature activation-9.65e-01

Pos logit contributions
ily+0.006
 someone+0.006
 Claire+0.005
Prin+0.005
ackers+0.005
Neg logit contributions
 dodging-0.009
arate-0.009
itching-0.009
 mastering-0.009
akable-0.009
Token to
Token ID284
Feature activation-8.45e-02

Pos logit contributions
 someone+0.018
ily+0.016
 out+0.016
Prin+0.014
P+0.013
Neg logit contributions
ples-0.030
istent-0.030
oc-0.029
iled-0.029
 dodging-0.029
Token the
Token ID262
Feature activation+3.65e-03

Pos logit contributions
ily+0.002
Fore+0.002
 someone+0.002
P+0.002
Prin+0.002
Neg logit contributions
 dodging-0.002
arate-0.002
 pretending-0.002
 roaring-0.002
tain-0.002
Token visit
Token ID3187
Feature activation-3.60e-02

Pos logit contributions
 dodging+0.001
arate+0.001
 tug+0.001
 pretending+0.001
tail+0.001
Neg logit contributions
ily-0.001
Fore-0.001
abby-0.001
Prin-0.001
P-0.001
Token their
Token ID511
Feature activation+6.14e-01

Pos logit contributions
Prin+0.001
ily+0.001
Fore+0.001
 Claire+0.001
ackers+0.001
Neg logit contributions
 dodging-0.001
iled-0.001
arate-0.001
ky-0.001
iosity-0.001
Token neighbor
Token ID4780
Feature activation+3.79e-01

Pos logit contributions
 dodging+0.022
iled+0.022
ples+0.021
oing+0.021
flies+0.021
Neg logit contributions
 someone-0.009
ily-0.007
Fore-0.006
 Claire-0.006
Prin-0.006
Token,
Token ID11
Feature activation+2.22e+00

Pos logit contributions
 dodging+0.009
iled+0.009
arate+0.009
oc+0.009
flies+0.009
Neg logit contributions
ily-0.009
 someone-0.008
Fore-0.008
Prin-0.008
 Claire-0.007
Token Mrs
Token ID9074
Feature activation+2.15e+00

Pos logit contributions
 dodging+0.060
iled+0.056
arate+0.054
oing+0.053
 pretending+0.053
Neg logit contributions
ily-0.044
Prin-0.042
Fore-0.042
 someone-0.040
P-0.039
Token.
Token ID13
Feature activation+3.85e+00

Pos logit contributions
 dodging+0.042
iled+0.042
flies+0.041
girls+0.039
ples+0.039
Neg logit contributions
 someone-0.058
ily-0.056
Prin-0.056
Fore-0.055
P-0.054
Token Lee
Token ID5741
Feature activation+7.43e-01

Pos logit contributions
 dodging+0.093
iled+0.089
arate+0.088
ang+0.087
ples+0.086
Neg logit contributions
 someone-0.093
ily-0.089
 Claire-0.083
Prin-0.082
Fore-0.081
Token.
Token ID13
Feature activation+1.70e+00

Pos logit contributions
 dodging+0.017
iled+0.017
arate+0.016
ples+0.016
flies+0.016
Neg logit contributions
 someone-0.018
ily-0.018
Prin-0.016
Fore-0.016
 Claire-0.016
Token Mrs
Token ID9074
Feature activation+1.68e+00

Pos logit contributions
 dodging+0.054
iled+0.052
arate+0.051
ples+0.051
ang+0.051
Neg logit contributions
 someone-0.028
ily-0.027
Fore-0.023
Prin-0.023
 Claire-0.023
Token.
Token ID13
Feature activation+3.09e+00

Pos logit contributions
 dodging+0.035
iled+0.035
flies+0.034
ples+0.032
oing+0.032
Neg logit contributions
 someone-0.043
ily-0.042
Prin-0.041
Fore-0.041
P-0.040
Token Lee
Token ID5741
Feature activation+8.17e-01

Pos logit contributions
 dodging+0.072
iled+0.069
arate+0.067
ang+0.066
oing+0.066
Neg logit contributions
 someone-0.077
ily-0.075
 Claire-0.069
Prin-0.068
Fore-0.068
Token too
Token ID1165
Feature activation-2.13e-01

Pos logit contributions
 dodging+0.016
ples+0.015
iled+0.015
itching+0.015
 roaring+0.014
Neg logit contributions
ily-0.014
Fore-0.013
Prin-0.013
 someone-0.013
P-0.012
Token late
Token ID2739
Feature activation-1.23e+00

Pos logit contributions
 someone+0.004
ily+0.004
Fore+0.004
Prin+0.004
 Claire+0.004
Neg logit contributions
 dodging-0.006
iled-0.006
arate-0.006
flies-0.006
ang-0.006
Token.
Token ID13
Feature activation+1.77e-01

Pos logit contributions
 someone+0.026
ily+0.025
 Claire+0.022
Fore+0.021
Prin+0.021
Neg logit contributions
 dodging-0.034
arate-0.033
iled-0.033
ang-0.033
oing-0.032
Token<|endoftext|>
Token ID50256
Feature activation-3.46e-01

Pos logit contributions
iled+0.005
ang+0.005
oc+0.005
oing+0.005
arate+0.005
Neg logit contributions
ily-0.003
 someone-0.003
Prin-0.003
Fore-0.003
P-0.003
Token"
Token ID1
Feature activation-3.65e-01

Pos logit contributions
 someone+0.007
Fore+0.007
abby+0.006
Prin+0.006
ily+0.006
Neg logit contributions
iled-0.009
oc-0.008
ang-0.008
arate-0.008
 dodging-0.008
TokenLook
Token ID8567
Feature activation-4.19e+00

Pos logit contributions
ily+0.008
 someone+0.007
atalie+0.007
Fore+0.007
 Heidi+0.007
Neg logit contributions
ples-0.008
 dodging-0.008
king-0.008
iled-0.008
ky-0.008
Token,
Token ID11
Feature activation-5.68e-01

Pos logit contributions
 someone+0.110
Fore+0.108
Prin+0.107
ily+0.104
atalie+0.098
Neg logit contributions
iled-0.086
tain-0.076
itching-0.076
ang-0.075
flies-0.075
Token Mom
Token ID11254
Feature activation-1.93e+00

Pos logit contributions
Fore+0.011
ily+0.011
Prin+0.010
P+0.010
 someone+0.010
Neg logit contributions
 dodging-0.014
iled-0.014
arate-0.014
 pretending-0.013
flies-0.013
Tokenmy
Token ID1820
Feature activation-1.24e+00

Pos logit contributions
 someone+0.032
ily+0.031
Fore+0.028
Prin+0.027
 Claire+0.026
Neg logit contributions
 dodging-0.060
iled-0.059
ples-0.058
flies-0.057
arate-0.057
Token!"
Token ID2474
Feature activation-1.42e+00

Pos logit contributions
 someone+0.036
ily+0.035
Fore+0.033
Prin+0.033
 Claire+0.032
Neg logit contributions
 dodging-0.024
iled-0.023
arate-0.022
ples-0.021
flies-0.021
Token exclaimed
Token ID33552
Feature activation-8.44e-01

Pos logit contributions
ily+0.027
 someone+0.026
Fore+0.026
Prin+0.025
 Claire+0.023
Neg logit contributions
iled-0.039
 dodging-0.038
arate-0.036
ang-0.036
ples-0.036
Token so
Token ID523
Feature activation+5.72e-01

Pos logit contributions
 dodging+0.014
arate+0.013
iled+0.013
oc+0.013
iosity+0.013
Neg logit contributions
 someone-0.011
ily-0.010
 Claire-0.009
Prin-0.009
Fore-0.008
Token happy
Token ID3772
Feature activation+4.54e-01

Pos logit contributions
 dodging+0.016
arate+0.015
iled+0.014
iosity+0.014
ples+0.014
Neg logit contributions
 someone-0.012
ily-0.012
 Claire-0.011
Fore-0.011
Prin-0.010
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
ang+0.014
 dodging+0.014
oc+0.014
itching+0.014
arate+0.014
Neg logit contributions
 someone-0.009
ily-0.008
 Claire-0.007
 Sophie-0.007
 Lily-0.007
Token<|endoftext|>
Token ID50256
Feature activation-2.61e-01

Pos logit contributions
iled+0.008
ples+0.007
flies+0.007
oing+0.007
 dodging+0.007
Neg logit contributions
 someone-0.007
ily-0.007
Prin-0.006
Fore-0.006
P-0.006
Token"
Token ID1
Feature activation-2.72e-01

Pos logit contributions
Fore+0.005
 someone+0.005
abby+0.005
 ma+0.005
ily+0.005
Neg logit contributions
iled-0.006
oc-0.006
ang-0.006
mer-0.006
arate-0.006
TokenLook
Token ID8567
Feature activation-4.17e+00

Pos logit contributions
ily+0.006
 someone+0.006
atalie+0.005
Fore+0.005
Prin+0.005
Neg logit contributions
 dodging-0.007
ples-0.007
iled-0.006
king-0.006
ky-0.006
Token!"
Token ID2474
Feature activation-9.90e-01

Pos logit contributions
 someone+0.112
Fore+0.109
Prin+0.108
ily+0.106
P+0.102
Neg logit contributions
iled-0.083
tain-0.074
iosity-0.073
ang-0.073
flies-0.073
Token Mary
Token ID5335
Feature activation-7.48e-01

Pos logit contributions
Fore+0.020
ily+0.019
Prin+0.018
 someone+0.018
ackers+0.017
Neg logit contributions
iled-0.025
 dodging-0.025
iosity-0.024
flies-0.024
ples-0.023
Token said
Token ID531
Feature activation+2.53e-01

Pos logit contributions
 someone+0.017
ily+0.017
Fore+0.015
Prin+0.015
 Claire+0.015
Neg logit contributions
 dodging-0.019
iled-0.018
arate-0.018
ples-0.017
ang-0.017
Token to
Token ID284
Feature activation-6.73e-01

Pos logit contributions
 dodging+0.006
iled+0.005
oing+0.005
ples+0.005
arate+0.005
Neg logit contributions
ily-0.006
 someone-0.006
Fore-0.006
Prin-0.006
P-0.006
Token her
Token ID607
Feature activation-3.93e-01

Pos logit contributions
Fore+0.015
Prin+0.015
P+0.014
ackers+0.014
abby+0.013
Neg logit contributions
 pretending-0.017
 mastering-0.016
 because-0.016
 dodging-0.015
 counting-0.015
Token the
Token ID262
Feature activation+3.23e-01

Pos logit contributions
 dodging+0.014
iled+0.013
 roaring+0.013
 mastering+0.013
arate+0.013
Neg logit contributions
ily-0.010
Prin-0.010
Fore-0.009
 someone-0.009
P-0.009
Token medicine
Token ID9007
Feature activation+3.20e-01

Pos logit contributions
 dodging+0.010
iled+0.009
ples+0.009
arate+0.009
flies+0.009
Neg logit contributions
 someone-0.006
ily-0.006
Prin-0.005
Fore-0.005
 Claire-0.005
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
 dodging+0.008
iled+0.008
arate+0.008
ang+0.008
ples+0.008
Neg logit contributions
 someone-0.007
ily-0.007
 Claire-0.006
Prin-0.006
Fore-0.006
Token<|endoftext|>
Token ID50256
Feature activation-3.95e-01

Pos logit contributions
iled+0.007
oc+0.006
arate+0.006
 dodging+0.006
oing+0.006
Neg logit contributions
ily-0.005
Prin-0.004
 someone-0.004
P-0.004
Fore-0.004
Token"
Token ID1
Feature activation-5.19e-01

Pos logit contributions
 someone+0.007
Fore+0.007
ily+0.007
 ma+0.007
Prin+0.007
Neg logit contributions
iled-0.010
oc-0.010
ang-0.009
arate-0.009
mer-0.009
TokenLook
Token ID8567
Feature activation-4.14e+00

Pos logit contributions
ily+0.012
 someone+0.011
atalie+0.011
Fore+0.011
Prin+0.010
Neg logit contributions
 dodging-0.012
iled-0.012
ples-0.012
ang-0.011
king-0.011
Token up
Token ID510
Feature activation-2.78e-01

Pos logit contributions
 someone+0.113
Prin+0.112
Fore+0.110
ily+0.108
 Claire+0.103
Neg logit contributions
iled-0.080
oc-0.071
tain-0.071
ang-0.070
iosity-0.069
Token there
Token ID612
Feature activation-4.54e-01

Pos logit contributions
 someone+0.006
ily+0.006
Fore+0.005
Prin+0.005
 Claire+0.005
Neg logit contributions
 dodging-0.007
iled-0.007
arate-0.007
ples-0.007
oing-0.007
Token,
Token ID11
Feature activation+1.65e-01

Pos logit contributions
ily+0.012
Fore+0.011
Prin+0.011
 someone+0.011
P+0.011
Neg logit contributions
iled-0.009
flies-0.009
 dodging-0.009
oc-0.009
ples-0.008
Token Mama
Token ID40084
Feature activation-7.36e-01

Pos logit contributions
 dodging+0.004
iled+0.004
flies+0.004
 because+0.004
arate+0.004
Neg logit contributions
Fore-0.003
ily-0.003
Prin-0.003
P-0.003
 someone-0.003
Token!"
Token ID2474
Feature activation-1.01e+00

Pos logit contributions
 someone+0.016
ily+0.016
Fore+0.015
Prin+0.014
P+0.013
Neg logit contributions
iled-0.019
 dodging-0.019
flies-0.018
ples-0.018
oc-0.018
Token lively
Token ID29696
Feature activation+1.42e+00

Pos logit contributions
 dodging+0.030
ang+0.029
oing+0.029
iled+0.029
flies+0.028
Neg logit contributions
 someone-0.017
 time-0.014
 Claire-0.014
 other-0.013
 rest-0.013
Token adventure
Token ID8855
Feature activation+1.86e-01

Pos logit contributions
 dodging+0.041
iled+0.040
ang+0.039
flies+0.039
arate+0.038
Neg logit contributions
 someone-0.029
ily-0.025
 Claire-0.025
 time-0.023
Fore-0.023
Token!
Token ID0
Feature activation+3.14e-02

Pos logit contributions
 dodging+0.005
oing+0.005
itching+0.005
arate+0.005
iled+0.004
Neg logit contributions
 someone-0.004
ily-0.004
 Claire-0.004
 Sophie-0.003
Fore-0.003
Token<|endoftext|>
Token ID50256
Feature activation-2.60e-01

Pos logit contributions
iled+0.001
arate+0.001
 dodging+0.001
ang+0.001
oc+0.001
Neg logit contributions
ily-0.000
Prin-0.000
 someone-0.000
P-0.000
Fore-0.000
Token"
Token ID1
Feature activation-2.66e-01

Pos logit contributions
 someone+0.005
Fore+0.005
ily+0.005
 ma+0.005
abby+0.005
Neg logit contributions
iled-0.006
oc-0.006
ang-0.006
arate-0.006
 dodging-0.006
TokenLook
Token ID8567
Feature activation-4.14e+00

Pos logit contributions
ily+0.006
 someone+0.005
atalie+0.005
Fore+0.005
Prin+0.005
Neg logit contributions
 dodging-0.006
iled-0.006
ples-0.006
king-0.006
ky-0.006
Token,
Token ID11
Feature activation-7.97e-02

Pos logit contributions
 someone+0.110
Prin+0.107
Fore+0.107
ily+0.104
P+0.099
Neg logit contributions
iled-0.084
iosity-0.074
tain-0.074
flies-0.074
ples-0.074
Token Mom
Token ID11254
Feature activation-1.53e+00

Pos logit contributions
Fore+0.002
ily+0.002
Prin+0.001
P+0.001
 someone+0.001
Neg logit contributions
 dodging-0.002
iled-0.002
flies-0.002
arate-0.002
 pretending-0.002
Token!
Token ID0
Feature activation+9.35e-01

Pos logit contributions
 someone+0.026
ily+0.026
Fore+0.023
Prin+0.023
 Claire+0.021
Neg logit contributions
 dodging-0.046
iled-0.046
ples-0.044
flies-0.044
arate-0.044
Token It
Token ID632
Feature activation-3.37e-01

Pos logit contributions
 dodging+0.023
iled+0.023
oing+0.022
ples+0.022
flies+0.022
Neg logit contributions
 someone-0.021
ily-0.020
Fore-0.019
Prin-0.018
P-0.017
Token's
Token ID338
Feature activation-3.65e-01

Pos logit contributions
ily+0.006
 someone+0.006
Prin+0.006
Fore+0.006
 Claire+0.005
Neg logit contributions
 dodging-0.009
iled-0.009
flies-0.009
oc-0.009
ples-0.009
Token voice
Token ID3809
Feature activation+5.73e-01

Pos logit contributions
 dodging+0.024
arate+0.023
years+0.023
iled+0.023
itching+0.023
Neg logit contributions
 someone-0.020
ily-0.019
 Claire-0.016
Prin-0.016
 ma-0.015
Token shouted
Token ID17293
Feature activation-4.61e-01

Pos logit contributions
 dodging+0.016
iled+0.015
arate+0.015
iosity+0.015
itching+0.015
Neg logit contributions
 someone-0.012
ily-0.012
Prin-0.010
Fore-0.010
 Claire-0.010
Token out
Token ID503
Feature activation+1.10e+00

Pos logit contributions
 someone+0.008
ily+0.008
Fore+0.007
Prin+0.007
 Claire+0.006
Neg logit contributions
 dodging-0.014
iled-0.014
ples-0.013
oing-0.013
arate-0.013
Token.
Token ID13
Feature activation-7.38e-01

Pos logit contributions
arate+0.037
ience+0.036
ang+0.035
nered+0.035
 dodging+0.035
Neg logit contributions
 someone-0.020
 out-0.017
ily-0.016
 Lily-0.014
 her-0.014
Token "
Token ID366
Feature activation-4.02e-01

Pos logit contributions
ackers+0.021
ily+0.021
P+0.020
Fore+0.020
abby+0.018
Neg logit contributions
iled-0.013
 flashing-0.012
oing-0.012
 dodging-0.012
ples-0.012
TokenLook
Token ID8567
Feature activation-4.10e+00

Pos logit contributions
 someone+0.008
ily+0.008
Fore+0.007
Prin+0.007
 Claire+0.007
Neg logit contributions
 dodging-0.011
iled-0.011
arate-0.011
ples-0.010
ang-0.010
Token!
Token ID0
Feature activation+2.21e-01

Pos logit contributions
 someone+0.095
ily+0.094
Fore+0.084
Prin+0.084
 Claire+0.083
Neg logit contributions
 dodging-0.102
iled-0.098
arate-0.095
ples-0.094
ang-0.093
Token It
Token ID632
Feature activation+3.75e-01

Pos logit contributions
 dodging+0.006
iled+0.005
arate+0.005
flies+0.005
oing+0.005
Neg logit contributions
ily-0.005
 someone-0.005
Fore-0.005
Prin-0.004
 Claire-0.004
Token is
Token ID318
Feature activation+5.97e-01

Pos logit contributions
 dodging+0.012
iled+0.012
arate+0.011
ples+0.011
ang+0.011
Neg logit contributions
 someone-0.006
ily-0.006
Fore-0.005
Prin-0.005
 Claire-0.005
Token a
Token ID257
Feature activation-8.56e-01

Pos logit contributions
 dodging+0.017
iled+0.017
flies+0.016
ples+0.016
arate+0.016
Neg logit contributions
ily-0.011
 someone-0.011
Prin-0.010
Fore-0.010
 Claire-0.010
Token race
Token ID3234
Feature activation+1.70e-01

Pos logit contributions
 someone+0.020
ily+0.018
Fore+0.017
Prin+0.017
 Claire+0.017
Neg logit contributions
iled-0.022
 dodging-0.022
ples-0.021
arate-0.021
iosity-0.021
Token
Token ID198
Feature activation+1.32e-01

Pos logit contributions
ang+0.005
iled+0.005
arate+0.004
oc+0.004
 dodging+0.004
Neg logit contributions
 someone-0.002
ily-0.002
 Claire-0.002
atalie-0.002
 out-0.002
Tokenâ
Token ID22940
Feature activation-8.18e-01

Pos logit contributions
 dodging+0.004
iled+0.004
arate+0.004
ang+0.004
oing+0.004
Neg logit contributions
 someone-0.002
ily-0.002
 Claire-0.002
Fore-0.002
Prin-0.002
Token
Token ID26391
Feature activation-4.20e-01

Pos logit contributions
 someone+0.012
ily+0.012
Prin+0.010
Fore+0.010
 Claire+0.010
Neg logit contributions
 dodging-0.027
iled-0.026
arate-0.026
ples-0.025
flies-0.025
Token
Token ID129
Feature activation-4.31e-01

Pos logit contributions
ily+0.008
 someone+0.008
Fore+0.007
Prin+0.007
 Claire+0.007
Neg logit contributions
 dodging-0.012
iled-0.011
arate-0.011
oing-0.011
ples-0.011
Token
Token ID241
Feature activation-2.30e-01

Pos logit contributions
 someone+0.011
ily+0.011
Prin+0.010
Fore+0.010
 Claire+0.010
Neg logit contributions
 dodging-0.009
flies-0.009
iled-0.009
oing-0.009
arate-0.009
TokenLook
Token ID8567
Feature activation-4.10e+00

Pos logit contributions
 someone+0.004
ily+0.004
Fore+0.003
Prin+0.003
 Claire+0.003
Neg logit contributions
 dodging-0.007
iled-0.007
arate-0.007
ples-0.007
flies-0.007
Token into
Token ID656
Feature activation+8.59e-02

Pos logit contributions
 someone+0.105
ily+0.104
Fore+0.097
Prin+0.096
 Claire+0.094
Neg logit contributions
 dodging-0.090
iled-0.087
arate-0.084
ang-0.083
ples-0.082
Token it
Token ID340
Feature activation+6.29e-02

Pos logit contributions
 dodging+0.003
iled+0.003
oing+0.003
arate+0.003
ples+0.003
Neg logit contributions
 someone-0.001
ily-0.001
 Claire-0.001
Prin-0.001
Fore-0.001
Token,
Token ID11
Feature activation-2.69e-01

Pos logit contributions
 dodging+0.002
iled+0.001
flies+0.001
arate+0.001
ples+0.001
Neg logit contributions
 someone-0.001
ily-0.001
Fore-0.001
Prin-0.001
 Claire-0.001
Tokenâ
Token ID22940
Feature activation-1.00e+00

Pos logit contributions
ily+0.005
Fore+0.005
 someone+0.005
Prin+0.005
P+0.004
Neg logit contributions
 dodging-0.007
iled-0.007
arate-0.007
ang-0.007
flies-0.007
Token
Token ID26391
Feature activation-9.55e-01

Pos logit contributions
 someone+0.017
ily+0.015
Fore+0.014
Prin+0.014
 Claire+0.013
Neg logit contributions
 dodging-0.031
iled-0.031
arate-0.031
ang-0.030
iosity-0.030
Token quiet
Token ID5897
Feature activation+2.28e-01

Pos logit contributions
 dodging+0.012
iled+0.012
arate+0.011
ang+0.011
ples+0.011
Neg logit contributions
 someone-0.011
ily-0.011
Fore-0.010
Prin-0.010
 Claire-0.010
Token anymore
Token ID7471
Feature activation-1.13e+00

Pos logit contributions
 dodging+0.006
arate+0.006
iled+0.006
ang+0.006
itching+0.006
Neg logit contributions
 someone-0.005
ily-0.005
 Claire-0.004
Prin-0.004
 Sophie-0.004
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
 someone+0.019
ily+0.016
 Claire+0.013
 Lily+0.013
 her+0.012
Neg logit contributions
arate-0.040
 dodging-0.039
akable-0.039
ang-0.038
istent-0.038
Token<|endoftext|>
Token ID50256
Feature activation-1.12e-01

Pos logit contributions
ang+0.004
iled+0.004
oc+0.003
arate+0.003
 dodging+0.003
Neg logit contributions
 someone-0.002
ily-0.002
Prin-0.002
ackers-0.002
 Claire-0.002
Token"
Token ID1
Feature activation-4.37e-01

Pos logit contributions
oph+0.002
abby+0.002
Fore+0.002
 someone+0.002
 Sandra+0.002
Neg logit contributions
iled-0.002
ang-0.002
oc-0.002
arate-0.002
mer-0.002
TokenLook
Token ID8567
Feature activation-4.09e+00

Pos logit contributions
ily+0.009
 someone+0.009
atalie+0.009
Fore+0.009
 Heidi+0.008
Neg logit contributions
 dodging-0.010
ples-0.010
iled-0.010
king-0.010
ky-0.010
Token Mom
Token ID11254
Feature activation-5.78e-01

Pos logit contributions
 someone+0.116
Prin+0.114
Fore+0.113
ily+0.111
 Claire+0.107
Neg logit contributions
iled-0.074
tain-0.065
ang-0.065
itching-0.063
flies-0.063
Tokenmy
Token ID1820
Feature activation-9.69e-02

Pos logit contributions
 someone+0.011
ily+0.010
Fore+0.010
Prin+0.009
 Claire+0.009
Neg logit contributions
 dodging-0.017
iled-0.017
ples-0.016
flies-0.016
ang-0.016
Token,
Token ID11
Feature activation+1.57e-01

Pos logit contributions
 someone+0.003
ily+0.003
Fore+0.003
Prin+0.003
 Claire+0.002
Neg logit contributions
 dodging-0.002
iled-0.002
ples-0.002
flies-0.002
arate-0.002
Token what
Token ID644
Feature activation+2.07e-02

Pos logit contributions
 pretending+0.004
 dodging+0.004
iled+0.004
 because+0.004
arate+0.004
Neg logit contributions
Fore-0.003
Prin-0.003
ily-0.003
ackers-0.003
P-0.003
Token can
Token ID460
Feature activation-1.82e+00

Pos logit contributions
iled+0.001
arate+0.001
flies+0.001
oc+0.001
ples+0.001
Neg logit contributions
ily-0.000
 someone-0.000
Prin-0.000
atalie-0.000
ackers-0.000
Token
Token ID6184
Feature activation-1.10e-01

Pos logit contributions
 someone+0.004
ily+0.004
Fore+0.003
Prin+0.003
 Claire+0.003
Neg logit contributions
 dodging-0.004
iled-0.004
arate-0.004
ples-0.004
ang-0.004
Token
Token ID95
Feature activation+1.24e-01

Pos logit contributions
ily+0.003
 someone+0.003
Prin+0.003
 Claire+0.003
Fore+0.003
Neg logit contributions
 dodging-0.002
flies-0.002
iled-0.002
oing-0.002
ples-0.002
Token
Token ID26391
Feature activation-7.14e-01

Pos logit contributions
 dodging+0.003
iled+0.003
flies+0.003
oing+0.003
ang+0.003
Neg logit contributions
ily-0.003
 someone-0.003
Prin-0.003
Fore-0.003
 Claire-0.003
Token
Token ID129
Feature activation-6.38e-01

Pos logit contributions
 someone+0.017
ily+0.017
Prin+0.015
Fore+0.015
 Claire+0.015
Neg logit contributions
 dodging-0.017
iled-0.017
arate-0.016
oing-0.016
ang-0.016
Token
Token ID241
Feature activation-2.81e-01

Pos logit contributions
ily+0.016
 someone+0.015
Prin+0.014
Fore+0.014
 time+0.013
Neg logit contributions
flies-0.015
 dodging-0.014
oing-0.014
fly-0.014
iled-0.013
TokenLook
Token ID8567
Feature activation-4.05e+00

Pos logit contributions
 someone+0.005
ily+0.005
Fore+0.004
Prin+0.004
 Claire+0.004
Neg logit contributions
 dodging-0.008
iled-0.008
flies-0.008
arate-0.008
ples-0.008
Token at
Token ID379
Feature activation+1.62e-01

Pos logit contributions
 someone+0.107
ily+0.105
Fore+0.095
Prin+0.095
 Claire+0.095
Neg logit contributions
 dodging-0.089
iled-0.083
arate-0.081
ples-0.081
oing-0.080
Token your
Token ID534
Feature activation-1.22e+00

Pos logit contributions
 dodging+0.005
iled+0.004
arate+0.004
ples+0.004
ang+0.004
Neg logit contributions
 someone-0.003
ily-0.003
Fore-0.003
Prin-0.003
 Claire-0.003
Token beautiful
Token ID4950
Feature activation+3.93e-01

Pos logit contributions
 someone+0.026
ily+0.025
 Claire+0.022
Prin+0.022
Fore+0.022
Neg logit contributions
 dodging-0.033
iled-0.032
arate-0.031
ang-0.031
ples-0.031
Token red
Token ID2266
Feature activation-7.67e-01

Pos logit contributions
 dodging+0.010
iled+0.010
arate+0.010
ples+0.010
flies+0.010
Neg logit contributions
 someone-0.008
ily-0.008
Fore-0.007
Prin-0.007
 Claire-0.007
Token pet
Token ID4273
Feature activation-1.67e-01

Pos logit contributions
 someone+0.017
ily+0.017
Fore+0.015
Prin+0.015
 Claire+0.015
Neg logit contributions
 dodging-0.020
iled-0.019
flies-0.019
arate-0.019
ples-0.018
Token away
Token ID1497
Feature activation-6.69e-01

Pos logit contributions
 dodging+0.020
ples+0.019
arate+0.019
iled+0.019
ang+0.018
Neg logit contributions
 someone-0.015
ily-0.014
Prin-0.013
Fore-0.012
 Claire-0.012
Token.
Token ID13
Feature activation-2.95e-01

Pos logit contributions
 someone+0.015
ily+0.014
Prin+0.013
Fore+0.012
 Claire+0.012
Neg logit contributions
 dodging-0.018
arate-0.017
iled-0.017
oing-0.017
ples-0.017
Token
Token ID198
Feature activation-4.13e-01

Pos logit contributions
ily+0.005
 someone+0.005
Fore+0.005
Prin+0.005
abby+0.004
Neg logit contributions
 dodging-0.008
iled-0.008
arate-0.008
oing-0.008
ples-0.008
Token
Token ID198
Feature activation-7.91e-01

Pos logit contributions
 Claire+0.006
Prin+0.006
 someone+0.006
ily+0.006
oph+0.006
Neg logit contributions
iled-0.012
 dodging-0.012
oc-0.012
oing-0.012
ang-0.011
Token"
Token ID1
Feature activation+3.54e-01

Pos logit contributions
 someone+0.018
ily+0.017
 Claire+0.017
Prin+0.015
atalie+0.015
Neg logit contributions
iled-0.020
 dodging-0.019
arate-0.019
ang-0.019
oc-0.018
TokenLook
Token ID8567
Feature activation-4.04e+00

Pos logit contributions
 dodging+0.010
iled+0.009
arate+0.009
ples+0.009
oing+0.009
Neg logit contributions
 someone-0.007
ily-0.007
 Claire-0.006
Prin-0.006
Fore-0.006
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 someone+0.110
ily+0.109
Prin+0.109
Fore+0.107
 Claire+0.101
Neg logit contributions
iled-0.078
 dodging-0.074
arate-0.071
iosity-0.070
ang-0.069
Token Ben
Token ID3932
Feature activation-6.88e-01

Pos logit contributions
Prin+0.017
Fore+0.016
ily+0.015
atalie+0.015
P+0.015
Neg logit contributions
 dodging-0.030
 pretending-0.029
arate-0.029
flies-0.028
iled-0.028
Token and
Token ID290
Feature activation+3.03e-01

Pos logit contributions
 someone+0.021
Fore+0.021
Prin+0.021
ily+0.020
 Claire+0.020
Neg logit contributions
iled-0.012
oc-0.011
 dodging-0.010
flies-0.010
oing-0.010
Token Lily
Token ID20037
Feature activation-4.77e-01

Pos logit contributions
iosity+0.008
 dodging+0.008
 tug+0.008
 because+0.007
ience+0.007
Neg logit contributions
Fore-0.006
Prin-0.006
P-0.005
ate-0.005
abby-0.005
Token."
Token ID526
Feature activation-8.25e-01

Pos logit contributions
 someone+0.012
ily+0.012
Fore+0.012
Prin+0.012
 Claire+0.011
Neg logit contributions
iled-0.010
 dodging-0.010
arate-0.009
flies-0.009
iosity-0.009
Token shouted
Token ID17293
Feature activation-7.72e-01

Pos logit contributions
 someone+0.021
ily+0.020
Prin+0.018
Fore+0.018
 Claire+0.018
Neg logit contributions
 dodging-0.025
iled-0.023
arate-0.023
ples-0.023
oing-0.023
Token to
Token ID284
Feature activation-9.08e-01

Pos logit contributions
 someone+0.015
ily+0.015
 Claire+0.013
Prin+0.012
Fore+0.012
Neg logit contributions
 dodging-0.022
ang-0.021
arate-0.021
iled-0.021
ples-0.021
Token Mum
Token ID38367
Feature activation-1.29e-02

Pos logit contributions
Prin+0.024
ily+0.022
Fore+0.022
P+0.022
ackers+0.021
Neg logit contributions
 dodging-0.020
 roaring-0.018
iosity-0.018
 pretending-0.018
 tug-0.018
Token,
Token ID11
Feature activation+4.61e-01

Pos logit contributions
 someone+0.000
ily+0.000
 Claire+0.000
Fore+0.000
Prin+0.000
Neg logit contributions
 dodging-0.000
iled-0.000
arate-0.000
ples-0.000
ang-0.000
Token "
Token ID366
Feature activation-5.37e-01

Pos logit contributions
 dodging+0.014
iled+0.013
arate+0.013
ples+0.013
flies+0.013
Neg logit contributions
 someone-0.009
ily-0.008
 Claire-0.007
Fore-0.007
Prin-0.007
TokenLook
Token ID8567
Feature activation-4.02e+00

Pos logit contributions
 someone+0.010
ily+0.010
atalie+0.009
 Claire+0.009
Prin+0.009
Neg logit contributions
iled-0.015
 dodging-0.015
oc-0.015
ples-0.014
ky-0.014
Token!
Token ID0
Feature activation+4.89e-02

Pos logit contributions
 someone+0.103
ily+0.101
Prin+0.098
Fore+0.097
 Claire+0.091
Neg logit contributions
iled-0.086
 dodging-0.085
arate-0.081
ang-0.081
flies-0.079
Token There
Token ID1318
Feature activation-8.73e-01

Pos logit contributions
 dodging+0.001
iled+0.001
arate+0.001
flies+0.001
oing+0.001
Neg logit contributions
ily-0.001
 someone-0.001
Fore-0.001
Prin-0.001
 Claire-0.001
Token's
Token ID338
Feature activation-6.48e-01

Pos logit contributions
ily+0.016
 someone+0.016
Prin+0.015
Fore+0.014
 Claire+0.014
Neg logit contributions
 dodging-0.025
iled-0.024
flies-0.024
arate-0.024
oc-0.023
Token a
Token ID257
Feature activation-7.51e-01

Pos logit contributions
ily+0.012
Prin+0.012
ackers+0.012
abby+0.012
atalie+0.011
Neg logit contributions
 dodging-0.018
flies-0.017
 roaring-0.017
 scree-0.017
tain-0.016
Token dove
Token ID38862
Feature activation-2.18e-02

Pos logit contributions
ily+0.017
 someone+0.017
Prin+0.015
 Claire+0.015
Fore+0.015
Neg logit contributions
 dodging-0.019
iled-0.018
ang-0.018
arate-0.017
oing-0.017
Token
Token ID6184
Feature activation-2.43e-01

Pos logit contributions
 someone+0.007
ily+0.007
Fore+0.006
Prin+0.006
 Claire+0.006
Neg logit contributions
 dodging-0.011
iled-0.010
arate-0.010
ples-0.010
ang-0.010
Token
Token ID95
Feature activation+7.39e-02

Pos logit contributions
ily+0.006
 someone+0.006
P+0.005
 Claire+0.005
 Sophie+0.005
Neg logit contributions
flies-0.006
 dodging-0.006
fly-0.006
 roaring-0.006
sing-0.006
Token
Token ID26391
Feature activation-7.74e-01

Pos logit contributions
 dodging+0.002
flies+0.002
oing+0.002
iled+0.002
itching+0.002
Neg logit contributions
ily-0.002
 someone-0.002
Prin-0.001
P-0.001
Fore-0.001
Token
Token ID129
Feature activation-5.02e-01

Pos logit contributions
 someone+0.018
ily+0.017
Fore+0.015
Prin+0.015
 Claire+0.015
Neg logit contributions
 dodging-0.020
flies-0.018
iled-0.018
iosity-0.018
ples-0.018
Token
Token ID241
Feature activation-4.04e-01

Pos logit contributions
ily+0.012
 someone+0.012
 time+0.010
Prin+0.010
 Claire+0.010
Neg logit contributions
flies-0.013
fly-0.012
 dodging-0.011
oing-0.011
arate-0.011
TokenLook
Token ID8567
Feature activation-4.02e+00

Pos logit contributions
 someone+0.007
ily+0.007
Fore+0.006
Prin+0.006
 Claire+0.006
Neg logit contributions
 dodging-0.012
iled-0.012
flies-0.012
arate-0.011
ples-0.011
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
 someone+0.103
ily+0.101
Fore+0.090
 Claire+0.090
Prin+0.090
Neg logit contributions
 dodging-0.094
iled-0.086
arate-0.085
oing-0.084
ples-0.084
Token these
Token ID777
Feature activation-1.17e+00

Pos logit contributions
 someone+0.019
ily+0.019
Fore+0.016
Prin+0.016
 Claire+0.016
Neg logit contributions
 dodging-0.035
iled-0.034
arate-0.033
ples-0.033
ang-0.032
Token are
Token ID389
Feature activation-9.27e-01

Pos logit contributions
 someone+0.025
ily+0.025
Fore+0.022
Prin+0.022
 Claire+0.022
Neg logit contributions
 dodging-0.031
iled-0.030
arate-0.029
ples-0.029
ang-0.028
Token for
Token ID329
Feature activation-4.62e-01

Pos logit contributions
 someone+0.019
ily+0.018
Fore+0.016
 Claire+0.016
Prin+0.016
Neg logit contributions
 dodging-0.026
iled-0.025
oing-0.024
arate-0.024
iosity-0.024
Token me
Token ID502
Feature activation-2.67e+00

Pos logit contributions
 someone+0.008
ily+0.008
Fore+0.007
Prin+0.007
 Claire+0.006
Neg logit contributions
 dodging-0.014
iled-0.014
arate-0.014
oing-0.014
ples-0.014
Token
Token ID6184
Feature activation-8.67e-01

Pos logit contributions
 dodging+0.003
iled+0.003
arate+0.003
ples+0.003
flies+0.003
Neg logit contributions
 someone-0.002
ily-0.002
Fore-0.002
Prin-0.002
 Claire-0.002
Token
Token ID95
Feature activation-4.07e-01

Pos logit contributions
 someone+0.020
ily+0.020
Prin+0.018
 Claire+0.017
Fore+0.017
Neg logit contributions
 dodging-0.022
iled-0.021
ples-0.021
oing-0.021
flies-0.020
Token
Token ID26391
Feature activation-7.71e-01

Pos logit contributions
 someone+0.009
ily+0.009
Fore+0.008
Prin+0.008
 Claire+0.008
Neg logit contributions
 dodging-0.010
iled-0.010
arate-0.010
ang-0.009
ples-0.009
Token
Token ID129
Feature activation-8.56e-01

Pos logit contributions
 someone+0.011
ily+0.011
Fore+0.009
Prin+0.009
 Claire+0.009
Neg logit contributions
iled-0.026
ang-0.026
 dodging-0.025
arate-0.025
oing-0.025
Token
Token ID241
Feature activation-8.27e-02

Pos logit contributions
 someone+0.019
ily+0.019
Prin+0.017
Fore+0.017
 Claire+0.017
Neg logit contributions
 dodging-0.022
iled-0.021
arate-0.020
flies-0.020
ples-0.020
TokenLook
Token ID8567
Feature activation-4.01e+00

Pos logit contributions
 someone+0.001
ily+0.001
Prin+0.001
 Claire+0.001
Fore+0.001
Neg logit contributions
 dodging-0.003
iled-0.002
ang-0.002
arate-0.002
oing-0.002
Token,
Token ID11
Feature activation-2.50e-01

Pos logit contributions
 someone+0.106
ily+0.106
Fore+0.105
Prin+0.104
 Claire+0.099
Neg logit contributions
iled-0.079
 dodging-0.076
ang-0.074
arate-0.074
flies-0.071
Token Sam
Token ID3409
Feature activation-2.13e-01

Pos logit contributions
Fore+0.004
ily+0.004
Prin+0.004
P+0.004
 someone+0.004
Neg logit contributions
 dodging-0.007
iled-0.007
arate-0.007
flies-0.006
ang-0.006
Token!
Token ID0
Feature activation+1.57e-02

Pos logit contributions
 someone+0.004
ily+0.004
Fore+0.004
Prin+0.004
 Claire+0.004
Neg logit contributions
 dodging-0.006
iled-0.006
arate-0.005
ples-0.005
flies-0.005
Token Look
Token ID6803
Feature activation-3.32e+00

Pos logit contributions
 dodging+0.000
iled+0.000
arate+0.000
oc+0.000
ang+0.000
Neg logit contributions
ily-0.000
 someone-0.000
Fore-0.000
Prin-0.000
atalie-0.000
Token how
Token ID703
Feature activation+6.79e-01

Pos logit contributions
 someone+0.083
ily+0.081
Prin+0.073
Fore+0.073
 Claire+0.073
Neg logit contributions
 dodging-0.079
iled-0.074
arate-0.072
ples-0.071
oing-0.071
Token
Token ID198
Feature activation-7.71e-01

Pos logit contributions
ily+0.038
Fore+0.036
Prin+0.036
 someone+0.035
ackers+0.033
Neg logit contributions
 dodging-0.039
iled-0.039
arate-0.038
ang-0.036
iosity-0.036
TokenDaddy
Token ID48280
Feature activation-1.05e+00

Pos logit contributions
ily+0.016
 Claire+0.016
 Clover+0.015
Prin+0.015
 Heidi+0.015
Neg logit contributions
iled-0.018
oing-0.017
oc-0.017
ang-0.016
akable-0.016
Token said
Token ID531
Feature activation-5.45e-01

Pos logit contributions
 someone+0.021
ily+0.021
Prin+0.018
Fore+0.018
 Claire+0.018
Neg logit contributions
 dodging-0.029
iled-0.028
arate-0.027
ples-0.027
oing-0.027
Token,
Token ID11
Feature activation-1.96e-01

Pos logit contributions
 someone+0.013
ily+0.013
Fore+0.013
Prin+0.013
 Claire+0.012
Neg logit contributions
 dodging-0.013
iled-0.012
arate-0.011
oing-0.011
ples-0.011
Token "
Token ID366
Feature activation-1.79e-01

Pos logit contributions
ily+0.004
 someone+0.004
Fore+0.004
Prin+0.004
P+0.004
Neg logit contributions
 dodging-0.005
iled-0.005
ples-0.005
arate-0.005
oing-0.005
TokenLook
Token ID8567
Feature activation-3.99e+00

Pos logit contributions
 someone+0.003
ily+0.003
 Claire+0.003
Prin+0.002
Fore+0.002
Neg logit contributions
 dodging-0.006
iled-0.005
arate-0.005
oing-0.005
ang-0.005
Token under
Token ID739
Feature activation-5.13e-03

Pos logit contributions
 someone+0.091
ily+0.090
Fore+0.081
Prin+0.081
 Claire+0.080
Neg logit contributions
 dodging-0.101
iled-0.096
arate-0.093
ples-0.092
ang-0.092
Token the
Token ID262
Feature activation-8.16e-01

Pos logit contributions
 someone+0.000
ily+0.000
 Claire+0.000
Fore+0.000
Prin+0.000
Neg logit contributions
iled-0.000
 dodging-0.000
oing-0.000
ang-0.000
ples-0.000
Token bed
Token ID3996
Feature activation-3.84e-01

Pos logit contributions
ily+0.016
 someone+0.016
Prin+0.014
Fore+0.014
 Claire+0.014
Neg logit contributions
 dodging-0.023
iled-0.022
ples-0.021
arate-0.021
flies-0.021
Token.
Token ID13
Feature activation-8.35e-02

Pos logit contributions
ily+0.009
 someone+0.009
Fore+0.009
Prin+0.009
 Claire+0.008
Neg logit contributions
 dodging-0.009
iled-0.008
flies-0.008
ples-0.008
arate-0.008
Token That
Token ID1320
Feature activation+6.84e-01

Pos logit contributions
ily+0.002
 someone+0.002
Fore+0.002
Prin+0.002
P+0.002
Neg logit contributions
 dodging-0.002
iled-0.002
arate-0.002
ples-0.002
flies-0.002
Token Mom
Token ID11254
Feature activation-7.01e-01

Pos logit contributions
ily+0.011
P+0.011
 someone+0.011
Fore+0.010
Prin+0.010
Neg logit contributions
 dodging-0.017
ples-0.016
 mastering-0.016
iled-0.016
iosity-0.016
Token and
Token ID290
Feature activation-5.32e-01

Pos logit contributions
 someone+0.014
ily+0.014
 Claire+0.012
Prin+0.011
 out+0.011
Neg logit contributions
 dodging-0.020
arate-0.019
iled-0.019
ang-0.019
iosity-0.019
Token said
Token ID531
Feature activation-8.00e-01

Pos logit contributions
 someone+0.010
ily+0.010
Fore+0.010
Prin+0.010
P+0.009
Neg logit contributions
 dodging-0.014
iled-0.014
arate-0.013
flies-0.013
ples-0.013
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
 someone+0.018
ily+0.018
Fore+0.017
Prin+0.017
P+0.016
Neg logit contributions
 dodging-0.019
iled-0.019
arate-0.018
iosity-0.017
ples-0.017
Token "
Token ID366
Feature activation+4.77e-02

Pos logit contributions
 someone+0.002
ily+0.002
 Claire+0.001
Prin+0.001
Fore+0.001
Neg logit contributions
arate-0.004
 dodging-0.004
ang-0.004
iled-0.004
itching-0.004
TokenLook
Token ID8567
Feature activation-3.98e+00

Pos logit contributions
ples+0.001
iled+0.001
 dodging+0.001
oing+0.001
arate+0.001
Neg logit contributions
 someone-0.001
ily-0.001
 Claire-0.001
Fore-0.001
atalie-0.001
Token,
Token ID11
Feature activation-5.87e-01

Pos logit contributions
 someone+0.097
ily+0.096
Prin+0.088
Fore+0.088
 Claire+0.086
Neg logit contributions
 dodging-0.093
iled-0.090
arate-0.087
ples-0.086
ang-0.085
Token Mom
Token ID11254
Feature activation-1.25e+00

Pos logit contributions
 someone+0.008
ily+0.008
Prin+0.006
Fore+0.006
 Claire+0.006
Neg logit contributions
 dodging-0.020
iled-0.020
arate-0.019
ples-0.019
flies-0.019
Token,
Token ID11
Feature activation-3.31e-01

Pos logit contributions
 someone+0.029
ily+0.027
 Claire+0.023
 Sophie+0.023
 Lily+0.023
Neg logit contributions
 dodging-0.034
itching-0.031
iosity-0.031
arate-0.030
ang-0.030
Token it
Token ID340
Feature activation-1.18e-01

Pos logit contributions
 someone+0.006
ily+0.005
 Claire+0.004
Fore+0.004
Prin+0.004
Neg logit contributions
 dodging-0.011
iled-0.011
ang-0.010
itching-0.010
oing-0.010
Token's
Token ID338
Feature activation-8.78e-02

Pos logit contributions
 someone+0.001
ily+0.001
 Claire+0.001
Prin+0.001
Fore+0.001
Neg logit contributions
 dodging-0.004
iosity-0.004
ang-0.004
arate-0.004
years-0.004
Token for
Token ID329
Feature activation+2.53e-01

Pos logit contributions
 someone+0.007
ily+0.007
Fore+0.006
 Claire+0.006
Prin+0.006
Neg logit contributions
 dodging-0.010
itching-0.009
arate-0.009
ples-0.009
oing-0.009
Token others
Token ID1854
Feature activation-9.29e-01

Pos logit contributions
 dodging+0.007
iled+0.006
arate+0.006
ples+0.006
flies+0.006
Neg logit contributions
 someone-0.005
ily-0.005
Prin-0.005
Fore-0.005
 Claire-0.005
Token.
Token ID13
Feature activation+9.06e-01

Pos logit contributions
 someone+0.019
ily+0.016
 out+0.015
 time+0.014
 Claire+0.014
Neg logit contributions
 dodging-0.028
itching-0.026
ang-0.026
ples-0.026
oing-0.026
Token<|endoftext|>
Token ID50256
Feature activation-3.65e-01

Pos logit contributions
iled+0.028
 dodging+0.028
arate+0.028
oc+0.028
oing+0.027
Neg logit contributions
ily-0.014
 someone-0.013
Prin-0.012
 Claire-0.011
P-0.011
Token"
Token ID1
Feature activation-3.73e-01

Pos logit contributions
abby+0.007
Fore+0.007
atalie+0.007
 someone+0.007
oph+0.007
Neg logit contributions
iled-0.008
oc-0.008
ang-0.008
arate-0.008
mer-0.008
TokenLook
Token ID8567
Feature activation-3.96e+00

Pos logit contributions
ily+0.008
 someone+0.007
atalie+0.007
Fore+0.007
 Claire+0.007
Neg logit contributions
 dodging-0.009
iled-0.009
king-0.009
ples-0.009
oing-0.009
Token,
Token ID11
Feature activation-2.65e-01

Pos logit contributions
Fore+0.109
 someone+0.109
Prin+0.108
ily+0.106
P+0.101
Neg logit contributions
iled-0.073
tain-0.066
ience-0.065
iosity-0.065
ang-0.064
Token there
Token ID612
Feature activation-9.46e-01

Pos logit contributions
ily+0.005
Fore+0.005
 someone+0.005
P+0.005
Prin+0.005
Neg logit contributions
 dodging-0.007
iled-0.007
flies-0.007
arate-0.007
iosity-0.007
Token's
Token ID338
Feature activation-1.17e+00

Pos logit contributions
ily+0.019
 someone+0.018
Prin+0.017
Fore+0.017
P+0.016
Neg logit contributions
iled-0.025
flies-0.024
 dodging-0.024
arate-0.024
oc-0.024
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
Prin+0.021
ily+0.021
ackers+0.020
abby+0.020
atalie+0.020
Neg logit contributions
 dodging-0.029
flies-0.029
iled-0.029
 pretending-0.029
 roaring-0.028
Token stand
Token ID1302
Feature activation-8.66e-01

Pos logit contributions
 someone+0.021
ily+0.020
Fore+0.018
Prin+0.018
 Claire+0.018
Neg logit contributions
 dodging-0.030
iled-0.029
arate-0.028
ples-0.028
flies-0.027
Tokenooky
Token ID29655
Feature activation-4.29e-01

Pos logit contributions
 dodging+0.020
iled+0.019
arate+0.018
ples+0.018
flies+0.018
Neg logit contributions
 someone-0.018
ily-0.018
Fore-0.016
Prin-0.016
 Claire-0.016
Token.
Token ID13
Feature activation-8.88e-01

Pos logit contributions
 someone+0.006
 out+0.005
 Claire+0.004
ily+0.004
 up+0.003
Neg logit contributions
arate-0.017
ang-0.016
years-0.016
iosity-0.015
istent-0.015
Token
Token ID198
Feature activation-6.22e-01

Pos logit contributions
Fore+0.024
abby+0.024
Prin+0.024
ics+0.023
P+0.023
Neg logit contributions
 because-0.013
 Winning-0.012
 Because-0.012
 barking-0.012
 Sharks-0.012
Token
Token ID198
Feature activation-8.07e-01

Pos logit contributions
P+0.014
oph+0.014
 ma+0.014
Prin+0.014
izons+0.014
Neg logit contributions
oc-0.013
iled-0.013
ang-0.011
ines-0.011
oing-0.011
Token"
Token ID1
Feature activation+3.05e-01

Pos logit contributions
 someone+0.021
 Claire+0.020
 Heidi+0.020
atalie+0.020
 av+0.020
Neg logit contributions
iled-0.016
oc-0.013
ang-0.012
arate-0.012
oing-0.012
TokenLook
Token ID8567
Feature activation-3.95e+00

Pos logit contributions
 dodging+0.007
iled+0.007
ples+0.007
oing+0.007
ky+0.007
Neg logit contributions
 someone-0.007
ily-0.006
 out-0.006
Fore-0.006
atalie-0.005
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
 someone+0.106
Prin+0.106
ily+0.105
Fore+0.105
atalie+0.096
Neg logit contributions
iled-0.077
 dodging-0.072
arate-0.070
iosity-0.069
ang-0.068
Token Lily
Token ID20037
Feature activation-1.61e+00

Pos logit contributions
Prin+0.026
Fore+0.025
atalie+0.024
P+0.023
ily+0.023
Neg logit contributions
 dodging-0.032
 pretending-0.032
arate-0.031
iled-0.030
iosity-0.030
Token,
Token ID11
Feature activation-6.10e-01

Pos logit contributions
 someone+0.043
ily+0.043
Fore+0.041
Prin+0.041
 Claire+0.039
Neg logit contributions
 dodging-0.033
iled-0.033
flies-0.031
arate-0.030
oc-0.030
Token look
Token ID804
Feature activation-2.03e+00

Pos logit contributions
 someone+0.011
ily+0.011
Prin+0.010
Fore+0.010
 Claire+0.010
Neg logit contributions
 dodging-0.018
iled-0.017
arate-0.017
flies-0.016
ples-0.016
Token!"
Token ID2474
Feature activation-1.24e+00

Pos logit contributions
 someone+0.052
ily+0.051
Fore+0.046
Prin+0.046
 Claire+0.046
Neg logit contributions
 dodging-0.047
iled-0.044
arate-0.043
ples-0.042
oing-0.042
Token angry
Token ID7954
Feature activation+6.89e-01

Pos logit contributions
 dodging+0.004
iled+0.004
itching+0.004
ang+0.004
ples+0.004
Neg logit contributions
 someone-0.006
ily-0.005
Fore-0.005
Prin-0.005
 Claire-0.005
Token and
Token ID290
Feature activation-4.14e-01

Pos logit contributions
itching+0.021
 dodging+0.021
ples+0.020
ience+0.020
arate+0.020
Neg logit contributions
 someone-0.014
ily-0.013
 out-0.010
 Lily-0.010
-0.009
Token said
Token ID531
Feature activation-3.50e-01

Pos logit contributions
 someone+0.010
ily+0.010
Fore+0.009
Prin+0.009
 Claire+0.009
Neg logit contributions
 dodging-0.010
iled-0.009
arate-0.009
ples-0.009
ang-0.009
Token,
Token ID11
Feature activation-1.99e-01

Pos logit contributions
 someone+0.009
ily+0.009
Fore+0.008
Prin+0.008
 Claire+0.008
Neg logit contributions
 dodging-0.008
iled-0.007
arate-0.007
flies-0.007
oing-0.007
Token "
Token ID366
Feature activation-2.09e-01

Pos logit contributions
 someone+0.003
ily+0.003
Fore+0.003
Prin+0.003
 Claire+0.003
Neg logit contributions
 dodging-0.006
arate-0.006
iled-0.006
ang-0.006
ples-0.006
TokenLook
Token ID8567
Feature activation-3.93e+00

Pos logit contributions
 someone+0.004
ily+0.004
 Claire+0.004
Prin+0.004
Fore+0.003
Neg logit contributions
 dodging-0.006
iled-0.006
arate-0.006
ang-0.006
oing-0.006
Token what
Token ID644
Feature activation+7.68e-01

Pos logit contributions
 someone+0.100
ily+0.099
Prin+0.094
Fore+0.092
 Claire+0.090
Neg logit contributions
 dodging-0.086
iled-0.085
arate-0.081
ang-0.079
iosity-0.079
Token you
Token ID345
Feature activation-7.11e-03

Pos logit contributions
arate+0.020
iled+0.020
 dodging+0.019
 pretending+0.019
flies+0.018
Neg logit contributions
Prin-0.015
atalie-0.015
Fore-0.014
abby-0.014
ily-0.014
Token did
Token ID750
Feature activation-8.62e-03

Pos logit contributions
 someone+0.000
ily+0.000
Prin+0.000
Fore+0.000
 Claire+0.000
Neg logit contributions
 dodging-0.000
iled-0.000
arate-0.000
ples-0.000
ang-0.000
Token,
Token ID11
Feature activation-4.03e-01

Pos logit contributions
 someone+0.000
ily+0.000
 Claire+0.000
Prin+0.000
Fore+0.000
Neg logit contributions
 dodging-0.000
ang-0.000
ples-0.000
oing-0.000
iled-0.000
Token Ben
Token ID3932
Feature activation-1.16e+00

Pos logit contributions
 someone+0.005
ily+0.005
Prin+0.004
Fore+0.004
P+0.004
Neg logit contributions
 dodging-0.014
iled-0.014
ang-0.013
arate-0.013
iosity-0.013
Token colorful
Token ID20239
Feature activation-1.55e+00

Pos logit contributions
 someone+0.028
ily+0.026
Prin+0.024
 Claire+0.023
Fore+0.023
Neg logit contributions
 dodging-0.035
iled-0.034
arate-0.033
ang-0.033
iosity-0.033
Token.
Token ID13
Feature activation-5.69e-01

Pos logit contributions
 someone+0.028
ily+0.024
 out+0.021
Prin+0.021
 Claire+0.020
Neg logit contributions
 dodging-0.049
ang-0.048
iled-0.048
arate-0.048
itching-0.047
Token
Token ID198
Feature activation-3.12e-01

Pos logit contributions
ily+0.012
abby+0.011
Prin+0.011
ackers+0.011
 someone+0.011
Neg logit contributions
 dodging-0.014
iled-0.013
arate-0.013
ples-0.013
ang-0.013
Token
Token ID198
Feature activation-4.58e-01

Pos logit contributions
 Claire+0.007
Prin+0.006
P+0.006
 someone+0.006
 Nek+0.006
Neg logit contributions
oc-0.008
iled-0.007
ang-0.007
oing-0.007
ines-0.007
Token"
Token ID1
Feature activation+4.82e-01

Pos logit contributions
 someone+0.011
 Claire+0.010
ily+0.010
atalie+0.009
ackers+0.009
Neg logit contributions
iled-0.011
 dodging-0.011
ang-0.011
arate-0.011
oc-0.011
TokenLook
Token ID8567
Feature activation-3.93e+00

Pos logit contributions
 dodging+0.011
ples+0.011
arate+0.011
iled+0.011
ky+0.011
Neg logit contributions
 someone-0.011
ily-0.010
 Claire-0.009
atalie-0.009
ackers-0.009
Token,
Token ID11
Feature activation-7.64e-01

Pos logit contributions
 someone+0.104
ily+0.104
Prin+0.100
Fore+0.099
 Claire+0.095
Neg logit contributions
iled-0.079
 dodging-0.078
arate-0.074
iosity-0.073
ang-0.073
Token Mom
Token ID11254
Feature activation-1.46e+00

Pos logit contributions
ily+0.011
 someone+0.011
Prin+0.011
Fore+0.011
atalie+0.010
Neg logit contributions
 dodging-0.024
arate-0.023
iled-0.023
flies-0.022
tain-0.022
Token,"
Token ID553
Feature activation-5.06e-01

Pos logit contributions
 someone+0.032
ily+0.031
Fore+0.027
Prin+0.027
 Claire+0.027
Neg logit contributions
 dodging-0.039
iled-0.037
arate-0.036
oing-0.036
ang-0.035
Token Tom
Token ID4186
Feature activation-4.46e-01

Pos logit contributions
ily+0.008
 someone+0.008
Prin+0.008
Fore+0.008
ackers+0.007
Neg logit contributions
 dodging-0.015
arate-0.015
iled-0.014
iosity-0.014
ang-0.014
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
 someone+0.009
ily+0.009
Prin+0.008
 Claire+0.008
Fore+0.007
Neg logit contributions
 dodging-0.013
iled-0.012
arate-0.012
ples-0.012
flies-0.012
Token of
Token ID286
Feature activation+1.38e-01

Pos logit contributions
 someone+0.010
ily+0.010
 out+0.008
 Claire+0.008
 Sophie+0.008
Neg logit contributions
 dodging-0.018
iosity-0.017
itching-0.017
arate-0.017
iled-0.017
Token different
Token ID1180
Feature activation-1.66e-01

Pos logit contributions
 dodging+0.004
iled+0.004
iosity+0.004
oing+0.004
arate+0.004
Neg logit contributions
 someone-0.003
ily-0.003
 Claire-0.002
Prin-0.002
Fore-0.002
Token colours
Token ID18915
Feature activation-1.41e+00

Pos logit contributions
 someone+0.004
ily+0.003
 Claire+0.003
Prin+0.003
 time+0.003
Neg logit contributions
 dodging-0.005
iled-0.005
ang-0.004
oing-0.004
arate-0.004
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 someone+0.022
ily+0.019
 out+0.019
 Claire+0.017
 time+0.016
Neg logit contributions
 dodging-0.049
iosity-0.048
arate-0.048
years-0.048
oc-0.047
Token "
Token ID366
Feature activation-4.08e-01

Pos logit contributions
ily+0.022
Fore+0.021
 someone+0.021
Prin+0.020
P+0.019
Neg logit contributions
iled-0.024
ples-0.024
 dodging-0.024
arate-0.023
ang-0.022
TokenLook
Token ID8567
Feature activation-3.91e+00

Pos logit contributions
 someone+0.008
ily+0.008
Fore+0.007
atalie+0.007
 Claire+0.007
Neg logit contributions
ples-0.011
iled-0.011
 dodging-0.011
arate-0.011
ang-0.011
Token,"
Token ID553
Feature activation-1.37e+00

Pos logit contributions
 someone+0.095
ily+0.095
Fore+0.090
Prin+0.088
 Claire+0.082
Neg logit contributions
iled-0.086
 dodging-0.085
ples-0.082
flies-0.082
arate-0.082
Token said
Token ID531
Feature activation-1.11e+00

Pos logit contributions
ily+0.026
Fore+0.025
 someone+0.024
Prin+0.024
ackers+0.022
Neg logit contributions
iled-0.036
 dodging-0.035
ang-0.035
arate-0.034
ples-0.034
Token M
Token ID337
Feature activation-2.25e-01

Pos logit contributions
Fore+0.029
ily+0.026
Prin+0.026
 someone+0.025
P+0.024
Neg logit contributions
iled-0.022
ples-0.021
flies-0.020
ang-0.020
arate-0.020
Tokenummy
Token ID13513
Feature activation+8.66e-02

Pos logit contributions
 someone+0.006
ily+0.005
Prin+0.005
Fore+0.005
P+0.005
Neg logit contributions
iled-0.005
 dodging-0.005
ples-0.005
flies-0.005
oing-0.005
Token.
Token ID13
Feature activation-4.90e-01

Pos logit contributions
 dodging+0.003
ang+0.002
arate+0.002
itching+0.002
oing+0.002
Neg logit contributions
ily-0.002
 someone-0.002
 Claire-0.002
 Sophie-0.001
 Lily-0.001
Token and
Token ID290
Feature activation-1.57e-01

Pos logit contributions
 dodging+0.012
ples+0.011
arate+0.011
iled+0.011
iosity+0.011
Neg logit contributions
 someone-0.009
ily-0.008
 Claire-0.007
Fore-0.007
Prin-0.007
Token examined
Token ID11068
Feature activation-7.99e-02

Pos logit contributions
ily+0.003
 someone+0.003
Fore+0.003
Prin+0.003
 Claire+0.003
Neg logit contributions
 dodging-0.004
iled-0.004
arate-0.004
flies-0.004
ples-0.004
Token it
Token ID340
Feature activation-8.04e-03

Pos logit contributions
 someone+0.001
ily+0.001
 Claire+0.001
Fore+0.001
Prin+0.001
Neg logit contributions
 dodging-0.003
iled-0.003
oing-0.003
arate-0.003
flies-0.003
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
 someone+0.000
 out+0.000
ily+0.000
 up+0.000
 Claire+0.000
Neg logit contributions
 dodging-0.000
arate-0.000
ang-0.000
oing-0.000
iled-0.000
Token "
Token ID366
Feature activation+1.63e-01

Pos logit contributions
ily+0.005
 someone+0.005
Fore+0.004
Prin+0.004
P+0.004
Neg logit contributions
 dodging-0.005
iled-0.004
arate-0.004
ples-0.004
iosity-0.004
TokenLook
Token ID8567
Feature activation-3.90e+00

Pos logit contributions
 dodging+0.004
iled+0.004
arate+0.004
ples+0.004
ang+0.004
Neg logit contributions
 someone-0.003
ily-0.003
 Claire-0.003
Prin-0.003
Fore-0.003
Token at
Token ID379
Feature activation+4.56e-01

Pos logit contributions
 someone+0.100
ily+0.099
Prin+0.094
Fore+0.094
 Claire+0.089
Neg logit contributions
 dodging-0.083
iled-0.083
arate-0.079
ang-0.077
iosity-0.077
Token its
Token ID663
Feature activation-3.06e-01

Pos logit contributions
 dodging+0.012
iled+0.011
arate+0.011
flies+0.011
ples+0.011
Neg logit contributions
ily-0.010
 someone-0.010
Prin-0.009
Fore-0.009
 Claire-0.009
Token little
Token ID1310
Feature activation-1.05e+00

Pos logit contributions
 someone+0.007
ily+0.006
 Claire+0.006
Prin+0.005
Fore+0.005
Neg logit contributions
 dodging-0.008
iled-0.008
arate-0.008
ples-0.008
ang-0.008
Token claws
Token ID28421
Feature activation-3.43e-01

Pos logit contributions
 someone+0.025
ily+0.025
Prin+0.022
Fore+0.022
 Claire+0.022
Neg logit contributions
 dodging-0.026
iled-0.024
arate-0.024
ples-0.023
oing-0.023
Token Ben
Token ID3932
Feature activation-4.18e-01

Pos logit contributions
 someone+0.008
ily+0.008
 Claire+0.007
Prin+0.007
Fore+0.007
Neg logit contributions
 dodging-0.008
iled-0.008
arate-0.008
ang-0.008
ples-0.008
Token mom
Token ID1995
Feature activation-2.20e-01

Pos logit contributions
 dodging+0.015
iled+0.014
arate+0.014
oing+0.014
flies+0.014
Neg logit contributions
 someone-0.009
ily-0.009
Prin-0.008
 Claire-0.008
Fore-0.008
Token about
Token ID546
Feature activation+1.77e+00

Pos logit contributions
 someone+0.005
ily+0.005
Fore+0.005
Prin+0.004
 Claire+0.004
Neg logit contributions
 dodging-0.006
iled-0.005
ples-0.005
ang-0.005
arate-0.005
Token Max
Token ID5436
Feature activation+1.78e+00

Pos logit contributions
 dodging+0.057
iled+0.056
arate+0.054
ples+0.054
ang+0.054
Neg logit contributions
 someone-0.029
ily-0.028
Fore-0.024
Prin-0.023
 Claire-0.023
Token taking
Token ID2263
Feature activation+4.37e-01

Pos logit contributions
ang+0.060
ples+0.060
arate+0.057
oing+0.057
chie+0.057
Neg logit contributions
 someone-0.033
ily-0.031
 time-0.025
 Lily-0.025
 out-0.024
Token his
Token ID465
Feature activation+1.75e+00

Pos logit contributions
 dodging+0.014
iled+0.013
arate+0.013
flies+0.013
ples+0.013
Neg logit contributions
 someone-0.008
ily-0.007
Prin-0.006
Fore-0.006
 Claire-0.006
Token shield
Token ID7614
Feature activation+2.54e+00

Pos logit contributions
 dodging+0.038
iled+0.036
arate+0.034
itching+0.034
iosity+0.034
Neg logit contributions
ily-0.046
 someone-0.045
Prin-0.042
 Claire-0.042
Fore-0.042
Token.
Token ID13
Feature activation+8.76e-01

Pos logit contributions
 dodging+0.068
ples+0.066
iled+0.065
arate+0.064
ang+0.064
Neg logit contributions
 someone-0.056
ily-0.054
Prin-0.047
Fore-0.046
 Claire-0.045
Token His
Token ID2399
Feature activation+8.32e-01

Pos logit contributions
ang+0.033
 dodging+0.031
ples+0.031
arate+0.031
oing+0.031
Neg logit contributions
 someone-0.014
ily-0.011
 Lily-0.011
 Claire-0.009
Prin-0.009
Token mom
Token ID1995
Feature activation+3.74e-01

Pos logit contributions
 dodging+0.023
iled+0.022
itching+0.021
iosity+0.021
flies+0.021
Neg logit contributions
ily-0.017
 someone-0.016
Prin-0.015
 Claire-0.015
Fore-0.015
Token said
Token ID531
Feature activation-9.36e-03

Pos logit contributions
 dodging+0.013
tain+0.012
itching+0.012
flies+0.012
ples+0.012
Neg logit contributions
ily-0.006
 someone-0.006
Fore-0.005
Prin-0.005
 Claire-0.004
Token,
Token ID11
Feature activation+3.15e-01

Pos logit contributions
 someone+0.000
ily+0.000
Prin+0.000
Fore+0.000
 Claire+0.000
Neg logit contributions
 dodging-0.000
iled-0.000
arate-0.000
iosity-0.000
ples-0.000
Token Lily
Token ID20037
Feature activation-1.25e+00

Pos logit contributions
 dodging+0.027
iled+0.026
ples+0.025
arate+0.025
oing+0.024
Neg logit contributions
 someone-0.017
ily-0.016
Fore-0.016
Prin-0.015
 Claire-0.014
Token loved
Token ID6151
Feature activation+8.37e-01

Pos logit contributions
 someone+0.032
ily+0.030
Prin+0.030
 Claire+0.029
Fore+0.029
Neg logit contributions
iled-0.026
 dodging-0.025
oc-0.025
arate-0.024
ples-0.024
Token to
Token ID284
Feature activation+1.15e-01

Pos logit contributions
 dodging+0.023
iled+0.023
ang+0.022
ples+0.022
iosity+0.022
Neg logit contributions
 someone-0.018
ily-0.017
Fore-0.015
Prin-0.015
 Claire-0.015
Token eat
Token ID4483
Feature activation+1.36e-01

Pos logit contributions
 dodging+0.003
iled+0.003
ang+0.003
flies+0.003
iosity+0.003
Neg logit contributions
 someone-0.003
ily-0.003
 Claire-0.002
Prin-0.002
Fore-0.002
Token av
Token ID1196
Feature activation+2.36e-01

Pos logit contributions
 dodging+0.004
iled+0.003
oc+0.003
arate+0.003
itching+0.003
Neg logit contributions
ily-0.003
 someone-0.003
Fore-0.003
Prin-0.003
 Claire-0.002
Tokenoc
Token ID420
Feature activation+2.57e+00

Pos logit contributions
 dodging+0.001
iled+0.001
arate+0.001
oc+0.000
ples+0.000
Neg logit contributions
 someone-0.010
ily-0.010
 Claire-0.010
Fore-0.010
Prin-0.010
Tokenados
Token ID22484
Feature activation-2.74e-02

Pos logit contributions
 dodging+0.051
iled+0.049
arate+0.049
iosity+0.047
itching+0.047
Neg logit contributions
 someone-0.071
ily-0.070
Fore-0.065
Prin-0.065
 Claire-0.064
Token,
Token ID11
Feature activation+7.07e-01

Pos logit contributions
 someone+0.001
ily+0.001
Fore+0.001
Prin+0.001
 Claire+0.001
Neg logit contributions
 dodging-0.001
iled-0.001
arate-0.001
ples-0.001
iosity-0.001
Token they
Token ID484
Feature activation-4.63e-01

Pos logit contributions
 dodging+0.012
 because+0.010
 pretending+0.010
 bothering+0.010
iled+0.010
Neg logit contributions
ily-0.021
Prin-0.020
atalie-0.020
Fore-0.020
 someone-0.020
Token were
Token ID547
Feature activation-1.44e+00

Pos logit contributions
ily+0.008
 someone+0.008
Fore+0.007
 Claire+0.007
Prin+0.007
Neg logit contributions
 dodging-0.015
arate-0.014
iled-0.014
oing-0.014
iosity-0.013
Token her
Token ID607
Feature activation-8.51e-01

Pos logit contributions
 someone+0.032
ily+0.031
Prin+0.029
Fore+0.028
 Claire+0.027
Neg logit contributions
 dodging-0.036
iled-0.035
ples-0.034
flies-0.033
arate-0.033
Token?"
Token ID1701
Feature activation-7.34e-01

Pos logit contributions
 dodging+0.007
iled+0.007
arate+0.007
ples+0.007
ang+0.007
Neg logit contributions
 someone-0.003
ily-0.003
Fore-0.002
Prin-0.002
 Claire-0.002
Token
Token ID198
Feature activation-1.76e-01

Pos logit contributions
Prin+0.011
ily+0.011
 Claire+0.010
abby+0.010
atalie+0.010
Neg logit contributions
 dodging-0.022
arate-0.022
iled-0.022
oc-0.021
ang-0.021
Token
Token ID198
Feature activation-3.09e-01

Pos logit contributions
 Claire+0.004
Prin+0.003
abby+0.003
oph+0.003
atalie+0.003
Neg logit contributions
iled-0.005
ang-0.005
oc-0.005
arate-0.005
 dodging-0.005
TokenTom
Token ID13787
Feature activation+8.28e-01

Pos logit contributions
 someone+0.004
ily+0.004
Fore+0.003
Prin+0.003
P+0.003
Neg logit contributions
 dodging-0.011
flies-0.011
ples-0.011
iled-0.011
iosity-0.011
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
 dodging+0.021
iled+0.020
ples+0.020
arate+0.020
oing+0.020
Neg logit contributions
 someone-0.019
ily-0.019
Fore-0.016
Prin-0.016
 Claire-0.016
Token Mia
Token ID32189
Feature activation+2.82e+00

Pos logit contributions
 dodging+0.035
iled+0.034
arate+0.033
ples+0.033
flies+0.033
Neg logit contributions
 someone-0.021
ily-0.021
Fore-0.019
Prin-0.018
 Claire-0.018
Token think
Token ID892
Feature activation+1.17e+00

Pos logit contributions
 dodging+0.078
iled+0.074
arate+0.072
ples+0.072
ang+0.071
Neg logit contributions
 someone-0.058
ily-0.057
Fore-0.051
Prin-0.051
 Claire-0.050
Token.
Token ID13
Feature activation+5.81e-01

Pos logit contributions
 dodging+0.029
iled+0.028
arate+0.027
ples+0.027
ang+0.027
Neg logit contributions
 someone-0.027
ily-0.027
Prin-0.024
Fore-0.024
 Claire-0.024
Token They
Token ID1119
Feature activation+1.16e+00

Pos logit contributions
 dodging+0.019
itching+0.019
iled+0.018
iosity+0.018
ience+0.018
Neg logit contributions
 someone-0.010
ily-0.009
Fore-0.007
Prin-0.007
 Claire-0.007
Token are
Token ID389
Feature activation+3.06e-02

Pos logit contributions
 dodging+0.034
iled+0.034
oing+0.033
ang+0.033
ples+0.033
Neg logit contributions
ily-0.022
 someone-0.021
Fore-0.018
Prin-0.018
 Claire-0.018
Token kind
Token ID1611
Feature activation+3.14e-01

Pos logit contributions
 dodging+0.001
iled+0.001
arate+0.001
oing+0.001
ples+0.001
Neg logit contributions
 someone-0.001
ily-0.001
Fore-0.001
Prin-0.001
 Claire-0.001
TokenWhat
Token ID2061
Feature activation+3.81e-01

Pos logit contributions
 dodging+0.012
iled+0.012
arate+0.012
ang+0.011
oing+0.011
Neg logit contributions
 someone-0.009
ily-0.009
 Claire-0.008
Fore-0.008
Prin-0.008
Token is
Token ID318
Feature activation+1.45e-01

Pos logit contributions
iled+0.010
 dodging+0.010
flies+0.010
arate+0.010
oc+0.009
Neg logit contributions
ily-0.008
 someone-0.007
Prin-0.007
 Claire-0.007
Fore-0.007
Token this
Token ID428
Feature activation+1.34e+00

Pos logit contributions
oing+0.004
 dodging+0.004
iled+0.004
ples+0.004
ang+0.003
Neg logit contributions
 someone-0.004
ily-0.003
Fore-0.003
 Claire-0.003
Prin-0.003
Token,
Token ID11
Feature activation+1.60e+00

Pos logit contributions
 dodging+0.043
flies+0.042
iled+0.042
oc+0.042
arate+0.041
Neg logit contributions
ily-0.020
 someone-0.018
Fore-0.017
Prin-0.016
 Claire-0.016
Token Mr
Token ID1770
Feature activation+3.06e-02

Pos logit contributions
 dodging+0.040
iled+0.038
flies+0.037
arate+0.037
ang+0.036
Neg logit contributions
ily-0.036
 someone-0.036
Prin-0.033
Fore-0.033
 Claire-0.032
Token.
Token ID13
Feature activation+3.52e+00

Pos logit contributions
flies+0.001
iled+0.001
oc+0.001
fly+0.001
gie+0.001
Neg logit contributions
ily-0.001
 someone-0.001
Prin-0.001
Fore-0.001
 Claire-0.001
Token Owl
Token ID37007
Feature activation+5.01e-01

Pos logit contributions
 dodging+0.053
iled+0.050
arate+0.047
ples+0.046
ang+0.046
Neg logit contributions
 someone-0.117
ily-0.114
Prin-0.107
Fore-0.106
 Claire-0.105
Token?"
Token ID1701
Feature activation-6.50e-01

Pos logit contributions
 dodging+0.019
iled+0.018
arate+0.018
ples+0.018
ang+0.017
Neg logit contributions
 someone-0.005
ily-0.005
Fore-0.004
Prin-0.004
 Claire-0.004
Token Benny
Token ID44275
Feature activation-1.45e-01

Pos logit contributions
ily+0.013
Fore+0.012
abby+0.011
 someone+0.011
ackers+0.011
Neg logit contributions
 dodging-0.017
oc-0.016
iled-0.016
arate-0.016
iosity-0.016
Token asked
Token ID1965
Feature activation-7.57e-01

Pos logit contributions
 someone+0.003
ily+0.003
Prin+0.002
Fore+0.002
 Claire+0.002
Neg logit contributions
 dodging-0.004
iled-0.004
arate-0.004
ples-0.004
ang-0.004
Token.
Token ID13
Feature activation-1.77e-01

Pos logit contributions
Fore+0.023
ily+0.022
P+0.021
Prin+0.021
ackers+0.020
Neg logit contributions
iled-0.013
 dodging-0.012
 roaring-0.011
oc-0.011
ishly-0.011
Token he
Token ID339
Feature activation+4.45e-01

Pos logit contributions
ily+0.009
Prin+0.009
 someone+0.009
P+0.009
Fore+0.009
Neg logit contributions
 dodging-0.009
iled-0.008
arate-0.008
flies-0.008
 roaring-0.008
Token watched
Token ID7342
Feature activation+7.47e-01

Pos logit contributions
 dodging+0.014
oing+0.014
arate+0.013
tain+0.013
iosity+0.013
Neg logit contributions
 someone-0.007
ily-0.007
 went-0.007
Prin-0.006
Fore-0.006
Token the
Token ID262
Feature activation+6.84e-02

Pos logit contributions
 dodging+0.023
iled+0.022
arate+0.022
ang+0.022
flies+0.021
Neg logit contributions
 someone-0.014
ily-0.013
 Claire-0.011
Prin-0.011
Fore-0.011
Token hero
Token ID4293
Feature activation+1.37e+00

Pos logit contributions
iled+0.002
flies+0.002
ared+0.002
ang+0.002
anged+0.002
Neg logit contributions
 someone-0.002
 other-0.002
 time-0.001
 rest-0.001
 people-0.001
Token's
Token ID338
Feature activation+1.50e-01

Pos logit contributions
itching+0.040
 dodging+0.040
arate+0.040
istered+0.039
ang+0.038
Neg logit contributions
 someone-0.026
ily-0.023
 out-0.022
 go-0.021
 Claire-0.021
Token triumph
Token ID15499
Feature activation+3.05e+00

Pos logit contributions
iled+0.004
ang+0.004
arate+0.004
 dodging+0.004
ples+0.004
Neg logit contributions
 someone-0.003
 time-0.003
 other-0.003
 rest-0.003
 go-0.003
Token.
Token ID13
Feature activation+6.90e-01

Pos logit contributions
 dodging+0.078
iled+0.075
ples+0.074
flies+0.072
arate+0.072
Neg logit contributions
ily-0.068
 someone-0.068
Fore-0.061
Prin-0.060
 Claire-0.060
Token He
Token ID679
Feature activation+7.08e-01

Pos logit contributions
iled+0.016
 dodging+0.016
arate+0.015
oc+0.015
ang+0.015
Neg logit contributions
ily-0.017
 someone-0.016
Prin-0.016
 Claire-0.015
Fore-0.015
Token had
Token ID550
Feature activation+3.14e-01

Pos logit contributions
 dodging+0.024
oing+0.023
arate+0.022
iled+0.022
tain+0.022
Neg logit contributions
 someone-0.011
ily-0.010
Fore-0.009
 Claire-0.009
 went-0.009
Token known
Token ID1900
Feature activation-2.50e-01

Pos logit contributions
 dodging+0.009
iled+0.008
arate+0.008
ang+0.008
oing+0.008
Neg logit contributions
 someone-0.007
ily-0.006
Prin-0.006
Fore-0.006
 Claire-0.006
Token all
Token ID477
Feature activation+2.14e-01

Pos logit contributions
 someone+0.006
ily+0.006
 Claire+0.005
Fore+0.005
Prin+0.005
Neg logit contributions
oing-0.006
 dodging-0.006
iled-0.006
ples-0.006
arate-0.006
Token "
Token ID366
Feature activation-2.54e-01

Pos logit contributions
 someone+0.004
ily+0.004
Fore+0.003
Prin+0.003
 Claire+0.003
Neg logit contributions
 dodging-0.011
iled-0.011
arate-0.011
ples-0.010
flies-0.010
TokenYou
Token ID1639
Feature activation+2.33e-01

Pos logit contributions
 someone+0.004
ily+0.004
Fore+0.004
Prin+0.004
 Claire+0.004
Neg logit contributions
 dodging-0.008
iled-0.008
flies-0.007
ples-0.007
arate-0.007
Token have
Token ID423
Feature activation-2.46e-01

Pos logit contributions
 dodging+0.008
oing+0.007
ples+0.007
itching+0.007
 mastering+0.007
Neg logit contributions
 someone-0.004
ily-0.004
 go-0.003
 out-0.003
Fore-0.003
Token the
Token ID262
Feature activation-7.12e-01

Pos logit contributions
 someone+0.005
ily+0.005
Fore+0.004
Prin+0.004
 Claire+0.004
Neg logit contributions
 dodging-0.007
oing-0.007
ang-0.007
iled-0.007
ples-0.007
Token blue
Token ID4171
Feature activation-2.44e-01

Pos logit contributions
 someone+0.012
ily+0.011
 Claire+0.009
Fore+0.009
Prin+0.009
Neg logit contributions
iled-0.024
 dodging-0.024
oing-0.023
oc-0.023
ples-0.023
Token boat
Token ID8848
Feature activation+5.03e-01

Pos logit contributions
 someone+0.006
ily+0.005
Fore+0.005
 Claire+0.005
Prin+0.005
Neg logit contributions
 dodging-0.006
iled-0.006
 mastering-0.006
arate-0.006
ples-0.006
Token!"
Token ID2474
Feature activation-1.08e+00

Pos logit contributions
 dodging+0.014
oing+0.013
 mastering+0.013
itching+0.013
ples+0.013
Neg logit contributions
 someone-0.011
ily-0.010
 out-0.009
 Claire-0.008
 Lily-0.008
Token
Token ID198
Feature activation-1.98e-01

Pos logit contributions
 someone+0.010
ily+0.010
Prin+0.008
Fore+0.008
 Claire+0.008
Neg logit contributions
 dodging-0.041
iled-0.040
arate-0.039
ang-0.039
iosity-0.038
Token
Token ID198
Feature activation-1.78e-01

Pos logit contributions
 Claire+0.004
Prin+0.004
oph+0.004
 Nek+0.004
ackers+0.003
Neg logit contributions
iled-0.005
oc-0.005
ang-0.005
arate-0.005
oing-0.005
Token"
Token ID1
Feature activation-1.06e-01

Pos logit contributions
 someone+0.003
ily+0.003
Fore+0.002
P+0.002
Prin+0.002
Neg logit contributions
 dodging-0.006
flies-0.006
ples-0.006
tain-0.006
oing-0.006
TokenI
Token ID40
Feature activation-1.59e+00

Pos logit contributions
ily+0.002
 someone+0.002
Fore+0.002
Prin+0.002
P+0.002
Neg logit contributions
flies-0.003
 dodging-0.003
tain-0.003
iled-0.003
ples-0.003
Token
Token ID198
Feature activation+4.36e-01

Pos logit contributions
 dodging+0.026
iled+0.025
arate+0.025
ples+0.024
ang+0.024
Neg logit contributions
 someone-0.018
ily-0.018
Fore-0.016
Prin-0.015
 Claire-0.015
Token
Token ID198
Feature activation+7.50e-01

Pos logit contributions
iled+0.013
oc+0.013
ang+0.012
oing+0.012
arate+0.012
Neg logit contributions
 Claire-0.007
ily-0.007
 someone-0.007
Prin-0.007
 ma-0.007
TokenIn
Token ID818
Feature activation+1.35e+00

Pos logit contributions
 dodging+0.022
flies+0.022
ples+0.021
iosity+0.021
tain+0.021
Neg logit contributions
 someone-0.015
ily-0.014
P-0.013
Fore-0.013
Prin-0.012
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
 dodging+0.045
iled+0.043
arate+0.042
ples+0.042
oing+0.042
Neg logit contributions
 someone-0.021
ily-0.020
Fore-0.017
Prin-0.017
 Claire-0.017
Token woods
Token ID16479
Feature activation+3.86e-01

Pos logit contributions
 dodging+0.003
iled+0.003
arate+0.003
oc+0.003
ang+0.003
Neg logit contributions
ily-0.003
 someone-0.003
 Claire-0.002
Prin-0.002
Fore-0.002
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
 dodging+0.011
ples+0.010
iled+0.010
itching+0.010
oing+0.010
Neg logit contributions
 someone-0.009
ily-0.008
Fore-0.007
Prin-0.007
 Claire-0.007
Token Buddy
Token ID36896
Feature activation+9.99e-01

Pos logit contributions
 dodging+0.022
ples+0.022
iled+0.022
arate+0.022
oing+0.021
Neg logit contributions
 someone-0.028
ily-0.026
Fore-0.024
 Claire-0.023
Prin-0.023
Token met
Token ID1138
Feature activation+4.23e-01

Pos logit contributions
 dodging+0.031
iled+0.029
arate+0.029
ples+0.029
ang+0.029
Neg logit contributions
 someone-0.018
ily-0.017
Prin-0.015
Fore-0.014
 Claire-0.014
Token a
Token ID257
Feature activation+4.04e-01

Pos logit contributions
 dodging+0.014
iled+0.013
arate+0.013
flies+0.013
ang+0.013
Neg logit contributions
ily-0.006
 someone-0.006
Prin-0.006
 Claire-0.005
Fore-0.005
Token little
Token ID1310
Feature activation+1.00e-01

Pos logit contributions
 dodging+0.013
iled+0.012
ples+0.012
iosity+0.012
arate+0.012
Neg logit contributions
 someone-0.007
ily-0.007
Prin-0.006
Fore-0.006
 Claire-0.006
Token bird
Token ID6512
Feature activation+2.69e-01

Pos logit contributions
 dodging+0.003
 mastering+0.003
ples+0.003
itching+0.003
istent+0.003
Neg logit contributions
 someone-0.002
ily-0.002
Prin-0.002
 Claire-0.002
 Lily-0.001
Token by
Token ID416
Feature activation+1.05e+00

Pos logit contributions
 dodging+0.053
arate+0.052
oing+0.051
akable+0.051
flies+0.050
Neg logit contributions
 someone-0.025
ily-0.021
 out-0.018
 time-0.016
 other-0.015
Token himself
Token ID2241
Feature activation+2.25e-01

Pos logit contributions
 dodging+0.030
iled+0.029
arate+0.029
ples+0.029
oing+0.028
Neg logit contributions
 someone-0.021
ily-0.020
Fore-0.017
Prin-0.017
 Claire-0.017
Token.
Token ID13
Feature activation+1.71e-01

Pos logit contributions
 dodging+0.006
arate+0.006
iled+0.006
ples+0.006
ang+0.006
Neg logit contributions
 someone-0.005
ily-0.005
Fore-0.004
 Claire-0.004
Prin-0.004
Token He
Token ID679
Feature activation+3.77e-01

Pos logit contributions
oc+0.005
iled+0.004
 dodging+0.004
ang+0.004
flies+0.004
Neg logit contributions
Prin-0.003
ily-0.003
Fore-0.003
P-0.003
abby-0.003
Token looked
Token ID3114
Feature activation-2.57e-01

Pos logit contributions
 dodging+0.010
iled+0.010
arate+0.009
oing+0.009
ples+0.009
Neg logit contributions
 someone-0.008
ily-0.008
Fore-0.007
Prin-0.007
 Claire-0.007
Token around
Token ID1088
Feature activation+5.62e-01

Pos logit contributions
 out+0.004
 someone+0.004
ily+0.004
 Lily+0.003
 up+0.003
Neg logit contributions
oing-0.008
ples-0.008
arians-0.008
flies-0.008
girls-0.008
Token and
Token ID290
Feature activation-7.39e-01

Pos logit contributions
ples+0.019
oing+0.018
ang+0.018
ience+0.018
tails+0.018
Neg logit contributions
 someone-0.010
ily-0.009
 out-0.008
 time-0.008
 her-0.008
Token saw
Token ID2497
Feature activation-6.58e-01

Pos logit contributions
 someone+0.014
ily+0.013
 Claire+0.011
Prin+0.011
Fore+0.010
Neg logit contributions
 dodging-0.022
iled-0.022
oing-0.022
ples-0.022
arate-0.022
Token a
Token ID257
Feature activation-8.10e-01

Pos logit contributions
ily+0.010
 someone+0.010
Prin+0.009
Fore+0.009
 Claire+0.009
Neg logit contributions
 dodging-0.021
iled-0.020
flies-0.020
arate-0.020
ang-0.019
Token friendly
Token ID8030
Feature activation+1.51e+00

Pos logit contributions
 someone+0.015
ily+0.014
Fore+0.013
Prin+0.013
 Claire+0.012
Neg logit contributions
 dodging-0.024
iled-0.023
arate-0.023
ples-0.023
iosity-0.023
Token bird
Token ID6512
Feature activation+7.33e-02

Pos logit contributions
 dodging+0.031
iled+0.029
oing+0.028
arate+0.028
flies+0.028
Neg logit contributions
ily-0.041
 someone-0.040
Fore-0.039
Prin-0.038
 Claire-0.036
Token.
Token ID13
Feature activation-2.97e-01

Pos logit contributions
 dodging+0.007
iled+0.007
arate+0.007
oing+0.007
ples+0.007
Neg logit contributions
 someone-0.006
ily-0.006
Prin-0.005
 Claire-0.005
Fore-0.005
Token
Token ID198
Feature activation+3.36e-01

Pos logit contributions
P+0.007
Prin+0.007
Fore+0.007
ackers+0.006
abby+0.006
Neg logit contributions
 dodging-0.007
iled-0.006
arate-0.006
iosity-0.006
 mastering-0.006
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
iled+0.008
oc+0.008
arate+0.008
ines+0.008
ang+0.007
Neg logit contributions
Prin-0.006
 Claire-0.006
 Nek-0.006
 someone-0.006
P-0.006
Token"
Token ID1
Feature activation+4.88e-01

Pos logit contributions
 dodging+0.004
iled+0.004
arate+0.004
ang+0.004
iosity+0.004
Neg logit contributions
 someone-0.003
ily-0.003
 Claire-0.003
Prin-0.003
Fore-0.003
TokenWill
Token ID8743
Feature activation-4.22e-01

Pos logit contributions
 dodging+0.011
arate+0.009
 looming+0.009
king+0.009
 tug+0.009
Neg logit contributions
ackers-0.010
ily-0.010
 out-0.010
 someone-0.010
emo-0.010
Token you
Token ID345
Feature activation+6.02e-01

Pos logit contributions
 someone+0.008
ily+0.008
Fore+0.007
Prin+0.007
 Claire+0.007
Neg logit contributions
 dodging-0.012
iled-0.012
arate-0.012
flies-0.012
ang-0.012
Token come
Token ID1282
Feature activation+3.69e-01

Pos logit contributions
 dodging+0.018
ples+0.018
iled+0.017
oing+0.017
ang+0.017
Neg logit contributions
 someone-0.012
ily-0.011
Prin-0.010
 Claire-0.009
Fore-0.009
Token back
Token ID736
Feature activation+5.84e-01

Pos logit contributions
 dodging+0.013
iled+0.013
ples+0.013
ang+0.013
oing+0.013
Neg logit contributions
 someone-0.005
ily-0.004
 out-0.004
Prin-0.003
 Claire-0.003
Token?"
Token ID1701
Feature activation-5.00e-01

Pos logit contributions
 dodging+0.014
iled+0.013
arate+0.013
flies+0.013
ang+0.013
Neg logit contributions
ily-0.014
 someone-0.013
Fore-0.013
Prin-0.012
P-0.012
Token Tom
Token ID4186
Feature activation-3.75e-01

Pos logit contributions
Fore+0.009
ily+0.009
Prin+0.008
ackers+0.008
 someone+0.008
Neg logit contributions
 dodging-0.013
arate-0.013
iled-0.013
iosity-0.013
 tug-0.013
Token asked
Token ID1965
Feature activation-5.41e-02

Pos logit contributions
 someone+0.007
ily+0.007
Fore+0.007
Prin+0.006
 Claire+0.006
Neg logit contributions
 dodging-0.010
iled-0.010
arate-0.010
ang-0.010
iosity-0.010
Token band
Token ID4097
Feature activation+5.68e-01

Pos logit contributions
 dodging+0.029
iled+0.028
flies+0.027
ples+0.027
arate+0.027
Neg logit contributions
 someone-0.025
ily-0.022
Prin-0.020
Fore-0.020
P-0.020
Tokenage
Token ID496
Feature activation+1.44e+00

Pos logit contributions
 dodging+0.015
arate+0.013
ples+0.013
iled+0.013
flies+0.013
Neg logit contributions
 someone-0.013
ily-0.013
Prin-0.011
Fore-0.011
 Claire-0.011
Token with
Token ID351
Feature activation-3.55e-01

Pos logit contributions
 dodging+0.036
iled+0.034
arate+0.033
ples+0.033
oing+0.033
Neg logit contributions
 someone-0.034
ily-0.033
Prin-0.030
Fore-0.029
 Claire-0.029
Token pride
Token ID11293
Feature activation-2.09e-01

Pos logit contributions
 someone+0.007
ily+0.006
 Claire+0.005
Prin+0.005
Fore+0.005
Neg logit contributions
 dodging-0.011
oing-0.011
arate-0.011
iled-0.011
ang-0.011
Token.
Token ID13
Feature activation+6.72e-01

Pos logit contributions
 someone+0.004
ily+0.004
 Claire+0.003
 out+0.003
 Sophie+0.003
Neg logit contributions
 dodging-0.007
arate-0.006
akable-0.006
itching-0.006
ang-0.006
Token<|endoftext|>
Token ID50256
Feature activation+2.70e-01

Pos logit contributions
 dodging+0.019
iled+0.018
arate+0.018
ples+0.018
oing+0.018
Neg logit contributions
 someone-0.013
ily-0.013
Fore-0.012
Prin-0.011
 Claire-0.011
TokenThe
Token ID464
Feature activation+3.81e-01

Pos logit contributions
oc+0.006
iled+0.006
ang+0.006
arate+0.006
girls+0.006
Neg logit contributions
 ma-0.006
abby-0.006
oph-0.005
 someone-0.005
Fore-0.005
Token teacher
Token ID4701
Feature activation-8.01e-01

Pos logit contributions
 dodging+0.011
iled+0.011
ang+0.011
arate+0.011
flies+0.011
Neg logit contributions
ily-0.006
 someone-0.006
Prin-0.006
Fore-0.006
 Claire-0.005
Token stood
Token ID6204
Feature activation+4.69e-01

Pos logit contributions
 someone+0.017
ily+0.017
Fore+0.015
Prin+0.015
 Claire+0.014
Neg logit contributions
 dodging-0.021
iled-0.020
arate-0.020
ples-0.020
flies-0.019
Token at
Token ID379
Feature activation+1.79e+00

Pos logit contributions
 dodging+0.014
iled+0.014
arate+0.013
ples+0.013
flies+0.013
Neg logit contributions
 someone-0.009
ily-0.008
Prin-0.007
Fore-0.007
 Claire-0.007
Token the
Token ID262
Feature activation+7.01e-01

Pos logit contributions
 dodging+0.052
arate+0.049
flies+0.049
iled+0.048
ky+0.048
Neg logit contributions
ily-0.032
 someone-0.030
Fore-0.029
Prin-0.028
P-0.027
Token She
Token ID1375
Feature activation+4.24e-01

Pos logit contributions
iled+0.010
oc+0.010
 dodging+0.010
arate+0.010
ang+0.009
Neg logit contributions
 someone-0.009
ily-0.009
Fore-0.009
P-0.009
Prin-0.009
Token continued
Token ID3767
Feature activation+3.69e-01

Pos logit contributions
 dodging+0.012
iled+0.012
oing+0.011
arate+0.011
ples+0.011
Neg logit contributions
 someone-0.008
ily-0.008
Fore-0.007
 Claire-0.007
Prin-0.007
Token to
Token ID284
Feature activation-3.83e-01

Pos logit contributions
flies+0.011
ang+0.011
 dodging+0.011
iled+0.011
ples+0.011
Neg logit contributions
 someone-0.007
 Claire-0.006
ily-0.006
 out-0.005
 her-0.005
Token bring
Token ID2222
Feature activation-5.22e-01

Pos logit contributions
 someone+0.009
ily+0.008
 Claire+0.008
Prin+0.007
Fore+0.007
Neg logit contributions
 dodging-0.010
iled-0.010
ang-0.010
oing-0.009
arate-0.009
Token beef
Token ID12023
Feature activation+6.55e-01

Pos logit contributions
ily+0.011
Fore+0.010
Prin+0.010
ackers+0.009
abby+0.009
Neg logit contributions
 dodging-0.013
arate-0.012
iled-0.012
oc-0.012
ples-0.012
Token to
Token ID284
Feature activation-2.69e-02

Pos logit contributions
 dodging+0.018
ples+0.017
iled+0.017
ang+0.017
oing+0.017
Neg logit contributions
 someone-0.014
ily-0.013
 Claire-0.012
Fore-0.011
Prin-0.011
Token her
Token ID607
Feature activation+1.12e-01

Pos logit contributions
ily+0.000
Fore+0.000
Prin+0.000
 someone+0.000
P+0.000
Neg logit contributions
 dodging-0.001
iled-0.001
 mastering-0.001
arate-0.001
 roaring-0.001
Token neighbors
Token ID12020
Feature activation-3.77e-01

Pos logit contributions
iled+0.004
 dodging+0.003
ples+0.003
oing+0.003
istent+0.003
Neg logit contributions
 someone-0.002
 Claire-0.002
ily-0.002
 other-0.002
 time-0.001
Token,
Token ID11
Feature activation+1.40e+00

Pos logit contributions
 someone+0.007
ily+0.007
 Claire+0.007
 out+0.006
 Sophie+0.006
Neg logit contributions
 dodging-0.011
itching-0.010
iosity-0.010
oing-0.010
arate-0.010
Token and
Token ID290
Feature activation-4.33e-01

Pos logit contributions
 dodging+0.034
iled+0.033
arate+0.032
ang+0.032
ples+0.032
Neg logit contributions
 someone-0.034
ily-0.032
 Claire-0.030
Prin-0.029
Fore-0.029
Token no
Token ID645
Feature activation-9.15e-01

Pos logit contributions
ily+0.009
 someone+0.009
Fore+0.008
Prin+0.008
 Claire+0.008
Neg logit contributions
 dodging-0.012
iled-0.011
arate-0.011
flies-0.011
ples-0.010
Token box
Token ID3091
Feature activation+9.83e-01

Pos logit contributions
years+0.028
 dodging+0.028
iled+0.028
ples+0.027
istent+0.027
Neg logit contributions
 someone-0.018
ily-0.016
 Claire-0.015
 out-0.014
 Lily-0.014
Token with
Token ID351
Feature activation-4.62e-01

Pos logit contributions
 dodging+0.031
girls+0.029
iosity+0.029
tain+0.029
ience+0.029
Neg logit contributions
ily-0.018
 someone-0.018
 out-0.014
Fore-0.013
 Claire-0.013
Token many
Token ID867
Feature activation-3.72e-01

Pos logit contributions
 someone+0.007
ily+0.006
Fore+0.005
Prin+0.005
 Claire+0.005
Neg logit contributions
 dodging-0.016
iled-0.016
arate-0.015
oing-0.015
ang-0.015
Token nails
Token ID23361
Feature activation-5.65e-01

Pos logit contributions
 someone+0.006
ily+0.006
 other+0.005
 flour+0.005
 time+0.004
Neg logit contributions
 dodging-0.013
oing-0.013
istent-0.013
ishly-0.013
iled-0.013
Token inside
Token ID2641
Feature activation-5.94e-01

Pos logit contributions
ily+0.010
 someone+0.010
 out+0.008
Fore+0.007
 flour+0.007
Neg logit contributions
 dodging-0.017
arate-0.016
oing-0.016
oc-0.016
iosity-0.016
Token.
Token ID13
Feature activation-1.90e-01

Pos logit contributions
 someone+0.012
ily+0.011
 Claire+0.010
Fore+0.009
 Sophie+0.009
Neg logit contributions
 dodging-0.018
arate-0.017
oing-0.017
iled-0.017
iosity-0.017
Token Lily
Token ID20037
Feature activation-6.58e-01

Pos logit contributions
Fore+0.005
Prin+0.005
P+0.005
abby+0.005
ily+0.005
Neg logit contributions
iled-0.004
 dodging-0.004
arate-0.004
 because-0.003
ang-0.003
Token wanted
Token ID2227
Feature activation-5.22e-02

Pos logit contributions
 someone+0.013
ily+0.013
 Claire+0.011
Fore+0.011
Prin+0.011
Neg logit contributions
 dodging-0.020
oing-0.018
arate-0.018
flies-0.018
ples-0.018
Token to
Token ID284
Feature activation+3.53e-02

Pos logit contributions
 someone+0.001
ily+0.001
Fore+0.001
 out+0.001
 flour+0.001
Neg logit contributions
 dodging-0.002
oing-0.002
ang-0.002
oc-0.002
iosity-0.002
Token make
Token ID787
Feature activation+7.53e-01

Pos logit contributions
 dodging+0.001
iled+0.001
oing+0.001
flies+0.001
ience+0.001
Neg logit contributions
 someone-0.001
ily-0.001
 Claire-0.001
Prin-0.001
Fore-0.001
Token something
Token ID1223
Feature activation+3.83e-01

Pos logit contributions
 dodging+0.027
iled+0.026
arate+0.025
oing+0.025
ples+0.025
Neg logit contributions
 someone-0.010
ily-0.010
Fore-0.008
Prin-0.008
 Claire-0.007
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
 someone+0.002
ily+0.002
 Claire+0.001
 Sophie+0.001
 time+0.001
Neg logit contributions
oing-0.006
flies-0.006
 dodging-0.006
ang-0.006
iled-0.006
Token To
Token ID1675
Feature activation+6.36e-01

Pos logit contributions
 dodging+0.005
iled+0.005
arate+0.005
ples+0.005
ang+0.005
Neg logit contributions
ily-0.005
 someone-0.005
Prin-0.005
Fore-0.005
P-0.005
Token his
Token ID465
Feature activation-5.32e-02

Pos logit contributions
 dodging+0.018
iled+0.018
arate+0.017
ples+0.017
oc+0.017
Neg logit contributions
ily-0.012
 someone-0.011
Prin-0.011
Fore-0.011
 Claire-0.010
Token surprise
Token ID5975
Feature activation-1.80e-01

Pos logit contributions
 someone+0.001
ily+0.001
Fore+0.001
 Claire+0.001
Prin+0.001
Neg logit contributions
 dodging-0.001
iled-0.001
ples-0.001
oing-0.001
ang-0.001
Token,
Token ID11
Feature activation-3.96e-01

Pos logit contributions
Prin+0.005
ily+0.005
Fore+0.005
 someone+0.005
P+0.005
Neg logit contributions
 dodging-0.003
iled-0.003
arate-0.003
iosity-0.003
ang-0.003
Token he
Token ID339
Feature activation+4.17e-02

Pos logit contributions
ily+0.009
Prin+0.009
 someone+0.009
Fore+0.008
 Claire+0.008
Neg logit contributions
 dodging-0.010
iled-0.009
arate-0.009
ang-0.009
flies-0.009
Token saw
Token ID2497
Feature activation-5.29e-01

Pos logit contributions
 dodging+0.001
iled+0.001
arate+0.001
flies+0.001
ples+0.001
Neg logit contributions
ily-0.001
 someone-0.001
Fore-0.001
Prin-0.001
 Claire-0.001
Token a
Token ID257
Feature activation-8.91e-01

Pos logit contributions
ily+0.009
Prin+0.008
Fore+0.008
 someone+0.008
 Claire+0.008
Neg logit contributions
 dodging-0.016
iled-0.015
flies-0.015
arate-0.015
ples-0.014
Token little
Token ID1310
Feature activation-7.97e-01

Pos logit contributions
 someone+0.018
ily+0.015
 Claire+0.015
Prin+0.014
Fore+0.014
Neg logit contributions
 dodging-0.026
iled-0.025
iosity-0.025
ples-0.025
arate-0.025
Token frog
Token ID21264
Feature activation+5.72e-01

Pos logit contributions
 someone+0.015
ily+0.012
 Claire+0.012
Prin+0.011
 out+0.011
Neg logit contributions
 dodging-0.024
itching-0.024
 mastering-0.023
iled-0.023
girls-0.023
Token!
Token ID0
Feature activation-2.40e-01

Pos logit contributions
 dodging+0.016
iosity+0.015
tain+0.015
istent+0.015
ples+0.015
Neg logit contributions
 someone-0.012
ily-0.011
 out-0.010
 Claire-0.009
P-0.009
Token you
Token ID345
Feature activation-1.64e+00

Pos logit contributions
Prin+0.012
ily+0.012
Fore+0.012
 someone+0.011
P+0.011
Neg logit contributions
iled-0.017
arate-0.016
 dodging-0.016
tain-0.016
ang-0.016
Token can
Token ID460
Feature activation-1.79e+00

Pos logit contributions
 someone+0.037
ily+0.036
Fore+0.032
Prin+0.032
 Claire+0.031
Neg logit contributions
 dodging-0.044
iled-0.040
oing-0.040
arate-0.039
ples-0.039
Token carry
Token ID3283
Feature activation-4.67e-01

Pos logit contributions
 someone+0.035
ily+0.034
 Claire+0.029
Prin+0.029
Fore+0.029
Neg logit contributions
 dodging-0.053
iled-0.050
flies-0.049
oing-0.049
ples-0.049
Token it
Token ID340
Feature activation-6.93e-01

Pos logit contributions
ily+0.009
 someone+0.009
Prin+0.008
Fore+0.008
 Claire+0.008
Neg logit contributions
 dodging-0.012
iled-0.012
arate-0.012
iosity-0.012
tain-0.012
Token around
Token ID1088
Feature activation+4.08e-01

Pos logit contributions
ily+0.016
 someone+0.016
Fore+0.015
Prin+0.014
 Claire+0.014
Neg logit contributions
 dodging-0.017
iled-0.016
arate-0.015
flies-0.015
tain-0.015
Token."
Token ID526
Feature activation-6.24e-01

Pos logit contributions
 dodging+0.011
iled+0.011
arate+0.011
ples+0.011
ang+0.010
Neg logit contributions
 someone-0.008
ily-0.008
Fore-0.007
Prin-0.007
 Claire-0.007
Token
Token ID198
Feature activation+3.30e-01

Pos logit contributions
ily+0.009
 someone+0.009
Fore+0.008
Prin+0.008
 Claire+0.008
Neg logit contributions
 dodging-0.020
iled-0.020
arate-0.019
ples-0.019
oing-0.019
Token
Token ID198
Feature activation+2.03e-01

Pos logit contributions
iled+0.010
arate+0.009
oing+0.009
 dodging+0.009
oc+0.009
Neg logit contributions
ily-0.005
 someone-0.005
 Claire-0.005
ackers-0.005
Prin-0.004
TokenSarah
Token ID29284
Feature activation+8.99e-02

Pos logit contributions
 dodging+0.006
flies+0.006
iled+0.006
ang+0.006
itching+0.006
Neg logit contributions
 someone-0.004
ily-0.004
Fore-0.003
Prin-0.003
P-0.003
Token was
Token ID373
Feature activation+5.18e-02

Pos logit contributions
 dodging+0.003
iled+0.003
arate+0.003
oing+0.003
flies+0.003
Neg logit contributions
 someone-0.002
ily-0.002
Prin-0.001
Fore-0.001
 Claire-0.001
Token so
Token ID523
Feature activation+6.59e-01

Pos logit contributions
 dodging+0.001
iled+0.001
arate+0.001
ples+0.001
ang+0.001
Neg logit contributions
 someone-0.001
ily-0.001
Prin-0.001
Fore-0.001
 Claire-0.001
Token L
Token ID406
Feature activation+3.65e-01

Pos logit contributions
 dodging+0.010
iled+0.010
ples+0.009
ang+0.009
flies+0.009
Neg logit contributions
 someone-0.007
 Claire-0.006
ily-0.006
Prin-0.006
Fore-0.006
Tokenila
Token ID10102
Feature activation+1.19e+00

Pos logit contributions
ang+0.011
iled+0.011
arate+0.010
oing+0.010
 dodging+0.010
Neg logit contributions
 Claire-0.007
 someone-0.007
Prin-0.006
Fore-0.006
atalie-0.006
Token's
Token ID338
Feature activation-3.78e-01

Pos logit contributions
 dodging+0.042
arate+0.042
iosity+0.040
ples+0.040
ang+0.040
Neg logit contributions
 someone-0.015
ily-0.015
 out-0.011
Fore-0.010
Prin-0.010
Token hand
Token ID1021
Feature activation+6.61e-01

Pos logit contributions
 someone+0.009
ily+0.009
Fore+0.008
 Claire+0.008
Prin+0.008
Neg logit contributions
 dodging-0.009
iled-0.009
arate-0.009
ples-0.009
ang-0.009
Token.
Token ID13
Feature activation-1.26e-01

Pos logit contributions
arate+0.020
 dodging+0.019
ang+0.019
iosity+0.019
 mastering+0.018
Neg logit contributions
 someone-0.013
ily-0.012
 out-0.011
Fore-0.009
Prin-0.009
Token L
Token ID406
Feature activation-6.70e-01

Pos logit contributions
 someone+0.003
ily+0.002
Fore+0.002
Prin+0.002
 Claire+0.002
Neg logit contributions
 dodging-0.003
iled-0.003
ples-0.003
arate-0.003
ang-0.003
Tokenila
Token ID10102
Feature activation+1.49e-01

Pos logit contributions
 someone+0.014
 Claire+0.013
Prin+0.012
Fore+0.011
atalie+0.011
Neg logit contributions
ang-0.020
iled-0.019
arate-0.018
oing-0.017
 dodging-0.017
Token cried
Token ID16896
Feature activation+8.19e-02

Pos logit contributions
 dodging+0.005
arate+0.005
tain+0.005
iosity+0.005
ples+0.005
Neg logit contributions
ily-0.003
 someone-0.002
Prin-0.002
Fore-0.002
 Claire-0.002
Token out
Token ID503
Feature activation+1.45e+00

Pos logit contributions
 dodging+0.002
arate+0.002
iled+0.002
ang+0.002
ples+0.002
Neg logit contributions
 someone-0.002
ily-0.002
Prin-0.001
Fore-0.001
 Claire-0.001
Token loud
Token ID7812
Feature activation+1.13e+00

Pos logit contributions
arate+0.054
ience+0.053
ci+0.052
istent+0.052
arians+0.052
Neg logit contributions
 someone-0.025
 out-0.022
ily-0.020
 her-0.019
 Lily-0.018
Token.
Token ID13
Feature activation+5.86e-02

Pos logit contributions
ang+0.038
arate+0.037
itching+0.036
 dodging+0.035
oing+0.035
Neg logit contributions
 someone-0.021
ily-0.019
 out-0.015
 Lily-0.015
 her-0.014
Token were
Token ID547
Feature activation-4.06e-01

Pos logit contributions
ily+0.002
 went+0.002
 someone+0.002
 go+0.002
 Lily+0.002
Neg logit contributions
oing-0.005
 dodging-0.005
itching-0.005
arate-0.005
ang-0.005
Token having
Token ID1719
Feature activation+1.29e+00

Pos logit contributions
 someone+0.010
ily+0.010
 Claire+0.009
Prin+0.009
 out+0.008
Neg logit contributions
oing-0.010
oc-0.010
iled-0.009
arate-0.009
 dodging-0.009
Token fun
Token ID1257
Feature activation-3.25e-01

Pos logit contributions
oing+0.041
 dodging+0.041
ang+0.041
iled+0.041
flies+0.041
Neg logit contributions
 someone-0.024
ily-0.022
 Claire-0.019
Fore-0.018
Prin-0.018
Token,
Token ID11
Feature activation+2.70e-01

Pos logit contributions
 someone+0.005
 time+0.004
+0.003
ily+0.003
 everything+0.003
Neg logit contributions
ience-0.012
ifying-0.012
chie-0.012
fly-0.012
oing-0.012
Token but
Token ID475
Feature activation+1.75e-01

Pos logit contributions
iled+0.007
oc+0.007
ang+0.007
oing+0.007
ples+0.007
Neg logit contributions
 someone-0.007
ily-0.006
 Lily-0.005
 Claire-0.005
 Sara-0.005
Token then
Token ID788
Feature activation-2.00e+00

Pos logit contributions
 dodging+0.005
iled+0.005
arate+0.004
ang+0.004
iosity+0.004
Neg logit contributions
ily-0.004
 someone-0.004
Prin-0.003
Fore-0.003
 Claire-0.003
Token Sara
Token ID24799
Feature activation-1.96e-01

Pos logit contributions
ily+0.036
 someone+0.033
Prin+0.033
Fore+0.033
P+0.029
Neg logit contributions
 dodging-0.059
iled-0.056
flies-0.055
arate-0.055
iosity-0.054
Token saw
Token ID2497
Feature activation-5.33e-01

Pos logit contributions
 someone+0.004
ily+0.004
Prin+0.003
 Claire+0.003
Fore+0.003
Neg logit contributions
 dodging-0.006
arate-0.006
iled-0.006
ang-0.005
oing-0.005
Token Ben
Token ID3932
Feature activation+1.19e+00

Pos logit contributions
Prin+0.010
Fore+0.010
abby+0.010
atalie+0.009
ily+0.009
Neg logit contributions
 dodging-0.014
iled-0.014
flies-0.013
arate-0.013
 pretending-0.013
Token's
Token ID338
Feature activation+1.47e-01

Pos logit contributions
ang+0.037
arate+0.037
ples+0.037
years+0.037
 dodging+0.037
Neg logit contributions
 someone-0.020
ily-0.018
 out-0.015
 Claire-0.015
 her-0.014
Token car
Token ID1097
Feature activation+9.63e-01

Pos logit contributions
iled+0.005
 dodging+0.005
ples+0.005
arate+0.005
oing+0.005
Neg logit contributions
 someone-0.003
ily-0.002
 Claire-0.002
 time-0.002
 Lily-0.002
TokenThis
Token ID1212
Feature activation+1.02e+00

Pos logit contributions
 dodging+0.001
iled+0.001
arate+0.001
ples+0.001
ang+0.001
Neg logit contributions
 someone-0.001
ily-0.001
Fore-0.000
 Claire-0.000
Prin-0.000
Token is
Token ID318
Feature activation+5.20e-01

Pos logit contributions
 dodging+0.028
iled+0.027
arate+0.027
flies+0.026
iosity+0.026
Neg logit contributions
ily-0.021
 someone-0.020
Prin-0.019
Fore-0.019
 Claire-0.018
Token a
Token ID257
Feature activation-6.11e-01

Pos logit contributions
 dodging+0.017
iled+0.016
arate+0.016
ples+0.016
flies+0.016
Neg logit contributions
 someone-0.008
ily-0.008
Prin-0.007
Fore-0.007
 Claire-0.007
Token mineral
Token ID18352
Feature activation-3.09e-01

Pos logit contributions
 someone+0.014
ily+0.012
Fore+0.012
Prin+0.011
 Claire+0.011
Neg logit contributions
 dodging-0.016
iled-0.016
ples-0.016
oing-0.016
flies-0.016
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
 someone+0.006
ily+0.006
 out+0.005
 Claire+0.005
 Lily+0.004
Neg logit contributions
 dodging-0.010
years-0.009
oing-0.009
 mastering-0.009
ples-0.009
Token children
Token ID1751
Feature activation-2.64e+00

Pos logit contributions
 dodging+0.038
iled+0.037
oing+0.037
arate+0.037
ang+0.036
Neg logit contributions
 someone-0.025
ily-0.023
 Claire-0.021
 Sophie-0.019
Fore-0.019
Token.
Token ID13
Feature activation-2.77e-01

Pos logit contributions
 someone+0.043
ily+0.043
 out+0.035
 Claire+0.034
 Lily+0.032
Neg logit contributions
 dodging-0.089
itching-0.087
oing-0.085
akable-0.085
 terrifying-0.084
Token It
Token ID632
Feature activation+1.84e-02

Pos logit contributions
 someone+0.005
ily+0.005
 Claire+0.004
Prin+0.004
Fore+0.004
Neg logit contributions
 dodging-0.009
itching-0.008
oing-0.008
ang-0.008
iled-0.008
Token is
Token ID318
Feature activation-1.54e-01

Pos logit contributions
 dodging+0.001
oing+0.001
iled+0.001
ang+0.001
itching+0.001
Neg logit contributions
 someone-0.000
ily-0.000
Fore-0.000
 Claire-0.000
Prin-0.000
Token a
Token ID257
Feature activation-1.06e+00

Pos logit contributions
 someone+0.003
ily+0.003
Fore+0.003
 Claire+0.003
Prin+0.003
Neg logit contributions
 dodging-0.004
iled-0.004
arate-0.004
oing-0.004
ang-0.004
Token part
Token ID636
Feature activation+4.83e-01

Pos logit contributions
 someone+0.023
ily+0.020
Fore+0.020
 Claire+0.019
Prin+0.018
Neg logit contributions
 dodging-0.029
iled-0.029
ples-0.028
oing-0.028
ang-0.028
Token looking
Token ID2045
Feature activation-5.18e-01

Pos logit contributions
 someone+0.032
ily+0.030
 out+0.028
P+0.026
Prin+0.025
Neg logit contributions
istent-0.053
iled-0.053
vered-0.052
 dodging-0.052
oc-0.052
Token for
Token ID329
Feature activation-2.42e-01

Pos logit contributions
 someone+0.014
ily+0.014
Fore+0.013
Prin+0.013
 Claire+0.013
Neg logit contributions
 dodging-0.010
iled-0.010
ang-0.010
arate-0.009
oing-0.009
Token them
Token ID606
Feature activation+8.65e-03

Pos logit contributions
ily+0.005
 someone+0.004
Fore+0.004
Prin+0.004
atalie+0.004
Neg logit contributions
 dodging-0.007
ples-0.006
iled-0.006
arate-0.006
flies-0.006
Token in
Token ID287
Feature activation-2.11e-01

Pos logit contributions
 dodging+0.000
istent+0.000
oc+0.000
ples+0.000
arate+0.000
Neg logit contributions
 someone-0.000
ily-0.000
 out-0.000
 Claire-0.000
 her-0.000
Token the
Token ID262
Feature activation-5.98e-01

Pos logit contributions
 someone+0.003
ily+0.003
Prin+0.003
Fore+0.003
 Claire+0.002
Neg logit contributions
 dodging-0.007
iled-0.007
arate-0.007
ples-0.007
ang-0.007
Token me
Token ID502
Feature activation-2.23e+00

Pos logit contributions
ily+0.013
 someone+0.012
atalie+0.011
Prin+0.011
Fore+0.011
Neg logit contributions
 dodging-0.014
ky-0.014
arate-0.013
oc-0.013
 herd-0.013
Tokenadow
Token ID4584
Feature activation-5.96e-01

Pos logit contributions
 someone+0.059
ily+0.057
Prin+0.055
Fore+0.054
 Claire+0.054
Neg logit contributions
iled-0.048
 dodging-0.046
ang-0.045
arate-0.044
oing-0.044
Token.
Token ID13
Feature activation-3.12e-02

Pos logit contributions
 someone+0.014
ily+0.014
Fore+0.012
Prin+0.012
 Claire+0.012
Neg logit contributions
 dodging-0.015
iled-0.014
arate-0.014
ples-0.014
ang-0.014
Token She
Token ID1375
Feature activation+4.72e-03

Pos logit contributions
P+0.001
Fore+0.001
abby+0.001
ily+0.001
 someone+0.001
Neg logit contributions
 because-0.000
tail-0.000
tain-0.000
oc-0.000
arate-0.000
Token was
Token ID373
Feature activation+1.78e-02

Pos logit contributions
 dodging+0.000
iled+0.000
arate+0.000
ang+0.000
ples+0.000
Neg logit contributions
 someone-0.000
ily-0.000
Fore-0.000
Prin-0.000
 Claire-0.000
Token very
Token ID845
Feature activation+8.67e-01

Pos logit contributions
 dodging+0.000
iled+0.000
arate+0.000
ples+0.000
ang+0.000
Neg logit contributions
 someone-0.000
ily-0.000
Fore-0.000
Prin-0.000
 Claire-0.000
Token.
Token ID13
Feature activation-3.97e-01

Pos logit contributions
 someone+0.005
ily+0.004
 time+0.004
 rest+0.004
 out+0.003
Neg logit contributions
ang-0.009
arate-0.009
 dodging-0.009
flies-0.009
ples-0.009
Token "
Token ID366
Feature activation-1.02e-01

Pos logit contributions
ily+0.008
P+0.008
Fore+0.008
 someone+0.007
Prin+0.007
Neg logit contributions
 dodging-0.010
iled-0.010
flies-0.009
oing-0.009
oc-0.009
TokenThank
Token ID10449
Feature activation+5.75e-01

Pos logit contributions
 someone+0.002
ily+0.002
 Claire+0.002
Prin+0.002
atalie+0.002
Neg logit contributions
 dodging-0.003
iled-0.003
arate-0.002
ang-0.002
oc-0.002
Token you
Token ID345
Feature activation+8.20e-02

Pos logit contributions
iled+0.015
arate+0.014
 dodging+0.014
ang+0.014
iosity+0.014
Neg logit contributions
 someone-0.011
Prin-0.011
ackers-0.011
ily-0.011
atalie-0.011
Token,
Token ID11
Feature activation+4.46e-01

Pos logit contributions
 dodging+0.002
iled+0.002
iosity+0.002
oc+0.002
arate+0.002
Neg logit contributions
 someone-0.002
ily-0.002
Prin-0.002
Fore-0.002
 Claire-0.002
Token mom
Token ID1995
Feature activation-2.29e+00

Pos logit contributions
 dodging+0.016
iled+0.015
iosity+0.015
flies+0.015
arate+0.015
Neg logit contributions
 someone-0.005
ily-0.005
Prin-0.004
Fore-0.004
P-0.004
Tokenmy
Token ID1820
Feature activation-1.46e+00

Pos logit contributions
 someone+0.041
ily+0.041
 Claire+0.037
Fore+0.037
Prin+0.036
Neg logit contributions
iled-0.066
 dodging-0.065
oc-0.064
flies-0.063
ples-0.062
Token!
Token ID0
Feature activation+5.39e-01

Pos logit contributions
 someone+0.029
ily+0.028
Fore+0.025
Prin+0.025
 Claire+0.025
Neg logit contributions
 dodging-0.042
iled-0.040
arate-0.039
ples-0.039
flies-0.038
Token You
Token ID921
Feature activation-4.15e-02

Pos logit contributions
iled+0.013
 dodging+0.013
arate+0.012
ang+0.012
oc+0.012
Neg logit contributions
 someone-0.013
ily-0.012
Prin-0.012
Fore-0.011
 Claire-0.011
Token are
Token ID389
Feature activation-1.96e-01

Pos logit contributions
 someone+0.001
ily+0.001
Fore+0.001
Prin+0.001
 Claire+0.001
Neg logit contributions
 dodging-0.001
iled-0.001
arate-0.001
ples-0.001
flies-0.001
Token very
Token ID845
Feature activation+1.10e+00

Pos logit contributions
 someone+0.004
ily+0.004
Fore+0.003
 Claire+0.003
Prin+0.003
Neg logit contributions
 dodging-0.006
oing-0.006
iled-0.006
arate-0.006
ang-0.006
Token a
Token ID257
Feature activation+8.16e-02

Pos logit contributions
 dodging+0.008
iled+0.007
arate+0.007
oing+0.007
ples+0.007
Neg logit contributions
 someone-0.003
ily-0.002
Fore-0.002
 Claire-0.002
Prin-0.002
Token cozy
Token ID37438
Feature activation+1.53e+00

Pos logit contributions
 dodging+0.003
iled+0.003
flies+0.003
ples+0.002
arate+0.002
Neg logit contributions
 someone-0.001
ily-0.001
Prin-0.001
Fore-0.001
 Claire-0.001
Token room
Token ID2119
Feature activation-1.04e+00

Pos logit contributions
 dodging+0.045
iled+0.044
arate+0.043
ples+0.042
oing+0.042
Neg logit contributions
 someone-0.029
ily-0.028
Prin-0.024
Fore-0.024
 Claire-0.024
Token with
Token ID351
Feature activation-6.43e-01

Pos logit contributions
 someone+0.017
ily+0.016
 Claire+0.012
 out+0.012
 time+0.012
Neg logit contributions
 dodging-0.035
girls-0.035
ience-0.035
istent-0.035
oing-0.034
Token many
Token ID867
Feature activation-1.90e-01

Pos logit contributions
ily+0.010
 someone+0.010
Fore+0.010
Prin+0.009
 Claire+0.009
Neg logit contributions
 dodging-0.019
iled-0.019
ples-0.018
flies-0.018
arate-0.018
Token toys
Token ID14958
Feature activation-2.26e+00

Pos logit contributions
 someone+0.003
ily+0.003
 other+0.002
 time+0.002
 Claire+0.002
Neg logit contributions
 dodging-0.007
iled-0.007
oing-0.006
flies-0.006
arate-0.006
Token.
Token ID13
Feature activation+3.66e-01

Pos logit contributions
ily+0.029
 someone+0.028
 out+0.024
 other+0.018
 went+0.018
Neg logit contributions
oc-0.087
 dodging-0.086
istent-0.086
oing-0.086
akable-0.085
Token
Token ID198
Feature activation+1.30e-01

Pos logit contributions
iled+0.010
oc+0.010
 dodging+0.010
arate+0.010
ang+0.010
Neg logit contributions
ily-0.006
Fore-0.006
 someone-0.006
abby-0.005
atalie-0.005
Token
Token ID198
Feature activation+3.02e-01

Pos logit contributions
oc+0.004
iled+0.004
ang+0.003
oing+0.003
arate+0.003
Neg logit contributions
 someone-0.002
 Claire-0.002
ily-0.002
Prin-0.002
 Nek-0.002
TokenOne
Token ID3198
Feature activation+6.74e-01

Pos logit contributions
 dodging+0.009
flies+0.009
iled+0.009
ples+0.009
iosity+0.009
Neg logit contributions
 someone-0.006
ily-0.006
Prin-0.005
Fore-0.005
 Claire-0.005
Token day
Token ID1110
Feature activation+2.08e-01

Pos logit contributions
 dodging+0.022
iled+0.021
arate+0.021
ples+0.020
ang+0.020
Neg logit contributions
 someone-0.011
ily-0.010
Fore-0.009
Prin-0.009
 Claire-0.009