TOP ACTIVATIONS
MAX = 3.457

there was a silly wood pe cker who wanted to mail
S andy was an inqu is itive little girl . Everywhere
sund ae with lots of topp ings . Tim my was
lion with a big man e and sharp teeth . Lily
around on a tight ro pe , he could even turn
out popped a little wood pe cker . "
horse with a long man e and a tail . And
a princess called R yle e . Every day , she
eyes and a long man e . Tom and Anna smiled
<|endoftext|> Andy was an inqu is itive 3 year old boy
it !" Isa ac gul ped and said ,
sund ae with lots of topp ings . They
horse with a long man e and a tail . It
a z ebra named Stri pe . Stri pe lived in
time there was a wood cut ter . He had an
lion with a fluffy man e . She saw a tall
lion with a golden man e . He looked at her
to wear shorts and T - shirts . They did not
horn and a rainbow man e . She was so happy
walking on a tight ro pe . He had to balance

MIN ACTIVATIONS
MIN = -3.268

it 's important to keep hed ges neat and tidy "
parents could lead to dangerous situations . From that day on
trick - or - t reating anymore . She sadly put
him how to think of situations from different perspectives , and
work . They trimmed the hed ges and cut the grass
things can come from bad situations . <|endoftext|> Once upon a
sandwich . Her mom made her a peanut butter and jelly
ended up in such bad situations . The
eat . Her mom made her a y ummy peanut butter
sandwich . Her mom made her a normal sandwich with peanut
that she had made the little boy so happy . She
. Her mom my made her a sandwich with peanut butter
could help him in difficult situations . <|endoftext|> Once upon a
â € The little dog shook his head .
. Her mom my picked her up and said , "
day , her mom made her a y ummy sandwich with
also help you in tough situations . From that day on
" Hello , little girl . What 's your
, Lily 's mom made her some lemon ade , but
Her friends tried to make her feel better , but nothing

INTERVAL 1.776 - 3.457
CONTAINS 0.136%

, there lived an elderly man in a village . He
Jim was a happy little boy . He had a pet
horns . They say m oo ." Tom thinks
two friends who were two circles . One lived far away
set off with a big grin on his face . He

INTERVAL 0.094 - 1.776
CONTAINS 41.548%

, they saw Lily 's grand pa standing there with a
but it couldn 't . It didn 't even fl utter
years old and very brave . One day , his parents
I can sail ." <|endoftext|> Once upon a time , there
. The yellow flower saw a dish washer in

INTERVAL -1.587 - 0.094
CONTAINS 58.134%

could grow big and small . Mia loved her lamp very
a walk . She walked very slow , taking in all
the puppy had tried to bite him . He didn 't
From that day , Sally knew that when she was busy
decide to play a different game ." They started to play

INTERVAL -3.268 - -1.587
CONTAINS 0.182%

feeling better too . Max 's owner also gave him a
us ! Let â € ™ s share it ." Pat
that her kind heart made her special . They became good
,' she said , giving her daughter a hug . '
his house and saw how humble the man 's house was
Token there
Token ID612
Feature activation-2.26e-01

Pos logit contributions
 next+0.001
 three+0.001
 size+0.001
el+0.001
 only+0.000
Neg logit contributions
 stret-0.002
Charl-0.002
ches-0.002
 treadmill-0.002
 unnecessary-0.002
Token was
Token ID373
Feature activation-3.74e-01

Pos logit contributions
 stret+0.010
ches+0.009
Charl+0.009
 treadmill+0.008
 safer+0.008
Neg logit contributions
 size-0.005
 next-0.005
 three-0.005
el-0.005
 nos-0.005
Token a
Token ID257
Feature activation-2.59e-01

Pos logit contributions
 stret+0.017
ches+0.015
Charl+0.015
 treadmill+0.014
 safer+0.014
Neg logit contributions
 size-0.009
 next-0.009
 three-0.009
ho-0.008
el-0.008
Token silly
Token ID14397
Feature activation+6.44e-01

Pos logit contributions
 stret+0.013
ches+0.011
Charl+0.011
ion+0.011
 treadmill+0.011
Neg logit contributions
 three-0.005
 next-0.005
 size-0.005
 nos-0.005
 only-0.005
Token wood
Token ID4898
Feature activation+1.37e+00

Pos logit contributions
el+0.017
 size+0.017
ad+0.017
 next+0.016
ra+0.016
Neg logit contributions
 stret-0.027
ches-0.022
Charl-0.022
 treadmill-0.022
 wheelchair-0.021
Tokenpe
Token ID431
Feature activation+3.46e+00

Pos logit contributions
el+0.042
 size+0.041
 nos+0.041
ad+0.041
 colors+0.040
Neg logit contributions
 stret-0.050
ches-0.042
ion-0.040
Charl-0.039
 treadmill-0.039
Tokencker
Token ID15280
Feature activation+1.90e+00

Pos logit contributions
 nos+0.104
 three+0.100
 size+0.099
umm+0.099
 3+0.096
Neg logit contributions
 stret-0.127
ches-0.111
ion-0.100
land-0.099
Charl-0.096
Token who
Token ID508
Feature activation-4.88e-01

Pos logit contributions
 size+0.060
ad+0.059
el+0.058
 three+0.058
 nos+0.058
Neg logit contributions
 stret-0.066
ches-0.057
Charl-0.053
 safer-0.052
 wheelchair-0.051
Token wanted
Token ID2227
Feature activation+4.53e-01

Pos logit contributions
 stret+0.018
ches+0.017
Charl+0.016
 treadmill+0.015
 safer+0.015
Neg logit contributions
 next-0.014
 three-0.014
 size-0.014
el-0.014
 3-0.013
Token to
Token ID284
Feature activation+4.79e-01

Pos logit contributions
ra+0.016
el+0.016
flower+0.016
ho+0.015
umm+0.015
Neg logit contributions
 stret-0.015
 safer-0.013
 unnecessary-0.012
ion-0.012
 treadmill-0.012
Token mail
Token ID6920
Feature activation-1.02e-01

Pos logit contributions
 size+0.013
 next+0.013
 three+0.012
ho+0.012
el+0.012
Neg logit contributions
 stret-0.020
ches-0.017
Charl-0.017
 treadmill-0.017
 safer-0.016
TokenS
Token ID50
Feature activation+1.17e+00

Pos logit contributions
 next+0.007
 three+0.007
 size+0.007
el+0.007
ad+0.007
Neg logit contributions
 stret-0.009
ches-0.007
Charl-0.007
 treadmill-0.007
 unnecessary-0.007
Tokenandy
Token ID10757
Feature activation+7.87e-01

Pos logit contributions
 size+0.034
 three+0.034
 next+0.034
 nos+0.033
el+0.033
Neg logit contributions
 stret-0.046
ches-0.039
Charl-0.039
 treadmill-0.037
ion-0.036
Token was
Token ID373
Feature activation+2.75e-01

Pos logit contributions
 size+0.018
 next+0.018
 three+0.018
el+0.017
ad+0.017
Neg logit contributions
 stret-0.036
ches-0.032
Charl-0.031
 treadmill-0.030
 safer-0.030
Token an
Token ID281
Feature activation+6.40e-01

Pos logit contributions
ho+0.008
 size+0.008
 markings+0.008
ad+0.007
umm+0.007
Neg logit contributions
 stret-0.011
Charl-0.010
 safer-0.009
 wheelchair-0.009
ches-0.009
Token inqu
Token ID7949
Feature activation+6.12e-01

Pos logit contributions
 size+0.013
ho+0.013
 next+0.013
ra+0.012
 shining+0.012
Neg logit contributions
 stret-0.030
Charl-0.026
ches-0.026
 treadmill-0.025
 safer-0.025
Tokenis
Token ID271
Feature activation+3.34e+00

Pos logit contributions
 next+0.016
ad+0.015
 size+0.015
 three+0.015
el+0.015
Neg logit contributions
 stret-0.027
Charl-0.023
ches-0.023
 treadmill-0.022
 wheelchair-0.022
Tokenitive
Token ID1800
Feature activation+2.21e-01

Pos logit contributions
 nos+0.075
 size+0.075
 next+0.074
 colors+0.073
 three+0.072
Neg logit contributions
 stret-0.149
ches-0.131
Charl-0.125
ion-0.120
 treadmill-0.119
Token little
Token ID1310
Feature activation-3.24e-01

Pos logit contributions
ped+0.005
ra+0.005
 shining+0.005
el+0.005
ho+0.004
Neg logit contributions
 stret-0.010
Charl-0.008
ches-0.008
 treadmill-0.008
 safer-0.008
Token girl
Token ID2576
Feature activation+1.28e+00

Pos logit contributions
 stret+0.013
Charl+0.011
 treadmill+0.011
ches+0.011
 safer+0.011
Neg logit contributions
ad-0.009
ped-0.008
 size-0.008
 markings-0.008
el-0.008
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
 size+0.038
 next+0.037
 three+0.037
ad+0.036
el+0.036
Neg logit contributions
 stret-0.049
ches-0.042
Charl-0.042
 treadmill-0.039
ion-0.039
Token Everywhere
Token ID48201
Feature activation+2.79e-01

Pos logit contributions
 size+0.007
 nos+0.007
 shining+0.007
 next+0.007
ho+0.007
Neg logit contributions
 stret-0.010
Charl-0.009
 treadmill-0.009
ches-0.009
ion-0.009
Token sund
Token ID37437
Feature activation+2.89e-01

Pos logit contributions
 size+0.001
ad+0.001
 three+0.001
el+0.001
 nos+0.001
Neg logit contributions
 stret-0.001
ches-0.001
Charl-0.001
 treadmill-0.001
ion-0.001
Tokenae
Token ID3609
Feature activation-5.22e-01

Pos logit contributions
 next+0.008
 size+0.008
 three+0.008
el+0.008
ad+0.008
Neg logit contributions
 stret-0.012
Charl-0.010
ches-0.010
 treadmill-0.010
 wheelchair-0.009
Token with
Token ID351
Feature activation+3.66e-01

Pos logit contributions
 stret+0.020
ches+0.017
ion+0.015
 safer+0.015
Charl+0.015
Neg logit contributions
ad-0.016
el-0.016
 nos-0.015
ped-0.015
 size-0.015
Token lots
Token ID6041
Feature activation-2.62e-01

Pos logit contributions
flower+0.011
ho+0.011
 size+0.011
 next+0.011
ra+0.010
Neg logit contributions
 stret-0.013
Charl-0.011
ion-0.011
ches-0.011
text-0.010
Token of
Token ID286
Feature activation+2.86e-01

Pos logit contributions
 stret+0.013
Charl+0.012
ches+0.011
 treadmill+0.011
 unnecessary+0.011
Neg logit contributions
 next-0.005
 three-0.005
 size-0.005
 colors-0.005
el-0.005
Token topp
Token ID23126
Feature activation+3.32e+00

Pos logit contributions
ho+0.010
flower+0.009
 size+0.009
ra+0.009
el+0.008
Neg logit contributions
 stret-0.011
 safer-0.008
ion-0.008
 treadmill-0.008
ches-0.008
Tokenings
Token ID654
Feature activation-4.46e-01

Pos logit contributions
 next+0.127
 3+0.126
 three+0.126
 nos+0.122
 humming+0.122
Neg logit contributions
 stret-0.091
ches-0.074
text-0.071
 treadmill-0.061
uddle-0.061
Token.
Token ID13
Feature activation-9.77e-01

Pos logit contributions
 stret+0.020
Charl+0.017
ches+0.017
 treadmill+0.016
 safer+0.016
Neg logit contributions
 next-0.011
 three-0.011
 size-0.011
el-0.011
ad-0.011
Token Tim
Token ID5045
Feature activation-1.05e+00

Pos logit contributions
 stret+0.048
ches+0.043
Charl+0.042
 unnecessary+0.041
 treadmill+0.040
Neg logit contributions
 next-0.020
 three-0.020
 size-0.019
 only-0.019
el-0.019
Tokenmy
Token ID1820
Feature activation-6.53e-01

Pos logit contributions
 stret+0.047
ches+0.043
Charl+0.041
 treadmill+0.040
 safer+0.039
Neg logit contributions
ho-0.023
 three-0.022
 size-0.022
 next-0.022
 nos-0.022
Token was
Token ID373
Feature activation-3.63e-01

Pos logit contributions
 stret+0.026
ches+0.025
Charl+0.023
 treadmill+0.022
 safer+0.021
Neg logit contributions
 size-0.016
ad-0.016
 shining-0.016
el-0.016
flower-0.016
Token lion
Token ID18744
Feature activation+1.52e+00

Pos logit contributions
ped+0.011
ad+0.011
el+0.011
red+0.010
 ever+0.010
Neg logit contributions
 stret-0.015
 safer-0.012
ches-0.012
 wheelchair-0.012
 treadmill-0.012
Token with
Token ID351
Feature activation+1.28e+00

Pos logit contributions
 size+0.042
 nos+0.042
ho+0.040
el+0.040
umm+0.039
Neg logit contributions
 stret-0.059
ches-0.055
ion-0.047
 safer-0.047
 treadmill-0.047
Token a
Token ID257
Feature activation+7.49e-01

Pos logit contributions
flower+0.041
ok+0.041
ho+0.040
umm+0.040
 next+0.039
Neg logit contributions
 stret-0.038
ches-0.036
ion-0.032
 safer-0.032
text-0.032
Token big
Token ID1263
Feature activation+4.00e-01

Pos logit contributions
ad+0.017
 size+0.017
 next+0.017
el+0.017
ped+0.016
Neg logit contributions
 stret-0.034
ches-0.030
Charl-0.029
 wheelchair-0.028
 treadmill-0.028
Token man
Token ID582
Feature activation+1.02e+00

Pos logit contributions
 next+0.011
ped+0.010
years+0.010
 ever+0.010
ok+0.010
Neg logit contributions
 stret-0.016
ches-0.013
 wheelchair-0.013
 treadmill-0.013
 mat-0.013
Tokene
Token ID68
Feature activation+3.32e+00

Pos logit contributions
 size+0.029
umm+0.029
 nos+0.029
ho+0.028
 rays+0.027
Neg logit contributions
 stret-0.039
ches-0.036
ion-0.032
 Street-0.031
Charl-0.030
Token and
Token ID290
Feature activation+9.84e-01

Pos logit contributions
 size+0.105
el+0.098
 next+0.096
 nos+0.096
 three+0.096
Neg logit contributions
 stret-0.126
ches-0.108
Charl-0.100
 treadmill-0.095
ion-0.093
Token sharp
Token ID7786
Feature activation+7.88e-01

Pos logit contributions
el+0.024
ho+0.024
 next+0.024
 size+0.024
ad+0.023
Neg logit contributions
 stret-0.042
ches-0.037
Charl-0.036
 treadmill-0.035
 safer-0.034
Token teeth
Token ID9941
Feature activation+9.32e-01

Pos logit contributions
 next+0.023
ho+0.021
umm+0.020
years+0.020
 ever+0.020
Neg logit contributions
 stret-0.030
ches-0.028
 wheelchair-0.024
 Street-0.024
land-0.023
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
 size+0.029
 nos+0.029
el+0.029
ho+0.029
 three+0.028
Neg logit contributions
 stret-0.033
ches-0.029
ion-0.027
 treadmill-0.026
Charl-0.026
Token Lily
Token ID20037
Feature activation-5.69e-01

Pos logit contributions
 stret+0.011
Charl+0.010
 unnecessary+0.010
ches+0.009
text+0.009
Neg logit contributions
 three-0.005
 next-0.005
 only-0.005
 colors-0.005
 thought-0.004
Token around
Token ID1088
Feature activation-1.94e-01

Pos logit contributions
 stret+0.016
ches+0.015
 treadmill+0.014
 safer+0.014
Charl+0.013
Neg logit contributions
el-0.013
flower-0.013
 size-0.013
ra-0.012
 e-0.012
Token on
Token ID319
Feature activation+3.03e-02

Pos logit contributions
 stret+0.007
ches+0.006
Charl+0.006
 treadmill+0.006
 safer+0.006
Neg logit contributions
flower-0.006
el-0.006
ho-0.006
 nos-0.006
 size-0.006
Token a
Token ID257
Feature activation+7.12e-04

Pos logit contributions
 size+0.001
 next+0.001
 three+0.001
el+0.001
ho+0.001
Neg logit contributions
 stret-0.001
ches-0.001
Charl-0.001
 treadmill-0.001
 safer-0.001
Token tight
Token ID5381
Feature activation+2.31e-01

Pos logit contributions
 three+0.000
 looking+0.000
 next+0.000
 colors+0.000
 bright+0.000
Neg logit contributions
 stret-0.000
ion-0.000
Charl-0.000
 unnecessary-0.000
text-0.000
Tokenro
Token ID305
Feature activation+1.97e+00

Pos logit contributions
 size+0.006
 three+0.006
 looking+0.006
 next+0.006
 colors+0.006
Neg logit contributions
 stret-0.010
Charl-0.009
 unnecessary-0.008
ches-0.008
 safer-0.008
Tokenpe
Token ID431
Feature activation+3.25e+00

Pos logit contributions
 size+0.036
 next+0.036
 three+0.035
ho+0.034
ad+0.034
Neg logit contributions
 stret-0.098
Charl-0.087
ches-0.087
 treadmill-0.084
 safer-0.083
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
 nos+0.086
 size+0.083
 three+0.082
ad+0.081
el+0.080
Neg logit contributions
 stret-0.137
ches-0.120
Charl-0.108
ion-0.106
 treadmill-0.106
Token he
Token ID339
Feature activation-1.38e-01

Pos logit contributions
 stret+0.006
 safer+0.005
ches+0.005
 treadmill+0.005
 wheelchair+0.005
Neg logit contributions
el-0.005
ad-0.005
umm-0.004
ums-0.004
flower-0.004
Token could
Token ID714
Feature activation+6.42e-01

Pos logit contributions
 stret+0.005
ches+0.005
Charl+0.004
 treadmill+0.004
 safer+0.004
Neg logit contributions
el-0.004
umm-0.004
ad-0.004
 size-0.004
flower-0.004
Token even
Token ID772
Feature activation-2.54e-01

Pos logit contributions
el+0.019
ho+0.018
umm+0.018
 size+0.018
 colors+0.018
Neg logit contributions
 stret-0.025
ches-0.022
 treadmill-0.020
 safer-0.020
 wheelchair-0.020
Token turn
Token ID1210
Feature activation+3.78e-01

Pos logit contributions
 stret+0.010
ches+0.009
 safer+0.009
Charl+0.008
 treadmill+0.008
Neg logit contributions
el-0.007
 size-0.007
ho-0.007
ad-0.007
 next-0.007
Token out
Token ID503
Feature activation+1.84e-01

Pos logit contributions
 stret+0.016
ches+0.014
 safer+0.014
Charl+0.013
ion+0.013
Neg logit contributions
el-0.013
ums-0.013
ad-0.013
 size-0.013
ra-0.013
Token popped
Token ID22928
Feature activation+6.17e-01

Pos logit contributions
flower+0.006
umm+0.006
ra+0.006
ums+0.006
 size+0.006
Neg logit contributions
ches-0.006
 stret-0.005
ion-0.005
 safer-0.005
 else-0.005
Token a
Token ID257
Feature activation+2.02e-01

Pos logit contributions
ho+0.020
 size+0.020
ad+0.019
 next+0.019
flower+0.019
Neg logit contributions
 stret-0.022
ches-0.018
 safer-0.017
ion-0.017
Charl-0.016
Token little
Token ID1310
Feature activation-9.89e-01

Pos logit contributions
el+0.005
ad+0.005
 next+0.005
 size+0.005
ped+0.005
Neg logit contributions
 stret-0.009
ches-0.008
Charl-0.007
 safer-0.007
 treadmill-0.007
Token wood
Token ID4898
Feature activation+7.32e-01

Pos logit contributions
 stret+0.042
ches+0.035
 safer+0.035
Charl+0.034
 treadmill+0.033
Neg logit contributions
ad-0.026
 next-0.025
el-0.025
 size-0.025
ped-0.024
Tokenpe
Token ID431
Feature activation+3.24e+00

Pos logit contributions
 nos+0.021
 colors+0.020
 three+0.020
el+0.020
ad+0.020
Neg logit contributions
 stret-0.029
ches-0.025
ion-0.023
 safer-0.022
 treadmill-0.022
Tokencker
Token ID15280
Feature activation+1.55e+00

Pos logit contributions
 nos+0.084
 three+0.080
umm+0.078
 3+0.078
 size+0.076
Neg logit contributions
 stret-0.137
ches-0.126
land-0.112
 Street-0.109
ion-0.107
Token.
Token ID13
Feature activation+4.77e-01

Pos logit contributions
ad+0.046
 three+0.046
 colors+0.046
 nos+0.046
 size+0.046
Neg logit contributions
 stret-0.059
ches-0.051
 safer-0.045
 Street-0.044
 wheelchair-0.044
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
 size+0.012
 next+0.012
el+0.012
 three+0.012
 colors+0.011
Neg logit contributions
 stret-0.021
ches-0.018
Charl-0.018
 treadmill-0.017
ion-0.017
Token
Token ID198
Feature activation+7.77e-02

Pos logit contributions
 next+0.003
 three+0.003
ad+0.003
 nos+0.003
 only+0.003
Neg logit contributions
 stret-0.005
ches-0.004
 treadmill-0.004
Charl-0.004
 unnecessary-0.004
Token"
Token ID1
Feature activation+5.06e-01

Pos logit contributions
 next+0.003
 three+0.003
 nos+0.003
ad+0.003
ho+0.002
Neg logit contributions
 stret-0.003
ches-0.002
 treadmill-0.002
Charl-0.002
 unnecessary-0.002
Token horse
Token ID8223
Feature activation+8.32e-01

Pos logit contributions
el+0.014
ped+0.014
ad+0.014
red+0.013
 size+0.013
Neg logit contributions
 stret-0.022
 safer-0.018
 wheelchair-0.017
 treadmill-0.017
 mat-0.017
Token with
Token ID351
Feature activation+1.11e+00

Pos logit contributions
umm+0.025
 size+0.025
 nos+0.025
ped+0.024
el+0.024
Neg logit contributions
 stret-0.030
ches-0.027
 safer-0.025
 clinic-0.024
ion-0.024
Token a
Token ID257
Feature activation+8.73e-01

Pos logit contributions
umm+0.038
ok+0.038
ho+0.037
flower+0.037
 tone+0.035
Neg logit contributions
 stret-0.031
ches-0.028
ion-0.027
 yoga-0.027
 safer-0.026
Token long
Token ID890
Feature activation+1.10e+00

Pos logit contributions
ad+0.022
el+0.022
 size+0.022
 next+0.021
ho+0.020
Neg logit contributions
 stret-0.040
ches-0.033
Charl-0.033
 wheelchair-0.032
 treadmill-0.032
Token man
Token ID582
Feature activation+7.43e-01

Pos logit contributions
ho+0.028
 next+0.027
umm+0.027
 nos+0.027
years+0.027
Neg logit contributions
 stret-0.049
ches-0.042
 wheelchair-0.039
Charl-0.038
 mat-0.037
Tokene
Token ID68
Feature activation+3.19e+00

Pos logit contributions
umm+0.022
 size+0.022
 nos+0.021
ho+0.020
 rays+0.020
Neg logit contributions
 stret-0.028
ches-0.025
ion-0.023
 Street-0.023
 clinic-0.022
Token and
Token ID290
Feature activation+7.03e-01

Pos logit contributions
 size+0.104
el+0.098
umm+0.095
 3+0.095
 only+0.094
Neg logit contributions
 stret-0.117
ches-0.099
Charl-0.090
land-0.085
ion-0.085
Token a
Token ID257
Feature activation+6.26e-01

Pos logit contributions
ho+0.019
el+0.019
 next+0.018
 size+0.018
ad+0.018
Neg logit contributions
 stret-0.028
Charl-0.024
ches-0.024
 safer-0.023
 wheelchair-0.023
Token tail
Token ID7894
Feature activation+1.41e+00

Pos logit contributions
el+0.017
ad+0.017
 size+0.016
 next+0.016
ho+0.016
Neg logit contributions
 stret-0.028
Charl-0.022
 wheelchair-0.022
ches-0.022
 treadmill-0.021
Token.
Token ID13
Feature activation+3.39e-01

Pos logit contributions
umm+0.051
ra+0.050
el+0.050
 nos+0.050
 size+0.049
Neg logit contributions
 stret-0.041
ches-0.036
ion-0.032
 wheelchair-0.030
 clinic-0.030
Token And
Token ID843
Feature activation-3.75e-01

Pos logit contributions
 three+0.008
 next+0.008
 only+0.007
 colors+0.007
el+0.007
Neg logit contributions
 stret-0.016
ches-0.014
Charl-0.014
 unnecessary-0.013
 treadmill-0.013
Token a
Token ID257
Feature activation-2.37e-02

Pos logit contributions
 stret+0.001
Charl+0.001
ches+0.001
 safer+0.001
ion+0.001
Neg logit contributions
 size-0.001
ho-0.001
el-0.001
flower-0.001
 markings-0.001
Token princess
Token ID21752
Feature activation+7.43e-01

Pos logit contributions
 stret+0.001
Charl+0.001
ches+0.001
 treadmill+0.001
 safer+0.001
Neg logit contributions
el-0.001
 size-0.001
ad-0.001
 next-0.001
 colors-0.001
Token called
Token ID1444
Feature activation+3.71e-01

Pos logit contributions
 size+0.025
 three+0.025
el+0.025
 colors+0.024
ad+0.024
Neg logit contributions
 stret-0.025
ches-0.021
Charl-0.021
 wheelchair-0.019
ion-0.018
Token R
Token ID371
Feature activation+8.23e-01

Pos logit contributions
 size+0.012
el+0.011
 next+0.011
 colors+0.011
 nos+0.011
Neg logit contributions
 stret-0.014
Charl-0.012
ches-0.012
ion-0.011
 treadmill-0.011
Tokenyle
Token ID2349
Feature activation+3.24e-01

Pos logit contributions
 size+0.027
 next+0.026
 three+0.026
el+0.026
ad+0.025
Neg logit contributions
 stret-0.030
ches-0.025
Charl-0.025
 treadmill-0.024
 safer-0.023
Tokene
Token ID68
Feature activation+3.18e+00

Pos logit contributions
 three+0.009
 next+0.009
ad+0.008
ho+0.008
 ever+0.008
Neg logit contributions
 stret-0.014
 treadmill-0.012
Charl-0.012
ches-0.012
 unnecessary-0.012
Token.
Token ID13
Feature activation-5.01e-01

Pos logit contributions
 size+0.092
 next+0.088
 three+0.088
el+0.088
ho+0.086
Neg logit contributions
 stret-0.129
ches-0.110
Charl-0.108
 treadmill-0.101
ion-0.099
Token Every
Token ID3887
Feature activation+8.70e-02

Pos logit contributions
 stret+0.021
ches+0.018
Charl+0.018
 treadmill+0.017
ion+0.017
Neg logit contributions
 size-0.014
 next-0.013
 three-0.013
 nos-0.013
el-0.013
Token day
Token ID1110
Feature activation-3.40e-01

Pos logit contributions
 size+0.002
 nos+0.002
 colors+0.002
flower+0.002
 next+0.002
Neg logit contributions
 stret-0.004
ches-0.004
Charl-0.004
ion-0.003
 wheelchair-0.003
Token,
Token ID11
Feature activation+2.47e-01

Pos logit contributions
 stret+0.012
ches+0.011
Charl+0.011
text+0.010
ion+0.010
Neg logit contributions
 size-0.010
 colors-0.010
 next-0.010
flower-0.010
 nos-0.010
Token she
Token ID673
Feature activation-3.91e-01

Pos logit contributions
 size+0.008
 colors+0.008
ad+0.008
 nos+0.008
flower+0.008
Neg logit contributions
 stret-0.008
ches-0.007
Charl-0.007
 treadmill-0.007
 safer-0.007
Token eyes
Token ID2951
Feature activation+4.93e-01

Pos logit contributions
ho+0.021
 next+0.020
flower+0.019
ra+0.019
 ever+0.019
Neg logit contributions
 stret-0.037
 wheelchair-0.030
 treadmill-0.030
ches-0.030
 unnecessary-0.029
Token and
Token ID290
Feature activation+8.16e-01

Pos logit contributions
 size+0.017
ra+0.016
el+0.016
umm+0.016
ad+0.016
Neg logit contributions
 stret-0.017
ches-0.015
Charl-0.013
 safer-0.013
land-0.013
Token a
Token ID257
Feature activation+1.16e+00

Pos logit contributions
el+0.023
flower+0.023
ad+0.022
ho+0.022
 size+0.022
Neg logit contributions
 stret-0.033
ches-0.028
 safer-0.028
Charl-0.027
 wheelchair-0.027
Token long
Token ID890
Feature activation+1.29e+00

Pos logit contributions
el+0.032
ad+0.032
ped+0.029
 next+0.029
 ever+0.028
Neg logit contributions
 stret-0.054
 wheelchair-0.044
Charl-0.044
 safer-0.043
 treadmill-0.041
Token man
Token ID582
Feature activation+7.50e-01

Pos logit contributions
umm+0.033
years+0.033
 next+0.032
el+0.032
 ever+0.032
Neg logit contributions
 stret-0.061
ches-0.048
 wheelchair-0.048
Charl-0.045
 treadmill-0.044
Tokene
Token ID68
Feature activation+3.15e+00

Pos logit contributions
umm+0.025
 size+0.024
 colors+0.023
 rays+0.023
 colours+0.022
Neg logit contributions
 stret-0.029
ches-0.024
 clinic-0.022
 Street-0.022
 wheelchair-0.021
Token.
Token ID13
Feature activation+4.66e-01

Pos logit contributions
 size+0.100
el+0.096
 three+0.094
 colors+0.094
ad+0.093
Neg logit contributions
 stret-0.123
ches-0.101
Charl-0.094
 treadmill-0.089
land-0.088
Token Tom
Token ID4186
Feature activation+2.77e-01

Pos logit contributions
 three+0.008
 next+0.008
 only+0.008
 nos+0.008
 size+0.008
Neg logit contributions
 stret-0.024
ches-0.021
Charl-0.021
 treadmill-0.021
ion-0.020
Token and
Token ID290
Feature activation+1.85e-01

Pos logit contributions
 size+0.007
el+0.007
umm+0.007
ad+0.007
 three+0.007
Neg logit contributions
 stret-0.011
ches-0.011
Charl-0.010
 treadmill-0.009
 wheelchair-0.009
Token Anna
Token ID11735
Feature activation+7.87e-01

Pos logit contributions
 next+0.007
 three+0.007
 size+0.007
 only+0.006
 nos+0.006
Neg logit contributions
 stret-0.006
Charl-0.005
ches-0.005
 treadmill-0.005
ion-0.005
Token smiled
Token ID13541
Feature activation+6.28e-01

Pos logit contributions
umm+0.024
 size+0.022
el+0.021
 next+0.021
ad+0.020
Neg logit contributions
 stret-0.031
ches-0.027
Charl-0.025
text-0.025
 else-0.025
Token<|endoftext|>
Token ID50256
Feature activation+3.20e-01

Pos logit contributions
 three+0.009
 next+0.009
ad+0.009
 only+0.009
el+0.009
Neg logit contributions
 stret-0.018
ches-0.015
 treadmill-0.014
Charl-0.014
 unnecessary-0.014
TokenAndy
Token ID35314
Feature activation+6.99e-01

Pos logit contributions
 next+0.011
 three+0.011
 size+0.010
el+0.010
ad+0.010
Neg logit contributions
 stret-0.012
ches-0.009
Charl-0.009
 treadmill-0.009
 safer-0.009
Token was
Token ID373
Feature activation-3.16e-03

Pos logit contributions
 size+0.016
 three+0.016
 next+0.016
 nos+0.015
ho+0.015
Neg logit contributions
 stret-0.031
ches-0.028
Charl-0.027
 treadmill-0.026
ion-0.026
Token an
Token ID281
Feature activation+7.14e-01

Pos logit contributions
 stret+0.000
Charl+0.000
 safer+0.000
 wheelchair+0.000
ches+0.000
Neg logit contributions
ho-0.000
 markings-0.000
ad-0.000
umm-0.000
el-0.000
Token inqu
Token ID7949
Feature activation+4.93e-01

Pos logit contributions
ho+0.015
 size+0.015
ra+0.014
 next+0.014
flower+0.014
Neg logit contributions
 stret-0.033
Charl-0.029
ches-0.028
 treadmill-0.028
 safer-0.028
Tokenis
Token ID271
Feature activation+3.15e+00

Pos logit contributions
 next+0.012
ad+0.012
el+0.012
 size+0.011
 three+0.011
Neg logit contributions
 stret-0.022
Charl-0.019
ches-0.019
 treadmill-0.018
 wheelchair-0.018
Tokenitive
Token ID1800
Feature activation+1.69e-01

Pos logit contributions
 next+0.073
 nos+0.073
 size+0.073
 three+0.072
 colors+0.071
Neg logit contributions
 stret-0.138
ches-0.119
Charl-0.114
 treadmill-0.110
ion-0.110
Token 3
Token ID513
Feature activation+2.01e+00

Pos logit contributions
ra+0.004
ped+0.003
ho+0.003
el+0.003
 shining+0.003
Neg logit contributions
 stret-0.008
Charl-0.006
 treadmill-0.006
ches-0.006
 safer-0.006
Token year
Token ID614
Feature activation+3.97e-01

Pos logit contributions
 size+0.052
 next+0.050
 three+0.050
ho+0.050
el+0.049
Neg logit contributions
 stret-0.086
ches-0.073
Charl-0.072
 treadmill-0.071
ion-0.069
Token old
Token ID1468
Feature activation+1.54e+00

Pos logit contributions
 size+0.010
el+0.010
 three+0.010
flower+0.010
ho+0.009
Neg logit contributions
 stret-0.017
ches-0.015
Charl-0.014
 treadmill-0.014
 safer-0.014
Token boy
Token ID2933
Feature activation+1.80e+00

Pos logit contributions
 size+0.047
el+0.047
ad+0.046
 next+0.046
ho+0.046
Neg logit contributions
 stret-0.057
Charl-0.046
ches-0.044
 treadmill-0.044
 wheelchair-0.042
Token it
Token ID340
Feature activation-8.32e-01

Pos logit contributions
 stret+0.012
ches+0.011
Charl+0.011
 treadmill+0.010
ion+0.010
Neg logit contributions
 size-0.009
ho-0.009
 three-0.009
ad-0.009
el-0.009
Token!"
Token ID2474
Feature activation+5.72e-02

Pos logit contributions
 stret+0.029
Charl+0.023
ches+0.023
 treadmill+0.022
 wheelchair+0.022
Neg logit contributions
 next-0.029
 size-0.029
 three-0.029
 only-0.028
 nos-0.028
Token
Token ID198
Feature activation-4.33e-01

Pos logit contributions
 three+0.001
 next+0.001
 only+0.001
el+0.001
 3+0.001
Neg logit contributions
 stret-0.003
 treadmill-0.002
ches-0.002
Charl-0.002
 unnecessary-0.002
Token
Token ID198
Feature activation-4.01e-01

Pos logit contributions
 stret+0.020
 treadmill+0.017
ches+0.017
 unnecessary+0.017
Charl+0.016
Neg logit contributions
 next-0.011
 three-0.011
 only-0.010
ad-0.010
 3-0.010
TokenIsa
Token ID39443
Feature activation+6.77e-01

Pos logit contributions
 stret+0.015
 treadmill+0.013
ches+0.013
 unnecessary+0.012
Charl+0.012
Neg logit contributions
 next-0.014
 three-0.013
 only-0.012
 size-0.012
 3-0.012
Tokenac
Token ID330
Feature activation+3.14e+00

Pos logit contributions
 size+0.013
 three+0.013
 next+0.013
el+0.012
ad+0.012
Neg logit contributions
 stret-0.034
ches-0.029
Charl-0.029
 treadmill-0.029
 safer-0.028
Token gul
Token ID47161
Feature activation+1.55e+00

Pos logit contributions
 next+0.087
 size+0.086
 looking+0.086
 shining+0.085
 nos+0.085
Neg logit contributions
 stret-0.121
ches-0.114
ion-0.105
Charl-0.105
text-0.104
Tokenped
Token ID9124
Feature activation+5.75e-01

Pos logit contributions
 size+0.018
 next+0.018
 three+0.018
 nos+0.017
ho+0.016
Neg logit contributions
 stret-0.088
ches-0.081
Charl-0.078
ion-0.078
text-0.076
Token and
Token ID290
Feature activation+7.77e-02

Pos logit contributions
 size+0.019
 colors+0.018
ho+0.018
flower+0.018
el+0.018
Neg logit contributions
 stret-0.021
ches-0.017
Charl-0.017
 safer-0.016
 treadmill-0.015
Token said
Token ID531
Feature activation+4.47e-01

Pos logit contributions
 size+0.002
 next+0.002
 three+0.002
el+0.002
ad+0.002
Neg logit contributions
 stret-0.004
ches-0.003
Charl-0.003
 treadmill-0.003
 safer-0.003
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
 size+0.013
el+0.012
 next+0.012
 colors+0.012
 three+0.012
Neg logit contributions
 stret-0.018
ches-0.016
Charl-0.015
 treadmill-0.015
 safer-0.015
Token sund
Token ID37437
Feature activation+6.73e-01

Pos logit contributions
 stret+0.008
ches+0.007
Charl+0.007
 treadmill+0.006
ion+0.006
Neg logit contributions
 three-0.004
 size-0.004
 next-0.004
 nos-0.004
 only-0.004
Tokenae
Token ID3609
Feature activation+4.31e-01

Pos logit contributions
 size+0.021
 three+0.020
 next+0.020
 nos+0.020
ho+0.020
Neg logit contributions
 stret-0.026
ches-0.022
Charl-0.021
 treadmill-0.020
ion-0.020
Token with
Token ID351
Feature activation+7.63e-01

Pos logit contributions
ad+0.012
 nos+0.012
ho+0.012
 size+0.012
el+0.012
Neg logit contributions
 stret-0.018
ches-0.014
ion-0.014
 wheelchair-0.013
 safer-0.013
Token lots
Token ID6041
Feature activation-1.02e-01

Pos logit contributions
flower+0.023
 size+0.023
ho+0.023
 next+0.022
 nos+0.021
Neg logit contributions
 stret-0.028
Charl-0.024
ion-0.023
 safer-0.023
ches-0.022
Token of
Token ID286
Feature activation+4.31e-01

Pos logit contributions
 stret+0.005
Charl+0.004
ches+0.004
 treadmill+0.004
 unnecessary+0.004
Neg logit contributions
 next-0.002
 size-0.002
 three-0.002
el-0.002
 colors-0.002
Token topp
Token ID23126
Feature activation+3.14e+00

Pos logit contributions
ho+0.015
flower+0.015
 size+0.014
ra+0.013
 next+0.013
Neg logit contributions
 stret-0.016
ion-0.013
 safer-0.012
 treadmill-0.012
ches-0.012
Tokenings
Token ID654
Feature activation+6.67e-02

Pos logit contributions
 next+0.128
 nos+0.125
 three+0.124
 3+0.124
 shining+0.121
Neg logit contributions
 stret-0.082
ches-0.069
text-0.061
ion-0.059
land-0.054
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
 size+0.002
 three+0.002
 nos+0.002
 next+0.002
 colors+0.002
Neg logit contributions
 stret-0.003
ches-0.002
Charl-0.002
ion-0.002
 treadmill-0.002
Token
Token ID198
Feature activation+9.53e-03

Pos logit contributions
 three+0.000
 only+0.000
 next+0.000
 looking+0.000
 3+0.000
Neg logit contributions
 stret-0.001
ches-0.001
 unnecessary-0.001
 safer-0.001
Charl-0.001
Token
Token ID198
Feature activation-2.56e-02

Pos logit contributions
 next+0.000
 three+0.000
 thought+0.000
 only+0.000
 looking+0.000
Neg logit contributions
 stret-0.001
 unnecessary-0.001
ience-0.000
 treadmill-0.000
 Street-0.000
TokenThey
Token ID2990
Feature activation-3.87e-02

Pos logit contributions
 stret+0.001
 unnecessary+0.001
ches+0.001
 safer+0.001
 treadmill+0.001
Neg logit contributions
 next-0.001
 three-0.001
 nos-0.001
Mom-0.001
 only-0.001
Token horse
Token ID8223
Feature activation+8.06e-01

Pos logit contributions
 stret+0.018
Charl+0.015
 safer+0.015
 wheelchair+0.015
ches+0.015
Neg logit contributions
ped-0.013
ad-0.013
 next-0.013
ho-0.013
 ever-0.013
Token with
Token ID351
Feature activation+9.36e-01

Pos logit contributions
 size+0.022
 colors+0.022
 nos+0.022
 three+0.021
 3+0.020
Neg logit contributions
 stret-0.034
ches-0.029
 safer-0.026
 clinic-0.026
ion-0.026
Token a
Token ID257
Feature activation+5.60e-01

Pos logit contributions
ho+0.029
flower+0.028
 next+0.027
 size+0.026
ok+0.026
Neg logit contributions
 stret-0.035
Charl-0.029
ches-0.029
 safer-0.028
 wheelchair-0.028
Token long
Token ID890
Feature activation+1.08e+00

Pos logit contributions
ad+0.014
 next+0.014
 size+0.014
el+0.013
ho+0.013
Neg logit contributions
 stret-0.026
ches-0.022
Charl-0.021
 wheelchair-0.021
 treadmill-0.021
Token man
Token ID582
Feature activation+8.41e-01

Pos logit contributions
ho+0.027
 next+0.026
 ever+0.026
years+0.025
umm+0.025
Neg logit contributions
 stret-0.050
ches-0.043
 wheelchair-0.040
Charl-0.040
 treadmill-0.039
Tokene
Token ID68
Feature activation+3.13e+00

Pos logit contributions
umm+0.025
 size+0.024
 colors+0.022
 rays+0.022
 nos+0.022
Neg logit contributions
 stret-0.035
ches-0.029
 Street-0.027
ion-0.027
 clinic-0.026
Token and
Token ID290
Feature activation+6.34e-01

Pos logit contributions
 size+0.102
 three+0.096
 next+0.095
 3+0.095
umm+0.094
Neg logit contributions
 stret-0.121
ches-0.098
Charl-0.092
 treadmill-0.085
 wheelchair-0.085
Token a
Token ID257
Feature activation+4.43e-01

Pos logit contributions
ho+0.015
 size+0.015
el+0.015
 next+0.015
ad+0.015
Neg logit contributions
 stret-0.028
ches-0.024
Charl-0.024
 treadmill-0.023
 wheelchair-0.023
Token tail
Token ID7894
Feature activation+1.35e+00

Pos logit contributions
ad+0.012
el+0.011
 next+0.011
 size+0.011
ho+0.011
Neg logit contributions
 stret-0.020
Charl-0.016
 wheelchair-0.016
ches-0.016
 treadmill-0.016
Token.
Token ID13
Feature activation-1.09e-01

Pos logit contributions
umm+0.046
years+0.046
 3+0.045
ra+0.045
 size+0.045
Neg logit contributions
 stret-0.045
ches-0.037
 wheelchair-0.032
ion-0.032
 treadmill-0.031
Token It
Token ID632
Feature activation+1.43e-01

Pos logit contributions
 stret+0.005
ches+0.004
Charl+0.004
 treadmill+0.004
 unnecessary+0.004
Neg logit contributions
 three-0.003
 next-0.003
 only-0.003
 size-0.002
el-0.002
Token a
Token ID257
Feature activation+1.61e-01

Pos logit contributions
 size+0.002
 next+0.002
 three+0.002
ho+0.002
el+0.002
Neg logit contributions
 stret-0.004
Charl-0.004
ches-0.004
 treadmill-0.003
 safer-0.003
Token z
Token ID1976
Feature activation+1.03e+00

Pos logit contributions
 size+0.004
 next+0.004
 three+0.003
el+0.003
ad+0.003
Neg logit contributions
 stret-0.008
ches-0.007
Charl-0.006
 treadmill-0.006
 safer-0.006
Tokenebra
Token ID37052
Feature activation+1.76e+00

Pos logit contributions
 size+0.038
 colors+0.037
 three+0.037
 next+0.037
 nos+0.036
Neg logit contributions
 stret-0.033
ches-0.027
Charl-0.026
 treadmill-0.025
 wheelchair-0.024
Token named
Token ID3706
Feature activation+9.82e-01

Pos logit contributions
 size+0.055
 three+0.053
 next+0.052
el+0.052
 colors+0.052
Neg logit contributions
 stret-0.067
ches-0.057
Charl-0.055
 treadmill-0.053
ion-0.051
Token Stri
Token ID26137
Feature activation+1.11e+00

Pos logit contributions
 size+0.027
 next+0.025
 three+0.025
el+0.025
 nos+0.025
Neg logit contributions
 stret-0.041
ches-0.036
Charl-0.035
 treadmill-0.035
ion-0.033
Tokenpe
Token ID431
Feature activation+3.13e+00

Pos logit contributions
 size+0.024
 three+0.024
 next+0.023
 nos+0.022
 colors+0.022
Neg logit contributions
 stret-0.053
ches-0.047
Charl-0.046
 treadmill-0.044
ion-0.043
Token.
Token ID13
Feature activation-6.17e-01

Pos logit contributions
 nos+0.090
 size+0.088
 three+0.087
umm+0.085
 3+0.084
Neg logit contributions
 stret-0.118
ches-0.104
ion-0.094
Charl-0.093
land-0.089
Token Stri
Token ID26137
Feature activation+5.95e-01

Pos logit contributions
 stret+0.020
ches+0.017
Charl+0.016
 treadmill+0.016
 wheelchair+0.015
Neg logit contributions
 size-0.022
 next-0.022
 nos-0.021
 three-0.021
ho-0.021
Tokenpe
Token ID431
Feature activation+2.48e+00

Pos logit contributions
 size+0.010
 three+0.010
 next+0.009
el+0.009
 nos+0.009
Neg logit contributions
 stret-0.031
ches-0.028
Charl-0.027
 treadmill-0.026
ion-0.026
Token lived
Token ID5615
Feature activation+1.55e-01

Pos logit contributions
 size+0.061
 three+0.060
 nos+0.060
 next+0.058
umm+0.057
Neg logit contributions
 stret-0.102
ches-0.094
Charl-0.085
ion-0.084
text-0.083
Token in
Token ID287
Feature activation-1.54e-01

Pos logit contributions
umm+0.005
 markings+0.005
el+0.004
 size+0.004
ho+0.004
Neg logit contributions
 stret-0.005
Charl-0.005
 safer-0.005
ches-0.005
 wheelchair-0.005
Token time
Token ID640
Feature activation-5.02e-01

Pos logit contributions
 stret+0.060
ches+0.052
Charl+0.051
 treadmill+0.051
ion+0.050
Neg logit contributions
 next-0.026
 three-0.026
 size-0.026
 only-0.025
 nos-0.025
Token there
Token ID612
Feature activation+1.65e-01

Pos logit contributions
 stret+0.020
Charl+0.017
ches+0.017
 treadmill+0.016
ion+0.016
Neg logit contributions
 size-0.015
 three-0.014
 next-0.014
el-0.014
 colors-0.014
Token was
Token ID373
Feature activation-3.22e-02

Pos logit contributions
 size+0.004
 next+0.004
 three+0.004
 nos+0.004
ho+0.004
Neg logit contributions
 stret-0.007
ches-0.006
Charl-0.006
 treadmill-0.006
ion-0.006
Token a
Token ID257
Feature activation-1.06e-01

Pos logit contributions
 stret+0.001
ches+0.001
Charl+0.001
 treadmill+0.001
 safer+0.001
Neg logit contributions
 size-0.001
 next-0.001
 three-0.001
el-0.001
ad-0.001
Token wood
Token ID4898
Feature activation+1.55e+00

Pos logit contributions
 stret+0.005
ches+0.004
Charl+0.004
 treadmill+0.004
ion+0.004
Neg logit contributions
 size-0.002
 three-0.002
 next-0.002
 nos-0.002
el-0.002
Tokencut
Token ID8968
Feature activation+3.13e+00

Pos logit contributions
 size+0.047
el+0.046
 nos+0.046
ra+0.046
ad+0.046
Neg logit contributions
 stret-0.056
ches-0.047
ion-0.046
Charl-0.045
 treadmill-0.044
Tokenter
Token ID353
Feature activation+1.21e+00

Pos logit contributions
 size+0.061
 nos+0.060
 colors+0.058
 three+0.058
 sill+0.058
Neg logit contributions
 stret-0.142
ches-0.129
ion-0.120
Charl-0.120
 treadmill-0.115
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
 size+0.042
 three+0.041
 next+0.041
 nos+0.040
ad+0.040
Neg logit contributions
 stret-0.040
ches-0.034
Charl-0.032
ion-0.030
 treadmill-0.030
Token He
Token ID679
Feature activation+2.98e-01

Pos logit contributions
 stret+0.011
 treadmill+0.009
Charl+0.009
ion+0.009
 safer+0.008
Neg logit contributions
 size-0.013
 next-0.013
ho-0.013
 nos-0.012
ra-0.012
Token had
Token ID550
Feature activation+1.69e-01

Pos logit contributions
 size+0.008
 sill+0.008
flower+0.008
umm+0.008
el+0.008
Neg logit contributions
 stret-0.011
ches-0.011
Charl-0.010
 safer-0.009
text-0.009
Token an
Token ID281
Feature activation+5.08e-01

Pos logit contributions
ho+0.006
umm+0.006
flower+0.006
el+0.006
ad+0.006
Neg logit contributions
 stret-0.005
 safer-0.004
ches-0.004
Charl-0.004
 unnecessary-0.004
Token lion
Token ID18744
Feature activation+1.19e+00

Pos logit contributions
 stret+0.014
 treadmill+0.011
 safer+0.011
ches+0.011
 wheelchair+0.011
Neg logit contributions
ped-0.009
 next-0.009
el-0.009
ad-0.008
ra-0.008
Token with
Token ID351
Feature activation+8.80e-01

Pos logit contributions
 size+0.033
 three+0.033
 colors+0.033
el+0.032
 nos+0.032
Neg logit contributions
 stret-0.048
ches-0.042
Charl-0.038
ion-0.037
 safer-0.037
Token a
Token ID257
Feature activation+5.63e-01

Pos logit contributions
flower+0.030
ok+0.030
umm+0.029
 next+0.028
ho+0.027
Neg logit contributions
 stret-0.027
ches-0.023
Charl-0.023
 unnecessary-0.022
 safer-0.022
Token fluffy
Token ID39145
Feature activation+5.82e-01

Pos logit contributions
el+0.015
 next+0.015
ad+0.015
 size+0.014
ped+0.014
Neg logit contributions
 stret-0.025
ches-0.021
 wheelchair-0.020
Charl-0.020
 treadmill-0.020
Token man
Token ID582
Feature activation+6.16e-01

Pos logit contributions
 next+0.018
ok+0.018
years+0.018
umm+0.017
ho+0.017
Neg logit contributions
 stret-0.023
ches-0.018
 wheelchair-0.018
Charl-0.017
 treadmill-0.016
Tokene
Token ID68
Feature activation+3.12e+00

Pos logit contributions
 size+0.018
umm+0.018
 colors+0.017
 rays+0.017
 nos+0.017
Neg logit contributions
 stret-0.025
ches-0.022
ion-0.020
Charl-0.019
 clinic-0.019
Token.
Token ID13
Feature activation+1.96e-01

Pos logit contributions
 size+0.100
 three+0.094
 3+0.093
el+0.092
 next+0.092
Neg logit contributions
 stret-0.124
ches-0.101
Charl-0.095
 treadmill-0.089
 wheelchair-0.088
Token She
Token ID1375
Feature activation-8.51e-01

Pos logit contributions
 size+0.007
 shining+0.006
 next+0.006
flower+0.006
ra+0.006
Neg logit contributions
 stret-0.007
ion-0.006
ches-0.006
 treadmill-0.006
 safer-0.005
Token saw
Token ID2497
Feature activation-1.44e-01

Pos logit contributions
 stret+0.035
ches+0.033
Charl+0.031
 treadmill+0.028
 wheelchair+0.028
Neg logit contributions
umm-0.023
el-0.021
 size-0.021
 sill-0.021
 rays-0.020
Token a
Token ID257
Feature activation-3.88e-01

Pos logit contributions
 stret+0.005
ion+0.004
ches+0.004
 safer+0.004
 treadmill+0.004
Neg logit contributions
 size-0.005
ra-0.004
flower-0.004
el-0.004
 next-0.004
Token tall
Token ID7331
Feature activation+8.98e-02

Pos logit contributions
 stret+0.017
Charl+0.014
ches+0.014
 treadmill+0.014
 safer+0.014
Neg logit contributions
el-0.010
ad-0.010
 next-0.010
ped-0.010
 size-0.010
Token lion
Token ID18744
Feature activation+1.19e+00

Pos logit contributions
 stret+0.013
 wheelchair+0.010
 safer+0.010
 treadmill+0.010
Charl+0.010
Neg logit contributions
ped-0.010
 next-0.009
ad-0.009
ho-0.009
 ever-0.009
Token with
Token ID351
Feature activation+1.08e+00

Pos logit contributions
 size+0.032
 three+0.032
 colors+0.031
 nos+0.031
 rays+0.031
Neg logit contributions
 stret-0.049
ches-0.042
ion-0.038
 wheelchair-0.038
Charl-0.038
Token a
Token ID257
Feature activation+5.94e-01

Pos logit contributions
flower+0.038
ok+0.036
umm+0.035
 next+0.035
ho+0.034
Neg logit contributions
 stret-0.032
Charl-0.027
text-0.026
ches-0.026
ion-0.026
Token golden
Token ID10861
Feature activation+1.31e+00

Pos logit contributions
ad+0.016
 next+0.015
el+0.015
 size+0.015
ped+0.014
Neg logit contributions
 stret-0.026
ches-0.022
 wheelchair-0.022
Charl-0.021
 treadmill-0.021
Token man
Token ID582
Feature activation+5.13e-01

Pos logit contributions
 next+0.037
years+0.036
ho+0.035
 ever+0.034
ok+0.033
Neg logit contributions
 stret-0.057
 wheelchair-0.044
ches-0.042
 clinic-0.040
 mat-0.040
Tokene
Token ID68
Feature activation+3.11e+00

Pos logit contributions
umm+0.015
 size+0.015
 rays+0.014
 colors+0.014
 nos+0.014
Neg logit contributions
 stret-0.021
ches-0.018
 Street-0.016
ion-0.016
 clinic-0.016
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
 size+0.101
 3+0.095
 three+0.095
umm+0.094
 only+0.093
Neg logit contributions
 stret-0.120
ches-0.096
Charl-0.088
land-0.083
ion-0.082
Token He
Token ID679
Feature activation-3.86e-01

Pos logit contributions
 size+0.012
 next+0.011
ho+0.011
flower+0.011
 colors+0.011
Neg logit contributions
 stret-0.020
Charl-0.018
ches-0.017
ion-0.017
 safer-0.017
Token looked
Token ID3114
Feature activation-1.09e+00

Pos logit contributions
ches+0.013
 stret+0.013
Charl+0.011
 else+0.011
 safer+0.010
Neg logit contributions
umm-0.014
 size-0.012
 rays-0.012
ok-0.012
ums-0.012
Token at
Token ID379
Feature activation+1.11e-01

Pos logit contributions
 stret+0.046
ches+0.040
 safer+0.039
 unnecessary+0.037
Charl+0.037
Neg logit contributions
 size-0.031
flower-0.029
umm-0.029
ra-0.029
ad-0.028
Token her
Token ID607
Feature activation-2.99e-01

Pos logit contributions
 size+0.003
 nos+0.003
 next+0.003
ho+0.003
 three+0.003
Neg logit contributions
 stret-0.004
ches-0.004
Charl-0.004
ion-0.003
 safer-0.003
Token to
Token ID284
Feature activation+9.93e-01

Pos logit contributions
 size+0.007
 next+0.007
 three+0.007
el+0.007
ad+0.007
Neg logit contributions
 stret-0.010
ches-0.009
Charl-0.009
 treadmill-0.008
 safer-0.008
Token wear
Token ID5806
Feature activation+5.91e-01

Pos logit contributions
 three+0.026
 size+0.026
 next+0.026
 only+0.025
ad+0.025
Neg logit contributions
 stret-0.042
Charl-0.037
ches-0.037
 treadmill-0.035
ion-0.035
Token shorts
Token ID22078
Feature activation+2.10e-02

Pos logit contributions
 colors+0.014
 shining+0.014
 next+0.014
 three+0.013
 looking+0.013
Neg logit contributions
 stret-0.027
Charl-0.025
 treadmill-0.024
ches-0.024
ion-0.023
Token and
Token ID290
Feature activation+9.73e-01

Pos logit contributions
 next+0.000
 three+0.000
 only+0.000
 size+0.000
 looking+0.000
Neg logit contributions
 stret-0.001
Charl-0.001
 unnecessary-0.001
ches-0.001
 treadmill-0.001
Token T
Token ID309
Feature activation+1.03e+00

Pos logit contributions
 next+0.028
 three+0.028
 size+0.028
 colors+0.027
 nos+0.027
Neg logit contributions
 stret-0.038
Charl-0.033
ches-0.033
 treadmill-0.031
ion-0.031
Token-
Token ID12
Feature activation+3.10e+00

Pos logit contributions
 size+0.029
 three+0.029
 next+0.028
ho+0.028
flower+0.027
Neg logit contributions
 stret-0.042
ches-0.036
Charl-0.034
 treadmill-0.034
ion-0.033
Tokenshirts
Token ID23231
Feature activation+1.32e+00

Pos logit contributions
 size+0.129
 next+0.127
 three+0.126
el+0.124
ho+0.124
Neg logit contributions
 stret-0.086
ches-0.067
Charl-0.066
 treadmill-0.061
ion-0.060
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
 next+0.031
 size+0.030
 only+0.030
 three+0.030
 looking+0.028
Neg logit contributions
 stret-0.061
Charl-0.055
ches-0.054
 treadmill-0.052
 unnecessary-0.052
Token They
Token ID1119
Feature activation-2.77e-02

Pos logit contributions
 three+0.007
 next+0.007
 only+0.007
 size+0.007
el+0.007
Neg logit contributions
 stret-0.013
ches-0.011
Charl-0.011
 treadmill-0.011
 unnecessary-0.010
Token did
Token ID750
Feature activation+1.48e+00

Pos logit contributions
 stret+0.001
ches+0.001
Charl+0.001
 treadmill+0.001
 safer+0.001
Neg logit contributions
 size-0.001
 next-0.001
el-0.001
ho-0.001
ad-0.001
Token not
Token ID407
Feature activation+7.32e-01

Pos logit contributions
 only+0.013
 next+0.012
 looking+0.011
 size+0.011
 colors+0.009
Neg logit contributions
 stret-0.094
Charl-0.090
ches-0.084
text-0.084
ion-0.083
Token horn
Token ID12718
Feature activation+1.87e+00

Pos logit contributions
ho+0.031
 next+0.031
years+0.030
 nos+0.029
 ever+0.029
Neg logit contributions
 stret-0.052
ches-0.045
Charl-0.042
 wheelchair-0.041
 treadmill-0.039
Token and
Token ID290
Feature activation+7.79e-01

Pos logit contributions
 nos+0.061
years+0.061
ad+0.061
 size+0.059
 next+0.058
Neg logit contributions
 stret-0.061
ches-0.053
ion-0.045
land-0.044
 treadmill-0.041
Token a
Token ID257
Feature activation+8.09e-01

Pos logit contributions
ho+0.020
el+0.020
ad+0.020
 size+0.020
 next+0.019
Neg logit contributions
 stret-0.032
ches-0.028
Charl-0.027
 safer-0.027
 treadmill-0.026
Token rainbow
Token ID27223
Feature activation+3.42e-01

Pos logit contributions
el+0.019
ad+0.019
 next+0.019
ho+0.018
 size+0.018
Neg logit contributions
 stret-0.038
Charl-0.031
ches-0.031
 safer-0.030
 wheelchair-0.030
Token man
Token ID582
Feature activation+1.02e+00

Pos logit contributions
 nos+0.010
years+0.009
 next+0.009
 curiosity+0.009
 size+0.009
Neg logit contributions
 stret-0.014
ches-0.012
ion-0.011
land-0.010
text-0.010
Tokene
Token ID68
Feature activation+3.09e+00

Pos logit contributions
 size+0.026
umm+0.026
 nos+0.026
 next+0.025
ho+0.025
Neg logit contributions
 stret-0.041
ches-0.037
ion-0.033
 Street-0.032
Charl-0.032
Token.
Token ID13
Feature activation+7.46e-02

Pos logit contributions
 size+0.103
 next+0.096
ho+0.096
el+0.095
 nos+0.094
Neg logit contributions
 stret-0.110
ches-0.095
Charl-0.087
land-0.079
 treadmill-0.079
Token She
Token ID1375
Feature activation-3.52e-01

Pos logit contributions
 size+0.002
 next+0.002
 three+0.002
el+0.002
ad+0.002
Neg logit contributions
 stret-0.003
ches-0.003
Charl-0.003
 treadmill-0.003
ion-0.003
Token was
Token ID373
Feature activation-1.83e-01

Pos logit contributions
 stret+0.015
ches+0.014
Charl+0.013
 treadmill+0.013
 safer+0.013
Neg logit contributions
 next-0.008
 size-0.008
el-0.008
ho-0.008
ad-0.008
Token so
Token ID523
Feature activation-3.86e-01

Pos logit contributions
 stret+0.006
 safer+0.005
Charl+0.005
ches+0.005
 treadmill+0.004
Neg logit contributions
ho-0.007
el-0.006
 size-0.006
ad-0.006
 next-0.006
Token happy
Token ID3772
Feature activation+6.21e-01

Pos logit contributions
 stret+0.013
 safer+0.012
Charl+0.012
 else+0.012
ches+0.011
Neg logit contributions
 size-0.012
el-0.011
ho-0.011
ad-0.011
flower-0.011
Token walking
Token ID6155
Feature activation-8.20e-01

Pos logit contributions
 colors+0.010
 size+0.010
umm+0.010
 rays+0.010
sed+0.009
Neg logit contributions
 stret-0.016
ches-0.013
 clinic-0.013
 wheelchair-0.013
 safer-0.012
Token on
Token ID319
Feature activation-6.89e-01

Pos logit contributions
 stret+0.028
ion+0.023
ches+0.023
 wheelchair+0.023
 treadmill+0.022
Neg logit contributions
 size-0.027
umm-0.027
sed-0.027
 markings-0.026
ra-0.025
Token a
Token ID257
Feature activation-2.51e-01

Pos logit contributions
 stret+0.029
Charl+0.024
ches+0.024
 treadmill+0.023
ion+0.023
Neg logit contributions
 size-0.019
 next-0.018
 three-0.018
ho-0.018
el-0.018
Token tight
Token ID5381
Feature activation-1.22e-01

Pos logit contributions
text+0.008
 unnecessary+0.008
ion+0.008
 stret+0.008
Charl+0.008
Neg logit contributions
 looking-0.008
 three-0.008
 next-0.008
 shining-0.008
 brown-0.008
Tokenro
Token ID305
Feature activation+2.07e+00

Pos logit contributions
 stret+0.004
 unnecessary+0.004
Charl+0.004
ches+0.004
 safer+0.004
Neg logit contributions
 size-0.004
 looking-0.004
 three-0.004
 next-0.004
ho-0.004
Tokenpe
Token ID431
Feature activation+3.09e+00

Pos logit contributions
 three+0.040
 size+0.040
 next+0.039
el+0.038
 nos+0.037
Neg logit contributions
 stret-0.103
ches-0.090
Charl-0.089
 treadmill-0.086
 wheelchair-0.084
Token.
Token ID13
Feature activation+5.32e-02

Pos logit contributions
 nos+0.087
umm+0.085
 three+0.083
 size+0.082
 3+0.081
Neg logit contributions
 stret-0.128
ches-0.108
land-0.098
ion-0.095
 Street-0.094
Token He
Token ID679
Feature activation-5.86e-01

Pos logit contributions
 three+0.001
 next+0.001
 looking+0.001
 only+0.001
 nos+0.001
Neg logit contributions
 stret-0.003
ches-0.002
Charl-0.002
 unnecessary-0.002
 safer-0.002
Token had
Token ID550
Feature activation-5.80e-01

Pos logit contributions
 stret+0.024
ches+0.022
Charl+0.021
 treadmill+0.020
 safer+0.019
Neg logit contributions
 size-0.016
umm-0.015
 sill-0.015
el-0.014
 rays-0.014
Token to
Token ID284
Feature activation-3.75e-01

Pos logit contributions
 stret+0.022
Charl+0.019
 wheelchair+0.019
 safer+0.019
ches+0.018
Neg logit contributions
ho-0.018
umm-0.018
flower-0.017
el-0.017
 size-0.016
Token balance
Token ID5236
Feature activation-4.76e-01

Pos logit contributions
 stret+0.016
ches+0.013
Charl+0.013
 treadmill+0.012
 safer+0.012
Neg logit contributions
ho-0.011
 size-0.010
 three-0.010
el-0.010
 next-0.010
Token it
Token ID340
Feature activation+1.81e-01

Pos logit contributions
 stret+0.021
ches+0.018
Charl+0.018
ion+0.018
 safer+0.018
Neg logit contributions
 size-0.019
ho-0.018
ad-0.018
 three-0.018
 next-0.018
Token's
Token ID338
Feature activation-7.14e-01

Pos logit contributions
 size+0.006
ho+0.006
 three+0.006
 next+0.006
ad+0.006
Neg logit contributions
 stret-0.006
ches-0.005
Charl-0.005
 treadmill-0.004
 safer-0.004
Token important
Token ID1593
Feature activation-4.35e-01

Pos logit contributions
 stret+0.029
ches+0.026
 safer+0.025
Charl+0.025
ion+0.024
Neg logit contributions
ho-0.019
ad-0.018
el-0.018
flower-0.018
 next-0.017
Token to
Token ID284
Feature activation+1.02e-02

Pos logit contributions
 stret+0.019
ches+0.016
Charl+0.016
 treadmill+0.016
ion+0.015
Neg logit contributions
 size-0.011
 three-0.011
 next-0.011
el-0.010
ad-0.010
Token keep
Token ID1394
Feature activation-6.53e-01

Pos logit contributions
ho+0.000
flower+0.000
 nos+0.000
 shining+0.000
 size+0.000
Neg logit contributions
 stret-0.000
ches-0.000
Charl-0.000
 safer-0.000
ion-0.000
Token hed
Token ID16418
Feature activation-3.27e+00

Pos logit contributions
 stret+0.023
ches+0.020
Charl+0.020
 unnecessary+0.019
 safer+0.019
Neg logit contributions
ho-0.021
 size-0.021
ad-0.020
 three-0.020
flower-0.020
Tokenges
Token ID3212
Feature activation+5.82e-02

Pos logit contributions
 stret+0.131
ches+0.116
Charl+0.113
ion+0.109
 treadmill+0.106
Neg logit contributions
 size-0.093
 colors-0.090
 nos-0.089
 next-0.088
 three-0.088
Token neat
Token ID15049
Feature activation-8.03e-01

Pos logit contributions
 size+0.002
 next+0.002
 three+0.002
el+0.002
ad+0.002
Neg logit contributions
 stret-0.002
ches-0.002
Charl-0.002
 treadmill-0.002
 safer-0.002
Token and
Token ID290
Feature activation-4.27e-01

Pos logit contributions
 stret+0.034
Charl+0.029
ches+0.029
 treadmill+0.028
 wheelchair+0.027
Neg logit contributions
 size-0.022
 next-0.022
 three-0.021
 colors-0.021
el-0.020
Token tidy
Token ID43044
Feature activation-8.36e-01

Pos logit contributions
 stret+0.017
ches+0.014
 safer+0.014
 treadmill+0.014
Charl+0.014
Neg logit contributions
ad-0.012
el-0.012
ho-0.012
 size-0.012
 three-0.012
Token"
Token ID1
Feature activation-6.59e-01

Pos logit contributions
 stret+0.033
Charl+0.029
ches+0.028
 treadmill+0.027
 wheelchair+0.026
Neg logit contributions
 next-0.026
 size-0.025
 colors-0.025
 three-0.025
el-0.024
Token parents
Token ID3397
Feature activation-1.04e+00

Pos logit contributions
 stret+0.043
ches+0.037
Charl+0.037
 treadmill+0.036
 safer+0.036
Neg logit contributions
ad-0.013
 size-0.013
el-0.013
ho-0.013
 three-0.013
Token could
Token ID714
Feature activation-4.72e-01

Pos logit contributions
 stret+0.041
ches+0.035
Charl+0.035
 treadmill+0.034
 safer+0.033
Neg logit contributions
 size-0.030
 three-0.029
 next-0.029
el-0.029
ad-0.028
Token lead
Token ID1085
Feature activation-1.36e+00

Pos logit contributions
 stret+0.020
ches+0.017
Charl+0.017
 treadmill+0.016
 safer+0.016
Neg logit contributions
 three-0.012
 size-0.012
 colors-0.012
el-0.012
ho-0.012
Token to
Token ID284
Feature activation-7.31e-01

Pos logit contributions
 stret+0.061
ches+0.053
Charl+0.052
 treadmill+0.050
 safer+0.049
Neg logit contributions
 size-0.033
 three-0.032
 next-0.032
el-0.031
ho-0.031
Token dangerous
Token ID4923
Feature activation-7.06e-01

Pos logit contributions
 stret+0.029
ches+0.025
Charl+0.025
 treadmill+0.024
ion+0.023
Neg logit contributions
 size-0.021
 next-0.021
 three-0.021
 nos-0.020
el-0.020
Token situations
Token ID7445
Feature activation-3.26e+00

Pos logit contributions
 stret+0.029
ches+0.025
Charl+0.025
 treadmill+0.024
 unnecessary+0.024
Neg logit contributions
 size-0.018
 three-0.018
 next-0.018
ho-0.018
el-0.018
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
 stret+0.170
Charl+0.149
ches+0.147
text+0.146
 treadmill+0.143
Neg logit contributions
 next-0.072
 ever-0.066
 three-0.063
 size-0.063
 only-0.063
Token From
Token ID3574
Feature activation-8.72e-02

Pos logit contributions
 next+0.003
 only+0.003
 three+0.003
 thought+0.003
el+0.003
Neg logit contributions
 stret-0.008
ches-0.006
Charl-0.006
 unnecessary-0.006
 treadmill-0.006
Token that
Token ID326
Feature activation-1.86e-01

Pos logit contributions
 stret+0.004
ches+0.003
Charl+0.003
 treadmill+0.003
text+0.003
Neg logit contributions
 next-0.003
 three-0.003
 only-0.003
 size-0.003
el-0.003
Token day
Token ID1110
Feature activation+1.53e-01

Pos logit contributions
 stret+0.008
Charl+0.008
ches+0.008
 safer+0.007
 treadmill+0.007
Neg logit contributions
ad-0.004
ra-0.004
 size-0.004
 markings-0.004
el-0.004
Token on
Token ID319
Feature activation+2.59e-01

Pos logit contributions
 next+0.002
 three+0.002
 size+0.002
el+0.002
 only+0.002
Neg logit contributions
 stret-0.008
ches-0.007
Charl-0.007
 treadmill-0.007
ion-0.007
Token trick
Token ID6908
Feature activation-6.86e-01

Pos logit contributions
 next+0.004
 looking+0.004
 only+0.004
 three+0.004
 size+0.004
Neg logit contributions
 stret-0.008
Charl-0.007
text-0.007
ches-0.007
 treadmill-0.007
Token-
Token ID12
Feature activation+3.11e-01

Pos logit contributions
 stret+0.030
ches+0.026
Charl+0.026
 treadmill+0.025
ion+0.024
Neg logit contributions
 size-0.017
 three-0.017
 next-0.017
el-0.016
ho-0.016
Tokenor
Token ID273
Feature activation+6.72e-02

Pos logit contributions
 next+0.012
 three+0.012
ho+0.012
ad+0.011
 only+0.011
Neg logit contributions
 stret-0.010
Charl-0.008
ches-0.008
 treadmill-0.007
 safer-0.007
Token-
Token ID12
Feature activation+2.57e-01

Pos logit contributions
 next+0.002
 three+0.002
ad+0.002
el+0.002
 size+0.002
Neg logit contributions
 stret-0.003
Charl-0.003
 treadmill-0.002
ches-0.002
 safer-0.002
Tokent
Token ID83
Feature activation-1.08e-01

Pos logit contributions
 next+0.010
 three+0.010
ad+0.010
ho+0.010
 only+0.009
Neg logit contributions
 stret-0.008
Charl-0.006
ches-0.006
 treadmill-0.006
 safer-0.006
Tokenreating
Token ID34567
Feature activation-3.24e+00

Pos logit contributions
 stret+0.005
Charl+0.005
ches+0.004
 treadmill+0.004
text+0.004
Neg logit contributions
 next-0.003
 size-0.003
 colors-0.002
ad-0.002
 three-0.002
Token anymore
Token ID7471
Feature activation-1.85e-01

Pos logit contributions
 stret+0.157
Charl+0.140
ches+0.138
 treadmill+0.135
text+0.135
Neg logit contributions
 next-0.083
 ever-0.073
 only-0.071
 looking-0.065
 size-0.064
Token.
Token ID13
Feature activation+2.90e-02

Pos logit contributions
 stret+0.008
ches+0.007
Charl+0.007
 treadmill+0.006
 safer+0.006
Neg logit contributions
 size-0.005
 next-0.005
 three-0.005
el-0.005
 nos-0.005
Token She
Token ID1375
Feature activation-6.87e-01

Pos logit contributions
 next+0.001
 three+0.001
 only+0.001
 nos+0.001
 looking+0.001
Neg logit contributions
 stret-0.001
ches-0.001
 unnecessary-0.001
 treadmill-0.001
Charl-0.001
Token sadly
Token ID21098
Feature activation+1.91e-01

Pos logit contributions
 stret+0.032
Charl+0.027
ches+0.027
 treadmill+0.027
ion+0.026
Neg logit contributions
 size-0.016
 next-0.016
 three-0.016
 only-0.015
 nos-0.015
Token put
Token ID1234
Feature activation-2.15e-01

Pos logit contributions
 size+0.004
 next+0.004
 three+0.004
el+0.004
ad+0.004
Neg logit contributions
 stret-0.009
ches-0.008
Charl-0.008
 treadmill-0.008
ion-0.007
Token him
Token ID683
Feature activation+2.28e-01

Pos logit contributions
 stret+0.012
ches+0.011
Charl+0.011
ion+0.010
 treadmill+0.010
Neg logit contributions
 size-0.008
 next-0.008
flower-0.008
ho-0.008
 nos-0.007
Token how
Token ID703
Feature activation+2.57e-01

Pos logit contributions
 size+0.008
 next+0.008
 three+0.008
el+0.008
ad+0.008
Neg logit contributions
 stret-0.007
ches-0.006
Charl-0.006
 treadmill-0.006
 safer-0.006
Token to
Token ID284
Feature activation-4.35e-01

Pos logit contributions
 size+0.006
el+0.006
 three+0.006
ad+0.006
 next+0.006
Neg logit contributions
 stret-0.011
ches-0.010
Charl-0.010
 safer-0.009
 treadmill-0.009
Token think
Token ID892
Feature activation-8.46e-01

Pos logit contributions
 stret+0.017
ches+0.015
Charl+0.015
 treadmill+0.014
 safer+0.014
Neg logit contributions
 nos-0.012
ho-0.012
 next-0.011
el-0.011
 size-0.011
Token of
Token ID286
Feature activation-2.83e-01

Pos logit contributions
 stret+0.039
Charl+0.033
ches+0.033
 treadmill+0.032
 wheelchair+0.031
Neg logit contributions
 next-0.020
 three-0.020
 size-0.020
 only-0.019
 colors-0.019
Token situations
Token ID7445
Feature activation-3.13e+00

Pos logit contributions
 stret+0.012
Charl+0.010
ches+0.010
 safer+0.010
 treadmill+0.009
Neg logit contributions
ho-0.008
 three-0.007
 size-0.007
 next-0.007
flower-0.007
Token from
Token ID422
Feature activation+3.98e-01

Pos logit contributions
 stret+0.142
ches+0.120
Charl+0.120
 treadmill+0.116
 wheelchair+0.113
Neg logit contributions
 next-0.078
 three-0.077
 size-0.077
 only-0.073
 looking-0.072
Token different
Token ID1180
Feature activation-2.47e-02

Pos logit contributions
 size+0.011
ho+0.011
 three+0.011
flower+0.011
el+0.011
Neg logit contributions
 stret-0.015
ches-0.013
Charl-0.013
 safer-0.013
 treadmill-0.012
Token perspectives
Token ID22582
Feature activation-2.61e-01

Pos logit contributions
 stret+0.001
ches+0.001
Charl+0.001
 treadmill+0.001
 safer+0.001
Neg logit contributions
 next-0.001
 size-0.001
 three-0.001
ho-0.001
el-0.001
Token,
Token ID11
Feature activation-1.83e-01

Pos logit contributions
 stret+0.011
ches+0.010
Charl+0.010
 treadmill+0.009
 safer+0.009
Neg logit contributions
 size-0.007
 next-0.007
 three-0.007
el-0.006
ad-0.006
Token and
Token ID290
Feature activation+5.33e-01

Pos logit contributions
 stret+0.007
 safer+0.006
ches+0.006
 unnecessary+0.006
 treadmill+0.006
Neg logit contributions
flower-0.005
el-0.005
ra-0.005
ho-0.005
ad-0.005
Token work
Token ID670
Feature activation-4.87e-01

Pos logit contributions
 stret+0.005
 treadmill+0.004
Charl+0.004
 safer+0.004
ches+0.003
Neg logit contributions
ho-0.004
ped-0.004
el-0.004
 next-0.003
 size-0.003
Token.
Token ID13
Feature activation-3.01e-01

Pos logit contributions
 stret+0.020
ches+0.017
Charl+0.017
 treadmill+0.016
 safer+0.016
Neg logit contributions
 size-0.013
 three-0.013
 next-0.013
el-0.013
ho-0.013
Token They
Token ID1119
Feature activation-2.68e-01

Pos logit contributions
 stret+0.013
ches+0.011
Charl+0.011
 treadmill+0.010
 safer+0.010
Neg logit contributions
 size-0.008
 three-0.008
 next-0.008
el-0.008
 nos-0.008
Token trimmed
Token ID40325
Feature activation-2.00e-01

Pos logit contributions
 stret+0.011
ches+0.010
 treadmill+0.009
Charl+0.009
ion+0.009
Neg logit contributions
umm-0.007
 size-0.007
 next-0.007
el-0.007
ad-0.007
Token the
Token ID262
Feature activation-4.15e-01

Pos logit contributions
 stret+0.009
Charl+0.007
ches+0.007
 treadmill+0.007
 wheelchair+0.007
Neg logit contributions
 three-0.005
 size-0.005
 next-0.005
el-0.005
 colors-0.005
Token hed
Token ID16418
Feature activation-3.04e+00

Pos logit contributions
 stret+0.017
ches+0.014
 treadmill+0.014
Charl+0.013
 safer+0.013
Neg logit contributions
ad-0.012
 size-0.012
el-0.012
 three-0.011
 next-0.011
Tokenges
Token ID3212
Feature activation+2.53e-01

Pos logit contributions
 stret+0.116
ches+0.100
ion+0.099
Charl+0.098
 treadmill+0.093
Neg logit contributions
 size-0.092
 nos-0.091
 colors-0.090
 next-0.089
 three-0.089
Token and
Token ID290
Feature activation-3.78e-01

Pos logit contributions
 size+0.007
 three+0.006
ad+0.006
 next+0.006
 nos+0.006
Neg logit contributions
 stret-0.011
ches-0.009
Charl-0.009
 treadmill-0.009
ion-0.009
Token cut
Token ID2005
Feature activation-2.77e-01

Pos logit contributions
 stret+0.015
 safer+0.013
 treadmill+0.013
ches+0.013
ion+0.012
Neg logit contributions
el-0.010
ad-0.010
ho-0.010
flower-0.009
ra-0.009
Token the
Token ID262
Feature activation-6.70e-01

Pos logit contributions
 stret+0.011
ches+0.010
Charl+0.010
 treadmill+0.009
 safer+0.009
Neg logit contributions
 size-0.008
 next-0.008
 three-0.008
el-0.007
ad-0.007
Token grass
Token ID8701
Feature activation-7.80e-01

Pos logit contributions
 stret+0.027
ches+0.022
 treadmill+0.022
 safer+0.021
Charl+0.021
Neg logit contributions
ad-0.020
el-0.019
 next-0.019
 three-0.019
 size-0.019
Token things
Token ID1243
Feature activation-6.15e-01

Pos logit contributions
 stret+0.009
 unnecessary+0.009
 safer+0.008
ches+0.008
 wheelchair+0.008
Neg logit contributions
ho-0.009
flower-0.009
ad-0.008
el-0.008
 sill-0.008
Token can
Token ID460
Feature activation+1.85e-01

Pos logit contributions
 stret+0.025
Charl+0.021
ches+0.021
 treadmill+0.020
 wheelchair+0.020
Neg logit contributions
 next-0.018
 size-0.018
 only-0.018
 three-0.017
 colors-0.017
Token come
Token ID1282
Feature activation+9.25e-01

Pos logit contributions
years+0.006
ped+0.006
 3+0.006
ho+0.006
 nos+0.006
Neg logit contributions
 stret-0.006
ches-0.006
 safer-0.005
 earn-0.005
 else-0.005
Token from
Token ID422
Feature activation-8.70e-02

Pos logit contributions
ad+0.025
ho+0.025
 size+0.024
el+0.023
 three+0.023
Neg logit contributions
 stret-0.037
ches-0.034
 safer-0.032
 unnecessary-0.031
Charl-0.030
Token bad
Token ID2089
Feature activation-2.89e-01

Pos logit contributions
 stret+0.003
ches+0.003
 unnecessary+0.003
Charl+0.003
 safer+0.003
Neg logit contributions
ho-0.002
ad-0.002
 three-0.002
flower-0.002
el-0.002
Token situations
Token ID7445
Feature activation-2.97e+00

Pos logit contributions
 stret+0.012
ches+0.011
 unnecessary+0.010
Charl+0.010
 safer+0.010
Neg logit contributions
ho-0.007
 three-0.007
ad-0.007
el-0.007
 next-0.007
Token.
Token ID13
Feature activation-3.68e-02

Pos logit contributions
 stret+0.152
Charl+0.129
ches+0.125
 treadmill+0.124
 wheelchair+0.121
Neg logit contributions
 next-0.067
 three-0.064
 size-0.062
 only-0.061
 ever-0.059
Token<|endoftext|>
Token ID50256
Feature activation+7.18e-02

Pos logit contributions
 stret+0.002
ches+0.001
 treadmill+0.001
 unnecessary+0.001
Charl+0.001
Neg logit contributions
 next-0.001
 three-0.001
 only-0.001
ad-0.001
el-0.001
TokenOnce
Token ID7454
Feature activation+3.54e-01

Pos logit contributions
 next+0.002
 three+0.002
 only+0.002
 looking+0.002
ad+0.002
Neg logit contributions
 stret-0.003
 unnecessary-0.003
 treadmill-0.003
ches-0.003
Charl-0.003
Token upon
Token ID2402
Feature activation-2.13e-01

Pos logit contributions
 next+0.004
 three+0.004
el+0.004
 only+0.004
 size+0.004
Neg logit contributions
 stret-0.021
Charl-0.019
 treadmill-0.018
ches-0.018
 unnecessary-0.018
Token a
Token ID257
Feature activation-1.12e+00

Pos logit contributions
 stret+0.011
ches+0.010
 treadmill+0.010
Charl+0.010
ion+0.010
Neg logit contributions
 three-0.004
 next-0.004
 only-0.004
 looking-0.004
 nos-0.003
Token sandwich
Token ID20433
Feature activation+5.85e-01

Pos logit contributions
 stret+0.010
ches+0.008
Charl+0.008
 treadmill+0.008
 safer+0.008
Neg logit contributions
el-0.009
ad-0.009
 size-0.009
 next-0.009
 colors-0.009
Token.
Token ID13
Feature activation-2.30e-01

Pos logit contributions
 next+0.015
 size+0.015
 three+0.015
el+0.014
 only+0.014
Neg logit contributions
 stret-0.025
ches-0.022
Charl-0.022
 treadmill-0.021
 safer-0.020
Token Her
Token ID2332
Feature activation-1.59e+00

Pos logit contributions
 stret+0.010
ches+0.009
Charl+0.009
 treadmill+0.009
 safer+0.008
Neg logit contributions
 size-0.006
 next-0.006
 three-0.006
el-0.005
 nos-0.005
Token mom
Token ID1995
Feature activation-4.06e-01

Pos logit contributions
 stret+0.063
 safer+0.053
 wheelchair+0.052
 hospital+0.051
 proper+0.051
Neg logit contributions
ad-0.045
el-0.040
 markings-0.040
years-0.039
ho-0.039
Token made
Token ID925
Feature activation-4.12e-01

Pos logit contributions
ches+0.012
 stret+0.010
 else+0.010
ion+0.010
old+0.009
Neg logit contributions
umm-0.016
 size-0.015
 markings-0.015
 blu-0.015
 rays-0.015
Token her
Token ID607
Feature activation-2.95e+00

Pos logit contributions
 stret+0.014
ches+0.013
ion+0.013
 safer+0.012
Charl+0.011
Neg logit contributions
 size-0.014
flower-0.014
ho-0.014
 nos-0.013
 markings-0.012
Token a
Token ID257
Feature activation-6.70e-01

Pos logit contributions
 stret+0.129
ches+0.111
Charl+0.108
 treadmill+0.106
 safer+0.105
Neg logit contributions
 size-0.073
 next-0.072
ho-0.071
ad-0.070
el-0.070
Token peanut
Token ID26636
Feature activation+7.24e-01

Pos logit contributions
 stret+0.025
 safer+0.020
ches+0.020
 treadmill+0.019
Charl+0.019
Neg logit contributions
ad-0.022
el-0.022
ped-0.021
flower-0.021
 next-0.021
Token butter
Token ID9215
Feature activation+7.35e-01

Pos logit contributions
 three+0.021
 size+0.021
ad+0.021
 nos+0.021
ho+0.020
Neg logit contributions
 stret-0.028
ches-0.025
Charl-0.023
 safer-0.022
 treadmill-0.022
Token and
Token ID290
Feature activation-2.93e-01

Pos logit contributions
 nos+0.020
 three+0.020
 size+0.020
 3+0.019
ad+0.019
Neg logit contributions
 stret-0.030
ches-0.027
Charl-0.024
ion-0.024
 safer-0.023
Token jelly
Token ID27644
Feature activation+2.69e-01

Pos logit contributions
 stret+0.011
ches+0.009
 safer+0.009
ion+0.009
Charl+0.009
Neg logit contributions
ad-0.009
el-0.009
 size-0.008
ho-0.008
 next-0.008
Token ended
Token ID4444
Feature activation-3.37e-02

Pos logit contributions
 stret+0.030
ches+0.026
Charl+0.026
 treadmill+0.025
 safer+0.024
Neg logit contributions
 size-0.018
 next-0.018
 three-0.018
el-0.017
ad-0.017
Token up
Token ID510
Feature activation+1.82e-01

Pos logit contributions
 stret+0.001
ches+0.001
Charl+0.001
 treadmill+0.001
ion+0.001
Neg logit contributions
 next-0.001
 size-0.001
 three-0.001
el-0.001
 only-0.001
Token in
Token ID287
Feature activation+1.19e-01

Pos logit contributions
 size+0.005
 three+0.004
 next+0.004
el+0.004
ad+0.004
Neg logit contributions
 stret-0.008
ches-0.007
Charl-0.007
 treadmill-0.007
 safer-0.006
Token such
Token ID884
Feature activation+2.65e-01

Pos logit contributions
 size+0.003
 three+0.003
 next+0.003
ho+0.003
el+0.003
Neg logit contributions
 stret-0.005
Charl-0.004
ches-0.004
 treadmill-0.004
 safer-0.004
Token bad
Token ID2089
Feature activation-2.03e-01

Pos logit contributions
 size+0.006
 next+0.006
 three+0.006
el+0.006
ho+0.006
Neg logit contributions
 stret-0.012
ches-0.011
Charl-0.011
 treadmill-0.010
 safer-0.010
Token situations
Token ID7445
Feature activation-2.93e+00

Pos logit contributions
 stret+0.008
 unnecessary+0.006
 wheelchair+0.006
ches+0.006
 safer+0.006
Neg logit contributions
ho-0.006
 three-0.006
el-0.006
ad-0.006
ra-0.006
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
 stret+0.134
Charl+0.119
ches+0.119
 treadmill+0.114
text+0.111
Neg logit contributions
 next-0.073
 three-0.067
 ever-0.067
 size-0.066
 looking-0.066
Token
Token ID220
Feature activation-7.19e-01

Pos logit contributions
 stret+0.006
ches+0.005
Charl+0.005
 treadmill+0.005
 safer+0.005
Neg logit contributions
 three-0.004
 next-0.004
 size-0.004
el-0.004
ad-0.004
Token
Token ID198
Feature activation-2.10e-01

Pos logit contributions
 stret+0.034
ches+0.029
Charl+0.029
 treadmill+0.028
 unnecessary+0.028
Neg logit contributions
 next-0.016
 three-0.015
 only-0.015
ad-0.014
el-0.014
Token
Token ID198
Feature activation-2.21e-01

Pos logit contributions
 stret+0.008
ches+0.007
Charl+0.007
 treadmill+0.007
 safer+0.006
Neg logit contributions
 next-0.006
 three-0.006
 size-0.006
ad-0.006
el-0.006
TokenThe
Token ID464
Feature activation+5.95e-01

Pos logit contributions
 stret+0.008
ches+0.007
Charl+0.006
 treadmill+0.006
 safer+0.006
Neg logit contributions
 next-0.007
 three-0.007
 size-0.007
ad-0.007
 nos-0.007
Token eat
Token ID4483
Feature activation-2.99e-01

Pos logit contributions
 size+0.008
 next+0.008
 three+0.008
el+0.008
ad+0.008
Neg logit contributions
 stret-0.014
Charl-0.012
ches-0.012
 treadmill-0.012
ion-0.011
Token.
Token ID13
Feature activation+4.50e-02

Pos logit contributions
 stret+0.014
Charl+0.012
ches+0.012
 treadmill+0.012
 wheelchair+0.011
Neg logit contributions
 next-0.007
 size-0.007
 three-0.007
el-0.007
 only-0.007
Token Her
Token ID2332
Feature activation-1.25e+00

Pos logit contributions
 size+0.001
 next+0.001
 three+0.001
el+0.001
 nos+0.001
Neg logit contributions
 stret-0.002
ches-0.002
Charl-0.002
 treadmill-0.002
 safer-0.002
Token mom
Token ID1995
Feature activation-1.33e-01

Pos logit contributions
 stret+0.049
 wheelchair+0.041
ches+0.040
 hospital+0.040
 safer+0.040
Neg logit contributions
ad-0.039
years-0.034
el-0.034
 markings-0.033
 colours-0.032
Token made
Token ID925
Feature activation-2.43e-01

Pos logit contributions
ches+0.005
 stret+0.004
Charl+0.004
ion+0.004
 else+0.004
Neg logit contributions
umm-0.005
 markings-0.004
 size-0.004
ad-0.004
 sill-0.004
Token her
Token ID607
Feature activation-2.92e+00

Pos logit contributions
 stret+0.009
ches+0.008
ion+0.007
 safer+0.007
Charl+0.007
Neg logit contributions
 size-0.008
flower-0.008
ho-0.008
 nos-0.007
ra-0.007
Token a
Token ID257
Feature activation-6.38e-01

Pos logit contributions
 stret+0.129
ches+0.111
Charl+0.109
 treadmill+0.107
 safer+0.105
Neg logit contributions
 size-0.072
 next-0.070
ho-0.069
ad-0.069
el-0.069
Token y
Token ID331
Feature activation+5.17e-01

Pos logit contributions
 stret+0.024
 safer+0.019
 treadmill+0.019
ches+0.019
Charl+0.018
Neg logit contributions
ad-0.021
el-0.021
ped-0.020
flower-0.020
 size-0.020
Tokenummy
Token ID13513
Feature activation-3.39e-01

Pos logit contributions
 next+0.008
 three+0.008
 size+0.007
 nos+0.007
 only+0.007
Neg logit contributions
 stret-0.027
ches-0.024
Charl-0.024
 treadmill-0.024
ion-0.023
Token peanut
Token ID26636
Feature activation+6.22e-01

Pos logit contributions
 stret+0.010
ches+0.008
 safer+0.007
 treadmill+0.007
ion+0.007
Neg logit contributions
ho-0.013
 size-0.013
ad-0.013
el-0.013
ra-0.013
Token butter
Token ID9215
Feature activation+6.15e-01

Pos logit contributions
 three+0.018
 size+0.018
ad+0.018
 nos+0.018
el+0.017
Neg logit contributions
 stret-0.024
ches-0.021
Charl-0.020
 treadmill-0.020
 safer-0.019
Token sandwich
Token ID20433
Feature activation+5.60e-01

Pos logit contributions
 stret+0.009
ches+0.007
Charl+0.007
 treadmill+0.007
 safer+0.007
Neg logit contributions
el-0.008
ad-0.008
 size-0.008
ra-0.008
 colors-0.008
Token.
Token ID13
Feature activation-1.66e-01

Pos logit contributions
 size+0.014
 next+0.014
 three+0.014
el+0.014
ad+0.013
Neg logit contributions
 stret-0.024
ches-0.021
Charl-0.021
 treadmill-0.020
 safer-0.019
Token Her
Token ID2332
Feature activation-1.55e+00

Pos logit contributions
 stret+0.007
ches+0.006
Charl+0.006
 treadmill+0.006
 safer+0.006
Neg logit contributions
 size-0.004
 three-0.004
 next-0.004
el-0.004
 only-0.004
Token mom
Token ID1995
Feature activation-4.27e-01

Pos logit contributions
 stret+0.061
 safer+0.051
 wheelchair+0.049
 proper+0.049
ches+0.049
Neg logit contributions
ad-0.044
el-0.040
 markings-0.040
ho-0.039
years-0.039
Token made
Token ID925
Feature activation-4.31e-01

Pos logit contributions
ches+0.013
 stret+0.012
 else+0.011
ion+0.011
Charl+0.010
Neg logit contributions
umm-0.015
 size-0.015
 markings-0.015
 blu-0.014
 sill-0.014
Token her
Token ID607
Feature activation-2.92e+00

Pos logit contributions
 stret+0.015
ches+0.014
ion+0.013
 safer+0.012
Charl+0.012
Neg logit contributions
 size-0.014
flower-0.014
ho-0.014
 nos-0.013
 next-0.013
Token a
Token ID257
Feature activation-7.06e-01

Pos logit contributions
 stret+0.125
ches+0.107
Charl+0.104
 safer+0.102
 treadmill+0.102
Neg logit contributions
 size-0.075
ho-0.074
 next-0.073
ad-0.072
el-0.072
Token normal
Token ID3487
Feature activation-1.19e+00

Pos logit contributions
 stret+0.025
 safer+0.019
ches+0.019
 treadmill+0.018
Charl+0.018
Neg logit contributions
ad-0.025
el-0.025
ped-0.024
flower-0.024
 size-0.024
Token sandwich
Token ID20433
Feature activation+5.89e-01

Pos logit contributions
 stret+0.030
 safer+0.022
ches+0.022
 yoga+0.021
 clinic+0.020
Neg logit contributions
ad-0.050
el-0.050
years-0.050
ped-0.048
ho-0.047
Token with
Token ID351
Feature activation+5.52e-01

Pos logit contributions
el+0.015
 nos+0.015
ad+0.015
 three+0.015
 size+0.015
Neg logit contributions
 stret-0.024
ches-0.021
 safer-0.020
Charl-0.020
 treadmill-0.020
Token peanut
Token ID26636
Feature activation+7.57e-01

Pos logit contributions
flower+0.018
ok+0.017
 next+0.017
 size+0.017
 nos+0.016
Neg logit contributions
 stret-0.017
ches-0.016
ion-0.015
 safer-0.015
Charl-0.014
Token that
Token ID326
Feature activation+2.57e-01

Pos logit contributions
 stret+0.003
land+0.003
ches+0.003
 yoga+0.003
 safer+0.003
Neg logit contributions
flower-0.006
 blu-0.006
ra-0.006
ho-0.006
el-0.006
Token she
Token ID673
Feature activation-7.74e-02

Pos logit contributions
ra+0.009
 markings+0.009
 size+0.009
ad+0.009
ho+0.009
Neg logit contributions
 stret-0.007
 safer-0.007
Charl-0.006
ches-0.006
ion-0.006
Token had
Token ID550
Feature activation-6.01e-02

Pos logit contributions
 stret+0.004
Charl+0.003
ches+0.003
 treadmill+0.003
 unnecessary+0.003
Neg logit contributions
 next-0.001
 three-0.001
 size-0.001
 only-0.001
 nos-0.001
Token made
Token ID925
Feature activation-2.03e-01

Pos logit contributions
 stret+0.003
ches+0.003
Charl+0.003
 treadmill+0.002
 safer+0.002
Neg logit contributions
 size-0.001
 next-0.001
 three-0.001
el-0.001
ho-0.001
Token the
Token ID262
Feature activation-1.90e+00

Pos logit contributions
 stret+0.008
ches+0.007
Charl+0.007
 safer+0.007
 treadmill+0.007
Neg logit contributions
ho-0.006
 size-0.006
 next-0.005
flower-0.005
el-0.005
Token little
Token ID1310
Feature activation-2.88e+00

Pos logit contributions
 stret+0.074
 safer+0.063
 wheelchair+0.059
Charl+0.059
 hospital+0.059
Neg logit contributions
ad-0.058
el-0.057
ho-0.053
ra-0.051
 markings-0.051
Token boy
Token ID2933
Feature activation-8.51e-01

Pos logit contributions
 stret+0.082
 safer+0.076
 hospital+0.065
 wheelchair+0.065
 salon+0.063
Neg logit contributions
ad-0.108
el-0.100
years-0.098
ild-0.097
ra-0.097
Token so
Token ID523
Feature activation-3.26e-01

Pos logit contributions
 stret+0.036
ches+0.031
 safer+0.030
Charl+0.029
 treadmill+0.029
Neg logit contributions
 size-0.022
ho-0.022
 next-0.021
el-0.021
 three-0.021
Token happy
Token ID3772
Feature activation+4.33e-01

Pos logit contributions
 stret+0.011
 safer+0.011
 else+0.010
ches+0.010
 treadmill+0.010
Neg logit contributions
flower-0.010
 size-0.010
el-0.009
 markings-0.009
ad-0.009
Token.
Token ID13
Feature activation-6.35e-02

Pos logit contributions
el+0.013
 three+0.013
 size+0.013
ho+0.013
 nos+0.013
Neg logit contributions
 stret-0.016
ches-0.013
Charl-0.013
 treadmill-0.013
 safer-0.013
Token She
Token ID1375
Feature activation-2.98e-01

Pos logit contributions
 stret+0.004
ches+0.003
 unnecessary+0.003
Charl+0.003
 treadmill+0.003
Neg logit contributions
 three-0.001
 only-0.001
 next-0.001
 size-0.001
ad-0.001
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
 stret+0.020
ches+0.017
Charl+0.017
 treadmill+0.017
 safer+0.016
Neg logit contributions
 size-0.014
el-0.014
 three-0.014
ad-0.014
ho-0.013
Token Her
Token ID2332
Feature activation-1.33e+00

Pos logit contributions
 stret+0.001
ches+0.000
Charl+0.000
 treadmill+0.000
 unnecessary+0.000
Neg logit contributions
 three-0.000
 next-0.000
 size-0.000
 only-0.000
el-0.000
Token mom
Token ID1995
Feature activation-1.19e-01

Pos logit contributions
 stret+0.053
 wheelchair+0.046
 yoga+0.044
 hospital+0.044
 treadmill+0.043
Neg logit contributions
ad-0.041
el-0.036
years-0.036
 markings-0.034
ho-0.034
Tokenmy
Token ID1820
Feature activation-1.12e+00

Pos logit contributions
ches+0.004
 stret+0.004
Charl+0.003
ion+0.003
 else+0.003
Neg logit contributions
umm-0.004
 markings-0.004
 size-0.004
 rays-0.004
 sill-0.004
Token made
Token ID925
Feature activation-2.82e-01

Pos logit contributions
ches+0.042
 stret+0.040
Charl+0.035
ion+0.034
 safer+0.032
Neg logit contributions
 size-0.036
umm-0.036
 markings-0.034
ad-0.034
 sill-0.034
Token her
Token ID607
Feature activation-2.84e+00

Pos logit contributions
 stret+0.010
ion+0.009
ches+0.008
 safer+0.008
 yoga+0.007
Neg logit contributions
flower-0.010
ho-0.010
 size-0.010
 nos-0.009
sed-0.009
Token a
Token ID257
Feature activation-3.73e-01

Pos logit contributions
 stret+0.126
ches+0.107
Charl+0.106
 treadmill+0.103
 safer+0.102
Neg logit contributions
 size-0.070
 next-0.068
ho-0.068
ad-0.067
el-0.067
Token sandwich
Token ID20433
Feature activation+6.16e-01

Pos logit contributions
 stret+0.016
 safer+0.013
 treadmill+0.013
ches+0.012
Charl+0.012
Neg logit contributions
ad-0.011
el-0.011
 size-0.010
ped-0.010
flower-0.010
Token with
Token ID351
Feature activation+7.24e-01

Pos logit contributions
el+0.016
 nos+0.015
ad+0.015
 three+0.015
 size+0.015
Neg logit contributions
 stret-0.026
ches-0.023
 safer-0.022
 treadmill-0.021
Charl-0.021
Token peanut
Token ID26636
Feature activation+8.34e-01

Pos logit contributions
flower+0.024
ok+0.023
 next+0.022
ho+0.022
ad+0.022
Neg logit contributions
 stret-0.024
ion-0.020
ches-0.020
 safer-0.019
Charl-0.019
Token butter
Token ID9215
Feature activation+7.65e-01

Pos logit contributions
 three+0.026
 size+0.026
ad+0.026
 next+0.025
 nos+0.025
Neg logit contributions
 stret-0.030
ches-0.027
Charl-0.024
ion-0.024
 treadmill-0.024
Token could
Token ID714
Feature activation-5.49e-01

Pos logit contributions
 size+0.015
 three+0.015
 next+0.014
el+0.014
ad+0.014
Neg logit contributions
 stret-0.020
ches-0.018
Charl-0.017
ion-0.017
 treadmill-0.017
Token help
Token ID1037
Feature activation-9.47e-01

Pos logit contributions
 stret+0.022
 safer+0.020
ches+0.020
 treadmill+0.018
Charl+0.018
Neg logit contributions
 3-0.015
 three-0.014
flower-0.014
el-0.014
ho-0.014
Token him
Token ID683
Feature activation+9.71e-02

Pos logit contributions
 stret+0.043
ches+0.038
Charl+0.036
 treadmill+0.036
 safer+0.036
Neg logit contributions
 size-0.021
 three-0.021
ho-0.021
 next-0.021
el-0.020
Token in
Token ID287
Feature activation-6.13e-01

Pos logit contributions
 size+0.003
 three+0.003
el+0.003
ho+0.003
 next+0.002
Neg logit contributions
 stret-0.004
ches-0.003
Charl-0.003
 treadmill-0.003
 safer-0.003
Token difficult
Token ID2408
Feature activation-7.94e-01

Pos logit contributions
 stret+0.023
 safer+0.021
 unnecessary+0.020
Charl+0.020
ches+0.019
Neg logit contributions
ho-0.018
flower-0.017
ra-0.017
el-0.017
ad-0.016
Token situations
Token ID7445
Feature activation-2.84e+00

Pos logit contributions
 stret+0.030
ches+0.026
Charl+0.025
 treadmill+0.025
 safer+0.025
Neg logit contributions
 three-0.023
 size-0.023
 next-0.023
ho-0.022
ad-0.022
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
 stret+0.136
Charl+0.116
ches+0.115
 treadmill+0.113
text+0.110
Neg logit contributions
 next-0.069
 three-0.065
 size-0.065
 only-0.064
 ever-0.062
Token<|endoftext|>
Token ID50256
Feature activation+3.13e-01

Pos logit contributions
 next+0.006
 three+0.006
ad+0.006
 only+0.005
 size+0.005
Neg logit contributions
 stret-0.011
ches-0.009
Charl-0.009
 treadmill-0.009
 unnecessary-0.008
TokenOnce
Token ID7454
Feature activation+3.67e-01

Pos logit contributions
 next+0.009
 three+0.009
 size+0.009
el+0.009
ad+0.009
Neg logit contributions
 stret-0.013
ches-0.011
Charl-0.010
 treadmill-0.010
 unnecessary-0.010
Token upon
Token ID2402
Feature activation-2.77e-01

Pos logit contributions
 next+0.005
 three+0.004
el+0.004
 only+0.004
 size+0.004
Neg logit contributions
 stret-0.022
Charl-0.019
 treadmill-0.019
ches-0.018
 unnecessary-0.018
Token a
Token ID257
Feature activation-1.24e+00

Pos logit contributions
 stret+0.014
ches+0.012
Charl+0.012
 treadmill+0.012
ion+0.012
Neg logit contributions
 three-0.005
 next-0.005
 only-0.005
 looking-0.005
 size-0.005
Tokenâ
Token ID22940
Feature activation-4.46e-01

Pos logit contributions
 stret+0.063
ches+0.054
Charl+0.054
 treadmill+0.052
ion+0.050
Neg logit contributions
 three-0.041
 next-0.040
 size-0.039
 only-0.039
 nos-0.038
Token€
Token ID26391
Feature activation-2.35e+00

Pos logit contributions
 stret+0.016
Charl+0.013
ches+0.013
 treadmill+0.013
 unnecessary+0.013
Neg logit contributions
 next-0.015
ad-0.015
el-0.014
 three-0.014
 only-0.014
Token
Token ID198
Feature activation-8.67e-01

Pos logit contributions
 stret+0.081
ion+0.073
ches+0.069
 unnecessary+0.063
Charl+0.063
Neg logit contributions
 nos-0.084
 next-0.082
 looking-0.081
 only-0.078
 ever-0.078
Token
Token ID198
Feature activation-9.81e-01

Pos logit contributions
 stret+0.043
 unnecessary+0.038
Charl+0.037
ches+0.037
 treadmill+0.036
Neg logit contributions
 next-0.018
 three-0.017
 only-0.016
ad-0.016
 looking-0.016
TokenThe
Token ID464
Feature activation-1.08e+00

Pos logit contributions
 stret+0.036
ches+0.030
 unnecessary+0.029
Charl+0.028
 wheelchair+0.028
Neg logit contributions
 next-0.033
 three-0.033
 only-0.032
 nos-0.031
 colors-0.031
Token little
Token ID1310
Feature activation-2.76e+00

Pos logit contributions
 stret+0.051
 treadmill+0.042
Charl+0.042
 safer+0.042
 wheelchair+0.041
Neg logit contributions
el-0.026
ad-0.026
 size-0.024
ho-0.024
 markings-0.024
Token dog
Token ID3290
Feature activation+4.39e-01

Pos logit contributions
 stret+0.104
Charl+0.084
ches+0.083
 treadmill+0.082
 wheelchair+0.082
Neg logit contributions
ad-0.086
 size-0.083
el-0.081
ho-0.081
 markings-0.081
Token shook
Token ID14682
Feature activation-2.44e-03

Pos logit contributions
 next+0.009
 three+0.009
 size+0.009
 only+0.008
 nos+0.008
Neg logit contributions
 stret-0.022
Charl-0.019
ches-0.019
 treadmill-0.018
ion-0.018
Token his
Token ID465
Feature activation-6.15e-01

Pos logit contributions
 stret+0.000
ches+0.000
Charl+0.000
 treadmill+0.000
ion+0.000
Neg logit contributions
 next-0.000
 three-0.000
 size-0.000
el-0.000
 only-0.000
Token head
Token ID1182
Feature activation+8.85e-01

Pos logit contributions
 stret+0.036
ches+0.032
Charl+0.032
 treadmill+0.031
 safer+0.031
Neg logit contributions
 size-0.006
 next-0.006
 three-0.006
el-0.006
ad-0.006
Token.
Token ID13
Feature activation-6.80e-01

Pos logit contributions
 size+0.023
 three+0.023
ad+0.023
el+0.022
 next+0.022
Neg logit contributions
 stret-0.037
ches-0.032
Charl-0.031
 treadmill-0.030
 safer-0.030
Token.
Token ID13
Feature activation+6.46e-01

Pos logit contributions
 next+0.023
 size+0.022
 only+0.022
 three+0.021
 nos+0.020
Neg logit contributions
 stret-0.046
Charl-0.041
ion-0.039
 treadmill-0.038
ches-0.038
Token Her
Token ID2332
Feature activation-1.28e+00

Pos logit contributions
 size+0.017
 next+0.017
ad+0.016
 three+0.016
ho+0.016
Neg logit contributions
 stret-0.028
Charl-0.024
ches-0.024
 treadmill-0.023
 wheelchair-0.022
Token mom
Token ID1995
Feature activation-7.67e-01

Pos logit contributions
 stret+0.061
 wheelchair+0.053
Charl+0.052
ches+0.050
 safer+0.049
Neg logit contributions
ad-0.029
ho-0.028
 markings-0.025
el-0.025
 size-0.025
Tokenmy
Token ID1820
Feature activation-9.96e-01

Pos logit contributions
 stret+0.031
ches+0.029
Charl+0.027
 safer+0.025
 wheelchair+0.025
Neg logit contributions
 size-0.021
 markings-0.021
umm-0.021
ho-0.020
ra-0.020
Token picked
Token ID6497
Feature activation-8.94e-01

Pos logit contributions
 stret+0.047
ches+0.043
Charl+0.040
 wheelchair+0.038
 safer+0.038
Neg logit contributions
 size-0.023
ho-0.021
 sill-0.021
 markings-0.021
 colors-0.021
Token her
Token ID607
Feature activation-2.76e+00

Pos logit contributions
 stret+0.046
Charl+0.040
 treadmill+0.040
ches+0.039
 wheelchair+0.038
Neg logit contributions
el-0.017
 next-0.017
 three-0.017
 only-0.017
 looking-0.016
Token up
Token ID510
Feature activation-5.25e-02

Pos logit contributions
 stret+0.135
Charl+0.124
ches+0.122
text+0.121
 treadmill+0.120
Neg logit contributions
 next-0.068
 three-0.059
 only-0.057
 looking-0.054
el-0.053
Token and
Token ID290
Feature activation-8.97e-01

Pos logit contributions
 stret+0.002
ches+0.002
Charl+0.002
 treadmill+0.002
 safer+0.002
Neg logit contributions
 next-0.001
 size-0.001
 three-0.001
el-0.001
 only-0.001
Token said
Token ID531
Feature activation-7.28e-01

Pos logit contributions
 stret+0.047
Charl+0.042
 treadmill+0.041
ches+0.041
 unnecessary+0.041
Neg logit contributions
 next-0.016
 thought-0.015
 nos-0.015
 looking-0.015
 only-0.014
Token,
Token ID11
Feature activation-1.89e-01

Pos logit contributions
 stret+0.028
ches+0.023
Charl+0.023
 treadmill+0.022
ion+0.022
Neg logit contributions
 size-0.023
ad-0.021
 next-0.021
flower-0.021
el-0.021
Token "
Token ID366
Feature activation-2.88e-01

Pos logit contributions
 stret+0.007
ches+0.005
Charl+0.005
 treadmill+0.005
ion+0.005
Neg logit contributions
 size-0.007
el-0.007
ad-0.007
flower-0.006
ra-0.006
Token day
Token ID1110
Feature activation-5.11e-01

Pos logit contributions
 next+0.004
 three+0.004
 size+0.003
 only+0.003
 nos+0.003
Neg logit contributions
 stret-0.022
Charl-0.019
ches-0.019
 treadmill-0.019
 unnecessary-0.018
Token,
Token ID11
Feature activation+3.31e-01

Pos logit contributions
 stret+0.029
ches+0.025
rig+0.025
 treadmill+0.024
Charl+0.024
Neg logit contributions
 three-0.009
 looking-0.008
 next-0.008
 only-0.008
 nos-0.008
Token her
Token ID607
Feature activation-8.73e-01

Pos logit contributions
 three+0.009
 next+0.009
 only+0.009
 looking+0.008
 nos+0.008
Neg logit contributions
 stret-0.015
ches-0.013
Charl-0.013
 treadmill-0.012
text-0.012
Token mom
Token ID1995
Feature activation-3.72e-02

Pos logit contributions
 stret+0.046
ches+0.041
Charl+0.041
 treadmill+0.040
 wheelchair+0.039
Neg logit contributions
 size-0.014
ad-0.013
el-0.013
 three-0.013
 next-0.013
Token made
Token ID925
Feature activation-5.19e-01

Pos logit contributions
 stret+0.002
ches+0.002
Charl+0.001
 treadmill+0.001
 safer+0.001
Neg logit contributions
 size-0.001
ho-0.001
ad-0.001
 markings-0.001
el-0.001
Token her
Token ID607
Feature activation-2.75e+00

Pos logit contributions
 stret+0.022
ches+0.019
Charl+0.018
ion+0.018
 treadmill+0.018
Neg logit contributions
 size-0.014
ho-0.014
 next-0.013
flower-0.013
 three-0.013
Token a
Token ID257
Feature activation-6.05e-01

Pos logit contributions
 stret+0.142
Charl+0.134
ches+0.134
ion+0.125
 treadmill+0.122
Neg logit contributions
 three-0.054
 next-0.046
 only-0.046
 3-0.046
 nos-0.046
Token y
Token ID331
Feature activation+4.63e-01

Pos logit contributions
 stret+0.026
ches+0.024
Charl+0.023
ion+0.022
text+0.022
Neg logit contributions
 three-0.016
 size-0.015
 nos-0.015
 next-0.015
 only-0.015
Tokenummy
Token ID13513
Feature activation+9.59e-02

Pos logit contributions
 next+0.006
 three+0.006
 size+0.006
 nos+0.006
 only+0.005
Neg logit contributions
 stret-0.025
ches-0.023
Charl-0.023
 treadmill-0.022
ion-0.022
Token sandwich
Token ID20433
Feature activation+6.66e-01

Pos logit contributions
 size+0.003
ho+0.003
el+0.003
ad+0.003
 next+0.003
Neg logit contributions
 stret-0.003
ches-0.003
 treadmill-0.003
Charl-0.003
ion-0.003
Token with
Token ID351
Feature activation+4.92e-01

Pos logit contributions
 size+0.017
 three+0.016
el+0.016
 next+0.016
 nos+0.016
Neg logit contributions
 stret-0.029
ches-0.025
Charl-0.025
 treadmill-0.024
ion-0.024
Token also
Token ID635
Feature activation-4.34e-01

Pos logit contributions
 three+0.006
 3+0.006
 colors+0.006
 size+0.006
 nos+0.006
Neg logit contributions
 stret-0.012
ches-0.011
 safer-0.010
Charl-0.010
 treadmill-0.010
Token help
Token ID1037
Feature activation-4.78e-01

Pos logit contributions
 stret+0.019
ches+0.016
 safer+0.016
Charl+0.016
 unnecessary+0.015
Neg logit contributions
el-0.011
ho-0.010
flower-0.010
 three-0.010
 size-0.010
Token you
Token ID345
Feature activation-3.04e-01

Pos logit contributions
 stret+0.024
Charl+0.020
 treadmill+0.020
ches+0.019
 wheelchair+0.019
Neg logit contributions
 next-0.011
 size-0.011
 three-0.010
 only-0.010
el-0.010
Token in
Token ID287
Feature activation-2.90e-01

Pos logit contributions
 stret+0.014
Charl+0.012
ches+0.012
 treadmill+0.011
ion+0.011
Neg logit contributions
 next-0.008
 size-0.008
 three-0.008
 only-0.007
 looking-0.007
Token tough
Token ID5802
Feature activation-7.80e-02

Pos logit contributions
 stret+0.013
ches+0.011
Charl+0.011
 treadmill+0.011
ion+0.010
Neg logit contributions
 size-0.007
 next-0.007
 three-0.007
el-0.007
 nos-0.007
Token situations
Token ID7445
Feature activation-2.75e+00

Pos logit contributions
 stret+0.002
ches+0.002
Charl+0.002
 treadmill+0.002
 safer+0.002
Neg logit contributions
 three-0.003
 size-0.003
 next-0.003
ho-0.003
ad-0.003
Token.
Token ID13
Feature activation+6.60e-01

Pos logit contributions
 stret+0.140
Charl+0.119
ches+0.117
 treadmill+0.117
text+0.115
Neg logit contributions
 next-0.063
 three-0.058
 size-0.057
 ever-0.056
 only-0.056
Token From
Token ID3574
Feature activation-9.14e-02

Pos logit contributions
 next+0.016
 three+0.016
el+0.015
 only+0.015
ad+0.015
Neg logit contributions
 stret-0.032
ches-0.026
 treadmill-0.026
Charl-0.025
 unnecessary-0.025
Token that
Token ID326
Feature activation-3.43e-02

Pos logit contributions
 stret+0.003
ches+0.003
Charl+0.003
 treadmill+0.003
 safer+0.003
Neg logit contributions
 size-0.003
 next-0.003
 three-0.003
el-0.003
ad-0.003
Token day
Token ID1110
Feature activation+8.88e-01

Pos logit contributions
 stret+0.001
Charl+0.001
ches+0.001
 safer+0.001
 wheelchair+0.001
Neg logit contributions
ad-0.001
ra-0.001
 markings-0.001
el-0.001
 size-0.001
Token on
Token ID319
Feature activation+3.89e-01

Pos logit contributions
 size+0.017
 next+0.017
 three+0.017
el+0.016
ad+0.016
Neg logit contributions
 stret-0.044
ches-0.039
Charl-0.039
 treadmill-0.037
 safer-0.036
Token
Token ID198
Feature activation-5.17e-01

Pos logit contributions
 stret+0.024
 unnecessary+0.021
ches+0.020
Charl+0.020
vern+0.019
Neg logit contributions
 next-0.007
 three-0.007
 thought-0.007
 only-0.007
 sun-0.006
Token
Token ID198
Feature activation-5.94e-01

Pos logit contributions
 stret+0.029
 unnecessary+0.026
 treadmill+0.025
Charl+0.024
ches+0.023
Neg logit contributions
 next-0.009
 three-0.008
 thought-0.008
 only-0.008
 looking-0.008
Token"
Token ID1
Feature activation+1.71e-01

Pos logit contributions
 stret+0.028
 unnecessary+0.024
text+0.022
ion+0.022
 clinic+0.021
Neg logit contributions
 next-0.018
 colors-0.015
 nos-0.015
 thought-0.015
 three-0.015
TokenHello
Token ID15496
Feature activation-8.58e-01

Pos logit contributions
 size+0.005
el+0.004
ra+0.004
 markings+0.004
ad+0.004
Neg logit contributions
 stret-0.007
Charl-0.007
ches-0.006
Beat-0.006
 safer-0.006
Token,
Token ID11
Feature activation-3.12e-01

Pos logit contributions
 stret+0.035
ches+0.030
Charl+0.030
 treadmill+0.029
 safer+0.028
Neg logit contributions
 size-0.023
 next-0.023
 three-0.023
ad-0.022
el-0.022
Token little
Token ID1310
Feature activation-2.75e+00

Pos logit contributions
 stret+0.014
ches+0.012
Charl+0.012
 treadmill+0.011
 safer+0.011
Neg logit contributions
 size-0.008
el-0.007
 next-0.007
 three-0.007
ad-0.007
Token girl
Token ID2576
Feature activation-9.16e-01

Pos logit contributions
 stret+0.120
Charl+0.104
ches+0.102
 safer+0.101
 treadmill+0.099
Neg logit contributions
 size-0.064
ad-0.063
el-0.062
 next-0.061
 three-0.061
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
 stret+0.036
Charl+0.030
ches+0.030
 treadmill+0.030
 unnecessary+0.028
Neg logit contributions
 next-0.028
 three-0.028
 size-0.027
 nos-0.027
el-0.027
Token What
Token ID1867
Feature activation-7.33e-01

Pos logit contributions
 stret+0.039
Charl+0.036
ches+0.033
 treadmill+0.033
ion+0.032
Neg logit contributions
 size-0.038
el-0.035
 colors-0.035
ad-0.035
 next-0.034
Token's
Token ID338
Feature activation-5.61e-01

Pos logit contributions
 stret+0.033
ches+0.027
Charl+0.026
 treadmill+0.026
ion+0.025
Neg logit contributions
 three-0.021
 nos-0.021
 next-0.021
 only-0.020
 colors-0.020
Token your
Token ID534
Feature activation-7.25e-01

Pos logit contributions
 stret+0.021
ches+0.017
Charl+0.017
 treadmill+0.017
text+0.016
Neg logit contributions
 next-0.021
 looking-0.020
 only-0.020
 three-0.019
 shining-0.019
Token,
Token ID11
Feature activation+3.28e-03

Pos logit contributions
 stret+0.047
ches+0.040
Charl+0.039
 treadmill+0.039
rig+0.039
Neg logit contributions
 three-0.016
 next-0.015
 looking-0.015
 nos-0.014
 only-0.014
Token Lily
Token ID20037
Feature activation-6.21e-01

Pos logit contributions
 size+0.000
 next+0.000
 three+0.000
el+0.000
ad+0.000
Neg logit contributions
 stret-0.000
ches-0.000
Charl-0.000
 treadmill-0.000
 safer-0.000
Token's
Token ID338
Feature activation-8.72e-01

Pos logit contributions
 stret+0.025
Charl+0.021
 treadmill+0.020
ches+0.020
ion+0.020
Neg logit contributions
 next-0.019
 three-0.019
 size-0.018
 only-0.018
 nos-0.018
Token mom
Token ID1995
Feature activation+3.10e-02

Pos logit contributions
 stret+0.037
 treadmill+0.031
Charl+0.030
 wheelchair+0.030
 hospital+0.029
Neg logit contributions
ad-0.022
ped-0.021
 markings-0.020
 ever-0.019
 size-0.019
Token made
Token ID925
Feature activation-4.24e-01

Pos logit contributions
 markings+0.001
ho+0.001
 size+0.001
ad+0.001
 sill+0.001
Neg logit contributions
 stret-0.001
ches-0.001
Charl-0.001
 safer-0.001
 treadmill-0.001
Token her
Token ID607
Feature activation-2.74e+00

Pos logit contributions
 stret+0.017
ches+0.014
ion+0.014
Charl+0.014
 treadmill+0.013
Neg logit contributions
 size-0.012
ho-0.012
flower-0.012
 nos-0.011
 next-0.011
Token some
Token ID617
Feature activation+3.15e-02

Pos logit contributions
 stret+0.132
Charl+0.118
ches+0.117
 treadmill+0.112
ion+0.109
Neg logit contributions
 three-0.060
 next-0.057
 size-0.056
 only-0.055
 nos-0.054
Token lemon
Token ID18873
Feature activation+4.58e-01

Pos logit contributions
 size+0.001
 next+0.001
 three+0.001
ad+0.001
ho+0.001
Neg logit contributions
 stret-0.001
ches-0.001
Charl-0.001
 treadmill-0.001
ion-0.001
Tokenade
Token ID671
Feature activation-2.94e-01

Pos logit contributions
 next+0.019
 size+0.018
 three+0.018
el+0.018
 colors+0.018
Neg logit contributions
 stret-0.013
Charl-0.011
ches-0.010
 treadmill-0.010
 wheelchair-0.010
Token,
Token ID11
Feature activation-6.38e-01

Pos logit contributions
 stret+0.013
ches+0.012
Charl+0.011
 treadmill+0.011
 safer+0.011
Neg logit contributions
 size-0.007
 next-0.007
 three-0.007
el-0.007
ad-0.007
Token but
Token ID475
Feature activation-2.77e-01

Pos logit contributions
 stret+0.026
ches+0.022
 treadmill+0.021
ion+0.021
Charl+0.021
Neg logit contributions
ad-0.018
 size-0.018
el-0.018
ho-0.017
 next-0.017
Token Her
Token ID2332
Feature activation-1.13e+00

Pos logit contributions
 size+0.016
 next+0.015
 three+0.015
ad+0.015
el+0.015
Neg logit contributions
 stret-0.021
Charl-0.018
ches-0.018
 treadmill-0.017
 wheelchair-0.016
Token friends
Token ID2460
Feature activation+9.40e-04

Pos logit contributions
 stret+0.050
 wheelchair+0.044
Charl+0.043
 treadmill+0.041
 yoga+0.041
Neg logit contributions
ad-0.027
ho-0.025
el-0.025
 markings-0.024
 three-0.023
Token tried
Token ID3088
Feature activation-9.14e-01

Pos logit contributions
 size+0.000
umm+0.000
 sill+0.000
 colors+0.000
 markings+0.000
Neg logit contributions
 stret-0.000
ches-0.000
Charl-0.000
 treadmill-0.000
 wheelchair-0.000
Token to
Token ID284
Feature activation-1.32e+00

Pos logit contributions
 stret+0.042
ches+0.037
Charl+0.036
 treadmill+0.035
 safer+0.034
Neg logit contributions
 size-0.021
 three-0.020
 next-0.020
el-0.020
ad-0.020
Token make
Token ID787
Feature activation-1.20e+00

Pos logit contributions
 stret+0.055
ches+0.046
Charl+0.045
 treadmill+0.044
 safer+0.043
Neg logit contributions
ho-0.038
 size-0.037
el-0.036
 three-0.036
 colors-0.035
Token her
Token ID607
Feature activation-2.73e+00

Pos logit contributions
 stret+0.060
ches+0.053
Charl+0.053
 treadmill+0.052
ion+0.050
Neg logit contributions
 three-0.024
 next-0.023
 only-0.023
 looking-0.022
el-0.022
Token feel
Token ID1254
Feature activation-4.50e-01

Pos logit contributions
 stret+0.127
Charl+0.118
ches+0.115
ion+0.110
 treadmill+0.109
Neg logit contributions
 three-0.062
 next-0.061
 only-0.059
 size-0.059
 nos-0.058
Token better
Token ID1365
Feature activation-3.99e-03

Pos logit contributions
 stret+0.017
Charl+0.014
 Street+0.014
 treadmill+0.014
ches+0.013
Neg logit contributions
 only-0.016
 next-0.016
 nos-0.016
 looking-0.015
 three-0.015
Token,
Token ID11
Feature activation-2.68e-01

Pos logit contributions
 stret+0.000
ches+0.000
Charl+0.000
 treadmill+0.000
ion+0.000
Neg logit contributions
 size-0.000
 next-0.000
 three-0.000
 only-0.000
el-0.000
Token but
Token ID475
Feature activation+6.03e-02

Pos logit contributions
 stret+0.010
 safer+0.009
ches+0.009
Charl+0.008
 treadmill+0.008
Neg logit contributions
ad-0.008
el-0.008
ra-0.008
ho-0.008
 size-0.008
Token nothing
Token ID2147
Feature activation+2.42e-01

Pos logit contributions
 next+0.002
 size+0.002
 three+0.002
 only+0.002
 nos+0.002
Neg logit contributions
 stret-0.003
ches-0.002
Charl-0.002
 treadmill-0.002
ion-0.002
Token,
Token ID11
Feature activation+8.00e-01

Pos logit contributions
 stret+0.009
Charl+0.008
ches+0.008
 treadmill+0.007
ion+0.007
Neg logit contributions
 size-0.006
 three-0.006
 next-0.006
 colors-0.006
el-0.006
Token there
Token ID612
Feature activation-1.19e-01

Pos logit contributions
 size+0.017
ad+0.016
 markings+0.016
 colors+0.015
ho+0.015
Neg logit contributions
 stret-0.037
ches-0.033
Charl-0.033
 safer-0.032
ion-0.031
Token lived
Token ID5615
Feature activation+1.72e-01

Pos logit contributions
 stret+0.005
ches+0.005
Charl+0.005
 treadmill+0.004
 safer+0.004
Neg logit contributions
 size-0.003
 next-0.003
 three-0.003
 nos-0.003
el-0.003
Token an
Token ID281
Feature activation-3.70e-02

Pos logit contributions
 size+0.005
ho+0.005
el+0.004
 markings+0.004
ad+0.004
Neg logit contributions
 stret-0.007
Charl-0.006
ches-0.006
 safer-0.006
ion-0.006
Token elderly
Token ID13830
Feature activation-1.83e-01

Pos logit contributions
 stret+0.002
ches+0.001
Charl+0.001
 treadmill+0.001
 safer+0.001
Neg logit contributions
 size-0.001
ho-0.001
ra-0.001
 next-0.001
el-0.001
Token man
Token ID582
Feature activation+1.79e+00

Pos logit contributions
 stret+0.008
Charl+0.007
ches+0.007
 treadmill+0.006
ion+0.006
Neg logit contributions
 size-0.005
 next-0.005
 three-0.005
el-0.005
 colors-0.004
Token in
Token ID287
Feature activation+2.75e-02

Pos logit contributions
 size+0.067
 nos+0.065
 colors+0.064
umm+0.064
 markings+0.063
Neg logit contributions
 stret-0.054
ches-0.045
Charl-0.043
ion-0.042
 clinic-0.040
Token a
Token ID257
Feature activation-4.75e-02

Pos logit contributions
ho+0.001
flower+0.001
 size+0.001
el+0.001
 colors+0.001
Neg logit contributions
 stret-0.001
Charl-0.001
 safer-0.001
ches-0.001
 wheelchair-0.001
Token village
Token ID7404
Feature activation+5.51e-01

Pos logit contributions
 stret+0.002
ches+0.002
ion+0.002
Charl+0.002
 treadmill+0.002
Neg logit contributions
 three-0.001
 next-0.001
 nos-0.001
 size-0.001
 looking-0.001
Token.
Token ID13
Feature activation-3.83e-01

Pos logit contributions
umm+0.020
 markings+0.020
ped+0.019
ra+0.019
 size+0.019
Neg logit contributions
 stret-0.016
 clinic-0.013
 safer-0.013
ion-0.012
 wheelchair-0.012
Token He
Token ID679
Feature activation-3.08e-02

Pos logit contributions
 stret+0.015
Charl+0.013
 treadmill+0.013
ches+0.013
 wheelchair+0.013
Neg logit contributions
 size-0.011
ho-0.011
 next-0.010
ra-0.010
flower-0.010
TokenJim
Token ID18050
Feature activation+1.15e+00

Pos logit contributions
 next+0.004
 three+0.004
 size+0.004
el+0.004
ad+0.004
Neg logit contributions
 stret-0.005
ches-0.004
Charl-0.004
 treadmill-0.004
 safer-0.004
Token was
Token ID373
Feature activation+2.47e-03

Pos logit contributions
 size+0.025
 three+0.025
 next+0.024
 nos+0.023
ho+0.023
Neg logit contributions
 stret-0.053
ches-0.047
Charl-0.046
 treadmill-0.044
ion-0.044
Token a
Token ID257
Feature activation+6.87e-01

Pos logit contributions
ho+0.000
 markings+0.000
ad+0.000
 size+0.000
 colors+0.000
Neg logit contributions
 stret-0.000
Charl-0.000
 safer-0.000
 wheelchair-0.000
ches-0.000
Token happy
Token ID3772
Feature activation+1.16e+00

Pos logit contributions
ad+0.014
 size+0.014
el+0.014
 next+0.014
ho+0.013
Neg logit contributions
 stret-0.033
Charl-0.028
ches-0.028
 treadmill-0.027
 safer-0.027
Token little
Token ID1310
Feature activation-1.25e-01

Pos logit contributions
ped+0.030
ra+0.029
 markings+0.029
el+0.029
ad+0.029
Neg logit contributions
 stret-0.049
Charl-0.041
 wheelchair-0.040
 concert-0.038
 safer-0.038
Token boy
Token ID2933
Feature activation+1.98e+00

Pos logit contributions
 stret+0.005
Charl+0.005
ches+0.005
 treadmill+0.004
 safer+0.004
Neg logit contributions
ad-0.003
 size-0.003
ra-0.003
el-0.003
 next-0.003
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
 size+0.056
 next+0.055
 three+0.054
 nos+0.054
el+0.053
Neg logit contributions
 stret-0.080
ches-0.068
Charl-0.067
 treadmill-0.064
ion-0.063
Token He
Token ID679
Feature activation+9.52e-01

Pos logit contributions
 stret+0.007
Charl+0.006
 treadmill+0.006
 safer+0.006
ches+0.006
Neg logit contributions
ra-0.004
 size-0.004
ho-0.004
el-0.004
 nos-0.004
Token had
Token ID550
Feature activation+2.29e-01

Pos logit contributions
 size+0.019
 next+0.019
 three+0.019
el+0.018
ho+0.018
Neg logit contributions
 stret-0.046
ches-0.041
Charl-0.040
 treadmill-0.039
 safer-0.038
Token a
Token ID257
Feature activation+8.55e-01

Pos logit contributions
ho+0.008
flower+0.007
el+0.007
 size+0.007
ra+0.007
Neg logit contributions
 stret-0.008
Charl-0.007
 wheelchair-0.006
 safer-0.006
ches-0.006
Token pet
Token ID4273
Feature activation+1.09e+00

Pos logit contributions
 size+0.022
ad+0.021
el+0.021
 next+0.021
 three+0.021
Neg logit contributions
 stret-0.037
Charl-0.032
ches-0.031
 treadmill-0.030
 wheelchair-0.030
Token horns
Token ID26771
Feature activation+1.15e+00

Pos logit contributions
 size+0.011
ad+0.011
ho+0.011
el+0.011
flower+0.010
Neg logit contributions
 stret-0.017
ches-0.015
Charl-0.014
 safer-0.014
ion-0.014
Token.
Token ID13
Feature activation-9.82e-01

Pos logit contributions
 size+0.035
 next+0.034
 three+0.033
 nos+0.033
ad+0.033
Neg logit contributions
 stret-0.044
ches-0.038
Charl-0.035
ion-0.035
 treadmill-0.035
Token They
Token ID1119
Feature activation-5.50e-01

Pos logit contributions
 stret+0.040
ches+0.034
Charl+0.034
 treadmill+0.033
 safer+0.032
Neg logit contributions
 size-0.028
 next-0.027
 three-0.027
ho-0.026
ad-0.026
Token say
Token ID910
Feature activation+3.82e-01

Pos logit contributions
 stret+0.018
ches+0.016
 safer+0.015
 treadmill+0.013
 clinic+0.013
Neg logit contributions
umm-0.019
 size-0.019
el-0.019
ad-0.019
 next-0.019
Token m
Token ID285
Feature activation+3.24e-02

Pos logit contributions
 size+0.012
el+0.011
ad+0.011
 next+0.011
flower+0.011
Neg logit contributions
 stret-0.015
ches-0.013
Charl-0.012
 treadmill-0.012
ion-0.012
Tokenoo
Token ID2238
Feature activation+1.96e+00

Pos logit contributions
 size+0.001
 colors+0.001
 shining+0.001
 three+0.001
 looking+0.001
Neg logit contributions
 stret-0.001
ion-0.001
ches-0.001
 treadmill-0.001
Charl-0.001
Token."
Token ID526
Feature activation-1.03e+00

Pos logit contributions
 size+0.058
 next+0.058
 three+0.058
el+0.056
 nos+0.055
Neg logit contributions
 stret-0.077
Charl-0.065
ches-0.065
 treadmill-0.062
 safer-0.060
Token
Token ID198
Feature activation-8.79e-01

Pos logit contributions
 stret+0.045
ches+0.040
Charl+0.038
 treadmill+0.037
 unnecessary+0.037
Neg logit contributions
 next-0.027
 three-0.026
 only-0.026
el-0.025
ad-0.025
Token
Token ID198
Feature activation-9.97e-01

Pos logit contributions
 stret+0.047
 unnecessary+0.043
ches+0.043
Charl+0.041
 treadmill+0.041
Neg logit contributions
 next-0.016
ad-0.013
 only-0.013
 thought-0.012
 three-0.011
TokenTom
Token ID13787
Feature activation+5.32e-01

Pos logit contributions
 stret+0.050
ches+0.048
 unnecessary+0.045
ion+0.043
text+0.042
Neg logit contributions
 next-0.022
ad-0.018
 nos-0.017
Mom-0.017
 only-0.017
Token thinks
Token ID6834
Feature activation-4.24e-01

Pos logit contributions
 size+0.014
 next+0.014
 three+0.014
 only+0.013
el+0.013
Neg logit contributions
 stret-0.023
Charl-0.020
ches-0.019
 treadmill-0.019
 safer-0.018
Token two
Token ID734
Feature activation+1.25e+00

Pos logit contributions
 size+0.020
ho+0.019
 markings+0.019
ra+0.018
sed+0.018
Neg logit contributions
text-0.029
 stret-0.028
 safer-0.026
Charl-0.026
 Woo-0.025
Token friends
Token ID2460
Feature activation+1.04e+00

Pos logit contributions
ra+0.030
ho+0.028
 markings+0.027
ad+0.026
 sill+0.026
Neg logit contributions
 stret-0.058
 safer-0.050
ches-0.049
 treadmill-0.047
udd-0.046
Token who
Token ID508
Feature activation-9.29e-01

Pos logit contributions
 size+0.037
ra+0.036
ho+0.036
 markings+0.035
 colors+0.035
Neg logit contributions
 stret-0.035
ches-0.029
Charl-0.028
 safer-0.026
ion-0.026
Token were
Token ID547
Feature activation+1.16e+00

Pos logit contributions
 stret+0.035
ches+0.030
Charl+0.029
 safer+0.029
text+0.028
Neg logit contributions
umm-0.028
el-0.027
 size-0.027
 next-0.026
ho-0.025
Token two
Token ID734
Feature activation+1.42e+00

Pos logit contributions
 markings+0.031
ad+0.031
umm+0.031
 size+0.030
ho+0.030
Neg logit contributions
 stret-0.050
 safer-0.043
 hospital-0.040
 treadmill-0.040
 clinic-0.040
Token circles
Token ID13332
Feature activation+2.10e+00

Pos logit contributions
ra+0.029
ad+0.028
ho+0.027
 next+0.027
el+0.026
Neg logit contributions
 stret-0.070
ches-0.060
 safer-0.059
 treadmill-0.058
 unnecessary-0.055
Token.
Token ID13
Feature activation-3.16e-01

Pos logit contributions
 nos+0.071
ra+0.070
 size+0.069
 markings+0.068
ho+0.067
Neg logit contributions
 stret-0.073
ches-0.060
ion-0.058
Charl-0.055
 treadmill-0.052
Token One
Token ID1881
Feature activation+9.31e-01

Pos logit contributions
 stret+0.010
 treadmill+0.008
ion+0.008
Charl+0.008
ches+0.008
Neg logit contributions
 size-0.011
 next-0.011
 nos-0.011
ho-0.011
 shining-0.011
Token lived
Token ID5615
Feature activation-4.27e-02

Pos logit contributions
 size+0.021
ad+0.020
 markings+0.020
ra+0.020
el+0.020
Neg logit contributions
 stret-0.039
ches-0.037
 safer-0.034
ion-0.034
text-0.033
Token far
Token ID1290
Feature activation+4.62e-02

Pos logit contributions
 stret+0.002
Charl+0.002
 safer+0.002
 bases+0.001
ches+0.001
Neg logit contributions
umm-0.001
 markings-0.001
ad-0.001
ho-0.001
 sill-0.001
Token away
Token ID1497
Feature activation+6.35e-02

Pos logit contributions
 size+0.001
 three+0.001
 next+0.001
ad+0.001
el+0.001
Neg logit contributions
 stret-0.002
ches-0.002
Charl-0.002
 treadmill-0.002
ion-0.002
Token set
Token ID900
Feature activation+7.03e-01

Pos logit contributions
 stret+0.009
ches+0.008
Charl+0.008
text+0.007
 safer+0.007
Neg logit contributions
 size-0.005
 nos-0.005
 three-0.005
 next-0.005
 colors-0.005
Token off
Token ID572
Feature activation-2.53e-01

Pos logit contributions
 size+0.028
 nos+0.027
ho+0.027
 colors+0.027
 markings+0.027
Neg logit contributions
 stret-0.021
ches-0.017
 safer-0.016
Charl-0.015
ion-0.015
Token with
Token ID351
Feature activation+5.36e-02

Pos logit contributions
 stret+0.011
ches+0.009
 safer+0.009
 wheelchair+0.008
Charl+0.008
Neg logit contributions
umm-0.007
 size-0.007
ra-0.007
el-0.007
 markings-0.007
Token a
Token ID257
Feature activation+5.39e-01

Pos logit contributions
ho+0.002
ra+0.002
flower+0.002
 size+0.002
 markings+0.002
Neg logit contributions
 stret-0.002
 safer-0.002
Charl-0.002
ches-0.002
 wheelchair-0.001
Token big
Token ID1263
Feature activation+4.58e-01

Pos logit contributions
 size+0.010
 three+0.010
 next+0.010
 nos+0.010
el+0.009
Neg logit contributions
 stret-0.027
ches-0.024
Charl-0.024
 treadmill-0.023
ion-0.023
Token grin
Token ID16770
Feature activation+2.00e+00

Pos logit contributions
ra+0.010
 size+0.010
 next+0.010
ped+0.010
 only+0.009
Neg logit contributions
 stret-0.022
 safer-0.018
 wheelchair-0.018
ches-0.018
Charl-0.018
Token on
Token ID319
Feature activation+1.64e+00

Pos logit contributions
 size+0.048
 colors+0.046
 nos+0.046
ad+0.045
el+0.044
Neg logit contributions
 stret-0.089
ches-0.078
Charl-0.074
ion-0.072
 wheelchair-0.071
Token his
Token ID465
Feature activation+4.34e-01

Pos logit contributions
flower+0.051
 size+0.050
ho+0.049
 nos+0.048
 next+0.047
Neg logit contributions
 stret-0.064
ches-0.052
Charl-0.050
text-0.048
ion-0.048
Token face
Token ID1986
Feature activation+1.77e+00

Pos logit contributions
ho+0.008
 size+0.008
flower+0.008
el+0.008
 only+0.007
Neg logit contributions
 stret-0.022
ches-0.019
 safer-0.018
Charl-0.018
 wheelchair-0.018
Token.
Token ID13
Feature activation+9.67e-01

Pos logit contributions
 size+0.061
 nos+0.061
ra+0.060
 colors+0.058
ho+0.058
Neg logit contributions
 stret-0.056
ches-0.047
ion-0.044
land-0.044
 safer-0.042
Token He
Token ID679
Feature activation+6.34e-01

Pos logit contributions
 three+0.017
 next+0.017
 size+0.017
ad+0.016
el+0.016
Neg logit contributions
 stret-0.051
ches-0.044
Charl-0.044
 treadmill-0.043
ion-0.042
Token,
Token ID11
Feature activation-1.47e-01

Pos logit contributions
 size+0.006
 three+0.006
 next+0.006
ad+0.006
el+0.006
Neg logit contributions
 stret-0.010
ches-0.009
Charl-0.009
 treadmill-0.008
 safer-0.008
Token they
Token ID484
Feature activation-5.92e-02

Pos logit contributions
 stret+0.006
ches+0.005
Charl+0.005
 treadmill+0.005
ion+0.005
Neg logit contributions
 three-0.004
 next-0.004
 size-0.004
 only-0.004
 nos-0.004
Token saw
Token ID2497
Feature activation+5.68e-01

Pos logit contributions
 stret+0.003
ches+0.002
Charl+0.002
 treadmill+0.002
 safer+0.002
Neg logit contributions
 size-0.001
 next-0.001
el-0.001
ad-0.001
 colors-0.001
Token Lily
Token ID20037
Feature activation-8.68e-01

Pos logit contributions
 three+0.014
 next+0.014
 only+0.014
 nos+0.013
 looking+0.013
Neg logit contributions
 stret-0.026
ches-0.023
Charl-0.022
 treadmill-0.021
 wheelchair-0.021
Token's
Token ID338
Feature activation-4.02e-01

Pos logit contributions
 stret+0.031
ches+0.026
Charl+0.024
 treadmill+0.024
 wheelchair+0.023
Neg logit contributions
 size-0.028
ho-0.028
ad-0.028
 three-0.028
 nos-0.027
Token grand
Token ID4490
Feature activation+2.64e-01

Pos logit contributions
 stret+0.016
 wheelchair+0.014
 yoga+0.013
 clinic+0.013
 hospital+0.013
Neg logit contributions
ad-0.012
ped-0.011
el-0.011
ho-0.011
cho-0.011
Tokenpa
Token ID8957
Feature activation+5.61e-02

Pos logit contributions
ho+0.008
flower+0.007
aped+0.007
cho+0.007
 blu+0.007
Neg logit contributions
 stret-0.011
ches-0.009
 concert-0.008
 Street-0.008
land-0.008
Token standing
Token ID5055
Feature activation+3.51e-01

Pos logit contributions
 size+0.002
ho+0.002
 colors+0.002
 rays+0.002
 nos+0.002
Neg logit contributions
 stret-0.002
ches-0.002
Charl-0.002
 safer-0.002
 Street-0.001
Token there
Token ID612
Feature activation-7.48e-01

Pos logit contributions
ho+0.008
 size+0.008
 markings+0.008
ad+0.008
 e+0.007
Neg logit contributions
 stret-0.015
Charl-0.013
 wheelchair-0.013
ches-0.013
 safer-0.013
Token with
Token ID351
Feature activation+2.42e-01

Pos logit contributions
 stret+0.022
ches+0.019
land+0.018
 safer+0.018
Charl+0.018
Neg logit contributions
ra-0.026
 nos-0.026
el-0.025
 markings-0.025
ped-0.025
Token a
Token ID257
Feature activation+4.91e-01

Pos logit contributions
ho+0.008
flower+0.008
ra+0.008
ok+0.007
 next+0.007
Neg logit contributions
 stret-0.009
Charl-0.007
 safer-0.007
 wheelchair-0.007
ion-0.007
Token but
Token ID475
Feature activation-2.89e-01

Pos logit contributions
 stret+0.013
Charl+0.011
ches+0.010
 treadmill+0.010
 wheelchair+0.010
Neg logit contributions
 next-0.011
 three-0.011
 size-0.011
 only-0.010
ho-0.010
Token it
Token ID340
Feature activation+2.26e-01

Pos logit contributions
 stret+0.012
ches+0.011
Charl+0.010
 treadmill+0.010
ion+0.010
Neg logit contributions
 size-0.008
 next-0.008
 three-0.008
el-0.007
 only-0.007
Token couldn
Token ID3521
Feature activation-2.24e-01

Pos logit contributions
 three+0.007
 size+0.007
ad+0.007
 next+0.007
 3+0.006
Neg logit contributions
 stret-0.008
ches-0.008
 safer-0.007
Charl-0.007
 treadmill-0.007
Token't
Token ID470
Feature activation+4.48e-01

Pos logit contributions
 stret+0.014
Charl+0.012
 unnecessary+0.012
 treadmill+0.012
 wheelchair+0.012
Neg logit contributions
 next-0.003
el-0.002
 only-0.002
ad-0.002
 ever-0.002
Token.
Token ID13
Feature activation+6.12e-01

Pos logit contributions
 next+0.010
 three+0.009
 only+0.009
 ever+0.009
ad+0.009
Neg logit contributions
 stret-0.023
Charl-0.021
 treadmill-0.020
 unnecessary-0.020
 wheelchair-0.019
Token It
Token ID632
Feature activation+3.44e-01

Pos logit contributions
 size+0.018
flower+0.018
 next+0.017
 colors+0.017
 shining+0.017
Neg logit contributions
 stret-0.023
Charl-0.020
 treadmill-0.019
ches-0.019
 wheelchair-0.019
Token didn
Token ID1422
Feature activation+1.62e-01

Pos logit contributions
 size+0.009
 three+0.009
ad+0.009
 next+0.009
 looking+0.009
Neg logit contributions
 stret-0.014
ches-0.013
Charl-0.012
 safer-0.011
ion-0.011
Token't
Token ID470
Feature activation+5.79e-01

Pos logit contributions
 next+0.001
 only+0.001
el+0.001
ad+0.001
 ever+0.001
Neg logit contributions
 stret-0.011
Charl-0.010
 unnecessary-0.010
 wheelchair-0.009
 clinic-0.009
Token even
Token ID772
Feature activation-3.60e-01

Pos logit contributions
 size+0.014
 three+0.014
 next+0.014
ad+0.013
el+0.013
Neg logit contributions
 stret-0.026
Charl-0.022
ches-0.022
 treadmill-0.022
ion-0.021
Token fl
Token ID781
Feature activation-2.27e-01

Pos logit contributions
 stret+0.017
Charl+0.015
 treadmill+0.014
ches+0.014
ion+0.014
Neg logit contributions
 three-0.009
 size-0.009
 only-0.009
 nos-0.008
 next-0.008
Tokenutter
Token ID10381
Feature activation+1.59e-01

Pos logit contributions
 stret+0.009
ches+0.008
Charl+0.008
ion+0.008
 unnecessary+0.007
Neg logit contributions
 size-0.006
 colors-0.006
 next-0.006
 three-0.006
 looking-0.006
Token years
Token ID812
Feature activation-1.32e+00

Pos logit contributions
ho+0.021
flower+0.020
 shining+0.020
 size+0.020
ad+0.019
Neg logit contributions
 stret-0.026
ches-0.022
Charl-0.021
 safer-0.021
 treadmill-0.021
Token old
Token ID1468
Feature activation+4.06e-01

Pos logit contributions
 stret+0.051
 safer+0.041
ches+0.041
ion+0.039
Charl+0.039
Neg logit contributions
flower-0.040
ho-0.039
 size-0.039
ad-0.039
ra-0.038
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
 size+0.012
el+0.011
ho+0.011
ad+0.011
 next+0.011
Neg logit contributions
 stret-0.016
Charl-0.013
ches-0.013
 treadmill-0.013
 safer-0.012
Token very
Token ID845
Feature activation-3.20e-01

Pos logit contributions
el+0.028
flower+0.028
ho+0.028
 size+0.028
ad+0.027
Neg logit contributions
 stret-0.040
Charl-0.036
ches-0.035
 safer-0.035
 treadmill-0.034
Token brave
Token ID14802
Feature activation-2.49e-02

Pos logit contributions
 stret+0.016
Charl+0.014
ches+0.014
 safer+0.014
 treadmill+0.014
Neg logit contributions
 size-0.005
ad-0.005
flower-0.005
el-0.005
ho-0.005
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
 stret+0.001
ches+0.001
Charl+0.001
 safer+0.001
 unnecessary+0.001
Neg logit contributions
ho-0.001
flower-0.001
ad-0.001
el-0.001
ra-0.001
Token One
Token ID1881
Feature activation+9.78e-01

Pos logit contributions
 size+0.003
 next+0.003
 three+0.003
ho+0.003
el+0.003
Neg logit contributions
 stret-0.005
Charl-0.005
ches-0.005
 treadmill-0.005
ion-0.004
Token day
Token ID1110
Feature activation-2.22e-01

Pos logit contributions
ra+0.025
 size+0.024
ad+0.023
sed+0.023
ho+0.023
Neg logit contributions
 stret-0.037
ches-0.035
ion-0.035
 safer-0.034
 wheelchair-0.033
Token,
Token ID11
Feature activation+2.38e-01

Pos logit contributions
 stret+0.009
Charl+0.007
ches+0.007
 treadmill+0.007
ion+0.007
Neg logit contributions
 size-0.007
ho-0.006
el-0.006
flower-0.006
 colors-0.006
Token his
Token ID465
Feature activation+4.00e-02

Pos logit contributions
 size+0.007
el+0.007
ho+0.007
ad+0.007
 colors+0.007
Neg logit contributions
 stret-0.009
Charl-0.008
 treadmill-0.008
ches-0.007
 safer-0.007
Token parents
Token ID3397
Feature activation+3.24e-01

Pos logit contributions
ad+0.001
el+0.001
flower+0.001
ped+0.001
ho+0.001
Neg logit contributions
 stret-0.002
Charl-0.002
 treadmill-0.002
ches-0.002
 wheelchair-0.002
Token I
Token ID314
Feature activation-2.66e-01

Pos logit contributions
 stret+0.013
ches+0.011
Charl+0.011
 treadmill+0.010
 safer+0.010
Neg logit contributions
 size-0.011
 next-0.011
 three-0.011
el-0.011
ad-0.011
Token can
Token ID460
Feature activation+2.91e-01

Pos logit contributions
 stret+0.009
ches+0.008
Charl+0.007
 treadmill+0.007
 safer+0.007
Neg logit contributions
 size-0.009
 next-0.009
el-0.009
 three-0.009
 colors-0.009
Token sail
Token ID14936
Feature activation-1.21e+00

Pos logit contributions
 size+0.008
 next+0.008
 three+0.008
ho+0.008
 colors+0.008
Neg logit contributions
 stret-0.012
ches-0.010
Charl-0.010
 treadmill-0.010
 safer-0.009
Token."
Token ID526
Feature activation+1.30e-01

Pos logit contributions
 stret+0.053
Charl+0.045
ches+0.045
 treadmill+0.044
 wheelchair+0.042
Neg logit contributions
 next-0.031
 three-0.031
 size-0.031
el-0.029
 only-0.029
Token<|endoftext|>
Token ID50256
Feature activation-1.63e-01

Pos logit contributions
 size+0.004
 next+0.004
 three+0.004
el+0.004
 nos+0.004
Neg logit contributions
 stret-0.005
ches-0.004
Charl-0.004
 treadmill-0.004
ion-0.004
TokenOnce
Token ID7454
Feature activation+1.94e-01

Pos logit contributions
 stret+0.007
ches+0.006
 treadmill+0.006
Charl+0.006
 unnecessary+0.005
Neg logit contributions
 next-0.005
 three-0.005
ad-0.004
el-0.004
 only-0.004
Token upon
Token ID2402
Feature activation-2.99e-01

Pos logit contributions
 next+0.003
 size+0.003
 three+0.003
el+0.003
ho+0.003
Neg logit contributions
 stret-0.011
ches-0.009
Charl-0.009
 treadmill-0.009
ion-0.009
Token a
Token ID257
Feature activation-1.36e+00

Pos logit contributions
 stret+0.015
ches+0.013
Charl+0.013
 treadmill+0.013
ion+0.013
Neg logit contributions
 three-0.006
 next-0.006
 only-0.006
 size-0.006
 looking-0.005
Token time
Token ID640
Feature activation-3.40e-01

Pos logit contributions
 stret+0.066
ches+0.057
Charl+0.057
 treadmill+0.056
ion+0.056
Neg logit contributions
 next-0.028
 size-0.028
 three-0.028
 only-0.027
 nos-0.026
Token,
Token ID11
Feature activation+4.83e-01

Pos logit contributions
 stret+0.014
ches+0.012
Charl+0.012
 treadmill+0.012
 safer+0.011
Neg logit contributions
 size-0.009
 three-0.009
 next-0.009
el-0.009
ad-0.009
Token there
Token ID612
Feature activation-3.14e-01

Pos logit contributions
ad+0.010
 size+0.010
 markings+0.010
ho+0.009
el+0.009
Neg logit contributions
 stret-0.023
ches-0.020
Charl-0.020
 safer-0.019
 treadmill-0.019
Token.
Token ID13
Feature activation-1.96e-01

Pos logit contributions
 next+0.002
 three+0.001
 size+0.001
 only+0.001
 looking+0.001
Neg logit contributions
 stret-0.003
Charl-0.003
 treadmill-0.003
ches-0.003
 wheelchair-0.003
Token
Token ID198
Feature activation-3.48e-01

Pos logit contributions
 stret+0.008
ches+0.007
Charl+0.007
 treadmill+0.006
 safer+0.006
Neg logit contributions
 size-0.006
 next-0.006
ho-0.005
 three-0.005
el-0.005
Token
Token ID198
Feature activation-3.93e-01

Pos logit contributions
 stret+0.014
ches+0.012
Charl+0.012
 treadmill+0.011
ion+0.011
Neg logit contributions
 next-0.010
 three-0.010
 size-0.010
el-0.010
ad-0.010
TokenThe
Token ID464
Feature activation-5.43e-03

Pos logit contributions
 stret+0.013
Charl+0.011
ches+0.011
 treadmill+0.010
 safer+0.010
Neg logit contributions
 size-0.014
el-0.014
 three-0.014
ho-0.014
 next-0.014
Token yellow
Token ID7872
Feature activation-5.74e-01

Pos logit contributions
 safer+0.000
 yoga+0.000
 mat+0.000
 stret+0.000
 proper+0.000
Neg logit contributions
ad-0.000
 markings-0.000
ums-0.000
cho-0.000
ped-0.000
Token flower
Token ID15061
Feature activation+5.78e-01

Pos logit contributions
 mat+0.013
 stret+0.013
ches+0.013
ion+0.012
 yoga+0.012
Neg logit contributions
ad-0.022
years-0.020
 markings-0.020
 thinking-0.020
cho-0.020
Token saw
Token ID2497
Feature activation+4.73e-01

Pos logit contributions
ad+0.017
umm+0.017
 size+0.016
 markings+0.016
el+0.015
Neg logit contributions
ches-0.021
 stret-0.021
Charl-0.018
 safer-0.018
 wheelchair-0.018
Token a
Token ID257
Feature activation-5.66e-02

Pos logit contributions
 size+0.014
ra+0.014
ho+0.013
el+0.013
flower+0.013
Neg logit contributions
 stret-0.018
ches-0.015
 safer-0.015
ion-0.015
Charl-0.015
Token dish
Token ID9433
Feature activation+3.83e-01

Pos logit contributions
 stret+0.003
Charl+0.002
ches+0.002
 safer+0.002
 treadmill+0.002
Neg logit contributions
el-0.001
ad-0.001
 size-0.001
 next-0.001
 three-0.001
Tokenwasher
Token ID45146
Feature activation-6.06e-02

Pos logit contributions
ad+0.010
 size+0.010
 markings+0.010
ra+0.010
ped+0.010
Neg logit contributions
 stret-0.014
ion-0.013
ches-0.012
 safer-0.012
 hospital-0.012
Token in
Token ID287
Feature activation+3.10e-01

Pos logit contributions
 stret+0.002
ches+0.002
ion+0.002
 safer+0.002
 hospital+0.002
Neg logit contributions
 markings-0.002
ra-0.002
 nos-0.002
ad-0.002
ho-0.002
Token could
Token ID714
Feature activation+5.22e-01

Pos logit contributions
umm+0.014
 size+0.014
 three+0.014
ad+0.014
ums+0.014
Neg logit contributions
ches-0.016
 stret-0.015
 safer-0.015
Charl-0.014
 else-0.014
Token grow
Token ID1663
Feature activation+2.41e-01

Pos logit contributions
umm+0.016
flower+0.015
ho+0.015
 markings+0.014
 size+0.014
Neg logit contributions
 stret-0.019
ches-0.017
 safer-0.017
 hospital-0.016
 wheelchair-0.015
Token big
Token ID1263
Feature activation+2.35e-02

Pos logit contributions
 next+0.006
 size+0.006
 three+0.006
 colors+0.005
 only+0.005
Neg logit contributions
 stret-0.012
Charl-0.010
 treadmill-0.010
ches-0.010
 wheelchair-0.009
Token and
Token ID290
Feature activation+6.27e-01

Pos logit contributions
ho+0.001
ad+0.001
 size+0.001
 three+0.001
flower+0.001
Neg logit contributions
 stret-0.001
ches-0.001
 safer-0.001
Charl-0.001
 treadmill-0.001
Token small
Token ID1402
Feature activation-3.11e-01

Pos logit contributions
ad+0.017
el+0.016
 size+0.016
flower+0.016
ums+0.016
Neg logit contributions
 stret-0.024
 safer-0.024
ches-0.023
 treadmill-0.022
 proper-0.021
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
 stret+0.010
ches+0.009
 safer+0.009
 treadmill+0.008
Charl+0.008
Neg logit contributions
ad-0.010
ho-0.010
flower-0.010
 next-0.009
el-0.009
Token Mia
Token ID32189
Feature activation-4.53e-01

Pos logit contributions
 stret+0.009
ches+0.008
ion+0.008
 safer+0.008
Charl+0.008
Neg logit contributions
 size-0.011
 shining-0.010
ho-0.010
flower-0.010
ra-0.010
Token loved
Token ID6151
Feature activation+2.74e-01

Pos logit contributions
 stret+0.019
ches+0.018
Charl+0.017
text+0.016
 safer+0.016
Neg logit contributions
 size-0.011
umm-0.011
ad-0.011
 shining-0.010
 sill-0.010
Token her
Token ID607
Feature activation-2.07e-01

Pos logit contributions
flower+0.009
ad+0.008
ho+0.008
ok+0.008
el+0.008
Neg logit contributions
 stret-0.009
 yoga-0.008
 safer-0.008
 wheelchair-0.008
Charl-0.008
Token lamp
Token ID20450
Feature activation+6.30e-01

Pos logit contributions
 stret+0.008
ches+0.007
 safer+0.007
Charl+0.007
 wheelchair+0.007
Neg logit contributions
ho-0.006
ad-0.006
el-0.005
 size-0.005
flower-0.005
Token very
Token ID845
Feature activation+1.05e-01

Pos logit contributions
ho+0.018
ad+0.018
 size+0.018
el+0.018
 three+0.017
Neg logit contributions
 stret-0.024
ches-0.021
 safer-0.020
Charl-0.020
 treadmill-0.020
Token a
Token ID257
Feature activation-1.03e-01

Pos logit contributions
 three+0.004
 next+0.003
 looking+0.003
 only+0.003
ad+0.003
Neg logit contributions
 stret-0.010
ches-0.009
Charl-0.009
 treadmill-0.008
ion-0.008
Token walk
Token ID2513
Feature activation+1.33e-01

Pos logit contributions
 stret+0.005
ion+0.004
ches+0.004
text+0.004
Charl+0.004
Neg logit contributions
 three-0.003
 looking-0.002
 next-0.002
 nos-0.002
 3-0.002
Token.
Token ID13
Feature activation-7.52e-02

Pos logit contributions
 markings+0.004
 size+0.004
el+0.004
ad+0.004
 e+0.004
Neg logit contributions
 stret-0.005
ches-0.004
 treadmill-0.004
Charl-0.004
 safer-0.004
Token She
Token ID1375
Feature activation-4.43e-01

Pos logit contributions
 stret+0.004
ches+0.003
Charl+0.003
 treadmill+0.003
 safer+0.003
Neg logit contributions
 three-0.002
 next-0.002
 size-0.002
 only-0.002
 looking-0.001
Token walked
Token ID6807
Feature activation-3.03e-01

Pos logit contributions
 stret+0.023
Charl+0.020
ches+0.020
 treadmill+0.019
ion+0.019
Neg logit contributions
 three-0.008
 size-0.008
 next-0.008
 only-0.008
el-0.008
Token very
Token ID845
Feature activation-1.57e-01

Pos logit contributions
 stret+0.011
ches+0.010
 treadmill+0.009
Charl+0.009
 safer+0.009
Neg logit contributions
el-0.010
 size-0.009
 markings-0.009
ra-0.009
ad-0.009
Token slow
Token ID3105
Feature activation-7.08e-02

Pos logit contributions
 stret+0.006
ches+0.005
Charl+0.005
 treadmill+0.005
 safer+0.005
Neg logit contributions
 size-0.005
 three-0.004
 next-0.004
el-0.004
ad-0.004
Token,
Token ID11
Feature activation-6.41e-01

Pos logit contributions
 stret+0.003
ches+0.003
Charl+0.003
 treadmill+0.003
ion+0.003
Neg logit contributions
 next-0.002
 size-0.002
 three-0.002
 looking-0.002
 only-0.002
Token taking
Token ID2263
Feature activation+3.19e-01

Pos logit contributions
 stret+0.025
 treadmill+0.021
 safer+0.021
ches+0.021
 wheelchair+0.020
Neg logit contributions
ad-0.020
el-0.020
ra-0.019
flower-0.019
 markings-0.018
Token in
Token ID287
Feature activation-3.82e-01

Pos logit contributions
 size+0.009
 three+0.009
 next+0.009
el+0.008
 nos+0.008
Neg logit contributions
 stret-0.013
ches-0.012
Charl-0.011
 treadmill-0.011
 safer-0.011
Token all
Token ID477
Feature activation-1.10e+00

Pos logit contributions
 stret+0.017
ches+0.015
Charl+0.014
 treadmill+0.014
ion+0.013
Neg logit contributions
 size-0.010
 next-0.010
 three-0.010
el-0.009
 only-0.009
Token the
Token ID262
Feature activation-1.72e-01

Pos logit contributions
 size+0.010
ra+0.010
el+0.010
ho+0.010
flower+0.010
Neg logit contributions
 stret-0.013
 safer-0.011
Charl-0.011
ches-0.011
 treadmill-0.011
Token puppy
Token ID26188
Feature activation-4.64e-01

Pos logit contributions
 stret+0.006
Charl+0.005
 treadmill+0.005
ches+0.004
 safer+0.004
Neg logit contributions
ad-0.006
 size-0.006
el-0.006
ho-0.006
ra-0.006
Token had
Token ID550
Feature activation-3.59e-01

Pos logit contributions
 stret+0.019
ches+0.018
Charl+0.016
 safer+0.016
 treadmill+0.016
Neg logit contributions
 size-0.012
 three-0.012
ad-0.012
el-0.012
 colors-0.011
Token tried
Token ID3088
Feature activation-6.49e-01

Pos logit contributions
 stret+0.015
ches+0.013
 safer+0.013
Charl+0.013
 unnecessary+0.013
Neg logit contributions
el-0.009
flower-0.009
ho-0.009
 size-0.009
 next-0.009
Token to
Token ID284
Feature activation-1.10e+00

Pos logit contributions
 stret+0.028
ches+0.025
Charl+0.024
 safer+0.024
 treadmill+0.024
Neg logit contributions
el-0.016
 size-0.016
 three-0.016
ho-0.015
ad-0.015
Token bite
Token ID13197
Feature activation-9.16e-01

Pos logit contributions
 stret+0.045
ches+0.037
 safer+0.035
Charl+0.034
 treadmill+0.034
Neg logit contributions
 three-0.032
el-0.032
 size-0.032
ho-0.032
 colors-0.031
Token him
Token ID683
Feature activation+2.77e-01

Pos logit contributions
 stret+0.044
Charl+0.039
ches+0.038
 treadmill+0.037
 wheelchair+0.036
Neg logit contributions
 next-0.019
 size-0.019
 three-0.019
el-0.018
 only-0.018
Token.
Token ID13
Feature activation+3.36e-01

Pos logit contributions
 size+0.007
 next+0.007
 three+0.007
el+0.007
ad+0.007
Neg logit contributions
 stret-0.012
ches-0.010
Charl-0.010
 treadmill-0.010
 safer-0.009
Token He
Token ID679
Feature activation-4.33e-02

Pos logit contributions
 size+0.008
 next+0.008
 three+0.008
el+0.008
ho+0.008
Neg logit contributions
 stret-0.015
ches-0.013
Charl-0.013
 treadmill-0.013
 safer-0.012
Token didn
Token ID1422
Feature activation+7.41e-02

Pos logit contributions
 stret+0.002
ches+0.002
Charl+0.002
 safer+0.002
 treadmill+0.001
Neg logit contributions
 size-0.001
flower-0.001
el-0.001
 shining-0.001
 three-0.001
Token't
Token ID470
Feature activation+1.46e-02

Pos logit contributions
 next+0.001
 only+0.001
 ever+0.000
ad+0.000
el+0.000
Neg logit contributions
 stret-0.005
Charl-0.004
 unnecessary-0.004
land-0.004
 Street-0.004
Token From
Token ID3574
Feature activation-7.71e-02

Pos logit contributions
 stret+0.002
ches+0.002
Charl+0.002
 treadmill+0.002
 safer+0.001
Neg logit contributions
 next-0.001
 three-0.001
 size-0.001
 only-0.001
ho-0.001
Token that
Token ID326
Feature activation-3.61e-01

Pos logit contributions
 stret+0.003
ches+0.002
Charl+0.002
 treadmill+0.002
 safer+0.002
Neg logit contributions
 size-0.002
 three-0.002
 next-0.002
el-0.002
ad-0.002
Token day
Token ID1110
Feature activation+2.21e-01

Pos logit contributions
Charl+0.013
 stret+0.013
ches+0.012
 safer+0.012
ion+0.012
Neg logit contributions
 markings-0.010
ra-0.010
 eyes-0.010
 lips-0.010
ad-0.010
Token,
Token ID11
Feature activation-5.94e-01

Pos logit contributions
 size+0.005
 colors+0.005
 three+0.005
ad+0.005
 next+0.005
Neg logit contributions
 stret-0.010
ches-0.009
Charl-0.009
 treadmill-0.008
ion-0.008
Token Sally
Token ID25737
Feature activation-5.29e-01

Pos logit contributions
 stret+0.023
ches+0.020
Charl+0.020
 safer+0.019
 treadmill+0.019
Neg logit contributions
 size-0.017
 three-0.016
el-0.016
 colors-0.016
 next-0.016
Token knew
Token ID2993
Feature activation-1.43e-01

Pos logit contributions
 stret+0.026
ches+0.023
Charl+0.022
 treadmill+0.022
 safer+0.021
Neg logit contributions
 size-0.011
 next-0.010
 three-0.010
el-0.010
ad-0.010
Token that
Token ID326
Feature activation-6.22e-02

Pos logit contributions
 stret+0.005
Charl+0.004
ches+0.004
ion+0.004
 safer+0.004
Neg logit contributions
 size-0.005
 three-0.005
el-0.005
ho-0.005
 colors-0.005
Token when
Token ID618
Feature activation+7.93e-01

Pos logit contributions
 safer+0.002
 stret+0.002
Charl+0.002
 yoga+0.002
ion+0.002
Neg logit contributions
ra-0.002
el-0.002
ped-0.002
ad-0.002
flower-0.002
Token she
Token ID673
Feature activation-6.45e-02

Pos logit contributions
flower+0.022
ad+0.022
ho+0.022
el+0.022
 sill+0.022
Neg logit contributions
 stret-0.028
 unnecessary-0.026
Charl-0.025
 safer-0.025
 yoga-0.024
Token was
Token ID373
Feature activation-3.59e-01

Pos logit contributions
 stret+0.003
ches+0.003
Charl+0.002
 treadmill+0.002
 safer+0.002
Neg logit contributions
 size-0.002
 three-0.002
 next-0.002
el-0.001
ad-0.001
Token busy
Token ID8179
Feature activation+3.19e-01

Pos logit contributions
 stret+0.017
ches+0.015
 safer+0.015
Charl+0.014
 wheelchair+0.014
Neg logit contributions
ho-0.008
 size-0.008
el-0.008
flower-0.007
ra-0.007
Token decide
Token ID5409
Feature activation-9.96e-01

Pos logit contributions
 stret+0.040
ches+0.034
Charl+0.033
 safer+0.032
 treadmill+0.032
Neg logit contributions
 size-0.033
ho-0.033
 three-0.032
el-0.032
ad-0.032
Token to
Token ID284
Feature activation-8.45e-01

Pos logit contributions
 stret+0.048
Charl+0.047
text+0.047
ches+0.044
 treadmill+0.043
Neg logit contributions
 next-0.028
 three-0.023
 looking-0.022
 only-0.022
 ever-0.022
Token play
Token ID711
Feature activation-3.52e-01

Pos logit contributions
 stret+0.034
ches+0.029
Charl+0.028
 treadmill+0.028
 safer+0.027
Neg logit contributions
 size-0.024
ho-0.023
 three-0.023
 next-0.023
el-0.023
Token a
Token ID257
Feature activation+5.94e-03

Pos logit contributions
 stret+0.017
Charl+0.016
ches+0.015
 treadmill+0.015
ion+0.015
Neg logit contributions
 next-0.009
 three-0.008
 only-0.007
 nos-0.007
 looking-0.007
Token different
Token ID1180
Feature activation+4.29e-01

Pos logit contributions
 next+0.000
 three+0.000
 nos+0.000
 only+0.000
 size+0.000
Neg logit contributions
 stret-0.000
Charl-0.000
ion-0.000
text-0.000
ches-0.000
Token game
Token ID983
Feature activation-6.62e-01

Pos logit contributions
 next+0.011
 size+0.011
 colors+0.011
 three+0.011
 looking+0.010
Neg logit contributions
 stret-0.020
Charl-0.018
ches-0.018
 unnecessary-0.017
 treadmill-0.017
Token."
Token ID526
Feature activation-2.85e-01

Pos logit contributions
 stret+0.026
Charl+0.022
ches+0.021
 treadmill+0.021
 unnecessary+0.020
Neg logit contributions
 next-0.022
 three-0.021
 size-0.020
 ever-0.020
 only-0.020
Token They
Token ID1119
Feature activation-1.51e-01

Pos logit contributions
 stret+0.013
ches+0.012
Charl+0.011
 unnecessary+0.011
 safer+0.011
Neg logit contributions
 three-0.007
 next-0.006
 only-0.006
 looking-0.006
 thought-0.006
Token started
Token ID2067
Feature activation-3.08e-01

Pos logit contributions
 stret+0.007
ches+0.006
Charl+0.006
 treadmill+0.006
 safer+0.006
Neg logit contributions
 size-0.004
 next-0.003
 three-0.003
el-0.003
ad-0.003
Token to
Token ID284
Feature activation-5.01e-01

Pos logit contributions
 stret+0.014
ches+0.012
Charl+0.012
 treadmill+0.012
 wheelchair+0.012
Neg logit contributions
 size-0.007
 next-0.007
 three-0.007
el-0.007
ad-0.007
Token play
Token ID711
Feature activation+1.98e-01

Pos logit contributions
 stret+0.022
ches+0.019
Charl+0.018
 treadmill+0.018
 safer+0.017
Neg logit contributions
 size-0.013
 next-0.013
 three-0.013
el-0.012
ad-0.012
Token feeling
Token ID4203
Feature activation-6.37e-01

Pos logit contributions
 size+0.014
 three+0.014
ho+0.014
 next+0.014
el+0.014
Neg logit contributions
 stret-0.022
ches-0.019
Charl-0.018
 treadmill-0.018
 safer-0.018
Token better
Token ID1365
Feature activation+4.17e-01

Pos logit contributions
 stret+0.023
Charl+0.019
ches+0.019
 treadmill+0.019
ion+0.018
Neg logit contributions
 next-0.023
 nos-0.022
 only-0.022
 looking-0.022
 three-0.022
Token too
Token ID1165
Feature activation+1.16e+00

Pos logit contributions
 size+0.011
 three+0.011
 next+0.011
el+0.011
ad+0.011
Neg logit contributions
 stret-0.017
ches-0.015
Charl-0.015
 treadmill-0.014
 safer-0.014
Token.
Token ID13
Feature activation-6.83e-02

Pos logit contributions
 size+0.031
 three+0.031
 next+0.031
el+0.030
ad+0.030
Neg logit contributions
 stret-0.049
ches-0.042
Charl-0.042
 treadmill-0.040
 safer-0.039
Token Max
Token ID5436
Feature activation-2.18e-01

Pos logit contributions
 stret+0.004
 unnecessary+0.003
ion+0.003
ches+0.003
Charl+0.003
Neg logit contributions
 next-0.001
 only-0.001
 thought-0.001
 three-0.001
 nos-0.001
Token's
Token ID338
Feature activation-1.61e+00

Pos logit contributions
 stret+0.011
Charl+0.009
 treadmill+0.009
ches+0.009
 unnecessary+0.009
Neg logit contributions
 next-0.005
 three-0.005
 only-0.005
 thought-0.005
 size-0.005
Token owner
Token ID4870
Feature activation+4.44e-01

Pos logit contributions
 stret+0.076
 wheelchair+0.063
 safer+0.062
 clinic+0.059
 hospital+0.058
Neg logit contributions
el-0.041
ho-0.040
ad-0.039
umm-0.038
years-0.037
Token also
Token ID635
Feature activation+3.28e-01

Pos logit contributions
 size+0.012
el+0.011
ho+0.011
 colors+0.011
 three+0.011
Neg logit contributions
 stret-0.019
ches-0.017
Charl-0.016
 safer-0.016
 treadmill-0.015
Token gave
Token ID2921
Feature activation-1.04e-01

Pos logit contributions
 three+0.006
 next+0.006
 size+0.006
 nos+0.006
 only+0.006
Neg logit contributions
 stret-0.017
ches-0.015
Charl-0.015
 treadmill-0.015
ion-0.014
Token him
Token ID683
Feature activation-7.03e-02

Pos logit contributions
 stret+0.005
ches+0.005
Charl+0.004
 treadmill+0.004
 safer+0.004
Neg logit contributions
 size-0.002
 next-0.002
 three-0.002
el-0.002
ad-0.002
Token a
Token ID257
Feature activation-1.10e-01

Pos logit contributions
 stret+0.003
Charl+0.003
ches+0.003
 treadmill+0.003
 wheelchair+0.003
Neg logit contributions
 three-0.001
 next-0.001
 nos-0.001
 size-0.001
 only-0.001
Token us
Token ID514
Feature activation+9.08e-01

Pos logit contributions
 stret+0.025
Charl+0.021
ches+0.021
 safer+0.020
 treadmill+0.020
Neg logit contributions
 size-0.020
ho-0.019
flower-0.019
 three-0.019
 next-0.019
Token!
Token ID0
Feature activation-1.26e+00

Pos logit contributions
 size+0.028
 next+0.028
 three+0.027
el+0.026
 nos+0.026
Neg logit contributions
 stret-0.035
Charl-0.029
ches-0.029
 treadmill-0.028
ion-0.027
Token Let
Token ID3914
Feature activation-9.17e-01

Pos logit contributions
 stret+0.053
ches+0.046
Charl+0.046
 treadmill+0.044
ion+0.043
Neg logit contributions
 size-0.034
 next-0.033
 three-0.033
el-0.032
 colors-0.032
Tokenâ
Token ID22940
Feature activation-4.81e-01

Pos logit contributions
 stret+0.041
 treadmill+0.036
Charl+0.035
 wheelchair+0.034
ion+0.033
Neg logit contributions
 next-0.026
 only-0.025
el-0.025
 three-0.023
 nos-0.023
Token€
Token ID26391
Feature activation-1.94e+00

Pos logit contributions
 stret+0.020
Charl+0.016
ches+0.016
 treadmill+0.016
 wheelchair+0.016
Neg logit contributions
ad-0.014
 next-0.014
el-0.013
 size-0.013
 three-0.013
Tokenâ„¢
Token ID8151
Feature activation-1.59e+00

Pos logit contributions
 stret+0.069
ion+0.057
ches+0.055
 treadmill+0.051
Charl+0.051
Neg logit contributions
 next-0.069
 nos-0.069
ra-0.067
 ever-0.067
 three-0.067
Tokens
Token ID82
Feature activation-6.97e-01

Pos logit contributions
 stret+0.063
Charl+0.051
 unnecessary+0.050
 treadmill+0.047
 wheelchair+0.047
Neg logit contributions
ho-0.056
 next-0.054
 ever-0.053
el-0.053
 three-0.051
Token share
Token ID2648
Feature activation-1.88e-01

Pos logit contributions
 stret+0.026
ches+0.022
Charl+0.021
 treadmill+0.021
 safer+0.020
Neg logit contributions
 next-0.021
 size-0.021
el-0.020
 three-0.020
ho-0.020
Token it
Token ID340
Feature activation-1.09e+00

Pos logit contributions
 stret+0.009
Charl+0.007
ches+0.007
 treadmill+0.007
 wheelchair+0.007
Neg logit contributions
 next-0.005
 three-0.005
 colors-0.005
 size-0.005
 looking-0.004
Token."
Token ID526
Feature activation-4.52e-01

Pos logit contributions
 stret+0.050
ches+0.043
Charl+0.043
 treadmill+0.042
 safer+0.040
Neg logit contributions
 size-0.025
 next-0.025
 three-0.025
el-0.024
ad-0.024
Token Pat
Token ID3208
Feature activation-1.64e-01

Pos logit contributions
 stret+0.018
ches+0.015
Charl+0.015
 treadmill+0.015
 safer+0.014
Neg logit contributions
 size-0.013
 next-0.013
 three-0.013
el-0.013
ad-0.013
Token that
Token ID326
Feature activation-1.57e-01

Pos logit contributions
 stret+0.039
Charl+0.034
ches+0.033
 treadmill+0.032
 wheelchair+0.031
Neg logit contributions
 next-0.020
 three-0.020
 size-0.020
 nos-0.019
 only-0.019
Token her
Token ID607
Feature activation-4.33e-01

Pos logit contributions
 safer+0.004
 stret+0.003
 hospital+0.003
 yoga+0.003
 else+0.003
Neg logit contributions
ad-0.006
ra-0.006
umm-0.006
el-0.006
 markings-0.006
Token kind
Token ID1611
Feature activation+5.09e-01

Pos logit contributions
 stret+0.022
ches+0.019
Charl+0.019
 treadmill+0.018
 safer+0.018
Neg logit contributions
 size-0.008
 three-0.008
 next-0.008
el-0.008
ad-0.008
Token heart
Token ID2612
Feature activation+8.57e-01

Pos logit contributions
years+0.013
 shining+0.013
ra+0.012
aped+0.012
 next+0.012
Neg logit contributions
 stret-0.019
ion-0.018
 safer-0.017
 wheelchair-0.017
 hospital-0.017
Token made
Token ID925
Feature activation-5.50e-01

Pos logit contributions
 size+0.021
 next+0.021
 three+0.021
el+0.020
ad+0.020
Neg logit contributions
 stret-0.038
ches-0.032
Charl-0.032
 treadmill-0.031
ion-0.030
Token her
Token ID607
Feature activation-1.82e+00

Pos logit contributions
 stret+0.020
ches+0.017
 safer+0.017
Charl+0.016
ion+0.016
Neg logit contributions
ho-0.018
 size-0.017
flower-0.017
 three-0.017
 nos-0.016
Token special
Token ID2041
Feature activation-7.53e-01

Pos logit contributions
 stret+0.084
Charl+0.075
ches+0.074
 treadmill+0.072
ion+0.071
Neg logit contributions
 next-0.044
 three-0.043
 size-0.043
 only-0.042
 nos-0.041
Token.
Token ID13
Feature activation+3.46e-01

Pos logit contributions
 stret+0.031
ches+0.025
Charl+0.025
 safer+0.025
 wheelchair+0.024
Neg logit contributions
ho-0.021
el-0.021
 three-0.020
ad-0.020
 size-0.020
Token They
Token ID1119
Feature activation-5.36e-02

Pos logit contributions
 next+0.007
 three+0.007
 only+0.006
 thought+0.006
ad+0.006
Neg logit contributions
 stret-0.018
 unnecessary-0.016
ches-0.015
 treadmill-0.015
Charl-0.015
Token became
Token ID2627
Feature activation-2.65e-01

Pos logit contributions
 stret+0.002
ches+0.002
Charl+0.002
 safer+0.002
 treadmill+0.002
Neg logit contributions
 size-0.001
 next-0.001
 three-0.001
el-0.001
 nos-0.001
Token good
Token ID922
Feature activation+3.96e-01

Pos logit contributions
 stret+0.014
ches+0.012
Charl+0.012
 treadmill+0.011
ion+0.011
Neg logit contributions
 three-0.005
 next-0.005
 size-0.005
 nos-0.005
 only-0.005
Token,'
Token ID4032
Feature activation+3.88e-01

Pos logit contributions
 size+0.033
el+0.032
ho+0.032
 three+0.032
flower+0.031
Neg logit contributions
 stret-0.028
ches-0.023
Charl-0.022
ion-0.021
 treadmill-0.021
Token she
Token ID673
Feature activation-3.68e-01

Pos logit contributions
 size+0.015
ho+0.013
flower+0.013
sed+0.013
ra+0.013
Neg logit contributions
 stret-0.011
ion-0.010
ches-0.009
Charl-0.009
 safer-0.009
Token said
Token ID531
Feature activation-4.62e-01

Pos logit contributions
 stret+0.015
ches+0.013
Charl+0.012
 safer+0.012
ion+0.012
Neg logit contributions
 size-0.010
el-0.010
ho-0.010
flower-0.009
umm-0.009
Token,
Token ID11
Feature activation-5.82e-02

Pos logit contributions
 stret+0.017
Charl+0.014
ches+0.014
ion+0.013
 safer+0.013
Neg logit contributions
 size-0.015
 colors-0.015
el-0.014
flower-0.014
ho-0.014
Token giving
Token ID3501
Feature activation-2.87e-02

Pos logit contributions
 stret+0.003
Charl+0.002
ches+0.002
 treadmill+0.002
ion+0.002
Neg logit contributions
el-0.002
 size-0.002
 colors-0.002
ad-0.001
 markings-0.001
Token her
Token ID607
Feature activation-1.60e+00

Pos logit contributions
 stret+0.001
ches+0.001
ion+0.001
Charl+0.001
 safer+0.001
Neg logit contributions
 size-0.001
ho-0.001
flower-0.001
 nos-0.001
 markings-0.001
Token daughter
Token ID4957
Feature activation-3.27e-01

Pos logit contributions
 stret+0.077
ches+0.067
Charl+0.066
 treadmill+0.065
 wheelchair+0.062
Neg logit contributions
 three-0.036
 next-0.035
 size-0.034
 only-0.033
 nos-0.033
Token a
Token ID257
Feature activation+3.60e-01

Pos logit contributions
 stret+0.015
ches+0.013
Charl+0.013
 treadmill+0.012
 safer+0.012
Neg logit contributions
 size-0.008
 next-0.008
 three-0.008
el-0.007
ho-0.007
Token hug
Token ID16225
Feature activation+1.22e-01

Pos logit contributions
 three+0.009
 nos+0.008
 next+0.008
 3+0.008
 size+0.008
Neg logit contributions
 stret-0.016
ches-0.015
Charl-0.014
 treadmill-0.014
ion-0.014
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
 size+0.004
 nos+0.004
 colors+0.004
el+0.004
ad+0.004
Neg logit contributions
 stret-0.004
ches-0.004
Charl-0.004
ion-0.004
 safer-0.003
Token '
Token ID705
Feature activation+8.80e-01

Pos logit contributions
 three+0.004
 next+0.004
 size+0.003
 only+0.003
el+0.003
Neg logit contributions
 stret-0.006
ches-0.005
Charl-0.005
 treadmill-0.005
 unnecessary-0.004
Token his
Token ID465
Feature activation-6.38e-01

Pos logit contributions
 stret+0.012
Charl+0.009
ches+0.009
 treadmill+0.009
 safer+0.009
Neg logit contributions
 size-0.008
ho-0.008
 next-0.008
el-0.008
flower-0.008
Token house
Token ID2156
Feature activation-4.54e-01

Pos logit contributions
 stret+0.030
ches+0.027
Charl+0.026
 treadmill+0.026
ion+0.026
Neg logit contributions
 next-0.014
 size-0.014
 three-0.014
 nos-0.013
 only-0.013
Token and
Token ID290
Feature activation-3.85e-01

Pos logit contributions
 stret+0.017
 clinic+0.013
ches+0.013
 safer+0.013
 wheelchair+0.013
Neg logit contributions
 markings-0.014
el-0.014
ra-0.014
 nos-0.014
ad-0.014
Token saw
Token ID2497
Feature activation-3.19e-01

Pos logit contributions
 stret+0.018
ches+0.016
Charl+0.016
 treadmill+0.016
 safer+0.015
Neg logit contributions
 size-0.008
el-0.008
ad-0.008
 next-0.008
 three-0.007
Token how
Token ID703
Feature activation-6.94e-02

Pos logit contributions
 stret+0.016
ches+0.014
Charl+0.014
 treadmill+0.014
 wheelchair+0.014
Neg logit contributions
 three-0.007
 only-0.007
 looking-0.006
 next-0.006
 two-0.006
Token humble
Token ID19695
Feature activation-1.83e+00

Pos logit contributions
 stret+0.002
 safer+0.002
ches+0.002
 treadmill+0.002
 bases+0.002
Neg logit contributions
 size-0.002
flower-0.002
ra-0.002
ad-0.002
el-0.002
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 stret+0.075
ches+0.064
Charl+0.063
 treadmill+0.062
ion+0.061
Neg logit contributions
 size-0.050
 three-0.048
 next-0.048
ho-0.048
el-0.047
Token man
Token ID582
Feature activation-7.03e-01

Pos logit contributions
 stret+0.047
 treadmill+0.037
 hospital+0.036
 safer+0.036
 wheelchair+0.034
Neg logit contributions
el-0.050
ra-0.048
ho-0.047
ad-0.046
ped-0.044
Token's
Token ID338
Feature activation-8.25e-01

Pos logit contributions
 stret+0.025
ches+0.025
 safer+0.022
ion+0.021
 else+0.021
Neg logit contributions
 size-0.021
ho-0.020
ra-0.019
ad-0.019
 three-0.019
Token house
Token ID2156
Feature activation-8.23e-01

Pos logit contributions
 stret+0.030
 treadmill+0.027
ches+0.023
 bases+0.023
 proper+0.023
Neg logit contributions
ra-0.026
ad-0.024
ho-0.024
el-0.023
 three-0.023
Token was
Token ID373
Feature activation-6.15e-01

Pos logit contributions
 stret+0.031
ches+0.028
 treadmill+0.026
 safer+0.026
ion+0.026
Neg logit contributions
 three-0.023
ho-0.023
 size-0.023
 nos-0.022
ra-0.022