TOP ACTIVATIONS
MAX = 6.781

to wear . It was a flexible skirt that moved when
very much . It was a cute , intelligent t eddy
very much . It was a small , pretty ball er
very much . It was a fragile doll that she always
nice day . It was a fine day , so the
loved using . It was a lamp with a pink top
. Her grandma made her a y ummy cake . Lily
very much . It was a t eddy bear with soft
. It made the sky a deep orange color and the
very much . It was a big , pink t eddy
a lot . It was a red car that went fast
. <|endoftext|> Daniel was having a great day . He wanted
to wear . It was a pretty dress with flowers on
very much . It was a pretty hat with flowers on
! <|endoftext|> Billy was having a lot of fun . He
unique pet . It was a large , green lizard .
she went . It was a big , red hat that
. <|endoftext|> Max was having a great day . He had
Anna . She made Anna a house with sticks and leaves
done . <|endoftext|> Alice made a wish on a star .

MIN ACTIVATIONS
MIN = -9.488

that he wanted . <|endoftext|> Ak io was a 3 year
by and saw how sad Z ippy was . The girl
behind a big rock . Z ippy was so happy and
often ". Co pper thanked the old bird
asked what was wrong and Z ippy told her about her
help the cut heal . Cand ace was so proud of
, a gentle girl named Ci ara stumbled upon a lovely
children didn 't know what Mam ma meant but they ran
From that day on , Z ippy made sure to keep
The tree was gone ! Z eb was happy because his
Daddy nodded and told Tr icia that it was okay
around . <|endoftext|> One day Al ina wanted to explore ,
stories said . Mama helped Ci ara and together they read
ma was very proud of L ila . She said ,
, something bad happened . Z ee split in two .
so much fun . K ait lyn was so happy to
than the park . K ait lyn ran over to the
, lights and screens . Z og showed them how to
, the nurse said that Kay la 's knee wouldn 't
find it . She thanks L ila and Sam for helping

INTERVAL 2.713 - 6.781
CONTAINS 7.755%

more natural . And you should not take Lily 's cr
course . He was wearing a red pl aid shirt and
grabbed a branch and started to wipe it against the bee
balloon . She hoped he would like it .
! They played with the toys all day and shared them

INTERVAL -1.354 - 2.713
CONTAINS 68.359%

shine . All the creatures stared in awe at the magic
They helped her put her bathing suit back on and they
said , " Me too , Tim my ." <|endoftext|> Once
can help you ." L ily learned a
in the park with their ball . They liked to kick

INTERVAL -5.421 - -1.354
CONTAINS 23.462%

to unite . At the airport , they saw
the figure again . This time , she got brave and
web in the corner of the garden . He was very
save the lizard , but they were too slow . The
L ily and her babys itter went to

INTERVAL -9.488 - -5.421
CONTAINS 0.424%

mom my said , " L ily , I told you
The wild wild er be ast had to think for a
sweet strawberries !" S aman tha and Dan thought
. Suddenly , M illy felt someone grab her
shiny again ." L ily was happy to help
Token to
Token ID284
Feature activation+3.01e+00

Pos logit contributions
ie+0.003
ople+0.003
les+0.003
pa+0.003
ing+0.003
Neg logit contributions
 sturdy-0.002
 boun-0.001
 flat-0.001
 cos-0.001
 exc-0.001
Token wear
Token ID5806
Feature activation+8.16e-01

Pos logit contributions
 sturdy+0.054
 flat+0.048
jar+0.048
 stump+0.047
 stret+0.047
Neg logit contributions
ie-0.085
les-0.082
pa-0.074
ipped-0.074
by-0.072
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
 sturdy+0.018
onna+0.017
 exc+0.017
 boun+0.016
 cos+0.016
Neg logit contributions
les-0.020
ie-0.019
ed-0.018
led-0.018
ing-0.018
Token It
Token ID632
Feature activation+4.76e-01

Pos logit contributions
 sturdy+0.026
 flat+0.024
 boun+0.024
 exc+0.023
 sil+0.023
Neg logit contributions
ie-0.035
les-0.034
ople-0.034
pa-0.033
father-0.032
Token was
Token ID373
Feature activation+3.93e+00

Pos logit contributions
 boun+0.012
 sturdy+0.012
 flat+0.012
 cos+0.012
 cozy+0.011
Neg logit contributions
ople-0.009
les-0.009
ie-0.009
pa-0.009
father-0.009
Token a
Token ID257
Feature activation+6.78e+00

Pos logit contributions
 sturdy+0.070
 boun+0.066
 flat+0.066
onna+0.064
 exc+0.064
Neg logit contributions
ie-0.102
les-0.096
pa-0.096
ople-0.093
father-0.093
Token flexible
Token ID12846
Feature activation+2.25e+00

Pos logit contributions
 opener+0.133
 distract+0.132
 bleed+0.129
onna+0.127
resses+0.125
Neg logit contributions
by-0.141
 End-0.140
ie-0.138
ma-0.126
ing-0.116
Token skirt
Token ID23967
Feature activation+4.86e+00

Pos logit contributions
 sturdy+0.044
 cos+0.041
 flat+0.041
 exc+0.041
 boun+0.040
Neg logit contributions
ie-0.057
les-0.056
ople-0.053
pa-0.052
by-0.052
Token that
Token ID326
Feature activation-4.60e-01

Pos logit contributions
 sturdy+0.118
 stret+0.114
 cos+0.113
 sil+0.113
onna+0.112
Neg logit contributions
ie-0.098
ing-0.096
les-0.094
ed-0.093
ople-0.087
Token moved
Token ID3888
Feature activation-1.63e-02

Pos logit contributions
ople+0.013
les+0.013
pa+0.012
ie+0.012
father+0.012
Neg logit contributions
 boun-0.008
 sturdy-0.008
 flat-0.008
 cos-0.008
 cozy-0.008
Token when
Token ID618
Feature activation-1.21e+00

Pos logit contributions
ople+0.000
pa+0.000
father+0.000
les+0.000
ipped+0.000
Neg logit contributions
 boun-0.000
 sturdy-0.000
 flat-0.000
 safe-0.000
 cozy-0.000
Token very
Token ID845
Feature activation+1.97e+00

Pos logit contributions
 sturdy+0.005
 boun+0.004
 cos+0.004
 flat+0.004
 sil+0.004
Neg logit contributions
ie-0.007
pa-0.007
ople-0.006
ing-0.006
les-0.006
Token much
Token ID881
Feature activation+9.72e-01

Pos logit contributions
 sturdy+0.024
 boun+0.023
 cozy+0.023
 flat+0.023
 cos+0.023
Neg logit contributions
ople-0.065
les-0.064
pa-0.062
ipped-0.062
ie-0.062
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
 sturdy+0.021
 boun+0.020
 flat+0.020
 sil+0.020
 cos+0.020
Neg logit contributions
les-0.022
ie-0.022
ing-0.021
ople-0.021
pa-0.021
Token It
Token ID632
Feature activation+5.86e-01

Pos logit contributions
 sturdy+0.027
 boun+0.026
 flat+0.025
 exc+0.024
 cos+0.024
Neg logit contributions
ie-0.037
les-0.037
ople-0.037
pa-0.036
father-0.035
Token was
Token ID373
Feature activation+4.26e+00

Pos logit contributions
 sturdy+0.016
 boun+0.015
 flat+0.015
 cos+0.015
 cozy+0.015
Neg logit contributions
ie-0.010
ople-0.010
les-0.010
pa-0.010
father-0.009
Token a
Token ID257
Feature activation+6.74e+00

Pos logit contributions
 sturdy+0.080
 boun+0.077
 flat+0.076
onna+0.075
 leash+0.075
Neg logit contributions
ie-0.105
pa-0.098
father-0.095
les-0.095
el-0.094
Token cute
Token ID13779
Feature activation+1.91e+00

Pos logit contributions
 distract+0.129
onna+0.129
 opener+0.128
resses+0.128
 bleed+0.128
Neg logit contributions
 End-0.138
by-0.136
ie-0.132
ma-0.126
pa-0.114
Token,
Token ID11
Feature activation+3.61e+00

Pos logit contributions
 sturdy+0.039
 flat+0.036
 exc+0.036
onna+0.035
 boun+0.035
Neg logit contributions
ie-0.048
les-0.047
ing-0.044
pa-0.044
by-0.044
Token intelligent
Token ID12661
Feature activation+7.01e-01

Pos logit contributions
onna+0.053
 opener+0.051
 stret+0.048
 distract+0.047
 sturdy+0.047
Neg logit contributions
ie-0.094
les-0.088
led-0.086
pa-0.086
ma-0.084
Token t
Token ID256
Feature activation-8.55e-01

Pos logit contributions
 sturdy+0.015
onna+0.014
 flat+0.014
 exc+0.014
 cos+0.014
Neg logit contributions
ie-0.016
les-0.016
pa-0.016
ople-0.015
ing-0.015
Tokeneddy
Token ID21874
Feature activation-8.42e-01

Pos logit contributions
ople+0.020
les+0.019
pa+0.018
 End+0.018
ipped+0.018
Neg logit contributions
 boun-0.018
 sturdy-0.017
 flat-0.017
 cos-0.017
onna-0.017
Token very
Token ID845
Feature activation+2.07e+00

Pos logit contributions
 sturdy+0.001
 boun+0.001
 cos+0.001
 flat+0.001
 sil+0.001
Neg logit contributions
ie-0.002
pa-0.002
les-0.002
ople-0.002
father-0.002
Token much
Token ID881
Feature activation+8.42e-01

Pos logit contributions
 sturdy+0.025
 boun+0.025
 flat+0.024
 cozy+0.024
 exc+0.024
Neg logit contributions
ople-0.069
les-0.067
pa-0.066
ipped-0.066
ie-0.065
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
 sturdy+0.018
 boun+0.017
 sil+0.017
 flat+0.017
 cos+0.017
Neg logit contributions
les-0.020
ie-0.019
ing-0.019
ople-0.019
pa-0.018
Token It
Token ID632
Feature activation+5.93e-01

Pos logit contributions
 sturdy+0.027
 boun+0.025
 flat+0.025
 exc+0.024
 cos+0.024
Neg logit contributions
ie-0.041
ople-0.041
les-0.041
pa-0.040
father-0.039
Token was
Token ID373
Feature activation+4.08e+00

Pos logit contributions
 sturdy+0.014
 boun+0.014
 flat+0.014
 cos+0.014
 cozy+0.014
Neg logit contributions
ople-0.012
les-0.012
ie-0.012
pa-0.012
father-0.011
Token a
Token ID257
Feature activation+6.74e+00

Pos logit contributions
 sturdy+0.074
 boun+0.072
 flat+0.069
 exc+0.069
onna+0.069
Neg logit contributions
ie-0.103
pa-0.096
les-0.094
father-0.094
el-0.093
Token small
Token ID1402
Feature activation+3.75e+00

Pos logit contributions
 distract+0.128
resses+0.128
 opener+0.127
onna+0.125
 bleed+0.121
Neg logit contributions
 End-0.138
by-0.137
ie-0.135
ma-0.126
mother-0.115
Token,
Token ID11
Feature activation+3.69e+00

Pos logit contributions
 sturdy+0.076
onna+0.072
 exc+0.071
 consider+0.069
 sil+0.069
Neg logit contributions
ie-0.089
les-0.083
by-0.082
pa-0.082
el-0.081
Token pretty
Token ID2495
Feature activation+2.02e+00

Pos logit contributions
onna+0.057
 opener+0.054
 stret+0.052
 distract+0.050
 exc+0.050
Neg logit contributions
ie-0.095
les-0.089
el-0.086
ing-0.086
led-0.086
Token ball
Token ID2613
Feature activation+3.91e+00

Pos logit contributions
 sturdy+0.040
 exc+0.038
onna+0.037
 cos+0.036
 sil+0.035
Neg logit contributions
ie-0.052
les-0.051
by-0.047
ing-0.047
pa-0.046
Tokener
Token ID263
Feature activation+1.71e+00

Pos logit contributions
 sturdy+0.100
 sil+0.095
 cos+0.093
 stret+0.093
 exc+0.093
Neg logit contributions
ie-0.080
ing-0.075
les-0.074
ed-0.068
ople-0.068
Token very
Token ID845
Feature activation+1.99e+00

Pos logit contributions
 sturdy+0.005
 boun+0.004
 cos+0.004
 flat+0.004
 sil+0.004
Neg logit contributions
ie-0.007
pa-0.007
ople-0.007
ing-0.007
father-0.007
Token much
Token ID881
Feature activation+9.98e-01

Pos logit contributions
 sturdy+0.024
 boun+0.023
 cozy+0.023
 flat+0.023
 cos+0.023
Neg logit contributions
ople-0.067
les-0.065
ipped-0.063
pa-0.063
ie-0.063
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
 sturdy+0.022
 boun+0.021
 flat+0.021
 sil+0.021
 cos+0.021
Neg logit contributions
les-0.023
ie-0.022
ing-0.022
ople-0.021
pa-0.021
Token It
Token ID632
Feature activation+6.41e-01

Pos logit contributions
 sturdy+0.029
 boun+0.028
 flat+0.027
 exc+0.026
 sil+0.026
Neg logit contributions
ie-0.037
ople-0.036
les-0.036
pa-0.036
father-0.035
Token was
Token ID373
Feature activation+4.20e+00

Pos logit contributions
 sturdy+0.017
 boun+0.017
 flat+0.016
 cos+0.016
 cozy+0.016
Neg logit contributions
ople-0.012
ie-0.011
les-0.011
pa-0.011
father-0.011
Token a
Token ID257
Feature activation+6.72e+00

Pos logit contributions
 sturdy+0.080
 boun+0.077
 flat+0.075
onna+0.075
 leash+0.074
Neg logit contributions
ie-0.103
pa-0.096
father-0.093
les-0.093
el-0.092
Token fragile
Token ID21049
Feature activation+2.09e+00

Pos logit contributions
 distract+0.127
 opener+0.127
onna+0.127
 bleed+0.127
resses+0.126
Neg logit contributions
by-0.138
 End-0.137
ie-0.133
ma-0.127
pa-0.117
Token doll
Token ID3654
Feature activation+3.15e+00

Pos logit contributions
 sturdy+0.046
 exc+0.043
onna+0.043
 cos+0.042
 sil+0.042
Neg logit contributions
ie-0.048
les-0.046
ing-0.044
by-0.043
el-0.042
Token that
Token ID326
Feature activation+8.05e-03

Pos logit contributions
 sturdy+0.070
 sil+0.067
 boun+0.066
onna+0.066
 flat+0.066
Neg logit contributions
ie-0.073
ing-0.068
ople-0.067
pa-0.066
les-0.066
Token she
Token ID673
Feature activation-1.08e+00

Pos logit contributions
 sturdy+0.000
 boun+0.000
 flat+0.000
 cos+0.000
 cozy+0.000
Neg logit contributions
ople-0.000
les-0.000
ie-0.000
pa-0.000
father-0.000
Token always
Token ID1464
Feature activation+9.81e-01

Pos logit contributions
ople+0.027
les+0.027
ie+0.027
pa+0.026
father+0.026
Neg logit contributions
 sturdy-0.021
 boun-0.020
 flat-0.020
 cos-0.020
 cozy-0.019
Token nice
Token ID3621
Feature activation-4.66e-01

Pos logit contributions
onna+0.017
 sturdy+0.016
 flat+0.016
 opener+0.015
 sew+0.015
Neg logit contributions
ie-0.017
pa-0.017
les-0.017
ing-0.016
ople-0.016
Token day
Token ID1110
Feature activation+3.86e-01

Pos logit contributions
ie+0.008
les+0.007
pa+0.007
ople+0.007
ing+0.007
Neg logit contributions
 sturdy-0.013
 flat-0.013
 boun-0.013
onna-0.013
 exc-0.013
Token.
Token ID13
Feature activation+1.85e+00

Pos logit contributions
 sturdy+0.007
 boun+0.007
 flat+0.007
 sil+0.006
 exc+0.006
Neg logit contributions
ie-0.010
ople-0.010
les-0.010
pa-0.010
ing-0.010
Token It
Token ID632
Feature activation-3.35e-01

Pos logit contributions
 sturdy+0.037
 boun+0.036
 sil+0.035
 flat+0.035
 exc+0.034
Neg logit contributions
ople-0.044
pa-0.043
les-0.043
ie-0.043
father-0.041
Token was
Token ID373
Feature activation+4.00e+00

Pos logit contributions
ie+0.005
les+0.005
ople+0.005
pa+0.005
ed+0.005
Neg logit contributions
 sturdy-0.010
 boun-0.010
 flat-0.009
onna-0.009
 cos-0.009
Token a
Token ID257
Feature activation+6.71e+00

Pos logit contributions
 sturdy+0.076
onna+0.075
 sew+0.075
 boun+0.073
 stret+0.072
Neg logit contributions
ie-0.094
pa-0.089
 End-0.086
el-0.085
ma-0.084
Token fine
Token ID3734
Feature activation+1.81e+00

Pos logit contributions
 opener+0.123
onna+0.121
 sew+0.121
resses+0.120
 bleed+0.120
Neg logit contributions
by-0.151
 End-0.150
ie-0.143
ma-0.139
cy-0.129
Token day
Token ID1110
Feature activation+1.06e+00

Pos logit contributions
 sturdy+0.057
 flat+0.055
onna+0.055
 boun+0.054
 sew+0.054
Neg logit contributions
ie-0.024
pa-0.022
les-0.021
ing-0.020
by-0.020
Token,
Token ID11
Feature activation+2.63e+00

Pos logit contributions
 sturdy+0.023
 boun+0.023
 flat+0.022
 exc+0.022
onna+0.022
Neg logit contributions
ie-0.023
ople-0.023
pa-0.023
ing-0.023
les-0.023
Token so
Token ID523
Feature activation+1.38e+00

Pos logit contributions
 sturdy+0.036
 boun+0.032
 sil+0.032
 cos+0.032
 flat+0.032
Neg logit contributions
ie-0.080
ople-0.078
les-0.078
pa-0.077
ing-0.077
Token the
Token ID262
Feature activation-2.84e+00

Pos logit contributions
 sturdy+0.024
 cozy+0.024
 boun+0.024
 cos+0.024
 flat+0.023
Neg logit contributions
ople-0.040
les-0.039
ipped-0.037
father-0.037
ie-0.037
Token loved
Token ID6151
Feature activation+2.09e-01

Pos logit contributions
ople+0.021
les+0.021
ie+0.021
pa+0.021
father+0.020
Neg logit contributions
 sturdy-0.016
 boun-0.016
 flat-0.016
 cos-0.016
 cozy-0.015
Token using
Token ID1262
Feature activation-6.33e-01

Pos logit contributions
 sturdy+0.004
 boun+0.003
 cos+0.003
 flat+0.003
 sil+0.003
Neg logit contributions
ie-0.006
pa-0.006
ople-0.006
les-0.006
ing-0.005
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
les+0.013
ie+0.013
pa+0.012
ople+0.012
ing+0.012
Neg logit contributions
 sturdy-0.015
 boun-0.015
onna-0.015
 exc-0.014
 cos-0.014
Token It
Token ID632
Feature activation+3.36e-01

Pos logit contributions
 sturdy+0.020
 boun+0.019
 flat+0.019
 exc+0.018
 sil+0.018
Neg logit contributions
ie-0.026
les-0.026
ople-0.026
pa-0.025
father-0.025
Token was
Token ID373
Feature activation+4.00e+00

Pos logit contributions
 sturdy+0.009
 boun+0.008
 flat+0.008
 cos+0.008
 cozy+0.008
Neg logit contributions
ople-0.006
les-0.006
ie-0.006
pa-0.006
father-0.006
Token a
Token ID257
Feature activation+6.71e+00

Pos logit contributions
 sturdy+0.073
 boun+0.072
 flat+0.070
 exc+0.069
 cos+0.068
Neg logit contributions
ie-0.101
pa-0.096
les-0.095
ople-0.094
father-0.094
Token lamp
Token ID20450
Feature activation+3.36e+00

Pos logit contributions
 bleed+0.126
onna+0.126
 opener+0.126
 distract+0.124
resses+0.122
Neg logit contributions
by-0.142
 End-0.138
ie-0.132
ma-0.128
pa-0.117
Token with
Token ID351
Feature activation-3.16e-01

Pos logit contributions
 sturdy+0.068
 boun+0.067
 flat+0.065
 cos+0.064
 exc+0.064
Neg logit contributions
ie-0.081
ople-0.080
les-0.079
pa-0.077
ing-0.077
Token a
Token ID257
Feature activation+5.19e+00

Pos logit contributions
ople+0.010
ie+0.010
les+0.010
pa+0.010
ipped+0.010
Neg logit contributions
 sturdy-0.005
 boun-0.004
 cozy-0.004
 flat-0.004
 sil-0.004
Token pink
Token ID11398
Feature activation+2.07e+00

Pos logit contributions
onna+0.099
 opener+0.098
 distract+0.097
resses+0.092
 consider+0.091
Neg logit contributions
les-0.104
el-0.104
ing-0.099
 End-0.099
by-0.094
Token top
Token ID1353
Feature activation+3.07e+00

Pos logit contributions
 sturdy+0.041
 exc+0.039
 consider+0.038
 boun+0.037
 cos+0.037
Neg logit contributions
les-0.053
ie-0.051
el-0.051
ing-0.049
ed-0.048
Token.
Token ID13
Feature activation+2.08e+00

Pos logit contributions
 sturdy+0.063
 boun+0.061
 sil+0.060
 flat+0.057
 exc+0.057
Neg logit contributions
ie-0.075
pa-0.071
ing-0.070
ma-0.069
ed-0.068
Token Her
Token ID2332
Feature activation-9.73e-01

Pos logit contributions
 sturdy+0.044
 boun+0.042
 flat+0.040
 exc+0.040
 sil+0.039
Neg logit contributions
ople-0.047
les-0.046
ie-0.046
ed-0.045
pa-0.045
Token grandma
Token ID49890
Feature activation-1.38e+00

Pos logit contributions
ople+0.022
les+0.022
ie+0.022
pa+0.021
father+0.021
Neg logit contributions
 sturdy-0.021
 boun-0.021
 flat-0.020
 cos-0.020
 cozy-0.020
Token made
Token ID925
Feature activation-1.38e+00

Pos logit contributions
ie+0.031
ople+0.030
les+0.030
pa+0.029
father+0.028
Neg logit contributions
 sturdy-0.031
 boun-0.030
 flat-0.030
 cos-0.029
 sil-0.029
Token her
Token ID607
Feature activation+3.13e+00

Pos logit contributions
ople+0.037
les+0.035
father+0.035
pa+0.035
ipped+0.034
Neg logit contributions
 sturdy-0.025
 boun-0.025
 cozy-0.025
 cos-0.024
 flat-0.024
Token a
Token ID257
Feature activation+6.69e+00

Pos logit contributions
 sturdy+0.054
 boun+0.053
onna+0.052
 flat+0.051
 sil+0.050
Neg logit contributions
ie-0.081
pa-0.080
ople-0.080
les-0.079
by-0.077
Token y
Token ID331
Feature activation+2.73e-01

Pos logit contributions
onna+0.126
 opener+0.122
 seek+0.112
 consider+0.112
 sturdy+0.110
Neg logit contributions
by-0.157
ie-0.152
ma-0.144
pa-0.139
les-0.132
Tokenummy
Token ID13513
Feature activation+2.62e+00

Pos logit contributions
 sturdy+0.008
 boun+0.008
 flat+0.008
 cos+0.008
 cozy+0.008
Neg logit contributions
ople-0.004
les-0.004
ie-0.004
pa-0.004
father-0.003
Token cake
Token ID12187
Feature activation+3.60e+00

Pos logit contributions
 sturdy+0.051
 flat+0.047
 boun+0.047
 exc+0.046
 sil+0.046
Neg logit contributions
ie-0.068
les-0.066
pa-0.064
by-0.062
ing-0.062
Token.
Token ID13
Feature activation+1.71e+00

Pos logit contributions
 sturdy+0.083
 sil+0.080
 boun+0.079
 exc+0.078
 cos+0.077
Neg logit contributions
ie-0.084
ing-0.074
ma-0.073
pa-0.073
les-0.071
Token Lily
Token ID20037
Feature activation-6.60e-01

Pos logit contributions
 sturdy+0.043
 exc+0.040
 boun+0.040
 flat+0.040
 sil+0.039
Neg logit contributions
les-0.033
ople-0.032
ie-0.031
ed-0.031
ipped-0.029
Token very
Token ID845
Feature activation+1.72e+00

Pos logit contributions
 sturdy+0.002
 boun+0.002
 cos+0.002
 flat+0.002
 sil+0.002
Neg logit contributions
ie-0.003
pa-0.003
ople-0.003
les-0.003
ing-0.003
Token much
Token ID881
Feature activation+6.93e-01

Pos logit contributions
 sturdy+0.021
 boun+0.020
 flat+0.020
 cozy+0.020
 exc+0.020
Neg logit contributions
ople-0.058
les-0.056
pa-0.055
ipped-0.055
ie-0.054
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
 sturdy+0.015
 boun+0.014
 flat+0.014
 sil+0.014
 cos+0.014
Neg logit contributions
les-0.017
ie-0.016
ing-0.016
ople-0.016
pa-0.016
Token It
Token ID632
Feature activation+5.16e-01

Pos logit contributions
 sturdy+0.028
 boun+0.026
 flat+0.026
 exc+0.025
 sil+0.025
Neg logit contributions
ie-0.039
les-0.039
ople-0.038
pa-0.038
father-0.037
Token was
Token ID373
Feature activation+4.11e+00

Pos logit contributions
 sturdy+0.014
 boun+0.013
 flat+0.013
 cos+0.013
 cozy+0.013
Neg logit contributions
ople-0.009
ie-0.009
les-0.009
pa-0.009
father-0.009
Token a
Token ID257
Feature activation+6.68e+00

Pos logit contributions
 sturdy+0.078
 boun+0.076
 flat+0.074
onna+0.073
 leash+0.073
Neg logit contributions
ie-0.100
pa-0.094
father-0.092
les-0.091
el-0.090
Token t
Token ID256
Feature activation-2.29e-01

Pos logit contributions
 distract+0.127
 bleed+0.127
resses+0.126
onna+0.126
 opener+0.126
Neg logit contributions
 End-0.137
by-0.135
ie-0.131
ma-0.126
pa-0.114
Tokeneddy
Token ID21874
Feature activation-8.61e-01

Pos logit contributions
ople+0.005
les+0.005
pa+0.005
ipped+0.005
father+0.005
Neg logit contributions
 boun-0.005
 sturdy-0.005
 flat-0.005
 cos-0.005
 cozy-0.005
Token bear
Token ID6842
Feature activation+2.24e+00

Pos logit contributions
ople+0.020
les+0.020
ie+0.019
pa+0.019
ipped+0.019
Neg logit contributions
 sturdy-0.018
 boun-0.018
 flat-0.018
 cos-0.017
 cozy-0.017
Token with
Token ID351
Feature activation+4.94e-01

Pos logit contributions
 sturdy+0.051
 sil+0.048
 flat+0.048
 boun+0.048
 exc+0.047
Neg logit contributions
ie-0.051
ing-0.049
les-0.047
pa-0.047
ople-0.046
Token soft
Token ID2705
Feature activation+8.95e-01

Pos logit contributions
 boun+0.007
 cozy+0.007
 sturdy+0.007
 flat+0.007
 sil+0.007
Neg logit contributions
ople-0.016
les-0.015
ie-0.015
pa-0.015
ipped-0.015
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
 sturdy+0.021
 boun+0.021
 flat+0.020
 cos+0.020
 cozy+0.020
Neg logit contributions
ople-0.037
les-0.037
ie-0.036
pa-0.036
father-0.035
Token It
Token ID632
Feature activation-2.08e-01

Pos logit contributions
 sturdy+0.022
 boun+0.022
 seek+0.021
 sew+0.021
 flat+0.021
Neg logit contributions
ople-0.026
les-0.025
pa-0.024
ie-0.024
ing-0.024
Token made
Token ID925
Feature activation-7.48e-01

Pos logit contributions
ople+0.004
les+0.004
ie+0.004
pa+0.004
father+0.003
Neg logit contributions
 sturdy-0.005
 boun-0.005
 flat-0.005
 cos-0.005
 cozy-0.005
Token the
Token ID262
Feature activation+3.88e-01

Pos logit contributions
ople+0.019
father+0.018
les+0.018
ipped+0.018
pa+0.018
Neg logit contributions
 cos-0.015
 sturdy-0.015
 cozy-0.015
 boun-0.014
 sil-0.014
Token sky
Token ID6766
Feature activation+3.59e+00

Pos logit contributions
 sturdy+0.008
 boun+0.007
 flat+0.007
 cos+0.007
 cozy+0.007
Neg logit contributions
ople-0.010
les-0.009
ie-0.009
pa-0.009
father-0.009
Token a
Token ID257
Feature activation+6.66e+00

Pos logit contributions
 boun+0.048
 sturdy+0.048
onna+0.045
 exc+0.044
 flat+0.044
Neg logit contributions
ie-0.109
les-0.106
ing-0.104
ople-0.103
pa-0.102
Token deep
Token ID2769
Feature activation+3.35e+00

Pos logit contributions
 opener+0.125
onna+0.118
 sew+0.112
resses+0.107
 knit+0.105
Neg logit contributions
ie-0.160
 End-0.155
by-0.154
les-0.142
ing-0.140
Token orange
Token ID10912
Feature activation+2.69e+00

Pos logit contributions
 exc+0.068
 sturdy+0.067
onna+0.065
 boun+0.064
 opener+0.064
Neg logit contributions
ie-0.082
les-0.078
ople-0.075
el-0.075
pa-0.074
Token color
Token ID3124
Feature activation+2.69e+00

Pos logit contributions
 sturdy+0.056
 cos+0.052
 exc+0.052
 cozy+0.052
 boun+0.051
Neg logit contributions
ie-0.066
les-0.064
ing-0.063
ople-0.062
pa-0.061
Token and
Token ID290
Feature activation+1.37e+00

Pos logit contributions
 sturdy+0.052
 boun+0.051
 exc+0.050
 cos+0.050
 cozy+0.050
Neg logit contributions
ie-0.067
les-0.066
ing-0.065
ople-0.062
el-0.061
Token the
Token ID262
Feature activation-1.57e+00

Pos logit contributions
 sturdy+0.026
 boun+0.025
 flat+0.025
 cos+0.024
 cozy+0.024
Neg logit contributions
ople-0.035
les-0.034
ie-0.034
pa-0.034
father-0.033
Token very
Token ID845
Feature activation+1.83e+00

Pos logit contributions
 sturdy+0.004
 boun+0.004
 cos+0.004
 flat+0.004
 sil+0.004
Neg logit contributions
ie-0.007
pa-0.007
ople-0.007
les-0.007
ing-0.007
Token much
Token ID881
Feature activation+8.27e-01

Pos logit contributions
 sturdy+0.022
 boun+0.021
 cozy+0.021
 flat+0.021
 exc+0.021
Neg logit contributions
ople-0.062
les-0.060
ipped-0.058
pa-0.058
ie-0.058
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
 sturdy+0.017
 boun+0.017
 flat+0.017
 sil+0.017
 cos+0.016
Neg logit contributions
les-0.019
ie-0.019
ing-0.019
ople-0.018
pa-0.018
Token It
Token ID632
Feature activation+5.14e-01

Pos logit contributions
 sturdy+0.027
 boun+0.026
 flat+0.025
 exc+0.025
 sil+0.024
Neg logit contributions
ie-0.039
les-0.038
ople-0.038
pa-0.037
father-0.036
Token was
Token ID373
Feature activation+4.19e+00

Pos logit contributions
 sturdy+0.013
 boun+0.013
 flat+0.013
 cos+0.013
 cozy+0.013
Neg logit contributions
ople-0.010
les-0.009
ie-0.009
pa-0.009
father-0.009
Token a
Token ID257
Feature activation+6.66e+00

Pos logit contributions
 sturdy+0.077
 boun+0.074
 flat+0.073
onna+0.072
 leash+0.072
Neg logit contributions
ie-0.104
pa-0.098
les-0.095
father-0.095
el-0.094
Token big
Token ID1263
Feature activation+3.99e+00

Pos logit contributions
 opener+0.125
 bleed+0.124
onna+0.124
 distract+0.124
resses+0.123
Neg logit contributions
 End-0.140
by-0.139
ie-0.135
ma-0.128
pa-0.118
Token,
Token ID11
Feature activation+3.84e+00

Pos logit contributions
 sturdy+0.086
onna+0.082
 exc+0.080
 opener+0.078
 cos+0.078
Neg logit contributions
ie-0.091
pa-0.089
les-0.086
ople-0.085
ma-0.083
Token pink
Token ID11398
Feature activation+1.65e+00

Pos logit contributions
onna+0.063
 opener+0.059
 stret+0.056
 distract+0.054
comb+0.054
Neg logit contributions
ie-0.094
les-0.090
led-0.087
ing-0.086
ma-0.086
Token t
Token ID256
Feature activation-5.22e-01

Pos logit contributions
 sturdy+0.035
 cos+0.032
 exc+0.031
 flat+0.031
 sil+0.031
Neg logit contributions
ie-0.041
les-0.040
ople-0.038
ing-0.038
pa-0.038
Tokeneddy
Token ID21874
Feature activation-4.39e-01

Pos logit contributions
ople+0.012
les+0.012
pa+0.011
father+0.011
 End+0.011
Neg logit contributions
 boun-0.011
 flat-0.011
 sturdy-0.011
 cos-0.010
 cozy-0.010
Token a
Token ID257
Feature activation+3.99e+00

Pos logit contributions
 sturdy+0.008
 boun+0.007
 cos+0.007
 sil+0.007
onna+0.007
Neg logit contributions
ie-0.010
les-0.010
pa-0.009
ople-0.009
ing-0.009
Token lot
Token ID1256
Feature activation+5.28e-01

Pos logit contributions
 consider+0.095
 distract+0.090
 sturdy+0.089
 exc+0.088
onna+0.088
Neg logit contributions
by-0.080
les-0.072
ie-0.070
ing-0.066
el-0.059
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
 sturdy+0.011
 boun+0.011
 flat+0.011
 cos+0.011
 cozy+0.011
Neg logit contributions
les-0.012
ople-0.012
ie-0.012
pa-0.012
ing-0.011
Token It
Token ID632
Feature activation+5.63e-01

Pos logit contributions
 sturdy+0.022
 boun+0.021
 sil+0.021
 cos+0.020
 flat+0.020
Neg logit contributions
ople-0.025
pa-0.025
ie-0.025
les-0.024
father-0.024
Token was
Token ID373
Feature activation+4.26e+00

Pos logit contributions
 sturdy+0.016
 flat+0.015
 leash+0.015
 boun+0.015
 cos+0.015
Neg logit contributions
ie-0.009
les-0.008
ople-0.008
pa-0.008
ed-0.008
Token a
Token ID257
Feature activation+6.61e+00

Pos logit contributions
 stret+0.080
 sturdy+0.080
onna+0.079
 sil+0.079
 seek+0.077
Neg logit contributions
ie-0.097
pa-0.092
father-0.089
el-0.088
ied-0.086
Token red
Token ID2266
Feature activation+2.08e+00

Pos logit contributions
 distract+0.124
 bleed+0.124
onna+0.124
 opener+0.122
resses+0.121
Neg logit contributions
by-0.135
 End-0.126
ie-0.120
ma-0.118
ing-0.111
Token car
Token ID1097
Feature activation+3.84e+00

Pos logit contributions
 sturdy+0.045
 cos+0.042
onna+0.041
 sil+0.041
 exc+0.041
Neg logit contributions
ie-0.050
les-0.049
ing-0.048
el-0.047
by-0.046
Token that
Token ID326
Feature activation-1.37e-01

Pos logit contributions
 sturdy+0.082
 sil+0.079
 cos+0.079
onna+0.078
 boun+0.077
Neg logit contributions
ie-0.089
ople-0.087
les-0.086
ing-0.085
pa-0.085
Token went
Token ID1816
Feature activation+9.03e-01

Pos logit contributions
ople+0.004
les+0.004
ie+0.003
pa+0.003
ipped+0.003
Neg logit contributions
 boun-0.003
 sturdy-0.003
 flat-0.003
 cos-0.002
 cozy-0.002
Token fast
Token ID3049
Feature activation+1.56e+00

Pos logit contributions
 sturdy+0.015
 boun+0.014
 cos+0.014
 sil+0.014
 cozy+0.014
Neg logit contributions
les-0.025
ie-0.025
ople-0.025
pa-0.024
ing-0.024
Token.
Token ID13
Feature activation+6.73e-01

Pos logit contributions
 sturdy+0.008
 boun+0.008
 flat+0.008
 cos+0.008
 cozy+0.008
Neg logit contributions
ople-0.015
les-0.015
ie-0.014
pa-0.014
father-0.014
Token<|endoftext|>
Token ID50256
Feature activation-4.62e+00

Pos logit contributions
 boun+0.009
 cozy+0.009
 cos+0.009
 sturdy+0.009
 exc+0.009
Neg logit contributions
les-0.021
ople-0.020
ie-0.020
led-0.020
pa-0.020
TokenDaniel
Token ID19962
Feature activation-4.49e+00

Pos logit contributions
ople+0.131
ie+0.126
atted+0.126
pa+0.124
father+0.123
Neg logit contributions
 boun-0.075
 cozy-0.073
 opener-0.070
 safe-0.070
 wrap-0.070
Token was
Token ID373
Feature activation+4.60e+00

Pos logit contributions
father+0.115
ople+0.115
ipped+0.111
ing+0.110
ays+0.110
Neg logit contributions
 wrap-0.075
 cozy-0.070
 safe-0.070
 sil-0.069
 boun-0.069
Token having
Token ID1719
Feature activation-1.12e-02

Pos logit contributions
 sturdy+0.083
 boun+0.078
onna+0.078
 flat+0.078
 sil+0.076
Neg logit contributions
ie-0.117
ople-0.116
pa-0.115
les-0.114
father-0.112
Token a
Token ID257
Feature activation+6.61e+00

Pos logit contributions
ople+0.000
les+0.000
ie+0.000
pa+0.000
father+0.000
Neg logit contributions
 sturdy-0.000
 boun-0.000
 flat-0.000
 cos-0.000
 cozy-0.000
Token great
Token ID1049
Feature activation+4.55e+00

Pos logit contributions
 sturdy+0.101
onna+0.096
 flat+0.095
 boun+0.094
 sil+0.093
Neg logit contributions
ie-0.185
by-0.182
pa-0.180
les-0.179
ople-0.178
Token day
Token ID1110
Feature activation+2.86e+00

Pos logit contributions
 sturdy+0.127
 boun+0.125
 flat+0.124
 cos+0.122
 cozy+0.122
Neg logit contributions
ople-0.074
ie-0.073
les-0.073
pa-0.071
father-0.068
Token.
Token ID13
Feature activation+1.76e+00

Pos logit contributions
 sturdy+0.068
onna+0.066
 flat+0.063
 exc+0.063
 sil+0.063
Neg logit contributions
ie-0.060
ing-0.060
les-0.059
ople-0.059
pa-0.058
Token He
Token ID679
Feature activation+6.56e-01

Pos logit contributions
 sturdy+0.028
 boun+0.028
 flat+0.027
 cos+0.026
 cozy+0.026
Neg logit contributions
ople-0.050
les-0.050
ie-0.049
pa-0.048
father-0.048
Token wanted
Token ID2227
Feature activation-1.56e+00

Pos logit contributions
 sturdy+0.013
 boun+0.013
 flat+0.013
 cos+0.012
 cozy+0.012
Neg logit contributions
ople-0.016
les-0.016
ie-0.016
pa-0.015
ipped-0.015
Token to
Token ID284
Feature activation+2.99e+00

Pos logit contributions
 sturdy+0.003
 boun+0.002
 flat+0.002
 cos+0.002
 exc+0.002
Neg logit contributions
ie-0.004
pa-0.004
ople-0.004
les-0.004
ing-0.004
Token wear
Token ID5806
Feature activation+6.95e-01

Pos logit contributions
 sturdy+0.057
 hollow+0.051
jar+0.051
 boun+0.051
onna+0.050
Neg logit contributions
ie-0.081
les-0.074
pa-0.072
el-0.070
ipped-0.068
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
 sturdy+0.014
onna+0.014
 exc+0.014
 boun+0.014
 flat+0.013
Neg logit contributions
les-0.017
ie-0.017
ople-0.016
pa-0.016
ing-0.016
Token It
Token ID632
Feature activation+4.16e-01

Pos logit contributions
 sturdy+0.027
 boun+0.025
 flat+0.025
 exc+0.024
 sil+0.024
Neg logit contributions
ie-0.040
les-0.039
ople-0.039
pa-0.038
father-0.037
Token was
Token ID373
Feature activation+4.15e+00

Pos logit contributions
 boun+0.011
 sturdy+0.011
 flat+0.010
 cos+0.010
 cozy+0.010
Neg logit contributions
ople-0.008
les-0.008
ie-0.008
pa-0.008
father-0.007
Token a
Token ID257
Feature activation+6.59e+00

Pos logit contributions
 sturdy+0.074
 boun+0.070
 flat+0.070
onna+0.069
 exc+0.068
Neg logit contributions
ie-0.107
pa-0.101
les-0.100
ople-0.098
father-0.098
Token pretty
Token ID2495
Feature activation+2.41e+00

Pos logit contributions
 opener+0.125
 distract+0.121
onna+0.121
 bleed+0.119
resses+0.114
Neg logit contributions
by-0.145
ie-0.145
 End-0.142
ma-0.137
pa-0.126
Token dress
Token ID6576
Feature activation+3.14e+00

Pos logit contributions
 sturdy+0.039
 exc+0.036
onna+0.036
 flat+0.035
 boun+0.035
Neg logit contributions
ie-0.070
les-0.069
by-0.065
pa-0.064
ing-0.064
Token with
Token ID351
Feature activation-2.36e-01

Pos logit contributions
 sturdy+0.061
 boun+0.059
 flat+0.058
 cos+0.058
 exc+0.057
Neg logit contributions
ie-0.079
les-0.078
ople-0.077
ing-0.075
pa-0.075
Token flowers
Token ID12734
Feature activation+3.11e+00

Pos logit contributions
ople+0.008
pa+0.008
father+0.008
ie+0.007
ipped+0.007
Neg logit contributions
 boun-0.003
 cozy-0.003
 sil-0.003
 sturdy-0.003
 flat-0.003
Token on
Token ID319
Feature activation-3.06e-01

Pos logit contributions
 sturdy+0.081
 exc+0.076
 cos+0.076
 stret+0.075
 sil+0.074
Neg logit contributions
les-0.061
ing-0.061
ie-0.061
ed-0.057
el-0.054
Token very
Token ID845
Feature activation+1.86e+00

Pos logit contributions
 sturdy+0.065
 boun+0.064
onna+0.062
 flat+0.062
 sil+0.061
Neg logit contributions
ie-0.089
les-0.086
ople-0.086
pa-0.085
ing-0.083
Token much
Token ID881
Feature activation+5.89e-01

Pos logit contributions
 sturdy+0.025
 boun+0.025
 flat+0.024
 cos+0.024
 cozy+0.024
Neg logit contributions
ople-0.058
les-0.057
ie-0.056
pa-0.056
father-0.055
Token.
Token ID13
Feature activation+1.54e+00

Pos logit contributions
 sturdy+0.011
 boun+0.011
 flat+0.010
 cos+0.010
 sil+0.010
Neg logit contributions
les-0.015
ie-0.015
ople-0.015
ing-0.015
pa-0.015
Token It
Token ID632
Feature activation+2.87e-01

Pos logit contributions
 sturdy+0.029
 boun+0.027
 flat+0.027
 cos+0.026
 exc+0.026
Neg logit contributions
ie-0.040
les-0.040
ople-0.039
pa-0.039
father-0.037
Token was
Token ID373
Feature activation+3.86e+00

Pos logit contributions
 sturdy+0.008
 boun+0.008
 flat+0.008
 cos+0.007
 cozy+0.007
Neg logit contributions
ie-0.005
ople-0.005
les-0.005
pa-0.005
father-0.005
Token a
Token ID257
Feature activation+6.55e+00

Pos logit contributions
 sturdy+0.065
 boun+0.062
 flat+0.061
 exc+0.060
onna+0.060
Neg logit contributions
ie-0.103
pa-0.098
les-0.097
father-0.095
ople-0.095
Token pretty
Token ID2495
Feature activation+1.97e+00

Pos logit contributions
 distract+0.131
 bleed+0.128
 opener+0.123
resses+0.121
onna+0.120
Neg logit contributions
ie-0.135
by-0.134
ma-0.130
 End-0.129
ing-0.115
Token hat
Token ID6877
Feature activation+3.96e+00

Pos logit contributions
 sturdy+0.033
 exc+0.031
 boun+0.031
 flat+0.030
 cos+0.030
Neg logit contributions
ie-0.056
les-0.055
pa-0.052
by-0.052
ople-0.051
Token with
Token ID351
Feature activation-2.14e-01

Pos logit contributions
 sturdy+0.090
 boun+0.086
 flat+0.084
 sil+0.084
 exc+0.084
Neg logit contributions
ie-0.090
ing-0.086
les-0.086
ople-0.084
pa-0.084
Token flowers
Token ID12734
Feature activation+3.00e+00

Pos logit contributions
ople+0.007
pa+0.007
father+0.007
ipped+0.007
ie+0.007
Neg logit contributions
 boun-0.003
 sil-0.003
 cozy-0.003
 sturdy-0.003
 flat-0.003
Token on
Token ID319
Feature activation-1.84e-01

Pos logit contributions
 sturdy+0.077
 exc+0.074
 cos+0.073
 boun+0.072
 sil+0.071
Neg logit contributions
les-0.060
ie-0.059
ing-0.058
ed-0.055
ople-0.053
Token!
Token ID0
Feature activation+1.30e+00

Pos logit contributions
 sturdy+0.027
 boun+0.025
 flat+0.025
 cos+0.025
 exc+0.025
Neg logit contributions
les-0.036
ie-0.036
ople-0.035
ing-0.035
ed-0.034
Token<|endoftext|>
Token ID50256
Feature activation-4.39e+00

Pos logit contributions
 cozy+0.019
 sturdy+0.019
 cos+0.019
 boun+0.019
 sil+0.019
Neg logit contributions
ople-0.039
les-0.038
ie-0.038
pa-0.037
father-0.037
TokenBilly
Token ID49640
Feature activation-2.39e+00

Pos logit contributions
ople+0.117
ie+0.112
atted+0.111
pa+0.110
father+0.109
Neg logit contributions
 boun-0.079
 cozy-0.078
 wrap-0.076
 cos-0.075
 opener-0.074
Token was
Token ID373
Feature activation+4.20e+00

Pos logit contributions
ople+0.061
les+0.058
father+0.057
ipped+0.057
ing+0.057
Neg logit contributions
 boun-0.045
 cozy-0.044
 cos-0.044
 sturdy-0.043
 sil-0.043
Token having
Token ID1719
Feature activation-2.17e-02

Pos logit contributions
 sturdy+0.076
 boun+0.074
 flat+0.072
 cos+0.071
 cozy+0.070
Neg logit contributions
ople-0.110
les-0.109
ie-0.109
pa-0.107
father-0.104
Token a
Token ID257
Feature activation+6.55e+00

Pos logit contributions
ople+0.001
les+0.001
ie+0.001
pa+0.001
father+0.001
Neg logit contributions
 sturdy-0.000
 boun-0.000
 flat-0.000
 cos-0.000
 cozy-0.000
Token lot
Token ID1256
Feature activation+6.88e-01

Pos logit contributions
 sturdy+0.104
onna+0.099
 flat+0.098
 boun+0.096
 sil+0.095
Neg logit contributions
ie-0.181
by-0.177
pa-0.174
les-0.174
ople-0.173
Token of
Token ID286
Feature activation+9.30e-01

Pos logit contributions
 sturdy+0.018
 boun+0.017
 flat+0.017
 cozy+0.017
 cos+0.017
Neg logit contributions
les-0.013
ie-0.013
ople-0.013
ing-0.012
pa-0.012
Token fun
Token ID1257
Feature activation+1.82e+00

Pos logit contributions
 sturdy+0.011
onna+0.011
 flat+0.010
 boun+0.010
 cos+0.010
Neg logit contributions
pa-0.030
ie-0.029
les-0.029
ople-0.029
father-0.028
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
 sturdy+0.034
 boun+0.032
 flat+0.032
 cos+0.032
 cozy+0.032
Neg logit contributions
ie-0.047
les-0.047
ople-0.047
pa-0.046
ing-0.045
Token He
Token ID679
Feature activation+5.29e-01

Pos logit contributions
 sturdy+0.027
 boun+0.027
 flat+0.026
 cos+0.026
 cozy+0.025
Neg logit contributions
ople-0.045
les-0.044
ie-0.044
pa-0.044
father-0.043
Token unique
Token ID3748
Feature activation+2.91e+00

Pos logit contributions
 sturdy+0.065
 opener+0.061
 sew+0.060
 flat+0.060
 boun+0.060
Neg logit contributions
ie-0.150
les-0.147
by-0.145
pa-0.142
ople-0.142
Token pet
Token ID4273
Feature activation+7.15e-02

Pos logit contributions
 sturdy+0.055
onna+0.050
 flat+0.050
 cos+0.050
 boun+0.050
Neg logit contributions
ie-0.076
by-0.072
les-0.072
pa-0.071
ing-0.067
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
 sturdy+0.001
 boun+0.001
 flat+0.001
 cos+0.001
 cozy+0.001
Neg logit contributions
les-0.002
ie-0.002
ople-0.002
ing-0.002
pa-0.002
Token It
Token ID632
Feature activation+9.43e-01

Pos logit contributions
 sturdy+0.033
 boun+0.031
 flat+0.031
 seek+0.030
 leash+0.030
Neg logit contributions
ie-0.037
ople-0.036
les-0.036
pa-0.036
el-0.035
Token was
Token ID373
Feature activation+4.16e+00

Pos logit contributions
 sturdy+0.026
 boun+0.025
 flat+0.025
 cos+0.025
 cozy+0.024
Neg logit contributions
ie-0.016
les-0.016
ople-0.016
pa-0.015
father-0.015
Token a
Token ID257
Feature activation+6.54e+00

Pos logit contributions
 sturdy+0.069
 boun+0.067
 flat+0.065
 cos+0.064
 exc+0.063
Neg logit contributions
ie-0.113
les-0.109
pa-0.108
ople-0.108
father-0.106
Token large
Token ID1588
Feature activation+3.42e+00

Pos logit contributions
 opener+0.116
 distract+0.114
 sturdy+0.110
onna+0.110
resses+0.109
Neg logit contributions
ie-0.152
by-0.150
 End-0.144
ma-0.139
ing-0.136
Token,
Token ID11
Feature activation+3.30e+00

Pos logit contributions
 sturdy+0.074
 exc+0.068
onna+0.068
 cos+0.067
 opener+0.067
Neg logit contributions
ie-0.081
les-0.075
pa-0.073
ople-0.071
by-0.071
Token green
Token ID4077
Feature activation+1.62e+00

Pos logit contributions
onna+0.052
 opener+0.050
 distract+0.045
 stret+0.045
 exc+0.045
Neg logit contributions
ie-0.086
les-0.083
led-0.080
ing-0.080
el-0.079
Token lizard
Token ID42406
Feature activation+2.20e+00

Pos logit contributions
 sturdy+0.030
 boun+0.028
 cos+0.027
 flat+0.027
 cozy+0.027
Neg logit contributions
ie-0.044
les-0.043
ople-0.042
ing-0.041
pa-0.041
Token.
Token ID13
Feature activation+8.36e-01

Pos logit contributions
 sturdy+0.050
 flat+0.046
 sil+0.045
 cos+0.044
 boun+0.044
Neg logit contributions
ie-0.054
les-0.050
ing-0.049
ed-0.046
by-0.046
Token she
Token ID673
Feature activation-2.18e+00

Pos logit contributions
 sturdy+0.008
 boun+0.008
 flat+0.008
 cos+0.008
onna+0.008
Neg logit contributions
ie-0.011
les-0.011
ing-0.010
ople-0.010
pa-0.010
Token went
Token ID1816
Feature activation+5.82e-01

Pos logit contributions
ople+0.058
ing+0.056
les+0.056
atted+0.056
pa+0.055
Neg logit contributions
 wrap-0.039
 cozy-0.039
 boun-0.039
 exc-0.038
 cos-0.038
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
 sturdy+0.013
onna+0.012
 exc+0.012
 boun+0.012
 sil+0.012
Neg logit contributions
ie-0.014
ing-0.014
les-0.014
pa-0.013
el-0.012
Token It
Token ID632
Feature activation+3.49e-01

Pos logit contributions
 sturdy+0.032
 boun+0.031
 flat+0.030
 cos+0.029
 sil+0.029
Neg logit contributions
ie-0.052
les-0.051
ople-0.051
pa-0.050
father-0.048
Token was
Token ID373
Feature activation+3.77e+00

Pos logit contributions
 sturdy+0.009
 boun+0.009
 flat+0.009
 cos+0.009
 cozy+0.009
Neg logit contributions
ople-0.006
les-0.006
ie-0.006
pa-0.006
father-0.006
Token a
Token ID257
Feature activation+6.53e+00

Pos logit contributions
 sturdy+0.070
 boun+0.068
 flat+0.066
 exc+0.065
 cos+0.065
Neg logit contributions
ie-0.095
pa-0.090
les-0.089
father-0.087
ople-0.087
Token big
Token ID1263
Feature activation+4.04e+00

Pos logit contributions
 opener+0.117
 distract+0.115
resses+0.113
 bleed+0.113
onna+0.113
Neg logit contributions
ie-0.147
by-0.146
 End-0.143
ma-0.141
pa-0.128
Token,
Token ID11
Feature activation+3.52e+00

Pos logit contributions
 sturdy+0.090
onna+0.086
 exc+0.085
 boun+0.084
 flat+0.084
Neg logit contributions
ie-0.089
pa-0.087
les-0.085
ople-0.084
father-0.081
Token red
Token ID2266
Feature activation+1.41e+00

Pos logit contributions
onna+0.052
 opener+0.050
 exc+0.044
 distract+0.044
resses+0.043
Neg logit contributions
ie-0.095
les-0.091
led-0.089
pa-0.087
ing-0.086
Token hat
Token ID6877
Feature activation+3.63e+00

Pos logit contributions
 sturdy+0.023
 boun+0.022
 cos+0.021
 flat+0.021
 cozy+0.021
Neg logit contributions
ie-0.040
les-0.040
ople-0.039
pa-0.039
ipped-0.038
Token that
Token ID326
Feature activation-4.49e-01

Pos logit contributions
 sturdy+0.079
 boun+0.076
 cos+0.075
 flat+0.074
 sil+0.073
Neg logit contributions
ie-0.085
les-0.083
ople-0.082
ing-0.081
pa-0.080
Token.
Token ID13
Feature activation+7.10e-01

Pos logit contributions
ople+0.031
les+0.030
ie+0.030
pa+0.030
father+0.029
Neg logit contributions
 sturdy-0.017
 boun-0.017
 flat-0.016
 cos-0.016
 cozy-0.016
Token<|endoftext|>
Token ID50256
Feature activation-4.61e+00

Pos logit contributions
 sturdy+0.012
 boun+0.012
 flat+0.012
 cos+0.011
 cozy+0.011
Neg logit contributions
ople-0.019
les-0.019
ie-0.019
pa-0.019
father-0.018
TokenMax
Token ID11518
Feature activation-2.10e+00

Pos logit contributions
ople+0.119
ie+0.114
atted+0.112
pa+0.112
father+0.111
Neg logit contributions
 boun-0.088
 cozy-0.084
 opener-0.082
 flat-0.082
 wrap-0.082
Token was
Token ID373
Feature activation+4.08e+00

Pos logit contributions
ople+0.053
les+0.052
ie+0.051
father+0.050
pa+0.050
Neg logit contributions
 boun-0.040
 sturdy-0.040
 flat-0.038
 cos-0.038
 cozy-0.038
Token having
Token ID1719
Feature activation-5.50e-02

Pos logit contributions
 sturdy+0.071
 boun+0.069
 flat+0.067
 cos+0.066
 sil+0.065
Neg logit contributions
ople-0.109
ie-0.108
les-0.108
pa-0.106
father-0.104
Token a
Token ID257
Feature activation+6.53e+00

Pos logit contributions
ople+0.001
les+0.001
ie+0.001
pa+0.001
father+0.001
Neg logit contributions
 sturdy-0.001
 boun-0.001
 flat-0.001
 cos-0.001
 cozy-0.001
Token great
Token ID1049
Feature activation+4.48e+00

Pos logit contributions
 sturdy+0.099
 flat+0.092
onna+0.092
 boun+0.091
 sil+0.090
Neg logit contributions
ie-0.186
by-0.182
pa-0.181
les-0.180
ople-0.179
Token day
Token ID1110
Feature activation+2.68e+00

Pos logit contributions
 sturdy+0.123
 boun+0.122
 flat+0.120
 cos+0.119
 cozy+0.118
Neg logit contributions
ople-0.076
les-0.075
ie-0.074
pa-0.071
father-0.070
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
 sturdy+0.065
onna+0.063
 sil+0.060
 flat+0.060
 exc+0.060
Neg logit contributions
ie-0.055
ing-0.055
ople-0.054
les-0.053
pa-0.053
Token He
Token ID679
Feature activation+4.97e-01

Pos logit contributions
 sturdy+0.029
 boun+0.028
 flat+0.027
 exc+0.027
 sil+0.027
Neg logit contributions
ople-0.038
pa-0.037
les-0.037
ie-0.036
father-0.036
Token had
Token ID550
Feature activation-2.27e-01

Pos logit contributions
 sturdy+0.010
 boun+0.010
 flat+0.010
 cos+0.009
 cozy+0.009
Neg logit contributions
ople-0.012
les-0.012
ie-0.012
pa-0.012
father-0.011
Token Anna
Token ID11735
Feature activation-4.32e-01

Pos logit contributions
ie+0.044
les+0.044
ople+0.044
pa+0.043
ing+0.042
Neg logit contributions
 sturdy-0.038
 boun-0.037
 flat-0.037
 cos-0.036
 cozy-0.035
Token.
Token ID13
Feature activation+7.38e-01

Pos logit contributions
ie+0.011
les+0.011
ople+0.011
pa+0.011
ing+0.011
Neg logit contributions
 sturdy-0.008
 boun-0.008
 flat-0.008
 cos-0.007
 sil-0.007
Token She
Token ID1375
Feature activation-2.69e-01

Pos logit contributions
 sturdy+0.016
 boun+0.015
 flat+0.015
 cos+0.015
 exc+0.015
Neg logit contributions
ople-0.017
les-0.017
ie-0.016
pa-0.016
ipped-0.015
Token made
Token ID925
Feature activation-1.47e+00

Pos logit contributions
ie+0.006
les+0.006
ople+0.006
pa+0.006
ed+0.006
Neg logit contributions
 sturdy-0.006
 flat-0.005
 boun-0.005
 cos-0.005
 cozy-0.005
Token Anna
Token ID11735
Feature activation+2.72e+00

Pos logit contributions
ople+0.045
pa+0.043
father+0.043
ipped+0.042
ie+0.042
Neg logit contributions
 boun-0.022
 sturdy-0.022
 cozy-0.022
 cos-0.021
 flat-0.021
Token a
Token ID257
Feature activation+6.53e+00

Pos logit contributions
 sturdy+0.046
 boun+0.043
onna+0.041
 sil+0.041
 flat+0.041
Neg logit contributions
ie-0.075
les-0.072
ople-0.071
pa-0.070
by-0.069
Token house
Token ID2156
Feature activation+3.97e+00

Pos logit contributions
 opener+0.113
 sturdy+0.106
onna+0.106
resses+0.105
 distract+0.103
Neg logit contributions
by-0.163
ie-0.159
les-0.145
ma-0.140
 End-0.139
Token with
Token ID351
Feature activation+7.19e-02

Pos logit contributions
 sturdy+0.081
 sil+0.077
 boun+0.076
 exc+0.075
onna+0.075
Neg logit contributions
ie-0.094
ing-0.090
les-0.089
ma-0.088
ople-0.087
Token sticks
Token ID16461
Feature activation+3.23e+00

Pos logit contributions
 sturdy+0.001
 boun+0.001
 flat+0.001
 cozy+0.001
 cos+0.001
Neg logit contributions
ople-0.002
les-0.002
ie-0.002
pa-0.002
father-0.002
Token and
Token ID290
Feature activation+2.44e+00

Pos logit contributions
 sturdy+0.073
 cos+0.071
 boun+0.070
 sil+0.068
 cozy+0.068
Neg logit contributions
les-0.073
ie-0.073
ing-0.070
ed-0.068
ople-0.068
Token leaves
Token ID5667
Feature activation+1.54e+00

Pos logit contributions
 sturdy+0.043
 boun+0.042
 flat+0.041
 cos+0.041
 exc+0.040
Neg logit contributions
les-0.063
ople-0.063
ie-0.063
pa-0.061
ed-0.060
Token done
Token ID1760
Feature activation+2.75e-01

Pos logit contributions
 sturdy+0.035
 boun+0.035
 flat+0.033
 sil+0.033
 cos+0.033
Neg logit contributions
ople-0.046
les-0.045
ie-0.045
pa-0.044
ing-0.043
Token.
Token ID13
Feature activation+3.76e-01

Pos logit contributions
 sturdy+0.005
 boun+0.005
 cos+0.005
 flat+0.005
 cozy+0.005
Neg logit contributions
les-0.007
ie-0.007
ople-0.007
pa-0.007
ing-0.007
Token<|endoftext|>
Token ID50256
Feature activation-4.65e+00

Pos logit contributions
 sturdy+0.005
 cos+0.005
 cozy+0.005
 boun+0.005
 flat+0.005
Neg logit contributions
ople-0.011
les-0.011
ie-0.011
led-0.011
ipped-0.011
TokenAlice
Token ID44484
Feature activation-2.46e+00

Pos logit contributions
ople+0.114
ie+0.109
atted+0.108
pa+0.108
father+0.106
Neg logit contributions
 boun-0.094
 cozy-0.091
 opener-0.089
 wrap-0.089
 cos-0.089
Token made
Token ID925
Feature activation-1.02e+00

Pos logit contributions
ople+0.056
les+0.055
ie+0.054
pa+0.053
father+0.052
Neg logit contributions
 sturdy-0.053
 boun-0.053
 flat-0.052
 cos-0.051
 cozy-0.051
Token a
Token ID257
Feature activation+6.51e+00

Pos logit contributions
ople+0.026
les+0.026
ie+0.026
pa+0.025
father+0.025
Neg logit contributions
 sturdy-0.019
 boun-0.018
 flat-0.018
 cos-0.018
 cozy-0.018
Token wish
Token ID4601
Feature activation+1.24e+00

Pos logit contributions
onna+0.108
 opener+0.106
 bleed+0.102
 distract+0.099
 twitch+0.098
Neg logit contributions
by-0.168
ie-0.165
ma-0.154
pa-0.149
 End-0.147
Token on
Token ID319
Feature activation-1.88e+00

Pos logit contributions
 sturdy+0.023
 boun+0.023
 flat+0.022
 cos+0.022
 cozy+0.022
Neg logit contributions
ie-0.032
les-0.032
ople-0.031
pa-0.031
ing-0.030
Token a
Token ID257
Feature activation+3.98e+00

Pos logit contributions
ople+0.044
les+0.043
ie+0.043
pa+0.042
father+0.041
Neg logit contributions
 sturdy-0.040
 boun-0.039
 flat-0.038
 cos-0.038
 cozy-0.038
Token star
Token ID3491
Feature activation+1.48e+00

Pos logit contributions
 sturdy+0.067
 boun+0.064
 flat+0.063
 exc+0.063
onna+0.062
Neg logit contributions
ie-0.107
by-0.102
pa-0.100
les-0.099
ing-0.098
Token.
Token ID13
Feature activation+6.19e-01

Pos logit contributions
 sturdy+0.027
 boun+0.026
 flat+0.025
 cos+0.025
 cozy+0.025
Neg logit contributions
ie-0.039
les-0.039
ople-0.039
pa-0.038
ing-0.037
Token that
Token ID326
Feature activation-1.72e+00

Pos logit contributions
 sturdy+0.040
 cos+0.039
 exc+0.039
 boun+0.039
 cozy+0.039
Neg logit contributions
ie-0.031
les-0.031
ing-0.030
ople-0.030
ed-0.029
Token he
Token ID339
Feature activation-1.61e+00

Pos logit contributions
ople+0.039
les+0.039
ie+0.038
pa+0.037
ipped+0.037
Neg logit contributions
 sturdy-0.038
 boun-0.038
 flat-0.037
 cos-0.036
 cozy-0.036
Token wanted
Token ID2227
Feature activation-1.80e+00

Pos logit contributions
ople+0.040
les+0.039
ie+0.039
father+0.038
pa+0.038
Neg logit contributions
 sturdy-0.031
 boun-0.031
 flat-0.030
 cos-0.030
 cozy-0.030
Token.
Token ID13
Feature activation+9.58e-01

Pos logit contributions
ople+0.053
les+0.052
ie+0.052
pa+0.051
father+0.050
Neg logit contributions
 sturdy-0.027
 boun-0.027
 flat-0.026
 cos-0.026
 cozy-0.025
Token<|endoftext|>
Token ID50256
Feature activation-4.91e+00

Pos logit contributions
 sturdy+0.015
 boun+0.014
 cos+0.014
 cozy+0.014
 flat+0.014
Neg logit contributions
les-0.028
ie-0.028
ople-0.028
pa-0.027
led-0.027
TokenAk
Token ID33901
Feature activation-9.49e+00

Pos logit contributions
ople+0.152
ie+0.146
atted+0.145
pa+0.143
father+0.142
Neg logit contributions
 boun-0.069
 cozy-0.066
 opener-0.066
 wrap-0.064
 safe-0.063
Tokenio
Token ID952
Feature activation-4.37e+00

Pos logit contributions
 End+0.116
ople+0.114
 hearts+0.113
 nights+0.112
iness+0.108
Neg logit contributions
 unt-0.243
 unw-0.242
onna-0.235
 tie-0.231
 tied-0.223
Token was
Token ID373
Feature activation+3.99e+00

Pos logit contributions
ople+0.107
les+0.102
ipped+0.102
 End+0.101
ing+0.099
Neg logit contributions
 sturdy-0.078
onna-0.077
 boun-0.077
 wrap-0.077
 rake-0.077
Token a
Token ID257
Feature activation+5.15e+00

Pos logit contributions
 sturdy+0.070
 boun+0.068
 flat+0.066
 cos+0.065
 sil+0.065
Neg logit contributions
ople-0.106
ie-0.105
les-0.104
pa-0.103
father-0.101
Token 3
Token ID513
Feature activation-1.39e+00

Pos logit contributions
 sturdy+0.096
 boun+0.093
 flat+0.092
onna+0.091
 cos+0.090
Neg logit contributions
ople-0.127
ie-0.127
les-0.126
pa-0.124
by-0.121
Token year
Token ID614
Feature activation+4.91e-01

Pos logit contributions
ays+0.031
ipped+0.030
father+0.029
ople+0.029
les+0.028
Neg logit contributions
 exc-0.030
 boun-0.030
 safe-0.029
 cos-0.029
 sturdy-0.029
Token by
Token ID416
Feature activation-1.45e+00

Pos logit contributions
ople+0.004
les+0.004
ie+0.004
pa+0.004
father+0.004
Neg logit contributions
 boun-0.003
 sturdy-0.003
 flat-0.003
 cozy-0.003
 cos-0.003
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
ople+0.041
les+0.040
ie+0.040
pa+0.039
father+0.039
Neg logit contributions
 sturdy-0.023
 boun-0.023
 flat-0.023
 cozy-0.022
 sil-0.022
Token saw
Token ID2497
Feature activation-3.05e+00

Pos logit contributions
 sturdy+0.020
 boun+0.020
 flat+0.019
 cos+0.019
 cozy+0.019
Neg logit contributions
ople-0.027
les-0.026
ie-0.026
pa-0.026
father-0.025
Token how
Token ID703
Feature activation-9.92e-02

Pos logit contributions
ople+0.092
les+0.089
iness+0.087
ams+0.086
 End+0.086
Neg logit contributions
 wrap-0.044
 tag-0.043
 boun-0.042
 exc-0.041
 rake-0.040
Token sad
Token ID6507
Feature activation-3.97e+00

Pos logit contributions
ople+0.003
father+0.003
les+0.003
pa+0.003
ie+0.003
Neg logit contributions
 boun-0.001
 sturdy-0.001
 cozy-0.001
 flat-0.001
 safe-0.001
Token Z
Token ID1168
Feature activation-9.31e+00

Pos logit contributions
ople+0.123
father+0.121
ipped+0.118
eled+0.117
les+0.117
Neg logit contributions
 boun-0.053
 wrap-0.053
 safe-0.051
 cozy-0.051
 sil-0.050
Tokenippy
Token ID41214
Feature activation-2.91e+00

Pos logit contributions
pa+0.210
ople+0.206
iece+0.206
father+0.203
ams+0.188
Neg logit contributions
onna-0.170
 wrap-0.162
 boun-0.156
 leash-0.153
 tie-0.153
Token was
Token ID373
Feature activation+1.91e+00

Pos logit contributions
ople+0.067
father+0.067
pa+0.066
 End+0.063
eled+0.062
Neg logit contributions
 boun-0.060
 sturdy-0.059
 safe-0.058
 flat-0.057
 cozy-0.057
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
 sturdy+0.042
 boun+0.041
 cos+0.039
 sil+0.039
 flat+0.038
Neg logit contributions
ie-0.044
pa-0.042
les-0.042
ing-0.041
ople-0.041
Token The
Token ID383
Feature activation-2.46e+00

Pos logit contributions
 sturdy+0.020
 boun+0.020
 flat+0.019
 cos+0.019
 cozy+0.018
Neg logit contributions
ople-0.040
les-0.040
ie-0.040
pa-0.039
father-0.038
Token girl
Token ID2576
Feature activation-8.60e-01

Pos logit contributions
ople+0.066
pa+0.064
les+0.063
ie+0.062
father+0.062
Neg logit contributions
 boun-0.047
 cozy-0.046
 sturdy-0.045
 cos-0.045
 wrap-0.045
Token behind
Token ID2157
Feature activation-2.62e+00

Pos logit contributions
ople+0.036
les+0.034
pa+0.034
ie+0.033
father+0.033
Neg logit contributions
 sturdy-0.021
 boun-0.021
 flat-0.021
 safe-0.020
 cos-0.020
Token a
Token ID257
Feature activation+1.91e+00

Pos logit contributions
ople+0.077
les+0.075
atted+0.074
ie+0.074
led+0.073
Neg logit contributions
 wrap-0.040
 safe-0.039
 flat-0.039
 sturdy-0.039
 cozy-0.038
Token big
Token ID1263
Feature activation+1.59e+00

Pos logit contributions
 exc+0.035
 sturdy+0.031
onna+0.031
 cos+0.030
 bleed+0.030
Neg logit contributions
les-0.047
ie-0.046
ing-0.045
by-0.044
ison-0.044
Token rock
Token ID3881
Feature activation+2.43e+00

Pos logit contributions
 exc+0.029
 sturdy+0.028
 cos+0.026
onna+0.026
 boun+0.026
Neg logit contributions
ie-0.042
les-0.041
pa-0.040
ople-0.039
el-0.038
Token.
Token ID13
Feature activation+4.46e-01

Pos logit contributions
 sturdy+0.048
 cos+0.046
 cozy+0.046
 boun+0.046
 sil+0.045
Neg logit contributions
ie-0.063
les-0.061
ople-0.061
ing-0.060
pa-0.058
Token Z
Token ID1168
Feature activation-9.17e+00

Pos logit contributions
 sturdy+0.007
 boun+0.007
 flat+0.007
 cos+0.007
 cozy+0.007
Neg logit contributions
ople-0.012
les-0.012
ie-0.012
pa-0.012
father-0.012
Tokenippy
Token ID41214
Feature activation-2.13e+00

Pos logit contributions
iece+0.221
ople+0.214
pa+0.207
father+0.202
iness+0.197
Neg logit contributions
onna-0.158
 boun-0.150
 brid-0.146
 saddle-0.144
 unt-0.144
Token was
Token ID373
Feature activation+3.24e+00

Pos logit contributions
ople+0.046
les+0.046
ie+0.045
pa+0.045
father+0.044
Neg logit contributions
 sturdy-0.048
 boun-0.047
 flat-0.046
 cos-0.046
 cozy-0.045
Token so
Token ID523
Feature activation+2.93e+00

Pos logit contributions
 sturdy+0.056
 boun+0.055
onna+0.055
 flat+0.054
 opener+0.054
Neg logit contributions
ie-0.080
ople-0.079
ed-0.079
atted-0.077
pa-0.077
Token happy
Token ID3772
Feature activation+2.01e-01

Pos logit contributions
 sturdy+0.040
 boun+0.040
 flat+0.039
 leash+0.038
 rake+0.038
Neg logit contributions
ie-0.085
ople-0.084
ed-0.084
les-0.083
led-0.082
Token and
Token ID290
Feature activation+2.06e+00

Pos logit contributions
 sturdy+0.004
 boun+0.004
 flat+0.004
 cos+0.004
 exc+0.004
Neg logit contributions
ie-0.005
les-0.005
ople-0.005
pa-0.005
ing-0.005
Token often
Token ID1690
Feature activation+1.05e+00

Pos logit contributions
les+0.006
ie+0.006
ople+0.006
pa+0.006
ing+0.006
Neg logit contributions
 boun-0.004
 sturdy-0.004
 flat-0.004
 cos-0.004
 sil-0.004
Token".
Token ID1911
Feature activation-5.33e-01

Pos logit contributions
 sturdy+0.018
 boun+0.018
 cos+0.018
 cozy+0.017
 sil+0.017
Neg logit contributions
les-0.028
ople-0.028
ie-0.028
pa-0.027
ing-0.027
Token
Token ID220
Feature activation-1.61e+00

Pos logit contributions
ople+0.015
les+0.015
ie+0.015
pa+0.014
led+0.014
Neg logit contributions
 sturdy-0.009
 boun-0.009
 flat-0.009
 cos-0.008
 cozy-0.008
Token
Token ID198
Feature activation-9.74e-01

Pos logit contributions
ople+0.044
les+0.043
ie+0.043
pa+0.042
father+0.041
Neg logit contributions
 sturdy-0.028
 boun-0.027
 flat-0.027
 cos-0.026
 cozy-0.026
Token
Token ID198
Feature activation-4.37e-01

Pos logit contributions
ople+0.028
les+0.028
ie+0.027
pa+0.027
father+0.026
Neg logit contributions
 sturdy-0.015
 boun-0.015
 flat-0.015
 cozy-0.014
 cos-0.014
TokenCo
Token ID7222
Feature activation-9.11e+00

Pos logit contributions
ople+0.012
les+0.011
ie+0.011
father+0.011
ing+0.011
Neg logit contributions
 boun-0.008
 sturdy-0.008
 cos-0.007
 flat-0.007
 cozy-0.007
Tokenpper
Token ID2848
Feature activation-1.97e+00

Pos logit contributions
les+0.197
ople+0.196
ie+0.193
atted+0.190
 End+0.188
Neg logit contributions
 sturdy-0.176
 boun-0.170
 unt-0.169
 flat-0.169
 stump-0.168
Token thanked
Token ID26280
Feature activation-2.89e+00

Pos logit contributions
ople+0.039
les+0.039
ie+0.038
pa+0.037
father+0.036
Neg logit contributions
 sturdy-0.048
 boun-0.047
 flat-0.047
 cos-0.046
 cozy-0.046
Token the
Token ID262
Feature activation-2.43e+00

Pos logit contributions
ople+0.083
les+0.082
led+0.080
ie+0.080
ipped+0.079
Neg logit contributions
 sturdy-0.046
 flat-0.044
 boun-0.044
 wrap-0.043
 exc-0.043
Token old
Token ID1468
Feature activation-3.15e+00

Pos logit contributions
ople+0.069
les+0.068
ie+0.068
pa+0.067
father+0.065
Neg logit contributions
 sturdy-0.039
 boun-0.039
 flat-0.038
 cos-0.037
 cozy-0.037
Token bird
Token ID6512
Feature activation+1.57e-01

Pos logit contributions
ie+0.072
pa+0.070
father+0.067
ople+0.066
les+0.066
Neg logit contributions
 sturdy-0.072
 cos-0.070
 boun-0.068
 flat-0.068
 sil-0.067
Token asked
Token ID1965
Feature activation-3.28e+00

Pos logit contributions
ople+0.018
ie+0.018
les+0.018
pa+0.018
father+0.017
Neg logit contributions
 sturdy-0.020
 boun-0.020
 flat-0.019
 cos-0.019
 cozy-0.019
Token what
Token ID644
Feature activation-3.07e+00

Pos logit contributions
ople+0.094
les+0.093
ays+0.091
led+0.091
ipped+0.088
Neg logit contributions
 safe-0.053
 wrap-0.049
 cozy-0.048
 hide-0.048
 tag-0.047
Token was
Token ID373
Feature activation+6.67e-01

Pos logit contributions
ople+0.062
les+0.061
ie+0.060
ipped+0.058
pa+0.057
Neg logit contributions
 boun-0.075
 sturdy-0.075
 flat-0.073
 cos-0.073
 cozy-0.073
Token wrong
Token ID2642
Feature activation-3.65e-01

Pos logit contributions
 sturdy+0.014
 boun+0.014
 flat+0.013
 cos+0.013
 cozy+0.013
Neg logit contributions
ople-0.016
les-0.016
ie-0.016
pa-0.015
father-0.015
Token and
Token ID290
Feature activation-1.86e-01

Pos logit contributions
ople+0.009
les+0.009
ie+0.009
pa+0.009
father+0.009
Neg logit contributions
 sturdy-0.007
 boun-0.007
 flat-0.007
 cos-0.006
 cozy-0.006
Token Z
Token ID1168
Feature activation-9.10e+00

Pos logit contributions
les+0.005
ople+0.005
ipped+0.005
father+0.005
pa+0.005
Neg logit contributions
 sturdy-0.003
 boun-0.003
 cozy-0.003
 safe-0.003
 cos-0.003
Tokenippy
Token ID41214
Feature activation-3.01e+00

Pos logit contributions
iece+0.204
ople+0.193
father+0.182
pa+0.180
 End+0.179
Neg logit contributions
onna-0.173
 boun-0.167
 brid-0.158
 exc-0.157
 wrap-0.157
Token told
Token ID1297
Feature activation-3.02e+00

Pos logit contributions
ople+0.077
les+0.075
father+0.074
pa+0.073
ipped+0.073
Neg logit contributions
 boun-0.053
 sturdy-0.053
 exc-0.051
 cos-0.050
 flat-0.050
Token her
Token ID607
Feature activation-1.82e+00

Pos logit contributions
ople+0.080
les+0.077
led+0.076
ipped+0.075
ays+0.075
Neg logit contributions
 safe-0.056
 wrap-0.055
 sturdy-0.053
 cos-0.053
 boun-0.053
Token about
Token ID546
Feature activation-2.25e+00

Pos logit contributions
ie+0.046
ople+0.045
les+0.045
pa+0.045
ing+0.043
Neg logit contributions
 sturdy-0.035
 boun-0.034
 flat-0.033
 cos-0.033
 sil-0.032
Token her
Token ID607
Feature activation+1.73e-02

Pos logit contributions
ie+0.045
pa+0.044
ople+0.044
les+0.043
father+0.042
Neg logit contributions
 sturdy-0.055
 boun-0.053
 flat-0.052
 cos-0.052
 sil-0.051
Token help
Token ID1037
Feature activation-1.70e+00

Pos logit contributions
 boun+0.005
 sturdy+0.005
 flat+0.005
 sil+0.005
 cos+0.004
Neg logit contributions
ie-0.013
pa-0.012
ople-0.012
les-0.012
father-0.012
Token the
Token ID262
Feature activation-2.81e+00

Pos logit contributions
ople+0.032
les+0.031
ipped+0.030
ie+0.030
atted+0.029
Neg logit contributions
 sturdy-0.043
 flat-0.042
 boun-0.042
 safe-0.042
 cos-0.042
Token cut
Token ID2005
Feature activation-3.10e+00

Pos logit contributions
ople+0.083
les+0.082
ie+0.082
pa+0.080
ing+0.079
Neg logit contributions
 sturdy-0.041
 boun-0.041
 flat-0.039
 cos-0.039
 sil-0.038
Token heal
Token ID12035
Feature activation-2.15e+00

Pos logit contributions
ople+0.081
pa+0.076
father+0.076
ipped+0.075
les+0.075
Neg logit contributions
 sturdy-0.057
 flat-0.056
 safe-0.056
 wrap-0.054
 boun-0.054
Token.
Token ID13
Feature activation+8.67e-01

Pos logit contributions
ople+0.067
ipped+0.065
les+0.064
father+0.063
atted+0.063
Neg logit contributions
 sturdy-0.029
 safe-0.029
 flat-0.028
 cozy-0.028
 cos-0.027
Token Cand
Token ID15518
Feature activation-9.04e+00

Pos logit contributions
 sturdy+0.017
 boun+0.017
 flat+0.017
 cos+0.016
 sil+0.016
Neg logit contributions
ople-0.021
les-0.021
ie-0.020
pa-0.020
ipped-0.020
Tokenace
Token ID558
Feature activation-2.39e+00

Pos logit contributions
father+0.110
pa+0.105
iece+0.094
parents+0.089
mother+0.084
Neg logit contributions
 exc-0.251
 boun-0.243
 sew-0.243
 wrap-0.242
 unt-0.240
Token was
Token ID373
Feature activation+2.84e+00

Pos logit contributions
ople+0.050
les+0.048
ie+0.048
pa+0.047
father+0.047
Neg logit contributions
 sturdy-0.055
 boun-0.055
 cos-0.054
 flat-0.053
 cozy-0.053
Token so
Token ID523
Feature activation+2.60e+00

Pos logit contributions
 boun+0.047
 rake+0.046
 sturdy+0.046
onna+0.046
 opener+0.045
Neg logit contributions
ie-0.073
ople-0.072
ed-0.072
led-0.071
atted-0.071
Token proud
Token ID6613
Feature activation+4.29e-01

Pos logit contributions
 boun+0.036
 sturdy+0.035
 flat+0.034
onna+0.034
 rake+0.033
Neg logit contributions
ie-0.076
led-0.075
les-0.075
ed-0.074
ing-0.074
Token of
Token ID286
Feature activation-2.25e+00

Pos logit contributions
 sturdy+0.010
 boun+0.010
 flat+0.010
 cos+0.009
 cozy+0.009
Neg logit contributions
les-0.009
ie-0.009
ople-0.009
pa-0.009
ing-0.009
Token,
Token ID11
Feature activation+8.86e-02

Pos logit contributions
les+0.006
ople+0.005
father+0.005
pa+0.005
eled+0.005
Neg logit contributions
 boun-0.003
 exc-0.003
 cozy-0.002
 flat-0.002
 safe-0.002
Token a
Token ID257
Feature activation+1.52e+00

Pos logit contributions
 boun+0.001
 flat+0.001
 cozy+0.001
 safe+0.001
 exc+0.001
Neg logit contributions
ople-0.003
les-0.003
pa-0.003
father-0.003
parents-0.003
Token gentle
Token ID10296
Feature activation+2.83e-01

Pos logit contributions
 boun+0.025
 sturdy+0.024
 cozy+0.024
 flat+0.023
 cos+0.023
Neg logit contributions
ople-0.045
les-0.043
ie-0.043
pa-0.042
ipped-0.042
Token girl
Token ID2576
Feature activation-6.19e-01

Pos logit contributions
 boun+0.005
 sturdy+0.005
 flat+0.005
 cos+0.005
 cozy+0.005
Neg logit contributions
ople-0.008
les-0.008
ie-0.007
pa-0.007
father-0.007
Token named
Token ID3706
Feature activation-1.84e+00

Pos logit contributions
les+0.022
ople+0.022
led+0.021
ing+0.021
iness+0.021
Neg logit contributions
 wrap-0.005
 exc-0.005
 boun-0.005
 cozy-0.005
 cos-0.005
Token Ci
Token ID37685
Feature activation-8.98e+00

Pos logit contributions
les+0.050
ople+0.050
father+0.049
pa+0.049
ipped+0.048
Neg logit contributions
 boun-0.032
 cozy-0.032
 sturdy-0.032
 flat-0.031
 cos-0.031
Tokenara
Token ID3301
Feature activation-3.80e+00

Pos logit contributions
atted+0.143
ople+0.143
ied+0.129
eled+0.128
ed+0.128
Neg logit contributions
 wrap-0.229
 tie-0.223
 sew-0.216
 knit-0.214
onna-0.212
Token stumbled
Token ID24241
Feature activation-1.01e+00

Pos logit contributions
ople+0.114
ipped+0.111
les+0.109
atted+0.107
ays+0.106
Neg logit contributions
 exc-0.055
 cozy-0.054
 safe-0.054
 boun-0.052
 wrap-0.051
Token upon
Token ID2402
Feature activation-2.08e+00

Pos logit contributions
ople+0.026
pa+0.025
les+0.025
father+0.025
ie+0.024
Neg logit contributions
 boun-0.019
 flat-0.019
 sturdy-0.019
 cozy-0.018
 safe-0.018
Token a
Token ID257
Feature activation+3.29e+00

Pos logit contributions
ople+0.058
les+0.057
ie+0.057
pa+0.056
father+0.055
Neg logit contributions
 sturdy-0.034
 boun-0.034
 flat-0.033
 cos-0.032
 cozy-0.032
Token lovely
Token ID14081
Feature activation+1.94e+00

Pos logit contributions
 sturdy+0.054
 exc+0.053
 opener+0.052
onna+0.051
 boun+0.049
Neg logit contributions
ie-0.089
by-0.088
les-0.085
ing-0.084
pa-0.081
Token children
Token ID1751
Feature activation-7.48e-01

Pos logit contributions
ays+0.088
les+0.087
ople+0.086
ie+0.086
led+0.086
Neg logit contributions
 boun-0.059
 exc-0.056
 cozy-0.056
 sturdy-0.055
 rake-0.055
Token didn
Token ID1422
Feature activation+1.09e+00

Pos logit contributions
ople+0.017
les+0.017
ie+0.016
pa+0.016
father+0.016
Neg logit contributions
 sturdy-0.016
 boun-0.016
 flat-0.015
 cos-0.015
 exc-0.015
Token't
Token ID470
Feature activation+1.81e+00

Pos logit contributions
 sturdy+0.025
 cos+0.024
 boun+0.024
 cozy+0.024
 sil+0.023
Neg logit contributions
ing-0.023
les-0.023
el-0.022
ipped-0.022
pa-0.022
Token know
Token ID760
Feature activation-3.00e+00

Pos logit contributions
 boun+0.029
 sturdy+0.029
 cos+0.028
 cozy+0.028
 sil+0.027
Neg logit contributions
ie-0.052
les-0.051
ople-0.051
pa-0.050
by-0.049
Token what
Token ID644
Feature activation-2.72e+00

Pos logit contributions
ople+0.077
les+0.073
ed+0.071
ipped+0.071
by+0.070
Neg logit contributions
 sturdy-0.060
 safe-0.058
 cozy-0.057
 exc-0.057
 cos-0.057
Token Mam
Token ID29926
Feature activation-8.97e+00

Pos logit contributions
ople+0.069
ie+0.069
les+0.069
pa+0.069
father+0.067
Neg logit contributions
 sturdy-0.050
 boun-0.050
 flat-0.048
 cos-0.048
 cozy-0.047
Tokenma
Token ID2611
Feature activation-4.59e+00

Pos logit contributions
 End+0.103
text+0.102
iece+0.099
ople+0.097
ays+0.094
Neg logit contributions
 rake-0.244
 wrap-0.239
 exc-0.238
 tie-0.237
 fit-0.236
Token meant
Token ID4001
Feature activation+3.03e-01

Pos logit contributions
ople+0.119
les+0.115
ipped+0.114
ie+0.112
ing+0.110
Neg logit contributions
 sturdy-0.081
 boun-0.079
 wrap-0.078
 cos-0.076
 flat-0.076
Token but
Token ID475
Feature activation-8.86e-01

Pos logit contributions
 sturdy+0.007
 boun+0.007
 cos+0.007
 flat+0.007
 cozy+0.006
Neg logit contributions
ople-0.007
ie-0.007
pa-0.007
les-0.006
ing-0.006
Token they
Token ID484
Feature activation-5.63e-01

Pos logit contributions
ople+0.025
les+0.025
ie+0.024
pa+0.024
father+0.024
Neg logit contributions
 sturdy-0.015
 boun-0.014
 flat-0.014
 cos-0.014
 cozy-0.014
Token ran
Token ID4966
Feature activation+3.01e-01

Pos logit contributions
ople+0.013
ie+0.013
les+0.012
pa+0.012
father+0.012
Neg logit contributions
 sturdy-0.012
 boun-0.012
 flat-0.012
 cos-0.012
 cozy-0.012
Token From
Token ID3574
Feature activation-1.64e+00

Pos logit contributions
 boun+0.015
 sturdy+0.015
 cos+0.015
 cozy+0.015
 flat+0.015
Neg logit contributions
les-0.024
ople-0.024
ie-0.024
pa-0.024
father-0.023
Token that
Token ID326
Feature activation-1.32e+00

Pos logit contributions
ople+0.047
les+0.047
ipped+0.044
father+0.044
ie+0.044
Neg logit contributions
 boun-0.027
 sturdy-0.026
 safe-0.026
 cozy-0.025
 flat-0.025
Token day
Token ID1110
Feature activation-9.75e-01

Pos logit contributions
ople+0.028
les+0.027
ipped+0.025
atted+0.025
led+0.024
Neg logit contributions
 boun-0.033
 cozy-0.033
 sturdy-0.033
 safe-0.032
 flat-0.032
Token on
Token ID319
Feature activation-2.78e+00

Pos logit contributions
ople+0.025
les+0.025
ie+0.025
pa+0.024
by+0.023
Neg logit contributions
 safe-0.018
 flat-0.018
 boun-0.018
 sturdy-0.018
 cozy-0.017
Token,
Token ID11
Feature activation-7.98e-01

Pos logit contributions
ople+0.081
les+0.080
father+0.076
eled+0.076
led+0.075
Neg logit contributions
 boun-0.045
 cozy-0.043
 flat-0.042
 cos-0.042
 safe-0.042
Token Z
Token ID1168
Feature activation-8.94e+00

Pos logit contributions
les+0.024
ople+0.024
pa+0.024
father+0.024
led+0.024
Neg logit contributions
 boun-0.013
 cozy-0.012
 exc-0.012
 safe-0.011
 flat-0.011
Tokenippy
Token ID41214
Feature activation-2.81e+00

Pos logit contributions
iece+0.219
ople+0.216
pa+0.199
father+0.191
iness+0.189
Neg logit contributions
onna-0.157
 exc-0.150
 boun-0.149
 brid-0.145
 leash-0.145
Token made
Token ID925
Feature activation-1.43e+00

Pos logit contributions
ople+0.070
les+0.068
pa+0.067
ie+0.066
father+0.066
Neg logit contributions
 sturdy-0.051
 boun-0.051
 exc-0.050
 cozy-0.049
 flat-0.049
Token sure
Token ID1654
Feature activation-1.90e+00

Pos logit contributions
ople+0.047
ipped+0.044
pa+0.043
les+0.043
ie+0.043
Neg logit contributions
 sturdy-0.020
 cozy-0.020
 safe-0.020
 boun-0.019
 cos-0.019
Token to
Token ID284
Feature activation+2.76e+00

Pos logit contributions
les+0.067
ople+0.066
ie+0.064
pa+0.064
ipped+0.064
Neg logit contributions
 safe-0.018
 sturdy-0.018
 flat-0.017
 boun-0.017
 cozy-0.017
Token keep
Token ID1394
Feature activation-2.03e+00

Pos logit contributions
 boun+0.043
 sturdy+0.042
 flat+0.041
 sil+0.040
 cos+0.038
Neg logit contributions
ie-0.080
pa-0.078
father-0.076
ople-0.074
by-0.074
Token The
Token ID383
Feature activation-1.86e+00

Pos logit contributions
ie+0.003
ople+0.003
les+0.003
pa+0.003
father+0.003
Neg logit contributions
 boun-0.002
 sturdy-0.002
 flat-0.002
 cos-0.002
 cozy-0.002
Token tree
Token ID5509
Feature activation-6.35e-01

Pos logit contributions
father+0.047
les+0.046
ie+0.046
pa+0.046
 End+0.046
Neg logit contributions
onna-0.036
 cos-0.032
 sturdy-0.032
 stret-0.031
 shady-0.031
Token was
Token ID373
Feature activation+3.37e+00

Pos logit contributions
ople+0.013
les+0.013
ie+0.013
pa+0.012
father+0.012
Neg logit contributions
 sturdy-0.015
 boun-0.015
 flat-0.015
 cos-0.014
 cozy-0.014
Token gone
Token ID3750
Feature activation+1.71e+00

Pos logit contributions
 knit+0.060
onna+0.058
 sew+0.058
 opener+0.056
 rake+0.056
Neg logit contributions
atted-0.076
ie-0.076
ed-0.075
led-0.075
 End-0.075
Token!
Token ID0
Feature activation-4.29e-01

Pos logit contributions
 sturdy+0.034
 boun+0.034
 exc+0.033
 cos+0.033
 sil+0.033
Neg logit contributions
ie-0.041
les-0.040
ing-0.039
ople-0.038
pa-0.038
Token Z
Token ID1168
Feature activation-8.87e+00

Pos logit contributions
ie+0.013
les+0.013
ople+0.012
pa+0.012
father+0.012
Neg logit contributions
 boun-0.007
 flat-0.006
 sturdy-0.006
 cozy-0.006
 cos-0.006
Tokeneb
Token ID1765
Feature activation-5.03e+00

Pos logit contributions
ople+0.139
iness+0.132
ie+0.124
ing+0.118
led+0.118
Neg logit contributions
onna-0.234
 exc-0.221
 nib-0.217
 unw-0.214
 unt-0.214
Token was
Token ID373
Feature activation+3.15e+00

Pos logit contributions
ople+0.117
ays+0.116
ipped+0.112
les+0.109
ted+0.109
Neg logit contributions
 exc-0.099
 sturdy-0.097
 bend-0.093
 safe-0.093
onna-0.093
Token happy
Token ID3772
Feature activation+1.07e+00

Pos logit contributions
 sturdy+0.054
 boun+0.053
onna+0.051
 sil+0.051
 flat+0.050
Neg logit contributions
ople-0.083
ie-0.082
les-0.080
pa-0.080
father-0.079
Token because
Token ID780
Feature activation-4.48e-02

Pos logit contributions
 sturdy+0.023
 boun+0.022
 flat+0.022
 cos+0.022
 sil+0.022
Neg logit contributions
ie-0.025
ople-0.024
les-0.024
pa-0.024
ing-0.024
Token his
Token ID465
Feature activation-1.68e+00

Pos logit contributions
ople+0.001
les+0.001
ie+0.001
ipped+0.001
pa+0.001
Neg logit contributions
 boun-0.001
 sturdy-0.001
 exc-0.001
 flat-0.001
 cozy-0.001
Token
Token ID198
Feature activation+2.04e-01

Pos logit contributions
pa+0.016
ie+0.016
ople+0.016
les+0.016
father+0.016
Neg logit contributions
 sturdy-0.014
 boun-0.014
 flat-0.013
 cos-0.013
 cozy-0.013
TokenDaddy
Token ID48280
Feature activation-3.15e+00

Pos logit contributions
 boun+0.005
 sturdy+0.005
 flat+0.005
onna+0.005
 cos+0.004
Neg logit contributions
pa-0.004
father-0.004
ing-0.004
ma-0.004
ople-0.004
Token nodded
Token ID14464
Feature activation+6.69e-01

Pos logit contributions
ople+0.070
les+0.068
ie+0.067
pa+0.066
father+0.065
Neg logit contributions
 boun-0.069
 sturdy-0.068
 cos-0.066
 flat-0.066
 sil-0.066
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 sturdy+0.013
 boun+0.013
 flat+0.013
 cos+0.012
 cozy+0.012
Neg logit contributions
ie-0.017
les-0.016
ople-0.016
pa-0.016
ing-0.016
Token told
Token ID1297
Feature activation-2.59e+00

Pos logit contributions
 sturdy+0.020
 boun+0.020
 flat+0.020
 cos+0.019
 cozy+0.019
Neg logit contributions
ople-0.028
les-0.027
ie-0.027
pa-0.027
father-0.026
Token Tr
Token ID833
Feature activation-8.79e+00

Pos logit contributions
ople+0.068
les+0.063
ipped+0.062
led+0.061
ie+0.061
Neg logit contributions
 wrap-0.050
 cos-0.049
 cozy-0.049
 sturdy-0.048
 sew-0.048
Tokenicia
Token ID33577
Feature activation-1.66e+00

Pos logit contributions
ted+0.142
ople+0.140
ipped+0.131
iece+0.129
ed+0.127
Neg logit contributions
onna-0.206
 exc-0.203
 fold-0.196
 rake-0.190
 opener-0.188
Token that
Token ID326
Feature activation-2.13e+00

Pos logit contributions
ople+0.042
les+0.042
ie+0.040
pa+0.040
ipped+0.040
Neg logit contributions
 sturdy-0.031
 boun-0.030
 cos-0.030
 flat-0.030
 safe-0.030
Token it
Token ID340
Feature activation-1.95e+00

Pos logit contributions
ople+0.056
les+0.056
ie+0.054
ipped+0.053
pa+0.053
Neg logit contributions
 sturdy-0.039
 boun-0.038
 cos-0.037
 flat-0.037
 cozy-0.037
Token was
Token ID373
Feature activation+1.40e+00

Pos logit contributions
ople+0.038
les+0.037
ie+0.037
pa+0.036
ipped+0.036
Neg logit contributions
 sturdy-0.048
 boun-0.048
 flat-0.047
 cos-0.047
 cozy-0.046
Token okay
Token ID8788
Feature activation-6.67e-01

Pos logit contributions
 sturdy+0.027
 boun+0.027
 leash+0.026
 flat+0.026
 sil+0.025
Neg logit contributions
ie-0.034
pa-0.033
ople-0.032
les-0.032
father-0.032
Token around
Token ID1088
Feature activation-1.70e+00

Pos logit contributions
 boun+0.005
 sturdy+0.005
 safe+0.005
 flat+0.004
 cozy+0.004
Neg logit contributions
ople-0.005
ipped-0.005
les-0.005
father-0.005
pa-0.005
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
ople+0.044
ipped+0.041
les+0.041
father+0.041
atted+0.040
Neg logit contributions
 boun-0.031
 safe-0.030
 flat-0.030
 sturdy-0.030
 wrap-0.030
Token<|endoftext|>
Token ID50256
Feature activation-4.45e+00

Pos logit contributions
 sturdy+0.020
 boun+0.020
 cozy+0.020
 cos+0.020
 flat+0.019
Neg logit contributions
ople-0.039
ie-0.038
les-0.038
pa-0.037
ipped-0.036
TokenOne
Token ID3198
Feature activation-2.76e+00

Pos logit contributions
ople+0.121
pa+0.115
atted+0.115
ie+0.114
father+0.114
Neg logit contributions
 boun-0.078
 cozy-0.076
 wrap-0.074
 opener-0.073
 safe-0.072
Token day
Token ID1110
Feature activation-3.08e-01

Pos logit contributions
ople+0.062
les+0.058
ipped+0.057
father+0.056
io+0.055
Neg logit contributions
 boun-0.067
 cozy-0.066
 sturdy-0.065
 cos-0.065
 safe-0.065
Token Al
Token ID978
Feature activation-8.78e+00

Pos logit contributions
ople+0.008
les+0.007
father+0.007
pa+0.007
ie+0.007
Neg logit contributions
 boun-0.006
 sturdy-0.006
 flat-0.006
 cos-0.006
 cozy-0.006
Tokenina
Token ID1437
Feature activation-2.53e+00

Pos logit contributions
ople+0.123
ipped+0.122
 End+0.120
ted+0.113
atted+0.108
Neg logit contributions
onna-0.225
 open-0.223
 fold-0.219
 wrap-0.218
 bury-0.217
Token wanted
Token ID2227
Feature activation-1.00e+00

Pos logit contributions
ople+0.062
father+0.061
les+0.060
ie+0.060
ipped+0.060
Neg logit contributions
 boun-0.047
 sturdy-0.047
 cos-0.045
 flat-0.045
 sil-0.045
Token to
Token ID284
Feature activation+2.69e+00

Pos logit contributions
pa+0.025
ie+0.025
ople+0.024
ing+0.024
by+0.024
Neg logit contributions
 sturdy-0.018
 boun-0.018
 exc-0.017
 cos-0.017
 sil-0.017
Token explore
Token ID7301
Feature activation+3.86e-02

Pos logit contributions
 sturdy+0.045
 boun+0.044
onna+0.043
jar+0.043
 flat+0.043
Neg logit contributions
pa-0.072
ie-0.071
ople-0.070
el-0.068
ma-0.066
Token,
Token ID11
Feature activation+1.41e+00

Pos logit contributions
 sturdy+0.001
onna+0.001
 exc+0.001
 boun+0.001
 flat+0.001
Neg logit contributions
ie-0.001
les-0.001
pa-0.001
ing-0.001
father-0.001
Token stories
Token ID3923
Feature activation-5.46e-01

Pos logit contributions
ople+0.037
les+0.037
pa+0.037
ie+0.037
father+0.036
Neg logit contributions
 sturdy-0.029
 boun-0.029
onna-0.029
 flat-0.028
 cos-0.028
Token said
Token ID531
Feature activation-2.28e-01

Pos logit contributions
les+0.011
ie+0.011
ople+0.011
pa+0.011
ed+0.010
Neg logit contributions
 sturdy-0.013
 boun-0.013
 flat-0.013
 cos-0.013
 cozy-0.013
Token.
Token ID13
Feature activation+8.63e-01

Pos logit contributions
ople+0.006
les+0.006
ie+0.006
pa+0.006
father+0.006
Neg logit contributions
 sturdy-0.004
 boun-0.004
 flat-0.004
 cos-0.004
 cozy-0.003
Token Mama
Token ID40084
Feature activation-3.97e+00

Pos logit contributions
 sturdy+0.019
 boun+0.018
 flat+0.018
 exc+0.017
 cos+0.017
Neg logit contributions
ople-0.020
les-0.020
ie-0.020
ed-0.020
ing-0.020
Token helped
Token ID4193
Feature activation-3.22e+00

Pos logit contributions
ople+0.104
les+0.102
ipped+0.100
ie+0.098
pa+0.098
Neg logit contributions
 boun-0.068
 sturdy-0.067
 wrap-0.065
 cos-0.065
 cozy-0.064
Token Ci
Token ID37685
Feature activation-8.77e+00

Pos logit contributions
ople+0.084
les+0.082
father+0.081
atted+0.078
led+0.078
Neg logit contributions
 wrap-0.060
 sew-0.058
 cozy-0.055
 safe-0.055
 rake-0.055
Tokenara
Token ID3301
Feature activation+1.12e-02

Pos logit contributions
father+0.161
ople+0.141
 End+0.140
pa+0.139
iece+0.139
Neg logit contributions
 wrap-0.209
 tie-0.199
 knit-0.197
 sew-0.195
onna-0.192
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
 sturdy+0.000
 boun+0.000
 flat+0.000
 cos+0.000
 cozy+0.000
Neg logit contributions
ople-0.000
les-0.000
ie-0.000
pa-0.000
father-0.000
Token together
Token ID1978
Feature activation-5.28e-01

Pos logit contributions
les+0.030
ople+0.030
father+0.029
pa+0.029
ipped+0.028
Neg logit contributions
 boun-0.020
 sturdy-0.019
 cos-0.019
 cozy-0.019
 exc-0.019
Token they
Token ID484
Feature activation+5.18e-01

Pos logit contributions
ople+0.013
les+0.013
ie+0.013
pa+0.013
father+0.013
Neg logit contributions
 sturdy-0.010
 boun-0.010
 flat-0.010
 cos-0.010
 cozy-0.009
Token read
Token ID1100
Feature activation-1.91e+00

Pos logit contributions
 sturdy+0.010
 boun+0.010
 flat+0.010
 cozy+0.010
 cos+0.010
Neg logit contributions
les-0.013
ople-0.013
ie-0.012
ing-0.012
ipped-0.012
Tokenma
Token ID2611
Feature activation-3.04e+00

Pos logit contributions
led+0.088
ed+0.082
les+0.079
rified+0.072
ted+0.067
Neg logit contributions
 sew-0.190
 knit-0.188
 exc-0.184
 wrap-0.184
 swing-0.175
Token was
Token ID373
Feature activation+2.75e+00

Pos logit contributions
ople+0.076
ie+0.074
les+0.073
pa+0.072
father+0.072
Neg logit contributions
 boun-0.057
 sturdy-0.056
 wrap-0.055
 cos-0.054
 flat-0.054
Token very
Token ID845
Feature activation+3.14e+00

Pos logit contributions
 sturdy+0.049
 boun+0.046
onna+0.045
 flat+0.045
 cos+0.044
Neg logit contributions
ople-0.070
pa-0.068
ma-0.068
ed-0.068
les-0.067
Token proud
Token ID6613
Feature activation+5.08e-01

Pos logit contributions
 sturdy+0.050
 boun+0.048
 flat+0.047
 sil+0.047
 cos+0.046
Neg logit contributions
ople-0.085
les-0.085
ie-0.085
ing-0.084
atted-0.083
Token of
Token ID286
Feature activation-3.89e+00

Pos logit contributions
 sturdy+0.012
 boun+0.011
 flat+0.011
 cos+0.011
 sil+0.011
Neg logit contributions
ie-0.011
les-0.011
ople-0.011
pa-0.011
ing-0.011
Token L
Token ID406
Feature activation-8.76e+00

Pos logit contributions
ople+0.122
led+0.119
les+0.118
by+0.113
ed+0.111
Neg logit contributions
 exc-0.053
 cozy-0.047
 sew-0.046
 bend-0.046
 grow-0.046
Tokenila
Token ID10102
Feature activation-2.29e+00

Pos logit contributions
father+0.189
eled+0.186
pa+0.186
by+0.185
ople+0.182
Neg logit contributions
onna-0.176
 exc-0.172
 swing-0.167
 tie-0.167
 rake-0.166
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
ople+0.068
les+0.066
ie+0.065
ipped+0.065
pa+0.065
Neg logit contributions
 sturdy-0.033
 boun-0.033
 flat-0.032
 cozy-0.032
 cos-0.031
Token She
Token ID1375
Feature activation-2.19e-01

Pos logit contributions
 sturdy+0.021
 boun+0.021
 flat+0.020
 cos+0.020
 cozy+0.019
Neg logit contributions
ople-0.033
les-0.033
ie-0.033
pa-0.032
ing-0.032
Token said
Token ID531
Feature activation-1.66e+00

Pos logit contributions
les+0.005
ie+0.004
ople+0.004
pa+0.004
ed+0.004
Neg logit contributions
 sturdy-0.005
 flat-0.005
 cos-0.005
 boun-0.005
 leash-0.005
Token,
Token ID11
Feature activation+1.50e+00

Pos logit contributions
ople+0.040
les+0.040
ie+0.039
pa+0.038
father+0.038
Neg logit contributions
 boun-0.034
 sturdy-0.034
 flat-0.033
 cozy-0.032
 cos-0.032
Token,
Token ID11
Feature activation-8.59e-01

Pos logit contributions
les+0.072
ople+0.071
father+0.070
pa+0.068
ie+0.068
Neg logit contributions
 boun-0.035
 cozy-0.035
 flat-0.035
 safe-0.034
 cos-0.034
Token something
Token ID1223
Feature activation-8.71e-01

Pos logit contributions
les+0.027
ople+0.026
ie+0.026
pa+0.026
father+0.026
Neg logit contributions
 boun-0.013
 flat-0.013
 cozy-0.012
 sturdy-0.012
 safe-0.012
Token bad
Token ID2089
Feature activation-2.54e+00

Pos logit contributions
les+0.019
ie+0.019
led+0.019
ople+0.018
ed+0.018
Neg logit contributions
 sturdy-0.019
 flat-0.018
 cos-0.018
 boun-0.018
onna-0.018
Token happened
Token ID3022
Feature activation+2.47e-01

Pos logit contributions
ople+0.051
ie+0.049
father+0.048
les+0.048
pa+0.048
Neg logit contributions
 boun-0.062
 sturdy-0.060
 flat-0.060
 wrap-0.059
 sil-0.058
Token.
Token ID13
Feature activation+5.08e-01

Pos logit contributions
 sturdy+0.004
 boun+0.004
 cos+0.004
 flat+0.004
 cozy+0.004
Neg logit contributions
ople-0.007
les-0.007
ie-0.007
pa-0.006
ing-0.006
Token Z
Token ID1168
Feature activation-8.75e+00

Pos logit contributions
 sturdy+0.010
 boun+0.010
 flat+0.010
 cos+0.009
 cozy+0.009
Neg logit contributions
ople-0.013
les-0.012
ie-0.012
pa-0.012
father-0.012
Tokenee
Token ID1453
Feature activation-3.77e+00

Pos logit contributions
ople+0.158
ie+0.148
father+0.147
mother+0.136
parents+0.133
Neg logit contributions
 fold-0.206
 tie-0.204
onna-0.202
 bend-0.194
 wrap-0.192
Token split
Token ID6626
Feature activation-3.77e+00

Pos logit contributions
father+0.094
pa+0.094
ople+0.091
ie+0.088
 End+0.087
Neg logit contributions
 boun-0.070
 flat-0.068
 sil-0.068
 sturdy-0.067
 seek-0.066
Token in
Token ID287
Feature activation-1.63e+00

Pos logit contributions
father+0.117
ople+0.115
pa+0.113
parents+0.111
mother+0.108
Neg logit contributions
 flat-0.062
 boun-0.058
 sturdy-0.053
 bend-0.052
 safe-0.051
Token two
Token ID734
Feature activation-1.22e+00

Pos logit contributions
ople+0.040
les+0.040
ie+0.040
pa+0.038
led+0.038
Neg logit contributions
 flat-0.034
 sturdy-0.034
 boun-0.033
 cozy-0.032
 wrap-0.032
Token.
Token ID13
Feature activation+5.53e-01

Pos logit contributions
ople+0.026
les+0.026
ie+0.026
pa+0.025
father+0.025
Neg logit contributions
 sturdy-0.028
 boun-0.027
 flat-0.027
 cos-0.026
 cozy-0.026
Token so
Token ID523
Feature activation+1.74e+00

Pos logit contributions
les+0.029
ople+0.029
ie+0.029
pa+0.028
father+0.028
Neg logit contributions
 boun-0.014
 sturdy-0.014
 cozy-0.014
 safe-0.014
 flat-0.014
Token much
Token ID881
Feature activation+1.29e+00

Pos logit contributions
 cozy+0.014
 safe+0.013
 boun+0.013
 exc+0.013
 sturdy+0.012
Neg logit contributions
ople-0.067
father-0.067
pa-0.066
les-0.066
ie-0.064
Token fun
Token ID1257
Feature activation+1.36e+00

Pos logit contributions
 sturdy+0.013
 flat+0.012
 boun+0.012
 exc+0.011
 cos+0.011
Neg logit contributions
les-0.044
ople-0.044
ie-0.044
pa-0.043
ipped-0.042
Token.
Token ID13
Feature activation+8.57e-01

Pos logit contributions
 boun+0.023
 sturdy+0.022
 flat+0.022
 cos+0.021
 cozy+0.021
Neg logit contributions
ople-0.038
les-0.037
ie-0.037
pa-0.037
father-0.036
Token K
Token ID509
Feature activation-8.48e+00

Pos logit contributions
 sturdy+0.014
 boun+0.014
 cos+0.014
 cozy+0.014
 flat+0.014
Neg logit contributions
les-0.024
ople-0.024
ie-0.024
pa-0.023
father-0.023
Tokenait
Token ID4548
Feature activation-8.72e+00

Pos logit contributions
iece+0.226
ople+0.223
les+0.222
father+0.218
led+0.217
Neg logit contributions
 exc-0.131
 cos-0.117
 sturdy-0.111
 cozy-0.111
 sil-0.110
Tokenlyn
Token ID6213
Feature activation-2.25e+00

Pos logit contributions
ays+0.213
les+0.200
ipped+0.198
ople+0.193
atted+0.192
Neg logit contributions
 safe-0.129
 wrap-0.126
 exc-0.123
 shut-0.121
 dry-0.110
Token was
Token ID373
Feature activation+3.16e+00

Pos logit contributions
ople+0.050
ie+0.050
les+0.049
pa+0.049
father+0.049
Neg logit contributions
 boun-0.047
 sturdy-0.047
 cozy-0.046
 exc-0.045
 cos-0.045
Token so
Token ID523
Feature activation+3.08e+00

Pos logit contributions
 sturdy+0.044
 boun+0.043
 flat+0.042
 cos+0.041
 cozy+0.041
Neg logit contributions
les-0.098
ople-0.098
ie-0.096
pa-0.095
father-0.093
Token happy
Token ID3772
Feature activation+6.08e-01

Pos logit contributions
 sturdy+0.029
 boun+0.029
 cozy+0.028
 exc+0.027
 flat+0.027
Neg logit contributions
les-0.111
ople-0.111
ie-0.109
pa-0.108
father-0.107
Token to
Token ID284
Feature activation+1.84e+00

Pos logit contributions
 sturdy+0.012
 boun+0.012
 flat+0.011
 cos+0.011
 cozy+0.011
Neg logit contributions
ople-0.015
ie-0.015
les-0.015
pa-0.015
ing-0.015
Token than
Token ID621
Feature activation-6.86e-01

Pos logit contributions
 sturdy+0.002
 boun+0.002
 flat+0.002
 cos+0.002
 cozy+0.002
Neg logit contributions
ople-0.003
ie-0.003
les-0.003
pa-0.002
ing-0.002
Token the
Token ID262
Feature activation-1.81e+00

Pos logit contributions
ople+0.019
les+0.018
ie+0.018
pa+0.018
father+0.018
Neg logit contributions
 sturdy-0.012
 flat-0.012
 boun-0.012
 safe-0.012
 cozy-0.012
Token park
Token ID3952
Feature activation-1.03e+00

Pos logit contributions
ople+0.046
les+0.046
ie+0.045
pa+0.044
father+0.043
Neg logit contributions
 sturdy-0.034
 boun-0.034
 flat-0.033
 cos-0.033
 cozy-0.032
Token.
Token ID13
Feature activation+8.29e-01

Pos logit contributions
ople+0.030
les+0.030
ie+0.029
pa+0.029
father+0.029
Neg logit contributions
 sturdy-0.016
 boun-0.016
 flat-0.015
 cos-0.015
 cozy-0.015
Token K
Token ID509
Feature activation-8.08e+00

Pos logit contributions
 sturdy+0.014
 boun+0.013
 cos+0.013
 flat+0.013
 cozy+0.013
Neg logit contributions
ople-0.023
les-0.023
ie-0.023
pa-0.022
father-0.022
Tokenait
Token ID4548
Feature activation-8.72e+00

Pos logit contributions
ople+0.209
les+0.206
iece+0.204
led+0.201
father+0.199
Neg logit contributions
 exc-0.133
 cos-0.125
 sturdy-0.121
 cozy-0.121
 sil-0.120
Tokenlyn
Token ID6213
Feature activation-1.53e+00

Pos logit contributions
ays+0.214
ipped+0.205
les+0.204
atted+0.201
ople+0.200
Neg logit contributions
 wrap-0.129
 shut-0.123
 safe-0.120
 exc-0.119
 branch-0.112
Token ran
Token ID4966
Feature activation+4.97e-01

Pos logit contributions
ople+0.035
les+0.035
ie+0.035
pa+0.034
father+0.034
Neg logit contributions
 boun-0.032
 sturdy-0.032
 flat-0.031
 cos-0.031
 cozy-0.030
Token over
Token ID625
Feature activation-7.55e-01

Pos logit contributions
 sturdy+0.010
 boun+0.010
 flat+0.009
 cos+0.009
 cozy+0.009
Neg logit contributions
ople-0.012
les-0.012
ie-0.012
pa-0.012
ing-0.012
Token to
Token ID284
Feature activation+7.18e-01

Pos logit contributions
ople+0.025
les+0.025
father+0.024
ie+0.024
pa+0.024
Neg logit contributions
 boun-0.009
 sturdy-0.009
 flat-0.009
 safe-0.008
 cozy-0.008
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
 sturdy+0.016
 boun+0.015
 flat+0.015
 cos+0.015
 cozy+0.015
Neg logit contributions
ople-0.016
les-0.016
ie-0.016
pa-0.016
father-0.015
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
 sturdy+0.017
 boun+0.017
 flat+0.017
 cozy+0.016
 cos+0.016
Neg logit contributions
ople-0.028
ie-0.027
les-0.027
pa-0.027
father-0.027
Token lights
Token ID7588
Feature activation+8.00e-01

Pos logit contributions
 sturdy+0.018
 flat+0.017
 boun+0.017
 cozy+0.016
 sil+0.016
Neg logit contributions
ople-0.042
ie-0.042
les-0.041
ipped-0.041
pa-0.041
Token and
Token ID290
Feature activation+8.72e-01

Pos logit contributions
 sturdy+0.016
 boun+0.016
 flat+0.015
 cos+0.015
 cozy+0.015
Neg logit contributions
ople-0.020
ie-0.019
les-0.019
pa-0.019
father-0.019
Token screens
Token ID8947
Feature activation+2.01e+00

Pos logit contributions
 sturdy+0.017
 boun+0.017
 flat+0.017
 cozy+0.016
 sil+0.016
Neg logit contributions
ie-0.022
ople-0.022
les-0.021
pa-0.021
ipped-0.021
Token.
Token ID13
Feature activation+8.37e-01

Pos logit contributions
 sturdy+0.038
 boun+0.037
 cos+0.036
 cozy+0.036
 flat+0.036
Neg logit contributions
les-0.052
ople-0.051
ie-0.050
pa-0.049
ing-0.049
Token Z
Token ID1168
Feature activation-8.71e+00

Pos logit contributions
 sturdy+0.013
 boun+0.013
 flat+0.013
 cos+0.013
 cozy+0.012
Neg logit contributions
ople-0.023
les-0.023
ie-0.023
pa-0.022
ing-0.022
Tokenog
Token ID519
Feature activation-3.69e+00

Pos logit contributions
ie+0.164
father+0.161
pa+0.157
ople+0.156
by+0.153
Neg logit contributions
 boun-0.213
 rake-0.205
 flat-0.204
 tie-0.204
 wrap-0.203
Token showed
Token ID3751
Feature activation-3.66e+00

Pos logit contributions
ople+0.090
les+0.088
father+0.087
pa+0.086
ipped+0.086
Neg logit contributions
 sturdy-0.070
 boun-0.069
 flat-0.069
 wrap-0.066
 sew-0.065
Token them
Token ID606
Feature activation+8.41e-02

Pos logit contributions
ople+0.091
father+0.089
pa+0.087
 End+0.085
ie+0.084
Neg logit contributions
 boun-0.076
 flat-0.074
 sturdy-0.074
 wrap-0.073
 safe-0.072
Token how
Token ID703
Feature activation-2.96e-01

Pos logit contributions
 sturdy+0.002
 boun+0.002
 flat+0.002
 cos+0.002
 cozy+0.002
Neg logit contributions
ople-0.002
les-0.002
ie-0.002
pa-0.002
father-0.002
Token to
Token ID284
Feature activation+1.25e+00

Pos logit contributions
ople+0.010
les+0.010
ie+0.010
pa+0.009
father+0.009
Neg logit contributions
 sturdy-0.004
 boun-0.004
 flat-0.003
 cozy-0.003
 cos-0.003
Token,
Token ID11
Feature activation-4.06e-01

Pos logit contributions
les+0.043
ople+0.043
ie+0.042
father+0.041
pa+0.041
Neg logit contributions
 sturdy-0.028
 flat-0.027
 safe-0.027
 boun-0.027
 cos-0.027
Token the
Token ID262
Feature activation-2.17e+00

Pos logit contributions
les+0.012
ie+0.011
ople+0.011
father+0.011
pa+0.011
Neg logit contributions
 sturdy-0.007
 flat-0.007
 safe-0.007
 cozy-0.006
 boun-0.006
Token nurse
Token ID15849
Feature activation-3.43e+00

Pos logit contributions
les+0.056
ople+0.056
ie+0.055
pa+0.054
father+0.053
Neg logit contributions
 sturdy-0.039
 boun-0.038
onna-0.037
 flat-0.037
 cos-0.036
Token said
Token ID531
Feature activation-1.50e+00

Pos logit contributions
ople+0.084
les+0.082
ie+0.082
pa+0.081
father+0.080
Neg logit contributions
 sturdy-0.066
 boun-0.066
 flat-0.064
 cos-0.063
 wrap-0.063
Token that
Token ID326
Feature activation-3.15e+00

Pos logit contributions
ople+0.038
les+0.038
ie+0.037
pa+0.037
father+0.036
Neg logit contributions
 sturdy-0.029
 boun-0.028
 flat-0.028
 cos-0.027
 cozy-0.027
Token Kay
Token ID17356
Feature activation-8.70e+00

Pos logit contributions
les+0.089
ople+0.087
ipped+0.084
ie+0.083
led+0.083
Neg logit contributions
 sturdy-0.054
 safe-0.052
 boun-0.052
 cos-0.052
 exc-0.052
Tokenla
Token ID5031
Feature activation-4.46e+00

Pos logit contributions
 End+0.162
ople+0.153
father+0.153
ipped+0.148
pa+0.142
Neg logit contributions
 wrap-0.181
 sil-0.179
 tie-0.171
 safe-0.166
 shut-0.165
Token's
Token ID338
Feature activation-6.42e-01

Pos logit contributions
ople+0.102
father+0.099
pa+0.099
les+0.098
ie+0.098
Neg logit contributions
 sturdy-0.092
 boun-0.091
 safe-0.088
 cos-0.087
 flat-0.087
Token knee
Token ID10329
Feature activation-2.69e-01

Pos logit contributions
ie+0.015
les+0.015
ople+0.015
pa+0.015
ipped+0.015
Neg logit contributions
 sturdy-0.013
 boun-0.012
 flat-0.012
onna-0.012
 cos-0.012
Token wouldn
Token ID3636
Feature activation+1.13e+00

Pos logit contributions
ople+0.006
les+0.005
ie+0.005
pa+0.005
father+0.005
Neg logit contributions
 sturdy-0.006
 boun-0.006
 flat-0.006
 cos-0.006
 cozy-0.006
Token't
Token ID470
Feature activation+2.34e+00

Pos logit contributions
 sturdy+0.021
 boun+0.021
 cos+0.020
 cozy+0.020
 flat+0.019
Neg logit contributions
ie-0.029
les-0.029
ople-0.029
ing-0.028
pa-0.028
Token find
Token ID1064
Feature activation-2.56e+00

Pos logit contributions
 sturdy+0.015
 boun+0.015
 flat+0.014
 cos+0.014
 sil+0.013
Neg logit contributions
ie-0.026
pa-0.025
ople-0.025
les-0.025
father-0.024
Token it
Token ID340
Feature activation+1.28e+00

Pos logit contributions
ople+0.073
les+0.068
ie+0.067
led+0.067
ipped+0.066
Neg logit contributions
 wrap-0.045
 safe-0.045
 sturdy-0.044
 cozy-0.043
 boun-0.042
Token.
Token ID13
Feature activation+8.77e-01

Pos logit contributions
 sturdy+0.025
 boun+0.025
 sil+0.024
 cos+0.024
 flat+0.023
Neg logit contributions
ie-0.032
les-0.031
pa-0.030
ed-0.029
ople-0.029
Token She
Token ID1375
Feature activation-9.25e-03

Pos logit contributions
 sturdy+0.014
 boun+0.014
 flat+0.014
 cos+0.013
 cozy+0.013
Neg logit contributions
ople-0.024
les-0.024
ie-0.024
pa-0.024
ing-0.023
Token thanks
Token ID5176
Feature activation-4.05e+00

Pos logit contributions
les+0.000
ople+0.000
ie+0.000
pa+0.000
ipped+0.000
Neg logit contributions
 sturdy-0.000
 flat-0.000
 boun-0.000
 cos-0.000
 sil-0.000
Token L
Token ID406
Feature activation-8.70e+00

Pos logit contributions
ople+0.113
les+0.108
led+0.107
father+0.105
ipped+0.102
Neg logit contributions
 boun-0.068
 exc-0.068
 wrap-0.067
 flat-0.067
 cozy-0.067
Tokenila
Token ID10102
Feature activation-2.12e+00

Pos logit contributions
father+0.196
pa+0.190
ople+0.190
by+0.184
eled+0.182
Neg logit contributions
 exc-0.174
 swing-0.173
onna-0.171
 tie-0.168
 boun-0.165
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
ople+0.062
pa+0.060
les+0.059
father+0.059
ie+0.059
Neg logit contributions
 boun-0.032
 sturdy-0.032
 safe-0.031
 cozy-0.031
 flat-0.031
Token Sam
Token ID3409
Feature activation+1.17e-01

Pos logit contributions
ople+0.032
ie+0.032
les+0.031
father+0.031
led+0.031
Neg logit contributions
 boun-0.026
 exc-0.025
 wrap-0.024
 flat-0.024
 cozy-0.024
Token for
Token ID329
Feature activation-2.09e-01

Pos logit contributions
 sturdy+0.002
 boun+0.002
 flat+0.002
 cos+0.002
 cozy+0.002
Neg logit contributions
ople-0.003
les-0.003
ie-0.003
pa-0.003
father-0.003
Token helping
Token ID5742
Feature activation-2.52e+00

Pos logit contributions
pa+0.005
ople+0.005
ie+0.005
father+0.005
ma+0.004
Neg logit contributions
 sturdy-0.004
 boun-0.004
 cos-0.004
 sil-0.004
 flat-0.004
Token more
Token ID517
Feature activation+1.56e+00

Pos logit contributions
 boun+0.043
 sturdy+0.043
 cos+0.041
 leash+0.041
 sil+0.041
Neg logit contributions
ie-0.064
les-0.063
ing-0.060
ople-0.060
pa-0.059
Token natural
Token ID3288
Feature activation-1.41e+00

Pos logit contributions
 sturdy+0.024
 boun+0.024
 flat+0.023
 cos+0.023
 leash+0.023
Neg logit contributions
les-0.044
ie-0.043
pa-0.042
ople-0.042
ing-0.041
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
les+0.041
ie+0.041
ople+0.041
father+0.041
ipped+0.040
Neg logit contributions
 boun-0.020
 sturdy-0.020
 flat-0.019
 sil-0.019
 exc-0.019
Token And
Token ID843
Feature activation+1.13e+00

Pos logit contributions
 sturdy+0.006
 boun+0.006
 flat+0.005
 cos+0.005
 cozy+0.005
Neg logit contributions
ople-0.007
les-0.006
pa-0.006
father-0.006
ie-0.006
Token you
Token ID345
Feature activation-1.35e+00

Pos logit contributions
 sturdy+0.021
 boun+0.020
 flat+0.020
 cos+0.020
 cozy+0.020
Neg logit contributions
les-0.030
ople-0.030
ie-0.029
pa-0.029
ipped-0.028
Token should
Token ID815
Feature activation+3.43e+00

Pos logit contributions
ople+0.031
les+0.031
ie+0.031
pa+0.030
father+0.029
Neg logit contributions
 boun-0.028
 sturdy-0.028
 cos-0.027
 flat-0.027
 sil-0.027
Token not
Token ID407
Feature activation+3.09e+00

Pos logit contributions
 sturdy+0.058
 boun+0.056
 sil+0.054
 cos+0.054
 flat+0.053
Neg logit contributions
ie-0.095
pa-0.094
ma-0.090
ople-0.090
les-0.087
Token take
Token ID1011
Feature activation-3.49e+00

Pos logit contributions
 sturdy+0.051
 boun+0.049
 cos+0.048
 cozy+0.048
 sil+0.046
Neg logit contributions
ie-0.090
ople-0.088
el-0.086
pa-0.085
les-0.084
Token Lily
Token ID20037
Feature activation-1.42e+00

Pos logit contributions
ie+0.085
ople+0.084
ipped+0.084
eled+0.084
father+0.083
Neg logit contributions
 safe-0.072
 flat-0.070
 sturdy-0.069
 cozy-0.068
 long-0.067
Token's
Token ID338
Feature activation-1.27e+00

Pos logit contributions
ople+0.039
les+0.038
ipped+0.037
pa+0.037
ie+0.037
Neg logit contributions
 sturdy-0.023
 flat-0.023
 boun-0.023
 safe-0.023
 wrap-0.022
Token cr
Token ID1067
Feature activation-1.14e+00

Pos logit contributions
ople+0.033
les+0.033
ie+0.033
pa+0.032
ipped+0.032
Neg logit contributions
 sturdy-0.023
 boun-0.023
 flat-0.022
 cozy-0.022
 cos-0.022
Token course
Token ID1781
Feature activation+8.61e-01

Pos logit contributions
 sturdy+0.022
 boun+0.021
 flat+0.021
 cos+0.021
 cozy+0.020
Neg logit contributions
ople-0.024
les-0.024
ie-0.024
pa-0.023
father-0.023
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
 sturdy+0.016
 boun+0.016
 cos+0.015
 sil+0.015
 flat+0.015
Neg logit contributions
ie-0.022
ople-0.022
les-0.022
pa-0.022
ing-0.021
Token He
Token ID679
Feature activation-3.47e-01

Pos logit contributions
 boun+0.033
 sturdy+0.032
 sil+0.031
 exc+0.031
 flat+0.031
Neg logit contributions
ople-0.027
les-0.026
pa-0.025
ed-0.025
ing-0.024
Token was
Token ID373
Feature activation+3.63e+00

Pos logit contributions
ople+0.007
les+0.007
ie+0.007
pa+0.007
ed+0.007
Neg logit contributions
 sturdy-0.008
 boun-0.008
 flat-0.008
 cos-0.008
 cozy-0.007
Token wearing
Token ID5762
Feature activation-7.57e-01

Pos logit contributions
 sturdy+0.067
onna+0.066
 sil+0.066
 sew+0.064
jar+0.063
Neg logit contributions
ople-0.085
ie-0.083
ed-0.082
ing-0.082
atted-0.081
Token a
Token ID257
Feature activation+4.71e+00

Pos logit contributions
ople+0.021
ie+0.021
les+0.021
pa+0.020
father+0.020
Neg logit contributions
 sturdy-0.013
 boun-0.013
 flat-0.013
 cozy-0.012
 cos-0.012
Token red
Token ID2266
Feature activation+2.64e+00

Pos logit contributions
onna+0.085
 exc+0.081
 opener+0.080
 sew+0.077
 distract+0.076
Neg logit contributions
les-0.111
by-0.108
ie-0.102
el-0.100
ipped-0.099
Token pl
Token ID458
Feature activation-1.24e+00

Pos logit contributions
 sturdy+0.042
 exc+0.040
onna+0.040
 cos+0.039
 cozy+0.038
Neg logit contributions
les-0.076
ie-0.075
ipped-0.073
ople-0.072
pa-0.072
Tokenaid
Token ID1698
Feature activation+1.91e+00

Pos logit contributions
ie+0.023
ople+0.023
pa+0.022
les+0.022
father+0.021
Neg logit contributions
 sturdy-0.031
 boun-0.031
 flat-0.031
 opener-0.030
onna-0.030
Token shirt
Token ID10147
Feature activation+4.25e+00

Pos logit contributions
 sturdy+0.041
 boun+0.041
 flat+0.040
 cos+0.039
 cozy+0.039
Neg logit contributions
ople-0.044
les-0.043
ie-0.043
pa-0.042
father-0.041
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
 sturdy+0.091
 cos+0.087
 exc+0.086
 boun+0.086
 sil+0.085
Neg logit contributions
les-0.100
ie-0.099
ing-0.098
ed-0.097
ople-0.096
Token grabbed
Token ID13176
Feature activation-1.93e+00

Pos logit contributions
 boun+0.049
 flat+0.047
 sturdy+0.047
 cos+0.047
 cozy+0.046
Neg logit contributions
les-0.055
ie-0.053
ed-0.052
led-0.051
ople-0.049
Token a
Token ID257
Feature activation+4.05e+00

Pos logit contributions
ople+0.056
les+0.052
ie+0.052
pa+0.052
father+0.052
Neg logit contributions
 sturdy-0.033
 boun-0.032
 flat-0.032
 strong-0.031
 wrap-0.031
Token branch
Token ID8478
Feature activation+2.45e+00

Pos logit contributions
resses+0.075
 exc+0.069
 boun+0.068
 consider+0.068
 sew+0.068
Neg logit contributions
ie-0.099
by-0.099
les-0.095
ing-0.090
ison-0.087
Token and
Token ID290
Feature activation+1.32e+00

Pos logit contributions
 sturdy+0.051
 boun+0.050
 cozy+0.049
 exc+0.049
 cos+0.049
Neg logit contributions
ie-0.058
ing-0.057
les-0.056
ed-0.055
pa-0.054
Token started
Token ID2067
Feature activation+1.58e+00

Pos logit contributions
 boun+0.025
 sturdy+0.025
 flat+0.025
 cos+0.024
onna+0.024
Neg logit contributions
les-0.032
ie-0.032
ople-0.032
ed-0.032
led-0.031
Token to
Token ID284
Feature activation+3.58e+00

Pos logit contributions
 sturdy+0.030
jar+0.030
 boun+0.029
 cos+0.028
 sil+0.028
Neg logit contributions
ing-0.038
atted-0.038
ed-0.038
ipped-0.037
ie-0.037
Token wipe
Token ID19916
Feature activation-2.73e+00

Pos logit contributions
 boun+0.056
jar+0.053
resses+0.052
 suitable+0.052
 sturdy+0.051
Neg logit contributions
ie-0.103
pa-0.095
by-0.093
el-0.092
les-0.092
Token it
Token ID340
Feature activation+2.31e+00

Pos logit contributions
ople+0.080
father+0.075
ipped+0.073
pa+0.073
mother+0.072
Neg logit contributions
 flat-0.047
 sturdy-0.046
 safe-0.046
 cozy-0.044
 cos-0.044
Token against
Token ID1028
Feature activation-2.06e+00

Pos logit contributions
 boun+0.056
jar+0.056
 exc+0.055
onna+0.055
 sturdy+0.055
Neg logit contributions
ie-0.044
ing-0.042
les-0.041
ed-0.041
led-0.040
Token the
Token ID262
Feature activation-1.88e+00

Pos logit contributions
ople+0.056
les+0.054
ipped+0.052
atted+0.052
ie+0.051
Neg logit contributions
 flat-0.040
 sturdy-0.040
 cozy-0.038
 safe-0.038
 boun-0.038
Token bee
Token ID20697
Feature activation+6.32e-01

Pos logit contributions
ie+0.054
les+0.052
ople+0.051
ing+0.050
el+0.048
Neg logit contributions
 boun-0.030
 sturdy-0.029
 opener-0.028
 exc-0.028
 cos-0.028
Token balloon
Token ID21190
Feature activation+1.61e+00

Pos logit contributions
les+0.045
ople+0.045
pa+0.045
ie+0.043
by+0.043
Neg logit contributions
onna-0.030
 sturdy-0.029
 cos-0.029
 seek-0.028
 opener-0.028
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
 sturdy+0.031
 sil+0.030
 cos+0.029
 boun+0.029
 exc+0.028
Neg logit contributions
ie-0.040
ed-0.038
les-0.038
by-0.038
ing-0.038
Token She
Token ID1375
Feature activation-3.14e-01

Pos logit contributions
 sturdy+0.024
 boun+0.023
 cos+0.022
 flat+0.022
 exc+0.022
Neg logit contributions
ople-0.036
les-0.036
pa-0.035
ie-0.035
father-0.035
Token hoped
Token ID10719
Feature activation-2.34e+00

Pos logit contributions
ople+0.008
les+0.008
ie+0.008
pa+0.007
ed+0.007
Neg logit contributions
 sturdy-0.006
 flat-0.006
 boun-0.006
 cos-0.006
 cozy-0.006
Token he
Token ID339
Feature activation-1.72e+00

Pos logit contributions
ople+0.071
eled+0.070
ipped+0.070
les+0.070
led+0.068
Neg logit contributions
 safe-0.034
 sturdy-0.033
 flat-0.032
 cozy-0.032
 boun-0.032
Token would
Token ID561
Feature activation+4.52e+00

Pos logit contributions
ople+0.045
les+0.044
ie+0.044
pa+0.043
father+0.043
Neg logit contributions
 sturdy-0.032
 boun-0.031
 flat-0.031
 cos-0.030
 cozy-0.030
Token like
Token ID588
Feature activation-1.49e+00

Pos logit contributions
 boun+0.089
 sturdy+0.086
 cos+0.083
 sil+0.082
resses+0.079
Neg logit contributions
ie-0.097
by-0.094
pa-0.092
 End-0.089
ma-0.089
Token it
Token ID340
Feature activation+2.02e-01

Pos logit contributions
ople+0.032
les+0.032
ie+0.031
pa+0.031
father+0.030
Neg logit contributions
 sturdy-0.034
 boun-0.034
 flat-0.033
 cos-0.033
 cozy-0.032
Token.
Token ID13
Feature activation+8.02e-01

Pos logit contributions
 sturdy+0.004
 boun+0.004
 flat+0.003
 cos+0.003
 cozy+0.003
Neg logit contributions
ople-0.005
ie-0.005
les-0.005
pa-0.005
father-0.005
Token
Token ID198
Feature activation-9.35e-01

Pos logit contributions
 sturdy+0.012
 boun+0.011
 flat+0.011
 cos+0.011
 cozy+0.011
Neg logit contributions
ople-0.024
les-0.024
ie-0.024
pa-0.023
father-0.023
Token
Token ID198
Feature activation+6.57e-02

Pos logit contributions
ople+0.028
les+0.027
ie+0.027
ipped+0.026
pa+0.026
Neg logit contributions
 sturdy-0.014
 boun-0.014
 cozy-0.013
 flat-0.013
 exc-0.013
Token!
Token ID0
Feature activation+4.65e-01

Pos logit contributions
ipped+0.062
ople+0.062
atted+0.060
father+0.060
eled+0.060
Neg logit contributions
 safe-0.020
 boun-0.019
 sturdy-0.018
 cozy-0.018
 flat-0.017
Token They
Token ID1119
Feature activation+8.25e-01

Pos logit contributions
 sturdy+0.008
 boun+0.008
 flat+0.008
 cos+0.008
 cozy+0.008
Neg logit contributions
ople-0.013
les-0.013
ie-0.012
pa-0.012
father-0.012
Token played
Token ID2826
Feature activation+3.74e-01

Pos logit contributions
 sturdy+0.022
 flat+0.021
onna+0.021
 cos+0.021
 sil+0.021
Neg logit contributions
ie-0.015
ople-0.015
les-0.015
pa-0.014
ed-0.014
Token with
Token ID351
Feature activation-2.80e+00

Pos logit contributions
onna+0.008
 exc+0.007
 sturdy+0.007
 sil+0.007
 cos+0.007
Neg logit contributions
les-0.009
ed-0.009
ie-0.008
led-0.008
ing-0.008
Token the
Token ID262
Feature activation-1.82e+00

Pos logit contributions
ople+0.075
father+0.072
ipped+0.071
ie+0.071
pa+0.071
Neg logit contributions
 boun-0.055
 sturdy-0.055
 safe-0.055
 cozy-0.054
 flat-0.054
Token toys
Token ID14958
Feature activation+2.81e+00

Pos logit contributions
ople+0.049
les+0.049
ie+0.048
pa+0.047
father+0.046
Neg logit contributions
 sturdy-0.031
 boun-0.030
onna-0.030
 flat-0.030
 cos-0.030
Token all
Token ID477
Feature activation+1.32e-01

Pos logit contributions
 sturdy+0.060
 sil+0.059
 cos+0.059
 boun+0.059
 exc+0.058
Neg logit contributions
ie-0.062
les-0.062
ople-0.062
ing-0.060
ed-0.059
Token day
Token ID1110
Feature activation+1.13e+00

Pos logit contributions
 safe+0.003
 boun+0.003
 sturdy+0.003
 flat+0.003
 cozy+0.003
Neg logit contributions
ople-0.003
ipped-0.003
father-0.003
les-0.003
pa-0.003
Token and
Token ID290
Feature activation-1.52e-01

Pos logit contributions
 exc+0.020
 sturdy+0.019
 sew+0.019
 cos+0.018
 sil+0.018
Neg logit contributions
ing-0.030
les-0.029
ie-0.029
ople-0.028
ed-0.028
Token shared
Token ID4888
Feature activation-1.58e+00

Pos logit contributions
ipped+0.004
les+0.004
ays+0.004
father+0.004
ople+0.004
Neg logit contributions
 boun-0.003
 safe-0.003
 cozy-0.003
 sturdy-0.003
 cos-0.003
Token them
Token ID606
Feature activation+7.95e-01

Pos logit contributions
ople+0.037
ie+0.036
les+0.036
father+0.036
pa+0.036
Neg logit contributions
 sturdy-0.034
 boun-0.033
 flat-0.033
 cozy-0.032
 cos-0.032
Token shine
Token ID18340
Feature activation+5.88e-01

Pos logit contributions
 sturdy+0.059
 boun+0.058
 leash+0.055
 sil+0.055
 flat+0.054
Neg logit contributions
ie-0.119
les-0.116
by-0.113
ing-0.110
ed-0.109
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
 sturdy+0.009
 boun+0.009
 flat+0.008
 cozy+0.008
 cos+0.008
Neg logit contributions
ople-0.018
pa-0.017
father-0.017
les-0.017
ipped-0.017
Token All
Token ID1439
Feature activation-2.52e+00

Pos logit contributions
 sturdy+0.021
 boun+0.020
 flat+0.020
 cos+0.019
 cozy+0.019
Neg logit contributions
ople-0.036
les-0.036
ie-0.036
pa-0.035
father-0.034
Token the
Token ID262
Feature activation-2.55e+00

Pos logit contributions
ople+0.069
io+0.068
father+0.067
ipped+0.066
ael+0.065
Neg logit contributions
 boun-0.043
 cozy-0.041
 safe-0.040
 cos-0.040
 clean-0.040
Token creatures
Token ID8109
Feature activation+8.59e-01

Pos logit contributions
ipped+0.061
ople+0.061
ed+0.060
ays+0.060
ing+0.059
Neg logit contributions
 boun-0.054
 cozy-0.052
 sturdy-0.052
 cos-0.051
 sil-0.051
Token stared
Token ID18484
Feature activation-6.04e-01

Pos logit contributions
 sturdy+0.018
 boun+0.018
 flat+0.018
 cos+0.017
 cozy+0.017
Neg logit contributions
les-0.020
ie-0.020
ople-0.020
pa-0.019
led-0.019
Token in
Token ID287
Feature activation+7.37e-01

Pos logit contributions
ople+0.017
father+0.016
pa+0.016
les+0.015
ipped+0.015
Neg logit contributions
 sturdy-0.011
 boun-0.011
 flat-0.011
 cozy-0.010
 safe-0.010
Token awe
Token ID25030
Feature activation-1.87e+00

Pos logit contributions
 sturdy+0.016
 boun+0.016
 flat+0.015
 cos+0.015
 cozy+0.015
Neg logit contributions
ople-0.017
les-0.017
ie-0.017
pa-0.016
father-0.016
Token at
Token ID379
Feature activation-3.30e+00

Pos logit contributions
ople+0.055
father+0.055
ipped+0.053
pa+0.053
ison+0.052
Neg logit contributions
 sturdy-0.027
 boun-0.026
 sil-0.025
 safe-0.025
 flat-0.025
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
les+0.081
ople+0.080
ipped+0.078
led+0.076
father+0.076
Neg logit contributions
 boun-0.063
 cozy-0.062
 sturdy-0.062
 wrap-0.062
 flat-0.061
Token magic
Token ID5536
Feature activation-9.96e-01

Pos logit contributions
ople+0.028
les+0.027
ie+0.027
pa+0.027
ipped+0.027
Neg logit contributions
 sturdy-0.020
 boun-0.020
 flat-0.019
 cozy-0.019
 cos-0.019
Token They
Token ID1119
Feature activation+5.55e-01

Pos logit contributions
 sturdy+0.030
 boun+0.029
 flat+0.029
 cos+0.028
 exc+0.028
Neg logit contributions
ople-0.034
les-0.033
ie-0.033
pa-0.032
ipped-0.031
Token helped
Token ID4193
Feature activation-3.03e+00

Pos logit contributions
 sturdy+0.012
 boun+0.012
 flat+0.011
 cos+0.011
 cozy+0.011
Neg logit contributions
ople-0.013
les-0.013
ie-0.013
pa-0.012
father-0.012
Token her
Token ID607
Feature activation-2.51e-01

Pos logit contributions
ople+0.078
les+0.077
father+0.077
ipped+0.076
pa+0.074
Neg logit contributions
 wrap-0.056
 sew-0.055
 sturdy-0.054
 boun-0.054
 cozy-0.054
Token put
Token ID1234
Feature activation-3.82e+00

Pos logit contributions
ie+0.008
les+0.008
ople+0.008
pa+0.008
ing+0.008
Neg logit contributions
 sturdy-0.003
 boun-0.003
 flat-0.003
 sil-0.003
 cos-0.003
Token her
Token ID607
Feature activation-1.50e+00

Pos logit contributions
ople+0.110
father+0.108
pa+0.102
by+0.100
ie+0.100
Neg logit contributions
 boun-0.071
 safe-0.069
 flat-0.068
 wrap-0.068
 cozy-0.067
Token bathing
Token ID39153
Feature activation-1.02e+00

Pos logit contributions
ople+0.047
father+0.046
ie+0.045
les+0.045
pa+0.045
Neg logit contributions
 boun-0.022
 cozy-0.022
 flat-0.022
 safe-0.021
 sturdy-0.021
Token suit
Token ID6050
Feature activation+1.36e+00

Pos logit contributions
ople+0.029
father+0.028
pa+0.028
les+0.027
ie+0.027
Neg logit contributions
 boun-0.017
 flat-0.016
 sturdy-0.016
 wrap-0.016
 cos-0.016
Token back
Token ID736
Feature activation-5.40e-01

Pos logit contributions
 sturdy+0.028
 flat+0.027
 boun+0.027
 safe+0.026
 cozy+0.026
Neg logit contributions
ople-0.034
father-0.032
pa-0.032
les-0.032
ie-0.032
Token on
Token ID319
Feature activation-1.71e+00

Pos logit contributions
ople+0.015
les+0.014
father+0.014
pa+0.014
ie+0.014
Neg logit contributions
 sturdy-0.009
 flat-0.009
 boun-0.009
 cozy-0.009
 safe-0.009
Token and
Token ID290
Feature activation+8.31e-01

Pos logit contributions
ople+0.037
les+0.037
ie+0.036
pa+0.035
father+0.035
Neg logit contributions
 sturdy-0.039
 boun-0.039
 flat-0.038
 cos-0.037
 cozy-0.037
Token they
Token ID484
Feature activation+5.13e-01

Pos logit contributions
 boun+0.016
 sturdy+0.016
 flat+0.015
 cos+0.015
 cozy+0.015
Neg logit contributions
ople-0.021
les-0.021
ie-0.021
pa-0.020
father-0.020
Token said
Token ID531
Feature activation-2.01e-01

Pos logit contributions
 sturdy+0.025
 flat+0.025
 boun+0.024
onna+0.024
 sil+0.024
Neg logit contributions
ie-0.026
les-0.026
ople-0.025
pa-0.025
ed-0.025
Token,
Token ID11
Feature activation+3.00e+00

Pos logit contributions
ople+0.004
les+0.004
ie+0.004
pa+0.004
ing+0.004
Neg logit contributions
 sturdy-0.005
 boun-0.004
 flat-0.004
 cos-0.004
 cozy-0.004
Token "
Token ID366
Feature activation-2.47e-01

Pos logit contributions
 sturdy+0.064
 cos+0.058
 boun+0.058
 hollow+0.057
 seek+0.057
Neg logit contributions
ed-0.067
ing-0.066
atted-0.064
ie-0.063
ople-0.063
TokenMe
Token ID5308
Feature activation-3.23e+00

Pos logit contributions
ople+0.007
les+0.007
ie+0.007
pa+0.007
ipped+0.007
Neg logit contributions
 boun-0.004
 sturdy-0.004
 cos-0.004
 sil-0.004
 flat-0.004
Token too
Token ID1165
Feature activation+7.22e-01

Pos logit contributions
les+0.102
ople+0.100
father+0.098
pa+0.097
ie+0.096
Neg logit contributions
 boun-0.045
 flat-0.044
 sturdy-0.044
 safe-0.042
 cos-0.042
Token,
Token ID11
Feature activation+5.63e-01

Pos logit contributions
 sturdy+0.017
 boun+0.017
 rake+0.016
 flat+0.016
 sew+0.016
Neg logit contributions
ie-0.015
ople-0.015
les-0.014
ing-0.014
pa-0.014
Token Tim
Token ID5045
Feature activation-3.71e+00

Pos logit contributions
 sturdy+0.010
 boun+0.010
 flat+0.010
 cos+0.010
 cozy+0.010
Neg logit contributions
ople-0.014
ie-0.014
les-0.014
pa-0.014
father-0.014
Tokenmy
Token ID1820
Feature activation-3.58e-01

Pos logit contributions
ipped+0.052
ople+0.052
les+0.048
ed+0.048
led+0.048
Neg logit contributions
 safe-0.110
 boun-0.109
 sturdy-0.109
 flat-0.109
 cozy-0.108
Token."
Token ID526
Feature activation+1.33e+00

Pos logit contributions
ie+0.010
ople+0.009
pa+0.009
les+0.009
ing+0.009
Neg logit contributions
 sturdy-0.007
 boun-0.007
 flat-0.006
 sil-0.006
 cos-0.006
Token<|endoftext|>
Token ID50256
Feature activation-4.05e+00

Pos logit contributions
 sturdy+0.026
 boun+0.025
 flat+0.024
 sil+0.024
 cos+0.024
Neg logit contributions
ie-0.033
ople-0.033
les-0.033
ed-0.032
pa-0.032
TokenOnce
Token ID7454
Feature activation-3.19e+00

Pos logit contributions
ople+0.133
ie+0.128
les+0.128
father+0.127
pa+0.127
Neg logit contributions
 boun-0.049
 cozy-0.048
 flat-0.046
 safe-0.045
 cos-0.045
Token can
Token ID460
Feature activation+4.31e+00

Pos logit contributions
 sturdy+0.009
 boun+0.009
 sil+0.008
 flat+0.008
 cos+0.008
Neg logit contributions
ie-0.010
ople-0.009
les-0.009
pa-0.009
el-0.009
Token help
Token ID1037
Feature activation-1.33e+00

Pos logit contributions
 sturdy+0.093
 boun+0.091
 flat+0.082
 sil+0.082
 suitable+0.079
Neg logit contributions
ie-0.114
pa-0.109
ople-0.105
el-0.105
les-0.102
Token you
Token ID345
Feature activation+2.02e+00

Pos logit contributions
ie+0.027
ople+0.026
pa+0.025
les+0.025
ing+0.024
Neg logit contributions
 sturdy-0.034
 boun-0.034
 flat-0.033
 sil-0.032
 cos-0.032
Token."
Token ID526
Feature activation+9.95e-01

Pos logit contributions
 boun+0.042
 sturdy+0.038
 sil+0.037
 spe+0.034
 cos+0.033
Neg logit contributions
ie-0.055
les-0.049
by-0.048
ing-0.048
en-0.047
Token
Token ID220
Feature activation-1.05e+00

Pos logit contributions
 sturdy+0.021
 boun+0.021
 flat+0.021
 cos+0.020
 exc+0.020
Neg logit contributions
ople-0.023
les-0.023
ie-0.023
pa-0.022
ipped-0.022
Token
Token ID198
Feature activation-8.13e-02

Pos logit contributions
ople+0.028
les+0.028
ie+0.028
pa+0.027
father+0.027
Neg logit contributions
 sturdy-0.018
 boun-0.018
 flat-0.017
 cos-0.017
 cozy-0.017
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
ople+0.002
les+0.002
ie+0.002
pa+0.002
father+0.002
Neg logit contributions
 sturdy-0.001
 boun-0.001
 flat-0.001
 cos-0.001
 cozy-0.001
TokenL
Token ID43
Feature activation-6.21e+00

Pos logit contributions
onna+0.029
 sturdy+0.029
 flat+0.029
 boun+0.029
 hollow+0.028
Neg logit contributions
ie-0.018
les-0.017
ople-0.016
ing-0.015
ma-0.015
Tokenily
Token ID813
Feature activation-1.72e+00

Pos logit contributions
pa+0.155
ople+0.154
father+0.148
by+0.147
eled+0.143
Neg logit contributions
 exc-0.113
 boun-0.111
 sew-0.105
 safe-0.103
 sturdy-0.102
Token learned
Token ID4499
Feature activation-2.20e-01

Pos logit contributions
ople+0.037
les+0.037
ie+0.036
pa+0.036
father+0.035
Neg logit contributions
 sturdy-0.038
 boun-0.038
 flat-0.037
 cos-0.037
 cozy-0.037
Token a
Token ID257
Feature activation+4.07e+00

Pos logit contributions
ie+0.004
pa+0.004
les+0.004
ing+0.004
father+0.004
Neg logit contributions
 sturdy-0.005
 boun-0.005
 cos-0.005
 flat-0.005
 sil-0.005
Token in
Token ID287
Feature activation-2.69e-01

Pos logit contributions
 sturdy+0.020
onna+0.018
 cos+0.018
 flat+0.018
 boun+0.018
Neg logit contributions
ie-0.024
les-0.023
ople-0.023
ing-0.023
pa-0.023
Token the
Token ID262
Feature activation-6.95e-01

Pos logit contributions
ople+0.006
les+0.006
ie+0.006
pa+0.006
ing+0.006
Neg logit contributions
 boun-0.006
 sturdy-0.006
 flat-0.006
 cozy-0.006
 cos-0.006
Token park
Token ID3952
Feature activation+1.95e+00

Pos logit contributions
ople+0.020
les+0.019
ie+0.019
pa+0.019
ipped+0.019
Neg logit contributions
 sturdy-0.012
 boun-0.011
 flat-0.011
 cozy-0.011
 cos-0.011
Token with
Token ID351
Feature activation-5.78e-01

Pos logit contributions
 sturdy+0.034
 boun+0.033
 flat+0.032
 cos+0.032
 cozy+0.032
Neg logit contributions
ople-0.053
ie-0.052
les-0.052
pa-0.051
ing-0.050
Token their
Token ID511
Feature activation-3.46e-03

Pos logit contributions
ople+0.016
les+0.015
ie+0.014
ing+0.014
led+0.014
Neg logit contributions
 boun-0.012
 cozy-0.011
 sturdy-0.011
 cos-0.011
 flat-0.011
Token ball
Token ID2613
Feature activation+2.40e+00

Pos logit contributions
ople+0.000
les+0.000
ie+0.000
pa+0.000
father+0.000
Neg logit contributions
 boun-0.000
 sturdy-0.000
 flat-0.000
 cozy-0.000
 cos-0.000
Token.
Token ID13
Feature activation+2.35e+00

Pos logit contributions
 sturdy+0.045
 cos+0.043
 boun+0.043
 sil+0.042
 flat+0.042
Neg logit contributions
ie-0.063
les-0.062
ople-0.061
ing-0.059
pa-0.059
Token They
Token ID1119
Feature activation+1.54e+00

Pos logit contributions
 sturdy+0.044
 boun+0.041
 flat+0.041
 cos+0.040
 sil+0.040
Neg logit contributions
ople-0.060
les-0.058
ie-0.058
ed-0.056
ipped-0.056
Token liked
Token ID8288
Feature activation-9.87e-01

Pos logit contributions
 sturdy+0.036
onna+0.034
 cos+0.034
 boun+0.034
 flat+0.034
Neg logit contributions
ople-0.032
ie-0.032
les-0.031
pa-0.030
ed-0.030
Token to
Token ID284
Feature activation+3.15e+00

Pos logit contributions
ie+0.028
ople+0.028
les+0.027
pa+0.027
ing+0.027
Neg logit contributions
 sturdy-0.016
 cos-0.015
 boun-0.014
 flat-0.014
 sil-0.014
Token kick
Token ID4829
Feature activation-2.40e+00

Pos logit contributions
jar+0.059
 suitable+0.050
 sturdy+0.049
onna+0.048
 cos+0.047
Neg logit contributions
ie-0.084
les-0.079
pa-0.079
ople-0.076
en-0.073
Token to
Token ID284
Feature activation+5.64e-01

Pos logit contributions
 sturdy+0.043
 boun+0.042
 sil+0.042
 exc+0.041
 cos+0.041
Neg logit contributions
ie-0.060
pa-0.058
ople-0.057
ing-0.057
les-0.057
Token unite
Token ID24558
Feature activation+1.67e-01

Pos logit contributions
 boun+0.009
onna+0.008
 sil+0.008
 sturdy+0.008
 twitch+0.008
Neg logit contributions
ie-0.016
pa-0.016
ople-0.015
ma-0.015
father-0.015
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
 sturdy+0.003
 boun+0.003
 flat+0.003
 cos+0.003
 exc+0.003
Neg logit contributions
ie-0.004
pa-0.004
les-0.004
ople-0.004
ing-0.004
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
 sturdy+0.018
 boun+0.018
 cos+0.017
 cozy+0.017
 flat+0.017
Neg logit contributions
les-0.035
ie-0.035
ople-0.034
pa-0.034
led-0.033
Token
Token ID198
Feature activation+8.33e-01

Pos logit contributions
ople+0.009
les+0.009
ie+0.009
pa+0.009
ipped+0.009
Neg logit contributions
 sturdy-0.005
 boun-0.005
 flat-0.005
 cos-0.005
 cozy-0.005
TokenAt
Token ID2953
Feature activation-1.84e+00

Pos logit contributions
 sturdy+0.016
 boun+0.016
 flat+0.016
 cos+0.016
 cozy+0.016
Neg logit contributions
ople-0.020
ie-0.020
les-0.019
father-0.019
ing-0.019
Token the
Token ID262
Feature activation-8.17e-01

Pos logit contributions
les+0.053
ople+0.051
led+0.050
ed+0.049
atted+0.049
Neg logit contributions
 flat-0.034
 safe-0.034
 cozy-0.033
 sturdy-0.033
 boun-0.032
Token airport
Token ID9003
Feature activation-4.44e-01

Pos logit contributions
ople+0.017
les+0.017
atted+0.017
ipped+0.017
led+0.017
Neg logit contributions
 sturdy-0.019
 flat-0.019
 cozy-0.019
 safe-0.019
 boun-0.019
Token,
Token ID11
Feature activation-7.14e-01

Pos logit contributions
les+0.013
father+0.012
ople+0.012
ipped+0.012
eled+0.012
Neg logit contributions
 safe-0.007
 sturdy-0.007
 flat-0.007
 cozy-0.007
 seek-0.007
Token they
Token ID484
Feature activation+4.31e-01

Pos logit contributions
les+0.021
ople+0.020
ie+0.020
pa+0.020
father+0.020
Neg logit contributions
 sturdy-0.012
 boun-0.012
 safe-0.012
 cozy-0.012
 flat-0.012
Token saw
Token ID2497
Feature activation-2.31e+00

Pos logit contributions
 sturdy+0.009
 boun+0.009
 flat+0.009
 cos+0.009
 cozy+0.009
Neg logit contributions
ople-0.010
les-0.010
ie-0.010
pa-0.010
father-0.009
Token the
Token ID262
Feature activation-1.44e+00

Pos logit contributions
ople+0.070
les+0.068
ipped+0.065
by+0.065
io+0.064
Neg logit contributions
 cozy-0.030
 boun-0.029
 sturdy-0.029
 safe-0.029
 wrap-0.028
Token figure
Token ID3785
Feature activation+1.86e+00

Pos logit contributions
ople+0.046
les+0.045
ie+0.045
pa+0.044
ipped+0.043
Neg logit contributions
 sturdy-0.019
 boun-0.019
 flat-0.018
 cozy-0.017
 cos-0.017
Token again
Token ID757
Feature activation+1.68e-01

Pos logit contributions
 sturdy+0.038
 boun+0.037
 flat+0.036
 cos+0.036
 cozy+0.035
Neg logit contributions
ople-0.044
les-0.044
ie-0.044
pa-0.043
ing-0.042
Token.
Token ID13
Feature activation+2.41e+00

Pos logit contributions
 sturdy+0.003
 boun+0.003
 flat+0.003
 cos+0.003
 cozy+0.003
Neg logit contributions
ople-0.005
les-0.005
ie-0.005
pa-0.004
father-0.004
Token This
Token ID770
Feature activation-3.27e+00

Pos logit contributions
 sturdy+0.049
 boun+0.048
 seek+0.046
 exc+0.046
 flat+0.046
Neg logit contributions
ople-0.056
ed-0.055
ie-0.053
ing-0.053
les-0.053
Token time
Token ID640
Feature activation-2.31e+00

Pos logit contributions
ople+0.089
les+0.082
ipped+0.081
ie+0.081
led+0.081
Neg logit contributions
 sturdy-0.063
 wrap-0.063
 boun-0.063
 flat-0.061
 cozy-0.060
Token,
Token ID11
Feature activation+4.87e-01

Pos logit contributions
ople+0.058
les+0.057
ie+0.055
pa+0.055
father+0.055
Neg logit contributions
 sturdy-0.046
 boun-0.046
 flat-0.045
 cozy-0.044
 cos-0.044
Token she
Token ID673
Feature activation-9.57e-01

Pos logit contributions
 sturdy+0.008
 boun+0.008
 flat+0.008
 cozy+0.008
 cos+0.008
Neg logit contributions
ople-0.014
les-0.014
ie-0.014
pa-0.013
father-0.013
Token got
Token ID1392
Feature activation+9.47e-01

Pos logit contributions
ople+0.022
les+0.021
ie+0.021
pa+0.021
father+0.020
Neg logit contributions
 sturdy-0.021
 boun-0.021
 flat-0.020
 cos-0.020
 cozy-0.020
Token brave
Token ID14802
Feature activation+5.90e-01

Pos logit contributions
 sturdy+0.016
 boun+0.016
 flat+0.015
 cos+0.015
 sil+0.015
Neg logit contributions
ople-0.026
ie-0.026
les-0.026
pa-0.025
father-0.025
Token and
Token ID290
Feature activation+2.09e+00

Pos logit contributions
 boun+0.010
 sturdy+0.010
 flat+0.010
 cozy+0.010
 cos+0.010
Neg logit contributions
ople-0.016
ie-0.015
les-0.015
father-0.015
pa-0.015
Tokenweb
Token ID12384
Feature activation+3.18e+00

Pos logit contributions
ople+0.013
ie+0.013
les+0.013
pa+0.012
ing+0.012
Neg logit contributions
 sturdy-0.012
 boun-0.012
 flat-0.012
 cos-0.012
 cozy-0.012
Token in
Token ID287
Feature activation-4.92e-01

Pos logit contributions
 sturdy+0.069
 exc+0.067
 cozy+0.066
 cos+0.065
 boun+0.065
Neg logit contributions
ie-0.074
les-0.073
ing-0.072
ople-0.068
ed-0.068
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
ople+0.011
les+0.011
ie+0.011
pa+0.011
ipped+0.011
Neg logit contributions
 sturdy-0.011
 boun-0.011
 flat-0.010
 cozy-0.010
 cos-0.010
Token corner
Token ID5228
Feature activation-4.36e-01

Pos logit contributions
ople+0.030
ie+0.030
les+0.030
pa+0.029
father+0.028
Neg logit contributions
 sturdy-0.024
 boun-0.024
 flat-0.023
 exc-0.023
 cos-0.022
Token of
Token ID286
Feature activation-1.88e+00

Pos logit contributions
ople+0.012
les+0.012
ie+0.011
pa+0.011
by+0.011
Neg logit contributions
 sturdy-0.008
 boun-0.007
 flat-0.007
 cos-0.007
 safe-0.007
Token the
Token ID262
Feature activation-1.71e+00

Pos logit contributions
ople+0.050
les+0.049
led+0.047
atted+0.047
ipped+0.047
Neg logit contributions
 sturdy-0.036
 flat-0.036
 cozy-0.035
 boun-0.035
 wrap-0.035
Token garden
Token ID11376
Feature activation+1.03e+00

Pos logit contributions
ie+0.043
ople+0.043
les+0.043
ing+0.041
pa+0.041
Neg logit contributions
 sturdy-0.031
 boun-0.031
 exc-0.030
 flat-0.030
onna-0.029
Token.
Token ID13
Feature activation+9.27e-01

Pos logit contributions
 sturdy+0.018
 boun+0.017
 flat+0.017
 cos+0.016
 cozy+0.016
Neg logit contributions
ople-0.028
les-0.028
ie-0.028
pa-0.027
ing-0.027
Token He
Token ID679
Feature activation-1.83e-01

Pos logit contributions
 sturdy+0.019
 exc+0.019
 boun+0.018
 flat+0.018
 seek+0.018
Neg logit contributions
ople-0.021
les-0.021
ed-0.020
ing-0.020
ipped-0.020
Token was
Token ID373
Feature activation+4.14e+00

Pos logit contributions
ople+0.004
ie+0.004
les+0.004
pa+0.004
father+0.004
Neg logit contributions
 sturdy-0.004
 boun-0.004
 flat-0.004
 cos-0.004
 cozy-0.004
Token very
Token ID845
Feature activation+4.37e+00

Pos logit contributions
onna+0.071
 sturdy+0.068
 sew+0.067
 opener+0.066
 knit+0.065
Neg logit contributions
ed-0.102
ople-0.101
ing-0.101
atted-0.100
eled-0.099
Token save
Token ID3613
Feature activation-3.32e+00

Pos logit contributions
 sturdy+0.049
 sil+0.047
 boun+0.046
 flat+0.046
 cos+0.045
Neg logit contributions
ie-0.082
ople-0.073
pa-0.072
el-0.071
les-0.071
Token the
Token ID262
Feature activation-2.81e+00

Pos logit contributions
ople+0.094
father+0.090
les+0.090
ipped+0.089
pa+0.089
Neg logit contributions
 safe-0.059
 flat-0.058
 sturdy-0.058
 boun-0.056
 cozy-0.056
Token lizard
Token ID42406
Feature activation+7.56e-01

Pos logit contributions
ople+0.077
pa+0.074
les+0.073
father+0.072
ipped+0.072
Neg logit contributions
 boun-0.053
 sturdy-0.050
 cozy-0.050
 flat-0.050
 exc-0.049
Token,
Token ID11
Feature activation+2.71e+00

Pos logit contributions
 sturdy+0.015
 sil+0.015
 boun+0.015
 cos+0.014
 flat+0.014
Neg logit contributions
ie-0.019
les-0.018
ople-0.018
ing-0.018
ed-0.017
Token but
Token ID475
Feature activation-1.14e+00

Pos logit contributions
 cos+0.045
 sil+0.045
 sturdy+0.044
 boun+0.043
 seek+0.042
Neg logit contributions
ie-0.068
ople-0.067
les-0.065
pa-0.065
atted-0.064
Token they
Token ID484
Feature activation-1.38e+00

Pos logit contributions
ople+0.025
ie+0.025
les+0.024
pa+0.024
father+0.023
Neg logit contributions
 sturdy-0.026
 boun-0.025
 flat-0.025
 cos-0.025
 sil-0.024
Token were
Token ID547
Feature activation+1.11e+00

Pos logit contributions
ople+0.030
les+0.030
ie+0.029
pa+0.029
father+0.028
Neg logit contributions
 sturdy-0.031
 boun-0.031
 flat-0.031
 cos-0.030
 cozy-0.030
Token too
Token ID1165
Feature activation+2.09e+00

Pos logit contributions
 sew+0.026
 sil+0.025
 knit+0.025
 sturdy+0.025
 cos+0.024
Neg logit contributions
ie-0.023
ople-0.022
led-0.022
atted-0.022
ing-0.022
Token slow
Token ID3105
Feature activation+5.23e-01

Pos logit contributions
 sew+0.031
 wrap+0.030
 knit+0.030
 sturdy+0.030
 boun+0.030
Neg logit contributions
ing-0.059
ie-0.058
les-0.057
by-0.056
ed-0.056
Token.
Token ID13
Feature activation+8.00e-01

Pos logit contributions
 sturdy+0.008
 boun+0.007
 flat+0.007
 cozy+0.007
 cos+0.007
Neg logit contributions
ople-0.016
les-0.015
ie-0.015
pa-0.015
father-0.015
Token The
Token ID383
Feature activation-3.00e+00

Pos logit contributions
 sturdy+0.016
 boun+0.016
 cos+0.015
 sil+0.015
 flat+0.015
Neg logit contributions
ople-0.020
les-0.019
ie-0.019
ed-0.019
ing-0.018
Token
Token ID220
Feature activation-1.32e+00

Pos logit contributions
 sturdy+0.041
 boun+0.040
 flat+0.038
 exc+0.038
 sil+0.038
Neg logit contributions
ople-0.043
les-0.043
ie-0.042
ed-0.041
pa-0.040
Token
Token ID198
Feature activation+3.75e-01

Pos logit contributions
ople+0.042
les+0.041
ie+0.040
pa+0.040
ipped+0.039
Neg logit contributions
 cozy-0.017
 sturdy-0.017
 flat-0.016
 exc-0.016
 boun-0.016
Token
Token ID198
Feature activation+1.44e+00

Pos logit contributions
 sturdy+0.006
 boun+0.006
 flat+0.006
 cozy+0.006
 cos+0.006
Neg logit contributions
ople-0.011
les-0.010
ie-0.010
pa-0.010
father-0.010
TokenL
Token ID43
Feature activation-5.63e+00

Pos logit contributions
 sturdy+0.037
onna+0.037
 flat+0.036
 boun+0.036
 cos+0.035
Neg logit contributions
les-0.026
ie-0.026
ing-0.024
ople-0.024
ed-0.022
Tokenily
Token ID813
Feature activation-1.36e+00

Pos logit contributions
pa+0.138
ople+0.137
father+0.135
by+0.133
eled+0.130
Neg logit contributions
 exc-0.105
 boun-0.099
 wrap-0.096
 rake-0.096
 sew-0.095
Token and
Token ID290
Feature activation-1.90e+00

Pos logit contributions
ople+0.031
les+0.031
ie+0.030
pa+0.030
father+0.030
Neg logit contributions
 sturdy-0.029
 boun-0.028
 cos-0.028
 cozy-0.028
 flat-0.028
Token her
Token ID607
Feature activation-2.04e+00

Pos logit contributions
les+0.051
ople+0.050
ing+0.048
led+0.048
ed+0.047
Neg logit contributions
 wrap-0.035
 sew-0.035
 boun-0.035
 exc-0.034
 cozy-0.034
Token babys
Token ID46711
Feature activation-2.95e+00

Pos logit contributions
les+0.054
ople+0.054
ie+0.051
led+0.050
ing+0.050
Neg logit contributions
 boun-0.038
 cozy-0.037
 exc-0.037
 cos-0.036
 flat-0.036
Tokenitter
Token ID1967
Feature activation-9.53e-01

Pos logit contributions
les+0.067
ople+0.063
pa+0.061
led+0.060
eled+0.059
Neg logit contributions
 sew-0.061
 rake-0.060
 exc-0.059
 sturdy-0.059
 boun-0.059
Token went
Token ID1816
Feature activation+1.41e+00

Pos logit contributions
les+0.021
ople+0.020
father+0.020
 End+0.019
pa+0.019
Neg logit contributions
 wrap-0.019
 rake-0.018
 safe-0.018
 sew-0.018
 sturdy-0.018
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
 sturdy+0.029
 sil+0.028
 exc+0.027
 cos+0.027
 boun+0.026
Neg logit contributions
ie-0.035
ing-0.035
les-0.034
ed-0.034
ople-0.033
Token mom
Token ID1995
Feature activation-2.60e+00

Pos logit contributions
ople+0.009
les+0.009
ie+0.009
ing+0.009
pa+0.009
Neg logit contributions
 boun-0.007
 sturdy-0.007
 cozy-0.007
 flat-0.007
 cos-0.007
Tokenmy
Token ID1820
Feature activation-2.01e+00

Pos logit contributions
ople+0.061
les+0.059
ays+0.059
ipped+0.058
io+0.057
Neg logit contributions
 wrap-0.050
 safe-0.049
 sturdy-0.049
 sew-0.048
 boun-0.048
Token said
Token ID531
Feature activation-1.20e+00

Pos logit contributions
ople+0.052
les+0.050
ie+0.049
pa+0.049
ipped+0.049
Neg logit contributions
 sturdy-0.036
 boun-0.036
 flat-0.035
 cos-0.035
 cozy-0.035
Token,
Token ID11
Feature activation+1.15e+00

Pos logit contributions
ie+0.026
ople+0.025
les+0.025
ing+0.025
pa+0.025
Neg logit contributions
 sturdy-0.028
 boun-0.027
 cos-0.026
 flat-0.026
 cozy-0.026
Token "
Token ID366
Feature activation-1.04e+00

Pos logit contributions
 sturdy+0.022
 boun+0.021
 cos+0.020
 flat+0.019
 seek+0.019
Neg logit contributions
ing-0.030
ed-0.029
ie-0.028
pa-0.027
ople-0.027
TokenL
Token ID43
Feature activation-6.65e+00

Pos logit contributions
ople+0.029
les+0.029
pa+0.029
ie+0.029
father+0.028
Neg logit contributions
 sturdy-0.016
 boun-0.016
 flat-0.016
 sil-0.016
 cos-0.015
Tokenily
Token ID813
Feature activation-3.15e+00

Pos logit contributions
ople+0.156
pa+0.154
eled+0.152
by+0.151
father+0.148
Neg logit contributions
 boun-0.125
 exc-0.124
 sew-0.123
onna-0.121
 rake-0.121
Token,
Token ID11
Feature activation-2.13e-01

Pos logit contributions
ople+0.080
les+0.079
father+0.076
ie+0.076
ipped+0.076
Neg logit contributions
 boun-0.058
 sturdy-0.057
 safe-0.056
 flat-0.056
 cozy-0.056
Token I
Token ID314
Feature activation-4.22e-01

Pos logit contributions
ie+0.005
ople+0.005
les+0.005
pa+0.005
ing+0.005
Neg logit contributions
 sturdy-0.005
 boun-0.004
 flat-0.004
 cos-0.004
 sil-0.004
Token told
Token ID1297
Feature activation-3.83e+00

Pos logit contributions
ople+0.011
ie+0.011
les+0.011
pa+0.011
father+0.011
Neg logit contributions
 sturdy-0.007
 boun-0.007
 flat-0.007
 cos-0.007
 sil-0.007
Token you
Token ID345
Feature activation-2.15e+00

Pos logit contributions
les+0.118
led+0.117
eled+0.115
ed+0.113
father+0.112
Neg logit contributions
 safe-0.056
 cozy-0.055
 clean-0.052
 sew-0.052
 bend-0.051
TokenThe
Token ID464
Feature activation-1.88e+00

Pos logit contributions
ople+0.013
les+0.013
ie+0.013
ipped+0.013
ing+0.013
Neg logit contributions
 sturdy-0.012
 boun-0.011
 flat-0.011
 cozy-0.011
 cos-0.011
Token wild
Token ID4295
Feature activation-5.19e+00

Pos logit contributions
les+0.049
ople+0.047
 End+0.046
ing+0.045
ie+0.045
Neg logit contributions
onna-0.034
 sturdy-0.033
 cos-0.032
 boun-0.031
 sil-0.031
Token wild
Token ID4295
Feature activation-5.07e+00

Pos logit contributions
les+0.139
ople+0.134
ie+0.132
ing+0.131
pa+0.128
Neg logit contributions
 sturdy-0.097
 cozy-0.091
onna-0.091
 boun-0.091
 cos-0.090
Tokener
Token ID263
Feature activation-2.56e+00

Pos logit contributions
les+0.135
ople+0.131
ie+0.129
ing+0.127
pa+0.126
Neg logit contributions
 sturdy-0.095
 cozy-0.090
 boun-0.090
 flat-0.089
onna-0.089
Tokenbe
Token ID1350
Feature activation-4.20e+00

Pos logit contributions
ople+0.062
les+0.061
ie+0.061
pa+0.060
father+0.059
Neg logit contributions
 sturdy-0.051
 boun-0.050
 flat-0.049
 cos-0.048
 exc-0.048
Tokenast
Token ID459
Feature activation-5.95e+00

Pos logit contributions
father+0.084
ople+0.082
ays+0.080
ted+0.080
mother+0.075
Neg logit contributions
 exc-0.091
 safe-0.089
 sturdy-0.088
 fit-0.087
 tie-0.087
Token had
Token ID550
Feature activation-7.11e-01

Pos logit contributions
father+0.097
ople+0.094
pa+0.093
by+0.087
ie+0.086
Neg logit contributions
 wrap-0.159
 flat-0.157
 sturdy-0.155
 tie-0.152
 sew-0.152
Token to
Token ID284
Feature activation+1.16e+00

Pos logit contributions
ed+0.015
les+0.015
atted+0.014
el+0.014
pa+0.014
Neg logit contributions
jar-0.015
 exc-0.014
 boun-0.014
 cos-0.014
 sturdy-0.014
Token think
Token ID892
Feature activation-1.73e+00

Pos logit contributions
 boun+0.017
 sturdy+0.016
 flat+0.016
 sil+0.015
 cozy+0.015
Neg logit contributions
ie-0.035
pa-0.034
ople-0.033
les-0.033
father-0.032
Token for
Token ID329
Feature activation-2.70e+00

Pos logit contributions
ople+0.045
les+0.044
ie+0.044
pa+0.043
father+0.042
Neg logit contributions
 sturdy-0.032
 boun-0.032
 flat-0.031
 cos-0.030
 cozy-0.030
Token a
Token ID257
Feature activation+3.99e+00

Pos logit contributions
ople+0.079
les+0.078
ie+0.076
ipped+0.074
ing+0.074
Neg logit contributions
 sturdy-0.044
 boun-0.042
 cozy-0.042
 cos-0.042
 flat-0.042
Token sweet
Token ID6029
Feature activation-3.77e+00

Pos logit contributions
les+0.042
ie+0.042
ople+0.042
ing+0.040
pa+0.040
Neg logit contributions
 sturdy-0.035
 flat-0.034
 boun-0.034
 cos-0.033
 exc-0.033
Token strawberries
Token ID36973
Feature activation+3.15e-01

Pos logit contributions
ople+0.072
father+0.072
pa+0.066
ipped+0.066
atted+0.065
Neg logit contributions
 wrap-0.096
 boun-0.095
 cozy-0.092
 cos-0.092
 rake-0.091
Token!"
Token ID2474
Feature activation-5.43e-01

Pos logit contributions
 sturdy+0.006
 boun+0.005
 sil+0.005
 flat+0.005
 cos+0.005
Neg logit contributions
ie-0.009
ople-0.008
les-0.008
ing-0.008
pa-0.008
Token
Token ID198
Feature activation-1.71e+00

Pos logit contributions
ople+0.013
les+0.012
ie+0.012
pa+0.012
ipped+0.012
Neg logit contributions
 sturdy-0.012
 boun-0.011
 flat-0.011
 cos-0.011
 cozy-0.011
Token
Token ID198
Feature activation-6.63e-01

Pos logit contributions
les+0.042
ie+0.042
ople+0.042
pa+0.041
ing+0.040
Neg logit contributions
 sturdy-0.034
 boun-0.033
 flat-0.032
 cos-0.032
 cozy-0.031
TokenS
Token ID50
Feature activation-6.80e+00

Pos logit contributions
les+0.013
ople+0.013
pa+0.013
ie+0.013
ing+0.012
Neg logit contributions
 boun-0.016
 sturdy-0.016
 flat-0.016
 cos-0.016
 cozy-0.016
Tokenaman
Token ID10546
Feature activation-5.16e+00

Pos logit contributions
ople+0.143
les+0.136
father+0.134
ie+0.132
led+0.131
Neg logit contributions
 sturdy-0.145
 boun-0.141
 sil-0.140
 leash-0.139
 flat-0.138
Tokentha
Token ID12898
Feature activation-3.75e+00

Pos logit contributions
les+0.128
father+0.114
ie+0.112
ople+0.110
ays+0.108
Neg logit contributions
 sturdy-0.104
 boun-0.102
 safe-0.101
 flat-0.100
 leash-0.097
Token and
Token ID290
Feature activation-2.06e+00

Pos logit contributions
ople+0.099
les+0.097
ie+0.096
father+0.096
pa+0.096
Neg logit contributions
 sturdy-0.064
 boun-0.064
 flat-0.061
 cos-0.061
 sil-0.061
Token Dan
Token ID6035
Feature activation-1.52e+00

Pos logit contributions
les+0.049
ople+0.047
led+0.047
ie+0.046
ipped+0.043
Neg logit contributions
 leash-0.043
 exc-0.043
 tie-0.043
 swing-0.042
 rake-0.042
Token thought
Token ID1807
Feature activation-1.39e+00

Pos logit contributions
ople+0.036
les+0.035
father+0.035
pa+0.035
ie+0.034
Neg logit contributions
 sturdy-0.030
 boun-0.030
 wrap-0.029
 cos-0.029
 exc-0.029
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
father+0.122
ipped+0.121
ople+0.120
atted+0.120
iness+0.119
Neg logit contributions
 boun-0.058
 tag-0.056
 wrap-0.055
 swing-0.054
 branch-0.053
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
 sturdy+0.028
 boun+0.027
 flat+0.027
 cos+0.026
 cozy+0.026
Neg logit contributions
ople-0.043
les-0.042
ie-0.042
pa-0.041
father-0.040
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
 sturdy+0.003
 boun+0.003
 cos+0.003
 cozy+0.003
 flat+0.003
Neg logit contributions
ople-0.006
les-0.006
ie-0.006
led-0.005
pa-0.005
TokenSuddenly
Token ID38582
Feature activation-2.35e+00

Pos logit contributions
 sturdy+0.023
 boun+0.022
 flat+0.022
 cos+0.021
 cozy+0.021
Neg logit contributions
ople-0.030
ie-0.029
les-0.029
pa-0.029
father-0.029
Token,
Token ID11
Feature activation-3.57e-01

Pos logit contributions
ople+0.062
les+0.061
ie+0.059
father+0.059
pa+0.058
Neg logit contributions
 sturdy-0.043
 boun-0.042
 flat-0.041
 cos-0.041
 cozy-0.041
Token M
Token ID337
Feature activation-5.67e+00

Pos logit contributions
les+0.010
ople+0.010
ie+0.009
ipped+0.009
atted+0.009
Neg logit contributions
 sturdy-0.006
 boun-0.006
 flat-0.006
 cozy-0.006
 cos-0.006
Tokenilly
Token ID6548
Feature activation-1.61e+00

Pos logit contributions
ople+0.078
les+0.076
led+0.071
ed+0.069
father+0.068
Neg logit contributions
onna-0.167
 rake-0.162
 boun-0.160
 sil-0.159
 exc-0.159
Token felt
Token ID2936
Feature activation+2.43e+00

Pos logit contributions
ople+0.037
ie+0.036
les+0.036
pa+0.035
father+0.035
Neg logit contributions
 sturdy-0.034
 boun-0.034
 flat-0.033
 cos-0.033
 cozy-0.033
Token someone
Token ID2130
Feature activation+1.51e+00

Pos logit contributions
 sturdy+0.046
 boun+0.045
 flat+0.044
 exc+0.043
 cos+0.043
Neg logit contributions
les-0.061
ople-0.060
ie-0.060
pa-0.058
ing-0.057
Token grab
Token ID5552
Feature activation-2.60e+00

Pos logit contributions
 sturdy+0.029
 boun+0.028
onna+0.028
 flat+0.027
 sil+0.027
Neg logit contributions
ie-0.040
les-0.038
ing-0.037
ople-0.036
pa-0.036
Token her
Token ID607
Feature activation+9.22e-02

Pos logit contributions
ople+0.074
father+0.071
les+0.070
pa+0.070
ie+0.069
Neg logit contributions
 boun-0.044
 flat-0.044
 safe-0.043
 sturdy-0.043
 wrap-0.043
Token shiny
Token ID22441
Feature activation+9.27e-01

Pos logit contributions
 leash+0.048
 stret+0.043
 boun+0.040
jar+0.039
 hollow+0.039
Neg logit contributions
ie-0.104
les-0.099
ing-0.096
led-0.094
by-0.093
Token again
Token ID757
Feature activation-1.19e-01

Pos logit contributions
 sturdy+0.016
 leash+0.015
 boun+0.015
 cos+0.015
 exc+0.015
Neg logit contributions
les-0.025
ie-0.025
ing-0.024
ople-0.024
ed-0.023
Token."
Token ID526
Feature activation+9.53e-01

Pos logit contributions
ie+0.004
ople+0.004
les+0.004
pa+0.004
ing+0.003
Neg logit contributions
 boun-0.002
 sturdy-0.002
 flat-0.002
 cos-0.002
 sil-0.001
Token
Token ID198
Feature activation-3.09e-01

Pos logit contributions
 sturdy+0.019
 boun+0.019
 flat+0.019
 cos+0.018
 sil+0.018
Neg logit contributions
ople-0.023
les-0.023
ie-0.023
pa-0.022
ipped-0.022
Token
Token ID198
Feature activation+9.61e-01

Pos logit contributions
ople+0.009
les+0.009
ie+0.009
pa+0.008
father+0.008
Neg logit contributions
 sturdy-0.005
 boun-0.005
 flat-0.005
 cozy-0.005
 cos-0.005
TokenL
Token ID43
Feature activation-6.01e+00

Pos logit contributions
onna+0.025
 sturdy+0.025
 flat+0.024
 spe+0.024
 hollow+0.024
Neg logit contributions
ie-0.018
les-0.018
ople-0.017
ing-0.017
led-0.016
Tokenily
Token ID813
Feature activation-1.48e+00

Pos logit contributions
pa+0.148
ople+0.147
father+0.143
by+0.141
eled+0.136
Neg logit contributions
 exc-0.116
 boun-0.113
 sturdy-0.105
 sew-0.105
 wrap-0.104
Token was
Token ID373
Feature activation+3.38e+00

Pos logit contributions
ople+0.032
les+0.032
ie+0.032
pa+0.031
father+0.030
Neg logit contributions
 sturdy-0.034
 boun-0.033
 flat-0.033
 cos-0.032
 cozy-0.032
Token happy
Token ID3772
Feature activation+1.49e+00

Pos logit contributions
onna+0.069
 opener+0.064
 sturdy+0.062
 leash+0.062
jar+0.061
Neg logit contributions
 End-0.079
atted-0.076
ople-0.076
ed-0.075
ie-0.075
Token to
Token ID284
Feature activation+2.22e+00

Pos logit contributions
 sturdy+0.029
 boun+0.027
 flat+0.027
 cos+0.026
 sil+0.026
Neg logit contributions
ie-0.039
ople-0.038
les-0.038
pa-0.038
ing-0.037
Token help
Token ID1037
Feature activation-1.84e+00

Pos logit contributions
 boun+0.035
 sturdy+0.035
 cos+0.034
 sil+0.032
 flat+0.032
Neg logit contributions
ie-0.064
pa-0.062
les-0.060
ipped-0.060
el-0.060