TOP ACTIVATIONS
MAX = 0.089

she heard a knock on the door . She ran to
saw a man sitting on the sidewalk . The man looked
he heard a knock on the door . It was his
a poor man sitting on the street .
can . Just knock on the door . I 'll be
heard their mom knock on the door . "
saw her dad sitting on the couch . She looked up
tiger heard a knock on the door . When he answered
the truck and knock on the door . Their mom opened
the building and knock on the door . Mr . Jones
she heard a knock on the door . She went to
. I can go wherever I want and do whatever I
try to figure out what I can do with it ."
to herself , " should I follow the path ?" She
he heard a knock on the door . He slowly opened
She heard a knock on the door . She opened it
's house and knock on the door . " Mail for
saw a man standing on the corner . He was selling
, " Come see what I can do !" Toby ran
a homeless man sitting on the side of the road .

MIN ACTIVATIONS
MIN = -0.095

and proud of himself . Because he was persistent , he
still have fun . But most of all , they have
of her toys . But most of all , she was
if you 're nervous ! Because you can end up with
has different tastes . But most importantly , she learned to
. But most importantly , they discussed what
you need it . And most importantly , she learned that
It 's okay , my de ars . I know you
would still be beautiful . Because beauty is not about what
he is so happy . Because he was an intelligent bear
's courage . The two de ers became good friends after
are not flowers , my de ars . Those are caul
to help other people , because it makes them happy .
were safe and happy . Because Bloom warned them , they
special shell with him . Because it was always a reminder
was a dream , my de ars ," she said .
gave him one banana . Because of this , he was
" Thank you , my de ars . I appreciate your
woods . But most of all , Sam enjoyed
, the farmer in the de ll ". Suddenly , he

INTERVAL 0.043 - 0.089
CONTAINS 3.678%

that birds can fly in their own special way and that
Lily cry . He knew he was wrong .
scissors and went back to the garden . But nothing felt
and dad took her to the hospital . When
Once upon a time , there was a little boy named

INTERVAL -0.003 - 0.043
CONTAINS 52.197%

red balloon . It was floating in the air , tied
€ � � saw the train and wanted to help .
around . One day , the youngest duck ling saw a
clothes gl owed brilliantly in the sunshine , just like the
had a problem . He came across a big , deep

INTERVAL -0.049 - -0.003
CONTAINS 42.469%

she kept moving . Suddenly a big , loud noise made
. " Okay , you can try it ,"
star . In a flash , her wish was granted !
They climbed and climbed , until they reached the top of
in our family ." L ily smiled and felt

INTERVAL -0.095 - -0.049
CONTAINS 1.656%

a time , there was a girl called Dawn . One
" What is it , Jake ?" Tom asked .
soft fur . Lily was so happy to have her t
and Dad . They are glad they are not crocod ile
upon a time there was a big , red bird .
Token she
Token ID673
Feature activation+1.58e-02

Pos logit contributions
mmm+0.022
ww+0.021
 importantly+0.020
www+0.019
dd+0.017
Neg logit contributions
 asleep-0.030
 driveway-0.030
robe-0.030
seek-0.029
 bank-0.028
Token heard
Token ID2982
Feature activation+1.50e-02

Pos logit contributions
 asleep+0.026
 driveway+0.026
robe+0.025
seek+0.024
 bank+0.024
Neg logit contributions
ww-0.032
mmm-0.032
 importantly-0.030
www-0.030
dd-0.028
Token a
Token ID257
Feature activation-1.65e-02

Pos logit contributions
 driveway+0.026
 asleep+0.025
robe+0.025
 bank+0.024
seek+0.024
Neg logit contributions
mmm-0.030
ww-0.030
 importantly-0.028
www-0.028
dd-0.025
Token knock
Token ID10643
Feature activation+2.88e-03

Pos logit contributions
mmm+0.030
ww+0.027
 importantly+0.026
dd+0.025
www+0.025
Neg logit contributions
 asleep-0.032
 driveway-0.032
 edge-0.031
robe-0.031
 bank-0.031
Token on
Token ID319
Feature activation+2.71e-02

Pos logit contributions
 asleep+0.004
 driveway+0.004
robe+0.004
seek+0.004
 bank+0.004
Neg logit contributions
mmm-0.007
ww-0.007
 importantly-0.006
www-0.006
dd-0.006
Token the
Token ID262
Feature activation+8.88e-02

Pos logit contributions
robe+0.062
 driveway+0.061
pit+0.060
 asleep+0.059
seek+0.059
Neg logit contributions
ww-0.036
mmm-0.036
www-0.033
 importantly-0.031
dd-0.030
Token door
Token ID3420
Feature activation+2.76e-02

Pos logit contributions
seek+0.116
ASH+0.106
izons+0.105
 asleep+0.104
pit+0.104
Neg logit contributions
ww-0.185
mmm-0.177
www-0.174
 importantly-0.174
 hour-0.166
Token.
Token ID13
Feature activation+3.47e-02

Pos logit contributions
 asleep+0.050
 driveway+0.049
robe+0.049
 bank+0.047
seek+0.047
Neg logit contributions
mmm-0.053
ww-0.051
 importantly-0.049
www-0.047
dd-0.042
Token She
Token ID1375
Feature activation+8.24e-03

Pos logit contributions
 asleep+0.066
seek+0.065
 driveway+0.065
pit+0.065
 bank+0.065
Neg logit contributions
mmm-0.059
ww-0.057
www-0.053
 importantly-0.053
 gust-0.047
Token ran
Token ID4966
Feature activation+4.19e-02

Pos logit contributions
 driveway+0.011
 asleep+0.011
robe+0.010
 bank+0.010
stick+0.010
Neg logit contributions
mmm-0.020
ww-0.019
 importantly-0.019
www-0.018
dd-0.017
Token to
Token ID284
Feature activation+3.60e-02

Pos logit contributions
robe+0.053
 asleep+0.052
 driveway+0.051
 bank+0.050
pit+0.048
Neg logit contributions
ww-0.098
mmm-0.096
www-0.094
 importantly-0.090
dd-0.086
Token saw
Token ID2497
Feature activation+3.34e-03

Pos logit contributions
mmm+0.003
ww+0.003
 importantly+0.003
www+0.003
dd+0.003
Neg logit contributions
seek-0.004
robe-0.004
 asleep-0.004
ASH-0.004
 driveway-0.004
Token a
Token ID257
Feature activation-2.13e-02

Pos logit contributions
 asleep+0.006
 driveway+0.006
robe+0.006
seek+0.006
 bank+0.006
Neg logit contributions
mmm-0.006
ww-0.006
 importantly-0.006
www-0.006
dd-0.005
Token man
Token ID582
Feature activation-2.00e-02

Pos logit contributions
mmm+0.044
ww+0.043
 importantly+0.041
www+0.039
dd+0.037
Neg logit contributions
 driveway-0.036
 asleep-0.036
robe-0.035
 bank-0.034
 neighborhood-0.032
Token sitting
Token ID5586
Feature activation+2.93e-02

Pos logit contributions
mmm+0.036
ww+0.034
 importantly+0.034
www+0.031
dd+0.030
Neg logit contributions
robe-0.038
 asleep-0.037
 driveway-0.037
seek-0.035
ASH-0.035
Token on
Token ID319
Feature activation+2.93e-02

Pos logit contributions
seek+0.062
 driveway+0.059
 bank+0.057
stick+0.057
ASH+0.056
Neg logit contributions
 importantly-0.043
 badly-0.041
ww-0.041
mmm-0.040
dd-0.039
Token the
Token ID262
Feature activation+8.87e-02

Pos logit contributions
robe+0.064
pit+0.062
 driveway+0.062
 asleep+0.061
 grounds+0.060
Neg logit contributions
mmm-0.042
ww-0.040
www-0.035
 importantly-0.035
dd-0.032
Token sidewalk
Token ID24058
Feature activation+1.50e-02

Pos logit contributions
pit+0.123
ASH+0.123
 asleep+0.121
seek+0.118
robe+0.109
Neg logit contributions
ww-0.183
mmm-0.176
www-0.174
 annoying-0.161
 hour-0.160
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
robe+0.034
 asleep+0.033
ags+0.033
 driveway+0.032
ASH+0.032
Neg logit contributions
mmm-0.018
 importantly-0.017
 badly-0.015
 hour-0.015
ww-0.015
Token The
Token ID383
Feature activation+1.93e-02

Pos logit contributions
 asleep+0.034
robe+0.033
 driveway+0.033
seek+0.032
 bank+0.032
Neg logit contributions
mmm-0.028
ww-0.027
 importantly-0.026
www-0.025
dd-0.022
Token man
Token ID582
Feature activation-1.93e-02

Pos logit contributions
 asleep+0.041
seek+0.039
stick+0.038
ASH+0.038
robe+0.038
Neg logit contributions
ww-0.029
mmm-0.028
 importantly-0.028
www-0.027
 annoying-0.025
Token looked
Token ID3114
Feature activation+9.65e-03

Pos logit contributions
mmm+0.040
ww+0.039
www+0.036
 importantly+0.036
dd+0.033
Neg logit contributions
 driveway-0.033
 asleep-0.032
 bank-0.032
robe-0.030
seek-0.030
Token he
Token ID339
Feature activation+9.82e-03

Pos logit contributions
mmm+0.039
ww+0.038
 importantly+0.037
www+0.035
 annoying+0.031
Neg logit contributions
 asleep-0.037
 bank-0.034
 driveway-0.034
robe-0.034
 edge-0.034
Token heard
Token ID2982
Feature activation+8.65e-03

Pos logit contributions
 asleep+0.016
 driveway+0.016
robe+0.015
seek+0.015
 bank+0.015
Neg logit contributions
mmm-0.021
ww-0.021
 importantly-0.020
www-0.019
dd-0.018
Token a
Token ID257
Feature activation-1.81e-02

Pos logit contributions
 driveway+0.016
robe+0.015
 asleep+0.015
 bank+0.014
seek+0.014
Neg logit contributions
mmm-0.017
ww-0.017
 importantly-0.016
www-0.015
dd-0.014
Token knock
Token ID10643
Feature activation-6.43e-03

Pos logit contributions
mmm+0.032
ww+0.029
 importantly+0.029
dd+0.027
www+0.026
Neg logit contributions
 driveway-0.036
 asleep-0.036
 edge-0.035
 bank-0.035
robe-0.034
Token on
Token ID319
Feature activation+2.24e-02

Pos logit contributions
mmm+0.014
ww+0.014
 importantly+0.013
www+0.013
dd+0.012
Neg logit contributions
 asleep-0.010
 driveway-0.010
robe-0.010
seek-0.009
 bank-0.009
Token the
Token ID262
Feature activation+8.84e-02

Pos logit contributions
robe+0.052
 driveway+0.051
 asleep+0.050
seek+0.050
pit+0.049
Neg logit contributions
mmm-0.031
ww-0.031
www-0.028
 importantly-0.026
dd-0.026
Token door
Token ID3420
Feature activation+2.24e-02

Pos logit contributions
seek+0.122
ASH+0.113
izons+0.110
pit+0.108
stick+0.105
Neg logit contributions
ww-0.182
mmm-0.173
 importantly-0.172
www-0.172
 hour-0.166
Token.
Token ID13
Feature activation+2.96e-02

Pos logit contributions
 asleep+0.044
robe+0.044
 driveway+0.043
seek+0.042
 bank+0.042
Neg logit contributions
mmm-0.039
ww-0.037
 importantly-0.035
www-0.034
 hour-0.030
Token It
Token ID632
Feature activation+3.19e-02

Pos logit contributions
 asleep+0.053
 driveway+0.052
robe+0.052
seek+0.048
 bank+0.048
Neg logit contributions
mmm-0.059
ww-0.057
www-0.054
 importantly-0.054
dd-0.048
Token was
Token ID373
Feature activation-5.02e-03

Pos logit contributions
 asleep+0.042
 driveway+0.042
robe+0.041
 bank+0.039
seek+0.038
Neg logit contributions
mmm-0.078
ww-0.076
 importantly-0.074
www-0.070
dd-0.066
Token his
Token ID465
Feature activation+2.26e-02

Pos logit contributions
mmm+0.012
ww+0.012
www+0.011
 importantly+0.011
dd+0.010
Neg logit contributions
 asleep-0.008
 driveway-0.007
robe-0.007
 bathroom-0.006
 bank-0.006
Token a
Token ID257
Feature activation-2.95e-02

Pos logit contributions
 driveway+0.026
 asleep+0.026
robe+0.025
seek+0.025
 bank+0.024
Neg logit contributions
ww-0.029
mmm-0.029
 importantly-0.028
www-0.027
dd-0.025
Token poor
Token ID3595
Feature activation-1.98e-02

Pos logit contributions
mmm+0.065
ww+0.060
 importantly+0.059
www+0.056
dd+0.056
Neg logit contributions
 driveway-0.047
 asleep-0.047
robe-0.045
 bank-0.045
 neighborhood-0.043
Token man
Token ID582
Feature activation-1.63e-02

Pos logit contributions
mmm+0.041
ww+0.040
 importantly+0.038
www+0.037
dd+0.034
Neg logit contributions
 asleep-0.034
 driveway-0.033
robe-0.032
 bank-0.031
seek-0.030
Token sitting
Token ID5586
Feature activation+3.18e-02

Pos logit contributions
mmm+0.030
ww+0.029
 importantly+0.028
www+0.026
dd+0.025
Neg logit contributions
robe-0.030
 asleep-0.030
 driveway-0.030
seek-0.028
ASH-0.028
Token on
Token ID319
Feature activation+3.05e-02

Pos logit contributions
seek+0.064
 driveway+0.064
stick+0.060
 bank+0.060
ASH+0.058
Neg logit contributions
ww-0.050
 importantly-0.049
mmm-0.047
 badly-0.047
dd-0.045
Token the
Token ID262
Feature activation+8.78e-02

Pos logit contributions
robe+0.063
 driveway+0.063
pit+0.062
 asleep+0.061
seek+0.060
Neg logit contributions
mmm-0.047
ww-0.047
www-0.041
 importantly-0.040
dd-0.038
Token street
Token ID4675
Feature activation+8.73e-03

Pos logit contributions
ASH+0.113
 asleep+0.113
pit+0.111
seek+0.111
stick+0.110
Neg logit contributions
ww-0.197
mmm-0.188
www-0.186
 importantly-0.169
 annoying-0.164
Token.
Token ID13
Feature activation+2.65e-02

Pos logit contributions
robe+0.020
 asleep+0.019
ASH+0.018
ags+0.018
 driveway+0.018
Neg logit contributions
mmm-0.011
 importantly-0.011
 badly-0.010
ww-0.010
 hour-0.010
Token
Token ID220
Feature activation+3.37e-02

Pos logit contributions
seek+0.049
 asleep+0.049
 driveway+0.048
robe+0.047
ASH+0.047
Neg logit contributions
mmm-0.046
ww-0.045
 importantly-0.043
www-0.041
 gust-0.037
Token
Token ID198
Feature activation-1.38e-02

Pos logit contributions
 asleep+0.058
 driveway+0.057
 bank+0.055
seek+0.055
robe+0.055
Neg logit contributions
mmm-0.068
ww-0.064
 importantly-0.063
www-0.059
dd-0.055
Token
Token ID198
Feature activation-2.22e-02

Pos logit contributions
mmm+0.033
ww+0.033
 importantly+0.031
www+0.031
dd+0.029
Neg logit contributions
 asleep-0.018
 driveway-0.018
robe-0.018
seek-0.017
 bank-0.017
Token can
Token ID460
Feature activation+2.48e-02

Pos logit contributions
 asleep+0.046
 driveway+0.046
robe+0.046
seek+0.045
 bank+0.044
Neg logit contributions
mmm-0.040
ww-0.039
 importantly-0.036
www-0.035
dd-0.031
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
robe+0.053
 asleep+0.052
 driveway+0.050
ASH+0.049
pit+0.048
Neg logit contributions
mmm-0.042
ww-0.039
www-0.037
 importantly-0.035
dd-0.031
Token Just
Token ID2329
Feature activation-9.45e-03

Pos logit contributions
 asleep+0.020
 driveway+0.020
robe+0.020
 bank+0.019
seek+0.019
Neg logit contributions
mmm-0.018
ww-0.018
 importantly-0.017
www-0.016
dd-0.014
Token knock
Token ID10643
Feature activation+4.84e-02

Pos logit contributions
mmm+0.019
ww+0.019
 importantly+0.018
www+0.017
dd+0.016
Neg logit contributions
 asleep-0.016
 driveway-0.016
robe-0.016
seek-0.015
 bank-0.015
Token on
Token ID319
Feature activation+8.95e-03

Pos logit contributions
 driveway+0.080
seek+0.079
robe+0.079
 bank+0.076
 asleep+0.075
Neg logit contributions
ww-0.099
mmm-0.099
www-0.092
 importantly-0.089
dd-0.083
Token the
Token ID262
Feature activation+8.67e-02

Pos logit contributions
 driveway+0.019
 asleep+0.019
robe+0.019
seek+0.019
 bank+0.018
Neg logit contributions
mmm-0.014
ww-0.014
 importantly-0.013
www-0.012
dd-0.011
Token door
Token ID3420
Feature activation+1.20e-02

Pos logit contributions
seek+0.131
robe+0.107
stick+0.105
 asleep+0.102
izons+0.101
Neg logit contributions
ww-0.197
www-0.181
mmm-0.179
 importantly-0.177
 gust-0.175
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
robe+0.024
 driveway+0.024
seek+0.024
 asleep+0.024
 bank+0.023
Neg logit contributions
mmm-0.020
ww-0.018
www-0.017
 importantly-0.017
 hour-0.015
Token I
Token ID314
Feature activation+3.50e-02

Pos logit contributions
 asleep+0.029
 driveway+0.029
robe+0.028
 bank+0.027
seek+0.027
Neg logit contributions
mmm-0.027
ww-0.026
 importantly-0.025
www-0.024
dd-0.021
Token'll
Token ID1183
Feature activation+3.61e-02

Pos logit contributions
 asleep+0.063
 driveway+0.062
robe+0.061
seek+0.061
ASH+0.060
Neg logit contributions
ww-0.064
mmm-0.062
www-0.060
 importantly-0.057
dd-0.054
Token be
Token ID307
Feature activation-5.06e-04

Pos logit contributions
robe+0.067
 asleep+0.065
 driveway+0.062
pit+0.062
ASH+0.062
Neg logit contributions
mmm-0.068
ww-0.064
www-0.061
 importantly-0.059
 annoying-0.055
Token heard
Token ID2982
Feature activation+9.05e-03

Pos logit contributions
 asleep+0.006
 driveway+0.006
robe+0.006
seek+0.005
 bank+0.005
Neg logit contributions
mmm-0.008
ww-0.007
 importantly-0.007
www-0.007
dd-0.006
Token their
Token ID511
Feature activation+1.85e-02

Pos logit contributions
 driveway+0.017
robe+0.017
 asleep+0.016
seek+0.016
 bank+0.016
Neg logit contributions
mmm-0.017
ww-0.017
 importantly-0.016
www-0.015
dd-0.014
Token mom
Token ID1995
Feature activation-2.24e-03

Pos logit contributions
 asleep+0.035
robe+0.035
 driveway+0.035
seek+0.034
 bank+0.033
Neg logit contributions
ww-0.033
mmm-0.033
 importantly-0.031
www-0.029
dd-0.027
Token knock
Token ID10643
Feature activation+2.39e-02

Pos logit contributions
mmm+0.003
 importantly+0.003
ww+0.003
www+0.003
 annoying+0.003
Neg logit contributions
robe-0.005
 driveway-0.005
seek-0.005
 asleep-0.005
 bank-0.004
Token on
Token ID319
Feature activation+2.88e-02

Pos logit contributions
robe+0.041
seek+0.040
 driveway+0.039
 bank+0.038
 asleep+0.037
Neg logit contributions
ww-0.046
mmm-0.046
 importantly-0.044
www-0.043
 hour-0.040
Token the
Token ID262
Feature activation+8.55e-02

Pos logit contributions
robe+0.060
 driveway+0.058
seek+0.058
 asleep+0.056
pit+0.055
Neg logit contributions
mmm-0.047
ww-0.045
 importantly-0.042
www-0.042
dd-0.038
Token door
Token ID3420
Feature activation+2.72e-02

Pos logit contributions
seek+0.120
pit+0.100
ASH+0.097
 asleep+0.096
izons+0.096
Neg logit contributions
ww-0.185
 importantly-0.178
mmm-0.176
www-0.171
 hour-0.165
Token.
Token ID13
Feature activation+2.27e-02

Pos logit contributions
robe+0.056
 asleep+0.055
seek+0.054
 driveway+0.054
 bank+0.054
Neg logit contributions
mmm-0.043
ww-0.040
 importantly-0.039
www-0.037
 hour-0.034
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
 asleep+0.039
 driveway+0.039
robe+0.039
seek+0.038
 bank+0.037
Neg logit contributions
mmm-0.046
ww-0.044
 importantly-0.042
www-0.041
dd-0.036
Token
Token ID198
Feature activation-3.39e-02

Pos logit contributions
ww+0.048
mmm+0.048
 importantly+0.046
www+0.045
dd+0.042
Neg logit contributions
 driveway-0.021
 asleep-0.019
robe-0.019
seek-0.018
 edge-0.018
Token"
Token ID1
Feature activation+2.26e-03

Pos logit contributions
mmm+0.059
 importantly+0.059
ww+0.055
 gust+0.051
 annoying+0.050
Neg logit contributions
 driveway-0.064
seek-0.060
 neighborhood-0.059
 edge-0.058
 bathroom-0.057
Token saw
Token ID2497
Feature activation+7.54e-03

Pos logit contributions
robe+0.018
 driveway+0.018
seek+0.017
stick+0.017
pit+0.017
Neg logit contributions
ww-0.015
mmm-0.015
www-0.014
dd-0.014
 importantly-0.014
Token her
Token ID607
Feature activation+1.56e-02

Pos logit contributions
 driveway+0.015
robe+0.014
seek+0.014
 asleep+0.014
 bank+0.014
Neg logit contributions
mmm-0.013
ww-0.013
 importantly-0.013
www-0.012
dd-0.011
Token dad
Token ID9955
Feature activation+5.75e-03

Pos logit contributions
 driveway+0.029
seek+0.029
ASH+0.028
robe+0.027
 asleep+0.027
Neg logit contributions
ww-0.027
mmm-0.027
 importantly-0.027
www-0.024
 annoying-0.023
Token sitting
Token ID5586
Feature activation+3.39e-02

Pos logit contributions
robe+0.010
 driveway+0.010
seek+0.010
 asleep+0.010
ASH+0.009
Neg logit contributions
mmm-0.011
 importantly-0.011
ww-0.010
www-0.010
dd-0.009
Token on
Token ID319
Feature activation+2.66e-02

Pos logit contributions
seek+0.078
 driveway+0.076
 bank+0.075
stick+0.074
 banks+0.070
Neg logit contributions
 importantly-0.045
ww-0.041
 badly-0.040
mmm-0.038
 years-0.038
Token the
Token ID262
Feature activation+8.53e-02

Pos logit contributions
 driveway+0.059
robe+0.057
pit+0.057
 grounds+0.056
seek+0.056
Neg logit contributions
mmm-0.037
ww-0.035
 importantly-0.032
www-0.031
dd-0.029
Token couch
Token ID18507
Feature activation+2.32e-02

Pos logit contributions
pit+0.094
ASH+0.091
seek+0.089
izons+0.088
stick+0.087
Neg logit contributions
ww-0.194
mmm-0.185
 annoying-0.182
www-0.181
 hour-0.179
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
ASH+0.052
 driveway+0.051
ags+0.051
robe+0.050
 meets+0.050
Neg logit contributions
mmm-0.026
 importantly-0.025
ww-0.023
 hour-0.022
 badly-0.022
Token She
Token ID1375
Feature activation+1.47e-03

Pos logit contributions
seek+0.039
ASH+0.039
pit+0.039
 driveway+0.038
 bank+0.038
Neg logit contributions
mmm-0.034
 importantly-0.034
ww-0.032
www-0.030
 hour-0.027
Token looked
Token ID3114
Feature activation+1.45e-02

Pos logit contributions
 asleep+0.002
 driveway+0.002
robe+0.002
 bank+0.002
 neighborhood+0.002
Neg logit contributions
mmm-0.003
ww-0.003
 importantly-0.003
www-0.003
dd-0.002
Token up
Token ID510
Feature activation-2.62e-04

Pos logit contributions
 asleep+0.022
 driveway+0.021
robe+0.021
 bank+0.020
seek+0.019
Neg logit contributions
mmm-0.033
ww-0.033
 importantly-0.031
www-0.030
dd-0.028
Token tiger
Token ID26241
Feature activation-1.28e-02

Pos logit contributions
 asleep+0.016
 driveway+0.016
robe+0.016
seek+0.015
 bank+0.015
Neg logit contributions
mmm-0.014
ww-0.014
 importantly-0.013
www-0.013
dd-0.011
Token heard
Token ID2982
Feature activation+9.72e-03

Pos logit contributions
ww+0.024
mmm+0.024
www+0.022
 importantly+0.022
dd+0.021
Neg logit contributions
 asleep-0.023
 driveway-0.023
robe-0.023
seek-0.021
ASH-0.021
Token a
Token ID257
Feature activation-1.88e-02

Pos logit contributions
 driveway+0.018
robe+0.017
 asleep+0.017
 bank+0.016
seek+0.016
Neg logit contributions
mmm-0.019
ww-0.019
 importantly-0.018
www-0.017
dd-0.016
Token knock
Token ID10643
Feature activation-3.69e-03

Pos logit contributions
mmm+0.033
ww+0.031
 importantly+0.030
dd+0.029
www+0.028
Neg logit contributions
 asleep-0.037
 driveway-0.037
 edge-0.037
 bank-0.036
 grounds-0.034
Token on
Token ID319
Feature activation+2.48e-02

Pos logit contributions
mmm+0.008
ww+0.008
 importantly+0.007
www+0.007
dd+0.007
Neg logit contributions
 driveway-0.006
 asleep-0.006
robe-0.006
seek-0.006
 bank-0.006
Token the
Token ID262
Feature activation+8.49e-02

Pos logit contributions
robe+0.061
 driveway+0.059
seek+0.058
pit+0.058
 asleep+0.057
Neg logit contributions
mmm-0.030
ww-0.030
www-0.028
dd-0.024
 importantly-0.024
Token door
Token ID3420
Feature activation+2.42e-02

Pos logit contributions
seek+0.121
 asleep+0.109
robe+0.105
stick+0.099
izons+0.098
Neg logit contributions
ww-0.185
mmm-0.177
www-0.175
 hour-0.163
 importantly-0.163
Token.
Token ID13
Feature activation+3.16e-02

Pos logit contributions
robe+0.050
 asleep+0.049
 driveway+0.048
seek+0.048
 bank+0.047
Neg logit contributions
mmm-0.039
ww-0.036
 importantly-0.035
www-0.033
 hour-0.031
Token When
Token ID1649
Feature activation-2.36e-02

Pos logit contributions
 driveway+0.058
 asleep+0.058
robe+0.056
seek+0.054
pit+0.054
Neg logit contributions
mmm-0.059
ww-0.057
www-0.054
 importantly-0.053
Run-0.047
Token he
Token ID339
Feature activation+1.50e-02

Pos logit contributions
mmm+0.037
ww+0.036
 importantly+0.034
www+0.032
dd+0.029
Neg logit contributions
 asleep-0.052
 driveway-0.051
robe-0.050
 bank-0.049
seek-0.049
Token answered
Token ID9373
Feature activation+4.07e-03

Pos logit contributions
 asleep+0.018
 driveway+0.018
 bank+0.016
robe+0.016
ASH+0.016
Neg logit contributions
mmm-0.039
ww-0.037
 importantly-0.037
www-0.035
dd-0.033
Token the
Token ID262
Feature activation+4.77e-02

Pos logit contributions
robe+0.001
seek+0.001
 driveway+0.001
 asleep+0.001
 grounds+0.001
Neg logit contributions
mmm-0.001
ww-0.001
 importantly-0.001
www-0.001
dd-0.001
Token truck
Token ID7779
Feature activation+1.54e-02

Pos logit contributions
seek+0.085
robe+0.078
 asleep+0.072
stick+0.072
izons+0.066
Neg logit contributions
ww-0.094
 hour-0.088
 importantly-0.087
www-0.087
mmm-0.086
Token and
Token ID290
Feature activation-1.43e-03

Pos logit contributions
robe+0.033
 asleep+0.032
seek+0.031
ASH+0.030
 driveway+0.030
Neg logit contributions
mmm-0.022
ww-0.020
 importantly-0.020
 hour-0.018
www-0.018
Token knock
Token ID10643
Feature activation+3.21e-02

Pos logit contributions
ww+0.002
mmm+0.002
 importantly+0.002
www+0.002
Run+0.002
Neg logit contributions
robe-0.003
 driveway-0.003
 asleep-0.003
seek-0.003
ASH-0.003
Token on
Token ID319
Feature activation+1.70e-02

Pos logit contributions
 asleep+0.048
 driveway+0.048
robe+0.047
seek+0.046
 bank+0.045
Neg logit contributions
mmm-0.070
ww-0.070
 importantly-0.066
www-0.065
dd-0.061
Token the
Token ID262
Feature activation+8.48e-02

Pos logit contributions
 driveway+0.035
robe+0.034
 asleep+0.034
seek+0.034
 grounds+0.032
Neg logit contributions
mmm-0.028
ww-0.028
 importantly-0.025
www-0.025
dd-0.023
Token door
Token ID3420
Feature activation+2.31e-02

Pos logit contributions
seek+0.135
robe+0.115
 asleep+0.106
stick+0.102
izons+0.101
Neg logit contributions
ww-0.171
mmm-0.161
www-0.161
 hour-0.160
 importantly-0.160
Token.
Token ID13
Feature activation+2.33e-02

Pos logit contributions
robe+0.048
 asleep+0.047
 driveway+0.046
seek+0.046
 bank+0.045
Neg logit contributions
mmm-0.036
ww-0.033
 importantly-0.033
www-0.031
 hour-0.029
Token Their
Token ID5334
Feature activation+2.65e-02

Pos logit contributions
 driveway+0.038
 asleep+0.037
robe+0.037
seek+0.037
 bank+0.035
Neg logit contributions
mmm-0.049
ww-0.048
 importantly-0.046
www-0.044
dd-0.041
Token mom
Token ID1995
Feature activation-1.86e-02

Pos logit contributions
robe+0.066
seek+0.065
 asleep+0.062
stick+0.061
ASH+0.059
Neg logit contributions
ww-0.029
 importantly-0.028
www-0.026
mmm-0.024
 annoying-0.024
Token opened
Token ID4721
Feature activation+1.94e-02

Pos logit contributions
mmm+0.038
ww+0.038
 importantly+0.036
www+0.035
dd+0.032
Neg logit contributions
 asleep-0.031
 driveway-0.030
robe-0.030
seek-0.029
 bank-0.029
Token the
Token ID262
Feature activation+4.29e-02

Pos logit contributions
robe+0.047
pit+0.046
 driveway+0.045
 asleep+0.045
seek+0.044
Neg logit contributions
mmm-0.026
ww-0.022
 importantly-0.022
www-0.020
 annoying-0.019
Token building
Token ID2615
Feature activation+3.75e-02

Pos logit contributions
robe+0.066
 asleep+0.064
stick+0.063
seek+0.060
anced+0.060
Neg logit contributions
ww-0.084
www-0.079
 annoying-0.076
 importantly-0.076
 unexpected-0.075
Token and
Token ID290
Feature activation+1.65e-02

Pos logit contributions
 asleep+0.083
robe+0.082
izons+0.076
seek+0.074
IT+0.073
Neg logit contributions
 hour-0.044
 much-0.041
 though-0.038
mmm-0.038
 badly-0.035
Token knock
Token ID10643
Feature activation+4.44e-02

Pos logit contributions
robe+0.034
 asleep+0.032
ASH+0.032
pit+0.031
seek+0.031
Neg logit contributions
mmm-0.023
 else-0.021
 expect-0.021
 though-0.021
Run-0.021
Token on
Token ID319
Feature activation+1.85e-02

Pos logit contributions
 driveway+0.078
seek+0.077
robe+0.077
 asleep+0.077
stick+0.075
Neg logit contributions
ww-0.081
mmm-0.081
www-0.076
 importantly-0.074
dd-0.071
Token the
Token ID262
Feature activation+8.47e-02

Pos logit contributions
robe+0.042
 driveway+0.041
seek+0.040
 asleep+0.040
pit+0.040
Neg logit contributions
mmm-0.026
ww-0.026
www-0.023
 importantly-0.023
dd-0.021
Token door
Token ID3420
Feature activation+2.08e-02

Pos logit contributions
seek+0.142
robe+0.124
stick+0.120
izons+0.117
pit+0.114
Neg logit contributions
ww-0.154
 importantly-0.149
mmm-0.146
 hour-0.146
 gust-0.145
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
robe+0.046
 asleep+0.045
 driveway+0.044
seek+0.044
 bank+0.043
Neg logit contributions
mmm-0.029
 importantly-0.026
ww-0.025
 hour-0.024
www-0.024
Token Mr
Token ID1770
Feature activation-1.25e-02

Pos logit contributions
 driveway+0.036
robe+0.036
 asleep+0.036
pit+0.036
seek+0.035
Neg logit contributions
mmm-0.038
 importantly-0.036
ww-0.036
www-0.033
Run-0.030
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
mmm+0.020
ww+0.019
 importantly+0.018
www+0.017
dd+0.016
Neg logit contributions
 driveway-0.026
 asleep-0.025
robe-0.024
 bank-0.024
seek-0.024
Token Jones
Token ID5437
Feature activation-6.19e-03

Pos logit contributions
seek+0.038
 grounds+0.038
 driveway+0.038
 bank+0.037
 asleep+0.037
Neg logit contributions
mmm-0.040
ww-0.037
 importantly-0.036
www-0.033
Run-0.030
Token she
Token ID673
Feature activation+2.15e-02

Pos logit contributions
mmm+0.026
ww+0.026
 importantly+0.024
www+0.023
dd+0.020
Neg logit contributions
 asleep-0.042
 driveway-0.041
robe-0.041
seek-0.040
 bank-0.039
Token heard
Token ID2982
Feature activation+1.31e-02

Pos logit contributions
 asleep+0.038
 driveway+0.038
robe+0.037
seek+0.037
stick+0.036
Neg logit contributions
ww-0.041
mmm-0.040
 importantly-0.038
www-0.037
dd-0.036
Token a
Token ID257
Feature activation-1.84e-02

Pos logit contributions
 driveway+0.022
 asleep+0.022
robe+0.022
 bank+0.021
seek+0.021
Neg logit contributions
mmm-0.026
ww-0.026
 importantly-0.025
www-0.024
dd-0.022
Token knock
Token ID10643
Feature activation+2.48e-03

Pos logit contributions
mmm+0.032
ww+0.029
 importantly+0.028
dd+0.027
www+0.026
Neg logit contributions
 asleep-0.038
 driveway-0.038
 edge-0.037
robe-0.037
 bank-0.037
Token on
Token ID319
Feature activation+2.62e-02

Pos logit contributions
 asleep+0.004
 driveway+0.004
robe+0.004
seek+0.003
 bank+0.003
Neg logit contributions
mmm-0.006
ww-0.006
 importantly-0.005
www-0.005
dd-0.005
Token the
Token ID262
Feature activation+8.43e-02

Pos logit contributions
robe+0.061
 driveway+0.060
seek+0.059
pit+0.058
 asleep+0.058
Neg logit contributions
ww-0.035
mmm-0.034
www-0.032
 importantly-0.029
dd-0.027
Token door
Token ID3420
Feature activation+2.73e-02

Pos logit contributions
seek+0.115
 asleep+0.101
izons+0.100
ASH+0.099
pit+0.097
Neg logit contributions
ww-0.181
mmm-0.171
www-0.168
 importantly-0.165
 annoying-0.157
Token.
Token ID13
Feature activation+3.22e-02

Pos logit contributions
 asleep+0.051
robe+0.050
 driveway+0.050
seek+0.049
 bank+0.048
Neg logit contributions
mmm-0.050
ww-0.048
 importantly-0.046
www-0.045
 hour-0.039
Token She
Token ID1375
Feature activation+8.22e-03

Pos logit contributions
 asleep+0.064
 lair+0.062
pit+0.062
seek+0.062
 bank+0.061
Neg logit contributions
mmm-0.052
ww-0.048
www-0.046
 importantly-0.045
 gust-0.042
Token went
Token ID1816
Feature activation+4.96e-02

Pos logit contributions
 driveway+0.013
 asleep+0.013
robe+0.012
 bank+0.012
seek+0.012
Neg logit contributions
mmm-0.019
ww-0.018
 importantly-0.017
www-0.016
dd-0.015
Token to
Token ID284
Feature activation+3.61e-02

Pos logit contributions
ASH+0.079
robe+0.076
izons+0.075
 bank+0.074
 driveway+0.073
Neg logit contributions
ww-0.088
www-0.086
 though-0.083
mmm-0.080
 importantly-0.078
Token.
Token ID13
Feature activation-3.46e-03

Pos logit contributions
mmm+0.070
ww+0.070
 importantly+0.065
www+0.064
dd+0.059
Neg logit contributions
 asleep-0.058
 driveway-0.056
robe-0.055
 bank-0.053
seek-0.053
Token I
Token ID314
Feature activation+1.98e-02

Pos logit contributions
ww+0.006
mmm+0.005
www+0.005
 importantly+0.005
dd+0.005
Neg logit contributions
 asleep-0.007
 driveway-0.007
robe-0.007
seek-0.007
stick-0.007
Token can
Token ID460
Feature activation+3.19e-02

Pos logit contributions
 driveway+0.041
 asleep+0.039
robe+0.038
ASH+0.038
pit+0.037
Neg logit contributions
mmm-0.033
ww-0.033
 importantly-0.030
www-0.030
dd-0.028
Token go
Token ID467
Feature activation+4.48e-02

Pos logit contributions
 asleep+0.059
robe+0.058
 driveway+0.058
ASH+0.057
pit+0.056
Neg logit contributions
mmm-0.061
ww-0.056
 importantly-0.053
www-0.052
 annoying-0.048
Token wherever
Token ID14530
Feature activation-2.75e-02

Pos logit contributions
ASH+0.075
izons+0.072
 driveway+0.072
robe+0.072
IT+0.070
Neg logit contributions
mmm-0.082
ww-0.082
 though-0.076
www-0.075
 importantly-0.073
Token I
Token ID314
Feature activation+8.38e-02

Pos logit contributions
mmm+0.031
ww+0.030
 importantly+0.027
www+0.026
dd+0.021
Neg logit contributions
 asleep-0.072
 driveway-0.072
robe-0.071
seek-0.069
 bank-0.069
Token want
Token ID765
Feature activation-1.25e-02

Pos logit contributions
 asleep+0.175
 driveway+0.174
seek+0.168
robe+0.164
pit+0.162
Neg logit contributions
mmm-0.132
ww-0.125
www-0.116
 importantly-0.112
 expect-0.107
Token and
Token ID290
Feature activation-4.78e-03

Pos logit contributions
mmm+0.024
ww+0.024
 importantly+0.022
www+0.022
dd+0.020
Neg logit contributions
 driveway-0.023
 asleep-0.023
robe-0.022
seek-0.021
 bank-0.021
Token do
Token ID466
Feature activation-1.51e-02

Pos logit contributions
mmm+0.008
ww+0.008
 importantly+0.007
www+0.007
 annoying+0.006
Neg logit contributions
robe-0.010
 driveway-0.009
ASH-0.009
pit-0.009
seek-0.009
Token whatever
Token ID4232
Feature activation-2.76e-02

Pos logit contributions
mmm+0.025
ww+0.024
 importantly+0.023
www+0.022
dd+0.020
Neg logit contributions
 driveway-0.032
robe-0.031
 asleep-0.031
ASH-0.030
seek-0.030
Token I
Token ID314
Feature activation+5.80e-02

Pos logit contributions
mmm+0.033
ww+0.032
 importantly+0.028
www+0.027
dd+0.023
Neg logit contributions
 asleep-0.071
 driveway-0.070
robe-0.069
seek-0.067
 bank-0.067
Token try
Token ID1949
Feature activation+1.96e-02

Pos logit contributions
 asleep+0.059
robe+0.058
 driveway+0.057
seek+0.055
ASH+0.053
Neg logit contributions
mmm-0.075
ww-0.072
 importantly-0.068
www-0.067
dd-0.061
Token to
Token ID284
Feature activation+4.38e-02

Pos logit contributions
robe+0.043
 driveway+0.042
seek+0.039
 asleep+0.038
ASH+0.038
Neg logit contributions
mmm-0.034
ww-0.031
 importantly-0.029
www-0.028
dd-0.025
Token figure
Token ID3785
Feature activation+1.42e-02

Pos logit contributions
robe+0.083
 asleep+0.080
 driveway+0.075
seek+0.073
pit+0.072
Neg logit contributions
mmm-0.090
ww-0.081
 importantly-0.077
www-0.077
 annoying-0.070
Token out
Token ID503
Feature activation-6.31e-03

Pos logit contributions
 driveway+0.026
robe+0.026
 asleep+0.026
seek+0.025
 bank+0.024
Neg logit contributions
mmm-0.027
ww-0.027
www-0.025
 importantly-0.024
dd-0.022
Token what
Token ID644
Feature activation+2.32e-02

Pos logit contributions
mmm+0.008
ww+0.008
 importantly+0.007
www+0.007
dd+0.006
Neg logit contributions
 driveway-0.016
robe-0.016
 asleep-0.015
seek-0.015
 bank-0.015
Token I
Token ID314
Feature activation+8.37e-02

Pos logit contributions
 asleep+0.050
 driveway+0.050
robe+0.050
seek+0.048
 bank+0.048
Neg logit contributions
ww-0.036
mmm-0.036
 importantly-0.033
www-0.032
dd-0.028
Token can
Token ID460
Feature activation+5.29e-02

Pos logit contributions
 asleep+0.184
robe+0.176
 driveway+0.176
seek+0.171
ASH+0.165
Neg logit contributions
mmm-0.134
ww-0.125
 importantly-0.118
www-0.117
dd-0.105
Token do
Token ID466
Feature activation-3.84e-03

Pos logit contributions
 asleep+0.094
robe+0.092
 driveway+0.090
ASH+0.084
seek+0.083
Neg logit contributions
mmm-0.110
ww-0.099
 importantly-0.094
www-0.094
dd-0.086
Token with
Token ID351
Feature activation-1.86e-02

Pos logit contributions
mmm+0.007
ww+0.006
 importantly+0.006
www+0.006
dd+0.005
Neg logit contributions
 driveway-0.008
 asleep-0.008
robe-0.007
 bank-0.007
ASH-0.007
Token it
Token ID340
Feature activation-3.86e-03

Pos logit contributions
ww+0.026
mmm+0.026
 importantly+0.024
www+0.023
dd+0.020
Neg logit contributions
 asleep-0.043
 driveway-0.042
robe-0.041
 bank-0.041
seek-0.040
Token."
Token ID526
Feature activation+9.97e-03

Pos logit contributions
mmm+0.007
ww+0.007
 importantly+0.006
www+0.006
dd+0.005
Neg logit contributions
 driveway-0.008
 asleep-0.007
robe-0.007
seek-0.007
 bank-0.007
Token to
Token ID284
Feature activation-4.16e-03

Pos logit contributions
 driveway+0.014
robe+0.014
 asleep+0.013
 bank+0.013
seek+0.013
Neg logit contributions
mmm-0.013
 importantly-0.012
ww-0.012
www-0.011
dd-0.011
Token herself
Token ID5223
Feature activation-1.25e-02

Pos logit contributions
mmm+0.008
ww+0.007
 importantly+0.007
www+0.007
dd+0.006
Neg logit contributions
 driveway-0.008
 asleep-0.008
robe-0.008
 bank-0.007
seek-0.007
Token,
Token ID11
Feature activation+1.46e-02

Pos logit contributions
mmm+0.023
ww+0.022
 importantly+0.021
www+0.020
dd+0.019
Neg logit contributions
 driveway-0.024
robe-0.023
 asleep-0.023
seek-0.022
ASH-0.022
Token "
Token ID366
Feature activation+2.72e-02

Pos logit contributions
robe+0.029
 driveway+0.028
seek+0.028
 asleep+0.027
 bank+0.026
Neg logit contributions
mmm-0.025
 importantly-0.024
ww-0.023
www-0.021
dd-0.020
Tokenshould
Token ID21754
Feature activation-2.05e-02

Pos logit contributions
 asleep+0.056
 driveway+0.054
 bank+0.053
robe+0.052
seek+0.051
Neg logit contributions
mmm-0.046
ww-0.045
www-0.041
 importantly-0.040
dd-0.038
Token I
Token ID314
Feature activation+8.36e-02

Pos logit contributions
mmm+0.020
ww+0.019
 importantly+0.017
www+0.015
dd+0.012
Neg logit contributions
 driveway-0.057
 asleep-0.057
robe-0.056
seek-0.055
 bank-0.054
Token follow
Token ID1061
Feature activation+6.78e-03

Pos logit contributions
 asleep+0.137
 driveway+0.134
robe+0.131
seek+0.128
ASH+0.124
Neg logit contributions
mmm-0.175
ww-0.170
 importantly-0.159
www-0.159
dd-0.145
Token the
Token ID262
Feature activation+3.40e-02

Pos logit contributions
 driveway+0.015
 asleep+0.015
robe+0.015
seek+0.014
 bank+0.014
Neg logit contributions
mmm-0.011
ww-0.010
 importantly-0.009
www-0.009
dd-0.008
Token path
Token ID3108
Feature activation+3.95e-03

Pos logit contributions
 asleep+0.053
robe+0.052
 driveway+0.051
seek+0.049
ASH+0.048
Neg logit contributions
ww-0.076
mmm-0.074
www-0.071
 importantly-0.070
 annoying-0.062
Token?"
Token ID1701
Feature activation+1.29e-02

Pos logit contributions
 driveway+0.008
robe+0.008
seek+0.008
 asleep+0.007
ASH+0.007
Neg logit contributions
mmm-0.007
ww-0.007
www-0.006
 importantly-0.006
 hour-0.005
Token She
Token ID1375
Feature activation+8.53e-04

Pos logit contributions
 driveway+0.024
robe+0.024
 asleep+0.023
 bank+0.023
pit+0.023
Neg logit contributions
mmm-0.025
 importantly-0.023
ww-0.022
www-0.020
dd-0.020
Token he
Token ID339
Feature activation+2.01e-02

Pos logit contributions
mmm+0.045
ww+0.044
 importantly+0.042
www+0.039
dd+0.034
Neg logit contributions
 driveway-0.073
robe-0.072
 asleep-0.071
seek-0.071
stick-0.070
Token heard
Token ID2982
Feature activation+5.76e-03

Pos logit contributions
 driveway+0.037
robe+0.036
 asleep+0.035
stick+0.035
seek+0.034
Neg logit contributions
ww-0.038
mmm-0.037
 importantly-0.034
www-0.034
dd-0.032
Token a
Token ID257
Feature activation-2.47e-02

Pos logit contributions
 driveway+0.011
robe+0.011
pit+0.010
 asleep+0.010
seek+0.010
Neg logit contributions
mmm-0.011
ww-0.010
 importantly-0.010
www-0.009
dd-0.009
Token knock
Token ID10643
Feature activation-3.09e-03

Pos logit contributions
mmm+0.046
ww+0.043
dd+0.041
 importantly+0.041
www+0.039
Neg logit contributions
 asleep-0.046
 driveway-0.046
 edge-0.045
 bank-0.043
robe-0.043
Token on
Token ID319
Feature activation+1.56e-02

Pos logit contributions
mmm+0.007
ww+0.006
 importantly+0.006
www+0.006
dd+0.006
Neg logit contributions
 driveway-0.005
 asleep-0.005
robe-0.005
seek-0.005
 bank-0.005
Token the
Token ID262
Feature activation+8.34e-02

Pos logit contributions
robe+0.037
 driveway+0.036
pit+0.036
seek+0.036
 asleep+0.034
Neg logit contributions
mmm-0.021
ww-0.020
www-0.018
 importantly-0.018
dd-0.017
Token door
Token ID3420
Feature activation+1.89e-02

Pos logit contributions
seek+0.109
ASH+0.099
pit+0.095
izons+0.093
robe+0.093
Neg logit contributions
ww-0.184
mmm-0.178
 importantly-0.173
www-0.172
 painful-0.164
Token.
Token ID13
Feature activation+3.11e-02

Pos logit contributions
robe+0.038
 asleep+0.038
 driveway+0.038
seek+0.037
 bank+0.037
Neg logit contributions
mmm-0.031
ww-0.029
 importantly-0.029
www-0.027
 hour-0.024
Token He
Token ID679
Feature activation+1.16e-02

Pos logit contributions
 asleep+0.059
pit+0.058
robe+0.058
 driveway+0.057
seek+0.056
Neg logit contributions
mmm-0.056
ww-0.053
 importantly-0.050
www-0.049
Run-0.043
Token slowly
Token ID6364
Feature activation+9.51e-03

Pos logit contributions
 asleep+0.017
 driveway+0.017
robe+0.016
 bank+0.016
seek+0.015
Neg logit contributions
mmm-0.027
ww-0.026
 importantly-0.025
www-0.024
dd-0.022
Token opened
Token ID4721
Feature activation+2.65e-02

Pos logit contributions
robe+0.017
 driveway+0.017
 asleep+0.017
seek+0.016
pit+0.016
Neg logit contributions
ww-0.018
mmm-0.018
www-0.016
 importantly-0.016
dd-0.016
Token She
Token ID1375
Feature activation+1.88e-02

Pos logit contributions
 driveway+0.044
 bank+0.044
stick+0.043
seek+0.041
 grounds+0.040
Neg logit contributions
mmm-0.033
 importantly-0.028
ww-0.026
www-0.023
 much-0.023
Token heard
Token ID2982
Feature activation-1.08e-02

Pos logit contributions
 asleep+0.036
 driveway+0.036
robe+0.036
seek+0.035
 bank+0.034
Neg logit contributions
mmm-0.033
ww-0.032
 importantly-0.031
www-0.030
dd-0.028
Token a
Token ID257
Feature activation-2.05e-02

Pos logit contributions
mmm+0.018
ww+0.017
 importantly+0.017
www+0.015
 annoying+0.014
Neg logit contributions
 driveway-0.023
robe-0.022
seek-0.021
 bank-0.021
ASH-0.021
Token knock
Token ID10643
Feature activation-8.36e-03

Pos logit contributions
mmm+0.036
ww+0.034
 importantly+0.032
www+0.031
dd+0.030
Neg logit contributions
 asleep-0.042
 driveway-0.041
robe-0.040
 bank-0.039
 edge-0.039
Token on
Token ID319
Feature activation+8.03e-03

Pos logit contributions
mmm+0.017
ww+0.017
 importantly+0.015
www+0.015
dd+0.014
Neg logit contributions
 driveway-0.015
robe-0.014
 asleep-0.014
 bank-0.014
seek-0.014
Token the
Token ID262
Feature activation+8.34e-02

Pos logit contributions
 driveway+0.019
robe+0.019
seek+0.019
pit+0.018
 asleep+0.018
Neg logit contributions
mmm-0.010
ww-0.010
www-0.009
 importantly-0.009
dd-0.008
Token door
Token ID3420
Feature activation+1.13e-02

Pos logit contributions
seek+0.123
ASH+0.104
stick+0.102
IT+0.100
izons+0.099
Neg logit contributions
ww-0.165
mmm-0.161
 hour-0.158
 importantly-0.158
www-0.156
Token.
Token ID13
Feature activation+2.24e-02

Pos logit contributions
robe+0.022
 asleep+0.022
 driveway+0.022
 bank+0.022
seek+0.021
Neg logit contributions
mmm-0.019
ww-0.018
 importantly-0.018
www-0.017
 hour-0.016
Token She
Token ID1375
Feature activation+1.07e-02

Pos logit contributions
 driveway+0.042
 bank+0.042
seek+0.040
stick+0.040
 asleep+0.040
Neg logit contributions
mmm-0.043
ww-0.040
 importantly-0.039
www-0.036
Run-0.033
Token opened
Token ID4721
Feature activation+1.69e-02

Pos logit contributions
 asleep+0.017
 driveway+0.017
robe+0.016
 bank+0.015
seek+0.015
Neg logit contributions
mmm-0.023
ww-0.023
 importantly-0.022
www-0.021
dd-0.020
Token it
Token ID340
Feature activation+3.08e-02

Pos logit contributions
robe+0.044
 driveway+0.043
seek+0.043
 bank+0.042
 grounds+0.041
Neg logit contributions
mmm-0.018
ww-0.017
www-0.015
 importantly-0.014
dd-0.014
Token's
Token ID338
Feature activation+4.38e-02

Pos logit contributions
robe+0.048
ASH+0.044
 asleep+0.042
ags+0.042
seek+0.042
Neg logit contributions
mmm-0.040
ww-0.037
 importantly-0.035
www-0.034
 though-0.032
Token house
Token ID2156
Feature activation+1.42e-02

Pos logit contributions
seek+0.073
robe+0.070
 asleep+0.066
ASH+0.066
stick+0.063
Neg logit contributions
 importantly-0.080
ww-0.078
mmm-0.076
www-0.070
 hour-0.070
Token and
Token ID290
Feature activation-9.27e-03

Pos logit contributions
robe+0.032
 driveway+0.031
ASH+0.031
 asleep+0.031
seek+0.031
Neg logit contributions
mmm-0.016
 hour-0.015
 though-0.015
 importantly-0.015
ww-0.014
Token knock
Token ID10643
Feature activation+5.34e-02

Pos logit contributions
mmm+0.014
ww+0.014
 importantly+0.013
www+0.013
Run+0.011
Neg logit contributions
robe-0.019
seek-0.018
 asleep-0.018
ASH-0.018
 driveway-0.018
Token on
Token ID319
Feature activation+1.97e-02

Pos logit contributions
seek+0.085
 driveway+0.084
robe+0.084
 asleep+0.083
 bank+0.080
Neg logit contributions
mmm-0.111
ww-0.109
www-0.102
 importantly-0.102
dd-0.094
Token the
Token ID262
Feature activation+8.30e-02

Pos logit contributions
robe+0.039
 driveway+0.039
 asleep+0.039
seek+0.038
 bank+0.037
Neg logit contributions
mmm-0.034
ww-0.033
 importantly-0.031
www-0.030
dd-0.027
Token door
Token ID3420
Feature activation+3.12e-02

Pos logit contributions
seek+0.137
robe+0.110
ASH+0.106
pit+0.106
izons+0.103
Neg logit contributions
ww-0.169
 importantly-0.160
mmm-0.159
www-0.154
 hour-0.153
Token.
Token ID13
Feature activation+2.28e-02

Pos logit contributions
robe+0.065
 asleep+0.065
seek+0.064
 bank+0.063
 driveway+0.063
Neg logit contributions
mmm-0.047
ww-0.043
 importantly-0.042
www-0.040
 hour-0.039
Token "
Token ID366
Feature activation+1.48e-02

Pos logit contributions
 asleep+0.052
 driveway+0.051
robe+0.051
 bank+0.049
seek+0.049
Neg logit contributions
mmm-0.032
ww-0.029
 importantly-0.028
www-0.027
Run-0.023
TokenMail
Token ID25804
Feature activation+4.20e-03

Pos logit contributions
 asleep+0.026
robe+0.026
 driveway+0.025
 bank+0.025
seek+0.024
Neg logit contributions
ww-0.030
mmm-0.029
www-0.027
 importantly-0.027
dd-0.025
Token for
Token ID329
Feature activation-9.45e-03

Pos logit contributions
 driveway+0.008
 asleep+0.008
seek+0.008
robe+0.008
 bank+0.008
Neg logit contributions
ww-0.008
mmm-0.007
www-0.007
 importantly-0.007
dd-0.006
Token saw
Token ID2497
Feature activation+1.23e-03

Pos logit contributions
robe+0.039
 asleep+0.038
 driveway+0.037
seek+0.037
stick+0.035
Neg logit contributions
mmm-0.031
ww-0.030
www-0.028
 importantly-0.028
dd-0.026
Token a
Token ID257
Feature activation-2.07e-02

Pos logit contributions
 driveway+0.002
 asleep+0.002
robe+0.002
seek+0.002
pit+0.002
Neg logit contributions
ww-0.002
mmm-0.002
 importantly-0.002
www-0.002
dd-0.002
Token man
Token ID582
Feature activation-1.73e-02

Pos logit contributions
mmm+0.042
ww+0.041
 importantly+0.039
www+0.038
dd+0.036
Neg logit contributions
 asleep-0.036
 driveway-0.036
robe-0.035
 bank-0.034
seek-0.032
Token standing
Token ID5055
Feature activation+2.53e-02

Pos logit contributions
mmm+0.032
ww+0.031
 importantly+0.030
www+0.028
dd+0.026
Neg logit contributions
robe-0.032
 asleep-0.031
 driveway-0.031
seek-0.030
ASH-0.029
Token on
Token ID319
Feature activation+2.28e-02

Pos logit contributions
 driveway+0.046
seek+0.046
robe+0.046
 asleep+0.045
 bank+0.044
Neg logit contributions
mmm-0.045
ww-0.044
 importantly-0.043
www-0.041
dd-0.039
Token the
Token ID262
Feature activation+8.29e-02

Pos logit contributions
robe+0.050
 driveway+0.049
 asleep+0.048
pit+0.048
seek+0.047
Neg logit contributions
mmm-0.032
ww-0.031
www-0.028
 importantly-0.028
Run-0.026
Token corner
Token ID5228
Feature activation-7.90e-03

Pos logit contributions
 asleep+0.121
robe+0.119
pit+0.111
stick+0.106
seek+0.106
Neg logit contributions
ww-0.170
www-0.162
 annoying-0.158
 hour-0.157
mmm-0.156
Token.
Token ID13
Feature activation+2.93e-02

Pos logit contributions
mmm+0.011
ww+0.010
 importantly+0.010
www+0.009
dd+0.008
Neg logit contributions
 asleep-0.019
 driveway-0.018
robe-0.018
seek-0.018
 bank-0.018
Token He
Token ID679
Feature activation+2.64e-03

Pos logit contributions
robe+0.050
 asleep+0.050
 driveway+0.049
seek+0.047
pit+0.046
Neg logit contributions
mmm-0.058
ww-0.057
 importantly-0.054
www-0.053
Run-0.047
Token was
Token ID373
Feature activation-1.73e-02

Pos logit contributions
 driveway+0.004
 asleep+0.004
 bank+0.004
robe+0.003
 neighborhood+0.003
Neg logit contributions
mmm-0.006
ww-0.006
 importantly-0.006
www-0.005
dd-0.005
Token selling
Token ID6301
Feature activation+3.99e-03

Pos logit contributions
mmm+0.030
ww+0.030
 importantly+0.029
www+0.027
dd+0.025
Neg logit contributions
 driveway-0.033
robe-0.033
 asleep-0.033
seek-0.032
 bank-0.032
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
 driveway+0.042
 asleep+0.040
robe+0.040
seek+0.039
 bank+0.039
Neg logit contributions
mmm-0.036
ww-0.035
 importantly-0.033
www-0.032
dd-0.030
Token "
Token ID366
Feature activation+2.80e-02

Pos logit contributions
 driveway+0.028
robe+0.027
 asleep+0.027
seek+0.026
 bank+0.026
Neg logit contributions
mmm-0.032
ww-0.031
 importantly-0.029
www-0.028
dd-0.026
TokenCome
Token ID16773
Feature activation+4.64e-02

Pos logit contributions
 asleep+0.050
 driveway+0.050
robe+0.049
seek+0.047
 bank+0.047
Neg logit contributions
ww-0.055
mmm-0.054
www-0.050
 importantly-0.050
dd-0.046
Token see
Token ID766
Feature activation-6.90e-05

Pos logit contributions
 driveway+0.070
 asleep+0.069
robe+0.068
seek+0.065
 bank+0.064
Neg logit contributions
mmm-0.103
ww-0.102
 importantly-0.095
www-0.095
dd-0.088
Token what
Token ID644
Feature activation+2.51e-02

Pos logit contributions
mmm+0.000
ww+0.000
 importantly+0.000
www+0.000
dd+0.000
Neg logit contributions
 asleep-0.000
 driveway-0.000
robe-0.000
 bank-0.000
seek-0.000
Token I
Token ID314
Feature activation+8.29e-02

Pos logit contributions
 asleep+0.034
 driveway+0.034
robe+0.033
 bank+0.032
seek+0.031
Neg logit contributions
mmm-0.059
ww-0.057
 importantly-0.056
www-0.054
dd-0.051
Token can
Token ID460
Feature activation+5.52e-02

Pos logit contributions
 asleep+0.175
 driveway+0.174
seek+0.172
robe+0.167
ASH+0.162
Neg logit contributions
mmm-0.128
ww-0.126
www-0.116
 importantly-0.115
dd-0.103
Token do
Token ID466
Feature activation+1.84e-03

Pos logit contributions
 asleep+0.108
 driveway+0.106
robe+0.104
seek+0.100
ASH+0.100
Neg logit contributions
mmm-0.102
ww-0.091
www-0.086
 importantly-0.083
dd-0.078
Token!"
Token ID2474
Feature activation+1.64e-02

Pos logit contributions
 driveway+0.004
 asleep+0.004
robe+0.004
 bank+0.004
seek+0.004
Neg logit contributions
mmm-0.003
ww-0.003
 importantly-0.002
www-0.002
dd-0.002
Token Toby
Token ID37578
Feature activation-7.23e-04

Pos logit contributions
 asleep+0.031
 driveway+0.031
robe+0.030
seek+0.030
 bank+0.030
Neg logit contributions
mmm-0.029
ww-0.029
 importantly-0.027
www-0.026
dd-0.024
Token ran
Token ID4966
Feature activation+3.36e-02

Pos logit contributions
mmm+0.001
ww+0.001
 importantly+0.001
www+0.001
dd+0.001
Neg logit contributions
 asleep-0.001
 driveway-0.001
robe-0.001
seek-0.001
 bank-0.001
Token a
Token ID257
Feature activation-2.35e-02

Pos logit contributions
mmm+0.013
ww+0.012
 importantly+0.012
www+0.011
dd+0.010
Neg logit contributions
 driveway-0.017
robe-0.017
 asleep-0.016
seek-0.016
ASH-0.016
Token homeless
Token ID10463
Feature activation-7.89e-04

Pos logit contributions
mmm+0.046
ww+0.045
 importantly+0.042
www+0.041
dd+0.038
Neg logit contributions
 asleep-0.042
 driveway-0.042
robe-0.041
 bank-0.039
seek-0.039
Token man
Token ID582
Feature activation-1.05e-02

Pos logit contributions
ww+0.001
mmm+0.001
 importantly+0.001
www+0.001
dd+0.001
Neg logit contributions
 driveway-0.002
ASH-0.001
seek-0.001
robe-0.001
 asleep-0.001
Token sitting
Token ID5586
Feature activation+2.89e-02

Pos logit contributions
mmm+0.019
ww+0.018
 importantly+0.018
www+0.016
dd+0.016
Neg logit contributions
robe-0.020
 driveway-0.019
 asleep-0.019
seek-0.019
ASH-0.018
Token on
Token ID319
Feature activation+2.76e-02

Pos logit contributions
seek+0.060
 driveway+0.058
stick+0.056
 bank+0.056
robe+0.054
Neg logit contributions
 importantly-0.043
ww-0.042
 badly-0.042
mmm-0.041
dd-0.040
Token the
Token ID262
Feature activation+8.28e-02

Pos logit contributions
robe+0.059
pit+0.058
 driveway+0.058
seek+0.055
 grounds+0.055
Neg logit contributions
mmm-0.041
ww-0.040
www-0.036
 importantly-0.035
dd-0.034
Token side
Token ID1735
Feature activation+4.44e-03

Pos logit contributions
pit+0.114
robe+0.108
seek+0.106
ASH+0.105
 asleep+0.104
Neg logit contributions
ww-0.176
www-0.165
mmm-0.164
 hour-0.161
 annoying-0.159
Token of
Token ID286
Feature activation+1.24e-02

Pos logit contributions
 asleep+0.012
robe+0.012
 driveway+0.012
seek+0.012
 bank+0.012
Neg logit contributions
mmm-0.004
ww-0.004
 importantly-0.003
www-0.003
 hour-0.003
Token the
Token ID262
Feature activation+5.73e-02

Pos logit contributions
robe+0.029
seek+0.028
 driveway+0.027
pit+0.027
 grounds+0.027
Neg logit contributions
ww-0.017
mmm-0.016
 importantly-0.016
www-0.015
 hour-0.013
Token road
Token ID2975
Feature activation+1.60e-02

Pos logit contributions
robe+0.085
seek+0.084
pit+0.082
ASH+0.079
stick+0.079
Neg logit contributions
ww-0.122
www-0.113
 hour-0.110
mmm-0.108
 importantly-0.107
Token.
Token ID13
Feature activation+2.52e-02

Pos logit contributions
robe+0.035
 asleep+0.035
ASH+0.034
 driveway+0.033
seek+0.033
Neg logit contributions
mmm-0.020
 importantly-0.019
ww-0.018
 hour-0.018
 badly-0.017
Token and
Token ID290
Feature activation-3.55e-02

Pos logit contributions
mmm+0.047
ww+0.046
 importantly+0.045
www+0.042
dd+0.039
Neg logit contributions
 driveway-0.044
robe-0.043
 asleep-0.042
 bank-0.041
seek-0.041
Token proud
Token ID6613
Feature activation-4.81e-02

Pos logit contributions
mmm+0.056
ww+0.055
 importantly+0.052
www+0.050
dd+0.045
Neg logit contributions
 asleep-0.076
 driveway-0.076
robe-0.075
 bank-0.072
seek-0.072
Token of
Token ID286
Feature activation-2.26e-02

Pos logit contributions
mmm+0.082
ww+0.081
www+0.074
dd+0.065
 importantly+0.061
Neg logit contributions
 asleep-0.106
robe-0.096
seek-0.095
 driveway-0.094
pit-0.092
Token himself
Token ID2241
Feature activation-1.46e-02

Pos logit contributions
mmm+0.057
ww+0.056
 importantly+0.053
www+0.052
dd+0.049
Neg logit contributions
 asleep-0.029
 driveway-0.028
robe-0.028
seek-0.026
 bank-0.026
Token.
Token ID13
Feature activation+6.61e-03

Pos logit contributions
mmm+0.028
ww+0.028
 importantly+0.026
www+0.025
dd+0.023
Neg logit contributions
 driveway-0.027
 asleep-0.026
robe-0.026
seek-0.025
 bank-0.025
Token Because
Token ID4362
Feature activation-8.89e-02

Pos logit contributions
 asleep+0.011
 driveway+0.011
robe+0.011
seek+0.010
 bank+0.010
Neg logit contributions
mmm-0.014
ww-0.013
 importantly-0.013
www-0.012
dd-0.011
Token he
Token ID339
Feature activation+5.95e-03

Pos logit contributions
mmm+0.178
ww+0.176
www+0.165
dd+0.142
Run+0.137
Neg logit contributions
 asleep-0.159
 driveway-0.142
 back-0.140
robe-0.138
 neighborhood-0.138
Token was
Token ID373
Feature activation-4.01e-02

Pos logit contributions
 asleep+0.011
 driveway+0.011
robe+0.010
seek+0.010
 bank+0.010
Neg logit contributions
mmm-0.012
ww-0.011
 importantly-0.011
www-0.010
dd-0.010
Token persistent
Token ID16218
Feature activation-2.44e-02

Pos logit contributions
mmm+0.064
ww+0.063
 importantly+0.061
www+0.057
dd+0.052
Neg logit contributions
 driveway-0.086
robe-0.085
 asleep-0.083
 bank-0.082
seek-0.080
Token,
Token ID11
Feature activation-3.88e-02

Pos logit contributions
mmm+0.037
 importantly+0.037
ww+0.037
www+0.033
dd+0.030
Neg logit contributions
 driveway-0.054
robe-0.051
 bank-0.051
 asleep-0.051
seek-0.050
Token he
Token ID339
Feature activation+1.72e-03

Pos logit contributions
ww+0.068
mmm+0.067
www+0.061
 importantly+0.058
dd+0.054
Neg logit contributions
 asleep-0.079
 driveway-0.076
 bank-0.075
robe-0.074
seek-0.071
Token still
Token ID991
Feature activation+2.03e-02

Pos logit contributions
 asleep+0.003
 driveway+0.003
ASH+0.003
robe+0.003
seek+0.003
Neg logit contributions
mmm-0.002
ww-0.002
 importantly-0.002
www-0.002
 annoying-0.002
Token have
Token ID423
Feature activation-6.00e-03

Pos logit contributions
 driveway+0.043
 asleep+0.042
ASH+0.042
robe+0.042
pit+0.041
Neg logit contributions
ww-0.031
mmm-0.030
 importantly-0.028
www-0.027
 annoying-0.025
Token fun
Token ID1257
Feature activation-1.65e-02

Pos logit contributions
ww+0.009
 importantly+0.009
mmm+0.008
www+0.007
 annoying+0.007
Neg logit contributions
 driveway-0.014
robe-0.014
 asleep-0.013
 edge-0.013
pit-0.013
Token.
Token ID13
Feature activation+5.40e-03

Pos logit contributions
 importantly+0.027
mmm+0.026
ww+0.026
www+0.023
dd+0.022
Neg logit contributions
 driveway-0.034
robe-0.033
seek-0.032
 asleep-0.032
ASH-0.032
Token But
Token ID887
Feature activation-6.48e-02

Pos logit contributions
 driveway+0.006
 asleep+0.006
robe+0.006
seek+0.006
 neighborhood+0.006
Neg logit contributions
ww-0.014
mmm-0.014
www-0.013
 importantly-0.013
dd-0.012
Token most
Token ID749
Feature activation-8.86e-02

Pos logit contributions
ww+0.109
mmm+0.108
www+0.100
 importantly+0.097
dd+0.086
Neg logit contributions
 asleep-0.132
 driveway-0.129
robe-0.126
 bank-0.123
seek-0.121
Token of
Token ID286
Feature activation-2.74e-02

Pos logit contributions
mmm+0.152
ww+0.129
www+0.122
mmmm+0.104
Run+0.100
Neg logit contributions
 neighborhood-0.185
 neighbourhood-0.174
 asleep-0.170
 park-0.165
robe-0.164
Token all
Token ID477
Feature activation-4.32e-02

Pos logit contributions
mmm+0.059
ww+0.058
 importantly+0.055
www+0.054
dd+0.050
Neg logit contributions
 asleep-0.044
 driveway-0.044
robe-0.042
 bank-0.041
seek-0.041
Token,
Token ID11
Feature activation-4.62e-02

Pos logit contributions
mmm+0.100
ww+0.097
www+0.090
 importantly+0.088
dd+0.082
Neg logit contributions
 asleep-0.065
 driveway-0.060
robe-0.059
 bank-0.058
 grounds-0.056
Token they
Token ID484
Feature activation-2.02e-02

Pos logit contributions
mmm+0.071
ww+0.070
www+0.063
 importantly+0.062
dd+0.055
Neg logit contributions
 asleep-0.102
 driveway-0.101
robe-0.099
 bank-0.097
seek-0.095
Token have
Token ID423
Feature activation-1.64e-02

Pos logit contributions
mmm+0.035
ww+0.034
 importantly+0.032
www+0.031
dd+0.030
Neg logit contributions
 driveway-0.040
 asleep-0.039
ASH-0.039
robe-0.039
seek-0.038
Token of
Token ID286
Feature activation-1.42e-02

Pos logit contributions
ww+0.020
mmm+0.019
 importantly+0.019
www+0.018
dd+0.016
Neg logit contributions
robe-0.027
 driveway-0.026
seek-0.026
 bank-0.026
 asleep-0.026
Token her
Token ID607
Feature activation+2.89e-02

Pos logit contributions
mmm+0.023
 importantly+0.021
ww+0.021
www+0.019
dd+0.018
Neg logit contributions
 driveway-0.030
seek-0.030
robe-0.029
 asleep-0.028
 bank-0.028
Token toys
Token ID14958
Feature activation-1.72e-02

Pos logit contributions
seek+0.045
robe+0.042
stick+0.041
 driveway+0.041
 asleep+0.040
Neg logit contributions
 importantly-0.062
ww-0.060
mmm-0.060
www-0.055
 hour-0.054
Token.
Token ID13
Feature activation+6.52e-03

Pos logit contributions
mmm+0.026
ww+0.026
 importantly+0.025
www+0.024
 badly+0.023
Neg logit contributions
 driveway-0.035
ASH-0.034
seek-0.033
robe-0.033
 asleep-0.033
Token But
Token ID887
Feature activation-5.51e-02

Pos logit contributions
 asleep+0.011
 driveway+0.011
 bank+0.010
robe+0.010
seek+0.010
Neg logit contributions
mmm-0.013
ww-0.013
 importantly-0.013
www-0.012
dd-0.011
Token most
Token ID749
Feature activation-8.77e-02

Pos logit contributions
ww+0.098
mmm+0.096
www+0.089
 importantly+0.084
dd+0.077
Neg logit contributions
 asleep-0.113
 driveway-0.108
robe-0.105
 back-0.101
 bank-0.101
Token of
Token ID286
Feature activation-2.98e-02

Pos logit contributions
mmm+0.147
ww+0.127
www+0.118
Run+0.106
heit+0.102
Neg logit contributions
 neighborhood-0.185
 neighbourhood-0.174
 asleep-0.168
 garden-0.165
 grounds-0.164
Token all
Token ID477
Feature activation-4.21e-02

Pos logit contributions
mmm+0.070
ww+0.069
www+0.064
 importantly+0.064
dd+0.059
Neg logit contributions
 asleep-0.043
 driveway-0.042
robe-0.041
 bank-0.039
seek-0.038
Token,
Token ID11
Feature activation-4.02e-02

Pos logit contributions
mmm+0.085
ww+0.082
www+0.074
 importantly+0.071
dd+0.067
Neg logit contributions
 asleep-0.079
 driveway-0.072
 grounds-0.068
 kitchen-0.068
 back-0.068
Token she
Token ID673
Feature activation-1.05e-02

Pos logit contributions
mmm+0.065
ww+0.064
 importantly+0.059
www+0.057
dd+0.051
Neg logit contributions
 asleep-0.086
 driveway-0.085
robe-0.084
 bank-0.081
seek-0.081
Token was
Token ID373
Feature activation-3.02e-02

Pos logit contributions
mmm+0.018
ww+0.018
 importantly+0.017
www+0.016
dd+0.015
Neg logit contributions
 asleep-0.021
 driveway-0.021
robe-0.021
seek-0.020
 bank-0.020
Token if
Token ID611
Feature activation-5.22e-02

Pos logit contributions
mmm+0.089
ww+0.087
 importantly+0.082
www+0.079
dd+0.071
Neg logit contributions
 driveway-0.106
 asleep-0.104
robe-0.103
seek-0.100
 bank-0.098
Token you
Token ID345
Feature activation+2.02e-02

Pos logit contributions
mmm+0.059
ww+0.058
 importantly+0.053
www+0.049
dd+0.042
Neg logit contributions
 driveway-0.136
 asleep-0.136
robe-0.135
seek-0.131
 bank-0.130
Token're
Token ID821
Feature activation-1.70e-02

Pos logit contributions
 driveway+0.042
 asleep+0.041
robe+0.041
seek+0.040
ASH+0.039
Neg logit contributions
mmm-0.035
ww-0.033
www-0.030
 importantly-0.030
dd-0.027
Token nervous
Token ID10927
Feature activation-2.61e-02

Pos logit contributions
mmm+0.029
ww+0.029
 importantly+0.028
www+0.026
dd+0.024
Neg logit contributions
 driveway-0.034
robe-0.034
 asleep-0.033
seek-0.033
ASH-0.032
Token!
Token ID0
Feature activation+7.12e-03

Pos logit contributions
mmm+0.044
ww+0.043
 importantly+0.041
www+0.039
dd+0.035
Neg logit contributions
 driveway-0.054
robe-0.053
 asleep-0.053
seek-0.051
 bank-0.051
Token Because
Token ID4362
Feature activation-8.76e-02

Pos logit contributions
 asleep+0.010
 driveway+0.010
robe+0.009
seek+0.009
 bank+0.009
Neg logit contributions
ww-0.017
mmm-0.017
 importantly-0.016
www-0.015
dd-0.014
Token you
Token ID345
Feature activation-8.47e-03

Pos logit contributions
mmm+0.130
ww+0.129
www+0.115
 importantly+0.111
dd+0.102
Neg logit contributions
 asleep-0.196
 driveway-0.188
robe-0.186
 bank-0.182
seek-0.179
Token can
Token ID460
Feature activation+2.04e-02

Pos logit contributions
mmm+0.015
ww+0.014
 importantly+0.013
www+0.013
dd+0.012
Neg logit contributions
 asleep-0.017
 driveway-0.017
robe-0.017
seek-0.016
 bank-0.016
Token end
Token ID886
Feature activation+8.70e-03

Pos logit contributions
 asleep+0.041
robe+0.041
 driveway+0.039
ASH+0.038
seek+0.037
Neg logit contributions
mmm-0.039
ww-0.035
www-0.033
 importantly-0.033
dd-0.029
Token up
Token ID510
Feature activation-1.29e-02

Pos logit contributions
 driveway+0.018
robe+0.017
 asleep+0.017
seek+0.016
ASH+0.016
Neg logit contributions
mmm-0.016
ww-0.015
 importantly-0.015
www-0.014
dd-0.012
Token with
Token ID351
Feature activation-2.76e-02

Pos logit contributions
mmm+0.022
ww+0.021
 importantly+0.020
www+0.019
dd+0.017
Neg logit contributions
 driveway-0.027
robe-0.027
 asleep-0.026
seek-0.026
 bank-0.025
Token has
Token ID468
Feature activation+4.89e-03

Pos logit contributions
mmm+0.026
ww+0.025
 importantly+0.023
www+0.023
dd+0.021
Neg logit contributions
 asleep-0.021
 driveway-0.020
robe-0.020
 bank-0.019
seek-0.019
Token different
Token ID1180
Feature activation+2.88e-03

Pos logit contributions
 driveway+0.011
robe+0.011
 asleep+0.010
pit+0.010
 edge+0.010
Neg logit contributions
 importantly-0.007
ww-0.007
mmm-0.007
www-0.006
 annoying-0.006
Token tastes
Token ID18221
Feature activation-3.54e-02

Pos logit contributions
 driveway+0.005
robe+0.005
seek+0.005
 asleep+0.005
 bank+0.005
Neg logit contributions
mmm-0.006
 importantly-0.006
ww-0.005
www-0.005
dd-0.004
Token.
Token ID13
Feature activation-5.28e-03

Pos logit contributions
mmm+0.066
ww+0.064
www+0.059
 importantly+0.057
dd+0.053
Neg logit contributions
 asleep-0.069
robe-0.065
 driveway-0.063
 bank-0.062
seek-0.060
Token But
Token ID887
Feature activation-6.55e-02

Pos logit contributions
ww+0.013
mmm+0.013
www+0.012
 importantly+0.012
dd+0.012
Neg logit contributions
 asleep-0.007
 driveway-0.007
robe-0.007
seek-0.006
 neighborhood-0.006
Token most
Token ID749
Feature activation-8.76e-02

Pos logit contributions
ww+0.113
mmm+0.110
www+0.105
 importantly+0.102
dd+0.094
Neg logit contributions
 asleep-0.127
 driveway-0.120
robe-0.118
 bank-0.118
seek-0.117
Token importantly
Token ID11003
Feature activation-4.92e-02

Pos logit contributions
mmm+0.125
ww+0.119
www+0.113
cle+0.095
Run+0.093
Neg logit contributions
 neighborhood-0.190
 neighbourhood-0.180
 garden-0.173
 asleep-0.168
 gardens-0.167
Token,
Token ID11
Feature activation-5.36e-02

Pos logit contributions
mmm+0.092
ww+0.090
www+0.084
 importantly+0.082
 annoying+0.076
Neg logit contributions
 asleep-0.091
robe-0.085
 driveway-0.083
pit-0.080
 bank-0.079
Token she
Token ID673
Feature activation-1.01e-02

Pos logit contributions
mmm+0.082
ww+0.081
 importantly+0.074
www+0.073
dd+0.065
Neg logit contributions
 asleep-0.118
 driveway-0.116
robe-0.115
 bank-0.112
seek-0.111
Token learned
Token ID4499
Feature activation-2.26e-02

Pos logit contributions
mmm+0.018
ww+0.018
 importantly+0.017
www+0.016
dd+0.015
Neg logit contributions
 asleep-0.020
 driveway-0.020
robe-0.019
seek-0.019
 bank-0.018
Token to
Token ID284
Feature activation+1.87e-02

Pos logit contributions
mmm+0.034
ww+0.033
 importantly+0.031
www+0.030
dd+0.026
Neg logit contributions
 driveway-0.051
 asleep-0.051
robe-0.050
seek-0.048
 bank-0.048
Token.
Token ID13
Feature activation+1.95e-03

Pos logit contributions
mmm+0.003
 importantly+0.003
ww+0.003
www+0.002
 hour+0.002
Neg logit contributions
 driveway-0.004
ASH-0.004
robe-0.004
seek-0.004
 asleep-0.004
Token
Token ID220
Feature activation+1.89e-02

Pos logit contributions
 driveway+0.003
seek+0.003
robe+0.003
 asleep+0.003
 bank+0.003
Neg logit contributions
mmm-0.004
ww-0.004
 importantly-0.004
www-0.004
dd-0.003
Token
Token ID198
Feature activation-3.35e-02

Pos logit contributions
 driveway+0.036
 bank+0.036
 asleep+0.035
seek+0.035
pit+0.035
Neg logit contributions
mmm-0.035
 importantly-0.031
ww-0.030
www-0.027
dd-0.026
Token
Token ID198
Feature activation-4.55e-02

Pos logit contributions
mmm+0.076
ww+0.075
 importantly+0.071
www+0.069
dd+0.065
Neg logit contributions
 asleep-0.050
 driveway-0.049
robe-0.048
seek-0.046
 bank-0.046
TokenBut
Token ID1537
Feature activation-5.44e-02

Pos logit contributions
mmm+0.088
ww+0.086
www+0.079
 importantly+0.079
dd+0.070
Neg logit contributions
 asleep-0.081
 driveway-0.079
robe-0.078
seek-0.075
 bank-0.074
Token most
Token ID749
Feature activation-8.73e-02

Pos logit contributions
mmm+0.100
ww+0.098
 importantly+0.091
www+0.090
dd+0.081
Neg logit contributions
 asleep-0.106
 driveway-0.102
robe-0.101
 bank-0.098
seek-0.096
Token importantly
Token ID11003
Feature activation-4.96e-02

Pos logit contributions
mmm+0.138
ww+0.121
www+0.110
mmmm+0.095
Run+0.088
Neg logit contributions
 neighborhood-0.189
 neighbourhood-0.185
 asleep-0.179
 grounds-0.176
 gardens-0.173
Token,
Token ID11
Feature activation-5.48e-02

Pos logit contributions
mmm+0.095
ww+0.092
www+0.085
 importantly+0.082
 annoying+0.076
Neg logit contributions
 asleep-0.093
robe-0.086
 driveway-0.084
 bank-0.081
pit-0.081
Token they
Token ID484
Feature activation-1.32e-02

Pos logit contributions
mmm+0.096
ww+0.094
www+0.085
 importantly+0.083
dd+0.076
Neg logit contributions
 asleep-0.113
robe-0.107
 driveway-0.107
 bank-0.103
stick-0.100
Token discussed
Token ID6693
Feature activation-4.98e-03

Pos logit contributions
mmm+0.022
ww+0.022
 importantly+0.021
www+0.020
dd+0.019
Neg logit contributions
 driveway-0.027
 asleep-0.027
robe-0.026
seek-0.026
ASH-0.025
Token what
Token ID644
Feature activation-9.87e-03

Pos logit contributions
mmm+0.008
ww+0.007
 importantly+0.007
www+0.006
dd+0.006
Neg logit contributions
 driveway-0.011
robe-0.011
 asleep-0.011
seek-0.011
ASH-0.010
Token you
Token ID345
Feature activation+3.51e-02

Pos logit contributions
mmm+0.038
ww+0.037
 importantly+0.035
www+0.031
dd+0.028
Neg logit contributions
 driveway-0.090
robe-0.089
 asleep-0.087
seek-0.087
stick-0.086
Token need
Token ID761
Feature activation+1.20e-02

Pos logit contributions
 driveway+0.073
pit+0.070
robe+0.070
ASH+0.069
 asleep+0.067
Neg logit contributions
mmm-0.056
ww-0.053
 importantly-0.049
www-0.048
 annoying-0.046
Token it
Token ID340
Feature activation+2.02e-02

Pos logit contributions
 driveway+0.025
robe+0.025
pit+0.025
seek+0.024
 asleep+0.024
Neg logit contributions
mmm-0.020
ww-0.020
 importantly-0.018
www-0.018
dd-0.016
Token.
Token ID13
Feature activation+6.71e-03

Pos logit contributions
 driveway+0.045
ASH+0.041
seek+0.041
pit+0.041
robe+0.041
Neg logit contributions
mmm-0.031
ww-0.031
 importantly-0.029
www-0.027
dd-0.024
Token And
Token ID843
Feature activation-5.52e-02

Pos logit contributions
 asleep+0.010
 driveway+0.009
robe+0.009
seek+0.009
 bank+0.009
Neg logit contributions
ww-0.015
mmm-0.015
 importantly-0.014
www-0.014
dd-0.014
Token most
Token ID749
Feature activation-8.71e-02

Pos logit contributions
ww+0.087
mmm+0.086
www+0.079
 importantly+0.076
dd+0.071
Neg logit contributions
 asleep-0.119
robe-0.115
 driveway-0.114
 bank-0.111
seek-0.110
Token importantly
Token ID11003
Feature activation-5.07e-02

Pos logit contributions
mmm+0.102
ww+0.092
www+0.084
Run+0.069
mmmm+0.058
Neg logit contributions
 asleep-0.219
 neighborhood-0.213
 neighbourhood-0.207
robe-0.207
 garden-0.205
Token,
Token ID11
Feature activation-5.37e-02

Pos logit contributions
mmm+0.090
ww+0.088
www+0.081
 importantly+0.080
dd+0.072
Neg logit contributions
 asleep-0.101
robe-0.096
 driveway-0.095
 bank-0.090
seek-0.090
Token she
Token ID673
Feature activation-5.28e-03

Pos logit contributions
mmm+0.079
ww+0.078
 importantly+0.072
www+0.069
dd+0.061
Neg logit contributions
 driveway-0.121
 asleep-0.121
robe-0.119
seek-0.116
 bank-0.116
Token learned
Token ID4499
Feature activation-2.02e-02

Pos logit contributions
ww+0.009
mmm+0.009
 importantly+0.008
www+0.008
dd+0.008
Neg logit contributions
 driveway-0.011
 asleep-0.010
robe-0.010
seek-0.010
 bank-0.010
Token that
Token ID326
Feature activation-1.77e-02

Pos logit contributions
mmm+0.031
ww+0.031
 importantly+0.029
www+0.027
dd+0.024
Neg logit contributions
 driveway-0.045
 asleep-0.044
robe-0.043
 bank-0.042
seek-0.042
TokenIt
Token ID1026
Feature activation+3.32e-02

Pos logit contributions
mmm+0.001
ww+0.001
 importantly+0.001
www+0.001
dd+0.001
Neg logit contributions
 asleep-0.001
 driveway-0.001
robe-0.001
 bank-0.001
seek-0.001
Token's
Token ID338
Feature activation-1.18e-02

Pos logit contributions
 driveway+0.056
 asleep+0.055
seek+0.051
robe+0.050
 bank+0.050
Neg logit contributions
ww-0.071
mmm-0.068
www-0.065
 importantly-0.064
 annoying-0.058
Token okay
Token ID8788
Feature activation-5.01e-02

Pos logit contributions
ww+0.023
Run+0.022
www+0.021
mmm+0.021
 importantly+0.021
Neg logit contributions
 back-0.021
 asleep-0.020
robe-0.019
ags-0.019
 kitchen-0.019
Token,
Token ID11
Feature activation-3.16e-02

Pos logit contributions
mmm+0.112
ww+0.110
 importantly+0.105
www+0.105
dd+0.104
Neg logit contributions
 asleep-0.076
 back-0.071
robe-0.066
 bathroom-0.065
 kitchen-0.064
Token my
Token ID616
Feature activation-1.63e-02

Pos logit contributions
ww+0.055
mmm+0.055
www+0.050
 importantly+0.050
dd+0.049
Neg logit contributions
 asleep-0.064
 driveway-0.061
robe-0.059
 bathroom-0.057
 kitchen-0.057
Token de
Token ID390
Feature activation-8.70e-02

Pos logit contributions
mmm+0.028
ww+0.027
 importantly+0.025
www+0.025
dd+0.023
Neg logit contributions
 asleep-0.033
 driveway-0.033
robe-0.033
 bank-0.031
seek-0.031
Tokenars
Token ID945
Feature activation-2.24e-02

Pos logit contributions
 importantly+0.195
mmm+0.189
ww+0.181
www+0.170
 annoying+0.163
Neg logit contributions
seek-0.111
robe-0.104
 bank-0.102
 driveway-0.101
 grounds-0.099
Token.
Token ID13
Feature activation-7.88e-04

Pos logit contributions
mmm+0.036
ww+0.035
 importantly+0.032
www+0.031
dd+0.028
Neg logit contributions
 driveway-0.047
robe-0.044
 asleep-0.044
 bank-0.044
seek-0.044
Token I
Token ID314
Feature activation+2.32e-02

Pos logit contributions
mmm+0.001
ww+0.001
 importantly+0.001
www+0.001
dd+0.001
Neg logit contributions
 asleep-0.002
 driveway-0.002
robe-0.002
seek-0.002
 bank-0.002
Token know
Token ID760
Feature activation-4.84e-02

Pos logit contributions
 asleep+0.044
 driveway+0.043
robe+0.042
seek+0.040
 grounds+0.040
Neg logit contributions
mmm-0.044
ww-0.043
 importantly-0.040
www-0.039
dd-0.037
Token you
Token ID345
Feature activation+2.73e-02

Pos logit contributions
ww+0.072
mmm+0.070
www+0.065
 importantly+0.062
dd+0.058
Neg logit contributions
 asleep-0.109
 driveway-0.104
robe-0.103
seek-0.101
 bank-0.100
Token would
Token ID561
Feature activation+2.98e-02

Pos logit contributions
mmm+0.008
ww+0.007
 importantly+0.007
www+0.007
dd+0.006
Neg logit contributions
 asleep-0.008
 driveway-0.008
robe-0.008
 bank-0.008
seek-0.007
Token still
Token ID991
Feature activation+2.25e-02

Pos logit contributions
robe+0.053
 driveway+0.052
 asleep+0.052
seek+0.051
ASH+0.051
Neg logit contributions
mmm-0.056
ww-0.055
 importantly-0.052
www-0.051
 annoying-0.046
Token be
Token ID307
Feature activation-2.02e-02

Pos logit contributions
robe+0.041
 asleep+0.040
 driveway+0.040
seek+0.039
ASH+0.039
Neg logit contributions
mmm-0.042
ww-0.041
 importantly-0.039
www-0.037
 annoying-0.034
Token beautiful
Token ID4950
Feature activation-2.73e-02

Pos logit contributions
ww+0.028
 importantly+0.028
mmm+0.027
www+0.025
dd+0.023
Neg logit contributions
robe-0.046
 driveway-0.046
seek-0.045
 bank-0.044
 asleep-0.044
Token.
Token ID13
Feature activation+8.03e-03

Pos logit contributions
mmm+0.053
ww+0.053
 importantly+0.051
www+0.048
dd+0.044
Neg logit contributions
 driveway-0.049
robe-0.046
seek-0.045
 bank-0.045
 asleep-0.044
Token Because
Token ID4362
Feature activation-8.63e-02

Pos logit contributions
 asleep+0.012
 driveway+0.012
robe+0.011
seek+0.011
 bank+0.011
Neg logit contributions
ww-0.018
mmm-0.018
 importantly-0.017
www-0.017
dd-0.016
Token beauty
Token ID8737
Feature activation-3.40e-02

Pos logit contributions
mmm+0.135
ww+0.131
www+0.118
 importantly+0.113
dd+0.105
Neg logit contributions
 asleep-0.193
robe-0.183
 driveway-0.181
 bank-0.172
seek-0.171
Token is
Token ID318
Feature activation-4.65e-02

Pos logit contributions
mmm+0.059
ww+0.058
 importantly+0.054
www+0.053
dd+0.048
Neg logit contributions
 asleep-0.068
 driveway-0.067
robe-0.066
seek-0.064
 bank-0.064
Token not
Token ID407
Feature activation-5.00e-02

Pos logit contributions
mmm+0.080
ww+0.079
 importantly+0.074
www+0.071
dd+0.065
Neg logit contributions
 driveway-0.093
 asleep-0.093
robe-0.091
seek-0.089
 bank-0.088
Token about
Token ID546
Feature activation-9.45e-03

Pos logit contributions
mmm+0.082
ww+0.082
 importantly+0.078
www+0.074
dd+0.066
Neg logit contributions
 driveway-0.104
robe-0.101
 asleep-0.100
seek-0.100
 bank-0.098
Token what
Token ID644
Feature activation-6.72e-03

Pos logit contributions
mmm+0.016
ww+0.016
 importantly+0.016
www+0.014
 annoying+0.013
Neg logit contributions
 driveway-0.019
robe-0.018
seek-0.018
ASH-0.017
 bank-0.017
Token he
Token ID339
Feature activation+1.02e-02

Pos logit contributions
ww+0.037
mmm+0.036
 importantly+0.036
www+0.033
 annoying+0.030
Neg logit contributions
seek-0.057
robe-0.057
 driveway-0.056
ASH-0.055
pit-0.055
Token is
Token ID318
Feature activation-3.12e-02

Pos logit contributions
 driveway+0.021
robe+0.020
 asleep+0.020
seek+0.020
pit+0.020
Neg logit contributions
ww-0.017
mmm-0.016
 importantly-0.015
www-0.015
dd-0.014
Token so
Token ID523
Feature activation-5.33e-02

Pos logit contributions
ww+0.046
mmm+0.045
 importantly+0.045
www+0.041
dd+0.038
Neg logit contributions
 driveway-0.068
robe-0.068
seek-0.067
ASH-0.066
 bank-0.065
Token happy
Token ID3772
Feature activation-2.91e-02

Pos logit contributions
mmm+0.074
ww+0.072
 importantly+0.066
www+0.064
dd+0.056
Neg logit contributions
 asleep-0.126
 driveway-0.125
robe-0.123
seek-0.120
 bank-0.120
Token.
Token ID13
Feature activation+8.27e-03

Pos logit contributions
mmm+0.051
ww+0.050
 importantly+0.048
www+0.045
dd+0.042
Neg logit contributions
 driveway-0.058
robe-0.057
 asleep-0.056
 bank-0.055
seek-0.054
Token Because
Token ID4362
Feature activation-8.57e-02

Pos logit contributions
 asleep+0.013
 driveway+0.013
robe+0.012
seek+0.012
 bank+0.012
Neg logit contributions
mmm-0.018
ww-0.018
 importantly-0.017
www-0.017
dd-0.015
Token he
Token ID339
Feature activation+3.27e-03

Pos logit contributions
mmm+0.139
ww+0.136
www+0.124
dd+0.106
Run+0.105
Neg logit contributions
 asleep-0.193
robe-0.176
 driveway-0.174
 bank-0.168
 neighborhood-0.164
Token was
Token ID373
Feature activation-3.75e-02

Pos logit contributions
 driveway+0.006
 asleep+0.006
robe+0.006
seek+0.006
 bank+0.006
Neg logit contributions
mmm-0.006
ww-0.006
 importantly-0.005
www-0.005
dd-0.005
Token an
Token ID281
Feature activation-4.67e-02

Pos logit contributions
mmm+0.055
ww+0.054
 importantly+0.054
www+0.049
dd+0.045
Neg logit contributions
 driveway-0.084
robe-0.083
 bank-0.080
 asleep-0.080
seek-0.079
Token intelligent
Token ID12661
Feature activation-3.00e-02

Pos logit contributions
ww+0.095
mmm+0.092
www+0.082
 importantly+0.080
Run+0.078
Neg logit contributions
 back-0.082
 driveway-0.079
iddle-0.076
 bank-0.075
 asleep-0.075
Token bear
Token ID6842
Feature activation-3.05e-02

Pos logit contributions
mmm+0.069
ww+0.068
 importantly+0.064
www+0.063
dd+0.059
Neg logit contributions
 asleep-0.044
 driveway-0.043
robe-0.043
 bank-0.040
seek-0.040
Token's
Token ID338
Feature activation+1.83e-02

Pos logit contributions
 driveway+0.006
robe+0.006
 asleep+0.006
seek+0.006
ASH+0.006
Neg logit contributions
mmm-0.006
ww-0.006
 importantly-0.006
www-0.006
 badly-0.005
Token courage
Token ID11917
Feature activation-2.74e-03

Pos logit contributions
robe+0.037
 asleep+0.036
 driveway+0.035
seek+0.035
pit+0.035
Neg logit contributions
 importantly-0.030
mmm-0.029
ww-0.028
 hour-0.027
 annoying-0.026
Token.
Token ID13
Feature activation+7.56e-03

Pos logit contributions
mmm+0.004
 importantly+0.004
ww+0.004
www+0.004
dd+0.004
Neg logit contributions
 driveway-0.006
robe-0.005
ASH-0.005
 asleep-0.005
seek-0.005
Token The
Token ID383
Feature activation-1.52e-02

Pos logit contributions
 asleep+0.014
robe+0.014
 driveway+0.013
pit+0.013
seek+0.013
Neg logit contributions
mmm-0.015
ww-0.014
 importantly-0.014
www-0.013
dd-0.012
Token two
Token ID734
Feature activation-3.77e-02

Pos logit contributions
mmm+0.028
ww+0.027
 importantly+0.025
www+0.025
dd+0.022
Neg logit contributions
 asleep-0.030
 driveway-0.029
robe-0.029
 bank-0.028
seek-0.028
Token de
Token ID390
Feature activation-8.56e-02

Pos logit contributions
mmm+0.086
ww+0.082
www+0.077
 importantly+0.074
dd+0.069
Neg logit contributions
 asleep-0.059
 driveway-0.056
 bank-0.056
 grounds-0.053
robe-0.053
Tokeners
Token ID364
Feature activation-1.83e-02

Pos logit contributions
mmm+0.171
ww+0.166
 importantly+0.164
www+0.154
 annoying+0.139
Neg logit contributions
seek-0.133
 bank-0.126
 driveway-0.126
robe-0.125
 grounds-0.123
Token became
Token ID2627
Feature activation-3.07e-02

Pos logit contributions
mmm+0.032
ww+0.032
 importantly+0.030
www+0.029
dd+0.028
Neg logit contributions
 asleep-0.035
robe-0.035
 driveway-0.034
seek-0.033
ASH-0.032
Token good
Token ID922
Feature activation-1.90e-02

Pos logit contributions
mmm+0.063
ww+0.061
www+0.057
 importantly+0.057
dd+0.052
Neg logit contributions
 asleep-0.054
 driveway-0.052
robe-0.051
 bank-0.050
seek-0.049
Token friends
Token ID2460
Feature activation-1.94e-02

Pos logit contributions
 importantly+0.036
ww+0.036
mmm+0.036
www+0.033
dd+0.031
Neg logit contributions
 driveway-0.034
robe-0.034
 asleep-0.033
seek-0.032
ASH-0.031
Token after
Token ID706
Feature activation-3.99e-02

Pos logit contributions
mmm+0.039
ww+0.038
 importantly+0.035
www+0.035
dd+0.032
Neg logit contributions
 asleep-0.034
 driveway-0.033
robe-0.033
seek-0.032
 bank-0.032
Token are
Token ID389
Feature activation-3.12e-02

Pos logit contributions
 driveway+0.035
 asleep+0.034
robe+0.034
seek+0.033
ASH+0.031
Neg logit contributions
ww-0.046
mmm-0.046
 importantly-0.043
www-0.043
dd-0.039
Token not
Token ID407
Feature activation-3.79e-02

Pos logit contributions
mmm+0.060
ww+0.059
www+0.055
 importantly+0.055
dd+0.054
Neg logit contributions
 asleep-0.054
robe-0.050
 bank-0.050
 kitchen-0.050
 driveway-0.049
Token flowers
Token ID12734
Feature activation-3.72e-02

Pos logit contributions
mmm+0.069
ww+0.068
 importantly+0.064
www+0.062
dd+0.057
Neg logit contributions
 asleep-0.073
 driveway-0.071
robe-0.070
 bank-0.068
seek-0.067
Token,
Token ID11
Feature activation-5.79e-02

Pos logit contributions
mmm+0.065
ww+0.064
 importantly+0.060
www+0.058
dd+0.053
Neg logit contributions
 asleep-0.073
 driveway-0.073
robe-0.072
seek-0.069
 bank-0.069
Token my
Token ID616
Feature activation-3.95e-03

Pos logit contributions
ww+0.102
mmm+0.093
www+0.092
dd+0.090
 importantly+0.089
Neg logit contributions
 asleep-0.105
 bathroom-0.104
 kitchen-0.101
 edge-0.100
robe-0.100
Token de
Token ID390
Feature activation-8.55e-02

Pos logit contributions
mmm+0.008
ww+0.008
 importantly+0.007
www+0.007
dd+0.006
Neg logit contributions
 driveway-0.007
 asleep-0.007
seek-0.007
robe-0.007
stick-0.006
Tokenars
Token ID945
Feature activation-2.38e-02

Pos logit contributions
 importantly+0.189
mmm+0.183
ww+0.180
www+0.166
 annoying+0.156
Neg logit contributions
seek-0.110
 bank-0.108
 grounds-0.108
 driveway-0.106
robe-0.105
Token.
Token ID13
Feature activation+2.46e-03

Pos logit contributions
mmm+0.042
ww+0.041
 importantly+0.038
www+0.037
dd+0.034
Neg logit contributions
 driveway-0.047
 asleep-0.047
robe-0.046
seek-0.045
 bank-0.044
Token Those
Token ID5845
Feature activation+2.44e-02

Pos logit contributions
 asleep+0.004
 driveway+0.004
robe+0.004
 bank+0.004
seek+0.004
Neg logit contributions
mmm-0.005
ww-0.005
 importantly-0.005
www-0.005
dd-0.004
Token are
Token ID389
Feature activation-3.39e-02

Pos logit contributions
 driveway+0.037
 asleep+0.037
robe+0.037
seek+0.035
ASH+0.033
Neg logit contributions
ww-0.054
mmm-0.054
 importantly-0.050
www-0.050
dd-0.045
Token caul
Token ID46087
Feature activation-3.53e-02

Pos logit contributions
mmm+0.064
ww+0.062
www+0.057
 importantly+0.057
dd+0.053
Neg logit contributions
 asleep-0.064
 driveway-0.060
robe-0.060
 bank-0.060
 edge-0.057
Token to
Token ID284
Feature activation+1.70e-02

Pos logit contributions
mmm+0.037
ww+0.037
 importantly+0.036
www+0.035
dd+0.032
Neg logit contributions
 driveway-0.025
 asleep-0.024
robe-0.024
seek-0.023
 bank-0.023
Token help
Token ID1037
Feature activation+2.95e-03

Pos logit contributions
 asleep+0.030
 driveway+0.030
robe+0.030
seek+0.028
 bank+0.028
Neg logit contributions
mmm-0.035
ww-0.034
 importantly-0.032
www-0.031
dd-0.028
Token other
Token ID584
Feature activation-8.52e-03

Pos logit contributions
 asleep+0.005
 driveway+0.005
robe+0.005
seek+0.005
 bank+0.005
Neg logit contributions
mmm-0.006
ww-0.006
 importantly-0.005
www-0.005
dd-0.005
Token people
Token ID661
Feature activation-8.70e-03

Pos logit contributions
mmm+0.017
ww+0.016
 importantly+0.016
www+0.015
dd+0.013
Neg logit contributions
 driveway-0.015
robe-0.015
 asleep-0.015
seek-0.015
 bank-0.014
Token,
Token ID11
Feature activation-2.71e-02

Pos logit contributions
mmm+0.015
ww+0.014
 importantly+0.014
www+0.013
dd+0.012
Neg logit contributions
 driveway-0.017
robe-0.017
 asleep-0.017
seek-0.017
ASH-0.017
Token because
Token ID780
Feature activation-8.53e-02

Pos logit contributions
mmm+0.045
ww+0.045
 importantly+0.041
www+0.041
dd+0.037
Neg logit contributions
 asleep-0.056
 driveway-0.055
robe-0.054
 bank-0.053
seek-0.052
Token it
Token ID340
Feature activation+1.59e-02

Pos logit contributions
mmm+0.123
ww+0.122
www+0.108
 importantly+0.108
dd+0.097
Neg logit contributions
 asleep-0.194
 driveway-0.191
robe-0.188
 bank-0.182
seek-0.180
Token makes
Token ID1838
Feature activation-1.05e-02

Pos logit contributions
 driveway+0.027
 asleep+0.027
seek+0.026
robe+0.026
 bank+0.025
Neg logit contributions
mmm-0.033
ww-0.032
 importantly-0.031
www-0.030
dd-0.027
Token them
Token ID606
Feature activation-3.43e-04

Pos logit contributions
mmm+0.018
ww+0.017
 importantly+0.016
www+0.016
dd+0.014
Neg logit contributions
 driveway-0.022
 asleep-0.022
robe-0.021
seek-0.021
 bank-0.021
Token happy
Token ID3772
Feature activation-7.13e-03

Pos logit contributions
mmm+0.001
ww+0.001
 importantly+0.001
www+0.001
 annoying+0.000
Neg logit contributions
 driveway-0.001
robe-0.001
seek-0.001
 asleep-0.001
ASH-0.001
Token.
Token ID13
Feature activation-4.73e-03

Pos logit contributions
mmm+0.013
ww+0.012
 importantly+0.012
www+0.011
 hour+0.010
Neg logit contributions
 driveway-0.014
seek-0.014
robe-0.014
 bank-0.013
 asleep-0.013
Token were
Token ID547
Feature activation-3.22e-02

Pos logit contributions
mmm+0.023
 importantly+0.022
ww+0.022
www+0.020
dd+0.020
Neg logit contributions
seek-0.031
ASH-0.030
 driveway-0.030
robe-0.030
 asleep-0.029
Token safe
Token ID3338
Feature activation+3.54e-03

Pos logit contributions
mmm+0.053
ww+0.053
 importantly+0.051
www+0.048
dd+0.044
Neg logit contributions
 driveway-0.065
robe-0.065
seek-0.063
 asleep-0.063
 bank-0.062
Token and
Token ID290
Feature activation-3.83e-02

Pos logit contributions
 driveway+0.008
seek+0.008
robe+0.008
ASH+0.008
izons+0.007
Neg logit contributions
 importantly-0.005
ww-0.004
mmm-0.004
 badly-0.004
 though-0.004
Token happy
Token ID3772
Feature activation-1.35e-02

Pos logit contributions
mmm+0.061
ww+0.061
 importantly+0.058
www+0.055
dd+0.049
Neg logit contributions
 driveway-0.081
 asleep-0.081
robe-0.080
seek-0.078
 bank-0.076
Token.
Token ID13
Feature activation+6.72e-03

Pos logit contributions
 importantly+0.022
mmm+0.022
ww+0.021
www+0.019
dd+0.018
Neg logit contributions
 driveway-0.029
robe-0.027
 bank-0.027
ASH-0.026
 workshop-0.026
Token Because
Token ID4362
Feature activation-8.51e-02

Pos logit contributions
 driveway+0.012
 asleep+0.012
robe+0.011
seek+0.011
pit+0.011
Neg logit contributions
mmm-0.013
ww-0.013
 importantly-0.013
www-0.012
dd-0.011
Token Bloom
Token ID11891
Feature activation-5.28e-02

Pos logit contributions
mmm+0.151
ww+0.142
www+0.132
 importantly+0.118
dd+0.117
Neg logit contributions
 asleep-0.181
robe-0.166
 driveway-0.156
stick-0.156
 bank-0.156
Token warned
Token ID7728
Feature activation-1.32e-02

Pos logit contributions
mmm+0.110
ww+0.106
 importantly+0.101
www+0.098
 hour+0.088
Neg logit contributions
 asleep-0.087
robe-0.085
 bank-0.082
 driveway-0.081
seek-0.080
Token them
Token ID606
Feature activation-7.88e-03

Pos logit contributions
mmm+0.023
ww+0.022
 importantly+0.021
www+0.020
dd+0.018
Neg logit contributions
 driveway-0.028
robe-0.026
 asleep-0.026
pit-0.025
ASH-0.025
Token,
Token ID11
Feature activation-1.88e-02

Pos logit contributions
mmm+0.013
ww+0.012
 importantly+0.012
www+0.011
dd+0.010
Neg logit contributions
 driveway-0.017
 asleep-0.016
robe-0.016
ASH-0.016
seek-0.016
Token they
Token ID484
Feature activation-7.97e-04

Pos logit contributions
mmm+0.035
ww+0.034
 importantly+0.032
www+0.031
dd+0.029
Neg logit contributions
 driveway-0.035
 asleep-0.035
robe-0.035
seek-0.033
 bank-0.033
Token special
Token ID2041
Feature activation+3.26e-02

Pos logit contributions
 driveway+0.056
 asleep+0.055
robe+0.054
seek+0.054
stick+0.052
Neg logit contributions
ww-0.066
mmm-0.064
 importantly-0.061
www-0.060
dd-0.054
Token shell
Token ID7582
Feature activation+1.31e-02

Pos logit contributions
 driveway+0.056
 asleep+0.055
seek+0.054
robe+0.053
ASH+0.052
Neg logit contributions
ww-0.066
mmm-0.063
 importantly-0.061
www-0.060
dd-0.054
Token with
Token ID351
Feature activation-1.53e-02

Pos logit contributions
 driveway+0.019
ASH+0.018
seek+0.017
 neighborhood+0.016
IT+0.016
Neg logit contributions
ww-0.028
mmm-0.028
 importantly-0.026
 badly-0.026
www-0.025
Token him
Token ID683
Feature activation+4.11e-04

Pos logit contributions
mmm+0.027
ww+0.026
 importantly+0.026
www+0.024
dd+0.023
Neg logit contributions
 driveway-0.030
robe-0.029
 asleep-0.029
seek-0.029
pit-0.028
Token.
Token ID13
Feature activation+2.37e-02

Pos logit contributions
 driveway+0.001
seek+0.001
 asleep+0.001
robe+0.001
ASH+0.001
Neg logit contributions
mmm-0.001
ww-0.001
 importantly-0.001
www-0.001
dd-0.001
Token Because
Token ID4362
Feature activation-8.50e-02

Pos logit contributions
 asleep+0.042
 driveway+0.042
robe+0.041
seek+0.040
 bank+0.040
Neg logit contributions
mmm-0.047
ww-0.044
 importantly-0.043
www-0.040
dd-0.035
Token it
Token ID340
Feature activation+6.91e-03

Pos logit contributions
mmm+0.135
ww+0.130
www+0.117
 importantly+0.109
dd+0.100
Neg logit contributions
 asleep-0.189
robe-0.179
 driveway-0.178
 bank-0.173
stick-0.168
Token was
Token ID373
Feature activation-3.03e-02

Pos logit contributions
 driveway+0.013
 asleep+0.013
robe+0.012
seek+0.012
ASH+0.012
Neg logit contributions
mmm-0.013
ww-0.013
 importantly-0.012
www-0.012
dd-0.011
Token always
Token ID1464
Feature activation-1.35e-02

Pos logit contributions
mmm+0.054
ww+0.053
 importantly+0.050
www+0.048
dd+0.044
Neg logit contributions
 driveway-0.059
 asleep-0.059
robe-0.058
seek-0.056
 bank-0.055
Token a
Token ID257
Feature activation-3.41e-02

Pos logit contributions
ww+0.023
mmm+0.023
 importantly+0.022
www+0.020
dd+0.018
Neg logit contributions
 driveway-0.028
robe-0.027
 asleep-0.027
seek-0.027
 bank-0.026
Token reminder
Token ID15438
Feature activation-9.33e-03

Pos logit contributions
mmm+0.060
ww+0.058
 importantly+0.054
www+0.053
dd+0.048
Neg logit contributions
 asleep-0.068
 driveway-0.068
robe-0.066
 bank-0.065
seek-0.063
Token was
Token ID373
Feature activation-1.82e-02

Pos logit contributions
 driveway+0.052
 asleep+0.050
seek+0.048
 bank+0.047
robe+0.047
Neg logit contributions
ww-0.068
mmm-0.065
www-0.062
 importantly-0.061
dd-0.055
Token a
Token ID257
Feature activation-1.79e-02

Pos logit contributions
ww+0.040
mmm+0.039
www+0.037
 importantly+0.036
Run+0.035
Neg logit contributions
 asleep-0.031
robe-0.027
 back-0.027
 driveway-0.027
 bathroom-0.026
Token dream
Token ID4320
Feature activation-1.02e-02

Pos logit contributions
mmm+0.032
ww+0.030
 importantly+0.029
www+0.028
dd+0.026
Neg logit contributions
 asleep-0.036
 driveway-0.035
robe-0.035
 bank-0.033
seek-0.033
Token,
Token ID11
Feature activation-3.00e-02

Pos logit contributions
mmm+0.015
ww+0.014
 importantly+0.013
www+0.013
 hour+0.011
Neg logit contributions
 driveway-0.024
 asleep-0.023
robe-0.023
ASH-0.023
 bank-0.022
Token my
Token ID616
Feature activation-1.35e-02

Pos logit contributions
ww+0.057
mmm+0.055
www+0.053
dd+0.051
 importantly+0.049
Neg logit contributions
 asleep-0.057
robe-0.054
 driveway-0.053
 bathroom-0.051
pit-0.050
Token de
Token ID390
Feature activation-8.47e-02

Pos logit contributions
mmm+0.025
ww+0.025
 importantly+0.023
www+0.023
dd+0.021
Neg logit contributions
 asleep-0.025
 driveway-0.025
robe-0.025
seek-0.024
 bank-0.024
Tokenars
Token ID945
Feature activation-2.31e-02

Pos logit contributions
 importantly+0.188
mmm+0.187
ww+0.180
www+0.170
 annoying+0.158
Neg logit contributions
seek-0.109
robe-0.107
 driveway-0.104
 bank-0.101
 grounds-0.100
Token,"
Token ID553
Feature activation-4.20e-02

Pos logit contributions
mmm+0.038
ww+0.037
 importantly+0.034
www+0.033
dd+0.030
Neg logit contributions
 driveway-0.049
 asleep-0.047
robe-0.047
seek-0.046
ASH-0.045
Token she
Token ID673
Feature activation-4.52e-04

Pos logit contributions
ww+0.079
mmm+0.077
www+0.073
 importantly+0.068
dd+0.067
Neg logit contributions
 asleep-0.080
 driveway-0.076
robe-0.075
 bank-0.074
 bathroom-0.074
Token said
Token ID531
Feature activation+1.36e-03

Pos logit contributions
mmm+0.001
ww+0.001
 importantly+0.001
www+0.001
dd+0.001
Neg logit contributions
 asleep-0.001
 driveway-0.001
robe-0.001
 bank-0.001
seek-0.001
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
 driveway+0.003
robe+0.003
 bank+0.003
 asleep+0.003
ASH+0.003
Neg logit contributions
mmm-0.002
 importantly-0.002
ww-0.002
www-0.002
dd-0.002
Token gave
Token ID2921
Feature activation+4.52e-03

Pos logit contributions
mmm+0.001
 importantly+0.001
ww+0.001
www+0.001
 badly+0.001
Neg logit contributions
robe-0.001
 asleep-0.001
 driveway-0.001
ags-0.001
ASH-0.001
Token him
Token ID683
Feature activation-1.75e-03

Pos logit contributions
 driveway+0.011
robe+0.010
seek+0.010
 asleep+0.010
pit+0.010
Neg logit contributions
ww-0.006
mmm-0.006
www-0.006
dd-0.006
 importantly-0.005
Token one
Token ID530
Feature activation-4.53e-03

Pos logit contributions
ww+0.003
mmm+0.003
 importantly+0.003
www+0.003
dd+0.003
Neg logit contributions
 driveway-0.003
robe-0.003
 asleep-0.003
seek-0.003
ASH-0.003
Token banana
Token ID25996
Feature activation-7.35e-03

Pos logit contributions
mmm+0.009
ww+0.009
 importantly+0.008
www+0.008
dd+0.008
Neg logit contributions
 asleep-0.008
 driveway-0.008
robe-0.008
seek-0.007
 bank-0.007
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
ww+0.014
mmm+0.014
www+0.013
 importantly+0.012
dd+0.011
Neg logit contributions
 driveway-0.014
 asleep-0.013
robe-0.012
ASH-0.012
 neighborhood-0.012
Token Because
Token ID4362
Feature activation-8.44e-02

Pos logit contributions
 driveway+0.036
 asleep+0.036
robe+0.036
 grounds+0.034
seek+0.034
Neg logit contributions
mmm-0.041
ww-0.039
 importantly-0.037
www-0.036
dd-0.031
Token of
Token ID286
Feature activation-4.35e-02

Pos logit contributions
mmm+0.138
ww+0.134
www+0.121
 importantly+0.114
dd+0.108
Neg logit contributions
 asleep-0.183
robe-0.171
 driveway-0.171
 bank-0.167
pit-0.163
Token this
Token ID428
Feature activation-1.34e-02

Pos logit contributions
mmm+0.074
ww+0.072
 importantly+0.068
www+0.065
dd+0.059
Neg logit contributions
 asleep-0.090
 driveway-0.090
robe-0.089
seek-0.086
 bank-0.085
Token,
Token ID11
Feature activation-4.69e-02

Pos logit contributions
mmm+0.024
ww+0.024
 importantly+0.022
www+0.022
dd+0.020
Neg logit contributions
 asleep-0.026
 driveway-0.026
robe-0.026
seek-0.025
 bank-0.025
Token he
Token ID339
Feature activation+4.92e-03

Pos logit contributions
mmm+0.073
ww+0.071
 importantly+0.066
www+0.064
dd+0.057
Neg logit contributions
 asleep-0.102
 driveway-0.102
robe-0.100
seek-0.098
 bank-0.097
Token was
Token ID373
Feature activation-3.68e-02

Pos logit contributions
 driveway+0.009
 asleep+0.009
robe+0.009
seek+0.009
ASH+0.009
Neg logit contributions
mmm-0.009
ww-0.009
 importantly-0.008
www-0.008
dd-0.007
Token "
Token ID366
Feature activation-1.19e-02

Pos logit contributions
 driveway+0.014
robe+0.013
 asleep+0.013
 bank+0.013
seek+0.013
Neg logit contributions
mmm-0.015
ww-0.014
 importantly-0.013
www-0.013
dd-0.011
TokenThank
Token ID10449
Feature activation-3.78e-02

Pos logit contributions
mmm+0.022
ww+0.021
 importantly+0.020
www+0.019
dd+0.018
Neg logit contributions
 asleep-0.022
 driveway-0.022
robe-0.022
seek-0.022
 bank-0.021
Token you
Token ID345
Feature activation-3.13e-02

Pos logit contributions
mmm+0.056
ww+0.054
 importantly+0.051
www+0.049
 annoying+0.047
Neg logit contributions
 asleep-0.075
seek-0.074
 back-0.074
robe-0.073
ASH-0.072
Token,
Token ID11
Feature activation-4.33e-02

Pos logit contributions
ww+0.065
mmm+0.064
dd+0.062
 importantly+0.061
www+0.060
Neg logit contributions
 asleep-0.052
 back-0.050
robe-0.048
 driveway-0.047
 kitchen-0.046
Token my
Token ID616
Feature activation-3.51e-02

Pos logit contributions
ww+0.076
dd+0.074
mmm+0.072
www+0.069
 importantly+0.068
Neg logit contributions
 asleep-0.082
 bathroom-0.073
 kitchen-0.073
 driveway-0.073
pit-0.070
Token de
Token ID390
Feature activation-8.39e-02

Pos logit contributions
mmm+0.068
ww+0.067
 importantly+0.063
www+0.062
dd+0.059
Neg logit contributions
 asleep-0.064
robe-0.062
 driveway-0.059
 bank-0.057
seek-0.056
Tokenars
Token ID945
Feature activation-2.33e-02

Pos logit contributions
 importantly+0.196
mmm+0.191
ww+0.186
www+0.174
 gust+0.165
Neg logit contributions
seek-0.101
robe-0.095
 bank-0.092
 driveway-0.091
ags-0.089
Token.
Token ID13
Feature activation-5.24e-03

Pos logit contributions
mmm+0.040
ww+0.039
 importantly+0.036
www+0.035
dd+0.031
Neg logit contributions
 driveway-0.048
 asleep-0.045
 bank-0.045
seek-0.044
robe-0.044
Token I
Token ID314
Feature activation+1.08e-02

Pos logit contributions
mmm+0.009
ww+0.009
 importantly+0.008
www+0.008
dd+0.008
Neg logit contributions
 asleep-0.010
robe-0.010
 driveway-0.010
seek-0.010
ASH-0.010
Token appreciate
Token ID9144
Feature activation-2.44e-02

Pos logit contributions
 driveway+0.021
 asleep+0.021
robe+0.020
seek+0.019
 bank+0.019
Neg logit contributions
mmm-0.020
ww-0.020
 importantly-0.018
www-0.018
dd-0.017
Token your
Token ID534
Feature activation+1.42e-02

Pos logit contributions
mmm+0.041
ww+0.041
www+0.037
 importantly+0.037
dd+0.034
Neg logit contributions
 asleep-0.050
 driveway-0.048
robe-0.046
 bank-0.046
seek-0.045
Token woods
Token ID16479
Feature activation-9.17e-03

Pos logit contributions
seek+0.060
robe+0.057
 asleep+0.056
stick+0.056
ASH+0.047
Neg logit contributions
ww-0.114
www-0.105
mmm-0.103
 hour-0.099
 importantly-0.098
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
mmm+0.014
 importantly+0.013
ww+0.013
www+0.012
 hour+0.012
Neg logit contributions
robe-0.019
 driveway-0.019
 asleep-0.019
ASH-0.018
seek-0.018
Token
Token ID198
Feature activation-2.97e-02

Pos logit contributions
 asleep+0.019
 driveway+0.019
robe+0.019
seek+0.018
 bank+0.018
Neg logit contributions
mmm-0.027
ww-0.027
 importantly-0.026
www-0.025
dd-0.023
Token
Token ID198
Feature activation-3.75e-02

Pos logit contributions
mmm+0.066
ww+0.065
 importantly+0.062
www+0.061
dd+0.055
Neg logit contributions
 driveway-0.045
 asleep-0.043
robe-0.043
seek-0.042
 bank-0.041
TokenBut
Token ID1537
Feature activation-4.44e-02

Pos logit contributions
mmm+0.071
ww+0.070
 importantly+0.068
www+0.064
dd+0.058
Neg logit contributions
 driveway-0.067
seek-0.064
 edge-0.062
 neighborhood-0.062
 bank-0.062
Token most
Token ID749
Feature activation-8.38e-02

Pos logit contributions
ww+0.071
mmm+0.070
 importantly+0.065
www+0.064
dd+0.056
Neg logit contributions
 driveway-0.095
 asleep-0.094
robe-0.092
 bank-0.091
seek-0.089
Token of
Token ID286
Feature activation-3.13e-02

Pos logit contributions
mmm+0.149
ww+0.133
www+0.121
mmmm+0.108
Run+0.095
Neg logit contributions
 neighborhood-0.168
 neighbourhood-0.161
 grounds-0.157
 asleep-0.151
 gardens-0.150
Token all
Token ID477
Feature activation-3.99e-02

Pos logit contributions
mmm+0.068
ww+0.068
 importantly+0.064
www+0.063
dd+0.058
Neg logit contributions
 asleep-0.050
 driveway-0.049
robe-0.048
seek-0.046
 bank-0.046
Token,
Token ID11
Feature activation-4.20e-02

Pos logit contributions
mmm+0.078
ww+0.077
www+0.069
 importantly+0.066
dd+0.062
Neg logit contributions
 asleep-0.075
 driveway-0.071
 bank-0.069
 grounds-0.067
robe-0.067
Token Sam
Token ID3409
Feature activation-2.26e-02

Pos logit contributions
mmm+0.071
ww+0.071
www+0.065
 importantly+0.063
dd+0.058
Neg logit contributions
 asleep-0.084
 driveway-0.084
robe-0.083
 bank-0.082
seek-0.080
Token enjoyed
Token ID8359
Feature activation+2.78e-03

Pos logit contributions
mmm+0.040
ww+0.040
 importantly+0.037
www+0.037
dd+0.034
Neg logit contributions
 driveway-0.043
 asleep-0.043
robe-0.043
seek-0.041
ASH-0.040
Token,
Token ID11
Feature activation-4.00e-02

Pos logit contributions
mmm+0.092
 importantly+0.088
ww+0.087
www+0.080
dd+0.078
Neg logit contributions
 asleep-0.105
robe-0.100
 driveway-0.099
seek-0.097
pit-0.096
Token the
Token ID262
Feature activation+1.39e-02

Pos logit contributions
mmm+0.060
ww+0.059
 importantly+0.055
www+0.052
dd+0.048
Neg logit contributions
 asleep-0.088
 driveway-0.086
robe-0.085
 bank-0.083
seek-0.082
Token farmer
Token ID18739
Feature activation-1.66e-02

Pos logit contributions
seek+0.024
robe+0.024
 driveway+0.023
 asleep+0.023
stick+0.022
Neg logit contributions
ww-0.029
mmm-0.028
www-0.027
 importantly-0.026
dd-0.023
Token in
Token ID287
Feature activation-1.83e-03

Pos logit contributions
mmm+0.028
ww+0.028
 importantly+0.025
www+0.025
dd+0.022
Neg logit contributions
 driveway-0.034
 asleep-0.034
robe-0.033
seek-0.032
 bank-0.032
Token the
Token ID262
Feature activation+4.20e-02

Pos logit contributions
mmm+0.003
ww+0.003
 importantly+0.003
www+0.003
dd+0.003
Neg logit contributions
 asleep-0.003
 driveway-0.003
robe-0.003
seek-0.003
 bank-0.003
Token de
Token ID390
Feature activation-8.33e-02

Pos logit contributions
seek+0.066
robe+0.065
stick+0.059
 asleep+0.058
ASH+0.054
Neg logit contributions
ww-0.096
www-0.089
mmm-0.087
 importantly-0.084
 hour-0.081
Tokenll
Token ID297
Feature activation+1.84e-02

Pos logit contributions
 importantly+0.172
mmm+0.169
ww+0.164
www+0.149
 annoying+0.144
Neg logit contributions
seek-0.120
 grounds-0.118
 driveway-0.116
 asleep-0.114
 bank-0.113
Token".
Token ID1911
Feature activation+1.47e-02

Pos logit contributions
 asleep+0.034
 driveway+0.034
robe+0.033
 bank+0.032
seek+0.032
Neg logit contributions
mmm-0.034
ww-0.034
 importantly-0.032
www-0.031
dd-0.028
Token Suddenly
Token ID24975
Feature activation-1.82e-02

Pos logit contributions
 asleep+0.028
robe+0.028
 driveway+0.027
stick+0.026
ASH+0.025
Neg logit contributions
mmm-0.028
ww-0.025
 importantly-0.025
www-0.024
Run-0.022
Token,
Token ID11
Feature activation-2.12e-02

Pos logit contributions
mmm+0.032
ww+0.031
 importantly+0.029
www+0.029
dd+0.025
Neg logit contributions
 asleep-0.036
 driveway-0.035
robe-0.035
 bank-0.034
seek-0.034
Token he
Token ID339
Feature activation+4.51e-04

Pos logit contributions
mmm+0.041
ww+0.040
 importantly+0.036
www+0.036
dd+0.032
Neg logit contributions
 asleep-0.039
 driveway-0.038
robe-0.037
 bank-0.037
 edge-0.036
Token that
Token ID326
Feature activation-8.12e-03

Pos logit contributions
ww+0.017
mmm+0.016
 importantly+0.015
www+0.015
dd+0.014
Neg logit contributions
 driveway-0.020
 asleep-0.019
ASH-0.019
robe-0.019
pit-0.018
Token birds
Token ID10087
Feature activation-5.98e-04

Pos logit contributions
mmm+0.012
ww+0.012
 importantly+0.011
www+0.010
dd+0.009
Neg logit contributions
 driveway-0.019
robe-0.018
 asleep-0.018
seek-0.018
ASH-0.017
Token can
Token ID460
Feature activation+3.48e-02

Pos logit contributions
mmm+0.001
ww+0.001
 importantly+0.001
www+0.001
dd+0.001
Neg logit contributions
 driveway-0.001
 asleep-0.001
robe-0.001
seek-0.001
ASH-0.001
Token fly
Token ID6129
Feature activation+2.67e-02

Pos logit contributions
robe+0.067
 driveway+0.065
 asleep+0.065
ASH+0.064
pit+0.063
Neg logit contributions
mmm-0.065
ww-0.060
www-0.056
 importantly-0.056
 annoying-0.053
Token in
Token ID287
Feature activation+9.09e-03

Pos logit contributions
 driveway+0.053
robe+0.049
seek+0.048
 asleep+0.047
pit+0.047
Neg logit contributions
ww-0.047
mmm-0.045
 importantly-0.042
www-0.041
 though-0.039
Token their
Token ID511
Feature activation+5.74e-02

Pos logit contributions
robe+0.018
 driveway+0.018
 asleep+0.017
seek+0.017
pit+0.016
Neg logit contributions
mmm-0.016
ww-0.016
 importantly-0.015
www-0.014
dd-0.013
Token own
Token ID898
Feature activation+4.55e-02

Pos logit contributions
robe+0.093
seek+0.092
 asleep+0.087
stick+0.082
pit+0.080
Neg logit contributions
ww-0.123
 importantly-0.110
www-0.108
mmm-0.107
 hour-0.104
Token special
Token ID2041
Feature activation+4.58e-02

Pos logit contributions
robe+0.072
seek+0.071
 asleep+0.066
pit+0.063
 driveway+0.062
Neg logit contributions
ww-0.094
 importantly-0.087
mmm-0.086
www-0.084
 hour-0.083
Token way
Token ID835
Feature activation+2.46e-03

Pos logit contributions
seek+0.072
robe+0.072
 asleep+0.067
pit+0.061
 driveway+0.060
Neg logit contributions
ww-0.099
 importantly-0.090
 hour-0.089
www-0.089
mmm-0.088
Token and
Token ID290
Feature activation-2.72e-02

Pos logit contributions
 driveway+0.005
robe+0.005
seek+0.005
 asleep+0.005
ASH+0.004
Neg logit contributions
mmm-0.004
ww-0.004
 importantly-0.004
www-0.004
dd-0.003
Token that
Token ID326
Feature activation-1.87e-02

Pos logit contributions
mmm+0.038
 importantly+0.038
ww+0.038
www+0.033
 annoying+0.031
Neg logit contributions
robe-0.062
 driveway-0.061
ASH-0.060
pit-0.059
 asleep-0.059
Token Lily
Token ID20037
Feature activation+1.13e-02

Pos logit contributions
 driveway+0.012
seek+0.011
robe+0.011
 grounds+0.011
 bank+0.011
Neg logit contributions
mmm-0.008
ww-0.008
 importantly-0.007
 annoying-0.007
www-0.007
Token cry
Token ID3960
Feature activation-1.79e-02

Pos logit contributions
 driveway+0.023
robe+0.022
 asleep+0.022
seek+0.021
 grounds+0.021
Neg logit contributions
mmm-0.018
ww-0.018
 importantly-0.016
 badly-0.015
www-0.015
Token.
Token ID13
Feature activation-1.22e-03

Pos logit contributions
mmm+0.034
ww+0.033
 importantly+0.032
www+0.030
dd+0.028
Neg logit contributions
 driveway-0.033
 asleep-0.032
robe-0.031
seek-0.031
 bank-0.031
Token He
Token ID679
Feature activation+1.51e-02

Pos logit contributions
mmm+0.003
ww+0.003
www+0.003
 importantly+0.003
dd+0.003
Neg logit contributions
 asleep-0.002
 driveway-0.002
robe-0.002
seek-0.002
 bank-0.002
Token knew
Token ID2993
Feature activation-2.88e-02

Pos logit contributions
 asleep+0.029
 driveway+0.029
robe+0.028
 bank+0.028
seek+0.027
Neg logit contributions
mmm-0.028
ww-0.027
 importantly-0.025
www-0.024
dd-0.022
Token he
Token ID339
Feature activation+5.12e-02

Pos logit contributions
ww+0.046
mmm+0.046
www+0.042
 importantly+0.040
dd+0.036
Neg logit contributions
 asleep-0.063
robe-0.061
 driveway-0.061
seek-0.058
 bank-0.057
Token was
Token ID373
Feature activation-1.56e-03

Pos logit contributions
 asleep+0.098
 driveway+0.097
robe+0.093
seek+0.091
 bank+0.090
Neg logit contributions
mmm-0.092
ww-0.092
 importantly-0.087
www-0.084
dd-0.079
Token wrong
Token ID2642
Feature activation-2.80e-03

Pos logit contributions
mmm+0.003
ww+0.003
 importantly+0.003
www+0.002
 annoying+0.002
Neg logit contributions
 driveway-0.003
robe-0.003
 asleep-0.003
seek-0.003
 bank-0.003
Token.
Token ID13
Feature activation+1.18e-03

Pos logit contributions
mmm+0.006
ww+0.006
 importantly+0.005
www+0.005
dd+0.005
Neg logit contributions
 asleep-0.005
 driveway-0.005
robe-0.005
seek-0.004
 bank-0.004
Token
Token ID198
Feature activation-2.58e-02

Pos logit contributions
 asleep+0.001
 driveway+0.001
robe+0.001
ASH+0.001
 bathroom+0.001
Neg logit contributions
ww-0.003
mmm-0.003
www-0.003
dd-0.003
 importantly-0.003
Token
Token ID198
Feature activation-4.23e-02

Pos logit contributions
ww+0.070
mmm+0.068
www+0.065
 importantly+0.065
dd+0.061
Neg logit contributions
 driveway-0.027
 asleep-0.026
robe-0.025
 edge-0.024
 neighborhood-0.023
Token scissors
Token ID41786
Feature activation+3.36e-03

Pos logit contributions
seek+0.062
robe+0.059
 driveway+0.058
stick+0.057
 asleep+0.057
Neg logit contributions
ww-0.067
mmm-0.064
www-0.062
 importantly-0.062
 hour-0.058
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
robe+0.007
 driveway+0.007
seek+0.007
ASH+0.007
 neighborhood+0.007
Neg logit contributions
mmm-0.005
ww-0.005
 importantly-0.004
dd-0.004
www-0.004
Token went
Token ID1816
Feature activation+4.67e-02

Pos logit contributions
 asleep+0.028
robe+0.027
 driveway+0.027
seek+0.027
ASH+0.026
Neg logit contributions
mmm-0.028
ww-0.028
 importantly-0.026
www-0.026
dd-0.024
Token back
Token ID736
Feature activation+2.81e-02

Pos logit contributions
ASH+0.087
IT+0.087
izons+0.081
bows+0.078
pit+0.076
Neg logit contributions
 much-0.073
ww-0.072
dd-0.065
www-0.065
 of-0.063
Token to
Token ID284
Feature activation+1.90e-02

Pos logit contributions
izons+0.057
robe+0.056
pit+0.055
IT+0.053
ASH+0.053
Neg logit contributions
ww-0.035
 though-0.033
 much-0.033
 because-0.033
dd-0.031
Token the
Token ID262
Feature activation+4.51e-02

Pos logit contributions
robe+0.041
pit+0.039
 driveway+0.038
 grounds+0.038
seek+0.038
Neg logit contributions
mmm-0.029
 importantly-0.027
ww-0.027
www-0.024
 inviting-0.024
Token garden
Token ID11376
Feature activation+1.92e-02

Pos logit contributions
seek+0.050
stick+0.049
robe+0.049
 asleep+0.045
ags+0.041
Neg logit contributions
ww-0.114
www-0.108
mmm-0.106
 importantly-0.100
 unexpected-0.099
Token.
Token ID13
Feature activation+2.33e-02

Pos logit contributions
ags+0.047
IT+0.045
ASH+0.045
robe+0.045
 driveway+0.044
Neg logit contributions
 importantly-0.018
 though-0.018
mmm-0.016
 much-0.016
 badly-0.016
Token But
Token ID887
Feature activation-4.25e-02

Pos logit contributions
 bank+0.043
 asleep+0.043
 driveway+0.042
stick+0.042
robe+0.041
Neg logit contributions
mmm-0.044
 importantly-0.041
ww-0.041
www-0.038
dd-0.033
Token nothing
Token ID2147
Feature activation-3.90e-02

Pos logit contributions
mmm+0.067
ww+0.067
 importantly+0.061
www+0.060
dd+0.053
Neg logit contributions
 asleep-0.092
 driveway-0.091
robe-0.089
 bank-0.087
seek-0.086
Token felt
Token ID2936
Feature activation-5.13e-02

Pos logit contributions
mmm+0.070
ww+0.066
 importantly+0.060
www+0.060
dd+0.052
Neg logit contributions
 asleep-0.079
robe-0.076
 driveway-0.075
seek-0.074
 bank-0.073
Token and
Token ID290
Feature activation-2.71e-02

Pos logit contributions
ww+0.014
mmm+0.014
 importantly+0.013
www+0.013
dd+0.012
Neg logit contributions
 asleep-0.015
 driveway-0.015
robe-0.015
seek-0.015
 bank-0.015
Token dad
Token ID9955
Feature activation+9.96e-04

Pos logit contributions
mmm+0.054
ww+0.052
 importantly+0.049
www+0.048
dd+0.044
Neg logit contributions
 asleep-0.050
 driveway-0.049
robe-0.048
seek-0.046
 bank-0.046
Token took
Token ID1718
Feature activation+2.43e-02

Pos logit contributions
 asleep+0.002
 driveway+0.002
robe+0.002
seek+0.002
 bank+0.002
Neg logit contributions
mmm-0.002
ww-0.002
 importantly-0.002
www-0.002
dd-0.001
Token her
Token ID607
Feature activation+3.72e-02

Pos logit contributions
 driveway+0.046
 asleep+0.045
robe+0.045
 bank+0.044
seek+0.043
Neg logit contributions
ww-0.044
mmm-0.044
 importantly-0.040
www-0.040
dd-0.037
Token to
Token ID284
Feature activation+3.61e-02

Pos logit contributions
pit+0.059
 driveway+0.056
robe+0.055
seek+0.054
ASH+0.054
Neg logit contributions
ww-0.077
 importantly-0.070
mmm-0.069
www-0.069
 hour-0.065
Token the
Token ID262
Feature activation+5.84e-02

Pos logit contributions
robe+0.085
pit+0.082
 driveway+0.080
seek+0.079
 asleep+0.078
Neg logit contributions
mmm-0.052
ww-0.046
 importantly-0.043
www-0.042
 hour-0.037
Token hospital
Token ID4436
Feature activation+2.84e-02

Pos logit contributions
seek+0.079
stick+0.077
 asleep+0.076
robe+0.073
anced+0.073
Neg logit contributions
ww-0.123
 unexpected-0.118
www-0.114
mmm-0.111
 importantly-0.108
Token.
Token ID13
Feature activation+2.48e-02

Pos logit contributions
 driveway+0.060
ASH+0.058
robe+0.058
 asleep+0.057
pit+0.057
Neg logit contributions
mmm-0.041
 importantly-0.037
ww-0.036
 hour-0.034
 though-0.033
Token
Token ID198
Feature activation-7.30e-03

Pos logit contributions
 asleep+0.041
 driveway+0.041
robe+0.040
seek+0.039
 bank+0.039
Neg logit contributions
mmm-0.051
ww-0.050
 importantly-0.048
www-0.046
dd-0.042
Token
Token ID198
Feature activation-2.16e-02

Pos logit contributions
mmm+0.017
ww+0.017
 importantly+0.016
www+0.016
dd+0.015
Neg logit contributions
 asleep-0.010
 driveway-0.010
robe-0.010
seek-0.009
 bank-0.009
TokenWhen
Token ID2215
Feature activation-4.03e-02

Pos logit contributions
mmm+0.043
ww+0.041
 importantly+0.039
www+0.037
dd+0.036
Neg logit contributions
 driveway-0.038
robe-0.036
 grounds-0.035
 neighborhood-0.035
seek-0.035
TokenOnce
Token ID7454
Feature activation-7.50e-03

Pos logit contributions
 driveway+0.123
stick+0.122
 bank+0.121
seek+0.120
 workshop+0.116
Neg logit contributions
mmm-0.105
ww-0.104
 importantly-0.099
www-0.094
dd-0.091
Token upon
Token ID2402
Feature activation+1.23e-02

Pos logit contributions
mmm+0.020
ww+0.020
www+0.019
 importantly+0.019
dd+0.017
Neg logit contributions
 asleep-0.008
 driveway-0.008
robe-0.008
 bank-0.007
pit-0.007
Token a
Token ID257
Feature activation-4.34e-02

Pos logit contributions
 driveway+0.022
 asleep+0.021
robe+0.021
 bank+0.021
seek+0.020
Neg logit contributions
mmm-0.024
ww-0.024
 importantly-0.023
www-0.022
dd-0.020
Token time
Token ID640
Feature activation+9.38e-03

Pos logit contributions
mmm+0.094
ww+0.089
 importantly+0.086
www+0.084
Run+0.077
Neg logit contributions
 driveway-0.065
 neighborhood-0.065
 asleep-0.064
 neighbourhood-0.060
seek-0.057
Token,
Token ID11
Feature activation-2.98e-03

Pos logit contributions
 asleep+0.017
robe+0.016
 driveway+0.016
pit+0.015
seek+0.015
Neg logit contributions
mmm-0.018
ww-0.018
www-0.017
 importantly-0.017
dd-0.015
Token there
Token ID612
Feature activation+5.03e-02

Pos logit contributions
mmm+0.006
ww+0.006
 importantly+0.006
www+0.006
dd+0.005
Neg logit contributions
 asleep-0.005
robe-0.005
 driveway-0.005
 bank-0.004
 edge-0.004
Token was
Token ID373
Feature activation-6.63e-03

Pos logit contributions
 asleep+0.079
 driveway+0.078
robe+0.076
seek+0.073
 bank+0.073
Neg logit contributions
mmm-0.109
ww-0.108
 importantly-0.102
www-0.100
dd-0.092
Token a
Token ID257
Feature activation-5.52e-02

Pos logit contributions
mmm+0.016
ww+0.016
www+0.015
 importantly+0.015
dd+0.013
Neg logit contributions
 asleep-0.009
 driveway-0.008
robe-0.008
seek-0.008
 bank-0.008
Token little
Token ID1310
Feature activation-5.04e-02

Pos logit contributions
mmm+0.137
ww+0.119
 importantly+0.119
dd+0.114
www+0.114
Neg logit contributions
 neighborhood-0.077
 garden-0.075
 driveway-0.074
 asleep-0.074
 kitchen-0.074
Token boy
Token ID2933
Feature activation-4.00e-02

Pos logit contributions
mmm+0.136
 importantly+0.122
ww+0.114
dd+0.111
www+0.111
Neg logit contributions
 asleep-0.063
robe-0.058
 driveway-0.057
 neighborhood-0.055
 bank-0.055
Token named
Token ID3706
Feature activation+1.16e-02

Pos logit contributions
mmm+0.087
ww+0.079
www+0.075
 importantly+0.071
mmmm+0.068
Neg logit contributions
gy-0.068
 bank-0.067
 neighborhood-0.066
stick-0.066
 driveway-0.066
Token red
Token ID2266
Feature activation+2.38e-03

Pos logit contributions
 driveway+0.012
 asleep+0.012
robe+0.012
 bank+0.011
seek+0.011
Neg logit contributions
mmm-0.015
ww-0.015
 importantly-0.014
www-0.014
dd-0.013
Token balloon
Token ID21190
Feature activation-1.84e-02

Pos logit contributions
 asleep+0.004
 driveway+0.004
robe+0.004
seek+0.004
 grounds+0.003
Neg logit contributions
ww-0.005
mmm-0.005
www-0.005
 importantly-0.005
dd-0.004
Token.
Token ID13
Feature activation+1.91e-02

Pos logit contributions
mmm+0.030
ww+0.029
 importantly+0.028
www+0.026
dd+0.024
Neg logit contributions
 asleep-0.039
 driveway-0.039
robe-0.038
 bank-0.037
seek-0.036
Token It
Token ID632
Feature activation+2.34e-02

Pos logit contributions
 driveway+0.036
 asleep+0.036
robe+0.035
seek+0.035
 bank+0.034
Neg logit contributions
mmm-0.035
ww-0.034
 importantly-0.032
www-0.031
dd-0.028
Token was
Token ID373
Feature activation-1.76e-02

Pos logit contributions
 asleep+0.035
 driveway+0.035
robe+0.034
 bank+0.033
seek+0.032
Neg logit contributions
mmm-0.053
ww-0.052
 importantly-0.050
www-0.048
dd-0.045
Token floating
Token ID12462
Feature activation+8.98e-03

Pos logit contributions
mmm+0.038
ww+0.037
 importantly+0.035
www+0.034
dd+0.032
Neg logit contributions
 asleep-0.029
 driveway-0.028
robe-0.027
 bank-0.026
seek-0.026
Token in
Token ID287
Feature activation-3.33e-03

Pos logit contributions
 asleep+0.013
 driveway+0.013
robe+0.013
seek+0.012
 bank+0.012
Neg logit contributions
mmm-0.020
ww-0.020
 importantly-0.019
www-0.019
dd-0.017
Token the
Token ID262
Feature activation+4.59e-02

Pos logit contributions
mmm+0.006
ww+0.006
 importantly+0.005
www+0.005
dd+0.005
Neg logit contributions
 asleep-0.007
robe-0.007
 driveway-0.007
seek-0.006
 bank-0.006
Token air
Token ID1633
Feature activation+2.29e-03

Pos logit contributions
 asleep+0.059
robe+0.058
seek+0.058
stick+0.051
pit+0.048
Neg logit contributions
ww-0.120
www-0.109
mmm-0.108
 importantly-0.102
Run-0.097
Token,
Token ID11
Feature activation+5.35e-03

Pos logit contributions
 driveway+0.005
robe+0.005
seek+0.005
 asleep+0.005
ASH+0.005
Neg logit contributions
mmm-0.003
ww-0.003
www-0.003
 importantly-0.003
dd-0.002
Token tied
Token ID8165
Feature activation+1.41e-02

Pos logit contributions
robe+0.011
 driveway+0.011
seek+0.011
 asleep+0.010
 grounds+0.010
Neg logit contributions
mmm-0.009
ww-0.009
 importantly-0.008
www-0.008
Run-0.007
Token€
Token ID26391
Feature activation-2.41e-02

Pos logit contributions
 asleep+0.025
 driveway+0.024
robe+0.024
seek+0.023
 bank+0.023
Neg logit contributions
mmm-0.028
ww-0.027
 importantly-0.026
www-0.025
dd-0.023
Token�
Token ID447
Feature activation-6.05e-03

Pos logit contributions
mmm+0.048
ww+0.047
 importantly+0.046
www+0.043
dd+0.040
Neg logit contributions
 driveway-0.041
robe-0.040
 asleep-0.040
seek-0.040
 bank-0.038
Token�
Token ID250
Feature activation+1.28e-02

Pos logit contributions
 importantly+0.012
mmm+0.012
ww+0.011
www+0.011
dd+0.010
Neg logit contributions
 driveway-0.011
 bank-0.010
robe-0.010
 asleep-0.010
ASH-0.010
Token saw
Token ID2497
Feature activation-2.40e-02

Pos logit contributions
 asleep+0.025
 driveway+0.025
robe+0.024
seek+0.024
 bank+0.023
Neg logit contributions
mmm-0.023
ww-0.023
 importantly-0.021
www-0.021
dd-0.019
Token the
Token ID262
Feature activation+2.38e-02

Pos logit contributions
mmm+0.037
ww+0.035
 importantly+0.034
www+0.032
dd+0.029
Neg logit contributions
 asleep-0.053
 driveway-0.051
robe-0.051
 bank-0.050
seek-0.049
Token train
Token ID4512
Feature activation+6.95e-03

Pos logit contributions
 asleep+0.038
seek+0.038
 driveway+0.036
robe+0.036
 grounds+0.035
Neg logit contributions
ww-0.051
mmm-0.048
www-0.047
 importantly-0.047
 annoying-0.044
Token and
Token ID290
Feature activation-3.29e-02

Pos logit contributions
 driveway+0.015
 asleep+0.015
robe+0.015
seek+0.015
ASH+0.014
Neg logit contributions
ww-0.011
mmm-0.011
www-0.010
 importantly-0.009
 annoying-0.008
Token wanted
Token ID2227
Feature activation+6.06e-03

Pos logit contributions
mmm+0.050
ww+0.048
 importantly+0.045
www+0.043
dd+0.039
Neg logit contributions
 asleep-0.073
 driveway-0.072
robe-0.071
 bank-0.070
seek-0.068
Token to
Token ID284
Feature activation+3.89e-02

Pos logit contributions
 driveway+0.008
robe+0.008
 asleep+0.008
pit+0.007
seek+0.007
Neg logit contributions
ww-0.015
mmm-0.015
 importantly-0.014
www-0.014
dd-0.013
Token help
Token ID1037
Feature activation+5.30e-03

Pos logit contributions
 asleep+0.060
 driveway+0.059
robe+0.058
pit+0.055
seek+0.054
Neg logit contributions
mmm-0.086
ww-0.085
www-0.079
 importantly-0.079
dd-0.072
Token.
Token ID13
Feature activation-3.67e-03

Pos logit contributions
 asleep+0.010
 driveway+0.010
robe+0.010
 bank+0.010
seek+0.010
Neg logit contributions
mmm-0.009
ww-0.009
 importantly-0.009
www-0.008
dd-0.008
Token around
Token ID1088
Feature activation+1.11e-02

Pos logit contributions
 driveway+0.080
 asleep+0.077
robe+0.073
seek+0.071
pit+0.070
Neg logit contributions
mmm-0.068
ww-0.067
 importantly-0.063
www-0.061
dd-0.059
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
 driveway+0.020
 asleep+0.020
robe+0.020
seek+0.019
 bank+0.018
Neg logit contributions
mmm-0.022
ww-0.021
 importantly-0.020
www-0.020
dd-0.018
Token One
Token ID1881
Feature activation-1.94e-02

Pos logit contributions
 asleep+0.029
robe+0.028
 driveway+0.028
seek+0.027
 bank+0.026
Neg logit contributions
mmm-0.038
ww-0.037
 importantly-0.035
www-0.034
dd-0.031
Token day
Token ID1110
Feature activation-1.27e-02

Pos logit contributions
mmm+0.054
 importantly+0.048
ww+0.045
www+0.044
dd+0.044
Neg logit contributions
 garden-0.023
 park-0.023
 back-0.022
 driveway-0.022
 backyard-0.022
Token,
Token ID11
Feature activation-1.58e-02

Pos logit contributions
mmm+0.030
 importantly+0.028
ww+0.028
www+0.027
 annoying+0.027
Neg logit contributions
 asleep-0.017
robe-0.015
 back-0.015
 edge-0.015
seek-0.015
Token the
Token ID262
Feature activation+6.47e-03

Pos logit contributions
mmm+0.035
ww+0.033
 importantly+0.032
www+0.031
dd+0.029
Neg logit contributions
 asleep-0.026
robe-0.025
 bank-0.024
 driveway-0.024
 edge-0.023
Token youngest
Token ID18887
Feature activation-1.10e-02

Pos logit contributions
 asleep+0.012
robe+0.012
seek+0.012
 driveway+0.012
stick+0.012
Neg logit contributions
ww-0.012
mmm-0.011
 importantly-0.011
www-0.011
dd-0.010
Token duck
Token ID22045
Feature activation-2.00e-02

Pos logit contributions
mmm+0.023
ww+0.022
 importantly+0.021
www+0.020
dd+0.019
Neg logit contributions
 asleep-0.019
 driveway-0.019
robe-0.018
 bank-0.017
seek-0.017
Tokenling
Token ID1359
Feature activation+6.36e-03

Pos logit contributions
ww+0.041
mmm+0.041
 importantly+0.038
www+0.038
dd+0.036
Neg logit contributions
 asleep-0.033
 driveway-0.033
robe-0.032
seek-0.031
 bank-0.030
Token saw
Token ID2497
Feature activation+5.44e-03

Pos logit contributions
 driveway+0.011
 asleep+0.010
robe+0.010
seek+0.010
 bank+0.010
Neg logit contributions
ww-0.013
mmm-0.013
 importantly-0.012
www-0.012
dd-0.011
Token a
Token ID257
Feature activation-2.45e-02

Pos logit contributions
 driveway+0.009
 asleep+0.009
robe+0.009
seek+0.009
 bank+0.008
Neg logit contributions
mmm-0.011
ww-0.011
 importantly-0.011
www-0.010
dd-0.010
Token clothes
Token ID8242
Feature activation+1.82e-03

Pos logit contributions
 asleep+0.008
 driveway+0.008
robe+0.008
seek+0.008
 bank+0.008
Neg logit contributions
mmm-0.009
ww-0.009
 importantly-0.009
www-0.008
dd-0.008
Token gl
Token ID1278
Feature activation-5.34e-03

Pos logit contributions
ASH+0.004
 driveway+0.004
seek+0.004
 asleep+0.004
robe+0.004
Neg logit contributions
mmm-0.003
ww-0.003
 importantly-0.003
www-0.002
dd-0.002
Tokenowed
Token ID6972
Feature activation-1.60e-02

Pos logit contributions
mmm+0.009
ww+0.008
 importantly+0.008
www+0.007
dd+0.007
Neg logit contributions
 driveway-0.012
 bank-0.011
 asleep-0.011
 kitchen-0.011
 bathroom-0.011
Token brilliantly
Token ID37928
Feature activation-2.34e-02

Pos logit contributions
mmm+0.029
ww+0.029
 importantly+0.027
www+0.026
dd+0.024
Neg logit contributions
 driveway-0.031
 asleep-0.030
robe-0.030
seek-0.029
 bank-0.029
Token in
Token ID287
Feature activation-1.73e-02

Pos logit contributions
mmm+0.043
ww+0.043
 importantly+0.040
www+0.039
dd+0.036
Neg logit contributions
 driveway-0.044
 asleep-0.044
robe-0.043
seek-0.042
 bank-0.041
Token the
Token ID262
Feature activation+3.61e-02

Pos logit contributions
mmm+0.036
ww+0.035
www+0.033
 importantly+0.032
dd+0.030
Neg logit contributions
 asleep-0.030
 driveway-0.030
robe-0.028
 bank-0.027
ASH-0.027
Token sunshine
Token ID34488
Feature activation+1.14e-02

Pos logit contributions
seek+0.056
robe+0.055
 asleep+0.054
stick+0.051
pit+0.050
Neg logit contributions
ww-0.080
mmm-0.073
www-0.073
 importantly-0.072
 hour-0.065
Token,
Token ID11
Feature activation-2.94e-03

Pos logit contributions
 driveway+0.025
seek+0.025
ASH+0.024
robe+0.024
 asleep+0.024
Neg logit contributions
mmm-0.016
ww-0.016
 importantly-0.015
www-0.014
 hour-0.012
Token just
Token ID655
Feature activation-2.73e-02

Pos logit contributions
mmm+0.005
ww+0.005
 importantly+0.005
www+0.004
dd+0.004
Neg logit contributions
seek-0.006
robe-0.006
 driveway-0.006
 asleep-0.006
ASH-0.006
Token like
Token ID588
Feature activation-4.28e-02

Pos logit contributions
mmm+0.036
ww+0.034
 importantly+0.033
www+0.030
dd+0.027
Neg logit contributions
 driveway-0.066
robe-0.065
 asleep-0.064
seek-0.063
 bank-0.062
Token the
Token ID262
Feature activation+5.57e-03

Pos logit contributions
mmm+0.071
ww+0.070
 importantly+0.067
www+0.063
dd+0.057
Neg logit contributions
 driveway-0.088
robe-0.085
 asleep-0.085
seek-0.084
 bank-0.082
Token had
Token ID550
Feature activation+3.55e-02

Pos logit contributions
ww+0.014
mmm+0.014
www+0.013
 importantly+0.013
dd+0.013
Neg logit contributions
robe-0.017
 driveway-0.017
 asleep-0.017
seek-0.017
ASH-0.016
Token a
Token ID257
Feature activation-8.75e-03

Pos logit contributions
robe+0.063
 driveway+0.059
pit+0.059
ASH+0.056
 grounds+0.056
Neg logit contributions
ww-0.068
mmm-0.066
 importantly-0.066
www-0.064
 badly-0.061
Token problem
Token ID1917
Feature activation-1.57e-02

Pos logit contributions
ww+0.015
mmm+0.014
 importantly+0.014
www+0.013
dd+0.012
Neg logit contributions
 asleep-0.018
robe-0.017
 driveway-0.017
seek-0.017
ASH-0.017
Token.
Token ID13
Feature activation+2.75e-02

Pos logit contributions
ww+0.035
mmm+0.035
www+0.032
 importantly+0.031
dd+0.029
Neg logit contributions
 asleep-0.024
 driveway-0.023
robe-0.023
seek-0.022
 bank-0.022
Token He
Token ID679
Feature activation+1.31e-02

Pos logit contributions
robe+0.053
 asleep+0.052
seek+0.051
 bank+0.050
 driveway+0.049
Neg logit contributions
mmm-0.051
 importantly-0.046
ww-0.044
www-0.042
 hour-0.039
Token came
Token ID1625
Feature activation+3.67e-02

Pos logit contributions
 asleep+0.026
 driveway+0.025
robe+0.025
seek+0.024
 bank+0.024
Neg logit contributions
mmm-0.023
ww-0.023
 importantly-0.022
www-0.021
dd-0.019
Token across
Token ID1973
Feature activation+2.46e-02

Pos logit contributions
robe+0.062
 driveway+0.058
ASH+0.056
 bank+0.055
pit+0.054
Neg logit contributions
 badly-0.068
 importantly-0.066
ww-0.066
 much-0.065
 though-0.063
Token a
Token ID257
Feature activation-4.63e-03

Pos logit contributions
robe+0.050
 driveway+0.049
seek+0.048
 asleep+0.048
pit+0.047
Neg logit contributions
mmm-0.040
ww-0.039
 importantly-0.036
www-0.035
Run-0.033
Token big
Token ID1263
Feature activation-8.27e-03

Pos logit contributions
mmm+0.009
ww+0.009
 importantly+0.009
www+0.009
dd+0.008
Neg logit contributions
 asleep-0.008
 driveway-0.008
robe-0.008
seek-0.007
 bank-0.007
Token,
Token ID11
Feature activation+7.10e-03

Pos logit contributions
mmm+0.020
ww+0.020
 importantly+0.019
www+0.019
dd+0.018
Neg logit contributions
 asleep-0.011
 driveway-0.011
robe-0.010
 bank-0.010
seek-0.009
Token deep
Token ID2769
Feature activation+4.93e-03

Pos logit contributions
 asleep+0.015
 driveway+0.015
 bank+0.014
robe+0.014
 edge+0.013
Neg logit contributions
mmm-0.013
ww-0.012
 importantly-0.011
www-0.011
dd-0.010
Token she
Token ID673
Feature activation+3.23e-03

Pos logit contributions
mmm+0.077
ww+0.077
 importantly+0.070
www+0.069
dd+0.061
Neg logit contributions
 asleep-0.106
 driveway-0.104
robe-0.102
seek-0.099
 bank-0.099
Token kept
Token ID4030
Feature activation+2.71e-02

Pos logit contributions
 asleep+0.006
 driveway+0.006
robe+0.006
seek+0.006
 bank+0.006
Neg logit contributions
mmm-0.006
ww-0.006
 importantly-0.006
www-0.006
dd-0.005
Token moving
Token ID3867
Feature activation+1.13e-02

Pos logit contributions
robe+0.054
seek+0.053
 driveway+0.052
 bank+0.050
pit+0.050
Neg logit contributions
mmm-0.046
 importantly-0.045
ww-0.045
dd-0.042
www-0.041
Token.
Token ID13
Feature activation+2.49e-02

Pos logit contributions
 driveway+0.022
robe+0.021
seek+0.021
 bank+0.020
 asleep+0.020
Neg logit contributions
mmm-0.020
ww-0.019
 importantly-0.019
www-0.018
dd-0.017
Token Suddenly
Token ID24975
Feature activation-1.89e-02

Pos logit contributions
robe+0.047
 asleep+0.046
 bank+0.044
 driveway+0.043
stick+0.042
Neg logit contributions
mmm-0.050
 importantly-0.045
ww-0.044
www-0.043
 hour-0.037
Token a
Token ID257
Feature activation-3.85e-02

Pos logit contributions
mmm+0.030
ww+0.030
 importantly+0.027
www+0.027
dd+0.024
Neg logit contributions
 asleep-0.040
 driveway-0.040
robe-0.040
seek-0.038
 bank-0.038
Token big
Token ID1263
Feature activation-2.08e-02

Pos logit contributions
mmm+0.076
ww+0.070
 importantly+0.066
www+0.064
dd+0.063
Neg logit contributions
 asleep-0.070
 driveway-0.070
robe-0.067
 bank-0.065
 edge-0.064
Token,
Token ID11
Feature activation-1.13e-02

Pos logit contributions
mmm+0.040
ww+0.038
 importantly+0.036
www+0.034
dd+0.033
Neg logit contributions
 asleep-0.040
 driveway-0.039
robe-0.036
 bank-0.036
 bathroom-0.036
Token loud
Token ID7812
Feature activation-4.88e-03

Pos logit contributions
mmm+0.023
ww+0.021
 importantly+0.020
www+0.019
dd+0.019
Neg logit contributions
 asleep-0.020
 driveway-0.020
robe-0.018
 bathroom-0.018
 kitchen-0.018
Token noise
Token ID7838
Feature activation-2.16e-02

Pos logit contributions
mmm+0.009
ww+0.009
 importantly+0.009
www+0.008
dd+0.008
Neg logit contributions
 asleep-0.009
 driveway-0.009
robe-0.009
 bank-0.008
seek-0.008
Token made
Token ID925
Feature activation-2.42e-02

Pos logit contributions
mmm+0.051
ww+0.050
 importantly+0.046
www+0.046
dd+0.040
Neg logit contributions
 asleep-0.033
 driveway-0.030
robe-0.030
seek-0.028
stick-0.028
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
mmm+0.033
ww+0.033
 importantly+0.031
www+0.031
dd+0.028
Neg logit contributions
 driveway-0.030
 asleep-0.030
robe-0.029
seek-0.028
 bank-0.028
Token
Token ID198
Feature activation-2.30e-02

Pos logit contributions
 driveway+0.016
 asleep+0.016
robe+0.015
 bank+0.015
seek+0.014
Neg logit contributions
ww-0.029
mmm-0.028
 importantly-0.027
www-0.027
dd-0.026
Token
Token ID198
Feature activation-3.57e-02

Pos logit contributions
ww+0.058
mmm+0.057
 importantly+0.055
www+0.054
dd+0.050
Neg logit contributions
 driveway-0.028
 asleep-0.026
robe-0.025
 neighborhood-0.025
seek-0.025
Token"
Token ID1
Feature activation-5.94e-03

Pos logit contributions
mmm+0.062
ww+0.060
 importantly+0.060
www+0.054
 gust+0.053
Neg logit contributions
 driveway-0.067
seek-0.064
 neighborhood-0.063
 neighbourhood-0.061
 edge-0.061
TokenOkay
Token ID16454
Feature activation-8.85e-03

Pos logit contributions
mmm+0.011
ww+0.011
 importantly+0.010
www+0.010
dd+0.009
Neg logit contributions
 asleep-0.012
 driveway-0.012
robe-0.011
seek-0.011
 bank-0.011
Token,
Token ID11
Feature activation-3.34e-02

Pos logit contributions
ww+0.015
mmm+0.015
 importantly+0.015
www+0.014
dd+0.013
Neg logit contributions
 asleep-0.018
 driveway-0.017
robe-0.017
 bank-0.016
 edge-0.016
Token you
Token ID345
Feature activation+2.04e-02

Pos logit contributions
ww+0.059
mmm+0.058
 importantly+0.053
www+0.053
dd+0.052
Neg logit contributions
 asleep-0.065
 driveway-0.063
robe-0.060
 edge-0.060
 bank-0.060
Token can
Token ID460
Feature activation+3.32e-02

Pos logit contributions
 asleep+0.037
 driveway+0.037
robe+0.036
seek+0.035
 bank+0.035
Neg logit contributions
mmm-0.040
ww-0.039
 importantly-0.037
www-0.036
dd-0.033
Token try
Token ID1949
Feature activation+2.06e-02

Pos logit contributions
 asleep+0.066
robe+0.065
 driveway+0.062
ASH+0.060
 grounds+0.059
Neg logit contributions
mmm-0.065
ww-0.058
www-0.055
 importantly-0.051
dd-0.047
Token it
Token ID340
Feature activation+3.04e-02

Pos logit contributions
robe+0.046
 driveway+0.045
 asleep+0.042
seek+0.042
 grounds+0.042
Neg logit contributions
mmm-0.034
ww-0.032
 importantly-0.029
www-0.028
 badly-0.025
Token,"
Token ID553
Feature activation-2.27e-02

Pos logit contributions
 driveway+0.077
robe+0.073
 asleep+0.073
seek+0.071
 workshop+0.071
Neg logit contributions
mmm-0.041
ww-0.039
www-0.033
 importantly-0.030
 badly-0.027
Token star
Token ID3491
Feature activation-2.73e-02

Pos logit contributions
 asleep+0.034
seek+0.034
 driveway+0.033
robe+0.033
 grounds+0.032
Neg logit contributions
ww-0.039
mmm-0.039
 importantly-0.037
www-0.036
 hour-0.032
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
mmm+0.052
ww+0.051
 importantly+0.048
www+0.046
dd+0.042
Neg logit contributions
 driveway-0.050
 asleep-0.049
robe-0.048
seek-0.047
 bank-0.047
Token In
Token ID554
Feature activation-1.00e-02

Pos logit contributions
 asleep+0.024
 driveway+0.024
robe+0.024
seek+0.023
 bank+0.023
Neg logit contributions
mmm-0.030
ww-0.029
 importantly-0.027
www-0.027
dd-0.024
Token a
Token ID257
Feature activation-2.56e-02

Pos logit contributions
mmm+0.017
ww+0.016
 importantly+0.015
www+0.015
dd+0.013
Neg logit contributions
 asleep-0.020
 driveway-0.020
robe-0.020
seek-0.020
 bank-0.020
Token flash
Token ID7644
Feature activation-2.66e-02

Pos logit contributions
mmm+0.044
ww+0.044
 importantly+0.041
www+0.040
dd+0.036
Neg logit contributions
 asleep-0.051
 driveway-0.051
robe-0.050
seek-0.048
 bank-0.048
Token,
Token ID11
Feature activation-3.02e-02

Pos logit contributions
mmm+0.038
ww+0.037
 importantly+0.034
www+0.033
dd+0.029
Neg logit contributions
 asleep-0.062
 driveway-0.061
robe-0.060
seek-0.059
 bank-0.059
Token her
Token ID607
Feature activation+2.70e-02

Pos logit contributions
mmm+0.048
ww+0.047
 importantly+0.044
www+0.042
dd+0.038
Neg logit contributions
 driveway-0.065
 asleep-0.064
robe-0.063
seek-0.062
 bank-0.062
Token wish
Token ID4601
Feature activation+1.75e-02

Pos logit contributions
seek+0.048
 asleep+0.046
 driveway+0.045
robe+0.045
 bank+0.044
Neg logit contributions
 importantly-0.051
ww-0.050
mmm-0.050
www-0.045
 annoying-0.043
Token was
Token ID373
Feature activation-1.89e-02

Pos logit contributions
 driveway+0.035
seek+0.034
 asleep+0.034
 bank+0.034
ASH+0.033
Neg logit contributions
ww-0.029
mmm-0.029
 importantly-0.027
www-0.026
dd-0.024
Token granted
Token ID7520
Feature activation-7.15e-03

Pos logit contributions
mmm+0.034
ww+0.034
 importantly+0.032
www+0.031
dd+0.028
Neg logit contributions
 driveway-0.037
 asleep-0.035
robe-0.035
 bank-0.035
seek-0.034
Token!
Token ID0
Feature activation+1.35e-02

Pos logit contributions
mmm+0.013
ww+0.013
 importantly+0.012
www+0.012
dd+0.010
Neg logit contributions
 driveway-0.014
 asleep-0.014
robe-0.013
 bank-0.013
seek-0.013
TokenThey
Token ID2990
Feature activation+8.52e-03

Pos logit contributions
mmm+0.098
 importantly+0.095
ww+0.092
www+0.085
 gust+0.085
Neg logit contributions
 driveway-0.073
 edge-0.070
seek-0.069
 neighborhood-0.068
 bank-0.067
Token climbed
Token ID19952
Feature activation+4.54e-02

Pos logit contributions
 asleep+0.016
robe+0.016
 driveway+0.015
seek+0.015
ASH+0.014
Neg logit contributions
ww-0.015
mmm-0.015
www-0.014
 importantly-0.014
dd-0.013
Token and
Token ID290
Feature activation+2.09e-02

Pos logit contributions
robe+0.093
 driveway+0.092
seek+0.092
stick+0.089
izons+0.087
Neg logit contributions
ww-0.062
mmm-0.060
 though-0.056
Run-0.055
 much-0.054
Token climbed
Token ID19952
Feature activation+4.13e-02

Pos logit contributions
robe+0.043
 asleep+0.042
 neighborhood+0.041
 driveway+0.040
 neighbourhood+0.040
Neg logit contributions
ww-0.033
mmm-0.031
www-0.030
dd-0.029
Run-0.029
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
 driveway+0.097
robe+0.096
seek+0.093
stick+0.092
ASH+0.092
Neg logit contributions
mmm-0.047
ww-0.044
 importantly-0.040
 though-0.038
 much-0.038
Token until
Token ID1566
Feature activation-2.71e-02

Pos logit contributions
robe+0.032
ASH+0.031
seek+0.030
 neighborhood+0.030
 driveway+0.030
Neg logit contributions
 though-0.014
mmm-0.014
ww-0.013
 importantly-0.012
 hour-0.012
Token they
Token ID484
Feature activation+2.07e-02

Pos logit contributions
mmm+0.036
ww+0.035
 importantly+0.033
www+0.031
dd+0.026
Neg logit contributions
 asleep-0.067
 driveway-0.067
robe-0.065
 bank-0.064
seek-0.063
Token reached
Token ID4251
Feature activation+2.28e-02

Pos logit contributions
 asleep+0.041
robe+0.040
 driveway+0.040
stick+0.039
seek+0.039
Neg logit contributions
mmm-0.034
ww-0.033
dd-0.030
 importantly-0.029
www-0.029
Token the
Token ID262
Feature activation+5.45e-02

Pos logit contributions
robe+0.060
stick+0.059
seek+0.058
 driveway+0.057
 bank+0.054
Neg logit contributions
mmm-0.022
ww-0.020
 importantly-0.020
 much-0.019
 unexpected-0.016
Token top
Token ID1353
Feature activation-7.29e-04

Pos logit contributions
seek+0.080
stick+0.077
 asleep+0.075
robe+0.072
ASH+0.068
Neg logit contributions
ww-0.120
mmm-0.115
www-0.110
 importantly-0.107
 hour-0.103
Token of
Token ID286
Feature activation-7.96e-03

Pos logit contributions
mmm+0.001
ww+0.001
 importantly+0.001
www+0.001
 hour+0.001
Neg logit contributions
robe-0.002
 asleep-0.002
 driveway-0.002
seek-0.002
 bank-0.002
Token in
Token ID287
Feature activation-1.68e-02

Pos logit contributions
mmm+0.054
ww+0.054
 importantly+0.050
www+0.049
dd+0.044
Neg logit contributions
 driveway-0.054
 asleep-0.054
robe-0.053
seek-0.051
 bank-0.051
Token our
Token ID674
Feature activation+4.22e-02

Pos logit contributions
mmm+0.029
ww+0.028
 importantly+0.026
www+0.026
dd+0.024
Neg logit contributions
 asleep-0.034
 driveway-0.034
robe-0.033
 bank-0.032
seek-0.032
Token family
Token ID1641
Feature activation-4.57e-03

Pos logit contributions
robe+0.061
seek+0.057
 asleep+0.055
stick+0.054
 driveway+0.048
Neg logit contributions
ww-0.097
mmm-0.094
 importantly-0.093
www-0.091
 hour-0.089
Token."
Token ID526
Feature activation+2.91e-03

Pos logit contributions
mmm+0.008
ww+0.008
 importantly+0.008
www+0.008
dd+0.007
Neg logit contributions
 driveway-0.009
 asleep-0.009
robe-0.009
seek-0.008
ASH-0.008
Token
Token ID198
Feature activation-2.34e-02

Pos logit contributions
 driveway+0.006
 asleep+0.005
robe+0.005
seek+0.005
 bank+0.005
Neg logit contributions
mmm-0.005
ww-0.005
 importantly-0.005
www-0.005
dd-0.004
Token
Token ID198
Feature activation-3.83e-02

Pos logit contributions
mmm+0.052
ww+0.052
www+0.049
 importantly+0.049
dd+0.045
Neg logit contributions
 asleep-0.035
 driveway-0.034
robe-0.034
seek-0.032
 bank-0.032
TokenL
Token ID43
Feature activation-5.00e-02

Pos logit contributions
mmm+0.068
ww+0.066
 importantly+0.063
www+0.062
dd+0.056
Neg logit contributions
robe-0.071
 driveway-0.071
seek-0.069
 asleep-0.068
 neighborhood-0.067
Tokenily
Token ID813
Feature activation-5.70e-03

Pos logit contributions
mmm+0.092
ww+0.092
www+0.084
 importantly+0.084
dd+0.076
Neg logit contributions
robe-0.095
 asleep-0.094
 driveway-0.091
seek-0.088
ASH-0.086
Token smiled
Token ID13541
Feature activation+2.13e-03

Pos logit contributions
mmm+0.011
ww+0.010
 importantly+0.009
www+0.009
dd+0.009
Neg logit contributions
 asleep-0.011
 driveway-0.011
robe-0.011
seek-0.010
 bank-0.010
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
 driveway+0.004
robe+0.004
 asleep+0.004
seek+0.004
 bank+0.004
Neg logit contributions
mmm-0.004
ww-0.004
 importantly-0.004
www-0.003
dd-0.003
Token felt
Token ID2936
Feature activation-5.35e-02

Pos logit contributions
mmm+0.032
ww+0.031
 importantly+0.030
www+0.028
 hour+0.025
Neg logit contributions
 asleep-0.038
 driveway-0.037
robe-0.037
 bank-0.036
seek-0.036
Token a
Token ID257
Feature activation-4.11e-02

Pos logit contributions
 driveway+0.022
 asleep+0.022
robe+0.022
 bank+0.021
seek+0.021
Neg logit contributions
mmm-0.028
ww-0.028
 importantly-0.026
www-0.025
dd-0.024
Token time
Token ID640
Feature activation+1.02e-02

Pos logit contributions
mmm+0.089
ww+0.084
 importantly+0.083
www+0.080
dd+0.072
Neg logit contributions
 driveway-0.061
 neighborhood-0.060
 asleep-0.058
 neighbourhood-0.056
seek-0.055
Token,
Token ID11
Feature activation-4.03e-03

Pos logit contributions
 asleep+0.018
robe+0.017
 driveway+0.017
seek+0.016
pit+0.016
Neg logit contributions
mmm-0.020
ww-0.020
www-0.019
 importantly-0.019
dd-0.017
Token there
Token ID612
Feature activation+5.03e-02

Pos logit contributions
mmm+0.010
ww+0.009
www+0.009
 importantly+0.008
dd+0.008
Neg logit contributions
 asleep-0.006
robe-0.006
 driveway-0.006
 edge-0.005
pit-0.005
Token was
Token ID373
Feature activation-4.72e-03

Pos logit contributions
 asleep+0.077
 driveway+0.076
robe+0.074
seek+0.071
 bank+0.071
Neg logit contributions
mmm-0.111
ww-0.110
 importantly-0.104
www-0.102
dd-0.095
Token a
Token ID257
Feature activation-5.25e-02

Pos logit contributions
mmm+0.012
ww+0.012
www+0.011
 importantly+0.011
dd+0.010
Neg logit contributions
 asleep-0.006
robe-0.005
 driveway-0.005
ASH-0.005
seek-0.005
Token girl
Token ID2576
Feature activation-3.98e-02

Pos logit contributions
mmm+0.128
ww+0.113
 importantly+0.112
dd+0.108
www+0.107
Neg logit contributions
 neighborhood-0.075
 driveway-0.075
 asleep-0.074
 garden-0.073
 kitchen-0.073
Token called
Token ID1444
Feature activation+1.07e-02

Pos logit contributions
mmm+0.085
ww+0.076
www+0.071
 importantly+0.066
mmmm+0.065
Neg logit contributions
 driveway-0.073
 neighborhood-0.072
 kitchen-0.070
 garden-0.070
stick-0.070
Token Dawn
Token ID12258
Feature activation-2.49e-02

Pos logit contributions
 asleep+0.023
 driveway+0.023
robe+0.022
seek+0.022
 bank+0.021
Neg logit contributions
mmm-0.017
ww-0.017
 importantly-0.016
www-0.015
dd-0.014
Token.
Token ID13
Feature activation+2.41e-02

Pos logit contributions
mmm+0.058
ww+0.056
 importantly+0.053
www+0.053
dd+0.048
Neg logit contributions
 asleep-0.038
robe-0.037
 driveway-0.037
 bank-0.035
seek-0.034
Token One
Token ID1881
Feature activation-6.33e-03

Pos logit contributions
 driveway+0.039
 asleep+0.038
robe+0.036
seek+0.036
 bank+0.036
Neg logit contributions
mmm-0.050
ww-0.050
 importantly-0.048
www-0.046
dd-0.042
Token"
Token ID1
Feature activation+2.53e-04

Pos logit contributions
ww+0.065
mmm+0.065
 importantly+0.062
www+0.057
 gust+0.052
Neg logit contributions
 driveway-0.081
 edge-0.076
seek-0.075
 grounds-0.075
 neighborhood-0.074
TokenWhat
Token ID2061
Feature activation+2.43e-02

Pos logit contributions
 driveway+0.000
 asleep+0.000
robe+0.000
seek+0.000
 bank+0.000
Neg logit contributions
mmm-0.000
ww-0.000
 importantly-0.000
www-0.000
dd-0.000
Token is
Token ID318
Feature activation-3.87e-02

Pos logit contributions
robe+0.033
 asleep+0.033
 driveway+0.032
seek+0.031
 neighborhood+0.030
Neg logit contributions
mmm-0.055
ww-0.055
www-0.052
 importantly-0.051
dd-0.049
Token it
Token ID340
Feature activation-4.59e-03

Pos logit contributions
mmm+0.076
 importantly+0.073
ww+0.072
www+0.067
heart+0.063
Neg logit contributions
 asleep-0.064
 kitchen-0.063
 driveway-0.062
 bathroom-0.061
 bank-0.061
Token,
Token ID11
Feature activation-3.26e-02

Pos logit contributions
mmm+0.007
ww+0.007
 importantly+0.006
www+0.006
dd+0.005
Neg logit contributions
 asleep-0.011
robe-0.010
 driveway-0.010
 bank-0.010
seek-0.010
Token Jake
Token ID14757
Feature activation-5.92e-02

Pos logit contributions
ww+0.063
mmm+0.060
www+0.057
 importantly+0.057
dd+0.056
Neg logit contributions
 asleep-0.058
 driveway-0.057
robe-0.057
 bathroom-0.055
 edge-0.054
Token?"
Token ID1701
Feature activation-1.85e-02

Pos logit contributions
mmm+0.102
ww+0.099
 importantly+0.098
www+0.088
dd+0.081
Neg logit contributions
 asleep-0.116
 driveway-0.114
robe-0.112
 bank-0.110
 neighborhood-0.107
Token Tom
Token ID4186
Feature activation-2.66e-02

Pos logit contributions
ww+0.040
mmm+0.038
www+0.036
dd+0.033
 importantly+0.033
Neg logit contributions
 asleep-0.032
 driveway-0.031
 bathroom-0.029
robe-0.029
 neighborhood-0.028
Token asked
Token ID1965
Feature activation+7.04e-03

Pos logit contributions
mmm+0.050
ww+0.050
 importantly+0.046
www+0.045
dd+0.041
Neg logit contributions
 asleep-0.049
 driveway-0.049
robe-0.048
 bank-0.046
seek-0.046
Token.
Token ID13
Feature activation+1.77e-02

Pos logit contributions
 driveway+0.015
 asleep+0.014
robe+0.014
seek+0.013
 bank+0.013
Neg logit contributions
mmm-0.011
 importantly-0.011
ww-0.011
www-0.010
dd-0.010
Token
Token ID198
Feature activation-1.82e-02

Pos logit contributions
 driveway+0.030
 asleep+0.030
 bank+0.029
seek+0.029
robe+0.029
Neg logit contributions
mmm-0.036
ww-0.034
 importantly-0.034
www-0.032
dd-0.029
Token soft
Token ID2705
Feature activation+4.48e-02

Pos logit contributions
seek+0.069
 asleep+0.067
robe+0.067
pit+0.065
 driveway+0.065
Neg logit contributions
ww-0.085
mmm-0.083
 importantly-0.079
www-0.075
 painful-0.073
Token fur
Token ID9230
Feature activation+1.62e-02

Pos logit contributions
seek+0.072
 grounds+0.071
iddle+0.068
pit+0.067
 driveway+0.067
Neg logit contributions
ww-0.085
 importantly-0.073
mmm-0.073
www-0.071
 hour-0.070
Token.
Token ID13
Feature activation+8.63e-03

Pos logit contributions
 driveway+0.040
 meets+0.039
ASH+0.038
 workshop+0.037
 neighborhood+0.037
Neg logit contributions
mmm-0.018
 badly-0.017
ww-0.016
 importantly-0.015
 though-0.015
Token Lily
Token ID20037
Feature activation-1.72e-03

Pos logit contributions
 driveway+0.018
 asleep+0.018
robe+0.018
seek+0.018
 bank+0.017
Neg logit contributions
mmm-0.014
ww-0.014
 importantly-0.013
www-0.012
dd-0.011
Token was
Token ID373
Feature activation-3.55e-02

Pos logit contributions
mmm+0.003
ww+0.003
 importantly+0.003
www+0.003
dd+0.003
Neg logit contributions
 driveway-0.003
 asleep-0.003
robe-0.003
seek-0.003
 bank-0.003
Token so
Token ID523
Feature activation-5.67e-02

Pos logit contributions
mmm+0.055
ww+0.054
 importantly+0.050
www+0.048
dd+0.043
Neg logit contributions
 asleep-0.078
 driveway-0.078
robe-0.076
seek-0.074
 bank-0.074
Token happy
Token ID3772
Feature activation-3.02e-02

Pos logit contributions
mmm+0.106
ww+0.102
www+0.098
 importantly+0.094
Run+0.089
Neg logit contributions
 asleep-0.103
robe-0.100
 driveway-0.097
seek-0.094
 neighborhood-0.092
Token to
Token ID284
Feature activation+1.37e-02

Pos logit contributions
mmm+0.054
ww+0.053
 importantly+0.050
www+0.048
dd+0.044
Neg logit contributions
 driveway-0.059
 asleep-0.059
robe-0.058
 bank-0.056
seek-0.056
Token have
Token ID423
Feature activation+2.74e-03

Pos logit contributions
 asleep+0.023
 driveway+0.023
robe+0.022
seek+0.021
 bank+0.021
Neg logit contributions
mmm-0.028
ww-0.028
 importantly-0.026
www-0.026
dd-0.024
Token her
Token ID607
Feature activation+1.52e-02

Pos logit contributions
 driveway+0.005
robe+0.005
 asleep+0.005
 bank+0.005
pit+0.005
Neg logit contributions
ww-0.005
mmm-0.005
 importantly-0.005
www-0.005
dd-0.004
Token t
Token ID256
Feature activation-1.24e-02

Pos logit contributions
 asleep+0.020
 driveway+0.020
robe+0.019
seek+0.018
 bank+0.018
Neg logit contributions
mmm-0.037
ww-0.037
 importantly-0.035
www-0.034
dd-0.032
Token and
Token ID290
Feature activation-3.34e-02

Pos logit contributions
ww+0.005
mmm+0.005
 importantly+0.005
www+0.005
 though+0.004
Neg logit contributions
 driveway-0.007
robe-0.007
seek-0.007
 asleep-0.007
 edge-0.006
Token Dad
Token ID17415
Feature activation-1.05e-03

Pos logit contributions
ww+0.041
mmm+0.041
www+0.036
 importantly+0.035
dd+0.029
Neg logit contributions
robe-0.083
 asleep-0.080
 driveway-0.080
seek-0.078
stick-0.078
Token.
Token ID13
Feature activation+5.74e-03

Pos logit contributions
 importantly+0.001
ww+0.001
mmm+0.001
 badly+0.001
 though+0.001
Neg logit contributions
 driveway-0.002
robe-0.002
ASH-0.002
seek-0.002
stick-0.002
Token They
Token ID1119
Feature activation-3.31e-05

Pos logit contributions
 asleep+0.008
 driveway+0.008
robe+0.007
seek+0.007
 bank+0.007
Neg logit contributions
mmm-0.014
ww-0.014
www-0.013
 importantly-0.013
dd-0.012
Token are
Token ID389
Feature activation-3.40e-02

Pos logit contributions
mmm+0.000
ww+0.000
 importantly+0.000
www+0.000
dd+0.000
Neg logit contributions
 asleep-0.000
 driveway-0.000
robe-0.000
seek-0.000
ASH-0.000
Token glad
Token ID9675
Feature activation-5.45e-02

Pos logit contributions
mmm+0.054
ww+0.053
 importantly+0.048
www+0.048
dd+0.042
Neg logit contributions
 asleep-0.075
 driveway-0.073
robe-0.072
seek-0.070
 bank-0.070
Token they
Token ID484
Feature activation+2.48e-02

Pos logit contributions
mmm+0.079
ww+0.079
www+0.071
 importantly+0.067
dd+0.061
Neg logit contributions
 asleep-0.129
 driveway-0.122
robe-0.120
seek-0.118
 bank-0.117
Token are
Token ID389
Feature activation-2.83e-02

Pos logit contributions
 driveway+0.048
 asleep+0.048
pit+0.047
robe+0.045
ASH+0.045
Neg logit contributions
ww-0.043
mmm-0.042
www-0.040
dd-0.038
 importantly-0.038
Token not
Token ID407
Feature activation-2.32e-02

Pos logit contributions
mmm+0.049
ww+0.049
 importantly+0.047
www+0.044
dd+0.041
Neg logit contributions
 driveway-0.057
 asleep-0.055
robe-0.055
seek-0.054
 bank-0.054
Token crocod
Token ID37565
Feature activation-3.96e-02

Pos logit contributions
ww+0.036
mmm+0.036
 importantly+0.034
www+0.032
dd+0.029
Neg logit contributions
 driveway-0.050
robe-0.050
 asleep-0.049
seek-0.047
 bank-0.047
Tokenile
Token ID576
Feature activation-2.91e-02

Pos logit contributions
ww+0.050
mmm+0.049
 importantly+0.047
www+0.043
dd+0.040
Neg logit contributions
 driveway-0.102
 asleep-0.096
robe-0.095
 bank-0.093
seek-0.090
Token upon
Token ID2402
Feature activation+1.18e-02

Pos logit contributions
mmm+0.048
ww+0.048
www+0.045
 importantly+0.045
dd+0.042
Neg logit contributions
 asleep-0.019
 driveway-0.017
robe-0.017
 neighborhood-0.015
pit-0.014
Token a
Token ID257
Feature activation-4.64e-02

Pos logit contributions
 driveway+0.020
 asleep+0.020
robe+0.019
 bank+0.019
seek+0.019
Neg logit contributions
mmm-0.024
ww-0.024
 importantly-0.022
www-0.022
dd-0.020
Token time
Token ID640
Feature activation+5.05e-03

Pos logit contributions
mmm+0.100
ww+0.094
 importantly+0.093
www+0.089
Run+0.081
Neg logit contributions
 neighborhood-0.070
 driveway-0.070
 asleep-0.068
 neighbourhood-0.065
robe-0.063
Token there
Token ID612
Feature activation+5.03e-02

Pos logit contributions
 asleep+0.009
robe+0.008
 driveway+0.008
pit+0.008
seek+0.008
Neg logit contributions
mmm-0.010
ww-0.010
www-0.009
 importantly-0.009
dd-0.008
Token was
Token ID373
Feature activation-9.02e-03

Pos logit contributions
 asleep+0.075
 driveway+0.075
robe+0.073
seek+0.070
 bank+0.070
Neg logit contributions
mmm-0.113
ww-0.111
 importantly-0.105
www-0.103
dd-0.096
Token a
Token ID257
Feature activation-5.17e-02

Pos logit contributions
mmm+0.023
ww+0.023
www+0.022
 importantly+0.022
keys+0.019
Neg logit contributions
 asleep-0.011
 driveway-0.010
robe-0.010
 kitchen-0.009
ASH-0.009
Token big
Token ID1263
Feature activation-3.06e-02

Pos logit contributions
mmm+0.126
 importantly+0.112
ww+0.110
www+0.106
dd+0.105
Neg logit contributions
 neighborhood-0.073
 garden-0.071
 driveway-0.071
 asleep-0.071
 kitchen-0.070
Token,
Token ID11
Feature activation-4.49e-03

Pos logit contributions
mmm+0.075
 importantly+0.071
ww+0.069
dd+0.066
www+0.066
Neg logit contributions
 asleep-0.043
 driveway-0.043
 bank-0.039
 neighborhood-0.039
 kitchen-0.039
Token red
Token ID2266
Feature activation-2.90e-03

Pos logit contributions
mmm+0.011
ww+0.010
 importantly+0.010
dd+0.009
www+0.009
Neg logit contributions
 driveway-0.006
 asleep-0.006
 kitchen-0.006
 garden-0.006
robe-0.006
Token bird
Token ID6512
Feature activation-2.39e-02

Pos logit contributions
ww+0.006
mmm+0.006
www+0.006
 importantly+0.006
dd+0.005
Neg logit contributions
 asleep-0.004
 driveway-0.004
seek-0.004
robe-0.004
 bank-0.004
Token.
Token ID13
Feature activation+3.25e-02

Pos logit contributions
mmm+0.042
ww+0.039
www+0.036
 importantly+0.036
dd+0.032
Neg logit contributions
 asleep-0.049
robe-0.049
 driveway-0.048
 bank-0.048
stick-0.047