TOP ACTIVATIONS
MAX = 8.368

When Daisy put the magn ifying glass down , the cater
flashlight , and a magn ifying glass . As
a flashlight and a magn ifying glass to examine their finds
it she had a magn ifying glass , a tiny torch
special folder . Inside the folder was something very weird .
She takes out a magn ifying glass . She looks at
in the fields where the corn grew tall . One day
. They had a magn ifying glass and a small box
. Lily had a magn ifying glass that she found in
in the fields where the hay was kept . He would
the necklace and the magn ifying glass and started to examine
went to the forest to pick some berries . She saw
shiny . " Who lives there ?" Sara asks
He brought his magn ifying glass and looked closely .
She had a big magn ifying glass that made things look
, handing him the magn ifying glass . She looked around
mighty jar . Inside the jar were lots of different things
Who â s there
book case . On the book case was a big jar
mom my went out to pick berries . Fred loved picking

MIN ACTIVATIONS
MIN = -7.437

for you . Goodbye , Mir a and R avi ."
You are too young !" Fre ya was sad but she
that !" Charl otte thanked her mother for
when she was done , Fre ya realized that the stair
too dangerous to use . Fre ya was very disappointed and
impressed . Ang ie was so proud of
foot . Mar ina 's mom said ,
Chicken were happy and thanked Farmer Brown . They brought the
" Don 't worry , Mir a . We can buy
feel better . Sally thanked Doctor Bob and went home with
was so happy and thanked Gig i and Duck ie .
, give it back !" Mir a said . " We
uncle ever ." Un cle Ben hugged Lily and
and said , " Good girl , Lily . Good boy
save the baby birds !" Twe ety said .
They all smiled and congratulated Cas per on his amazing accomplishment
" Sorry , Farmer Joe ," Sam says .
and found her . Poor H ilda never managed to get
welcome here ." Un cle Jack gr umbled and
, " Good job , Pe ppa . You did a

INTERVAL 4.417 - 8.368
CONTAINS 0.922%

having lots of fun ! <|endoftext|> Once upon a time ,
a lot of fun . <|endoftext|> Sam was a tall boy
Once upon a time , there was a princess . She
<|endoftext|> Once upon a time there were two friends , Bob
Once upon a time , there was a brave man called

INTERVAL 0.465 - 4.417
CONTAINS 37.942%

liked to bounce on . He liked to feel the pillow
that was different . It was a toy robot that had
beautiful picture and the slow deer was very proud of it
cat . The cat has shiny fur and bright eyes .
to watch them and he noticed how much they seemed to

INTERVAL -3.486 - 0.465
CONTAINS 59.472%

it 's the best !" Lily says . Mom
" Tom , help me !" Tom ran to
, she said goodbye to her fake kingdom and ran off
. They felt scared and sorry . They said , "
The little boy stared up at the nurse with admiration in

INTERVAL -7.437 - -3.486
CONTAINS 1.665%

mail to send ," he said . He picked up the
wayne for his help . D wayne was still a bit
. You can still be my friend ." And Tim felt
Ben felt more scared and ashamed . He felt alone and
You 're such a talented artist !" <|endoftext|> Once upon a
Token When
Token ID1649
Feature activation+7.17e-01

Pos logit contributions
 hall+0.046
 garden+0.046
 forest+0.046
 hunt+0.046
 outdoors+0.046
Neg logit contributions
 understanding-0.062
 accomplishment-0.059
 apology-0.058
iously-0.057
mmm-0.057
Token Daisy
Token ID40355
Feature activation+7.67e-01

Pos logit contributions
c+0.017
 outdoors+0.017
 forest+0.016
tub+0.016
bowl+0.016
Neg logit contributions
 accomplishment-0.019
 understanding-0.018
iously-0.017
 apology-0.017
mmm-0.016
Token put
Token ID1234
Feature activation+3.86e+00

Pos logit contributions
 outdoors+0.016
tub+0.016
 hall+0.015
 sandbox+0.015
 pots+0.015
Neg logit contributions
 understanding-0.021
iously-0.021
 accomplishment-0.021
 apology-0.020
mmm-0.019
Token the
Token ID262
Feature activation+2.81e+00

Pos logit contributions
tub+0.084
 hunt+0.082
 hall+0.080
 forest+0.077
 met+0.077
Neg logit contributions
 accomplishment-0.113
 understanding-0.109
 apology-0.101
iously-0.097
mmm-0.095
Token magn
Token ID7842
Feature activation+2.64e+00

Pos logit contributions
 met+0.046
c+0.044
 hunt+0.044
 hall+0.043
tub+0.043
Neg logit contributions
 apology-0.092
 understanding-0.091
 accomplishment-0.090
 efforts-0.082
iously-0.081
Tokenifying
Token ID4035
Feature activation+8.37e+00

Pos logit contributions
 forest+0.079
 hall+0.079
 garden+0.078
 park+0.078
 hunt+0.077
Neg logit contributions
iously-0.054
 understanding-0.048
 accomplishment-0.047
mmm-0.046
 Deal-0.045
Token glass
Token ID5405
Feature activation+3.91e+00

Pos logit contributions
 outdoors+0.159
 met+0.148
 hunt+0.147
 hall+0.146
 grounds+0.141
Neg logit contributions
 accomplishment-0.243
mmm-0.241
 understanding-0.234
 apology-0.233
 Deal-0.223
Token down
Token ID866
Feature activation+2.12e+00

Pos logit contributions
 gra+0.098
 met+0.098
 hunt+0.098
 woods+0.096
 forest+0.096
Neg logit contributions
 understanding-0.079
 accomplishment-0.079
iously-0.076
 apology-0.075
mmm-0.073
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
 hunt+0.050
c+0.049
 forest+0.049
 hall+0.049
 outdoors+0.048
Neg logit contributions
 accomplishment-0.055
 understanding-0.055
 apology-0.051
iously-0.051
mmm-0.048
Token the
Token ID262
Feature activation+5.13e-01

Pos logit contributions
tub+0.040
c+0.040
 pool+0.039
 hunt+0.039
bowl+0.039
Neg logit contributions
 understanding-0.035
 accomplishment-0.034
iously-0.029
 efforts-0.029
 apology-0.028
Token cater
Token ID20825
Feature activation+1.09e+00

Pos logit contributions
c+0.009
 hunt+0.008
 outdoors+0.008
bowl+0.008
 met+0.008
Neg logit contributions
 understanding-0.017
 accomplishment-0.016
 apology-0.016
iously-0.015
 efforts-0.015
Token flashlight
Token ID38371
Feature activation+2.78e+00

Pos logit contributions
c+0.013
 outdoors+0.012
 forest+0.012
 met+0.012
 hunt+0.012
Neg logit contributions
 understanding-0.026
 accomplishment-0.026
 apology-0.024
 efforts-0.023
 Deal-0.022
Token,
Token ID11
Feature activation+2.36e+00

Pos logit contributions
bowl+0.067
c+0.065
 outdoors+0.065
 woods+0.064
 forest+0.064
Neg logit contributions
 understanding-0.064
iously-0.064
 accomplishment-0.064
 efforts-0.057
 apology-0.055
Token and
Token ID290
Feature activation+3.92e-01

Pos logit contributions
tub+0.057
 hall+0.056
 forest+0.056
bowl+0.055
 pool+0.054
Neg logit contributions
 understanding-0.062
 accomplishment-0.054
iously-0.053
 apology-0.051
 efforts-0.050
Token a
Token ID257
Feature activation+2.86e-01

Pos logit contributions
tub+0.008
 outdoors+0.008
 forest+0.008
c+0.008
 hall+0.008
Neg logit contributions
 understanding-0.012
 accomplishment-0.011
 apology-0.010
iously-0.010
 efforts-0.010
Token magn
Token ID7842
Feature activation+2.51e+00

Pos logit contributions
c+0.005
 outdoors+0.004
 met+0.004
 hunt+0.004
 hall+0.004
Neg logit contributions
 understanding-0.010
 accomplishment-0.009
 apology-0.009
 efforts-0.009
 Deal-0.008
Tokenifying
Token ID4035
Feature activation+7.90e+00

Pos logit contributions
 garden+0.073
 forest+0.072
 hall+0.072
 sandbox+0.072
 park+0.070
Neg logit contributions
iously-0.055
 understanding-0.049
 accomplishment-0.049
 efforts-0.046
oor-0.046
Token glass
Token ID5405
Feature activation+2.56e+00

Pos logit contributions
 outdoors+0.149
 met+0.143
 hunt+0.141
 grounds+0.137
 hall+0.132
Neg logit contributions
 accomplishment-0.225
 understanding-0.215
mmm-0.213
 apology-0.209
 satisfaction-0.204
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
 met+0.060
 forest+0.057
 outdoors+0.057
 woods+0.057
 gra+0.057
Neg logit contributions
 accomplishment-0.058
 understanding-0.058
iously-0.055
 apology-0.054
 efforts-0.054
Token
Token ID198
Feature activation+1.30e+00

Pos logit contributions
 hunt+0.034
 forest+0.033
 garden+0.033
 hall+0.033
 outdoors+0.033
Neg logit contributions
 understanding-0.041
 accomplishment-0.039
 apology-0.039
mmm-0.038
iously-0.038
Token
Token ID198
Feature activation-5.29e-01

Pos logit contributions
 hall+0.025
 hunt+0.025
 garden+0.025
 forest+0.024
 outdoors+0.024
Neg logit contributions
 understanding-0.040
 accomplishment-0.038
 apology-0.037
mmm-0.037
iously-0.036
TokenAs
Token ID1722
Feature activation-1.44e+00

Pos logit contributions
 understanding+0.012
 accomplishment+0.012
iously+0.012
 apology+0.012
mmm+0.011
Neg logit contributions
 hall-0.014
 garden-0.014
 forest-0.014
 outdoors-0.014
c-0.014
Token a
Token ID257
Feature activation+1.94e+00

Pos logit contributions
tub+0.093
c+0.089
bowl+0.087
ware+0.084
robe+0.084
Neg logit contributions
 understanding-0.091
 efforts-0.078
 accomplishment-0.078
 caring-0.075
 attempts-0.073
Token flashlight
Token ID38371
Feature activation+2.42e+00

Pos logit contributions
c+0.038
tub+0.034
 outdoors+0.034
 fields+0.034
bowl+0.034
Neg logit contributions
 accomplishment-0.056
 understanding-0.056
 apology-0.052
 efforts-0.050
 caring-0.048
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
c+0.066
bowl+0.064
 met+0.064
 outdoors+0.063
 woods+0.063
Neg logit contributions
iously-0.050
 understanding-0.050
 accomplishment-0.049
 efforts-0.045
olding-0.040
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
 outdoors+0.025
c+0.024
 forest+0.023
tub+0.023
 hall+0.022
Neg logit contributions
 understanding-0.034
 efforts-0.031
 accomplishment-0.031
 apology-0.030
iously-0.029
Token magn
Token ID7842
Feature activation+2.86e+00

Pos logit contributions
c+0.028
 outdoors+0.025
rob+0.024
 met+0.024
bowl+0.024
Neg logit contributions
 understanding-0.052
 accomplishment-0.050
 apology-0.048
 efforts-0.045
 Deal-0.043
Tokenifying
Token ID4035
Feature activation+7.48e+00

Pos logit contributions
 garden+0.087
 hall+0.087
 sandbox+0.086
 park+0.086
 forest+0.085
Neg logit contributions
iously-0.054
oor-0.045
 efforts-0.045
 accomplishment-0.044
 Deal-0.044
Token glass
Token ID5405
Feature activation+2.70e+00

Pos logit contributions
 outdoors+0.139
 met+0.131
 paints+0.124
bowl+0.123
c+0.122
Neg logit contributions
 accomplishment-0.216
 understanding-0.206
 apology-0.204
mmm-0.200
 Deal-0.199
Token to
Token ID284
Feature activation+4.47e+00

Pos logit contributions
c+0.061
 met+0.060
 outdoors+0.059
 forest+0.056
 hall+0.056
Neg logit contributions
 accomplishment-0.066
 understanding-0.065
iously-0.063
 apology-0.063
 efforts-0.063
Token examine
Token ID10716
Feature activation+2.55e+00

Pos logit contributions
tub+0.096
 hall+0.087
 outdoors+0.087
 sandbox+0.086
 woods+0.086
Neg logit contributions
 understanding-0.130
 accomplishment-0.124
 apology-0.124
 thank-0.111
mmm-0.109
Token their
Token ID511
Feature activation-4.44e-01

Pos logit contributions
 hall+0.063
 sandbox+0.062
tub+0.062
 met+0.060
 hunt+0.060
Neg logit contributions
 understanding-0.060
 accomplishment-0.057
iously-0.057
 apology-0.055
 efforts-0.052
Token finds
Token ID7228
Feature activation+8.44e-01

Pos logit contributions
 accomplishment+0.014
 understanding+0.014
 apology+0.013
 efforts+0.013
iously+0.013
Neg logit contributions
c-0.008
 outdoors-0.008
tub-0.008
 hall-0.008
rob-0.008
Token it
Token ID340
Feature activation+2.44e+00

Pos logit contributions
tub+0.047
 outdoors+0.046
c+0.046
 hunt+0.046
bowl+0.045
Neg logit contributions
 accomplishment-0.069
 understanding-0.068
 apology-0.063
iously-0.062
 efforts-0.061
Token she
Token ID673
Feature activation+2.76e+00

Pos logit contributions
rob+0.076
bowl+0.070
 paints+0.070
 matches+0.070
 pots+0.069
Neg logit contributions
 accomplishment-0.043
 understanding-0.043
 efforts-0.035
iously-0.034
 decision-0.033
Token had
Token ID550
Feature activation+3.93e+00

Pos logit contributions
 outdoors+0.049
 sofa+0.041
 pots+0.041
 sandbox+0.040
 matches+0.040
Neg logit contributions
 accomplishment-0.085
 understanding-0.085
 efforts-0.080
iously-0.077
 apology-0.076
Token a
Token ID257
Feature activation+1.39e+00

Pos logit contributions
tub+0.088
bowl+0.083
 hall+0.083
 pool+0.083
c+0.079
Neg logit contributions
 understanding-0.115
 accomplishment-0.105
iously-0.100
 efforts-0.099
From-0.093
Token magn
Token ID7842
Feature activation+2.68e+00

Pos logit contributions
c+0.029
tub+0.028
 hall+0.027
 fields+0.027
 outdoors+0.027
Neg logit contributions
 understanding-0.041
 accomplishment-0.040
 apology-0.037
 efforts-0.035
 caring-0.033
Tokenifying
Token ID4035
Feature activation+6.95e+00

Pos logit contributions
 hall+0.080
 garden+0.078
 sandbox+0.077
 forest+0.077
 park+0.077
Neg logit contributions
iously-0.056
 understanding-0.053
 accomplishment-0.051
 efforts-0.048
mmm-0.046
Token glass
Token ID5405
Feature activation+2.92e+00

Pos logit contributions
 outdoors+0.130
 hall+0.120
 paints+0.119
 met+0.118
bowl+0.117
Neg logit contributions
 accomplishment-0.204
 understanding-0.201
 apology-0.194
mmm-0.191
 Deal-0.185
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
 met+0.072
 outdoors+0.072
c+0.071
 gra+0.071
 hall+0.070
Neg logit contributions
 understanding-0.064
 accomplishment-0.062
iously-0.059
 efforts-0.058
 apology-0.056
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 hall+0.026
tub+0.026
 pool+0.025
 forest+0.025
c+0.025
Neg logit contributions
 understanding-0.035
 accomplishment-0.032
iously-0.030
 apology-0.029
mmm-0.029
Token tiny
Token ID7009
Feature activation+1.11e+00

Pos logit contributions
c+0.019
 outdoors+0.018
 hall+0.017
 fields+0.017
 met+0.017
Neg logit contributions
 understanding-0.032
 accomplishment-0.031
 apology-0.029
 efforts-0.027
mmm-0.027
Token torch
Token ID28034
Feature activation+2.64e+00

Pos logit contributions
 hall+0.018
c+0.018
 outdoors+0.018
 met+0.018
tub+0.017
Neg logit contributions
 understanding-0.036
 accomplishment-0.035
 apology-0.034
 efforts-0.033
iously-0.033
Token special
Token ID2041
Feature activation+2.13e+00

Pos logit contributions
c+0.044
 hall+0.043
tub+0.043
 outdoors+0.043
 fields+0.042
Neg logit contributions
 understanding-0.062
 accomplishment-0.060
 apology-0.058
 efforts-0.055
iously-0.053
Token folder
Token ID9483
Feature activation+2.75e+00

Pos logit contributions
c+0.039
 hall+0.038
 fields+0.037
bowl+0.037
 met+0.037
Neg logit contributions
 understanding-0.066
 accomplishment-0.066
 apology-0.064
 efforts-0.060
iously-0.059
Token.
Token ID13
Feature activation+2.89e+00

Pos logit contributions
 forest+0.058
 garden+0.058
 hall+0.058
 woods+0.057
c+0.057
Neg logit contributions
 understanding-0.077
 accomplishment-0.073
 apology-0.070
iously-0.068
 efforts-0.066
Token Inside
Token ID14384
Feature activation+5.07e+00

Pos logit contributions
c+0.069
 hunt+0.068
 forest+0.066
bowl+0.066
 met+0.066
Neg logit contributions
 understanding-0.072
 accomplishment-0.070
 apology-0.067
mmm-0.065
 Deal-0.064
Token the
Token ID262
Feature activation+4.05e+00

Pos logit contributions
 hunt+0.131
bowl+0.127
 outdoors+0.124
 cloak+0.123
 sandbox+0.122
Neg logit contributions
 accomplishment-0.098
oral-0.089
 understanding-0.088
ama-0.084
mmm-0.082
Token folder
Token ID9483
Feature activation+6.89e+00

Pos logit contributions
c+0.066
 forest+0.065
 outdoors+0.064
 pool+0.064
 hunt+0.063
Neg logit contributions
 understanding-0.127
 accomplishment-0.126
 apology-0.122
 efforts-0.117
oral-0.115
Token was
Token ID373
Feature activation+2.36e+00

Pos logit contributions
 gra+0.172
 outdoors+0.153
 valleys+0.151
bowl+0.148
 hunt+0.145
Neg logit contributions
 understanding-0.153
oral-0.143
 accomplishment-0.138
mmm-0.131
From-0.130
Token something
Token ID1223
Feature activation+2.64e+00

Pos logit contributions
 forest+0.036
 hall+0.036
 outdoors+0.035
 garden+0.035
 hunt+0.035
Neg logit contributions
 understanding-0.081
 accomplishment-0.080
 apology-0.076
iously-0.076
mmm-0.073
Token very
Token ID845
Feature activation+8.67e-01

Pos logit contributions
c+0.060
 hall+0.060
 forest+0.058
 fields+0.057
 outdoors+0.056
Neg logit contributions
 understanding-0.073
 accomplishment-0.069
 apology-0.066
iously-0.062
 efforts-0.061
Token weird
Token ID7650
Feature activation+1.57e+00

Pos logit contributions
 hall+0.020
 forest+0.019
 garden+0.019
 outdoors+0.019
c+0.019
Neg logit contributions
 understanding-0.024
 accomplishment-0.024
 apology-0.023
iously-0.022
mmm-0.021
Token.
Token ID13
Feature activation+3.16e+00

Pos logit contributions
c+0.039
 sandbox+0.037
bowl+0.037
 hall+0.036
 gra+0.036
Neg logit contributions
 understanding-0.037
iously-0.036
 accomplishment-0.036
 apology-0.034
 efforts-0.033
Token She
Token ID1375
Feature activation+1.16e+00

Pos logit contributions
 accomplishment+0.013
 understanding+0.013
 apology+0.012
iously+0.012
mmm+0.012
Neg logit contributions
 garden-0.007
 hall-0.007
 forest-0.007
 outdoors-0.007
c-0.007
Token takes
Token ID2753
Feature activation+7.75e-01

Pos logit contributions
 outdoors+0.026
tub+0.025
 hall+0.025
c+0.025
bowl+0.025
Neg logit contributions
 accomplishment-0.032
 understanding-0.032
 apology-0.031
iously-0.030
 efforts-0.029
Token out
Token ID503
Feature activation+1.92e-03

Pos logit contributions
 hall+0.012
 forest+0.012
 garden+0.012
 hunt+0.012
tub+0.012
Neg logit contributions
 accomplishment-0.027
 understanding-0.026
 apology-0.025
iously-0.025
 efforts-0.024
Token a
Token ID257
Feature activation+1.10e+00

Pos logit contributions
 forest+0.000
 hunt+0.000
 garden+0.000
 hall+0.000
 outdoors+0.000
Neg logit contributions
 understanding-0.000
 accomplishment-0.000
 apology-0.000
iously-0.000
 efforts-0.000
Token magn
Token ID7842
Feature activation+1.67e+00

Pos logit contributions
 fields+0.021
tub+0.021
c+0.020
 met+0.020
rob+0.020
Neg logit contributions
 understanding-0.035
 accomplishment-0.034
 apology-0.032
 efforts-0.029
oral-0.028
Tokenifying
Token ID4035
Feature activation+6.83e+00

Pos logit contributions
 hall+0.046
 garden+0.045
 forest+0.045
 park+0.044
 woods+0.044
Neg logit contributions
iously-0.039
 understanding-0.038
 accomplishment-0.038
 apology-0.035
 efforts-0.035
Token glass
Token ID5405
Feature activation+2.24e+00

Pos logit contributions
 outdoors+0.129
 met+0.118
 hunt+0.116
 hall+0.116
 grounds+0.111
Neg logit contributions
 accomplishment-0.208
 understanding-0.205
 apology-0.200
mmm-0.196
iously-0.187
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
 met+0.052
 outdoors+0.051
 forest+0.049
 woods+0.049
 hunt+0.049
Neg logit contributions
 understanding-0.059
 accomplishment-0.058
 apology-0.056
iously-0.054
 efforts-0.053
Token She
Token ID1375
Feature activation+2.08e+00

Pos logit contributions
 forest+0.028
 hall+0.028
 hunt+0.028
 outdoors+0.027
 garden+0.027
Neg logit contributions
 understanding-0.039
 accomplishment-0.038
 apology-0.037
mmm-0.036
iously-0.036
Token looks
Token ID3073
Feature activation+5.25e-01

Pos logit contributions
 outdoors+0.047
tub+0.046
bowl+0.045
 hall+0.045
c+0.044
Neg logit contributions
 accomplishment-0.058
 understanding-0.058
 apology-0.055
iously-0.054
 efforts-0.052
Token at
Token ID379
Feature activation+1.05e+00

Pos logit contributions
 hall+0.011
 forest+0.011
 garden+0.011
c+0.011
 hunt+0.011
Neg logit contributions
 understanding-0.015
 accomplishment-0.015
 apology-0.014
iously-0.014
 efforts-0.014
Token in
Token ID287
Feature activation+3.75e+00

Pos logit contributions
 met+0.047
 hunt+0.047
tub+0.046
c+0.045
 woods+0.044
Neg logit contributions
 understanding-0.072
 accomplishment-0.067
iously-0.066
mmm-0.062
 efforts-0.061
Token the
Token ID262
Feature activation+4.66e+00

Pos logit contributions
tub+0.104
 met+0.096
robe+0.093
bowl+0.093
 grounds+0.092
Neg logit contributions
 understanding-0.087
 accomplishment-0.077
mmm-0.067
 apology-0.067
From-0.066
Token fields
Token ID7032
Feature activation+3.23e+00

Pos logit contributions
ware+0.107
 hunt+0.099
 matches+0.097
rob+0.091
tub+0.089
Neg logit contributions
 understanding-0.118
 bye-0.115
From-0.109
um-0.108
oral-0.099
Token where
Token ID810
Feature activation+3.86e+00

Pos logit contributions
rob+0.092
 matches+0.088
 met+0.088
 paints+0.087
 sandbox+0.087
Neg logit contributions
 understanding-0.065
 accomplishment-0.055
iously-0.049
olding-0.049
 hour-0.049
Token the
Token ID262
Feature activation+1.78e+00

Pos logit contributions
 hunt+0.114
c+0.106
bowl+0.104
rob+0.102
 outdoors+0.100
Neg logit contributions
 understanding-0.077
 accomplishment-0.065
 accidents-0.063
 bye-0.062
 efforts-0.060
Token corn
Token ID11676
Feature activation+6.60e+00

Pos logit contributions
c+0.032
bowl+0.029
tub+0.029
 matches+0.028
 hunt+0.028
Neg logit contributions
 understanding-0.055
 accomplishment-0.051
iously-0.050
 bye-0.049
 apology-0.049
Token grew
Token ID6348
Feature activation+1.01e+00

Pos logit contributions
 outdoors+0.154
 sandbox+0.129
ptions+0.128
 outer+0.127
rob+0.126
Neg logit contributions
 accomplishment-0.127
iously-0.127
oral-0.126
 understanding-0.123
 actions-0.123
Token tall
Token ID7331
Feature activation+1.41e+00

Pos logit contributions
 hall+0.020
 hunt+0.019
 garden+0.019
c+0.019
 forest+0.019
Neg logit contributions
 understanding-0.030
 accomplishment-0.029
iously-0.028
 apology-0.027
mmm-0.026
Token.
Token ID13
Feature activation+3.28e+00

Pos logit contributions
 met+0.035
 hall+0.035
bowl+0.034
tub+0.034
 outdoors+0.034
Neg logit contributions
iously-0.033
 understanding-0.032
 accomplishment-0.031
mmm-0.028
 apology-0.028
Token One
Token ID1881
Feature activation+1.56e+00

Pos logit contributions
 hunt+0.081
tub+0.081
 matches+0.080
bowl+0.079
 forest+0.078
Neg logit contributions
 understanding-0.075
 apology-0.071
mmm-0.070
 accomplishment-0.069
 Deal-0.068
Token day
Token ID1110
Feature activation+3.36e-01

Pos logit contributions
 garden+0.030
 hall+0.029
 forest+0.029
 outdoors+0.029
c+0.028
Neg logit contributions
 understanding-0.049
 accomplishment-0.049
iously-0.047
 apology-0.047
mmm-0.045
Token.
Token ID13
Feature activation+1.82e+00

Pos logit contributions
 hall+0.030
tub+0.029
 garage+0.029
 woods+0.029
 backyard+0.029
Neg logit contributions
 understanding-0.019
iously-0.018
 accomplishment-0.017
 efforts-0.016
 apology-0.016
Token They
Token ID1119
Feature activation+2.09e+00

Pos logit contributions
 hall+0.038
 forest+0.037
 hunt+0.037
 garden+0.037
 outdoors+0.037
Neg logit contributions
 understanding-0.052
 accomplishment-0.051
 apology-0.049
iously-0.049
mmm-0.048
Token had
Token ID550
Feature activation+2.88e+00

Pos logit contributions
tub+0.042
 outdoors+0.041
 forest+0.040
c+0.040
 pool+0.039
Neg logit contributions
 understanding-0.063
 accomplishment-0.055
 efforts-0.054
iously-0.053
 apologised-0.051
Token a
Token ID257
Feature activation+1.09e+00

Pos logit contributions
tub+0.069
 hall+0.065
c+0.065
bowl+0.064
rob+0.063
Neg logit contributions
 understanding-0.078
 efforts-0.065
 accomplishment-0.065
 caring-0.064
iously-0.063
Token magn
Token ID7842
Feature activation+2.35e+00

Pos logit contributions
c+0.024
 hall+0.023
tub+0.023
 outdoors+0.023
 fields+0.023
Neg logit contributions
 understanding-0.030
 accomplishment-0.029
 apology-0.027
 efforts-0.026
oral-0.025
Tokenifying
Token ID4035
Feature activation+6.60e+00

Pos logit contributions
 hall+0.075
 garden+0.074
 forest+0.073
 park+0.072
 woods+0.072
Neg logit contributions
iously-0.044
 efforts-0.038
 accomplishment-0.037
 understanding-0.037
 Deal-0.036
Token glass
Token ID5405
Feature activation+2.67e+00

Pos logit contributions
 outdoors+0.117
 met+0.109
 paints+0.107
c+0.105
bowl+0.105
Neg logit contributions
 accomplishment-0.197
 understanding-0.192
 apology-0.190
mmm-0.185
 Deal-0.182
Token and
Token ID290
Feature activation+4.45e-01

Pos logit contributions
c+0.060
 outdoors+0.060
 met+0.059
 hall+0.057
 forest+0.057
Neg logit contributions
 understanding-0.066
 accomplishment-0.065
 apology-0.063
iously-0.062
 efforts-0.062
Token a
Token ID257
Feature activation+1.07e+00

Pos logit contributions
 outdoors+0.009
c+0.009
 forest+0.009
 hall+0.009
tub+0.008
Neg logit contributions
 understanding-0.014
 accomplishment-0.013
 efforts-0.012
 apology-0.012
iously-0.012
Token small
Token ID1402
Feature activation+1.27e+00

Pos logit contributions
c+0.018
 outdoors+0.017
 met+0.016
 hall+0.016
 fields+0.016
Neg logit contributions
 understanding-0.036
 accomplishment-0.035
 apology-0.034
 efforts-0.031
 Deal-0.031
Token box
Token ID3091
Feature activation+2.27e+00

Pos logit contributions
c+0.019
 met+0.018
 outdoors+0.018
tub+0.018
 hall+0.018
Neg logit contributions
 understanding-0.043
 accomplishment-0.043
 apology-0.042
 efforts-0.040
iously-0.039
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
 hall+0.004
 sandbox+0.004
 forest+0.003
 woods+0.003
 gra+0.003
Neg logit contributions
 understanding-0.003
iously-0.002
 accomplishment-0.002
 efforts-0.002
mmm-0.002
Token Lily
Token ID20037
Feature activation+2.02e+00

Pos logit contributions
 hunt+0.040
 hall+0.040
 outdoors+0.039
c+0.039
 forest+0.039
Neg logit contributions
 understanding-0.045
 accomplishment-0.045
 apology-0.043
 Deal-0.041
iously-0.041
Token had
Token ID550
Feature activation+3.44e+00

Pos logit contributions
 outdoors+0.046
tub+0.043
 hall+0.042
c+0.042
 pool+0.042
Neg logit contributions
 understanding-0.054
 accomplishment-0.049
iously-0.049
 efforts-0.047
 apologised-0.044
Token a
Token ID257
Feature activation+1.73e+00

Pos logit contributions
tub+0.086
bowl+0.081
c+0.077
 pool+0.077
 hall+0.076
Neg logit contributions
 understanding-0.093
 accomplishment-0.079
 efforts-0.078
iously-0.074
 caring-0.073
Token magn
Token ID7842
Feature activation+2.43e+00

Pos logit contributions
c+0.035
 fields+0.033
 outdoors+0.032
bowl+0.032
 outside+0.031
Neg logit contributions
 understanding-0.052
 accomplishment-0.051
 apology-0.047
 efforts-0.044
 thank-0.043
Tokenifying
Token ID4035
Feature activation+6.53e+00

Pos logit contributions
 hall+0.076
 garden+0.075
 sandbox+0.074
 forest+0.074
 park+0.074
Neg logit contributions
iously-0.047
 understanding-0.042
 efforts-0.041
 accomplishment-0.041
mmm-0.039
Token glass
Token ID5405
Feature activation+2.40e+00

Pos logit contributions
 outdoors+0.127
 met+0.113
 paints+0.111
 fields+0.110
 hall+0.110
Neg logit contributions
 accomplishment-0.191
 apology-0.187
 understanding-0.186
mmm-0.182
 Deal-0.176
Token that
Token ID326
Feature activation+3.78e+00

Pos logit contributions
 outdoors+0.060
c+0.059
 met+0.059
 hall+0.058
 gra+0.057
Neg logit contributions
 understanding-0.054
 accomplishment-0.053
iously-0.051
 apology-0.051
 efforts-0.050
Token she
Token ID673
Feature activation+1.61e+00

Pos logit contributions
c+0.099
 outdoors+0.098
 hall+0.096
 sandbox+0.093
 forest+0.093
Neg logit contributions
 understanding-0.084
 accomplishment-0.081
iously-0.074
mmm-0.074
 efforts-0.071
Token found
Token ID1043
Feature activation+1.31e+00

Pos logit contributions
 hall+0.033
 outdoors+0.033
 pool+0.033
 fields+0.032
 valley+0.032
Neg logit contributions
 understanding-0.046
 accomplishment-0.043
iously-0.042
 efforts-0.041
 apology-0.041
Token in
Token ID287
Feature activation+1.29e+00

Pos logit contributions
 hall+0.031
 sandbox+0.030
 pool+0.029
tub+0.028
bowl+0.028
Neg logit contributions
 understanding-0.032
iously-0.032
 accomplishment-0.031
 efforts-0.029
 apology-0.028
Token in
Token ID287
Feature activation+3.64e+00

Pos logit contributions
 met+0.055
c+0.053
 garden+0.051
tub+0.051
 hall+0.051
Neg logit contributions
 understanding-0.071
 accomplishment-0.067
iously-0.066
 efforts-0.060
 hour-0.060
Token the
Token ID262
Feature activation+4.58e+00

Pos logit contributions
tub+0.101
 met+0.097
 grounds+0.094
c+0.093
bows+0.093
Neg logit contributions
 understanding-0.079
 accomplishment-0.074
 apology-0.062
From-0.060
Less-0.057
Token fields
Token ID7032
Feature activation+3.54e+00

Pos logit contributions
ware+0.099
 hunt+0.090
 matches+0.089
tub+0.083
pter+0.081
Neg logit contributions
 understanding-0.119
 bye-0.117
From-0.111
um-0.107
 relief-0.102
Token where
Token ID810
Feature activation+3.70e+00

Pos logit contributions
 paints+0.099
 met+0.096
 matches+0.095
 sandbox+0.095
 hall+0.094
Neg logit contributions
 understanding-0.069
 accomplishment-0.061
iously-0.054
 hour-0.053
 badly-0.052
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
c+0.108
 hunt+0.103
 outdoors+0.094
bowl+0.093
nob+0.093
Neg logit contributions
 understanding-0.072
 accomplishment-0.065
 bye-0.064
 accidents-0.058
 punishment-0.056
Token hay
Token ID27678
Feature activation+6.52e+00

Pos logit contributions
c+0.033
bowl+0.029
 hunt+0.029
tub+0.029
 hall+0.028
Neg logit contributions
 understanding-0.053
 accomplishment-0.049
 apology-0.047
 bye-0.046
iously-0.046
Token was
Token ID373
Feature activation+2.02e+00

Pos logit contributions
 outdoors+0.158
ptions+0.143
 outer+0.138
c+0.132
rob+0.131
Neg logit contributions
iously-0.118
 accomplishment-0.113
 understanding-0.112
 actions-0.112
 vowed-0.112
Token kept
Token ID4030
Feature activation+7.97e-01

Pos logit contributions
rob+0.048
 hall+0.048
 sandbox+0.047
c+0.047
 paints+0.046
Neg logit contributions
 understanding-0.050
 accomplishment-0.045
 badly-0.041
iously-0.041
 caring-0.041
Token.
Token ID13
Feature activation+3.41e+00

Pos logit contributions
 hall+0.017
 sandbox+0.017
tub+0.016
 met+0.015
 theater+0.015
Neg logit contributions
 understanding-0.023
 accomplishment-0.022
iously-0.021
 caring-0.020
 badly-0.019
Token He
Token ID679
Feature activation+3.05e+00

Pos logit contributions
 hall+0.080
 outdoors+0.080
 garden+0.079
 met+0.078
 paints+0.078
Neg logit contributions
 understanding-0.083
 accomplishment-0.081
 apology-0.081
mmm-0.078
iously-0.074
Token would
Token ID561
Feature activation+1.67e+00

Pos logit contributions
 outdoors+0.061
tub+0.059
 hall+0.057
 pool+0.056
 garden+0.056
Neg logit contributions
 understanding-0.087
 accomplishment-0.081
 efforts-0.079
iously-0.079
 apology-0.075
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
 understanding+0.002
 accomplishment+0.002
 apology+0.002
iously+0.002
 efforts+0.002
Neg logit contributions
 hunt-0.002
tub-0.002
 forest-0.002
 outdoors-0.002
 hall-0.002
Token necklace
Token ID38987
Feature activation+1.54e+00

Pos logit contributions
 outdoors+0.019
 hall+0.019
tub+0.019
 theater+0.019
bowl+0.018
Neg logit contributions
 apology-0.037
 accomplishment-0.037
 understanding-0.036
 efforts-0.033
iously-0.031
Token and
Token ID290
Feature activation+5.71e-01

Pos logit contributions
 forest+0.039
 outdoors+0.039
 woods+0.039
 hall+0.039
 met+0.039
Neg logit contributions
 understanding-0.035
 accomplishment-0.034
iously-0.034
 apology-0.032
 efforts-0.030
Token the
Token ID262
Feature activation+7.32e-03

Pos logit contributions
 outdoors+0.014
tub+0.013
 forest+0.013
c+0.013
 hall+0.013
Neg logit contributions
 understanding-0.015
 accomplishment-0.014
iously-0.013
 apology-0.013
 efforts-0.013
Token magn
Token ID7842
Feature activation+9.99e-01

Pos logit contributions
 outdoors+0.000
c+0.000
tub+0.000
bowl+0.000
 hall+0.000
Neg logit contributions
 understanding-0.000
 accomplishment-0.000
 apology-0.000
 efforts-0.000
iously-0.000
Tokenifying
Token ID4035
Feature activation+6.47e+00

Pos logit contributions
 hall+0.028
 garden+0.027
 forest+0.027
 park+0.027
 woods+0.026
Neg logit contributions
iously-0.023
 accomplishment-0.021
 understanding-0.021
mmm-0.020
oor-0.020
Token glass
Token ID5405
Feature activation+1.93e+00

Pos logit contributions
 outdoors+0.119
 hunt+0.107
 hall+0.107
 met+0.105
bowl+0.102
Neg logit contributions
 accomplishment-0.198
 understanding-0.195
 apology-0.192
mmm-0.187
iously-0.182
Token and
Token ID290
Feature activation+3.03e-02

Pos logit contributions
 outdoors+0.047
 forest+0.046
 hall+0.045
 met+0.045
 woods+0.045
Neg logit contributions
 accomplishment-0.046
 understanding-0.045
iously-0.045
 apology-0.044
 efforts-0.042
Token started
Token ID2067
Feature activation-5.51e-02

Pos logit contributions
 outdoors+0.001
c+0.001
tub+0.001
 hall+0.001
 forest+0.001
Neg logit contributions
 understanding-0.001
 accomplishment-0.001
iously-0.001
 apology-0.001
 efforts-0.001
Token to
Token ID284
Feature activation+5.81e-01

Pos logit contributions
 understanding+0.001
 accomplishment+0.001
iously+0.001
 efforts+0.001
 apology+0.001
Neg logit contributions
tub-0.001
 forest-0.001
bowl-0.001
 hall-0.001
rob-0.001
Token examine
Token ID10716
Feature activation+1.52e+00

Pos logit contributions
tub+0.013
rob+0.012
boards+0.012
 woods+0.012
 mountains+0.011
Neg logit contributions
 accomplishment-0.017
 understanding-0.016
 thank-0.015
 apology-0.015
 promise-0.014
Token went
Token ID1816
Feature activation+2.29e+00

Pos logit contributions
tub+0.017
 outdoors+0.017
 mountains+0.017
 hall+0.017
 pool+0.016
Neg logit contributions
 understanding-0.022
 accomplishment-0.020
 apology-0.019
iously-0.018
mmm-0.018
Token to
Token ID284
Feature activation+3.86e+00

Pos logit contributions
c+0.049
 sandbox+0.047
 forest+0.046
 garden+0.046
 hall+0.046
Neg logit contributions
 understanding-0.063
 accomplishment-0.061
 apology-0.059
iously-0.058
olding-0.055
Token the
Token ID262
Feature activation+4.52e+00

Pos logit contributions
tub+0.097
c+0.092
bowl+0.091
 grounds+0.087
 sandbox+0.086
Neg logit contributions
 understanding-0.101
 accomplishment-0.095
 apology-0.088
mmm-0.081
 Deal-0.081
Token forest
Token ID8222
Feature activation+2.23e+00

Pos logit contributions
c+0.067
bowl+0.059
 shone+0.055
 valleys+0.054
tub+0.053
Neg logit contributions
 understanding-0.151
 caring-0.139
 bye-0.138
oral-0.137
omm-0.135
Token to
Token ID284
Feature activation+4.47e+00

Pos logit contributions
 paints+0.046
 forest+0.046
 matches+0.046
 garden+0.045
 hall+0.045
Neg logit contributions
 understanding-0.063
 accomplishment-0.057
iously-0.057
olding-0.054
 apology-0.054
Token pick
Token ID2298
Feature activation+6.44e+00

Pos logit contributions
tub+0.102
boards+0.095
bridge+0.091
 grounds+0.091
c+0.090
Neg logit contributions
 understanding-0.118
 thank-0.110
 apology-0.107
 accomplishment-0.105
 relief-0.097
Token some
Token ID617
Feature activation+3.11e+00

Pos logit contributions
 hall+0.133
 hallway+0.130
 garage+0.130
tub+0.126
 theater+0.126
Neg logit contributions
 understanding-0.169
 apology-0.165
iously-0.149
 caring-0.147
 lesson-0.145
Token berries
Token ID36322
Feature activation+3.45e+00

Pos logit contributions
 hall+0.047
tub+0.044
 hunt+0.041
 met+0.040
 theater+0.040
Neg logit contributions
 understanding-0.110
 apology-0.107
 accomplishment-0.105
iously-0.100
mmm-0.099
Token.
Token ID13
Feature activation+3.42e+00

Pos logit contributions
 hall+0.075
 forest+0.073
 garden+0.072
 outdoors+0.071
 woods+0.071
Neg logit contributions
 understanding-0.096
 accomplishment-0.093
iously-0.091
 apology-0.091
mmm-0.088
Token She
Token ID1375
Feature activation+3.15e+00

Pos logit contributions
bowl+0.080
 forest+0.080
c+0.080
rob+0.080
 hunt+0.079
Neg logit contributions
 understanding-0.084
 accomplishment-0.079
 apology-0.077
iously-0.075
mmm-0.074
Token saw
Token ID2497
Feature activation+3.82e+00

Pos logit contributions
 outdoors+0.049
tub+0.047
 hall+0.047
 pool+0.045
 forest+0.045
Neg logit contributions
 understanding-0.106
 accomplishment-0.101
iously-0.098
 efforts-0.095
 apology-0.095
Token shiny
Token ID22441
Feature activation+3.89e-01

Pos logit contributions
 outdoors+0.035
tub+0.033
 forest+0.033
c+0.033
bowl+0.032
Neg logit contributions
 understanding-0.067
 accomplishment-0.066
 apology-0.062
mmm-0.061
iously-0.060
Token.
Token ID13
Feature activation+2.01e+00

Pos logit contributions
 hall+0.009
 met+0.009
c+0.009
 outdoors+0.009
 garden+0.009
Neg logit contributions
 understanding-0.010
iously-0.010
 accomplishment-0.010
 apology-0.010
mmm-0.009
Token
Token ID198
Feature activation+1.52e+00

Pos logit contributions
 garden+0.045
 hall+0.045
 forest+0.044
 outdoors+0.044
 park+0.044
Neg logit contributions
 understanding-0.055
 accomplishment-0.055
 apology-0.053
iously-0.052
 efforts-0.050
Token
Token ID198
Feature activation-5.71e-01

Pos logit contributions
 hall+0.027
 garden+0.027
 forest+0.026
 outdoors+0.026
c+0.026
Neg logit contributions
 understanding-0.049
 accomplishment-0.048
 apology-0.047
iously-0.046
mmm-0.045
Token"
Token ID1
Feature activation+2.28e+00

Pos logit contributions
 accomplishment+0.014
 understanding+0.013
 apology+0.013
iously+0.013
 efforts+0.013
Neg logit contributions
 outdoors-0.015
 park-0.015
 hall-0.015
 garden-0.015
 pool-0.015
TokenWho
Token ID8241
Feature activation+6.39e+00

Pos logit contributions
 hall+0.042
 garden+0.042
 forest+0.042
 outdoors+0.041
 hunt+0.041
Neg logit contributions
 understanding-0.071
 accomplishment-0.067
 apology-0.066
iously-0.066
mmm-0.065
Token lives
Token ID3160
Feature activation+1.37e+00

Pos logit contributions
tub+0.118
 outdoors+0.118
 hall+0.112
 forest+0.109
 woods+0.109
Neg logit contributions
 understanding-0.199
 apology-0.193
mmm-0.183
 efforts-0.180
 accomplishment-0.180
Token there
Token ID612
Feature activation+2.42e+00

Pos logit contributions
 hall+0.026
 sandbox+0.026
tub+0.025
c+0.025
 garden+0.025
Neg logit contributions
 understanding-0.042
 accomplishment-0.038
mmm-0.037
 apology-0.036
iously-0.035
Token?"
Token ID1701
Feature activation-1.41e+00

Pos logit contributions
 sandbox+0.059
 hall+0.058
c+0.057
 garden+0.056
tub+0.056
Neg logit contributions
 understanding-0.058
mmm-0.057
 accomplishment-0.052
iously-0.050
 apology-0.049
Token Sara
Token ID24799
Feature activation-2.13e+00

Pos logit contributions
 accomplishment+0.040
 understanding+0.038
 apology+0.038
iously+0.037
 efforts+0.036
Neg logit contributions
 park-0.032
 garden-0.031
 hall-0.031
 pretend-0.031
 forest-0.030
Token asks
Token ID7893
Feature activation-1.72e+00

Pos logit contributions
 understanding+0.057
 accomplishment+0.054
iously+0.054
 apology+0.054
mmm+0.051
Neg logit contributions
 hall-0.049
 garden-0.048
 forest-0.048
 outdoors-0.048
tub-0.048
Token
Token ID198
Feature activation-3.53e-01

Pos logit contributions
 hall+0.023
 garden+0.023
 forest+0.022
 outdoors+0.022
c+0.022
Neg logit contributions
 understanding-0.042
 accomplishment-0.041
 apology-0.040
iously-0.040
mmm-0.039
TokenHe
Token ID1544
Feature activation+6.26e-01

Pos logit contributions
 accomplishment+0.009
 understanding+0.009
iously+0.009
 apology+0.009
mmm+0.008
Neg logit contributions
 hall-0.009
 garden-0.009
 forest-0.009
 outdoors-0.009
c-0.009
Token brought
Token ID3181
Feature activation+3.05e+00

Pos logit contributions
 hall+0.011
 outdoors+0.011
 garden+0.011
tub+0.011
c+0.011
Neg logit contributions
 understanding-0.020
iously-0.019
 accomplishment-0.019
 apology-0.018
 efforts-0.018
Token his
Token ID465
Feature activation+3.13e-01

Pos logit contributions
 hall+0.073
tub+0.067
 sandbox+0.067
 hunt+0.065
c+0.064
Neg logit contributions
 understanding-0.089
 accomplishment-0.086
 apology-0.080
iously-0.078
 efforts-0.076
Token magn
Token ID7842
Feature activation+2.16e+00

Pos logit contributions
 hall+0.008
 outdoors+0.008
c+0.008
 forest+0.008
tub+0.008
Neg logit contributions
 understanding-0.008
 accomplishment-0.008
 apology-0.007
iously-0.007
 efforts-0.007
Tokenifying
Token ID4035
Feature activation+6.38e+00

Pos logit contributions
 hall+0.061
 forest+0.060
 garden+0.059
 park+0.059
 woods+0.059
Neg logit contributions
iously-0.049
 understanding-0.049
 accomplishment-0.046
 efforts-0.044
 apology-0.042
Token glass
Token ID5405
Feature activation+2.82e+00

Pos logit contributions
 outdoors+0.115
 hall+0.109
 met+0.106
 hunt+0.106
 forest+0.102
Neg logit contributions
 understanding-0.197
 apology-0.195
 accomplishment-0.194
mmm-0.182
iously-0.180
Token and
Token ID290
Feature activation+8.76e-01

Pos logit contributions
 hall+0.059
 met+0.059
 outdoors+0.059
 forest+0.058
 woods+0.057
Neg logit contributions
 understanding-0.076
 accomplishment-0.074
 apology-0.073
iously-0.073
 efforts-0.071
Token looked
Token ID3114
Feature activation+7.88e-01

Pos logit contributions
 outdoors+0.020
 hall+0.020
c+0.019
tub+0.019
 theater+0.019
Neg logit contributions
 understanding-0.024
 accomplishment-0.024
iously-0.023
 efforts-0.022
 apology-0.020
Token closely
Token ID7173
Feature activation+3.52e-01

Pos logit contributions
 hall+0.014
 garden+0.013
 forest+0.013
c+0.013
 outdoors+0.013
Neg logit contributions
 understanding-0.026
 accomplishment-0.026
iously-0.024
 apology-0.024
 efforts-0.023
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
 hall+0.008
 forest+0.008
 garden+0.008
c+0.008
 park+0.008
Neg logit contributions
 understanding-0.010
 accomplishment-0.010
iously-0.009
 apology-0.009
 efforts-0.009
Token She
Token ID1375
Feature activation+2.26e+00

Pos logit contributions
 hall+0.038
 hunt+0.038
 forest+0.037
 garden+0.037
 outdoors+0.037
Neg logit contributions
 understanding-0.056
 accomplishment-0.054
 apology-0.053
iously-0.053
mmm-0.052
Token had
Token ID550
Feature activation+3.25e+00

Pos logit contributions
 outdoors+0.051
tub+0.048
 hall+0.048
 hunt+0.047
 pool+0.046
Neg logit contributions
 understanding-0.063
 accomplishment-0.056
 efforts-0.055
iously-0.054
 apology-0.052
Token a
Token ID257
Feature activation+9.94e-01

Pos logit contributions
tub+0.083
bowl+0.080
rob+0.076
c+0.076
robe+0.076
Neg logit contributions
 understanding-0.084
 accomplishment-0.069
 efforts-0.068
 caring-0.068
iously-0.065
Token big
Token ID1263
Feature activation+1.62e+00

Pos logit contributions
c+0.021
tub+0.020
 fields+0.020
 hall+0.020
 hunt+0.020
Neg logit contributions
 understanding-0.029
 accomplishment-0.028
 apology-0.026
 efforts-0.025
 caring-0.024
Token magn
Token ID7842
Feature activation+2.24e+00

Pos logit contributions
tub+0.030
c+0.030
 outdoors+0.030
 met+0.029
bowl+0.029
Neg logit contributions
 accomplishment-0.049
 understanding-0.049
 apology-0.047
 efforts-0.045
iously-0.043
Tokenifying
Token ID4035
Feature activation+6.38e+00

Pos logit contributions
 hall+0.069
 garden+0.067
 forest+0.066
 park+0.066
 woods+0.066
Neg logit contributions
iously-0.047
 understanding-0.040
 efforts-0.039
 accomplishment-0.039
mmm-0.038
Token glass
Token ID5405
Feature activation+2.43e+00

Pos logit contributions
 outdoors+0.115
 met+0.107
 hall+0.103
bowl+0.103
 hunt+0.103
Neg logit contributions
 accomplishment-0.191
 understanding-0.189
 apology-0.185
mmm-0.182
 Deal-0.175
Token that
Token ID326
Feature activation+4.01e+00

Pos logit contributions
 met+0.060
c+0.060
 outdoors+0.059
 gra+0.058
 hall+0.057
Neg logit contributions
 understanding-0.055
 accomplishment-0.053
 efforts-0.051
iously-0.051
 apology-0.050
Token made
Token ID925
Feature activation+1.02e-01

Pos logit contributions
 outdoors+0.090
 pool+0.088
 sandbox+0.087
c+0.087
 forest+0.086
Neg logit contributions
 understanding-0.101
 accomplishment-0.096
 efforts-0.087
iously-0.086
mmm-0.085
Token things
Token ID1243
Feature activation+3.19e-01

Pos logit contributions
bowl+0.002
 hunt+0.002
 hall+0.002
 forest+0.002
rob+0.002
Neg logit contributions
 accomplishment-0.003
 understanding-0.003
iously-0.002
 apology-0.002
 efforts-0.002
Token look
Token ID804
Feature activation+7.25e-01

Pos logit contributions
 hall+0.007
 forest+0.007
c+0.007
 woods+0.007
rob+0.007
Neg logit contributions
 understanding-0.010
 accomplishment-0.009
iously-0.008
 apology-0.008
 efforts-0.008
Token,
Token ID11
Feature activation-3.41e+00

Pos logit contributions
 accomplishment+0.100
 understanding+0.097
mmm+0.097
 apology+0.096
iously+0.096
Neg logit contributions
 outside-0.045
 garden-0.042
 outdoors-0.041
 pretend-0.040
 hall-0.039
Token handing
Token ID22786
Feature activation+1.43e+00

Pos logit contributions
 accomplishment+0.097
 understanding+0.097
 apology+0.094
iously+0.093
mmm+0.089
Neg logit contributions
 garden-0.072
 hall-0.072
 forest-0.071
 outdoors-0.071
c-0.069
Token him
Token ID683
Feature activation-1.15e+00

Pos logit contributions
 hall+0.032
tub+0.032
 garden+0.031
 forest+0.031
 outdoors+0.031
Neg logit contributions
 understanding-0.039
 accomplishment-0.038
 apology-0.038
iously-0.036
mmm-0.034
Token the
Token ID262
Feature activation+1.21e+00

Pos logit contributions
 understanding+0.034
 accomplishment+0.032
 apology+0.032
iously+0.031
 efforts+0.030
Neg logit contributions
 hall-0.024
 forest-0.023
tub-0.023
 woods-0.023
 outdoors-0.023
Token magn
Token ID7842
Feature activation+1.33e+00

Pos logit contributions
c+0.022
bowl+0.021
 valleys+0.021
 met+0.020
 hall+0.020
Neg logit contributions
 apology-0.040
 understanding-0.038
 accomplishment-0.038
 efforts-0.034
 punishment-0.033
Tokenifying
Token ID4035
Feature activation+6.37e+00

Pos logit contributions
 hall+0.043
 garden+0.043
 forest+0.042
 woods+0.042
 park+0.041
Neg logit contributions
iously-0.024
 understanding-0.021
 accomplishment-0.021
 efforts-0.020
 apology-0.020
Token glass
Token ID5405
Feature activation+1.79e+00

Pos logit contributions
 outdoors+0.123
 met+0.114
 hall+0.110
 hunt+0.110
 grounds+0.106
Neg logit contributions
 apology-0.190
 accomplishment-0.189
 understanding-0.187
mmm-0.179
oral-0.174
Token.
Token ID13
Feature activation+6.28e-01

Pos logit contributions
 outdoors+0.041
 forest+0.040
 hall+0.039
 woods+0.039
 met+0.039
Neg logit contributions
 apology-0.047
 understanding-0.047
 accomplishment-0.047
iously-0.045
 efforts-0.043
Token She
Token ID1375
Feature activation+9.81e-01

Pos logit contributions
 hall+0.012
 forest+0.012
 garden+0.012
 outdoors+0.012
c+0.012
Neg logit contributions
 understanding-0.019
 accomplishment-0.019
 apology-0.018
iously-0.018
mmm-0.018
Token looked
Token ID3114
Feature activation-5.73e-02

Pos logit contributions
 garden+0.020
 hall+0.020
 forest+0.020
 outdoors+0.019
c+0.019
Neg logit contributions
 understanding-0.029
 accomplishment-0.029
 apology-0.028
iously-0.028
mmm-0.027
Token around
Token ID1088
Feature activation-1.01e+00

Pos logit contributions
 understanding+0.002
 accomplishment+0.002
iously+0.001
 apology+0.001
 efforts+0.001
Neg logit contributions
 hall-0.001
 forest-0.001
 garden-0.001
c-0.001
 hunt-0.001
Token mighty
Token ID18680
Feature activation+2.10e-02

Pos logit contributions
 accomplishment+0.018
iously+0.018
 understanding+0.018
 apology+0.018
mmm+0.017
Neg logit contributions
 garden-0.017
 forest-0.016
 hall-0.016
 park-0.016
 outdoors-0.016
Token jar
Token ID17379
Feature activation+2.68e+00

Pos logit contributions
 hall+0.000
c+0.000
 outdoors+0.000
 garden+0.000
 forest+0.000
Neg logit contributions
 understanding-0.001
 accomplishment-0.001
 apology-0.001
iously-0.001
mmm-0.001
Token.
Token ID13
Feature activation+3.46e+00

Pos logit contributions
 pretend+0.063
 paints+0.063
 hall+0.063
 garden+0.062
 matches+0.062
Neg logit contributions
 understanding-0.067
 accomplishment-0.059
iously-0.058
olding-0.058
 apology-0.058
Token Inside
Token ID14384
Feature activation+4.70e+00

Pos logit contributions
 garden+0.070
 hall+0.069
 forest+0.069
 outdoors+0.068
 hunt+0.068
Neg logit contributions
 understanding-0.102
 accomplishment-0.100
 apology-0.096
iously-0.096
mmm-0.095
Token the
Token ID262
Feature activation+3.74e+00

Pos logit contributions
 outdoors+0.109
 hunt+0.109
tub+0.105
 sandbox+0.104
 pretend+0.103
Neg logit contributions
 accomplishment-0.107
 understanding-0.103
oral-0.100
mmm-0.100
iously-0.099
Token jar
Token ID17379
Feature activation+6.36e+00

Pos logit contributions
 outdoors+0.049
 hunt+0.048
c+0.048
 hall+0.048
 pool+0.047
Neg logit contributions
 understanding-0.128
 accomplishment-0.127
 apology-0.123
mmm-0.120
 efforts-0.119
Token were
Token ID547
Feature activation+2.13e+00

Pos logit contributions
 gra+0.140
 pretend+0.136
 hunt+0.134
 outdoors+0.133
 valleys+0.127
Neg logit contributions
 understanding-0.158
oral-0.150
 accomplishment-0.142
mmm-0.139
um-0.139
Token lots
Token ID6041
Feature activation-2.91e-01

Pos logit contributions
 hall+0.043
 outdoors+0.042
 hunt+0.042
 forest+0.042
c+0.041
Neg logit contributions
 understanding-0.063
 accomplishment-0.061
iously-0.060
 apology-0.058
mmm-0.058
Token of
Token ID286
Feature activation+1.77e+00

Pos logit contributions
 understanding+0.007
 accomplishment+0.007
 apology+0.006
iously+0.006
mmm+0.006
Neg logit contributions
 hall-0.008
 garden-0.008
 forest-0.008
 outdoors-0.008
c-0.008
Token different
Token ID1180
Feature activation+1.28e+00

Pos logit contributions
 hall+0.034
c+0.033
tub+0.033
 hunt+0.033
 forest+0.032
Neg logit contributions
 understanding-0.055
 accomplishment-0.054
 apology-0.052
iously-0.050
mmm-0.049
Token things
Token ID1243
Feature activation+1.44e+00

Pos logit contributions
 hall+0.022
 hunt+0.021
tub+0.021
c+0.020
 outdoors+0.020
Neg logit contributions
 understanding-0.041
 accomplishment-0.040
iously-0.040
 apology-0.039
 efforts-0.038
Token
Token ID6184
Feature activation-2.28e+00

Pos logit contributions
c+0.023
 hall+0.023
 garden+0.023
 forest+0.022
 outdoors+0.022
Neg logit contributions
 understanding-0.033
 accomplishment-0.032
iously-0.031
 apology-0.031
mmm-0.030
Token
Token ID95
Feature activation-6.91e-01

Pos logit contributions
 understanding+0.059
 accomplishment+0.058
iously+0.056
 apology+0.056
mmm+0.055
Neg logit contributions
 hall-0.054
 garden-0.054
 forest-0.053
 outdoors-0.053
c-0.053
Token
Token ID26391
Feature activation-1.78e+00

Pos logit contributions
 understanding+0.020
 accomplishment+0.019
iously+0.019
 apology+0.019
mmm+0.019
Neg logit contributions
 outdoors-0.014
 hall-0.014
 garden-0.014
 paints-0.014
 pots-0.014
Token
Token ID129
Feature activation-5.88e-01

Pos logit contributions
 understanding+0.046
mmm+0.046
 accomplishment+0.045
iously+0.044
 efforts+0.042
Neg logit contributions
 paints-0.038
 outdoors-0.038
 matches-0.038
 pool-0.038
 garden-0.037
Token
Token ID241
Feature activation+4.61e-01

Pos logit contributions
 understanding+0.014
 accomplishment+0.014
iously+0.014
 apology+0.014
mmm+0.013
Neg logit contributions
 hall-0.015
 garden-0.015
 outdoors-0.014
c-0.014
 forest-0.014
TokenWho
Token ID8241
Feature activation+6.35e+00

Pos logit contributions
 hall+0.010
 garden+0.010
 forest+0.010
 matches+0.010
 hunt+0.010
Neg logit contributions
 understanding-0.013
 accomplishment-0.013
 apology-0.012
iously-0.012
mmm-0.012
Tokenâ
Token ID22940
Feature activation-1.56e+00

Pos logit contributions
 outdoors+0.132
 hall+0.129
tub+0.127
 fields+0.125
 valley+0.123
Neg logit contributions
 understanding-0.186
 apology-0.171
 efforts-0.165
 accomplishment-0.161
iously-0.161
Token
Token ID26391
Feature activation-9.64e-01

Pos logit contributions
 accomplishment+0.037
 understanding+0.037
 apology+0.036
mmm+0.036
iously+0.035
Neg logit contributions
 garden-0.039
 hall-0.039
 forest-0.038
 pots-0.038
 fields-0.038
Token
Token ID8151
Feature activation+8.80e-01

Pos logit contributions
mmm+0.027
 understanding+0.027
 accomplishment+0.026
iously+0.025
 efforts+0.025
Neg logit contributions
 paints-0.020
 matches-0.019
 pots-0.018
 garden-0.018
 sandbox-0.018
Tokens
Token ID82
Feature activation-1.62e-01

Pos logit contributions
 hall+0.018
 garden+0.018
 forest+0.018
 outdoors+0.018
c+0.018
Neg logit contributions
 understanding-0.026
 accomplishment-0.026
 apology-0.025
iously-0.025
mmm-0.023
Token there
Token ID612
Feature activation+2.07e+00

Pos logit contributions
 understanding+0.004
iously+0.004
 accomplishment+0.004
 efforts+0.004
mmm+0.004
Neg logit contributions
c-0.004
rob-0.004
 sandbox-0.004
 paints-0.004
 outdoors-0.004
Token book
Token ID1492
Feature activation+3.13e+00

Pos logit contributions
 hall+0.006
 outdoors+0.006
 garden+0.006
c+0.006
 forest+0.006
Neg logit contributions
 understanding-0.013
 accomplishment-0.013
 apology-0.012
iously-0.012
mmm-0.012
Tokencase
Token ID7442
Feature activation+1.18e+00

Pos logit contributions
 garden+0.066
c+0.065
 outdoors+0.065
 hall+0.064
 forest+0.064
Neg logit contributions
 understanding-0.088
 accomplishment-0.085
 apology-0.081
iously-0.081
mmm-0.079
Token.
Token ID13
Feature activation+3.27e+00

Pos logit contributions
 garden+0.021
 hall+0.021
 outdoors+0.021
 forest+0.021
c+0.021
Neg logit contributions
 understanding-0.037
 accomplishment-0.036
iously-0.035
 apology-0.034
 efforts-0.033
Token On
Token ID1550
Feature activation+3.60e+00

Pos logit contributions
 garden+0.062
 hall+0.061
 forest+0.061
 outdoors+0.060
c+0.060
Neg logit contributions
 understanding-0.101
 accomplishment-0.099
 apology-0.096
iously-0.095
mmm-0.094
Token the
Token ID262
Feature activation+2.53e+00

Pos logit contributions
 outdoors+0.078
bowl+0.076
tub+0.075
 met+0.075
 hunt+0.074
Neg logit contributions
 accomplishment-0.101
 understanding-0.098
iously-0.093
mmm-0.093
 apology-0.090
Token book
Token ID1492
Feature activation+6.35e+00

Pos logit contributions
 garden+0.030
 outdoors+0.030
 hall+0.030
 forest+0.029
c+0.029
Neg logit contributions
 understanding-0.095
 accomplishment-0.095
 apology-0.092
iously-0.092
mmm-0.089
Tokencase
Token ID7442
Feature activation+4.39e+00

Pos logit contributions
 gra+0.128
 pots+0.126
 outdoors+0.124
bowl+0.124
 hunt+0.121
Neg logit contributions
 understanding-0.167
oral-0.163
 accomplishment-0.160
mmm-0.153
 apology-0.150
Token was
Token ID373
Feature activation+2.64e+00

Pos logit contributions
 outdoors+0.082
 pots+0.079
bowl+0.078
 garden+0.078
tub+0.077
Neg logit contributions
 understanding-0.127
 accomplishment-0.121
iously-0.118
 efforts-0.114
 apology-0.113
Token a
Token ID257
Feature activation+6.61e-01

Pos logit contributions
bowl+0.043
c+0.042
 outdoors+0.042
 forest+0.042
 garden+0.042
Neg logit contributions
 understanding-0.089
 accomplishment-0.087
iously-0.081
 apology-0.081
 efforts-0.078
Token big
Token ID1263
Feature activation+7.41e-01

Pos logit contributions
 hall+0.013
 garden+0.013
 outdoors+0.013
c+0.013
 forest+0.013
Neg logit contributions
 understanding-0.020
 accomplishment-0.020
 apology-0.019
iously-0.019
mmm-0.018
Token jar
Token ID17379
Feature activation+3.08e+00

Pos logit contributions
c+0.011
 outdoors+0.011
tub+0.011
bowl+0.010
 met+0.010
Neg logit contributions
 accomplishment-0.026
 understanding-0.026
 apology-0.024
iously-0.023
 efforts-0.023
Token mom
Token ID1995
Feature activation+2.09e+00

Pos logit contributions
 hall+0.011
c+0.011
 garden+0.010
 outdoors+0.010
 forest+0.010
Neg logit contributions
 understanding-0.009
 accomplishment-0.009
 apology-0.009
iously-0.008
 efforts-0.008
Tokenmy
Token ID1820
Feature activation+1.69e+00

Pos logit contributions
 hall+0.037
 hunt+0.037
 outdoors+0.037
tub+0.036
 garden+0.036
Neg logit contributions
 understanding-0.065
 accomplishment-0.060
 apology-0.059
iously-0.059
mmm-0.058
Token went
Token ID1816
Feature activation+2.38e+00

Pos logit contributions
 hall+0.034
tub+0.033
 garden+0.033
 hunt+0.033
c+0.033
Neg logit contributions
 understanding-0.048
 accomplishment-0.045
iously-0.044
 apology-0.043
mmm-0.043
Token out
Token ID503
Feature activation+1.37e+00

Pos logit contributions
c+0.058
 sandbox+0.054
 forest+0.054
 hall+0.053
bowl+0.052
Neg logit contributions
 understanding-0.059
 accomplishment-0.057
 apology-0.054
iously-0.052
olding-0.051
Token to
Token ID284
Feature activation+3.74e+00

Pos logit contributions
 hall+0.025
 garden+0.025
 forest+0.025
c+0.024
 outdoors+0.024
Neg logit contributions
 understanding-0.043
 accomplishment-0.042
 apology-0.041
iously-0.040
 efforts-0.039
Token pick
Token ID2298
Feature activation+6.33e+00

Pos logit contributions
tub+0.096
 grounds+0.094
bridge+0.092
boards+0.090
rob+0.089
Neg logit contributions
 understanding-0.086
 accomplishment-0.078
 apology-0.076
 thank-0.070
 Sister-0.067
Token berries
Token ID36322
Feature activation+3.92e+00

Pos logit contributions
 hall+0.160
 grounds+0.153
 hallway+0.148
 hunt+0.148
 theater+0.144
Neg logit contributions
 apology-0.148
 understanding-0.143
um-0.133
 accomplishment-0.133
mmm-0.131
Token.
Token ID13
Feature activation+3.21e+00

Pos logit contributions
 hall+0.094
 forest+0.091
 garden+0.091
 woods+0.089
 sandbox+0.088
Neg logit contributions
 understanding-0.100
 accomplishment-0.096
iously-0.096
 apology-0.094
mmm-0.093
Token Fred
Token ID8559
Feature activation+2.70e+00

Pos logit contributions
c+0.076
 hall+0.076
 outdoors+0.073
 forest+0.073
 paints+0.071
Neg logit contributions
 understanding-0.083
 accomplishment-0.081
iously-0.079
 apology-0.079
mmm-0.075
Token loved
Token ID6151
Feature activation+1.92e+00

Pos logit contributions
 hall+0.050
tub+0.050
 theater+0.048
 outdoors+0.047
 pool+0.047
Neg logit contributions
 understanding-0.082
iously-0.079
 accomplishment-0.074
mmm-0.073
 efforts-0.073
Token picking
Token ID10868
Feature activation+5.67e+00

Pos logit contributions
tub+0.045
 hall+0.045
 grounds+0.042
 theater+0.042
 garage+0.042
Neg logit contributions
 understanding-0.051
 accomplishment-0.046
iously-0.045
 apology-0.044
 caring-0.044
Token for
Token ID329
Feature activation-1.93e+00

Pos logit contributions
 understanding+0.047
 accomplishment+0.045
 apology+0.042
iously+0.041
mmm+0.041
Neg logit contributions
 hall-0.043
 hunt-0.041
 garden-0.041
 forest-0.040
 woods-0.040
Token you
Token ID345
Feature activation-2.64e+00

Pos logit contributions
 understanding+0.050
 accomplishment+0.049
 apology+0.047
iously+0.046
mmm+0.045
Neg logit contributions
 hall-0.047
 garden-0.047
 forest-0.046
c-0.046
 outdoors-0.046
Token.
Token ID13
Feature activation-6.22e-01

Pos logit contributions
 understanding+0.069
 accomplishment+0.065
iously+0.063
 apology+0.063
 efforts+0.061
Neg logit contributions
 hall-0.063
 forest-0.062
 garden-0.062
 woods-0.061
 sandbox-0.060
Token Goodbye
Token ID44442
Feature activation-3.90e+00

Pos logit contributions
 understanding+0.016
 accomplishment+0.016
 apology+0.015
iously+0.015
mmm+0.014
Neg logit contributions
 hall-0.015
 garden-0.015
 forest-0.015
 outdoors-0.015
c-0.015
Token,
Token ID11
Feature activation-3.73e+00

Pos logit contributions
 understanding+0.126
 accomplishment+0.125
 apology+0.124
iously+0.119
Less+0.117
Neg logit contributions
 garden-0.069
 outside-0.067
 pretend-0.066
 outdoors-0.065
 forest-0.064
Token Mir
Token ID7381
Feature activation-7.44e+00

Pos logit contributions
 understanding+0.100
 accomplishment+0.100
iously+0.097
 apology+0.096
mmm+0.092
Neg logit contributions
 garden-0.085
 hall-0.085
 forest-0.084
 park-0.082
 outdoors-0.082
Tokena
Token ID64
Feature activation-4.27e+00

Pos logit contributions
 accomplishment+0.250
 understanding+0.227
 apology+0.222
 Deal+0.220
iously+0.218
Neg logit contributions
c-0.147
 forest-0.123
 met-0.119
 garden-0.119
 hall-0.116
Token and
Token ID290
Feature activation-4.01e+00

Pos logit contributions
 understanding+0.123
 accomplishment+0.121
 apology+0.117
iously+0.116
mmm+0.112
Neg logit contributions
 hall-0.090
 garden-0.090
 forest-0.088
 outdoors-0.088
c-0.087
Token R
Token ID371
Feature activation-5.48e+00

Pos logit contributions
 accomplishment+0.112
 understanding+0.111
 apology+0.109
iously+0.109
mmm+0.104
Neg logit contributions
 garden-0.089
 forest-0.087
 hall-0.085
 park-0.085
 hunt-0.084
Tokenavi
Token ID15820
Feature activation-2.31e+00

Pos logit contributions
 accomplishment+0.130
 understanding+0.128
 apology+0.117
iously+0.113
 efforts+0.111
Neg logit contributions
 forest-0.145
 garden-0.143
c-0.142
 park-0.141
 outdoors-0.140
Token."
Token ID526
Feature activation-2.08e+00

Pos logit contributions
 understanding+0.067
 accomplishment+0.065
 apology+0.064
iously+0.063
mmm+0.060
Neg logit contributions
 hall-0.048
 garden-0.048
 outdoors-0.047
 forest-0.047
 woods-0.046
Token You
Token ID921
Feature activation+1.90e+00

Pos logit contributions
 hall+0.007
 garden+0.007
 forest+0.007
 outdoors+0.007
 hunt+0.007
Neg logit contributions
 understanding-0.008
 accomplishment-0.008
 apology-0.008
iously-0.007
mmm-0.007
Token are
Token ID389
Feature activation-1.11e+00

Pos logit contributions
 hall+0.039
 fields+0.039
 woods+0.039
 sandbox+0.038
 garden+0.038
Neg logit contributions
 understanding-0.056
 apology-0.053
 accomplishment-0.053
 efforts-0.052
mmm-0.050
Token too
Token ID1165
Feature activation-1.77e+00

Pos logit contributions
 understanding+0.030
 accomplishment+0.029
 apology+0.027
iously+0.027
mmm+0.026
Neg logit contributions
 hall-0.026
 garden-0.026
 forest-0.026
c-0.026
 outdoors-0.026
Token young
Token ID1862
Feature activation-2.74e+00

Pos logit contributions
 accomplishment+0.050
 apology+0.050
iously+0.049
 understanding+0.047
mmm+0.046
Neg logit contributions
 garden-0.038
 outdoors-0.038
 hall-0.038
 forest-0.036
 outside-0.036
Token!"
Token ID2474
Feature activation-9.62e-01

Pos logit contributions
 understanding+0.080
 accomplishment+0.078
 apology+0.075
iously+0.075
 efforts+0.072
Neg logit contributions
 hall-0.057
 garden-0.056
 forest-0.056
c-0.055
 outdoors-0.055
Token Fre
Token ID4848
Feature activation-7.16e+00

Pos logit contributions
 understanding+0.031
 accomplishment+0.031
 apology+0.030
iously+0.029
mmm+0.028
Neg logit contributions
 garden-0.017
 hall-0.017
 forest-0.017
 outdoors-0.017
c-0.016
Tokenya
Token ID3972
Feature activation-2.14e+00

Pos logit contributions
 Deal+0.171
 understanding+0.155
 apology+0.153
omo+0.153
 accomplishment+0.152
Neg logit contributions
 outdoors-0.174
 forest-0.170
 garden-0.167
 land-0.166
c-0.165
Token was
Token ID373
Feature activation-2.25e+00

Pos logit contributions
 accomplishment+0.068
 understanding+0.067
 apology+0.066
iously+0.065
mmm+0.064
Neg logit contributions
 garden-0.037
c-0.037
 hall-0.037
 forest-0.036
 met-0.035
Token sad
Token ID6507
Feature activation-2.36e+00

Pos logit contributions
 understanding+0.055
 accomplishment+0.054
 apology+0.051
iously+0.051
mmm+0.048
Neg logit contributions
 hall-0.057
 garden-0.057
 forest-0.057
 outdoors-0.056
c-0.056
Token but
Token ID475
Feature activation-3.26e+00

Pos logit contributions
 accomplishment+0.074
 apology+0.072
 understanding+0.072
iously+0.071
mmm+0.070
Neg logit contributions
 garden-0.043
 outdoors-0.042
 forest-0.042
 outside-0.041
 hall-0.041
Token she
Token ID673
Feature activation-1.03e+00

Pos logit contributions
 accomplishment+0.101
 apology+0.100
mmm+0.098
iously+0.098
 understanding+0.096
Neg logit contributions
 garden-0.061
 forest-0.059
 hall-0.058
 park-0.058
 outdoors-0.058
Token that
Token ID326
Feature activation+7.95e-01

Pos logit contributions
 understanding+0.013
 accomplishment+0.013
 apology+0.012
iously+0.012
mmm+0.012
Neg logit contributions
 hall-0.010
c-0.010
 forest-0.010
 garden-0.010
 outdoors-0.009
Token!"
Token ID2474
Feature activation-1.38e+00

Pos logit contributions
 hall+0.018
 sandbox+0.017
 forest+0.017
 woods+0.017
 outdoors+0.017
Neg logit contributions
 understanding-0.021
 accomplishment-0.021
 apology-0.020
mmm-0.020
iously-0.019
Token
Token ID220
Feature activation-2.19e+00

Pos logit contributions
 accomplishment+0.049
 understanding+0.048
iously+0.047
 apology+0.047
mmm+0.045
Neg logit contributions
 garden-0.020
 hall-0.020
 forest-0.020
 outdoors-0.019
c-0.019
Token
Token ID198
Feature activation-2.01e+00

Pos logit contributions
 understanding+0.074
 accomplishment+0.073
 apology+0.071
iously+0.071
mmm+0.068
Neg logit contributions
 garden-0.035
 hall-0.035
 forest-0.034
 outdoors-0.034
c-0.033
Token
Token ID198
Feature activation-3.09e+00

Pos logit contributions
 accomplishment+0.071
 understanding+0.071
iously+0.069
 apology+0.068
 efforts+0.066
Neg logit contributions
 garden-0.030
 forest-0.029
 outdoors-0.029
 hall-0.029
 park-0.028
TokenCharl
Token ID24453
Feature activation-7.10e+00

Pos logit contributions
 accomplishment+0.089
iously+0.088
 apology+0.086
 understanding+0.085
 efforts+0.080
Neg logit contributions
 garden-0.068
 outdoors-0.068
 forest-0.066
 park-0.066
 hall-0.064
Tokenotte
Token ID11404
Feature activation-2.46e+00

Pos logit contributions
 apology+0.191
 accomplishment+0.186
 understanding+0.183
mmm+0.166
 efforts+0.163
Neg logit contributions
 outdoors-0.163
 outside-0.156
 hall-0.156
 met-0.152
c-0.150
Token thanked
Token ID26280
Feature activation-2.91e+00

Pos logit contributions
 accomplishment+0.079
 understanding+0.078
 apology+0.077
iously+0.074
mmm+0.074
Neg logit contributions
 garden-0.043
 hall-0.041
 forest-0.041
c-0.041
 park-0.041
Token her
Token ID607
Feature activation-4.90e+00

Pos logit contributions
iously+0.114
aring+0.112
 cents+0.112
mmm+0.111
olding+0.111
Neg logit contributions
 outside-0.034
 garden-0.034
 outdoors-0.031
 park-0.030
 forest-0.029
Token mother
Token ID2802
Feature activation-4.87e+00

Pos logit contributions
mmm+0.120
iously+0.113
aring+0.102
oor+0.100
 accomplishment+0.100
Neg logit contributions
 garden-0.150
 forest-0.134
 box-0.133
 backyard-0.131
 outside-0.131
Token for
Token ID329
Feature activation-1.35e+00

Pos logit contributions
 accomplishment+0.154
 understanding+0.152
mmm+0.151
 apology+0.150
iously+0.148
Neg logit contributions
 garden-0.088
 outside-0.085
c-0.083
 outdoors-0.082
 hall-0.082
Token when
Token ID618
Feature activation-2.56e+00

Pos logit contributions
 apology+0.064
 accomplishment+0.063
 understanding+0.063
iously+0.060
 Deal+0.059
Neg logit contributions
 garden-0.044
 forest-0.043
 outdoors-0.043
 hall-0.042
 woods-0.042
Token she
Token ID673
Feature activation+1.19e-01

Pos logit contributions
 understanding+0.076
 apology+0.075
 accomplishment+0.074
iously+0.072
mmm+0.069
Neg logit contributions
 garden-0.053
 forest-0.053
 hall-0.052
 outdoors-0.051
 woods-0.051
Token was
Token ID373
Feature activation-3.47e-01

Pos logit contributions
 hall+0.002
 garden+0.002
 outdoors+0.002
 forest+0.002
c+0.002
Neg logit contributions
 understanding-0.004
 accomplishment-0.004
iously-0.004
 apology-0.004
mmm-0.004
Token done
Token ID1760
Feature activation-9.19e-01

Pos logit contributions
 understanding+0.008
 accomplishment+0.008
iously+0.008
 apology+0.008
mmm+0.008
Neg logit contributions
 garden-0.009
 hall-0.009
 outdoors-0.009
 forest-0.009
 park-0.009
Token,
Token ID11
Feature activation-2.72e+00

Pos logit contributions
 understanding+0.026
 accomplishment+0.026
iously+0.024
 apology+0.024
mmm+0.023
Neg logit contributions
 forest-0.020
c-0.020
 garden-0.020
 hall-0.020
 woods-0.020
Token Fre
Token ID4848
Feature activation-6.96e+00

Pos logit contributions
 understanding+0.082
 accomplishment+0.082
 apology+0.080
iously+0.078
mmm+0.076
Neg logit contributions
 garden-0.054
 forest-0.053
 hall-0.052
 outdoors-0.052
 woods-0.051
Tokenya
Token ID3972
Feature activation-2.39e+00

Pos logit contributions
 Deal+0.161
 apology+0.155
 understanding+0.149
omo+0.148
 accomplishment+0.144
Neg logit contributions
 outdoors-0.178
 forest-0.173
 garden-0.172
 land-0.171
 outside-0.166
Token realized
Token ID6939
Feature activation-1.23e+00

Pos logit contributions
 understanding+0.074
 accomplishment+0.073
iously+0.071
 apology+0.070
 efforts+0.067
Neg logit contributions
 hall-0.045
 outdoors-0.044
 garden-0.044
 forest-0.044
c-0.043
Token that
Token ID326
Feature activation-3.97e-01

Pos logit contributions
 understanding+0.034
 accomplishment+0.034
iously+0.032
 apology+0.032
mmm+0.031
Neg logit contributions
 hall-0.027
 garden-0.027
 forest-0.027
 outdoors-0.027
c-0.026
Token the
Token ID262
Feature activation-3.68e-01

Pos logit contributions
 understanding+0.011
 accomplishment+0.011
 apology+0.010
 efforts+0.010
iously+0.009
Neg logit contributions
 hunt-0.009
tub-0.009
 pool-0.009
c-0.009
 hall-0.009
Token stair
Token ID18761
Feature activation+2.96e+00

Pos logit contributions
 understanding+0.013
 accomplishment+0.013
iously+0.013
 apology+0.013
mmm+0.012
Neg logit contributions
 garden-0.005
 hall-0.005
 forest-0.005
 outdoors-0.005
 park-0.005
Token too
Token ID1165
Feature activation-8.78e-01

Pos logit contributions
 understanding+0.020
 accomplishment+0.020
 apology+0.020
iously+0.020
mmm+0.019
Neg logit contributions
 garden-0.016
 hall-0.016
 forest-0.016
 outdoors-0.016
c-0.015
Token dangerous
Token ID4923
Feature activation-2.18e+00

Pos logit contributions
iously+0.024
 apology+0.024
 accomplishment+0.024
 understanding+0.024
mmm+0.023
Neg logit contributions
 garden-0.020
 hall-0.020
 outdoors-0.019
 outside-0.019
 forest-0.019
Token to
Token ID284
Feature activation+7.07e-02

Pos logit contributions
 understanding+0.055
 accomplishment+0.054
iously+0.054
 apology+0.050
 efforts+0.048
Neg logit contributions
c-0.054
 hall-0.054
 garden-0.053
 sandbox-0.053
 forest-0.052
Token use
Token ID779
Feature activation+3.42e-01

Pos logit contributions
 hall+0.002
 garden+0.002
c+0.001
 forest+0.001
 woods+0.001
Neg logit contributions
 understanding-0.002
 accomplishment-0.002
 apology-0.002
iously-0.002
mmm-0.002
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 hall+0.007
 forest+0.007
 hunt+0.007
 garden+0.007
 pool+0.007
Neg logit contributions
 understanding-0.010
 accomplishment-0.010
 apology-0.009
iously-0.009
 efforts-0.009
Token Fre
Token ID4848
Feature activation-6.92e+00

Pos logit contributions
 accomplishment+0.030
 understanding+0.030
 apology+0.029
iously+0.028
mmm+0.027
Neg logit contributions
 garden-0.022
 forest-0.021
 hall-0.021
 outdoors-0.021
 woods-0.021
Tokenya
Token ID3972
Feature activation-2.38e+00

Pos logit contributions
 Deal+0.166
 apology+0.157
omo+0.155
 understanding+0.148
 accomplishment+0.146
Neg logit contributions
 outdoors-0.170
 forest-0.163
 land-0.162
 garden-0.161
 met-0.157
Token was
Token ID373
Feature activation-2.32e+00

Pos logit contributions
 accomplishment+0.076
 understanding+0.075
 apology+0.073
iously+0.072
mmm+0.071
Neg logit contributions
 garden-0.042
 hall-0.042
 forest-0.042
c-0.041
 park-0.040
Token very
Token ID845
Feature activation-3.07e+00

Pos logit contributions
 accomplishment+0.056
 understanding+0.055
 apology+0.054
iously+0.054
mmm+0.052
Neg logit contributions
 garden-0.059
 hall-0.059
 forest-0.058
 outdoors-0.058
c-0.058
Token disappointed
Token ID11679
Feature activation-2.43e+00

Pos logit contributions
mmm+0.081
iously+0.079
 accomplishment+0.075
 Deal+0.071
 apology+0.071
Neg logit contributions
 garden-0.073
 outside-0.070
 met-0.070
 outdoors-0.069
 hall-0.068
Token and
Token ID290
Feature activation-2.66e+00

Pos logit contributions
 accomplishment+0.075
 understanding+0.074
 apology+0.073
iously+0.072
mmm+0.070
Neg logit contributions
 garden-0.046
 outdoors-0.045
 forest-0.045
 hall-0.044
 park-0.043
Token impressed
Token ID12617
Feature activation-3.20e+00

Pos logit contributions
iously+0.095
 apology+0.091
mmm+0.088
 accomplishment+0.088
 Deal+0.085
Neg logit contributions
 garden-0.112
 outdoors-0.110
 outside-0.107
 hall-0.107
tub-0.106
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
 understanding+0.099
 accomplishment+0.098
 apology+0.094
iously+0.094
mmm+0.091
Neg logit contributions
 hall-0.061
 garden-0.061
 forest-0.060
 outdoors-0.059
c-0.059
Token
Token ID220
Feature activation-1.10e+00

Pos logit contributions
 accomplishment+0.040
 understanding+0.040
 apology+0.039
iously+0.039
mmm+0.037
Neg logit contributions
 garden-0.022
 hall-0.022
 forest-0.022
c-0.021
 outdoors-0.021
Token
Token ID198
Feature activation-1.67e+00

Pos logit contributions
 understanding+0.031
 accomplishment+0.031
mmm+0.030
 apology+0.029
iously+0.029
Neg logit contributions
 hunt-0.023
 hall-0.023
 forest-0.023
 garden-0.022
 sandbox-0.022
Token
Token ID198
Feature activation-2.84e+00

Pos logit contributions
 accomplishment+0.058
 understanding+0.058
iously+0.056
 apology+0.056
 efforts+0.054
Neg logit contributions
 garden-0.025
 hall-0.025
 forest-0.025
 outdoors-0.024
c-0.024
TokenAng
Token ID13450
Feature activation-6.90e+00

Pos logit contributions
 accomplishment+0.076
iously+0.075
 apology+0.075
 understanding+0.071
 efforts+0.068
Neg logit contributions
 forest-0.066
 garden-0.066
 outdoors-0.066
c-0.065
 hall-0.065
Tokenie
Token ID494
Feature activation-1.83e+00

Pos logit contributions
 Deal+0.232
 understanding+0.223
 accomplishment+0.220
 apology+0.218
mmm+0.217
Neg logit contributions
c-0.112
 hall-0.103
 paints-0.097
 forest-0.097
 garden-0.097
Token was
Token ID373
Feature activation-3.11e+00

Pos logit contributions
 understanding+0.059
 accomplishment+0.059
 apology+0.057
iously+0.057
mmm+0.055
Neg logit contributions
 garden-0.032
 hall-0.031
 forest-0.031
 outdoors-0.031
c-0.030
Token so
Token ID523
Feature activation-3.75e+00

Pos logit contributions
 understanding+0.074
 accomplishment+0.074
 apology+0.071
iously+0.070
mmm+0.067
Neg logit contributions
 garden-0.081
 hall-0.081
 forest-0.080
 outdoors-0.079
c-0.078
Token proud
Token ID6613
Feature activation-3.27e+00

Pos logit contributions
iously+0.086
 apology+0.083
 accomplishment+0.080
mmm+0.077
 efforts+0.075
Neg logit contributions
 garden-0.104
 outdoors-0.103
 outside-0.100
 hall-0.100
 park-0.099
Token of
Token ID286
Feature activation-2.28e+00

Pos logit contributions
 understanding+0.080
 accomplishment+0.079
 apology+0.076
iously+0.076
mmm+0.072
Neg logit contributions
 garden-0.083
 hall-0.083
 forest-0.083
 outdoors-0.082
c-0.081
Token foot
Token ID2366
Feature activation+3.25e-01

Pos logit contributions
 understanding+0.018
 accomplishment+0.017
 apology+0.017
iously+0.017
 efforts+0.016
Neg logit contributions
 hall-0.014
 garden-0.014
 forest-0.014
 outdoors-0.014
c-0.014
Token.
Token ID13
Feature activation-6.53e-01

Pos logit contributions
 hall+0.007
 forest+0.007
 garden+0.007
 outdoors+0.006
 woods+0.006
Neg logit contributions
 understanding-0.010
 accomplishment-0.009
iously-0.009
 apology-0.009
 efforts-0.009
Token
Token ID220
Feature activation-5.55e-01

Pos logit contributions
 accomplishment+0.021
 understanding+0.021
 apology+0.020
iously+0.020
mmm+0.019
Neg logit contributions
 garden-0.012
 hall-0.011
 forest-0.011
 outdoors-0.011
 woods-0.011
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
 understanding+0.017
 accomplishment+0.017
 apology+0.017
iously+0.017
mmm+0.016
Neg logit contributions
 hall-0.010
 forest-0.010
 garden-0.010
 hunt-0.010
 outdoors-0.010
Token
Token ID198
Feature activation-2.56e+00

Pos logit contributions
 accomplishment+0.054
iously+0.053
 understanding+0.052
 apology+0.052
 efforts+0.050
Neg logit contributions
 garden-0.019
 outdoors-0.019
 hall-0.018
 forest-0.018
 park-0.018
TokenMar
Token ID7676
Feature activation-6.85e+00

Pos logit contributions
 accomplishment+0.075
 apology+0.072
iously+0.071
 understanding+0.067
 efforts+0.067
Neg logit contributions
 garden-0.057
 outdoors-0.057
 hall-0.055
 forest-0.055
 park-0.054
Tokenina
Token ID1437
Feature activation-1.98e+00

Pos logit contributions
 accomplishment+0.196
 understanding+0.195
 apology+0.193
mmm+0.186
 efforts+0.179
Neg logit contributions
c-0.153
 hall-0.142
 garden-0.142
 forest-0.138
 outdoors-0.137
Token's
Token ID338
Feature activation-2.32e+00

Pos logit contributions
 accomplishment+0.063
 understanding+0.062
 apology+0.061
iously+0.060
mmm+0.059
Neg logit contributions
 garden-0.036
 hall-0.036
 forest-0.035
c-0.035
 outdoors-0.034
Token mom
Token ID1995
Feature activation-2.15e+00

Pos logit contributions
iously+0.047
 understanding+0.047
mmm+0.047
 accomplishment+0.046
 apology+0.044
Neg logit contributions
 garden-0.069
 hall-0.068
 forest-0.067
 park-0.066
 pool-0.066
Token said
Token ID531
Feature activation-2.15e+00

Pos logit contributions
 understanding+0.066
 accomplishment+0.065
 apology+0.063
iously+0.063
mmm+0.060
Neg logit contributions
 hall-0.042
 garden-0.042
 forest-0.041
 outdoors-0.041
c-0.040
Token,
Token ID11
Feature activation-3.24e+00

Pos logit contributions
 accomplishment+0.064
 understanding+0.064
 apology+0.062
iously+0.061
mmm+0.059
Neg logit contributions
 garden-0.042
 hall-0.042
 forest-0.042
 outdoors-0.042
c-0.041
Token Chicken
Token ID16405
Feature activation-3.96e-01

Pos logit contributions
 accomplishment+0.086
 understanding+0.085
 apology+0.083
iously+0.081
mmm+0.079
Neg logit contributions
 garden-0.081
 forest-0.081
 hall-0.080
 outdoors-0.080
c-0.079
Token were
Token ID547
Feature activation-2.11e+00

Pos logit contributions
 understanding+0.012
 accomplishment+0.011
iously+0.011
 efforts+0.011
 apology+0.011
Neg logit contributions
 hall-0.008
 outdoors-0.008
c-0.008
 forest-0.008
 garden-0.008
Token happy
Token ID3772
Feature activation-3.84e+00

Pos logit contributions
 understanding+0.051
 accomplishment+0.049
iously+0.045
 apology+0.044
 efforts+0.043
Neg logit contributions
 hall-0.057
 hunt-0.056
 forest-0.056
 garden-0.055
c-0.055
Token and
Token ID290
Feature activation-3.03e+00

Pos logit contributions
 understanding+0.121
 accomplishment+0.119
iously+0.114
 apology+0.113
 efforts+0.109
Neg logit contributions
 hall-0.073
 garden-0.072
 forest-0.071
c-0.071
 outdoors-0.070
Token thanked
Token ID26280
Feature activation-3.15e+00

Pos logit contributions
 understanding+0.076
 accomplishment+0.075
 apology+0.072
iously+0.072
mmm+0.068
Neg logit contributions
 hall-0.075
 garden-0.075
 forest-0.074
 outdoors-0.074
c-0.073
Token Farmer
Token ID37755
Feature activation-6.82e+00

Pos logit contributions
 accomplishment+0.091
 understanding+0.090
 apology+0.087
iously+0.087
 efforts+0.082
Neg logit contributions
 forest-0.067
 garden-0.067
 outdoors-0.064
 park-0.064
 woods-0.064
Token Brown
Token ID4373
Feature activation-5.27e+00

Pos logit contributions
 understanding+0.203
 accomplishment+0.203
 apology+0.195
iously+0.190
mmm+0.187
Neg logit contributions
 forest-0.134
 garden-0.134
 hall-0.133
c-0.132
 outdoors-0.131
Token.
Token ID13
Feature activation-1.93e+00

Pos logit contributions
 understanding+0.146
 accomplishment+0.145
 apology+0.138
iously+0.137
mmm+0.135
Neg logit contributions
c-0.116
 garden-0.115
 forest-0.115
 hall-0.114
 outdoors-0.113
Token They
Token ID1119
Feature activation-4.72e-01

Pos logit contributions
 accomplishment+0.066
 understanding+0.065
 apology+0.064
iously+0.063
 efforts+0.061
Neg logit contributions
 garden-0.033
 forest-0.033
 fields-0.032
 woods-0.032
 park-0.030
Token brought
Token ID3181
Feature activation+8.80e-01

Pos logit contributions
 accomplishment+0.014
 understanding+0.014
 apology+0.014
iously+0.014
mmm+0.014
Neg logit contributions
 garden-0.009
 forest-0.009
 hall-0.009
 park-0.009
 met-0.009
Token the
Token ID262
Feature activation+6.01e-02

Pos logit contributions
 hall+0.020
tub+0.018
 hunt+0.018
 forest+0.018
 garden+0.018
Neg logit contributions
 understanding-0.025
 accomplishment-0.025
 apology-0.024
iously-0.024
 efforts-0.023
Token "
Token ID366
Feature activation-9.98e-01

Pos logit contributions
 understanding+0.089
 accomplishment+0.088
 apology+0.085
iously+0.084
mmm+0.081
Neg logit contributions
 hall-0.070
 garden-0.070
 forest-0.069
 outdoors-0.068
c-0.068
TokenDon
Token ID3987
Feature activation-1.94e+00

Pos logit contributions
 understanding+0.026
 accomplishment+0.024
 apology+0.024
iously+0.023
mmm+0.022
Neg logit contributions
 hall-0.025
 garden-0.025
 outdoors-0.024
 hunt-0.024
 forest-0.024
Token't
Token ID470
Feature activation-2.25e+00

Pos logit contributions
 understanding+0.070
 accomplishment+0.068
iously+0.067
 apology+0.065
 efforts+0.064
Neg logit contributions
 hall-0.029
 forest-0.029
 park-0.028
 garden-0.028
 outside-0.027
Token worry
Token ID5490
Feature activation-4.72e+00

Pos logit contributions
 accomplishment+0.046
iously+0.041
 apology+0.041
 understanding+0.041
mmm+0.038
Neg logit contributions
 garden-0.069
 hall-0.067
 pretend-0.067
 hunt-0.067
 outdoors-0.067
Token,
Token ID11
Feature activation-4.96e+00

Pos logit contributions
 understanding+0.160
 accomplishment+0.160
 apology+0.153
iously+0.146
Less+0.144
Neg logit contributions
 garden-0.077
 hall-0.075
 forest-0.075
 outside-0.074
 outdoors-0.071
Token Mir
Token ID7381
Feature activation-6.81e+00

Pos logit contributions
 accomplishment+0.144
 understanding+0.141
 apology+0.138
iously+0.134
 efforts+0.129
Neg logit contributions
 garden-0.106
 hall-0.104
 forest-0.103
 park-0.101
 pretend-0.099
Tokena
Token ID64
Feature activation-3.68e+00

Pos logit contributions
 accomplishment+0.229
 understanding+0.219
 apology+0.211
iously+0.208
 Deal+0.205
Neg logit contributions
c-0.129
 forest-0.112
 hall-0.110
 garden-0.110
 outdoors-0.106
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
 understanding+0.105
 accomplishment+0.103
 apology+0.099
iously+0.098
 efforts+0.094
Neg logit contributions
 hall-0.079
 forest-0.078
 garden-0.078
 outdoors-0.078
 woods-0.077
Token We
Token ID775
Feature activation+8.44e-01

Pos logit contributions
 understanding+0.025
 apology+0.025
mmm+0.024
 accomplishment+0.024
iously+0.024
Neg logit contributions
 hunt-0.031
 outdoors-0.031
 forest-0.030
 hall-0.030
 park-0.029
Token can
Token ID460
Feature activation-8.44e-01

Pos logit contributions
 fields+0.016
 hall+0.016
 woods+0.015
tub+0.015
 mountains+0.015
Neg logit contributions
 understanding-0.026
 apology-0.025
 accomplishment-0.025
 efforts-0.024
olding-0.023
Token buy
Token ID2822
Feature activation+1.14e+00

Pos logit contributions
 understanding+0.024
 accomplishment+0.024
 apology+0.023
mmm+0.022
iously+0.022
Neg logit contributions
 hall-0.018
 woods-0.018
c-0.018
 fields-0.018
 forest-0.017
Token feel
Token ID1254
Feature activation-2.53e+00

Pos logit contributions
 understanding+0.060
 accomplishment+0.056
 apology+0.056
iously+0.054
 efforts+0.052
Neg logit contributions
 forest-0.048
 woods-0.048
 hall-0.048
 mountains-0.047
 fields-0.046
Token better
Token ID1365
Feature activation-2.99e+00

Pos logit contributions
 understanding+0.065
 accomplishment+0.064
iously+0.062
 apology+0.062
mmm+0.059
Neg logit contributions
 garden-0.061
 hall-0.060
 forest-0.060
 outdoors-0.060
c-0.059
Token.
Token ID13
Feature activation-4.01e-01

Pos logit contributions
 understanding+0.094
 accomplishment+0.093
 apology+0.090
iously+0.090
mmm+0.087
Neg logit contributions
 garden-0.055
 hall-0.055
 forest-0.054
 outdoors-0.054
c-0.053
Token Sally
Token ID25737
Feature activation-3.38e-01

Pos logit contributions
 accomplishment+0.013
 understanding+0.013
 apology+0.013
iously+0.013
 efforts+0.012
Neg logit contributions
 garden-0.007
 forest-0.007
 woods-0.006
 park-0.006
 hall-0.006
Token thanked
Token ID26280
Feature activation-2.26e+00

Pos logit contributions
 understanding+0.010
 accomplishment+0.010
iously+0.010
 apology+0.010
mmm+0.009
Neg logit contributions
 hall-0.007
 outdoors-0.007
 garden-0.007
 forest-0.007
c-0.006
Token Doctor
Token ID9356
Feature activation-6.78e+00

Pos logit contributions
 accomplishment+0.071
 understanding+0.071
 apology+0.068
iously+0.068
 efforts+0.065
Neg logit contributions
 garden-0.041
 forest-0.041
 park-0.040
 outdoors-0.040
 hall-0.039
Token Bob
Token ID5811
Feature activation-4.27e+00

Pos logit contributions
 accomplishment+0.185
 understanding+0.185
 apology+0.178
iously+0.173
mmm+0.169
Neg logit contributions
 garden-0.151
 forest-0.150
 hall-0.150
 outdoors-0.147
 park-0.146
Token and
Token ID290
Feature activation-2.13e+00

Pos logit contributions
 understanding+0.106
 accomplishment+0.105
 apology+0.101
iously+0.100
mmm+0.096
Neg logit contributions
 garden-0.106
 hall-0.106
 forest-0.105
 outdoors-0.104
c-0.103
Token went
Token ID1816
Feature activation-1.25e+00

Pos logit contributions
 accomplishment+0.058
 understanding+0.058
 apology+0.058
iously+0.057
mmm+0.055
Neg logit contributions
 garden-0.047
 forest-0.046
 hall-0.045
 park-0.045
 hunt-0.044
Token home
Token ID1363
Feature activation-2.56e+00

Pos logit contributions
 understanding+0.037
 accomplishment+0.036
 apology+0.035
iously+0.034
 efforts+0.032
Neg logit contributions
 forest-0.025
 hall-0.025
 garden-0.025
c-0.025
 outdoors-0.024
Token with
Token ID351
Feature activation-4.59e-01

Pos logit contributions
 accomplishment+0.082
 understanding+0.082
 apology+0.081
iously+0.081
mmm+0.079
Neg logit contributions
 hall-0.044
 garden-0.044
 outdoors-0.043
 forest-0.042
 outside-0.042
Token was
Token ID373
Feature activation-2.15e+00

Pos logit contributions
 outdoors+0.004
 hall+0.003
 pool+0.003
 forest+0.003
tub+0.003
Neg logit contributions
 understanding-0.007
 accomplishment-0.007
iously-0.007
 apology-0.006
 efforts-0.006
Token so
Token ID523
Feature activation-2.97e+00

Pos logit contributions
 understanding+0.054
 accomplishment+0.054
 apology+0.051
iously+0.051
mmm+0.049
Neg logit contributions
 hall-0.053
 garden-0.053
 forest-0.053
 outdoors-0.052
c-0.052
Token happy
Token ID3772
Feature activation-2.51e+00

Pos logit contributions
iously+0.077
 accomplishment+0.076
 apology+0.073
 understanding+0.071
mmm+0.071
Neg logit contributions
 garden-0.072
 hall-0.069
 forest-0.069
 outdoors-0.069
 met-0.069
Token and
Token ID290
Feature activation-2.83e+00

Pos logit contributions
 understanding+0.073
 accomplishment+0.071
iously+0.069
 apology+0.066
 efforts+0.065
Neg logit contributions
 hall-0.054
c-0.053
 garden-0.052
 forest-0.052
 outdoors-0.051
Token thanked
Token ID26280
Feature activation-2.70e+00

Pos logit contributions
 apology+0.076
 accomplishment+0.075
 Deal+0.074
 understanding+0.074
iously+0.074
Neg logit contributions
 garden-0.066
 forest-0.063
 hall-0.063
 met-0.063
 hunt-0.062
Token Gig
Token ID19525
Feature activation-6.77e+00

Pos logit contributions
 accomplishment+0.071
 understanding+0.070
 apology+0.068
iously+0.066
 efforts+0.065
Neg logit contributions
 garden-0.066
 forest-0.066
 park-0.064
 outdoors-0.063
 fields-0.063
Tokeni
Token ID72
Feature activation-5.42e+00

Pos logit contributions
 accomplishment+0.174
 understanding+0.172
 apology+0.167
 Deal+0.153
 efforts+0.150
Neg logit contributions
c-0.176
 forest-0.174
 garden-0.171
 woods-0.165
 fields-0.162
Token and
Token ID290
Feature activation-3.08e+00

Pos logit contributions
 accomplishment+0.148
 understanding+0.147
 apology+0.142
iously+0.139
mmm+0.139
Neg logit contributions
 garden-0.121
c-0.120
 hall-0.119
 forest-0.118
 outdoors-0.118
Token Duck
Token ID21867
Feature activation-3.17e+00

Pos logit contributions
iously+0.092
 Deal+0.092
 apology+0.091
mmm+0.091
 accomplishment+0.090
Neg logit contributions
 garden-0.062
 forest-0.060
 met-0.060
 hall-0.058
 hunt-0.058
Tokenie
Token ID494
Feature activation-3.66e+00

Pos logit contributions
 understanding+0.080
 accomplishment+0.079
 apology+0.076
iously+0.076
mmm+0.071
Neg logit contributions
 hall-0.078
 garden-0.078
 forest-0.077
 outdoors-0.077
c-0.075
Token.
Token ID13
Feature activation-9.95e-01

Pos logit contributions
mmm+0.113
 understanding+0.113
 accomplishment+0.112
 apology+0.110
iously+0.109
Neg logit contributions
 garden-0.068
 outside-0.066
 forest-0.066
 hall-0.065
c-0.065
Token,
Token ID11
Feature activation-1.91e+00

Pos logit contributions
 understanding+0.101
 accomplishment+0.100
 apology+0.096
iously+0.095
mmm+0.091
Neg logit contributions
 hall-0.070
 garden-0.070
 forest-0.069
 outdoors-0.068
c-0.067
Token give
Token ID1577
Feature activation+9.25e-02

Pos logit contributions
 understanding+0.053
 accomplishment+0.051
 apology+0.051
iously+0.050
mmm+0.047
Neg logit contributions
 outdoors-0.043
c-0.043
 garden-0.042
 hall-0.042
 forest-0.042
Token it
Token ID340
Feature activation-4.26e-01

Pos logit contributions
tub+0.002
 hall+0.002
 garden+0.002
 outdoors+0.002
 sandbox+0.002
Neg logit contributions
 understanding-0.002
iously-0.002
 accomplishment-0.002
 apology-0.002
mmm-0.002
Token back
Token ID736
Feature activation-9.68e-01

Pos logit contributions
 understanding+0.012
 accomplishment+0.012
 apology+0.012
iously+0.011
mmm+0.011
Neg logit contributions
 hall-0.009
 garden-0.009
 forest-0.009
 outdoors-0.009
 woods-0.009
Token!"
Token ID2474
Feature activation-1.67e+00

Pos logit contributions
 understanding+0.024
iously+0.023
 accomplishment+0.023
 apology+0.022
mmm+0.021
Neg logit contributions
 hall-0.024
 garden-0.024
 forest-0.024
tub-0.024
 outdoors-0.023
Token Mir
Token ID7381
Feature activation-6.75e+00

Pos logit contributions
 understanding+0.043
 accomplishment+0.041
iously+0.041
 apology+0.040
mmm+0.039
Neg logit contributions
 outdoors-0.041
c-0.040
 garden-0.040
 hall-0.039
 forest-0.039
Tokena
Token ID64
Feature activation-2.71e+00

Pos logit contributions
 accomplishment+0.244
 understanding+0.225
mmm+0.218
 apology+0.215
 efforts+0.214
Neg logit contributions
c-0.112
 forest-0.098
 met-0.097
 garden-0.095
 hall-0.094
Token said
Token ID531
Feature activation-4.11e+00

Pos logit contributions
 accomplishment+0.084
 understanding+0.080
 apology+0.078
mmm+0.076
iously+0.076
Neg logit contributions
 hall-0.053
c-0.053
 garden-0.052
 met-0.052
 forest-0.052
Token.
Token ID13
Feature activation-8.66e-01

Pos logit contributions
 accomplishment+0.142
 understanding+0.140
mmm+0.137
 apology+0.136
iously+0.134
Neg logit contributions
 outside-0.062
 outdoors-0.060
 garden-0.059
 hall-0.058
c-0.057
Token "
Token ID366
Feature activation+1.63e+00

Pos logit contributions
 understanding+0.023
 accomplishment+0.021
iously+0.021
 apology+0.021
mmm+0.020
Neg logit contributions
c-0.020
 hall-0.020
 garden-0.020
 outdoors-0.020
 forest-0.020
TokenWe
Token ID1135
Feature activation+2.11e+00

Pos logit contributions
 garden+0.035
 hall+0.035
 forest+0.035
 outdoors+0.035
c+0.034
Neg logit contributions
 understanding-0.046
 accomplishment-0.045
 apology-0.044
iously-0.043
mmm-0.042
Token uncle
Token ID7711
Feature activation-4.62e+00

Pos logit contributions
 accomplishment+0.130
iously+0.127
 understanding+0.126
mmm+0.121
 apology+0.120
Neg logit contributions
 hall-0.077
 outdoors-0.077
 forest-0.077
 garden-0.077
 park-0.076
Token ever
Token ID1683
Feature activation-3.08e+00

Pos logit contributions
 understanding+0.107
 accomplishment+0.101
 apology+0.099
iously+0.098
mmm+0.092
Neg logit contributions
 hall-0.124
 garden-0.123
 forest-0.121
 outdoors-0.120
 woods-0.119
Token."
Token ID526
Feature activation-1.80e+00

Pos logit contributions
 accomplishment+0.097
 understanding+0.097
iously+0.093
 apology+0.093
 efforts+0.090
Neg logit contributions
 hall-0.057
 forest-0.057
 garden-0.056
 outdoors-0.056
c-0.056
Token
Token ID198
Feature activation-1.46e+00

Pos logit contributions
 accomplishment+0.064
 understanding+0.064
iously+0.063
 apology+0.062
 efforts+0.060
Neg logit contributions
 garden-0.027
 hall-0.026
 forest-0.026
 outdoors-0.025
 park-0.025
Token
Token ID198
Feature activation-2.99e+00

Pos logit contributions
 understanding+0.050
 accomplishment+0.050
iously+0.049
 apology+0.048
 efforts+0.047
Neg logit contributions
 garden-0.023
 hall-0.022
 forest-0.022
 outdoors-0.022
 park-0.022
TokenUn
Token ID3118
Feature activation-6.74e+00

Pos logit contributions
iously+0.093
 accomplishment+0.093
 understanding+0.088
 apology+0.088
 efforts+0.085
Neg logit contributions
 park-0.061
 garden-0.061
 forest-0.060
 hall-0.059
 woods-0.058
Tokencle
Token ID2375
Feature activation-3.89e+00

Pos logit contributions
 accomplishment+0.128
 understanding+0.122
iously+0.121
 apology+0.115
 efforts+0.114
Neg logit contributions
c-0.204
 hunt-0.190
 hall-0.190
 park-0.189
 block-0.188
Token Ben
Token ID3932
Feature activation-8.32e-01

Pos logit contributions
 understanding+0.092
 accomplishment+0.088
iously+0.085
 apology+0.084
mmm+0.080
Neg logit contributions
 outdoors-0.102
 hall-0.102
 garden-0.101
 forest-0.100
c-0.099
Token hugged
Token ID41130
Feature activation-2.96e+00

Pos logit contributions
 understanding+0.022
 accomplishment+0.021
 apology+0.020
iously+0.020
mmm+0.019
Neg logit contributions
 hall-0.020
 garden-0.020
 outdoors-0.020
 forest-0.019
c-0.019
Token Lily
Token ID20037
Feature activation-3.56e+00

Pos logit contributions
 accomplishment+0.094
 understanding+0.092
iously+0.089
mmm+0.088
 apology+0.087
Neg logit contributions
 forest-0.056
 garden-0.056
 hall-0.055
 park-0.055
 outdoors-0.054
Token and
Token ID290
Feature activation-2.86e+00

Pos logit contributions
 accomplishment+0.122
mmm+0.120
 understanding+0.120
iously+0.117
 apology+0.117
Neg logit contributions
 outside-0.056
 garden-0.056
c-0.055
 hall-0.053
 outdoors-0.053
Token and
Token ID290
Feature activation-2.77e+00

Pos logit contributions
 accomplishment+0.156
mmm+0.154
iously+0.152
 understanding+0.148
 apology+0.147
Neg logit contributions
 outside-0.070
 garden-0.064
 outdoors-0.062
 park-0.061
 hall-0.060
Token said
Token ID531
Feature activation-2.88e+00

Pos logit contributions
 accomplishment+0.079
iously+0.078
 understanding+0.077
 apology+0.076
mmm+0.076
Neg logit contributions
 garden-0.059
 park-0.057
 hall-0.057
 forest-0.057
 hunt-0.055
Token,
Token ID11
Feature activation-4.20e+00

Pos logit contributions
 accomplishment+0.088
 understanding+0.088
iously+0.085
 apology+0.084
mmm+0.082
Neg logit contributions
 garden-0.054
 hall-0.053
 outdoors-0.053
 forest-0.052
 outside-0.051
Token "
Token ID366
Feature activation-1.97e+00

Pos logit contributions
 accomplishment+0.129
 understanding+0.125
iously+0.124
 apology+0.123
 efforts+0.118
Neg logit contributions
 garden-0.082
 hall-0.081
 forest-0.078
 outdoors-0.077
 park-0.077
TokenGood
Token ID10248
Feature activation-3.53e+00

Pos logit contributions
 understanding+0.059
 accomplishment+0.058
 apology+0.056
iously+0.056
mmm+0.054
Neg logit contributions
 hall-0.040
 garden-0.040
 forest-0.039
 outdoors-0.039
c-0.038
Token girl
Token ID2576
Feature activation-6.73e+00

Pos logit contributions
iously+0.117
 accomplishment+0.117
 understanding+0.116
mmm+0.110
 efforts+0.108
Neg logit contributions
 garden-0.063
 park-0.062
 forest-0.061
 hall-0.059
 outside-0.059
Token,
Token ID11
Feature activation-4.97e+00

Pos logit contributions
 accomplishment+0.269
iously+0.256
 understanding+0.252
mmm+0.250
 apology+0.250
Neg logit contributions
 box-0.076
 garden-0.075
 outside-0.074
 living-0.071
c-0.069
Token Lily
Token ID20037
Feature activation-5.40e+00

Pos logit contributions
iously+0.148
 accomplishment+0.141
 understanding+0.140
oral+0.135
mmm+0.135
Neg logit contributions
 garden-0.102
 park-0.100
 pretend-0.098
 forest-0.096
 pool-0.095
Token.
Token ID13
Feature activation-2.09e+00

Pos logit contributions
 accomplishment+0.194
iously+0.188
 understanding+0.188
 apology+0.185
mmm+0.182
Neg logit contributions
 garden-0.083
 forest-0.079
 park-0.078
c-0.077
 hall-0.077
Token Good
Token ID4599
Feature activation-3.22e+00

Pos logit contributions
 understanding+0.051
 accomplishment+0.050
 apology+0.048
iously+0.047
mmm+0.046
Neg logit contributions
 hall-0.053
 forest-0.052
 hunt-0.052
 garden-0.052
 outdoors-0.052
Token boy
Token ID2933
Feature activation-5.08e+00

Pos logit contributions
 understanding+0.106
iously+0.106
 accomplishment+0.105
mmm+0.103
 apology+0.099
Neg logit contributions
 garden-0.058
 forest-0.058
 outside-0.056
 pool-0.055
 park-0.055
Token save
Token ID3613
Feature activation+6.26e-01

Pos logit contributions
 hall+0.017
tub+0.016
 outdoors+0.016
c+0.016
 garden+0.016
Neg logit contributions
 understanding-0.021
 accomplishment-0.021
 apology-0.020
iously-0.020
mmm-0.019
Token the
Token ID262
Feature activation-3.93e-01

Pos logit contributions
 hall+0.012
 garden+0.012
 forest+0.012
 outdoors+0.012
c+0.011
Neg logit contributions
 understanding-0.019
 accomplishment-0.019
 apology-0.019
iously-0.018
mmm-0.018
Token baby
Token ID5156
Feature activation-1.67e+00

Pos logit contributions
 understanding+0.012
 accomplishment+0.012
 apology+0.011
iously+0.011
 efforts+0.010
Neg logit contributions
tub-0.008
 hall-0.008
c-0.008
bowl-0.008
 outdoors-0.008
Token birds
Token ID10087
Feature activation-6.10e-01

Pos logit contributions
 understanding+0.051
iously+0.049
 accomplishment+0.049
 apology+0.047
mmm+0.045
Neg logit contributions
tub-0.032
 hall-0.032
 outdoors-0.031
 pool-0.030
 garden-0.030
Token!"
Token ID2474
Feature activation-1.75e+00

Pos logit contributions
 understanding+0.015
 accomplishment+0.014
iously+0.014
 apology+0.013
mmm+0.013
Neg logit contributions
 hall-0.016
 woods-0.016
 forest-0.015
 garden-0.015
tub-0.015
Token Twe
Token ID24205
Feature activation-6.66e+00

Pos logit contributions
 accomplishment+0.048
 understanding+0.047
iously+0.046
 apology+0.046
 efforts+0.044
Neg logit contributions
 garden-0.039
 forest-0.039
 hall-0.039
c-0.038
 outdoors-0.038
Tokenety
Token ID2963
Feature activation-3.54e+00

Pos logit contributions
 accomplishment+0.238
 understanding+0.219
 apology+0.211
 efforts+0.210
iously+0.207
Neg logit contributions
c-0.137
bowl-0.104
 forest-0.102
 met-0.102
 outdoors-0.101
Token said
Token ID531
Feature activation-3.92e+00

Pos logit contributions
 accomplishment+0.123
 Deal+0.115
 apology+0.114
mmm+0.113
oral+0.109
Neg logit contributions
c-0.064
 met-0.057
 forest-0.056
 garden-0.055
bowl-0.053
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
mmm+0.126
 accomplishment+0.125
 apology+0.122
 Deal+0.122
 understanding+0.121
Neg logit contributions
 outside-0.067
 outdoors-0.064
 garden-0.064
c-0.063
 hall-0.062
Token
Token ID220
Feature activation-1.19e+00

Pos logit contributions
 accomplishment+0.054
 understanding+0.054
 apology+0.052
iously+0.052
 efforts+0.051
Neg logit contributions
 garden-0.022
 forest-0.022
 hall-0.022
 outdoors-0.021
c-0.021
Token
Token ID198
Feature activation-9.50e-01

Pos logit contributions
 understanding+0.039
 accomplishment+0.039
 apology+0.038
iously+0.038
mmm+0.036
Neg logit contributions
 hall-0.020
 garden-0.020
 forest-0.020
 outdoors-0.019
c-0.019
Token They
Token ID1119
Feature activation-2.41e-01

Pos logit contributions
 understanding+0.019
 accomplishment+0.019
iously+0.019
 apology+0.019
mmm+0.018
Neg logit contributions
 garden-0.013
 hall-0.013
 forest-0.013
 woods-0.013
 outdoors-0.013
Token all
Token ID477
Feature activation-7.35e-02

Pos logit contributions
 understanding+0.007
 accomplishment+0.007
 apology+0.007
iously+0.007
mmm+0.007
Neg logit contributions
 hall-0.005
 garden-0.005
 forest-0.005
 outdoors-0.005
c-0.005
Token smiled
Token ID13541
Feature activation-3.36e+00

Pos logit contributions
 understanding+0.002
 accomplishment+0.002
 apology+0.002
iously+0.002
mmm+0.002
Neg logit contributions
 hall-0.002
 garden-0.002
 forest-0.002
 outdoors-0.002
c-0.002
Token and
Token ID290
Feature activation-2.75e+00

Pos logit contributions
 apology+0.101
 accomplishment+0.101
 understanding+0.100
iously+0.097
mmm+0.096
Neg logit contributions
 garden-0.065
 outdoors-0.064
 forest-0.064
 outside-0.063
 hall-0.062
Token congratulated
Token ID47630
Feature activation-3.33e+00

Pos logit contributions
 apology+0.078
iously+0.077
 accomplishment+0.077
 understanding+0.076
mmm+0.075
Neg logit contributions
 garden-0.059
 met-0.057
 hall-0.057
 forest-0.056
 park-0.056
Token Cas
Token ID11294
Feature activation-6.66e+00

Pos logit contributions
 understanding+0.109
 accomplishment+0.108
 apology+0.105
iously+0.104
mmm+0.102
Neg logit contributions
 outdoors-0.055
 forest-0.055
 garden-0.055
 hall-0.055
 outside-0.053
Tokenper
Token ID525
Feature activation-3.98e+00

Pos logit contributions
 understanding+0.211
 apology+0.207
 accomplishment+0.205
 Deal+0.201
mmm+0.197
Neg logit contributions
 hunt-0.113
 garden-0.113
 pretend-0.113
tub-0.112
 met-0.112
Token on
Token ID319
Feature activation-1.74e+00

Pos logit contributions
 understanding+0.116
 accomplishment+0.115
 apology+0.110
iously+0.110
mmm+0.105
Neg logit contributions
 hall-0.082
 garden-0.082
 forest-0.081
 outdoors-0.080
c-0.079
Token his
Token ID465
Feature activation-2.68e+00

Pos logit contributions
 apology+0.055
 understanding+0.054
iously+0.054
 accomplishment+0.054
mmm+0.052
Neg logit contributions
 garden-0.032
 hall-0.031
 outdoors-0.031
 forest-0.030
 park-0.030
Token amazing
Token ID4998
Feature activation-2.71e-01

Pos logit contributions
 understanding+0.077
iously+0.076
mmm+0.075
 accomplishment+0.073
 apology+0.072
Neg logit contributions
 garden-0.057
 hall-0.055
 forest-0.055
 park-0.054
 outdoors-0.054
Token accomplishment
Token ID29340
Feature activation-1.64e+00

Pos logit contributions
 accomplishment+0.008
 understanding+0.008
 apology+0.008
iously+0.008
 efforts+0.008
Neg logit contributions
 hall-0.005
 garden-0.005
 forest-0.005
 outdoors-0.005
c-0.005
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
 understanding+0.036
 accomplishment+0.035
 apology+0.034
iously+0.034
mmm+0.033
Neg logit contributions
 garden-0.023
 hall-0.022
 forest-0.022
 outdoors-0.022
c-0.022
Token
Token ID198
Feature activation-2.86e+00

Pos logit contributions
 accomplishment+0.042
 understanding+0.042
iously+0.041
 apology+0.040
 efforts+0.039
Neg logit contributions
 garden-0.016
 forest-0.016
 hall-0.016
 outdoors-0.015
 park-0.015
Token"
Token ID1
Feature activation-1.31e+00

Pos logit contributions
 accomplishment+0.078
iously+0.075
 apology+0.073
 understanding+0.072
 efforts+0.070
Neg logit contributions
 garden-0.069
 forest-0.069
 woods-0.067
 hall-0.066
 park-0.066
TokenSorry
Token ID14385
Feature activation-3.69e+00

Pos logit contributions
 accomplishment+0.035
 understanding+0.035
iously+0.033
 apology+0.033
mmm+0.032
Neg logit contributions
 garden-0.030
 hall-0.030
 forest-0.030
 outdoors-0.030
c-0.029
Token,
Token ID11
Feature activation-3.95e+00

Pos logit contributions
 accomplishment+0.116
 understanding+0.113
iously+0.110
 apology+0.109
 efforts+0.104
Neg logit contributions
 garden-0.070
 forest-0.069
 hall-0.068
 outdoors-0.068
 park-0.067
Token Farmer
Token ID37755
Feature activation-6.65e+00

Pos logit contributions
 accomplishment+0.095
iously+0.093
 understanding+0.089
 apology+0.087
 efforts+0.086
Neg logit contributions
 garden-0.103
 forest-0.099
 park-0.099
 hall-0.097
 pool-0.097
Token Joe
Token ID5689
Feature activation-3.57e+00

Pos logit contributions
 understanding+0.154
 accomplishment+0.151
iously+0.145
 apology+0.145
mmm+0.138
Neg logit contributions
 hall-0.178
 garden-0.176
 forest-0.175
 outdoors-0.175
c-0.173
Token,"
Token ID553
Feature activation-3.88e+00

Pos logit contributions
 understanding+0.095
 accomplishment+0.093
 apology+0.091
iously+0.090
mmm+0.085
Neg logit contributions
 hall-0.083
 outdoors-0.082
 garden-0.081
 forest-0.080
 woods-0.080
Token Sam
Token ID3409
Feature activation-2.39e+00

Pos logit contributions
 accomplishment+0.099
 understanding+0.096
iously+0.095
 apology+0.094
 efforts+0.090
Neg logit contributions
 garden-0.096
 forest-0.095
 hall-0.093
 park-0.092
 woods-0.091
Token says
Token ID1139
Feature activation-4.83e+00

Pos logit contributions
 understanding+0.065
 accomplishment+0.064
 apology+0.062
iously+0.061
mmm+0.059
Neg logit contributions
 hall-0.054
 garden-0.054
 forest-0.054
 outdoors-0.054
c-0.053
Token.
Token ID13
Feature activation-2.72e+00

Pos logit contributions
 accomplishment+0.170
 understanding+0.169
mmm+0.166
iously+0.161
 apology+0.161
Neg logit contributions
 outside-0.070
 garden-0.066
 outdoors-0.065
 hall-0.064
c-0.061
Token and
Token ID290
Feature activation-9.96e-01

Pos logit contributions
 understanding+0.002
 accomplishment+0.002
iously+0.002
 apology+0.002
 efforts+0.002
Neg logit contributions
 paints-0.002
 outdoors-0.002
c-0.002
 matches-0.002
bowl-0.002
Token found
Token ID1043
Feature activation+6.52e-01

Pos logit contributions
 understanding+0.029
 accomplishment+0.028
iously+0.027
 apology+0.026
mmm+0.025
Neg logit contributions
 outdoors-0.021
 garden-0.021
 hall-0.021
 forest-0.021
c-0.020
Token her
Token ID607
Feature activation-1.60e+00

Pos logit contributions
 hall+0.014
 garden+0.014
 forest+0.014
 hunt+0.014
 sandbox+0.014
Neg logit contributions
 understanding-0.018
 accomplishment-0.018
 apology-0.017
iously-0.017
 efforts-0.016
Token.
Token ID13
Feature activation-5.30e-01

Pos logit contributions
 accomplishment+0.048
 understanding+0.048
iously+0.045
 apology+0.045
 efforts+0.044
Neg logit contributions
 hall-0.033
 forest-0.031
tub-0.031
 outdoors-0.031
c-0.031
Token Poor
Token ID23676
Feature activation-3.59e+00

Pos logit contributions
 accomplishment+0.018
 understanding+0.018
 apology+0.017
iously+0.017
 efforts+0.016
Neg logit contributions
 hall-0.009
 garden-0.009
 forest-0.009
 outdoors-0.008
 woods-0.008
Token H
Token ID367
Feature activation-6.65e+00

Pos logit contributions
 understanding+0.091
 accomplishment+0.088
 apology+0.088
iously+0.087
mmm+0.086
Neg logit contributions
 hall-0.089
 garden-0.087
 outdoors-0.086
 forest-0.086
 park-0.084
Tokenilda
Token ID27281
Feature activation-3.63e+00

Pos logit contributions
 understanding+0.179
 accomplishment+0.165
 apology+0.163
 Deal+0.156
 relief+0.153
Neg logit contributions
 hall-0.162
c-0.153
tub-0.153
 outdoors-0.151
 paints-0.148
Token never
Token ID1239
Feature activation-1.76e+00

Pos logit contributions
 understanding+0.118
 accomplishment+0.117
 apology+0.114
iously+0.112
mmm+0.110
Neg logit contributions
 garden-0.063
 hall-0.063
 forest-0.062
c-0.061
 outdoors-0.060
Token managed
Token ID5257
Feature activation-5.49e-01

Pos logit contributions
 understanding+0.056
 accomplishment+0.054
iously+0.052
 apology+0.052
 efforts+0.050
Neg logit contributions
 hall-0.033
 garden-0.033
 outdoors-0.032
 forest-0.032
c-0.032
Token to
Token ID284
Feature activation-2.42e-01

Pos logit contributions
 understanding+0.019
 accomplishment+0.018
 apology+0.018
iously+0.018
mmm+0.017
Neg logit contributions
 hall-0.009
 garden-0.009
 forest-0.009
 outdoors-0.008
c-0.008
Token get
Token ID651
Feature activation-7.95e-01

Pos logit contributions
 understanding+0.008
 accomplishment+0.007
 apology+0.007
iously+0.007
mmm+0.007
Neg logit contributions
 hall-0.005
 garden-0.005
 forest-0.005
tub-0.005
 outdoors-0.005
Token welcome
Token ID7062
Feature activation-4.21e+00

Pos logit contributions
 accomplishment+0.067
iously+0.065
oral+0.063
mmm+0.063
 apology+0.062
Neg logit contributions
 outside-0.053
 outdoors-0.053
 garden-0.051
 met-0.051
 park-0.050
Token here
Token ID994
Feature activation-2.20e+00

Pos logit contributions
 accomplishment+0.131
 understanding+0.131
 apology+0.126
iously+0.125
mmm+0.122
Neg logit contributions
 outdoors-0.078
 garden-0.078
 hall-0.077
 forest-0.077
 outside-0.075
Token."
Token ID526
Feature activation-1.38e+00

Pos logit contributions
 understanding+0.068
 accomplishment+0.068
iously+0.065
 apology+0.065
mmm+0.063
Neg logit contributions
 garden-0.041
 hall-0.041
 outdoors-0.041
 forest-0.040
 park-0.039
Token
Token ID198
Feature activation-1.57e+00

Pos logit contributions
 accomplishment+0.049
 understanding+0.049
iously+0.048
 apology+0.047
 efforts+0.046
Neg logit contributions
 garden-0.020
 hall-0.019
 outdoors-0.019
 forest-0.019
 park-0.019
Token
Token ID198
Feature activation-2.98e+00

Pos logit contributions
 accomplishment+0.056
 understanding+0.056
iously+0.055
 apology+0.054
 efforts+0.053
Neg logit contributions
 garden-0.023
 hall-0.022
 forest-0.022
 outdoors-0.022
 park-0.022
TokenUn
Token ID3118
Feature activation-6.64e+00

Pos logit contributions
iously+0.090
 accomplishment+0.089
 efforts+0.084
 apology+0.084
 understanding+0.079
Neg logit contributions
 park-0.065
 garden-0.065
 outdoors-0.063
 hall-0.063
 forest-0.062
Tokencle
Token ID2375
Feature activation-4.22e+00

Pos logit contributions
 accomplishment+0.132
iously+0.122
 understanding+0.121
 apology+0.121
 efforts+0.119
Neg logit contributions
c-0.188
 block-0.179
 box-0.174
 garage-0.174
 hall-0.173
Token Jack
Token ID3619
Feature activation-1.47e+00

Pos logit contributions
 understanding+0.105
 accomplishment+0.099
iously+0.096
 apology+0.095
mmm+0.091
Neg logit contributions
 hall-0.108
 outdoors-0.107
 forest-0.107
 garden-0.107
c-0.105
Token gr
Token ID1036
Feature activation-3.44e+00

Pos logit contributions
 understanding+0.041
 accomplishment+0.041
 apology+0.039
iously+0.039
mmm+0.037
Neg logit contributions
 garden-0.032
 hall-0.032
 forest-0.032
 outdoors-0.031
c-0.031
Tokenumbled
Token ID11137
Feature activation-1.93e+00

Pos logit contributions
 accomplishment+0.088
 understanding+0.080
 Deal+0.076
iously+0.074
mmm+0.073
Neg logit contributions
c-0.088
tub-0.087
bowl-0.083
 forest-0.082
 met-0.082
Token and
Token ID290
Feature activation-1.99e+00

Pos logit contributions
 understanding+0.060
 accomplishment+0.059
 apology+0.057
iously+0.057
mmm+0.055
Neg logit contributions
 hall-0.037
 garden-0.037
 forest-0.036
 outdoors-0.036
c-0.035
Token,
Token ID11
Feature activation-3.52e+00

Pos logit contributions
 accomplishment+0.068
 understanding+0.068
 apology+0.067
iously+0.066
 Deal+0.064
Neg logit contributions
 outdoors-0.039
 garden-0.038
 outside-0.038
 hall-0.038
c-0.037
Token "
Token ID366
Feature activation-7.14e-01

Pos logit contributions
 accomplishment+0.115
 apology+0.115
 efforts+0.109
iously+0.109
 understanding+0.109
Neg logit contributions
 garden-0.059
 outside-0.057
 hall-0.057
 outdoors-0.056
 park-0.054
TokenGood
Token ID10248
Feature activation-3.09e+00

Pos logit contributions
 accomplishment+0.019
 understanding+0.019
 apology+0.018
iously+0.018
 efforts+0.017
Neg logit contributions
 garden-0.017
 outdoors-0.017
 hall-0.017
 forest-0.017
c-0.017
Token job
Token ID1693
Feature activation-3.03e+00

Pos logit contributions
 understanding+0.084
iously+0.080
 accomplishment+0.080
 apology+0.079
 Deal+0.078
Neg logit contributions
 garden-0.069
 outside-0.069
 outdoors-0.069
 pretend-0.068
 forest-0.067
Token,
Token ID11
Feature activation-4.81e+00

Pos logit contributions
 understanding+0.090
 accomplishment+0.089
 apology+0.087
iously+0.085
 efforts+0.081
Neg logit contributions
 garden-0.061
 hall-0.060
 outdoors-0.059
 forest-0.059
 pool-0.057
Token Pe
Token ID2631
Feature activation-6.62e+00

Pos logit contributions
 accomplishment+0.125
 apology+0.123
 understanding+0.123
iously+0.123
 efforts+0.115
Neg logit contributions
 garden-0.114
 park-0.109
 forest-0.109
 woods-0.103
c-0.101
Tokenppa
Token ID44989
Feature activation-4.39e+00

Pos logit contributions
 accomplishment+0.122
 understanding+0.118
 apology+0.115
iously+0.109
 Deal+0.104
Neg logit contributions
c-0.215
 garden-0.204
 outdoors-0.202
 forest-0.200
 hunt-0.199
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
 understanding+0.122
 accomplishment+0.121
iously+0.116
 apology+0.115
mmm+0.111
Neg logit contributions
 hall-0.096
 garden-0.095
 forest-0.095
 outdoors-0.094
 woods-0.093
Token You
Token ID921
Feature activation+6.65e-01

Pos logit contributions
 accomplishment+0.029
 understanding+0.029
iously+0.028
 apology+0.028
 efforts+0.027
Neg logit contributions
 hall-0.025
 garden-0.025
 forest-0.024
c-0.024
 outdoors-0.024
Token did
Token ID750
Feature activation-1.18e+00

Pos logit contributions
 woods+0.013
c+0.013
 hall+0.013
 forest+0.012
 outdoors+0.012
Neg logit contributions
 understanding-0.021
 accomplishment-0.020
 apology-0.019
mmm-0.019
 efforts-0.019
Token a
Token ID257
Feature activation-2.41e+00

Pos logit contributions
 understanding+0.041
 accomplishment+0.040
iously+0.039
 apology+0.039
mmm+0.038
Neg logit contributions
 hall-0.019
 garden-0.018
 forest-0.018
c-0.018
 outdoors-0.018
Token having
Token ID1719
Feature activation+1.10e+00

Pos logit contributions
 understanding+0.051
 accomplishment+0.049
 apology+0.045
 efforts+0.044
iously+0.043
Neg logit contributions
c-0.040
 outdoors-0.039
 hall-0.039
 forest-0.038
 woods-0.037
Token lots
Token ID6041
Feature activation-1.36e+00

Pos logit contributions
 hall+0.026
c+0.025
tub+0.025
 forest+0.025
 woods+0.025
Neg logit contributions
 understanding-0.030
 accomplishment-0.030
 efforts-0.026
 apology-0.026
mmm-0.025
Token of
Token ID286
Feature activation+7.78e-01

Pos logit contributions
 understanding+0.033
 accomplishment+0.032
iously+0.032
 apology+0.031
mmm+0.030
Neg logit contributions
 outdoors-0.035
 garden-0.035
 hall-0.034
 forest-0.034
 park-0.034
Token fun
Token ID1257
Feature activation-9.46e-01

Pos logit contributions
 hall+0.017
 garden+0.017
c+0.017
 forest+0.017
 outdoors+0.016
Neg logit contributions
 understanding-0.022
 accomplishment-0.022
 apology-0.021
iously-0.020
mmm-0.020
Token!
Token ID0
Feature activation-4.78e-01

Pos logit contributions
 accomplishment+0.026
 understanding+0.026
iously+0.024
 apology+0.024
 efforts+0.023
Neg logit contributions
c-0.021
 hall-0.021
 forest-0.021
 garden-0.021
 woods-0.020
Token<|endoftext|>
Token ID50256
Feature activation+5.01e+00

Pos logit contributions
 accomplishment+0.018
 understanding+0.018
iously+0.017
 apology+0.017
 efforts+0.016
Neg logit contributions
 hall-0.006
 forest-0.006
 garden-0.006
c-0.006
 park-0.006
TokenOnce
Token ID7454
Feature activation+3.03e+00

Pos logit contributions
 sandbox+0.112
 hunt+0.110
 pots+0.108
 paints+0.108
 pretend+0.106
Neg logit contributions
oral-0.112
Less-0.111
 understanding-0.111
 Deal-0.109
mmm-0.109
Token upon
Token ID2402
Feature activation+3.81e+00

Pos logit contributions
 garden+0.046
 hall+0.046
 forest+0.045
 outdoors+0.045
c+0.044
Neg logit contributions
 understanding-0.104
 accomplishment-0.104
 apology-0.100
iously-0.100
mmm-0.096
Token a
Token ID257
Feature activation-9.47e-02

Pos logit contributions
 hall+0.086
tub+0.084
 paints+0.083
 forest+0.083
c+0.082
Neg logit contributions
 understanding-0.109
 accomplishment-0.093
 apology-0.092
From-0.090
 badly-0.088
Token time
Token ID640
Feature activation+2.61e+00

Pos logit contributions
 understanding+0.003
 accomplishment+0.003
iously+0.003
 apology+0.003
mmm+0.003
Neg logit contributions
 garden-0.002
 outdoors-0.002
 hall-0.002
 forest-0.002
 park-0.002
Token,
Token ID11
Feature activation+3.47e+00

Pos logit contributions
 garden+0.062
 forest+0.061
 hall+0.061
c+0.060
 woods+0.060
Neg logit contributions
 understanding-0.067
 accomplishment-0.066
iously-0.063
 apology-0.062
mmm-0.062
Token a
Token ID257
Feature activation-1.09e+00

Pos logit contributions
 hall+0.017
 forest+0.016
 outdoors+0.016
 garden+0.016
c+0.016
Neg logit contributions
 understanding-0.027
 accomplishment-0.026
 apology-0.025
iously-0.025
 efforts-0.024
Token lot
Token ID1256
Feature activation-2.15e+00

Pos logit contributions
 understanding+0.030
 accomplishment+0.030
 apology+0.028
iously+0.027
 efforts+0.026
Neg logit contributions
 hall-0.024
c-0.024
 outdoors-0.024
 garden-0.024
 forest-0.024
Token of
Token ID286
Feature activation+3.16e-01

Pos logit contributions
 understanding+0.048
 accomplishment+0.047
 apology+0.045
iously+0.044
mmm+0.042
Neg logit contributions
 hall-0.060
 garden-0.060
 forest-0.059
 outdoors-0.059
c-0.058
Token fun
Token ID1257
Feature activation-1.65e+00

Pos logit contributions
 hall+0.007
 garden+0.007
 forest+0.007
c+0.007
 outdoors+0.007
Neg logit contributions
 understanding-0.009
 accomplishment-0.009
 apology-0.009
iously-0.008
 efforts-0.008
Token.
Token ID13
Feature activation-3.92e-01

Pos logit contributions
 understanding+0.048
 accomplishment+0.048
iously+0.045
 apology+0.044
 efforts+0.043
Neg logit contributions
 hall-0.034
 garden-0.034
c-0.034
 forest-0.034
 hunt-0.033
Token<|endoftext|>
Token ID50256
Feature activation+4.43e+00

Pos logit contributions
 accomplishment+0.014
 understanding+0.014
iously+0.014
 apology+0.013
 efforts+0.013
Neg logit contributions
 garden-0.006
 hall-0.006
 forest-0.006
 park-0.006
 woods-0.006
TokenSam
Token ID16305
Feature activation+1.19e+00

Pos logit contributions
 hunt+0.097
 sandbox+0.095
 pretend+0.094
 pots+0.089
 hall+0.088
Neg logit contributions
oral-0.105
 Deal-0.094
Less-0.094
mmm-0.092
 importantly-0.089
Token was
Token ID373
Feature activation+9.56e-02

Pos logit contributions
tub+0.021
 outdoors+0.020
 hall+0.020
 matches+0.019
 hunt+0.019
Neg logit contributions
 understanding-0.035
 accomplishment-0.034
 apology-0.033
iously-0.033
 efforts-0.032
Token a
Token ID257
Feature activation-1.09e+00

Pos logit contributions
 hunt+0.002
 hall+0.002
 forest+0.002
c+0.002
tub+0.002
Neg logit contributions
 understanding-0.003
 accomplishment-0.003
 apology-0.002
iously-0.002
 Deal-0.002
Token tall
Token ID7331
Feature activation-8.46e-01

Pos logit contributions
 accomplishment+0.030
 understanding+0.030
iously+0.030
 apology+0.030
mmm+0.028
Neg logit contributions
 garden-0.025
 forest-0.024
 hall-0.024
 outdoors-0.024
 park-0.023
Token boy
Token ID2933
Feature activation-1.95e+00

Pos logit contributions
 understanding+0.028
 accomplishment+0.027
 apology+0.026
iously+0.026
mmm+0.025
Neg logit contributions
 garden-0.015
 hall-0.015
 forest-0.014
 outdoors-0.014
c-0.014
TokenOnce
Token ID7454
Feature activation+2.46e+00

Pos logit contributions
 sandbox+0.131
 paints+0.130
 hall+0.124
 hunt+0.124
 pots+0.123
Neg logit contributions
 understanding-0.110
 accomplishment-0.103
 apology-0.100
From-0.096
Less-0.095
Token upon
Token ID2402
Feature activation+3.74e+00

Pos logit contributions
 hall+0.032
 garden+0.032
 forest+0.031
 outdoors+0.031
c+0.030
Neg logit contributions
 understanding-0.091
 accomplishment-0.090
iously-0.089
 apology-0.088
mmm-0.085
Token a
Token ID257
Feature activation+1.75e-02

Pos logit contributions
 hall+0.076
 forest+0.074
 paints+0.073
tub+0.073
c+0.073
Neg logit contributions
 understanding-0.112
 accomplishment-0.099
 apology-0.097
From-0.095
mmm-0.095
Token time
Token ID640
Feature activation+2.59e+00

Pos logit contributions
 outdoors+0.000
 park+0.000
 garden+0.000
 outside+0.000
 forest+0.000
Neg logit contributions
iously-0.001
 understanding-0.001
 accomplishment-0.001
 apology-0.001
mmm-0.001
Token,
Token ID11
Feature activation+3.41e+00

Pos logit contributions
 hall+0.054
 garden+0.054
 forest+0.054
 outdoors+0.053
c+0.053
Neg logit contributions
 understanding-0.075
 accomplishment-0.074
 apology-0.071
iously-0.071
mmm-0.068
Token there
Token ID612
Feature activation+5.14e+00

Pos logit contributions
 hunt+0.047
 hall+0.045
 forest+0.045
 fields+0.044
 garden+0.044
Neg logit contributions
 understanding-0.122
 accomplishment-0.115
 apology-0.110
mmm-0.110
 badly-0.108
Token was
Token ID373
Feature activation+2.98e+00

Pos logit contributions
 outdoors+0.102
 hunt+0.101
 hall+0.100
 mountains+0.098
 fields+0.098
Neg logit contributions
 understanding-0.142
 accomplishment-0.137
mmm-0.130
 apology-0.127
 efforts-0.125
Token a
Token ID257
Feature activation-2.32e-01

Pos logit contributions
 hall+0.049
 hunt+0.048
c+0.046
 forest+0.046
tub+0.045
Neg logit contributions
 understanding-0.099
 accomplishment-0.097
mmm-0.092
 apology-0.091
iously-0.090
Token princess
Token ID21752
Feature activation-5.13e-01

Pos logit contributions
iously+0.007
 accomplishment+0.007
 understanding+0.006
 apology+0.006
mmm+0.006
Neg logit contributions
 garden-0.005
 hall-0.005
 forest-0.005
 park-0.005
 outdoors-0.005
Token.
Token ID13
Feature activation+3.19e+00

Pos logit contributions
 understanding+0.014
 accomplishment+0.013
iously+0.013
 apology+0.013
mmm+0.012
Neg logit contributions
 hall-0.011
 garden-0.011
 outdoors-0.011
 woods-0.011
 pool-0.011
Token She
Token ID1375
Feature activation+2.81e+00

Pos logit contributions
 garden+0.063
 forest+0.062
 outdoors+0.062
 hall+0.062
c+0.062
Neg logit contributions
 understanding-0.094
 accomplishment-0.090
 apology-0.088
iously-0.088
mmm-0.086
Token<|endoftext|>
Token ID50256
Feature activation+5.11e+00

Pos logit contributions
 accomplishment+0.015
 understanding+0.015
 apology+0.015
iously+0.015
 efforts+0.014
Neg logit contributions
 garden-0.007
 hall-0.007
 forest-0.006
 outdoors-0.006
c-0.006
TokenOnce
Token ID7454
Feature activation+2.88e+00

Pos logit contributions
 pots+0.116
 hunt+0.115
 paints+0.115
 sandbox+0.115
 matches+0.111
Neg logit contributions
oral-0.112
Less-0.110
From-0.108
 understanding-0.108
mmm-0.107
Token upon
Token ID2402
Feature activation+3.63e+00

Pos logit contributions
 hall+0.043
 garden+0.043
 forest+0.042
 outdoors+0.042
c+0.041
Neg logit contributions
 understanding-0.100
 accomplishment-0.099
 apology-0.096
iously-0.095
mmm-0.092
Token a
Token ID257
Feature activation-4.35e-01

Pos logit contributions
 hall+0.079
tub+0.078
 forest+0.077
 paints+0.077
c+0.076
Neg logit contributions
 understanding-0.106
 accomplishment-0.092
 apology-0.090
From-0.087
 badly-0.086
Token time
Token ID640
Feature activation+2.29e+00

Pos logit contributions
 understanding+0.014
 accomplishment+0.014
iously+0.014
 apology+0.014
mmm+0.013
Neg logit contributions
 outdoors-0.008
 garden-0.008
 park-0.008
 forest-0.008
 hall-0.008
Token there
Token ID612
Feature activation+5.05e+00

Pos logit contributions
 garden+0.054
 forest+0.054
 hall+0.053
c+0.053
 hunt+0.053
Neg logit contributions
 understanding-0.059
 accomplishment-0.058
iously-0.055
 apology-0.055
mmm-0.054
Token were
Token ID547
Feature activation+2.04e+00

Pos logit contributions
 hunt+0.114
 outdoors+0.111
 fields+0.109
 pots+0.109
 paints+0.108
Neg logit contributions
 understanding-0.126
 accomplishment-0.119
mmm-0.113
From-0.109
 apology-0.108
Token two
Token ID734
Feature activation-7.65e-01

Pos logit contributions
 hall+0.041
 hunt+0.041
c+0.040
 garden+0.039
 forest+0.039
Neg logit contributions
 understanding-0.060
 accomplishment-0.058
iously-0.055
mmm-0.055
 apology-0.055
Token friends
Token ID2460
Feature activation-7.26e-01

Pos logit contributions
 accomplishment+0.021
 understanding+0.020
 apology+0.020
iously+0.020
mmm+0.020
Neg logit contributions
 garden-0.018
 forest-0.018
 hall-0.018
 park-0.017
 outdoors-0.017
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
 accomplishment+0.025
 apology+0.024
iously+0.023
 understanding+0.023
mmm+0.023
Neg logit contributions
 garden-0.012
 hall-0.012
 outside-0.012
 forest-0.012
 outdoors-0.012
Token Bob
Token ID5811
Feature activation-1.38e+00

Pos logit contributions
 hall+0.025
 garden+0.025
 forest+0.025
 outdoors+0.025
c+0.025
Neg logit contributions
 understanding-0.040
 accomplishment-0.039
 apology-0.037
iously-0.037
mmm-0.036
TokenOnce
Token ID7454
Feature activation+2.34e+00

Pos logit contributions
 pots+0.142
 sandbox+0.141
 paints+0.139
 hunt+0.139
 matches+0.134
Neg logit contributions
 understanding-0.096
mmm-0.092
 accomplishment-0.092
From-0.091
oral-0.091
Token upon
Token ID2402
Feature activation+3.60e+00

Pos logit contributions
 hall+0.036
 garden+0.036
 forest+0.035
 outdoors+0.035
 hunt+0.034
Neg logit contributions
 understanding-0.080
 accomplishment-0.079
 apology-0.077
iously-0.076
mmm-0.074
Token a
Token ID257
Feature activation-1.71e-01

Pos logit contributions
 hall+0.082
tub+0.080
 paints+0.078
c+0.078
 hunt+0.077
Neg logit contributions
 understanding-0.103
 accomplishment-0.089
 apology-0.086
From-0.085
 badly-0.083
Token time
Token ID640
Feature activation+2.17e+00

Pos logit contributions
 understanding+0.005
 accomplishment+0.005
iously+0.005
 apology+0.005
mmm+0.005
Neg logit contributions
 garden-0.003
 outdoors-0.003
 forest-0.003
 park-0.003
 hall-0.003
Token,
Token ID11
Feature activation+2.56e+00

Pos logit contributions
 garden+0.048
 hall+0.048
 forest+0.047
 outdoors+0.047
c+0.047
Neg logit contributions
 understanding-0.060
 accomplishment-0.059
iously-0.057
 apology-0.056
mmm-0.055
Token there
Token ID612
Feature activation+4.66e+00

Pos logit contributions
 hunt+0.040
 hall+0.035
 forest+0.035
 fields+0.034
 garden+0.033
Neg logit contributions
 understanding-0.090
 accomplishment-0.085
iously-0.082
 badly-0.081
mmm-0.081
Token was
Token ID373
Feature activation+2.74e+00

Pos logit contributions
 hunt+0.105
 outdoors+0.102
 paints+0.102
 mountains+0.100
 pots+0.099
Neg logit contributions
 understanding-0.116
 accomplishment-0.110
From-0.103
mmm-0.102
 efforts-0.100
Token a
Token ID257
Feature activation-5.16e-04

Pos logit contributions
 hunt+0.048
 hall+0.048
c+0.045
tub+0.044
 forest+0.044
Neg logit contributions
 understanding-0.088
 accomplishment-0.086
mmm-0.082
 apology-0.080
iously-0.080
Token brave
Token ID14802
Feature activation-2.62e-01

Pos logit contributions
 accomplishment+0.000
iously+0.000
 understanding+0.000
 apology+0.000
mmm+0.000
Neg logit contributions
 garden-0.000
 hall-0.000
 forest-0.000
 outdoors-0.000
 park-0.000
Token man
Token ID582
Feature activation-1.05e+00

Pos logit contributions
 understanding+0.008
 accomplishment+0.008
iously+0.007
 apology+0.007
mmm+0.007
Neg logit contributions
 garden-0.005
 hall-0.005
 forest-0.005
 outdoors-0.005
 park-0.005
Token called
Token ID1444
Feature activation+1.05e+00

Pos logit contributions
 understanding+0.024
iously+0.021
 accomplishment+0.020
 apology+0.020
olding+0.020
Neg logit contributions
tub-0.027
 sandbox-0.027
 hall-0.027
 garden-0.027
 woods-0.027
Token liked
Token ID8288
Feature activation+1.39e+00

Pos logit contributions
 hall+0.053
 garden+0.050
tub+0.050
 pool+0.050
 sandbox+0.050
Neg logit contributions
 understanding-0.068
 accomplishment-0.066
iously-0.062
 apology-0.061
 efforts-0.060
Token to
Token ID284
Feature activation+3.38e+00

Pos logit contributions
 hall+0.029
c+0.028
 garden+0.028
 sandbox+0.028
 paints+0.027
Neg logit contributions
 understanding-0.040
 accomplishment-0.038
iously-0.037
 efforts-0.035
 apology-0.034
Token bounce
Token ID20110
Feature activation+3.51e+00

Pos logit contributions
 hall+0.063
 woods+0.062
 mountains+0.062
 fields+0.062
 forest+0.061
Neg logit contributions
 understanding-0.103
 accomplishment-0.102
 apology-0.101
mmm-0.093
 efforts-0.091
Token on
Token ID319
Feature activation+2.78e+00

Pos logit contributions
 hall+0.073
 garden+0.070
 sandbox+0.069
 met+0.067
 backyard+0.067
Neg logit contributions
 accomplishment-0.095
 understanding-0.093
 badly-0.088
 apology-0.087
iously-0.085
Token.
Token ID13
Feature activation+2.99e+00

Pos logit contributions
 met+0.068
tub+0.067
 hall+0.064
c+0.063
bowl+0.062
Neg logit contributions
 accomplishment-0.070
 understanding-0.069
iously-0.065
 apology-0.061
 efforts-0.060
Token He
Token ID679
Feature activation+2.78e+00

Pos logit contributions
 hall+0.072
 hunt+0.072
 outdoors+0.071
 garden+0.071
 pots+0.071
Neg logit contributions
 understanding-0.072
 accomplishment-0.069
 apology-0.068
mmm-0.065
iously-0.064
Token liked
Token ID8288
Feature activation+1.41e+00

Pos logit contributions
 outdoors+0.052
 hall+0.051
 garden+0.050
tub+0.049
 forest+0.049
Neg logit contributions
 understanding-0.083
 accomplishment-0.081
iously-0.075
 apology-0.075
 efforts-0.075
Token to
Token ID284
Feature activation+3.17e+00

Pos logit contributions
c+0.032
 hall+0.032
tub+0.031
 grounds+0.030
 garden+0.030
Neg logit contributions
 understanding-0.039
 accomplishment-0.037
iously-0.034
 efforts-0.033
 apology-0.033
Token feel
Token ID1254
Feature activation-3.32e-01

Pos logit contributions
 woods+0.061
 forest+0.058
tub+0.058
 hall+0.057
 fields+0.057
Neg logit contributions
 accomplishment-0.099
 understanding-0.096
 apology-0.094
 efforts-0.085
iously-0.083
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
 understanding+0.010
 accomplishment+0.010
 apology+0.010
iously+0.009
 efforts+0.009
Neg logit contributions
 hall-0.007
 garden-0.006
 forest-0.006
c-0.006
 hunt-0.006
Token pillow
Token ID28774
Feature activation+1.78e+00

Pos logit contributions
c+0.024
 hall+0.023
 hunt+0.023
 met+0.023
 paints+0.022
Neg logit contributions
 understanding-0.033
 accomplishment-0.033
 apology-0.031
 Deal-0.029
 efforts-0.029
Token that
Token ID326
Feature activation+2.58e+00

Pos logit contributions
c+0.017
 outdoors+0.017
 garden+0.017
 hall+0.016
 forest+0.016
Neg logit contributions
 understanding-0.030
 accomplishment-0.030
iously-0.030
 apology-0.029
 efforts-0.028
Token was
Token ID373
Feature activation+1.37e+00

Pos logit contributions
 outdoors+0.069
 garden+0.063
c+0.063
 hall+0.063
 hunt+0.061
Neg logit contributions
 understanding-0.058
 accomplishment-0.055
 apology-0.050
 efforts-0.050
iously-0.048
Token different
Token ID1180
Feature activation-6.48e-01

Pos logit contributions
 hall+0.028
 garden+0.028
 outdoors+0.028
 forest+0.028
c+0.027
Neg logit contributions
 understanding-0.040
 accomplishment-0.040
 apology-0.038
iously-0.037
 efforts-0.036
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
 understanding+0.017
 accomplishment+0.017
iously+0.016
 apology+0.016
 efforts+0.015
Neg logit contributions
 garden-0.015
c-0.015
 hall-0.015
 forest-0.015
 outdoors-0.015
Token It
Token ID632
Feature activation+3.29e+00

Pos logit contributions
 hall+0.033
 garden+0.033
 forest+0.033
 outdoors+0.033
c+0.032
Neg logit contributions
 understanding-0.043
 accomplishment-0.042
 apology-0.041
iously-0.040
mmm-0.039
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
 outdoors+0.073
 garden+0.067
 fields+0.067
 sandbox+0.064
 mountains+0.064
Neg logit contributions
 understanding-0.091
 accomplishment-0.086
iously-0.085
 apology-0.084
 apologised-0.081
Token a
Token ID257
Feature activation+3.31e-01

Pos logit contributions
 hall+0.022
 garden+0.022
 forest+0.022
 outdoors+0.022
c+0.022
Neg logit contributions
 understanding-0.037
 accomplishment-0.037
 apology-0.035
iously-0.035
mmm-0.034
Token toy
Token ID13373
Feature activation+8.53e-01

Pos logit contributions
 hall+0.006
 garden+0.006
 outdoors+0.006
 forest+0.006
c+0.006
Neg logit contributions
 understanding-0.011
 accomplishment-0.011
 apology-0.010
iously-0.010
mmm-0.010
Token robot
Token ID9379
Feature activation+5.03e-01

Pos logit contributions
c+0.013
 outdoors+0.013
 garden+0.013
 hall+0.012
 forest+0.012
Neg logit contributions
 understanding-0.029
 accomplishment-0.028
 apology-0.028
iously-0.027
mmm-0.026
Token that
Token ID326
Feature activation+3.79e+00

Pos logit contributions
 outdoors+0.012
 garden+0.012
 fields+0.012
 woods+0.011
 forest+0.011
Neg logit contributions
 understanding-0.013
 accomplishment-0.012
iously-0.012
 apology-0.011
 efforts-0.011
Token had
Token ID550
Feature activation+3.03e+00

Pos logit contributions
 outdoors+0.088
c+0.081
 garden+0.080
 hall+0.079
 valley+0.079
Neg logit contributions
 understanding-0.096
 accomplishment-0.089
 apology-0.084
 apologised-0.082
 caring-0.081
Token beautiful
Token ID4950
Feature activation-3.80e-01

Pos logit contributions
iously+0.056
 understanding+0.053
 apology+0.053
 accomplishment+0.053
mmm+0.052
Neg logit contributions
 garden-0.037
 forest-0.036
 hall-0.035
 park-0.035
 pretend-0.033
Token picture
Token ID4286
Feature activation-2.97e-01

Pos logit contributions
 accomplishment+0.012
 understanding+0.012
 apology+0.012
iously+0.012
mmm+0.011
Neg logit contributions
 hall-0.006
 outdoors-0.006
 forest-0.006
c-0.006
 garden-0.006
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
 accomplishment+0.006
 understanding+0.006
 efforts+0.005
iously+0.005
 apology+0.005
Neg logit contributions
 hall-0.009
 met-0.009
 forest-0.009
 woods-0.009
 garden-0.009
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
 understanding+0.045
 accomplishment+0.044
iously+0.039
 apology+0.038
 efforts+0.038
Neg logit contributions
 outdoors-0.036
 hall-0.035
c-0.035
 forest-0.033
 hunt-0.033
Token slow
Token ID3105
Feature activation-1.18e+00

Pos logit contributions
 understanding+0.059
 accomplishment+0.056
 apology+0.052
iously+0.050
 efforts+0.049
Neg logit contributions
c-0.042
tub-0.041
bowl-0.039
 hall-0.038
 pots-0.038
Token deer
Token ID20096
Feature activation+9.07e-01

Pos logit contributions
 accomplishment+0.034
 understanding+0.033
iously+0.032
 efforts+0.030
 apology+0.030
Neg logit contributions
 hall-0.025
bowl-0.025
tub-0.024
 pots-0.024
c-0.024
Token was
Token ID373
Feature activation-2.21e+00

Pos logit contributions
 outdoors+0.018
 woods+0.018
 hall+0.018
 forest+0.018
tub+0.017
Neg logit contributions
 understanding-0.027
 accomplishment-0.026
iously-0.025
 efforts-0.024
 apology-0.024
Token very
Token ID845
Feature activation-3.19e+00

Pos logit contributions
 understanding+0.052
 accomplishment+0.051
iously+0.047
 apology+0.047
mmm+0.045
Neg logit contributions
 hall-0.059
 forest-0.059
 garden-0.059
 outdoors-0.058
 hunt-0.058
Token proud
Token ID6613
Feature activation-3.10e+00

Pos logit contributions
iously+0.090
 apology+0.084
 Deal+0.084
mmm+0.083
oral+0.083
Neg logit contributions
 garden-0.072
 pretend-0.071
 outdoors-0.070
 outside-0.068
 hall-0.067
Token of
Token ID286
Feature activation-1.44e+00

Pos logit contributions
 understanding+0.087
 accomplishment+0.086
 apology+0.083
iously+0.083
mmm+0.079
Neg logit contributions
 garden-0.068
 hall-0.068
 forest-0.067
 outdoors-0.067
c-0.065
Token it
Token ID340
Feature activation-1.68e+00

Pos logit contributions
 understanding+0.041
 accomplishment+0.040
 apology+0.039
iously+0.039
mmm+0.037
Neg logit contributions
 hall-0.032
 garden-0.031
 forest-0.031
 outdoors-0.031
c-0.031
Token cat
Token ID3797
Feature activation+1.41e+00

Pos logit contributions
 met+0.045
c+0.045
 outdoors+0.043
bowl+0.043
 pots+0.042
Neg logit contributions
 understanding-0.075
 apology-0.073
 accomplishment-0.072
iously-0.071
mmm-0.069
Token.
Token ID13
Feature activation+2.16e+00

Pos logit contributions
tub+0.040
 pots+0.039
 paints+0.039
 valleys+0.039
 fields+0.039
Neg logit contributions
 understanding-0.026
 badly-0.023
mmm-0.023
iously-0.022
 accomplishment-0.022
Token The
Token ID383
Feature activation+3.26e-01

Pos logit contributions
 garden+0.042
 hall+0.042
 forest+0.042
 outdoors+0.041
c+0.041
Neg logit contributions
 understanding-0.066
 accomplishment-0.066
 apology-0.063
iously-0.063
mmm-0.060
Token cat
Token ID3797
Feature activation+3.36e+00

Pos logit contributions
 outdoors+0.006
bowl+0.006
tub+0.006
 hunt+0.006
 met+0.006
Neg logit contributions
 understanding-0.010
 apology-0.009
 accomplishment-0.009
 efforts-0.009
 Deal-0.008
Token has
Token ID468
Feature activation+4.36e+00

Pos logit contributions
tub+0.070
 outdoors+0.068
bowl+0.061
 hall+0.059
 port+0.058
Neg logit contributions
 understanding-0.096
iously-0.092
 apology-0.086
 accomplishment-0.085
mmm-0.085
Token shiny
Token ID22441
Feature activation+3.22e+00

Pos logit contributions
tub+0.109
 hall+0.101
bowl+0.097
 valley+0.097
 outdoors+0.097
Neg logit contributions
 understanding-0.125
iously-0.102
 apology-0.101
 accomplishment-0.101
 efforts-0.101
Token fur
Token ID9230
Feature activation+2.30e+00

Pos logit contributions
 hall+0.082
tub+0.082
 outdoors+0.078
 hunt+0.078
 met+0.077
Neg logit contributions
 understanding-0.078
iously-0.077
 apology-0.075
 accomplishment-0.075
mmm-0.070
Token and
Token ID290
Feature activation+2.00e+00

Pos logit contributions
 hall+0.062
 outdoors+0.062
 met+0.062
 forest+0.061
 woods+0.060
Neg logit contributions
iously-0.049
 understanding-0.047
 badly-0.043
 accomplishment-0.041
mmm-0.040
Token bright
Token ID6016
Feature activation+1.10e+00

Pos logit contributions
tub+0.045
 outdoors+0.045
 hall+0.043
bowl+0.042
 forest+0.041
Neg logit contributions
 understanding-0.058
 accomplishment-0.054
iously-0.052
 apology-0.052
mmm-0.050
Token eyes
Token ID2951
Feature activation+1.67e-01

Pos logit contributions
 hall+0.031
 hunt+0.031
tub+0.031
 met+0.029
 forest+0.029
Neg logit contributions
 understanding-0.025
iously-0.024
 accomplishment-0.023
 apology-0.022
 efforts-0.021
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
 hall+0.005
 forest+0.004
 sandbox+0.004
 woods+0.004
tub+0.004
Neg logit contributions
 understanding-0.004
iously-0.004
 accomplishment-0.004
mmm-0.003
 apology-0.003
Token to
Token ID284
Feature activation+1.43e+00

Pos logit contributions
 understanding+0.015
 accomplishment+0.014
iously+0.014
 efforts+0.013
 apology+0.013
Neg logit contributions
 hall-0.012
c-0.012
tub-0.011
 hunt-0.011
 forest-0.011
Token watch
Token ID2342
Feature activation+7.18e-01

Pos logit contributions
tub+0.029
 hall+0.029
 woods+0.029
c+0.029
 forest+0.028
Neg logit contributions
 understanding-0.043
 accomplishment-0.042
 apology-0.041
iously-0.038
mmm-0.037
Token them
Token ID606
Feature activation-9.50e-01

Pos logit contributions
 hall+0.016
c+0.016
 garden+0.016
 forest+0.015
 sandbox+0.015
Neg logit contributions
 understanding-0.020
 accomplishment-0.020
iously-0.019
 apology-0.019
mmm-0.018
Token and
Token ID290
Feature activation-6.84e-01

Pos logit contributions
 understanding+0.026
 accomplishment+0.025
iously+0.024
 apology+0.023
mmm+0.022
Neg logit contributions
 hall-0.022
 forest-0.021
 woods-0.021
 met-0.021
tub-0.021
Token he
Token ID339
Feature activation+7.72e-01

Pos logit contributions
 understanding+0.018
 accomplishment+0.017
iously+0.016
 efforts+0.015
 apology+0.015
Neg logit contributions
c-0.017
 outdoors-0.017
 hall-0.016
tub-0.016
 forest-0.016
Token noticed
Token ID6810
Feature activation+1.24e+00

Pos logit contributions
 outdoors+0.018
 hall+0.017
 forest+0.016
 garden+0.016
tub+0.016
Neg logit contributions
 understanding-0.022
 accomplishment-0.020
iously-0.020
 efforts-0.019
 apology-0.019
Token how
Token ID703
Feature activation-1.49e+00

Pos logit contributions
 hall+0.028
 hunt+0.027
c+0.027
 garden+0.026
 outdoors+0.026
Neg logit contributions
 understanding-0.036
 accomplishment-0.033
iously-0.033
 apology-0.031
 efforts-0.030
Token much
Token ID881
Feature activation-3.04e-01

Pos logit contributions
 understanding+0.038
 accomplishment+0.037
 apology+0.035
iously+0.034
 efforts+0.033
Neg logit contributions
 hall-0.038
 garden-0.038
 forest-0.037
 hunt-0.037
c-0.037
Token they
Token ID484
Feature activation+1.64e+00

Pos logit contributions
 understanding+0.008
 accomplishment+0.008
 apology+0.007
 efforts+0.007
iously+0.007
Neg logit contributions
c-0.007
 hall-0.007
tub-0.007
 garden-0.007
 outdoors-0.007
Token seemed
Token ID3947
Feature activation-7.26e-01

Pos logit contributions
 hall+0.035
 garden+0.034
 outdoors+0.034
tub+0.034
c+0.033
Neg logit contributions
 understanding-0.048
 accomplishment-0.044
iously-0.042
 efforts-0.041
 apology-0.041
Token to
Token ID284
Feature activation+4.11e-01

Pos logit contributions
 understanding+0.021
 accomplishment+0.020
iously+0.019
 efforts+0.018
 apology+0.018
Neg logit contributions
 hall-0.016
c-0.016
 forest-0.015
 sandbox-0.015
rob-0.015
Token it
Token ID340
Feature activation+7.08e-01

Pos logit contributions
 accomplishment+0.082
 understanding+0.082
iously+0.081
 apology+0.078
mmm+0.075
Neg logit contributions
 garden-0.061
 hall-0.060
 park-0.058
 forest-0.058
 pretend-0.058
Token's
Token ID338
Feature activation-5.90e-01

Pos logit contributions
 outdoors+0.012
 woods+0.011
 forest+0.011
 fields+0.011
 hall+0.011
Neg logit contributions
 understanding-0.023
 apology-0.023
 accomplishment-0.023
iously-0.023
mmm-0.022
Token the
Token ID262
Feature activation+1.69e-01

Pos logit contributions
 understanding+0.015
 accomplishment+0.015
 apology+0.015
iously+0.015
oral+0.014
Neg logit contributions
 garden-0.014
 outside-0.014
 hall-0.014
 outdoors-0.014
 park-0.014
Token best
Token ID1266
Feature activation-1.23e+00

Pos logit contributions
c+0.003
 met+0.003
bowl+0.003
tub+0.003
 hunt+0.003
Neg logit contributions
 accomplishment-0.005
 understanding-0.005
 apology-0.005
 Deal-0.005
 efforts-0.005
Token!"
Token ID2474
Feature activation-2.08e+00

Pos logit contributions
 accomplishment+0.035
 understanding+0.035
iously+0.034
 apology+0.033
 efforts+0.032
Neg logit contributions
c-0.025
 met-0.025
 hall-0.024
 forest-0.024
 garden-0.024
Token Lily
Token ID20037
Feature activation-2.25e+00

Pos logit contributions
iously+0.073
mmm+0.066
 accomplishment+0.066
 understanding+0.065
istent+0.064
Neg logit contributions
 hall-0.038
 garden-0.038
 park-0.037
 pool-0.035
 pretend-0.035
Token says
Token ID1139
Feature activation-3.99e+00

Pos logit contributions
 understanding+0.061
 accomplishment+0.060
 apology+0.058
iously+0.058
mmm+0.055
Neg logit contributions
 hall-0.051
 garden-0.051
 forest-0.050
 outdoors-0.050
c-0.049
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
mmm+0.145
 accomplishment+0.144
 understanding+0.142
iously+0.140
oral+0.139
Neg logit contributions
 outside-0.056
 outdoors-0.048
 garden-0.047
 pretend-0.047
 hall-0.047
Token
Token ID198
Feature activation-4.57e-01

Pos logit contributions
 accomplishment+0.042
 understanding+0.041
iously+0.041
 apology+0.040
 efforts+0.039
Neg logit contributions
 garden-0.017
 hall-0.016
 forest-0.016
 park-0.016
 outdoors-0.016
Token
Token ID198
Feature activation-1.98e+00

Pos logit contributions
 accomplishment+0.017
iously+0.017
 understanding+0.017
 efforts+0.017
 apology+0.016
Neg logit contributions
 garden-0.006
 park-0.005
 hall-0.005
 forest-0.005
 outside-0.005
TokenMom
Token ID29252
Feature activation-1.28e+00

Pos logit contributions
 accomplishment+0.058
iously+0.058
 efforts+0.054
 apology+0.054
 understanding+0.053
Neg logit contributions
 garden-0.046
 park-0.045
 hall-0.043
 outside-0.043
 forest-0.042
Token "
Token ID366
Feature activation+1.27e+00

Pos logit contributions
 accomplishment+0.035
 understanding+0.034
 apology+0.033
iously+0.033
mmm+0.032
Neg logit contributions
 garden-0.022
 hall-0.022
 forest-0.022
 outdoors-0.021
 park-0.021
TokenTom
Token ID13787
Feature activation-1.96e+00

Pos logit contributions
 hall+0.027
 garden+0.027
 forest+0.027
 outdoors+0.026
c+0.026
Neg logit contributions
 understanding-0.036
 accomplishment-0.036
iously-0.035
 apology-0.035
mmm-0.033
Token,
Token ID11
Feature activation-7.10e-01

Pos logit contributions
 understanding+0.060
 accomplishment+0.059
iously+0.058
 apology+0.057
mmm+0.055
Neg logit contributions
 hall-0.037
 garden-0.037
 outdoors-0.037
 forest-0.037
 woods-0.036
Token help
Token ID1037
Feature activation-4.46e-01

Pos logit contributions
 understanding+0.019
 accomplishment+0.018
 apology+0.018
iously+0.017
mmm+0.017
Neg logit contributions
 garden-0.017
 hall-0.017
 forest-0.017
 outdoors-0.017
c-0.016
Token me
Token ID502
Feature activation-1.62e+00

Pos logit contributions
 understanding+0.014
 accomplishment+0.014
iously+0.013
 apology+0.013
mmm+0.013
Neg logit contributions
 hall-0.009
 garden-0.009
 forest-0.009
 outdoors-0.009
c-0.008
Token!"
Token ID2474
Feature activation-1.18e+00

Pos logit contributions
 understanding+0.047
 accomplishment+0.047
 apology+0.045
iously+0.044
mmm+0.043
Neg logit contributions
 garden-0.034
 hall-0.034
 outdoors-0.034
 forest-0.034
c-0.033
Token
Token ID198
Feature activation-4.99e-01

Pos logit contributions
 understanding+0.040
 accomplishment+0.040
iously+0.038
 apology+0.038
mmm+0.037
Neg logit contributions
 garden-0.019
 hall-0.019
 forest-0.019
 outdoors-0.018
 park-0.018
Token
Token ID198
Feature activation-1.76e+00

Pos logit contributions
 accomplishment+0.019
iously+0.018
 understanding+0.018
 apology+0.018
 efforts+0.018
Neg logit contributions
 garden-0.006
 forest-0.006
 hall-0.006
 park-0.006
 outdoors-0.006
TokenTom
Token ID13787
Feature activation-1.86e+00

Pos logit contributions
 accomplishment+0.052
iously+0.050
 apology+0.048
 understanding+0.048
 efforts+0.048
Neg logit contributions
 park-0.040
 garden-0.040
 hall-0.040
 forest-0.039
 woods-0.038
Token ran
Token ID4966
Feature activation-1.15e+00

Pos logit contributions
 understanding+0.059
iously+0.056
 accomplishment+0.056
 apology+0.055
 efforts+0.053
Neg logit contributions
 outdoors-0.034
 hall-0.033
 garden-0.033
 forest-0.033
 woods-0.032
Token to
Token ID284
Feature activation-4.51e-03

Pos logit contributions
mmm+0.042
 accomplishment+0.042
 understanding+0.041
iously+0.040
 apology+0.040
Neg logit contributions
 outside-0.018
 hall-0.017
 park-0.016
 outdoors-0.016
 garden-0.016
Token,
Token ID11
Feature activation-1.70e+00

Pos logit contributions
 hall+0.007
 garden+0.007
 forest+0.007
 outdoors+0.007
c+0.007
Neg logit contributions
 understanding-0.009
 accomplishment-0.009
iously-0.009
 apology-0.009
mmm-0.008
Token she
Token ID673
Feature activation-7.17e-01

Pos logit contributions
 understanding+0.044
 accomplishment+0.043
 apology+0.042
iously+0.042
mmm+0.039
Neg logit contributions
 garden-0.041
 hall-0.041
 outdoors-0.040
 forest-0.040
 park-0.039
Token said
Token ID531
Feature activation-1.91e+00

Pos logit contributions
 understanding+0.021
 accomplishment+0.021
 apology+0.020
iously+0.020
mmm+0.019
Neg logit contributions
 garden-0.015
 hall-0.015
 forest-0.015
 outdoors-0.014
c-0.014
Token goodbye
Token ID24829
Feature activation-2.24e+00

Pos logit contributions
 accomplishment+0.027
 understanding+0.027
 apology+0.025
iously+0.025
mmm+0.023
Neg logit contributions
 garden-0.067
 outdoors-0.067
 hall-0.067
 forest-0.066
 outside-0.065
Token to
Token ID284
Feature activation-5.89e-01

Pos logit contributions
 understanding+0.073
 accomplishment+0.072
iously+0.070
 apology+0.069
mmm+0.067
Neg logit contributions
 hall-0.039
 garden-0.038
 forest-0.038
 outdoors-0.038
c-0.037
Token her
Token ID607
Feature activation-3.25e+00

Pos logit contributions
 understanding+0.020
 accomplishment+0.020
iously+0.020
 apology+0.020
mmm+0.019
Neg logit contributions
 garden-0.009
 hall-0.009
 park-0.009
 forest-0.009
 outdoors-0.009
Token fake
Token ID8390
Feature activation-7.95e-01

Pos logit contributions
 understanding+0.078
iously+0.076
 accomplishment+0.076
mmm+0.075
 apology+0.074
Neg logit contributions
 garden-0.086
 hall-0.085
 forest-0.083
 park-0.082
 outdoors-0.081
Token kingdom
Token ID13239
Feature activation-6.78e-01

Pos logit contributions
 accomplishment+0.029
 understanding+0.027
 apology+0.027
iously+0.026
 efforts+0.026
Neg logit contributions
c-0.012
bowl-0.011
 outdoors-0.011
 gra-0.010
 fields-0.010
Token and
Token ID290
Feature activation-2.29e+00

Pos logit contributions
 understanding+0.020
 accomplishment+0.020
iously+0.019
 apology+0.019
 efforts+0.018
Neg logit contributions
 hall-0.014
 garden-0.013
 forest-0.013
 outdoors-0.013
c-0.013
Token ran
Token ID4966
Feature activation-9.26e-01

Pos logit contributions
 understanding+0.064
 accomplishment+0.064
 apology+0.062
iously+0.061
mmm+0.059
Neg logit contributions
 garden-0.050
 hall-0.050
 forest-0.049
 outdoors-0.048
 park-0.048
Token off
Token ID572
Feature activation-1.61e+00

Pos logit contributions
 understanding+0.026
 accomplishment+0.026
 apology+0.025
iously+0.025
mmm+0.024
Neg logit contributions
 hall-0.020
 garden-0.020
 forest-0.020
c-0.019
 outdoors-0.019
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
 forest+0.010
 outdoors+0.010
 garden+0.010
 woods+0.010
 hall+0.010
Neg logit contributions
 understanding-0.014
 accomplishment-0.013
iously-0.013
 apology-0.013
 efforts-0.012
Token They
Token ID1119
Feature activation+4.61e-01

Pos logit contributions
 hunt+0.004
c+0.004
 outdoors+0.004
 forest+0.004
 hall+0.004
Neg logit contributions
 understanding-0.006
 accomplishment-0.005
 apology-0.005
iously-0.005
mmm-0.005
Token felt
Token ID2936
Feature activation-1.22e+00

Pos logit contributions
 outdoors+0.010
 hall+0.010
 forest+0.009
 garden+0.009
c+0.009
Neg logit contributions
 understanding-0.014
 accomplishment-0.013
 apology-0.012
iously-0.012
 efforts-0.012
Token scared
Token ID12008
Feature activation-2.66e+00

Pos logit contributions
 understanding+0.020
 accomplishment+0.019
 apology+0.018
iously+0.016
 efforts+0.015
Neg logit contributions
 hall-0.043
 hunt-0.042
c-0.042
 forest-0.042
 outdoors-0.041
Token and
Token ID290
Feature activation-2.62e+00

Pos logit contributions
 accomplishment+0.086
 understanding+0.084
 apology+0.082
iously+0.082
mmm+0.081
Neg logit contributions
 garden-0.047
 outdoors-0.047
 forest-0.047
 outside-0.047
 hall-0.047
Token sorry
Token ID7926
Feature activation-1.76e+00

Pos logit contributions
 understanding+0.044
 accomplishment+0.043
 apology+0.040
iously+0.039
mmm+0.036
Neg logit contributions
 hall-0.087
 garden-0.087
 forest-0.087
 outdoors-0.087
c-0.086
Token.
Token ID13
Feature activation-5.84e-01

Pos logit contributions
 accomplishment+0.058
 understanding+0.057
iously+0.056
 apology+0.055
mmm+0.054
Neg logit contributions
 garden-0.030
 hall-0.029
 forest-0.029
 outside-0.028
 outdoors-0.028
Token They
Token ID1119
Feature activation-2.24e-01

Pos logit contributions
 understanding+0.018
 accomplishment+0.018
 apology+0.017
iously+0.017
mmm+0.016
Neg logit contributions
 hall-0.011
 garden-0.011
 forest-0.011
 outdoors-0.011
c-0.011
Token said
Token ID531
Feature activation-1.20e+00

Pos logit contributions
 accomplishment+0.007
iously+0.006
 understanding+0.006
 apology+0.006
mmm+0.006
Neg logit contributions
 garden-0.005
 forest-0.005
 hall-0.005
 park-0.005
 met-0.005
Token,
Token ID11
Feature activation-2.74e+00

Pos logit contributions
 accomplishment+0.033
 understanding+0.033
iously+0.032
 apology+0.031
mmm+0.031
Neg logit contributions
 garden-0.026
 hall-0.026
 forest-0.026
 outdoors-0.026
 park-0.025
Token "
Token ID366
Feature activation-4.29e-01

Pos logit contributions
 accomplishment+0.079
 understanding+0.079
iously+0.076
 apology+0.076
mmm+0.073
Neg logit contributions
 garden-0.057
 hall-0.057
 forest-0.056
 outdoors-0.055
 park-0.054
Token The
Token ID383
Feature activation-2.51e+00

Pos logit contributions
 understanding+0.007
 accomplishment+0.006
 apology+0.006
iously+0.006
mmm+0.006
Neg logit contributions
 hall-0.004
 garden-0.004
 forest-0.004
 outdoors-0.004
c-0.004
Token little
Token ID1310
Feature activation-3.18e+00

Pos logit contributions
 understanding+0.064
iously+0.064
 accomplishment+0.064
mmm+0.062
 apology+0.061
Neg logit contributions
 garden-0.062
 hall-0.062
 forest-0.062
 park-0.061
 woods-0.060
Token boy
Token ID2933
Feature activation-9.05e-01

Pos logit contributions
 understanding+0.083
 accomplishment+0.081
 apology+0.077
iously+0.075
 efforts+0.074
Neg logit contributions
 hunt-0.075
 outdoors-0.074
 garden-0.073
 hall-0.073
c-0.072
Token stared
Token ID18484
Feature activation-2.65e+00

Pos logit contributions
 understanding+0.029
 accomplishment+0.028
iously+0.027
 apology+0.027
 efforts+0.026
Neg logit contributions
 hall-0.016
 outdoors-0.016
 garden-0.016
 forest-0.016
 hunt-0.016
Token up
Token ID510
Feature activation-2.28e+00

Pos logit contributions
 understanding+0.073
 accomplishment+0.071
iously+0.068
 apology+0.066
 efforts+0.064
Neg logit contributions
 hall-0.062
 hunt-0.061
 forest-0.060
c-0.060
 garden-0.060
Token at
Token ID379
Feature activation-1.21e-02

Pos logit contributions
 understanding+0.066
 accomplishment+0.065
 apology+0.063
iously+0.063
mmm+0.061
Neg logit contributions
 garden-0.048
 hall-0.047
 forest-0.047
 outdoors-0.047
c-0.046
Token the
Token ID262
Feature activation-8.06e-01

Pos logit contributions
 accomplishment+0.000
 understanding+0.000
 apology+0.000
iously+0.000
 efforts+0.000
Neg logit contributions
 hall-0.000
c-0.000
tub-0.000
 hunt-0.000
 outdoors-0.000
Token nurse
Token ID15849
Feature activation-2.62e+00

Pos logit contributions
 accomplishment+0.023
 understanding+0.023
 apology+0.021
 efforts+0.020
iously+0.020
Neg logit contributions
c-0.018
bowl-0.018
 hunt-0.018
tub-0.017
 met-0.017
Token with
Token ID351
Feature activation-6.35e-01

Pos logit contributions
 understanding+0.072
 accomplishment+0.071
iously+0.068
 apology+0.064
 efforts+0.063
Neg logit contributions
 hall-0.059
 forest-0.059
 woods-0.059
tub-0.058
 outdoors-0.058
Token admiration
Token ID28506
Feature activation-3.80e+00

Pos logit contributions
 understanding+0.015
 accomplishment+0.014
 apology+0.013
 efforts+0.013
iously+0.012
Neg logit contributions
 hall-0.018
c-0.018
tub-0.017
 garden-0.017
 forest-0.017
Token in
Token ID287
Feature activation-1.64e+00

Pos logit contributions
 understanding+0.113
 accomplishment+0.113
 apology+0.110
iously+0.108
mmm+0.106
Neg logit contributions
 garden-0.074
 outdoors-0.074
 hall-0.073
 forest-0.073
c-0.071
Token mail
Token ID6920
Feature activation+6.00e-02

Pos logit contributions
c+0.012
 hall+0.012
 forest+0.012
 outdoors+0.012
 hunt+0.012
Neg logit contributions
 understanding-0.023
 accomplishment-0.023
 apology-0.022
iously-0.021
 efforts-0.021
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
 hall+0.001
 forest+0.001
 garden+0.001
 woods+0.001
 outdoors+0.001
Neg logit contributions
 understanding-0.002
 accomplishment-0.001
 apology-0.001
iously-0.001
mmm-0.001
Token send
Token ID3758
Feature activation+4.95e-01

Pos logit contributions
 hall+0.026
 garden+0.026
 forest+0.026
 outdoors+0.026
c+0.026
Neg logit contributions
 understanding-0.037
 accomplishment-0.036
 apology-0.035
iously-0.034
mmm-0.034
Token,"
Token ID553
Feature activation-2.92e+00

Pos logit contributions
 hall+0.011
 forest+0.011
 garden+0.010
 woods+0.010
 outdoors+0.010
Neg logit contributions
 understanding-0.014
 apology-0.014
 accomplishment-0.014
iously-0.013
mmm-0.013
Token he
Token ID339
Feature activation-2.30e+00

Pos logit contributions
 accomplishment+0.092
iously+0.091
 understanding+0.091
 apology+0.088
 efforts+0.086
Neg logit contributions
 garden-0.054
 hall-0.053
 forest-0.052
 park-0.052
 pool-0.050
Token said
Token ID531
Feature activation-4.14e+00

Pos logit contributions
 accomplishment+0.077
 apology+0.071
mmm+0.071
iously+0.071
 understanding+0.071
Neg logit contributions
 met-0.040
 hall-0.039
 garden-0.039
 forest-0.038
 hunt-0.038
Token.
Token ID13
Feature activation-1.77e+00

Pos logit contributions
 accomplishment+0.145
mmm+0.144
 understanding+0.143
iously+0.139
 apology+0.138
Neg logit contributions
 outside-0.060
 outdoors-0.056
 hall-0.056
 garden-0.056
tub-0.053
Token He
Token ID679
Feature activation-2.68e-01

Pos logit contributions
 accomplishment+0.065
iously+0.064
 understanding+0.063
 apology+0.062
mmm+0.061
Neg logit contributions
 garden-0.025
 hall-0.024
 forest-0.024
 park-0.023
 woods-0.023
Token picked
Token ID6497
Feature activation+1.82e+00

Pos logit contributions
 accomplishment+0.009
 understanding+0.009
 apology+0.009
iously+0.009
mmm+0.009
Neg logit contributions
 garden-0.004
 hall-0.004
 forest-0.004
 park-0.004
 met-0.004
Token up
Token ID510
Feature activation-8.74e-01

Pos logit contributions
 hunt+0.033
 hall+0.033
 forest+0.032
 sandbox+0.031
 woods+0.031
Neg logit contributions
 accomplishment-0.058
 apology-0.058
 understanding-0.057
iously-0.054
 efforts-0.050
Token the
Token ID262
Feature activation+6.78e-01

Pos logit contributions
 understanding+0.026
 accomplishment+0.025
 apology+0.024
iously+0.023
 efforts+0.023
Neg logit contributions
 outdoors-0.018
 forest-0.018
 hunt-0.018
 hall-0.018
 garden-0.018
Tokenwayne
Token ID43932
Feature activation-3.95e+00

Pos logit contributions
 understanding+0.102
 accomplishment+0.100
iously+0.093
 apology+0.093
 efforts+0.087
Neg logit contributions
 outdoors-0.131
 garden-0.129
 forest-0.128
 hall-0.128
c-0.127
Token for
Token ID329
Feature activation-4.63e-01

Pos logit contributions
 understanding+0.106
 accomplishment+0.104
iously+0.100
 apology+0.099
 efforts+0.094
Neg logit contributions
 hall-0.092
 garden-0.091
 forest-0.091
 outdoors-0.089
 woods-0.089
Token his
Token ID465
Feature activation-2.92e+00

Pos logit contributions
iously+0.013
 accomplishment+0.012
 understanding+0.012
 apology+0.012
mmm+0.012
Neg logit contributions
 garden-0.010
 hall-0.010
 forest-0.010
 outdoors-0.010
 park-0.010
Token help
Token ID1037
Feature activation-1.99e+00

Pos logit contributions
mmm+0.080
iously+0.080
 understanding+0.079
 accomplishment+0.078
 apology+0.075
Neg logit contributions
 garden-0.062
 outdoors-0.061
 park-0.060
 pretend-0.059
 forest-0.059
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
 understanding+0.059
 accomplishment+0.058
 apology+0.056
iously+0.056
mmm+0.054
Neg logit contributions
 hall-0.040
 garden-0.040
 forest-0.039
 outdoors-0.039
c-0.039
Token D
Token ID360
Feature activation-3.93e+00

Pos logit contributions
 understanding+0.016
 accomplishment+0.016
 apology+0.016
iously+0.015
 efforts+0.015
Neg logit contributions
 garden-0.011
 hall-0.010
 outdoors-0.010
 forest-0.010
c-0.010
Tokenwayne
Token ID43932
Feature activation-8.03e-01

Pos logit contributions
 understanding+0.086
 accomplishment+0.084
 apology+0.080
iously+0.080
mmm+0.075
Neg logit contributions
 hall-0.111
 garden-0.111
 forest-0.111
 outdoors-0.110
c-0.109
Token was
Token ID373
Feature activation-2.23e+00

Pos logit contributions
 understanding+0.024
 accomplishment+0.024
iously+0.023
 apology+0.023
 efforts+0.022
Neg logit contributions
 hall-0.016
 garden-0.016
 forest-0.016
 outdoors-0.016
 woods-0.016
Token still
Token ID991
Feature activation-2.52e+00

Pos logit contributions
 understanding+0.058
 accomplishment+0.057
 apology+0.055
iously+0.055
mmm+0.052
Neg logit contributions
 hall-0.053
 garden-0.053
 forest-0.053
 outdoors-0.052
c-0.051
Token a
Token ID257
Feature activation-2.84e+00

Pos logit contributions
 understanding+0.068
 accomplishment+0.068
 apology+0.066
iously+0.066
mmm+0.063
Neg logit contributions
 garden-0.057
 hall-0.057
 outdoors-0.056
 forest-0.056
c-0.055
Token bit
Token ID1643
Feature activation-3.57e+00

Pos logit contributions
iously+0.077
mmm+0.075
 apology+0.074
 accomplishment+0.074
 understanding+0.074
Neg logit contributions
 garden-0.066
 hall-0.065
 forest-0.065
 park-0.064
 outdoors-0.063
Token.
Token ID13
Feature activation-9.05e-01

Pos logit contributions
 understanding+0.052
 accomplishment+0.051
 apology+0.050
iously+0.049
 efforts+0.048
Neg logit contributions
 hall-0.031
 garden-0.031
 outdoors-0.031
 forest-0.030
 woods-0.030
Token You
Token ID921
Feature activation+1.08e+00

Pos logit contributions
 accomplishment+0.025
 understanding+0.025
iously+0.024
 apology+0.023
mmm+0.023
Neg logit contributions
 hall-0.020
 garden-0.020
 forest-0.019
c-0.019
 outdoors-0.019
Token can
Token ID460
Feature activation-4.22e-01

Pos logit contributions
 hall+0.021
 garden+0.020
 woods+0.020
 outdoors+0.020
 forest+0.020
Neg logit contributions
 understanding-0.034
 accomplishment-0.032
 apology-0.032
 efforts-0.031
iously-0.031
Token still
Token ID991
Feature activation-4.20e-01

Pos logit contributions
 understanding+0.013
 accomplishment+0.012
 apology+0.012
iously+0.012
mmm+0.011
Neg logit contributions
 hall-0.009
 garden-0.009
 forest-0.008
 outdoors-0.008
tub-0.008
Token be
Token ID307
Feature activation-2.05e+00

Pos logit contributions
 understanding+0.012
 accomplishment+0.012
 apology+0.012
iously+0.011
mmm+0.011
Neg logit contributions
 hall-0.009
tub-0.009
 garden-0.009
 woods-0.009
 forest-0.009
Token my
Token ID616
Feature activation-3.49e+00

Pos logit contributions
 understanding+0.052
 accomplishment+0.049
 apology+0.047
iously+0.046
 efforts+0.044
Neg logit contributions
 hall-0.053
 garden-0.052
 forest-0.051
tub-0.050
 woods-0.050
Token friend
Token ID1545
Feature activation-2.92e+00

Pos logit contributions
 understanding+0.086
 accomplishment+0.085
 apology+0.081
iously+0.080
 efforts+0.076
Neg logit contributions
 hall-0.089
 garden-0.088
 forest-0.087
c-0.087
 outdoors-0.087
Token."
Token ID526
Feature activation-8.60e-01

Pos logit contributions
 understanding+0.081
 accomplishment+0.079
 apology+0.077
iously+0.076
mmm+0.073
Neg logit contributions
 hall-0.064
 garden-0.064
 forest-0.063
 outdoors-0.063
 woods-0.062
Token And
Token ID843
Feature activation-4.18e+00

Pos logit contributions
 understanding+0.027
 accomplishment+0.027
iously+0.026
 apology+0.026
 efforts+0.025
Neg logit contributions
 garden-0.016
 hall-0.016
 forest-0.016
 park-0.016
 pool-0.015
Token Tim
Token ID5045
Feature activation-1.36e+00

Pos logit contributions
 Deal+0.131
 apology+0.130
iously+0.129
 accomplishment+0.129
oral+0.127
Neg logit contributions
 outside-0.074
 garden-0.073
 pretend-0.071
 hall-0.071
 met-0.070
Token felt
Token ID2936
Feature activation-2.59e+00

Pos logit contributions
 understanding+0.040
 accomplishment+0.040
 apology+0.039
iously+0.038
mmm+0.038
Neg logit contributions
 garden-0.027
 forest-0.027
 hall-0.027
 park-0.027
c-0.027
TokenBen
Token ID11696
Feature activation-2.09e+00

Pos logit contributions
 accomplishment+0.067
 understanding+0.065
iously+0.065
 apology+0.063
 efforts+0.061
Neg logit contributions
 garden-0.053
 hall-0.053
 forest-0.052
 outdoors-0.052
 park-0.052
Token felt
Token ID2936
Feature activation-2.88e+00

Pos logit contributions
 understanding+0.060
 accomplishment+0.059
iously+0.057
 apology+0.057
mmm+0.054
Neg logit contributions
 hall-0.044
 garden-0.044
 outdoors-0.044
 forest-0.044
c-0.043
Token more
Token ID517
Feature activation-3.73e+00

Pos logit contributions
 understanding+0.063
 accomplishment+0.060
 apology+0.058
iously+0.056
mmm+0.053
Neg logit contributions
 hall-0.083
 garden-0.082
 forest-0.082
 outdoors-0.081
c-0.081
Token scared
Token ID12008
Feature activation-3.66e+00

Pos logit contributions
 accomplishment+0.089
 understanding+0.088
iously+0.086
 apology+0.085
mmm+0.082
Neg logit contributions
 garden-0.097
 outdoors-0.096
 hall-0.095
 forest-0.095
 park-0.093
Token and
Token ID290
Feature activation-4.42e+00

Pos logit contributions
 accomplishment+0.118
 understanding+0.116
iously+0.113
 apology+0.113
mmm+0.111
Neg logit contributions
 garden-0.065
 forest-0.065
 hall-0.065
 outdoors-0.064
 outside-0.063
Token ashamed
Token ID22461
Feature activation-4.21e+00

Pos logit contributions
 understanding+0.089
 accomplishment+0.088
iously+0.084
 apology+0.083
mmm+0.079
Neg logit contributions
 garden-0.131
 hall-0.131
 forest-0.130
 outdoors-0.129
c-0.127
Token.
Token ID13
Feature activation-6.42e-01

Pos logit contributions
 accomplishment+0.163
mmm+0.160
iously+0.156
 understanding+0.154
 Deal+0.152
Neg logit contributions
 outside-0.051
 garden-0.045
 forest-0.044
 park-0.043
 pretend-0.043
Token He
Token ID679
Feature activation-8.47e-01

Pos logit contributions
 understanding+0.019
 accomplishment+0.018
 apology+0.018
iously+0.018
mmm+0.017
Neg logit contributions
 hall-0.013
 garden-0.013
 outdoors-0.013
 forest-0.013
c-0.013
Token felt
Token ID2936
Feature activation-2.40e+00

Pos logit contributions
 accomplishment+0.028
 understanding+0.027
iously+0.026
 apology+0.026
mmm+0.026
Neg logit contributions
 garden-0.015
 forest-0.015
 hall-0.015
 park-0.014
 met-0.014
Token alone
Token ID3436
Feature activation-1.58e+00

Pos logit contributions
 understanding+0.045
 accomplishment+0.042
 apology+0.041
iously+0.039
 efforts+0.036
Neg logit contributions
 hall-0.077
 garden-0.076
 forest-0.076
c-0.076
 hunt-0.075
Token and
Token ID290
Feature activation-2.80e+00

Pos logit contributions
 understanding+0.048
 accomplishment+0.047
 apology+0.045
iously+0.045
mmm+0.043
Neg logit contributions
 hall-0.031
 garden-0.031
 forest-0.031
 outdoors-0.031
c-0.030
Token You
Token ID921
Feature activation+2.56e-01

Pos logit contributions
 accomplishment+0.043
 understanding+0.042
iously+0.041
 apology+0.040
 efforts+0.039
Neg logit contributions
 garden-0.033
 hall-0.032
c-0.032
 forest-0.032
 outdoors-0.032
Token're
Token ID821
Feature activation-2.49e+00

Pos logit contributions
 hall+0.004
 forest+0.004
 garden+0.004
 woods+0.004
 outdoors+0.004
Neg logit contributions
 understanding-0.009
 accomplishment-0.008
 apology-0.008
iously-0.008
mmm-0.008
Token such
Token ID884
Feature activation-2.30e+00

Pos logit contributions
 understanding+0.069
 accomplishment+0.069
iously+0.067
 apology+0.067
mmm+0.064
Neg logit contributions
 garden-0.055
 hall-0.054
 forest-0.054
 outdoors-0.054
 park-0.053
Token a
Token ID257
Feature activation-3.39e+00

Pos logit contributions
iously+0.087
 accomplishment+0.085
 apology+0.084
 understanding+0.084
mmm+0.082
Neg logit contributions
 garden-0.028
 outdoors-0.028
 forest-0.025
 outside-0.025
 pretend-0.025
Token talented
Token ID12356
Feature activation-3.05e+00

Pos logit contributions
iously+0.103
mmm+0.093
 apology+0.091
 accomplishment+0.090
 Deal+0.086
Neg logit contributions
 garden-0.079
 pretend-0.075
 park-0.074
 forest-0.074
 outdoors-0.074
Token artist
Token ID6802
Feature activation-4.07e+00

Pos logit contributions
 understanding+0.086
 accomplishment+0.085
iously+0.082
 apology+0.082
mmm+0.079
Neg logit contributions
 garden-0.066
 hall-0.066
 forest-0.065
 outdoors-0.065
c-0.063
Token!"
Token ID2474
Feature activation-1.07e+00

Pos logit contributions
 understanding+0.113
 accomplishment+0.111
 apology+0.106
iously+0.105
mmm+0.101
Neg logit contributions
 garden-0.090
 hall-0.090
 forest-0.089
 outdoors-0.088
 woods-0.087
Token<|endoftext|>
Token ID50256
Feature activation+3.94e+00

Pos logit contributions
 accomplishment+0.040
 understanding+0.039
iously+0.039
 apology+0.038
 efforts+0.038
Neg logit contributions
 garden-0.014
c-0.014
 hall-0.014
 forest-0.013
 outdoors-0.013
TokenOnce
Token ID7454
Feature activation+2.26e+00

Pos logit contributions
 sandbox+0.101
 hunt+0.097
 paints+0.097
 pots+0.096
 pretend+0.096
Neg logit contributions
 understanding-0.077
oral-0.075
mmm-0.074
Less-0.074
From-0.074
Token upon
Token ID2402
Feature activation+3.17e+00

Pos logit contributions
 hall+0.031
 garden+0.031
 forest+0.030
 outdoors+0.030
c+0.029
Neg logit contributions
 understanding-0.082
 accomplishment-0.081
iously-0.079
 apology-0.079
mmm-0.076
Token a
Token ID257
Feature activation-5.44e-01

Pos logit contributions
 hall+0.069
 forest+0.067
c+0.067
tub+0.067
 paints+0.066
Neg logit contributions
 understanding-0.092
 accomplishment-0.079
 apology-0.078
From-0.076
mmm-0.075