TOP ACTIVATIONS
MAX = 8.478

. It said " DO NOT T OU CH ". But
to be ' C omet Expl orers '. "
Lucy suddenly heard another " SC REAM ". This time ,
Lucy heard a loud " SC REAM "! She felt a
button that said " DO NOT PRESS ." She couldn 't
big bird sitting on the pr une tree . The bird
is a seed in the pr une . It bit me
's go home now . Expl oring can be adventurous ,
the air , leaving a spir aling trail of black ink
and then a loud " CR ASH !" echoed around the
reached up , grabbed the pend ulum , and then slowly
that said " No T resp assing ". "
to the edge of the tr amp oline and took a
his elephant the name " BI GG EST ". He was
Tommy quietly stepped off the tr amp oline and looked around
. She stepped onto the tr icycle â s
out , you little r asc als ," the driver said
It was hiding under a seaw eed . Lily said ,
She walked through the tall st alks of corn and discovered
that said " No T resp assing ". And behind the

MIN ACTIVATIONS
MIN = -6.066

wrong . Tim my told her that the w affle was
going on . Tim told her about the talking cat and
meet new friends and teach them what they knew .
wrong . Tim my told her about his computer trouble .
her M ummy for reminding her . Then , she jumped
and asked grandma to tell her some stories . Grand ma
ask her mom to tell her a bed time story .
wrong . Tim my told her about his broken car .
was wrong . Lily told her that she wanted to bring
was wrong . Lily told her about the butterfly and her
thanked her mom for telling her the truth . From that
what happened . Lily told her that she broke her triangle
to their mom and told her what they did .
wrong . Tim my told her he couldn 't find his
wrong . Tim my told her about the broken whistle .
was wrong . Lily told her that she couldn 't find
was wrong . Tim told her that he did not understand
her mom my and told her what happened . " Mom
to her mom and told her what happened . Her mom
wrong . Tim my told her about his bad dream and

INTERVAL 4.842 - 8.478
CONTAINS 0.093%

airplane that flew high in the sky . The airplane was
must never go near the chim ney ! Do you understand
" Hey , stop ! That 's our snow man
stars shining in the in ky sky . He turned to
, the lion and mouse stepped out of the maze and

INTERVAL 1.206 - 4.842
CONTAINS 18.791%

a bath ," Lily said , trying to resist .
. The wind was still blowing all around the
have one . " But Dane ," Mom my
while , the noise stopped and Josh stopped being scared too
day , something special happened . In the cave , the

INTERVAL -2.430 - 1.206
CONTAINS 77.407%

fast and that he should never do something dangerous like run
. He loved to swim and explore the ocean . One
good time . Suddenly , Tim threw his ball
look , it was actually a fancy wrap that was made
life to the tree . Its branches were soon filled with

INTERVAL -6.066 - -2.430
CONTAINS 3.710%

and started to bark . Her friend smiled and said ,
was happy and thanked Max for his help . They became
She wanted to cry but remembered how tough she was .
at the clouds and asked lots of questions . He also
?" L izzy thought about what the butterfly said
Token.
Token ID13
Feature activation+9.96e-01

Pos logit contributions
 valleys+0.028
ASH+0.027
 oceans+0.026
 rivers+0.026
 hills+0.026
Neg logit contributions
 important-0.010
ggy-0.009
 lesson-0.008
 caring-0.008
 about-0.008
Token It
Token ID632
Feature activation+3.09e-01

Pos logit contributions
 valleys+0.032
 oceans+0.031
 hills+0.031
 rivers+0.031
ASH+0.030
Neg logit contributions
 important-0.019
 lesson-0.019
 caring-0.019
ggy-0.018
 valuable-0.017
Token said
Token ID531
Feature activation-3.68e-01

Pos logit contributions
 valleys+0.010
 hills+0.010
 rivers+0.010
 oceans+0.010
 mountains+0.010
Neg logit contributions
 caring-0.006
 important-0.005
 lesson-0.005
ggy-0.005
 forgiven-0.005
Token "
Token ID366
Feature activation+3.65e+00

Pos logit contributions
 caring+0.007
 important+0.007
 lesson+0.006
ggy+0.006
 forgiven+0.006
Neg logit contributions
 valleys-0.012
 hills-0.012
 rivers-0.012
 oceans-0.012
 mountains-0.011
TokenDO
Token ID18227
Feature activation+3.58e+00

Pos logit contributions
 valleys+0.108
 hills+0.102
 oceans+0.100
 mountains+0.100
 rivers+0.100
Neg logit contributions
 caring-0.075
 lesson-0.074
 important-0.074
ggy-0.071
 understand-0.070
Token NOT
Token ID5626
Feature activation+8.48e+00

Pos logit contributions
 valleys+0.113
 hills+0.109
 oceans+0.107
 rivers+0.107
ASH+0.104
Neg logit contributions
 important-0.076
 caring-0.072
ggy-0.072
 lesson-0.070
 forgiven-0.067
Token T
Token ID309
Feature activation+3.08e+00

Pos logit contributions
 valleys+0.221
 oceans+0.202
 hills+0.200
 lakes+0.198
 rivers+0.197
Neg logit contributions
 important-0.201
 caring-0.197
 bad-0.191
 valuable-0.189
 lesson-0.186
TokenOU
Token ID2606
Feature activation+5.53e+00

Pos logit contributions
 valleys+0.098
 hills+0.092
 rivers+0.091
 oceans+0.090
 lakes+0.087
Neg logit contributions
ggy-0.069
 important-0.067
 lesson-0.065
 caring-0.064
ggie-0.063
TokenCH
Token ID3398
Feature activation+3.29e+00

Pos logit contributions
 valleys+0.155
 oceans+0.146
 rivers+0.144
 hills+0.143
 mountains+0.137
Neg logit contributions
ggy-0.138
 caring-0.131
ggie-0.128
 important-0.128
 lesson-0.127
Token".
Token ID1911
Feature activation+2.87e-01

Pos logit contributions
 valleys+0.095
 oceans+0.089
 hills+0.088
 rivers+0.088
 lakes+0.084
Neg logit contributions
ggy-0.083
ggie-0.077
 important-0.077
 caring-0.075
 lesson-0.075
Token But
Token ID887
Feature activation-1.20e+00

Pos logit contributions
 valleys+0.009
 hills+0.009
 rivers+0.008
 oceans+0.008
ASH+0.008
Neg logit contributions
 caring-0.006
 important-0.006
ggy-0.006
 lesson-0.006
 forgiven-0.005
Token to
Token ID284
Feature activation+9.38e-01

Pos logit contributions
 caring+0.034
 important+0.034
 lesson+0.033
ggy+0.033
 forgiven+0.031
Neg logit contributions
 valleys-0.058
 hills-0.056
 oceans-0.055
 rivers-0.055
ASH-0.053
Token be
Token ID307
Feature activation+5.51e-01

Pos logit contributions
 valleys+0.033
 oceans+0.031
ting+0.031
 hills+0.031
ays+0.031
Neg logit contributions
ggy-0.016
 caring-0.015
 not-0.015
 important-0.015
 understand-0.015
Token '
Token ID705
Feature activation+2.47e+00

Pos logit contributions
 valleys+0.018
 hills+0.018
ASH+0.018
 oceans+0.017
 railing+0.017
Neg logit contributions
 important-0.010
 caring-0.010
ggy-0.010
 lesson-0.009
 forgiven-0.009
TokenC
Token ID34
Feature activation+3.06e+00

Pos logit contributions
 valleys+0.067
 oceans+0.064
 hills+0.063
 rivers+0.063
 railing+0.062
Neg logit contributions
 important-0.058
 caring-0.057
ggy-0.056
 lesson-0.055
 forgiven-0.054
Tokenomet
Token ID908
Feature activation+1.82e+00

Pos logit contributions
 valleys+0.081
 railing+0.077
 hills+0.077
 rivers+0.074
 oceans+0.074
Neg logit contributions
ggy-0.076
 important-0.075
 caring-0.075
ggie-0.073
live-0.069
Token Expl
Token ID5905
Feature activation+8.37e+00

Pos logit contributions
 valleys+0.057
 hills+0.054
 rivers+0.053
ASH+0.052
 oceans+0.052
Neg logit contributions
 caring-0.039
 important-0.039
ggy-0.039
 lesson-0.037
ggie-0.036
Tokenorers
Token ID28089
Feature activation+2.58e+00

Pos logit contributions
 valleys+0.209
 rivers+0.200
 hills+0.197
 lakes+0.197
 streams+0.190
Neg logit contributions
 important-0.193
 forgiven-0.188
 valuable-0.186
 lesson-0.183
:-0.182
Token'.
Token ID4458
Feature activation+3.97e-01

Pos logit contributions
 valleys+0.079
 hills+0.073
 rivers+0.072
 oceans+0.071
ASH+0.071
Neg logit contributions
 important-0.057
ggy-0.056
 caring-0.056
ggie-0.055
 lesson-0.055
Token
Token ID198
Feature activation-1.76e+00

Pos logit contributions
 valleys+0.014
 hills+0.014
 oceans+0.014
 rivers+0.014
 lakes+0.013
Neg logit contributions
 caring-0.006
 important-0.006
ggy-0.006
 lesson-0.006
 forgiven-0.006
Token
Token ID198
Feature activation-5.35e-01

Pos logit contributions
 important+0.031
 caring+0.031
 lesson+0.029
ggy+0.029
 forgiven+0.028
Neg logit contributions
 valleys-0.060
 hills-0.059
 rivers-0.057
 oceans-0.057
 mountains-0.055
Token"
Token ID1
Feature activation+2.67e+00

Pos logit contributions
 caring+0.008
 important+0.008
ggy+0.008
 lesson+0.008
 forgiven+0.007
Neg logit contributions
 valleys-0.020
 hills-0.019
 oceans-0.019
 rivers-0.019
 mountains-0.018
Token Lucy
Token ID22162
Feature activation+1.80e-01

Pos logit contributions
 caring+0.001
 important+0.001
 lesson+0.001
ggy+0.001
 forgiven+0.001
Neg logit contributions
 valleys-0.003
 hills-0.003
 oceans-0.003
 rivers-0.003
ASH-0.003
Token suddenly
Token ID6451
Feature activation+1.61e+00

Pos logit contributions
 valleys+0.007
 oceans+0.006
 hills+0.006
 rivers+0.006
 lakes+0.006
Neg logit contributions
ggy-0.003
 understood-0.003
 caring-0.003
 important-0.003
 not-0.002
Token heard
Token ID2982
Feature activation+1.13e+00

Pos logit contributions
 valleys+0.057
ASH+0.056
 hills+0.056
 oceans+0.055
 rivers+0.054
Neg logit contributions
 important-0.024
ggy-0.024
 caring-0.023
 understand-0.023
 forgiven-0.022
Token another
Token ID1194
Feature activation-6.74e-01

Pos logit contributions
 valleys+0.042
ASH+0.041
 hills+0.040
 oceans+0.040
 rivers+0.039
Neg logit contributions
 important-0.016
 caring-0.016
 lesson-0.015
ggy-0.015
 wonderful-0.014
Token "
Token ID366
Feature activation+3.95e+00

Pos logit contributions
 caring+0.016
 important+0.016
ggy+0.016
 lesson+0.016
 understand+0.015
Neg logit contributions
 valleys-0.019
 hills-0.018
 oceans-0.018
 rivers-0.018
ASH-0.018
TokenSC
Token ID6173
Feature activation+8.23e+00

Pos logit contributions
 valleys+0.111
 hills+0.109
 oceans+0.107
 mountains+0.106
 rivers+0.103
Neg logit contributions
ggy-0.083
 lesson-0.083
 caring-0.082
 understand-0.082
 important-0.082
TokenREAM
Token ID32235
Feature activation+5.69e+00

Pos logit contributions
 valleys+0.170
 hills+0.153
 oceans+0.149
 mountains+0.146
 lakes+0.144
Neg logit contributions
 forgiven-0.224
 caring-0.222
 important-0.213
ggy-0.206
isi-0.200
Token".
Token ID1911
Feature activation+1.80e-01

Pos logit contributions
 valleys+0.172
 oceans+0.160
 hills+0.159
 rivers+0.157
ASH+0.156
Neg logit contributions
ggy-0.132
ggie-0.122
 important-0.118
 lesson-0.111
 bad-0.109
Token This
Token ID770
Feature activation-8.62e-01

Pos logit contributions
 valleys+0.006
 oceans+0.006
 rivers+0.006
 hills+0.006
 lakes+0.006
Neg logit contributions
 important-0.003
 caring-0.003
ggy-0.003
 lesson-0.003
 forgiven-0.003
Token time
Token ID640
Feature activation-1.44e+00

Pos logit contributions
 caring+0.017
 important+0.016
ggy+0.016
 lesson+0.015
 forgiven+0.015
Neg logit contributions
 valleys-0.028
 hills-0.027
 rivers-0.026
 oceans-0.026
 mountains-0.026
Token,
Token ID11
Feature activation-5.44e-01

Pos logit contributions
 important+0.024
 caring+0.024
 lesson+0.023
ggy+0.023
 forgiven+0.021
Neg logit contributions
 valleys-0.051
 hills-0.049
 oceans-0.049
 rivers-0.048
ASH-0.048
Token Lucy
Token ID22162
Feature activation+1.16e+00

Pos logit contributions
 valleys+0.031
 hills+0.030
 oceans+0.030
 rivers+0.030
ASH+0.030
Neg logit contributions
 important-0.014
 caring-0.014
ggy-0.014
 lesson-0.014
 forgiven-0.012
Token heard
Token ID2982
Feature activation+9.61e-01

Pos logit contributions
 valleys+0.039
 oceans+0.038
 hills+0.037
 rivers+0.037
 lakes+0.037
Neg logit contributions
ggy-0.020
 important-0.018
ggie-0.017
 forgiven-0.017
 not-0.017
Token a
Token ID257
Feature activation+8.16e-01

Pos logit contributions
 valleys+0.033
 hills+0.033
 oceans+0.032
 rivers+0.032
ASH+0.031
Neg logit contributions
 caring-0.016
 important-0.016
 lesson-0.015
ggy-0.015
 forgiven-0.014
Token loud
Token ID7812
Feature activation+3.06e+00

Pos logit contributions
 valleys+0.029
 hills+0.027
 oceans+0.027
ASH+0.027
 rivers+0.027
Neg logit contributions
 caring-0.015
 important-0.014
ggy-0.014
 lesson-0.013
 bad-0.013
Token "
Token ID366
Feature activation+4.31e+00

Pos logit contributions
 valleys+0.107
 oceans+0.101
 hills+0.099
 rivers+0.098
ASH+0.097
Neg logit contributions
ggy-0.059
 caring-0.058
 bad-0.056
 important-0.053
 lesson-0.052
TokenSC
Token ID6173
Feature activation+8.19e+00

Pos logit contributions
 valleys+0.118
 hills+0.113
 oceans+0.113
 rivers+0.111
 mountains+0.109
Neg logit contributions
ggy-0.100
 lesson-0.099
 caring-0.099
 important-0.099
 forgiven-0.093
TokenREAM
Token ID32235
Feature activation+5.27e+00

Pos logit contributions
 valleys+0.169
 hills+0.148
 oceans+0.147
 rivers+0.142
 lakes+0.141
Neg logit contributions
 caring-0.229
 forgiven-0.227
ggy-0.221
 important-0.220
isi-0.217
Token"!
Token ID40484
Feature activation+1.07e-01

Pos logit contributions
 valleys+0.157
 oceans+0.147
 hills+0.144
 rivers+0.144
 lakes+0.141
Neg logit contributions
ggy-0.128
ggie-0.117
 important-0.116
 lesson-0.111
 caring-0.110
Token She
Token ID1375
Feature activation+3.83e-01

Pos logit contributions
 valleys+0.004
 oceans+0.004
 hills+0.004
 rivers+0.003
 lakes+0.003
Neg logit contributions
 important-0.002
 caring-0.002
ggy-0.002
 lesson-0.002
 forgiven-0.002
Token felt
Token ID2936
Feature activation+1.64e+00

Pos logit contributions
 valleys+0.012
 hills+0.012
 oceans+0.011
 rivers+0.011
ASH+0.011
Neg logit contributions
 caring-0.008
 important-0.008
ggy-0.008
 lesson-0.007
 forgiven-0.007
Token a
Token ID257
Feature activation+1.16e+00

Pos logit contributions
 valleys+0.047
 hills+0.046
 rivers+0.045
 oceans+0.044
 mountains+0.043
Neg logit contributions
 caring-0.037
 important-0.036
 lesson-0.036
ggy-0.036
 forgiven-0.034
Token button
Token ID4936
Feature activation+9.49e-01

Pos logit contributions
 valleys+0.053
 oceans+0.047
 rivers+0.047
 lakes+0.044
 hills+0.044
Neg logit contributions
 lesson-0.032
 important-0.030
ggy-0.030
 caring-0.029
 bad-0.029
Token that
Token ID326
Feature activation-7.08e-02

Pos logit contributions
 valleys+0.035
 oceans+0.033
 rivers+0.032
 streams+0.032
ASH+0.032
Neg logit contributions
ggy-0.016
 important-0.016
 about-0.016
 not-0.015
ggie-0.014
Token said
Token ID531
Feature activation-6.70e-01

Pos logit contributions
 caring+0.001
 important+0.001
 lesson+0.001
ggy+0.001
 forgiven+0.001
Neg logit contributions
 valleys-0.002
 hills-0.002
 oceans-0.002
 rivers-0.002
ASH-0.002
Token "
Token ID366
Feature activation+3.17e+00

Pos logit contributions
 caring+0.013
 important+0.013
 lesson+0.013
ggy+0.012
 forgiven+0.012
Neg logit contributions
 valleys-0.021
 hills-0.021
 rivers-0.020
 oceans-0.020
 mountains-0.020
TokenDO
Token ID18227
Feature activation+3.50e+00

Pos logit contributions
 valleys+0.098
 hills+0.092
 rivers+0.091
 oceans+0.091
 mountains+0.091
Neg logit contributions
 caring-0.065
 important-0.064
 lesson-0.064
ggy-0.061
isi-0.060
Token NOT
Token ID5626
Feature activation+8.18e+00

Pos logit contributions
 valleys+0.115
 hills+0.110
 oceans+0.109
 rivers+0.108
 lakes+0.105
Neg logit contributions
 important-0.073
ggy-0.070
 caring-0.069
 lesson-0.066
 forgiven-0.064
Token PRESS
Token ID44310
Feature activation+2.56e+00

Pos logit contributions
 valleys+0.227
 oceans+0.209
 hills+0.206
 lakes+0.205
 rivers+0.205
Neg logit contributions
 important-0.183
 caring-0.181
 bad-0.178
 valuable-0.173
ggy-0.170
Token."
Token ID526
Feature activation+9.57e-01

Pos logit contributions
 valleys+0.077
 oceans+0.072
 rivers+0.070
 lakes+0.070
 hills+0.070
Neg logit contributions
 important-0.058
ggy-0.058
 lesson-0.056
ggie-0.054
 forgiven-0.053
Token She
Token ID1375
Feature activation+8.20e-02

Pos logit contributions
 valleys+0.034
 oceans+0.032
 rivers+0.032
 lakes+0.031
 hills+0.031
Neg logit contributions
 important-0.016
 lesson-0.016
 caring-0.015
 forgiven-0.015
ggy-0.015
Token couldn
Token ID3521
Feature activation-1.90e-01

Pos logit contributions
 valleys+0.002
 hills+0.002
 rivers+0.002
 oceans+0.002
ASH+0.002
Neg logit contributions
 caring-0.002
 important-0.002
 lesson-0.002
ggy-0.002
 forgiven-0.002
Token't
Token ID470
Feature activation-1.37e+00

Pos logit contributions
ggy+0.003
 lesson+0.003
 important+0.003
 caring+0.003
ggie+0.003
Neg logit contributions
 valleys-0.007
 oceans-0.007
 rivers-0.007
 hills-0.007
 lakes-0.007
Token big
Token ID1263
Feature activation+1.65e+00

Pos logit contributions
 caring+0.001
 important+0.001
 lesson+0.001
ggy+0.001
 forgiven+0.001
Neg logit contributions
 valleys-0.001
 hills-0.001
 oceans-0.001
 rivers-0.001
ASH-0.001
Token bird
Token ID6512
Feature activation+1.96e+00

Pos logit contributions
 valleys+0.056
ASH+0.051
 rivers+0.051
 oceans+0.051
 hills+0.050
Neg logit contributions
 important-0.033
 bad-0.033
 caring-0.033
 lesson-0.030
 kind-0.029
Token sitting
Token ID5586
Feature activation+2.98e+00

Pos logit contributions
 valleys+0.063
 hills+0.059
 oceans+0.059
ASH+0.059
 rivers+0.058
Neg logit contributions
ggy-0.042
 important-0.040
 caring-0.040
 not-0.037
 bad-0.036
Token on
Token ID319
Feature activation+2.77e+00

Pos logit contributions
 valleys+0.086
 oceans+0.081
 hills+0.081
 rivers+0.081
 lakes+0.079
Neg logit contributions
 important-0.071
 caring-0.067
 lesson-0.066
ggy-0.064
 forgiven-0.063
Token the
Token ID262
Feature activation+4.12e+00

Pos logit contributions
 valleys+0.107
 oceans+0.102
 hills+0.101
 rivers+0.101
 railing+0.099
Neg logit contributions
 caring-0.034
 lesson-0.034
 important-0.033
ggy-0.033
 bad-0.030
Token pr
Token ID778
Feature activation+7.82e+00

Pos logit contributions
 valleys+0.082
 oceans+0.074
 rivers+0.072
 hills+0.072
 lakes+0.070
Neg logit contributions
 important-0.133
 lesson-0.131
ggy-0.130
 caring-0.129
 bad-0.124
Tokenune
Token ID1726
Feature activation+3.44e+00

Pos logit contributions
 valleys+0.250
 oceans+0.241
 rivers+0.235
 hills+0.235
 lakes+0.229
Neg logit contributions
ggy-0.157
ggie-0.139
 important-0.138
 lesson-0.137
isi-0.137
Token tree
Token ID5509
Feature activation+2.41e+00

Pos logit contributions
 valleys+0.105
 oceans+0.100
ASH+0.098
 rivers+0.097
 lakes+0.096
Neg logit contributions
 important-0.072
 lesson-0.072
ggy-0.070
 bad-0.069
 not-0.068
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
 valleys+0.096
 oceans+0.092
ASH+0.091
 rivers+0.089
 lakes+0.089
Neg logit contributions
 important-0.030
 not-0.029
 about-0.027
ggy-0.026
 caring-0.025
Token The
Token ID383
Feature activation+2.69e-01

Pos logit contributions
 valleys+0.059
 oceans+0.056
 rivers+0.055
 hills+0.055
 streams+0.054
Neg logit contributions
 important-0.034
 caring-0.033
 lesson-0.032
ggy-0.031
 forgiven-0.030
Token bird
Token ID6512
Feature activation+1.36e+00

Pos logit contributions
 hills+0.006
 valleys+0.006
 rivers+0.006
 oceans+0.006
 mountains+0.005
Neg logit contributions
 caring-0.008
ggy-0.008
 important-0.008
 forgiven-0.007
ggie-0.007
Token is
Token ID318
Feature activation+2.22e+00

Pos logit contributions
 valleys+0.022
 hills+0.021
 oceans+0.021
 rivers+0.020
 mountains+0.020
Neg logit contributions
 important-0.017
 caring-0.017
 lesson-0.017
ggy-0.016
 forgiven-0.016
Token a
Token ID257
Feature activation+1.47e+00

Pos logit contributions
 valleys+0.084
 oceans+0.081
 hills+0.080
 rivers+0.080
ASH+0.078
Neg logit contributions
 important-0.032
 caring-0.031
 lesson-0.030
 bad-0.030
 not-0.030
Token seed
Token ID9403
Feature activation+2.50e+00

Pos logit contributions
 valleys+0.050
 hills+0.048
 oceans+0.048
 rivers+0.047
ASH+0.046
Neg logit contributions
 important-0.028
 caring-0.028
 lesson-0.027
ggy-0.026
 bad-0.026
Token in
Token ID287
Feature activation+2.46e+00

Pos logit contributions
 valleys+0.087
 oceans+0.083
ASH+0.082
 hills+0.082
 rivers+0.081
Neg logit contributions
 bad-0.048
 important-0.047
 not-0.043
ggy-0.042
 caring-0.040
Token the
Token ID262
Feature activation+4.24e+00

Pos logit contributions
 valleys+0.085
 hills+0.083
 oceans+0.082
 rivers+0.081
ASH+0.079
Neg logit contributions
 important-0.040
 caring-0.040
 lesson-0.038
ggy-0.037
 bad-0.036
Token pr
Token ID778
Feature activation+7.65e+00

Pos logit contributions
 valleys+0.099
 oceans+0.094
 hills+0.092
ays+0.090
 rivers+0.090
Neg logit contributions
 important-0.119
 bad-0.116
 lesson-0.113
 caring-0.109
 valuable-0.106
Tokenune
Token ID1726
Feature activation+3.48e+00

Pos logit contributions
 valleys+0.278
 oceans+0.275
 hills+0.267
 rivers+0.264
 lakes+0.261
Neg logit contributions
ggy-0.104
 important-0.095
 forgiven-0.092
 lesson-0.091
isi-0.090
Token.
Token ID13
Feature activation+3.90e+00

Pos logit contributions
 valleys+0.132
 oceans+0.130
ASH+0.125
 rivers+0.125
 lakes+0.123
Neg logit contributions
 bad-0.051
 important-0.049
ggy-0.046
 not-0.044
 that-0.044
Token It
Token ID632
Feature activation+1.85e+00

Pos logit contributions
 valleys+0.115
 hills+0.111
 oceans+0.110
 rivers+0.109
 mountains+0.104
Neg logit contributions
 caring-0.090
 important-0.087
 lesson-0.084
ggy-0.084
 forgiven-0.082
Token bit
Token ID1643
Feature activation+1.89e+00

Pos logit contributions
 valleys+0.058
 hills+0.056
 oceans+0.055
 rivers+0.055
 lakes+0.053
Neg logit contributions
 important-0.041
 caring-0.039
 bad-0.038
 lesson-0.038
 not-0.037
Token me
Token ID502
Feature activation+4.26e-01

Pos logit contributions
 valleys+0.049
 oceans+0.045
 hills+0.044
 rivers+0.044
ASH+0.044
Neg logit contributions
 important-0.051
 caring-0.049
 bad-0.049
 lesson-0.048
 not-0.045
Token's
Token ID338
Feature activation-5.49e-01

Pos logit contributions
 valleys+0.010
 hills+0.010
ASH+0.009
 rivers+0.009
 oceans+0.009
Neg logit contributions
 caring-0.012
 important-0.012
 lesson-0.012
ggy-0.012
 valuable-0.011
Token go
Token ID467
Feature activation+1.28e+00

Pos logit contributions
ggy+0.010
 important+0.010
 caring+0.010
 understand+0.010
 lesson+0.010
Neg logit contributions
 valleys-0.019
 hills-0.018
 oceans-0.018
 rivers-0.018
ASH-0.017
Token home
Token ID1363
Feature activation-4.85e-01

Pos logit contributions
 valleys+0.046
 oceans+0.044
ASH+0.043
 rivers+0.043
 hills+0.042
Neg logit contributions
ggy-0.021
 forgive-0.021
 understand-0.020
 important-0.020
 lesson-0.019
Token now
Token ID783
Feature activation-1.80e-01

Pos logit contributions
 important+0.009
 not+0.009
 caring+0.009
 lesson+0.008
ggy+0.008
Neg logit contributions
 valleys-0.017
 oceans-0.017
 rivers-0.016
 hills-0.016
 lakes-0.016
Token.
Token ID13
Feature activation+5.02e-01

Pos logit contributions
 important+0.003
ggy+0.003
 lesson+0.003
 caring+0.003
 not+0.003
Neg logit contributions
 valleys-0.007
 oceans-0.006
 hills-0.006
 rivers-0.006
 mountains-0.006
Token Expl
Token ID5905
Feature activation+7.58e+00

Pos logit contributions
 valleys+0.016
 hills+0.015
 oceans+0.015
 rivers+0.015
 mountains+0.014
Neg logit contributions
 caring-0.011
 important-0.010
 lesson-0.010
ggy-0.010
 forgiven-0.010
Tokenoring
Token ID3255
Feature activation-7.62e-02

Pos logit contributions
 rivers+0.182
 valleys+0.182
 lakes+0.178
 oceans+0.176
 hills+0.175
Neg logit contributions
 important-0.182
 valuable-0.175
 forgiven-0.175
 resolved-0.173
 lesson-0.172
Token can
Token ID460
Feature activation+2.22e-01

Pos logit contributions
 important+0.002
 caring+0.002
 lesson+0.002
ggy+0.001
 valuable+0.001
Neg logit contributions
 valleys-0.002
 hills-0.002
 oceans-0.002
 rivers-0.002
ASH-0.002
Token be
Token ID307
Feature activation+8.88e-01

Pos logit contributions
 valleys+0.008
 oceans+0.008
 hills+0.007
ASH+0.007
 rivers+0.007
Neg logit contributions
 important-0.004
 caring-0.004
ggy-0.004
 lesson-0.004
 understand-0.004
Token adventurous
Token ID39770
Feature activation-3.85e-01

Pos logit contributions
 valleys+0.032
 hills+0.032
 oceans+0.031
 railing+0.031
ASH+0.031
Neg logit contributions
 important-0.015
 caring-0.014
ggy-0.013
 forgiven-0.013
 lesson-0.013
Token,
Token ID11
Feature activation+2.88e-01

Pos logit contributions
 important+0.008
 caring+0.008
 lesson+0.007
 wonderful+0.007
 valuable+0.007
Neg logit contributions
 valleys-0.012
ASH-0.012
 hills-0.012
ACK-0.012
 oceans-0.011
Token the
Token ID262
Feature activation+3.61e+00

Pos logit contributions
 valleys+0.043
 hills+0.042
 oceans+0.041
 rivers+0.041
ASH+0.040
Neg logit contributions
 caring-0.025
 important-0.024
 lesson-0.024
ggy-0.023
 forgiven-0.021
Token air
Token ID1633
Feature activation+9.75e-01

Pos logit contributions
ays+0.087
rol+0.079
unks+0.074
 valleys+0.074
ASH+0.074
Neg logit contributions
 bad-0.103
 important-0.100
 wonderful-0.099
 kind-0.095
 lesson-0.093
Token,
Token ID11
Feature activation+2.49e+00

Pos logit contributions
 valleys+0.039
ASH+0.037
 oceans+0.037
 hills+0.037
 rivers+0.036
Neg logit contributions
 important-0.012
 lesson-0.011
ggy-0.011
 caring-0.011
 not-0.011
Token leaving
Token ID4305
Feature activation+1.47e-01

Pos logit contributions
 valleys+0.084
ASH+0.084
ays+0.083
 oceans+0.078
 hills+0.078
Neg logit contributions
 not-0.051
 caring-0.048
 important-0.046
 about-0.042
 lesson-0.042
Token a
Token ID257
Feature activation+1.69e+00

Pos logit contributions
 valleys+0.005
 hills+0.005
 rivers+0.005
 oceans+0.005
ASH+0.005
Neg logit contributions
 caring-0.002
 important-0.002
ggy-0.002
 lesson-0.002
 forgiven-0.002
Token spir
Token ID9158
Feature activation+7.44e+00

Pos logit contributions
 valleys+0.057
ASH+0.056
ays+0.055
 oceans+0.054
 rivers+0.053
Neg logit contributions
 caring-0.033
 important-0.032
 bad-0.031
 lesson-0.031
 valuable-0.031
Tokenaling
Token ID4272
Feature activation+2.59e+00

Pos logit contributions
 valleys+0.156
 hills+0.154
 slope+0.142
 oceans+0.141
 rivers+0.138
Neg logit contributions
isi-0.196
 forgiven-0.195
ily-0.183
aid-0.173
isy-0.172
Token trail
Token ID8025
Feature activation+1.67e+00

Pos logit contributions
 valleys+0.067
ASH+0.066
ACK+0.062
 oceans+0.062
 lakes+0.060
Neg logit contributions
 lesson-0.059
 bad-0.057
ggy-0.057
 important-0.055
 that-0.053
Token of
Token ID286
Feature activation+8.02e-01

Pos logit contributions
ASH+0.056
 valleys+0.054
ACK+0.051
 oceans+0.051
 lakes+0.050
Neg logit contributions
 important-0.033
ggy-0.031
 lesson-0.030
 not-0.029
 valuable-0.029
Token black
Token ID2042
Feature activation+3.35e+00

Pos logit contributions
ASH+0.023
 valleys+0.022
ACK+0.022
ays+0.021
 oceans+0.021
Neg logit contributions
 important-0.018
 caring-0.018
 wonderful-0.018
 lesson-0.018
 bad-0.018
Token ink
Token ID16882
Feature activation+2.29e+00

Pos logit contributions
 railing+0.064
 south+0.062
ASH+0.062
 platform+0.061
ACK+0.061
Neg logit contributions
 happened-0.087
 about-0.084
 important-0.081
 lesson-0.081
 bad-0.081
Token and
Token ID290
Feature activation+2.40e+00

Pos logit contributions
 valleys+0.069
ASH+0.067
 oceans+0.065
 lakes+0.064
 rivers+0.064
Neg logit contributions
 not-0.029
 important-0.028
 about-0.028
 lesson-0.028
 caring-0.028
Token then
Token ID788
Feature activation+1.19e+00

Pos logit contributions
 valleys+0.075
ASH+0.074
 oceans+0.072
 railing+0.071
 spaces+0.070
Neg logit contributions
 caring-0.046
 not-0.042
ggy-0.042
 important-0.040
 why-0.039
Token a
Token ID257
Feature activation+2.08e-01

Pos logit contributions
 valleys+0.038
ASH+0.038
 oceans+0.037
 hills+0.036
 rivers+0.036
Neg logit contributions
ggy-0.023
 caring-0.023
 important-0.022
 bad-0.021
 not-0.021
Token loud
Token ID7812
Feature activation+2.81e+00

Pos logit contributions
 valleys+0.007
ASH+0.007
 oceans+0.007
 hills+0.007
 rivers+0.007
Neg logit contributions
 caring-0.004
 important-0.004
ggy-0.004
 lesson-0.004
 valuable-0.003
Token "
Token ID366
Feature activation+4.61e+00

Pos logit contributions
 valleys+0.089
 oceans+0.084
ays+0.083
ASH+0.082
 hills+0.082
Neg logit contributions
ggy-0.063
 bad-0.060
 caring-0.059
 lesson-0.057
 important-0.056
TokenCR
Token ID9419
Feature activation+7.43e+00

Pos logit contributions
 valleys+0.131
 hills+0.128
 oceans+0.127
 mountains+0.127
 rivers+0.123
Neg logit contributions
ggy-0.095
 caring-0.095
 lesson-0.094
 forgiven-0.091
 understand-0.089
TokenASH
Token ID11211
Feature activation+4.69e+00

Pos logit contributions
 valleys+0.117
 oceans+0.098
 rivers+0.091
 lakes+0.089
 hills+0.089
Neg logit contributions
ggy-0.252
 forgiven-0.251
 lesson-0.243
ggie-0.242
 friend-0.241
Token!"
Token ID2474
Feature activation+8.19e-01

Pos logit contributions
 valleys+0.147
 oceans+0.139
 rivers+0.136
 hills+0.134
 lakes+0.132
Neg logit contributions
ggy-0.110
 important-0.100
ggie-0.099
 lesson-0.096
 forgiven-0.095
Token echoed
Token ID22211
Feature activation+2.43e+00

Pos logit contributions
 valleys+0.031
 oceans+0.031
 rivers+0.030
ASH+0.030
 lakes+0.029
Neg logit contributions
ggy-0.011
 caring-0.010
 lesson-0.010
 important-0.010
 not-0.009
Token around
Token ID1088
Feature activation+2.28e+00

Pos logit contributions
 valleys+0.062
ASH+0.060
 oceans+0.059
 hills+0.058
 rivers+0.057
Neg logit contributions
 important-0.061
ggy-0.060
 caring-0.060
 lesson-0.059
 bad-0.057
Token the
Token ID262
Feature activation+3.07e+00

Pos logit contributions
ASH+0.084
 valleys+0.084
ays+0.083
 oceans+0.082
ACK+0.081
Neg logit contributions
 caring-0.028
 bad-0.026
ggy-0.025
 not-0.025
 important-0.025
Token reached
Token ID4251
Feature activation+2.06e+00

Pos logit contributions
 valleys+0.048
 oceans+0.048
 hills+0.047
ASH+0.047
 rivers+0.046
Neg logit contributions
ggy-0.030
 caring-0.029
 forgiven-0.029
 important-0.028
 understood-0.028
Token up
Token ID510
Feature activation+1.14e+00

Pos logit contributions
 valleys+0.061
ASH+0.060
 oceans+0.059
 lakes+0.058
 hills+0.057
Neg logit contributions
 important-0.043
 what-0.041
 lesson-0.041
 not-0.040
 caring-0.040
Token,
Token ID11
Feature activation+3.25e+00

Pos logit contributions
 valleys+0.041
ASH+0.040
 oceans+0.040
 lakes+0.039
 rivers+0.038
Neg logit contributions
 caring-0.018
 lesson-0.018
 important-0.017
 not-0.016
ggy-0.015
Token grabbed
Token ID13176
Feature activation+1.95e+00

Pos logit contributions
 valleys+0.110
ASH+0.109
 oceans+0.109
 lakes+0.107
ays+0.105
Neg logit contributions
 not-0.064
 caring-0.057
 what-0.055
 why-0.054
 about-0.052
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
 valleys+0.070
 oceans+0.068
 hills+0.067
 rivers+0.066
ASH+0.066
Neg logit contributions
 caring-0.030
 important-0.029
 lesson-0.029
ggy-0.027
 bad-0.026
Token pend
Token ID44017
Feature activation+7.41e+00

Pos logit contributions
 valleys+0.032
 oceans+0.030
ASH+0.030
ays+0.029
 rivers+0.029
Neg logit contributions
 lesson-0.029
 important-0.028
 bad-0.027
 wonderful-0.026
ggy-0.026
Tokenulum
Token ID14452
Feature activation+9.65e-01

Pos logit contributions
rol+0.179
sc+0.171
 valleys+0.170
 south+0.165
 streams+0.164
Neg logit contributions
ggy-0.162
Jerry-0.152
ggie-0.151
:-0.148
 not-0.147
Token,
Token ID11
Feature activation+3.12e+00

Pos logit contributions
 valleys+0.037
 oceans+0.035
 rivers+0.034
 lakes+0.034
ASH+0.034
Neg logit contributions
 not-0.015
 about-0.015
 caring-0.014
 important-0.014
 lesson-0.014
Token and
Token ID290
Feature activation+2.16e+00

Pos logit contributions
 valleys+0.105
 oceans+0.104
ASH+0.101
 hills+0.100
 lakes+0.099
Neg logit contributions
 not-0.064
 caring-0.057
 what-0.051
 understood-0.051
 understand-0.050
Token then
Token ID788
Feature activation+7.18e-01

Pos logit contributions
 valleys+0.072
 oceans+0.072
ASH+0.071
 rivers+0.069
 hills+0.069
Neg logit contributions
 caring-0.038
 important-0.036
 not-0.036
ggy-0.033
 understood-0.033
Token slowly
Token ID6364
Feature activation+1.43e+00

Pos logit contributions
 valleys+0.025
 oceans+0.024
ASH+0.024
 hills+0.024
 rivers+0.024
Neg logit contributions
 caring-0.013
ggy-0.012
 important-0.012
 lesson-0.012
 not-0.011
Token that
Token ID326
Feature activation+1.92e-01

Pos logit contributions
 caring+0.003
 important+0.003
ggy+0.003
 lesson+0.003
 forgiven+0.003
Neg logit contributions
 valleys-0.009
 hills-0.009
 oceans-0.009
 rivers-0.009
ASH-0.009
Token said
Token ID531
Feature activation-1.90e-01

Pos logit contributions
 valleys+0.007
 hills+0.007
 oceans+0.007
 rivers+0.007
ASH+0.007
Neg logit contributions
 important-0.003
 caring-0.003
ggy-0.002
 lesson-0.002
 forgiven-0.002
Token "
Token ID366
Feature activation+3.74e+00

Pos logit contributions
 caring+0.004
 important+0.004
 lesson+0.003
ggy+0.003
 forgiven+0.003
Neg logit contributions
 valleys-0.006
 hills-0.006
 oceans-0.006
 rivers-0.006
 mountains-0.006
TokenNo
Token ID2949
Feature activation-1.10e+00

Pos logit contributions
 valleys+0.103
 hills+0.096
 oceans+0.095
 mountains+0.094
 rivers+0.094
Neg logit contributions
 lesson-0.085
ggy-0.085
 caring-0.085
 important-0.082
 understand-0.079
Token T
Token ID309
Feature activation+2.54e+00

Pos logit contributions
 caring+0.026
 important+0.026
 lesson+0.025
ggy+0.025
 forgiven+0.023
Neg logit contributions
 valleys-0.032
 hills-0.030
 oceans-0.030
 rivers-0.030
ASH-0.029
Tokenresp
Token ID4363
Feature activation+7.40e+00

Pos logit contributions
 valleys+0.089
 hills+0.083
 oceans+0.082
 rivers+0.082
 mountains+0.079
Neg logit contributions
ggy-0.049
 important-0.046
 lesson-0.045
 caring-0.045
ggie-0.043
Tokenassing
Token ID19696
Feature activation+1.52e+00

Pos logit contributions
 valleys+0.190
 oceans+0.178
 railing+0.178
 asleep+0.177
ASH+0.176
Neg logit contributions
Jerry-0.162
 caring-0.158
isi-0.154
ggy-0.148
 Jerry-0.142
Token".
Token ID1911
Feature activation+6.97e-01

Pos logit contributions
 valleys+0.046
 oceans+0.043
ASH+0.042
 rivers+0.042
 hills+0.042
Neg logit contributions
 important-0.033
 lesson-0.033
ggy-0.032
 caring-0.031
ggie-0.029
Token
Token ID198
Feature activation-1.45e+00

Pos logit contributions
 valleys+0.025
 oceans+0.024
 hills+0.024
 rivers+0.024
 lakes+0.023
Neg logit contributions
 caring-0.011
 important-0.011
 lesson-0.011
ggy-0.010
 forgiven-0.010
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
 caring+0.033
 important+0.032
 forgiven+0.031
 lesson+0.031
isi+0.030
Neg logit contributions
 hills-0.043
 valleys-0.043
 rivers-0.040
 oceans-0.039
 lakes-0.038
Token"
Token ID1
Feature activation+3.07e+00

Pos logit contributions
 caring+0.003
 important+0.003
 lesson+0.003
ggy+0.003
 forgiven+0.002
Neg logit contributions
 valleys-0.005
 hills-0.005
 rivers-0.005
 oceans-0.005
 mountains-0.005
Token to
Token ID284
Feature activation+2.19e+00

Pos logit contributions
 valleys+0.084
ASH+0.084
 oceans+0.082
 lakes+0.080
ACK+0.080
Neg logit contributions
 lesson-0.031
 not-0.025
 caring-0.025
 bad-0.024
 important-0.024
Token the
Token ID262
Feature activation+2.46e+00

Pos logit contributions
 valleys+0.084
 oceans+0.082
ASH+0.079
 rivers+0.079
 hills+0.079
Neg logit contributions
 lesson-0.026
 caring-0.025
 understand-0.024
 important-0.024
 wonderful-0.023
Token edge
Token ID5743
Feature activation+1.53e+00

Pos logit contributions
 valleys+0.072
 oceans+0.069
 rivers+0.067
ASH+0.066
 lakes+0.065
Neg logit contributions
 lesson-0.057
 important-0.057
 bad-0.054
 caring-0.053
ggy-0.051
Token of
Token ID286
Feature activation+1.11e+00

Pos logit contributions
 valleys+0.065
ASH+0.063
 oceans+0.062
 rivers+0.061
ACK+0.060
Neg logit contributions
 lesson-0.012
 important-0.012
 about-0.010
 caring-0.010
ggy-0.010
Token the
Token ID262
Feature activation+2.78e+00

Pos logit contributions
 valleys+0.042
ASH+0.041
 oceans+0.041
als+0.040
 rivers+0.040
Neg logit contributions
 caring-0.012
 lesson-0.012
 important-0.011
 bad-0.011
 not-0.011
Token tr
Token ID491
Feature activation+7.38e+00

Pos logit contributions
 valleys+0.068
 oceans+0.066
ASH+0.065
 rivers+0.064
 lakes+0.061
Neg logit contributions
 important-0.073
 lesson-0.073
 bad-0.071
 wonderful-0.068
 caring-0.067
Tokenamp
Token ID696
Feature activation+5.01e+00

Pos logit contributions
 valleys+0.133
 rivers+0.126
 oceans+0.126
 hills+0.120
 lakes+0.117
Neg logit contributions
ggy-0.232
live-0.218
 important-0.218
ggie-0.217
 lesson-0.216
Tokenoline
Token ID14453
Feature activation+3.53e+00

Pos logit contributions
 valleys+0.133
 oceans+0.124
 south+0.122
 rivers+0.122
 lakes+0.119
Neg logit contributions
 lesson-0.117
ggy-0.116
 important-0.110
ggie-0.106
:-0.106
Token and
Token ID290
Feature activation+2.63e+00

Pos logit contributions
 valleys+0.140
ASH+0.133
 oceans+0.131
ACK+0.129
 south+0.129
Neg logit contributions
 about-0.038
 that-0.038
:-0.036
 not-0.032
ggy-0.032
Token took
Token ID1718
Feature activation+1.20e+00

Pos logit contributions
 valleys+0.086
ASH+0.086
 oceans+0.085
 rivers+0.082
ays+0.082
Neg logit contributions
 not-0.046
 caring-0.044
ggy-0.043
 important-0.042
 lesson-0.042
Token a
Token ID257
Feature activation-3.35e-01

Pos logit contributions
 valleys+0.039
 oceans+0.038
 hills+0.037
 rivers+0.037
 lakes+0.036
Neg logit contributions
ggy-0.023
 lesson-0.022
 important-0.022
 caring-0.022
ggie-0.021
Token his
Token ID465
Feature activation-6.82e-01

Pos logit contributions
 valleys+0.057
ASH+0.056
 oceans+0.054
 hills+0.053
 rivers+0.053
Neg logit contributions
 caring-0.030
 important-0.028
ggy-0.028
 not-0.027
 lesson-0.027
Token elephant
Token ID20950
Feature activation-2.14e-01

Pos logit contributions
 important+0.014
 caring+0.014
ggy+0.014
 lesson+0.014
 forgiven+0.012
Neg logit contributions
 valleys-0.023
 oceans-0.021
 rivers-0.021
 hills-0.021
ASH-0.021
Token the
Token ID262
Feature activation+1.56e+00

Pos logit contributions
 important+0.004
 caring+0.004
ggy+0.004
 lesson+0.003
 forgiven+0.003
Neg logit contributions
 valleys-0.008
 oceans-0.007
 hills-0.007
 rivers-0.007
ASH-0.007
Token name
Token ID1438
Feature activation+3.77e-01

Pos logit contributions
 valleys+0.047
ays+0.043
 oceans+0.042
ASH+0.042
 hills+0.042
Neg logit contributions
 important-0.037
 caring-0.036
 lesson-0.036
 bad-0.033
ggy-0.033
Token "
Token ID366
Feature activation+3.56e+00

Pos logit contributions
 valleys+0.014
 hills+0.013
 oceans+0.013
 rivers+0.013
ASH+0.013
Neg logit contributions
 important-0.006
ggy-0.006
 lesson-0.006
 caring-0.006
ggie-0.005
TokenBI
Token ID3483
Feature activation+7.34e+00

Pos logit contributions
 valleys+0.103
 hills+0.097
 railing+0.097
 mountains+0.095
 oceans+0.095
Neg logit contributions
 caring-0.077
 lesson-0.076
 important-0.075
isi-0.074
 understand-0.074
TokenGG
Token ID11190
Feature activation+6.42e+00

Pos logit contributions
 valleys+0.217
 hills+0.207
 rivers+0.205
 railing+0.204
ASH+0.203
Neg logit contributions
 caring-0.146
Jerry-0.141
ggy-0.140
ggie-0.133
isi-0.132
TokenEST
Token ID6465
Feature activation+3.69e+00

Pos logit contributions
 valleys+0.178
 hills+0.163
 rivers+0.161
 oceans+0.158
ASH+0.158
Neg logit contributions
ggy-0.169
ggie-0.160
 caring-0.152
 important-0.147
isi-0.142
Token".
Token ID1911
Feature activation+1.06e+00

Pos logit contributions
 valleys+0.107
 hills+0.101
ASH+0.100
 oceans+0.100
 rivers+0.099
Neg logit contributions
ggy-0.088
ggie-0.082
 important-0.082
 lesson-0.080
 caring-0.079
Token He
Token ID679
Feature activation+1.00e+00

Pos logit contributions
 valleys+0.036
 oceans+0.035
ASH+0.035
 rivers+0.034
 hills+0.034
Neg logit contributions
ggy-0.017
 important-0.017
 lesson-0.016
 caring-0.016
 understand-0.015
Token was
Token ID373
Feature activation+5.73e-01

Pos logit contributions
 valleys+0.035
 hills+0.034
 oceans+0.034
ASH+0.034
 rivers+0.033
Neg logit contributions
 caring-0.017
ggy-0.017
 important-0.017
 lesson-0.016
ggie-0.015
Token Tommy
Token ID19919
Feature activation+3.53e-01

Pos logit contributions
 valleys+0.071
 oceans+0.068
 rivers+0.067
 hills+0.066
 lakes+0.066
Neg logit contributions
 important-0.035
 lesson-0.034
 caring-0.033
ggy-0.033
 forgiven-0.032
Token quietly
Token ID12703
Feature activation+1.38e+00

Pos logit contributions
 valleys+0.013
 oceans+0.012
 hills+0.012
 rivers+0.012
 lakes+0.012
Neg logit contributions
ggy-0.006
 caring-0.006
 important-0.006
 not-0.005
 understood-0.005
Token stepped
Token ID10764
Feature activation+5.41e+00

Pos logit contributions
 valleys+0.042
 hills+0.040
ASH+0.040
 oceans+0.039
 rivers+0.038
Neg logit contributions
 caring-0.028
ggy-0.028
 understand-0.027
 important-0.026
 understood-0.026
Token off
Token ID572
Feature activation+1.88e+00

Pos logit contributions
 valleys+0.163
ASH+0.152
 oceans+0.151
 spaces+0.149
 lakes+0.148
Neg logit contributions
:-0.110
 important-0.101
 what-0.099
 bad-0.099
 forgive-0.099
Token the
Token ID262
Feature activation+2.47e+00

Pos logit contributions
 valleys+0.078
ASH+0.076
 oceans+0.075
 lakes+0.073
 rivers+0.072
Neg logit contributions
 lesson-0.021
 bad-0.019
 important-0.018
 not-0.018
 caring-0.018
Token tr
Token ID491
Feature activation+7.33e+00

Pos logit contributions
 valleys+0.060
ays+0.057
 oceans+0.056
 rivers+0.054
ASH+0.054
Neg logit contributions
 lesson-0.067
 bad-0.067
 important-0.066
 caring-0.063
 wonderful-0.063
Tokenamp
Token ID696
Feature activation+5.10e+00

Pos logit contributions
 valleys+0.131
 oceans+0.123
 rivers+0.123
 hills+0.119
 lakes+0.115
Neg logit contributions
ggy-0.239
 important-0.226
 lesson-0.224
ggie-0.223
 caring-0.222
Tokenoline
Token ID14453
Feature activation+3.42e+00

Pos logit contributions
 valleys+0.136
 oceans+0.127
 rivers+0.124
 lakes+0.123
 south+0.122
Neg logit contributions
ggy-0.121
 lesson-0.120
 important-0.112
ggie-0.110
:-0.108
Token and
Token ID290
Feature activation+2.96e+00

Pos logit contributions
 valleys+0.129
ASH+0.122
 oceans+0.122
 lakes+0.119
 rivers+0.117
Neg logit contributions
 about-0.046
 that-0.045
 not-0.044
ggy-0.043
:-0.042
Token looked
Token ID3114
Feature activation+3.38e+00

Pos logit contributions
ays+0.103
 valleys+0.103
ASH+0.102
 oceans+0.101
 hills+0.100
Neg logit contributions
 not-0.046
 bad-0.044
 understood-0.044
 important-0.042
 why-0.042
Token around
Token ID1088
Feature activation+1.96e+00

Pos logit contributions
ASH+0.096
 valleys+0.090
ACK+0.087
 oceans+0.086
 hills+0.086
Neg logit contributions
 important-0.087
 wonderful-0.082
 what-0.082
 bad-0.082
 why-0.081
Token.
Token ID13
Feature activation+1.34e+00

Pos logit contributions
 important+0.005
 caring+0.005
ggy+0.005
 lesson+0.004
 not+0.004
Neg logit contributions
 valleys-0.010
 oceans-0.010
 hills-0.009
 rivers-0.009
ASH-0.009
Token She
Token ID1375
Feature activation+1.17e-01

Pos logit contributions
 valleys+0.045
 oceans+0.043
 rivers+0.043
ASH+0.043
 lakes+0.042
Neg logit contributions
 important-0.022
 lesson-0.021
 caring-0.021
 valuable-0.019
 understand-0.019
Token stepped
Token ID10764
Feature activation+4.90e+00

Pos logit contributions
 valleys+0.004
 oceans+0.004
 rivers+0.004
ASH+0.004
 hills+0.004
Neg logit contributions
 important-0.002
 lesson-0.002
 caring-0.002
ggy-0.002
 understand-0.002
Token onto
Token ID4291
Feature activation+1.96e+00

Pos logit contributions
ASH+0.149
 valleys+0.146
 oceans+0.134
 lakes+0.134
 plains+0.133
Neg logit contributions
 important-0.095
 what-0.091
 lesson-0.091
:-0.089
 about-0.089
Token the
Token ID262
Feature activation+3.08e+00

Pos logit contributions
 valleys+0.078
ASH+0.075
 oceans+0.075
 hills+0.073
 rivers+0.073
Neg logit contributions
 caring-0.022
 lesson-0.021
 important-0.021
ggy-0.019
 not-0.019
Token tr
Token ID491
Feature activation+7.32e+00

Pos logit contributions
ays+0.082
 valleys+0.081
 oceans+0.077
ASH+0.073
 rivers+0.072
Neg logit contributions
 important-0.082
 wonderful-0.076
 lesson-0.075
 bad-0.075
 caring-0.074
Tokenicycle
Token ID35298
Feature activation+2.57e+00

Pos logit contributions
 valleys+0.142
 oceans+0.136
 rivers+0.135
 hills+0.132
 lakes+0.128
Neg logit contributions
ggy-0.224
 important-0.211
ggie-0.209
 lesson-0.208
 caring-0.207
Tokenâ
Token ID22940
Feature activation+4.99e+00

Pos logit contributions
 valleys+0.101
 oceans+0.096
 lakes+0.094
 rivers+0.093
ASH+0.093
Neg logit contributions
 not-0.036
 about-0.035
 lesson-0.035
 that-0.034
 important-0.033
Token
Token ID26391
Feature activation+3.01e+00

Pos logit contributions
 valleys+0.192
 hills+0.187
 oceans+0.185
 rivers+0.182
 mountains+0.182
Neg logit contributions
ggy-0.075
 important-0.072
ggie-0.070
 lesson-0.066
 caring-0.066
Token
Token ID8151
Feature activation-2.39e-01

Pos logit contributions
 valleys+0.090
 oceans+0.088
 rivers+0.086
 hills+0.086
 streams+0.084
Neg logit contributions
ggy-0.069
ggie-0.061
 important-0.059
 not-0.058
 lesson-0.057
Tokens
Token ID82
Feature activation+4.63e-01

Pos logit contributions
 important+0.004
 caring+0.004
 lesson+0.004
ggy+0.004
 forgiven+0.004
Neg logit contributions
 valleys-0.008
 oceans-0.008
 rivers-0.008
 hills-0.008
ASH-0.008
Token out
Token ID503
Feature activation+2.13e+00

Pos logit contributions
 valleys+0.084
 oceans+0.080
 rivers+0.077
 mountains+0.077
 hills+0.076
Neg logit contributions
 not-0.054
ggy-0.052
 what-0.052
 forgive-0.052
 why-0.050
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
 valleys+0.075
 oceans+0.073
ASH+0.073
 rivers+0.070
 mountains+0.070
Neg logit contributions
 important-0.036
 bad-0.034
 caring-0.034
 not-0.033
 lesson-0.033
Token you
Token ID345
Feature activation+1.91e+00

Pos logit contributions
 valleys+0.132
 oceans+0.123
 hills+0.122
ASH+0.119
 rivers+0.118
Neg logit contributions
 why-0.069
 not-0.069
 what-0.066
 forgive-0.062
 okay-0.062
Token little
Token ID1310
Feature activation+2.11e+00

Pos logit contributions
 valleys+0.064
 oceans+0.062
 hills+0.060
 rivers+0.060
 lakes+0.057
Neg logit contributions
 important-0.039
ggy-0.038
 bad-0.038
 understand-0.037
 not-0.037
Token r
Token ID374
Feature activation+4.28e+00

Pos logit contributions
 valleys+0.071
 oceans+0.065
 rivers+0.064
 hills+0.063
 lakes+0.063
Neg logit contributions
 bad-0.038
 important-0.036
 caring-0.034
 lesson-0.033
 forgiveness-0.032
Tokenasc
Token ID3372
Feature activation+7.32e+00

Pos logit contributions
 valleys+0.110
 oceans+0.105
 hills+0.104
 rivers+0.102
 mountains+0.101
Neg logit contributions
ggy-0.108
 important-0.103
 forgiven-0.100
 caring-0.100
ggie-0.098
Tokenals
Token ID874
Feature activation+2.35e+00

Pos logit contributions
 valleys+0.118
 oceans+0.110
 south+0.108
hole+0.105
 rivers+0.104
Neg logit contributions
ggy-0.237
:-0.218
ggie-0.215
 bad-0.211
 mom-0.204
Token,"
Token ID553
Feature activation-1.48e-01

Pos logit contributions
 valleys+0.090
ASH+0.086
 oceans+0.086
 rivers+0.083
ACK+0.082
Neg logit contributions
ggy-0.032
 about-0.031
 bad-0.031
 that-0.030
 not-0.030
Token the
Token ID262
Feature activation+7.64e-01

Pos logit contributions
 caring+0.002
 important+0.002
ggy+0.002
 lesson+0.002
 forgiveness+0.002
Neg logit contributions
 valleys-0.005
 hills-0.005
 oceans-0.005
 rivers-0.005
ASH-0.005
Token driver
Token ID4639
Feature activation+2.52e-01

Pos logit contributions
 valleys+0.021
 oceans+0.020
 rivers+0.019
 hills+0.019
 lakes+0.019
Neg logit contributions
 important-0.019
 caring-0.019
ggy-0.019
 lesson-0.019
 bad-0.018
Token said
Token ID531
Feature activation-1.13e+00

Pos logit contributions
 valleys+0.008
 hills+0.008
 oceans+0.008
 rivers+0.008
ASH+0.008
Neg logit contributions
 caring-0.005
 important-0.005
 lesson-0.005
ggy-0.005
 forgiven-0.004
Token It
Token ID632
Feature activation+1.22e+00

Pos logit contributions
 valleys+0.054
 hills+0.052
 oceans+0.052
 rivers+0.051
ASH+0.050
Neg logit contributions
 caring-0.034
 lesson-0.033
 important-0.033
ggy-0.032
 valuable-0.031
Token was
Token ID373
Feature activation+1.76e+00

Pos logit contributions
 valleys+0.046
 hills+0.044
 oceans+0.044
 rivers+0.043
ASH+0.043
Neg logit contributions
 important-0.018
 caring-0.017
ggy-0.017
 lesson-0.017
 valuable-0.015
Token hiding
Token ID11816
Feature activation+2.53e+00

Pos logit contributions
 valleys+0.056
 hills+0.054
ASH+0.053
 oceans+0.053
 rivers+0.052
Neg logit contributions
 important-0.037
 caring-0.036
ggy-0.034
 lesson-0.034
 valuable-0.032
Token under
Token ID739
Feature activation+2.25e+00

Pos logit contributions
 valleys+0.067
 hills+0.063
ASH+0.063
 oceans+0.063
 rivers+0.062
Neg logit contributions
 important-0.061
ggy-0.059
 lesson-0.058
 not-0.057
 caring-0.057
Token a
Token ID257
Feature activation+2.86e+00

Pos logit contributions
 valleys+0.087
ASH+0.084
 oceans+0.084
 hills+0.083
hole+0.083
Neg logit contributions
ggy-0.024
 caring-0.023
 not-0.023
 lesson-0.022
 bad-0.021
Token seaw
Token ID32085
Feature activation+7.28e+00

Pos logit contributions
ASH+0.086
 valleys+0.084
 oceans+0.081
ACK+0.079
 rivers+0.079
Neg logit contributions
 bad-0.062
ggy-0.058
 lesson-0.058
 important-0.058
 valuable-0.058
Tokeneed
Token ID2308
Feature activation+3.41e+00

Pos logit contributions
 valleys+0.205
 rivers+0.190
 hills+0.189
 oceans+0.187
 lakes+0.181
Neg logit contributions
ggy-0.179
 important-0.174
 lesson-0.172
 caring-0.167
 valuable-0.166
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
 valleys+0.118
ASH+0.117
 oceans+0.107
 rivers+0.107
ACK+0.105
Neg logit contributions
 not-0.058
 that-0.055
 bad-0.054
ggy-0.053
 lesson-0.052
Token Lily
Token ID20037
Feature activation+1.08e+00

Pos logit contributions
 valleys+0.041
 hills+0.039
 oceans+0.039
 rivers+0.039
ASH+0.038
Neg logit contributions
 caring-0.024
 important-0.024
 lesson-0.023
ggy-0.023
 forgiven-0.022
Token said
Token ID531
Feature activation-9.06e-01

Pos logit contributions
 valleys+0.040
 oceans+0.039
 rivers+0.038
 hills+0.038
 lakes+0.037
Neg logit contributions
ggy-0.016
 caring-0.016
 not-0.015
 important-0.015
 understood-0.015
Token,
Token ID11
Feature activation+5.39e-01

Pos logit contributions
 caring+0.018
 important+0.018
 lesson+0.017
ggy+0.017
 forgiven+0.016
Neg logit contributions
 valleys-0.029
 hills-0.028
 oceans-0.028
 rivers-0.028
 mountains-0.027
Token She
Token ID1375
Feature activation+6.21e-01

Pos logit contributions
 valleys+0.039
 oceans+0.038
ASH+0.037
 rivers+0.037
 lakes+0.037
Neg logit contributions
 caring-0.022
 lesson-0.021
 important-0.021
 forgiven-0.020
 understand-0.020
Token walked
Token ID6807
Feature activation+3.88e+00

Pos logit contributions
ASH+0.021
 valleys+0.021
 oceans+0.020
 hills+0.020
ACK+0.020
Neg logit contributions
 important-0.011
ggy-0.011
 caring-0.010
 understood-0.010
 lesson-0.010
Token through
Token ID832
Feature activation+2.11e+00

Pos logit contributions
ASH+0.111
ays+0.104
ACK+0.102
 valleys+0.101
 spaces+0.099
Neg logit contributions
 lesson-0.085
 important-0.085
 caring-0.082
 not-0.081
 wonderful-0.079
Token the
Token ID262
Feature activation+3.40e+00

Pos logit contributions
ASH+0.085
ACK+0.082
ays+0.082
idd+0.078
rol+0.077
Neg logit contributions
 wonderful-0.021
 caring-0.021
 important-0.021
 lesson-0.021
 bad-0.019
Token tall
Token ID7331
Feature activation+4.74e+00

Pos logit contributions
ASH+0.080
ays+0.078
ACK+0.077
idd+0.072
rol+0.071
Neg logit contributions
 important-0.101
 wonderful-0.095
 bad-0.094
 lesson-0.090
 kind-0.090
Token st
Token ID336
Feature activation+7.25e+00

Pos logit contributions
ASH+0.118
rol+0.115
ACK+0.113
ays+0.111
idd+0.105
Neg logit contributions
 important-0.123
 lesson-0.116
 bad-0.116
 kind-0.113
 wonderful-0.111
Tokenalks
Token ID23833
Feature activation+1.90e+00

Pos logit contributions
ASH+0.180
 valleys+0.177
 oceans+0.174
ting+0.170
 lakes+0.169
Neg logit contributions
ggy-0.168
 important-0.155
 not-0.154
:-0.152
 valuable-0.151
Token of
Token ID286
Feature activation+9.82e-01

Pos logit contributions
ASH+0.068
ACK+0.064
 valleys+0.061
 oceans+0.058
 lakes+0.058
Neg logit contributions
 about-0.030
 important-0.029
 not-0.028
:-0.027
 caring-0.026
Token corn
Token ID11676
Feature activation+3.48e+00

Pos logit contributions
ASH+0.035
ACK+0.033
ays+0.033
 valleys+0.032
als+0.031
Neg logit contributions
 caring-0.015
 important-0.015
 wonderful-0.015
 lesson-0.015
 bad-0.014
Token and
Token ID290
Feature activation+2.00e+00

Pos logit contributions
ASH+0.092
ACK+0.088
 valleys+0.076
RE+0.075
ays+0.074
Neg logit contributions
 not-0.079
 caring-0.077
 important-0.076
 that-0.076
:-0.076
Token discovered
Token ID5071
Feature activation-1.14e-01

Pos logit contributions
ASH+0.065
ays+0.064
ACK+0.061
 valleys+0.060
 railing+0.060
Neg logit contributions
 caring-0.037
 important-0.035
 not-0.034
 wonderful-0.032
ggy-0.032
Token that
Token ID326
Feature activation+4.90e-01

Pos logit contributions
 valleys+0.016
ASH+0.015
 oceans+0.015
 rivers+0.015
 hills+0.015
Neg logit contributions
 important-0.005
ggy-0.005
 caring-0.005
 lesson-0.004
 bad-0.004
Token said
Token ID531
Feature activation-1.11e-01

Pos logit contributions
 valleys+0.019
 oceans+0.018
 hills+0.018
 rivers+0.018
ASH+0.018
Neg logit contributions
 important-0.006
 caring-0.006
 lesson-0.006
ggy-0.006
 forgiven-0.005
Token "
Token ID366
Feature activation+3.20e+00

Pos logit contributions
 caring+0.002
 important+0.002
 lesson+0.002
ggy+0.002
 forgiven+0.002
Neg logit contributions
 valleys-0.004
 hills-0.004
 oceans-0.004
 rivers-0.004
ASH-0.003
TokenNo
Token ID2949
Feature activation-1.24e+00

Pos logit contributions
 valleys+0.092
 oceans+0.087
 hills+0.086
 rivers+0.086
 mountains+0.085
Neg logit contributions
 lesson-0.072
ggy-0.072
 caring-0.071
 important-0.070
ggie-0.066
Token T
Token ID309
Feature activation+1.96e+00

Pos logit contributions
 caring+0.028
 important+0.028
 lesson+0.027
ggy+0.027
 forgiven+0.026
Neg logit contributions
 valleys-0.035
 hills-0.034
 rivers-0.033
 oceans-0.033
 mountains-0.032
Tokenresp
Token ID4363
Feature activation+7.23e+00

Pos logit contributions
 valleys+0.063
 hills+0.060
 oceans+0.059
 rivers+0.059
ASH+0.057
Neg logit contributions
ggy-0.041
 important-0.040
 caring-0.039
 lesson-0.039
ggie-0.037
Tokenassing
Token ID19696
Feature activation+1.32e+00

Pos logit contributions
 valleys+0.183
 oceans+0.172
 hills+0.171
 railing+0.170
 spaces+0.169
Neg logit contributions
 caring-0.164
Jerry-0.163
isi-0.163
ggy-0.154
 forgiven-0.149
Token".
Token ID1911
Feature activation+6.04e-01

Pos logit contributions
 valleys+0.039
 oceans+0.037
 rivers+0.036
 hills+0.036
ASH+0.036
Neg logit contributions
 important-0.029
 lesson-0.029
ggy-0.028
 caring-0.028
ggie-0.026
Token And
Token ID843
Feature activation-1.05e+00

Pos logit contributions
 valleys+0.018
 oceans+0.018
 rivers+0.017
 hills+0.017
 lakes+0.017
Neg logit contributions
 important-0.012
 lesson-0.012
 caring-0.012
ggy-0.012
 not-0.011
Token behind
Token ID2157
Feature activation+8.58e-01

Pos logit contributions
 caring+0.016
 important+0.016
ggy+0.015
 lesson+0.015
 forgiven+0.014
Neg logit contributions
 valleys-0.038
 hills-0.037
 oceans-0.037
 rivers-0.037
ASH-0.036
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
 valleys+0.030
ASH+0.029
 oceans+0.029
 hills+0.028
 rivers+0.028
Neg logit contributions
ggy-0.015
 important-0.014
 not-0.013
 lesson-0.013
 caring-0.013
Token wrong
Token ID2642
Feature activation-2.11e+00

Pos logit contributions
 valleys+0.031
 hills+0.030
 rivers+0.029
 oceans+0.029
 mountains+0.029
Neg logit contributions
 caring-0.013
 lesson-0.012
 important-0.012
ggy-0.012
 forgiven-0.012
Token.
Token ID13
Feature activation+6.99e-01

Pos logit contributions
 caring+0.048
 important+0.047
 lesson+0.046
ggy+0.046
 forgiven+0.045
Neg logit contributions
 valleys-0.058
 hills-0.057
 rivers-0.055
 oceans-0.055
 mountains-0.053
Token Tim
Token ID5045
Feature activation-2.03e+00

Pos logit contributions
 valleys+0.026
 oceans+0.025
 rivers+0.025
 hills+0.024
 lakes+0.024
Neg logit contributions
 caring-0.011
 important-0.011
 lesson-0.010
ggy-0.010
 forgiven-0.010
Tokenmy
Token ID1820
Feature activation-1.88e+00

Pos logit contributions
 caring+0.067
ggy+0.067
 lesson+0.066
 valuable+0.066
 rewarding+0.065
Neg logit contributions
ASH-0.036
 window-0.029
ACK-0.028
 bur-0.027
 hills-0.027
Token told
Token ID1297
Feature activation-2.33e+00

Pos logit contributions
 rewarding+0.043
vern+0.042
 lesson+0.041
 importantly+0.040
 caring+0.039
Neg logit contributions
 bur-0.044
ASH-0.043
 window-0.039
 hills-0.039
 fields-0.039
Token her
Token ID607
Feature activation-6.07e+00

Pos logit contributions
 rewarding+0.068
 valuable+0.068
 important+0.065
 importantly+0.063
 lesson+0.062
Neg logit contributions
 hills-0.045
ASH-0.045
 rivers-0.044
 mountains-0.044
ting-0.042
Token that
Token ID326
Feature activation-2.65e+00

Pos logit contributions
 rewarding+0.147
live+0.147
isi+0.143
Jerry+0.141
Days+0.137
Neg logit contributions
 down-0.119
 window-0.116
 asleep-0.108
 hills-0.105
 out-0.105
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
 caring+0.047
 important+0.047
 lesson+0.045
ggy+0.045
 forgiven+0.042
Neg logit contributions
 valleys-0.090
 hills-0.087
 oceans-0.086
 rivers-0.086
 mountains-0.083
Token w
Token ID266
Feature activation+9.98e-01

Pos logit contributions
ggy+0.053
 caring+0.053
 forgiven+0.052
ggie+0.050
 important+0.050
Neg logit contributions
 hills-0.041
 valleys-0.038
 rivers-0.037
 mountains-0.037
 oceans-0.037
Tokenaffle
Token ID30697
Feature activation-9.21e-01

Pos logit contributions
 valleys+0.038
 hills+0.037
 oceans+0.036
 rivers+0.036
ASH+0.035
Neg logit contributions
 caring-0.014
 important-0.014
 lesson-0.013
ggy-0.013
 forgiven-0.012
Token was
Token ID373
Feature activation-5.11e-01

Pos logit contributions
 caring+0.016
 important+0.016
 lesson+0.015
ggy+0.015
 forgiven+0.014
Neg logit contributions
 valleys-0.032
 hills-0.031
 oceans-0.030
 rivers-0.030
ASH-0.029
Token going
Token ID1016
Feature activation-8.21e-01

Pos logit contributions
 valleys+0.011
 hills+0.011
 mountains+0.011
 rivers+0.011
 oceans+0.011
Neg logit contributions
 caring-0.006
 lesson-0.006
ggy-0.006
 forgiven-0.006
 understand-0.006
Token on
Token ID319
Feature activation-8.36e-01

Pos logit contributions
 caring+0.026
ggy+0.026
 important+0.025
 lesson+0.025
ggie+0.025
Neg logit contributions
 hills-0.016
 valleys-0.015
 oceans-0.015
 rivers-0.014
 mountains-0.014
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
 caring+0.018
 important+0.017
 lesson+0.017
ggy+0.017
 forgiven+0.017
Neg logit contributions
 valleys-0.024
 hills-0.024
 rivers-0.023
 oceans-0.023
 mountains-0.023
Token Tim
Token ID5045
Feature activation-8.24e-01

Pos logit contributions
 valleys+0.007
 oceans+0.006
 rivers+0.006
 hills+0.006
 lakes+0.006
Neg logit contributions
 caring-0.003
 important-0.003
 lesson-0.003
ggy-0.003
 forgiven-0.003
Token told
Token ID1297
Feature activation-2.08e+00

Pos logit contributions
 important+0.015
 caring+0.015
 lesson+0.015
ggy+0.014
 valuable+0.014
Neg logit contributions
 valleys-0.025
 hills-0.025
 rivers-0.025
 oceans-0.024
 mountains-0.024
Token her
Token ID607
Feature activation-5.98e+00

Pos logit contributions
 valuable+0.054
 important+0.053
 rewarding+0.051
 lesson+0.050
ggy+0.049
Neg logit contributions
 hills-0.052
 rivers-0.051
 mountains-0.050
 valleys-0.049
 oceans-0.049
Token about
Token ID546
Feature activation-3.26e+00

Pos logit contributions
live+0.120
Jerry+0.114
 rewarding+0.112
isi+0.112
Days+0.106
Neg logit contributions
 mountains-0.154
 hills-0.152
 window-0.147
 rivers-0.144
 lakes-0.140
Token the
Token ID262
Feature activation-1.15e+00

Pos logit contributions
ggy+0.072
 lesson+0.071
 important+0.071
 forgiven+0.068
ggie+0.067
Neg logit contributions
 valleys-0.094
 hills-0.093
 rivers-0.092
 oceans-0.091
 mountains-0.090
Token talking
Token ID3375
Feature activation-9.49e-01

Pos logit contributions
ggy+0.026
 caring+0.025
ggie+0.025
 important+0.024
 forgiven+0.024
Neg logit contributions
 hills-0.032
 valleys-0.031
 rivers-0.030
 oceans-0.030
 mountains-0.030
Token cat
Token ID3797
Feature activation-2.62e+00

Pos logit contributions
 caring+0.023
 important+0.023
ggy+0.022
 lesson+0.022
 forgiven+0.021
Neg logit contributions
 hills-0.025
 valleys-0.025
 rivers-0.024
 oceans-0.024
 mountains-0.024
Token and
Token ID290
Feature activation-1.18e+00

Pos logit contributions
 caring+0.061
 important+0.060
 lesson+0.060
ggy+0.058
 forgiven+0.056
Neg logit contributions
 valleys-0.070
 hills-0.069
 rivers-0.068
 oceans-0.067
 mountains-0.066
Token meet
Token ID1826
Feature activation-9.50e-01

Pos logit contributions
 caring+0.017
 important+0.017
 lesson+0.016
ggy+0.016
 forgiven+0.016
Neg logit contributions
 valleys-0.023
 hills-0.022
 oceans-0.022
 rivers-0.022
ASH-0.021
Token new
Token ID649
Feature activation-2.17e+00

Pos logit contributions
 important+0.020
 caring+0.020
ggy+0.020
 lesson+0.019
 forgiven+0.019
Neg logit contributions
 valleys-0.027
 hills-0.027
 oceans-0.027
 rivers-0.027
 mountains-0.027
Token friends
Token ID2460
Feature activation-2.69e+00

Pos logit contributions
 caring+0.029
 important+0.028
 lesson+0.027
ggy+0.027
 forgiven+0.024
Neg logit contributions
 valleys-0.085
 hills-0.082
 oceans-0.081
 rivers-0.081
ASH-0.080
Token and
Token ID290
Feature activation-1.98e+00

Pos logit contributions
 caring+0.058
 important+0.057
ggy+0.056
 lesson+0.055
ggie+0.052
Neg logit contributions
 valleys-0.084
 hills-0.079
 oceans-0.078
ASH-0.078
 rivers-0.078
Token teach
Token ID4545
Feature activation-2.71e+00

Pos logit contributions
 important+0.036
ggy+0.036
 caring+0.036
 lesson+0.034
ggie+0.033
Neg logit contributions
 valleys-0.061
 hills-0.061
 rivers-0.059
 oceans-0.059
 mountains-0.058
Token them
Token ID606
Feature activation-5.96e+00

Pos logit contributions
 important+0.069
ggy+0.068
 caring+0.067
 lesson+0.067
 forgiven+0.066
Neg logit contributions
 valleys-0.068
 hills-0.068
 rivers-0.066
 oceans-0.066
 mountains-0.064
Token what
Token ID644
Feature activation-4.32e+00

Pos logit contributions
 caring+0.091
 important+0.091
ggy+0.090
 forgiven+0.089
 lesson+0.086
Neg logit contributions
 hills-0.203
 valleys-0.201
 rivers-0.197
 oceans-0.195
 mountains-0.194
Token they
Token ID484
Feature activation-1.57e+00

Pos logit contributions
isi+0.095
isy+0.093
Days+0.090
 forgiven+0.089
vern+0.087
Neg logit contributions
 hills-0.114
 mountains-0.106
 valleys-0.103
 rivers-0.103
 oceans-0.103
Token knew
Token ID2993
Feature activation-3.08e+00

Pos logit contributions
ggy+0.036
 caring+0.034
 lesson+0.033
ggie+0.033
 important+0.032
Neg logit contributions
 valleys-0.044
 hills-0.044
 oceans-0.043
 rivers-0.042
 mountains-0.041
Token.
Token ID13
Feature activation-3.72e-01

Pos logit contributions
 caring+0.061
 important+0.059
ggy+0.058
 lesson+0.058
 forgiven+0.055
Neg logit contributions
 valleys-0.096
 hills-0.094
 rivers-0.092
 oceans-0.092
 mountains-0.089
Token
Token ID198
Feature activation-1.69e+00

Pos logit contributions
 caring+0.009
 important+0.009
ggy+0.009
 lesson+0.008
 forgiven+0.008
Neg logit contributions
 valleys-0.011
 hills-0.010
 oceans-0.010
 rivers-0.010
 mountains-0.010
Token wrong
Token ID2642
Feature activation-2.51e+00

Pos logit contributions
 valleys+0.020
 hills+0.020
 oceans+0.019
 rivers+0.019
ASH+0.019
Neg logit contributions
 caring-0.007
 lesson-0.007
 important-0.007
ggy-0.007
 forgiven-0.006
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 caring+0.063
 forgiven+0.061
isi+0.061
ggy+0.060
 important+0.060
Neg logit contributions
 hills-0.061
 valleys-0.060
 rivers-0.057
 oceans-0.056
 mountains-0.056
Token Tim
Token ID5045
Feature activation-2.13e+00

Pos logit contributions
 valleys+0.005
 hills+0.005
 oceans+0.005
 rivers+0.005
 lakes+0.005
Neg logit contributions
 caring-0.003
 important-0.003
 lesson-0.003
ggy-0.003
 forgiven-0.002
Tokenmy
Token ID1820
Feature activation-2.05e+00

Pos logit contributions
 caring+0.072
ggy+0.071
 lesson+0.070
 valuable+0.069
 rewarding+0.068
Neg logit contributions
ASH-0.038
 window-0.031
ACK-0.030
 hills-0.029
 valleys-0.028
Token told
Token ID1297
Feature activation-2.42e+00

Pos logit contributions
 rewarding+0.047
vern+0.047
 importantly+0.045
 lesson+0.045
 caring+0.044
Neg logit contributions
ASH-0.047
 bur-0.045
 window-0.043
 hills-0.043
 mountains-0.042
Token her
Token ID607
Feature activation-5.94e+00

Pos logit contributions
 valuable+0.070
 rewarding+0.069
 important+0.067
 lesson+0.064
 importantly+0.064
Neg logit contributions
 hills-0.050
ASH-0.049
 rivers-0.049
 mountains-0.048
 valleys-0.046
Token about
Token ID546
Feature activation-2.70e+00

Pos logit contributions
live+0.149
isi+0.141
 rewarding+0.138
Jerry+0.135
Deal+0.123
Neg logit contributions
 window-0.126
 down-0.123
 hills-0.118
 railing-0.115
 door-0.112
Token his
Token ID465
Feature activation-1.94e+00

Pos logit contributions
 important+0.053
ggy+0.052
 lesson+0.052
 caring+0.052
 forgiven+0.050
Neg logit contributions
 valleys-0.084
 hills-0.083
 rivers-0.081
 oceans-0.081
 mountains-0.079
Token computer
Token ID3644
Feature activation-1.73e+00

Pos logit contributions
ggy+0.050
 caring+0.049
isi+0.047
ggie+0.047
 forgiven+0.046
Neg logit contributions
 hills-0.048
 valleys-0.045
ASH-0.044
 oceans-0.043
 mountains-0.043
Token trouble
Token ID5876
Feature activation-1.55e+00

Pos logit contributions
 important+0.040
 caring+0.039
 lesson+0.038
ggy+0.038
 forgiven+0.036
Neg logit contributions
 valleys-0.051
 hills-0.048
 oceans-0.048
 rivers-0.048
ASH-0.046
Token.
Token ID13
Feature activation-5.56e-01

Pos logit contributions
 important+0.037
 caring+0.036
 lesson+0.035
ggy+0.035
 forgiven+0.033
Neg logit contributions
 valleys-0.045
 hills-0.042
 oceans-0.042
 rivers-0.042
ASH-0.040
Token her
Token ID607
Feature activation-3.29e+00

Pos logit contributions
 caring+0.046
 important+0.046
 lesson+0.045
ggy+0.045
 forgiven+0.042
Neg logit contributions
 valleys-0.064
 hills-0.062
 oceans-0.061
 rivers-0.061
 mountains-0.059
Token M
Token ID337
Feature activation-8.01e-01

Pos logit contributions
ggy+0.052
 caring+0.049
 important+0.048
 lesson+0.047
ggie+0.046
Neg logit contributions
 valleys-0.116
 hills-0.115
 oceans-0.112
 rivers-0.112
 mountains-0.109
Tokenummy
Token ID13513
Feature activation-3.40e+00

Pos logit contributions
 caring+0.015
 important+0.015
 lesson+0.015
ggy+0.015
 forgiven+0.014
Neg logit contributions
 valleys-0.025
 hills-0.025
 oceans-0.024
 rivers-0.024
ASH-0.024
Token for
Token ID329
Feature activation-2.67e+00

Pos logit contributions
 caring+0.074
 important+0.074
ggy+0.073
 lesson+0.071
 forgiven+0.068
Neg logit contributions
 valleys-0.100
 hills-0.097
 oceans-0.095
 rivers-0.095
 mountains-0.092
Token reminding
Token ID30047
Feature activation-3.05e+00

Pos logit contributions
 important+0.052
 caring+0.052
 lesson+0.051
ggy+0.051
 forgiven+0.047
Neg logit contributions
 valleys-0.084
 hills-0.082
 rivers-0.080
 oceans-0.080
 mountains-0.077
Token her
Token ID607
Feature activation-5.92e+00

Pos logit contributions
 important+0.060
 caring+0.059
ggy+0.059
 lesson+0.059
 valuable+0.055
Neg logit contributions
 valleys-0.095
 hills-0.093
 rivers-0.092
 oceans-0.091
ASH-0.088
Token.
Token ID13
Feature activation-5.81e-01

Pos logit contributions
Jerry+0.126
isi+0.125
ggy+0.119
live+0.112
ggie+0.111
Neg logit contributions
 hills-0.157
 valleys-0.152
 railing-0.151
 rivers-0.150
 oceans-0.148
Token Then
Token ID3244
Feature activation-1.85e+00

Pos logit contributions
 caring+0.012
 important+0.011
 lesson+0.011
 valuable+0.010
 forgiven+0.010
Neg logit contributions
 valleys-0.019
 oceans-0.018
 rivers-0.018
 hills-0.018
 lakes-0.017
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 caring+0.031
 important+0.031
 lesson+0.030
ggy+0.029
 forgiven+0.027
Neg logit contributions
 valleys-0.065
 hills-0.063
 oceans-0.062
 rivers-0.062
ASH-0.061
Token she
Token ID673
Feature activation-6.15e-01

Pos logit contributions
 caring+0.016
 important+0.015
 lesson+0.014
ggy+0.013
 forgiven+0.013
Neg logit contributions
 valleys-0.038
 hills-0.037
 oceans-0.037
 rivers-0.036
ASH-0.036
Token jumped
Token ID11687
Feature activation+3.21e+00

Pos logit contributions
 caring+0.011
 important+0.011
 lesson+0.011
ggy+0.011
 forgiven+0.010
Neg logit contributions
 valleys-0.021
 hills-0.020
 oceans-0.020
 rivers-0.020
ASH-0.020
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
 valleys+0.014
ASH+0.013
 hills+0.013
 oceans+0.013
 rivers+0.013
Neg logit contributions
 caring-0.008
 important-0.008
 not-0.007
 lesson-0.007
 understand-0.007
Token asked
Token ID1965
Feature activation-2.03e+00

Pos logit contributions
 caring+0.022
 important+0.022
 lesson+0.020
ggy+0.020
 forgiven+0.019
Neg logit contributions
 valleys-0.043
 hills-0.042
 oceans-0.041
 rivers-0.041
ASH-0.040
Token grandma
Token ID49890
Feature activation-4.35e+00

Pos logit contributions
ggy+0.041
 caring+0.040
 important+0.040
 lesson+0.039
 valuable+0.037
Neg logit contributions
 valleys-0.061
 hills-0.061
 rivers-0.059
 oceans-0.059
 mountains-0.058
Token to
Token ID284
Feature activation-1.04e+00

Pos logit contributions
ggy+0.088
 lesson+0.087
 important+0.086
 caring+0.081
 valuable+0.080
Neg logit contributions
 hills-0.130
 rivers-0.128
 valleys-0.128
 oceans-0.128
 mountains-0.127
Token tell
Token ID1560
Feature activation-2.12e+00

Pos logit contributions
 caring+0.021
 important+0.021
 lesson+0.020
ggy+0.020
 forgiven+0.019
Neg logit contributions
 valleys-0.033
 hills-0.032
 oceans-0.031
 rivers-0.031
ASH-0.030
Token her
Token ID607
Feature activation-5.90e+00

Pos logit contributions
 caring+0.038
 important+0.038
 lesson+0.036
ggy+0.036
 forgiven+0.034
Neg logit contributions
 valleys-0.071
 hills-0.069
 rivers-0.068
 oceans-0.068
 mountains-0.066
Token some
Token ID617
Feature activation-3.43e+00

Pos logit contributions
Jerry+0.092
live+0.085
isi+0.084
 caring+0.083
ggy+0.083
Neg logit contributions
 hills-0.189
 mountains-0.188
 rivers-0.183
 valleys-0.183
 oceans-0.179
Token stories
Token ID3923
Feature activation-2.39e+00

Pos logit contributions
 important+0.052
 caring+0.052
 lesson+0.050
ggy+0.048
 forgiven+0.045
Neg logit contributions
 valleys-0.128
 hills-0.123
 oceans-0.121
 rivers-0.121
ASH-0.120
Token.
Token ID13
Feature activation-1.81e-01

Pos logit contributions
 caring+0.040
 important+0.039
 lesson+0.038
ggy+0.038
 forgiven+0.035
Neg logit contributions
 valleys-0.084
 hills-0.081
 oceans-0.080
 rivers-0.080
ASH-0.077
Token Grand
Token ID5675
Feature activation-1.74e+00

Pos logit contributions
 important+0.003
 caring+0.003
 lesson+0.003
ggy+0.003
 forgiven+0.003
Neg logit contributions
 valleys-0.006
 rivers-0.006
 hills-0.006
 oceans-0.006
 streams-0.006
Tokenma
Token ID2611
Feature activation-2.27e+00

Pos logit contributions
 important+0.037
 lesson+0.037
 caring+0.036
ggy+0.036
 valuable+0.036
Neg logit contributions
ASH-0.049
 hills-0.045
 valleys-0.044
 oceans-0.044
 rivers-0.043
Token ask
Token ID1265
Feature activation-1.79e+00

Pos logit contributions
 valleys+0.004
 oceans+0.004
 hills+0.004
ASH+0.004
 rivers+0.004
Neg logit contributions
 caring-0.003
 important-0.002
ggy-0.002
 understand-0.002
 forgiven-0.002
Token her
Token ID607
Feature activation-4.51e+00

Pos logit contributions
ggy+0.037
 caring+0.036
 important+0.035
 lesson+0.034
 forgiven+0.034
Neg logit contributions
 hills-0.054
 valleys-0.053
 rivers-0.052
 oceans-0.052
 mountains-0.051
Token mom
Token ID1995
Feature activation-3.29e+00

Pos logit contributions
ggy+0.072
Jerry+0.070
ggie+0.065
 forgiven+0.064
live+0.063
Neg logit contributions
 hills-0.154
 mountains-0.151
 window-0.146
 rivers-0.146
 oceans-0.145
Token to
Token ID284
Feature activation-9.85e-01

Pos logit contributions
 caring+0.073
 important+0.072
ggy+0.071
 lesson+0.071
 forgiven+0.067
Neg logit contributions
 valleys-0.097
 hills-0.094
 oceans-0.092
 rivers-0.092
 mountains-0.089
Token tell
Token ID1560
Feature activation-2.12e+00

Pos logit contributions
 caring+0.021
 important+0.021
 lesson+0.020
ggy+0.020
 forgiven+0.019
Neg logit contributions
 valleys-0.030
 hills-0.029
 oceans-0.029
 rivers-0.029
ASH-0.028
Token her
Token ID607
Feature activation-5.88e+00

Pos logit contributions
 caring+0.039
 important+0.039
ggy+0.038
 lesson+0.038
 forgiven+0.036
Neg logit contributions
 valleys-0.068
 hills-0.067
 rivers-0.066
 oceans-0.066
 mountains-0.064
Token a
Token ID257
Feature activation-3.89e+00

Pos logit contributions
Jerry+0.121
isi+0.109
live+0.107
 forgiven+0.103
ggy+0.101
Neg logit contributions
 hills-0.166
 mountains-0.163
 rivers-0.160
 valleys-0.157
 oceans-0.155
Token bed
Token ID3996
Feature activation-1.65e+00

Pos logit contributions
ggy+0.055
isi+0.055
 forgiven+0.055
live+0.053
 caring+0.053
Neg logit contributions
 hills-0.129
 valleys-0.123
 mountains-0.123
 rivers-0.121
 railing-0.121
Tokentime
Token ID2435
Feature activation-3.10e+00

Pos logit contributions
 caring+0.033
 important+0.032
 lesson+0.031
ggy+0.031
 forgiven+0.030
Neg logit contributions
 valleys-0.051
 hills-0.050
 oceans-0.049
 rivers-0.049
 mountains-0.048
Token story
Token ID1621
Feature activation-2.12e+00

Pos logit contributions
 caring+0.040
ggy+0.037
 forgiven+0.037
 important+0.036
 forgiveness+0.034
Neg logit contributions
 hills-0.111
 valleys-0.108
 rivers-0.107
 oceans-0.106
 mountains-0.105
Token.
Token ID13
Feature activation-6.29e-01

Pos logit contributions
 caring+0.044
 important+0.042
 lesson+0.041
ggy+0.041
 forgiven+0.040
Neg logit contributions
 valleys-0.062
 hills-0.062
 rivers-0.060
 oceans-0.060
 mountains-0.058
Token wrong
Token ID2642
Feature activation-2.50e+00

Pos logit contributions
 valleys+0.023
 hills+0.023
 oceans+0.022
 rivers+0.022
ASH+0.022
Neg logit contributions
 caring-0.011
 lesson-0.010
ggy-0.010
 important-0.009
isi-0.009
Token.
Token ID13
Feature activation-5.14e-02

Pos logit contributions
 caring+0.065
isi+0.064
 forgiven+0.063
ggy+0.062
 important+0.062
Neg logit contributions
 hills-0.058
 valleys-0.056
 rivers-0.054
 oceans-0.052
ASH-0.052
Token Tim
Token ID5045
Feature activation-2.26e+00

Pos logit contributions
 caring+0.001
 important+0.001
 lesson+0.001
ggy+0.001
 forgiven+0.001
Neg logit contributions
 valleys-0.002
 hills-0.002
 rivers-0.002
 oceans-0.002
ASH-0.002
Tokenmy
Token ID1820
Feature activation-2.16e+00

Pos logit contributions
ggy+0.080
 caring+0.079
 rewarding+0.078
 valuable+0.077
vern+0.077
Neg logit contributions
ASH-0.034
 window-0.027
ACK-0.025
 down-0.024
 bur-0.024
Token told
Token ID1297
Feature activation-2.55e+00

Pos logit contributions
 rewarding+0.056
vern+0.054
 importantly+0.052
ggy+0.051
isi+0.051
Neg logit contributions
ASH-0.045
 bur-0.043
 window-0.040
ACK-0.038
 hills-0.038
Token her
Token ID607
Feature activation-5.88e+00

Pos logit contributions
 rewarding+0.075
 valuable+0.074
 important+0.070
 importantly+0.069
 mattered+0.068
Neg logit contributions
 hills-0.051
 rivers-0.049
ASH-0.048
 mountains-0.048
 window-0.047
Token about
Token ID546
Feature activation-2.79e+00

Pos logit contributions
live+0.153
 rewarding+0.144
isi+0.142
Jerry+0.132
Deal+0.130
Neg logit contributions
 window-0.118
 down-0.117
 hills-0.109
 door-0.107
 mountains-0.104
Token his
Token ID465
Feature activation-1.38e+00

Pos logit contributions
ggy+0.056
 lesson+0.056
 important+0.055
 forgiven+0.054
 caring+0.053
Neg logit contributions
 hills-0.083
 valleys-0.082
 rivers-0.081
 oceans-0.080
 mountains-0.080
Token broken
Token ID5445
Feature activation+4.28e-01

Pos logit contributions
 caring+0.037
ggy+0.036
 important+0.035
 forgiven+0.035
isi+0.035
Neg logit contributions
 hills-0.033
 valleys-0.033
 oceans-0.031
 rivers-0.031
ASH-0.031
Token car
Token ID1097
Feature activation-1.20e+00

Pos logit contributions
 valleys+0.011
 oceans+0.010
 rivers+0.010
 hills+0.009
 streams+0.009
Neg logit contributions
 important-0.012
 lesson-0.012
 caring-0.012
ggy-0.011
 bad-0.011
Token.
Token ID13
Feature activation-6.41e-01

Pos logit contributions
 caring+0.030
 important+0.029
ggy+0.029
 lesson+0.029
 forgiven+0.027
Neg logit contributions
 valleys-0.031
 hills-0.031
 rivers-0.029
 oceans-0.029
 mountains-0.029
Token was
Token ID373
Feature activation+2.98e-01

Pos logit contributions
isi+0.087
oto+0.085
 caring+0.080
isy+0.080
Days+0.077
Neg logit contributions
 hills-0.063
 mountains-0.058
 valleys-0.056
 lands-0.055
 rivers-0.051
Token wrong
Token ID2642
Feature activation-2.40e+00

Pos logit contributions
 valleys+0.011
 hills+0.010
 rivers+0.010
 oceans+0.010
 mountains+0.010
Neg logit contributions
 caring-0.005
 lesson-0.005
ggy-0.004
 important-0.004
 forgiven-0.004
Token.
Token ID13
Feature activation+6.06e-01

Pos logit contributions
 caring+0.060
isi+0.059
 forgiven+0.059
ggy+0.058
 important+0.058
Neg logit contributions
 hills-0.057
 valleys-0.055
 rivers-0.053
 oceans-0.052
ASH-0.052
Token Lily
Token ID20037
Feature activation-1.86e+00

Pos logit contributions
 valleys+0.022
 hills+0.021
 oceans+0.021
 rivers+0.020
 lakes+0.020
Neg logit contributions
 caring-0.010
 important-0.010
 lesson-0.010
ggy-0.009
 forgiven-0.009
Token told
Token ID1297
Feature activation-2.02e+00

Pos logit contributions
 lesson+0.037
 rewarding+0.036
 caring+0.036
 important+0.035
ggy+0.035
Neg logit contributions
ASH-0.050
 hills-0.048
 valleys-0.048
 bur-0.048
 mountains-0.047
Token her
Token ID607
Feature activation-5.87e+00

Pos logit contributions
 valuable+0.056
 important+0.055
 rewarding+0.054
 lesson+0.053
ggy+0.052
Neg logit contributions
 hills-0.045
 rivers-0.044
ASH-0.044
 mountains-0.043
 valleys-0.043
Token that
Token ID326
Feature activation-2.35e+00

Pos logit contributions
live+0.154
Jerry+0.153
 rewarding+0.149
isi+0.147
Fin+0.138
Neg logit contributions
 window-0.125
 curtains-0.111
 door-0.111
 down-0.108
 railing-0.100
Token she
Token ID673
Feature activation-1.79e+00

Pos logit contributions
 caring+0.043
 important+0.042
 lesson+0.041
ggy+0.040
 forgiven+0.038
Neg logit contributions
 valleys-0.080
 hills-0.077
 oceans-0.076
 rivers-0.076
ASH-0.073
Token wanted
Token ID2227
Feature activation-1.80e+00

Pos logit contributions
 caring+0.037
ggy+0.034
 lesson+0.034
isi+0.034
oto+0.032
Neg logit contributions
 valleys-0.049
 hills-0.049
ASH-0.048
 mountains-0.047
 rivers-0.047
Token to
Token ID284
Feature activation-1.35e+00

Pos logit contributions
 caring+0.037
 important+0.036
 lesson+0.035
ggy+0.035
 forgiven+0.033
Neg logit contributions
 valleys-0.058
 hills-0.056
 oceans-0.055
 rivers-0.054
ASH-0.053
Token bring
Token ID2222
Feature activation-1.06e+00

Pos logit contributions
 caring+0.023
 important+0.023
 lesson+0.022
ggy+0.021
 valuable+0.020
Neg logit contributions
 valleys-0.045
 hills-0.045
 rivers-0.043
 oceans-0.043
ASH-0.042
Token was
Token ID373
Feature activation+8.29e-01

Pos logit contributions
isi+0.069
Days+0.067
oto+0.066
 caring+0.065
isy+0.064
Neg logit contributions
 hills-0.062
 mountains-0.058
 valleys-0.058
 lands-0.057
 rivers-0.055
Token wrong
Token ID2642
Feature activation-2.34e+00

Pos logit contributions
 valleys+0.029
 hills+0.029
 oceans+0.028
 rivers+0.028
 mountains+0.027
Neg logit contributions
 caring-0.014
 lesson-0.013
 important-0.012
 forgiven-0.012
ggy-0.012
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
 caring+0.054
 important+0.053
 lesson+0.052
 forgiven+0.052
ggy+0.052
Neg logit contributions
 hills-0.061
 valleys-0.061
 rivers-0.059
 oceans-0.058
 mountains-0.057
Token Lily
Token ID20037
Feature activation-1.67e+00

Pos logit contributions
 valleys+0.007
 hills+0.007
 oceans+0.006
 rivers+0.006
ASH+0.006
Neg logit contributions
 caring-0.003
 important-0.003
 lesson-0.003
ggy-0.003
 forgiven-0.003
Token told
Token ID1297
Feature activation-2.11e+00

Pos logit contributions
 important+0.030
 lesson+0.030
 caring+0.030
ggy+0.028
 valuable+0.028
Neg logit contributions
ASH-0.048
 valleys-0.048
 hills-0.048
 mountains-0.047
 rivers-0.047
Token her
Token ID607
Feature activation-5.86e+00

Pos logit contributions
 important+0.057
 valuable+0.057
 rewarding+0.056
 lesson+0.054
ggy+0.053
Neg logit contributions
 hills-0.047
 rivers-0.046
ASH-0.045
 mountains-0.045
 valleys-0.045
Token about
Token ID546
Feature activation-2.50e+00

Pos logit contributions
live+0.142
 rewarding+0.140
Jerry+0.138
isi+0.131
Deal+0.126
Neg logit contributions
 window-0.137
 curtains-0.120
 down-0.118
 door-0.118
 asleep-0.114
Token the
Token ID262
Feature activation-5.63e-01

Pos logit contributions
 important+0.051
 caring+0.050
 lesson+0.050
ggy+0.049
 forgiven+0.047
Neg logit contributions
 valleys-0.077
 hills-0.076
 rivers-0.074
 oceans-0.074
 mountains-0.072
Token butterfly
Token ID35113
Feature activation-2.17e+00

Pos logit contributions
 caring+0.015
 important+0.015
ggy+0.015
 lesson+0.014
 forgiven+0.014
Neg logit contributions
 valleys-0.014
 hills-0.014
 oceans-0.013
 rivers-0.013
 mountains-0.013
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
 caring+0.053
 important+0.053
 lesson+0.051
ggy+0.051
 forgiven+0.049
Neg logit contributions
 valleys-0.059
 hills-0.057
 oceans-0.055
 rivers-0.055
 mountains-0.053
Token her
Token ID607
Feature activation-3.35e+00

Pos logit contributions
 important+0.017
ggy+0.017
 lesson+0.017
 caring+0.016
ggie+0.015
Neg logit contributions
 valleys-0.034
 hills-0.033
 rivers-0.032
 oceans-0.032
 mountains-0.032
Token thanked
Token ID26280
Feature activation-1.52e+00

Pos logit contributions
ggy+0.011
 important+0.011
 caring+0.011
 lesson+0.011
ggie+0.011
Neg logit contributions
 valleys-0.018
 hills-0.018
 rivers-0.018
 oceans-0.017
 mountains-0.017
Token her
Token ID607
Feature activation-3.90e+00

Pos logit contributions
 important+0.030
 caring+0.029
 lesson+0.029
ggy+0.029
 forgiven+0.027
Neg logit contributions
 valleys-0.049
 hills-0.048
 rivers-0.047
 oceans-0.047
 mountains-0.046
Token mom
Token ID1995
Feature activation-3.36e+00

Pos logit contributions
live+0.075
Jerry+0.075
ggie+0.073
ggy+0.072
isi+0.072
Neg logit contributions
 window-0.126
 hills-0.114
 railing-0.111
 mountains-0.108
 curtains-0.107
Token for
Token ID329
Feature activation-2.19e+00

Pos logit contributions
ggy+0.082
 caring+0.080
 important+0.080
 lesson+0.078
ggie+0.076
Neg logit contributions
 valleys-0.087
 hills-0.087
 rivers-0.084
 oceans-0.084
 mountains-0.082
Token telling
Token ID5149
Feature activation-2.27e+00

Pos logit contributions
 important+0.047
 lesson+0.046
ggy+0.045
 caring+0.044
 valuable+0.043
Neg logit contributions
 valleys-0.065
 hills-0.063
 rivers-0.062
 oceans-0.062
 mountains-0.059
Token her
Token ID607
Feature activation-5.86e+00

Pos logit contributions
ggy+0.050
 important+0.048
 lesson+0.047
 caring+0.046
ggie+0.045
Neg logit contributions
 valleys-0.064
 hills-0.063
 rivers-0.062
 oceans-0.061
 mountains-0.060
Token the
Token ID262
Feature activation-2.90e+00

Pos logit contributions
Jerry+0.110
isi+0.107
live+0.107
ggy+0.091
Mic+0.089
Neg logit contributions
 hills-0.170
 rivers-0.165
 mountains-0.164
 window-0.162
 valleys-0.162
Token truth
Token ID3872
Feature activation-3.14e+00

Pos logit contributions
 caring+0.039
ggy+0.039
 forgiven+0.036
 important+0.036
isi+0.035
Neg logit contributions
 hills-0.104
 valleys-0.103
 rivers-0.101
 mountains-0.100
 oceans-0.100
Token.
Token ID13
Feature activation-5.66e-02

Pos logit contributions
 caring+0.069
 important+0.068
 lesson+0.066
ggy+0.065
 forgiven+0.064
Neg logit contributions
 valleys-0.088
 hills-0.086
 rivers-0.084
 oceans-0.083
 mountains-0.081
Token From
Token ID3574
Feature activation-1.88e+00

Pos logit contributions
 caring+0.001
 important+0.001
 lesson+0.001
ggy+0.001
 forgiven+0.001
Neg logit contributions
 valleys-0.002
 hills-0.002
 oceans-0.002
 rivers-0.002
ASH-0.002
Token that
Token ID326
Feature activation-2.18e+00

Pos logit contributions
 lesson+0.034
 forgiven+0.033
ggy+0.033
 caring+0.032
 understand+0.032
Neg logit contributions
 hills-0.060
 mountains-0.059
 valleys-0.058
 oceans-0.058
 rivers-0.057
Token what
Token ID644
Feature activation-3.12e+00

Pos logit contributions
ggy+0.018
ggie+0.017
 caring+0.017
isi+0.017
 lesson+0.017
Neg logit contributions
 hills-0.023
 valleys-0.023
 rivers-0.022
 oceans-0.022
 mountains-0.022
Token happened
Token ID3022
Feature activation-1.07e+00

Pos logit contributions
isi+0.081
Days+0.079
oto+0.078
Years+0.075
isy+0.074
Neg logit contributions
 hills-0.066
 lands-0.060
 mountains-0.060
 valleys-0.059
 clouds-0.055
Token.
Token ID13
Feature activation+6.79e-01

Pos logit contributions
 important+0.020
 caring+0.020
 lesson+0.019
ggy+0.019
 forgiven+0.018
Neg logit contributions
 valleys-0.036
 hills-0.035
 oceans-0.034
 rivers-0.034
 mountains-0.033
Token Lily
Token ID20037
Feature activation-1.29e+00

Pos logit contributions
 valleys+0.022
 hills+0.022
 oceans+0.021
 rivers+0.021
ASH+0.020
Neg logit contributions
 caring-0.013
ggy-0.013
 important-0.013
 lesson-0.012
 forgiven-0.012
Token told
Token ID1297
Feature activation-2.20e+00

Pos logit contributions
 lesson+0.025
 caring+0.024
ggy+0.024
 important+0.024
 valuable+0.023
Neg logit contributions
ASH-0.037
 valleys-0.037
 hills-0.036
 mountains-0.035
 rivers-0.035
Token her
Token ID607
Feature activation-5.85e+00

Pos logit contributions
 valuable+0.064
 important+0.062
 rewarding+0.062
ggy+0.061
 lesson+0.061
Neg logit contributions
 hills-0.045
 rivers-0.044
ASH-0.043
 mountains-0.043
 valleys-0.043
Token that
Token ID326
Feature activation-2.44e+00

Pos logit contributions
live+0.149
Jerry+0.145
isi+0.143
 rewarding+0.138
ggie+0.132
Neg logit contributions
 window-0.141
 down-0.124
 curtains-0.121
 door-0.118
 railing-0.115
Token she
Token ID673
Feature activation-1.45e+00

Pos logit contributions
 caring+0.048
 important+0.048
ggy+0.048
 lesson+0.046
 forgiven+0.045
Neg logit contributions
 valleys-0.077
 hills-0.075
 oceans-0.073
 rivers-0.073
 mountains-0.071
Token broke
Token ID6265
Feature activation+3.04e-01

Pos logit contributions
 caring+0.031
ggy+0.030
isi+0.030
 lesson+0.030
ggie+0.029
Neg logit contributions
 valleys-0.039
ASH-0.038
 hills-0.038
 oceans-0.036
 mountains-0.036
Token her
Token ID607
Feature activation-4.20e-01

Pos logit contributions
 valleys+0.011
 hills+0.010
 oceans+0.010
 rivers+0.010
 lakes+0.010
Neg logit contributions
 caring-0.005
 important-0.005
 lesson-0.005
ggy-0.005
 not-0.004
Token triangle
Token ID22950
Feature activation-6.74e-01

Pos logit contributions
 caring+0.012
 important+0.012
 lesson+0.012
ggy+0.012
 forgiven+0.011
Neg logit contributions
 valleys-0.010
 hills-0.009
 oceans-0.009
 rivers-0.009
ASH-0.009
Token to
Token ID284
Feature activation-3.99e-01

Pos logit contributions
 valleys+0.005
ASH+0.005
 oceans+0.005
 rivers+0.005
 lakes+0.004
Neg logit contributions
 lesson-0.002
 important-0.002
 caring-0.002
ggy-0.002
 not-0.002
Token their
Token ID511
Feature activation-5.45e-01

Pos logit contributions
 important+0.008
 caring+0.008
ggy+0.008
 lesson+0.007
 forgiven+0.007
Neg logit contributions
 valleys-0.013
 hills-0.013
 rivers-0.012
 oceans-0.012
 mountains-0.012
Token mom
Token ID1995
Feature activation-9.18e-01

Pos logit contributions
 caring+0.010
 important+0.010
 lesson+0.010
ggy+0.010
 valuable+0.009
Neg logit contributions
 valleys-0.018
 oceans-0.017
 hills-0.017
 rivers-0.017
ASH-0.017
Token and
Token ID290
Feature activation-6.17e-01

Pos logit contributions
 caring+0.020
 important+0.019
ggy+0.018
 forgiven+0.018
 lesson+0.018
Neg logit contributions
 valleys-0.028
 hills-0.026
 oceans-0.026
 rivers-0.026
ASH-0.026
Token told
Token ID1297
Feature activation-1.67e+00

Pos logit contributions
 important+0.012
 lesson+0.012
ggy+0.012
 valuable+0.011
ggie+0.011
Neg logit contributions
 hills-0.018
 valleys-0.018
 rivers-0.017
 window-0.017
 mountains-0.017
Token her
Token ID607
Feature activation-5.85e+00

Pos logit contributions
ggy+0.041
 valuable+0.041
live+0.041
 important+0.040
 rewarding+0.039
Neg logit contributions
 hills-0.038
 rivers-0.037
 valleys-0.037
 mountains-0.036
 oceans-0.035
Token what
Token ID644
Feature activation-3.63e+00

Pos logit contributions
isi+0.108
live+0.107
Jerry+0.101
Days+0.091
ggy+0.091
Neg logit contributions
 hills-0.174
 mountains-0.170
 valleys-0.170
 rivers-0.168
 oceans-0.166
Token they
Token ID484
Feature activation-1.28e+00

Pos logit contributions
isi+0.072
Days+0.070
isy+0.069
 forgiven+0.069
 caring+0.067
Neg logit contributions
 hills-0.102
 mountains-0.099
 valleys-0.097
 rivers-0.096
 oceans-0.096
Token did
Token ID750
Feature activation-2.22e+00

Pos logit contributions
ggy+0.030
 caring+0.030
ggie+0.028
 lesson+0.028
 forgiveness+0.027
Neg logit contributions
 valleys-0.035
 hills-0.034
 rivers-0.033
 oceans-0.033
 mountains-0.033
Token.
Token ID13
Feature activation-5.95e-01

Pos logit contributions
 caring+0.052
ggy+0.050
 important+0.049
 lesson+0.048
isi+0.047
Neg logit contributions
 valleys-0.060
 hills-0.060
 oceans-0.058
 rivers-0.058
 mountains-0.055
Token
Token ID198
Feature activation-2.08e+00

Pos logit contributions
 caring+0.010
 important+0.010
ggy+0.010
 lesson+0.009
ggie+0.009
Neg logit contributions
 valleys-0.020
 hills-0.020
 rivers-0.019
 oceans-0.019
 mountains-0.019
Token wrong
Token ID2642
Feature activation-2.44e+00

Pos logit contributions
 valleys+0.026
 hills+0.026
 oceans+0.025
 rivers+0.025
 mountains+0.025
Neg logit contributions
 caring-0.011
 lesson-0.010
ggy-0.010
 important-0.010
 forgiven-0.009
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
 caring+0.061
 forgiven+0.059
isi+0.058
 important+0.058
ggy+0.058
Neg logit contributions
 hills-0.060
 valleys-0.059
 rivers-0.057
 oceans-0.056
 mountains-0.056
Token Tim
Token ID5045
Feature activation-2.10e+00

Pos logit contributions
 valleys+0.011
 oceans+0.010
 hills+0.010
 rivers+0.010
 lakes+0.010
Neg logit contributions
 caring-0.005
 important-0.005
ggy-0.005
 lesson-0.005
 forgiven-0.005
Tokenmy
Token ID1820
Feature activation-1.99e+00

Pos logit contributions
 caring+0.070
 lesson+0.070
ggy+0.070
 valuable+0.069
 rewarding+0.067
Neg logit contributions
ASH-0.036
 window-0.029
ACK-0.028
 hills-0.027
hole-0.027
Token told
Token ID1297
Feature activation-2.19e+00

Pos logit contributions
 rewarding+0.043
 lesson+0.042
vern+0.042
 caring+0.041
 importantly+0.040
Neg logit contributions
ASH-0.048
 bur-0.047
 hills-0.045
 window-0.044
 valleys-0.043
Token her
Token ID607
Feature activation-5.84e+00

Pos logit contributions
 valuable+0.062
 rewarding+0.061
 important+0.059
 lesson+0.057
 importantly+0.056
Neg logit contributions
 hills-0.047
ASH-0.045
 rivers-0.045
 mountains-0.045
 valleys-0.043
Token he
Token ID339
Feature activation-1.88e+00

Pos logit contributions
live+0.144
isi+0.141
 rewarding+0.140
Jerry+0.135
Deal+0.129
Neg logit contributions
 window-0.119
 down-0.117
 hills-0.114
 mountains-0.111
 railing-0.110
Token couldn
Token ID3521
Feature activation-1.02e+00

Pos logit contributions
 caring+0.045
 lesson+0.041
ggy+0.041
vern+0.040
isi+0.040
Neg logit contributions
 valleys-0.045
 hills-0.045
ASH-0.044
 mountains-0.044
 rivers-0.043
Token't
Token ID470
Feature activation-2.03e+00

Pos logit contributions
 important+0.019
 caring+0.019
ggy+0.019
 lesson+0.018
 forgiven+0.017
Neg logit contributions
 valleys-0.034
 hills-0.033
 oceans-0.033
 rivers-0.033
 lakes-0.032
Token find
Token ID1064
Feature activation+1.89e-01

Pos logit contributions
 caring+0.055
 important+0.053
 lesson+0.052
ggy+0.051
 forgiven+0.050
Neg logit contributions
 valleys-0.048
 hills-0.048
 rivers-0.045
 oceans-0.045
ASH-0.044
Token his
Token ID465
Feature activation-2.39e+00

Pos logit contributions
 valleys+0.006
 hills+0.006
 rivers+0.005
 oceans+0.005
 mountains+0.005
Neg logit contributions
 caring-0.004
 lesson-0.004
 important-0.004
ggy-0.004
 forgiven-0.004
Token wrong
Token ID2642
Feature activation-2.48e+00

Pos logit contributions
 valleys+0.026
 hills+0.025
 oceans+0.025
 rivers+0.025
ASH+0.025
Neg logit contributions
 caring-0.011
 lesson-0.011
ggy-0.010
 important-0.010
 forgiven-0.010
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
 caring+0.063
isi+0.061
 forgiven+0.060
ggy+0.060
 important+0.059
Neg logit contributions
 hills-0.060
 valleys-0.058
 rivers-0.056
 oceans-0.055
 mountains-0.055
Token Tim
Token ID5045
Feature activation-2.07e+00

Pos logit contributions
 valleys+0.009
 hills+0.008
 oceans+0.008
 rivers+0.008
 lakes+0.008
Neg logit contributions
 caring-0.005
 important-0.004
 lesson-0.004
ggy-0.004
 forgiven-0.004
Tokenmy
Token ID1820
Feature activation-2.03e+00

Pos logit contributions
ggy+0.069
 caring+0.069
 valuable+0.067
 lesson+0.067
ggie+0.066
Neg logit contributions
ASH-0.036
 window-0.029
 hills-0.028
ACK-0.028
 valleys-0.027
Token told
Token ID1297
Feature activation-2.38e+00

Pos logit contributions
 rewarding+0.047
vern+0.046
 importantly+0.045
ggy+0.044
 lesson+0.044
Neg logit contributions
ASH-0.046
 bur-0.045
 hills-0.042
 window-0.042
 mountains-0.041
Token her
Token ID607
Feature activation-5.84e+00

Pos logit contributions
 valuable+0.071
 rewarding+0.070
 important+0.067
 importantly+0.065
 mattered+0.064
Neg logit contributions
 hills-0.048
ASH-0.046
 rivers-0.046
 mountains-0.046
 valleys-0.043
Token about
Token ID546
Feature activation-2.98e+00

Pos logit contributions
live+0.148
 rewarding+0.140
isi+0.139
Jerry+0.133
Deal+0.130
Neg logit contributions
 down-0.115
 window-0.115
 hills-0.110
 mountains-0.106
 asleep-0.106
Token the
Token ID262
Feature activation-3.98e-01

Pos logit contributions
ggy+0.061
 important+0.061
 lesson+0.060
 forgiven+0.060
oral+0.059
Neg logit contributions
 hills-0.084
 valleys-0.083
 oceans-0.082
 rivers-0.081
 mountains-0.081
Token broken
Token ID5445
Feature activation+9.31e-01

Pos logit contributions
 caring+0.011
 important+0.011
ggy+0.010
 lesson+0.010
 forgiven+0.010
Neg logit contributions
 valleys-0.010
 hills-0.010
 oceans-0.009
 rivers-0.009
ASH-0.009
Token whistle
Token ID16121
Feature activation-1.37e+00

Pos logit contributions
 valleys+0.025
 hills+0.023
 rivers+0.023
 oceans+0.023
 streams+0.022
Neg logit contributions
 important-0.025
 lesson-0.025
 caring-0.024
ggy-0.023
 bad-0.022
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
 caring+0.033
 important+0.033
 lesson+0.032
ggy+0.032
 forgiven+0.031
Neg logit contributions
 valleys-0.038
 hills-0.036
 oceans-0.036
 rivers-0.035
ASH-0.034
Token was
Token ID373
Feature activation+3.97e-01

Pos logit contributions
isi+0.089
oto+0.086
Days+0.084
isy+0.083
oral+0.081
Neg logit contributions
 hills-0.055
 lands-0.051
 mountains-0.050
 valleys-0.050
 rivers-0.045
Token wrong
Token ID2642
Feature activation-2.52e+00

Pos logit contributions
 valleys+0.014
 hills+0.014
 rivers+0.013
 oceans+0.013
 mountains+0.013
Neg logit contributions
 caring-0.006
 lesson-0.006
ggy-0.006
 important-0.006
 understand-0.006
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
 caring+0.062
 important+0.060
 forgiven+0.060
isi+0.060
ggy+0.059
Neg logit contributions
 hills-0.061
 valleys-0.060
 rivers-0.058
 oceans-0.056
ASH-0.056
Token Lily
Token ID20037
Feature activation-1.58e+00

Pos logit contributions
 valleys+0.008
 hills+0.008
 oceans+0.007
 rivers+0.007
ASH+0.007
Neg logit contributions
 caring-0.004
ggy-0.004
 important-0.004
 lesson-0.004
ggie-0.004
Token told
Token ID1297
Feature activation-2.22e+00

Pos logit contributions
 lesson+0.030
 important+0.029
 caring+0.029
ggy+0.028
 valuable+0.027
Neg logit contributions
 valleys-0.047
ASH-0.046
 hills-0.046
 rivers-0.045
 mountains-0.044
Token her
Token ID607
Feature activation-5.84e+00

Pos logit contributions
 valuable+0.066
 rewarding+0.065
 important+0.065
 lesson+0.063
ggy+0.062
Neg logit contributions
 hills-0.042
 rivers-0.041
 mountains-0.041
ASH-0.041
ting-0.040
Token that
Token ID326
Feature activation-2.54e+00

Pos logit contributions
live+0.141
Jerry+0.139
isi+0.136
 rewarding+0.135
Deal+0.127
Neg logit contributions
 window-0.132
 curtains-0.124
 asleep-0.120
 down-0.117
 door-0.114
Token she
Token ID673
Feature activation-1.79e+00

Pos logit contributions
 caring+0.049
 important+0.048
ggy+0.048
 lesson+0.047
 forgiven+0.045
Neg logit contributions
 valleys-0.082
 hills-0.080
 rivers-0.078
 oceans-0.078
 mountains-0.076
Token couldn
Token ID3521
Feature activation-1.11e+00

Pos logit contributions
 caring+0.038
 lesson+0.036
ggy+0.036
isi+0.035
ggie+0.034
Neg logit contributions
 valleys-0.049
 hills-0.048
ASH-0.046
 mountains-0.046
 oceans-0.046
Token't
Token ID470
Feature activation-1.73e+00

Pos logit contributions
 caring+0.022
 important+0.022
 lesson+0.021
ggy+0.021
 forgiven+0.020
Neg logit contributions
 valleys-0.036
 hills-0.035
 oceans-0.034
 rivers-0.034
 mountains-0.033
Token find
Token ID1064
Feature activation+1.02e-01

Pos logit contributions
 caring+0.039
 important+0.039
 lesson+0.038
ggy+0.037
 forgiven+0.036
Neg logit contributions
 valleys-0.050
 hills-0.049
 rivers-0.047
 oceans-0.047
ASH-0.046
Token was
Token ID373
Feature activation+4.76e-01

Pos logit contributions
Days+0.085
isy+0.081
isi+0.080
 caring+0.077
Years+0.077
Neg logit contributions
 hills-0.082
 mountains-0.078
 lands-0.071
 valleys-0.070
 lakes-0.070
Token wrong
Token ID2642
Feature activation-2.16e+00

Pos logit contributions
 valleys+0.016
 hills+0.016
 oceans+0.016
 rivers+0.016
 mountains+0.015
Neg logit contributions
 caring-0.008
 lesson-0.008
ggy-0.007
isi-0.007
 forgiven-0.007
Token.
Token ID13
Feature activation+3.82e-01

Pos logit contributions
 caring+0.050
 important+0.049
ggy+0.048
 lesson+0.048
 forgiven+0.047
Neg logit contributions
 valleys-0.059
 hills-0.059
 rivers-0.057
 oceans-0.056
 mountains-0.055
Token Tim
Token ID5045
Feature activation-1.59e+00

Pos logit contributions
 valleys+0.014
 oceans+0.013
 rivers+0.013
 hills+0.013
 lakes+0.013
Neg logit contributions
 caring-0.006
 important-0.006
ggy-0.006
 lesson-0.006
 forgiven-0.006
Token told
Token ID1297
Feature activation-2.12e+00

Pos logit contributions
 caring+0.035
 lesson+0.034
 important+0.033
 valuable+0.033
ggy+0.033
Neg logit contributions
ASH-0.042
 hills-0.040
 valleys-0.039
 rivers-0.038
 mountains-0.038
Token her
Token ID607
Feature activation-5.83e+00

Pos logit contributions
 valuable+0.061
 rewarding+0.060
 important+0.058
 importantly+0.056
 caring+0.054
Neg logit contributions
 hills-0.047
ASH-0.046
 mountains-0.045
 rivers-0.044
 valleys-0.042
Token that
Token ID326
Feature activation-2.47e+00

Pos logit contributions
live+0.152
 rewarding+0.146
isi+0.141
Jerry+0.138
Days+0.136
Neg logit contributions
 window-0.114
 down-0.109
 door-0.106
 curtains-0.101
 hills-0.101
Token he
Token ID339
Feature activation-1.71e+00

Pos logit contributions
 caring+0.043
 important+0.043
ggy+0.042
 lesson+0.041
 forgiven+0.039
Neg logit contributions
 valleys-0.083
 hills-0.082
 oceans-0.080
 rivers-0.080
 mountains-0.078
Token did
Token ID750
Feature activation-1.65e+00

Pos logit contributions
 caring+0.036
ggy+0.034
 lesson+0.033
isi+0.032
ggie+0.032
Neg logit contributions
 hills-0.046
 valleys-0.046
ASH-0.045
 mountains-0.044
 oceans-0.044
Token not
Token ID407
Feature activation-1.76e+00

Pos logit contributions
isi+0.030
live+0.026
oto+0.026
uddy+0.023
Jerry+0.023
Neg logit contributions
 hills-0.049
 valleys-0.048
 rivers-0.047
 oceans-0.046
ASH-0.045
Token understand
Token ID1833
Feature activation-2.64e+00

Pos logit contributions
ggy+0.033
 caring+0.032
 important+0.031
 lesson+0.031
ggie+0.030
Neg logit contributions
 hills-0.053
 valleys-0.053
 oceans-0.052
 rivers-0.052
ASH-0.050
Token her
Token ID607
Feature activation-1.17e+00

Pos logit contributions
 valleys+0.008
 hills+0.007
 oceans+0.007
 rivers+0.007
ASH+0.007
Neg logit contributions
 caring-0.004
 important-0.004
 lesson-0.004
ggy-0.004
 forgiven-0.004
Token mom
Token ID1995
Feature activation-1.24e+00

Pos logit contributions
 caring+0.018
 lesson+0.018
 important+0.017
isi+0.017
ggy+0.017
Neg logit contributions
 hills-0.041
 window-0.039
 valleys-0.039
 mountains-0.039
ASH-0.038
Tokenmy
Token ID1820
Feature activation-9.93e-01

Pos logit contributions
 caring+0.033
 important+0.033
ggy+0.033
 lesson+0.033
ggie+0.031
Neg logit contributions
 valleys-0.030
 hills-0.029
 rivers-0.028
 oceans-0.028
 mountains-0.027
Token and
Token ID290
Feature activation-4.09e-01

Pos logit contributions
 caring+0.026
 important+0.025
 lesson+0.024
ggy+0.024
 forgiven+0.024
Neg logit contributions
 valleys-0.026
 hills-0.025
 oceans-0.024
 rivers-0.024
ASH-0.024
Token told
Token ID1297
Feature activation-1.94e+00

Pos logit contributions
 lesson+0.010
ggy+0.010
 important+0.009
ggie+0.009
 valuable+0.009
Neg logit contributions
 window-0.010
 hills-0.010
 valleys-0.010
ASH-0.010
 down-0.010
Token her
Token ID607
Feature activation-5.83e+00

Pos logit contributions
 valuable+0.049
ggy+0.049
 important+0.049
 rewarding+0.048
live+0.048
Neg logit contributions
 hills-0.043
 rivers-0.042
 valleys-0.042
ASH-0.041
 mountains-0.041
Token what
Token ID644
Feature activation-3.27e+00

Pos logit contributions
Jerry+0.145
live+0.143
isi+0.140
scar+0.124
ggie+0.122
Neg logit contributions
 window-0.145
 down-0.139
 curtains-0.138
 asleep-0.132
 railing-0.125
Token happened
Token ID3022
Feature activation-1.17e+00

Pos logit contributions
isi+0.059
Days+0.056
oto+0.053
Years+0.053
 caring+0.052
Neg logit contributions
 hills-0.095
 mountains-0.089
 valleys-0.088
 lands-0.087
 rivers-0.086
Token.
Token ID13
Feature activation-4.80e-01

Pos logit contributions
 caring+0.024
 important+0.024
ggy+0.024
 lesson+0.023
 forgiven+0.022
Neg logit contributions
 valleys-0.036
 hills-0.035
 oceans-0.034
 rivers-0.034
ASH-0.033
Token "
Token ID366
Feature activation+2.33e+00

Pos logit contributions
 caring+0.008
 important+0.008
 lesson+0.007
ggy+0.007
 forgiven+0.007
Neg logit contributions
 valleys-0.017
 hills-0.017
 oceans-0.016
 rivers-0.016
ASH-0.016
TokenMom
Token ID29252
Feature activation-5.41e-01

Pos logit contributions
 valleys+0.066
 hills+0.063
ASH+0.062
 oceans+0.062
 rivers+0.062
Neg logit contributions
 important-0.053
 caring-0.053
 lesson-0.053
ggy-0.052
 forgiven-0.049
Token to
Token ID284
Feature activation-5.99e-02

Pos logit contributions
 valleys+0.065
ASH+0.063
 oceans+0.061
 rivers+0.060
 lakes+0.059
Neg logit contributions
 important-0.045
 caring-0.044
ggy-0.042
 lesson-0.042
 not-0.041
Token her
Token ID607
Feature activation-6.94e-01

Pos logit contributions
 caring+0.002
 important+0.001
 lesson+0.001
ggy+0.001
 understand+0.001
Neg logit contributions
 valleys-0.002
 oceans-0.002
 rivers-0.002
 hills-0.002
 lakes-0.002
Token mom
Token ID1995
Feature activation-7.91e-01

Pos logit contributions
 caring+0.015
 important+0.015
 lesson+0.014
ggy+0.014
 forgiven+0.014
Neg logit contributions
 valleys-0.021
 hills-0.020
 rivers-0.020
 oceans-0.019
 mountains-0.019
Token and
Token ID290
Feature activation-4.09e-01

Pos logit contributions
 caring+0.018
 important+0.018
 forgiven+0.017
 lesson+0.016
ggy+0.016
Neg logit contributions
 valleys-0.024
 hills-0.022
 oceans-0.022
 rivers-0.022
ASH-0.021
Token told
Token ID1297
Feature activation-2.14e+00

Pos logit contributions
 important+0.009
 lesson+0.009
ggy+0.009
 valuable+0.009
ggie+0.009
Neg logit contributions
 hills-0.011
 valleys-0.010
 window-0.010
ASH-0.010
 mountains-0.010
Token her
Token ID607
Feature activation-5.83e+00

Pos logit contributions
ggy+0.055
live+0.054
 valuable+0.052
 rewarding+0.052
Jerry+0.052
Neg logit contributions
 hills-0.047
 rivers-0.046
 mountains-0.046
 valleys-0.046
 oceans-0.045
Token what
Token ID644
Feature activation-3.73e+00

Pos logit contributions
live+0.122
Jerry+0.120
isi+0.119
scar+0.103
ggie+0.102
Neg logit contributions
 window-0.155
 hills-0.154
 mountains-0.151
 down-0.150
 asleep-0.149
Token happened
Token ID3022
Feature activation-1.25e+00

Pos logit contributions
isi+0.062
oto+0.062
Days+0.060
 caring+0.059
 lesson+0.058
Neg logit contributions
 hills-0.108
 mountains-0.105
 valleys-0.104
 rivers-0.102
 oceans-0.102
Token.
Token ID13
Feature activation-9.33e-01

Pos logit contributions
 important+0.024
 caring+0.024
 lesson+0.023
ggy+0.023
 forgiven+0.022
Neg logit contributions
 valleys-0.041
 hills-0.040
 oceans-0.039
 rivers-0.039
ASH-0.038
Token Her
Token ID2332
Feature activation-3.64e+00

Pos logit contributions
 caring+0.018
 important+0.018
ggy+0.018
 lesson+0.017
ggie+0.016
Neg logit contributions
 valleys-0.029
 hills-0.029
 oceans-0.028
ASH-0.028
 rivers-0.028
Token mom
Token ID1995
Feature activation-1.71e+00

Pos logit contributions
isi+0.066
Jerry+0.057
ggie+0.056
 caring+0.055
live+0.054
Neg logit contributions
 window-0.118
 hills-0.115
 railing-0.112
ASH-0.110
 mountains-0.107
Token wrong
Token ID2642
Feature activation-2.27e+00

Pos logit contributions
 valleys+0.046
 hills+0.044
 oceans+0.044
 rivers+0.044
ASH+0.043
Neg logit contributions
 important-0.014
 caring-0.013
ggy-0.013
 lesson-0.013
 forgiven-0.011
Token.
Token ID13
Feature activation+5.11e-01

Pos logit contributions
 caring+0.051
 important+0.049
 lesson+0.048
ggy+0.048
 forgiven+0.047
Neg logit contributions
 valleys-0.064
 hills-0.064
 rivers-0.062
 oceans-0.061
 mountains-0.060
Token Tim
Token ID5045
Feature activation-1.51e+00

Pos logit contributions
 valleys+0.019
 hills+0.018
 oceans+0.018
 rivers+0.018
 lakes+0.017
Neg logit contributions
 caring-0.008
 important-0.008
 lesson-0.008
ggy-0.008
 forgiven-0.007
Tokenmy
Token ID1820
Feature activation-1.18e+00

Pos logit contributions
 caring+0.045
 lesson+0.044
ggy+0.044
 important+0.043
 valuable+0.043
Neg logit contributions
ASH-0.032
 valleys-0.030
 hills-0.029
 rivers-0.028
 oceans-0.028
Token told
Token ID1297
Feature activation-2.27e+00

Pos logit contributions
 lesson+0.024
 caring+0.023
 important+0.023
ggy+0.022
 valuable+0.022
Neg logit contributions
 valleys-0.033
ASH-0.033
 hills-0.032
 mountains-0.032
 rivers-0.031
Token her
Token ID607
Feature activation-5.82e+00

Pos logit contributions
 valuable+0.062
 rewarding+0.062
 important+0.060
 lesson+0.058
live+0.057
Neg logit contributions
 hills-0.050
 rivers-0.049
 mountains-0.049
ASH-0.048
 valleys-0.047
Token about
Token ID546
Feature activation-2.45e+00

Pos logit contributions
live+0.128
Jerry+0.126
 rewarding+0.123
isi+0.122
Deal+0.115
Neg logit contributions
 window-0.136
 asleep-0.135
 mountains-0.135
 hills-0.134
 curtains-0.130
Token his
Token ID465
Feature activation-1.52e+00

Pos logit contributions
 caring+0.044
 important+0.044
 lesson+0.043
ggy+0.042
 forgiven+0.040
Neg logit contributions
 valleys-0.082
 hills-0.080
 oceans-0.078
 rivers-0.078
 mountains-0.076
Token bad
Token ID2089
Feature activation-4.50e-01

Pos logit contributions
 important+0.031
 caring+0.031
 lesson+0.030
ggy+0.029
 forgiven+0.027
Neg logit contributions
 valleys-0.049
 hills-0.047
 oceans-0.046
 rivers-0.046
ASH-0.045
Token dream
Token ID4320
Feature activation-9.49e-01

Pos logit contributions
 caring+0.009
 important+0.008
ggy+0.008
 lesson+0.008
 forgiven+0.008
Neg logit contributions
 valleys-0.015
 hills-0.014
 oceans-0.014
 rivers-0.014
 mountains-0.014
Token and
Token ID290
Feature activation-1.27e+00

Pos logit contributions
 important+0.021
 caring+0.021
 lesson+0.021
ggy+0.021
 forgiven+0.019
Neg logit contributions
 valleys-0.028
 hills-0.027
 oceans-0.026
 rivers-0.026
ASH-0.026
Token airplane
Token ID19401
Feature activation+1.27e+00

Pos logit contributions
 valleys+0.033
ASH+0.031
 rivers+0.030
 oceans+0.030
 hills+0.030
Neg logit contributions
 important-0.022
 caring-0.022
 bad-0.021
 lesson-0.020
 wonderful-0.020
Token that
Token ID326
Feature activation+7.37e-01

Pos logit contributions
 valleys+0.048
ASH+0.046
 oceans+0.045
 hills+0.045
 rivers+0.045
Neg logit contributions
ggy-0.019
 important-0.019
 caring-0.017
 lesson-0.017
ggie-0.016
Token flew
Token ID13112
Feature activation+2.73e+00

Pos logit contributions
 valleys+0.021
 hills+0.021
ASH+0.021
 oceans+0.020
 rivers+0.020
Neg logit contributions
 important-0.017
 lesson-0.016
 caring-0.015
 not-0.015
 valuable-0.015
Token high
Token ID1029
Feature activation+3.27e+00

Pos logit contributions
ASH+0.084
 valleys+0.080
 railing+0.076
ACK+0.076
 lakes+0.073
Neg logit contributions
 important-0.064
 lesson-0.058
 valuable-0.056
 not-0.055
 caring-0.054
Token in
Token ID287
Feature activation+2.54e+00

Pos logit contributions
ASH+0.101
 valleys+0.101
 railing+0.097
ACK+0.095
 lakes+0.095
Neg logit contributions
 important-0.065
 lesson-0.058
 not-0.054
 about-0.052
 bad-0.052
Token the
Token ID262
Feature activation+5.05e+00

Pos logit contributions
als+0.097
ays+0.095
rol+0.095
 railing+0.095
ting+0.094
Neg logit contributions
 important-0.028
 wonderful-0.025
 lesson-0.024
 caring-0.023
 bad-0.021
Token sky
Token ID6766
Feature activation+1.68e+00

Pos logit contributions
 valleys+0.081
ASH+0.078
ting+0.078
 railing+0.076
 hills+0.076
Neg logit contributions
 important-0.171
 wonderful-0.159
 lesson-0.158
 bad-0.158
 valuable-0.154
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
 valleys+0.069
ASH+0.068
ACK+0.066
 railing+0.065
 oceans+0.065
Neg logit contributions
 important-0.017
 about-0.015
 not-0.014
ggy-0.014
 lesson-0.013
Token The
Token ID383
Feature activation+5.03e-01

Pos logit contributions
 valleys+0.062
ASH+0.059
 oceans+0.059
 rivers+0.058
 streams+0.058
Neg logit contributions
 important-0.026
 lesson-0.025
 caring-0.024
 understand-0.024
 about-0.023
Token airplane
Token ID19401
Feature activation+1.20e+00

Pos logit contributions
 valleys+0.011
 hills+0.010
ASH+0.010
 oceans+0.010
 rivers+0.010
Neg logit contributions
 important-0.015
 caring-0.015
 lesson-0.015
ggy-0.014
 valuable-0.014
Token was
Token ID373
Feature activation+1.30e+00

Pos logit contributions
 valleys+0.046
 hills+0.044
 oceans+0.044
ASH+0.044
 rivers+0.044
Neg logit contributions
 important-0.017
 lesson-0.016
ggy-0.015
 caring-0.015
 valuable-0.013
Token must
Token ID1276
Feature activation-6.14e-01

Pos logit contributions
 caring+0.014
 important+0.014
 lesson+0.014
ggy+0.013
 forgiven+0.012
Neg logit contributions
 valleys-0.039
 hills-0.038
 rivers-0.037
 oceans-0.037
 mountains-0.036
Token never
Token ID1239
Feature activation-7.46e-01

Pos logit contributions
 caring+0.010
 important+0.010
 lesson+0.010
ggy+0.010
 forgiven+0.009
Neg logit contributions
 valleys-0.021
 hills-0.020
 rivers-0.020
 oceans-0.020
ASH-0.020
Token go
Token ID467
Feature activation+1.61e+00

Pos logit contributions
 caring+0.016
 important+0.016
ggy+0.016
 lesson+0.016
 forgiven+0.015
Neg logit contributions
 valleys-0.023
 hills-0.022
 oceans-0.021
 rivers-0.021
 mountains-0.020
Token near
Token ID1474
Feature activation+1.40e+00

Pos logit contributions
 valleys+0.049
ASH+0.047
 oceans+0.045
 rivers+0.045
 lakes+0.045
Neg logit contributions
 important-0.036
 caring-0.035
 not-0.035
 bad-0.034
ggy-0.032
Token the
Token ID262
Feature activation+3.10e+00

Pos logit contributions
 valleys+0.051
 oceans+0.048
 hills+0.047
 rivers+0.047
ASH+0.046
Neg logit contributions
 important-0.023
 caring-0.022
ggy-0.021
 bad-0.021
 lesson-0.020
Token chim
Token ID18205
Feature activation+4.99e+00

Pos logit contributions
 valleys+0.073
ays+0.068
 oceans+0.066
ASH+0.064
 rivers+0.063
Neg logit contributions
 bad-0.089
 important-0.088
 lesson-0.086
 caring-0.079
 wonderful-0.079
Tokenney
Token ID1681
Feature activation+3.13e+00

Pos logit contributions
 valleys+0.142
 oceans+0.137
 rivers+0.134
 hills+0.132
 streams+0.130
Neg logit contributions
 important-0.111
 lesson-0.106
ggy-0.106
 caring-0.106
 forgiven-0.100
Token!
Token ID0
Feature activation+1.29e+00

Pos logit contributions
 valleys+0.108
 oceans+0.102
 rivers+0.100
 lakes+0.098
 hills+0.097
Neg logit contributions
 important-0.052
 lesson-0.051
 about-0.051
 not-0.050
 bad-0.050
Token Do
Token ID2141
Feature activation-9.06e-01

Pos logit contributions
 valleys+0.034
 hills+0.033
 oceans+0.032
 rivers+0.032
 mountains+0.031
Neg logit contributions
 caring-0.033
 important-0.033
ggy-0.032
 lesson-0.032
 forgiven-0.031
Token you
Token ID345
Feature activation-2.29e+00

Pos logit contributions
 valuable+0.027
 lesson+0.027
 forgiven+0.026
 important+0.026
 caring+0.026
Neg logit contributions
 hills-0.015
 mountains-0.014
 oceans-0.014
 rivers-0.013
 valleys-0.013
Token understand
Token ID1833
Feature activation-2.16e+00

Pos logit contributions
 caring+0.031
 important+0.030
ggy+0.028
 lesson+0.028
 forgiven+0.025
Neg logit contributions
 valleys-0.085
 hills-0.083
 rivers-0.081
 oceans-0.081
ASH-0.079
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
 caring+0.028
 important+0.028
 forgiven+0.026
 lesson+0.026
isi+0.026
Neg logit contributions
 hills-0.036
 valleys-0.036
 rivers-0.033
 oceans-0.033
 mountains-0.032
Token"
Token ID1
Feature activation+2.95e+00

Pos logit contributions
 valleys+0.008
 hills+0.008
 rivers+0.008
 oceans+0.008
 mountains+0.007
Neg logit contributions
 caring-0.004
 important-0.004
 lesson-0.004
ggy-0.004
 forgiven-0.004
TokenHey
Token ID10814
Feature activation+1.78e+00

Pos logit contributions
 valleys+0.073
 hills+0.070
 oceans+0.068
 rivers+0.068
ASH+0.066
Neg logit contributions
 caring-0.080
 important-0.080
 lesson-0.078
ggy-0.078
 forgiven-0.075
Token,
Token ID11
Feature activation+2.66e+00

Pos logit contributions
 valleys+0.052
 hills+0.051
 rivers+0.049
 oceans+0.049
ASH+0.049
Neg logit contributions
 important-0.040
 caring-0.040
 lesson-0.039
 valuable-0.038
 forgiven-0.037
Token stop
Token ID2245
Feature activation+2.44e+00

Pos logit contributions
 valleys+0.084
 hills+0.082
 rivers+0.080
 oceans+0.080
ASH+0.077
Neg logit contributions
 caring-0.054
 important-0.054
 lesson-0.052
ggy-0.051
 forgiven-0.049
Token!
Token ID0
Feature activation+5.76e+00

Pos logit contributions
 valleys+0.086
 hills+0.082
 oceans+0.081
 rivers+0.081
ASH+0.079
Neg logit contributions
 caring-0.041
ggy-0.041
 important-0.038
 bad-0.038
 not-0.037
Token That
Token ID1320
Feature activation+2.06e+00

Pos logit contributions
 valleys+0.167
 oceans+0.161
 hills+0.160
 rivers+0.160
 mountains+0.149
Neg logit contributions
ggy-0.131
 caring-0.122
 bad-0.118
 lesson-0.118
 forgiveness-0.118
Token's
Token ID338
Feature activation+1.43e+00

Pos logit contributions
 valleys+0.061
 hills+0.058
 oceans+0.058
 rivers+0.057
 lakes+0.055
Neg logit contributions
 important-0.047
ggy-0.046
 caring-0.046
 lesson-0.046
 forgiven-0.043
Token our
Token ID674
Feature activation-1.13e+00

Pos logit contributions
ASH+0.042
 valleys+0.042
 hills+0.042
 rivers+0.042
 oceans+0.041
Neg logit contributions
oral-0.027
 lesson-0.026
 understand-0.025
 caring-0.025
 forgiveness-0.025
Token snow
Token ID6729
Feature activation+1.23e+00

Pos logit contributions
 lesson+0.031
 important+0.030
 bad+0.028
 caring+0.027
ggy+0.027
Neg logit contributions
 valleys-0.032
 oceans-0.030
 rivers-0.029
ting-0.029
 hills-0.029
Tokenman
Token ID805
Feature activation-3.06e-01

Pos logit contributions
 valleys+0.045
 oceans+0.043
 rivers+0.042
 hills+0.042
 streams+0.041
Neg logit contributions
ggy-0.023
 lesson-0.020
 important-0.020
ggie-0.019
 caring-0.019
Token stars
Token ID5788
Feature activation+1.71e+00

Pos logit contributions
ASH+0.060
ACK+0.058
ays+0.054
 railing+0.054
 valleys+0.054
Neg logit contributions
 wonderful-0.048
 important-0.047
 bad-0.046
 caring-0.045
 kind-0.043
Token shining
Token ID22751
Feature activation+2.98e+00

Pos logit contributions
ASH+0.060
ACK+0.056
 valleys+0.054
 hills+0.052
 railing+0.051
Neg logit contributions
 caring-0.029
 important-0.028
 about-0.028
ggy-0.028
 that-0.027
Token in
Token ID287
Feature activation+1.79e+00

Pos logit contributions
ASH+0.094
ACK+0.089
 railing+0.086
 valleys+0.084
idd+0.080
Neg logit contributions
 important-0.059
 wonderful-0.057
 caring-0.053
 bad-0.053
 lesson-0.052
Token the
Token ID262
Feature activation+3.65e+00

Pos logit contributions
 valleys+0.064
ASH+0.063
 hills+0.062
 railing+0.061
 oceans+0.061
Neg logit contributions
 caring-0.025
 important-0.024
 lesson-0.023
 wonderful-0.022
 bad-0.021
Token in
Token ID287
Feature activation+1.64e+00

Pos logit contributions
ASH+0.079
ays+0.074
ACK+0.073
 valleys+0.072
unks+0.067
Neg logit contributions
 important-0.112
 wonderful-0.111
 bad-0.108
 lesson-0.105
 valuable-0.100
Tokenky
Token ID2584
Feature activation+4.94e+00

Pos logit contributions
 valleys+0.056
 hills+0.055
ASH+0.054
 oceans+0.053
 rivers+0.053
Neg logit contributions
 caring-0.026
 important-0.026
 lesson-0.026
ggy-0.024
 wonderful-0.023
Token sky
Token ID6766
Feature activation+7.27e-01

Pos logit contributions
ASH+0.126
ACK+0.126
ays+0.123
 valleys+0.120
 railing+0.117
Neg logit contributions
 important-0.109
 wonderful-0.104
ggy-0.099
 kind-0.099
 story-0.097
Token.
Token ID13
Feature activation+2.35e-01

Pos logit contributions
ASH+0.028
 valleys+0.027
ACK+0.026
 hills+0.026
 railing+0.026
Neg logit contributions
 important-0.009
 caring-0.009
ggy-0.008
 wonderful-0.008
 about-0.008
Token He
Token ID679
Feature activation+4.16e-01

Pos logit contributions
 valleys+0.008
 hills+0.007
 oceans+0.007
 rivers+0.007
ASH+0.007
Neg logit contributions
 caring-0.005
 important-0.005
 lesson-0.004
ggy-0.004
 forgiven-0.004
Token turned
Token ID2900
Feature activation+3.20e+00

Pos logit contributions
 valleys+0.016
ASH+0.015
 hills+0.015
 oceans+0.015
 rivers+0.015
Neg logit contributions
 caring-0.006
 important-0.006
ggy-0.006
 lesson-0.005
 understood-0.005
Token to
Token ID284
Feature activation+1.47e+00

Pos logit contributions
ASH+0.099
 valleys+0.091
ACK+0.091
 hills+0.086
 railing+0.085
Neg logit contributions
 important-0.070
 caring-0.068
 lesson-0.067
 valuable-0.065
 wonderful-0.065
Token,
Token ID11
Feature activation+7.35e-01

Pos logit contributions
 caring+0.014
 important+0.014
 lesson+0.013
ggy+0.013
 bad+0.012
Neg logit contributions
 valleys-0.035
ASH-0.034
 oceans-0.033
 hills-0.033
 rivers-0.033
Token the
Token ID262
Feature activation+4.33e-01

Pos logit contributions
ASH+0.028
 valleys+0.027
 oceans+0.026
 hills+0.026
 railing+0.026
Neg logit contributions
 caring-0.010
 lesson-0.010
 important-0.010
ggy-0.009
 bad-0.009
Token lion
Token ID18744
Feature activation+8.37e-01

Pos logit contributions
 valleys+0.013
 oceans+0.012
 hills+0.012
ASH+0.012
 rivers+0.012
Neg logit contributions
 important-0.010
 caring-0.010
 lesson-0.010
ggy-0.009
 bad-0.009
Token and
Token ID290
Feature activation+2.29e-01

Pos logit contributions
 valleys+0.028
ASH+0.027
 hills+0.027
 oceans+0.027
 rivers+0.027
Neg logit contributions
ggy-0.015
 important-0.014
 caring-0.014
 lesson-0.014
 forgiven-0.013
Token mouse
Token ID10211
Feature activation+3.26e-02

Pos logit contributions
 valleys+0.007
 hills+0.007
 oceans+0.007
ASH+0.007
 rivers+0.007
Neg logit contributions
 caring-0.004
 important-0.004
 lesson-0.004
ggy-0.004
 forgiven-0.004
Token stepped
Token ID10764
Feature activation+4.95e+00

Pos logit contributions
 valleys+0.001
ASH+0.001
 hills+0.001
 oceans+0.001
 rivers+0.001
Neg logit contributions
ggy-0.001
 important-0.001
 caring-0.001
 lesson-0.001
 forgiven-0.001
Token out
Token ID503
Feature activation+9.83e-01

Pos logit contributions
ASH+0.142
ACK+0.128
 valleys+0.127
hole+0.119
icer+0.118
Neg logit contributions
 important-0.100
 wonderful-0.098
 bad-0.097
 about-0.094
 lesson-0.094
Token of
Token ID286
Feature activation+5.76e-01

Pos logit contributions
ASH+0.037
 valleys+0.035
ACK+0.035
 oceans+0.034
 hills+0.034
Neg logit contributions
 lesson-0.013
 caring-0.013
 important-0.013
 wonderful-0.012
 bad-0.011
Token the
Token ID262
Feature activation+3.52e+00

Pos logit contributions
 valleys+0.019
 hills+0.019
 oceans+0.018
 rivers+0.018
ASH+0.018
Neg logit contributions
 caring-0.010
 important-0.010
 lesson-0.010
ggy-0.010
 forgiven-0.009
Token maze
Token ID31237
Feature activation+3.32e+00

Pos logit contributions
 valleys+0.089
ASH+0.087
ays+0.085
 oceans+0.083
 rivers+0.081
Neg logit contributions
 important-0.093
 lesson-0.090
 bad-0.087
 wonderful-0.085
 valuable-0.084
Token and
Token ID290
Feature activation+2.13e+00

Pos logit contributions
ASH+0.116
ACK+0.111
 valleys+0.110
 oceans+0.108
 lakes+0.107
Neg logit contributions
 important-0.045
 about-0.045
 lesson-0.045
 not-0.041
 that-0.040
Token a
Token ID257
Feature activation-1.49e+00

Pos logit contributions
 valleys+0.031
 oceans+0.029
 hills+0.029
 rivers+0.029
ASH+0.028
Neg logit contributions
 important-0.019
 caring-0.019
 lesson-0.018
 bad-0.017
ggy-0.017
Token bath
Token ID7837
Feature activation+4.36e-01

Pos logit contributions
 caring+0.039
 important+0.039
 lesson+0.038
ggy+0.037
 valuable+0.036
Neg logit contributions
 valleys-0.040
 hills-0.037
 rivers-0.037
 oceans-0.037
ASH-0.035
Token,"
Token ID553
Feature activation-1.39e+00

Pos logit contributions
 valleys+0.014
 rivers+0.014
 oceans+0.013
 hills+0.013
 lakes+0.013
Neg logit contributions
 important-0.009
 caring-0.008
 lesson-0.008
ggy-0.008
 forgiven-0.008
Token Lily
Token ID20037
Feature activation-1.05e+00

Pos logit contributions
vern+0.028
 important+0.027
 lesson+0.027
ggy+0.027
isi+0.027
Neg logit contributions
 hills-0.038
 valleys-0.038
 rivers-0.036
 oceans-0.036
 mountains-0.036
Token said
Token ID531
Feature activation-2.40e+00

Pos logit contributions
 lesson+0.021
 caring+0.021
 important+0.020
ggy+0.019
 valuable+0.019
Neg logit contributions
ASH-0.029
 hills-0.029
 valleys-0.028
 rivers-0.027
 oceans-0.027
Token,
Token ID11
Feature activation+1.24e+00

Pos logit contributions
 lesson+0.058
 important+0.057
 caring+0.057
isi+0.056
oral+0.056
Neg logit contributions
 hills-0.057
 valleys-0.057
ASH-0.053
 rivers-0.053
 oceans-0.053
Token trying
Token ID2111
Feature activation-2.28e+00

Pos logit contributions
 valleys+0.041
 hills+0.039
 oceans+0.039
 rivers+0.038
 lakes+0.038
Neg logit contributions
 caring-0.026
 important-0.025
 not-0.024
ggy-0.023
 lesson-0.023
Token to
Token ID284
Feature activation+1.06e-01

Pos logit contributions
 caring+0.053
 important+0.053
ggy+0.052
isi+0.051
 forgiven+0.051
Neg logit contributions
 hills-0.058
 valleys-0.057
 rivers-0.054
 oceans-0.054
ASH-0.053
Token resist
Token ID4180
Feature activation-4.87e-01

Pos logit contributions
 valleys+0.003
 hills+0.003
 oceans+0.003
 rivers+0.003
 mountains+0.003
Neg logit contributions
 caring-0.002
 important-0.002
 lesson-0.002
ggy-0.002
 forgiven-0.002
Token.
Token ID13
Feature activation-4.48e-01

Pos logit contributions
 caring+0.009
 important+0.009
ggy+0.008
 lesson+0.008
 bad+0.007
Neg logit contributions
 valleys-0.017
 oceans-0.017
 hills-0.016
 rivers-0.016
ASH-0.016
Token
Token ID198
Feature activation-2.47e+00

Pos logit contributions
 caring+0.008
 important+0.008
 lesson+0.007
ggy+0.007
 forgiven+0.007
Neg logit contributions
 valleys-0.016
 hills-0.015
 oceans-0.015
 rivers-0.015
 lakes-0.014
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
 valleys+0.067
ASH+0.063
 hills+0.062
 oceans+0.062
 railing+0.062
Neg logit contributions
 important-0.023
 caring-0.022
 about-0.021
 lesson-0.021
ggy-0.021
Token
Token ID198
Feature activation-1.51e+00

Pos logit contributions
 valleys+0.008
 oceans+0.008
 rivers+0.008
 hills+0.008
 lakes+0.008
Neg logit contributions
 caring-0.004
 important-0.004
ggy-0.004
 lesson-0.004
 forgiven-0.003
Token
Token ID198
Feature activation-3.78e-01

Pos logit contributions
 important+0.028
 caring+0.028
 lesson+0.027
 forgiven+0.026
ggy+0.025
Neg logit contributions
 valleys-0.050
 hills-0.049
 rivers-0.047
 oceans-0.046
 mountains-0.045
TokenThe
Token ID464
Feature activation-2.25e-01

Pos logit contributions
 caring+0.006
 important+0.006
ggy+0.006
 lesson+0.006
 forgiven+0.006
Neg logit contributions
 valleys-0.013
 hills-0.013
 oceans-0.013
 rivers-0.013
ASH-0.012
Token wind
Token ID2344
Feature activation+1.46e+00

Pos logit contributions
 important+0.005
 lesson+0.005
 caring+0.005
 bad+0.004
ggy+0.004
Neg logit contributions
 valleys-0.007
ASH-0.007
 oceans-0.007
 rivers-0.007
 hills-0.007
Token was
Token ID373
Feature activation+1.34e+00

Pos logit contributions
 valleys+0.051
 hills+0.049
 oceans+0.049
 rivers+0.048
 mountains+0.047
Neg logit contributions
ggy-0.026
 important-0.025
 lesson-0.025
 caring-0.025
 forgiven-0.023
Token still
Token ID991
Feature activation+1.15e+00

Pos logit contributions
 valleys+0.037
 hills+0.036
 oceans+0.035
 rivers+0.035
ASH+0.034
Neg logit contributions
 caring-0.033
 important-0.033
 lesson-0.032
ggy-0.031
 forgiven-0.030
Token blowing
Token ID19280
Feature activation+2.21e+00

Pos logit contributions
 valleys+0.035
 hills+0.032
ASH+0.032
 oceans+0.032
 rivers+0.032
Neg logit contributions
 important-0.028
 caring-0.027
 not-0.025
 bad-0.025
 lesson-0.025
Token all
Token ID477
Feature activation+1.82e-01

Pos logit contributions
 valleys+0.075
ASH+0.073
 oceans+0.072
ACK+0.070
 railing+0.069
Neg logit contributions
 important-0.041
 bad-0.039
 caring-0.038
 lesson-0.037
 wonderful-0.036
Token around
Token ID1088
Feature activation+1.65e+00

Pos logit contributions
 valleys+0.006
 oceans+0.006
 hills+0.006
 rivers+0.006
ASH+0.006
Neg logit contributions
 important-0.004
 lesson-0.004
 caring-0.004
ggy-0.003
 bad-0.003
Token the
Token ID262
Feature activation+2.76e+00

Pos logit contributions
 valleys+0.063
 oceans+0.062
ASH+0.061
 hills+0.060
 rivers+0.060
Neg logit contributions
 caring-0.020
 important-0.019
ggy-0.019
 lesson-0.018
 bad-0.018
Token have
Token ID423
Feature activation-7.22e-01

Pos logit contributions
 caring+0.012
 important+0.012
 lesson+0.012
ggy+0.012
 forgiven+0.011
Neg logit contributions
 valleys-0.023
 hills-0.022
 oceans-0.021
 rivers-0.021
ASH-0.021
Token one
Token ID530
Feature activation-2.96e+00

Pos logit contributions
 caring+0.018
 important+0.018
ggy+0.017
 lesson+0.017
 forgiven+0.016
Neg logit contributions
 valleys-0.018
 hills-0.018
 rivers-0.017
 oceans-0.017
 mountains-0.017
Token.
Token ID13
Feature activation-5.72e-01

Pos logit contributions
 caring+0.070
ggy+0.067
 important+0.066
 forgiven+0.065
 lesson+0.064
Neg logit contributions
 hills-0.078
 valleys-0.076
ASH-0.073
 rivers-0.073
 oceans-0.072
Token
Token ID198
Feature activation-2.78e+00

Pos logit contributions
 caring+0.010
 important+0.010
ggy+0.010
 lesson+0.009
 forgiven+0.009
Neg logit contributions
 valleys-0.019
 hills-0.019
 oceans-0.018
 rivers-0.018
ASH-0.018
Token
Token ID198
Feature activation-6.62e-01

Pos logit contributions
 caring+0.068
 important+0.065
 forgiveness+0.063
 generosity+0.062
ggy+0.062
Neg logit contributions
 hills-0.074
 valleys-0.069
 oceans-0.065
 mountains-0.064
 rivers-0.064
Token"
Token ID1
Feature activation+1.43e+00

Pos logit contributions
 caring+0.012
 important+0.012
 lesson+0.012
ggy+0.012
 forgiven+0.011
Neg logit contributions
 valleys-0.022
 hills-0.021
 oceans-0.021
 rivers-0.021
 mountains-0.020
TokenBut
Token ID1537
Feature activation-8.58e-01

Pos logit contributions
 valleys+0.040
 hills+0.040
 oceans+0.038
 rivers+0.038
ASH+0.038
Neg logit contributions
 important-0.033
ggy-0.032
 caring-0.032
 lesson-0.031
 forgiven-0.030
Token Dane
Token ID42999
Feature activation-7.98e-01

Pos logit contributions
 caring+0.014
 important+0.014
 lesson+0.013
ggy+0.013
 forgiven+0.012
Neg logit contributions
 valleys-0.031
 hills-0.030
 oceans-0.030
 rivers-0.030
ASH-0.029
Token,"
Token ID553
Feature activation-1.83e+00

Pos logit contributions
 caring+0.017
 lesson+0.016
 important+0.016
ggy+0.016
 forgiven+0.015
Neg logit contributions
 valleys-0.023
 hills-0.022
 rivers-0.022
ASH-0.022
 oceans-0.022
Token Mom
Token ID11254
Feature activation-2.78e+00

Pos logit contributions
ggy+0.036
 rewarding+0.035
vern+0.035
 forgiven+0.035
 important+0.034
Neg logit contributions
 hills-0.048
ASH-0.047
 valleys-0.045
 spaces-0.045
 railing-0.044
Tokenmy
Token ID1820
Feature activation-1.49e+00

Pos logit contributions
ggy+0.090
 lesson+0.088
 caring+0.086
 important+0.086
 valuable+0.086
Neg logit contributions
ASH-0.049
hole-0.044
 window-0.044
 hills-0.041
ACK-0.039
Token while
Token ID981
Feature activation-8.73e-01

Pos logit contributions
 forgiven+0.028
ggy+0.027
live+0.027
 important+0.026
oto+0.026
Neg logit contributions
 hills-0.023
 valleys-0.021
 oceans-0.021
 rivers-0.021
 south-0.019
Token,
Token ID11
Feature activation-5.78e-01

Pos logit contributions
 caring+0.015
 important+0.015
ggy+0.015
 lesson+0.014
 forgiven+0.013
Neg logit contributions
 valleys-0.030
 hills-0.029
 oceans-0.029
 rivers-0.029
ASH-0.028
Token the
Token ID262
Feature activation+1.71e-01

Pos logit contributions
 caring+0.011
 important+0.011
 lesson+0.011
ggy+0.011
 forgiven+0.010
Neg logit contributions
 valleys-0.019
 hills-0.018
 oceans-0.018
 rivers-0.018
ASH-0.017
Token noise
Token ID7838
Feature activation+1.94e-01

Pos logit contributions
 valleys+0.005
ASH+0.005
 oceans+0.005
 rivers+0.005
 hills+0.005
Neg logit contributions
 important-0.004
 lesson-0.004
 caring-0.004
 bad-0.004
ggy-0.004
Token stopped
Token ID5025
Feature activation+2.29e+00

Pos logit contributions
 valleys+0.006
 hills+0.006
 oceans+0.006
 rivers+0.006
ASH+0.006
Neg logit contributions
 caring-0.004
 important-0.004
 lesson-0.004
ggy-0.004
 forgiven-0.003
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
 valleys+0.077
ASH+0.074
als+0.074
 hills+0.073
 oceans+0.072
Neg logit contributions
 caring-0.041
 important-0.041
 not-0.038
 bad-0.037
 understand-0.037
Token Josh
Token ID8518
Feature activation+4.10e-02

Pos logit contributions
 valleys+0.042
 hills+0.042
 oceans+0.041
ASH+0.040
 streams+0.040
Neg logit contributions
 not-0.022
 caring-0.021
 lesson-0.021
 important-0.020
 why-0.020
Token stopped
Token ID5025
Feature activation+1.83e+00

Pos logit contributions
 valleys+0.001
 hills+0.001
 oceans+0.001
 rivers+0.001
 lakes+0.001
Neg logit contributions
ggy-0.001
 caring-0.001
 important-0.001
 forgiven-0.001
 lesson-0.001
Token being
Token ID852
Feature activation+8.24e-01

Pos logit contributions
 valleys+0.064
ASH+0.062
 hills+0.061
als+0.061
 oceans+0.061
Neg logit contributions
 caring-0.031
 important-0.029
 understand-0.028
 not-0.027
 bad-0.027
Token scared
Token ID12008
Feature activation-2.23e-01

Pos logit contributions
 valleys+0.026
 hills+0.025
 oceans+0.025
 rivers+0.025
 lakes+0.024
Neg logit contributions
 important-0.018
 caring-0.017
 forgiven-0.016
ggy-0.016
 lesson-0.016
Token too
Token ID1165
Feature activation-6.66e-01

Pos logit contributions
 important+0.004
 caring+0.004
 lesson+0.003
ggy+0.003
 bad+0.003
Neg logit contributions
 valleys-0.008
 hills-0.008
 oceans-0.008
 rivers-0.008
ASH-0.008
Token day
Token ID1110
Feature activation-8.24e-02

Pos logit contributions
 caring+0.040
 forgiven+0.039
ggy+0.038
 forgiveness+0.037
isi+0.037
Neg logit contributions
 valleys-0.060
 hills-0.060
 rivers-0.058
 oceans-0.057
 mountains-0.056
Token,
Token ID11
Feature activation-6.35e-02

Pos logit contributions
 caring+0.001
 important+0.001
ggy+0.001
 lesson+0.001
 forgiven+0.001
Neg logit contributions
 valleys-0.003
 hills-0.003
 oceans-0.003
 rivers-0.003
ASH-0.003
Token something
Token ID1223
Feature activation-1.47e+00

Pos logit contributions
 caring+0.001
 important+0.001
 lesson+0.001
ggy+0.001
 bad+0.001
Neg logit contributions
 valleys-0.002
 oceans-0.002
ASH-0.002
 rivers-0.002
 hills-0.002
Token special
Token ID2041
Feature activation-6.72e-01

Pos logit contributions
isi+0.026
 forgiveness+0.023
 caring+0.023
Jerry+0.022
 forgiven+0.022
Neg logit contributions
 hills-0.046
 valleys-0.045
 mountains-0.044
 rivers-0.044
 lands-0.043
Token happened
Token ID3022
Feature activation+9.13e-01

Pos logit contributions
 caring+0.006
 important+0.006
ggy+0.005
 lesson+0.005
 forgiven+0.005
Neg logit contributions
 valleys-0.028
 hills-0.028
 oceans-0.027
 rivers-0.027
 mountains-0.027
Token.
Token ID13
Feature activation+2.62e+00

Pos logit contributions
 valleys+0.038
 oceans+0.035
 hills+0.035
ASH+0.035
 rivers+0.035
Neg logit contributions
 important-0.012
 not-0.011
ggy-0.010
 caring-0.010
:-0.009
Token In
Token ID554
Feature activation+1.64e+00

Pos logit contributions
 valleys+0.089
 oceans+0.085
 rivers+0.084
ASH+0.083
 lands+0.083
Neg logit contributions
 important-0.043
 caring-0.041
 lesson-0.040
 forgiven-0.039
ggy-0.039
Token the
Token ID262
Feature activation+2.82e+00

Pos logit contributions
 valleys+0.056
 hills+0.054
 oceans+0.054
 rivers+0.054
ASH+0.053
Neg logit contributions
 important-0.028
 caring-0.028
 lesson-0.027
ggy-0.027
 forgiven-0.025
Token cave
Token ID11527
Feature activation+2.03e+00

Pos logit contributions
ASH+0.060
 valleys+0.059
ays+0.058
ACK+0.056
ting+0.055
Neg logit contributions
 important-0.084
 bad-0.082
 lesson-0.079
 wonderful-0.078
 caring-0.076
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
 valleys+0.076
ASH+0.074
 oceans+0.074
 rivers+0.073
 hills+0.073
Neg logit contributions
 important-0.029
ggy-0.027
 lesson-0.026
 bad-0.025
 caring-0.025
Token the
Token ID262
Feature activation+3.77e-01

Pos logit contributions
ASH+0.041
 valleys+0.040
 oceans+0.040
 rivers+0.039
 hills+0.038
Neg logit contributions
 important-0.016
 lesson-0.015
 not-0.015
 bad-0.015
 caring-0.014
Token fast
Token ID3049
Feature activation+1.46e+00

Pos logit contributions
 valleys+0.029
ASH+0.028
 oceans+0.027
 rivers+0.027
 hills+0.027
Neg logit contributions
 important-0.023
 lesson-0.022
 caring-0.022
 bad-0.020
 valuable-0.020
Token and
Token ID290
Feature activation+2.38e-01

Pos logit contributions
 valleys+0.047
ASH+0.047
 railing+0.045
 oceans+0.045
 rivers+0.044
Neg logit contributions
 important-0.028
 lesson-0.026
 bad-0.025
 caring-0.025
 valuable-0.023
Token that
Token ID326
Feature activation-1.12e+00

Pos logit contributions
 valleys+0.009
ASH+0.008
 oceans+0.008
 rivers+0.008
 hills+0.008
Neg logit contributions
 caring-0.004
 important-0.004
 lesson-0.004
 not-0.004
 valuable-0.004
Token he
Token ID339
Feature activation-1.40e+00

Pos logit contributions
 important+0.020
 caring+0.020
 lesson+0.019
 not+0.017
ggy+0.017
Neg logit contributions
 valleys-0.039
ASH-0.039
 oceans-0.037
 hills-0.037
 rivers-0.037
Token should
Token ID815
Feature activation-1.18e+00

Pos logit contributions
 important+0.023
 caring+0.022
 lesson+0.022
ggy+0.021
 forgiven+0.020
Neg logit contributions
 valleys-0.051
 hills-0.049
 oceans-0.048
 rivers-0.048
ASH-0.047
Token never
Token ID1239
Feature activation-4.53e-01

Pos logit contributions
 important+0.017
 caring+0.017
 lesson+0.016
ggy+0.016
 understand+0.015
Neg logit contributions
 valleys-0.046
 hills-0.044
 oceans-0.044
 rivers-0.044
ASH-0.043
Token do
Token ID466
Feature activation-1.07e+00

Pos logit contributions
 caring+0.010
 important+0.010
 lesson+0.009
ggy+0.009
 forgiven+0.009
Neg logit contributions
 valleys-0.014
 hills-0.013
 oceans-0.013
 rivers-0.013
ASH-0.013
Token something
Token ID1223
Feature activation-1.40e+00

Pos logit contributions
 important+0.021
 caring+0.021
ggy+0.020
 lesson+0.020
 valuable+0.018
Neg logit contributions
 valleys-0.036
 hills-0.034
 oceans-0.034
 rivers-0.034
ASH-0.033
Token dangerous
Token ID4923
Feature activation+5.14e-01

Pos logit contributions
 caring+0.018
 important+0.018
 lesson+0.017
ggy+0.017
 forgiven+0.015
Neg logit contributions
 valleys-0.055
 hills-0.053
 oceans-0.053
 rivers-0.052
ASH-0.051
Token like
Token ID588
Feature activation+6.28e-01

Pos logit contributions
 valleys+0.017
 hills+0.016
 rivers+0.016
 oceans+0.016
ASH+0.015
Neg logit contributions
 important-0.011
 lesson-0.010
 caring-0.010
ggy-0.010
 bad-0.009
Token run
Token ID1057
Feature activation+3.00e+00

Pos logit contributions
 valleys+0.023
ASH+0.021
 oceans+0.021
 rivers+0.021
 hills+0.021
Neg logit contributions
 important-0.011
 lesson-0.010
 caring-0.010
ggy-0.010
 bad-0.009
Token.
Token ID13
Feature activation+2.50e+00

Pos logit contributions
 valleys+0.002
 hills+0.002
 rivers+0.002
 oceans+0.002
 mountains+0.002
Neg logit contributions
 caring-0.002
 lesson-0.002
 important-0.002
 forgiven-0.002
ggy-0.002
Token He
Token ID679
Feature activation+1.19e+00

Pos logit contributions
 valleys+0.073
 hills+0.070
 oceans+0.069
 rivers+0.069
ASH+0.067
Neg logit contributions
 caring-0.056
 important-0.056
 lesson-0.054
ggy-0.054
 forgiven-0.051
Token loved
Token ID6151
Feature activation-8.66e-01

Pos logit contributions
 valleys+0.041
 hills+0.040
 oceans+0.039
 rivers+0.039
 mountains+0.038
Neg logit contributions
 caring-0.021
 important-0.020
 lesson-0.020
ggy-0.020
 forgiven-0.019
Token to
Token ID284
Feature activation+1.43e+00

Pos logit contributions
 caring+0.016
 important+0.016
 lesson+0.015
ggy+0.013
 valuable+0.013
Neg logit contributions
 valleys-0.030
ASH-0.029
 hills-0.027
 rivers-0.027
 oceans-0.027
Token swim
Token ID9422
Feature activation+2.03e+00

Pos logit contributions
 valleys+0.041
ASH+0.038
 hills+0.038
 oceans+0.038
 rivers+0.037
Neg logit contributions
 caring-0.034
ggy-0.033
 important-0.033
 lesson-0.032
 understand-0.032
Token and
Token ID290
Feature activation+1.12e+00

Pos logit contributions
 valleys+0.057
ASH+0.054
 hills+0.053
 rivers+0.053
 oceans+0.053
Neg logit contributions
 important-0.049
 lesson-0.047
 caring-0.047
ggy-0.045
 valuable-0.044
Token explore
Token ID7301
Feature activation+1.17e+00

Pos logit contributions
ASH+0.036
ays+0.035
 valleys+0.033
ACK+0.033
hole+0.033
Neg logit contributions
 not-0.023
 caring-0.022
 important-0.022
 understand-0.022
 lesson-0.022
Token the
Token ID262
Feature activation+1.93e+00

Pos logit contributions
 valleys+0.041
ASH+0.040
 hills+0.039
 oceans+0.038
ACK+0.038
Neg logit contributions
 important-0.019
 caring-0.019
 lesson-0.018
ggy-0.017
 wonderful-0.016
Token ocean
Token ID9151
Feature activation+1.49e+00

Pos logit contributions
ASH+0.050
ACK+0.047
ays+0.047
hole+0.046
 valleys+0.044
Neg logit contributions
 important-0.057
 wonderful-0.052
 lesson-0.051
 bad-0.050
 valuable-0.049
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
ASH+0.060
ACK+0.059
 valleys+0.056
 railing+0.053
ays+0.053
Neg logit contributions
 important-0.022
 not-0.018
 lesson-0.016
 caring-0.016
 kind-0.016
Token One
Token ID1881
Feature activation-1.57e+00

Pos logit contributions
 valleys+0.039
 oceans+0.038
 rivers+0.037
 hills+0.037
ASH+0.037
Neg logit contributions
 important-0.018
 lesson-0.017
 caring-0.017
 understand-0.015
ggy-0.015
Token good
Token ID922
Feature activation-1.07e+00

Pos logit contributions
 important+0.015
 caring+0.015
ggy+0.015
 forgiven+0.015
isi+0.014
Neg logit contributions
 valleys-0.036
 hills-0.036
 oceans-0.034
 rivers-0.034
 mountains-0.034
Token time
Token ID640
Feature activation-1.56e+00

Pos logit contributions
 caring+0.018
ggy+0.016
 forgiven+0.016
 important+0.016
isi+0.016
Neg logit contributions
 hills-0.036
 valleys-0.035
ASH-0.035
 oceans-0.034
 rivers-0.034
Token.
Token ID13
Feature activation-8.07e-01

Pos logit contributions
 caring+0.025
 important+0.024
ggy+0.024
 lesson+0.020
 forgiven+0.020
Neg logit contributions
 valleys-0.057
ASH-0.056
 oceans-0.054
 hills-0.054
 rivers-0.053
Token
Token ID198
Feature activation-2.91e+00

Pos logit contributions
 caring+0.018
 important+0.018
ggy+0.017
 forgiveness+0.017
 lesson+0.017
Neg logit contributions
 hills-0.023
 valleys-0.023
 rivers-0.022
 mountains-0.022
ASH-0.021
Token
Token ID198
Feature activation-5.46e-01

Pos logit contributions
 caring+0.085
 forgiveness+0.084
 generosity+0.084
 unfair+0.083
 forgiven+0.081
Neg logit contributions
 hills-0.066
 valleys-0.059
 mountains-0.053
 rivers-0.053
 oceans-0.049
TokenSuddenly
Token ID38582
Feature activation-2.09e-01

Pos logit contributions
 forgiveness+0.013
 forgiven+0.012
 caring+0.012
 important+0.012
 generosity+0.012
Neg logit contributions
 hills-0.015
 valleys-0.015
 rivers-0.014
 oceans-0.013
 lakes-0.013
Token,
Token ID11
Feature activation+3.38e-01

Pos logit contributions
 caring+0.004
 important+0.004
 lesson+0.004
ggy+0.004
 forgiven+0.003
Neg logit contributions
 valleys-0.007
 hills-0.007
 oceans-0.007
 rivers-0.007
ASH-0.006
Token Tim
Token ID5045
Feature activation+1.09e+00

Pos logit contributions
 valleys+0.011
 hills+0.011
 rivers+0.010
 oceans+0.010
 mountains+0.010
Neg logit contributions
 caring-0.007
 important-0.006
ggy-0.006
 lesson-0.006
 forgiven-0.006
Token threw
Token ID9617
Feature activation+2.16e+00

Pos logit contributions
 valleys+0.036
 oceans+0.033
 rivers+0.033
 hills+0.033
 lakes+0.033
Neg logit contributions
 important-0.021
ggy-0.021
 caring-0.020
 forgiven-0.020
 not-0.019
Token his
Token ID465
Feature activation-1.46e+00

Pos logit contributions
 valleys+0.080
 oceans+0.076
 hills+0.075
ASH+0.075
 rivers+0.075
Neg logit contributions
 lesson-0.031
 caring-0.031
 important-0.031
 bad-0.029
 wonderful-0.026
Token ball
Token ID2613
Feature activation+7.96e-01

Pos logit contributions
 important+0.036
 caring+0.036
 lesson+0.035
ggy+0.034
 forgiven+0.033
Neg logit contributions
 valleys-0.041
 hills-0.038
 oceans-0.038
 rivers-0.038
ASH-0.036
Token look
Token ID804
Feature activation+1.45e+00

Pos logit contributions
 valleys+0.010
 hills+0.010
 oceans+0.010
 rivers+0.010
ASH+0.009
Neg logit contributions
 caring-0.008
 important-0.008
 lesson-0.008
ggy-0.008
 forgiven-0.007
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
 valleys+0.053
ASH+0.051
 hills+0.051
 oceans+0.050
 rivers+0.050
Neg logit contributions
 important-0.023
 caring-0.022
 lesson-0.021
ggy-0.021
 bad-0.020
Token it
Token ID340
Feature activation+1.72e+00

Pos logit contributions
 valleys+0.037
ASH+0.037
 oceans+0.036
 hills+0.036
 rivers+0.036
Neg logit contributions
 caring-0.015
 important-0.014
 lesson-0.014
ggy-0.013
 bad-0.013
Token was
Token ID373
Feature activation+1.40e+00

Pos logit contributions
 valleys+0.058
 hills+0.056
 oceans+0.055
 rivers+0.055
ASH+0.054
Neg logit contributions
 important-0.031
 caring-0.031
ggy-0.030
 lesson-0.029
 forgiven-0.027
Token actually
Token ID1682
Feature activation+5.98e-01

Pos logit contributions
 valleys+0.046
 hills+0.045
 rivers+0.044
 oceans+0.044
 mountains+0.043
Neg logit contributions
 caring-0.026
 important-0.026
 lesson-0.026
ggy-0.025
 forgiven-0.024
Token a
Token ID257
Feature activation-6.92e-02

Pos logit contributions
 valleys+0.020
 hills+0.020
 rivers+0.019
 oceans+0.019
 mountains+0.019
Neg logit contributions
 caring-0.011
 important-0.010
 lesson-0.010
ggy-0.010
 forgiven-0.010
Token fancy
Token ID14996
Feature activation+1.62e-01

Pos logit contributions
 important+0.001
 caring+0.001
 lesson+0.001
 valuable+0.001
ggy+0.001
Neg logit contributions
 valleys-0.002
 rivers-0.002
 oceans-0.002
 hills-0.002
ASH-0.002
Token wrap
Token ID14441
Feature activation-2.75e-01

Pos logit contributions
 valleys+0.006
 rivers+0.005
 oceans+0.005
 hills+0.005
 lakes+0.005
Neg logit contributions
 important-0.003
 lesson-0.003
 caring-0.003
ggy-0.003
 valuable-0.003
Token that
Token ID326
Feature activation-4.85e-01

Pos logit contributions
 important+0.005
ggy+0.005
 lesson+0.005
 caring+0.005
ggie+0.004
Neg logit contributions
 valleys-0.010
 oceans-0.009
 hills-0.009
 rivers-0.009
ASH-0.009
Token was
Token ID373
Feature activation+1.15e-01

Pos logit contributions
 important+0.009
 caring+0.009
 lesson+0.009
ggy+0.009
 forgiven+0.008
Neg logit contributions
 valleys-0.016
 hills-0.015
 oceans-0.015
 rivers-0.015
ASH-0.015
Token made
Token ID925
Feature activation-3.17e-01

Pos logit contributions
 valleys+0.004
 hills+0.003
 oceans+0.003
 rivers+0.003
ASH+0.003
Neg logit contributions
 important-0.003
 caring-0.003
 lesson-0.002
ggy-0.002
 forgiven-0.002
Token life
Token ID1204
Feature activation-1.71e+00

Pos logit contributions
 caring+0.048
 important+0.048
 lesson+0.046
ggy+0.045
 forgiven+0.044
Neg logit contributions
 valleys-0.063
 hills-0.060
 oceans-0.059
 rivers-0.059
ASH-0.057
Token to
Token ID284
Feature activation-2.30e-01

Pos logit contributions
 important+0.030
 caring+0.029
 lesson+0.028
ggy+0.027
 valuable+0.026
Neg logit contributions
 valleys-0.060
ASH-0.058
 hills-0.057
 oceans-0.057
 rivers-0.056
Token the
Token ID262
Feature activation+1.62e+00

Pos logit contributions
 caring+0.004
 important+0.004
 lesson+0.004
ggy+0.004
 forgiven+0.004
Neg logit contributions
 valleys-0.008
 hills-0.008
 oceans-0.007
 rivers-0.007
ASH-0.007
Token tree
Token ID5509
Feature activation-5.39e-01

Pos logit contributions
 valleys+0.042
ASH+0.038
 oceans+0.038
 hills+0.037
 rivers+0.037
Neg logit contributions
 important-0.046
 caring-0.044
 lesson-0.043
 wonderful-0.041
 bad-0.041
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
 important+0.008
 caring+0.007
 lesson+0.007
ggy+0.007
 forgiven+0.007
Neg logit contributions
 valleys-0.021
ASH-0.020
 oceans-0.020
 hills-0.020
 rivers-0.019
Token Its
Token ID6363
Feature activation-3.44e-01

Pos logit contributions
 valleys+0.046
 rivers+0.043
 oceans+0.043
 lakes+0.042
 railing+0.042
Neg logit contributions
 important-0.021
 caring-0.021
 lesson-0.020
 forgiven-0.020
 valuable-0.019
Token branches
Token ID13737
Feature activation+1.44e+00

Pos logit contributions
 important+0.009
 caring+0.009
 lesson+0.008
ggy+0.008
 forgiven+0.008
Neg logit contributions
 valleys-0.009
 hills-0.009
 oceans-0.009
 rivers-0.009
ASH-0.009
Token were
Token ID547
Feature activation+5.55e-01

Pos logit contributions
 valleys+0.055
ASH+0.055
 oceans+0.054
 hills+0.053
 rivers+0.053
Neg logit contributions
 important-0.018
 lesson-0.018
 caring-0.017
 not-0.016
 understand-0.016
Token soon
Token ID2582
Feature activation+1.28e+00

Pos logit contributions
 valleys+0.018
 hills+0.017
 oceans+0.017
 rivers+0.017
ASH+0.016
Neg logit contributions
 caring-0.011
 important-0.011
 lesson-0.011
ggy-0.010
 forgiven-0.010
Token filled
Token ID5901
Feature activation+1.20e+00

Pos logit contributions
 valleys+0.038
ASH+0.037
 hills+0.037
 oceans+0.036
 rivers+0.036
Neg logit contributions
 important-0.028
 caring-0.027
 lesson-0.026
 forgiven-0.025
 bad-0.025
Token with
Token ID351
Feature activation+9.57e-01

Pos logit contributions
 valleys+0.037
 hills+0.035
 oceans+0.034
 rivers+0.034
 railing+0.034
Neg logit contributions
 important-0.027
 caring-0.026
 lesson-0.025
ggy-0.025
ggie-0.023
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
 caring+0.044
 important+0.044
ggy+0.043
 lesson+0.043
 forgiven+0.041
Neg logit contributions
 valleys-0.058
 hills-0.057
 oceans-0.055
 rivers-0.055
 mountains-0.053
Token started
Token ID2067
Feature activation-4.00e-01

Pos logit contributions
 caring+0.022
 important+0.021
 lesson+0.020
ggy+0.020
 forgiven+0.019
Neg logit contributions
 valleys-0.036
 hills-0.035
 oceans-0.035
 rivers-0.035
ASH-0.034
Token to
Token ID284
Feature activation+4.99e-01

Pos logit contributions
 caring+0.008
 important+0.008
 lesson+0.008
ggy+0.007
 forgiven+0.007
Neg logit contributions
 valleys-0.013
 hills-0.012
 oceans-0.012
 rivers-0.012
ASH-0.012
Token bark
Token ID21405
Feature activation+1.05e+00

Pos logit contributions
 valleys+0.016
 hills+0.015
 oceans+0.015
 rivers+0.015
 mountains+0.015
Neg logit contributions
 caring-0.011
 important-0.010
ggy-0.010
 lesson-0.010
 understand-0.010
Token.
Token ID13
Feature activation-2.25e-01

Pos logit contributions
 valleys+0.034
 oceans+0.032
 hills+0.032
ASH+0.032
 rivers+0.032
Neg logit contributions
 important-0.021
 caring-0.021
 lesson-0.020
ggy-0.020
 forgiven-0.019
Token Her
Token ID2332
Feature activation-2.67e+00

Pos logit contributions
 caring+0.004
 important+0.004
 lesson+0.004
ggy+0.004
 forgiven+0.004
Neg logit contributions
 valleys-0.007
 hills-0.007
 oceans-0.007
 rivers-0.007
ASH-0.007
Token friend
Token ID1545
Feature activation-9.01e-01

Pos logit contributions
 caring+0.034
isi+0.034
 important+0.034
oral+0.033
 lesson+0.032
Neg logit contributions
 hills-0.091
 valleys-0.087
 rivers-0.087
 mountains-0.086
 bur-0.086
Token smiled
Token ID13541
Feature activation+4.59e-01

Pos logit contributions
 caring+0.017
 important+0.017
 lesson+0.016
ggy+0.016
 forgiven+0.015
Neg logit contributions
 valleys-0.030
 hills-0.029
 rivers-0.028
 oceans-0.028
ASH-0.027
Token and
Token ID290
Feature activation-5.05e-01

Pos logit contributions
 valleys+0.012
 hills+0.012
 oceans+0.011
 rivers+0.011
ASH+0.011
Neg logit contributions
 caring-0.012
 important-0.012
 lesson-0.011
ggy-0.011
 forgiven-0.011
Token said
Token ID531
Feature activation-1.81e+00

Pos logit contributions
 caring+0.009
 important+0.009
 lesson+0.009
ggy+0.009
 forgiven+0.008
Neg logit contributions
 valleys-0.017
 hills-0.017
 oceans-0.016
 rivers-0.016
ASH-0.016
Token,
Token ID11
Feature activation-7.13e-01

Pos logit contributions
 important+0.037
 caring+0.036
 lesson+0.036
ggy+0.035
isi+0.034
Neg logit contributions
 valleys-0.053
 hills-0.052
 rivers-0.050
 oceans-0.050
 mountains-0.049
Token was
Token ID373
Feature activation-2.11e-01

Pos logit contributions
 valleys+0.032
 oceans+0.031
 hills+0.030
 rivers+0.030
ASH+0.029
Neg logit contributions
 caring-0.018
 important-0.018
ggy-0.018
 lesson-0.018
 forgiven-0.017
Token happy
Token ID3772
Feature activation-1.69e+00

Pos logit contributions
 lesson+0.004
oral+0.004
ggy+0.004
 caring+0.004
 important+0.004
Neg logit contributions
 valleys-0.006
 hills-0.006
 rivers-0.006
 oceans-0.006
 mountains-0.006
Token and
Token ID290
Feature activation-3.17e-01

Pos logit contributions
 caring+0.038
 important+0.037
 lesson+0.036
ggy+0.035
 forgiven+0.034
Neg logit contributions
 valleys-0.051
 hills-0.048
 oceans-0.048
 rivers-0.048
ASH-0.046
Token thanked
Token ID26280
Feature activation-9.81e-01

Pos logit contributions
 caring+0.007
 important+0.006
 lesson+0.006
ggy+0.006
 forgiven+0.006
Neg logit contributions
 valleys-0.010
 hills-0.010
 oceans-0.010
 rivers-0.010
ASH-0.009
Token Max
Token ID5436
Feature activation-3.01e+00

Pos logit contributions
 important+0.019
 caring+0.019
 lesson+0.018
ggy+0.018
 forgiven+0.018
Neg logit contributions
 valleys-0.032
 hills-0.031
 rivers-0.030
 oceans-0.030
 mountains-0.030
Token for
Token ID329
Feature activation-2.45e+00

Pos logit contributions
 caring+0.062
 important+0.062
 lesson+0.060
ggy+0.060
 forgiven+0.057
Neg logit contributions
 valleys-0.093
 hills-0.090
 rivers-0.088
 oceans-0.088
 mountains-0.085
Token his
Token ID465
Feature activation-3.12e+00

Pos logit contributions
 important+0.052
ggy+0.051
 lesson+0.051
 valuable+0.049
isi+0.049
Neg logit contributions
 valleys-0.070
 hills-0.068
 rivers-0.067
 oceans-0.066
 mountains-0.064
Token help
Token ID1037
Feature activation-1.87e+00

Pos logit contributions
 caring+0.045
ggy+0.045
 important+0.044
isi+0.044
live+0.042
Neg logit contributions
 hills-0.106
 valleys-0.103
 rivers-0.101
ASH-0.100
 oceans-0.100
Token.
Token ID13
Feature activation+5.73e-02

Pos logit contributions
 caring+0.033
 important+0.033
 lesson+0.031
ggy+0.031
 forgiven+0.030
Neg logit contributions
 valleys-0.066
 hills-0.062
 oceans-0.062
 rivers-0.061
 mountains-0.060
Token They
Token ID1119
Feature activation-9.11e-01

Pos logit contributions
 valleys+0.002
 hills+0.002
 oceans+0.002
 rivers+0.002
ASH+0.002
Neg logit contributions
 caring-0.001
 important-0.001
 lesson-0.001
ggy-0.001
 forgiven-0.001
Token became
Token ID2627
Feature activation-4.88e-01

Pos logit contributions
 important+0.023
 caring+0.023
ggy+0.022
 lesson+0.022
 valuable+0.021
Neg logit contributions
 valleys-0.024
 hills-0.024
 rivers-0.023
 oceans-0.023
 mountains-0.022
Token She
Token ID1375
Feature activation-6.46e-01

Pos logit contributions
 caring+0.009
 important+0.009
ggy+0.009
 lesson+0.009
 forgiven+0.008
Neg logit contributions
 valleys-0.011
 hills-0.011
 rivers-0.011
 oceans-0.011
ASH-0.010
Token wanted
Token ID2227
Feature activation-2.31e+00

Pos logit contributions
 caring+0.013
 important+0.013
ggy+0.013
 lesson+0.013
 forgiven+0.012
Neg logit contributions
 valleys-0.020
 hills-0.019
 oceans-0.019
 rivers-0.019
ASH-0.018
Token to
Token ID284
Feature activation-4.17e-01

Pos logit contributions
ggy+0.056
 important+0.054
 caring+0.054
 lesson+0.053
oral+0.053
Neg logit contributions
 hills-0.062
 valleys-0.059
 rivers-0.059
 mountains-0.058
 oceans-0.057
Token cry
Token ID3960
Feature activation-8.97e-01

Pos logit contributions
 important+0.011
 caring+0.011
 lesson+0.010
ggy+0.010
 valuable+0.010
Neg logit contributions
 valleys-0.011
 hills-0.010
 rivers-0.010
 oceans-0.010
ASH-0.010
Token but
Token ID475
Feature activation-2.69e+00

Pos logit contributions
 important+0.017
 caring+0.017
 lesson+0.015
ggy+0.015
 bad+0.014
Neg logit contributions
 valleys-0.031
 hills-0.029
 oceans-0.029
 rivers-0.029
ASH-0.028
Token remembered
Token ID12086
Feature activation-3.30e+00

Pos logit contributions
 lesson+0.058
ggy+0.058
 important+0.058
oral+0.057
 caring+0.057
Neg logit contributions
 valleys-0.071
 hills-0.071
 rivers-0.069
 mountains-0.067
 oceans-0.066
Token how
Token ID703
Feature activation-2.30e+00

Pos logit contributions
 lesson+0.055
 important+0.054
ggy+0.053
 caring+0.052
 valuable+0.050
Neg logit contributions
 hills-0.109
 valleys-0.109
 rivers-0.108
 mountains-0.106
 oceans-0.105
Token tough
Token ID5802
Feature activation-2.33e+00

Pos logit contributions
isi+0.044
oto+0.039
ggie+0.038
ggy+0.038
 caring+0.037
Neg logit contributions
 hills-0.070
 valleys-0.070
 rivers-0.066
 streams-0.064
 oceans-0.064
Token she
Token ID673
Feature activation-1.89e+00

Pos logit contributions
 caring+0.045
 important+0.045
 lesson+0.043
ggy+0.043
 forgiven+0.041
Neg logit contributions
 valleys-0.075
 hills-0.073
 rivers-0.071
 oceans-0.071
 mountains-0.069
Token was
Token ID373
Feature activation-1.82e+00

Pos logit contributions
 caring+0.041
ggy+0.039
 lesson+0.038
vern+0.038
 forgiveness+0.036
Neg logit contributions
 valleys-0.055
 hills-0.055
 mountains-0.052
 rivers-0.052
 oceans-0.051
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 caring+0.038
 important+0.037
 lesson+0.036
ggy+0.036
 forgiven+0.034
Neg logit contributions
 valleys-0.056
 hills-0.054
 oceans-0.053
 rivers-0.053
 mountains-0.051
Token at
Token ID379
Feature activation-4.31e-01

Pos logit contributions
 valleys+0.037
ASH+0.036
 hills+0.035
 rivers+0.035
 oceans+0.034
Neg logit contributions
 important-0.031
 caring-0.030
 lesson-0.029
 valuable-0.027
 forgiven-0.027
Token the
Token ID262
Feature activation+4.95e-01

Pos logit contributions
ggy+0.011
ggie+0.010
 caring+0.010
 important+0.010
 forgiven+0.009
Neg logit contributions
 hills-0.012
 valleys-0.012
 rivers-0.012
 oceans-0.012
 mountains-0.011
Token clouds
Token ID15114
Feature activation+1.12e+00

Pos logit contributions
 valleys+0.012
 hills+0.011
 oceans+0.011
 rivers+0.011
ASH+0.011
Neg logit contributions
 important-0.014
 lesson-0.014
 caring-0.014
ggy-0.013
 bad-0.013
Token and
Token ID290
Feature activation+8.30e-01

Pos logit contributions
ASH+0.040
 valleys+0.039
ACK+0.038
 railing+0.037
 hills+0.036
Neg logit contributions
 about-0.018
 caring-0.018
 that-0.017
 important-0.017
 not-0.016
Token asked
Token ID1965
Feature activation-1.97e+00

Pos logit contributions
 valleys+0.028
 hills+0.027
 oceans+0.027
 rivers+0.027
ASH+0.027
Neg logit contributions
 caring-0.015
 important-0.015
 lesson-0.014
ggy-0.013
 forgiven-0.013
Token lots
Token ID6041
Feature activation-2.58e+00

Pos logit contributions
ggy+0.055
live+0.053
ggie+0.052
isi+0.051
Jerry+0.048
Neg logit contributions
 hills-0.049
 valleys-0.047
 rivers-0.046
 oceans-0.046
 mountains-0.046
Token of
Token ID286
Feature activation-2.00e+00

Pos logit contributions
 caring+0.038
 important+0.037
ggy+0.036
 lesson+0.036
 forgiven+0.033
Neg logit contributions
 valleys-0.095
 hills-0.093
 oceans-0.091
 rivers-0.091
 mountains-0.089
Token questions
Token ID2683
Feature activation-2.72e+00

Pos logit contributions
 caring+0.036
 important+0.035
 lesson+0.034
ggy+0.032
 valuable+0.030
Neg logit contributions
 valleys-0.070
 hills-0.067
ASH-0.067
 oceans-0.066
 rivers-0.066
Token.
Token ID13
Feature activation-9.77e-02

Pos logit contributions
 caring+0.038
 important+0.037
 lesson+0.035
ggy+0.033
 forgiven+0.031
Neg logit contributions
 valleys-0.105
ASH-0.101
 hills-0.100
 oceans-0.099
 rivers-0.099
Token He
Token ID679
Feature activation-1.27e+00

Pos logit contributions
 caring+0.002
 important+0.002
 lesson+0.002
ggy+0.002
 forgiven+0.002
Neg logit contributions
 valleys-0.003
 hills-0.003
 oceans-0.003
 rivers-0.003
ASH-0.003
Token also
Token ID635
Feature activation+1.19e-01

Pos logit contributions
 caring+0.020
 important+0.020
ggy+0.019
 lesson+0.019
 forgiven+0.018
Neg logit contributions
 valleys-0.044
 hills-0.043
 rivers-0.042
 oceans-0.042
 mountains-0.042
Token?"
Token ID1701
Feature activation-1.09e+00

Pos logit contributions
 important+0.006
 caring+0.006
 lesson+0.006
ggy+0.006
 understand+0.006
Neg logit contributions
 valleys-0.012
 hills-0.011
 oceans-0.011
 rivers-0.011
ASH-0.011
Token
Token ID198
Feature activation-2.31e+00

Pos logit contributions
 caring+0.018
 important+0.018
 lesson+0.017
ggy+0.017
 forgiven+0.016
Neg logit contributions
 valleys-0.038
 hills-0.037
 oceans-0.036
 rivers-0.036
ASH-0.035
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
 important+0.046
 caring+0.045
 lesson+0.043
 forgiven+0.041
 valuable+0.040
Neg logit contributions
 valleys-0.073
 hills-0.072
 rivers-0.069
 oceans-0.068
 mountains-0.066
TokenL
Token ID43
Feature activation+5.99e-01

Pos logit contributions
 important+0.020
 caring+0.019
 lesson+0.019
ggy+0.018
 forgiven+0.017
Neg logit contributions
 valleys-0.041
 hills-0.039
 rivers-0.039
 oceans-0.039
ASH-0.037
Tokenizzy
Token ID40593
Feature activation+5.82e-01

Pos logit contributions
 valleys+0.023
 hills+0.022
 oceans+0.022
 rivers+0.022
ASH+0.022
Neg logit contributions
 caring-0.008
 important-0.008
 lesson-0.007
ggy-0.007
 forgiven-0.007
Token thought
Token ID1807
Feature activation-2.88e+00

Pos logit contributions
 valleys+0.021
 hills+0.021
 oceans+0.021
 rivers+0.020
 lakes+0.020
Neg logit contributions
 important-0.009
ggy-0.009
 caring-0.009
 lesson-0.009
 forgiven-0.008
Token about
Token ID546
Feature activation-3.25e+00

Pos logit contributions
 important+0.047
 caring+0.046
 lesson+0.045
ggy+0.044
 forgiven+0.040
Neg logit contributions
 valleys-0.104
 hills-0.101
 oceans-0.099
 rivers-0.099
ASH-0.098
Token what
Token ID644
Feature activation-4.34e+00

Pos logit contributions
 important+0.060
 caring+0.059
ggy+0.058
 lesson+0.057
 forgiven+0.056
Neg logit contributions
 valleys-0.107
 hills-0.104
 rivers-0.102
 oceans-0.102
 mountains-0.100
Token the
Token ID262
Feature activation-9.52e-01

Pos logit contributions
isi+0.086
 caring+0.083
ggy+0.082
 forgiven+0.082
 important+0.081
Neg logit contributions
 hills-0.117
 valleys-0.116
 mountains-0.113
 rivers-0.112
 oceans-0.109
Token butterfly
Token ID35113
Feature activation-1.55e+00

Pos logit contributions
 important+0.019
 lesson+0.019
 caring+0.019
ggy+0.017
 valuable+0.017
Neg logit contributions
 valleys-0.031
 hills-0.029
 oceans-0.029
 rivers-0.029
ASH-0.029
Token said
Token ID531
Feature activation-2.39e+00

Pos logit contributions
 caring+0.024
 important+0.024
 lesson+0.023
ggy+0.023
 forgiven+0.021
Neg logit contributions
 valleys-0.056
 hills-0.055
 oceans-0.054
 rivers-0.054
ASH-0.053