TOP ACTIVATIONS
MAX = 12.506

THE BE LL WHEN YOU FE EL LIKE IT ".
bouncing around , an unpleasant od our wa ft ed into
theater and Larry felt his eyel ids become heavy . He
. He blinked his long eyel ashes and kept hopping .
long until Chloe felt her eyel ids getting heavy . She
thought it was the best sc ooter in the whole world
" It is an old wishing well , M andy .
a time there was a Stub born Old Woman . She
. It was a magical wishing star ! The
That 's a very nice sc ooter . Did you polish
seems like a depend able sc ooter ." L
, I 'm a magical wishing star . Do you want
You are an amazing resc uer ! I bet nobody
and said , " G lad you had fun , Sam
and said : " G lad to help ".
had a bike , a sc ooter and a tr icycle
and said , " G lad we could help . Have
mom replied , " G lad you like it !"
Mom said , " G lad to see you both made
Tim . Tim had a sc ooter that he loved to

MIN ACTIVATIONS
MIN = -10.428

would <|endoftext|> John b iked to the park with his friends
The naughty boy slowly walked to his bedroom , but he
to play on the swing at the park . She wanted
S ally climbed to the top of the big
The boy cautiously walked to the hive and asked in
my and Daddy took baby to the store . They wanted
to go on the slides at the park . She asked
her mom my took all of the pills . They went
ama and Papa took baby to the supermarket . Mama wanted
ring , so he climbed to the highest branch to see
's hand and they walked to the oven . They smelled
. She walked to her classroom . She saw
<|endoftext|> Jimmy wanted to climb to the top of the tree
had been able to climb to the top . After a
to it . He climbed to the top and stepped onto
heard Tweet ie and ran to help . " Hold on
Peter would read to the surf board . They
the tooth fairy would come at night . Tim put the
L ily wanted to come to the party with her big
S ara and Ben jog to the swings . They push

INTERVAL 6.772 - 12.506
CONTAINS 0.436%

enny felt sad because she thought she could never dance like
feeling even more curious and wondering what the morning would bring
very excited and couldn 't wait to see what it was
" That , my de ars , is a cave
on the slides . Jack thought that maybe it was special

INTERVAL 1.039 - 6.772
CONTAINS 36.797%

is my best friend . Do you have a best friend
think these cookies are delicious . You should try them again
hungry . " Hello , puppy ! Where did
win . Everyone was amazed and cheered for her .
her friends all about their secret adventure . They all thought

INTERVAL -4.695 - 1.039
CONTAINS 53.521%

. Billy didn 't want to play with Tim
are friends . They go to the swing .
big and round wheel was rolling down the hill . The
and ro ars again . It looks for something else to
to walk together . As they walked , the sun started

INTERVAL -10.428 - -4.695
CONTAINS 9.246%

big boy . They ran to the open . The open
didn 't want to go to the beach anymore . He
herbs and put the pie into the oven . And when
dirty , so it decided to take a bath . But
find the truth . The end . <|endoftext|>
Token THE
Token ID3336
Feature activation-2.31e-01

Pos logit contributions
 sorry+0.049
 dear+0.049
 darling+0.047
 okay+0.045
 ashamed+0.044
Neg logit contributions
|-0.042
growth-0.039
 land-0.037
 camp-0.036
ground-0.034
Token BE
Token ID9348
Feature activation+5.04e+00

Pos logit contributions
growth+0.005
|+0.005
 land+0.004
 camp+0.004
 forests+0.004
Neg logit contributions
 sorry-0.006
 dear-0.005
 darling-0.005
 okay-0.005
 ashamed-0.005
TokenLL
Token ID3069
Feature activation+7.12e+00

Pos logit contributions
 sorry+0.109
 dear+0.106
 darling+0.104
 ashamed+0.101
 ok+0.101
Neg logit contributions
|-0.118
 land-0.110
e-0.107
ground-0.103
growth-0.103
Token WHEN
Token ID42099
Feature activation+6.29e+00

Pos logit contributions
 sorry+0.187
 dear+0.185
 darling+0.180
 okay+0.175
 ashamed+0.173
Neg logit contributions
|-0.135
growth-0.127
 land-0.120
 camp-0.113
ground-0.112
Token YOU
Token ID7013
Feature activation+3.63e+00

Pos logit contributions
 sorry+0.119
 dear+0.111
 okay+0.108
 darling+0.105
 unfair+0.103
Neg logit contributions
|-0.167
growth-0.162
 land-0.156
 camp-0.151
 forests-0.148
Token FE
Token ID18630
Feature activation+1.25e+01

Pos logit contributions
 sorry+0.094
 dear+0.091
 darling+0.088
 okay+0.088
 ok+0.085
Neg logit contributions
|-0.071
growth-0.070
 land-0.065
 camp-0.063
 forests-0.063
TokenEL
Token ID3698
Feature activation+3.35e+00

Pos logit contributions
 sorry+0.266
 dear+0.254
 ashamed+0.250
 okay+0.249
 darling+0.246
Neg logit contributions
|-0.297
growth-0.262
of-0.258
 land-0.256
ground-0.254
Token LIKE
Token ID34178
Feature activation+5.07e+00

Pos logit contributions
 sorry+0.081
 dear+0.079
 darling+0.076
 unfair+0.076
 ashamed+0.075
Neg logit contributions
|-0.066
growth-0.066
 land-0.060
 camp-0.060
ground-0.057
Token IT
Token ID7283
Feature activation+1.51e+00

Pos logit contributions
 sorry+0.103
 dear+0.099
 darling+0.094
 okay+0.094
 ashamed+0.091
Neg logit contributions
|-0.129
growth-0.124
 land-0.119
 camp-0.115
 forests-0.112
Token".
Token ID1911
Feature activation-9.64e-01

Pos logit contributions
 sorry+0.037
 dear+0.035
 darling+0.034
 okay+0.034
 ashamed+0.033
Neg logit contributions
|-0.032
growth-0.031
 land-0.028
 camp-0.027
 forests-0.027
Token
Token ID198
Feature activation-6.38e+00

Pos logit contributions
|+0.021
growth+0.020
 land+0.019
 camp+0.019
 forests+0.018
Neg logit contributions
 sorry-0.023
 dear-0.022
 okay-0.022
 ashamed-0.021
 darling-0.021
Token bouncing
Token ID30713
Feature activation+2.88e-02

Pos logit contributions
 sorry+0.032
 okay+0.030
 dear+0.030
 ashamed+0.029
 darling+0.028
Neg logit contributions
growth-0.029
|-0.028
 land-0.025
 valleys-0.025
 forests-0.024
Token around
Token ID1088
Feature activation-6.10e+00

Pos logit contributions
 sorry+0.001
 okay+0.001
 dear+0.001
 darling+0.001
 ashamed+0.001
Neg logit contributions
|-0.001
growth-0.001
 land-0.000
ground-0.000
 camp-0.000
Token,
Token ID11
Feature activation+3.84e+00

Pos logit contributions
|+0.132
growth+0.119
of+0.116
Ray+0.103
 land+0.096
Neg logit contributions
 sorry-0.152
 okay-0.140
 dear-0.135
 guilty-0.132
 ok-0.131
Token an
Token ID281
Feature activation+1.37e+00

Pos logit contributions
 sorry+0.095
 dear+0.091
 okay+0.088
 darling+0.088
 ashamed+0.086
Neg logit contributions
|-0.082
growth-0.077
 land-0.073
 camp-0.070
ground-0.069
Token unpleasant
Token ID22029
Feature activation+4.94e+00

Pos logit contributions
 sorry+0.028
 okay+0.027
 darling+0.027
 dear+0.027
 ok+0.026
Neg logit contributions
 land-0.033
growth-0.032
|-0.031
 camp-0.030
 forest-0.029
Token od
Token ID16298
Feature activation+1.13e+01

Pos logit contributions
 sorry+0.107
 dear+0.104
 darling+0.099
 okay+0.097
 ashamed+0.096
Neg logit contributions
|-0.119
growth-0.114
 land-0.106
 camp-0.103
ground-0.102
Tokenour
Token ID454
Feature activation+2.14e+00

Pos logit contributions
 okay+0.253
 sorry+0.243
 ok+0.243
 dear+0.239
 ashamed+0.228
Neg logit contributions
 land-0.228
 camp-0.224
|-0.220
growth-0.220
 enclosure-0.219
Token wa
Token ID2082
Feature activation+5.99e+00

Pos logit contributions
 dear+0.052
ower+0.051
 sorry+0.050
 darling+0.049
 okay+0.048
Neg logit contributions
|-0.046
growth-0.040
 camp-0.038
 forests-0.036
 hunt-0.035
Tokenft
Token ID701
Feature activation+5.58e-01

Pos logit contributions
 sorry+0.118
 dear+0.116
 darling+0.110
 ashamed+0.105
 unfair+0.104
Neg logit contributions
|-0.143
 land-0.140
growth-0.133
of-0.131
 camp-0.130
Tokened
Token ID276
Feature activation-3.46e-01

Pos logit contributions
 sorry+0.010
 okay+0.009
 ashamed+0.009
 dear+0.009
 guilty+0.009
Neg logit contributions
|-0.016
growth-0.015
 camp-0.014
 land-0.014
 hunt-0.014
Token into
Token ID656
Feature activation-6.79e+00

Pos logit contributions
|+0.007
growth+0.006
 land+0.006
ground+0.006
of+0.005
Neg logit contributions
 sorry-0.009
 okay-0.009
 dear-0.009
 ashamed-0.009
 ok-0.008
Token theater
Token ID13766
Feature activation-6.20e-01

Pos logit contributions
growth+0.052
|+0.051
of+0.047
isk+0.045
Ray+0.044
Neg logit contributions
 sorry-0.077
 dear-0.073
 guilty-0.071
 ashamed-0.070
 darling-0.069
Token and
Token ID290
Feature activation+5.22e-01

Pos logit contributions
|+0.013
growth+0.012
 land+0.010
 camp+0.010
 forests+0.010
Neg logit contributions
 dear-0.016
 sorry-0.016
 darling-0.015
 okay-0.015
 ashamed-0.015
Token Larry
Token ID13633
Feature activation-2.47e-01

Pos logit contributions
 sorry+0.014
 dear+0.013
 okay+0.013
 ashamed+0.013
 darling+0.013
Neg logit contributions
|-0.011
growth-0.010
of-0.009
ground-0.009
 land-0.008
Token felt
Token ID2936
Feature activation+5.08e+00

Pos logit contributions
|+0.005
growth+0.004
 land+0.004
 camp+0.004
ground+0.004
Neg logit contributions
 sorry-0.007
 dear-0.007
 darling-0.006
 ashamed-0.006
 okay-0.006
Token his
Token ID465
Feature activation+3.38e+00

Pos logit contributions
 sorry+0.098
 dear+0.094
 darling+0.090
 okay+0.089
 ashamed+0.087
Neg logit contributions
|-0.134
growth-0.130
 land-0.124
 camp-0.121
ground-0.118
Token eyel
Token ID29708
Feature activation+1.11e+01

Pos logit contributions
 sorry+0.074
 dear+0.072
 darling+0.069
 okay+0.069
 ashamed+0.067
Neg logit contributions
|-0.078
growth-0.075
 land-0.073
 camp-0.071
ground-0.067
Tokenids
Token ID2340
Feature activation-1.99e+00

Pos logit contributions
 sorry+0.232
 dear+0.230
 unfair+0.227
 darling+0.222
 okay+0.218
Neg logit contributions
|-0.265
growth-0.257
 land-0.237
e-0.236
 camp-0.225
Token become
Token ID1716
Feature activation+2.30e+00

Pos logit contributions
|+0.040
growth+0.038
 land+0.033
 camp+0.033
 forests+0.031
Neg logit contributions
 sorry-0.052
 okay-0.050
 dear-0.050
 ok-0.048
 darling-0.048
Token heavy
Token ID4334
Feature activation+2.87e+00

Pos logit contributions
 sorry+0.050
 dear+0.049
 darling+0.047
 okay+0.046
 ashamed+0.045
Neg logit contributions
|-0.054
growth-0.052
 land-0.050
 camp-0.048
ground-0.047
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
 sorry+0.090
 dear+0.088
 darling+0.085
 ashamed+0.084
 okay+0.084
Neg logit contributions
|-0.037
 land-0.036
growth-0.035
 camp-0.034
 forests-0.030
Token He
Token ID679
Feature activation-1.36e+00

Pos logit contributions
 sorry+0.024
 dear+0.023
 darling+0.023
 okay+0.022
 ashamed+0.022
Neg logit contributions
|-0.026
growth-0.024
 land-0.024
 camp-0.023
ground-0.022
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
 sorry+0.042
 dear+0.042
 darling+0.041
 okay+0.040
 ashamed+0.039
Neg logit contributions
|-0.027
growth-0.026
 land-0.024
 camp-0.023
ground-0.022
Token He
Token ID679
Feature activation-1.58e+00

Pos logit contributions
|+0.002
growth+0.002
 land+0.002
 camp+0.002
e+0.002
Neg logit contributions
 sorry-0.003
 dear-0.003
 darling-0.003
 okay-0.003
 ashamed-0.003
Token blinked
Token ID40360
Feature activation+1.02e+00

Pos logit contributions
|+0.031
growth+0.029
ground+0.026
 outer+0.024
of+0.024
Neg logit contributions
 sorry-0.038
 dear-0.038
 okay-0.038
 ashamed-0.037
 stubborn-0.037
Token his
Token ID465
Feature activation+1.95e+00

Pos logit contributions
 sorry+0.028
 dear+0.028
 okay+0.027
 darling+0.027
 ashamed+0.026
Neg logit contributions
|-0.019
growth-0.017
 land-0.016
ground-0.015
 camp-0.015
Token long
Token ID890
Feature activation+3.36e-01

Pos logit contributions
 sorry+0.034
 dear+0.033
 darling+0.031
 okay+0.031
 ashamed+0.030
Neg logit contributions
|-0.053
growth-0.052
 land-0.051
 camp-0.050
 forests-0.048
Token eyel
Token ID29708
Feature activation+1.09e+01

Pos logit contributions
 sorry+0.007
 dear+0.007
 darling+0.007
 okay+0.007
 ashamed+0.007
Neg logit contributions
|-0.008
growth-0.008
 land-0.007
 camp-0.007
ground-0.007
Tokenashes
Token ID7465
Feature activation+2.86e-01

Pos logit contributions
 sorry+0.239
 dear+0.233
 unfair+0.231
 darling+0.225
 okay+0.221
Neg logit contributions
growth-0.240
|-0.239
 land-0.218
e-0.215
of-0.207
Token and
Token ID290
Feature activation-7.47e-02

Pos logit contributions
 sorry+0.007
 dear+0.007
 okay+0.007
 darling+0.007
 ashamed+0.006
Neg logit contributions
|-0.006
growth-0.006
 land-0.005
 camp-0.005
ground-0.005
Token kept
Token ID4030
Feature activation-8.75e-01

Pos logit contributions
|+0.002
growth+0.001
ground+0.001
 camp+0.001
of+0.001
Neg logit contributions
 sorry-0.002
 okay-0.002
 dear-0.002
 ashamed-0.002
 darling-0.002
Token hopping
Token ID47153
Feature activation-9.45e-01

Pos logit contributions
|+0.020
growth+0.018
 land+0.017
 camp+0.017
ground+0.016
Neg logit contributions
 sorry-0.021
 dear-0.020
 okay-0.019
 darling-0.019
 ashamed-0.019
Token.
Token ID13
Feature activation-4.04e-01

Pos logit contributions
|+0.020
growth+0.020
 camp+0.017
 land+0.017
ground+0.017
Neg logit contributions
 sorry-0.022
 okay-0.021
 dear-0.021
 darling-0.021
 ok-0.021
Token long
Token ID890
Feature activation+1.06e+00

Pos logit contributions
 sorry+0.040
 dear+0.040
 darling+0.038
 ashamed+0.037
 okay+0.037
Neg logit contributions
|-0.038
 land-0.035
growth-0.034
 camp-0.034
ground-0.031
Token until
Token ID1566
Feature activation+2.01e+00

Pos logit contributions
 sorry+0.030
 dear+0.029
 darling+0.028
 okay+0.028
 ashamed+0.028
Neg logit contributions
|-0.018
growth-0.017
 land-0.016
 camp-0.015
ground-0.014
Token Chloe
Token ID29476
Feature activation+2.02e+00

Pos logit contributions
 sorry+0.049
 dear+0.048
 darling+0.046
 okay+0.046
 ashamed+0.044
Neg logit contributions
|-0.043
growth-0.042
 land-0.039
 camp-0.037
 forests-0.037
Token felt
Token ID2936
Feature activation+4.68e+00

Pos logit contributions
 dear+0.056
 sorry+0.056
 darling+0.055
 okay+0.054
 ashamed+0.053
Neg logit contributions
|-0.034
growth-0.033
 land-0.029
 forests-0.029
 camp-0.028
Token her
Token ID607
Feature activation+3.20e+00

Pos logit contributions
 sorry+0.093
 dear+0.091
 darling+0.087
 okay+0.085
 ashamed+0.083
Neg logit contributions
|-0.118
growth-0.114
 land-0.110
 camp-0.107
ground-0.103
Token eyel
Token ID29708
Feature activation+1.09e+01

Pos logit contributions
 sorry+0.072
 unfair+0.070
 dear+0.070
 ashamed+0.068
 darling+0.067
Neg logit contributions
|-0.066
 land-0.066
growth-0.063
 camp-0.063
 visit-0.057
Tokenids
Token ID2340
Feature activation-1.96e+00

Pos logit contributions
 sorry+0.217
 dear+0.214
 unfair+0.213
 darling+0.206
 okay+0.199
Neg logit contributions
|-0.267
growth-0.256
e-0.237
 land-0.235
 camp-0.228
Token getting
Token ID1972
Feature activation+1.35e+00

Pos logit contributions
|+0.039
growth+0.037
 camp+0.033
 land+0.033
 forests+0.031
Neg logit contributions
 sorry-0.051
 okay-0.050
 dear-0.049
 ok-0.048
 darling-0.047
Token heavy
Token ID4334
Feature activation+1.41e+00

Pos logit contributions
 sorry+0.029
 dear+0.029
 darling+0.028
 okay+0.027
 ashamed+0.027
Neg logit contributions
|-0.031
growth-0.030
 land-0.029
 camp-0.028
ground-0.027
Token.
Token ID13
Feature activation+1.38e+00

Pos logit contributions
 sorry+0.044
 dear+0.043
 darling+0.042
 ashamed+0.041
 unfair+0.041
Neg logit contributions
|-0.018
 land-0.018
growth-0.017
 camp-0.017
ground-0.015
Token She
Token ID1375
Feature activation-1.33e+00

Pos logit contributions
 sorry+0.040
 dear+0.039
 unfair+0.038
 okay+0.038
 darling+0.037
Neg logit contributions
|-0.022
growth-0.020
 land-0.019
 camp-0.019
ground-0.017
Token thought
Token ID1807
Feature activation+7.62e+00

Pos logit contributions
|+0.003
growth+0.003
 land+0.003
 camp+0.003
ground+0.003
Neg logit contributions
 sorry-0.005
 dear-0.005
 darling-0.004
 okay-0.004
 ashamed-0.004
Token it
Token ID340
Feature activation+3.01e+00

Pos logit contributions
 sorry+0.220
 dear+0.214
 darling+0.214
 ashamed+0.213
 concerned+0.212
Neg logit contributions
growth-0.105
|-0.104
 land-0.104
 the-0.099
 camp-0.097
Token was
Token ID373
Feature activation+4.55e+00

Pos logit contributions
 sorry+0.090
 dear+0.088
 okay+0.087
 darling+0.086
 unfair+0.085
Neg logit contributions
growth-0.044
|-0.044
 land-0.038
 camp-0.037
ground-0.036
Token the
Token ID262
Feature activation+7.38e-01

Pos logit contributions
 sorry+0.110
 dear+0.108
 darling+0.105
 ashamed+0.101
 okay+0.101
Neg logit contributions
|-0.094
 land-0.088
growth-0.087
 camp-0.084
 forests-0.079
Token best
Token ID1266
Feature activation+7.61e+00

Pos logit contributions
 sorry+0.013
 dear+0.012
 darling+0.012
 concerned+0.011
 ashamed+0.011
Neg logit contributions
 land-0.020
|-0.019
 camp-0.019
growth-0.018
 distance-0.018
Token sc
Token ID629
Feature activation+1.08e+01

Pos logit contributions
 sorry+0.217
 dear+0.211
 darling+0.207
 ashamed+0.202
 concerned+0.202
Neg logit contributions
 land-0.124
|-0.122
 camp-0.109
growth-0.109
 visit-0.105
Tokenooter
Token ID25141
Feature activation+3.32e+00

Pos logit contributions
 sorry+0.173
 darling+0.167
 dear+0.163
 okay+0.160
 unfair+0.160
Neg logit contributions
|-0.298
 land-0.284
of-0.271
growth-0.266
e-0.266
Token in
Token ID287
Feature activation-5.64e+00

Pos logit contributions
 sorry+0.100
 dear+0.096
 darling+0.095
 okay+0.093
 concerned+0.093
Neg logit contributions
|-0.050
 land-0.046
growth-0.044
e-0.041
 camp-0.040
Token the
Token ID262
Feature activation-2.94e+00

Pos logit contributions
|+0.116
 enclosure+0.110
 hunt+0.103
growth+0.103
 trek+0.103
Neg logit contributions
 sorry-0.130
 okay-0.127
 dear-0.124
 darling-0.122
 unfair-0.119
Token whole
Token ID2187
Feature activation-1.52e+00

Pos logit contributions
growth+0.057
|+0.055
ground+0.048
of+0.046
 hunt+0.045
Neg logit contributions
 sorry-0.077
 okay-0.073
 unfair-0.072
 ashamed-0.072
 guilty-0.072
Token world
Token ID995
Feature activation+4.76e-01

Pos logit contributions
growth+0.034
|+0.033
ground+0.029
 hunt+0.028
 enclosure+0.028
Neg logit contributions
 sorry-0.032
 okay-0.031
 important-0.031
 dear-0.031
 unfair-0.031
Token "
Token ID366
Feature activation-1.82e-01

Pos logit contributions
 dear+0.094
 darling+0.091
 sorry+0.090
 unfair+0.089
 fault+0.087
Neg logit contributions
 land-0.091
|-0.090
growth-0.088
 the-0.087
 camp-0.087
TokenIt
Token ID1026
Feature activation+7.16e-01

Pos logit contributions
|+0.004
growth+0.004
 land+0.003
 camp+0.003
 hunt+0.003
Neg logit contributions
 dear-0.004
 sorry-0.004
 darling-0.004
 okay-0.004
 ok-0.004
Token is
Token ID318
Feature activation+3.59e+00

Pos logit contributions
 sorry+0.021
 dear+0.020
 darling+0.019
 okay+0.019
rier+0.019
Neg logit contributions
|-0.011
 land-0.011
growth-0.010
 camp-0.010
 hunt-0.009
Token an
Token ID281
Feature activation+3.84e+00

Pos logit contributions
 sorry+0.112
 dear+0.110
 darling+0.107
 ashamed+0.105
rier+0.103
Neg logit contributions
|-0.046
 land-0.045
growth-0.043
 camp-0.042
ground-0.038
Token old
Token ID1468
Feature activation+4.20e+00

Pos logit contributions
 sorry+0.084
 okay+0.078
 dear+0.077
 ashamed+0.076
ouncing+0.076
Neg logit contributions
 land-0.085
growth-0.082
|-0.079
 forest-0.077
 town-0.076
Token wishing
Token ID24433
Feature activation+1.05e+01

Pos logit contributions
 sorry+0.114
 okay+0.111
 dear+0.110
 ok+0.107
 darling+0.107
Neg logit contributions
 land-0.073
|-0.067
 camp-0.064
growth-0.064
 hunt-0.062
Token well
Token ID880
Feature activation+2.67e+00

Pos logit contributions
 sorry+0.272
 dear+0.264
 okay+0.259
 darling+0.258
 ashamed+0.254
Neg logit contributions
 land-0.191
growth-0.182
|-0.180
 camp-0.173
ground-0.164
Token,
Token ID11
Feature activation+5.07e+00

Pos logit contributions
 sorry+0.074
 dear+0.073
 darling+0.070
 okay+0.070
 ashamed+0.069
Neg logit contributions
|-0.046
growth-0.043
 land-0.042
 camp-0.040
ground-0.038
Token M
Token ID337
Feature activation+4.52e+00

Pos logit contributions
 sorry+0.129
ower+0.124
 unfair+0.121
 ashamed+0.120
 dear+0.120
Neg logit contributions
 land-0.087
 the-0.084
 camp-0.081
growth-0.078
|-0.077
Tokenandy
Token ID10757
Feature activation+2.61e+00

Pos logit contributions
 sorry+0.137
 ashamed+0.126
 dear+0.126
 okay+0.123
 unfair+0.123
Neg logit contributions
|-0.070
growth-0.066
 land-0.063
 camp-0.060
of-0.059
Token.
Token ID13
Feature activation+3.68e+00

Pos logit contributions
 sorry+0.076
 dear+0.074
 okay+0.072
 darling+0.072
 ashamed+0.071
Neg logit contributions
|-0.040
 land-0.038
growth-0.037
 camp-0.035
ground-0.033
Token a
Token ID257
Feature activation-2.66e-01

Pos logit contributions
|+0.111
growth+0.104
ground+0.103
 enclosure+0.097
field+0.096
Neg logit contributions
 okay-0.144
 sorry-0.142
 dear-0.141
 darling-0.132
 ashamed-0.130
Token time
Token ID640
Feature activation+9.35e-01

Pos logit contributions
|+0.005
growth+0.004
 land+0.004
 camp+0.004
ground+0.004
Neg logit contributions
 sorry-0.007
 dear-0.007
 darling-0.007
 okay-0.007
 ashamed-0.007
Token there
Token ID612
Feature activation-3.51e+00

Pos logit contributions
 sorry+0.025
 dear+0.024
 okay+0.023
 darling+0.023
 ashamed+0.023
Neg logit contributions
|-0.018
growth-0.017
 land-0.016
 camp-0.015
ground-0.015
Token was
Token ID373
Feature activation+9.37e-01

Pos logit contributions
growth+0.079
|+0.071
 forests+0.069
 hunt+0.067
 valleys+0.067
Neg logit contributions
 okay-0.080
 sorry-0.076
 dear-0.076
 ok-0.073
 darling-0.073
Token a
Token ID257
Feature activation+7.12e-01

Pos logit contributions
 sorry+0.031
 dear+0.030
 darling+0.029
 okay+0.029
 ashamed+0.029
Neg logit contributions
|-0.012
growth-0.011
 land-0.010
 camp-0.010
ground-0.009
Token Stub
Token ID41135
Feature activation+1.05e+01

Pos logit contributions
 sorry+0.017
 dear+0.016
 darling+0.016
 ok+0.016
 okay+0.016
Neg logit contributions
 land-0.015
|-0.014
 camp-0.014
growth-0.013
 town-0.013
Tokenborn
Token ID6286
Feature activation+3.24e+00

Pos logit contributions
 sorry+0.188
 dear+0.183
 darling+0.177
 okay+0.172
 ashamed+0.167
Neg logit contributions
|-0.288
growth-0.272
 land-0.268
 camp-0.256
ground-0.253
Token Old
Token ID5706
Feature activation+2.01e+00

Pos logit contributions
 dear+0.084
 sorry+0.082
 darling+0.081
 okay+0.078
 ashamed+0.076
Neg logit contributions
|-0.064
growth-0.061
 land-0.054
ground-0.053
 camp-0.052
Token Woman
Token ID14381
Feature activation+2.23e+00

Pos logit contributions
 dear+0.050
 sorry+0.049
 darling+0.048
 okay+0.047
 ashamed+0.045
Neg logit contributions
|-0.042
growth-0.040
 land-0.036
 camp-0.034
ground-0.034
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
 dear+0.058
 sorry+0.056
 darling+0.055
 okay+0.053
 ashamed+0.053
Neg logit contributions
|-0.044
growth-0.041
 land-0.037
 camp-0.037
 miles-0.036
Token She
Token ID1375
Feature activation-1.50e+00

Pos logit contributions
 sorry+0.035
 dear+0.035
 darling+0.033
 okay+0.033
 ashamed+0.032
Neg logit contributions
|-0.033
growth-0.031
 land-0.030
 camp-0.028
ground-0.028
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
 sorry+0.121
 dear+0.118
 darling+0.115
 okay+0.113
 ashamed+0.112
Neg logit contributions
|-0.073
growth-0.069
 land-0.066
 camp-0.063
ground-0.060
Token It
Token ID632
Feature activation-3.41e-01

Pos logit contributions
 sorry+0.042
 dear+0.041
 darling+0.039
 okay+0.039
 ashamed+0.038
Neg logit contributions
|-0.042
growth-0.041
 land-0.039
 camp-0.038
ground-0.037
Token was
Token ID373
Feature activation+3.35e+00

Pos logit contributions
|+0.006
growth+0.006
ground+0.005
 camp+0.005
 land+0.004
Neg logit contributions
 sorry-0.009
 dear-0.009
 ashamed-0.009
 darling-0.009
 okay-0.009
Token a
Token ID257
Feature activation+2.35e+00

Pos logit contributions
 sorry+0.099
 dear+0.096
 okay+0.094
 darling+0.094
 ok+0.091
Neg logit contributions
|-0.058
growth-0.052
 land-0.047
of-0.045
 camp-0.045
Token magical
Token ID10883
Feature activation+3.73e+00

Pos logit contributions
 sorry+0.059
 dear+0.058
 okay+0.056
 darling+0.056
 ashamed+0.055
Neg logit contributions
|-0.045
 land-0.043
growth-0.043
 camp-0.041
ground-0.039
Token wishing
Token ID24433
Feature activation+1.04e+01

Pos logit contributions
 sorry+0.100
 dear+0.098
 darling+0.095
 okay+0.094
 ashamed+0.092
Neg logit contributions
|-0.068
growth-0.065
 land-0.062
 camp-0.059
ground-0.056
Token star
Token ID3491
Feature activation+9.36e-01

Pos logit contributions
 sorry+0.264
 dear+0.259
 darling+0.250
 okay+0.247
 ashamed+0.243
Neg logit contributions
|-0.206
growth-0.197
 land-0.189
 camp-0.181
ground-0.174
Token!
Token ID0
Feature activation+1.31e+00

Pos logit contributions
 sorry+0.025
 dear+0.025
 darling+0.024
 okay+0.024
 ashamed+0.023
Neg logit contributions
|-0.017
growth-0.016
 land-0.015
 camp-0.014
ground-0.014
Token
Token ID198
Feature activation-5.66e+00

Pos logit contributions
 sorry+0.033
 dear+0.032
 darling+0.031
 okay+0.031
 ashamed+0.030
Neg logit contributions
|-0.027
growth-0.025
 land-0.024
 camp-0.023
ground-0.022
Token
Token ID198
Feature activation-4.66e+00

Pos logit contributions
|+0.096
growth+0.091
 land+0.087
 camp+0.082
 hunt+0.080
Neg logit contributions
 sorry-0.156
 dear-0.148
 darling-0.148
 okay-0.147
 concerned-0.146
TokenThe
Token ID464
Feature activation-8.49e-01

Pos logit contributions
|+0.093
growth+0.087
 land+0.084
 camp+0.081
 hunt+0.077
Neg logit contributions
 sorry-0.118
 dear-0.114
 darling-0.112
 okay-0.111
 ashamed-0.109
TokenThat
Token ID2504
Feature activation+3.66e+00

Pos logit contributions
|+0.002
growth+0.002
 camp+0.002
 land+0.002
 hunt+0.002
Neg logit contributions
 dear-0.002
 darling-0.002
 sorry-0.002
 okay-0.002
 ok-0.002
Token's
Token ID338
Feature activation+3.12e+00

Pos logit contributions
 sorry+0.108
 dear+0.104
 darling+0.101
 okay+0.101
 ashamed+0.099
Neg logit contributions
|-0.056
 land-0.055
growth-0.051
 camp-0.050
 hunt-0.047
Token a
Token ID257
Feature activation+3.14e+00

Pos logit contributions
 sorry+0.075
 dear+0.074
 darling+0.071
 okay+0.070
 ashamed+0.069
Neg logit contributions
|-0.067
growth-0.063
 land-0.062
 camp-0.059
ground-0.057
Token very
Token ID845
Feature activation+7.52e+00

Pos logit contributions
 sorry+0.070
 dear+0.069
 darling+0.067
 ashamed+0.065
rier+0.065
Neg logit contributions
 land-0.069
|-0.067
 camp-0.063
 visit-0.060
growth-0.060
Token nice
Token ID3621
Feature activation+6.02e+00

Pos logit contributions
 sorry+0.143
 fault+0.142
 darling+0.141
rier+0.141
 concerned+0.140
Neg logit contributions
 land-0.177
|-0.163
 camp-0.157
 hunt-0.155
 play-0.155
Token sc
Token ID629
Feature activation+1.04e+01

Pos logit contributions
 sorry+0.165
 dear+0.156
 okay+0.155
 darling+0.153
rier+0.150
Neg logit contributions
 land-0.107
|-0.100
 camp-0.096
 visit-0.090
 town-0.089
Tokenooter
Token ID25141
Feature activation+3.06e+00

Pos logit contributions
 sorry+0.169
 darling+0.163
 okay+0.161
 dear+0.159
 ok+0.158
Neg logit contributions
|-0.277
 land-0.267
of-0.261
 hunt-0.251
e-0.248
Token.
Token ID13
Feature activation+4.50e+00

Pos logit contributions
 sorry+0.090
 dear+0.086
 darling+0.085
 okay+0.085
 concerned+0.084
Neg logit contributions
|-0.045
 land-0.043
growth-0.041
 camp-0.040
e-0.038
Token Did
Token ID7731
Feature activation+3.18e+00

Pos logit contributions
 sorry+0.091
 concerned+0.087
 dear+0.086
 ashamed+0.085
 unfair+0.084
Neg logit contributions
|-0.109
growth-0.106
 land-0.098
e-0.096
ground-0.096
Token you
Token ID345
Feature activation+1.42e+00

Pos logit contributions
 sorry+0.070
 dear+0.069
 darling+0.065
 okay+0.065
 ashamed+0.062
Neg logit contributions
|-0.077
growth-0.073
 land-0.069
 camp-0.067
ground-0.066
Token polish
Token ID25245
Feature activation-6.08e-01

Pos logit contributions
 sorry+0.037
 okay+0.036
 dear+0.036
 ok+0.034
 darling+0.034
Neg logit contributions
growth-0.028
|-0.028
 forests-0.025
 valleys-0.025
ground-0.025
Token seems
Token ID2331
Feature activation+8.01e+00

Pos logit contributions
 sorry+0.007
 dear+0.007
 darling+0.007
 okay+0.006
 ashamed+0.006
Neg logit contributions
|-0.005
growth-0.005
 land-0.005
 camp-0.004
 forests-0.004
Token like
Token ID588
Feature activation+2.43e+00

Pos logit contributions
ower+0.184
 sorry+0.184
 dear+0.182
 darling+0.178
ushing+0.177
Neg logit contributions
 land-0.157
|-0.152
 the-0.149
 camp-0.147
 hunt-0.135
Token a
Token ID257
Feature activation+1.86e+00

Pos logit contributions
 sorry+0.054
 dear+0.053
 darling+0.051
 okay+0.050
 ashamed+0.049
Neg logit contributions
|-0.057
growth-0.054
 land-0.051
 camp-0.050
ground-0.049
Token depend
Token ID4745
Feature activation+4.33e+00

Pos logit contributions
 sorry+0.045
 dear+0.044
 darling+0.043
 concerned+0.042
ushing+0.042
Neg logit contributions
 land-0.037
|-0.035
 camp-0.034
growth-0.033
 hunt-0.032
Tokenable
Token ID540
Feature activation+5.52e+00

Pos logit contributions
 sorry+0.103
 dear+0.100
 darling+0.097
 okay+0.097
 ok+0.095
Neg logit contributions
|-0.096
 land-0.089
growth-0.084
of-0.082
e-0.082
Token sc
Token ID629
Feature activation+1.04e+01

Pos logit contributions
 sorry+0.146
 dear+0.138
 darling+0.137
 concerned+0.136
 okay+0.136
Neg logit contributions
 land-0.100
|-0.093
 camp-0.089
 visit-0.085
 hunt-0.085
Tokenooter
Token ID25141
Feature activation+3.00e+00

Pos logit contributions
 sorry+0.170
 darling+0.164
 dear+0.159
 okay+0.158
 ok+0.157
Neg logit contributions
|-0.275
 land-0.269
of-0.259
 hunt-0.250
e-0.247
Token."
Token ID526
Feature activation-1.03e+00

Pos logit contributions
 sorry+0.081
 concerned+0.077
 dear+0.077
 darling+0.076
 okay+0.075
Neg logit contributions
|-0.052
 land-0.051
growth-0.046
 camp-0.046
 hunt-0.046
Token
Token ID198
Feature activation-6.26e+00

Pos logit contributions
|+0.018
 land+0.014
 play+0.013
 the+0.013
 camp+0.013
Neg logit contributions
 sorry-0.030
rier-0.030
 dear-0.029
 ok-0.029
 concerned-0.029
Token
Token ID198
Feature activation-5.13e+00

Pos logit contributions
growth+0.098
|+0.090
 land+0.086
 camp+0.081
 forests+0.080
Neg logit contributions
 sorry-0.185
 dear-0.182
 okay-0.180
 darling-0.179
 ashamed-0.172
TokenL
Token ID43
Feature activation+8.71e-01

Pos logit contributions
|+0.050
 land+0.040
of+0.036
The+0.035
growth+0.034
Neg logit contributions
 sorry-0.185
 dear-0.179
 fault-0.178
rier-0.174
 darling-0.174
Token,
Token ID11
Feature activation+4.36e+00

Pos logit contributions
 sorry+0.072
 dear+0.070
 unfair+0.068
 ashamed+0.067
 okay+0.067
Neg logit contributions
|-0.032
 land-0.031
e-0.028
growth-0.028
 camp-0.028
Token I
Token ID314
Feature activation+7.36e-02

Pos logit contributions
 sorry+0.107
 dear+0.105
 darling+0.101
 okay+0.100
 ashamed+0.099
Neg logit contributions
|-0.089
growth-0.085
 land-0.082
 camp-0.079
ground-0.075
Token'm
Token ID1101
Feature activation+5.26e+00

Pos logit contributions
 sorry+0.002
 dear+0.002
 okay+0.002
 darling+0.002
 ashamed+0.002
Neg logit contributions
growth-0.001
|-0.001
 forests-0.001
 camp-0.001
 valleys-0.001
Token a
Token ID257
Feature activation+3.03e+00

Pos logit contributions
 dear+0.128
 sorry+0.126
 darling+0.124
 unfair+0.120
ower+0.120
Neg logit contributions
|-0.101
 land-0.096
growth-0.096
 camp-0.091
e-0.088
Token magical
Token ID10883
Feature activation+4.01e+00

Pos logit contributions
 sorry+0.075
ouncing+0.074
 dear+0.074
 okay+0.073
ushing+0.072
Neg logit contributions
 land-0.056
|-0.054
 camp-0.052
growth-0.050
 town-0.046
Token wishing
Token ID24433
Feature activation+1.03e+01

Pos logit contributions
 sorry+0.110
 dear+0.107
 okay+0.104
 darling+0.104
 ashamed+0.102
Neg logit contributions
|-0.070
growth-0.067
 land-0.065
 camp-0.062
ground-0.058
Token star
Token ID3491
Feature activation+1.08e+00

Pos logit contributions
 sorry+0.272
 dear+0.265
 darling+0.257
 okay+0.256
 ashamed+0.253
Neg logit contributions
|-0.185
 land-0.178
growth-0.177
 camp-0.167
ground-0.157
Token.
Token ID13
Feature activation+3.04e+00

Pos logit contributions
 sorry+0.028
 dear+0.028
 darling+0.027
 okay+0.026
 ashamed+0.026
Neg logit contributions
|-0.021
growth-0.020
 land-0.018
 camp-0.018
ground-0.017
Token Do
Token ID2141
Feature activation+1.99e+00

Pos logit contributions
 sorry+0.062
 dear+0.060
 unfair+0.058
 ashamed+0.057
rier+0.057
Neg logit contributions
|-0.073
growth-0.071
 land-0.066
 camp-0.065
ground-0.065
Token you
Token ID345
Feature activation+2.14e+00

Pos logit contributions
 dear+0.040
 sorry+0.040
 okay+0.038
 darling+0.038
 ok+0.036
Neg logit contributions
|-0.051
growth-0.050
 land-0.047
ground-0.046
 camp-0.046
Token want
Token ID765
Feature activation+5.63e-01

Pos logit contributions
 sorry+0.041
 dear+0.040
 okay+0.038
 darling+0.038
 ok+0.037
Neg logit contributions
|-0.056
growth-0.055
 land-0.052
 camp-0.051
ground-0.050
Token
Token ID241
Feature activation+9.61e-01

Pos logit contributions
 sorry+0.046
 dear+0.046
 darling+0.044
 okay+0.043
 ashamed+0.042
Neg logit contributions
|-0.054
growth-0.051
 land-0.050
 camp-0.047
e-0.047
TokenYou
Token ID1639
Feature activation+3.90e+00

Pos logit contributions
 sorry+0.025
 dear+0.023
 okay+0.023
 darling+0.022
 ashamed+0.022
Neg logit contributions
|-0.019
growth-0.018
 land-0.017
 camp-0.016
ground-0.016
Token are
Token ID389
Feature activation+2.38e+00

Pos logit contributions
 sorry+0.098
 dear+0.097
 darling+0.094
 okay+0.093
 ashamed+0.092
Neg logit contributions
|-0.079
growth-0.077
 land-0.071
 camp-0.070
 forests-0.069
Token an
Token ID281
Feature activation+3.81e+00

Pos logit contributions
 sorry+0.056
 dear+0.056
 darling+0.054
 okay+0.053
 ashamed+0.052
Neg logit contributions
|-0.051
growth-0.048
 land-0.047
 camp-0.045
ground-0.043
Token amazing
Token ID4998
Feature activation+4.72e+00

Pos logit contributions
ower+0.071
 sorry+0.070
ouncing+0.069
idents+0.066
 dear+0.064
Neg logit contributions
|-0.091
 land-0.090
e-0.088
growth-0.086
 camp-0.085
Token resc
Token ID6811
Feature activation+1.02e+01

Pos logit contributions
 sorry+0.120
 unfair+0.114
 dear+0.113
 okay+0.113
 ashamed+0.113
Neg logit contributions
 land-0.088
 camp-0.084
|-0.080
growth-0.078
 visit-0.075
Tokenuer
Token ID15573
Feature activation+4.00e+00

Pos logit contributions
 sorry+0.181
 okay+0.179
 dear+0.179
 darling+0.175
 ashamed+0.173
Neg logit contributions
|-0.252
 land-0.239
ground-0.236
 hunt-0.228
e-0.226
Token!
Token ID0
Feature activation+2.83e+00

Pos logit contributions
 sorry+0.106
 dear+0.103
 darling+0.099
 okay+0.099
 ashamed+0.098
Neg logit contributions
|-0.073
growth-0.069
 land-0.068
 camp-0.065
ground-0.062
Token I
Token ID314
Feature activation-4.38e-01

Pos logit contributions
 sorry+0.055
 dear+0.052
 unfair+0.050
 ashamed+0.049
 okay+0.049
Neg logit contributions
|-0.073
growth-0.069
 land-0.065
ground-0.065
 camp-0.064
Token bet
Token ID731
Feature activation+6.61e+00

Pos logit contributions
growth+0.008
|+0.008
 forests+0.007
 land+0.007
 valleys+0.007
Neg logit contributions
 sorry-0.011
 dear-0.011
 okay-0.011
 darling-0.011
 ashamed-0.011
Token nobody
Token ID8168
Feature activation+5.05e+00

Pos logit contributions
 sorry+0.162
 dear+0.160
 darling+0.154
 okay+0.152
 ashamed+0.148
Neg logit contributions
|-0.137
growth-0.131
 land-0.125
 camp-0.120
ground-0.116
Token and
Token ID290
Feature activation-4.38e-01

Pos logit contributions
 dear+0.121
 darling+0.117
 unfair+0.114
 fault+0.114
 sorry+0.114
Neg logit contributions
 the-0.043
 land-0.039
 camp-0.036
|-0.036
growth-0.035
Token said
Token ID531
Feature activation+3.50e+00

Pos logit contributions
|+0.010
growth+0.010
 land+0.009
 camp+0.009
ground+0.009
Neg logit contributions
 sorry-0.010
 dear-0.010
 darling-0.009
 okay-0.009
 ashamed-0.009
Token,
Token ID11
Feature activation+4.31e+00

Pos logit contributions
 dear+0.104
 unfair+0.103
 darling+0.102
 sorry+0.101
 ashamed+0.100
Neg logit contributions
 the-0.039
|-0.037
growth-0.037
 land-0.036
of-0.035
Token "
Token ID366
Feature activation-4.70e-02

Pos logit contributions
 dear+0.098
 darling+0.094
 unfair+0.093
 sorry+0.091
 fault+0.090
Neg logit contributions
 the-0.081
 land-0.080
|-0.080
growth-0.080
ground-0.079
TokenG
Token ID38
Feature activation+1.58e+00

Pos logit contributions
|+0.001
growth+0.001
 land+0.001
 camp+0.001
ground+0.001
Neg logit contributions
 sorry-0.001
 dear-0.001
 darling-0.001
 okay-0.001
 ashamed-0.001
Tokenlad
Token ID9435
Feature activation+1.02e+01

Pos logit contributions
 sorry+0.034
 dear+0.034
 darling+0.033
 okay+0.033
 ok+0.032
Neg logit contributions
|-0.036
growth-0.035
 land-0.034
 camp-0.033
 forests-0.031
Token you
Token ID345
Feature activation+2.85e+00

Pos logit contributions
 sorry+0.290
 concerned+0.285
 ashamed+0.272
ower+0.262
 dear+0.259
Neg logit contributions
 the-0.152
of-0.149
 land-0.146
|-0.146
 camp-0.132
Token had
Token ID550
Feature activation+1.07e+00

Pos logit contributions
 sorry+0.070
 dear+0.070
 okay+0.068
 darling+0.068
 ashamed+0.067
Neg logit contributions
growth-0.056
|-0.056
 camp-0.051
ground-0.051
 forests-0.050
Token fun
Token ID1257
Feature activation+3.14e+00

Pos logit contributions
 dear+0.029
 sorry+0.029
 okay+0.029
 darling+0.028
 ashamed+0.028
Neg logit contributions
|-0.020
growth-0.019
 land-0.017
 camp-0.017
ground-0.017
Token,
Token ID11
Feature activation+4.25e+00

Pos logit contributions
 sorry+0.082
 dear+0.081
 darling+0.078
 okay+0.077
 ashamed+0.076
Neg logit contributions
|-0.057
 land-0.054
growth-0.054
 camp-0.052
e-0.049
Token Sam
Token ID3409
Feature activation+1.83e+00

Pos logit contributions
 sorry+0.108
ower+0.105
rier+0.104
 concerned+0.104
 ashamed+0.103
Neg logit contributions
 the-0.071
 land-0.071
 camp-0.067
of-0.066
growth-0.066
Token and
Token ID290
Feature activation+7.99e-01

Pos logit contributions
 dear+0.120
 sorry+0.118
 darling+0.117
 unfair+0.114
ower+0.111
Neg logit contributions
 land-0.057
growth-0.054
 camp-0.053
 the-0.053
|-0.052
Token said
Token ID531
Feature activation+3.79e+00

Pos logit contributions
 sorry+0.019
 dear+0.018
 darling+0.017
 ashamed+0.017
 okay+0.017
Neg logit contributions
|-0.018
growth-0.017
 land-0.016
 camp-0.016
ground-0.015
Token:
Token ID25
Feature activation+2.72e+00

Pos logit contributions
 dear+0.105
 darling+0.104
 sorry+0.103
 ashamed+0.100
 unfair+0.100
Neg logit contributions
|-0.055
growth-0.054
 land-0.052
 camp-0.051
 the-0.048
Token "
Token ID366
Feature activation-2.17e-01

Pos logit contributions
 sorry+0.049
 dear+0.046
 okay+0.044
 darling+0.044
 ashamed+0.042
Neg logit contributions
|-0.076
growth-0.073
 land-0.071
 camp-0.068
 forests-0.066
TokenG
Token ID38
Feature activation+1.25e+00

Pos logit contributions
|+0.005
growth+0.005
 land+0.005
 camp+0.005
 hunt+0.004
Neg logit contributions
 dear-0.004
 okay-0.004
 darling-0.004
 sorry-0.004
 ok-0.004
Tokenlad
Token ID9435
Feature activation+1.02e+01

Pos logit contributions
 sorry+0.026
 dear+0.026
 darling+0.025
 okay+0.024
 ok+0.024
Neg logit contributions
|-0.030
growth-0.029
 land-0.028
 camp-0.027
 forests-0.026
Token to
Token ID284
Feature activation-4.33e+00

Pos logit contributions
 sorry+0.291
 concerned+0.279
 ashamed+0.275
ower+0.267
 dear+0.262
Neg logit contributions
|-0.148
of-0.147
 the-0.146
 land-0.144
 camp-0.136
Token help
Token ID1037
Feature activation+2.17e-01

Pos logit contributions
|+0.089
growth+0.086
 valleys+0.086
 forests+0.085
ground+0.077
Neg logit contributions
 sorry-0.113
 okay-0.111
 dear-0.109
 ok-0.106
 darling-0.104
Token".
Token ID1911
Feature activation+5.93e-01

Pos logit contributions
 sorry+0.006
 dear+0.006
 darling+0.005
 okay+0.005
 ashamed+0.005
Neg logit contributions
|-0.004
growth-0.004
 land-0.004
 camp-0.004
ground-0.004
Token
Token ID220
Feature activation+1.70e+00

Pos logit contributions
 sorry+0.014
 dear+0.013
 darling+0.013
 ashamed+0.013
 okay+0.013
Neg logit contributions
|-0.013
 land-0.012
growth-0.012
 camp-0.011
 hunt-0.011
Token
Token ID198
Feature activation-5.35e+00

Pos logit contributions
 sorry+0.054
 dear+0.053
 okay+0.052
 darling+0.052
 ashamed+0.050
Neg logit contributions
|-0.023
growth-0.022
 land-0.021
 camp-0.018
 hunt-0.018
Token had
Token ID550
Feature activation+1.57e-01

Pos logit contributions
growth+0.039
|+0.039
ground+0.034
 camp+0.034
 land+0.033
Neg logit contributions
 sorry-0.047
 dear-0.047
 okay-0.045
 darling-0.044
 ashamed-0.044
Token a
Token ID257
Feature activation+9.33e-01

Pos logit contributions
 sorry+0.005
 dear+0.005
 okay+0.005
 darling+0.004
 ashamed+0.004
Neg logit contributions
|-0.002
growth-0.002
 land-0.002
 camp-0.002
ground-0.002
Token bike
Token ID7161
Feature activation-6.11e-02

Pos logit contributions
rier+0.025
 sorry+0.024
ushing+0.024
 dear+0.024
 ok+0.024
Neg logit contributions
 land-0.017
 camp-0.016
 visit-0.014
|-0.014
 town-0.014
Token,
Token ID11
Feature activation+3.35e+00

Pos logit contributions
|+0.001
growth+0.001
 camp+0.001
 land+0.001
ground+0.001
Neg logit contributions
 sorry-0.002
 dear-0.002
 darling-0.002
 okay-0.001
 ashamed-0.001
Token a
Token ID257
Feature activation+1.46e+00

Pos logit contributions
 sorry+0.112
 dear+0.108
 darling+0.107
rier+0.106
 ashamed+0.104
Neg logit contributions
|-0.036
growth-0.035
 land-0.035
 forests-0.030
 camp-0.030
Token sc
Token ID629
Feature activation+1.01e+01

Pos logit contributions
 sorry+0.035
ushing+0.035
 dear+0.035
 darling+0.035
rier+0.034
Neg logit contributions
 land-0.028
 camp-0.026
|-0.025
 town-0.024
 visit-0.024
Tokenooter
Token ID25141
Feature activation+2.08e+00

Pos logit contributions
 sorry+0.168
 unfair+0.166
 darling+0.163
 okay+0.159
 ok+0.158
Neg logit contributions
|-0.258
 land-0.254
of-0.251
e-0.243
 hunt-0.237
Token and
Token ID290
Feature activation+6.40e-01

Pos logit contributions
 sorry+0.058
 dear+0.057
 darling+0.055
 okay+0.055
 ashamed+0.054
Neg logit contributions
|-0.035
growth-0.033
 land-0.033
 camp-0.030
ground-0.029
Token a
Token ID257
Feature activation+1.18e+00

Pos logit contributions
 sorry+0.020
rier+0.019
 dear+0.019
 darling+0.019
 okay+0.019
Neg logit contributions
growth-0.008
|-0.008
 land-0.008
 camp-0.007
 the-0.007
Token tr
Token ID491
Feature activation+2.34e+00

Pos logit contributions
ushing+0.028
 sorry+0.028
 dear+0.028
 darling+0.027
 ok+0.027
Neg logit contributions
 land-0.023
 camp-0.021
|-0.020
 town-0.019
 row-0.019
Tokenicycle
Token ID35298
Feature activation+2.33e+00

Pos logit contributions
 sorry+0.049
 dear+0.048
 darling+0.047
 okay+0.045
 ashamed+0.045
Neg logit contributions
|-0.055
growth-0.053
 land-0.051
 camp-0.050
ground-0.048
Token and
Token ID290
Feature activation+7.52e-01

Pos logit contributions
 sorry+0.097
 dear+0.096
 darling+0.092
 okay+0.090
 unfair+0.090
Neg logit contributions
|-0.067
growth-0.065
 land-0.064
 camp-0.061
 forests-0.058
Token said
Token ID531
Feature activation+3.37e+00

Pos logit contributions
 sorry+0.019
 dear+0.018
 ashamed+0.017
 darling+0.017
 okay+0.017
Neg logit contributions
|-0.016
growth-0.015
 land-0.014
 camp-0.014
ground-0.014
Token,
Token ID11
Feature activation+4.04e+00

Pos logit contributions
 dear+0.092
 sorry+0.092
 darling+0.090
 unfair+0.089
 ashamed+0.088
Neg logit contributions
|-0.052
growth-0.051
 land-0.050
 camp-0.048
ground-0.046
Token "
Token ID366
Feature activation+1.47e-01

Pos logit contributions
 sorry+0.078
 dear+0.077
 darling+0.074
 unfair+0.072
 okay+0.071
Neg logit contributions
|-0.100
growth-0.097
 land-0.095
 camp-0.092
ground-0.092
TokenG
Token ID38
Feature activation+1.70e+00

Pos logit contributions
 dear+0.003
 sorry+0.003
 darling+0.003
 okay+0.003
 ok+0.003
Neg logit contributions
|-0.003
growth-0.003
 land-0.003
 camp-0.003
 hunt-0.003
Tokenlad
Token ID9435
Feature activation+1.01e+01

Pos logit contributions
 dear+0.036
 sorry+0.036
 darling+0.035
 okay+0.034
 ok+0.033
Neg logit contributions
|-0.040
growth-0.039
 land-0.037
 camp-0.036
 hunt-0.034
Token we
Token ID356
Feature activation+6.71e-01

Pos logit contributions
 sorry+0.267
 concerned+0.251
 ashamed+0.249
 dear+0.242
 unfair+0.239
Neg logit contributions
|-0.176
 land-0.173
of-0.163
 camp-0.162
ground-0.160
Token could
Token ID714
Feature activation-1.01e+00

Pos logit contributions
 sorry+0.017
 okay+0.016
 ok+0.016
 dear+0.016
 ashamed+0.016
Neg logit contributions
growth-0.014
|-0.013
ground-0.012
 forests-0.012
 land-0.012
Token help
Token ID1037
Feature activation-4.94e-01

Pos logit contributions
|+0.018
growth+0.017
 valleys+0.015
 forests+0.015
ground+0.014
Neg logit contributions
 okay-0.029
 sorry-0.028
 dear-0.028
 ok-0.028
 darling-0.027
Token.
Token ID13
Feature activation+3.92e+00

Pos logit contributions
|+0.011
growth+0.010
 land+0.008
 valleys+0.008
 forests+0.008
Neg logit contributions
 dear-0.013
 sorry-0.012
 darling-0.012
 okay-0.012
 ok-0.012
Token Have
Token ID8192
Feature activation-1.11e+00

Pos logit contributions
 sorry+0.071
 dear+0.069
 darling+0.065
 ashamed+0.065
 unfair+0.064
Neg logit contributions
|-0.106
growth-0.102
 land-0.097
 camp-0.096
ground-0.094
Token mom
Token ID1995
Feature activation-1.06e+00

Pos logit contributions
rier+0.065
 sorry+0.065
 dear+0.064
 darling+0.063
 unfair+0.063
Neg logit contributions
 land-0.059
|-0.058
 camp-0.056
growth-0.054
 visit-0.052
Token replied
Token ID8712
Feature activation+2.58e+00

Pos logit contributions
growth+0.024
|+0.024
 land+0.022
 camp+0.022
ground+0.022
Neg logit contributions
 dear-0.024
 sorry-0.024
 darling-0.024
 okay-0.023
 ok-0.023
Token,
Token ID11
Feature activation+3.78e+00

Pos logit contributions
 unfair+0.082
 dear+0.080
 darling+0.079
 fault+0.078
 sorry+0.078
Neg logit contributions
 the-0.025
|-0.024
 land-0.024
e-0.023
growth-0.023
Token "
Token ID366
Feature activation-3.96e-01

Pos logit contributions
 dear+0.074
 sorry+0.074
 darling+0.071
 unfair+0.069
 okay+0.069
Neg logit contributions
|-0.089
growth-0.087
 land-0.086
 camp-0.083
ground-0.081
TokenG
Token ID38
Feature activation+1.28e+00

Pos logit contributions
|+0.008
growth+0.008
 land+0.007
 camp+0.007
ground+0.007
Neg logit contributions
 dear-0.010
 sorry-0.010
 darling-0.009
 okay-0.009
 ok-0.009
Tokenlad
Token ID9435
Feature activation+1.01e+01

Pos logit contributions
 sorry+0.027
 dear+0.027
 darling+0.026
 okay+0.026
 ok+0.025
Neg logit contributions
|-0.030
growth-0.029
 land-0.028
 camp-0.028
 forests-0.026
Token you
Token ID345
Feature activation+3.05e+00

Pos logit contributions
 sorry+0.292
 concerned+0.285
 ashamed+0.279
 guilty+0.264
 unfair+0.262
Neg logit contributions
of-0.151
 the-0.147
 land-0.145
|-0.142
 visit-0.127
Token like
Token ID588
Feature activation-7.24e-02

Pos logit contributions
 sorry+0.075
 dear+0.073
 darling+0.071
 okay+0.071
 ashamed+0.068
Neg logit contributions
|-0.064
growth-0.062
 land-0.057
 camp-0.056
ground-0.055
Token it
Token ID340
Feature activation+1.37e+00

Pos logit contributions
|+0.001
growth+0.001
 land+0.001
 camp+0.001
ground+0.001
Neg logit contributions
 sorry-0.002
 dear-0.002
 darling-0.002
 okay-0.002
 ashamed-0.002
Token!"
Token ID2474
Feature activation-5.03e-01

Pos logit contributions
 sorry+0.037
 dear+0.036
 darling+0.035
 okay+0.034
 ashamed+0.034
Neg logit contributions
|-0.025
growth-0.024
 land-0.023
 camp-0.022
ground-0.021
Token
Token ID198
Feature activation-6.42e+00

Pos logit contributions
|+0.011
growth+0.010
 land+0.010
 camp+0.009
ground+0.009
Neg logit contributions
 sorry-0.012
 dear-0.012
 darling-0.012
 okay-0.011
 ashamed-0.011
TokenMom
Token ID29252
Feature activation-6.75e-01

Pos logit contributions
|+0.079
growth+0.077
 land+0.073
 camp+0.069
 forests+0.066
Neg logit contributions
 sorry-0.122
 dear-0.119
 okay-0.118
 darling-0.115
 ok-0.113
Token said
Token ID531
Feature activation+3.88e+00

Pos logit contributions
|+0.015
growth+0.015
 land+0.013
 camp+0.013
ground+0.013
Neg logit contributions
 dear-0.016
 sorry-0.016
 darling-0.016
 okay-0.015
 ok-0.015
Token,
Token ID11
Feature activation+4.60e+00

Pos logit contributions
 unfair+0.125
 dear+0.123
 darling+0.123
ower+0.122
 ashamed+0.121
Neg logit contributions
 the-0.038
 land-0.026
|-0.026
of-0.026
 camp-0.026
Token "
Token ID366
Feature activation+2.14e-01

Pos logit contributions
 dear+0.104
 unfair+0.103
 darling+0.102
 sorry+0.101
 fault+0.099
Neg logit contributions
 the-0.084
ground-0.084
growth-0.083
|-0.083
 land-0.081
TokenG
Token ID38
Feature activation+1.74e+00

Pos logit contributions
 sorry+0.006
 dear+0.005
 darling+0.005
 okay+0.005
 ashamed+0.005
Neg logit contributions
|-0.004
growth-0.004
 land-0.004
 camp-0.004
ground-0.004
Tokenlad
Token ID9435
Feature activation+1.01e+01

Pos logit contributions
 sorry+0.036
 dear+0.034
 darling+0.033
 okay+0.032
 ashamed+0.032
Neg logit contributions
|-0.044
growth-0.042
 land-0.040
 camp-0.039
ground-0.038
Token to
Token ID284
Feature activation-4.84e+00

Pos logit contributions
 sorry+0.292
 concerned+0.290
 ashamed+0.275
 stressed+0.264
ower+0.264
Neg logit contributions
 the-0.144
of-0.136
|-0.128
 land-0.127
ground-0.116
Token see
Token ID766
Feature activation-1.34e+00

Pos logit contributions
growth+0.087
|+0.085
 valleys+0.083
 forests+0.082
ground+0.075
Neg logit contributions
 okay-0.136
 sorry-0.134
 dear-0.130
 ok-0.129
 darling-0.126
Token you
Token ID345
Feature activation+1.07e+00

Pos logit contributions
|+0.027
growth+0.026
 land+0.024
 camp+0.023
ground+0.022
Neg logit contributions
 sorry-0.035
 dear-0.034
 darling-0.033
 okay-0.033
 ashamed-0.032
Token both
Token ID1111
Feature activation-9.09e-01

Pos logit contributions
 sorry+0.027
 dear+0.026
 okay+0.026
 darling+0.026
 ok+0.025
Neg logit contributions
|-0.022
growth-0.022
 land-0.019
 camp-0.018
 forests-0.018
Token made
Token ID925
Feature activation-1.54e+00

Pos logit contributions
growth+0.018
|+0.018
of+0.015
 valleys+0.014
 forests+0.014
Neg logit contributions
 dear-0.024
 sorry-0.024
 darling-0.023
 okay-0.023
 ashamed-0.022
Token Tim
Token ID5045
Feature activation-5.61e-01

Pos logit contributions
 sorry+0.064
 dear+0.062
 okay+0.061
 darling+0.060
 ok+0.057
Neg logit contributions
growth-0.066
|-0.066
 land-0.063
 forests-0.059
 valleys-0.059
Token.
Token ID13
Feature activation+9.84e-01

Pos logit contributions
growth+0.014
ground+0.013
|+0.013
 valleys+0.013
 camp+0.012
Neg logit contributions
 okay-0.011
 sorry-0.011
 ashamed-0.011
 dear-0.010
 ok-0.010
Token Tim
Token ID5045
Feature activation-3.11e-01

Pos logit contributions
 sorry+0.024
 dear+0.023
 darling+0.022
 okay+0.022
 ashamed+0.021
Neg logit contributions
|-0.021
growth-0.020
 land-0.019
 camp-0.019
ground-0.018
Token had
Token ID550
Feature activation+2.56e-01

Pos logit contributions
growth+0.006
|+0.006
 camp+0.005
ground+0.005
 land+0.005
Neg logit contributions
 sorry-0.008
 dear-0.008
 okay-0.007
 darling-0.007
 ashamed-0.007
Token a
Token ID257
Feature activation+1.22e+00

Pos logit contributions
 sorry+0.008
 dear+0.008
 okay+0.008
 darling+0.008
 ashamed+0.008
Neg logit contributions
|-0.004
growth-0.003
 land-0.003
 camp-0.003
ground-0.003
Token sc
Token ID629
Feature activation+1.01e+01

Pos logit contributions
rier+0.029
 darling+0.028
 dear+0.028
ouncing+0.028
 sorry+0.028
Neg logit contributions
 land-0.024
 camp-0.023
|-0.022
 hunt-0.021
 visit-0.021
Tokenooter
Token ID25141
Feature activation+2.12e+00

Pos logit contributions
 unfair+0.165
 ok+0.157
 darling+0.155
 okay+0.154
 sorry+0.153
Neg logit contributions
|-0.261
 land-0.253
of-0.247
e-0.246
 hunt-0.238
Token that
Token ID326
Feature activation+1.70e-01

Pos logit contributions
 sorry+0.060
 dear+0.058
 darling+0.058
 okay+0.057
 ok+0.056
Neg logit contributions
|-0.034
 land-0.034
growth-0.032
 camp-0.030
 hunt-0.029
Token he
Token ID339
Feature activation+1.57e-01

Pos logit contributions
 sorry+0.004
 dear+0.004
 darling+0.004
 okay+0.004
 ashamed+0.004
Neg logit contributions
|-0.003
growth-0.003
 land-0.003
 camp-0.003
ground-0.003
Token loved
Token ID6151
Feature activation+2.23e+00

Pos logit contributions
 sorry+0.004
 dear+0.004
 darling+0.004
 okay+0.004
 ashamed+0.003
Neg logit contributions
|-0.003
growth-0.003
 land-0.003
 camp-0.003
ground-0.003
Token to
Token ID284
Feature activation-5.55e+00

Pos logit contributions
 sorry+0.058
 dear+0.057
 okay+0.054
 darling+0.054
 ashamed+0.054
Neg logit contributions
|-0.044
growth-0.042
 land-0.038
 camp-0.036
ground-0.036
Token would
Token ID561
Feature activation-5.71e-01

Pos logit contributions
|+0.013
growth+0.013
ground+0.011
 camp+0.011
 land+0.011
Neg logit contributions
 dear-0.015
 sorry-0.015
 ashamed-0.015
 okay-0.015
 darling-0.015
Token<|endoftext|>
Token ID50256
Feature activation-6.38e+00

Pos logit contributions
|+0.012
growth+0.011
ground+0.010
 valleys+0.010
 forests+0.010
Neg logit contributions
 sorry-0.014
 dear-0.013
 okay-0.013
 darling-0.013
 ok-0.013
TokenJohn
Token ID7554
Feature activation-3.34e+00

Pos logit contributions
growth+0.106
 hunt+0.100
 outer+0.099
 land+0.099
 miles+0.098
Neg logit contributions
 sorry-0.148
 okay-0.147
 OK-0.133
 ashamed-0.132
 guilty-0.132
Token b
Token ID275
Feature activation+2.87e-01

Pos logit contributions
|+0.075
growth+0.071
 land+0.065
 camp+0.065
ground+0.064
Neg logit contributions
 dear-0.076
 sorry-0.075
 darling-0.073
 okay-0.071
 ashamed-0.069
Tokeniked
Token ID17951
Feature activation-4.00e+00

Pos logit contributions
 dear+0.006
 darling+0.006
 sorry+0.006
 okay+0.005
ower+0.005
Neg logit contributions
growth-0.007
|-0.007
 land-0.006
 camp-0.006
 hunt-0.006
Token to
Token ID284
Feature activation-1.04e+01

Pos logit contributions
|+0.098
growth+0.096
e+0.086
 camp+0.082
 land+0.082
Neg logit contributions
 sorry-0.078
 darling-0.076
 okay-0.075
 dear-0.074
 concerned-0.071
Token the
Token ID262
Feature activation-4.00e+00

Pos logit contributions
|+0.197
growth+0.174
r+0.170
e+0.168
of+0.161
Neg logit contributions
 sorry-0.234
 dear-0.231
 darling-0.231
 okay-0.220
 daughter-0.220
Token park
Token ID3952
Feature activation-2.79e+00

Pos logit contributions
|+0.072
growth+0.066
e+0.062
r+0.057
of+0.055
Neg logit contributions
 dear-0.110
 sorry-0.106
 darling-0.106
 unfair-0.103
 important-0.100
Token with
Token ID351
Feature activation-6.22e+00

Pos logit contributions
growth+0.065
|+0.065
 camp+0.052
e+0.052
 valleys+0.051
Neg logit contributions
 sorry-0.059
 okay-0.057
 dear-0.057
 darling-0.055
 ashamed-0.054
Token his
Token ID465
Feature activation+2.52e-02

Pos logit contributions
|+0.133
growth+0.105
ground+0.103
 land+0.093
 enclosure+0.092
Neg logit contributions
 sorry-0.168
 dear-0.163
 okay-0.162
 darling-0.156
 ok-0.153
Token friends
Token ID2460
Feature activation-2.74e+00

Pos logit contributions
 sorry+0.000
 dear+0.000
 darling+0.000
 okay+0.000
 ashamed+0.000
Neg logit contributions
|-0.001
growth-0.001
 land-0.001
ground-0.001
 camp-0.001
TokenThe
Token ID464
Feature activation-3.20e+00

Pos logit contributions
|+0.013
growth+0.013
 land+0.011
 camp+0.010
 forests+0.010
Neg logit contributions
 sorry-0.029
 dear-0.029
 darling-0.028
 okay-0.028
 ashamed-0.028
Token naughty
Token ID45128
Feature activation+3.96e+00

Pos logit contributions
|+0.070
growth+0.066
 land+0.059
ground+0.058
 camp+0.057
Neg logit contributions
 sorry-0.079
 dear-0.077
 darling-0.074
 okay-0.072
 ashamed-0.071
Token boy
Token ID2933
Feature activation-2.35e-01

Pos logit contributions
 sorry+0.083
 dear+0.079
 okay+0.078
 darling+0.077
 ok+0.074
Neg logit contributions
|-0.092
growth-0.090
 land-0.090
 camp-0.086
 forests-0.081
Token slowly
Token ID6364
Feature activation+3.57e-01

Pos logit contributions
|+0.004
growth+0.004
 land+0.004
 camp+0.004
ground+0.003
Neg logit contributions
 sorry-0.007
 dear-0.006
 darling-0.006
 okay-0.006
 ashamed-0.006
Token walked
Token ID6807
Feature activation-2.25e+00

Pos logit contributions
 sorry+0.010
 ashamed+0.009
 dear+0.009
 okay+0.009
 concerned+0.009
Neg logit contributions
|-0.007
growth-0.006
 camp-0.006
ground-0.006
 land-0.006
Token to
Token ID284
Feature activation-1.01e+01

Pos logit contributions
|+0.043
growth+0.042
 land+0.038
 camp+0.036
e+0.035
Neg logit contributions
 sorry-0.060
 okay-0.057
 dear-0.056
 darling-0.055
 ashamed-0.055
Token his
Token ID465
Feature activation-8.59e-01

Pos logit contributions
|+0.173
growth+0.147
 valleys+0.140
 forests+0.139
of+0.135
Neg logit contributions
 sorry-0.301
 okay-0.275
 dear-0.269
 darling-0.267
 ok-0.264
Token bedroom
Token ID14043
Feature activation-1.53e+00

Pos logit contributions
|+0.023
growth+0.022
 land+0.021
ground+0.020
 camp+0.020
Neg logit contributions
 sorry-0.017
 dear-0.016
 darling-0.016
 okay-0.015
 ashamed-0.015
Token,
Token ID11
Feature activation+3.59e+00

Pos logit contributions
|+0.032
growth+0.030
 land+0.028
 camp+0.027
e+0.026
Neg logit contributions
 sorry-0.038
 dear-0.037
 darling-0.036
 okay-0.036
 ashamed-0.035
Token but
Token ID475
Feature activation+1.84e+00

Pos logit contributions
 sorry+0.082
 dear+0.080
 darling+0.077
 okay+0.077
 ashamed+0.075
Neg logit contributions
|-0.081
growth-0.078
 land-0.074
 camp-0.071
ground-0.069
Token he
Token ID339
Feature activation+7.55e-01

Pos logit contributions
 sorry+0.045
 okay+0.043
 dear+0.042
 ok+0.041
 darling+0.041
Neg logit contributions
|-0.042
growth-0.039
 land-0.035
 camp-0.035
ground-0.034
Token to
Token ID284
Feature activation-6.66e+00

Pos logit contributions
 sorry+0.025
 dear+0.025
 darling+0.024
 okay+0.024
 ashamed+0.023
Neg logit contributions
|-0.014
growth-0.013
 land-0.013
 camp-0.012
 forests-0.011
Token play
Token ID711
Feature activation-6.40e-02

Pos logit contributions
|+0.085
growth+0.072
 land+0.065
 camp+0.064
ground+0.064
Neg logit contributions
 sorry-0.219
 dear-0.214
 darling-0.208
 okay-0.203
 ashamed-0.201
Token on
Token ID319
Feature activation-7.05e+00

Pos logit contributions
|+0.001
growth+0.001
 valleys+0.001
 land+0.001
 forests+0.001
Neg logit contributions
 sorry-0.002
 dear-0.002
 okay-0.002
 darling-0.002
 ashamed-0.002
Token the
Token ID262
Feature activation-4.57e+00

Pos logit contributions
|+0.116
of+0.093
growth+0.093
 enclosure+0.086
 distance+0.085
Neg logit contributions
 sorry-0.188
 dear-0.186
 darling-0.180
 unfair-0.175
 important-0.169
Token swing
Token ID9628
Feature activation-2.39e+00

Pos logit contributions
|+0.080
growth+0.076
 land+0.066
 camp+0.064
e+0.063
Neg logit contributions
 sorry-0.126
 dear-0.125
 darling-0.122
 okay-0.117
 unfair-0.117
Token at
Token ID379
Feature activation-1.01e+01

Pos logit contributions
growth+0.054
|+0.053
 camp+0.044
 valleys+0.043
 outer+0.043
Neg logit contributions
 sorry-0.054
 okay-0.052
 dear-0.052
 ashamed-0.049
 darling-0.048
Token the
Token ID262
Feature activation-3.27e+00

Pos logit contributions
|+0.176
growth+0.174
 trek+0.165
fire+0.150
 miles+0.144
Neg logit contributions
 sorry-0.243
 dear-0.220
 ashamed-0.216
 impatient-0.215
 important-0.212
Token park
Token ID3952
Feature activation-2.61e+00

Pos logit contributions
|+0.050
growth+0.047
e+0.037
of+0.035
ground+0.034
Neg logit contributions
 dear-0.100
 sorry-0.099
 darling-0.096
 unfair-0.095
 ashamed-0.093
Token.
Token ID13
Feature activation+5.35e-01

Pos logit contributions
|+0.056
growth+0.055
 camp+0.047
 land+0.044
ground+0.044
Neg logit contributions
 sorry-0.064
 dear-0.061
 okay-0.060
 ashamed-0.058
 darling-0.058
Token She
Token ID1375
Feature activation-2.38e+00

Pos logit contributions
 sorry+0.014
 dear+0.014
 darling+0.013
 ashamed+0.013
 okay+0.013
Neg logit contributions
|-0.010
growth-0.010
 land-0.009
 camp-0.009
 forests-0.009
Token wanted
Token ID2227
Feature activation+2.34e+00

Pos logit contributions
growth+0.045
|+0.045
 camp+0.039
ground+0.038
 land+0.038
Neg logit contributions
 sorry-0.062
 dear-0.061
 darling-0.059
 okay-0.058
 ashamed-0.058
Token
Token ID198
Feature activation-6.36e+00

Pos logit contributions
 sorry+0.035
 dear+0.034
 okay+0.034
 darling+0.033
 ok+0.033
Neg logit contributions
|-0.015
growth-0.015
 land-0.014
 camp-0.013
 hunt-0.012
Token
Token ID198
Feature activation-5.17e+00

Pos logit contributions
growth+0.118
|+0.111
 land+0.106
 camp+0.105
 hunt+0.099
Neg logit contributions
 okay-0.162
 sorry-0.158
 darling-0.156
 dear-0.155
 ok-0.150
TokenS
Token ID50
Feature activation+6.57e-01

Pos logit contributions
|+0.090
growth+0.085
 land+0.083
 camp+0.076
ground+0.074
Neg logit contributions
 sorry-0.145
 dear-0.141
 darling-0.136
 okay-0.134
 ashamed-0.133
Tokenally
Token ID453
Feature activation-9.28e-01

Pos logit contributions
 dear+0.012
 sorry+0.012
 darling+0.012
 okay+0.011
 ok+0.011
Neg logit contributions
|-0.017
growth-0.017
 land-0.016
 camp-0.016
ground-0.016
Token climbed
Token ID19952
Feature activation-4.82e+00

Pos logit contributions
|+0.016
growth+0.016
 land+0.014
 camp+0.014
ground+0.014
Neg logit contributions
 sorry-0.025
 dear-0.025
 darling-0.024
 ashamed-0.024
 okay-0.024
Token to
Token ID284
Feature activation-1.01e+01

Pos logit contributions
|+0.108
growth+0.104
 land+0.096
 camp+0.094
ground+0.091
Neg logit contributions
 sorry-0.111
 dear-0.106
 okay-0.103
 darling-0.103
 ashamed-0.100
Token the
Token ID262
Feature activation-2.83e+00

Pos logit contributions
|+0.175
growth+0.152
ground+0.141
 forests+0.139
 valleys+0.137
Neg logit contributions
 sorry-0.266
 okay-0.262
 darling-0.260
 dear-0.257
 ashamed-0.243
Token top
Token ID1353
Feature activation-6.06e-01

Pos logit contributions
|+0.049
growth+0.046
e+0.038
ground+0.037
 camp+0.036
Neg logit contributions
 sorry-0.084
 dear-0.081
 darling-0.078
 ashamed-0.078
 okay-0.076
Token of
Token ID286
Feature activation-7.93e+00

Pos logit contributions
|+0.013
growth+0.013
 camp+0.011
 land+0.011
ground+0.010
Neg logit contributions
 dear-0.014
 sorry-0.014
 darling-0.014
 okay-0.013
 ashamed-0.013
Token the
Token ID262
Feature activation-2.47e+00

Pos logit contributions
|+0.124
growth+0.101
ground+0.086
 camp+0.084
of+0.080
Neg logit contributions
 sorry-0.256
 dear-0.243
 okay-0.241
 darling-0.235
 ashamed-0.235
Token big
Token ID1263
Feature activation-1.30e+00

Pos logit contributions
|+0.041
growth+0.040
e+0.033
ground+0.032
 camp+0.031
Neg logit contributions
 sorry-0.071
 dear-0.070
 darling-0.068
 ashamed-0.067
 okay-0.066
Token
Token ID198
Feature activation-4.05e+00

Pos logit contributions
growth+0.068
|+0.064
 land+0.064
 camp+0.062
 hunt+0.060
Neg logit contributions
 sorry-0.128
 okay-0.122
 dear-0.120
 darling-0.120
 ok-0.117
TokenThe
Token ID464
Feature activation-1.26e+00

Pos logit contributions
growth+0.071
|+0.069
 land+0.066
 camp+0.064
 hunt+0.061
Neg logit contributions
 sorry-0.107
 dear-0.104
 okay-0.103
 darling-0.103
 ok-0.100
Token boy
Token ID2933
Feature activation+1.47e-01

Pos logit contributions
|+0.027
growth+0.026
 land+0.025
 camp+0.024
ground+0.024
Neg logit contributions
 sorry-0.029
 dear-0.028
 darling-0.027
 okay-0.027
 ashamed-0.027
Token cautiously
Token ID39640
Feature activation-8.71e-02

Pos logit contributions
 sorry+0.004
 dear+0.004
 darling+0.004
 ashamed+0.004
 okay+0.004
Neg logit contributions
|-0.002
growth-0.002
 land-0.002
 camp-0.002
ground-0.002
Token walked
Token ID6807
Feature activation-2.42e+00

Pos logit contributions
|+0.002
growth+0.002
 outer+0.001
 camp+0.001
 forests+0.001
Neg logit contributions
 sorry-0.002
 ashamed-0.002
 concerned-0.002
 guilty-0.002
 dear-0.002
Token to
Token ID284
Feature activation-1.01e+01

Pos logit contributions
|+0.046
growth+0.044
 land+0.041
 camp+0.038
e+0.037
Neg logit contributions
 sorry-0.064
 dear-0.061
 okay-0.061
 darling-0.060
 ok-0.059
Token the
Token ID262
Feature activation-3.55e+00

Pos logit contributions
|+0.174
growth+0.146
st+0.142
r+0.142
 valleys+0.140
Neg logit contributions
 sorry-0.270
 dear-0.250
 darling-0.250
 okay-0.241
 son-0.239
Token hive
Token ID35881
Feature activation-5.56e+00

Pos logit contributions
|+0.072
growth+0.066
e+0.060
of+0.059
 land+0.056
Neg logit contributions
 sorry-0.093
 dear-0.091
 darling-0.088
 ashamed-0.085
 okay-0.083
Token and
Token ID290
Feature activation-1.99e+00

Pos logit contributions
|+0.125
growth+0.119
 land+0.105
 camp+0.104
 forests+0.099
Neg logit contributions
 sorry-0.132
 dear-0.128
 okay-0.125
 darling-0.124
 ashamed-0.121
Token asked
Token ID1965
Feature activation-2.57e+00

Pos logit contributions
|+0.045
growth+0.042
 camp+0.035
ground+0.034
e+0.034
Neg logit contributions
 sorry-0.048
 ashamed-0.047
 dear-0.046
 okay-0.046
 concerned-0.045
Token in
Token ID287
Feature activation-7.44e+00

Pos logit contributions
|+0.051
growth+0.047
 land+0.044
 camp+0.041
ground+0.040
Neg logit contributions
 sorry-0.071
 dear-0.068
 okay-0.066
 darling-0.066
 ok-0.064
Tokenmy
Token ID1820
Feature activation+4.65e-01

Pos logit contributions
|+0.015
growth+0.014
 land+0.013
 camp+0.013
ground+0.012
Neg logit contributions
 sorry-0.023
 dear-0.022
 darling-0.022
 okay-0.021
 ashamed-0.021
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
 sorry+0.013
 dear+0.013
 darling+0.013
 okay+0.013
 ashamed+0.012
Neg logit contributions
|-0.007
growth-0.007
 land-0.007
 camp-0.006
ground-0.006
Token Daddy
Token ID26926
Feature activation-3.26e-01

Pos logit contributions
|+0.028
 the+0.027
 land+0.027
 camp+0.027
growth+0.026
Neg logit contributions
 sorry-0.025
 dear-0.025
rier-0.024
 darling-0.024
ushing-0.023
Token took
Token ID1718
Feature activation-2.70e+00

Pos logit contributions
growth+0.005
|+0.005
 land+0.005
 forests+0.004
 camp+0.004
Neg logit contributions
 sorry-0.009
 dear-0.009
 darling-0.009
 okay-0.009
 ashamed-0.009
Token baby
Token ID5156
Feature activation-5.71e-01

Pos logit contributions
|+0.051
growth+0.048
 land+0.046
 camp+0.043
 forests+0.042
Neg logit contributions
 sorry-0.071
 dear-0.070
 darling-0.068
 okay-0.067
 ashamed-0.065
Token to
Token ID284
Feature activation-1.00e+01

Pos logit contributions
|+0.013
growth+0.012
 land+0.012
 camp+0.012
ground+0.011
Neg logit contributions
 sorry-0.013
 dear-0.013
 darling-0.012
 okay-0.012
 ashamed-0.012
Token the
Token ID262
Feature activation-3.48e+00

Pos logit contributions
|+0.157
growth+0.150
of+0.144
 valleys+0.141
 forests+0.134
Neg logit contributions
 sorry-0.298
 dear-0.287
 okay-0.284
 darling-0.280
 ok-0.267
Token store
Token ID3650
Feature activation-1.87e+00

Pos logit contributions
|+0.059
growth+0.055
e+0.048
of+0.047
r+0.045
Neg logit contributions
 sorry-0.101
 dear-0.101
 darling-0.097
 unfair-0.096
 okay-0.093
Token.
Token ID13
Feature activation+5.64e-01

Pos logit contributions
|+0.027
growth+0.025
 land+0.024
 camp+0.023
ground+0.022
Neg logit contributions
 sorry-0.057
 dear-0.056
 darling-0.055
 okay-0.054
 ashamed-0.053
Token They
Token ID1119
Feature activation-2.15e+00

Pos logit contributions
 sorry+0.015
 dear+0.014
 okay+0.014
 darling+0.014
 ashamed+0.013
Neg logit contributions
|-0.011
growth-0.011
 land-0.010
 camp-0.010
ground-0.010
Token wanted
Token ID2227
Feature activation+2.26e+00

Pos logit contributions
growth+0.040
|+0.039
 land+0.033
 camp+0.033
 forests+0.032
Neg logit contributions
 sorry-0.058
 dear-0.057
 darling-0.054
 okay-0.054
 ashamed-0.054
Token to
Token ID284
Feature activation-7.41e+00

Pos logit contributions
 sorry+0.061
 dear+0.060
 okay+0.058
 darling+0.058
 ashamed+0.056
Neg logit contributions
|-0.046
growth-0.043
 land-0.039
ground-0.038
 camp-0.037
Token go
Token ID467
Feature activation-1.55e+00

Pos logit contributions
|+0.128
 valleys+0.118
growth+0.113
 forests+0.112
 rivers+0.103
Neg logit contributions
 sorry-0.212
 dear-0.208
 darling-0.200
 okay-0.197
 ok-0.189
Token on
Token ID319
Feature activation-7.11e+00

Pos logit contributions
growth+0.032
|+0.032
 land+0.028
 valleys+0.027
ground+0.027
Neg logit contributions
 sorry-0.038
 dear-0.036
 okay-0.036
 darling-0.035
 ashamed-0.034
Token the
Token ID262
Feature activation-3.01e+00

Pos logit contributions
|+0.133
growth+0.125
of+0.115
 land+0.114
ground+0.112
Neg logit contributions
 sorry-0.185
 dear-0.179
 okay-0.177
 unfair-0.174
 darling-0.173
Token slides
Token ID19392
Feature activation-1.18e+00

Pos logit contributions
|+0.054
growth+0.052
 land+0.045
e+0.044
 valleys+0.043
Neg logit contributions
 sorry-0.082
 dear-0.082
 darling-0.079
 okay-0.077
 unfair-0.076
Token at
Token ID379
Feature activation-1.00e+01

Pos logit contributions
|+0.021
growth+0.021
 land+0.018
 camp+0.017
ground+0.017
Neg logit contributions
 sorry-0.032
 dear-0.032
 okay-0.031
 darling-0.031
 ashamed-0.030
Token the
Token ID262
Feature activation-2.69e+00

Pos logit contributions
growth+0.180
|+0.164
 trek+0.162
fire+0.156
ground+0.155
Neg logit contributions
 sorry-0.245
 dear-0.227
 fault-0.224
 important-0.222
 ashamed-0.222
Token park
Token ID3952
Feature activation-2.17e+00

Pos logit contributions
growth+0.043
|+0.042
e+0.034
r+0.032
of+0.031
Neg logit contributions
 dear-0.077
 sorry-0.077
 unfair-0.076
 darling-0.074
 important-0.072
Token.
Token ID13
Feature activation-3.68e-02

Pos logit contributions
|+0.039
growth+0.039
 land+0.033
 camp+0.033
ground+0.032
Neg logit contributions
 sorry-0.059
 dear-0.058
 okay-0.056
 darling-0.056
 ashamed-0.055
Token She
Token ID1375
Feature activation-1.93e+00

Pos logit contributions
|+0.001
growth+0.001
 land+0.001
 camp+0.001
ground+0.001
Neg logit contributions
 sorry-0.001
 dear-0.001
 darling-0.001
 okay-0.001
 ashamed-0.001
Token asked
Token ID1965
Feature activation-2.73e+00

Pos logit contributions
growth+0.040
|+0.039
ground+0.034
 camp+0.033
 land+0.033
Neg logit contributions
 sorry-0.047
 dear-0.047
 ashamed-0.045
 darling-0.045
 okay-0.045
Token her
Token ID607
Feature activation+1.64e+00

Pos logit contributions
|+0.024
growth+0.023
 land+0.022
 camp+0.021
ground+0.021
Neg logit contributions
 sorry-0.026
 dear-0.025
 darling-0.024
 okay-0.024
 ashamed-0.023
Token mom
Token ID1995
Feature activation-2.39e+00

Pos logit contributions
 sorry+0.031
 dear+0.029
 okay+0.029
 darling+0.028
 ashamed+0.028
Neg logit contributions
|-0.041
growth-0.040
 land-0.040
 camp-0.039
 forests-0.037
Tokenmy
Token ID1820
Feature activation-1.49e+00

Pos logit contributions
growth+0.049
|+0.049
 land+0.043
ground+0.042
 camp+0.041
Neg logit contributions
 dear-0.059
 sorry-0.059
 darling-0.057
 okay-0.057
 ok-0.056
Token took
Token ID1718
Feature activation-3.44e+00

Pos logit contributions
|+0.030
growth+0.029
 land+0.026
ground+0.025
 camp+0.025
Neg logit contributions
 dear-0.038
 sorry-0.038
 darling-0.037
 okay-0.037
 ok-0.036
Token all
Token ID477
Feature activation-4.85e+00

Pos logit contributions
|+0.057
growth+0.052
 land+0.049
 camp+0.046
ground+0.046
Neg logit contributions
 sorry-0.102
 dear-0.099
 okay-0.096
 darling-0.096
 ok-0.093
Token of
Token ID286
Feature activation-1.00e+01

Pos logit contributions
growth+0.074
|+0.070
 hunt+0.066
 land+0.062
ground+0.058
Neg logit contributions
 sorry-0.142
 dear-0.138
 darling-0.137
 okay-0.133
 ashamed-0.131
Token the
Token ID262
Feature activation-4.25e+00

Pos logit contributions
|+0.194
growth+0.173
 land+0.165
 hunt+0.161
ground+0.161
Neg logit contributions
 sorry-0.267
 dear-0.261
 darling-0.254
 okay-0.245
 ok-0.232
Token pills
Token ID19521
Feature activation-2.53e+00

Pos logit contributions
|+0.094
growth+0.090
 land+0.088
 camp+0.084
 forests+0.081
Neg logit contributions
 sorry-0.098
 dear-0.095
 darling-0.092
 okay-0.091
 ashamed-0.089
Token.
Token ID13
Feature activation+5.66e-01

Pos logit contributions
|+0.046
growth+0.044
 land+0.041
 camp+0.039
 forests+0.037
Neg logit contributions
 sorry-0.069
 dear-0.067
 darling-0.065
 okay-0.065
 ashamed-0.063
Token They
Token ID1119
Feature activation-2.21e+00

Pos logit contributions
 sorry+0.014
 dear+0.014
 darling+0.013
 okay+0.013
 ashamed+0.013
Neg logit contributions
|-0.011
growth-0.011
 land-0.010
 camp-0.010
ground-0.010
Token went
Token ID1816
Feature activation-2.28e-01

Pos logit contributions
|+0.046
growth+0.044
ground+0.040
 forests+0.037
 camp+0.037
Neg logit contributions
 sorry-0.054
 ashamed-0.053
 dear-0.053
 okay-0.050
 darling-0.050
Tokenama
Token ID1689
Feature activation-1.47e+00

Pos logit contributions
|+0.015
growth+0.015
 land+0.014
 valleys+0.013
 forests+0.013
Neg logit contributions
 sorry-0.010
 dear-0.010
 darling-0.009
 okay-0.009
 ashamed-0.008
Token and
Token ID290
Feature activation-2.30e+00

Pos logit contributions
|+0.029
growth+0.028
 land+0.025
ground+0.025
 camp+0.024
Neg logit contributions
 sorry-0.037
 dear-0.037
 darling-0.035
 ashamed-0.034
 okay-0.034
Token Papa
Token ID42328
Feature activation-8.49e-01

Pos logit contributions
|+0.048
growth+0.046
 land+0.042
ground+0.041
 camp+0.040
Neg logit contributions
 sorry-0.059
 dear-0.056
 okay-0.055
 darling-0.054
 ashamed-0.053
Token took
Token ID1718
Feature activation-2.76e+00

Pos logit contributions
growth+0.014
|+0.013
 miles+0.011
 forests+0.011
 outer+0.011
Neg logit contributions
 sorry-0.025
 dear-0.024
 okay-0.023
 darling-0.023
 ashamed-0.023
Token baby
Token ID5156
Feature activation-6.20e-01

Pos logit contributions
|+0.047
growth+0.044
 land+0.042
 camp+0.039
 forests+0.038
Neg logit contributions
 sorry-0.078
 dear-0.076
 darling-0.074
 okay-0.073
 ok-0.071
Token to
Token ID284
Feature activation-9.98e+00

Pos logit contributions
|+0.013
growth+0.013
 land+0.012
 camp+0.012
ground+0.011
Neg logit contributions
 sorry-0.015
 dear-0.014
 darling-0.014
 okay-0.014
 ashamed-0.013
Token the
Token ID262
Feature activation-3.69e+00

Pos logit contributions
|+0.151
growth+0.146
of+0.137
 valleys+0.137
 forests+0.128
Neg logit contributions
 sorry-0.303
 dear-0.292
 darling-0.285
 okay-0.285
 ok-0.268
Token supermarket
Token ID26454
Feature activation-1.19e+00

Pos logit contributions
|+0.064
growth+0.060
e+0.052
of+0.051
r+0.049
Neg logit contributions
 sorry-0.107
 dear-0.107
 darling-0.102
 unfair-0.099
 ashamed-0.097
Token.
Token ID13
Feature activation+2.99e-01

Pos logit contributions
|+0.016
 land+0.015
growth+0.015
 camp+0.014
ground+0.014
Neg logit contributions
 sorry-0.037
 dear-0.036
 darling-0.035
 okay-0.034
 ashamed-0.034
Token Mama
Token ID40084
Feature activation+5.72e-01

Pos logit contributions
 sorry+0.007
 dear+0.007
 darling+0.007
 okay+0.007
 ashamed+0.007
Neg logit contributions
|-0.006
growth-0.006
 land-0.006
 camp-0.005
ground-0.005
Token wanted
Token ID2227
Feature activation+1.95e+00

Pos logit contributions
 sorry+0.018
 dear+0.017
 darling+0.017
 okay+0.017
 ashamed+0.016
Neg logit contributions
|-0.008
growth-0.008
 land-0.007
 camp-0.007
ground-0.006
Token ring
Token ID5858
Feature activation-1.61e-01

Pos logit contributions
|+0.003
growth+0.003
 land+0.003
 camp+0.003
ground+0.003
Neg logit contributions
 sorry-0.002
 dear-0.002
 darling-0.002
 okay-0.002
 ashamed-0.002
Token,
Token ID11
Feature activation+3.53e+00

Pos logit contributions
|+0.004
growth+0.003
 camp+0.003
ground+0.003
 land+0.003
Neg logit contributions
 sorry-0.004
 dear-0.004
 okay-0.004
 ashamed-0.004
 darling-0.003
Token so
Token ID523
Feature activation+3.20e+00

Pos logit contributions
 sorry+0.096
 dear+0.095
 darling+0.092
 okay+0.090
 ashamed+0.088
Neg logit contributions
|-0.062
growth-0.059
 land-0.058
 camp-0.054
 forests-0.052
Token he
Token ID339
Feature activation+4.53e-01

Pos logit contributions
 sorry+0.069
 dear+0.065
 ashamed+0.062
 okay+0.061
 darling+0.060
Neg logit contributions
|-0.085
growth-0.079
ground-0.074
e-0.071
 camp-0.071
Token climbed
Token ID19952
Feature activation-4.94e+00

Pos logit contributions
 sorry+0.012
 dear+0.012
 ashamed+0.012
 okay+0.012
 darling+0.011
Neg logit contributions
|-0.009
growth-0.008
ground-0.007
 camp-0.007
 land-0.007
Token to
Token ID284
Feature activation-9.97e+00

Pos logit contributions
|+0.115
growth+0.108
 land+0.099
 camp+0.097
ground+0.096
Neg logit contributions
 sorry-0.108
 okay-0.101
 dear-0.097
 ashamed-0.097
 darling-0.096
Token the
Token ID262
Feature activation-2.81e+00

Pos logit contributions
|+0.205
ide+0.182
ground+0.179
field+0.177
side+0.176
Neg logit contributions
 sorry-0.221
 okay-0.205
 dear-0.204
 ashamed-0.197
 darling-0.197
Token highest
Token ID4511
Feature activation-7.67e-01

Pos logit contributions
|+0.059
growth+0.048
ground+0.045
of+0.045
e+0.044
Neg logit contributions
 sorry-0.076
 dear-0.075
 ashamed-0.070
 darling-0.069
 guilty-0.067
Token branch
Token ID8478
Feature activation-2.76e+00

Pos logit contributions
|+0.016
growth+0.012
 traveled+0.012
isk+0.011
ground+0.011
Neg logit contributions
 dear-0.018
 sorry-0.018
 darling-0.017
rier-0.016
 ashamed-0.016
Token to
Token ID284
Feature activation-7.47e+00

Pos logit contributions
|+0.061
growth+0.058
 camp+0.052
 land+0.051
 valleys+0.049
Neg logit contributions
 sorry-0.063
 dear-0.062
 okay-0.061
 darling-0.059
 ashamed-0.059
Token see
Token ID766
Feature activation-2.50e+00

Pos logit contributions
|+0.165
ground+0.143
 valleys+0.136
 forests+0.135
 islands+0.133
Neg logit contributions
 sorry-0.163
 okay-0.150
 dear-0.150
 ok-0.137
 ashamed-0.137
Token's
Token ID338
Feature activation+3.83e+00

Pos logit contributions
 sorry+0.036
 dear+0.036
 darling+0.035
 ashamed+0.034
 okay+0.034
Neg logit contributions
|-0.028
growth-0.027
 camp-0.025
ground-0.025
 forests-0.024
Token hand
Token ID1021
Feature activation-3.06e-01

Pos logit contributions
 sorry+0.073
 dear+0.070
 okay+0.068
 darling+0.068
 ashamed+0.067
Neg logit contributions
|-0.090
 land-0.090
growth-0.088
 camp-0.087
 hunt-0.082
Token and
Token ID290
Feature activation-4.81e-01

Pos logit contributions
|+0.005
growth+0.005
 land+0.005
 camp+0.005
 forests+0.004
Neg logit contributions
 sorry-0.008
 dear-0.008
 darling-0.008
 okay-0.008
 ashamed-0.008
Token they
Token ID484
Feature activation-1.70e+00

Pos logit contributions
|+0.010
growth+0.010
 land+0.009
 camp+0.009
ground+0.009
Neg logit contributions
 sorry-0.012
 dear-0.012
 darling-0.011
 okay-0.011
 ashamed-0.011
Token walked
Token ID6807
Feature activation-2.27e+00

Pos logit contributions
growth+0.031
|+0.031
ground+0.028
 forests+0.027
 camp+0.027
Neg logit contributions
 sorry-0.047
 okay-0.044
 ashamed-0.044
 dear-0.044
 darling-0.042
Token to
Token ID284
Feature activation-9.95e+00

Pos logit contributions
growth+0.044
|+0.044
 land+0.039
 camp+0.037
e+0.036
Neg logit contributions
 sorry-0.059
 dear-0.056
 okay-0.056
 darling-0.055
 ashamed-0.054
Token the
Token ID262
Feature activation-3.88e+00

Pos logit contributions
|+0.166
growth+0.155
 valleys+0.149
 forests+0.147
r+0.138
Neg logit contributions
 sorry-0.279
 darling-0.258
 dear-0.258
 okay-0.252
 ashamed-0.241
Token oven
Token ID14361
Feature activation-3.32e+00

Pos logit contributions
|+0.081
growth+0.075
e+0.068
of+0.067
 land+0.063
Neg logit contributions
 sorry-0.101
 dear-0.098
 darling-0.094
 ashamed-0.091
 unfair-0.089
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
|+0.069
growth+0.067
 land+0.058
 camp+0.057
 forests+0.056
Neg logit contributions
 sorry-0.083
 dear-0.082
 darling-0.079
 okay-0.078
 ashamed-0.076
Token They
Token ID1119
Feature activation-2.41e+00

Pos logit contributions
 sorry+0.036
 dear+0.035
 darling+0.034
 okay+0.034
 ok+0.033
Neg logit contributions
|-0.030
growth-0.029
 land-0.027
 camp-0.026
ground-0.026
Token smelled
Token ID37609
Feature activation+2.44e+00

Pos logit contributions
|+0.050
growth+0.047
 outer+0.043
ground+0.042
 camp+0.041
Neg logit contributions
 ashamed-0.061
 sorry-0.061
 dear-0.060
 okay-0.057
 guilty-0.056
Token.
Token ID13
Feature activation+5.39e-01

Pos logit contributions
growth+0.084
|+0.084
 land+0.071
 camp+0.071
of+0.070
Neg logit contributions
 sorry-0.089
 dear-0.088
 okay-0.086
 darling-0.085
 ashamed-0.082
Token
Token ID198
Feature activation-5.51e+00

Pos logit contributions
 sorry+0.015
 dear+0.015
 darling+0.014
 okay+0.014
 ashamed+0.014
Neg logit contributions
|-0.009
growth-0.009
 land-0.008
 camp-0.008
ground-0.007
Token
Token ID198
Feature activation-4.43e+00

Pos logit contributions
growth+0.097
|+0.093
 land+0.089
 hunt+0.088
 camp+0.086
Neg logit contributions
 sorry-0.146
 okay-0.139
 dear-0.136
 darling-0.134
 concerned-0.132
TokenShe
Token ID3347
Feature activation-2.49e+00

Pos logit contributions
growth+0.092
|+0.092
 land+0.085
 camp+0.084
 hunt+0.081
Neg logit contributions
 sorry-0.105
 okay-0.100
 dear-0.100
 darling-0.098
 ok-0.096
Token walked
Token ID6807
Feature activation-1.71e+00

Pos logit contributions
|+0.047
growth+0.045
ground+0.040
 land+0.039
 camp+0.038
Neg logit contributions
 sorry-0.066
 dear-0.064
 ashamed-0.063
 darling-0.062
 okay-0.061
Token to
Token ID284
Feature activation-9.93e+00

Pos logit contributions
growth+0.036
|+0.035
 land+0.030
e+0.030
 camp+0.029
Neg logit contributions
 sorry-0.042
 okay-0.040
 dear-0.040
 darling-0.039
 concerned-0.039
Token her
Token ID607
Feature activation-7.87e-01

Pos logit contributions
|+0.194
 valleys+0.169
growth+0.168
st+0.164
 traveled+0.164
Neg logit contributions
 sorry-0.260
 dear-0.241
 okay-0.232
 darling-0.232
 daughter-0.223
Token classroom
Token ID15806
Feature activation-5.11e-01

Pos logit contributions
|+0.022
growth+0.021
 land+0.020
ground+0.019
 camp+0.019
Neg logit contributions
 sorry-0.014
 dear-0.014
 darling-0.013
 okay-0.013
 ashamed-0.013
Token.
Token ID13
Feature activation+2.19e+00

Pos logit contributions
|+0.010
growth+0.010
 land+0.009
 camp+0.009
ground+0.008
Neg logit contributions
 sorry-0.013
 dear-0.013
 darling-0.012
 okay-0.012
 ashamed-0.012
Token She
Token ID1375
Feature activation-2.10e+00

Pos logit contributions
 sorry+0.046
 dear+0.044
 darling+0.043
 okay+0.042
 ashamed+0.041
Neg logit contributions
|-0.053
growth-0.052
 land-0.049
 camp-0.048
ground-0.047
Token saw
Token ID2497
Feature activation-1.63e+00

Pos logit contributions
growth+0.040
|+0.040
ground+0.033
 camp+0.032
 outer+0.032
Neg logit contributions
 dear-0.055
 sorry-0.055
 ashamed-0.055
 darling-0.052
 okay-0.052
Token<|endoftext|>
Token ID50256
Feature activation-6.09e+00

Pos logit contributions
|+0.007
growth+0.006
 land+0.006
 camp+0.005
ground+0.005
Neg logit contributions
 sorry-0.019
 dear-0.019
 darling-0.019
 okay-0.018
 unfair-0.018
TokenJimmy
Token ID40335
Feature activation-2.08e+00

Pos logit contributions
growth+0.095
 hunt+0.085
 miles+0.083
 land+0.083
wered+0.082
Neg logit contributions
 sorry-0.154
 okay-0.151
 darling-0.145
 ashamed-0.144
 dear-0.143
Token wanted
Token ID2227
Feature activation+1.31e+00

Pos logit contributions
|+0.038
growth+0.036
 land+0.034
 camp+0.033
ground+0.032
Neg logit contributions
 sorry-0.056
 dear-0.055
 darling-0.053
 okay-0.052
 ashamed-0.051
Token to
Token ID284
Feature activation-8.23e+00

Pos logit contributions
 sorry+0.034
 dear+0.033
 okay+0.032
 darling+0.032
 ashamed+0.031
Neg logit contributions
|-0.027
growth-0.025
 land-0.023
ground-0.022
 camp-0.022
Token climb
Token ID12080
Feature activation-4.26e+00

Pos logit contributions
|+0.140
growth+0.126
 valleys+0.122
 forests+0.119
 camp+0.107
Neg logit contributions
 sorry-0.234
 dear-0.229
 darling-0.225
 okay-0.222
 ok-0.216
Token to
Token ID284
Feature activation-9.93e+00

Pos logit contributions
|+0.072
growth+0.067
of+0.057
e+0.057
 land+0.055
Neg logit contributions
 sorry-0.117
 okay-0.113
 dear-0.111
 darling-0.111
 ashamed-0.107
Token the
Token ID262
Feature activation-3.48e+00

Pos logit contributions
|+0.187
growth+0.147
of+0.138
e+0.137
r+0.136
Neg logit contributions
 sorry-0.255
 dear-0.245
 darling-0.241
 okay-0.238
 ashamed-0.226
Token top
Token ID1353
Feature activation-6.80e-01

Pos logit contributions
|+0.061
growth+0.055
e+0.051
r+0.046
of+0.045
Neg logit contributions
 dear-0.099
 sorry-0.097
 darling-0.095
 unfair-0.094
 ashamed-0.089
Token of
Token ID286
Feature activation-7.64e+00

Pos logit contributions
|+0.014
growth+0.014
 land+0.012
 camp+0.012
e+0.011
Neg logit contributions
 dear-0.016
 sorry-0.016
 darling-0.016
 okay-0.015
 ashamed-0.015
Token the
Token ID262
Feature activation-2.95e+00

Pos logit contributions
|+0.149
of+0.113
growth+0.109
r+0.101
ide+0.100
Neg logit contributions
 sorry-0.220
 dear-0.202
 okay-0.201
 darling-0.196
 ashamed-0.192
Token tree
Token ID5509
Feature activation-4.75e+00

Pos logit contributions
|+0.058
growth+0.055
e+0.050
of+0.045
r+0.044
Neg logit contributions
 dear-0.077
 darling-0.074
 sorry-0.074
 unfair-0.071
 important-0.070
Token had
Token ID550
Feature activation-3.80e-01

Pos logit contributions
|+0.004
growth+0.004
 land+0.004
 camp+0.004
ground+0.004
Neg logit contributions
 sorry-0.005
 dear-0.005
 darling-0.005
 ashamed-0.005
 okay-0.005
Token been
Token ID587
Feature activation+3.19e+00

Pos logit contributions
|+0.008
growth+0.007
 land+0.006
 camp+0.006
ground+0.006
Neg logit contributions
 sorry-0.010
 dear-0.010
 okay-0.010
 ashamed-0.010
 darling-0.010
Token able
Token ID1498
Feature activation+4.69e-01

Pos logit contributions
 sorry+0.072
 dear+0.070
 darling+0.067
 okay+0.067
 ashamed+0.066
Neg logit contributions
|-0.073
growth-0.071
 land-0.067
 camp-0.065
ground-0.063
Token to
Token ID284
Feature activation-7.69e+00

Pos logit contributions
 sorry+0.013
 dear+0.013
 darling+0.013
 okay+0.012
 ashamed+0.012
Neg logit contributions
|-0.008
growth-0.008
 land-0.007
 camp-0.007
ground-0.007
Token climb
Token ID12080
Feature activation-5.60e+00

Pos logit contributions
|+0.154
 valleys+0.139
growth+0.139
 forests+0.139
 rivers+0.124
Neg logit contributions
 sorry-0.207
 okay-0.196
 ok-0.188
 dear-0.187
 darling-0.183
Token to
Token ID284
Feature activation-9.93e+00

Pos logit contributions
|+0.095
growth+0.089
 land+0.075
e+0.075
of+0.073
Neg logit contributions
 sorry-0.154
 okay-0.148
 dear-0.146
 darling-0.146
 ok-0.142
Token the
Token ID262
Feature activation-3.31e+00

Pos logit contributions
|+0.167
growth+0.129
of+0.124
r+0.118
 traveled+0.116
Neg logit contributions
 sorry-0.279
 okay-0.264
 dear-0.263
 darling-0.259
 ashamed-0.250
Token top
Token ID1353
Feature activation-2.93e-01

Pos logit contributions
|+0.061
growth+0.054
e+0.048
of+0.047
ground+0.046
Neg logit contributions
 dear-0.093
 sorry-0.093
 darling-0.089
 okay-0.086
 unfair-0.085
Token.
Token ID13
Feature activation+6.88e-01

Pos logit contributions
|+0.007
growth+0.007
 camp+0.005
e+0.005
 hunt+0.005
Neg logit contributions
 dear-0.006
 darling-0.006
 sorry-0.006
 okay-0.006
 ashamed-0.006
Token After
Token ID2293
Feature activation-2.98e+00

Pos logit contributions
 sorry+0.019
 dear+0.018
 darling+0.018
 okay+0.018
 ashamed+0.017
Neg logit contributions
|-0.012
growth-0.012
 land-0.011
 camp-0.011
ground-0.010
Token a
Token ID257
Feature activation+4.06e-01

Pos logit contributions
|+0.061
growth+0.055
 land+0.050
of+0.049
 hunt+0.046
Neg logit contributions
 sorry-0.081
 dear-0.076
 okay-0.075
 darling-0.073
 ashamed-0.071
Token to
Token ID284
Feature activation-8.68e+00

Pos logit contributions
 dear+0.017
 sorry+0.017
 darling+0.017
 okay+0.016
 ok+0.016
Neg logit contributions
|-0.008
 land-0.007
growth-0.006
 camp-0.006
 the-0.006
Token it
Token ID340
Feature activation-2.16e+00

Pos logit contributions
|+0.173
growth+0.163
ground+0.138
 forests+0.138
 valleys+0.138
Neg logit contributions
 sorry-0.223
 dear-0.209
 darling-0.208
 okay-0.206
 ashamed-0.199
Token.
Token ID13
Feature activation+7.39e-02

Pos logit contributions
|+0.033
growth+0.031
 land+0.029
 camp+0.028
ground+0.026
Neg logit contributions
 sorry-0.065
 dear-0.064
 darling-0.062
 okay-0.061
 ashamed-0.060
Token He
Token ID679
Feature activation-1.49e+00

Pos logit contributions
 sorry+0.002
 dear+0.002
 darling+0.002
 okay+0.002
 ok+0.002
Neg logit contributions
|-0.001
growth-0.001
 land-0.001
 camp-0.001
ground-0.001
Token climbed
Token ID19952
Feature activation-4.79e+00

Pos logit contributions
|+0.028
growth+0.027
 camp+0.024
 land+0.024
ground+0.023
Neg logit contributions
 sorry-0.039
 dear-0.039
 darling-0.038
 ashamed-0.037
 okay-0.037
Token to
Token ID284
Feature activation-9.92e+00

Pos logit contributions
|+0.103
growth+0.099
 land+0.093
 camp+0.090
ground+0.087
Neg logit contributions
 sorry-0.114
 dear-0.110
 darling-0.107
 okay-0.106
 ashamed-0.103
Token the
Token ID262
Feature activation-2.69e+00

Pos logit contributions
|+0.173
growth+0.147
field+0.130
ide+0.128
ground+0.127
Neg logit contributions
 sorry-0.256
 darling-0.254
 okay-0.251
 dear-0.250
 ashamed-0.238
Token top
Token ID1353
Feature activation-4.22e-01

Pos logit contributions
|+0.045
growth+0.042
e+0.035
 camp+0.034
ground+0.033
Neg logit contributions
 sorry-0.080
 dear-0.079
 darling-0.076
 ashamed-0.075
 okay-0.074
Token and
Token ID290
Feature activation-1.87e+00

Pos logit contributions
|+0.010
growth+0.009
 camp+0.008
 land+0.008
 hunt+0.008
Neg logit contributions
 dear-0.010
 darling-0.009
 sorry-0.009
 okay-0.009
 ashamed-0.009
Token stepped
Token ID10764
Feature activation+1.21e+00

Pos logit contributions
|+0.035
growth+0.034
 camp+0.030
 land+0.029
ground+0.029
Neg logit contributions
 sorry-0.049
 dear-0.048
 ashamed-0.047
 darling-0.047
 okay-0.046
Token onto
Token ID4291
Feature activation-6.25e+00

Pos logit contributions
 dear+0.035
 sorry+0.035
 darling+0.033
 unfair+0.033
 ashamed+0.032
Neg logit contributions
|-0.019
growth-0.019
 land-0.018
 camp-0.017
ground-0.016
Token heard
Token ID2982
Feature activation-1.07e+00

Pos logit contributions
|+0.058
growth+0.055
ground+0.050
 camp+0.050
e+0.048
Neg logit contributions
 sorry-0.060
 dear-0.057
 ashamed-0.055
 darling-0.054
 okay-0.054
Token Tweet
Token ID18752
Feature activation-3.62e+00

Pos logit contributions
|+0.019
growth+0.017
ground+0.016
 land+0.016
 camp+0.015
Neg logit contributions
 sorry-0.031
 dear-0.030
 ashamed-0.029
 darling-0.029
 okay-0.029
Tokenie
Token ID494
Feature activation-2.04e+00

Pos logit contributions
growth+0.075
|+0.070
ground+0.070
 valleys+0.064
 forests+0.062
Neg logit contributions
 sorry-0.095
 okay-0.090
 ashamed-0.085
 dear-0.085
 ok-0.084
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
|+0.042
growth+0.042
ground+0.037
 land+0.037
 camp+0.036
Neg logit contributions
 sorry-0.053
 dear-0.049
 okay-0.049
 ashamed-0.048
 darling-0.047
Token ran
Token ID4966
Feature activation-8.11e-01

Pos logit contributions
|+0.023
growth+0.022
ground+0.020
e+0.018
 camp+0.018
Neg logit contributions
 sorry-0.028
 ashamed-0.027
 dear-0.027
 okay-0.026
 darling-0.025
Token to
Token ID284
Feature activation-9.92e+00

Pos logit contributions
|+0.014
growth+0.014
 land+0.013
 camp+0.012
ground+0.012
Neg logit contributions
 sorry-0.022
 dear-0.022
 okay-0.021
 darling-0.021
 ashamed-0.021
Token help
Token ID1037
Feature activation-1.90e+00

Pos logit contributions
growth+0.187
|+0.180
field+0.171
of+0.170
ground+0.170
Neg logit contributions
 sorry-0.279
 okay-0.249
 ashamed-0.236
 dear-0.233
 ok-0.230
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
|+0.038
growth+0.037
 valleys+0.033
 forests+0.033
ground+0.033
Neg logit contributions
 sorry-0.050
 okay-0.048
 dear-0.046
 ok-0.045
 darling-0.045
Token "
Token ID366
Feature activation-2.17e+00

Pos logit contributions
 sorry+0.003
 dear+0.003
 darling+0.003
 okay+0.003
 ashamed+0.003
Neg logit contributions
|-0.002
growth-0.002
 land-0.002
 camp-0.002
ground-0.002
TokenHold
Token ID26807
Feature activation-6.31e-01

Pos logit contributions
 camp+0.044
 land+0.043
|+0.043
growth+0.042
ground+0.041
Neg logit contributions
 okay-0.050
 ok-0.048
 sorry-0.048
 dear-0.046
 darling-0.046
Token on
Token ID319
Feature activation-6.86e+00

Pos logit contributions
|+0.010
growth+0.009
 land+0.009
 camp+0.009
e+0.008
Neg logit contributions
 dear-0.018
 sorry-0.018
 darling-0.018
 ashamed-0.017
 okay-0.017
Token
Token ID198
Feature activation-6.50e+00

Pos logit contributions
|+0.005
growth+0.005
 land+0.005
e+0.005
 miles+0.004
Neg logit contributions
 sorry-0.009
 dear-0.009
 darling-0.009
 okay-0.009
 ashamed-0.008
Token
Token ID198
Feature activation-5.25e+00

Pos logit contributions
growth+0.127
|+0.116
 land+0.115
 valleys+0.109
 forests+0.108
Neg logit contributions
 sorry-0.147
 okay-0.145
 dear-0.140
 important-0.137
 darling-0.137
TokenPeter
Token ID19727
Feature activation+2.18e-02

Pos logit contributions
|+0.071
growth+0.066
 land+0.060
 camp+0.054
of+0.054
Neg logit contributions
 important-0.149
 okay-0.148
 unfair-0.148
 sorry-0.147
 ok-0.147
Token would
Token ID561
Feature activation-7.90e-01

Pos logit contributions
 dear+0.001
 sorry+0.001
 darling+0.001
 ashamed+0.001
 okay+0.000
Neg logit contributions
growth-0.000
|-0.000
 land-0.000
 camp-0.000
ground-0.000
Token read
Token ID1100
Feature activation-3.42e+00

Pos logit contributions
|+0.013
growth+0.012
ground+0.010
 valleys+0.010
 land+0.010
Neg logit contributions
 sorry-0.024
 dear-0.022
 okay-0.022
 darling-0.022
 ok-0.021
Token to
Token ID284
Feature activation-9.90e+00

Pos logit contributions
|+0.064
growth+0.057
of+0.045
field+0.044
 land+0.044
Neg logit contributions
 sorry-0.097
 dear-0.093
 okay-0.093
 important-0.088
 ok-0.086
Token the
Token ID262
Feature activation-2.27e+00

Pos logit contributions
|+0.167
growth+0.143
of+0.127
 valleys+0.122
ground+0.121
Neg logit contributions
 dear-0.294
 sorry-0.292
 darling-0.279
 ashamed-0.265
 okay-0.263
Token surf
Token ID9053
Feature activation+7.20e-01

Pos logit contributions
|+0.034
growth+0.032
 land+0.030
 camp+0.028
ground+0.027
Neg logit contributions
 sorry-0.069
 dear-0.068
 darling-0.066
 okay-0.065
 ashamed-0.065
Tokenboard
Token ID3526
Feature activation-1.64e+00

Pos logit contributions
 sorry+0.014
 dear+0.013
 darling+0.012
 ashamed+0.012
 okay+0.012
Neg logit contributions
|-0.019
growth-0.019
 land-0.016
 camp-0.016
 forests-0.016
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
|+0.034
growth+0.032
 land+0.028
 camp+0.028
ground+0.027
Neg logit contributions
 sorry-0.042
 dear-0.041
 okay-0.039
 darling-0.039
 ashamed-0.038
Token They
Token ID1119
Feature activation-1.97e+00

Pos logit contributions
 sorry+0.042
 dear+0.041
 darling+0.039
 okay+0.039
 ashamed+0.038
Neg logit contributions
|-0.039
growth-0.038
 land-0.036
 camp-0.035
 forests-0.034
Token the
Token ID262
Feature activation+2.79e-01

Pos logit contributions
 sorry+0.084
 dear+0.082
 okay+0.079
 darling+0.079
 ashamed+0.077
Neg logit contributions
|-0.054
growth-0.051
 land-0.047
 camp-0.046
ground-0.044
Token tooth
Token ID16162
Feature activation+1.29e+00

Pos logit contributions
 sorry+0.006
 dear+0.006
 darling+0.006
 okay+0.006
 ashamed+0.005
Neg logit contributions
|-0.007
growth-0.006
 land-0.006
 camp-0.006
ground-0.006
Token fairy
Token ID25607
Feature activation+1.17e+00

Pos logit contributions
 sorry+0.039
 dear+0.038
 darling+0.037
 okay+0.037
 ashamed+0.036
Neg logit contributions
|-0.020
growth-0.018
 land-0.017
 camp-0.016
ground-0.015
Token would
Token ID561
Feature activation+8.78e-02

Pos logit contributions
 sorry+0.036
 dear+0.035
 darling+0.034
 okay+0.034
 ashamed+0.033
Neg logit contributions
|-0.017
growth-0.016
 land-0.015
 camp-0.014
ground-0.014
Token come
Token ID1282
Feature activation-1.95e+00

Pos logit contributions
 sorry+0.002
 okay+0.002
 dear+0.002
 ashamed+0.002
 darling+0.002
Neg logit contributions
growth-0.002
|-0.002
 valleys-0.002
ground-0.002
 forests-0.002
Token at
Token ID379
Feature activation-9.88e+00

Pos logit contributions
growth+0.040
|+0.039
 camp+0.035
ground+0.035
 land+0.035
Neg logit contributions
 sorry-0.049
 okay-0.048
 dear-0.047
 darling-0.046
 ok-0.045
Token night
Token ID1755
Feature activation+7.57e-01

Pos logit contributions
|+0.170
of+0.168
growth+0.164
 enclosure+0.141
ground+0.140
Neg logit contributions
 sorry-0.255
 dear-0.241
 okay-0.241
 important-0.234
 guilty-0.229
Token.
Token ID13
Feature activation+6.04e-02

Pos logit contributions
 dear+0.017
 sorry+0.017
 darling+0.016
 okay+0.016
 ashamed+0.016
Neg logit contributions
growth-0.016
|-0.016
of-0.014
Ray-0.014
wered-0.014
Token Tim
Token ID5045
Feature activation-7.36e-01

Pos logit contributions
 sorry+0.002
 dear+0.002
 darling+0.002
 okay+0.002
 ashamed+0.002
Neg logit contributions
|-0.001
growth-0.001
 land-0.001
 camp-0.001
ground-0.001
Token put
Token ID1234
Feature activation-2.33e+00

Pos logit contributions
growth+0.014
|+0.014
ground+0.013
 camp+0.012
 miles+0.012
Neg logit contributions
 sorry-0.019
 dear-0.018
 ashamed-0.018
 darling-0.018
 okay-0.018
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
|+0.032
growth+0.030
 land+0.026
 camp+0.026
ground+0.025
Neg logit contributions
 sorry-0.077
 dear-0.074
 okay-0.072
 darling-0.071
 ashamed-0.070
TokenL
Token ID43
Feature activation-1.67e+00

Pos logit contributions
growth+0.110
 outer+0.102
 hunt+0.101
wered+0.098
 miles+0.098
Neg logit contributions
 sorry-0.129
 okay-0.126
 OK-0.114
 ashamed-0.110
 guilty-0.110
Tokenily
Token ID813
Feature activation-2.05e+00

Pos logit contributions
|+0.043
growth+0.042
 land+0.040
 forests+0.038
 valleys+0.038
Neg logit contributions
 sorry-0.031
 dear-0.031
 darling-0.029
 okay-0.029
 ashamed-0.027
Token wanted
Token ID2227
Feature activation+1.27e+00

Pos logit contributions
|+0.037
growth+0.036
ground+0.031
 camp+0.031
 land+0.030
Neg logit contributions
 dear-0.053
 sorry-0.053
 darling-0.051
 ashamed-0.051
 okay-0.050
Token to
Token ID284
Feature activation-8.68e+00

Pos logit contributions
 sorry+0.032
 dear+0.031
 okay+0.030
 darling+0.030
 ashamed+0.030
Neg logit contributions
|-0.027
growth-0.025
 land-0.023
ground-0.022
 valleys-0.022
Token come
Token ID1282
Feature activation-3.47e+00

Pos logit contributions
|+0.154
 valleys+0.142
growth+0.136
 forests+0.136
 rivers+0.123
Neg logit contributions
 sorry-0.237
 dear-0.229
 darling-0.222
 okay-0.218
 unfair-0.208
Token to
Token ID284
Feature activation-9.88e+00

Pos logit contributions
|+0.067
growth+0.065
 land+0.058
 camp+0.055
ground+0.054
Neg logit contributions
 sorry-0.091
 dear-0.088
 okay-0.087
 darling-0.085
 ok-0.083
Token the
Token ID262
Feature activation-3.67e+00

Pos logit contributions
|+0.185
growth+0.178
 valleys+0.170
fire+0.160
ide+0.151
Neg logit contributions
 dear-0.238
 sorry-0.236
 darling-0.224
 daughter-0.216
 okay-0.212
Token party
Token ID2151
Feature activation-1.94e+00

Pos logit contributions
|+0.069
growth+0.065
e+0.055
of+0.054
r+0.053
Neg logit contributions
 dear-0.103
 sorry-0.102
 darling-0.097
 unfair-0.094
 ashamed-0.092
Token with
Token ID351
Feature activation-6.02e+00

Pos logit contributions
|+0.036
growth+0.035
 land+0.031
 valleys+0.030
 forests+0.029
Neg logit contributions
 sorry-0.052
 dear-0.051
 okay-0.050
 darling-0.049
 ok-0.048
Token her
Token ID607
Feature activation-5.24e-01

Pos logit contributions
|+0.147
growth+0.135
 valleys+0.131
ground+0.126
 forests+0.126
Neg logit contributions
 sorry-0.141
 dear-0.134
 okay-0.131
 darling-0.124
 unfair-0.118
Token big
Token ID1263
Feature activation-9.21e-02

Pos logit contributions
|+0.014
growth+0.014
 land+0.013
 camp+0.013
ground+0.012
Neg logit contributions
 sorry-0.010
 dear-0.010
 darling-0.009
 okay-0.009
 ashamed-0.009
TokenS
Token ID50
Feature activation+1.19e+00

Pos logit contributions
growth+0.071
|+0.069
 land+0.066
 camp+0.064
 hunt+0.061
Neg logit contributions
 sorry-0.107
 dear-0.104
 okay-0.103
 darling-0.103
 ok-0.100
Tokenara
Token ID3301
Feature activation-7.70e-01

Pos logit contributions
 dear+0.027
 darling+0.026
 sorry+0.026
 okay+0.025
 ok+0.024
Neg logit contributions
|-0.027
growth-0.026
 land-0.025
 camp-0.025
ground-0.024
Token and
Token ID290
Feature activation-5.13e-01

Pos logit contributions
|+0.015
growth+0.015
 land+0.014
ground+0.013
 camp+0.013
Neg logit contributions
 sorry-0.019
 dear-0.019
 darling-0.019
 ashamed-0.018
 okay-0.018
Token Ben
Token ID3932
Feature activation-2.28e-01

Pos logit contributions
|+0.014
ground+0.013
growth+0.013
 valleys+0.012
 land+0.012
Neg logit contributions
 sorry-0.011
 dear-0.010
 ashamed-0.009
 okay-0.009
 darling-0.009
Token jog
Token ID48342
Feature activation-7.33e-02

Pos logit contributions
|+0.004
growth+0.004
ground+0.004
 outer+0.003
 forests+0.003
Neg logit contributions
 sorry-0.007
 ashamed-0.006
 okay-0.006
 dear-0.006
 ok-0.006
Token to
Token ID284
Feature activation-9.87e+00

Pos logit contributions
growth+0.002
|+0.001
 land+0.001
 forests+0.001
 valleys+0.001
Neg logit contributions
 sorry-0.002
 okay-0.002
 dear-0.002
 darling-0.002
 ashamed-0.002
Token the
Token ID262
Feature activation-3.36e+00

Pos logit contributions
|+0.160
growth+0.156
r+0.144
 valleys+0.144
of+0.144
Neg logit contributions
 sorry-0.301
 dear-0.272
 okay-0.264
 darling-0.258
 ashamed-0.252
Token swings
Token ID26728
Feature activation-3.92e+00

Pos logit contributions
|+0.056
growth+0.055
e+0.046
of+0.046
r+0.043
Neg logit contributions
 sorry-0.101
 dear-0.097
 darling-0.093
 ashamed-0.092
 okay-0.091
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
growth+0.095
|+0.090
 camp+0.078
 forests+0.077
 valleys+0.077
Neg logit contributions
 sorry-0.087
 okay-0.084
 dear-0.080
 darling-0.076
 ashamed-0.076
Token They
Token ID1119
Feature activation-2.40e+00

Pos logit contributions
 sorry+0.041
 dear+0.040
 darling+0.038
 okay+0.038
 ashamed+0.037
Neg logit contributions
|-0.040
growth-0.039
 land-0.037
 camp-0.036
ground-0.034
Token push
Token ID4574
Feature activation-2.16e+00

Pos logit contributions
growth+0.050
|+0.044
ground+0.041
 outer+0.039
 forests+0.039
Neg logit contributions
 sorry-0.063
 ashamed-0.057
 okay-0.056
 guilty-0.055
 dear-0.055
Tokenenny
Token ID11870
Feature activation+1.44e+00

Pos logit contributions
 sorry+0.047
 dear+0.044
 darling+0.043
 okay+0.042
 ashamed+0.041
Neg logit contributions
|-0.060
growth-0.058
 land-0.056
of-0.055
 camp-0.054
Token felt
Token ID2936
Feature activation+4.71e+00

Pos logit contributions
 sorry+0.036
 dear+0.036
 darling+0.035
 ashamed+0.034
 okay+0.034
Neg logit contributions
|-0.028
growth-0.028
 land-0.025
 camp-0.025
ground-0.024
Token sad
Token ID6507
Feature activation+6.11e+00

Pos logit contributions
 dear+0.073
 darling+0.070
 sorry+0.070
 okay+0.064
ower+0.063
Neg logit contributions
|-0.136
growth-0.131
 land-0.131
 camp-0.126
 forests-0.121
Token because
Token ID780
Feature activation+2.77e+00

Pos logit contributions
 sorry+0.159
 dear+0.156
 darling+0.153
 unfair+0.152
 ok+0.149
Neg logit contributions
 land-0.104
|-0.099
 camp-0.099
growth-0.096
 the-0.090
Token she
Token ID673
Feature activation+8.67e-01

Pos logit contributions
 sorry+0.055
 dear+0.052
 okay+0.050
 darling+0.050
 ashamed+0.049
Neg logit contributions
|-0.072
growth-0.069
 land-0.065
 camp-0.064
ground-0.063
Token thought
Token ID1807
Feature activation+7.70e+00

Pos logit contributions
 sorry+0.022
 dear+0.021
 darling+0.020
 okay+0.020
rier+0.020
Neg logit contributions
|-0.018
 land-0.016
growth-0.016
 camp-0.015
e-0.015
Token she
Token ID673
Feature activation+1.76e+00

Pos logit contributions
 sorry+0.216
 dear+0.212
 darling+0.207
 okay+0.203
 ashamed+0.200
Neg logit contributions
|-0.130
growth-0.124
 land-0.119
 camp-0.113
 forests-0.106
Token could
Token ID714
Feature activation-1.28e+00

Pos logit contributions
 sorry+0.051
 dear+0.049
 darling+0.048
 okay+0.047
 ashamed+0.046
Neg logit contributions
|-0.030
growth-0.028
 land-0.027
 camp-0.025
 forests-0.023
Token never
Token ID1239
Feature activation-1.29e+00

Pos logit contributions
growth+0.025
|+0.023
 valleys+0.022
ground+0.022
 forests+0.022
Neg logit contributions
 sorry-0.036
 okay-0.035
 ok-0.032
 dear-0.032
 darling-0.031
Token dance
Token ID9280
Feature activation+2.54e+00

Pos logit contributions
growth+0.025
ground+0.022
|+0.022
 valleys+0.021
 forests+0.021
Neg logit contributions
 sorry-0.036
 okay-0.035
 ashamed-0.033
 ok-0.033
 dear-0.033
Token like
Token ID588
Feature activation-5.13e-02

Pos logit contributions
 sorry+0.067
 okay+0.063
 dear+0.063
 darling+0.060
 ashamed+0.060
Neg logit contributions
growth-0.050
|-0.050
 land-0.044
ground-0.043
 camp-0.042
Token feeling
Token ID4203
Feature activation+4.16e+00

Pos logit contributions
 sorry+0.069
 dear+0.067
 darling+0.065
 okay+0.064
 ashamed+0.062
Neg logit contributions
|-0.077
growth-0.074
 land-0.072
 camp-0.069
ground-0.067
Token even
Token ID772
Feature activation+3.39e+00

Pos logit contributions
 sorry+0.075
 dear+0.069
 okay+0.066
 ashamed+0.066
 darling+0.065
Neg logit contributions
|-0.119
growth-0.115
 land-0.109
 camp-0.107
ground-0.105
Token more
Token ID517
Feature activation+2.84e+00

Pos logit contributions
 sorry+0.083
 dear+0.080
 okay+0.078
 darling+0.077
 ashamed+0.076
Neg logit contributions
|-0.073
growth-0.071
 land-0.065
 camp-0.063
ground-0.062
Token curious
Token ID11040
Feature activation+4.53e+00

Pos logit contributions
 sorry+0.055
 dear+0.054
 darling+0.052
 okay+0.051
ouncing+0.051
Neg logit contributions
 land-0.069
growth-0.065
|-0.064
 camp-0.062
 forests-0.061
Token and
Token ID290
Feature activation+1.59e+00

Pos logit contributions
 sorry+0.123
 dear+0.119
 darling+0.116
 okay+0.115
 ashamed+0.114
Neg logit contributions
|-0.079
growth-0.077
 land-0.075
 camp-0.071
 forests-0.068
Token wondering
Token ID11263
Feature activation+6.83e+00

Pos logit contributions
 sorry+0.034
 dear+0.033
 darling+0.031
 okay+0.031
 ashamed+0.031
Neg logit contributions
|-0.039
growth-0.038
 land-0.035
 camp-0.035
ground-0.034
Token what
Token ID644
Feature activation+2.18e+00

Pos logit contributions
 sorry+0.170
 dear+0.165
 darling+0.159
 okay+0.158
 ashamed+0.155
Neg logit contributions
|-0.144
growth-0.136
 land-0.128
 camp-0.124
ground-0.120
Token the
Token ID262
Feature activation+7.47e-01

Pos logit contributions
 sorry+0.058
 dear+0.056
 okay+0.054
 darling+0.054
 ashamed+0.054
Neg logit contributions
|-0.043
growth-0.040
 land-0.036
 camp-0.035
ground-0.035
Token morning
Token ID3329
Feature activation+3.53e+00

Pos logit contributions
 sorry+0.018
 dear+0.018
 darling+0.017
 okay+0.017
 ashamed+0.017
Neg logit contributions
|-0.015
growth-0.015
 land-0.015
 camp-0.014
 forests-0.013
Token would
Token ID561
Feature activation+1.31e+00

Pos logit contributions
 dear+0.097
 sorry+0.097
 okay+0.095
 darling+0.093
 ok+0.092
Neg logit contributions
|-0.061
growth-0.056
 camp-0.050
 land-0.047
of-0.046
Token bring
Token ID2222
Feature activation-1.85e+00

Pos logit contributions
 sorry+0.033
 okay+0.032
 dear+0.031
 ok+0.031
 darling+0.031
Neg logit contributions
|-0.027
growth-0.025
 forests-0.023
 valleys-0.022
 camp-0.022
Token very
Token ID845
Feature activation+6.31e+00

Pos logit contributions
 darling+0.079
 dear+0.078
ower+0.078
rier+0.076
ouncing+0.076
Neg logit contributions
 the-0.066
 land-0.065
|-0.063
e-0.061
 camp-0.059
Token excited
Token ID6568
Feature activation+5.06e+00

Pos logit contributions
 fault+0.144
rier+0.133
idents+0.127
ouncing+0.125
aser+0.123
Neg logit contributions
 land-0.134
 the-0.122
 live-0.118
 play-0.117
 town-0.117
Token and
Token ID290
Feature activation+1.71e+00

Pos logit contributions
 sorry+0.161
 dear+0.149
rier+0.148
 ashamed+0.147
 okay+0.145
Neg logit contributions
 land-0.067
|-0.067
 hunt-0.062
 camp-0.061
 the-0.060
Token couldn
Token ID3521
Feature activation+1.39e+00

Pos logit contributions
rier+0.049
 fault+0.049
 sorry+0.045
ower+0.045
 dear+0.045
Neg logit contributions
 land-0.027
 the-0.025
 play-0.024
|-0.024
 visit-0.023
Token't
Token ID470
Feature activation-9.98e-01

Pos logit contributions
 sorry+0.033
 dear+0.031
 okay+0.031
 darling+0.030
 ok+0.029
Neg logit contributions
growth-0.029
|-0.029
 land-0.028
 forests-0.027
 valleys-0.027
Token wait
Token ID4043
Feature activation+7.78e+00

Pos logit contributions
|+0.020
growth+0.019
 land+0.018
 camp+0.017
ground+0.016
Neg logit contributions
 sorry-0.026
 dear-0.025
 darling-0.024
 okay-0.024
 ashamed-0.024
Token to
Token ID284
Feature activation-6.85e+00

Pos logit contributions
 sorry+0.234
 dear+0.231
 darling+0.224
 ashamed+0.223
 unfair+0.219
Neg logit contributions
|-0.108
growth-0.105
 land-0.104
 camp-0.100
 hunt-0.092
Token see
Token ID766
Feature activation-2.07e+00

Pos logit contributions
|+0.137
growth+0.119
 valleys+0.117
 forests+0.112
ground+0.111
Neg logit contributions
 sorry-0.184
 dear-0.174
 okay-0.168
 darling-0.168
 ok-0.161
Token what
Token ID644
Feature activation+2.11e+00

Pos logit contributions
|+0.042
growth+0.039
 land+0.036
 camp+0.035
ground+0.034
Neg logit contributions
 sorry-0.055
 dear-0.053
 darling-0.051
 okay-0.051
 ashamed-0.050
Token it
Token ID340
Feature activation+2.27e+00

Pos logit contributions
 sorry+0.061
 dear+0.060
 darling+0.058
 okay+0.057
 ashamed+0.056
Neg logit contributions
|-0.035
growth-0.034
 land-0.031
 camp-0.029
ground-0.028
Token was
Token ID373
Feature activation+3.66e+00

Pos logit contributions
 dear+0.061
 okay+0.061
 sorry+0.060
 unfair+0.059
 ashamed+0.059
Neg logit contributions
growth-0.040
|-0.038
ground-0.033
 forests-0.033
 valleys-0.032
Token
Token ID198
Feature activation-4.05e+00

Pos logit contributions
|+0.072
growth+0.072
 land+0.065
 camp+0.061
 hunt+0.060
Neg logit contributions
 sorry-0.142
 dear-0.136
 okay-0.133
 darling-0.132
 ashamed-0.130
Token"
Token ID1
Feature activation+1.35e-01

Pos logit contributions
|+0.081
growth+0.078
 land+0.074
 camp+0.071
ground+0.068
Neg logit contributions
 sorry-0.103
 dear-0.100
 darling-0.097
 okay-0.096
 ashamed-0.094
TokenThat
Token ID2504
Feature activation+3.45e+00

Pos logit contributions
 sorry+0.004
 dear+0.003
 darling+0.003
 okay+0.003
 ashamed+0.003
Neg logit contributions
|-0.003
growth-0.002
 land-0.002
 camp-0.002
ground-0.002
Token,
Token ID11
Feature activation+5.23e+00

Pos logit contributions
 sorry+0.096
 dear+0.093
 okay+0.090
 darling+0.090
 ashamed+0.088
Neg logit contributions
|-0.061
growth-0.058
 land-0.057
 camp-0.054
ground-0.051
Token my
Token ID616
Feature activation+2.77e+00

Pos logit contributions
ower+0.128
 concerned+0.123
 sorry+0.121
 ashamed+0.119
ometer+0.117
Neg logit contributions
 the-0.092
 land-0.091
 camp-0.091
|-0.087
growth-0.085
Token de
Token ID390
Feature activation+7.05e+00

Pos logit contributions
 sorry+0.046
 okay+0.044
 ok+0.044
 concerned+0.043
rier+0.043
Neg logit contributions
 land-0.073
|-0.072
 camp-0.071
growth-0.069
ground-0.066
Tokenars
Token ID945
Feature activation+1.12e+00

Pos logit contributions
 sorry+0.181
 dear+0.176
 okay+0.169
 darling+0.168
 ashamed+0.165
Neg logit contributions
|-0.141
growth-0.137
 land-0.127
 camp-0.124
 forests-0.119
Token,
Token ID11
Feature activation+5.81e+00

Pos logit contributions
 sorry+0.027
 dear+0.027
 darling+0.026
 okay+0.025
 ashamed+0.024
Neg logit contributions
growth-0.024
|-0.023
 forests-0.021
 valleys-0.020
 land-0.020
Token is
Token ID318
Feature activation+3.21e+00

Pos logit contributions
ower+0.154
 concerned+0.150
 sorry+0.149
 ashamed+0.148
 darling+0.147
Neg logit contributions
 the-0.087
 land-0.086
|-0.085
 camp-0.083
of-0.081
Token a
Token ID257
Feature activation+2.63e+00

Pos logit contributions
 sorry+0.090
 dear+0.089
 darling+0.086
 okay+0.085
 ashamed+0.084
Neg logit contributions
|-0.054
growth-0.052
 land-0.050
 camp-0.048
ground-0.045
Token cave
Token ID11527
Feature activation+2.39e-01

Pos logit contributions
 sorry+0.064
 dear+0.064
 darling+0.062
 okay+0.061
 ashamed+0.060
Neg logit contributions
|-0.051
 land-0.050
growth-0.049
 camp-0.048
 hunt-0.045
Token on
Token ID319
Feature activation-6.45e+00

Pos logit contributions
 sorry+0.017
 dear+0.017
 darling+0.016
 okay+0.016
 ashamed+0.016
Neg logit contributions
|-0.009
growth-0.009
 land-0.008
 camp-0.008
ground-0.008
Token the
Token ID262
Feature activation-3.04e+00

Pos logit contributions
|+0.118
growth+0.078
 traveled+0.077
of+0.077
 explored+0.076
Neg logit contributions
 sorry-0.176
 dear-0.175
 darling-0.173
 unfair-0.170
 okay-0.166
Token slides
Token ID19392
Feature activation-1.12e+00

Pos logit contributions
|+0.055
growth+0.052
 land+0.046
 camp+0.046
ground+0.044
Neg logit contributions
 sorry-0.084
 dear-0.083
 darling-0.081
 okay-0.078
 ashamed-0.077
Token.
Token ID13
Feature activation+3.82e-01

Pos logit contributions
|+0.022
growth+0.022
 camp+0.018
 land+0.018
ground+0.018
Neg logit contributions
 sorry-0.028
 dear-0.028
 darling-0.027
 okay-0.026
ower-0.026
Token Jack
Token ID3619
Feature activation+9.21e-02

Pos logit contributions
 sorry+0.010
 dear+0.010
 darling+0.009
 okay+0.009
 ashamed+0.009
Neg logit contributions
|-0.007
growth-0.007
 land-0.007
 camp-0.007
 forests-0.006
Token thought
Token ID1807
Feature activation+7.58e+00

Pos logit contributions
 sorry+0.002
 dear+0.002
 darling+0.002
 ashamed+0.002
 okay+0.002
Neg logit contributions
|-0.002
growth-0.002
 land-0.002
 camp-0.002
ground-0.002
Token that
Token ID326
Feature activation+9.61e-01

Pos logit contributions
 sorry+0.219
 dear+0.217
 darling+0.213
 okay+0.207
 ashamed+0.205
Neg logit contributions
|-0.117
growth-0.113
 land-0.110
 camp-0.105
 forests-0.097
Token maybe
Token ID3863
Feature activation+4.96e+00

Pos logit contributions
 sorry+0.028
 dear+0.027
 okay+0.027
 darling+0.026
 ok+0.026
Neg logit contributions
|-0.016
growth-0.016
 land-0.013
ground-0.013
 camp-0.012
Token it
Token ID340
Feature activation+2.62e+00

Pos logit contributions
 sorry+0.131
 dear+0.128
 okay+0.125
 darling+0.123
 ok+0.119
Neg logit contributions
|-0.099
growth-0.092
 land-0.081
 camp-0.079
ground-0.079
Token was
Token ID373
Feature activation+3.23e+00

Pos logit contributions
 sorry+0.076
 dear+0.075
 okay+0.074
 darling+0.073
 ashamed+0.072
Neg logit contributions
growth-0.041
|-0.041
 land-0.035
ground-0.035
 camp-0.034
Token special
Token ID2041
Feature activation+3.00e+00

Pos logit contributions
 sorry+0.089
 dear+0.088
 darling+0.086
 okay+0.083
 ashamed+0.082
Neg logit contributions
|-0.055
 land-0.052
growth-0.052
 camp-0.049
 forests-0.045
Token is
Token ID318
Feature activation+3.15e+00

Pos logit contributions
|+0.008
growth+0.007
 land+0.007
 camp+0.007
ground+0.006
Neg logit contributions
 sorry-0.010
 dear-0.010
 darling-0.010
 okay-0.009
 ashamed-0.009
Token my
Token ID616
Feature activation+3.14e+00

Pos logit contributions
 sorry+0.077
 dear+0.075
 darling+0.073
 okay+0.072
 ashamed+0.070
Neg logit contributions
|-0.065
growth-0.062
 land-0.060
 camp-0.058
ground-0.055
Token best
Token ID1266
Feature activation+7.84e+00

Pos logit contributions
 sorry+0.067
 ashamed+0.063
 concerned+0.063
 unfair+0.062
ower+0.062
Neg logit contributions
 land-0.070
 camp-0.067
|-0.066
 visit-0.062
 hunt-0.061
Token friend
Token ID1545
Feature activation+2.39e+00

Pos logit contributions
 sorry+0.197
 unfair+0.188
 dear+0.186
 ashamed+0.184
 concerned+0.183
Neg logit contributions
|-0.148
 land-0.148
 camp-0.140
growth-0.135
 visit-0.131
Token.
Token ID13
Feature activation+3.89e+00

Pos logit contributions
 sorry+0.067
 dear+0.066
 darling+0.064
 ashamed+0.063
 concerned+0.063
Neg logit contributions
|-0.039
 land-0.036
growth-0.034
 camp-0.033
e-0.033
Token Do
Token ID2141
Feature activation+2.19e+00

Pos logit contributions
 sorry+0.077
 concerned+0.073
 dear+0.073
 ashamed+0.073
 unfair+0.073
Neg logit contributions
|-0.096
growth-0.091
 land-0.086
 camp-0.084
of-0.081
Token you
Token ID345
Feature activation+2.15e+00

Pos logit contributions
 sorry+0.045
 dear+0.045
 okay+0.042
 darling+0.042
 ok+0.039
Neg logit contributions
|-0.055
growth-0.055
 land-0.051
 camp-0.050
ground-0.049
Token have
Token ID423
Feature activation-1.12e+00

Pos logit contributions
 sorry+0.039
 dear+0.038
 darling+0.036
 okay+0.035
 ashamed+0.035
Neg logit contributions
|-0.059
growth-0.056
 land-0.055
 camp-0.053
ground-0.051
Token a
Token ID257
Feature activation+1.81e+00

Pos logit contributions
|+0.021
growth+0.020
 land+0.018
ground+0.018
 camp+0.018
Neg logit contributions
 sorry-0.031
 dear-0.030
 okay-0.029
 darling-0.029
 ashamed-0.028
Token best
Token ID1266
Feature activation+6.64e+00

Pos logit contributions
 sorry+0.045
 dear+0.044
 darling+0.043
 okay+0.043
 ashamed+0.043
Neg logit contributions
|-0.035
 land-0.033
 camp-0.032
growth-0.031
 hunt-0.030
Token friend
Token ID1545
Feature activation+9.55e-01

Pos logit contributions
 sorry+0.153
 dear+0.148
 darling+0.143
 okay+0.141
 ashamed+0.140
Neg logit contributions
|-0.146
growth-0.139
 land-0.137
 camp-0.131
 hunt-0.124
Token think
Token ID892
Feature activation+8.51e+00

Pos logit contributions
 sorry+0.000
 dear+0.000
 darling+0.000
 okay+0.000
 ashamed+0.000
Neg logit contributions
|-0.000
growth-0.000
 land-0.000
 camp-0.000
 forests-0.000
Token these
Token ID777
Feature activation+2.18e+00

Pos logit contributions
 sorry+0.222
 dear+0.217
 concerned+0.217
 darling+0.216
 ashamed+0.212
Neg logit contributions
|-0.137
 land-0.132
growth-0.129
 the-0.128
 camp-0.128
Token cookies
Token ID14746
Feature activation-1.80e+00

Pos logit contributions
 sorry+0.051
 dear+0.050
 darling+0.048
 okay+0.047
 ashamed+0.046
Neg logit contributions
|-0.047
growth-0.045
 land-0.043
 camp-0.042
ground-0.040
Token are
Token ID389
Feature activation+3.18e+00

Pos logit contributions
|+0.026
growth+0.025
 land+0.023
 camp+0.022
ground+0.021
Neg logit contributions
 sorry-0.055
 dear-0.054
 darling-0.052
 okay-0.052
 ashamed-0.051
Token delicious
Token ID12625
Feature activation+2.88e+00

Pos logit contributions
 sorry+0.066
 dear+0.065
 darling+0.063
 ashamed+0.060
 okay+0.060
Neg logit contributions
|-0.073
 land-0.071
growth-0.069
 camp-0.068
 hunt-0.064
Token.
Token ID13
Feature activation+3.29e+00

Pos logit contributions
 sorry+0.088
 concerned+0.086
 ashamed+0.082
 dear+0.081
 darling+0.079
Neg logit contributions
 land-0.039
|-0.037
 camp-0.037
e-0.034
 hunt-0.034
Token You
Token ID921
Feature activation+3.46e+00

Pos logit contributions
 concerned+0.067
 sorry+0.066
rier+0.064
 guilty+0.063
ower+0.063
Neg logit contributions
|-0.076
growth-0.069
 camp-0.066
 visit-0.066
 meet-0.064
Token should
Token ID815
Feature activation-2.20e+00

Pos logit contributions
 sorry+0.081
 dear+0.079
 okay+0.077
 darling+0.076
 ashamed+0.075
Neg logit contributions
|-0.077
growth-0.076
 forests-0.068
 land-0.068
 camp-0.068
Token try
Token ID1949
Feature activation+1.75e+00

Pos logit contributions
|+0.036
growth+0.034
 land+0.031
 camp+0.030
 forests+0.029
Neg logit contributions
 sorry-0.064
 dear-0.063
 darling-0.061
 okay-0.061
 ashamed-0.059
Token them
Token ID606
Feature activation-1.21e+00

Pos logit contributions
 sorry+0.050
 dear+0.049
 darling+0.048
 okay+0.047
 ashamed+0.046
Neg logit contributions
|-0.030
growth-0.028
 land-0.026
 camp-0.025
ground-0.024
Token again
Token ID757
Feature activation-1.14e+00

Pos logit contributions
|+0.025
growth+0.024
 land+0.022
 camp+0.021
 forests+0.020
Neg logit contributions
 sorry-0.031
 okay-0.030
 dear-0.030
 darling-0.029
 ok-0.029
Token hungry
Token ID14720
Feature activation+3.08e+00

Pos logit contributions
 sorry+0.045
 dear+0.044
 darling+0.042
 okay+0.041
 ashamed+0.040
Neg logit contributions
|-0.078
growth-0.075
 land-0.073
 camp-0.071
ground-0.069
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
 sorry+0.092
 dear+0.090
 darling+0.087
 okay+0.087
 ashamed+0.085
Neg logit contributions
|-0.048
growth-0.045
 land-0.042
 camp-0.040
ground-0.038
Token
Token ID198
Feature activation-5.02e+00

Pos logit contributions
 sorry+0.058
 dear+0.057
 darling+0.055
 okay+0.054
 ashamed+0.054
Neg logit contributions
|-0.038
growth-0.036
 land-0.034
 camp-0.033
ground-0.031
Token
Token ID198
Feature activation-4.04e+00

Pos logit contributions
growth+0.093
|+0.091
 land+0.085
 hunt+0.082
 camp+0.081
Neg logit contributions
 sorry-0.131
 okay-0.125
 dear-0.122
 darling-0.120
 ashamed-0.119
Token"
Token ID1
Feature activation-9.17e-02

Pos logit contributions
|+0.086
growth+0.084
 land+0.077
 camp+0.076
 hunt+0.073
Neg logit contributions
 sorry-0.098
 dear-0.094
 okay-0.092
 darling-0.091
 ashamed-0.090
TokenHello
Token ID15496
Feature activation+3.05e+00

Pos logit contributions
|+0.002
 land+0.002
growth+0.002
 camp+0.002
 hunt+0.002
Neg logit contributions
 sorry-0.002
 okay-0.002
 dear-0.002
 ashamed-0.002
 darling-0.002
Token,
Token ID11
Feature activation+4.94e+00

Pos logit contributions
 sorry+0.079
 dear+0.078
 darling+0.075
 okay+0.075
 ashamed+0.072
Neg logit contributions
|-0.061
growth-0.059
 land-0.053
 camp-0.051
ground-0.050
Token puppy
Token ID26188
Feature activation+4.69e-01

Pos logit contributions
 sorry+0.110
 dear+0.107
 darling+0.105
 okay+0.102
 ashamed+0.101
Neg logit contributions
|-0.109
growth-0.103
 land-0.102
 camp-0.099
ground-0.094
Token!
Token ID0
Feature activation+5.31e+00

Pos logit contributions
 sorry+0.012
 dear+0.011
 darling+0.011
 okay+0.011
 ashamed+0.011
Neg logit contributions
|-0.010
growth-0.010
 land-0.009
 camp-0.008
 forests-0.008
Token Where
Token ID6350
Feature activation+3.63e+00

Pos logit contributions
 sorry+0.100
 dear+0.096
 darling+0.091
 okay+0.090
 ashamed+0.086
Neg logit contributions
|-0.140
growth-0.137
 land-0.134
 camp-0.129
 forests-0.126
Token did
Token ID750
Feature activation+1.79e+00

Pos logit contributions
 sorry+0.109
 dear+0.106
 okay+0.105
 darling+0.101
 ok+0.100
Neg logit contributions
growth-0.059
|-0.058
 forests-0.051
 valleys-0.051
 land-0.051
Token win
Token ID1592
Feature activation+7.42e-01

Pos logit contributions
 sorry+0.019
 dear+0.018
 okay+0.018
 darling+0.018
 ok+0.017
Neg logit contributions
|-0.020
growth-0.019
 land-0.019
 camp-0.018
 forests-0.017
Token.
Token ID13
Feature activation+8.58e-01

Pos logit contributions
 sorry+0.019
 dear+0.019
 darling+0.018
 okay+0.018
 ok+0.017
Neg logit contributions
|-0.015
growth-0.014
 camp-0.013
 land-0.012
ground-0.012
Token Everyone
Token ID11075
Feature activation-1.31e+00

Pos logit contributions
 sorry+0.023
 dear+0.023
 darling+0.022
 okay+0.021
 ashamed+0.021
Neg logit contributions
|-0.015
growth-0.015
 land-0.014
 camp-0.014
 hunt-0.013
Token was
Token ID373
Feature activation+3.40e+00

Pos logit contributions
growth+0.023
|+0.023
 camp+0.020
 land+0.019
ground+0.019
Neg logit contributions
 sorry-0.036
 dear-0.035
 ashamed-0.034
 darling-0.034
 concerned-0.033
Token amazed
Token ID24562
Feature activation+3.79e+00

Pos logit contributions
 sorry+0.062
 dear+0.059
 darling+0.056
 okay+0.056
 ashamed+0.055
Neg logit contributions
|-0.093
growth-0.091
 land-0.087
 camp-0.085
ground-0.082
Token and
Token ID290
Feature activation+1.93e+00

Pos logit contributions
 sorry+0.105
 dear+0.100
 darling+0.097
 okay+0.096
 ashamed+0.095
Neg logit contributions
|-0.064
 land-0.063
growth-0.060
 camp-0.056
 forests-0.056
Token cheered
Token ID34767
Feature activation+3.54e+00

Pos logit contributions
 sorry+0.043
 dear+0.042
 darling+0.040
 okay+0.040
 ashamed+0.039
Neg logit contributions
|-0.045
growth-0.044
 land-0.041
 camp-0.040
ground-0.039
Token for
Token ID329
Feature activation-4.65e+00

Pos logit contributions
 sorry+0.103
 dear+0.100
 ashamed+0.098
 darling+0.096
 unfair+0.095
Neg logit contributions
|-0.052
 land-0.050
growth-0.047
 camp-0.047
 the-0.044
Token her
Token ID607
Feature activation+3.32e+00

Pos logit contributions
|+0.081
growth+0.074
 land+0.066
 camp+0.064
ground+0.063
Neg logit contributions
 sorry-0.139
 dear-0.136
 darling-0.131
 okay-0.131
 ok-0.127
Token.
Token ID13
Feature activation+7.47e-01

Pos logit contributions
 sorry+0.093
 dear+0.091
 unfair+0.089
 darling+0.088
 ashamed+0.087
Neg logit contributions
|-0.048
 land-0.048
growth-0.046
 camp-0.045
 the-0.042
Token
Token ID198
Feature activation-5.77e+00

Pos logit contributions
 sorry+0.020
 dear+0.020
 darling+0.019
 okay+0.019
 ashamed+0.019
Neg logit contributions
|-0.013
growth-0.013
 land-0.013
 camp-0.012
 hunt-0.012
Token her
Token ID607
Feature activation+2.58e+00

Pos logit contributions
|+0.003
growth+0.002
 land+0.002
 valleys+0.002
ground+0.002
Neg logit contributions
 sorry-0.004
 dear-0.004
 darling-0.004
 okay-0.003
 ashamed-0.003
Token friends
Token ID2460
Feature activation-2.69e+00

Pos logit contributions
 sorry+0.056
 dear+0.055
 darling+0.053
 okay+0.053
 ok+0.051
Neg logit contributions
|-0.058
 land-0.056
growth-0.056
 camp-0.054
ground-0.050
Token all
Token ID477
Feature activation-1.46e+00

Pos logit contributions
|+0.066
growth+0.063
 land+0.060
 camp+0.058
ground+0.057
Neg logit contributions
 sorry-0.057
 dear-0.055
 darling-0.053
 okay-0.053
 ashamed-0.051
Token about
Token ID546
Feature activation-4.67e+00

Pos logit contributions
|+0.041
growth+0.040
 land+0.038
 camp+0.037
ground+0.036
Neg logit contributions
 sorry-0.026
 dear-0.025
 darling-0.023
 okay-0.023
 ashamed-0.023
Token their
Token ID511
Feature activation+8.25e-01

Pos logit contributions
|+0.067
growth+0.062
 land+0.051
of+0.050
ground+0.050
Neg logit contributions
 sorry-0.151
 dear-0.145
 okay-0.143
 darling-0.141
 ashamed-0.138
Token secret
Token ID3200
Feature activation+1.87e+00

Pos logit contributions
 sorry+0.020
 dear+0.020
 darling+0.019
 okay+0.019
 ashamed+0.019
Neg logit contributions
|-0.017
growth-0.016
 land-0.015
 camp-0.015
ground-0.014
Token adventure
Token ID8855
Feature activation+8.70e-01

Pos logit contributions
 sorry+0.050
 dear+0.050
 darling+0.049
 okay+0.047
 ashamed+0.046
Neg logit contributions
|-0.035
growth-0.033
 land-0.028
 camp-0.028
of-0.028
Token.
Token ID13
Feature activation+1.98e+00

Pos logit contributions
 sorry+0.024
 dear+0.023
 darling+0.023
 okay+0.023
 ashamed+0.022
Neg logit contributions
|-0.016
growth-0.015
 land-0.014
 camp-0.013
 forests-0.012
Token They
Token ID1119
Feature activation-1.76e+00

Pos logit contributions
 sorry+0.049
 dear+0.048
 okay+0.046
 darling+0.045
 ok+0.045
Neg logit contributions
|-0.041
growth-0.039
 land-0.036
ground-0.035
 camp-0.035
Token all
Token ID477
Feature activation-1.59e+00

Pos logit contributions
|+0.038
growth+0.038
ground+0.033
 land+0.032
 valleys+0.032
Neg logit contributions
 sorry-0.044
 ashamed-0.042
 dear-0.042
 okay-0.041
 darling-0.040
Token thought
Token ID1807
Feature activation+7.26e+00

Pos logit contributions
growth+0.040
|+0.038
ground+0.036
 camp+0.035
 forests+0.035
Neg logit contributions
 sorry-0.033
 ashamed-0.032
 dear-0.031
 darling-0.030
 concerned-0.029
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
 dear+0.099
 unfair+0.099
 darling+0.097
 fault+0.093
ouncing+0.092
Neg logit contributions
 the-0.020
 land-0.016
e-0.016
|-0.015
 play-0.013
Token
Token ID198
Feature activation-5.43e+00

Pos logit contributions
 sorry+0.046
 dear+0.046
 darling+0.045
 unfair+0.044
rier+0.044
Neg logit contributions
|-0.022
growth-0.020
 land-0.019
 camp-0.019
ground-0.018
Token
Token ID198
Feature activation-4.51e+00

Pos logit contributions
growth+0.086
|+0.084
 land+0.081
 camp+0.074
 forests+0.072
Neg logit contributions
 sorry-0.158
 okay-0.154
 dear-0.152
 darling-0.150
 ok-0.147
TokenBilly
Token ID49640
Feature activation-5.96e-01

Pos logit contributions
|+0.075
growth+0.071
 land+0.067
 camp+0.063
ground+0.062
Neg logit contributions
 sorry-0.131
 dear-0.128
 darling-0.124
 okay-0.121
 ashamed-0.121
Token didn
Token ID1422
Feature activation+4.47e+00

Pos logit contributions
|+0.011
growth+0.011
 land+0.010
 camp+0.010
ground+0.010
Neg logit contributions
 sorry-0.016
 dear-0.015
 ashamed-0.015
 darling-0.015
 okay-0.015
Token't
Token ID470
Feature activation+1.46e-01

Pos logit contributions
 sorry+0.127
 dear+0.127
 darling+0.123
 okay+0.120
 ashamed+0.119
Neg logit contributions
|-0.075
growth-0.070
 land-0.066
 camp-0.063
ground-0.061
Token want
Token ID765
Feature activation+1.93e+00

Pos logit contributions
 dear+0.004
 sorry+0.004
 darling+0.004
 concerned+0.003
ower+0.003
Neg logit contributions
|-0.003
 land-0.003
growth-0.003
of-0.003
 camp-0.002
Token to
Token ID284
Feature activation-5.63e+00

Pos logit contributions
 sorry+0.057
 dear+0.056
 darling+0.055
 okay+0.054
 ashamed+0.053
Neg logit contributions
|-0.030
growth-0.029
 land-0.027
 camp-0.026
ground-0.024
Token play
Token ID711
Feature activation+2.60e-01

Pos logit contributions
|+0.082
growth+0.078
 land+0.072
 camp+0.068
ground+0.065
Neg logit contributions
 sorry-0.174
 dear-0.171
 darling-0.166
 okay-0.165
 ashamed-0.162
Token with
Token ID351
Feature activation-4.12e+00

Pos logit contributions
 sorry+0.008
 dear+0.007
 okay+0.007
 ashamed+0.007
 darling+0.007
Neg logit contributions
|-0.004
growth-0.004
 land-0.004
 camp-0.004
 forests-0.004
Token Tim
Token ID5045
Feature activation-1.88e+00

Pos logit contributions
|+0.055
growth+0.051
 land+0.047
 camp+0.044
ground+0.041
Neg logit contributions
 sorry-0.134
 dear-0.131
 darling-0.128
 okay-0.127
 ashamed-0.125
Token are
Token ID389
Feature activation+1.39e+00

Pos logit contributions
growth+0.046
|+0.044
ground+0.043
 forests+0.042
 valleys+0.041
Neg logit contributions
 sorry-0.058
 okay-0.054
 dear-0.054
 ashamed-0.053
 darling-0.051
Token friends
Token ID2460
Feature activation+4.65e-01

Pos logit contributions
 dear+0.027
 darling+0.027
ower+0.025
rier+0.025
 unfair+0.024
Neg logit contributions
 land-0.033
 the-0.032
|-0.032
 camp-0.031
e-0.030
Token.
Token ID13
Feature activation+3.06e+00

Pos logit contributions
 sorry+0.014
 dear+0.014
 darling+0.013
 okay+0.013
 ashamed+0.013
Neg logit contributions
|-0.007
growth-0.007
 land-0.006
 camp-0.006
ground-0.006
Token
Token ID198
Feature activation-4.80e+00

Pos logit contributions
 sorry+0.079
 dear+0.077
 darling+0.075
 okay+0.074
 ashamed+0.072
Neg logit contributions
|-0.060
growth-0.057
 land-0.055
 camp-0.052
ground-0.050
Token
Token ID198
Feature activation-4.09e+00

Pos logit contributions
|+0.082
growth+0.079
 land+0.071
 miles+0.068
 valleys+0.068
Neg logit contributions
 sorry-0.141
 okay-0.129
 dear-0.128
 ashamed-0.126
 darling-0.123
TokenThey
Token ID2990
Feature activation-2.50e+00

Pos logit contributions
|+0.088
growth+0.085
 camp+0.078
 land+0.078
 hunt+0.073
Neg logit contributions
 sorry-0.099
 okay-0.094
 dear-0.092
 ashamed-0.089
 darling-0.089
Token go
Token ID467
Feature activation-1.46e+00

Pos logit contributions
growth+0.050
|+0.047
ground+0.046
 forests+0.044
 valleys+0.044
Neg logit contributions
 sorry-0.064
 okay-0.057
 ashamed-0.055
 dear-0.054
 guilty-0.054
Token to
Token ID284
Feature activation-8.47e+00

Pos logit contributions
growth+0.029
|+0.029
 land+0.025
 camp+0.024
ground+0.024
Neg logit contributions
 sorry-0.038
 dear-0.035
 okay-0.035
 darling-0.034
 ashamed-0.034
Token the
Token ID262
Feature activation-2.61e+00

Pos logit contributions
growth+0.138
|+0.131
 valleys+0.124
r+0.122
 forests+0.118
Neg logit contributions
 sorry-0.259
 okay-0.228
 ashamed-0.219
 dear-0.213
 guilty-0.212
Token swing
Token ID9628
Feature activation-2.60e+00

Pos logit contributions
|+0.040
growth+0.039
e+0.033
of+0.030
r+0.030
Neg logit contributions
 sorry-0.083
 dear-0.079
 ashamed-0.076
 darling-0.075
 okay-0.075
Token.
Token ID13
Feature activation+3.02e+00

Pos logit contributions
growth+0.067
|+0.065
 outer+0.055
ground+0.055
 valleys+0.054
Neg logit contributions
 sorry-0.053
 okay-0.049
 dear-0.047
 ashamed-0.046
rier-0.044
Token big
Token ID1263
Feature activation+1.33e+00

Pos logit contributions
 sorry+0.014
 dear+0.014
 darling+0.014
 ok+0.013
 okay+0.013
Neg logit contributions
|-0.013
 land-0.013
growth-0.013
 camp-0.013
ground-0.012
Token and
Token ID290
Feature activation+1.41e+00

Pos logit contributions
 sorry+0.031
 dear+0.031
 darling+0.030
 okay+0.030
 ok+0.029
Neg logit contributions
|-0.028
 land-0.027
growth-0.026
 camp-0.026
 forests-0.024
Token round
Token ID2835
Feature activation+1.62e+00

Pos logit contributions
 fault+0.035
 dear+0.034
 darling+0.034
ushing+0.033
 sorry+0.032
Neg logit contributions
 land-0.028
|-0.027
 the-0.025
 camp-0.025
 play-0.024
Token wheel
Token ID7825
Feature activation-4.41e-01

Pos logit contributions
 sorry+0.037
 dear+0.036
 darling+0.034
 okay+0.033
 ashamed+0.033
Neg logit contributions
|-0.037
growth-0.035
 land-0.033
 camp-0.032
ground-0.031
Token was
Token ID373
Feature activation+2.56e+00

Pos logit contributions
growth+0.008
|+0.007
 camp+0.006
 forests+0.006
 valleys+0.006
Neg logit contributions
 sorry-0.012
 ashamed-0.012
 dear-0.012
 okay-0.011
 guilty-0.011
Token rolling
Token ID10708
Feature activation-2.06e-01

Pos logit contributions
 dear+0.063
 darling+0.062
 sorry+0.061
 okay+0.058
 ok+0.057
Neg logit contributions
|-0.051
 land-0.050
growth-0.048
 camp-0.047
ground-0.045
Token down
Token ID866
Feature activation-3.70e+00

Pos logit contributions
growth+0.004
|+0.004
 land+0.003
 camp+0.003
 hunt+0.003
Neg logit contributions
 sorry-0.005
 okay-0.005
 dear-0.005
 ashamed-0.005
 darling-0.005
Token the
Token ID262
Feature activation-3.41e+00

Pos logit contributions
|+0.066
of+0.064
growth+0.060
 hunt+0.056
isk+0.055
Neg logit contributions
 sorry-0.098
 ashamed-0.088
 guilty-0.084
 important-0.084
 dear-0.082
Token hill
Token ID12788
Feature activation-2.49e+00

Pos logit contributions
|+0.065
growth+0.064
e+0.055
 land+0.054
of+0.054
Neg logit contributions
 sorry-0.086
 dear-0.085
 darling-0.082
 ashamed-0.080
 okay-0.079
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
growth+0.054
|+0.050
 camp+0.045
 forests+0.043
of+0.043
Neg logit contributions
 sorry-0.058
 okay-0.055
 ashamed-0.054
 dear-0.054
ower-0.052
Token The
Token ID383
Feature activation-1.04e+00

Pos logit contributions
 sorry+0.036
 dear+0.035
 darling+0.034
 okay+0.033
 ashamed+0.033
Neg logit contributions
|-0.023
growth-0.022
 land-0.021
 camp-0.020
ground-0.019
Token and
Token ID290
Feature activation-4.37e-01

Pos logit contributions
 sorry+0.016
 dear+0.016
 darling+0.016
 okay+0.015
 ashamed+0.015
Neg logit contributions
|-0.011
growth-0.011
 land-0.010
 camp-0.010
ground-0.009
Token ro
Token ID686
Feature activation+1.54e+00

Pos logit contributions
|+0.010
growth+0.009
ground+0.008
 land+0.008
of+0.008
Neg logit contributions
 sorry-0.011
 dear-0.010
 ashamed-0.010
 darling-0.010
 okay-0.010
Tokenars
Token ID945
Feature activation-2.46e+00

Pos logit contributions
 sorry+0.039
 dear+0.037
 okay+0.036
 ashamed+0.036
 darling+0.035
Neg logit contributions
|-0.031
growth-0.031
 land-0.029
 camp-0.028
 forests-0.026
Token again
Token ID757
Feature activation-1.33e+00

Pos logit contributions
|+0.045
growth+0.042
 land+0.038
 camp+0.037
ground+0.036
Neg logit contributions
 sorry-0.069
 dear-0.065
 okay-0.065
 ashamed-0.063
 darling-0.063
Token.
Token ID13
Feature activation+2.88e+00

Pos logit contributions
|+0.030
growth+0.028
ground+0.024
wered+0.024
 camp+0.024
Neg logit contributions
 sorry-0.032
 okay-0.030
 dear-0.030
 ashamed-0.029
 darling-0.029
Token It
Token ID632
Feature activation-1.42e+00

Pos logit contributions
 sorry+0.071
 dear+0.069
 darling+0.066
 okay+0.066
 ashamed+0.065
Neg logit contributions
|-0.061
growth-0.057
 land-0.056
 camp-0.054
ground-0.051
Token looks
Token ID3073
Feature activation+5.94e+00

Pos logit contributions
|+0.033
growth+0.031
ground+0.029
 land+0.029
 camp+0.028
Neg logit contributions
 sorry-0.032
 dear-0.030
 ashamed-0.030
 darling-0.029
 okay-0.029
Token for
Token ID329
Feature activation-4.65e+00

Pos logit contributions
 dear+0.129
ower+0.127
 darling+0.127
ouncing+0.118
rier+0.117
Neg logit contributions
 land-0.127
 the-0.121
|-0.121
 camp-0.119
growth-0.114
Token something
Token ID1223
Feature activation+4.43e+00

Pos logit contributions
|+0.109
of+0.082
growth+0.081
ground+0.077
field+0.075
Neg logit contributions
 sorry-0.132
 dear-0.124
 okay-0.121
 ashamed-0.115
 darling-0.112
Token else
Token ID2073
Feature activation+1.75e+00

Pos logit contributions
 sorry+0.117
 dear+0.117
 darling+0.114
 okay+0.109
 ashamed+0.108
Neg logit contributions
|-0.076
 land-0.074
growth-0.073
 camp-0.070
 hunt-0.067
Token to
Token ID284
Feature activation-5.10e+00

Pos logit contributions
 sorry+0.051
 dear+0.051
 darling+0.049
 okay+0.048
 ashamed+0.048
Neg logit contributions
|-0.027
growth-0.026
 land-0.025
 camp-0.023
ground-0.022
Token to
Token ID284
Feature activation-6.78e+00

Pos logit contributions
 sorry+0.046
 dear+0.043
 okay+0.043
 ashamed+0.042
 darling+0.042
Neg logit contributions
|-0.035
growth-0.034
 land-0.030
ground-0.030
of-0.029
Token walk
Token ID2513
Feature activation-2.85e+00

Pos logit contributions
|+0.116
growth+0.115
ground+0.103
of+0.096
 forests+0.090
Neg logit contributions
 sorry-0.196
 okay-0.189
 dear-0.182
 darling-0.178
 ok-0.176
Token together
Token ID1978
Feature activation-1.55e+00

Pos logit contributions
growth+0.065
|+0.061
ground+0.055
of+0.053
wered+0.053
Neg logit contributions
 sorry-0.064
 okay-0.060
 ashamed-0.058
 dear-0.057
 darling-0.056
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
|+0.037
growth+0.037
wered+0.030
of+0.030
ground+0.029
Neg logit contributions
 sorry-0.033
 okay-0.033
 ok-0.030
 dear-0.029
 ashamed-0.029
Token As
Token ID1081
Feature activation+1.29e+00

Pos logit contributions
 sorry+0.011
 dear+0.011
 darling+0.011
 unfair+0.011
 ok+0.011
Neg logit contributions
|-0.007
growth-0.006
 land-0.006
 camp-0.006
ground-0.006
Token they
Token ID484
Feature activation-1.36e+00

Pos logit contributions
 sorry+0.027
 dear+0.026
 okay+0.026
 darling+0.024
 ashamed+0.024
Neg logit contributions
|-0.033
growth-0.032
e-0.029
 enclosure-0.028
 hunt-0.028
Token walked
Token ID6807
Feature activation-1.35e+00

Pos logit contributions
growth+0.024
|+0.023
ground+0.022
 outer+0.021
 hunt+0.021
Neg logit contributions
 sorry-0.028
 ashamed-0.028
 okay-0.028
 guilty-0.027
 stubborn-0.027
Token,
Token ID11
Feature activation+2.98e+00

Pos logit contributions
growth+0.029
|+0.029
 land+0.025
e+0.025
ground+0.024
Neg logit contributions
 sorry-0.034
 dear-0.031
 okay-0.031
 darling-0.030
 ashamed-0.030
Token the
Token ID262
Feature activation-2.34e-01

Pos logit contributions
 sorry+0.071
 dear+0.067
 okay+0.065
 darling+0.063
 ashamed+0.063
Neg logit contributions
|-0.066
growth-0.065
 land-0.060
 camp-0.058
of-0.057
Token sun
Token ID4252
Feature activation-8.50e-01

Pos logit contributions
|+0.004
growth+0.004
ground+0.003
of+0.003
e+0.003
Neg logit contributions
 sorry-0.007
 dear-0.007
 okay-0.006
 darling-0.006
 ashamed-0.006
Token started
Token ID2067
Feature activation+2.71e+00

Pos logit contributions
|+0.017
growth+0.015
 hunt+0.013
ground+0.013
 camp+0.012
Neg logit contributions
 sorry-0.022
 dear-0.022
 okay-0.021
 ashamed-0.021
 concerned-0.020
Token big
Token ID1263
Feature activation+6.94e-01

Pos logit contributions
|+0.006
growth+0.006
 land+0.005
 camp+0.005
ground+0.005
Neg logit contributions
 sorry-0.012
 dear-0.011
 darling-0.011
 okay-0.011
 ashamed-0.011
Token boy
Token ID2933
Feature activation+6.29e-01

Pos logit contributions
 sorry+0.015
 dear+0.015
 darling+0.014
 okay+0.014
 ashamed+0.014
Neg logit contributions
|-0.016
growth-0.016
 land-0.015
 camp-0.015
 forests-0.014
Token.
Token ID13
Feature activation+2.04e+00

Pos logit contributions
 sorry+0.018
 dear+0.018
 darling+0.017
 okay+0.017
 ashamed+0.017
Neg logit contributions
|-0.010
growth-0.010
 land-0.009
 camp-0.009
ground-0.008
Token They
Token ID1119
Feature activation-2.03e+00

Pos logit contributions
 sorry+0.051
 dear+0.050
 darling+0.049
 okay+0.048
 ok+0.047
Neg logit contributions
|-0.041
growth-0.039
 land-0.037
 camp-0.036
ground-0.034
Token ran
Token ID4966
Feature activation+1.99e-01

Pos logit contributions
|+0.045
growth+0.044
ground+0.040
 camp+0.040
 land+0.040
Neg logit contributions
 sorry-0.049
 dear-0.046
 ashamed-0.045
 okay-0.045
 darling-0.044
Token to
Token ID284
Feature activation-8.78e+00

Pos logit contributions
 sorry+0.006
 dear+0.005
 okay+0.005
 darling+0.005
 ashamed+0.005
Neg logit contributions
|-0.003
growth-0.003
 land-0.003
 camp-0.003
ground-0.003
Token the
Token ID262
Feature activation-2.90e+00

Pos logit contributions
|+0.146
growth+0.143
 valleys+0.130
 forests+0.129
field+0.120
Neg logit contributions
 sorry-0.274
 okay-0.250
 ashamed-0.237
 ok-0.232
 guilty-0.230
Token open
Token ID1280
Feature activation-2.92e+00

Pos logit contributions
|+0.060
growth+0.058
e+0.052
ground+0.049
of+0.048
Neg logit contributions
 sorry-0.078
 dear-0.074
 ashamed-0.072
 darling-0.070
 okay-0.070
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
|+0.068
growth+0.060
e+0.055
of+0.051
isk+0.050
Neg logit contributions
 dear-0.065
 sorry-0.063
 darling-0.062
 ashamed-0.061
rier-0.057
Token The
Token ID383
Feature activation-9.51e-01

Pos logit contributions
 sorry+0.037
 dear+0.036
 darling+0.035
 okay+0.034
 ashamed+0.034
Neg logit contributions
|-0.034
growth-0.033
 land-0.032
 camp-0.031
 forests-0.029
Token open
Token ID1280
Feature activation-1.48e+00

Pos logit contributions
|+0.023
growth+0.022
 land+0.021
 camp+0.021
 forests+0.020
Neg logit contributions
 sorry-0.020
 dear-0.020
 darling-0.019
 okay-0.019
 ok-0.018
Token didn
Token ID1422
Feature activation+3.78e+00

Pos logit contributions
 sorry+0.031
 dear+0.031
 darling+0.030
 okay+0.029
 ashamed+0.029
Neg logit contributions
|-0.020
growth-0.019
 land-0.018
 camp-0.017
ground-0.017
Token't
Token ID470
Feature activation-3.36e-01

Pos logit contributions
 sorry+0.102
 dear+0.099
 darling+0.096
 okay+0.095
 ashamed+0.093
Neg logit contributions
|-0.069
growth-0.066
 land-0.063
 camp-0.060
 forests-0.058
Token want
Token ID765
Feature activation+1.60e+00

Pos logit contributions
|+0.007
growth+0.007
 land+0.007
 camp+0.007
ground+0.006
Neg logit contributions
 sorry-0.008
 dear-0.008
 darling-0.007
 okay-0.007
 ashamed-0.007
Token to
Token ID284
Feature activation-5.82e+00

Pos logit contributions
 sorry+0.041
 dear+0.040
 okay+0.039
 darling+0.038
 ashamed+0.038
Neg logit contributions
|-0.032
growth-0.030
 land-0.029
ground-0.028
 valleys-0.027
Token go
Token ID467
Feature activation-5.29e-01

Pos logit contributions
growth+0.092
|+0.088
 valleys+0.081
 forests+0.080
ground+0.079
Neg logit contributions
 sorry-0.179
 okay-0.173
 dear-0.168
 darling-0.164
 ok-0.164
Token to
Token ID284
Feature activation-7.86e+00

Pos logit contributions
growth+0.010
|+0.010
ground+0.009
 land+0.009
 forests+0.009
Neg logit contributions
 sorry-0.014
 okay-0.013
 dear-0.013
 ashamed-0.013
 darling-0.012
Token the
Token ID262
Feature activation-2.26e+00

Pos logit contributions
|+0.113
ground+0.104
of+0.104
 valleys+0.103
growth+0.101
Neg logit contributions
 sorry-0.244
 okay-0.218
 ashamed-0.217
 dear-0.214
 fault-0.213
Token beach
Token ID10481
Feature activation-1.08e+00

Pos logit contributions
|+0.040
growth+0.037
e+0.032
of+0.031
ground+0.030
Neg logit contributions
 sorry-0.067
 dear-0.064
 ashamed-0.062
 unfair-0.062
 darling-0.061
Token anymore
Token ID7471
Feature activation-5.30e-01

Pos logit contributions
growth+0.020
|+0.020
 land+0.017
ground+0.016
 camp+0.016
Neg logit contributions
 sorry-0.029
 okay-0.028
 dear-0.028
 ashamed-0.027
 darling-0.027
Token.
Token ID13
Feature activation+2.15e+00

Pos logit contributions
|+0.009
growth+0.009
 land+0.008
 camp+0.008
ground+0.007
Neg logit contributions
 sorry-0.015
 dear-0.015
 darling-0.014
 okay-0.014
 ashamed-0.014
Token He
Token ID679
Feature activation-9.21e-01

Pos logit contributions
rier+0.062
 sorry+0.060
 concerned+0.060
 dollars+0.059
asers+0.058
Neg logit contributions
|-0.030
growth-0.028
 land-0.027
 visit-0.027
 play-0.026
Token herbs
Token ID25411
Feature activation-2.60e+00

Pos logit contributions
|+0.060
growth+0.057
 land+0.054
 camp+0.052
ground+0.051
Neg logit contributions
 sorry-0.063
 dear-0.062
 darling-0.060
 okay-0.058
 ashamed-0.057
Token and
Token ID290
Feature activation-2.36e-01

Pos logit contributions
|+0.046
growth+0.044
 land+0.042
 camp+0.040
ground+0.038
Neg logit contributions
 sorry-0.071
 dear-0.070
 darling-0.067
 okay-0.067
 ashamed-0.066
Token put
Token ID1234
Feature activation-2.93e+00

Pos logit contributions
|+0.005
growth+0.005
 land+0.005
 camp+0.004
ground+0.004
Neg logit contributions
 dear-0.006
 sorry-0.006
 darling-0.005
 okay-0.005
 ashamed-0.005
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
|+0.041
growth+0.036
 land+0.031
 camp+0.031
 hunt+0.029
Neg logit contributions
 sorry-0.093
 dear-0.090
 okay-0.089
 darling-0.089
 ok-0.087
Token pie
Token ID2508
Feature activation-3.51e+00

Pos logit contributions
|+0.028
growth+0.027
 land+0.027
 camp+0.026
 forests+0.025
Neg logit contributions
 sorry-0.029
 dear-0.028
 darling-0.027
 okay-0.027
 ashamed-0.027
Token into
Token ID656
Feature activation-7.24e+00

Pos logit contributions
|+0.059
growth+0.056
 land+0.051
 camp+0.049
 forests+0.046
Neg logit contributions
 sorry-0.100
 dear-0.098
 okay-0.096
 darling-0.095
 ok-0.092
Token the
Token ID262
Feature activation-1.74e+00

Pos logit contributions
|+0.129
growth+0.102
 land+0.091
of+0.089
 camp+0.089
Neg logit contributions
 sorry-0.215
 dear-0.214
 darling-0.210
 okay-0.205
 ashamed-0.197
Token oven
Token ID14361
Feature activation-2.22e+00

Pos logit contributions
|+0.035
growth+0.033
 land+0.032
 camp+0.031
ground+0.029
Neg logit contributions
 sorry-0.044
 dear-0.043
 darling-0.042
 okay-0.041
 ashamed-0.040
Token.
Token ID13
Feature activation+1.79e+00

Pos logit contributions
|+0.036
growth+0.034
 land+0.032
 camp+0.030
ground+0.029
Neg logit contributions
 sorry-0.064
 dear-0.063
 darling-0.061
 okay-0.061
 ashamed-0.060
Token And
Token ID843
Feature activation+3.60e+00

Pos logit contributions
 sorry+0.046
 dear+0.044
 darling+0.042
 okay+0.042
 ashamed+0.042
Neg logit contributions
|-0.036
growth-0.034
 land-0.031
 camp-0.031
ground-0.030
Token when
Token ID618
Feature activation+1.87e+00

Pos logit contributions
 sorry+0.095
 dear+0.092
 darling+0.089
 okay+0.088
 ashamed+0.088
Neg logit contributions
|-0.066
growth-0.063
 land-0.061
 camp-0.058
ground-0.056
Token dirty
Token ID11841
Feature activation+3.90e+00

Pos logit contributions
 dear+0.052
 darling+0.050
 sorry+0.049
rier+0.046
ower+0.044
Neg logit contributions
|-0.086
 land-0.083
 camp-0.080
 the-0.080
growth-0.080
Token,
Token ID11
Feature activation+5.14e+00

Pos logit contributions
 sorry+0.112
 dear+0.111
 darling+0.107
 ashamed+0.104
 concerned+0.103
Neg logit contributions
|-0.060
 land-0.056
growth-0.055
 camp-0.053
 forests-0.049
Token so
Token ID523
Feature activation+3.82e+00

Pos logit contributions
 dear+0.109
 sorry+0.108
 darling+0.105
ower+0.100
 okay+0.099
Neg logit contributions
|-0.116
 land-0.112
growth-0.109
 camp-0.106
 forests-0.104
Token it
Token ID340
Feature activation+1.75e+00

Pos logit contributions
 sorry+0.101
 dear+0.098
 darling+0.094
 okay+0.094
 ashamed+0.092
Neg logit contributions
|-0.076
growth-0.072
 land-0.066
 camp-0.065
ground-0.063
Token decided
Token ID3066
Feature activation+6.20e+00

Pos logit contributions
 sorry+0.046
 dear+0.045
 okay+0.043
 darling+0.043
 ashamed+0.043
Neg logit contributions
|-0.033
growth-0.032
 land-0.030
 camp-0.029
ground-0.028
Token to
Token ID284
Feature activation-6.94e+00

Pos logit contributions
 dear+0.194
 sorry+0.193
 darling+0.189
ower+0.181
 ashamed+0.181
Neg logit contributions
|-0.073
 land-0.072
growth-0.068
 camp-0.063
 the-0.062
Token take
Token ID1011
Feature activation-2.90e+00

Pos logit contributions
|+0.115
growth+0.113
 forests+0.103
 valleys+0.102
 land+0.097
Neg logit contributions
 sorry-0.204
 dear-0.194
 okay-0.192
 darling-0.191
 ok-0.187
Token a
Token ID257
Feature activation+1.21e-01

Pos logit contributions
|+0.051
growth+0.046
 land+0.043
 hunt+0.042
 camp+0.042
Neg logit contributions
 sorry-0.082
 dear-0.078
 okay-0.078
 darling-0.076
 ok-0.075
Token bath
Token ID7837
Feature activation+1.17e+00

Pos logit contributions
 sorry+0.004
 dear+0.004
 darling+0.003
 okay+0.003
 ashamed+0.003
Neg logit contributions
|-0.002
growth-0.002
 land-0.002
 camp-0.002
ground-0.002
Token.
Token ID13
Feature activation+1.67e+00

Pos logit contributions
 sorry+0.030
 dear+0.030
 okay+0.029
 darling+0.029
 ashamed+0.028
Neg logit contributions
|-0.022
growth-0.022
 land-0.020
 camp-0.019
 forests-0.019
Token But
Token ID887
Feature activation+2.30e+00

Pos logit contributions
 sorry+0.040
 dear+0.040
 darling+0.038
 okay+0.038
 ashamed+0.037
Neg logit contributions
|-0.035
growth-0.034
 land-0.032
 camp-0.031
ground-0.030
Token find
Token ID1064
Feature activation-1.56e+00

Pos logit contributions
|+0.034
growth+0.032
ground+0.028
 valleys+0.028
 camp+0.027
Neg logit contributions
 sorry-0.048
 dear-0.047
 okay-0.046
 darling-0.045
 ok-0.044
Token the
Token ID262
Feature activation-1.02e+00

Pos logit contributions
|+0.028
growth+0.024
 land+0.022
ground+0.022
 camp+0.022
Neg logit contributions
 sorry-0.045
 dear-0.044
 okay-0.043
 darling-0.043
 ok-0.042
Token truth
Token ID3872
Feature activation+1.71e+00

Pos logit contributions
|+0.022
 land+0.022
growth+0.021
 camp+0.020
 hunt+0.019
Neg logit contributions
 sorry-0.023
 dear-0.023
 darling-0.022
 okay-0.021
 ashamed-0.021
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
 sorry+0.049
 dear+0.048
 darling+0.046
 okay+0.046
 ashamed+0.045
Neg logit contributions
|-0.028
growth-0.027
 land-0.025
 camp-0.024
ground-0.023
Token
Token ID220
Feature activation+1.16e+00

Pos logit contributions
 sorry+0.025
 dear+0.024
 darling+0.023
 okay+0.023
 ashamed+0.023
Neg logit contributions
|-0.026
growth-0.025
 land-0.024
 camp-0.023
ground-0.022
Token
Token ID198
Feature activation-5.95e+00

Pos logit contributions
 sorry+0.039
 dear+0.038
 okay+0.037
 darling+0.037
 ok+0.036
Neg logit contributions
|-0.014
growth-0.014
 land-0.013
 camp-0.012
 forests-0.011
Token
Token ID198
Feature activation-4.97e+00

Pos logit contributions
growth+0.109
 land+0.104
 camp+0.101
|+0.099
 hunt+0.095
Neg logit contributions
 sorry-0.148
 okay-0.143
 dear-0.141
 darling-0.140
 concerned-0.135
TokenThe
Token ID464
Feature activation-8.00e-01

Pos logit contributions
|+0.089
growth+0.088
 land+0.082
 camp+0.082
 hunt+0.075
Neg logit contributions
 sorry-0.131
 dear-0.129
 okay-0.128
 darling-0.126
 ok-0.124
Token end
Token ID886
Feature activation+3.23e+00

Pos logit contributions
|+0.015
 land+0.014
growth+0.014
 camp+0.013
 town+0.013
Neg logit contributions
 dear-0.020
 sorry-0.020
 darling-0.020
 okay-0.019
rier-0.019
Token.
Token ID13
Feature activation+2.07e+00

Pos logit contributions
 sorry+0.093
 dear+0.091
 darling+0.089
 okay+0.088
 ashamed+0.086
Neg logit contributions
|-0.054
growth-0.052
 land-0.048
 camp-0.045
 forests-0.044
Token<|endoftext|>
Token ID50256
Feature activation-5.16e+00

Pos logit contributions
 dear+0.061
 sorry+0.060
 okay+0.060
 darling+0.059
 ashamed+0.057
Neg logit contributions
|-0.029
 land-0.029
growth-0.029
 camp-0.028
 hunt-0.027