TOP ACTIVATIONS
MAX = 4.969

as he pushed . M omm a saw Tim my and
she asked . " Sure , honey . I
pictures together ?" " Of course !" said Lily
it together ?" " Yes , we can ,
Inspired , Emily asked her pap a , " How can
be friends ?" " Of course , oct opus
Ben asked . " Of course we can .
?" Her dad replied , " No , we can 't
Her mom my replied , " Maybe another day , sweet
my smiled and said , " Sure , let 's go
my love ," said M omm a . " Let 's
own story ?" " Of course , we can
of yours ?" " Of course , you can
my smiled and said , " Sure , let 's have
. Ranger ?" " Sure , Lily , but
Lily asks . " Yes , you can .
waves again ?" " Of course , we can
girl asked . " Sure , you can play
Ben asked . " Of course , I can
Her mom my said , " Yes , let 's go

MIN ACTIVATIONS
MIN = -5.206

colourful flowers , through the lush forests deep in thought .
- rolling green hills , lush grass , trees and flowers
. It ran through a lush green me adow , with
was running around on the lush grass , enjoying the sunshine
sight . There was a lush green patch of grass and
good to roll in the lush green grass and jump high
farmer who lived on a lush farm with many animals and
grass was a deep , lush green . The little girl
to set . <|endoftext|> Little Johnny was so excited . He
lived in a big , wide forest and enjoyed exploring every
Finally , they reached a vast stretch of beach . As
middle of a dark , wide sea . All the animals
, there was a big wide house . Inside the house
lived in a big , wide field . As he hopped
, she came to a wide , open field . In
, the beetle sailed across the sea . She was
She spotted a big , wide tree in the distance and
stay happy . <|endoftext|> Little Sammy always liked to explore ,
a time , in a peaceful land , there lived a
pocket . <|endoftext|> Little Tim my was walking in the park

INTERVAL 2.426 - 4.969
CONTAINS 0.611%

Daisy 's mom was very proud of her .
my smiled and said , " No , sweet ie ,
to make Sara and mom angry . He said sorry to
she called . " Yes , mom , we
hugged him and said , " I love you , but

INTERVAL -0.118 - 2.426
CONTAINS 54.755%

leave Lily and Ben alone . The boy and
a little girl named Lily . She loved to play with
warm and happy . They forget about the fall and the
't want to buy from him because he thought the bucket
a nice nap . <|endoftext|> L ily and Ben like to

INTERVAL -2.662 - -0.118
CONTAINS 43.966%

parade , but he was not very good at walking and
her . One day , her teacher said ,
mineral all day , bending and flex ing it . But
and watch the navy show . They learn more about the
sky . He wanted to fly too ! The

INTERVAL -5.206 - -2.662
CONTAINS 0.669%

playing , he saw a big , mean cat coming near
friends forever . <|endoftext|> It was a cold , icy day
were flying their k ite on the hill . Harry has
mom walked around the park . He waved at the other
, Tim my was playing in his garden with his toy
Token as
Token ID355
Feature activation+2.38e-01

Pos logit contributions
Of+0.002
olding+0.002
 forgiveness+0.002
aid+0.002
 apology+0.002
Neg logit contributions
 lake-0.002
neath-0.002
 camp-0.002
 shore-0.002
 creek-0.002
Token he
Token ID339
Feature activation+3.04e-01

Pos logit contributions
Of+0.010
aid+0.010
 forgiveness+0.009
olding+0.009
 apology+0.009
Neg logit contributions
 lake-0.012
neath-0.012
 camp-0.012
 shore-0.012
 river-0.011
Token pushed
Token ID7121
Feature activation-7.41e-02

Pos logit contributions
Of+0.016
aid+0.015
olding+0.015
 forgiveness+0.015
 apology+0.015
Neg logit contributions
 lake-0.012
 camp-0.012
neath-0.012
 shore-0.012
 river-0.011
Token.
Token ID13
Feature activation-5.98e-01

Pos logit contributions
 lake+0.002
neath+0.002
 camp+0.002
 creek+0.002
 river+0.002
Neg logit contributions
Of-0.004
aid-0.004
 forgiveness-0.004
olding-0.004
 apology-0.004
Token M
Token ID337
Feature activation+5.98e-01

Pos logit contributions
 lake+0.026
 camp+0.026
 canyon+0.025
neath+0.025
 ocean+0.025
Neg logit contributions
Of-0.030
olding-0.027
aid-0.027
 forgiveness-0.027
 apology-0.027
Tokenomm
Token ID2002
Feature activation+4.95e+00

Pos logit contributions
Of+0.028
aid+0.027
 forgiveness+0.026
olding+0.026
 apology+0.026
Neg logit contributions
 lake-0.027
 camp-0.026
neath-0.026
 ocean-0.026
 shore-0.026
Tokena
Token ID64
Feature activation+4.59e-01

Pos logit contributions
 apology+0.249
 honesty+0.248
 forgiveness+0.245
 facts+0.242
Of+0.234
Neg logit contributions
 set-0.116
 lake-0.114
 camp-0.113
gy-0.112
 tim-0.111
Token saw
Token ID2497
Feature activation-3.55e-01

Pos logit contributions
Of+0.023
aid+0.022
 forgiveness+0.022
olding+0.022
 apology+0.021
Neg logit contributions
 lake-0.020
 camp-0.019
neath-0.019
 shore-0.018
 river-0.018
Token Tim
Token ID5045
Feature activation+8.68e-02

Pos logit contributions
 lake+0.015
 camp+0.015
 canyon+0.014
 creek+0.014
neath+0.014
Neg logit contributions
Of-0.018
aid-0.018
 forgiveness-0.018
olding-0.017
 apology-0.016
Tokenmy
Token ID1820
Feature activation-8.20e-01

Pos logit contributions
Of+0.004
aid+0.004
olding+0.004
 forgiveness+0.003
 apology+0.003
Neg logit contributions
 lake-0.004
 camp-0.004
neath-0.004
 shore-0.004
 creek-0.004
Token and
Token ID290
Feature activation+5.09e-01

Pos logit contributions
 lake+0.038
 camp+0.037
 creek+0.037
neath+0.037
 shore+0.037
Neg logit contributions
olding-0.035
Of-0.035
 forgiveness-0.035
aid-0.034
 else-0.033
Token she
Token ID673
Feature activation+1.78e+00

Pos logit contributions
asing+0.150
aid+0.148
olding+0.146
Deal+0.142
aser+0.141
Neg logit contributions
neath-0.057
 camp-0.056
 shore-0.056
 lake-0.054
 creek-0.053
Token asked
Token ID1965
Feature activation+1.99e+00

Pos logit contributions
Of+0.103
aid+0.102
olding+0.101
 forgiveness+0.099
 apology+0.098
Neg logit contributions
 lake-0.059
 camp-0.056
 river-0.054
 shore-0.054
neath-0.054
Token.
Token ID13
Feature activation+2.06e+00

Pos logit contributions
Of+0.124
aid+0.120
Deal+0.115
olding+0.114
grand+0.113
Neg logit contributions
 camp-0.053
neath-0.053
 shore-0.052
 lake-0.050
 fields-0.049
Token
Token ID198
Feature activation+1.46e+00

Pos logit contributions
asing+0.140
aid+0.138
olding+0.136
 forgiveness+0.135
asers+0.135
Neg logit contributions
neath-0.046
 lake-0.044
 shore-0.042
 creek-0.041
 camp-0.041
Token
Token ID198
Feature activation+1.79e+00

Pos logit contributions
Of+0.091
aid+0.089
 forgiveness+0.089
asing+0.089
olding+0.089
Neg logit contributions
 lake-0.041
neath-0.040
 camp-0.040
 shore-0.038
 clearing-0.036
Token"
Token ID1
Feature activation+4.92e+00

Pos logit contributions
asers+0.091
 forgiveness+0.091
 apology+0.089
asing+0.088
Of+0.087
Neg logit contributions
 lake-0.068
neath-0.065
 clearing-0.064
 creek-0.063
 river-0.063
TokenSure
Token ID19457
Feature activation+1.57e+00

Pos logit contributions
iness+0.207
 supervision+0.200
 actions+0.197
asers+0.192
aser+0.190
Neg logit contributions
 lake-0.198
 creek-0.188
 clearing-0.174
neath-0.172
 field-0.171
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
Of+0.119
aid+0.118
olding+0.116
asing+0.116
asers+0.113
Neg logit contributions
 lake-0.025
 camp-0.024
 shore-0.019
 clearing-0.018
 river-0.018
Token honey
Token ID12498
Feature activation+9.23e-01

Pos logit contributions
aid+0.085
olding+0.085
Of+0.082
asing+0.082
olded+0.080
Neg logit contributions
 lake-0.066
 camp-0.065
 shore-0.061
 ocean-0.060
 river-0.060
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
Of+0.055
aid+0.052
olding+0.052
 forgiveness+0.051
 apology+0.051
Neg logit contributions
 lake-0.031
 camp-0.030
neath-0.028
 shore-0.028
 river-0.028
Token I
Token ID314
Feature activation+1.20e+00

Pos logit contributions
aid+0.063
 forgiveness+0.061
olding+0.061
Of+0.061
 apology+0.059
Neg logit contributions
 lake-0.051
neath-0.049
 camp-0.048
 creek-0.047
 river-0.047
Token pictures
Token ID5986
Feature activation+5.15e-01

Pos logit contributions
Of+0.037
olding+0.036
aid+0.036
 forgiveness+0.036
 apology+0.035
Neg logit contributions
 lake-0.025
 camp-0.024
neath-0.023
 shore-0.023
 ocean-0.023
Token together
Token ID1978
Feature activation-8.80e-01

Pos logit contributions
Of+0.028
aid+0.027
olding+0.027
 forgiveness+0.027
 apology+0.027
Neg logit contributions
 lake-0.019
 camp-0.018
neath-0.018
 shore-0.018
 river-0.018
Token?"
Token ID1701
Feature activation+1.43e+00

Pos logit contributions
 lake+0.033
neath+0.032
 valleys+0.032
 shore+0.032
 canyon+0.032
Neg logit contributions
Of-0.046
 forgiveness-0.045
aid-0.043
olding-0.043
asing-0.042
Token
Token ID198
Feature activation+1.62e+00

Pos logit contributions
asing+0.090
olding+0.090
 forgiveness+0.090
aid+0.088
 apology+0.087
Neg logit contributions
 lake-0.038
 shore-0.035
neath-0.034
 camp-0.034
 river-0.033
Token
Token ID198
Feature activation+2.11e+00

Pos logit contributions
asing+0.104
 forgiveness+0.103
olding+0.102
 apology+0.100
aid+0.099
Neg logit contributions
 lake-0.042
 camp-0.040
neath-0.038
 shore-0.037
 pool-0.036
Token"
Token ID1
Feature activation+4.91e+00

Pos logit contributions
 forgiveness+0.100
 apology+0.095
asers+0.092
asing+0.092
 actions+0.091
Neg logit contributions
 lake-0.086
 river-0.083
 creek-0.081
 clearing-0.080
 shore-0.080
TokenOf
Token ID5189
Feature activation+1.85e+00

Pos logit contributions
 actions+0.213
 supervision+0.208
 instincts+0.203
 apology+0.202
iness+0.200
Neg logit contributions
 lake-0.213
neath-0.195
 clearing-0.189
 creek-0.187
 field-0.180
Token course
Token ID1781
Feature activation+1.09e+00

Pos logit contributions
olding+0.109
Of+0.104
asers+0.104
asing+0.103
olded+0.102
Neg logit contributions
 lake-0.065
neath-0.057
 clearing-0.055
 ocean-0.054
 camp-0.053
Token!"
Token ID2474
Feature activation+1.51e+00

Pos logit contributions
aid+0.071
Of+0.070
olding+0.069
 forgiveness+0.069
 apology+0.068
Neg logit contributions
 lake-0.029
 camp-0.027
 shore-0.026
 river-0.025
neath-0.025
Token said
Token ID531
Feature activation+1.33e-01

Pos logit contributions
olding+0.066
 forgiveness+0.065
asing+0.065
 apology+0.062
aser+0.062
Neg logit contributions
 lake-0.069
neath-0.066
 shore-0.065
 camp-0.065
 pool-0.064
Token Lily
Token ID20037
Feature activation+7.20e-01

Pos logit contributions
Of+0.007
 forgiveness+0.007
aid+0.007
olding+0.007
 apology+0.007
Neg logit contributions
 lake-0.005
neath-0.005
 camp-0.005
 river-0.005
 shore-0.005
Token it
Token ID340
Feature activation-9.41e-02

Pos logit contributions
neath+0.041
 ditch+0.039
 lake+0.039
 canyon+0.038
 clearing+0.038
Neg logit contributions
 forgiveness-0.051
Of-0.050
aid-0.046
 apology-0.046
 else-0.046
Token together
Token ID1978
Feature activation-4.45e-01

Pos logit contributions
 lake+0.003
neath+0.003
 camp+0.003
 shore+0.003
 creek+0.003
Neg logit contributions
Of-0.005
aid-0.005
 forgiveness-0.005
olding-0.005
 apology-0.005
Token?"
Token ID1701
Feature activation+1.40e+00

Pos logit contributions
 lake+0.016
neath+0.016
 camp+0.015
 creek+0.015
 shore+0.015
Neg logit contributions
Of-0.024
 forgiveness-0.024
aid-0.024
olding-0.023
 apology-0.023
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
olding+0.090
aid+0.088
Of+0.087
 forgiveness+0.087
aser+0.087
Neg logit contributions
 lake-0.040
 camp-0.039
 creek-0.036
neath-0.036
 shore-0.036
Token
Token ID198
Feature activation+1.83e+00

Pos logit contributions
Of+0.060
olding+0.060
 forgiveness+0.059
aid+0.058
 apology+0.057
Neg logit contributions
 lake-0.034
 camp-0.034
neath-0.032
 shore-0.031
 creek-0.031
Token"
Token ID1
Feature activation+4.86e+00

Pos logit contributions
olding+0.097
 apology+0.095
 forgiveness+0.093
aser+0.092
asers+0.089
Neg logit contributions
 lake-0.074
 camp-0.072
 clearing-0.069
 creek-0.067
 valleys-0.067
TokenYes
Token ID5297
Feature activation+5.88e-01

Pos logit contributions
aser+0.196
asers+0.189
olds+0.189
olding+0.185
 actions+0.184
Neg logit contributions
 lake-0.236
 clearing-0.206
 pool-0.201
 pond-0.199
 creek-0.197
Token,
Token ID11
Feature activation+2.82e+00

Pos logit contributions
Of+0.038
aid+0.038
olding+0.037
 forgiveness+0.037
 apology+0.036
Neg logit contributions
 lake-0.016
 camp-0.015
neath-0.015
 shore-0.015
 river-0.014
Token we
Token ID356
Feature activation+5.30e-01

Pos logit contributions
olding+0.154
kind+0.149
asing+0.148
aid+0.148
asers+0.147
Neg logit contributions
 camp-0.113
 lake-0.101
 pool-0.094
 river-0.089
 creek-0.086
Token can
Token ID460
Feature activation-1.51e-01

Pos logit contributions
Of+0.022
 forgiveness+0.022
olding+0.022
aid+0.022
 apology+0.021
Neg logit contributions
 lake-0.027
 camp-0.026
neath-0.026
 shore-0.025
 river-0.025
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
 lake+0.007
neath+0.007
 camp+0.007
 creek+0.007
 canyon+0.007
Neg logit contributions
Of-0.007
aid-0.007
olding-0.007
 forgiveness-0.007
 apology-0.007
Token Inspired
Token ID45827
Feature activation+7.07e-01

Pos logit contributions
Of+0.026
aid+0.026
olding+0.025
 forgiveness+0.025
 apology+0.025
Neg logit contributions
 lake-0.016
neath-0.015
 camp-0.015
 shore-0.015
 creek-0.014
Token,
Token ID11
Feature activation+1.78e+00

Pos logit contributions
Of+0.040
aid+0.040
olding+0.039
 forgiveness+0.039
 apology+0.038
Neg logit contributions
 lake-0.025
neath-0.024
 camp-0.024
 ocean-0.023
 creek-0.023
Token Emily
Token ID17608
Feature activation-9.86e-02

Pos logit contributions
aid+0.095
olding+0.095
Of+0.092
 apology+0.089
Deal+0.089
Neg logit contributions
neath-0.069
 lake-0.068
 camp-0.068
 creek-0.066
 fields-0.065
Token asked
Token ID1965
Feature activation+5.50e-01

Pos logit contributions
 lake+0.005
 camp+0.005
neath+0.005
 shore+0.005
 creek+0.005
Neg logit contributions
Of-0.004
aid-0.004
 forgiveness-0.004
olding-0.004
 apology-0.004
Token her
Token ID607
Feature activation+8.93e-01

Pos logit contributions
Of+0.030
aid+0.030
olding+0.029
 forgiveness+0.029
 apology+0.028
Neg logit contributions
 lake-0.021
 camp-0.020
neath-0.020
 shore-0.019
 creek-0.019
Token pap
Token ID20461
Feature activation+4.80e+00

Pos logit contributions
aid+0.038
Of+0.038
olding+0.037
 forgiveness+0.036
 apology+0.035
Neg logit contributions
 lake-0.045
 camp-0.045
 shore-0.044
 creek-0.043
 ocean-0.043
Tokena
Token ID64
Feature activation+1.84e+00

Pos logit contributions
 apology+0.213
 supervision+0.212
 importantly+0.211
Of+0.211
 meanings+0.211
Neg logit contributions
 clearing-0.138
 lake-0.129
 ocean-0.126
 reef-0.119
 bay-0.118
Token,
Token ID11
Feature activation+9.03e-01

Pos logit contributions
aid+0.089
Of+0.089
olding+0.085
 forgiveness+0.084
 apology+0.084
Neg logit contributions
 camp-0.079
 lake-0.078
 creek-0.074
neath-0.073
 river-0.073
Token "
Token ID366
Feature activation+1.29e+00

Pos logit contributions
aid+0.048
Of+0.047
 forgiveness+0.047
olding+0.047
 apology+0.046
Neg logit contributions
 lake-0.034
neath-0.033
 camp-0.033
 shore-0.032
 river-0.032
TokenHow
Token ID2437
Feature activation+7.95e-01

Pos logit contributions
olding+0.060
 apology+0.059
aid+0.058
 forgiveness+0.058
Of+0.056
Neg logit contributions
 lake-0.061
neath-0.060
 clearing-0.057
 shore-0.056
 ocean-0.056
Token can
Token ID460
Feature activation+1.05e+00

Pos logit contributions
Of+0.037
aid+0.036
 forgiveness+0.035
olding+0.035
 apology+0.034
Neg logit contributions
 lake-0.036
 camp-0.036
neath-0.035
 creek-0.034
 river-0.034
Token be
Token ID307
Feature activation+2.74e-01

Pos logit contributions
neath+0.022
 lake+0.022
 valleys+0.021
 camp+0.021
 canyon+0.021
Neg logit contributions
Of-0.026
 forgiveness-0.025
aid-0.025
olding-0.024
 apology-0.024
Token friends
Token ID2460
Feature activation+1.32e+00

Pos logit contributions
Of+0.009
aid+0.009
 forgiveness+0.008
olding+0.008
 apology+0.008
Neg logit contributions
 lake-0.016
neath-0.016
 camp-0.016
 shore-0.016
 creek-0.016
Token?"
Token ID1701
Feature activation+1.63e+00

Pos logit contributions
Of+0.087
olding+0.086
aid+0.085
 apology+0.084
 forgiveness+0.083
Neg logit contributions
 lake-0.034
 camp-0.032
neath-0.030
 ocean-0.030
 river-0.029
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
olding+0.109
asing+0.108
aser+0.106
 forgiveness+0.105
 apology+0.104
Neg logit contributions
 lake-0.040
 shore-0.037
neath-0.035
 camp-0.035
 river-0.034
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
Of+0.070
olding+0.069
 forgiveness+0.068
asing+0.067
aid+0.066
Neg logit contributions
 lake-0.038
 camp-0.037
 shore-0.035
neath-0.034
 river-0.034
Token"
Token ID1
Feature activation+4.77e+00

Pos logit contributions
olding+0.057
 forgiveness+0.057
Of+0.055
 apology+0.055
asing+0.054
Neg logit contributions
 lake-0.066
 river-0.061
 camp-0.061
 valleys-0.061
 clearing-0.061
TokenOf
Token ID5189
Feature activation+2.15e+00

Pos logit contributions
grand+0.172
olding+0.165
olds+0.163
iness+0.162
 actions+0.162
Neg logit contributions
 lake-0.240
neath-0.209
 creek-0.207
 river-0.206
 pond-0.205
Token course
Token ID1781
Feature activation+1.43e+00

Pos logit contributions
olding+0.137
Of+0.132
olded+0.128
asing+0.127
asers+0.126
Neg logit contributions
 lake-0.068
neath-0.056
 camp-0.055
 ocean-0.054
 clearing-0.053
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
aid+0.093
Of+0.092
olding+0.090
 forgiveness+0.089
asing+0.089
Neg logit contributions
 lake-0.038
 camp-0.036
 shore-0.033
 river-0.033
 pool-0.031
Token oct
Token ID19318
Feature activation+5.24e-01

Pos logit contributions
aid+0.061
olding+0.061
Of+0.059
asing+0.057
Deal+0.057
Neg logit contributions
 lake-0.067
 camp-0.064
 shore-0.063
 river-0.062
 ocean-0.061
Tokenopus
Token ID25790
Feature activation+1.61e+00

Pos logit contributions
Of+0.023
 forgiveness+0.022
aid+0.022
olding+0.021
 apology+0.021
Neg logit contributions
 lake-0.026
 camp-0.025
neath-0.025
 shore-0.024
 creek-0.024
Token Ben
Token ID3932
Feature activation+1.52e+00

Pos logit contributions
Of+0.007
 forgiveness+0.007
aid+0.007
olding+0.007
 apology+0.007
Neg logit contributions
 lake-0.005
neath-0.004
 camp-0.004
 shore-0.004
 canyon-0.004
Token asked
Token ID1965
Feature activation+1.86e+00

Pos logit contributions
Of+0.075
aid+0.075
olding+0.074
 forgiveness+0.073
 apology+0.072
Neg logit contributions
 lake-0.063
 ocean-0.058
 camp-0.058
 river-0.058
 shore-0.058
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
Of+0.114
aid+0.111
Deal+0.108
olding+0.107
asing+0.105
Neg logit contributions
 camp-0.054
 shore-0.052
neath-0.051
 lake-0.051
 pool-0.049
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
asing+0.099
 forgiveness+0.097
aid+0.097
olding+0.096
Deal+0.094
Neg logit contributions
 lake-0.041
 camp-0.039
neath-0.038
 shore-0.038
 river-0.038
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
Of+0.073
 forgiveness+0.071
aid+0.071
olding+0.071
asing+0.070
Neg logit contributions
 lake-0.037
 camp-0.036
neath-0.035
 shore-0.034
 river-0.033
Token"
Token ID1
Feature activation+4.77e+00

Pos logit contributions
 forgiveness+0.054
Of+0.054
 apology+0.053
olding+0.053
aid+0.052
Neg logit contributions
 lake-0.054
 camp-0.051
 river-0.051
 creek-0.050
neath-0.050
TokenOf
Token ID5189
Feature activation+1.90e+00

Pos logit contributions
iness+0.201
 actions+0.200
aser+0.199
asers+0.198
 forgiveness+0.196
Neg logit contributions
 lake-0.201
 creek-0.191
neath-0.179
 river-0.176
 pond-0.172
Token course
Token ID1781
Feature activation+1.36e+00

Pos logit contributions
olding+0.115
Of+0.112
asing+0.110
asers+0.109
olded+0.109
Neg logit contributions
 lake-0.066
neath-0.058
 clearing-0.055
 ocean-0.055
 shore-0.054
Token we
Token ID356
Feature activation+5.48e-01

Pos logit contributions
aid+0.081
Of+0.079
olding+0.077
asing+0.075
 forgiveness+0.075
Neg logit contributions
 lake-0.045
 camp-0.044
 shore-0.042
 river-0.040
 pool-0.040
Token can
Token ID460
Feature activation-1.56e-01

Pos logit contributions
Of+0.024
aid+0.023
 forgiveness+0.023
olding+0.023
 apology+0.022
Neg logit contributions
 lake-0.027
 camp-0.026
neath-0.026
 shore-0.025
 river-0.025
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
 lake+0.007
neath+0.007
 camp+0.007
 canyon+0.006
 shore+0.006
Neg logit contributions
Of-0.007
olding-0.007
 forgiveness-0.007
aid-0.007
 apology-0.007
Token?"
Token ID1701
Feature activation+6.29e-01

Pos logit contributions
 lake+0.029
neath+0.028
 valleys+0.028
 shore+0.028
 canyon+0.028
Neg logit contributions
Of-0.038
 forgiveness-0.037
aid-0.037
olding-0.036
aser-0.035
Token Her
Token ID2332
Feature activation+1.20e+00

Pos logit contributions
Of+0.030
aid+0.030
olding+0.029
 forgiveness+0.029
 apology+0.028
Neg logit contributions
 lake-0.028
neath-0.027
 camp-0.027
 shore-0.026
 river-0.026
Token dad
Token ID9955
Feature activation+1.33e+00

Pos logit contributions
aid+0.057
Of+0.055
olding+0.053
 forgiveness+0.052
asing+0.052
Neg logit contributions
 lake-0.058
 shore-0.057
 ocean-0.056
 camp-0.055
 pool-0.053
Token replied
Token ID8712
Feature activation+1.20e+00

Pos logit contributions
olding+0.054
Of+0.054
aid+0.054
 forgiveness+0.054
 apology+0.053
Neg logit contributions
 lake-0.066
neath-0.064
 camp-0.062
 river-0.060
 clearing-0.060
Token,
Token ID11
Feature activation+5.88e-02

Pos logit contributions
Of+0.083
aid+0.082
 forgiveness+0.080
olding+0.080
 apology+0.079
Neg logit contributions
 lake-0.025
 camp-0.024
 shore-0.024
neath-0.023
 river-0.022
Token "
Token ID366
Feature activation+4.77e+00

Pos logit contributions
Of+0.003
aid+0.003
olding+0.003
 forgiveness+0.003
 apology+0.003
Neg logit contributions
 lake-0.003
neath-0.003
 camp-0.003
 shore-0.003
 creek-0.002
TokenNo
Token ID2949
Feature activation+1.11e+00

Pos logit contributions
 forgiveness+0.222
 apology+0.219
 actions+0.218
 supervision+0.217
 instincts+0.199
Neg logit contributions
 lake-0.205
 river-0.192
 clearing-0.188
 hills-0.186
 valleys-0.183
Token,
Token ID11
Feature activation+1.68e+00

Pos logit contributions
Of+0.078
aid+0.078
olding+0.075
asing+0.073
 forgiveness+0.072
Neg logit contributions
 lake-0.027
 camp-0.025
 river-0.023
 shore-0.023
 pool-0.022
Token we
Token ID356
Feature activation-2.83e-01

Pos logit contributions
olding+0.087
aid+0.086
Of+0.083
asing+0.082
 forgiveness+0.080
Neg logit contributions
 lake-0.068
 camp-0.065
 river-0.063
 shore-0.062
 pool-0.061
Token can
Token ID460
Feature activation-2.54e-01

Pos logit contributions
 lake+0.015
 camp+0.014
 shore+0.014
neath+0.014
 valleys+0.014
Neg logit contributions
Of-0.011
olding-0.011
aid-0.011
 forgiveness-0.011
 apology-0.010
Token't
Token ID470
Feature activation-1.21e-01

Pos logit contributions
 lake+0.011
 camp+0.010
 valleys+0.010
neath+0.010
 canyon+0.010
Neg logit contributions
Of-0.012
aid-0.012
 forgiveness-0.012
olding-0.012
 apology-0.012
Token Her
Token ID2332
Feature activation+1.79e+00

Pos logit contributions
aid+0.068
olding+0.066
asing+0.065
aser+0.063
asers+0.063
Neg logit contributions
 lake-0.061
neath-0.060
 camp-0.059
 shore-0.058
 pool-0.057
Token mom
Token ID1995
Feature activation+2.12e+00

Pos logit contributions
aid+0.094
asing+0.094
Of+0.091
olding+0.091
olded+0.086
Neg logit contributions
 lake-0.078
 camp-0.076
 pool-0.074
 shore-0.070
 ocean-0.070
Tokenmy
Token ID1820
Feature activation+1.47e+00

Pos logit contributions
 forgiveness+0.129
 apology+0.127
olding+0.127
Of+0.126
asers+0.123
Neg logit contributions
 lake-0.054
neath-0.050
 clearing-0.049
 camp-0.047
 river-0.044
Token replied
Token ID8712
Feature activation+1.14e+00

Pos logit contributions
olding+0.059
Of+0.058
aid+0.058
 forgiveness+0.058
 apology+0.057
Neg logit contributions
 lake-0.073
neath-0.069
 camp-0.068
 river-0.066
 clearing-0.066
Token,
Token ID11
Feature activation+3.22e-01

Pos logit contributions
Of+0.078
aid+0.077
 forgiveness+0.076
olding+0.075
 apology+0.074
Neg logit contributions
 lake-0.024
 camp-0.023
 shore-0.022
neath-0.022
 river-0.020
Token "
Token ID366
Feature activation+4.76e+00

Pos logit contributions
Of+0.016
aid+0.016
olding+0.015
 forgiveness+0.015
 apology+0.015
Neg logit contributions
 lake-0.014
 camp-0.013
neath-0.013
 shore-0.013
 river-0.013
TokenMaybe
Token ID13300
Feature activation+9.86e-01

Pos logit contributions
 apology+0.212
 actions+0.211
aser+0.211
 forgiveness+0.209
asers+0.205
Neg logit contributions
 lake-0.189
 clearing-0.181
neath-0.167
 field-0.167
 hills-0.162
Token another
Token ID1194
Feature activation+9.07e-01

Pos logit contributions
aid+0.047
Of+0.046
olding+0.045
 forgiveness+0.043
 apology+0.043
Neg logit contributions
 lake-0.045
 camp-0.044
 shore-0.044
 ocean-0.042
 river-0.042
Token day
Token ID1110
Feature activation+8.35e-01

Pos logit contributions
aid+0.046
Of+0.046
 forgiveness+0.044
olding+0.044
asing+0.043
Neg logit contributions
 lake-0.039
 camp-0.038
 shore-0.037
 ocean-0.036
 river-0.036
Token,
Token ID11
Feature activation+1.53e+00

Pos logit contributions
Of+0.042
aid+0.041
 forgiveness+0.040
olding+0.040
 apology+0.039
Neg logit contributions
 lake-0.035
 camp-0.034
neath-0.033
 shore-0.033
 river-0.032
Token sweet
Token ID6029
Feature activation+3.95e-01

Pos logit contributions
olding+0.072
aid+0.072
Of+0.069
 apology+0.067
asing+0.067
Neg logit contributions
 lake-0.068
 camp-0.067
 river-0.063
 shore-0.063
neath-0.062
Tokenmy
Token ID1820
Feature activation+2.08e+00

Pos logit contributions
 apology+0.176
olding+0.175
 forgiveness+0.173
asers+0.169
oral+0.169
Neg logit contributions
 lake-0.049
 clearing-0.045
neath-0.043
 camp-0.040
 river-0.037
Token smiled
Token ID13541
Feature activation+6.30e-01

Pos logit contributions
olding+0.094
 apology+0.091
 forgiveness+0.090
olds+0.088
asing+0.088
Neg logit contributions
 lake-0.092
neath-0.086
 camp-0.084
 clearing-0.082
 river-0.081
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
Of+0.040
aid+0.039
olding+0.039
 forgiveness+0.039
 apology+0.038
Neg logit contributions
 lake-0.018
 camp-0.017
neath-0.017
 shore-0.017
 ocean-0.016
Token said
Token ID531
Feature activation+2.08e+00

Pos logit contributions
olding+0.045
 forgiveness+0.043
 apology+0.043
aid+0.042
Of+0.042
Neg logit contributions
 lake-0.061
 camp-0.059
 shore-0.058
neath-0.058
 river-0.057
Token,
Token ID11
Feature activation+9.85e-01

Pos logit contributions
Of+0.156
olding+0.154
aid+0.154
grand+0.151
asing+0.151
Neg logit contributions
 camp-0.018
 lake-0.016
neath-0.016
 shore-0.015
 pool-0.013
Token "
Token ID366
Feature activation+4.75e+00

Pos logit contributions
aid+0.053
 forgiveness+0.053
olding+0.052
Of+0.052
 apology+0.052
Neg logit contributions
 lake-0.036
 shore-0.034
 camp-0.034
neath-0.034
 river-0.033
TokenSure
Token ID19457
Feature activation+1.43e+00

Pos logit contributions
aser+0.192
asers+0.191
 apology+0.190
 supervision+0.190
 actions+0.189
Neg logit contributions
 lake-0.212
 clearing-0.201
neath-0.192
 river-0.190
 field-0.190
Token,
Token ID11
Feature activation+2.09e+00

Pos logit contributions
Of+0.106
aid+0.105
olding+0.103
asing+0.102
asers+0.100
Neg logit contributions
 lake-0.025
 camp-0.025
 shore-0.020
 river-0.019
 pool-0.019
Token let
Token ID1309
Feature activation-1.94e-01

Pos logit contributions
olding+0.104
aid+0.104
olds+0.100
olded+0.099
asing+0.097
Neg logit contributions
 camp-0.084
 lake-0.084
 shore-0.078
 river-0.078
 pool-0.076
Token's
Token ID338
Feature activation-1.56e+00

Pos logit contributions
 lake+0.010
 camp+0.009
 shore+0.009
neath+0.009
 creek+0.009
Neg logit contributions
Of-0.008
 forgiveness-0.008
aid-0.008
olding-0.008
 apology-0.008
Token go
Token ID467
Feature activation-1.05e+00

Pos logit contributions
 valleys+0.075
neath+0.071
 clearing+0.070
 canyon+0.068
 lake+0.068
Neg logit contributions
 forgiveness-0.067
Of-0.065
 forgive-0.065
 promise-0.064
aid-0.063
Token my
Token ID616
Feature activation+5.43e-01

Pos logit contributions
neath+0.031
 valleys+0.030
 lake+0.029
 canyon+0.029
 camp+0.029
Neg logit contributions
Of-0.040
 forgiveness-0.038
olding-0.037
 apology-0.037
aid-0.036
Token love
Token ID1842
Feature activation+4.02e-01

Pos logit contributions
Of+0.021
aid+0.021
 forgiveness+0.020
olding+0.020
 apology+0.020
Neg logit contributions
 lake-0.029
 camp-0.028
neath-0.028
 shore-0.027
 river-0.027
Token,"
Token ID553
Feature activation+1.47e+00

Pos logit contributions
Of+0.021
aid+0.020
olding+0.020
 forgiveness+0.020
 apology+0.019
Neg logit contributions
 lake-0.017
 camp-0.016
neath-0.016
 shore-0.016
 river-0.015
Token said
Token ID531
Feature activation+4.40e-02

Pos logit contributions
olding+0.081
 apology+0.077
asing+0.076
Of+0.076
 forgiveness+0.075
Neg logit contributions
 lake-0.055
 camp-0.053
 shore-0.052
neath-0.051
 river-0.050
Token M
Token ID337
Feature activation-4.30e-02

Pos logit contributions
Of+0.002
 forgiveness+0.002
aid+0.002
olding+0.002
 apology+0.002
Neg logit contributions
 lake-0.002
neath-0.002
 camp-0.002
 shore-0.001
 river-0.001
Tokenomm
Token ID2002
Feature activation+4.74e+00

Pos logit contributions
 lake+0.002
neath+0.002
 camp+0.002
 shore+0.002
 river+0.002
Neg logit contributions
Of-0.002
olding-0.002
 forgiveness-0.002
aid-0.002
 apology-0.002
Tokena
Token ID64
Feature activation+1.70e+00

Pos logit contributions
 interests+0.213
Of+0.205
Deal+0.203
 narrative+0.202
 passions+0.202
Neg logit contributions
 tim-0.163
 camp-0.149
 clearing-0.142
 polar-0.137
 lake-0.135
Token.
Token ID13
Feature activation-4.61e-01

Pos logit contributions
Of+0.097
 forgiveness+0.094
aid+0.093
 apology+0.093
olding+0.093
Neg logit contributions
 lake-0.054
 camp-0.052
neath-0.051
 shore-0.051
 river-0.050
Token "
Token ID366
Feature activation+2.11e+00

Pos logit contributions
 lake+0.022
 camp+0.021
neath+0.021
 ocean+0.021
 canyon+0.021
Neg logit contributions
Of-0.022
olding-0.020
aid-0.020
 forgiveness-0.020
 apology-0.019
TokenLet
Token ID5756
Feature activation+1.98e-01

Pos logit contributions
 apology+0.102
olding+0.100
 forgiveness+0.099
aser+0.097
olded+0.097
Neg logit contributions
 lake-0.087
neath-0.087
 valleys-0.083
 ocean-0.081
 shore-0.081
Token's
Token ID338
Feature activation-1.73e+00

Pos logit contributions
Of+0.009
 forgiveness+0.009
aid+0.009
olding+0.009
 apology+0.009
Neg logit contributions
 lake-0.009
 camp-0.009
neath-0.009
 shore-0.008
 creek-0.008
Token own
Token ID898
Feature activation+5.36e-01

Pos logit contributions
aid+0.086
Of+0.084
olding+0.083
olded+0.077
asing+0.077
Neg logit contributions
 lake-0.080
 camp-0.080
 ocean-0.078
 pool-0.075
 shore-0.075
Token story
Token ID1621
Feature activation+1.38e+00

Pos logit contributions
Of+0.021
aid+0.020
olding+0.020
 forgiveness+0.020
 apology+0.019
Neg logit contributions
 lake-0.029
 camp-0.028
neath-0.027
 shore-0.027
 ocean-0.027
Token?"
Token ID1701
Feature activation+1.40e+00

Pos logit contributions
Of+0.072
aid+0.070
olding+0.070
 forgiveness+0.070
 apology+0.069
Neg logit contributions
 camp-0.054
 lake-0.053
neath-0.050
 pool-0.050
 shore-0.049
Token
Token ID198
Feature activation+1.40e+00

Pos logit contributions
olding+0.090
aid+0.089
asing+0.088
 forgiveness+0.087
 apology+0.087
Neg logit contributions
 lake-0.037
neath-0.036
 camp-0.035
 shore-0.035
 creek-0.034
Token
Token ID198
Feature activation+1.61e+00

Pos logit contributions
olding+0.088
Of+0.087
asing+0.087
 forgiveness+0.085
aid+0.085
Neg logit contributions
 lake-0.039
 camp-0.038
neath-0.037
 shore-0.036
 river-0.034
Token"
Token ID1
Feature activation+4.74e+00

Pos logit contributions
olding+0.077
 apology+0.077
asers+0.076
asing+0.075
 forgiveness+0.075
Neg logit contributions
 lake-0.069
neath-0.066
 camp-0.065
 shore-0.065
 river-0.064
TokenOf
Token ID5189
Feature activation+1.99e+00

Pos logit contributions
 actions+0.199
iness+0.194
 instincts+0.190
 supervision+0.185
 behavior+0.179
Neg logit contributions
 lake-0.205
 creek-0.183
 clearing-0.182
 pool-0.181
neath-0.180
Token course
Token ID1781
Feature activation+1.58e+00

Pos logit contributions
olding+0.122
Of+0.117
asing+0.116
olded+0.114
asers+0.114
Neg logit contributions
 lake-0.067
neath-0.058
 clearing-0.055
 camp-0.055
 rav-0.052
Token,
Token ID11
Feature activation+2.73e+00

Pos logit contributions
aid+0.111
Of+0.108
olding+0.107
asing+0.106
 forgiveness+0.105
Neg logit contributions
 lake-0.033
 camp-0.033
 shore-0.028
 pool-0.027
 river-0.027
Token we
Token ID356
Feature activation+6.74e-01

Pos logit contributions
olding+0.154
kind+0.146
asing+0.146
olds+0.146
oral+0.146
Neg logit contributions
 camp-0.088
 lake-0.086
 pool-0.079
 river-0.075
 creek-0.072
Token can
Token ID460
Feature activation-3.31e-02

Pos logit contributions
Of+0.030
 forgiveness+0.029
olding+0.029
aid+0.029
 apology+0.028
Neg logit contributions
 lake-0.032
neath-0.031
 camp-0.031
 river-0.030
 shore-0.030
Token of
Token ID286
Feature activation+1.51e+00

Pos logit contributions
Of+0.125
olding+0.124
 forgiveness+0.115
Deal+0.114
aid+0.113
Neg logit contributions
 lake-0.083
 ocean-0.080
 camp-0.078
 river-0.077
 pool-0.077
Token yours
Token ID12431
Feature activation+1.04e+00

Pos logit contributions
olding+0.097
Of+0.096
aid+0.092
Deal+0.090
olded+0.089
Neg logit contributions
 lake-0.045
 shore-0.042
 camp-0.042
 ocean-0.040
neath-0.040
Token?"
Token ID1701
Feature activation+6.46e-01

Pos logit contributions
Of+0.066
olding+0.064
aid+0.064
 forgiveness+0.063
 apology+0.063
Neg logit contributions
 lake-0.030
 camp-0.029
neath-0.028
 shore-0.027
 ocean-0.027
Token
Token ID198
Feature activation+8.31e-01

Pos logit contributions
Of+0.036
aid+0.036
olding+0.036
 forgiveness+0.036
 apology+0.035
Neg logit contributions
 lake-0.023
 camp-0.022
neath-0.022
 shore-0.022
 creek-0.021
Token
Token ID198
Feature activation+5.28e-01

Pos logit contributions
Of+0.047
 forgiveness+0.046
aid+0.046
olding+0.046
 apology+0.045
Neg logit contributions
 lake-0.029
 camp-0.028
neath-0.028
 shore-0.027
 river-0.026
Token"
Token ID1
Feature activation+4.73e+00

Pos logit contributions
Of+0.023
 forgiveness+0.022
aid+0.022
olding+0.022
 apology+0.021
Neg logit contributions
 lake-0.026
neath-0.025
 camp-0.025
 shore-0.025
 river-0.025
TokenOf
Token ID5189
Feature activation+1.91e+00

Pos logit contributions
 actions+0.215
 instincts+0.213
 forgiveness+0.211
iness+0.210
 apology+0.196
Neg logit contributions
 lake-0.181
neath-0.177
 creek-0.168
 clearing-0.164
 hills-0.162
Token course
Token ID1781
Feature activation+1.54e+00

Pos logit contributions
olding+0.116
Of+0.113
asing+0.109
olded+0.108
asers+0.108
Neg logit contributions
 lake-0.064
neath-0.057
 clearing-0.053
 camp-0.052
 rav-0.051
Token,
Token ID11
Feature activation+1.64e+00

Pos logit contributions
aid+0.101
Of+0.099
olding+0.097
 forgiveness+0.096
asing+0.095
Neg logit contributions
 lake-0.040
 camp-0.039
 shore-0.035
 pool-0.034
 river-0.034
Token you
Token ID345
Feature activation+6.71e-01

Pos logit contributions
aid+0.077
olding+0.076
Of+0.074
 forgiveness+0.074
oral+0.071
Neg logit contributions
 lake-0.072
 camp-0.069
 shore-0.066
 pool-0.065
 ocean-0.064
Token can
Token ID460
Feature activation+1.07e+00

Pos logit contributions
Of+0.030
aid+0.030
 forgiveness+0.030
olding+0.029
 apology+0.028
Neg logit contributions
 lake-0.032
neath-0.031
 camp-0.030
 river-0.030
 shore-0.029
Tokenmy
Token ID1820
Feature activation+1.95e+00

Pos logit contributions
 apology+0.140
 forgiveness+0.140
olding+0.137
oral+0.136
grand+0.133
Neg logit contributions
 lake-0.058
neath-0.053
 clearing-0.052
 camp-0.048
 pool-0.047
Token smiled
Token ID13541
Feature activation+7.44e-01

Pos logit contributions
olding+0.085
 apology+0.085
 forgiveness+0.085
grand+0.080
aid+0.080
Neg logit contributions
 lake-0.087
neath-0.082
 camp-0.078
 clearing-0.077
 river-0.077
Token and
Token ID290
Feature activation+1.14e+00

Pos logit contributions
Of+0.048
aid+0.047
olding+0.047
 forgiveness+0.047
 apology+0.046
Neg logit contributions
 lake-0.021
 camp-0.019
neath-0.019
 shore-0.019
 ocean-0.019
Token said
Token ID531
Feature activation+1.85e+00

Pos logit contributions
olding+0.044
 forgiveness+0.043
 apology+0.042
aid+0.041
asing+0.040
Neg logit contributions
 lake-0.060
neath-0.058
 shore-0.057
 camp-0.057
 river-0.056
Token,
Token ID11
Feature activation+4.23e-01

Pos logit contributions
aid+0.132
Of+0.131
olding+0.129
 forgiveness+0.129
 apology+0.128
Neg logit contributions
 lake-0.027
 camp-0.026
 shore-0.025
neath-0.025
 pool-0.023
Token "
Token ID366
Feature activation+4.72e+00

Pos logit contributions
Of+0.021
aid+0.021
 forgiveness+0.021
olding+0.021
 apology+0.020
Neg logit contributions
 lake-0.018
 camp-0.017
neath-0.017
 shore-0.017
 river-0.016
TokenSure
Token ID19457
Feature activation+1.28e+00

Pos logit contributions
aser+0.202
 apology+0.201
 actions+0.198
 forgiveness+0.198
asers+0.197
Neg logit contributions
 lake-0.210
 clearing-0.192
neath-0.182
 river-0.179
 valleys-0.179
Token,
Token ID11
Feature activation+1.74e+00

Pos logit contributions
aid+0.092
Of+0.091
olding+0.089
asing+0.088
 forgiveness+0.086
Neg logit contributions
 lake-0.026
 camp-0.025
 shore-0.022
 river-0.021
 pool-0.020
Token let
Token ID1309
Feature activation-4.92e-01

Pos logit contributions
aid+0.082
olding+0.082
olds+0.078
asing+0.077
 apology+0.076
Neg logit contributions
 lake-0.076
 camp-0.075
 shore-0.070
 river-0.070
 pool-0.069
Token's
Token ID338
Feature activation-1.24e+00

Pos logit contributions
 lake+0.026
 creek+0.025
 canyon+0.025
 camp+0.025
 valleys+0.025
Neg logit contributions
Of-0.019
 forgiveness-0.018
aid-0.018
 apology-0.018
olding-0.018
Token have
Token ID423
Feature activation+9.39e-02

Pos logit contributions
 valleys+0.055
neath+0.053
 clearing+0.052
 lake+0.052
 canyon+0.052
Neg logit contributions
Of-0.056
 forgiveness-0.056
aid-0.055
 apology-0.053
olding-0.052
Token.
Token ID13
Feature activation-8.27e-01

Pos logit contributions
Of+0.004
 forgiveness+0.004
aid+0.004
olding+0.004
 apology+0.004
Neg logit contributions
 lake-0.004
neath-0.004
 camp-0.004
 shore-0.004
 river-0.004
Token Ranger
Token ID21888
Feature activation+5.80e-02

Pos logit contributions
neath+0.033
 lake+0.031
 valleys+0.029
 shore+0.029
 camp+0.029
Neg logit contributions
Of-0.044
olding-0.042
 forgiveness-0.042
 apology-0.040
aid-0.040
Token?"
Token ID1701
Feature activation-6.62e-02

Pos logit contributions
Of+0.003
aid+0.003
olding+0.003
 forgiveness+0.003
 apology+0.003
Neg logit contributions
 lake-0.002
neath-0.002
 camp-0.002
 shore-0.002
 creek-0.002
Token
Token ID198
Feature activation+3.04e-01

Pos logit contributions
 lake+0.003
neath+0.003
 camp+0.003
 canyon+0.003
 shore+0.003
Neg logit contributions
Of-0.003
 forgiveness-0.003
aid-0.003
olding-0.003
 apology-0.003
Token
Token ID198
Feature activation-1.89e-01

Pos logit contributions
Of+0.016
aid+0.015
olding+0.015
 forgiveness+0.015
 apology+0.015
Neg logit contributions
 lake-0.012
 camp-0.012
neath-0.012
 shore-0.011
 river-0.011
Token"
Token ID1
Feature activation+4.71e+00

Pos logit contributions
 lake+0.010
neath+0.010
 camp+0.010
 ocean+0.010
 shore+0.010
Neg logit contributions
Of-0.007
aid-0.007
 forgiveness-0.007
olding-0.007
 apology-0.006
TokenSure
Token ID19457
Feature activation+1.58e+00

Pos logit contributions
 supervision+0.181
 actions+0.178
iness+0.175
aser+0.169
asing+0.169
Neg logit contributions
 lake-0.234
 creek-0.222
 clearing-0.208
 river-0.204
neath-0.202
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
Of+0.117
asing+0.115
olding+0.115
aid+0.115
asers+0.111
Neg logit contributions
 lake-0.029
 camp-0.029
 shore-0.023
 river-0.022
 clearing-0.021
Token Lily
Token ID20037
Feature activation+9.45e-01

Pos logit contributions
olding+0.063
aid+0.062
Of+0.061
asing+0.060
 apology+0.059
Neg logit contributions
 lake-0.062
 camp-0.062
 shore-0.058
 river-0.057
 creek-0.057
Token,
Token ID11
Feature activation+6.47e-01

Pos logit contributions
Of+0.056
 forgiveness+0.054
aid+0.054
olding+0.053
 apology+0.052
Neg logit contributions
 lake-0.031
 camp-0.030
neath-0.028
 river-0.028
 shore-0.028
Token but
Token ID475
Feature activation+9.05e-01

Pos logit contributions
Of+0.026
aid+0.026
olding+0.025
 forgiveness+0.025
 apology+0.024
Neg logit contributions
 lake-0.034
 camp-0.033
neath-0.032
 shore-0.032
 river-0.031
Token Lily
Token ID20037
Feature activation+1.21e-01

Pos logit contributions
aid+0.054
olding+0.053
Of+0.053
asing+0.052
 forgiveness+0.052
Neg logit contributions
 lake-0.039
 camp-0.039
 shore-0.037
neath-0.037
 creek-0.037
Token asks
Token ID7893
Feature activation+1.56e+00

Pos logit contributions
Of+0.005
aid+0.005
olding+0.005
 forgiveness+0.005
 apology+0.005
Neg logit contributions
 lake-0.006
neath-0.006
 camp-0.006
 shore-0.006
 river-0.006
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
Of+0.092
aid+0.089
olding+0.085
Deal+0.083
 apology+0.083
Neg logit contributions
 camp-0.052
 shore-0.051
 lake-0.051
neath-0.050
 creek-0.048
Token
Token ID198
Feature activation+4.99e-01

Pos logit contributions
aid+0.092
olding+0.092
 forgiveness+0.091
asing+0.091
 apology+0.090
Neg logit contributions
 lake-0.041
neath-0.040
 camp-0.039
 shore-0.039
 creek-0.038
Token
Token ID198
Feature activation+5.34e-01

Pos logit contributions
Of+0.027
aid+0.026
olding+0.026
 forgiveness+0.026
 apology+0.025
Neg logit contributions
 lake-0.019
 camp-0.018
neath-0.018
 shore-0.018
 creek-0.017
Token"
Token ID1
Feature activation+4.71e+00

Pos logit contributions
Of+0.023
aid+0.022
 forgiveness+0.022
olding+0.022
 apology+0.021
Neg logit contributions
 lake-0.027
 camp-0.026
neath-0.026
 shore-0.025
 creek-0.025
TokenYes
Token ID5297
Feature activation+7.23e-01

Pos logit contributions
aser+0.196
grand+0.191
asers+0.189
iness+0.189
asing+0.185
Neg logit contributions
 lake-0.199
 creek-0.186
 shore-0.175
 river-0.174
 pond-0.172
Token,
Token ID11
Feature activation+2.72e+00

Pos logit contributions
Of+0.048
aid+0.048
olding+0.047
 forgiveness+0.046
asing+0.045
Neg logit contributions
 lake-0.019
 camp-0.018
neath-0.017
 shore-0.017
 river-0.016
Token you
Token ID345
Feature activation+8.58e-01

Pos logit contributions
olding+0.143
aid+0.143
kind+0.142
asers+0.138
olds+0.137
Neg logit contributions
 lake-0.099
 camp-0.093
 ocean-0.092
 shore-0.092
 pool-0.090
Token can
Token ID460
Feature activation+6.32e-01

Pos logit contributions
Of+0.042
olding+0.041
aid+0.041
 forgiveness+0.040
 apology+0.038
Neg logit contributions
 lake-0.037
 camp-0.036
neath-0.036
 river-0.035
 shore-0.035
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
Of+0.033
aid+0.032
olding+0.032
 forgiveness+0.032
 apology+0.031
Neg logit contributions
 lake-0.025
 camp-0.024
neath-0.024
 shore-0.024
 river-0.023
Token waves
Token ID9813
Feature activation+1.16e-01

Pos logit contributions
neath+0.071
 tim+0.067
 Everest+0.061
 canyon+0.060
 clearing+0.060
Neg logit contributions
Of-0.105
 forgiveness-0.102
 else-0.099
 apology-0.098
aid-0.098
Token again
Token ID757
Feature activation-1.05e+00

Pos logit contributions
Of+0.006
aid+0.006
olding+0.006
 forgiveness+0.006
 apology+0.006
Neg logit contributions
 lake-0.004
neath-0.004
 camp-0.004
 shore-0.004
 creek-0.004
Token?"
Token ID1701
Feature activation+1.42e+00

Pos logit contributions
neath+0.042
 lake+0.042
 valleys+0.040
 canyon+0.040
 creek+0.040
Neg logit contributions
Of-0.051
 forgiveness-0.049
aid-0.047
olding-0.047
 else-0.046
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
olding+0.088
asing+0.086
aid+0.086
aser+0.084
asers+0.084
Neg logit contributions
 lake-0.043
neath-0.041
 camp-0.040
 shore-0.039
 pool-0.038
Token
Token ID198
Feature activation+1.35e+00

Pos logit contributions
Of+0.071
olding+0.071
aid+0.070
asing+0.070
 forgiveness+0.069
Neg logit contributions
 lake-0.039
 camp-0.038
neath-0.037
 shore-0.036
 pool-0.034
Token"
Token ID1
Feature activation+4.70e+00

Pos logit contributions
asers+0.047
olding+0.047
 forgiveness+0.047
Of+0.047
aid+0.046
Neg logit contributions
 lake-0.074
neath-0.072
 camp-0.072
 shore-0.070
 river-0.069
TokenOf
Token ID5189
Feature activation+2.00e+00

Pos logit contributions
asers+0.195
 actions+0.192
aser+0.191
olds+0.189
 apology+0.189
Neg logit contributions
 lake-0.221
 clearing-0.205
 pool-0.193
neath-0.190
 creek-0.189
Token course
Token ID1781
Feature activation+9.52e-01

Pos logit contributions
olding+0.118
asers+0.114
asing+0.113
Of+0.111
olded+0.110
Neg logit contributions
 lake-0.070
neath-0.061
 camp-0.059
 clearing-0.059
 ocean-0.058
Token,
Token ID11
Feature activation+1.80e+00

Pos logit contributions
aid+0.058
Of+0.057
olding+0.056
 forgiveness+0.056
 apology+0.055
Neg logit contributions
 lake-0.029
 camp-0.028
 shore-0.027
neath-0.026
 river-0.026
Token we
Token ID356
Feature activation+1.47e-01

Pos logit contributions
olding+0.098
aid+0.096
asing+0.095
olds+0.094
asers+0.091
Neg logit contributions
 lake-0.065
 camp-0.065
 river-0.059
 pool-0.059
 shore-0.058
Token can
Token ID460
Feature activation-1.30e+00

Pos logit contributions
Of+0.006
aid+0.006
olding+0.006
 forgiveness+0.006
 apology+0.005
Neg logit contributions
 lake-0.008
neath-0.008
 camp-0.007
 shore-0.007
 creek-0.007
Token girl
Token ID2576
Feature activation+1.24e+00

Pos logit contributions
Of+0.048
aid+0.048
olding+0.048
asing+0.047
 forgiveness+0.046
Neg logit contributions
 lake-0.057
 camp-0.054
 creek-0.054
 pool-0.054
 river-0.054
Token asked
Token ID1965
Feature activation+1.32e+00

Pos logit contributions
Of+0.059
aid+0.058
 forgiveness+0.058
olding+0.058
 apology+0.058
Neg logit contributions
 lake-0.055
 camp-0.051
neath-0.051
 shore-0.051
 river-0.050
Token.
Token ID13
Feature activation+7.48e-01

Pos logit contributions
Of+0.073
aid+0.072
olding+0.069
 forgiveness+0.068
 apology+0.068
Neg logit contributions
 lake-0.048
 camp-0.048
neath-0.047
 shore-0.046
 valleys-0.045
Token
Token ID198
Feature activation+4.17e-01

Pos logit contributions
aid+0.041
Of+0.041
 forgiveness+0.041
olding+0.041
 apology+0.040
Neg logit contributions
 lake-0.027
neath-0.026
 camp-0.026
 shore-0.025
 creek-0.025
Token
Token ID198
Feature activation+5.44e-01

Pos logit contributions
Of+0.022
aid+0.022
olding+0.022
 forgiveness+0.022
 apology+0.021
Neg logit contributions
 lake-0.016
 camp-0.016
neath-0.016
 shore-0.015
 river-0.015
Token"
Token ID1
Feature activation+4.68e+00

Pos logit contributions
Of+0.024
olding+0.023
 forgiveness+0.023
aid+0.023
 apology+0.023
Neg logit contributions
 lake-0.027
neath-0.026
 camp-0.026
 shore-0.025
 river-0.025
TokenSure
Token ID19457
Feature activation+1.36e+00

Pos logit contributions
 actions+0.216
iness+0.211
 choices+0.206
 instincts+0.204
asers+0.199
Neg logit contributions
 lake-0.188
neath-0.173
 creek-0.167
Look-0.159
 pond-0.157
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
Of+0.094
aid+0.092
olding+0.092
asing+0.090
 forgiveness+0.089
Neg logit contributions
 lake-0.033
 camp-0.031
 shore-0.028
 clearing-0.027
 river-0.026
Token you
Token ID345
Feature activation+3.31e-01

Pos logit contributions
aid+0.075
olding+0.074
Of+0.072
olds+0.070
olded+0.070
Neg logit contributions
 lake-0.074
 camp-0.072
 shore-0.069
 river-0.068
 ocean-0.067
Token can
Token ID460
Feature activation+3.79e-01

Pos logit contributions
Of+0.014
aid+0.014
 forgiveness+0.013
olding+0.013
 apology+0.013
Neg logit contributions
 lake-0.017
 camp-0.016
neath-0.016
 shore-0.016
 river-0.016
Token play
Token ID711
Feature activation+2.81e-01

Pos logit contributions
Of+0.021
aid+0.021
olding+0.021
 forgiveness+0.021
 apology+0.020
Neg logit contributions
 lake-0.014
 camp-0.013
neath-0.013
 shore-0.013
 river-0.013
Token Ben
Token ID3932
Feature activation+1.55e+00

Pos logit contributions
Of+0.020
aid+0.019
olding+0.019
 forgiveness+0.019
 apology+0.019
Neg logit contributions
 lake-0.013
 camp-0.013
neath-0.013
 shore-0.012
 river-0.012
Token asked
Token ID1965
Feature activation+1.85e+00

Pos logit contributions
Of+0.080
aid+0.078
 forgiveness+0.078
olding+0.078
 apology+0.077
Neg logit contributions
 lake-0.061
 camp-0.057
 ocean-0.057
 river-0.056
 shore-0.056
Token.
Token ID13
Feature activation+1.63e+00

Pos logit contributions
Of+0.110
aid+0.107
Deal+0.105
olding+0.104
grand+0.101
Neg logit contributions
 camp-0.057
 shore-0.056
 lake-0.055
neath-0.053
 valleys-0.051
Token
Token ID198
Feature activation+8.38e-01

Pos logit contributions
asing+0.108
aid+0.106
 forgiveness+0.105
olding+0.104
Of+0.103
Neg logit contributions
 lake-0.042
 shore-0.039
 camp-0.039
neath-0.039
 river-0.039
Token
Token ID198
Feature activation+7.23e-01

Pos logit contributions
Of+0.048
olding+0.047
 forgiveness+0.047
aid+0.047
 apology+0.045
Neg logit contributions
 lake-0.029
 camp-0.028
neath-0.028
 shore-0.027
 river-0.027
Token"
Token ID1
Feature activation+4.67e+00

Pos logit contributions
Of+0.032
 forgiveness+0.032
olding+0.032
aid+0.031
 apology+0.031
Neg logit contributions
 lake-0.034
 camp-0.033
neath-0.033
 shore-0.033
 river-0.032
TokenOf
Token ID5189
Feature activation+2.08e+00

Pos logit contributions
iness+0.174
asing+0.172
 actions+0.171
aser+0.169
grand+0.169
Neg logit contributions
 lake-0.212
 shore-0.199
 creek-0.198
 river-0.191
neath-0.189
Token course
Token ID1781
Feature activation+1.47e+00

Pos logit contributions
olding+0.131
Of+0.126
asing+0.125
olded+0.123
asers+0.121
Neg logit contributions
 lake-0.069
neath-0.059
 shore-0.056
 ocean-0.055
 clearing-0.055
Token,
Token ID11
Feature activation+1.45e+00

Pos logit contributions
aid+0.094
Of+0.093
olding+0.091
asing+0.090
 forgiveness+0.088
Neg logit contributions
 lake-0.042
 camp-0.040
 shore-0.037
 river-0.036
 creek-0.035
Token I
Token ID314
Feature activation+2.12e+00

Pos logit contributions
aid+0.069
olding+0.067
Of+0.066
asing+0.064
Deal+0.064
Neg logit contributions
 lake-0.066
 camp-0.063
 shore-0.063
 ocean-0.062
 river-0.060
Token can
Token ID460
Feature activation+2.20e+00

Pos logit contributions
asers+0.103
 forgiveness+0.101
aid+0.100
asing+0.099
Of+0.098
Neg logit contributions
 camp-0.083
 lake-0.082
 river-0.080
 pool-0.079
 creek-0.077
Token Her
Token ID2332
Feature activation+1.29e+00

Pos logit contributions
aid+0.045
olding+0.044
Of+0.044
 forgiveness+0.043
asing+0.043
Neg logit contributions
 lake-0.039
 camp-0.037
neath-0.037
 shore-0.036
 river-0.036
Token mom
Token ID1995
Feature activation+1.48e+00

Pos logit contributions
aid+0.057
Of+0.057
olding+0.055
asing+0.055
Deal+0.051
Neg logit contributions
 lake-0.067
 camp-0.065
 shore-0.063
 pool-0.063
 river-0.062
Tokenmy
Token ID1820
Feature activation+6.93e-01

Pos logit contributions
olding+0.080
Of+0.079
 forgiveness+0.079
aid+0.078
 apology+0.078
Neg logit contributions
 lake-0.053
neath-0.050
 camp-0.049
 clearing-0.048
 ocean-0.047
Token said
Token ID531
Feature activation+1.64e+00

Pos logit contributions
Of+0.025
aid+0.024
olding+0.024
 forgiveness+0.024
 apology+0.023
Neg logit contributions
 lake-0.039
neath-0.037
 camp-0.037
 shore-0.036
 river-0.036
Token,
Token ID11
Feature activation+2.01e-01

Pos logit contributions
aid+0.103
Of+0.102
asing+0.100
olding+0.100
 forgiveness+0.099
Neg logit contributions
 lake-0.040
 camp-0.039
 shore-0.038
neath-0.038
 pool-0.035
Token "
Token ID366
Feature activation+4.66e+00

Pos logit contributions
Of+0.010
aid+0.009
olding+0.009
 forgiveness+0.009
 apology+0.009
Neg logit contributions
 lake-0.009
 camp-0.009
neath-0.009
 shore-0.008
 creek-0.008
TokenYes
Token ID5297
Feature activation+9.00e-01

Pos logit contributions
 actions+0.205
aser+0.202
 apology+0.202
asers+0.196
 forgiveness+0.195
Neg logit contributions
 lake-0.195
 clearing-0.188
neath-0.177
 field-0.172
 river-0.168
Token,
Token ID11
Feature activation+2.37e+00

Pos logit contributions
aid+0.060
Of+0.060
 forgiveness+0.058
olding+0.058
asing+0.056
Neg logit contributions
 lake-0.023
 camp-0.022
neath-0.021
 shore-0.020
 river-0.020
Token let
Token ID1309
Feature activation-4.50e-01

Pos logit contributions
aid+0.114
olding+0.111
olds+0.110
asing+0.109
asers+0.108
Neg logit contributions
 lake-0.099
 camp-0.096
 river-0.091
 pool-0.089
 shore-0.089
Token's
Token ID338
Feature activation-1.77e+00

Pos logit contributions
 lake+0.023
 valleys+0.023
 creek+0.023
 shore+0.023
 canyon+0.023
Neg logit contributions
Of-0.018
 forgiveness-0.017
olding-0.017
aid-0.017
 apology-0.017
Token go
Token ID467
Feature activation-1.50e+00

Pos logit contributions
 valleys+0.083
neath+0.080
 clearing+0.078
 ditch+0.078
 canyon+0.076
Neg logit contributions
Of-0.074
 forgiveness-0.074
 promise-0.071
 forgive-0.069
 else-0.069
Token colourful
Token ID44799
Feature activation-1.31e+00

Pos logit contributions
neath+0.032
 lake+0.031
 canyon+0.030
 camp+0.029
 creek+0.029
Neg logit contributions
 forgiveness-0.040
Of-0.040
aid-0.039
 apology-0.038
olding-0.038
Token flowers
Token ID12734
Feature activation-9.96e-01

Pos logit contributions
neath+0.055
 lake+0.049
 canyon+0.049
 tim+0.048
 ditch+0.047
Neg logit contributions
aid-0.063
Of-0.063
 forgiveness-0.062
 apology-0.059
olding-0.058
Token,
Token ID11
Feature activation-1.27e+00

Pos logit contributions
neath+0.046
 lake+0.045
 camp+0.044
 canyon+0.044
 shore+0.043
Neg logit contributions
Of-0.042
aid-0.041
olding-0.041
 forgiveness-0.041
asing-0.038
Token through
Token ID832
Feature activation-1.83e+00

Pos logit contributions
neath+0.050
 lake+0.047
 camp+0.047
 canyon+0.046
 ditch+0.046
Neg logit contributions
Of-0.066
 forgiveness-0.061
aid-0.061
olding-0.058
 respectful-0.058
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
neath+0.057
 tim+0.056
 canyon+0.055
 creek+0.054
 ditch+0.053
Neg logit contributions
 forgiveness-0.098
 else-0.094
Of-0.092
aid-0.088
 actions-0.087
Token lush
Token ID37408
Feature activation-5.21e+00

Pos logit contributions
neath+0.044
 tim+0.034
 camp+0.033
 lake+0.033
 ditch+0.032
Neg logit contributions
Of-0.082
aid-0.077
 forgiveness-0.077
olding-0.077
 respectful-0.076
Token forests
Token ID17039
Feature activation-1.98e+00

Pos logit contributions
neath+0.217
 tim+0.203
 cinema+0.179
acket+0.177
 ditch+0.168
Neg logit contributions
 actions-0.190
 valuable-0.189
 languages-0.182
 words-0.180
 importantly-0.179
Token deep
Token ID2769
Feature activation-3.59e+00

Pos logit contributions
neath+0.094
 lake+0.089
 canyon+0.088
 creek+0.086
 camp+0.086
Neg logit contributions
olding-0.078
aid-0.073
Of-0.072
 forgiveness-0.072
 else-0.071
Token in
Token ID287
Feature activation-2.49e+00

Pos logit contributions
neath+0.111
 tim+0.106
 cinema+0.105
latable+0.101
acket+0.098
Neg logit contributions
ups-0.156
 actions-0.155
able-0.154
ly-0.148
 interests-0.147
Token thought
Token ID1807
Feature activation+4.60e-01

Pos logit contributions
acket+0.105
 tim+0.103
latable+0.099
 canyon+0.092
 ditch+0.090
Neg logit contributions
 forgiveness-0.098
 else-0.092
Of-0.085
aid-0.083
 apology-0.083
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
Of+0.021
aid+0.020
 forgiveness+0.020
olding+0.020
 apology+0.019
Neg logit contributions
 lake-0.022
 camp-0.021
neath-0.021
 shore-0.021
 river-0.021
Token -
Token ID532
Feature activation-1.81e+00

Pos logit contributions
 lake+0.050
neath+0.050
 canyon+0.049
 camp+0.049
 shore+0.048
Neg logit contributions
Of-0.044
olding-0.044
 forgiveness-0.043
aid-0.042
 else-0.041
Token rolling
Token ID10708
Feature activation-1.98e+00

Pos logit contributions
neath+0.070
 canyon+0.068
 ditch+0.064
 lake+0.064
 camp+0.064
Neg logit contributions
Of-0.095
aid-0.087
 forgiveness-0.086
 else-0.083
olding-0.083
Token green
Token ID4077
Feature activation-2.63e+00

Pos logit contributions
neath+0.070
 canyon+0.070
 ocean+0.067
 shore+0.067
 camp+0.067
Neg logit contributions
Of-0.096
aid-0.093
 else-0.090
 apology-0.087
ups-0.086
Token hills
Token ID18639
Feature activation-2.53e+00

Pos logit contributions
neath+0.093
 tim+0.086
 ditch+0.086
 canyon+0.086
 bay+0.083
Neg logit contributions
aid-0.125
ups-0.121
Of-0.120
asers-0.119
 else-0.116
Token,
Token ID11
Feature activation-2.13e+00

Pos logit contributions
neath+0.121
 canyon+0.120
 creek+0.117
 camp+0.116
 lake+0.115
Neg logit contributions
olding-0.088
aid-0.082
Of-0.082
 else-0.082
ups-0.082
Token lush
Token ID37408
Feature activation-5.20e+00

Pos logit contributions
neath+0.084
 camp+0.081
 ditch+0.080
 canyon+0.079
 bay+0.077
Neg logit contributions
Of-0.101
aid-0.093
ups-0.088
 else-0.088
 respectful-0.086
Token grass
Token ID8701
Feature activation-3.22e+00

Pos logit contributions
neath+0.235
 cinema+0.208
 tim+0.208
 ditch+0.202
acket+0.199
Neg logit contributions
ups-0.174
 else-0.173
 words-0.172
 facts-0.163
 songs-0.162
Token,
Token ID11
Feature activation-2.42e+00

Pos logit contributions
neath+0.162
 canyon+0.158
 ditch+0.155
 camp+0.151
 creek+0.147
Neg logit contributions
olding-0.095
ups-0.093
as-0.092
aser-0.088
 else-0.087
Token trees
Token ID7150
Feature activation-2.66e+00

Pos logit contributions
neath+0.111
 ditch+0.109
 camp+0.108
 canyon+0.106
 creek+0.104
Neg logit contributions
Of-0.097
aid-0.087
 respectful-0.082
ups-0.082
 else-0.081
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
neath+0.140
 canyon+0.136
 creek+0.132
 ditch+0.131
 camp+0.128
Neg logit contributions
aid-0.081
olding-0.078
 else-0.077
Of-0.077
 respectful-0.069
Token flowers
Token ID12734
Feature activation-1.30e+00

Pos logit contributions
neath+0.039
 lake+0.037
 canyon+0.036
 camp+0.036
 creek+0.035
Neg logit contributions
Of-0.059
aid-0.056
 forgiveness-0.056
 else-0.053
 apology-0.052
Token.
Token ID13
Feature activation-2.71e+00

Pos logit contributions
neath+0.072
 canyon+0.070
 ditch+0.069
 camp+0.069
 creek+0.068
Neg logit contributions
Of-0.042
 forgiveness-0.042
aid-0.042
olding-0.041
aser-0.038
Token It
Token ID632
Feature activation-1.06e+00

Pos logit contributions
 canyon+0.139
 pasture+0.131
icer+0.130
 camp+0.129
 ditch+0.129
Neg logit contributions
Of-0.109
 forgive-0.087
aid-0.084
olding-0.084
 forgiveness-0.083
Token ran
Token ID4966
Feature activation-2.54e+00

Pos logit contributions
neath+0.053
 lake+0.052
 shore+0.051
 clearing+0.051
 camp+0.051
Neg logit contributions
Of-0.040
 else-0.039
aid-0.037
olding-0.036
 forgiveness-0.035
Token through
Token ID832
Feature activation-2.48e+00

Pos logit contributions
neath+0.134
icer+0.129
 pasture+0.125
 canyon+0.121
 lake+0.119
Neg logit contributions
Of-0.094
 lesson-0.087
olding-0.085
 apology-0.077
 else-0.077
Token a
Token ID257
Feature activation-1.73e+00

Pos logit contributions
acket+0.124
 tim+0.122
neath+0.116
 cinema+0.110
 ditch+0.110
Neg logit contributions
 else-0.101
 forgiveness-0.095
 forgive-0.082
Of-0.080
 understanding-0.077
Token lush
Token ID37408
Feature activation-5.15e+00

Pos logit contributions
neath+0.072
 tim+0.058
 Everest+0.055
 canyon+0.054
acket+0.054
Neg logit contributions
 else-0.090
Of-0.082
 respectful-0.077
 forgiveness-0.076
aid-0.076
Token green
Token ID4077
Feature activation-3.05e+00

Pos logit contributions
neath+0.249
 tim+0.223
 cinema+0.199
acket+0.198
 Everest+0.183
Neg logit contributions
 valuable-0.165
 important-0.158
 else-0.156
 kind-0.156
 respectful-0.147
Token me
Token ID502
Feature activation-6.18e-01

Pos logit contributions
neath+0.102
 tim+0.090
 Everest+0.078
 Robbie+0.071
 gonna+0.071
Neg logit contributions
 forgiveness-0.141
 friend-0.141
aid-0.140
 actions-0.137
 else-0.136
Tokenadow
Token ID4584
Feature activation-1.66e+00

Pos logit contributions
neath+0.028
 lake+0.028
 camp+0.027
 creek+0.026
 shore+0.026
Neg logit contributions
olding-0.029
aid-0.028
Of-0.028
 forgiveness-0.027
asing-0.027
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
neath+0.097
 lake+0.090
 camp+0.090
 canyon+0.089
 ditch+0.088
Neg logit contributions
olding-0.054
Of-0.048
 forgiveness-0.047
 respectful-0.045
aid-0.045
Token with
Token ID351
Feature activation-1.11e-01

Pos logit contributions
neath+0.096
 camp+0.091
 canyon+0.089
 ditch+0.088
icer+0.087
Neg logit contributions
Of-0.096
olding-0.085
 respectful-0.083
 forgiveness-0.079
 else-0.078
Token was
Token ID373
Feature activation-1.09e+00

Pos logit contributions
 lake+0.030
neath+0.029
 camp+0.029
 canyon+0.028
 shore+0.028
Neg logit contributions
Of-0.028
aid-0.026
olding-0.025
 forgiveness-0.025
 else-0.024
Token running
Token ID2491
Feature activation-1.48e+00

Pos logit contributions
 lake+0.053
neath+0.051
 creek+0.051
 camp+0.049
 canyon+0.049
Neg logit contributions
Of-0.049
olding-0.046
aid-0.046
 respectful-0.045
 forgiveness-0.044
Token around
Token ID1088
Feature activation-1.80e+00

Pos logit contributions
neath+0.071
 lake+0.069
 canyon+0.067
 ocean+0.065
 camp+0.064
Neg logit contributions
Of-0.065
olding-0.060
aid-0.057
 else-0.057
olded-0.054
Token on
Token ID319
Feature activation-2.34e+00

Pos logit contributions
neath+0.078
 canyon+0.075
 lake+0.075
 ditch+0.075
 river+0.073
Neg logit contributions
Of-0.079
 forgiveness-0.078
olding-0.074
 else-0.071
 apology-0.069
Token the
Token ID262
Feature activation-1.92e+00

Pos logit contributions
neath+0.090
 tim+0.088
acket+0.087
 canyon+0.087
 ditch+0.083
Neg logit contributions
 forgiveness-0.109
Of-0.100
 else-0.099
 classmates-0.096
 apology-0.094
Token lush
Token ID37408
Feature activation-4.89e+00

Pos logit contributions
neath+0.074
 canyon+0.061
 camp+0.059
 clearing+0.058
 valleys+0.057
Neg logit contributions
Of-0.103
 apology-0.096
olding-0.096
aid-0.094
 else-0.093
Token grass
Token ID8701
Feature activation-1.42e+00

Pos logit contributions
neath+0.234
 tim+0.193
 takeoff+0.177
acket+0.174
icer+0.173
Neg logit contributions
 else-0.162
ups-0.162
 respectful-0.162
 apology-0.158
 actions-0.158
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
neath+0.061
 lake+0.059
 canyon+0.058
 camp+0.058
 ditch+0.056
Neg logit contributions
olding-0.065
Of-0.065
 forgiveness-0.061
aid-0.060
asing-0.059
Token enjoying
Token ID13226
Feature activation-2.06e+00

Pos logit contributions
neath+0.069
 camp+0.069
 lake+0.067
 canyon+0.066
 ditch+0.065
Neg logit contributions
Of-0.084
olding-0.073
 respectful-0.072
aid-0.070
 else-0.068
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
neath+0.087
 ditch+0.082
 canyon+0.077
acket+0.076
 river+0.075
Neg logit contributions
Of-0.096
 forgiveness-0.095
 caring-0.092
 respectful-0.090
 else-0.087
Token sunshine
Token ID34488
Feature activation-6.17e-01

Pos logit contributions
neath+0.045
 canyon+0.039
 camp+0.038
 ditch+0.038
 lake+0.037
Neg logit contributions
Of-0.059
aid-0.055
 forgiveness-0.055
 apology-0.054
olding-0.054
Token sight
Token ID6504
Feature activation-2.40e-01

Pos logit contributions
neath+0.108
 valleys+0.090
 ditch+0.080
 takeoff+0.078
 tim+0.078
Neg logit contributions
 forgiveness-0.117
 apology-0.116
olding-0.113
Of-0.112
 respectful-0.109
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
 lake+0.011
neath+0.011
 camp+0.011
 canyon+0.010
 shore+0.010
Neg logit contributions
Of-0.011
olding-0.010
 forgiveness-0.010
aid-0.010
 apology-0.010
Token There
Token ID1318
Feature activation-5.86e-01

Pos logit contributions
neath+0.061
 lake+0.061
 camp+0.061
 canyon+0.060
 shore+0.059
Neg logit contributions
Of-0.066
olding-0.060
 apology-0.057
olded-0.056
 forgiveness-0.056
Token was
Token ID373
Feature activation-1.53e+00

Pos logit contributions
 lake+0.027
 canyon+0.027
 shore+0.026
 camp+0.026
neath+0.026
Neg logit contributions
 forgiveness-0.025
Of-0.025
aid-0.023
olding-0.023
 apology-0.023
Token a
Token ID257
Feature activation-1.25e+00

Pos logit contributions
neath+0.057
 lake+0.053
 canyon+0.053
 camp+0.051
 creek+0.051
Neg logit contributions
Of-0.079
 forgiveness-0.079
aid-0.074
 else-0.073
olding-0.073
Token lush
Token ID37408
Feature activation-4.80e+00

Pos logit contributions
neath+0.053
 valleys+0.046
 camp+0.044
 lake+0.044
 fields+0.044
Neg logit contributions
Of-0.063
 forgiveness-0.060
olding-0.059
aid-0.058
 apology-0.058
Token green
Token ID4077
Feature activation-2.72e+00

Pos logit contributions
neath+0.227
 tim+0.214
 cinema+0.190
ASH+0.181
 takeoff+0.178
Neg logit contributions
 valuable-0.152
 respectful-0.148
 important-0.148
 understanding-0.146
 importantly-0.146
Token patch
Token ID8529
Feature activation-1.78e+00

Pos logit contributions
neath+0.098
 tim+0.079
 valleys+0.078
 ditch+0.075
 takeoff+0.073
Neg logit contributions
Of-0.134
 forgiveness-0.131
 apology-0.130
aid-0.129
 actions-0.126
Token of
Token ID286
Feature activation-1.65e+00

Pos logit contributions
 camp+0.112
neath+0.111
 lake+0.110
 canyon+0.110
 valleys+0.109
Neg logit contributions
olding-0.043
Of-0.041
aid-0.040
 forgiveness-0.039
aser-0.038
Token grass
Token ID8701
Feature activation-2.82e+00

Pos logit contributions
neath+0.061
 tim+0.056
 canyon+0.054
 ditch+0.053
 camp+0.051
Neg logit contributions
Of-0.087
 forgiveness-0.085
aid-0.084
 apology-0.081
 else-0.078
Token and
Token ID290
Feature activation-1.56e+00

Pos logit contributions
 camp+0.121
 tim+0.118
 canyon+0.118
neath+0.115
 ditch+0.114
Neg logit contributions
aser-0.106
olding-0.106
 else-0.103
Of-0.102
 forgiveness-0.100
Token good
Token ID922
Feature activation+1.62e-01

Pos logit contributions
neath+0.055
 lake+0.054
 canyon+0.051
 shore+0.051
 clearing+0.050
Neg logit contributions
Of-0.037
olding-0.036
aid-0.035
 else-0.034
 respectful-0.032
Token to
Token ID284
Feature activation-1.13e+00

Pos logit contributions
Of+0.007
olding+0.007
aid+0.007
 forgiveness+0.007
 apology+0.007
Neg logit contributions
 lake-0.008
neath-0.007
 camp-0.007
 shore-0.007
 river-0.007
Token roll
Token ID4836
Feature activation-4.77e-01

Pos logit contributions
neath+0.056
 lake+0.054
 ditch+0.054
 valleys+0.054
 fields+0.053
Neg logit contributions
 forgiveness-0.045
 apology-0.044
Of-0.044
aid-0.041
 forgive-0.041
Token in
Token ID287
Feature activation-1.11e+00

Pos logit contributions
 lake+0.019
neath+0.018
 camp+0.018
 canyon+0.018
 shore+0.018
Neg logit contributions
Of-0.025
aid-0.024
olding-0.024
 forgiveness-0.023
 apology-0.023
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
neath+0.038
 canyon+0.038
 lake+0.036
 ocean+0.036
 creek+0.036
Neg logit contributions
 forgiveness-0.059
Of-0.055
 else-0.054
 apology-0.054
olding-0.052
Token lush
Token ID37408
Feature activation-4.78e+00

Pos logit contributions
neath+0.050
 shore+0.038
ground+0.038
 tim+0.037
 takeoff+0.037
Neg logit contributions
Of-0.089
aid-0.083
olding-0.081
Deal-0.081
 forgiveness-0.081
Token green
Token ID4077
Feature activation-2.34e+00

Pos logit contributions
neath+0.207
 tim+0.176
acket+0.172
 takeoff+0.156
 cinema+0.156
Neg logit contributions
 actions-0.184
 valuable-0.178
 languages-0.173
 feelings-0.172
ups-0.171
Token grass
Token ID8701
Feature activation-7.94e-01

Pos logit contributions
neath+0.083
 tim+0.064
 Everest+0.062
 takeoff+0.061
acket+0.058
Neg logit contributions
 actions-0.116
ups-0.113
 forgiveness-0.108
 apology-0.108
 feelings-0.108
Token and
Token ID290
Feature activation-8.35e-01

Pos logit contributions
neath+0.037
 lake+0.037
 camp+0.036
 canyon+0.036
 shore+0.035
Neg logit contributions
olding-0.034
Of-0.033
 forgiveness-0.033
aid-0.032
asing-0.031
Token jump
Token ID4391
Feature activation-1.86e+00

Pos logit contributions
neath+0.031
 lake+0.029
 canyon+0.029
 camp+0.028
 ocean+0.028
Neg logit contributions
Of-0.046
 forgiveness-0.043
aid-0.042
olding-0.041
Deal-0.041
Token high
Token ID1029
Feature activation-2.05e+00

Pos logit contributions
 lake+0.091
neath+0.089
 ocean+0.087
 canyon+0.087
 clearing+0.085
Neg logit contributions
olding-0.070
aid-0.069
Of-0.069
 else-0.067
 responsibly-0.063
Token farmer
Token ID18739
Feature activation+5.87e-01

Pos logit contributions
neath+0.008
 lake+0.007
 camp+0.007
 valleys+0.007
 canyon+0.007
Neg logit contributions
Of-0.005
olding-0.004
 forgiveness-0.004
aid-0.004
 apology-0.004
Token who
Token ID508
Feature activation-5.50e-01

Pos logit contributions
Of+0.028
aid+0.027
 forgiveness+0.027
olding+0.027
 apology+0.026
Neg logit contributions
 lake-0.027
 camp-0.025
neath-0.025
 shore-0.025
 river-0.025
Token lived
Token ID5615
Feature activation-1.01e+00

Pos logit contributions
neath+0.027
 lake+0.027
 shore+0.027
 canyon+0.026
 camp+0.026
Neg logit contributions
Of-0.023
aid-0.022
 forgiveness-0.022
olding-0.022
Deal-0.021
Token on
Token ID319
Feature activation-3.23e+00

Pos logit contributions
 lake+0.044
neath+0.043
 creek+0.042
 canyon+0.042
 river+0.042
Neg logit contributions
Of-0.048
aid-0.048
olding-0.047
 forgiveness-0.046
Deal-0.043
Token a
Token ID257
Feature activation-1.66e+00

Pos logit contributions
 canyon+0.156
neath+0.153
 ditch+0.147
 waterfall+0.147
 slope+0.146
Neg logit contributions
 forgiveness-0.106
 feelings-0.093
 caring-0.092
 lesson-0.090
grand-0.090
Token lush
Token ID37408
Feature activation-4.77e+00

Pos logit contributions
neath+0.061
 valleys+0.051
 canyon+0.049
 clearing+0.046
 lake+0.045
Neg logit contributions
Of-0.092
 else-0.090
 respectful-0.086
olding-0.086
 forgiveness-0.085
Token farm
Token ID5318
Feature activation-1.10e-01

Pos logit contributions
neath+0.220
 valleys+0.180
 tim+0.175
 takeoff+0.163
 GO+0.159
Neg logit contributions
 important-0.185
 valuable-0.177
 caring-0.170
 respectful-0.169
 kind-0.161
Token with
Token ID351
Feature activation+1.82e-01

Pos logit contributions
 lake+0.005
neath+0.005
 camp+0.005
 shore+0.005
 creek+0.005
Neg logit contributions
Of-0.005
olding-0.005
 forgiveness-0.005
aid-0.005
 apology-0.004
Token many
Token ID867
Feature activation+4.24e-01

Pos logit contributions
Of+0.010
aid+0.010
 forgiveness+0.010
olding+0.010
 apology+0.009
Neg logit contributions
 lake-0.007
neath-0.007
 camp-0.006
 shore-0.006
 creek-0.006
Token animals
Token ID4695
Feature activation+6.65e-01

Pos logit contributions
Of+0.019
aid+0.019
olding+0.018
 forgiveness+0.018
 apology+0.018
Neg logit contributions
 lake-0.020
 camp-0.019
neath-0.019
 shore-0.019
 river-0.019
Token and
Token ID290
Feature activation+1.70e-01

Pos logit contributions
Of+0.033
aid+0.032
 forgiveness+0.032
olding+0.031
 apology+0.031
Neg logit contributions
 lake-0.029
 camp-0.028
neath-0.027
 shore-0.027
 river-0.027
Token grass
Token ID8701
Feature activation-1.02e+00

Pos logit contributions
 canyon+0.055
 camp+0.054
 lake+0.053
 shore+0.052
neath+0.051
Neg logit contributions
 forgiveness-0.076
Of-0.075
aid-0.072
 apology-0.071
olding-0.071
Token was
Token ID373
Feature activation-1.14e+00

Pos logit contributions
 lake+0.048
neath+0.047
 camp+0.047
 canyon+0.046
 creek+0.045
Neg logit contributions
Of-0.042
olding-0.042
 forgiveness-0.041
aid-0.040
 apology-0.038
Token a
Token ID257
Feature activation-4.87e-01

Pos logit contributions
 lake+0.043
neath+0.043
 creek+0.042
 camp+0.042
 canyon+0.042
Neg logit contributions
 forgiveness-0.057
Of-0.057
aid-0.056
olding-0.055
 else-0.053
Token deep
Token ID2769
Feature activation-3.12e+00

Pos logit contributions
neath+0.021
 lake+0.020
 camp+0.019
 canyon+0.019
 valleys+0.019
Neg logit contributions
Of-0.024
 forgiveness-0.023
aid-0.022
olding-0.022
 apology-0.022
Token,
Token ID11
Feature activation-1.38e+00

Pos logit contributions
neath+0.116
 valleys+0.101
 takeoff+0.100
 camp+0.100
 tim+0.099
Neg logit contributions
 forgiveness-0.125
Of-0.121
olding-0.121
 promise-0.120
 apology-0.120
Token lush
Token ID37408
Feature activation-4.71e+00

Pos logit contributions
neath+0.057
 camp+0.055
 lake+0.054
 canyon+0.054
 shore+0.052
Neg logit contributions
Of-0.069
 forgiveness-0.065
aid-0.063
olding-0.062
 apology-0.061
Token green
Token ID4077
Feature activation-2.80e+00

Pos logit contributions
neath+0.222
 tim+0.197
 cinema+0.191
 takeoff+0.179
 Everest+0.176
Neg logit contributions
 respectful-0.153
 valuable-0.152
 precious-0.148
 importantly-0.146
 caring-0.143
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
neath+0.102
 tim+0.081
 canyon+0.080
 ditch+0.080
 takeoff+0.079
Neg logit contributions
Of-0.135
 forgiveness-0.135
aid-0.132
 apology-0.130
 actions-0.129
Token The
Token ID383
Feature activation+2.63e-01

Pos logit contributions
aid+0.066
 forgiveness+0.064
olding+0.064
Of+0.064
 apology+0.062
Neg logit contributions
neath-0.034
 lake-0.034
 camp-0.032
 clearing-0.031
 river-0.031
Token little
Token ID1310
Feature activation-2.26e-01

Pos logit contributions
Of+0.014
aid+0.014
 forgiveness+0.014
olding+0.014
 apology+0.013
Neg logit contributions
 lake-0.010
neath-0.010
 camp-0.010
 shore-0.010
 creek-0.010
Token girl
Token ID2576
Feature activation+3.45e-01

Pos logit contributions
neath+0.014
 lake+0.013
 camp+0.013
 canyon+0.013
 shore+0.013
Neg logit contributions
Of-0.007
 forgiveness-0.007
aid-0.006
olding-0.006
 apology-0.006
Token to
Token ID284
Feature activation-2.17e+00

Pos logit contributions
 lake+0.042
neath+0.041
 camp+0.040
 creek+0.040
 canyon+0.040
Neg logit contributions
Of-0.026
olding-0.025
 forgiveness-0.025
aid-0.024
 apology-0.023
Token set
Token ID900
Feature activation-1.41e+00

Pos logit contributions
 ditch+0.080
 canyon+0.074
neath+0.072
icer+0.072
 camp+0.072
Neg logit contributions
Of-0.107
 forgiveness-0.102
 apology-0.100
 forgive-0.098
aid-0.096
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
neath+0.071
 lake+0.071
 canyon+0.070
 camp+0.068
 creek+0.068
Neg logit contributions
 forgiveness-0.051
olding-0.051
Of-0.051
aid-0.049
 apology-0.048
Token<|endoftext|>
Token ID50256
Feature activation-2.87e+00

Pos logit contributions
 lake+0.000
 camp+0.000
neath+0.000
 shore+0.000
 canyon+0.000
Neg logit contributions
Of-0.001
olding-0.001
aid-0.001
 forgiveness-0.001
 apology-0.001
TokenLittle
Token ID22253
Feature activation-1.37e+00

Pos logit contributions
 canyon+0.131
 shore+0.126
 river+0.123
 ditch+0.122
 lake+0.119
Neg logit contributions
Of-0.117
aid-0.105
 forgiveness-0.104
 else-0.102
olding-0.100
Token Johnny
Token ID15470
Feature activation-4.67e+00

Pos logit contributions
 canyon+0.054
 camp+0.053
 valleys+0.053
 ditch+0.052
 shore+0.052
Neg logit contributions
Of-0.072
olding-0.065
 forgiveness-0.065
aid-0.062
 apology-0.062
Token was
Token ID373
Feature activation-2.77e+00

Pos logit contributions
 canyon+0.265
 ditch+0.258
 rav+0.247
 clearing+0.244
 camp+0.241
Neg logit contributions
 cared-0.123
Of-0.121
olded-0.118
 cares-0.113
 else-0.113
Token so
Token ID523
Feature activation-1.02e+00

Pos logit contributions
 creek+0.128
 lake+0.127
 canyon+0.122
 valleys+0.121
 river+0.121
Neg logit contributions
olding-0.111
Of-0.110
olded-0.103
 understanding-0.102
 respectful-0.101
Token excited
Token ID6568
Feature activation-9.42e-01

Pos logit contributions
 lake+0.058
neath+0.056
 canyon+0.055
 clearing+0.055
 camp+0.054
Neg logit contributions
Of-0.037
olding-0.035
 else-0.032
aid-0.032
olded-0.031
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
neath+0.048
 lake+0.047
 valleys+0.046
 canyon+0.046
 creek+0.045
Neg logit contributions
olding-0.037
Of-0.037
 forgiveness-0.036
aid-0.033
 apology-0.033
Token He
Token ID679
Feature activation-2.06e+00

Pos logit contributions
 canyon+0.087
 camp+0.086
 lake+0.085
neath+0.083
 clearing+0.081
Neg logit contributions
Of-0.076
olding-0.071
 apology-0.066
 forgiveness-0.064
olded-0.063
Token lived
Token ID5615
Feature activation-6.73e-01

Pos logit contributions
neath+0.031
 lake+0.030
 shore+0.030
 canyon+0.030
 camp+0.030
Neg logit contributions
Of-0.022
aid-0.021
 forgiveness-0.020
 else-0.020
Deal-0.020
Token in
Token ID287
Feature activation-1.96e+00

Pos logit contributions
neath+0.029
 lake+0.029
 river+0.028
 camp+0.028
 shore+0.028
Neg logit contributions
Of-0.033
aid-0.032
olding-0.031
 forgiveness-0.031
Deal-0.029
Token a
Token ID257
Feature activation-2.12e+00

Pos logit contributions
neath+0.084
 creek+0.083
 canyon+0.083
 shore+0.082
 river+0.082
Neg logit contributions
 forgiveness-0.086
aid-0.080
Of-0.077
 else-0.073
 apology-0.073
Token big
Token ID1263
Feature activation-3.47e+00

Pos logit contributions
neath+0.104
acket+0.088
 valleys+0.088
 canyon+0.086
 takeoff+0.086
Neg logit contributions
 else-0.088
Of-0.082
 respectful-0.078
 forgiveness-0.074
 caring-0.074
Token,
Token ID11
Feature activation-2.51e+00

Pos logit contributions
neath+0.149
acket+0.130
ground+0.126
 valleys+0.123
 takeoff+0.123
Neg logit contributions
 promise-0.137
 forgiveness-0.134
 actions-0.132
 respectful-0.130
 friend-0.129
Token wide
Token ID3094
Feature activation-4.64e+00

Pos logit contributions
neath+0.128
 shore+0.117
 valleys+0.117
ground+0.116
 canyon+0.115
Neg logit contributions
Of-0.099
olding-0.089
 respectful-0.089
 valuable-0.088
 else-0.085
Token forest
Token ID8222
Feature activation-8.15e-01

Pos logit contributions
neath+0.201
 takeoff+0.160
 valleys+0.151
acket+0.139
ground+0.138
Neg logit contributions
 friend-0.193
 owner-0.185
 actions-0.176
 promise-0.175
 respectful-0.175
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
neath+0.042
 lake+0.041
 camp+0.040
 shore+0.039
 canyon+0.039
Neg logit contributions
olding-0.032
Of-0.031
 forgiveness-0.030
aid-0.029
 respectful-0.028
Token enjoyed
Token ID8359
Feature activation-1.83e+00

Pos logit contributions
neath+0.008
 lake+0.008
 camp+0.008
 shore+0.008
 canyon+0.008
Neg logit contributions
Of-0.008
aid-0.008
 forgiveness-0.008
olding-0.008
 apology-0.007
Token exploring
Token ID13504
Feature activation-1.31e+00

Pos logit contributions
neath+0.092
 ditch+0.076
acket+0.075
 canyon+0.073
 lake+0.070
Neg logit contributions
 forgiveness-0.082
 caring-0.079
 respectful-0.076
Of-0.075
Deal-0.073
Token every
Token ID790
Feature activation-8.77e-01

Pos logit contributions
neath+0.066
 canyon+0.062
 lake+0.061
 ditch+0.061
 river+0.059
Neg logit contributions
 forgiveness-0.050
Of-0.049
 else-0.046
 apology-0.044
Deal-0.044
TokenFinally
Token ID11158
Feature activation-1.04e+00

Pos logit contributions
 lake+0.006
 camp+0.006
neath+0.006
 shore+0.006
 ocean+0.006
Neg logit contributions
Of-0.005
aid-0.005
olding-0.005
 forgiveness-0.005
 apology-0.005
Token,
Token ID11
Feature activation-6.20e-01

Pos logit contributions
 lake+0.042
 canyon+0.040
neath+0.040
 river+0.039
 valleys+0.039
Neg logit contributions
 forgiveness-0.053
Of-0.051
 apology-0.048
aid-0.048
olding-0.047
Token they
Token ID484
Feature activation-1.35e+00

Pos logit contributions
 lake+0.023
 camp+0.022
 canyon+0.021
neath+0.021
 valleys+0.021
Neg logit contributions
Of-0.034
 forgiveness-0.032
aid-0.031
 apology-0.031
olding-0.031
Token reached
Token ID4251
Feature activation-2.54e+00

Pos logit contributions
 lake+0.056
 canyon+0.056
 ditch+0.054
neath+0.054
 creek+0.053
Neg logit contributions
Of-0.063
aid-0.058
 else-0.056
 forgiveness-0.053
olding-0.053
Token a
Token ID257
Feature activation-1.67e+00

Pos logit contributions
 canyon+0.110
 slope+0.104
 lake+0.103
 river+0.103
neath+0.102
Neg logit contributions
 forgiveness-0.100
 honesty-0.092
Of-0.089
 kindness-0.088
 apology-0.088
Token vast
Token ID5909
Feature activation-4.62e+00

Pos logit contributions
neath+0.066
 valleys+0.058
 canyon+0.054
 tim+0.054
 fields+0.054
Neg logit contributions
Of-0.087
 else-0.083
 forgiveness-0.081
olding-0.078
aid-0.078
Token stretch
Token ID7539
Feature activation-1.35e+00

Pos logit contributions
neath+0.180
 tim+0.153
 takeoff+0.136
unks+0.133
ics+0.130
Neg logit contributions
 actions-0.189
 respectful-0.188
 promise-0.183
 valuable-0.178
 apology-0.177
Token of
Token ID286
Feature activation-1.30e+00

Pos logit contributions
neath+0.070
 lake+0.069
 camp+0.068
 valleys+0.068
 canyon+0.068
Neg logit contributions
olding-0.048
Of-0.046
 forgiveness-0.044
aid-0.044
aser-0.043
Token beach
Token ID10481
Feature activation-1.75e+00

Pos logit contributions
neath+0.044
 tim+0.043
 canyon+0.041
 lake+0.039
 camp+0.039
Neg logit contributions
 forgiveness-0.071
 apology-0.068
Of-0.068
aid-0.066
olding-0.062
Token.
Token ID13
Feature activation-1.89e+00

Pos logit contributions
neath+0.087
 canyon+0.078
 lake+0.077
 camp+0.077
 ditch+0.076
Neg logit contributions
olding-0.068
 forgiveness-0.065
Of-0.064
ups-0.063
 respectful-0.062
Token As
Token ID1081
Feature activation-7.88e-01

Pos logit contributions
 canyon+0.074
 lake+0.074
 camp+0.073
 valleys+0.072
 ocean+0.070
Neg logit contributions
Of-0.098
 apology-0.088
 forgiveness-0.088
olding-0.087
olded-0.085
Token middle
Token ID3504
Feature activation+1.13e-01

Pos logit contributions
neath+0.063
 tim+0.055
 canyon+0.049
 camp+0.046
 creek+0.045
Neg logit contributions
Of-0.124
 else-0.119
 forgiveness-0.118
 respectful-0.115
oral-0.114
Token of
Token ID286
Feature activation-2.73e+00

Pos logit contributions
Of+0.003
aid+0.003
olding+0.003
 forgiveness+0.003
 apology+0.003
Neg logit contributions
 lake-0.007
neath-0.007
 camp-0.007
 shore-0.007
 creek-0.007
Token a
Token ID257
Feature activation-2.12e+00

Pos logit contributions
acket+0.130
 tim+0.129
latable+0.122
neath+0.121
amp+0.119
Neg logit contributions
 forgiveness-0.093
 caring-0.082
Of-0.080
 understanding-0.078
 honesty-0.077
Token dark
Token ID3223
Feature activation-2.89e+00

Pos logit contributions
neath+0.079
 valleys+0.056
 tim+0.055
 canyon+0.055
latable+0.053
Neg logit contributions
 else-0.118
Of-0.114
 respectful-0.107
 forgiveness-0.102
olding-0.102
Token,
Token ID11
Feature activation-2.28e+00

Pos logit contributions
neath+0.115
 Everest+0.086
 tim+0.084
latable+0.081
 GO+0.078
Neg logit contributions
grand-0.123
 actions-0.117
olding-0.117
 else-0.116
 respectful-0.116
Token wide
Token ID3094
Feature activation-4.60e+00

Pos logit contributions
neath+0.115
 camp+0.100
growth+0.099
 canyon+0.098
 valleys+0.097
Neg logit contributions
Of-0.089
 valuable-0.085
 respectful-0.083
olding-0.082
 precious-0.080
Token sea
Token ID5417
Feature activation-2.99e-01

Pos logit contributions
neath+0.179
 tim+0.124
IT+0.120
unks+0.119
pped+0.116
Neg logit contributions
keeper-0.193
 gift-0.190
 promise-0.184
 owner-0.183
 friend-0.181
Token.
Token ID13
Feature activation-2.27e+00

Pos logit contributions
neath+0.015
 lake+0.014
 camp+0.014
 canyon+0.014
 shore+0.014
Neg logit contributions
Of-0.012
olding-0.012
aid-0.012
 forgiveness-0.012
 apology-0.011
Token All
Token ID1439
Feature activation-2.45e-01

Pos logit contributions
 canyon+0.105
 shore+0.104
neath+0.101
 waterfall+0.099
 camp+0.097
Neg logit contributions
Of-0.102
 forgiveness-0.081
olding-0.079
Deal-0.078
Okay-0.076
Token the
Token ID262
Feature activation-6.96e-01

Pos logit contributions
 lake+0.008
 camp+0.007
neath+0.007
 shore+0.007
 canyon+0.007
Neg logit contributions
Of-0.015
aid-0.014
 forgiveness-0.014
olding-0.014
 apology-0.014
Token animals
Token ID4695
Feature activation-2.14e-01

Pos logit contributions
neath+0.037
 lake+0.033
 canyon+0.033
 creek+0.033
 camp+0.033
Neg logit contributions
Of-0.027
 forgiveness-0.026
aid-0.025
 apology-0.025
olding-0.025
Token,
Token ID11
Feature activation-7.70e-01

Pos logit contributions
 lake+0.035
neath+0.033
 shore+0.033
 canyon+0.033
 river+0.033
Neg logit contributions
Of-0.045
olding-0.044
 forgiveness-0.044
 else-0.042
 apology-0.041
Token there
Token ID612
Feature activation-1.63e+00

Pos logit contributions
 lake+0.033
 shore+0.032
 canyon+0.032
 river+0.031
 camp+0.031
Neg logit contributions
Of-0.038
 forgiveness-0.035
olding-0.035
 else-0.035
aid-0.035
Token was
Token ID373
Feature activation-2.00e+00

Pos logit contributions
 canyon+0.098
 ditch+0.094
 shore+0.093
 valleys+0.093
 river+0.092
Neg logit contributions
 else-0.048
 forgiveness-0.047
Of-0.046
olding-0.041
 apology-0.040
Token a
Token ID257
Feature activation-8.51e-01

Pos logit contributions
 canyon+0.092
 river+0.086
 lake+0.085
 valleys+0.084
 shore+0.084
Neg logit contributions
Of-0.089
 forgiveness-0.087
aid-0.084
 else-0.078
 caring-0.077
Token big
Token ID1263
Feature activation-2.88e+00

Pos logit contributions
neath+0.042
 valleys+0.039
 lake+0.038
 canyon+0.038
 shore+0.037
Neg logit contributions
Of-0.038
 forgiveness-0.035
aid-0.035
olding-0.035
 apology-0.034
Token wide
Token ID3094
Feature activation-4.57e+00

Pos logit contributions
neath+0.139
 valleys+0.135
ground+0.116
 canyon+0.116
growth+0.115
Neg logit contributions
 forgiveness-0.116
Of-0.113
 apology-0.107
 promise-0.105
 caring-0.104
Token house
Token ID2156
Feature activation-2.05e-01

Pos logit contributions
neath+0.190
 valleys+0.179
unks+0.146
 afloat+0.141
 takeoff+0.141
Neg logit contributions
Of-0.193
 promise-0.182
 gift-0.181
olding-0.179
 friend-0.178
Token.
Token ID13
Feature activation-2.55e+00

Pos logit contributions
 lake+0.009
neath+0.009
 camp+0.009
 shore+0.009
 canyon+0.009
Neg logit contributions
Of-0.009
olding-0.009
aid-0.009
 forgiveness-0.009
 apology-0.008
Token Inside
Token ID14384
Feature activation-2.05e+00

Pos logit contributions
 shore+0.122
 canyon+0.114
 lake+0.114
neath+0.110
 waterfall+0.109
Neg logit contributions
Of-0.108
olding-0.099
 forgiveness-0.098
aid-0.093
 apology-0.091
Token the
Token ID262
Feature activation-2.51e-01

Pos logit contributions
 ditch+0.086
 canyon+0.084
 shore+0.078
 rav+0.076
 valleys+0.076
Neg logit contributions
 else-0.092
 forgiveness-0.088
Of-0.085
oral-0.083
Deal-0.083
Token house
Token ID2156
Feature activation+4.16e-01

Pos logit contributions
neath+0.008
 lake+0.008
 camp+0.007
 canyon+0.007
 shore+0.007
Neg logit contributions
Of-0.015
olding-0.014
 forgiveness-0.014
aid-0.014
 apology-0.014
Token lived
Token ID5615
Feature activation-2.34e-01

Pos logit contributions
Of+0.011
aid+0.011
olding+0.011
 forgiveness+0.011
 apology+0.010
Neg logit contributions
 lake-0.012
neath-0.011
 camp-0.011
 shore-0.011
 creek-0.011
Token in
Token ID287
Feature activation-1.42e+00

Pos logit contributions
 lake+0.009
neath+0.009
 camp+0.009
 shore+0.009
 river+0.009
Neg logit contributions
Of-0.012
aid-0.012
olding-0.012
 forgiveness-0.012
 apology-0.011
Token a
Token ID257
Feature activation-1.59e+00

Pos logit contributions
 shore+0.053
 canyon+0.052
 lake+0.052
acket+0.052
 creek+0.051
Neg logit contributions
 forgiveness-0.072
Of-0.068
aid-0.065
 caring-0.063
 apology-0.063
Token big
Token ID1263
Feature activation-3.13e+00

Pos logit contributions
neath+0.069
 shore+0.059
 canyon+0.059
 valleys+0.058
 tim+0.058
Neg logit contributions
Of-0.072
 else-0.072
 forgiveness-0.065
 respectful-0.065
 caring-0.063
Token,
Token ID11
Feature activation-2.55e+00

Pos logit contributions
neath+0.123
 tim+0.110
ground+0.109
acket+0.105
 takeoff+0.105
Neg logit contributions
 promise-0.132
 forgiveness-0.131
Of-0.126
 actions-0.126
 respectful-0.124
Token wide
Token ID3094
Feature activation-4.57e+00

Pos logit contributions
neath+0.130
 valleys+0.124
 shore+0.121
 takeoff+0.120
 canyon+0.120
Neg logit contributions
Of-0.097
 valuable-0.090
olding-0.087
 caring-0.086
 forgiveness-0.085
Token field
Token ID2214
Feature activation+3.26e-01

Pos logit contributions
neath+0.180
 takeoff+0.153
 tim+0.144
 valleys+0.141
ground+0.130
Neg logit contributions
 friend-0.201
 owner-0.189
 promise-0.180
 mother-0.178
Of-0.178
Token.
Token ID13
Feature activation-2.29e+00

Pos logit contributions
Of+0.016
aid+0.015
olding+0.015
 forgiveness+0.015
 apology+0.015
Neg logit contributions
 lake-0.014
 camp-0.014
neath-0.014
 shore-0.013
 river-0.013
Token As
Token ID1081
Feature activation-8.68e-01

Pos logit contributions
 canyon+0.113
 shore+0.111
neath+0.109
 waterfall+0.106
 valleys+0.105
Neg logit contributions
Of-0.093
olding-0.073
 forgiveness-0.073
 Friends-0.070
 apology-0.070
Token he
Token ID339
Feature activation-1.50e+00

Pos logit contributions
neath+0.049
 canyon+0.047
 lake+0.046
 ditch+0.046
 clearing+0.045
Neg logit contributions
Of-0.029
 forgiveness-0.027
 else-0.026
aid-0.024
Deal-0.023
Token hopped
Token ID45230
Feature activation-1.02e+00

Pos logit contributions
 shore+0.069
neath+0.066
 ditch+0.066
 canyon+0.066
 lake+0.065
Neg logit contributions
Of-0.062
aid-0.061
 else-0.059
Deal-0.058
 forgiveness-0.055
Token,
Token ID11
Feature activation-4.02e-01

Pos logit contributions
 lake+0.021
neath+0.020
 ocean+0.019
 canyon+0.019
 shore+0.019
Neg logit contributions
Of-0.019
 forgiveness-0.019
aid-0.018
olding-0.018
 apology-0.017
Token she
Token ID673
Feature activation-5.56e-01

Pos logit contributions
 lake+0.021
 camp+0.020
neath+0.020
 canyon+0.020
 ocean+0.020
Neg logit contributions
Of-0.016
 forgiveness-0.015
aid-0.014
olding-0.014
 apology-0.014
Token came
Token ID1625
Feature activation-1.02e+00

Pos logit contributions
 lake+0.025
neath+0.025
 creek+0.025
 camp+0.024
 shore+0.024
Neg logit contributions
Of-0.025
aid-0.024
olding-0.024
 forgiveness-0.023
 else-0.023
Token to
Token ID284
Feature activation-1.41e+00

Pos logit contributions
 lake+0.049
neath+0.046
 camp+0.045
 creek+0.045
 canyon+0.045
Neg logit contributions
Of-0.046
olding-0.045
 forgiveness-0.043
aid-0.041
asing-0.040
Token a
Token ID257
Feature activation-1.97e+00

Pos logit contributions
neath+0.051
 canyon+0.050
 lake+0.049
 ditch+0.048
 ocean+0.046
Neg logit contributions
 forgiveness-0.077
Of-0.077
 apology-0.072
 forgive-0.067
 honesty-0.066
Token wide
Token ID3094
Feature activation-4.56e+00

Pos logit contributions
neath+0.082
 valleys+0.070
 tim+0.066
 canyon+0.066
 fields+0.063
Neg logit contributions
Of-0.099
 else-0.095
 forgiveness-0.093
 respectful-0.088
olding-0.087
Token,
Token ID11
Feature activation-1.47e+00

Pos logit contributions
neath+0.167
 tim+0.134
 marsh+0.129
unks+0.128
 valleys+0.124
Neg logit contributions
Of-0.198
 promise-0.198
keeper-0.197
 respectful-0.194
 friend-0.193
Token open
Token ID1280
Feature activation-2.21e+00

Pos logit contributions
neath+0.064
 camp+0.061
 lake+0.059
 shore+0.058
 canyon+0.057
Neg logit contributions
Of-0.072
olding-0.065
 forgiveness-0.064
 respectful-0.063
aid-0.063
Token field
Token ID2214
Feature activation-1.12e+00

Pos logit contributions
neath+0.065
 tim+0.063
 camp+0.055
 shore+0.054
acket+0.053
Neg logit contributions
 forgiveness-0.118
Of-0.116
olding-0.113
aser-0.112
aid-0.111
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
neath+0.056
 lake+0.055
 camp+0.055
 canyon+0.054
 creek+0.052
Neg logit contributions
olding-0.044
Of-0.043
 forgiveness-0.042
aid-0.040
 apology-0.039
Token In
Token ID554
Feature activation-1.35e+00

Pos logit contributions
 lake+0.043
 camp+0.043
 canyon+0.042
 ocean+0.041
 shore+0.041
Neg logit contributions
Of-0.055
olding-0.050
 forgiveness-0.050
 apology-0.049
aid-0.048
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
 lake+0.056
 canyon+0.054
 river+0.053
 clearing+0.053
neath+0.052
Neg logit contributions
Of-0.060
olding-0.059
 else-0.058
 forgiveness-0.058
olded-0.055
Token the
Token ID262
Feature activation-9.31e-01

Pos logit contributions
 canyon+0.063
 lake+0.061
 camp+0.059
 river+0.059
 shore+0.057
Neg logit contributions
Of-0.080
 else-0.076
 forgiveness-0.073
olding-0.073
aid-0.070
Token beetle
Token ID45489
Feature activation-1.16e+00

Pos logit contributions
neath+0.039
 valleys+0.036
 canyon+0.036
 lake+0.035
 clearing+0.035
Neg logit contributions
Of-0.048
 forgiveness-0.046
aid-0.045
olding-0.045
 apology-0.044
Token sailed
Token ID33835
Feature activation-2.95e-01

Pos logit contributions
neath+0.055
 lake+0.055
 camp+0.054
 shore+0.053
 canyon+0.053
Neg logit contributions
Of-0.047
olding-0.045
 else-0.045
aid-0.044
 forgiveness-0.044
Token across
Token ID1973
Feature activation-3.58e+00

Pos logit contributions
 lake+0.012
neath+0.012
 camp+0.012
 canyon+0.011
 shore+0.011
Neg logit contributions
Of-0.015
aid-0.014
olding-0.014
 forgiveness-0.014
 apology-0.014
Token the
Token ID262
Feature activation-4.53e+00

Pos logit contributions
acket+0.197
 tim+0.179
latable+0.175
icer+0.166
neath+0.163
Neg logit contributions
 forgiveness-0.121
 else-0.104
 she-0.091
aid-0.091
 lesson-0.090
Token sea
Token ID5417
Feature activation-1.40e+00

Pos logit contributions
neath+0.171
 tim+0.149
 OUT+0.140
und+0.135
anced+0.127
Neg logit contributions
 valuable-0.196
 owner-0.188
 important-0.187
 respectful-0.187
 else-0.184
Token.
Token ID13
Feature activation-2.29e+00

Pos logit contributions
neath+0.079
 lake+0.073
 camp+0.071
 canyon+0.071
 ditch+0.069
Neg logit contributions
olding-0.047
Of-0.044
 else-0.044
 respectful-0.043
 forgiveness-0.042
Token
Token ID220
Feature activation+1.03e+00

Pos logit contributions
 camp+0.109
neath+0.108
 lake+0.107
 canyon+0.107
 waterfall+0.104
Neg logit contributions
Of-0.101
olding-0.084
 apology-0.082
aid-0.081
 forgiveness-0.079
Token She
Token ID1375
Feature activation-1.60e+00

Pos logit contributions
Of+0.058
aid+0.057
 forgiveness+0.057
olding+0.056
 apology+0.055
Neg logit contributions
 lake-0.036
neath-0.035
 camp-0.034
 shore-0.033
 creek-0.032
Token was
Token ID373
Feature activation-2.16e+00

Pos logit contributions
neath+0.084
 camp+0.078
 creek+0.078
 clearing+0.077
 lake+0.077
Neg logit contributions
 else-0.058
Of-0.057
 forgiveness-0.052
olding-0.051
aid-0.051
Token She
Token ID1375
Feature activation-1.24e+00

Pos logit contributions
 lake+0.030
 camp+0.029
neath+0.028
 river+0.028
 canyon+0.028
Neg logit contributions
Of-0.035
olding-0.032
 apology-0.031
 forgiveness-0.031
Deal-0.031
Token spotted
Token ID13489
Feature activation-7.64e-01

Pos logit contributions
neath+0.059
 lake+0.059
 shore+0.057
 canyon+0.057
 ditch+0.056
Neg logit contributions
Of-0.053
 else-0.048
olding-0.047
aid-0.047
olded-0.046
Token a
Token ID257
Feature activation-2.01e+00

Pos logit contributions
 lake+0.025
 canyon+0.024
neath+0.024
 camp+0.024
 river+0.024
Neg logit contributions
Of-0.044
 forgiveness-0.044
aid-0.043
 apology-0.040
olding-0.040
Token big
Token ID1263
Feature activation-3.45e+00

Pos logit contributions
neath+0.089
 valleys+0.082
 canyon+0.076
 camp+0.073
 fields+0.073
Neg logit contributions
Of-0.096
 forgiveness-0.091
 else-0.089
aid-0.086
olding-0.086
Token,
Token ID11
Feature activation-1.66e+00

Pos logit contributions
neath+0.153
 valleys+0.148
 canyon+0.126
 tim+0.125
growth+0.125
Neg logit contributions
 forgiveness-0.148
 apology-0.140
Of-0.137
 promise-0.137
aser-0.129
Token wide
Token ID3094
Feature activation-4.50e+00

Pos logit contributions
neath+0.078
 shore+0.073
 camp+0.073
 lake+0.072
 valleys+0.071
Neg logit contributions
Of-0.076
 forgiveness-0.069
aid-0.068
olding-0.068
 apology-0.066
Token tree
Token ID5509
Feature activation-1.31e+00

Pos logit contributions
neath+0.201
 valleys+0.176
unders+0.152
unks+0.150
 takeoff+0.150
Neg logit contributions
Of-0.177
 promise-0.171
 apology-0.169
olding-0.167
 understanding-0.165
Token in
Token ID287
Feature activation-1.64e+00

Pos logit contributions
neath+0.061
 camp+0.060
 lake+0.058
 shore+0.058
 canyon+0.056
Neg logit contributions
olding-0.055
Of-0.053
aid-0.051
 forgiveness-0.050
asing-0.049
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
 tim+0.065
acket+0.065
 canyon+0.062
 camp+0.061
 river+0.060
Neg logit contributions
 forgiveness-0.077
Of-0.076
 apology-0.069
aid-0.069
 understanding-0.069
Token distance
Token ID5253
Feature activation-2.98e-03

Pos logit contributions
neath+0.039
 camp+0.037
 canyon+0.035
 lake+0.034
 shore+0.033
Neg logit contributions
Of-0.075
aid-0.071
 forgiveness-0.069
olding-0.068
Deal-0.067
Token and
Token ID290
Feature activation-4.63e-01

Pos logit contributions
 lake+0.000
neath+0.000
 camp+0.000
 creek+0.000
 canyon+0.000
Neg logit contributions
Of-0.000
olding-0.000
 forgiveness-0.000
aid-0.000
 apology-0.000
Token stay
Token ID2652
Feature activation+2.58e-01

Pos logit contributions
aid+0.090
Of+0.090
olding+0.089
aser+0.085
 apology+0.083
Neg logit contributions
 lake-0.078
 shore-0.077
 camp-0.075
 river-0.074
 creek-0.073
Token happy
Token ID3772
Feature activation+9.51e-01

Pos logit contributions
Of+0.013
aid+0.012
 forgiveness+0.012
olding+0.012
 apology+0.012
Neg logit contributions
 lake-0.011
 camp-0.011
neath-0.011
 shore-0.011
 creek-0.010
Token.
Token ID13
Feature activation+6.77e-01

Pos logit contributions
Of+0.054
aid+0.052
olding+0.051
 forgiveness+0.050
 apology+0.050
Neg logit contributions
 lake-0.035
 camp-0.034
 shore-0.033
neath-0.032
 river-0.032
Token<|endoftext|>
Token ID50256
Feature activation-2.74e+00

Pos logit contributions
Of+0.037
aid+0.037
olding+0.036
 forgiveness+0.036
 apology+0.035
Neg logit contributions
 lake-0.025
neath-0.024
 camp-0.024
 shore-0.023
 river-0.023
TokenLittle
Token ID22253
Feature activation-1.49e+00

Pos logit contributions
 canyon+0.136
 river+0.133
 shore+0.132
 ditch+0.128
 lake+0.128
Neg logit contributions
Of-0.099
 forgiveness-0.092
aid-0.091
olding-0.088
 else-0.088
Token Sammy
Token ID44067
Feature activation-4.49e+00

Pos logit contributions
 ditch+0.058
 canyon+0.057
 shore+0.057
 valleys+0.056
 camp+0.056
Neg logit contributions
Of-0.078
olding-0.072
 forgiveness-0.071
 apology-0.068
aid-0.068
Token always
Token ID1464
Feature activation-1.13e+00

Pos logit contributions
 ditch+0.247
 canyon+0.242
 clearing+0.235
 camp+0.229
 takeoff+0.229
Neg logit contributions
Of-0.125
 cared-0.125
 cares-0.123
 else-0.119
Deal-0.119
Token liked
Token ID8288
Feature activation-8.88e-01

Pos logit contributions
 lake+0.059
neath+0.059
 creek+0.057
 canyon+0.057
 camp+0.057
Neg logit contributions
Of-0.043
 forgiveness-0.041
olding-0.040
aid-0.040
 respectful-0.038
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
neath+0.048
 lake+0.046
 clearing+0.045
 canyon+0.045
 camp+0.044
Neg logit contributions
Of-0.033
 forgiveness-0.033
olding-0.031
aid-0.030
 apology-0.030
Token explore
Token ID7301
Feature activation-1.53e+00

Pos logit contributions
neath+0.044
 lake+0.043
 clearing+0.042
 valleys+0.042
 canyon+0.042
Neg logit contributions
 forgiveness-0.053
Of-0.052
 apology-0.051
aid-0.051
olding-0.049
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
neath+0.058
 canyon+0.057
 lake+0.055
 ditch+0.055
 shore+0.053
Neg logit contributions
 forgiveness-0.077
Of-0.073
 else-0.072
 apology-0.068
aid-0.067
Token a
Token ID257
Feature activation-2.10e-01

Pos logit contributions
 lake+0.035
neath+0.035
 canyon+0.034
 river+0.033
 shore+0.033
Neg logit contributions
Of-0.057
aid-0.057
 forgiveness-0.056
 else-0.054
olding-0.053
Token time
Token ID640
Feature activation-5.39e-01

Pos logit contributions
 lake+0.007
neath+0.007
 shore+0.007
 camp+0.007
 river+0.007
Neg logit contributions
Of-0.012
olding-0.012
aid-0.011
 forgiveness-0.011
 apology-0.011
Token,
Token ID11
Feature activation-2.43e-01

Pos logit contributions
 lake+0.020
 shore+0.019
neath+0.019
 river+0.019
 canyon+0.019
Neg logit contributions
Of-0.029
olding-0.029
 forgiveness-0.028
aid-0.028
 apology-0.027
Token in
Token ID287
Feature activation-1.29e+00

Pos logit contributions
 lake+0.009
 shore+0.009
 camp+0.009
 river+0.009
 canyon+0.009
Neg logit contributions
Of-0.013
olding-0.013
aid-0.013
 forgiveness-0.013
 apology-0.012
Token a
Token ID257
Feature activation-1.94e+00

Pos logit contributions
 shore+0.047
 river+0.046
 lake+0.045
 canyon+0.045
 ocean+0.044
Neg logit contributions
Of-0.072
aid-0.068
 forgiveness-0.067
 apology-0.064
olding-0.063
Token peaceful
Token ID12309
Feature activation-4.49e+00

Pos logit contributions
neath+0.095
 canyon+0.086
 shore+0.086
 waterfall+0.086
acket+0.085
Neg logit contributions
 else-0.077
Of-0.074
 respectful-0.073
Deal-0.066
 caring-0.065
Token land
Token ID1956
Feature activation-9.00e-02

Pos logit contributions
neath+0.243
 ditch+0.212
 valleys+0.202
unks+0.201
 gonna+0.201
Neg logit contributions
 friend-0.139
 song-0.127
Of-0.122
 respectful-0.117
 else-0.117
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
 lake+0.004
neath+0.004
 camp+0.004
 shore+0.004
 creek+0.003
Neg logit contributions
Of-0.005
olding-0.004
aid-0.004
 forgiveness-0.004
 apology-0.004
Token there
Token ID612
Feature activation-1.92e+00

Pos logit contributions
 canyon+0.055
 shore+0.055
 lake+0.055
 camp+0.053
 river+0.053
Neg logit contributions
Of-0.054
 else-0.051
 forgiveness-0.050
aid-0.049
olding-0.048
Token lived
Token ID5615
Feature activation-1.49e+00

Pos logit contributions
 canyon+0.119
 ditch+0.117
 shore+0.112
 creek+0.111
 rav+0.111
Neg logit contributions
 else-0.054
 forgiveness-0.051
Of-0.045
Deal-0.044
 forgive-0.044
Token a
Token ID257
Feature activation-1.06e+00

Pos logit contributions
 lake+0.063
 canyon+0.062
neath+0.062
 river+0.061
 shore+0.060
Neg logit contributions
Of-0.073
aid-0.072
 forgiveness-0.071
 forgive-0.068
 else-0.066
Token pocket
Token ID10000
Feature activation+1.05e+00

Pos logit contributions
Of+0.046
aid+0.045
olding+0.044
 forgiveness+0.043
asing+0.042
Neg logit contributions
 lake-0.038
 camp-0.036
 shore-0.035
 river-0.034
 ocean-0.034
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
Of+0.055
aid+0.053
 forgiveness+0.052
olding+0.052
 apology+0.052
Neg logit contributions
 lake-0.042
 camp-0.040
neath-0.039
 shore-0.039
 river-0.039
Token<|endoftext|>
Token ID50256
Feature activation-2.91e+00

Pos logit contributions
 lake+0.011
 camp+0.011
neath+0.011
 canyon+0.011
 shore+0.011
Neg logit contributions
Of-0.014
 forgiveness-0.013
olding-0.013
aid-0.013
 apology-0.013
TokenLittle
Token ID22253
Feature activation-1.70e+00

Pos logit contributions
 canyon+0.152
 river+0.146
 shore+0.145
 ditch+0.145
 lake+0.141
Neg logit contributions
Of-0.097
 forgiveness-0.090
aid-0.089
 else-0.086
olding-0.084
Token Tim
Token ID5045
Feature activation-3.36e+00

Pos logit contributions
 canyon+0.063
 ditch+0.062
 camp+0.062
 valleys+0.062
 shore+0.061
Neg logit contributions
Of-0.092
olding-0.085
 forgiveness-0.084
aid-0.081
 apology-0.081
Tokenmy
Token ID1820
Feature activation-4.47e+00

Pos logit contributions
 canyon+0.234
 ditch+0.227
 shore+0.226
 creek+0.225
 waterfall+0.224
Neg logit contributions
olding-0.050
aid-0.049
Of-0.048
 cared-0.046
 cares-0.044
Token was
Token ID373
Feature activation-2.35e+00

Pos logit contributions
 canyon+0.260
 waterfall+0.252
 ditch+0.251
 clearing+0.249
 rav+0.247
Neg logit contributions
 cared-0.115
 cares-0.107
grand-0.107
 else-0.105
 actions-0.099
Token walking
Token ID6155
Feature activation-1.34e+00

Pos logit contributions
 lake+0.108
 creek+0.106
neath+0.104
 valleys+0.102
 waterfall+0.102
Neg logit contributions
olding-0.099
Of-0.095
olded-0.090
aid-0.089
 forgiveness-0.089
Token in
Token ID287
Feature activation-2.66e+00

Pos logit contributions
neath+0.052
 canyon+0.052
 lake+0.050
 ditch+0.047
 camp+0.047
Neg logit contributions
Of-0.068
olding-0.063
aid-0.059
 forgiveness-0.059
 apology-0.057
Token the
Token ID262
Feature activation-2.69e+00

Pos logit contributions
acket+0.119
 canyon+0.109
neath+0.108
 rav+0.102
 tim+0.102
Neg logit contributions
 forgiveness-0.103
Of-0.096
 understanding-0.090
 honesty-0.089
 questions-0.089
Token park
Token ID3952
Feature activation-9.78e-01

Pos logit contributions
 tim+0.084
neath+0.083
 canyon+0.075
acket+0.072
 takeoff+0.071
Neg logit contributions
Of-0.140
 forgiveness-0.132
 else-0.127
 respectful-0.123
 owner-0.118
Token Daisy
Token ID40355
Feature activation+5.68e-02

Pos logit contributions
 lake+0.014
 camp+0.014
 canyon+0.013
 shore+0.013
neath+0.013
Neg logit contributions
Of-0.019
 forgiveness-0.017
aid-0.017
olding-0.017
 apology-0.017
Token's
Token ID338
Feature activation+1.81e+00

Pos logit contributions
Of+0.003
aid+0.003
olding+0.003
 forgiveness+0.003
 apology+0.002
Neg logit contributions
 lake-0.003
 camp-0.003
neath-0.002
 shore-0.002
 creek-0.002
Token mom
Token ID1995
Feature activation+1.12e+00

Pos logit contributions
aid+0.082
Of+0.079
olding+0.078
asing+0.074
olded+0.074
Neg logit contributions
 lake-0.090
 ocean-0.085
 river-0.085
 pool-0.085
 shore-0.084
Token was
Token ID373
Feature activation+1.32e-01

Pos logit contributions
Of+0.059
 forgiveness+0.059
 apology+0.059
olding+0.059
aid+0.058
Neg logit contributions
 lake-0.043
neath-0.042
 camp-0.040
 river-0.039
 ocean-0.039
Token very
Token ID845
Feature activation+1.70e+00

Pos logit contributions
Of+0.006
aid+0.006
olding+0.006
 forgiveness+0.006
 apology+0.006
Neg logit contributions
 lake-0.006
 camp-0.006
neath-0.006
 creek-0.006
 shore-0.006
Token proud
Token ID6613
Feature activation+2.56e+00

Pos logit contributions
 forgiveness+0.082
aid+0.082
 apology+0.080
Of+0.079
asing+0.076
Neg logit contributions
 shore-0.068
 lake-0.068
 camp-0.066
 river-0.065
 ocean-0.065
Token of
Token ID286
Feature activation+4.79e-01

Pos logit contributions
aid+0.140
 apology+0.138
Deal+0.132
Of+0.132
olding+0.128
Neg logit contributions
 ocean-0.099
 lake-0.098
 camp-0.095
 pool-0.094
 shore-0.091
Token her
Token ID607
Feature activation+1.72e+00

Pos logit contributions
Of+0.024
aid+0.023
olding+0.023
 forgiveness+0.022
 apology+0.022
Neg logit contributions
 lake-0.021
neath-0.020
 camp-0.020
 shore-0.020
 creek-0.020
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
aid+0.104
Of+0.101
 forgiveness+0.094
olding+0.093
olded+0.092
Neg logit contributions
 lake-0.056
 camp-0.054
 river-0.052
 shore-0.052
neath-0.051
Token
Token ID198
Feature activation-7.56e-01

Pos logit contributions
 lake+0.016
 camp+0.016
 canyon+0.015
neath+0.015
 shore+0.015
Neg logit contributions
Of-0.019
olding-0.017
 forgiveness-0.017
aid-0.017
 apology-0.017
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
 lake+0.036
 canyon+0.035
 camp+0.035
neath+0.035
 shore+0.034
Neg logit contributions
Of-0.033
aid-0.031
olding-0.031
 forgiveness-0.030
 apology-0.030
Tokenmy
Token ID1820
Feature activation+1.14e+00

Pos logit contributions
Of+0.055
 forgiveness+0.054
olding+0.053
 apology+0.053
aid+0.053
Neg logit contributions
 lake-0.044
neath-0.042
 camp-0.042
 river-0.040
 ocean-0.040
Token smiled
Token ID13541
Feature activation+1.54e-01

Pos logit contributions
Of+0.041
 forgiveness+0.041
olding+0.041
aid+0.041
 apology+0.040
Neg logit contributions
 lake-0.062
neath-0.059
 camp-0.059
 river-0.057
 shore-0.057
Token and
Token ID290
Feature activation+9.33e-02

Pos logit contributions
Of+0.009
aid+0.009
 forgiveness+0.009
olding+0.009
 apology+0.008
Neg logit contributions
 lake-0.005
neath-0.005
 camp-0.005
 shore-0.005
 river-0.005
Token said
Token ID531
Feature activation+8.82e-01

Pos logit contributions
Of+0.003
aid+0.003
 forgiveness+0.003
olding+0.003
 apology+0.003
Neg logit contributions
 lake-0.006
 camp-0.006
neath-0.006
 shore-0.005
 creek-0.005
Token,
Token ID11
Feature activation-2.37e-02

Pos logit contributions
Of+0.058
aid+0.056
olding+0.056
 forgiveness+0.056
 apology+0.055
Neg logit contributions
 lake-0.022
 camp-0.021
neath-0.021
 shore-0.021
 river-0.020
Token "
Token ID366
Feature activation+3.27e+00

Pos logit contributions
 lake+0.001
 camp+0.001
neath+0.001
 shore+0.001
 creek+0.001
Neg logit contributions
Of-0.001
aid-0.001
olding-0.001
 forgiveness-0.001
 apology-0.001
TokenNo
Token ID2949
Feature activation+6.90e-01

Pos logit contributions
 apology+0.117
 forgiveness+0.117
aser+0.114
 classmates+0.110
 actions+0.110
Neg logit contributions
 lake-0.175
 clearing-0.170
neath-0.168
 valleys-0.161
 river-0.160
Token,
Token ID11
Feature activation+3.47e-01

Pos logit contributions
Of+0.045
aid+0.044
olding+0.043
 forgiveness+0.042
 apology+0.041
Neg logit contributions
 lake-0.020
 camp-0.019
neath-0.018
 shore-0.017
 river-0.017
Token sweet
Token ID6029
Feature activation+2.49e-01

Pos logit contributions
Of+0.016
aid+0.015
olding+0.015
 forgiveness+0.015
 apology+0.015
Neg logit contributions
 lake-0.016
 camp-0.016
neath-0.016
 shore-0.016
 river-0.015
Tokenie
Token ID494
Feature activation+2.28e-01

Pos logit contributions
Of+0.009
aid+0.008
olding+0.008
 forgiveness+0.008
 apology+0.008
Neg logit contributions
 lake-0.014
neath-0.014
 camp-0.014
 shore-0.014
 river-0.013
Token,
Token ID11
Feature activation+8.71e-01

Pos logit contributions
Of+0.012
aid+0.012
olding+0.012
 forgiveness+0.012
 apology+0.011
Neg logit contributions
 lake-0.009
neath-0.009
 camp-0.008
 shore-0.008
 creek-0.008
Token to
Token ID284
Feature activation+1.51e+00

Pos logit contributions
Of+0.073
aid+0.070
 forgiveness+0.068
olding+0.068
 apology+0.067
Neg logit contributions
 camp-0.049
 lake-0.048
 shore-0.048
 fields-0.046
 river-0.046
Token make
Token ID787
Feature activation+4.65e-01

Pos logit contributions
Of+0.086
olding+0.083
aid+0.081
asing+0.080
aser+0.079
Neg logit contributions
 lake-0.054
 shore-0.053
 river-0.053
 camp-0.052
 creek-0.050
Token Sara
Token ID24799
Feature activation+1.18e+00

Pos logit contributions
Of+0.024
aid+0.023
olding+0.023
 forgiveness+0.023
 apology+0.022
Neg logit contributions
 lake-0.019
 camp-0.019
neath-0.018
 shore-0.018
 river-0.018
Token and
Token ID290
Feature activation+2.63e+00

Pos logit contributions
Of+0.057
aid+0.053
olding+0.052
 forgiveness+0.052
Deal+0.051
Neg logit contributions
 lake-0.051
 camp-0.051
neath-0.049
 shore-0.049
 river-0.049
Token mom
Token ID1995
Feature activation+2.41e+00

Pos logit contributions
oral+0.155
olding+0.152
Of+0.151
aid+0.146
aser+0.146
Neg logit contributions
 camp-0.073
 river-0.071
 shore-0.069
 pool-0.068
 lake-0.067
Token angry
Token ID7954
Feature activation+2.94e+00

Pos logit contributions
Of+0.133
aid+0.119
Deal+0.119
aser+0.119
 forgiveness+0.117
Neg logit contributions
 lake-0.079
 pool-0.078
 camp-0.078
 river-0.077
 shore-0.076
Token.
Token ID13
Feature activation+4.86e-01

Pos logit contributions
Of+0.210
aid+0.204
grand+0.195
Deal+0.194
asers+0.188
Neg logit contributions
 shore-0.057
 ocean-0.049
 sea-0.047
 lake-0.045
 river-0.045
Token He
Token ID679
Feature activation+1.77e+00

Pos logit contributions
Of+0.030
aid+0.029
 forgiveness+0.029
olding+0.029
 apology+0.028
Neg logit contributions
 lake-0.015
 camp-0.015
neath-0.014
 shore-0.014
 river-0.014
Token said
Token ID531
Feature activation+1.28e+00

Pos logit contributions
 forgiveness+0.086
olding+0.085
asing+0.082
Of+0.081
 apology+0.080
Neg logit contributions
 lake-0.077
 river-0.071
 camp-0.069
 shore-0.068
 clearing-0.068
Token sorry
Token ID7926
Feature activation+2.44e+00

Pos logit contributions
Of+0.061
olding+0.060
asing+0.057
 apology+0.056
aid+0.056
Neg logit contributions
 camp-0.055
 lake-0.054
 shore-0.054
 river-0.052
 ocean-0.051
Token to
Token ID284
Feature activation+1.02e+00

Pos logit contributions
Of+0.156
 forgiveness+0.153
asers+0.151
Deal+0.150
aid+0.149
Neg logit contributions
 camp-0.056
 river-0.051
 pool-0.049
 lake-0.048
 shore-0.048
Token she
Token ID673
Feature activation+2.00e-01

Pos logit contributions
Of+0.029
aid+0.028
olding+0.028
 forgiveness+0.028
 apology+0.027
Neg logit contributions
 lake-0.032
 camp-0.031
neath-0.031
 shore-0.031
 creek-0.030
Token called
Token ID1444
Feature activation-6.47e-01

Pos logit contributions
Of+0.008
aid+0.008
 forgiveness+0.008
olding+0.008
 apology+0.007
Neg logit contributions
 lake-0.010
neath-0.010
 camp-0.010
 shore-0.010
 creek-0.010
Token.
Token ID13
Feature activation+2.31e-01

Pos logit contributions
 lake+0.032
neath+0.031
 ocean+0.030
 canyon+0.030
 river+0.029
Neg logit contributions
 forgiveness-0.028
olding-0.027
 apology-0.027
Of-0.027
aid-0.026
Token
Token ID198
Feature activation+1.00e-01

Pos logit contributions
Of+0.012
aid+0.012
olding+0.012
 forgiveness+0.011
 apology+0.011
Neg logit contributions
 lake-0.009
 camp-0.009
neath-0.009
 shore-0.009
 river-0.009
Token
Token ID198
Feature activation+2.11e-01

Pos logit contributions
Of+0.005
aid+0.005
 forgiveness+0.005
olding+0.005
 apology+0.005
Neg logit contributions
 lake-0.004
neath-0.004
 camp-0.004
 shore-0.004
 river-0.004
Token"
Token ID1
Feature activation+2.53e+00

Pos logit contributions
Of+0.008
aid+0.007
olding+0.007
 forgiveness+0.007
 apology+0.007
Neg logit contributions
 lake-0.012
 camp-0.012
neath-0.012
 shore-0.011
 river-0.011
TokenYes
Token ID5297
Feature activation-2.04e-01

Pos logit contributions
asing+0.123
olding+0.122
 apology+0.120
aser+0.119
asers+0.118
Neg logit contributions
 lake-0.107
 creek-0.101
 pool-0.101
neath-0.098
 camp-0.097
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
 lake+0.007
neath+0.007
 valleys+0.007
 camp+0.007
 canyon+0.007
Neg logit contributions
Of-0.012
 forgiveness-0.011
olding-0.011
 apology-0.011
aid-0.011
Token mom
Token ID1995
Feature activation+1.73e-01

Pos logit contributions
aid+0.068
olding+0.067
Of+0.067
asing+0.066
 forgiveness+0.065
Neg logit contributions
 lake-0.053
 camp-0.051
 shore-0.050
 river-0.049
 creek-0.049
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
Of+0.010
aid+0.010
olding+0.010
 forgiveness+0.010
 apology+0.010
Neg logit contributions
 lake-0.006
neath-0.006
 camp-0.006
 shore-0.005
 creek-0.005
Token we
Token ID356
Feature activation-2.47e-02

Pos logit contributions
Of+0.052
aid+0.051
olding+0.051
 forgiveness+0.049
asing+0.049
Neg logit contributions
 lake-0.045
 camp-0.043
 shore-0.042
 river-0.042
 ocean-0.041
Token hugged
Token ID41130
Feature activation+6.60e-01

Pos logit contributions
olding+0.096
asers+0.095
 apology+0.094
asing+0.093
 forgiveness+0.093
Neg logit contributions
 lake-0.089
neath-0.086
 clearing-0.081
 camp-0.081
 pool-0.079
Token him
Token ID683
Feature activation+1.17e+00

Pos logit contributions
Of+0.041
olding+0.041
aid+0.040
 forgiveness+0.040
 apology+0.039
Neg logit contributions
 lake-0.020
 camp-0.019
neath-0.019
 shore-0.018
 ocean-0.017
Token and
Token ID290
Feature activation+2.20e+00

Pos logit contributions
Of+0.068
aid+0.066
 forgiveness+0.066
olding+0.065
asing+0.063
Neg logit contributions
 lake-0.038
 camp-0.037
neath-0.036
 shore-0.035
 river-0.034
Token said
Token ID531
Feature activation+1.65e+00

Pos logit contributions
olding+0.103
aser+0.098
asing+0.097
asers+0.094
 apology+0.094
Neg logit contributions
 lake-0.097
 camp-0.093
neath-0.091
 fields-0.091
 shore-0.091
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
Of+0.122
aid+0.117
olding+0.117
 apology+0.114
grand+0.114
Neg logit contributions
 camp-0.024
 lake-0.024
 shore-0.022
neath-0.020
 pool-0.019
Token "
Token ID366
Feature activation+3.24e+00

Pos logit contributions
aid+0.055
Of+0.055
 forgiveness+0.055
olding+0.054
 apology+0.054
Neg logit contributions
 lake-0.037
 camp-0.035
 shore-0.035
neath-0.035
 river-0.034
TokenI
Token ID40
Feature activation+1.69e+00

Pos logit contributions
 apology+0.175
olding+0.174
aser+0.172
asers+0.171
asing+0.164
Neg logit contributions
 lake-0.120
 clearing-0.105
 river-0.101
 valleys-0.101
 fields-0.101
Token love
Token ID1842
Feature activation+1.10e+00

Pos logit contributions
aser+0.078
asers+0.077
asing+0.076
olding+0.073
 forgiveness+0.072
Neg logit contributions
 camp-0.073
 lake-0.071
 pool-0.066
 river-0.066
 clearing-0.065
Token you
Token ID345
Feature activation+1.76e+00

Pos logit contributions
olding+0.051
Of+0.051
aid+0.050
 apology+0.047
olded+0.046
Neg logit contributions
 lake-0.052
 camp-0.051
 shore-0.050
 ocean-0.049
 fields-0.048
Token,
Token ID11
Feature activation+8.08e-01

Pos logit contributions
Of+0.113
aid+0.113
olding+0.110
grand+0.109
 apology+0.105
Neg logit contributions
 camp-0.045
 lake-0.044
 ocean-0.040
 pool-0.040
 river-0.037
Token but
Token ID475
Feature activation+1.96e+00

Pos logit contributions
Of+0.041
aid+0.041
olding+0.040
 forgiveness+0.039
asing+0.038
Neg logit contributions
 lake-0.034
 camp-0.032
 shore-0.031
 ocean-0.031
 river-0.031
Token leave
Token ID2666
Feature activation+3.61e-01

Pos logit contributions
neath+0.014
 lake+0.013
 valleys+0.013
 camp+0.013
 canyon+0.013
Neg logit contributions
Of-0.015
 forgiveness-0.015
aid-0.015
olding-0.014
 apology-0.014
Token Lily
Token ID20037
Feature activation+4.10e-01

Pos logit contributions
Of+0.021
aid+0.021
olding+0.021
 forgiveness+0.020
 apology+0.020
Neg logit contributions
 lake-0.012
 camp-0.012
neath-0.012
 shore-0.011
 river-0.011
Token and
Token ID290
Feature activation+1.73e+00

Pos logit contributions
Of+0.022
aid+0.021
 forgiveness+0.021
olding+0.021
 apology+0.021
Neg logit contributions
 lake-0.016
 camp-0.015
neath-0.015
 shore-0.015
 river-0.014
Token Ben
Token ID3932
Feature activation+7.09e-01

Pos logit contributions
Of+0.089
olding+0.089
aid+0.088
 apology+0.085
aser+0.082
Neg logit contributions
 camp-0.071
 lake-0.068
 shore-0.068
 river-0.065
 ocean-0.065
Token alone
Token ID3436
Feature activation+1.36e+00

Pos logit contributions
Of+0.037
aid+0.036
 forgiveness+0.036
olding+0.036
 apology+0.035
Neg logit contributions
 lake-0.028
 camp-0.027
neath-0.027
 shore-0.027
 river-0.026
Token.
Token ID13
Feature activation+6.00e-01

Pos logit contributions
Of+0.076
aid+0.074
 forgiveness+0.071
olding+0.071
 apology+0.071
Neg logit contributions
 camp-0.050
 lake-0.050
 shore-0.048
 river-0.046
 pool-0.046
Token
Token ID198
Feature activation+8.14e-01

Pos logit contributions
Of+0.035
aid+0.035
 forgiveness+0.034
olding+0.034
 apology+0.033
Neg logit contributions
 lake-0.020
 camp-0.019
neath-0.019
 shore-0.019
 river-0.018
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
Of+0.046
aid+0.045
 forgiveness+0.045
olding+0.044
 apology+0.043
Neg logit contributions
 lake-0.029
 camp-0.028
neath-0.027
 shore-0.027
 creek-0.026
TokenThe
Token ID464
Feature activation+1.12e+00

Pos logit contributions
 forgiveness+0.041
Of+0.040
aid+0.039
 apology+0.039
olding+0.039
Neg logit contributions
 lake-0.052
 camp-0.050
neath-0.050
 clearing-0.049
 shore-0.049
Token boy
Token ID2933
Feature activation+6.72e-01

Pos logit contributions
Of+0.055
aid+0.053
olding+0.053
 forgiveness+0.052
asing+0.051
Neg logit contributions
 lake-0.050
 camp-0.048
 shore-0.047
 river-0.047
 creek-0.047
Token and
Token ID290
Feature activation+2.89e-01

Pos logit contributions
Of+0.035
aid+0.033
 forgiveness+0.033
olding+0.033
 apology+0.033
Neg logit contributions
 lake-0.028
 camp-0.026
neath-0.026
 shore-0.026
 river-0.025
Token a
Token ID257
Feature activation-7.92e-01

Pos logit contributions
 canyon+0.087
 river+0.082
 valleys+0.082
 lake+0.081
 creek+0.079
Neg logit contributions
Of-0.081
 forgiveness-0.079
aid-0.076
 else-0.073
 apology-0.072
Token little
Token ID1310
Feature activation-7.18e-01

Pos logit contributions
neath+0.039
 valleys+0.036
 lake+0.036
 canyon+0.035
 shore+0.035
Neg logit contributions
Of-0.036
aid-0.033
 forgiveness-0.033
olding-0.033
 apology-0.032
Token girl
Token ID2576
Feature activation+1.29e+00

Pos logit contributions
neath+0.045
 valleys+0.043
 lake+0.043
 canyon+0.042
 shore+0.042
Neg logit contributions
Of-0.022
aid-0.021
 forgiveness-0.020
olding-0.019
 apology-0.018
Token named
Token ID3706
Feature activation+4.00e-01

Pos logit contributions
Of+0.078
 apology+0.076
aid+0.075
 forgiveness+0.074
olding+0.073
Neg logit contributions
 lake-0.042
 camp-0.040
 river-0.039
 ocean-0.038
 pool-0.038
Token Lily
Token ID20037
Feature activation+8.64e-01

Pos logit contributions
Of+0.019
aid+0.019
 forgiveness+0.019
olding+0.019
 apology+0.018
Neg logit contributions
 lake-0.018
 camp-0.017
neath-0.017
 shore-0.017
 creek-0.016
Token.
Token ID13
Feature activation+1.62e-01

Pos logit contributions
Of+0.046
aid+0.044
 forgiveness+0.043
 apology+0.043
olding+0.043
Neg logit contributions
 lake-0.035
 camp-0.033
neath-0.033
 shore-0.033
 river-0.032
Token She
Token ID1375
Feature activation+2.36e-01

Pos logit contributions
Of+0.009
aid+0.008
olding+0.008
 forgiveness+0.008
 apology+0.008
Neg logit contributions
 lake-0.007
 camp-0.006
neath-0.006
 shore-0.006
 river-0.006
Token loved
Token ID6151
Feature activation-2.91e-01

Pos logit contributions
Of+0.010
aid+0.010
olding+0.010
 forgiveness+0.010
 apology+0.009
Neg logit contributions
 lake-0.012
neath-0.011
 camp-0.011
 shore-0.011
 creek-0.011
Token to
Token ID284
Feature activation-1.13e+00

Pos logit contributions
neath+0.015
 lake+0.015
 canyon+0.014
 camp+0.014
 clearing+0.014
Neg logit contributions
Of-0.012
 forgiveness-0.011
aid-0.011
olding-0.011
 apology-0.010
Token play
Token ID711
Feature activation-6.85e-01

Pos logit contributions
neath+0.052
 valleys+0.052
 clearing+0.051
 ditch+0.050
 lake+0.050
Neg logit contributions
 forgiveness-0.051
aid-0.051
Of-0.051
 apology-0.049
 forgive-0.044
Token with
Token ID351
Feature activation-7.72e-02

Pos logit contributions
neath+0.026
 lake+0.026
 valleys+0.025
 shore+0.025
 canyon+0.025
Neg logit contributions
Of-0.036
aid-0.035
 forgiveness-0.035
olding-0.033
 else-0.032
Token warm
Token ID5814
Feature activation-1.18e+00

Pos logit contributions
Of+0.007
aid+0.007
 forgiveness+0.007
olding+0.007
 apology+0.007
Neg logit contributions
 lake-0.008
neath-0.008
 camp-0.007
 shore-0.007
 river-0.007
Token and
Token ID290
Feature activation+6.50e-01

Pos logit contributions
neath+0.054
 creek+0.051
 lake+0.050
 canyon+0.050
 camp+0.050
Neg logit contributions
aid-0.056
 forgiveness-0.054
Of-0.053
olding-0.050
Deal-0.049
Token happy
Token ID3772
Feature activation+7.73e-01

Pos logit contributions
Of+0.035
olding+0.034
aid+0.034
 forgiveness+0.034
 apology+0.033
Neg logit contributions
 lake-0.025
 camp-0.024
neath-0.023
 shore-0.023
 river-0.023
Token.
Token ID13
Feature activation+3.48e-01

Pos logit contributions
Of+0.041
aid+0.039
olding+0.039
 forgiveness+0.038
 apology+0.038
Neg logit contributions
 lake-0.032
 camp-0.031
 shore-0.030
 ocean-0.030
 river-0.029
Token They
Token ID1119
Feature activation-6.99e-01

Pos logit contributions
Of+0.020
aid+0.020
olding+0.019
 forgiveness+0.019
 apology+0.019
Neg logit contributions
 lake-0.012
 camp-0.012
neath-0.012
 shore-0.011
 river-0.011
Token forget
Token ID6044
Feature activation+2.10e-01

Pos logit contributions
 lake+0.035
neath+0.034
 canyon+0.034
 valleys+0.034
 clearing+0.034
Neg logit contributions
Of-0.030
aid-0.029
 forgiveness-0.027
 apology-0.027
olding-0.027
Token about
Token ID546
Feature activation-1.86e-01

Pos logit contributions
Of+0.009
 forgiveness+0.008
aid+0.008
olding+0.008
 apology+0.008
Neg logit contributions
 lake-0.011
neath-0.010
 camp-0.010
 shore-0.010
 river-0.010
Token the
Token ID262
Feature activation+6.26e-01

Pos logit contributions
 lake+0.006
neath+0.006
 canyon+0.006
 camp+0.005
 clearing+0.005
Neg logit contributions
 forgiveness-0.011
Of-0.011
 apology-0.011
aid-0.011
olding-0.011
Token fall
Token ID2121
Feature activation-1.69e+00

Pos logit contributions
Of+0.034
aid+0.033
olding+0.033
 forgiveness+0.032
 apology+0.032
Neg logit contributions
 lake-0.025
 camp-0.023
 shore-0.023
 river-0.023
neath-0.023
Token and
Token ID290
Feature activation-2.44e-01

Pos logit contributions
 lake+0.077
 clearing+0.075
neath+0.074
 canyon+0.074
 shore+0.072
Neg logit contributions
aid-0.071
olding-0.070
 forgiveness-0.070
aser-0.068
Of-0.067
Token the
Token ID262
Feature activation+3.42e-01

Pos logit contributions
 lake+0.009
neath+0.008
 camp+0.008
 canyon+0.008
 creek+0.008
Neg logit contributions
Of-0.014
 forgiveness-0.014
aid-0.013
olding-0.013
 apology-0.013
Token't
Token ID470
Feature activation+1.22e+00

Pos logit contributions
 lake+0.072
 valleys+0.070
 fields+0.069
 ocean+0.069
 clearing+0.068
Neg logit contributions
olding-0.052
Of-0.051
 apology-0.051
aid-0.050
 forgiveness-0.050
Token want
Token ID765
Feature activation+9.56e-01

Pos logit contributions
Of+0.066
 forgiveness+0.064
aid+0.061
asing+0.061
olding+0.061
Neg logit contributions
 camp-0.044
 lake-0.044
 shore-0.043
 river-0.043
 creek-0.041
Token to
Token ID284
Feature activation+1.06e-01

Pos logit contributions
Of+0.049
olding+0.047
aid+0.047
 forgiveness+0.045
 apology+0.045
Neg logit contributions
 lake-0.041
 camp-0.040
 shore-0.039
neath-0.038
 ocean-0.038
Token buy
Token ID2822
Feature activation+1.81e-01

Pos logit contributions
Of+0.005
aid+0.005
 forgiveness+0.005
olding+0.005
 apology+0.005
Neg logit contributions
 lake-0.005
neath-0.005
 camp-0.005
 creek-0.005
 shore-0.005
Token from
Token ID422
Feature activation-1.32e-01

Pos logit contributions
Of+0.010
aid+0.009
 forgiveness+0.009
olding+0.009
 apology+0.009
Neg logit contributions
 lake-0.007
neath-0.007
 camp-0.007
 shore-0.007
 creek-0.007
Token him
Token ID683
Feature activation+4.50e-01

Pos logit contributions
 lake+0.005
neath+0.004
 camp+0.004
 creek+0.004
 canyon+0.004
Neg logit contributions
 forgiveness-0.007
Of-0.007
aid-0.007
olding-0.007
 apology-0.007
Token because
Token ID780
Feature activation+6.73e-01

Pos logit contributions
Of+0.021
aid+0.021
olding+0.020
 forgiveness+0.020
 apology+0.020
Neg logit contributions
 lake-0.020
 camp-0.019
neath-0.019
 shore-0.019
 river-0.019
Token he
Token ID339
Feature activation+3.83e-01

Pos logit contributions
Of+0.030
aid+0.029
olding+0.029
 forgiveness+0.029
 apology+0.028
Neg logit contributions
 lake-0.032
 camp-0.031
neath-0.031
 shore-0.031
 river-0.030
Token thought
Token ID1807
Feature activation+9.29e-01

Pos logit contributions
Of+0.018
aid+0.017
 forgiveness+0.017
olding+0.017
 apology+0.017
Neg logit contributions
 lake-0.017
 camp-0.017
neath-0.017
 shore-0.016
 river-0.016
Token the
Token ID262
Feature activation+4.29e-01

Pos logit contributions
Of+0.048
aid+0.046
olding+0.045
 forgiveness+0.044
 apology+0.043
Neg logit contributions
 lake-0.039
 camp-0.038
neath-0.037
 shore-0.037
 river-0.036
Token bucket
Token ID19236
Feature activation+1.99e+00

Pos logit contributions
Of+0.026
aid+0.025
olding+0.025
 forgiveness+0.025
 apology+0.024
Neg logit contributions
 lake-0.014
 camp-0.013
neath-0.013
 shore-0.013
 river-0.013
Token a
Token ID257
Feature activation+6.79e-01

Pos logit contributions
neath+0.049
 canyon+0.047
 creek+0.047
 lake+0.046
 river+0.045
Neg logit contributions
Of-0.055
 forgiveness-0.051
aid-0.051
 else-0.048
Deal-0.048
Token nice
Token ID3621
Feature activation-1.16e+00

Pos logit contributions
Of+0.033
olding+0.033
aid+0.033
 forgiveness+0.032
asing+0.031
Neg logit contributions
 lake-0.030
 camp-0.029
 shore-0.028
neath-0.028
 river-0.027
Token nap
Token ID25422
Feature activation+7.21e-01

Pos logit contributions
neath+0.056
 valleys+0.052
 canyon+0.050
 lake+0.048
 ditch+0.047
Neg logit contributions
Of-0.054
 forgiveness-0.050
olding-0.050
aid-0.050
 apology-0.050
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
Of+0.036
aid+0.034
olding+0.034
 forgiveness+0.034
 apology+0.033
Neg logit contributions
 lake-0.031
 camp-0.031
neath-0.030
 shore-0.030
 ocean-0.029
Token<|endoftext|>
Token ID50256
Feature activation-2.62e+00

Pos logit contributions
 lake+0.008
neath+0.008
 camp+0.008
 canyon+0.008
 shore+0.008
Neg logit contributions
Of-0.010
olding-0.010
aid-0.009
 forgiveness-0.009
 apology-0.009
TokenL
Token ID43
Feature activation-8.74e-02

Pos logit contributions
 canyon+0.122
 river+0.120
 shore+0.119
 lake+0.115
 ditch+0.114
Neg logit contributions
Of-0.101
aid-0.099
 forgiveness-0.096
 else-0.095
oral-0.091
Tokenily
Token ID813
Feature activation-2.99e+00

Pos logit contributions
 lake+0.005
neath+0.005
 river+0.005
 camp+0.005
 creek+0.005
Neg logit contributions
Of-0.003
aid-0.003
olding-0.003
 forgiveness-0.003
 apology-0.003
Token and
Token ID290
Feature activation-7.50e-02

Pos logit contributions
 canyon+0.164
 ditch+0.158
 waterfall+0.157
 creek+0.157
neath+0.157
Neg logit contributions
Of-0.095
grand-0.093
olding-0.093
 cares-0.092
olded-0.092
Token Ben
Token ID3932
Feature activation-3.14e+00

Pos logit contributions
 lake+0.003
 camp+0.003
neath+0.003
 canyon+0.003
 river+0.003
Neg logit contributions
Of-0.004
 forgiveness-0.003
aid-0.003
olding-0.003
 apology-0.003
Token like
Token ID588
Feature activation-3.42e-01

Pos logit contributions
 canyon+0.191
 ditch+0.181
 clearing+0.178
 waterfall+0.178
 lake+0.175
Neg logit contributions
 cares-0.092
Of-0.092
 cared-0.091
 forgive-0.088
 else-0.088
Token to
Token ID284
Feature activation-2.02e+00

Pos logit contributions
neath+0.019
 lake+0.018
 canyon+0.018
 clearing+0.018
 camp+0.017
Neg logit contributions
Of-0.013
 forgiveness-0.012
aid-0.012
olding-0.012
 apology-0.011
Token parade
Token ID16134
Feature activation+6.92e-01

Pos logit contributions
Of+0.008
aid+0.008
 forgiveness+0.008
olding+0.008
 apology+0.008
Neg logit contributions
 lake-0.005
neath-0.005
 camp-0.005
 shore-0.004
 creek-0.004
Token,
Token ID11
Feature activation-5.90e-01

Pos logit contributions
Of+0.035
aid+0.034
 forgiveness+0.033
olding+0.033
 apology+0.033
Neg logit contributions
 lake-0.030
 camp-0.029
 shore-0.028
neath-0.028
 river-0.028
Token but
Token ID475
Feature activation+7.97e-01

Pos logit contributions
neath+0.026
 lake+0.026
 camp+0.025
 canyon+0.025
 creek+0.024
Neg logit contributions
Of-0.029
 forgiveness-0.027
olding-0.026
aid-0.026
 apology-0.025
Token he
Token ID339
Feature activation+4.82e-01

Pos logit contributions
Of+0.038
aid+0.038
olding+0.038
 apology+0.037
 forgiveness+0.037
Neg logit contributions
 lake-0.036
 camp-0.034
 shore-0.034
neath-0.034
 river-0.033
Token was
Token ID373
Feature activation-4.91e-01

Pos logit contributions
Of+0.022
aid+0.021
olding+0.021
 forgiveness+0.021
 apology+0.020
Neg logit contributions
 lake-0.023
 camp-0.022
neath-0.022
 shore-0.022
 river-0.022
Token not
Token ID407
Feature activation-5.62e-01

Pos logit contributions
 lake+0.026
neath+0.026
 creek+0.025
 camp+0.025
 valleys+0.025
Neg logit contributions
Of-0.019
olding-0.019
 forgiveness-0.018
aid-0.018
 else-0.017
Token very
Token ID845
Feature activation+5.17e-01

Pos logit contributions
neath+0.030
 lake+0.029
 creek+0.028
 camp+0.027
 river+0.027
Neg logit contributions
aid-0.023
Of-0.022
olding-0.022
 forgiveness-0.022
 apology-0.021
Token good
Token ID922
Feature activation+1.41e+00

Pos logit contributions
Of+0.022
aid+0.021
 forgiveness+0.021
olding+0.020
 apology+0.020
Neg logit contributions
 lake-0.026
 camp-0.026
 shore-0.025
neath-0.025
 river-0.025
Token at
Token ID379
Feature activation-3.75e-01

Pos logit contributions
Of+0.083
aid+0.081
 forgiveness+0.077
 apology+0.077
olding+0.075
Neg logit contributions
 camp-0.048
 shore-0.047
 ocean-0.046
 lake-0.046
 creek-0.044
Token walking
Token ID6155
Feature activation-4.40e-01

Pos logit contributions
neath+0.013
 lake+0.013
 creek+0.013
 clearing+0.012
 valleys+0.012
Neg logit contributions
 forgiveness-0.021
Of-0.020
aid-0.020
 apology-0.020
olding-0.020
Token and
Token ID290
Feature activation+1.10e-01

Pos logit contributions
 lake+0.021
neath+0.021
 camp+0.020
 shore+0.020
 creek+0.020
Neg logit contributions
Of-0.018
olding-0.018
 forgiveness-0.018
aid-0.018
 apology-0.017
Token her
Token ID607
Feature activation+8.82e-01

Pos logit contributions
Of+0.008
 forgiveness+0.007
aid+0.007
olding+0.007
 apology+0.007
Neg logit contributions
 lake-0.006
neath-0.006
 camp-0.006
 shore-0.006
 river-0.006
Token.
Token ID13
Feature activation+3.62e-01

Pos logit contributions
Of+0.049
aid+0.048
olding+0.047
 forgiveness+0.046
 apology+0.045
Neg logit contributions
 lake-0.033
 camp-0.033
 shore-0.031
 creek-0.031
neath-0.030
Token
Token ID198
Feature activation+8.82e-01

Pos logit contributions
Of+0.019
aid+0.019
 forgiveness+0.019
olding+0.019
 apology+0.018
Neg logit contributions
 lake-0.014
 camp-0.014
neath-0.013
 shore-0.013
 river-0.013
Token
Token ID198
Feature activation+9.81e-01

Pos logit contributions
Of+0.051
 forgiveness+0.050
aid+0.050
olding+0.050
 apology+0.049
Neg logit contributions
 lake-0.030
 camp-0.030
neath-0.029
 clearing-0.028
 shore-0.028
TokenOne
Token ID3198
Feature activation+9.17e-01

Pos logit contributions
 forgiveness+0.047
aid+0.047
Of+0.047
 apology+0.046
olding+0.046
Neg logit contributions
 lake-0.043
 camp-0.042
neath-0.042
 clearing-0.041
 creek-0.041
Token day
Token ID1110
Feature activation-6.66e-01

Pos logit contributions
Of+0.047
aid+0.047
 forgiveness+0.046
olding+0.044
 apology+0.044
Neg logit contributions
 camp-0.039
 lake-0.039
 ocean-0.037
 pool-0.037
 creek-0.037
Token,
Token ID11
Feature activation+3.46e-01

Pos logit contributions
 lake+0.027
 river+0.026
 canyon+0.026
neath+0.025
 shore+0.025
Neg logit contributions
Of-0.034
 forgiveness-0.033
olding-0.033
aid-0.033
 else-0.032
Token her
Token ID607
Feature activation+1.29e+00

Pos logit contributions
Of+0.016
aid+0.016
olding+0.016
 forgiveness+0.016
 apology+0.015
Neg logit contributions
 lake-0.016
 camp-0.015
neath-0.015
 shore-0.015
 river-0.015
Token teacher
Token ID4701
Feature activation+1.06e-01

Pos logit contributions
aid+0.049
Of+0.048
 forgiveness+0.045
olding+0.045
Deal+0.043
Neg logit contributions
 camp-0.074
 lake-0.073
 pool-0.071
 shore-0.070
 ocean-0.070
Token said
Token ID531
Feature activation+9.41e-01

Pos logit contributions
Of+0.005
olding+0.005
aid+0.005
 forgiveness+0.005
 apology+0.005
Neg logit contributions
 lake-0.005
neath-0.005
 camp-0.005
 shore-0.005
 creek-0.005
Token,
Token ID11
Feature activation+4.57e-02

Pos logit contributions
Of+0.054
aid+0.053
olding+0.052
 forgiveness+0.051
 apology+0.051
Neg logit contributions
 lake-0.032
 camp-0.032
 shore-0.031
neath-0.031
 river-0.030
Token mineral
Token ID18352
Feature activation+4.03e-01

Pos logit contributions
 lake+0.005
 camp+0.005
neath+0.005
 canyon+0.004
 shore+0.004
Neg logit contributions
Of-0.007
aid-0.007
 forgiveness-0.007
olding-0.007
 apology-0.007
Token all
Token ID477
Feature activation+3.85e-01

Pos logit contributions
Of+0.023
aid+0.022
olding+0.022
 forgiveness+0.022
 apology+0.022
Neg logit contributions
 lake-0.014
 camp-0.014
neath-0.013
 shore-0.013
 river-0.013
Token day
Token ID1110
Feature activation-8.22e-01

Pos logit contributions
Of+0.018
aid+0.017
 forgiveness+0.017
olding+0.017
 apology+0.016
Neg logit contributions
 lake-0.018
neath-0.018
 camp-0.017
 shore-0.017
 river-0.017
Token,
Token ID11
Feature activation-5.07e-01

Pos logit contributions
neath+0.036
 lake+0.035
 camp+0.034
 creek+0.033
 canyon+0.033
Neg logit contributions
olding-0.038
Of-0.038
 forgiveness-0.036
aid-0.036
 else-0.036
Token bending
Token ID30808
Feature activation-1.85e+00

Pos logit contributions
neath+0.021
 lake+0.021
 camp+0.020
 canyon+0.020
 creek+0.020
Neg logit contributions
Of-0.026
 forgiveness-0.024
olding-0.024
aid-0.024
 apology-0.023
Token and
Token ID290
Feature activation-8.22e-01

Pos logit contributions
 camp+0.089
 canyon+0.086
 clearing+0.085
 creek+0.084
 lake+0.084
Neg logit contributions
Of-0.067
aid-0.065
aser-0.065
 forgiveness-0.063
asing-0.060
Token flex
Token ID7059
Feature activation+7.63e-02

Pos logit contributions
 lake+0.028
neath+0.028
 camp+0.028
 creek+0.027
 canyon+0.026
Neg logit contributions
Of-0.045
 forgiveness-0.044
aid-0.043
olding-0.042
asing-0.042
Tokening
Token ID278
Feature activation-4.35e-01

Pos logit contributions
Of+0.003
aid+0.003
 forgiveness+0.003
olding+0.003
 apology+0.003
Neg logit contributions
 lake-0.004
 camp-0.004
neath-0.004
 shore-0.003
 river-0.003
Token it
Token ID340
Feature activation+1.07e+00

Pos logit contributions
 lake+0.017
neath+0.017
 camp+0.016
 clearing+0.016
 canyon+0.016
Neg logit contributions
aid-0.022
Of-0.022
 forgiveness-0.022
 apology-0.021
olding-0.021
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
Of+0.059
aid+0.057
 forgiveness+0.057
olding+0.056
 apology+0.056
Neg logit contributions
 lake-0.040
 camp-0.039
neath-0.038
 shore-0.037
 river-0.037
Token But
Token ID887
Feature activation+6.65e-01

Pos logit contributions
 lake+0.005
 camp+0.005
neath+0.004
 canyon+0.004
 shore+0.004
Neg logit contributions
Of-0.006
olding-0.006
aid-0.006
 forgiveness-0.006
 apology-0.006
Token and
Token ID290
Feature activation-7.26e-01

Pos logit contributions
Of+0.002
olding+0.002
aid+0.002
 forgiveness+0.002
 apology+0.002
Neg logit contributions
 lake-0.001
neath-0.001
 camp-0.001
 shore-0.001
 creek-0.001
Token watch
Token ID2342
Feature activation-6.62e-01

Pos logit contributions
 lake+0.031
neath+0.030
 canyon+0.029
 ocean+0.029
 clearing+0.029
Neg logit contributions
Of-0.036
 forgiveness-0.034
aid-0.034
 apology-0.033
olding-0.033
Token the
Token ID262
Feature activation+6.02e-01

Pos logit contributions
 lake+0.022
neath+0.022
 canyon+0.021
 clearing+0.021
 creek+0.021
Neg logit contributions
 forgiveness-0.037
Of-0.037
 apology-0.037
aid-0.036
olding-0.035
Token navy
Token ID23956
Feature activation+2.31e-01

Pos logit contributions
Of+0.033
aid+0.032
olding+0.032
 forgiveness+0.032
 apology+0.031
Neg logit contributions
 lake-0.024
 camp-0.023
 shore-0.022
neath-0.022
 river-0.022
Token show
Token ID905
Feature activation+1.75e+00

Pos logit contributions
Of+0.012
aid+0.012
 forgiveness+0.012
olding+0.012
 apology+0.011
Neg logit contributions
 lake-0.009
neath-0.009
 camp-0.009
 shore-0.009
 creek-0.009
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
Of+0.099
aid+0.096
olding+0.092
 forgiveness+0.091
 apology+0.090
Neg logit contributions
 camp-0.065
 lake-0.063
 shore-0.062
 pool-0.060
 ocean-0.059
Token They
Token ID1119
Feature activation-1.02e+00

Pos logit contributions
 lake+0.007
neath+0.006
 canyon+0.006
 camp+0.006
 shore+0.006
Neg logit contributions
Of-0.009
olding-0.009
aid-0.009
 forgiveness-0.009
 apology-0.009
Token learn
Token ID2193
Feature activation+1.04e+00

Pos logit contributions
 lake+0.048
neath+0.048
 clearing+0.047
 ditch+0.046
 canyon+0.046
Neg logit contributions
Of-0.044
aid-0.042
 forgiveness-0.041
 apology-0.041
olding-0.040
Token more
Token ID517
Feature activation+7.24e-01

Pos logit contributions
Of+0.056
olding+0.055
aid+0.054
 forgiveness+0.052
 apology+0.051
Neg logit contributions
 lake-0.042
 camp-0.041
 shore-0.039
 pool-0.038
 creek-0.038
Token about
Token ID546
Feature activation-6.37e-01

Pos logit contributions
Of+0.033
aid+0.031
olding+0.031
 forgiveness+0.030
 apology+0.029
Neg logit contributions
 lake-0.035
 camp-0.034
 shore-0.033
 creek-0.033
neath-0.033
Token the
Token ID262
Feature activation+4.80e-01

Pos logit contributions
neath+0.023
 lake+0.022
 canyon+0.021
 clearing+0.021
 creek+0.021
Neg logit contributions
 forgiveness-0.035
Of-0.034
 apology-0.033
aid-0.033
olding-0.032
Token sky
Token ID6766
Feature activation-1.64e+00

Pos logit contributions
neath+0.044
 camp+0.043
 canyon+0.039
 shore+0.038
 creek+0.038
Neg logit contributions
Of-0.078
aid-0.078
olding-0.069
 else-0.069
 forgiveness-0.069
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
neath+0.096
 creek+0.090
 ditch+0.089
 canyon+0.089
 lake+0.088
Neg logit contributions
olding-0.049
Of-0.048
 forgiveness-0.047
aid-0.047
 else-0.045
Token He
Token ID679
Feature activation-1.04e+00

Pos logit contributions
 canyon+0.077
 camp+0.074
 lake+0.073
neath+0.072
 ocean+0.072
Neg logit contributions
Of-0.064
olding-0.050
aid-0.049
olded-0.049
 apology-0.047
Token wanted
Token ID2227
Feature activation-8.08e-03

Pos logit contributions
neath+0.051
 lake+0.050
 creek+0.049
 camp+0.049
 canyon+0.049
Neg logit contributions
Of-0.044
 else-0.041
aid-0.040
 forgiveness-0.038
olding-0.038
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
 lake+0.000
neath+0.000
 camp+0.000
 creek+0.000
 canyon+0.000
Neg logit contributions
Of-0.000
 forgiveness-0.000
aid-0.000
olding-0.000
 apology-0.000
Token fly
Token ID6129
Feature activation-5.81e-01

Pos logit contributions
neath+0.046
 valleys+0.044
 camp+0.042
 lake+0.042
 fields+0.042
Neg logit contributions
Of-0.050
 forgiveness-0.050
 apology-0.050
aid-0.049
 else-0.046
Token too
Token ID1165
Feature activation-1.08e+00

Pos logit contributions
neath+0.025
 lake+0.024
 camp+0.023
 canyon+0.023
 shore+0.023
Neg logit contributions
Of-0.029
aid-0.027
 forgiveness-0.027
olding-0.026
 else-0.025
Token!
Token ID0
Feature activation-2.08e-01

Pos logit contributions
neath+0.054
 lake+0.052
 camp+0.050
 ocean+0.049
 creek+0.049
Neg logit contributions
Of-0.044
olding-0.044
aid-0.043
 else-0.042
 forgiveness-0.040
Token
Token ID198
Feature activation+7.69e-01

Pos logit contributions
 lake+0.009
 camp+0.009
neath+0.009
 shore+0.008
 canyon+0.008
Neg logit contributions
Of-0.011
olding-0.010
aid-0.010
 forgiveness-0.010
 apology-0.009
Token
Token ID198
Feature activation+8.87e-02

Pos logit contributions
Of+0.043
 forgiveness+0.042
aid+0.042
olding+0.042
 apology+0.041
Neg logit contributions
 lake-0.028
 camp-0.026
neath-0.026
 shore-0.025
 clearing-0.025
TokenThe
Token ID464
Feature activation+5.90e-01

Pos logit contributions
Of+0.004
aid+0.004
olding+0.004
 forgiveness+0.003
 apology+0.003
Neg logit contributions
 lake-0.004
neath-0.004
 camp-0.004
 shore-0.004
 ocean-0.004
Token playing
Token ID2712
Feature activation-1.09e+00

Pos logit contributions
neath+0.134
 lake+0.131
icer+0.123
 valleys+0.123
 exp+0.120
Neg logit contributions
 else-0.093
olding-0.090
 respectful-0.085
 forgiveness-0.080
olded-0.080
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
neath+0.047
 lake+0.045
 valleys+0.044
 canyon+0.043
 shore+0.043
Neg logit contributions
 forgiveness-0.052
Of-0.051
aid-0.051
olding-0.049
 apology-0.047
Token he
Token ID339
Feature activation-1.60e+00

Pos logit contributions
neath+0.095
 canyon+0.092
 lake+0.090
 valleys+0.090
 bay+0.089
Neg logit contributions
Of-0.055
 forgiveness-0.050
aid-0.049
 else-0.047
Deal-0.042
Token saw
Token ID2497
Feature activation-3.15e-01

Pos logit contributions
neath+0.081
 lake+0.078
 camp+0.075
 canyon+0.074
 creek+0.073
Neg logit contributions
 else-0.063
Of-0.059
ups-0.058
olded-0.057
olding-0.055
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
 lake+0.010
neath+0.010
 camp+0.010
 canyon+0.009
 creek+0.009
Neg logit contributions
aid-0.019
 forgiveness-0.019
Of-0.019
 apology-0.018
olding-0.018
Token big
Token ID1263
Feature activation-3.09e+00

Pos logit contributions
neath+0.046
 valleys+0.043
 lake+0.041
 camp+0.040
 shore+0.040
Neg logit contributions
Of-0.050
 forgiveness-0.049
aid-0.047
olding-0.046
 apology-0.046
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
neath+0.148
 valleys+0.144
ground+0.122
growth+0.118
 streams+0.117
Neg logit contributions
 forgiveness-0.127
 apology-0.121
aser-0.115
 promise-0.114
 caring-0.110
Token mean
Token ID1612
Feature activation-2.10e+00

Pos logit contributions
neath+0.091
 camp+0.086
 valleys+0.084
 shore+0.084
 lake+0.082
Neg logit contributions
Of-0.079
 forgiveness-0.075
 apology-0.074
olding-0.073
aid-0.073
Token cat
Token ID3797
Feature activation-5.34e-01

Pos logit contributions
neath+0.135
 valleys+0.126
ground+0.114
 fields+0.112
 canyon+0.111
Neg logit contributions
 apology-0.068
 forgiveness-0.064
Of-0.063
olding-0.063
aid-0.062
Token coming
Token ID2406
Feature activation-4.28e-01

Pos logit contributions
neath+0.027
 lake+0.026
 camp+0.026
 shore+0.025
 valleys+0.025
Neg logit contributions
olding-0.022
 forgiveness-0.021
aid-0.021
Of-0.021
asing-0.020
Token near
Token ID1474
Feature activation-1.57e+00

Pos logit contributions
 lake+0.017
neath+0.017
 camp+0.016
 ocean+0.016
 canyon+0.016
Neg logit contributions
Of-0.022
olding-0.022
aid-0.021
 forgiveness-0.021
 apology-0.021
Token friends
Token ID2460
Feature activation+1.85e+00

Pos logit contributions
Of+0.058
olding+0.057
aid+0.053
asing+0.050
olded+0.050
Neg logit contributions
 lake-0.078
 camp-0.077
 shore-0.076
 ocean-0.073
 pool-0.071
Token forever
Token ID8097
Feature activation-6.43e-02

Pos logit contributions
aid+0.104
Of+0.103
 apology+0.102
olding+0.099
 forgiveness+0.098
Neg logit contributions
 lake-0.065
 camp-0.065
 shore-0.061
 ocean-0.060
neath-0.059
Token.
Token ID13
Feature activation+7.84e-01

Pos logit contributions
 lake+0.003
neath+0.003
 camp+0.003
 creek+0.003
 canyon+0.003
Neg logit contributions
Of-0.003
 forgiveness-0.003
olding-0.003
aid-0.003
 apology-0.003
Token<|endoftext|>
Token ID50256
Feature activation-2.52e+00

Pos logit contributions
aid+0.044
Of+0.044
olding+0.043
 forgiveness+0.043
 apology+0.042
Neg logit contributions
 lake-0.028
 camp-0.027
neath-0.027
 shore-0.026
 river-0.026
TokenIt
Token ID1026
Feature activation-1.85e+00

Pos logit contributions
 canyon+0.143
 river+0.139
 shore+0.138
 lake+0.135
 ditch+0.134
Neg logit contributions
Of-0.075
aid-0.069
 forgiveness-0.069
 else-0.066
olding-0.063
Token was
Token ID373
Feature activation-2.72e+00

Pos logit contributions
 ditch+0.101
 canyon+0.100
 shore+0.099
 lake+0.097
 creek+0.095
Neg logit contributions
Of-0.057
 else-0.054
olding-0.051
 forgiveness-0.051
Deal-0.048
Token a
Token ID257
Feature activation-1.81e+00

Pos logit contributions
 canyon+0.132
 valleys+0.119
 creek+0.118
neath+0.117
 lake+0.117
Neg logit contributions
Of-0.110
 forgiveness-0.104
aid-0.103
 caring-0.101
 lesson-0.096
Token cold
Token ID4692
Feature activation-1.80e+00

Pos logit contributions
neath+0.093
 valleys+0.087
 canyon+0.083
 ditch+0.081
 shore+0.080
Neg logit contributions
Of-0.078
Deal-0.066
 respectful-0.066
 else-0.065
 supportive-0.064
Token,
Token ID11
Feature activation-3.46e+00

Pos logit contributions
neath+0.103
 valleys+0.100
 canyon+0.095
 ditch+0.093
 shore+0.091
Neg logit contributions
Of-0.061
olding-0.058
 forgiveness-0.056
aid-0.052
 apology-0.052
Token icy
Token ID30284
Feature activation-2.43e+00

Pos logit contributions
neath+0.164
 canyon+0.161
 valleys+0.159
 shore+0.152
 takeoff+0.150
Neg logit contributions
Of-0.127
 else-0.124
 importantly-0.122
 respectful-0.121
 caring-0.119
Token day
Token ID1110
Feature activation-1.96e+00

Pos logit contributions
neath+0.143
 valleys+0.131
 ditch+0.122
 Everest+0.121
 canyon+0.121
Neg logit contributions
Of-0.074
olding-0.070
 forgiveness-0.069
Deal-0.066
 apology-0.065
Token were
Token ID547
Feature activation-1.55e+00

Pos logit contributions
 lake+0.029
 canyon+0.028
 creek+0.028
 camp+0.028
neath+0.027
Neg logit contributions
Of-0.026
olding-0.026
aid-0.026
 forgiveness-0.025
 apology-0.024
Token flying
Token ID7348
Feature activation-1.18e+00

Pos logit contributions
 lake+0.064
 canyon+0.062
 creek+0.062
neath+0.062
 clearing+0.062
Neg logit contributions
asing-0.070
olding-0.069
 forgiveness-0.068
Of-0.067
olded-0.067
Token their
Token ID511
Feature activation-7.86e-01

Pos logit contributions
neath+0.050
 lake+0.050
 shore+0.049
 canyon+0.048
 ocean+0.048
Neg logit contributions
Of-0.051
aid-0.048
 forgiveness-0.048
olding-0.047
 else-0.046
Token k
Token ID479
Feature activation+1.44e-01

Pos logit contributions
neath+0.030
 clearing+0.029
 lake+0.028
 canyon+0.028
 camp+0.028
Neg logit contributions
Of-0.040
olding-0.038
 forgiveness-0.038
 apology-0.037
aser-0.036
Tokenite
Token ID578
Feature activation-8.50e-01

Pos logit contributions
Of+0.005
aid+0.005
olding+0.005
 forgiveness+0.005
 apology+0.005
Neg logit contributions
 lake-0.008
neath-0.008
 camp-0.008
 shore-0.008
 creek-0.008
Token on
Token ID319
Feature activation-2.75e+00

Pos logit contributions
 lake+0.039
neath+0.038
 camp+0.038
 shore+0.037
 canyon+0.037
Neg logit contributions
Of-0.036
olding-0.035
 forgiveness-0.035
aid-0.034
asing-0.033
Token the
Token ID262
Feature activation-2.73e+00

Pos logit contributions
 tim+0.109
 canyon+0.109
acket+0.104
asis+0.100
neath+0.100
Neg logit contributions
 forgiveness-0.117
 else-0.099
 apology-0.097
 lesson-0.096
 caring-0.096
Token hill
Token ID12788
Feature activation-3.51e-01

Pos logit contributions
neath+0.101
 tim+0.089
 valleys+0.088
 canyon+0.084
 clearing+0.084
Neg logit contributions
Of-0.126
ups-0.124
 else-0.122
 lesson-0.121
 respectful-0.120
Token.
Token ID13
Feature activation-2.09e+00

Pos logit contributions
 lake+0.017
neath+0.017
 camp+0.017
 canyon+0.016
 shore+0.016
Neg logit contributions
Of-0.014
olding-0.014
 forgiveness-0.014
aid-0.014
 apology-0.013
Token Harry
Token ID5850
Feature activation-1.55e+00

Pos logit contributions
 tim+0.098
 canyon+0.096
 camp+0.096
 valley+0.090
 clearing+0.089
Neg logit contributions
Of-0.084
 interests-0.071
 apology-0.068
olding-0.066
olded-0.065
Token has
Token ID468
Feature activation-9.20e-01

Pos logit contributions
 canyon+0.087
 camp+0.086
 shore+0.085
 lake+0.084
 creek+0.084
Neg logit contributions
Of-0.054
 else-0.050
olded-0.046
olding-0.046
aid-0.045
Token mom
Token ID1995
Feature activation-1.06e+00

Pos logit contributions
aid+0.052
Of+0.052
olding+0.052
asing+0.050
 forgiveness+0.049
Neg logit contributions
 lake-0.072
 camp-0.069
 shore-0.067
 river-0.067
 pool-0.067
Token walked
Token ID6807
Feature activation-8.45e-01

Pos logit contributions
 creek+0.046
 lake+0.046
 canyon+0.046
 camp+0.045
neath+0.044
Neg logit contributions
Of-0.049
aid-0.048
olding-0.046
olded-0.046
asing-0.045
Token around
Token ID1088
Feature activation-1.87e+00

Pos logit contributions
neath+0.038
 lake+0.038
 canyon+0.037
 river+0.036
 creek+0.036
Neg logit contributions
Of-0.039
olding-0.036
aid-0.034
 forgiveness-0.033
 apology-0.033
Token the
Token ID262
Feature activation-2.42e+00

Pos logit contributions
 canyon+0.071
neath+0.071
 ditch+0.069
acket+0.067
 lake+0.066
Neg logit contributions
 forgiveness-0.090
Of-0.088
olding-0.081
 classmates-0.081
 apology-0.080
Token park
Token ID3952
Feature activation-8.02e-01

Pos logit contributions
 takeoff+0.089
 canyon+0.088
neath+0.087
 tim+0.084
 rav+0.082
Neg logit contributions
Of-0.119
 else-0.106
Deal-0.106
Okay-0.104
 owner-0.102
Token.
Token ID13
Feature activation-2.70e+00

Pos logit contributions
neath+0.036
 lake+0.033
 canyon+0.033
 camp+0.032
 creek+0.032
Neg logit contributions
Of-0.037
olding-0.037
 forgiveness-0.036
aid-0.034
 apology-0.034
Token He
Token ID679
Feature activation-1.49e+00

Pos logit contributions
 canyon+0.131
 camp+0.124
 lake+0.121
 waterfall+0.119
 valley+0.118
Neg logit contributions
Of-0.111
olding-0.099
olded-0.095
olds-0.094
 apology-0.091
Token waved
Token ID26834
Feature activation-2.86e-02

Pos logit contributions
neath+0.067
 canyon+0.067
 creek+0.066
 lake+0.066
 shore+0.065
Neg logit contributions
Of-0.065
 else-0.063
aid-0.058
olded-0.058
olding-0.057
Token at
Token ID379
Feature activation-1.62e-01

Pos logit contributions
 lake+0.001
neath+0.001
 camp+0.001
 river+0.001
 creek+0.001
Neg logit contributions
Of-0.001
aid-0.001
 forgiveness-0.001
olding-0.001
 apology-0.001
Token the
Token ID262
Feature activation-5.02e-01

Pos logit contributions
 lake+0.005
 camp+0.005
 canyon+0.004
 creek+0.004
neath+0.004
Neg logit contributions
Of-0.010
 forgiveness-0.010
aid-0.010
 apology-0.010
olding-0.010
Token other
Token ID584
Feature activation-1.29e+00

Pos logit contributions
neath+0.017
 camp+0.016
 canyon+0.016
 lake+0.016
 clearing+0.015
Neg logit contributions
Of-0.029
aid-0.028
 forgiveness-0.027
 apology-0.027
olding-0.027
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
 lake+0.052
 canyon+0.049
 river+0.048
 valleys+0.047
 clearing+0.047
Neg logit contributions
Of-0.053
olding-0.053
 else-0.052
 forgiveness-0.051
aid-0.049
Token Tim
Token ID5045
Feature activation-2.54e+00

Pos logit contributions
 canyon+0.068
 lake+0.066
 river+0.065
 shore+0.064
 camp+0.063
Neg logit contributions
Of-0.078
 else-0.073
olding-0.070
aid-0.069
 forgiveness-0.067
Tokenmy
Token ID1820
Feature activation-2.83e+00

Pos logit contributions
 canyon+0.183
 waterfall+0.180
 valleys+0.177
 creek+0.175
 shore+0.172
Neg logit contributions
olding-0.034
 cared-0.034
grand-0.032
 interests-0.032
aid-0.032
Token was
Token ID373
Feature activation-1.40e+00

Pos logit contributions
 waterfall+0.168
 canyon+0.168
 creek+0.159
 rav+0.159
 valleys+0.158
Neg logit contributions
grand-0.078
 cared-0.077
 else-0.076
 forgiveness-0.071
aid-0.071
Token playing
Token ID2712
Feature activation-9.92e-01

Pos logit contributions
 lake+0.057
neath+0.056
 canyon+0.054
 valleys+0.053
 waterfall+0.053
Neg logit contributions
Of-0.069
olding-0.067
aid-0.066
 forgiveness-0.065
olded-0.062
Token in
Token ID287
Feature activation-2.90e+00

Pos logit contributions
neath+0.041
 lake+0.041
 valleys+0.040
 canyon+0.040
 camp+0.039
Neg logit contributions
Of-0.048
 forgiveness-0.047
aid-0.046
olding-0.044
 else-0.044
Token his
Token ID465
Feature activation-1.24e+00

Pos logit contributions
acket+0.147
 canyon+0.142
 tim+0.134
neath+0.131
 creek+0.129
Neg logit contributions
 forgiveness-0.094
 else-0.091
aid-0.088
Of-0.082
Her-0.082
Token garden
Token ID11376
Feature activation-8.32e-01

Pos logit contributions
neath+0.045
 canyon+0.041
 valleys+0.041
 lake+0.038
 clearing+0.038
Neg logit contributions
Of-0.066
 forgiveness-0.064
olding-0.061
aid-0.061
 else-0.061
Token with
Token ID351
Feature activation-1.49e-01

Pos logit contributions
neath+0.044
 lake+0.042
 camp+0.041
 canyon+0.041
 valleys+0.041
Neg logit contributions
olding-0.031
Of-0.031
 forgiveness-0.030
aid-0.029
 else-0.027
Token his
Token ID465
Feature activation+9.59e-01

Pos logit contributions
 lake+0.006
neath+0.005
 camp+0.005
 canyon+0.005
 shore+0.005
Neg logit contributions
Of-0.008
aid-0.008
 forgiveness-0.008
olding-0.007
 apology-0.007
Token toy
Token ID13373
Feature activation+2.35e-01

Pos logit contributions
olding+0.047
Of+0.047
aid+0.046
 forgiveness+0.044
 apology+0.044
Neg logit contributions
 lake-0.044
 camp-0.042
 river-0.041
 creek-0.041
 pool-0.041