TOP ACTIVATIONS
MAX = 0.122

as he pushed . M omm a saw Tim my and
pictures together ?" " Of course !" said Lily
she asked . " Sure , honey . I
Inspired , Emily asked her pap a , " How can
it together ?" " Yes , we can ,
my love ," said M omm a . " Let 's
?" Her dad replied , " No , we can 't
be friends ?" " Of course , oct opus
Her mom my replied , " Maybe another day , sweet
my smiled and said , " Sure , let 's go
Ben asked . " Of course we can .
my smiled and said , " Sure , let 's have
own story ?" " Of course , we can
of yours ?" " Of course , you can
day , Emma asked her pap a if they could go
waves again ?" " Of course , we can
Lily asks . " Yes , you can .
. Ranger ?" " Sure , Lily , but
girl asked . " Sure , you can play
Her mom my said , " Yes , let 's go

MIN ACTIVATIONS
MIN = -0.128

- rolling green hills , lush grass , trees and flowers
colourful flowers , through the lush forests deep in thought .
. It ran through a lush green me adow , with
was running around on the lush grass , enjoying the sunshine
sight . There was a lush green patch of grass and
good to roll in the lush green grass and jump high
farmer who lived on a lush farm with many animals and
grass was a deep , lush green . The little girl
to set . <|endoftext|> Little Johnny was so excited . He
lived in a big , wide forest and enjoyed exploring every
Finally , they reached a vast stretch of beach . As
, there was a big wide house . Inside the house
middle of a dark , wide sea . All the animals
, she came to a wide , open field . In
lived in a big , wide field . As he hopped
She spotted a big , wide tree in the distance and
, the beetle sailed across the sea . She was
Suddenly , she spotted a wide field of flowers . It
stay happy . <|endoftext|> Little Sammy always liked to explore ,
day , he found a wide door . He hopped over

INTERVAL 0.060 - 0.122
CONTAINS 0.599%

should be more careful and respectful . " Can
hugged them and said , " Don 't worry , I
and picked her up . " You 're a very good
seems too good to be true . If something seems too
All of a sudden , something magical happened

INTERVAL -0.003 - 0.060
CONTAINS 54.705%

get enough sunlight ", said Mom my . Jack felt sad
was stuck on the slide . She felt a pain in
adorable , soft toy cat . The cat was silver and
, in a far away corner , he saw a big
of your business !" He gets back in the

INTERVAL -0.065 - -0.003
CONTAINS 44.022%

day when they were playing , Rabbit noticed Bear had a
When Mom is busy in the kitchen , they
She got scared and began to panic . The wave was
play and splash in the water all day . The hipp
eat snacks . One day , Dave realised that he wanted

INTERVAL -0.128 - -0.065
CONTAINS 0.674%

forward . Finally it soared up to the top of the
the noise of the blocks falling , they ran back to
are worth looking after . <|endoftext|> Once , there was a
, " Wow , that fire work is so pretty !"
alive than ever before ! <|endoftext|> Once upon a time there
Token as
Token ID355
Feature activation+5.44e-03

Pos logit contributions
Of+0.003
 forgiveness+0.003
olding+0.002
aid+0.002
 apology+0.002
Neg logit contributions
 lake-0.003
neath-0.003
 camp-0.002
 creek-0.002
 river-0.002
Token he
Token ID339
Feature activation+6.89e-03

Pos logit contributions
Of+0.009
 forgiveness+0.009
aid+0.009
olding+0.009
 apology+0.009
Neg logit contributions
 lake-0.011
neath-0.011
 camp-0.011
 creek-0.011
 river-0.011
Token pushed
Token ID7121
Feature activation-2.18e-03

Pos logit contributions
Of+0.015
 forgiveness+0.014
olding+0.014
aid+0.014
 apology+0.014
Neg logit contributions
 lake-0.011
neath-0.011
 camp-0.011
 creek-0.011
 river-0.011
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
 lake+0.003
neath+0.003
 camp+0.003
 creek+0.003
 river+0.003
Neg logit contributions
Of-0.005
 forgiveness-0.005
aid-0.005
olding-0.005
 apology-0.005
Token M
Token ID337
Feature activation+1.46e-02

Pos logit contributions
 lake+0.027
 camp+0.027
neath+0.027
 canyon+0.027
 creek+0.026
Neg logit contributions
Of-0.031
olding-0.029
 forgiveness-0.028
aid-0.028
 apology-0.028
Tokenomm
Token ID2002
Feature activation+1.21e-01

Pos logit contributions
Of+0.028
aid+0.027
 forgiveness+0.027
olding+0.027
 apology+0.026
Neg logit contributions
 lake-0.027
neath-0.026
 camp-0.026
 river-0.025
 ocean-0.025
Tokena
Token ID64
Feature activation+1.08e-02

Pos logit contributions
 honesty+0.252
 apology+0.250
 forgiveness+0.247
 facts+0.245
Of+0.234
Neg logit contributions
 set-0.117
 lake-0.113
 clearing-0.111
 tim-0.110
 row-0.110
Token saw
Token ID2497
Feature activation-9.43e-03

Pos logit contributions
Of+0.022
 forgiveness+0.021
olding+0.021
aid+0.021
 apology+0.021
Neg logit contributions
 lake-0.019
neath-0.018
 camp-0.018
 river-0.017
 creek-0.017
Token Tim
Token ID5045
Feature activation+2.15e-03

Pos logit contributions
 lake+0.016
 camp+0.016
 canyon+0.016
 creek+0.016
neath+0.016
Neg logit contributions
Of-0.019
 forgiveness-0.019
aid-0.019
olding-0.018
 apology-0.018
Tokenmy
Token ID1820
Feature activation-2.00e-02

Pos logit contributions
Of+0.004
 forgiveness+0.004
olding+0.004
aid+0.004
 apology+0.003
Neg logit contributions
 lake-0.004
neath-0.004
 camp-0.004
 creek-0.004
 shore-0.004
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
 lake+0.038
neath+0.038
 creek+0.037
 camp+0.037
 shore+0.036
Neg logit contributions
olding-0.035
 forgiveness-0.035
Of-0.035
aid-0.034
asing-0.033
Token pictures
Token ID5986
Feature activation+1.11e-02

Pos logit contributions
Of+0.036
olding+0.035
 forgiveness+0.035
aid+0.035
 apology+0.034
Neg logit contributions
 lake-0.024
neath-0.023
 camp-0.022
 river-0.022
 creek-0.022
Token together
Token ID1978
Feature activation-2.27e-02

Pos logit contributions
Of+0.025
 forgiveness+0.024
olding+0.024
aid+0.024
 apology+0.024
Neg logit contributions
 lake-0.017
neath-0.017
 camp-0.016
 river-0.016
 creek-0.016
Token?"
Token ID1701
Feature activation+3.50e-02

Pos logit contributions
 lake+0.035
neath+0.035
 valleys+0.033
 canyon+0.033
 ditch+0.033
Neg logit contributions
Of-0.049
 forgiveness-0.048
olding-0.046
aid-0.046
asing-0.045
Token
Token ID198
Feature activation+3.94e-02

Pos logit contributions
asing+0.091
 forgiveness+0.090
olding+0.090
 apology+0.088
aid+0.087
Neg logit contributions
 lake-0.038
neath-0.035
 shore-0.034
 river-0.033
 creek-0.033
Token
Token ID198
Feature activation+5.09e-02

Pos logit contributions
asing+0.103
 forgiveness+0.102
olding+0.102
 apology+0.099
asers+0.099
Neg logit contributions
 lake-0.042
 camp-0.040
neath-0.039
 shore-0.036
 pool-0.036
Token"
Token ID1
Feature activation+1.19e-01

Pos logit contributions
 forgiveness+0.100
 apology+0.094
asers+0.091
 actions+0.091
asing+0.091
Neg logit contributions
 lake-0.084
 river-0.081
 creek-0.079
 clearing-0.077
neath-0.077
TokenOf
Token ID5189
Feature activation+4.53e-02

Pos logit contributions
 actions+0.211
 supervision+0.207
 instincts+0.201
 apology+0.199
olding+0.195
Neg logit contributions
 lake-0.211
neath-0.198
 clearing-0.187
 creek-0.187
 field-0.178
Token course
Token ID1781
Feature activation+2.69e-02

Pos logit contributions
olding+0.109
Of+0.104
asers+0.104
asing+0.103
olded+0.103
Neg logit contributions
 lake-0.064
neath-0.058
 clearing-0.055
 ocean-0.053
 camp-0.052
Token!"
Token ID2474
Feature activation+3.70e-02

Pos logit contributions
aid+0.072
Of+0.071
 forgiveness+0.070
olding+0.070
 apology+0.069
Neg logit contributions
 lake-0.028
 camp-0.026
neath-0.025
 shore-0.025
 river-0.025
Token said
Token ID531
Feature activation+3.98e-03

Pos logit contributions
olding+0.066
 forgiveness+0.065
asing+0.065
 apology+0.063
aser+0.061
Neg logit contributions
 lake-0.068
neath-0.067
 shore-0.065
 camp-0.064
 pool-0.063
Token Lily
Token ID20037
Feature activation+1.94e-02

Pos logit contributions
Of+0.009
 forgiveness+0.009
olding+0.009
aid+0.009
 apology+0.009
Neg logit contributions
 lake-0.006
neath-0.006
 camp-0.006
 creek-0.006
 river-0.006
Token she
Token ID673
Feature activation+4.36e-02

Pos logit contributions
asing+0.150
aid+0.147
olding+0.146
Deal+0.142
 apology+0.141
Neg logit contributions
neath-0.058
 camp-0.054
 lake-0.054
 shore-0.054
 creek-0.053
Token asked
Token ID1965
Feature activation+4.90e-02

Pos logit contributions
Of+0.103
aid+0.101
olding+0.101
 forgiveness+0.100
 apology+0.098
Neg logit contributions
 lake-0.060
neath-0.055
 camp-0.055
 river-0.055
 canyon-0.054
Token.
Token ID13
Feature activation+4.95e-02

Pos logit contributions
Of+0.125
aid+0.120
Deal+0.115
olding+0.114
grand+0.114
Neg logit contributions
neath-0.055
 camp-0.052
 shore-0.051
 lake-0.050
 fields-0.048
Token
Token ID198
Feature activation+3.56e-02

Pos logit contributions
asing+0.137
aid+0.134
olding+0.133
 forgiveness+0.132
asers+0.131
Neg logit contributions
neath-0.047
 lake-0.044
 creek-0.042
 shore-0.041
 river-0.040
Token
Token ID198
Feature activation+4.30e-02

Pos logit contributions
Of+0.090
 forgiveness+0.088
asing+0.088
olding+0.088
aid+0.088
Neg logit contributions
 lake-0.041
neath-0.041
 camp-0.039
 shore-0.038
 clearing-0.036
Token"
Token ID1
Feature activation+1.19e-01

Pos logit contributions
 forgiveness+0.090
asers+0.089
 apology+0.088
asing+0.086
Of+0.086
Neg logit contributions
 lake-0.066
neath-0.064
 creek-0.062
 clearing-0.062
 river-0.061
TokenSure
Token ID19457
Feature activation+3.84e-02

Pos logit contributions
iness+0.200
 supervision+0.198
 actions+0.195
 choices+0.188
asers+0.188
Neg logit contributions
 lake-0.196
 creek-0.188
neath-0.175
 clearing-0.172
 field-0.169
Token,
Token ID11
Feature activation+4.02e-02

Pos logit contributions
Of+0.120
aid+0.118
olding+0.117
asing+0.116
olded+0.114
Neg logit contributions
 lake-0.024
 camp-0.022
 shore-0.018
 clearing-0.018
 river-0.018
Token honey
Token ID12498
Feature activation+2.24e-02

Pos logit contributions
olding+0.084
aid+0.083
asing+0.081
Of+0.081
olds+0.079
Neg logit contributions
 lake-0.065
 camp-0.063
 shore-0.059
 river-0.059
 creek-0.059
Token.
Token ID13
Feature activation+3.02e-02

Pos logit contributions
Of+0.054
aid+0.052
olding+0.051
 forgiveness+0.051
 apology+0.050
Neg logit contributions
 lake-0.031
 camp-0.030
neath-0.029
 river-0.028
 creek-0.028
Token I
Token ID314
Feature activation+2.85e-02

Pos logit contributions
aid+0.061
 forgiveness+0.060
olding+0.060
Of+0.059
 apology+0.058
Neg logit contributions
 lake-0.049
neath-0.049
 camp-0.047
 creek-0.046
 river-0.046
Token Inspired
Token ID45827
Feature activation+1.75e-02

Pos logit contributions
Of+0.026
aid+0.026
 forgiveness+0.026
olding+0.026
 apology+0.025
Neg logit contributions
 lake-0.016
neath-0.016
 camp-0.015
 creek-0.015
 river-0.015
Token,
Token ID11
Feature activation+4.34e-02

Pos logit contributions
Of+0.041
aid+0.040
olding+0.040
 forgiveness+0.040
 apology+0.039
Neg logit contributions
neath-0.025
 lake-0.025
 camp-0.024
 creek-0.023
 river-0.023
Token Emily
Token ID17608
Feature activation-1.78e-03

Pos logit contributions
olding+0.096
aid+0.095
Of+0.092
 apology+0.089
Deal+0.088
Neg logit contributions
neath-0.070
 lake-0.067
 camp-0.066
 creek-0.066
 fields-0.064
Token asked
Token ID1965
Feature activation+1.34e-02

Pos logit contributions
 lake+0.004
neath+0.004
 camp+0.003
 creek+0.003
 shore+0.003
Neg logit contributions
Of-0.003
 forgiveness-0.003
olding-0.003
aid-0.003
 apology-0.003
Token her
Token ID607
Feature activation+2.28e-02

Pos logit contributions
Of+0.030
aid+0.029
olding+0.029
 forgiveness+0.029
 apology+0.028
Neg logit contributions
 lake-0.020
neath-0.020
 camp-0.019
 shore-0.019
 creek-0.019
Token pap
Token ID20461
Feature activation+1.18e-01

Pos logit contributions
aid+0.040
Of+0.039
olding+0.038
 forgiveness+0.038
 apology+0.037
Neg logit contributions
 lake-0.048
 camp-0.046
 creek-0.045
 shore-0.045
 river-0.045
Tokena
Token ID64
Feature activation+4.63e-02

Pos logit contributions
 supervision+0.218
 apology+0.216
 actions+0.216
 importantly+0.215
 meanings+0.215
Neg logit contributions
 clearing-0.137
 lake-0.128
 ocean-0.122
 reef-0.117
 bay-0.115
Token,
Token ID11
Feature activation+2.14e-02

Pos logit contributions
aid+0.092
Of+0.091
olding+0.087
 forgiveness+0.087
 apology+0.087
Neg logit contributions
 camp-0.079
 lake-0.079
neath-0.076
 creek-0.075
 river-0.075
Token "
Token ID366
Feature activation+3.06e-02

Pos logit contributions
aid+0.046
Of+0.046
 forgiveness+0.046
olding+0.045
 apology+0.045
Neg logit contributions
 lake-0.033
neath-0.032
 camp-0.031
 river-0.031
 shore-0.031
TokenHow
Token ID2437
Feature activation+1.88e-02

Pos logit contributions
olding+0.058
 apology+0.058
 forgiveness+0.057
aid+0.056
Of+0.054
Neg logit contributions
 lake-0.058
neath-0.058
 clearing-0.055
 river-0.053
 shore-0.053
Token can
Token ID460
Feature activation+2.47e-02

Pos logit contributions
Of+0.036
aid+0.035
 forgiveness+0.034
olding+0.034
 apology+0.033
Neg logit contributions
 lake-0.035
 camp-0.034
neath-0.034
 creek-0.033
 river-0.033
Token it
Token ID340
Feature activation-2.82e-03

Pos logit contributions
neath+0.040
 ditch+0.038
 lake+0.037
 canyon+0.037
 creek+0.037
Neg logit contributions
 forgiveness-0.050
Of-0.049
 apology-0.045
aid-0.045
 else-0.044
Token together
Token ID1978
Feature activation-1.16e-02

Pos logit contributions
 lake+0.004
neath+0.004
 creek+0.004
 camp+0.004
 shore+0.004
Neg logit contributions
Of-0.006
 forgiveness-0.006
aid-0.006
olding-0.006
 apology-0.006
Token?"
Token ID1701
Feature activation+3.53e-02

Pos logit contributions
neath+0.017
 lake+0.017
 creek+0.016
 river+0.016
 camp+0.016
Neg logit contributions
Of-0.026
 forgiveness-0.026
aid-0.025
olding-0.025
 apology-0.025
Token
Token ID198
Feature activation+2.57e-02

Pos logit contributions
olding+0.093
 forgiveness+0.090
aid+0.090
Of+0.090
asing+0.090
Neg logit contributions
 lake-0.040
 camp-0.039
neath-0.037
 creek-0.037
 shore-0.035
Token
Token ID198
Feature activation+4.44e-02

Pos logit contributions
Of+0.061
olding+0.061
 forgiveness+0.061
aid+0.059
 apology+0.059
Neg logit contributions
 lake-0.035
 camp-0.034
neath-0.033
 creek-0.031
 shore-0.031
Token"
Token ID1
Feature activation+1.18e-01

Pos logit contributions
olding+0.097
 apology+0.095
 forgiveness+0.094
aser+0.091
olded+0.089
Neg logit contributions
 lake-0.072
 camp-0.070
 clearing-0.067
 creek-0.066
neath-0.065
TokenYes
Token ID5297
Feature activation+1.40e-02

Pos logit contributions
aser+0.190
olds+0.188
asers+0.186
 actions+0.183
olding+0.183
Neg logit contributions
 lake-0.234
 clearing-0.205
 pool-0.198
 pond-0.196
 creek-0.196
Token,
Token ID11
Feature activation+6.89e-02

Pos logit contributions
Of+0.037
aid+0.036
 forgiveness+0.036
olding+0.036
 apology+0.035
Neg logit contributions
 lake-0.016
neath-0.015
 camp-0.015
 shore-0.014
 river-0.014
Token we
Token ID356
Feature activation+1.26e-02

Pos logit contributions
olding+0.154
kind+0.148
asing+0.148
olded+0.148
olds+0.147
Neg logit contributions
 camp-0.111
 lake-0.100
 pool-0.093
 river-0.089
 spring-0.086
Token can
Token ID460
Feature activation-3.26e-03

Pos logit contributions
Of+0.022
 forgiveness+0.021
olding+0.021
aid+0.021
 apology+0.020
Neg logit contributions
 lake-0.026
neath-0.025
 camp-0.025
 river-0.024
 creek-0.024
Token,
Token ID11
Feature activation+3.72e-02

Pos logit contributions
neath+0.006
 lake+0.006
 creek+0.006
 canyon+0.006
 camp+0.006
Neg logit contributions
Of-0.006
 forgiveness-0.006
olding-0.006
aid-0.006
 apology-0.006
Token my
Token ID616
Feature activation+1.34e-02

Pos logit contributions
neath+0.033
 lake+0.031
 valleys+0.031
 canyon+0.030
 camp+0.029
Neg logit contributions
Of-0.041
 forgiveness-0.040
olding-0.039
 apology-0.039
aid-0.037
Token love
Token ID1842
Feature activation+9.44e-03

Pos logit contributions
Of+0.022
 forgiveness+0.021
aid+0.021
olding+0.020
 apology+0.020
Neg logit contributions
 lake-0.029
neath-0.028
 camp-0.028
 river-0.027
 creek-0.027
Token,"
Token ID553
Feature activation+3.59e-02

Pos logit contributions
Of+0.020
 forgiveness+0.019
olding+0.019
aid+0.019
 apology+0.019
Neg logit contributions
 lake-0.016
neath-0.015
 camp-0.015
 river-0.015
 creek-0.015
Token said
Token ID531
Feature activation+1.03e-03

Pos logit contributions
olding+0.081
 apology+0.077
asing+0.076
 forgiveness+0.075
Of+0.075
Neg logit contributions
 lake-0.055
neath-0.052
 camp-0.052
 shore-0.050
 river-0.050
Token M
Token ID337
Feature activation-1.35e-03

Pos logit contributions
Of+0.002
 forgiveness+0.002
aid+0.002
olding+0.002
 apology+0.002
Neg logit contributions
 lake-0.002
neath-0.002
 camp-0.001
 creek-0.001
 river-0.001
Tokenomm
Token ID2002
Feature activation+1.17e-01

Pos logit contributions
 lake+0.003
neath+0.003
 camp+0.003
 creek+0.003
 river+0.003
Neg logit contributions
Of-0.002
olding-0.002
 forgiveness-0.002
aid-0.002
 apology-0.002
Tokena
Token ID64
Feature activation+4.25e-02

Pos logit contributions
 interests+0.219
Of+0.206
 passions+0.206
Deal+0.204
 forgiveness+0.204
Neg logit contributions
 tim-0.162
 camp-0.146
 clearing-0.141
 lake-0.134
 polar-0.134
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
Of+0.099
 forgiveness+0.097
 apology+0.096
olding+0.095
aid+0.095
Neg logit contributions
 lake-0.055
neath-0.052
 camp-0.052
 river-0.051
 shore-0.050
Token "
Token ID366
Feature activation+5.11e-02

Pos logit contributions
 lake+0.022
neath+0.022
 camp+0.022
 canyon+0.021
 creek+0.021
Neg logit contributions
Of-0.022
olding-0.021
 forgiveness-0.020
aid-0.020
 apology-0.019
TokenLet
Token ID5756
Feature activation+4.46e-03

Pos logit contributions
 apology+0.102
olding+0.100
 forgiveness+0.099
olded+0.097
aser+0.095
Neg logit contributions
neath-0.087
 lake-0.085
 valleys-0.080
 clearing-0.079
 river-0.078
Token's
Token ID338
Feature activation-4.28e-02

Pos logit contributions
Of+0.009
 forgiveness+0.008
olding+0.008
aid+0.008
 apology+0.008
Neg logit contributions
 lake-0.008
neath-0.008
 camp-0.008
 creek-0.008
 river-0.008
Token?"
Token ID1701
Feature activation+1.65e-02

Pos logit contributions
 lake+0.030
neath+0.030
 valleys+0.029
 canyon+0.029
 shore+0.029
Neg logit contributions
Of-0.041
 forgiveness-0.040
aid-0.039
olding-0.038
asing-0.037
Token Her
Token ID2332
Feature activation+3.09e-02

Pos logit contributions
Of+0.032
aid+0.032
olding+0.032
 forgiveness+0.032
 apology+0.031
Neg logit contributions
 lake-0.029
neath-0.029
 camp-0.028
 shore-0.028
 river-0.027
Token dad
Token ID9955
Feature activation+3.22e-02

Pos logit contributions
aid+0.060
Of+0.058
olding+0.056
 forgiveness+0.055
asing+0.055
Neg logit contributions
 lake-0.060
 shore-0.059
 ocean-0.058
 camp-0.057
 creek-0.056
Token replied
Token ID8712
Feature activation+2.90e-02

Pos logit contributions
olding+0.053
 forgiveness+0.053
Of+0.053
aid+0.052
 apology+0.052
Neg logit contributions
 lake-0.065
neath-0.064
 camp-0.060
 river-0.059
 clearing-0.059
Token,
Token ID11
Feature activation+1.08e-03

Pos logit contributions
Of+0.082
aid+0.081
 forgiveness+0.079
olding+0.079
 apology+0.079
Neg logit contributions
 lake-0.024
neath-0.023
 camp-0.022
 shore-0.022
 river-0.021
Token "
Token ID366
Feature activation+1.16e-01

Pos logit contributions
Of+0.002
 forgiveness+0.002
olding+0.002
aid+0.002
 apology+0.002
Neg logit contributions
 lake-0.002
neath-0.002
 camp-0.002
 creek-0.002
 canyon-0.002
TokenNo
Token ID2949
Feature activation+2.75e-02

Pos logit contributions
 forgiveness+0.223
 actions+0.219
 apology+0.219
 supervision+0.219
 instincts+0.200
Neg logit contributions
 lake-0.202
 river-0.191
 clearing-0.186
 hills-0.183
 creek-0.181
Token,
Token ID11
Feature activation+4.03e-02

Pos logit contributions
Of+0.079
aid+0.078
olding+0.077
asing+0.075
 forgiveness+0.074
Neg logit contributions
 lake-0.027
 camp-0.025
 river-0.023
 shore-0.022
 pool-0.021
Token we
Token ID356
Feature activation-7.81e-03

Pos logit contributions
olding+0.085
aid+0.084
Of+0.081
asing+0.080
 forgiveness+0.079
Neg logit contributions
 lake-0.066
 camp-0.063
 river-0.061
 shore-0.060
 pool-0.059
Token can
Token ID460
Feature activation-7.16e-03

Pos logit contributions
 lake+0.017
neath+0.016
 camp+0.016
 shore+0.016
 creek+0.016
Neg logit contributions
Of-0.013
olding-0.012
 forgiveness-0.012
aid-0.012
 apology-0.012
Token't
Token ID470
Feature activation-3.77e-03

Pos logit contributions
 lake+0.012
neath+0.012
 valleys+0.012
 canyon+0.012
 camp+0.012
Neg logit contributions
Of-0.014
 forgiveness-0.014
olding-0.014
aid-0.014
 apology-0.014
Token be
Token ID307
Feature activation+7.48e-03

Pos logit contributions
neath+0.023
 lake+0.022
 valleys+0.022
 creek+0.021
 camp+0.021
Neg logit contributions
Of-0.026
 forgiveness-0.026
aid-0.026
 apology-0.025
olding-0.025
Token friends
Token ID2460
Feature activation+3.15e-02

Pos logit contributions
Of+0.010
 forgiveness+0.009
olding+0.009
aid+0.009
 apology+0.009
Neg logit contributions
 lake-0.018
neath-0.018
 camp-0.018
 creek-0.018
 river-0.018
Token?"
Token ID1701
Feature activation+4.02e-02

Pos logit contributions
Of+0.085
olding+0.085
 apology+0.083
aid+0.083
 forgiveness+0.081
Neg logit contributions
 lake-0.033
 camp-0.030
neath-0.030
 river-0.028
 ocean-0.028
Token
Token ID198
Feature activation+2.93e-02

Pos logit contributions
olding+0.110
asing+0.109
 forgiveness+0.107
aser+0.107
 apology+0.106
Neg logit contributions
 lake-0.040
 shore-0.036
neath-0.036
 river-0.034
 camp-0.033
Token
Token ID198
Feature activation+3.25e-02

Pos logit contributions
Of+0.070
olding+0.070
 forgiveness+0.069
asing+0.068
aid+0.067
Neg logit contributions
 lake-0.039
 camp-0.036
neath-0.036
 shore-0.035
 river-0.034
Token"
Token ID1
Feature activation+1.16e-01

Pos logit contributions
olding+0.057
 forgiveness+0.057
 apology+0.055
Of+0.055
asing+0.054
Neg logit contributions
 lake-0.064
 river-0.060
 clearing-0.059
 valleys-0.059
neath-0.059
TokenOf
Token ID5189
Feature activation+5.25e-02

Pos logit contributions
grand+0.170
olding+0.162
olds+0.162
 actions+0.161
 supervision+0.161
Neg logit contributions
 lake-0.237
neath-0.211
 creek-0.206
 river-0.204
 pond-0.202
Token course
Token ID1781
Feature activation+3.51e-02

Pos logit contributions
olding+0.137
Of+0.132
olded+0.128
asing+0.127
asers+0.126
Neg logit contributions
 lake-0.067
neath-0.057
 camp-0.053
 rav-0.052
 ocean-0.052
Token,
Token ID11
Feature activation+3.34e-02

Pos logit contributions
aid+0.093
Of+0.092
olding+0.091
 forgiveness+0.089
asing+0.089
Neg logit contributions
 lake-0.038
 camp-0.035
 river-0.032
 shore-0.032
neath-0.031
Token oct
Token ID19318
Feature activation+1.25e-02

Pos logit contributions
olding+0.061
aid+0.060
Of+0.058
asing+0.057
olded+0.057
Neg logit contributions
 lake-0.066
 camp-0.062
 shore-0.061
 river-0.061
 ocean-0.059
Tokenopus
Token ID25790
Feature activation+4.03e-02

Pos logit contributions
Of+0.022
 forgiveness+0.021
olding+0.021
aid+0.021
 apology+0.021
Neg logit contributions
 lake-0.025
neath-0.024
 camp-0.024
 creek-0.023
 river-0.023
Token Her
Token ID2332
Feature activation+4.42e-02

Pos logit contributions
aid+0.069
olding+0.067
asing+0.067
asers+0.064
 forgiveness+0.064
Neg logit contributions
neath-0.062
 lake-0.062
 camp-0.059
 shore-0.058
 river-0.058
Token mom
Token ID1995
Feature activation+5.18e-02

Pos logit contributions
asing+0.095
aid+0.094
Of+0.092
olding+0.091
olded+0.088
Neg logit contributions
 lake-0.079
 camp-0.076
 pool-0.075
 river-0.071
 ocean-0.070
Tokenmy
Token ID1820
Feature activation+3.58e-02

Pos logit contributions
 forgiveness+0.129
 apology+0.127
olding+0.127
Of+0.126
asers+0.123
Neg logit contributions
 lake-0.054
neath-0.051
 clearing-0.048
 camp-0.045
 river-0.044
Token replied
Token ID8712
Feature activation+2.75e-02

Pos logit contributions
olding+0.058
 forgiveness+0.058
Of+0.058
aid+0.057
 apology+0.056
Neg logit contributions
 lake-0.073
neath-0.070
 camp-0.067
 river-0.066
 clearing-0.065
Token,
Token ID11
Feature activation+7.48e-03

Pos logit contributions
Of+0.077
aid+0.076
 forgiveness+0.075
olding+0.074
 apology+0.073
Neg logit contributions
 lake-0.023
neath-0.023
 camp-0.022
 shore-0.021
 river-0.020
Token "
Token ID366
Feature activation+1.16e-01

Pos logit contributions
Of+0.015
 forgiveness+0.015
olding+0.015
aid+0.015
 apology+0.014
Neg logit contributions
 lake-0.013
neath-0.013
 camp-0.012
 creek-0.012
 river-0.012
TokenMaybe
Token ID13300
Feature activation+2.40e-02

Pos logit contributions
 actions+0.211
 apology+0.210
 forgiveness+0.208
aser+0.205
 supervision+0.203
Neg logit contributions
 lake-0.187
 clearing-0.180
neath-0.170
 field-0.165
 hills-0.160
Token another
Token ID1194
Feature activation+2.22e-02

Pos logit contributions
aid+0.047
Of+0.046
olding+0.046
 forgiveness+0.044
 apology+0.044
Neg logit contributions
 lake-0.045
 camp-0.043
 shore-0.043
neath-0.042
 river-0.041
Token day
Token ID1110
Feature activation+2.06e-02

Pos logit contributions
aid+0.046
Of+0.046
 forgiveness+0.045
olding+0.044
asing+0.043
Neg logit contributions
 lake-0.040
 camp-0.037
 river-0.036
 shore-0.036
 ocean-0.036
Token,
Token ID11
Feature activation+3.64e-02

Pos logit contributions
Of+0.042
aid+0.042
 forgiveness+0.041
olding+0.041
 apology+0.040
Neg logit contributions
 lake-0.035
neath-0.034
 camp-0.034
 shore-0.032
 river-0.032
Token sweet
Token ID6029
Feature activation+1.01e-02

Pos logit contributions
olding+0.070
aid+0.069
Of+0.067
 apology+0.066
 forgiveness+0.066
Neg logit contributions
 lake-0.065
 camp-0.064
neath-0.061
 river-0.061
 creek-0.060
Tokenmy
Token ID1820
Feature activation+5.06e-02

Pos logit contributions
 apology+0.175
olding+0.174
 forgiveness+0.173
asers+0.168
oral+0.167
Neg logit contributions
 lake-0.049
neath-0.045
 clearing-0.045
 camp-0.038
 river-0.037
Token smiled
Token ID13541
Feature activation+1.55e-02

Pos logit contributions
olding+0.093
 apology+0.091
 forgiveness+0.090
olds+0.089
asing+0.087
Neg logit contributions
 lake-0.092
neath-0.087
 camp-0.083
 clearing-0.082
 river-0.081
Token and
Token ID290
Feature activation+2.90e-02

Pos logit contributions
Of+0.040
olding+0.039
 forgiveness+0.039
aid+0.039
 apology+0.038
Neg logit contributions
 lake-0.018
neath-0.018
 camp-0.017
 shore-0.016
 river-0.016
Token said
Token ID531
Feature activation+5.04e-02

Pos logit contributions
olding+0.046
 forgiveness+0.045
 apology+0.044
aid+0.043
Of+0.042
Neg logit contributions
 lake-0.062
neath-0.060
 camp-0.059
 shore-0.059
 river-0.058
Token,
Token ID11
Feature activation+2.33e-02

Pos logit contributions
Of+0.155
olding+0.153
aid+0.152
grand+0.150
asing+0.150
Neg logit contributions
neath-0.017
 camp-0.017
 lake-0.016
 shore-0.013
 pool-0.012
Token "
Token ID366
Feature activation+1.15e-01

Pos logit contributions
 forgiveness+0.051
aid+0.051
olding+0.051
Of+0.050
 apology+0.050
Neg logit contributions
 lake-0.035
neath-0.034
 river-0.032
 shore-0.032
 camp-0.032
TokenSure
Token ID19457
Feature activation+3.50e-02

Pos logit contributions
 supervision+0.190
 actions+0.189
 apology+0.188
asers+0.188
aser+0.186
Neg logit contributions
 lake-0.211
 clearing-0.199
neath-0.195
 river-0.190
 field-0.188
Token,
Token ID11
Feature activation+5.04e-02

Pos logit contributions
Of+0.106
aid+0.105
olding+0.103
asing+0.102
asers+0.100
Neg logit contributions
 lake-0.025
 camp-0.024
 river-0.019
 shore-0.019
 pool-0.018
Token let
Token ID1309
Feature activation-4.49e-03

Pos logit contributions
olding+0.103
aid+0.102
olds+0.100
olded+0.098
asing+0.095
Neg logit contributions
 lake-0.082
 camp-0.081
 river-0.076
 shore-0.075
 creek-0.074
Token's
Token ID338
Feature activation-3.83e-02

Pos logit contributions
 lake+0.009
neath+0.009
 creek+0.009
 camp+0.009
 canyon+0.009
Neg logit contributions
Of-0.008
 forgiveness-0.008
aid-0.007
olding-0.007
 apology-0.007
Token go
Token ID467
Feature activation-2.54e-02

Pos logit contributions
 valleys+0.074
neath+0.072
 clearing+0.069
 canyon+0.068
 lake+0.067
Neg logit contributions
 forgiveness-0.068
Of-0.066
 forgive-0.065
 promise-0.065
 apology-0.063
Token Ben
Token ID3932
Feature activation+3.76e-02

Pos logit contributions
Of+0.009
 forgiveness+0.009
olding+0.009
aid+0.009
 apology+0.009
Neg logit contributions
 lake-0.006
neath-0.006
 camp-0.006
 creek-0.005
 river-0.005
Token asked
Token ID1965
Feature activation+4.62e-02

Pos logit contributions
Of+0.076
aid+0.076
olding+0.075
 forgiveness+0.075
 apology+0.073
Neg logit contributions
 lake-0.063
neath-0.059
 river-0.059
 ocean-0.058
 camp-0.058
Token.
Token ID13
Feature activation+3.71e-02

Pos logit contributions
Of+0.116
aid+0.112
Deal+0.109
olding+0.108
asing+0.107
Neg logit contributions
 camp-0.053
neath-0.053
 shore-0.051
 lake-0.051
 pool-0.049
Token
Token ID198
Feature activation+2.95e-02

Pos logit contributions
asing+0.098
 forgiveness+0.096
olding+0.095
aid+0.095
Deal+0.093
Neg logit contributions
 lake-0.041
neath-0.039
 river-0.038
 creek-0.038
 camp-0.038
Token
Token ID198
Feature activation+2.83e-02

Pos logit contributions
Of+0.072
 forgiveness+0.071
olding+0.071
aid+0.070
asing+0.070
Neg logit contributions
 lake-0.037
neath-0.036
 camp-0.035
 shore-0.034
 river-0.033
Token"
Token ID1
Feature activation+1.15e-01

Pos logit contributions
 forgiveness+0.054
Of+0.053
 apology+0.053
olding+0.052
aid+0.051
Neg logit contributions
 lake-0.052
neath-0.049
 river-0.049
 creek-0.049
 camp-0.048
TokenOf
Token ID5189
Feature activation+4.64e-02

Pos logit contributions
 actions+0.198
iness+0.195
asers+0.194
 forgiveness+0.194
aser+0.193
Neg logit contributions
 lake-0.199
 creek-0.189
neath-0.181
 river-0.175
 pool-0.169
Token course
Token ID1781
Feature activation+3.34e-02

Pos logit contributions
olding+0.115
Of+0.112
asing+0.110
olded+0.109
asers+0.109
Neg logit contributions
 lake-0.065
neath-0.059
 clearing-0.054
 ocean-0.053
 shore-0.053
Token we
Token ID356
Feature activation+1.27e-02

Pos logit contributions
aid+0.081
Of+0.079
olding+0.078
 forgiveness+0.076
asing+0.076
Neg logit contributions
 lake-0.045
 camp-0.043
 shore-0.040
 river-0.040
neath-0.040
Token can
Token ID460
Feature activation-3.94e-03

Pos logit contributions
Of+0.023
 forgiveness+0.022
olding+0.022
aid+0.022
 apology+0.021
Neg logit contributions
 lake-0.025
neath-0.025
 camp-0.024
 river-0.024
 creek-0.024
Token.
Token ID13
Feature activation+3.03e-02

Pos logit contributions
neath+0.007
 lake+0.007
 camp+0.007
 creek+0.007
 canyon+0.007
Neg logit contributions
Of-0.008
 forgiveness-0.008
olding-0.007
aid-0.007
 apology-0.007
Tokenmy
Token ID1820
Feature activation+4.72e-02

Pos logit contributions
 apology+0.140
 forgiveness+0.140
olding+0.136
oral+0.135
grand+0.133
Neg logit contributions
 lake-0.057
neath-0.054
 clearing-0.051
 pool-0.046
 camp-0.046
Token smiled
Token ID13541
Feature activation+1.85e-02

Pos logit contributions
 forgiveness+0.084
 apology+0.084
olding+0.084
grand+0.079
olds+0.079
Neg logit contributions
 lake-0.087
neath-0.083
 clearing-0.077
 camp-0.077
 river-0.077
Token and
Token ID290
Feature activation+2.82e-02

Pos logit contributions
Of+0.048
 forgiveness+0.048
olding+0.048
aid+0.048
 apology+0.047
Neg logit contributions
 lake-0.021
neath-0.020
 camp-0.019
 shore-0.019
 creek-0.018
Token said
Token ID531
Feature activation+4.50e-02

Pos logit contributions
olding+0.044
 forgiveness+0.044
 apology+0.043
aid+0.041
asing+0.041
Neg logit contributions
 lake-0.061
neath-0.059
 shore-0.057
 camp-0.057
 river-0.056
Token,
Token ID11
Feature activation+9.80e-03

Pos logit contributions
aid+0.131
Of+0.130
 forgiveness+0.129
olding+0.128
 apology+0.128
Neg logit contributions
 lake-0.026
neath-0.026
 camp-0.025
 shore-0.024
 pool-0.022
Token "
Token ID366
Feature activation+1.15e-01

Pos logit contributions
Of+0.020
 forgiveness+0.020
olding+0.020
aid+0.020
 apology+0.019
Neg logit contributions
 lake-0.017
neath-0.016
 camp-0.016
 river-0.015
 shore-0.015
TokenSure
Token ID19457
Feature activation+3.14e-02

Pos logit contributions
 apology+0.200
 actions+0.198
 forgiveness+0.197
aser+0.197
 supervision+0.196
Neg logit contributions
 lake-0.209
 clearing-0.190
neath-0.185
 river-0.179
 field-0.177
Token,
Token ID11
Feature activation+4.19e-02

Pos logit contributions
aid+0.092
Of+0.092
olding+0.090
asing+0.089
 forgiveness+0.087
Neg logit contributions
 lake-0.026
 camp-0.024
 shore-0.021
 river-0.020
 clearing-0.020
Token let
Token ID1309
Feature activation-1.16e-02

Pos logit contributions
olding+0.081
aid+0.080
olds+0.078
asing+0.076
 apology+0.075
Neg logit contributions
 lake-0.074
 camp-0.072
 river-0.068
 shore-0.068
neath-0.067
Token's
Token ID338
Feature activation-3.04e-02

Pos logit contributions
 lake+0.025
 creek+0.024
neath+0.024
 canyon+0.024
 valleys+0.024
Neg logit contributions
Of-0.018
 forgiveness-0.018
 apology-0.017
aid-0.017
olding-0.017
Token have
Token ID423
Feature activation+3.01e-03

Pos logit contributions
 valleys+0.054
neath+0.054
 clearing+0.052
 lake+0.051
 canyon+0.051
Neg logit contributions
 forgiveness-0.056
Of-0.056
aid-0.054
 apology-0.054
olding-0.053
Token own
Token ID898
Feature activation+1.53e-02

Pos logit contributions
aid+0.090
Of+0.089
olding+0.088
olded+0.083
asing+0.082
Neg logit contributions
 lake-0.085
 camp-0.084
 ocean-0.082
 pool-0.079
 shore-0.078
Token story
Token ID1621
Feature activation+3.29e-02

Pos logit contributions
Of+0.025
olding+0.024
aid+0.024
 forgiveness+0.023
 apology+0.022
Neg logit contributions
 lake-0.034
 camp-0.032
neath-0.032
 shore-0.032
 river-0.032
Token?"
Token ID1701
Feature activation+3.41e-02

Pos logit contributions
Of+0.070
 forgiveness+0.068
olding+0.068
aid+0.068
 apology+0.067
Neg logit contributions
 camp-0.052
 lake-0.052
neath-0.050
 pool-0.049
 river-0.048
Token
Token ID198
Feature activation+3.39e-02

Pos logit contributions
olding+0.089
aid+0.088
asing+0.088
 forgiveness+0.087
 apology+0.086
Neg logit contributions
 lake-0.037
neath-0.037
 camp-0.034
 shore-0.034
 creek-0.034
Token
Token ID198
Feature activation+3.85e-02

Pos logit contributions
olding+0.087
Of+0.086
asing+0.086
 forgiveness+0.085
aid+0.084
Neg logit contributions
 lake-0.039
neath-0.038
 camp-0.037
 shore-0.035
 river-0.034
Token"
Token ID1
Feature activation+1.14e-01

Pos logit contributions
olding+0.076
 apology+0.075
asers+0.075
 forgiveness+0.074
asing+0.074
Neg logit contributions
 lake-0.066
neath-0.064
 camp-0.062
 river-0.061
 shore-0.061
TokenOf
Token ID5189
Feature activation+4.86e-02

Pos logit contributions
 actions+0.197
 instincts+0.189
iness+0.188
 supervision+0.184
 behavior+0.178
Neg logit contributions
 lake-0.203
neath-0.183
 creek-0.182
 clearing-0.180
 pool-0.178
Token course
Token ID1781
Feature activation+3.86e-02

Pos logit contributions
olding+0.121
Of+0.117
asing+0.115
olded+0.114
asers+0.114
Neg logit contributions
 lake-0.066
neath-0.059
 clearing-0.054
 camp-0.053
 rav-0.052
Token,
Token ID11
Feature activation+6.58e-02

Pos logit contributions
aid+0.111
Of+0.108
olding+0.107
 forgiveness+0.106
asing+0.106
Neg logit contributions
 lake-0.033
 camp-0.031
neath-0.027
 shore-0.027
 river-0.027
Token we
Token ID356
Feature activation+1.54e-02

Pos logit contributions
olding+0.151
olds+0.145
asing+0.144
kind+0.144
oral+0.143
Neg logit contributions
 lake-0.085
 camp-0.085
 pool-0.077
 river-0.074
 creek-0.072
Token can
Token ID460
Feature activation-7.37e-04

Pos logit contributions
Of+0.028
 forgiveness+0.028
olding+0.028
aid+0.027
 apology+0.027
Neg logit contributions
 lake-0.030
neath-0.029
 camp-0.028
 river-0.028
 creek-0.028
Token of
Token ID286
Feature activation+3.64e-02

Pos logit contributions
Of+0.122
olding+0.122
 forgiveness+0.113
Deal+0.111
 apology+0.111
Neg logit contributions
 lake-0.081
 ocean-0.077
 camp-0.075
 river-0.075
 pool-0.074
Token yours
Token ID12431
Feature activation+2.39e-02

Pos logit contributions
olding+0.095
Of+0.094
aid+0.090
olded+0.088
 forgiveness+0.088
Neg logit contributions
 lake-0.044
neath-0.040
 shore-0.040
 camp-0.040
 ocean-0.039
Token?"
Token ID1701
Feature activation+1.61e-02

Pos logit contributions
Of+0.062
olding+0.061
 forgiveness+0.060
aid+0.060
 apology+0.059
Neg logit contributions
 lake-0.028
neath-0.027
 camp-0.026
 shore-0.025
 river-0.025
Token
Token ID198
Feature activation+2.02e-02

Pos logit contributions
Of+0.037
 forgiveness+0.036
aid+0.036
olding+0.036
 apology+0.035
Neg logit contributions
 lake-0.024
neath-0.023
 camp-0.022
 creek-0.022
 shore-0.022
Token
Token ID198
Feature activation+1.21e-02

Pos logit contributions
Of+0.047
 forgiveness+0.046
olding+0.046
aid+0.045
 apology+0.044
Neg logit contributions
 lake-0.029
neath-0.028
 camp-0.027
 shore-0.027
 creek-0.026
Token"
Token ID1
Feature activation+1.14e-01

Pos logit contributions
Of+0.021
 forgiveness+0.021
olding+0.021
aid+0.020
 apology+0.020
Neg logit contributions
 lake-0.024
neath-0.024
 camp-0.023
 creek-0.023
 river-0.023
TokenOf
Token ID5189
Feature activation+4.66e-02

Pos logit contributions
 actions+0.213
 instincts+0.211
 forgiveness+0.209
iness+0.204
 apology+0.194
Neg logit contributions
 lake-0.179
neath-0.178
 creek-0.167
 clearing-0.161
Look-0.160
Token course
Token ID1781
Feature activation+3.77e-02

Pos logit contributions
olding+0.115
Of+0.112
asing+0.109
olded+0.108
asers+0.107
Neg logit contributions
 lake-0.063
neath-0.057
 clearing-0.053
 camp-0.051
 rav-0.051
Token,
Token ID11
Feature activation+3.96e-02

Pos logit contributions
aid+0.101
Of+0.099
 forgiveness+0.097
olding+0.097
asing+0.096
Neg logit contributions
 lake-0.040
 camp-0.037
 shore-0.033
 river-0.033
neath-0.033
Token you
Token ID345
Feature activation+1.53e-02

Pos logit contributions
olding+0.075
aid+0.075
 forgiveness+0.074
Of+0.073
olded+0.070
Neg logit contributions
 lake-0.071
 camp-0.067
 shore-0.063
 pool-0.063
neath-0.063
Token can
Token ID460
Feature activation+2.57e-02

Pos logit contributions
Of+0.028
 forgiveness+0.028
olding+0.027
aid+0.027
 apology+0.026
Neg logit contributions
 lake-0.029
neath-0.029
 camp-0.028
 river-0.027
 creek-0.027
Token day
Token ID1110
Feature activation-3.38e-02

Pos logit contributions
Of+0.029
 forgiveness+0.028
aid+0.028
olding+0.027
 apology+0.027
Neg logit contributions
 lake-0.028
neath-0.027
 camp-0.027
 creek-0.026
 river-0.026
Token,
Token ID11
Feature activation-1.31e-02

Pos logit contributions
 lake+0.056
 canyon+0.055
neath+0.054
 river+0.053
 creek+0.051
Neg logit contributions
Of-0.069
 forgiveness-0.069
aid-0.066
olding-0.066
olded-0.063
Token Emma
Token ID18966
Feature activation-6.28e-02

Pos logit contributions
 lake+0.021
neath+0.020
 canyon+0.020
 river+0.019
 camp+0.019
Neg logit contributions
Of-0.029
 forgiveness-0.028
aid-0.027
olding-0.026
 apology-0.026
Token asked
Token ID1965
Feature activation+9.15e-03

Pos logit contributions
 canyon+0.136
 lake+0.133
neath+0.133
 valleys+0.131
 ditch+0.130
Neg logit contributions
olded-0.084
Of-0.081
olding-0.081
 else-0.080
ups-0.076
Token her
Token ID607
Feature activation+3.60e-02

Pos logit contributions
Of+0.018
 forgiveness+0.018
olding+0.018
aid+0.018
 apology+0.017
Neg logit contributions
 lake-0.016
neath-0.016
 camp-0.016
 creek-0.015
 shore-0.015
Token pap
Token ID20461
Feature activation+1.14e-01

Pos logit contributions
aid+0.060
Of+0.059
olding+0.059
asing+0.058
Deal+0.057
Neg logit contributions
 lake-0.076
 camp-0.076
 creek-0.073
 shore-0.073
 river-0.072
Tokena
Token ID64
Feature activation+4.75e-02

Pos logit contributions
Of+0.206
 actions+0.196
 importantly+0.193
olds+0.193
 apology+0.192
Neg logit contributions
 clearing-0.150
 lake-0.140
 ocean-0.135
 camp-0.132
 tim-0.126
Token if
Token ID611
Feature activation+2.34e-02

Pos logit contributions
aid+0.099
Of+0.098
 forgiveness+0.094
olding+0.094
 apology+0.094
Neg logit contributions
 camp-0.076
 lake-0.076
neath-0.074
 creek-0.072
 river-0.071
Token they
Token ID484
Feature activation+2.75e-02

Pos logit contributions
Of+0.033
aid+0.033
olding+0.033
 apology+0.032
 forgiveness+0.031
Neg logit contributions
 lake-0.054
neath-0.053
 camp-0.053
 shore-0.052
 creek-0.051
Token could
Token ID714
Feature activation-1.94e-02

Pos logit contributions
Of+0.051
olding+0.050
 forgiveness+0.050
aid+0.049
 apology+0.049
Neg logit contributions
 lake-0.051
neath-0.051
 camp-0.049
 river-0.049
 creek-0.048
Token go
Token ID467
Feature activation-2.93e-02

Pos logit contributions
 lake+0.034
neath+0.034
 valleys+0.034
 canyon+0.033
 clearing+0.033
Neg logit contributions
 forgiveness-0.037
Of-0.036
olding-0.036
 apology-0.036
aid-0.036
Token waves
Token ID9813
Feature activation+2.48e-03

Pos logit contributions
neath+0.072
 tim+0.067
 Everest+0.061
 canyon+0.059
 clearing+0.059
Neg logit contributions
Of-0.105
 forgiveness-0.103
 apology-0.099
 else-0.099
aid-0.098
Token again
Token ID757
Feature activation-2.57e-02

Pos logit contributions
Of+0.005
 forgiveness+0.005
olding+0.005
aid+0.005
 apology+0.005
Neg logit contributions
 lake-0.004
neath-0.004
 camp-0.004
 creek-0.004
 river-0.004
Token?"
Token ID1701
Feature activation+3.52e-02

Pos logit contributions
neath+0.043
 lake+0.042
 creek+0.040
 canyon+0.039
 valleys+0.039
Neg logit contributions
Of-0.052
 forgiveness-0.050
olding-0.048
aid-0.048
 else-0.046
Token
Token ID198
Feature activation+2.97e-02

Pos logit contributions
olding+0.089
asing+0.087
aid+0.086
asers+0.085
 apology+0.084
Neg logit contributions
 lake-0.043
neath-0.043
 camp-0.039
 shore-0.039
 pool-0.037
Token
Token ID198
Feature activation+3.24e-02

Pos logit contributions
Of+0.071
olding+0.071
asing+0.070
 forgiveness+0.070
aid+0.070
Neg logit contributions
 lake-0.039
neath-0.038
 camp-0.037
 shore-0.035
 river-0.034
Token"
Token ID1
Feature activation+1.14e-01

Pos logit contributions
olding+0.047
 forgiveness+0.046
asers+0.046
Of+0.046
asing+0.046
Neg logit contributions
 lake-0.072
neath-0.071
 camp-0.069
 river-0.068
 creek-0.067
TokenOf
Token ID5189
Feature activation+4.90e-02

Pos logit contributions
 actions+0.191
asers+0.191
olds+0.188
 apology+0.186
aser+0.185
Neg logit contributions
 lake-0.219
 clearing-0.203
neath-0.192
 pool-0.190
 creek-0.189
Token course
Token ID1781
Feature activation+2.34e-02

Pos logit contributions
olding+0.119
asers+0.114
asing+0.113
Of+0.111
olded+0.111
Neg logit contributions
 lake-0.069
neath-0.062
 clearing-0.058
 camp-0.057
 ocean-0.056
Token,
Token ID11
Feature activation+4.34e-02

Pos logit contributions
aid+0.058
Of+0.057
olding+0.057
 forgiveness+0.057
 apology+0.056
Neg logit contributions
 lake-0.029
 camp-0.028
neath-0.027
 shore-0.026
 river-0.026
Token we
Token ID356
Feature activation+3.19e-03

Pos logit contributions
olding+0.097
aid+0.094
olds+0.093
asing+0.093
asers+0.090
Neg logit contributions
 lake-0.064
 camp-0.063
 river-0.058
 pool-0.058
 creek-0.057
Token can
Token ID460
Feature activation-3.15e-02

Pos logit contributions
Of+0.005
olding+0.005
 forgiveness+0.005
aid+0.005
 apology+0.005
Neg logit contributions
 lake-0.007
neath-0.007
 camp-0.006
 creek-0.006
 shore-0.006
Token Lily
Token ID20037
Feature activation+3.79e-03

Pos logit contributions
aid+0.054
olding+0.054
Of+0.054
asing+0.053
 forgiveness+0.053
Neg logit contributions
 lake-0.039
 camp-0.038
neath-0.038
 creek-0.037
 shore-0.037
Token asks
Token ID7893
Feature activation+3.85e-02

Pos logit contributions
Of+0.006
 forgiveness+0.006
olding+0.006
aid+0.006
 apology+0.006
Neg logit contributions
 lake-0.008
neath-0.008
 camp-0.007
 creek-0.007
 river-0.007
Token.
Token ID13
Feature activation+3.53e-02

Pos logit contributions
Of+0.092
aid+0.088
olding+0.085
 apology+0.083
 forgiveness+0.083
Neg logit contributions
 camp-0.052
neath-0.051
 lake-0.051
 shore-0.051
 creek-0.048
Token
Token ID198
Feature activation+1.22e-02

Pos logit contributions
olding+0.090
 forgiveness+0.090
aid+0.089
asing+0.089
 apology+0.089
Neg logit contributions
 lake-0.041
neath-0.040
 creek-0.038
 camp-0.038
 shore-0.037
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
Of+0.027
 forgiveness+0.026
olding+0.026
aid+0.026
 apology+0.025
Neg logit contributions
 lake-0.019
neath-0.018
 camp-0.018
 shore-0.017
 creek-0.017
Token"
Token ID1
Feature activation+1.14e-01

Pos logit contributions
Of+0.022
 forgiveness+0.021
olding+0.021
aid+0.021
 apology+0.021
Neg logit contributions
 lake-0.025
neath-0.024
 camp-0.024
 creek-0.023
 river-0.023
TokenYes
Token ID5297
Feature activation+1.67e-02

Pos logit contributions
aser+0.190
grand+0.188
asers+0.187
iness+0.184
asing+0.183
Neg logit contributions
 lake-0.196
 creek-0.185
 river-0.172
neath-0.171
 shore-0.171
Token,
Token ID11
Feature activation+6.56e-02

Pos logit contributions
Of+0.045
aid+0.045
olding+0.044
 forgiveness+0.044
 apology+0.043
Neg logit contributions
 lake-0.018
neath-0.017
 camp-0.016
 shore-0.016
 river-0.016
Token you
Token ID345
Feature activation+2.01e-02

Pos logit contributions
olding+0.141
kind+0.139
aid+0.139
olds+0.136
asers+0.135
Neg logit contributions
 lake-0.097
 camp-0.090
 ocean-0.089
 shore-0.089
 pool-0.088
Token can
Token ID460
Feature activation+1.51e-02

Pos logit contributions
Of+0.040
olding+0.039
 forgiveness+0.039
aid+0.039
 apology+0.037
Neg logit contributions
 lake-0.035
neath-0.035
 camp-0.034
 river-0.033
 creek-0.033
Token.
Token ID13
Feature activation+3.84e-02

Pos logit contributions
Of+0.032
 forgiveness+0.031
olding+0.031
aid+0.031
 apology+0.030
Neg logit contributions
 lake-0.025
neath-0.024
 camp-0.024
 river-0.023
 shore-0.023
Token.
Token ID13
Feature activation-2.02e-02

Pos logit contributions
Of+0.006
 forgiveness+0.005
olding+0.005
aid+0.005
 apology+0.005
Neg logit contributions
 lake-0.005
neath-0.005
 camp-0.005
 creek-0.005
 river-0.005
Token Ranger
Token ID21888
Feature activation+1.57e-03

Pos logit contributions
neath+0.033
 lake+0.031
 valleys+0.029
 creek+0.029
 shore+0.029
Neg logit contributions
Of-0.044
olding-0.042
 forgiveness-0.042
 apology-0.040
aid-0.039
Token?"
Token ID1701
Feature activation-9.05e-04

Pos logit contributions
Of+0.004
 forgiveness+0.004
olding+0.004
aid+0.004
 apology+0.004
Neg logit contributions
 lake-0.002
neath-0.002
 camp-0.002
 shore-0.002
 creek-0.002
Token
Token ID198
Feature activation+7.99e-03

Pos logit contributions
 lake+0.002
neath+0.002
 camp+0.002
 canyon+0.001
 creek+0.001
Neg logit contributions
Of-0.002
 forgiveness-0.002
aid-0.002
olding-0.002
 apology-0.002
Token
Token ID198
Feature activation-4.69e-03

Pos logit contributions
Of+0.017
 forgiveness+0.017
olding+0.017
aid+0.017
 apology+0.016
Neg logit contributions
 lake-0.013
neath-0.013
 camp-0.012
 creek-0.012
 river-0.012
Token"
Token ID1
Feature activation+1.14e-01

Pos logit contributions
neath+0.010
 lake+0.010
 camp+0.010
 canyon+0.010
 creek+0.010
Neg logit contributions
Of-0.007
aid-0.007
 forgiveness-0.007
olding-0.007
 apology-0.007
TokenSure
Token ID19457
Feature activation+3.91e-02

Pos logit contributions
 supervision+0.180
 actions+0.176
iness+0.169
asing+0.166
 forgiveness+0.165
Neg logit contributions
 lake-0.231
 creek-0.220
 clearing-0.206
neath-0.204
 river-0.202
Token,
Token ID11
Feature activation+3.35e-02

Pos logit contributions
Of+0.118
asing+0.117
olding+0.117
aid+0.116
olded+0.113
Neg logit contributions
 lake-0.029
 camp-0.028
 river-0.022
 shore-0.021
 clearing-0.021
Token Lily
Token ID20037
Feature activation+2.37e-02

Pos logit contributions
olding+0.063
aid+0.062
Of+0.061
asing+0.060
 apology+0.059
Neg logit contributions
 lake-0.061
 camp-0.061
 river-0.057
 creek-0.057
 shore-0.056
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
Of+0.058
 forgiveness+0.056
aid+0.055
olding+0.055
 apology+0.054
Neg logit contributions
 lake-0.032
 camp-0.030
neath-0.029
 river-0.029
 creek-0.028
Token but
Token ID475
Feature activation+2.07e-02

Pos logit contributions
Of+0.026
olding+0.025
aid+0.025
 forgiveness+0.025
 apology+0.024
Neg logit contributions
 lake-0.033
neath-0.032
 camp-0.032
 shore-0.031
 river-0.031
Token girl
Token ID2576
Feature activation+3.09e-02

Pos logit contributions
Of+0.048
olding+0.048
aid+0.048
asing+0.048
 forgiveness+0.046
Neg logit contributions
 lake-0.058
 creek-0.054
 camp-0.054
 river-0.054
 pool-0.054
Token asked
Token ID1965
Feature activation+3.31e-02

Pos logit contributions
Of+0.060
 forgiveness+0.060
 apology+0.059
olding+0.059
aid+0.059
Neg logit contributions
 lake-0.055
neath-0.053
 camp-0.051
 river-0.051
 shore-0.051
Token.
Token ID13
Feature activation+1.79e-02

Pos logit contributions
Of+0.075
aid+0.073
olding+0.071
 forgiveness+0.070
 apology+0.069
Neg logit contributions
neath-0.049
 lake-0.049
 camp-0.048
 shore-0.046
 creek-0.046
Token
Token ID198
Feature activation+1.00e-02

Pos logit contributions
 forgiveness+0.040
Of+0.040
aid+0.040
olding+0.040
 apology+0.039
Neg logit contributions
 lake-0.026
neath-0.026
 camp-0.025
 creek-0.024
 river-0.024
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
Of+0.022
 forgiveness+0.021
olding+0.021
aid+0.021
 apology+0.021
Neg logit contributions
 lake-0.016
neath-0.016
 camp-0.015
 river-0.015
 creek-0.015
Token"
Token ID1
Feature activation+1.13e-01

Pos logit contributions
Of+0.022
 forgiveness+0.022
olding+0.022
aid+0.022
 apology+0.021
Neg logit contributions
 lake-0.025
neath-0.024
 camp-0.024
 creek-0.023
 river-0.023
TokenSure
Token ID19457
Feature activation+3.35e-02

Pos logit contributions
 actions+0.215
 choices+0.206
iness+0.206
 instincts+0.204
 forgiveness+0.197
Neg logit contributions
 lake-0.185
neath-0.175
 creek-0.166
Look-0.158
 clearing-0.154
Token,
Token ID11
Feature activation+3.97e-02

Pos logit contributions
Of+0.095
olding+0.093
aid+0.093
asing+0.091
 forgiveness+0.090
Neg logit contributions
 lake-0.033
 camp-0.030
 shore-0.026
 clearing-0.026
 river-0.026
Token you
Token ID345
Feature activation+7.50e-03

Pos logit contributions
olding+0.074
aid+0.074
Of+0.071
olds+0.071
olded+0.070
Neg logit contributions
 lake-0.073
 camp-0.071
 shore-0.067
 river-0.067
 creek-0.066
Token can
Token ID460
Feature activation+9.25e-03

Pos logit contributions
Of+0.013
 forgiveness+0.013
olding+0.013
aid+0.013
 apology+0.012
Neg logit contributions
 lake-0.015
neath-0.015
 camp-0.015
 creek-0.014
 river-0.014
Token play
Token ID711
Feature activation+6.43e-03

Pos logit contributions
Of+0.021
 forgiveness+0.021
olding+0.021
aid+0.021
 apology+0.020
Neg logit contributions
 lake-0.014
neath-0.013
 camp-0.013
 river-0.013
 creek-0.012
Token Her
Token ID2332
Feature activation+3.25e-02

Pos logit contributions
olding+0.046
aid+0.046
Of+0.045
 forgiveness+0.045
asing+0.044
Neg logit contributions
 lake-0.040
neath-0.038
 camp-0.037
 shore-0.037
 river-0.036
Token mom
Token ID1995
Feature activation+3.63e-02

Pos logit contributions
Of+0.059
aid+0.058
olding+0.058
asing+0.057
Deal+0.053
Neg logit contributions
 lake-0.069
 camp-0.066
 pool-0.064
 river-0.064
 shore-0.063
Tokenmy
Token ID1820
Feature activation+1.69e-02

Pos logit contributions
olding+0.080
 forgiveness+0.080
Of+0.079
 apology+0.078
aid+0.078
Neg logit contributions
 lake-0.053
neath-0.051
 camp-0.047
 clearing-0.047
 river-0.046
Token said
Token ID531
Feature activation+3.98e-02

Pos logit contributions
Of+0.025
olding+0.024
 forgiveness+0.024
aid+0.024
 apology+0.023
Neg logit contributions
 lake-0.039
neath-0.038
 camp-0.037
 river-0.036
 shore-0.036
Token,
Token ID11
Feature activation+4.39e-03

Pos logit contributions
aid+0.101
Of+0.101
asing+0.100
olding+0.099
 forgiveness+0.099
Neg logit contributions
 lake-0.039
neath-0.038
 camp-0.037
 shore-0.037
 pool-0.034
Token "
Token ID366
Feature activation+1.13e-01

Pos logit contributions
Of+0.009
 forgiveness+0.008
olding+0.008
aid+0.008
 apology+0.008
Neg logit contributions
 lake-0.008
neath-0.008
 camp-0.007
 creek-0.007
 river-0.007
TokenYes
Token ID5297
Feature activation+2.11e-02

Pos logit contributions
 actions+0.206
 apology+0.201
aser+0.198
 supervision+0.196
 forgiveness+0.195
Neg logit contributions
 lake-0.193
 clearing-0.186
neath-0.180
 field-0.170
 river-0.167
Token,
Token ID11
Feature activation+5.68e-02

Pos logit contributions
Of+0.057
aid+0.057
 forgiveness+0.055
olding+0.055
asing+0.054
Neg logit contributions
 lake-0.022
neath-0.021
 camp-0.020
 shore-0.019
 river-0.019
Token let
Token ID1309
Feature activation-1.07e-02

Pos logit contributions
aid+0.111
olding+0.109
olds+0.109
asing+0.107
asers+0.106
Neg logit contributions
 lake-0.096
 camp-0.092
 river-0.089
 pool-0.086
 shore-0.085
Token's
Token ID338
Feature activation-4.32e-02

Pos logit contributions
 lake+0.023
neath+0.022
 creek+0.022
 valleys+0.022
 canyon+0.022
Neg logit contributions
Of-0.017
 forgiveness-0.017
olding-0.016
 apology-0.016
aid-0.016
Token go
Token ID467
Feature activation-3.64e-02

Pos logit contributions
 valleys+0.082
neath+0.080
 clearing+0.078
 ditch+0.077
 canyon+0.075
Neg logit contributions
 forgiveness-0.075
Of-0.074
 promise-0.072
 forgive-0.070
 apology-0.070
Token -
Token ID532
Feature activation-4.47e-02

Pos logit contributions
neath+0.051
 lake+0.050
 canyon+0.050
 creek+0.049
 camp+0.048
Neg logit contributions
Of-0.045
olding-0.044
 forgiveness-0.044
aid-0.042
asing-0.041
Token rolling
Token ID10708
Feature activation-4.91e-02

Pos logit contributions
neath+0.072
 canyon+0.068
 ditch+0.065
 lake+0.065
 clearing+0.063
Neg logit contributions
Of-0.095
aid-0.087
 forgiveness-0.087
olding-0.084
 else-0.083
Token green
Token ID4077
Feature activation-6.50e-02

Pos logit contributions
neath+0.073
 canyon+0.071
 ocean+0.067
 clearing+0.067
 lake+0.067
Neg logit contributions
Of-0.097
aid-0.093
 else-0.091
 apology-0.088
ups-0.088
Token hills
Token ID18639
Feature activation-6.26e-02

Pos logit contributions
neath+0.096
 ditch+0.088
 tim+0.087
 canyon+0.087
 bay+0.083
Neg logit contributions
aid-0.125
ups-0.123
Of-0.120
asers-0.120
 facts-0.119
Token,
Token ID11
Feature activation-5.27e-02

Pos logit contributions
neath+0.124
 canyon+0.121
 creek+0.118
 lake+0.116
 camp+0.116
Neg logit contributions
olding-0.089
ups-0.083
Of-0.083
 else-0.082
aid-0.082
Token lush
Token ID37408
Feature activation-1.28e-01

Pos logit contributions
neath+0.086
 ditch+0.082
 camp+0.081
 canyon+0.080
 creek+0.079
Neg logit contributions
Of-0.101
aid-0.092
ups-0.089
 else-0.088
 respectful-0.088
Token grass
Token ID8701
Feature activation-7.92e-02

Pos logit contributions
neath+0.240
 tim+0.208
 cinema+0.207
 ditch+0.203
acket+0.199
Neg logit contributions
ups-0.175
 words-0.173
 else-0.172
 facts-0.166
 songs-0.164
Token,
Token ID11
Feature activation-5.97e-02

Pos logit contributions
neath+0.165
 canyon+0.158
 ditch+0.156
 camp+0.150
 creek+0.148
Neg logit contributions
olding-0.096
ups-0.095
as-0.092
asing-0.087
aser-0.087
Token trees
Token ID7150
Feature activation-6.54e-02

Pos logit contributions
neath+0.114
 ditch+0.110
 canyon+0.107
 camp+0.107
 creek+0.105
Neg logit contributions
Of-0.097
aid-0.086
 respectful-0.084
ups-0.083
 else-0.081
Token and
Token ID290
Feature activation-2.58e-02

Pos logit contributions
neath+0.142
 canyon+0.136
 creek+0.132
 ditch+0.131
 camp+0.127
Neg logit contributions
aid-0.080
olding-0.079
Of-0.077
 else-0.076
 respectful-0.071
Token flowers
Token ID12734
Feature activation-3.15e-02

Pos logit contributions
neath+0.039
 lake+0.036
 canyon+0.036
 camp+0.035
 creek+0.034
Neg logit contributions
Of-0.058
aid-0.055
 forgiveness-0.055
 else-0.052
 apology-0.052
Token colourful
Token ID44799
Feature activation-3.14e-02

Pos logit contributions
neath+0.031
 lake+0.029
 canyon+0.029
 creek+0.028
 camp+0.028
Neg logit contributions
 forgiveness-0.039
Of-0.038
aid-0.037
 apology-0.037
olding-0.036
Token flowers
Token ID12734
Feature activation-2.37e-02

Pos logit contributions
neath+0.055
 lake+0.048
 canyon+0.048
 ditch+0.046
 creek+0.046
Neg logit contributions
 forgiveness-0.062
aid-0.062
Of-0.061
 apology-0.058
olding-0.057
Token,
Token ID11
Feature activation-3.09e-02

Pos logit contributions
neath+0.045
 lake+0.044
 canyon+0.043
 camp+0.042
 creek+0.042
Neg logit contributions
Of-0.041
olding-0.040
 forgiveness-0.040
aid-0.040
asing-0.038
Token through
Token ID832
Feature activation-4.45e-02

Pos logit contributions
neath+0.051
 lake+0.047
 camp+0.046
 canyon+0.046
 ditch+0.046
Neg logit contributions
Of-0.066
 forgiveness-0.062
aid-0.060
olding-0.059
 respectful-0.058
Token the
Token ID262
Feature activation-3.38e-02

Pos logit contributions
neath+0.058
 tim+0.055
 canyon+0.055
 creek+0.054
 ditch+0.053
Neg logit contributions
 forgiveness-0.098
 else-0.093
Of-0.091
 actions-0.088
 honesty-0.088
Token lush
Token ID37408
Feature activation-1.27e-01

Pos logit contributions
neath+0.045
 tim+0.033
 lake+0.033
 camp+0.032
 ditch+0.032
Neg logit contributions
Of-0.081
 forgiveness-0.077
olding-0.076
 respectful-0.076
aid-0.076
Token forests
Token ID17039
Feature activation-4.82e-02

Pos logit contributions
neath+0.220
 tim+0.202
 cinema+0.177
acket+0.176
 ditch+0.168
Neg logit contributions
 actions-0.192
 valuable-0.188
 languages-0.185
 words-0.181
 importantly-0.180
Token deep
Token ID2769
Feature activation-8.78e-02

Pos logit contributions
neath+0.094
 lake+0.088
 canyon+0.087
 creek+0.086
 camp+0.084
Neg logit contributions
olding-0.078
 forgiveness-0.073
Of-0.072
aid-0.072
ups-0.071
Token in
Token ID287
Feature activation-6.11e-02

Pos logit contributions
neath+0.114
 tim+0.106
 cinema+0.104
latable+0.101
acket+0.098
Neg logit contributions
 actions-0.156
ups-0.156
able-0.152
 interests-0.148
 importantly-0.147
Token thought
Token ID1807
Feature activation+1.06e-02

Pos logit contributions
acket+0.105
 tim+0.103
latable+0.099
 canyon+0.092
 ditch+0.091
Neg logit contributions
 forgiveness-0.099
 else-0.092
Of-0.085
 apology-0.084
 forgive-0.083
Token.
Token ID13
Feature activation-2.77e-02

Pos logit contributions
Of+0.019
 forgiveness+0.019
aid+0.019
olding+0.019
 apology+0.018
Neg logit contributions
 lake-0.021
neath-0.020
 camp-0.020
 creek-0.019
 river-0.019
Token.
Token ID13
Feature activation-6.73e-02

Pos logit contributions
neath+0.071
 canyon+0.069
 ditch+0.067
 creek+0.067
 camp+0.067
Neg logit contributions
 forgiveness-0.042
Of-0.042
aid-0.041
olding-0.041
asing-0.037
Token It
Token ID632
Feature activation-2.68e-02

Pos logit contributions
 canyon+0.141
icer+0.134
neath+0.131
 ditch+0.131
 pasture+0.131
Neg logit contributions
Of-0.110
 forgive-0.088
olding-0.085
 forgiveness-0.085
aid-0.084
Token ran
Token ID4966
Feature activation-6.25e-02

Pos logit contributions
neath+0.056
 lake+0.054
 ditch+0.053
 clearing+0.052
 canyon+0.052
Neg logit contributions
Of-0.041
 else-0.040
aid-0.037
olding-0.037
 forgiveness-0.036
Token through
Token ID832
Feature activation-6.05e-02

Pos logit contributions
neath+0.137
icer+0.131
 pasture+0.124
 canyon+0.122
 lake+0.120
Neg logit contributions
Of-0.094
 lesson-0.087
olding-0.086
 apology-0.078
 else-0.076
Token a
Token ID257
Feature activation-4.23e-02

Pos logit contributions
acket+0.123
 tim+0.121
neath+0.117
 ditch+0.109
 cinema+0.108
Neg logit contributions
 else-0.100
 forgiveness-0.095
 forgive-0.082
Of-0.080
 understanding-0.079
Token lush
Token ID37408
Feature activation-1.26e-01

Pos logit contributions
neath+0.073
 tim+0.057
 Everest+0.054
 canyon+0.054
acket+0.053
Neg logit contributions
 else-0.089
Of-0.082
 respectful-0.078
 forgiveness-0.077
aid-0.076
Token green
Token ID4077
Feature activation-7.46e-02

Pos logit contributions
neath+0.252
 tim+0.222
acket+0.197
 cinema+0.196
 OUT+0.183
Neg logit contributions
 valuable-0.165
 important-0.158
 kind-0.157
 else-0.155
 respectful-0.150
Token me
Token ID502
Feature activation-1.39e-02

Pos logit contributions
neath+0.105
 tim+0.090
 Everest+0.077
 gonna+0.071
 takeoff+0.070
Neg logit contributions
 friend-0.142
 forgiveness-0.142
 actions-0.138
aid-0.138
 interests-0.137
Tokenadow
Token ID4584
Feature activation-4.02e-02

Pos logit contributions
neath+0.027
 lake+0.026
 creek+0.024
 camp+0.024
 river+0.024
Neg logit contributions
olding-0.026
aid-0.026
Of-0.025
 forgiveness-0.025
asing-0.024
Token,
Token ID11
Feature activation-5.14e-02

Pos logit contributions
neath+0.097
 lake+0.089
 camp+0.088
 canyon+0.088
 ditch+0.087
Neg logit contributions
olding-0.053
Of-0.048
 forgiveness-0.047
 respectful-0.046
aid-0.044
Token with
Token ID351
Feature activation-2.21e-03

Pos logit contributions
neath+0.098
 camp+0.090
 canyon+0.089
 ditch+0.089
icer+0.088
Neg logit contributions
Of-0.096
olding-0.086
 respectful-0.085
 forgiveness-0.080
 else-0.078
Token was
Token ID373
Feature activation-2.55e-02

Pos logit contributions
neath+0.029
 lake+0.029
 canyon+0.028
 camp+0.028
 creek+0.028
Neg logit contributions
Of-0.027
olding-0.025
aid-0.025
 forgiveness-0.025
 apology-0.024
Token running
Token ID2491
Feature activation-3.62e-02

Pos logit contributions
 lake+0.050
neath+0.050
 creek+0.049
 river+0.047
 canyon+0.047
Neg logit contributions
Of-0.047
olding-0.045
aid-0.044
 forgiveness-0.043
 respectful-0.043
Token around
Token ID1088
Feature activation-4.37e-02

Pos logit contributions
neath+0.071
 lake+0.068
 canyon+0.066
 ocean+0.064
 river+0.064
Neg logit contributions
Of-0.065
olding-0.060
aid-0.057
 else-0.057
olded-0.054
Token on
Token ID319
Feature activation-5.68e-02

Pos logit contributions
neath+0.078
 lake+0.074
 canyon+0.074
 ditch+0.074
 river+0.072
Neg logit contributions
Of-0.079
 forgiveness-0.078
olding-0.074
 classmates-0.071
 else-0.070
Token the
Token ID262
Feature activation-4.71e-02

Pos logit contributions
neath+0.091
 tim+0.086
acket+0.086
 canyon+0.086
 ditch+0.083
Neg logit contributions
 forgiveness-0.109
Of-0.100
 else-0.098
 classmates-0.098
 honesty-0.096
Token lush
Token ID37408
Feature activation-1.20e-01

Pos logit contributions
neath+0.075
 canyon+0.061
 clearing+0.058
 camp+0.058
 valleys+0.056
Neg logit contributions
Of-0.103
 apology-0.097
olding-0.097
aid-0.094
 respectful-0.093
Token grass
Token ID8701
Feature activation-3.49e-02

Pos logit contributions
neath+0.237
 tim+0.192
 takeoff+0.175
icer+0.175
acket+0.172
Neg logit contributions
 respectful-0.165
ups-0.164
 else-0.162
 actions-0.161
 feelings-0.161
Token,
Token ID11
Feature activation-4.04e-02

Pos logit contributions
neath+0.062
 lake+0.059
 canyon+0.058
 camp+0.057
 ditch+0.056
Neg logit contributions
olding-0.065
Of-0.065
 forgiveness-0.062
aid-0.060
asing-0.060
Token enjoying
Token ID13226
Feature activation-4.95e-02

Pos logit contributions
neath+0.070
 camp+0.067
 lake+0.067
 canyon+0.066
 ditch+0.065
Neg logit contributions
Of-0.083
olding-0.073
 respectful-0.072
aid-0.069
 forgiveness-0.068
Token the
Token ID262
Feature activation-2.65e-02

Pos logit contributions
neath+0.086
 ditch+0.080
 canyon+0.075
acket+0.073
 river+0.073
Neg logit contributions
Of-0.095
 forgiveness-0.095
 caring-0.091
 respectful-0.090
 understanding-0.087
Token sunshine
Token ID34488
Feature activation-1.53e-02

Pos logit contributions
neath+0.044
 canyon+0.038
 ditch+0.037
 lake+0.036
 camp+0.036
Neg logit contributions
Of-0.058
 forgiveness-0.054
 apology-0.054
aid-0.054
olding-0.053
Token sight
Token ID6504
Feature activation-5.61e-03

Pos logit contributions
neath+0.109
 valleys+0.089
 ditch+0.080
 takeoff+0.078
 tim+0.078
Neg logit contributions
 forgiveness-0.118
 apology-0.117
olding-0.114
Of-0.112
 respectful-0.110
Token.
Token ID13
Feature activation-3.35e-02

Pos logit contributions
neath+0.011
 lake+0.011
 camp+0.010
 creek+0.010
 river+0.010
Neg logit contributions
Of-0.010
olding-0.010
 forgiveness-0.010
aid-0.010
 apology-0.009
Token There
Token ID1318
Feature activation-1.50e-02

Pos logit contributions
neath+0.061
 lake+0.060
 camp+0.059
 canyon+0.059
 shore+0.057
Neg logit contributions
Of-0.066
olding-0.060
 apology-0.058
 forgiveness-0.057
olded-0.057
Token was
Token ID373
Feature activation-3.73e-02

Pos logit contributions
 lake+0.028
 canyon+0.028
neath+0.028
 creek+0.027
 shore+0.027
Neg logit contributions
 forgiveness-0.027
Of-0.026
olding-0.025
aid-0.025
 apology-0.024
Token a
Token ID257
Feature activation-3.12e-02

Pos logit contributions
neath+0.057
 lake+0.053
 canyon+0.053
 creek+0.051
 camp+0.050
Neg logit contributions
 forgiveness-0.079
Of-0.079
aid-0.074
olding-0.073
 else-0.072
Token lush
Token ID37408
Feature activation-1.18e-01

Pos logit contributions
neath+0.054
 valleys+0.046
 lake+0.045
 camp+0.044
 canyon+0.044
Neg logit contributions
Of-0.064
 forgiveness-0.062
olding-0.060
 apology-0.060
aid-0.059
Token green
Token ID4077
Feature activation-6.75e-02

Pos logit contributions
neath+0.232
 tim+0.214
 cinema+0.190
ASH+0.182
 takeoff+0.178
Neg logit contributions
 valuable-0.153
 respectful-0.152
 understanding-0.150
 important-0.149
 actions-0.148
Token patch
Token ID8529
Feature activation-4.53e-02

Pos logit contributions
neath+0.101
 tim+0.079
 valleys+0.078
 ditch+0.077
 takeoff+0.074
Neg logit contributions
Of-0.135
 forgiveness-0.133
 apology-0.132
aid-0.129
 actions-0.129
Token of
Token ID286
Feature activation-4.07e-02

Pos logit contributions
neath+0.117
 camp+0.115
 lake+0.114
 canyon+0.114
 valleys+0.113
Neg logit contributions
olding-0.045
Of-0.042
 forgiveness-0.041
aid-0.041
aser-0.038
Token grass
Token ID8701
Feature activation-7.02e-02

Pos logit contributions
neath+0.063
 tim+0.056
 canyon+0.054
 ditch+0.053
ground+0.051
Neg logit contributions
Of-0.087
 forgiveness-0.086
aid-0.084
 apology-0.082
 else-0.078
Token and
Token ID290
Feature activation-3.80e-02

Pos logit contributions
 camp+0.121
 tim+0.120
 canyon+0.119
neath+0.119
 ditch+0.116
Neg logit contributions
olding-0.108
aser-0.106
 else-0.103
Of-0.103
 forgiveness-0.103
Token good
Token ID922
Feature activation+4.27e-03

Pos logit contributions
neath+0.058
 lake+0.055
 canyon+0.052
 shore+0.052
 clearing+0.052
Neg logit contributions
Of-0.038
olding-0.037
aid-0.036
 else-0.034
 respectful-0.034
Token to
Token ID284
Feature activation-2.78e-02

Pos logit contributions
Of+0.008
 forgiveness+0.008
olding+0.008
aid+0.008
 apology+0.007
Neg logit contributions
 lake-0.008
neath-0.008
 camp-0.008
 river-0.008
 creek-0.008
Token roll
Token ID4836
Feature activation-1.29e-02

Pos logit contributions
neath+0.057
 ditch+0.054
 lake+0.054
 valleys+0.053
 fields+0.052
Neg logit contributions
 forgiveness-0.045
 apology-0.045
Of-0.044
 forgive-0.041
olding-0.041
Token in
Token ID287
Feature activation-2.77e-02

Pos logit contributions
 lake+0.021
neath+0.021
 canyon+0.020
 camp+0.020
 shore+0.019
Neg logit contributions
Of-0.027
aid-0.026
olding-0.026
 forgiveness-0.026
 apology-0.025
Token the
Token ID262
Feature activation-3.80e-02

Pos logit contributions
neath+0.040
 canyon+0.038
 lake+0.037
 creek+0.037
 river+0.036
Neg logit contributions
 forgiveness-0.060
Of-0.055
 apology-0.055
 else-0.055
olding-0.053
Token lush
Token ID37408
Feature activation-1.17e-01

Pos logit contributions
neath+0.052
ground+0.039
 shore+0.037
 tim+0.037
 takeoff+0.037
Neg logit contributions
Of-0.089
aid-0.083
olding-0.082
 forgiveness-0.081
Deal-0.081
Token green
Token ID4077
Feature activation-5.75e-02

Pos logit contributions
neath+0.210
 tim+0.175
acket+0.172
 takeoff+0.155
�+0.155
Neg logit contributions
 actions-0.187
 valuable-0.178
 languages-0.175
 feelings-0.175
ups-0.172
Token grass
Token ID8701
Feature activation-1.96e-02

Pos logit contributions
neath+0.085
 tim+0.064
 Everest+0.062
 takeoff+0.061
acket+0.059
Neg logit contributions
 actions-0.117
ups-0.113
 feelings-0.109
 forgiveness-0.109
 interests-0.109
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
neath+0.038
 lake+0.038
 camp+0.036
 canyon+0.036
 creek+0.035
Neg logit contributions
olding-0.034
Of-0.034
 forgiveness-0.033
aid-0.032
 apology-0.031
Token jump
Token ID4391
Feature activation-4.65e-02

Pos logit contributions
neath+0.031
 lake+0.029
 canyon+0.028
 camp+0.028
 ditch+0.027
Neg logit contributions
Of-0.046
 forgiveness-0.043
aid-0.042
olding-0.041
 respectful-0.041
Token high
Token ID1029
Feature activation-5.00e-02

Pos logit contributions
 lake+0.093
neath+0.092
 canyon+0.089
 ocean+0.088
 clearing+0.086
Neg logit contributions
olding-0.072
Of-0.070
aid-0.069
 else-0.067
 apology-0.064
Token farmer
Token ID18739
Feature activation+1.58e-02

Pos logit contributions
neath+0.007
 lake+0.007
 camp+0.007
 canyon+0.007
 valleys+0.007
Neg logit contributions
Of-0.005
 forgiveness-0.004
olding-0.004
aid-0.004
 apology-0.004
Token who
Token ID508
Feature activation-1.32e-02

Pos logit contributions
Of+0.031
 forgiveness+0.030
aid+0.030
olding+0.030
 apology+0.029
Neg logit contributions
 lake-0.029
neath-0.028
 camp-0.027
 river-0.027
 shore-0.027
Token lived
Token ID5615
Feature activation-2.40e-02

Pos logit contributions
neath+0.027
 lake+0.026
 canyon+0.026
 shore+0.026
 creek+0.025
Neg logit contributions
Of-0.023
aid-0.022
 forgiveness-0.022
olding-0.021
Deal-0.021
Token on
Token ID319
Feature activation-7.82e-02

Pos logit contributions
 lake+0.043
neath+0.042
 creek+0.040
 canyon+0.040
 river+0.040
Neg logit contributions
Of-0.047
aid-0.047
olding-0.046
 forgiveness-0.045
Deal-0.042
Token a
Token ID257
Feature activation-4.03e-02

Pos logit contributions
neath+0.154
 canyon+0.153
 slope+0.146
 ditch+0.145
 waterfall+0.144
Neg logit contributions
 forgiveness-0.107
 feelings-0.094
 caring-0.092
 honesty-0.090
grand-0.090
Token lush
Token ID37408
Feature activation-1.17e-01

Pos logit contributions
neath+0.062
 valleys+0.051
 canyon+0.049
 clearing+0.045
 lake+0.045
Neg logit contributions
Of-0.091
 else-0.088
 respectful-0.086
olding-0.085
 forgiveness-0.084
Token farm
Token ID5318
Feature activation-1.48e-03

Pos logit contributions
neath+0.223
 valleys+0.178
 tim+0.174
 takeoff+0.162
 GO+0.159
Neg logit contributions
 important-0.184
 valuable-0.176
 caring-0.171
 respectful-0.171
 kind-0.161
Token with
Token ID351
Feature activation+5.51e-03

Pos logit contributions
neath+0.003
 lake+0.003
 camp+0.003
 creek+0.003
 canyon+0.003
Neg logit contributions
Of-0.003
olding-0.003
 forgiveness-0.003
aid-0.003
 apology-0.002
Token many
Token ID867
Feature activation+1.14e-02

Pos logit contributions
Of+0.012
 forgiveness+0.012
aid+0.012
olding+0.012
 apology+0.012
Neg logit contributions
 lake-0.008
neath-0.008
 camp-0.008
 creek-0.008
 river-0.008
Token animals
Token ID4695
Feature activation+1.79e-02

Pos logit contributions
Of+0.021
 forgiveness+0.020
olding+0.020
aid+0.020
 apology+0.020
Neg logit contributions
 lake-0.022
neath-0.021
 camp-0.021
 river-0.021
 creek-0.021
Token and
Token ID290
Feature activation+5.43e-03

Pos logit contributions
Of+0.037
 forgiveness+0.035
aid+0.035
olding+0.035
 apology+0.035
Neg logit contributions
 lake-0.032
neath-0.030
 camp-0.030
 river-0.029
 shore-0.029
Token grass
Token ID8701
Feature activation-2.48e-02

Pos logit contributions
 canyon+0.056
 camp+0.054
neath+0.053
 lake+0.053
 ditch+0.052
Neg logit contributions
 forgiveness-0.078
Of-0.077
aid-0.073
 apology-0.073
olding-0.073
Token was
Token ID373
Feature activation-2.77e-02

Pos logit contributions
neath+0.048
 lake+0.048
 camp+0.046
 canyon+0.046
 creek+0.045
Neg logit contributions
Of-0.042
olding-0.042
 forgiveness-0.041
aid-0.040
 apology-0.038
Token a
Token ID257
Feature activation-1.18e-02

Pos logit contributions
neath+0.044
 lake+0.043
 creek+0.042
 canyon+0.041
 camp+0.041
Neg logit contributions
 forgiveness-0.057
Of-0.056
aid-0.055
olding-0.055
 respectful-0.053
Token deep
Token ID2769
Feature activation-7.64e-02

Pos logit contributions
neath+0.021
 lake+0.020
 camp+0.019
 canyon+0.019
 creek+0.019
Neg logit contributions
Of-0.024
 forgiveness-0.023
olding-0.022
aid-0.022
 apology-0.022
Token,
Token ID11
Feature activation-3.48e-02

Pos logit contributions
neath+0.118
 takeoff+0.100
 valleys+0.099
ground+0.098
 tim+0.098
Neg logit contributions
 forgiveness-0.126
olding-0.122
Of-0.121
 promise-0.121
 apology-0.121
Token lush
Token ID37408
Feature activation-1.15e-01

Pos logit contributions
neath+0.060
 camp+0.055
 lake+0.055
 canyon+0.055
 creek+0.053
Neg logit contributions
Of-0.071
 forgiveness-0.067
olding-0.064
aid-0.064
 apology-0.063
Token green
Token ID4077
Feature activation-6.85e-02

Pos logit contributions
neath+0.224
 tim+0.195
 cinema+0.188
 takeoff+0.177
 Everest+0.174
Neg logit contributions
 respectful-0.156
 valuable-0.152
 precious-0.149
 importantly-0.148
 understanding-0.146
Token.
Token ID13
Feature activation+2.71e-02

Pos logit contributions
neath+0.103
 tim+0.080
 ditch+0.080
 canyon+0.080
 takeoff+0.078
Neg logit contributions
 forgiveness-0.136
Of-0.135
aid-0.131
 apology-0.131
 actions-0.130
Token The
Token ID383
Feature activation+7.00e-03

Pos logit contributions
aid+0.066
 forgiveness+0.065
olding+0.065
Of+0.064
 apology+0.063
Neg logit contributions
neath-0.035
 lake-0.034
 clearing-0.032
 camp-0.031
 river-0.031
Token little
Token ID1310
Feature activation-5.05e-03

Pos logit contributions
Of+0.015
 forgiveness+0.015
olding+0.015
aid+0.015
 apology+0.014
Neg logit contributions
 lake-0.011
neath-0.011
 camp-0.011
 creek-0.010
 river-0.010
Token girl
Token ID2576
Feature activation+9.12e-03

Pos logit contributions
neath+0.013
 lake+0.012
 camp+0.012
 canyon+0.012
 shore+0.012
Neg logit contributions
Of-0.007
 forgiveness-0.006
aid-0.006
olding-0.006
 apology-0.006
Token to
Token ID284
Feature activation-5.39e-02

Pos logit contributions
 lake+0.041
neath+0.041
 creek+0.039
 canyon+0.039
 camp+0.039
Neg logit contributions
Of-0.025
olding-0.025
 forgiveness-0.024
aid-0.023
 apology-0.023
Token set
Token ID900
Feature activation-3.46e-02

Pos logit contributions
 ditch+0.081
 canyon+0.075
neath+0.075
icer+0.074
 pool+0.071
Neg logit contributions
Of-0.108
 forgiveness-0.104
 apology-0.102
 forgive-0.100
aid-0.096
Token.
Token ID13
Feature activation+1.81e-04

Pos logit contributions
neath+0.072
 lake+0.071
 canyon+0.070
 creek+0.068
 camp+0.067
Neg logit contributions
 forgiveness-0.052
olding-0.051
Of-0.051
 apology-0.049
aid-0.048
Token<|endoftext|>
Token ID50256
Feature activation-7.04e-02

Pos logit contributions
Of+0.000
olding+0.000
 forgiveness+0.000
aid+0.000
 apology+0.000
Neg logit contributions
 lake-0.000
neath-0.000
 camp-0.000
 canyon-0.000
 creek-0.000
TokenLittle
Token ID22253
Feature activation-3.38e-02

Pos logit contributions
 canyon+0.130
 shore+0.124
 river+0.123
 ditch+0.122
 lake+0.118
Neg logit contributions
Of-0.117
 forgiveness-0.105
aid-0.105
 else-0.102
olding-0.101
Token Johnny
Token ID15470
Feature activation-1.14e-01

Pos logit contributions
 canyon+0.054
 ditch+0.053
 valleys+0.052
 camp+0.052
 lake+0.052
Neg logit contributions
Of-0.072
 forgiveness-0.066
olding-0.066
 apology-0.063
aid-0.062
Token was
Token ID373
Feature activation-6.65e-02

Pos logit contributions
 canyon+0.263
 ditch+0.257
 rav+0.246
 clearing+0.242
 creek+0.239
Neg logit contributions
 cared-0.126
Of-0.122
olded-0.120
 cares-0.117
grand-0.115
Token so
Token ID523
Feature activation-2.41e-02

Pos logit contributions
 creek+0.126
 lake+0.125
 canyon+0.120
neath+0.119
 river+0.119
Neg logit contributions
olding-0.109
Of-0.107
olded-0.102
 understanding-0.101
 respectful-0.100
Token excited
Token ID6568
Feature activation-2.19e-02

Pos logit contributions
 lake+0.056
neath+0.055
 canyon+0.053
 clearing+0.053
 creek+0.052
Neg logit contributions
Of-0.035
olding-0.034
 else-0.030
 forgiveness-0.030
olded-0.030
Token.
Token ID13
Feature activation-4.45e-02

Pos logit contributions
neath+0.046
 lake+0.045
 canyon+0.043
 valleys+0.043
 creek+0.043
Neg logit contributions
olding-0.035
Of-0.035
 forgiveness-0.035
 apology-0.032
aid-0.032
Token He
Token ID679
Feature activation-5.05e-02

Pos logit contributions
 canyon+0.087
 lake+0.085
 camp+0.085
neath+0.084
 clearing+0.081
Neg logit contributions
Of-0.076
olding-0.071
 apology-0.067
 forgiveness-0.065
olded-0.064
Token lived
Token ID5615
Feature activation-1.64e-02

Pos logit contributions
neath+0.032
 lake+0.031
 shore+0.030
 canyon+0.030
 creek+0.030
Neg logit contributions
Of-0.022
aid-0.021
 forgiveness-0.021
olding-0.020
 else-0.020
Token in
Token ID287
Feature activation-4.79e-02

Pos logit contributions
neath+0.030
 lake+0.029
 river+0.028
 creek+0.027
 canyon+0.027
Neg logit contributions
Of-0.032
aid-0.031
olding-0.031
 forgiveness-0.031
asing-0.029
Token a
Token ID257
Feature activation-5.22e-02

Pos logit contributions
neath+0.085
 creek+0.083
 canyon+0.083
 river+0.082
 shore+0.081
Neg logit contributions
 forgiveness-0.086
aid-0.079
Of-0.077
 honesty-0.073
 apology-0.073
Token big
Token ID1263
Feature activation-8.54e-02

Pos logit contributions
neath+0.106
acket+0.089
 valleys+0.088
 canyon+0.086
 takeoff+0.086
Neg logit contributions
 else-0.088
Of-0.083
 respectful-0.080
 caring-0.075
 forgiveness-0.075
Token,
Token ID11
Feature activation-6.29e-02

Pos logit contributions
neath+0.153
acket+0.130
ground+0.128
 takeoff+0.123
 valleys+0.123
Neg logit contributions
 promise-0.138
 forgiveness-0.135
 actions-0.134
 respectful-0.132
 friend-0.132
Token wide
Token ID3094
Feature activation-1.14e-01

Pos logit contributions
neath+0.133
ground+0.119
 valleys+0.119
 shore+0.118
 canyon+0.118
Neg logit contributions
Of-0.101
 respectful-0.092
olding-0.091
 valuable-0.089
 precious-0.087
Token forest
Token ID8222
Feature activation-1.95e-02

Pos logit contributions
neath+0.205
 takeoff+0.160
 valleys+0.150
pped+0.140
ground+0.140
Neg logit contributions
 friend-0.196
 owner-0.186
 actions-0.179
 respectful-0.178
 promise-0.177
Token and
Token ID290
Feature activation-4.58e-03

Pos logit contributions
neath+0.042
 lake+0.039
 camp+0.038
 shore+0.038
 creek+0.037
Neg logit contributions
olding-0.032
Of-0.030
 forgiveness-0.029
aid-0.029
 respectful-0.028
Token enjoyed
Token ID8359
Feature activation-4.35e-02

Pos logit contributions
neath+0.009
 lake+0.008
 camp+0.008
 canyon+0.008
 creek+0.008
Neg logit contributions
Of-0.009
 forgiveness-0.008
aid-0.008
olding-0.008
 apology-0.008
Token exploring
Token ID13504
Feature activation-3.17e-02

Pos logit contributions
neath+0.089
 ditch+0.073
acket+0.071
 canyon+0.071
 creek+0.068
Neg logit contributions
 forgiveness-0.080
 caring-0.077
 respectful-0.075
Of-0.073
 honesty-0.071
Token every
Token ID790
Feature activation-2.16e-02

Pos logit contributions
neath+0.066
 canyon+0.061
 lake+0.061
 ditch+0.060
 river+0.059
Neg logit contributions
 forgiveness-0.050
Of-0.049
 else-0.045
 apology-0.044
Deal-0.043
TokenFinally
Token ID11158
Feature activation-2.58e-02

Pos logit contributions
 lake+0.008
neath+0.008
 camp+0.008
 shore+0.008
 canyon+0.008
Neg logit contributions
Of-0.007
aid-0.007
olding-0.007
 forgiveness-0.007
 apology-0.006
Token,
Token ID11
Feature activation-1.57e-02

Pos logit contributions
 lake+0.042
neath+0.040
 canyon+0.040
 river+0.039
 valleys+0.039
Neg logit contributions
 forgiveness-0.054
Of-0.052
 apology-0.049
olding-0.048
aid-0.048
Token they
Token ID484
Feature activation-3.32e-02

Pos logit contributions
 lake+0.024
neath+0.023
 camp+0.022
 canyon+0.022
 river+0.022
Neg logit contributions
Of-0.035
 forgiveness-0.034
 apology-0.032
aid-0.032
olding-0.032
Token reached
Token ID4251
Feature activation-6.09e-02

Pos logit contributions
 lake+0.056
 canyon+0.056
neath+0.055
 ditch+0.054
 creek+0.053
Neg logit contributions
Of-0.063
aid-0.058
 else-0.056
 forgiveness-0.054
olded-0.054
Token a
Token ID257
Feature activation-4.19e-02

Pos logit contributions
 canyon+0.106
 slope+0.102
neath+0.101
 lake+0.100
 river+0.100
Neg logit contributions
 forgiveness-0.100
 honesty-0.092
Of-0.089
 kindness-0.088
 generosity-0.088
Token vast
Token ID5909
Feature activation-1.13e-01

Pos logit contributions
neath+0.069
 valleys+0.059
 canyon+0.056
 tim+0.055
 fields+0.054
Neg logit contributions
Of-0.089
 else-0.085
 forgiveness-0.084
olding-0.080
 apology-0.079
Token stretch
Token ID7539
Feature activation-3.33e-02

Pos logit contributions
neath+0.183
 tim+0.152
 takeoff+0.135
unks+0.133
ics+0.127
Neg logit contributions
 actions-0.192
 respectful-0.191
 promise-0.184
 apology-0.178
 valuable-0.178
Token of
Token ID286
Feature activation-3.16e-02

Pos logit contributions
neath+0.071
 lake+0.069
 canyon+0.068
 camp+0.068
 valleys+0.067
Neg logit contributions
olding-0.049
Of-0.047
 forgiveness-0.046
aid-0.044
 apology-0.044
Token beach
Token ID10481
Feature activation-4.18e-02

Pos logit contributions
neath+0.045
 tim+0.042
 canyon+0.040
 lake+0.039
 valleys+0.038
Neg logit contributions
 forgiveness-0.071
 apology-0.068
Of-0.068
aid-0.065
 honesty-0.062
Token.
Token ID13
Feature activation-4.61e-02

Pos logit contributions
neath+0.086
 canyon+0.076
 lake+0.075
 ditch+0.074
 camp+0.074
Neg logit contributions
olding-0.067
 forgiveness-0.065
Of-0.063
ups-0.062
 respectful-0.062
Token As
Token ID1081
Feature activation-1.95e-02

Pos logit contributions
 canyon+0.074
 lake+0.073
 camp+0.072
 valleys+0.071
neath+0.070
Neg logit contributions
Of-0.097
 apology-0.088
 forgiveness-0.088
olding-0.087
olded-0.085
Token,
Token ID11
Feature activation-2.00e-02

Pos logit contributions
 lake+0.036
neath+0.035
 canyon+0.034
 river+0.034
 shore+0.034
Neg logit contributions
Of-0.046
olding-0.045
 forgiveness-0.045
 else-0.043
 apology-0.043
Token there
Token ID612
Feature activation-4.14e-02

Pos logit contributions
 lake+0.035
 shore+0.034
 canyon+0.034
 river+0.033
 creek+0.032
Neg logit contributions
Of-0.040
 forgiveness-0.038
olding-0.037
 else-0.037
aid-0.037
Token was
Token ID373
Feature activation-4.86e-02

Pos logit contributions
 canyon+0.101
 ditch+0.097
 shore+0.095
 valleys+0.095
 river+0.095
Neg logit contributions
 forgiveness-0.050
 else-0.049
Of-0.048
olding-0.042
 apology-0.042
Token a
Token ID257
Feature activation-2.14e-02

Pos logit contributions
 canyon+0.091
 river+0.086
 lake+0.084
neath+0.083
 valleys+0.083
Neg logit contributions
Of-0.088
 forgiveness-0.087
aid-0.083
 caring-0.077
 else-0.076
Token big
Token ID1263
Feature activation-7.11e-02

Pos logit contributions
neath+0.044
 valleys+0.040
 lake+0.040
 canyon+0.039
 shore+0.038
Neg logit contributions
Of-0.039
 forgiveness-0.036
olding-0.035
aid-0.035
 apology-0.034
Token wide
Token ID3094
Feature activation-1.13e-01

Pos logit contributions
neath+0.142
 valleys+0.134
ground+0.118
 canyon+0.117
 ditch+0.116
Neg logit contributions
 forgiveness-0.117
Of-0.113
 apology-0.108
 promise-0.107
 actions-0.106
Token house
Token ID2156
Feature activation-5.41e-03

Pos logit contributions
neath+0.195
 valleys+0.179
unks+0.147
 canyon+0.142
 takeoff+0.142
Neg logit contributions
Of-0.194
 promise-0.184
 friend-0.183
 gift-0.182
olding-0.181
Token.
Token ID13
Feature activation-6.30e-02

Pos logit contributions
neath+0.010
 lake+0.010
 camp+0.010
 shore+0.010
 creek+0.010
Neg logit contributions
Of-0.010
olding-0.010
 forgiveness-0.010
aid-0.010
 apology-0.009
Token Inside
Token ID14384
Feature activation-5.09e-02

Pos logit contributions
 shore+0.121
 canyon+0.115
 lake+0.115
neath+0.113
 waterfall+0.111
Neg logit contributions
Of-0.108
olding-0.100
 forgiveness-0.100
aid-0.093
 apology-0.092
Token the
Token ID262
Feature activation-6.65e-03

Pos logit contributions
 ditch+0.087
 canyon+0.086
 rav+0.078
 shore+0.078
 valleys+0.077
Neg logit contributions
 else-0.092
 forgiveness-0.089
Of-0.085
oral-0.083
Deal-0.083
Token house
Token ID2156
Feature activation+8.66e-03

Pos logit contributions
neath+0.009
 lake+0.009
 canyon+0.008
 camp+0.008
 creek+0.008
Neg logit contributions
Of-0.016
olding-0.015
 forgiveness-0.015
 apology-0.015
aid-0.015
Token middle
Token ID3504
Feature activation+2.66e-03

Pos logit contributions
neath+0.064
 tim+0.054
 canyon+0.049
 creek+0.045
 boulder+0.045
Neg logit contributions
Of-0.123
 forgiveness-0.118
 else-0.117
 respectful-0.115
ups-0.112
Token of
Token ID286
Feature activation-6.67e-02

Pos logit contributions
Of+0.003
 forgiveness+0.003
olding+0.003
aid+0.003
 apology+0.003
Neg logit contributions
 lake-0.007
neath-0.007
 camp-0.007
 creek-0.006
 shore-0.006
Token a
Token ID257
Feature activation-5.20e-02

Pos logit contributions
acket+0.130
 tim+0.128
neath+0.124
latable+0.121
amp+0.120
Neg logit contributions
 forgiveness-0.094
 caring-0.083
 understanding-0.080
Of-0.079
 honesty-0.079
Token dark
Token ID3223
Feature activation-7.06e-02

Pos logit contributions
neath+0.080
 valleys+0.055
 canyon+0.055
 tim+0.055
latable+0.052
Neg logit contributions
 else-0.118
Of-0.114
 respectful-0.108
 forgiveness-0.103
 caring-0.103
Token,
Token ID11
Feature activation-5.72e-02

Pos logit contributions
neath+0.117
 Everest+0.085
 tim+0.084
latable+0.081
 GO+0.078
Neg logit contributions
grand-0.122
 actions-0.118
 respectful-0.117
olding-0.116
 forgiveness-0.116
Token wide
Token ID3094
Feature activation-1.13e-01

Pos logit contributions
neath+0.120
growth+0.102
 camp+0.101
 canyon+0.101
 takeoff+0.099
Neg logit contributions
Of-0.090
 valuable-0.088
 respectful-0.087
olding-0.084
 precious-0.082
Token sea
Token ID5417
Feature activation-6.92e-03

Pos logit contributions
neath+0.183
 tim+0.124
IT+0.120
pped+0.119
unks+0.119
Neg logit contributions
keeper-0.193
 gift-0.190
 promise-0.185
 friend-0.184
 owner-0.184
Token.
Token ID13
Feature activation-5.58e-02

Pos logit contributions
neath+0.014
 lake+0.014
 camp+0.013
 canyon+0.013
 creek+0.013
Neg logit contributions
Of-0.011
olding-0.011
 forgiveness-0.011
aid-0.011
 apology-0.010
Token All
Token ID1439
Feature activation-5.82e-03

Pos logit contributions
 canyon+0.105
neath+0.103
 shore+0.103
 waterfall+0.099
 lake+0.097
Neg logit contributions
Of-0.102
 forgiveness-0.082
olding-0.080
Deal-0.077
 apology-0.077
Token the
Token ID262
Feature activation-1.62e-02

Pos logit contributions
 lake+0.007
neath+0.007
 camp+0.007
 shore+0.007
 canyon+0.007
Neg logit contributions
Of-0.014
 forgiveness-0.014
aid-0.014
olding-0.014
 apology-0.013
Token animals
Token ID4695
Feature activation-5.10e-03

Pos logit contributions
neath+0.036
 lake+0.031
 creek+0.031
 canyon+0.031
 clearing+0.031
Neg logit contributions
Of-0.026
 forgiveness-0.025
 apology-0.024
aid-0.024
olding-0.024
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
 lake+0.020
neath+0.019
 river+0.019
 canyon+0.019
 creek+0.018
Neg logit contributions
Of-0.019
 forgiveness-0.018
aid-0.018
olding-0.017
 apology-0.017
Token she
Token ID673
Feature activation-1.35e-02

Pos logit contributions
 lake+0.022
neath+0.021
 canyon+0.021
 camp+0.021
 river+0.021
Neg logit contributions
Of-0.016
 forgiveness-0.016
aid-0.015
olding-0.014
 apology-0.014
Token came
Token ID1625
Feature activation-2.39e-02

Pos logit contributions
 lake+0.025
neath+0.025
 creek+0.024
 camp+0.024
 canyon+0.023
Neg logit contributions
Of-0.025
aid-0.024
olding-0.024
 forgiveness-0.023
olded-0.023
Token to
Token ID284
Feature activation-3.34e-02

Pos logit contributions
 lake+0.046
neath+0.044
 creek+0.043
 canyon+0.042
 camp+0.042
Neg logit contributions
Of-0.044
olding-0.043
 forgiveness-0.041
aid-0.039
asing-0.039
Token a
Token ID257
Feature activation-4.87e-02

Pos logit contributions
neath+0.050
 canyon+0.048
 lake+0.048
 ditch+0.046
icer+0.044
Neg logit contributions
 forgiveness-0.076
Of-0.075
 apology-0.070
 forgive-0.066
 honesty-0.065
Token wide
Token ID3094
Feature activation-1.12e-01

Pos logit contributions
neath+0.084
 valleys+0.070
 canyon+0.066
 tim+0.066
 ditch+0.064
Neg logit contributions
Of-0.100
 else-0.095
 forgiveness-0.094
 respectful-0.090
olding-0.088
Token,
Token ID11
Feature activation-3.67e-02

Pos logit contributions
neath+0.171
 tim+0.134
 marsh+0.130
unks+0.128
 valleys+0.123
Neg logit contributions
 promise-0.200
Of-0.198
keeper-0.198
 respectful-0.197
 friend-0.197
Token open
Token ID1280
Feature activation-5.42e-02

Pos logit contributions
neath+0.066
 camp+0.061
 lake+0.060
 canyon+0.058
 shore+0.058
Neg logit contributions
Of-0.073
olding-0.066
 forgiveness-0.066
 respectful-0.066
 apology-0.064
Token field
Token ID2214
Feature activation-2.70e-02

Pos logit contributions
neath+0.067
 tim+0.063
 camp+0.054
 canyon+0.053
acket+0.053
Neg logit contributions
 forgiveness-0.119
Of-0.116
olding-0.114
aser-0.111
 respectful-0.110
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
neath+0.056
 lake+0.054
 camp+0.053
 canyon+0.053
 creek+0.051
Neg logit contributions
olding-0.044
Of-0.043
 forgiveness-0.041
aid-0.039
 apology-0.038
Token In
Token ID554
Feature activation-3.37e-02

Pos logit contributions
 lake+0.043
 camp+0.041
 canyon+0.041
 ocean+0.040
 shore+0.040
Neg logit contributions
Of-0.055
 forgiveness-0.050
olding-0.050
 apology-0.048
aid-0.047
Token lived
Token ID5615
Feature activation-5.10e-03

Pos logit contributions
Of+0.011
 forgiveness+0.011
aid+0.011
olding+0.011
 apology+0.010
Neg logit contributions
 lake-0.012
neath-0.012
 camp-0.011
 shore-0.011
 creek-0.011
Token in
Token ID287
Feature activation-3.44e-02

Pos logit contributions
neath+0.008
 lake+0.008
 camp+0.008
 river+0.008
 creek+0.008
Neg logit contributions
Of-0.011
olding-0.011
aid-0.011
 forgiveness-0.011
 apology-0.010
Token a
Token ID257
Feature activation-3.88e-02

Pos logit contributions
 canyon+0.052
 lake+0.052
neath+0.052
 shore+0.051
 creek+0.051
Neg logit contributions
 forgiveness-0.071
Of-0.067
aid-0.064
 caring-0.062
 apology-0.062
Token big
Token ID1263
Feature activation-7.66e-02

Pos logit contributions
neath+0.070
 canyon+0.058
 shore+0.058
 tim+0.057
 valleys+0.057
Neg logit contributions
Of-0.072
 else-0.071
 respectful-0.066
 forgiveness-0.066
 caring-0.064
Token,
Token ID11
Feature activation-6.35e-02

Pos logit contributions
neath+0.125
ground+0.110
 tim+0.110
acket+0.104
 takeoff+0.104
Neg logit contributions
 promise-0.132
 forgiveness-0.132
 actions-0.127
 respectful-0.126
 friend-0.126
Token wide
Token ID3094
Feature activation-1.12e-01

Pos logit contributions
neath+0.134
 valleys+0.126
 takeoff+0.122
ground+0.122
 canyon+0.122
Neg logit contributions
Of-0.098
 valuable-0.092
olding-0.088
 caring-0.088
 respectful-0.088
Token field
Token ID2214
Feature activation+8.53e-03

Pos logit contributions
neath+0.184
 takeoff+0.152
 tim+0.144
 valleys+0.139
ground+0.132
Neg logit contributions
 friend-0.204
 owner-0.189
 promise-0.181
 mother-0.180
 actions-0.179
Token.
Token ID13
Feature activation-5.60e-02

Pos logit contributions
Of+0.017
 forgiveness+0.016
olding+0.016
aid+0.016
 apology+0.016
Neg logit contributions
 lake-0.015
neath-0.015
 camp-0.014
 creek-0.014
 river-0.014
Token As
Token ID1081
Feature activation-2.15e-02

Pos logit contributions
 canyon+0.113
neath+0.111
 shore+0.109
 waterfall+0.106
 valleys+0.104
Neg logit contributions
Of-0.093
 forgiveness-0.074
olding-0.073
 Friends-0.071
 apology-0.071
Token he
Token ID339
Feature activation-3.71e-02

Pos logit contributions
neath+0.050
 canyon+0.047
 lake+0.047
 ditch+0.046
 clearing+0.045
Neg logit contributions
Of-0.029
 forgiveness-0.028
 else-0.026
aid-0.024
 respectful-0.024
Token hopped
Token ID45230
Feature activation-2.59e-02

Pos logit contributions
 shore+0.070
neath+0.068
 ditch+0.067
 canyon+0.067
 lake+0.066
Neg logit contributions
Of-0.062
aid-0.061
 else-0.059
Deal-0.058
 forgiveness-0.056
Token She
Token ID1375
Feature activation-3.03e-02

Pos logit contributions
 lake+0.028
neath+0.027
 camp+0.027
 river+0.026
 canyon+0.026
Neg logit contributions
Of-0.033
olding-0.031
 apology-0.030
 forgiveness-0.030
Deal-0.029
Token spotted
Token ID13489
Feature activation-1.86e-02

Pos logit contributions
neath+0.059
 lake+0.058
 canyon+0.056
 shore+0.056
 ditch+0.056
Neg logit contributions
Of-0.053
olding-0.048
 else-0.047
aid-0.046
olded-0.046
Token a
Token ID257
Feature activation-5.05e-02

Pos logit contributions
neath+0.025
 lake+0.025
 canyon+0.024
 camp+0.024
 river+0.024
Neg logit contributions
Of-0.044
 forgiveness-0.044
aid-0.043
 apology-0.040
olding-0.040
Token big
Token ID1263
Feature activation-8.58e-02

Pos logit contributions
neath+0.092
 valleys+0.084
 canyon+0.078
 camp+0.074
 fields+0.073
Neg logit contributions
Of-0.098
 forgiveness-0.094
 else-0.091
olding-0.088
aid-0.087
Token,
Token ID11
Feature activation-4.18e-02

Pos logit contributions
neath+0.157
 valleys+0.149
 canyon+0.127
ground+0.126
growth+0.126
Neg logit contributions
 forgiveness-0.152
 apology-0.142
 promise-0.140
Of-0.139
 honesty-0.131
Token wide
Token ID3094
Feature activation-1.11e-01

Pos logit contributions
neath+0.081
 shore+0.074
 lake+0.074
 camp+0.074
 valleys+0.073
Neg logit contributions
Of-0.078
 forgiveness-0.071
olding-0.069
aid-0.069
 apology-0.068
Token tree
Token ID5509
Feature activation-3.24e-02

Pos logit contributions
neath+0.206
 valleys+0.175
unders+0.153
unks+0.151
 takeoff+0.151
Neg logit contributions
Of-0.178
 promise-0.174
 apology-0.172
 understanding-0.169
olding-0.169
Token in
Token ID287
Feature activation-4.03e-02

Pos logit contributions
neath+0.062
 camp+0.060
 lake+0.059
 shore+0.057
 creek+0.057
Neg logit contributions
olding-0.056
Of-0.053
 forgiveness-0.051
aid-0.051
asing-0.049
Token the
Token ID262
Feature activation-3.12e-02

Pos logit contributions
acket+0.065
 tim+0.065
 canyon+0.063
 river+0.060
 camp+0.060
Neg logit contributions
 forgiveness-0.078
Of-0.076
 apology-0.070
 understanding-0.070
aid-0.069
Token distance
Token ID5253
Feature activation+9.19e-05

Pos logit contributions
neath+0.041
 camp+0.037
 canyon+0.035
 lake+0.034
 shore+0.033
Neg logit contributions
Of-0.076
aid-0.072
 forgiveness-0.070
olding-0.070
Deal-0.068
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
Of+0.000
olding+0.000
 forgiveness+0.000
aid+0.000
 apology+0.000
Neg logit contributions
 lake-0.000
neath-0.000
 camp-0.000
 creek-0.000
 canyon-0.000
Token,
Token ID11
Feature activation-3.87e-02

Pos logit contributions
 lake+0.056
 canyon+0.054
neath+0.053
 river+0.053
 clearing+0.052
Neg logit contributions
Of-0.060
olding-0.060
 else-0.059
 forgiveness-0.059
olded-0.057
Token the
Token ID262
Feature activation-2.27e-02

Pos logit contributions
 canyon+0.065
 lake+0.062
 river+0.060
 camp+0.060
neath+0.059
Neg logit contributions
Of-0.081
 else-0.077
 forgiveness-0.074
olding-0.074
aid-0.071
Token beetle
Token ID45489
Feature activation-2.81e-02

Pos logit contributions
neath+0.039
 canyon+0.035
 valleys+0.035
 lake+0.035
 clearing+0.034
Neg logit contributions
Of-0.047
 forgiveness-0.046
olding-0.045
aid-0.044
 apology-0.044
Token sailed
Token ID33835
Feature activation-7.00e-03

Pos logit contributions
neath+0.055
 lake+0.053
 camp+0.052
 canyon+0.052
 shore+0.052
Neg logit contributions
Of-0.047
olding-0.045
 forgiveness-0.045
 else-0.044
aid-0.044
Token across
Token ID1973
Feature activation-8.67e-02

Pos logit contributions
neath+0.012
 lake+0.012
 camp+0.011
 canyon+0.011
 creek+0.011
Neg logit contributions
Of-0.015
aid-0.014
 forgiveness-0.014
olding-0.014
 apology-0.013
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
acket+0.193
 tim+0.175
latable+0.171
icer+0.165
neath+0.163
Neg logit contributions
 forgiveness-0.122
 else-0.103
 honesty-0.091
aid-0.090
 caring-0.090
Token sea
Token ID5417
Feature activation-3.34e-02

Pos logit contributions
neath+0.174
 tim+0.148
 OUT+0.140
und+0.135
anced+0.125
Neg logit contributions
 valuable-0.196
 respectful-0.189
 owner-0.189
 important-0.187
 else-0.182
Token.
Token ID13
Feature activation-5.63e-02

Pos logit contributions
neath+0.077
 lake+0.071
 canyon+0.069
 camp+0.068
 ditch+0.067
Neg logit contributions
olding-0.047
Of-0.044
 else-0.043
 respectful-0.042
 forgiveness-0.042
Token
Token ID220
Feature activation+2.54e-02

Pos logit contributions
neath+0.111
 camp+0.108
 lake+0.107
 canyon+0.107
 waterfall+0.104
Neg logit contributions
Of-0.101
olding-0.085
 apology-0.083
aid-0.081
 forgiveness-0.081
Token She
Token ID1375
Feature activation-3.91e-02

Pos logit contributions
Of+0.059
 forgiveness+0.058
aid+0.057
olding+0.057
 apology+0.056
Neg logit contributions
 lake-0.036
neath-0.036
 camp-0.034
 shore-0.033
 creek-0.033
Token was
Token ID373
Feature activation-5.17e-02

Pos logit contributions
neath+0.084
 creek+0.077
 camp+0.077
 clearing+0.076
 lake+0.076
Neg logit contributions
 else-0.057
Of-0.057
 forgiveness-0.053
olding-0.052
 respectful-0.051
TokenSuddenly
Token ID38582
Feature activation-4.55e-02

Pos logit contributions
Of+0.012
 forgiveness+0.011
olding+0.011
aid+0.011
 apology+0.011
Neg logit contributions
 lake-0.013
neath-0.012
 camp-0.012
 creek-0.012
 shore-0.012
Token,
Token ID11
Feature activation-7.67e-03

Pos logit contributions
 lake+0.088
 ocean+0.085
 shore+0.085
 canyon+0.082
 river+0.082
Neg logit contributions
 forgiveness-0.078
Of-0.075
aid-0.074
 else-0.071
olding-0.070
Token she
Token ID673
Feature activation-3.26e-02

Pos logit contributions
 lake+0.014
neath+0.013
 ocean+0.013
 shore+0.013
 river+0.013
Neg logit contributions
Of-0.015
 forgiveness-0.015
aid-0.014
olding-0.014
 apology-0.013
Token spotted
Token ID13489
Feature activation-2.88e-02

Pos logit contributions
 lake+0.065
neath+0.064
 creek+0.062
 camp+0.061
 canyon+0.060
Neg logit contributions
 else-0.052
Of-0.050
olding-0.050
 forgiveness-0.050
aid-0.049
Token a
Token ID257
Feature activation-5.53e-02

Pos logit contributions
 lake+0.043
neath+0.043
 canyon+0.043
 creek+0.042
 river+0.042
Neg logit contributions
 forgiveness-0.064
Of-0.060
aid-0.060
 apology-0.056
olding-0.056
Token wide
Token ID3094
Feature activation-1.11e-01

Pos logit contributions
neath+0.104
 valleys+0.095
 canyon+0.087
 camp+0.085
 fields+0.084
Neg logit contributions
Of-0.101
 forgiveness-0.100
 else-0.094
olding-0.092
aid-0.092
Token field
Token ID2214
Feature activation-2.20e-02

Pos logit contributions
neath+0.203
 valleys+0.184
unks+0.163
icer+0.155
unders+0.154
Neg logit contributions
 promise-0.171
 understanding-0.169
 apology-0.166
 respectful-0.164
olding-0.164
Token of
Token ID286
Feature activation-5.45e-02

Pos logit contributions
neath+0.046
 camp+0.045
 lake+0.045
 canyon+0.043
 valleys+0.043
Neg logit contributions
olding-0.036
Of-0.034
 forgiveness-0.033
aid-0.033
 apology-0.031
Token flowers
Token ID12734
Feature activation-3.70e-02

Pos logit contributions
neath+0.090
 canyon+0.084
 tim+0.082
gy+0.079
acket+0.075
Neg logit contributions
Of-0.107
 forgiveness-0.098
 respectful-0.097
 caring-0.095
aid-0.095
Token.
Token ID13
Feature activation-2.11e-02

Pos logit contributions
neath+0.077
 camp+0.075
 lake+0.075
 canyon+0.073
 ditch+0.072
Neg logit contributions
olding-0.054
Of-0.051
 forgiveness-0.051
aid-0.050
 else-0.049
Token It
Token ID632
Feature activation-1.91e-02

Pos logit contributions
 lake+0.037
 shore+0.036
 ocean+0.036
neath+0.035
 camp+0.035
Neg logit contributions
Of-0.042
olding-0.040
 forgiveness-0.040
aid-0.039
 apology-0.039
Token stay
Token ID2652
Feature activation+6.92e-03

Pos logit contributions
Of+0.092
aid+0.092
olding+0.091
aser+0.086
 apology+0.085
Neg logit contributions
 lake-0.080
 shore-0.077
 river-0.076
 camp-0.075
 creek-0.074
Token happy
Token ID3772
Feature activation+2.40e-02

Pos logit contributions
Of+0.014
 forgiveness+0.013
olding+0.013
aid+0.013
 apology+0.013
Neg logit contributions
 lake-0.012
neath-0.012
 camp-0.012
 creek-0.011
 river-0.011
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
Of+0.056
aid+0.054
olding+0.053
 forgiveness+0.053
 apology+0.052
Neg logit contributions
 lake-0.036
 camp-0.034
neath-0.034
 shore-0.033
 river-0.033
Token<|endoftext|>
Token ID50256
Feature activation-6.71e-02

Pos logit contributions
Of+0.040
aid+0.039
 forgiveness+0.039
olding+0.039
 apology+0.038
Neg logit contributions
 lake-0.026
neath-0.026
 camp-0.025
 river-0.024
 creek-0.024
TokenLittle
Token ID22253
Feature activation-3.63e-02

Pos logit contributions
 canyon+0.136
 river+0.132
 shore+0.130
 ditch+0.128
 lake+0.127
Neg logit contributions
Of-0.099
 forgiveness-0.093
aid-0.091
olding-0.089
olds-0.088
Token Sammy
Token ID44067
Feature activation-1.09e-01

Pos logit contributions
 ditch+0.057
 canyon+0.057
 lake+0.056
 valleys+0.055
 shore+0.055
Neg logit contributions
Of-0.078
olding-0.072
 forgiveness-0.072
 apology-0.068
aid-0.067
Token always
Token ID1464
Feature activation-2.69e-02

Pos logit contributions
 ditch+0.244
 canyon+0.239
 clearing+0.232
 takeoff+0.225
 camp+0.224
Neg logit contributions
 cared-0.127
 cares-0.126
Of-0.126
 actions-0.122
 else-0.119
Token liked
Token ID8288
Feature activation-2.05e-02

Pos logit contributions
neath+0.058
 lake+0.057
 creek+0.055
 canyon+0.055
 camp+0.054
Neg logit contributions
Of-0.042
 forgiveness-0.041
olding-0.040
aid-0.038
 respectful-0.038
Token to
Token ID284
Feature activation-2.56e-02

Pos logit contributions
neath+0.046
 lake+0.043
 clearing+0.042
 canyon+0.042
 camp+0.041
Neg logit contributions
 forgiveness-0.032
Of-0.032
olding-0.029
aid-0.029
 apology-0.029
Token explore
Token ID7301
Feature activation-3.77e-02

Pos logit contributions
neath+0.044
 lake+0.042
 clearing+0.041
 valleys+0.041
 canyon+0.040
Neg logit contributions
 forgiveness-0.053
Of-0.051
 apology-0.051
aid-0.050
olding-0.048
Token,
Token ID11
Feature activation-2.75e-02

Pos logit contributions
neath+0.060
 canyon+0.058
 ditch+0.055
 lake+0.055
 river+0.053
Neg logit contributions
 forgiveness-0.078
Of-0.073
 else-0.072
 apology-0.069
olding-0.067
Token day
Token ID1110
Feature activation-1.04e-02

Pos logit contributions
Of+0.001
olding+0.001
 forgiveness+0.001
aid+0.001
 apology+0.001
Neg logit contributions
neath-0.002
 lake-0.002
 camp-0.002
 shore-0.002
 canyon-0.002
Token,
Token ID11
Feature activation+4.11e-03

Pos logit contributions
 lake+0.016
neath+0.015
 shore+0.015
 river+0.015
 canyon+0.015
Neg logit contributions
Of-0.023
 forgiveness-0.022
aid-0.022
olding-0.022
 apology-0.021
Token he
Token ID339
Feature activation-4.28e-02

Pos logit contributions
Of+0.009
 forgiveness+0.008
aid+0.008
olding+0.008
 apology+0.008
Neg logit contributions
 lake-0.007
neath-0.007
 camp-0.007
 river-0.006
 shore-0.006
Token found
Token ID1043
Feature activation-6.01e-03

Pos logit contributions
neath+0.090
 lake+0.086
 canyon+0.084
 marsh+0.084
 shore+0.082
Neg logit contributions
Of-0.061
 else-0.060
ups-0.056
olding-0.055
olded-0.055
Token a
Token ID257
Feature activation-2.71e-02

Pos logit contributions
 lake+0.008
neath+0.007
 camp+0.007
 creek+0.007
 canyon+0.007
Neg logit contributions
Of-0.015
 forgiveness-0.015
aid-0.015
olding-0.014
 apology-0.014
Token wide
Token ID3094
Feature activation-1.09e-01

Pos logit contributions
neath+0.049
 valleys+0.046
 canyon+0.042
 lake+0.042
 camp+0.042
Neg logit contributions
Of-0.054
 forgiveness-0.053
olding-0.049
 apology-0.049
aid-0.049
Token door
Token ID3420
Feature activation-5.36e-03

Pos logit contributions
neath+0.210
 valleys+0.197
 takeoff+0.162
 tim+0.159
unders+0.159
Neg logit contributions
 promise-0.168
 gift-0.160
 forgiveness-0.156
 apology-0.155
 valuable-0.154
Token.
Token ID13
Feature activation-2.85e-02

Pos logit contributions
 lake+0.009
neath+0.009
 camp+0.009
 creek+0.009
 shore+0.009
Neg logit contributions
Of-0.011
olding-0.011
 forgiveness-0.010
aid-0.010
 apology-0.010
Token He
Token ID679
Feature activation-6.71e-03

Pos logit contributions
 lake+0.051
 shore+0.050
 ocean+0.049
 canyon+0.049
 waterfall+0.048
Neg logit contributions
Of-0.057
 forgiveness-0.052
aid-0.052
olding-0.051
 apology-0.050
Token hopped
Token ID45230
Feature activation-3.74e-02

Pos logit contributions
 lake+0.012
neath+0.012
 shore+0.012
 creek+0.011
 camp+0.011
Neg logit contributions
Of-0.013
aid-0.013
olding-0.012
 forgiveness-0.012
 apology-0.012
Token over
Token ID625
Feature activation-2.60e-02

Pos logit contributions
 lake+0.068
 canyon+0.064
 river+0.064
 ocean+0.064
 camp+0.062
Neg logit contributions
Of-0.073
olding-0.069
aid-0.064
 apology-0.062
olded-0.061
Token should
Token ID815
Feature activation+1.85e-02

Pos logit contributions
Of+0.051
 forgiveness+0.050
olding+0.049
aid+0.048
 apology+0.048
Neg logit contributions
 lake-0.047
neath-0.046
 camp-0.045
 river-0.043
 pool-0.043
Token be
Token ID307
Feature activation+5.43e-02

Pos logit contributions
Of+0.034
olding+0.032
aid+0.032
 forgiveness+0.032
 apology+0.031
Neg logit contributions
 lake-0.036
neath-0.035
 camp-0.034
 river-0.034
 shore-0.034
Token more
Token ID517
Feature activation+4.49e-02

Pos logit contributions
aser+0.092
grand+0.092
oral+0.091
 apology+0.089
olding+0.089
Neg logit contributions
 lake-0.090
 clearing-0.089
 fields-0.088
 pool-0.088
 shore-0.087
Token careful
Token ID8161
Feature activation+4.46e-02

Pos logit contributions
Of+0.071
 forgiveness+0.064
 apology+0.062
aser+0.062
oral+0.061
Neg logit contributions
 lake-0.089
 shore-0.088
 camp-0.088
 river-0.086
 ocean-0.085
Token and
Token ID290
Feature activation+7.56e-02

Pos logit contributions
Of+0.110
aid+0.101
 forgiveness+0.096
 apology+0.096
olding+0.094
Neg logit contributions
 lake-0.060
 shore-0.057
 ocean-0.056
 camp-0.056
 river-0.055
Token respectful
Token ID25923
Feature activation+6.93e-02

Pos logit contributions
aser+0.120
asers+0.118
oral+0.117
asing+0.107
fulness+0.105
Neg logit contributions
 river-0.136
 camp-0.136
 lake-0.135
 shore-0.133
 creek-0.131
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
Of+0.196
aid+0.183
 apology+0.173
 forgiveness+0.170
Deal+0.169
Neg logit contributions
 camp-0.059
 pool-0.056
 lake-0.055
 shore-0.052
 ocean-0.052
Token
Token ID198
Feature activation+1.31e-02

Pos logit contributions
Of+0.034
aid+0.034
 forgiveness+0.034
olding+0.033
 apology+0.033
Neg logit contributions
 lake-0.022
neath-0.022
 camp-0.021
 creek-0.021
 shore-0.020
Token
Token ID198
Feature activation+1.98e-02

Pos logit contributions
Of+0.029
 forgiveness+0.029
olding+0.028
aid+0.028
 apology+0.028
Neg logit contributions
 lake-0.020
neath-0.020
 camp-0.019
 shore-0.019
 creek-0.019
Token"
Token ID1
Feature activation+6.19e-02

Pos logit contributions
 forgiveness+0.036
Of+0.036
 apology+0.035
olding+0.035
aid+0.035
Neg logit contributions
 lake-0.038
neath-0.037
 creek-0.036
 camp-0.036
 river-0.036
TokenCan
Token ID6090
Feature activation+5.06e-02

Pos logit contributions
olding+0.132
 apology+0.130
olded+0.129
aser+0.129
 supervision+0.128
Neg logit contributions
 lake-0.096
 creek-0.088
 shore-0.083
 river-0.082
neath-0.082
Token hugged
Token ID41130
Feature activation+2.20e-02

Pos logit contributions
Of+0.032
 forgiveness+0.032
olding+0.031
aid+0.031
 apology+0.031
Neg logit contributions
 lake-0.035
neath-0.035
 camp-0.033
 river-0.033
 shore-0.032
Token them
Token ID606
Feature activation+5.84e-03

Pos logit contributions
olding+0.055
Of+0.054
 forgiveness+0.053
aid+0.053
asing+0.051
Neg logit contributions
neath-0.029
 lake-0.029
 camp-0.027
 shore-0.026
 creek-0.026
Token and
Token ID290
Feature activation+3.79e-02

Pos logit contributions
Of+0.014
 forgiveness+0.014
olding+0.014
aid+0.014
 apology+0.013
Neg logit contributions
 lake-0.008
neath-0.008
 camp-0.007
 creek-0.007
 river-0.007
Token said
Token ID531
Feature activation+2.69e-02

Pos logit contributions
olding+0.067
 apology+0.062
asing+0.062
aser+0.062
Of+0.061
Neg logit contributions
 lake-0.075
neath-0.073
 camp-0.071
 shore-0.070
 fields-0.069
Token,
Token ID11
Feature activation+2.20e-02

Pos logit contributions
Of+0.077
aid+0.074
olding+0.074
 forgiveness+0.073
 apology+0.072
Neg logit contributions
 lake-0.022
neath-0.022
 camp-0.021
 shore-0.020
 creek-0.019
Token "
Token ID366
Feature activation+6.68e-02

Pos logit contributions
Of+0.048
 forgiveness+0.048
aid+0.048
olding+0.047
 apology+0.046
Neg logit contributions
 lake-0.033
neath-0.032
 river-0.031
 shore-0.031
 camp-0.031
TokenDon
Token ID3987
Feature activation+1.19e-02

Pos logit contributions
 apology+0.152
 forgiveness+0.148
olding+0.148
aser+0.145
 classmates+0.143
Neg logit contributions
 lake-0.096
neath-0.089
 creek-0.083
 river-0.083
 valleys-0.082
Token't
Token ID470
Feature activation+9.72e-03

Pos logit contributions
Of+0.027
 forgiveness+0.027
aid+0.026
olding+0.026
 apology+0.026
Neg logit contributions
 lake-0.018
neath-0.017
 camp-0.017
 river-0.016
 creek-0.016
Token worry
Token ID5490
Feature activation+5.03e-03

Pos logit contributions
Of+0.015
aid+0.015
 forgiveness+0.015
olding+0.015
 apology+0.014
Neg logit contributions
 lake-0.021
neath-0.021
 camp-0.021
 river-0.020
 creek-0.020
Token,
Token ID11
Feature activation+1.65e-02

Pos logit contributions
Of+0.013
 forgiveness+0.012
olding+0.012
aid+0.012
 apology+0.012
Neg logit contributions
 lake-0.006
neath-0.006
 camp-0.006
 river-0.006
 creek-0.006
Token I
Token ID314
Feature activation+2.89e-02

Pos logit contributions
Of+0.028
aid+0.028
olding+0.027
 forgiveness+0.027
 apology+0.026
Neg logit contributions
 lake-0.034
neath-0.033
 camp-0.033
 river-0.032
 creek-0.032
Token and
Token ID290
Feature activation+1.64e-02

Pos logit contributions
Of+0.029
 forgiveness+0.028
olding+0.028
aid+0.028
 apology+0.028
Neg logit contributions
 lake-0.015
neath-0.015
 camp-0.015
 shore-0.014
 river-0.014
Token picked
Token ID6497
Feature activation-2.72e-03

Pos logit contributions
olding+0.023
Of+0.023
 forgiveness+0.022
 apology+0.022
aid+0.022
Neg logit contributions
 lake-0.038
neath-0.037
 camp-0.037
 shore-0.036
 river-0.036
Token her
Token ID607
Feature activation-8.31e-03

Pos logit contributions
 lake+0.003
neath+0.003
 river+0.003
 camp+0.003
 canyon+0.003
Neg logit contributions
Of-0.007
 forgiveness-0.007
aid-0.007
olding-0.007
 apology-0.007
Token up
Token ID510
Feature activation+2.73e-02

Pos logit contributions
 lake+0.009
 river+0.008
 camp+0.008
neath+0.008
 canyon+0.008
Neg logit contributions
Of-0.022
 forgiveness-0.022
olding-0.021
aid-0.021
 apology-0.021
Token.
Token ID13
Feature activation-4.70e-03

Pos logit contributions
Of+0.063
aid+0.061
olding+0.060
 forgiveness+0.060
asing+0.060
Neg logit contributions
 lake-0.040
neath-0.039
 camp-0.038
 shore-0.037
 creek-0.036
Token "
Token ID366
Feature activation+6.73e-02

Pos logit contributions
 lake+0.009
neath+0.008
 camp+0.008
 canyon+0.008
 shore+0.008
Neg logit contributions
Of-0.009
 forgiveness-0.009
olding-0.009
aid-0.009
 apology-0.008
TokenYou
Token ID1639
Feature activation-2.14e-02

Pos logit contributions
 apology+0.143
 forgiveness+0.138
olding+0.137
 actions+0.136
asers+0.136
Neg logit contributions
neath-0.106
 clearing-0.104
 lake-0.102
 valleys-0.094
 ocean-0.089
Token're
Token ID821
Feature activation-3.66e-02

Pos logit contributions
 lake+0.043
 creek+0.042
 shore+0.042
neath+0.041
 canyon+0.041
Neg logit contributions
aid-0.038
Of-0.037
 forgiveness-0.036
olded-0.034
olding-0.034
Token a
Token ID257
Feature activation+5.20e-03

Pos logit contributions
 creek+0.079
neath+0.079
 valleys+0.079
 ditch+0.078
 lake+0.077
Neg logit contributions
 forgiveness-0.059
Of-0.057
 caring-0.055
 respectful-0.054
aid-0.052
Token very
Token ID845
Feature activation+3.42e-02

Pos logit contributions
Of+0.009
 forgiveness+0.009
olding+0.009
aid+0.009
 apology+0.008
Neg logit contributions
 lake-0.011
neath-0.011
 camp-0.010
 creek-0.010
 river-0.010
Token good
Token ID922
Feature activation+2.25e-02

Pos logit contributions
 forgiveness+0.059
Of+0.057
aid+0.057
asing+0.055
 apology+0.053
Neg logit contributions
 lake-0.067
 camp-0.065
 river-0.065
 creek-0.064
 shore-0.063
Token seems
Token ID2331
Feature activation+3.12e-02

Pos logit contributions
Of+0.066
aid+0.063
olding+0.062
 forgiveness+0.062
 apology+0.061
Neg logit contributions
 lake-0.064
neath-0.063
 camp-0.061
 river-0.060
 creek-0.059
Token too
Token ID1165
Feature activation-5.45e-03

Pos logit contributions
Of+0.063
aid+0.062
 forgiveness+0.061
olding+0.060
 apology+0.060
Neg logit contributions
 lake-0.054
neath-0.052
 camp-0.052
 river-0.051
 shore-0.051
Token good
Token ID922
Feature activation+4.13e-02

Pos logit contributions
 lake+0.010
neath+0.010
 camp+0.010
 creek+0.010
 shore+0.010
Neg logit contributions
Of-0.010
olding-0.010
 forgiveness-0.010
aid-0.010
 apology-0.009
Token to
Token ID284
Feature activation+1.45e-02

Pos logit contributions
Of+0.095
aid+0.093
 forgiveness+0.088
olding+0.087
 apology+0.087
Neg logit contributions
 lake-0.059
 shore-0.057
 camp-0.057
neath-0.056
 ocean-0.056
Token be
Token ID307
Feature activation+2.39e-02

Pos logit contributions
Of+0.029
aid+0.028
olding+0.028
 forgiveness+0.028
 apology+0.027
Neg logit contributions
 lake-0.026
neath-0.025
 camp-0.024
 shore-0.024
 river-0.024
Token true
Token ID2081
Feature activation+6.18e-02

Pos logit contributions
Of+0.048
olding+0.046
aid+0.046
 forgiveness+0.045
 apology+0.044
Neg logit contributions
 lake-0.042
neath-0.041
 camp-0.040
 river-0.040
 shore-0.039
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
Of+0.152
aid+0.149
olding+0.142
 apology+0.138
 forgiveness+0.137
Neg logit contributions
 lake-0.081
 camp-0.073
 shore-0.072
 river-0.071
 ocean-0.070
Token If
Token ID1002
Feature activation+2.60e-02

Pos logit contributions
Of+0.047
aid+0.047
 forgiveness+0.046
olding+0.046
 apology+0.045
Neg logit contributions
 lake-0.031
neath-0.030
 camp-0.029
 river-0.029
 creek-0.028
Token something
Token ID1223
Feature activation+5.35e-02

Pos logit contributions
Of+0.047
aid+0.046
olding+0.046
 apology+0.043
 forgiveness+0.043
Neg logit contributions
 lake-0.051
 camp-0.050
neath-0.049
 shore-0.049
 river-0.048
Token seems
Token ID2331
Feature activation+2.04e-02

Pos logit contributions
Of+0.107
aid+0.107
olding+0.104
asing+0.101
asers+0.100
Neg logit contributions
 lake-0.091
 camp-0.087
 creek-0.084
 river-0.084
neath-0.083
Token too
Token ID1165
Feature activation-5.51e-03

Pos logit contributions
Of+0.040
aid+0.039
 forgiveness+0.038
olding+0.038
 apology+0.037
Neg logit contributions
 lake-0.037
neath-0.036
 camp-0.035
 river-0.035
 creek-0.034
Token
Token ID220
Feature activation-7.49e-03

Pos logit contributions
 lake+0.005
neath+0.004
 camp+0.004
 creek+0.004
 canyon+0.004
Neg logit contributions
Of-0.006
olding-0.006
 forgiveness-0.006
aid-0.006
 apology-0.006
Token
Token ID198
Feature activation+8.17e-03

Pos logit contributions
 lake+0.014
neath+0.013
 camp+0.013
 creek+0.013
 canyon+0.013
Neg logit contributions
Of-0.015
olding-0.014
 forgiveness-0.014
aid-0.014
 apology-0.014
Token
Token ID198
Feature activation+1.31e-02

Pos logit contributions
Of+0.018
 forgiveness+0.017
olding+0.017
aid+0.017
 apology+0.017
Neg logit contributions
 lake-0.013
neath-0.013
 camp-0.012
 creek-0.012
 river-0.012
TokenAll
Token ID3237
Feature activation+2.60e-02

Pos logit contributions
Of+0.024
 forgiveness+0.023
aid+0.023
olding+0.023
 apology+0.022
Neg logit contributions
 lake-0.026
neath-0.025
 camp-0.024
 shore-0.024
 river-0.024
Token of
Token ID286
Feature activation-4.31e-03

Pos logit contributions
Of+0.043
 forgiveness+0.042
aid+0.041
olding+0.040
 apology+0.040
Neg logit contributions
 lake-0.055
neath-0.054
 camp-0.053
 ocean-0.052
 shore-0.051
Token a
Token ID257
Feature activation+7.04e-02

Pos logit contributions
neath+0.006
 lake+0.006
 river+0.005
 creek+0.005
 canyon+0.005
Neg logit contributions
Of-0.011
 forgiveness-0.010
aid-0.010
olding-0.010
 apology-0.010
Token sudden
Token ID4802
Feature activation-4.20e-02

Pos logit contributions
asing+0.180
olding+0.171
aid+0.167
 apology+0.166
asers+0.165
Neg logit contributions
 lake-0.100
 river-0.092
 camp-0.091
 shore-0.089
 clearing-0.088
Token,
Token ID11
Feature activation-3.41e-02

Pos logit contributions
 lake+0.077
neath+0.076
 creek+0.074
 clearing+0.074
 canyon+0.073
Neg logit contributions
 forgiveness-0.076
aid-0.072
 else-0.072
Of-0.071
olded-0.069
Token something
Token ID1223
Feature activation+3.01e-02

Pos logit contributions
 canyon+0.055
 lake+0.054
 river+0.054
neath+0.054
 clearing+0.052
Neg logit contributions
Of-0.072
 forgiveness-0.068
olding-0.064
aid-0.064
 else-0.063
Token magical
Token ID10883
Feature activation+6.37e-03

Pos logit contributions
Of+0.062
olding+0.060
 forgiveness+0.060
aid+0.059
asing+0.059
Neg logit contributions
 lake-0.052
neath-0.050
 camp-0.049
 shore-0.048
 creek-0.048
Token happened
Token ID3022
Feature activation+5.06e-03

Pos logit contributions
Of+0.011
 forgiveness+0.010
olding+0.010
aid+0.010
 apology+0.010
Neg logit contributions
 lake-0.013
neath-0.013
 camp-0.013
 creek-0.013
 river-0.013
Token get
Token ID651
Feature activation-8.21e-04

Pos logit contributions
Of+0.003
 forgiveness+0.003
olding+0.003
aid+0.003
 apology+0.003
Neg logit contributions
 lake-0.002
neath-0.002
 camp-0.002
 creek-0.002
 canyon-0.002
Token enough
Token ID1576
Feature activation-7.28e-03

Pos logit contributions
 lake+0.001
neath+0.001
 camp+0.001
 creek+0.001
 river+0.001
Neg logit contributions
Of-0.002
 forgiveness-0.002
aid-0.002
olding-0.002
 apology-0.002
Token sunlight
Token ID19606
Feature activation-2.17e-02

Pos logit contributions
neath+0.013
 lake+0.013
 camp+0.012
 creek+0.012
 canyon+0.012
Neg logit contributions
Of-0.014
 forgiveness-0.014
olding-0.013
aid-0.013
 apology-0.013
Token",
Token ID1600
Feature activation+3.89e-02

Pos logit contributions
 lake+0.038
neath+0.037
 creek+0.036
 clearing+0.036
 camp+0.036
Neg logit contributions
Of-0.041
 forgiveness-0.041
olding-0.041
aid-0.041
 apology-0.040
Token said
Token ID531
Feature activation+2.41e-02

Pos logit contributions
olding+0.066
Of+0.066
aid+0.066
asing+0.064
 forgiveness+0.063
Neg logit contributions
 lake-0.077
neath-0.075
 shore-0.075
 camp-0.072
 creek-0.072
Token Mom
Token ID11254
Feature activation+4.50e-02

Pos logit contributions
Of+0.061
olding+0.060
aid+0.060
 forgiveness+0.058
 apology+0.058
Neg logit contributions
neath-0.030
 lake-0.029
 camp-0.027
 shore-0.027
 creek-0.026
Tokenmy
Token ID1820
Feature activation+3.54e-02

Pos logit contributions
Of+0.104
 forgiveness+0.098
 apology+0.098
aid+0.097
olding+0.095
Neg logit contributions
neath-0.060
 lake-0.060
 camp-0.059
 river-0.057
 clearing-0.056
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
Of+0.079
aid+0.075
 forgiveness+0.075
 apology+0.073
olding+0.072
Neg logit contributions
 lake-0.052
neath-0.052
 camp-0.051
 shore-0.049
 river-0.049
Token Jack
Token ID3619
Feature activation-8.98e-03

Pos logit contributions
 lake+0.018
 camp+0.018
 canyon+0.018
neath+0.018
 creek+0.017
Neg logit contributions
Of-0.028
 forgiveness-0.026
olding-0.026
 apology-0.025
olded-0.025
Token felt
Token ID2936
Feature activation+1.66e-02

Pos logit contributions
 lake+0.019
neath+0.019
 camp+0.018
 creek+0.018
 canyon+0.018
Neg logit contributions
Of-0.015
aid-0.014
olding-0.014
 forgiveness-0.014
 apology-0.013
Token sad
Token ID6507
Feature activation+4.55e-02

Pos logit contributions
Of+0.033
olding+0.032
 forgiveness+0.032
aid+0.032
 apology+0.031
Neg logit contributions
 lake-0.029
neath-0.028
 camp-0.027
 shore-0.027
 creek-0.027
Token was
Token ID373
Feature activation-1.76e-03

Pos logit contributions
Of+0.051
olding+0.050
 forgiveness+0.050
 apology+0.049
aid+0.049
Neg logit contributions
neath-0.041
 lake-0.041
 camp-0.038
 river-0.038
 creek-0.038
Token stuck
Token ID7819
Feature activation-3.82e-02

Pos logit contributions
 lake+0.003
neath+0.003
 camp+0.003
 creek+0.003
 river+0.003
Neg logit contributions
Of-0.004
 forgiveness-0.004
aid-0.004
olding-0.004
 apology-0.003
Token on
Token ID319
Feature activation-3.73e-02

Pos logit contributions
 lake+0.073
neath+0.068
 creek+0.068
 river+0.068
 canyon+0.068
Neg logit contributions
olding-0.070
 forgiveness-0.067
Of-0.066
asing-0.065
aser-0.065
Token the
Token ID262
Feature activation-1.25e-02

Pos logit contributions
 lake+0.062
 canyon+0.062
 ocean+0.060
 camp+0.060
 river+0.059
Neg logit contributions
 forgiveness-0.073
 apology-0.067
Of-0.067
olding-0.065
aid-0.064
Token slide
Token ID10649
Feature activation-2.15e-02

Pos logit contributions
neath+0.016
 camp+0.014
 lake+0.014
 clearing+0.014
 canyon+0.014
Neg logit contributions
Of-0.031
 forgiveness-0.031
 apology-0.031
olding-0.031
aid-0.030
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
 lake+0.041
neath+0.040
 camp+0.039
 canyon+0.039
 shore+0.039
Neg logit contributions
olding-0.038
Of-0.037
 forgiveness-0.036
aid-0.036
aser-0.035
Token She
Token ID1375
Feature activation+1.62e-02

Pos logit contributions
Of+0.039
aid+0.039
 forgiveness+0.039
olding+0.038
 apology+0.037
Neg logit contributions
neath-0.026
 lake-0.026
 creek-0.025
 camp-0.025
 river-0.024
Token felt
Token ID2936
Feature activation+8.54e-03

Pos logit contributions
Of+0.033
olding+0.032
 forgiveness+0.032
aid+0.032
 apology+0.031
Neg logit contributions
 lake-0.028
neath-0.028
 camp-0.026
 river-0.026
 shore-0.026
Token a
Token ID257
Feature activation+1.30e-02

Pos logit contributions
Of+0.016
 forgiveness+0.016
olding+0.016
aid+0.016
 apology+0.015
Neg logit contributions
 lake-0.016
neath-0.015
 camp-0.015
 creek-0.015
 river-0.015
Token pain
Token ID2356
Feature activation-2.76e-02

Pos logit contributions
Of+0.027
 forgiveness+0.027
olding+0.027
aid+0.026
 apology+0.026
Neg logit contributions
 lake-0.021
neath-0.021
 camp-0.020
 creek-0.020
 river-0.020
Token in
Token ID287
Feature activation+5.80e-03

Pos logit contributions
 lake+0.049
neath+0.047
 creek+0.045
 valleys+0.045
 camp+0.045
Neg logit contributions
olding-0.053
 forgiveness-0.053
aid-0.052
Of-0.052
aser-0.050
Token adorable
Token ID29012
Feature activation-1.11e-03

Pos logit contributions
Of+0.016
 forgiveness+0.016
olding+0.016
aid+0.016
 apology+0.015
Neg logit contributions
 lake-0.018
neath-0.017
 camp-0.017
 river-0.016
 creek-0.016
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
neath+0.002
 lake+0.002
 camp+0.002
 canyon+0.002
 valleys+0.002
Neg logit contributions
Of-0.002
 forgiveness-0.002
olding-0.002
 apology-0.002
aid-0.002
Token soft
Token ID2705
Feature activation-3.63e-02

Pos logit contributions
Of+0.037
olding+0.036
 forgiveness+0.036
aid+0.036
 apology+0.036
Neg logit contributions
 lake-0.023
neath-0.022
 camp-0.022
 river-0.021
 creek-0.021
Token toy
Token ID13373
Feature activation+5.23e-02

Pos logit contributions
neath+0.073
 valleys+0.067
 canyon+0.063
 ditch+0.060
 lake+0.059
Neg logit contributions
 forgiveness-0.067
 apology-0.065
Of-0.064
aid-0.061
olding-0.060
Token cat
Token ID3797
Feature activation+2.56e-02

Pos logit contributions
Of+0.117
aid+0.117
olding+0.115
 apology+0.112
 forgiveness+0.112
Neg logit contributions
 lake-0.085
 river-0.080
 camp-0.079
 pool-0.078
 ocean-0.076
Token.
Token ID13
Feature activation+3.10e-03

Pos logit contributions
Of+0.058
 apology+0.056
 forgiveness+0.056
olding+0.055
aid+0.055
Neg logit contributions
 lake-0.040
 camp-0.037
neath-0.037
 river-0.036
 pool-0.036
Token The
Token ID383
Feature activation+3.44e-02

Pos logit contributions
Of+0.007
 forgiveness+0.007
olding+0.007
aid+0.007
 apology+0.007
Neg logit contributions
 lake-0.005
neath-0.004
 camp-0.004
 shore-0.004
 creek-0.004
Token cat
Token ID3797
Feature activation+1.68e-02

Pos logit contributions
aid+0.080
Of+0.079
asing+0.078
olding+0.077
olded+0.077
Neg logit contributions
 lake-0.055
 pool-0.050
 camp-0.050
 river-0.050
 creek-0.049
Token was
Token ID373
Feature activation-3.77e-03

Pos logit contributions
Of+0.035
 forgiveness+0.034
olding+0.034
aid+0.034
 apology+0.033
Neg logit contributions
 lake-0.029
neath-0.028
 camp-0.027
 river-0.027
 creek-0.027
Token silver
Token ID8465
Feature activation+3.68e-03

Pos logit contributions
neath+0.007
 lake+0.007
 creek+0.006
 shore+0.006
 canyon+0.006
Neg logit contributions
Of-0.008
 forgiveness-0.007
aid-0.007
olding-0.007
 apology-0.007
Token and
Token ID290
Feature activation+1.82e-02

Pos logit contributions
Of+0.008
 forgiveness+0.008
olding+0.008
aid+0.008
 apology+0.008
Neg logit contributions
 lake-0.005
neath-0.005
 camp-0.005
 shore-0.005
 creek-0.005
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
 lake+0.081
neath+0.079
 canyon+0.078
 river+0.076
 creek+0.076
Neg logit contributions
 forgiveness-0.075
Of-0.072
aid-0.070
 else-0.069
olding-0.068
Token in
Token ID287
Feature activation-8.57e-03

Pos logit contributions
 lake+0.023
neath+0.023
 canyon+0.022
 creek+0.022
 river+0.022
Neg logit contributions
Of-0.031
 forgiveness-0.030
aid-0.028
olding-0.028
 apology-0.027
Token a
Token ID257
Feature activation-8.81e-03

Pos logit contributions
 lake+0.010
neath+0.009
 river+0.009
 shore+0.009
 creek+0.009
Neg logit contributions
 forgiveness-0.022
Of-0.022
aid-0.021
 apology-0.021
olding-0.021
Token far
Token ID1290
Feature activation-9.27e-05

Pos logit contributions
neath+0.013
 lake+0.012
 camp+0.012
 shore+0.012
 creek+0.012
Neg logit contributions
Of-0.020
 forgiveness-0.019
aid-0.019
olding-0.019
 apology-0.018
Token away
Token ID1497
Feature activation-3.62e-02

Pos logit contributions
neath+0.000
 lake+0.000
 camp+0.000
 creek+0.000
 shore+0.000
Neg logit contributions
Of-0.000
olding-0.000
 forgiveness-0.000
aid-0.000
 apology-0.000
Token corner
Token ID5228
Feature activation+1.67e-02

Pos logit contributions
neath+0.048
 lake+0.040
 tim+0.038
 canyon+0.037
 ditch+0.036
Neg logit contributions
 forgiveness-0.083
olding-0.081
Of-0.080
 apology-0.079
 else-0.077
Token,
Token ID11
Feature activation-5.13e-02

Pos logit contributions
Of+0.029
aid+0.028
 forgiveness+0.028
olding+0.028
 apology+0.027
Neg logit contributions
 lake-0.034
neath-0.033
 camp-0.033
 river-0.032
 shore-0.032
Token he
Token ID339
Feature activation-2.34e-02

Pos logit contributions
neath+0.098
 lake+0.093
 canyon+0.093
 river+0.088
 creek+0.088
Neg logit contributions
 forgiveness-0.091
Of-0.090
olding-0.079
 else-0.078
aid-0.077
Token saw
Token ID2497
Feature activation-7.69e-03

Pos logit contributions
neath+0.044
 lake+0.043
 creek+0.042
 camp+0.041
 canyon+0.041
Neg logit contributions
Of-0.041
olding-0.040
 forgiveness-0.039
olded-0.038
aid-0.038
Token a
Token ID257
Feature activation-3.57e-02

Pos logit contributions
neath+0.010
 lake+0.010
 camp+0.010
 canyon+0.009
 creek+0.009
Neg logit contributions
 forgiveness-0.018
Of-0.018
aid-0.018
 apology-0.017
olding-0.017
Token big
Token ID1263
Feature activation-7.55e-02

Pos logit contributions
neath+0.068
 valleys+0.059
 lake+0.057
 canyon+0.056
 camp+0.056
Neg logit contributions
Of-0.068
 forgiveness-0.066
 apology-0.062
olding-0.062
aid-0.061
Token of
Token ID286
Feature activation-3.36e-03

Pos logit contributions
neath+0.003
 lake+0.003
 camp+0.003
 creek+0.003
 canyon+0.003
Neg logit contributions
Of-0.003
 forgiveness-0.003
olding-0.003
aid-0.003
 apology-0.003
Token your
Token ID534
Feature activation+1.45e-02

Pos logit contributions
neath+0.005
 lake+0.005
 camp+0.005
 creek+0.005
 canyon+0.005
Neg logit contributions
 forgiveness-0.007
Of-0.007
aid-0.007
 apology-0.007
olding-0.007
Token business
Token ID1597
Feature activation-2.58e-02

Pos logit contributions
Of+0.032
aid+0.031
 forgiveness+0.030
olding+0.030
 apology+0.029
Neg logit contributions
 lake-0.024
 camp-0.022
neath-0.022
 river-0.022
 shore-0.022
Token!"
Token ID2474
Feature activation+2.10e-02

Pos logit contributions
neath+0.042
 lake+0.040
 camp+0.038
 canyon+0.038
 clearing+0.038
Neg logit contributions
olding-0.054
 forgiveness-0.053
 apology-0.052
Of-0.052
aid-0.051
Token
Token ID198
Feature activation+1.22e-02

Pos logit contributions
Of+0.050
olding+0.049
aid+0.049
 forgiveness+0.048
asing+0.048
Neg logit contributions
 lake-0.029
neath-0.028
 camp-0.028
 shore-0.027
 river-0.027
Token
Token ID198
Feature activation+2.40e-02

Pos logit contributions
Of+0.027
 forgiveness+0.026
olding+0.026
aid+0.026
 apology+0.025
Neg logit contributions
 lake-0.019
neath-0.019
 camp-0.018
 creek-0.018
 river-0.018
TokenHe
Token ID1544
Feature activation-2.37e-02

Pos logit contributions
Of+0.046
olding+0.045
 forgiveness+0.045
 apology+0.045
aid+0.043
Neg logit contributions
 lake-0.044
neath-0.042
 river-0.042
 valleys-0.041
 creek-0.041
Token gets
Token ID3011
Feature activation+1.19e-02

Pos logit contributions
 lake+0.045
neath+0.045
 shore+0.043
 river+0.042
 ocean+0.042
Neg logit contributions
Of-0.043
 apology-0.042
olding-0.041
 forgiveness-0.040
aid-0.040
Token back
Token ID736
Feature activation-1.48e-02

Pos logit contributions
Of+0.029
 forgiveness+0.028
olding+0.028
aid+0.028
 apology+0.028
Neg logit contributions
 lake-0.016
neath-0.015
 camp-0.015
 shore-0.014
 river-0.014
Token in
Token ID287
Feature activation-8.54e-03

Pos logit contributions
 lake+0.030
neath+0.029
 creek+0.028
 canyon+0.028
 river+0.028
Neg logit contributions
olding-0.026
 forgiveness-0.026
aid-0.025
Of-0.025
 apology-0.024
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
 lake+0.012
neath+0.012
 creek+0.011
 canyon+0.011
 river+0.011
Neg logit contributions
 forgiveness-0.020
Of-0.020
 apology-0.020
aid-0.019
olding-0.019
Token day
Token ID1110
Feature activation-2.53e-02

Pos logit contributions
Of+0.004
olding+0.004
 forgiveness+0.004
aid+0.004
 apology+0.004
Neg logit contributions
neath-0.005
 lake-0.005
 camp-0.005
 shore-0.005
 canyon-0.005
Token when
Token ID618
Feature activation-2.34e-02

Pos logit contributions
 lake+0.046
 canyon+0.045
neath+0.045
 river+0.044
 creek+0.044
Neg logit contributions
Of-0.046
 forgiveness-0.046
olding-0.044
 else-0.043
aid-0.043
Token they
Token ID484
Feature activation-2.61e-02

Pos logit contributions
 canyon+0.044
neath+0.043
 lake+0.043
 clearing+0.042
 ocean+0.041
Neg logit contributions
 forgiveness-0.040
Of-0.039
 else-0.035
 apology-0.034
 caring-0.034
Token were
Token ID547
Feature activation-4.77e-02

Pos logit contributions
 lake+0.052
 canyon+0.051
 clearing+0.050
 valleys+0.050
 ditch+0.050
Neg logit contributions
Of-0.042
 else-0.040
 forgiveness-0.040
aid-0.039
olding-0.037
Token playing
Token ID2712
Feature activation-1.94e-02

Pos logit contributions
neath+0.083
 lake+0.079
 creek+0.073
 valleys+0.072
 exp+0.070
Neg logit contributions
olding-0.094
 forgiveness-0.092
asing-0.090
aid-0.090
 classmates-0.089
Token,
Token ID11
Feature activation-2.46e-02

Pos logit contributions
neath+0.031
 lake+0.030
 canyon+0.028
 valleys+0.028
 shore+0.027
Neg logit contributions
 forgiveness-0.042
Of-0.041
aid-0.040
olding-0.039
 apology-0.039
Token Rabbit
Token ID25498
Feature activation-7.09e-02

Pos logit contributions
neath+0.045
 canyon+0.045
 lake+0.044
 ocean+0.043
 clearing+0.042
Neg logit contributions
Of-0.045
 forgiveness-0.045
 apology-0.039
aid-0.039
 else-0.039
Token noticed
Token ID6810
Feature activation-1.31e-02

Pos logit contributions
 canyon+0.155
neath+0.155
 valleys+0.152
icer+0.150
 lake+0.147
Neg logit contributions
 else-0.091
olded-0.090
Of-0.084
olding-0.084
 forgiveness-0.083
Token Bear
Token ID14732
Feature activation-4.60e-02

Pos logit contributions
neath+0.022
 lake+0.022
 canyon+0.022
 valleys+0.022
 camp+0.021
Neg logit contributions
 forgiveness-0.026
Of-0.025
aid-0.025
 apology-0.023
olding-0.023
Token had
Token ID550
Feature activation-2.32e-02

Pos logit contributions
neath+0.073
 lake+0.072
 valleys+0.072
 rav+0.070
 canyon+0.070
Neg logit contributions
 forgiveness-0.092
olding-0.089
 else-0.084
aid-0.084
asing-0.084
Token a
Token ID257
Feature activation-2.22e-02

Pos logit contributions
neath+0.036
 lake+0.036
 creek+0.035
 canyon+0.034
 camp+0.033
Neg logit contributions
 forgiveness-0.050
aid-0.049
Of-0.048
olding-0.046
 apology-0.045
Token
Token ID198
Feature activation+2.23e-02

Pos logit contributions
Of+0.039
 forgiveness+0.038
aid+0.038
olding+0.038
 apology+0.037
Neg logit contributions
 lake-0.025
neath-0.024
 camp-0.023
 creek-0.023
 shore-0.023
Token
Token ID198
Feature activation+1.76e-02

Pos logit contributions
Of+0.052
 forgiveness+0.052
olding+0.051
aid+0.051
asing+0.050
Neg logit contributions
 lake-0.031
neath-0.030
 camp-0.029
 shore-0.029
 creek-0.028
TokenWhen
Token ID2215
Feature activation-1.10e-02

Pos logit contributions
 forgiveness+0.030
Of+0.030
 apology+0.029
olding+0.029
aid+0.029
Neg logit contributions
 lake-0.036
neath-0.035
 camp-0.034
 shore-0.034
 creek-0.033
Token Mom
Token ID11254
Feature activation-4.59e-02

Pos logit contributions
neath+0.027
 lake+0.027
 canyon+0.026
 river+0.026
 creek+0.026
Neg logit contributions
Of-0.014
 forgiveness-0.014
aid-0.012
 apology-0.012
olding-0.012
Token is
Token ID318
Feature activation-7.04e-03

Pos logit contributions
 shore+0.093
 lake+0.092
 canyon+0.091
 creek+0.090
 river+0.090
Neg logit contributions
olding-0.070
olds-0.070
Of-0.067
olded-0.067
 forgiveness-0.067
Token busy
Token ID8179
Feature activation-3.43e-02

Pos logit contributions
 lake+0.014
neath+0.014
 creek+0.013
 river+0.013
 camp+0.013
Neg logit contributions
Of-0.012
olding-0.012
 forgiveness-0.012
aid-0.012
 apology-0.012
Token in
Token ID287
Feature activation-7.14e-03

Pos logit contributions
neath+0.056
 lake+0.056
 clearing+0.052
 valleys+0.052
 canyon+0.052
Neg logit contributions
 forgiveness-0.073
olding-0.072
Of-0.067
 apology-0.067
olds-0.065
Token the
Token ID262
Feature activation+1.31e-03

Pos logit contributions
 lake+0.008
neath+0.008
 river+0.007
 creek+0.007
 canyon+0.007
Neg logit contributions
 forgiveness-0.018
Of-0.018
olding-0.018
 apology-0.018
aid-0.018
Token kitchen
Token ID9592
Feature activation-2.90e-02

Pos logit contributions
Of+0.004
 forgiveness+0.003
olding+0.003
aid+0.003
 apology+0.003
Neg logit contributions
neath-0.001
 lake-0.001
 camp-0.001
 canyon-0.001
 shore-0.001
Token,
Token ID11
Feature activation-2.23e-02

Pos logit contributions
neath+0.048
 lake+0.047
 canyon+0.045
 clearing+0.045
 river+0.043
Neg logit contributions
 forgiveness-0.059
olding-0.058
 apology-0.056
Of-0.056
olds-0.054
Token they
Token ID484
Feature activation-5.67e-02

Pos logit contributions
neath+0.053
 canyon+0.053
 lake+0.053
 river+0.052
 clearing+0.052
Neg logit contributions
Of-0.031
 forgiveness-0.027
olding-0.026
 else-0.025
aid-0.025
Token She
Token ID1375
Feature activation-3.84e-03

Pos logit contributions
 canyon+0.109
 camp+0.108
 lake+0.107
 ocean+0.106
 camps+0.106
Neg logit contributions
Of-0.072
olding-0.058
olded-0.052
aid-0.052
olds-0.052
Token got
Token ID1392
Feature activation+6.46e-03

Pos logit contributions
neath+0.007
 lake+0.007
 camp+0.007
 creek+0.007
 shore+0.007
Neg logit contributions
Of-0.007
 forgiveness-0.007
aid-0.006
olding-0.006
 apology-0.006
Token scared
Token ID12008
Feature activation+9.64e-05

Pos logit contributions
Of+0.014
 forgiveness+0.013
olding+0.013
aid+0.013
 apology+0.013
Neg logit contributions
 lake-0.011
neath-0.010
 camp-0.010
 creek-0.010
 river-0.010
Token and
Token ID290
Feature activation-3.02e-03

Pos logit contributions
Of+0.000
 forgiveness+0.000
olding+0.000
aid+0.000
 apology+0.000
Neg logit contributions
 lake-0.000
neath-0.000
 camp-0.000
 creek-0.000
 river-0.000
Token began
Token ID2540
Feature activation+1.72e-03

Pos logit contributions
 lake+0.006
neath+0.006
 camp+0.005
 creek+0.005
 shore+0.005
Neg logit contributions
Of-0.006
 forgiveness-0.006
olding-0.005
aid-0.005
 apology-0.005
Token to
Token ID284
Feature activation-4.45e-02

Pos logit contributions
Of+0.003
 forgiveness+0.003
olding+0.003
aid+0.003
 apology+0.003
Neg logit contributions
 lake-0.004
neath-0.004
 camp-0.003
 river-0.003
 creek-0.003
Token panic
Token ID13619
Feature activation-8.37e-03

Pos logit contributions
neath+0.097
 valleys+0.093
icer+0.093
ground+0.091
 lake+0.090
Neg logit contributions
 apology-0.066
 forgiveness-0.065
Of-0.062
olding-0.059
 forgive-0.058
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
neath+0.018
 lake+0.018
 camp+0.016
 creek+0.016
 ocean+0.016
Neg logit contributions
 forgiveness-0.013
Of-0.013
olding-0.013
aid-0.013
 apology-0.013
Token The
Token ID383
Feature activation+1.61e-02

Pos logit contributions
 lake+0.002
neath+0.002
 camp+0.002
 creek+0.002
 canyon+0.002
Neg logit contributions
Of-0.003
olding-0.003
 forgiveness-0.003
aid-0.003
 apology-0.002
Token wave
Token ID6769
Feature activation+4.08e-02

Pos logit contributions
Of+0.038
aid+0.037
 forgiveness+0.037
olding+0.037
 apology+0.035
Neg logit contributions
 lake-0.024
 shore-0.022
 creek-0.022
 river-0.022
 camp-0.022
Token was
Token ID373
Feature activation-6.41e-03

Pos logit contributions
Of+0.098
 forgiveness+0.096
aid+0.095
olding+0.094
 apology+0.093
Neg logit contributions
 lake-0.057
neath-0.054
 river-0.053
 camp-0.052
 shore-0.050
Token play
Token ID711
Feature activation-2.62e-02

Pos logit contributions
neath+0.104
 valleys+0.093
 clearing+0.092
icer+0.088
 ditch+0.086
Neg logit contributions
 forgiveness-0.094
 apology-0.094
aid-0.093
 forgive-0.087
 respectful-0.086
Token and
Token ID290
Feature activation-2.52e-02

Pos logit contributions
neath+0.054
 valleys+0.048
 lake+0.048
 canyon+0.047
 shore+0.046
Neg logit contributions
Of-0.044
aid-0.043
 forgiveness-0.043
Deal-0.041
 else-0.041
Token splash
Token ID22870
Feature activation-1.27e-02

Pos logit contributions
neath+0.043
 lake+0.038
 clearing+0.037
 canyon+0.036
 camp+0.035
Neg logit contributions
Of-0.051
 forgiveness-0.051
 apology-0.049
aid-0.049
olding-0.047
Token in
Token ID287
Feature activation-5.93e-02

Pos logit contributions
neath+0.021
 lake+0.020
 clearing+0.019
 canyon+0.019
 camp+0.019
Neg logit contributions
Of-0.026
aid-0.025
 forgiveness-0.025
olding-0.024
 apology-0.024
Token the
Token ID262
Feature activation-5.92e-02

Pos logit contributions
acket+0.117
neath+0.114
 tim+0.100
 canyon+0.097
 Everest+0.095
Neg logit contributions
 else-0.096
 forgiveness-0.092
 apology-0.082
 forgive-0.076
 caring-0.076
Token water
Token ID1660
Feature activation-2.14e-02

Pos logit contributions
neath+0.103
 Everest+0.092
 tim+0.091
 OUT+0.083
 takeoff+0.082
Neg logit contributions
aid-0.101
Of-0.096
 else-0.096
ups-0.093
 forgiveness-0.087
Token all
Token ID477
Feature activation-7.78e-03

Pos logit contributions
neath+0.048
 lake+0.045
 clearing+0.044
 canyon+0.044
 camp+0.043
Neg logit contributions
olding-0.031
 forgiveness-0.030
Of-0.030
aid-0.030
 apology-0.028
Token day
Token ID1110
Feature activation-2.80e-02

Pos logit contributions
 lake+0.016
neath+0.016
 camp+0.016
 canyon+0.015
 creek+0.015
Neg logit contributions
Of-0.012
olding-0.012
aid-0.012
 forgiveness-0.012
 apology-0.011
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
neath+0.061
 lake+0.058
 clearing+0.055
 camp+0.055
 canyon+0.054
Neg logit contributions
olding-0.043
Of-0.041
 forgiveness-0.040
 else-0.039
aid-0.039
Token The
Token ID383
Feature activation-7.33e-03

Pos logit contributions
neath+0.033
 lake+0.031
 canyon+0.031
 shore+0.030
 ditch+0.029
Neg logit contributions
Of-0.036
olding-0.033
 forgiveness-0.032
 apology-0.032
aid-0.031
Token hipp
Token ID18568
Feature activation+1.08e-02

Pos logit contributions
neath+0.013
 lake+0.011
 camp+0.011
 canyon+0.011
 creek+0.011
Neg logit contributions
Of-0.015
 apology-0.015
 forgiveness-0.015
aid-0.015
olding-0.014
Token eat
Token ID4483
Feature activation-3.08e-02

Pos logit contributions
neath+0.052
 lake+0.048
 valleys+0.048
 canyon+0.047
 clearing+0.047
Neg logit contributions
 forgiveness-0.060
 apology-0.059
aid-0.059
Of-0.058
olding-0.055
Token snacks
Token ID28487
Feature activation+7.95e-05

Pos logit contributions
neath+0.061
 canyon+0.055
 valleys+0.053
 lake+0.053
 clearing+0.052
Neg logit contributions
 forgiveness-0.056
Of-0.054
aid-0.054
 apology-0.053
 else-0.052
Token.
Token ID13
Feature activation-3.41e-02

Pos logit contributions
Of+0.000
 forgiveness+0.000
aid+0.000
olding+0.000
 apology+0.000
Neg logit contributions
 lake-0.000
neath-0.000
 camp-0.000
 creek-0.000
 river-0.000
Token One
Token ID1881
Feature activation-7.35e-03

Pos logit contributions
neath+0.060
 canyon+0.058
 camp+0.058
 lake+0.056
 river+0.054
Neg logit contributions
Of-0.068
olding-0.059
 apology-0.059
Deal-0.057
 forgiveness-0.056
Token day
Token ID1110
Feature activation-2.49e-02

Pos logit contributions
neath+0.017
 lake+0.016
 canyon+0.016
 valleys+0.015
 shore+0.015
Neg logit contributions
Of-0.011
olding-0.011
aid-0.010
 forgiveness-0.010
 apology-0.010
Token,
Token ID11
Feature activation-1.93e-02

Pos logit contributions
 lake+0.041
neath+0.040
 canyon+0.039
 river+0.039
 camp+0.038
Neg logit contributions
Of-0.052
olding-0.050
 forgiveness-0.049
aid-0.048
 apology-0.048
Token Dave
Token ID9935
Feature activation-5.37e-02

Pos logit contributions
neath+0.030
 lake+0.030
 canyon+0.030
 camp+0.030
 river+0.029
Neg logit contributions
Of-0.043
olding-0.039
 forgiveness-0.038
 apology-0.037
aid-0.037
Token realised
Token ID19169
Feature activation-2.33e-02

Pos logit contributions
neath+0.119
 canyon+0.118
 lake+0.116
 valleys+0.115
 waterfall+0.114
Neg logit contributions
 else-0.070
olded-0.069
olding-0.068
ups-0.067
Of-0.065
Token that
Token ID326
Feature activation-1.61e-02

Pos logit contributions
neath+0.052
 lake+0.051
 valleys+0.050
 camp+0.050
 canyon+0.050
Neg logit contributions
 forgiveness-0.035
Of-0.033
aid-0.032
olding-0.031
 apology-0.030
Token he
Token ID339
Feature activation-5.47e-03

Pos logit contributions
neath+0.028
 lake+0.027
 canyon+0.026
 camp+0.026
 clearing+0.026
Neg logit contributions
 forgiveness-0.032
Of-0.032
olding-0.030
aid-0.030
 apology-0.030
Token wanted
Token ID2227
Feature activation-1.20e-02

Pos logit contributions
 lake+0.011
neath+0.011
 camp+0.011
 shore+0.011
 creek+0.011
Neg logit contributions
Of-0.009
 forgiveness-0.009
olding-0.009
aid-0.009
 apology-0.008
Token forward
Token ID2651
Feature activation-2.72e-02

Pos logit contributions
 lake+0.027
neath+0.027
 camp+0.026
 canyon+0.025
 creek+0.025
Neg logit contributions
Of-0.029
 forgiveness-0.029
olding-0.028
aid-0.028
 apology-0.027
Token.
Token ID13
Feature activation-1.59e-02

Pos logit contributions
neath+0.052
 lake+0.052
 creek+0.048
 canyon+0.048
 camp+0.048
Neg logit contributions
olding-0.048
 forgiveness-0.047
Of-0.047
aid-0.045
 apology-0.045
Token Finally
Token ID9461
Feature activation-2.95e-02

Pos logit contributions
 lake+0.027
 camp+0.026
 canyon+0.026
 ocean+0.026
 creek+0.026
Neg logit contributions
Of-0.033
olding-0.031
 forgiveness-0.031
aid-0.031
 apology-0.030
Token it
Token ID340
Feature activation-1.31e-02

Pos logit contributions
 lake+0.054
neath+0.053
 canyon+0.052
 creek+0.051
 ditch+0.051
Neg logit contributions
 forgiveness-0.055
Of-0.051
aid-0.049
 apology-0.049
 else-0.048
Token soared
Token ID35292
Feature activation-5.34e-02

Pos logit contributions
 lake+0.023
neath+0.023
 creek+0.023
 camp+0.022
 shore+0.022
Neg logit contributions
Of-0.025
aid-0.024
olding-0.024
 forgiveness-0.023
 apology-0.023
Token up
Token ID510
Feature activation-8.02e-02

Pos logit contributions
neath+0.089
 lake+0.087
 canyon+0.084
 ditch+0.083
 camp+0.081
Neg logit contributions
Of-0.101
 else-0.088
ups-0.088
olding-0.087
aid-0.087
Token to
Token ID284
Feature activation-4.36e-02

Pos logit contributions
latable+0.162
acket+0.160
 ditch+0.151
icer+0.147
 tim+0.146
Neg logit contributions
Of-0.095
 else-0.092
 wisely-0.091
 lesson-0.089
olding-0.080
Token the
Token ID262
Feature activation-1.72e-02

Pos logit contributions
neath+0.062
 ditch+0.060
 canyon+0.058
 creek+0.055
 lake+0.054
Neg logit contributions
 forgiveness-0.097
Of-0.096
 apology-0.092
 honesty-0.086
aid-0.085
Token top
Token ID1353
Feature activation-3.27e-02

Pos logit contributions
neath+0.021
 camp+0.018
 lake+0.018
 canyon+0.017
 shore+0.017
Neg logit contributions
Of-0.043
 forgiveness-0.042
olding-0.042
 apology-0.041
aid-0.040
Token of
Token ID286
Feature activation-3.68e-02

Pos logit contributions
neath+0.078
 lake+0.076
 camp+0.075
 creek+0.075
 canyon+0.074
Neg logit contributions
Of-0.037
olding-0.037
aid-0.035
 forgiveness-0.035
 else-0.035
Token the
Token ID262
Feature activation-1.78e-02

Pos logit contributions
neath+0.050
 creek+0.049
 tim+0.049
 canyon+0.048
 ditch+0.046
Neg logit contributions
 forgiveness-0.084
Of-0.077
 apology-0.076
aid-0.076
 honesty-0.073
Token the
Token ID262
Feature activation-1.72e-03

Pos logit contributions
Of+0.032
olding+0.031
 forgiveness+0.031
aid+0.031
 apology+0.030
Neg logit contributions
 lake-0.023
neath-0.022
 camp-0.021
 river-0.021
 shore-0.021
Token noise
Token ID7838
Feature activation-2.47e-03

Pos logit contributions
neath+0.003
 lake+0.003
 camp+0.002
 creek+0.002
 canyon+0.002
Neg logit contributions
Of-0.004
 forgiveness-0.004
olding-0.004
aid-0.004
 apology-0.004
Token of
Token ID286
Feature activation-3.08e-02

Pos logit contributions
 lake+0.004
neath+0.004
 camp+0.004
 creek+0.004
 river+0.004
Neg logit contributions
Of-0.005
 forgiveness-0.005
olding-0.005
aid-0.005
 apology-0.005
Token the
Token ID262
Feature activation-3.25e-03

Pos logit contributions
neath+0.041
 lake+0.039
 canyon+0.038
 creek+0.037
 river+0.037
Neg logit contributions
 forgiveness-0.075
 apology-0.071
Of-0.069
aid-0.069
olding-0.067
Token blocks
Token ID7021
Feature activation-3.15e-02

Pos logit contributions
neath+0.005
 lake+0.005
 camp+0.004
 creek+0.004
 canyon+0.004
Neg logit contributions
Of-0.007
 forgiveness-0.007
olding-0.007
aid-0.007
 apology-0.007
Token falling
Token ID7463
Feature activation-7.84e-02

Pos logit contributions
 lake+0.060
neath+0.059
 clearing+0.058
 camp+0.058
 canyon+0.057
Neg logit contributions
 forgiveness-0.054
olding-0.054
aid-0.053
Of-0.053
 apology-0.051
Token,
Token ID11
Feature activation-2.99e-02

Pos logit contributions
 clearing+0.140
 lake+0.138
 canyon+0.135
neath+0.135
 creek+0.130
Neg logit contributions
ups-0.120
olding-0.119
aid-0.117
aser-0.117
 forgiveness-0.115
Token they
Token ID484
Feature activation-1.71e-02

Pos logit contributions
neath+0.052
 lake+0.051
 camp+0.050
 canyon+0.049
 creek+0.049
Neg logit contributions
Of-0.058
 forgiveness-0.056
olding-0.055
 apology-0.054
aid-0.054
Token ran
Token ID4966
Feature activation-3.60e-02

Pos logit contributions
 lake+0.031
neath+0.030
 camp+0.030
 clearing+0.029
 canyon+0.029
Neg logit contributions
Of-0.032
olding-0.031
aid-0.031
 forgiveness-0.031
 apology-0.030
Token back
Token ID736
Feature activation-3.47e-02

Pos logit contributions
neath+0.072
 lake+0.071
 canyon+0.069
 ocean+0.067
 river+0.066
Neg logit contributions
Of-0.061
olding-0.059
aid-0.055
olded-0.054
 forgiveness-0.053
Token to
Token ID284
Feature activation-2.38e-02

Pos logit contributions
 lake+0.076
neath+0.075
 canyon+0.074
 ocean+0.072
 river+0.072
Neg logit contributions
olding-0.050
Of-0.048
 forgiveness-0.047
aid-0.045
 apology-0.045
Token are
Token ID389
Feature activation+7.29e-03

Pos logit contributions
Of+0.060
olding+0.060
 forgiveness+0.058
aid+0.058
 apology+0.057
Neg logit contributions
 lake-0.057
neath-0.057
 camp-0.055
 river-0.054
 creek-0.053
Token worth
Token ID2861
Feature activation+2.52e-02

Pos logit contributions
Of+0.013
 forgiveness+0.013
olding+0.013
aid+0.013
 apology+0.012
Neg logit contributions
 lake-0.014
neath-0.014
 camp-0.014
 creek-0.013
 river-0.013
Token looking
Token ID2045
Feature activation+2.63e-03

Pos logit contributions
Of+0.058
olding+0.055
aid+0.055
 apology+0.053
 forgiveness+0.053
Neg logit contributions
 lake-0.038
neath-0.037
 shore-0.036
 camp-0.035
 valleys-0.035
Token after
Token ID706
Feature activation-3.55e-03

Pos logit contributions
Of+0.005
 forgiveness+0.005
olding+0.005
aid+0.005
 apology+0.005
Neg logit contributions
 lake-0.005
neath-0.005
 camp-0.005
 creek-0.004
 river-0.004
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
 lake+0.006
neath+0.006
 camp+0.006
 creek+0.006
 river+0.006
Neg logit contributions
 forgiveness-0.007
Of-0.007
olding-0.007
aid-0.007
 apology-0.007
Token<|endoftext|>
Token ID50256
Feature activation-6.71e-02

Pos logit contributions
Of+0.027
aid+0.026
 forgiveness+0.026
olding+0.026
 apology+0.026
Neg logit contributions
 lake-0.019
neath-0.019
 camp-0.018
 creek-0.018
 river-0.017
TokenOnce
Token ID7454
Feature activation+3.90e-02

Pos logit contributions
 canyon+0.120
 river+0.114
 shore+0.113
 ditch+0.112
 lake+0.110
Neg logit contributions
Of-0.115
 forgiveness-0.109
aid-0.103
olding-0.102
 else-0.101
Token,
Token ID11
Feature activation+1.01e-02

Pos logit contributions
aid+0.099
Of+0.099
 forgiveness+0.096
olding+0.094
 apology+0.094
Neg logit contributions
neath-0.048
 lake-0.047
 camp-0.047
 creek-0.044
 ocean-0.043
Token there
Token ID612
Feature activation-2.52e-02

Pos logit contributions
Of+0.025
 forgiveness+0.024
aid+0.024
olding+0.024
 apology+0.024
Neg logit contributions
 lake-0.013
neath-0.013
 camp-0.012
 creek-0.012
 river-0.012
Token was
Token ID373
Feature activation-3.26e-02

Pos logit contributions
 canyon+0.053
 lake+0.052
neath+0.052
 creek+0.051
 river+0.051
Neg logit contributions
Of-0.039
 forgiveness-0.039
 else-0.036
olding-0.036
 apology-0.035
Token a
Token ID257
Feature activation-1.33e-02

Pos logit contributions
 canyon+0.050
 lake+0.048
neath+0.047
 river+0.047
 creek+0.046
Neg logit contributions
Of-0.072
 forgiveness-0.070
aid-0.069
 apology-0.065
olding-0.063
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
Of+0.077
olding+0.075
aid+0.075
 forgiveness+0.075
 apology+0.075
Neg logit contributions
 lake-0.025
neath-0.025
 shore-0.023
 camp-0.023
 river-0.022
Token "
Token ID366
Feature activation+5.19e-02

Pos logit contributions
 lake+0.022
neath+0.022
 camp+0.021
 canyon+0.021
 creek+0.020
Neg logit contributions
Of-0.020
olding-0.019
 forgiveness-0.019
aid-0.018
 apology-0.018
TokenWow
Token ID22017
Feature activation-3.55e-02

Pos logit contributions
 apology+0.103
 forgiveness+0.102
olding+0.098
asers+0.098
aser+0.097
Neg logit contributions
neath-0.092
 lake-0.091
 clearing-0.087
 creek-0.083
 river-0.081
Token,
Token ID11
Feature activation+1.87e-02

Pos logit contributions
 lake+0.068
 clearing+0.065
neath+0.065
 river+0.064
 canyon+0.063
Neg logit contributions
Of-0.063
 forgiveness-0.063
olding-0.063
 else-0.061
aser-0.061
Token that
Token ID326
Feature activation-2.73e-02

Pos logit contributions
Of+0.037
olding+0.037
aid+0.036
 forgiveness+0.036
 apology+0.035
Neg logit contributions
 lake-0.033
neath-0.032
 camp-0.032
 shore-0.031
 creek-0.031
Token fire
Token ID2046
Feature activation-7.32e-02

Pos logit contributions
neath+0.059
 valleys+0.056
 clearing+0.055
 canyon+0.055
 shore+0.054
Neg logit contributions
 forgiveness-0.043
Of-0.040
 apology-0.038
 else-0.038
 respectful-0.037
Tokenwork
Token ID1818
Feature activation-3.62e-02

Pos logit contributions
 valleys+0.156
 clearing+0.151
 ditch+0.151
 shore+0.147
 valley+0.146
Neg logit contributions
 forgiveness-0.101
aser-0.101
asing-0.100
olding-0.099
 cares-0.095
Token is
Token ID318
Feature activation-5.77e-02

Pos logit contributions
neath+0.067
 lake+0.064
 clearing+0.063
 valleys+0.063
 shore+0.063
Neg logit contributions
 forgiveness-0.065
 else-0.063
Of-0.063
olding-0.062
asing-0.061
Token so
Token ID523
Feature activation-2.82e-02

Pos logit contributions
neath+0.123
 valleys+0.113
 creek+0.112
 canyon+0.111
 valley+0.109
Neg logit contributions
 else-0.082
 caring-0.081
 forgiveness-0.080
 valuable-0.077
 respectful-0.077
Token pretty
Token ID2495
Feature activation-3.61e-02

Pos logit contributions
neath+0.057
 lake+0.050
 canyon+0.050
 clearing+0.050
 ditch+0.049
Neg logit contributions
Of-0.049
 else-0.048
olding-0.047
 apology-0.046
 forgiveness-0.046
Token!"
Token ID2474
Feature activation+1.62e-02

Pos logit contributions
neath+0.064
 lake+0.061
 ditch+0.061
 canyon+0.060
 clearing+0.060
Neg logit contributions
Of-0.065
 forgiveness-0.065
olding-0.063
 else-0.063
asing-0.062
Token alive
Token ID6776
Feature activation-3.74e-02

Pos logit contributions
Of+0.003
olding+0.003
aid+0.003
 forgiveness+0.003
 apology+0.003
Neg logit contributions
 lake-0.003
neath-0.003
 camp-0.003
 creek-0.003
 river-0.003
Token than
Token ID621
Feature activation+5.51e-03

Pos logit contributions
neath+0.067
 lake+0.063
 creek+0.061
 canyon+0.060
 river+0.060
Neg logit contributions
 forgiveness-0.069
Of-0.069
olding-0.067
aid-0.066
 respectful-0.063
Token ever
Token ID1683
Feature activation-9.44e-03

Pos logit contributions
Of+0.009
 forgiveness+0.008
olding+0.008
aid+0.008
 apology+0.008
Neg logit contributions
 lake-0.012
neath-0.012
 camp-0.012
 creek-0.011
 river-0.011
Token before
Token ID878
Feature activation-4.76e-02

Pos logit contributions
neath+0.020
 lake+0.020
 creek+0.019
 river+0.019
 camp+0.019
Neg logit contributions
Of-0.015
 forgiveness-0.015
olding-0.015
aid-0.014
 apology-0.014
Token!
Token ID0
Feature activation-4.41e-03

Pos logit contributions
neath+0.104
 lake+0.098
 creek+0.098
 canyon+0.097
 river+0.095
Neg logit contributions
 forgiveness-0.063
 else-0.057
 importantly-0.055
olding-0.053
 honesty-0.052
Token<|endoftext|>
Token ID50256
Feature activation-7.04e-02

Pos logit contributions
 lake+0.007
neath+0.007
 camp+0.007
 canyon+0.007
 creek+0.007
Neg logit contributions
Of-0.010
olding-0.009
 forgiveness-0.009
aid-0.009
 apology-0.009
TokenOnce
Token ID7454
Feature activation+3.09e-02

Pos logit contributions
 canyon+0.119
 river+0.114
 shore+0.112
 ditch+0.111
 lake+0.109
Neg logit contributions
Of-0.124
 forgiveness-0.117
aid-0.115
olding-0.113
 else-0.111
Token upon
Token ID2402
Feature activation-2.98e-02

Pos logit contributions
Of+0.076
aid+0.075
 forgiveness+0.074
olding+0.072
 apology+0.072
Neg logit contributions
neath-0.040
 lake-0.040
 camp-0.039
 creek-0.037
 ocean-0.037
Token a
Token ID257
Feature activation-3.49e-03

Pos logit contributions
neath+0.046
 lake+0.045
 canyon+0.043
 river+0.043
 creek+0.042
Neg logit contributions
Of-0.066
 forgiveness-0.065
aid-0.065
 else-0.062
 apology-0.061
Token time
Token ID640
Feature activation-1.70e-02

Pos logit contributions
 lake+0.005
neath+0.005
 shore+0.005
 river+0.005
 creek+0.005
Neg logit contributions
Of-0.008
olding-0.008
 forgiveness-0.008
aid-0.008
 apology-0.008
Token there
Token ID612
Feature activation-3.25e-02

Pos logit contributions
 lake+0.026
neath+0.026
 canyon+0.025
 river+0.025
 shore+0.024
Neg logit contributions
Of-0.037
 forgiveness-0.036
olding-0.036
aid-0.034
 apology-0.034