TOP ACTIVATIONS
MAX = 13.138

THE BE LL WHEN YOU FE EL LIKE IT ".
The voice said , " cong ratulations , you can have
was not the answer . Ap olog izing and being nice
bouncing around , an unpleasant od our wa ft ed into
T - Rex , a St eg osaurus , and even
it ! Enc our aged by her success
a time there was a Stub born Old Woman . She
long until Chloe felt her eyel ids getting heavy . She
and said : " G lad to help ".
Mom said , " G lad to see you both made
he shouted , " A P IG !!" His
and said , " G lad you had fun , Sam
and said , " G lad we could help . Have
mom said , " G lad you enjoyed it ."
girl who was feeling un conf ident . She wanted to
mom replied , " G lad you like it !"
theater and Larry felt his eyel ids become heavy . He
a situation may seem . Enc our agement can come from
. They got some bar bec ued meat . M ummy
You are an amazing resc uer ! I bet nobody

MIN ACTIVATIONS
MIN = -10.772

would <|endoftext|> John b iked to the park with his friends
<|endoftext|> P olly was playing at the park . She loved
<|endoftext|> I zzy was playing at home . She saw a
day , Amy was playing at the park with her friends
Billy went to play baseball at the park . Tim my
<|endoftext|> S andy was playing at the park with her friends
<|endoftext|> Jimmy wanted to climb to the top of the tree
. She wanted to go to bed , but she had
<|endoftext|> Sus ie was playing at the beach . She picked
He was going to go to the big , enormous zoo
olly and Max were playing at the beach . The sun
. They were taking John to the park . As they
to play on the swing at the park . She wanted
L ily wanted to come to the party with her big
ily and Ben were playing at the beach . They liked
remembering <|endoftext|> John was preparing to go fishing . He grabbed
um and Dad took Daisy to school one day . Daisy
large cop brought his car to the park . He wanted
Lily were eager to go to the beach . They liked
<|endoftext|> He idi was playing at the beach . She saw

INTERVAL 7.160 - 13.138
CONTAINS 0.512%

sit in the corner and think about what you did .
. She is sad and mad . She runs after Tom
That sounds like a great idea !" They both
smiled and said , " Yes , please king . I
declared that it was the funn est day ever ! <|endoftext|>

INTERVAL 1.183 - 7.160
CONTAINS 33.525%

was done , her mom said ,
, Jill spotted something very different in the landscape . It
saw that it was a big box . She tried to
boy named Tim my . Tim my loved to play with
all danced together to the slow beat of the music .

INTERVAL -4.794 - 1.183
CONTAINS 58.709%

tape to measure the table and a meter stick to measure
tri pped over his own feet . He had lots of
saw the living room , she was not happy . She
the neighborhood . He even rode it in the rain and
I have no friends to play with ." The

INTERVAL -10.772 - -4.794
CONTAINS 7.254%

" I lost my toy in the fountain ." Tim wanted
called Clara . She loved to go to the opera .
He had a new toy to play with outside . He
Lots of people went there to fly in big planes .
that was cool and decided to try soaking more things .
Token THE
Token ID3336
Feature activation+1.97e-01

Pos logit contributions
 darling+0.068
 concerned+0.067
 dear+0.067
ealing+0.067
 sorry+0.066
Neg logit contributions
|-0.064
 hunt-0.056
 land-0.056
 camp-0.053
 distance-0.052
Token BE
Token ID9348
Feature activation+5.65e+00

Pos logit contributions
 darling+0.005
 sorry+0.005
 dear+0.005
 concerned+0.005
ealing+0.005
Neg logit contributions
|-0.005
 land-0.004
 hunt-0.004
 camp-0.004
 distance-0.004
TokenLL
Token ID3069
Feature activation+8.29e+00

Pos logit contributions
 fault+0.126
 concerned+0.125
 darling+0.124
rier+0.124
 sorry+0.120
Neg logit contributions
|-0.152
 land-0.137
 hunt-0.136
 camp-0.124
 play-0.123
Token WHEN
Token ID42099
Feature activation+8.35e+00

Pos logit contributions
ealing+0.217
 darling+0.217
 sorry+0.217
 concerned+0.216
 dear+0.215
Neg logit contributions
|-0.184
 land-0.163
 hunt-0.163
 camp-0.156
 distance-0.150
Token YOU
Token ID7013
Feature activation+3.86e+00

Pos logit contributions
ealing+0.155
 sorry+0.154
rier+0.151
 darling+0.151
 concerned+0.150
Neg logit contributions
|-0.250
 hunt-0.228
 land-0.226
 distance-0.223
 camp-0.217
Token FE
Token ID18630
Feature activation+1.31e+01

Pos logit contributions
ealing+0.099
 darling+0.099
 sorry+0.099
 concerned+0.098
 dear+0.098
Neg logit contributions
|-0.088
 land-0.077
 hunt-0.077
 camp-0.074
 distance-0.074
TokenEL
Token ID3698
Feature activation+3.81e+00

Pos logit contributions
 concerned+0.286
rier+0.280
 darling+0.273
 sorry+0.272
 dear+0.265
Neg logit contributions
|-0.358
 hunt-0.316
 land-0.302
 camp-0.287
 play-0.281
Token LIKE
Token ID34178
Feature activation+6.05e+00

Pos logit contributions
ealing+0.098
 concerned+0.095
 darling+0.094
 dear+0.093
ushing+0.093
Neg logit contributions
|-0.085
 hunt-0.076
 land-0.074
 camp-0.073
 distance-0.070
Token IT
Token ID7283
Feature activation+1.92e+00

Pos logit contributions
 darling+0.124
 concerned+0.124
ealing+0.122
 sorry+0.122
 dear+0.122
Neg logit contributions
|-0.168
 hunt-0.152
 land-0.152
 camp-0.146
 distance-0.141
Token".
Token ID1911
Feature activation-1.48e+00

Pos logit contributions
ealing+0.048
 sorry+0.048
 darling+0.048
 concerned+0.048
rier+0.047
Neg logit contributions
|-0.045
 hunt-0.039
 land-0.038
 distance-0.037
 camp-0.037
Token
Token ID198
Feature activation-6.32e+00

Pos logit contributions
|+0.038
 hunt+0.035
 land+0.035
 camp+0.034
 distance+0.032
Neg logit contributions
 sorry-0.034
ealing-0.033
 dear-0.033
 darling-0.033
 fault-0.032
TokenThe
Token ID464
Feature activation+4.06e-01

Pos logit contributions
|+0.077
 hunt+0.074
 land+0.071
 camp+0.067
 distance+0.063
Neg logit contributions
 concerned-0.115
 darling-0.114
ealing-0.111
 sorry-0.110
 dear-0.110
Token voice
Token ID3809
Feature activation+2.16e+00

Pos logit contributions
 darling+0.010
 concerned+0.010
 sorry+0.010
 dear+0.010
ealing+0.009
Neg logit contributions
|-0.010
 hunt-0.009
 land-0.009
 camp-0.008
 distance-0.008
Token said
Token ID531
Feature activation+5.66e+00

Pos logit contributions
 concerned+0.056
 darling+0.055
 sorry+0.055
 dear+0.055
ealing+0.054
Neg logit contributions
|-0.050
 hunt-0.045
 land-0.043
 camp-0.042
 distance-0.040
Token,
Token ID11
Feature activation+4.54e+00

Pos logit contributions
 darling+0.146
 concerned+0.146
ealing+0.145
 sorry+0.144
 dear+0.144
Neg logit contributions
|-0.128
 land-0.112
 hunt-0.112
 camp-0.107
 distance-0.103
Token "
Token ID366
Feature activation+7.99e-01

Pos logit contributions
 fault+0.093
ealing+0.093
 darling+0.091
ushing+0.090
 dear+0.089
Neg logit contributions
|-0.123
 land-0.113
 hunt-0.113
 camp-0.109
 distance-0.105
Tokencong
Token ID36801
Feature activation+1.17e+01

Pos logit contributions
 darling+0.020
 concerned+0.020
ealing+0.019
 dear+0.019
 ok+0.019
Neg logit contributions
|-0.018
 hunt-0.017
 land-0.016
 camp-0.016
 distance-0.014
Tokenratulations
Token ID30167
Feature activation+4.02e+00

Pos logit contributions
 concerned+0.249
rier+0.238
 sorry+0.238
ealing+0.238
 darling+0.228
Neg logit contributions
|-0.314
 hunt-0.290
 land-0.288
 camp-0.277
 play-0.270
Token,
Token ID11
Feature activation+4.99e+00

Pos logit contributions
 concerned+0.120
 sorry+0.118
 darling+0.118
ealing+0.118
rier+0.117
Neg logit contributions
|-0.072
 land-0.065
 hunt-0.064
 camp-0.061
 distance-0.055
Token you
Token ID345
Feature activation+3.24e+00

Pos logit contributions
 darling+0.128
 dear+0.126
 sorry+0.126
 concerned+0.125
ealing+0.123
Neg logit contributions
|-0.117
 hunt-0.102
 land-0.101
 camp-0.096
 distance-0.095
Token can
Token ID460
Feature activation+1.97e-01

Pos logit contributions
 darling+0.089
 dear+0.088
 sorry+0.087
ealing+0.086
 concerned+0.085
Neg logit contributions
|-0.073
 hunt-0.061
 land-0.060
 distance-0.059
 camp-0.057
Token have
Token ID423
Feature activation-1.09e+00

Pos logit contributions
 darling+0.005
 sorry+0.005
 ok+0.005
 dear+0.005
 fault+0.005
Neg logit contributions
|-0.004
 distance-0.004
 camp-0.004
 land-0.003
fire-0.003
Token was
Token ID373
Feature activation+4.48e+00

Pos logit contributions
 darling+0.161
 concerned+0.161
ealing+0.160
 sorry+0.159
 dear+0.159
Neg logit contributions
|-0.085
 land-0.071
 hunt-0.071
 camp-0.066
 distance-0.063
Token not
Token ID407
Feature activation+2.54e+00

Pos logit contributions
ealing+0.107
rier+0.106
ushing+0.105
 darling+0.105
 concerned+0.104
Neg logit contributions
|-0.106
 land-0.097
 hunt-0.097
 camp-0.094
 distance-0.089
Token the
Token ID262
Feature activation+2.15e+00

Pos logit contributions
 concerned+0.061
 darling+0.061
ealing+0.061
ushing+0.060
rier+0.060
Neg logit contributions
|-0.061
 land-0.055
 hunt-0.055
 camp-0.053
 distance-0.050
Token answer
Token ID3280
Feature activation+3.87e+00

Pos logit contributions
ealing+0.051
 darling+0.051
 concerned+0.050
ushing+0.050
 dear+0.049
Neg logit contributions
|-0.053
 land-0.049
 hunt-0.048
 camp-0.046
 distance-0.045
Token.
Token ID13
Feature activation+2.27e+00

Pos logit contributions
 concerned+0.126
ealing+0.126
 darling+0.125
 sorry+0.123
rier+0.123
Neg logit contributions
|-0.059
 land-0.051
 hunt-0.051
 camp-0.046
 distance-0.044
Token Ap
Token ID5949
Feature activation+1.12e+01

Pos logit contributions
 concerned+0.060
ealing+0.060
rier+0.059
 darling+0.059
ushing+0.059
Neg logit contributions
|-0.050
 land-0.041
 camp-0.040
 hunt-0.040
 distance-0.037
Tokenolog
Token ID928
Feature activation+4.60e+00

Pos logit contributions
 concerned+0.211
 fault+0.208
 sorry+0.204
rier+0.202
 mistakes+0.198
Neg logit contributions
|-0.320
 hunt-0.289
 land-0.277
 play-0.269
ground-0.267
Tokenizing
Token ID2890
Feature activation+3.27e+00

Pos logit contributions
 fault+0.105
 dear+0.104
 darling+0.104
 sorry+0.102
 concerned+0.100
Neg logit contributions
|-0.122
 land-0.109
 hunt-0.108
 camp-0.099
 distance-0.098
Token and
Token ID290
Feature activation-9.98e-01

Pos logit contributions
ealing+0.103
ented+0.101
 concerned+0.100
rier+0.100
 darling+0.100
Neg logit contributions
|-0.053
 land-0.047
 hunt-0.045
 camp-0.045
 play-0.039
Token being
Token ID852
Feature activation+2.19e+00

Pos logit contributions
|+0.023
 hunt+0.020
 land+0.020
 camp+0.019
 distance+0.019
Neg logit contributions
 sorry-0.026
 darling-0.026
 concerned-0.026
 dear-0.025
ealing-0.025
Token nice
Token ID3621
Feature activation+4.19e+00

Pos logit contributions
 darling+0.046
rier+0.045
ealing+0.044
 concerned+0.044
 dear+0.044
Neg logit contributions
|-0.059
 land-0.055
 hunt-0.054
 camp-0.052
 distance-0.051
Token bouncing
Token ID30713
Feature activation-1.56e-01

Pos logit contributions
 sorry+0.020
 concerned+0.019
ealing+0.019
ushing+0.018
 dear+0.018
Neg logit contributions
|-0.019
growth-0.016
 hunt-0.016
 land-0.015
 camp-0.015
Token around
Token ID1088
Feature activation-5.29e+00

Pos logit contributions
|+0.003
 hunt+0.003
 land+0.003
 camp+0.003
 distance+0.002
Neg logit contributions
 sorry-0.004
 concerned-0.004
 darling-0.004
ealing-0.004
rier-0.004
Token,
Token ID11
Feature activation+2.82e+00

Pos logit contributions
|+0.136
 hunt+0.102
 distance+0.100
Ray+0.097
growth+0.095
Neg logit contributions
 sorry-0.127
 concerned-0.121
 darling-0.115
 dear-0.113
 mistakes-0.112
Token an
Token ID281
Feature activation+7.81e-01

Pos logit contributions
 sorry+0.081
 concerned+0.079
 darling+0.077
 dear+0.076
ealing+0.075
Neg logit contributions
|-0.064
 hunt-0.049
 land-0.047
 distance-0.044
 camp-0.044
Token unpleasant
Token ID22029
Feature activation+4.45e+00

Pos logit contributions
 fault+0.017
ealing+0.017
 darling+0.017
rier+0.016
ushing+0.016
Neg logit contributions
 land-0.020
|-0.020
 hunt-0.019
 forest-0.019
 camp-0.019
Token od
Token ID16298
Feature activation+1.12e+01

Pos logit contributions
 concerned+0.103
 sorry+0.102
 darling+0.102
 dear+0.102
rier+0.100
Neg logit contributions
|-0.116
 hunt-0.100
 land-0.095
 camp-0.092
 distance-0.090
Tokenour
Token ID454
Feature activation+1.03e+00

Pos logit contributions
 ok+0.219
 apology+0.218
 darling+0.214
 dear+0.212
 Promise+0.212
Neg logit contributions
|-0.273
 hunt-0.272
 land-0.272
 camp-0.265
field-0.248
Token wa
Token ID2082
Feature activation+5.18e+00

Pos logit contributions
 concerned+0.027
 dear+0.027
 darling+0.027
ushing+0.027
rier+0.026
Neg logit contributions
|-0.024
 hunt-0.020
 camp-0.019
 land-0.018
 play-0.017
Tokenft
Token ID701
Feature activation+4.66e-01

Pos logit contributions
 darling+0.099
 concerned+0.098
 fault+0.097
 sorry+0.097
 dear+0.097
Neg logit contributions
|-0.149
 land-0.137
 hunt-0.136
 camp-0.129
 distance-0.127
Tokened
Token ID276
Feature activation-2.51e-01

Pos logit contributions
ented+0.007
ealing+0.007
rier+0.007
ushing+0.007
 concerned+0.007
Neg logit contributions
|-0.015
 hunt-0.014
 land-0.013
 camp-0.013
 visit-0.013
Token into
Token ID656
Feature activation-6.43e+00

Pos logit contributions
|+0.006
 hunt+0.005
 land+0.004
 camp+0.004
 visit+0.004
Neg logit contributions
 concerned-0.007
 sorry-0.007
rier-0.007
 darling-0.007
ented-0.007
Token T
Token ID309
Feature activation+7.77e+00

Pos logit contributions
|+0.027
 hunt+0.022
 land+0.022
 camp+0.021
growth+0.020
Neg logit contributions
 darling-0.028
 dear-0.028
 concerned-0.028
 sorry-0.028
ealing-0.027
Token-
Token ID12
Feature activation+4.18e+00

Pos logit contributions
 concerned+0.179
 sorry+0.165
 darling+0.164
ealing+0.163
 fault+0.159
Neg logit contributions
|-0.204
 hunt-0.192
 land-0.182
 camp-0.178
 play-0.166
TokenRex
Token ID47389
Feature activation+1.68e+00

Pos logit contributions
 darling+0.094
 dear+0.093
 concerned+0.092
ealing+0.092
rier+0.091
Neg logit contributions
|-0.106
 hunt-0.097
 land-0.096
 camp-0.095
 distance-0.089
Token,
Token ID11
Feature activation+3.60e+00

Pos logit contributions
 sorry+0.043
 concerned+0.043
ealing+0.042
 darling+0.042
 dear+0.042
Neg logit contributions
|-0.039
 land-0.033
 distance-0.032
 hunt-0.032
 camp-0.030
Token a
Token ID257
Feature activation-2.06e-01

Pos logit contributions
 sorry+0.106
 concerned+0.105
 darling+0.105
 dear+0.104
ealing+0.101
Neg logit contributions
|-0.073
 hunt-0.060
 land-0.060
 camp-0.056
 distance-0.054
Token St
Token ID520
Feature activation+1.11e+01

Pos logit contributions
|+0.005
 hunt+0.004
 land+0.004
 camp+0.004
 distance+0.004
Neg logit contributions
 darling-0.005
 sorry-0.005
 dear-0.005
 concerned-0.005
ealing-0.005
Tokeneg
Token ID1533
Feature activation+3.50e+00

Pos logit contributions
 concerned+0.240
 darling+0.222
 sorry+0.219
rier+0.212
 mistakes+0.211
Neg logit contributions
|-0.297
 hunt-0.278
 land-0.273
 camp-0.265
 play-0.247
Tokenosaurus
Token ID47650
Feature activation+1.01e+00

Pos logit contributions
 darling+0.076
 dear+0.076
ealing+0.075
 sorry+0.075
ushing+0.074
Neg logit contributions
|-0.094
 land-0.084
 distance-0.083
 hunt-0.081
 camp-0.079
Token,
Token ID11
Feature activation+3.33e+00

Pos logit contributions
 dear+0.027
 darling+0.027
 sorry+0.026
 concerned+0.026
ealing+0.026
Neg logit contributions
|-0.024
 land-0.019
 hunt-0.019
 distance-0.019
 camp-0.018
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 darling+0.085
 sorry+0.085
 dear+0.085
 concerned+0.084
ealing+0.079
Neg logit contributions
|-0.083
 land-0.067
 hunt-0.067
 camp-0.064
 distance-0.063
Token even
Token ID772
Feature activation+1.41e+00

Pos logit contributions
|+0.027
 hunt+0.021
 land+0.021
 distance+0.020
 camp+0.020
Neg logit contributions
 dear-0.038
 darling-0.037
 sorry-0.037
 concerned-0.037
ealing-0.036
Token it
Token ID340
Feature activation-4.49e-01

Pos logit contributions
 darling+0.049
 concerned+0.048
 sorry+0.048
 dear+0.048
ealing+0.048
Neg logit contributions
|-0.029
 hunt-0.024
 land-0.024
 distance-0.022
 camp-0.022
Token!
Token ID0
Feature activation-3.23e-01

Pos logit contributions
|+0.009
 hunt+0.008
 land+0.008
 camp+0.007
 distance+0.007
Neg logit contributions
 darling-0.013
 concerned-0.013
 dear-0.012
 sorry-0.012
ealing-0.012
Token
Token ID220
Feature activation+1.71e+00

Pos logit contributions
|+0.005
 land+0.004
 hunt+0.004
 camp+0.004
 distance+0.003
Neg logit contributions
 darling-0.011
 concerned-0.011
ealing-0.011
 sorry-0.011
 dear-0.011
Token
Token ID198
Feature activation-5.60e+00

Pos logit contributions
 darling+0.061
ealing+0.060
 sorry+0.060
 concerned+0.060
 dear+0.060
Neg logit contributions
|-0.022
 hunt-0.019
 land-0.019
 camp-0.017
 distance-0.016
Token
Token ID198
Feature activation-4.43e+00

Pos logit contributions
|+0.088
 hunt+0.086
 land+0.079
 camp+0.074
 distance+0.070
Neg logit contributions
 sorry-0.180
 concerned-0.179
 darling-0.179
ealing-0.177
 dear-0.174
TokenEnc
Token ID27195
Feature activation+1.11e+01

Pos logit contributions
|+0.089
 hunt+0.082
 land+0.077
 camp+0.075
 distance+0.069
Neg logit contributions
 concerned-0.127
ealing-0.126
 darling-0.126
 sorry-0.124
rier-0.122
Tokenour
Token ID454
Feature activation+2.74e+00

Pos logit contributions
 fault+0.193
 concerned+0.188
 mistakes+0.188
 darling+0.187
 dear+0.186
Neg logit contributions
|-0.334
 land-0.312
 hunt-0.303
field-0.293
ground-0.287
Tokenaged
Token ID1886
Feature activation+1.60e+00

Pos logit contributions
 darling+0.054
 concerned+0.054
ealing+0.053
 dear+0.053
 sorry+0.052
Neg logit contributions
|-0.081
 hunt-0.072
 land-0.071
 camp-0.069
 distance-0.069
Token by
Token ID416
Feature activation-1.33e+00

Pos logit contributions
 fault+0.047
 sorry+0.046
ealing+0.046
 concerned+0.046
 darling+0.046
Neg logit contributions
|-0.029
 land-0.026
 camp-0.024
 hunt-0.024
 distance-0.023
Token her
Token ID607
Feature activation+3.01e+00

Pos logit contributions
|+0.026
 land+0.022
 hunt+0.021
 camp+0.021
 distance+0.020
Neg logit contributions
ealing-0.039
 concerned-0.038
 fault-0.038
 darling-0.038
 dear-0.038
Token success
Token ID1943
Feature activation+2.84e+00

Pos logit contributions
rier+0.061
ealing+0.059
ares+0.058
 concerned+0.058
 darling+0.057
Neg logit contributions
 land-0.081
|-0.081
 camp-0.078
 hunt-0.077
 play-0.073
Token a
Token ID257
Feature activation-9.76e-01

Pos logit contributions
|+0.136
field+0.127
 hunt+0.122
ground+0.117
 distance+0.116
Neg logit contributions
 dear-0.131
 sorry-0.128
 darling-0.128
 okay-0.125
 ok-0.121
Token time
Token ID640
Feature activation+1.22e+00

Pos logit contributions
|+0.020
 hunt+0.017
 land+0.017
 camp+0.016
 distance+0.015
Neg logit contributions
 darling-0.028
 concerned-0.027
ealing-0.027
 dear-0.027
 sorry-0.027
Token there
Token ID612
Feature activation-3.21e+00

Pos logit contributions
 darling+0.034
 concerned+0.034
ealing+0.033
 sorry+0.033
 dear+0.033
Neg logit contributions
|-0.025
 hunt-0.022
 land-0.022
 camp-0.021
 distance-0.020
Token was
Token ID373
Feature activation+5.51e-01

Pos logit contributions
|+0.066
 hunt+0.064
 distance+0.055
fire+0.055
growth+0.055
Neg logit contributions
 darling-0.082
 dear-0.081
 fault-0.080
 sorry-0.080
 ok-0.078
Token a
Token ID257
Feature activation-1.58e+00

Pos logit contributions
 darling+0.019
 sorry+0.019
 dear+0.018
 concerned+0.018
ealing+0.018
Neg logit contributions
|-0.008
 hunt-0.007
 land-0.006
 distance-0.006
 camp-0.006
Token Stub
Token ID41135
Feature activation+1.10e+01

Pos logit contributions
|+0.040
 land+0.037
 hunt+0.036
 camp+0.035
 distance+0.034
Neg logit contributions
 darling-0.036
 concerned-0.036
ealing-0.036
 fault-0.035
 sorry-0.035
Tokenborn
Token ID6286
Feature activation+4.07e+00

Pos logit contributions
 darling+0.194
 concerned+0.193
ealing+0.190
 sorry+0.189
 dear+0.189
Neg logit contributions
|-0.343
 hunt-0.312
 land-0.311
 camp-0.299
 distance-0.292
Token Old
Token ID5706
Feature activation+4.40e+00

Pos logit contributions
 dear+0.108
 darling+0.108
ealing+0.101
 concerned+0.101
ushing+0.099
Neg logit contributions
|-0.095
 hunt-0.079
 land-0.074
 distance-0.073
 camp-0.072
Token Woman
Token ID14381
Feature activation+2.88e+00

Pos logit contributions
 darling+0.114
 dear+0.114
ealing+0.112
ushing+0.110
 concerned+0.110
Neg logit contributions
|-0.102
 hunt-0.086
 land-0.085
 distance-0.082
 camp-0.080
Token.
Token ID13
Feature activation+5.45e-01

Pos logit contributions
 darling+0.080
 dear+0.080
ushing+0.078
 concerned+0.077
 sorry+0.076
Neg logit contributions
|-0.061
 distance-0.050
 hunt-0.049
 camp-0.048
 land-0.048
Token She
Token ID1375
Feature activation-1.95e+00

Pos logit contributions
 darling+0.013
 concerned+0.013
 dear+0.013
ealing+0.013
 sorry+0.013
Neg logit contributions
|-0.013
 hunt-0.011
 land-0.011
 camp-0.011
 distance-0.011
Token long
Token ID890
Feature activation+1.45e+00

Pos logit contributions
 darling+0.048
 sorry+0.048
 concerned+0.048
 dear+0.047
ealing+0.047
Neg logit contributions
|-0.044
 hunt-0.040
 land-0.039
 camp-0.037
 distance-0.036
Token until
Token ID1566
Feature activation+1.96e+00

Pos logit contributions
 darling+0.044
 concerned+0.044
ealing+0.044
 sorry+0.043
 dear+0.043
Neg logit contributions
|-0.026
 hunt-0.022
 land-0.022
 camp-0.021
 distance-0.020
Token Chloe
Token ID29476
Feature activation+3.32e+00

Pos logit contributions
 darling+0.050
 sorry+0.050
 dear+0.050
 concerned+0.048
ealing+0.047
Neg logit contributions
|-0.047
 hunt-0.041
 land-0.039
 distance-0.037
 camp-0.037
Token felt
Token ID2936
Feature activation+5.04e+00

Pos logit contributions
 darling+0.100
 dear+0.098
ealing+0.097
 concerned+0.097
 sorry+0.096
Neg logit contributions
|-0.061
 hunt-0.051
 land-0.048
 camp-0.047
 distance-0.047
Token her
Token ID607
Feature activation+4.08e+00

Pos logit contributions
 sorry+0.116
 concerned+0.115
 darling+0.114
 dear+0.112
ealing+0.112
Neg logit contributions
|-0.133
 hunt-0.120
 land-0.116
 camp-0.112
 distance-0.107
Token eyel
Token ID29708
Feature activation+1.09e+01

Pos logit contributions
ealing+0.098
 concerned+0.097
ushing+0.094
rier+0.094
 darling+0.094
Neg logit contributions
|-0.099
 land-0.092
 hunt-0.089
 camp-0.087
 distance-0.083
Tokenids
Token ID2340
Feature activation-1.99e+00

Pos logit contributions
 concerned+0.210
 darling+0.202
rier+0.202
 dear+0.196
 unfair+0.196
Neg logit contributions
|-0.320
 land-0.274
 hunt-0.271
 camp-0.264
growth-0.252
Token getting
Token ID1972
Feature activation+1.82e+00

Pos logit contributions
|+0.043
 hunt+0.035
 camp+0.033
 land+0.033
growth+0.031
Neg logit contributions
 concerned-0.055
 sorry-0.055
 darling-0.054
ushing-0.054
 dear-0.053
Token heavy
Token ID4334
Feature activation+2.11e+00

Pos logit contributions
 concerned+0.044
 sorry+0.043
 darling+0.043
ealing+0.043
 dear+0.042
Neg logit contributions
|-0.046
 hunt-0.041
 land-0.040
 camp-0.038
 distance-0.037
Token.
Token ID13
Feature activation+6.14e-01

Pos logit contributions
ealing+0.070
 concerned+0.070
 darling+0.070
 sorry+0.069
 dear+0.069
Neg logit contributions
|-0.031
 land-0.027
 hunt-0.026
 camp-0.025
 distance-0.023
Token She
Token ID1375
Feature activation-1.74e+00

Pos logit contributions
 concerned+0.018
ealing+0.018
 darling+0.018
 sorry+0.018
rier+0.018
Neg logit contributions
|-0.012
 land-0.010
 hunt-0.010
 camp-0.009
 distance-0.009
Token and
Token ID290
Feature activation-3.63e-01

Pos logit contributions
 darling+0.168
 fault+0.168
ealing+0.167
 dear+0.166
 apology+0.164
Neg logit contributions
|-0.090
 land-0.089
 hunt-0.082
 camp-0.081
 distance-0.079
Token said
Token ID531
Feature activation+5.29e+00

Pos logit contributions
|+0.009
 hunt+0.008
 land+0.008
 camp+0.008
 distance+0.007
Neg logit contributions
 darling-0.009
 concerned-0.009
 sorry-0.009
 dear-0.009
ealing-0.009
Token:
Token ID25
Feature activation+2.63e+00

Pos logit contributions
ealing+0.156
 darling+0.155
ushing+0.154
 fault+0.154
 concerned+0.154
Neg logit contributions
|-0.091
 land-0.081
 hunt-0.080
 camp-0.079
 play-0.070
Token "
Token ID366
Feature activation+5.20e-01

Pos logit contributions
 sorry+0.049
 darling+0.049
 concerned+0.049
 dear+0.047
ealing+0.047
Neg logit contributions
|-0.080
 hunt-0.072
 land-0.072
 camp-0.069
 distance-0.069
TokenG
Token ID38
Feature activation+3.88e+00

Pos logit contributions
 darling+0.012
 concerned+0.012
ealing+0.012
 dear+0.012
ushing+0.012
Neg logit contributions
|-0.013
 hunt-0.012
 land-0.012
 camp-0.011
 distance-0.010
Tokenlad
Token ID9435
Feature activation+1.09e+01

Pos logit contributions
 concerned+0.076
 sorry+0.076
 darling+0.075
 dear+0.074
rier+0.072
Neg logit contributions
|-0.114
 hunt-0.102
 land-0.102
 camp-0.098
 distance-0.094
Token to
Token ID284
Feature activation-2.13e+00

Pos logit contributions
 concerned+0.344
ealing+0.331
 sorry+0.327
ushing+0.322
 fault+0.321
Neg logit contributions
|-0.178
 land-0.159
 hunt-0.153
 camp-0.147
 visit-0.137
Token help
Token ID1037
Feature activation+1.87e+00

Pos logit contributions
|+0.050
 camp+0.041
 distance+0.040
 land+0.039
 hunt+0.038
Neg logit contributions
 sorry-0.055
 darling-0.055
 dear-0.055
 apology-0.053
 ok-0.053
Token".
Token ID1911
Feature activation+2.79e-01

Pos logit contributions
 darling+0.052
 concerned+0.052
ealing+0.051
 sorry+0.051
 dear+0.051
Neg logit contributions
|-0.039
 hunt-0.034
 land-0.034
 camp-0.032
 distance-0.031
Token
Token ID220
Feature activation+2.71e+00

Pos logit contributions
 darling+0.007
 concerned+0.007
ealing+0.007
 sorry+0.007
 dear+0.007
Neg logit contributions
|-0.007
 hunt-0.006
 land-0.006
 camp-0.006
 distance-0.006
Token
Token ID198
Feature activation-4.79e+00

Pos logit contributions
 darling+0.098
ealing+0.098
 concerned+0.097
 sorry+0.097
 dear+0.096
Neg logit contributions
|-0.034
 hunt-0.028
 land-0.027
 distance-0.024
 camp-0.024
TokenMom
Token ID29252
Feature activation+1.63e+00

Pos logit contributions
|+0.074
 hunt+0.066
 land+0.066
 camp+0.062
 distance+0.058
Neg logit contributions
ealing-0.103
 concerned-0.101
 darling-0.100
 sorry-0.100
 dear-0.099
Token said
Token ID531
Feature activation+5.64e+00

Pos logit contributions
 darling+0.043
 concerned+0.042
ealing+0.042
 dear+0.042
 sorry+0.041
Neg logit contributions
|-0.037
 hunt-0.032
 land-0.032
 camp-0.030
 distance-0.030
Token,
Token ID11
Feature activation+4.95e+00

Pos logit contributions
 fault+0.199
ushing+0.196
ented+0.196
ares+0.193
ealing+0.193
Neg logit contributions
|-0.049
 the-0.049
 land-0.045
 camp-0.043
 hunt-0.042
Token "
Token ID366
Feature activation+1.36e+00

Pos logit contributions
 fault+0.130
ealing+0.126
ushing+0.122
 mistakes+0.122
 darling+0.122
Neg logit contributions
|-0.101
 hunt-0.096
 land-0.093
 camp-0.091
 visit-0.087
TokenG
Token ID38
Feature activation+4.62e+00

Pos logit contributions
 sorry+0.038
 concerned+0.037
ealing+0.037
 darling+0.037
 dear+0.037
Neg logit contributions
|-0.029
 land-0.025
 hunt-0.024
 distance-0.023
 camp-0.023
Tokenlad
Token ID9435
Feature activation+1.08e+01

Pos logit contributions
 concerned+0.088
 sorry+0.088
 darling+0.084
 dear+0.083
rier+0.082
Neg logit contributions
|-0.140
 hunt-0.124
 land-0.124
 camp-0.118
ground-0.114
Token to
Token ID284
Feature activation-2.53e+00

Pos logit contributions
 concerned+0.357
ealing+0.334
 sorry+0.331
 daughters+0.328
ushing+0.327
Neg logit contributions
|-0.160
 land-0.144
 hunt-0.137
 camp-0.129
 visit-0.128
Token see
Token ID766
Feature activation+4.01e-01

Pos logit contributions
|+0.052
 hunt+0.041
 camp+0.041
 land+0.040
 forests+0.039
Neg logit contributions
 darling-0.070
 sorry-0.070
 dear-0.069
 ok-0.069
 concerned-0.065
Token you
Token ID345
Feature activation+1.51e+00

Pos logit contributions
 darling+0.011
 concerned+0.011
ealing+0.011
 sorry+0.011
 dear+0.011
Neg logit contributions
|-0.008
 hunt-0.007
 land-0.007
 camp-0.007
 distance-0.007
Token both
Token ID1111
Feature activation-3.83e-01

Pos logit contributions
 darling+0.040
 sorry+0.040
ealing+0.039
 dear+0.039
 concerned+0.039
Neg logit contributions
|-0.035
 hunt-0.029
 land-0.029
 distance-0.028
 camp-0.027
Token made
Token ID925
Feature activation-5.64e-01

Pos logit contributions
|+0.008
 hunt+0.007
growth+0.007
 land+0.007
 distance+0.006
Neg logit contributions
 darling-0.010
 dear-0.010
 sorry-0.010
ealing-0.010
 concerned-0.010
Token he
Token ID339
Feature activation-1.64e-01

Pos logit contributions
 darling+0.074
 concerned+0.074
 sorry+0.073
ealing+0.073
 dear+0.073
Neg logit contributions
|-0.039
 hunt-0.032
 land-0.031
 camp-0.029
 distance-0.028
Token shouted
Token ID17293
Feature activation+4.92e+00

Pos logit contributions
|+0.004
 hunt+0.003
 land+0.003
 camp+0.003
 distance+0.003
Neg logit contributions
 concerned-0.004
 darling-0.004
ealing-0.004
 dear-0.004
 sorry-0.004
Token,
Token ID11
Feature activation+3.56e+00

Pos logit contributions
 fault+0.145
 darling+0.143
 concerned+0.141
 dear+0.141
ealing+0.141
Neg logit contributions
|-0.084
 land-0.078
 hunt-0.075
 camp-0.074
 play-0.070
Token "
Token ID366
Feature activation+1.07e+00

Pos logit contributions
ealing+0.074
 darling+0.074
 fault+0.074
 concerned+0.073
 dear+0.072
Neg logit contributions
|-0.096
 land-0.088
 hunt-0.088
 camp-0.084
 distance-0.082
TokenA
Token ID32
Feature activation+1.61e+00

Pos logit contributions
 darling+0.027
 concerned+0.027
ealing+0.027
 dear+0.026
rier+0.026
Neg logit contributions
|-0.025
 hunt-0.023
 land-0.022
 camp-0.022
 distance-0.020
Token P
Token ID350
Feature activation+1.08e+01

Pos logit contributions
 darling+0.040
 dear+0.040
 concerned+0.040
 sorry+0.040
ealing+0.039
Neg logit contributions
|-0.039
 land-0.033
 hunt-0.033
 camp-0.032
 distance-0.031
TokenIG
Token ID3528
Feature activation+8.68e+00

Pos logit contributions
 concerned+0.235
 sorry+0.221
rier+0.202
 ashamed+0.199
 amused+0.194
Neg logit contributions
|-0.294
 hunt-0.280
 land-0.272
 camp-0.270
 visit-0.247
Token!!"
Token ID37160
Feature activation+6.99e-01

Pos logit contributions
 concerned+0.201
 darling+0.196
 sorry+0.196
 fault+0.194
ealing+0.194
Neg logit contributions
|-0.212
 hunt-0.195
 land-0.194
 camp-0.187
 play-0.172
Token
Token ID198
Feature activation-4.95e+00

Pos logit contributions
 darling+0.022
 concerned+0.022
ealing+0.022
 sorry+0.022
 dear+0.022
Neg logit contributions
|-0.012
 hunt-0.010
 land-0.010
 camp-0.009
 distance-0.009
Token
Token ID198
Feature activation-4.22e+00

Pos logit contributions
|+0.077
 hunt+0.069
 land+0.065
 camp+0.058
 distance+0.058
Neg logit contributions
 sorry-0.165
 concerned-0.164
ealing-0.162
 darling-0.162
 apology-0.158
TokenHis
Token ID6653
Feature activation+5.28e+00

Pos logit contributions
|+0.084
 hunt+0.073
 land+0.071
 camp+0.068
 distance+0.064
Neg logit contributions
 concerned-0.123
ealing-0.123
 sorry-0.121
 darling-0.121
ushing-0.120
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 fault+0.180
 darling+0.173
 dear+0.172
ealing+0.172
 apology+0.169
Neg logit contributions
|-0.069
 land-0.066
 play-0.061
 hunt-0.061
 camp-0.060
Token said
Token ID531
Feature activation+4.81e+00

Pos logit contributions
|+0.032
 hunt+0.028
 land+0.028
 camp+0.027
 distance+0.026
Neg logit contributions
 concerned-0.030
 darling-0.030
 sorry-0.030
ealing-0.030
 dear-0.030
Token,
Token ID11
Feature activation+4.07e+00

Pos logit contributions
 fault+0.156
ented+0.154
ealing+0.154
 concerned+0.154
 mistakes+0.152
Neg logit contributions
|-0.060
 land-0.054
 hunt-0.052
 camp-0.049
 play-0.049
Token "
Token ID366
Feature activation+6.55e-01

Pos logit contributions
 fault+0.107
ealing+0.103
 darling+0.101
ushing+0.100
 mistakes+0.100
Neg logit contributions
|-0.082
 land-0.078
 hunt-0.078
 camp-0.074
 play-0.073
TokenG
Token ID38
Feature activation+4.10e+00

Pos logit contributions
 darling+0.017
 concerned+0.017
ealing+0.017
 dear+0.016
 sorry+0.016
Neg logit contributions
|-0.015
 hunt-0.013
 land-0.013
 camp-0.013
 distance-0.012
Tokenlad
Token ID9435
Feature activation+1.08e+01

Pos logit contributions
 darling+0.082
 concerned+0.082
ealing+0.081
 sorry+0.081
 dear+0.080
Neg logit contributions
|-0.117
 hunt-0.106
 land-0.106
 camp-0.101
 distance-0.099
Token you
Token ID345
Feature activation+3.34e+00

Pos logit contributions
 concerned+0.342
ealing+0.325
 sorry+0.321
rier+0.316
 daughters+0.316
Neg logit contributions
|-0.178
 land-0.162
 hunt-0.156
 camp-0.145
 visit-0.140
Token had
Token ID550
Feature activation+1.66e+00

Pos logit contributions
ealing+0.089
 darling+0.089
 dear+0.087
 sorry+0.085
ented+0.084
Neg logit contributions
|-0.072
 camp-0.062
 distance-0.062
 land-0.061
 hunt-0.061
Token fun
Token ID1257
Feature activation+3.08e+00

Pos logit contributions
 darling+0.050
 dear+0.050
 sorry+0.049
ealing+0.048
 concerned+0.047
Neg logit contributions
|-0.034
 land-0.027
 camp-0.026
 distance-0.026
 hunt-0.026
Token,
Token ID11
Feature activation+4.15e+00

Pos logit contributions
 concerned+0.086
ealing+0.086
 darling+0.085
rier+0.085
 dear+0.084
Neg logit contributions
|-0.062
 land-0.056
 hunt-0.055
 camp-0.053
 play-0.049
Token Sam
Token ID3409
Feature activation+3.73e+00

Pos logit contributions
 concerned+0.118
rier+0.118
ealing+0.116
 fault+0.113
ushing+0.113
Neg logit contributions
 land-0.073
 hunt-0.072
|-0.071
 camp-0.068
 play-0.067
Token and
Token ID290
Feature activation-1.15e-01

Pos logit contributions
 darling+0.147
 concerned+0.147
ealing+0.146
 dear+0.145
 sorry+0.145
Neg logit contributions
|-0.116
 land-0.104
 hunt-0.103
 camp-0.099
 distance-0.095
Token said
Token ID531
Feature activation+4.88e+00

Pos logit contributions
|+0.003
 hunt+0.002
 land+0.002
 camp+0.002
 distance+0.002
Neg logit contributions
 sorry-0.003
 darling-0.003
 concerned-0.003
 dear-0.003
ealing-0.003
Token,
Token ID11
Feature activation+3.97e+00

Pos logit contributions
 concerned+0.141
 fault+0.141
 darling+0.140
ealing+0.139
ushing+0.139
Neg logit contributions
|-0.089
 land-0.081
 hunt-0.078
 camp-0.077
 play-0.068
Token "
Token ID366
Feature activation+8.81e-01

Pos logit contributions
 darling+0.081
ealing+0.080
 concerned+0.080
 fault+0.080
 dear+0.079
Neg logit contributions
|-0.110
 hunt-0.101
 land-0.100
 camp-0.097
 distance-0.094
TokenG
Token ID38
Feature activation+4.44e+00

Pos logit contributions
 darling+0.022
 concerned+0.021
 dear+0.021
ealing+0.021
rier+0.021
Neg logit contributions
|-0.021
 hunt-0.019
 land-0.019
 camp-0.018
 distance-0.017
Tokenlad
Token ID9435
Feature activation+1.07e+01

Pos logit contributions
 concerned+0.091
 sorry+0.090
 darling+0.089
 dear+0.088
ealing+0.087
Neg logit contributions
|-0.125
 land-0.114
 hunt-0.113
 camp-0.108
 distance-0.105
Token we
Token ID356
Feature activation+8.92e-01

Pos logit contributions
 concerned+0.306
ealing+0.297
 sorry+0.295
 darling+0.291
 fault+0.291
Neg logit contributions
|-0.211
 land-0.190
 hunt-0.186
 camp-0.177
 distance-0.167
Token could
Token ID714
Feature activation-8.66e-01

Pos logit contributions
 sorry+0.024
ealing+0.024
 ok+0.024
 concerned+0.023
 darling+0.023
Neg logit contributions
|-0.019
 distance-0.016
 land-0.015
 hunt-0.015
 camp-0.015
Token help
Token ID1037
Feature activation+7.51e-01

Pos logit contributions
|+0.019
 distance+0.014
growth+0.013
 camp+0.013
 land+0.013
Neg logit contributions
 darling-0.024
 dear-0.024
 ok-0.024
 sorry-0.023
ushing-0.022
Token.
Token ID13
Feature activation+2.64e+00

Pos logit contributions
 darling+0.021
 dear+0.020
rier+0.020
 sorry+0.020
 concerned+0.019
Neg logit contributions
|-0.018
 hunt-0.013
 distance-0.013
 land-0.013
 camp-0.012
Token Have
Token ID8192
Feature activation-3.96e-01

Pos logit contributions
 concerned+0.053
ealing+0.052
 darling+0.051
rier+0.051
 sorry+0.051
Neg logit contributions
|-0.076
 land-0.068
 hunt-0.068
 camp-0.066
 distance-0.063
Token mom
Token ID1995
Feature activation+7.72e-01

Pos logit contributions
rier+0.104
ares+0.100
ealing+0.098
ushing+0.097
ented+0.096
Neg logit contributions
|-0.098
 land-0.097
 camp-0.094
 play-0.091
 hunt-0.087
Token said
Token ID531
Feature activation+4.69e+00

Pos logit contributions
 darling+0.017
 concerned+0.017
 dear+0.017
ealing+0.017
 sorry+0.017
Neg logit contributions
|-0.021
 hunt-0.019
 land-0.018
 camp-0.018
 distance-0.017
Token,
Token ID11
Feature activation+3.96e+00

Pos logit contributions
 fault+0.159
ealing+0.153
ented+0.152
ushing+0.151
ares+0.151
Neg logit contributions
|-0.058
 land-0.052
 play-0.048
 hunt-0.047
 camp-0.047
Token "
Token ID366
Feature activation+4.35e-01

Pos logit contributions
ealing+0.080
 darling+0.080
 fault+0.080
 concerned+0.079
 dear+0.078
Neg logit contributions
|-0.110
 land-0.100
 hunt-0.100
 camp-0.096
 distance-0.093
TokenG
Token ID38
Feature activation+3.97e+00

Pos logit contributions
 darling+0.011
 dear+0.010
 concerned+0.010
ealing+0.010
rier+0.010
Neg logit contributions
|-0.010
 hunt-0.010
 land-0.009
 camp-0.009
 distance-0.009
Tokenlad
Token ID9435
Feature activation+1.07e+01

Pos logit contributions
 concerned+0.080
 darling+0.080
 sorry+0.079
ealing+0.079
 dear+0.078
Neg logit contributions
|-0.113
 land-0.102
 hunt-0.102
 camp-0.098
 distance-0.095
Token you
Token ID345
Feature activation+3.56e+00

Pos logit contributions
 concerned+0.308
ealing+0.300
 sorry+0.295
ushing+0.291
 fault+0.289
Neg logit contributions
|-0.211
 land-0.185
 camp-0.178
 hunt-0.176
 visit-0.163
Token enjoyed
Token ID8359
Feature activation+9.80e-01

Pos logit contributions
 darling+0.098
ealing+0.097
 dear+0.096
 sorry+0.095
 concerned+0.093
Neg logit contributions
|-0.074
 hunt-0.064
 land-0.063
 camp-0.062
 distance-0.062
Token it
Token ID340
Feature activation-1.19e+00

Pos logit contributions
 darling+0.030
 sorry+0.029
 dear+0.029
 concerned+0.029
ealing+0.029
Neg logit contributions
|-0.019
 hunt-0.016
 land-0.015
 distance-0.014
 camp-0.014
Token."
Token ID526
Feature activation-8.49e-02

Pos logit contributions
|+0.026
 hunt+0.022
 land+0.022
 camp+0.021
 distance+0.021
Neg logit contributions
 darling-0.032
 concerned-0.032
 sorry-0.032
 dear-0.032
ealing-0.031
Token
Token ID220
Feature activation+2.60e+00

Pos logit contributions
|+0.002
 hunt+0.001
 land+0.001
 distance+0.001
 camp+0.001
Neg logit contributions
 darling-0.003
 dear-0.002
 concerned-0.002
 sorry-0.002
ealing-0.002
Token girl
Token ID2576
Feature activation+1.53e+00

Pos logit contributions
 fault+0.075
ealing+0.074
 concerned+0.073
 darling+0.072
ushing+0.072
Neg logit contributions
|-0.084
 land-0.080
 hunt-0.076
 camp-0.076
 distance-0.072
Token who
Token ID508
Feature activation-1.58e-01

Pos logit contributions
 sorry+0.045
 concerned+0.045
 darling+0.044
 dear+0.044
ushing+0.043
Neg logit contributions
|-0.029
 hunt-0.025
 camp-0.024
 land-0.023
 distance-0.022
Token was
Token ID373
Feature activation+2.13e+00

Pos logit contributions
|+0.004
 hunt+0.003
 camp+0.003
 land+0.003
 distance+0.003
Neg logit contributions
 sorry-0.004
 concerned-0.004
 darling-0.004
 dear-0.004
ealing-0.004
Token feeling
Token ID4203
Feature activation+4.44e+00

Pos logit contributions
 concerned+0.055
 darling+0.055
 sorry+0.055
ealing+0.054
 dear+0.054
Neg logit contributions
|-0.048
 hunt-0.043
 land-0.042
 camp-0.040
 distance-0.039
Token un
Token ID555
Feature activation+3.71e+00

Pos logit contributions
 darling+0.102
 concerned+0.101
ealing+0.101
 dear+0.099
 fault+0.099
Neg logit contributions
|-0.114
 land-0.102
 hunt-0.101
 camp-0.097
 distance-0.094
Tokenconf
Token ID10414
Feature activation+1.07e+01

Pos logit contributions
 darling+0.084
 dear+0.083
 fault+0.083
 sorry+0.083
ealing+0.083
Neg logit contributions
|-0.098
 hunt-0.086
 land-0.086
 camp-0.081
 distance-0.079
Tokenident
Token ID738
Feature activation+5.15e+00

Pos logit contributions
 darling+0.234
 dear+0.230
 concerned+0.229
 fault+0.227
 sorry+0.225
Neg logit contributions
|-0.277
 hunt-0.252
 land-0.246
 camp-0.227
 play-0.225
Token.
Token ID13
Feature activation-6.23e-01

Pos logit contributions
 darling+0.170
 concerned+0.170
ealing+0.169
 sorry+0.168
 dear+0.168
Neg logit contributions
|-0.080
 hunt-0.065
 land-0.065
 camp-0.060
 distance-0.057
Token She
Token ID1375
Feature activation-2.41e+00

Pos logit contributions
|+0.016
 land+0.015
 hunt+0.015
 camp+0.014
 distance+0.014
Neg logit contributions
 concerned-0.014
 darling-0.014
ealing-0.014
 sorry-0.014
 dear-0.014
Token wanted
Token ID2227
Feature activation+1.87e+00

Pos logit contributions
|+0.054
 hunt+0.048
 distance+0.048
 camp+0.046
 land+0.045
Neg logit contributions
 concerned-0.061
 sorry-0.061
 dear-0.060
 darling-0.060
ealing-0.060
Token to
Token ID284
Feature activation-7.31e+00

Pos logit contributions
 fault+0.053
 sorry+0.052
 dear+0.052
 darling+0.052
 concerned+0.052
Neg logit contributions
|-0.040
 hunt-0.032
 land-0.031
 camp-0.030
 distance-0.029
Token mom
Token ID1995
Feature activation+6.49e-01

Pos logit contributions
ealing+0.096
rier+0.095
ares+0.093
ushing+0.092
 concerned+0.092
Neg logit contributions
|-0.099
 land-0.094
 camp-0.090
 hunt-0.089
 play-0.088
Token replied
Token ID8712
Feature activation+3.96e+00

Pos logit contributions
 darling+0.015
 concerned+0.015
 dear+0.015
ealing+0.015
ushing+0.015
Neg logit contributions
|-0.017
 hunt-0.015
 land-0.015
 camp-0.014
 distance-0.014
Token,
Token ID11
Feature activation+3.71e+00

Pos logit contributions
 fault+0.136
ealing+0.129
ented+0.129
asers+0.128
 darling+0.127
Neg logit contributions
|-0.048
 land-0.044
 hunt-0.039
 play-0.039
 camp-0.038
Token "
Token ID366
Feature activation+1.79e-01

Pos logit contributions
 fault+0.081
ealing+0.079
 darling+0.078
 concerned+0.077
ushing+0.077
Neg logit contributions
|-0.097
 land-0.090
 hunt-0.089
 camp-0.086
 distance-0.083
TokenG
Token ID38
Feature activation+3.68e+00

Pos logit contributions
 darling+0.005
 dear+0.004
 concerned+0.004
ealing+0.004
ushing+0.004
Neg logit contributions
|-0.004
 hunt-0.004
 land-0.004
 camp-0.004
 distance-0.003
Tokenlad
Token ID9435
Feature activation+1.06e+01

Pos logit contributions
 concerned+0.075
 sorry+0.074
 darling+0.074
ealing+0.073
 dear+0.073
Neg logit contributions
|-0.104
 land-0.094
 hunt-0.094
 camp-0.089
 distance-0.087
Token you
Token ID345
Feature activation+3.52e+00

Pos logit contributions
 concerned+0.338
ealing+0.322
 sorry+0.319
ushing+0.312
 fault+0.311
Neg logit contributions
|-0.176
 land-0.162
 hunt-0.150
 camp-0.143
 visit-0.141
Token like
Token ID588
Feature activation+4.67e-01

Pos logit contributions
 darling+0.091
ealing+0.090
 dear+0.090
 sorry+0.090
 concerned+0.088
Neg logit contributions
|-0.081
 hunt-0.069
 land-0.069
 distance-0.068
 camp-0.067
Token it
Token ID340
Feature activation+9.06e-01

Pos logit contributions
ealing+0.014
 concerned+0.014
 darling+0.014
 sorry+0.014
rier+0.014
Neg logit contributions
|-0.009
 land-0.007
 hunt-0.007
 camp-0.007
 distance-0.007
Token!"
Token ID2474
Feature activation-6.07e-01

Pos logit contributions
 darling+0.025
 concerned+0.025
 dear+0.025
 sorry+0.025
ealing+0.025
Neg logit contributions
|-0.019
 hunt-0.016
 land-0.016
 camp-0.015
 distance-0.015
Token
Token ID198
Feature activation-6.13e+00

Pos logit contributions
|+0.015
 hunt+0.013
 land+0.013
 camp+0.012
 distance+0.012
Neg logit contributions
 darling-0.015
 concerned-0.015
ealing-0.015
 sorry-0.015
 dear-0.015
Token theater
Token ID13766
Feature activation-2.56e+00

Pos logit contributions
|+0.062
 hunt+0.054
Buzz+0.050
Ray+0.049
 roam+0.048
Neg logit contributions
 sorry-0.080
 dear-0.077
 darling-0.077
 concerned-0.075
 mistakes-0.073
Token and
Token ID290
Feature activation-1.48e+00

Pos logit contributions
|+0.058
 hunt+0.045
growth+0.043
 land+0.043
 camp+0.042
Neg logit contributions
 darling-0.070
 dear-0.069
 concerned-0.067
 sorry-0.067
ushing-0.066
Token Larry
Token ID13633
Feature activation+2.51e-01

Pos logit contributions
|+0.033
 hunt+0.026
 land+0.024
 camp+0.023
growth+0.022
Neg logit contributions
 sorry-0.042
 concerned-0.041
 darling-0.040
 dear-0.039
 ok-0.039
Token felt
Token ID2936
Feature activation+4.97e+00

Pos logit contributions
 concerned+0.007
 darling+0.007
ealing+0.007
 sorry+0.007
 dear+0.007
Neg logit contributions
|-0.005
 hunt-0.004
 land-0.004
 camp-0.004
 distance-0.004
Token his
Token ID465
Feature activation+3.98e+00

Pos logit contributions
 sorry+0.111
 concerned+0.110
 darling+0.107
ealing+0.106
 dear+0.106
Neg logit contributions
|-0.137
 hunt-0.123
 land-0.118
 camp-0.114
 distance-0.110
Token eyel
Token ID29708
Feature activation+1.06e+01

Pos logit contributions
 darling+0.086
 sorry+0.085
 dear+0.085
 concerned+0.084
 fault+0.083
Neg logit contributions
|-0.109
 hunt-0.098
 land-0.096
 camp-0.091
 distance-0.090
Tokenids
Token ID2340
Feature activation-2.44e+00

Pos logit contributions
 concerned+0.213
 darling+0.207
rier+0.202
 dear+0.200
 mistakes+0.198
Neg logit contributions
|-0.308
 land-0.268
 hunt-0.265
 camp-0.253
growth-0.244
Token become
Token ID1716
Feature activation+2.57e+00

Pos logit contributions
|+0.053
 hunt+0.041
 land+0.040
 camp+0.040
growth+0.037
Neg logit contributions
 concerned-0.069
 sorry-0.068
 darling-0.067
ushing-0.067
rier-0.066
Token heavy
Token ID4334
Feature activation+2.92e+00

Pos logit contributions
 concerned+0.061
 sorry+0.061
rier+0.059
ealing+0.059
 darling+0.059
Neg logit contributions
|-0.067
 hunt-0.059
 land-0.055
 camp-0.053
 distance-0.052
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
 concerned+0.097
 darling+0.097
ealing+0.097
 sorry+0.096
 dear+0.096
Neg logit contributions
|-0.044
 land-0.037
 hunt-0.037
 camp-0.034
 distance-0.032
Token He
Token ID679
Feature activation-1.96e+00

Pos logit contributions
 darling+0.003
 concerned+0.003
 sorry+0.003
ealing+0.003
 dear+0.003
Neg logit contributions
|-0.002
 hunt-0.002
 land-0.002
 camp-0.002
 distance-0.002
Token a
Token ID257
Feature activation+3.15e-01

Pos logit contributions
 concerned+0.113
 darling+0.112
ealing+0.112
 sorry+0.111
 dear+0.110
Neg logit contributions
|-0.085
 land-0.076
 hunt-0.076
 camp-0.071
 distance-0.068
Token situation
Token ID3074
Feature activation+2.91e+00

Pos logit contributions
 darling+0.007
 concerned+0.007
 sorry+0.007
ealing+0.007
 dear+0.007
Neg logit contributions
|-0.008
 hunt-0.007
 land-0.007
 camp-0.007
 distance-0.007
Token may
Token ID743
Feature activation+1.43e+00

Pos logit contributions
 darling+0.079
 dear+0.078
 sorry+0.078
 concerned+0.078
ealing+0.078
Neg logit contributions
|-0.063
 hunt-0.053
 land-0.053
 camp-0.050
 distance-0.049
Token seem
Token ID1283
Feature activation+4.05e+00

Pos logit contributions
 fault+0.037
 sorry+0.036
 ok+0.035
 darling+0.035
 mistakes+0.034
Neg logit contributions
|-0.033
 camp-0.026
growth-0.026
 hunt-0.025
 distance-0.025
Token.
Token ID13
Feature activation+6.98e-01

Pos logit contributions
 darling+0.123
 concerned+0.122
ealing+0.121
 sorry+0.121
 dear+0.121
Neg logit contributions
|-0.074
 hunt-0.062
 land-0.062
 camp-0.058
 distance-0.055
Token Enc
Token ID14711
Feature activation+1.06e+01

Pos logit contributions
 concerned+0.022
 darling+0.022
ealing+0.022
 sorry+0.022
 dear+0.022
Neg logit contributions
|-0.011
 hunt-0.009
 land-0.009
 camp-0.009
 distance-0.008
Tokenour
Token ID454
Feature activation+3.23e+00

Pos logit contributions
 concerned+0.160
 darling+0.160
 fault+0.155
 dear+0.155
 sorry+0.149
Neg logit contributions
|-0.343
 land-0.320
 hunt-0.314
 camp-0.300
 distance-0.290
Tokenagement
Token ID5082
Feature activation+4.56e+00

Pos logit contributions
 darling+0.073
 concerned+0.073
ealing+0.073
rier+0.073
 sorry+0.072
Neg logit contributions
|-0.080
 land-0.074
 hunt-0.073
 camp-0.069
 distance-0.066
Token can
Token ID460
Feature activation+5.89e-01

Pos logit contributions
 concerned+0.114
ealing+0.114
 darling+0.113
 sorry+0.112
 dear+0.111
Neg logit contributions
|-0.104
 land-0.095
 hunt-0.094
 camp-0.089
 distance-0.086
Token come
Token ID1282
Feature activation+9.04e-01

Pos logit contributions
 sorry+0.016
 ok+0.016
 darling+0.016
 dear+0.015
 concerned+0.015
Neg logit contributions
|-0.013
 land-0.010
 camp-0.010
 distance-0.010
 hunt-0.010
Token from
Token ID422
Feature activation-4.09e+00

Pos logit contributions
 darling+0.023
 sorry+0.023
 concerned+0.023
ealing+0.023
 dear+0.023
Neg logit contributions
|-0.021
 hunt-0.018
 land-0.018
 camp-0.017
 distance-0.017
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
ealing+0.027
 concerned+0.026
 sorry+0.026
 darling+0.026
ented+0.026
Neg logit contributions
|-0.016
 land-0.015
 hunt-0.015
 camp-0.014
 distance-0.013
Token They
Token ID1119
Feature activation-2.39e+00

Pos logit contributions
 darling+0.031
 concerned+0.031
 dear+0.031
 sorry+0.031
ealing+0.030
Neg logit contributions
|-0.030
 land-0.027
 hunt-0.027
 camp-0.025
 distance-0.024
Token got
Token ID1392
Feature activation+2.05e+00

Pos logit contributions
|+0.056
 hunt+0.046
 distance+0.046
 outer+0.045
growth+0.045
Neg logit contributions
 concerned-0.057
ented-0.056
 darling-0.056
 dear-0.055
ealing-0.055
Token some
Token ID617
Feature activation-2.65e+00

Pos logit contributions
 darling+0.063
 concerned+0.062
ealing+0.062
 dear+0.062
 sorry+0.062
Neg logit contributions
|-0.037
 land-0.031
 hunt-0.031
 camp-0.029
 distance-0.028
Token bar
Token ID2318
Feature activation+2.83e+00

Pos logit contributions
|+0.055
 land+0.049
 hunt+0.049
 camp+0.046
 distance+0.044
Neg logit contributions
 concerned-0.073
 darling-0.072
ealing-0.072
 sorry-0.072
 dear-0.071
Tokenbec
Token ID9423
Feature activation+1.05e+01

Pos logit contributions
 darling+0.075
 concerned+0.075
 sorry+0.074
 fault+0.074
 dear+0.073
Neg logit contributions
|-0.057
 land-0.054
 hunt-0.053
 camp-0.050
 distance-0.049
Tokenued
Token ID1739
Feature activation+1.54e+00

Pos logit contributions
 fault+0.209
 darling+0.199
 mistakes+0.194
 concerned+0.194
omm+0.191
Neg logit contributions
|-0.274
 hunt-0.257
 land-0.255
fire-0.246
ground-0.238
Token meat
Token ID6174
Feature activation-2.28e+00

Pos logit contributions
 darling+0.046
 concerned+0.046
ealing+0.046
 sorry+0.045
 dear+0.045
Neg logit contributions
|-0.029
 land-0.025
 hunt-0.025
 camp-0.023
 distance-0.022
Token.
Token ID13
Feature activation+9.48e-01

Pos logit contributions
|+0.050
 land+0.042
 hunt+0.041
 camp+0.039
 distance+0.039
Neg logit contributions
 darling-0.062
 concerned-0.062
ealing-0.061
 sorry-0.061
 dear-0.061
Token M
Token ID337
Feature activation+6.21e+00

Pos logit contributions
 darling+0.024
 dear+0.023
 concerned+0.023
 sorry+0.023
ealing+0.023
Neg logit contributions
|-0.022
 hunt-0.020
 land-0.020
 camp-0.019
 distance-0.018
Tokenummy
Token ID13513
Feature activation+7.38e-01

Pos logit contributions
 concerned+0.115
 sorry+0.111
rier+0.106
ealing+0.095
 ashamed+0.094
Neg logit contributions
|-0.187
 hunt-0.174
 camp-0.167
 land-0.166
 visit-0.161
Token
Token ID241
Feature activation+1.76e+00

Pos logit contributions
 darling+0.085
rier+0.084
 dear+0.083
 concerned+0.082
 sorry+0.082
Neg logit contributions
|-0.116
 hunt-0.107
 land-0.105
 camp-0.098
 play-0.095
TokenYou
Token ID1639
Feature activation+4.36e+00

Pos logit contributions
 darling+0.045
 concerned+0.045
ealing+0.044
 sorry+0.044
 dear+0.044
Neg logit contributions
|-0.041
 hunt-0.036
 land-0.036
 camp-0.034
 distance-0.033
Token are
Token ID389
Feature activation+3.72e+00

Pos logit contributions
 darling+0.116
 dear+0.114
 sorry+0.113
ealing+0.113
 concerned+0.112
Neg logit contributions
|-0.098
 hunt-0.085
 land-0.085
 distance-0.082
 camp-0.082
Token an
Token ID281
Feature activation+4.31e+00

Pos logit contributions
 darling+0.100
 concerned+0.100
ealing+0.099
 sorry+0.099
 dear+0.099
Neg logit contributions
|-0.081
 hunt-0.070
 land-0.070
 camp-0.066
 distance-0.064
Token amazing
Token ID4998
Feature activation+5.31e+00

Pos logit contributions
ouncing+0.093
ushing+0.093
ucker+0.093
ealing+0.092
 concerned+0.090
Neg logit contributions
|-0.108
 land-0.104
 hunt-0.100
 camp-0.097
 forest-0.095
Token resc
Token ID6811
Feature activation+1.05e+01

Pos logit contributions
ealing+0.143
ented+0.140
 concerned+0.139
ushing+0.138
ares+0.136
Neg logit contributions
 land-0.108
|-0.108
 hunt-0.104
 camp-0.102
 play-0.094
Tokenuer
Token ID15573
Feature activation+4.19e+00

Pos logit contributions
 darling+0.216
 mistakes+0.215
 concerned+0.211
 fault+0.209
 dear+0.208
Neg logit contributions
|-0.271
 hunt-0.255
 land-0.246
fire-0.241
 camp-0.224
Token!
Token ID0
Feature activation+2.21e+00

Pos logit contributions
 concerned+0.117
 darling+0.117
ealing+0.117
 sorry+0.116
 dear+0.115
Neg logit contributions
|-0.085
 land-0.074
 hunt-0.074
 camp-0.069
 distance-0.066
Token I
Token ID314
Feature activation-4.56e-01

Pos logit contributions
 concerned+0.046
 fault+0.045
ealing+0.045
rier+0.045
ushing+0.044
Neg logit contributions
|-0.062
 land-0.054
 hunt-0.054
 camp-0.053
 visit-0.051
Token bet
Token ID731
Feature activation+6.58e+00

Pos logit contributions
|+0.009
 hunt+0.008
 land+0.008
 distance+0.008
 camp+0.008
Neg logit contributions
 darling-0.012
 concerned-0.012
 sorry-0.012
ealing-0.012
 dear-0.012
Token nobody
Token ID8168
Feature activation+4.30e+00

Pos logit contributions
 darling+0.175
 dear+0.173
 concerned+0.172
 sorry+0.172
ealing+0.171
Neg logit contributions
|-0.147
 hunt-0.127
 land-0.126
 camp-0.120
 distance-0.118
Token would
Token ID561
Feature activation-6.62e-01

Pos logit contributions
|+0.014
 hunt+0.012
 land+0.011
 camp+0.011
 distance+0.011
Neg logit contributions
 concerned-0.018
 darling-0.018
 dear-0.018
ushing-0.018
ealing-0.018
Token<|endoftext|>
Token ID50256
Feature activation-5.11e+00

Pos logit contributions
|+0.015
 hunt+0.011
 distance+0.011
 land+0.010
 camp+0.010
Neg logit contributions
 darling-0.018
 dear-0.017
 sorry-0.017
 concerned-0.017
 ok-0.017
TokenJohn
Token ID7554
Feature activation-3.35e-01

Pos logit contributions
 hunt+0.104
|+0.094
 land+0.093
 distance+0.092
 outer+0.091
Neg logit contributions
ealing-0.111
 sorry-0.111
oking-0.109
 darling-0.106
 concerned-0.105
Token b
Token ID275
Feature activation+2.43e+00

Pos logit contributions
|+0.008
 hunt+0.007
 land+0.007
 camp+0.007
 distance+0.007
Neg logit contributions
 darling-0.008
 concerned-0.008
 dear-0.008
ealing-0.008
 sorry-0.008
Tokeniked
Token ID17951
Feature activation-3.16e+00

Pos logit contributions
ealing+0.052
 darling+0.051
ushing+0.050
 concerned+0.050
 dear+0.049
Neg logit contributions
|-0.066
 hunt-0.062
 land-0.061
 camp-0.059
 distance-0.057
Token to
Token ID284
Feature activation-9.69e+00

Pos logit contributions
|+0.088
 hunt+0.075
 land+0.071
 camp+0.070
growth+0.070
Neg logit contributions
 concerned-0.064
 darling-0.064
rier-0.062
 sorry-0.060
 fault-0.059
Token the
Token ID262
Feature activation-4.37e+00

Pos logit contributions
|+0.210
fire+0.186
Buzz+0.156
Boo+0.155
st+0.151
Neg logit contributions
 daughter-0.235
 darling-0.234
 dear-0.223
 sorry-0.221
 son-0.221
Token park
Token ID3952
Feature activation-3.73e+00

Pos logit contributions
|+0.095
 hunt+0.065
growth+0.065
e+0.064
 miles+0.063
Neg logit contributions
 darling-0.119
 dear-0.119
 concerned-0.114
 sorry-0.111
 unfair-0.110
Token with
Token ID351
Feature activation-5.28e+00

Pos logit contributions
|+0.094
growth+0.076
 hunt+0.071
 camp+0.071
Ar+0.070
Neg logit contributions
 concerned-0.084
 darling-0.083
 sorry-0.082
ushing-0.081
 dear-0.080
Token his
Token ID465
Feature activation+1.27e-01

Pos logit contributions
|+0.112
 hunt+0.080
fire+0.079
 land+0.076
ground+0.072
Neg logit contributions
 sorry-0.157
 darling-0.156
 dear-0.155
 concerned-0.155
 ok-0.147
Token friends
Token ID2460
Feature activation-2.23e+00

Pos logit contributions
 darling+0.002
 concerned+0.002
 sorry+0.002
 dear+0.002
ealing+0.002
Neg logit contributions
|-0.004
 hunt-0.004
 land-0.004
 camp-0.003
 distance-0.003
Token<|endoftext|>
Token ID50256
Feature activation-4.80e+00

Pos logit contributions
 darling+0.010
 sorry+0.010
 dear+0.010
 concerned+0.010
ealing+0.010
Neg logit contributions
|-0.005
 hunt-0.005
 land-0.005
 camp-0.004
 distance-0.004
TokenP
Token ID47
Feature activation+1.74e+00

Pos logit contributions
 hunt+0.103
 land+0.089
 miles+0.089
 roam+0.088
|+0.087
Neg logit contributions
ealing-0.099
 sorry-0.098
aughter-0.097
 darling-0.095
oke-0.095
Tokenolly
Token ID5098
Feature activation+1.38e+00

Pos logit contributions
ealing+0.044
 darling+0.044
 concerned+0.043
 dear+0.043
ushing+0.043
Neg logit contributions
|-0.040
 hunt-0.036
 land-0.036
 distance-0.034
 camp-0.033
Token was
Token ID373
Feature activation+1.26e+00

Pos logit contributions
 darling+0.035
 concerned+0.035
 dear+0.035
ealing+0.034
 sorry+0.034
Neg logit contributions
|-0.032
 hunt-0.027
 land-0.027
 camp-0.026
 distance-0.026
Token playing
Token ID2712
Feature activation-7.62e-01

Pos logit contributions
 darling+0.038
ealing+0.038
rier+0.038
 concerned+0.038
 dear+0.038
Neg logit contributions
|-0.023
 land-0.020
 hunt-0.019
 camp-0.018
 distance-0.018
Token at
Token ID379
Feature activation-9.37e+00

Pos logit contributions
|+0.018
 distance+0.014
 hunt+0.013
field+0.013
Ray+0.013
Neg logit contributions
 sorry-0.020
 concerned-0.019
 darling-0.018
 mistakes-0.018
 ok-0.018
Token the
Token ID262
Feature activation-4.63e+00

Pos logit contributions
|+0.186
fire+0.175
 hunt+0.158
Buzz+0.151
 trek+0.149
Neg logit contributions
 fault-0.237
 mistakes-0.224
 darling-0.221
aughter-0.215
 sorry-0.214
Token park
Token ID3952
Feature activation-3.53e+00

Pos logit contributions
|+0.094
growth+0.066
 miles+0.064
 hunt+0.063
fire+0.062
Neg logit contributions
 darling-0.128
 dear-0.125
 concerned-0.124
 mistakes-0.121
 fault-0.121
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
|+0.083
 hunt+0.066
 camp+0.065
 land+0.064
growth+0.060
Neg logit contributions
 concerned-0.091
 darling-0.090
 sorry-0.089
ushing-0.089
 dear-0.089
Token She
Token ID1375
Feature activation-2.83e+00

Pos logit contributions
|+0.025
 land+0.022
 hunt+0.021
 camp+0.021
 play+0.020
Neg logit contributions
rier-0.029
 fault-0.029
ushing-0.029
 darling-0.029
 dear-0.029
Token loved
Token ID6151
Feature activation+2.11e+00

Pos logit contributions
|+0.046
 hunt+0.036
 camp+0.034
 land+0.033
 distance+0.032
Neg logit contributions
 concerned-0.090
 darling-0.088
 sorry-0.088
ealing-0.088
 dear-0.087
Token<|endoftext|>
Token ID50256
Feature activation-4.83e+00

Pos logit contributions
|+0.000
 hunt+0.000
 land+0.000
 camp+0.000
 distance+0.000
Neg logit contributions
 darling-0.000
 sorry-0.000
 concerned-0.000
ealing-0.000
 dear-0.000
TokenI
Token ID40
Feature activation-2.20e+00

Pos logit contributions
 hunt+0.103
 miles+0.086
|+0.086
 land+0.085
 roam+0.085
Neg logit contributions
ealing-0.105
 sorry-0.103
aughter-0.102
 darling-0.101
oke-0.100
Tokenzzy
Token ID31570
Feature activation+7.49e-01

Pos logit contributions
|+0.051
 hunt+0.046
 land+0.045
 camp+0.044
 distance+0.042
Neg logit contributions
 concerned-0.053
ealing-0.053
 sorry-0.053
 darling-0.053
 dear-0.051
Token was
Token ID373
Feature activation+1.64e+00

Pos logit contributions
 darling+0.021
 dear+0.021
 concerned+0.020
ealing+0.020
ushing+0.019
Neg logit contributions
|-0.015
 hunt-0.013
 camp-0.012
 distance-0.012
 land-0.012
Token playing
Token ID2712
Feature activation-8.96e-01

Pos logit contributions
 sorry+0.044
 concerned+0.044
 darling+0.043
 dear+0.043
ealing+0.043
Neg logit contributions
|-0.036
 hunt-0.032
 land-0.031
 camp-0.030
 distance-0.029
Token at
Token ID379
Feature activation-9.36e+00

Pos logit contributions
|+0.020
 distance+0.016
 hunt+0.015
growth+0.015
Ray+0.015
Neg logit contributions
 sorry-0.024
 concerned-0.023
rier-0.023
 darling-0.023
 mistakes-0.022
Token home
Token ID1363
Feature activation-3.82e+00

Pos logit contributions
|+0.181
fire+0.176
 hunt+0.153
Buzz+0.147
 trek+0.143
Neg logit contributions
 fault-0.238
 mistakes-0.230
 darling-0.213
 sorry-0.209
 older-0.206
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
|+0.091
 hunt+0.072
 distance+0.068
 land+0.067
 camp+0.066
Neg logit contributions
 darling-0.103
 dear-0.100
rier-0.100
 concerned-0.098
ushing-0.098
Token She
Token ID1375
Feature activation-2.89e+00

Pos logit contributions
|+0.028
 hunt+0.025
 land+0.025
 camp+0.024
 distance+0.023
Neg logit contributions
 darling-0.027
 concerned-0.027
ealing-0.027
 sorry-0.027
 dear-0.027
Token saw
Token ID2497
Feature activation-8.86e-01

Pos logit contributions
|+0.050
 hunt+0.040
 camp+0.038
 land+0.038
 distance+0.037
Neg logit contributions
 darling-0.089
 concerned-0.089
 dear-0.088
 sorry-0.088
ealing-0.087
Token a
Token ID257
Feature activation-2.55e+00

Pos logit contributions
|+0.018
 hunt+0.014
 land+0.013
 camp+0.012
 distance+0.012
Neg logit contributions
 dear-0.027
 darling-0.027
 sorry-0.027
 concerned-0.027
 fault-0.026
Token day
Token ID1110
Feature activation+5.08e-01

Pos logit contributions
|+0.046
 hunt+0.042
growth+0.040
fire+0.037
ground+0.037
Neg logit contributions
 darling-0.051
 dear-0.051
 sorry-0.051
 concerned-0.049
ealing-0.049
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
 darling+0.014
 concerned+0.014
ealing+0.014
 sorry+0.014
 dear+0.013
Neg logit contributions
|-0.011
 hunt-0.010
 land-0.010
 camp-0.009
 distance-0.009
Token Amy
Token ID14235
Feature activation+7.04e-01

Pos logit contributions
 sorry+0.052
 darling+0.051
 concerned+0.050
 dear+0.050
ealing+0.050
Neg logit contributions
|-0.029
 hunt-0.027
 land-0.025
 camp-0.024
 distance-0.023
Token was
Token ID373
Feature activation+1.87e+00

Pos logit contributions
 darling+0.018
 concerned+0.017
 dear+0.017
ealing+0.017
 sorry+0.017
Neg logit contributions
|-0.015
 hunt-0.013
 camp-0.013
 distance-0.012
 land-0.012
Token playing
Token ID2712
Feature activation-9.68e-01

Pos logit contributions
 darling+0.059
ealing+0.059
rier+0.059
 concerned+0.058
 dear+0.058
Neg logit contributions
|-0.032
 land-0.028
 hunt-0.027
 camp-0.025
 distance-0.024
Token at
Token ID379
Feature activation-9.33e+00

Pos logit contributions
|+0.023
 distance+0.019
field+0.019
Ray+0.018
 traveled+0.017
Neg logit contributions
 sorry-0.025
 concerned-0.023
 mistakes-0.022
 darling-0.021
ushing-0.021
Token the
Token ID262
Feature activation-4.38e+00

Pos logit contributions
|+0.174
fire+0.166
 hunt+0.149
 trek+0.146
 distance+0.140
Neg logit contributions
 fault-0.229
 sorry-0.222
 mistakes-0.222
 darling-0.215
 daughters-0.211
Token park
Token ID3952
Feature activation-3.90e+00

Pos logit contributions
|+0.088
growth+0.061
 miles+0.060
fire+0.059
 distance+0.058
Neg logit contributions
 darling-0.117
 dear-0.115
 concerned-0.115
 fault-0.114
 sorry-0.112
Token with
Token ID351
Feature activation-5.11e+00

Pos logit contributions
|+0.085
 camp+0.069
 hunt+0.067
 land+0.065
growth+0.064
Neg logit contributions
 concerned-0.103
 sorry-0.103
ushing-0.102
 darling-0.101
 dear-0.100
Token her
Token ID607
Feature activation-6.16e-01

Pos logit contributions
|+0.139
 hunt+0.119
fire+0.116
 land+0.115
field+0.114
Neg logit contributions
 sorry-0.114
 darling-0.111
 concerned-0.110
 dear-0.110
ealing-0.105
Token friends
Token ID2460
Feature activation-2.36e+00

Pos logit contributions
|+0.014
 land+0.014
 hunt+0.014
 camp+0.013
 play+0.013
Neg logit contributions
rier-0.015
ealing-0.015
 darling-0.015
 concerned-0.015
 sorry-0.015
Token Billy
Token ID15890
Feature activation+7.15e-02

Pos logit contributions
|+0.033
 hunt+0.028
 land+0.028
 camp+0.027
 distance+0.026
Neg logit contributions
 darling-0.039
 concerned-0.039
ealing-0.039
 sorry-0.039
 dear-0.039
Token went
Token ID1816
Feature activation-3.77e-01

Pos logit contributions
 sorry+0.002
 concerned+0.002
ealing+0.002
 darling+0.002
ushing+0.002
Neg logit contributions
|-0.001
 land-0.001
 hunt-0.001
 camp-0.001
 distance-0.001
Token to
Token ID284
Feature activation-8.34e+00

Pos logit contributions
|+0.007
 land+0.006
 hunt+0.006
 distance+0.006
 camp+0.006
Neg logit contributions
 sorry-0.011
 concerned-0.011
ealing-0.011
 darling-0.011
 fault-0.011
Token play
Token ID711
Feature activation-8.47e-01

Pos logit contributions
|+0.125
fire+0.118
field+0.102
 distance+0.101
 land+0.101
Neg logit contributions
 sorry-0.261
 darling-0.247
 fault-0.246
 apology-0.244
 concerned-0.243
Token baseball
Token ID9283
Feature activation-4.07e+00

Pos logit contributions
|+0.021
 land+0.018
 distance+0.018
 hunt+0.018
 camp+0.017
Neg logit contributions
 sorry-0.021
ealing-0.020
 concerned-0.020
ushing-0.019
 darling-0.019
Token at
Token ID379
Feature activation-9.31e+00

Pos logit contributions
|+0.084
 distance+0.071
 land+0.067
growth+0.067
 outer+0.063
Neg logit contributions
 sorry-0.106
 mistakes-0.105
ushing-0.104
 concerned-0.103
ealing-0.103
Token the
Token ID262
Feature activation-3.14e+00

Pos logit contributions
|+0.157
fire+0.131
 trek+0.130
 miles+0.120
 roam+0.119
Neg logit contributions
 fault-0.256
 sorry-0.251
 mistakes-0.245
aughter-0.244
 sons-0.240
Token park
Token ID3952
Feature activation-3.99e+00

Pos logit contributions
|+0.055
 land+0.036
 hunt+0.036
 miles+0.034
growth+0.034
Neg logit contributions
 fault-0.098
 sorry-0.097
 concerned-0.097
ealing-0.097
 darling-0.096
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
|+0.098
growth+0.088
 outer+0.081
 camp+0.078
 land+0.077
Neg logit contributions
 sorry-0.087
 fault-0.087
ushing-0.086
 concerned-0.085
olding-0.083
Token Tim
Token ID5045
Feature activation+1.12e+00

Pos logit contributions
|+0.020
 hunt+0.017
 land+0.016
 camp+0.015
 distance+0.015
Neg logit contributions
 darling-0.037
 concerned-0.037
rier-0.037
ealing-0.037
 dear-0.037
Tokenmy
Token ID1820
Feature activation+1.23e+00

Pos logit contributions
rier+0.034
 darling+0.034
 sorry+0.034
 dear+0.034
 concerned+0.033
Neg logit contributions
|-0.021
 hunt-0.018
 land-0.018
 camp-0.016
 distance-0.016
Token<|endoftext|>
Token ID50256
Feature activation-4.25e+00

Pos logit contributions
 darling+0.047
 sorry+0.047
 concerned+0.046
 dear+0.046
ealing+0.045
Neg logit contributions
|-0.021
 hunt-0.020
 land-0.019
 camp-0.018
 miles-0.016
TokenS
Token ID50
Feature activation+2.15e+00

Pos logit contributions
 hunt+0.086
|+0.073
 land+0.072
 miles+0.072
 roam+0.069
Neg logit contributions
ealing-0.102
 sorry-0.098
 darling-0.098
oking-0.097
 concerned-0.097
Tokenandy
Token ID10757
Feature activation+1.63e+00

Pos logit contributions
ealing+0.042
 darling+0.042
 dear+0.040
 concerned+0.040
omm+0.040
Neg logit contributions
|-0.060
 hunt-0.056
 land-0.056
fire-0.054
 camp-0.053
Token was
Token ID373
Feature activation+1.48e+00

Pos logit contributions
 darling+0.044
 concerned+0.043
 dear+0.043
ealing+0.043
ushing+0.042
Neg logit contributions
|-0.034
 hunt-0.029
 land-0.028
 camp-0.028
 distance-0.028
Token playing
Token ID2712
Feature activation-6.96e-01

Pos logit contributions
rier+0.043
ealing+0.043
 darling+0.043
ushing+0.042
 dear+0.042
Neg logit contributions
|-0.028
 land-0.025
 camp-0.023
 hunt-0.023
 distance-0.022
Token at
Token ID379
Feature activation-9.31e+00

Pos logit contributions
|+0.016
 distance+0.013
Ray+0.012
 hunt+0.012
field+0.012
Neg logit contributions
 sorry-0.018
 concerned-0.018
 darling-0.017
 mistakes-0.017
 ok-0.016
Token the
Token ID262
Feature activation-4.63e+00

Pos logit contributions
|+0.181
fire+0.173
 hunt+0.154
Buzz+0.148
Sun+0.138
Neg logit contributions
 fault-0.238
 mistakes-0.231
 darling-0.224
 daughter-0.215
 sorry-0.215
Token park
Token ID3952
Feature activation-3.48e+00

Pos logit contributions
|+0.094
growth+0.066
 hunt+0.064
 miles+0.063
fire+0.062
Neg logit contributions
 darling-0.130
 dear-0.126
 concerned-0.125
 fault-0.120
 mistakes-0.120
Token with
Token ID351
Feature activation-5.08e+00

Pos logit contributions
|+0.080
 hunt+0.064
 camp+0.063
 land+0.063
 distance+0.059
Neg logit contributions
 concerned-0.091
 darling-0.090
 sorry-0.089
ushing-0.089
 dear-0.089
Token her
Token ID607
Feature activation-6.18e-01

Pos logit contributions
|+0.143
 hunt+0.121
fire+0.119
field+0.114
 land+0.114
Neg logit contributions
 sorry-0.107
 concerned-0.106
 darling-0.106
 dear-0.103
 mistakes-0.100
Token friends
Token ID2460
Feature activation-2.01e+00

Pos logit contributions
|+0.017
 land+0.016
 hunt+0.016
 camp+0.015
 distance+0.015
Neg logit contributions
 darling-0.013
 concerned-0.013
ealing-0.013
 sorry-0.013
 dear-0.012
Token<|endoftext|>
Token ID50256
Feature activation-4.69e+00

Pos logit contributions
|+0.007
 hunt+0.005
 land+0.005
 camp+0.005
 distance+0.004
Neg logit contributions
 darling-0.017
 concerned-0.017
ealing-0.017
 sorry-0.017
 dear-0.017
TokenJimmy
Token ID40335
Feature activation+1.02e+00

Pos logit contributions
 hunt+0.091
|+0.080
 land+0.078
 miles+0.076
 outer+0.075
Neg logit contributions
ealing-0.114
 darling-0.111
 sorry-0.110
oking-0.109
 concerned-0.109
Token wanted
Token ID2227
Feature activation+8.99e-01

Pos logit contributions
 concerned+0.029
 darling+0.029
ealing+0.029
 sorry+0.029
rier+0.029
Neg logit contributions
|-0.020
 land-0.017
 hunt-0.017
 camp-0.016
 distance-0.016
Token to
Token ID284
Feature activation-7.92e+00

Pos logit contributions
 darling+0.025
 dear+0.025
 concerned+0.025
 sorry+0.025
 fault+0.025
Neg logit contributions
|-0.020
 hunt-0.015
 land-0.015
 camp-0.014
 distance-0.014
Token climb
Token ID12080
Feature activation-3.50e+00

Pos logit contributions
|+0.151
 distance+0.113
 hunt+0.113
 land+0.112
 camp+0.110
Neg logit contributions
 darling-0.236
 sorry-0.232
 dear-0.231
 concerned-0.223
ealing-0.220
Token to
Token ID284
Feature activation-9.31e+00

Pos logit contributions
|+0.076
 hunt+0.059
fire+0.056
 land+0.054
 distance+0.053
Neg logit contributions
 fault-0.089
 darling-0.089
 sorry-0.088
 concerned-0.088
 mistakes-0.086
Token the
Token ID262
Feature activation-4.11e+00

Pos logit contributions
|+0.208
fire+0.169
Buzz+0.148
Ar+0.146
Sky+0.137
Neg logit contributions
 darling-0.236
 sorry-0.233
 dear-0.228
 older-0.220
 daughter-0.220
Token top
Token ID1353
Feature activation-1.62e+00

Pos logit contributions
|+0.091
 hunt+0.063
e+0.060
growth+0.060
fire+0.058
Neg logit contributions
 darling-0.115
 dear-0.115
 concerned-0.110
 sorry-0.109
 unfair-0.108
Token of
Token ID286
Feature activation-6.74e+00

Pos logit contributions
|+0.042
 hunt+0.035
growth+0.032
 camp+0.031
 land+0.031
Neg logit contributions
 darling-0.038
 dear-0.037
rier-0.036
ushing-0.036
 concerned-0.035
Token the
Token ID262
Feature activation-3.63e+00

Pos logit contributions
|+0.147
fire+0.098
 hunt+0.097
Buzz+0.091
 land+0.084
Neg logit contributions
 sorry-0.196
 darling-0.188
 concerned-0.185
 dear-0.185
 fault-0.179
Token tree
Token ID5509
Feature activation-5.79e+00

Pos logit contributions
|+0.086
 hunt+0.063
growth+0.063
e+0.060
 traveled+0.058
Neg logit contributions
 darling-0.095
 dear-0.094
 concerned-0.087
 sorry-0.086
 unfair-0.085
Token.
Token ID13
Feature activation+7.25e-02

Pos logit contributions
 sorry+0.111
 darling+0.110
 dear+0.109
 concerned+0.108
rier+0.108
Neg logit contributions
|-0.092
 hunt-0.071
 land-0.071
 distance-0.069
growth-0.068
Token She
Token ID1375
Feature activation-3.06e+00

Pos logit contributions
 concerned+0.002
ealing+0.002
 fault+0.002
 darling+0.002
rier+0.002
Neg logit contributions
|-0.002
 land-0.002
 hunt-0.002
 camp-0.001
 distance-0.001
Token wanted
Token ID2227
Feature activation+1.62e+00

Pos logit contributions
|+0.062
 hunt+0.053
 land+0.052
 camp+0.049
 distance+0.049
Neg logit contributions
 darling-0.086
 concerned-0.086
 sorry-0.086
ealing-0.085
 dear-0.085
Token to
Token ID284
Feature activation-8.04e+00

Pos logit contributions
 darling+0.042
 dear+0.041
 fault+0.041
 sorry+0.041
 mistakes+0.040
Neg logit contributions
|-0.039
 hunt-0.030
 land-0.029
 distance-0.028
fire-0.028
Token go
Token ID467
Feature activation-1.56e+00

Pos logit contributions
|+0.179
 valleys+0.148
 distance+0.142
 clearing+0.140
 forests+0.140
Neg logit contributions
 sorry-0.208
 darling-0.204
 dear-0.201
 ok-0.191
 apology-0.187
Token to
Token ID284
Feature activation-9.25e+00

Pos logit contributions
|+0.034
growth+0.028
 land+0.027
fire+0.027
 distance+0.027
Neg logit contributions
 sorry-0.040
 concerned-0.038
 darling-0.037
ealing-0.037
 dear-0.037
Token bed
Token ID3996
Feature activation-1.66e+00

Pos logit contributions
|+0.207
fire+0.198
Buzz+0.159
 distance+0.159
Ray+0.159
Neg logit contributions
 dear-0.234
 sorry-0.233
 darling-0.233
 daughter-0.224
 mistakes-0.221
Token,
Token ID11
Feature activation+2.13e+00

Pos logit contributions
|+0.037
 distance+0.029
 hunt+0.029
 land+0.028
growth+0.028
Neg logit contributions
 sorry-0.046
 dear-0.046
rier-0.045
 darling-0.045
 concerned-0.045
Token but
Token ID475
Feature activation+1.48e+00

Pos logit contributions
 darling+0.043
 dear+0.043
 sorry+0.043
 ok+0.041
 concerned+0.040
Neg logit contributions
|-0.059
 land-0.052
 hunt-0.051
fire-0.051
growth-0.051
Token she
Token ID673
Feature activation-4.45e-01

Pos logit contributions
rier+0.040
ealing+0.040
 concerned+0.039
 darling+0.039
 fault+0.039
Neg logit contributions
|-0.032
 land-0.029
 hunt-0.028
 camp-0.027
 distance-0.026
Token had
Token ID550
Feature activation-5.50e-01

Pos logit contributions
|+0.008
 hunt+0.007
 land+0.006
 camp+0.006
 distance+0.006
Neg logit contributions
 concerned-0.014
 darling-0.014
 dear-0.014
 sorry-0.013
ealing-0.013
Token<|endoftext|>
Token ID50256
Feature activation-5.37e+00

Pos logit contributions
|+0.007
 hunt+0.007
 land+0.006
 camp+0.006
 distance+0.006
Neg logit contributions
 darling-0.011
 sorry-0.011
 dear-0.011
ealing-0.010
 fault-0.010
TokenSus
Token ID30746
Feature activation+3.23e+00

Pos logit contributions
 hunt+0.122
 outer+0.109
 roam+0.109
 land+0.108
 miles+0.107
Neg logit contributions
 sorry-0.100
ealing-0.099
 fault-0.098
oke-0.098
oking-0.096
Tokenie
Token ID494
Feature activation+6.84e-01

Pos logit contributions
ushing+0.075
 darling+0.075
 concerned+0.075
ealing+0.075
 fault+0.074
Neg logit contributions
|-0.077
 hunt-0.071
 land-0.066
 camp-0.066
 distance-0.065
Token was
Token ID373
Feature activation+1.77e+00

Pos logit contributions
 darling+0.020
 dear+0.020
 concerned+0.020
 sorry+0.020
ealing+0.020
Neg logit contributions
|-0.013
 hunt-0.011
 land-0.010
 camp-0.010
 distance-0.010
Token playing
Token ID2712
Feature activation-7.05e-01

Pos logit contributions
 darling+0.052
ealing+0.052
ushing+0.051
 dear+0.051
rier+0.051
Neg logit contributions
|-0.034
 land-0.030
 camp-0.028
 hunt-0.028
 distance-0.026
Token at
Token ID379
Feature activation-9.25e+00

Pos logit contributions
|+0.015
 distance+0.012
 hunt+0.012
growth+0.011
Ray+0.011
Neg logit contributions
 sorry-0.019
 concerned-0.019
 darling-0.019
rier-0.019
 dear-0.018
Token the
Token ID262
Feature activation-4.22e+00

Pos logit contributions
|+0.179
fire+0.162
 hunt+0.155
 trek+0.137
 roam+0.134
Neg logit contributions
 fault-0.244
 mistakes-0.233
 darling-0.229
 sorry-0.223
 dear-0.222
Token beach
Token ID10481
Feature activation-3.13e+00

Pos logit contributions
|+0.086
growth+0.062
 hunt+0.059
 miles+0.057
fire+0.056
Neg logit contributions
 darling-0.116
 dear-0.114
 concerned-0.112
 fault-0.112
 mistakes-0.111
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
|+0.068
 hunt+0.050
 camp+0.050
 land+0.048
growth+0.048
Neg logit contributions
ushing-0.087
 darling-0.086
 concerned-0.086
rier-0.086
 dear-0.085
Token She
Token ID1375
Feature activation-2.90e+00

Pos logit contributions
|+0.038
 land+0.034
 hunt+0.033
 camp+0.032
 distance+0.031
Neg logit contributions
ealing-0.037
 concerned-0.037
rier-0.036
 darling-0.036
 sorry-0.036
Token picked
Token ID6497
Feature activation-8.18e-01

Pos logit contributions
|+0.048
 hunt+0.037
 camp+0.035
 land+0.034
 distance+0.033
Neg logit contributions
 concerned-0.092
 darling-0.091
ealing-0.091
 sorry-0.091
 dear-0.090
Token He
Token ID679
Feature activation-2.88e+00

Pos logit contributions
|+0.031
 hunt+0.029
 land+0.028
 camp+0.026
 distance+0.026
Neg logit contributions
 darling-0.031
 dear-0.031
ealing-0.030
 sorry-0.030
 concerned-0.030
Token was
Token ID373
Feature activation+2.25e+00

Pos logit contributions
|+0.058
 hunt+0.047
 distance+0.046
 land+0.045
 camp+0.045
Neg logit contributions
 darling-0.080
 dear-0.079
 concerned-0.078
ealing-0.077
 sorry-0.077
Token going
Token ID1016
Feature activation+1.44e+00

Pos logit contributions
 darling+0.053
ealing+0.053
 concerned+0.052
 dear+0.052
rier+0.051
Neg logit contributions
|-0.057
 land-0.051
 hunt-0.051
 camp-0.049
 distance-0.047
Token to
Token ID284
Feature activation-8.79e+00

Pos logit contributions
 darling+0.041
 concerned+0.041
 sorry+0.041
 dear+0.041
ealing+0.040
Neg logit contributions
|-0.029
 land-0.023
 hunt-0.022
 distance-0.021
 camp-0.021
Token go
Token ID467
Feature activation-1.83e+00

Pos logit contributions
|+0.170
fire+0.142
 distance+0.126
field+0.123
Buzz+0.120
Neg logit contributions
 darling-0.235
 dear-0.233
 sorry-0.233
 mistakes-0.215
 apology-0.214
Token to
Token ID284
Feature activation-9.25e+00

Pos logit contributions
|+0.038
 land+0.031
growth+0.031
 distance+0.030
 hunt+0.030
Neg logit contributions
 darling-0.048
 concerned-0.048
 sorry-0.048
ealing-0.047
 dear-0.047
Token the
Token ID262
Feature activation-3.82e+00

Pos logit contributions
|+0.190
fire+0.177
Buzz+0.152
 distance+0.150
Sky+0.141
Neg logit contributions
 darling-0.247
 dear-0.239
 sorry-0.231
 daughter-0.231
 mistakes-0.222
Token big
Token ID1263
Feature activation-1.69e+00

Pos logit contributions
|+0.083
growth+0.059
 miles+0.055
 hunt+0.055
fire+0.055
Neg logit contributions
 dear-0.105
 darling-0.104
 sorry-0.101
 concerned-0.100
 unfair-0.099
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
|+0.031
 hunt+0.022
growth+0.022
 land+0.021
 distance+0.021
Neg logit contributions
 darling-0.052
 dear-0.051
 concerned-0.050
 sorry-0.049
ealing-0.049
Token enormous
Token ID9812
Feature activation+2.33e-01

Pos logit contributions
ealing+0.044
 fault+0.043
rier+0.043
 darling+0.043
ushing+0.043
Neg logit contributions
|-0.040
 land-0.036
 hunt-0.035
 camp-0.034
 distance-0.033
Token zoo
Token ID26626
Feature activation-1.83e+00

Pos logit contributions
 darling+0.007
 dear+0.007
 concerned+0.007
 sorry+0.007
ealing+0.007
Neg logit contributions
|-0.004
 hunt-0.003
 land-0.003
 distance-0.003
growth-0.003
Tokenolly
Token ID5098
Feature activation+1.57e+00

Pos logit contributions
ealing+0.105
 darling+0.103
ushing+0.100
 concerned+0.100
 dear+0.099
Neg logit contributions
|-0.135
 hunt-0.127
 land-0.124
 distance-0.119
 camp-0.119
Token and
Token ID290
Feature activation-3.56e+00

Pos logit contributions
 darling+0.043
 concerned+0.043
 dear+0.043
ealing+0.042
 sorry+0.042
Neg logit contributions
|-0.033
 hunt-0.028
 land-0.028
 camp-0.027
 distance-0.026
Token Max
Token ID5436
Feature activation+1.27e+00

Pos logit contributions
|+0.078
 hunt+0.069
 land+0.068
 camp+0.065
 distance+0.063
Neg logit contributions
 darling-0.095
 concerned-0.094
 sorry-0.094
 dear-0.094
ealing-0.093
Token were
Token ID547
Feature activation+8.99e-01

Pos logit contributions
 darling+0.040
ealing+0.039
 dear+0.039
 sorry+0.039
 concerned+0.039
Neg logit contributions
|-0.021
 hunt-0.018
 land-0.017
 distance-0.016
 camp-0.016
Token playing
Token ID2712
Feature activation-9.13e-01

Pos logit contributions
rier+0.023
 apology+0.023
 darling+0.022
ealing+0.022
 dear+0.022
Neg logit contributions
|-0.021
 land-0.019
 camp-0.018
 play-0.017
 hunt-0.017
Token at
Token ID379
Feature activation-9.25e+00

Pos logit contributions
|+0.020
 distance+0.016
 hunt+0.015
field+0.015
Ray+0.015
Neg logit contributions
 sorry-0.024
 concerned-0.023
 mistakes-0.023
 darling-0.023
ushing-0.022
Token the
Token ID262
Feature activation-4.63e+00

Pos logit contributions
|+0.173
fire+0.162
 hunt+0.144
Buzz+0.137
Sun+0.133
Neg logit contributions
 fault-0.245
 mistakes-0.235
 sorry-0.223
 darling-0.222
 daughters-0.220
Token beach
Token ID10481
Feature activation-3.45e+00

Pos logit contributions
|+0.090
growth+0.062
 hunt+0.061
fire+0.058
 distance+0.057
Neg logit contributions
 darling-0.133
 concerned-0.130
 dear-0.128
 fault-0.127
 mistakes-0.126
Token.
Token ID13
Feature activation-9.89e-01

Pos logit contributions
|+0.074
 camp+0.054
 hunt+0.053
 land+0.053
growth+0.052
Neg logit contributions
 concerned-0.096
ushing-0.096
 darling-0.095
rier-0.095
 dear-0.093
Token The
Token ID383
Feature activation-1.38e+00

Pos logit contributions
|+0.021
 hunt+0.018
 land+0.018
 camp+0.018
 distance+0.016
Neg logit contributions
rier-0.028
ushing-0.027
 darling-0.027
 concerned-0.027
 fault-0.027
Token sun
Token ID4252
Feature activation-1.61e+00

Pos logit contributions
|+0.024
 land+0.021
 hunt+0.020
 camp+0.020
 distance+0.019
Neg logit contributions
 concerned-0.043
 darling-0.043
 fault-0.042
ealing-0.042
 sorry-0.042
Token.
Token ID13
Feature activation-9.47e-01

Pos logit contributions
 concerned+0.148
ealing+0.148
 sorry+0.147
 darling+0.146
rier+0.145
Neg logit contributions
|-0.064
 hunt-0.053
 land-0.053
 camp-0.049
 distance-0.044
Token They
Token ID1119
Feature activation-2.65e+00

Pos logit contributions
|+0.021
 land+0.019
 hunt+0.019
 camp+0.018
 distance+0.017
Neg logit contributions
 concerned-0.025
 darling-0.025
ealing-0.025
 sorry-0.024
 dear-0.024
Token were
Token ID547
Feature activation+1.67e+00

Pos logit contributions
|+0.053
 hunt+0.044
 distance+0.042
growth+0.041
 camp+0.041
Neg logit contributions
 darling-0.074
 dear-0.073
 sorry-0.073
 concerned-0.073
ealing-0.072
Token taking
Token ID2263
Feature activation-7.57e-01

Pos logit contributions
 darling+0.042
ealing+0.042
ushing+0.041
 fault+0.041
 concerned+0.041
Neg logit contributions
|-0.039
 land-0.037
 hunt-0.035
 camp-0.035
 play-0.032
Token John
Token ID1757
Feature activation-1.04e+00

Pos logit contributions
|+0.015
 hunt+0.013
 land+0.012
 distance+0.012
field+0.012
Neg logit contributions
ealing-0.021
 darling-0.021
 dear-0.021
 sorry-0.021
 concerned-0.020
Token to
Token ID284
Feature activation-9.25e+00

Pos logit contributions
|+0.022
 land+0.019
 hunt+0.018
 camp+0.018
 distance+0.017
Neg logit contributions
 darling-0.028
 concerned-0.028
ealing-0.028
 fault-0.028
 dear-0.028
Token the
Token ID262
Feature activation-3.49e+00

Pos logit contributions
|+0.197
st+0.156
fire+0.155
Buzz+0.154
Sky+0.151
Neg logit contributions
 sorry-0.237
 dear-0.229
 darling-0.228
 daughter-0.227
 son-0.217
Token park
Token ID3952
Feature activation-2.95e+00

Pos logit contributions
|+0.079
growth+0.054
 hunt+0.052
 miles+0.052
 distance+0.051
Neg logit contributions
 dear-0.096
 sorry-0.095
 unfair-0.093
 darling-0.092
 concerned-0.091
Token.
Token ID13
Feature activation-1.75e+00

Pos logit contributions
|+0.073
growth+0.065
 distance+0.059
Buzz+0.058
 miles+0.057
Neg logit contributions
 darling-0.062
ushing-0.061
 dear-0.061
 fault-0.060
 concerned-0.059
Token As
Token ID1081
Feature activation+6.48e-02

Pos logit contributions
|+0.035
 hunt+0.031
 land+0.030
 camp+0.028
 distance+0.027
Neg logit contributions
 darling-0.050
 concerned-0.049
ealing-0.049
 dear-0.049
 sorry-0.049
Token they
Token ID484
Feature activation-2.02e+00

Pos logit contributions
 darling+0.001
 dear+0.001
 sorry+0.001
 concerned+0.001
rier+0.001
Neg logit contributions
|-0.002
 hunt-0.001
 land-0.001
growth-0.001
 distance-0.001
Token to
Token ID284
Feature activation-6.74e+00

Pos logit contributions
 darling+0.017
 sorry+0.017
 concerned+0.017
 dear+0.017
 fault+0.017
Neg logit contributions
|-0.010
 hunt-0.008
 land-0.008
 camp-0.008
 distance-0.008
Token play
Token ID711
Feature activation+7.69e-02

Pos logit contributions
|+0.098
 distance+0.070
 land+0.069
 hunt+0.069
 camp+0.068
Neg logit contributions
 darling-0.230
 sorry-0.230
 dear-0.227
 concerned-0.222
ealing-0.220
Token on
Token ID319
Feature activation-6.14e+00

Pos logit contributions
 sorry+0.002
ushing+0.002
 concerned+0.002
ealing+0.002
 darling+0.002
Neg logit contributions
|-0.002
 distance-0.001
 hunt-0.001
 land-0.001
 camp-0.001
Token the
Token ID262
Feature activation-5.20e+00

Pos logit contributions
|+0.123
fire+0.097
 distance+0.097
 hunt+0.093
 traveled+0.084
Neg logit contributions
 darling-0.164
 dear-0.161
 sorry-0.160
 mistakes-0.160
 fault-0.154
Token swing
Token ID9628
Feature activation-3.60e+00

Pos logit contributions
|+0.107
 hunt+0.084
 land+0.084
 camp+0.080
 distance+0.079
Neg logit contributions
 darling-0.147
 concerned-0.145
 dear-0.144
 sorry-0.142
 fault-0.142
Token at
Token ID379
Feature activation-9.24e+00

Pos logit contributions
|+0.088
growth+0.071
 hunt+0.070
 distance+0.070
 camp+0.068
Neg logit contributions
 sorry-0.084
ushing-0.084
ented-0.083
rier-0.083
ares-0.082
Token the
Token ID262
Feature activation-3.83e+00

Pos logit contributions
|+0.176
fire+0.161
 hunt+0.149
 trek+0.141
 prow+0.131
Neg logit contributions
 fault-0.246
aughter-0.245
 sorry-0.241
 apology-0.232
 mistakes-0.231
Token park
Token ID3952
Feature activation-3.75e+00

Pos logit contributions
|+0.070
 hunt+0.049
growth+0.047
fire+0.045
 land+0.044
Neg logit contributions
 darling-0.119
 dear-0.118
 concerned-0.117
 fault-0.117
 sorry-0.116
Token.
Token ID13
Feature activation-6.44e-01

Pos logit contributions
|+0.089
 camp+0.069
growth+0.068
 hunt+0.067
 outer+0.065
Neg logit contributions
 sorry-0.094
rier-0.093
ushing-0.093
 dear-0.092
 concerned-0.092
Token She
Token ID1375
Feature activation-2.99e+00

Pos logit contributions
|+0.014
 land+0.012
 hunt+0.012
 camp+0.011
 distance+0.011
Neg logit contributions
 concerned-0.018
ealing-0.018
 darling-0.017
ushing-0.017
 sorry-0.017
Token wanted
Token ID2227
Feature activation+2.00e+00

Pos logit contributions
|+0.056
 hunt+0.047
 land+0.045
 camp+0.045
 distance+0.043
Neg logit contributions
 concerned-0.088
 darling-0.087
 dear-0.087
 sorry-0.086
ealing-0.086
TokenL
Token ID43
Feature activation+2.31e+00

Pos logit contributions
 hunt+0.088
 outer+0.078
|+0.077
 roam+0.077
 distance+0.077
Neg logit contributions
 sorry-0.086
ealing-0.082
oking-0.079
ubby-0.078
 mistakes-0.078
Tokenily
Token ID813
Feature activation+6.61e-01

Pos logit contributions
 darling+0.039
 concerned+0.038
ealing+0.038
 dear+0.038
 sorry+0.038
Neg logit contributions
|-0.074
 hunt-0.068
 land-0.067
 camp-0.065
 distance-0.064
Token wanted
Token ID2227
Feature activation+1.04e+00

Pos logit contributions
 darling+0.019
 dear+0.019
ealing+0.019
 concerned+0.019
ushing+0.019
Neg logit contributions
|-0.012
 hunt-0.010
 land-0.010
 camp-0.010
 distance-0.010
Token to
Token ID284
Feature activation-8.37e+00

Pos logit contributions
 darling+0.029
 fault+0.029
 dear+0.028
 sorry+0.028
 concerned+0.028
Neg logit contributions
|-0.023
 hunt-0.018
 land-0.018
 distance-0.017
 camp-0.017
Token come
Token ID1282
Feature activation-2.15e+00

Pos logit contributions
|+0.157
fire+0.116
 distance+0.116
 land+0.112
 camp+0.111
Neg logit contributions
 darling-0.246
 sorry-0.245
 dear-0.242
 apology-0.233
 concerned-0.231
Token to
Token ID284
Feature activation-9.23e+00

Pos logit contributions
|+0.044
 hunt+0.036
 land+0.035
 distance+0.034
 camp+0.033
Neg logit contributions
 sorry-0.060
 darling-0.060
 concerned-0.060
ealing-0.060
 dear-0.059
Token the
Token ID262
Feature activation-4.09e+00

Pos logit contributions
|+0.204
fire+0.200
 distance+0.166
asis+0.159
field+0.153
Neg logit contributions
 daughter-0.226
 dear-0.223
 darling-0.222
 sorry-0.217
 son-0.211
Token party
Token ID2151
Feature activation-2.28e+00

Pos logit contributions
|+0.094
growth+0.069
 hunt+0.066
 traveled+0.064
 trek+0.062
Neg logit contributions
 dear-0.112
 darling-0.110
 sorry-0.109
 concerned-0.106
 unfair-0.101
Token with
Token ID351
Feature activation-5.19e+00

Pos logit contributions
|+0.049
growth+0.038
 land+0.037
 distance+0.037
 miles+0.036
Neg logit contributions
 dear-0.062
 darling-0.062
ushing-0.062
 sorry-0.061
rier-0.060
Token her
Token ID607
Feature activation-3.70e-01

Pos logit contributions
|+0.142
 hunt+0.123
 land+0.120
fire+0.119
 distance+0.118
Neg logit contributions
 sorry-0.118
 dear-0.115
 darling-0.114
 concerned-0.108
 daughters-0.106
Token big
Token ID1263
Feature activation-6.77e-03

Pos logit contributions
|+0.012
 hunt+0.010
 land+0.010
 camp+0.010
 distance+0.010
Neg logit contributions
 darling-0.007
 concerned-0.007
 sorry-0.006
 dear-0.006
ealing-0.006
Tokenily
Token ID813
Feature activation+3.45e-01

Pos logit contributions
 darling+0.042
 concerned+0.042
ealing+0.042
 sorry+0.041
 dear+0.041
Neg logit contributions
|-0.079
 hunt-0.073
 land-0.072
 camp-0.070
 distance-0.068
Token and
Token ID290
Feature activation-3.51e+00

Pos logit contributions
 darling+0.010
 dear+0.010
 concerned+0.010
ealing+0.010
 sorry+0.009
Neg logit contributions
|-0.007
 hunt-0.006
 land-0.005
 distance-0.005
 camp-0.005
Token Ben
Token ID3932
Feature activation+1.31e+00

Pos logit contributions
|+0.077
 hunt+0.067
 land+0.066
 camp+0.064
 distance+0.061
Neg logit contributions
 sorry-0.095
 darling-0.095
 concerned-0.094
 dear-0.094
ealing-0.093
Token were
Token ID547
Feature activation+5.28e-01

Pos logit contributions
ealing+0.039
 darling+0.039
 sorry+0.039
 dear+0.039
 concerned+0.038
Neg logit contributions
|-0.023
 hunt-0.019
 distance-0.018
 land-0.018
 camp-0.018
Token playing
Token ID2712
Feature activation-6.30e-01

Pos logit contributions
rier+0.012
 darling+0.012
ealing+0.011
 dear+0.011
 apology+0.011
Neg logit contributions
|-0.014
 land-0.013
 camp-0.013
 hunt-0.012
 play-0.012
Token at
Token ID379
Feature activation-9.19e+00

Pos logit contributions
|+0.015
 distance+0.013
Ray+0.011
 hunt+0.011
field+0.011
Neg logit contributions
 sorry-0.016
 concerned-0.014
 mistakes-0.014
rier-0.014
ushing-0.014
Token the
Token ID262
Feature activation-4.82e+00

Pos logit contributions
|+0.173
fire+0.161
 hunt+0.148
Buzz+0.139
 trek+0.137
Neg logit contributions
 fault-0.241
 sorry-0.220
 mistakes-0.219
aughter-0.215
 daughters-0.214
Token beach
Token ID10481
Feature activation-3.43e+00

Pos logit contributions
|+0.110
growth+0.076
 miles+0.074
fire+0.073
 traveled+0.072
Neg logit contributions
 darling-0.121
 dear-0.118
 fault-0.118
 concerned-0.115
 unfair-0.113
Token.
Token ID13
Feature activation-5.47e-01

Pos logit contributions
|+0.082
growth+0.063
 camp+0.059
 outer+0.056
 hunt+0.056
Neg logit contributions
rier-0.088
ushing-0.086
 concerned-0.085
 darling-0.084
 sorry-0.083
Token They
Token ID1119
Feature activation-2.77e+00

Pos logit contributions
|+0.012
 land+0.010
 hunt+0.010
 camp+0.010
 distance+0.009
Neg logit contributions
 darling-0.015
 concerned-0.015
rier-0.015
 dear-0.015
ealing-0.015
Token liked
Token ID8288
Feature activation+1.32e+00

Pos logit contributions
|+0.052
 camp+0.041
 hunt+0.041
 land+0.039
growth+0.038
Neg logit contributions
 concerned-0.082
 sorry-0.081
ealing-0.080
 darling-0.079
 dear-0.078
Token remembering
Token ID24865
Feature activation+2.27e+00

Pos logit contributions
 darling+0.084
ealing+0.084
 concerned+0.083
 sorry+0.083
 dear+0.083
Neg logit contributions
|-0.088
 land-0.079
 hunt-0.078
 camp-0.075
 distance-0.073
Token<|endoftext|>
Token ID50256
Feature activation-4.73e+00

Pos logit contributions
 concerned+0.063
 darling+0.063
 sorry+0.063
 dear+0.062
 fault+0.060
Neg logit contributions
|-0.053
 hunt-0.042
 land-0.040
 camp-0.039
 distance-0.038
TokenJohn
Token ID7554
Feature activation-3.22e-01

Pos logit contributions
 hunt+0.085
|+0.082
 land+0.073
 distance+0.071
fire+0.070
Neg logit contributions
ealing-0.122
 concerned-0.120
 darling-0.119
 sorry-0.118
oking-0.118
Token was
Token ID373
Feature activation+1.44e+00

Pos logit contributions
|+0.007
 land+0.006
 hunt+0.006
 camp+0.006
 distance+0.006
Neg logit contributions
 concerned-0.009
 darling-0.009
ealing-0.009
 sorry-0.008
 dear-0.008
Token preparing
Token ID10629
Feature activation-2.34e+00

Pos logit contributions
 darling+0.039
 concerned+0.039
ealing+0.039
 sorry+0.038
 dear+0.038
Neg logit contributions
|-0.031
 hunt-0.027
 land-0.027
 camp-0.026
 distance-0.025
Token to
Token ID284
Feature activation-9.18e+00

Pos logit contributions
|+0.048
 hunt+0.037
 land+0.036
 distance+0.034
 camp+0.033
Neg logit contributions
 darling-0.070
 dear-0.069
 sorry-0.069
 concerned-0.069
 fault-0.066
Token go
Token ID467
Feature activation-1.60e+00

Pos logit contributions
|+0.197
 distance+0.144
 land+0.141
 hunt+0.141
 camp+0.139
Neg logit contributions
 darling-0.255
 sorry-0.252
 dear-0.246
 concerned-0.241
 ok-0.238
Token fishing
Token ID12478
Feature activation-4.62e+00

Pos logit contributions
|+0.034
 land+0.028
 hunt+0.027
 distance+0.026
growth+0.025
Neg logit contributions
 sorry-0.044
 concerned-0.043
 darling-0.043
ealing-0.042
 dear-0.042
Token.
Token ID13
Feature activation-5.28e-01

Pos logit contributions
|+0.101
growth+0.075
 land+0.074
 hunt+0.072
 distance+0.071
Neg logit contributions
 darling-0.124
 concerned-0.123
ealing-0.122
rier-0.120
 sorry-0.120
Token He
Token ID679
Feature activation-2.66e+00

Pos logit contributions
|+0.011
 land+0.009
 hunt+0.009
 camp+0.009
 distance+0.009
Neg logit contributions
 concerned-0.015
ealing-0.015
 sorry-0.015
ushing-0.015
 darling-0.015
Token grabbed
Token ID13176
Feature activation-6.52e-01

Pos logit contributions
|+0.051
 hunt+0.039
 land+0.039
 distance+0.037
 camp+0.036
Neg logit contributions
 concerned-0.079
ealing-0.079
 darling-0.078
 dear-0.077
 sorry-0.076
Tokenum
Token ID388
Feature activation-5.14e-01

Pos logit contributions
 darling+0.054
 concerned+0.054
ealing+0.054
 dear+0.054
 sorry+0.053
Neg logit contributions
|-0.058
 hunt-0.052
 land-0.052
 camp-0.049
 distance-0.048
Token and
Token ID290
Feature activation-3.22e+00

Pos logit contributions
|+0.012
 hunt+0.010
 land+0.010
 distance+0.010
 camp+0.010
Neg logit contributions
 darling-0.013
 dear-0.013
 concerned-0.013
ealing-0.013
 sorry-0.013
Token Dad
Token ID17415
Feature activation-1.04e-01

Pos logit contributions
|+0.061
 hunt+0.052
 land+0.051
 camp+0.048
 distance+0.046
Neg logit contributions
 darling-0.096
 concerned-0.096
 sorry-0.096
 dear-0.095
ealing-0.095
Token took
Token ID1718
Feature activation-1.07e+00

Pos logit contributions
|+0.002
 hunt+0.002
 land+0.002
 camp+0.002
 distance+0.002
Neg logit contributions
 darling-0.003
 concerned-0.003
ealing-0.003
 sorry-0.003
 dear-0.003
Token Daisy
Token ID40355
Feature activation-2.63e+00

Pos logit contributions
|+0.023
 hunt+0.021
 land+0.021
 distance+0.019
 camp+0.019
Neg logit contributions
 darling-0.028
ealing-0.028
 sorry-0.028
 dear-0.028
 concerned-0.028
Token to
Token ID284
Feature activation-9.17e+00

Pos logit contributions
|+0.056
 land+0.048
 hunt+0.046
 distance+0.046
 miles+0.045
Neg logit contributions
 concerned-0.070
 darling-0.069
 fault-0.067
 sorry-0.067
 dear-0.067
Token school
Token ID1524
Feature activation-1.78e+00

Pos logit contributions
|+0.185
fire+0.148
 distance+0.144
field+0.142
asis+0.140
Neg logit contributions
 sorry-0.258
 darling-0.257
 dear-0.253
 daughter-0.249
 son-0.235
Token one
Token ID530
Feature activation-1.89e+00

Pos logit contributions
|+0.034
 land+0.027
 hunt+0.027
 distance+0.026
 camp+0.025
Neg logit contributions
 darling-0.053
 dear-0.053
 sorry-0.053
 concerned-0.053
ealing-0.052
Token day
Token ID1110
Feature activation+1.30e-01

Pos logit contributions
|+0.039
 hunt+0.032
 land+0.030
 camp+0.030
growth+0.028
Neg logit contributions
 darling-0.054
 dear-0.053
 concerned-0.053
 sorry-0.053
ushing-0.052
Token.
Token ID13
Feature activation-9.53e-01

Pos logit contributions
 darling+0.004
 concerned+0.004
ealing+0.004
 sorry+0.004
 dear+0.004
Neg logit contributions
|-0.002
 hunt-0.002
 land-0.002
 camp-0.002
 distance-0.002
Token Daisy
Token ID40355
Feature activation+3.69e-01

Pos logit contributions
|+0.021
 hunt+0.018
 land+0.018
 camp+0.018
 distance+0.017
Neg logit contributions
 concerned-0.025
ealing-0.025
 darling-0.025
rier-0.025
 fault-0.025
Token large
Token ID1588
Feature activation+1.00e+00

Pos logit contributions
|+0.042
 hunt+0.037
 land+0.037
 camp+0.035
 distance+0.034
Neg logit contributions
 darling-0.033
 concerned-0.033
 sorry-0.033
 dear-0.033
ealing-0.032
Token cop
Token ID2243
Feature activation+5.14e-02

Pos logit contributions
 darling+0.026
 concerned+0.025
 sorry+0.025
 dear+0.025
ealing+0.025
Neg logit contributions
|-0.023
 hunt-0.020
 land-0.020
 camp-0.019
 distance-0.018
Token brought
Token ID3181
Feature activation-1.71e+00

Pos logit contributions
 sorry+0.001
 dear+0.001
 concerned+0.001
 darling+0.001
ealing+0.001
Neg logit contributions
|-0.001
 distance-0.001
 camp-0.001
 hunt-0.001
 land-0.001
Token his
Token ID465
Feature activation-5.18e-01

Pos logit contributions
|+0.032
 hunt+0.026
 land+0.026
 distance+0.024
 camp+0.024
Neg logit contributions
 sorry-0.052
ealing-0.051
 concerned-0.051
 darling-0.050
 mistakes-0.050
Token car
Token ID1097
Feature activation-2.70e+00

Pos logit contributions
|+0.014
 hunt+0.013
 land+0.013
 camp+0.012
 distance+0.012
Neg logit contributions
 darling-0.011
 concerned-0.011
ealing-0.011
 sorry-0.011
 dear-0.011
Token to
Token ID284
Feature activation-9.16e+00

Pos logit contributions
|+0.062
growth+0.049
 land+0.049
 camp+0.048
 miles+0.047
Neg logit contributions
 sorry-0.067
 dear-0.067
ushing-0.066
 darling-0.066
ented-0.066
Token the
Token ID262
Feature activation-3.35e+00

Pos logit contributions
|+0.212
field+0.162
st+0.158
fire+0.149
asis+0.147
Neg logit contributions
 sorry-0.260
 dear-0.239
 apology-0.235
 ok-0.225
 darling-0.223
Token park
Token ID3952
Feature activation-3.23e+00

Pos logit contributions
|+0.071
 hunt+0.050
growth+0.049
 miles+0.047
of+0.045
Neg logit contributions
 sorry-0.094
 dear-0.094
 unfair-0.091
 darling-0.090
 mistakes-0.089
Token.
Token ID13
Feature activation-3.79e-01

Pos logit contributions
|+0.078
growth+0.066
 camp+0.061
 land+0.060
 miles+0.060
Neg logit contributions
 sorry-0.077
 dear-0.074
 concerned-0.074
 darling-0.074
ushing-0.073
Token He
Token ID679
Feature activation-2.25e+00

Pos logit contributions
|+0.009
 hunt+0.008
 land+0.008
 camp+0.007
 miles+0.007
Neg logit contributions
 sorry-0.009
 dear-0.009
 darling-0.009
 concerned-0.009
ealing-0.009
Token wanted
Token ID2227
Feature activation+2.17e+00

Pos logit contributions
|+0.048
growth+0.042
 distance+0.041
 hunt+0.039
 land+0.038
Neg logit contributions
 dear-0.056
 sorry-0.056
 concerned-0.056
 darling-0.055
ealing-0.053
Token Lily
Token ID20037
Feature activation+6.49e-01

Pos logit contributions
|+0.057
 hunt+0.048
 land+0.047
 camp+0.046
 distance+0.043
Neg logit contributions
 sorry-0.113
 darling-0.110
 dear-0.110
 concerned-0.108
ealing-0.108
Token were
Token ID547
Feature activation+7.02e-01

Pos logit contributions
ealing+0.018
 sorry+0.017
 darling+0.017
 dear+0.017
 fault+0.017
Neg logit contributions
|-0.013
 hunt-0.011
 distance-0.011
 camp-0.010
 land-0.010
Token eager
Token ID11069
Feature activation+4.38e+00

Pos logit contributions
rier+0.018
 darling+0.017
 dear+0.016
ealing+0.016
 apology+0.016
Neg logit contributions
|-0.017
 camp-0.015
 land-0.015
 play-0.014
 hunt-0.014
Token to
Token ID284
Feature activation-7.02e+00

Pos logit contributions
 darling+0.132
 dear+0.132
 sorry+0.131
 fault+0.130
 concerned+0.129
Neg logit contributions
|-0.082
 land-0.067
 hunt-0.066
 camp-0.062
 distance-0.061
Token go
Token ID467
Feature activation-2.08e+00

Pos logit contributions
|+0.121
 distance+0.091
fire+0.088
 land+0.084
 hunt+0.083
Neg logit contributions
 sorry-0.220
 darling-0.218
 dear-0.216
 concerned-0.209
ealing-0.208
Token to
Token ID284
Feature activation-9.16e+00

Pos logit contributions
|+0.042
 land+0.035
growth+0.034
 distance+0.034
 hunt+0.033
Neg logit contributions
 sorry-0.055
 concerned-0.054
 darling-0.054
ealing-0.053
 dear-0.053
Token the
Token ID262
Feature activation-4.72e+00

Pos logit contributions
|+0.179
fire+0.169
 distance+0.145
Buzz+0.130
growth+0.128
Neg logit contributions
 darling-0.257
 dear-0.255
 sorry-0.251
 daughter-0.249
 fault-0.243
Token beach
Token ID10481
Feature activation-3.66e+00

Pos logit contributions
|+0.103
growth+0.073
e+0.072
 miles+0.071
 hunt+0.069
Neg logit contributions
 dear-0.128
 darling-0.126
 sorry-0.126
 concerned-0.124
 unfair-0.122
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
|+0.080
growth+0.066
 distance+0.060
 camp+0.059
 land+0.058
Neg logit contributions
rier-0.094
 darling-0.094
ushing-0.094
 concerned-0.094
 sorry-0.093
Token They
Token ID1119
Feature activation-3.28e+00

Pos logit contributions
|+0.024
 land+0.020
 play+0.020
 hunt+0.020
 camp+0.020
Neg logit contributions
rier-0.039
 concerned-0.039
ushing-0.038
ealing-0.038
 darling-0.037
Token liked
Token ID8288
Feature activation+1.18e+00

Pos logit contributions
|+0.064
 hunt+0.053
 land+0.052
 camp+0.050
 distance+0.049
Neg logit contributions
 darling-0.094
 concerned-0.094
ealing-0.093
 dear-0.093
 sorry-0.093
Token<|endoftext|>
Token ID50256
Feature activation-4.66e+00

Pos logit contributions
 darling+0.028
 concerned+0.028
ealing+0.028
 sorry+0.028
 dear+0.027
Neg logit contributions
|-0.017
 hunt-0.015
 land-0.014
 camp-0.013
 distance-0.013
TokenHe
Token ID1544
Feature activation-3.25e+00

Pos logit contributions
 hunt+0.090
|+0.080
 land+0.078
 miles+0.075
 distance+0.075
Neg logit contributions
ealing-0.114
 darling-0.112
 sorry-0.110
 fault-0.109
oking-0.109
Tokenidi
Token ID19830
Feature activation-4.36e-01

Pos logit contributions
|+0.065
 hunt+0.052
 camp+0.050
 distance+0.050
 land+0.050
Neg logit contributions
 darling-0.094
 sorry-0.093
 concerned-0.093
 dear-0.092
ealing-0.092
Token was
Token ID373
Feature activation+1.60e+00

Pos logit contributions
|+0.008
 hunt+0.007
 land+0.007
 camp+0.006
 distance+0.006
Neg logit contributions
 darling-0.013
 concerned-0.013
 dear-0.013
ealing-0.013
 sorry-0.013
Token playing
Token ID2712
Feature activation-7.50e-01

Pos logit contributions
 darling+0.043
ealing+0.043
 concerned+0.043
 dear+0.043
 sorry+0.043
Neg logit contributions
|-0.034
 land-0.030
 hunt-0.029
 camp-0.028
 distance-0.027
Token at
Token ID379
Feature activation-9.15e+00

Pos logit contributions
|+0.016
 distance+0.013
 hunt+0.013
Ray+0.012
field+0.012
Neg logit contributions
 sorry-0.020
 concerned-0.020
 darling-0.019
 ok-0.019
 mistakes-0.018
Token the
Token ID262
Feature activation-4.31e+00

Pos logit contributions
|+0.175
fire+0.165
 hunt+0.150
Buzz+0.135
 trek+0.135
Neg logit contributions
 fault-0.240
 mistakes-0.238
 darling-0.223
 sorry-0.217
 daughters-0.214
Token beach
Token ID10481
Feature activation-3.14e+00

Pos logit contributions
|+0.084
growth+0.060
 hunt+0.059
 land+0.055
 miles+0.055
Neg logit contributions
 darling-0.124
 concerned-0.121
 dear-0.121
 mistakes-0.118
 fault-0.117
Token.
Token ID13
Feature activation-6.74e-01

Pos logit contributions
|+0.072
 hunt+0.053
 camp+0.053
 land+0.052
growth+0.052
Neg logit contributions
 darling-0.083
 concerned-0.083
ushing-0.083
rier-0.082
 dear-0.081
Token She
Token ID1375
Feature activation-2.74e+00

Pos logit contributions
|+0.016
 land+0.014
 hunt+0.014
 distance+0.014
 camp+0.013
Neg logit contributions
ushing-0.017
 darling-0.017
 concerned-0.017
 dear-0.016
 sorry-0.016
Token saw
Token ID2497
Feature activation-4.98e-01

Pos logit contributions
|+0.048
 hunt+0.038
 camp+0.036
 land+0.035
 distance+0.034
Neg logit contributions
 concerned-0.084
 darling-0.083
ealing-0.083
 sorry-0.082
 dear-0.081
Token sit
Token ID1650
Feature activation-9.65e-01

Pos logit contributions
|+0.140
 hunt+0.114
growth+0.113
 land+0.109
 camp+0.108
Neg logit contributions
 sorry-0.171
 darling-0.164
 dear-0.162
 apology-0.161
 ok-0.160
Token in
Token ID287
Feature activation-6.03e+00

Pos logit contributions
|+0.018
 land+0.016
 hunt+0.016
 camp+0.015
 distance+0.014
Neg logit contributions
 darling-0.029
 concerned-0.029
ealing-0.028
 dear-0.028
 sorry-0.028
Token the
Token ID262
Feature activation-2.64e+00

Pos logit contributions
|+0.102
 hunt+0.085
 land+0.072
 camp+0.071
fire+0.070
Neg logit contributions
 sorry-0.191
 darling-0.189
 mistakes-0.187
 dear-0.186
 fault-0.186
Token corner
Token ID5228
Feature activation-3.42e+00

Pos logit contributions
|+0.052
 hunt+0.045
 land+0.041
 camp+0.039
fire+0.038
Neg logit contributions
 sorry-0.078
 darling-0.076
 concerned-0.075
 dear-0.075
 fault-0.075
Token and
Token ID290
Feature activation-3.00e+00

Pos logit contributions
|+0.073
 hunt+0.062
 land+0.060
 camp+0.058
 distance+0.055
Neg logit contributions
 darling-0.094
 dear-0.093
 sorry-0.093
 concerned-0.093
ealing-0.091
Token think
Token ID892
Feature activation+7.43e+00

Pos logit contributions
|+0.062
growth+0.053
 hunt+0.047
 camp+0.047
ground+0.046
Neg logit contributions
 sorry-0.089
 dear-0.081
 darling-0.081
 concerned-0.078
 apology-0.078
Token about
Token ID546
Feature activation-2.78e+00

Pos logit contributions
 darling+0.169
 concerned+0.168
 sorry+0.167
ealing+0.167
 dear+0.166
Neg logit contributions
|-0.192
 hunt-0.172
 land-0.171
 camp-0.163
 distance-0.160
Token what
Token ID644
Feature activation+1.89e+00

Pos logit contributions
|+0.076
 distance+0.066
 hunt+0.063
 land+0.062
 camp+0.060
Neg logit contributions
 darling-0.065
 dear-0.064
 sorry-0.064
 fault-0.062
 mistakes-0.061
Token you
Token ID345
Feature activation+3.12e+00

Pos logit contributions
 darling+0.045
 sorry+0.044
 concerned+0.044
ealing+0.044
 dear+0.044
Neg logit contributions
|-0.047
 hunt-0.042
 land-0.042
 camp-0.040
 distance-0.039
Token did
Token ID750
Feature activation+2.27e+00

Pos logit contributions
ealing+0.060
 sorry+0.059
 dear+0.056
 darling+0.055
 concerned+0.055
Neg logit contributions
|-0.091
 distance-0.080
growth-0.079
 land-0.078
 camp-0.078
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
 concerned+0.076
 darling+0.075
ealing+0.075
 dear+0.074
 sorry+0.074
Neg logit contributions
|-0.034
 land-0.029
 hunt-0.029
 camp-0.026
 distance-0.025
Token.
Token ID13
Feature activation+2.28e+00

Pos logit contributions
|+0.042
 land+0.034
 camp+0.034
 hunt+0.034
 distance+0.032
Neg logit contributions
 fault-0.057
 sorry-0.056
 concerned-0.056
 darling-0.056
ushing-0.056
Token She
Token ID1375
Feature activation-2.22e+00

Pos logit contributions
 concerned+0.059
ealing+0.059
rier+0.059
 darling+0.059
ushing+0.058
Neg logit contributions
|-0.051
 land-0.045
 hunt-0.044
 camp-0.042
 distance-0.041
Token is
Token ID318
Feature activation+3.00e+00

Pos logit contributions
|+0.063
 land+0.058
 hunt+0.058
 camp+0.056
 play+0.053
Neg logit contributions
rier-0.044
 darling-0.044
 concerned-0.044
ealing-0.043
 dear-0.043
Token sad
Token ID6507
Feature activation+6.23e+00

Pos logit contributions
 darling+0.059
ealing+0.058
 concerned+0.058
 dear+0.057
rier+0.057
Neg logit contributions
|-0.086
 land-0.079
 hunt-0.078
 camp-0.075
 distance-0.074
Token and
Token ID290
Feature activation+6.56e-01

Pos logit contributions
 darling+0.203
 concerned+0.203
ealing+0.202
 sorry+0.201
 dear+0.200
Neg logit contributions
|-0.101
 hunt-0.084
 land-0.083
 camp-0.077
 distance-0.073
Token mad
Token ID8805
Feature activation+8.04e+00

Pos logit contributions
 darling+0.009
ealing+0.009
 dear+0.008
 concerned+0.008
 fault+0.008
Neg logit contributions
|-0.023
 land-0.021
 hunt-0.021
 camp-0.021
 distance-0.020
Token.
Token ID13
Feature activation+2.67e+00

Pos logit contributions
 darling+0.311
 concerned+0.310
 fault+0.301
 dear+0.301
 apology+0.299
Neg logit contributions
 land-0.059
|-0.052
 hunt-0.047
 play-0.042
 camp-0.040
Token She
Token ID1375
Feature activation-1.31e+00

Pos logit contributions
rier+0.074
ealing+0.072
 concerned+0.072
 darling+0.071
ushing+0.071
Neg logit contributions
|-0.057
 land-0.049
 hunt-0.048
 distance-0.046
 play-0.045
Token runs
Token ID4539
Feature activation+1.94e+00

Pos logit contributions
|+0.040
 land+0.038
 hunt+0.038
 camp+0.036
 play+0.035
Neg logit contributions
rier-0.023
 darling-0.023
 concerned-0.022
ushing-0.022
 fault-0.022
Token after
Token ID706
Feature activation-3.96e+00

Pos logit contributions
 concerned+0.059
 sorry+0.058
ealing+0.058
 darling+0.058
 dear+0.057
Neg logit contributions
|-0.036
 hunt-0.030
 land-0.030
 camp-0.028
 distance-0.026
Token Tom
Token ID4186
Feature activation+4.59e-01

Pos logit contributions
|+0.099
 hunt+0.082
 land+0.081
 distance+0.079
fire+0.076
Neg logit contributions
 sorry-0.104
 dear-0.094
 darling-0.093
 concerned-0.092
 mistakes-0.090
TokenThat
Token ID2504
Feature activation+4.06e+00

Pos logit contributions
 darling+0.004
 concerned+0.004
 dear+0.004
ealing+0.004
rier+0.004
Neg logit contributions
|-0.005
 hunt-0.005
 land-0.004
 camp-0.004
 distance-0.004
Token sounds
Token ID5238
Feature activation+3.99e+00

Pos logit contributions
 concerned+0.109
 darling+0.109
ealing+0.109
 sorry+0.108
 dear+0.108
Neg logit contributions
|-0.088
 hunt-0.077
 land-0.077
 camp-0.073
 distance-0.070
Token like
Token ID588
Feature activation+1.91e+00

Pos logit contributions
 concerned+0.101
ealing+0.100
rier+0.098
ushing+0.098
 darling+0.097
Neg logit contributions
|-0.094
 land-0.084
 hunt-0.084
 camp-0.081
 distance-0.076
Token a
Token ID257
Feature activation+1.18e-01

Pos logit contributions
 darling+0.056
 dear+0.055
 sorry+0.055
 fault+0.053
 concerned+0.053
Neg logit contributions
|-0.040
 hunt-0.033
 land-0.032
 distance-0.031
 camp-0.029
Token great
Token ID1049
Feature activation+5.01e+00

Pos logit contributions
 concerned+0.003
 darling+0.003
ealing+0.003
 sorry+0.003
 dear+0.003
Neg logit contributions
|-0.003
 land-0.003
 hunt-0.003
 camp-0.002
 distance-0.002
Token idea
Token ID2126
Feature activation+7.34e+00

Pos logit contributions
ealing+0.093
 concerned+0.092
ushing+0.091
rier+0.090
ented+0.089
Neg logit contributions
|-0.151
 land-0.143
 hunt-0.141
 camp-0.139
 play-0.130
Token!"
Token ID2474
Feature activation-1.19e+00

Pos logit contributions
 concerned+0.235
ealing+0.233
ented+0.227
 sorry+0.227
rier+0.226
Neg logit contributions
|-0.121
 hunt-0.108
 land-0.106
 camp-0.098
 distance-0.089
Token
Token ID198
Feature activation-5.38e+00

Pos logit contributions
|+0.024
 hunt+0.022
 land+0.022
 camp+0.020
 distance+0.020
Neg logit contributions
 darling-0.034
 dear-0.033
 sorry-0.033
 concerned-0.032
ealing-0.032
Token
Token ID198
Feature activation-4.28e+00

Pos logit contributions
 hunt+0.094
 land+0.087
|+0.084
 distance+0.079
 camp+0.074
Neg logit contributions
 darling-0.162
 concerned-0.161
 sorry-0.160
 dear-0.156
 mistakes-0.156
TokenThey
Token ID2990
Feature activation-2.51e+00

Pos logit contributions
 hunt+0.084
|+0.083
 land+0.079
 camp+0.074
 distance+0.071
Neg logit contributions
 darling-0.118
 concerned-0.118
ealing-0.115
 dear-0.113
 mistakes-0.112
Token both
Token ID1111
Feature activation+3.78e-02

Pos logit contributions
|+0.058
 hunt+0.049
 distance+0.048
 land+0.044
fire+0.044
Neg logit contributions
 concerned-0.062
ented-0.062
ealing-0.060
 sorry-0.060
 daughters-0.059
Token smiled
Token ID13541
Feature activation+5.70e+00

Pos logit contributions
 concerned+0.034
 darling+0.034
 dear+0.033
ealing+0.033
 sorry+0.033
Neg logit contributions
|-0.029
 hunt-0.025
 land-0.025
 camp-0.024
 distance-0.024
Token and
Token ID290
Feature activation-2.28e-01

Pos logit contributions
 fault+0.170
 darling+0.168
ealing+0.167
 dear+0.166
ented+0.164
Neg logit contributions
|-0.094
 land-0.091
 hunt-0.085
 camp-0.083
 distance-0.081
Token said
Token ID531
Feature activation+5.27e+00

Pos logit contributions
|+0.006
 hunt+0.005
 land+0.005
 camp+0.005
 distance+0.005
Neg logit contributions
 darling-0.005
 concerned-0.005
 sorry-0.005
 dear-0.005
ealing-0.005
Token,
Token ID11
Feature activation+4.19e+00

Pos logit contributions
 fault+0.153
 darling+0.153
 concerned+0.153
ealing+0.152
ushing+0.150
Neg logit contributions
|-0.094
 land-0.085
 hunt-0.084
 camp-0.080
 distance-0.072
Token "
Token ID366
Feature activation+1.00e+00

Pos logit contributions
 fault+0.093
ealing+0.092
 darling+0.091
ushing+0.090
 concerned+0.089
Neg logit contributions
|-0.106
 hunt-0.098
 land-0.098
 camp-0.094
 distance-0.091
TokenYes
Token ID5297
Feature activation+8.36e+00

Pos logit contributions
 darling+0.025
 concerned+0.025
 dear+0.024
ealing+0.024
rier+0.024
Neg logit contributions
|-0.024
 hunt-0.021
 land-0.021
 camp-0.021
 distance-0.019
Token,
Token ID11
Feature activation+5.60e+00

Pos logit contributions
 concerned+0.253
ealing+0.249
 fault+0.247
ushing+0.243
ented+0.241
Neg logit contributions
|-0.145
 land-0.131
 hunt-0.128
 camp-0.125
 play-0.111
Token please
Token ID3387
Feature activation+1.47e+00

Pos logit contributions
ealing+0.136
 concerned+0.135
 fault+0.132
ushing+0.131
 darling+0.130
Neg logit contributions
|-0.133
 land-0.122
 hunt-0.121
 camp-0.116
 distance-0.111
Token king
Token ID5822
Feature activation+1.16e+00

Pos logit contributions
 darling+0.040
 dear+0.039
 sorry+0.039
 concerned+0.039
ealing+0.039
Neg logit contributions
|-0.033
 land-0.028
 hunt-0.028
 camp-0.026
 distance-0.026
Token.
Token ID13
Feature activation+3.17e+00

Pos logit contributions
 darling+0.031
 dear+0.031
 fault+0.029
ealing+0.029
ushing+0.029
Neg logit contributions
|-0.027
 land-0.023
 hunt-0.022
 distance-0.022
 camp-0.021
Token I
Token ID314
Feature activation+2.66e-02

Pos logit contributions
 concerned+0.067
ealing+0.066
 darling+0.064
 fault+0.064
rier+0.064
Neg logit contributions
|-0.087
 hunt-0.078
 land-0.078
 camp-0.075
 distance-0.072
Token declared
Token ID6875
Feature activation+3.86e+00

Pos logit contributions
|+0.043
 hunt+0.037
 land+0.037
 camp+0.035
 distance+0.035
Neg logit contributions
 concerned-0.042
 darling-0.042
 sorry-0.042
 dear-0.041
ealing-0.041
Token that
Token ID326
Feature activation+1.12e+00

Pos logit contributions
ealing+0.113
 fault+0.113
ushing+0.113
 darling+0.112
ented+0.112
Neg logit contributions
|-0.069
 land-0.060
 hunt-0.060
 camp-0.058
 play-0.053
Token it
Token ID340
Feature activation+1.91e+00

Pos logit contributions
 darling+0.029
 concerned+0.029
 sorry+0.028
 dear+0.028
ealing+0.028
Neg logit contributions
|-0.026
 hunt-0.023
 land-0.022
 camp-0.021
 distance-0.021
Token was
Token ID373
Feature activation+3.09e+00

Pos logit contributions
ealing+0.068
 darling+0.068
 concerned+0.068
rier+0.068
 sorry+0.068
Neg logit contributions
|-0.025
 land-0.019
 hunt-0.019
 camp-0.017
 distance-0.016
Token the
Token ID262
Feature activation+2.76e-01

Pos logit contributions
ealing+0.089
 darling+0.089
 concerned+0.088
rier+0.088
ushing+0.088
Neg logit contributions
|-0.060
 land-0.053
 hunt-0.052
 camp-0.050
 distance-0.047
Token funn
Token ID36090
Feature activation+7.53e+00

Pos logit contributions
ealing+0.006
ushing+0.006
 concerned+0.006
 darling+0.006
 sorry+0.006
Neg logit contributions
|-0.007
 land-0.007
 hunt-0.007
 camp-0.006
 distance-0.006
Tokenest
Token ID395
Feature activation+5.80e+00

Pos logit contributions
 concerned+0.180
 sorry+0.171
ealing+0.170
 darling+0.170
ented+0.167
Neg logit contributions
|-0.187
 land-0.171
 hunt-0.165
 camp-0.162
 distance-0.149
Token day
Token ID1110
Feature activation+1.94e+00

Pos logit contributions
 concerned+0.151
ealing+0.150
 darling+0.150
 sorry+0.149
 dear+0.147
Neg logit contributions
|-0.130
 hunt-0.115
 land-0.115
 camp-0.109
 distance-0.105
Token ever
Token ID1683
Feature activation+1.83e+00

Pos logit contributions
 darling+0.058
 sorry+0.058
 dear+0.057
 concerned+0.057
ealing+0.056
Neg logit contributions
|-0.036
 hunt-0.030
 land-0.030
 camp-0.029
 distance-0.027
Token!
Token ID0
Feature activation-6.96e-01

Pos logit contributions
 darling+0.053
 dear+0.052
ealing+0.051
ushing+0.051
 fault+0.050
Neg logit contributions
|-0.037
 hunt-0.031
 land-0.030
 camp-0.030
 distance-0.029
Token<|endoftext|>
Token ID50256
Feature activation-5.09e+00

Pos logit contributions
|+0.010
 hunt+0.008
 land+0.008
 camp+0.008
 distance+0.007
Neg logit contributions
 darling-0.024
 concerned-0.023
 sorry-0.023
 dear-0.023
ealing-0.023
Token was
Token ID373
Feature activation+2.85e+00

Pos logit contributions
|+0.029
 hunt+0.025
 land+0.024
 distance+0.023
 camp+0.023
Neg logit contributions
 darling-0.040
 concerned-0.039
 dear-0.039
 sorry-0.039
ushing-0.038
Token done
Token ID1760
Feature activation+1.40e+00

Pos logit contributions
 darling+0.072
ealing+0.071
 concerned+0.071
 dear+0.071
rier+0.070
Neg logit contributions
|-0.066
 land-0.059
 hunt-0.058
 camp-0.056
 distance-0.054
Token,
Token ID11
Feature activation+3.08e+00

Pos logit contributions
ealing+0.042
ented+0.042
 darling+0.041
ushing+0.041
rier+0.041
Neg logit contributions
|-0.025
 land-0.023
 camp-0.023
 hunt-0.022
 play-0.021
Token her
Token ID607
Feature activation+3.96e+00

Pos logit contributions
 darling+0.070
 dear+0.069
 sorry+0.069
 concerned+0.067
 ok+0.064
Neg logit contributions
|-0.084
 hunt-0.077
 land-0.071
 distance-0.068
 camp-0.067
Token mom
Token ID1995
Feature activation+5.17e-01

Pos logit contributions
ealing+0.088
rier+0.086
ushing+0.084
 concerned+0.084
ares+0.084
Neg logit contributions
|-0.103
 land-0.099
 camp-0.095
 hunt-0.094
 play-0.090
Token said
Token ID531
Feature activation+5.09e+00

Pos logit contributions
 darling+0.014
 concerned+0.014
ealing+0.014
 sorry+0.014
 dear+0.014
Neg logit contributions
|-0.011
 hunt-0.010
 land-0.010
 camp-0.009
 distance-0.009
Token,
Token ID11
Feature activation+4.33e+00

Pos logit contributions
 fault+0.175
ushing+0.173
ealing+0.170
ented+0.170
ares+0.168
Neg logit contributions
|-0.055
 land-0.053
 camp-0.050
 the-0.049
 hunt-0.048
Token
Token ID6184
Feature activation+7.53e+00

Pos logit contributions
 fault+0.107
ealing+0.104
ushing+0.102
rier+0.099
ented+0.098
Neg logit contributions
|-0.100
 land-0.091
 hunt-0.090
 camp-0.089
 visit-0.084
Token
Token ID95
Feature activation+4.34e+00

Pos logit contributions
 concerned+0.178
 fault+0.176
 darling+0.175
 sorry+0.173
rier+0.173
Neg logit contributions
|-0.189
 hunt-0.168
 land-0.168
 camp-0.160
 distance-0.150
Token
Token ID26391
Feature activation+5.51e+00

Pos logit contributions
 concerned+0.098
 darling+0.097
 sorry+0.096
ealing+0.096
 dear+0.095
Neg logit contributions
|-0.114
 hunt-0.101
 land-0.101
 camp-0.096
 distance-0.094
Token
Token ID129
Feature activation+3.50e+00

Pos logit contributions
 darling+0.136
 concerned+0.135
ealing+0.135
 sorry+0.134
 dear+0.133
Neg logit contributions
|-0.132
 hunt-0.117
 land-0.116
 camp-0.111
 distance-0.108
Token,
Token ID11
Feature activation+2.86e+00

Pos logit contributions
rier+0.077
 concerned+0.077
 fault+0.076
ealing+0.076
 darling+0.075
Neg logit contributions
|-0.042
 land-0.039
 hunt-0.035
 camp-0.034
 distance-0.033
Token Jill
Token ID22772
Feature activation+2.08e+00

Pos logit contributions
 darling+0.082
 sorry+0.081
 dear+0.081
 concerned+0.081
ealing+0.079
Neg logit contributions
|-0.059
 hunt-0.050
 land-0.048
 camp-0.046
 distance-0.044
Token spotted
Token ID13489
Feature activation-4.50e-01

Pos logit contributions
 concerned+0.060
 darling+0.059
ealing+0.059
ushing+0.059
 dear+0.058
Neg logit contributions
|-0.042
 hunt-0.034
 camp-0.033
 land-0.033
 distance-0.031
Token something
Token ID1223
Feature activation+4.36e-01

Pos logit contributions
|+0.010
 hunt+0.008
 land+0.007
 camp+0.007
 distance+0.007
Neg logit contributions
 dear-0.013
 darling-0.013
 sorry-0.013
 concerned-0.013
 fault-0.013
Token very
Token ID845
Feature activation+4.22e+00

Pos logit contributions
ealing+0.012
 darling+0.012
 dear+0.011
 concerned+0.011
 fault+0.011
Neg logit contributions
|-0.009
 land-0.008
 hunt-0.008
 camp-0.008
 distance-0.008
Token different
Token ID1180
Feature activation+1.68e+00

Pos logit contributions
 fault+0.093
ealing+0.092
 darling+0.092
 sorry+0.091
 dear+0.091
Neg logit contributions
|-0.104
 land-0.103
 hunt-0.097
 distance-0.093
 camp-0.093
Token in
Token ID287
Feature activation-4.69e+00

Pos logit contributions
 darling+0.052
 concerned+0.051
 dear+0.051
ealing+0.051
rier+0.051
Neg logit contributions
|-0.031
 hunt-0.025
 land-0.024
 camp-0.023
 distance-0.022
Token the
Token ID262
Feature activation-2.90e+00

Pos logit contributions
|+0.081
 hunt+0.063
fire+0.051
 camp+0.050
 land+0.049
Neg logit contributions
 concerned-0.149
 darling-0.149
 dear-0.145
 sorry-0.144
 mistakes-0.141
Token landscape
Token ID10747
Feature activation-4.89e+00

Pos logit contributions
|+0.053
 hunt+0.040
growth+0.034
Buzz+0.034
 camp+0.033
Neg logit contributions
 darling-0.087
 dear-0.086
ented-0.085
 concerned-0.085
ealing-0.084
Token.
Token ID13
Feature activation+7.31e-01

Pos logit contributions
|+0.107
 hunt+0.085
 camp+0.082
 land+0.078
growth+0.076
Neg logit contributions
rier-0.132
 darling-0.132
 concerned-0.131
ushing-0.129
 dear-0.129
Token It
Token ID632
Feature activation-2.50e+00

Pos logit contributions
rier+0.019
ealing+0.019
 concerned+0.019
 fault+0.018
 sorry+0.018
Neg logit contributions
|-0.016
 land-0.015
 hunt-0.014
 camp-0.013
 visit-0.013
Token saw
Token ID2497
Feature activation+4.02e-01

Pos logit contributions
|+0.055
 camp+0.042
 hunt+0.041
 land+0.041
 distance+0.039
Neg logit contributions
 concerned-0.077
 sorry-0.075
 darling-0.074
 dear-0.073
 ashamed-0.072
Token that
Token ID326
Feature activation+2.73e-01

Pos logit contributions
 sorry+0.013
 dear+0.012
 concerned+0.012
 darling+0.012
rier+0.011
Neg logit contributions
|-0.009
 hunt-0.007
 distance-0.006
 land-0.006
 camp-0.006
Token it
Token ID340
Feature activation-6.06e-02

Pos logit contributions
 sorry+0.008
 concerned+0.007
 dear+0.007
 darling+0.007
 ok+0.007
Neg logit contributions
|-0.006
 hunt-0.005
growth-0.004
ground-0.004
 camp-0.004
Token was
Token ID373
Feature activation+2.63e+00

Pos logit contributions
|+0.001
 camp+0.001
 hunt+0.001
 distance+0.001
 land+0.001
Neg logit contributions
 concerned-0.002
 sorry-0.002
ushing-0.002
 darling-0.002
 dear-0.002
Token a
Token ID257
Feature activation+2.29e-01

Pos logit contributions
 sorry+0.078
 concerned+0.076
 darling+0.074
 ok+0.074
 dear+0.074
Neg logit contributions
|-0.057
 hunt-0.046
 land-0.043
 camp-0.042
 distance-0.040
Token big
Token ID1263
Feature activation+1.62e+00

Pos logit contributions
 darling+0.006
 sorry+0.006
 dear+0.006
 concerned+0.006
rier+0.005
Neg logit contributions
|-0.006
 hunt-0.005
 land-0.005
 camp-0.005
 distance-0.004
Token box
Token ID3091
Feature activation-1.19e+00

Pos logit contributions
ealing+0.048
 concerned+0.047
 darling+0.047
ushing+0.046
rier+0.046
Neg logit contributions
|-0.030
 land-0.029
 hunt-0.028
 camp-0.026
 distance-0.025
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
|+0.022
 land+0.017
 hunt+0.017
 camp+0.017
 distance+0.016
Neg logit contributions
 concerned-0.036
 sorry-0.036
 darling-0.036
 dear-0.036
rier-0.036
Token She
Token ID1375
Feature activation-2.05e+00

Pos logit contributions
ealing+0.028
 fault+0.028
 concerned+0.027
ushing+0.027
rier+0.027
Neg logit contributions
|-0.025
 hunt-0.023
 land-0.022
 camp-0.022
 play-0.021
Token tried
Token ID3088
Feature activation+1.68e+00

Pos logit contributions
|+0.045
 hunt+0.038
 land+0.038
 camp+0.037
 distance+0.036
Neg logit contributions
 darling-0.056
 concerned-0.056
 dear-0.055
ealing-0.054
 sorry-0.054
Token to
Token ID284
Feature activation-6.82e+00

Pos logit contributions
 concerned+0.053
 darling+0.052
 sorry+0.052
ealing+0.052
 dear+0.051
Neg logit contributions
|-0.030
 hunt-0.024
 land-0.024
 camp-0.023
 distance-0.022
Token boy
Token ID2933
Feature activation+1.45e+00

Pos logit contributions
 fault+0.066
ealing+0.066
 concerned+0.065
 darling+0.064
ushing+0.064
Neg logit contributions
|-0.078
 land-0.074
 hunt-0.071
 camp-0.071
 distance-0.067
Token named
Token ID3706
Feature activation+3.24e+00

Pos logit contributions
 sorry+0.041
 concerned+0.040
 dear+0.039
ushing+0.039
 darling+0.039
Neg logit contributions
|-0.029
 camp-0.025
 hunt-0.024
 distance-0.024
 land-0.023
Token Tim
Token ID5045
Feature activation+1.51e+00

Pos logit contributions
 sorry+0.071
 darling+0.070
 dear+0.069
 ok+0.064
ented+0.063
Neg logit contributions
|-0.085
 land-0.081
 distance-0.078
 hunt-0.076
field-0.074
Tokenmy
Token ID1820
Feature activation+1.53e+00

Pos logit contributions
 fault+0.031
ushing+0.029
 concerned+0.029
 darling+0.029
 sorry+0.029
Neg logit contributions
|-0.041
ground-0.037
 camp-0.037
fire-0.037
 hunt-0.036
Token.
Token ID13
Feature activation+1.65e-01

Pos logit contributions
 sorry+0.040
 concerned+0.040
ushing+0.039
 dear+0.039
 darling+0.039
Neg logit contributions
|-0.034
 camp-0.028
 hunt-0.027
ground-0.027
 miles-0.027
Token Tim
Token ID5045
Feature activation+1.60e+00

Pos logit contributions
 darling+0.004
 sorry+0.004
 concerned+0.004
ealing+0.004
 dear+0.004
Neg logit contributions
|-0.004
 land-0.003
 hunt-0.003
 camp-0.003
 distance-0.003
Tokenmy
Token ID1820
Feature activation+1.82e+00

Pos logit contributions
 fault+0.037
 darling+0.037
ealing+0.036
 concerned+0.036
 sorry+0.036
Neg logit contributions
|-0.038
 hunt-0.035
 camp-0.034
 land-0.034
ground-0.032
Token loved
Token ID6151
Feature activation+2.01e+00

Pos logit contributions
 darling+0.048
 sorry+0.048
 concerned+0.047
 dear+0.047
ealing+0.047
Neg logit contributions
|-0.039
 hunt-0.034
 land-0.033
 camp-0.033
 distance-0.032
Token to
Token ID284
Feature activation-5.71e+00

Pos logit contributions
 sorry+0.055
 dear+0.055
 concerned+0.054
 darling+0.054
 fault+0.054
Neg logit contributions
|-0.043
 land-0.034
 hunt-0.033
 distance-0.033
fire-0.031
Token play
Token ID711
Feature activation+5.93e-02

Pos logit contributions
|+0.081
 distance+0.057
 land+0.049
 clearing+0.049
fire+0.047
Neg logit contributions
 sorry-0.202
 dear-0.200
 darling-0.199
 apology-0.189
 concerned-0.188
Token with
Token ID351
Feature activation-3.88e+00

Pos logit contributions
 sorry+0.002
ushing+0.001
 concerned+0.001
 dear+0.001
ealing+0.001
Neg logit contributions
|-0.001
 distance-0.001
field-0.001
growth-0.001
 land-0.001
Token all
Token ID477
Feature activation-1.99e+00

Pos logit contributions
|+0.034
 hunt+0.031
growth+0.030
ground+0.030
 distance+0.029
Neg logit contributions
 sorry-0.045
 concerned-0.042
ealing-0.042
 ashamed-0.041
ented-0.041
Token danced
Token ID39480
Feature activation+4.94e-01

Pos logit contributions
 hunt+0.038
growth+0.036
|+0.036
 camp+0.033
ground+0.032
Neg logit contributions
 sorry-0.055
ealing-0.053
 ashamed-0.052
 concerned-0.051
 darling-0.051
Token together
Token ID1978
Feature activation-9.51e-01

Pos logit contributions
 sorry+0.012
ealing+0.012
 darling+0.012
 concerned+0.012
rier+0.011
Neg logit contributions
|-0.012
 hunt-0.011
 land-0.010
 distance-0.010
growth-0.010
Token to
Token ID284
Feature activation-6.53e+00

Pos logit contributions
|+0.021
 hunt+0.017
growth+0.016
 distance+0.016
 camp+0.016
Neg logit contributions
 sorry-0.025
 fault-0.025
 concerned-0.025
rier-0.024
 darling-0.024
Token the
Token ID262
Feature activation-1.80e+00

Pos logit contributions
|+0.130
 distance+0.097
field+0.091
fire+0.091
 hunt+0.090
Neg logit contributions
 sorry-0.202
 apology-0.182
 mistakes-0.181
 fault-0.181
 dear-0.181
Token slow
Token ID3105
Feature activation+3.42e+00

Pos logit contributions
|+0.039
 hunt+0.030
growth+0.028
 land+0.028
 camp+0.026
Neg logit contributions
 darling-0.050
 concerned-0.050
 sorry-0.050
 dear-0.050
 mistakes-0.049
Token beat
Token ID4405
Feature activation+1.78e+00

Pos logit contributions
 darling+0.089
 dear+0.088
 concerned+0.087
ealing+0.087
 fault+0.086
Neg logit contributions
|-0.079
 hunt-0.066
 land-0.064
 camp-0.061
 distance-0.060
Token of
Token ID286
Feature activation-4.20e+00

Pos logit contributions
 sorry+0.049
 concerned+0.048
ealing+0.048
 darling+0.048
rier+0.048
Neg logit contributions
|-0.037
 hunt-0.033
 land-0.031
 camp-0.029
 distance-0.029
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
|+0.084
 hunt+0.065
 land+0.060
 distance+0.058
 camp+0.056
Neg logit contributions
 sorry-0.128
 concerned-0.125
 fault-0.123
 mistakes-0.123
 darling-0.123
Token music
Token ID2647
Feature activation-1.03e+00

Pos logit contributions
|+0.026
 hunt+0.022
 land+0.021
 camp+0.020
 distance+0.019
Neg logit contributions
 concerned-0.035
 darling-0.035
 sorry-0.035
 dear-0.035
ealing-0.035
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
|+0.022
 hunt+0.017
 land+0.016
growth+0.016
 camp+0.015
Neg logit contributions
 sorry-0.028
 fault-0.028
 darling-0.028
 dear-0.028
 concerned-0.028
Token tape
Token ID9154
Feature activation-4.09e+00

Pos logit contributions
|+0.053
 hunt+0.044
 land+0.043
 camp+0.042
 distance+0.041
Neg logit contributions
 darling-0.053
rier-0.052
 dear-0.052
ealing-0.051
 concerned-0.051
Token to
Token ID284
Feature activation-7.25e+00

Pos logit contributions
|+0.083
 hunt+0.068
 land+0.067
 camp+0.065
 distance+0.062
Neg logit contributions
 darling-0.117
 concerned-0.116
rier-0.116
 dear-0.115
 sorry-0.115
Token measure
Token ID3953
Feature activation-3.61e+00

Pos logit contributions
|+0.198
 camp+0.150
 hunt+0.142
 distance+0.134
ground+0.133
Neg logit contributions
 sorry-0.175
 dear-0.174
 darling-0.173
rier-0.170
 concerned-0.164
Token the
Token ID262
Feature activation-1.84e+00

Pos logit contributions
|+0.073
 hunt+0.060
 land+0.057
 camp+0.055
 distance+0.054
Neg logit contributions
 darling-0.107
 dear-0.105
rier-0.105
 concerned-0.104
ealing-0.104
Token table
Token ID3084
Feature activation-5.44e+00

Pos logit contributions
|+0.045
 hunt+0.039
 land+0.038
 camp+0.037
 distance+0.036
Neg logit contributions
 darling-0.046
 dear-0.045
 concerned-0.045
ealing-0.044
 sorry-0.044
Token and
Token ID290
Feature activation-3.12e+00

Pos logit contributions
|+0.097
 hunt+0.076
 land+0.074
 camp+0.071
 distance+0.068
Neg logit contributions
 darling-0.171
 concerned-0.170
rier-0.170
 sorry-0.169
 dear-0.169
Token a
Token ID257
Feature activation-1.45e+00

Pos logit contributions
|+0.076
 hunt+0.061
 camp+0.059
 land+0.059
 distance+0.057
Neg logit contributions
 darling-0.081
 dear-0.080
 sorry-0.080
 concerned-0.080
rier-0.078
Token meter
Token ID16430
Feature activation-7.55e-01

Pos logit contributions
|+0.034
 hunt+0.030
 land+0.030
 camp+0.028
 distance+0.027
Neg logit contributions
 darling-0.037
 concerned-0.036
 sorry-0.036
 dear-0.036
ealing-0.036
Token stick
Token ID4859
Feature activation-3.42e+00

Pos logit contributions
|+0.011
 land+0.010
 hunt+0.010
 camp+0.009
 distance+0.008
Neg logit contributions
 concerned-0.025
ealing-0.025
 darling-0.025
 sorry-0.025
 dear-0.024
Token to
Token ID284
Feature activation-7.47e+00

Pos logit contributions
|+0.057
 hunt+0.046
 land+0.045
 camp+0.043
 distance+0.041
Neg logit contributions
 darling-0.109
 concerned-0.109
 sorry-0.108
ealing-0.108
 dear-0.108
Token measure
Token ID3953
Feature activation-3.60e+00

Pos logit contributions
|+0.184
 camp+0.138
 hunt+0.132
 distance+0.123
 land+0.123
Neg logit contributions
 dear-0.202
 darling-0.201
 sorry-0.200
rier-0.194
 apology-0.191
Token tri
Token ID1333
Feature activation+3.08e+00

Pos logit contributions
 sorry+0.020
 darling+0.020
 concerned+0.020
 dear+0.020
ealing+0.020
Neg logit contributions
|-0.016
 hunt-0.014
 land-0.013
 camp-0.013
 distance-0.013
Tokenpped
Token ID1496
Feature activation-1.77e+00

Pos logit contributions
 concerned+0.059
ealing+0.056
ushing+0.056
ented+0.054
 sorry+0.053
Neg logit contributions
|-0.087
 hunt-0.086
 land-0.081
 visit-0.081
 camp-0.079
Token over
Token ID625
Feature activation-2.70e+00

Pos logit contributions
|+0.038
 hunt+0.033
 land+0.032
 camp+0.030
growth+0.028
Neg logit contributions
 concerned-0.047
 sorry-0.047
 fault-0.047
rier-0.046
 darling-0.045
Token his
Token ID465
Feature activation+1.85e+00

Pos logit contributions
|+0.050
 hunt+0.042
 land+0.038
field+0.037
fire+0.034
Neg logit contributions
 sorry-0.082
 fault-0.082
 darling-0.082
 mistakes-0.081
 concerned-0.081
Token own
Token ID898
Feature activation+9.20e-01

Pos logit contributions
 darling+0.038
 concerned+0.038
 sorry+0.037
 dear+0.037
ealing+0.037
Neg logit contributions
|-0.053
 hunt-0.047
 land-0.047
 camp-0.045
 distance-0.044
Token feet
Token ID3625
Feature activation-1.16e+00

Pos logit contributions
 darling+0.022
 concerned+0.021
 dear+0.021
 sorry+0.021
 fault+0.021
Neg logit contributions
|-0.023
 hunt-0.020
 land-0.020
 camp-0.019
 distance-0.019
Token.
Token ID13
Feature activation-5.59e-01

Pos logit contributions
|+0.021
 hunt+0.017
 land+0.017
 camp+0.016
 distance+0.015
Neg logit contributions
 concerned-0.035
 darling-0.035
 sorry-0.035
rier-0.035
 fault-0.035
Token He
Token ID679
Feature activation-1.74e+00

Pos logit contributions
|+0.011
 land+0.009
 hunt+0.009
 camp+0.009
 distance+0.008
Neg logit contributions
 darling-0.016
 concerned-0.016
 sorry-0.016
 dear-0.016
ealing-0.016
Token had
Token ID550
Feature activation-5.66e-02

Pos logit contributions
|+0.038
 hunt+0.033
 camp+0.031
ground+0.030
 land+0.030
Neg logit contributions
 concerned-0.046
 sorry-0.044
 darling-0.044
 dear-0.043
ealing-0.043
Token lots
Token ID6041
Feature activation+1.01e+00

Pos logit contributions
|+0.001
 hunt+0.001
 land+0.001
 camp+0.001
ground+0.001
Neg logit contributions
 fault-0.002
 sorry-0.002
 concerned-0.002
 ok-0.002
 darling-0.002
Token of
Token ID286
Feature activation-4.24e+00

Pos logit contributions
 darling+0.029
 concerned+0.029
ealing+0.029
 sorry+0.029
 dear+0.029
Neg logit contributions
|-0.020
 hunt-0.017
 land-0.017
 camp-0.016
 distance-0.015
Token saw
Token ID2497
Feature activation+1.38e-01

Pos logit contributions
|+0.015
 outer+0.014
 hunt+0.014
 forests+0.013
 valleys+0.013
Neg logit contributions
 sorry-0.019
 dear-0.018
 concerned-0.018
 darling-0.018
 ashamed-0.018
Token the
Token ID262
Feature activation-2.40e-01

Pos logit contributions
 sorry+0.004
 dear+0.004
 darling+0.004
 concerned+0.004
 fault+0.004
Neg logit contributions
|-0.003
 hunt-0.002
 land-0.002
 distance-0.002
 camp-0.002
Token living
Token ID2877
Feature activation+3.70e-01

Pos logit contributions
|+0.006
 hunt+0.005
 land+0.005
 camp+0.005
 distance+0.005
Neg logit contributions
 sorry-0.006
 darling-0.006
 concerned-0.006
 dear-0.006
rier-0.006
Token room
Token ID2119
Feature activation-3.39e+00

Pos logit contributions
 concerned+0.010
 darling+0.009
 sorry+0.009
rier+0.009
 dear+0.009
Neg logit contributions
|-0.008
 hunt-0.007
 land-0.007
 camp-0.007
 distance-0.007
Token,
Token ID11
Feature activation+3.30e+00

Pos logit contributions
|+0.071
 hunt+0.062
 land+0.062
 camp+0.058
 distance+0.056
Neg logit contributions
 darling-0.093
 concerned-0.093
ealing-0.092
 sorry-0.092
 dear-0.091
Token she
Token ID673
Feature activation-9.07e-01

Pos logit contributions
 sorry+0.079
 dear+0.077
 darling+0.076
 concerned+0.074
rier+0.072
Neg logit contributions
|-0.089
 hunt-0.077
 land-0.072
 distance-0.071
 camp-0.069
Token was
Token ID373
Feature activation+3.50e+00

Pos logit contributions
|+0.019
 hunt+0.016
 distance+0.015
 camp+0.015
 outer+0.015
Neg logit contributions
 concerned-0.026
 sorry-0.026
 dear-0.025
 darling-0.025
ealing-0.025
Token not
Token ID407
Feature activation+1.66e+00

Pos logit contributions
 sorry+0.075
 concerned+0.074
ealing+0.071
 darling+0.071
 dear+0.070
Neg logit contributions
|-0.099
 hunt-0.090
 land-0.087
 camp-0.084
 distance-0.083
Token happy
Token ID3772
Feature activation+5.06e+00

Pos logit contributions
 darling+0.034
 concerned+0.034
ealing+0.034
 dear+0.033
 sorry+0.033
Neg logit contributions
|-0.047
 land-0.043
 hunt-0.042
 camp-0.041
 distance-0.040
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
 concerned+0.168
ealing+0.168
 darling+0.168
 sorry+0.166
 dear+0.165
Neg logit contributions
|-0.076
 land-0.065
 hunt-0.064
 camp-0.059
 distance-0.055
Token She
Token ID1375
Feature activation-1.93e+00

Pos logit contributions
 concerned+0.038
ealing+0.038
 darling+0.037
rier+0.037
ushing+0.037
Neg logit contributions
|-0.035
 land-0.031
 hunt-0.030
 camp-0.029
 distance-0.028
Token the
Token ID262
Feature activation-3.70e+00

Pos logit contributions
|+0.128
 hunt+0.106
 traveled+0.098
 distance+0.098
 trek+0.096
Neg logit contributions
 sorry-0.155
 mistakes-0.155
 darling-0.154
 fault-0.154
rier-0.149
Token neighborhood
Token ID6232
Feature activation-3.12e+00

Pos logit contributions
|+0.076
 hunt+0.059
growth+0.056
Ray+0.052
 roam+0.050
Neg logit contributions
 darling-0.101
 dear-0.100
 mistakes-0.100
 fault-0.097
 concerned-0.095
Token.
Token ID13
Feature activation-7.11e-01

Pos logit contributions
|+0.066
 hunt+0.051
 camp+0.050
 land+0.049
growth+0.049
Neg logit contributions
 darling-0.088
 dear-0.087
 concerned-0.087
 sorry-0.086
rier-0.085
Token He
Token ID679
Feature activation-2.03e+00

Pos logit contributions
|+0.014
 hunt+0.013
 land+0.013
 camp+0.012
 distance+0.012
Neg logit contributions
 darling-0.020
 dear-0.019
 sorry-0.019
 concerned-0.019
ealing-0.019
Token even
Token ID772
Feature activation+1.51e+00

Pos logit contributions
|+0.045
 hunt+0.036
 camp+0.036
growth+0.035
 distance+0.035
Neg logit contributions
 concerned-0.052
 dear-0.051
 darling-0.051
 sorry-0.050
ealing-0.049
Token rode
Token ID22075
Feature activation-7.90e-01

Pos logit contributions
 concerned+0.038
 dear+0.037
 sorry+0.037
ealing+0.037
 darling+0.036
Neg logit contributions
|-0.034
 hunt-0.029
 camp-0.029
 land-0.027
field-0.027
Token it
Token ID340
Feature activation-3.62e+00

Pos logit contributions
|+0.017
 hunt+0.015
 land+0.014
 camp+0.014
growth+0.013
Neg logit contributions
 sorry-0.022
 concerned-0.021
rier-0.021
ealing-0.021
 darling-0.021
Token in
Token ID287
Feature activation-6.64e+00

Pos logit contributions
|+0.083
growth+0.068
 camp+0.066
 hunt+0.066
 land+0.065
Neg logit contributions
 concerned-0.087
 sorry-0.087
 fault-0.086
 darling-0.084
ented-0.084
Token the
Token ID262
Feature activation-3.38e+00

Pos logit contributions
|+0.125
 hunt+0.095
field+0.090
fire+0.089
Buzz+0.087
Neg logit contributions
 fault-0.191
 darling-0.190
 sorry-0.190
 mistakes-0.188
 dear-0.185
Token rain
Token ID6290
Feature activation-1.95e+00

Pos logit contributions
|+0.080
 hunt+0.071
 camp+0.062
fire+0.061
 land+0.060
Neg logit contributions
 sorry-0.088
 concerned-0.084
 dear-0.084
ealing-0.083
 darling-0.083
Token and
Token ID290
Feature activation-3.02e+00

Pos logit contributions
|+0.047
growth+0.041
 hunt+0.040
 camp+0.039
 land+0.038
Neg logit contributions
 fault-0.045
 darling-0.045
rier-0.044
 dear-0.044
ushing-0.043
Token I
Token ID314
Feature activation-3.04e-01

Pos logit contributions
 sorry+0.061
 darling+0.059
 dear+0.059
 concerned+0.057
rier+0.057
Neg logit contributions
|-0.068
 hunt-0.059
 land-0.057
 distance-0.056
 camp-0.055
Token have
Token ID423
Feature activation-3.84e-01

Pos logit contributions
|+0.007
 hunt+0.006
 distance+0.006
 land+0.006
 camp+0.006
Neg logit contributions
 sorry-0.007
 fault-0.007
 concerned-0.007
 darling-0.007
ealing-0.007
Token no
Token ID645
Feature activation+1.96e+00

Pos logit contributions
|+0.009
 hunt+0.007
 land+0.007
 distance+0.007
 camp+0.007
Neg logit contributions
 fault-0.010
 sorry-0.010
 dear-0.010
 darling-0.010
 mistakes-0.010
Token friends
Token ID2460
Feature activation-3.10e-01

Pos logit contributions
 darling+0.043
 sorry+0.043
 dear+0.042
 fault+0.042
 concerned+0.042
Neg logit contributions
|-0.054
 hunt-0.046
 land-0.046
 camp-0.044
 distance-0.043
Token to
Token ID284
Feature activation-5.38e+00

Pos logit contributions
|+0.006
 hunt+0.005
 land+0.005
 camp+0.004
 distance+0.004
Neg logit contributions
 darling-0.009
 concerned-0.009
ealing-0.009
 sorry-0.009
 dear-0.009
Token play
Token ID711
Feature activation-5.61e-01

Pos logit contributions
|+0.098
 valleys+0.081
growth+0.081
 clearing+0.078
ground+0.078
Neg logit contributions
 sorry-0.172
 apology-0.157
 dear-0.156
 ok-0.156
 darling-0.156
Token with
Token ID351
Feature activation-3.73e+00

Pos logit contributions
|+0.011
 distance+0.010
 trek+0.009
 land+0.009
field+0.009
Neg logit contributions
 sorry-0.017
rier-0.016
 darling-0.015
 concerned-0.015
 dear-0.015
Token."
Token ID526
Feature activation-3.65e-01

Pos logit contributions
|+0.094
 hunt+0.079
 distance+0.078
 land+0.076
 camp+0.075
Neg logit contributions
 sorry-0.091
 darling-0.089
 dear-0.089
 mistakes-0.086
rier-0.085
Token
Token ID198
Feature activation-4.94e+00

Pos logit contributions
|+0.006
 land+0.006
 hunt+0.006
 camp+0.005
 distance+0.005
Neg logit contributions
 darling-0.011
 sorry-0.011
 dear-0.011
ealing-0.011
 concerned-0.011
Token
Token ID198
Feature activation-4.01e+00

Pos logit contributions
|+0.094
 hunt+0.092
 land+0.087
 distance+0.082
 camp+0.081
Neg logit contributions
 concerned-0.132
 sorry-0.131
 apology-0.130
 darling-0.130
ealing-0.129
TokenThe
Token ID464
Feature activation+7.13e-01

Pos logit contributions
|+0.063
 hunt+0.053
 camp+0.050
 land+0.050
 distance+0.043
Neg logit contributions
 concerned-0.133
ealing-0.130
 darling-0.128
 sorry-0.126
ushing-0.126
Token "
Token ID366
Feature activation+8.27e-01

Pos logit contributions
ealing+0.097
 fault+0.096
 darling+0.096
 concerned+0.094
ushing+0.094
Neg logit contributions
|-0.116
 hunt-0.106
 land-0.105
 camp-0.102
 distance-0.098
TokenI
Token ID40
Feature activation+1.02e+00

Pos logit contributions
 concerned+0.026
 darling+0.026
 dear+0.025
ealing+0.025
 sorry+0.025
Neg logit contributions
|-0.015
 hunt-0.013
 land-0.012
 camp-0.012
 distance-0.011
Token lost
Token ID2626
Feature activation+2.71e+00

Pos logit contributions
 darling+0.030
 concerned+0.030
ealing+0.030
 sorry+0.029
 dear+0.029
Neg logit contributions
|-0.020
 hunt-0.017
 land-0.017
 camp-0.016
 distance-0.015
Token my
Token ID616
Feature activation+2.64e+00

Pos logit contributions
 darling+0.061
 sorry+0.059
 dear+0.059
 concerned+0.058
rier+0.057
Neg logit contributions
|-0.073
 hunt-0.065
 land-0.064
 camp-0.061
 distance-0.060
Token toy
Token ID13373
Feature activation+5.86e-01

Pos logit contributions
ealing+0.065
 concerned+0.063
ushing+0.063
 darling+0.062
ented+0.062
Neg logit contributions
|-0.063
 land-0.059
 hunt-0.058
 camp-0.055
 distance-0.054
Token in
Token ID287
Feature activation-5.96e+00

Pos logit contributions
 darling+0.016
 sorry+0.016
 dear+0.016
 concerned+0.016
rier+0.016
Neg logit contributions
|-0.012
 land-0.010
 hunt-0.010
 camp-0.010
 distance-0.009
Token the
Token ID262
Feature activation-3.08e+00

Pos logit contributions
|+0.145
 hunt+0.125
 trek+0.116
fire+0.114
 camp+0.110
Neg logit contributions
 darling-0.144
 dear-0.143
 sorry-0.143
 mistakes-0.137
 fault-0.136
Token fountain
Token ID29420
Feature activation-2.74e+00

Pos logit contributions
|+0.063
 hunt+0.053
growth+0.048
 trek+0.047
 camp+0.047
Neg logit contributions
 sorry-0.089
 concerned-0.086
ealing-0.084
 darling-0.084
 dear-0.084
Token."
Token ID526
Feature activation-2.43e-01

Pos logit contributions
|+0.057
 hunt+0.045
 land+0.045
 camp+0.044
growth+0.043
Neg logit contributions
 darling-0.077
 dear-0.076
rier-0.076
 sorry-0.075
 concerned-0.075
Token Tim
Token ID5045
Feature activation+1.38e+00

Pos logit contributions
|+0.005
 hunt+0.004
 land+0.004
 camp+0.004
 distance+0.004
Neg logit contributions
 darling-0.007
 concerned-0.007
ealing-0.007
 sorry-0.007
 dear-0.007
Token wanted
Token ID2227
Feature activation+2.40e+00

Pos logit contributions
 concerned+0.037
 darling+0.037
ealing+0.037
 sorry+0.036
 dear+0.036
Neg logit contributions
|-0.030
 hunt-0.026
 land-0.026
 camp-0.025
 distance-0.024
Token called
Token ID1444
Feature activation+2.63e+00

Pos logit contributions
 concerned+0.040
 darling+0.039
 sorry+0.039
 dear+0.038
ushing+0.037
Neg logit contributions
|-0.024
 hunt-0.020
 camp-0.019
 land-0.018
 distance-0.018
Token Clara
Token ID27443
Feature activation+2.76e+00

Pos logit contributions
 sorry+0.058
 darling+0.058
 dear+0.056
 concerned+0.055
 fault+0.053
Neg logit contributions
|-0.070
 hunt-0.064
 land-0.063
 distance-0.061
 camp-0.059
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
 darling+0.079
 concerned+0.078
 dear+0.077
 sorry+0.076
ealing+0.076
Neg logit contributions
|-0.058
 hunt-0.050
 camp-0.046
 land-0.045
 distance-0.045
Token She
Token ID1375
Feature activation-1.81e+00

Pos logit contributions
 concerned+0.007
ealing+0.007
 darling+0.007
 sorry+0.007
 dear+0.007
Neg logit contributions
|-0.008
 land-0.007
 hunt-0.007
 camp-0.006
 distance-0.006
Token loved
Token ID6151
Feature activation+3.00e+00

Pos logit contributions
|+0.036
 hunt+0.031
 land+0.029
 camp+0.029
 distance+0.029
Neg logit contributions
 darling-0.053
 concerned-0.052
 dear-0.052
 sorry-0.051
ealing-0.050
Token to
Token ID284
Feature activation-5.06e+00

Pos logit contributions
 darling+0.089
 concerned+0.089
 dear+0.088
 sorry+0.088
 fault+0.087
Neg logit contributions
|-0.058
 hunt-0.048
 land-0.047
 camp-0.044
 distance-0.044
Token go
Token ID467
Feature activation-2.14e-01

Pos logit contributions
|+0.081
 distance+0.058
 land+0.052
 camp+0.051
 hunt+0.051
Neg logit contributions
 darling-0.168
 sorry-0.167
 dear-0.167
 concerned-0.161
 apology-0.158
Token to
Token ID284
Feature activation-7.93e+00

Pos logit contributions
|+0.004
 land+0.004
growth+0.003
 hunt+0.003
 distance+0.003
Neg logit contributions
 sorry-0.006
 concerned-0.006
 darling-0.006
ealing-0.006
 dear-0.006
Token the
Token ID262
Feature activation-2.94e+00

Pos logit contributions
|+0.149
fire+0.133
 distance+0.110
Ar+0.099
Buzz+0.099
Neg logit contributions
 fault-0.233
 sorry-0.232
 dear-0.232
 darling-0.231
 mistakes-0.230
Token opera
Token ID27296
Feature activation-1.61e+00

Pos logit contributions
|+0.064
 hunt+0.043
growth+0.042
 miles+0.040
fire+0.039
Neg logit contributions
 dear-0.083
 darling-0.082
 sorry-0.080
 concerned-0.080
 fault-0.079
Token.
Token ID13
Feature activation-7.32e-01

Pos logit contributions
|+0.036
growth+0.028
 hunt+0.027
 camp+0.026
 distance+0.026
Neg logit contributions
 dear-0.043
 darling-0.043
 concerned-0.042
 sorry-0.042
rier-0.041
Token He
Token ID679
Feature activation-2.30e+00

Pos logit contributions
 concerned+0.002
ealing+0.002
 sorry+0.002
 fault+0.002
 darling+0.002
Neg logit contributions
|-0.001
 land-0.001
 camp-0.001
 hunt-0.001
 play-0.001
Token had
Token ID550
Feature activation-1.10e+00

Pos logit contributions
|+0.045
 hunt+0.037
 land+0.036
 distance+0.036
 camp+0.035
Neg logit contributions
 darling-0.066
 dear-0.065
 concerned-0.064
 sorry-0.063
ealing-0.063
Token a
Token ID257
Feature activation-1.98e+00

Pos logit contributions
|+0.023
 hunt+0.019
fire+0.018
 camp+0.017
 land+0.017
Neg logit contributions
 dear-0.030
 sorry-0.030
 fault-0.030
 darling-0.030
 concerned-0.029
Token new
Token ID649
Feature activation+1.71e+00

Pos logit contributions
|+0.041
 land+0.037
 hunt+0.037
 camp+0.035
 distance+0.033
Neg logit contributions
ealing-0.055
rier-0.054
 darling-0.054
 concerned-0.054
ushing-0.054
Token toy
Token ID13373
Feature activation+3.53e-01

Pos logit contributions
ealing+0.044
 concerned+0.044
 darling+0.043
 sorry+0.043
rier+0.043
Neg logit contributions
|-0.039
 land-0.035
 hunt-0.035
 camp-0.033
 distance-0.032
Token to
Token ID284
Feature activation-6.22e+00

Pos logit contributions
 darling+0.010
 sorry+0.010
 dear+0.010
 concerned+0.010
rier+0.009
Neg logit contributions
|-0.008
 land-0.006
 hunt-0.006
 camp-0.006
 distance-0.006
Token play
Token ID711
Feature activation-1.40e-01

Pos logit contributions
|+0.107
 distance+0.084
 valleys+0.076
field+0.073
 forests+0.072
Neg logit contributions
 sorry-0.196
 dear-0.196
 darling-0.195
 apology-0.189
ealing-0.178
Token with
Token ID351
Feature activation-4.53e+00

Pos logit contributions
|+0.003
 distance+0.003
field+0.002
growth+0.002
 hunt+0.002
Neg logit contributions
 sorry-0.004
rier-0.004
 dear-0.003
 concerned-0.003
 darling-0.003
Token outside
Token ID2354
Feature activation-4.52e+00

Pos logit contributions
|+0.108
 distance+0.085
 hunt+0.082
 land+0.080
growth+0.080
Neg logit contributions
 darling-0.112
 mistakes-0.112
 dear-0.110
rier-0.108
 concerned-0.108
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
|+0.094
 hunt+0.076
 land+0.071
 distance+0.069
 camp+0.068
Neg logit contributions
 sorry-0.129
rier-0.128
 concerned-0.127
 darling-0.126
 dear-0.123
Token He
Token ID679
Feature activation-2.29e+00

Pos logit contributions
|+0.025
 hunt+0.022
 land+0.022
 distance+0.021
 camp+0.021
Neg logit contributions
 darling-0.027
 dear-0.026
 sorry-0.026
 concerned-0.026
ealing-0.026
Token Lots
Token ID25024
Feature activation-6.60e-01

Pos logit contributions
|+0.007
 hunt+0.007
 land+0.007
field+0.007
 distance+0.007
Neg logit contributions
 dear-0.007
 darling-0.007
 sorry-0.007
ealing-0.006
ented-0.006
Token of
Token ID286
Feature activation-2.75e+00

Pos logit contributions
|+0.016
 hunt+0.014
 land+0.013
 camp+0.013
growth+0.013
Neg logit contributions
 dear-0.016
 darling-0.016
ushing-0.015
 sorry-0.015
 concerned-0.015
Token people
Token ID661
Feature activation-3.23e+00

Pos logit contributions
|+0.069
 hunt+0.061
 land+0.060
 camp+0.058
 distance+0.056
Neg logit contributions
 darling-0.065
 concerned-0.065
 sorry-0.065
ealing-0.064
 dear-0.064
Token went
Token ID1816
Feature activation+1.12e+00

Pos logit contributions
|+0.062
growth+0.051
 hunt+0.050
 camp+0.049
 distance+0.047
Neg logit contributions
 sorry-0.090
 concerned-0.090
ealing-0.089
 dear-0.088
 darling-0.087
Token there
Token ID612
Feature activation-5.35e+00

Pos logit contributions
 sorry+0.032
 concerned+0.032
ealing+0.031
 dear+0.031
 darling+0.031
Neg logit contributions
|-0.023
 hunt-0.018
growth-0.018
 land-0.018
 distance-0.017
Token to
Token ID284
Feature activation-6.87e+00

Pos logit contributions
|+0.135
growth+0.111
 distance+0.103
 hunt+0.101
 land+0.098
Neg logit contributions
 fault-0.128
 sorry-0.126
 darling-0.124
 ok-0.123
 dear-0.123
Token fly
Token ID6129
Feature activation-3.72e+00

Pos logit contributions
|+0.159
st+0.129
Buzz+0.127
growth+0.125
fire+0.121
Neg logit contributions
 sorry-0.175
 dear-0.166
 apology-0.153
 darling-0.152
 ok-0.150
Token in
Token ID287
Feature activation-6.94e+00

Pos logit contributions
|+0.087
growth+0.084
 hunt+0.075
Buzz+0.071
field+0.070
Neg logit contributions
 fault-0.071
 darling-0.071
 sorry-0.071
 concerned-0.068
ealing-0.068
Token big
Token ID1263
Feature activation-1.52e+00

Pos logit contributions
|+0.143
 hunt+0.113
Buzz+0.108
growth+0.103
fire+0.101
Neg logit contributions
 sorry-0.188
 darling-0.188
 dear-0.186
 mistakes-0.184
 fault-0.183
Token planes
Token ID13016
Feature activation-7.22e+00

Pos logit contributions
|+0.034
 hunt+0.029
 land+0.029
 camp+0.028
 distance+0.027
Neg logit contributions
 darling-0.040
 concerned-0.040
ealing-0.039
 sorry-0.039
 dear-0.039
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
|+0.194
growth+0.172
Buzz+0.161
Ro+0.157
Sky+0.154
Neg logit contributions
ushing-0.131
 sorry-0.130
 dear-0.127
 fault-0.126
 concerned-0.125
Token that
Token ID326
Feature activation+4.21e-01

Pos logit contributions
rier+0.227
ealing+0.222
 concerned+0.213
 darling+0.212
ushing+0.210
Neg logit contributions
 camp-0.105
|-0.103
 land-0.100
 play-0.096
 the-0.095
Token was
Token ID373
Feature activation+5.19e+00

Pos logit contributions
rier+0.016
ealing+0.016
 darling+0.016
 concerned+0.016
 dear+0.016
Neg logit contributions
|-0.005
 land-0.004
 hunt-0.004
 camp-0.003
 distance-0.003
Token cool
Token ID3608
Feature activation+5.10e+00

Pos logit contributions
rier+0.166
ares+0.164
ealing+0.162
ented+0.162
 fault+0.161
Neg logit contributions
|-0.082
 land-0.076
 camp-0.070
 hunt-0.067
 play-0.067
Token and
Token ID290
Feature activation+2.81e-01

Pos logit contributions
 Deal+0.172
ealing+0.171
 apology+0.170
omm+0.170
 concerned+0.168
Neg logit contributions
 land-0.061
 camp-0.057
 play-0.056
 hunt-0.054
|-0.054
Token decided
Token ID3066
Feature activation+5.71e+00

Pos logit contributions
rier+0.008
 fault+0.008
omm+0.008
ares+0.008
ealing+0.008
Neg logit contributions
|-0.005
 land-0.005
 play-0.005
 hunt-0.005
 camp-0.004
Token to
Token ID284
Feature activation-4.91e+00

Pos logit contributions
ealing+0.218
rier+0.217
ushing+0.214
ented+0.214
ares+0.214
Neg logit contributions
|-0.050
 hunt-0.048
 land-0.047
 play-0.042
 visit-0.038
Token try
Token ID1949
Feature activation+8.96e-01

Pos logit contributions
|+0.100
 hunt+0.084
 land+0.082
 distance+0.079
 camp+0.078
Neg logit contributions
 darling-0.141
 sorry-0.139
 dear-0.138
 concerned-0.137
ealing-0.136
Token soaking
Token ID41714
Feature activation-8.96e-02

Pos logit contributions
 darling+0.027
 concerned+0.027
ealing+0.027
 sorry+0.027
 dear+0.026
Neg logit contributions
|-0.017
 hunt-0.014
 land-0.014
 camp-0.013
 distance-0.013
Token more
Token ID517
Feature activation-3.23e-01

Pos logit contributions
|+0.001
 land+0.001
 camp+0.001
 hunt+0.001
 distance+0.001
Neg logit contributions
ealing-0.003
 dear-0.003
 concerned-0.003
 darling-0.003
 fault-0.003
Token things
Token ID1243
Feature activation-1.70e+00

Pos logit contributions
|+0.006
 land+0.005
 hunt+0.005
 camp+0.005
 distance+0.004
Neg logit contributions
 concerned-0.010
ealing-0.010
 sorry-0.010
 darling-0.010
 fault-0.010
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
|+0.030
 hunt+0.025
 land+0.024
 camp+0.023
 distance+0.022
Neg logit contributions
 darling-0.053
 concerned-0.053
 sorry-0.052
ealing-0.052
 dear-0.052