TOP ACTIVATIONS
MAX = 11.849

time there was a big chamber . Inside the chamber there
time , in a big garden , there lived a little
time , in a small garden , there was a little
time , in a small garden , there was a little
time , in a big garden , there was a caul
time , in a big garden , there was a wealthy
time , in a small garden , there lived a little
time , in a big garden , there was a caul
big chamber . Inside the chamber there were many things that
small chamber . Inside this chamber , lived a modest lady
time , in a green garden , there lived a restless
outside in her backyard . Yesterday , she saw a big
time , in a small garden , there was a fragile
a tiny bird in a backyard . The bird 's feathers
time , in a small garden , there lived a gr
time , in a small garden , there lived a lazy
time , in a small garden , there was a little
time , in a small garden , there was a magic
Once upon a time , there was a little girl called
time , in a wild garden , there lived a little

MIN ACTIVATIONS
MIN = -14.173

" ch oo cho o cho o ". They
I am glad you love Grand ma . She loves you
temper and often threw temper tant rums . One day ,
not know . We love Grand ma . She is compassionate
them and never threw a tant rum again . <|endoftext|> Once
did not want to make Mrs . Lee sad . What
. You want to show Grand ma and Grand pa your
And only if we bring Mr . Fl uff ."
other . And we love Grand ma . And we love
Mom . And we love Grand ma . And we love
baby back and she thanked Twe ety and the owl for
can still remind me of Grand ma . She was good
, Mom . We love Grand ma too . We want
ushed and thanked the woman prof use ly . The woman
I just wanted to make Grand ma happy ."
healthy . And we want Grand ma to be clean and
and said " thank you mom my " and ate her
said , " Thank you Grand ma , for the best
" Thank you for sharing Mitt ens with me . I
" And we will call Grand ma and say thank you

INTERVAL 5.343 - 11.849
CONTAINS 4.617%

get to the top . As she got to the top
. <|endoftext|> Once upon a time there was a big ,
felt sad . A big , patient truck named
kins safe from bugs . One day , he decided to
Tom , who lived in a big house with his family

INTERVAL -1.162 - 5.343
CONTAINS 59.761%

garden , there lived a little ant named Andy . Andy
he realised he had left something behind . His favourite red
go back and see what was happening . When they reached
. The big tree fell down . The raven was still
until all the dirt was gone . When the

INTERVAL -7.668 - -1.162
CONTAINS 35.285%

that he would always make sure to put left overs in
They swap their planes and throw them again . But this
Ben is teasing me ." Mia said . Mom
be able to heal the leather . Maybe the shop keeper
waited for their mom to come home . When she saw

INTERVAL -14.173 - -7.668
CONTAINS 0.338%

fun , â € said Mom my . � � €
it on your toes ," mom says . Anna
leaf . " L ila , Tom , it
home , he told his mom all about his fun day
feeling scared and asked her mom if she could sleep in
Token time
Token ID640
Feature activation+5.99e+00

Pos logit contributions
 valley+0.119
 woods+0.112
 mountain+0.111
 forest+0.109
 field+0.107
Neg logit contributions
 supervision-0.431
 wounds-0.409
 permission-0.389
olds-0.357
 apologize-0.355
Token there
Token ID612
Feature activation+9.11e+00

Pos logit contributions
 valley+0.109
 woods+0.106
 mountain+0.105
 lake+0.102
 field+0.100
Neg logit contributions
 supervision-0.286
 wounds-0.266
 permission-0.255
 apologize-0.236
olds-0.236
Token was
Token ID373
Feature activation+7.16e+00

Pos logit contributions
 valley+0.179
 mountain+0.173
 woods+0.163
 field+0.162
 lake+0.160
Neg logit contributions
 supervision-0.404
 wounds-0.377
 permission-0.354
 apologize-0.336
olds-0.331
Token a
Token ID257
Feature activation+8.42e+00

Pos logit contributions
 mountain+0.145
 valley+0.138
 woods+0.127
 lake+0.125
 hill+0.125
Neg logit contributions
 supervision-0.320
 wounds-0.305
 permission-0.292
 apologize-0.270
ppa-0.269
Token big
Token ID1263
Feature activation+6.24e+00

Pos logit contributions
 woods+0.164
 valley+0.161
 mountain+0.152
 sky+0.151
 forest+0.146
Neg logit contributions
 wounds-0.338
 supervision-0.333
ppa-0.309
 permission-0.299
anut-0.294
Token chamber
Token ID11847
Feature activation+1.18e+01

Pos logit contributions
 woods+0.118
case+0.116
 Under+0.113
 valley+0.112
 clearing+0.111
Neg logit contributions
 supervision-0.247
 wounds-0.237
 permission-0.223
 apologize-0.209
 apology-0.207
Token.
Token ID13
Feature activation+5.64e+00

Pos logit contributions
 valley+0.367
 lake+0.341
 woods+0.339
 landscape+0.318
 creek+0.318
Neg logit contributions
 supervision-0.284
 wounds-0.268
 apologize-0.258
olds-0.254
 permission-0.249
Token Inside
Token ID14384
Feature activation+9.73e+00

Pos logit contributions
 valley+0.144
 mountain+0.135
 lake+0.129
 field+0.124
 woods+0.124
Neg logit contributions
 supervision-0.195
 permission-0.184
and-0.170
olds-0.167
 apology-0.163
Token the
Token ID262
Feature activation+4.80e+00

Pos logit contributions
 clearing+0.270
 valley+0.269
ifying+0.265
 tiger+0.260
Under+0.259
Neg logit contributions
 supervision-0.239
 wounds-0.198
 permission-0.197
ma-0.186
pa-0.180
Token chamber
Token ID11847
Feature activation+1.15e+01

Pos logit contributions
�+0.090
 lake+0.082
storm+0.078
 construction+0.076
 mountain+0.076
Neg logit contributions
 wounds-0.163
 attention-0.154
 permission-0.149
 apologize-0.141
 supervision-0.140
Token there
Token ID612
Feature activation+9.00e+00

Pos logit contributions
 valley+0.278
 lake+0.276
 tiger+0.259
 creek+0.255
 clearing+0.251
Neg logit contributions
 supervision-0.369
olds-0.331
 wounds-0.331
 permission-0.320
 apologize-0.315
Token time
Token ID640
Feature activation+5.89e+00

Pos logit contributions
 valley+0.128
 woods+0.121
 mountain+0.120
 forest+0.116
 field+0.114
Neg logit contributions
 supervision-0.427
 wounds-0.405
 permission-0.384
olds-0.352
ppa-0.351
Token,
Token ID11
Feature activation+4.82e+00

Pos logit contributions
 valley+0.095
 woods+0.090
 mountain+0.090
 lake+0.085
 forest+0.085
Neg logit contributions
 supervision-0.298
 wounds-0.282
 permission-0.267
 apologize-0.247
olds-0.247
Token in
Token ID287
Feature activation+7.06e+00

Pos logit contributions
 mountain+0.115
 valley+0.113
 lake+0.109
 woods+0.107
 forest+0.102
Neg logit contributions
 supervision-0.184
 wounds-0.180
 permission-0.167
 apologize-0.162
wed-0.153
Token a
Token ID257
Feature activation+8.88e+00

Pos logit contributions
 valley+0.163
 mountain+0.162
 woods+0.152
 forest+0.146
 hill+0.143
Neg logit contributions
 supervision-0.277
 wounds-0.276
 permission-0.253
 apologize-0.233
ppa-0.221
Token big
Token ID1263
Feature activation+7.22e+00

Pos logit contributions
 mountain+0.161
 valley+0.155
 lake+0.145
 sky+0.145
 woods+0.143
Neg logit contributions
 supervision-0.377
 wounds-0.376
 permission-0.331
 apologize-0.328
ppa-0.311
Token garden
Token ID11376
Feature activation+1.18e+01

Pos logit contributions
case+0.104
 A+0.099
 Under+0.099
�+0.095
Under+0.094
Neg logit contributions
 supervision-0.300
 wounds-0.291
 permission-0.268
 apologize-0.257
 promises-0.237
Token,
Token ID11
Feature activation+5.14e+00

Pos logit contributions
 valley+0.296
 mountain+0.291
 tiger+0.282
 lake+0.281
 woods+0.272
Neg logit contributions
 supervision-0.392
 wounds-0.371
olds-0.351
 permission-0.349
 apologize-0.344
Token there
Token ID612
Feature activation+8.92e+00

Pos logit contributions
 mountain+0.117
 lake+0.112
 valley+0.111
 woods+0.104
 forest+0.102
Neg logit contributions
 wounds-0.194
 supervision-0.192
 apologize-0.176
 permission-0.175
olds-0.166
Token lived
Token ID5615
Feature activation+7.52e+00

Pos logit contributions
 valley+0.198
 mountain+0.194
 lake+0.175
 woods+0.174
 field+0.173
Neg logit contributions
 supervision-0.350
 wounds-0.326
 permission-0.314
 apologize-0.305
olds-0.292
Token a
Token ID257
Feature activation+7.44e+00

Pos logit contributions
 mountain+0.141
 valley+0.137
 lake+0.125
 woods+0.124
 forest+0.118
Neg logit contributions
 supervision-0.346
 wounds-0.340
 permission-0.327
 apologize-0.300
olds-0.296
Token little
Token ID1310
Feature activation+5.54e+00

Pos logit contributions
case+0.174
 woods+0.173
 sky+0.170
 valley+0.169
 mountain+0.168
Neg logit contributions
 wounds-0.250
 supervision-0.228
ppa-0.216
 apologize-0.214
anut-0.209
Token time
Token ID640
Feature activation+6.07e+00

Pos logit contributions
 valley+0.128
 woods+0.120
 mountain+0.119
 forest+0.116
 field+0.114
Neg logit contributions
 supervision-0.442
 wounds-0.419
 permission-0.397
olds-0.365
ppa-0.364
Token,
Token ID11
Feature activation+4.81e+00

Pos logit contributions
 valley+0.101
 woods+0.096
 mountain+0.095
 lake+0.091
 forest+0.091
Neg logit contributions
 supervision-0.304
 wounds-0.286
 permission-0.272
 apologize-0.252
olds-0.251
Token in
Token ID287
Feature activation+7.24e+00

Pos logit contributions
 mountain+0.121
 valley+0.119
 lake+0.113
 woods+0.113
 forest+0.108
Neg logit contributions
 supervision-0.176
 wounds-0.171
 permission-0.158
 apologize-0.158
 apology-0.144
Token a
Token ID257
Feature activation+9.06e+00

Pos logit contributions
 valley+0.158
 mountain+0.155
 woods+0.149
 forest+0.143
 hill+0.135
Neg logit contributions
 supervision-0.304
 wounds-0.297
 permission-0.275
 apologize-0.257
 apology-0.245
Token small
Token ID1402
Feature activation+8.33e+00

Pos logit contributions
 mountain+0.170
 valley+0.167
 woods+0.157
 lake+0.154
 sky+0.153
Neg logit contributions
 supervision-0.388
 wounds-0.381
 permission-0.338
 apologize-0.335
ppa-0.320
Token garden
Token ID11376
Feature activation+1.18e+01

Pos logit contributions
case+0.176
�+0.175
 Under+0.175
 A+0.164
 Pretty+0.161
Neg logit contributions
 supervision-0.257
 wounds-0.255
 permission-0.236
 apologize-0.215
ppa-0.207
Token,
Token ID11
Feature activation+4.86e+00

Pos logit contributions
 valley+0.275
 mountain+0.267
 woods+0.259
 lake+0.255
 tiger+0.246
Neg logit contributions
 supervision-0.445
 wounds-0.411
 permission-0.395
 apologize-0.381
olds-0.377
Token there
Token ID612
Feature activation+8.92e+00

Pos logit contributions
 mountain+0.116
 lake+0.110
 valley+0.108
 woods+0.102
 forest+0.100
Neg logit contributions
 supervision-0.177
 wounds-0.176
 apologize-0.163
 permission-0.160
olds-0.148
Token was
Token ID373
Feature activation+6.74e+00

Pos logit contributions
 valley+0.194
 mountain+0.189
 woods+0.172
 lake+0.170
 field+0.168
Neg logit contributions
 supervision-0.357
 wounds-0.325
 permission-0.318
 apologize-0.307
olds-0.292
Token a
Token ID257
Feature activation+8.12e+00

Pos logit contributions
 mountain+0.183
 valley+0.173
 hill+0.165
 lake+0.160
 forest+0.148
Neg logit contributions
 wounds-0.232
 supervision-0.231
 permission-0.228
 apologize-0.207
olds-0.192
Token little
Token ID1310
Feature activation+6.00e+00

Pos logit contributions
 woods+0.176
case+0.173
 valley+0.171
 sky+0.168
 On+0.167
Neg logit contributions
 wounds-0.291
 supervision-0.272
 apologize-0.251
ppa-0.249
 permission-0.245
Token time
Token ID640
Feature activation+5.87e+00

Pos logit contributions
 valley+0.128
 woods+0.120
 mountain+0.120
 forest+0.116
 lake+0.113
Neg logit contributions
 supervision-0.414
 wounds-0.393
 permission-0.372
 apologize-0.341
olds-0.341
Token,
Token ID11
Feature activation+4.70e+00

Pos logit contributions
 valley+0.109
 woods+0.106
 mountain+0.105
 lake+0.102
 field+0.100
Neg logit contributions
 supervision-0.278
 wounds-0.258
 permission-0.247
 apologize-0.230
olds-0.230
Token in
Token ID287
Feature activation+7.40e+00

Pos logit contributions
 mountain+0.106
 valley+0.105
 woods+0.103
 lake+0.100
 forest+0.097
Neg logit contributions
 supervision-0.184
 wounds-0.180
 permission-0.166
ppa-0.166
anut-0.158
Token a
Token ID257
Feature activation+9.13e+00

Pos logit contributions
 valley+0.161
 mountain+0.161
 woods+0.151
 forest+0.146
 hill+0.145
Neg logit contributions
 supervision-0.305
 wounds-0.293
 permission-0.280
 apologize-0.253
ppa-0.243
Token small
Token ID1402
Feature activation+7.87e+00

Pos logit contributions
 mountain+0.163
 valley+0.154
 woods+0.147
 sky+0.143
 lake+0.141
Neg logit contributions
 wounds-0.380
 supervision-0.377
 permission-0.332
 apologize-0.326
ppa-0.326
Token garden
Token ID11376
Feature activation+1.17e+01

Pos logit contributions
 Under+0.150
case+0.148
 A+0.141
 mountain+0.140
�+0.136
Neg logit contributions
 supervision-0.291
 wounds-0.285
 permission-0.266
ppa-0.241
 apologize-0.236
Token,
Token ID11
Feature activation+4.92e+00

Pos logit contributions
 valley+0.297
 mountain+0.291
 tiger+0.282
 lake+0.274
 woods+0.274
Neg logit contributions
 supervision-0.402
 wounds-0.366
 permission-0.350
olds-0.343
 apologize-0.335
Token there
Token ID612
Feature activation+9.79e+00

Pos logit contributions
 mountain+0.110
 valley+0.104
 lake+0.103
 woods+0.102
 forest+0.099
Neg logit contributions
 wounds-0.185
 supervision-0.185
 permission-0.171
 apologize-0.165
ppa-0.157
Token was
Token ID373
Feature activation+7.11e+00

Pos logit contributions
 mountain+0.211
 valley+0.211
 lake+0.187
 woods+0.185
 field+0.185
Neg logit contributions
 supervision-0.389
 wounds-0.365
 permission-0.347
 apologize-0.336
olds-0.325
Token a
Token ID257
Feature activation+8.41e+00

Pos logit contributions
 mountain+0.196
 valley+0.177
 hill+0.175
 lake+0.165
 tiger+0.159
Neg logit contributions
 wounds-0.224
 permission-0.221
 supervision-0.220
and-0.201
 apologize-0.200
Token little
Token ID1310
Feature activation+6.01e+00

Pos logit contributions
case+0.172
 woods+0.170
 sky+0.165
 On+0.164
 valley+0.163
Neg logit contributions
 wounds-0.305
 supervision-0.289
ppa-0.268
 apologize-0.265
 permission-0.262
Token time
Token ID640
Feature activation+5.63e+00

Pos logit contributions
 valley+0.118
 woods+0.111
 mountain+0.110
 forest+0.107
 field+0.106
Neg logit contributions
 supervision-0.418
 wounds-0.397
 permission-0.377
olds-0.346
ppa-0.345
Token,
Token ID11
Feature activation+4.50e+00

Pos logit contributions
 valley+0.084
 mountain+0.078
 woods+0.078
 forest+0.075
 lake+0.074
Neg logit contributions
 supervision-0.293
 wounds-0.279
 permission-0.263
 apologize-0.244
olds-0.243
Token in
Token ID287
Feature activation+6.87e+00

Pos logit contributions
 mountain+0.111
 valley+0.109
 lake+0.105
 woods+0.104
 forest+0.100
Neg logit contributions
 supervision-0.168
 wounds-0.161
 permission-0.149
 apologize-0.147
and-0.138
Token a
Token ID257
Feature activation+8.73e+00

Pos logit contributions
 valley+0.143
 mountain+0.142
 woods+0.137
 forest+0.131
 lake+0.125
Neg logit contributions
 supervision-0.295
 wounds-0.289
 permission-0.268
 apologize-0.249
 apology-0.240
Token big
Token ID1263
Feature activation+7.21e+00

Pos logit contributions
 mountain+0.157
 valley+0.152
 woods+0.144
 sky+0.143
 lake+0.141
Neg logit contributions
 supervision-0.373
 wounds-0.370
 permission-0.328
 apologize-0.324
ppa-0.310
Token garden
Token ID11376
Feature activation+1.17e+01

Pos logit contributions
case+0.106
 A+0.101
 Under+0.099
Under+0.097
�+0.094
Neg logit contributions
 supervision-0.294
 wounds-0.280
 permission-0.266
 apologize-0.252
 promises-0.233
Token,
Token ID11
Feature activation+5.05e+00

Pos logit contributions
 valley+0.283
 mountain+0.277
 lake+0.265
 tiger+0.265
 woods+0.265
Neg logit contributions
 supervision-0.415
 wounds-0.387
 permission-0.369
olds-0.364
 apologize-0.359
Token there
Token ID612
Feature activation+8.71e+00

Pos logit contributions
 mountain+0.114
 lake+0.109
 valley+0.107
 woods+0.100
 forest+0.099
Neg logit contributions
 wounds-0.188
 supervision-0.188
 permission-0.172
 apologize-0.171
olds-0.160
Token was
Token ID373
Feature activation+6.67e+00

Pos logit contributions
 valley+0.183
 mountain+0.178
 lake+0.161
 woods+0.159
 field+0.159
Neg logit contributions
 supervision-0.361
 wounds-0.334
 permission-0.324
 apologize-0.312
olds-0.300
Token a
Token ID257
Feature activation+7.93e+00

Pos logit contributions
 mountain+0.184
 valley+0.171
 hill+0.163
 lake+0.161
 tiger+0.148
Neg logit contributions
 wounds-0.225
 permission-0.221
 supervision-0.220
 apologize-0.203
and-0.191
Token caul
Token ID46087
Feature activation+2.55e+00

Pos logit contributions
 woods+0.167
case+0.167
 sky+0.162
 valley+0.162
 On+0.160
Neg logit contributions
 wounds-0.281
 supervision-0.259
 apologize-0.244
ppa-0.240
 permission-0.235
Token time
Token ID640
Feature activation+5.55e+00

Pos logit contributions
 valley+0.121
 woods+0.114
 mountain+0.114
 forest+0.110
 lake+0.108
Neg logit contributions
 supervision-0.407
 wounds-0.386
 permission-0.366
olds-0.336
ppa-0.335
Token,
Token ID11
Feature activation+4.36e+00

Pos logit contributions
 valley+0.091
 woods+0.086
 mountain+0.086
 forest+0.082
 lake+0.082
Neg logit contributions
 supervision-0.279
 wounds-0.264
 permission-0.250
 apologize-0.232
olds-0.231
Token in
Token ID287
Feature activation+6.78e+00

Pos logit contributions
 mountain+0.110
 valley+0.109
 woods+0.103
 lake+0.102
 forest+0.099
Neg logit contributions
 supervision-0.157
 wounds-0.153
 apologize-0.142
 permission-0.141
and-0.131
Token a
Token ID257
Feature activation+8.65e+00

Pos logit contributions
 valley+0.153
 mountain+0.145
 woods+0.145
 forest+0.137
 sky+0.131
Neg logit contributions
 supervision-0.273
 wounds-0.271
 permission-0.247
 apologize-0.232
 apology-0.220
Token big
Token ID1263
Feature activation+7.18e+00

Pos logit contributions
 mountain+0.156
 valley+0.151
 woods+0.141
 sky+0.141
 lake+0.138
Neg logit contributions
 supervision-0.363
 wounds-0.362
 apologize-0.317
 permission-0.317
ppa-0.302
Token garden
Token ID11376
Feature activation+1.16e+01

Pos logit contributions
case+0.105
 Under+0.100
 A+0.099
Under+0.098
 On+0.092
Neg logit contributions
 supervision-0.289
 wounds-0.277
 permission-0.257
 apologize-0.249
 promises-0.232
Token,
Token ID11
Feature activation+4.64e+00

Pos logit contributions
 valley+0.300
 mountain+0.292
 tiger+0.280
 lake+0.277
 woods+0.274
Neg logit contributions
 supervision-0.380
 wounds-0.351
olds-0.341
 permission-0.335
 apologize-0.335
Token there
Token ID612
Feature activation+8.62e+00

Pos logit contributions
 mountain+0.113
 lake+0.106
 valley+0.105
 woods+0.100
 forest+0.099
Neg logit contributions
 wounds-0.164
 supervision-0.162
 apologize-0.151
 permission-0.147
olds-0.138
Token was
Token ID373
Feature activation+6.52e+00

Pos logit contributions
 valley+0.197
 mountain+0.194
 field+0.173
 lake+0.172
 woods+0.172
Neg logit contributions
 supervision-0.327
 wounds-0.302
 permission-0.293
 apologize-0.286
olds-0.272
Token a
Token ID257
Feature activation+7.86e+00

Pos logit contributions
 mountain+0.193
 valley+0.179
 hill+0.174
 lake+0.165
 tiger+0.158
Neg logit contributions
 wounds-0.194
 permission-0.193
 supervision-0.186
 apologize-0.174
and-0.169
Token wealthy
Token ID11574
Feature activation+5.34e+00

Pos logit contributions
 woods+0.164
case+0.163
 valley+0.159
 On+0.159
 sky+0.156
Neg logit contributions
 wounds-0.280
 supervision-0.260
 apologize-0.243
ppa-0.239
 permission-0.235
Token time
Token ID640
Feature activation+5.93e+00

Pos logit contributions
 valley+0.134
 woods+0.126
 mountain+0.126
 forest+0.121
 lake+0.119
Neg logit contributions
 supervision-0.421
 wounds-0.399
 permission-0.377
 apologize-0.347
olds-0.346
Token,
Token ID11
Feature activation+4.77e+00

Pos logit contributions
 valley+0.106
 woods+0.100
 mountain+0.100
 lake+0.096
 field+0.095
Neg logit contributions
 supervision-0.288
 wounds-0.270
 permission-0.257
olds-0.239
 apologize-0.239
Token in
Token ID287
Feature activation+7.09e+00

Pos logit contributions
 mountain+0.127
 valley+0.123
 lake+0.118
 woods+0.117
 field+0.114
Neg logit contributions
 supervision-0.157
 wounds-0.155
 apologize-0.145
 permission-0.140
 mad-0.133
Token a
Token ID257
Feature activation+8.92e+00

Pos logit contributions
 valley+0.168
 mountain+0.161
 woods+0.156
 forest+0.150
 hill+0.145
Neg logit contributions
 supervision-0.271
 wounds-0.270
 permission-0.247
 apologize-0.232
 apology-0.217
Token small
Token ID1402
Feature activation+8.22e+00

Pos logit contributions
 mountain+0.166
 valley+0.158
 sky+0.151
 lake+0.147
 woods+0.145
Neg logit contributions
 supervision-0.360
 wounds-0.359
 apologize-0.318
 permission-0.312
ppa-0.292
Token garden
Token ID11376
Feature activation+1.16e+01

Pos logit contributions
case+0.185
�+0.179
 Under+0.171
storm+0.164
 A+0.159
Neg logit contributions
 supervision-0.248
 wounds-0.248
 permission-0.231
 apologize-0.209
 mad-0.202
Token,
Token ID11
Feature activation+4.69e+00

Pos logit contributions
 valley+0.281
 mountain+0.277
 lake+0.264
 woods+0.259
 tiger+0.259
Neg logit contributions
 supervision-0.406
 wounds-0.378
 permission-0.360
olds-0.357
 apologize-0.352
Token there
Token ID612
Feature activation+8.87e+00

Pos logit contributions
 mountain+0.112
 lake+0.105
 valley+0.104
 woods+0.097
 field+0.097
Neg logit contributions
 wounds-0.170
 supervision-0.170
 apologize-0.157
 permission-0.153
olds-0.146
Token lived
Token ID5615
Feature activation+7.48e+00

Pos logit contributions
 valley+0.199
 mountain+0.198
 field+0.177
 lake+0.176
 woods+0.174
Neg logit contributions
 supervision-0.338
 wounds-0.313
 permission-0.303
 apologize-0.296
olds-0.281
Token a
Token ID257
Feature activation+7.41e+00

Pos logit contributions
 mountain+0.148
 valley+0.141
 lake+0.129
 woods+0.128
 hill+0.124
Neg logit contributions
 supervision-0.336
 wounds-0.331
 permission-0.320
 apologize-0.293
olds-0.288
Token little
Token ID1310
Feature activation+5.56e+00

Pos logit contributions
case+0.172
 sky+0.164
 woods+0.164
 valley+0.161
 mountain+0.161
Neg logit contributions
 wounds-0.251
 supervision-0.228
ppa-0.217
 apologize-0.216
anut-0.212
Token time
Token ID640
Feature activation+5.69e+00

Pos logit contributions
 valley+0.116
 woods+0.109
 mountain+0.108
 forest+0.106
 field+0.104
Neg logit contributions
 supervision-0.423
 wounds-0.401
 permission-0.381
olds-0.351
 apologize-0.348
Token,
Token ID11
Feature activation+4.57e+00

Pos logit contributions
 valley+0.104
 woods+0.101
 mountain+0.100
 lake+0.097
 field+0.096
Neg logit contributions
 supervision-0.271
 wounds-0.252
 permission-0.242
olds-0.224
 apologize-0.224
Token in
Token ID287
Feature activation+7.06e+00

Pos logit contributions
 mountain+0.109
 valley+0.106
 lake+0.104
 woods+0.104
 forest+0.099
Neg logit contributions
 supervision-0.169
 wounds-0.165
ppa-0.153
 permission-0.151
anut-0.146
Token a
Token ID257
Feature activation+8.91e+00

Pos logit contributions
 valley+0.146
 mountain+0.146
 woods+0.138
 forest+0.132
 lake+0.131
Neg logit contributions
 supervision-0.303
 wounds-0.291
 permission-0.277
 apologize-0.250
ppa-0.243
Token big
Token ID1263
Feature activation+6.98e+00

Pos logit contributions
 mountain+0.157
 valley+0.151
 woods+0.144
 sky+0.139
 lake+0.138
Neg logit contributions
 wounds-0.380
 supervision-0.378
 permission-0.334
ppa-0.328
 apologize-0.324
Token garden
Token ID11376
Feature activation+1.16e+01

Pos logit contributions
 Under+0.097
case+0.096
 A+0.095
 woods+0.093
 valley+0.091
Neg logit contributions
 supervision-0.308
 wounds-0.294
 permission-0.275
 apologize-0.258
ppa-0.248
Token,
Token ID11
Feature activation+4.94e+00

Pos logit contributions
 valley+0.301
 mountain+0.294
 tiger+0.287
 lake+0.279
 woods+0.276
Neg logit contributions
 supervision-0.380
 wounds-0.349
 permission-0.332
olds-0.330
 apologize-0.319
Token there
Token ID612
Feature activation+9.44e+00

Pos logit contributions
 mountain+0.118
 lake+0.111
 valley+0.109
 woods+0.108
 forest+0.106
Neg logit contributions
 wounds-0.171
 supervision-0.167
 permission-0.155
 apologize-0.154
ppa-0.145
Token was
Token ID373
Feature activation+6.83e+00

Pos logit contributions
 valley+0.203
 mountain+0.203
 lake+0.180
 field+0.179
 woods+0.176
Neg logit contributions
 supervision-0.373
 wounds-0.351
 permission-0.336
 apologize-0.324
olds-0.313
Token a
Token ID257
Feature activation+8.15e+00

Pos logit contributions
 mountain+0.181
 valley+0.167
 hill+0.162
 lake+0.156
 tiger+0.145
Neg logit contributions
 wounds-0.223
 supervision-0.219
 permission-0.218
and-0.199
 apologize-0.197
Token caul
Token ID46087
Feature activation+2.71e+00

Pos logit contributions
 woods+0.161
case+0.160
 On+0.156
 valley+0.155
 sky+0.153
Neg logit contributions
 wounds-0.300
 supervision-0.285
ppa-0.263
 apologize-0.261
 permission-0.257
Token big
Token ID1263
Feature activation+6.24e+00

Pos logit contributions
 woods+0.164
 valley+0.161
 mountain+0.152
 sky+0.151
 forest+0.146
Neg logit contributions
 wounds-0.338
 supervision-0.333
ppa-0.309
 permission-0.299
anut-0.294
Token chamber
Token ID11847
Feature activation+1.18e+01

Pos logit contributions
 woods+0.118
case+0.116
 Under+0.113
 valley+0.112
 clearing+0.111
Neg logit contributions
 supervision-0.247
 wounds-0.237
 permission-0.223
 apologize-0.209
 apology-0.207
Token.
Token ID13
Feature activation+5.64e+00

Pos logit contributions
 valley+0.367
 lake+0.341
 woods+0.339
 landscape+0.318
 creek+0.318
Neg logit contributions
 supervision-0.284
 wounds-0.268
 apologize-0.258
olds-0.254
 permission-0.249
Token Inside
Token ID14384
Feature activation+9.73e+00

Pos logit contributions
 valley+0.144
 mountain+0.135
 lake+0.129
 field+0.124
 woods+0.124
Neg logit contributions
 supervision-0.195
 permission-0.184
and-0.170
olds-0.167
 apology-0.163
Token the
Token ID262
Feature activation+4.80e+00

Pos logit contributions
 clearing+0.270
 valley+0.269
ifying+0.265
 tiger+0.260
Under+0.259
Neg logit contributions
 supervision-0.239
 wounds-0.198
 permission-0.197
ma-0.186
pa-0.180
Token chamber
Token ID11847
Feature activation+1.15e+01

Pos logit contributions
�+0.090
 lake+0.082
storm+0.078
 construction+0.076
 mountain+0.076
Neg logit contributions
 wounds-0.163
 attention-0.154
 permission-0.149
 apologize-0.141
 supervision-0.140
Token there
Token ID612
Feature activation+9.00e+00

Pos logit contributions
 valley+0.278
 lake+0.276
 tiger+0.259
 creek+0.255
 clearing+0.251
Neg logit contributions
 supervision-0.369
olds-0.331
 wounds-0.331
 permission-0.320
 apologize-0.315
Token were
Token ID547
Feature activation+6.95e+00

Pos logit contributions
 valley+0.211
 mountain+0.207
 lake+0.190
 tiger+0.187
 field+0.187
Neg logit contributions
 supervision-0.331
 permission-0.306
 wounds-0.299
 apologize-0.279
olds-0.271
Token many
Token ID867
Feature activation+6.10e+00

Pos logit contributions
 mountain+0.184
 valley+0.176
 lake+0.175
 field+0.170
 hill+0.170
Neg logit contributions
 wounds-0.225
 supervision-0.219
 permission-0.217
and-0.205
olds-0.197
Token things
Token ID1243
Feature activation+4.55e+00

Pos logit contributions
 mountain+0.170
 lake+0.169
 valley+0.168
 field+0.163
�+0.160
Neg logit contributions
 wounds-0.205
 supervision-0.185
 permission-0.171
olds-0.160
 attention-0.154
Token that
Token ID326
Feature activation+3.45e+00

Pos logit contributions
 valley+0.123
 mountain+0.118
 woods+0.113
 lake+0.112
 forest+0.111
Neg logit contributions
 supervision-0.180
 wounds-0.163
 permission-0.158
olds-0.144
 apologize-0.143
Token small
Token ID1402
Feature activation+7.06e+00

Pos logit contributions
 woods+0.165
 valley+0.163
 mountain+0.150
 sky+0.150
 forest+0.148
Neg logit contributions
 wounds-0.322
 supervision-0.317
ppa-0.285
 permission-0.285
 apologize-0.282
Token chamber
Token ID11847
Feature activation+1.11e+01

Pos logit contributions
case+0.162
 Under+0.157
 On+0.155
There+0.155
 Inside+0.155
Neg logit contributions
 supervision-0.228
 wounds-0.224
 permission-0.211
 apology-0.195
 apologize-0.190
Token.
Token ID13
Feature activation+5.29e+00

Pos logit contributions
 valley+0.358
 lake+0.327
 woods+0.319
 mountain+0.310
 landscape+0.306
Neg logit contributions
 supervision-0.263
 wounds-0.249
olds-0.241
 apologize-0.239
 permission-0.229
Token Inside
Token ID14384
Feature activation+9.41e+00

Pos logit contributions
 valley+0.148
 mountain+0.132
 lake+0.130
 field+0.123
 woods+0.121
Neg logit contributions
 supervision-0.174
 permission-0.161
olds-0.152
 apologize-0.150
 wounds-0.150
Token this
Token ID428
Feature activation+6.77e+00

Pos logit contributions
 clearing+0.259
 valley+0.259
ifying+0.253
Under+0.252
 Under+0.248
Neg logit contributions
 supervision-0.254
 permission-0.206
 wounds-0.203
pa-0.196
 apologize-0.195
Token chamber
Token ID11847
Feature activation+1.15e+01

Pos logit contributions
ifying+0.148
 hawk+0.124
Wow+0.121
plets+0.121
 construction+0.118
Neg logit contributions
 promise-0.192
 attention-0.192
 apologize-0.190
 promises-0.190
 wounds-0.180
Token,
Token ID11
Feature activation+4.33e+00

Pos logit contributions
 valley+0.294
 lake+0.293
 tiger+0.278
 creek+0.275
 clearing+0.273
Neg logit contributions
 supervision-0.332
olds-0.299
 permission-0.291
 wounds-0.287
 apologize-0.285
Token lived
Token ID5615
Feature activation+6.25e+00

Pos logit contributions
 valley+0.110
 lake+0.107
 mountain+0.104
 field+0.103
 woods+0.102
Neg logit contributions
 supervision-0.150
 wounds-0.148
 permission-0.144
 apologize-0.133
 forgiveness-0.121
Token a
Token ID257
Feature activation+6.68e+00

Pos logit contributions
 mountain+0.129
 valley+0.127
 lake+0.121
 woods+0.112
 field+0.109
Neg logit contributions
 wounds-0.265
 supervision-0.260
 permission-0.257
 apologize-0.230
and-0.229
Token modest
Token ID12949
Feature activation+5.20e+00

Pos logit contributions
und+0.157
neath+0.156
 woods+0.153
 valley+0.153
 On+0.153
Neg logit contributions
 wounds-0.200
 supervision-0.179
 apologize-0.178
 forg-0.176
ppa-0.174
Token lady
Token ID10846
Feature activation-1.38e+00

Pos logit contributions
�+0.108
There+0.106
ifying+0.104
neath+0.103
und+0.096
Neg logit contributions
 wounds-0.164
 apology-0.161
 permission-0.159
 mistake-0.147
 supervision-0.147
Token time
Token ID640
Feature activation+5.69e+00

Pos logit contributions
 valley+0.127
 woods+0.120
 mountain+0.119
 forest+0.115
 lake+0.113
Neg logit contributions
 supervision-0.417
 wounds-0.395
 permission-0.374
olds-0.343
ppa-0.343
Token,
Token ID11
Feature activation+4.53e+00

Pos logit contributions
 valley+0.089
 woods+0.084
 mountain+0.084
 forest+0.080
 lake+0.079
Neg logit contributions
 supervision-0.292
 wounds-0.276
 permission-0.261
 apologize-0.242
olds-0.242
Token in
Token ID287
Feature activation+6.98e+00

Pos logit contributions
 mountain+0.114
 valley+0.110
 lake+0.108
 woods+0.103
 field+0.102
Neg logit contributions
 supervision-0.161
 wounds-0.158
 apologize-0.148
 permission-0.143
olds-0.135
Token a
Token ID257
Feature activation+8.79e+00

Pos logit contributions
 valley+0.159
 mountain+0.154
 woods+0.146
 forest+0.141
 hill+0.137
Neg logit contributions
 supervision-0.279
 wounds-0.276
 permission-0.254
 apologize-0.239
 apology-0.223
Token green
Token ID4077
Feature activation+8.12e+00

Pos logit contributions
 mountain+0.157
 valley+0.150
 sky+0.143
 lake+0.141
 woods+0.138
Neg logit contributions
 supervision-0.365
 wounds-0.364
 apologize-0.320
 permission-0.319
ppa-0.299
Token garden
Token ID11376
Feature activation+1.14e+01

Pos logit contributions
 must+0.137
 Under+0.130
 Pretty+0.127
case+0.126
�+0.124
Neg logit contributions
 wounds-0.262
 permission-0.260
 supervision-0.259
 apologize-0.243
ppa-0.220
Token,
Token ID11
Feature activation+4.40e+00

Pos logit contributions
 valley+0.287
 mountain+0.277
 lake+0.266
 tiger+0.263
 woods+0.262
Neg logit contributions
 supervision-0.387
 wounds-0.360
olds-0.343
 permission-0.342
 apologize-0.339
Token there
Token ID612
Feature activation+8.60e+00

Pos logit contributions
 mountain+0.104
 lake+0.098
 valley+0.097
 field+0.090
 woods+0.090
Neg logit contributions
 supervision-0.162
 wounds-0.162
 apologize-0.151
 permission-0.146
olds-0.139
Token lived
Token ID5615
Feature activation+7.23e+00

Pos logit contributions
 valley+0.186
 mountain+0.182
 field+0.164
 lake+0.164
 woods+0.163
Neg logit contributions
 supervision-0.343
 wounds-0.317
 permission-0.307
 apologize-0.300
olds-0.285
Token a
Token ID257
Feature activation+7.19e+00

Pos logit contributions
 mountain+0.140
 valley+0.135
 lake+0.123
 woods+0.122
 hill+0.117
Neg logit contributions
 supervision-0.328
 wounds-0.322
 permission-0.311
 apologize-0.285
olds-0.281
Token restless
Token ID42537
Feature activation+7.64e+00

Pos logit contributions
case+0.165
 woods+0.163
 sky+0.161
 valley+0.160
 mountain+0.159
Neg logit contributions
 wounds-0.242
 supervision-0.218
ppa-0.210
 apologize-0.208
anut-0.203
Token outside
Token ID2354
Feature activation+4.03e+00

Pos logit contributions
 mountain+0.096
 woods+0.094
 valley+0.093
 sky+0.090
 lake+0.087
Neg logit contributions
 wounds-0.146
 supervision-0.136
 permission-0.126
 apologize-0.122
ppa-0.116
Token in
Token ID287
Feature activation+3.18e+00

Pos logit contributions
 mountain+0.101
 lake+0.101
 valley+0.100
 woods+0.100
 clearing+0.099
Neg logit contributions
 wounds-0.134
 supervision-0.128
 permission-0.124
and-0.114
 apologize-0.111
Token her
Token ID607
Feature activation+1.23e+00

Pos logit contributions
 valley+0.084
 mountain+0.082
 woods+0.081
 lake+0.079
 forest+0.078
Neg logit contributions
 supervision-0.123
 wounds-0.117
 permission-0.108
 apologize-0.098
olds-0.094
Token backyard
Token ID24296
Feature activation+8.20e+00

Pos logit contributions
 mountain+0.019
 valley+0.019
 lake+0.018
 clearing+0.016
 woods+0.016
Neg logit contributions
 supervision-0.057
 wounds-0.056
 permission-0.051
 apologize-0.048
ppa-0.048
Token.
Token ID13
Feature activation+3.43e+00

Pos logit contributions
 valley+0.198
 mountain+0.191
 woods+0.183
 lake+0.183
 tiger+0.175
Neg logit contributions
 supervision-0.325
 wounds-0.305
 permission-0.294
 apologize-0.266
olds-0.264
Token Yesterday
Token ID34668
Feature activation+1.14e+01

Pos logit contributions
 mountain+0.059
 valley+0.059
 woods+0.054
 lake+0.053
 forest+0.052
Neg logit contributions
 supervision-0.163
 wounds-0.152
 permission-0.146
 apologize-0.138
olds-0.135
Token,
Token ID11
Feature activation+5.08e+00

Pos logit contributions
 valley+0.268
 mountain+0.257
 woods+0.246
 lake+0.244
 clearing+0.238
Neg logit contributions
 supervision-0.417
 wounds-0.373
 apologize-0.364
 permission-0.363
olds-0.340
Token she
Token ID673
Feature activation+6.01e+00

Pos logit contributions
 mountain+0.156
 lake+0.152
 woods+0.150
 valley+0.150
 forest+0.149
Neg logit contributions
 supervision-0.156
 wounds-0.149
 permission-0.139
 apologize-0.128
 attention-0.121
Token saw
Token ID2497
Feature activation+5.84e+00

Pos logit contributions
 valley+0.147
 mountain+0.144
 woods+0.138
 lake+0.137
 forest+0.133
Neg logit contributions
 supervision-0.239
 wounds-0.229
 permission-0.212
 apologize-0.199
ppa-0.199
Token a
Token ID257
Feature activation+7.21e+00

Pos logit contributions
 valley+0.109
 mountain+0.108
 woods+0.100
 lake+0.099
 forest+0.097
Neg logit contributions
 supervision-0.278
 wounds-0.267
 permission-0.252
 apologize-0.231
olds-0.228
Token big
Token ID1263
Feature activation+5.60e+00

Pos logit contributions
 On+0.144
 Inside+0.135
 woods+0.133
 Under+0.133
 Pretty+0.132
Neg logit contributions
 wounds-0.261
 supervision-0.245
 apologize-0.221
ppa-0.220
 mistake-0.216
Token time
Token ID640
Feature activation+5.54e+00

Pos logit contributions
 valley+0.122
 woods+0.114
 mountain+0.114
 forest+0.110
 lake+0.108
Neg logit contributions
 supervision-0.412
 wounds-0.391
 permission-0.370
olds-0.340
ppa-0.340
Token,
Token ID11
Feature activation+4.35e+00

Pos logit contributions
 valley+0.086
 woods+0.081
 mountain+0.081
 forest+0.077
 lake+0.077
Neg logit contributions
 supervision-0.285
 wounds-0.270
 permission-0.255
 apologize-0.236
olds-0.236
Token in
Token ID287
Feature activation+6.74e+00

Pos logit contributions
 mountain+0.106
 valley+0.104
 lake+0.101
 woods+0.100
 forest+0.096
Neg logit contributions
 supervision-0.163
 wounds-0.156
 apologize-0.145
 permission-0.144
and-0.137
Token a
Token ID257
Feature activation+8.69e+00

Pos logit contributions
 valley+0.147
 mountain+0.146
 woods+0.142
 forest+0.135
 lake+0.130
Neg logit contributions
 supervision-0.276
 wounds-0.269
 permission-0.249
 apologize-0.232
 apology-0.223
Token small
Token ID1402
Feature activation+7.95e+00

Pos logit contributions
 mountain+0.159
 valley+0.155
 woods+0.147
 sky+0.144
 lake+0.142
Neg logit contributions
 supervision-0.367
 wounds-0.363
 permission-0.323
 apologize-0.318
ppa-0.304
Token garden
Token ID11376
Feature activation+1.14e+01

Pos logit contributions
case+0.170
 Under+0.165
�+0.160
Under+0.152
storm+0.149
Neg logit contributions
 supervision-0.253
 wounds-0.249
 permission-0.237
 apologize-0.213
 promises-0.203
Token,
Token ID11
Feature activation+4.41e+00

Pos logit contributions
 valley+0.275
 mountain+0.269
 woods+0.255
 tiger+0.254
 lake+0.253
Neg logit contributions
 supervision-0.410
 wounds-0.374
 permission-0.360
 apologize-0.352
olds-0.350
Token there
Token ID612
Feature activation+8.52e+00

Pos logit contributions
 mountain+0.103
 lake+0.098
 valley+0.096
 woods+0.092
 forest+0.090
Neg logit contributions
 supervision-0.162
 wounds-0.162
 apologize-0.149
 permission-0.146
and-0.135
Token was
Token ID373
Feature activation+6.35e+00

Pos logit contributions
 valley+0.177
 mountain+0.173
 woods+0.157
 lake+0.156
 field+0.155
Neg logit contributions
 supervision-0.356
 wounds-0.330
 permission-0.318
 apologize-0.306
olds-0.294
Token a
Token ID257
Feature activation+7.84e+00

Pos logit contributions
 mountain+0.173
 valley+0.160
 hill+0.154
 lake+0.149
 forest+0.139
Neg logit contributions
 wounds-0.217
 supervision-0.215
 permission-0.213
 apologize-0.195
and-0.180
Token fragile
Token ID21049
Feature activation+6.29e+00

Pos logit contributions
case+0.165
 woods+0.165
 valley+0.161
 sky+0.158
 mountain+0.157
Neg logit contributions
 wounds-0.282
 supervision-0.267
 apologize-0.247
 permission-0.242
ppa-0.241
Token a
Token ID257
Feature activation+8.33e+00

Pos logit contributions
 mountain+0.157
 valley+0.149
 hill+0.137
 lake+0.134
 woods+0.133
Neg logit contributions
 supervision-0.274
 wounds-0.265
 permission-0.257
 apologize-0.238
and-0.224
Token tiny
Token ID7009
Feature activation+7.02e+00

Pos logit contributions
 woods+0.171
 valley+0.170
 sky+0.158
case+0.157
 mountain+0.157
Neg logit contributions
 wounds-0.313
 supervision-0.305
ppa-0.277
 apologize-0.276
 permission-0.274
Token bird
Token ID6512
Feature activation+2.98e+00

Pos logit contributions
case+0.154
 woods+0.147
 Pretty+0.145
 On+0.145
There+0.142
Neg logit contributions
 wounds-0.234
 supervision-0.225
 permission-0.213
 apology-0.201
ppa-0.198
Token in
Token ID287
Feature activation+5.47e+00

Pos logit contributions
 valley+0.079
 mountain+0.077
 woods+0.075
 lake+0.073
 field+0.070
Neg logit contributions
 supervision-0.111
 wounds-0.107
 permission-0.098
ppa-0.094
 apologize-0.091
Token a
Token ID257
Feature activation+7.77e+00

Pos logit contributions
 valley+0.141
 mountain+0.133
 woods+0.131
case+0.130
 field+0.128
Neg logit contributions
 supervision-0.204
 wounds-0.192
 permission-0.183
 apologize-0.168
 apology-0.151
Token backyard
Token ID24296
Feature activation+1.13e+01

Pos logit contributions
 mountain+0.126
 Under+0.120
 valley+0.117
 On+0.116
 Inside+0.115
Neg logit contributions
 wounds-0.315
 supervision-0.308
 apologize-0.273
ppa-0.267
 permission-0.264
Token.
Token ID13
Feature activation+4.93e+00

Pos logit contributions
 valley+0.317
 woods+0.300
 mountain+0.300
 lake+0.290
 Under+0.280
Neg logit contributions
 supervision-0.367
 wounds-0.338
 permission-0.329
 apologize-0.304
olds-0.299
Token The
Token ID383
Feature activation+3.89e+00

Pos logit contributions
 valley+0.159
 mountain+0.144
 lake+0.141
 woods+0.140
 cabin+0.136
Neg logit contributions
 supervision-0.141
 apologize-0.124
 wounds-0.123
 permission-0.122
olds-0.116
Token bird
Token ID6512
Feature activation+1.61e+00

Pos logit contributions
 valley+0.067
 woods+0.065
 mountain+0.063
 clearing+0.061
 field+0.060
Neg logit contributions
 wounds-0.169
 supervision-0.168
 permission-0.158
 apology-0.147
 apologize-0.142
Token's
Token ID338
Feature activation+2.47e+00

Pos logit contributions
 valley+0.033
 mountain+0.032
 lake+0.031
 woods+0.031
 field+0.029
Neg logit contributions
 supervision-0.067
 wounds-0.065
 permission-0.059
ppa-0.058
 apologize-0.057
Token feathers
Token ID27400
Feature activation+1.53e+00

Pos logit contributions
 valley+0.069
 mountain+0.068
 woods+0.065
 lake+0.063
 forest+0.062
Neg logit contributions
 supervision-0.093
 wounds-0.089
 permission-0.084
 apologize-0.074
 apology-0.073
Token time
Token ID640
Feature activation+5.61e+00

Pos logit contributions
 valley+0.112
 woods+0.105
 mountain+0.103
 forest+0.102
 field+0.101
Neg logit contributions
 supervision-0.422
 wounds-0.401
 permission-0.381
olds-0.351
ppa-0.348
Token,
Token ID11
Feature activation+4.60e+00

Pos logit contributions
 valley+0.092
 woods+0.087
 mountain+0.087
 lake+0.083
 forest+0.082
Neg logit contributions
 supervision-0.282
 wounds-0.266
 permission-0.252
 apologize-0.234
olds-0.233
Token in
Token ID287
Feature activation+6.81e+00

Pos logit contributions
 mountain+0.108
 valley+0.105
 lake+0.103
 woods+0.101
 forest+0.098
Neg logit contributions
 supervision-0.176
 wounds-0.172
 permission-0.158
 apologize-0.152
and-0.149
Token a
Token ID257
Feature activation+8.78e+00

Pos logit contributions
 valley+0.147
 mountain+0.147
 woods+0.140
 forest+0.135
 lake+0.130
Neg logit contributions
 supervision-0.283
 wounds-0.275
 permission-0.257
 apologize-0.235
 apology-0.226
Token small
Token ID1402
Feature activation+7.82e+00

Pos logit contributions
 mountain+0.161
 valley+0.155
 woods+0.146
 lake+0.143
 sky+0.142
Neg logit contributions
 supervision-0.375
 wounds-0.374
 permission-0.329
 apologize-0.321
ppa-0.315
Token garden
Token ID11376
Feature activation+1.13e+01

Pos logit contributions
 Under+0.155
case+0.155
 A+0.142
�+0.141
 Inside+0.140
Neg logit contributions
 supervision-0.274
 wounds-0.272
 permission-0.251
ppa-0.228
 apologize-0.225
Token,
Token ID11
Feature activation+4.58e+00

Pos logit contributions
 valley+0.278
 mountain+0.274
 tiger+0.258
 lake+0.257
 woods+0.256
Neg logit contributions
 supervision-0.397
 wounds-0.365
 permission-0.346
olds-0.342
 apologize-0.340
Token there
Token ID612
Feature activation+8.98e+00

Pos logit contributions
 mountain+0.110
 lake+0.104
 valley+0.102
 woods+0.098
 forest+0.097
Neg logit contributions
 wounds-0.165
 supervision-0.162
 apologize-0.149
 permission-0.146
and-0.137
Token lived
Token ID5615
Feature activation+7.23e+00

Pos logit contributions
 valley+0.187
 mountain+0.185
 lake+0.166
 woods+0.165
 field+0.165
Neg logit contributions
 supervision-0.370
 wounds-0.344
 permission-0.329
 apologize-0.320
olds-0.307
Token a
Token ID257
Feature activation+7.30e+00

Pos logit contributions
 mountain+0.135
 valley+0.130
 lake+0.118
 woods+0.118
 forest+0.113
Neg logit contributions
 supervision-0.335
 wounds-0.329
 permission-0.317
 apologize-0.290
olds-0.286
Token gr
Token ID1036
Feature activation+1.77e+00

Pos logit contributions
 woods+0.153
case+0.153
 valley+0.149
 sky+0.149
 mountain+0.148
Neg logit contributions
 wounds-0.261
 supervision-0.244
ppa-0.232
 apologize-0.226
anut-0.222
Token time
Token ID640
Feature activation+5.48e+00

Pos logit contributions
 valley+0.113
 woods+0.106
 mountain+0.104
 forest+0.102
 field+0.101
Neg logit contributions
 supervision-0.404
 wounds-0.383
 permission-0.363
olds-0.334
ppa-0.332
Token,
Token ID11
Feature activation+4.39e+00

Pos logit contributions
 valley+0.093
 woods+0.089
 mountain+0.088
 lake+0.085
 field+0.084
Neg logit contributions
 supervision-0.271
 wounds-0.254
 permission-0.242
 apologize-0.225
olds-0.224
Token in
Token ID287
Feature activation+6.61e+00

Pos logit contributions
 mountain+0.103
 valley+0.100
 lake+0.098
 woods+0.095
 forest+0.092
Neg logit contributions
 supervision-0.168
 wounds-0.166
 permission-0.151
 apologize-0.146
and-0.142
Token a
Token ID257
Feature activation+8.51e+00

Pos logit contributions
 valley+0.144
 mountain+0.142
 woods+0.135
 forest+0.130
 lake+0.129
Neg logit contributions
 supervision-0.273
 wounds-0.266
 permission-0.248
 apologize-0.227
 apology-0.218
Token small
Token ID1402
Feature activation+7.56e+00

Pos logit contributions
 mountain+0.159
 valley+0.152
 woods+0.141
 lake+0.140
 sky+0.140
Neg logit contributions
 supervision-0.359
 wounds-0.359
 permission-0.316
 apologize-0.310
ppa-0.300
Token garden
Token ID11376
Feature activation+1.13e+01

Pos logit contributions
case+0.156
 Under+0.154
�+0.146
 Inside+0.140
 On+0.140
Neg logit contributions
 wounds-0.255
 supervision-0.254
 permission-0.234
 apologize-0.208
ppa-0.206
Token,
Token ID11
Feature activation+4.43e+00

Pos logit contributions
 valley+0.275
 mountain+0.271
 lake+0.256
 tiger+0.256
 woods+0.255
Neg logit contributions
 supervision-0.393
 wounds-0.364
 permission-0.346
olds-0.340
 apologize-0.338
Token there
Token ID612
Feature activation+8.78e+00

Pos logit contributions
 mountain+0.104
 lake+0.099
 valley+0.097
 woods+0.092
 field+0.092
Neg logit contributions
 wounds-0.162
 supervision-0.159
 apologize-0.148
 permission-0.145
and-0.135
Token lived
Token ID5615
Feature activation+7.18e+00

Pos logit contributions
 valley+0.191
 mountain+0.189
 lake+0.170
 field+0.169
 woods+0.168
Neg logit contributions
 supervision-0.346
 wounds-0.322
 permission-0.308
 apologize-0.300
olds-0.287
Token a
Token ID257
Feature activation+7.15e+00

Pos logit contributions
 mountain+0.136
 valley+0.131
 lake+0.120
 woods+0.119
 hill+0.114
Neg logit contributions
 supervision-0.327
 wounds-0.322
 permission-0.311
 apologize-0.284
olds-0.280
Token lazy
Token ID16931
Feature activation+6.09e+00

Pos logit contributions
case+0.152
 woods+0.151
 valley+0.148
 sky+0.147
 mountain+0.147
Neg logit contributions
 wounds-0.252
 supervision-0.234
ppa-0.223
 apologize-0.217
anut-0.214
Token time
Token ID640
Feature activation+5.55e+00

Pos logit contributions
 valley+0.111
 woods+0.103
 mountain+0.103
 forest+0.100
 field+0.100
Neg logit contributions
 supervision-0.418
 wounds-0.397
 permission-0.377
olds-0.347
ppa-0.344
Token,
Token ID11
Feature activation+4.34e+00

Pos logit contributions
 valley+0.097
 woods+0.093
 mountain+0.092
 lake+0.088
 field+0.088
Neg logit contributions
 supervision-0.271
 wounds-0.254
 permission-0.242
 apologize-0.225
olds-0.225
Token in
Token ID287
Feature activation+6.79e+00

Pos logit contributions
 mountain+0.104
 valley+0.102
 lake+0.101
 woods+0.098
 forest+0.095
Neg logit contributions
 supervision-0.159
 wounds-0.156
 permission-0.143
 apologize-0.139
and-0.137
Token a
Token ID257
Feature activation+8.69e+00

Pos logit contributions
 valley+0.146
 mountain+0.143
 woods+0.140
 forest+0.134
 lake+0.129
Neg logit contributions
 supervision-0.281
 wounds-0.274
 permission-0.256
 apologize-0.234
 apology-0.224
Token small
Token ID1402
Feature activation+7.76e+00

Pos logit contributions
 mountain+0.150
 valley+0.147
 woods+0.140
 sky+0.136
 lake+0.136
Neg logit contributions
 supervision-0.380
 wounds-0.377
 permission-0.337
 apologize-0.326
ppa-0.322
Token garden
Token ID11376
Feature activation+1.12e+01

Pos logit contributions
case+0.154
 Under+0.151
�+0.144
 A+0.138
 Inside+0.137
Neg logit contributions
 wounds-0.267
 supervision-0.266
 permission-0.249
ppa-0.224
 apologize-0.221
Token,
Token ID11
Feature activation+4.39e+00

Pos logit contributions
 valley+0.275
 mountain+0.268
 lake+0.257
 tiger+0.255
 woods+0.255
Neg logit contributions
 supervision-0.390
 wounds-0.359
olds-0.341
 permission-0.340
 apologize-0.335
Token there
Token ID612
Feature activation+8.79e+00

Pos logit contributions
 mountain+0.104
 lake+0.100
 valley+0.096
 woods+0.094
 forest+0.092
Neg logit contributions
 wounds-0.156
 supervision-0.152
 apologize-0.142
 permission-0.140
and-0.130
Token was
Token ID373
Feature activation+6.38e+00

Pos logit contributions
 valley+0.193
 mountain+0.190
 lake+0.170
 woods+0.170
 field+0.169
Neg logit contributions
 supervision-0.342
 wounds-0.320
 permission-0.304
 apologize-0.298
olds-0.284
Token a
Token ID257
Feature activation+7.75e+00

Pos logit contributions
 mountain+0.174
 valley+0.160
 hill+0.157
 lake+0.150
 tiger+0.141
Neg logit contributions
 wounds-0.206
 permission-0.202
 supervision-0.199
 apologize-0.182
and-0.177
Token little
Token ID1310
Feature activation+5.57e+00

Pos logit contributions
 woods+0.156
case+0.155
 valley+0.149
 sky+0.149
 On+0.148
Neg logit contributions
 wounds-0.287
 supervision-0.271
ppa-0.249
 apologize-0.248
 permission-0.247
Token time
Token ID640
Feature activation+5.47e+00

Pos logit contributions
 valley+0.102
 woods+0.096
 mountain+0.094
 forest+0.094
 field+0.093
Neg logit contributions
 supervision-0.413
 wounds-0.392
 permission-0.374
olds-0.344
ppa-0.340
Token,
Token ID11
Feature activation+4.45e+00

Pos logit contributions
 valley+0.091
 woods+0.086
 mountain+0.086
 lake+0.082
 forest+0.081
Neg logit contributions
 supervision-0.274
 wounds-0.258
 permission-0.245
 apologize-0.227
olds-0.227
Token in
Token ID287
Feature activation+6.78e+00

Pos logit contributions
 mountain+0.101
 valley+0.099
 lake+0.097
 woods+0.094
 forest+0.091
Neg logit contributions
 supervision-0.176
 wounds-0.172
 permission-0.159
 apologize-0.151
ppa-0.151
Token a
Token ID257
Feature activation+8.48e+00

Pos logit contributions
 valley+0.145
 mountain+0.144
 woods+0.137
 forest+0.131
 lake+0.131
Neg logit contributions
 supervision-0.283
 wounds-0.274
 permission-0.258
 apologize-0.235
 apology-0.225
Token small
Token ID1402
Feature activation+7.58e+00

Pos logit contributions
 mountain+0.149
 valley+0.143
 woods+0.136
 lake+0.133
 sky+0.132
Neg logit contributions
 wounds-0.369
 supervision-0.368
 permission-0.325
 apologize-0.315
ppa-0.313
Token garden
Token ID11376
Feature activation+1.12e+01

Pos logit contributions
case+0.147
 Under+0.147
 A+0.134
 On+0.133
�+0.132
Neg logit contributions
 wounds-0.273
 supervision-0.270
 permission-0.250
ppa-0.230
 apologize-0.222
Token,
Token ID11
Feature activation+4.42e+00

Pos logit contributions
 valley+0.280
 mountain+0.276
 tiger+0.263
 lake+0.260
 woods+0.259
Neg logit contributions
 supervision-0.392
 wounds-0.361
 permission-0.341
olds-0.336
 apologize-0.333
Token there
Token ID612
Feature activation+8.88e+00

Pos logit contributions
 mountain+0.103
 lake+0.099
 valley+0.096
 woods+0.093
 forest+0.091
Neg logit contributions
 wounds-0.161
 supervision-0.158
 apologize-0.146
 permission-0.144
and-0.133
Token was
Token ID373
Feature activation+6.32e+00

Pos logit contributions
 valley+0.186
 mountain+0.184
 field+0.165
 lake+0.165
 woods+0.163
Neg logit contributions
 supervision-0.362
 wounds-0.336
 permission-0.321
 apologize-0.313
olds-0.301
Token a
Token ID257
Feature activation+7.61e+00

Pos logit contributions
 mountain+0.172
 valley+0.159
 hill+0.155
 lake+0.149
 tiger+0.138
Neg logit contributions
 wounds-0.205
 permission-0.199
 supervision-0.199
 apologize-0.181
and-0.178
Token magic
Token ID5536
Feature activation+6.88e+00

Pos logit contributions
case+0.154
 woods+0.153
 On+0.150
 valley+0.149
 sky+0.147
Neg logit contributions
 wounds-0.279
 supervision-0.263
 apologize-0.241
ppa-0.241
 permission-0.238
TokenOnce
Token ID7454
Feature activation+3.69e+00

Pos logit contributions
 mountain+0.055
 valley+0.053
 field+0.048
 lake+0.047
 hill+0.046
Neg logit contributions
 supervision-0.114
 wounds-0.107
 permission-0.102
olds-0.100
and-0.099
Token upon
Token ID2402
Feature activation+7.67e+00

Pos logit contributions
 valley+0.091
 mountain+0.088
 woods+0.088
 lake+0.086
 forest+0.085
Neg logit contributions
 supervision-0.154
 wounds-0.145
 permission-0.135
ppa-0.123
 apologize-0.122
Token a
Token ID257
Feature activation+8.97e+00

Pos logit contributions
 valley+0.165
 mountain+0.162
 lake+0.147
 woods+0.145
 hill+0.144
Neg logit contributions
 supervision-0.328
 wounds-0.323
 permission-0.307
 apologize-0.284
and-0.274
Token time
Token ID640
Feature activation+7.06e+00

Pos logit contributions
 valley+0.131
 woods+0.123
 mountain+0.122
 forest+0.120
 field+0.117
Neg logit contributions
 supervision-0.477
 wounds-0.452
 permission-0.428
olds-0.393
 apologize-0.393
Token,
Token ID11
Feature activation+5.67e+00

Pos logit contributions
 valley+0.139
 mountain+0.137
 woods+0.136
 lake+0.132
 field+0.129
Neg logit contributions
 supervision-0.320
 wounds-0.295
 permission-0.285
 apologize-0.265
olds-0.264
Token there
Token ID612
Feature activation+1.12e+01

Pos logit contributions
 mountain+0.115
 valley+0.114
 woods+0.113
 lake+0.108
 forest+0.106
Neg logit contributions
 supervision-0.238
 wounds-0.231
 permission-0.216
ppa-0.214
anut-0.202
Token was
Token ID373
Feature activation+8.29e+00

Pos logit contributions
 valley+0.222
 mountain+0.220
 woods+0.193
 lake+0.192
 field+0.191
Neg logit contributions
 supervision-0.494
 wounds-0.472
 permission-0.442
 apologize-0.422
olds-0.414
Token a
Token ID257
Feature activation+9.40e+00

Pos logit contributions
 mountain+0.174
 valley+0.165
 hill+0.152
 woods+0.150
 lake+0.150
Neg logit contributions
 supervision-0.363
 wounds-0.351
 permission-0.334
 apologize-0.310
ppa-0.304
Token little
Token ID1310
Feature activation+6.94e+00

Pos logit contributions
 woods+0.187
 valley+0.176
 mountain+0.174
 sky+0.173
case+0.165
Neg logit contributions
 wounds-0.371
 supervision-0.359
ppa-0.333
 permission-0.326
 apologize-0.320
Token girl
Token ID2576
Feature activation+4.26e+00

Pos logit contributions
case+0.174
 woods+0.170
 Under+0.165
 On+0.164
 Inside+0.161
Neg logit contributions
 supervision-0.240
 wounds-0.232
 permission-0.214
ppa-0.202
 apology-0.195
Token called
Token ID1444
Feature activation+8.38e-01

Pos logit contributions
 valley+0.104
 mountain+0.100
 woods+0.100
 forest+0.097
 lake+0.097
Neg logit contributions
 supervision-0.182
 wounds-0.171
 permission-0.159
 apologize-0.144
ppa-0.144
Token time
Token ID640
Feature activation+5.49e+00

Pos logit contributions
 valley+0.119
 woods+0.112
 mountain+0.111
 forest+0.108
 field+0.106
Neg logit contributions
 supervision-0.404
 wounds-0.383
 permission-0.363
olds-0.333
ppa-0.332
Token,
Token ID11
Feature activation+4.47e+00

Pos logit contributions
 valley+0.083
 mountain+0.078
 woods+0.078
 forest+0.074
 lake+0.074
Neg logit contributions
 supervision-0.284
 wounds-0.270
 permission-0.255
 apologize-0.236
olds-0.235
Token in
Token ID287
Feature activation+6.77e+00

Pos logit contributions
 mountain+0.103
 valley+0.099
 lake+0.096
 woods+0.094
 forest+0.091
Neg logit contributions
 supervision-0.173
 wounds-0.171
 permission-0.155
 apologize-0.155
and-0.146
Token a
Token ID257
Feature activation+8.53e+00

Pos logit contributions
 mountain+0.145
 valley+0.145
 woods+0.136
 forest+0.131
 lake+0.128
Neg logit contributions
 supervision-0.284
 wounds-0.278
 permission-0.257
 apologize-0.239
 apology-0.227
Token wild
Token ID4295
Feature activation+6.26e+00

Pos logit contributions
 mountain+0.157
 valley+0.150
 lake+0.140
 woods+0.139
 sky+0.138
Neg logit contributions
 supervision-0.363
 wounds-0.360
 permission-0.319
 apologize-0.315
ppa-0.299
Token garden
Token ID11376
Feature activation+1.12e+01

Pos logit contributions
�+0.114
case+0.098
 Under+0.094
 mountain+0.090
br+0.090
Neg logit contributions
 wounds-0.222
 supervision-0.212
 permission-0.207
and-0.195
 apologize-0.193
Token,
Token ID11
Feature activation+4.25e+00

Pos logit contributions
 valley+0.271
 mountain+0.270
 tiger+0.258
 lake+0.252
 woods+0.249
Neg logit contributions
 supervision-0.384
 wounds-0.363
olds-0.342
 permission-0.342
 apologize-0.334
Token there
Token ID612
Feature activation+8.44e+00

Pos logit contributions
 mountain+0.097
 lake+0.092
 valley+0.091
 woods+0.085
 field+0.085
Neg logit contributions
 wounds-0.160
 supervision-0.156
 apologize-0.145
 permission-0.142
and-0.134
Token lived
Token ID5615
Feature activation+7.02e+00

Pos logit contributions
 valley+0.175
 mountain+0.175
 lake+0.155
 woods+0.155
 field+0.153
Neg logit contributions
 supervision-0.339
 wounds-0.321
 permission-0.308
 apologize-0.298
olds-0.285
Token a
Token ID257
Feature activation+6.98e+00

Pos logit contributions
 mountain+0.125
 valley+0.122
 lake+0.111
 woods+0.111
 forest+0.106
Neg logit contributions
 supervision-0.332
 wounds-0.325
 permission-0.311
 apologize-0.286
olds-0.282
Token little
Token ID1310
Feature activation+5.09e+00

Pos logit contributions
case+0.160
 woods+0.156
 mountain+0.154
 sky+0.154
 valley+0.154
Neg logit contributions
 wounds-0.234
 supervision-0.211
ppa-0.203
 apologize-0.201
anut-0.197
Token "
Token ID366
Feature activation-2.35e+00

Pos logit contributions
 supervision+0.054
 wounds+0.051
 permission+0.048
 apologize+0.044
olds+0.042
Neg logit contributions
 valley-0.039
 mountain-0.038
 woods-0.038
 forest-0.037
 lake-0.036
Tokench
Token ID354
Feature activation-3.57e+00

Pos logit contributions
 supervision+0.098
 wounds+0.093
 permission+0.086
 apologize+0.078
ppa+0.077
Neg logit contributions
 valley-0.059
 woods-0.057
 mountain-0.057
 forest-0.055
 lake-0.055
Tokenoo
Token ID2238
Feature activation-5.75e+00

Pos logit contributions
 supervision+0.141
 wounds+0.132
 permission+0.122
olds+0.110
 apologize+0.110
Neg logit contributions
 valley-0.097
 woods-0.094
 mountain-0.093
 forest-0.091
 lake-0.090
Token cho
Token ID1727
Feature activation-1.33e+01

Pos logit contributions
 supervision+0.236
 wounds+0.221
 permission+0.207
 apologize+0.185
ppa+0.185
Neg logit contributions
 valley-0.151
 woods-0.145
 mountain-0.144
 forest-0.141
 lake-0.138
Tokeno
Token ID78
Feature activation-6.46e+00

Pos logit contributions
 supervision+0.502
 wounds+0.479
 permission+0.430
 apologize+0.391
 apology+0.378
Neg logit contributions
 valley-0.357
 mountain-0.354
 woods-0.350
 sky-0.346
 field-0.346
Token cho
Token ID1727
Feature activation-1.39e+01

Pos logit contributions
 supervision+0.260
 wounds+0.244
 permission+0.228
 apologize+0.204
ppa+0.203
Neg logit contributions
 valley-0.173
 woods-0.167
 mountain-0.165
 forest-0.162
 lake-0.160
Tokeno
Token ID78
Feature activation-7.08e+00

Pos logit contributions
 supervision+0.533
 wounds+0.505
 permission+0.455
 apologize+0.414
 apology+0.401
Neg logit contributions
 valley-0.373
 mountain-0.368
 woods-0.365
 sky-0.364
 lake-0.362
Token".
Token ID1911
Feature activation-6.53e-01

Pos logit contributions
 supervision+0.285
 wounds+0.268
 permission+0.249
 apologize+0.223
ppa+0.223
Neg logit contributions
 valley-0.188
 woods-0.182
 mountain-0.180
 forest-0.177
 lake-0.175
Token
Token ID198
Feature activation-2.80e+00

Pos logit contributions
 supervision+0.029
 wounds+0.028
 permission+0.026
 apologize+0.024
olds+0.024
Neg logit contributions
 valley-0.014
 mountain-0.014
 woods-0.014
 forest-0.013
 lake-0.013
Token
Token ID198
Feature activation-1.48e+00

Pos logit contributions
 supervision+0.152
 wounds+0.142
 permission+0.138
beh+0.125
ppa+0.124
Neg logit contributions
 woods-0.036
 valley-0.036
 forest-0.034
 lake-0.034
 mountain-0.034
TokenThey
Token ID2990
Feature activation+2.86e+00

Pos logit contributions
 supervision+0.058
 wounds+0.054
 permission+0.050
 apologize+0.045
olds+0.044
Neg logit contributions
 valley-0.041
 mountain-0.040
 woods-0.040
 forest-0.039
 lake-0.039
TokenI
Token ID40
Feature activation-1.48e+00

Pos logit contributions
 supervision+0.204
 wounds+0.192
 permission+0.178
ppa+0.165
olds+0.164
Neg logit contributions
 valley-0.106
 mountain-0.102
 woods-0.099
 forest-0.097
 lake-0.097
Token am
Token ID716
Feature activation-7.73e-01

Pos logit contributions
 supervision+0.058
 wounds+0.055
 permission+0.048
ppa+0.045
beh+0.044
Neg logit contributions
 valley-0.041
 woods-0.041
 forest-0.040
 lake-0.039
 mountain-0.039
Token glad
Token ID9675
Feature activation-3.57e+00

Pos logit contributions
 supervision+0.022
 wounds+0.020
 permission+0.018
 apologize+0.015
olds+0.015
Neg logit contributions
 valley-0.030
 mountain-0.029
 woods-0.029
 forest-0.029
 lake-0.028
Token you
Token ID345
Feature activation-6.28e+00

Pos logit contributions
 supervision+0.128
 wounds+0.120
 permission+0.112
olds+0.100
 apologize+0.098
Neg logit contributions
 valley-0.112
 woods-0.107
 mountain-0.106
 field-0.105
 lake-0.104
Token love
Token ID1842
Feature activation-5.12e+00

Pos logit contributions
 supervision+0.269
 wounds+0.253
 permission+0.231
ppa+0.210
olds+0.210
Neg logit contributions
 valley-0.152
 woods-0.148
 forest-0.145
 mountain-0.145
 lake-0.143
Token Grand
Token ID5675
Feature activation-1.37e+01

Pos logit contributions
 supervision+0.201
 wounds+0.188
 permission+0.173
 apologize+0.156
ppa+0.155
Neg logit contributions
 valley-0.143
 woods-0.139
 mountain-0.138
 forest-0.136
 lake-0.134
Tokenma
Token ID2611
Feature activation-9.85e+00

Pos logit contributions
 supervision+0.398
 wounds+0.374
 permission+0.307
 apologize+0.280
ppa+0.271
Neg logit contributions
 forest-0.491
 mountain-0.482
 lake-0.480
 woods-0.480
 sky-0.476
Token.
Token ID13
Feature activation-2.39e+00

Pos logit contributions
 supervision+0.460
 wounds+0.436
 permission+0.408
 apologize+0.373
ppa+0.372
Neg logit contributions
 valley-0.197
 mountain-0.188
 woods-0.188
 forest-0.182
 lake-0.181
Token She
Token ID1375
Feature activation+1.04e-01

Pos logit contributions
 supervision+0.109
 wounds+0.100
 permission+0.096
olds+0.089
 apologize+0.088
Neg logit contributions
 valley-0.052
 mountain-0.049
 lake-0.048
 field-0.047
 forest-0.047
Token loves
Token ID10408
Feature activation-2.55e+00

Pos logit contributions
 valley+0.002
 mountain+0.002
 woods+0.002
 field+0.002
 lake+0.002
Neg logit contributions
 supervision-0.005
 wounds-0.004
 permission-0.004
 apologize-0.004
olds-0.004
Token you
Token ID345
Feature activation-5.98e+00

Pos logit contributions
 supervision+0.099
 wounds+0.092
 permission+0.084
 apologize+0.077
ppa+0.076
Neg logit contributions
 valley-0.071
 woods-0.070
 mountain-0.069
 forest-0.068
 lake-0.067
Token temper
Token ID4124
Feature activation+1.31e+00

Pos logit contributions
 supervision+0.029
 wounds+0.027
 permission+0.025
 apologize+0.023
olds+0.022
Neg logit contributions
 valley-0.021
 mountain-0.020
 woods-0.020
 forest-0.020
 lake-0.020
Token and
Token ID290
Feature activation-1.83e+00

Pos logit contributions
 valley+0.031
 mountain+0.029
 woods+0.028
 lake+0.028
 forest+0.027
Neg logit contributions
 supervision-0.057
 wounds-0.054
 permission-0.051
olds-0.046
 apologize-0.045
Token often
Token ID1690
Feature activation+1.42e+00

Pos logit contributions
 supervision+0.082
 wounds+0.077
 permission+0.072
 apologize+0.066
olds+0.066
Neg logit contributions
 valley-0.042
 mountain-0.039
 woods-0.038
 lake-0.038
 forest-0.037
Token threw
Token ID9617
Feature activation-3.58e-01

Pos logit contributions
 valley+0.039
 mountain+0.037
 woods+0.037
 lake+0.036
 forest+0.036
Neg logit contributions
 supervision-0.057
 wounds-0.052
 permission-0.049
 apologize-0.045
ppa-0.044
Token temper
Token ID4124
Feature activation-1.19e+00

Pos logit contributions
 supervision+0.016
 wounds+0.015
 permission+0.014
ppa+0.013
olds+0.013
Neg logit contributions
 valley-0.008
 woods-0.008
 forest-0.008
 lake-0.008
 mountain-0.008
Token tant
Token ID24246
Feature activation-1.33e+01

Pos logit contributions
 supervision+0.046
 wounds+0.043
 permission+0.040
ppa+0.036
 apologize+0.036
Neg logit contributions
 valley-0.033
 woods-0.032
 mountain-0.032
 forest-0.031
 lake-0.031
Tokenrums
Token ID45241
Feature activation-3.57e+00

Pos logit contributions
 supervision+0.336
 wounds+0.322
 apologize+0.277
 permission+0.249
 compliments+0.222
Neg logit contributions
 tiger-0.442
 mountain-0.442
 forest-0.439
 volcano-0.422
 woods-0.421
Token.
Token ID13
Feature activation+2.08e+00

Pos logit contributions
 supervision+0.157
 wounds+0.145
 permission+0.138
ppa+0.124
olds+0.123
Neg logit contributions
 valley-0.087
 mountain-0.082
 woods-0.081
 lake-0.079
 forest-0.079
Token One
Token ID1881
Feature activation+5.71e+00

Pos logit contributions
 valley+0.044
 mountain+0.040
 field+0.038
 woods+0.038
 lake+0.038
Neg logit contributions
 supervision-0.092
 wounds-0.086
 permission-0.083
olds-0.078
 apologize-0.076
Token day
Token ID1110
Feature activation+7.98e+00

Pos logit contributions
 valley+0.126
 mountain+0.111
 lake+0.111
 woods+0.108
 field+0.105
Neg logit contributions
 supervision-0.245
 wounds-0.225
 permission-0.225
 apologize-0.201
 apology-0.200
Token,
Token ID11
Feature activation+4.26e+00

Pos logit contributions
 valley+0.207
 lake+0.190
 forest+0.186
 woods+0.186
 field+0.185
Neg logit contributions
 supervision-0.319
 permission-0.300
 wounds-0.284
olds-0.260
 apologize-0.259
Token not
Token ID407
Feature activation-6.17e+00

Pos logit contributions
 valley+0.002
 mountain+0.002
 woods+0.002
 forest+0.002
 lake+0.002
Neg logit contributions
 supervision-0.002
 wounds-0.002
 permission-0.002
ppa-0.002
olds-0.002
Token know
Token ID760
Feature activation-3.41e+00

Pos logit contributions
 supervision+0.224
 wounds+0.212
 permission+0.185
ppa+0.168
olds+0.164
Neg logit contributions
 forest-0.185
 woods-0.185
 valley-0.183
 lake-0.179
 mountain-0.177
Token.
Token ID13
Feature activation-1.90e+00

Pos logit contributions
 supervision+0.144
 wounds+0.135
 permission+0.126
 apologize+0.113
olds+0.112
Neg logit contributions
 valley-0.087
 woods-0.084
 mountain-0.083
 forest-0.081
 field-0.080
Token We
Token ID775
Feature activation-3.68e+00

Pos logit contributions
 supervision+0.077
 wounds+0.071
 permission+0.065
 apologize+0.060
olds+0.060
Neg logit contributions
 valley-0.052
 mountain-0.051
 woods-0.050
 forest-0.050
 field-0.049
Token love
Token ID1842
Feature activation-4.19e+00

Pos logit contributions
 supervision+0.154
 wounds+0.144
 permission+0.136
 apologize+0.124
olds+0.122
Neg logit contributions
 valley-0.093
 mountain-0.091
 woods-0.089
 forest-0.087
 lake-0.086
Token Grand
Token ID5675
Feature activation-1.32e+01

Pos logit contributions
 supervision+0.171
 wounds+0.161
 permission+0.147
ppa+0.137
olds+0.134
Neg logit contributions
 valley-0.107
 woods-0.105
 mountain-0.103
 forest-0.103
 lake-0.103
Tokenma
Token ID2611
Feature activation-9.07e+00

Pos logit contributions
 supervision+0.360
 wounds+0.338
 permission+0.275
 apology+0.244
 apologize+0.244
Neg logit contributions
 forest-0.494
 lake-0.490
 mountain-0.484
 valley-0.483
 woods-0.480
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
 supervision+0.428
 wounds+0.405
 permission+0.379
 apologize+0.348
olds+0.347
Neg logit contributions
 valley-0.177
 mountain-0.169
 woods-0.168
 forest-0.163
 field-0.162
Token She
Token ID1375
Feature activation+3.85e-01

Pos logit contributions
 supervision+0.074
 wounds+0.068
 permission+0.064
 apologize+0.060
olds+0.060
Neg logit contributions
 valley-0.044
 mountain-0.044
 forest-0.042
 lake-0.042
 woods-0.042
Token is
Token ID318
Feature activation-4.24e-01

Pos logit contributions
 valley+0.009
 mountain+0.009
 woods+0.009
 field+0.008
 lake+0.008
Neg logit contributions
 supervision-0.017
 wounds-0.016
 permission-0.015
 apologize-0.014
olds-0.014
Token compassionate
Token ID32533
Feature activation-3.23e+00

Pos logit contributions
 supervision+0.017
 wounds+0.016
 permission+0.015
 apologize+0.014
ppa+0.014
Neg logit contributions
 valley-0.011
 mountain-0.011
 woods-0.011
 lake-0.010
 forest-0.010
Token them
Token ID606
Feature activation-5.45e+00

Pos logit contributions
 supervision+0.120
 wounds+0.112
 permission+0.103
olds+0.094
 apologize+0.094
Neg logit contributions
 valley-0.076
 woods-0.074
 mountain-0.073
 forest-0.073
 lake-0.072
Token and
Token ID290
Feature activation-3.39e+00

Pos logit contributions
 supervision+0.246
 wounds+0.232
 permission+0.218
 apologize+0.198
ppa+0.197
Neg logit contributions
 valley-0.119
 mountain-0.114
 woods-0.114
 forest-0.110
 lake-0.110
Token never
Token ID1239
Feature activation-4.50e+00

Pos logit contributions
 supervision+0.135
 wounds+0.126
 permission+0.117
ppa+0.107
olds+0.105
Neg logit contributions
 valley-0.090
 woods-0.088
 mountain-0.087
 forest-0.086
 field-0.085
Token threw
Token ID9617
Feature activation-2.88e+00

Pos logit contributions
 supervision+0.165
 wounds+0.150
 permission+0.140
 apologize+0.127
olds+0.123
Neg logit contributions
 valley-0.140
 mountain-0.135
 woods-0.134
 lake-0.131
 forest-0.130
Token a
Token ID257
Feature activation-7.58e-02

Pos logit contributions
 supervision+0.105
 wounds+0.097
 permission+0.089
ppa+0.080
olds+0.080
Neg logit contributions
 valley-0.088
 woods-0.085
 mountain-0.084
 forest-0.084
 lake-0.083
Token tant
Token ID24246
Feature activation-1.31e+01

Pos logit contributions
 supervision+0.003
 wounds+0.003
 permission+0.003
ppa+0.003
olds+0.003
Neg logit contributions
 valley-0.002
 mountain-0.002
 woods-0.002
 lake-0.002
 forest-0.002
Tokenrum
Token ID6582
Feature activation-1.92e+00

Pos logit contributions
 supervision+0.360
 wounds+0.352
 apologize+0.299
 permission+0.271
 compliments+0.255
Neg logit contributions
 tiger-0.375
 forest-0.368
 mountain-0.366
 snake-0.356
 volcano-0.353
Token again
Token ID757
Feature activation-2.21e+00

Pos logit contributions
 supervision+0.083
 wounds+0.077
 permission+0.073
olds+0.066
 apologize+0.066
Neg logit contributions
 valley-0.048
 mountain-0.046
 lake-0.044
 woods-0.044
 field-0.043
Token.
Token ID13
Feature activation+3.78e-01

Pos logit contributions
 supervision+0.097
 wounds+0.091
 permission+0.086
 apologize+0.077
ppa+0.077
Neg logit contributions
 valley-0.052
 mountain-0.049
 woods-0.049
 lake-0.048
 forest-0.047
Token<|endoftext|>
Token ID50256
Feature activation+1.55e+00

Pos logit contributions
 valley+0.008
 woods+0.007
 mountain+0.007
 forest+0.007
 lake+0.007
Neg logit contributions
 supervision-0.018
 wounds-0.017
 permission-0.016
ppa-0.014
 apologize-0.014
TokenOnce
Token ID7454
Feature activation+2.34e+00

Pos logit contributions
 valley+0.038
 mountain+0.037
 woods+0.034
 field+0.034
 lake+0.034
Neg logit contributions
 supervision-0.062
 wounds-0.059
 permission-0.054
olds-0.052
 apologize-0.051
Token did
Token ID750
Feature activation+3.86e-01

Pos logit contributions
 supervision+0.077
 wounds+0.073
 permission+0.068
 apologize+0.062
olds+0.062
Neg logit contributions
 valley-0.036
 mountain-0.035
 woods-0.035
 forest-0.034
 lake-0.033
Token not
Token ID407
Feature activation-6.11e+00

Pos logit contributions
 valley+0.010
 mountain+0.010
 woods+0.010
 forest+0.010
 top+0.010
Neg logit contributions
 supervision-0.015
 wounds-0.014
 permission-0.013
ppa-0.012
olds-0.012
Token want
Token ID765
Feature activation-3.52e+00

Pos logit contributions
 supervision+0.240
 wounds+0.227
 permission+0.194
ppa+0.185
olds+0.174
Neg logit contributions
 forest-0.163
 woods-0.159
 valley-0.153
 lake-0.153
 mountain-0.150
Token to
Token ID284
Feature activation-6.32e+00

Pos logit contributions
 supervision+0.158
 wounds+0.149
 permission+0.138
ppa+0.128
 apologize+0.126
Neg logit contributions
 valley-0.078
 woods-0.075
 mountain-0.075
 forest-0.073
 lake-0.072
Token make
Token ID787
Feature activation-4.54e+00

Pos logit contributions
 supervision+0.256
 wounds+0.240
 permission+0.220
ppa+0.202
olds+0.195
Neg logit contributions
 valley-0.161
 forest-0.158
 lake-0.158
 mountain-0.157
 woods-0.154
Token Mrs
Token ID9074
Feature activation-1.31e+01

Pos logit contributions
 supervision+0.199
 wounds+0.185
 permission+0.173
ppa+0.163
olds+0.156
Neg logit contributions
 valley-0.104
 woods-0.099
 mountain-0.099
 forest-0.099
 lake-0.097
Token.
Token ID13
Feature activation-3.26e+00

Pos logit contributions
 supervision+0.620
 wounds+0.585
 permission+0.550
 apologize+0.503
olds+0.501
Neg logit contributions
 valley-0.259
 woods-0.248
 mountain-0.247
 forest-0.240
 lake-0.236
Token Lee
Token ID5741
Feature activation-6.91e+00

Pos logit contributions
 supervision+0.095
 wounds+0.085
 permission+0.081
 apologize+0.067
olds+0.066
Neg logit contributions
 valley-0.121
 woods-0.117
 mountain-0.117
 lake-0.116
 field-0.116
Token sad
Token ID6507
Feature activation-2.91e+00

Pos logit contributions
 supervision+0.305
 wounds+0.287
 permission+0.268
 apologize+0.244
olds+0.243
Neg logit contributions
 valley-0.158
 woods-0.151
 mountain-0.151
 forest-0.147
 lake-0.146
Token.
Token ID13
Feature activation-1.85e+00

Pos logit contributions
 supervision+0.142
 wounds+0.133
 permission+0.126
ppa+0.120
beh+0.114
Neg logit contributions
 valley-0.050
 woods-0.049
 mountain-0.048
 forest-0.048
 sky-0.046
Token What
Token ID1867
Feature activation+2.88e+00

Pos logit contributions
 supervision+0.089
 wounds+0.083
 permission+0.078
olds+0.074
 apologize+0.073
Neg logit contributions
 valley-0.036
 mountain-0.035
 lake-0.033
 woods-0.033
 field-0.032
Token.
Token ID13
Feature activation-6.20e-01

Pos logit contributions
 supervision+0.059
 wounds+0.055
 permission+0.052
 apologize+0.048
ppa+0.047
Neg logit contributions
 valley-0.033
 woods-0.032
 lake-0.031
 forest-0.031
 mountain-0.030
Token You
Token ID921
Feature activation-2.43e+00

Pos logit contributions
 supervision+0.026
 wounds+0.024
 permission+0.023
 apologize+0.021
olds+0.021
Neg logit contributions
 valley-0.016
 mountain-0.015
 woods-0.015
 forest-0.015
 lake-0.014
Token want
Token ID765
Feature activation-2.05e+00

Pos logit contributions
 supervision+0.095
 wounds+0.088
 permission+0.084
 apologize+0.079
olds+0.075
Neg logit contributions
 valley-0.068
 woods-0.066
 mountain-0.065
 lake-0.064
 field-0.064
Token to
Token ID284
Feature activation-5.19e+00

Pos logit contributions
 supervision+0.090
 wounds+0.085
 permission+0.079
ppa+0.072
 apologize+0.072
Neg logit contributions
 valley-0.047
 woods-0.046
 mountain-0.046
 forest-0.044
 lake-0.044
Token show
Token ID905
Feature activation-4.67e+00

Pos logit contributions
 supervision+0.189
 wounds+0.176
 permission+0.163
 apologize+0.148
olds+0.144
Neg logit contributions
 valley-0.160
 woods-0.155
 mountain-0.154
 forest-0.151
 field-0.149
Token Grand
Token ID5675
Feature activation-1.30e+01

Pos logit contributions
 supervision+0.196
 wounds+0.182
 permission+0.163
beh+0.160
ppa+0.157
Neg logit contributions
 valley-0.105
 forest-0.103
 mountain-0.103
 woods-0.103
 top-0.102
Tokenma
Token ID2611
Feature activation-8.70e+00

Pos logit contributions
 supervision+0.392
 wounds+0.359
 permission+0.306
 apologize+0.276
 apology+0.271
Neg logit contributions
 forest-0.463
 mountain-0.448
 woods-0.442
 lake-0.440
 valley-0.437
Token and
Token ID290
Feature activation-5.40e+00

Pos logit contributions
 supervision+0.395
 wounds+0.374
 permission+0.348
 apologize+0.319
olds+0.317
Neg logit contributions
 valley-0.184
 mountain-0.177
 woods-0.177
 forest-0.172
 field-0.171
Token Grand
Token ID5675
Feature activation-1.24e+01

Pos logit contributions
 supervision+0.195
 wounds+0.186
 permission+0.166
ppa+0.152
olds+0.150
Neg logit contributions
 valley-0.157
 mountain-0.155
 woods-0.155
 forest-0.152
 field-0.152
Tokenpa
Token ID8957
Feature activation-7.57e+00

Pos logit contributions
 supervision+0.351
 wounds+0.340
shadow+0.259
 permission+0.257
 rid+0.249
Neg logit contributions
 forest-0.429
 mountain-0.404
 woods-0.403
 lake-0.402
 sky-0.396
Token your
Token ID534
Feature activation-4.61e+00

Pos logit contributions
 supervision+0.346
 wounds+0.327
 permission+0.306
 apologize+0.279
ppa+0.278
Neg logit contributions
 valley-0.160
 mountain-0.153
 woods-0.153
 forest-0.148
 lake-0.147
Token And
Token ID843
Feature activation-4.44e+00

Pos logit contributions
 supervision+0.046
 wounds+0.043
 permission+0.040
 apologize+0.036
olds+0.036
Neg logit contributions
 valley-0.032
 mountain-0.031
 woods-0.030
 forest-0.030
 field-0.029
Token only
Token ID691
Feature activation-2.12e+00

Pos logit contributions
 supervision+0.171
 wounds+0.159
 permission+0.148
 apologize+0.133
olds+0.129
Neg logit contributions
 valley-0.129
 woods-0.125
 mountain-0.124
 forest-0.122
 lake-0.120
Token if
Token ID611
Feature activation-2.34e+00

Pos logit contributions
 supervision+0.081
 wounds+0.073
 permission+0.070
 apologize+0.063
olds+0.060
Neg logit contributions
 valley-0.064
 woods-0.061
 mountain-0.061
 forest-0.059
 lake-0.059
Token we
Token ID356
Feature activation-5.42e+00

Pos logit contributions
 supervision+0.087
 wounds+0.080
 permission+0.076
 apologize+0.067
olds+0.064
Neg logit contributions
 valley-0.071
 woods-0.068
 mountain-0.068
 lake-0.067
 forest-0.067
Token bring
Token ID2222
Feature activation-4.12e+00

Pos logit contributions
 supervision+0.198
 wounds+0.183
 permission+0.174
 apologize+0.156
olds+0.148
Neg logit contributions
 valley-0.164
 woods-0.161
 mountain-0.158
 forest-0.156
 lake-0.153
Token Mr
Token ID1770
Feature activation-1.29e+01

Pos logit contributions
 supervision+0.183
 wounds+0.168
 permission+0.156
ppa+0.153
beh+0.152
Neg logit contributions
 lake-0.084
 valley-0.082
 field-0.081
 woods-0.081
 forest-0.080
Token.
Token ID13
Feature activation-2.73e+00

Pos logit contributions
 supervision+0.582
 wounds+0.552
 permission+0.517
 apologize+0.467
ppa+0.467
Neg logit contributions
 valley-0.290
 mountain-0.276
 woods-0.273
 lake-0.264
 forest-0.263
Token Fl
Token ID1610
Feature activation-8.57e+00

Pos logit contributions
 supervision+0.102
 wounds+0.094
 permission+0.090
 apologize+0.079
 apology+0.076
Neg logit contributions
 valley-0.079
 woods-0.075
 field-0.075
 mountain-0.074
 forest-0.072
Tokenuff
Token ID1648
Feature activation-6.68e+00

Pos logit contributions
 supervision+0.303
 wounds+0.289
 permission+0.263
olds+0.231
ppa+0.230
Neg logit contributions
 valley-0.267
 woods-0.264
 mountain-0.263
 forest-0.255
 lake-0.255
Token."
Token ID526
Feature activation-2.88e+00

Pos logit contributions
 supervision+0.286
 wounds+0.266
 permission+0.256
olds+0.228
 apologize+0.226
Neg logit contributions
 valley-0.167
 woods-0.163
 forest-0.156
 mountain-0.156
 lake-0.154
Token
Token ID198
Feature activation-4.74e+00

Pos logit contributions
 supervision+0.150
 wounds+0.141
 permission+0.134
beh+0.123
ppa+0.122
Neg logit contributions
 valley-0.045
 woods-0.044
 forest-0.042
 sky-0.041
 mountain-0.041
Token other
Token ID584
Feature activation-5.88e+00

Pos logit contributions
 supervision+0.275
 wounds+0.257
 permission+0.233
ppa+0.220
beh+0.217
Neg logit contributions
 valley-0.137
 woods-0.137
 forest-0.135
 mountain-0.134
 field-0.133
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
 supervision+0.288
 wounds+0.268
 permission+0.255
ppa+0.238
olds+0.237
Neg logit contributions
 valley-0.100
 mountain-0.094
 sky-0.094
 forest-0.094
 lake-0.093
Token And
Token ID843
Feature activation-4.76e+00

Pos logit contributions
 supervision+0.077
 wounds+0.072
 permission+0.067
 apologize+0.062
olds+0.061
Neg logit contributions
 valley-0.045
 mountain-0.043
 woods-0.043
 forest-0.042
 field-0.041
Token we
Token ID356
Feature activation-3.35e+00

Pos logit contributions
 supervision+0.209
 wounds+0.196
 permission+0.183
 apologize+0.167
olds+0.164
Neg logit contributions
 valley-0.114
 woods-0.110
 mountain-0.108
 forest-0.106
 lake-0.104
Token love
Token ID1842
Feature activation-4.16e+00

Pos logit contributions
 supervision+0.146
 wounds+0.137
 permission+0.128
 apologize+0.116
ppa+0.116
Neg logit contributions
 valley-0.079
 woods-0.076
 mountain-0.076
 forest-0.074
 lake-0.073
Token Grand
Token ID5675
Feature activation-1.29e+01

Pos logit contributions
 supervision+0.166
 wounds+0.154
 permission+0.142
ppa+0.132
olds+0.129
Neg logit contributions
 valley-0.110
 woods-0.109
 mountain-0.108
 forest-0.107
 lake-0.107
Tokenma
Token ID2611
Feature activation-9.07e+00

Pos logit contributions
 supervision+0.385
 wounds+0.361
 permission+0.306
ppa+0.275
shadow+0.267
Neg logit contributions
 forest-0.449
 lake-0.442
 sky-0.429
 mountain-0.428
 valley-0.424
Token.
Token ID13
Feature activation-1.91e+00

Pos logit contributions
 supervision+0.433
 wounds+0.408
 permission+0.382
 apologize+0.351
olds+0.350
Neg logit contributions
 valley-0.169
 mountain-0.162
 woods-0.161
 forest-0.158
 field-0.158
Token And
Token ID843
Feature activation-4.92e+00

Pos logit contributions
 supervision+0.082
 wounds+0.076
 permission+0.070
olds+0.068
 apologize+0.067
Neg logit contributions
 valley-0.046
 mountain-0.046
 woods-0.043
 lake-0.043
 forest-0.043
Token we
Token ID356
Feature activation-3.36e+00

Pos logit contributions
 supervision+0.208
 wounds+0.193
 permission+0.182
 apologize+0.166
olds+0.162
Neg logit contributions
 valley-0.126
 woods-0.123
 mountain-0.119
 forest-0.118
 lake-0.117
Token love
Token ID1842
Feature activation-4.25e+00

Pos logit contributions
 supervision+0.134
 wounds+0.126
 permission+0.116
ppa+0.104
 apologize+0.104
Neg logit contributions
 valley-0.091
 woods-0.088
 mountain-0.088
 forest-0.086
 lake-0.086
Token Mom
Token ID11254
Feature activation-6.33e+00

Pos logit contributions
 supervision+0.084
 wounds+0.078
 permission+0.072
 apologize+0.063
olds+0.062
Neg logit contributions
 valley-0.080
 woods-0.078
 mountain-0.076
 forest-0.076
 lake-0.075
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
 supervision+0.284
 wounds+0.265
 permission+0.251
 apologize+0.228
ppa+0.227
Neg logit contributions
 valley-0.143
 woods-0.136
 mountain-0.135
 forest-0.130
 lake-0.130
Token And
Token ID843
Feature activation-4.30e+00

Pos logit contributions
 supervision+0.044
 wounds+0.039
 permission+0.037
 apologize+0.034
olds+0.034
Neg logit contributions
 valley-0.035
 mountain-0.034
 woods-0.033
 forest-0.033
 field-0.033
Token we
Token ID356
Feature activation-2.78e+00

Pos logit contributions
 supervision+0.173
 wounds+0.158
 permission+0.154
 apologize+0.136
 apology+0.130
Neg logit contributions
 valley-0.119
 woods-0.119
 forest-0.114
 mountain-0.110
 field-0.108
Token love
Token ID1842
Feature activation-4.02e+00

Pos logit contributions
 supervision+0.096
 wounds+0.088
 permission+0.083
 apologize+0.074
ppa+0.072
Neg logit contributions
 valley-0.091
 mountain-0.089
 woods-0.088
 forest-0.086
 lake-0.085
Token Grand
Token ID5675
Feature activation-1.29e+01

Pos logit contributions
 supervision+0.158
 wounds+0.147
 permission+0.136
ppa+0.122
 apologize+0.122
Neg logit contributions
 valley-0.112
 mountain-0.109
 woods-0.109
 forest-0.106
 lake-0.106
Tokenma
Token ID2611
Feature activation-9.16e+00

Pos logit contributions
 supervision+0.363
 wounds+0.343
 permission+0.287
ppa+0.259
 apologize+0.251
Neg logit contributions
 forest-0.467
 lake-0.459
 mountain-0.456
 valley-0.453
 woods-0.449
Token.
Token ID13
Feature activation-1.79e+00

Pos logit contributions
 supervision+0.426
 wounds+0.403
 permission+0.377
 apologize+0.345
olds+0.344
Neg logit contributions
 valley-0.186
 mountain-0.178
 woods-0.178
 forest-0.172
 lake-0.170
Token And
Token ID843
Feature activation-4.76e+00

Pos logit contributions
 supervision+0.072
 wounds+0.065
 permission+0.061
olds+0.058
 apologize+0.058
Neg logit contributions
 valley-0.048
 mountain-0.047
 forest-0.045
 woods-0.044
 field-0.044
Token we
Token ID356
Feature activation-3.05e+00

Pos logit contributions
 supervision+0.194
 wounds+0.177
 permission+0.171
 apologize+0.153
olds+0.147
Neg logit contributions
 valley-0.129
 woods-0.128
 forest-0.123
 mountain-0.121
 lake-0.120
Token love
Token ID1842
Feature activation-4.23e+00

Pos logit contributions
 supervision+0.104
 wounds+0.095
 permission+0.089
 apologize+0.079
ppa+0.077
Neg logit contributions
 valley-0.102
 mountain-0.099
 woods-0.099
 forest-0.096
 lake-0.096
Token baby
Token ID5156
Feature activation-6.41e+00

Pos logit contributions
 supervision+0.155
 wounds+0.145
 permission+0.134
 apologize+0.120
ppa+0.119
Neg logit contributions
 valley-0.113
 mountain-0.109
 woods-0.109
 forest-0.106
 lake-0.106
Token back
Token ID736
Feature activation-3.68e+00

Pos logit contributions
 supervision+0.278
 wounds+0.260
 permission+0.241
olds+0.222
ppa+0.221
Neg logit contributions
 valley-0.152
 mountain-0.146
 woods-0.146
 forest-0.144
 field-0.141
Token and
Token ID290
Feature activation-2.88e+00

Pos logit contributions
 supervision+0.157
 wounds+0.147
 permission+0.141
 apologize+0.126
ppa+0.124
Neg logit contributions
 valley-0.089
 mountain-0.085
 woods-0.082
 lake-0.082
 forest-0.079
Token she
Token ID673
Feature activation+9.97e-01

Pos logit contributions
 supervision+0.117
 wounds+0.109
 permission+0.101
ppa+0.091
olds+0.091
Neg logit contributions
 valley-0.077
 woods-0.074
 mountain-0.074
 forest-0.073
 lake-0.072
Token thanked
Token ID26280
Feature activation-5.98e+00

Pos logit contributions
 valley+0.033
 woods+0.032
 mountain+0.032
 forest+0.031
 lake+0.031
Neg logit contributions
 supervision-0.034
 wounds-0.031
 permission-0.029
 apologize-0.025
ppa-0.025
Token Twe
Token ID24205
Feature activation-1.29e+01

Pos logit contributions
 supervision+0.254
 wounds+0.236
 permission+0.215
olds+0.201
ppa+0.196
Neg logit contributions
 valley-0.152
 mountain-0.144
 woods-0.143
 forest-0.143
 lake-0.142
Tokenety
Token ID2963
Feature activation-7.91e+00

Pos logit contributions
 supervision+0.440
 wounds+0.418
 permission+0.364
 apologize+0.343
olds+0.323
Neg logit contributions
 woods-0.381
 valley-0.378
 forest-0.377
 field-0.371
 lake-0.363
Token and
Token ID290
Feature activation-3.96e+00

Pos logit contributions
 supervision+0.326
 wounds+0.312
 permission+0.272
 elbows+0.260
olds+0.259
Neg logit contributions
 mountain-0.193
 valley-0.191
 forest-0.187
 field-0.186
 woods-0.186
Token the
Token ID262
Feature activation-2.56e+00

Pos logit contributions
 supervision+0.157
 wounds+0.148
 permission+0.136
olds+0.123
ppa+0.122
Neg logit contributions
 valley-0.109
 mountain-0.105
 woods-0.105
 forest-0.103
 field-0.102
Token owl
Token ID39610
Feature activation-4.03e+00

Pos logit contributions
 supervision+0.117
 wounds+0.111
 permission+0.103
 apologize+0.094
ppa+0.094
Neg logit contributions
 valley-0.056
 mountain-0.053
 woods-0.053
 forest-0.051
 lake-0.051
Token for
Token ID329
Feature activation-2.34e+00

Pos logit contributions
 supervision+0.175
 wounds+0.165
 permission+0.153
 apologize+0.139
olds+0.139
Neg logit contributions
 valley-0.095
 mountain-0.091
 woods-0.091
 forest-0.088
 lake-0.087
Token can
Token ID460
Feature activation-3.34e+00

Pos logit contributions
 supervision+0.138
 wounds+0.128
 permission+0.123
 apologize+0.115
olds+0.110
Neg logit contributions
 valley-0.088
 woods-0.087
 mountain-0.085
 field-0.083
 lake-0.082
Token still
Token ID991
Feature activation-4.65e+00

Pos logit contributions
 supervision+0.122
 wounds+0.115
 permission+0.106
 apologize+0.099
olds+0.095
Neg logit contributions
 valley-0.099
 woods-0.098
 mountain-0.096
 field-0.093
 forest-0.093
Token remind
Token ID7101
Feature activation-7.41e+00

Pos logit contributions
 supervision+0.176
 wounds+0.165
 permission+0.153
 apologize+0.138
olds+0.136
Neg logit contributions
 valley-0.134
 woods-0.130
 mountain-0.129
 forest-0.126
 lake-0.125
Token me
Token ID502
Feature activation-9.12e+00

Pos logit contributions
 supervision+0.332
 wounds+0.312
 permission+0.282
ppa+0.264
olds+0.260
Neg logit contributions
 forest-0.156
 valley-0.152
 woods-0.151
 lake-0.148
 sky-0.145
Token of
Token ID286
Feature activation-3.80e+00

Pos logit contributions
 supervision+0.400
 wounds+0.376
 permission+0.347
ppa+0.319
olds+0.316
Neg logit contributions
 valley-0.209
 mountain-0.201
 woods-0.198
 forest-0.195
 lake-0.193
Token Grand
Token ID5675
Feature activation-1.29e+01

Pos logit contributions
 supervision+0.147
 wounds+0.138
 permission+0.126
olds+0.115
ppa+0.115
Neg logit contributions
 valley-0.104
 mountain-0.101
 woods-0.101
 forest-0.101
 lake-0.099
Tokenma
Token ID2611
Feature activation-8.85e+00

Pos logit contributions
 supervision+0.378
 wounds+0.352
 permission+0.289
olded+0.278
 apologize+0.268
Neg logit contributions
 forest-0.456
 mountain-0.437
 lake-0.433
 valley-0.427
 woods-0.422
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
 supervision+0.416
 wounds+0.396
 permission+0.367
 apologize+0.338
olds+0.336
Neg logit contributions
 valley-0.168
 mountain-0.162
 woods-0.161
 forest-0.158
 field-0.157
Token She
Token ID1375
Feature activation+1.18e+00

Pos logit contributions
 supervision+0.060
 wounds+0.055
 permission+0.053
olds+0.049
 apologize+0.049
Neg logit contributions
 valley-0.032
 mountain-0.031
 woods-0.030
 field-0.029
 lake-0.029
Token was
Token ID373
Feature activation-7.62e-02

Pos logit contributions
 valley+0.026
 mountain+0.025
 woods+0.025
 field+0.024
 lake+0.024
Neg logit contributions
 supervision-0.053
 wounds-0.050
 permission-0.046
 apologize-0.044
olds-0.043
Token good
Token ID922
Feature activation-3.47e+00

Pos logit contributions
 supervision+0.003
 wounds+0.003
 permission+0.003
 apologize+0.003
ppa+0.003
Neg logit contributions
 valley-0.002
 mountain-0.002
 woods-0.002
 forest-0.002
 lake-0.002
Token,
Token ID11
Feature activation-2.65e+00

Pos logit contributions
 supervision+0.282
 wounds+0.267
 permission+0.251
 apologize+0.228
ppa+0.226
Neg logit contributions
 valley-0.137
 woods-0.131
 mountain-0.131
 forest-0.127
 lake-0.125
Token Mom
Token ID11254
Feature activation-6.76e+00

Pos logit contributions
 supervision+0.093
 wounds+0.087
 permission+0.080
 apologize+0.070
olds+0.069
Neg logit contributions
 valley-0.085
 woods-0.083
 mountain-0.082
 forest-0.080
 lake-0.079
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 supervision+0.300
 wounds+0.282
 permission+0.265
 apologize+0.241
olds+0.240
Neg logit contributions
 valley-0.156
 woods-0.148
 mountain-0.147
 forest-0.142
 lake-0.141
Token We
Token ID775
Feature activation-2.68e+00

Pos logit contributions
 supervision+0.040
 wounds+0.036
 permission+0.034
 apologize+0.031
olds+0.031
Neg logit contributions
 valley-0.031
 mountain-0.030
 forest-0.029
 woods-0.029
 field-0.029
Token love
Token ID1842
Feature activation-4.00e+00

Pos logit contributions
 supervision+0.092
 wounds+0.085
 permission+0.078
 apologize+0.068
ppa+0.068
Neg logit contributions
 valley-0.087
 woods-0.085
 mountain-0.085
 forest-0.083
 lake-0.083
Token Grand
Token ID5675
Feature activation-1.28e+01

Pos logit contributions
 supervision+0.167
 wounds+0.157
 permission+0.144
ppa+0.132
 apologize+0.131
Neg logit contributions
 valley-0.101
 mountain-0.098
 woods-0.098
 forest-0.096
 lake-0.096
Tokenma
Token ID2611
Feature activation-8.91e+00

Pos logit contributions
 supervision+0.375
 wounds+0.356
 permission+0.293
ppa+0.264
 apologize+0.259
Neg logit contributions
 forest-0.456
 mountain-0.451
 lake-0.446
 valley-0.445
 sky-0.442
Token too
Token ID1165
Feature activation-3.32e+00

Pos logit contributions
 supervision+0.422
 wounds+0.399
 permission+0.374
 apologize+0.343
olds+0.342
Neg logit contributions
 valley-0.172
 mountain-0.164
 woods-0.164
 forest-0.159
 field-0.158
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
 supervision+0.135
 wounds+0.128
 permission+0.121
olds+0.110
 apologize+0.108
Neg logit contributions
 valley-0.088
 woods-0.083
 mountain-0.082
 forest-0.081
 lake-0.079
Token We
Token ID775
Feature activation-3.02e+00

Pos logit contributions
 supervision+0.062
 wounds+0.056
 permission+0.053
olds+0.049
 apologize+0.049
Neg logit contributions
 valley-0.042
 mountain-0.041
 forest-0.040
 woods-0.040
 field-0.039
Token want
Token ID765
Feature activation-1.91e+00

Pos logit contributions
 supervision+0.106
 wounds+0.098
 permission+0.089
ppa+0.079
olds+0.079
Neg logit contributions
 valley-0.096
 woods-0.093
 mountain-0.093
 forest-0.092
 lake-0.091
Tokenushed
Token ID7474
Feature activation-2.15e+00

Pos logit contributions
 supervision+0.096
 wounds+0.088
 permission+0.077
olds+0.066
ppa+0.066
Neg logit contributions
 valley-0.146
 mountain-0.143
 woods-0.142
 forest-0.140
 lake-0.139
Token and
Token ID290
Feature activation-2.87e+00

Pos logit contributions
 supervision+0.088
 wounds+0.084
 permission+0.080
 apologize+0.072
ppa+0.071
Neg logit contributions
 valley-0.054
 mountain-0.053
 woods-0.052
 forest-0.050
 lake-0.049
Token thanked
Token ID26280
Feature activation-4.72e+00

Pos logit contributions
 supervision+0.106
 wounds+0.099
 permission+0.091
 apologize+0.082
ppa+0.081
Neg logit contributions
 valley-0.086
 mountain-0.083
 woods-0.083
 forest-0.081
 lake-0.080
Token the
Token ID262
Feature activation-3.28e+00

Pos logit contributions
 supervision+0.189
 wounds+0.176
 permission+0.158
ppa+0.147
olds+0.146
Neg logit contributions
 valley-0.130
 mountain-0.126
 woods-0.125
 forest-0.124
 lake-0.122
Token woman
Token ID2415
Feature activation-6.18e+00

Pos logit contributions
 supervision+0.144
 wounds+0.136
 permission+0.128
 apologize+0.116
olds+0.115
Neg logit contributions
 valley-0.075
 mountain-0.072
 woods-0.071
 forest-0.069
 field-0.069
Token prof
Token ID1534
Feature activation-1.28e+01

Pos logit contributions
 supervision+0.272
 wounds+0.257
 permission+0.236
ppa+0.217
 apologize+0.216
Neg logit contributions
 valley-0.140
 mountain-0.136
 woods-0.134
 forest-0.131
 field-0.131
Tokenuse
Token ID1904
Feature activation-8.75e+00

Pos logit contributions
 supervision+0.393
 wounds+0.392
 permission+0.304
 apology+0.284
and+0.279
Neg logit contributions
 valley-0.426
 field-0.415
 woods-0.413
 lake-0.412
 forest-0.406
Tokenly
Token ID306
Feature activation-1.92e+00

Pos logit contributions
 supervision+0.368
 wounds+0.357
beh+0.299
 permission+0.296
 apologize+0.295
Neg logit contributions
 forest-0.195
 field-0.195
 woods-0.190
 sky-0.188
 mountain-0.185
Token.
Token ID13
Feature activation-9.03e-01

Pos logit contributions
 supervision+0.091
 wounds+0.087
 permission+0.080
 apologize+0.075
ppa+0.074
Neg logit contributions
 valley-0.036
 mountain-0.035
 woods-0.034
 field-0.034
 forest-0.033
Token The
Token ID383
Feature activation+6.42e-01

Pos logit contributions
 supervision+0.040
 wounds+0.036
 permission+0.035
olds+0.033
 apologize+0.032
Neg logit contributions
 valley-0.020
 mountain-0.019
 woods-0.019
 lake-0.018
 forest-0.018
Token woman
Token ID2415
Feature activation-3.04e+00

Pos logit contributions
 valley+0.018
 mountain+0.017
 woods+0.017
 forest+0.017
 lake+0.017
Neg logit contributions
 supervision-0.025
 wounds-0.023
 permission-0.022
 apologize-0.019
ppa-0.019
Token I
Token ID314
Feature activation-1.23e+00

Pos logit contributions
 supervision+0.039
 wounds+0.035
 permission+0.033
 apologize+0.032
olds+0.032
Neg logit contributions
 valley-0.018
 woods-0.017
 mountain-0.017
 field-0.016
 lake-0.016
Token just
Token ID655
Feature activation-3.69e+00

Pos logit contributions
 supervision+0.049
 wounds+0.046
 permission+0.043
 apologize+0.039
olds+0.038
Neg logit contributions
 valley-0.033
 mountain-0.032
 woods-0.032
 forest-0.031
 lake-0.031
Token wanted
Token ID2227
Feature activation-8.52e-01

Pos logit contributions
 supervision+0.148
 wounds+0.133
 permission+0.130
 apologize+0.118
olds+0.113
Neg logit contributions
 valley-0.104
 mountain-0.099
 woods-0.099
 lake-0.096
 forest-0.096
Token to
Token ID284
Feature activation-4.33e+00

Pos logit contributions
 supervision+0.038
 wounds+0.036
 permission+0.033
ppa+0.030
 apologize+0.030
Neg logit contributions
 valley-0.019
 woods-0.018
 mountain-0.018
 forest-0.018
 lake-0.018
Token make
Token ID787
Feature activation-3.20e+00

Pos logit contributions
 supervision+0.154
 wounds+0.142
 permission+0.133
 apologize+0.122
 apology+0.115
Neg logit contributions
 valley-0.138
 woods-0.135
 mountain-0.131
 forest-0.129
 field-0.129
Token Grand
Token ID5675
Feature activation-1.28e+01

Pos logit contributions
 supervision+0.143
 wounds+0.134
 permission+0.125
ppa+0.117
olds+0.116
Neg logit contributions
 valley-0.068
 mountain-0.066
 woods-0.065
 forest-0.065
 lake-0.064
Tokenma
Token ID2611
Feature activation-9.55e+00

Pos logit contributions
 supervision+0.357
 wounds+0.337
 permission+0.278
ppa+0.242
 apologize+0.242
Neg logit contributions
 forest-0.476
 mountain-0.474
 valley-0.468
 lake-0.467
 woods-0.462
Token happy
Token ID3772
Feature activation-3.37e+00

Pos logit contributions
 supervision+0.353
 wounds+0.328
 permission+0.300
olds+0.270
ppa+0.269
Neg logit contributions
 valley-0.277
 mountain-0.271
 field-0.264
 woods-0.263
 forest-0.263
Token."
Token ID526
Feature activation-2.58e+00

Pos logit contributions
 supervision+0.132
 wounds+0.124
 permission+0.117
 apologize+0.105
olds+0.102
Neg logit contributions
 valley-0.093
 woods-0.089
 mountain-0.088
 lake-0.087
 field-0.086
Token
Token ID198
Feature activation-4.24e+00

Pos logit contributions
 supervision+0.142
 wounds+0.138
 permission+0.127
beh+0.121
ppa+0.121
Neg logit contributions
 sky-0.032
 woods-0.030
 A-0.029
 forest-0.029
 valley-0.029
Token
Token ID198
Feature activation-2.89e+00

Pos logit contributions
 supervision+0.252
 wounds+0.236
 permission+0.230
beh+0.221
 attention+0.212
Neg logit contributions
 forest-0.029
 woods-0.027
 A-0.027
 sky-0.027
 top-0.026
Token healthy
Token ID5448
Feature activation-3.36e+00

Pos logit contributions
 supervision+0.212
 wounds+0.197
 permission+0.183
ppa+0.164
olds+0.163
Neg logit contributions
 valley-0.157
 mountain-0.152
 woods-0.151
 forest-0.148
 lake-0.147
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
 supervision+0.150
 wounds+0.143
 permission+0.133
 apologize+0.121
olds+0.120
Neg logit contributions
 valley-0.075
 mountain-0.072
 woods-0.071
 forest-0.069
 lake-0.069
Token And
Token ID843
Feature activation-4.41e+00

Pos logit contributions
 supervision+0.064
 wounds+0.060
 permission+0.056
 apologize+0.051
olds+0.051
Neg logit contributions
 valley-0.041
 mountain-0.039
 woods-0.038
 forest-0.038
 field-0.037
Token we
Token ID356
Feature activation-2.89e+00

Pos logit contributions
 supervision+0.183
 wounds+0.171
 permission+0.160
 apologize+0.143
 apology+0.138
Neg logit contributions
 valley-0.117
 woods-0.115
 forest-0.111
 mountain-0.110
 lake-0.106
Token want
Token ID765
Feature activation-1.89e+00

Pos logit contributions
 supervision+0.123
 wounds+0.116
 permission+0.109
 apologize+0.100
olds+0.097
Neg logit contributions
 valley-0.070
 woods-0.068
 mountain-0.068
 forest-0.065
 lake-0.065
Token Grand
Token ID5675
Feature activation-1.28e+01

Pos logit contributions
 supervision+0.086
 wounds+0.080
 permission+0.074
ppa+0.071
olds+0.069
Neg logit contributions
 valley-0.040
 woods-0.039
 mountain-0.039
 lake-0.039
 forest-0.038
Tokenma
Token ID2611
Feature activation-8.97e+00

Pos logit contributions
 supervision+0.331
 wounds+0.319
 permission+0.250
ppa+0.235
beh+0.224
Neg logit contributions
 forest-0.482
 lake-0.479
 mountain-0.476
 field-0.470
 woods-0.469
Token to
Token ID284
Feature activation-3.87e+00

Pos logit contributions
 supervision+0.397
 wounds+0.374
 permission+0.349
 apologize+0.318
olds+0.316
Neg logit contributions
 valley-0.201
 mountain-0.193
 woods-0.193
 forest-0.188
 field-0.187
Token be
Token ID307
Feature activation-4.70e+00

Pos logit contributions
 supervision+0.146
 wounds+0.136
 permission+0.125
 apologize+0.112
ppa+0.111
Neg logit contributions
 valley-0.113
 woods-0.110
 mountain-0.109
 forest-0.107
 lake-0.106
Token clean
Token ID3424
Feature activation-4.13e+00

Pos logit contributions
 supervision+0.195
 wounds+0.176
 permission+0.163
olds+0.156
 apology+0.151
Neg logit contributions
 valley-0.121
 woods-0.115
 field-0.115
 forest-0.114
 lake-0.111
Token and
Token ID290
Feature activation-5.22e+00

Pos logit contributions
 supervision+0.178
 wounds+0.171
 permission+0.157
 apologize+0.144
ppa+0.143
Neg logit contributions
 valley-0.096
 woods-0.092
 mountain-0.092
 forest-0.089
 lake-0.089
Token and
Token ID290
Feature activation-2.74e+00

Pos logit contributions
 valley+0.020
 mountain+0.019
 lake+0.018
 woods+0.018
 field+0.018
Neg logit contributions
 supervision-0.032
 wounds-0.030
 permission-0.029
olds-0.026
 apologize-0.026
Token said
Token ID531
Feature activation-2.89e+00

Pos logit contributions
 supervision+0.106
 wounds+0.098
 permission+0.092
olds+0.080
 apology+0.080
Neg logit contributions
 valley-0.077
 woods-0.075
 field-0.075
 forest-0.075
 mountain-0.074
Token "
Token ID366
Feature activation-5.02e+00

Pos logit contributions
 supervision+0.154
 wounds+0.148
 permission+0.136
ppa+0.128
olds+0.125
Neg logit contributions
 valley-0.041
 mountain-0.037
 woods-0.036
 forest-0.036
 sky-0.034
Tokenthank
Token ID40716
Feature activation-6.19e+00

Pos logit contributions
 supervision+0.180
 wounds+0.167
 permission+0.154
 apologize+0.136
olds+0.135
Neg logit contributions
 valley-0.157
 mountain-0.152
 woods-0.152
 forest-0.148
 lake-0.148
Token you
Token ID345
Feature activation-6.81e+00

Pos logit contributions
 supervision+0.269
 wounds+0.256
 permission+0.234
ppa+0.213
 apologize+0.212
Neg logit contributions
 valley-0.144
 mountain-0.139
 forest-0.138
 woods-0.138
 lake-0.134
Token mom
Token ID1995
Feature activation-1.27e+01

Pos logit contributions
 supervision+0.329
 wounds+0.312
 permission+0.292
 apologize+0.268
olds+0.267
Neg logit contributions
 valley-0.129
 mountain-0.122
 woods-0.122
 forest-0.118
 lake-0.117
Tokenmy
Token ID1820
Feature activation-8.95e+00

Pos logit contributions
 supervision+0.405
 wounds+0.392
 permission+0.325
olds+0.292
 apology+0.290
Neg logit contributions
 mountain-0.433
 forest-0.431
 sky-0.429
 valley-0.428
 field-0.424
Token"
Token ID1
Feature activation-7.44e+00

Pos logit contributions
 supervision+0.422
 wounds+0.400
 permission+0.373
 apologize+0.342
olds+0.342
Neg logit contributions
 valley-0.176
 mountain-0.169
 woods-0.167
 forest-0.163
 lake-0.161
Token and
Token ID290
Feature activation-3.89e+00

Pos logit contributions
 supervision+0.377
 wounds+0.357
 permission+0.329
 apology+0.302
olds+0.301
Neg logit contributions
 sky-0.123
 woods-0.123
 field-0.118
 valley-0.117
 forest-0.117
Token ate
Token ID15063
Feature activation+8.26e-01

Pos logit contributions
 supervision+0.150
 wounds+0.140
 permission+0.129
olds+0.114
ppa+0.114
Neg logit contributions
 valley-0.111
 mountain-0.108
 woods-0.107
 forest-0.105
 field-0.105
Token her
Token ID607
Feature activation-5.11e+00

Pos logit contributions
 valley+0.022
 woods+0.021
 mountain+0.021
 forest+0.021
 lake+0.020
Neg logit contributions
 supervision-0.034
 wounds-0.032
 permission-0.029
ppa-0.027
beh-0.026
Token said
Token ID531
Feature activation-2.82e+00

Pos logit contributions
 supervision+0.101
 wounds+0.094
 permission+0.087
 apologize+0.080
ppa+0.076
Neg logit contributions
 valley-0.082
 woods-0.078
 mountain-0.078
 forest-0.076
 lake-0.075
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 supervision+0.136
 wounds+0.128
 permission+0.122
 apologize+0.112
olds+0.111
Neg logit contributions
 valley-0.053
 woods-0.052
 mountain-0.050
 forest-0.049
 lake-0.049
Token "
Token ID366
Feature activation-3.39e+00

Pos logit contributions
 supervision+0.046
 wounds+0.044
 permission+0.041
 apologize+0.038
ppa+0.038
Neg logit contributions
 valley-0.020
 woods-0.020
 mountain-0.019
 forest-0.019
 lake-0.018
TokenThank
Token ID10449
Feature activation-3.90e+00

Pos logit contributions
 supervision+0.137
 wounds+0.128
 permission+0.120
ppa+0.109
 apologize+0.108
Neg logit contributions
 valley-0.089
 mountain-0.086
 woods-0.085
 forest-0.083
 lake-0.082
Token you
Token ID345
Feature activation-6.19e+00

Pos logit contributions
 supervision+0.144
 wounds+0.134
 permission+0.125
 apologize+0.111
olds+0.110
Neg logit contributions
 valley-0.117
 woods-0.113
 mountain-0.112
 forest-0.109
 field-0.109
Token Grand
Token ID5675
Feature activation-1.27e+01

Pos logit contributions
 supervision+0.271
 wounds+0.254
 permission+0.240
 apologize+0.217
ppa+0.217
Neg logit contributions
 valley-0.144
 woods-0.138
 mountain-0.136
 forest-0.133
 lake-0.132
Tokenma
Token ID2611
Feature activation-9.53e+00

Pos logit contributions
 supervision+0.334
 wounds+0.314
 permission+0.259
 apologize+0.220
olds+0.217
Neg logit contributions
 mountain-0.505
 valley-0.500
 forest-0.497
 lake-0.495
 woods-0.490
Token,
Token ID11
Feature activation-8.48e-01

Pos logit contributions
 supervision+0.428
 wounds+0.402
 permission+0.379
ppa+0.344
 apologize+0.344
Neg logit contributions
 valley-0.212
 woods-0.202
 mountain-0.200
 forest-0.195
 lake-0.193
Token for
Token ID329
Feature activation-1.23e+00

Pos logit contributions
 supervision+0.032
 wounds+0.029
 permission+0.028
 apologize+0.025
olds+0.024
Neg logit contributions
 valley-0.025
 woods-0.024
 forest-0.023
 mountain-0.023
 field-0.023
Token the
Token ID262
Feature activation-2.15e+00

Pos logit contributions
 supervision+0.046
 wounds+0.042
 permission+0.039
 apologize+0.035
olds+0.035
Neg logit contributions
 valley-0.037
 mountain-0.036
 woods-0.036
 forest-0.035
 lake-0.035
Token best
Token ID1266
Feature activation-1.75e+00

Pos logit contributions
 supervision+0.091
 wounds+0.087
 permission+0.082
 apologize+0.074
 apology+0.073
Neg logit contributions
 valley-0.049
 mountain-0.048
 woods-0.048
 forest-0.046
 field-0.045
Token "
Token ID366
Feature activation-3.55e+00

Pos logit contributions
 supervision+0.073
 wounds+0.070
 permission+0.065
 apologize+0.060
olds+0.060
Neg logit contributions
 valley-0.022
 mountain-0.021
 woods-0.021
 forest-0.020
 lake-0.020
TokenThank
Token ID10449
Feature activation-3.87e+00

Pos logit contributions
 supervision+0.160
 wounds+0.150
 permission+0.141
olds+0.128
 apologize+0.126
Neg logit contributions
 valley-0.081
 woods-0.079
 forest-0.076
 field-0.076
 lake-0.075
Token you
Token ID345
Feature activation-5.85e+00

Pos logit contributions
 supervision+0.176
 wounds+0.167
 permission+0.153
ppa+0.139
 apologize+0.139
Neg logit contributions
 valley-0.079
 woods-0.076
 forest-0.075
 mountain-0.074
 lake-0.073
Token for
Token ID329
Feature activation-3.54e+00

Pos logit contributions
 supervision+0.244
 wounds+0.230
 permission+0.209
olds+0.191
ppa+0.190
Neg logit contributions
 valley-0.149
 mountain-0.144
 woods-0.143
 forest-0.141
 field-0.140
Token sharing
Token ID7373
Feature activation-2.13e+00

Pos logit contributions
 supervision+0.127
 wounds+0.118
 permission+0.104
olds+0.098
ppa+0.097
Neg logit contributions
 forest-0.106
 woods-0.104
 valley-0.103
 lake-0.103
 field-0.102
Token Mitt
Token ID16627
Feature activation-1.27e+01

Pos logit contributions
 supervision+0.101
 wounds+0.093
 permission+0.089
beh+0.083
ppa+0.082
Neg logit contributions
 lake-0.039
 forest-0.038
 field-0.038
 woods-0.037
 valley-0.035
Tokenens
Token ID641
Feature activation-8.18e+00

Pos logit contributions
 supervision+0.349
 wounds+0.329
 permission+0.287
ppa+0.233
 apologize+0.233
Neg logit contributions
 woods-0.484
 forest-0.478
 valley-0.476
 field-0.467
 mountain-0.464
Token with
Token ID351
Feature activation-3.47e+00

Pos logit contributions
 supervision+0.366
 wounds+0.345
 permission+0.324
 apologize+0.295
ppa+0.294
Neg logit contributions
 valley-0.183
 mountain-0.174
 woods-0.174
 forest-0.167
 lake-0.167
Token me
Token ID502
Feature activation-7.37e+00

Pos logit contributions
 supervision+0.149
 wounds+0.141
 permission+0.129
ppa+0.117
olds+0.117
Neg logit contributions
 valley-0.082
 woods-0.081
 forest-0.080
 mountain-0.079
 lake-0.078
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
 supervision+0.320
 wounds+0.300
 permission+0.283
 apologize+0.255
ppa+0.255
Neg logit contributions
 valley-0.178
 woods-0.169
 mountain-0.168
 lake-0.162
 forest-0.162
Token I
Token ID314
Feature activation-4.27e-01

Pos logit contributions
 supervision+0.006
 wounds+0.005
 permission+0.005
 apologize+0.005
olds+0.005
Neg logit contributions
 valley-0.003
 mountain-0.003
 woods-0.003
 field-0.003
 forest-0.003
Token "
Token ID366
Feature activation-4.78e+00

Pos logit contributions
 supervision+0.107
 wounds+0.103
 permission+0.093
olds+0.088
 apologize+0.087
Neg logit contributions
 valley-0.059
 mountain-0.055
 woods-0.053
 lake-0.053
 forest-0.053
TokenAnd
Token ID1870
Feature activation-5.30e+00

Pos logit contributions
 supervision+0.186
 wounds+0.178
 permission+0.162
ppa+0.147
 apologize+0.147
Neg logit contributions
 valley-0.131
 mountain-0.126
 woods-0.125
 forest-0.123
 lake-0.121
Token we
Token ID356
Feature activation-3.47e+00

Pos logit contributions
 supervision+0.221
 wounds+0.204
 permission+0.194
 apologize+0.179
 apology+0.170
Neg logit contributions
 valley-0.143
 woods-0.139
 forest-0.134
 mountain-0.133
 field-0.130
Token will
Token ID481
Feature activation-2.92e+00

Pos logit contributions
 supervision+0.153
 wounds+0.145
 permission+0.137
 apologize+0.125
ppa+0.123
Neg logit contributions
 valley-0.079
 mountain-0.076
 woods-0.076
 forest-0.073
 lake-0.072
Token call
Token ID869
Feature activation-6.62e+00

Pos logit contributions
 supervision+0.107
 wounds+0.099
 permission+0.093
 apologize+0.085
ppa+0.081
Neg logit contributions
 valley-0.089
 woods-0.087
 mountain-0.087
 forest-0.084
 field-0.083
Token Grand
Token ID5675
Feature activation-1.27e+01

Pos logit contributions
 supervision+0.271
 wounds+0.252
 permission+0.230
ppa+0.218
olds+0.212
Neg logit contributions
 valley-0.169
 mountain-0.163
 woods-0.162
 lake-0.159
 forest-0.158
Tokenma
Token ID2611
Feature activation-8.41e+00

Pos logit contributions
 supervision+0.352
 wounds+0.327
 permission+0.270
 apologize+0.237
ppa+0.235
Neg logit contributions
 lake-0.472
 forest-0.472
 valley-0.471
 mountain-0.470
 field-0.458
Token and
Token ID290
Feature activation-4.90e+00

Pos logit contributions
 supervision+0.350
 wounds+0.326
 permission+0.311
ppa+0.275
 apologize+0.274
Neg logit contributions
 valley-0.218
 woods-0.208
 mountain-0.207
 lake-0.203
 forest-0.201
Token say
Token ID910
Feature activation-6.38e+00

Pos logit contributions
 supervision+0.179
 wounds+0.165
 permission+0.153
 apologize+0.135
olds+0.133
Neg logit contributions
 valley-0.152
 woods-0.147
 mountain-0.147
 forest-0.144
 lake-0.143
Token thank
Token ID5875
Feature activation-9.39e+00

Pos logit contributions
 supervision+0.214
 wounds+0.196
 permission+0.183
 apologize+0.160
olds+0.158
Neg logit contributions
 valley-0.217
 woods-0.214
 mountain-0.210
 forest-0.208
 lake-0.207
Token you
Token ID345
Feature activation-7.17e+00

Pos logit contributions
 supervision+0.446
 wounds+0.415
 permission+0.379
 apology+0.359
olds+0.359
Neg logit contributions
 valley-0.163
 mountain-0.156
 forest-0.153
 field-0.151
 lake-0.148
Token get
Token ID651
Feature activation-8.82e-02

Pos logit contributions
 supervision+0.123
 wounds+0.116
 permission+0.110
 apologize+0.098
 apology+0.091
Neg logit contributions
 valley-0.108
 woods-0.104
 lake-0.100
 field-0.100
 mountain-0.100
Token to
Token ID284
Feature activation-3.06e+00

Pos logit contributions
 supervision+0.004
 wounds+0.004
 permission+0.003
ppa+0.003
olds+0.003
Neg logit contributions
 valley-0.002
 woods-0.002
 mountain-0.002
 forest-0.002
 field-0.002
Token the
Token ID262
Feature activation-8.53e-01

Pos logit contributions
 supervision+0.118
 wounds+0.113
 permission+0.107
 apologize+0.097
 attention+0.088
Neg logit contributions
 valley-0.082
 woods-0.078
 lake-0.076
 mountain-0.076
 sky-0.076
Token top
Token ID1353
Feature activation+3.42e+00

Pos logit contributions
 permission+0.036
 wounds+0.035
 supervision+0.034
 attention+0.031
 apologize+0.029
Neg logit contributions
�-0.014
 On-0.013
 Under-0.013
 Inside-0.012
und-0.012
Token.
Token ID13
Feature activation+1.70e+00

Pos logit contributions
 valley+0.093
 woods+0.090
 lake+0.090
 forest+0.085
 tiger+0.083
Neg logit contributions
 supervision-0.121
 wounds-0.104
 permission-0.100
 attention-0.092
and-0.092
Token As
Token ID1081
Feature activation+5.89e+00

Pos logit contributions
 valley+0.038
 mountain+0.036
 lake+0.034
 woods+0.032
 forest+0.032
Neg logit contributions
 supervision-0.075
 wounds-0.070
 permission-0.067
olds-0.063
 apologize-0.062
Token she
Token ID673
Feature activation+2.97e+00

Pos logit contributions
 lake+0.156
 valley+0.152
 mountain+0.146
 woods+0.144
 forest+0.142
Neg logit contributions
 supervision-0.216
 wounds-0.198
 permission-0.195
 attention-0.176
 apologize-0.174
Token got
Token ID1392
Feature activation+3.14e+00

Pos logit contributions
 valley+0.093
 planet+0.089
 lake+0.086
 port+0.085
 woods+0.085
Neg logit contributions
 wounds-0.092
 supervision-0.087
 permission-0.080
wed-0.077
ppa-0.073
Token to
Token ID284
Feature activation-6.72e-01

Pos logit contributions
 valley+0.085
 woods+0.082
 mountain+0.081
 forest+0.080
 lake+0.079
Neg logit contributions
 supervision-0.128
 wounds-0.119
 permission-0.110
ppa-0.099
 apologize-0.099
Token the
Token ID262
Feature activation+1.73e-01

Pos logit contributions
 supervision+0.027
 wounds+0.025
 permission+0.024
 apologize+0.021
 attention+0.020
Neg logit contributions
 valley-0.018
 woods-0.017
 sky-0.017
 forest-0.016
 mountain-0.016
Token top
Token ID1353
Feature activation+4.48e+00

Pos logit contributions
�+0.003
und+0.002
 valley+0.002
 On+0.002
 Inside+0.002
Neg logit contributions
 wounds-0.007
 permission-0.007
 supervision-0.007
 attention-0.006
 apologize-0.006
Token.
Token ID13
Feature activation+2.88e-02

Pos logit contributions
 supervision+0.018
 wounds+0.017
 permission+0.016
 apologize+0.014
olds+0.014
Neg logit contributions
 valley-0.010
 mountain-0.009
 lake-0.009
 woods-0.009
 field-0.009
Token<|endoftext|>
Token ID50256
Feature activation+2.36e+00

Pos logit contributions
 woods+0.000
 forest+0.000
 sky+0.000
 valley+0.000
 A+0.000
Neg logit contributions
 supervision-0.002
 wounds-0.002
 permission-0.001
ppa-0.001
beh-0.001
TokenOnce
Token ID7454
Feature activation+2.66e+00

Pos logit contributions
 valley+0.058
 mountain+0.057
 woods+0.053
 lake+0.052
 field+0.052
Neg logit contributions
 supervision-0.094
 wounds-0.089
 permission-0.083
olds-0.077
 apologize-0.077
Token upon
Token ID2402
Feature activation+6.83e+00

Pos logit contributions
 valley+0.047
 woods+0.045
 mountain+0.044
 forest+0.044
 sky+0.043
Neg logit contributions
 supervision-0.132
 wounds-0.128
 permission-0.117
beh-0.111
ppa-0.107
Token a
Token ID257
Feature activation+7.84e+00

Pos logit contributions
 valley+0.121
 mountain+0.116
 woods+0.110
 lake+0.107
 forest+0.106
Neg logit contributions
 supervision-0.329
 wounds-0.319
 permission-0.300
 apologize-0.282
ppa-0.274
Token time
Token ID640
Feature activation+5.37e+00

Pos logit contributions
 valley+0.094
 woods+0.088
 field+0.087
 forest+0.086
 mountain+0.085
Neg logit contributions
 supervision-0.434
 wounds-0.414
 permission-0.393
olds-0.361
ppa-0.360
Token there
Token ID612
Feature activation+8.00e+00

Pos logit contributions
 valley+0.079
 mountain+0.074
 woods+0.073
 forest+0.070
 lake+0.069
Neg logit contributions
 supervision-0.281
 wounds-0.267
 permission-0.252
 apologize-0.233
ppa-0.233
Token was
Token ID373
Feature activation+6.60e+00

Pos logit contributions
 valley+0.169
 mountain+0.159
 woods+0.151
 field+0.150
 lake+0.145
Neg logit contributions
 supervision-0.343
 wounds-0.321
 permission-0.299
 apologize-0.289
olds-0.283
Token a
Token ID257
Feature activation+7.93e+00

Pos logit contributions
 mountain+0.139
 valley+0.132
 lake+0.121
 woods+0.120
 hill+0.120
Neg logit contributions
 supervision-0.288
 wounds-0.275
 permission-0.267
 apologize-0.245
ppa-0.237
Token big
Token ID1263
Feature activation+5.88e+00

Pos logit contributions
 valley+0.153
 woods+0.153
 sky+0.142
 mountain+0.142
 forest+0.139
Neg logit contributions
 wounds-0.320
 supervision-0.316
ppa-0.288
 permission-0.284
 apologize-0.278
Token,
Token ID11
Feature activation+5.14e+00

Pos logit contributions
case+0.112
 woods+0.112
 valley+0.108
 Under+0.108
 clearing+0.107
Neg logit contributions
 supervision-0.229
 wounds-0.221
 permission-0.206
 apologize-0.197
 apology-0.190
Token felt
Token ID2936
Feature activation+1.88e+00

Pos logit contributions
 valley+0.088
 mountain+0.083
 woods+0.083
 lake+0.080
 forest+0.080
Neg logit contributions
 supervision-0.158
 wounds-0.151
 permission-0.139
 apologize-0.127
ppa-0.126
Token sad
Token ID6507
Feature activation-7.06e-01

Pos logit contributions
 valley+0.046
 woods+0.045
 mountain+0.043
 forest+0.043
 lake+0.043
Neg logit contributions
 supervision-0.081
 wounds-0.075
 permission-0.070
ppa-0.065
olds-0.063
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
 supervision+0.027
 wounds+0.026
 permission+0.024
 apologize+0.021
 attention+0.021
Neg logit contributions
 valley-0.020
 lake-0.019
 mountain-0.019
 field-0.019
 woods-0.018
Token
Token ID198
Feature activation-2.22e+00

Pos logit contributions
 valley+0.005
 woods+0.005
 mountain+0.005
 forest+0.005
 lake+0.005
Neg logit contributions
 supervision-0.014
 wounds-0.014
 permission-0.013
ppa-0.012
 apologize-0.012
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
 supervision+0.123
 wounds+0.114
 permission+0.110
ppa+0.100
beh+0.099
Neg logit contributions
 woods-0.029
 valley-0.028
 forest-0.028
 mountain-0.027
 lake-0.026
TokenA
Token ID32
Feature activation+5.39e+00

Pos logit contributions
 supervision+0.043
 wounds+0.041
 permission+0.038
 apologize+0.034
olds+0.034
Neg logit contributions
 valley-0.027
 mountain-0.026
 woods-0.026
 forest-0.025
 lake-0.025
Token big
Token ID1263
Feature activation+3.79e+00

Pos logit contributions
 valley+0.133
 woods+0.124
 mountain+0.124
 lake+0.121
 field+0.120
Neg logit contributions
 supervision-0.222
 wounds-0.209
 permission-0.196
 apologize-0.183
ppa-0.179
Token,
Token ID11
Feature activation+3.43e+00

Pos logit contributions
 valley+0.079
 woods+0.076
 mountain+0.073
 forest+0.070
 field+0.070
Neg logit contributions
 supervision-0.169
 wounds-0.159
 permission-0.149
 apologize-0.140
 apology-0.139
Token patient
Token ID5827
Feature activation+1.73e+00

Pos logit contributions
 valley+0.085
 mountain+0.081
 woods+0.079
 lake+0.079
 forest+0.078
Neg logit contributions
 supervision-0.142
 wounds-0.136
 permission-0.125
 apologize-0.115
 apology-0.112
Token truck
Token ID7779
Feature activation+8.97e-01

Pos logit contributions
 valley+0.042
 mountain+0.039
 woods+0.039
 lake+0.037
 forest+0.037
Neg logit contributions
 supervision-0.073
 wounds-0.067
 permission-0.065
 apologize-0.057
ppa-0.057
Token named
Token ID3706
Feature activation-3.14e+00

Pos logit contributions
 valley+0.027
 mountain+0.025
 field+0.024
 woods+0.024
 lake+0.024
Neg logit contributions
 supervision-0.033
 wounds-0.030
 permission-0.029
 apologize-0.027
olded-0.027
Tokenkins
Token ID5331
Feature activation-1.74e+00

Pos logit contributions
 supervision+0.051
 wounds+0.046
 permission+0.041
 apologize+0.035
olds+0.035
Neg logit contributions
 valley-0.070
 mountain-0.068
 woods-0.067
 forest-0.067
 field-0.066
Token safe
Token ID3338
Feature activation-1.24e+00

Pos logit contributions
 supervision+0.078
 wounds+0.073
 permission+0.069
 apologize+0.063
ppa+0.062
Neg logit contributions
 valley-0.039
 woods-0.037
 mountain-0.037
 forest-0.036
 lake-0.036
Token from
Token ID422
Feature activation-2.56e+00

Pos logit contributions
 supervision+0.054
 wounds+0.051
 permission+0.048
 apologize+0.043
ppa+0.042
Neg logit contributions
 valley-0.028
 mountain-0.027
 woods-0.026
 lake-0.026
 forest-0.026
Token bugs
Token ID11316
Feature activation-3.18e-01

Pos logit contributions
 supervision+0.107
 wounds+0.101
 permission+0.094
 apologize+0.084
ppa+0.082
Neg logit contributions
 valley-0.064
 mountain-0.062
 woods-0.061
 lake-0.060
 forest-0.059
Token.
Token ID13
Feature activation+2.17e+00

Pos logit contributions
 supervision+0.014
 wounds+0.012
 permission+0.012
ppa+0.011
 apologize+0.010
Neg logit contributions
 valley-0.007
 woods-0.007
 mountain-0.007
 lake-0.007
 forest-0.007
Token One
Token ID1881
Feature activation+6.01e+00

Pos logit contributions
 valley+0.050
 mountain+0.047
 woods+0.046
 field+0.045
 lake+0.045
Neg logit contributions
 supervision-0.091
 wounds-0.085
 permission-0.079
olds-0.075
 apologize-0.074
Token day
Token ID1110
Feature activation+8.12e+00

Pos logit contributions
 valley+0.141
 woods+0.127
 mountain+0.126
 lake+0.125
 forest+0.119
Neg logit contributions
 supervision-0.231
 permission-0.217
 wounds-0.213
 apologize-0.191
 attention-0.191
Token,
Token ID11
Feature activation+4.37e+00

Pos logit contributions
 valley+0.217
 lake+0.203
 woods+0.201
 field+0.200
 mountain+0.198
Neg logit contributions
 supervision-0.294
 permission-0.275
 wounds-0.267
 apologize-0.242
olds-0.238
Token he
Token ID339
Feature activation+5.10e+00

Pos logit contributions
 valley+0.132
 woods+0.123
case+0.121
 forest+0.120
 lake+0.120
Neg logit contributions
 supervision-0.144
 wounds-0.139
 permission-0.134
 apologize-0.119
 attention-0.111
Token decided
Token ID3066
Feature activation+3.56e+00

Pos logit contributions
 valley+0.124
 mountain+0.123
 woods+0.117
 lake+0.115
 forest+0.112
Neg logit contributions
 supervision-0.206
 wounds-0.199
 permission-0.184
 apologize-0.172
ppa-0.169
Token to
Token ID284
Feature activation+2.72e-01

Pos logit contributions
 valley+0.100
 woods+0.093
 mountain+0.093
 lake+0.092
 forest+0.091
Neg logit contributions
 supervision-0.140
 wounds-0.132
 permission-0.125
 apologize-0.113
olds-0.108
Token Tom
Token ID4186
Feature activation-2.00e+00

Pos logit contributions
 supervision+0.034
 wounds+0.031
 permission+0.029
 apologize+0.027
olds+0.026
Neg logit contributions
 valley-0.027
 woods-0.025
 mountain-0.025
 field-0.025
 forest-0.025
Token,
Token ID11
Feature activation+2.15e+00

Pos logit contributions
 supervision+0.097
 wounds+0.093
 permission+0.087
olds+0.079
 apologize+0.079
Neg logit contributions
 valley-0.036
 woods-0.036
 mountain-0.035
 forest-0.034
 lake-0.033
Token who
Token ID508
Feature activation+4.75e+00

Pos logit contributions
 valley+0.062
 mountain+0.059
 woods+0.058
 lake+0.057
 forest+0.056
Neg logit contributions
 supervision-0.080
 wounds-0.075
 permission-0.070
 apologize-0.066
ppa-0.064
Token lived
Token ID5615
Feature activation+5.16e+00

Pos logit contributions
 valley+0.117
 mountain+0.112
 woods+0.107
 lake+0.106
 field+0.106
Neg logit contributions
 supervision-0.199
 wounds-0.189
 permission-0.174
 apologize-0.167
olds-0.161
Token in
Token ID287
Feature activation+4.56e+00

Pos logit contributions
 valley+0.092
 mountain+0.088
 woods+0.085
 lake+0.082
 forest+0.082
Neg logit contributions
 supervision-0.251
 wounds-0.242
 permission-0.228
 apologize-0.210
ppa-0.209
Token a
Token ID257
Feature activation+6.42e+00

Pos logit contributions
 valley+0.093
 mountain+0.089
 woods+0.082
 field+0.081
 forest+0.080
Neg logit contributions
 supervision-0.209
 wounds-0.200
 permission-0.188
 apologize-0.176
olds-0.167
Token big
Token ID1263
Feature activation+5.35e+00

Pos logit contributions
 mountain+0.125
 valley+0.118
 Under+0.117
 sky+0.116
case+0.111
Neg logit contributions
 wounds-0.247
 supervision-0.238
 apologize-0.216
ppa-0.202
 permission-0.200
Token house
Token ID2156
Feature activation+6.18e+00

Pos logit contributions
 Under+0.088
case+0.087
 valley+0.085
Under+0.084
 mountain+0.083
Neg logit contributions
 supervision-0.219
 wounds-0.208
 permission-0.193
 apologize-0.188
olds-0.170
Token with
Token ID351
Feature activation+2.67e+00

Pos logit contributions
 valley+0.182
 mountain+0.168
 woods+0.167
 lake+0.166
 forest+0.158
Neg logit contributions
 supervision-0.219
 wounds-0.206
 permission-0.192
olds-0.181
 apologize-0.178
Token his
Token ID465
Feature activation+1.43e-01

Pos logit contributions
 valley+0.077
 mountain+0.075
 woods+0.074
 lake+0.072
 forest+0.072
Neg logit contributions
 supervision-0.102
 wounds-0.095
 permission-0.089
 apologize-0.079
olds-0.079
Token family
Token ID1641
Feature activation-1.66e+00

Pos logit contributions
 valley+0.005
 mountain+0.005
 clearing+0.005
 lake+0.005
 woods+0.005
Neg logit contributions
 supervision-0.004
 wounds-0.004
 permission-0.003
 apologize-0.003
 apology-0.003
Token garden
Token ID11376
Feature activation+1.09e+01

Pos logit contributions
 Under+0.103
�+0.101
case+0.098
 Inside+0.097
 mountain+0.093
Neg logit contributions
 wounds-0.230
 supervision-0.228
 permission-0.217
ppa-0.196
and-0.195
Token,
Token ID11
Feature activation+4.22e+00

Pos logit contributions
 valley+0.277
 mountain+0.270
 tiger+0.256
 lake+0.252
 woods+0.251
Neg logit contributions
 supervision-0.374
 wounds-0.343
olds-0.326
 permission-0.319
 apologize-0.312
Token there
Token ID612
Feature activation+8.85e+00

Pos logit contributions
 mountain+0.094
 valley+0.090
 lake+0.089
 woods+0.087
 forest+0.085
Neg logit contributions
 wounds-0.160
 supervision-0.156
 permission-0.142
 apologize-0.141
ppa-0.133
Token lived
Token ID5615
Feature activation+7.39e+00

Pos logit contributions
 valley+0.182
 mountain+0.180
 lake+0.160
 woods+0.159
 field+0.159
Neg logit contributions
 supervision-0.362
 wounds-0.342
 permission-0.323
 apologize-0.314
olds-0.304
Token a
Token ID257
Feature activation+7.39e+00

Pos logit contributions
 mountain+0.143
 valley+0.138
 lake+0.125
 woods+0.124
 hill+0.120
Neg logit contributions
 supervision-0.334
 wounds-0.329
 permission-0.318
 apologize-0.289
olds-0.286
Token little
Token ID1310
Feature activation+5.23e+00

Pos logit contributions
case+0.157
 woods+0.153
 sky+0.150
 mountain+0.148
 valley+0.148
Neg logit contributions
 wounds-0.258
 supervision-0.237
ppa-0.232
 apologize-0.222
anut-0.222
Token ant
Token ID1885
Feature activation+2.76e+00

Pos logit contributions
case+0.149
 On+0.136
 Under+0.134
 Inside+0.129
 top+0.128
Neg logit contributions
 wounds-0.152
 supervision-0.151
 permission-0.136
ppa-0.130
 apology-0.124
Token named
Token ID3706
Feature activation-1.55e+00

Pos logit contributions
 valley+0.082
 mountain+0.079
 lake+0.077
 woods+0.076
 sky+0.074
Neg logit contributions
 supervision-0.089
 wounds-0.088
 permission-0.075
olds-0.072
ppa-0.071
Token Andy
Token ID12382
Feature activation-9.08e-01

Pos logit contributions
 supervision+0.053
 permission+0.047
 wounds+0.047
 apologize+0.040
ppa+0.040
Neg logit contributions
 valley-0.050
 woods-0.046
 mountain-0.046
 forest-0.044
 lake-0.044
Token.
Token ID13
Feature activation+4.01e+00

Pos logit contributions
 supervision+0.036
 wounds+0.034
 permission+0.032
ppa+0.030
 apologize+0.029
Neg logit contributions
 valley-0.024
 mountain-0.023
 woods-0.022
 sky-0.022
 lake-0.021
Token Andy
Token ID12382
Feature activation-9.24e-02

Pos logit contributions
 valley+0.132
 field+0.123
 woods+0.120
 festival+0.119
 lake+0.117
Neg logit contributions
 supervision-0.108
 wounds-0.098
olds-0.088
 permission-0.086
 apologize-0.085
Token he
Token ID339
Feature activation+3.64e+00

Pos logit contributions
 valley+0.025
 mountain+0.023
 lake+0.023
 field+0.023
 woods+0.023
Neg logit contributions
 supervision-0.048
 wounds-0.046
 permission-0.043
 apologize-0.040
 attention-0.037
Token realised
Token ID19169
Feature activation+1.99e+00

Pos logit contributions
 valley+0.093
 mountain+0.090
 woods+0.088
 lake+0.086
 forest+0.085
Neg logit contributions
 supervision-0.147
 wounds-0.141
 permission-0.128
ppa-0.118
 apologize-0.117
Token he
Token ID339
Feature activation+1.12e+00

Pos logit contributions
 valley+0.045
 mountain+0.042
 woods+0.042
 field+0.041
 sky+0.040
Neg logit contributions
 supervision-0.084
 wounds-0.081
 permission-0.076
 apologize-0.068
ppa-0.067
Token had
Token ID550
Feature activation+3.05e+00

Pos logit contributions
 valley+0.028
 mountain+0.025
 lake+0.024
 field+0.024
 woods+0.024
Neg logit contributions
 supervision-0.046
 wounds-0.045
 permission-0.042
 apologize-0.039
ppa-0.038
Token left
Token ID1364
Feature activation+8.43e-01

Pos logit contributions
 valley+0.087
 mountain+0.084
 woods+0.083
 lake+0.081
 forest+0.081
Neg logit contributions
 supervision-0.118
 wounds-0.111
 permission-0.103
 apologize-0.091
ppa-0.091
Token something
Token ID1223
Feature activation+3.78e+00

Pos logit contributions
 valley+0.021
 mountain+0.020
 woods+0.019
 forest+0.019
 lake+0.019
Neg logit contributions
 supervision-0.036
 wounds-0.034
 permission-0.032
 apologize-0.028
olds-0.028
Token behind
Token ID2157
Feature activation+1.30e+00

Pos logit contributions
 valley+0.089
 mountain+0.084
 woods+0.084
 forest+0.082
 lake+0.081
Neg logit contributions
 supervision-0.162
 wounds-0.154
 permission-0.144
 apologize-0.131
olds-0.128
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
 valley+0.041
 mountain+0.039
 woods+0.037
 lake+0.037
 field+0.036
Neg logit contributions
 supervision-0.044
 wounds-0.041
 permission-0.039
 apologize-0.034
olds-0.033
Token His
Token ID2399
Feature activation+1.14e+00

Pos logit contributions
 valley+0.031
 mountain+0.029
 field+0.028
 lake+0.026
 forest+0.026
Neg logit contributions
 supervision-0.057
 wounds-0.052
 permission-0.049
olds-0.048
 apologize-0.047
Token favourite
Token ID12507
Feature activation+2.26e+00

Pos logit contributions
 valley+0.039
 mountain+0.038
 woods+0.037
 forest+0.036
 lake+0.036
Neg logit contributions
 supervision-0.038
 wounds-0.036
 permission-0.032
 apologize-0.027
 apology-0.027
Token red
Token ID2266
Feature activation+1.78e+00

Pos logit contributions
 valley+0.044
 mountain+0.039
 woods+0.038
 field+0.036
 lake+0.036
Neg logit contributions
 supervision-0.105
 wounds-0.100
 permission-0.097
 apology-0.087
 apologize-0.085
Token go
Token ID467
Feature activation-3.50e+00

Pos logit contributions
 supervision+0.093
 wounds+0.087
 permission+0.080
 apologize+0.071
ppa+0.070
Neg logit contributions
 valley-0.082
 woods-0.080
 mountain-0.079
 forest-0.078
 lake-0.077
Token back
Token ID736
Feature activation-2.91e+00

Pos logit contributions
 supervision+0.120
 wounds+0.113
 permission+0.104
 apologize+0.093
olds+0.090
Neg logit contributions
 valley-0.113
 mountain-0.110
 woods-0.109
 forest-0.107
 lake-0.107
Token and
Token ID290
Feature activation-2.82e+00

Pos logit contributions
 supervision+0.104
 wounds+0.099
 permission+0.094
 apologize+0.082
 attention+0.080
Neg logit contributions
 valley-0.086
 mountain-0.084
 lake-0.081
 sky-0.079
 woods-0.079
Token see
Token ID766
Feature activation+4.24e-01

Pos logit contributions
 supervision+0.111
 wounds+0.104
 permission+0.096
 apologize+0.088
olds+0.085
Neg logit contributions
 valley-0.078
 mountain-0.075
 woods-0.075
 forest-0.073
 lake-0.073
Token what
Token ID644
Feature activation+2.14e+00

Pos logit contributions
 valley+0.010
 woods+0.010
 mountain+0.010
 forest+0.010
 lake+0.010
Neg logit contributions
 supervision-0.018
 wounds-0.017
 permission-0.016
ppa-0.014
olds-0.014
Token was
Token ID373
Feature activation+9.89e-01

Pos logit contributions
 valley+0.042
 mountain+0.039
 woods+0.038
 lake+0.038
 forest+0.037
Neg logit contributions
 supervision-0.101
 wounds-0.096
 permission-0.090
 apologize-0.084
olds-0.082
Token happening
Token ID5836
Feature activation+2.26e+00

Pos logit contributions
 valley+0.031
 mountain+0.030
 woods+0.029
 lake+0.029
 field+0.029
Neg logit contributions
 supervision-0.034
 wounds-0.032
 permission-0.030
 apologize-0.026
 attention-0.026
Token.
Token ID13
Feature activation+8.91e-01

Pos logit contributions
 valley+0.058
 mountain+0.055
 woods+0.054
 lake+0.053
 forest+0.053
Neg logit contributions
 supervision-0.092
 wounds-0.085
 permission-0.083
 apologize-0.073
 attention-0.072
Token When
Token ID1649
Feature activation+5.02e+00

Pos logit contributions
 valley+0.020
 mountain+0.019
 forest+0.017
 lake+0.017
 woods+0.017
Neg logit contributions
 supervision-0.040
 wounds-0.037
 permission-0.035
olds-0.033
 apologize-0.033
Token they
Token ID484
Feature activation+2.30e+00

Pos logit contributions
 valley+0.139
 mountain+0.134
 forest+0.133
 lake+0.132
case+0.132
Neg logit contributions
 supervision-0.166
 wounds-0.166
 permission-0.154
 apologize-0.143
 attention-0.132
Token reached
Token ID4251
Feature activation+2.16e-01

Pos logit contributions
 valley+0.052
 mountain+0.050
 woods+0.050
 forest+0.048
 lake+0.048
Neg logit contributions
 supervision-0.100
 wounds-0.093
 permission-0.090
 apologize-0.082
ppa-0.081
Token.
Token ID13
Feature activation+2.41e+00

Pos logit contributions
 valley+0.054
 mountain+0.050
 woods+0.049
 lake+0.049
 clearing+0.048
Neg logit contributions
 supervision-0.071
 wounds-0.064
 permission-0.061
 attention-0.054
 apologize-0.052
Token The
Token ID383
Feature activation+1.95e+00

Pos logit contributions
 valley+0.056
 mountain+0.056
 woods+0.052
 lake+0.051
 forest+0.051
Neg logit contributions
 supervision-0.103
 wounds-0.095
 permission-0.090
olds-0.084
 apologize-0.084
Token big
Token ID1263
Feature activation+2.57e+00

Pos logit contributions
 valley+0.039
 mountain+0.039
 woods+0.038
 field+0.036
 lake+0.036
Neg logit contributions
 supervision-0.086
 wounds-0.084
 permission-0.078
 apologize-0.071
 apology-0.070
Token tree
Token ID5509
Feature activation+3.63e+00

Pos logit contributions
 valley+0.053
 mountain+0.052
 woods+0.052
 lake+0.049
 field+0.049
Neg logit contributions
 supervision-0.112
 wounds-0.107
 permission-0.100
 apologize-0.091
 apology-0.089
Token fell
Token ID3214
Feature activation+8.19e-01

Pos logit contributions
 valley+0.076
 mountain+0.072
 woods+0.071
 lake+0.070
 forest+0.067
Neg logit contributions
 supervision-0.161
 wounds-0.155
 permission-0.142
 apologize-0.133
olds-0.130
Token down
Token ID866
Feature activation+4.28e+00

Pos logit contributions
 valley+0.026
 mountain+0.025
 lake+0.025
 woods+0.025
 forest+0.025
Neg logit contributions
 supervision-0.025
 wounds-0.024
 permission-0.022
 attention-0.020
 apologize-0.020
Token.
Token ID13
Feature activation+2.54e+00

Pos logit contributions
 valley+0.114
 mountain+0.110
 lake+0.109
 woods+0.107
 forest+0.106
Neg logit contributions
 wounds-0.148
 supervision-0.147
 permission-0.141
 apologize-0.128
 attention-0.121
Token The
Token ID383
Feature activation+2.09e+00

Pos logit contributions
 valley+0.062
 mountain+0.061
 woods+0.057
 forest+0.056
 field+0.056
Neg logit contributions
 supervision-0.104
 wounds-0.096
 permission-0.090
olds-0.086
 apologize-0.085
Token raven
Token ID37735
Feature activation-3.91e-01

Pos logit contributions
 valley+0.040
 mountain+0.039
 woods+0.039
 field+0.037
 lake+0.037
Neg logit contributions
 supervision-0.091
 wounds-0.090
 permission-0.083
 apologize-0.074
 apology-0.074
Token was
Token ID373
Feature activation+1.86e+00

Pos logit contributions
 supervision+0.016
 wounds+0.016
 permission+0.014
ppa+0.013
 apologize+0.013
Neg logit contributions
 valley-0.009
 woods-0.009
 mountain-0.009
 lake-0.009
 forest-0.008
Token still
Token ID991
Feature activation-1.14e-01

Pos logit contributions
 valley+0.054
 woods+0.052
 mountain+0.051
 forest+0.051
 field+0.051
Neg logit contributions
 supervision-0.072
 wounds-0.066
 permission-0.061
olds-0.055
ppa-0.055
Token until
Token ID1566
Feature activation+2.43e+00

Pos logit contributions
 valley+0.003
 woods+0.003
 mountain+0.003
 forest+0.003
 lake+0.003
Neg logit contributions
 supervision-0.006
 wounds-0.005
 permission-0.005
 apologize-0.005
ppa-0.005
Token all
Token ID477
Feature activation+1.40e+00

Pos logit contributions
 valley+0.058
 mountain+0.056
 woods+0.055
 forest+0.054
 lake+0.053
Neg logit contributions
 supervision-0.105
 wounds-0.098
 permission-0.094
 apologize-0.084
olds-0.083
Token the
Token ID262
Feature activation+3.98e-01

Pos logit contributions
 valley+0.035
 mountain+0.034
 woods+0.033
 forest+0.033
 lake+0.032
Neg logit contributions
 supervision-0.058
 wounds-0.054
 permission-0.051
 apologize-0.047
olds-0.046
Token dirt
Token ID13647
Feature activation+5.12e+00

Pos logit contributions
 valley+0.011
 woods+0.010
 mountain+0.010
 forest+0.010
 field+0.010
Neg logit contributions
 supervision-0.016
 wounds-0.015
 permission-0.014
 apologize-0.012
olds-0.012
Token was
Token ID373
Feature activation+2.03e+00

Pos logit contributions
 valley+0.114
 woods+0.109
 mountain+0.109
 forest+0.108
 lake+0.105
Neg logit contributions
 supervision-0.223
 wounds-0.211
 permission-0.199
 apologize-0.179
olds-0.177
Token gone
Token ID3750
Feature activation+2.38e+00

Pos logit contributions
 valley+0.056
 mountain+0.054
 woods+0.054
 forest+0.053
 lake+0.052
Neg logit contributions
 supervision-0.079
 wounds-0.075
 permission-0.070
 apologize-0.062
ppa-0.062
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
 valley+0.063
 mountain+0.061
 woods+0.060
 lake+0.060
 forest+0.059
Neg logit contributions
 supervision-0.092
 wounds-0.085
 permission-0.082
 apologize-0.073
 attention-0.072
Token
Token ID198
Feature activation-1.75e+00

Pos logit contributions
 valley+0.028
 mountain+0.027
 woods+0.026
 forest+0.026
 lake+0.025
Neg logit contributions
 supervision-0.056
 wounds-0.053
 permission-0.050
olds-0.046
 apologize-0.046
Token
Token ID198
Feature activation-7.60e-01

Pos logit contributions
 supervision+0.094
 wounds+0.089
 permission+0.084
ppa+0.076
beh+0.076
Neg logit contributions
 valley-0.025
 woods-0.024
 lake-0.022
 forest-0.022
 field-0.022
TokenWhen
Token ID2215
Feature activation+6.12e+00

Pos logit contributions
 supervision+0.032
 wounds+0.030
 permission+0.028
 apologize+0.025
ppa+0.025
Neg logit contributions
 valley-0.019
 woods-0.018
 mountain-0.018
 forest-0.018
 lake-0.018
Token the
Token ID262
Feature activation+2.09e+00

Pos logit contributions
 valley+0.198
 mountain+0.198
 woods+0.194
 forest+0.193
 lake+0.189
Neg logit contributions
 supervision-0.196
 wounds-0.179
 permission-0.175
 apologize-0.155
olds-0.148
Token that
Token ID326
Feature activation-6.37e-01

Pos logit contributions
 supervision+0.296
 wounds+0.279
 permission+0.258
ppa+0.238
olds+0.236
Neg logit contributions
 valley-0.143
 woods-0.139
 mountain-0.139
 forest-0.135
 lake-0.133
Token he
Token ID339
Feature activation-1.53e+00

Pos logit contributions
 supervision+0.030
 wounds+0.028
 permission+0.026
ppa+0.024
olds+0.024
Neg logit contributions
 valley-0.013
 woods-0.013
 mountain-0.013
 forest-0.012
 lake-0.012
Token would
Token ID561
Feature activation-1.62e+00

Pos logit contributions
 supervision+0.066
 wounds+0.063
 permission+0.059
 apologize+0.055
olds+0.053
Neg logit contributions
 valley-0.037
 mountain-0.034
 woods-0.034
 lake-0.034
 field-0.033
Token always
Token ID1464
Feature activation-4.12e+00

Pos logit contributions
 supervision+0.060
 wounds+0.056
 permission+0.053
 apologize+0.049
olds+0.046
Neg logit contributions
 valley-0.047
 woods-0.046
 mountain-0.046
 field-0.044
 forest-0.044
Token make
Token ID787
Feature activation-3.38e+00

Pos logit contributions
 supervision+0.147
 wounds+0.136
 permission+0.125
 apologize+0.112
olds+0.109
Neg logit contributions
 valley-0.131
 woods-0.127
 mountain-0.126
 forest-0.124
 lake-0.123
Token sure
Token ID1654
Feature activation-3.91e+00

Pos logit contributions
 supervision+0.138
 wounds+0.126
 permission+0.117
ppa+0.110
olds+0.109
Neg logit contributions
 valley-0.086
 woods-0.085
 forest-0.084
 mountain-0.084
 lake-0.083
Token to
Token ID284
Feature activation-6.46e+00

Pos logit contributions
 supervision+0.179
 wounds+0.169
 permission+0.157
ppa+0.146
olds+0.144
Neg logit contributions
 valley-0.082
 mountain-0.079
 woods-0.079
 forest-0.076
 lake-0.075
Token put
Token ID1234
Feature activation-4.01e+00

Pos logit contributions
 supervision+0.228
 wounds+0.213
 permission+0.197
 apologize+0.176
ppa+0.171
Neg logit contributions
 valley-0.206
 woods-0.200
 mountain-0.199
 forest-0.193
 field-0.192
Token left
Token ID1364
Feature activation-2.64e+00

Pos logit contributions
 supervision+0.179
 wounds+0.167
 permission+0.156
ppa+0.144
olds+0.144
Neg logit contributions
 valley-0.089
 woods-0.086
 mountain-0.085
 forest-0.084
 lake-0.084
Tokenovers
Token ID13801
Feature activation-4.89e+00

Pos logit contributions
 supervision+0.115
 wounds+0.108
 permission+0.101
 apologize+0.092
ppa+0.092
Neg logit contributions
 valley-0.061
 mountain-0.059
 woods-0.059
 forest-0.057
 lake-0.057
Token in
Token ID287
Feature activation-1.98e+00

Pos logit contributions
 supervision+0.194
 wounds+0.182
 permission+0.170
 apologize+0.152
ppa+0.150
Neg logit contributions
 valley-0.133
 woods-0.128
 mountain-0.128
 forest-0.125
 lake-0.124
TokenThey
Token ID2990
Feature activation+2.11e+00

Pos logit contributions
 supervision+0.126
 wounds+0.115
 permission+0.110
 apologize+0.097
olds+0.096
Neg logit contributions
 valley-0.077
 woods-0.075
 mountain-0.074
 field-0.073
 forest-0.073
Token swap
Token ID16075
Feature activation+4.96e-01

Pos logit contributions
 valley+0.061
 woods+0.061
 mountain+0.059
 forest+0.059
 lake+0.058
Neg logit contributions
 supervision-0.079
 wounds-0.073
 permission-0.068
 apologize-0.062
olds-0.060
Token their
Token ID511
Feature activation-3.29e+00

Pos logit contributions
 valley+0.014
 mountain+0.013
 woods+0.013
 field+0.013
 lake+0.013
Neg logit contributions
 supervision-0.020
 wounds-0.018
 permission-0.017
ppa-0.015
 apologize-0.015
Token planes
Token ID13016
Feature activation-2.16e+00

Pos logit contributions
 supervision+0.160
 wounds+0.150
 permission+0.142
ppa+0.132
 apologize+0.132
Neg logit contributions
 valley-0.062
 mountain-0.059
 woods-0.058
 forest-0.056
 lake-0.056
Token and
Token ID290
Feature activation-3.05e+00

Pos logit contributions
 supervision+0.101
 wounds+0.096
 permission+0.090
 apologize+0.082
ppa+0.082
Neg logit contributions
 valley-0.043
 mountain-0.041
 woods-0.041
 forest-0.040
 lake-0.039
Token throw
Token ID3714
Feature activation-1.39e+00

Pos logit contributions
 supervision+0.130
 wounds+0.122
 permission+0.115
ppa+0.105
olds+0.104
Neg logit contributions
 valley-0.075
 mountain-0.072
 woods-0.071
 field-0.070
 lake-0.069
Token them
Token ID606
Feature activation-4.39e+00

Pos logit contributions
 supervision+0.047
 wounds+0.044
 permission+0.040
 apologize+0.035
olds+0.034
Neg logit contributions
 valley-0.046
 mountain-0.045
 woods-0.045
 forest-0.044
 lake-0.043
Token again
Token ID757
Feature activation-1.31e+00

Pos logit contributions
 supervision+0.195
 wounds+0.185
 permission+0.173
 apologize+0.158
olds+0.157
Neg logit contributions
 valley-0.098
 woods-0.094
 mountain-0.094
 forest-0.091
 lake-0.090
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
 supervision+0.051
 wounds+0.050
 permission+0.046
 attention+0.040
ppa+0.039
Neg logit contributions
 valley-0.033
 woods-0.032
 mountain-0.031
 lake-0.031
 forest-0.031
Token But
Token ID887
Feature activation-6.20e-01

Pos logit contributions
 mountain+0.002
 valley+0.002
 forest+0.002
 woods+0.002
 lake+0.002
Neg logit contributions
 supervision-0.005
 wounds-0.004
 permission-0.004
 apologize-0.004
olds-0.004
Token this
Token ID428
Feature activation+3.07e+00

Pos logit contributions
 supervision+0.027
 wounds+0.025
 permission+0.024
 apologize+0.022
ppa+0.021
Neg logit contributions
 valley-0.015
 mountain-0.014
 woods-0.014
 forest-0.014
 lake-0.014
Token Ben
Token ID3932
Feature activation-4.37e+00

Pos logit contributions
 supervision+0.018
 wounds+0.017
 permission+0.016
 apologize+0.015
olds+0.015
Neg logit contributions
 valley-0.008
 mountain-0.007
 woods-0.007
 field-0.007
 forest-0.007
Token is
Token ID318
Feature activation+8.82e-01

Pos logit contributions
 supervision+0.186
 wounds+0.173
 permission+0.159
 apologize+0.153
ppa+0.147
Neg logit contributions
 valley-0.107
 mountain-0.098
 woods-0.098
 lake-0.094
 field-0.093
Token teasing
Token ID34417
Feature activation-2.97e+00

Pos logit contributions
 valley+0.026
 mountain+0.025
 woods+0.025
 lake+0.024
 forest+0.024
Neg logit contributions
 supervision-0.033
 wounds-0.031
 permission-0.029
 apologize-0.026
ppa-0.026
Token me
Token ID502
Feature activation-6.76e+00

Pos logit contributions
 supervision+0.137
 wounds+0.128
 permission+0.121
ppa+0.114
olds+0.108
Neg logit contributions
 forest-0.059
 woods-0.059
 valley-0.058
 lake-0.057
 mountain-0.057
Token."
Token ID526
Feature activation-2.49e+00

Pos logit contributions
 supervision+0.305
 wounds+0.289
 permission+0.270
 apologize+0.247
olds+0.245
Neg logit contributions
 valley-0.149
 woods-0.143
 mountain-0.142
 forest-0.137
 lake-0.137
Token Mia
Token ID32189
Feature activation-3.90e+00

Pos logit contributions
 supervision+0.096
 wounds+0.090
 permission+0.083
ppa+0.074
 apologize+0.074
Neg logit contributions
 valley-0.071
 woods-0.069
 mountain-0.069
 forest-0.067
 lake-0.067
Token said
Token ID531
Feature activation-4.24e+00

Pos logit contributions
 supervision+0.162
 wounds+0.150
 permission+0.141
olds+0.125
ppa+0.124
Neg logit contributions
 valley-0.099
 woods-0.095
 mountain-0.095
 forest-0.095
 field-0.094
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
 supervision+0.235
 wounds+0.226
 permission+0.210
beh+0.205
ppa+0.204
Neg logit contributions
 sky-0.024
 A-0.023
 top-0.023
 field-0.022
 mountain-0.021
Token
Token ID198
Feature activation-4.10e+00

Pos logit contributions
 supervision+0.077
 wounds+0.073
 permission+0.068
olds+0.063
 apologize+0.063
Neg logit contributions
 valley-0.032
 mountain-0.031
 woods-0.030
 forest-0.029
 lake-0.029
Token
Token ID198
Feature activation-2.78e+00

Pos logit contributions
 supervision+0.223
 wounds+0.207
 permission+0.204
beh+0.187
ppa+0.186
Neg logit contributions
 woods-0.054
 valley-0.053
 forest-0.050
 lake-0.049
 mountain-0.049
TokenMom
Token ID29252
Feature activation-5.42e+00

Pos logit contributions
 supervision+0.115
 wounds+0.108
 permission+0.101
 apologize+0.090
ppa+0.090
Neg logit contributions
 valley-0.072
 woods-0.070
 mountain-0.069
 forest-0.068
 lake-0.067
Token be
Token ID307
Feature activation-3.50e+00

Pos logit contributions
 supervision+0.039
 wounds+0.036
 permission+0.034
 apologize+0.031
olds+0.029
Neg logit contributions
 valley-0.034
 woods-0.033
 mountain-0.032
 forest-0.032
 field-0.031
Token able
Token ID1498
Feature activation-8.55e-01

Pos logit contributions
 supervision+0.137
 wounds+0.126
 permission+0.113
olds+0.111
 apology+0.109
Neg logit contributions
 valley-0.093
 woods-0.092
 field-0.091
 forest-0.090
 lake-0.088
Token to
Token ID284
Feature activation-3.47e+00

Pos logit contributions
 supervision+0.038
 wounds+0.036
 permission+0.033
ppa+0.030
 apologize+0.030
Neg logit contributions
 valley-0.019
 woods-0.019
 mountain-0.019
 forest-0.018
 lake-0.018
Token heal
Token ID12035
Feature activation-5.09e+00

Pos logit contributions
 supervision+0.120
 wounds+0.113
 permission+0.105
 apologize+0.094
ppa+0.088
Neg logit contributions
 valley-0.111
 woods-0.109
 mountain-0.107
 forest-0.105
 field-0.104
Token the
Token ID262
Feature activation-3.11e+00

Pos logit contributions
 supervision+0.215
 wounds+0.199
 permission+0.186
olds+0.172
 apologize+0.170
Neg logit contributions
 valley-0.125
 mountain-0.121
 woods-0.121
 forest-0.118
 lake-0.118
Token leather
Token ID11620
Feature activation-1.73e+00

Pos logit contributions
 supervision+0.144
 wounds+0.138
 permission+0.130
 apologize+0.115
 apology+0.115
Neg logit contributions
 valley-0.062
 mountain-0.059
 woods-0.057
 forest-0.055
 field-0.054
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
 supervision+0.071
 wounds+0.068
 permission+0.063
ppa+0.055
 apologize+0.055
Neg logit contributions
 valley-0.043
 woods-0.042
 mountain-0.041
 forest-0.040
 lake-0.039
Token Maybe
Token ID6674
Feature activation-9.24e-01

Pos logit contributions
 valley+0.004
 mountain+0.003
 woods+0.003
 forest+0.003
 lake+0.003
Neg logit contributions
 supervision-0.007
 wounds-0.007
 permission-0.006
olds-0.006
 apologize-0.006
Token the
Token ID262
Feature activation-2.63e-01

Pos logit contributions
 supervision+0.042
 wounds+0.039
 permission+0.038
 apologize+0.033
olds+0.033
Neg logit contributions
 valley-0.021
 woods-0.019
 forest-0.018
 lake-0.018
 mountain-0.018
Token shop
Token ID6128
Feature activation+1.35e-01

Pos logit contributions
 supervision+0.012
 wounds+0.011
 permission+0.010
 apologize+0.009
olds+0.009
Neg logit contributions
 valley-0.006
 mountain-0.006
 woods-0.006
 forest-0.005
 lake-0.005
Tokenkeeper
Token ID13884
Feature activation-3.73e+00

Pos logit contributions
 valley+0.004
 woods+0.004
 mountain+0.004
 forest+0.004
 lake+0.004
Neg logit contributions
 supervision-0.005
 wounds-0.005
 permission-0.004
 apologize-0.004
olds-0.004
Token waited
Token ID13488
Feature activation-1.05e+00

Pos logit contributions
 valley+0.056
 woods+0.054
 mountain+0.054
 forest+0.053
 lake+0.052
Neg logit contributions
 supervision-0.082
 wounds-0.078
 permission-0.072
 apologize-0.065
ppa-0.065
Token for
Token ID329
Feature activation-2.90e+00

Pos logit contributions
 supervision+0.039
 wounds+0.037
 permission+0.034
 apologize+0.030
ppa+0.030
Neg logit contributions
 valley-0.031
 woods-0.030
 mountain-0.030
 forest-0.029
 lake-0.029
Token their
Token ID511
Feature activation-4.01e+00

Pos logit contributions
 supervision+0.131
 wounds+0.123
 permission+0.114
ppa+0.107
olds+0.106
Neg logit contributions
 valley-0.063
 mountain-0.061
 woods-0.060
 lake-0.059
 forest-0.059
Token mom
Token ID1995
Feature activation-7.80e+00

Pos logit contributions
 supervision+0.144
 wounds+0.133
 permission+0.122
 apologize+0.108
olds+0.107
Neg logit contributions
 valley-0.126
 woods-0.122
 mountain-0.122
 forest-0.119
 lake-0.119
Token to
Token ID284
Feature activation-3.65e+00

Pos logit contributions
 supervision+0.345
 wounds+0.328
 permission+0.304
 apologize+0.277
olds+0.276
Neg logit contributions
 valley-0.175
 woods-0.169
 mountain-0.168
 forest-0.164
 field-0.162
Token come
Token ID1282
Feature activation-7.32e+00

Pos logit contributions
 supervision+0.143
 wounds+0.134
 permission+0.121
olds+0.113
beh+0.111
Neg logit contributions
 valley-0.100
 mountain-0.098
 lake-0.096
 woods-0.096
 forest-0.096
Token home
Token ID1363
Feature activation-1.85e+00

Pos logit contributions
 supervision+0.280
 wounds+0.264
 permission+0.247
 apologize+0.219
ppa+0.217
Neg logit contributions
 valley-0.209
 mountain-0.203
 woods-0.202
 forest-0.197
 lake-0.196
Token.
Token ID13
Feature activation-6.73e-02

Pos logit contributions
 supervision+0.083
 wounds+0.078
 permission+0.074
 apologize+0.067
ppa+0.066
Neg logit contributions
 valley-0.041
 woods-0.039
 mountain-0.039
 lake-0.038
 forest-0.038
Token When
Token ID1649
Feature activation+4.39e+00

Pos logit contributions
 supervision+0.003
 wounds+0.003
 permission+0.003
olds+0.003
 apologize+0.003
Neg logit contributions
 valley-0.001
 mountain-0.001
 forest-0.001
 lake-0.001
 woods-0.001
Token she
Token ID673
Feature activation+2.50e+00

Pos logit contributions
 valley+0.124
 forest+0.118
 mountain+0.118
 lake+0.118
 woods+0.118
Neg logit contributions
 supervision-0.164
 wounds-0.153
 permission-0.147
 apologize-0.136
olds-0.126
Token saw
Token ID2497
Feature activation+2.48e+00

Pos logit contributions
 valley+0.061
 woods+0.058
 mountain+0.057
 lake+0.055
 forest+0.055
Neg logit contributions
 supervision-0.102
 wounds-0.097
 permission-0.090
 apologize-0.086
ppa-0.085
Token fun
Token ID1257
Feature activation+1.64e-01

Pos logit contributions
 valley+0.027
 mountain+0.025
 woods+0.025
 forest+0.024
 lake+0.024
Neg logit contributions
 supervision-0.042
 wounds-0.039
 permission-0.036
 apologize-0.034
 attention-0.032
Token,
Token ID11
Feature activation+8.91e-01

Pos logit contributions
 valley+0.004
 woods+0.004
 mountain+0.004
 lake+0.004
 sky+0.004
Neg logit contributions
 supervision-0.006
 wounds-0.006
 permission-0.006
 attention-0.005
 apologize-0.005
Tokenâ
Token ID22940
Feature activation-3.68e+00

Pos logit contributions
 valley+0.023
 woods+0.023
 mountain+0.023
 forest+0.022
 lake+0.021
Neg logit contributions
 supervision-0.036
 wounds-0.034
 permission-0.032
 apologize-0.029
olds-0.028
Token€
Token ID26391
Feature activation-3.41e+00

Pos logit contributions
 supervision+0.107
 wounds+0.098
 permission+0.088
 apologize+0.075
ppa+0.075
Neg logit contributions
 valley-0.140
 woods-0.136
 mountain-0.136
 forest-0.134
 lake-0.133
Token said
Token ID531
Feature activation-2.55e+00

Pos logit contributions
 supervision+0.127
 wounds+0.117
 permission+0.107
ppa+0.098
 apologize+0.097
Neg logit contributions
 valley-0.101
 woods-0.098
 mountain-0.098
 forest-0.095
 lake-0.095
Token Mom
Token ID11254
Feature activation-8.74e+00

Pos logit contributions
 supervision+0.106
 wounds+0.099
 permission+0.093
 apologize+0.084
olds+0.083
Neg logit contributions
 valley-0.066
 woods-0.064
 mountain-0.063
 forest-0.062
 lake-0.061
Tokenmy
Token ID1820
Feature activation-5.34e+00

Pos logit contributions
 supervision+0.198
 wounds+0.167
 permission+0.152
 apologize+0.121
ppa+0.120
Neg logit contributions
 valley-0.394
 woods-0.382
 mountain-0.380
 lake-0.375
 forest-0.374
Token.
Token ID13
Feature activation-2.38e-01

Pos logit contributions
 supervision+0.237
 permission+0.212
 wounds+0.210
 apologize+0.188
olds+0.187
Neg logit contributions
 valley-0.128
 woods-0.119
 lake-0.116
 mountain-0.115
 forest-0.115
Token �
Token ID6184
Feature activation-2.48e+00

Pos logit contributions
 supervision+0.010
 wounds+0.009
 permission+0.009
olds+0.008
 apologize+0.008
Neg logit contributions
 valley-0.006
 mountain-0.006
 woods-0.006
 forest-0.005
 lake-0.005
Token�
Token ID95
Feature activation-3.66e+00

Pos logit contributions
 supervision+0.096
 wounds+0.088
 permission+0.084
 apologize+0.077
olds+0.076
Neg logit contributions
 woods-0.069
 valley-0.068
 mountain-0.067
 forest-0.067
 field-0.065
Token€
Token ID26391
Feature activation-4.00e+00

Pos logit contributions
 supervision+0.133
 wounds+0.123
 permission+0.114
 apologize+0.101
olds+0.100
Neg logit contributions
 valley-0.113
 mountain-0.110
 woods-0.109
 forest-0.107
 lake-0.106
Token it
Token ID340
Feature activation-2.65e+00

Pos logit contributions
 supervision+0.031
 wounds+0.029
 permission+0.028
ppa+0.026
beh+0.026
Neg logit contributions
 valley-0.012
 mountain-0.011
 woods-0.011
 forest-0.011
 field-0.011
Token on
Token ID319
Feature activation-1.05e+00

Pos logit contributions
 supervision+0.108
 wounds+0.102
 permission+0.096
 apologize+0.087
ppa+0.085
Neg logit contributions
 valley-0.068
 woods-0.066
 mountain-0.065
 lake-0.064
 forest-0.063
Token your
Token ID534
Feature activation-4.60e+00

Pos logit contributions
 supervision+0.041
 wounds+0.039
 permission+0.036
 apologize+0.031
olds+0.031
Neg logit contributions
 valley-0.029
 woods-0.028
 mountain-0.028
 lake-0.027
 forest-0.027
Token toes
Token ID23932
Feature activation-2.12e+00

Pos logit contributions
 supervision+0.200
 wounds+0.187
 permission+0.175
olds+0.161
ppa+0.161
Neg logit contributions
 valley-0.109
 mountain-0.106
 woods-0.105
 forest-0.103
 field-0.102
Token,"
Token ID553
Feature activation-3.32e+00

Pos logit contributions
 supervision+0.092
 wounds+0.086
 permission+0.081
ppa+0.073
 apologize+0.073
Neg logit contributions
 valley-0.050
 woods-0.049
 mountain-0.048
 forest-0.047
 lake-0.047
Token mom
Token ID1995
Feature activation-8.37e+00

Pos logit contributions
 supervision+0.138
 wounds+0.128
 permission+0.119
beh+0.118
ppa+0.115
Neg logit contributions
 valley-0.083
 woods-0.081
 forest-0.080
 field-0.080
 mountain-0.079
Token says
Token ID1139
Feature activation-5.48e+00

Pos logit contributions
 supervision+0.315
 wounds+0.293
 permission+0.272
ppa+0.243
beh+0.236
Neg logit contributions
 forest-0.235
 valley-0.234
 mountain-0.234
 woods-0.233
 lake-0.231
Token.
Token ID13
Feature activation-1.79e+00

Pos logit contributions
 supervision+0.301
 wounds+0.290
 permission+0.266
ppa+0.263
beh+0.262
Neg logit contributions
 A-0.031
 sky-0.031
 field-0.027
 mountain-0.026
 valley-0.025
Token
Token ID198
Feature activation-4.65e+00

Pos logit contributions
 supervision+0.087
 wounds+0.082
 permission+0.077
 apologize+0.071
ppa+0.071
Neg logit contributions
 valley-0.033
 woods-0.032
 mountain-0.031
 forest-0.031
 lake-0.030
Token
Token ID198
Feature activation-3.27e+00

Pos logit contributions
 supervision+0.257
 permission+0.236
 wounds+0.234
beh+0.217
ppa+0.213
Neg logit contributions
 woods-0.060
 valley-0.060
 forest-0.057
 field-0.054
 lake-0.054
TokenAnna
Token ID31160
Feature activation-3.18e+00

Pos logit contributions
 supervision+0.142
 wounds+0.132
 permission+0.125
 apologize+0.112
ppa+0.112
Neg logit contributions
 valley-0.079
 woods-0.076
 mountain-0.075
 forest-0.073
 field-0.073
Token leaf
Token ID12835
Feature activation+2.58e+00

Pos logit contributions
 woods+0.044
 valley+0.044
 forest+0.037
 A+0.037
 sky+0.036
Neg logit contributions
 supervision-0.122
 wounds-0.115
 permission-0.106
 apologize-0.101
 apology-0.099
Token.
Token ID13
Feature activation+5.45e-01

Pos logit contributions
 valley+0.070
 woods+0.070
 A+0.066
 lake+0.064
 mountain+0.064
Neg logit contributions
 supervision-0.090
 wounds-0.078
 permission-0.074
 apologize-0.067
ppa-0.066
Token
Token ID198
Feature activation-2.67e+00

Pos logit contributions
 valley+0.012
 mountain+0.012
 woods+0.011
 forest+0.011
 field+0.011
Neg logit contributions
 supervision-0.024
 wounds-0.023
 permission-0.021
olds-0.020
 apologize-0.020
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
 supervision+0.141
 wounds+0.131
 permission+0.128
ppa+0.115
beh+0.114
Neg logit contributions
 valley-0.041
 woods-0.040
 forest-0.038
 mountain-0.038
 lake-0.038
Token"
Token ID1
Feature activation-3.18e+00

Pos logit contributions
 supervision+0.072
 wounds+0.068
 permission+0.064
 apologize+0.058
ppa+0.058
Neg logit contributions
 valley-0.037
 mountain-0.036
 woods-0.036
 forest-0.035
 lake-0.034
TokenL
Token ID43
Feature activation-7.79e+00

Pos logit contributions
 supervision+0.131
 wounds+0.124
 permission+0.115
 apologize+0.104
olds+0.103
Neg logit contributions
 valley-0.082
 mountain-0.080
 woods-0.079
 forest-0.077
 field-0.076
Tokenila
Token ID10102
Feature activation-5.18e+00

Pos logit contributions
 supervision+0.226
 wounds+0.205
 permission+0.189
olds+0.159
 apologize+0.158
Neg logit contributions
 valley-0.300
 mountain-0.292
 woods-0.292
 lake-0.286
 forest-0.286
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
 supervision+0.222
 wounds+0.206
 permission+0.193
ppa+0.179
olds+0.177
Neg logit contributions
 valley-0.127
 woods-0.118
 mountain-0.113
 lake-0.111
 forest-0.108
Token Tom
Token ID4186
Feature activation-4.85e+00

Pos logit contributions
 supervision+0.044
 wounds+0.040
 permission+0.038
 apologize+0.034
olds+0.032
Neg logit contributions
 valley-0.033
 woods-0.033
 mountain-0.032
 forest-0.031
 field-0.031
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
 supervision+0.223
 wounds+0.206
 permission+0.195
ppa+0.182
 apologize+0.181
Neg logit contributions
 valley-0.105
 woods-0.094
 mountain-0.094
 lake-0.089
 field-0.088
Token it
Token ID340
Feature activation+2.14e+00

Pos logit contributions
 supervision+0.039
 wounds+0.036
 permission+0.035
 apologize+0.030
olds+0.029
Neg logit contributions
 valley-0.032
 woods-0.031
 mountain-0.030
 forest-0.030
 field-0.030
Token home
Token ID1363
Feature activation-1.77e-01

Pos logit contributions
 valley+0.062
 woods+0.061
 mountain+0.060
 forest+0.059
 field+0.059
Neg logit contributions
 supervision-0.078
 wounds-0.073
 permission-0.067
olds-0.059
ppa-0.059
Token,
Token ID11
Feature activation+8.46e-01

Pos logit contributions
 supervision+0.009
 wounds+0.009
 permission+0.008
 apologize+0.007
olds+0.007
Neg logit contributions
 valley-0.003
 woods-0.003
 mountain-0.003
 forest-0.003
 lake-0.003
Token he
Token ID339
Feature activation+2.48e+00

Pos logit contributions
 valley+0.018
 mountain+0.017
 woods+0.017
 forest+0.016
 lake+0.016
Neg logit contributions
 supervision-0.039
 wounds-0.036
 permission-0.034
 apologize-0.032
ppa-0.031
Token told
Token ID1297
Feature activation-3.34e+00

Pos logit contributions
 valley+0.063
 mountain+0.061
 woods+0.061
 forest+0.059
 lake+0.059
Neg logit contributions
 supervision-0.103
 wounds-0.097
 permission-0.090
 apologize-0.081
olds-0.081
Token his
Token ID465
Feature activation-2.72e+00

Pos logit contributions
 supervision+0.156
 wounds+0.140
ppa+0.124
beh+0.122
 permission+0.121
Neg logit contributions
 valley-0.066
 forest-0.066
 woods-0.063
 lake-0.062
 mountain-0.061
Token mom
Token ID1995
Feature activation-8.35e+00

Pos logit contributions
 supervision+0.100
 wounds+0.091
 permission+0.084
ppa+0.076
 apologize+0.074
Neg logit contributions
 valley-0.085
 mountain-0.083
 woods-0.082
 forest-0.081
 lake-0.080
Token all
Token ID477
Feature activation-1.29e+00

Pos logit contributions
 supervision+0.340
 wounds+0.318
 permission+0.297
 apologize+0.268
olds+0.266
Neg logit contributions
 valley-0.220
 mountain-0.211
 woods-0.211
 forest-0.205
 lake-0.205
Token about
Token ID546
Feature activation+6.31e-03

Pos logit contributions
 supervision+0.052
 wounds+0.049
 permission+0.045
ppa+0.040
olds+0.040
Neg logit contributions
 valley-0.035
 woods-0.034
 mountain-0.033
 forest-0.033
 lake-0.032
Token his
Token ID465
Feature activation-2.31e+00

Pos logit contributions
 valley+0.000
 woods+0.000
 mountain+0.000
 forest+0.000
 lake+0.000
Neg logit contributions
 supervision-0.000
 wounds-0.000
 permission-0.000
ppa-0.000
olds-0.000
Token fun
Token ID1257
Feature activation-1.51e+00

Pos logit contributions
 supervision+0.098
 wounds+0.093
 permission+0.086
 apologize+0.078
olds+0.077
Neg logit contributions
 valley-0.057
 mountain-0.055
 woods-0.055
 forest-0.053
 lake-0.053
Token day
Token ID1110
Feature activation+4.46e-01

Pos logit contributions
 supervision+0.072
 wounds+0.068
 permission+0.064
ppa+0.059
 apologize+0.059
Neg logit contributions
 valley-0.029
 woods-0.028
 mountain-0.028
 forest-0.027
 lake-0.027
Token feeling
Token ID4203
Feature activation+1.78e+00

Pos logit contributions
 valley+0.095
 mountain+0.093
 lake+0.090
 woods+0.090
 forest+0.087
Neg logit contributions
 supervision-0.127
 wounds-0.120
 permission-0.114
ppa-0.101
 apologize-0.100
Token scared
Token ID12008
Feature activation-2.00e-01

Pos logit contributions
 valley+0.048
 mountain+0.046
 woods+0.046
 lake+0.045
 forest+0.045
Neg logit contributions
 supervision-0.071
 wounds-0.066
 permission-0.062
 apologize-0.056
ppa-0.055
Token and
Token ID290
Feature activation-3.41e+00

Pos logit contributions
 supervision+0.008
 wounds+0.007
 permission+0.007
 attention+0.006
 apologize+0.006
Neg logit contributions
 valley-0.005
 mountain-0.005
 lake-0.005
 woods-0.005
 field-0.005
Token asked
Token ID1965
Feature activation-3.73e+00

Pos logit contributions
 supervision+0.123
 wounds+0.115
 permission+0.107
 apologize+0.097
olds+0.092
Neg logit contributions
 valley-0.104
 mountain-0.102
 lake-0.100
 woods-0.099
 forest-0.097
Token her
Token ID607
Feature activation-3.20e+00

Pos logit contributions
 supervision+0.149
 wounds+0.140
ppa+0.119
beh+0.118
 permission+0.117
Neg logit contributions
 woods-0.099
 valley-0.098
 forest-0.097
 mountain-0.092
 field-0.092
Token mom
Token ID1995
Feature activation-8.85e+00

Pos logit contributions
 supervision+0.110
 wounds+0.101
 permission+0.092
 apologize+0.081
ppa+0.081
Neg logit contributions
 valley-0.106
 mountain-0.103
 woods-0.103
 forest-0.101
 lake-0.101
Token if
Token ID611
Feature activation-1.81e+00

Pos logit contributions
 supervision+0.319
 wounds+0.305
 permission+0.266
ppa+0.243
beh+0.241
Neg logit contributions
 valley-0.266
 woods-0.263
 mountain-0.261
 forest-0.260
 field-0.256
Token she
Token ID673
Feature activation-9.10e-01

Pos logit contributions
 supervision+0.082
 wounds+0.076
 permission+0.073
 apologize+0.066
ppa+0.065
Neg logit contributions
 valley-0.039
 mountain-0.038
 woods-0.038
 lake-0.037
 forest-0.037
Token could
Token ID714
Feature activation-1.69e+00

Pos logit contributions
 supervision+0.042
 wounds+0.039
 permission+0.037
 apologize+0.034
ppa+0.034
Neg logit contributions
 valley-0.020
 mountain-0.018
 woods-0.018
 lake-0.018
 forest-0.018
Token sleep
Token ID3993
Feature activation-2.18e-01

Pos logit contributions
 supervision+0.057
 wounds+0.053
 permission+0.051
 apologize+0.048
olds+0.043
Neg logit contributions
 valley-0.053
 woods-0.053
 mountain-0.051
 field-0.051
 lake-0.050
Token in
Token ID287
Feature activation-1.08e-01

Pos logit contributions
 supervision+0.009
 wounds+0.009
 permission+0.008
 apologize+0.008
ppa+0.007
Neg logit contributions
 valley-0.005
 mountain-0.005
 woods-0.005
 lake-0.005
 forest-0.005