TOP ACTIVATIONS
MAX = 0.336

He began to paddle across the bath . He
his boat and began to paddle away , eagerly looking for
didn 't know how to paddle . Suddenly
red . It has a paddle . Lily and Tom want
the boat and started to paddle . She explored deeper and
started to do do ggy paddle , she loved the way
was scared and tried to paddle away . But the bear
boat . He took a paddle and started to paddle .
their own kay ak and paddle as fast as they can
a paddle and started to paddle . Suddenly , he heard
to hold the stick and paddle together . " Are you
a real boat with a paddle . He quickly row ed
ak . They liked to paddle and see the fish and
they can sit on and paddle . They pretend they are
ar , and started to paddle . The
wanted to go for a paddle . But Tom was too
I 'm going for a paddle in the lake ," replied
Rob how to use a paddle to move the boat .
the boat and took the paddle . He pushed and pulled
his mom and started to paddle away with the lumber .

MIN ACTIVATIONS
MIN = -0.272

," it said , " you won 't regret it ".
smiles too and says " you are welcome , Tom .
agrees and says , " Deal . Let 's make a
said . " Deal !" the twins said .
. She said , " Deal , mom . Thank you
. He said , " Deal . I promise . Let
said . " Deal !" Lily said .
. I ... miss ... you ... very ... much .
nod and say , " Deal , mom . Thank you
S ara said , " Deal ." She put her tablet
She smiled and said " you 're right , John !
again ." " Deal ," the judge said .
and Ben say , " Deal , mom . We will
. She said , " Deal , Anna . But you
I ... h ope ... you ... are ... well ...
. She said , " Deal , Mom . Thank you
. I ... sent ... you ... a ... sur prise
, she softly said " you 'll be okay . I
. And you say ' you 're welcome ' when you
him and said , " you don 't have to weep

INTERVAL 0.184 - 0.336
CONTAINS 0.609%

obby nodded and they both moved inside the cave . The
he decided to investigate . He moved closer to get a
outside and the sun was shining so brightly . Hannah 's
a farmer called Pete . He was growing onions in his
bit more careful , this stream could be dangerous ".

INTERVAL 0.032 - 0.184
CONTAINS 32.358%

was very sad . Tim my 's mom saw
. From that day on , the diamond became Lucy 's
were very happy . The switch was happy too because it
and stopped feeling worried . She thanked her mum and ran
all for herself . The lady said , "

INTERVAL -0.120 - 0.032
CONTAINS 60.329%

. He was happy and good . One day
ily was happy to help the lost puppy . She held
from the sky and land on the ground . She ran
me think of a joke . I know ! What did
forgot the big , white cloud . She will always care

INTERVAL -0.272 - -0.120
CONTAINS 6.702%

there was a little duck who loved to swim in the
said yes , so they all got together and made a
" Don 't worry , we can buy some new blocks
and friendship . He asked all the animals to unite and
But you should ask before you touch something that 's not
Token
Token ID198
Feature activation+7.66e-02

Pos logit contributions
 virtue+0.072
 eater+0.069
 mistake+0.066
nie+0.066
olding+0.066
Neg logit contributions
 valleys-0.022
He-0.020
 sails-0.020
His-0.019
rolled-0.018
Token
Token ID198
Feature activation+7.55e-02

Pos logit contributions
 valleys+0.079
 sails+0.073
itched+0.071
 sail+0.067
He+0.065
Neg logit contributions
 virtue-0.161
 eater-0.159
nie-0.150
 tutor-0.150
olding-0.148
TokenHe
Token ID1544
Feature activation+1.83e-01

Pos logit contributions
 valleys+0.073
 sails+0.068
itched+0.063
He+0.062
 sail+0.062
Neg logit contributions
 virtue-0.162
 eater-0.160
 tutor-0.153
nie-0.153
olding-0.150
Token began
Token ID2540
Feature activation+1.10e-01

Pos logit contributions
 valleys+0.229
 sails+0.216
itched+0.208
 sail+0.200
 widened+0.188
Neg logit contributions
 virtue-0.342
 eater-0.336
nie-0.322
 tutor-0.318
olding-0.314
Token to
Token ID284
Feature activation-4.78e-02

Pos logit contributions
 valleys+0.232
 sails+0.226
itched+0.218
 sail+0.215
 widened+0.210
Neg logit contributions
 virtue-0.111
 eater-0.108
 tutor-0.098
nie-0.097
olding-0.096
Token paddle
Token ID39517
Feature activation+3.36e-01

Pos logit contributions
 virtue+0.070
 eater+0.069
nie+0.065
 tutor+0.064
olding+0.062
Neg logit contributions
 valleys-0.079
 sails-0.075
itched-0.073
 sail-0.070
 widened-0.068
Token across
Token ID1973
Feature activation+1.41e-01

Pos logit contributions
 valleys+0.492
 sails+0.475
itched+0.458
 sail+0.444
 gl+0.423
Neg logit contributions
 virtue-0.554
 eater-0.540
nie-0.512
 tutor-0.505
olding-0.504
Token the
Token ID262
Feature activation+8.18e-03

Pos logit contributions
 valleys+0.208
 sails+0.200
itched+0.189
 sail+0.187
 gl+0.178
Neg logit contributions
 virtue-0.235
 eater-0.229
nie-0.217
 tutor-0.214
olding-0.213
Token bath
Token ID7837
Feature activation+1.16e-01

Pos logit contributions
 valleys+0.008
 sails+0.008
itched+0.008
 sail+0.007
 widened+0.007
Neg logit contributions
 virtue-0.017
 eater-0.017
nie-0.016
 tutor-0.016
olding-0.016
Token.
Token ID13
Feature activation-3.12e-02

Pos logit contributions
 valleys+0.160
 sails+0.154
 sail+0.144
itched+0.142
 gl+0.138
Neg logit contributions
 virtue-0.204
 eater-0.196
nie-0.191
 tutor-0.190
olding-0.186
Token He
Token ID679
Feature activation+1.82e-01

Pos logit contributions
 virtue+0.074
 eater+0.072
nie+0.070
 promise+0.069
 mistake+0.068
Neg logit contributions
 valleys-0.025
 sails-0.023
 sail-0.021
itched-0.020
rolled-0.019
Token his
Token ID465
Feature activation+1.22e-01

Pos logit contributions
 valleys+0.143
 sails+0.141
itched+0.140
 sail+0.132
rolling+0.130
Neg logit contributions
 eater-0.130
 virtue-0.127
 tutor-0.123
nie-0.123
 student-0.118
Token boat
Token ID8848
Feature activation+1.68e-01

Pos logit contributions
 valleys+0.197
 widened+0.197
itched+0.182
 widen+0.176
 sails+0.174
Neg logit contributions
 tutor-0.171
 eater-0.165
 promise-0.162
 cousin-0.158
 sister-0.155
Token and
Token ID290
Feature activation-2.24e-02

Pos logit contributions
 valleys+0.282
 sails+0.268
itched+0.260
 widened+0.250
 sail+0.249
Neg logit contributions
 virtue-0.237
 eater-0.235
nie-0.223
 tutor-0.221
olding-0.216
Token began
Token ID2540
Feature activation+1.08e-01

Pos logit contributions
 eater+0.039
 virtue+0.039
nie+0.038
 tutor+0.037
 ever+0.036
Neg logit contributions
 valleys-0.028
itched-0.027
 sails-0.027
 sail-0.024
 widened-0.024
Token to
Token ID284
Feature activation-5.09e-02

Pos logit contributions
 valleys+0.246
 widened+0.244
 sails+0.243
itched+0.242
ink+0.236
Neg logit contributions
 eater-0.065
 ever-0.062
 virtue-0.061
 tutor-0.061
 before-0.059
Token paddle
Token ID39517
Feature activation+3.35e-01

Pos logit contributions
 eater+0.074
 virtue+0.071
nie+0.071
 promise+0.070
 mistake+0.068
Neg logit contributions
itched-0.078
 valleys-0.077
 sails-0.073
ink-0.070
rolled-0.068
Token away
Token ID1497
Feature activation+7.16e-02

Pos logit contributions
 valleys+0.533
 sails+0.515
itched+0.501
 sail+0.483
 widened+0.465
Neg logit contributions
 virtue-0.509
 eater-0.495
nie-0.469
olding-0.459
 tutor-0.459
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
 valleys+0.127
 sails+0.125
itched+0.122
 sail+0.117
 widened+0.114
Neg logit contributions
 virtue-0.093
 eater-0.089
nie-0.083
olding-0.083
 tutor-0.082
Token eagerly
Token ID30130
Feature activation+2.79e-02

Pos logit contributions
 valleys+0.203
 sails+0.195
 sail+0.186
itched+0.183
 gl+0.179
Neg logit contributions
 virtue-0.170
 eater-0.166
 tutor-0.154
nie-0.153
olding-0.150
Token looking
Token ID2045
Feature activation+7.45e-02

Pos logit contributions
 valleys+0.043
 sails+0.042
 sail+0.041
itched+0.039
 gl+0.038
Neg logit contributions
 virtue-0.045
 eater-0.044
 tutor-0.042
nie-0.041
olding-0.040
Token for
Token ID329
Feature activation-1.77e-01

Pos logit contributions
 valleys+0.163
 sails+0.158
 sail+0.151
itched+0.150
 gl+0.147
Neg logit contributions
 virtue-0.074
 eater-0.071
nie-0.063
 tutor-0.062
olding-0.060
Token didn
Token ID1422
Feature activation-1.63e-01

Pos logit contributions
 virtue+0.011
 eater+0.010
 tutor+0.010
nie+0.010
olding+0.010
Neg logit contributions
 valleys-0.011
 sails-0.011
itched-0.011
 sail-0.010
 gl-0.010
Token't
Token ID470
Feature activation-1.21e-01

Pos logit contributions
 virtue+0.145
 ever+0.127
 tutor+0.122
 eater+0.121
olding+0.119
Neg logit contributions
 valleys-0.375
 sails-0.354
 widened-0.331
 forests-0.329
 gl-0.328
Token know
Token ID760
Feature activation+8.70e-02

Pos logit contributions
 virtue+0.187
 eater+0.181
 tutor+0.171
nie+0.169
olding+0.160
Neg logit contributions
 sail-0.183
 valleys-0.182
 sails-0.181
itched-0.170
He-0.170
Token how
Token ID703
Feature activation-1.52e-01

Pos logit contributions
 valleys+0.150
 sails+0.146
itched+0.140
 sail+0.138
 gl+0.133
Neg logit contributions
 virtue-0.123
 eater-0.119
nie-0.111
 tutor-0.110
olding-0.109
Token to
Token ID284
Feature activation-4.91e-02

Pos logit contributions
 virtue+0.203
 eater+0.200
 tutor+0.184
nie+0.184
olding+0.179
Neg logit contributions
 valleys-0.274
 sails-0.267
itched-0.255
 sail-0.254
 gl-0.243
Token paddle
Token ID39517
Feature activation+3.35e-01

Pos logit contributions
 promise+0.068
 ever+0.065
 virtue+0.064
 eater+0.061
 rule+0.061
Neg logit contributions
 valleys-0.089
itched-0.079
 widened-0.078
 sails-0.076
 forests-0.075
Token.
Token ID13
Feature activation-3.11e-02

Pos logit contributions
 valleys+0.488
 sails+0.478
 sail+0.457
itched+0.433
 gl+0.428
Neg logit contributions
 virtue-0.567
 eater-0.559
 tutor-0.537
nie-0.529
olding-0.514
Token
Token ID220
Feature activation+9.54e-02

Pos logit contributions
 virtue+0.070
 eater+0.068
nie+0.065
 tutor+0.065
olding+0.064
Neg logit contributions
 valleys-0.030
 sails-0.027
itched-0.025
rolled-0.024
 sail-0.024
Token
Token ID198
Feature activation+7.59e-02

Pos logit contributions
 valleys+0.106
 sails+0.100
itched+0.095
 sail+0.092
 gl+0.086
Neg logit contributions
 virtue-0.192
 eater-0.189
nie-0.180
 tutor-0.178
olding-0.176
Token
Token ID198
Feature activation+7.57e-02

Pos logit contributions
 valleys+0.097
 sails+0.096
 sail+0.089
itched+0.088
 gl+0.083
Neg logit contributions
 virtue-0.138
 eater-0.133
nie-0.130
olding-0.128
 tutor-0.127
TokenSuddenly
Token ID38582
Feature activation+6.54e-02

Pos logit contributions
 valleys+0.084
 sails+0.079
itched+0.075
 sail+0.072
 gl+0.068
Neg logit contributions
 virtue-0.153
 eater-0.151
nie-0.143
 tutor-0.143
olding-0.141
Token red
Token ID2266
Feature activation+1.50e-01

Pos logit contributions
 virtue+0.041
 eater+0.041
olding+0.038
 tutor+0.038
nie+0.038
Neg logit contributions
 valleys-0.027
 sails-0.026
 sail-0.024
 gl-0.023
itched-0.023
Token.
Token ID13
Feature activation-2.94e-02

Pos logit contributions
 sails+0.219
 valleys+0.218
 sail+0.204
itched+0.199
 gl+0.193
Neg logit contributions
 virtue-0.255
 eater-0.242
nie-0.236
 tutor-0.232
olding-0.231
Token It
Token ID632
Feature activation+7.10e-02

Pos logit contributions
 virtue+0.055
 eater+0.054
nie+0.052
 tutor+0.051
olding+0.051
Neg logit contributions
 valleys-0.036
 sails-0.035
itched-0.033
 sail-0.032
 gl-0.030
Token has
Token ID468
Feature activation-4.59e-02

Pos logit contributions
 valleys+0.127
 sails+0.124
 sail+0.117
itched+0.117
 gl+0.113
Neg logit contributions
 virtue-0.097
 eater-0.094
nie-0.087
 tutor-0.087
olding-0.086
Token a
Token ID257
Feature activation+7.91e-03

Pos logit contributions
 virtue+0.076
 ever+0.073
 eater+0.071
nie+0.069
 tutor+0.069
Neg logit contributions
itched-0.062
 valleys-0.060
rolling-0.058
rolled-0.056
 widened-0.056
Token paddle
Token ID39517
Feature activation+3.35e-01

Pos logit contributions
 valleys+0.010
itched+0.008
 sails+0.008
 widened+0.008
rolled+0.008
Neg logit contributions
 virtue-0.015
 eater-0.014
 tutor-0.014
 mistake-0.014
 promise-0.014
Token.
Token ID13
Feature activation-3.64e-02

Pos logit contributions
 valleys+0.533
 sails+0.524
 sail+0.495
itched+0.493
 gl+0.471
Neg logit contributions
 virtue-0.514
 eater-0.499
nie-0.472
 tutor-0.472
olding-0.459
Token Lily
Token ID20037
Feature activation-1.21e-01

Pos logit contributions
 virtue+0.066
 eater+0.065
 tutor+0.062
nie+0.061
olding+0.061
Neg logit contributions
 valleys-0.047
 sails-0.045
itched-0.042
 sail-0.041
 widened-0.039
Token and
Token ID290
Feature activation-2.29e-02

Pos logit contributions
 virtue+0.173
 eater+0.167
 tutor+0.155
nie+0.155
olding+0.154
Neg logit contributions
 valleys-0.207
 sails-0.203
itched-0.191
 sail-0.191
 gl-0.184
Token Tom
Token ID4186
Feature activation-4.86e-02

Pos logit contributions
 virtue+0.037
 eater+0.033
olding+0.032
nie+0.032
 occasion+0.031
Neg logit contributions
 valleys-0.038
 sails-0.038
 sail-0.037
 gl-0.036
itched-0.035
Token want
Token ID765
Feature activation-5.43e-02

Pos logit contributions
 virtue+0.063
 eater+0.061
nie+0.057
 tutor+0.057
olding+0.056
Neg logit contributions
 valleys-0.089
 sails-0.087
itched-0.083
 sail-0.083
 gl-0.079
Token the
Token ID262
Feature activation+9.01e-03

Pos logit contributions
 valleys+0.135
 sails+0.129
itched+0.122
 sail+0.121
 gl+0.116
Neg logit contributions
 virtue-0.153
 eater-0.149
nie-0.141
olding-0.139
 tutor-0.139
Token boat
Token ID8848
Feature activation+1.66e-01

Pos logit contributions
itched+0.012
 valleys+0.011
 widened+0.010
ink+0.010
rolled+0.010
Neg logit contributions
 virtue-0.015
 promise-0.015
nie-0.015
 eater-0.014
 tutor-0.014
Token and
Token ID290
Feature activation-2.17e-02

Pos logit contributions
 valleys+0.261
 sails+0.251
itched+0.241
 sail+0.235
 widened+0.225
Neg logit contributions
 virtue-0.256
 eater-0.250
nie-0.236
 tutor-0.234
olding-0.232
Token started
Token ID2067
Feature activation+9.10e-02

Pos logit contributions
 virtue+0.038
 eater+0.038
nie+0.037
 tutor+0.036
olding+0.035
Neg logit contributions
 valleys-0.028
 sails-0.027
itched-0.026
 sail-0.025
 widened-0.024
Token to
Token ID284
Feature activation-4.98e-02

Pos logit contributions
 valleys+0.191
 widened+0.189
 sails+0.184
itched+0.181
ink+0.180
Neg logit contributions
 ever-0.067
 before-0.066
 eater-0.063
 virtue-0.061
 tutor-0.061
Token paddle
Token ID39517
Feature activation+3.34e-01

Pos logit contributions
 promise+0.082
nie+0.079
 eater+0.078
 mistake+0.077
 virtue+0.075
Neg logit contributions
 valleys-0.064
itched-0.064
ink-0.060
br-0.058
He-0.056
Token.
Token ID13
Feature activation-3.22e-02

Pos logit contributions
 valleys+0.541
 sails+0.524
itched+0.508
 sail+0.491
 widened+0.473
Neg logit contributions
 virtue-0.497
 eater-0.482
nie-0.455
olding-0.447
 tutor-0.446
Token She
Token ID1375
Feature activation+1.21e-01

Pos logit contributions
 virtue+0.069
 eater+0.068
nie+0.066
 tutor+0.064
olding+0.064
Neg logit contributions
 valleys-0.033
 sails-0.030
itched-0.028
 sail-0.027
rolled-0.025
Token explored
Token ID18782
Feature activation+8.38e-02

Pos logit contributions
 valleys+0.155
itched+0.146
 sails+0.144
 sail+0.133
 widened+0.129
Neg logit contributions
 virtue-0.214
 eater-0.208
nie-0.207
 ever-0.198
olding-0.195
Token deeper
Token ID9211
Feature activation+2.17e-01

Pos logit contributions
itched+0.152
 sails+0.146
ink+0.143
rolling+0.142
 valleys+0.141
Neg logit contributions
 before-0.093
 virtue-0.091
 ever-0.090
 eater-0.088
nie-0.088
Token and
Token ID290
Feature activation-2.16e-02

Pos logit contributions
itched+0.389
rolling+0.376
 sails+0.375
 sail+0.373
rolled+0.368
Neg logit contributions
nie-0.235
 before-0.235
 virtue-0.233
 ever-0.225
 eater-0.224
Token started
Token ID2067
Feature activation+9.12e-02

Pos logit contributions
 sails+0.145
 valleys+0.142
 sail+0.134
 gl+0.129
 sailed+0.127
Neg logit contributions
 virtue-0.239
 eater-0.233
 tutor-0.223
olding-0.221
nie-0.213
Token to
Token ID284
Feature activation-5.05e-02

Pos logit contributions
 valleys+0.169
 sails+0.164
itched+0.159
 sail+0.155
 widened+0.153
Neg logit contributions
 virtue-0.111
 eater-0.109
nie-0.101
 tutor-0.101
olding-0.099
Token do
Token ID466
Feature activation-2.63e-02

Pos logit contributions
 virtue+0.089
 eater+0.087
nie+0.083
 tutor+0.082
olding+0.081
Neg logit contributions
 valleys-0.069
 sails-0.066
itched-0.063
 sail-0.062
 gl-0.058
Token do
Token ID466
Feature activation-2.64e-02

Pos logit contributions
 virtue+0.047
 eater+0.046
nie+0.044
olding+0.043
 tutor+0.043
Neg logit contributions
 valleys-0.037
 sails-0.035
itched-0.033
 sail-0.032
 gl-0.032
Tokenggy
Token ID19970
Feature activation-6.18e-02

Pos logit contributions
 virtue+0.046
 eater+0.046
nie+0.043
olding+0.043
 tutor+0.043
Neg logit contributions
 valleys-0.038
 sails-0.036
itched-0.034
 sail-0.033
 gl-0.033
Token paddle
Token ID39517
Feature activation+3.34e-01

Pos logit contributions
 virtue+0.123
nie+0.108
 eater+0.107
 occasion+0.103
 tutor+0.103
Neg logit contributions
 valleys-0.081
 sails-0.076
 sail-0.074
 gl-0.072
itched-0.072
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
 valleys+0.479
 sails+0.455
 sail+0.427
 gl+0.417
itched+0.412
Neg logit contributions
 virtue-0.582
 eater-0.563
 tutor-0.540
nie-0.532
olding-0.525
Token she
Token ID673
Feature activation+3.75e-02

Pos logit contributions
 valleys+0.165
 sails+0.162
 sail+0.155
 gl+0.147
itched+0.142
Neg logit contributions
 virtue-0.211
 eater-0.201
nie-0.189
 tutor-0.189
olding-0.187
Token loved
Token ID6151
Feature activation-6.87e-02

Pos logit contributions
 sails+0.049
 valleys+0.049
 sail+0.045
 gl+0.043
 sailed+0.043
Neg logit contributions
 virtue-0.071
 eater-0.069
 tutor-0.067
olding-0.065
nie-0.063
Token the
Token ID262
Feature activation+7.41e-03

Pos logit contributions
 virtue+0.142
olding+0.131
 eater+0.131
nie+0.129
 promise+0.125
Neg logit contributions
 valleys-0.086
 gl-0.078
 sails-0.077
 soaring-0.074
itched-0.072
Token way
Token ID835
Feature activation-3.24e-02

Pos logit contributions
 sail+0.008
 sails+0.008
 valleys+0.007
 gl+0.007
 waterfall+0.006
Neg logit contributions
 virtue-0.016
 eater-0.016
olding-0.016
 tutor-0.015
nie-0.015
Token was
Token ID373
Feature activation+2.19e-02

Pos logit contributions
 virtue+0.096
 eater+0.092
 tutor+0.091
olding+0.088
 occasion+0.085
Neg logit contributions
 valleys-0.070
 sails-0.069
itched-0.064
 sail-0.063
 swoop-0.062
Token scared
Token ID12008
Feature activation+5.30e-03

Pos logit contributions
 valleys+0.039
 sails+0.037
itched+0.036
 sail+0.035
rolled+0.034
Neg logit contributions
 virtue-0.029
 eater-0.028
 ever-0.026
nie-0.026
olding-0.026
Token and
Token ID290
Feature activation-2.41e-02

Pos logit contributions
 valleys+0.008
 sails+0.008
itched+0.007
 sail+0.007
 gl+0.007
Neg logit contributions
 virtue-0.009
 eater-0.008
nie-0.008
 tutor-0.008
olding-0.008
Token tried
Token ID3088
Feature activation+3.84e-02

Pos logit contributions
 virtue+0.045
 eater+0.043
 tutor+0.041
nie+0.041
olding+0.040
Neg logit contributions
 valleys-0.031
 sails-0.031
 sail-0.030
itched-0.029
 gl-0.028
Token to
Token ID284
Feature activation-5.21e-02

Pos logit contributions
 valleys+0.076
 sails+0.073
itched+0.071
 sail+0.071
 gl+0.068
Neg logit contributions
 virtue-0.044
 eater-0.043
nie-0.040
 tutor-0.039
olding-0.038
Token paddle
Token ID39517
Feature activation+3.33e-01

Pos logit contributions
 virtue+0.084
 eater+0.082
nie+0.078
 tutor+0.077
olding+0.076
Neg logit contributions
 valleys-0.080
 sails-0.076
itched-0.073
 sail-0.071
 gl-0.068
Token away
Token ID1497
Feature activation+6.73e-02

Pos logit contributions
 valleys+0.452
 sails+0.432
 sail+0.409
itched+0.398
He+0.384
Neg logit contributions
 virtue-0.597
 eater-0.586
nie-0.568
 tutor-0.566
olding-0.538
Token.
Token ID13
Feature activation-3.23e-02

Pos logit contributions
 valleys+0.104
 sails+0.100
itched+0.096
 sail+0.094
 gl+0.090
Neg logit contributions
 virtue-0.107
 eater-0.105
nie-0.099
 tutor-0.098
olding-0.097
Token But
Token ID887
Feature activation-6.03e-02

Pos logit contributions
 virtue+0.062
 eater+0.061
nie+0.058
 tutor+0.058
olding+0.057
Neg logit contributions
 valleys-0.040
 sails-0.038
itched-0.036
 sail-0.035
 gl-0.033
Token the
Token ID262
Feature activation+2.22e-03

Pos logit contributions
 virtue+0.098
 eater+0.096
nie+0.091
 tutor+0.090
olding+0.089
Neg logit contributions
 valleys-0.091
 sails-0.087
itched-0.083
 sail-0.081
 gl-0.078
Token bear
Token ID6842
Feature activation-1.40e-02

Pos logit contributions
 valleys+0.003
itched+0.003
 sails+0.003
rolled+0.003
rolling+0.003
Neg logit contributions
 eater-0.003
 virtue-0.003
 tutor-0.003
nie-0.003
 promise-0.003
Token boat
Token ID8848
Feature activation+1.67e-01

Pos logit contributions
itched+0.013
 widened+0.012
 valleys+0.011
ink+0.011
rolled+0.011
Neg logit contributions
 promise-0.012
 kid-0.011
 rule-0.011
 day-0.011
 virtue-0.011
Token.
Token ID13
Feature activation-3.10e-02

Pos logit contributions
 valleys+0.281
 sails+0.274
itched+0.261
 sail+0.260
 gl+0.248
Neg logit contributions
 virtue-0.242
 eater-0.234
nie-0.219
 tutor-0.218
olding-0.214
Token He
Token ID679
Feature activation+1.80e-01

Pos logit contributions
 virtue+0.069
 eater+0.067
nie+0.065
 tutor+0.065
olding+0.064
Neg logit contributions
 valleys-0.028
 sails-0.026
itched-0.025
 sail-0.024
 gl-0.022
Token took
Token ID1718
Feature activation+2.72e-02

Pos logit contributions
 valleys+0.225
 sails+0.221
 sail+0.204
itched+0.203
 gl+0.193
Neg logit contributions
 virtue-0.338
 eater-0.332
 tutor-0.315
nie-0.312
olding-0.311
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 valleys+0.036
 sails+0.034
itched+0.033
 sail+0.032
 widened+0.031
Neg logit contributions
 virtue-0.047
 eater-0.047
nie-0.046
 ever-0.044
 tutor-0.044
Token paddle
Token ID39517
Feature activation+3.33e-01

Pos logit contributions
 valleys+0.018
itched+0.016
 sails+0.016
ink+0.015
 widened+0.015
Neg logit contributions
 virtue-0.022
nie-0.021
 promise-0.021
 tutor-0.021
 eater-0.020
Token and
Token ID290
Feature activation-2.35e-02

Pos logit contributions
 valleys+0.569
 sails+0.533
itched+0.530
 widened+0.504
 sail+0.496
Neg logit contributions
 virtue-0.471
 eater-0.455
olding-0.428
nie-0.425
 tutor-0.413
Token started
Token ID2067
Feature activation+8.83e-02

Pos logit contributions
 before+0.040
 ever+0.039
nie+0.039
 virtue+0.039
 eater+0.038
Neg logit contributions
itched-0.028
 valleys-0.028
ink-0.026
 sails-0.025
rolling-0.024
Token to
Token ID284
Feature activation-5.15e-02

Pos logit contributions
 valleys+0.147
 widened+0.146
itched+0.138
ink+0.138
 sails+0.136
Neg logit contributions
 before-0.110
 ever-0.106
 virtue-0.102
olding-0.097
nie-0.097
Token paddle
Token ID39517
Feature activation+3.33e-01

Pos logit contributions
 promise+0.101
 virtue+0.098
nie+0.097
 eater+0.095
 mistake+0.094
Neg logit contributions
 valleys-0.051
itched-0.050
ink-0.045
 widened-0.043
 sails-0.042
Token.
Token ID13
Feature activation-3.14e-02

Pos logit contributions
 valleys+0.527
 sails+0.508
itched+0.485
 sail+0.478
 gl+0.457
Neg logit contributions
 virtue-0.516
 eater-0.503
nie-0.474
 tutor-0.471
olding-0.465
Token their
Token ID511
Feature activation+1.23e-01

Pos logit contributions
 eater+0.140
 virtue+0.139
nie+0.134
olding+0.130
 tutor+0.126
Neg logit contributions
 valleys-0.082
 sails-0.079
 sail-0.074
itched-0.074
 widened-0.069
Token own
Token ID898
Feature activation-6.38e-02

Pos logit contributions
 valleys+0.163
 sails+0.162
 sail+0.149
itched+0.140
 gl+0.138
Neg logit contributions
 virtue-0.233
 eater-0.222
nie-0.214
olding-0.211
 ever-0.203
Token kay
Token ID34681
Feature activation+2.95e-01

Pos logit contributions
 virtue+0.108
 eater+0.106
nie+0.099
olding+0.097
 tutor+0.094
Neg logit contributions
 valleys-0.095
 sails-0.094
 sail-0.088
itched-0.085
 gl-0.082
Tokenak
Token ID461
Feature activation+2.02e-01

Pos logit contributions
 valleys+0.262
 sails+0.245
itched+0.228
 sail+0.223
 widened+0.212
Neg logit contributions
 virtue-0.661
 eater-0.641
nie-0.627
olding-0.622
 tutor-0.600
Token and
Token ID290
Feature activation-2.53e-02

Pos logit contributions
 valleys+0.307
 sails+0.303
itched+0.285
 sail+0.284
 gl+0.273
Neg logit contributions
 virtue-0.322
 eater-0.317
 tutor-0.297
nie-0.291
olding-0.288
Token paddle
Token ID39517
Feature activation+3.33e-01

Pos logit contributions
 virtue+0.049
 eater+0.047
nie+0.045
olding+0.045
 tutor+0.044
Neg logit contributions
 valleys-0.032
 sails-0.032
 sail-0.031
itched-0.028
 gl-0.028
Token as
Token ID355
Feature activation-6.96e-02

Pos logit contributions
 valleys+0.480
 sails+0.468
 sail+0.432
itched+0.415
 gl+0.408
Neg logit contributions
 eater-0.581
 virtue-0.574
 tutor-0.555
nie-0.538
 student-0.525
Token fast
Token ID3049
Feature activation+1.21e-02

Pos logit contributions
 eater+0.145
 virtue+0.143
nie+0.135
 tutor+0.135
olding+0.132
Neg logit contributions
 valleys-0.080
 sails-0.076
 sail-0.076
 sailed-0.066
itched-0.066
Token as
Token ID355
Feature activation-6.83e-02

Pos logit contributions
 valleys+0.022
 sails+0.021
itched+0.021
 sail+0.020
 gl+0.020
Neg logit contributions
 virtue-0.015
 eater-0.015
 ever-0.014
 tutor-0.013
olding-0.013
Token they
Token ID484
Feature activation-6.67e-03

Pos logit contributions
 virtue+0.123
 eater+0.122
nie+0.114
 tutor+0.113
olding+0.112
Neg logit contributions
 valleys-0.096
 sails-0.091
 sail-0.088
itched-0.084
 gl-0.082
Token can
Token ID460
Feature activation-5.68e-02

Pos logit contributions
 virtue+0.010
 eater+0.009
nie+0.009
 tutor+0.009
olding+0.009
Neg logit contributions
 valleys-0.011
 sails-0.011
 sail-0.010
itched-0.010
 gl-0.010
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 valleys+0.036
 sails+0.034
itched+0.033
 sail+0.032
 widened+0.031
Neg logit contributions
 virtue-0.047
 eater-0.047
nie-0.046
 ever-0.044
 tutor-0.044
Token paddle
Token ID39517
Feature activation+3.33e-01

Pos logit contributions
 valleys+0.018
itched+0.016
 sails+0.016
ink+0.015
 widened+0.015
Neg logit contributions
 virtue-0.022
nie-0.021
 promise-0.021
 tutor-0.021
 eater-0.020
Token and
Token ID290
Feature activation-2.35e-02

Pos logit contributions
 valleys+0.569
 sails+0.533
itched+0.530
 widened+0.504
 sail+0.496
Neg logit contributions
 virtue-0.471
 eater-0.455
olding-0.428
nie-0.425
 tutor-0.413
Token started
Token ID2067
Feature activation+8.83e-02

Pos logit contributions
 before+0.040
 ever+0.039
nie+0.039
 virtue+0.039
 eater+0.038
Neg logit contributions
itched-0.028
 valleys-0.028
ink-0.026
 sails-0.025
rolling-0.024
Token to
Token ID284
Feature activation-5.15e-02

Pos logit contributions
 valleys+0.147
 widened+0.146
itched+0.138
ink+0.138
 sails+0.136
Neg logit contributions
 before-0.110
 ever-0.106
 virtue-0.102
olding-0.097
nie-0.097
Token paddle
Token ID39517
Feature activation+3.33e-01

Pos logit contributions
 promise+0.101
 virtue+0.098
nie+0.097
 eater+0.095
 mistake+0.094
Neg logit contributions
 valleys-0.051
itched-0.050
ink-0.045
 widened-0.043
 sails-0.042
Token.
Token ID13
Feature activation-3.14e-02

Pos logit contributions
 valleys+0.527
 sails+0.508
itched+0.485
 sail+0.478
 gl+0.457
Neg logit contributions
 virtue-0.516
 eater-0.503
nie-0.474
 tutor-0.471
olding-0.465
Token Suddenly
Token ID24975
Feature activation+1.12e-01

Pos logit contributions
 virtue+0.057
 eater+0.055
nie+0.054
 tutor+0.054
olding+0.053
Neg logit contributions
 valleys-0.039
 sails-0.038
itched-0.036
 sail-0.035
 widened-0.034
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
 valleys+0.157
 sails+0.154
itched+0.152
rolling+0.145
 sail+0.143
Neg logit contributions
 ever-0.166
 before-0.164
 tutor-0.164
 virtue-0.164
nie-0.162
Token he
Token ID339
Feature activation+9.52e-02

Pos logit contributions
 valleys+0.137
 sails+0.128
 sail+0.119
itched+0.115
 gl+0.112
Neg logit contributions
 virtue-0.238
 eater-0.233
olding-0.221
nie-0.220
 tutor-0.215
Token heard
Token ID2982
Feature activation+4.00e-02

Pos logit contributions
 valleys+0.136
itched+0.127
 sails+0.125
 sail+0.118
rolled+0.115
Neg logit contributions
 virtue-0.152
 eater-0.148
nie-0.144
 ever-0.140
 tutor-0.137
Token to
Token ID284
Feature activation-5.17e-02

Pos logit contributions
 virtue+0.190
 eater+0.187
nie+0.169
 tutor+0.168
olding+0.163
Neg logit contributions
 valleys-0.298
 sails-0.291
 sail-0.278
itched-0.275
 gl-0.266
Token hold
Token ID1745
Feature activation+1.71e-01

Pos logit contributions
 promise+0.078
 virtue+0.076
 rule+0.072
 behave+0.071
 eater+0.071
Neg logit contributions
 valleys-0.082
itched-0.074
 widened-0.071
 sails-0.070
ann-0.067
Token the
Token ID262
Feature activation+5.93e-03

Pos logit contributions
 valleys+0.246
 sails+0.226
itched+0.224
 widened+0.217
rolling+0.212
Neg logit contributions
 virtue-0.280
 eater-0.271
nie-0.262
olding-0.261
 ever-0.255
Token stick
Token ID4859
Feature activation+1.56e-01

Pos logit contributions
 valleys+0.006
 sails+0.005
itched+0.005
 sail+0.005
 widened+0.005
Neg logit contributions
 virtue-0.013
 eater-0.012
nie-0.012
 tutor-0.012
olding-0.012
Token and
Token ID290
Feature activation-2.39e-02

Pos logit contributions
 valleys+0.223
 sails+0.222
 sail+0.212
itched+0.206
 gl+0.198
Neg logit contributions
 virtue-0.261
 eater-0.256
 tutor-0.240
nie-0.240
olding-0.236
Token paddle
Token ID39517
Feature activation+3.33e-01

Pos logit contributions
 virtue+0.038
 eater+0.037
 tutor+0.035
nie+0.035
olding+0.034
Neg logit contributions
 valleys-0.036
itched-0.034
 sails-0.033
ink-0.031
 widened-0.031
Token together
Token ID1978
Feature activation-2.90e-02

Pos logit contributions
 valleys+0.530
 sails+0.511
itched+0.490
 sail+0.480
 gl+0.460
Neg logit contributions
 virtue-0.509
 eater-0.495
nie-0.466
 tutor-0.461
olding-0.458
Token.
Token ID13
Feature activation-2.92e-02

Pos logit contributions
 virtue+0.045
 eater+0.044
nie+0.042
 tutor+0.041
olding+0.040
Neg logit contributions
 valleys-0.046
 sails-0.045
 sail-0.043
itched-0.043
 gl-0.041
Token "
Token ID366
Feature activation-6.32e-02

Pos logit contributions
 virtue+0.044
 eater+0.043
 tutor+0.043
olding+0.042
 cousin+0.041
Neg logit contributions
 valleys-0.044
 sails-0.043
itched-0.041
 soared-0.041
 swoop-0.040
TokenAre
Token ID8491
Feature activation+4.62e-02

Pos logit contributions
 virtue+0.094
 eater+0.091
nie+0.085
 ever+0.084
 student+0.083
Neg logit contributions
itched-0.103
 sails-0.098
 valleys-0.097
rolled-0.091
 gl-0.091
Token you
Token ID345
Feature activation-2.36e-01

Pos logit contributions
 valleys+0.108
 sails+0.106
 sail+0.103
 gl+0.100
itched+0.099
Neg logit contributions
 virtue-0.043
 eater-0.040
 tutor-0.033
olding-0.033
 student-0.032
Token a
Token ID257
Feature activation+1.27e-02

Pos logit contributions
 virtue+0.105
 eater+0.102
nie+0.096
 tutor+0.095
olding+0.094
Neg logit contributions
 valleys-0.107
 sails-0.104
itched-0.099
 sail-0.097
 gl-0.093
Token real
Token ID1103
Feature activation+5.00e-02

Pos logit contributions
 valleys+0.015
 sails+0.014
itched+0.014
 sail+0.013
 widened+0.013
Neg logit contributions
 virtue-0.024
 eater-0.024
 tutor-0.023
nie-0.022
olding-0.022
Token boat
Token ID8848
Feature activation+1.66e-01

Pos logit contributions
 valleys+0.058
 sails+0.057
 sail+0.053
itched+0.049
 gl+0.048
Neg logit contributions
 virtue-0.100
 eater-0.097
nie-0.096
olding-0.094
 tutor-0.091
Token with
Token ID351
Feature activation-8.30e-02

Pos logit contributions
 sails+0.236
 valleys+0.233
 sail+0.226
 gl+0.215
itched+0.214
Neg logit contributions
 virtue-0.291
 eater-0.277
nie-0.268
 tutor-0.265
olding-0.254
Token a
Token ID257
Feature activation+1.21e-02

Pos logit contributions
 virtue+0.134
 eater+0.131
 ever+0.129
 tutor+0.124
nie+0.123
Neg logit contributions
 valleys-0.113
itched-0.107
rolling-0.105
 sails-0.100
ink-0.098
Token paddle
Token ID39517
Feature activation+3.33e-01

Pos logit contributions
 valleys+0.015
 sails+0.014
itched+0.013
rolled+0.012
 widened+0.012
Neg logit contributions
 virtue-0.023
 eater-0.022
 tutor-0.022
nie-0.021
 mistake-0.021
Token.
Token ID13
Feature activation-3.40e-02

Pos logit contributions
 sails+0.530
 valleys+0.529
 sail+0.501
itched+0.491
 gl+0.473
Neg logit contributions
 virtue-0.511
 eater-0.496
nie-0.474
 tutor-0.473
olding-0.452
Token He
Token ID679
Feature activation+1.77e-01

Pos logit contributions
 virtue+0.083
 eater+0.081
nie+0.079
 tutor+0.078
olding+0.077
Neg logit contributions
 valleys-0.024
 sails-0.023
itched-0.020
 sail-0.019
 gl-0.017
Token quickly
Token ID2952
Feature activation-1.65e-02

Pos logit contributions
 valleys+0.254
 sails+0.250
itched+0.233
 sail+0.233
 gl+0.223
Neg logit contributions
 virtue-0.300
 eater-0.294
 tutor-0.277
nie-0.275
olding-0.272
Token row
Token ID5752
Feature activation+2.01e-01

Pos logit contributions
olding+0.023
 ever+0.023
 virtue+0.023
 eater+0.023
 before+0.022
Neg logit contributions
 valleys-0.025
itched-0.023
rolling-0.022
ink-0.022
 sails-0.021
Tokened
Token ID276
Feature activation+7.31e-02

Pos logit contributions
 valleys+0.261
 sails+0.249
itched+0.237
 sail+0.231
 gl+0.219
Neg logit contributions
 virtue-0.363
 eater-0.356
nie-0.339
 tutor-0.336
olding-0.335
Tokenak
Token ID461
Feature activation+2.02e-01

Pos logit contributions
 valleys+0.285
 sails+0.262
 widened+0.243
itched+0.239
 sail+0.238
Neg logit contributions
 virtue-0.641
 eater-0.626
nie-0.613
olding-0.610
 promise-0.578
Token.
Token ID13
Feature activation-3.85e-02

Pos logit contributions
 sails+0.298
 valleys+0.291
 sail+0.280
itched+0.274
 gl+0.269
Neg logit contributions
 virtue-0.338
 eater-0.316
 tutor-0.301
nie-0.298
olding-0.298
Token They
Token ID1119
Feature activation+7.20e-02

Pos logit contributions
 eater+0.061
 virtue+0.058
nie+0.056
olding+0.055
 tutor+0.055
Neg logit contributions
 valleys-0.058
 sails-0.055
itched-0.055
 soared-0.052
 gl-0.052
Token liked
Token ID8288
Feature activation-6.40e-02

Pos logit contributions
 valleys+0.114
 sails+0.107
itched+0.107
 sail+0.099
 widened+0.096
Neg logit contributions
 eater-0.108
 virtue-0.107
nie-0.105
 ever-0.099
 tutor-0.099
Token to
Token ID284
Feature activation-5.28e-02

Pos logit contributions
 virtue+0.048
 eater+0.047
 ever+0.044
 tutor+0.043
nie+0.042
Neg logit contributions
 valleys-0.144
itched-0.142
 widened-0.137
 sails-0.136
ink-0.135
Token paddle
Token ID39517
Feature activation+3.32e-01

Pos logit contributions
 promise+0.076
 eater+0.074
 ever+0.070
 rule+0.070
 virtue+0.070
Neg logit contributions
 valleys-0.085
itched-0.080
 widened-0.075
ann-0.072
 sails-0.071
Token and
Token ID290
Feature activation-2.46e-02

Pos logit contributions
 valleys+0.545
 sails+0.522
itched+0.516
 sail+0.492
 widened+0.478
Neg logit contributions
 virtue-0.485
 eater-0.471
nie-0.445
olding-0.436
 tutor-0.430
Token see
Token ID766
Feature activation+1.56e-01

Pos logit contributions
 virtue+0.043
 eater+0.042
nie+0.040
 tutor+0.040
olding+0.039
Neg logit contributions
 valleys-0.034
 sails-0.033
itched-0.031
 sail-0.030
 gl-0.029
Token the
Token ID262
Feature activation+5.71e-03

Pos logit contributions
itched+0.322
ink+0.312
rolling+0.301
 widened+0.300
 valleys+0.294
Neg logit contributions
 before-0.143
 mistake-0.137
 promise-0.132
 eater-0.132
 virtue-0.129
Token fish
Token ID5916
Feature activation+1.20e-01

Pos logit contributions
 valleys+0.006
 sails+0.006
itched+0.005
 sail+0.005
 widened+0.005
Neg logit contributions
 virtue-0.012
 eater-0.012
nie-0.011
 tutor-0.011
olding-0.011
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
 valleys+0.188
 sails+0.176
itched+0.175
 widened+0.167
rolled+0.163
Neg logit contributions
 eater-0.180
 virtue-0.177
 ever-0.166
nie-0.165
olding-0.164
Token they
Token ID484
Feature activation-8.78e-03

Pos logit contributions
 virtue+0.131
 eater+0.127
nie+0.120
 tutor+0.119
olding+0.118
Neg logit contributions
 valleys-0.144
 sails-0.139
 sail-0.132
itched-0.131
 gl-0.127
Token can
Token ID460
Feature activation-5.75e-02

Pos logit contributions
 virtue+0.013
 eater+0.013
 tutor+0.012
nie+0.012
olding+0.012
Neg logit contributions
 valleys-0.015
 sails-0.014
 sail-0.014
itched-0.013
 gl-0.013
Token sit
Token ID1650
Feature activation+8.37e-02

Pos logit contributions
 ever+0.084
 promise+0.077
 rule+0.075
 behave+0.070
 you+0.070
Neg logit contributions
 valleys-0.103
ann-0.089
 forests-0.088
 widened-0.087
itched-0.084
Token on
Token ID319
Feature activation-4.05e-02

Pos logit contributions
 valleys+0.141
 sails+0.137
itched+0.132
 widened+0.128
 sail+0.126
Neg logit contributions
 virtue-0.113
 eater-0.112
nie-0.106
 ever-0.103
olding-0.100
Token and
Token ID290
Feature activation-2.49e-02

Pos logit contributions
 eater+0.077
 virtue+0.076
olding+0.076
nie+0.074
 tutor+0.071
Neg logit contributions
 valleys-0.053
 sails-0.051
 sail-0.048
itched-0.047
 gl-0.044
Token paddle
Token ID39517
Feature activation+3.32e-01

Pos logit contributions
 virtue+0.043
 eater+0.041
nie+0.039
 tutor+0.039
olding+0.039
Neg logit contributions
 valleys-0.036
 sails-0.035
 sail-0.033
itched-0.032
 gl-0.031
Token.
Token ID13
Feature activation-3.41e-02

Pos logit contributions
 valleys+0.484
 sails+0.464
 sail+0.428
 sailed+0.401
 gl+0.398
Neg logit contributions
 virtue-0.597
 eater-0.593
 tutor-0.591
nie-0.561
olding-0.560
Token They
Token ID1119
Feature activation+7.09e-02

Pos logit contributions
 virtue+0.062
 eater+0.060
nie+0.057
 tutor+0.057
olding+0.057
Neg logit contributions
 valleys-0.045
 sails-0.043
itched-0.041
 sail-0.040
 gl-0.038
Token pretend
Token ID16614
Feature activation+7.40e-02

Pos logit contributions
 valleys+0.122
 sails+0.119
itched+0.113
 sail+0.112
 gl+0.108
Neg logit contributions
 virtue-0.100
 eater-0.097
nie-0.090
 tutor-0.090
olding-0.089
Token they
Token ID484
Feature activation-8.65e-03

Pos logit contributions
 sails+0.100
 valleys+0.095
 sail+0.095
 gl+0.094
 sailed+0.086
Neg logit contributions
 virtue-0.149
 eater-0.145
nie-0.142
olding-0.139
aughter-0.134
Token are
Token ID389
Feature activation+4.32e-02

Pos logit contributions
 ever+0.010
 virtue+0.009
 eater+0.009
 promise+0.009
nie+0.009
Neg logit contributions
itched-0.016
 valleys-0.015
 sails-0.015
His-0.015
rolling-0.015
Tokenar
Token ID283
Feature activation+6.26e-02

Pos logit contributions
 valleys+0.176
 sails+0.169
itched+0.160
 sail+0.158
 gl+0.151
Neg logit contributions
 virtue-0.186
 eater-0.181
nie-0.171
 tutor-0.169
olding-0.169
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
 valleys+0.089
 sails+0.089
itched+0.082
 sail+0.080
 gl+0.079
Neg logit contributions
 virtue-0.112
 eater-0.107
 tutor-0.102
nie-0.098
olding-0.098
Token and
Token ID290
Feature activation-2.13e-02

Pos logit contributions
 valleys+0.180
 sails+0.176
 sail+0.166
itched+0.165
 gl+0.160
Neg logit contributions
 virtue-0.204
 eater-0.199
nie-0.186
 tutor-0.185
olding-0.181
Token started
Token ID2067
Feature activation+9.19e-02

Pos logit contributions
 virtue+0.041
 eater+0.040
 tutor+0.038
olding+0.038
nie+0.038
Neg logit contributions
 valleys-0.026
 sails-0.026
 sail-0.024
 gl-0.023
itched-0.023
Token to
Token ID284
Feature activation-4.96e-02

Pos logit contributions
 valleys+0.173
 sails+0.167
itched+0.160
 sail+0.158
 widened+0.153
Neg logit contributions
 virtue-0.113
 eater-0.109
nie-0.101
 tutor-0.101
olding-0.100
Token paddle
Token ID39517
Feature activation+3.32e-01

Pos logit contributions
 virtue+0.074
 eater+0.072
nie+0.068
 tutor+0.067
olding+0.067
Neg logit contributions
 valleys-0.081
 sails-0.078
itched-0.075
 sail-0.074
 gl-0.071
Token.
Token ID13
Feature activation-2.90e-02

Pos logit contributions
 valleys+0.463
 sails+0.440
 sail+0.409
itched+0.398
 gl+0.391
Neg logit contributions
 virtue-0.591
 eater-0.584
 tutor-0.559
nie-0.551
olding-0.534
Token
Token ID220
Feature activation+9.62e-02

Pos logit contributions
 virtue+0.060
 eater+0.059
nie+0.056
 tutor+0.056
olding+0.055
Neg logit contributions
 valleys-0.031
 sails-0.030
itched-0.028
 sail-0.027
rolled-0.026
Token
Token ID198
Feature activation+7.57e-02

Pos logit contributions
 valleys+0.124
 sails+0.120
 sail+0.110
itched+0.109
 gl+0.106
Neg logit contributions
 virtue-0.176
 eater-0.169
olding-0.165
nie-0.163
 tutor-0.161
Token
Token ID198
Feature activation+7.57e-02

Pos logit contributions
 valleys+0.102
 sails+0.100
itched+0.091
 sail+0.090
 gl+0.088
Neg logit contributions
 virtue-0.137
 eater-0.130
olding-0.127
nie-0.126
 tutor-0.123
TokenThe
Token ID464
Feature activation+2.17e-02

Pos logit contributions
 valleys+0.084
 sails+0.081
itched+0.075
 sail+0.073
 gl+0.069
Neg logit contributions
 virtue-0.154
 eater-0.150
nie-0.144
olding-0.143
 tutor-0.142
Token wanted
Token ID2227
Feature activation-5.23e-02

Pos logit contributions
 valleys+0.269
 sails+0.260
itched+0.247
 sail+0.243
 gl+0.232
Neg logit contributions
 virtue-0.295
 eater-0.288
nie-0.272
 tutor-0.270
olding-0.267
Token to
Token ID284
Feature activation-5.17e-02

Pos logit contributions
 virtue+0.063
 eater+0.061
nie+0.056
olding+0.055
 tutor+0.054
Neg logit contributions
 valleys-0.102
 sails-0.101
 sail-0.096
itched-0.095
 gl-0.092
Token go
Token ID467
Feature activation+9.37e-02

Pos logit contributions
 virtue+0.071
 eater+0.069
nie+0.064
 tutor+0.063
olding+0.063
Neg logit contributions
 valleys-0.092
 sails-0.087
itched-0.084
 widened-0.080
 sail-0.079
Token for
Token ID329
Feature activation-1.78e-01

Pos logit contributions
 valleys+0.168
itched+0.163
 widened+0.161
 sails+0.152
 prope+0.143
Neg logit contributions
 before-0.123
 ever-0.121
 promise-0.114
olding-0.114
 virtue-0.110
Token a
Token ID257
Feature activation+1.32e-02

Pos logit contributions
 virtue+0.264
 ever+0.256
 tutor+0.241
 before+0.239
 eater+0.238
Neg logit contributions
 valleys-0.273
itched-0.260
 sails-0.257
 widened-0.248
rolling-0.241
Token paddle
Token ID39517
Feature activation+3.32e-01

Pos logit contributions
 valleys+0.020
 sails+0.019
itched+0.019
 widened+0.017
 sail+0.017
Neg logit contributions
 virtue-0.021
 eater-0.021
nie-0.020
 tutor-0.020
olding-0.019
Token.
Token ID13
Feature activation-3.07e-02

Pos logit contributions
 sails+0.442
 sail+0.431
 valleys+0.426
 gl+0.399
bar+0.386
Neg logit contributions
 virtue-0.582
 eater-0.566
 tutor-0.558
nie-0.549
icia-0.531
Token But
Token ID887
Feature activation-6.16e-02

Pos logit contributions
 virtue+0.070
 eater+0.068
 anniversary+0.063
 ever+0.062
 occasion+0.062
Neg logit contributions
 valleys-0.027
 sails-0.026
His-0.026
He-0.026
rolled-0.024
Token Tom
Token ID4186
Feature activation-4.06e-02

Pos logit contributions
 virtue+0.132
 eater+0.123
olding+0.119
 occasion+0.116
 anniversary+0.114
Neg logit contributions
 sail-0.063
 sails-0.063
His-0.063
 valleys-0.062
 sailed-0.058
Token was
Token ID373
Feature activation+2.08e-02

Pos logit contributions
 virtue+0.072
 eater+0.069
 occasion+0.065
 anniversary+0.065
 tutor+0.064
Neg logit contributions
 sails-0.056
itched-0.053
 gl-0.052
 bent-0.052
 swoop-0.051
Token too
Token ID1165
Feature activation-1.92e-01

Pos logit contributions
 valleys+0.033
 sails+0.032
 sail+0.031
itched+0.030
 gl+0.030
Neg logit contributions
 virtue-0.032
 eater-0.031
nie-0.028
 tutor-0.028
olding-0.028
TokenI
Token ID40
Feature activation-1.42e-01

Pos logit contributions
 virtue+0.024
 eater+0.023
 tutor+0.022
nie+0.022
olding+0.021
Neg logit contributions
 valleys-0.032
 sails-0.031
itched-0.030
 sail-0.029
 gl-0.028
Token'm
Token ID1101
Feature activation+3.81e-02

Pos logit contributions
 virtue+0.202
 eater+0.196
nie+0.184
 tutor+0.184
olding+0.179
Neg logit contributions
 valleys-0.241
 sails-0.237
 sail-0.230
itched-0.228
 gl-0.219
Token going
Token ID1016
Feature activation-1.04e-02

Pos logit contributions
 valleys+0.062
 sails+0.058
itched+0.055
 widened+0.055
 sail+0.052
Neg logit contributions
 virtue-0.056
 eater-0.055
nie-0.054
 tutor-0.053
 ever-0.053
Token for
Token ID329
Feature activation-1.78e-01

Pos logit contributions
 virtue+0.013
 eater+0.012
nie+0.011
 tutor+0.011
olding+0.011
Neg logit contributions
 valleys-0.020
 sails-0.020
 sail-0.019
itched-0.019
 gl-0.018
Token a
Token ID257
Feature activation+1.23e-02

Pos logit contributions
 virtue+0.300
 eater+0.291
 tutor+0.281
nie+0.280
 ever+0.275
Neg logit contributions
 valleys-0.251
 sails-0.238
itched-0.229
 sail-0.218
 widened-0.216
Token paddle
Token ID39517
Feature activation+3.32e-01

Pos logit contributions
 valleys+0.016
 sails+0.015
itched+0.015
 sail+0.014
 widened+0.013
Neg logit contributions
 virtue-0.022
 eater-0.022
nie-0.021
 tutor-0.021
olding-0.021
Token in
Token ID287
Feature activation-6.67e-02

Pos logit contributions
 sails+0.506
 sail+0.491
 valleys+0.475
bar+0.450
 gl+0.442
Neg logit contributions
 virtue-0.534
 eater-0.513
 tutor-0.499
icia-0.480
olding-0.474
Token the
Token ID262
Feature activation+7.78e-03

Pos logit contributions
 virtue+0.108
 eater+0.105
nie+0.100
olding+0.099
 tutor+0.098
Neg logit contributions
 valleys-0.101
 sails-0.097
itched-0.092
 sail-0.092
 gl-0.087
Token lake
Token ID13546
Feature activation+1.33e-01

Pos logit contributions
itched+0.011
 widened+0.009
ink+0.008
 valleys+0.008
rolled+0.008
Neg logit contributions
 virtue-0.014
 promise-0.014
 kid-0.013
 mistake-0.013
 rule-0.013
Token,"
Token ID553
Feature activation+6.47e-02

Pos logit contributions
 sails+0.184
 valleys+0.181
 sail+0.179
 gl+0.166
itched+0.157
Neg logit contributions
 virtue-0.234
 eater-0.218
 tutor-0.217
olding-0.212
 student-0.207
Token replied
Token ID8712
Feature activation+1.09e-01

Pos logit contributions
 valleys+0.090
 sails+0.086
itched+0.082
 sail+0.080
 gl+0.076
Neg logit contributions
 virtue-0.112
 eater-0.109
nie-0.104
 tutor-0.103
olding-0.102
Token Rob
Token ID3851
Feature activation+6.86e-02

Pos logit contributions
 valleys+0.075
itched+0.071
 sails+0.071
 sail+0.068
rolling+0.067
Neg logit contributions
 virtue-0.085
 eater-0.084
nie-0.084
 tutor-0.082
 ever-0.079
Token how
Token ID703
Feature activation-1.52e-01

Pos logit contributions
 sails+0.111
 valleys+0.109
 sail+0.103
itched+0.103
 gl+0.098
Neg logit contributions
 virtue-0.106
 eater-0.102
 tutor-0.094
olding-0.094
nie-0.093
Token to
Token ID284
Feature activation-5.27e-02

Pos logit contributions
 eater+0.211
 virtue+0.209
nie+0.187
 tutor+0.185
 student+0.180
Neg logit contributions
 valleys-0.270
 sails-0.267
 sail-0.255
itched-0.252
He-0.247
Token use
Token ID779
Feature activation+8.14e-02

Pos logit contributions
 virtue+0.073
 eater+0.072
nie+0.069
 promise+0.069
 mistake+0.067
Neg logit contributions
 valleys-0.090
 sails-0.081
itched-0.080
 widened-0.075
rolled-0.075
Token a
Token ID257
Feature activation+1.08e-02

Pos logit contributions
 valleys+0.108
rolling+0.100
ink+0.100
 widened+0.099
itched+0.098
Neg logit contributions
 virtue-0.131
 tutor-0.130
 before-0.126
 ever-0.125
 eater-0.124
Token paddle
Token ID39517
Feature activation+3.32e-01

Pos logit contributions
 valleys+0.016
itched+0.014
 sails+0.014
 widened+0.013
ink+0.013
Neg logit contributions
 virtue-0.018
 tutor-0.018
 promise-0.017
nie-0.017
 mistake-0.017
Token to
Token ID284
Feature activation-5.26e-02

Pos logit contributions
 valleys+0.545
 sails+0.526
itched+0.505
 sail+0.495
 gl+0.475
Neg logit contributions
 virtue-0.493
 eater-0.479
nie-0.450
 tutor-0.446
olding-0.442
Token move
Token ID1445
Feature activation+1.91e-01

Pos logit contributions
 virtue+0.082
 promise+0.079
 mistake+0.078
 eater+0.078
nie+0.076
Neg logit contributions
 valleys-0.076
itched-0.070
ink-0.068
 widened-0.067
 sails-0.067
Token the
Token ID262
Feature activation+5.18e-03

Pos logit contributions
 valleys+0.253
 sails+0.243
 sail+0.224
itched+0.223
 gl+0.212
Neg logit contributions
 virtue-0.354
 eater-0.347
nie-0.329
 tutor-0.327
olding-0.324
Token boat
Token ID8848
Feature activation+1.66e-01

Pos logit contributions
 valleys+0.005
itched+0.005
 sails+0.005
rolled+0.005
 widened+0.004
Neg logit contributions
 virtue-0.011
 eater-0.010
nie-0.010
 tutor-0.010
olding-0.010
Token.
Token ID13
Feature activation-3.54e-02

Pos logit contributions
 valleys+0.266
 sails+0.258
itched+0.245
 sail+0.243
 gl+0.232
Neg logit contributions
 virtue-0.253
 eater-0.246
nie-0.232
 tutor-0.230
olding-0.227
Token the
Token ID262
Feature activation+5.37e-03

Pos logit contributions
 virtue+0.123
 eater+0.120
nie+0.116
olding+0.113
 tutor+0.112
Neg logit contributions
 valleys-0.093
 sails-0.088
 sail-0.084
itched-0.083
 gl-0.079
Token boat
Token ID8848
Feature activation+1.66e-01

Pos logit contributions
itched+0.009
 widened+0.008
ink+0.008
bridge+0.007
rolled+0.007
Neg logit contributions
 promise-0.008
 mistake-0.008
 kid-0.007
 day-0.007
 lesson-0.007
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
 valleys+0.239
 sails+0.230
itched+0.219
 sail+0.214
 gl+0.204
Neg logit contributions
 virtue-0.280
 eater-0.273
nie-0.259
 tutor-0.257
olding-0.254
Token took
Token ID1718
Feature activation+2.48e-02

Pos logit contributions
 virtue+0.046
 eater+0.045
 tutor+0.042
nie+0.042
olding+0.042
Neg logit contributions
 valleys-0.033
 sails-0.032
 sail-0.030
itched-0.030
 gl-0.028
Token the
Token ID262
Feature activation+5.27e-03

Pos logit contributions
 valleys+0.038
itched+0.036
 sails+0.036
rolling+0.035
 widened+0.034
Neg logit contributions
 virtue-0.037
 ever-0.036
 eater-0.036
nie-0.036
 tutor-0.033
Token paddle
Token ID39517
Feature activation+3.32e-01

Pos logit contributions
 valleys+0.006
itched+0.005
 sails+0.005
 widened+0.005
ink+0.005
Neg logit contributions
 virtue-0.010
 eater-0.010
nie-0.010
 tutor-0.010
olding-0.010
Token.
Token ID13
Feature activation-3.41e-02

Pos logit contributions
 valleys+0.555
 sails+0.537
itched+0.515
 sail+0.506
 gl+0.486
Neg logit contributions
 virtue-0.483
 eater-0.469
nie-0.441
 tutor-0.436
olding-0.432
Token He
Token ID679
Feature activation+1.76e-01

Pos logit contributions
 virtue+0.063
 eater+0.060
nie+0.059
 tutor+0.059
olding+0.058
Neg logit contributions
 valleys-0.042
 sails-0.040
itched-0.040
 gl-0.037
 sail-0.036
Token pushed
Token ID7121
Feature activation+1.05e-01

Pos logit contributions
 valleys+0.243
 sails+0.233
itched+0.222
 sail+0.217
 gl+0.206
Neg logit contributions
 virtue-0.306
 eater-0.299
nie-0.284
 tutor-0.282
olding-0.279
Token and
Token ID290
Feature activation-2.54e-02

Pos logit contributions
 valleys+0.161
 sails+0.157
itched+0.151
 sail+0.147
 gl+0.142
Neg logit contributions
 virtue-0.163
 eater-0.158
nie-0.153
 tutor-0.148
olding-0.148
Token pulled
Token ID5954
Feature activation+1.02e-01

Pos logit contributions
 virtue+0.038
 ever+0.037
nie+0.036
 before+0.036
 eater+0.035
Neg logit contributions
itched-0.038
 valleys-0.037
 sails-0.036
ink-0.035
rolling-0.034
Token his
Token ID465
Feature activation+1.22e-01

Pos logit contributions
 virtue+0.106
 eater+0.100
nie+0.098
olding+0.096
 tutor+0.094
Neg logit contributions
 sail-0.055
 valleys-0.054
 sails-0.053
itched-0.045
 paddle-0.045
Token mom
Token ID1995
Feature activation-1.85e-01

Pos logit contributions
 sail+0.224
 sails+0.218
 valleys+0.198
itched+0.197
 gl+0.192
Neg logit contributions
 virtue-0.177
nie-0.163
 eater-0.160
olding-0.152
icia-0.148
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
 virtue+0.320
 eater+0.305
 tutor+0.286
nie+0.284
olding+0.282
Neg logit contributions
 valleys-0.266
 sails-0.259
itched-0.246
 sail-0.246
 gl-0.233
Token started
Token ID2067
Feature activation+8.86e-02

Pos logit contributions
 virtue+0.037
 eater+0.036
nie+0.034
 tutor+0.034
olding+0.034
Neg logit contributions
 valleys-0.027
 sails-0.026
itched-0.024
 sail-0.024
 gl-0.023
Token to
Token ID284
Feature activation-5.07e-02

Pos logit contributions
 valleys+0.175
 sails+0.166
 widened+0.166
ink+0.156
itched+0.154
Neg logit contributions
 tutor-0.086
 ever-0.085
 eater-0.085
 virtue-0.082
 mistake-0.082
Token paddle
Token ID39517
Feature activation+3.32e-01

Pos logit contributions
 eater+0.067
 virtue+0.066
 promise+0.064
 tutor+0.063
 mistake+0.062
Neg logit contributions
 valleys-0.088
 sails-0.082
itched-0.081
 widened-0.076
ink-0.076
Token away
Token ID1497
Feature activation+6.98e-02

Pos logit contributions
 valleys+0.506
 sails+0.489
itched+0.471
 sail+0.456
 gl+0.437
Neg logit contributions
 virtue-0.527
 eater-0.514
nie-0.484
 tutor-0.480
olding-0.480
Token with
Token ID351
Feature activation-7.85e-02

Pos logit contributions
 valleys+0.115
 sails+0.111
itched+0.106
 sail+0.105
 gl+0.101
Neg logit contributions
 virtue-0.104
 eater-0.101
nie-0.096
 tutor-0.095
olding-0.093
Token the
Token ID262
Feature activation+6.38e-03

Pos logit contributions
 virtue+0.138
 eater+0.136
 tutor+0.132
 ever+0.129
nie+0.127
Neg logit contributions
 valleys-0.101
 sails-0.093
itched-0.092
rolling-0.086
ink-0.086
Token lumber
Token ID34840
Feature activation+9.56e-02

Pos logit contributions
 valleys+0.006
 sails+0.006
itched+0.006
 sail+0.006
 gl+0.005
Neg logit contributions
 virtue-0.014
 eater-0.013
nie-0.013
 tutor-0.013
olding-0.013
Token.
Token ID13
Feature activation-2.80e-02

Pos logit contributions
 valleys+0.143
 sails+0.141
 sail+0.134
itched+0.130
 gl+0.130
Neg logit contributions
 virtue-0.157
 eater-0.152
nie-0.146
 tutor-0.143
olding-0.139
Token,"
Token ID553
Feature activation+5.63e-02

Pos logit contributions
 virtue+0.082
 eater+0.081
olding+0.080
 occasion+0.077
 anniversary+0.075
Neg logit contributions
 sails-0.029
 sail-0.029
 valleys-0.026
itched-0.026
ink-0.022
Token it
Token ID340
Feature activation-9.20e-03

Pos logit contributions
 sails+0.062
 valleys+0.061
 sail+0.060
He+0.058
 gl+0.057
Neg logit contributions
 virtue-0.123
 eater-0.111
olding-0.107
nie-0.105
 mistake-0.104
Token said
Token ID531
Feature activation+7.52e-02

Pos logit contributions
 virtue+0.018
 eater+0.017
olding+0.016
 tutor+0.016
 anniversary+0.015
Neg logit contributions
 valleys-0.011
 sails-0.011
 sail-0.011
 gl-0.010
itched-0.010
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
itched+0.097
 sail+0.092
 sails+0.091
 valleys+0.091
 widened+0.086
Neg logit contributions
 virtue-0.140
 eater-0.133
nie-0.125
 occasion-0.123
 tutor-0.123
Token "
Token ID366
Feature activation-6.23e-02

Pos logit contributions
 valleys+0.162
 sails+0.158
itched+0.152
 sail+0.152
 gl+0.143
Neg logit contributions
 virtue-0.209
 eater-0.199
nie-0.187
olding-0.185
 tutor-0.184
Tokenyou
Token ID5832
Feature activation-2.56e-01

Pos logit contributions
 virtue+0.101
 eater+0.098
 mistake+0.096
 tutor+0.094
nie+0.089
Neg logit contributions
 valleys-0.096
He-0.095
His-0.091
 sails-0.086
 sail-0.083
Token won
Token ID1839
Feature activation-1.43e-01

Pos logit contributions
 eater+0.438
 virtue+0.416
 tutor+0.394
olding+0.391
becca+0.390
Neg logit contributions
 sails-0.404
 sail-0.389
 valleys-0.366
 gl-0.357
 sailed-0.354
Token't
Token ID470
Feature activation-1.25e-01

Pos logit contributions
 eater+0.234
 virtue+0.228
olding+0.211
nie+0.206
 tutor+0.202
Neg logit contributions
 valleys-0.219
itched-0.217
 sails-0.216
 sail-0.209
rolled-0.199
Token regret
Token ID13721
Feature activation-3.78e-02

Pos logit contributions
 virtue+0.209
 eater+0.207
 tutor+0.198
aughter+0.192
nie+0.188
Neg logit contributions
 sail-0.185
 swoop-0.172
 sails-0.172
 valleys-0.164
He-0.164
Token it
Token ID340
Feature activation-4.34e-03

Pos logit contributions
 virtue+0.072
 eater+0.071
 occasion+0.068
olding+0.065
 hour+0.062
Neg logit contributions
itched-0.049
 soaring-0.047
 sails-0.045
 valleys-0.045
 sail-0.045
Token".
Token ID1911
Feature activation-1.29e-01

Pos logit contributions
 eater+0.007
 virtue+0.007
 tutor+0.007
 student+0.007
 occasion+0.007
Neg logit contributions
 sail-0.005
itched-0.005
 sails-0.005
 valleys-0.005
He-0.005
Token smiles
Token ID21845
Feature activation+7.29e-02

Pos logit contributions
 virtue+0.017
 eater+0.016
nie+0.016
olding+0.015
 occasion+0.015
Neg logit contributions
 sails-0.015
 valleys-0.015
 sail-0.014
itched-0.014
 gl-0.014
Token too
Token ID1165
Feature activation-1.92e-01

Pos logit contributions
 valleys+0.156
 sails+0.147
rolled+0.144
rolling+0.143
itched+0.140
Neg logit contributions
 tutor-0.070
 eater-0.069
 ever-0.066
 before-0.065
 virtue-0.064
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
 virtue+0.309
 eater+0.302
nie+0.284
 tutor+0.279
 occasion+0.273
Neg logit contributions
 valleys-0.291
 sails-0.283
itched-0.276
 sail-0.270
 widened-0.258
Token says
Token ID1139
Feature activation+1.08e-01

Pos logit contributions
 tutor+0.027
 eater+0.027
 virtue+0.025
 promise+0.024
olding+0.024
Neg logit contributions
 valleys-0.049
itched-0.047
 sails-0.045
rolled-0.045
rolling-0.044
Token "
Token ID366
Feature activation-6.54e-02

Pos logit contributions
 valleys+0.139
 sails+0.136
itched+0.135
 sail+0.130
 gl+0.122
Neg logit contributions
 virtue-0.195
 eater-0.187
nie-0.185
olding-0.174
 tutor-0.172
Tokenyou
Token ID5832
Feature activation-2.56e-01

Pos logit contributions
 virtue+0.095
 eater+0.090
nie+0.088
 tutor+0.088
olding+0.087
Neg logit contributions
 valleys-0.110
 sails-0.103
He-0.101
 sail-0.099
itched-0.099
Token are
Token ID389
Feature activation+4.12e-02

Pos logit contributions
 virtue+0.474
 eater+0.467
nie+0.448
olding+0.442
 tutor+0.435
Neg logit contributions
 sails-0.330
 sail-0.314
 valleys-0.301
 gl-0.296
itched-0.286
Token welcome
Token ID7062
Feature activation+2.08e-03

Pos logit contributions
 valleys+0.069
 sails+0.066
itched+0.064
 sail+0.062
 widened+0.060
Neg logit contributions
 virtue-0.061
 eater-0.059
nie-0.056
 tutor-0.055
olding-0.054
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
 sails+0.001
 valleys+0.001
itched+0.001
ink+0.001
 sail+0.001
Neg logit contributions
 virtue-0.005
nie-0.004
olding-0.004
 occasion-0.004
 eater-0.004
Token Tom
Token ID4186
Feature activation-4.76e-02

Pos logit contributions
He+0.131
 gl+0.126
 sails+0.125
 sail+0.122
 bent+0.121
Neg logit contributions
 virtue-0.219
grand-0.198
 occasion-0.196
 eater-0.193
icia-0.191
Token.
Token ID13
Feature activation-2.92e-02

Pos logit contributions
 virtue+0.096
 occasion+0.092
olding+0.088
 tutor+0.088
 anniversary+0.088
Neg logit contributions
He-0.050
 sails-0.049
itched-0.046
 valleys-0.046
 swoop-0.045
Token agrees
Token ID14386
Feature activation+6.89e-02

Pos logit contributions
 virtue+0.195
 eater+0.185
nie+0.176
olding+0.174
 tutor+0.172
Neg logit contributions
 valleys-0.162
 sails-0.160
itched-0.148
 sail-0.147
 gl-0.144
Token and
Token ID290
Feature activation-2.62e-02

Pos logit contributions
 valleys+0.081
 sails+0.079
itched+0.078
 sail+0.077
He+0.077
Neg logit contributions
 virtue-0.134
 eater-0.126
nie-0.121
 tutor-0.116
icia-0.116
Token says
Token ID1139
Feature activation+1.05e-01

Pos logit contributions
 virtue+0.035
 eater+0.035
 tutor+0.032
nie+0.032
olding+0.032
Neg logit contributions
 valleys-0.047
 sails-0.044
itched-0.043
 sail-0.042
 widened-0.040
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 valleys+0.125
 sails+0.119
itched+0.112
 sail+0.109
 gl+0.103
Neg logit contributions
 virtue-0.205
 eater-0.200
nie-0.191
 tutor-0.190
olding-0.188
Token "
Token ID366
Feature activation-6.67e-02

Pos logit contributions
 valleys+0.153
 sails+0.147
itched+0.140
 sail+0.138
 gl+0.132
Neg logit contributions
 virtue-0.190
 eater-0.184
nie-0.175
 tutor-0.172
olding-0.171
TokenDeal
Token ID45776
Feature activation-2.55e-01

Pos logit contributions
 virtue+0.055
 eater+0.052
nie+0.048
 tutor+0.048
olding+0.048
Neg logit contributions
 valleys-0.152
 sails-0.145
 sail-0.141
He-0.141
itched-0.140
Token.
Token ID13
Feature activation-2.76e-02

Pos logit contributions
 virtue+0.429
 eater+0.412
nie+0.411
becca+0.393
olding+0.392
Neg logit contributions
 sail-0.358
 sails-0.338
 valleys-0.333
itched-0.325
 gl-0.302
Token Let
Token ID3914
Feature activation-1.83e-01

Pos logit contributions
 mistake+0.036
 eater+0.035
 virtue+0.034
nie+0.034
 tutor+0.033
Neg logit contributions
 valleys-0.048
 sails-0.045
 sail-0.043
 gl-0.042
He-0.042
Token's
Token ID338
Feature activation-1.40e-02

Pos logit contributions
 virtue+0.316
 eater+0.305
olding+0.294
nie+0.285
 tutor+0.284
Neg logit contributions
 valleys-0.258
 sails-0.256
itched-0.240
 sail-0.240
 gl-0.233
Token make
Token ID787
Feature activation+2.19e-02

Pos logit contributions
 virtue+0.022
 eater+0.021
nie+0.020
olding+0.020
 tutor+0.020
Neg logit contributions
 valleys-0.022
 sails-0.022
 sail-0.021
itched-0.020
 gl-0.019
Token a
Token ID257
Feature activation+8.99e-03

Pos logit contributions
 valleys+0.027
 sails+0.027
itched+0.025
 sail+0.025
 gl+0.024
Neg logit contributions
 virtue-0.041
 eater-0.040
nie-0.038
olding-0.038
 tutor-0.038
Token said
Token ID531
Feature activation+6.83e-02

Pos logit contributions
 valleys+0.148
 sails+0.138
itched+0.133
 sail+0.130
rolled+0.126
Neg logit contributions
 eater-0.127
 virtue-0.125
nie-0.122
 tutor-0.121
olding-0.117
Token.
Token ID13
Feature activation-3.63e-02

Pos logit contributions
 valleys+0.112
 sails+0.109
itched+0.107
 sail+0.105
 gl+0.099
Neg logit contributions
 virtue-0.103
 eater-0.097
nie-0.092
 tutor-0.089
olding-0.088
Token
Token ID198
Feature activation+7.00e-02

Pos logit contributions
 eater+0.060
 virtue+0.060
 tutor+0.059
olding+0.056
 student+0.055
Neg logit contributions
 valleys-0.052
 sails-0.050
itched-0.047
 sail-0.045
 soared-0.045
Token
Token ID198
Feature activation+7.00e-02

Pos logit contributions
 valleys+0.109
 sails+0.108
 gl+0.097
 sail+0.096
itched+0.096
Neg logit contributions
 virtue-0.104
 eater-0.103
 tutor-0.100
olding-0.098
nie-0.096
Token"
Token ID1
Feature activation-2.66e-02

Pos logit contributions
 valleys+0.122
 sails+0.121
itched+0.115
 sail+0.111
 gl+0.111
Neg logit contributions
 virtue-0.097
 eater-0.096
nie-0.089
 tutor-0.089
olding-0.088
TokenDeal
Token ID45776
Feature activation-2.54e-01

Pos logit contributions
 virtue+0.020
 eater+0.019
nie+0.017
 tutor+0.016
olding+0.016
Neg logit contributions
 valleys-0.062
 sails-0.061
 sail-0.059
itched-0.059
 gl-0.058
Token!"
Token ID2474
Feature activation-1.02e-01

Pos logit contributions
 virtue+0.523
 eater+0.496
nie+0.483
icia+0.479
becca+0.472
Neg logit contributions
 sail-0.268
 sails-0.250
 valleys-0.246
He-0.230
itched-0.224
Token the
Token ID262
Feature activation-4.24e-04

Pos logit contributions
 virtue+0.155
nie+0.138
 eater+0.137
olding+0.131
 tutor+0.125
Neg logit contributions
 valleys-0.169
 sails-0.168
 sail-0.165
 widened-0.157
He-0.156
Token twins
Token ID20345
Feature activation-1.60e-01

Pos logit contributions
 virtue+0.001
 eater+0.001
nie+0.001
ccoli+0.001
 ever+0.001
Neg logit contributions
 sails-0.001
 sail-0.001
 valleys-0.001
 gl-0.001
His-0.001
Token said
Token ID531
Feature activation+6.79e-02

Pos logit contributions
 virtue+0.288
 eater+0.273
nie+0.258
 tutor+0.258
olding+0.253
Neg logit contributions
 sails-0.217
 valleys-0.214
 sail-0.206
itched-0.203
 gl-0.199
Token.
Token ID13
Feature activation-3.65e-02

Pos logit contributions
 sails+0.101
 valleys+0.101
itched+0.100
 sail+0.098
 widened+0.090
Neg logit contributions
 virtue-0.112
 eater-0.105
nie-0.099
 tutor-0.096
 occasion-0.095
Token.
Token ID13
Feature activation-3.45e-02

Pos logit contributions
 valleys+0.155
 sails+0.151
itched+0.146
 sail+0.143
 gl+0.138
Neg logit contributions
 virtue-0.138
 eater-0.131
nie-0.124
 tutor-0.122
olding-0.121
Token She
Token ID1375
Feature activation+1.14e-01

Pos logit contributions
 virtue+0.079
 eater+0.077
nie+0.075
olding+0.073
 tutor+0.072
Neg logit contributions
 valleys-0.030
 sails-0.027
itched-0.025
 sail-0.024
 gl-0.022
Token said
Token ID531
Feature activation+6.82e-02

Pos logit contributions
 sails+0.143
 gl+0.138
 valleys+0.138
 widened+0.135
 sail+0.133
Neg logit contributions
 virtue-0.210
 eater-0.199
olding-0.192
 tutor-0.191
nie-0.189
Token,
Token ID11
Feature activation+1.08e-01

Pos logit contributions
 valleys+0.077
 sails+0.073
itched+0.068
 sail+0.067
 gl+0.062
Neg logit contributions
 virtue-0.137
 eater-0.134
nie-0.129
 tutor-0.128
olding-0.127
Token "
Token ID366
Feature activation-6.85e-02

Pos logit contributions
 valleys+0.142
 sails+0.136
itched+0.133
 sail+0.130
 gl+0.124
Neg logit contributions
 virtue-0.196
 eater-0.189
nie-0.182
olding-0.176
 tutor-0.176
TokenDeal
Token ID45776
Feature activation-2.54e-01

Pos logit contributions
 virtue+0.059
 eater+0.057
nie+0.053
olding+0.052
 tutor+0.051
Neg logit contributions
 valleys-0.149
He-0.143
 sails-0.142
His-0.139
 gl-0.139
Token,
Token ID11
Feature activation+1.08e-01

Pos logit contributions
 virtue+0.505
nie+0.488
 eater+0.488
icia+0.471
becca+0.466
Neg logit contributions
 sail-0.279
 sails-0.252
 valleys-0.247
itched-0.241
 gl-0.223
Token mom
Token ID1995
Feature activation-1.89e-01

Pos logit contributions
 sail+0.145
 gl+0.141
 sails+0.132
He+0.132
 wood+0.127
Neg logit contributions
 virtue-0.182
olding-0.167
t-0.165
 occasion-0.165
 eater-0.164
Token.
Token ID13
Feature activation-3.49e-02

Pos logit contributions
 virtue+0.414
becca+0.370
icia+0.363
olding+0.360
 eater+0.359
Neg logit contributions
 valleys-0.169
He-0.166
 sails-0.162
 gl-0.153
itched-0.145
Token Thank
Token ID6952
Feature activation-1.06e-01

Pos logit contributions
 mistake+0.048
 virtue+0.046
nie+0.045
 eater+0.044
olding+0.042
Neg logit contributions
 valleys-0.060
 sails-0.055
 sail-0.054
 gl-0.053
 mountains-0.051
Token you
Token ID345
Feature activation-2.40e-01

Pos logit contributions
 virtue+0.113
 eater+0.094
nie+0.092
 mistake+0.090
olding+0.084
Neg logit contributions
 valleys-0.222
 sails-0.217
itched-0.215
 sail-0.213
ink-0.212
Token.
Token ID13
Feature activation-3.61e-02

Pos logit contributions
 valleys+0.166
 sails+0.159
itched+0.152
 sail+0.150
 gl+0.144
Neg logit contributions
 virtue-0.128
 eater-0.126
nie-0.118
 tutor-0.117
olding-0.115
Token He
Token ID679
Feature activation+1.71e-01

Pos logit contributions
 virtue+0.092
 eater+0.090
nie+0.088
olding+0.086
 tutor+0.086
Neg logit contributions
 valleys-0.021
 sails-0.019
itched-0.017
 sail-0.016
He-0.014
Token said
Token ID531
Feature activation+6.87e-02

Pos logit contributions
 sails+0.224
 valleys+0.220
 gl+0.212
 widened+0.207
 sail+0.207
Neg logit contributions
 virtue-0.309
 eater-0.295
olding-0.281
nie-0.278
 tutor-0.278
Token,
Token ID11
Feature activation+1.08e-01

Pos logit contributions
 valleys+0.084
 sails+0.079
itched+0.074
 sail+0.073
 widened+0.069
Neg logit contributions
 virtue-0.132
 eater-0.130
 tutor-0.123
nie-0.122
olding-0.122
Token "
Token ID366
Feature activation-6.75e-02

Pos logit contributions
 valleys+0.144
 sails+0.138
itched+0.134
 sail+0.131
 gl+0.125
Neg logit contributions
 virtue-0.195
 eater-0.188
nie-0.181
 tutor-0.176
olding-0.175
TokenDeal
Token ID45776
Feature activation-2.54e-01

Pos logit contributions
 virtue+0.055
 eater+0.053
 tutor+0.049
nie+0.048
olding+0.047
Neg logit contributions
 valleys-0.152
 sails-0.147
He-0.145
His-0.142
 sail-0.142
Token.
Token ID13
Feature activation-3.67e-02

Pos logit contributions
 virtue+0.493
 eater+0.475
icia+0.475
nie+0.471
becca+0.467
Neg logit contributions
 sail-0.292
 sails-0.262
 valleys-0.255
He-0.246
itched-0.242
Token I
Token ID314
Feature activation-1.78e-01

Pos logit contributions
 mistake+0.045
 virtue+0.045
 eater+0.045
icia+0.044
nie+0.042
Neg logit contributions
 valleys-0.063
 sails-0.061
He-0.060
His-0.059
 sail-0.059
Token promise
Token ID6991
Feature activation-1.28e-01

Pos logit contributions
 eater+0.243
 tutor+0.238
nie+0.234
 virtue+0.230
becca+0.219
Neg logit contributions
 sail-0.279
 sails-0.273
He-0.268
 widened-0.266
 gl-0.266
Token.
Token ID13
Feature activation-3.67e-02

Pos logit contributions
 virtue+0.228
 eater+0.227
 student+0.211
olding+0.211
icia+0.209
Neg logit contributions
 sails-0.158
 sail-0.157
itched-0.151
 valleys-0.151
He-0.150
Token Let
Token ID3914
Feature activation-1.83e-01

Pos logit contributions
 virtue+0.045
 mistake+0.044
 eater+0.044
nie+0.042
icia+0.041
Neg logit contributions
 valleys-0.064
 sails-0.064
 sail-0.062
He-0.061
His-0.060
Token said
Token ID531
Feature activation+6.84e-02

Pos logit contributions
 virtue+0.056
 eater+0.052
olding+0.049
 tutor+0.049
nie+0.049
Neg logit contributions
 sails-0.036
 valleys-0.035
itched-0.033
 sail-0.033
 gl-0.033
Token.
Token ID13
Feature activation-3.65e-02

Pos logit contributions
itched+0.095
 sails+0.094
 sail+0.093
 valleys+0.092
His+0.085
Neg logit contributions
 virtue-0.119
 eater-0.109
nie-0.104
 tutor-0.101
becca-0.101
Token
Token ID198
Feature activation+7.19e-02

Pos logit contributions
 eater+0.059
 virtue+0.058
 tutor+0.057
olding+0.055
 student+0.054
Neg logit contributions
 valleys-0.054
 sails-0.052
itched-0.050
 soared-0.047
 sail-0.047
Token
Token ID198
Feature activation+7.16e-02

Pos logit contributions
 valleys+0.116
 sails+0.114
 descended+0.103
 soared+0.103
 gl+0.103
Neg logit contributions
 virtue-0.103
 eater-0.102
olding-0.099
 tutor-0.097
nie-0.096
Token"
Token ID1
Feature activation-2.61e-02

Pos logit contributions
 valleys+0.109
 sails+0.106
itched+0.102
rolled+0.097
 sail+0.097
Neg logit contributions
 virtue-0.117
 eater-0.115
olding-0.107
nie-0.107
 tutor-0.104
TokenDeal
Token ID45776
Feature activation-2.54e-01

Pos logit contributions
 virtue+0.022
 eater+0.021
nie+0.018
 tutor+0.018
olding+0.018
Neg logit contributions
 valleys-0.061
 sails-0.059
itched-0.058
 sail-0.057
rolled-0.056
Token!"
Token ID2474
Feature activation-1.03e-01

Pos logit contributions
 virtue+0.508
 eater+0.481
nie+0.472
icia+0.465
becca+0.456
Neg logit contributions
 sail-0.285
 sails-0.266
 valleys-0.263
itched-0.252
He-0.237
Token Lily
Token ID20037
Feature activation-1.16e-01

Pos logit contributions
 virtue+0.134
nie+0.120
 eater+0.116
olding+0.113
 tutor+0.106
Neg logit contributions
 sails-0.194
 valleys-0.192
 sail-0.192
He-0.181
 gl-0.180
Token said
Token ID531
Feature activation+6.83e-02

Pos logit contributions
 virtue+0.240
 eater+0.221
olding+0.209
nie+0.209
 tutor+0.208
Neg logit contributions
 sails-0.129
itched-0.120
 gl-0.120
 widened-0.119
 valleys-0.117
Token.
Token ID13
Feature activation-3.69e-02

Pos logit contributions
 valleys+0.110
 sails+0.109
itched+0.107
 sail+0.104
 gl+0.099
Neg logit contributions
 virtue-0.103
 eater-0.098
nie-0.092
 tutor-0.090
olding-0.089
Token
Token ID198
Feature activation+7.20e-02

Pos logit contributions
 eater+0.062
 virtue+0.061
 tutor+0.061
olding+0.059
 student+0.056
Neg logit contributions
 sails-0.052
 valleys-0.052
 sail-0.048
itched-0.047
 soared-0.046
Token.
Token ID13
Feature activation-2.76e-02

Pos logit contributions
 sails+0.003
ink+0.003
 swoop+0.002
 valleys+0.002
itched+0.002
Neg logit contributions
 virtue-0.005
 occasion-0.005
olding-0.005
 eater-0.005
becca-0.005
Token I
Token ID314
Feature activation-1.78e-01

Pos logit contributions
 virtue+0.039
 eater+0.037
nie+0.035
 tutor+0.034
olding+0.033
Neg logit contributions
 valleys-0.048
 sails-0.047
 sail-0.045
itched-0.044
He-0.042
Token...
Token ID986
Feature activation+7.60e-02

Pos logit contributions
 virtue+0.293
 eater+0.284
 tutor+0.282
nie+0.277
olding+0.264
Neg logit contributions
 sails-0.255
 sail-0.254
 gl-0.249
 widened-0.241
 sailed-0.238
Tokenmiss
Token ID3927
Feature activation-2.94e-02

Pos logit contributions
 valleys+0.113
 sails+0.107
He+0.105
His+0.100
 sail+0.099
Neg logit contributions
 virtue-0.128
 eater-0.121
 mistake-0.120
 tutor-0.115
nie-0.115
Token...
Token ID986
Feature activation+7.61e-02

Pos logit contributions
 virtue+0.056
 eater+0.055
 tutor+0.054
idents+0.053
olding+0.052
Neg logit contributions
 valleys-0.034
 sails-0.034
ink-0.032
 soared-0.032
 swoop-0.031
Tokenyou
Token ID5832
Feature activation-2.54e-01

Pos logit contributions
 valleys+0.114
 sails+0.106
He+0.100
 soared+0.097
 sail+0.096
Neg logit contributions
 virtue-0.131
 eater-0.127
 tutor-0.121
 mistake-0.120
nie-0.117
Token...
Token ID986
Feature activation+7.63e-02

Pos logit contributions
 eater+0.432
 virtue+0.426
 tutor+0.392
nie+0.387
olding+0.384
Neg logit contributions
 sails-0.379
 sail-0.355
 valleys-0.351
 gl-0.343
 sailed-0.340
Tokenvery
Token ID548
Feature activation+5.49e-03

Pos logit contributions
 valleys+0.117
 sails+0.107
He+0.103
 soared+0.101
itched+0.099
Neg logit contributions
 virtue-0.127
 eater-0.124
 mistake-0.117
 tutor-0.116
 promise-0.115
Token...
Token ID986
Feature activation+7.66e-02

Pos logit contributions
ink+0.008
 sails+0.007
 gl+0.007
 sail+0.007
itched+0.007
Neg logit contributions
 virtue-0.010
 eater-0.009
olding-0.009
 tutor-0.009
nie-0.008
Tokenmuch
Token ID29482
Feature activation-3.85e-02

Pos logit contributions
 valleys+0.117
 sails+0.108
He+0.103
 soared+0.100
itched+0.099
Neg logit contributions
 virtue-0.128
 eater-0.125
 mistake-0.118
 promise-0.116
 tutor-0.115
Token.
Token ID13
Feature activation-2.66e-02

Pos logit contributions
 eater+0.062
 virtue+0.059
nie+0.056
becca+0.055
 tutor+0.054
Neg logit contributions
 sails-0.060
 valleys-0.058
 sail-0.056
 swoop-0.053
 gl-0.053
Token nod
Token ID18666
Feature activation+1.30e-01

Pos logit contributions
 virtue+0.140
 eater+0.135
nie+0.128
 tutor+0.127
olding+0.125
Neg logit contributions
 valleys-0.118
 sails-0.116
 sail-0.110
itched-0.108
 widened-0.107
Token and
Token ID290
Feature activation-2.59e-02

Pos logit contributions
 valleys+0.196
 sails+0.192
itched+0.186
 sail+0.182
 widened+0.175
Neg logit contributions
 virtue-0.209
 eater-0.202
nie-0.192
 tutor-0.188
olding-0.186
Token say
Token ID910
Feature activation+9.40e-02

Pos logit contributions
 virtue+0.040
 eater+0.038
nie+0.036
olding+0.035
 tutor+0.035
Neg logit contributions
 valleys-0.042
 sails-0.042
 sail-0.040
itched-0.039
 gl-0.038
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
 valleys+0.106
 sails+0.102
itched+0.098
 sail+0.094
 gl+0.088
Neg logit contributions
 virtue-0.187
 eater-0.183
nie-0.176
 tutor-0.173
olding-0.172
Token "
Token ID366
Feature activation-6.58e-02

Pos logit contributions
 valleys+0.119
 sails+0.112
itched+0.112
 sail+0.109
He+0.106
Neg logit contributions
 virtue-0.222
 eater-0.216
nie-0.212
 tutor-0.200
 student-0.200
TokenDeal
Token ID45776
Feature activation-2.54e-01

Pos logit contributions
 virtue+0.063
 eater+0.063
 tutor+0.060
olding+0.059
nie+0.059
Neg logit contributions
 valleys-0.135
 sails-0.131
He-0.130
 widened-0.127
 sail-0.127
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
 eater+0.423
 virtue+0.422
nie+0.419
becca+0.408
icia+0.406
Neg logit contributions
 sail-0.349
 sails-0.322
 valleys-0.311
itched-0.307
rolled-0.282
Token mom
Token ID1995
Feature activation-1.90e-01

Pos logit contributions
 sail+0.161
 sails+0.147
 gl+0.139
 wood+0.130
 glitter+0.129
Neg logit contributions
olding-0.181
 virtue-0.174
nie-0.172
 eater-0.170
t-0.164
Token.
Token ID13
Feature activation-2.92e-02

Pos logit contributions
 virtue+0.399
becca+0.365
olding+0.353
icia+0.353
 tutor+0.350
Neg logit contributions
He-0.173
 sails-0.173
 valleys-0.172
itched-0.157
 sail-0.157
Token Thank
Token ID6952
Feature activation-1.05e-01

Pos logit contributions
 mistake+0.043
 eater+0.041
nie+0.041
 tutor+0.040
 exam+0.040
Neg logit contributions
 valleys-0.047
 sails-0.046
 sail-0.045
 sailed-0.041
 gl-0.040
Token you
Token ID345
Feature activation-2.40e-01

Pos logit contributions
 virtue+0.123
 eater+0.108
nie+0.105
 mistake+0.104
olding+0.098
Neg logit contributions
 valleys-0.203
ink-0.202
itched-0.202
 sails-0.201
 sail-0.197
TokenS
Token ID50
Feature activation-4.52e-02

Pos logit contributions
 sails+0.152
 valleys+0.148
 sail+0.141
itched+0.140
 gl+0.136
Neg logit contributions
 virtue-0.074
 eater-0.072
olding-0.069
becca-0.063
 promise-0.063
Tokenara
Token ID3301
Feature activation-4.92e-02

Pos logit contributions
 virtue+0.065
 eater+0.060
 tutor+0.060
nie+0.059
 student+0.056
Neg logit contributions
 valleys-0.074
itched-0.073
 sails-0.071
ink-0.071
 widened-0.067
Token said
Token ID531
Feature activation+6.90e-02

Pos logit contributions
 virtue+0.105
 eater+0.099
 tutor+0.095
olding+0.094
nie+0.093
Neg logit contributions
 sails-0.050
 valleys-0.048
itched-0.047
 gl-0.045
 widened-0.044
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
 valleys+0.072
 sails+0.069
itched+0.066
 sail+0.063
 gl+0.059
Neg logit contributions
 virtue-0.144
 eater-0.140
nie-0.135
 tutor-0.132
olding-0.132
Token "
Token ID366
Feature activation-6.82e-02

Pos logit contributions
 valleys+0.120
itched+0.114
 sails+0.112
 sail+0.112
 gl+0.109
Neg logit contributions
 virtue-0.222
 eater-0.207
nie-0.205
olding-0.199
 mistake-0.198
TokenDeal
Token ID45776
Feature activation-2.54e-01

Pos logit contributions
 virtue+0.055
 eater+0.053
nie+0.049
olding+0.049
 tutor+0.048
Neg logit contributions
 valleys-0.153
He-0.145
 sails-0.145
 gl-0.141
 sail-0.141
Token."
Token ID526
Feature activation-1.38e-01

Pos logit contributions
 virtue+0.486
 eater+0.462
nie+0.459
olding+0.451
icia+0.450
Neg logit contributions
 sail-0.295
 valleys-0.270
 sails-0.269
itched-0.252
 gl-0.241
Token She
Token ID1375
Feature activation+1.14e-01

Pos logit contributions
 virtue+0.268
 eater+0.265
 tutor+0.255
olding+0.253
nie+0.248
Neg logit contributions
 valleys-0.164
 sails-0.160
 sail-0.145
itched-0.145
 gl-0.136
Token put
Token ID1234
Feature activation+9.25e-02

Pos logit contributions
 valleys+0.185
 sails+0.173
itched+0.168
 sail+0.163
 widened+0.154
Neg logit contributions
 virtue-0.175
 eater-0.173
nie-0.164
 tutor-0.162
olding-0.159
Token her
Token ID607
Feature activation+2.87e-02

Pos logit contributions
 valleys+0.120
 sails+0.114
itched+0.106
 sail+0.106
 gl+0.101
Neg logit contributions
 virtue-0.178
 eater-0.167
nie-0.162
olding-0.162
 tutor-0.156
Token tablet
Token ID14147
Feature activation-9.52e-02

Pos logit contributions
 sails+0.049
 sail+0.048
 hood+0.043
 gl+0.043
itched+0.043
Neg logit contributions
 virtue-0.048
 eater-0.041
olding-0.039
 inconvenience-0.038
 occasion-0.038
Token She
Token ID1375
Feature activation+1.20e-01

Pos logit contributions
 virtue+0.049
 eater+0.047
nie+0.046
olding+0.045
 tutor+0.045
Neg logit contributions
 valleys-0.027
 sails-0.025
itched-0.024
 sail-0.023
rolled-0.022
Token smiled
Token ID13541
Feature activation+8.09e-02

Pos logit contributions
 sails+0.148
 valleys+0.146
 gl+0.139
itched+0.135
 sail+0.135
Neg logit contributions
 virtue-0.235
 eater-0.225
 tutor-0.215
olding-0.215
nie-0.209
Token and
Token ID290
Feature activation-1.73e-02

Pos logit contributions
 valleys+0.149
 sails+0.140
itched+0.134
 sail+0.130
rolled+0.129
Neg logit contributions
 eater-0.103
 virtue-0.099
 tutor-0.097
nie-0.094
 ever-0.092
Token said
Token ID531
Feature activation+8.14e-02

Pos logit contributions
 virtue+0.030
 eater+0.030
nie+0.028
 tutor+0.028
olding+0.028
Neg logit contributions
 valleys-0.024
 sails-0.023
itched-0.022
 sail-0.021
 gl-0.020
Token "
Token ID366
Feature activation-5.83e-02

Pos logit contributions
 valleys+0.101
 sails+0.097
itched+0.091
 sail+0.089
 gl+0.083
Neg logit contributions
 virtue-0.153
 eater-0.150
nie-0.142
 tutor-0.142
olding-0.141
Tokenyou
Token ID5832
Feature activation-2.54e-01

Pos logit contributions
 virtue+0.097
 eater+0.093
 tutor+0.092
 mistake+0.090
olding+0.089
Neg logit contributions
 valleys-0.086
 sails-0.079
He-0.078
 sail-0.077
 swoop-0.076
Token're
Token ID821
Feature activation+3.27e-02

Pos logit contributions
 eater+0.458
 virtue+0.454
 tutor+0.437
olding+0.437
nie+0.423
Neg logit contributions
 sails-0.352
 sail-0.330
 gl-0.323
 valleys-0.322
 soared-0.301
Token right
Token ID826
Feature activation-1.01e-01

Pos logit contributions
 valleys+0.046
 sails+0.046
 gl+0.044
 sail+0.043
itched+0.042
Neg logit contributions
 eater-0.054
 virtue-0.054
olding-0.050
 tutor-0.050
nie-0.048
Token,
Token ID11
Feature activation+1.25e-01

Pos logit contributions
 virtue+0.187
 eater+0.178
olding+0.178
 occasion+0.171
oker+0.170
Neg logit contributions
 valleys-0.118
 sails-0.116
itched-0.114
ink-0.105
 sail-0.103
Token John
Token ID1757
Feature activation-1.28e-02

Pos logit contributions
 sail+0.180
 gl+0.175
 sails+0.174
 valleys+0.165
 soaring+0.163
Neg logit contributions
 virtue-0.199
olding-0.181
 eater-0.180
 occasion-0.166
grand-0.165
Token!
Token ID0
Feature activation-4.93e-02

Pos logit contributions
 virtue+0.023
 tutor+0.022
olding+0.022
 occasion+0.022
 exam+0.022
Neg logit contributions
ink-0.015
itched-0.014
 sails-0.014
 swoop-0.014
 gl-0.014
Token again
Token ID757
Feature activation-1.26e-01

Pos logit contributions
 virtue+0.156
 eater+0.146
nie+0.141
 tutor+0.140
becca+0.138
Neg logit contributions
 sail-0.125
 sails-0.124
 valleys-0.122
itched-0.116
 gl-0.114
Token."
Token ID526
Feature activation-1.41e-01

Pos logit contributions
 virtue+0.206
 eater+0.192
olding+0.185
 tutor+0.184
nie+0.176
Neg logit contributions
 valleys-0.193
 sails-0.187
 sail-0.185
itched-0.180
He-0.171
Token
Token ID198
Feature activation+7.34e-02

Pos logit contributions
 tutor+0.221
 eater+0.220
 virtue+0.215
olding+0.208
 student+0.202
Neg logit contributions
 valleys-0.202
 sails-0.200
itched-0.188
 soared-0.182
 swoop-0.181
Token
Token ID198
Feature activation+7.36e-02

Pos logit contributions
 valleys+0.123
 sails+0.119
 descended+0.113
 widened+0.111
 gl+0.110
Neg logit contributions
 virtue-0.097
 tutor-0.097
 eater-0.094
 mistake-0.092
olding-0.092
Token"
Token ID1
Feature activation-2.31e-02

Pos logit contributions
 valleys+0.111
itched+0.108
 sails+0.108
 gl+0.104
rolled+0.103
Neg logit contributions
 virtue-0.114
 eater-0.109
 tutor-0.107
 student-0.107
olding-0.102
TokenDeal
Token ID45776
Feature activation-2.53e-01

Pos logit contributions
 virtue+0.024
 eater+0.023
nie+0.021
 tutor+0.021
olding+0.021
Neg logit contributions
 valleys-0.048
 sails-0.047
itched-0.045
 sail-0.045
 gl-0.043
Token,"
Token ID553
Feature activation+6.14e-02

Pos logit contributions
 virtue+0.513
 eater+0.482
olding+0.469
nie+0.459
icia+0.457
Neg logit contributions
 sail-0.283
 sails-0.269
 valleys-0.259
 swoop-0.243
itched-0.242
Token the
Token ID262
Feature activation+6.75e-03

Pos logit contributions
 valleys+0.093
 sails+0.091
 sail+0.086
itched+0.085
 gl+0.082
Neg logit contributions
 virtue-0.102
 eater-0.097
nie-0.093
olding-0.091
 tutor-0.090
Token judge
Token ID5052
Feature activation-3.83e-02

Pos logit contributions
 valleys+0.015
 sails+0.015
itched+0.015
 sail+0.015
 gl+0.014
Neg logit contributions
 virtue-0.006
 eater-0.005
nie-0.005
 tutor-0.005
olding-0.005
Token said
Token ID531
Feature activation+6.95e-02

Pos logit contributions
 virtue+0.067
 eater+0.064
nie+0.060
 tutor+0.060
olding+0.059
Neg logit contributions
 sails-0.053
 valleys-0.053
itched-0.050
 sail-0.049
 gl-0.048
Token.
Token ID13
Feature activation-3.48e-02

Pos logit contributions
 valleys+0.113
 sails+0.111
itched+0.107
 sail+0.105
 gl+0.100
Neg logit contributions
 virtue-0.106
 eater-0.100
nie-0.094
 tutor-0.093
olding-0.091
Token and
Token ID290
Feature activation-2.81e-02

Pos logit contributions
 virtue+0.210
 eater+0.198
nie+0.188
olding+0.184
 occasion+0.184
Neg logit contributions
 sails-0.148
 valleys-0.145
itched-0.134
 sail-0.132
 gl-0.131
Token Ben
Token ID3932
Feature activation-2.78e-02

Pos logit contributions
 virtue+0.032
 eater+0.030
nie+0.028
olding+0.027
 tutor+0.027
Neg logit contributions
 valleys-0.057
 sails-0.056
itched-0.054
 sail-0.053
 gl-0.051
Token say
Token ID910
Feature activation+9.51e-02

Pos logit contributions
 virtue+0.045
 eater+0.043
nie+0.041
 tutor+0.040
olding+0.040
Neg logit contributions
 valleys-0.042
 sails-0.041
 sail-0.039
itched-0.039
 gl-0.037
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 valleys+0.129
 sails+0.122
itched+0.113
 sail+0.112
 gl+0.106
Neg logit contributions
 virtue-0.172
 eater-0.168
 tutor-0.160
nie-0.159
olding-0.158
Token "
Token ID366
Feature activation-6.61e-02

Pos logit contributions
 valleys+0.123
 sails+0.116
itched+0.114
 sail+0.113
 gl+0.109
Neg logit contributions
 virtue-0.217
 eater-0.207
nie-0.203
 mistake-0.195
olding-0.194
TokenDeal
Token ID45776
Feature activation-2.53e-01

Pos logit contributions
 virtue+0.056
 eater+0.054
 tutor+0.050
nie+0.050
olding+0.049
Neg logit contributions
 valleys-0.147
 sails-0.141
 sail-0.137
He-0.137
itched-0.135
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 virtue+0.414
 eater+0.408
nie+0.404
becca+0.386
olding+0.385
Neg logit contributions
 sail-0.358
 sails-0.336
 valleys-0.331
itched-0.318
 gl-0.294
Token mom
Token ID1995
Feature activation-1.90e-01

Pos logit contributions
 sail+0.176
 sails+0.169
 gl+0.157
 valleys+0.151
 wood+0.150
Neg logit contributions
 virtue-0.159
olding-0.159
nie-0.155
t-0.149
 eater-0.149
Token.
Token ID13
Feature activation-2.91e-02

Pos logit contributions
 virtue+0.406
becca+0.370
olding+0.361
 virus+0.358
 tutor+0.358
Neg logit contributions
He-0.165
 valleys-0.164
 sails-0.160
ink-0.145
 sail-0.144
Token We
Token ID775
Feature activation-1.74e-01

Pos logit contributions
 mistake+0.039
nie+0.039
 eater+0.039
 virtue+0.038
olding+0.037
Neg logit contributions
 valleys-0.051
 sails-0.048
 sail-0.047
 gl-0.043
 mountains-0.043
Token will
Token ID481
Feature activation-7.25e-02

Pos logit contributions
 eater+0.230
 virtue+0.226
 tutor+0.221
olding+0.216
nie+0.215
Neg logit contributions
 sails-0.296
 sail-0.291
 valleys-0.284
 gl-0.277
 widened-0.272
Token.
Token ID13
Feature activation-3.52e-02

Pos logit contributions
 valleys+0.140
 sails+0.134
itched+0.129
 sail+0.126
rolled+0.125
Neg logit contributions
 eater-0.093
 virtue-0.093
 tutor-0.087
nie-0.085
olding-0.085
Token She
Token ID1375
Feature activation+1.14e-01

Pos logit contributions
 virtue+0.077
 eater+0.075
nie+0.072
 tutor+0.071
olding+0.071
Neg logit contributions
 valleys-0.034
 sails-0.031
itched-0.029
 sail-0.028
 gl-0.026
Token said
Token ID531
Feature activation+6.92e-02

Pos logit contributions
 valleys+0.176
 sails+0.170
itched+0.162
 sail+0.159
 gl+0.152
Neg logit contributions
 virtue-0.180
 eater-0.175
nie-0.166
 tutor-0.164
olding-0.162
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 valleys+0.068
 sails+0.065
itched+0.062
 sail+0.058
 gl+0.056
Neg logit contributions
 virtue-0.149
 eater-0.144
nie-0.139
 tutor-0.136
olding-0.136
Token "
Token ID366
Feature activation-6.79e-02

Pos logit contributions
 valleys+0.153
 sails+0.147
itched+0.142
 sail+0.138
 gl+0.132
Neg logit contributions
 virtue-0.188
 eater-0.182
nie-0.173
 tutor-0.170
olding-0.169
TokenDeal
Token ID45776
Feature activation-2.53e-01

Pos logit contributions
 virtue+0.072
 eater+0.072
 mistake+0.066
olding+0.065
nie+0.065
Neg logit contributions
 valleys-0.136
He-0.134
His-0.127
 sails-0.123
 gl-0.123
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 virtue+0.602
 eater+0.592
nie+0.586
icia+0.573
becca+0.566
Neg logit contributions
 sail-0.158
 valleys-0.148
 sails-0.132
itched-0.123
He-0.117
Token Anna
Token ID11735
Feature activation-1.20e-01

Pos logit contributions
He+0.149
 smaller+0.140
She+0.140
 valleys+0.133
 glitter+0.131
Neg logit contributions
 virtue-0.172
grand-0.171
ci-0.166
 eater-0.156
olding-0.156
Token.
Token ID13
Feature activation-3.60e-02

Pos logit contributions
 virtue+0.286
 occasion+0.277
becca+0.276
 anniversary+0.269
olding+0.269
Neg logit contributions
,-0.063
He-0.063
ink-0.059
 valleys-0.057
 gl-0.055
Token But
Token ID887
Feature activation-6.59e-02

Pos logit contributions
 mistake+0.049
 eater+0.047
 virtue+0.047
nie+0.046
 tutor+0.044
Neg logit contributions
 valleys-0.063
 sails-0.055
 gl-0.054
 sail-0.053
He-0.053
Token you
Token ID345
Feature activation-2.39e-01

Pos logit contributions
 eater+0.082
 virtue+0.078
 tutor+0.074
nie+0.071
child+0.070
Neg logit contributions
 gl-0.117
 valleys-0.116
 sail-0.115
 sails-0.114
 wood-0.108
Token I
Token ID314
Feature activation-1.78e-01

Pos logit contributions
 virtue+0.038
 eater+0.037
nie+0.035
 mistake+0.033
 tutor+0.033
Neg logit contributions
 valleys-0.045
 sails-0.044
itched-0.041
 sail-0.041
He-0.040
Token...
Token ID986
Feature activation+7.65e-02

Pos logit contributions
 virtue+0.263
 eater+0.261
 tutor+0.249
nie+0.245
olding+0.232
Neg logit contributions
 sails-0.289
 sail-0.284
 valleys-0.274
 gl-0.274
itched-0.274
Tokenh
Token ID71
Feature activation+9.70e-02

Pos logit contributions
 valleys+0.111
 sails+0.104
He+0.100
 sail+0.095
itched+0.095
Neg logit contributions
 virtue-0.131
 eater-0.129
 mistake-0.123
 tutor-0.121
nie-0.120
Tokenope
Token ID3008
Feature activation+1.29e-02

Pos logit contributions
itched+0.139
 valleys+0.139
ink+0.139
 sails+0.136
 sail+0.123
Neg logit contributions
 virtue-0.156
 eater-0.154
 tutor-0.153
nie-0.140
 occasion-0.138
Token...
Token ID986
Feature activation+7.70e-02

Pos logit contributions
 valleys+0.017
 sails+0.017
ink+0.017
itched+0.016
br+0.016
Neg logit contributions
 eater-0.022
 virtue-0.022
 tutor-0.020
 occasion-0.020
 anniversary-0.019
Tokenyou
Token ID5832
Feature activation-2.53e-01

Pos logit contributions
 valleys+0.112
 sails+0.102
He+0.100
itched+0.094
 gl+0.092
Neg logit contributions
 virtue-0.134
 eater-0.133
 mistake-0.124
 promise-0.124
 tutor-0.124
Token...
Token ID986
Feature activation+7.71e-02

Pos logit contributions
 eater+0.438
 virtue+0.427
 tutor+0.397
olding+0.392
 occasion+0.392
Neg logit contributions
 sails-0.379
 sail-0.355
 valleys-0.344
 sailed-0.341
 gl-0.340
Tokenare
Token ID533
Feature activation+4.35e-02

Pos logit contributions
 valleys+0.119
 sails+0.106
He+0.104
 soared+0.101
itched+0.099
Neg logit contributions
 virtue-0.127
 eater-0.126
 mistake-0.119
 promise-0.118
 tutor-0.117
Token...
Token ID986
Feature activation+7.74e-02

Pos logit contributions
ink+0.063
 valleys+0.061
 sails+0.060
 swoop+0.060
 soared+0.059
Neg logit contributions
 virtue-0.073
 eater-0.070
 tutor-0.067
idents-0.064
 occasion-0.063
Tokenwell
Token ID4053
Feature activation+6.52e-03

Pos logit contributions
 valleys+0.118
 sails+0.106
He+0.104
 soared+0.099
itched+0.098
Neg logit contributions
 virtue-0.130
 eater-0.128
 mistake-0.121
 promise-0.119
 tutor-0.119
Token...
Token ID986
Feature activation+7.78e-02

Pos logit contributions
 sails+0.009
ink+0.009
itched+0.009
 valleys+0.008
 sailed+0.008
Neg logit contributions
 eater-0.011
 virtue-0.010
 tutor-0.010
 student-0.009
idents-0.009
Token.
Token ID13
Feature activation-3.51e-02

Pos logit contributions
 valleys+0.158
 sails+0.153
itched+0.147
 sail+0.145
 gl+0.139
Neg logit contributions
 virtue-0.137
 eater-0.133
nie-0.125
 tutor-0.123
olding-0.122
Token She
Token ID1375
Feature activation+1.15e-01

Pos logit contributions
 virtue+0.079
 eater+0.077
nie+0.075
 tutor+0.073
olding+0.073
Neg logit contributions
 valleys-0.032
 sails-0.029
itched-0.027
 sail-0.026
 gl-0.024
Token said
Token ID531
Feature activation+7.06e-02

Pos logit contributions
 sails+0.142
 gl+0.138
 widened+0.134
 valleys+0.133
 sail+0.132
Neg logit contributions
 virtue-0.217
 eater-0.201
 tutor-0.198
olding-0.197
nie-0.192
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 valleys+0.085
 sails+0.080
itched+0.075
 sail+0.073
 widened+0.069
Neg logit contributions
 virtue-0.137
 eater-0.135
 tutor-0.128
nie-0.127
olding-0.127
Token "
Token ID366
Feature activation-6.72e-02

Pos logit contributions
 valleys+0.141
 sails+0.135
itched+0.132
 sail+0.129
 gl+0.124
Neg logit contributions
 virtue-0.201
 eater-0.192
nie-0.187
olding-0.180
 tutor-0.180
TokenDeal
Token ID45776
Feature activation-2.53e-01

Pos logit contributions
 virtue+0.057
 eater+0.054
 tutor+0.050
nie+0.050
olding+0.049
Neg logit contributions
 valleys-0.149
 sails-0.143
He-0.141
 sail-0.139
itched-0.138
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
 virtue+0.486
nie+0.466
 eater+0.465
icia+0.455
becca+0.446
Neg logit contributions
 sail-0.292
 sails-0.268
 valleys-0.265
itched-0.256
 gl-0.237
Token Mom
Token ID11254
Feature activation-1.38e-01

Pos logit contributions
 sail+0.151
 gl+0.146
 sails+0.141
 bent+0.136
He+0.134
Neg logit contributions
 virtue-0.186
olding-0.171
 eater-0.167
nie-0.165
 topic-0.163
Token.
Token ID13
Feature activation-3.58e-02

Pos logit contributions
 virtue+0.297
becca+0.268
 tutor+0.262
olding+0.261
 virus+0.258
Neg logit contributions
 valleys-0.119
 sails-0.119
 gl-0.118
He-0.115
 sail-0.110
Token Thank
Token ID6952
Feature activation-1.04e-01

Pos logit contributions
 mistake+0.051
 virtue+0.049
nie+0.048
 eater+0.047
olding+0.045
Neg logit contributions
 valleys-0.059
 sails-0.055
 sail-0.054
 gl-0.053
He-0.051
Token you
Token ID345
Feature activation-2.39e-01

Pos logit contributions
 virtue+0.121
 eater+0.100
 mistake+0.099
nie+0.098
olding+0.092
Neg logit contributions
 valleys-0.208
ink-0.203
 sails-0.203
itched-0.202
 sail-0.201
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
 sails+0.059
 valleys+0.057
 sail+0.056
itched+0.054
ink+0.054
Neg logit contributions
 virtue-0.050
nie-0.048
 eater-0.048
olding-0.045
leton-0.044
Token I
Token ID314
Feature activation-1.79e-01

Pos logit contributions
 virtue+0.036
 eater+0.035
nie+0.033
 tutor+0.031
 mistake+0.031
Neg logit contributions
 valleys-0.040
 sails-0.039
itched-0.037
 sail-0.037
He-0.036
Token...
Token ID986
Feature activation+7.80e-02

Pos logit contributions
 virtue+0.264
 eater+0.261
 tutor+0.253
nie+0.248
olding+0.236
Neg logit contributions
 sails-0.288
 sail-0.284
 valleys-0.274
 gl-0.273
itched-0.270
Tokensent
Token ID34086
Feature activation+6.53e-02

Pos logit contributions
 valleys+0.115
 sails+0.106
He+0.103
 sail+0.097
 soared+0.097
Neg logit contributions
 virtue-0.134
 eater-0.130
 mistake-0.126
 tutor-0.122
 promise-0.122
Token...
Token ID986
Feature activation+7.83e-02

Pos logit contributions
 valleys+0.091
 sails+0.089
itched+0.086
 sail+0.082
 swoop+0.080
Neg logit contributions
 virtue-0.113
 eater-0.111
nie-0.102
 tutor-0.102
olding-0.102
Tokenyou
Token ID5832
Feature activation-2.53e-01

Pos logit contributions
 valleys+0.114
 sails+0.105
He+0.102
 sail+0.096
 soared+0.095
Neg logit contributions
 virtue-0.135
 eater-0.132
 mistake-0.126
 promise-0.122
nie-0.122
Token...
Token ID986
Feature activation+7.81e-02

Pos logit contributions
 eater+0.430
 virtue+0.418
 tutor+0.384
nie+0.379
olding+0.379
Neg logit contributions
 sails-0.382
 sail-0.361
 valleys-0.350
 soared-0.344
 sailed-0.344
Tokena
Token ID64
Feature activation+1.91e-03

Pos logit contributions
 valleys+0.120
 sails+0.106
He+0.104
 soared+0.103
 swoop+0.100
Neg logit contributions
 virtue-0.128
 eater-0.128
 mistake-0.122
 promise-0.119
 tutor-0.118
Token...
Token ID986
Feature activation+7.84e-02

Pos logit contributions
 sails+0.003
 valleys+0.003
ink+0.003
itched+0.003
 sail+0.002
Neg logit contributions
 virtue-0.003
 eater-0.003
 tutor-0.003
nie-0.003
olding-0.003
Tokensur
Token ID11793
Feature activation+1.86e-02

Pos logit contributions
 valleys+0.118
 sails+0.105
He+0.104
 soared+0.100
itched+0.098
Neg logit contributions
 virtue-0.131
 eater-0.129
 mistake-0.124
 promise-0.122
 tutor-0.119
Tokenprise
Token ID7919
Feature activation-1.27e-02

Pos logit contributions
 sails+0.028
 valleys+0.028
ink+0.027
itched+0.027
 sail+0.027
Neg logit contributions
 eater-0.027
 virtue-0.026
 tutor-0.024
nie-0.024
idents-0.024
Token,
Token ID11
Feature activation+1.18e-01

Pos logit contributions
 virtue+0.205
 eater+0.183
 occasion+0.182
 inconvenience+0.181
olding+0.177
Neg logit contributions
 valleys-0.100
 sails-0.097
 gl-0.096
itched-0.092
 sail-0.091
Token she
Token ID673
Feature activation+3.94e-02

Pos logit contributions
 valleys+0.166
 gl+0.157
 widened+0.140
 soaring+0.138
 sails+0.138
Neg logit contributions
 virtue-0.230
 eater-0.211
olding-0.203
 occasion-0.198
nie-0.195
Token softly
Token ID26625
Feature activation+4.90e-02

Pos logit contributions
 valleys+0.059
 sails+0.056
itched+0.054
 sail+0.053
 gl+0.050
Neg logit contributions
 virtue-0.065
 eater-0.063
nie-0.060
 tutor-0.059
olding-0.059
Token said
Token ID531
Feature activation+7.72e-02

Pos logit contributions
 valleys+0.074
 sails+0.067
itched+0.062
 sail+0.061
He+0.059
Neg logit contributions
 eater-0.075
 tutor-0.072
nie-0.072
 virtue-0.072
olding-0.070
Token "
Token ID366
Feature activation-5.95e-02

Pos logit contributions
 valleys+0.125
 sails+0.118
itched+0.109
 sail+0.108
 widened+0.104
Neg logit contributions
 virtue-0.120
 eater-0.116
 tutor-0.112
olding-0.109
nie-0.108
Tokenyou
Token ID5832
Feature activation-2.52e-01

Pos logit contributions
 virtue+0.108
 mistake+0.104
 eater+0.103
 tutor+0.101
olding+0.100
Neg logit contributions
 valleys-0.081
He-0.077
 soared-0.074
 swoop-0.074
His-0.070
Token'll
Token ID1183
Feature activation-6.86e-02

Pos logit contributions
 eater+0.470
 virtue+0.453
 tutor+0.431
olding+0.426
nie+0.424
Neg logit contributions
 sails-0.352
 sail-0.334
 gl-0.312
 valleys-0.311
 sailed-0.301
Token be
Token ID307
Feature activation+6.09e-02

Pos logit contributions
 virtue+0.108
 eater+0.105
nie+0.099
 tutor+0.098
olding+0.097
Neg logit contributions
 valleys-0.107
 sails-0.103
itched-0.099
 sail-0.097
 gl-0.093
Token okay
Token ID8788
Feature activation-1.65e-01

Pos logit contributions
 valleys+0.119
 sails+0.114
itched+0.110
 sail+0.108
rolled+0.106
Neg logit contributions
 virtue-0.072
nie-0.067
 eater-0.067
 tutor-0.064
 ever-0.062
Token.
Token ID13
Feature activation-3.02e-02

Pos logit contributions
 virtue+0.365
olding+0.354
 occasion+0.352
becca+0.348
 eater+0.346
Neg logit contributions
itched-0.127
He-0.113
 valleys-0.109
 sail-0.108
 sails-0.105
Token I
Token ID314
Feature activation-1.73e-01

Pos logit contributions
 eater+0.044
 virtue+0.043
 mistake+0.040
nie+0.039
 tutor+0.038
Neg logit contributions
 valleys-0.048
 sails-0.047
itched-0.046
 sail-0.046
He-0.045
Token.
Token ID13
Feature activation-2.86e-02

Pos logit contributions
 virtue+0.043
becca+0.042
 eater+0.041
leton+0.041
nie+0.040
Neg logit contributions
 sail-0.029
 sails-0.026
 gl-0.025
ink-0.025
itched-0.024
Token And
Token ID843
Feature activation-1.04e-01

Pos logit contributions
 virtue+0.041
 eater+0.040
nie+0.038
 tutor+0.037
 mistake+0.036
Neg logit contributions
 valleys-0.047
 sails-0.046
 sail-0.044
itched-0.044
 gl-0.041
Token you
Token ID345
Feature activation-2.40e-01

Pos logit contributions
 eater+0.139
 virtue+0.137
 tutor+0.131
nie+0.128
olding+0.126
Neg logit contributions
 sails-0.176
 sail-0.176
 valleys-0.172
 gl-0.172
 soared-0.157
Token say
Token ID910
Feature activation+9.27e-02

Pos logit contributions
 eater+0.387
 virtue+0.373
 tutor+0.362
olding+0.362
nie+0.350
Neg logit contributions
 sails-0.369
 sail-0.357
 valleys-0.342
 gl-0.332
 sailed-0.324
Token '
Token ID705
Feature activation-4.14e-02

Pos logit contributions
 sails+0.108
itched+0.105
 sail+0.104
 valleys+0.097
 gl+0.095
Neg logit contributions
 virtue-0.174
 eater-0.174
nie-0.172
olding-0.165
 student-0.157
Tokenyou
Token ID5832
Feature activation-2.52e-01

Pos logit contributions
 virtue+0.071
 tutor+0.071
 eater+0.069
 mistake+0.068
olding+0.065
Neg logit contributions
 valleys-0.058
 sails-0.052
He-0.051
 swoop-0.051
rolled-0.050
Token're
Token ID821
Feature activation+2.59e-02

Pos logit contributions
 eater+0.453
 virtue+0.418
 prank+0.417
nie+0.415
olding+0.409
Neg logit contributions
 sails-0.321
 sail-0.308
ink-0.278
 gl-0.272
 valleys-0.271
Token welcome
Token ID7062
Feature activation+5.48e-04

Pos logit contributions
 sails+0.029
 gl+0.029
 valleys+0.029
 sail+0.028
 soaring+0.026
Neg logit contributions
 eater-0.048
 virtue-0.047
olding-0.044
 occasion-0.043
nie-0.043
Token'
Token ID6
Feature activation-2.15e-02

Pos logit contributions
ink+0.000
itched+0.000
 sails+0.000
 sail+0.000
 valleys+0.000
Neg logit contributions
 virtue-0.001
nie-0.001
olding-0.001
 eater-0.001
leton-0.001
Token when
Token ID618
Feature activation-1.19e-01

Pos logit contributions
leton+0.038
 prank+0.037
 eater+0.036
becca+0.034
 virtue+0.034
Neg logit contributions
 sails-0.023
ink-0.022
He-0.021
 wide-0.020
itched-0.020
Token you
Token ID345
Feature activation-2.39e-01

Pos logit contributions
 virtue+0.150
 eater+0.145
olding+0.138
nie+0.129
 tutor+0.118
Neg logit contributions
 valleys-0.229
 sail-0.221
 sails-0.214
 wood-0.210
 mountains-0.209
Token him
Token ID683
Feature activation+6.34e-02

Pos logit contributions
 valleys+0.005
 sails+0.005
itched+0.004
He+0.004
 sail+0.004
Neg logit contributions
 virtue-0.015
 eater-0.015
olding-0.014
 tutor-0.014
nie-0.014
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
itched+0.065
 sails+0.065
 sail+0.064
He+0.063
 valleys+0.062
Neg logit contributions
 virtue-0.127
 inconvenience-0.123
 occasion-0.123
 eater-0.121
 anniversary-0.117
Token said
Token ID531
Feature activation+7.86e-02

Pos logit contributions
 virtue+0.027
 eater+0.027
nie+0.025
 tutor+0.025
olding+0.025
Neg logit contributions
 valleys-0.026
 sails-0.026
itched-0.024
 sail-0.024
 gl-0.023
Token,
Token ID11
Feature activation+1.24e-01

Pos logit contributions
 valleys+0.100
 sails+0.094
 sail+0.086
itched+0.086
 widened+0.080
Neg logit contributions
 virtue-0.148
 eater-0.145
nie-0.138
 tutor-0.138
olding-0.137
Token "
Token ID366
Feature activation-5.92e-02

Pos logit contributions
 valleys+0.167
 sails+0.161
itched+0.155
 sail+0.151
 gl+0.144
Neg logit contributions
 virtue-0.219
 eater-0.215
nie-0.202
 tutor-0.200
olding-0.197
Tokenyou
Token ID5832
Feature activation-2.52e-01

Pos logit contributions
 eater+0.111
 virtue+0.110
 tutor+0.105
 mistake+0.103
olding+0.102
Neg logit contributions
 valleys-0.075
He-0.071
 sails-0.067
His-0.065
 gl-0.062
Token don
Token ID836
Feature activation-1.81e-01

Pos logit contributions
 eater+0.450
 virtue+0.420
 tutor+0.416
olding+0.405
 student+0.401
Neg logit contributions
 sails-0.366
 sail-0.343
 gl-0.332
 sailed-0.322
 valleys-0.322
Token't
Token ID470
Feature activation-1.24e-01

Pos logit contributions
 virtue+0.161
 eater+0.140
 tutor+0.138
nie+0.136
 ever+0.136
Neg logit contributions
 valleys-0.413
 sails-0.392
 sail-0.366
 gl-0.365
itched-0.363
Token have
Token ID423
Feature activation-4.66e-02

Pos logit contributions
 eater+0.201
 tutor+0.197
becca+0.189
ae+0.188
aughter+0.183
Neg logit contributions
 sail-0.177
 sails-0.166
 swoop-0.162
He-0.162
 valleys-0.154
Token to
Token ID284
Feature activation-4.85e-02

Pos logit contributions
 eater+0.079
 virtue+0.073
aughter+0.070
olding+0.070
nie+0.068
Neg logit contributions
 sails-0.069
 valleys-0.066
 gl-0.061
 sail-0.061
 soared-0.060
Token weep
Token ID49671
Feature activation+5.50e-02

Pos logit contributions
 virtue+0.069
 eater+0.069
 tutor+0.065
nie+0.065
olding+0.062
Neg logit contributions
 sails-0.078
 sail-0.077
 valleys-0.076
itched-0.075
 gl-0.070
Tokenobby
Token ID11369
Feature activation+3.34e-02

Pos logit contributions
itched+0.058
 valleys+0.058
 sails+0.057
rolled+0.057
 sail+0.055
Neg logit contributions
 eater-0.039
 virtue-0.039
 tutor-0.037
 student-0.034
 mistake-0.034
Token nodded
Token ID14464
Feature activation+1.05e-01

Pos logit contributions
 valleys+0.043
 sails+0.043
itched+0.040
 sail+0.039
 gl+0.038
Neg logit contributions
 virtue-0.062
 eater-0.059
 tutor-0.056
olding-0.055
nie-0.055
Token and
Token ID290
Feature activation-1.94e-02

Pos logit contributions
 valleys+0.210
 sails+0.191
 soared+0.179
ink+0.178
br+0.174
Neg logit contributions
 eater-0.122
 before-0.121
 ever-0.119
 daughter-0.116
nie-0.115
Token they
Token ID484
Feature activation-8.06e-03

Pos logit contributions
 virtue+0.031
 eater+0.030
nie+0.029
 tutor+0.028
olding+0.028
Neg logit contributions
 valleys-0.030
 sails-0.028
itched-0.027
 sail-0.026
 widened-0.025
Token both
Token ID1111
Feature activation-1.33e-01

Pos logit contributions
 ever+0.011
nie+0.011
 promise+0.011
 eater+0.011
 daughter+0.010
Neg logit contributions
 valleys-0.013
itched-0.012
br-0.011
 sails-0.011
eeper-0.011
Token moved
Token ID3888
Feature activation+1.94e-01

Pos logit contributions
nie+0.209
 ever+0.206
 eater+0.200
 promise+0.197
 virtue+0.193
Neg logit contributions
 valleys-0.192
itched-0.189
 sails-0.173
rolling-0.168
ink-0.164
Token inside
Token ID2641
Feature activation+4.07e-02

Pos logit contributions
itched+0.319
 valleys+0.317
ink+0.312
 sails+0.309
rolling+0.293
Neg logit contributions
 ever-0.251
 virtue-0.232
nie-0.228
 before-0.228
 eater-0.227
Token the
Token ID262
Feature activation+8.70e-03

Pos logit contributions
 valleys+0.057
 sails+0.054
 sail+0.053
itched+0.052
 gl+0.050
Neg logit contributions
 virtue-0.075
 eater-0.073
nie-0.069
olding-0.067
 tutor-0.066
Token cave
Token ID11527
Feature activation+1.05e-01

Pos logit contributions
 valleys+0.010
 sails+0.009
itched+0.009
 sail+0.008
 widened+0.008
Neg logit contributions
 virtue-0.017
 eater-0.017
nie-0.016
 tutor-0.016
olding-0.016
Token.
Token ID13
Feature activation-2.75e-02

Pos logit contributions
 valleys+0.166
 sails+0.160
 sail+0.153
itched+0.151
 gl+0.149
Neg logit contributions
 virtue-0.169
 eater-0.165
nie-0.155
 tutor-0.153
olding-0.148
Token The
Token ID383
Feature activation+2.82e-02

Pos logit contributions
 virtue+0.047
 eater+0.046
nie+0.044
 tutor+0.043
olding+0.043
Neg logit contributions
 valleys-0.038
 sails-0.037
itched-0.035
 sail-0.034
 gl-0.033
Token he
Token ID339
Feature activation+1.03e-01

Pos logit contributions
 virtue+0.161
 eater+0.159
nie+0.150
olding+0.150
 tutor+0.147
Neg logit contributions
 valleys-0.090
 sails-0.082
itched-0.079
 sail-0.078
 gl-0.073
Token decided
Token ID3066
Feature activation+6.85e-02

Pos logit contributions
 valleys+0.132
 sails+0.131
 sail+0.122
 gl+0.118
itched+0.115
Neg logit contributions
 virtue-0.195
 eater-0.189
olding-0.178
 tutor-0.178
nie-0.173
Token to
Token ID284
Feature activation-4.42e-02

Pos logit contributions
 valleys+0.121
 sails+0.120
 sail+0.116
itched+0.111
 gl+0.111
Neg logit contributions
 virtue-0.095
 eater-0.095
nie-0.086
olding-0.082
 tutor-0.080
Token investigate
Token ID9161
Feature activation+1.08e-01

Pos logit contributions
 virtue+0.078
 eater+0.075
olding+0.073
nie+0.072
 tutor+0.069
Neg logit contributions
 sail-0.061
 sails-0.061
 valleys-0.059
itched-0.054
 gl-0.054
Token.
Token ID13
Feature activation-3.00e-02

Pos logit contributions
 valleys+0.159
 sails+0.151
 sail+0.140
 gl+0.134
itched+0.133
Neg logit contributions
 virtue-0.195
 eater-0.189
olding-0.171
nie-0.171
 tutor-0.169
Token He
Token ID679
Feature activation+1.87e-01

Pos logit contributions
 virtue+0.068
 eater+0.066
nie+0.063
 tutor+0.063
olding+0.062
Neg logit contributions
 valleys-0.028
 sails-0.025
itched-0.023
 sail-0.022
He-0.021
Token moved
Token ID3888
Feature activation+1.94e-01

Pos logit contributions
 valleys+0.230
 sails+0.222
 sail+0.203
itched+0.201
 gl+0.197
Neg logit contributions
 virtue-0.360
 eater-0.351
olding-0.331
 tutor-0.331
nie-0.328
Token closer
Token ID5699
Feature activation+1.36e-01

Pos logit contributions
 valleys+0.269
 sails+0.263
itched+0.260
 sail+0.246
ink+0.236
Neg logit contributions
 virtue-0.320
 eater-0.312
nie-0.306
 tutor-0.302
 ever-0.295
Token to
Token ID284
Feature activation-4.72e-02

Pos logit contributions
 valleys+0.251
 sails+0.246
itched+0.239
 sail+0.235
 widened+0.226
Neg logit contributions
 virtue-0.163
 eater-0.158
nie-0.151
 tutor-0.149
olding-0.145
Token get
Token ID651
Feature activation+4.44e-02

Pos logit contributions
 virtue+0.105
olding+0.096
 eater+0.096
nie+0.091
 tutor+0.088
Neg logit contributions
 valleys-0.056
 sail-0.053
 sails-0.051
 gl-0.045
 forests-0.043
Token a
Token ID257
Feature activation+1.93e-02

Pos logit contributions
 valleys+0.048
 sails+0.046
 sail+0.043
 gl+0.041
itched+0.038
Neg logit contributions
 virtue-0.094
 eater-0.093
nie-0.087
olding-0.086
 tutor-0.086
Token outside
Token ID2354
Feature activation-1.13e-03

Pos logit contributions
 valleys+0.034
 sails+0.032
 gl+0.030
 sail+0.029
itched+0.029
Neg logit contributions
 virtue-0.045
 eater-0.043
 tutor-0.040
olding-0.040
nie-0.040
Token and
Token ID290
Feature activation-1.99e-02

Pos logit contributions
 virtue+0.002
 eater+0.002
olding+0.002
nie+0.002
 tutor+0.002
Neg logit contributions
 valleys-0.002
 sails-0.001
itched-0.001
 sail-0.001
 gl-0.001
Token the
Token ID262
Feature activation+6.12e-03

Pos logit contributions
 virtue+0.039
 eater+0.038
olding+0.038
 student+0.034
 tutor+0.034
Neg logit contributions
 valleys-0.025
 gl-0.024
 sails-0.022
 sail-0.022
 widened-0.022
Token sun
Token ID4252
Feature activation+8.19e-03

Pos logit contributions
 valleys+0.006
 sails+0.005
 gl+0.005
 mountains+0.005
 sail+0.004
Neg logit contributions
 virtue-0.015
 eater-0.014
olding-0.014
 tutor-0.013
nie-0.013
Token was
Token ID373
Feature activation+2.50e-02

Pos logit contributions
 valleys+0.011
 sails+0.010
itched+0.010
 sail+0.010
 gl+0.010
Neg logit contributions
 virtue-0.015
 eater-0.014
 tutor-0.014
nie-0.014
olding-0.014
Token shining
Token ID22751
Feature activation+1.93e-01

Pos logit contributions
 valleys+0.035
 sails+0.034
itched+0.032
 sail+0.031
rolled+0.029
Neg logit contributions
 virtue-0.042
 eater-0.041
nie-0.040
 tutor-0.039
 ever-0.038
Token so
Token ID523
Feature activation-8.90e-02

Pos logit contributions
 valleys+0.338
 sails+0.330
itched+0.325
 sail+0.308
rolled+0.307
Neg logit contributions
 virtue-0.248
 eater-0.246
nie-0.238
 ever-0.225
 tutor-0.223
Token brightly
Token ID35254
Feature activation+4.96e-02

Pos logit contributions
 eater+0.170
 virtue+0.170
olding+0.161
 tutor+0.159
nie+0.157
Neg logit contributions
 valleys-0.110
 sails-0.101
 gl-0.099
 sail-0.099
 widened-0.094
Token.
Token ID13
Feature activation-2.71e-02

Pos logit contributions
 valleys+0.094
 sails+0.093
itched+0.089
 sail+0.086
rolling+0.085
Neg logit contributions
 virtue-0.057
 eater-0.056
nie-0.054
 ever-0.053
 before-0.051
Token Hannah
Token ID26013
Feature activation-1.79e-02

Pos logit contributions
 virtue+0.043
 eater+0.042
nie+0.039
 tutor+0.039
olding+0.039
Neg logit contributions
 valleys-0.042
 sails-0.041
itched-0.039
 sail-0.038
 gl-0.036
Token's
Token ID338
Feature activation-8.60e-03

Pos logit contributions
 virtue+0.037
 eater+0.035
olding+0.034
 tutor+0.034
 policy+0.033
Neg logit contributions
 sails-0.020
 valleys-0.020
 gl-0.019
 bent-0.019
 widened-0.018
Token a
Token ID257
Feature activation+1.59e-02

Pos logit contributions
 valleys+0.043
 sails+0.041
itched+0.039
rolled+0.038
rolling+0.038
Neg logit contributions
 eater-0.042
 virtue-0.040
nie-0.040
 tutor-0.039
 ever-0.039
Token farmer
Token ID18739
Feature activation+1.63e-03

Pos logit contributions
 valleys+0.026
 sails+0.026
 sail+0.026
itched+0.025
 gl+0.024
Neg logit contributions
 virtue-0.023
 eater-0.021
olding-0.021
nie-0.020
 tutor-0.019
Token called
Token ID1444
Feature activation+2.41e-02

Pos logit contributions
 valleys+0.002
 sails+0.002
 sail+0.002
itched+0.002
 gl+0.002
Neg logit contributions
 virtue-0.003
 eater-0.003
 tutor-0.003
olding-0.003
idents-0.003
Token Pete
Token ID17542
Feature activation+6.02e-02

Pos logit contributions
 valleys+0.037
 sails+0.036
 sail+0.035
itched+0.034
rolled+0.032
Neg logit contributions
 virtue-0.040
 eater-0.037
olding-0.035
 mistake-0.035
 promise-0.034
Token.
Token ID13
Feature activation-2.87e-02

Pos logit contributions
 sails+0.087
 valleys+0.087
 sail+0.084
itched+0.083
 gl+0.079
Neg logit contributions
 virtue-0.101
 eater-0.093
olding-0.090
 tutor-0.090
nie-0.086
Token He
Token ID679
Feature activation+1.85e-01

Pos logit contributions
 eater+0.037
 tutor+0.030
 tonight+0.030
 Molly+0.029
 virtue+0.029
Neg logit contributions
 widened-0.050
 sails-0.049
 prope-0.049
 valleys-0.049
 descended-0.048
Token was
Token ID373
Feature activation+2.92e-02

Pos logit contributions
 valleys+0.285
 sails+0.276
itched+0.263
 sail+0.260
 gl+0.250
Neg logit contributions
 virtue-0.295
 eater-0.285
 tutor-0.269
olding-0.268
nie-0.268
Token growing
Token ID3957
Feature activation+8.59e-02

Pos logit contributions
 valleys+0.047
 sails+0.045
itched+0.044
 widened+0.043
 sail+0.042
Neg logit contributions
 eater-0.044
 virtue-0.042
nie-0.042
 tutor-0.041
 ever-0.039
Token onions
Token ID24681
Feature activation-2.52e-02

Pos logit contributions
 valleys+0.126
 sails+0.121
itched+0.119
 sail+0.114
 widened+0.112
Neg logit contributions
 virtue-0.141
 eater-0.140
nie-0.134
 tutor-0.128
olding-0.126
Token in
Token ID287
Feature activation-6.20e-02

Pos logit contributions
 virtue+0.036
 eater+0.036
nie+0.034
 tutor+0.033
 ever+0.032
Neg logit contributions
 valleys-0.042
 sails-0.040
itched-0.039
 sail-0.038
 widened-0.037
Token his
Token ID465
Feature activation+1.25e-01

Pos logit contributions
 eater+0.085
nie+0.081
 virtue+0.081
 ever+0.078
 mistake+0.076
Neg logit contributions
itched-0.100
 sails-0.098
rolling-0.097
 valleys-0.097
 widened-0.097
Token bit
Token ID1643
Feature activation-9.99e-02

Pos logit contributions
 sail+0.029
 gl+0.026
 sails+0.026
 swoop+0.024
 kay+0.024
Neg logit contributions
olding-0.031
iders-0.030
you-0.030
 eater-0.029
icia-0.029
Token more
Token ID517
Feature activation-2.92e-02

Pos logit contributions
 virtue+0.165
 eater+0.160
 anniversary+0.151
becca+0.150
 tutor+0.149
Neg logit contributions
 sails-0.130
 sail-0.126
 gl-0.121
 valleys-0.119
He-0.119
Token careful
Token ID8161
Feature activation-4.09e-02

Pos logit contributions
 virtue+0.045
 eater+0.044
idents+0.042
aughter+0.042
olding+0.041
Neg logit contributions
 sails-0.042
 sail-0.041
 gl-0.039
 wood-0.037
 valleys-0.037
Token,
Token ID11
Feature activation+1.25e-01

Pos logit contributions
 virtue+0.068
 eater+0.062
olding+0.061
becca+0.059
leton+0.059
Neg logit contributions
 sails-0.056
 sail-0.055
itched-0.052
 valleys-0.049
ink-0.048
Token this
Token ID428
Feature activation-4.67e-02

Pos logit contributions
 sail+0.176
 sails+0.176
 gl+0.165
He+0.158
 valleys+0.153
Neg logit contributions
 virtue-0.203
 eater-0.196
olding-0.187
 tutor-0.181
leton-0.180
Token stream
Token ID4269
Feature activation+2.45e-01

Pos logit contributions
olding+0.070
otte+0.069
 eater+0.067
becca+0.066
acher+0.065
Neg logit contributions
 gl-0.072
 sails-0.070
 sail-0.066
 shone-0.063
 valleys-0.062
Token could
Token ID714
Feature activation-5.32e-02

Pos logit contributions
 sails+0.359
 valleys+0.337
 gl+0.334
 sail+0.334
ink+0.315
Neg logit contributions
 eater-0.382
 virtue-0.372
 student-0.368
 tutor-0.367
nie-0.356
Token be
Token ID307
Feature activation+6.57e-02

Pos logit contributions
 virtue+0.091
 eater+0.090
nie+0.084
 tutor+0.084
olding+0.083
Neg logit contributions
 sails-0.074
 valleys-0.074
 sail-0.073
itched-0.069
 gl-0.068
Token dangerous
Token ID4923
Feature activation+5.99e-03

Pos logit contributions
 valleys+0.094
 sails+0.091
 sail+0.084
 gl+0.082
itched+0.081
Neg logit contributions
 eater-0.115
 virtue-0.112
 tutor-0.109
olding-0.107
 student-0.104
Token".
Token ID1911
Feature activation-1.24e-01

Pos logit contributions
 sails+0.008
 valleys+0.008
 sail+0.008
 wood+0.007
 mountains+0.007
Neg logit contributions
 virtue-0.010
olding-0.010
becca-0.010
 eater-0.010
bek-0.009
Token
Token ID198
Feature activation+7.49e-02

Pos logit contributions
 virtue+0.249
 eater+0.240
nie+0.231
olding+0.227
 tutor+0.221
Neg logit contributions
 valleys-0.138
itched-0.130
 sails-0.130
 sail-0.126
rolled-0.119
Token was
Token ID373
Feature activation+2.32e-02

Pos logit contributions
 valleys+0.275
 sails+0.265
itched+0.253
 sail+0.248
 gl+0.237
Neg logit contributions
 virtue-0.288
 eater-0.280
nie-0.265
 tutor-0.263
olding-0.260
Token very
Token ID845
Feature activation-6.89e-02

Pos logit contributions
 valleys+0.039
 sails+0.036
itched+0.034
 sail+0.034
 widened+0.033
Neg logit contributions
 virtue-0.035
 eater-0.033
nie-0.032
 tutor-0.032
olding-0.031
Token sad
Token ID6507
Feature activation-1.94e-02

Pos logit contributions
 virtue+0.092
 eater+0.089
nie+0.083
 tutor+0.083
olding+0.083
Neg logit contributions
 valleys-0.126
 sails-0.120
itched-0.114
 sail-0.112
 widened-0.110
Token.
Token ID13
Feature activation-3.34e-02

Pos logit contributions
 virtue+0.035
 eater+0.034
nie+0.032
 tutor+0.032
becca+0.031
Neg logit contributions
 valleys-0.024
 sails-0.024
itched-0.024
 sail-0.024
He-0.023
Token
Token ID198
Feature activation+7.64e-02

Pos logit contributions
 virtue+0.058
 eater+0.057
nie+0.055
 tutor+0.055
olding+0.055
Neg logit contributions
 valleys-0.045
 sails-0.044
 sail-0.040
itched-0.040
 widened-0.038
Token
Token ID198
Feature activation+7.65e-02

Pos logit contributions
 valleys+0.101
 sails+0.098
 sail+0.089
itched+0.088
 gl+0.085
Neg logit contributions
 virtue-0.137
 eater-0.134
nie-0.129
 tutor-0.128
olding-0.128
TokenTim
Token ID14967
Feature activation-4.15e-02

Pos logit contributions
 valleys+0.143
 sails+0.139
 gl+0.125
 sail+0.124
itched+0.123
Neg logit contributions
 virtue-0.102
nie-0.098
 eater-0.096
olding-0.096
 tutor-0.089
Tokenmy
Token ID1820
Feature activation-4.03e-02

Pos logit contributions
 virtue+0.068
 eater+0.063
olding+0.060
 tutor+0.060
 occasion+0.057
Neg logit contributions
 valleys-0.063
 sails-0.062
itched-0.058
 swoop-0.057
 sail-0.057
Token's
Token ID338
Feature activation-1.14e-02

Pos logit contributions
 virtue+0.064
 eater+0.063
nie+0.060
 tutor+0.059
olding+0.058
Neg logit contributions
 valleys-0.062
 sails-0.059
itched-0.057
 sail-0.055
 gl-0.053
Token mom
Token ID1995
Feature activation-1.87e-01

Pos logit contributions
 virtue+0.017
nie+0.016
 eater+0.015
olding+0.015
you+0.015
Neg logit contributions
 sails-0.019
 sail-0.019
itched-0.017
 gl-0.017
 hood-0.017
Token saw
Token ID2497
Feature activation+9.69e-02

Pos logit contributions
 virtue+0.337
 eater+0.328
nie+0.311
 tutor+0.309
olding+0.306
Neg logit contributions
 valleys-0.250
 sails-0.240
itched-0.228
 sail-0.222
 gl-0.212
Token.
Token ID13
Feature activation-3.11e-02

Pos logit contributions
 virtue+0.072
nie+0.069
olding+0.067
 eater+0.067
 tutor+0.065
Neg logit contributions
 valleys-0.051
 sails-0.050
itched-0.048
 sail-0.048
 gl-0.045
Token From
Token ID3574
Feature activation-6.33e-02

Pos logit contributions
 eater+0.076
 virtue+0.074
 tutor+0.071
 exam+0.071
 student+0.071
Neg logit contributions
 valleys-0.021
itched-0.017
rolled-0.016
He-0.016
 sails-0.016
Token that
Token ID326
Feature activation-8.46e-02

Pos logit contributions
 virtue+0.098
 eater+0.096
nie+0.089
olding+0.087
 tutor+0.087
Neg logit contributions
 valleys-0.100
 sails-0.099
 sail-0.094
itched-0.094
 gl-0.091
Token day
Token ID1110
Feature activation-1.15e-01

Pos logit contributions
olding+0.118
 virtue+0.116
 eater+0.116
itor+0.114
idents+0.110
Neg logit contributions
 gl-0.133
 sail-0.128
 valleys-0.125
itched-0.121
 shining-0.121
Token on
Token ID319
Feature activation-3.82e-02

Pos logit contributions
 virtue+0.198
 eater+0.193
nie+0.184
 tutor+0.183
olding+0.181
Neg logit contributions
 valleys-0.161
 sails-0.156
itched-0.144
 sail-0.144
 widened-0.137
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
 virtue+0.097
 eater+0.095
olding+0.093
nie+0.090
 poet+0.088
Neg logit contributions
 valleys-0.024
itched-0.023
 sails-0.020
 sail-0.019
 gl-0.019
Token the
Token ID262
Feature activation+3.66e-03

Pos logit contributions
 valleys+0.140
 valley+0.118
 gl+0.110
 mountains+0.108
 sail+0.107
Neg logit contributions
 virtue-0.231
olding-0.227
 mistake-0.215
 poet-0.213
 apology-0.205
Token diamond
Token ID15291
Feature activation+1.78e-01

Pos logit contributions
 valleys+0.004
 sails+0.004
 sail+0.004
 gl+0.004
itched+0.004
Neg logit contributions
 virtue-0.007
 eater-0.007
olding-0.007
nie-0.007
 tutor-0.006
Token became
Token ID2627
Feature activation+1.39e-01

Pos logit contributions
 sails+0.237
 valleys+0.232
 gl+0.228
 sail+0.224
itched+0.216
Neg logit contributions
 virtue-0.326
 eater-0.306
olding-0.303
nie-0.297
 tutor-0.296
Token Lucy
Token ID22162
Feature activation-1.04e-01

Pos logit contributions
 valleys+0.212
 sails+0.204
itched+0.195
 sail+0.191
 gl+0.182
Neg logit contributions
 virtue-0.222
 eater-0.216
nie-0.204
 tutor-0.203
olding-0.200
Token's
Token ID338
Feature activation-8.20e-03

Pos logit contributions
 inconvenience+0.265
 virtue+0.248
 apology+0.245
 occasion+0.240
 topic+0.236
Neg logit contributions
 gl-0.063
ink-0.054
itched-0.050
,-0.044
 wide-0.043
Token were
Token ID547
Feature activation+6.40e-02

Pos logit contributions
 virtue+0.020
olding+0.019
 eater+0.019
 tutor+0.018
nie+0.018
Neg logit contributions
 valleys-0.013
 sails-0.013
 gl-0.012
 sailed-0.012
 sail-0.012
Token very
Token ID845
Feature activation-6.58e-02

Pos logit contributions
 valleys+0.087
 sails+0.085
 sail+0.080
itched+0.080
 gl+0.078
Neg logit contributions
 virtue-0.114
 eater-0.112
olding-0.107
 occasion-0.106
 tutor-0.104
Token happy
Token ID3772
Feature activation-3.83e-02

Pos logit contributions
 eater+0.127
 virtue+0.126
 tutor+0.121
 occasion+0.121
nie+0.119
Neg logit contributions
 valleys-0.074
 sail-0.074
 sails-0.073
 gl-0.069
itched-0.069
Token.
Token ID13
Feature activation-2.96e-02

Pos logit contributions
nie+0.087
 virtue+0.087
aughter+0.087
icia+0.085
becca+0.084
Neg logit contributions
 sail-0.033
 valleys-0.031
 sails-0.031
 gl-0.031
 soaring-0.029
Token The
Token ID383
Feature activation+2.78e-02

Pos logit contributions
 eater+0.066
 virtue+0.064
nie+0.064
 tutor+0.063
olding+0.063
Neg logit contributions
 valleys-0.027
 sails-0.025
 sail-0.022
itched-0.022
rolled-0.021
Token switch
Token ID5078
Feature activation+1.41e-01

Pos logit contributions
 sails+0.044
 sail+0.043
 valleys+0.041
 gl+0.041
 sparkling+0.037
Neg logit contributions
 eater-0.046
 virtue-0.044
nie-0.041
ccoli-0.040
 tonight-0.039
Token was
Token ID373
Feature activation+2.22e-02

Pos logit contributions
 sails+0.175
 gl+0.172
 valleys+0.162
 sail+0.162
 sailed+0.160
Neg logit contributions
 virtue-0.277
 eater-0.268
olding-0.258
nie-0.252
 tutor-0.247
Token happy
Token ID3772
Feature activation-3.85e-02

Pos logit contributions
 valleys+0.032
 sails+0.032
 sail+0.030
itched+0.030
 gl+0.030
Neg logit contributions
 eater-0.037
 virtue-0.036
nie-0.034
olding-0.034
 tutor-0.034
Token too
Token ID1165
Feature activation-1.91e-01

Pos logit contributions
nie+0.075
aughter+0.075
 virtue+0.075
icia+0.074
 eater+0.073
Neg logit contributions
 gl-0.045
 valleys-0.042
 sails-0.041
 sail-0.041
 soaring-0.039
Token because
Token ID780
Feature activation-1.66e-01

Pos logit contributions
 eater+0.367
 virtue+0.357
 tutor+0.330
nie+0.330
 occasion+0.325
Neg logit contributions
 valleys-0.237
 sails-0.235
 sail-0.235
itched-0.225
 widened-0.220
Token it
Token ID340
Feature activation+7.92e-06

Pos logit contributions
 eater+0.321
 virtue+0.309
nie+0.303
olding+0.293
 tutor+0.290
Neg logit contributions
 valleys-0.205
 sail-0.203
 sails-0.203
 gl-0.188
 sailed-0.178
Token and
Token ID290
Feature activation-2.29e-02

Pos logit contributions
 virtue+0.090
 eater+0.083
nie+0.083
icia+0.080
olding+0.079
Neg logit contributions
 gl-0.035
 valleys-0.033
 soaring-0.032
itched-0.031
 sails-0.031
Token stopped
Token ID5025
Feature activation+8.03e-02

Pos logit contributions
 virtue+0.043
 eater+0.042
nie+0.039
 tutor+0.039
olding+0.039
Neg logit contributions
 valleys-0.028
 sails-0.028
 gl-0.028
 sail-0.028
itched-0.027
Token feeling
Token ID4203
Feature activation-4.87e-02

Pos logit contributions
 valleys+0.117
 sails+0.113
itched+0.110
 sail+0.108
 gl+0.104
Neg logit contributions
 virtue-0.139
 eater-0.133
nie-0.127
olding-0.122
 tutor-0.122
Token worried
Token ID7960
Feature activation+2.35e-02

Pos logit contributions
 virtue+0.083
 eater+0.081
nie+0.077
 tutor+0.075
olding+0.075
Neg logit contributions
 valleys-0.071
 sails-0.068
 sail-0.065
itched-0.064
 gl-0.063
Token.
Token ID13
Feature activation-3.26e-02

Pos logit contributions
 valleys+0.029
 gl+0.028
 sails+0.028
 widened+0.027
 sail+0.027
Neg logit contributions
 virtue-0.044
 eater-0.042
nie-0.040
becca-0.040
 tutor-0.039
Token She
Token ID1375
Feature activation+1.23e-01

Pos logit contributions
 virtue+0.070
 eater+0.069
nie+0.067
 tutor+0.066
olding+0.065
Neg logit contributions
 valleys-0.032
 sails-0.029
itched-0.028
 sail-0.026
 gl-0.025
Token thanked
Token ID26280
Feature activation+2.59e-02

Pos logit contributions
 sails+0.163
 valleys+0.156
 gl+0.155
 sail+0.153
itched+0.148
Neg logit contributions
 virtue-0.223
 eater-0.219
olding-0.205
 tutor-0.203
nie-0.198
Token her
Token ID607
Feature activation+3.19e-02

Pos logit contributions
 sail+0.023
 valleys+0.022
 sails+0.022
 gl+0.021
itched+0.021
Neg logit contributions
 virtue-0.061
olding-0.055
nie-0.054
 eater-0.054
 student-0.051
Token mum
Token ID25682
Feature activation-7.87e-02

Pos logit contributions
 sail+0.047
 sails+0.046
 valleys+0.044
 gl+0.043
itched+0.043
Neg logit contributions
 virtue-0.053
nie-0.049
 eater-0.048
olding-0.046
 occasion-0.046
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
 virtue+0.165
 inconvenience+0.148
oker+0.147
 virus+0.147
 topic+0.145
Neg logit contributions
 sail-0.077
itched-0.073
 sails-0.073
 valleys-0.072
 gl-0.070
Token ran
Token ID4966
Feature activation+9.37e-02

Pos logit contributions
 virtue+0.042
 eater+0.040
nie+0.038
olding+0.038
 tutor+0.038
Neg logit contributions
 valleys-0.030
 sails-0.029
 sail-0.028
itched-0.027
 gl-0.027
Token all
Token ID477
Feature activation-1.98e-01

Pos logit contributions
 virtue+0.000
 eater+0.000
 tutor+0.000
 student+0.000
olding+0.000
Neg logit contributions
 sail-0.000
 sails-0.000
 valleys-0.000
 gl-0.000
itched-0.000
Token for
Token ID329
Feature activation-1.79e-01

Pos logit contributions
 virtue+0.399
 eater+0.394
 tutor+0.387
icia+0.376
 students+0.375
Neg logit contributions
 gl-0.200
 sail-0.196
 valleys-0.187
 soaring-0.183
 swoop-0.173
Token herself
Token ID5223
Feature activation+2.55e-02

Pos logit contributions
 virtue+0.397
 eater+0.390
olding+0.382
nie+0.381
 tutor+0.359
Neg logit contributions
itched-0.153
 valleys-0.147
 sail-0.126
 wood-0.122
 sails-0.121
Token.
Token ID13
Feature activation-3.16e-02

Pos logit contributions
 gl+0.017
 sail+0.016
 valleys+0.015
 soaring+0.015
itched+0.015
Neg logit contributions
 virtue-0.058
becca-0.055
 poet-0.054
 eater-0.054
 apology-0.054
Token
Token ID198
Feature activation+7.56e-02

Pos logit contributions
 virtue+0.070
 eater+0.068
nie+0.065
olding+0.064
 tutor+0.064
Neg logit contributions
 valleys-0.030
 sails-0.027
itched-0.027
 sail-0.025
rolled-0.024
Token
Token ID198
Feature activation+7.58e-02

Pos logit contributions
 valleys+0.083
 sails+0.079
itched+0.075
 sail+0.072
 gl+0.068
Neg logit contributions
 virtue-0.152
 eater-0.149
nie-0.142
 tutor-0.141
olding-0.140
TokenThe
Token ID464
Feature activation+2.52e-02

Pos logit contributions
 valleys+0.113
 sails+0.107
itched+0.107
He+0.103
 sail+0.102
Neg logit contributions
 virtue-0.122
 eater-0.120
nie-0.111
 tutor-0.110
olding-0.108
Token lady
Token ID10846
Feature activation-4.29e-02

Pos logit contributions
 sail+0.041
 sails+0.041
 valleys+0.041
 gl+0.040
itched+0.039
Neg logit contributions
 virtue-0.034
 eater-0.033
nie-0.033
icia-0.028
 behaviour-0.027
Token said
Token ID531
Feature activation+7.20e-02

Pos logit contributions
 virtue+0.091
 eater+0.081
 occasion+0.076
 behaviour+0.076
olding+0.076
Neg logit contributions
 gl-0.046
 sails-0.046
 bent-0.045
 swoop-0.044
 valleys-0.042
Token,
Token ID11
Feature activation+1.16e-01

Pos logit contributions
itched+0.054
 sails+0.050
 valleys+0.050
 sail+0.047
 gl+0.045
Neg logit contributions
 virtue-0.170
 eater-0.162
nie-0.161
 student-0.153
olding-0.153
Token "
Token ID366
Feature activation-6.38e-02

Pos logit contributions
 valleys+0.116
itched+0.112
 sails+0.111
 sail+0.106
 gl+0.105
Neg logit contributions
 virtue-0.250
 eater-0.238
nie-0.224
 promise-0.222
olding-0.220
Token.
Token ID13
Feature activation-3.15e-02

Pos logit contributions
 sails+0.076
 sail+0.075
 gl+0.072
 valleys+0.071
itched+0.069
Neg logit contributions
 virtue-0.111
olding-0.102
 eater-0.100
becca-0.098
nie-0.096
Token He
Token ID679
Feature activation+1.82e-01

Pos logit contributions
 virtue+0.072
 eater+0.069
olding+0.067
nie+0.067
 tutor+0.065
Neg logit contributions
 valleys-0.028
 sails-0.026
 sail-0.023
itched-0.023
He-0.022
Token was
Token ID373
Feature activation+2.68e-02

Pos logit contributions
 sails+0.243
 gl+0.241
 valleys+0.237
 sailed+0.232
 soared+0.226
Neg logit contributions
 virtue-0.334
olding-0.320
 tutor-0.301
 eater-0.298
aughter-0.297
Token happy
Token ID3772
Feature activation-3.79e-02

Pos logit contributions
 valleys+0.040
 sails+0.038
itched+0.036
 sail+0.036
 gl+0.034
Neg logit contributions
 virtue-0.045
 eater-0.043
olding-0.041
 tutor-0.041
nie-0.040
Token and
Token ID290
Feature activation-2.06e-02

Pos logit contributions
 virtue+0.079
aughter+0.073
icia+0.072
olding+0.071
nie+0.069
Neg logit contributions
 gl-0.044
 valleys-0.044
 sails-0.042
 sail-0.042
 mountains-0.041
Token good
Token ID922
Feature activation-7.83e-02

Pos logit contributions
 virtue+0.034
 eater+0.032
 tutor+0.031
olding+0.031
nie+0.030
Neg logit contributions
 valleys-0.031
 sails-0.030
 sail-0.029
itched-0.028
 gl-0.028
Token.
Token ID13
Feature activation-3.29e-02

Pos logit contributions
 virtue+0.167
 mistake+0.149
nie+0.149
 promise+0.145
icia+0.144
Neg logit contributions
 sail-0.083
 sails-0.078
 sailed-0.076
 gl-0.073
itched-0.073
Token
Token ID198
Feature activation+7.51e-02

Pos logit contributions
 virtue+0.074
 eater+0.071
olding+0.069
nie+0.069
 occasion+0.067
Neg logit contributions
 valleys-0.030
 sails-0.028
 sail-0.025
itched-0.025
He-0.025
Token
Token ID198
Feature activation+7.53e-02

Pos logit contributions
 valleys+0.096
 sails+0.094
itched+0.086
 sail+0.085
 gl+0.081
Neg logit contributions
 virtue-0.137
 eater-0.134
nie-0.129
 tutor-0.129
olding-0.126
TokenOne
Token ID3198
Feature activation+7.70e-02

Pos logit contributions
 valleys+0.083
 sails+0.080
itched+0.075
 sail+0.072
 gl+0.069
Neg logit contributions
 virtue-0.153
 eater-0.150
nie-0.145
 tutor-0.144
olding-0.141
Token day
Token ID1110
Feature activation-1.14e-01

Pos logit contributions
 sail+0.135
 gl+0.130
 valleys+0.128
He+0.127
 sailed+0.126
Neg logit contributions
 virtue-0.107
idents-0.093
 eater-0.089
 tutor-0.086
olding-0.086
Tokenily
Token ID813
Feature activation-1.13e-01

Pos logit contributions
 virtue+0.126
 tutor+0.118
 eater+0.118
idents+0.117
nie+0.110
Neg logit contributions
He-0.141
 valleys-0.138
itched-0.135
His-0.133
 sails-0.131
Token was
Token ID373
Feature activation+3.38e-02

Pos logit contributions
 virtue+0.219
 eater+0.204
olding+0.201
 occasion+0.200
 tutor+0.197
Neg logit contributions
 sails-0.138
 valleys-0.131
 gl-0.127
 widened-0.126
itched-0.125
Token happy
Token ID3772
Feature activation-3.25e-02

Pos logit contributions
 valleys+0.053
 sails+0.051
itched+0.049
 sail+0.048
 gl+0.046
Neg logit contributions
 virtue-0.053
 eater-0.052
nie-0.048
 tutor-0.048
olding-0.048
Token to
Token ID284
Feature activation-4.23e-02

Pos logit contributions
 virtue+0.077
icia+0.074
nie+0.073
aughter+0.072
bek+0.069
Neg logit contributions
 gl-0.026
He-0.024
 sail-0.024
itched-0.023
 bent-0.022
Token help
Token ID1037
Feature activation-2.27e-02

Pos logit contributions
 eater+0.058
 virtue+0.058
 ever+0.056
 tutor+0.055
 promise+0.053
Neg logit contributions
 valleys-0.076
 sails-0.071
itched-0.065
 gl-0.063
 widened-0.063
Token the
Token ID262
Feature activation+1.24e-02

Pos logit contributions
 virtue+0.056
olding+0.051
nie+0.051
 eater+0.051
leton+0.049
Neg logit contributions
 wood-0.015
itched-0.013
 sail-0.012
 gl-0.011
 his-0.011
Token lost
Token ID2626
Feature activation+9.03e-02

Pos logit contributions
 sail+0.019
 sails+0.018
 stream+0.017
 gl+0.016
 wood+0.016
Neg logit contributions
nie-0.021
 virtue-0.020
olding-0.019
 eater-0.019
icia-0.018
Token puppy
Token ID26188
Feature activation-3.61e-02

Pos logit contributions
 sail+0.097
 valleys+0.095
 sails+0.094
 gl+0.089
 wood+0.089
Neg logit contributions
 virtue-0.194
olding-0.179
nie-0.176
becca-0.167
Deal-0.165
Token.
Token ID13
Feature activation-3.14e-02

Pos logit contributions
 virtue+0.071
becca+0.066
nie+0.065
olding+0.063
 eater+0.063
Neg logit contributions
 sail-0.045
 sails-0.040
 valleys-0.039
 gl-0.038
itched-0.037
Token She
Token ID1375
Feature activation+1.23e-01

Pos logit contributions
 virtue+0.074
 eater+0.073
nie+0.072
olding+0.069
 tutor+0.068
Neg logit contributions
 valleys-0.024
 sails-0.021
itched-0.020
 sail-0.019
He-0.018
Token held
Token ID2714
Feature activation+1.46e-01

Pos logit contributions
 valleys+0.206
 sails+0.191
itched+0.187
 sail+0.178
rolling+0.172
Neg logit contributions
 virtue-0.182
 eater-0.179
nie-0.168
 tutor-0.167
 ever-0.162
Token from
Token ID422
Feature activation-1.26e-02

Pos logit contributions
 valleys+0.099
 sails+0.096
itched+0.090
 sail+0.089
rolling+0.087
Neg logit contributions
 virtue-0.082
 ever-0.080
 eater-0.079
nie-0.079
 before-0.077
Token the
Token ID262
Feature activation+6.34e-03

Pos logit contributions
 virtue+0.021
 eater+0.021
nie+0.020
 tutor+0.020
olding+0.019
Neg logit contributions
 valleys-0.018
 sails-0.017
itched-0.016
 sail-0.016
 widened-0.015
Token sky
Token ID6766
Feature activation+6.89e-02

Pos logit contributions
 valleys+0.007
 sails+0.006
 sail+0.006
 gl+0.005
 wood+0.005
Neg logit contributions
 virtue-0.014
 eater-0.013
olding-0.013
nie-0.013
 tutor-0.012
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
 valleys+0.100
 sails+0.097
itched+0.094
 gl+0.093
 sail+0.092
Neg logit contributions
 virtue-0.120
 eater-0.113
 tutor-0.109
nie-0.108
olding-0.105
Token land
Token ID1956
Feature activation+1.16e-01

Pos logit contributions
 virtue+0.041
 eater+0.040
olding+0.038
nie+0.038
 tutor+0.038
Neg logit contributions
 valleys-0.020
 sails-0.019
 sail-0.017
 gl-0.017
 swoop-0.017
Token on
Token ID319
Feature activation-3.79e-02

Pos logit contributions
 valleys+0.141
 sails+0.132
itched+0.131
 gl+0.126
 sail+0.121
Neg logit contributions
 virtue-0.231
 eater-0.223
nie-0.215
 tutor-0.213
 occasion-0.206
Token the
Token ID262
Feature activation+5.27e-03

Pos logit contributions
 virtue+0.073
 eater+0.071
nie+0.069
olding+0.069
 tutor+0.066
Neg logit contributions
 valleys-0.049
 sails-0.045
itched-0.043
 sail-0.042
 gl-0.041
Token ground
Token ID2323
Feature activation+1.18e-02

Pos logit contributions
 valleys+0.006
 sails+0.006
 sail+0.006
 gl+0.005
 wood+0.005
Neg logit contributions
 virtue-0.011
 eater-0.011
olding-0.011
 tutor-0.010
nie-0.010
Token.
Token ID13
Feature activation-2.94e-02

Pos logit contributions
 valleys+0.020
 sails+0.019
itched+0.018
 sail+0.018
 gl+0.018
Neg logit contributions
 virtue-0.017
 eater-0.017
nie-0.016
 tutor-0.016
olding-0.015
Token She
Token ID1375
Feature activation+1.22e-01

Pos logit contributions
 virtue+0.056
 eater+0.054
nie+0.053
olding+0.051
 tutor+0.050
Neg logit contributions
 valleys-0.037
 sails-0.035
itched-0.034
 sail-0.032
 gl-0.031
Token ran
Token ID4966
Feature activation+9.20e-02

Pos logit contributions
 valleys+0.175
 sails+0.168
itched+0.160
 sail+0.157
 gl+0.150
Neg logit contributions
 virtue-0.207
 eater-0.202
nie-0.192
 tutor-0.190
olding-0.188
Token me
Token ID502
Feature activation-5.90e-02

Pos logit contributions
 virtue+0.248
 eater+0.242
olding+0.232
 tutor+0.222
 student+0.218
Neg logit contributions
 sails-0.144
 sail-0.134
 gl-0.130
 valleys-0.130
 soared-0.125
Token think
Token ID892
Feature activation-1.74e-02

Pos logit contributions
 virtue+0.106
 eater+0.100
nie+0.095
olding+0.091
becca+0.088
Neg logit contributions
 sail-0.087
 sails-0.083
itched-0.076
rolled-0.074
 gl-0.074
Token of
Token ID286
Feature activation-5.71e-02

Pos logit contributions
 eater+0.032
 virtue+0.031
olding+0.027
 occasion+0.026
icia+0.026
Neg logit contributions
 sails-0.024
 sail-0.024
 valleys-0.023
 gl-0.022
 soaring-0.022
Token a
Token ID257
Feature activation+1.96e-02

Pos logit contributions
 eater+0.103
 virtue+0.103
olding+0.101
nie+0.092
 tutor+0.091
Neg logit contributions
 valleys-0.081
 sails-0.079
 sail-0.073
 gl-0.072
itched-0.070
Token joke
Token ID9707
Feature activation-3.04e-02

Pos logit contributions
 sail+0.027
 sails+0.025
 gl+0.024
 valleys+0.023
 swoop+0.023
Neg logit contributions
 eater-0.034
olding-0.034
 virtue-0.033
nie-0.032
becca-0.032
Token.
Token ID13
Feature activation-2.51e-02

Pos logit contributions
 virtue+0.055
olding+0.052
becca+0.050
nie+0.050
 eater+0.049
Neg logit contributions
 sails-0.037
br-0.036
 gl-0.036
 sail-0.035
He-0.033
Token I
Token ID314
Feature activation-1.73e-01

Pos logit contributions
 virtue+0.038
 eater+0.035
 mistake+0.035
 occasion+0.034
nie+0.034
Neg logit contributions
 sails-0.040
 valleys-0.039
 sail-0.038
He-0.037
itched-0.036
Token know
Token ID760
Feature activation+8.64e-02

Pos logit contributions
 eater+0.236
 virtue+0.235
 tutor+0.230
nie+0.218
olding+0.210
Neg logit contributions
 sails-0.291
 sail-0.284
 valleys-0.278
itched-0.276
 gl-0.274
Token!
Token ID0
Feature activation-5.37e-02

Pos logit contributions
 sails+0.124
 valleys+0.119
 gl+0.114
 sail+0.113
 soaring+0.108
Neg logit contributions
 virtue-0.155
 eater-0.150
olding-0.143
 occasion-0.136
nie-0.135
Token What
Token ID1867
Feature activation-9.52e-02

Pos logit contributions
 virtue+0.079
 mistake+0.075
 eater+0.075
 occasion+0.071
nie+0.070
Neg logit contributions
 sails-0.086
 valleys-0.083
He-0.083
 sail-0.082
His-0.079
Token did
Token ID750
Feature activation-7.56e-02

Pos logit contributions
 virtue+0.168
olding+0.162
 eater+0.161
 occasion+0.153
becca+0.148
Neg logit contributions
 sails-0.131
 gl-0.127
 sail-0.123
 valleys-0.118
 soared-0.117
Token forgot
Token ID16453
Feature activation-1.16e-01

Pos logit contributions
 virtue+0.251
 eater+0.242
 tutor+0.230
nie+0.223
olding+0.219
Neg logit contributions
 valleys-0.217
 sails-0.208
 sail-0.201
 gl-0.200
 soared-0.195
Token the
Token ID262
Feature activation+7.34e-03

Pos logit contributions
 virtue+0.239
 eater+0.237
nie+0.222
olding+0.220
 occasion+0.212
Neg logit contributions
 valleys-0.131
 sails-0.127
 gl-0.123
 soaring-0.113
 sail-0.113
Token big
Token ID1263
Feature activation+9.23e-02

Pos logit contributions
 gl+0.008
 sails+0.008
 sail+0.008
 valleys+0.008
 spark+0.007
Neg logit contributions
 eater-0.016
 virtue-0.015
enny-0.014
olding-0.014
otte-0.014
Token,
Token ID11
Feature activation+1.18e-01

Pos logit contributions
 sails+0.096
 valleys+0.095
 gl+0.094
 sail+0.092
itched+0.081
Neg logit contributions
 virtue-0.199
 eater-0.190
olding-0.189
becca-0.178
nie-0.178
Token white
Token ID2330
Feature activation+1.72e-01

Pos logit contributions
 gl+0.092
 valleys+0.089
 sails+0.080
 spark+0.073
 sparkling+0.073
Neg logit contributions
 virtue-0.304
 eater-0.293
olding-0.279
 anniversary-0.268
icia-0.268
Token cloud
Token ID6279
Feature activation-3.44e-04

Pos logit contributions
 sails+0.175
 gl+0.174
 valleys+0.171
 sail+0.166
itched+0.156
Neg logit contributions
 virtue-0.389
olding-0.373
 eater-0.370
nie-0.347
becca-0.347
Token.
Token ID13
Feature activation-3.18e-02

Pos logit contributions
 virtue+0.001
 eater+0.001
nie+0.001
 tutor+0.001
olding+0.001
Neg logit contributions
 gl-0.000
 valleys-0.000
 sail-0.000
 sails-0.000
itched-0.000
Token She
Token ID1375
Feature activation+1.20e-01

Pos logit contributions
 virtue+0.059
 eater+0.057
nie+0.054
 tutor+0.054
olding+0.054
Neg logit contributions
 valleys-0.040
 sails-0.040
itched-0.037
 sail-0.037
 widened-0.034
Token will
Token ID481
Feature activation-7.79e-02

Pos logit contributions
 gl+0.147
 sails+0.143
 soared+0.139
 valleys+0.138
 sailed+0.135
Neg logit contributions
 eater-0.233
 virtue-0.233
 tutor-0.226
olding-0.226
aughter-0.216
Token always
Token ID1464
Feature activation-9.38e-02

Pos logit contributions
 eater+0.114
 virtue+0.111
nie+0.106
olding+0.104
 tutor+0.102
Neg logit contributions
 sail-0.130
 valleys-0.129
 sails-0.128
 gl-0.125
 swoop-0.121
Token care
Token ID1337
Feature activation+4.18e-03

Pos logit contributions
 virtue+0.140
 eater+0.138
 tutor+0.126
olding+0.126
nie+0.125
Neg logit contributions
 sails-0.153
 sail-0.153
 valleys-0.149
 gl-0.143
 swoop-0.139
Token there
Token ID612
Feature activation-3.75e-04

Pos logit contributions
 sails+0.182
 valleys+0.181
itched+0.165
 sail+0.162
 widened+0.157
Neg logit contributions
 eater-0.172
 virtue-0.169
nie-0.168
 ever-0.161
 tutor-0.159
Token was
Token ID373
Feature activation+1.93e-02

Pos logit contributions
 eater+0.000
 virtue+0.000
 ever+0.000
nie+0.000
 mistake+0.000
Neg logit contributions
 valleys-0.001
 sails-0.001
 sail-0.001
itched-0.001
rolling-0.001
Token a
Token ID257
Feature activation+1.04e-02

Pos logit contributions
 valleys+0.029
 sails+0.028
itched+0.027
 sail+0.025
rolling+0.025
Neg logit contributions
 eater-0.031
 virtue-0.030
 tutor-0.029
nie-0.029
 ever-0.028
Token little
Token ID1310
Feature activation-9.03e-02

Pos logit contributions
 sail+0.016
 valleys+0.016
 sails+0.015
 gl+0.015
itched+0.014
Neg logit contributions
 virtue-0.017
olding-0.015
 eater-0.015
nie-0.015
 behaviour-0.014
Token duck
Token ID22045
Feature activation+6.17e-02

Pos logit contributions
 virtue+0.108
olding+0.093
 eater+0.092
nie+0.088
 tutor+0.083
Neg logit contributions
 valleys-0.176
 sails-0.175
 sail-0.173
 gl-0.165
itched-0.162
Token who
Token ID508
Feature activation-1.66e-01

Pos logit contributions
 valleys+0.088
 sails+0.086
itched+0.084
 sail+0.083
 gl+0.079
Neg logit contributions
 virtue-0.104
 eater-0.098
 tutor-0.095
nie-0.094
olding-0.092
Token loved
Token ID6151
Feature activation-7.39e-02

Pos logit contributions
 eater+0.213
 virtue+0.210
 ever+0.205
nie+0.202
 promise+0.179
Neg logit contributions
 valleys-0.301
 sails-0.289
itched-0.279
rolling-0.271
rolled-0.264
Token to
Token ID284
Feature activation-5.36e-02

Pos logit contributions
 virtue+0.098
 eater+0.091
nie+0.089
olding+0.088
 tutor+0.083
Neg logit contributions
 valleys-0.136
 sails-0.136
 sail-0.130
itched-0.125
 gl-0.125
Token swim
Token ID9422
Feature activation+1.95e-01

Pos logit contributions
 eater+0.074
 promise+0.068
 ever+0.065
 occasion+0.065
 tutor+0.065
Neg logit contributions
 valleys-0.096
 widened-0.082
itched-0.082
 sails-0.082
He-0.079
Token in
Token ID287
Feature activation-6.80e-02

Pos logit contributions
 valleys+0.299
 sails+0.291
 sail+0.281
itched+0.266
 gl+0.262
Neg logit contributions
 virtue-0.321
 eater-0.306
 tutor-0.292
nie-0.289
olding-0.288
Token the
Token ID262
Feature activation+6.10e-03

Pos logit contributions
 eater+0.087
 virtue+0.085
 tutor+0.082
 mistake+0.079
nie+0.079
Neg logit contributions
 valleys-0.115
itched-0.114
 sails-0.114
rolled-0.108
rolling-0.108
Token said
Token ID531
Feature activation+7.79e-02

Pos logit contributions
 virtue+0.303
 eater+0.278
olding+0.274
 tutor+0.274
 mistake+0.267
Neg logit contributions
 sails-0.157
 sail-0.151
 valleys-0.150
 sailed-0.148
 gl-0.147
Token yes
Token ID3763
Feature activation-6.04e-02

Pos logit contributions
 sails+0.102
 valleys+0.099
 sail+0.096
 gl+0.094
itched+0.094
Neg logit contributions
 virtue-0.142
nie-0.134
 eater-0.134
olding-0.130
 occasion-0.123
Token,
Token ID11
Feature activation+1.19e-01

Pos logit contributions
 eater+0.135
 virtue+0.132
 tutor+0.123
 student+0.120
olding+0.119
Neg logit contributions
itched-0.048
 sails-0.047
He-0.046
 sail-0.043
 gl-0.042
Token so
Token ID523
Feature activation-8.67e-02

Pos logit contributions
 sails+0.138
 valleys+0.134
 sail+0.128
 gl+0.125
He+0.116
Neg logit contributions
 virtue-0.250
 eater-0.233
nie-0.221
olding-0.219
 tutor-0.214
Token they
Token ID484
Feature activation-7.99e-03

Pos logit contributions
 virtue+0.183
 eater+0.176
olding+0.175
nie+0.171
 occasion+0.168
Neg logit contributions
 valleys-0.100
 gl-0.083
 sail-0.082
itched-0.081
 widened-0.080
Token all
Token ID477
Feature activation-1.95e-01

Pos logit contributions
 virtue+0.016
olding+0.015
 mistake+0.015
 eater+0.015
 tutor+0.015
Neg logit contributions
 sails-0.010
 sail-0.009
 sailed-0.009
 valleys-0.009
 gl-0.009
Token got
Token ID1392
Feature activation+7.86e-02

Pos logit contributions
 virtue+0.400
 eater+0.374
 tutor+0.370
olding+0.367
 mistake+0.352
Neg logit contributions
 valleys-0.221
 sails-0.218
 sail-0.206
 gl-0.199
 sailed-0.198
Token together
Token ID1978
Feature activation-2.78e-02

Pos logit contributions
 sails+0.106
 valleys+0.106
 sail+0.101
 gl+0.097
 sailed+0.094
Neg logit contributions
 virtue-0.150
 eater-0.137
nie-0.133
olding-0.133
 tutor-0.130
Token and
Token ID290
Feature activation-1.97e-02

Pos logit contributions
 virtue+0.050
 eater+0.047
nie+0.045
olding+0.045
 tutor+0.044
Neg logit contributions
 valleys-0.039
 sails-0.038
 sail-0.037
 gl-0.035
itched-0.035
Token made
Token ID925
Feature activation-1.21e-02

Pos logit contributions
 virtue+0.040
olding+0.037
 eater+0.036
 tutor+0.036
 mistake+0.035
Neg logit contributions
 valleys-0.025
 sails-0.024
 sail-0.023
 gl-0.023
 sailed-0.023
Token a
Token ID257
Feature activation+1.68e-02

Pos logit contributions
 virtue+0.024
olding+0.023
 eater+0.023
nie+0.022
 tutor+0.022
Neg logit contributions
 valleys-0.015
 sails-0.014
itched-0.013
 sail-0.013
 gl-0.013
Token"
Token ID1
Feature activation-2.39e-02

Pos logit contributions
 sails+0.139
 valleys+0.132
 sail+0.125
 sailed+0.122
 gl+0.120
Neg logit contributions
olding-0.094
 virtue-0.093
 tutor-0.088
 student-0.084
 mistake-0.083
TokenDon
Token ID3987
Feature activation-1.00e-01

Pos logit contributions
 virtue+0.038
 eater+0.037
 tutor+0.035
nie+0.035
olding+0.035
Neg logit contributions
 valleys-0.037
 sails-0.036
itched-0.035
 sail-0.034
rolled-0.032
Token't
Token ID470
Feature activation-1.29e-01

Pos logit contributions
 virtue+0.109
 eater+0.102
 tutor+0.097
nie+0.096
olding+0.094
Neg logit contributions
 valleys-0.206
 sails-0.199
itched-0.190
 sail-0.188
 gl-0.183
Token worry
Token ID5490
Feature activation-3.30e-02

Pos logit contributions
 virtue+0.194
 eater+0.182
aughter+0.174
 tutor+0.173
nie+0.170
Neg logit contributions
 sail-0.212
itched-0.201
 sails-0.200
 swoop-0.196
He-0.191
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
 virtue+0.073
 eater+0.072
 tutor+0.068
nie+0.067
olding+0.067
Neg logit contributions
 valleys-0.030
 sails-0.029
itched-0.028
 sail-0.026
 gl-0.024
Token we
Token ID356
Feature activation-1.94e-01

Pos logit contributions
 sails+0.203
 valleys+0.195
 sail+0.191
 gl+0.185
itched+0.180
Neg logit contributions
 virtue-0.156
 eater-0.146
nie-0.135
olding-0.130
 occasion-0.130
Token can
Token ID460
Feature activation-6.36e-02

Pos logit contributions
 eater+0.314
nie+0.303
 virtue+0.302
 tutor+0.300
becca+0.297
Neg logit contributions
 sails-0.277
 sail-0.270
 valleys-0.263
itched-0.258
aved-0.253
Token buy
Token ID2822
Feature activation+4.83e-02

Pos logit contributions
 virtue+0.083
 eater+0.081
nie+0.075
 tutor+0.074
olding+0.074
Neg logit contributions
 valleys-0.117
 sails-0.112
itched-0.107
 sail-0.105
 gl-0.103
Token some
Token ID617
Feature activation-4.91e-02

Pos logit contributions
 sails+0.063
 valleys+0.062
 sail+0.057
 gl+0.053
itched+0.053
Neg logit contributions
 virtue-0.093
 eater-0.089
olding-0.087
nie-0.084
 occasion-0.081
Token new
Token ID649
Feature activation+3.98e-02

Pos logit contributions
 virtue+0.091
 eater+0.086
nie+0.085
olding+0.081
icia+0.080
Neg logit contributions
 sails-0.067
 valleys-0.064
 sail-0.063
 gl-0.058
 wood-0.057
Token blocks
Token ID7021
Feature activation+8.93e-03

Pos logit contributions
 valleys+0.053
 sails+0.052
 sail+0.047
itched+0.045
 gl+0.044
Neg logit contributions
 virtue-0.075
 eater-0.072
nie-0.069
olding-0.069
 tutor-0.067
Token and
Token ID290
Feature activation-2.35e-02

Pos logit contributions
 virtue+0.163
 eater+0.153
nie+0.152
olding+0.151
becca+0.148
Neg logit contributions
 valleys-0.161
 sails-0.158
 gl-0.146
 sail-0.144
 mountains-0.141
Token friendship
Token ID14738
Feature activation-6.83e-02

Pos logit contributions
 virtue+0.040
 eater+0.039
nie+0.037
olding+0.037
 tutor+0.036
Neg logit contributions
 valleys-0.034
 sails-0.034
 sail-0.032
 gl-0.031
itched-0.031
Token.
Token ID13
Feature activation-3.06e-02

Pos logit contributions
 virtue+0.115
 eater+0.112
nie+0.108
olding+0.107
 tutor+0.104
Neg logit contributions
 valleys-0.102
 sails-0.099
 sail-0.093
 gl-0.092
itched-0.089
Token He
Token ID679
Feature activation+1.79e-01

Pos logit contributions
 virtue+0.072
 eater+0.069
nie+0.068
 promise+0.067
olding+0.067
Neg logit contributions
 valleys-0.026
 sails-0.023
 sail-0.020
itched-0.019
He-0.019
Token asked
Token ID1965
Feature activation+9.53e-02

Pos logit contributions
 sails+0.262
 valleys+0.252
 sail+0.246
 sailed+0.243
 gl+0.240
Neg logit contributions
 virtue-0.306
 eater-0.295
olding-0.290
 tutor-0.286
nie-0.272
Token all
Token ID477
Feature activation-1.98e-01

Pos logit contributions
 valleys+0.106
 sails+0.097
 sail+0.090
 wood+0.087
 forests+0.087
Neg logit contributions
 virtue-0.202
 eater-0.181
olding-0.181
idents-0.175
nie-0.175
Token the
Token ID262
Feature activation+3.48e-03

Pos logit contributions
 virtue+0.425
 eater+0.390
idents+0.373
olding+0.372
becca+0.363
Neg logit contributions
 valleys-0.217
 sails-0.213
 sail-0.203
 wood-0.184
 gl-0.183
Token animals
Token ID4695
Feature activation-9.19e-02

Pos logit contributions
 valleys+0.005
 sails+0.005
 sail+0.004
 wood+0.004
 gl+0.004
Neg logit contributions
 virtue-0.006
 eater-0.006
olding-0.006
nie-0.005
 tutor-0.005
Token to
Token ID284
Feature activation-5.21e-02

Pos logit contributions
 virtue+0.149
 eater+0.140
nie+0.133
 tutor+0.127
becca+0.126
Neg logit contributions
 sails-0.141
 valleys-0.141
 sail-0.137
 gl-0.130
itched-0.125
Token unite
Token ID24558
Feature activation+5.63e-02

Pos logit contributions
 virtue+0.070
 eater+0.068
 tutor+0.064
 ever+0.062
nie+0.062
Neg logit contributions
 valleys-0.093
 sails-0.087
itched-0.086
 widened-0.081
 gl-0.080
Token and
Token ID290
Feature activation-2.44e-02

Pos logit contributions
 valleys+0.071
 sails+0.071
 sail+0.066
 gl+0.062
itched+0.060
Neg logit contributions
 virtue-0.109
 eater-0.104
olding-0.102
nie-0.101
becca-0.097
TokenBut
Token ID1537
Feature activation-8.50e-02

Pos logit contributions
 virtue+0.118
 eater+0.113
nie+0.110
 tutor+0.110
 mistake+0.106
Neg logit contributions
 valleys-0.087
He-0.079
 sails-0.078
 sail-0.075
His-0.073
Token you
Token ID345
Feature activation-2.36e-01

Pos logit contributions
 virtue+0.100
 eater+0.096
 tutor+0.087
olding+0.086
nie+0.086
Neg logit contributions
 sails-0.167
 valleys-0.166
 sail-0.163
 gl-0.158
itched-0.155
Token should
Token ID815
Feature activation-7.44e-02

Pos logit contributions
 virtue+0.355
 eater+0.353
 tutor+0.327
olding+0.327
nie+0.322
Neg logit contributions
 sails-0.392
 valleys-0.380
 sail-0.374
itched-0.354
 gl-0.354
Token ask
Token ID1265
Feature activation+6.61e-02

Pos logit contributions
 virtue+0.090
 eater+0.087
nie+0.081
 tutor+0.080
olding+0.079
Neg logit contributions
 valleys-0.142
 sails-0.138
itched-0.133
 sail-0.131
 gl-0.127
Token before
Token ID878
Feature activation-2.08e-01

Pos logit contributions
 sails+0.082
 sail+0.080
 wood+0.079
 valleys+0.077
 mountains+0.074
Neg logit contributions
 virtue-0.120
 eater-0.120
oker-0.110
nie-0.105
 poet-0.103
Token you
Token ID345
Feature activation-2.37e-01

Pos logit contributions
 virtue+0.213
 eater+0.206
nie+0.189
olding+0.182
 tutor+0.172
Neg logit contributions
 valleys-0.455
 sails-0.440
itched-0.429
 sail-0.429
 gl-0.411
Token touch
Token ID3638
Feature activation+1.17e-01

Pos logit contributions
 virtue+0.386
 eater+0.375
 tutor+0.356
nie+0.354
olding+0.343
Neg logit contributions
 sails-0.367
 valleys-0.354
 sail-0.352
itched-0.332
 gl-0.326
Token something
Token ID1223
Feature activation-1.77e-02

Pos logit contributions
 valleys+0.169
 sails+0.164
 sail+0.150
itched+0.149
 gl+0.143
Neg logit contributions
 virtue-0.209
 eater-0.207
olding-0.194
nie-0.189
 tutor-0.187
Token that
Token ID326
Feature activation-8.88e-02

Pos logit contributions
 eater+0.027
 virtue+0.027
nie+0.025
 tutor+0.025
 student+0.024
Neg logit contributions
 sails-0.029
 valleys-0.028
 sail-0.027
 gl-0.027
itched-0.026
Token's
Token ID338
Feature activation-1.05e-02

Pos logit contributions
 virtue+0.147
 eater+0.144
olding+0.134
 tutor+0.132
nie+0.132
Neg logit contributions
 sails-0.140
 gl-0.135
 valleys-0.133
 sail-0.132
 soared-0.129
Token not
Token ID407
Feature activation-1.98e-01

Pos logit contributions
 eater+0.019
 virtue+0.018
 tutor+0.017
olding+0.017
nie+0.017
Neg logit contributions
 sails-0.015
 valleys-0.014
 sail-0.014
 gl-0.014
 sailed-0.013