TOP ACTIVATIONS
MAX = 0.331

Mom my explained how to paddle the board and move in
it was struggling to stay afloat , and Sandy felt bad
the boat and started to paddle . She explored deeper and
little n ooks and cr ann ies , and a cozy
ready . She started to paddle faster and when she got
back in the boat and paddle back to shore .
legs and how to stay afloat . N ora
him how to do ggy paddle . The naughty brother smiled
his mom and started to paddle away with the lumber .
his boat and began to paddle away , eagerly looking for
started to do do ggy paddle , she loved the way
in tight circles , or paddle her feet to travel around
She tried to paddle out , but the waves
ar , and started to paddle . The
Rob how to use a paddle to move the boat .
the boat and took the paddle . He pushed and pulled
wanted to go for a paddle . But Tom was too
so relieved and started to paddle towards the shore with the
a paddle and started to paddle . Suddenly , he heard
the n ooks and cr ann ies . After

MIN ACTIVATIONS
MIN = -0.296

not good . Ben , you need to share your toys
They are serious . Do you understand ?" Anna
We are friends . Do you want to be our friend
Go away , dog , you are bad !" Ben says
. " Can you lift the rock ?" L
of you . Anna , you need to share your desk
says , " Ben , you are very rude . You
" Ben , you need to drink water .
book for you . Do you want to hear a story
." Anna says you 're welcome to Ben .
, I am . Do you want to go downstairs ?"
you . They will make you strong and happy ."
from the book . Do you want to hear them ?"
farmer says . " Do you want to meet the bull
bed . They will keep you company and protect you from
your sister . Sara , you need to ask nicely and
our friendship stronger . Do you promise ?" Ben
Anna says . " But you have to be fair .
Maybe . Why don 't you try ? Make a wish
, Ben . And thank you , Lily . You did

INTERVAL 0.175 - 0.331
CONTAINS 1.847%

safe and everyone thanked the metal man for helping them .
bucket and the mole . He ran to his mom and
know what to do . He thought maybe he should keep
her happiness spread like a wave . Next ,
he heard a sound . He looked up and saw a

INTERVAL 0.018 - 0.175
CONTAINS 37.198%

people and things ," mom said . " Wow
and he lived in the garden . Every day he would
ara and Ben smile . They say , " That sounds
laughed and hugged them . " Of course ,
a time , there was a sweet dog named Spot .

INTERVAL -0.139 - 0.018
CONTAINS 55.593%

fast asleep . Suddenly , the wind picked up and sway
's go ," Mom said . She took Ben 's hand
her mom and she couldn 't wait to do it again
plate under their mom 's bed and waited for her to
smiled and said , " That 's my good , obedient

INTERVAL -0.296 - -0.139
CONTAINS 5.362%

One day , Lily 's mom asked her to stay in
them . And they are not better than forks . For
on top of me . I am warm and cozy ."
a special promise . His mom said , " Sure !
L ily and Sam are friends . They like to play
Token Mom
Token ID11254
Feature activation-1.11e-01

Pos logit contributions
 eater+0.168
 virtue+0.167
olding+0.162
 occasion+0.162
 anniversary+0.160
Neg logit contributions
 sail-0.067
 sails-0.064
 valleys-0.062
 gl-0.057
He-0.057
Tokenmy
Token ID1820
Feature activation-1.24e-02

Pos logit contributions
 eater+0.154
 virtue+0.149
 anniversary+0.139
 occasion+0.137
olding+0.133
Neg logit contributions
 sails-0.198
 valleys-0.191
 sail-0.190
 widened-0.186
 gl-0.185
Token explained
Token ID4893
Feature activation+1.19e-01

Pos logit contributions
 eater+0.022
 ever+0.021
 virtue+0.020
olding+0.020
 occasion+0.020
Neg logit contributions
 valleys-0.016
 sails-0.016
 widened-0.015
 sail-0.015
He-0.014
Token how
Token ID703
Feature activation-1.27e-01

Pos logit contributions
 sails+0.186
 valleys+0.180
 sail+0.176
 widened+0.173
 gl+0.168
Neg logit contributions
 eater-0.194
 virtue-0.184
 occasion-0.176
 anniversary-0.175
 ever-0.173
Token to
Token ID284
Feature activation-2.86e-02

Pos logit contributions
 eater+0.238
 virtue+0.220
 anniversary+0.215
 occasion+0.214
olding+0.209
Neg logit contributions
 sails-0.172
 valleys-0.166
 sail-0.165
 gl-0.157
He-0.155
Token paddle
Token ID39517
Feature activation+3.31e-01

Pos logit contributions
 eater+0.048
 ever+0.046
 virtue+0.045
 occasion+0.045
 anniversary+0.045
Neg logit contributions
 valleys-0.040
 widened-0.038
 sails-0.038
 hollow-0.035
 gl-0.034
Token the
Token ID262
Feature activation+1.04e-02

Pos logit contributions
 sails+0.452
 valleys+0.435
 sail+0.430
 widened+0.408
 gl+0.407
Neg logit contributions
 eater-0.614
 virtue-0.581
 anniversary-0.569
 occasion-0.561
olding-0.548
Token board
Token ID3096
Feature activation+9.09e-02

Pos logit contributions
 sails+0.010
 sail+0.009
 valleys+0.008
 gl+0.008
 widened+0.007
Neg logit contributions
 eater-0.025
 virtue-0.024
olding-0.023
 anniversary-0.023
 occasion-0.023
Token and
Token ID290
Feature activation-1.19e-02

Pos logit contributions
 sails+0.129
 sail+0.124
 valleys+0.123
 gl+0.120
 widened+0.117
Neg logit contributions
 eater-0.166
 virtue-0.156
 anniversary-0.152
olding-0.148
 occasion-0.147
Token move
Token ID1445
Feature activation+2.03e-01

Pos logit contributions
 eater+0.022
 virtue+0.021
olding+0.020
 occasion+0.020
 anniversary+0.020
Neg logit contributions
 sails-0.016
 valleys-0.016
 sail-0.016
 widened-0.015
 gl-0.015
Token in
Token ID287
Feature activation-4.02e-02

Pos logit contributions
 sails+0.280
 valleys+0.275
 sail+0.264
 widened+0.260
 gl+0.252
Neg logit contributions
 eater-0.367
 virtue-0.348
 anniversary-0.337
 occasion-0.334
olding-0.329
Token it
Token ID340
Feature activation+2.20e-02

Pos logit contributions
 eater+0.118
 virtue+0.115
 anniversary+0.109
 occasion+0.108
olding+0.106
Neg logit contributions
 sails-0.106
 valleys-0.105
 sail-0.103
 gl-0.101
 widened-0.099
Token was
Token ID373
Feature activation+1.94e-02

Pos logit contributions
 sails+0.029
 gl+0.029
 widened+0.028
 valleys+0.028
 sail+0.027
Neg logit contributions
 eater-0.041
 anniversary-0.039
 virtue-0.039
 occasion-0.038
olding-0.037
Token struggling
Token ID9648
Feature activation+7.06e-02

Pos logit contributions
 sails+0.031
 valleys+0.030
 gl+0.030
 sail+0.030
 widened+0.030
Neg logit contributions
 eater-0.031
 anniversary-0.028
 virtue-0.027
 occasion-0.027
olding-0.026
Token to
Token ID284
Feature activation-2.66e-02

Pos logit contributions
 sails+0.104
 valleys+0.102
 sail+0.099
 widened+0.098
 gl+0.097
Neg logit contributions
 eater-0.120
 anniversary-0.111
 virtue-0.109
 occasion-0.108
 guests-0.104
Token stay
Token ID2652
Feature activation+2.78e-02

Pos logit contributions
 eater+0.045
 virtue+0.042
 ever+0.041
 occasion+0.041
 anniversary+0.041
Neg logit contributions
 sails-0.039
 valleys-0.038
 widened-0.036
 sail-0.036
 gl-0.035
Token afloat
Token ID45759
Feature activation+3.30e-01

Pos logit contributions
 sails+0.041
 valleys+0.041
 widened+0.039
 sail+0.037
He+0.036
Neg logit contributions
 eater-0.042
 virtue-0.042
 ever-0.041
 mistake-0.041
 before-0.040
Token,
Token ID11
Feature activation+1.18e-01

Pos logit contributions
 sails+0.406
 valleys+0.404
 sail+0.383
 widened+0.376
 gl+0.368
Neg logit contributions
 eater-0.634
 virtue-0.611
 anniversary-0.600
 occasion-0.583
olding-0.579
Token and
Token ID290
Feature activation-9.92e-03

Pos logit contributions
 valleys+0.154
 sails+0.153
 gl+0.148
 sail+0.147
 widened+0.144
Neg logit contributions
 eater-0.225
 virtue-0.213
 anniversary-0.210
 occasion-0.206
olding-0.200
Token Sandy
Token ID17581
Feature activation+5.34e-02

Pos logit contributions
 eater+0.019
 virtue+0.019
 anniversary+0.018
 tutor+0.018
kids+0.018
Neg logit contributions
 valleys-0.012
 gl-0.012
 sails-0.012
 sail-0.012
 bent-0.011
Token felt
Token ID2936
Feature activation+8.32e-02

Pos logit contributions
 sails+0.068
 valleys+0.064
 sail+0.063
 widened+0.063
 gl+0.063
Neg logit contributions
 eater-0.105
 anniversary-0.101
 virtue-0.101
 occasion-0.098
olding-0.095
Token bad
Token ID2089
Feature activation-5.41e-02

Pos logit contributions
 sails+0.138
 valleys+0.134
 sail+0.128
 widened+0.126
itched+0.125
Neg logit contributions
 virtue-0.115
 ever-0.115
 eater-0.114
 before-0.108
olding-0.105
Token the
Token ID262
Feature activation+9.21e-03

Pos logit contributions
 sails+0.147
 valleys+0.144
 sail+0.139
 widened+0.138
 gl+0.132
Neg logit contributions
 eater-0.173
 virtue-0.164
 anniversary-0.157
 occasion-0.157
 ever-0.156
Token boat
Token ID8848
Feature activation+1.66e-01

Pos logit contributions
 widened+0.013
itched+0.012
 widen+0.011
 valleys+0.011
He+0.011
Neg logit contributions
 eater-0.014
 kid-0.014
 promise-0.014
 mistake-0.014
 virtue-0.014
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
 sails+0.254
 valleys+0.251
 widened+0.242
 sail+0.239
 gl+0.230
Neg logit contributions
 eater-0.268
 virtue-0.252
 ever-0.242
 occasion-0.241
 anniversary-0.241
Token started
Token ID2067
Feature activation+9.35e-02

Pos logit contributions
 eater+0.029
 ever+0.027
 virtue+0.027
 occasion+0.026
 anniversary+0.026
Neg logit contributions
 sails-0.020
 valleys-0.020
 widened-0.019
 sail-0.018
He-0.018
Token to
Token ID284
Feature activation-3.04e-02

Pos logit contributions
 widened+0.195
 sails+0.183
 valleys+0.183
 widen+0.181
He+0.179
Neg logit contributions
 ever-0.088
 eater-0.083
 before-0.082
 mistake-0.073
 guests-0.071
Token paddle
Token ID39517
Feature activation+3.30e-01

Pos logit contributions
 behave+0.049
 promise+0.049
 eater+0.049
 mistake+0.047
 ever+0.046
Neg logit contributions
 valleys-0.039
He-0.038
 widened-0.038
itched-0.037
 sails-0.035
Token.
Token ID13
Feature activation-7.33e-03

Pos logit contributions
 sails+0.511
 valleys+0.498
 widened+0.485
 sail+0.481
 gl+0.462
Neg logit contributions
 eater-0.522
 virtue-0.494
 ever-0.485
 occasion-0.468
olding-0.466
Token She
Token ID1375
Feature activation+1.50e-01

Pos logit contributions
 eater+0.017
 virtue+0.016
 anniversary+0.015
 occasion+0.015
 ever+0.015
Neg logit contributions
 sails-0.007
 valleys-0.007
 sail-0.006
 widened-0.006
 gl-0.006
Token explored
Token ID18782
Feature activation+7.84e-02

Pos logit contributions
 valleys+0.183
 sails+0.181
 widened+0.175
 sail+0.171
 hollow+0.163
Neg logit contributions
 eater-0.272
 ever-0.263
 virtue-0.260
 occasion-0.249
nie-0.248
Token deeper
Token ID9211
Feature activation+2.14e-01

Pos logit contributions
 sails+0.136
 widened+0.133
itched+0.131
 sail+0.128
 widen+0.126
Neg logit contributions
 ever-0.096
 before-0.095
 eater-0.092
 mistake-0.085
 occasion-0.085
Token and
Token ID290
Feature activation-1.63e-02

Pos logit contributions
 sail+0.371
 sails+0.370
 widened+0.364
itched+0.353
 gl+0.348
Neg logit contributions
 ever-0.253
 before-0.252
 eater-0.248
 mistake-0.235
 virtue-0.229
Token little
Token ID1310
Feature activation-7.17e-02

Pos logit contributions
 sails+0.109
 valleys+0.107
 sail+0.103
 widened+0.101
 gl+0.098
Neg logit contributions
 eater-0.162
 virtue-0.154
 anniversary-0.148
 occasion-0.148
 ever-0.147
Token n
Token ID299
Feature activation+6.33e-02

Pos logit contributions
 eater+0.136
 virtue+0.134
 anniversary+0.129
olding+0.128
becca+0.127
Neg logit contributions
 valleys-0.095
 sails-0.094
 sail-0.086
 widened-0.085
 gl-0.083
Tokenooks
Token ID31085
Feature activation+6.89e-02

Pos logit contributions
 sails+0.121
 widened+0.117
 valleys+0.115
 sail+0.115
 gl+0.111
Neg logit contributions
 eater-0.085
 virtue-0.078
olding-0.076
 ever-0.076
 anniversary-0.073
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
 valleys+0.088
 sails+0.087
 widened+0.083
 gl+0.081
 sail+0.079
Neg logit contributions
 virtue-0.133
 eater-0.131
 anniversary-0.125
 occasion-0.123
becca-0.120
Token cr
Token ID1067
Feature activation+2.95e-02

Pos logit contributions
 eater+0.034
 virtue+0.032
 anniversary+0.031
 occasion+0.031
 ever+0.031
Neg logit contributions
 sails-0.023
 valleys-0.022
 sail-0.021
 widened-0.021
 gl-0.020
Tokenann
Token ID1236
Feature activation+3.30e-01

Pos logit contributions
 sail+0.052
 valleys+0.048
 sailed+0.045
 paddle+0.045
 sails+0.045
Neg logit contributions
 ever-0.039
olding-0.036
 guests-0.036
t-0.036
kids-0.035
Tokenies
Token ID444
Feature activation+1.15e-02

Pos logit contributions
 sails+0.620
 valleys+0.606
 sail+0.585
 widened+0.583
 gl+0.570
Neg logit contributions
 eater-0.428
 virtue-0.407
 occasion-0.377
 anniversary-0.376
olding-0.365
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
 sails+0.017
 valleys+0.017
 sail+0.016
 widened+0.016
 gl+0.016
Neg logit contributions
 eater-0.019
 virtue-0.019
 anniversary-0.018
 occasion-0.018
olding-0.017
Token and
Token ID290
Feature activation-1.82e-02

Pos logit contributions
 sails+0.151
 valleys+0.148
 sail+0.139
 widened+0.138
 gl+0.137
Neg logit contributions
 eater-0.204
 virtue-0.198
 anniversary-0.187
 occasion-0.187
olding-0.185
Token a
Token ID257
Feature activation+3.49e-02

Pos logit contributions
 eater+0.034
 virtue+0.034
olding+0.032
 occasion+0.032
 anniversary+0.032
Neg logit contributions
 sails-0.024
 valleys-0.023
 sail-0.022
 gl-0.022
 widened-0.021
Token cozy
Token ID37438
Feature activation+9.12e-02

Pos logit contributions
 sails+0.044
 sail+0.043
 valleys+0.042
 gl+0.041
 widened+0.040
Neg logit contributions
 eater-0.066
 virtue-0.063
 anniversary-0.062
olding-0.062
 occasion-0.061
Token ready
Token ID3492
Feature activation-1.56e-02

Pos logit contributions
 valleys+0.085
 sails+0.084
 widened+0.082
 sail+0.077
itched+0.076
Neg logit contributions
 eater-0.082
 ever-0.079
 virtue-0.076
olding-0.073
 guests-0.073
Token.
Token ID13
Feature activation-7.63e-03

Pos logit contributions
 eater+0.022
 virtue+0.020
 anniversary+0.019
 occasion+0.019
olding+0.018
Neg logit contributions
 sails-0.028
 sail-0.028
 valleys-0.027
 widened-0.026
 gl-0.026
Token She
Token ID1375
Feature activation+1.49e-01

Pos logit contributions
 eater+0.019
 virtue+0.018
 anniversary+0.018
 ever+0.017
 occasion+0.017
Neg logit contributions
 valleys-0.005
 sails-0.005
 sail-0.005
 widened-0.005
 gl-0.004
Token started
Token ID2067
Feature activation+9.27e-02

Pos logit contributions
 sails+0.199
 valleys+0.193
 sail+0.187
 widened+0.185
 gl+0.177
Neg logit contributions
 eater-0.274
 virtue-0.260
 anniversary-0.250
 occasion-0.249
 ever-0.248
Token to
Token ID284
Feature activation-3.01e-02

Pos logit contributions
 widened+0.172
 valleys+0.168
 sails+0.166
 widen+0.157
He+0.157
Neg logit contributions
 eater-0.108
 ever-0.105
 mistake-0.101
 behaviour-0.099
 before-0.099
Token paddle
Token ID39517
Feature activation+3.29e-01

Pos logit contributions
 eater+0.052
 virtue+0.049
 ever+0.049
 mistake+0.048
 anniversary+0.048
Neg logit contributions
 valleys-0.040
 sails-0.038
 widened-0.038
He-0.036
itched-0.035
Token faster
Token ID5443
Feature activation+6.68e-02

Pos logit contributions
 sails+0.476
 valleys+0.461
 sail+0.448
 widened+0.446
 gl+0.425
Neg logit contributions
 eater-0.556
 virtue-0.526
 ever-0.515
 occasion-0.502
olding-0.498
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
 sails+0.108
 valleys+0.105
 sail+0.102
 widened+0.102
 gl+0.098
Neg logit contributions
 eater-0.102
 virtue-0.096
 ever-0.093
 anniversary-0.091
 occasion-0.091
Token when
Token ID618
Feature activation-1.10e-01

Pos logit contributions
 eater+0.030
 virtue+0.029
 ever+0.029
 occasion+0.028
olding+0.027
Neg logit contributions
 sails-0.023
 valleys-0.022
 widened-0.021
 sail-0.021
itched-0.020
Token she
Token ID673
Feature activation+5.85e-02

Pos logit contributions
 eater+0.234
 virtue+0.220
olding+0.215
 anniversary+0.211
nie+0.209
Neg logit contributions
 valleys-0.133
 sail-0.121
 gl-0.121
 sails-0.118
 widened-0.112
Token got
Token ID1392
Feature activation+5.31e-02

Pos logit contributions
 sails+0.079
 valleys+0.078
 sail+0.074
 widened+0.074
 gl+0.070
Neg logit contributions
 eater-0.104
 virtue-0.098
 ever-0.097
 occasion-0.094
 anniversary-0.094
Token back
Token ID736
Feature activation-1.10e-02

Pos logit contributions
 sails+0.035
 sail+0.033
 valleys+0.033
 widened+0.031
 gl+0.031
Neg logit contributions
 eater-0.054
 virtue-0.051
 anniversary-0.049
 occasion-0.049
olding-0.048
Token in
Token ID287
Feature activation-4.18e-02

Pos logit contributions
 eater+0.020
 virtue+0.019
 anniversary+0.018
 occasion+0.018
olding+0.017
Neg logit contributions
 sails-0.016
 valleys-0.015
 sail-0.015
 widened-0.014
 gl-0.014
Token the
Token ID262
Feature activation+9.28e-03

Pos logit contributions
 eater+0.088
 virtue+0.082
 anniversary+0.080
olding+0.079
becca+0.079
Neg logit contributions
 sails-0.048
 sail-0.047
 valleys-0.046
 widened-0.043
 gl-0.042
Token boat
Token ID8848
Feature activation+1.65e-01

Pos logit contributions
 widened+0.014
itched+0.014
 valleys+0.013
 sails+0.013
 gl+0.012
Neg logit contributions
 virtue-0.014
 eater-0.013
 mistake-0.013
 occasion-0.013
 promise-0.013
Token and
Token ID290
Feature activation-1.91e-02

Pos logit contributions
 sails+0.227
 sail+0.223
 valleys+0.213
 gl+0.206
 widened+0.202
Neg logit contributions
 eater-0.306
 virtue-0.290
 anniversary-0.281
 occasion-0.274
becca-0.271
Token paddle
Token ID39517
Feature activation+3.29e-01

Pos logit contributions
 eater+0.037
 virtue+0.035
olding+0.034
 occasion+0.034
 anniversary+0.033
Neg logit contributions
 sails-0.025
 sail-0.025
 valleys-0.023
 widened-0.022
 gl-0.022
Token back
Token ID736
Feature activation-1.03e-02

Pos logit contributions
 sails+0.432
 valleys+0.418
 sail+0.407
 widened+0.393
 gl+0.381
Neg logit contributions
 eater-0.631
 virtue-0.592
 anniversary-0.587
 occasion-0.576
 exam-0.559
Token to
Token ID284
Feature activation-2.96e-02

Pos logit contributions
 eater+0.019
 virtue+0.018
 anniversary+0.017
 occasion+0.017
olding+0.016
Neg logit contributions
 sails-0.014
 valleys-0.014
 sail-0.014
 widened-0.013
 gl-0.013
Token shore
Token ID15191
Feature activation+1.77e-01

Pos logit contributions
 eater+0.062
 virtue+0.059
olding+0.057
 anniversary+0.056
 occasion+0.056
Neg logit contributions
 sail-0.035
 sails-0.034
 valleys-0.033
 gl-0.030
 widened-0.029
Token.
Token ID13
Feature activation-6.06e-03

Pos logit contributions
 sails+0.231
 valleys+0.230
 sail+0.229
He+0.207
 gl+0.205
Neg logit contributions
 eater-0.333
 virtue-0.324
 anniversary-0.313
 occasion-0.308
 exam-0.299
Token
Token ID198
Feature activation+8.16e-02

Pos logit contributions
 eater+0.013
 virtue+0.012
 ever+0.012
 anniversary+0.012
 occasion+0.012
Neg logit contributions
 sails-0.007
 valleys-0.006
 sail-0.006
 widened-0.006
 gl-0.006
Token legs
Token ID7405
Feature activation+8.81e-02

Pos logit contributions
 sails+0.034
 valleys+0.032
 sail+0.032
 widened+0.031
 gl+0.030
Neg logit contributions
 eater-0.052
 virtue-0.049
 anniversary-0.047
 occasion-0.047
 ever-0.047
Token and
Token ID290
Feature activation-8.40e-03

Pos logit contributions
 sails+0.133
 valleys+0.130
 sail+0.127
 widened+0.125
 gl+0.122
Neg logit contributions
 eater-0.145
 virtue-0.136
 anniversary-0.131
 occasion-0.130
 ever-0.128
Token how
Token ID703
Feature activation-1.23e-01

Pos logit contributions
 eater+0.015
 virtue+0.014
 anniversary+0.014
 occasion+0.014
olding+0.014
Neg logit contributions
 sails-0.012
 valleys-0.011
 sail-0.011
 widened-0.011
 gl-0.011
Token to
Token ID284
Feature activation-2.44e-02

Pos logit contributions
 eater+0.211
 anniversary+0.192
 virtue+0.191
 occasion+0.189
kids+0.182
Neg logit contributions
 valleys-0.187
 sails-0.184
 sail-0.180
 gl-0.174
 widened-0.172
Token stay
Token ID2652
Feature activation+3.07e-02

Pos logit contributions
 eater+0.042
 virtue+0.040
 ever+0.039
 occasion+0.039
 anniversary+0.038
Neg logit contributions
 sails-0.034
 valleys-0.034
 widened-0.032
 sail-0.031
 gl-0.030
Token afloat
Token ID45759
Feature activation+3.29e-01

Pos logit contributions
 valleys+0.049
 sails+0.049
 widened+0.047
 sail+0.045
 gl+0.044
Neg logit contributions
 eater-0.045
 virtue-0.044
 ever-0.042
 mistake-0.041
 anniversary-0.040
Token.
Token ID13
Feature activation+4.19e-03

Pos logit contributions
 sails+0.443
 valleys+0.439
 sail+0.432
 widened+0.413
 gl+0.411
Neg logit contributions
 eater-0.588
 virtue-0.574
 anniversary-0.560
olding-0.549
kids-0.539
Token
Token ID198
Feature activation+8.98e-02

Pos logit contributions
 sails+0.005
 valleys+0.005
 sail+0.005
 widened+0.005
 gl+0.004
Neg logit contributions
 eater-0.008
 virtue-0.008
 occasion-0.008
 anniversary-0.008
 ever-0.008
Token
Token ID198
Feature activation+8.86e-02

Pos logit contributions
 sails+0.126
 valleys+0.119
 sail+0.114
 widened+0.114
 gl+0.109
Neg logit contributions
 eater-0.158
 virtue-0.153
 ever-0.149
olding-0.148
 occasion-0.147
TokenN
Token ID45
Feature activation+1.20e-02

Pos logit contributions
 sails+0.152
 valleys+0.147
 sail+0.142
 widened+0.139
 gl+0.136
Neg logit contributions
 eater-0.132
 virtue-0.127
 occasion-0.120
 ever-0.118
olding-0.117
Tokenora
Token ID5799
Feature activation+1.15e-01

Pos logit contributions
 sails+0.020
 valleys+0.019
 sail+0.019
 widened+0.018
 gl+0.018
Neg logit contributions
 eater-0.018
 virtue-0.017
 anniversary-0.016
 occasion-0.016
 ever-0.016
Token him
Token ID683
Feature activation+4.23e-02

Pos logit contributions
 sails+0.068
 valleys+0.064
 widened+0.063
 sail+0.061
 gl+0.061
Neg logit contributions
 virtue-0.153
 eater-0.152
 occasion-0.142
olding-0.141
 hour-0.140
Token how
Token ID703
Feature activation-1.29e-01

Pos logit contributions
 sails+0.039
 sail+0.038
 gl+0.033
 valleys+0.032
 wood+0.031
Neg logit contributions
 virtue-0.093
 occasion-0.093
 inconvenience-0.092
 hour-0.090
 eater-0.090
Token to
Token ID284
Feature activation-3.16e-02

Pos logit contributions
 eater+0.252
 virtue+0.231
 anniversary+0.228
 occasion+0.228
kids+0.226
Neg logit contributions
He-0.160
 sails-0.159
 sail-0.159
 gl-0.159
 valleys-0.157
Token do
Token ID466
Feature activation-2.26e-02

Pos logit contributions
 eater+0.059
 virtue+0.056
 ever+0.054
 occasion+0.054
 anniversary+0.054
Neg logit contributions
 sails-0.040
 valleys-0.040
 widened-0.038
 sail-0.037
 gl-0.036
Tokenggy
Token ID19970
Feature activation-8.33e-02

Pos logit contributions
 eater+0.043
 virtue+0.041
 anniversary+0.039
 occasion+0.039
olding+0.039
Neg logit contributions
 sails-0.030
 valleys-0.029
 sail-0.028
 widened-0.027
 gl-0.027
Token paddle
Token ID39517
Feature activation+3.29e-01

Pos logit contributions
 virtue+0.167
 anniversary+0.163
 occasion+0.162
 eater+0.160
kids+0.159
Neg logit contributions
 sails-0.093
 sail-0.093
 gl-0.089
 widened-0.088
 valleys-0.086
Token.
Token ID13
Feature activation-5.90e-03

Pos logit contributions
 sails+0.416
 valleys+0.397
 sail+0.388
He+0.379
 gl+0.375
Neg logit contributions
 eater-0.636
 anniversary-0.629
 virtue-0.610
 occasion-0.606
becca-0.594
Token The
Token ID383
Feature activation+3.57e-02

Pos logit contributions
 eater+0.011
 virtue+0.010
 anniversary+0.010
 occasion+0.010
 ever+0.010
Neg logit contributions
 sails-0.008
 valleys-0.008
 sail-0.008
 widened-0.008
 gl-0.007
Token naughty
Token ID45128
Feature activation-1.36e-01

Pos logit contributions
 sails+0.054
 sail+0.053
He+0.050
 gl+0.049
 widened+0.047
Neg logit contributions
 virtue-0.060
 eater-0.059
 anniversary-0.056
olding-0.055
 ever-0.055
Token brother
Token ID3956
Feature activation-9.69e-02

Pos logit contributions
 virtue+0.224
bek+0.190
Deal+0.189
 anniversary+0.186
nes+0.184
Neg logit contributions
 sails-0.223
 sail-0.223
 spark-0.205
 widened-0.203
 gl-0.201
Token smiled
Token ID13541
Feature activation+9.27e-02

Pos logit contributions
 eater+0.194
 occasion+0.193
 anniversary+0.193
 virtue+0.192
 exam+0.186
Neg logit contributions
 sails-0.125
 sail-0.114
 widened-0.113
 gl-0.111
 sailed-0.107
Token his
Token ID465
Feature activation+1.09e-01

Pos logit contributions
 eater+0.063
 virtue+0.062
 anniversary+0.059
olding+0.058
 occasion+0.058
Neg logit contributions
 sail-0.032
 sails-0.031
 valleys-0.029
 gl-0.026
 widened-0.026
Token mom
Token ID1995
Feature activation-1.78e-01

Pos logit contributions
 sail+0.203
 sails+0.197
 gl+0.175
 hollow+0.171
 valleys+0.171
Neg logit contributions
 virtue-0.156
 eater-0.155
you-0.148
 occasion-0.146
olding-0.140
Token and
Token ID290
Feature activation-1.46e-02

Pos logit contributions
 eater+0.321
 virtue+0.315
 anniversary+0.293
 occasion+0.292
olding+0.286
Neg logit contributions
 sails-0.245
 sail-0.236
 valleys-0.235
 widened-0.226
 gl-0.223
Token started
Token ID2067
Feature activation+9.36e-02

Pos logit contributions
 eater+0.028
 virtue+0.026
 anniversary+0.025
 occasion+0.025
 ever+0.025
Neg logit contributions
 sails-0.019
 valleys-0.018
 sail-0.018
 widened-0.017
 gl-0.017
Token to
Token ID284
Feature activation-2.97e-02

Pos logit contributions
 widened+0.184
 valleys+0.180
 sails+0.179
He+0.168
 widen+0.167
Neg logit contributions
 ever-0.101
 eater-0.101
 mistake-0.092
 before-0.092
 anniversary-0.090
Token paddle
Token ID39517
Feature activation+3.28e-01

Pos logit contributions
 eater+0.042
 ever+0.039
 mistake+0.038
 anniversary+0.038
 occasion+0.037
Neg logit contributions
 valleys-0.050
 sails-0.049
 widened-0.048
He-0.046
itched-0.045
Token away
Token ID1497
Feature activation+8.44e-02

Pos logit contributions
 sails+0.482
 valleys+0.470
 widened+0.455
 sail+0.453
 gl+0.434
Neg logit contributions
 eater-0.551
 virtue-0.520
 ever-0.505
 occasion-0.496
olding-0.495
Token with
Token ID351
Feature activation-6.77e-02

Pos logit contributions
 sails+0.134
 valleys+0.132
 sail+0.128
 widened+0.127
 gl+0.122
Neg logit contributions
 eater-0.133
 virtue-0.125
 anniversary-0.120
 occasion-0.119
 ever-0.117
Token the
Token ID262
Feature activation+9.86e-03

Pos logit contributions
 eater+0.123
 ever+0.119
 virtue+0.116
 behaviour+0.113
 before+0.112
Neg logit contributions
 valleys-0.084
 sails-0.082
 widened-0.080
 sail-0.074
itched-0.072
Token lumber
Token ID34840
Feature activation+9.68e-02

Pos logit contributions
 sails+0.010
 valleys+0.010
 sail+0.009
 widened+0.009
 gl+0.009
Neg logit contributions
 eater-0.021
 virtue-0.020
 occasion-0.020
 ever-0.020
 anniversary-0.020
Token.
Token ID13
Feature activation-3.08e-03

Pos logit contributions
 sails+0.140
 valleys+0.134
 sail+0.134
 gl+0.130
 widened+0.127
Neg logit contributions
 eater-0.169
 virtue-0.161
 anniversary-0.157
 occasion-0.154
 guests-0.149
Token his
Token ID465
Feature activation+9.92e-02

Pos logit contributions
 sails+0.146
 widened+0.142
 valleys+0.140
 sail+0.139
 gl+0.132
Neg logit contributions
 eater-0.150
 ever-0.136
 occasion-0.134
 virtue-0.133
 anniversary-0.132
Token boat
Token ID8848
Feature activation+1.66e-01

Pos logit contributions
 widened+0.167
 valleys+0.153
 widen+0.151
 sails+0.144
He+0.143
Neg logit contributions
 eater-0.147
 guests-0.134
 kid-0.134
 anniversary-0.133
 tutor-0.132
Token and
Token ID290
Feature activation-2.82e-02

Pos logit contributions
 sails+0.269
 valleys+0.269
 widened+0.264
 sail+0.251
 gl+0.243
Neg logit contributions
 eater-0.249
 virtue-0.227
 ever-0.223
 occasion-0.221
 anniversary-0.219
Token began
Token ID2540
Feature activation+1.07e-01

Pos logit contributions
 eater+0.052
 ever+0.049
 occasion+0.047
 virtue+0.047
nie+0.047
Neg logit contributions
 sails-0.035
 valleys-0.034
 widened-0.033
itched-0.032
 sail-0.032
Token to
Token ID284
Feature activation-3.67e-02

Pos logit contributions
 widened+0.235
 sails+0.227
 valleys+0.221
 widen+0.218
 sail+0.214
Neg logit contributions
 eater-0.093
 ever-0.091
 before-0.081
 guests-0.077
 mistake-0.077
Token paddle
Token ID39517
Feature activation+3.28e-01

Pos logit contributions
 eater+0.056
 ever+0.051
 exam+0.051
 occasion+0.051
 mistake+0.050
Neg logit contributions
 valleys-0.054
 widened-0.054
 sails-0.054
itched-0.053
He-0.052
Token away
Token ID1497
Feature activation+8.32e-02

Pos logit contributions
 sails+0.502
 valleys+0.490
 widened+0.482
 sail+0.473
 gl+0.455
Neg logit contributions
 eater-0.520
 virtue-0.491
 ever-0.488
olding-0.465
 occasion-0.465
Token,
Token ID11
Feature activation+9.66e-02

Pos logit contributions
 sails+0.146
 valleys+0.141
 widened+0.139
 sail+0.138
 gl+0.133
Neg logit contributions
 eater-0.112
 virtue-0.106
 ever-0.106
 before-0.101
olding-0.100
Token eagerly
Token ID30130
Feature activation+3.23e-02

Pos logit contributions
 sails+0.159
 valleys+0.157
 sail+0.153
 widened+0.149
 gl+0.147
Neg logit contributions
 eater-0.151
 virtue-0.142
 anniversary-0.137
 occasion-0.135
olding-0.131
Token looking
Token ID2045
Feature activation+7.81e-02

Pos logit contributions
 sails+0.049
 valleys+0.048
 sail+0.048
 gl+0.044
 widened+0.044
Neg logit contributions
 eater-0.055
 virtue-0.052
 anniversary-0.050
 occasion-0.050
olding-0.048
Token for
Token ID329
Feature activation-1.84e-01

Pos logit contributions
 sails+0.174
 valleys+0.172
 sail+0.168
 widened+0.167
 gl+0.163
Neg logit contributions
 eater-0.074
 virtue-0.067
 anniversary-0.063
 occasion-0.062
 ever-0.060
Token started
Token ID2067
Feature activation+9.12e-02

Pos logit contributions
 sails+0.176
 sail+0.164
 valleys+0.160
 gl+0.158
 widened+0.156
Neg logit contributions
 eater-0.295
 virtue-0.284
 anniversary-0.276
 occasion-0.272
olding-0.271
Token to
Token ID284
Feature activation-3.37e-02

Pos logit contributions
 sails+0.162
 valleys+0.159
 widened+0.156
 sail+0.154
 gl+0.148
Neg logit contributions
 eater-0.124
 virtue-0.114
 ever-0.112
 anniversary-0.109
 occasion-0.109
Token do
Token ID466
Feature activation-2.17e-02

Pos logit contributions
 eater+0.060
 virtue+0.057
 anniversary+0.055
 occasion+0.055
 ever+0.055
Neg logit contributions
 sails-0.046
 valleys-0.045
 sail-0.043
 widened-0.043
 gl-0.041
Token do
Token ID466
Feature activation-2.17e-02

Pos logit contributions
 eater+0.042
 virtue+0.039
 anniversary+0.038
olding+0.038
 occasion+0.037
Neg logit contributions
 valleys-0.028
 sails-0.028
 sail-0.026
 widened-0.026
 gl-0.026
Tokenggy
Token ID19970
Feature activation-8.85e-02

Pos logit contributions
 eater+0.041
 virtue+0.039
 anniversary+0.037
olding+0.037
 occasion+0.037
Neg logit contributions
 sails-0.029
 valleys-0.029
 sail-0.027
 widened-0.027
 gl-0.027
Token paddle
Token ID39517
Feature activation+3.28e-01

Pos logit contributions
 virtue+0.178
 eater+0.165
 occasion+0.164
 anniversary+0.161
 policy+0.159
Neg logit contributions
 sails-0.108
 valleys-0.108
 sail-0.106
 gl-0.105
 widened-0.102
Token,
Token ID11
Feature activation+1.03e-01

Pos logit contributions
 sails+0.433
 valleys+0.429
 sail+0.410
 widened+0.402
 gl+0.401
Neg logit contributions
 eater-0.609
 virtue-0.585
 anniversary-0.579
 occasion-0.565
olding-0.549
Token she
Token ID673
Feature activation+5.49e-02

Pos logit contributions
 sails+0.147
 sail+0.142
 valleys+0.141
 gl+0.135
 widened+0.129
Neg logit contributions
 eater-0.188
 virtue-0.184
 anniversary-0.176
 occasion-0.173
olding-0.169
Token loved
Token ID6151
Feature activation-6.30e-02

Pos logit contributions
 sails+0.072
 sail+0.068
 valleys+0.067
 gl+0.065
 widened+0.062
Neg logit contributions
 eater-0.108
 virtue-0.103
 anniversary-0.102
 occasion-0.100
olding-0.098
Token the
Token ID262
Feature activation+6.08e-03

Pos logit contributions
 virtue+0.134
 eater+0.131
olding+0.127
 anniversary+0.123
aughter+0.120
Neg logit contributions
 valleys-0.071
 gl-0.071
 sails-0.068
 bent-0.064
 sail-0.063
Token way
Token ID835
Feature activation-3.53e-02

Pos logit contributions
 sail+0.007
 sails+0.007
 gl+0.006
 valleys+0.006
 spark+0.005
Neg logit contributions
 eater-0.013
 anniversary-0.013
 virtue-0.013
olding-0.013
grand-0.013
Token in
Token ID287
Feature activation-4.56e-02

Pos logit contributions
 sails+0.254
 valleys+0.250
 sail+0.242
 widened+0.238
 gl+0.230
Neg logit contributions
 eater-0.302
 virtue-0.287
 anniversary-0.277
 occasion-0.275
olding-0.271
Token tight
Token ID5381
Feature activation-2.45e-02

Pos logit contributions
 eater+0.084
 virtue+0.081
olding+0.077
 anniversary+0.077
 occasion+0.076
Neg logit contributions
 sails-0.061
 valleys-0.061
 sail-0.058
 widened-0.056
 gl-0.056
Token circles
Token ID13332
Feature activation+1.21e-01

Pos logit contributions
 eater+0.042
 virtue+0.039
 ever+0.038
 occasion+0.038
 anniversary+0.038
Neg logit contributions
 sails-0.035
 valleys-0.034
 sail-0.033
 widened-0.033
 gl-0.032
Token,
Token ID11
Feature activation+9.83e-02

Pos logit contributions
 sails+0.191
 valleys+0.186
 sail+0.181
 widened+0.177
 hollow+0.169
Neg logit contributions
 eater-0.187
 ever-0.177
 virtue-0.170
 occasion-0.167
 anniversary-0.164
Token or
Token ID393
Feature activation-6.11e-02

Pos logit contributions
 sails+0.146
 valleys+0.144
 sail+0.139
 widened+0.136
 gl+0.133
Neg logit contributions
 eater-0.167
 virtue-0.160
 anniversary-0.153
 occasion-0.152
olding-0.150
Token paddle
Token ID39517
Feature activation+3.27e-01

Pos logit contributions
 virtue+0.111
 eater+0.110
olding+0.104
 occasion+0.102
 anniversary+0.102
Neg logit contributions
 sail-0.085
 sails-0.085
 valleys-0.084
 gl-0.079
 widened-0.077
Token her
Token ID607
Feature activation+1.78e-02

Pos logit contributions
 valleys+0.450
 sails+0.443
 widened+0.415
 sail+0.413
 gl+0.410
Neg logit contributions
 eater-0.601
 virtue-0.569
 anniversary-0.561
 occasion-0.556
olding-0.542
Token feet
Token ID3625
Feature activation+6.40e-02

Pos logit contributions
 sails+0.023
 sail+0.022
 gl+0.021
 valleys+0.020
 spark+0.018
Neg logit contributions
 eater-0.036
 virtue-0.036
olding-0.035
becca-0.034
aughter-0.034
Token to
Token ID284
Feature activation-3.34e-02

Pos logit contributions
 valleys+0.107
 sails+0.106
 widened+0.102
 sail+0.098
 hollow+0.094
Neg logit contributions
 eater-0.089
 ever-0.088
 virtue-0.084
 before-0.081
olding-0.081
Token travel
Token ID3067
Feature activation+1.59e-01

Pos logit contributions
 eater+0.052
 ever+0.048
 occasion+0.047
 virtue+0.047
 mistake+0.046
Neg logit contributions
 sails-0.052
 widened-0.050
 valleys-0.050
itched-0.047
He-0.047
Token around
Token ID1088
Feature activation+3.45e-02

Pos logit contributions
 valleys+0.218
 sails+0.210
 sail+0.203
 widened+0.199
 gl+0.197
Neg logit contributions
 eater-0.300
 virtue-0.278
 anniversary-0.276
olding-0.272
 occasion-0.264
Token
Token ID198
Feature activation+6.26e-02

Pos logit contributions
 eater+0.032
 virtue+0.031
 occasion+0.030
 ever+0.030
olding+0.030
Neg logit contributions
 sails-0.022
 valleys-0.020
 sail-0.020
 widened-0.020
 gl-0.019
Token
Token ID198
Feature activation+6.33e-02

Pos logit contributions
 sails+0.082
 valleys+0.077
 sail+0.074
 widened+0.073
 gl+0.070
Neg logit contributions
 eater-0.116
 virtue-0.111
 ever-0.111
 occasion-0.109
olding-0.108
TokenShe
Token ID3347
Feature activation+1.39e-01

Pos logit contributions
 sails+0.072
 valleys+0.069
 sail+0.067
 widened+0.065
 gl+0.063
Neg logit contributions
 eater-0.129
 virtue-0.124
 occasion-0.120
 anniversary-0.119
 ever-0.119
Token tried
Token ID3088
Feature activation+4.32e-02

Pos logit contributions
 valleys+0.180
 sails+0.177
 widened+0.170
 sail+0.167
itched+0.160
Neg logit contributions
 eater-0.249
 virtue-0.236
 ever-0.232
 occasion-0.228
 anniversary-0.227
Token to
Token ID284
Feature activation-3.63e-02

Pos logit contributions
 sails+0.085
 valleys+0.083
 widened+0.083
 sail+0.081
 gl+0.078
Neg logit contributions
 eater-0.050
 virtue-0.047
 ever-0.046
 anniversary-0.044
 mistake-0.043
Token paddle
Token ID39517
Feature activation+3.26e-01

Pos logit contributions
 eater+0.060
 virtue+0.056
 ever+0.056
 anniversary+0.055
 occasion+0.055
Neg logit contributions
 valleys-0.054
 sails-0.053
 widened-0.051
 gl-0.049
 sail-0.048
Token out
Token ID503
Feature activation+1.13e-01

Pos logit contributions
 sails+0.454
 valleys+0.442
 sail+0.428
 widened+0.424
 gl+0.405
Neg logit contributions
 eater-0.576
 virtue-0.547
 ever-0.527
 occasion-0.522
 anniversary-0.522
Token,
Token ID11
Feature activation+9.90e-02

Pos logit contributions
 sails+0.169
 valleys+0.166
 sail+0.160
 widened+0.158
 gl+0.153
Neg logit contributions
 eater-0.190
 virtue-0.179
 anniversary-0.172
 occasion-0.171
olding-0.168
Token but
Token ID475
Feature activation-5.30e-02

Pos logit contributions
 sails+0.169
 valleys+0.166
 sail+0.162
 widened+0.158
 gl+0.156
Neg logit contributions
 eater-0.148
 virtue-0.140
 anniversary-0.133
 occasion-0.131
 ever-0.129
Token the
Token ID262
Feature activation+3.47e-03

Pos logit contributions
 eater+0.111
 virtue+0.104
 anniversary+0.103
 occasion+0.100
olding+0.099
Neg logit contributions
 valleys-0.063
 gl-0.060
 sails-0.060
 sail-0.059
 widened-0.056
Token waves
Token ID9813
Feature activation+2.10e-01

Pos logit contributions
 sails+0.004
 sail+0.003
 valleys+0.003
 gl+0.003
 widened+0.003
Neg logit contributions
 eater-0.008
 virtue-0.008
 anniversary-0.007
olding-0.007
 ever-0.007
Tokenar
Token ID283
Feature activation+2.55e-02

Pos logit contributions
 sails+0.132
 valleys+0.127
 sail+0.126
 widened+0.122
 gl+0.121
Neg logit contributions
 eater-0.121
 virtue-0.114
 anniversary-0.110
 occasion-0.108
 ever-0.107
Token,
Token ID11
Feature activation+1.06e-01

Pos logit contributions
 sails+0.036
 valleys+0.034
 sail+0.033
 widened+0.033
 gl+0.032
Neg logit contributions
 eater-0.047
 virtue-0.046
 anniversary-0.043
 occasion-0.043
olding-0.042
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
 sails+0.156
 valleys+0.152
 sail+0.148
 widened+0.146
 gl+0.142
Neg logit contributions
 eater-0.182
 virtue-0.172
 anniversary-0.165
 occasion-0.164
 ever-0.162
Token started
Token ID2067
Feature activation+9.35e-02

Pos logit contributions
 eater+0.038
 virtue+0.037
 anniversary+0.035
 occasion+0.035
olding+0.035
Neg logit contributions
 sails-0.023
 valleys-0.022
 sail-0.022
 widened-0.021
 gl-0.021
Token to
Token ID284
Feature activation-3.24e-02

Pos logit contributions
 sails+0.167
 valleys+0.165
 widened+0.160
 sail+0.160
 gl+0.153
Neg logit contributions
 eater-0.126
 virtue-0.117
 ever-0.113
 anniversary-0.111
 occasion-0.110
Token paddle
Token ID39517
Feature activation+3.26e-01

Pos logit contributions
 eater+0.050
 virtue+0.047
 ever+0.045
 anniversary+0.044
 occasion+0.044
Neg logit contributions
 sails-0.052
 valleys-0.052
 widened-0.049
 sail-0.049
 gl-0.047
Token.
Token ID13
Feature activation-9.22e-03

Pos logit contributions
 sails+0.428
 valleys+0.422
 sail+0.402
 widened+0.392
 gl+0.383
Neg logit contributions
 eater-0.616
 virtue-0.582
 anniversary-0.574
 occasion-0.564
olding-0.546
Token
Token ID220
Feature activation+9.84e-02

Pos logit contributions
 eater+0.020
 virtue+0.019
 anniversary+0.019
 occasion+0.019
 ever+0.019
Neg logit contributions
 sails-0.009
 valleys-0.009
 sail-0.008
 widened-0.008
He-0.008
Token
Token ID198
Feature activation+7.96e-02

Pos logit contributions
 sails+0.117
 valleys+0.112
 widened+0.108
 sail+0.108
 gl+0.103
Neg logit contributions
 eater-0.191
 virtue-0.186
olding-0.180
 ever-0.177
 anniversary-0.173
Token
Token ID198
Feature activation+7.94e-02

Pos logit contributions
 sails+0.096
 valleys+0.092
 sail+0.087
 widened+0.087
 gl+0.085
Neg logit contributions
 eater-0.156
 virtue-0.152
olding-0.146
 ever-0.145
 occasion-0.145
TokenThe
Token ID464
Feature activation+2.40e-02

Pos logit contributions
 sails+0.082
 valleys+0.080
 sail+0.076
 widened+0.075
 gl+0.070
Neg logit contributions
 eater-0.169
 virtue-0.161
 anniversary-0.157
 occasion-0.156
 ever-0.155
Token Rob
Token ID3851
Feature activation+5.78e-02

Pos logit contributions
 sails+0.100
 valleys+0.100
 widened+0.099
 sail+0.098
 gl+0.094
Neg logit contributions
 eater-0.109
 ever-0.106
 virtue-0.101
nie-0.099
 guests-0.099
Token how
Token ID703
Feature activation-1.31e-01

Pos logit contributions
 sails+0.093
 valleys+0.089
 sail+0.088
 widened+0.085
 gl+0.084
Neg logit contributions
 eater-0.092
 virtue-0.087
 occasion-0.084
 anniversary-0.083
 ever-0.081
Token to
Token ID284
Feature activation-3.41e-02

Pos logit contributions
 eater+0.197
 virtue+0.181
 anniversary+0.176
 occasion+0.175
olding+0.163
Neg logit contributions
 sails-0.226
 valleys-0.219
 sail-0.217
He-0.209
 gl-0.209
Token use
Token ID779
Feature activation+8.41e-02

Pos logit contributions
 eater+0.051
 ever+0.048
 occasion+0.047
 virtue+0.047
 mistake+0.046
Neg logit contributions
 valleys-0.055
 sails-0.052
 widened-0.051
 gl-0.048
He-0.047
Token a
Token ID257
Feature activation+3.19e-02

Pos logit contributions
 widened+0.113
 valleys+0.109
He+0.098
rolling+0.098
 sails+0.097
Neg logit contributions
 ever-0.137
 before-0.135
 eater-0.133
 you-0.133
 homework-0.132
Token paddle
Token ID39517
Feature activation+3.26e-01

Pos logit contributions
 valleys+0.047
 widened+0.043
 sails+0.041
He+0.038
itched+0.037
Neg logit contributions
 virtue-0.051
 mistake-0.051
 promise-0.051
 eater-0.050
 occasion-0.049
Token to
Token ID284
Feature activation-3.44e-02

Pos logit contributions
 sails+0.509
 valleys+0.501
 sail+0.483
 widened+0.482
 gl+0.463
Neg logit contributions
 eater-0.518
 virtue-0.489
 anniversary-0.465
 occasion-0.464
 ever-0.463
Token move
Token ID1445
Feature activation+1.98e-01

Pos logit contributions
 eater+0.054
 mistake+0.053
 ever+0.052
 behave+0.052
 virtue+0.052
Neg logit contributions
 valleys-0.048
 widened-0.048
 sails-0.045
He-0.043
 hollow-0.043
Token the
Token ID262
Feature activation+4.35e-03

Pos logit contributions
 sails+0.252
 valleys+0.245
 sail+0.235
 widened+0.229
 gl+0.222
Neg logit contributions
 eater-0.383
 virtue-0.364
 anniversary-0.352
 occasion-0.351
olding-0.345
Token boat
Token ID8848
Feature activation+1.62e-01

Pos logit contributions
 widened+0.005
 valleys+0.005
 sails+0.004
itched+0.004
He+0.004
Neg logit contributions
 eater-0.009
 virtue-0.008
 occasion-0.008
 ever-0.008
 mistake-0.008
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 sails+0.248
 valleys+0.244
 sail+0.235
 widened+0.234
 gl+0.225
Neg logit contributions
 eater-0.261
 virtue-0.246
 anniversary-0.235
 ever-0.234
 occasion-0.234
Token the
Token ID262
Feature activation+4.06e-03

Pos logit contributions
 eater+0.087
 virtue+0.083
 anniversary+0.080
 occasion+0.079
olding+0.079
Neg logit contributions
 sails-0.064
 valleys-0.062
 sail-0.060
 widened-0.059
 gl-0.057
Token boat
Token ID8848
Feature activation+1.61e-01

Pos logit contributions
 widened+0.007
itched+0.006
 widen+0.006
iddled+0.006
 perched+0.006
Neg logit contributions
 day-0.006
 kid-0.006
 mistake-0.006
 promise-0.006
 job-0.005
Token and
Token ID290
Feature activation-2.22e-02

Pos logit contributions
 sails+0.230
 valleys+0.226
 sail+0.217
 widened+0.216
 gl+0.207
Neg logit contributions
 eater-0.278
 virtue-0.264
 ever-0.252
 anniversary-0.252
 occasion-0.252
Token took
Token ID1718
Feature activation+2.05e-02

Pos logit contributions
 eater+0.041
 virtue+0.039
 anniversary+0.038
 occasion+0.037
 ever+0.037
Neg logit contributions
 sails-0.029
 valleys-0.028
 sail-0.027
 widened-0.027
 gl-0.026
Token the
Token ID262
Feature activation+3.47e-03

Pos logit contributions
 widened+0.029
 sails+0.029
 valleys+0.029
 sail+0.027
 gl+0.027
Neg logit contributions
 ever-0.034
 eater-0.033
 virtue-0.032
 guests-0.030
 before-0.030
Token paddle
Token ID39517
Feature activation+3.25e-01

Pos logit contributions
 widened+0.004
 valleys+0.004
 sails+0.004
itched+0.004
He+0.003
Neg logit contributions
 eater-0.007
 virtue-0.006
 occasion-0.006
 mistake-0.006
 ever-0.006
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 sails+0.515
 valleys+0.505
 sail+0.489
 widened+0.486
 gl+0.468
Neg logit contributions
 eater-0.513
 virtue-0.484
 anniversary-0.461
 occasion-0.460
 ever-0.458
Token He
Token ID679
Feature activation+1.99e-01

Pos logit contributions
 eater+0.024
 virtue+0.024
 ever+0.023
olding+0.023
 before+0.022
Neg logit contributions
 sails-0.014
 valleys-0.014
 widened-0.014
 gl-0.013
 sail-0.013
Token pushed
Token ID7121
Feature activation+1.06e-01

Pos logit contributions
 sails+0.265
 valleys+0.263
 sail+0.250
 widened+0.250
 gl+0.237
Neg logit contributions
 eater-0.356
 virtue-0.339
 ever-0.328
 occasion-0.322
 anniversary-0.322
Token and
Token ID290
Feature activation-2.30e-02

Pos logit contributions
 sails+0.157
 valleys+0.152
 widened+0.148
 sail+0.148
 gl+0.142
Neg logit contributions
 eater-0.178
 virtue-0.169
 ever-0.165
 occasion-0.160
 anniversary-0.160
Token pulled
Token ID5954
Feature activation+1.03e-01

Pos logit contributions
 ever+0.036
 before+0.035
 eater+0.033
 virtue+0.033
 occasion+0.031
Neg logit contributions
 sails-0.033
itched-0.032
 valleys-0.032
 hollow-0.031
 widen-0.031
Token wanted
Token ID2227
Feature activation-4.00e-02

Pos logit contributions
 valleys+0.290
 sails+0.286
 sail+0.271
 widened+0.271
 gl+0.258
Neg logit contributions
 eater-0.340
 virtue-0.322
 ever-0.311
 occasion-0.306
 anniversary-0.304
Token to
Token ID284
Feature activation-3.44e-02

Pos logit contributions
 eater+0.053
 virtue+0.050
 anniversary+0.047
 occasion+0.047
olding+0.046
Neg logit contributions
 sails-0.074
 valleys-0.073
 sail-0.071
 widened-0.070
 gl-0.069
Token go
Token ID467
Feature activation+7.13e-02

Pos logit contributions
 eater+0.049
 virtue+0.046
 ever+0.046
 occasion+0.045
 anniversary+0.044
Neg logit contributions
 valleys-0.060
 sails-0.058
 widened-0.056
 gl-0.053
 sail-0.053
Token for
Token ID329
Feature activation-1.83e-01

Pos logit contributions
 widened+0.125
 valleys+0.120
 sails+0.115
itched+0.111
 darkened+0.106
Neg logit contributions
 ever-0.103
 before-0.100
olding-0.092
 you-0.092
 eater-0.090
Token a
Token ID257
Feature activation+2.94e-02

Pos logit contributions
 ever+0.309
 virtue+0.296
 eater+0.295
 occasion+0.295
 anniversary+0.290
Neg logit contributions
 widened-0.245
 sails-0.244
 valleys-0.243
 gl-0.223
 sail-0.222
Token paddle
Token ID39517
Feature activation+3.25e-01

Pos logit contributions
 sails+0.044
 valleys+0.043
 widened+0.042
 sail+0.039
 gl+0.039
Neg logit contributions
 eater-0.049
 virtue-0.046
 occasion-0.045
 ever-0.045
 anniversary-0.045
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
 sails+0.432
 sail+0.425
 gl+0.392
 valleys+0.388
He+0.377
Neg logit contributions
 eater-0.597
 virtue-0.570
 anniversary-0.566
 occasion-0.549
becca-0.548
Token But
Token ID887
Feature activation-5.12e-02

Pos logit contributions
 eater+0.033
 virtue+0.032
 anniversary+0.032
 occasion+0.031
 ever+0.031
Neg logit contributions
 sails-0.012
He-0.012
 valleys-0.011
 widened-0.011
 He-0.010
Token Tom
Token ID4186
Feature activation-6.20e-02

Pos logit contributions
 virtue+0.110
 eater+0.109
 anniversary+0.106
 occasion+0.106
olding+0.103
Neg logit contributions
 sail-0.053
 sails-0.052
He-0.050
His-0.049
 bent-0.048
Token was
Token ID373
Feature activation+3.17e-03

Pos logit contributions
 eater+0.114
 anniversary+0.112
 virtue+0.110
 occasion+0.110
 inconvenience+0.103
Neg logit contributions
 sails-0.084
 bent-0.079
 gl-0.079
 widened-0.079
He-0.078
Token too
Token ID1165
Feature activation-1.93e-01

Pos logit contributions
 sails+0.005
 valleys+0.005
 sail+0.005
 widened+0.005
 gl+0.005
Neg logit contributions
 eater-0.005
 virtue-0.005
 anniversary-0.005
 occasion-0.005
olding-0.004
Token so
Token ID523
Feature activation-8.03e-02

Pos logit contributions
 valleys+0.022
 sails+0.021
 sail+0.020
 widened+0.019
 mast+0.019
Neg logit contributions
 ever-0.012
 virtue-0.011
 before-0.010
 eater-0.010
 mistake-0.010
Token relieved
Token ID22598
Feature activation+4.12e-02

Pos logit contributions
 ever+0.078
 promise+0.076
olding+0.071
 eater+0.063
 virtue+0.061
Neg logit contributions
 valleys-0.182
 sails-0.181
 hood-0.166
 armour-0.165
 mast-0.165
Token and
Token ID290
Feature activation-1.94e-02

Pos logit contributions
 sails+0.054
 valleys+0.052
 sail+0.052
 gl+0.051
 widened+0.050
Neg logit contributions
 eater-0.078
 virtue-0.075
 anniversary-0.070
 occasion-0.069
nie-0.068
Token started
Token ID2067
Feature activation+9.03e-02

Pos logit contributions
 eater+0.031
 ever+0.029
 virtue+0.028
olding+0.028
 anniversary+0.027
Neg logit contributions
 sails-0.030
 valleys-0.029
 widened-0.027
 sail-0.027
itched-0.026
Token to
Token ID284
Feature activation-3.25e-02

Pos logit contributions
 widened+0.191
 valleys+0.190
 sails+0.184
 widen+0.176
He+0.175
Neg logit contributions
 ever-0.083
 eater-0.079
 mistake-0.076
 before-0.075
olding-0.074
Token paddle
Token ID39517
Feature activation+3.25e-01

Pos logit contributions
 mistake+0.048
 promise+0.048
 eater+0.047
 behave+0.047
 ever+0.046
Neg logit contributions
 valleys-0.051
 sails-0.047
 widened-0.047
He-0.045
itched-0.044
Token towards
Token ID3371
Feature activation+7.42e-02

Pos logit contributions
 sails+0.481
 valleys+0.463
 widened+0.451
 sail+0.451
 gl+0.430
Neg logit contributions
 eater-0.537
 virtue-0.509
 ever-0.503
olding-0.485
 occasion-0.481
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
 valleys+0.079
 sail+0.070
 sails+0.069
 gl+0.065
 wood+0.063
Neg logit contributions
 eater-0.174
 virtue-0.169
olding-0.161
 anniversary-0.156
nie-0.156
Token shore
Token ID15191
Feature activation+1.74e-01

Pos logit contributions
 widened+0.012
 valleys+0.012
itched+0.012
 sails+0.011
He+0.010
Neg logit contributions
 eater-0.020
 virtue-0.020
 mistake-0.019
 occasion-0.019
 ever-0.019
Token with
Token ID351
Feature activation-7.00e-02

Pos logit contributions
 sails+0.266
 valleys+0.261
 sail+0.256
 widened+0.243
 gl+0.242
Neg logit contributions
 eater-0.290
 virtue-0.278
 anniversary-0.266
 occasion-0.263
olding-0.256
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
 eater+0.113
 ever+0.109
 virtue+0.107
 anniversary+0.104
 occasion+0.103
Neg logit contributions
 sails-0.104
 valleys-0.102
 widened-0.099
 sail-0.098
He-0.093
Token a
Token ID257
Feature activation+2.91e-02

Pos logit contributions
 sails+0.023
 valleys+0.023
 widened+0.023
 sail+0.022
 gl+0.021
Neg logit contributions
 eater-0.035
 ever-0.034
 virtue-0.033
 occasion-0.032
 guests-0.032
Token paddle
Token ID39517
Feature activation+3.23e-01

Pos logit contributions
 valleys+0.040
 widened+0.037
 sails+0.036
He+0.033
itched+0.033
Neg logit contributions
 occasion-0.048
 eater-0.048
 promise-0.048
 virtue-0.048
nie-0.047
Token and
Token ID290
Feature activation-2.60e-02

Pos logit contributions
 valleys+0.527
 sails+0.517
 widened+0.515
 sail+0.485
 gl+0.475
Neg logit contributions
 eater-0.482
 virtue-0.456
 ever-0.445
olding-0.434
 occasion-0.428
Token started
Token ID2067
Feature activation+8.47e-02

Pos logit contributions
 ever+0.046
 before+0.045
 eater+0.043
 virtue+0.041
nie+0.041
Neg logit contributions
 valleys-0.031
 widened-0.030
 widen-0.030
itched-0.030
 hollow-0.029
Token to
Token ID284
Feature activation-3.75e-02

Pos logit contributions
 widened+0.145
 valleys+0.133
 widen+0.131
 hollow+0.131
 sails+0.130
Neg logit contributions
 ever-0.115
 before-0.114
 eater-0.105
olding-0.100
 virtue-0.099
Token paddle
Token ID39517
Feature activation+3.24e-01

Pos logit contributions
 promise+0.071
 behave+0.071
 eater+0.070
 ever+0.069
 mistake+0.068
Neg logit contributions
 widened-0.037
 valleys-0.037
itched-0.035
He-0.034
 hollow-0.033
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
 sails+0.489
 valleys+0.478
 sail+0.462
 widened+0.461
 gl+0.443
Neg logit contributions
 eater-0.533
 virtue-0.505
 ever-0.484
 occasion-0.479
 anniversary-0.479
Token Suddenly
Token ID24975
Feature activation+1.23e-01

Pos logit contributions
 eater+0.029
 virtue+0.029
 ever+0.028
olding+0.028
 before+0.027
Neg logit contributions
 sails-0.020
 widened-0.019
 valleys-0.018
 sail-0.018
 gl-0.017
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
 sails+0.182
 widened+0.177
 valleys+0.174
 sail+0.171
 widen+0.164
Neg logit contributions
 ever-0.186
 before-0.179
 eater-0.178
 you-0.168
 guests-0.167
Token he
Token ID339
Feature activation+1.08e-01

Pos logit contributions
 sails+0.116
 valleys+0.114
 sail+0.109
 widened+0.107
 gl+0.102
Neg logit contributions
 eater-0.209
 virtue-0.200
 anniversary-0.193
 occasion-0.192
olding-0.190
Token heard
Token ID2982
Feature activation+5.10e-02

Pos logit contributions
 valleys+0.143
 sails+0.139
 widened+0.134
 sail+0.133
 hollow+0.131
Neg logit contributions
 eater-0.182
 ever-0.177
 virtue-0.172
 before-0.165
 mistake-0.164
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
 eater+0.265
 virtue+0.249
 ever+0.244
 occasion+0.241
olding+0.236
Neg logit contributions
 sails-0.263
 valleys-0.256
 widened-0.249
 sail-0.248
 gl-0.232
Token n
Token ID299
Feature activation+6.54e-02

Pos logit contributions
 valleys+0.019
 sails+0.019
 gl+0.018
 widened+0.018
 wood+0.018
Neg logit contributions
 eater-0.024
 virtue-0.022
 anniversary-0.021
becca-0.021
 ever-0.021
Tokenooks
Token ID31085
Feature activation+6.65e-02

Pos logit contributions
 sails+0.118
 valleys+0.116
 sail+0.112
 widened+0.111
 gl+0.108
Neg logit contributions
 eater-0.090
 virtue-0.083
 anniversary-0.079
 occasion-0.079
 ever-0.079
Token and
Token ID290
Feature activation-1.95e-02

Pos logit contributions
 sails+0.090
 valleys+0.087
 widened+0.083
 sail+0.082
 gl+0.081
Neg logit contributions
 eater-0.123
 virtue-0.121
 anniversary-0.115
 occasion-0.113
becca-0.111
Token cr
Token ID1067
Feature activation+3.17e-02

Pos logit contributions
 eater+0.034
 virtue+0.032
 ever+0.031
 occasion+0.031
 anniversary+0.031
Neg logit contributions
 sails-0.027
 valleys-0.026
 sail-0.025
 widened-0.025
 gl-0.024
Tokenann
Token ID1236
Feature activation+3.24e-01

Pos logit contributions
 sail+0.051
 valleys+0.048
 sails+0.045
 paddle+0.044
 sailed+0.044
Neg logit contributions
 ever-0.046
olding-0.045
 guests-0.045
 virtue-0.044
 eater-0.044
Tokenies
Token ID444
Feature activation+7.86e-03

Pos logit contributions
 sails+0.578
 valleys+0.558
 widened+0.541
itched+0.539
 sail+0.536
Neg logit contributions
 eater-0.455
 virtue-0.435
 occasion-0.402
 anniversary-0.398
 ever-0.383
Token.
Token ID13
Feature activation-8.60e-03

Pos logit contributions
 sails+0.012
 valleys+0.012
 sail+0.012
 widened+0.011
 gl+0.011
Neg logit contributions
 eater-0.013
 virtue-0.012
 anniversary-0.012
 occasion-0.012
olding-0.012
Token
Token ID198
Feature activation+7.92e-02

Pos logit contributions
 eater+0.017
 virtue+0.016
 ever+0.016
 occasion+0.015
olding+0.015
Neg logit contributions
 sails-0.010
 valleys-0.010
 sail-0.009
 widened-0.009
 gl-0.009
Token
Token ID198
Feature activation+7.88e-02

Pos logit contributions
 sails+0.112
 valleys+0.105
 sail+0.103
 widened+0.100
 gl+0.098
Neg logit contributions
 eater-0.135
 virtue-0.132
 ever-0.129
olding-0.126
 occasion-0.124
TokenAfter
Token ID3260
Feature activation-7.22e-02

Pos logit contributions
 sails+0.135
 valleys+0.130
 sail+0.129
 widened+0.125
 gl+0.123
Neg logit contributions
 eater-0.115
 virtue-0.109
 ever-0.103
 occasion-0.101
olding-0.100
Token not
Token ID407
Feature activation-2.27e-01

Pos logit contributions
 eater+0.114
 occasion+0.105
kids+0.101
ae+0.100
 anniversary+0.099
Neg logit contributions
 sails-0.086
 sail-0.082
 gl-0.080
 glitter-0.075
 widened-0.074
Token good
Token ID922
Feature activation-1.18e-01

Pos logit contributions
 eater+0.390
 occasion+0.354
 virtue+0.353
 anniversary+0.346
becca+0.337
Neg logit contributions
 sails-0.346
 sail-0.331
 valleys-0.310
 gl-0.308
 widened-0.306
Token.
Token ID13
Feature activation-6.01e-02

Pos logit contributions
ae+0.226
becca+0.222
 eater+0.220
bek+0.217
otte+0.217
Neg logit contributions
 sails-0.148
 sail-0.136
 widened-0.120
He-0.119
itched-0.117
Token Ben
Token ID3932
Feature activation-8.17e-02

Pos logit contributions
 eater+0.086
 virtue+0.080
 occasion+0.077
 anniversary+0.075
 guests+0.074
Neg logit contributions
 sails-0.104
 valleys-0.100
 sail-0.100
 widened-0.098
He-0.096
Token,
Token ID11
Feature activation+5.19e-02

Pos logit contributions
 occasion+0.163
 anniversary+0.159
 eater+0.158
 virtue+0.154
 hour+0.153
Neg logit contributions
 sails-0.105
 widened-0.098
 gl-0.097
He-0.093
 sail-0.091
Token you
Token ID345
Feature activation-2.82e-01

Pos logit contributions
 sails+0.068
 sail+0.068
He+0.067
 gl+0.065
 valleys+0.060
Neg logit contributions
 virtue-0.103
 eater-0.092
 occasion-0.091
aughter-0.089
becca-0.088
Token need
Token ID761
Feature activation-6.76e-02

Pos logit contributions
 eater+0.521
 occasion+0.495
 anniversary+0.481
 virtue+0.479
becca+0.456
Neg logit contributions
 sails-0.415
 sail-0.390
 widened-0.378
 gl-0.372
He-0.359
Token to
Token ID284
Feature activation-6.75e-02

Pos logit contributions
 eater+0.111
 virtue+0.098
nie+0.096
 anniversary+0.095
becca+0.094
Neg logit contributions
 sails-0.109
 sail-0.103
 valleys-0.098
 gl-0.097
He-0.096
Token share
Token ID2648
Feature activation-4.99e-02

Pos logit contributions
 eater+0.096
 virtue+0.089
 anniversary+0.084
 ever+0.084
 occasion+0.084
Neg logit contributions
 sails-0.117
 sail-0.114
 valleys-0.112
 widened-0.110
 gl-0.107
Token your
Token ID534
Feature activation-5.91e-02

Pos logit contributions
 eater+0.102
 virtue+0.098
 occasion+0.096
 anniversary+0.094
olding+0.093
Neg logit contributions
 sails-0.058
 sail-0.054
He-0.053
 widened-0.051
His-0.049
Token toys
Token ID14958
Feature activation-1.11e-01

Pos logit contributions
 virtue+0.106
nie+0.102
otte+0.100
ae+0.100
icia+0.099
Neg logit contributions
 sail-0.090
 sails-0.090
 arrows-0.078
 glitter-0.075
 feathers-0.075
Token They
Token ID1119
Feature activation+3.15e-02

Pos logit contributions
 eater+0.109
 virtue+0.105
 mistake+0.100
 occasion+0.099
 behaviour+0.097
Neg logit contributions
 sails-0.105
He-0.103
 sail-0.102
 valleys-0.100
 widened-0.095
Token are
Token ID389
Feature activation-7.45e-03

Pos logit contributions
 sails+0.047
 sail+0.046
 widened+0.044
 gl+0.044
 valleys+0.041
Neg logit contributions
 eater-0.056
 virtue-0.052
 occasion-0.052
 anniversary-0.050
olding-0.049
Token serious
Token ID2726
Feature activation-3.49e-03

Pos logit contributions
 eater+0.014
 occasion+0.014
 virtue+0.013
kids+0.013
 anniversary+0.012
Neg logit contributions
 sails-0.010
 sail-0.010
 gl-0.009
 bent-0.009
 hollow-0.009
Token.
Token ID13
Feature activation-5.90e-02

Pos logit contributions
 virtue+0.008
becca+0.008
 eater+0.008
bek+0.008
 occasion+0.007
Neg logit contributions
 sails-0.003
He-0.003
 valleys-0.003
 widened-0.003
 gl-0.002
Token Do
Token ID2141
Feature activation-9.02e-02

Pos logit contributions
 eater+0.137
 virtue+0.133
 occasion+0.128
 exam+0.126
 anniversary+0.125
Neg logit contributions
 sails-0.050
 valleys-0.049
 sail-0.045
He-0.045
 widened-0.043
Token you
Token ID345
Feature activation-2.78e-01

Pos logit contributions
 eater+0.111
 virtue+0.104
 occasion+0.095
olding+0.087
nie+0.085
Neg logit contributions
 sails-0.185
 sail-0.182
He-0.177
 gl-0.175
 valleys-0.171
Token understand
Token ID1833
Feature activation+4.55e-02

Pos logit contributions
 eater+0.499
 virtue+0.467
 occasion+0.462
becca+0.460
 anniversary+0.456
Neg logit contributions
 sail-0.394
 sails-0.390
He-0.369
 widened-0.352
 widen-0.347
Token?"
Token ID1701
Feature activation-7.90e-02

Pos logit contributions
 sails+0.060
 sail+0.057
 gl+0.056
 glitter+0.053
He+0.053
Neg logit contributions
 virtue-0.083
 eater-0.082
olding-0.077
 occasion-0.074
 behaviour-0.074
Token
Token ID198
Feature activation+3.99e-02

Pos logit contributions
 virtue+0.176
 eater+0.175
 occasion+0.164
 anniversary+0.164
olding+0.161
Neg logit contributions
 sails-0.071
 valleys-0.071
 sail-0.069
He-0.068
 widened-0.067
Token
Token ID198
Feature activation+4.10e-02

Pos logit contributions
 sails+0.053
 valleys+0.050
 widened+0.048
 sail+0.048
 gl+0.046
Neg logit contributions
 eater-0.073
 virtue-0.068
 ever-0.068
olding-0.067
 anniversary-0.067
TokenAnna
Token ID31160
Feature activation-1.52e-01

Pos logit contributions
 sails+0.071
 valleys+0.069
 sail+0.067
 widened+0.067
 gl+0.065
Neg logit contributions
 eater-0.059
 virtue-0.055
 anniversary-0.053
 occasion-0.053
 ever-0.052
Token We
Token ID775
Feature activation-1.97e-01

Pos logit contributions
 eater+0.121
 virtue+0.116
 anniversary+0.112
 occasion+0.112
 ever+0.111
Neg logit contributions
 sails-0.055
 valleys-0.053
 sail-0.051
 widened-0.050
 gl-0.047
Token are
Token ID389
Feature activation-8.78e-03

Pos logit contributions
 eater+0.323
 occasion+0.307
 anniversary+0.306
 virtue+0.306
becca+0.297
Neg logit contributions
 sails-0.308
 sail-0.300
 widened-0.281
 gl-0.278
He-0.270
Token friends
Token ID2460
Feature activation-1.62e-01

Pos logit contributions
 eater+0.017
 occasion+0.017
 virtue+0.017
 behaviour+0.015
olding+0.015
Neg logit contributions
 sails-0.012
 sail-0.011
 gl-0.010
 bent-0.010
 sailed-0.009
Token.
Token ID13
Feature activation-6.39e-02

Pos logit contributions
 virtue+0.346
 occasion+0.342
becca+0.339
 exam+0.333
icia+0.328
Neg logit contributions
 sails-0.173
He-0.172
 sail-0.166
 gl-0.153
 widened-0.141
Token Do
Token ID2141
Feature activation-9.09e-02

Pos logit contributions
 eater+0.083
 virtue+0.078
 occasion+0.073
 mistake+0.073
 exam+0.071
Neg logit contributions
 sails-0.121
 sail-0.118
 valleys-0.116
 widened-0.111
He-0.110
Token you
Token ID345
Feature activation-2.77e-01

Pos logit contributions
 eater+0.084
 virtue+0.075
 occasion+0.066
olding+0.063
nie+0.060
Neg logit contributions
 sails-0.213
 sail-0.210
 gl-0.202
He-0.199
 valleys-0.198
Token want
Token ID765
Feature activation-7.76e-02

Pos logit contributions
 eater+0.453
 virtue+0.442
becca+0.422
 anniversary+0.408
 occasion+0.407
Neg logit contributions
 sail-0.447
 sails-0.436
He-0.391
 gl-0.389
 widened-0.388
Token to
Token ID284
Feature activation-6.43e-02

Pos logit contributions
 eater+0.127
 virtue+0.121
olding+0.114
nie+0.109
 anniversary+0.109
Neg logit contributions
 sails-0.127
 sail-0.122
 gl-0.115
 valleys-0.114
 widened-0.109
Token be
Token ID307
Feature activation+1.97e-03

Pos logit contributions
 eater+0.086
 ever+0.084
 anniversary+0.082
 occasion+0.079
 guests+0.078
Neg logit contributions
 valleys-0.116
 widened-0.110
 sails-0.108
 gl-0.102
He-0.101
Token our
Token ID674
Feature activation-7.33e-02

Pos logit contributions
 sails+0.003
 sail+0.003
 valleys+0.003
 gl+0.003
He+0.003
Neg logit contributions
 eater-0.003
 virtue-0.003
 occasion-0.003
olding-0.003
 behaviour-0.003
Token friend
Token ID1545
Feature activation-1.35e-01

Pos logit contributions
 eater+0.119
you+0.119
nie+0.118
t+0.118
oney+0.117
Neg logit contributions
 sail-0.117
 sails-0.116
 gl-0.096
 wood-0.094
 raft-0.092
TokenGo
Token ID5247
Feature activation-1.17e-01

Pos logit contributions
 eater+0.127
 virtue+0.119
 anniversary+0.113
 ever+0.112
 occasion+0.112
Neg logit contributions
 sails-0.138
 valleys-0.136
 sail-0.131
 widened-0.131
 gl-0.126
Token away
Token ID1497
Feature activation+4.26e-02

Pos logit contributions
 eater+0.251
 virtue+0.241
 anniversary+0.239
 actions+0.237
 students+0.233
Neg logit contributions
 sail-0.108
 sails-0.100
He-0.100
 swoop-0.099
 bent-0.096
Token,
Token ID11
Feature activation+6.60e-02

Pos logit contributions
 sail+0.033
 sails+0.032
 valleys+0.031
 widened+0.031
He+0.029
Neg logit contributions
 eater-0.099
 virtue-0.097
 occasion-0.096
 anniversary-0.093
olding-0.092
Token dog
Token ID3290
Feature activation-1.53e-01

Pos logit contributions
 sail+0.109
 sails+0.096
 gl+0.092
 zoom+0.092
 spark+0.090
Neg logit contributions
 virtue-0.123
 eater-0.105
olding-0.101
 occasion-0.100
nie-0.100
Token,
Token ID11
Feature activation+6.61e-02

Pos logit contributions
 virtue+0.354
 occasion+0.346
 anniversary+0.342
 eater+0.342
 behaviour+0.326
Neg logit contributions
 sails-0.129
 sail-0.122
 swoop-0.120
 valleys-0.114
He-0.112
Token you
Token ID345
Feature activation-2.77e-01

Pos logit contributions
 sail+0.103
 sails+0.094
He+0.086
 zoom+0.085
 gl+0.084
Neg logit contributions
 virtue-0.128
 eater-0.119
nie-0.118
 occasion-0.114
becca-0.110
Token are
Token ID389
Feature activation-1.10e-02

Pos logit contributions
 eater+0.489
 virtue+0.450
 occasion+0.450
 anniversary+0.446
becca+0.424
Neg logit contributions
 sails-0.422
 sail-0.405
 valleys-0.386
 gl-0.384
 widened-0.384
Token bad
Token ID2089
Feature activation-9.05e-02

Pos logit contributions
 eater+0.021
 occasion+0.020
 virtue+0.019
 anniversary+0.018
olding+0.018
Neg logit contributions
 sails-0.015
 sail-0.014
 gl-0.014
 valleys-0.014
 hollow-0.013
Token!"
Token ID2474
Feature activation-1.22e-01

Pos logit contributions
 virtue+0.175
becca+0.162
 eater+0.162
 occasion+0.161
bek+0.160
Neg logit contributions
 sails-0.119
 sail-0.108
 valleys-0.098
 gl-0.096
He-0.094
Token Ben
Token ID3932
Feature activation-7.11e-02

Pos logit contributions
 virtue+0.186
 eater+0.180
 anniversary+0.161
 occasion+0.161
olding+0.153
Neg logit contributions
 sails-0.211
 sail-0.208
 widened-0.205
He-0.204
 valleys-0.202
Token says
Token ID1139
Feature activation+8.77e-02

Pos logit contributions
 eater+0.138
 virtue+0.137
 occasion+0.132
 anniversary+0.131
 ever+0.124
Neg logit contributions
 sails-0.092
 widened-0.086
 valleys-0.083
 sail-0.083
 gl-0.081
Token.
Token ID13
Feature activation-6.33e-02

Pos logit contributions
 eater+0.158
 occasion+0.149
kids+0.148
 virtue+0.145
becca+0.144
Neg logit contributions
 sails-0.102
 sail-0.098
 gl-0.096
 bent-0.085
 wide-0.084
Token
Token ID198
Feature activation+3.47e-02

Pos logit contributions
 eater+0.102
 virtue+0.097
 occasion+0.092
 anniversary+0.092
 ever+0.091
Neg logit contributions
 sails-0.098
 valleys-0.096
 sail-0.093
 widened-0.091
He-0.089
Token
Token ID198
Feature activation+4.22e-02

Pos logit contributions
 sails+0.042
 valleys+0.040
 widened+0.040
 sail+0.039
 gl+0.038
Neg logit contributions
 eater-0.066
 virtue-0.064
olding-0.062
 ever-0.061
 anniversary-0.061
Token"
Token ID1
Feature activation-7.87e-02

Pos logit contributions
 sails+0.056
 valleys+0.055
 sail+0.053
 widened+0.052
 gl+0.050
Neg logit contributions
 eater-0.078
 virtue-0.074
 anniversary-0.071
 occasion-0.071
 ever-0.070
TokenCan
Token ID6090
Feature activation-1.71e-01

Pos logit contributions
 eater+0.141
 virtue+0.133
 anniversary+0.127
olding+0.126
 ever+0.126
Neg logit contributions
 sails-0.108
 valleys-0.107
 widened-0.103
 sail-0.102
He-0.101
Token you
Token ID345
Feature activation-2.76e-01

Pos logit contributions
 eater+0.234
 virtue+0.221
olding+0.193
 occasion+0.192
 behaviour+0.191
Neg logit contributions
 sail-0.332
 sails-0.321
He-0.310
 gl-0.306
His-0.301
Token lift
Token ID10303
Feature activation+1.53e-01

Pos logit contributions
 eater+0.381
 virtue+0.355
 anniversary+0.337
 occasion+0.336
 ever+0.335
Neg logit contributions
 sails-0.490
 valleys-0.484
 sail-0.467
 widened-0.465
 gl-0.451
Token the
Token ID262
Feature activation-2.28e-02

Pos logit contributions
 sails+0.233
 valleys+0.224
 sail+0.220
 gl+0.212
 widened+0.210
Neg logit contributions
 eater-0.257
 virtue-0.244
 occasion-0.233
 anniversary-0.232
olding-0.232
Token rock
Token ID3881
Feature activation+1.54e-01

Pos logit contributions
 eater+0.044
 virtue+0.043
olding+0.043
becca+0.042
 ever+0.041
Neg logit contributions
 sails-0.030
 sail-0.029
 gl-0.026
 wood-0.026
 valleys-0.026
Token?"
Token ID1701
Feature activation-7.76e-02

Pos logit contributions
 sail+0.231
 sails+0.226
 gl+0.214
 valleys+0.210
He+0.209
Neg logit contributions
 eater-0.249
 virtue-0.249
 anniversary-0.239
kids-0.233
 occasion-0.232
Token L
Token ID406
Feature activation-7.41e-02

Pos logit contributions
 eater+0.110
 virtue+0.107
 anniversary+0.098
 occasion+0.097
olding+0.096
Neg logit contributions
 sails-0.136
 valleys-0.134
 sail-0.132
 widened-0.130
 gl-0.127
Token of
Token ID286
Feature activation-6.13e-02

Pos logit contributions
 eater+0.064
 ever+0.060
 occasion+0.059
 anniversary+0.059
 virtue+0.059
Neg logit contributions
 sails-0.050
 valleys-0.048
 widened-0.047
 sail-0.046
He-0.044
Token you
Token ID345
Feature activation-2.67e-01

Pos logit contributions
 eater+0.115
 virtue+0.110
olding+0.110
 anniversary+0.104
becca+0.103
Neg logit contributions
 sails-0.083
 valleys-0.083
 sail-0.078
 gl-0.076
 wood-0.074
Token.
Token ID13
Feature activation-5.36e-02

Pos logit contributions
 eater+0.477
becca+0.470
 virtue+0.440
 anniversary+0.439
olding+0.436
Neg logit contributions
 sails-0.365
 sail-0.365
 gl-0.350
 valleys-0.332
 widened-0.324
Token Anna
Token ID11735
Feature activation-1.66e-01

Pos logit contributions
 eater+0.068
 virtue+0.064
 occasion+0.060
 anniversary+0.060
 ever+0.059
Neg logit contributions
 sails-0.100
 valleys-0.098
 sail-0.097
 widened-0.093
He-0.092
Token,
Token ID11
Feature activation+6.02e-02

Pos logit contributions
 eater+0.302
 virtue+0.295
 anniversary+0.295
 occasion+0.295
 hour+0.284
Neg logit contributions
 sails-0.224
 gl-0.218
 widened-0.210
 valleys-0.209
 sail-0.203
Token you
Token ID345
Feature activation-2.75e-01

Pos logit contributions
 sail+0.092
 sails+0.092
 gl+0.091
 valleys+0.087
He+0.086
Neg logit contributions
 virtue-0.101
 eater-0.090
becca-0.089
olding-0.089
 occasion-0.087
Token need
Token ID761
Feature activation-6.01e-02

Pos logit contributions
 eater+0.498
 virtue+0.458
 anniversary+0.455
 occasion+0.454
olding+0.438
Neg logit contributions
 sails-0.407
 sail-0.393
 widened-0.380
 gl-0.379
 valleys-0.375
Token to
Token ID284
Feature activation-6.01e-02

Pos logit contributions
 eater+0.106
 virtue+0.093
olding+0.092
becca+0.092
 anniversary+0.091
Neg logit contributions
 sails-0.090
 sail-0.084
 gl-0.081
 valleys-0.081
 wood-0.080
Token share
Token ID2648
Feature activation-4.16e-02

Pos logit contributions
 eater+0.090
 virtue+0.084
 anniversary+0.080
 occasion+0.079
olding+0.079
Neg logit contributions
 sails-0.100
 sail-0.097
 valleys-0.097
 widened-0.094
 gl-0.091
Token your
Token ID534
Feature activation-5.37e-02

Pos logit contributions
 eater+0.081
 virtue+0.080
olding+0.078
 inconvenience+0.077
 occasion+0.076
Neg logit contributions
 sails-0.048
He-0.045
 sail-0.044
 widened-0.044
 valleys-0.043
Token desk
Token ID6915
Feature activation-4.73e-02

Pos logit contributions
 virtue+0.096
olding+0.092
becca+0.089
icia+0.089
t+0.089
Neg logit contributions
 sails-0.082
 sail-0.082
 wood-0.070
 glitter-0.070
 gl-0.069
Token says
Token ID1139
Feature activation+1.03e-01

Pos logit contributions
 sails+0.155
 widened+0.145
 gl+0.144
 valleys+0.142
 sail+0.141
Neg logit contributions
 eater-0.221
 virtue-0.214
 occasion-0.208
 anniversary-0.206
olding-0.201
Token,
Token ID11
Feature activation+7.07e-02

Pos logit contributions
 sails+0.117
 sail+0.110
 gl+0.108
 widened+0.107
 valleys+0.107
Neg logit contributions
 eater-0.211
 virtue-0.208
 occasion-0.198
nie-0.194
olding-0.193
Token "
Token ID366
Feature activation-7.61e-02

Pos logit contributions
 sail+0.066
 sails+0.066
 valleys+0.061
 gl+0.059
He+0.059
Neg logit contributions
 virtue-0.157
 eater-0.157
 occasion-0.150
 anniversary-0.148
nie-0.147
TokenBen
Token ID11696
Feature activation-4.07e-02

Pos logit contributions
 eater+0.132
 virtue+0.132
 mistake+0.130
 behaviour+0.123
olding+0.120
Neg logit contributions
He-0.111
 valleys-0.106
His-0.099
 sails-0.099
 sail-0.097
Token,
Token ID11
Feature activation+7.24e-02

Pos logit contributions
 occasion+0.106
 anniversary+0.105
 eater+0.104
 virtue+0.104
becca+0.101
Neg logit contributions
 sails-0.019
 gl-0.017
itched-0.014
 widened-0.014
 sail-0.014
Token you
Token ID345
Feature activation-2.75e-01

Pos logit contributions
 sail+0.109
 gl+0.102
 sails+0.100
He+0.097
 zoom+0.095
Neg logit contributions
 virtue-0.125
 eater-0.112
nie-0.110
kids-0.109
olding-0.109
Token are
Token ID389
Feature activation-1.05e-02

Pos logit contributions
 eater+0.560
 occasion+0.533
 virtue+0.528
 anniversary+0.522
olding+0.502
Neg logit contributions
 sails-0.351
 sail-0.335
 gl-0.322
 widened-0.312
 valleys-0.291
Token very
Token ID845
Feature activation-9.13e-02

Pos logit contributions
 eater+0.022
 occasion+0.021
 virtue+0.021
olding+0.020
 behaviour+0.020
Neg logit contributions
 gl-0.012
 sails-0.012
 valleys-0.011
 hollow-0.011
He-0.011
Token rude
Token ID22066
Feature activation-7.06e-02

Pos logit contributions
 eater+0.167
 occasion+0.161
 virtue+0.160
 anniversary+0.153
kids+0.149
Neg logit contributions
 gl-0.115
 sails-0.114
 sail-0.113
He-0.112
 valleys-0.110
Token.
Token ID13
Feature activation-4.41e-02

Pos logit contributions
 occasion+0.161
 virtue+0.158
becca+0.155
 eater+0.153
 anniversary+0.153
Neg logit contributions
 sails-0.060
He-0.058
 He-0.048
 valleys-0.043
 sail-0.042
Token You
Token ID921
Feature activation-2.39e-01

Pos logit contributions
 virtue+0.054
 eater+0.052
 occasion+0.049
 mistake+0.048
 guests+0.047
Neg logit contributions
 sails-0.081
 sail-0.078
 valleys-0.077
He-0.076
 gl-0.074
Token
Token ID198
Feature activation+3.58e-02

Pos logit contributions
 eater+0.117
 virtue+0.112
 anniversary+0.108
 occasion+0.108
 ever+0.108
Neg logit contributions
 sails-0.054
 valleys-0.053
 sail-0.050
 widened-0.049
 gl-0.047
Token
Token ID198
Feature activation+3.68e-02

Pos logit contributions
 sails+0.047
 valleys+0.045
 widened+0.044
 sail+0.043
 gl+0.042
Neg logit contributions
 eater-0.066
 virtue-0.064
olding-0.062
 ever-0.061
 anniversary-0.061
Token"
Token ID1
Feature activation-8.22e-02

Pos logit contributions
 sails+0.046
 valleys+0.045
 sail+0.043
 widened+0.043
 gl+0.041
Neg logit contributions
 eater-0.071
 virtue-0.068
 anniversary-0.065
 occasion-0.065
 ever-0.064
TokenBen
Token ID11696
Feature activation-5.04e-02

Pos logit contributions
 eater+0.141
 virtue+0.136
olding+0.130
 anniversary+0.129
 exam+0.128
Neg logit contributions
 sails-0.118
 valleys-0.115
 widened-0.113
He-0.113
 sail-0.111
Token,
Token ID11
Feature activation+6.22e-02

Pos logit contributions
 eater+0.120
 occasion+0.119
 anniversary+0.118
 virtue+0.116
 hour+0.113
Neg logit contributions
 sails-0.039
 gl-0.035
 widened-0.033
 sail-0.033
 valleys-0.031
Token you
Token ID345
Feature activation-2.75e-01

Pos logit contributions
 sail+0.093
 sails+0.085
 gl+0.081
He+0.081
 zoom+0.076
Neg logit contributions
 virtue-0.112
 eater-0.102
kids-0.102
child-0.101
ookie-0.101
Token need
Token ID761
Feature activation-6.39e-02

Pos logit contributions
 eater+0.519
 occasion+0.497
 virtue+0.483
 anniversary+0.477
olding+0.467
Neg logit contributions
 sails-0.381
 sail-0.359
 gl-0.340
 widened-0.338
 valleys-0.334
Token to
Token ID284
Feature activation-6.36e-02

Pos logit contributions
 eater+0.104
 virtue+0.093
 anniversary+0.090
olding+0.090
becca+0.088
Neg logit contributions
 sails-0.104
 sail-0.097
 valleys-0.095
 gl-0.093
 wood-0.092
Token drink
Token ID4144
Feature activation-1.06e-02

Pos logit contributions
 eater+0.096
 virtue+0.091
 anniversary+0.086
 occasion+0.085
 ever+0.085
Neg logit contributions
 sails-0.104
 valleys-0.101
 sail-0.100
 widened-0.098
 gl-0.095
Token water
Token ID1660
Feature activation+9.65e-02

Pos logit contributions
 eater+0.020
 virtue+0.019
 anniversary+0.019
olding+0.019
 occasion+0.018
Neg logit contributions
 sails-0.013
 sail-0.013
 valleys-0.012
 gl-0.012
 widened-0.012
Token.
Token ID13
Feature activation-5.68e-02

Pos logit contributions
 sails+0.122
 sail+0.120
He+0.111
 gl+0.110
 widened+0.110
Neg logit contributions
 virtue-0.174
 eater-0.174
 students-0.167
 anniversary-0.167
becca-0.166
Token book
Token ID1492
Feature activation-1.25e-01

Pos logit contributions
 eater+0.012
you+0.012
ople+0.012
olding+0.012
kids+0.012
Neg logit contributions
 sail-0.011
 sails-0.010
 gl-0.010
 spark-0.009
 glitter-0.009
Token for
Token ID329
Feature activation-2.10e-01

Pos logit contributions
 eater+0.216
 virtue+0.207
 behaviour+0.204
 occasion+0.197
kids+0.196
Neg logit contributions
 sails-0.167
 sail-0.163
He-0.161
 gl-0.159
 widened-0.155
Token you
Token ID345
Feature activation-2.71e-01

Pos logit contributions
 virtue+0.331
 eater+0.325
olding+0.307
bek+0.295
ito+0.275
Neg logit contributions
 sails-0.339
 sail-0.334
 valleys-0.324
 glitter-0.318
 gl-0.312
Token.
Token ID13
Feature activation-6.16e-02

Pos logit contributions
becca+0.540
 eater+0.537
 virtue+0.519
leton+0.494
bek+0.493
Neg logit contributions
 sails-0.295
 sail-0.294
 wide-0.280
 gl-0.266
He-0.258
Token Do
Token ID2141
Feature activation-8.94e-02

Pos logit contributions
 eater+0.087
 virtue+0.086
 mistake+0.084
 behaviour+0.080
 occasion+0.078
Neg logit contributions
 sails-0.100
 sail-0.097
He-0.097
 valleys-0.096
 widened-0.092
Token you
Token ID345
Feature activation-2.75e-01

Pos logit contributions
 eater+0.089
 virtue+0.087
olding+0.073
 occasion+0.069
 behaviour+0.064
Neg logit contributions
 sails-0.193
 sail-0.190
 gl-0.188
He-0.188
His-0.182
Token want
Token ID765
Feature activation-7.80e-02

Pos logit contributions
becca+0.474
 virtue+0.471
 eater+0.469
 anniversary+0.444
otte+0.444
Neg logit contributions
 sail-0.403
 sails-0.382
He-0.350
 gl-0.337
 widened-0.336
Token to
Token ID284
Feature activation-6.50e-02

Pos logit contributions
 eater+0.140
 virtue+0.136
olding+0.131
 behaviour+0.124
nie+0.122
Neg logit contributions
 sails-0.109
 sail-0.104
 gl-0.102
 glitter-0.098
 valleys-0.095
Token hear
Token ID3285
Feature activation+5.04e-02

Pos logit contributions
 eater+0.099
 virtue+0.092
 ever+0.090
 anniversary+0.090
 occasion+0.089
Neg logit contributions
 sails-0.106
 valleys-0.106
 widened-0.101
 sail-0.099
 gl-0.097
Token a
Token ID257
Feature activation-3.06e-03

Pos logit contributions
 sails+0.065
 gl+0.061
 sail+0.060
 glitter+0.058
 valleys+0.057
Neg logit contributions
 virtue-0.098
 eater-0.096
olding-0.091
 behaviour-0.085
 occasion-0.085
Token story
Token ID1621
Feature activation-1.34e-01

Pos logit contributions
you+0.005
olding+0.005
 virtue+0.005
 eater+0.005
ople+0.005
Neg logit contributions
 sail-0.005
 gl-0.004
 sails-0.004
 spark-0.004
 wood-0.004
Token."
Token ID526
Feature activation-1.57e-01

Pos logit contributions
 virtue+0.080
becca+0.079
icia+0.078
otte+0.078
 eater+0.075
Neg logit contributions
 sail-0.074
 sails-0.069
 gl-0.061
He-0.057
 wood-0.056
Token
Token ID198
Feature activation+3.92e-02

Pos logit contributions
 virtue+0.381
 eater+0.369
 anniversary+0.354
 occasion+0.353
 mistake+0.350
Neg logit contributions
He-0.120
 sail-0.114
 sails-0.114
 valleys-0.111
 widened-0.106
Token
Token ID198
Feature activation+4.11e-02

Pos logit contributions
 sails+0.040
 valleys+0.039
 sail+0.037
 widened+0.036
 gl+0.034
Neg logit contributions
 eater-0.084
 virtue-0.080
 anniversary-0.078
 occasion-0.077
 ever-0.077
TokenAnna
Token ID31160
Feature activation-1.51e-01

Pos logit contributions
He+0.047
 valleys+0.046
 sails+0.046
 sail+0.045
 widened+0.043
Neg logit contributions
 eater-0.082
 virtue-0.077
 anniversary-0.075
 behaviour-0.075
 ever-0.074
Token says
Token ID1139
Feature activation+9.99e-02

Pos logit contributions
 eater+0.291
 virtue+0.290
 anniversary+0.282
 occasion+0.282
 hour+0.269
Neg logit contributions
 sails-0.192
 widened-0.186
 gl-0.177
 valleys-0.174
 sail-0.170
Token you
Token ID345
Feature activation-2.74e-01

Pos logit contributions
 sails+0.093
 sail+0.087
 widened+0.087
 valleys+0.086
 gl+0.085
Neg logit contributions
 eater-0.223
 virtue-0.213
 occasion-0.209
 anniversary-0.208
nie-0.201
Token're
Token ID821
Feature activation-1.13e-02

Pos logit contributions
 eater+0.529
 occasion+0.508
 anniversary+0.505
 virtue+0.496
becca+0.492
Neg logit contributions
 sails-0.359
 sail-0.345
 widened-0.335
He-0.323
 gl-0.321
Token welcome
Token ID7062
Feature activation-2.90e-02

Pos logit contributions
 eater+0.018
 occasion+0.017
 virtue+0.016
 anniversary+0.016
olding+0.015
Neg logit contributions
 sails-0.018
 sail-0.018
 valleys-0.018
 gl-0.017
 hollow-0.017
Token to
Token ID284
Feature activation-5.50e-02

Pos logit contributions
 virtue+0.074
 occasion+0.069
 eater+0.067
 inconvenience+0.065
becca+0.064
Neg logit contributions
He-0.018
 sails-0.018
 wide-0.018
 sail-0.016
 valleys-0.016
Token Ben
Token ID3932
Feature activation-5.79e-02

Pos logit contributions
 virtue+0.124
olding+0.113
 eater+0.111
aughter+0.107
 occasion+0.107
Neg logit contributions
 sail-0.068
 sails-0.060
 valleys-0.055
 bent-0.054
 gl-0.053
Token.
Token ID13
Feature activation-3.99e-02

Pos logit contributions
 virtue+0.133
 inconvenience+0.128
 occasion+0.126
becca+0.126
 anniversary+0.121
Neg logit contributions
 sail-0.050
 sails-0.048
He-0.046
 gl-0.044
 wide-0.042
Token,
Token ID11
Feature activation+8.47e-02

Pos logit contributions
 sails+0.073
 sail+0.071
 gl+0.070
 widened+0.067
 valleys+0.065
Neg logit contributions
 eater-0.093
 virtue-0.087
olding-0.084
 anniversary-0.083
 occasion-0.082
Token I
Token ID314
Feature activation-1.79e-01

Pos logit contributions
 sail+0.115
 sails+0.113
 gl+0.106
 valleys+0.106
He+0.104
Neg logit contributions
 virtue-0.161
 eater-0.156
olding-0.148
 occasion-0.147
 anniversary-0.146
Token am
Token ID716
Feature activation+3.71e-02

Pos logit contributions
 eater+0.280
 anniversary+0.264
 virtue+0.261
becca+0.257
 occasion+0.253
Neg logit contributions
 sails-0.284
 sail-0.282
 gl-0.273
 widened-0.271
 valleys-0.261
Token.
Token ID13
Feature activation-5.62e-02

Pos logit contributions
 sail+0.048
 gl+0.047
 sails+0.046
 bent+0.043
 perched+0.041
Neg logit contributions
 eater-0.071
 occasion-0.067
 virtue-0.065
 anniversary-0.065
 behaviour-0.064
Token Do
Token ID2141
Feature activation-8.99e-02

Pos logit contributions
 eater+0.074
 virtue+0.072
 mistake+0.071
 occasion+0.066
 behaviour+0.065
Neg logit contributions
 sails-0.103
 sail-0.100
 valleys-0.099
He-0.095
 gl-0.094
Token you
Token ID345
Feature activation-2.74e-01

Pos logit contributions
 eater+0.069
 virtue+0.064
 occasion+0.054
olding+0.054
 anniversary+0.046
Neg logit contributions
 sails-0.222
 sail-0.218
 gl-0.214
 valleys-0.212
He-0.208
Token want
Token ID765
Feature activation-7.73e-02

Pos logit contributions
 virtue+0.455
 eater+0.453
becca+0.439
aughter+0.420
 anniversary+0.419
Neg logit contributions
 sail-0.444
 sails-0.421
 gl-0.385
 widened-0.371
He-0.366
Token to
Token ID284
Feature activation-6.10e-02

Pos logit contributions
 eater+0.124
 virtue+0.119
olding+0.113
 anniversary+0.109
becca+0.107
Neg logit contributions
 sails-0.126
 sail-0.121
 gl-0.115
 valleys-0.114
 widened-0.108
Token go
Token ID467
Feature activation+4.85e-02

Pos logit contributions
 eater+0.078
 ever+0.075
 anniversary+0.073
 occasion+0.071
 exam+0.070
Neg logit contributions
 valleys-0.115
 widened-0.109
 sails-0.108
 gl-0.102
He-0.100
Token downstairs
Token ID34624
Feature activation-1.75e-02

Pos logit contributions
 sails+0.083
 valleys+0.081
 sail+0.079
 widened+0.078
 gl+0.076
Neg logit contributions
 eater-0.071
 virtue-0.067
 anniversary-0.064
 occasion-0.063
 ever-0.063
Token?"
Token ID1701
Feature activation-7.21e-02

Pos logit contributions
 virtue+0.037
olding+0.035
 eater+0.034
 behaviour+0.034
 inconvenience+0.034
Neg logit contributions
 sail-0.019
 sails-0.018
 wide-0.017
 valleys-0.016
 gl-0.016
Token you
Token ID345
Feature activation-2.79e-01

Pos logit contributions
 eater+0.412
 virtue+0.386
olding+0.385
aughter+0.363
nie+0.360
Neg logit contributions
 sails-0.302
 sail-0.292
 valleys-0.286
 wood-0.270
 gl-0.260
Token.
Token ID13
Feature activation-5.97e-02

Pos logit contributions
becca+0.508
 eater+0.507
aughter+0.479
 principal+0.471
kids+0.469
Neg logit contributions
 sail-0.376
 sails-0.360
 gl-0.322
He-0.320
 wide-0.319
Token They
Token ID1119
Feature activation+3.49e-02

Pos logit contributions
 eater+0.083
 virtue+0.079
 occasion+0.074
 anniversary+0.074
 ever+0.074
Neg logit contributions
 sails-0.104
 valleys-0.102
 sail-0.101
He-0.096
 widened-0.096
Token will
Token ID481
Feature activation-9.02e-02

Pos logit contributions
 sails+0.050
 sail+0.048
 gl+0.046
 widened+0.046
 valleys+0.044
Neg logit contributions
 eater-0.063
 virtue-0.062
 occasion-0.060
olding-0.059
 anniversary-0.058
Token make
Token ID787
Feature activation+2.39e-03

Pos logit contributions
 eater+0.146
 virtue+0.136
olding+0.131
 occasion+0.128
 behaviour+0.126
Neg logit contributions
 sail-0.144
 sails-0.143
 gl-0.131
 widened-0.131
 valleys-0.131
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
 sails+0.004
 sail+0.004
 valleys+0.004
 gl+0.004
 wood+0.003
Neg logit contributions
 eater-0.004
 virtue-0.004
olding-0.004
 occasion-0.004
 hour-0.003
Token strong
Token ID1913
Feature activation+3.42e-02

Pos logit contributions
 eater+0.476
 virtue+0.456
 occasion+0.435
 anniversary+0.432
olding+0.429
Neg logit contributions
 sails-0.391
 valleys-0.377
 sail-0.376
 widened-0.363
 gl-0.352
Token and
Token ID290
Feature activation-5.57e-02

Pos logit contributions
 sails+0.051
 valleys+0.049
 sail+0.048
 widened+0.046
 gl+0.044
Neg logit contributions
 eater-0.058
 virtue-0.056
 occasion-0.056
olding-0.055
becca-0.055
Token happy
Token ID3772
Feature activation-5.72e-02

Pos logit contributions
 eater+0.110
 virtue+0.104
 occasion+0.103
 anniversary+0.101
kids+0.099
Neg logit contributions
 sails-0.070
 sail-0.069
 valleys-0.066
 widened-0.063
 gl-0.063
Token."
Token ID526
Feature activation-1.54e-01

Pos logit contributions
becca+0.097
 virtue+0.097
olding+0.096
 eater+0.095
aughter+0.093
Neg logit contributions
 sails-0.086
 sail-0.085
 valleys-0.081
 gl-0.078
 widened-0.075
Token
Token ID198
Feature activation+4.57e-02

Pos logit contributions
 virtue+0.360
 eater+0.353
 occasion+0.333
 anniversary+0.331
 behaviour+0.328
Neg logit contributions
 sails-0.126
 sail-0.125
He-0.124
 valleys-0.124
 widened-0.113
Token from
Token ID422
Feature activation-4.37e-02

Pos logit contributions
 sails+0.135
 valleys+0.133
 sail+0.130
 widened+0.127
 gl+0.126
Neg logit contributions
 eater-0.103
 virtue-0.098
 anniversary-0.092
 occasion-0.091
olding-0.090
Token the
Token ID262
Feature activation-3.03e-02

Pos logit contributions
 eater+0.089
 virtue+0.086
olding+0.082
 occasion+0.080
 anniversary+0.078
Neg logit contributions
 sails-0.053
 valleys-0.051
 sail-0.050
 gl-0.046
 wood-0.044
Token book
Token ID1492
Feature activation-1.09e-01

Pos logit contributions
 eater+0.056
olding+0.055
 virtue+0.054
becca+0.053
you+0.052
Neg logit contributions
 sail-0.041
 sails-0.041
 gl-0.037
 wood-0.037
 valleys-0.037
Token.
Token ID13
Feature activation-4.26e-02

Pos logit contributions
 eater+0.215
 virtue+0.204
 anniversary+0.193
becca+0.192
 occasion+0.190
Neg logit contributions
 sails-0.129
 gl-0.129
 widened-0.122
 sail-0.119
He-0.119
Token Do
Token ID2141
Feature activation-8.74e-02

Pos logit contributions
 eater+0.094
 virtue+0.091
 anniversary+0.087
 occasion+0.087
 ever+0.086
Neg logit contributions
 sails-0.040
 valleys-0.040
 sail-0.038
 widened-0.036
He-0.036
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
 eater+0.080
 virtue+0.076
 occasion+0.065
olding+0.063
nie+0.057
Neg logit contributions
 sails-0.205
 sail-0.201
He-0.198
 gl-0.198
 valleys-0.191
Token want
Token ID765
Feature activation-7.52e-02

Pos logit contributions
 eater+0.426
 virtue+0.419
becca+0.399
 anniversary+0.394
 occasion+0.388
Neg logit contributions
 sail-0.456
 sails-0.446
 gl-0.408
He-0.404
 widened-0.402
Token to
Token ID284
Feature activation-6.06e-02

Pos logit contributions
 eater+0.128
 virtue+0.122
olding+0.117
 anniversary+0.110
nie+0.110
Neg logit contributions
 sails-0.117
 sail-0.112
 gl-0.108
 valleys-0.104
 glitter-0.101
Token hear
Token ID3285
Feature activation+5.32e-02

Pos logit contributions
 eater+0.091
 ever+0.086
 anniversary+0.085
 virtue+0.083
 occasion+0.083
Neg logit contributions
 valleys-0.100
 sails-0.097
 widened-0.095
 gl-0.089
 sail-0.088
Token them
Token ID606
Feature activation+7.91e-03

Pos logit contributions
 sails+0.074
 sail+0.069
 gl+0.067
 valleys+0.067
 widened+0.063
Neg logit contributions
 eater-0.100
 virtue-0.098
olding-0.091
 occasion-0.089
 anniversary-0.087
Token?"
Token ID1701
Feature activation-7.72e-02

Pos logit contributions
 sail+0.012
 sails+0.012
 gl+0.011
He+0.011
 widened+0.011
Neg logit contributions
 virtue-0.013
 eater-0.013
 occasion-0.012
olding-0.012
 anniversary-0.012
Token farmer
Token ID18739
Feature activation+4.42e-03

Pos logit contributions
 sails+0.110
 gl+0.104
 valleys+0.104
 sail+0.104
 widened+0.103
Neg logit contributions
 eater-0.057
 virtue-0.054
olding-0.053
 you-0.050
 hour-0.048
Token says
Token ID1139
Feature activation+9.92e-02

Pos logit contributions
 sails+0.006
 valleys+0.005
 sail+0.005
 gl+0.005
 widened+0.005
Neg logit contributions
 virtue-0.009
 eater-0.008
 anniversary-0.008
 occasion-0.008
olding-0.008
Token.
Token ID13
Feature activation-4.22e-02

Pos logit contributions
 sails+0.110
 gl+0.105
 sail+0.102
 widened+0.097
He+0.095
Neg logit contributions
 virtue-0.204
 eater-0.200
 occasion-0.191
 anniversary-0.188
olding-0.184
Token "
Token ID366
Feature activation-8.19e-02

Pos logit contributions
 eater+0.092
 virtue+0.089
 anniversary+0.086
 occasion+0.086
 ever+0.085
Neg logit contributions
 sails-0.041
 valleys-0.040
 sail-0.038
 widened-0.037
 gl-0.035
TokenDo
Token ID5211
Feature activation-1.00e-01

Pos logit contributions
 eater+0.139
 virtue+0.134
 anniversary+0.129
 guests+0.127
 occasion+0.126
Neg logit contributions
 valleys-0.117
 sails-0.116
He-0.116
 sail-0.111
 widened-0.109
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
 eater+0.107
 virtue+0.095
 occasion+0.086
olding+0.085
nie+0.077
Neg logit contributions
 sails-0.222
 sail-0.218
He-0.216
 gl-0.214
His-0.208
Token want
Token ID765
Feature activation-7.08e-02

Pos logit contributions
 virtue+0.415
 eater+0.409
becca+0.391
 anniversary+0.376
 occasion+0.372
Neg logit contributions
 sail-0.465
 sails-0.456
 widened-0.420
 gl-0.419
He-0.411
Token to
Token ID284
Feature activation-5.59e-02

Pos logit contributions
 eater+0.122
 virtue+0.117
olding+0.114
 anniversary+0.107
nie+0.106
Neg logit contributions
 sails-0.109
 sail-0.104
 gl-0.099
 valleys-0.093
 glitter-0.093
Token meet
Token ID1826
Feature activation-2.72e-02

Pos logit contributions
 eater+0.083
 virtue+0.076
 ever+0.076
 anniversary+0.075
 occasion+0.075
Neg logit contributions
 valleys-0.093
 sails-0.092
 widened-0.089
 sail-0.085
 gl-0.085
Token the
Token ID262
Feature activation-1.69e-02

Pos logit contributions
 virtue+0.052
 eater+0.051
olding+0.048
 anniversary+0.046
 occasion+0.046
Neg logit contributions
 sails-0.036
 sail-0.034
 valleys-0.033
 gl-0.032
 widened-0.031
Token bull
Token ID6473
Feature activation-4.96e-02

Pos logit contributions
olding+0.033
 virtue+0.032
oney+0.031
 eater+0.031
you+0.031
Neg logit contributions
 sail-0.022
 sails-0.021
 wood-0.020
 gl-0.019
 valleys-0.018
Token bed
Token ID3996
Feature activation-4.40e-02

Pos logit contributions
 virtue+0.167
 eater+0.167
olding+0.166
becca+0.160
you+0.159
Neg logit contributions
 sails-0.156
 sail-0.152
 gl-0.137
 hood-0.132
 wood-0.130
Token.
Token ID13
Feature activation-6.50e-02

Pos logit contributions
 virtue+0.077
 eater+0.076
olding+0.075
becca+0.074
bek+0.072
Neg logit contributions
 gl-0.055
 sails-0.055
 sail-0.053
 widened-0.051
He-0.050
Token They
Token ID1119
Feature activation+2.88e-02

Pos logit contributions
 eater+0.081
 virtue+0.076
 occasion+0.070
 anniversary+0.070
 mistake+0.070
Neg logit contributions
 sails-0.124
 valleys-0.122
 sail-0.120
He-0.116
 widened-0.115
Token will
Token ID481
Feature activation-9.10e-02

Pos logit contributions
 sails+0.042
 sail+0.040
 gl+0.040
 widened+0.039
 valleys+0.037
Neg logit contributions
 eater-0.052
 virtue-0.051
 occasion-0.048
olding-0.048
becca-0.048
Token keep
Token ID1394
Feature activation-1.50e-02

Pos logit contributions
 eater+0.149
 virtue+0.140
olding+0.135
 occasion+0.130
 behaviour+0.129
Neg logit contributions
 sail-0.142
 sails-0.141
 gl-0.132
 widened-0.132
 valleys-0.131
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
 eater+0.029
 virtue+0.029
olding+0.027
 occasion+0.026
oker+0.026
Neg logit contributions
 sails-0.021
 sail-0.020
 valleys-0.019
 gl-0.019
 wood-0.018
Token company
Token ID1664
Feature activation-1.13e-01

Pos logit contributions
 eater+0.555
 virtue+0.528
 occasion+0.513
becca+0.504
olding+0.503
Neg logit contributions
 sail-0.338
 sails-0.323
 valleys-0.292
 gl-0.290
 widened-0.284
Token and
Token ID290
Feature activation-4.63e-02

Pos logit contributions
 eater+0.207
becca+0.202
 virtue+0.198
olding+0.194
 occasion+0.193
Neg logit contributions
 sail-0.140
 sails-0.137
He-0.133
 widened-0.127
 wide-0.125
Token protect
Token ID1805
Feature activation-8.59e-03

Pos logit contributions
 eater+0.079
 virtue+0.079
olding+0.077
aughter+0.076
ito+0.075
Neg logit contributions
 sail-0.072
 sails-0.068
 gl-0.062
 valleys-0.060
 widened-0.059
Token you
Token ID345
Feature activation-2.62e-01

Pos logit contributions
 eater+0.019
 virtue+0.018
olding+0.017
 occasion+0.017
 inconvenience+0.017
Neg logit contributions
 sails-0.008
 valleys-0.008
 sail-0.008
 wood-0.007
He-0.007
Token from
Token ID422
Feature activation-3.60e-02

Pos logit contributions
becca+0.480
 eater+0.474
 occasion+0.464
 virtue+0.462
kids+0.462
Neg logit contributions
 sail-0.360
 sails-0.345
 valleys-0.316
 widened-0.303
He-0.301
Token your
Token ID534
Feature activation-5.25e-02

Pos logit contributions
 eater+0.194
 virtue+0.189
olding+0.182
 occasion+0.176
ae+0.173
Neg logit contributions
 sails-0.169
 sail-0.158
 wood-0.150
 glitter-0.147
 arrows-0.147
Token sister
Token ID6621
Feature activation-1.41e-01

Pos logit contributions
ae+0.099
 virtue+0.098
becca+0.094
 eater+0.093
you+0.093
Neg logit contributions
 sails-0.079
 sail-0.077
 hood-0.065
 wood-0.064
 arrows-0.063
Token.
Token ID13
Feature activation-5.42e-02

Pos logit contributions
becca+0.316
 virtue+0.314
rav+0.311
bek+0.309
 occasion+0.307
Neg logit contributions
 sails-0.114
He-0.097
 widened-0.096
 sail-0.095
 wide-0.093
Token Sara
Token ID24799
Feature activation-9.00e-02

Pos logit contributions
 eater+0.083
 virtue+0.079
 occasion+0.075
 anniversary+0.074
 ever+0.074
Neg logit contributions
 sails-0.087
 valleys-0.084
 sail-0.083
 widened-0.082
He-0.081
Token,
Token ID11
Feature activation+6.01e-02

Pos logit contributions
 occasion+0.182
 eater+0.174
 anniversary+0.174
 virtue+0.171
ae+0.170
Neg logit contributions
 sails-0.114
 gl-0.105
 sail-0.101
 bent-0.093
 valleys-0.093
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
 sail+0.109
 sails+0.106
He+0.105
 gl+0.104
 glitter+0.100
Neg logit contributions
 virtue-0.084
 eater-0.079
aughter-0.072
kids-0.071
becca-0.071
Token need
Token ID761
Feature activation-6.14e-02

Pos logit contributions
 eater+0.531
becca+0.514
 occasion+0.504
ae+0.502
 anniversary+0.492
Neg logit contributions
 sails-0.369
 sail-0.368
 gl-0.324
He-0.317
 widened-0.311
Token to
Token ID284
Feature activation-6.35e-02

Pos logit contributions
 eater+0.097
 virtue+0.086
olding+0.083
 anniversary+0.083
nie+0.082
Neg logit contributions
 sails-0.102
 sail-0.096
 valleys-0.094
 gl-0.092
 widened-0.090
Token ask
Token ID1265
Feature activation+6.76e-02

Pos logit contributions
 eater+0.091
 virtue+0.085
 anniversary+0.081
 occasion+0.080
 ever+0.080
Neg logit contributions
 sails-0.109
 valleys-0.106
 sail-0.105
 widened-0.103
 gl-0.100
Token nicely
Token ID16576
Feature activation-1.65e-01

Pos logit contributions
 his+0.064
 wide+0.058
 wood+0.056
 sails+0.054
 bent+0.052
Neg logit contributions
 virtue-0.142
oker-0.139
 eater-0.138
bek-0.134
rav-0.127
Token and
Token ID290
Feature activation-5.26e-02

Pos logit contributions
becca+0.344
 eater+0.341
 inconvenience+0.340
 occasion+0.336
 virtue+0.335
Neg logit contributions
He-0.132
 sails-0.125
 wide-0.121
 wood-0.119
His-0.111
Token our
Token ID674
Feature activation-8.85e-02

Pos logit contributions
 eater+0.100
 virtue+0.098
 occasion+0.093
olding+0.092
 anniversary+0.091
Neg logit contributions
 sails-0.050
 sail-0.046
 valleys-0.044
 gl-0.042
 widened-0.040
Token friendship
Token ID14738
Feature activation-1.05e-01

Pos logit contributions
olding+0.176
 virtue+0.174
 occasion+0.166
 eater+0.164
you+0.164
Neg logit contributions
 sails-0.130
 sail-0.124
 gl-0.104
 valleys-0.104
 feathers-0.099
Token stronger
Token ID7387
Feature activation+3.44e-02

Pos logit contributions
 eater+0.184
 virtue+0.175
 occasion+0.168
 anniversary+0.167
olding+0.167
Neg logit contributions
 sails-0.151
 sail-0.147
 valleys-0.144
 widened-0.141
 gl-0.138
Token.
Token ID13
Feature activation-5.23e-02

Pos logit contributions
 sail+0.053
 sails+0.053
 valleys+0.047
 gl+0.045
 widened+0.044
Neg logit contributions
becca-0.059
olding-0.058
 eater-0.056
otte-0.056
aughter-0.056
Token Do
Token ID2141
Feature activation-9.01e-02

Pos logit contributions
 eater+0.087
 virtue+0.082
 anniversary+0.079
 ever+0.079
 occasion+0.078
Neg logit contributions
 sails-0.077
 valleys-0.076
 widened-0.073
 sail-0.072
 gl-0.070
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
 eater+0.086
 virtue+0.079
 occasion+0.072
olding+0.071
nie+0.065
Neg logit contributions
 sails-0.209
 sail-0.204
 gl-0.200
He-0.199
 valleys-0.192
Token promise
Token ID6991
Feature activation-1.44e-01

Pos logit contributions
 eater+0.439
 virtue+0.425
becca+0.414
 anniversary+0.412
 occasion+0.406
Neg logit contributions
 sails-0.433
 sail-0.432
 widened-0.396
 gl-0.394
He-0.392
Token?"
Token ID1701
Feature activation-7.19e-02

Pos logit contributions
 eater+0.194
 virtue+0.181
 anniversary+0.170
 occasion+0.169
olding+0.167
Neg logit contributions
 sails-0.262
 valleys-0.255
 sail-0.253
 widened-0.248
 gl-0.242
Token
Token ID198
Feature activation+4.27e-02

Pos logit contributions
 eater+0.159
 virtue+0.158
 anniversary+0.149
 occasion+0.148
olding+0.146
Neg logit contributions
 sails-0.066
 valleys-0.065
 sail-0.064
 widened-0.062
He-0.061
Token
Token ID198
Feature activation+4.30e-02

Pos logit contributions
 sails+0.055
 valleys+0.052
 widened+0.050
 sail+0.050
 gl+0.048
Neg logit contributions
 eater-0.079
 virtue-0.075
 ever-0.074
olding-0.073
 occasion-0.072
TokenBen
Token ID11696
Feature activation-4.40e-02

Pos logit contributions
 sails+0.077
 valleys+0.075
 sail+0.073
 widened+0.072
 gl+0.070
Neg logit contributions
 eater-0.060
 virtue-0.056
 anniversary-0.053
 occasion-0.053
 ever-0.053
Token Anna
Token ID11735
Feature activation-1.51e-01

Pos logit contributions
 sails+0.073
 sail+0.072
 gl+0.069
 valleys+0.069
 widened+0.067
Neg logit contributions
 virtue-0.083
 eater-0.080
olding-0.074
 anniversary-0.073
 occasion-0.073
Token says
Token ID1139
Feature activation+9.18e-02

Pos logit contributions
 eater+0.295
 virtue+0.293
 anniversary+0.277
 occasion+0.275
olding+0.264
Neg logit contributions
 sails-0.190
 widened-0.179
 sail-0.174
 valleys-0.173
 gl-0.172
Token.
Token ID13
Feature activation-5.02e-02

Pos logit contributions
 sails+0.122
 sail+0.119
 widened+0.117
 valleys+0.114
 gl+0.112
Neg logit contributions
 eater-0.168
 virtue-0.164
 occasion-0.155
 anniversary-0.153
becca-0.148
Token "
Token ID366
Feature activation-8.82e-02

Pos logit contributions
 eater+0.081
 virtue+0.075
 ever+0.074
olding+0.074
 tutor+0.073
Neg logit contributions
 sails-0.075
 valleys-0.072
 sail-0.071
 widened-0.071
 gl-0.067
TokenBut
Token ID1537
Feature activation-1.09e-01

Pos logit contributions
 eater+0.162
 virtue+0.156
 mistake+0.153
 behaviour+0.152
 exam+0.148
Neg logit contributions
 valleys-0.119
He-0.118
 sails-0.112
 sail-0.110
 widened-0.102
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
 eater+0.154
 virtue+0.146
kids+0.134
olding+0.134
child+0.132
Neg logit contributions
 sails-0.198
 sail-0.197
 gl-0.194
 valleys-0.188
 glitter-0.179
Token have
Token ID423
Feature activation-7.24e-02

Pos logit contributions
 eater+0.525
 occasion+0.495
 anniversary+0.479
 virtue+0.476
olding+0.465
Neg logit contributions
 sails-0.397
 sail-0.393
 gl-0.356
 widened-0.352
 spark-0.332
Token to
Token ID284
Feature activation-5.84e-02

Pos logit contributions
 eater+0.147
olding+0.135
aughter+0.133
 virtue+0.131
 occasion+0.130
Neg logit contributions
 sails-0.096
 valleys-0.085
 wood-0.080
 sail-0.079
 gl-0.079
Token be
Token ID307
Feature activation+6.04e-03

Pos logit contributions
 eater+0.096
 virtue+0.090
olding+0.084
nie+0.082
 occasion+0.082
Neg logit contributions
 sail-0.091
 sails-0.088
 widened-0.083
He-0.080
 valleys-0.080
Token fair
Token ID3148
Feature activation-4.30e-02

Pos logit contributions
 sails+0.008
 valleys+0.008
 sail+0.008
 widened+0.008
He+0.008
Neg logit contributions
 eater-0.011
 occasion-0.010
 virtue-0.010
 anniversary-0.010
 behaviour-0.009
Token.
Token ID13
Feature activation-4.14e-02

Pos logit contributions
 virtue+0.085
 eater+0.083
 occasion+0.083
 inconvenience+0.083
becca+0.082
Neg logit contributions
 sails-0.047
 sail-0.045
 valleys-0.041
He-0.040
 wide-0.040
TokenMaybe
Token ID13300
Feature activation-6.97e-02

Pos logit contributions
 eater+0.097
 virtue+0.089
 occasion+0.085
 anniversary+0.084
 ever+0.083
Neg logit contributions
 sails-0.164
 valleys-0.161
 sail-0.156
 widened-0.155
 gl-0.150
Token.
Token ID13
Feature activation-5.42e-02

Pos logit contributions
 eater+0.098
 virtue+0.095
olding+0.087
 occasion+0.086
 anniversary+0.085
Neg logit contributions
 sails-0.125
 sail-0.122
 valleys-0.120
 gl-0.116
 widened-0.115
Token Why
Token ID4162
Feature activation-1.88e-01

Pos logit contributions
 eater+0.087
 virtue+0.084
 occasion+0.081
 mistake+0.080
nie+0.079
Neg logit contributions
 sails-0.082
 valleys-0.081
 sail-0.080
He-0.075
 widened-0.074
Token don
Token ID836
Feature activation-2.19e-01

Pos logit contributions
 eater+0.310
 occasion+0.309
 virtue+0.301
olding+0.292
child+0.281
Neg logit contributions
 sail-0.293
 sails-0.289
 gl-0.287
 widened-0.274
 glitter-0.263
Token't
Token ID470
Feature activation-1.60e-01

Pos logit contributions
 eater+0.178
 ever+0.168
 virtue+0.160
 you+0.152
 anniversary+0.145
Neg logit contributions
 valleys-0.501
 sails-0.494
 widened-0.477
 sail-0.463
 gl-0.459
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
 virtue+0.126
 eater+0.121
olding+0.108
 occasion+0.102
 anniversary+0.093
Neg logit contributions
 sail-0.389
 sails-0.381
 gl-0.368
 valleys-0.363
 widened-0.363
Token try
Token ID1949
Feature activation+6.24e-02

Pos logit contributions
 eater+0.474
 virtue+0.460
 anniversary+0.427
 occasion+0.426
olding+0.419
Neg logit contributions
 sail-0.407
 sails-0.401
 widened-0.378
 gl-0.360
 widen-0.349
Token?
Token ID30
Feature activation-9.26e-02

Pos logit contributions
 sails+0.113
 valleys+0.112
 widened+0.109
 sail+0.107
 gl+0.104
Neg logit contributions
 eater-0.083
 virtue-0.077
 ever-0.074
 occasion-0.073
 anniversary-0.073
Token Make
Token ID6889
Feature activation-2.09e-02

Pos logit contributions
 eater+0.157
 virtue+0.152
 occasion+0.147
 ever+0.145
 anniversary+0.143
Neg logit contributions
 sails-0.134
 sail-0.128
 valleys-0.127
 widened-0.122
He-0.121
Token a
Token ID257
Feature activation+2.21e-03

Pos logit contributions
 eater+0.038
 virtue+0.036
 anniversary+0.035
 occasion+0.035
 ever+0.035
Neg logit contributions
 sails-0.028
 valleys-0.027
 sail-0.026
 widened-0.026
 gl-0.025
Token wish
Token ID4601
Feature activation+7.05e-02

Pos logit contributions
 sails+0.003
 sail+0.003
 gl+0.003
 valleys+0.003
 widened+0.003
Neg logit contributions
 eater-0.004
 virtue-0.003
 ever-0.003
olding-0.003
 occasion-0.003
Token,
Token ID11
Feature activation+5.29e-02

Pos logit contributions
 eater+0.504
becca+0.475
 virtue+0.462
 anniversary+0.454
kids+0.451
Neg logit contributions
 sail-0.442
 sails-0.436
 gl-0.415
 widened-0.379
 sailed-0.373
Token Ben
Token ID3932
Feature activation-8.39e-02

Pos logit contributions
 sail+0.086
 sails+0.079
 gl+0.075
He+0.071
 spark+0.070
Neg logit contributions
 virtue-0.085
olding-0.080
 eater-0.079
kids-0.079
child-0.078
Token.
Token ID13
Feature activation-6.66e-02

Pos logit contributions
olding+0.153
 occasion+0.153
 anniversary+0.153
becca+0.150
 virtue+0.148
Neg logit contributions
 sails-0.107
 sail-0.104
 gl-0.095
He-0.095
 sailed-0.093
Token And
Token ID843
Feature activation-1.61e-01

Pos logit contributions
 eater+0.077
 virtue+0.077
 mistake+0.073
 occasion+0.072
 anniversary+0.070
Neg logit contributions
 sails-0.130
 sail-0.126
 valleys-0.122
He-0.121
 gl-0.118
Token thank
Token ID5875
Feature activation-1.54e-02

Pos logit contributions
 eater+0.200
kids+0.188
 virtue+0.184
child+0.183
olding+0.181
Neg logit contributions
 sail-0.314
 sails-0.305
 gl-0.295
 valleys-0.279
He-0.279
Token you
Token ID345
Feature activation-2.73e-01

Pos logit contributions
 virtue+0.018
 eater+0.016
olding+0.015
nie+0.015
 occasion+0.014
Neg logit contributions
 sail-0.032
 sails-0.032
 gl-0.030
 valleys-0.029
itched-0.029
Token,
Token ID11
Feature activation+6.19e-02

Pos logit contributions
 eater+0.612
becca+0.608
 occasion+0.603
 anniversary+0.600
 virtue+0.590
Neg logit contributions
 sail-0.249
 sails-0.235
He-0.211
 wide-0.195
 gl-0.195
Token Lily
Token ID20037
Feature activation-1.62e-01

Pos logit contributions
 sail+0.097
 sails+0.084
He+0.084
 gl+0.078
 wide+0.074
Neg logit contributions
 virtue-0.110
 occasion-0.100
olding-0.098
bek-0.096
days-0.095
Token.
Token ID13
Feature activation-6.02e-02

Pos logit contributions
 occasion+0.307
 anniversary+0.304
 virtue+0.302
olding+0.300
becca+0.299
Neg logit contributions
 sails-0.199
 sail-0.192
He-0.186
 gl-0.182
 valleys-0.177
Token You
Token ID921
Feature activation-2.44e-01

Pos logit contributions
 eater+0.073
 virtue+0.068
 occasion+0.064
 anniversary+0.063
 ever+0.063
Neg logit contributions
 sails-0.117
 valleys-0.115
 sail-0.113
 widened-0.111
 gl-0.108
Token did
Token ID750
Feature activation-1.10e-01

Pos logit contributions
 eater+0.437
 virtue+0.412
 anniversary+0.400
 occasion+0.399
olding+0.396
Neg logit contributions
 sails-0.354
 sail-0.337
 widened-0.327
 gl-0.324
 valleys-0.323
Token safe
Token ID3338
Feature activation-5.04e-03

Pos logit contributions
 eater+0.064
 virtue+0.060
 anniversary+0.058
 occasion+0.058
olding+0.057
Neg logit contributions
 sails-0.047
 valleys-0.046
 sail-0.045
 widened-0.044
 gl-0.042
Token and
Token ID290
Feature activation-1.78e-02

Pos logit contributions
 eater+0.010
 virtue+0.009
olding+0.009
becca+0.009
 anniversary+0.009
Neg logit contributions
 sails-0.007
 valleys-0.007
 sail-0.006
 gl-0.006
 widened-0.006
Token everyone
Token ID2506
Feature activation-1.81e-01

Pos logit contributions
 eater+0.030
 virtue+0.028
 anniversary+0.027
 occasion+0.027
olding+0.026
Neg logit contributions
 sails-0.027
 valleys-0.026
 sail-0.025
 widened-0.025
 gl-0.024
Token thanked
Token ID26280
Feature activation+4.72e-02

Pos logit contributions
 eater+0.343
 virtue+0.328
 anniversary+0.320
 occasion+0.318
olding+0.314
Neg logit contributions
 sails-0.238
 gl-0.231
 valleys-0.230
 sail-0.230
 widened-0.224
Token the
Token ID262
Feature activation+1.66e-02

Pos logit contributions
 sails+0.044
 sail+0.043
 valleys+0.043
He+0.037
 widened+0.036
Neg logit contributions
 eater-0.110
 virtue-0.109
olding-0.108
 occasion-0.101
 anniversary-0.100
Token metal
Token ID6147
Feature activation+2.65e-01

Pos logit contributions
 sails+0.026
 valleys+0.025
 sail+0.025
 gl+0.022
 wood+0.022
Neg logit contributions
 eater-0.029
 virtue-0.026
 anniversary-0.026
 occasion-0.026
otte-0.025
Token man
Token ID582
Feature activation+2.11e-02

Pos logit contributions
 sails+0.393
 sail+0.372
 gl+0.353
 valleys+0.351
He+0.330
Neg logit contributions
 eater-0.474
 virtue-0.448
 anniversary-0.444
becca-0.437
 occasion-0.433
Token for
Token ID329
Feature activation-1.79e-01

Pos logit contributions
 sail+0.026
 sails+0.025
 valleys+0.024
He+0.023
 gl+0.022
Neg logit contributions
 virtue-0.040
 inconvenience-0.039
becca-0.039
 occasion-0.038
 virus-0.038
Token helping
Token ID5742
Feature activation-7.25e-02

Pos logit contributions
 eater+0.313
 virtue+0.292
olding+0.291
 occasion+0.277
 anniversary+0.275
Neg logit contributions
 sail-0.258
 sails-0.257
 valleys-0.252
 gl-0.245
He-0.229
Token them
Token ID606
Feature activation+3.98e-02

Pos logit contributions
 virtue+0.175
 eater+0.174
olding+0.168
becca+0.162
 inconvenience+0.161
Neg logit contributions
 sail-0.060
 sails-0.056
 valleys-0.051
 gl-0.046
He-0.045
Token.
Token ID13
Feature activation-3.14e-03

Pos logit contributions
 sails+0.063
 sail+0.061
 valleys+0.061
 widened+0.058
 gl+0.057
Neg logit contributions
 eater-0.064
 virtue-0.060
 anniversary-0.057
 occasion-0.057
olding-0.056
Token bucket
Token ID19236
Feature activation+1.62e-01

Pos logit contributions
 eater+0.011
 virtue+0.011
 anniversary+0.011
becca+0.010
 occasion+0.010
Neg logit contributions
 sails-0.009
 sail-0.009
 gl-0.008
 valleys-0.008
 hollow-0.008
Token and
Token ID290
Feature activation-4.09e-02

Pos logit contributions
 sails+0.252
 valleys+0.248
 sail+0.238
 widened+0.237
 gl+0.227
Neg logit contributions
 eater-0.260
 virtue-0.244
 ever-0.235
 anniversary-0.232
 occasion-0.231
Token the
Token ID262
Feature activation-4.74e-03

Pos logit contributions
 eater+0.072
 virtue+0.067
 ever+0.067
 anniversary+0.065
 occasion+0.064
Neg logit contributions
 sails-0.056
 valleys-0.055
 sail-0.052
 widened-0.052
 gl-0.049
Token mole
Token ID9411
Feature activation-2.41e-02

Pos logit contributions
 eater+0.009
 virtue+0.009
becca+0.009
nie+0.009
kids+0.009
Neg logit contributions
 sail-0.006
 sails-0.006
 valleys-0.005
 gl-0.005
 wood-0.005
Token.
Token ID13
Feature activation-3.99e-02

Pos logit contributions
 eater+0.039
 virtue+0.037
 anniversary+0.036
 occasion+0.036
 ever+0.035
Neg logit contributions
 sails-0.037
 valleys-0.036
 sail-0.035
 widened-0.034
 gl-0.033
Token He
Token ID679
Feature activation+1.82e-01

Pos logit contributions
 eater+0.080
 virtue+0.076
 ever+0.074
olding+0.073
 mistake+0.072
Neg logit contributions
 sails-0.045
 valleys-0.044
 sail-0.041
 widened-0.040
 gl-0.039
Token ran
Token ID4966
Feature activation+6.38e-02

Pos logit contributions
 sails+0.252
 valleys+0.247
 sail+0.238
 widened+0.235
 gl+0.225
Neg logit contributions
 eater-0.322
 virtue-0.304
 anniversary-0.292
 ever-0.292
 occasion-0.291
Token to
Token ID284
Feature activation-4.88e-02

Pos logit contributions
 sails+0.099
 valleys+0.098
 sail+0.095
 widened+0.093
 gl+0.090
Neg logit contributions
 eater-0.103
 virtue-0.097
 anniversary-0.093
 occasion-0.093
olding-0.090
Token his
Token ID465
Feature activation+9.11e-02

Pos logit contributions
 eater+0.102
 virtue+0.101
 occasion+0.094
 anniversary+0.094
nie+0.092
Neg logit contributions
 sail-0.057
 sails-0.054
 valleys-0.051
He-0.050
 widened-0.049
Token mom
Token ID1995
Feature activation-2.00e-01

Pos logit contributions
 sails+0.173
 valleys+0.168
 sail+0.167
 widened+0.162
 gl+0.159
Neg logit contributions
 eater-0.118
 virtue-0.110
 occasion-0.104
 anniversary-0.103
 ever-0.102
Token and
Token ID290
Feature activation-4.10e-02

Pos logit contributions
 eater+0.313
 virtue+0.288
 anniversary+0.281
 ever+0.280
 occasion+0.278
Neg logit contributions
 sails-0.320
 valleys-0.315
 sail-0.304
 widened-0.302
 gl-0.291
Token know
Token ID760
Feature activation+1.11e-01

Pos logit contributions
 eater+0.327
 virtue+0.322
becca+0.302
 anniversary+0.296
olding+0.293
Neg logit contributions
 sail-0.264
 sails-0.261
 valleys-0.253
 widened-0.252
 gl-0.247
Token what
Token ID644
Feature activation-7.61e-02

Pos logit contributions
 sails+0.188
 valleys+0.185
 widened+0.180
 sail+0.180
 gl+0.171
Neg logit contributions
 eater-0.160
 virtue-0.150
 ever-0.146
 anniversary-0.143
olding-0.141
Token to
Token ID284
Feature activation-2.98e-02

Pos logit contributions
 eater+0.146
 virtue+0.138
olding+0.134
 anniversary+0.134
 occasion+0.130
Neg logit contributions
 sails-0.098
 gl-0.095
 valleys-0.094
 sail-0.094
He-0.088
Token do
Token ID466
Feature activation-1.97e-02

Pos logit contributions
 eater+0.036
 occasion+0.032
 anniversary+0.032
 virtue+0.032
 ever+0.032
Neg logit contributions
 valleys-0.061
 sails-0.059
 widened-0.056
 sail-0.055
 gl-0.054
Token.
Token ID13
Feature activation-9.80e-03

Pos logit contributions
 eater+0.042
 virtue+0.040
kids+0.039
becca+0.039
ookie+0.038
Neg logit contributions
He-0.020
 gl-0.019
 widened-0.019
 sail-0.018
 sails-0.018
Token He
Token ID679
Feature activation+2.03e-01

Pos logit contributions
 eater+0.026
 virtue+0.024
 anniversary+0.024
 occasion+0.024
 ever+0.024
Neg logit contributions
 valleys-0.006
 sails-0.006
He-0.005
 sail-0.005
 widened-0.005
Token thought
Token ID1807
Feature activation-2.18e-02

Pos logit contributions
 sails+0.293
 valleys+0.291
 sail+0.278
 widened+0.273
 gl+0.261
Neg logit contributions
 eater-0.347
 virtue-0.329
 ever-0.317
 occasion-0.313
 anniversary-0.313
Token maybe
Token ID3863
Feature activation-6.84e-02

Pos logit contributions
 eater+0.039
 virtue+0.037
 anniversary+0.036
 occasion+0.036
olding+0.034
Neg logit contributions
 sails-0.031
 sail-0.030
 valleys-0.029
 gl-0.028
 widened-0.028
Token he
Token ID339
Feature activation+1.12e-01

Pos logit contributions
 eater+0.119
 ever+0.111
 virtue+0.111
 anniversary+0.109
 occasion+0.109
Neg logit contributions
 sails-0.095
 valleys-0.094
 widened-0.092
 sail-0.087
itched-0.084
Token should
Token ID815
Feature activation-3.76e-02

Pos logit contributions
 sails+0.179
 valleys+0.172
 widened+0.170
 sail+0.169
 gl+0.167
Neg logit contributions
 eater-0.179
 virtue-0.167
 anniversary-0.165
 occasion-0.163
olding-0.159
Token keep
Token ID1394
Feature activation+1.72e-02

Pos logit contributions
 eater+0.059
 virtue+0.055
 anniversary+0.053
 occasion+0.052
olding+0.052
Neg logit contributions
 sails-0.060
 sail-0.058
 valleys-0.057
 widened-0.056
 gl-0.055
Token her
Token ID607
Feature activation+1.80e-02

Pos logit contributions
 valleys+0.115
 gl+0.110
 sail+0.107
 widened+0.103
 sails+0.103
Neg logit contributions
 virtue-0.234
 eater-0.229
olding-0.223
 anniversary-0.218
 occasion-0.211
Token happiness
Token ID12157
Feature activation-3.57e-02

Pos logit contributions
 gl+0.025
 sails+0.025
 sail+0.025
 spark+0.023
 wide+0.023
Neg logit contributions
 eater-0.034
 virtue-0.033
olding-0.032
 inconvenience-0.032
 occasion-0.031
Token spread
Token ID4104
Feature activation+8.53e-02

Pos logit contributions
 eater+0.071
 virtue+0.068
 anniversary+0.066
 occasion+0.065
olding+0.065
Neg logit contributions
 sails-0.042
 valleys-0.041
 sail-0.039
 widened-0.039
 gl-0.037
Token like
Token ID588
Feature activation-2.11e-02

Pos logit contributions
 sails+0.112
 valleys+0.109
 sail+0.106
 widened+0.105
 gl+0.104
Neg logit contributions
 eater-0.160
 virtue-0.151
 anniversary-0.147
 occasion-0.145
olding-0.144
Token a
Token ID257
Feature activation+3.24e-02

Pos logit contributions
 eater+0.047
 virtue+0.045
olding+0.044
nie+0.042
 anniversary+0.042
Neg logit contributions
 sails-0.021
 gl-0.021
 valleys-0.021
 sail-0.020
 widened-0.019
Token wave
Token ID6769
Feature activation+2.27e-01

Pos logit contributions
 sail+0.042
 gl+0.039
 sails+0.038
 spark+0.037
 widened+0.036
Neg logit contributions
 eater-0.063
olding-0.061
kids-0.059
 anniversary-0.059
aughter-0.059
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
 sail+0.304
 gl+0.301
 sails+0.297
 widened+0.280
 valleys+0.271
Neg logit contributions
 virtue-0.420
 eater-0.415
becca-0.392
 anniversary-0.388
olding-0.379
Token
Token ID198
Feature activation+6.27e-02

Pos logit contributions
 eater+0.033
 virtue+0.031
 occasion+0.031
 anniversary+0.030
olding+0.030
Neg logit contributions
 sails-0.021
 valleys-0.019
 sail-0.019
 widened-0.019
 gl-0.018
Token
Token ID198
Feature activation+6.38e-02

Pos logit contributions
 sails+0.072
 valleys+0.068
 sail+0.066
 widened+0.065
 gl+0.061
Neg logit contributions
 eater-0.126
 virtue-0.121
 ever-0.118
 occasion-0.118
 anniversary-0.117
TokenNext
Token ID10019
Feature activation+1.17e-02

Pos logit contributions
 valleys+0.060
 sails+0.059
 widened+0.056
 sail+0.055
He+0.055
Neg logit contributions
 eater-0.143
 virtue-0.135
 anniversary-0.131
 ever-0.130
 occasion-0.129
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
 valleys+0.010
 sails+0.010
 gl+0.010
 widened+0.009
 sail+0.009
Neg logit contributions
 eater-0.026
 virtue-0.026
olding-0.025
 anniversary-0.024
 occasion-0.024
Token he
Token ID339
Feature activation+1.09e-01

Pos logit contributions
 eater+0.253
 virtue+0.244
 anniversary+0.235
olding+0.234
 occasion+0.234
Neg logit contributions
 sails-0.110
 valleys-0.107
 sail-0.103
 widened-0.097
 gl-0.094
Token heard
Token ID2982
Feature activation+5.52e-02

Pos logit contributions
 sails+0.141
 valleys+0.137
 sail+0.131
 widened+0.130
 gl+0.122
Neg logit contributions
 eater-0.205
 virtue-0.192
 ever-0.189
 occasion-0.186
 anniversary-0.185
Token a
Token ID257
Feature activation+3.10e-02

Pos logit contributions
 valleys+0.069
 sails+0.068
 widened+0.066
 sail+0.064
 gl+0.061
Neg logit contributions
 eater-0.101
 virtue-0.094
 ever-0.094
 anniversary-0.093
 occasion-0.093
Token sound
Token ID2128
Feature activation-6.21e-03

Pos logit contributions
 sails+0.044
 valleys+0.044
 widened+0.041
 sail+0.040
 gl+0.038
Neg logit contributions
 eater-0.053
 virtue-0.050
 occasion-0.048
 anniversary-0.048
 ever-0.048
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
 eater+0.008
 ever+0.007
 before+0.007
 occasion+0.007
olding+0.007
Neg logit contributions
 valleys-0.012
 sails-0.012
 sail-0.011
 widened-0.011
 hollow-0.010
Token He
Token ID679
Feature activation+1.97e-01

Pos logit contributions
 eater+0.032
 virtue+0.030
olding+0.030
 ever+0.030
 occasion+0.029
Neg logit contributions
 sails-0.015
 valleys-0.015
 widened-0.014
 sail-0.014
 gl-0.013
Token looked
Token ID3114
Feature activation+1.79e-01

Pos logit contributions
 valleys+0.311
 sails+0.296
itched+0.291
 widen+0.276
 afloat+0.276
Neg logit contributions
 ever-0.294
nie-0.274
 kid-0.258
 eater-0.250
 before-0.247
Token up
Token ID510
Feature activation+1.08e-01

Pos logit contributions
 sails+0.261
 valleys+0.257
 sail+0.244
 widened+0.241
itched+0.239
Neg logit contributions
 eater-0.288
 ever-0.268
 virtue-0.266
olding-0.262
nie-0.261
Token and
Token ID290
Feature activation-2.54e-02

Pos logit contributions
 sails+0.216
 valleys+0.209
 widened+0.203
 sail+0.199
itched+0.196
Neg logit contributions
 ever-0.116
 before-0.116
 eater-0.100
 after-0.094
 you-0.094
Token saw
Token ID2497
Feature activation+1.09e-01

Pos logit contributions
 you+0.033
 before+0.033
 ever+0.031
nie+0.030
 eater+0.030
Neg logit contributions
 sails-0.042
itched-0.041
 sail-0.039
 prope-0.037
 valleys-0.037
Token a
Token ID257
Feature activation+3.02e-02

Pos logit contributions
 valleys+0.150
 widened+0.150
 sails+0.148
 sail+0.142
itched+0.138
Neg logit contributions
 eater-0.183
 ever-0.173
 virtue-0.166
 guests-0.164
 students-0.163
Token people
Token ID661
Feature activation-1.29e-01

Pos logit contributions
 eater+0.077
 virtue+0.071
 anniversary+0.068
 occasion+0.067
olding+0.067
Neg logit contributions
 sails-0.067
 valleys-0.065
 sail-0.062
 widened-0.061
 gl-0.059
Token and
Token ID290
Feature activation-4.40e-02

Pos logit contributions
 eater+0.233
 virtue+0.223
 occasion+0.208
becca+0.207
olding+0.202
Neg logit contributions
 sails-0.188
 sail-0.180
 valleys-0.178
 gl-0.166
 widened-0.160
Token things
Token ID1243
Feature activation-1.02e-01

Pos logit contributions
 eater+0.079
 virtue+0.075
olding+0.071
 occasion+0.070
 anniversary+0.069
Neg logit contributions
 sails-0.065
 valleys-0.063
 sail-0.062
 widened-0.058
 gl-0.057
Token,"
Token ID553
Feature activation+6.24e-02

Pos logit contributions
 eater+0.150
 virtue+0.141
 anniversary+0.135
 ever+0.134
 occasion+0.134
Neg logit contributions
 sails-0.170
 valleys-0.167
 sail-0.162
 widened-0.161
 gl-0.155
Token mom
Token ID1995
Feature activation-1.96e-01

Pos logit contributions
 sails+0.107
 sail+0.106
 gl+0.102
He+0.102
 widened+0.100
Neg logit contributions
 virtue-0.090
 eater-0.088
 occasion-0.081
olding-0.081
 anniversary-0.079
Token said
Token ID531
Feature activation+7.14e-02

Pos logit contributions
 eater+0.331
 virtue+0.307
olding+0.302
 ever+0.302
 occasion+0.300
Neg logit contributions
 sails-0.294
 valleys-0.293
 sail-0.281
 widened-0.267
 gl-0.256
Token.
Token ID13
Feature activation-3.18e-02

Pos logit contributions
 sails+0.088
 widened+0.084
 sail+0.084
 valleys+0.083
 gl+0.080
Neg logit contributions
 eater-0.136
 virtue-0.134
 occasion-0.128
 anniversary-0.125
becca-0.121
Token
Token ID198
Feature activation+5.64e-02

Pos logit contributions
 eater+0.049
olding+0.047
 tutor+0.046
 virtue+0.045
 before+0.044
Neg logit contributions
 sails-0.051
 sail-0.048
 valleys-0.048
 widened-0.046
 gl-0.045
Token
Token ID198
Feature activation+5.61e-02

Pos logit contributions
 sails+0.090
 valleys+0.085
 widened+0.083
 gl+0.083
 sail+0.082
Neg logit contributions
 eater-0.085
 mistake-0.085
 virtue-0.083
olding-0.083
 you-0.079
Token"
Token ID1
Feature activation-6.52e-02

Pos logit contributions
 sails+0.105
 valleys+0.099
 sail+0.099
 widened+0.099
 gl+0.098
Neg logit contributions
 eater-0.075
 virtue-0.071
olding-0.068
 anniversary-0.066
 mistake-0.066
TokenWow
Token ID22017
Feature activation-3.11e-02

Pos logit contributions
 eater+0.088
 virtue+0.083
 occasion+0.081
 anniversary+0.080
 ever+0.079
Neg logit contributions
 sails-0.117
 valleys-0.114
 sail-0.111
 widened-0.111
itched-0.109
Token and
Token ID290
Feature activation-2.84e-02

Pos logit contributions
 virtue+0.044
 eater+0.043
 occasion+0.043
 anniversary+0.042
 inconvenience+0.041
Neg logit contributions
 sails-0.034
 sail-0.032
He-0.032
 gl-0.032
 valleys-0.032
Token he
Token ID339
Feature activation+1.07e-01

Pos logit contributions
 eater+0.059
 virtue+0.056
 ever+0.054
 anniversary+0.054
 occasion+0.054
Neg logit contributions
 sails-0.031
 valleys-0.030
 widened-0.028
 sail-0.028
 gl-0.026
Token lived
Token ID5615
Feature activation-1.81e-02

Pos logit contributions
 sails+0.148
 valleys+0.142
 sail+0.141
 gl+0.141
 widened+0.139
Neg logit contributions
 eater-0.190
 virtue-0.184
 anniversary-0.180
 occasion-0.178
olding-0.177
Token in
Token ID287
Feature activation-4.70e-02

Pos logit contributions
 eater+0.031
 virtue+0.031
 anniversary+0.030
 occasion+0.030
 behaviour+0.028
Neg logit contributions
 sails-0.025
 valleys-0.025
 sail-0.024
 gl-0.024
He-0.023
Token the
Token ID262
Feature activation+1.50e-04

Pos logit contributions
 eater+0.093
 virtue+0.088
 ever+0.086
 anniversary+0.085
 occasion+0.085
Neg logit contributions
 sails-0.056
 valleys-0.054
 widened-0.052
 sail-0.052
 gl-0.049
Token garden
Token ID11376
Feature activation+6.32e-02

Pos logit contributions
 sails+0.000
 widened+0.000
 valleys+0.000
 sail+0.000
itched+0.000
Neg logit contributions
 eater-0.000
 virtue-0.000
 occasion-0.000
 anniversary-0.000
 ever-0.000
Token.
Token ID13
Feature activation-2.62e-02

Pos logit contributions
 sails+0.098
 valleys+0.096
 sail+0.094
 widened+0.091
 gl+0.090
Neg logit contributions
 eater-0.101
 virtue-0.097
 anniversary-0.092
 occasion-0.092
olding-0.090
Token Every
Token ID3887
Feature activation+5.01e-02

Pos logit contributions
 eater+0.046
 ever+0.042
 occasion+0.041
 virtue+0.041
 hour+0.040
Neg logit contributions
 sails-0.036
 widened-0.034
 valleys-0.034
 sail-0.033
 gl-0.033
Token day
Token ID1110
Feature activation-1.45e-01

Pos logit contributions
 sail+0.104
 gl+0.102
 sails+0.101
 valleys+0.098
He+0.096
Neg logit contributions
olding-0.050
 eater-0.050
 virtue-0.049
 anniversary-0.049
 principal-0.047
Token he
Token ID339
Feature activation+1.07e-01

Pos logit contributions
 eater+0.259
 ever+0.251
 virtue+0.236
 occasion+0.231
 anniversary+0.229
Neg logit contributions
 sails-0.198
 valleys-0.190
 widened-0.185
 sail-0.183
 gl-0.174
Token would
Token ID561
Feature activation-5.81e-02

Pos logit contributions
 sails+0.175
 valleys+0.168
 sail+0.167
 gl+0.164
 widened+0.164
Neg logit contributions
 virtue-0.166
 eater-0.166
olding-0.158
 anniversary-0.157
 occasion-0.155
Tokenara
Token ID3301
Feature activation-8.73e-02

Pos logit contributions
 eater+0.144
 virtue+0.143
 ever+0.136
 occasion+0.135
nie+0.131
Neg logit contributions
 sails-0.154
itched-0.152
 widened-0.149
 valleys-0.146
 gl-0.141
Token and
Token ID290
Feature activation-3.81e-02

Pos logit contributions
 eater+0.146
 virtue+0.139
 anniversary+0.133
 occasion+0.132
 ever+0.131
Neg logit contributions
 sails-0.130
 valleys-0.127
 sail-0.123
 widened-0.122
 gl-0.118
Token Ben
Token ID3932
Feature activation-4.93e-02

Pos logit contributions
 virtue+0.060
 eater+0.055
 occasion+0.055
olding+0.053
 anniversary+0.052
Neg logit contributions
 sails-0.064
 sail-0.063
 valleys-0.061
 widened-0.059
 gl-0.058
Token smile
Token ID8212
Feature activation+1.35e-01

Pos logit contributions
 eater+0.072
 virtue+0.065
 ever+0.064
 anniversary+0.064
olding+0.063
Neg logit contributions
 valleys-0.083
 sails-0.081
 sail-0.076
 widened-0.075
itched-0.075
Token.
Token ID13
Feature activation-2.68e-02

Pos logit contributions
 valleys+0.261
 sails+0.255
 sail+0.239
 hollow+0.234
 widened+0.231
Neg logit contributions
 eater-0.161
 ever-0.150
 virtue-0.144
olding-0.140
 mistake-0.139
Token They
Token ID1119
Feature activation+6.12e-02

Pos logit contributions
 eater+0.054
 virtue+0.051
 anniversary+0.050
 occasion+0.049
 ever+0.049
Neg logit contributions
 sails-0.031
 valleys-0.030
 sail-0.029
 widened-0.029
 gl-0.027
Token say
Token ID910
Feature activation+9.82e-02

Pos logit contributions
 sails+0.105
 valleys+0.105
 sail+0.099
 widened+0.099
 gl+0.096
Neg logit contributions
 eater-0.088
 virtue-0.082
 anniversary-0.077
 ever-0.077
 occasion-0.077
Token,
Token ID11
Feature activation+8.80e-02

Pos logit contributions
 valleys+0.125
 sails+0.125
 widened+0.117
 sail+0.115
 gl+0.110
Neg logit contributions
 eater-0.186
 virtue-0.175
 ever-0.171
 anniversary-0.170
olding-0.167
Token "
Token ID366
Feature activation-7.48e-02

Pos logit contributions
 sails+0.120
 valleys+0.117
 sail+0.113
 widened+0.112
 gl+0.107
Neg logit contributions
 eater-0.158
 virtue-0.150
 anniversary-0.144
 occasion-0.144
 ever-0.143
TokenThat
Token ID2504
Feature activation-8.92e-02

Pos logit contributions
 eater+0.094
 virtue+0.087
 anniversary+0.082
 occasion+0.082
 ever+0.081
Neg logit contributions
 sails-0.144
 valleys-0.142
 sail-0.138
 widened-0.136
 gl-0.133
Token sounds
Token ID5238
Feature activation+7.91e-02

Pos logit contributions
 eater+0.160
 virtue+0.151
 anniversary+0.146
 occasion+0.145
olding+0.145
Neg logit contributions
 sails-0.123
 valleys-0.119
 sail-0.116
 widened-0.114
 gl-0.111
Token laughed
Token ID13818
Feature activation+2.74e-02

Pos logit contributions
 eater+0.045
 ever+0.042
 virtue+0.040
olding+0.040
 guests+0.040
Neg logit contributions
 valleys-0.047
 sails-0.046
 sail-0.044
 widened-0.042
 hollow-0.040
Token and
Token ID290
Feature activation-3.64e-02

Pos logit contributions
 valleys+0.050
 sails+0.050
 sail+0.046
 widened+0.045
 hollow+0.045
Neg logit contributions
 eater-0.037
 ever-0.033
 before-0.032
 virtue-0.031
olding-0.031
Token hugged
Token ID41130
Feature activation+8.69e-03

Pos logit contributions
 eater+0.053
 ever+0.049
 virtue+0.048
 guests+0.047
olding+0.047
Neg logit contributions
 valleys-0.060
 sails-0.060
 sail-0.057
 widened-0.055
itched-0.054
Token them
Token ID606
Feature activation+2.70e-02

Pos logit contributions
 sails+0.008
 valleys+0.008
 widened+0.007
 sail+0.007
He+0.007
Neg logit contributions
 virtue-0.020
 eater-0.019
olding-0.018
 occasion-0.018
 anniversary-0.018
Token.
Token ID13
Feature activation-2.29e-02

Pos logit contributions
 sail+0.030
 sails+0.028
 wide+0.028
He+0.027
 widened+0.027
Neg logit contributions
 virtue-0.057
 inconvenience-0.056
 occasion-0.056
 eater-0.055
 anniversary-0.054
Token
Token ID198
Feature activation+6.82e-02

Pos logit contributions
 eater+0.037
olding+0.034
 tutor+0.034
 Nurse+0.034
 virtue+0.034
Neg logit contributions
 sails-0.035
 valleys-0.032
 sail-0.032
 soared-0.031
 gl-0.031
Token
Token ID198
Feature activation+6.80e-02

Pos logit contributions
 sails+0.110
 valleys+0.103
 gl+0.098
 sail+0.098
 widened+0.097
Neg logit contributions
 eater-0.103
olding-0.099
 virtue-0.098
 ever-0.097
 you-0.097
Token"
Token ID1
Feature activation-5.45e-02

Pos logit contributions
 sails+0.120
 valleys+0.113
 sail+0.110
 gl+0.109
 widened+0.108
Neg logit contributions
 eater-0.098
 virtue-0.093
olding-0.092
 anniversary-0.089
 occasion-0.089
TokenOf
Token ID5189
Feature activation-2.06e-01

Pos logit contributions
 eater+0.092
 virtue+0.088
 anniversary+0.083
 ever+0.083
 occasion+0.082
Neg logit contributions
 sails-0.081
 valleys-0.079
 widened-0.077
 sail-0.077
 gl-0.074
Token course
Token ID1781
Feature activation-1.16e-01

Pos logit contributions
 eater+0.354
 virtue+0.333
 anniversary+0.315
olding+0.313
 occasion+0.307
Neg logit contributions
 sails-0.297
 valleys-0.289
 widened-0.282
 gl-0.278
 sail-0.278
Token,
Token ID11
Feature activation+9.01e-02

Pos logit contributions
 eater+0.330
 virtue+0.326
 anniversary+0.310
 occasion+0.310
 principal+0.304
Neg logit contributions
 sail-0.040
 gl-0.036
 bent-0.031
 sails-0.029
 wide-0.029
Token a
Token ID257
Feature activation+2.29e-02

Pos logit contributions
 eater+0.059
 virtue+0.057
 anniversary+0.055
 occasion+0.055
olding+0.055
Neg logit contributions
 sails-0.028
 valleys-0.027
 sail-0.026
 widened-0.025
 gl-0.024
Token time
Token ID640
Feature activation-1.24e-01

Pos logit contributions
 sails+0.041
 widened+0.040
 valleys+0.040
 sail+0.040
He+0.039
Neg logit contributions
 eater-0.029
 virtue-0.027
 anniversary-0.026
 occasion-0.026
olding-0.025
Token,
Token ID11
Feature activation+8.82e-02

Pos logit contributions
 eater+0.229
 ever+0.218
 virtue+0.216
olding+0.210
 anniversary+0.210
Neg logit contributions
 sails-0.165
 valleys-0.159
 sail-0.151
 widened-0.151
 gl-0.141
Token there
Token ID612
Feature activation+3.54e-02

Pos logit contributions
 sails+0.149
 valleys+0.140
 widened+0.136
 sail+0.133
 gl+0.125
Neg logit contributions
 eater-0.134
 ever-0.130
nie-0.121
 virtue-0.120
olding-0.119
Token was
Token ID373
Feature activation-9.32e-04

Pos logit contributions
 sails+0.056
 valleys+0.055
 sail+0.053
 widened+0.052
 gl+0.050
Neg logit contributions
 eater-0.056
 virtue-0.053
 ever-0.051
 occasion-0.050
 anniversary-0.050
Token a
Token ID257
Feature activation+2.30e-02

Pos logit contributions
 eater+0.002
 ever+0.002
 anniversary+0.002
 occasion+0.001
 virtue+0.001
Neg logit contributions
 sails-0.001
 valleys-0.001
 widened-0.001
 sail-0.001
 gl-0.001
Token sweet
Token ID6029
Feature activation-4.50e-02

Pos logit contributions
 sail+0.036
 sails+0.035
 valleys+0.034
 gl+0.033
 widened+0.033
Neg logit contributions
 virtue-0.036
 eater-0.036
 occasion-0.034
olding-0.034
 anniversary-0.034
Token dog
Token ID3290
Feature activation-1.29e-01

Pos logit contributions
 virtue+0.082
olding+0.073
 inconvenience+0.070
 ever+0.069
 homework+0.068
Neg logit contributions
 sail-0.067
 sails-0.062
 gl-0.062
 spark-0.058
 wood-0.055
Token named
Token ID3706
Feature activation-9.95e-02

Pos logit contributions
 virtue+0.266
 eater+0.253
 behaviour+0.249
 occasion+0.242
 anniversary+0.240
Neg logit contributions
 sail-0.144
 sails-0.138
 gl-0.136
He-0.129
 valleys-0.126
Token Spot
Token ID15899
Feature activation-9.48e-02

Pos logit contributions
 eater+0.155
 virtue+0.153
olding+0.139
 anniversary+0.138
 occasion+0.138
Neg logit contributions
 sails-0.159
 sail-0.159
 valleys-0.155
 widened-0.149
He-0.149
Token.
Token ID13
Feature activation-3.00e-02

Pos logit contributions
 eater+0.146
 virtue+0.138
 anniversary+0.131
 occasion+0.131
 ever+0.129
Neg logit contributions
 sails-0.154
 valleys-0.151
 sail-0.147
 widened-0.145
 gl-0.141
Token fast
Token ID3049
Feature activation+1.95e-02

Pos logit contributions
 sails+0.153
 valleys+0.148
 widened+0.144
 sail+0.140
itched+0.129
Neg logit contributions
 ever-0.146
 before-0.140
 eater-0.138
 virtue-0.134
olding-0.125
Token asleep
Token ID16039
Feature activation+5.48e-02

Pos logit contributions
 sails+0.027
 valleys+0.027
 sail+0.026
 widened+0.026
 gl+0.025
Neg logit contributions
 eater-0.035
 virtue-0.033
 anniversary-0.032
 occasion-0.032
olding-0.031
Token.
Token ID13
Feature activation-6.32e-03

Pos logit contributions
 valleys+0.073
 sail+0.072
 gl+0.071
 widened+0.071
 sails+0.071
Neg logit contributions
 eater-0.098
 virtue-0.095
 anniversary-0.094
becca-0.091
olding-0.091
Token Suddenly
Token ID24975
Feature activation+1.28e-01

Pos logit contributions
 eater+0.013
 virtue+0.012
 anniversary+0.012
 occasion+0.012
 ever+0.012
Neg logit contributions
 sails-0.007
 valleys-0.007
 sail-0.007
 widened-0.007
 gl-0.006
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
 sails+0.231
 widened+0.225
 valleys+0.221
 widen+0.220
 sail+0.212
Neg logit contributions
 ever-0.159
 before-0.154
 you-0.147
 eater-0.133
 guests-0.128
Token the
Token ID262
Feature activation+6.30e-03

Pos logit contributions
 sails+0.149
 valleys+0.144
 widened+0.139
 sail+0.137
 gl+0.131
Neg logit contributions
 eater-0.196
 virtue-0.184
 ever-0.181
 occasion-0.177
 anniversary-0.176
Token wind
Token ID2344
Feature activation+2.36e-02

Pos logit contributions
 sails+0.008
 valleys+0.008
 sail+0.008
 gl+0.007
 widened+0.007
Neg logit contributions
 eater-0.012
 virtue-0.012
 anniversary-0.011
olding-0.011
 occasion-0.011
Token picked
Token ID6497
Feature activation+1.25e-01

Pos logit contributions
 valleys+0.032
 sails+0.031
 sail+0.029
 widened+0.028
 hollow+0.027
Neg logit contributions
 eater-0.041
 ever-0.041
 before-0.038
 virtue-0.037
nie-0.036
Token up
Token ID510
Feature activation+1.09e-01

Pos logit contributions
 sails+0.186
 widened+0.183
 valleys+0.183
 sail+0.177
 hollow+0.167
Neg logit contributions
 eater-0.194
 ever-0.191
 before-0.182
 virtue-0.180
 you-0.174
Token and
Token ID290
Feature activation-1.80e-02

Pos logit contributions
 sails+0.140
 valleys+0.135
 sail+0.131
 widened+0.129
 gl+0.127
Neg logit contributions
 eater-0.210
 virtue-0.203
 anniversary-0.196
 occasion-0.196
olding-0.192
Token sway
Token ID20009
Feature activation+2.19e-01

Pos logit contributions
 eater+0.033
 ever+0.032
 before+0.031
 virtue+0.031
 mistake+0.030
Neg logit contributions
 sails-0.022
 valleys-0.021
 widened-0.020
itched-0.019
 sail-0.019
Token's
Token ID338
Feature activation-2.95e-02

Pos logit contributions
 eater+0.245
 virtue+0.239
 occasion+0.225
olding+0.225
 anniversary+0.224
Neg logit contributions
 sails-0.191
 valleys-0.181
 sail-0.181
 gl-0.173
 widened-0.173
Token go
Token ID467
Feature activation+4.98e-02

Pos logit contributions
 eater+0.056
 virtue+0.054
olding+0.051
 occasion+0.051
 anniversary+0.051
Neg logit contributions
 sails-0.038
 sail-0.036
 valleys-0.036
 widened-0.034
 gl-0.034
Token,"
Token ID553
Feature activation+6.03e-02

Pos logit contributions
 sails+0.088
 valleys+0.086
 sail+0.084
 widened+0.083
 gl+0.081
Neg logit contributions
 eater-0.070
 virtue-0.065
 anniversary-0.062
 occasion-0.062
 ever-0.061
Token Mom
Token ID11254
Feature activation-1.56e-01

Pos logit contributions
 sail+0.105
 sails+0.102
 gl+0.099
He+0.099
His+0.096
Neg logit contributions
 virtue-0.092
 eater-0.083
olding-0.082
 behaviour-0.078
 anniversary-0.078
Token said
Token ID531
Feature activation+6.51e-02

Pos logit contributions
 eater+0.298
 virtue+0.297
 anniversary+0.280
 occasion+0.275
 ever+0.271
Neg logit contributions
 sails-0.190
 widened-0.179
 sail-0.178
 valleys-0.177
 gl-0.175
Token.
Token ID13
Feature activation-3.95e-02

Pos logit contributions
 sails+0.089
 sail+0.086
 widened+0.084
 valleys+0.083
 gl+0.082
Neg logit contributions
 virtue-0.114
 eater-0.114
 occasion-0.106
 anniversary-0.106
becca-0.103
Token She
Token ID1375
Feature activation+1.26e-01

Pos logit contributions
 eater+0.060
olding+0.056
 virtue+0.054
 tutor+0.053
 before+0.053
Neg logit contributions
 sails-0.065
 sail-0.061
 valleys-0.061
 widened-0.059
 gl-0.058
Token took
Token ID1718
Feature activation+1.44e-03

Pos logit contributions
 sails+0.175
 valleys+0.175
 sail+0.165
 widened+0.163
 gl+0.155
Neg logit contributions
 eater-0.221
 virtue-0.207
 ever-0.201
 occasion-0.199
 anniversary-0.198
Token Ben
Token ID3932
Feature activation-5.51e-02

Pos logit contributions
 sails+0.002
 valleys+0.002
 sail+0.002
 widened+0.002
 gl+0.002
Neg logit contributions
 eater-0.003
 virtue-0.003
olding-0.002
 occasion-0.002
 anniversary-0.002
Token's
Token ID338
Feature activation-2.78e-02

Pos logit contributions
 eater+0.098
 virtue+0.096
 occasion+0.091
 anniversary+0.088
 inconvenience+0.088
Neg logit contributions
 sails-0.077
 sail-0.074
 valleys-0.073
 gl-0.069
He-0.067
Token hand
Token ID1021
Feature activation-9.79e-03

Pos logit contributions
 virtue+0.047
 eater+0.046
 occasion+0.042
 ever+0.041
olding+0.041
Neg logit contributions
 sails-0.044
 sail-0.042
 valleys-0.040
 gl-0.039
 hood-0.037
Token her
Token ID607
Feature activation+2.49e-02

Pos logit contributions
 eater+0.146
 virtue+0.143
olding+0.137
 occasion+0.134
 anniversary+0.133
Neg logit contributions
 sails-0.080
 valleys-0.078
 sail-0.076
 gl-0.076
 widened-0.074
Token mom
Token ID1995
Feature activation-1.74e-01

Pos logit contributions
 sail+0.041
 gl+0.040
 sails+0.039
 spark+0.038
 wide+0.037
Neg logit contributions
 eater-0.041
 virtue-0.040
nie-0.037
 inconvenience-0.037
olding-0.037
Token and
Token ID290
Feature activation-1.69e-02

Pos logit contributions
 virtue+0.389
 inconvenience+0.387
 eater+0.360
 occasion+0.359
oker+0.358
Neg logit contributions
 gl-0.166
 sail-0.153
 sails-0.152
 wide-0.151
 valleys-0.150
Token she
Token ID673
Feature activation+5.91e-02

Pos logit contributions
 virtue+0.031
 eater+0.030
olding+0.029
 occasion+0.028
 inconvenience+0.027
Neg logit contributions
 sail-0.024
 sails-0.023
 gl-0.022
 valleys-0.022
 bent-0.021
Token couldn
Token ID3521
Feature activation-1.42e-01

Pos logit contributions
 sails+0.080
 gl+0.077
 valleys+0.077
 sail+0.076
 widened+0.073
Neg logit contributions
 eater-0.110
 virtue-0.105
 anniversary-0.104
 occasion-0.103
olding-0.102
Token't
Token ID470
Feature activation-1.26e-01

Pos logit contributions
 eater+0.163
 virtue+0.150
 anniversary+0.141
 occasion+0.140
 ever+0.138
Neg logit contributions
 sails-0.285
 valleys-0.280
 sail-0.274
 widened-0.272
 gl-0.265
Token wait
Token ID4043
Feature activation-7.47e-02

Pos logit contributions
 eater+0.141
 virtue+0.138
olding+0.121
 guests+0.120
 anniversary+0.117
Neg logit contributions
 sail-0.265
 sails-0.253
 widened-0.247
 valleys-0.239
 gl-0.238
Token to
Token ID284
Feature activation-2.85e-02

Pos logit contributions
 eater+0.112
 virtue+0.107
 anniversary+0.098
 occasion+0.097
olding+0.097
Neg logit contributions
 sails-0.126
 valleys-0.125
 sail-0.124
 widened-0.118
 gl-0.118
Token do
Token ID466
Feature activation-1.95e-02

Pos logit contributions
 eater+0.045
 ever+0.042
 virtue+0.042
 anniversary+0.041
 occasion+0.041
Neg logit contributions
 sails-0.044
 valleys-0.044
 widened-0.043
 sail-0.040
 gl-0.040
Token it
Token ID340
Feature activation+2.15e-02

Pos logit contributions
 eater+0.037
 virtue+0.035
 anniversary+0.033
 occasion+0.033
olding+0.033
Neg logit contributions
 sails-0.026
 valleys-0.025
 sail-0.025
 widened-0.024
 gl-0.024
Token again
Token ID757
Feature activation-1.09e-01

Pos logit contributions
 sails+0.035
 valleys+0.035
 sail+0.035
 widened+0.034
 gl+0.034
Neg logit contributions
 eater-0.033
 virtue-0.031
 anniversary-0.030
olding-0.029
 occasion-0.029
Token plate
Token ID7480
Feature activation-9.31e-02

Pos logit contributions
isk+0.007
days+0.007
 virtue+0.007
olding+0.007
nes+0.006
Neg logit contributions
 wide-0.006
 sails-0.006
 gl-0.006
 shiny-0.006
 spark-0.006
Token under
Token ID739
Feature activation-8.26e-02

Pos logit contributions
 eater+0.173
 virtue+0.166
 anniversary+0.162
 occasion+0.162
kids+0.160
Neg logit contributions
 sails-0.120
 sail-0.117
 valleys-0.116
 gl-0.114
 widened-0.112
Token their
Token ID511
Feature activation+9.41e-02

Pos logit contributions
 eater+0.157
 virtue+0.149
 ever+0.145
 anniversary+0.145
 occasion+0.144
Neg logit contributions
 sails-0.103
 valleys-0.100
 sail-0.096
 widened-0.095
 gl-0.091
Token mom
Token ID1995
Feature activation-1.92e-01

Pos logit contributions
 sails+0.148
 sail+0.142
 valleys+0.137
 gl+0.133
 hood+0.127
Neg logit contributions
 virtue-0.157
 eater-0.152
olding-0.144
 inconvenience-0.139
 occasion-0.138
Token's
Token ID338
Feature activation-2.06e-02

Pos logit contributions
 eater+0.330
 virtue+0.327
 occasion+0.298
olding+0.294
 anniversary+0.293
Neg logit contributions
 sails-0.278
 valleys-0.268
 sail-0.263
 widened-0.259
 gl-0.256
Token bed
Token ID3996
Feature activation-5.65e-03

Pos logit contributions
 virtue+0.032
 eater+0.031
 inconvenience+0.029
olding+0.029
 occasion+0.029
Neg logit contributions
 sails-0.034
 sail-0.033
 valleys-0.032
 gl-0.031
 hood-0.030
Token and
Token ID290
Feature activation-3.28e-02

Pos logit contributions
 eater+0.011
 virtue+0.011
rav+0.010
 anniversary+0.010
 inconvenience+0.010
Neg logit contributions
 sails-0.006
 widened-0.006
 gl-0.006
 valleys-0.006
 sail-0.006
Token waited
Token ID13488
Feature activation-5.36e-02

Pos logit contributions
 virtue+0.067
 eater+0.066
olding+0.064
 occasion+0.063
 hour+0.060
Neg logit contributions
 sails-0.039
 gl-0.038
 sail-0.038
 widened-0.037
 bent-0.035
Token for
Token ID329
Feature activation-1.90e-01

Pos logit contributions
 eater+0.072
 virtue+0.067
 anniversary+0.062
 occasion+0.061
becca+0.059
Neg logit contributions
 sails-0.100
 sail-0.098
 widened-0.096
 valleys-0.096
 gl-0.095
Token her
Token ID607
Feature activation+1.01e-02

Pos logit contributions
 eater+0.318
 virtue+0.302
 ever+0.295
 anniversary+0.294
 occasion+0.292
Neg logit contributions
 sails-0.277
 valleys-0.273
 widened-0.261
 sail-0.259
 gl-0.250
Token to
Token ID284
Feature activation-4.04e-02

Pos logit contributions
 sail+0.011
 sails+0.011
 gl+0.010
 glitter+0.010
 wide+0.010
Neg logit contributions
 virtue-0.022
becca-0.021
core-0.020
 eater-0.020
nie-0.020
Token smiled
Token ID13541
Feature activation+1.02e-01

Pos logit contributions
 sails+0.027
 valleys+0.026
 widened+0.025
 sail+0.025
 gl+0.024
Neg logit contributions
 eater-0.037
 virtue-0.036
 anniversary-0.034
 occasion-0.034
 ever-0.033
Token and
Token ID290
Feature activation-4.45e-03

Pos logit contributions
 valleys+0.200
 sails+0.190
 soared+0.180
 sail+0.177
aved+0.175
Neg logit contributions
 eater-0.128
 before-0.111
 ever-0.110
olding-0.107
 tutor-0.107
Token said
Token ID531
Feature activation+1.08e-01

Pos logit contributions
 eater+0.007
 ever+0.006
 virtue+0.006
olding+0.006
becca+0.006
Neg logit contributions
 valleys-0.007
 sails-0.007
 sail-0.007
 widened-0.007
itched-0.006
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
 sails+0.141
 valleys+0.140
 sail+0.131
 widened+0.128
 gl+0.122
Neg logit contributions
 eater-0.201
 virtue-0.190
 ever-0.187
olding-0.183
 anniversary-0.182
Token "
Token ID366
Feature activation-4.33e-02

Pos logit contributions
 sails+0.160
 valleys+0.156
 sail+0.150
 widened+0.148
 gl+0.142
Neg logit contributions
 eater-0.224
 virtue-0.213
 anniversary-0.205
 occasion-0.204
 ever-0.204
TokenThat
Token ID2504
Feature activation-7.10e-02

Pos logit contributions
 eater+0.072
 virtue+0.067
 anniversary+0.067
 mistake+0.066
 exam+0.065
Neg logit contributions
 valleys-0.064
He-0.064
 sails-0.063
 widened-0.060
 sail-0.059
Token's
Token ID338
Feature activation+3.49e-03

Pos logit contributions
 eater+0.127
 virtue+0.118
 anniversary+0.118
olding+0.117
 occasion+0.115
Neg logit contributions
 sails-0.103
 sail-0.097
 gl-0.094
 valleys-0.093
 widened-0.090
Token my
Token ID616
Feature activation-7.62e-02

Pos logit contributions
 sails+0.005
 sail+0.005
 valleys+0.005
He+0.005
 gl+0.005
Neg logit contributions
 eater-0.006
 occasion-0.005
 anniversary-0.005
olding-0.005
 virtue-0.005
Token good
Token ID922
Feature activation-8.45e-02

Pos logit contributions
 eater+0.123
otte+0.119
you+0.116
nes+0.114
becca+0.114
Neg logit contributions
 sail-0.121
 sails-0.119
 gl-0.109
 wood-0.104
 spark-0.104
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 virtue+0.107
 eater+0.105
becca+0.104
olding+0.101
 anniversary+0.101
Neg logit contributions
 sail-0.152
 sails-0.150
 gl-0.135
He-0.130
 wood-0.130
Token obedient
Token ID41652
Feature activation-1.06e-01

Pos logit contributions
He+0.146
 sails+0.139
 sail+0.130
 hollow+0.128
 He+0.125
Neg logit contributions
 eater-0.198
 occasion-0.182
 anniversary-0.182
ital-0.179
 virtue-0.178
Token One
Token ID1881
Feature activation+1.17e-01

Pos logit contributions
 eater+0.060
 virtue+0.057
 occasion+0.056
 anniversary+0.055
olding+0.055
Neg logit contributions
 sails-0.030
 valleys-0.029
 sail-0.028
 widened-0.027
 gl-0.025
Token day
Token ID1110
Feature activation-1.48e-01

Pos logit contributions
 sail+0.230
 gl+0.222
He+0.219
 wide+0.218
His+0.214
Neg logit contributions
 virtue-0.137
 eater-0.124
 inconvenience-0.120
 anniversary-0.120
otte-0.120
Token,
Token ID11
Feature activation+9.33e-02

Pos logit contributions
 eater+0.275
 ever+0.256
 virtue+0.256
 occasion+0.249
 anniversary+0.249
Neg logit contributions
 sails-0.200
 valleys-0.195
 sail-0.183
 widened-0.181
 gl-0.172
Token Lily
Token ID20037
Feature activation-1.29e-01

Pos logit contributions
 sail+0.116
 widened+0.115
 gl+0.111
 bent+0.107
He+0.106
Neg logit contributions
 virtue-0.186
 eater-0.171
olding-0.167
 anniversary-0.166
 behaviour-0.165
Token's
Token ID338
Feature activation-2.27e-02

Pos logit contributions
 eater+0.235
 virtue+0.231
 anniversary+0.219
olding+0.218
 occasion+0.218
Neg logit contributions
 sails-0.171
 valleys-0.163
 sail-0.161
 widened-0.157
 gl-0.154
Token mom
Token ID1995
Feature activation-1.84e-01

Pos logit contributions
 virtue+0.031
 eater+0.031
 occasion+0.028
 inconvenience+0.028
 ever+0.028
Neg logit contributions
 sails-0.040
 sail-0.039
 gl-0.038
 valleys-0.036
 wood-0.036
Token asked
Token ID1965
Feature activation+1.16e-01

Pos logit contributions
 virtue+0.336
 eater+0.333
 anniversary+0.314
 occasion+0.309
olding+0.305
Neg logit contributions
 sails-0.238
 gl-0.230
 sail-0.227
 widened-0.223
 bent-0.221
Token her
Token ID607
Feature activation+1.30e-02

Pos logit contributions
 bent+0.048
 widened+0.044
 valleys+0.042
He+0.039
 wide+0.039
Neg logit contributions
 virtue-0.312
 eater-0.279
 inconvenience-0.273
oker-0.273
becca-0.273
Token to
Token ID284
Feature activation-3.81e-02

Pos logit contributions
 sail+0.017
 sails+0.017
 gl+0.016
He+0.016
 bent+0.015
Neg logit contributions
 virtue-0.022
 eater-0.022
 occasion-0.020
kids-0.020
ito-0.020
Token stay
Token ID2652
Feature activation+2.01e-02

Pos logit contributions
 eater+0.062
 virtue+0.061
olding+0.056
 anniversary+0.055
 occasion+0.054
Neg logit contributions
 sail-0.058
 sails-0.058
 widened-0.053
 gl-0.053
 valleys-0.053
Token in
Token ID287
Feature activation-4.93e-02

Pos logit contributions
 sails+0.032
 valleys+0.031
 widened+0.030
 sail+0.030
 gl+0.029
Neg logit contributions
 eater-0.032
 virtue-0.030
 anniversary-0.029
 occasion-0.029
 ever-0.029
Token them
Token ID606
Feature activation+1.82e-02

Pos logit contributions
olding+0.190
 virtue+0.188
 eater+0.180
 occasion+0.180
bek+0.180
Neg logit contributions
 sails-0.127
 wood-0.116
 sail-0.114
 valleys-0.114
 glitter-0.104
Token.
Token ID13
Feature activation-4.54e-02

Pos logit contributions
 wide+0.015
He+0.015
 sails+0.014
 sail+0.014
 widened+0.013
Neg logit contributions
becca-0.041
kids-0.040
 occasion-0.039
child-0.039
bek-0.039
Token And
Token ID843
Feature activation-1.48e-01

Pos logit contributions
 virtue+0.098
 occasion+0.096
 eater+0.096
 mistake+0.095
 exam+0.094
Neg logit contributions
 sails-0.043
He-0.040
 sail-0.038
 valleys-0.037
 widened-0.035
Token they
Token ID484
Feature activation-3.03e-02

Pos logit contributions
 virtue+0.217
 eater+0.204
olding+0.203
child+0.200
kids+0.200
Neg logit contributions
 sails-0.258
 sail-0.254
 gl-0.238
 valleys-0.231
 glitter-0.226
Token are
Token ID389
Feature activation+4.96e-03

Pos logit contributions
 eater+0.066
 occasion+0.064
ae+0.064
becca+0.064
kids+0.063
Neg logit contributions
 sails-0.031
 gl-0.030
 sail-0.029
 spark-0.026
 widened-0.025
Token not
Token ID407
Feature activation-2.14e-01

Pos logit contributions
 sails+0.006
 wood+0.006
 sail+0.006
 shiny+0.006
 gl+0.006
Neg logit contributions
 occasion-0.010
enny-0.009
rav-0.009
child-0.009
 eater-0.009
Token better
Token ID1365
Feature activation+4.99e-03

Pos logit contributions
 occasion+0.389
becca+0.375
 virtue+0.374
olding+0.370
ital+0.367
Neg logit contributions
 sails-0.285
 sail-0.275
 wood-0.269
 gl-0.251
 shiny-0.248
Token than
Token ID621
Feature activation-3.40e-02

Pos logit contributions
 sails+0.006
 sail+0.006
 wood+0.005
itched+0.005
He+0.005
Neg logit contributions
becca-0.009
aughter-0.009
leton-0.009
bek-0.009
nie-0.009
Token forks
Token ID43378
Feature activation-1.77e-02

Pos logit contributions
 virtue+0.058
 eater+0.058
bek+0.058
becca+0.057
aughter+0.055
Neg logit contributions
 sails-0.047
 sail-0.046
 glitter-0.043
 wood-0.043
 bent-0.040
Token.
Token ID13
Feature activation-4.65e-02

Pos logit contributions
becca+0.038
 virtue+0.036
bek+0.035
 occasion+0.035
kids+0.035
Neg logit contributions
 sails-0.018
 sail-0.018
He-0.016
 gl-0.015
 widened-0.015
Token For
Token ID1114
Feature activation-1.14e-01

Pos logit contributions
 eater+0.104
 virtue+0.104
 occasion+0.102
 mistake+0.101
 exam+0.101
Neg logit contributions
 sails-0.040
He-0.036
 valleys-0.035
 sail-0.035
 widened-0.033
Token on
Token ID319
Feature activation-2.77e-02

Pos logit contributions
 sail+0.003
 valleys+0.003
 sails+0.003
 widened+0.003
 gl+0.003
Neg logit contributions
olding-0.004
 eater-0.004
 anniversary-0.004
 virtue-0.004
child-0.004
Token top
Token ID1353
Feature activation+2.90e-03

Pos logit contributions
olding+0.061
 eater+0.060
 virtue+0.059
oker+0.054
 anniversary+0.054
Neg logit contributions
 sails-0.030
 valleys-0.030
 sail-0.028
 wood-0.027
 gl-0.025
Token of
Token ID286
Feature activation-5.58e-02

Pos logit contributions
 valleys+0.004
 sails+0.004
 sail+0.003
 gl+0.003
He+0.003
Neg logit contributions
 virtue-0.005
 eater-0.005
olding-0.005
 anniversary-0.005
becca-0.005
Token me
Token ID502
Feature activation-7.94e-02

Pos logit contributions
 eater+0.114
olding+0.110
 virtue+0.108
 behaviour+0.100
oker+0.099
Neg logit contributions
 valleys-0.070
 sails-0.070
 sail-0.067
 wood-0.063
 gl-0.062
Token.
Token ID13
Feature activation-2.36e-02

Pos logit contributions
 eater+0.153
 virtue+0.151
becca+0.142
 occasion+0.139
 anniversary+0.138
Neg logit contributions
 sails-0.096
 valleys-0.096
 sail-0.093
 gl-0.091
 hollow-0.085
Token I
Token ID314
Feature activation-1.59e-01

Pos logit contributions
 eater+0.037
 mistake+0.036
 virtue+0.035
 behaviour+0.034
 occasion+0.033
Neg logit contributions
 sails-0.037
 valleys-0.036
 sail-0.035
He-0.035
 hollow-0.032
Token am
Token ID716
Feature activation+5.23e-02

Pos logit contributions
 eater+0.263
becca+0.250
 anniversary+0.246
 virtue+0.238
 occasion+0.235
Neg logit contributions
 sail-0.240
 sails-0.239
 gl-0.233
 widened-0.227
 valleys-0.222
Token warm
Token ID5814
Feature activation+7.02e-02

Pos logit contributions
 sail+0.061
 sails+0.060
 gl+0.060
 valleys+0.056
 hollow+0.056
Neg logit contributions
 eater-0.106
 occasion-0.101
 virtue-0.099
 behaviour-0.098
 anniversary-0.097
Token and
Token ID290
Feature activation-3.58e-02

Pos logit contributions
 sails+0.084
 valleys+0.082
 sail+0.081
He+0.074
 gl+0.074
Neg logit contributions
 virtue-0.139
olding-0.133
becca-0.131
 occasion-0.127
 anniversary-0.126
Token cozy
Token ID37438
Feature activation+8.24e-02

Pos logit contributions
 eater+0.073
 virtue+0.069
olding+0.066
 occasion+0.066
 anniversary+0.066
Neg logit contributions
 sails-0.041
 valleys-0.041
 sail-0.040
 gl-0.040
 widened-0.038
Token."
Token ID526
Feature activation-1.26e-01

Pos logit contributions
 valleys+0.129
 sail+0.126
 sails+0.125
 gl+0.116
 hollow+0.115
Neg logit contributions
becca-0.129
olding-0.127
 virtue-0.127
ito-0.123
 principal-0.120
Token a
Token ID257
Feature activation+2.99e-02

Pos logit contributions
 valleys+0.020
 widened+0.020
 sails+0.020
 sail+0.018
He+0.018
Neg logit contributions
 eater-0.019
 ever-0.017
 anniversary-0.017
 exam-0.017
 homework-0.017
Token special
Token ID2041
Feature activation+3.36e-02

Pos logit contributions
 sails+0.047
 valleys+0.046
 sail+0.045
 widened+0.044
 gl+0.043
Neg logit contributions
 eater-0.047
 virtue-0.045
 ever-0.043
 anniversary-0.042
 occasion-0.042
Token promise
Token ID6991
Feature activation-1.22e-01

Pos logit contributions
 sail+0.057
 sails+0.056
 gl+0.053
 valleys+0.051
 hollow+0.051
Neg logit contributions
 eater-0.048
 virtue-0.046
 ever-0.044
olding-0.043
becca-0.042
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
 eater+0.227
 virtue+0.215
becca+0.210
kids+0.203
 inconvenience+0.202
Neg logit contributions
 sail-0.158
 sails-0.151
He-0.141
 gl-0.138
itched-0.129
Token His
Token ID2399
Feature activation+1.85e-01

Pos logit contributions
 eater+0.028
 occasion+0.026
 virtue+0.025
 tutor+0.025
 ever+0.025
Neg logit contributions
 sails-0.017
 valleys-0.016
 widened-0.015
 sail-0.015
 gl-0.014
Token mom
Token ID1995
Feature activation-1.80e-01

Pos logit contributions
 sail+0.322
 sails+0.320
 gl+0.306
 bent+0.288
itched+0.288
Neg logit contributions
 eater-0.256
 virtue-0.247
 occasion-0.234
olding-0.228
 inconvenience-0.226
Token said
Token ID531
Feature activation+8.51e-02

Pos logit contributions
 eater+0.365
 virtue+0.358
 anniversary+0.346
 occasion+0.338
 inconvenience+0.337
Neg logit contributions
 gl-0.194
 sails-0.193
 bent-0.188
 widened-0.183
 sail-0.180
Token,
Token ID11
Feature activation+9.96e-02

Pos logit contributions
 sails+0.099
 sail+0.094
 valleys+0.092
 widened+0.091
 gl+0.089
Neg logit contributions
 eater-0.170
 virtue-0.164
 anniversary-0.157
 occasion-0.157
olding-0.155
Token "
Token ID366
Feature activation-6.04e-02

Pos logit contributions
 sails+0.111
 sail+0.105
 valleys+0.105
 gl+0.102
 widened+0.102
Neg logit contributions
 eater-0.203
 virtue-0.196
 anniversary-0.188
 occasion-0.185
 ever-0.182
TokenSure
Token ID19457
Feature activation-9.60e-02

Pos logit contributions
 eater+0.103
 virtue+0.099
 anniversary+0.099
 mistake+0.097
olding+0.096
Neg logit contributions
 valleys-0.087
He-0.087
 sails-0.084
 gl-0.082
 widened-0.080
Token!
Token ID0
Feature activation-4.83e-02

Pos logit contributions
 eater+0.261
 anniversary+0.257
 virtue+0.256
 occasion+0.253
olding+0.252
Neg logit contributions
 sail-0.027
 sails-0.027
 gl-0.023
 wood-0.022
 hollow-0.016
TokenL
Token ID43
Feature activation-1.46e-01

Pos logit contributions
 eater+0.130
 ever+0.121
 virtue+0.120
 occasion+0.118
 anniversary+0.117
Neg logit contributions
 sails-0.075
 valleys-0.070
 sail-0.069
 widened-0.069
 gl-0.063
Tokenily
Token ID813
Feature activation-1.30e-01

Pos logit contributions
 virtue+0.232
 eater+0.230
 occasion+0.202
 anniversary+0.202
olding+0.201
Neg logit contributions
 sails-0.220
 valleys-0.214
 sail-0.209
 widened-0.208
He-0.208
Token and
Token ID290
Feature activation-4.19e-02

Pos logit contributions
 eater+0.220
 virtue+0.207
 anniversary+0.198
 occasion+0.198
 ever+0.198
Neg logit contributions
 sails-0.191
 valleys-0.188
 sail-0.181
 widened-0.179
 gl-0.172
Token Sam
Token ID3409
Feature activation-2.52e-02

Pos logit contributions
 eater+0.067
 virtue+0.062
 ever+0.061
 anniversary+0.061
 occasion+0.060
Neg logit contributions
 sails-0.066
 valleys-0.065
 widened-0.062
 sail-0.062
 gl-0.060
Token are
Token ID389
Feature activation+5.83e-03

Pos logit contributions
 eater+0.041
 ever+0.039
 virtue+0.036
 anniversary+0.036
nie+0.035
Neg logit contributions
 valleys-0.037
 sails-0.037
 sail-0.035
itched-0.034
 hollow-0.033
Token friends
Token ID2460
Feature activation-1.48e-01

Pos logit contributions
 sails+0.009
 valleys+0.008
 sail+0.008
 gl+0.008
itched+0.008
Neg logit contributions
 virtue-0.010
 eater-0.010
 occasion-0.009
 inconvenience-0.009
 behaviour-0.008
Token.
Token ID13
Feature activation-3.70e-02

Pos logit contributions
 eater+0.231
 virtue+0.221
 anniversary+0.208
 occasion+0.208
olding+0.204
Neg logit contributions
 sails-0.237
 valleys-0.232
 sail-0.226
 widened-0.225
 gl-0.217
Token They
Token ID1119
Feature activation+5.31e-02

Pos logit contributions
 eater+0.077
 virtue+0.071
 occasion+0.070
 anniversary+0.070
 ever+0.070
Neg logit contributions
 sails-0.042
 valleys-0.040
 sail-0.038
 widened-0.037
 gl-0.036
Token like
Token ID588
Feature activation-4.09e-02

Pos logit contributions
 sails+0.085
 valleys+0.082
 sail+0.081
 widened+0.079
 gl+0.077
Neg logit contributions
 eater-0.084
 virtue-0.080
 anniversary-0.075
 occasion-0.075
olding-0.074
Token to
Token ID284
Feature activation-5.06e-02

Pos logit contributions
 virtue+0.067
 eater+0.065
olding+0.062
 mistake+0.060
becca+0.059
Neg logit contributions
 sails-0.066
 sail-0.066
 valleys-0.062
 gl-0.060
He-0.057
Token play
Token ID711
Feature activation-5.61e-03

Pos logit contributions
 eater+0.064
 ever+0.058
 occasion+0.056
 anniversary+0.056
 exam+0.056
Neg logit contributions
 valleys-0.097
 sails-0.094
 widened-0.092
He-0.087
 gl-0.087