TOP ACTIVATIONS
MAX = 0.573

. John loved seeing the ingredients go into the
horses and now she was seeing fake horses . She asked
. He didn 't like seeing the boy being treated badly
. She didn 't like seeing her mom my sad .
had had so much fun seeing the parade and watching the
up into the sky and seeing the clouds around her .
butter looked all around , seeing what it could do .
herself in the mirror and seeing her deep blue eyes .
the beach . He loved seeing them and it made him
loved summer . She loved seeing the sunshine and playing outside
fun travelling at speed and seeing the world .
Yes , and I enjoy seeing you too !" <|endoftext|> Once
a bit jealous , but seeing her brother feeling less scared
't believe what he was seeing . He wondered if staying
throwing pe b bles and seeing how many times they could
job ! I really missed seeing your drawings ." Lily smiled
decided she didn 't like seeing all the people go hungry
visit , the children loved seeing it up close !
Max always looked forward to seeing the kind lady in the
the island , and for seeing all the beauty it had

MIN ACTIVATIONS
MIN = -0.541

felt squ ishy and com fy . They laughed
curl up in her com fy crib and be ready for
But it looks so com fy and sweet ".
and fell . The ash tr ay flew out of her
is so soft and com fy ." The rabbit thought this
. He drops the ash tr ay . It breaks .
never come back . <|endoftext|> L ila and Tom are friends
. It 's really com fy !". Lu cy
wanted to play . <|endoftext|> L ily was a little girl
. It was so com fy and cozy .
her after all . <|endoftext|> L ily was a little girl
to collect things . <|endoftext|> L ily was hungry . She
show her an expensive ash tr ay he got as a
and jumped into her com fy bed . Lucy had big
the swing . L ily felt sad and didn
- mess and L - L ily got m - mad
. She felt so com fy , just like a big
bike and said , " L ily , always remember to
her mom said , " L ily , it 's important
clouds and friends . <|endoftext|> L ily and Ben were playing

INTERVAL 0.295 - 0.573
CONTAINS 0.804%

the good times to keep going . He had to find
to the kitchen . He sees the cake on the table
ran straight outside and began exploring . She noticed
a slide and had fun going down it together . Lily
Da isy became fascinated with the steam and she

INTERVAL 0.016 - 0.295
CONTAINS 43.940%

They can 't wait to show it to Mom and share
felt safe and proud . She had achieved her mission !
excited to have the license . She went outside and started
on her old princess dress and felt much better . But
let it bake . When it was done she took it

INTERVAL -0.262 - 0.016
CONTAINS 53.840%

full tum my . <|endoftext|> Once upon a time , there
Ben and Lily cried . They were sorry and they said
and Alex said no and thanked her . The
and it was available . He picked it up and soon
the spot , but it would always come back .

INTERVAL -0.541 - -0.262
CONTAINS 1.416%

. Lily has a large card with a red heart .
journey . Together , Tim and Joe continued
up Lily 's brother . Lily was happy because she got
paws to push the spear away . As the spear moved
a trash . L ily burst into tears and
Token.
Token ID13
Feature activation+4.31e-02

Pos logit contributions
tr+0.112
brush+0.110
L+0.107
fy+0.104
paste+0.100
Neg logit contributions
 seeing-0.160
 wondering-0.153
 observing-0.152
 discovering-0.151
 wonders-0.147
Token
Token ID198
Feature activation+1.15e-01

Pos logit contributions
 seeing+0.096
 discovering+0.090
 cultures+0.090
 believing+0.089
 observing+0.089
Neg logit contributions
L-0.051
tr-0.051
tered-0.048
brush-0.045
Max-0.044
Token
Token ID198
Feature activation+1.16e-01

Pos logit contributions
 seeing+0.249
 cultures+0.243
 wondering+0.238
 discovering+0.237
 observing+0.235
Neg logit contributions
L-0.157
tr-0.137
Max-0.134
tered-0.132
tted-0.131
TokenJohn
Token ID7554
Feature activation+1.13e-01

Pos logit contributions
 seeing+0.214
 wondering+0.199
 observing+0.197
 discovering+0.197
 wonders+0.192
Neg logit contributions
tr-0.187
L-0.186
brush-0.180
fy-0.180
tered-0.170
Token loved
Token ID6151
Feature activation+1.78e-01

Pos logit contributions
 seeing+0.220
 wondering+0.205
 observing+0.203
 discovering+0.202
 wonders+0.200
Neg logit contributions
tr-0.170
L-0.165
fy-0.162
brush-0.162
 gloss-0.150
Token seeing
Token ID4379
Feature activation+5.73e-01

Pos logit contributions
 cultures+0.358
 wonders+0.334
 wondering+0.326
 seeing+0.323
 observing+0.321
Neg logit contributions
 juice-0.227
 cookie-0.210
L-0.202
brush-0.201
 chip-0.198
Token the
Token ID262
Feature activation+7.73e-02

Pos logit contributions
 seeing+1.212
 cultures+1.203
 discovering+1.181
 observing+1.178
 wondering+1.152
Neg logit contributions
brush-0.636
 tighter-0.625
 juice-0.618
L-0.604
 half-0.592
Token ingredients
Token ID9391
Feature activation+8.24e-02

Pos logit contributions
 seeing+0.179
 discovering+0.173
 observing+0.170
 wondering+0.166
 cultures+0.166
Neg logit contributions
brush-0.091
 juice-0.087
tr-0.087
 cookie-0.084
L-0.081
Token go
Token ID467
Feature activation-1.02e-01

Pos logit contributions
 seeing+0.179
 observing+0.172
 wondering+0.170
 discovering+0.169
 cultures+0.168
Neg logit contributions
brush-0.101
tr-0.096
fy-0.096
L-0.093
 tighter-0.092
Token into
Token ID656
Feature activation-2.13e-02

Pos logit contributions
tr+0.152
L+0.149
brush+0.146
fy+0.146
 tighter+0.136
Neg logit contributions
 seeing-0.201
 wondering-0.187
 observing-0.186
 discovering-0.185
 wonders-0.181
Token the
Token ID262
Feature activation+7.82e-02

Pos logit contributions
L+0.027
 tighter+0.024
 half+0.024
brush+0.024
 juice+0.023
Neg logit contributions
 seeing-0.044
 observing-0.043
 cultures-0.043
 discovering-0.043
 wondering-0.042
Token horses
Token ID14260
Feature activation+2.21e-01

Pos logit contributions
 cultures+0.321
 wondering+0.320
 wonders+0.318
 seeing+0.317
 observing+0.316
Neg logit contributions
 Mitt-0.206
L-0.200
 Midnight-0.195
 Lily-0.195
 juice-0.191
Token and
Token ID290
Feature activation+8.98e-02

Pos logit contributions
 wonders+0.471
 wondering+0.466
 seeing+0.466
 observing+0.464
 cultures+0.454
Neg logit contributions
brush-0.232
tr-0.230
fy-0.219
L-0.208
tered-0.206
Token now
Token ID783
Feature activation+1.08e-02

Pos logit contributions
 wondering+0.138
 observing+0.134
 wonders+0.133
 seeing+0.132
 discovering+0.126
Neg logit contributions
tr-0.138
brush-0.132
L-0.129
 juice-0.129
 swapped-0.127
Token she
Token ID673
Feature activation+1.01e-01

Pos logit contributions
 wondering+0.021
 observing+0.021
 seeing+0.021
 wonders+0.021
 discovering+0.021
Neg logit contributions
brush-0.013
tr-0.013
L-0.012
fy-0.012
 gloss-0.012
Token was
Token ID373
Feature activation+3.25e-02

Pos logit contributions
 seeing+0.218
 observing+0.211
 discovering+0.209
 wondering+0.206
 cultures+0.204
Neg logit contributions
tr-0.132
L-0.128
fy-0.128
brush-0.125
bo-0.116
Token seeing
Token ID4379
Feature activation+5.72e-01

Pos logit contributions
 seeing+0.060
 wondering+0.056
 cultures+0.055
 wonders+0.055
 discovering+0.055
Neg logit contributions
tr-0.050
L-0.049
brush-0.049
fy-0.048
 tighter-0.046
Token fake
Token ID8390
Feature activation+5.09e-02

Pos logit contributions
 seeing+1.202
 observing+1.150
 discovering+1.146
 wondering+1.133
 cultures+1.114
Neg logit contributions
L-0.706
tr-0.688
brush-0.671
 tighter-0.657
 juice-0.630
Token horses
Token ID14260
Feature activation+2.23e-01

Pos logit contributions
 discovering+0.090
 seeing+0.085
 observing+0.085
 ey+0.084
 wondering+0.083
Neg logit contributions
tr-0.068
fy-0.068
tered-0.068
paste-0.066
 tighter-0.065
Token.
Token ID13
Feature activation+3.63e-02

Pos logit contributions
 seeing+0.465
 wonders+0.460
 wondering+0.455
 observing+0.454
 cultures+0.449
Neg logit contributions
tr-0.258
brush-0.257
fy-0.245
L-0.234
 tighter-0.232
Token She
Token ID1375
Feature activation+8.64e-02

Pos logit contributions
 seeing+0.078
 wondering+0.073
 observing+0.072
 discovering+0.072
 cultures+0.070
Neg logit contributions
tr-0.049
L-0.049
fy-0.046
brush-0.045
 gloss-0.043
Token asked
Token ID1965
Feature activation+1.58e-01

Pos logit contributions
 seeing+0.170
 observing+0.161
 discovering+0.160
 wondering+0.158
 cultures+0.153
Neg logit contributions
tr-0.128
L-0.126
brush-0.122
fy-0.122
 tighter-0.115
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
 seeing+0.170
 wondering+0.158
 observing+0.156
 discovering+0.155
 wonders+0.151
Neg logit contributions
tr-0.145
L-0.141
fy-0.140
brush-0.140
 tighter-0.130
Token He
Token ID679
Feature activation+6.69e-03

Pos logit contributions
 seeing+0.049
 discovering+0.046
 observing+0.046
 cultures+0.045
 wondering+0.045
Neg logit contributions
L-0.027
tr-0.027
fy-0.023
brush-0.023
tered-0.023
Token didn
Token ID1422
Feature activation+2.02e-02

Pos logit contributions
 seeing+0.011
 wondering+0.010
 wonders+0.010
 discovering+0.010
 observing+0.010
Neg logit contributions
tr-0.012
brush-0.012
L-0.012
fy-0.012
 gloss-0.011
Token't
Token ID470
Feature activation+1.64e-03

Pos logit contributions
 seeing+0.029
 observing+0.025
 discovering+0.024
 wondering+0.024
 wonders+0.024
Neg logit contributions
fy-0.041
tr-0.039
brush-0.038
 tighter-0.038
L-0.037
Token like
Token ID588
Feature activation+8.58e-02

Pos logit contributions
 seeing+0.002
 wondering+0.002
 observing+0.002
 discovering+0.002
 wonders+0.002
Neg logit contributions
tr-0.003
L-0.003
brush-0.003
fy-0.003
 tighter-0.003
Token seeing
Token ID4379
Feature activation+5.72e-01

Pos logit contributions
 wonders+0.163
 observing+0.163
 wondering+0.162
 cultures+0.161
 seeing+0.157
Neg logit contributions
L-0.105
 juice-0.103
 Lily-0.102
 gloss-0.093
 swapped-0.091
Token the
Token ID262
Feature activation+8.46e-02

Pos logit contributions
 seeing+1.135
 observing+1.070
 wondering+1.069
 discovering+1.068
 wonders+1.035
Neg logit contributions
tr-0.811
L-0.808
brush-0.773
fy-0.755
 tighter-0.732
Token boy
Token ID2933
Feature activation-4.49e-02

Pos logit contributions
 seeing+0.187
 wondering+0.175
 observing+0.175
 discovering+0.175
 wonders+0.168
Neg logit contributions
tr-0.108
L-0.106
brush-0.101
fy-0.101
 tighter-0.094
Token being
Token ID852
Feature activation+2.95e-01

Pos logit contributions
tr+0.055
L+0.054
fy+0.053
brush+0.051
 tighter+0.050
Neg logit contributions
 seeing-0.096
 observing-0.092
 wondering-0.092
 discovering-0.091
 wonders-0.090
Token treated
Token ID5716
Feature activation+1.04e-01

Pos logit contributions
 seeing+0.505
 wondering+0.463
 observing+0.458
 discovering+0.456
 wonders+0.445
Neg logit contributions
tr-0.529
L-0.513
brush-0.510
fy-0.506
 tighter-0.475
Token badly
Token ID11234
Feature activation-1.44e-01

Pos logit contributions
 seeing+0.187
 wondering+0.176
 wonders+0.170
 discovering+0.169
 observing+0.167
Neg logit contributions
tr-0.178
brush-0.174
fy-0.168
L-0.165
 gloss-0.157
Token.
Token ID13
Feature activation+2.62e-02

Pos logit contributions
L+0.148
fy+0.147
tr+0.142
tered+0.142
brush+0.139
Neg logit contributions
 seeing-0.220
 discovering-0.210
 observing-0.210
 cultures-0.205
 wondering-0.204
Token She
Token ID1375
Feature activation+8.26e-02

Pos logit contributions
 seeing+0.053
 observing+0.049
 discovering+0.048
 wondering+0.048
 cultures+0.046
Neg logit contributions
tr-0.040
L-0.039
brush-0.037
fy-0.036
tered-0.035
Token didn
Token ID1422
Feature activation+2.31e-02

Pos logit contributions
 seeing+0.139
 observing+0.127
 discovering+0.127
 wondering+0.127
 wonders+0.121
Neg logit contributions
tr-0.149
L-0.146
fy-0.143
brush-0.143
 tighter-0.135
Token't
Token ID470
Feature activation+5.33e-03

Pos logit contributions
 seeing+0.036
 observing+0.032
 wondering+0.031
 discovering+0.031
 wonders+0.030
Neg logit contributions
fy-0.045
tr-0.042
L-0.041
 tighter-0.041
brush-0.041
Token like
Token ID588
Feature activation+8.49e-02

Pos logit contributions
 seeing+0.007
 wondering+0.006
 observing+0.006
 discovering+0.006
 wonders+0.006
Neg logit contributions
tr-0.011
L-0.011
brush-0.011
fy-0.011
 tighter-0.011
Token seeing
Token ID4379
Feature activation+5.71e-01

Pos logit contributions
 observing+0.161
 wonders+0.160
adows+0.160
adow+0.159
 cultures+0.158
Neg logit contributions
 juice-0.101
 Lily-0.095
 cash-0.091
L-0.089
 cookie-0.088
Token her
Token ID607
Feature activation+1.05e-01

Pos logit contributions
 seeing+1.123
 discovering+1.063
 observing+1.060
 wondering+1.050
 cultures+1.023
Neg logit contributions
tr-0.814
L-0.794
brush-0.786
 tighter-0.748
fy-0.738
Token mom
Token ID1995
Feature activation-1.15e-01

Pos logit contributions
 seeing+0.249
 wondering+0.235
 observing+0.234
 discovering+0.233
 wonders+0.229
Neg logit contributions
tr-0.117
L-0.113
brush-0.110
fy-0.110
 tighter-0.100
Tokenmy
Token ID1820
Feature activation-3.36e-02

Pos logit contributions
tr+0.142
L+0.137
brush+0.134
fy+0.133
 tighter+0.125
Neg logit contributions
 seeing-0.257
 wondering-0.242
 observing-0.241
 discovering-0.241
 wonders-0.237
Token sad
Token ID6507
Feature activation+3.27e-02

Pos logit contributions
tr+0.037
fy+0.035
L+0.035
brush+0.035
 tighter+0.033
Neg logit contributions
 seeing-0.078
 observing-0.074
 discovering-0.074
 wondering-0.073
 wonders-0.073
Token.
Token ID13
Feature activation+2.70e-02

Pos logit contributions
 seeing+0.066
 observing+0.062
 discovering+0.062
 wondering+0.062
 wonders+0.060
Neg logit contributions
tr-0.046
brush-0.045
L-0.045
fy-0.044
 gloss-0.042
Token had
Token ID550
Feature activation-3.65e-02

Pos logit contributions
 seeing+0.164
 observing+0.158
 discovering+0.157
 wondering+0.154
 cultures+0.153
Neg logit contributions
L-0.101
fy-0.100
tr-0.098
brush-0.095
 swapped-0.090
Token had
Token ID550
Feature activation-3.59e-02

Pos logit contributions
L+0.047
tr+0.045
fy+0.044
brush+0.044
 swapped+0.042
Neg logit contributions
 seeing-0.077
 observing-0.074
 cultures-0.073
 wondering-0.073
 discovering-0.072
Token so
Token ID523
Feature activation-2.91e-02

Pos logit contributions
L+0.043
tr+0.042
brush+0.041
fy+0.041
 tighter+0.038
Neg logit contributions
 seeing-0.079
 wondering-0.075
 observing-0.075
 discovering-0.074
 cultures-0.073
Token much
Token ID881
Feature activation-1.52e-01

Pos logit contributions
L+0.023
fy+0.023
tr+0.022
brush+0.021
 tighter+0.019
Neg logit contributions
 seeing-0.077
 wondering-0.073
 discovering-0.073
 observing-0.073
 wonders-0.071
Token fun
Token ID1257
Feature activation+5.86e-02

Pos logit contributions
fy+0.182
L+0.177
 juice+0.177
brush+0.174
tr+0.167
Neg logit contributions
 seeing-0.323
 wondering-0.306
 discovering-0.304
 cultures-0.301
 observing-0.301
Token seeing
Token ID4379
Feature activation+5.71e-01

Pos logit contributions
 wonders+0.114
 cultures+0.112
 wondering+0.105
 seeing+0.102
 ey+0.100
Neg logit contributions
L-0.071
 gloss-0.068
brush-0.067
 juice-0.065
fy-0.064
Token the
Token ID262
Feature activation+7.60e-02

Pos logit contributions
 seeing+1.175
 observing+1.138
 wondering+1.117
 discovering+1.115
 cultures+1.108
Neg logit contributions
L-0.685
brush-0.647
 tighter-0.615
 gloss-0.613
 juice-0.610
Token parade
Token ID16134
Feature activation+1.13e-01

Pos logit contributions
 seeing+0.144
 discovering+0.134
 observing+0.134
 wondering+0.132
 cultures+0.125
Neg logit contributions
L-0.117
fy-0.108
brush-0.108
tr-0.107
 juice-0.103
Token and
Token ID290
Feature activation+8.84e-02

Pos logit contributions
 seeing+0.238
 wondering+0.230
 observing+0.229
 wonders+0.226
 discovering+0.224
Neg logit contributions
tr-0.144
brush-0.139
L-0.138
fy-0.136
tered-0.128
Token watching
Token ID4964
Feature activation+4.36e-01

Pos logit contributions
 wonders+0.115
 cultures+0.115
iles+0.109
 wondering+0.109
 seeing+0.106
Neg logit contributions
 swapped-0.149
brush-0.148
tr-0.148
L-0.147
 gloss-0.147
Token the
Token ID262
Feature activation+7.48e-02

Pos logit contributions
 seeing+0.848
 observing+0.822
 wondering+0.810
 discovering+0.809
 wonders+0.785
Neg logit contributions
L-0.616
tr-0.593
tered-0.585
brush-0.583
 tighter-0.555
Token up
Token ID510
Feature activation+5.93e-02

Pos logit contributions
 seeing+0.631
 observing+0.592
 wondering+0.591
 discovering+0.585
 wonders+0.567
Neg logit contributions
tr-0.489
L-0.480
brush-0.467
fy-0.463
 tighter-0.444
Token into
Token ID656
Feature activation-2.55e-02

Pos logit contributions
 seeing+0.122
 wondering+0.117
 observing+0.116
 cultures+0.115
 discovering+0.115
Neg logit contributions
L-0.079
tr-0.078
brush-0.074
tered-0.072
fy-0.071
Token the
Token ID262
Feature activation+7.09e-02

Pos logit contributions
L+0.036
brush+0.032
tr+0.031
 tighter+0.031
tered+0.030
Neg logit contributions
 seeing-0.051
 observing-0.049
 wondering-0.049
 discovering-0.048
 cultures-0.047
Token sky
Token ID6766
Feature activation+1.38e-01

Pos logit contributions
 seeing+0.118
 observing+0.108
 wondering+0.108
 discovering+0.108
 wonders+0.101
Neg logit contributions
tr-0.130
L-0.127
brush-0.126
fy-0.125
 tighter-0.118
Token and
Token ID290
Feature activation+8.83e-02

Pos logit contributions
 seeing+0.283
 wondering+0.270
 cultures+0.270
 observing+0.269
 wonders+0.268
Neg logit contributions
tr-0.182
brush-0.174
L-0.170
fy-0.167
 tighter-0.159
Token seeing
Token ID4379
Feature activation+5.70e-01

Pos logit contributions
 cultures+0.159
iles+0.151
 wonders+0.148
 whales+0.148
 seeing+0.147
Neg logit contributions
tr-0.118
L-0.116
 gloss-0.108
tered-0.107
 Lily-0.106
Token the
Token ID262
Feature activation+7.19e-02

Pos logit contributions
 observing+1.304
 dealing+1.296
 seeing+1.295
reetings+1.292
azes+1.266
Neg logit contributions
 more-0.468
 tighter-0.444
 off-0.431
 tightly-0.399
 Lily-0.395
Token clouds
Token ID15114
Feature activation+1.97e-01

Pos logit contributions
 seeing+0.133
 observing+0.128
 discovering+0.128
 wondering+0.122
 examining+0.122
Neg logit contributions
tr-0.109
brush-0.105
L-0.097
 tighter-0.094
fy-0.090
Token around
Token ID1088
Feature activation+4.76e-02

Pos logit contributions
 seeing+0.374
 wondering+0.373
 cultures+0.368
 observing+0.368
 wonders+0.367
Neg logit contributions
brush-0.261
tr-0.260
fy-0.241
tered-0.236
 tighter-0.232
Token her
Token ID607
Feature activation+1.01e-01

Pos logit contributions
 cultures+0.101
 seeing+0.100
 observing+0.098
 wondering+0.097
 discovering+0.097
Neg logit contributions
L-0.053
 Lily-0.047
tered-0.046
brush-0.045
tr-0.044
Token.
Token ID13
Feature activation+3.23e-02

Pos logit contributions
 seeing+0.212
 wondering+0.201
 observing+0.199
 discovering+0.199
 cultures+0.193
Neg logit contributions
tr-0.139
brush-0.124
L-0.118
fy-0.116
 tighter-0.111
Token butter
Token ID9215
Feature activation-8.99e-02

Pos logit contributions
tr+0.061
fy+0.057
L+0.055
brush+0.053
tered+0.052
Neg logit contributions
 seeing-0.088
 wondering-0.083
 wonders-0.082
 observing-0.081
 discovering-0.081
Token looked
Token ID3114
Feature activation-7.63e-03

Pos logit contributions
tr+0.124
brush+0.122
L+0.121
fy+0.110
tered+0.108
Neg logit contributions
 seeing-0.191
 wondering-0.181
 observing-0.180
 discovering-0.179
 cultures-0.175
Token all
Token ID477
Feature activation+2.13e-02

Pos logit contributions
tr+0.011
L+0.010
fy+0.010
brush+0.010
 tighter+0.009
Neg logit contributions
 seeing-0.016
 wondering-0.015
 observing-0.015
 discovering-0.015
 wonders-0.014
Token around
Token ID1088
Feature activation+4.66e-02

Pos logit contributions
 seeing+0.039
 observing+0.035
 discovering+0.035
 wondering+0.035
 wonders+0.035
Neg logit contributions
tr-0.036
L-0.034
brush-0.034
fy-0.033
 gloss-0.032
Token,
Token ID11
Feature activation-2.57e-02

Pos logit contributions
 cultures+0.120
ules+0.109
 property+0.109
 expect+0.108
 culture+0.107
Neg logit contributions
 Lily-0.035
L-0.031
time-0.027
tered-0.027
 longer-0.026
Token seeing
Token ID4379
Feature activation+5.70e-01

Pos logit contributions
tered+0.036
L+0.035
 juice+0.033
tr+0.033
 Lily+0.032
Neg logit contributions
 cultures-0.049
ules-0.044
 wonders-0.044
 expect-0.043
 seeing-0.043
Token what
Token ID644
Feature activation+2.72e-01

Pos logit contributions
 seeing+1.204
 observing+1.180
 discovering+1.171
 cultures+1.165
 wondering+1.157
Neg logit contributions
L-0.669
 tighter-0.652
brush-0.649
 juice-0.648
tr-0.644
Token it
Token ID340
Feature activation+1.10e-01

Pos logit contributions
 seeing+0.542
 wondering+0.515
 observing+0.512
 discovering+0.509
 cultures+0.480
Neg logit contributions
L-0.397
tr-0.394
brush-0.388
fy-0.372
 gloss-0.357
Token could
Token ID714
Feature activation+3.13e-02

Pos logit contributions
 seeing+0.275
 observing+0.275
 cultures+0.272
 discovering+0.268
 wondering+0.265
Neg logit contributions
L-0.107
tr-0.101
brush-0.093
bo-0.093
 juice-0.087
Token do
Token ID466
Feature activation+1.48e-01

Pos logit contributions
 seeing+0.065
 wondering+0.065
 observing+0.064
 cultures+0.063
 discovering+0.063
Neg logit contributions
brush-0.040
L-0.039
tr-0.039
tered-0.037
 juice-0.036
Token.
Token ID13
Feature activation+3.37e-02

Pos logit contributions
 seeing+0.352
 wondering+0.350
 observing+0.342
 wonders+0.342
 cultures+0.337
Neg logit contributions
L-0.147
tered-0.131
tr-0.126
brush-0.122
tted-0.121
Token herself
Token ID5223
Feature activation+1.35e-01

Pos logit contributions
 seeing+0.230
 observing+0.221
 cultures+0.214
 wondering+0.213
 discovering+0.212
Neg logit contributions
L-0.130
brush-0.128
 tighter-0.123
 Lily-0.121
 tightly-0.120
Token in
Token ID287
Feature activation+1.02e-02

Pos logit contributions
 seeing+0.277
 wondering+0.259
 observing+0.259
 wonders+0.257
 discovering+0.257
Neg logit contributions
tr-0.187
L-0.182
brush-0.180
fy-0.178
tered-0.169
Token the
Token ID262
Feature activation+7.34e-02

Pos logit contributions
 seeing+0.021
 observing+0.020
 discovering+0.020
 wondering+0.020
ules+0.019
Neg logit contributions
L-0.013
brush-0.012
 half-0.012
 tighter-0.011
 juice-0.011
Token mirror
Token ID10162
Feature activation+1.86e-01

Pos logit contributions
 seeing+0.131
 wondering+0.122
 observing+0.121
 discovering+0.119
 wonders+0.118
Neg logit contributions
tr-0.124
fy-0.120
L-0.118
brush-0.117
 tighter-0.114
Token and
Token ID290
Feature activation+8.61e-02

Pos logit contributions
 seeing+0.348
 wondering+0.324
 observing+0.324
 discovering+0.322
 wonders+0.319
Neg logit contributions
tr-0.295
brush-0.285
L-0.283
fy-0.281
 gloss-0.264
Token seeing
Token ID4379
Feature activation+5.70e-01

Pos logit contributions
 cultures+0.153
 seeing+0.148
 observing+0.148
ules+0.147
 dangers+0.146
Neg logit contributions
 swapped-0.114
tr-0.114
L-0.113
brush-0.111
 gloss-0.110
Token her
Token ID607
Feature activation+1.01e-01

Pos logit contributions
 seeing+1.194
 observing+1.192
 discovering+1.141
 cultures+1.123
 dealing+1.122
Neg logit contributions
L-0.602
 tighter-0.601
brush-0.601
 juice-0.580
 Lily-0.569
Token deep
Token ID2769
Feature activation+7.72e-02

Pos logit contributions
 seeing+0.246
 discovering+0.235
 observing+0.233
 wondering+0.227
 cultures+0.224
Neg logit contributions
tr-0.101
brush-0.094
L-0.093
fy-0.088
 gloss-0.086
Token blue
Token ID4171
Feature activation-2.91e-03

Pos logit contributions
 seeing+0.154
 discovering+0.143
 observing+0.142
 wondering+0.141
 wonders+0.136
Neg logit contributions
tr-0.113
brush-0.110
L-0.108
fy-0.107
 tighter-0.101
Token eyes
Token ID2951
Feature activation+1.13e-01

Pos logit contributions
tr+0.004
brush+0.004
L+0.004
 gloss+0.003
 juice+0.003
Neg logit contributions
 seeing-0.006
 discovering-0.006
 wondering-0.006
 observing-0.006
 cultures-0.006
Token.
Token ID13
Feature activation+2.95e-02

Pos logit contributions
 seeing+0.207
 wondering+0.191
 observing+0.187
 discovering+0.186
 wonders+0.183
Neg logit contributions
tr-0.186
L-0.184
fy-0.179
brush-0.176
 tighter-0.166
Token the
Token ID262
Feature activation+7.62e-02

Pos logit contributions
 cultures+0.096
ules+0.092
ule+0.090
 wondering+0.087
 observing+0.087
Neg logit contributions
 more-0.037
 juice-0.036
 Lily-0.035
 less-0.034
 longer-0.034
Token beach
Token ID10481
Feature activation+2.93e-01

Pos logit contributions
 seeing+0.100
 observing+0.091
 wondering+0.091
 discovering+0.090
 cultures+0.086
Neg logit contributions
L-0.162
tr-0.160
brush-0.155
fy-0.152
 tighter-0.149
Token.
Token ID13
Feature activation+2.40e-02

Pos logit contributions
 seeing+0.615
 observing+0.604
 wondering+0.602
 wonders+0.594
 discovering+0.589
Neg logit contributions
tr-0.350
L-0.343
brush-0.336
fy-0.326
 tighter-0.307
Token He
Token ID679
Feature activation+6.05e-03

Pos logit contributions
 seeing+0.048
 wondering+0.045
 observing+0.044
 discovering+0.044
 wonders+0.043
Neg logit contributions
tr-0.035
brush-0.035
L-0.035
fy-0.035
 tighter-0.032
Token loved
Token ID6151
Feature activation+1.70e-01

Pos logit contributions
 cultures+0.012
 seeing+0.012
 observing+0.011
 discovering+0.011
 wondering+0.011
Neg logit contributions
L-0.009
 swapped-0.008
fy-0.008
 tighter-0.007
bo-0.007
Token seeing
Token ID4379
Feature activation+5.69e-01

Pos logit contributions
eling+0.329
 cultures+0.323
 wonders+0.320
 cottage+0.320
 expect+0.317
Neg logit contributions
 Lily-0.185
 juice-0.181
 more-0.162
L-0.154
 Mitt-0.153
Token them
Token ID606
Feature activation+1.28e-01

Pos logit contributions
 seeing+1.185
 observing+1.184
 wondering+1.156
 discovering+1.148
 cultures+1.129
Neg logit contributions
 juice-0.598
L-0.575
 tighter-0.554
 Lily-0.544
 half-0.521
Token and
Token ID290
Feature activation+8.43e-02

Pos logit contributions
 seeing+0.249
 wondering+0.248
 observing+0.246
 cultures+0.242
 wonders+0.242
Neg logit contributions
 tighter-0.145
 juice-0.145
 swap-0.139
L-0.138
tered-0.137
Token it
Token ID340
Feature activation+1.07e-01

Pos logit contributions
 cultures+0.127
 wonders+0.124
ules+0.119
 heights+0.119
 wondering+0.117
Neg logit contributions
 swapped-0.123
L-0.122
tr-0.119
 juice-0.117
 tighter-0.117
Token made
Token ID925
Feature activation-4.20e-02

Pos logit contributions
 seeing+0.234
 observing+0.225
 wondering+0.222
 discovering+0.221
 cultures+0.220
Neg logit contributions
L-0.130
tr-0.126
fy-0.117
brush-0.116
tered-0.114
Token him
Token ID683
Feature activation+2.05e-01

Pos logit contributions
L+0.074
tered+0.070
tr+0.069
fy+0.069
brush+0.069
Neg logit contributions
 seeing-0.068
 observing-0.065
 wondering-0.064
 discovering-0.064
 cultures-0.063
Token loved
Token ID6151
Feature activation+1.71e-01

Pos logit contributions
 seeing+0.245
 wondering+0.223
 observing+0.221
 discovering+0.220
 wonders+0.213
Neg logit contributions
tr-0.325
L-0.320
fy-0.313
brush-0.313
 tighter-0.296
Token summer
Token ID3931
Feature activation+1.19e-01

Pos logit contributions
 wondering+0.363
 seeing+0.361
 wonders+0.361
 cultures+0.353
 observing+0.347
Neg logit contributions
 juice-0.189
tr-0.186
L-0.181
 gloss-0.168
brush-0.166
Token.
Token ID13
Feature activation+3.46e-02

Pos logit contributions
 seeing+0.203
 observing+0.185
 discovering+0.185
 wondering+0.184
 wonders+0.178
Neg logit contributions
tr-0.213
L-0.206
brush-0.203
fy-0.200
tted-0.197
Token She
Token ID1375
Feature activation+7.71e-02

Pos logit contributions
 seeing+0.076
 believing+0.072
 observing+0.072
 wondering+0.072
 discovering+0.070
Neg logit contributions
L-0.044
tr-0.041
brush-0.040
fy-0.038
tered-0.037
Token loved
Token ID6151
Feature activation+1.72e-01

Pos logit contributions
 seeing+0.146
 cultures+0.139
 discovering+0.137
 wondering+0.137
 observing+0.136
Neg logit contributions
L-0.126
fy-0.118
tr-0.117
brush-0.113
Fl-0.107
Token seeing
Token ID4379
Feature activation+5.69e-01

Pos logit contributions
 cultures+0.389
 wondering+0.385
 wonders+0.384
reetings+0.376
eling+0.374
Neg logit contributions
 juice-0.163
L-0.144
tr-0.136
 cookie-0.129
 gloss-0.128
Token the
Token ID262
Feature activation+7.82e-02

Pos logit contributions
 seeing+1.206
 observing+1.181
 wondering+1.178
 discovering+1.173
 dealing+1.154
Neg logit contributions
 juice-0.643
L-0.627
brush-0.598
 Lily-0.565
tr-0.538
Token sunshine
Token ID34488
Feature activation+3.62e-02

Pos logit contributions
 seeing+0.146
 wondering+0.137
 discovering+0.137
 observing+0.137
 believing+0.128
Neg logit contributions
L-0.125
tr-0.123
 juice-0.116
brush-0.113
Mom-0.108
Token and
Token ID290
Feature activation+8.95e-02

Pos logit contributions
 seeing+0.072
 wondering+0.071
 observing+0.069
 discovering+0.068
 wonders+0.067
Neg logit contributions
tr-0.050
brush-0.049
L-0.045
 gloss-0.045
fy-0.044
Token playing
Token ID2712
Feature activation+2.87e-01

Pos logit contributions
 cultures+0.146
iles+0.145
 property+0.144
 dealing+0.144
reetings+0.143
Neg logit contributions
 juice-0.117
L-0.116
 tighter-0.104
 swapped-0.102
brush-0.102
Token outside
Token ID2354
Feature activation+1.12e-01

Pos logit contributions
 seeing+0.650
 wondering+0.626
 observing+0.614
 discovering+0.610
 cultures+0.607
Neg logit contributions
L-0.349
tr-0.318
brush-0.312
tered-0.290
fy-0.282
Token fun
Token ID1257
Feature activation+5.85e-02

Pos logit contributions
L+0.162
 juice+0.159
fy+0.158
brush+0.158
tr+0.148
Neg logit contributions
 seeing-0.320
 wondering-0.307
 observing-0.303
 cultures-0.302
 discovering-0.300
Token travelling
Token ID16574
Feature activation+4.45e-01

Pos logit contributions
 wonders+0.111
 cultures+0.111
 ey+0.106
avi+0.105
eling+0.104
Neg logit contributions
L-0.063
 longer-0.059
tered-0.059
house-0.058
sc-0.057
Token at
Token ID379
Feature activation+1.12e-01

Pos logit contributions
 seeing+0.926
 observing+0.891
 wondering+0.880
 wonders+0.859
 discovering+0.854
Neg logit contributions
L-0.565
tr-0.559
fy-0.558
tered-0.551
brush-0.548
Token speed
Token ID2866
Feature activation+5.87e-02

Pos logit contributions
 seeing+0.238
 wondering+0.233
 cultures+0.232
 observing+0.231
 discovering+0.220
Neg logit contributions
L-0.123
 tighter-0.106
 Lily-0.106
tr-0.103
 half-0.100
Token and
Token ID290
Feature activation+8.94e-02

Pos logit contributions
 seeing+0.127
 observing+0.124
 wondering+0.120
 discovering+0.118
 wonders+0.115
Neg logit contributions
L-0.071
fy-0.068
brush-0.067
tered-0.067
tr-0.067
Token seeing
Token ID4379
Feature activation+5.69e-01

Pos logit contributions
 seeing+0.136
 observing+0.130
 wondering+0.129
 cultures+0.127
 wonders+0.125
Neg logit contributions
L-0.140
tered-0.138
tr-0.138
fy-0.136
 tighter-0.134
Token the
Token ID262
Feature activation+8.02e-02

Pos logit contributions
 seeing+1.200
 observing+1.198
ule+1.147
 wondering+1.145
ules+1.141
Neg logit contributions
 tighter-0.594
L-0.557
 more-0.550
 Lily-0.531
 less-0.528
Token world
Token ID995
Feature activation+3.93e-01

Pos logit contributions
 seeing+0.129
 observing+0.120
 wondering+0.113
 discovering+0.113
 visiting+0.108
Neg logit contributions
L-0.155
fy-0.140
 tighter-0.140
brush-0.139
tr-0.136
Token.
Token ID13
Feature activation+3.31e-02

Pos logit contributions
 seeing+0.792
 wondering+0.742
 observing+0.739
 discovering+0.731
 wonders+0.715
Neg logit contributions
tr-0.570
L-0.557
brush-0.545
fy-0.545
 tighter-0.509
Token
Token ID220
Feature activation-2.13e-02

Pos logit contributions
 seeing+0.077
 observing+0.071
 wondering+0.071
 discovering+0.070
 cultures+0.069
Neg logit contributions
tr-0.039
L-0.038
fy-0.036
brush-0.035
tered-0.035
Token
Token ID198
Feature activation+1.13e-01

Pos logit contributions
L+0.029
tr+0.028
tted+0.028
brush+0.025
Max+0.024
Neg logit contributions
 seeing-0.047
 wondering-0.044
 cultures-0.043
 observing-0.043
 discovering-0.043
TokenYes
Token ID5297
Feature activation-1.24e-01

Pos logit contributions
 seeing+0.291
 wondering+0.275
 discovering+0.271
 observing+0.271
 wonders+0.269
Neg logit contributions
tr-0.144
fy-0.137
brush-0.135
L-0.130
 tighter-0.126
Token,
Token ID11
Feature activation-2.57e-02

Pos logit contributions
tr+0.215
brush+0.195
L+0.194
fy+0.193
tted+0.192
Neg logit contributions
 seeing-0.232
 wondering-0.212
 observing-0.204
 discovering-0.200
 wonders-0.196
Token and
Token ID290
Feature activation+8.69e-02

Pos logit contributions
tr+0.037
L+0.036
brush+0.035
fy+0.035
 tighter+0.033
Neg logit contributions
 seeing-0.052
 wondering-0.049
 observing-0.049
 discovering-0.049
 wonders-0.048
Token I
Token ID314
Feature activation+7.40e-02

Pos logit contributions
 seeing+0.167
 wondering+0.157
 observing+0.156
 discovering+0.155
 wonders+0.152
Neg logit contributions
tr-0.132
L-0.130
brush-0.128
fy-0.127
 tighter-0.119
Token enjoy
Token ID2883
Feature activation+1.33e-01

Pos logit contributions
 seeing+0.143
 observing+0.137
 discovering+0.136
 wondering+0.135
 cultures+0.133
Neg logit contributions
L-0.116
tr-0.115
fy-0.110
brush-0.109
tered-0.102
Token seeing
Token ID4379
Feature activation+5.69e-01

Pos logit contributions
 cultures+0.271
 wonders+0.270
 ey+0.263
 wondering+0.257
 property+0.243
Neg logit contributions
L-0.149
Fl-0.133
 Lily-0.132
 juice-0.131
brush-0.130
Token you
Token ID345
Feature activation+1.17e-01

Pos logit contributions
 discovering+1.145
 seeing+1.141
 observing+1.139
 wondering+1.130
 cultures+1.092
Neg logit contributions
L-0.698
brush-0.617
tr-0.612
 tighter-0.594
Fl-0.593
Token too
Token ID1165
Feature activation-1.09e-01

Pos logit contributions
 seeing+0.249
 wondering+0.243
 observing+0.242
 discovering+0.239
 wonders+0.234
Neg logit contributions
fy-0.133
L-0.132
tr-0.131
tered-0.125
 tighter-0.123
Token!"
Token ID2474
Feature activation+1.21e-01

Pos logit contributions
fy+0.148
L+0.144
tered+0.131
tr+0.130
brush+0.124
Neg logit contributions
 seeing-0.213
 discovering-0.208
 observing-0.207
 cultures-0.207
 wondering-0.204
Token<|endoftext|>
Token ID50256
Feature activation+3.92e-02

Pos logit contributions
 seeing+0.229
 wondering+0.209
 observing+0.207
 discovering+0.206
 wonders+0.197
Neg logit contributions
brush-0.189
fy-0.189
tr-0.188
 tighter-0.177
L-0.177
TokenOnce
Token ID7454
Feature activation-1.12e-01

Pos logit contributions
 seeing+0.072
 wondering+0.067
 discovering+0.066
 observing+0.066
 wonders+0.064
Neg logit contributions
L-0.065
tr-0.064
brush-0.061
fy-0.060
tered-0.057
Token a
Token ID257
Feature activation-5.18e-02

Pos logit contributions
L+0.077
 tighter+0.076
brush+0.076
tr+0.075
fy+0.072
Neg logit contributions
 cultures-0.098
 seeing-0.096
 wonders-0.089
 discovering-0.089
 observing-0.088
Token bit
Token ID1643
Feature activation-2.90e-01

Pos logit contributions
tr+0.075
brush+0.072
L+0.070
 gloss+0.069
paste+0.069
Neg logit contributions
 seeing-0.107
 wondering-0.102
 observing-0.098
 discovering-0.096
 wonders-0.095
Token jealous
Token ID19354
Feature activation+9.96e-02

Pos logit contributions
tr+0.646
brush+0.593
L+0.593
tered+0.564
 gloss+0.560
Neg logit contributions
 seeing-0.401
 wondering-0.386
 wonders-0.372
 observing-0.350
 discovering-0.344
Token,
Token ID11
Feature activation-2.45e-02

Pos logit contributions
 seeing+0.191
 wondering+0.178
 observing+0.176
 discovering+0.175
 wonders+0.171
Neg logit contributions
tr-0.157
L-0.152
fy-0.149
brush-0.149
 tighter-0.139
Token but
Token ID475
Feature activation+1.17e-01

Pos logit contributions
L+0.035
brush+0.034
tr+0.034
fy+0.034
 tighter+0.033
Neg logit contributions
 seeing-0.046
 cultures-0.045
 observing-0.044
 wonders-0.043
 discovering-0.043
Token seeing
Token ID4379
Feature activation+5.69e-01

Pos logit contributions
 seeing+0.201
 wondering+0.187
 discovering+0.183
 observing+0.182
 wonders+0.178
Neg logit contributions
tr-0.221
brush-0.208
fy-0.207
L-0.207
tted-0.198
Token her
Token ID607
Feature activation+9.81e-02

Pos logit contributions
 seeing+1.135
 observing+1.083
 wondering+1.063
 discovering+1.056
 cultures+1.040
Neg logit contributions
L-0.759
brush-0.742
tr-0.739
 tighter-0.719
 gloss-0.695
Token brother
Token ID3956
Feature activation+6.21e-03

Pos logit contributions
 seeing+0.209
 discovering+0.196
 observing+0.193
 wondering+0.190
 cultures+0.184
Neg logit contributions
tr-0.133
brush-0.125
fy-0.124
L-0.121
 tighter-0.120
Token feeling
Token ID4203
Feature activation+1.90e-01

Pos logit contributions
 seeing+0.012
 cultures+0.012
 observing+0.011
 wonders+0.011
 discovering+0.011
Neg logit contributions
tr-0.008
L-0.008
brush-0.008
fy-0.008
 swapped-0.008
Token less
Token ID1342
Feature activation-2.77e-01

Pos logit contributions
 seeing+0.395
 observing+0.368
 discovering+0.366
 wondering+0.366
 cultures+0.356
Neg logit contributions
L-0.258
tr-0.254
fy-0.252
brush-0.252
 tighter-0.236
Token scared
Token ID12008
Feature activation+1.88e-01

Pos logit contributions
fy+0.459
 tighter+0.437
brush+0.433
L+0.432
 swapped+0.422
Neg logit contributions
 seeing-0.465
 discovering-0.448
 cultures-0.441
 observing-0.438
 wondering-0.403
Token't
Token ID470
Feature activation+2.65e-03

Pos logit contributions
 seeing+0.245
 wondering+0.229
 observing+0.228
 discovering+0.226
 wonders+0.221
Neg logit contributions
tr-0.171
L-0.166
fy-0.165
brush-0.163
 tighter-0.153
Token believe
Token ID1975
Feature activation+4.38e-02

Pos logit contributions
 seeing+0.005
 wondering+0.005
 observing+0.005
 discovering+0.005
 wonders+0.005
Neg logit contributions
tr-0.004
L-0.004
brush-0.004
fy-0.003
tered-0.003
Token what
Token ID644
Feature activation+2.73e-01

Pos logit contributions
 seeing+0.083
 wonders+0.080
 observing+0.080
 discovering+0.079
 wondering+0.078
Neg logit contributions
L-0.061
tered-0.058
tr-0.057
brush-0.057
 tighter-0.055
Token he
Token ID339
Feature activation+5.56e-02

Pos logit contributions
 seeing+0.559
 observing+0.539
 discovering+0.531
 wondering+0.530
 wonders+0.521
Neg logit contributions
L-0.356
brush-0.334
tr-0.330
fy-0.307
 gloss-0.305
Token was
Token ID373
Feature activation+3.60e-02

Pos logit contributions
 observing+0.137
 seeing+0.136
 discovering+0.134
 cultures+0.133
 wondering+0.130
Neg logit contributions
L-0.051
tr-0.050
fy-0.050
bo-0.049
tered-0.045
Token seeing
Token ID4379
Feature activation+5.69e-01

Pos logit contributions
 cultures+0.063
 ey+0.060
 heights+0.057
ric+0.056
ules+0.056
Neg logit contributions
tered-0.043
�-0.042
's-0.040
 cutter-0.040
 away-0.040
Token.
Token ID13
Feature activation+4.02e-02

Pos logit contributions
 seeing+1.246
 observing+1.209
 discovering+1.193
 cultures+1.188
 wondering+1.186
Neg logit contributions
L-0.679
tr-0.633
brush-0.631
tered-0.609
fy-0.596
Token He
Token ID679
Feature activation+1.56e-02

Pos logit contributions
 seeing+0.090
 observing+0.084
 discovering+0.084
 cultures+0.084
 wondering+0.082
Neg logit contributions
L-0.050
tr-0.048
brush-0.044
fy-0.043
tted-0.043
Token wondered
Token ID14028
Feature activation+3.39e-01

Pos logit contributions
 seeing+0.033
 observing+0.032
 discovering+0.031
 wondering+0.031
 cultures+0.031
Neg logit contributions
L-0.020
tr-0.020
fy-0.020
brush-0.020
tered-0.018
Token if
Token ID611
Feature activation+1.44e-01

Pos logit contributions
 wonders+0.556
 cultures+0.535
 observing+0.534
 seeing+0.525
 discovering+0.515
Neg logit contributions
 tighter-0.517
brush-0.513
tered-0.483
L-0.482
 tightly-0.478
Token staying
Token ID10589
Feature activation+3.91e-01

Pos logit contributions
 seeing+0.277
 observing+0.259
 wondering+0.259
 discovering+0.256
 wonders+0.252
Neg logit contributions
L-0.216
tr-0.209
brush-0.205
fy-0.203
 tighter-0.196
Token throwing
Token ID9644
Feature activation+2.00e-01

Pos logit contributions
L+0.116
tr+0.115
brush+0.112
 gloss+0.110
 tighter+0.110
Neg logit contributions
 cultures-0.096
 wonders-0.095
 seeing-0.095
 wondering-0.092
 observing-0.088
Token pe
Token ID613
Feature activation+2.73e-01

Pos logit contributions
 seeing+0.353
 wondering+0.327
 observing+0.326
 discovering+0.323
 wonders+0.311
Neg logit contributions
tr-0.340
L-0.337
brush-0.330
fy-0.326
 tighter-0.314
Tokenb
Token ID65
Feature activation+1.45e-01

Pos logit contributions
 seeing+0.510
 wondering+0.494
 observing+0.490
 discovering+0.481
 cultures+0.461
Neg logit contributions
 tighter-0.402
brush-0.396
L-0.388
bo-0.387
fy-0.371
Tokenbles
Token ID7689
Feature activation+4.60e-02

Pos logit contributions
 seeing+0.331
 wondering+0.320
 observing+0.315
 discovering+0.312
 wonders+0.303
Neg logit contributions
brush-0.173
tr-0.169
L-0.168
paste-0.154
 gloss-0.154
Token and
Token ID290
Feature activation+9.03e-02

Pos logit contributions
 seeing+0.087
 discovering+0.080
 wondering+0.080
 observing+0.080
 wonders+0.078
Neg logit contributions
tr-0.073
L-0.071
fy-0.070
brush-0.069
tered-0.065
Token seeing
Token ID4379
Feature activation+5.69e-01

Pos logit contributions
 seeing+0.123
 wondering+0.112
 observing+0.111
 discovering+0.109
 wonders+0.108
Neg logit contributions
tr-0.187
L-0.184
brush-0.181
fy-0.179
 tighter-0.173
Token how
Token ID703
Feature activation+1.40e-01

Pos logit contributions
 seeing+1.167
 observing+1.153
 discovering+1.124
 wondering+1.118
 cultures+1.089
Neg logit contributions
L-0.687
 tighter-0.683
tr-0.669
brush-0.636
 juice-0.606
Token many
Token ID867
Feature activation+8.86e-02

Pos logit contributions
 seeing+0.265
 wondering+0.246
 observing+0.245
 discovering+0.243
 wonders+0.237
Neg logit contributions
tr-0.219
L-0.217
fy-0.212
brush-0.211
 tighter-0.198
Token times
Token ID1661
Feature activation+1.40e-01

Pos logit contributions
 observing+0.159
 seeing+0.157
 wondering+0.155
 discovering+0.151
 visiting+0.143
Neg logit contributions
L-0.140
 juice-0.127
brush-0.118
 half-0.117
fy-0.117
Token they
Token ID484
Feature activation-4.61e-02

Pos logit contributions
 seeing+0.312
 wondering+0.300
 observing+0.299
 discovering+0.294
 wonders+0.288
Neg logit contributions
tr-0.169
brush-0.169
L-0.168
fy-0.161
tered-0.153
Token could
Token ID714
Feature activation+3.24e-02

Pos logit contributions
 swapped+0.049
L+0.049
fy+0.047
tted+0.046
tered+0.045
Neg logit contributions
 observing-0.108
 seeing-0.106
 wondering-0.105
 discovering-0.102
 visiting-0.099
Token job
Token ID1693
Feature activation-7.21e-02

Pos logit contributions
fy+0.030
brush+0.029
 tighter+0.029
 swapped+0.028
tted+0.028
Neg logit contributions
 seeing-0.036
 wonders-0.032
 discovering-0.032
 observing-0.032
 wondering-0.032
Token!
Token ID0
Feature activation+1.11e-01

Pos logit contributions
tr+0.105
L+0.100
brush+0.099
fy+0.098
 gloss+0.092
Neg logit contributions
 wonders-0.136
 cultures-0.134
 seeing-0.133
 wondering-0.131
 observing-0.129
Token I
Token ID314
Feature activation+7.18e-02

Pos logit contributions
 seeing+0.147
 observing+0.135
 discovering+0.134
 wondering+0.134
 wonders+0.127
Neg logit contributions
 tighter-0.229
fy-0.223
brush-0.223
tr-0.217
kin-0.210
Token really
Token ID1107
Feature activation-3.55e-02

Pos logit contributions
 seeing+0.135
 wondering+0.125
 observing+0.125
 discovering+0.124
 wonders+0.120
Neg logit contributions
tr-0.115
L-0.113
brush-0.111
fy-0.110
 gloss-0.103
Token missed
Token ID6825
Feature activation+5.48e-02

Pos logit contributions
tr+0.061
L+0.060
fy+0.059
brush+0.058
tered+0.055
Neg logit contributions
 seeing-0.062
 observing-0.057
 wondering-0.057
 discovering-0.057
 wonders-0.054
Token seeing
Token ID4379
Feature activation+5.68e-01

Pos logit contributions
 seeing+0.107
 wondering+0.102
 observing+0.102
 cultures+0.101
 wonders+0.100
Neg logit contributions
L-0.076
tr-0.068
brush-0.068
 tighter-0.067
fy-0.067
Token your
Token ID534
Feature activation+1.22e-01

Pos logit contributions
 seeing+1.094
 wondering+1.034
 discovering+1.032
 observing+1.031
 wonders+0.998
Neg logit contributions
L-0.863
tr-0.844
brush-0.819
fy-0.787
 gloss-0.763
Token drawings
Token ID23388
Feature activation+1.18e-01

Pos logit contributions
 seeing+0.245
 discovering+0.227
 wondering+0.227
 observing+0.226
 cultures+0.216
Neg logit contributions
tr-0.184
L-0.172
fy-0.168
brush-0.167
 gloss-0.158
Token."
Token ID526
Feature activation+6.55e-02

Pos logit contributions
 seeing+0.248
 observing+0.241
 wondering+0.238
 discovering+0.235
 cultures+0.231
Neg logit contributions
brush-0.160
tr-0.158
fy-0.150
L-0.146
 tighter-0.136
Token Lily
Token ID20037
Feature activation-2.63e-01

Pos logit contributions
 seeing+0.148
 wondering+0.144
 wonders+0.140
 discovering+0.137
 observing+0.135
Neg logit contributions
brush-0.076
fy-0.075
 tighter-0.071
tted-0.069
tr-0.069
Token smiled
Token ID13541
Feature activation-2.91e-02

Pos logit contributions
tr+0.443
fy+0.435
brush+0.427
L+0.418
tted+0.400
Neg logit contributions
 seeing-0.476
 wondering-0.439
 wonders-0.433
 observing-0.428
 discovering-0.425
Token decided
Token ID3066
Feature activation+1.31e-02

Pos logit contributions
L+0.058
tr+0.055
brush+0.051
 swapped+0.049
fy+0.049
Neg logit contributions
 seeing-0.091
 cultures-0.089
 wondering-0.089
 discovering-0.088
 observing-0.087
Token she
Token ID673
Feature activation+9.60e-02

Pos logit contributions
 cultures+0.031
 wonders+0.031
 observing+0.030
eling+0.029
 discovering+0.029
Neg logit contributions
L-0.012
 Lily-0.010
tr-0.010
brush-0.010
 tighter-0.009
Token didn
Token ID1422
Feature activation+1.88e-02

Pos logit contributions
 seeing+0.185
 observing+0.185
 discovering+0.182
 cultures+0.179
 wondering+0.177
Neg logit contributions
tr-0.144
fy-0.138
L-0.137
tered-0.132
brush-0.132
Token't
Token ID470
Feature activation+1.53e-03

Pos logit contributions
 seeing+0.039
 wondering+0.037
 observing+0.037
 discovering+0.036
 wonders+0.036
Neg logit contributions
tr-0.026
L-0.025
fy-0.025
brush-0.025
 tighter-0.023
Token like
Token ID588
Feature activation+8.45e-02

Pos logit contributions
 seeing+0.003
 wondering+0.003
 observing+0.003
 discovering+0.003
 wonders+0.002
Neg logit contributions
tr-0.002
L-0.002
brush-0.002
fy-0.002
 tighter-0.002
Token seeing
Token ID4379
Feature activation+5.68e-01

Pos logit contributions
 cultures+0.192
adow+0.185
 observing+0.184
reetings+0.183
 wondering+0.182
Neg logit contributions
 juice-0.086
 Lily-0.073
 cookie-0.067
 Mitt-0.060
 Midnight-0.058
Token all
Token ID477
Feature activation+2.01e-02

Pos logit contributions
 seeing+1.178
 discovering+1.169
 cultures+1.163
 observing+1.153
 wondering+1.131
Neg logit contributions
L-0.653
 juice-0.646
brush-0.636
tr-0.629
 Lily-0.623
Token the
Token ID262
Feature activation+7.64e-02

Pos logit contributions
 wondering+0.043
 seeing+0.043
 observing+0.042
 discovering+0.040
 cultures+0.040
Neg logit contributions
L-0.022
 juice-0.021
brush-0.020
 Lily-0.020
 tighter-0.020
Token people
Token ID661
Feature activation+2.10e-01

Pos logit contributions
 observing+0.155
azes+0.153
 seeing+0.153
 discovering+0.152
 believing+0.149
Neg logit contributions
 juice-0.091
 cookie-0.078
ny-0.078
L-0.074
 tighter-0.074
Token go
Token ID467
Feature activation-1.07e-01

Pos logit contributions
 cultures+0.418
 wonders+0.417
 seeing+0.412
 observing+0.411
 wondering+0.408
Neg logit contributions
 swapped-0.229
fy-0.227
 swap-0.223
brush-0.221
bo-0.220
Token hungry
Token ID14720
Feature activation+2.01e-01

Pos logit contributions
L+0.119
 tighter+0.118
fy+0.117
 away+0.113
tr+0.109
Neg logit contributions
 seeing-0.220
 cultures-0.216
 wonders-0.214
 observing-0.212
 wondering-0.210
Token visit
Token ID3187
Feature activation+1.70e-01

Pos logit contributions
 wondering+0.089
 discovering+0.088
 seeing+0.087
 observing+0.087
 cultures+0.086
Neg logit contributions
L-0.051
tr-0.048
 swap-0.046
 tighter-0.045
brush-0.045
Token,
Token ID11
Feature activation-2.96e-02

Pos logit contributions
 wonders+0.346
 seeing+0.345
 discovering+0.345
 cultures+0.342
 wondering+0.342
Neg logit contributions
L-0.189
tered-0.187
tr-0.182
 tighter-0.175
brush-0.173
Token the
Token ID262
Feature activation+7.67e-02

Pos logit contributions
L+0.029
brush+0.028
tered+0.027
tr+0.027
 tighter+0.025
Neg logit contributions
 wonders-0.065
 seeing-0.065
 discovering-0.064
 wondering-0.063
 observing-0.063
Token children
Token ID1751
Feature activation+2.78e-01

Pos logit contributions
 seeing+0.140
 discovering+0.135
 observing+0.132
 wondering+0.129
 cultures+0.127
Neg logit contributions
tr-0.124
L-0.121
brush-0.118
fy-0.112
 tighter-0.111
Token loved
Token ID6151
Feature activation+1.70e-01

Pos logit contributions
 heights+0.650
 ey+0.643
 wondering+0.641
 discovering+0.641
 wonders+0.637
Neg logit contributions
tr-0.268
 swapped-0.245
brush-0.230
L-0.227
 swap-0.225
Token seeing
Token ID4379
Feature activation+5.68e-01

Pos logit contributions
reetings+0.368
 ey+0.367
eling+0.361
 wonders+0.354
 wondering+0.340
Neg logit contributions
 juice-0.158
 Lily-0.151
tr-0.138
 Mitt-0.135
 cookie-0.132
Token it
Token ID340
Feature activation+1.08e-01

Pos logit contributions
 seeing+1.164
 observing+1.150
 discovering+1.138
 wondering+1.107
 dealing+1.059
Neg logit contributions
 juice-0.651
L-0.650
 tighter-0.642
tr-0.631
brush-0.623
Token up
Token ID510
Feature activation+5.91e-02

Pos logit contributions
 seeing+0.235
 observing+0.231
 wondering+0.227
 discovering+0.225
 cultures+0.219
Neg logit contributions
tr-0.121
 tighter-0.115
tered-0.112
L-0.110
fy-0.108
Token close
Token ID1969
Feature activation-5.70e-02

Pos logit contributions
 seeing+0.131
 discovering+0.125
 observing+0.125
 wondering+0.124
 cultures+0.121
Neg logit contributions
L-0.070
 tighter-0.067
tr-0.065
fy-0.064
tered-0.062
Token!
Token ID0
Feature activation+1.13e-01

Pos logit contributions
 tighter+0.063
fy+0.061
 tightly+0.057
brush+0.054
tr+0.054
Neg logit contributions
 seeing-0.126
 observing-0.123
 discovering-0.121
 wondering-0.117
 believing-0.114
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
 seeing+0.265
 wondering+0.253
 discovering+0.251
 observing+0.249
 believing+0.248
Neg logit contributions
tr-0.135
L-0.126
brush-0.115
fy-0.108
 swap-0.102
Token Max
Token ID5436
Feature activation-1.67e-01

Pos logit contributions
 seeing+0.194
 observing+0.179
 discovering+0.179
 wondering+0.179
 wonders+0.175
Neg logit contributions
tr-0.094
brush-0.090
fy-0.090
L-0.083
tted-0.082
Token always
Token ID1464
Feature activation+9.19e-02

Pos logit contributions
tr+0.250
brush+0.241
fy+0.239
L+0.238
 tighter+0.227
Neg logit contributions
 seeing-0.337
 discovering-0.309
 observing-0.305
 wondering-0.305
 wonders-0.298
Token looked
Token ID3114
Feature activation-1.19e-02

Pos logit contributions
 seeing+0.185
 wondering+0.170
 discovering+0.169
 observing+0.169
 wonders+0.165
Neg logit contributions
tr-0.141
brush-0.138
fy-0.133
L-0.132
paste-0.128
Token forward
Token ID2651
Feature activation-1.40e-01

Pos logit contributions
tr+0.021
brush+0.021
L+0.020
fy+0.020
 gloss+0.019
Neg logit contributions
 seeing-0.021
 wondering-0.019
 discovering-0.019
 observing-0.019
 wonders-0.018
Token to
Token ID284
Feature activation+4.10e-02

Pos logit contributions
tr+0.165
L+0.158
fy+0.154
 tighter+0.147
brush+0.145
Neg logit contributions
 seeing-0.304
 wondering-0.297
 wonders-0.294
 cultures-0.289
 observing-0.288
Token seeing
Token ID4379
Feature activation+5.68e-01

Pos logit contributions
 seeing+0.062
 wondering+0.061
 wonders+0.060
 observing+0.059
 cultures+0.057
Neg logit contributions
L-0.071
tr-0.069
fy-0.065
 gloss-0.064
 swapped-0.064
Token the
Token ID262
Feature activation+7.36e-02

Pos logit contributions
 seeing+1.160
 wondering+1.096
 observing+1.092
 discovering+1.083
 wonders+1.046
Neg logit contributions
L-0.780
tr-0.773
fy-0.730
brush-0.730
 tighter-0.719
Token kind
Token ID1611
Feature activation+2.76e-02

Pos logit contributions
 seeing+0.128
 wondering+0.119
 discovering+0.118
 observing+0.118
 believing+0.108
Neg logit contributions
L-0.127
tr-0.127
brush-0.118
fy-0.115
 swapped-0.114
Token lady
Token ID10846
Feature activation-4.27e-02

Pos logit contributions
 seeing+0.060
 wondering+0.057
 observing+0.056
 discovering+0.056
 wonders+0.054
Neg logit contributions
tr-0.036
L-0.035
brush-0.034
fy-0.034
 tighter-0.031
Token in
Token ID287
Feature activation+6.95e-03

Pos logit contributions
tr+0.060
L+0.057
fy+0.056
brush+0.056
 gloss+0.053
Neg logit contributions
 seeing-0.087
 wondering-0.083
 observing-0.081
 wonders-0.081
 discovering-0.081
Token the
Token ID262
Feature activation+7.47e-02

Pos logit contributions
 seeing+0.013
 wondering+0.012
 observing+0.012
 discovering+0.012
 wonders+0.012
Neg logit contributions
tr-0.011
L-0.011
brush-0.010
fy-0.010
 tighter-0.010
Token the
Token ID262
Feature activation+7.92e-02

Pos logit contributions
 seeing+0.814
 observing+0.792
 wondering+0.764
 discovering+0.745
 cultures+0.729
Neg logit contributions
L-0.510
 tighter-0.485
brush-0.480
 juice-0.453
fy-0.445
Token island
Token ID7022
Feature activation+2.21e-01

Pos logit contributions
 seeing+0.141
 observing+0.131
azes+0.128
 wondering+0.124
 discovering+0.123
Neg logit contributions
fy-0.115
 tighter-0.114
L-0.112
 juice-0.109
 half-0.108
Token,
Token ID11
Feature activation-2.65e-02

Pos logit contributions
 seeing+0.456
 observing+0.439
 wondering+0.434
 wonders+0.425
 discovering+0.418
Neg logit contributions
brush-0.289
tr-0.283
fy-0.275
L-0.268
 tighter-0.264
Token and
Token ID290
Feature activation+8.84e-02

Pos logit contributions
L+0.038
brush+0.037
tr+0.037
fy+0.035
 tighter+0.035
Neg logit contributions
 seeing-0.050
 observing-0.047
 cultures-0.047
 wonders-0.047
 wondering-0.046
Token for
Token ID329
Feature activation+4.99e-02

Pos logit contributions
 seeing+0.176
 wondering+0.165
 observing+0.165
 discovering+0.163
 wonders+0.161
Neg logit contributions
tr-0.126
L-0.124
brush-0.122
fy-0.120
 tighter-0.114
Token seeing
Token ID4379
Feature activation+5.68e-01

Pos logit contributions
 seeing+0.086
 wondering+0.080
 observing+0.080
 wonders+0.079
 discovering+0.078
Neg logit contributions
L-0.083
tr-0.081
brush-0.079
fy-0.078
 tighter-0.077
Token all
Token ID477
Feature activation+2.01e-02

Pos logit contributions
 seeing+1.123
 observing+1.085
 wondering+1.059
 discovering+1.031
 cultures+1.016
Neg logit contributions
L-0.769
brush-0.742
tr-0.720
 tighter-0.705
tered-0.685
Token the
Token ID262
Feature activation+8.11e-02

Pos logit contributions
 seeing+0.041
 wondering+0.040
 observing+0.039
 wonders+0.038
 cultures+0.038
Neg logit contributions
L-0.024
 tighter-0.023
fy-0.022
brush-0.022
 juice-0.021
Token beauty
Token ID8737
Feature activation+2.96e-01

Pos logit contributions
 seeing+0.147
 observing+0.140
azes+0.134
 wondering+0.131
 discovering+0.127
Neg logit contributions
L-0.114
fy-0.112
 tighter-0.111
 juice-0.106
 half-0.102
Token it
Token ID340
Feature activation+1.10e-01

Pos logit contributions
 seeing+0.717
 wondering+0.685
 observing+0.684
 discovering+0.671
 wonders+0.660
Neg logit contributions
brush-0.300
fy-0.298
tr-0.292
L-0.290
 tighter-0.266
Token had
Token ID550
Feature activation-3.35e-02

Pos logit contributions
 observing+0.254
 seeing+0.254
 wondering+0.245
 cultures+0.244
 discovering+0.242
Neg logit contributions
brush-0.101
tered-0.099
L-0.099
tr-0.098
fy-0.097
Token felt
Token ID2936
Feature activation-1.11e-01

Pos logit contributions
 seeing+0.098
 wondering+0.093
 observing+0.092
 discovering+0.091
 cultures+0.090
Neg logit contributions
L-0.073
tr-0.072
brush-0.069
 tighter-0.066
fy-0.066
Token squ
Token ID2809
Feature activation-9.41e-02

Pos logit contributions
L+0.143
brush+0.137
fy+0.135
 tighter+0.134
tered+0.132
Neg logit contributions
 seeing-0.229
 cultures-0.217
 observing-0.215
 discovering-0.213
 wondering-0.208
Tokenishy
Token ID49785
Feature activation-1.79e-01

Pos logit contributions
tr+0.163
brush+0.149
L+0.147
fy+0.145
tered+0.139
Neg logit contributions
 seeing-0.170
 wondering-0.160
 wonders-0.158
 observing-0.157
 discovering-0.155
Token and
Token ID290
Feature activation-2.22e-02

Pos logit contributions
tr+0.281
L+0.277
brush+0.273
fy+0.271
 tighter+0.256
Neg logit contributions
 seeing-0.337
 wondering-0.313
 observing-0.313
 discovering-0.312
 wonders-0.303
Token com
Token ID401
Feature activation-1.42e-02

Pos logit contributions
L+0.031
fy+0.031
tr+0.029
brush+0.029
 tighter+0.029
Neg logit contributions
 seeing-0.045
 observing-0.042
 discovering-0.042
 cultures-0.042
 wondering-0.041
Tokenfy
Token ID24928
Feature activation-5.21e-01

Pos logit contributions
tr+0.015
L+0.013
brush+0.013
Max+0.011
Wh+0.011
Neg logit contributions
 seeing-0.034
 wondering-0.033
 wonders-0.032
 observing-0.031
 discovering-0.031
Token.
Token ID13
Feature activation-6.89e-02

Pos logit contributions
brush+0.667
tr+0.643
fy+0.636
L+0.636
paste+0.597
Neg logit contributions
 seeing-1.069
 wonders-1.040
 cultures-1.039
 discovering-1.021
 observing-1.017
Token
Token ID198
Feature activation+1.86e-02

Pos logit contributions
L+0.090
tr+0.086
brush+0.082
fy+0.080
 gloss+0.077
Neg logit contributions
 seeing-0.151
 discovering-0.140
 observing-0.140
 wondering-0.139
 cultures-0.139
Token
Token ID198
Feature activation+2.41e-02

Pos logit contributions
 seeing+0.039
 wondering+0.036
 cultures+0.036
 discovering+0.036
 observing+0.036
Neg logit contributions
L-0.029
tr-0.024
Fl-0.024
brush-0.023
Max-0.023
TokenThey
Token ID2990
Feature activation-1.53e-01

Pos logit contributions
 seeing+0.057
 wondering+0.054
 observing+0.054
 discovering+0.054
 wonders+0.052
Neg logit contributions
tr-0.027
fy-0.025
brush-0.025
L-0.024
 tighter-0.023
Token laughed
Token ID13818
Feature activation-1.31e-01

Pos logit contributions
tr+0.210
L+0.203
fy+0.199
brush+0.199
 tighter+0.184
Neg logit contributions
 seeing-0.322
 wondering-0.301
 observing-0.298
 discovering-0.297
 wonders-0.292
Token curl
Token ID29249
Feature activation-2.02e-01

Pos logit contributions
tr+0.122
brush+0.122
L+0.117
fy+0.116
 swap+0.109
Neg logit contributions
 seeing-0.203
 observing-0.194
 discovering-0.194
 wondering-0.193
 cultures-0.190
Token up
Token ID510
Feature activation-4.61e-02

Pos logit contributions
tr+0.332
L+0.326
brush+0.320
fy+0.320
 tighter+0.302
Neg logit contributions
 seeing-0.371
 wondering-0.343
 observing-0.342
 discovering-0.341
 wonders-0.332
Token in
Token ID287
Feature activation-9.30e-02

Pos logit contributions
fy+0.058
tr+0.057
brush+0.057
L+0.056
 tighter+0.056
Neg logit contributions
 seeing-0.097
 observing-0.092
 discovering-0.091
 wondering-0.091
 cultures-0.090
Token her
Token ID607
Feature activation+5.14e-03

Pos logit contributions
L+0.125
brush+0.120
fy+0.118
 tighter+0.117
tr+0.117
Neg logit contributions
 seeing-0.189
 observing-0.178
 cultures-0.176
 wondering-0.176
 discovering-0.176
Token com
Token ID401
Feature activation-3.46e-04

Pos logit contributions
 seeing+0.011
 discovering+0.010
 wondering+0.010
 observing+0.010
 wonders+0.010
Neg logit contributions
tr-0.007
brush-0.007
fy-0.007
L-0.007
 tighter-0.006
Tokenfy
Token ID24928
Feature activation-5.05e-01

Pos logit contributions
tr+0.000
L+0.000
brush+0.000
tered+0.000
tted+0.000
Neg logit contributions
 seeing-0.001
 wonders-0.001
 wondering-0.001
 observing-0.001
 cultures-0.001
Token crib
Token ID48083
Feature activation-1.13e-01

Pos logit contributions
tr+0.707
L+0.691
brush+0.687
fy+0.682
 tighter+0.629
Neg logit contributions
 seeing-1.038
 wondering-0.972
 observing-0.965
 discovering-0.963
 wonders-0.940
Token and
Token ID290
Feature activation+3.76e-03

Pos logit contributions
tr+0.175
L+0.170
brush+0.167
fy+0.167
 tighter+0.156
Neg logit contributions
 seeing-0.217
 wondering-0.202
 observing-0.201
 discovering-0.200
 wonders-0.195
Token be
Token ID307
Feature activation-1.13e-01

Pos logit contributions
 seeing+0.008
 discovering+0.007
 observing+0.007
 wondering+0.007
 cultures+0.007
Neg logit contributions
tr-0.005
fy-0.005
brush-0.005
L-0.005
 tighter-0.005
Token ready
Token ID3492
Feature activation-8.24e-02

Pos logit contributions
tr+0.169
L+0.166
brush+0.163
fy+0.162
 tighter+0.152
Neg logit contributions
 seeing-0.222
 wondering-0.206
 observing-0.205
 discovering-0.205
 wonders-0.201
Token for
Token ID329
Feature activation-3.46e-02

Pos logit contributions
fy+0.105
brush+0.104
tr+0.103
L+0.100
 tighter+0.094
Neg logit contributions
 seeing-0.173
 observing-0.167
 wonders-0.166
 cultures-0.166
 wondering-0.165
TokenBut
Token ID1537
Feature activation-7.84e-02

Pos logit contributions
 seeing+0.025
 wondering+0.024
 observing+0.023
 discovering+0.023
 wonders+0.023
Neg logit contributions
tr-0.016
fy-0.016
brush-0.016
L-0.015
 tighter-0.014
Token it
Token ID340
Feature activation+5.62e-03

Pos logit contributions
tr+0.108
L+0.105
brush+0.104
fy+0.095
 tighter+0.094
Neg logit contributions
 seeing-0.156
 wondering-0.149
 discovering-0.148
 wonders-0.147
 cultures-0.147
Token looks
Token ID3073
Feature activation+3.50e-02

Pos logit contributions
 seeing+0.014
 observing+0.013
 discovering+0.013
 wondering+0.013
 cultures+0.013
Neg logit contributions
tr-0.006
L-0.005
brush-0.005
fy-0.005
 tighter-0.005
Token so
Token ID523
Feature activation-1.27e-01

Pos logit contributions
 seeing+0.072
 observing+0.069
 discovering+0.068
 cultures+0.066
 wondering+0.066
Neg logit contributions
fy-0.048
L-0.046
tr-0.046
brush-0.044
 tighter-0.044
Token com
Token ID401
Feature activation+5.90e-03

Pos logit contributions
fy+0.161
L+0.153
 juice+0.139
tr+0.137
brush+0.134
Neg logit contributions
 seeing-0.285
 observing-0.272
 discovering-0.272
 wondering-0.263
 cultures-0.262
Tokenfy
Token ID24928
Feature activation-5.03e-01

Pos logit contributions
 seeing+0.017
 wondering+0.016
 wonders+0.016
 observing+0.016
 discovering+0.016
Neg logit contributions
L-0.003
tr-0.003
brush-0.002
Fl-0.002
Wh-0.002
Token and
Token ID290
Feature activation-2.70e-03

Pos logit contributions
brush+0.691
fy+0.681
L+0.662
tr+0.661
tered+0.638
Neg logit contributions
 seeing-0.986
 cultures-0.977
 wonders-0.957
 observing-0.946
 discovering-0.944
Token sweet
Token ID6029
Feature activation-1.21e-01

Pos logit contributions
fy+0.004
brush+0.004
tr+0.004
L+0.003
 tighter+0.003
Neg logit contributions
 seeing-0.006
 observing-0.005
 discovering-0.005
 wondering-0.005
 cultures-0.005
Token".
Token ID1911
Feature activation+1.02e-02

Pos logit contributions
fy+0.175
brush+0.171
tr+0.166
L+0.165
 juice+0.156
Neg logit contributions
 seeing-0.239
 observing-0.229
 discovering-0.229
 wondering-0.227
 cultures-0.219
Token
Token ID198
Feature activation+2.26e-02

Pos logit contributions
 seeing+0.020
 wondering+0.019
 observing+0.018
 discovering+0.018
 wonders+0.018
Neg logit contributions
tr-0.015
fy-0.015
L-0.015
brush-0.015
 tighter-0.014
Token
Token ID198
Feature activation+2.62e-02

Pos logit contributions
 seeing+0.043
 wondering+0.040
 discovering+0.040
 observing+0.040
 wonders+0.038
Neg logit contributions
L-0.037
tr-0.035
brush-0.034
fy-0.033
Fl-0.032
Token and
Token ID290
Feature activation-1.05e-02

Pos logit contributions
tr+0.293
fy+0.290
L+0.283
brush+0.278
 gloss+0.263
Neg logit contributions
 seeing-0.346
 wondering-0.319
 observing-0.314
 discovering-0.313
 wonders-0.309
Token fell
Token ID3214
Feature activation-2.15e-01

Pos logit contributions
tr+0.014
fy+0.014
brush+0.013
L+0.013
 gloss+0.012
Neg logit contributions
 seeing-0.022
 wondering-0.020
 wonders-0.020
 observing-0.020
 discovering-0.020
Token.
Token ID13
Feature activation-6.51e-02

Pos logit contributions
tr+0.335
fy+0.333
L+0.329
brush+0.319
 swapped+0.304
Neg logit contributions
 seeing-0.402
 wondering-0.376
 observing-0.365
 discovering-0.362
 wonders-0.360
Token The
Token ID383
Feature activation-1.43e-01

Pos logit contributions
L+0.090
tr+0.085
brush+0.079
 gloss+0.077
Fl+0.073
Neg logit contributions
 seeing-0.143
 cultures-0.137
 discovering-0.136
 observing-0.132
 wondering-0.130
Token ash
Token ID12530
Feature activation+4.18e-02

Pos logit contributions
tr+0.206
L+0.202
fy+0.199
brush+0.197
 tighter+0.183
Neg logit contributions
 seeing-0.286
 wondering-0.268
 observing-0.266
 discovering-0.264
 wonders-0.259
Tokentr
Token ID2213
Feature activation-5.02e-01

Pos logit contributions
 seeing+0.091
 wonders+0.084
 wondering+0.083
 observing+0.082
 discovering+0.080
Neg logit contributions
fy-0.050
L-0.048
 tighter-0.044
tered-0.042
 swap-0.041
Tokenay
Token ID323
Feature activation-3.10e-01

Pos logit contributions
fy+0.587
 tighter+0.586
L+0.582
tered+0.566
Fl+0.563
Neg logit contributions
 seeing-0.980
 wonders-0.928
 wondering-0.925
 observing-0.898
 doing-0.855
Token flew
Token ID13112
Feature activation-7.04e-02

Pos logit contributions
fy+0.416
tr+0.413
L+0.408
brush+0.397
 tighter+0.381
Neg logit contributions
 seeing-0.642
 wondering-0.591
 observing-0.589
 discovering-0.581
 wonders-0.578
Token out
Token ID503
Feature activation-1.12e-01

Pos logit contributions
 gloss+0.117
fy+0.117
paste+0.113
brush+0.112
tr+0.112
Neg logit contributions
 seeing-0.111
 wondering-0.102
 wonders-0.099
 discovering-0.094
 observing-0.093
Token of
Token ID286
Feature activation-2.06e-01

Pos logit contributions
tr+0.157
L+0.150
fy+0.150
brush+0.148
 gloss+0.140
Neg logit contributions
 seeing-0.234
 wondering-0.218
 observing-0.215
 discovering-0.214
 wonders-0.212
Token her
Token ID607
Feature activation+1.71e-02

Pos logit contributions
tr+0.328
L+0.324
brush+0.316
fy+0.311
 tighter+0.296
Neg logit contributions
 seeing-0.384
 wondering-0.357
 observing-0.355
 discovering-0.354
 wonders-0.344
Token is
Token ID318
Feature activation+7.48e-02

Pos logit contributions
tr+0.125
L+0.118
brush+0.116
fy+0.115
bo+0.108
Neg logit contributions
 seeing-0.190
 observing-0.181
 discovering-0.180
 wondering-0.175
 cultures-0.174
Token so
Token ID523
Feature activation-1.24e-01

Pos logit contributions
 seeing+0.158
 cultures+0.150
 discovering+0.149
 observing+0.149
 wondering+0.147
Neg logit contributions
tr-0.098
brush-0.093
L-0.092
fy-0.090
 tighter-0.088
Token soft
Token ID2705
Feature activation-1.33e-01

Pos logit contributions
fy+0.162
L+0.154
brush+0.141
tr+0.139
 juice+0.138
Neg logit contributions
 seeing-0.277
 observing-0.261
 discovering-0.259
 cultures-0.253
 wondering-0.251
Token and
Token ID290
Feature activation-4.68e-03

Pos logit contributions
brush+0.188
tr+0.187
L+0.182
fy+0.180
tered+0.171
Neg logit contributions
 seeing-0.262
 observing-0.249
 cultures-0.248
 discovering-0.248
 wondering-0.245
Token com
Token ID401
Feature activation+3.78e-03

Pos logit contributions
tr+0.007
L+0.007
fy+0.007
brush+0.006
 tighter+0.006
Neg logit contributions
 seeing-0.010
 observing-0.009
 wondering-0.009
 discovering-0.009
 wonders-0.009
Tokenfy
Token ID24928
Feature activation-5.00e-01

Pos logit contributions
 seeing+0.011
 wonders+0.011
 wondering+0.011
 observing+0.011
 discovering+0.011
Neg logit contributions
tr-0.002
L-0.002
brush-0.001
 gloss-0.001
tered-0.001
Token."
Token ID526
Feature activation-2.18e-02

Pos logit contributions
brush+0.628
tr+0.610
fy+0.599
L+0.594
tered+0.576
Neg logit contributions
 seeing-1.040
 wonders-1.009
 cultures-1.009
 observing-0.996
 discovering-0.990
Token The
Token ID383
Feature activation-1.29e-01

Pos logit contributions
L+0.031
tr+0.027
brush+0.027
fy+0.027
 gloss+0.026
Neg logit contributions
 seeing-0.046
 discovering-0.043
 observing-0.043
 cultures-0.042
 wondering-0.042
Token rabbit
Token ID22746
Feature activation-9.38e-02

Pos logit contributions
L+0.181
tr+0.180
brush+0.174
fy+0.169
 tighter+0.162
Neg logit contributions
 seeing-0.266
 discovering-0.248
 observing-0.247
 wondering-0.246
 wonders-0.238
Token thought
Token ID1807
Feature activation+1.17e-01

Pos logit contributions
L+0.118
tr+0.118
brush+0.114
fy+0.112
 tighter+0.112
Neg logit contributions
 seeing-0.201
 discovering-0.197
 observing-0.195
 cultures-0.189
 wondering-0.184
Token this
Token ID428
Feature activation-1.07e-03

Pos logit contributions
 seeing+0.236
 observing+0.226
 cultures+0.223
 discovering+0.223
 wonders+0.222
Neg logit contributions
brush-0.165
fy-0.162
L-0.161
tr-0.159
tered-0.154
Token.
Token ID13
Feature activation-7.25e-02

Pos logit contributions
 seeing+0.247
 discovering+0.234
 cultures+0.233
 observing+0.232
 wondering+0.231
Neg logit contributions
tr-0.190
L-0.188
brush-0.183
fy-0.180
tered-0.176
Token He
Token ID679
Feature activation-9.30e-02

Pos logit contributions
L+0.084
tr+0.077
 Lily+0.068
Fl+0.067
brush+0.067
Neg logit contributions
 seeing-0.171
 cultures-0.168
 discovering-0.165
 observing-0.160
 believing-0.160
Token drops
Token ID10532
Feature activation-5.07e-02

Pos logit contributions
tr+0.154
L+0.151
brush+0.144
fy+0.144
bo+0.143
Neg logit contributions
 discovering-0.166
 seeing-0.164
 observing-0.163
 cultures-0.158
 believing-0.148
Token the
Token ID262
Feature activation-2.26e-02

Pos logit contributions
tr+0.104
fy+0.101
brush+0.096
 gloss+0.094
L+0.094
Neg logit contributions
 seeing-0.074
 wonders-0.067
 wondering-0.066
 observing-0.064
 discovering-0.064
Token ash
Token ID12530
Feature activation+4.55e-02

Pos logit contributions
tr+0.031
L+0.030
brush+0.030
fy+0.030
 tighter+0.028
Neg logit contributions
 seeing-0.047
 wondering-0.044
 observing-0.044
 discovering-0.044
 wonders-0.042
Tokentr
Token ID2213
Feature activation-4.99e-01

Pos logit contributions
 seeing+0.082
 wonders+0.074
 wondering+0.074
 observing+0.073
 discovering+0.071
Neg logit contributions
L-0.070
fy-0.069
 swapped-0.064
tered-0.063
tted-0.062
Tokenay
Token ID323
Feature activation-3.05e-01

Pos logit contributions
tered+0.634
L+0.632
Mid+0.621
Fl+0.612
 swapped+0.611
Neg logit contributions
 seeing-0.914
 wondering-0.873
 wonders-0.859
 observing-0.849
 ey-0.805
Token.
Token ID13
Feature activation-5.02e-02

Pos logit contributions
L+0.509
fy+0.507
tr+0.490
 swapped+0.478
brush+0.476
Neg logit contributions
 seeing-0.519
 wondering-0.468
 observing-0.467
 wonders-0.456
 discovering-0.455
Token It
Token ID632
Feature activation-7.08e-02

Pos logit contributions
L+0.060
tr+0.057
brush+0.053
tered+0.048
tted+0.048
Neg logit contributions
 seeing-0.118
 discovering-0.113
 cultures-0.112
 observing-0.111
 wondering-0.110
Token breaks
Token ID9457
Feature activation-2.23e-01

Pos logit contributions
tr+0.119
L+0.117
brush+0.115
fy+0.114
 tighter+0.108
Neg logit contributions
 seeing-0.127
 wondering-0.118
 observing-0.117
 discovering-0.117
 wonders-0.113
Token.
Token ID13
Feature activation-4.21e-02

Pos logit contributions
tr+0.365
L+0.354
fy+0.353
brush+0.350
 gloss+0.328
Neg logit contributions
 seeing-0.413
 wondering-0.383
 observing-0.378
 discovering-0.376
 wonders-0.370
Token never
Token ID1239
Feature activation-1.55e-01

Pos logit contributions
 seeing+0.157
 discovering+0.149
 observing+0.149
 wondering+0.147
 cultures+0.141
Neg logit contributions
tr-0.152
 tighter-0.148
brush-0.148
L-0.147
 swap-0.142
Token come
Token ID1282
Feature activation-9.33e-02

Pos logit contributions
 tighter+0.192
L+0.192
fy+0.190
tr+0.189
brush+0.185
Neg logit contributions
 seeing-0.290
 observing-0.290
 wondering-0.287
 cultures-0.285
 discovering-0.280
Token back
Token ID736
Feature activation-3.42e-01

Pos logit contributions
L+0.055
tr+0.054
 tighter+0.049
brush+0.046
fy+0.045
Neg logit contributions
 seeing-0.251
 wondering-0.243
 observing-0.242
 cultures-0.241
 discovering-0.241
Token.
Token ID13
Feature activation-8.16e-02

Pos logit contributions
L+0.295
tr+0.274
 tighter+0.268
fy+0.267
brush+0.241
Neg logit contributions
 seeing-0.779
 cultures-0.776
 discovering-0.763
 observing-0.763
 wondering-0.763
Token<|endoftext|>
Token ID50256
Feature activation-7.17e-02

Pos logit contributions
L+0.115
tr+0.114
brush+0.106
fy+0.106
 gloss+0.098
Neg logit contributions
 seeing-0.169
 observing-0.156
 wondering-0.156
 discovering-0.156
 wonders-0.151
TokenL
Token ID43
Feature activation-4.99e-01

Pos logit contributions
L+0.125
tr+0.118
tted+0.110
brush+0.110
Fl+0.109
Neg logit contributions
 seeing-0.132
 wondering-0.124
 cultures-0.122
 discovering-0.121
 observing-0.119
Tokenila
Token ID10102
Feature activation+9.63e-02

Pos logit contributions
L+0.670
tr+0.630
brush+0.612
fy+0.600
 tighter+0.561
Neg logit contributions
 seeing-1.085
 wondering-1.024
 observing-1.022
 discovering-1.012
 wonders-0.989
Token and
Token ID290
Feature activation-8.28e-03

Pos logit contributions
 cultures+0.220
 discovering+0.218
 seeing+0.218
 observing+0.216
 wondering+0.211
Neg logit contributions
L-0.113
tr-0.100
bo-0.097
brush-0.094
fy-0.088
Token Tom
Token ID4186
Feature activation-1.40e-01

Pos logit contributions
L+0.010
tr+0.010
brush+0.009
tered+0.009
fy+0.008
Neg logit contributions
 seeing-0.018
 discovering-0.018
 wondering-0.018
 cultures-0.017
 wonders-0.017
Token are
Token ID389
Feature activation+2.20e-03

Pos logit contributions
L+0.161
 swapped+0.149
bo+0.145
tr+0.142
 swap+0.139
Neg logit contributions
 seeing-0.324
 discovering-0.322
 observing-0.316
 wondering-0.314
 cultures-0.313
Token friends
Token ID2460
Feature activation-2.44e-04

Pos logit contributions
 seeing+0.004
 cultures+0.003
 wonders+0.003
 discovering+0.003
 wondering+0.003
Neg logit contributions
L-0.004
tr-0.004
fy-0.004
brush-0.004
 tighter-0.004
Token.
Token ID13
Feature activation-7.88e-02

Pos logit contributions
L+0.233
tr+0.228
brush+0.227
fy+0.222
 tighter+0.216
Neg logit contributions
 seeing-0.285
 discovering-0.269
 observing-0.269
 wondering-0.266
 cultures-0.253
Token It
Token ID632
Feature activation-8.07e-02

Pos logit contributions
tr+0.131
fy+0.131
brush+0.129
L+0.127
 tighter+0.126
Neg logit contributions
 seeing-0.133
 wondering-0.126
 observing-0.126
 wonders-0.124
 discovering-0.122
Token's
Token ID338
Feature activation-2.54e-01

Pos logit contributions
tr+0.115
brush+0.105
L+0.104
fy+0.103
 tighter+0.097
Neg logit contributions
 seeing-0.177
 discovering-0.167
 observing-0.167
 wondering-0.165
 cultures-0.158
Token really
Token ID1107
Feature activation-1.35e-01

Pos logit contributions
tr+0.316
brush+0.277
 tighter+0.271
fy+0.271
L+0.263
Neg logit contributions
 seeing-0.586
 discovering-0.561
 observing-0.557
 wondering-0.551
 cultures-0.550
Token com
Token ID401
Feature activation-1.27e-03

Pos logit contributions
fy+0.173
tr+0.168
L+0.158
brush+0.151
 tighter+0.145
Neg logit contributions
 seeing-0.306
 discovering-0.298
 observing-0.291
 cultures-0.288
 wondering-0.285
Tokenfy
Token ID24928
Feature activation-4.99e-01

Pos logit contributions
L+0.001
tr+0.001
tered+0.000
brush+0.000
 gloss+0.000
Neg logit contributions
 seeing-0.004
 wonders-0.004
 wondering-0.003
 cultures-0.003
 observing-0.003
Token!".
Token ID48220
Feature activation+5.01e-02

Pos logit contributions
brush+0.693
tr+0.688
fy+0.669
L+0.651
 tighter+0.610
Neg logit contributions
 seeing-1.001
 observing-0.949
 discovering-0.945
 wondering-0.942
 cultures-0.936
Token
Token ID198
Feature activation+2.52e-02

Pos logit contributions
 seeing+0.105
 discovering+0.097
 observing+0.097
 wondering+0.097
 wonders+0.095
Neg logit contributions
L-0.071
tr-0.067
brush-0.065
fy-0.064
 gloss-0.061
Token
Token ID198
Feature activation+3.08e-02

Pos logit contributions
 seeing+0.052
 discovering+0.048
 wondering+0.048
 observing+0.048
 cultures+0.048
Neg logit contributions
L-0.040
tr-0.034
brush-0.033
Fl-0.033
Max-0.032
TokenLu
Token ID25596
Feature activation-2.86e-01

Pos logit contributions
 seeing+0.075
 wondering+0.071
 observing+0.071
 discovering+0.071
 wonders+0.069
Neg logit contributions
tr-0.032
fy-0.030
brush-0.030
L-0.028
tered-0.026
Tokency
Token ID948
Feature activation-2.33e-01

Pos logit contributions
tr+0.239
L+0.232
fy+0.222
brush+0.217
 tighter+0.198
Neg logit contributions
 seeing-0.763
 wondering-0.721
 discovering-0.720
 observing-0.717
 wonders-0.700
Token wanted
Token ID2227
Feature activation+9.43e-02

Pos logit contributions
L+0.353
tr+0.338
brush+0.326
fy+0.322
 tighter+0.307
Neg logit contributions
 seeing-0.503
 observing-0.481
 wondering-0.479
 discovering-0.476
 cultures-0.455
Token to
Token ID284
Feature activation-8.18e-02

Pos logit contributions
 seeing+0.208
 wondering+0.205
 cultures+0.205
 observing+0.201
 wonders+0.201
Neg logit contributions
tr-0.099
L-0.093
brush-0.091
 juice-0.091
fy-0.087
Token play
Token ID711
Feature activation-1.43e-01

Pos logit contributions
brush+0.100
L+0.099
tr+0.098
 swap+0.094
 juice+0.088
Neg logit contributions
 seeing-0.173
 wondering-0.171
 observing-0.167
 discovering-0.166
 cultures-0.164
Token.
Token ID13
Feature activation-8.46e-02

Pos logit contributions
tr+0.183
L+0.178
brush+0.174
fy+0.174
 tighter+0.159
Neg logit contributions
 seeing-0.314
 wondering-0.295
 observing-0.293
 discovering-0.292
 wonders-0.286
Token<|endoftext|>
Token ID50256
Feature activation-7.41e-02

Pos logit contributions
tr+0.115
L+0.114
brush+0.109
fy+0.109
 gloss+0.101
Neg logit contributions
 seeing-0.179
 wondering-0.167
 observing-0.166
 discovering-0.166
 wonders-0.162
TokenL
Token ID43
Feature activation-4.99e-01

Pos logit contributions
L+0.111
tr+0.100
tted+0.092
Fl+0.092
Max+0.092
Neg logit contributions
 seeing-0.157
 wondering-0.151
 cultures-0.148
 discovering-0.146
 observing-0.144
Tokenily
Token ID813
Feature activation-2.43e-01

Pos logit contributions
L+0.650
tr+0.569
brush+0.567
fy+0.527
bo+0.516
Neg logit contributions
 seeing-1.111
 wondering-1.071
 observing-1.060
 discovering-1.043
 cultures-1.028
Token was
Token ID373
Feature activation-7.12e-02

Pos logit contributions
L+0.263
tr+0.236
bo+0.220
brush+0.213
 swapped+0.203
Neg logit contributions
 cultures-0.577
 discovering-0.564
 seeing-0.558
 observing-0.553
 heights-0.545
Token a
Token ID257
Feature activation-1.50e-01

Pos logit contributions
tr+0.139
L+0.137
brush+0.131
fy+0.130
 tighter+0.125
Neg logit contributions
 seeing-0.106
 cultures-0.102
 discovering-0.094
 wonders-0.094
 wondering-0.093
Token little
Token ID1310
Feature activation-1.38e-01

Pos logit contributions
tr+0.275
brush+0.266
fy+0.264
 gloss+0.256
paste+0.256
Neg logit contributions
 seeing-0.244
 wondering-0.233
 wonders-0.228
 observing-0.226
 discovering-0.211
Token girl
Token ID2576
Feature activation-1.34e-01

Pos logit contributions
tr+0.221
brush+0.208
fy+0.207
L+0.204
tered+0.201
Neg logit contributions
 seeing-0.260
 wondering-0.251
 wonders-0.244
 observing-0.243
 discovering-0.234
Token.
Token ID13
Feature activation-6.68e-02

Pos logit contributions
tr+0.140
L+0.137
fy+0.134
brush+0.134
 tighter+0.126
Neg logit contributions
 seeing-0.163
 wondering-0.151
 observing-0.150
 discovering-0.149
 wonders-0.146
Token It
Token ID632
Feature activation-7.84e-02

Pos logit contributions
L+0.091
tr+0.090
brush+0.085
fy+0.085
 gloss+0.079
Neg logit contributions
 seeing-0.142
 wondering-0.131
 observing-0.131
 discovering-0.131
 wonders-0.128
Token was
Token ID373
Feature activation-6.45e-02

Pos logit contributions
tr+0.142
L+0.137
brush+0.136
fy+0.134
 tighter+0.127
Neg logit contributions
 seeing-0.135
 observing-0.124
 wondering-0.124
 discovering-0.123
 cultures-0.116
Token so
Token ID523
Feature activation-1.26e-01

Pos logit contributions
tr+0.085
brush+0.081
L+0.080
fy+0.079
 tighter+0.076
Neg logit contributions
 seeing-0.137
 observing-0.129
 discovering-0.129
 cultures-0.128
 wondering-0.128
Token com
Token ID401
Feature activation+2.37e-03

Pos logit contributions
fy+0.165
L+0.160
tr+0.154
brush+0.148
tered+0.141
Neg logit contributions
 seeing-0.282
 observing-0.268
 discovering-0.266
 wondering-0.262
 cultures-0.255
Tokenfy
Token ID24928
Feature activation-4.97e-01

Pos logit contributions
 seeing+0.006
 wonders+0.006
 wondering+0.006
 observing+0.006
 discovering+0.006
Neg logit contributions
tr-0.001
L-0.001
brush-0.001
Wh-0.001
Fl-0.001
Token and
Token ID290
Feature activation+4.19e-03

Pos logit contributions
brush+0.661
tr+0.653
fy+0.635
L+0.629
tered+0.601
Neg logit contributions
 seeing-0.975
 cultures-0.974
 wonders-0.962
 observing-0.936
 wondering-0.935
Token cozy
Token ID37438
Feature activation-6.65e-02

Pos logit contributions
 seeing+0.008
 wondering+0.008
 observing+0.008
 discovering+0.008
 wonders+0.008
Neg logit contributions
tr-0.006
L-0.006
fy-0.006
brush-0.006
 tighter-0.005
Token.
Token ID13
Feature activation-4.58e-02

Pos logit contributions
tr+0.108
L+0.106
fy+0.104
brush+0.103
 gloss+0.098
Neg logit contributions
 seeing-0.124
 wondering-0.115
 observing-0.114
 discovering-0.113
 wonders-0.110
Token
Token ID198
Feature activation+2.75e-02

Pos logit contributions
tr+0.060
L+0.060
brush+0.057
fy+0.057
 gloss+0.052
Neg logit contributions
 seeing-0.099
 wondering-0.092
 observing-0.092
 discovering-0.092
 wonders-0.090
Token
Token ID198
Feature activation+3.15e-02

Pos logit contributions
 seeing+0.060
 cultures+0.056
 wondering+0.056
 discovering+0.056
 observing+0.055
Neg logit contributions
L-0.042
Fl-0.034
Max-0.033
tr-0.033
brush-0.032
Token her
Token ID607
Feature activation-3.39e-02

Pos logit contributions
L+0.217
brush+0.217
tr+0.204
 tighter+0.197
 juice+0.194
Neg logit contributions
 seeing-0.310
 wondering-0.306
 observing-0.302
 discovering-0.301
 cultures-0.296
Token after
Token ID706
Feature activation-2.48e-01

Pos logit contributions
tr+0.045
brush+0.043
L+0.041
fy+0.040
 tighter+0.038
Neg logit contributions
 seeing-0.073
 wondering-0.069
 discovering-0.069
 observing-0.069
 cultures-0.067
Token all
Token ID477
Feature activation-1.03e-01

Pos logit contributions
 juice+0.318
L+0.316
brush+0.312
tr+0.309
 Lily+0.286
Neg logit contributions
 cultures-0.485
 wonders-0.469
 wondering-0.469
 seeing-0.469
 discovering-0.461
Token.
Token ID13
Feature activation-8.87e-02

Pos logit contributions
L+0.093
 juice+0.092
brush+0.090
tr+0.085
fy+0.084
Neg logit contributions
 wondering-0.240
 seeing-0.240
 observing-0.236
 cultures-0.231
 discovering-0.230
Token<|endoftext|>
Token ID50256
Feature activation-6.94e-02

Pos logit contributions
tr+0.131
L+0.131
brush+0.123
fy+0.122
 gloss+0.115
Neg logit contributions
 seeing-0.178
 wondering-0.165
 observing-0.164
 discovering-0.164
 wonders-0.159
TokenL
Token ID43
Feature activation-4.96e-01

Pos logit contributions
L+0.098
tr+0.091
brush+0.084
tted+0.083
Fl+0.082
Neg logit contributions
 seeing-0.150
 wondering-0.143
 cultures-0.142
 discovering-0.140
 observing-0.138
Tokenily
Token ID813
Feature activation-2.42e-01

Pos logit contributions
L+0.675
tr+0.625
brush+0.611
fy+0.598
 tighter+0.560
Neg logit contributions
 seeing-1.069
 wondering-1.015
 observing-1.012
 discovering-1.000
 wonders-0.973
Token was
Token ID373
Feature activation-7.06e-02

Pos logit contributions
L+0.267
tr+0.240
bo+0.235
brush+0.215
fy+0.211
Neg logit contributions
 cultures-0.571
 discovering-0.552
 seeing-0.548
 observing-0.540
 heights-0.530
Token a
Token ID257
Feature activation-1.52e-01

Pos logit contributions
tr+0.139
L+0.137
brush+0.132
fy+0.131
 tighter+0.127
Neg logit contributions
 seeing-0.104
 cultures-0.097
 discovering-0.093
 wonders-0.092
 wondering-0.092
Token little
Token ID1310
Feature activation-1.41e-01

Pos logit contributions
tr+0.276
brush+0.269
fy+0.263
paste+0.258
 gloss+0.257
Neg logit contributions
 seeing-0.252
 wondering-0.240
 observing-0.233
 wonders-0.231
 discovering-0.218
Token girl
Token ID2576
Feature activation-1.35e-01

Pos logit contributions
tr+0.234
brush+0.221
fy+0.216
tted+0.215
L+0.215
Neg logit contributions
 seeing-0.256
 wondering-0.249
 wonders-0.241
 observing-0.240
 discovering-0.229
Token to
Token ID284
Feature activation-9.38e-02

Pos logit contributions
 juice+0.079
 Lily+0.073
 cookie+0.070
tr+0.068
L+0.068
Neg logit contributions
 wondering-0.123
 cultures-0.122
 wonders-0.122
 observing-0.121
 discovering-0.120
Token collect
Token ID2824
Feature activation+7.74e-02

Pos logit contributions
brush+0.103
 swap+0.101
tr+0.097
L+0.097
 juice+0.093
Neg logit contributions
 wondering-0.201
 seeing-0.200
 discovering-0.197
 observing-0.196
 cultures-0.194
Token things
Token ID1243
Feature activation+1.15e-01

Pos logit contributions
 seeing+0.141
 observing+0.133
 wondering+0.130
 discovering+0.130
 cultures+0.121
Neg logit contributions
L-0.127
brush-0.126
fy-0.123
tr-0.119
 juice-0.116
Token.
Token ID13
Feature activation-8.34e-02

Pos logit contributions
 seeing+0.253
 wondering+0.244
 observing+0.243
 discovering+0.237
 wonders+0.235
Neg logit contributions
brush-0.145
L-0.138
tr-0.138
fy-0.136
 tighter-0.123
Token<|endoftext|>
Token ID50256
Feature activation-6.73e-02

Pos logit contributions
L+0.105
tr+0.102
brush+0.095
fy+0.094
 gloss+0.088
Neg logit contributions
 seeing-0.187
 wondering-0.173
 discovering-0.173
 observing-0.173
 cultures-0.168
TokenL
Token ID43
Feature activation-4.96e-01

Pos logit contributions
L+0.099
tr+0.091
Fl+0.084
brush+0.083
tted+0.082
Neg logit contributions
 seeing-0.142
 wondering-0.136
 cultures-0.134
 discovering-0.133
 observing-0.130
Tokenily
Token ID813
Feature activation-2.41e-01

Pos logit contributions
L+0.716
tr+0.632
brush+0.631
fy+0.600
bo+0.586
Neg logit contributions
 seeing-1.043
 wondering-0.997
 observing-0.990
 discovering-0.975
 cultures-0.956
Token was
Token ID373
Feature activation-7.08e-02

Pos logit contributions
L+0.246
tr+0.220
bo+0.211
brush+0.193
 swapped+0.189
Neg logit contributions
 cultures-0.574
 heights-0.554
 seeing-0.553
 discovering-0.552
 observing-0.542
Token hungry
Token ID14720
Feature activation+1.04e-01

Pos logit contributions
tr+0.136
L+0.135
brush+0.129
fy+0.128
 tighter+0.124
Neg logit contributions
 seeing-0.106
 cultures-0.101
 discovering-0.095
 wonders-0.094
 wondering-0.093
Token.
Token ID13
Feature activation-5.68e-02

Pos logit contributions
 seeing+0.192
 wondering+0.178
 discovering+0.177
 observing+0.177
 wonders+0.174
Neg logit contributions
tr-0.169
L-0.164
brush-0.163
fy-0.162
 tighter-0.152
Token She
Token ID1375
Feature activation-9.43e-03

Pos logit contributions
tr+0.102
fy+0.100
brush+0.098
L+0.097
 tighter+0.096
Neg logit contributions
 seeing-0.095
 wondering-0.088
 observing-0.087
 discovering-0.086
 wonders-0.084
Token show
Token ID905
Feature activation-2.06e-02

Pos logit contributions
brush+0.100
L+0.099
tr+0.098
 swap+0.094
 juice+0.088
Neg logit contributions
 seeing-0.173
 wondering-0.171
 observing-0.167
 discovering-0.166
 cultures-0.164
Token her
Token ID607
Feature activation-1.32e-02

Pos logit contributions
L+0.031
tr+0.030
brush+0.030
fy+0.030
 tighter+0.029
Neg logit contributions
 seeing-0.039
 observing-0.037
 wondering-0.037
 discovering-0.036
 wonders-0.035
Token an
Token ID281
Feature activation+1.09e-02

Pos logit contributions
tr+0.017
brush+0.017
L+0.016
 gloss+0.016
fy+0.016
Neg logit contributions
 seeing-0.028
 observing-0.026
 wondering-0.026
 discovering-0.026
 wonders-0.025
Token expensive
Token ID5789
Feature activation+1.91e-02

Pos logit contributions
 seeing+0.019
 observing+0.018
 discovering+0.017
 wondering+0.017
 cultures+0.016
Neg logit contributions
tr-0.018
brush-0.018
fy-0.018
 tighter-0.017
L-0.017
Token ash
Token ID12530
Feature activation+5.24e-02

Pos logit contributions
 seeing+0.042
 discovering+0.040
 observing+0.040
 wondering+0.040
 wonders+0.039
Neg logit contributions
brush-0.023
tr-0.023
 juice-0.021
L-0.020
 cookie-0.019
Tokentr
Token ID2213
Feature activation-4.96e-01

Pos logit contributions
 seeing+0.129
 wondering+0.123
 observing+0.122
 discovering+0.122
 wonders+0.120
Neg logit contributions
tr-0.054
brush-0.050
L-0.050
fy-0.049
 gloss-0.044
Tokenay
Token ID323
Feature activation-3.08e-01

Pos logit contributions
L+0.548
 swapped+0.507
fy+0.505
 tighter+0.502
tered+0.500
Neg logit contributions
 seeing-1.075
 wondering-1.022
 wonders-1.004
 observing-1.002
 discovering-0.982
Token he
Token ID339
Feature activation-4.23e-02

Pos logit contributions
tr+0.441
brush+0.426
L+0.402
fy+0.392
 gloss+0.388
Neg logit contributions
 seeing-0.633
 observing-0.607
 wondering-0.604
 wonders-0.601
 discovering-0.598
Token got
Token ID1392
Feature activation-2.53e-01

Pos logit contributions
L+0.049
tr+0.049
fy+0.047
brush+0.046
tered+0.041
Neg logit contributions
 observing-0.095
 seeing-0.094
 cultures-0.093
 discovering-0.092
 heights-0.090
Token as
Token ID355
Feature activation-2.30e-01

Pos logit contributions
brush+0.325
tr+0.316
L+0.315
fy+0.311
 tighter+0.295
Neg logit contributions
 seeing-0.524
 cultures-0.506
 observing-0.506
 discovering-0.494
 wonders-0.487
Token a
Token ID257
Feature activation-1.35e-01

Pos logit contributions
L+0.283
brush+0.272
fy+0.265
 tighter+0.261
tr+0.260
Neg logit contributions
 seeing-0.501
 observing-0.479
 discovering-0.473
 cultures-0.462
 wondering-0.461
Token and
Token ID290
Feature activation-6.32e-03

Pos logit contributions
tr+0.352
L+0.341
brush+0.340
fy+0.334
 tighter+0.313
Neg logit contributions
 seeing-0.423
 wondering-0.396
 observing-0.394
 discovering-0.392
 wonders-0.386
Token jumped
Token ID11687
Feature activation-7.68e-02

Pos logit contributions
tr+0.009
L+0.008
brush+0.008
fy+0.008
 tighter+0.008
Neg logit contributions
 seeing-0.013
 wondering-0.012
 observing-0.012
 discovering-0.012
 wonders-0.012
Token into
Token ID656
Feature activation-1.21e-01

Pos logit contributions
tr+0.118
L+0.114
fy+0.112
brush+0.111
 gloss+0.105
Neg logit contributions
 seeing-0.148
 wondering-0.139
 observing-0.136
 discovering-0.135
 wonders-0.134
Token her
Token ID607
Feature activation+1.26e-02

Pos logit contributions
L+0.159
brush+0.142
 tighter+0.139
 juice+0.137
fy+0.136
Neg logit contributions
 seeing-0.251
 cultures-0.240
 observing-0.238
 discovering-0.235
 wondering-0.233
Token com
Token ID401
Feature activation+8.12e-03

Pos logit contributions
 seeing+0.025
 wondering+0.023
 discovering+0.023
 observing+0.023
 wonders+0.022
Neg logit contributions
tr-0.019
L-0.018
fy-0.018
brush-0.018
 tighter-0.017
Tokenfy
Token ID24928
Feature activation-4.95e-01

Pos logit contributions
 seeing+0.022
 wonders+0.021
 cultures+0.021
 wondering+0.021
 observing+0.021
Neg logit contributions
tr-0.005
L-0.004
tered-0.004
Wh-0.004
brush-0.004
Token bed
Token ID3996
Feature activation-6.51e-02

Pos logit contributions
tr+0.661
brush+0.647
L+0.645
fy+0.639
 tighter+0.585
Neg logit contributions
 seeing-1.040
 wondering-0.978
 observing-0.969
 discovering-0.969
 wonders-0.946
Token.
Token ID13
Feature activation-5.16e-02

Pos logit contributions
tr+0.108
L+0.106
fy+0.103
brush+0.102
 gloss+0.097
Neg logit contributions
 seeing-0.118
 wondering-0.109
 observing-0.107
 discovering-0.107
 wonders-0.103
Token Lucy
Token ID22162
Feature activation-1.62e-01

Pos logit contributions
tr+0.073
L+0.072
brush+0.069
fy+0.069
 gloss+0.064
Neg logit contributions
 seeing-0.106
 wondering-0.099
 observing-0.099
 discovering-0.099
 wonders-0.096
Token had
Token ID550
Feature activation-1.12e-01

Pos logit contributions
L+0.221
tr+0.220
fy+0.215
brush+0.212
 tighter+0.200
Neg logit contributions
 seeing-0.339
 observing-0.319
 discovering-0.318
 wondering-0.314
 cultures-0.309
Token big
Token ID1263
Feature activation-5.55e-02

Pos logit contributions
L+0.134
tr+0.131
fy+0.130
brush+0.129
 tighter+0.118
Neg logit contributions
 seeing-0.252
 observing-0.239
 cultures-0.238
 discovering-0.236
 wondering-0.235
Token the
Token ID262
Feature activation-4.56e-02

Pos logit contributions
L+0.297
 juice+0.272
brush+0.262
 half+0.252
 Lily+0.248
Neg logit contributions
 seeing-0.506
 wondering-0.501
 observing-0.500
 discovering-0.493
 cultures-0.476
Token swing
Token ID9628
Feature activation-8.90e-02

Pos logit contributions
tr+0.059
brush+0.057
L+0.054
 juice+0.053
fy+0.052
Neg logit contributions
 seeing-0.102
 discovering-0.094
 observing-0.094
 wondering-0.094
 wonders-0.089
Token.
Token ID13
Feature activation-9.07e-02

Pos logit contributions
tr+0.144
L+0.143
fy+0.138
brush+0.136
 tighter+0.131
Neg logit contributions
 seeing-0.160
 discovering-0.146
 wondering-0.146
 observing-0.145
 wonders-0.140
Token
Token ID198
Feature activation+3.32e-03

Pos logit contributions
tr+0.126
L+0.125
brush+0.120
fy+0.119
 gloss+0.111
Neg logit contributions
 seeing-0.189
 wondering-0.176
 observing-0.175
 discovering-0.175
 wonders-0.170
Token
Token ID198
Feature activation+1.21e-02

Pos logit contributions
 seeing+0.007
 wondering+0.007
 observing+0.007
 cultures+0.007
 discovering+0.007
Neg logit contributions
L-0.005
tr-0.004
Fl-0.004
Max-0.004
Mom-0.004
TokenL
Token ID43
Feature activation-4.95e-01

Pos logit contributions
 seeing+0.028
 observing+0.027
 wondering+0.027
 discovering+0.027
 wonders+0.026
Neg logit contributions
tr-0.014
fy-0.013
brush-0.013
L-0.012
 gloss-0.012
Tokenily
Token ID813
Feature activation-2.38e-01

Pos logit contributions
L+0.460
tr+0.438
brush+0.410
fy+0.398
 gloss+0.357
Neg logit contributions
 seeing-1.285
 wondering-1.225
 observing-1.219
 discovering-1.211
 wonders-1.186
Token felt
Token ID2936
Feature activation-5.66e-02

Pos logit contributions
L+0.330
tr+0.327
brush+0.310
fy+0.306
 swapped+0.298
Neg logit contributions
 seeing-0.489
 observing-0.467
 discovering-0.466
 wondering-0.458
 cultures-0.454
Token sad
Token ID6507
Feature activation-6.35e-02

Pos logit contributions
L+0.084
tr+0.084
brush+0.082
fy+0.082
 tighter+0.077
Neg logit contributions
 seeing-0.111
 observing-0.103
 discovering-0.103
 wondering-0.103
 wonders-0.100
Token and
Token ID290
Feature activation-2.70e-03

Pos logit contributions
tr+0.098
L+0.097
brush+0.094
fy+0.093
 tighter+0.089
Neg logit contributions
 seeing-0.121
 observing-0.114
 discovering-0.114
 wondering-0.113
 cultures-0.109
Token didn
Token ID1422
Feature activation-7.40e-02

Pos logit contributions
L+0.004
tr+0.004
fy+0.004
 tighter+0.004
brush+0.004
Neg logit contributions
 seeing-0.005
 discovering-0.004
 observing-0.004
 cultures-0.004
 wondering-0.004
Token-
Token ID12
Feature activation-2.07e-01

Pos logit contributions
tr+0.079
L+0.077
brush+0.076
fy+0.076
 tighter+0.071
Neg logit contributions
 seeing-0.094
 wondering-0.087
 observing-0.087
 discovering-0.086
 wonders-0.084
Tokenmess
Token ID37348
Feature activation-6.39e-02

Pos logit contributions
tr+0.277
fy+0.266
L+0.265
brush+0.253
 tighter+0.248
Neg logit contributions
 seeing-0.426
 wondering-0.401
 observing-0.398
 discovering-0.393
 wonders-0.385
Token and
Token ID290
Feature activation-1.55e-02

Pos logit contributions
tr+0.075
brush+0.070
fy+0.069
L+0.068
bo+0.065
Neg logit contributions
 seeing-0.148
 discovering-0.141
 wondering-0.140
 observing-0.139
 wonders-0.139
Token L
Token ID406
Feature activation-1.75e-01

Pos logit contributions
tr+0.022
brush+0.021
L+0.021
fy+0.021
 tighter+0.019
Neg logit contributions
 seeing-0.032
 observing-0.030
 discovering-0.030
 wondering-0.029
 cultures-0.029
Token-
Token ID12
Feature activation-1.92e-01

Pos logit contributions
L+0.286
tr+0.278
brush+0.272
fy+0.266
 gloss+0.257
Neg logit contributions
 seeing-0.328
 wondering-0.308
 discovering-0.306
 observing-0.305
 wonders-0.303
TokenL
Token ID43
Feature activation-4.95e-01

Pos logit contributions
tr+0.254
fy+0.244
L+0.242
brush+0.241
 tighter+0.225
Neg logit contributions
 seeing-0.410
 wondering-0.383
 observing-0.381
 discovering-0.378
 wonders-0.370
Tokenily
Token ID813
Feature activation-2.41e-01

Pos logit contributions
L+0.715
tr+0.706
brush+0.676
fy+0.672
 gloss+0.629
Neg logit contributions
 seeing-1.019
 wondering-0.958
 observing-0.947
 discovering-0.945
 wonders-0.930
Token got
Token ID1392
Feature activation-2.66e-01

Pos logit contributions
tr+0.340
brush+0.328
L+0.326
fy+0.313
 gloss+0.300
Neg logit contributions
 seeing-0.499
 discovering-0.478
 observing-0.475
 cultures-0.473
 wondering-0.470
Token m
Token ID285
Feature activation-2.86e-02

Pos logit contributions
tr+0.361
L+0.358
brush+0.351
fy+0.347
 gloss+0.329
Neg logit contributions
 seeing-0.554
 discovering-0.515
 observing-0.515
 wondering-0.511
 cultures-0.508
Token-
Token ID12
Feature activation-1.87e-01

Pos logit contributions
tr+0.047
L+0.046
brush+0.044
fy+0.043
tered+0.043
Neg logit contributions
 seeing-0.052
 observing-0.049
 wondering-0.048
 discovering-0.047
 wonders-0.044
Tokenmad
Token ID9937
Feature activation-1.35e-01

Pos logit contributions
tr+0.258
fy+0.237
brush+0.236
kin+0.233
tered+0.223
Neg logit contributions
 seeing-0.383
 observing-0.357
 wondering-0.353
 discovering-0.349
 wonders-0.332
Token.
Token ID13
Feature activation-6.24e-02

Pos logit contributions
 seeing+0.030
 wondering+0.028
 observing+0.027
 discovering+0.026
 wonders+0.026
Neg logit contributions
tr-0.032
L-0.031
brush-0.029
fy-0.029
 gloss-0.029
Token She
Token ID1375
Feature activation-7.20e-03

Pos logit contributions
tr+0.089
L+0.089
brush+0.083
fy+0.082
 gloss+0.077
Neg logit contributions
 seeing-0.130
 observing-0.120
 discovering-0.120
 wondering-0.119
 wonders-0.115
Token felt
Token ID2936
Feature activation-4.35e-02

Pos logit contributions
tr+0.009
brush+0.009
L+0.009
fy+0.009
 tighter+0.008
Neg logit contributions
 seeing-0.016
 discovering-0.015
 observing-0.015
 cultures-0.015
 wondering-0.014
Token so
Token ID523
Feature activation-1.13e-01

Pos logit contributions
fy+0.052
L+0.051
brush+0.051
 tighter+0.049
tr+0.048
Neg logit contributions
 seeing-0.095
 cultures-0.090
 observing-0.089
 discovering-0.089
 wondering-0.087
Token com
Token ID401
Feature activation+9.93e-03

Pos logit contributions
fy+0.149
L+0.138
brush+0.132
tted+0.132
 tighter+0.131
Neg logit contributions
 seeing-0.242
 cultures-0.234
 discovering-0.228
 observing-0.224
 whales-0.224
Tokenfy
Token ID24928
Feature activation-4.95e-01

Pos logit contributions
 seeing+0.026
 wondering+0.026
 wonders+0.025
 observing+0.025
 discovering+0.025
Neg logit contributions
tr-0.006
L-0.006
brush-0.005
Wh-0.004
kin-0.004
Token,
Token ID11
Feature activation-1.03e-01

Pos logit contributions
brush+0.666
tr+0.640
fy+0.630
L+0.625
tered+0.606
Neg logit contributions
 cultures-0.971
 seeing-0.966
 wonders-0.963
 observing-0.921
 discovering-0.915
Token just
Token ID655
Feature activation-1.07e-01

Pos logit contributions
brush+0.148
L+0.146
fy+0.143
tr+0.143
 tighter+0.142
Neg logit contributions
 cultures-0.190
 seeing-0.190
 wonders-0.181
 observing-0.180
 discovering-0.177
Token like
Token ID588
Feature activation+5.60e-03

Pos logit contributions
fy+0.128
 tighter+0.125
tr+0.121
brush+0.117
tered+0.112
Neg logit contributions
 cultures-0.220
 seeing-0.214
 wonders-0.205
 discovering-0.205
 observing-0.203
Token a
Token ID257
Feature activation-1.24e-01

Pos logit contributions
 seeing+0.011
 observing+0.010
 wonders+0.010
 discovering+0.010
 wondering+0.010
Neg logit contributions
L-0.008
tr-0.007
brush-0.007
fy-0.007
 gloss-0.007
Token big
Token ID1263
Feature activation-5.12e-02

Pos logit contributions
tr+0.181
L+0.176
fy+0.173
brush+0.172
tered+0.163
Neg logit contributions
 seeing-0.249
 wondering-0.234
 observing-0.230
 discovering-0.227
 wonders-0.226
Token bike
Token ID7161
Feature activation-1.47e-01

Pos logit contributions
tr+0.071
brush+0.068
L+0.067
fy+0.066
 tighter+0.062
Neg logit contributions
 seeing-0.107
 observing-0.099
 discovering-0.099
 wondering-0.099
 wonders-0.096
Token and
Token ID290
Feature activation-2.91e-02

Pos logit contributions
tr+0.250
L+0.249
fy+0.241
brush+0.240
 gloss+0.230
Neg logit contributions
 seeing-0.254
 wondering-0.234
 discovering-0.231
 observing-0.230
 wonders-0.222
Token said
Token ID531
Feature activation-2.52e-02

Pos logit contributions
tr+0.041
L+0.040
brush+0.039
fy+0.039
 tighter+0.036
Neg logit contributions
 seeing-0.060
 wondering-0.057
 observing-0.056
 discovering-0.056
 wonders-0.055
Token,
Token ID11
Feature activation-1.35e-01

Pos logit contributions
tr+0.036
L+0.035
fy+0.035
brush+0.034
 tighter+0.032
Neg logit contributions
 seeing-0.051
 wondering-0.048
 observing-0.048
 discovering-0.047
 wonders-0.046
Token "
Token ID366
Feature activation+2.55e-02

Pos logit contributions
tr+0.164
L+0.163
brush+0.155
 gloss+0.145
tered+0.144
Neg logit contributions
 cultures-0.291
 wonders-0.283
 seeing-0.277
 discovering-0.273
 observing-0.269
TokenL
Token ID43
Feature activation-4.95e-01

Pos logit contributions
 seeing+0.050
 observing+0.046
 wondering+0.046
 discovering+0.046
 wonders+0.046
Neg logit contributions
tr-0.038
fy-0.037
brush-0.036
L-0.034
 tighter-0.034
Tokenily
Token ID813
Feature activation-2.41e-01

Pos logit contributions
tr+0.636
L+0.609
fy+0.602
brush+0.601
 tighter+0.554
Neg logit contributions
 seeing-1.090
 wondering-1.021
 observing-1.017
 discovering-1.014
 wonders-0.991
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
tr+0.271
L+0.263
brush+0.260
fy+0.253
 gloss+0.224
Neg logit contributions
 seeing-0.558
 discovering-0.534
 wondering-0.531
 observing-0.529
 cultures-0.525
Token always
Token ID1464
Feature activation-1.67e-03

Pos logit contributions
L+0.178
brush+0.175
tr+0.175
fy+0.158
 swap+0.154
Neg logit contributions
 seeing-0.246
 wondering-0.239
 observing-0.237
 wonders-0.234
 discovering-0.233
Token remember
Token ID3505
Feature activation+2.50e-02

Pos logit contributions
L+0.003
tr+0.003
fy+0.003
 tighter+0.003
brush+0.002
Neg logit contributions
 seeing-0.003
 wondering-0.003
 cultures-0.003
 wonders-0.003
 observing-0.003
Token to
Token ID284
Feature activation-5.13e-02

Pos logit contributions
 wonders+0.051
 cultures+0.049
 wondering+0.049
 observing+0.047
 seeing+0.047
Neg logit contributions
brush-0.028
tr-0.028
L-0.028
 juice-0.026
 tighter-0.025
Token her
Token ID607
Feature activation-2.67e-02

Pos logit contributions
tr+0.080
brush+0.080
L+0.078
fy+0.076
 tighter+0.074
Neg logit contributions
 seeing-0.113
 observing-0.109
 discovering-0.108
 wondering-0.106
 cultures-0.104
Token mom
Token ID1995
Feature activation-2.36e-01

Pos logit contributions
tr+0.035
brush+0.033
L+0.032
fy+0.032
 tighter+0.030
Neg logit contributions
 seeing-0.059
 discovering-0.055
 observing-0.055
 wondering-0.054
 cultures-0.052
Token said
Token ID531
Feature activation-2.94e-02

Pos logit contributions
tr+0.293
brush+0.279
L+0.275
fy+0.270
bo+0.257
Neg logit contributions
 seeing-0.508
 discovering-0.497
 observing-0.492
 cultures-0.491
 wondering-0.485
Token,
Token ID11
Feature activation-1.36e-01

Pos logit contributions
tr+0.040
L+0.039
brush+0.038
fy+0.038
tered+0.036
Neg logit contributions
 seeing-0.061
 wondering-0.058
 discovering-0.057
 observing-0.057
 wonders-0.057
Token "
Token ID366
Feature activation+2.17e-02

Pos logit contributions
tr+0.198
L+0.197
brush+0.190
tered+0.180
 gloss+0.178
Neg logit contributions
 seeing-0.258
 wonders-0.255
 cultures-0.254
 discovering-0.246
 observing-0.245
TokenL
Token ID43
Feature activation-4.95e-01

Pos logit contributions
 seeing+0.041
 observing+0.039
 discovering+0.038
 wondering+0.038
 wonders+0.038
Neg logit contributions
tr-0.032
fy-0.032
brush-0.031
tered-0.029
 tighter-0.029
Tokenily
Token ID813
Feature activation-2.41e-01

Pos logit contributions
tr+0.552
L+0.518
fy+0.518
brush+0.516
 tighter+0.473
Neg logit contributions
 seeing-1.170
 wondering-1.099
 observing-1.098
 discovering-1.095
 wonders-1.070
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
tr+0.283
brush+0.276
L+0.276
fy+0.271
 gloss+0.240
Neg logit contributions
 seeing-0.549
 discovering-0.521
 wondering-0.520
 observing-0.517
 wonders-0.509
Token it
Token ID340
Feature activation+8.94e-03

Pos logit contributions
L+0.186
tr+0.185
brush+0.183
fy+0.173
 gloss+0.165
Neg logit contributions
 seeing-0.240
 wondering-0.228
 observing-0.227
 discovering-0.224
 wonders-0.223
Token's
Token ID338
Feature activation-2.49e-01

Pos logit contributions
 seeing+0.020
 wondering+0.019
 observing+0.019
 discovering+0.019
 wonders+0.018
Neg logit contributions
tr-0.011
L-0.011
brush-0.011
fy-0.010
 tighter-0.010
Token important
Token ID1593
Feature activation-2.12e-03

Pos logit contributions
tr+0.342
L+0.321
brush+0.317
fy+0.314
 tighter+0.290
Neg logit contributions
 seeing-0.532
 wondering-0.499
 discovering-0.498
 observing-0.497
 cultures-0.489
Token clouds
Token ID15114
Feature activation+7.16e-02

Pos logit contributions
L+0.046
tr+0.046
brush+0.044
 tighter+0.043
fy+0.043
Neg logit contributions
 seeing-0.061
 observing-0.057
 wondering-0.057
 discovering-0.057
 wonders-0.055
Token and
Token ID290
Feature activation-2.46e-02

Pos logit contributions
 cultures+0.154
 wonders+0.154
 discovering+0.152
 seeing+0.152
 wondering+0.151
Neg logit contributions
tr-0.084
brush-0.078
tered-0.075
fy-0.073
bo-0.069
Token friends
Token ID2460
Feature activation-1.54e-02

Pos logit contributions
tr+0.037
brush+0.035
L+0.034
 tighter+0.034
 juice+0.034
Neg logit contributions
 seeing-0.041
 wondering-0.041
 observing-0.040
 discovering-0.040
ules-0.039
Token.
Token ID13
Feature activation-7.85e-02

Pos logit contributions
brush+0.016
tr+0.015
L+0.015
fy+0.014
 swapped+0.014
Neg logit contributions
 seeing-0.033
 wonders-0.033
 observing-0.032
 ey-0.032
 wondering-0.032
Token<|endoftext|>
Token ID50256
Feature activation-6.25e-02

Pos logit contributions
L+0.100
tr+0.096
brush+0.089
fy+0.088
Fl+0.084
Neg logit contributions
 seeing-0.174
 wondering-0.161
 discovering-0.161
 observing-0.160
 cultures-0.157
TokenL
Token ID43
Feature activation-4.94e-01

Pos logit contributions
L+0.092
tr+0.083
bo+0.077
Fl+0.077
Max+0.076
Neg logit contributions
 seeing-0.132
 wondering-0.128
 cultures-0.127
 discovering-0.125
 observing-0.122
Tokenily
Token ID813
Feature activation-2.42e-01

Pos logit contributions
L+0.554
tr+0.522
brush+0.505
fy+0.492
 tighter+0.452
Neg logit contributions
 seeing-1.168
 wondering-1.113
 observing-1.108
 discovering-1.098
 wonders-1.072
Token and
Token ID290
Feature activation-5.44e-03

Pos logit contributions
L+0.249
tr+0.224
bo+0.216
brush+0.198
ny+0.194
Neg logit contributions
 cultures-0.570
 discovering-0.556
 seeing-0.551
 observing-0.545
 wondering-0.544
Token Ben
Token ID3932
Feature activation-8.87e-02

Pos logit contributions
L+0.007
tr+0.007
brush+0.007
fy+0.006
tered+0.006
Neg logit contributions
 seeing-0.011
 discovering-0.011
 wondering-0.011
 observing-0.011
 wonders-0.011
Token were
Token ID547
Feature activation-3.66e-02

Pos logit contributions
L+0.104
 swapped+0.090
bo+0.087
tr+0.086
 swap+0.085
Neg logit contributions
 discovering-0.203
 seeing-0.203
 observing-0.200
 cultures-0.199
 wondering-0.199
Token playing
Token ID2712
Feature activation+1.91e-01

Pos logit contributions
tr+0.067
L+0.066
fy+0.064
brush+0.064
 tighter+0.061
Neg logit contributions
 seeing-0.058
 cultures-0.053
 discovering-0.051
 wonders-0.051
 wondering-0.051
Token the
Token ID262
Feature activation+4.42e-03

Pos logit contributions
 seeing+0.337
 wondering+0.327
 observing+0.325
 cultures+0.322
 discovering+0.319
Neg logit contributions
tr-0.195
L-0.192
brush-0.186
 tighter-0.183
fy-0.176
Token good
Token ID922
Feature activation-1.91e-01

Pos logit contributions
 seeing+0.008
 observing+0.007
 discovering+0.007
 wondering+0.007
 cultures+0.007
Neg logit contributions
tr-0.007
L-0.007
fy-0.007
brush-0.007
 tighter-0.007
Token times
Token ID1661
Feature activation+6.63e-02

Pos logit contributions
L+0.275
fy+0.274
tr+0.266
brush+0.266
 tighter+0.253
Neg logit contributions
 seeing-0.372
 observing-0.351
 wondering-0.349
 discovering-0.345
 wonders-0.329
Token to
Token ID284
Feature activation-2.24e-02

Pos logit contributions
 seeing+0.139
 observing+0.132
 wondering+0.131
 discovering+0.130
 wonders+0.129
Neg logit contributions
brush-0.087
tr-0.087
fy-0.085
L-0.084
 tighter-0.081
Token keep
Token ID1394
Feature activation-1.04e-01

Pos logit contributions
L+0.027
tr+0.027
brush+0.026
fy+0.025
tered+0.024
Neg logit contributions
 seeing-0.049
 wondering-0.048
 observing-0.047
 discovering-0.047
 wonders-0.046
Token going
Token ID1016
Feature activation+3.21e-01

Pos logit contributions
 tighter+0.127
 tightly+0.121
 juice+0.120
 off+0.117
L+0.117
Neg logit contributions
 cultures-0.204
 seeing-0.196
 wonders-0.191
 observing-0.183
 discovering-0.182
Token.
Token ID13
Feature activation-2.95e-02

Pos logit contributions
 seeing+0.661
 observing+0.621
 wondering+0.618
 discovering+0.614
 wonders+0.608
Neg logit contributions
tr-0.437
L-0.433
fy-0.428
brush-0.426
 tighter-0.404
Token He
Token ID679
Feature activation-5.15e-02

Pos logit contributions
L+0.038
tr+0.038
brush+0.036
fy+0.036
 tighter+0.033
Neg logit contributions
 seeing-0.065
 observing-0.061
 discovering-0.060
 wondering-0.060
 cultures-0.059
Token had
Token ID550
Feature activation-9.63e-02

Pos logit contributions
tr+0.082
L+0.080
brush+0.079
fy+0.078
 gloss+0.073
Neg logit contributions
 seeing-0.097
 wondering-0.091
 observing-0.090
 discovering-0.089
 wonders-0.088
Token to
Token ID284
Feature activation-1.77e-02

Pos logit contributions
L+0.115
brush+0.113
tr+0.111
 tighter+0.110
 swapped+0.110
Neg logit contributions
 seeing-0.205
 observing-0.199
 cultures-0.197
 discovering-0.194
 wondering-0.193
Token find
Token ID1064
Feature activation+1.24e-01

Pos logit contributions
tr+0.024
L+0.024
brush+0.023
fy+0.023
 tighter+0.022
Neg logit contributions
 seeing-0.037
 wondering-0.035
 observing-0.035
 discovering-0.034
 wonders-0.034
Token to
Token ID284
Feature activation+2.47e-02

Pos logit contributions
 seeing+0.172
 wondering+0.150
 wonders+0.148
 discovering+0.147
 observing+0.144
Neg logit contributions
tr-0.214
brush-0.207
fy-0.200
 gloss-0.199
kin-0.193
Token the
Token ID262
Feature activation+5.80e-02

Pos logit contributions
 seeing+0.056
 wondering+0.053
 observing+0.053
 discovering+0.053
 wonders+0.051
Neg logit contributions
tr-0.030
L-0.029
brush-0.028
fy-0.028
 tighter-0.026
Token kitchen
Token ID9592
Feature activation+1.47e-01

Pos logit contributions
 seeing+0.093
 wondering+0.085
 observing+0.084
 discovering+0.084
 wonders+0.082
Neg logit contributions
tr-0.108
L-0.107
fy-0.106
brush-0.105
tered-0.100
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
 seeing+0.286
 observing+0.274
 discovering+0.273
 wondering+0.270
 cultures+0.267
Neg logit contributions
tr-0.213
L-0.208
brush-0.205
fy-0.196
 tighter-0.191
Token He
Token ID679
Feature activation-6.95e-03

Pos logit contributions
 seeing+0.023
 wondering+0.021
 observing+0.021
 wonders+0.021
 discovering+0.021
Neg logit contributions
tr-0.018
fy-0.017
brush-0.017
L-0.017
 tighter-0.016
Token sees
Token ID7224
Feature activation+3.46e-01

Pos logit contributions
tr+0.012
 swapped+0.011
fy+0.011
L+0.011
 swap+0.011
Neg logit contributions
 seeing-0.011
 discovering-0.011
 observing-0.011
 cultures-0.011
 believing-0.011
Token the
Token ID262
Feature activation+6.15e-02

Pos logit contributions
 seeing+0.780
 wondering+0.731
 observing+0.725
 discovering+0.719
 wonders+0.714
Neg logit contributions
tr-0.431
fy-0.416
L-0.408
brush-0.406
 tighter-0.369
Token cake
Token ID12187
Feature activation-8.22e-02

Pos logit contributions
 seeing+0.155
 discovering+0.152
 observing+0.147
 wondering+0.146
 cultures+0.145
Neg logit contributions
tr-0.063
brush-0.058
 juice-0.055
 cookie-0.054
L-0.052
Token on
Token ID319
Feature activation-6.08e-02

Pos logit contributions
tr+0.112
brush+0.110
L+0.105
fy+0.100
 tighter+0.096
Neg logit contributions
 seeing-0.170
 wondering-0.165
 observing-0.165
 discovering-0.165
 cultures-0.162
Token the
Token ID262
Feature activation+6.12e-02

Pos logit contributions
fy+0.129
tr+0.126
brush+0.118
 gloss+0.117
tted+0.117
Neg logit contributions
 seeing-0.091
 wonders-0.079
 wondering-0.078
 observing-0.076
 discovering-0.076
Token table
Token ID3084
Feature activation+3.47e-02

Pos logit contributions
 seeing+0.107
 wondering+0.098
 observing+0.097
 discovering+0.097
 wonders+0.095
Neg logit contributions
L-0.103
tr-0.103
fy-0.102
brush-0.101
 gloss-0.097
Token ran
Token ID4966
Feature activation-2.71e-02

Pos logit contributions
 seeing+0.193
 observing+0.182
 wondering+0.181
 discovering+0.180
 wonders+0.175
Neg logit contributions
tr-0.136
L-0.135
fy-0.133
brush-0.130
tered-0.122
Token straight
Token ID3892
Feature activation-6.36e-02

Pos logit contributions
tr+0.052
brush+0.049
L+0.048
fy+0.048
paste+0.047
Neg logit contributions
 seeing-0.047
 discovering-0.043
 wondering-0.042
 observing-0.042
 wonders-0.039
Token outside
Token ID2354
Feature activation+1.10e-01

Pos logit contributions
L+0.075
tr+0.075
fy+0.075
 tighter+0.073
brush+0.073
Neg logit contributions
 seeing-0.136
 wondering-0.128
 cultures-0.128
 observing-0.128
 discovering-0.125
Token and
Token ID290
Feature activation+8.69e-02

Pos logit contributions
 cultures+0.233
 seeing+0.228
 wonders+0.222
 wondering+0.216
 observing+0.216
Neg logit contributions
L-0.130
tr-0.119
 tighter-0.117
brush-0.117
fy-0.113
Token began
Token ID2540
Feature activation-1.03e-01

Pos logit contributions
 seeing+0.185
 wondering+0.174
 observing+0.174
 discovering+0.172
 cultures+0.171
Neg logit contributions
tr-0.111
L-0.110
fy-0.109
brush-0.105
 tighter-0.100
Token exploring
Token ID13504
Feature activation+4.72e-01

Pos logit contributions
L+0.118
tr+0.111
 tighter+0.106
brush+0.105
 gloss+0.105
Neg logit contributions
 cultures-0.217
 wonders-0.211
 seeing-0.206
 wondering-0.201
 observing-0.194
Token.
Token ID13
Feature activation+3.61e-02

Pos logit contributions
 seeing+1.054
 cultures+1.018
 observing+1.013
 wondering+1.013
ules+0.982
Neg logit contributions
L-0.512
 tighter-0.476
 tightly-0.415
brush-0.386
tered-0.386
Token
Token ID198
Feature activation+1.10e-01

Pos logit contributions
 seeing+0.084
 cultures+0.081
eling+0.080
 believing+0.079
 whales+0.077
Neg logit contributions
L-0.042
tr-0.040
fy-0.034
 Lily-0.032
 gloss-0.032
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
 seeing+0.253
 cultures+0.251
 wondering+0.241
 observing+0.238
 discovering+0.238
Neg logit contributions
L-0.149
Max-0.113
Fl-0.113
tr-0.112
Mom-0.109
TokenShe
Token ID3347
Feature activation+8.77e-02

Pos logit contributions
 seeing+0.275
 wondering+0.263
 observing+0.261
 discovering+0.257
 cultures+0.255
Neg logit contributions
L-0.124
tr-0.102
brush-0.095
Fl-0.095
fy-0.093
Token noticed
Token ID6810
Feature activation+2.25e-01

Pos logit contributions
 seeing+0.174
 wondering+0.163
 observing+0.162
 discovering+0.161
 wonders+0.156
Neg logit contributions
tr-0.129
L-0.128
fy-0.124
brush-0.124
 tighter-0.115
Token a
Token ID257
Feature activation-5.15e-02

Pos logit contributions
 seeing+0.136
 wondering+0.130
 wonders+0.128
 observing+0.127
 discovering+0.125
Neg logit contributions
fy-0.135
tr-0.128
brush-0.127
L-0.126
 gloss-0.120
Token slide
Token ID10649
Feature activation+4.42e-02

Pos logit contributions
tr+0.076
L+0.075
fy+0.074
brush+0.074
tered+0.068
Neg logit contributions
 seeing-0.102
 wondering-0.095
 observing-0.094
 discovering-0.094
 wonders-0.092
Token and
Token ID290
Feature activation+8.43e-02

Pos logit contributions
 seeing+0.070
 wondering+0.062
 discovering+0.061
 observing+0.061
 wonders+0.059
Neg logit contributions
tr-0.082
L-0.082
fy-0.082
brush-0.079
 gloss-0.078
Token had
Token ID550
Feature activation-3.73e-02

Pos logit contributions
 seeing+0.157
 discovering+0.140
 wondering+0.138
 observing+0.135
 wonders+0.132
Neg logit contributions
tr-0.144
brush-0.135
fy-0.135
ainer-0.132
inky-0.132
Token fun
Token ID1257
Feature activation+4.99e-02

Pos logit contributions
tr+0.051
L+0.050
brush+0.049
fy+0.048
 tighter+0.045
Neg logit contributions
 seeing-0.079
 wondering-0.074
 observing-0.074
 discovering-0.073
 wonders-0.072
Token going
Token ID1016
Feature activation+3.83e-01

Pos logit contributions
 cultures+0.086
 wonders+0.083
 seeing+0.082
 wondering+0.080
 culture+0.076
Neg logit contributions
L-0.077
brush-0.075
tr-0.073
 gloss-0.071
 juice-0.070
Token down
Token ID866
Feature activation+4.12e-02

Pos logit contributions
 seeing+0.789
 wondering+0.738
 observing+0.733
 discovering+0.730
 wonders+0.715
Neg logit contributions
tr-0.544
L-0.530
brush-0.520
fy-0.519
 tighter-0.481
Token it
Token ID340
Feature activation+1.07e-01

Pos logit contributions
 seeing+0.080
 wondering+0.077
 observing+0.077
 cultures+0.075
 discovering+0.075
Neg logit contributions
L-0.059
tr-0.056
brush-0.056
fy-0.054
 tighter-0.053
Token together
Token ID1978
Feature activation-1.41e-01

Pos logit contributions
 seeing+0.210
 wondering+0.196
 observing+0.195
 discovering+0.194
 wonders+0.189
Neg logit contributions
tr-0.162
L-0.159
brush-0.156
fy-0.155
 tighter-0.145
Token.
Token ID13
Feature activation+2.82e-02

Pos logit contributions
L+0.171
tr+0.169
brush+0.167
fy+0.163
tered+0.158
Neg logit contributions
 seeing-0.301
 observing-0.287
 wondering-0.287
 cultures-0.282
 wonders-0.281
Token Lily
Token ID20037
Feature activation-2.61e-01

Pos logit contributions
 seeing+0.052
 wondering+0.048
 wonders+0.047
 discovering+0.047
 observing+0.046
Neg logit contributions
brush-0.047
kin-0.043
paste-0.043
fy-0.043
tr-0.042
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
 seeing+0.061
 wondering+0.056
 observing+0.056
 discovering+0.056
 wonders+0.055
Neg logit contributions
tr-0.039
L-0.039
fy-0.037
brush-0.037
tered-0.035
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
 seeing+0.211
 wondering+0.198
 cultures+0.197
 observing+0.196
 discovering+0.195
Neg logit contributions
L-0.158
tr-0.143
brush-0.139
fy-0.138
tted-0.134
TokenDa
Token ID26531
Feature activation-1.57e-01

Pos logit contributions
 seeing+0.233
 discovering+0.220
 wonders+0.219
 wondering+0.219
 observing+0.217
Neg logit contributions
tr-0.135
brush-0.121
fy-0.116
paste-0.112
 gloss-0.111
Tokenisy
Token ID13560
Feature activation-8.40e-02

Pos logit contributions
tr+0.240
brush+0.236
L+0.220
 swapped+0.214
fy+0.213
Neg logit contributions
 seeing-0.308
 discovering-0.287
 wondering-0.285
 wonders-0.284
 observing-0.280
Token became
Token ID2627
Feature activation-1.38e-01

Pos logit contributions
L+0.112
tr+0.108
brush+0.104
fy+0.103
bo+0.099
Neg logit contributions
 seeing-0.177
 observing-0.168
 discovering-0.168
 cultures-0.166
 wondering-0.164
Token fascinated
Token ID30103
Feature activation+3.60e-01

Pos logit contributions
L+0.197
tr+0.193
brush+0.188
 tighter+0.184
fy+0.183
Neg logit contributions
 seeing-0.276
 discovering-0.253
 observing-0.252
 wondering-0.251
 cultures-0.249
Token with
Token ID351
Feature activation-9.19e-02

Pos logit contributions
 seeing+0.745
 observing+0.715
 discovering+0.706
 wondering+0.697
 cultures+0.674
Neg logit contributions
L-0.497
fy-0.458
tr-0.448
 tighter-0.442
brush-0.440
Token the
Token ID262
Feature activation+6.81e-02

Pos logit contributions
 tighter+0.090
brush+0.089
tered+0.088
 juice+0.087
L+0.087
Neg logit contributions
 seeing-0.191
 cultures-0.188
 observing-0.181
 discovering-0.178
 wondering-0.178
Token steam
Token ID13324
Feature activation+1.12e-01

Pos logit contributions
 seeing+0.137
 cultures+0.130
 discovering+0.130
 observing+0.126
 wondering+0.125
Neg logit contributions
tr-0.096
fy-0.080
 juice-0.079
L-0.079
brush-0.078
Token and
Token ID290
Feature activation+8.30e-02

Pos logit contributions
 seeing+0.221
 wondering+0.211
 discovering+0.211
 observing+0.210
 wonders+0.208
Neg logit contributions
tr-0.171
brush-0.158
tered-0.142
paste-0.141
fy-0.141
Token she
Token ID673
Feature activation+9.40e-02

Pos logit contributions
 cultures+0.146
 observing+0.145
 seeing+0.144
 discovering+0.140
 wondering+0.139
Neg logit contributions
L-0.103
 tighter-0.100
tr-0.097
brush-0.096
 swapped-0.096
Token They
Token ID1119
Feature activation-3.42e-02

Pos logit contributions
 seeing+0.043
 wondering+0.040
 observing+0.040
 discovering+0.040
 wonders+0.039
Neg logit contributions
tr-0.024
L-0.024
fy-0.022
brush-0.022
tered-0.021
Token can
Token ID460
Feature activation+4.10e-02

Pos logit contributions
brush+0.049
tr+0.048
fy+0.047
L+0.047
 swapped+0.046
Neg logit contributions
 seeing-0.069
 observing-0.067
 wondering-0.066
 discovering-0.066
 cultures-0.062
Token't
Token ID470
Feature activation-1.26e-02

Pos logit contributions
 seeing+0.081
 wondering+0.075
 observing+0.075
 discovering+0.074
 wonders+0.073
Neg logit contributions
tr-0.061
L-0.060
fy-0.059
brush-0.057
 tighter-0.055
Token wait
Token ID4043
Feature activation-1.35e-01

Pos logit contributions
tr+0.017
fy+0.016
L+0.016
brush+0.016
 gloss+0.015
Neg logit contributions
 seeing-0.027
 wondering-0.024
 wonders-0.024
 observing-0.024
 discovering-0.024
Token to
Token ID284
Feature activation+3.19e-02

Pos logit contributions
fy+0.147
L+0.145
brush+0.139
tr+0.138
tered+0.138
Neg logit contributions
 cultures-0.286
 wonders-0.284
 seeing-0.280
 observing-0.278
 discovering-0.275
Token show
Token ID905
Feature activation+8.47e-02

Pos logit contributions
 seeing+0.058
 wondering+0.053
 observing+0.053
 discovering+0.053
 wonders+0.052
Neg logit contributions
tr-0.053
L-0.052
fy-0.051
brush-0.051
 tighter-0.048
Token it
Token ID340
Feature activation+8.93e-02

Pos logit contributions
 seeing+0.137
 wondering+0.124
 wonders+0.121
 observing+0.119
 discovering+0.118
Neg logit contributions
tr-0.164
fy-0.158
L-0.158
brush-0.154
tted-0.153
Token to
Token ID284
Feature activation+3.24e-02

Pos logit contributions
 seeing+0.193
 wondering+0.187
 observing+0.186
 discovering+0.183
 wonders+0.178
Neg logit contributions
tr-0.113
brush-0.107
L-0.106
fy-0.104
 tighter-0.100
Token Mom
Token ID11254
Feature activation-1.58e-01

Pos logit contributions
 seeing+0.081
 wondering+0.078
 discovering+0.078
 observing+0.078
 wonders+0.076
Neg logit contributions
L-0.030
tr-0.028
brush-0.026
fy-0.025
tered-0.025
Token and
Token ID290
Feature activation+7.87e-02

Pos logit contributions
tr+0.219
L+0.214
brush+0.214
fy+0.205
 tighter+0.196
Neg logit contributions
 seeing-0.317
 wondering-0.306
 observing-0.303
 discovering-0.301
 wonders-0.300
Token share
Token ID2648
Feature activation-1.13e-01

Pos logit contributions
 seeing+0.149
 wondering+0.142
 discovering+0.141
 observing+0.141
 wonders+0.137
Neg logit contributions
tr-0.120
L-0.117
brush-0.115
fy-0.113
 tighter-0.109
Token felt
Token ID2936
Feature activation+3.76e-02

Pos logit contributions
L+0.199
tr+0.198
brush+0.191
fy+0.191
 tighter+0.180
Neg logit contributions
 seeing-0.289
 observing-0.271
 wondering-0.270
 discovering-0.269
 wonders-0.261
Token safe
Token ID3338
Feature activation-6.86e-02

Pos logit contributions
 seeing+0.078
 observing+0.073
 discovering+0.073
 wondering+0.073
 cultures+0.072
Neg logit contributions
L-0.050
fy-0.049
brush-0.049
tr-0.049
 tighter-0.046
Token and
Token ID290
Feature activation+8.23e-02

Pos logit contributions
tr+0.088
brush+0.087
 tighter+0.085
L+0.083
fy+0.083
Neg logit contributions
 seeing-0.136
 cultures-0.134
 wonders-0.132
 observing-0.130
 wondering-0.128
Token proud
Token ID6613
Feature activation+1.11e-01

Pos logit contributions
 seeing+0.162
 observing+0.150
 wondering+0.150
 discovering+0.149
 wonders+0.145
Neg logit contributions
L-0.119
tr-0.118
fy-0.117
brush-0.115
 tighter-0.111
Token.
Token ID13
Feature activation+2.56e-02

Pos logit contributions
 seeing+0.253
 observing+0.243
 wondering+0.241
 cultures+0.240
 discovering+0.239
Neg logit contributions
L-0.117
fy-0.113
brush-0.112
tr-0.111
 tighter-0.104
Token She
Token ID1375
Feature activation+7.52e-02

Pos logit contributions
 seeing+0.048
 wondering+0.044
 observing+0.044
 discovering+0.044
 wonders+0.043
Neg logit contributions
tr-0.041
L-0.040
brush-0.039
fy-0.039
 gloss-0.036
Token had
Token ID550
Feature activation-3.78e-02

Pos logit contributions
 seeing+0.154
 observing+0.148
 wondering+0.144
 discovering+0.144
 cultures+0.144
Neg logit contributions
L-0.103
tr-0.101
fy-0.099
brush-0.098
 tighter-0.093
Token achieved
Token ID8793
Feature activation+1.51e-01

Pos logit contributions
L+0.051
brush+0.048
tr+0.048
fy+0.047
 tighter+0.046
Neg logit contributions
 seeing-0.077
 observing-0.074
 cultures-0.073
 wondering-0.073
 discovering-0.071
Token her
Token ID607
Feature activation+1.01e-01

Pos logit contributions
 seeing+0.306
 cultures+0.291
 observing+0.290
 wondering+0.287
 discovering+0.284
Neg logit contributions
brush-0.193
L-0.192
tr-0.187
 tighter-0.183
fy-0.168
Token mission
Token ID4365
Feature activation+6.07e-02

Pos logit contributions
 seeing+0.176
 wondering+0.162
 observing+0.161
 discovering+0.160
 wonders+0.155
Neg logit contributions
tr-0.175
L-0.171
fy-0.168
brush-0.168
 tighter-0.159
Token!
Token ID0
Feature activation+1.13e-01

Pos logit contributions
 seeing+0.130
 observing+0.130
 cultures+0.126
 wondering+0.126
 whales+0.123
Neg logit contributions
brush-0.070
tr-0.066
 tighter-0.066
L-0.066
fy-0.063
Token excited
Token ID6568
Feature activation+1.00e-01

Pos logit contributions
L+0.073
tr+0.072
fy+0.072
brush+0.070
tered+0.067
Neg logit contributions
 seeing-0.080
 discovering-0.074
 wondering-0.074
 observing-0.073
 cultures-0.071
Token to
Token ID284
Feature activation+3.32e-02

Pos logit contributions
 cultures+0.239
 wondering+0.228
 wonders+0.228
 observing+0.227
eling+0.225
Neg logit contributions
L-0.091
fy-0.079
 juice-0.077
 Lily-0.077
tr-0.074
Token have
Token ID423
Feature activation-3.57e-03

Pos logit contributions
 seeing+0.066
 wondering+0.063
 observing+0.062
 discovering+0.062
 wonders+0.061
Neg logit contributions
tr-0.047
L-0.046
brush-0.046
fy-0.045
tered-0.043
Token the
Token ID262
Feature activation+6.40e-02

Pos logit contributions
L+0.004
brush+0.004
fy+0.004
 juice+0.004
 swapped+0.004
Neg logit contributions
 seeing-0.008
 wondering-0.008
 cultures-0.008
 observing-0.008
 discovering-0.007
Token license
Token ID5964
Feature activation-1.60e-01

Pos logit contributions
 seeing+0.157
 wondering+0.148
 observing+0.148
 discovering+0.147
 wonders+0.144
Neg logit contributions
tr-0.066
L-0.063
brush-0.062
fy-0.061
 tighter-0.056
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
tr+0.216
brush+0.212
L+0.202
fy+0.199
 gloss+0.187
Neg logit contributions
 seeing-0.333
 wondering-0.318
 observing-0.318
 wonders-0.312
 discovering-0.311
Token She
Token ID1375
Feature activation+6.70e-02

Pos logit contributions
 seeing+0.037
 observing+0.034
 discovering+0.034
 wondering+0.034
 wonders+0.032
Neg logit contributions
tr-0.029
L-0.027
brush-0.027
fy-0.027
tered-0.025
Token went
Token ID1816
Feature activation-1.21e-01

Pos logit contributions
 seeing+0.140
 observing+0.132
 discovering+0.132
 wondering+0.131
 cultures+0.128
Neg logit contributions
tr-0.092
L-0.091
fy-0.089
brush-0.088
tered-0.082
Token outside
Token ID2354
Feature activation+9.88e-02

Pos logit contributions
tr+0.176
L+0.170
brush+0.169
fy+0.167
 tighter+0.159
Neg logit contributions
 seeing-0.238
 wondering-0.224
 wonders-0.222
 observing-0.221
 cultures-0.219
Token and
Token ID290
Feature activation+7.64e-02

Pos logit contributions
 cultures+0.223
 seeing+0.206
 wonders+0.204
 wondering+0.200
eling+0.199
Neg logit contributions
L-0.108
tr-0.099
 tighter-0.094
brush-0.091
fy-0.090
Token started
Token ID2067
Feature activation-1.23e-01

Pos logit contributions
 seeing+0.141
 wondering+0.131
 observing+0.130
 discovering+0.130
 cultures+0.129
Neg logit contributions
tr-0.122
L-0.119
fy-0.117
brush-0.116
 tighter-0.110
Token on
Token ID319
Feature activation-5.13e-02

Pos logit contributions
tr+0.085
fy+0.082
tted+0.075
brush+0.075
L+0.075
Neg logit contributions
 seeing-0.064
 wondering-0.060
 observing-0.058
 wonders-0.058
 discovering-0.058
Token her
Token ID607
Feature activation+8.99e-02

Pos logit contributions
tr+0.114
fy+0.110
brush+0.107
tted+0.107
L+0.105
Neg logit contributions
 seeing-0.070
 wondering-0.064
 discovering-0.062
 observing-0.061
 wonders-0.061
Token old
Token ID1468
Feature activation+1.75e-01

Pos logit contributions
 seeing+0.187
 wondering+0.176
 observing+0.174
 discovering+0.174
 wonders+0.171
Neg logit contributions
tr-0.126
L-0.120
brush-0.120
fy-0.119
tered-0.111
Token princess
Token ID21752
Feature activation-3.88e-02

Pos logit contributions
 seeing+0.399
 wondering+0.375
 observing+0.373
 discovering+0.372
 wonders+0.365
Neg logit contributions
tr-0.207
L-0.201
brush-0.196
fy-0.195
 tighter-0.178
Token dress
Token ID6576
Feature activation-1.18e-01

Pos logit contributions
tr+0.052
fy+0.050
L+0.050
tered+0.048
brush+0.047
Neg logit contributions
 seeing-0.082
 discovering-0.076
 observing-0.076
 wondering-0.076
 wonders-0.073
Token and
Token ID290
Feature activation+7.81e-02

Pos logit contributions
L+0.228
tr+0.227
fy+0.226
tted+0.216
Fl+0.214
Neg logit contributions
 seeing-0.174
 wondering-0.155
 discovering-0.151
 observing-0.149
 wonders-0.140
Token felt
Token ID2936
Feature activation+3.05e-02

Pos logit contributions
 seeing+0.149
 wondering+0.135
 discovering+0.132
 wonders+0.131
 observing+0.129
Neg logit contributions
tr-0.125
tted-0.123
fy-0.122
L-0.117
Fl-0.116
Token much
Token ID881
Feature activation-1.65e-01

Pos logit contributions
 seeing+0.057
 wondering+0.053
 discovering+0.052
 observing+0.052
 wonders+0.051
Neg logit contributions
tr-0.052
L-0.049
brush-0.048
fy-0.048
 gloss-0.045
Token better
Token ID1365
Feature activation-2.41e-01

Pos logit contributions
tr+0.214
Fl+0.177
L+0.177
Wh+0.172
tered+0.170
Neg logit contributions
 seeing-0.370
 wonders-0.368
 wondering-0.363
 observing-0.338
 discovering-0.335
Token.
Token ID13
Feature activation+2.39e-02

Pos logit contributions
L+0.331
tr+0.328
fy+0.322
brush+0.321
 tighter+0.304
Neg logit contributions
 seeing-0.484
 observing-0.456
 wondering-0.453
 discovering-0.452
 cultures-0.445
Token But
Token ID887
Feature activation-1.49e-02

Pos logit contributions
 seeing+0.043
 wondering+0.040
 observing+0.040
 discovering+0.040
 wonders+0.039
Neg logit contributions
tr-0.039
brush-0.038
fy-0.038
L-0.038
 tighter-0.036
Token let
Token ID1309
Feature activation-1.60e-01

Pos logit contributions
 seeing+0.171
 observing+0.164
 discovering+0.164
 wondering+0.163
 cultures+0.161
Neg logit contributions
tr-0.102
L-0.100
brush-0.099
fy-0.094
tered-0.091
Token it
Token ID340
Feature activation+9.97e-02

Pos logit contributions
fy+0.289
tr+0.287
L+0.281
brush+0.274
 tighter+0.265
Neg logit contributions
 seeing-0.268
 wondering-0.238
 wonders-0.236
 observing-0.235
 discovering-0.233
Token bake
Token ID28450
Feature activation+1.56e-02

Pos logit contributions
 seeing+0.207
 observing+0.188
 wondering+0.187
 discovering+0.186
 wonders+0.185
Neg logit contributions
L-0.136
tr-0.135
fy-0.135
brush-0.128
 tighter-0.125
Token.
Token ID13
Feature activation+2.86e-02

Pos logit contributions
 seeing+0.028
 wondering+0.024
 observing+0.024
 discovering+0.024
 wonders+0.024
Neg logit contributions
tr-0.027
fy-0.026
L-0.026
brush-0.025
 tighter-0.024
Token When
Token ID1649
Feature activation-8.35e-02

Pos logit contributions
 seeing+0.060
 believing+0.057
 discovering+0.057
 observing+0.057
 wondering+0.057
Neg logit contributions
tr-0.039
tered-0.034
L-0.034
brush-0.034
fy-0.032
Token it
Token ID340
Feature activation+9.98e-02

Pos logit contributions
L+0.125
tr+0.122
brush+0.121
fy+0.115
 tighter+0.113
Neg logit contributions
 seeing-0.152
 wondering-0.148
 discovering-0.146
 observing-0.145
 cultures-0.143
Token was
Token ID373
Feature activation+2.35e-02

Pos logit contributions
 seeing+0.190
 wondering+0.174
 observing+0.173
 discovering+0.171
 wonders+0.170
Neg logit contributions
tr-0.156
L-0.152
fy-0.152
brush-0.150
 tighter-0.143
Token done
Token ID1760
Feature activation+9.41e-02

Pos logit contributions
 seeing+0.048
 wondering+0.044
 observing+0.044
 discovering+0.044
 wonders+0.043
Neg logit contributions
tr-0.034
L-0.033
fy-0.033
brush-0.033
 tighter-0.030
Token she
Token ID673
Feature activation+9.44e-02

Pos logit contributions
 seeing+0.204
 wondering+0.194
 wonders+0.192
 discovering+0.192
 observing+0.192
Neg logit contributions
tr-0.118
L-0.116
brush-0.114
fy-0.110
tered-0.105
Token took
Token ID1718
Feature activation-7.42e-02

Pos logit contributions
 seeing+0.226
 discovering+0.219
 cultures+0.219
 observing+0.219
 wondering+0.216
Neg logit contributions
tr-0.098
fy-0.095
brush-0.095
L-0.094
 swapped-0.087
Token it
Token ID340
Feature activation+1.01e-01

Pos logit contributions
tr+0.131
fy+0.123
L+0.123
brush+0.118
 gloss+0.115
Neg logit contributions
 seeing-0.134
 wondering-0.121
 wonders-0.121
 observing-0.120
 discovering-0.119
Token full
Token ID1336
Feature activation-3.42e-02

Pos logit contributions
tr+0.085
L+0.083
fy+0.081
brush+0.081
tered+0.078
Neg logit contributions
 seeing-0.096
 wondering-0.089
 observing-0.088
 discovering-0.088
 wonders-0.086
Token tum
Token ID11814
Feature activation-1.83e-01

Pos logit contributions
brush+0.031
tr+0.031
fy+0.030
L+0.030
 tighter+0.027
Neg logit contributions
 seeing-0.086
 observing-0.082
 wondering-0.082
 discovering-0.082
 cultures-0.080
Tokenmy
Token ID1820
Feature activation-3.80e-02

Pos logit contributions
tr+0.252
L+0.245
fy+0.242
brush+0.238
tered+0.223
Neg logit contributions
 seeing-0.383
 wondering-0.357
 observing-0.356
 discovering-0.354
 wonders-0.347
Token.
Token ID13
Feature activation+2.46e-02

Pos logit contributions
tr+0.054
brush+0.053
fy+0.052
L+0.052
 tighter+0.049
Neg logit contributions
 seeing-0.077
 observing-0.072
 wondering-0.072
 discovering-0.072
 wonders-0.070
Token<|endoftext|>
Token ID50256
Feature activation+3.80e-02

Pos logit contributions
 seeing+0.046
 wondering+0.043
 observing+0.043
 discovering+0.042
 wonders+0.042
Neg logit contributions
tr-0.040
brush-0.039
L-0.038
fy-0.038
 tighter-0.036
TokenOnce
Token ID7454
Feature activation-1.11e-01

Pos logit contributions
 seeing+0.072
 wondering+0.067
 observing+0.066
 discovering+0.066
 cultures+0.065
Neg logit contributions
L-0.061
tr-0.060
brush-0.057
fy-0.056
tered-0.053
Token upon
Token ID2402
Feature activation+3.49e-01

Pos logit contributions
L+0.254
tr+0.241
fy+0.239
brush+0.236
 tighter+0.229
Neg logit contributions
 seeing-0.130
 discovering-0.117
 wondering-0.117
 observing-0.116
 cultures-0.116
Token a
Token ID257
Feature activation-5.07e-02

Pos logit contributions
 seeing+0.739
 wondering+0.685
 observing+0.684
 discovering+0.679
 wonders+0.668
Neg logit contributions
tr-0.479
fy-0.464
brush-0.460
L-0.449
 gloss-0.423
Token time
Token ID640
Feature activation+1.50e-02

Pos logit contributions
tr+0.081
brush+0.080
fy+0.077
L+0.076
 gloss+0.075
Neg logit contributions
 seeing-0.094
 wondering-0.087
 observing-0.087
 discovering-0.086
 wonders-0.085
Token,
Token ID11
Feature activation-2.62e-02

Pos logit contributions
 seeing+0.033
 wondering+0.031
 discovering+0.031
 observing+0.031
 wonders+0.031
Neg logit contributions
tr-0.019
L-0.019
brush-0.017
fy-0.017
tered-0.016
Token there
Token ID612
Feature activation+1.78e-01

Pos logit contributions
tr+0.056
brush+0.055
fy+0.055
L+0.054
 gloss+0.052
Neg logit contributions
 seeing-0.036
 wondering-0.032
 observing-0.032
 discovering-0.031
 wonders-0.030
Token Ben
Token ID3932
Feature activation-1.93e-02

Pos logit contributions
 seeing+0.026
 discovering+0.024
 observing+0.024
 wondering+0.024
 cultures+0.024
Neg logit contributions
L-0.020
tr-0.019
brush-0.019
fy-0.018
tered-0.017
Token and
Token ID290
Feature activation+6.41e-02

Pos logit contributions
L+0.030
tr+0.029
brush+0.028
fy+0.027
tered+0.026
Neg logit contributions
 seeing-0.038
 observing-0.036
 discovering-0.036
 cultures-0.036
 wondering-0.035
Token Lily
Token ID20037
Feature activation-2.85e-01

Pos logit contributions
 seeing+0.143
 wonders+0.136
 wondering+0.132
 discovering+0.126
 observing+0.125
Neg logit contributions
fy-0.083
brush-0.082
tr-0.082
tted-0.077
inky-0.069
Token cried
Token ID16896
Feature activation-2.23e-02

Pos logit contributions
tr+0.446
L+0.438
brush+0.430
fy+0.427
 tighter+0.400
Neg logit contributions
 seeing-0.543
 wondering-0.504
 observing-0.503
 discovering-0.502
 wonders-0.487
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
tr+0.047
fy+0.046
brush+0.046
tted+0.044
L+0.044
Neg logit contributions
 seeing-0.030
 wondering-0.029
 wonders-0.027
 observing-0.026
 discovering-0.025
Token They
Token ID1119
Feature activation-4.63e-02

Pos logit contributions
 seeing+0.029
 wondering+0.027
 discovering+0.027
 observing+0.027
 wonders+0.026
Neg logit contributions
L-0.020
tr-0.020
brush-0.019
fy-0.019
tered-0.018
Token were
Token ID547
Feature activation+3.41e-02

Pos logit contributions
tr+0.090
fy+0.088
brush+0.086
L+0.085
 gloss+0.084
Neg logit contributions
 seeing-0.071
 wondering-0.066
 wonders-0.065
 observing-0.060
 discovering-0.060
Token sorry
Token ID7926
Feature activation-5.26e-02

Pos logit contributions
 seeing+0.063
 wondering+0.059
 observing+0.058
 discovering+0.057
 wonders+0.056
Neg logit contributions
tr-0.057
L-0.056
fy-0.055
brush-0.055
 gloss-0.051
Token and
Token ID290
Feature activation+6.75e-02

Pos logit contributions
L+0.075
tr+0.074
fy+0.070
brush+0.070
 tighter+0.068
Neg logit contributions
 seeing-0.107
 observing-0.102
 discovering-0.101
 wondering-0.101
 cultures-0.098
Token they
Token ID484
Feature activation-7.36e-02

Pos logit contributions
 seeing+0.117
 wondering+0.108
 observing+0.106
 discovering+0.106
 wonders+0.104
Neg logit contributions
tr-0.120
L-0.116
brush-0.115
fy-0.114
 gloss-0.108
Token said
Token ID531
Feature activation+6.18e-02

Pos logit contributions
tr+0.130
brush+0.128
fy+0.127
L+0.127
 gloss+0.123
Neg logit contributions
 seeing-0.121
 wondering-0.114
 wonders-0.110
 observing-0.107
 discovering-0.105
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
L+0.204
tr+0.199
brush+0.192
fy+0.191
 tighter+0.184
Neg logit contributions
 seeing-0.258
 observing-0.242
 discovering-0.242
 wondering-0.238
 cultures-0.233
Token Alex
Token ID4422
Feature activation-6.44e-02

Pos logit contributions
 seeing+0.030
 wondering+0.029
 discovering+0.029
 observing+0.028
 wonders+0.028
Neg logit contributions
L-0.018
tr-0.017
brush-0.016
 gloss-0.015
 tighter-0.015
Token said
Token ID531
Feature activation+1.78e-02

Pos logit contributions
tr+0.099
L+0.098
fy+0.097
brush+0.096
tered+0.091
Neg logit contributions
 seeing-0.123
 observing-0.114
 wondering-0.113
 discovering-0.113
 wonders-0.109
Token no
Token ID645
Feature activation-2.61e-01

Pos logit contributions
 seeing+0.043
 wondering+0.040
 observing+0.040
 discovering+0.039
 wonders+0.037
Neg logit contributions
tr-0.021
L-0.020
fy-0.020
brush-0.019
bo-0.017
Token and
Token ID290
Feature activation+2.38e-02

Pos logit contributions
tr+0.432
brush+0.425
L+0.412
fy+0.407
tted+0.402
Neg logit contributions
 seeing-0.498
 wondering-0.461
 observing-0.453
 discovering-0.451
 wonders-0.445
Token thanked
Token ID26280
Feature activation-5.57e-02

Pos logit contributions
 seeing+0.047
 wondering+0.043
 discovering+0.042
 observing+0.042
 wonders+0.040
Neg logit contributions
tr-0.039
fy-0.037
brush-0.036
L-0.036
tted-0.035
Token her
Token ID607
Feature activation+3.97e-02

Pos logit contributions
tr+0.094
L+0.091
fy+0.091
brush+0.090
 tighter+0.084
Neg logit contributions
 seeing-0.101
 wondering-0.093
 observing-0.092
 discovering-0.092
 wonders-0.090
Token.
Token ID13
Feature activation-2.94e-02

Pos logit contributions
 seeing+0.075
 wondering+0.069
 observing+0.069
 discovering+0.069
 wonders+0.067
Neg logit contributions
tr-0.064
L-0.063
fy-0.062
brush-0.061
 gloss-0.057
Token
Token ID198
Feature activation+4.75e-02

Pos logit contributions
tr+0.041
L+0.040
brush+0.039
fy+0.039
 tighter+0.036
Neg logit contributions
 seeing-0.061
 wondering-0.057
 observing-0.057
 discovering-0.057
 wonders-0.055
Token
Token ID198
Feature activation+5.06e-02

Pos logit contributions
 seeing+0.095
 wondering+0.089
 observing+0.088
 discovering+0.088
 cultures+0.086
Neg logit contributions
L-0.076
tr-0.069
brush-0.065
Fl-0.064
fy-0.064
TokenThe
Token ID464
Feature activation-1.77e-01

Pos logit contributions
 seeing+0.107
 discovering+0.101
 observing+0.100
 wonders+0.100
 wondering+0.099
Neg logit contributions
tr-0.069
fy-0.066
brush-0.064
 tighter-0.058
 gloss-0.058
Token and
Token ID290
Feature activation+8.13e-02

Pos logit contributions
 seeing+0.555
 observing+0.542
 discovering+0.526
 cultures+0.525
 wondering+0.522
Neg logit contributions
tr-0.225
brush-0.222
fy-0.209
 juice-0.195
tered-0.190
Token it
Token ID340
Feature activation+1.01e-01

Pos logit contributions
 seeing+0.163
 observing+0.159
 discovering+0.156
 cultures+0.155
 believing+0.150
Neg logit contributions
L-0.101
tr-0.099
fy-0.097
 tighter-0.097
brush-0.095
Token was
Token ID373
Feature activation+2.40e-02

Pos logit contributions
 seeing+0.232
 observing+0.223
 discovering+0.220
 wondering+0.217
 cultures+0.216
Neg logit contributions
tr-0.121
L-0.116
brush-0.111
fy-0.108
tered-0.105
Token available
Token ID1695
Feature activation+1.19e-01

Pos logit contributions
 cultures+0.050
 seeing+0.048
 observing+0.047
 discovering+0.047
 wonders+0.045
Neg logit contributions
tr-0.030
brush-0.027
 tighter-0.027
L-0.026
 juice-0.026
Token.
Token ID13
Feature activation+3.23e-02

Pos logit contributions
 seeing+0.253
 observing+0.246
 wonders+0.244
 wondering+0.244
 cultures+0.244
Neg logit contributions
tr-0.143
fy-0.133
brush-0.132
L-0.130
 tighter-0.128
Token He
Token ID679
Feature activation+3.37e-03

Pos logit contributions
 seeing+0.066
 wondering+0.062
 observing+0.062
 discovering+0.061
 wonders+0.060
Neg logit contributions
tr-0.046
L-0.045
brush-0.045
fy-0.045
 tighter-0.041
Token picked
Token ID6497
Feature activation+4.53e-02

Pos logit contributions
 seeing+0.006
 observing+0.005
 wondering+0.005
 discovering+0.005
 wonders+0.005
Neg logit contributions
tr-0.006
L-0.006
fy-0.006
brush-0.006
 tighter-0.006
Token it
Token ID340
Feature activation+1.03e-01

Pos logit contributions
 seeing+0.094
 cultures+0.089
 observing+0.088
 discovering+0.087
 wondering+0.086
Neg logit contributions
L-0.058
tr-0.057
 tighter-0.057
brush-0.057
tered-0.055
Token up
Token ID510
Feature activation+5.47e-02

Pos logit contributions
 seeing+0.180
 wondering+0.167
 observing+0.165
 discovering+0.165
 wonders+0.161
Neg logit contributions
tr-0.177
L-0.174
brush-0.171
fy-0.171
 gloss-0.160
Token and
Token ID290
Feature activation+8.40e-02

Pos logit contributions
 seeing+0.094
 wondering+0.087
 observing+0.086
 discovering+0.086
 wonders+0.083
Neg logit contributions
tr-0.097
L-0.095
brush-0.093
fy-0.093
 gloss-0.088
Token soon
Token ID2582
Feature activation-1.37e-01

Pos logit contributions
 seeing+0.175
 observing+0.170
 discovering+0.168
 cultures+0.168
 wondering+0.165
Neg logit contributions
L-0.104
tr-0.104
brush-0.100
 tighter-0.097
 swapped-0.097
Token the
Token ID262
Feature activation-2.10e-02

Pos logit contributions
L+0.275
brush+0.274
 tighter+0.259
tr+0.259
 juice+0.258
Neg logit contributions
 seeing-0.449
 observing-0.426
 wondering-0.424
 discovering-0.421
 cultures-0.418
Token spot
Token ID4136
Feature activation-7.62e-02

Pos logit contributions
tr+0.033
brush+0.032
L+0.031
fy+0.030
 tighter+0.029
Neg logit contributions
 seeing-0.041
 observing-0.038
 discovering-0.038
 wondering-0.038
 cultures-0.036
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
brush+0.101
tr+0.101
fy+0.097
L+0.096
 tighter+0.091
Neg logit contributions
 seeing-0.160
 observing-0.152
 wondering-0.152
 discovering-0.149
 wonders-0.149
Token but
Token ID475
Feature activation+1.64e-02

Pos logit contributions
brush+0.177
tr+0.177
 tighter+0.170
L+0.170
fy+0.165
Neg logit contributions
 seeing-0.240
 cultures-0.230
 observing-0.228
 wonders-0.226
 discovering-0.222
Token it
Token ID340
Feature activation+1.66e-02

Pos logit contributions
 seeing+0.033
 wondering+0.031
 observing+0.031
 discovering+0.030
 wonders+0.030
Neg logit contributions
tr-0.024
L-0.023
brush-0.023
fy-0.023
 tighter-0.021
Token would
Token ID561
Feature activation-8.08e-02

Pos logit contributions
 seeing+0.033
 wondering+0.031
 observing+0.031
 discovering+0.031
 wonders+0.030
Neg logit contributions
tr-0.024
L-0.024
brush-0.023
fy-0.023
 tighter-0.022
Token always
Token ID1464
Feature activation+4.33e-03

Pos logit contributions
tr+0.099
brush+0.097
L+0.096
fy+0.091
 tighter+0.087
Neg logit contributions
 seeing-0.177
 observing-0.172
 wondering-0.170
 discovering-0.169
 cultures-0.165
Token come
Token ID1282
Feature activation-4.91e-02

Pos logit contributions
 seeing+0.010
 observing+0.010
 discovering+0.010
 wondering+0.009
 cultures+0.009
Neg logit contributions
tr-0.005
brush-0.004
 tighter-0.004
fy-0.004
 swap-0.004
Token back
Token ID736
Feature activation-2.95e-01

Pos logit contributions
tr+0.063
L+0.060
brush+0.059
fy+0.059
 tighter+0.056
Neg logit contributions
 seeing-0.108
 wondering-0.101
 observing-0.101
 discovering-0.100
 wonders-0.098
Token.
Token ID13
Feature activation-6.01e-02

Pos logit contributions
tr+0.330
 tighter+0.320
brush+0.315
fy+0.314
L+0.308
Neg logit contributions
 seeing-0.642
 wondering-0.616
 observing-0.613
 discovering-0.605
 cultures-0.597
Token
Token ID198
Feature activation+2.01e-02

Pos logit contributions
tr+0.086
L+0.086
fy+0.082
brush+0.082
 gloss+0.076
Neg logit contributions
 seeing-0.123
 wondering-0.113
 observing-0.113
 discovering-0.113
 wonders-0.110
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
tr+0.119
brush+0.116
L+0.116
fy+0.105
 gloss+0.101
Neg logit contributions
 seeing-0.194
 wondering-0.189
 observing-0.186
 discovering-0.183
 wonders-0.182
Token Lily
Token ID20037
Feature activation-3.11e-01

Pos logit contributions
tr+0.019
brush+0.019
fy+0.019
paste+0.018
 tighter+0.018
Neg logit contributions
 seeing-0.025
 wonders-0.024
 observing-0.024
 wondering-0.024
 discovering-0.023
Token has
Token ID468
Feature activation+6.44e-02

Pos logit contributions
L+0.532
 swapped+0.513
 swap+0.509
fy+0.504
tr+0.498
Neg logit contributions
 seeing-0.522
 observing-0.513
 discovering-0.483
 wondering-0.478
 believing-0.473
Token a
Token ID257
Feature activation-9.59e-02

Pos logit contributions
 seeing+0.143
 wondering+0.136
 observing+0.135
 discovering+0.134
 wonders+0.130
Neg logit contributions
tr-0.080
L-0.079
brush-0.075
fy-0.074
 tighter-0.068
Token large
Token ID1588
Feature activation+7.11e-02

Pos logit contributions
tr+0.146
L+0.144
fy+0.144
brush+0.141
 tighter+0.136
Neg logit contributions
 seeing-0.182
 wondering-0.171
 observing-0.167
 wonders-0.166
 discovering-0.165
Token card
Token ID2657
Feature activation-2.74e-01

Pos logit contributions
 seeing+0.153
 wondering+0.143
 observing+0.142
 discovering+0.141
 wonders+0.139
Neg logit contributions
tr-0.093
L-0.092
fy-0.090
brush-0.088
 tighter-0.083
Token with
Token ID351
Feature activation-1.31e-01

Pos logit contributions
fy+0.391
tr+0.390
L+0.390
brush+0.372
 tighter+0.367
Neg logit contributions
 seeing-0.552
 wondering-0.507
 observing-0.506
 discovering-0.506
 wonders-0.487
Token a
Token ID257
Feature activation-9.24e-02

Pos logit contributions
tr+0.199
fy+0.190
brush+0.190
L+0.190
 tighter+0.177
Neg logit contributions
 seeing-0.259
 wondering-0.240
 discovering-0.237
 observing-0.236
 wonders-0.235
Token red
Token ID2266
Feature activation-4.28e-02

Pos logit contributions
tr+0.137
L+0.133
fy+0.131
brush+0.131
 tighter+0.122
Neg logit contributions
 seeing-0.184
 wondering-0.171
 observing-0.170
 discovering-0.169
 wonders-0.166
Token heart
Token ID2612
Feature activation-8.12e-02

Pos logit contributions
fy+0.058
L+0.056
tr+0.055
tted+0.054
 tighter+0.054
Neg logit contributions
 seeing-0.088
 wondering-0.080
 observing-0.080
 wonders-0.079
 discovering-0.079
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
L+0.134
tr+0.132
fy+0.131
brush+0.125
 tighter+0.124
Neg logit contributions
 seeing-0.147
 observing-0.133
 wondering-0.133
 discovering-0.133
 wonders-0.128
Token journey
Token ID7002
Feature activation+1.59e-02

Pos logit contributions
 seeing+0.010
 wondering+0.009
 observing+0.009
 discovering+0.009
 wonders+0.009
Neg logit contributions
tr-0.007
L-0.007
fy-0.007
brush-0.007
 tighter-0.007
Token.
Token ID13
Feature activation-4.45e-02

Pos logit contributions
 seeing+0.032
 wondering+0.030
 observing+0.030
 wonders+0.030
 discovering+0.030
Neg logit contributions
tr-0.022
brush-0.021
fy-0.021
L-0.021
tered-0.020
Token
Token ID220
Feature activation-9.38e-02

Pos logit contributions
tr+0.049
L+0.049
brush+0.045
fy+0.043
tted+0.043
Neg logit contributions
 seeing-0.104
 observing-0.096
 wondering-0.096
 cultures-0.095
 discovering-0.095
Token
Token ID198
Feature activation+3.99e-02

Pos logit contributions
tr+0.118
L+0.115
tted+0.113
brush+0.106
Max+0.099
Neg logit contributions
 seeing-0.213
 wondering-0.200
 cultures-0.199
 observing-0.196
 discovering-0.196
Token
Token ID198
Feature activation+4.16e-02

Pos logit contributions
 seeing+0.086
 wondering+0.081
 cultures+0.080
 observing+0.079
 discovering+0.078
Neg logit contributions
L-0.058
tted-0.050
tr-0.049
brush-0.048
Max-0.047
TokenTogether
Token ID41631
Feature activation-3.20e-01

Pos logit contributions
 seeing+0.091
 wondering+0.086
 observing+0.085
 discovering+0.084
 wonders+0.083
Neg logit contributions
L-0.054
tr-0.053
brush-0.051
fy-0.050
 tighter-0.046
Token,
Token ID11
Feature activation-9.32e-02

Pos logit contributions
tr+0.669
L+0.593
Fl+0.581
brush+0.578
Wh+0.576
Neg logit contributions
 seeing-0.529
 discovering-0.475
 wondering-0.469
 observing-0.461
 exploring-0.443
Token Tim
Token ID5045
Feature activation-1.85e-01

Pos logit contributions
L+0.165
fy+0.161
tr+0.159
brush+0.157
 tighter+0.155
Neg logit contributions
 seeing-0.146
 wondering-0.137
 wonders-0.135
 observing-0.134
 cultures-0.133
Token and
Token ID290
Feature activation+2.11e-02

Pos logit contributions
fy+0.228
L+0.223
tr+0.223
brush+0.215
 tighter+0.211
Neg logit contributions
 seeing-0.404
 observing-0.385
 wondering-0.383
 discovering-0.376
 cultures-0.369
Token Joe
Token ID5689
Feature activation+2.02e-01

Pos logit contributions
 seeing+0.036
 wondering+0.033
 observing+0.033
 discovering+0.033
 wonders+0.032
Neg logit contributions
tr-0.038
L-0.037
fy-0.037
brush-0.037
 tighter-0.034
Token continued
Token ID3767
Feature activation-2.61e-01

Pos logit contributions
 seeing+0.449
 wondering+0.430
 observing+0.429
 discovering+0.419
 wonders+0.412
Neg logit contributions
fy-0.237
tr-0.237
brush-0.232
L-0.229
tered-0.215
Token up
Token ID510
Feature activation+4.71e-02

Pos logit contributions
 seeing+0.141
 wondering+0.131
 wonders+0.131
 observing+0.129
 discovering+0.128
Neg logit contributions
tr-0.139
L-0.132
brush-0.131
fy-0.131
 gloss-0.122
Token Lily
Token ID20037
Feature activation-2.72e-01

Pos logit contributions
 seeing+0.084
 wondering+0.077
 discovering+0.077
 observing+0.077
 wonders+0.075
Neg logit contributions
tr-0.081
L-0.078
brush-0.077
fy-0.077
 gloss-0.072
Token's
Token ID338
Feature activation-1.62e-01

Pos logit contributions
tr+0.304
L+0.298
brush+0.289
fy+0.287
 tighter+0.259
Neg logit contributions
 seeing-0.637
 wondering-0.600
 observing-0.597
 discovering-0.596
 wonders-0.583
Token brother
Token ID3956
Feature activation-6.10e-03

Pos logit contributions
tr+0.184
brush+0.171
L+0.169
fy+0.167
 tighter+0.153
Neg logit contributions
 seeing-0.389
 wondering-0.364
 discovering-0.363
 observing-0.361
 wonders-0.354
Token.
Token ID13
Feature activation+2.31e-02

Pos logit contributions
tr+0.009
L+0.009
brush+0.009
fy+0.009
 tighter+0.008
Neg logit contributions
 seeing-0.012
 wondering-0.011
 observing-0.011
 discovering-0.011
 wonders-0.011
Token Lily
Token ID20037
Feature activation-2.71e-01

Pos logit contributions
 seeing+0.045
 wondering+0.043
 observing+0.042
 discovering+0.041
 wonders+0.040
Neg logit contributions
brush-0.035
fy-0.033
tr-0.033
 tighter-0.032
paste-0.032
Token was
Token ID373
Feature activation+2.00e-02

Pos logit contributions
tr+0.357
L+0.355
fy+0.341
brush+0.338
 swapped+0.319
Neg logit contributions
 seeing-0.569
 observing-0.541
 discovering-0.539
 wondering-0.534
 cultures-0.534
Token happy
Token ID3772
Feature activation+9.23e-02

Pos logit contributions
 seeing+0.037
 wondering+0.035
 discovering+0.034
 observing+0.034
 wonders+0.034
Neg logit contributions
tr-0.032
L-0.031
brush-0.030
fy-0.030
 tighter-0.029
Token because
Token ID780
Feature activation+7.06e-02

Pos logit contributions
 cultures+0.209
 wonders+0.196
 seeing+0.196
 observing+0.194
eling+0.194
Neg logit contributions
L-0.102
brush-0.089
 gloss-0.088
tr-0.087
 tighter-0.084
Token she
Token ID673
Feature activation+9.02e-02

Pos logit contributions
 seeing+0.119
 wondering+0.109
 observing+0.108
 discovering+0.108
 wonders+0.104
Neg logit contributions
tr-0.131
fy-0.126
brush-0.125
L-0.124
tted-0.118
Token got
Token ID1392
Feature activation-1.70e-01

Pos logit contributions
 seeing+0.187
 wondering+0.177
 observing+0.177
 cultures+0.176
 discovering+0.175
Neg logit contributions
L-0.126
tr-0.124
fy-0.120
brush-0.115
tted-0.112
Token paws
Token ID49193
Feature activation-1.36e-01

Pos logit contributions
 seeing+0.079
 discovering+0.073
 wondering+0.073
 observing+0.073
 cultures+0.071
Neg logit contributions
tr-0.047
fy-0.045
brush-0.044
L-0.043
 tighter-0.042
Token to
Token ID284
Feature activation+2.92e-02

Pos logit contributions
tr+0.208
L+0.202
brush+0.201
fy+0.200
 tighter+0.188
Neg logit contributions
 seeing-0.263
 wondering-0.244
 observing-0.243
 discovering-0.241
 wonders-0.238
Token push
Token ID4574
Feature activation-1.05e-01

Pos logit contributions
 seeing+0.061
 wondering+0.057
 observing+0.056
 discovering+0.056
 wonders+0.055
Neg logit contributions
tr-0.041
L-0.040
brush-0.039
fy-0.039
 tighter-0.036
Token the
Token ID262
Feature activation+6.15e-02

Pos logit contributions
tr+0.225
brush+0.208
fy+0.206
L+0.200
 gloss+0.199
Neg logit contributions
 seeing-0.149
 wonders-0.139
 wondering-0.137
 discovering-0.133
 observing-0.131
Token spear
Token ID17514
Feature activation+1.27e-01

Pos logit contributions
 seeing+0.095
 wondering+0.088
 wonders+0.087
 observing+0.086
 discovering+0.085
Neg logit contributions
tr-0.118
L-0.117
brush-0.114
fy-0.113
 gloss-0.109
Token away
Token ID1497
Feature activation-2.69e-01

Pos logit contributions
 seeing+0.245
 wondering+0.230
 discovering+0.227
 observing+0.224
 wonders+0.222
Neg logit contributions
L-0.196
tr-0.194
brush-0.186
fy-0.186
 gloss-0.175
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
tr+0.448
L+0.440
fy+0.433
brush+0.428
 gloss+0.407
Neg logit contributions
 seeing-0.496
 wondering-0.456
 discovering-0.452
 observing-0.447
 wonders-0.440
Token As
Token ID1081
Feature activation-9.81e-02

Pos logit contributions
 seeing+0.039
 wondering+0.036
 observing+0.036
 discovering+0.036
 wonders+0.035
Neg logit contributions
tr-0.026
L-0.025
brush-0.025
fy-0.025
 tighter-0.023
Token the
Token ID262
Feature activation+6.43e-02

Pos logit contributions
tr+0.159
L+0.155
brush+0.152
fy+0.152
 tighter+0.142
Neg logit contributions
 seeing-0.185
 wondering-0.171
 observing-0.170
 discovering-0.170
 wonders-0.165
Token spear
Token ID17514
Feature activation+1.29e-01

Pos logit contributions
 seeing+0.094
 observing+0.085
 wonders+0.084
 wondering+0.083
 discovering+0.082
Neg logit contributions
brush-0.129
tr-0.127
L-0.126
fy-0.126
 gloss-0.122
Token moved
Token ID3888
Feature activation+6.59e-02

Pos logit contributions
 seeing+0.279
 wondering+0.261
 observing+0.260
 discovering+0.259
 wonders+0.254
Neg logit contributions
tr-0.168
L-0.164
brush-0.160
fy-0.160
 tighter-0.147
Token a
Token ID257
Feature activation-1.19e-01

Pos logit contributions
 seeing+0.016
 observing+0.015
 wondering+0.015
 discovering+0.015
 wonders+0.015
Neg logit contributions
brush-0.010
tr-0.010
L-0.010
fy-0.009
 juice-0.009
Token trash
Token ID13913
Feature activation-7.03e-02

Pos logit contributions
tr+0.176
L+0.172
brush+0.169
fy+0.168
 tighter+0.156
Neg logit contributions
 seeing-0.237
 wondering-0.221
 observing-0.220
 discovering-0.220
 wonders-0.214
Token.
Token ID13
Feature activation-4.30e-02

Pos logit contributions
tr+0.106
L+0.104
fy+0.102
brush+0.102
 tighter+0.096
Neg logit contributions
 seeing-0.137
 wondering-0.127
 observing-0.126
 discovering-0.125
 wonders-0.123
Token
Token ID198
Feature activation+3.44e-02

Pos logit contributions
L+0.057
tr+0.054
brush+0.051
fy+0.050
tered+0.047
Neg logit contributions
 seeing-0.096
 discovering-0.090
 observing-0.089
 wondering-0.088
 cultures-0.087
Token
Token ID198
Feature activation+3.90e-02

Pos logit contributions
 seeing+0.079
 cultures+0.077
 discovering+0.076
 wondering+0.076
 observing+0.075
Neg logit contributions
L-0.048
Fl-0.038
Max-0.037
Mom-0.035
tr-0.035
TokenL
Token ID43
Feature activation-4.61e-01

Pos logit contributions
 seeing+0.091
 observing+0.085
 wondering+0.085
 discovering+0.085
 wonders+0.083
Neg logit contributions
tr-0.047
fy-0.043
brush-0.042
 gloss-0.039
L-0.038
Tokenily
Token ID813
Feature activation-2.11e-01

Pos logit contributions
L+0.490
tr+0.468
brush+0.445
fy+0.435
 gloss+0.397
Neg logit contributions
 seeing-1.134
 wondering-1.078
 observing-1.072
 discovering-1.066
 wonders-1.041
Token burst
Token ID11173
Feature activation-2.14e-01

Pos logit contributions
L+0.322
tr+0.322
brush+0.308
fy+0.306
 tighter+0.288
Neg logit contributions
 seeing-0.410
 observing-0.386
 discovering-0.385
 wondering-0.381
 cultures-0.371
Token into
Token ID656
Feature activation-9.08e-02

Pos logit contributions
tr+0.389
fy+0.363
brush+0.357
L+0.346
 gloss+0.338
Neg logit contributions
 seeing-0.367
 wondering-0.357
 wonders-0.355
 observing-0.337
 discovering-0.332
Token tears
Token ID10953
Feature activation-6.38e-02

Pos logit contributions
L+0.134
tr+0.130
brush+0.126
fy+0.124
 tighter+0.120
Neg logit contributions
 seeing-0.181
 observing-0.169
 discovering-0.168
 wondering-0.167
 cultures-0.163
Token and
Token ID290
Feature activation+2.41e-02

Pos logit contributions
tr+0.098
L+0.097
brush+0.095
fy+0.094
 tighter+0.088
Neg logit contributions
 seeing-0.123
 wondering-0.115
 observing-0.115
 discovering-0.115
 wonders-0.111