TOP ACTIVATIONS
MAX = 0.185

treasure is not gold or jewels ," L ila says .
to be gold is not gold , but something else that
Maybe the treasure is not gold or jewels ," L ila
. I don 't need gold ice cream . I have
found a chest filled with gold and jewels . From that
real treasure chest filled with gold coins and jewels .
, maybe we will find gold and jewels and pear ls
to be restored to its former glory ."
a treasure chest full of gold and jewels ! Tim my
, you will get more gold ." The boy was greedy
found a chest filled with gold and jewels . From that
Maybe the birds can still hatch them . But you have
treasure chest full of shiny gold coins ! Tim , Sally
slide is not made of gold , but it is brilliant
good to have so much clutter . You can trip and
- a chest filled with gold and treasures ! <|endoftext|> Once
friendship are more precious than gold ." L ily
Maybe we can find more treasures ." They closed
the village back to its former glory !"
chest and found lots of gold coins and shiny jewels .

MIN ACTIVATIONS
MIN = -0.160

with me , Mom my ." Her mom smiled back and
it to its mom my ." Twe ety
have to say sorry first ." L ily did
much fun playing with it ." L ily
can share and play together ." Blue was happy , and
knee for a little while ." Mom my said .
sorry for not sharing before ." Her friend was
play with our shapes together ." They played with
want to play with me ." Mom kissed Anna
won 't do it again ." Her mom hugged
wants to play with us ." Tom comes back
wanted to make new friends ." L ily
deserve to ride on me ." Tim and Mia
't mean to break them ." Their mom sighed
won 't do that again ." <|endoftext|> L ily liked to
want to make a fort ." M ia said
themselves for who they are ." Ab bie thought
look pretty and y ummy ." They add everything
I couldn 't help sooner ." Qu ackers said , "
with your nice rubber ball ." L ily thought

INTERVAL 0.099 - 0.185
CONTAINS 4.299%

, pointing to a skysc raper . " Wow
decided to find an o asis . He followed his nose
paint brush and some beautiful colors . With a few sweeps
m ittens . They had disappeared . They felt
, you made a wonderful castle !" Mom said .

INTERVAL 0.013 - 0.099
CONTAINS 37.478%

more cel ery . Maybe it will increase our height ,"
new game and discovered even more exciting smells in the ocean
that he couldn 't play with his truck until he ate
a beautiful pattern in the clouds . The pattern was made
plane takes people from one place to another place . It

INTERVAL -0.074 - 0.013
CONTAINS 46.029%

The girl said , " My name is
<|endoftext|> Once upon a time there was a happy little child
went for a long walk in the forest , but quickly
Just then , Bree saw something on the creek 's
That strange man was right . Mark 's mar

INTERVAL -0.160 - -0.074
CONTAINS 12.195%

, help !" Tom cried . He was not happy .
Duck asked him to do . The family of ducks were
still there was no luck . Just then
on those toys ! <|endoftext|> Once upon a time , there
but Sara 's car stops . She looks at it and
Token treasure
Token ID14068
Feature activation+1.22e-01

Pos logit contributions
 Int+0.008
 Sadly+0.008
 Curious+0.008
ete+0.008
Sadly+0.008
Neg logit contributions
els-0.008
ps-0.007
 on-0.006
ws-0.006
 inside-0.006
Token is
Token ID318
Feature activation+2.90e-02

Pos logit contributions
els+0.206
ps+0.169
 colored+0.159
ws+0.159
ries+0.157
Neg logit contributions
 Int-0.207
ete-0.203
 Sadly-0.201
 Curious-0.194
 Everything-0.190
Token not
Token ID407
Feature activation+3.99e-02

Pos logit contributions
els+0.057
ries+0.044
ps+0.043
ws+0.042
ases+0.041
Neg logit contributions
 couldn-0.043
 Sadly-0.042
ete-0.042
 Int-0.040
 Barkley-0.040
Token gold
Token ID3869
Feature activation+1.83e-01

Pos logit contributions
els+0.081
ps+0.061
ries+0.061
ws+0.059
bones+0.058
Neg logit contributions
 couldn-0.058
 Int-0.057
 Sadly-0.056
 Barkley-0.056
ete-0.055
Token or
Token ID393
Feature activation+3.20e-02

Pos logit contributions
els+0.267
bones+0.220
ps+0.219
cks+0.219
ws+0.202
Neg logit contributions
ete-0.332
 He-0.312
 So-0.294
 couldn-0.288
So-0.284
Token jewels
Token ID42701
Feature activation+1.85e-01

Pos logit contributions
els+0.052
ps+0.043
ries+0.040
ws+0.039
ier+0.039
Neg logit contributions
 Int-0.057
ete-0.055
 Curious-0.055
 Sadly-0.055
 couldn-0.053
Token,"
Token ID553
Feature activation-8.39e-02

Pos logit contributions
els+0.299
ps+0.236
bones+0.228
cks+0.223
ws+0.219
Neg logit contributions
ete-0.331
 couldn-0.321
-0.305
So-0.302
 Sadly-0.302
Token L
Token ID406
Feature activation-1.62e-02

Pos logit contributions
 Int+0.152
 Sadly+0.148
 Curious+0.145
ete+0.144
Sadly+0.142
Neg logit contributions
els-0.147
ps-0.128
 on-0.123
 inside-0.121
ws-0.117
Tokenila
Token ID10102
Feature activation-1.47e-02

Pos logit contributions
 Int+0.036
 Sadly+0.036
 Curious+0.035
ete+0.035
Sadly+0.034
Neg logit contributions
els-0.022
ps-0.018
 on-0.017
 inside-0.016
.-0.016
Token says
Token ID1139
Feature activation+6.77e-03

Pos logit contributions
 Int+0.028
 Sadly+0.027
 Curious+0.027
ete+0.026
Sadly+0.026
Neg logit contributions
els-0.024
ps-0.021
 on-0.020
 inside-0.019
ws-0.019
Token.
Token ID13
Feature activation-8.49e-02

Pos logit contributions
els+0.007
 colored+0.005
ws+0.005
ries+0.005
 coloured+0.004
Neg logit contributions
 Int-0.016
 Curious-0.016
 Sadly-0.016
ete-0.016
 �-0.016
Token to
Token ID284
Feature activation+2.35e-02

Pos logit contributions
els+0.096
ps+0.085
 frost+0.079
 sticker+0.079
 icing+0.078
Neg logit contributions
 Int-0.099
ete-0.097
 Sadly-0.097
 Everything-0.095
 couldn-0.095
Token be
Token ID307
Feature activation+4.70e-02

Pos logit contributions
els+0.044
ps+0.039
 on+0.037
 inside+0.037
ws+0.036
Neg logit contributions
 Int-0.038
 Sadly-0.037
ete-0.037
 Curious-0.036
Sadly-0.036
Token gold
Token ID3869
Feature activation+1.69e-01

Pos logit contributions
els+0.096
ries+0.081
ps+0.080
 sticker+0.075
ws+0.075
Neg logit contributions
 Sadly-0.068
 Int-0.066
 Curious-0.065
Sadly-0.064
 Everything-0.063
Token is
Token ID318
Feature activation+4.81e-02

Pos logit contributions
els+0.222
ps+0.190
bones+0.185
ws+0.180
cks+0.174
Neg logit contributions
ete-0.334
 Int-0.314
 Sadly-0.308
 couldn-0.306
 Barkley-0.304
Token not
Token ID407
Feature activation+5.65e-02

Pos logit contributions
els+0.088
ries+0.074
ps+0.072
 icing+0.068
 frost+0.067
Neg logit contributions
 couldn-0.075
 Sadly-0.074
ete-0.074
 shouldn-0.072
 Int-0.071
Token gold
Token ID3869
Feature activation+1.84e-01

Pos logit contributions
els+0.105
ries+0.085
ips+0.082
ps+0.080
ases+0.080
Neg logit contributions
 couldn-0.084
 shouldn-0.084
ete-0.077
 Joey-0.076
 Sadly-0.075
Token,
Token ID11
Feature activation+2.22e-02

Pos logit contributions
els+0.267
cks+0.247
bones+0.244
ps+0.241
ws+0.230
Neg logit contributions
ete-0.319
 He-0.286
 couldn-0.284
 shouldn-0.278
 Barkley-0.263
Token but
Token ID475
Feature activation-2.26e-02

Pos logit contributions
els+0.032
ps+0.027
 on+0.026
 inside+0.025
ws+0.024
Neg logit contributions
 Int-0.047
 Sadly-0.046
ete-0.045
 Curious-0.045
Sadly-0.044
Token something
Token ID1223
Feature activation+3.91e-02

Pos logit contributions
 Sadly+0.034
 Int+0.034
ete+0.034
 Curious+0.033
 Everything+0.033
Neg logit contributions
els-0.043
ps-0.039
ws-0.036
 on-0.035
ries-0.034
Token else
Token ID2073
Feature activation+5.07e-02

Pos logit contributions
els+0.072
ps+0.062
ries+0.059
ws+0.058
 on+0.056
Neg logit contributions
 Int-0.064
 Sadly-0.062
ete-0.062
 Curious-0.060
 Everything-0.059
Token that
Token ID326
Feature activation+9.52e-03

Pos logit contributions
els+0.090
ps+0.078
ries+0.072
ws+0.072
 sticker+0.071
Neg logit contributions
 Int-0.083
ete-0.082
 Sadly-0.080
 Everything-0.078
 Curious-0.078
TokenMaybe
Token ID13300
Feature activation+2.97e-03

Pos logit contributions
 Int+0.050
 Sadly+0.047
 Curious+0.047
Sadly+0.045
ete+0.044
Neg logit contributions
els-0.111
ps-0.100
 on-0.097
 inside-0.097
ws-0.095
Token the
Token ID262
Feature activation-4.47e-03

Pos logit contributions
els+0.007
ps+0.007
ws+0.007
 channel+0.006
cks+0.006
Neg logit contributions
 Int-0.003
ete-0.003
 Curious-0.003
Of-0.003
 couldn-0.003
Token treasure
Token ID14068
Feature activation+1.22e-01

Pos logit contributions
 Int+0.008
 Sadly+0.008
 Curious+0.008
ete+0.008
Sadly+0.008
Neg logit contributions
els-0.008
ps-0.007
 on-0.006
ws-0.006
 inside-0.006
Token is
Token ID318
Feature activation+2.90e-02

Pos logit contributions
els+0.206
ps+0.169
 colored+0.159
ws+0.159
ries+0.157
Neg logit contributions
 Int-0.207
ete-0.203
 Sadly-0.201
 Curious-0.194
 Everything-0.190
Token not
Token ID407
Feature activation+3.99e-02

Pos logit contributions
els+0.057
ries+0.044
ps+0.043
ws+0.042
ases+0.041
Neg logit contributions
 couldn-0.043
 Sadly-0.042
ete-0.042
 Int-0.040
 Barkley-0.040
Token gold
Token ID3869
Feature activation+1.83e-01

Pos logit contributions
els+0.081
ps+0.061
ries+0.061
ws+0.059
bones+0.058
Neg logit contributions
 couldn-0.058
 Int-0.057
 Sadly-0.056
 Barkley-0.056
ete-0.055
Token or
Token ID393
Feature activation+3.20e-02

Pos logit contributions
els+0.267
bones+0.220
ps+0.219
cks+0.219
ws+0.202
Neg logit contributions
ete-0.332
 He-0.312
 So-0.294
 couldn-0.288
So-0.284
Token jewels
Token ID42701
Feature activation+1.85e-01

Pos logit contributions
els+0.052
ps+0.043
ries+0.040
ws+0.039
ier+0.039
Neg logit contributions
 Int-0.057
ete-0.055
 Curious-0.055
 Sadly-0.055
 couldn-0.053
Token,"
Token ID553
Feature activation-8.39e-02

Pos logit contributions
els+0.299
ps+0.236
bones+0.228
cks+0.223
ws+0.219
Neg logit contributions
ete-0.331
 couldn-0.321
-0.305
So-0.302
 Sadly-0.302
Token L
Token ID406
Feature activation-1.62e-02

Pos logit contributions
 Int+0.152
 Sadly+0.148
 Curious+0.145
ete+0.144
Sadly+0.142
Neg logit contributions
els-0.147
ps-0.128
 on-0.123
 inside-0.121
ws-0.117
Tokenila
Token ID10102
Feature activation-1.47e-02

Pos logit contributions
 Int+0.036
 Sadly+0.036
 Curious+0.035
ete+0.035
Sadly+0.034
Neg logit contributions
els-0.022
ps-0.018
 on-0.017
 inside-0.016
.-0.016
Token.
Token ID13
Feature activation-3.97e-02

Pos logit contributions
els+0.107
ps+0.094
cks+0.082
ws+0.076
bones+0.074
Neg logit contributions
 Int-0.117
Of-0.116
 Curious-0.113
 Sadly-0.112
ete-0.110
Token I
Token ID314
Feature activation+3.94e-03

Pos logit contributions
 Int+0.044
 Sadly+0.043
 Curious+0.041
ete+0.041
Sadly+0.040
Neg logit contributions
els-0.096
ps-0.087
 on-0.085
 inside-0.084
ws-0.082
Token don
Token ID836
Feature activation+2.91e-02

Pos logit contributions
els+0.008
ps+0.007
ries+0.007
cks+0.006
 on+0.006
Neg logit contributions
 Sadly-0.006
 Int-0.006
ete-0.006
 Curious-0.006
Sadly-0.006
Token't
Token ID470
Feature activation+2.02e-02

Pos logit contributions
els+0.071
ps+0.064
 on+0.063
 inside+0.062
ws+0.061
Neg logit contributions
 Int-0.032
 Sadly-0.030
ete-0.029
 Curious-0.029
Sadly-0.029
Token need
Token ID761
Feature activation+8.69e-02

Pos logit contributions
els+0.042
ps+0.036
ries+0.036
cks+0.033
 inside+0.031
Neg logit contributions
 Sadly-0.030
ete-0.029
Sadly-0.028
Of-0.027
 Int-0.027
Token gold
Token ID3869
Feature activation+1.82e-01

Pos logit contributions
els+0.144
ps+0.131
packs+0.123
 colored+0.121
ws+0.120
Neg logit contributions
 Int-0.142
ete-0.139
Of-0.136
 Everything-0.133
 Not-0.130
Token ice
Token ID4771
Feature activation+1.59e-01

Pos logit contributions
els+0.223
cks+0.214
ps+0.209
 colored+0.209
 channel+0.190
Neg logit contributions
 He-0.319
ete-0.317
Of-0.309
 So-0.296
 His-0.293
Token cream
Token ID8566
Feature activation+1.40e-01

Pos logit contributions
els+0.205
ps+0.188
cks+0.168
 harm+0.162
 envelop+0.157
Neg logit contributions
Of-0.286
ete-0.284
 He-0.268
Sure-0.253
 So-0.253
Token.
Token ID13
Feature activation-4.29e-02

Pos logit contributions
els+0.207
cks+0.179
ps+0.179
 colored+0.167
 coloured+0.151
Neg logit contributions
Of-0.266
ete-0.260
 He-0.255
 So-0.246
So-0.241
Token I
Token ID314
Feature activation+3.73e-03

Pos logit contributions
 Int+0.048
 Sadly+0.046
ete+0.045
 Curious+0.044
Sadly+0.044
Neg logit contributions
els-0.103
ps-0.093
 on-0.092
 inside-0.090
ws-0.088
Token have
Token ID423
Feature activation+3.63e-02

Pos logit contributions
els+0.007
ps+0.006
ries+0.006
 on+0.006
 inside+0.006
Neg logit contributions
 Sadly-0.006
 Int-0.006
 Curious-0.006
ete-0.006
Sadly-0.006
Token found
Token ID1043
Feature activation+4.83e-02

Pos logit contributions
 Int+0.061
 Sadly+0.059
ete+0.058
 Curious+0.058
Sadly+0.057
Neg logit contributions
els-0.063
ps-0.055
 on-0.055
 inside-0.054
.-0.051
Token a
Token ID257
Feature activation+3.93e-02

Pos logit contributions
els+0.079
ps+0.066
ws+0.062
ries+0.061
 on+0.060
Neg logit contributions
 Int-0.091
 Sadly-0.089
ete-0.089
 Curious-0.089
 Everything-0.087
Token chest
Token ID7721
Feature activation+1.17e-01

Pos logit contributions
els+0.067
ps+0.057
ws+0.052
ries+0.052
 on+0.051
Neg logit contributions
 Int-0.074
 Sadly-0.074
 Curious-0.073
ete-0.072
Sadly-0.072
Token filled
Token ID5901
Feature activation+8.54e-02

Pos logit contributions
els+0.193
ws+0.160
ps+0.157
ries+0.151
cks+0.139
Neg logit contributions
ete-0.224
 Sadly-0.217
 Curious-0.213
 Int-0.212
 couldn-0.212
Token with
Token ID351
Feature activation+6.65e-02

Pos logit contributions
els+0.121
ps+0.106
ws+0.095
ries+0.089
ier+0.088
Neg logit contributions
 Int-0.174
ete-0.173
 Sadly-0.173
 Curious-0.170
Sadly-0.167
Token gold
Token ID3869
Feature activation+1.79e-01

Pos logit contributions
els+0.105
ps+0.087
ws+0.081
ries+0.079
cks+0.077
Neg logit contributions
 Int-0.127
 Curious-0.126
 Sadly-0.126
ete-0.124
 Everything-0.120
Token and
Token ID290
Feature activation+6.94e-03

Pos logit contributions
els+0.245
ws+0.224
ps+0.206
cks+0.206
ases+0.198
Neg logit contributions
ete-0.327
 He-0.303
hog-0.294
 Curious-0.292
 couldn-0.289
Token jewels
Token ID42701
Feature activation+1.60e-01

Pos logit contributions
els+0.010
ps+0.008
ws+0.008
 on+0.008
ries+0.007
Neg logit contributions
 Int-0.015
 Sadly-0.014
 Curious-0.014
ete-0.014
Sadly-0.014
Token.
Token ID13
Feature activation-9.83e-02

Pos logit contributions
els+0.202
ws+0.169
ps+0.147
cks+0.137
ries+0.137
Neg logit contributions
 couldn-0.347
ete-0.345
 Int-0.336
 Curious-0.336
 Sadly-0.324
Token From
Token ID3574
Feature activation+4.12e-03

Pos logit contributions
Da+0.121
Sadly+0.114
Mos+0.109
Bet+0.107
mber+0.107
Neg logit contributions
.-0.201
 on-0.197
 inside-0.194
els-0.191
."-0.184
Token that
Token ID326
Feature activation-2.99e-02

Pos logit contributions
els+0.007
ps+0.006
 on+0.006
 inside+0.006
.+0.005
Neg logit contributions
 Int-0.008
 Sadly-0.008
 Curious-0.007
ete-0.007
Sadly-0.007
Token real
Token ID1103
Feature activation+9.59e-02

Pos logit contributions
els+0.079
ps+0.067
ws+0.062
ries+0.061
cks+0.059
Neg logit contributions
 Sadly-0.079
 Int-0.078
 Curious-0.077
Sadly-0.076
ete-0.075
Token treasure
Token ID14068
Feature activation+1.47e-01

Pos logit contributions
els+0.184
cks+0.153
ps+0.153
ws+0.153
ries+0.151
Neg logit contributions
 Curious-0.150
 Sadly-0.150
 Int-0.149
ete-0.147
 couldn-0.143
Token chest
Token ID7721
Feature activation+1.32e-01

Pos logit contributions
els+0.204
ws+0.168
ps+0.159
ries+0.152
cks+0.145
Neg logit contributions
hog-0.282
 couldn-0.281
ete-0.273
 Sadly-0.272
 Int-0.264
Token filled
Token ID5901
Feature activation+8.84e-02

Pos logit contributions
els+0.174
ws+0.141
ps+0.125
ries+0.124
cks+0.112
Neg logit contributions
 couldn-0.291
hog-0.268
ete-0.267
 Sadly-0.262
 hadn-0.259
Token with
Token ID351
Feature activation+6.34e-02

Pos logit contributions
els+0.126
ps+0.110
ws+0.100
ier+0.095
ries+0.091
Neg logit contributions
Da-0.178
 Sadly-0.174
 Int-0.172
ete-0.169
From-0.168
Token gold
Token ID3869
Feature activation+1.78e-01

Pos logit contributions
els+0.097
ps+0.078
ws+0.076
ries+0.074
cks+0.073
Neg logit contributions
 Int-0.117
 Curious-0.117
 Sadly-0.115
ete-0.112
 couldn-0.110
Token coins
Token ID10796
Feature activation+1.66e-01

Pos logit contributions
els+0.248
ws+0.221
cks+0.208
 harm+0.201
ps+0.200
Neg logit contributions
hog-0.287
 He-0.282
ete-0.274
 couldn-0.264
 Curious-0.257
Token and
Token ID290
Feature activation+5.73e-03

Pos logit contributions
els+0.216
ws+0.187
ps+0.165
ries+0.162
 channel+0.161
Neg logit contributions
 couldn-0.324
ete-0.308
hog-0.294
-0.285
 Barkley-0.284
Token jewels
Token ID42701
Feature activation+1.57e-01

Pos logit contributions
els+0.008
ps+0.006
ws+0.006
ries+0.005
cks+0.005
Neg logit contributions
 Int-0.012
 Curious-0.012
 Sadly-0.012
Sadly-0.012
ete-0.012
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
els+0.206
ws+0.176
ps+0.145
cks+0.144
ries+0.140
Neg logit contributions
 couldn-0.323
hog-0.302
ete-0.291
 Curious-0.287
ishly-0.286
Token
Token ID220
Feature activation-5.66e-02

Pos logit contributions
ete+0.217
rolling+0.203
Harry+0.196
Sadly+0.191
David+0.188
Neg logit contributions
.-0.203
 inside-0.190
 on-0.190
."-0.178
els-0.161
Token,
Token ID11
Feature activation-1.60e-02

Pos logit contributions
 Sadly+0.014
ete+0.014
 Int+0.014
 So+0.014
 Curious+0.014
Neg logit contributions
els-0.009
ps-0.007
ries-0.006
 icing-0.006
bones-0.006
Token maybe
Token ID3863
Feature activation-4.88e-03

Pos logit contributions
 Int+0.032
 Sadly+0.032
ete+0.031
 Curious+0.031
Sadly+0.030
Neg logit contributions
els-0.024
 on-0.022
 inside-0.022
ps-0.021
.-0.020
Token we
Token ID356
Feature activation+1.04e-03

Pos logit contributions
 couldn+0.004
 Int+0.004
ete+0.004
 Curious+0.004
Of+0.003
Neg logit contributions
ps-0.012
els-0.012
ws-0.011
 channel-0.011
bones-0.011
Token will
Token ID481
Feature activation+1.21e-02

Pos logit contributions
els+0.002
ps+0.002
 on+0.002
 inside+0.001
ries+0.001
Neg logit contributions
 Int-0.002
 Sadly-0.002
ete-0.002
 Curious-0.002
Sadly-0.002
Token find
Token ID1064
Feature activation+7.64e-02

Pos logit contributions
els+0.027
ries+0.023
ps+0.023
bones+0.022
ws+0.021
Neg logit contributions
 Sadly-0.016
 Int-0.016
ete-0.016
 Curious-0.016
Sadly-0.015
Token gold
Token ID3869
Feature activation+1.77e-01

Pos logit contributions
els+0.140
 channel+0.131
ge+0.124
ps+0.121
ier+0.118
Neg logit contributions
 Int-0.106
ete-0.103
 Everything-0.099
Of-0.095
 Curious-0.095
Token and
Token ID290
Feature activation+1.86e-02

Pos logit contributions
els+0.230
ps+0.220
 channel+0.212
bones+0.196
cks+0.194
Neg logit contributions
ete-0.312
 He-0.284
So-0.279
 So-0.276
 Int-0.272
Token jewels
Token ID42701
Feature activation+1.65e-01

Pos logit contributions
els+0.026
ps+0.022
ries+0.020
ier+0.019
 on+0.019
Neg logit contributions
 Int-0.037
 Curious-0.036
 Sadly-0.036
ete-0.036
 couldn-0.035
Token and
Token ID290
Feature activation+1.94e-02

Pos logit contributions
els+0.258
ps+0.211
bones+0.201
cks+0.196
ws+0.191
Neg logit contributions
ete-0.290
Of-0.282
So-0.279
 couldn-0.277
 Sadly-0.273
Token pear
Token ID25286
Feature activation+1.53e-01

Pos logit contributions
els+0.027
ps+0.022
ries+0.020
ier+0.019
ws+0.019
Neg logit contributions
 Int-0.039
 Sadly-0.038
 Curious-0.038
ete-0.037
Sadly-0.037
Tokenls
Token ID7278
Feature activation+1.18e-01

Pos logit contributions
ps+0.262
els+0.258
 channel+0.251
epad+0.241
 size+0.226
Neg logit contributions
ete-0.190
hog-0.182
 He-0.180
 Fin-0.178
 Nut-0.174
Token to
Token ID284
Feature activation+3.41e-02

Pos logit contributions
els+0.222
ps+0.185
cks+0.171
ries+0.167
 frost+0.162
Neg logit contributions
 Sadly-0.303
ete-0.299
 Everywhere-0.294
 Int-0.293
 Everything-0.287
Token be
Token ID307
Feature activation+6.86e-02

Pos logit contributions
els+0.061
ps+0.052
 on+0.049
ries+0.048
ws+0.048
Neg logit contributions
 Sadly-0.060
 Int-0.059
Sadly-0.057
ete-0.056
 Curious-0.056
Token restored
Token ID15032
Feature activation+1.16e-01

Pos logit contributions
els+0.133
ps+0.113
ries+0.109
ws+0.103
bones+0.103
Neg logit contributions
 Sadly-0.105
 Int-0.101
 Curious-0.096
Sadly-0.095
 couldn-0.093
Token to
Token ID284
Feature activation+2.81e-02

Pos logit contributions
els+0.152
ps+0.125
cks+0.120
bones+0.115
ws+0.112
Neg logit contributions
 Sadly-0.236
ete-0.230
 Everywhere-0.229
 couldn-0.224
 Int-0.223
Token its
Token ID663
Feature activation+3.20e-02

Pos logit contributions
els+0.051
ps+0.044
ries+0.041
ws+0.040
 on+0.039
Neg logit contributions
 Sadly-0.047
 Everything-0.045
 Int-0.045
Sadly-0.044
ete-0.043
Token former
Token ID1966
Feature activation+1.77e-01

Pos logit contributions
els+0.063
ps+0.056
cks+0.052
ws+0.052
ries+0.051
Neg logit contributions
 Sadly-0.052
Sadly-0.047
 Curious-0.047
 Int-0.047
ete-0.047
Token glory
Token ID13476
Feature activation+1.32e-01

Pos logit contributions
els+0.316
 on+0.301
ps+0.289
 inside+0.280
 sticker+0.274
Neg logit contributions
 Sadly-0.285
ete-0.267
 Everywhere-0.264
 Int-0.264
Sadly-0.260
Token."
Token ID526
Feature activation-1.23e-01

Pos logit contributions
els+0.135
ps+0.114
ws+0.103
cks+0.094
bones+0.093
Neg logit contributions
ete-0.304
 Int-0.298
 Sadly-0.296
 Everywhere-0.295
 Everything-0.291
Token
Token ID220
Feature activation-6.71e-02

Pos logit contributions
Sadly+0.196
 Barkley+0.187
rolling+0.180
Nut+0.180
 Int+0.178
Neg logit contributions
.-0.237
 on-0.212
els-0.210
 inside-0.200
."-0.195
Token
Token ID198
Feature activation-8.59e-02

Pos logit contributions
 Int+0.104
Of+0.103
Sadly+0.102
 Curious+0.101
Da+0.098
Neg logit contributions
els-0.121
.-0.120
 inside-0.112
 on-0.112
."-0.106
Token
Token ID198
Feature activation-9.89e-02

Pos logit contributions
ishly+0.155
 Int+0.147
 Barkley+0.146
 Curious+0.140
 Sadly+0.140
Neg logit contributions
.-0.143
 on-0.133
els-0.129
 inside-0.123
."-0.122
Token a
Token ID257
Feature activation+3.47e-02

Pos logit contributions
els+0.069
 on+0.061
 inside+0.060
ps+0.059
.+0.054
Neg logit contributions
 Int-0.111
 Sadly-0.109
Sadly-0.107
 Curious-0.106
ete-0.105
Token treasure
Token ID14068
Feature activation+1.39e-01

Pos logit contributions
els+0.060
ps+0.052
 on+0.050
 inside+0.049
ws+0.048
Neg logit contributions
 Int-0.063
 Sadly-0.061
ete-0.060
 Curious-0.060
Sadly-0.059
Token chest
Token ID7721
Feature activation+1.20e-01

Pos logit contributions
els+0.247
ws+0.214
ps+0.212
ries+0.191
cks+0.189
Neg logit contributions
ete-0.233
 couldn-0.218
 Int-0.216
 Sadly-0.215
 Curious-0.209
Token full
Token ID1336
Feature activation+7.22e-02

Pos logit contributions
els+0.208
ws+0.173
ps+0.170
ries+0.160
cks+0.152
Neg logit contributions
ete-0.213
 couldn-0.210
 Sadly-0.208
 Int-0.205
 Curious-0.202
Token of
Token ID286
Feature activation+6.65e-02

Pos logit contributions
els+0.134
ps+0.118
ws+0.113
cks+0.106
ries+0.105
Neg logit contributions
ete-0.123
 Int-0.120
 Sadly-0.117
 Curious-0.115
 Everything-0.112
Token gold
Token ID3869
Feature activation+1.77e-01

Pos logit contributions
els+0.102
ps+0.088
ws+0.080
cks+0.076
ries+0.075
Neg logit contributions
 Int-0.128
 Curious-0.126
ete-0.126
 Sadly-0.124
 Everything-0.121
Token and
Token ID290
Feature activation+1.83e-03

Pos logit contributions
els+0.261
ws+0.246
ps+0.235
cks+0.225
 channel+0.225
Neg logit contributions
ete-0.300
 couldn-0.270
 He-0.248
 Barkley-0.246
hog-0.245
Token jewels
Token ID42701
Feature activation+1.52e-01

Pos logit contributions
els+0.003
ps+0.002
ws+0.002
 on+0.002
cks+0.002
Neg logit contributions
 Int-0.004
 Curious-0.004
 Sadly-0.004
ete-0.004
Sadly-0.004
Token!
Token ID0
Feature activation-1.14e-01

Pos logit contributions
els+0.204
ws+0.173
ps+0.147
cks+0.138
 channel+0.137
Neg logit contributions
 couldn-0.327
ete-0.314
 Int-0.299
 Curious-0.295
rolling-0.293
Token Tim
Token ID5045
Feature activation-3.72e-02

Pos logit contributions
mber+0.215
ete+0.212
Da+0.201
Ol+0.192
Sadly+0.189
Neg logit contributions
.-0.181
 inside-0.174
 on-0.165
."-0.158
els-0.157
Tokenmy
Token ID1820
Feature activation-2.36e-02

Pos logit contributions
 Int+0.078
 Sadly+0.076
Sadly+0.074
 Curious+0.074
ete+0.074
Neg logit contributions
els-0.056
ps-0.047
 on-0.047
 inside-0.045
.-0.044
Token,
Token ID11
Feature activation-2.17e-02

Pos logit contributions
els+0.124
ps+0.119
cks+0.112
bones+0.101
ws+0.101
Neg logit contributions
 Curious-0.094
 Int-0.093
 His-0.093
 Sadly-0.092
ete-0.090
Token you
Token ID345
Feature activation+1.04e-02

Pos logit contributions
 Int+0.034
 Sadly+0.033
ete+0.032
 Curious+0.032
Sadly+0.032
Neg logit contributions
els-0.043
ps-0.038
 on-0.037
 inside-0.036
ws-0.035
Token will
Token ID481
Feature activation+4.80e-03

Pos logit contributions
els+0.018
 on+0.016
ps+0.016
 inside+0.016
.+0.015
Neg logit contributions
 Int-0.019
 Curious-0.018
 Sadly-0.018
ete-0.018
Sadly-0.017
Token get
Token ID651
Feature activation+6.63e-02

Pos logit contributions
els+0.009
ps+0.008
 on+0.007
 inside+0.007
ws+0.007
Neg logit contributions
 Int-0.008
 Sadly-0.008
ete-0.008
Sadly-0.008
 Curious-0.008
Token more
Token ID517
Feature activation+8.89e-02

Pos logit contributions
els+0.115
ps+0.097
 colored+0.089
packs+0.089
ws+0.088
Neg logit contributions
 Sadly-0.116
 Int-0.112
ete-0.110
Sadly-0.108
 Curious-0.107
Token gold
Token ID3869
Feature activation+1.76e-01

Pos logit contributions
els+0.171
ps+0.167
bones+0.144
cks+0.139
 channel+0.136
Neg logit contributions
ete-0.127
 Sadly-0.118
Luckily-0.115
 couldn-0.114
mber-0.114
Token."
Token ID526
Feature activation-1.14e-01

Pos logit contributions
els+0.258
bones+0.225
cks+0.222
ps+0.213
 colored+0.208
Neg logit contributions
ete-0.327
 He-0.285
mber-0.277
 So-0.276
So-0.273
Token The
Token ID383
Feature activation-3.63e-02

Pos logit contributions
 Int+0.170
ete+0.164
Sadly+0.163
 Curious+0.159
rolling+0.159
Neg logit contributions
els-0.213
.-0.205
 on-0.204
 inside-0.197
."-0.188
Token boy
Token ID2933
Feature activation-1.37e-02

Pos logit contributions
 Curious+0.047
 Int+0.046
 Sadly+0.045
ete+0.042
 Everything+0.042
Neg logit contributions
els-0.085
ps-0.075
cks-0.073
ws-0.072
ries-0.070
Token was
Token ID373
Feature activation-7.72e-03

Pos logit contributions
ete+0.023
 Int+0.023
rolling+0.022
 Barkley+0.022
 Everything+0.022
Neg logit contributions
els-0.024
 on-0.023
ps-0.022
.-0.022
 inside-0.022
Token greedy
Token ID31828
Feature activation+8.49e-02

Pos logit contributions
 Int+0.012
 Sadly+0.012
ete+0.011
 Curious+0.011
Sadly+0.011
Neg logit contributions
els-0.015
ps-0.014
 on-0.014
 inside-0.013
.-0.013
Token found
Token ID1043
Feature activation+4.60e-02

Pos logit contributions
 Int+0.066
 Sadly+0.064
ete+0.062
 Curious+0.062
Sadly+0.061
Neg logit contributions
els-0.068
ps-0.060
 on-0.059
 inside-0.058
.-0.055
Token a
Token ID257
Feature activation+3.66e-02

Pos logit contributions
els+0.075
ps+0.063
ws+0.059
ries+0.058
 on+0.057
Neg logit contributions
 Int-0.087
 Sadly-0.085
ete-0.085
 Curious-0.085
 Everything-0.083
Token chest
Token ID7721
Feature activation+1.13e-01

Pos logit contributions
els+0.063
ps+0.053
ws+0.049
ries+0.048
 on+0.048
Neg logit contributions
 Int-0.069
 Sadly-0.069
 Curious-0.067
ete-0.067
Sadly-0.066
Token filled
Token ID5901
Feature activation+8.22e-02

Pos logit contributions
els+0.190
ws+0.158
ps+0.155
ries+0.150
cks+0.137
Neg logit contributions
ete-0.215
 Sadly-0.207
 Curious-0.204
 couldn-0.203
 Int-0.203
Token with
Token ID351
Feature activation+6.34e-02

Pos logit contributions
els+0.116
ps+0.100
ws+0.091
ries+0.084
ier+0.084
Neg logit contributions
 Int-0.169
ete-0.168
 Sadly-0.168
 Curious-0.165
Sadly-0.162
Token gold
Token ID3869
Feature activation+1.75e-01

Pos logit contributions
els+0.099
ps+0.082
ws+0.077
ries+0.075
 on+0.073
Neg logit contributions
 Int-0.122
 Curious-0.121
 Sadly-0.120
ete-0.119
 Everything-0.115
Token and
Token ID290
Feature activation+3.64e-03

Pos logit contributions
els+0.240
ws+0.220
cks+0.202
ps+0.202
ases+0.195
Neg logit contributions
ete-0.318
 He-0.295
hog-0.287
 Curious-0.283
 couldn-0.282
Token jewels
Token ID42701
Feature activation+1.56e-01

Pos logit contributions
els+0.005
ps+0.004
ws+0.004
 on+0.004
ries+0.004
Neg logit contributions
 Int-0.008
 Sadly-0.007
 Curious-0.007
ete-0.007
Sadly-0.007
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
els+0.197
ws+0.165
ps+0.143
cks+0.134
ries+0.133
Neg logit contributions
 couldn-0.340
ete-0.338
 Int-0.329
 Curious-0.327
rolling-0.317
Token From
Token ID3574
Feature activation+4.04e-04

Pos logit contributions
Da+0.126
Sadly+0.120
Mos+0.114
Bet+0.111
mber+0.111
Neg logit contributions
.-0.206
 on-0.200
 inside-0.197
els-0.195
."-0.187
Token that
Token ID326
Feature activation-3.48e-02

Pos logit contributions
els+0.001
ps+0.001
 on+0.001
 inside+0.001
.+0.001
Neg logit contributions
 Int-0.001
 Sadly-0.001
 Curious-0.001
ete-0.001
Sadly-0.001
Token Maybe
Token ID6674
Feature activation+5.01e-03

Pos logit contributions
 Int+0.088
rolling+0.085
ete+0.085
 Curious+0.082
Sadly+0.082
Neg logit contributions
els-0.125
ps-0.111
 on-0.109
.-0.109
."-0.108
Token the
Token ID262
Feature activation-6.20e-03

Pos logit contributions
els+0.012
ps+0.011
ws+0.011
cks+0.010
bones+0.010
Neg logit contributions
ete-0.005
 Int-0.005
Of-0.005
 Sadly-0.005
 couldn-0.005
Token birds
Token ID10087
Feature activation+9.07e-02

Pos logit contributions
 Sadly+0.012
Sadly+0.012
 Int+0.012
 Curious+0.011
 Everything+0.011
Neg logit contributions
els-0.009
ps-0.008
ws-0.007
cks-0.007
bones-0.007
Token can
Token ID460
Feature activation+2.27e-02

Pos logit contributions
els+0.166
ps+0.137
 colored+0.131
ries+0.130
ws+0.130
Neg logit contributions
 Int-0.146
 Sadly-0.146
Sadly-0.138
 Curious-0.137
ete-0.136
Token still
Token ID991
Feature activation+6.59e-02

Pos logit contributions
els+0.042
ps+0.035
ries+0.034
 on+0.034
ws+0.033
Neg logit contributions
 Sadly-0.038
 Int-0.037
Sadly-0.036
ete-0.035
 Curious-0.035
Token hatch
Token ID25834
Feature activation+1.75e-01

Pos logit contributions
els+0.113
ps+0.094
ries+0.087
ws+0.086
cks+0.085
Neg logit contributions
 Int-0.110
 Sadly-0.109
Sadly-0.104
 Curious-0.104
 Everything-0.102
Token them
Token ID606
Feature activation+9.64e-02

Pos logit contributions
els+0.212
ps+0.180
ws+0.178
bones+0.177
 colored+0.167
Neg logit contributions
ete-0.337
 Sadly-0.321
 couldn-0.317
So-0.315
 Int-0.311
Token.
Token ID13
Feature activation-6.03e-02

Pos logit contributions
els+0.122
ps+0.091
bones+0.090
ries+0.084
cks+0.083
Neg logit contributions
 Sadly-0.210
 Int-0.206
ete-0.200
So-0.199
Sadly-0.198
Token But
Token ID887
Feature activation-3.06e-02

Pos logit contributions
ete+0.089
rolling+0.088
 Int+0.083
mber+0.081
Sadly+0.080
Neg logit contributions
els-0.111
.-0.106
."-0.106
 on-0.103
ps-0.100
Token you
Token ID345
Feature activation+3.63e-04

Pos logit contributions
 Int+0.043
 Sadly+0.042
ete+0.042
Of+0.041
 Everything+0.041
Neg logit contributions
els-0.064
ps-0.057
ws-0.053
cks-0.052
 colored-0.052
Token have
Token ID423
Feature activation+1.65e-02

Pos logit contributions
els+0.001
 on+0.001
ps+0.001
 inside+0.000
.+0.000
Neg logit contributions
 Int-0.001
 Curious-0.001
ete-0.001
 Sadly-0.001
Sadly-0.001
Token treasure
Token ID14068
Feature activation+1.33e-01

Pos logit contributions
els+0.044
ps+0.038
ws+0.035
ries+0.034
cks+0.034
Neg logit contributions
 Sadly-0.042
 Int-0.042
 Curious-0.041
Sadly-0.040
ete-0.040
Token chest
Token ID7721
Feature activation+1.16e-01

Pos logit contributions
els+0.253
ws+0.226
ps+0.218
cks+0.200
ries+0.197
Neg logit contributions
ete-0.207
 Sadly-0.202
 Int-0.195
 couldn-0.195
 Curious-0.194
Token full
Token ID1336
Feature activation+6.81e-02

Pos logit contributions
els+0.226
ws+0.195
ps+0.190
ries+0.178
cks+0.168
Neg logit contributions
 couldn-0.190
ete-0.189
 Sadly-0.187
 Curious-0.180
 Int-0.180
Token of
Token ID286
Feature activation+6.15e-02

Pos logit contributions
els+0.144
ps+0.133
ws+0.129
cks+0.123
ries+0.116
Neg logit contributions
ete-0.096
 Int-0.094
 Sadly-0.092
 Everything-0.090
 Curious-0.089
Token shiny
Token ID22441
Feature activation+1.46e-01

Pos logit contributions
els+0.092
ps+0.081
ws+0.076
cks+0.071
ier+0.069
Neg logit contributions
 Curious-0.117
 Int-0.116
 Sadly-0.115
 Everything-0.114
ete-0.111
Token gold
Token ID3869
Feature activation+1.74e-01

Pos logit contributions
ps+0.221
cks+0.221
ws+0.218
ases+0.199
els+0.198
Neg logit contributions
 He-0.226
 Curious-0.212
 Int-0.200
 From-0.199
 So-0.198
Token coins
Token ID10796
Feature activation+1.65e-01

Pos logit contributions
els+0.262
ws+0.256
cks+0.241
ps+0.234
ases+0.228
Neg logit contributions
 He-0.275
ete-0.265
 couldn-0.258
hog-0.247
 So-0.237
Token!
Token ID0
Feature activation-1.03e-01

Pos logit contributions
els+0.242
ws+0.223
ps+0.205
ases+0.188
 channel+0.184
Neg logit contributions
 couldn-0.316
ete-0.299
 Barkley-0.273
 shouldn-0.273
 He-0.269
Token Tim
Token ID5045
Feature activation-2.65e-02

Pos logit contributions
ete+0.186
Sadly+0.183
ishly+0.178
Ol+0.178
rolling+0.177
Neg logit contributions
.-0.158
 inside-0.148
 on-0.147
."-0.143
els-0.133
Token,
Token ID11
Feature activation-3.12e-02

Pos logit contributions
 Int+0.062
ete+0.061
Sadly+0.060
 Sadly+0.060
 Everything+0.060
Neg logit contributions
els-0.030
 inside-0.027
 on-0.027
.-0.026
ps-0.025
Token Sally
Token ID25737
Feature activation-1.27e-02

Pos logit contributions
 Int+0.052
ete+0.051
 Sadly+0.050
Sadly+0.050
 Curious+0.048
Neg logit contributions
els-0.057
 on-0.053
 inside-0.051
.-0.050
ps-0.049
Token slide
Token ID10649
Feature activation+7.67e-02

Pos logit contributions
 Sadly+0.029
 Int+0.029
 Curious+0.029
Of+0.027
Sadly+0.027
Neg logit contributions
els-0.032
ps-0.027
cks-0.026
ws-0.026
 on-0.025
Token is
Token ID318
Feature activation+1.44e-02

Pos logit contributions
els+0.119
ps+0.102
 inside+0.096
 on+0.094
ws+0.093
Neg logit contributions
 Int-0.147
 Sadly-0.145
 Curious-0.139
ete-0.139
Sadly-0.138
Token not
Token ID407
Feature activation+2.73e-02

Pos logit contributions
els+0.032
ps+0.026
ries+0.026
ws+0.025
le+0.025
Neg logit contributions
 Sadly-0.018
Sadly-0.018
 couldn-0.018
 Int-0.017
 �-0.017
Token made
Token ID925
Feature activation+6.64e-02

Pos logit contributions
els+0.058
ps+0.046
ws+0.044
ries+0.044
bones+0.043
Neg logit contributions
 Int-0.037
 couldn-0.036
 Sadly-0.036
 Curious-0.034
 Everything-0.034
Token of
Token ID286
Feature activation+5.68e-02

Pos logit contributions
els+0.112
ps+0.091
ws+0.091
cks+0.088
bones+0.085
Neg logit contributions
 Everything-0.117
 Int-0.117
ete-0.116
 Sadly-0.113
Sadly-0.111
Token gold
Token ID3869
Feature activation+1.74e-01

Pos logit contributions
els+0.079
ps+0.067
ws+0.064
le+0.061
 on+0.061
Neg logit contributions
 Int-0.102
 Sadly-0.102
 Everything-0.100
ete-0.100
Sadly-0.097
Token,
Token ID11
Feature activation+2.30e-02

Pos logit contributions
els+0.232
cks+0.211
ps+0.210
ws+0.195
 colored+0.190
Neg logit contributions
 He-0.289
ete-0.277
 So-0.267
 couldn-0.267
-0.248
Token but
Token ID475
Feature activation-1.93e-02

Pos logit contributions
els+0.039
ps+0.033
 on+0.031
 inside+0.031
ws+0.030
Neg logit contributions
 Int-0.043
 Sadly-0.042
 Curious-0.041
ete-0.041
Sadly-0.040
Token it
Token ID340
Feature activation+1.28e-02

Pos logit contributions
 Int+0.029
 Sadly+0.029
 Everything+0.029
ete+0.029
 couldn+0.028
Neg logit contributions
els-0.035
ps-0.032
ws-0.030
cks-0.028
ries-0.027
Token is
Token ID318
Feature activation+7.53e-03

Pos logit contributions
els+0.020
ps+0.017
 on+0.017
 inside+0.016
ws+0.016
Neg logit contributions
 Int-0.025
 Sadly-0.024
ete-0.024
 Curious-0.024
Sadly-0.023
Token brilliant
Token ID10457
Feature activation+9.38e-02

Pos logit contributions
els+0.017
ries+0.014
ps+0.014
ases+0.013
ws+0.013
Neg logit contributions
 Sadly-0.009
 couldn-0.009
Sadly-0.009
 Int-0.008
ete-0.008
Token good
Token ID922
Feature activation+7.05e-02

Pos logit contributions
els+0.068
cks+0.053
bones+0.053
ps+0.053
ries+0.052
Neg logit contributions
 Sadly-0.046
Sadly-0.044
 couldn-0.043
 Everything-0.043
 Int-0.043
Token to
Token ID284
Feature activation+2.58e-02

Pos logit contributions
els+0.142
cks+0.127
bones+0.114
ws+0.107
ck+0.104
Neg logit contributions
 couldn-0.101
 Everything-0.095
 Sadly-0.095
 She-0.093
 He-0.093
Token have
Token ID423
Feature activation+5.58e-02

Pos logit contributions
els+0.051
ps+0.046
bones+0.043
ries+0.043
 on+0.042
Neg logit contributions
 Sadly-0.038
Sadly-0.038
 Int-0.036
 Everything-0.036
ete-0.034
Token so
Token ID523
Feature activation+2.76e-02

Pos logit contributions
els+0.099
cks+0.077
ps+0.074
ws+0.073
bones+0.073
Neg logit contributions
 Sadly-0.094
Sadly-0.089
 Everything-0.088
 Int-0.086
 couldn-0.085
Token much
Token ID881
Feature activation+1.07e-01

Pos logit contributions
els+0.059
ps+0.053
 colored+0.048
en+0.047
ries+0.047
Neg logit contributions
 Sadly-0.038
Sadly-0.037
 Everything-0.034
 Int-0.033
Of-0.032
Token clutter
Token ID45343
Feature activation+1.74e-01

Pos logit contributions
els+0.176
cks+0.162
bones+0.144
ps+0.139
et+0.137
Neg logit contributions
 Sadly-0.188
 Everything-0.181
 He-0.175
 Int-0.174
 She-0.174
Token.
Token ID13
Feature activation-7.02e-02

Pos logit contributions
els+0.235
cks+0.205
ck+0.190
ws+0.179
bones+0.177
Neg logit contributions
 couldn-0.298
 He-0.283
 She-0.272
ete-0.268
 Sadly-0.268
Token You
Token ID921
Feature activation+1.20e-02

Pos logit contributions
 Int+0.087
ete+0.084
 Sadly+0.083
 Curious+0.081
Sadly+0.080
Neg logit contributions
els-0.159
ps-0.143
 on-0.142
 inside-0.139
.-0.135
Token can
Token ID460
Feature activation-9.16e-03

Pos logit contributions
els+0.020
ps+0.017
 on+0.017
 inside+0.016
ws+0.016
Neg logit contributions
 Int-0.022
 Sadly-0.022
ete-0.021
 Curious-0.021
Sadly-0.021
Token trip
Token ID5296
Feature activation+1.13e-01

Pos logit contributions
 Int+0.016
 Sadly+0.016
 Curious+0.016
ete+0.015
Sadly+0.015
Neg logit contributions
els-0.016
ps-0.014
 on-0.014
 inside-0.014
.-0.013
Token and
Token ID290
Feature activation-3.02e-03

Pos logit contributions
els+0.185
ps+0.147
cks+0.140
ries+0.139
ws+0.131
Neg logit contributions
 Sadly-0.216
 Int-0.211
Sadly-0.205
ete-0.205
Of-0.204
Token -
Token ID532
Feature activation-1.71e-02

Pos logit contributions
els+0.127
ps+0.100
ws+0.093
cks+0.086
ries+0.086
Neg logit contributions
ete-0.246
 Int-0.241
 Sadly-0.240
 Curious-0.234
Sadly-0.230
Token a
Token ID257
Feature activation+2.84e-02

Pos logit contributions
 Int+0.034
 Sadly+0.033
ete+0.032
 Curious+0.032
Sadly+0.032
Neg logit contributions
els-0.027
ps-0.023
 on-0.022
 inside-0.021
ws-0.021
Token chest
Token ID7721
Feature activation+1.09e-01

Pos logit contributions
els+0.050
ps+0.043
ws+0.039
 on+0.039
ries+0.039
Neg logit contributions
 Int-0.052
 Sadly-0.052
 Curious-0.050
ete-0.050
Sadly-0.049
Token filled
Token ID5901
Feature activation+8.43e-02

Pos logit contributions
els+0.163
ps+0.134
ws+0.126
ries+0.120
 on+0.119
Neg logit contributions
 Int-0.218
ete-0.217
 Sadly-0.216
 Curious-0.211
Sadly-0.206
Token with
Token ID351
Feature activation+6.28e-02

Pos logit contributions
els+0.103
ps+0.086
ws+0.075
ries+0.069
 on+0.068
Neg logit contributions
ete-0.191
 Sadly-0.190
 Int-0.189
 Curious-0.184
Sadly-0.184
Token gold
Token ID3869
Feature activation+1.74e-01

Pos logit contributions
els+0.104
ps+0.088
ws+0.081
cks+0.079
 on+0.079
Neg logit contributions
 Int-0.114
 Sadly-0.114
 Curious-0.112
ete-0.112
Sadly-0.108
Token and
Token ID290
Feature activation+6.81e-03

Pos logit contributions
els+0.258
ws+0.212
cks+0.211
ps+0.209
ries+0.193
Neg logit contributions
ete-0.335
 Int-0.305
 Curious-0.302
 �-0.294
 He-0.291
Token treasures
Token ID29561
Feature activation+1.54e-01

Pos logit contributions
els+0.011
ps+0.009
ws+0.009
 on+0.008
ries+0.008
Neg logit contributions
 Int-0.013
 Sadly-0.013
 Curious-0.013
ete-0.012
Sadly-0.012
Token!
Token ID0
Feature activation-9.93e-02

Pos logit contributions
els+0.186
ws+0.138
ps+0.136
cks+0.123
ries+0.122
Neg logit contributions
ete-0.351
 Int-0.340
 Curious-0.333
 Sadly-0.331
 couldn-0.329
Token<|endoftext|>
Token ID50256
Feature activation-3.01e-02

Pos logit contributions
 Int+0.155
Sadly+0.155
rolling+0.149
 Curious+0.145
ete+0.145
Neg logit contributions
.-0.166
 on-0.159
els-0.153
."-0.149
 inside-0.143
TokenOnce
Token ID7454
Feature activation-9.54e-02

Pos logit contributions
 Int+0.047
 Sadly+0.046
 Curious+0.044
ete+0.044
Sadly+0.043
Neg logit contributions
els-0.060
ps-0.053
 on-0.051
 inside-0.050
ws-0.049
Token friendship
Token ID14738
Feature activation+9.47e-02

Pos logit contributions
 Sadly+0.013
ete+0.013
 Int+0.013
 Curious+0.012
 Everything+0.012
Neg logit contributions
els-0.012
ps-0.010
ries-0.009
 on-0.009
 inside-0.009
Token are
Token ID389
Feature activation+3.17e-02

Pos logit contributions
els+0.110
ries+0.075
ps+0.073
cks+0.072
bones+0.070
Neg logit contributions
ete-0.215
 Everything-0.211
 Sadly-0.210
 couldn-0.205
 Int-0.205
Token more
Token ID517
Feature activation+6.95e-02

Pos logit contributions
els+0.062
ps+0.051
ries+0.051
ws+0.049
cks+0.047
Neg logit contributions
ete-0.045
 couldn-0.045
 Sadly-0.044
Sadly-0.043
 From-0.042
Token precious
Token ID14186
Feature activation+1.21e-01

Pos logit contributions
els+0.142
ries+0.121
ps+0.117
bones+0.106
ws+0.106
Neg logit contributions
 From-0.106
 shouldn-0.103
So-0.103
 Sadly-0.102
From-0.101
Token than
Token ID621
Feature activation+5.73e-02

Pos logit contributions
els+0.189
cks+0.167
ps+0.154
bones+0.146
ws+0.141
Neg logit contributions
 couldn-0.209
 So-0.204
 From-0.204
 He-0.203
ete-0.202
Token gold
Token ID3869
Feature activation+1.73e-01

Pos logit contributions
els+0.095
ps+0.076
ws+0.075
cks+0.075
 colored+0.070
Neg logit contributions
ete-0.096
 couldn-0.093
 Sadly-0.092
 Everything-0.091
 Int-0.090
Token."
Token ID526
Feature activation-1.27e-01

Pos logit contributions
els+0.208
cks+0.193
ps+0.190
bones+0.185
 colored+0.173
Neg logit contributions
ete-0.311
 He-0.307
 couldn-0.292
 So-0.286
 shouldn-0.277
Token
Token ID198
Feature activation-8.91e-02

Pos logit contributions
 Int+0.223
ete+0.220
rolling+0.217
Sadly+0.212
 Curious+0.207
Neg logit contributions
.-0.220
 on-0.203
els-0.197
."-0.193
 inside-0.193
Token
Token ID198
Feature activation-1.09e-01

Pos logit contributions
 Int+0.164
ishly+0.158
 Curious+0.155
rolling+0.154
ete+0.152
Neg logit contributions
.-0.154
 on-0.144
els-0.138
 inside-0.136
."-0.127
TokenL
Token ID43
Feature activation-3.43e-02

Pos logit contributions
 Int+0.202
 Curious+0.191
 Everything+0.184
ete+0.184
rolling+0.183
Neg logit contributions
els-0.179
.-0.172
 on-0.170
 inside-0.157
 colored-0.151
Tokenily
Token ID813
Feature activation-4.55e-02

Pos logit contributions
 Int+0.060
 Sadly+0.059
ete+0.058
 Curious+0.057
Sadly+0.057
Neg logit contributions
els-0.062
ps-0.053
 on-0.052
 inside-0.051
ws-0.049
Token Maybe
Token ID6674
Feature activation+1.18e-02

Pos logit contributions
ete+0.070
 Int+0.069
Sadly+0.067
 Sadly+0.067
 Curious+0.063
Neg logit contributions
els-0.133
 on-0.122
 inside-0.121
ps-0.120
.-0.117
Token we
Token ID356
Feature activation+7.08e-03

Pos logit contributions
ps+0.028
els+0.028
ws+0.027
bones+0.025
 channel+0.025
Neg logit contributions
 Int-0.011
ete-0.011
 couldn-0.011
Of-0.010
 Curious-0.010
Token can
Token ID460
Feature activation+5.00e-04

Pos logit contributions
els+0.014
ps+0.012
ries+0.012
bones+0.011
cks+0.011
Neg logit contributions
 Sadly-0.011
 Int-0.010
ete-0.010
 Curious-0.010
 Everything-0.010
Token find
Token ID1064
Feature activation+7.38e-02

Pos logit contributions
els+0.001
ps+0.001
ries+0.001
bones+0.001
 on+0.001
Neg logit contributions
 Int-0.001
ete-0.001
 Sadly-0.001
 Curious-0.001
Sadly-0.001
Token more
Token ID517
Feature activation+9.19e-02

Pos logit contributions
els+0.129
ps+0.116
ge+0.111
ws+0.109
bones+0.109
Neg logit contributions
 Int-0.113
 Everything-0.109
ete-0.107
Of-0.103
 Curious-0.102
Token treasures
Token ID29561
Feature activation+1.73e-01

Pos logit contributions
ps+0.153
ier+0.149
els+0.145
 spills+0.134
 cutter+0.134
Neg logit contributions
 From-0.144
 So-0.143
 Int-0.142
 Curious-0.141
 Everywhere-0.140
Token."
Token ID526
Feature activation-1.09e-01

Pos logit contributions
els+0.270
ps+0.226
cks+0.220
bones+0.219
ws+0.214
Neg logit contributions
ete-0.299
 couldn-0.281
Of-0.280
 She-0.265
 He-0.264
Token
Token ID198
Feature activation-6.97e-02

Pos logit contributions
Sadly+0.170
 Int+0.165
ete+0.160
 Sadly+0.157
 Curious+0.157
Neg logit contributions
els-0.201
.-0.200
 on-0.193
 inside-0.183
."-0.172
Token
Token ID198
Feature activation-8.64e-02

Pos logit contributions
 Int+0.113
 Curious+0.109
 Sadly+0.107
 Everything+0.103
ete+0.102
Neg logit contributions
els-0.130
.-0.128
 on-0.123
 inside-0.118
ps-0.112
TokenThey
Token ID2990
Feature activation-3.72e-02

Pos logit contributions
 Int+0.147
 Curious+0.140
 Sadly+0.140
 Everything+0.140
 Everywhere+0.129
Neg logit contributions
els-0.166
 on-0.150
.-0.146
 inside-0.142
 cream-0.138
Token closed
Token ID4838
Feature activation+6.70e-02

Pos logit contributions
 Int+0.059
 Sadly+0.057
ete+0.056
 Curious+0.055
Sadly+0.055
Neg logit contributions
els-0.074
 on-0.065
ps-0.064
 inside-0.064
.-0.060
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
els+0.220
ps+0.205
ws+0.181
cks+0.180
bones+0.178
Neg logit contributions
ete-0.207
 Sadly-0.193
 Everything-0.192
 Int-0.186
Sadly-0.185
Token village
Token ID7404
Feature activation+1.12e-01

Pos logit contributions
els+0.021
ps+0.019
ws+0.018
cks+0.017
 on+0.017
Neg logit contributions
 Sadly-0.017
 Int-0.017
 Curious-0.017
Sadly-0.016
ete-0.016
Token back
Token ID736
Feature activation+5.52e-02

Pos logit contributions
els+0.141
ps+0.113
ws+0.102
ries+0.094
 colored+0.094
Neg logit contributions
 Sadly-0.249
 Int-0.249
ete-0.245
 Curious-0.242
Sadly-0.236
Token to
Token ID284
Feature activation+2.49e-02

Pos logit contributions
els+0.079
ps+0.067
ws+0.058
cks+0.057
 colored+0.057
Neg logit contributions
 Sadly-0.111
 Int-0.109
ete-0.108
 Curious-0.107
 Everything-0.105
Token its
Token ID663
Feature activation+3.23e-02

Pos logit contributions
els+0.044
ps+0.036
ries+0.034
ws+0.033
 on+0.033
Neg logit contributions
 Sadly-0.044
 Everything-0.043
ete-0.043
Sadly-0.042
 Int-0.042
Token former
Token ID1966
Feature activation+1.72e-01

Pos logit contributions
els+0.061
ps+0.053
ws+0.049
ries+0.048
cks+0.048
Neg logit contributions
 Sadly-0.054
ete-0.051
 Int-0.051
Sadly-0.051
 Curious-0.050
Token glory
Token ID13476
Feature activation+1.25e-01

Pos logit contributions
els+0.316
 on+0.298
ps+0.281
 inside+0.278
 sticker+0.267
Neg logit contributions
 Sadly-0.275
ete-0.261
Sadly-0.255
 Int-0.250
 Everywhere-0.246
Token!"
Token ID2474
Feature activation-1.14e-01

Pos logit contributions
els+0.128
ps+0.098
ws+0.091
cks+0.079
ries+0.079
Neg logit contributions
ete-0.301
 Int-0.293
 Sadly-0.292
 Curious-0.286
 Everything-0.285
Token
Token ID220
Feature activation-5.85e-02

Pos logit contributions
 Int+0.123
Sadly+0.121
ete+0.121
 Sadly+0.116
 Curious+0.114
Neg logit contributions
els-0.261
 on-0.249
.-0.248
 inside-0.236
ps-0.235
Token
Token ID198
Feature activation-7.53e-02

Pos logit contributions
 Int+0.085
Da+0.084
Of+0.084
 Curious+0.083
Sadly+0.080
Neg logit contributions
els-0.110
.-0.107
 on-0.102
ps-0.101
 inside-0.100
Token
Token ID198
Feature activation-8.50e-02

Pos logit contributions
ishly+0.128
 Int+0.127
 Curious+0.120
ete+0.120
 Sadly+0.117
Neg logit contributions
.-0.119
 on-0.117
els-0.110
."-0.106
 inside-0.102
Token chest
Token ID7721
Feature activation+1.00e-01

Pos logit contributions
els+0.012
ps+0.010
ries+0.010
ws+0.009
 colored+0.009
Neg logit contributions
 Int-0.012
 Curious-0.012
 Sadly-0.012
ete-0.011
Sadly-0.011
Token and
Token ID290
Feature activation-3.60e-02

Pos logit contributions
els+0.141
ps+0.117
ws+0.110
ries+0.101
cks+0.099
Neg logit contributions
 Sadly-0.215
 Int-0.215
ete-0.209
 Curious-0.209
Sadly-0.201
Token found
Token ID1043
Feature activation+4.58e-02

Pos logit contributions
 Int+0.068
 Sadly+0.067
ete+0.065
 Curious+0.065
Sadly+0.065
Neg logit contributions
els-0.059
 on-0.051
 inside-0.050
ps-0.050
.-0.046
Token lots
Token ID6041
Feature activation+1.07e-01

Pos logit contributions
els+0.072
ps+0.060
ws+0.056
 on+0.056
ries+0.054
Neg logit contributions
 Int-0.090
 Sadly-0.089
ete-0.087
 Curious-0.086
 Everything-0.084
Token of
Token ID286
Feature activation+7.61e-02

Pos logit contributions
els+0.233
ps+0.225
ws+0.217
cks+0.207
 channel+0.189
Neg logit contributions
 Int-0.128
ete-0.128
 Curious-0.123
 From-0.122
 Sadly-0.118
Token gold
Token ID3869
Feature activation+1.72e-01

Pos logit contributions
els+0.115
ps+0.105
ws+0.093
 cutter+0.088
cks+0.088
Neg logit contributions
 Int-0.147
 Curious-0.145
 Sadly-0.140
ete-0.135
 Everything-0.134
Token coins
Token ID10796
Feature activation+1.61e-01

Pos logit contributions
 channel+0.234
els+0.230
ws+0.224
ps+0.214
cks+0.204
Neg logit contributions
ete-0.289
 He-0.273
 couldn-0.261
hog-0.256
He-0.247
Token and
Token ID290
Feature activation+2.85e-03

Pos logit contributions
els+0.191
ws+0.166
 channel+0.162
ps+0.156
 colored+0.130
Neg logit contributions
ete-0.347
 couldn-0.338
rolling-0.298
 Barkley-0.297
 Int-0.297
Token shiny
Token ID22441
Feature activation+1.45e-01

Pos logit contributions
els+0.004
ps+0.003
ws+0.003
cks+0.003
 on+0.003
Neg logit contributions
 Int-0.006
 Curious-0.006
 Sadly-0.006
ete-0.006
Sadly-0.006
Token jewels
Token ID42701
Feature activation+1.54e-01

Pos logit contributions
 channel+0.214
cks+0.199
ps+0.198
ws+0.198
els+0.193
Neg logit contributions
 He-0.216
 From-0.195
 His-0.194
 Curious-0.194
 Int-0.190
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
els+0.194
ws+0.169
ps+0.142
 channel+0.141
cks+0.130
Neg logit contributions
 couldn-0.336
ete-0.322
hog-0.304
 Int-0.297
ishly-0.296
Token with
Token ID351
Feature activation-2.47e-03

Pos logit contributions
els+0.042
ps+0.031
ws+0.027
cks+0.027
 on+0.026
Neg logit contributions
 Sadly-0.095
 Int-0.095
ete-0.094
 Curious-0.093
Sadly-0.092
Token me
Token ID502
Feature activation+2.60e-02

Pos logit contributions
 Sadly+0.005
ete+0.005
 Int+0.005
 Everything+0.004
Sadly+0.004
Neg logit contributions
els-0.004
ps-0.003
 inside-0.003
 on-0.003
ws-0.003
Token,
Token ID11
Feature activation-5.08e-02

Pos logit contributions
els+0.023
ps+0.017
ws+0.014
 inside+0.013
ries+0.013
Neg logit contributions
 Int-0.067
 Sadly-0.067
ete-0.067
 Curious-0.065
Sadly-0.064
Token Mom
Token ID11254
Feature activation-4.32e-02

Pos logit contributions
 Int+0.095
 Sadly+0.093
ete+0.091
 Curious+0.090
Sadly+0.090
Neg logit contributions
els-0.082
 on-0.075
ps-0.072
 inside-0.072
.-0.070
Tokenmy
Token ID1820
Feature activation-2.76e-02

Pos logit contributions
 Int+0.096
 Sadly+0.096
ete+0.093
 Curious+0.093
 Everything+0.091
Neg logit contributions
els-0.055
ps-0.044
 on-0.040
ws-0.039
 inside-0.039
Token."
Token ID526
Feature activation-1.56e-01

Pos logit contributions
 Int+0.063
 Sadly+0.062
 Curious+0.061
ete+0.061
Sadly+0.060
Neg logit contributions
els-0.034
ps-0.027
 on-0.025
 inside-0.024
ws-0.024
Token Her
Token ID2332
Feature activation-7.15e-02

Pos logit contributions
 Int+0.227
ete+0.226
Sadly+0.223
 Curious+0.208
 Sadly+0.206
Neg logit contributions
.-0.296
 on-0.283
els-0.281
."-0.269
 inside-0.266
Token mom
Token ID1995
Feature activation-4.93e-02

Pos logit contributions
 Int+0.122
 Sadly+0.119
ete+0.116
 Curious+0.115
Sadly+0.115
Neg logit contributions
els-0.131
ps-0.115
 on-0.113
 inside-0.111
.-0.107
Token smiled
Token ID13541
Feature activation+7.39e-03

Pos logit contributions
 Int+0.092
 Sadly+0.089
ete+0.088
 Curious+0.087
Sadly+0.087
Neg logit contributions
els-0.083
ps-0.072
 on-0.071
 inside-0.069
.-0.067
Token back
Token ID736
Feature activation-1.68e-02

Pos logit contributions
els+0.009
ps+0.007
 on+0.007
ws+0.006
 inside+0.006
Neg logit contributions
 Int-0.017
 Sadly-0.017
 Curious-0.017
ete-0.017
Sadly-0.016
Token and
Token ID290
Feature activation-7.15e-02

Pos logit contributions
 From+0.031
 Curious+0.030
 Sadly+0.030
 �+0.029
 Int+0.029
Neg logit contributions
els-0.028
ws-0.024
ps-0.024
bones-0.023
cks-0.023
Token it
Token ID340
Feature activation+2.49e-02

Pos logit contributions
els+0.124
ps+0.106
cks+0.097
ws+0.096
ries+0.088
Neg logit contributions
ete-0.164
 Sadly-0.162
 Int-0.161
 Everything-0.160
 Curious-0.155
Token to
Token ID284
Feature activation-2.16e-02

Pos logit contributions
els+0.040
ps+0.033
ries+0.030
ws+0.030
cks+0.029
Neg logit contributions
 Int-0.047
 Sadly-0.047
ete-0.046
 Curious-0.045
Sadly-0.045
Token its
Token ID663
Feature activation-4.67e-03

Pos logit contributions
 Int+0.043
 Sadly+0.043
ete+0.043
 Curious+0.042
Sadly+0.042
Neg logit contributions
els-0.032
ps-0.027
 on-0.025
 inside-0.025
ws-0.024
Token mom
Token ID1995
Feature activation+8.20e-03

Pos logit contributions
 Sadly+0.008
 Int+0.008
ete+0.008
Sadly+0.008
 Curious+0.008
Neg logit contributions
els-0.008
ps-0.007
ws-0.007
ries-0.007
 on-0.007
Tokenmy
Token ID1820
Feature activation+1.99e-02

Pos logit contributions
els+0.010
ps+0.008
ws+0.007
cks+0.007
ries+0.007
Neg logit contributions
 Sadly-0.018
 Int-0.018
ete-0.018
 Curious-0.018
 Everything-0.017
Token."
Token ID526
Feature activation-1.56e-01

Pos logit contributions
els+0.022
ps+0.018
ws+0.015
cks+0.015
ries+0.014
Neg logit contributions
 Sadly-0.046
 Int-0.046
ete-0.045
 Curious-0.045
 Everything-0.044
Token
Token ID220
Feature activation-9.07e-02

Pos logit contributions
 Int+0.240
ete+0.237
Sadly+0.235
 Curious+0.223
rolling+0.221
Neg logit contributions
.-0.289
els-0.265
 on-0.265
."-0.252
 inside-0.247
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
 Int+0.143
 Curious+0.139
Sadly+0.138
Of+0.136
 Sadly+0.134
Neg logit contributions
els-0.168
.-0.162
 on-0.152
 inside-0.148
."-0.145
Token
Token ID198
Feature activation-1.17e-01

Pos logit contributions
 Int+0.181
ishly+0.179
 Curious+0.175
ete+0.174
 Sadly+0.173
Neg logit contributions
.-0.173
els-0.163
 on-0.154
."-0.152
 inside-0.141
TokenTwe
Token ID32665
Feature activation-4.52e-02

Pos logit contributions
 Int+0.175
 Curious+0.167
 Everything+0.167
 Sadly+0.166
ishly+0.165
Neg logit contributions
els-0.200
.-0.193
 on-0.183
 colored-0.177
."-0.173
Tokenety
Token ID2963
Feature activation-4.81e-02

Pos logit contributions
 Int+0.074
 Sadly+0.072
 Curious+0.070
Sadly+0.069
ete+0.069
Neg logit contributions
els-0.088
ps-0.075
 on-0.073
 inside-0.073
ws-0.071
Token have
Token ID423
Feature activation-4.30e-03

Pos logit contributions
 Int+0.011
 Sadly+0.010
ete+0.010
 Curious+0.010
Sadly+0.010
Neg logit contributions
els-0.010
ps-0.009
 on-0.009
 inside-0.009
ws-0.008
Token to
Token ID284
Feature activation-3.12e-02

Pos logit contributions
 Sadly+0.008
 Int+0.008
ete+0.008
Sadly+0.008
 Curious+0.008
Neg logit contributions
els-0.007
ps-0.006
ws-0.005
cks-0.005
 inside-0.005
Token say
Token ID910
Feature activation+2.07e-03

Pos logit contributions
 Sadly+0.052
 Int+0.052
ete+0.051
Sadly+0.050
 Curious+0.049
Neg logit contributions
els-0.057
ps-0.048
 on-0.046
ries-0.045
 inside-0.045
Token sorry
Token ID7926
Feature activation+8.98e-03

Pos logit contributions
els+0.004
ps+0.003
ws+0.003
 inside+0.003
 on+0.003
Neg logit contributions
 Int-0.003
 Sadly-0.003
ete-0.003
 Curious-0.003
 Everything-0.003
Token first
Token ID717
Feature activation+1.43e-02

Pos logit contributions
els+0.013
ps+0.010
cks+0.009
ries+0.009
bones+0.009
Neg logit contributions
 Int-0.019
 Sadly-0.019
ete-0.018
 Curious-0.018
 Everything-0.018
Token."
Token ID526
Feature activation-1.55e-01

Pos logit contributions
els+0.016
cks+0.010
ps+0.010
ws+0.010
ries+0.010
Neg logit contributions
 Sadly-0.034
 Everything-0.034
ete-0.034
 couldn-0.034
 Int-0.033
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
 Int+0.247
Sadly+0.244
ete+0.243
rolling+0.237
 Curious+0.230
Neg logit contributions
.-0.284
 on-0.256
els-0.252
."-0.250
 inside-0.236
Token
Token ID198
Feature activation-1.29e-01

Pos logit contributions
 Int+0.211
ishly+0.206
 Curious+0.201
ete+0.199
 Sadly+0.198
Neg logit contributions
.-0.213
els-0.204
 on-0.191
."-0.186
 inside-0.183
TokenL
Token ID43
Feature activation-5.91e-02

Pos logit contributions
 Int+0.224
 Everything+0.215
 Curious+0.212
ete+0.211
 Sadly+0.210
Neg logit contributions
els-0.205
.-0.196
 on-0.186
aks-0.178
 sticker-0.175
Tokenily
Token ID813
Feature activation-7.00e-02

Pos logit contributions
 Int+0.090
ete+0.089
 Sadly+0.088
Sadly+0.088
 Curious+0.084
Neg logit contributions
els-0.115
 on-0.101
ps-0.099
 inside-0.098
ries-0.094
Token did
Token ID750
Feature activation-4.36e-02

Pos logit contributions
ete+0.144
 Int+0.139
Sadly+0.136
rolling+0.136
 Sadly+0.135
Neg logit contributions
els-0.098
.-0.092
 on-0.091
ps-0.090
 inside-0.087
Token much
Token ID881
Feature activation+9.38e-03

Pos logit contributions
 Int+0.171
 Sadly+0.164
Sadly+0.160
ete+0.159
 Curious+0.158
Neg logit contributions
els-0.167
 on-0.166
.-0.159
 inside-0.159
ps-0.150
Token fun
Token ID1257
Feature activation+1.83e-02

Pos logit contributions
els+0.020
ps+0.018
ws+0.017
cks+0.017
bones+0.015
Neg logit contributions
 Int-0.014
 Sadly-0.014
Of-0.014
 Curious-0.014
ete-0.014
Token playing
Token ID2712
Feature activation+3.47e-02

Pos logit contributions
els+0.023
ps+0.018
ws+0.017
cks+0.015
ries+0.014
Neg logit contributions
 Int-0.041
 Sadly-0.040
ete-0.040
 Curious-0.039
 Everything-0.039
Token with
Token ID351
Feature activation-1.01e-02

Pos logit contributions
els+0.044
ps+0.034
ws+0.031
ries+0.030
cks+0.029
Neg logit contributions
 Int-0.078
 Sadly-0.077
 Curious-0.075
ete-0.075
Sadly-0.075
Token it
Token ID340
Feature activation+1.12e-02

Pos logit contributions
 Int+0.019
 Sadly+0.019
ete+0.019
 Curious+0.018
 Everything+0.018
Neg logit contributions
els-0.016
ps-0.013
ws-0.013
 on-0.012
 inside-0.012
Token."
Token ID526
Feature activation-1.54e-01

Pos logit contributions
els+0.010
ps+0.007
ws+0.006
ries+0.006
cks+0.006
Neg logit contributions
 Sadly-0.029
 Int-0.029
ete-0.029
 Curious-0.028
Sadly-0.028
Token
Token ID220
Feature activation-9.70e-02

Pos logit contributions
 Int+0.231
Sadly+0.228
ete+0.223
rolling+0.214
 Curious+0.214
Neg logit contributions
.-0.293
 on-0.271
els-0.268
."-0.257
 inside-0.247
Token
Token ID198
Feature activation-1.13e-01

Pos logit contributions
 Int+0.153
Sadly+0.149
 Curious+0.148
 Sadly+0.144
ete+0.143
Neg logit contributions
els-0.174
.-0.172
 on-0.164
 inside-0.157
."-0.155
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
 Int+0.192
ishly+0.185
 Curious+0.184
 Sadly+0.183
rolling+0.181
Neg logit contributions
.-0.184
els-0.167
 on-0.162
."-0.161
 inside-0.146
TokenL
Token ID43
Feature activation-6.11e-02

Pos logit contributions
 Int+0.178
 Sadly+0.167
 Everything+0.167
 Curious+0.167
ishly+0.163
Neg logit contributions
.-0.198
els-0.195
 on-0.185
."-0.177
 colored-0.173
Tokenily
Token ID813
Feature activation-6.46e-02

Pos logit contributions
 Int+0.103
 Sadly+0.100
ete+0.099
Sadly+0.097
 Curious+0.097
Neg logit contributions
els-0.113
ps-0.098
 on-0.097
 inside-0.095
ws-0.092
Token can
Token ID460
Feature activation-4.94e-02

Pos logit contributions
ete+0.069
Of+0.066
 Int+0.065
rolling+0.065
mber+0.063
Neg logit contributions
 on-0.058
 inside-0.057
els-0.056
.-0.055
ps-0.052
Token share
Token ID2648
Feature activation+4.62e-02

Pos logit contributions
 Int+0.091
 Sadly+0.089
ete+0.087
 Curious+0.087
Sadly+0.086
Neg logit contributions
els-0.083
ps-0.072
 on-0.071
 inside-0.069
.-0.066
Token and
Token ID290
Feature activation-2.99e-02

Pos logit contributions
els+0.078
ps+0.075
bones+0.067
ws+0.065
ier+0.065
Neg logit contributions
Da-0.068
Of-0.068
From-0.068
 From-0.068
 Her-0.067
Token play
Token ID711
Feature activation+3.54e-02

Pos logit contributions
 Sadly+0.048
 Int+0.047
Sadly+0.046
 Curious+0.045
Of+0.044
Neg logit contributions
els-0.058
ps-0.050
ries-0.046
 on-0.046
ws-0.045
Token together
Token ID1978
Feature activation+3.83e-02

Pos logit contributions
els+0.057
ps+0.044
ries+0.043
cks+0.040
ws+0.040
Neg logit contributions
 Sadly-0.069
 Int-0.068
Sadly-0.068
Of-0.067
From-0.066
Token."
Token ID526
Feature activation-1.54e-01

Pos logit contributions
els+0.052
ps+0.040
bones+0.039
ries+0.038
cks+0.035
Neg logit contributions
So-0.078
ete-0.077
 Sadly-0.076
 couldn-0.076
From-0.075
Token Blue
Token ID4518
Feature activation-5.53e-02

Pos logit contributions
 Int+0.244
Sadly+0.243
ete+0.241
 Curious+0.228
rolling+0.227
Neg logit contributions
.-0.282
els-0.264
 on-0.259
."-0.249
 inside-0.242
Token was
Token ID373
Feature activation-4.84e-02

Pos logit contributions
 Int+0.110
ete+0.107
 Sadly+0.106
Sadly+0.105
 Curious+0.103
Neg logit contributions
els-0.085
.-0.076
ps-0.076
 on-0.075
 inside-0.073
Token happy
Token ID3772
Feature activation-3.82e-03

Pos logit contributions
 Int+0.076
 Sadly+0.073
ete+0.072
 Curious+0.071
Sadly+0.070
Neg logit contributions
els-0.095
ps-0.085
 on-0.084
 inside-0.082
.-0.079
Token,
Token ID11
Feature activation-5.32e-02

Pos logit contributions
 Int+0.008
 Sadly+0.008
 Curious+0.008
ete+0.008
Sadly+0.007
Neg logit contributions
els-0.006
ps-0.005
ws-0.004
 on-0.004
cks-0.004
Token and
Token ID290
Feature activation-7.83e-02

Pos logit contributions
ete+0.130
 Int+0.128
Sadly+0.126
 Sadly+0.126
 Everything+0.124
Neg logit contributions
 on-0.054
els-0.049
.-0.048
 inside-0.047
 than-0.043
Token knee
Token ID10329
Feature activation+6.24e-02

Pos logit contributions
els+0.000
ps+0.000
 on+0.000
ws+0.000
 inside+0.000
Neg logit contributions
 Int-0.000
 Sadly-0.000
ete-0.000
 Curious-0.000
Sadly-0.000
Token for
Token ID329
Feature activation+4.54e-03

Pos logit contributions
els+0.064
ps+0.045
ws+0.038
cks+0.038
ries+0.037
Neg logit contributions
 Int-0.156
 Sadly-0.155
 Curious-0.151
Sadly-0.149
ete-0.148
Token a
Token ID257
Feature activation+3.81e-03

Pos logit contributions
els+0.009
ws+0.007
ps+0.007
packs+0.007
 colored+0.007
Neg logit contributions
 Int-0.007
Sadly-0.007
ete-0.006
 Sadly-0.006
 Everything-0.006
Token little
Token ID1310
Feature activation-3.06e-03

Pos logit contributions
els+0.007
ps+0.006
ries+0.006
ws+0.006
cks+0.005
Neg logit contributions
 Int-0.007
 Sadly-0.007
 Curious-0.007
Sadly-0.006
ete-0.006
Token while
Token ID981
Feature activation-1.65e-02

Pos logit contributions
 Int+0.005
 Sadly+0.005
 Curious+0.005
Of+0.005
Sadly+0.005
Neg logit contributions
els-0.006
ps-0.005
cks-0.005
ries-0.005
bones-0.005
Token."
Token ID526
Feature activation-1.54e-01

Pos logit contributions
 Int+0.030
 Sadly+0.029
ete+0.029
 Curious+0.029
Sadly+0.028
Neg logit contributions
els-0.028
ps-0.024
ws-0.023
 colored-0.022
cks-0.021
Token Mom
Token ID11254
Feature activation-7.09e-02

Pos logit contributions
Sadly+0.228
 Int+0.226
ete+0.221
rolling+0.212
 Curious+0.209
Neg logit contributions
.-0.288
 on-0.268
els-0.265
."-0.254
 inside-0.247
Tokenmy
Token ID1820
Feature activation-5.77e-02

Pos logit contributions
 Int+0.123
 Sadly+0.121
Sadly+0.119
ete+0.119
 Curious+0.117
Neg logit contributions
els-0.127
 on-0.111
ps-0.111
 inside-0.108
.-0.107
Token said
Token ID531
Feature activation-4.06e-02

Pos logit contributions
 Sadly+0.097
 Int+0.096
ete+0.094
Sadly+0.094
 Curious+0.091
Neg logit contributions
els-0.105
 on-0.094
ps-0.093
.-0.092
 inside-0.090
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
 Int+0.102
 Sadly+0.101
 Curious+0.099
ete+0.099
Sadly+0.097
Neg logit contributions
els-0.043
ps-0.031
ws-0.029
 on-0.027
 inside-0.027
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
ete+0.202
Sadly+0.202
 Int+0.190
 Sadly+0.185
 Barkley+0.178
Neg logit contributions
.-0.254
 on-0.241
els-0.240
 inside-0.229
."-0.227
Token sorry
Token ID7926
Feature activation-1.58e-02

Pos logit contributions
els+0.061
ps+0.050
ws+0.049
ries+0.047
cks+0.046
Neg logit contributions
 Int-0.053
 Curious-0.050
 Sadly-0.050
ete-0.049
Sadly-0.048
Token for
Token ID329
Feature activation+2.19e-03

Pos logit contributions
 Int+0.042
 Sadly+0.042
ete+0.041
 Curious+0.041
Sadly+0.041
Neg logit contributions
els-0.014
ps-0.010
 on-0.010
 inside-0.009
ws-0.008
Token not
Token ID407
Feature activation-1.42e-02

Pos logit contributions
els+0.004
ws+0.003
ps+0.003
 colored+0.003
packs+0.003
Neg logit contributions
 Sadly-0.003
Sadly-0.003
ete-0.003
 Int-0.003
 Everything-0.003
Token sharing
Token ID7373
Feature activation+7.23e-02

Pos logit contributions
 Sadly+0.023
 Int+0.022
Sadly+0.022
 Everything+0.021
 Curious+0.021
Neg logit contributions
els-0.028
ps-0.022
ries-0.021
cks-0.021
bones-0.020
Token before
Token ID878
Feature activation-5.46e-03

Pos logit contributions
els+0.094
ps+0.078
ws+0.067
 colored+0.066
cks+0.066
Neg logit contributions
Of-0.135
 Sadly-0.135
Don-0.134
ete-0.134
 His-0.133
Token."
Token ID526
Feature activation-1.54e-01

Pos logit contributions
 Sadly+0.011
 Int+0.010
ete+0.010
Sadly+0.010
 Curious+0.010
Neg logit contributions
els-0.008
ps-0.007
ws-0.007
 colored-0.006
 inside-0.006
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
 Int+0.238
Sadly+0.235
ete+0.235
rolling+0.223
 Curious+0.220
Neg logit contributions
.-0.286
 on-0.266
els-0.257
."-0.252
 inside-0.246
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
 Int+0.208
ishly+0.203
 Curious+0.198
ete+0.194
 Sadly+0.193
Neg logit contributions
.-0.210
els-0.196
 on-0.189
."-0.180
 inside-0.179
TokenHer
Token ID9360
Feature activation-7.47e-02

Pos logit contributions
 Int+0.218
 Everything+0.208
 Curious+0.206
ishly+0.204
 Sadly+0.201
Neg logit contributions
.-0.193
els-0.192
 on-0.182
aks-0.178
."-0.169
Token friend
Token ID1545
Feature activation-3.74e-02

Pos logit contributions
 Int+0.125
 Sadly+0.121
ete+0.118
 Curious+0.118
Sadly+0.116
Neg logit contributions
els-0.141
ps-0.123
 on-0.121
 inside-0.118
ws-0.113
Token was
Token ID373
Feature activation-3.93e-02

Pos logit contributions
 Int+0.069
Sadly+0.069
 Sadly+0.068
ete+0.067
 Curious+0.065
Neg logit contributions
els-0.060
 on-0.056
.-0.055
 inside-0.053
ps-0.053
Token play
Token ID711
Feature activation+3.47e-02

Pos logit contributions
 Int+0.087
 Sadly+0.085
ete+0.083
 Curious+0.083
Sadly+0.082
Neg logit contributions
els-0.078
ps-0.067
 on-0.065
 inside-0.063
ws-0.061
Token with
Token ID351
Feature activation-4.67e-03

Pos logit contributions
els+0.049
ps+0.038
ries+0.036
ws+0.034
cks+0.033
Neg logit contributions
 Int-0.074
 Sadly-0.074
Sadly-0.072
 Curious-0.072
ete-0.071
Token our
Token ID674
Feature activation-1.71e-02

Pos logit contributions
 Int+0.008
ete+0.008
 Sadly+0.008
Sadly+0.008
 Everything+0.008
Neg logit contributions
els-0.008
ps-0.007
ws-0.006
 on-0.006
 inside-0.006
Token shapes
Token ID15268
Feature activation+8.64e-02

Pos logit contributions
 Sadly+0.036
 Int+0.036
ete+0.035
 Curious+0.035
Sadly+0.034
Neg logit contributions
els-0.022
ps-0.018
ws-0.017
 on-0.016
 inside-0.016
Token together
Token ID1978
Feature activation+5.06e-02

Pos logit contributions
els+0.106
ps+0.078
ws+0.073
bones+0.072
cks+0.071
Neg logit contributions
ete-0.190
 Sadly-0.185
 Int-0.184
Sadly-0.178
 Curious-0.177
Token."
Token ID526
Feature activation-1.54e-01

Pos logit contributions
els+0.067
ps+0.051
bones+0.051
ries+0.048
cks+0.046
Neg logit contributions
ete-0.107
 couldn-0.106
So-0.103
 Sadly-0.102
 Int-0.100
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
Sadly+0.241
 Int+0.239
ete+0.236
 Curious+0.224
rolling+0.223
Neg logit contributions
.-0.285
els-0.265
 on-0.263
."-0.253
 inside-0.243
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
ishly+0.202
 Int+0.195
 Curious+0.190
 Sadly+0.187
 Everything+0.182
Neg logit contributions
.-0.180
els-0.168
 on-0.160
."-0.154
 inside-0.147
TokenThey
Token ID2990
Feature activation-7.03e-02

Pos logit contributions
 Int+0.200
 Everything+0.197
 Curious+0.194
 Sadly+0.193
ishly+0.193
Neg logit contributions
els-0.192
.-0.177
 on-0.172
aks-0.168
 sticker-0.163
Token played
Token ID2826
Feature activation+2.62e-02

Pos logit contributions
 Int+0.128
 Sadly+0.124
 Curious+0.121
ete+0.120
Sadly+0.120
Neg logit contributions
els-0.122
ps-0.107
 on-0.106
 inside-0.103
.-0.100
Token with
Token ID351
Feature activation+1.08e-03

Pos logit contributions
els+0.035
ps+0.026
ries+0.025
ws+0.024
 inside+0.022
Neg logit contributions
 Int-0.059
 Sadly-0.058
 Curious-0.057
Sadly-0.057
ete-0.056
Token want
Token ID765
Feature activation+2.17e-02

Pos logit contributions
 Int+0.052
 Sadly+0.051
 Curious+0.050
ete+0.050
Sadly+0.049
Neg logit contributions
els-0.059
ps-0.051
 on-0.047
 inside-0.047
ws-0.047
Token to
Token ID284
Feature activation-2.17e-02

Pos logit contributions
els+0.039
ps+0.034
ws+0.031
 inside+0.031
 colored+0.030
Neg logit contributions
 Int-0.037
 Sadly-0.036
ete-0.036
 Everything-0.035
 Curious-0.035
Token play
Token ID711
Feature activation+4.64e-02

Pos logit contributions
 Int+0.034
 Sadly+0.034
ete+0.033
Sadly+0.033
 Curious+0.032
Neg logit contributions
els-0.042
ps-0.036
 on-0.034
ws-0.034
 inside-0.034
Token with
Token ID351
Feature activation+1.16e-02

Pos logit contributions
els+0.066
ps+0.051
ws+0.048
ries+0.047
cks+0.045
Neg logit contributions
 Int-0.098
 Sadly-0.096
Sadly-0.094
 Curious-0.093
 Everything-0.092
Token me
Token ID502
Feature activation+3.69e-02

Pos logit contributions
els+0.018
ps+0.016
ws+0.015
 on+0.014
 inside+0.014
Neg logit contributions
 Int-0.021
 Sadly-0.020
ete-0.020
Sadly-0.020
 Everything-0.020
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.040
ps+0.032
bones+0.027
ws+0.027
cks+0.025
Neg logit contributions
 Int-0.087
 Sadly-0.085
ete-0.084
 Curious-0.083
 couldn-0.081
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
ete+0.238
Sadly+0.237
 Int+0.233
rolling+0.220
 Curious+0.217
Neg logit contributions
.-0.286
 on-0.261
els-0.258
."-0.249
 inside-0.240
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
ishly+0.199
 Int+0.194
 Curious+0.187
ete+0.184
 Sadly+0.183
Neg logit contributions
.-0.191
els-0.180
 on-0.169
."-0.163
 inside-0.158
TokenMom
Token ID29252
Feature activation-5.41e-02

Pos logit contributions
 Int+0.197
 Everything+0.192
ishly+0.190
 Sadly+0.188
 Curious+0.187
Neg logit contributions
els-0.197
.-0.189
 on-0.177
aks-0.170
 than-0.165
Token kissed
Token ID27946
Feature activation+1.80e-02

Pos logit contributions
 Int+0.102
 Sadly+0.101
ete+0.099
Sadly+0.099
 Curious+0.098
Neg logit contributions
els-0.090
ps-0.077
 on-0.076
 inside-0.074
.-0.072
Token Anna
Token ID11735
Feature activation-3.63e-03

Pos logit contributions
els+0.029
ps+0.025
 on+0.024
 inside+0.023
.+0.022
Neg logit contributions
 Int-0.035
 Sadly-0.034
ete-0.034
 Curious-0.034
Sadly-0.033
Token won
Token ID1839
Feature activation+4.67e-03

Pos logit contributions
ete+0.086
Da+0.083
 Int+0.082
Sadly+0.080
mber+0.079
Neg logit contributions
els-0.136
 on-0.128
 inside-0.127
.-0.122
."-0.122
Token't
Token ID470
Feature activation-2.00e-02

Pos logit contributions
els+0.006
ps+0.005
 on+0.004
 inside+0.004
ws+0.004
Neg logit contributions
 Int-0.011
 Sadly-0.011
ete-0.010
 Curious-0.010
Sadly-0.010
Token do
Token ID466
Feature activation+5.57e-03

Pos logit contributions
 Int+0.034
 Sadly+0.033
ete+0.032
 Curious+0.032
Sadly+0.032
Neg logit contributions
els-0.037
ps-0.032
 on-0.031
 inside-0.030
ws-0.030
Token it
Token ID340
Feature activation+3.25e-02

Pos logit contributions
els+0.008
ps+0.007
 on+0.006
ws+0.006
 inside+0.006
Neg logit contributions
 Int-0.012
 Sadly-0.011
ete-0.011
 Curious-0.011
Sadly-0.011
Token again
Token ID757
Feature activation+2.18e-02

Pos logit contributions
els+0.038
ps+0.027
ws+0.025
ries+0.024
cks+0.023
Neg logit contributions
 Int-0.077
 Sadly-0.075
ete-0.074
 Curious-0.074
Sadly-0.073
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.023
ps+0.018
ws+0.015
bones+0.014
cks+0.013
Neg logit contributions
 Int-0.052
 Sadly-0.052
 Curious-0.052
ete-0.051
 couldn-0.050
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
 Int+0.227
Sadly+0.225
ete+0.223
rolling+0.212
 Curious+0.211
Neg logit contributions
.-0.289
 on-0.265
els-0.264
."-0.252
 inside-0.245
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
 Int+0.189
ishly+0.186
 Curious+0.181
rolling+0.175
 Sadly+0.174
Neg logit contributions
.-0.188
els-0.176
 on-0.166
."-0.157
 inside-0.156
TokenHer
Token ID9360
Feature activation-7.41e-02

Pos logit contributions
 Int+0.176
ishly+0.170
 Curious+0.166
 Everything+0.166
 Sadly+0.160
Neg logit contributions
.-0.202
els-0.201
 on-0.186
aks-0.178
 inside-0.175
Token mom
Token ID1995
Feature activation-4.14e-02

Pos logit contributions
 Int+0.126
 Sadly+0.122
 Curious+0.119
ete+0.119
Sadly+0.118
Neg logit contributions
els-0.137
ps-0.119
 on-0.118
 inside-0.116
.-0.112
Token hugged
Token ID41130
Feature activation+1.88e-03

Pos logit contributions
 Int+0.069
 Sadly+0.068
ete+0.067
Sadly+0.066
 Curious+0.065
Neg logit contributions
els-0.078
ps-0.068
 on-0.068
 inside-0.066
.-0.066
Token wants
Token ID3382
Feature activation-1.92e-03

Pos logit contributions
els+0.067
ps+0.058
 on+0.057
 inside+0.055
ws+0.053
Neg logit contributions
 Int-0.075
 Sadly-0.073
ete-0.071
 Curious-0.071
Sadly-0.070
Token to
Token ID284
Feature activation-3.06e-02

Pos logit contributions
 Int+0.004
 Sadly+0.003
ete+0.003
 Curious+0.003
Sadly+0.003
Neg logit contributions
els-0.003
ps-0.003
ws-0.003
 on-0.003
 inside-0.003
Token play
Token ID711
Feature activation+3.50e-02

Pos logit contributions
 Int+0.054
 Sadly+0.052
ete+0.052
 Curious+0.051
Sadly+0.050
Neg logit contributions
els-0.054
ps-0.047
 on-0.045
 inside-0.044
ws-0.043
Token with
Token ID351
Feature activation+2.58e-03

Pos logit contributions
els+0.052
ps+0.042
ws+0.039
ries+0.037
cks+0.036
Neg logit contributions
 Int-0.072
 Sadly-0.070
 Curious-0.069
Sadly-0.068
 Everything-0.068
Token us
Token ID514
Feature activation+3.75e-02

Pos logit contributions
els+0.004
ps+0.004
ws+0.004
 on+0.003
 inside+0.003
Neg logit contributions
 Int-0.005
 Sadly-0.004
ete-0.004
 Curious-0.004
 Everything-0.004
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.034
ps+0.025
ws+0.021
bones+0.021
cks+0.019
Neg logit contributions
 Int-0.094
 Sadly-0.092
ete-0.092
 Curious-0.091
Of-0.089
Token
Token ID198
Feature activation-1.17e-01

Pos logit contributions
Sadly+0.247
ete+0.246
 Int+0.241
rolling+0.234
 Sadly+0.229
Neg logit contributions
.-0.279
els-0.257
 on-0.256
."-0.243
 inside-0.235
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
 Int+0.184
ishly+0.180
 Sadly+0.177
ete+0.177
 Curious+0.176
Neg logit contributions
els-0.212
.-0.209
 on-0.196
 inside-0.187
."-0.184
TokenTom
Token ID13787
Feature activation-6.97e-02

Pos logit contributions
 Int+0.211
 Sadly+0.205
 Everything+0.203
 Curious+0.201
ete+0.201
Neg logit contributions
els-0.207
.-0.190
 on-0.183
aks-0.172
 sticker-0.171
Token comes
Token ID2058
Feature activation-1.43e-02

Pos logit contributions
ete+0.161
 Sadly+0.160
 Int+0.159
Sadly+0.159
 Curious+0.153
Neg logit contributions
els-0.087
ps-0.074
 on-0.071
.-0.069
 inside-0.066
Token back
Token ID736
Feature activation-4.48e-03

Pos logit contributions
 Int+0.030
 Curious+0.029
 Sadly+0.028
ete+0.027
 Everything+0.027
Neg logit contributions
els-0.021
ws-0.017
ps-0.017
ries-0.016
 frost-0.016
Token wanted
Token ID2227
Feature activation-2.49e-02

Pos logit contributions
ete+0.177
 Int+0.173
Sadly+0.164
 Curious+0.161
 Sadly+0.158
Neg logit contributions
els-0.267
 on-0.254
.-0.253
 inside-0.247
ps-0.233
Token to
Token ID284
Feature activation-4.59e-02

Pos logit contributions
 Int+0.050
 Sadly+0.049
ete+0.048
 Curious+0.048
Sadly+0.047
Neg logit contributions
els-0.038
ps-0.032
 on-0.032
 inside-0.031
ws-0.029
Token make
Token ID787
Feature activation+2.57e-02

Pos logit contributions
 Int+0.087
 Sadly+0.085
 Curious+0.083
ete+0.083
Sadly+0.082
Neg logit contributions
els-0.075
ps-0.065
 on-0.064
 inside-0.062
.-0.059
Token new
Token ID649
Feature activation+4.62e-02

Pos logit contributions
els+0.038
ps+0.032
 on+0.031
ws+0.030
 inside+0.029
Neg logit contributions
 Int-0.052
ete-0.050
 Sadly-0.050
 Curious-0.049
 Everything-0.049
Token friends
Token ID2460
Feature activation+4.02e-02

Pos logit contributions
els+0.070
ps+0.062
ws+0.060
 on+0.055
cks+0.055
Neg logit contributions
 Int-0.088
 Curious-0.085
 Sadly-0.083
ete-0.082
 Everything-0.081
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.042
ps+0.032
ws+0.030
ries+0.024
cks+0.024
Neg logit contributions
 Int-0.099
 Curious-0.097
ete-0.095
 Sadly-0.094
 Everything-0.093
Token
Token ID220
Feature activation-9.27e-02

Pos logit contributions
Sadly+0.227
 Int+0.225
ete+0.218
rolling+0.208
 Curious+0.207
Neg logit contributions
.-0.297
els-0.272
 on-0.272
."-0.257
 inside-0.250
Token
Token ID198
Feature activation-1.02e-01

Pos logit contributions
 Int+0.143
Sadly+0.140
 Curious+0.139
 Sadly+0.138
Of+0.133
Neg logit contributions
els-0.174
.-0.170
 on-0.160
 inside-0.152
."-0.151
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
 Int+0.164
ishly+0.159
 Sadly+0.159
 Curious+0.156
rolling+0.155
Neg logit contributions
.-0.176
els-0.166
 on-0.160
."-0.152
 inside-0.144
TokenL
Token ID43
Feature activation-6.25e-02

Pos logit contributions
 Int+0.154
 Sadly+0.150
ete+0.145
ishly+0.144
 Everything+0.144
Neg logit contributions
els-0.205
.-0.203
 on-0.192
 colored-0.183
."-0.179
Tokenily
Token ID813
Feature activation-6.66e-02

Pos logit contributions
 Int+0.103
 Sadly+0.100
ete+0.100
 Curious+0.098
Sadly+0.097
Neg logit contributions
els-0.117
ps-0.102
 on-0.101
 inside-0.098
ws-0.095
Token deserve
Token ID10925
Feature activation+5.05e-02

Pos logit contributions
 Int+0.051
 Sadly+0.050
 Curious+0.049
ete+0.048
Sadly+0.048
Neg logit contributions
els-0.064
ps-0.053
ries-0.051
ws-0.048
 on-0.047
Token to
Token ID284
Feature activation-1.10e-02

Pos logit contributions
els+0.087
ps+0.069
ws+0.067
 colored+0.067
ries+0.066
Neg logit contributions
ete-0.083
 Everything-0.081
 Int-0.080
 Sadly-0.078
 Barkley-0.077
Token ride
Token ID6594
Feature activation+8.72e-02

Pos logit contributions
 Sadly+0.017
ete+0.017
 Int+0.017
Sadly+0.017
 Everything+0.017
Neg logit contributions
els-0.021
ps-0.017
 on-0.017
ws-0.017
ries-0.017
Token on
Token ID319
Feature activation+2.05e-02

Pos logit contributions
els+0.101
ps+0.079
ws+0.074
 colored+0.067
 frost+0.066
Neg logit contributions
 Int-0.197
ete-0.193
 Sadly-0.193
 Curious-0.190
 Everything-0.188
Token me
Token ID502
Feature activation+3.26e-02

Pos logit contributions
els+0.030
ps+0.026
ws+0.024
 inside+0.022
 colored+0.022
Neg logit contributions
 Int-0.041
 Sadly-0.041
 Curious-0.040
ete-0.039
Sadly-0.039
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.030
ps+0.022
ws+0.019
 colored+0.018
ries+0.018
Neg logit contributions
 Int-0.085
 Sadly-0.083
ete-0.083
 Curious-0.082
Sadly-0.081
Token
Token ID198
Feature activation-1.17e-01

Pos logit contributions
 Int+0.234
Sadly+0.233
ete+0.232
rolling+0.224
 Curious+0.219
Neg logit contributions
.-0.284
 on-0.254
els-0.252
."-0.245
 inside-0.235
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
ishly+0.210
 Int+0.204
 Curious+0.197
rolling+0.193
ete+0.190
Neg logit contributions
.-0.191
els-0.175
 on-0.165
."-0.162
 inside-0.156
TokenTim
Token ID14967
Feature activation-6.60e-02

Pos logit contributions
 Int+0.200
ishly+0.194
 Everything+0.193
 Curious+0.191
 Sadly+0.187
Neg logit contributions
els-0.198
.-0.193
 on-0.178
aks-0.178
."-0.169
Token and
Token ID290
Feature activation-9.13e-02

Pos logit contributions
 Int+0.124
ete+0.122
 Sadly+0.119
Sadly+0.119
 Curious+0.116
Neg logit contributions
els-0.108
ps-0.096
 on-0.096
.-0.094
 inside-0.091
Token Mia
Token ID32189
Feature activation-4.83e-02

Pos logit contributions
 Int+0.176
ete+0.171
Sadly+0.171
 Sadly+0.171
 Curious+0.165
Neg logit contributions
els-0.145
 on-0.129
ps-0.127
 inside-0.125
.-0.124
Token't
Token ID470
Feature activation-4.66e-02

Pos logit contributions
els+0.141
cks+0.113
ps+0.112
ws+0.107
bones+0.101
Neg logit contributions
 Sadly-0.173
 Int-0.155
 Curious-0.152
ete-0.151
 So-0.147
Token mean
Token ID1612
Feature activation+2.59e-02

Pos logit contributions
 Int+0.077
 Sadly+0.075
 Curious+0.075
ete+0.075
 Everything+0.071
Neg logit contributions
els-0.090
ps-0.073
ries-0.070
ws-0.069
 colored-0.067
Token to
Token ID284
Feature activation-3.55e-02

Pos logit contributions
els+0.043
ps+0.037
 inside+0.033
 on+0.033
 colored+0.033
Neg logit contributions
 Int-0.048
 Sadly-0.046
ete-0.046
 Curious-0.046
 Everything-0.045
Token break
Token ID2270
Feature activation+9.57e-02

Pos logit contributions
 Int+0.064
 Sadly+0.063
ete+0.062
 Curious+0.061
Sadly+0.060
Neg logit contributions
els-0.061
ps-0.051
 on-0.050
 inside-0.049
ws-0.048
Token them
Token ID606
Feature activation+4.14e-02

Pos logit contributions
els+0.141
ps+0.122
ws+0.122
 colored+0.120
 sticker+0.119
Neg logit contributions
 Int-0.154
 Sadly-0.152
Da-0.151
Sadly-0.150
Don-0.148
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.047
ps+0.031
bones+0.031
ocol+0.031
ries+0.029
Neg logit contributions
 Sadly-0.096
 Int-0.093
 couldn-0.091
So-0.090
Sadly-0.089
Token
Token ID198
Feature activation-1.17e-01

Pos logit contributions
ete+0.236
 Int+0.234
Sadly+0.233
rolling+0.224
 Curious+0.218
Neg logit contributions
.-0.283
 on-0.258
els-0.257
."-0.246
 inside-0.238
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
ishly+0.197
 Int+0.195
ete+0.187
 Curious+0.187
 Sadly+0.184
Neg logit contributions
.-0.198
els-0.191
 on-0.177
."-0.172
 inside-0.169
TokenTheir
Token ID14574
Feature activation-6.34e-02

Pos logit contributions
 Int+0.199
 Everything+0.191
 Sadly+0.189
 Curious+0.189
ete+0.189
Neg logit contributions
els-0.201
.-0.190
 on-0.180
aks-0.174
 inside-0.170
Token mom
Token ID1995
Feature activation-2.62e-02

Pos logit contributions
 Int+0.115
 Sadly+0.109
 Curious+0.108
ete+0.107
Sadly+0.105
Neg logit contributions
els-0.112
ps-0.099
 on-0.096
 inside-0.095
.-0.091
Token sighed
Token ID21893
Feature activation+9.63e-03

Pos logit contributions
 Int+0.052
ete+0.051
 Sadly+0.051
Sadly+0.049
 Curious+0.049
Neg logit contributions
els-0.041
ps-0.036
 on-0.035
 inside-0.034
.-0.033
Token won
Token ID1839
Feature activation+2.35e-02

Pos logit contributions
 Int+0.100
 Sadly+0.098
 Curious+0.095
ete+0.095
Sadly+0.094
Neg logit contributions
els-0.106
ps-0.092
 on-0.088
 inside-0.086
ws-0.084
Token't
Token ID470
Feature activation-9.50e-03

Pos logit contributions
els+0.056
ps+0.050
ws+0.049
 inside+0.048
 on+0.048
Neg logit contributions
 Int-0.026
 Sadly-0.025
 Curious-0.024
Sadly-0.023
ete-0.023
Token do
Token ID466
Feature activation+9.06e-03

Pos logit contributions
 Int+0.016
 Sadly+0.016
Sadly+0.016
ete+0.016
 Curious+0.015
Neg logit contributions
els-0.018
ps-0.015
 on-0.014
ries-0.014
 inside-0.014
Token that
Token ID326
Feature activation-9.47e-04

Pos logit contributions
els+0.014
ps+0.011
ws+0.010
 inside+0.010
 colored+0.010
Neg logit contributions
 Int-0.018
 Sadly-0.018
Sadly-0.017
ete-0.017
 Curious-0.017
Token again
Token ID757
Feature activation+1.56e-02

Pos logit contributions
 Int+0.002
 Sadly+0.002
 Curious+0.002
Sadly+0.002
 Everything+0.002
Neg logit contributions
els-0.001
ps-0.001
ws-0.001
cks-0.001
ries-0.001
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.019
ps+0.016
ws+0.013
bones+0.013
ries+0.012
Neg logit contributions
 Int-0.035
 Sadly-0.035
 Curious-0.034
 couldn-0.033
ete-0.033
Token<|endoftext|>
Token ID50256
Feature activation-7.34e-02

Pos logit contributions
 Int+0.222
ete+0.220
Sadly+0.218
rolling+0.210
 Curious+0.206
Neg logit contributions
.-0.292
els-0.264
 on-0.264
."-0.256
 inside-0.248
TokenL
Token ID43
Feature activation-8.35e-02

Pos logit contributions
 Int+0.088
 Sadly+0.085
 Curious+0.082
ete+0.081
Sadly+0.078
Neg logit contributions
els-0.173
ps-0.157
 on-0.150
 inside-0.149
ws-0.147
Tokenily
Token ID813
Feature activation-8.53e-02

Pos logit contributions
 Int+0.140
 Sadly+0.136
 Curious+0.133
ete+0.131
Sadly+0.131
Neg logit contributions
els-0.157
ps-0.138
 on-0.135
 inside-0.132
ws-0.127
Token liked
Token ID8288
Feature activation-4.10e-02

Pos logit contributions
 Int+0.165
 Sadly+0.161
ete+0.158
 Curious+0.157
Sadly+0.156
Neg logit contributions
els-0.136
ps-0.116
 on-0.114
 inside-0.112
.-0.107
Token to
Token ID284
Feature activation-5.79e-02

Pos logit contributions
 Int+0.082
 Sadly+0.080
ete+0.078
 Curious+0.078
Sadly+0.077
Neg logit contributions
els-0.063
ps-0.054
 on-0.052
 inside-0.051
ws-0.049
Token want
Token ID765
Feature activation-2.12e-03

Pos logit contributions
els+0.008
ps+0.007
ws+0.007
 on+0.006
 inside+0.006
Neg logit contributions
 Int-0.005
 Sadly-0.005
ete-0.005
 Curious-0.005
Sadly-0.005
Token to
Token ID284
Feature activation-2.86e-02

Pos logit contributions
 Int+0.004
 Sadly+0.004
ete+0.004
 Curious+0.004
Sadly+0.004
Neg logit contributions
els-0.004
ps-0.003
 on-0.003
 inside-0.003
ws-0.003
Token make
Token ID787
Feature activation+3.05e-02

Pos logit contributions
 Int+0.049
 Sadly+0.048
ete+0.048
 Curious+0.047
Sadly+0.046
Neg logit contributions
els-0.052
ps-0.045
 on-0.043
 inside-0.042
ws-0.041
Token a
Token ID257
Feature activation+1.90e-02

Pos logit contributions
els+0.045
ps+0.038
ws+0.036
 on+0.036
 inside+0.035
Neg logit contributions
 Int-0.060
ete-0.058
 Sadly-0.058
 Everything-0.057
 Curious-0.057
Token fort
Token ID6285
Feature activation+9.59e-02

Pos logit contributions
els+0.031
ps+0.027
ws+0.024
ries+0.024
cks+0.023
Neg logit contributions
 Sadly-0.036
 Int-0.035
Sadly-0.034
ete-0.034
 Curious-0.034
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.121
ps+0.088
ries+0.080
ws+0.080
cks+0.076
Neg logit contributions
 Sadly-0.211
 Int-0.206
ete-0.204
Sadly-0.203
 Everything-0.197
Token
Token ID198
Feature activation-1.13e-01

Pos logit contributions
Sadly+0.241
ete+0.235
 Int+0.232
rolling+0.222
 Sadly+0.218
Neg logit contributions
.-0.283
 on-0.262
els-0.259
."-0.247
 inside-0.240
Token
Token ID198
Feature activation-1.20e-01

Pos logit contributions
ishly+0.195
 Int+0.189
 Curious+0.182
 Sadly+0.182
ete+0.179
Neg logit contributions
.-0.192
els-0.185
 on-0.172
."-0.164
 inside-0.161
TokenM
Token ID44
Feature activation-8.55e-02

Pos logit contributions
 Int+0.194
 Sadly+0.191
 Everything+0.190
ishly+0.189
ete+0.186
Neg logit contributions
els-0.200
.-0.186
 on-0.177
aks-0.168
 inside-0.166
Tokenia
Token ID544
Feature activation-6.77e-02

Pos logit contributions
 Int+0.139
 Sadly+0.134
 Curious+0.132
Sadly+0.128
ete+0.126
Neg logit contributions
els-0.169
ps-0.145
 on-0.142
 inside-0.139
.-0.138
Token said
Token ID531
Feature activation-4.92e-02

Pos logit contributions
 Int+0.124
 Sadly+0.123
ete+0.122
Sadly+0.121
 Curious+0.118
Neg logit contributions
els-0.115
ps-0.100
 on-0.099
 inside-0.096
.-0.096
Token themselves
Token ID2405
Feature activation+2.22e-02

Pos logit contributions
els+0.025
ps+0.023
ws+0.022
cks+0.021
bones+0.019
Neg logit contributions
 Everything-0.026
 Int-0.026
 Sadly-0.025
 couldn-0.025
Of-0.025
Token for
Token ID329
Feature activation-1.29e-02

Pos logit contributions
els+0.024
ps+0.018
ws+0.016
cks+0.015
bones+0.015
Neg logit contributions
 Int-0.053
 Sadly-0.052
ete-0.052
 Curious-0.051
 couldn-0.050
Token who
Token ID508
Feature activation-3.64e-02

Pos logit contributions
ete+0.021
 Sadly+0.021
 Int+0.020
Sadly+0.020
 Everything+0.020
Neg logit contributions
els-0.023
ps-0.018
ws-0.018
ier-0.017
 inside-0.017
Token they
Token ID484
Feature activation-4.48e-02

Pos logit contributions
 Int+0.060
 Sadly+0.058
ete+0.057
 Curious+0.057
Sadly+0.056
Neg logit contributions
els-0.069
ps-0.060
 on-0.060
 inside-0.059
.-0.057
Token are
Token ID389
Feature activation+9.99e-04

Pos logit contributions
 Int+0.071
 Sadly+0.068
 Curious+0.067
ete+0.067
Sadly+0.066
Neg logit contributions
els-0.088
ps-0.078
 on-0.078
 inside-0.077
.-0.074
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.002
ws+0.002
ps+0.002
ries+0.001
cks+0.001
Neg logit contributions
 Sadly-0.002
ete-0.001
 Curious-0.001
 Everything-0.001
 Int-0.001
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
 Int+0.238
Sadly+0.234
ete+0.232
 Curious+0.222
rolling+0.219
Neg logit contributions
.-0.283
els-0.268
 on-0.261
."-0.247
 inside-0.242
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
 Int+0.202
ishly+0.200
 Curious+0.195
 Sadly+0.189
ete+0.187
Neg logit contributions
.-0.199
els-0.195
 on-0.176
."-0.170
 inside-0.168
TokenAb
Token ID4826
Feature activation-6.01e-02

Pos logit contributions
 Int+0.211
 Everything+0.203
 Curious+0.202
 Sadly+0.197
ishly+0.196
Neg logit contributions
els-0.201
.-0.188
 on-0.177
aks-0.169
 inside-0.166
Tokenbie
Token ID12590
Feature activation-6.39e-02

Pos logit contributions
 Int+0.107
 Sadly+0.106
ete+0.106
 Curious+0.102
Sadly+0.100
Neg logit contributions
els-0.101
ps-0.090
 on-0.087
 inside-0.085
ws-0.081
Token thought
Token ID1807
Feature activation-3.41e-02

Pos logit contributions
 Int+0.104
ete+0.100
 Sadly+0.099
 Curious+0.098
Sadly+0.097
Neg logit contributions
els-0.121
 on-0.108
ps-0.108
.-0.107
 inside-0.104
Token look
Token ID804
Feature activation+2.43e-02

Pos logit contributions
els+0.066
ps+0.049
ws+0.044
 inside+0.040
cks+0.040
Neg logit contributions
 Int-0.155
ete-0.153
 Sadly-0.152
 Curious-0.150
 Everything-0.148
Token pretty
Token ID2495
Feature activation+7.11e-02

Pos logit contributions
els+0.040
ps+0.034
ws+0.032
ries+0.030
 inside+0.030
Neg logit contributions
 Int-0.044
ete-0.044
 Sadly-0.043
 Curious-0.043
 Everything-0.041
Token and
Token ID290
Feature activation-3.53e-02

Pos logit contributions
els+0.078
ws+0.068
ps+0.067
cks+0.064
bones+0.056
Neg logit contributions
 Int-0.151
ete-0.150
 Curious-0.149
 couldn-0.147
mber-0.145
Token y
Token ID331
Feature activation+5.88e-02

Pos logit contributions
 Int+0.060
 Curious+0.058
 Sadly+0.057
ete+0.056
 Everything+0.055
Neg logit contributions
els-0.066
ps-0.057
ws-0.052
 on-0.052
 inside-0.051
Tokenummy
Token ID13513
Feature activation+4.28e-02

Pos logit contributions
els+0.148
ps+0.145
 on+0.141
ws+0.139
 sticker+0.138
Neg logit contributions
ete-0.052
 Sadly-0.038
 Int-0.036
 Everything-0.036
etsy-0.035
Token."
Token ID526
Feature activation-1.53e-01

Pos logit contributions
els+0.061
ps+0.047
cks+0.047
ws+0.046
bones+0.040
Neg logit contributions
ete-0.085
 Int-0.082
mber-0.081
 Curious-0.081
 Everything-0.080
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
Sadly+0.241
 Int+0.240
ete+0.228
 Curious+0.222
 Sadly+0.222
Neg logit contributions
.-0.285
 on-0.266
els-0.262
."-0.249
 inside-0.243
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
 Int+0.186
ishly+0.183
 Curious+0.179
 Sadly+0.175
rolling+0.170
Neg logit contributions
.-0.189
els-0.184
 on-0.170
."-0.163
 inside-0.160
TokenThey
Token ID2990
Feature activation-6.92e-02

Pos logit contributions
 Int+0.188
 Sadly+0.183
 Curious+0.179
 Everything+0.179
ishly+0.174
Neg logit contributions
els-0.202
.-0.188
 on-0.175
 sticker-0.166
 colored-0.166
Token add
Token ID751
Feature activation+6.08e-02

Pos logit contributions
 Int+0.122
 Sadly+0.118
 Curious+0.115
Sadly+0.114
ete+0.114
Neg logit contributions
els-0.123
ps-0.109
 on-0.108
 inside-0.105
.-0.103
Token everything
Token ID2279
Feature activation+4.64e-02

Pos logit contributions
els+0.101
cks+0.091
ps+0.090
ws+0.084
 harm+0.083
Neg logit contributions
 Int-0.091
 Everything-0.090
Of-0.088
 Sadly-0.087
 Everywhere-0.087
Token I
Token ID314
Feature activation-5.17e-02

Pos logit contributions
 Int+0.042
 Sadly+0.042
ete+0.041
 Curious+0.041
Sadly+0.041
Neg logit contributions
els-0.014
ps-0.010
 on-0.010
 inside-0.009
ws-0.008
Token couldn
Token ID3521
Feature activation-1.13e-02

Pos logit contributions
 Int+0.091
 Sadly+0.087
ete+0.087
 Curious+0.086
Sadly+0.085
Neg logit contributions
els-0.088
 on-0.081
ps-0.079
 inside-0.078
.-0.077
Token't
Token ID470
Feature activation-1.32e-02

Pos logit contributions
 Int+0.017
ete+0.016
 Curious+0.016
Of+0.016
 Sadly+0.016
Neg logit contributions
els-0.022
 on-0.022
.-0.021
ps-0.020
 inside-0.020
Token help
Token ID1037
Feature activation+2.07e-02

Pos logit contributions
 Int+0.025
 Sadly+0.025
 Curious+0.024
ete+0.024
Sadly+0.024
Neg logit contributions
els-0.021
ps-0.018
 on-0.018
 inside-0.018
.-0.017
Token sooner
Token ID14556
Feature activation+4.37e-02

Pos logit contributions
els+0.039
ps+0.034
ws+0.032
 on+0.031
 colored+0.031
Neg logit contributions
 Int-0.034
 Sadly-0.033
ete-0.033
 Curious-0.032
Sadly-0.032
Token."
Token ID526
Feature activation-1.52e-01

Pos logit contributions
els+0.047
ps+0.035
ws+0.033
bones+0.028
cks+0.028
Neg logit contributions
 Sadly-0.104
 Int-0.103
 Curious-0.101
Sadly-0.100
ete-0.100
Token Qu
Token ID2264
Feature activation-5.44e-02

Pos logit contributions
ete+0.224
 Int+0.222
Sadly+0.216
rolling+0.214
 Curious+0.204
Neg logit contributions
.-0.294
 on-0.270
."-0.258
els-0.254
 inside-0.250
Tokenackers
Token ID28874
Feature activation-4.69e-02

Pos logit contributions
 Int+0.086
 Curious+0.082
 Sadly+0.080
 Everything+0.080
Sadly+0.079
Neg logit contributions
els-0.100
.-0.096
 on-0.092
ws-0.092
ps-0.091
Token said
Token ID531
Feature activation-4.99e-02

Pos logit contributions
 Int+0.079
Sadly+0.074
 Sadly+0.074
ete+0.073
 Curious+0.073
Neg logit contributions
els-0.082
.-0.081
 on-0.078
ps-0.078
 inside-0.074
Token,
Token ID11
Feature activation-7.03e-02

Pos logit contributions
 Int+0.124
 Sadly+0.122
ete+0.120
 Curious+0.120
Sadly+0.119
Neg logit contributions
els-0.053
ps-0.041
 on-0.039
 inside-0.037
ws-0.035
Token "
Token ID366
Feature activation-8.02e-02

Pos logit contributions
 Int+0.093
ete+0.092
 Sadly+0.092
Sadly+0.091
 Curious+0.086
Neg logit contributions
 on-0.143
els-0.140
.-0.139
 inside-0.136
ps-0.134
Token with
Token ID351
Feature activation+1.29e-02

Pos logit contributions
els+0.072
ps+0.058
ws+0.052
ries+0.051
cks+0.049
Neg logit contributions
 Int-0.118
 Sadly-0.114
 Curious-0.113
Sadly-0.113
ete-0.112
Token your
Token ID534
Feature activation-7.48e-03

Pos logit contributions
els+0.023
ps+0.020
ws+0.019
 on+0.019
 inside+0.018
Neg logit contributions
 Int-0.021
ete-0.020
 Sadly-0.020
Sadly-0.020
 Curious-0.020
Token nice
Token ID3621
Feature activation+4.14e-02

Pos logit contributions
 Int+0.015
 Sadly+0.015
 Curious+0.015
ete+0.015
Sadly+0.014
Neg logit contributions
els-0.010
ps-0.009
ws-0.008
 on-0.008
 inside-0.008
Token rubber
Token ID14239
Feature activation+7.13e-02

Pos logit contributions
els+0.065
ps+0.057
cks+0.054
ws+0.052
bones+0.050
Neg logit contributions
 Int-0.077
 Curious-0.076
 Sadly-0.076
ete-0.076
 Everything-0.074
Token ball
Token ID2613
Feature activation+7.87e-02

Pos logit contributions
els+0.095
ps+0.079
ws+0.072
cks+0.070
 inside+0.069
Neg logit contributions
 Int-0.148
ete-0.147
 Sadly-0.146
 Curious-0.143
Of-0.142
Token."
Token ID526
Feature activation-1.52e-01

Pos logit contributions
els+0.079
ps+0.060
ws+0.050
cks+0.048
ries+0.046
Neg logit contributions
 Sadly-0.192
 Int-0.192
ete-0.191
 Curious-0.186
Sadly-0.185
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
Sadly+0.242
 Int+0.239
ete+0.237
rolling+0.226
 Curious+0.223
Neg logit contributions
.-0.281
 on-0.257
els-0.253
."-0.246
 inside-0.237
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
ishly+0.190
 Int+0.187
 Curious+0.182
 Sadly+0.178
rolling+0.175
Neg logit contributions
.-0.175
els-0.161
 on-0.153
."-0.147
 inside-0.141
TokenL
Token ID43
Feature activation-5.65e-02

Pos logit contributions
 Int+0.192
 Everything+0.189
 Sadly+0.185
 Curious+0.185
ishly+0.180
Neg logit contributions
els-0.187
.-0.180
 on-0.168
 sticker-0.160
aks-0.156
Tokenily
Token ID813
Feature activation-6.68e-02

Pos logit contributions
 Int+0.093
 Sadly+0.091
ete+0.090
Sadly+0.088
 Curious+0.088
Neg logit contributions
els-0.105
ps-0.092
 on-0.092
 inside-0.089
ws-0.085
Token thought
Token ID1807
Feature activation-3.48e-02

Pos logit contributions
ete+0.125
 Int+0.124
Sadly+0.121
 Sadly+0.121
 Everything+0.117
Neg logit contributions
els-0.107
.-0.097
 on-0.096
ps-0.094
 inside-0.093
Token,
Token ID11
Feature activation-2.15e-02

Pos logit contributions
 Int+0.025
 Sadly+0.024
 Curious+0.024
ete+0.024
 Everything+0.023
Neg logit contributions
els-0.012
ws-0.009
ps-0.008
ries-0.008
cks-0.007
Token pointing
Token ID10609
Feature activation+4.62e-02

Pos logit contributions
 Int+0.044
 Curious+0.042
 Sadly+0.041
ete+0.040
Sadly+0.040
Neg logit contributions
els-0.034
ps-0.027
ws-0.027
 on-0.026
 inside-0.025
Token to
Token ID284
Feature activation-6.89e-03

Pos logit contributions
els+0.066
ps+0.055
 on+0.052
ws+0.050
 inside+0.050
Neg logit contributions
 Int-0.097
 Sadly-0.094
 Curious-0.094
ete-0.094
 Everything-0.090
Token a
Token ID257
Feature activation+2.98e-02

Pos logit contributions
 Int+0.015
 Sadly+0.015
 Curious+0.014
ete+0.014
Sadly+0.014
Neg logit contributions
els-0.010
ps-0.008
 on-0.008
 inside-0.007
ws-0.007
Token skysc
Token ID37885
Feature activation+8.36e-02

Pos logit contributions
els+0.055
ps+0.049
ws+0.046
ries+0.044
cks+0.043
Neg logit contributions
 Int-0.051
 Curious-0.050
 Sadly-0.050
ete-0.048
Sadly-0.047
Tokenraper
Token ID38545
Feature activation+1.15e-01

Pos logit contributions
els+0.143
 on+0.131
ps+0.127
 inside+0.124
ws+0.120
Neg logit contributions
 Int-0.145
ete-0.139
 Sadly-0.138
Sadly-0.137
 Curious-0.134
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
els+0.122
ws+0.117
ps+0.105
cks+0.084
ishing+0.076
Neg logit contributions
 Int-0.251
 couldn-0.246
 He-0.239
 Curious-0.239
 Sadly-0.238
Token
Token ID198
Feature activation-6.72e-02

Pos logit contributions
Sadly+0.148
 Barkley+0.148
ishly+0.146
ete+0.145
Nut+0.141
Neg logit contributions
.-0.187
 on-0.180
els-0.173
 inside-0.170
ps-0.162
Token
Token ID198
Feature activation-8.54e-02

Pos logit contributions
 Int+0.102
ishly+0.100
 Curious+0.098
 Sadly+0.096
 Barkley+0.095
Neg logit contributions
els-0.124
.-0.121
 on-0.119
 inside-0.116
."-0.110
Token"
Token ID1
Feature activation-5.18e-02

Pos logit contributions
 Int+0.180
 Sadly+0.178
 Barkley+0.171
 Curious+0.171
ishly+0.168
Neg logit contributions
els-0.114
 on-0.106
 inside-0.100
.-0.100
ps-0.096
TokenWow
Token ID22017
Feature activation+1.37e-02

Pos logit contributions
ete+0.078
 Sadly+0.077
 Int+0.068
Sadly+0.068
ishly+0.067
Neg logit contributions
els-0.099
 on-0.096
 colored-0.091
 colors-0.090
 inside-0.089
Token decided
Token ID3066
Feature activation+2.55e-03

Pos logit contributions
 Int+0.089
Of+0.088
ete+0.087
 Sadly+0.085
Sadly+0.085
Neg logit contributions
 on-0.068
 inside-0.068
els-0.064
.-0.060
 colored-0.058
Token to
Token ID284
Feature activation-2.65e-02

Pos logit contributions
 on+0.003
 inside+0.003
els+0.003
.+0.003
ps+0.002
Neg logit contributions
 Int-0.006
 Sadly-0.006
ete-0.006
Sadly-0.006
 Curious-0.006
Token find
Token ID1064
Feature activation+4.40e-02

Pos logit contributions
 Int+0.055
Of+0.054
ete+0.052
 Sadly+0.052
 Curious+0.052
Neg logit contributions
els-0.036
 on-0.033
.-0.032
 inside-0.032
ps-0.030
Token an
Token ID281
Feature activation+4.74e-02

Pos logit contributions
els+0.075
ps+0.062
ries+0.057
ws+0.056
 on+0.056
Neg logit contributions
 Int-0.083
 Sadly-0.081
 Curious-0.080
ete-0.079
 Everything-0.078
Token o
Token ID267
Feature activation+5.54e-02

Pos logit contributions
els+0.083
ps+0.073
 on+0.070
 inside+0.068
ws+0.067
Neg logit contributions
 Int-0.084
 Sadly-0.082
 Curious-0.080
ete-0.080
Sadly-0.079
Tokenasis
Token ID17765
Feature activation+1.14e-01

Pos logit contributions
els+0.095
 on+0.082
ps+0.082
 inside+0.079
.+0.077
Neg logit contributions
 Int-0.100
 Sadly-0.098
 Curious-0.096
ete-0.095
Sadly-0.094
Token.
Token ID13
Feature activation-1.07e-01

Pos logit contributions
els+0.136
ps+0.105
ws+0.095
cks+0.083
 frost+0.082
Neg logit contributions
 Sadly-0.263
 Int-0.263
 Curious-0.258
ete-0.254
 couldn-0.249
Token He
Token ID679
Feature activation-4.26e-02

Pos logit contributions
ishly+0.157
Sadly+0.154
Ag+0.150
ete+0.147
Da+0.142
Neg logit contributions
.-0.210
 on-0.196
 inside-0.193
."-0.180
 sticker-0.163
Token followed
Token ID3940
Feature activation+3.23e-02

Pos logit contributions
 Int+0.074
ete+0.073
 Sadly+0.073
Sadly+0.072
 Everything+0.070
Neg logit contributions
els-0.075
 inside-0.069
 on-0.069
 colored-0.066
ps-0.065
Token his
Token ID465
Feature activation+5.21e-05

Pos logit contributions
els+0.054
ps+0.044
ws+0.043
ries+0.042
cks+0.040
Neg logit contributions
 Int-0.063
 Curious-0.060
 Sadly-0.059
ete-0.059
 Everything-0.057
Token nose
Token ID9686
Feature activation+7.42e-02

Pos logit contributions
els+0.000
ps+0.000
 on+0.000
 inside+0.000
.+0.000
Neg logit contributions
 Int-0.000
 Sadly-0.000
 Curious-0.000
ete-0.000
Sadly-0.000
Token paint
Token ID7521
Feature activation+1.28e-01

Pos logit contributions
els+0.031
ps+0.028
ws+0.028
bones+0.027
cks+0.027
Neg logit contributions
 Int-0.038
 Curious-0.037
Of-0.036
 Sadly-0.036
ete-0.036
Tokenbrush
Token ID32680
Feature activation+9.31e-02

Pos logit contributions
els+0.192
ps+0.187
ws+0.183
cks+0.179
 harm+0.167
Neg logit contributions
 Int-0.232
 Sadly-0.220
 Everywhere-0.219
 Curious-0.217
 Exc-0.213
Token and
Token ID290
Feature activation-2.36e-02

Pos logit contributions
els+0.134
ps+0.118
ws+0.112
cks+0.104
bones+0.100
Neg logit contributions
 Sadly-0.189
 Int-0.185
ete-0.180
Sadly-0.179
 Curious-0.178
Token some
Token ID617
Feature activation+7.11e-02

Pos logit contributions
 Int+0.037
 couldn+0.036
 Curious+0.035
 Sadly+0.034
Sadly+0.032
Neg logit contributions
els-0.043
ps-0.041
cks-0.039
ws-0.039
 channel-0.038
Token beautiful
Token ID4950
Feature activation+8.23e-02

Pos logit contributions
cks+0.092
ws+0.089
ps+0.088
els+0.088
bones+0.079
Neg logit contributions
 Int-0.138
Of-0.135
 Curious-0.134
 Everything-0.131
 Her-0.131
Token colors
Token ID7577
Feature activation+1.02e-01

Pos logit contributions
ws+0.125
cks+0.117
ps+0.115
els+0.100
ier+0.099
Neg logit contributions
 Int-0.146
 Her-0.145
 Curious-0.141
 Everything-0.140
 couldn-0.138
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
els+0.114
ws+0.106
cks+0.099
ps+0.091
 problems+0.090
Neg logit contributions
 couldn-0.216
 �-0.211
 She-0.210
 Int-0.206
 Her-0.206
Token With
Token ID2080
Feature activation+1.19e-02

Pos logit contributions
ete+0.113
Sadly+0.107
 Int+0.101
 Sadly+0.097
 Curious+0.091
Neg logit contributions
els-0.239
.-0.225
 on-0.225
 inside-0.222
."-0.207
Token a
Token ID257
Feature activation+2.92e-02

Pos logit contributions
els+0.018
ps+0.016
 inside+0.015
ws+0.015
 on+0.015
Neg logit contributions
 Int-0.023
ete-0.022
 Sadly-0.022
 Curious-0.022
 Everything-0.022
Token few
Token ID1178
Feature activation+7.33e-02

Pos logit contributions
els+0.055
ps+0.048
ws+0.045
ries+0.044
 inside+0.043
Neg logit contributions
 Int-0.051
 Curious-0.049
 Sadly-0.049
Sadly-0.048
ete-0.047
Token sweeps
Token ID46778
Feature activation+1.09e-01

Pos logit contributions
ws+0.093
els+0.093
bones+0.091
cks+0.089
ps+0.086
Neg logit contributions
 Int-0.141
 Curious-0.136
 Everything-0.131
Da-0.129
Sadly-0.128
Token m
Token ID285
Feature activation+6.10e-02

Pos logit contributions
els+0.085
ries+0.068
ws+0.068
cks+0.067
ps+0.067
Neg logit contributions
 Sadly-0.075
 Int-0.072
 Curious-0.071
 couldn-0.070
ete-0.070
Tokenittens
Token ID34978
Feature activation+9.87e-02

Pos logit contributions
els+0.137
ps+0.122
 on+0.120
 inside+0.117
.+0.114
Neg logit contributions
 Int-0.080
 Sadly-0.078
 Curious-0.075
ete-0.075
Sadly-0.074
Token.
Token ID13
Feature activation-8.71e-02

Pos logit contributions
els+0.083
ws+0.056
 colored+0.051
ps+0.050
cks+0.043
Neg logit contributions
ete-0.250
 Sadly-0.248
 couldn-0.245
ishly-0.242
 Int-0.240
Token They
Token ID1119
Feature activation-3.06e-02

Pos logit contributions
rolling+0.125
 Int+0.123
ete+0.122
Sadly+0.120
mber+0.118
Neg logit contributions
els-0.159
.-0.156
 on-0.155
 inside-0.152
."-0.141
Token had
Token ID550
Feature activation+1.12e-02

Pos logit contributions
 Int+0.051
ete+0.050
 Sadly+0.049
Sadly+0.048
 Curious+0.047
Neg logit contributions
els-0.057
 on-0.053
 inside-0.052
ps-0.051
.-0.050
Token disappeared
Token ID12120
Feature activation+1.14e-01

Pos logit contributions
els+0.019
ps+0.016
 on+0.016
 inside+0.015
ws+0.015
Neg logit contributions
 Int-0.021
 Sadly-0.021
ete-0.020
 Curious-0.020
Sadly-0.020
Token.
Token ID13
Feature activation-8.65e-02

Pos logit contributions
els+0.137
ws+0.110
 sticker+0.095
bones+0.094
ps+0.092
Neg logit contributions
ete-0.239
 Int-0.239
 Sadly-0.234
 couldn-0.234
 Curious-0.233
Token
Token ID198
Feature activation-6.36e-02

Pos logit contributions
 Int+0.118
ete+0.115
Sadly+0.112
rolling+0.111
 Curious+0.108
Neg logit contributions
els-0.171
 on-0.162
.-0.161
 inside-0.158
ps-0.150
Token
Token ID198
Feature activation-8.35e-02

Pos logit contributions
 Int+0.099
 Curious+0.093
ete+0.093
ishly+0.091
 Sadly+0.089
Neg logit contributions
els-0.120
.-0.115
 on-0.113
 inside-0.112
ps-0.103
TokenThey
Token ID2990
Feature activation-3.23e-02

Pos logit contributions
 Int+0.147
 Curious+0.138
ete+0.136
 Sadly+0.134
 Everything+0.132
Neg logit contributions
els-0.148
 on-0.138
 inside-0.136
.-0.130
ps-0.124
Token felt
Token ID2936
Feature activation+2.34e-02

Pos logit contributions
 Int+0.055
ete+0.053
 Sadly+0.052
 Curious+0.051
 Everything+0.051
Neg logit contributions
els-0.058
 on-0.055
 inside-0.054
ps-0.053
.-0.052
Token,
Token ID11
Feature activation-1.28e-02

Pos logit contributions
 Sadly+0.009
 Int+0.009
 Curious+0.009
ete+0.008
Sadly+0.008
Neg logit contributions
els-0.005
ps-0.004
ws-0.004
ries-0.004
bones-0.004
Token you
Token ID345
Feature activation-4.33e-03

Pos logit contributions
 Int+0.028
 Sadly+0.028
 Curious+0.027
Sadly+0.027
ete+0.027
Neg logit contributions
els-0.017
 on-0.016
 inside-0.015
.-0.014
ps-0.014
Token made
Token ID925
Feature activation+4.98e-02

Pos logit contributions
 Int+0.008
 Sadly+0.007
ete+0.007
 Curious+0.007
Sadly+0.007
Neg logit contributions
els-0.008
ps-0.007
 on-0.007
 inside-0.007
ws-0.006
Token a
Token ID257
Feature activation+3.00e-02

Pos logit contributions
els+0.079
ps+0.065
ws+0.061
 on+0.061
 inside+0.058
Neg logit contributions
 Int-0.097
ete-0.094
 Sadly-0.094
 Curious-0.092
Sadly-0.091
Token wonderful
Token ID7932
Feature activation+8.00e-02

Pos logit contributions
els+0.053
ps+0.045
 on+0.043
 inside+0.042
ries+0.041
Neg logit contributions
 Int-0.054
 Sadly-0.054
ete-0.052
Sadly-0.052
 Curious-0.052
Token castle
Token ID16669
Feature activation+1.27e-01

Pos logit contributions
els+0.162
ps+0.143
ries+0.132
ws+0.131
cks+0.131
Neg logit contributions
 Sadly-0.124
 Int-0.121
 Curious-0.121
 Everything-0.116
ete-0.113
Token!"
Token ID2474
Feature activation-1.05e-01

Pos logit contributions
els+0.184
ps+0.132
ws+0.130
ries+0.123
cks+0.109
Neg logit contributions
 Sadly-0.246
 Int-0.240
hog-0.238
ete-0.236
Sadly-0.234
Token Mom
Token ID11254
Feature activation-3.02e-02

Pos logit contributions
 Int+0.219
 Sadly+0.215
ete+0.214
Sadly+0.212
 Curious+0.209
Neg logit contributions
els-0.152
.-0.136
 on-0.133
 inside-0.127
ps-0.118
Token said
Token ID531
Feature activation-1.66e-03

Pos logit contributions
 Int+0.048
 Sadly+0.045
 Curious+0.045
ete+0.045
Sadly+0.043
Neg logit contributions
els-0.059
ps-0.052
 on-0.050
ws-0.050
 inside-0.049
Token.
Token ID13
Feature activation-9.98e-02

Pos logit contributions
 Int+0.004
 Sadly+0.004
 Curious+0.004
ete+0.004
ishly+0.004
Neg logit contributions
els-0.002
ws-0.001
ps-0.001
 colored-0.001
cks-0.001
Token
Token ID198
Feature activation-8.75e-02

Pos logit contributions
ete+0.125
 Int+0.122
 Sadly+0.118
Sadly+0.117
 Curious+0.111
Neg logit contributions
els-0.223
 on-0.209
.-0.206
 inside-0.204
ps-0.200
Token more
Token ID517
Feature activation+7.77e-02

Pos logit contributions
els+0.174
ps+0.161
epad+0.152
 channel+0.150
packs+0.146
Neg logit contributions
 Int-0.134
Of-0.134
ete-0.133
So-0.130
 Fro-0.130
Token cel
Token ID18725
Feature activation+1.27e-01

Pos logit contributions
ps+0.138
els+0.129
bones+0.120
ier+0.114
epad+0.112
Neg logit contributions
 So-0.114
So-0.113
 Dod-0.110
 Int-0.107
Da-0.106
Tokenery
Token ID1924
Feature activation+1.24e-01

Pos logit contributions
els+0.204
 inside+0.160
ries+0.159
ps+0.155
 colored+0.155
Neg logit contributions
ete-0.234
Sadly-0.217
Da-0.209
So-0.208
 Sadly-0.205
Token.
Token ID13
Feature activation-5.90e-02

Pos logit contributions
els+0.166
ps+0.128
cks+0.115
bones+0.115
ws+0.107
Neg logit contributions
ete-0.249
So-0.241
 Sadly-0.238
 So-0.235
hog-0.235
Token Maybe
Token ID6674
Feature activation+4.96e-03

Pos logit contributions
 Int+0.065
 Sadly+0.062
ete+0.061
 Curious+0.060
Sadly+0.059
Neg logit contributions
els-0.142
ps-0.129
 on-0.127
 inside-0.125
ws-0.122
Token it
Token ID340
Feature activation+2.19e-02

Pos logit contributions
ps+0.012
els+0.012
 channel+0.011
ws+0.011
bones+0.010
Neg logit contributions
ete-0.005
Of-0.005
 couldn-0.005
 Int-0.004
 Dod-0.004
Token will
Token ID481
Feature activation+1.35e-02

Pos logit contributions
els+0.037
ps+0.031
 on+0.029
 inside+0.029
 colored+0.028
Neg logit contributions
 Int-0.041
 Sadly-0.039
 Curious-0.038
Sadly-0.038
ete-0.037
Token increase
Token ID2620
Feature activation+1.11e-01

Pos logit contributions
els+0.028
ries+0.023
ps+0.023
 inside+0.021
bones+0.021
Neg logit contributions
 Sadly-0.020
Sadly-0.020
 Int-0.019
ete-0.019
 Curious-0.019
Token our
Token ID674
Feature activation+4.14e-02

Pos logit contributions
els+0.162
ps+0.147
bones+0.133
 sticker+0.130
 channel+0.123
Neg logit contributions
ete-0.191
Da-0.184
So-0.183
From-0.179
 couldn-0.177
Token height
Token ID6001
Feature activation+1.45e-01

Pos logit contributions
els+0.071
ps+0.058
 inside+0.057
bones+0.056
ries+0.056
Neg logit contributions
 Sadly-0.071
ete-0.070
 Int-0.068
Sadly-0.065
 Curious-0.065
Token,"
Token ID553
Feature activation-9.20e-02

Pos logit contributions
els+0.187
ps+0.163
bones+0.136
cks+0.136
ws+0.130
Neg logit contributions
ete-0.288
So-0.282
-0.270
 So-0.269
 couldn-0.268
Token new
Token ID649
Feature activation+8.57e-02

Pos logit contributions
els+0.063
ps+0.052
ws+0.052
ries+0.049
cks+0.049
Neg logit contributions
 Int-0.061
 Curious-0.059
ete-0.058
 Sadly-0.057
 Everything-0.056
Token game
Token ID983
Feature activation+9.81e-02

Pos logit contributions
els+0.145
ps+0.123
cks+0.122
ws+0.119
ries+0.115
Neg logit contributions
 Curious-0.149
 Int-0.149
ete-0.147
 Sadly-0.145
 Everything-0.142
Token and
Token ID290
Feature activation-1.76e-02

Pos logit contributions
els+0.133
ps+0.110
ws+0.099
ries+0.095
 on+0.095
Neg logit contributions
 Int-0.211
 Sadly-0.210
ete-0.208
 Curious-0.203
Sadly-0.202
Token discovered
Token ID5071
Feature activation+3.95e-02

Pos logit contributions
 Int+0.029
 Sadly+0.028
ete+0.028
 Curious+0.028
Sadly+0.027
Neg logit contributions
els-0.033
ps-0.029
 on-0.029
 inside-0.028
ws-0.027
Token even
Token ID772
Feature activation+3.81e-02

Pos logit contributions
els+0.061
ps+0.052
 on+0.050
 inside+0.048
ws+0.047
Neg logit contributions
 Int-0.078
 Sadly-0.077
ete-0.075
 Curious-0.075
Sadly-0.074
Token more
Token ID517
Feature activation+8.09e-02

Pos logit contributions
els+0.065
ps+0.058
cks+0.053
ries+0.048
ws+0.048
Neg logit contributions
Of-0.071
 Curious-0.070
 Int-0.070
Don-0.066
 Sadly-0.065
Token exciting
Token ID7895
Feature activation+1.11e-01

Pos logit contributions
els+0.141
ws+0.136
ps+0.135
cks+0.122
ries+0.114
Neg logit contributions
ete-0.123
 Curious-0.123
 Everything-0.122
 Int-0.121
 couldn-0.116
Token smells
Token ID25760
Feature activation+1.16e-01

Pos logit contributions
els+0.145
ws+0.134
ps+0.131
cks+0.119
 sticker+0.116
Neg logit contributions
 Int-0.215
 Curious-0.212
 Everything-0.210
ete-0.206
 couldn-0.205
Token in
Token ID287
Feature activation+4.48e-02

Pos logit contributions
els+0.185
ps+0.159
ws+0.158
cks+0.144
ries+0.134
Neg logit contributions
ete-0.220
 Int-0.214
 Everything-0.210
 Curious-0.209
 Everywhere-0.203
Token the
Token ID262
Feature activation+1.90e-02

Pos logit contributions
els+0.083
ps+0.075
ws+0.074
ier+0.067
ries+0.067
Neg logit contributions
ete-0.068
 Sadly-0.064
 Int-0.063
 Everything-0.063
 Everywhere-0.063
Token ocean
Token ID9151
Feature activation+1.05e-01

Pos logit contributions
els+0.038
ps+0.033
ws+0.031
 on+0.031
ries+0.031
Neg logit contributions
 Int-0.030
 Sadly-0.030
ete-0.029
 Curious-0.029
 Everything-0.028
Token that
Token ID326
Feature activation-2.13e-02

Pos logit contributions
els+0.006
 on+0.006
 inside+0.006
.+0.006
ps+0.005
Neg logit contributions
 Int-0.010
 Curious-0.010
 Sadly-0.010
Sadly-0.009
ete-0.009
Token he
Token ID339
Feature activation-1.52e-02

Pos logit contributions
 Int+0.029
 Sadly+0.029
 Curious+0.028
ete+0.027
Sadly+0.027
Neg logit contributions
els-0.045
ps-0.041
ws-0.038
cks-0.037
 inside-0.036
Token couldn
Token ID3521
Feature activation+7.63e-03

Pos logit contributions
 Int+0.025
 Curious+0.023
ete+0.022
Da+0.022
Sadly+0.022
Neg logit contributions
els-0.030
 on-0.028
 inside-0.027
.-0.027
ps-0.026
Token't
Token ID470
Feature activation+1.62e-02

Pos logit contributions
els+0.015
 on+0.014
.+0.014
ps+0.013
 inside+0.013
Neg logit contributions
 Int-0.012
Of-0.012
 Curious-0.012
ete-0.011
 Sadly-0.011
Token play
Token ID711
Feature activation+8.09e-02

Pos logit contributions
els+0.029
ps+0.025
 on+0.023
ws+0.023
ries+0.023
Neg logit contributions
 Sadly-0.028
 Int-0.028
ete-0.028
Sadly-0.028
 Curious-0.027
Token with
Token ID351
Feature activation+4.03e-02

Pos logit contributions
els+0.131
ps+0.109
ws+0.105
cks+0.091
 angles+0.090
Neg logit contributions
 Int-0.155
 Sadly-0.152
 Curious-0.147
 Everything-0.145
Sadly-0.143
Token his
Token ID465
Feature activation+3.41e-02

Pos logit contributions
els+0.065
ps+0.059
ws+0.059
ier+0.054
 colored+0.053
Neg logit contributions
 Sadly-0.062
 Int-0.062
 His-0.060
 Everything-0.060
ete-0.059
Token truck
Token ID7779
Feature activation+9.63e-02

Pos logit contributions
els+0.056
ps+0.049
ws+0.048
cks+0.047
bones+0.046
Neg logit contributions
 Sadly-0.058
 Curious-0.055
 Int-0.054
 His-0.054
 couldn-0.052
Token until
Token ID1566
Feature activation-1.64e-02

Pos logit contributions
els+0.137
ps+0.117
cks+0.106
ws+0.106
 colored+0.096
Neg logit contributions
 Sadly-0.201
 Int-0.189
ete-0.188
Sadly-0.186
 Curious-0.184
Token he
Token ID339
Feature activation-1.34e-02

Pos logit contributions
 Sadly+0.023
 Int+0.022
ete+0.022
 Everything+0.022
 Curious+0.021
Neg logit contributions
els-0.035
ps-0.032
ws-0.030
cks-0.029
 colored-0.028
Token ate
Token ID15063
Feature activation+9.66e-02

Pos logit contributions
 Int+0.020
 Curious+0.019
 Sadly+0.019
Sadly+0.019
ete+0.018
Neg logit contributions
els-0.027
 on-0.025
ps-0.024
 inside-0.024
.-0.024
Token a
Token ID257
Feature activation+2.12e-02

Pos logit contributions
els+0.019
 on+0.017
ps+0.016
 inside+0.016
.+0.015
Neg logit contributions
ete-0.037
 Int-0.036
 Sadly-0.036
Sadly-0.035
 Curious-0.035
Token beautiful
Token ID4950
Feature activation+7.14e-02

Pos logit contributions
els+0.037
ps+0.033
 on+0.031
 inside+0.030
ws+0.030
Neg logit contributions
 Int-0.038
 Sadly-0.037
 Curious-0.036
ete-0.036
Sadly-0.036
Token pattern
Token ID3912
Feature activation+1.19e-01

Pos logit contributions
els+0.137
ps+0.126
ws+0.123
cks+0.113
ries+0.106
Neg logit contributions
 Int-0.116
 Curious-0.113
 Sadly-0.110
Da-0.106
 Everything-0.105
Token in
Token ID287
Feature activation+2.42e-02

Pos logit contributions
els+0.135
ws+0.133
ps+0.110
 sticker+0.096
ries+0.094
Neg logit contributions
 Int-0.253
 Curious-0.243
hog-0.243
 couldn-0.241
Over-0.235
Token the
Token ID262
Feature activation-4.40e-03

Pos logit contributions
els+0.044
ws+0.041
ps+0.041
ier+0.040
ishing+0.038
Neg logit contributions
 Int-0.038
Da-0.037
 Curious-0.037
From-0.035
 Sadly-0.035
Token clouds
Token ID15114
Feature activation+9.08e-02

Pos logit contributions
 Int+0.008
 Sadly+0.008
 Curious+0.007
ete+0.007
Sadly+0.007
Neg logit contributions
els-0.008
ps-0.007
 on-0.007
 inside-0.006
ws-0.006
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
ws+0.101
els+0.100
ps+0.094
 problems+0.080
 sticker+0.079
Neg logit contributions
 Int-0.194
 She-0.188
 couldn-0.187
 Curious-0.183
 Everything-0.182
Token The
Token ID383
Feature activation-3.08e-02

Pos logit contributions
ete+0.220
mber+0.184
Sadly+0.173
David+0.173
 concession+0.170
Neg logit contributions
.-0.228
 on-0.221
 inside-0.210
."-0.201
els-0.192
Token pattern
Token ID3912
Feature activation+6.51e-02

Pos logit contributions
 Int+0.065
 Sadly+0.064
 Curious+0.063
ete+0.063
Sadly+0.062
Neg logit contributions
els-0.044
ps-0.037
 on-0.036
 inside-0.034
ws-0.033
Token was
Token ID373
Feature activation+9.80e-03

Pos logit contributions
els+0.131
 on+0.116
ps+0.115
 inside+0.113
.+0.109
Neg logit contributions
 Int-0.099
 Sadly-0.097
ete-0.095
 Curious-0.094
Sadly-0.093
Token made
Token ID925
Feature activation+5.06e-02

Pos logit contributions
els+0.020
ws+0.017
ps+0.017
ries+0.016
cks+0.015
Neg logit contributions
 Int-0.015
 Sadly-0.014
 Curious-0.014
Sadly-0.014
 Everything-0.013
Token plane
Token ID6614
Feature activation+6.12e-02

Pos logit contributions
 Int+0.045
Sadly+0.043
 Sadly+0.043
 Curious+0.042
 Everything+0.041
Neg logit contributions
els-0.041
ps-0.034
ws-0.033
ries-0.033
 inside-0.032
Token takes
Token ID2753
Feature activation+8.90e-02

Pos logit contributions
els+0.104
ps+0.088
ws+0.084
 on+0.083
 inside+0.083
Neg logit contributions
 Int-0.109
 Sadly-0.108
 Curious-0.104
Sadly-0.103
ete-0.101
Token people
Token ID661
Feature activation+8.44e-02

Pos logit contributions
els+0.147
ries+0.126
cks+0.114
ws+0.113
ier+0.112
Neg logit contributions
 Int-0.153
 Everything-0.151
 Sadly-0.150
Sadly-0.147
 Curious-0.143
Token from
Token ID422
Feature activation+2.66e-02

Pos logit contributions
els+0.132
ps+0.111
ws+0.105
ries+0.099
bones+0.099
Neg logit contributions
 Int-0.155
ete-0.153
 Sadly-0.151
 Curious-0.147
 Everything-0.145
Token one
Token ID530
Feature activation+5.98e-02

Pos logit contributions
els+0.045
ps+0.038
ws+0.037
ier+0.037
ries+0.036
Neg logit contributions
 Sadly-0.041
 Int-0.040
 couldn-0.039
Sadly-0.039
 Everything-0.039
Token place
Token ID1295
Feature activation+6.69e-02

Pos logit contributions
els+0.105
ps+0.104
ws+0.091
ries+0.087
bones+0.086
Neg logit contributions
 Sadly-0.093
So-0.093
Sadly-0.092
 Int-0.092
ete-0.091
Token to
Token ID284
Feature activation+9.54e-03

Pos logit contributions
els+0.082
ps+0.064
ws+0.058
cks+0.055
ries+0.055
Neg logit contributions
 Sadly-0.153
 Int-0.147
ete-0.147
Sadly-0.143
 Curious-0.143
Token another
Token ID1194
Feature activation+4.62e-02

Pos logit contributions
els+0.018
ries+0.015
ps+0.015
 on+0.014
ws+0.014
Neg logit contributions
 Int-0.015
 Sadly-0.015
Sadly-0.015
 Everything-0.015
ete-0.014
Token place
Token ID1295
Feature activation+6.79e-02

Pos logit contributions
els+0.083
ps+0.078
ws+0.071
ries+0.070
bones+0.063
Neg logit contributions
 Sadly-0.081
 Int-0.079
Sadly-0.078
So-0.076
 Curious-0.075
Token.
Token ID13
Feature activation-7.11e-02

Pos logit contributions
els+0.080
ps+0.061
ws+0.054
ries+0.051
cks+0.051
Neg logit contributions
 Sadly-0.157
ete-0.152
 Int-0.151
 Curious-0.147
 Everything-0.147
Token It
Token ID632
Feature activation-6.35e-03

Pos logit contributions
ete+0.082
 Int+0.079
Da+0.079
Sadly+0.077
 Sadly+0.076
Neg logit contributions
els-0.161
 on-0.147
ps-0.145
 inside-0.145
.-0.142
Token
Token ID220
Feature activation-5.59e-02

Pos logit contributions
ete+0.164
Sadly+0.159
 Sadly+0.148
 Int+0.147
rolling+0.147
Neg logit contributions
.-0.221
els-0.220
 on-0.219
 inside-0.204
 sticker-0.197
Token
Token ID198
Feature activation-7.37e-02

Pos logit contributions
 Int+0.083
Sadly+0.082
ete+0.081
ishly+0.081
Of+0.081
Neg logit contributions
els-0.103
.-0.102
 on-0.099
 inside-0.095
."-0.092
Token
Token ID198
Feature activation-8.76e-02

Pos logit contributions
ete+0.113
 Sadly+0.109
 Int+0.108
ishly+0.108
Sadly+0.105
Neg logit contributions
 on-0.135
els-0.133
.-0.132
 inside-0.125
 sticker-0.122
TokenThe
Token ID464
Feature activation-4.51e-02

Pos logit contributions
ete+0.149
 Sadly+0.144
 Int+0.140
ishly+0.136
 Everything+0.135
Neg logit contributions
els-0.144
 on-0.144
.-0.142
 sticker-0.136
 inside-0.130
Token girl
Token ID2576
Feature activation-1.27e-02

Pos logit contributions
 Int+0.068
 Curious+0.066
 Sadly+0.065
Of+0.062
Sadly+0.062
Neg logit contributions
els-0.095
ps-0.083
ws-0.081
 on-0.079
cks-0.078
Token said
Token ID531
Feature activation-8.99e-03

Pos logit contributions
Sadly+0.024
 Sadly+0.023
 Int+0.023
ete+0.023
 Curious+0.022
Neg logit contributions
els-0.021
 on-0.019
 inside-0.019
.-0.019
 colored-0.018
Token,
Token ID11
Feature activation-4.10e-02

Pos logit contributions
 Int+0.024
 Sadly+0.024
Sadly+0.023
ete+0.023
 Curious+0.023
Neg logit contributions
els-0.008
 on-0.006
ps-0.006
 inside-0.006
.-0.005
Token "
Token ID366
Feature activation-4.22e-02

Pos logit contributions
Sadly+0.057
 Sadly+0.056
ete+0.056
 Int+0.054
 Curious+0.050
Neg logit contributions
els-0.083
 on-0.083
.-0.080
 inside-0.079
ps-0.076
TokenMy
Token ID3666
Feature activation+9.86e-03

Pos logit contributions
ete+0.074
 Sadly+0.069
Sadly+0.064
apa+0.057
rolling+0.056
Neg logit contributions
 on-0.076
.-0.070
els-0.067
 colored-0.065
 than-0.064
Token name
Token ID1438
Feature activation+7.05e-02

Pos logit contributions
els+0.017
ps+0.015
 on+0.014
 inside+0.014
ws+0.014
Neg logit contributions
 Int-0.018
 Sadly-0.017
 Curious-0.017
ete-0.017
Sadly-0.017
Token is
Token ID318
Feature activation+2.91e-03

Pos logit contributions
els+0.100
ps+0.083
 on+0.079
 inside+0.078
ws+0.075
Neg logit contributions
 Int-0.149
 Sadly-0.145
 Curious-0.143
ete-0.142
Sadly-0.140
Token<|endoftext|>
Token ID50256
Feature activation-3.20e-02

Pos logit contributions
Sadly+0.141
ete+0.137
 Int+0.132
ishly+0.127
 Sadly+0.125
Neg logit contributions
.-0.165
 on-0.158
els-0.154
."-0.153
 inside-0.148
TokenOnce
Token ID7454
Feature activation-1.01e-01

Pos logit contributions
 Int+0.051
 Sadly+0.050
 Curious+0.049
ete+0.048
Sadly+0.047
Neg logit contributions
els-0.062
ps-0.055
 on-0.053
 inside-0.052
ws-0.051
Token upon
Token ID2402
Feature activation-5.52e-02

Pos logit contributions
 Int+0.146
Sadly+0.143
 Sadly+0.139
 Curious+0.138
ete+0.132
Neg logit contributions
els-0.208
 on-0.194
.-0.190
 inside-0.190
ps-0.174
Token a
Token ID257
Feature activation-4.94e-02

Pos logit contributions
ete+0.137
 Sadly+0.136
Sadly+0.132
 Int+0.132
ishly+0.129
Neg logit contributions
 on-0.052
.-0.050
els-0.049
 inside-0.047
ps-0.043
Token time
Token ID640
Feature activation-5.33e-02

Pos logit contributions
 Int+0.143
 Sadly+0.140
ete+0.139
 Curious+0.139
Sadly+0.138
Neg logit contributions
els-0.025
 on-0.019
.-0.018
 inside-0.016
ps-0.014
Token there
Token ID612
Feature activation-4.13e-02

Pos logit contributions
Sadly+0.152
 Sadly+0.151
 Int+0.150
 Curious+0.148
ete+0.147
Neg logit contributions
els-0.033
 on-0.027
.-0.024
 inside-0.023
ps-0.017
Token was
Token ID373
Feature activation-4.15e-02

Pos logit contributions
 Int+0.073
Sadly+0.072
 Sadly+0.071
ete+0.071
Of+0.070
Neg logit contributions
els-0.071
 on-0.065
.-0.064
 inside-0.063
 colored-0.060
Token a
Token ID257
Feature activation-1.37e-02

Pos logit contributions
 Int+0.090
 Sadly+0.089
ete+0.088
 Curious+0.086
Sadly+0.086
Neg logit contributions
els-0.056
 on-0.047
ps-0.046
 inside-0.045
.-0.043
Token happy
Token ID3772
Feature activation+2.88e-02

Pos logit contributions
 Int+0.023
 Sadly+0.022
ete+0.022
Sadly+0.022
 Curious+0.022
Neg logit contributions
els-0.025
 on-0.022
 inside-0.022
ps-0.021
.-0.021
Token little
Token ID1310
Feature activation-4.87e-03

Pos logit contributions
els+0.054
 on+0.047
ps+0.047
 inside+0.046
.+0.045
Neg logit contributions
 Int-0.047
 Sadly-0.046
ete-0.045
 Curious-0.045
Sadly-0.044
Token child
Token ID1200
Feature activation+6.69e-03

Pos logit contributions
 Int+0.008
 Sadly+0.008
 Curious+0.007
ete+0.007
Sadly+0.007
Neg logit contributions
els-0.009
ps-0.008
 on-0.008
 inside-0.008
ws-0.008
Token went
Token ID1816
Feature activation+4.70e-03

Pos logit contributions
Of+0.106
ishly+0.101
Sadly+0.101
 Int+0.100
ete+0.097
Neg logit contributions
 inside-0.072
 on-0.066
.-0.066
 colored-0.053
els-0.053
Token for
Token ID329
Feature activation+3.98e-03

Pos logit contributions
els+0.006
ps+0.005
 on+0.005
 inside+0.005
ws+0.005
Neg logit contributions
 Int-0.010
 Sadly-0.010
 Curious-0.010
ete-0.010
Sadly-0.010
Token a
Token ID257
Feature activation+3.44e-03

Pos logit contributions
els+0.007
ps+0.006
ws+0.005
ries+0.005
 on+0.005
Neg logit contributions
 Int-0.008
 Sadly-0.007
 Curious-0.007
ete-0.007
Sadly-0.007
Token long
Token ID890
Feature activation+6.04e-02

Pos logit contributions
 on+0.005
els+0.005
 inside+0.005
.+0.005
 sticker+0.004
Neg logit contributions
 Sadly-0.006
 Int-0.006
Sadly-0.006
ete-0.006
 Everything-0.006
Token walk
Token ID2513
Feature activation+7.69e-02

Pos logit contributions
els+0.086
ps+0.073
 on+0.066
 inside+0.065
ws+0.064
Neg logit contributions
 Int-0.128
 Sadly-0.126
 Curious-0.123
ete-0.123
Sadly-0.122
Token in
Token ID287
Feature activation+4.79e-03

Pos logit contributions
els+0.074
ps+0.056
 on+0.049
 inside+0.047
ws+0.047
Neg logit contributions
 Int-0.200
 Sadly-0.197
 Curious-0.193
ete-0.193
Sadly-0.191
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
els+0.007
ps+0.006
ws+0.006
ier+0.005
ries+0.005
Neg logit contributions
 Int-0.010
 Sadly-0.010
 Curious-0.010
 Everything-0.009
ete-0.009
Token forest
Token ID8222
Feature activation+5.98e-02

Pos logit contributions
ete+0.036
 Int+0.034
ishly+0.033
etsy+0.033
 Sadly+0.032
Neg logit contributions
 inside-0.017
 on-0.017
.-0.016
els-0.016
 paper-0.014
Token,
Token ID11
Feature activation-3.61e-02

Pos logit contributions
els+0.052
ps+0.038
ws+0.031
ries+0.028
 on+0.028
Neg logit contributions
 Int-0.162
 Sadly-0.160
 Curious-0.157
ete-0.154
Sadly-0.153
Token but
Token ID475
Feature activation-6.54e-02

Pos logit contributions
ete+0.080
Sadly+0.079
 Sadly+0.078
 Int+0.075
 Everything+0.074
Neg logit contributions
 on-0.048
 inside-0.046
.-0.042
els-0.038
 behind-0.037
Token quickly
Token ID2952
Feature activation+9.99e-03

Pos logit contributions
 Int+0.126
 Sadly+0.120
Sadly+0.118
 Curious+0.118
ete+0.117
Neg logit contributions
 on-0.102
 inside-0.097
els-0.095
.-0.089
ps-0.083
Token
Token ID198
Feature activation-8.46e-02

Pos logit contributions
ishly+0.116
 Int+0.111
ete+0.111
 Curious+0.106
Of+0.104
Neg logit contributions
.-0.116
 on-0.116
 inside-0.109
els-0.108
."-0.099
TokenJust
Token ID5703
Feature activation-8.67e-03

Pos logit contributions
ete+0.132
 Int+0.117
ishly+0.114
 Sadly+0.111
 Everything+0.108
Neg logit contributions
 on-0.167
els-0.162
.-0.152
 inside-0.150
 colored-0.142
Token then
Token ID788
Feature activation-3.32e-02

Pos logit contributions
 Int+0.016
 Sadly+0.016
 Curious+0.015
ete+0.015
Sadly+0.015
Neg logit contributions
els-0.015
ps-0.012
ries-0.012
ws-0.011
 on-0.011
Token,
Token ID11
Feature activation-5.92e-02

Pos logit contributions
 Int+0.088
 Sadly+0.087
ete+0.086
Sadly+0.085
 Curious+0.085
Neg logit contributions
els-0.028
 on-0.022
ps-0.021
 inside-0.020
.-0.018
Token Bree
Token ID40110
Feature activation-2.31e-03

Pos logit contributions
ete+0.131
 Int+0.127
 Sadly+0.127
Sadly+0.122
 Curious+0.121
Neg logit contributions
 on-0.075
els-0.075
 inside-0.069
.-0.066
ps-0.061
Token saw
Token ID2497
Feature activation+8.61e-03

Pos logit contributions
 Sadly+0.004
 Int+0.004
Of+0.004
Sadly+0.004
ete+0.004
Neg logit contributions
els-0.004
 inside-0.004
.-0.004
 on-0.004
ps-0.004
Token something
Token ID1223
Feature activation+4.39e-02

Pos logit contributions
els+0.010
 on+0.009
 inside+0.008
.+0.008
ps+0.008
Neg logit contributions
ete-0.020
 Sadly-0.020
 Int-0.020
Sadly-0.019
 Curious-0.019
Token on
Token ID319
Feature activation+3.16e-02

Pos logit contributions
els+0.077
ps+0.070
ws+0.068
ries+0.060
cks+0.058
Neg logit contributions
 Int-0.077
 Curious-0.075
ete-0.073
 Sadly-0.071
Da-0.070
Token the
Token ID262
Feature activation+7.53e-03

Pos logit contributions
els+0.049
ps+0.044
ws+0.043
ier+0.039
 inside+0.038
Neg logit contributions
 Int-0.059
 Curious-0.057
Da-0.057
 Sadly-0.056
Sadly-0.055
Token creek
Token ID39459
Feature activation+8.50e-02

Pos logit contributions
els+0.010
 on+0.010
 inside+0.009
ps+0.009
.+0.009
Neg logit contributions
 Int-0.016
 Sadly-0.016
 Curious-0.015
Sadly-0.015
ete-0.015
Token's
Token ID338
Feature activation+3.58e-02

Pos logit contributions
els+0.037
ps+0.017
 on+0.014
 inside+0.011
ws+0.006
Neg logit contributions
 Int-0.264
 Sadly-0.260
ete-0.257
 Curious-0.257
Sadly-0.255
Token
Token ID198
Feature activation-7.84e-02

Pos logit contributions
 Int+0.090
Sadly+0.090
Da+0.088
 Curious+0.088
Of+0.086
Neg logit contributions
els-0.112
.-0.107
 on-0.105
 inside-0.101
ps-0.098
Token
Token ID198
Feature activation-9.47e-02

Pos logit contributions
ishly+0.138
 Int+0.126
 Sadly+0.122
ete+0.121
 Curious+0.120
Neg logit contributions
.-0.121
 on-0.121
els-0.117
aks-0.109
 inside-0.108
TokenThat
Token ID2504
Feature activation-4.44e-02

Pos logit contributions
ishly+0.147
 Int+0.145
ete+0.144
 Sadly+0.140
cient+0.135
Neg logit contributions
 on-0.158
els-0.154
.-0.146
aks-0.142
 inside-0.139
Token strange
Token ID6283
Feature activation+3.85e-02

Pos logit contributions
 Int+0.104
 Sadly+0.101
 Curious+0.100
ete+0.099
Sadly+0.098
Neg logit contributions
els-0.053
 on-0.045
 inside-0.043
ps-0.042
.-0.041
Token man
Token ID582
Feature activation+2.34e-02

Pos logit contributions
els+0.058
ps+0.049
ws+0.046
 on+0.044
cks+0.044
Neg logit contributions
 Int-0.077
 Sadly-0.076
ete-0.075
 Curious-0.074
 Everything-0.074
Token was
Token ID373
Feature activation-1.02e-02

Pos logit contributions
els+0.047
ps+0.041
 on+0.041
 inside+0.040
.+0.039
Neg logit contributions
 Int-0.036
 Sadly-0.035
 Curious-0.034
ete-0.034
Sadly-0.033
Token right
Token ID826
Feature activation+2.75e-02

Pos logit contributions
 Int+0.017
 Sadly+0.017
ete+0.017
 Curious+0.017
Sadly+0.016
Neg logit contributions
els-0.019
ps-0.016
ws-0.015
 on-0.015
 inside-0.015
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
els+0.036
ws+0.032
cks+0.030
ps+0.029
ries+0.026
Neg logit contributions
ete-0.059
 Sadly-0.057
 Curious-0.056
 Int-0.055
 couldn-0.054
Token Mark
Token ID2940
Feature activation-3.41e-02

Pos logit contributions
ete+0.107
Sadly+0.101
Da+0.099
rolling+0.098
 Int+0.097
Neg logit contributions
els-0.228
.-0.227
 on-0.216
."-0.212
 inside-0.211
Token's
Token ID338
Feature activation-7.24e-03

Pos logit contributions
 Int+0.059
 Sadly+0.056
 Curious+0.055
Sadly+0.054
ete+0.054
Neg logit contributions
els-0.063
 on-0.056
ps-0.055
 inside-0.054
.-0.054
Token mar
Token ID1667
Feature activation+9.24e-02

Pos logit contributions
 Int+0.013
 Sadly+0.013
ete+0.013
 Curious+0.013
Sadly+0.013
Neg logit contributions
els-0.012
ps-0.011
 on-0.010
 inside-0.010
ws-0.010
Token,
Token ID11
Feature activation-4.06e-02

Pos logit contributions
els+0.064
bones+0.055
ps+0.055
ws+0.053
cks+0.049
Neg logit contributions
 Int-0.085
ete-0.084
 Curious-0.078
Da-0.077
 Fro-0.077
Token help
Token ID1037
Feature activation+2.87e-02

Pos logit contributions
 Int+0.074
 Sadly+0.072
 Curious+0.070
ete+0.070
Sadly+0.069
Neg logit contributions
els-0.072
ps-0.059
ws-0.056
ries-0.056
 on-0.056
Token!"
Token ID2474
Feature activation-1.02e-01

Pos logit contributions
els+0.031
ps+0.024
 on+0.021
ws+0.021
 inside+0.019
Neg logit contributions
 Int-0.070
ete-0.069
 Sadly-0.068
Sadly-0.067
 Curious-0.066
Token Tom
Token ID4186
Feature activation-4.19e-02

Pos logit contributions
ete+0.207
 Int+0.204
 Sadly+0.199
Sadly+0.197
 Curious+0.196
Neg logit contributions
els-0.144
 on-0.135
.-0.132
 inside-0.128
ps-0.127
Token cried
Token ID16896
Feature activation+2.43e-02

Pos logit contributions
 Int+0.069
ete+0.069
 Sadly+0.068
 Curious+0.066
 Everything+0.066
Neg logit contributions
els-0.077
 on-0.070
ps-0.068
 inside-0.067
.-0.067
Token.
Token ID13
Feature activation-9.03e-02

Pos logit contributions
els+0.023
ws+0.015
cks+0.015
ps+0.014
ries+0.014
Neg logit contributions
 Int-0.065
 Sadly-0.063
 Curious-0.063
 His-0.061
ete-0.061
Token He
Token ID679
Feature activation-4.18e-02

Pos logit contributions
ete+0.105
 Int+0.091
Sadly+0.088
Of+0.088
rolling+0.086
Neg logit contributions
.-0.212
 on-0.208
els-0.203
 inside-0.198
ps-0.197
Token was
Token ID373
Feature activation-1.16e-02

Pos logit contributions
ete+0.070
 Int+0.069
 Sadly+0.068
 Curious+0.066
Sadly+0.065
Neg logit contributions
els-0.078
ps-0.070
 on-0.069
 inside-0.067
.-0.064
Token not
Token ID407
Feature activation+5.44e-03

Pos logit contributions
 Int+0.017
 Sadly+0.017
 Curious+0.016
Sadly+0.016
 Everything+0.015
Neg logit contributions
els-0.024
ws-0.020
ries-0.019
 colored-0.019
ps-0.019
Token happy
Token ID3772
Feature activation+4.49e-02

Pos logit contributions
els+0.013
bones+0.011
ries+0.011
ws+0.011
 icing+0.011
Neg logit contributions
 Int-0.006
 Curious-0.006
Luckily-0.005
Da-0.005
 Barkley-0.005
Token.
Token ID13
Feature activation-8.31e-02

Pos logit contributions
els+0.063
ws+0.052
ps+0.049
 colored+0.045
aks+0.043
Neg logit contributions
 He-0.092
 His-0.088
 couldn-0.086
 Int-0.084
ete-0.084
Token Duck
Token ID21867
Feature activation+6.29e-02

Pos logit contributions
els+0.047
ps+0.041
 on+0.040
 inside+0.039
ws+0.038
Neg logit contributions
 Int-0.041
 Sadly-0.040
ete-0.039
 Curious-0.039
Sadly-0.038
Token asked
Token ID1965
Feature activation+2.51e-02

Pos logit contributions
els+0.121
ps+0.107
 on+0.105
 inside+0.102
ws+0.099
Neg logit contributions
 Int-0.101
 Sadly-0.098
ete-0.095
 Curious-0.095
Sadly-0.094
Token him
Token ID683
Feature activation+4.17e-02

Pos logit contributions
els+0.032
ps+0.026
 on+0.025
 inside+0.024
ws+0.023
Neg logit contributions
 Int-0.057
 Sadly-0.056
ete-0.055
 Curious-0.055
Sadly-0.054
Token to
Token ID284
Feature activation+1.69e-02

Pos logit contributions
els+0.051
ps+0.041
 on+0.039
 inside+0.038
ws+0.036
Neg logit contributions
 Int-0.096
 Sadly-0.094
ete-0.093
 Curious-0.093
Sadly-0.091
Token do
Token ID466
Feature activation+4.56e-02

Pos logit contributions
els+0.028
ps+0.024
 on+0.023
 inside+0.022
ws+0.022
Neg logit contributions
 Int-0.032
 Sadly-0.031
ete-0.031
 Curious-0.030
Sadly-0.030
Token.
Token ID13
Feature activation-1.12e-01

Pos logit contributions
els+0.053
ps+0.040
ws+0.038
cks+0.035
 colored+0.034
Neg logit contributions
ete-0.104
 Sadly-0.104
 Int-0.102
Sadly-0.102
 Everything-0.101
Token The
Token ID383
Feature activation-3.69e-02

Pos logit contributions
 Int+0.201
rolling+0.198
ete+0.189
 Curious+0.188
Quick+0.187
Neg logit contributions
.-0.193
."-0.175
 on-0.159
 chocolate-0.146
 inside-0.137
Token family
Token ID1641
Feature activation+5.39e-03

Pos logit contributions
 Int+0.084
 Sadly+0.079
 Curious+0.079
ete+0.078
 Everything+0.076
Neg logit contributions
els-0.047
 on-0.043
ps-0.041
.-0.040
 inside-0.039
Token of
Token ID286
Feature activation+2.66e-02

Pos logit contributions
els+0.010
 on+0.009
 inside+0.009
ps+0.009
.+0.009
Neg logit contributions
 Int-0.009
 Curious-0.009
ete-0.008
 Sadly-0.008
Sadly-0.008
Token ducks
Token ID39694
Feature activation+4.92e-02

Pos logit contributions
els+0.038
ps+0.033
 inside+0.031
 on+0.030
 sticker+0.030
Neg logit contributions
 Int-0.053
 Sadly-0.052
ete-0.052
Sadly-0.051
 Curious-0.051
Token were
Token ID547
Feature activation+8.79e-03

Pos logit contributions
els+0.097
 on+0.089
ps+0.087
 inside+0.087
.+0.084
Neg logit contributions
 Int-0.078
 Curious-0.073
 Sadly-0.073
Sadly-0.071
ete-0.071
Token still
Token ID991
Feature activation-1.64e-03

Pos logit contributions
 Int+0.106
ete+0.105
 Sadly+0.104
 Curious+0.102
Sadly+0.101
Neg logit contributions
els-0.068
 on-0.061
 inside-0.058
ps-0.055
.-0.053
Token there
Token ID612
Feature activation+1.94e-02

Pos logit contributions
 Int+0.002
 Sadly+0.002
 Curious+0.002
ete+0.002
Sadly+0.002
Neg logit contributions
els-0.004
ps-0.003
ws-0.003
 on-0.003
ries-0.003
Token was
Token ID373
Feature activation+4.00e-03

Pos logit contributions
els+0.039
 on+0.035
 inside+0.034
ps+0.034
.+0.034
Neg logit contributions
 Int-0.030
 Sadly-0.029
ete-0.029
 Curious-0.028
Sadly-0.028
Token no
Token ID645
Feature activation+4.99e-02

Pos logit contributions
els+0.007
ps+0.006
 on+0.006
 inside+0.006
.+0.005
Neg logit contributions
 Int-0.007
 Sadly-0.007
ete-0.007
 Curious-0.007
Sadly-0.007
Token luck
Token ID8458
Feature activation+1.21e-01

Pos logit contributions
els+0.091
cks+0.087
ries+0.077
ps+0.076
ws+0.072
Neg logit contributions
 Curious-0.081
 Int-0.080
 Everything-0.073
Of-0.073
 His-0.071
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
els+0.157
ws+0.122
ries+0.111
 colored+0.107
ps+0.106
Neg logit contributions
 couldn-0.263
ete-0.258
 Curious-0.242
 �-0.240
 Int-0.239
Token
Token ID220
Feature activation-4.70e-02

Pos logit contributions
ete+0.181
Sadly+0.156
mber+0.155
 concession+0.153
Fin+0.151
Neg logit contributions
.-0.187
 inside-0.176
 on-0.176
."-0.161
els-0.151
Token
Token ID198
Feature activation-6.56e-02

Pos logit contributions
ete+0.071
 Int+0.069
Sadly+0.068
 Curious+0.067
mber+0.067
Neg logit contributions
els-0.087
.-0.086
 on-0.084
 inside-0.080
."-0.076
Token
Token ID198
Feature activation-8.60e-02

Pos logit contributions
ete+0.115
ishly+0.114
 Int+0.106
 Curious+0.102
 Sadly+0.101
Neg logit contributions
.-0.110
 on-0.107
 inside-0.101
els-0.096
."-0.094
TokenJust
Token ID5703
Feature activation-8.49e-03

Pos logit contributions
ete+0.126
 Fro+0.118
ishly+0.108
 Int+0.106
 Sadly+0.101
Neg logit contributions
 on-0.166
 inside-0.163
.-0.162
els-0.160
 colored-0.146
Token then
Token ID788
Feature activation-3.89e-02

Pos logit contributions
 Int+0.011
 Sadly+0.010
 Curious+0.010
Sadly+0.010
ete+0.010
Neg logit contributions
els-0.019
ps-0.017
ws-0.016
ries-0.016
cks-0.016
Token on
Token ID319
Feature activation+3.96e-02

Pos logit contributions
els+0.134
ps+0.111
ws+0.106
cks+0.103
bones+0.098
Neg logit contributions
ete-0.181
 Int-0.178
 Sadly-0.176
 Curious-0.172
Of-0.166
Token those
Token ID883
Feature activation+5.70e-02

Pos logit contributions
els+0.069
ps+0.061
ws+0.061
cks+0.056
ier+0.054
Neg logit contributions
ete-0.067
 Int-0.067
 Sadly-0.066
 Curious-0.065
 Everywhere-0.062
Token toys
Token ID14958
Feature activation+9.93e-02

Pos logit contributions
els+0.084
ps+0.074
ws+0.072
cks+0.065
 sticker+0.064
Neg logit contributions
 Int-0.111
ete-0.107
 Curious-0.107
 Sadly-0.106
 Everything-0.102
Token!
Token ID0
Feature activation-1.18e-01

Pos logit contributions
els+0.124
ps+0.101
ws+0.093
cks+0.087
 colored+0.081
Neg logit contributions
 Sadly-0.222
ete-0.222
 Int-0.221
 Curious-0.212
 couldn-0.209
Token<|endoftext|>
Token ID50256
Feature activation-5.16e-02

Pos logit contributions
Sadly+0.218
rolling+0.216
mber+0.215
ete+0.211
Da+0.210
Neg logit contributions
.-0.177
."-0.153
 on-0.146
 inside-0.138
els-0.115
TokenOnce
Token ID7454
Feature activation-1.16e-01

Pos logit contributions
 Int+0.053
 Sadly+0.052
 Curious+0.049
ete+0.048
Sadly+0.046
Neg logit contributions
els-0.131
ps-0.118
ws-0.112
 on-0.112
 inside-0.112
Token upon
Token ID2402
Feature activation-6.64e-02

Pos logit contributions
 Int+0.162
Sadly+0.155
 Sadly+0.154
 Curious+0.153
ete+0.146
Neg logit contributions
els-0.248
 on-0.230
.-0.225
 inside-0.224
ps-0.212
Token a
Token ID257
Feature activation-6.31e-02

Pos logit contributions
ete+0.164
 Sadly+0.163
 Int+0.159
Sadly+0.157
ishly+0.155
Neg logit contributions
 on-0.063
.-0.061
els-0.060
 inside-0.057
ps-0.053
Token time
Token ID640
Feature activation-7.34e-02

Pos logit contributions
 Int+0.183
 Sadly+0.178
 Curious+0.178
ete+0.176
 Everything+0.176
Neg logit contributions
els-0.035
 on-0.024
.-0.022
ps-0.021
 inside-0.020
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
 Int+0.202
 Sadly+0.202
Sadly+0.201
 Curious+0.199
ete+0.198
Neg logit contributions
els-0.052
 on-0.042
 inside-0.037
.-0.036
ps-0.033
Token there
Token ID612
Feature activation-5.52e-02

Pos logit contributions
 Int+0.222
 Sadly+0.219
 Curious+0.215
ete+0.211
Sadly+0.210
Neg logit contributions
els-0.139
ps-0.115
ws-0.103
ries-0.099
 on-0.098
Token but
Token ID475
Feature activation-4.15e-02

Pos logit contributions
 Int+0.013
 Sadly+0.012
 Curious+0.012
Sadly+0.012
 Everything+0.012
Neg logit contributions
els-0.009
ries-0.006
ps-0.006
ws-0.006
 colored-0.006
Token Sara
Token ID24799
Feature activation-5.56e-03

Pos logit contributions
 Int+0.072
 Sadly+0.071
ete+0.070
 Curious+0.069
Sadly+0.068
Neg logit contributions
els-0.074
ps-0.064
 on-0.061
 inside-0.060
ws-0.059
Token's
Token ID338
Feature activation+2.34e-03

Pos logit contributions
 Int+0.010
 Sadly+0.010
 Curious+0.010
ete+0.010
Sadly+0.010
Neg logit contributions
els-0.009
ps-0.008
ries-0.007
 inside-0.007
 on-0.007
Token car
Token ID1097
Feature activation+7.43e-02

Pos logit contributions
els+0.004
ps+0.003
ries+0.003
ws+0.003
cks+0.003
Neg logit contributions
 Int-0.004
 Sadly-0.004
 Curious-0.004
Sadly-0.004
 Everything-0.004
Token stops
Token ID9911
Feature activation+5.16e-02

Pos logit contributions
els+0.116
ps+0.095
ries+0.093
 inside+0.092
 frost+0.090
Neg logit contributions
 Int-0.142
 Sadly-0.139
 Curious-0.134
Sadly-0.133
 Everything-0.131
Token.
Token ID13
Feature activation-8.82e-02

Pos logit contributions
ws+0.066
els+0.064
ps+0.059
 gifts+0.056
iest+0.054
Neg logit contributions
 �-0.094
rolling-0.094
 Int-0.093
 She-0.089
 symp-0.089
Token She
Token ID1375
Feature activation-4.39e-02

Pos logit contributions
ete+0.126
Sadly+0.124
 Int+0.114
 Sadly+0.112
mber+0.109
Neg logit contributions
els-0.173
 on-0.171
.-0.164
 inside-0.159
ps-0.148
Token looks
Token ID3073
Feature activation+2.19e-02

Pos logit contributions
 Int+0.092
 Sadly+0.089
 Curious+0.088
ete+0.087
Sadly+0.086
Neg logit contributions
els-0.064
ps-0.053
 on-0.051
 inside-0.050
ws-0.048
Token at
Token ID379
Feature activation+3.24e-03

Pos logit contributions
els+0.034
ps+0.028
ries+0.026
ws+0.026
bones+0.024
Neg logit contributions
 Int-0.044
 Curious-0.042
 Sadly-0.041
 Everything-0.040
ete-0.039
Token it
Token ID340
Feature activation+3.97e-02

Pos logit contributions
els+0.005
ps+0.004
 on+0.004
 inside+0.004
ws+0.004
Neg logit contributions
 Int-0.007
ete-0.006
 Sadly-0.006
 Curious-0.006
 Everything-0.006
Token and
Token ID290
Feature activation-3.60e-02

Pos logit contributions
els+0.059
ws+0.049
ries+0.048
ps+0.047
ocol+0.046
Neg logit contributions
 Int-0.080
 Curious-0.075
 Sadly-0.074
Sadly-0.073
So-0.073