TOP ACTIVATIONS
MAX = 0.175

to be gold is not gold , but something else that
good to have so much clutter . You can trip and
treasure is not gold or jewels ," L ila says .
dreamed of rain bows and gold and sour apples . <|endoftext|>
Maybe the treasure is not gold or jewels ," L ila
nice and not fight over gold . They decide
, you will get more gold ." The boy was greedy
. I don 't need gold ice cream . I have
and ice cre ams and cand ies and fruits and vegetables
together . They cannot afford gold ice cream .
. And you wasted the cand ies I bought for your
real treasure chest filled with gold coins and jewels .
to be restored to its former glory ."
friendship are more precious than gold ." L ily
found a chest filled with gold and jewels . From that
that it is not real gold . It is chocolate !
the village back to its former glory !"
, maybe we will find gold and jewels and pear ls
Maybe the birds can still hatch them . But you have
a treasure chest full of gold and jewels ! Tim my

MIN ACTIVATIONS
MIN = -0.148

instead of the sharp rock ." So the duck
much fun playing with it ." L ily
knee for a little while ." Mom my said .
store to buy some food ." L ily replied
scared of a fake wolf ." L ily had
needs to be done today ." Cl ara accepted
wanted to make new friends ." L ily
a cartoon with this box ." Tim drew pictures on the
especially when things seem scary ." The beetle
to share my meal with ." The kitten was so
it to its mom my ." Twe ety
and don 't know why ." Mom smiled and
longer until they cool down ." But Lily didn
's raining : an umbrella !" F ido was so excited
deserve to ride on me ." Tim and Mia
to go to the park ." Dad said ,
I couldn 't help sooner ." Qu ackers said , "
this drain by cleaning it ." He ran inside and asked
with me for so long ." Her mom said
want to make a fort ." M ia said

INTERVAL 0.094 - 0.175
CONTAINS 4.741%

Rex . He drops the ball . He w ags his
tree . The dog cannot climb . It stays under the
The party was filled with balloons and music and lots of
likes to look at the pictures and the words on the
mom reminded him to finish cleaning up . Tim my picked

INTERVAL 0.013 - 0.094
CONTAINS 36.704%

't know how to get back home . Just when he
mom my drove to the pet shop . The pet shop
the boy always came to see the cows and he was
, running from place to place . One day he was
They saw a big pond with ducks and fish . They

INTERVAL -0.067 - 0.013
CONTAINS 45.542%

and delicious . He thanked the old lady and
, but a responsibility . They also learned that Fl uffy
She had a warm blanket that she loved to sn uggle
think they are almost at the park . But
the beach with her mom and dad . She saw some

INTERVAL -0.148 - -0.067
CONTAINS 13.013%

many dresses in the store . Some were red , some
the sun and the waves . The gentle old boat ate
had fun together . At the end of the
. One day , they saw a big boat
, " Hi , Tim ! I was scared too ,
Token to
Token ID284
Feature activation+1.64e-02

Pos logit contributions
els+0.093
ps+0.081
 sticker+0.078
 frost+0.077
 icing+0.076
Neg logit contributions
 Int-0.107
 Everywhere-0.100
 Everything-0.098
 Curious-0.095
 Sadly-0.093
Token be
Token ID307
Feature activation+4.69e-02

Pos logit contributions
els+0.031
ps+0.027
 on+0.026
 cream+0.026
 inside+0.026
Neg logit contributions
 Int-0.029
 Curious-0.026
 Everything-0.026
 Everywhere-0.025
 Sadly-0.025
Token gold
Token ID3869
Feature activation+1.59e-01

Pos logit contributions
els+0.093
ries+0.077
ps+0.075
 sticker+0.074
 icing+0.073
Neg logit contributions
 Int-0.077
 Curious-0.073
 Everything-0.070
 Sadly-0.070
 Everywhere-0.068
Token is
Token ID318
Feature activation+4.34e-02

Pos logit contributions
els+0.205
bones+0.178
ps+0.170
 colored+0.167
cks+0.162
Neg logit contributions
 Int-0.329
 Everywhere-0.311
 Everything-0.303
 Curious-0.303
ete-0.302
Token not
Token ID407
Feature activation+5.27e-02

Pos logit contributions
els+0.078
ries+0.064
ps+0.061
 icing+0.060
bones+0.059
Neg logit contributions
 Int-0.074
 Everywhere-0.071
 Everything-0.069
 Curious-0.068
 Sadly-0.068
Token gold
Token ID3869
Feature activation+1.75e-01

Pos logit contributions
els+0.097
ries+0.077
bones+0.074
cks+0.072
 sticker+0.072
Neg logit contributions
 Everywhere-0.078
 Int-0.077
 Everything-0.076
 He-0.076
 couldn-0.075
Token,
Token ID11
Feature activation+1.20e-02

Pos logit contributions
els+0.258
bones+0.243
cks+0.241
 colored+0.232
ps+0.228
Neg logit contributions
 He-0.285
ete-0.279
 Everywhere-0.268
 Int-0.259
 Nice-0.256
Token but
Token ID475
Feature activation-2.81e-02

Pos logit contributions
els+0.018
ps+0.015
 colored+0.014
 inside+0.014
 on+0.014
Neg logit contributions
 Int-0.027
 Curious-0.024
 Everything-0.024
 Everywhere-0.024
 Sadly-0.024
Token something
Token ID1223
Feature activation+3.63e-02

Pos logit contributions
 Int+0.047
 Everything+0.044
 Curious+0.043
 Sadly+0.042
 Everywhere+0.042
Neg logit contributions
els-0.053
ps-0.048
 colored-0.045
 cream-0.044
ws-0.044
Token else
Token ID2073
Feature activation+4.70e-02

Pos logit contributions
els+0.066
ps+0.055
ries+0.052
 colored+0.052
 cream+0.051
Neg logit contributions
 Int-0.068
 Curious-0.060
 Everything-0.060
 Everywhere-0.060
 Sadly-0.058
Token that
Token ID326
Feature activation+2.11e-03

Pos logit contributions
els+0.084
ps+0.071
 cream+0.068
 colored+0.068
 sticker+0.068
Neg logit contributions
 Int-0.085
 Everything-0.077
 Curious-0.076
 Everywhere-0.074
 Sadly-0.073
Token good
Token ID922
Feature activation+6.48e-02

Pos logit contributions
els+0.070
bones+0.054
cks+0.054
ps+0.052
ries+0.052
Neg logit contributions
 Int-0.056
 Everything-0.052
 Sadly-0.051
 Curious-0.048
 Everywhere-0.048
Token to
Token ID284
Feature activation+2.34e-02

Pos logit contributions
els+0.130
cks+0.115
bones+0.105
ck+0.103
aks+0.096
Neg logit contributions
 Everything-0.096
 She-0.094
 Int-0.093
 He-0.092
 As-0.089
Token have
Token ID423
Feature activation+5.30e-02

Pos logit contributions
els+0.046
ps+0.041
bones+0.040
ries+0.038
 on+0.038
Neg logit contributions
 Int-0.038
 Everything-0.035
 Sadly-0.035
 Curious-0.033
 She-0.033
Token so
Token ID523
Feature activation+2.36e-02

Pos logit contributions
els+0.092
cks+0.071
bones+0.068
ps+0.066
ater+0.066
Neg logit contributions
 Int-0.095
 Everything-0.092
 Sadly-0.091
 Everywhere-0.086
 Her-0.082
Token much
Token ID881
Feature activation+1.07e-01

Pos logit contributions
els+0.051
ps+0.044
 colored+0.042
 coloured+0.040
cks+0.039
Neg logit contributions
 Int-0.033
 Sadly-0.032
 Everything-0.031
Sadly-0.030
 Curious-0.029
Token clutter
Token ID45343
Feature activation+1.74e-01

Pos logit contributions
els+0.168
cks+0.153
bones+0.138
ck+0.135
ps+0.128
Neg logit contributions
 Int-0.207
 Everything-0.201
 Sadly-0.196
 Everywhere-0.196
 She-0.195
Token.
Token ID13
Feature activation-8.95e-02

Pos logit contributions
els+0.236
ck+0.213
cks+0.205
 colored+0.188
 stamp+0.185
Neg logit contributions
 He-0.299
 Everywhere-0.299
 She-0.291
 Int-0.283
 couldn-0.280
Token You
Token ID921
Feature activation+7.85e-03

Pos logit contributions
 Int+0.126
 Curious+0.108
 Sadly+0.101
 Everywhere+0.098
 Everything+0.098
Neg logit contributions
els-0.206
 on-0.184
ps-0.182
 inside-0.181
 colored-0.177
Token can
Token ID460
Feature activation-9.00e-03

Pos logit contributions
els+0.013
ps+0.011
 on+0.011
 colored+0.011
 inside+0.011
Neg logit contributions
 Int-0.016
 Curious-0.015
 Sadly-0.014
 Everything-0.014
 Everywhere-0.014
Token trip
Token ID5296
Feature activation+1.07e-01

Pos logit contributions
 Int+0.018
 Curious+0.016
 Sadly+0.015
 Everything+0.015
 Everywhere+0.015
Neg logit contributions
els-0.016
ps-0.014
 on-0.014
 inside-0.013
 colored-0.013
Token and
Token ID290
Feature activation-1.21e-02

Pos logit contributions
els+0.175
ps+0.135
cks+0.132
 colored+0.130
ries+0.128
Neg logit contributions
 Int-0.224
 Everywhere-0.206
 Sadly-0.205
 Everything-0.204
 Curious-0.203
Token treasure
Token ID14068
Feature activation+1.09e-01

Pos logit contributions
 Int+0.030
 Curious+0.028
 Everything+0.027
 Sadly+0.026
 Everywhere+0.026
Neg logit contributions
els-0.027
ps-0.022
 on-0.021
 colored-0.021
 inside-0.021
Token is
Token ID318
Feature activation+2.50e-02

Pos logit contributions
els+0.178
 colored+0.145
ps+0.142
 cream+0.133
 on+0.132
Neg logit contributions
 Int-0.215
 Curious-0.193
 Everything-0.189
 Sadly-0.186
 Everywhere-0.186
Token not
Token ID407
Feature activation+3.45e-02

Pos logit contributions
els+0.049
ries+0.037
ishing+0.036
ater+0.036
ps+0.035
Neg logit contributions
 Int-0.040
 Everywhere-0.037
 Everything-0.037
 Curious-0.037
 Sadly-0.036
Token gold
Token ID3869
Feature activation+1.71e-01

Pos logit contributions
els+0.069
ps+0.050
ries+0.050
bones+0.049
 colored+0.048
Neg logit contributions
 Int-0.058
 Curious-0.052
 Everything-0.051
 Everywhere-0.051
 Sadly-0.049
Token or
Token ID393
Feature activation+2.05e-02

Pos logit contributions
els+0.245
bones+0.205
cks+0.200
 colored+0.197
ps+0.194
Neg logit contributions
 He-0.312
 Int-0.297
ete-0.296
 So-0.288
 Everywhere-0.283
Token jewels
Token ID42701
Feature activation+1.73e-01

Pos logit contributions
els+0.033
ps+0.026
ishing+0.025
 colored+0.024
ier+0.024
Neg logit contributions
 Int-0.041
 Curious-0.038
 Everywhere-0.038
 Everything-0.036
 Sadly-0.035
Token,"
Token ID553
Feature activation-8.46e-02

Pos logit contributions
els+0.276
bones+0.214
ps+0.211
 colored+0.208
cks+0.205
Neg logit contributions
 Int-0.316
 He-0.302
 She-0.299
ete-0.297
 couldn-0.289
Token L
Token ID406
Feature activation-2.20e-02

Pos logit contributions
 Int+0.169
 Curious+0.153
 Everything+0.147
 Sadly+0.147
 Everywhere+0.145
Neg logit contributions
els-0.150
ps-0.127
 on-0.125
 inside-0.124
 colored-0.124
Tokenila
Token ID10102
Feature activation-1.54e-02

Pos logit contributions
 Int+0.054
 Curious+0.050
 Sadly+0.049
 Everything+0.049
 Everywhere+0.048
Neg logit contributions
els-0.029
ps-0.023
 on-0.023
 inside-0.022
 colored-0.022
Token says
Token ID1139
Feature activation+1.44e-02

Pos logit contributions
 Int+0.033
 Curious+0.030
 Everything+0.029
 Sadly+0.029
 Everywhere+0.029
Neg logit contributions
els-0.024
ps-0.020
 on-0.020
 inside-0.020
 colored-0.020
Token.
Token ID13
Feature activation-9.17e-02

Pos logit contributions
els+0.014
 colored+0.013
 coloured+0.011
 cream+0.010
ries+0.010
Neg logit contributions
 Int-0.038
 Curious-0.036
 Her-0.034
 Sadly-0.034
 Everything-0.034
Token dreamed
Token ID27947
Feature activation+6.95e-02

Pos logit contributions
 Int+0.062
 Curious+0.055
 Everywhere+0.054
 Everything+0.054
 Sadly+0.052
Neg logit contributions
els-0.056
 on-0.053
 inside-0.052
ps-0.050
 colored-0.050
Token of
Token ID286
Feature activation+5.58e-02

Pos logit contributions
els+0.131
 colored+0.107
cks+0.106
ps+0.106
ws+0.101
Neg logit contributions
 Int-0.132
 Everything-0.122
 Curious-0.118
 Sadly-0.118
 Everywhere-0.118
Token rain
Token ID6290
Feature activation+1.21e-01

Pos logit contributions
els+0.095
ater+0.080
ier+0.078
ishing+0.077
ws+0.076
Neg logit contributions
 Int-0.096
 Everything-0.090
 Everywhere-0.089
 Curious-0.086
 Sadly-0.086
Tokenbows
Token ID25435
Feature activation+1.55e-01

Pos logit contributions
els+0.178
cks+0.127
ps+0.123
ws+0.116
ocol+0.114
Neg logit contributions
 Int-0.274
 Everything-0.251
 Curious-0.250
 Everywhere-0.245
 Sadly-0.245
Token and
Token ID290
Feature activation+1.88e-02

Pos logit contributions
els+0.225
ws+0.172
cks+0.167
ps+0.163
bones+0.154
Neg logit contributions
 She-0.301
 Int-0.299
 All-0.284
 Everything-0.282
 Everywhere-0.281
Token gold
Token ID3869
Feature activation+1.73e-01

Pos logit contributions
els+0.032
ps+0.025
 inside+0.024
 on+0.024
 cream+0.024
Neg logit contributions
 Int-0.038
 Curious-0.035
 Everything-0.034
 Everywhere-0.033
 Sadly-0.033
Token and
Token ID290
Feature activation+2.07e-02

Pos logit contributions
els+0.189
cks+0.181
bones+0.165
ws+0.164
ck+0.164
Neg logit contributions
 He-0.329
 She-0.315
 Int-0.310
 Everything-0.295
 Everywhere-0.289
Token sour
Token ID11348
Feature activation+1.23e-01

Pos logit contributions
els+0.035
ps+0.028
ishing+0.026
 inside+0.026
ries+0.026
Neg logit contributions
 Int-0.042
 Curious-0.039
 Everything-0.038
 Everywhere-0.037
 Sadly-0.037
Token apples
Token ID22514
Feature activation+1.36e-01

Pos logit contributions
els+0.153
cks+0.150
ck+0.134
ps+0.122
ge+0.121
Neg logit contributions
 Int-0.249
 Everything-0.244
 He-0.242
 Curious-0.239
 She-0.235
Token.
Token ID13
Feature activation-9.13e-02

Pos logit contributions
els+0.175
ps+0.126
cks+0.123
 colored+0.121
ishing+0.117
Neg logit contributions
 She-0.291
 Int-0.289
 He-0.280
 Everywhere-0.275
 Sadly-0.271
Token<|endoftext|>
Token ID50256
Feature activation-2.31e-02

Pos logit contributions
 Int+0.138
ete+0.129
rolling+0.125
ishly+0.124
Quick+0.114
Neg logit contributions
.-0.187
 inside-0.170
 on-0.169
."-0.158
 chocolate-0.154
TokenMaybe
Token ID13300
Feature activation+1.01e-03

Pos logit contributions
 Int+0.067
 Curious+0.057
 Everything+0.053
 Sadly+0.053
 Everywhere+0.052
Neg logit contributions
els-0.125
ps-0.110
 on-0.109
 inside-0.109
 colored-0.108
Token the
Token ID262
Feature activation-1.51e-02

Pos logit contributions
els+0.002
ps+0.002
ws+0.002
bones+0.002
cks+0.002
Neg logit contributions
 Int-0.001
 Curious-0.001
 Everything-0.001
Of-0.001
 Everywhere-0.001
Token treasure
Token ID14068
Feature activation+1.09e-01

Pos logit contributions
 Int+0.030
 Curious+0.028
 Everything+0.027
 Sadly+0.026
 Everywhere+0.026
Neg logit contributions
els-0.027
ps-0.022
 on-0.021
 colored-0.021
 inside-0.021
Token is
Token ID318
Feature activation+2.50e-02

Pos logit contributions
els+0.178
 colored+0.145
ps+0.142
 cream+0.133
 on+0.132
Neg logit contributions
 Int-0.215
 Curious-0.193
 Everything-0.189
 Sadly-0.186
 Everywhere-0.186
Token not
Token ID407
Feature activation+3.45e-02

Pos logit contributions
els+0.049
ries+0.037
ishing+0.036
ater+0.036
ps+0.035
Neg logit contributions
 Int-0.040
 Everywhere-0.037
 Everything-0.037
 Curious-0.037
 Sadly-0.036
Token gold
Token ID3869
Feature activation+1.71e-01

Pos logit contributions
els+0.069
ps+0.050
ries+0.050
bones+0.049
 colored+0.048
Neg logit contributions
 Int-0.058
 Curious-0.052
 Everything-0.051
 Everywhere-0.051
 Sadly-0.049
Token or
Token ID393
Feature activation+2.05e-02

Pos logit contributions
els+0.245
bones+0.205
cks+0.200
 colored+0.197
ps+0.194
Neg logit contributions
 He-0.312
 Int-0.297
ete-0.296
 So-0.288
 Everywhere-0.283
Token jewels
Token ID42701
Feature activation+1.73e-01

Pos logit contributions
els+0.033
ps+0.026
ishing+0.025
 colored+0.024
ier+0.024
Neg logit contributions
 Int-0.041
 Curious-0.038
 Everywhere-0.038
 Everything-0.036
 Sadly-0.035
Token,"
Token ID553
Feature activation-8.46e-02

Pos logit contributions
els+0.276
bones+0.214
ps+0.211
 colored+0.208
cks+0.205
Neg logit contributions
 Int-0.316
 He-0.302
 She-0.299
ete-0.297
 couldn-0.289
Token L
Token ID406
Feature activation-2.20e-02

Pos logit contributions
 Int+0.169
 Curious+0.153
 Everything+0.147
 Sadly+0.147
 Everywhere+0.145
Neg logit contributions
els-0.150
ps-0.127
 on-0.125
 inside-0.124
 colored-0.124
Tokenila
Token ID10102
Feature activation-1.54e-02

Pos logit contributions
 Int+0.054
 Curious+0.050
 Sadly+0.049
 Everything+0.049
 Everywhere+0.048
Neg logit contributions
els-0.029
ps-0.023
 on-0.023
 inside-0.022
 colored-0.022
Token nice
Token ID3621
Feature activation+8.54e-02

Pos logit contributions
els+0.059
ps+0.045
 colored+0.045
ries+0.043
 inside+0.042
Neg logit contributions
 Int-0.089
 Curious-0.082
 Sadly-0.082
 Everything-0.081
Of-0.081
Token and
Token ID290
Feature activation-1.59e-04

Pos logit contributions
els+0.148
cks+0.131
bones+0.121
ps+0.119
 colored+0.119
Neg logit contributions
 Int-0.152
 Everything-0.146
 Sadly-0.144
 Curious-0.141
 She-0.140
Token not
Token ID407
Feature activation+2.59e-02

Pos logit contributions
 Int+0.000
 Curious+0.000
 Sadly+0.000
Of+0.000
 Everything+0.000
Neg logit contributions
els-0.000
ps-0.000
 colored-0.000
 on-0.000
 inside-0.000
Token fight
Token ID1907
Feature activation+9.39e-02

Pos logit contributions
els+0.052
ps+0.043
ries+0.040
bones+0.039
 sticker+0.039
Neg logit contributions
 Int-0.042
 Sadly-0.039
 Curious-0.038
 Everything-0.038
Of-0.037
Token over
Token ID625
Feature activation+4.41e-02

Pos logit contributions
els+0.134
 colored+0.108
 frost+0.105
 coloured+0.104
 icing+0.103
Neg logit contributions
 Int-0.188
 Everywhere-0.183
 Everything-0.182
 He-0.176
 She-0.172
Token gold
Token ID3869
Feature activation+1.70e-01

Pos logit contributions
els+0.055
ps+0.055
ws+0.052
 colored+0.050
 cream+0.048
Neg logit contributions
 Int-0.089
 Everything-0.083
 Curious-0.082
 Everywhere-0.079
 Sadly-0.078
Token.
Token ID13
Feature activation-8.84e-02

Pos logit contributions
cks+0.196
ps+0.195
els+0.187
 colored+0.185
 coloured+0.172
Neg logit contributions
 He-0.317
 So-0.288
 Int-0.286
hog-0.286
 She-0.278
Token
Token ID198
Feature activation-5.50e-02

Pos logit contributions
 Int+0.133
 Curious+0.113
ete+0.108
rolling+0.103
 Everything+0.103
Neg logit contributions
els-0.186
.-0.180
 on-0.175
 inside-0.170
 sticker-0.166
Token
Token ID198
Feature activation-7.87e-02

Pos logit contributions
 Int+0.095
 Curious+0.085
 Everything+0.079
 Sadly+0.076
 Everywhere+0.076
Neg logit contributions
els-0.108
.-0.104
 on-0.100
 inside-0.098
 sticker-0.093
TokenThey
Token ID2990
Feature activation-3.23e-02

Pos logit contributions
 Int+0.149
 Curious+0.134
 Everything+0.128
 Everywhere+0.127
 Sadly+0.122
Neg logit contributions
els-0.151
 on-0.137
 cream-0.133
.-0.131
 inside-0.130
Token decide
Token ID5409
Feature activation-6.36e-04

Pos logit contributions
 Int+0.071
 Curious+0.065
 Everything+0.062
 Everywhere+0.061
 Sadly+0.061
Neg logit contributions
els-0.050
 on-0.043
 inside-0.043
ps-0.043
 colored-0.041
Token,
Token ID11
Feature activation-3.18e-02

Pos logit contributions
els+0.104
ps+0.099
cks+0.094
bones+0.086
ws+0.079
Neg logit contributions
 Int-0.095
 Curious-0.091
 Exc-0.087
 He-0.087
 Everywhere-0.087
Token you
Token ID345
Feature activation+7.89e-03

Pos logit contributions
 Int+0.054
 Curious+0.048
 Everything+0.047
 Sadly+0.046
 Everywhere+0.045
Neg logit contributions
els-0.065
ps-0.057
 colored-0.055
 on-0.054
 inside-0.054
Token will
Token ID481
Feature activation+2.34e-03

Pos logit contributions
els+0.014
ps+0.012
 on+0.012
 inside+0.012
 colored+0.012
Neg logit contributions
 Int-0.015
 Curious-0.014
 Everything-0.013
 Sadly-0.013
 Everywhere-0.013
Token get
Token ID651
Feature activation+6.39e-02

Pos logit contributions
els+0.004
ps+0.004
 on+0.004
 colored+0.004
 inside+0.004
Neg logit contributions
 Int-0.004
 Curious-0.004
 Sadly-0.004
 Everything-0.004
 Everywhere-0.004
Token more
Token ID517
Feature activation+8.73e-02

Pos logit contributions
els+0.111
 colored+0.091
ps+0.091
 coloured+0.083
cks+0.082
Neg logit contributions
 Int-0.121
 Sadly-0.112
 Curious-0.110
 Everything-0.107
Of-0.103
Token gold
Token ID3869
Feature activation+1.70e-01

Pos logit contributions
els+0.162
ps+0.156
bones+0.138
cks+0.130
ries+0.121
Neg logit contributions
 Int-0.130
 Curious-0.123
ete-0.123
 Everywhere-0.121
 He-0.121
Token."
Token ID526
Feature activation-1.13e-01

Pos logit contributions
els+0.245
bones+0.215
 colored+0.211
cks+0.209
ps+0.196
Neg logit contributions
ete-0.304
 He-0.297
 Int-0.289
 So-0.282
 Everywhere-0.280
Token The
Token ID383
Feature activation-4.19e-02

Pos logit contributions
 Int+0.197
 Curious+0.174
rolling+0.170
 Everything+0.164
 Everywhere+0.162
Neg logit contributions
.-0.203
els-0.199
 on-0.196
 inside-0.191
 cream-0.183
Token boy
Token ID2933
Feature activation-1.43e-02

Pos logit contributions
 Int+0.062
 Curious+0.061
 Everything+0.054
 Sadly+0.053
 Everywhere+0.051
Neg logit contributions
els-0.098
cks-0.086
ps-0.085
ries-0.080
ws-0.080
Token was
Token ID373
Feature activation+1.19e-03

Pos logit contributions
 Int+0.027
 Everywhere+0.025
 Everything+0.024
 Curious+0.024
 Sadly+0.023
Neg logit contributions
els-0.025
 on-0.023
 colored-0.022
 cream-0.022
ps-0.022
Token greedy
Token ID31828
Feature activation+8.77e-02

Pos logit contributions
els+0.002
ps+0.002
 on+0.002
 colored+0.002
 inside+0.002
Neg logit contributions
 Int-0.002
 Curious-0.002
 Sadly-0.002
 Everything-0.002
 Everywhere-0.002
Token.
Token ID13
Feature activation-6.39e-02

Pos logit contributions
els+0.088
ps+0.076
cks+0.067
 colored+0.063
bones+0.061
Neg logit contributions
 Int-0.113
 Curious-0.104
 Everywhere-0.102
Of-0.102
 Everything-0.099
Token I
Token ID314
Feature activation-5.74e-03

Pos logit contributions
 Int+0.083
 Curious+0.071
 Sadly+0.066
 Everything+0.066
 Everywhere+0.064
Neg logit contributions
els-0.157
ps-0.139
 on-0.139
 inside-0.137
 colored-0.137
Token don
Token ID836
Feature activation+2.48e-02

Pos logit contributions
 Int+0.009
 Sadly+0.009
 Curious+0.009
 Everything+0.008
Of+0.008
Neg logit contributions
els-0.012
ps-0.010
ries-0.010
cks-0.010
 colored-0.009
Token't
Token ID470
Feature activation+1.10e-02

Pos logit contributions
els+0.061
ps+0.054
 on+0.054
 colored+0.054
 inside+0.054
Neg logit contributions
 Int-0.032
 Curious-0.027
 Everything-0.025
 Sadly-0.025
 Everywhere-0.025
Token need
Token ID761
Feature activation+7.78e-02

Pos logit contributions
els+0.023
ries+0.020
ps+0.020
cks+0.019
 colored+0.018
Neg logit contributions
 Int-0.017
 Sadly-0.016
 Everything-0.016
 Curious-0.016
 Everywhere-0.015
Token gold
Token ID3869
Feature activation+1.69e-01

Pos logit contributions
els+0.126
 colored+0.111
ps+0.111
ishing+0.107
ating+0.104
Neg logit contributions
 Int-0.147
 Everywhere-0.132
 Everything-0.131
Of-0.127
 Curious-0.122
Token ice
Token ID4771
Feature activation+1.49e-01

Pos logit contributions
els+0.205
 colored+0.205
cks+0.195
ps+0.186
 coloured+0.185
Neg logit contributions
 He-0.315
Of-0.294
 So-0.288
 Int-0.284
ete-0.282
Token cream
Token ID8566
Feature activation+1.27e-01

Pos logit contributions
els+0.172
ps+0.151
 colored+0.139
cks+0.137
 harm+0.131
Neg logit contributions
Of-0.293
 He-0.288
ete-0.273
 So-0.268
 Everywhere-0.264
Token.
Token ID13
Feature activation-6.64e-02

Pos logit contributions
els+0.186
 colored+0.160
cks+0.159
ps+0.156
 coloured+0.148
Neg logit contributions
Of-0.244
 He-0.243
 So-0.230
 Everywhere-0.227
 Int-0.226
Token I
Token ID314
Feature activation-4.56e-03

Pos logit contributions
 Int+0.086
 Curious+0.073
 Sadly+0.068
 Everything+0.067
 Everywhere+0.066
Neg logit contributions
els-0.161
 on-0.144
ps-0.143
 inside-0.141
 colored-0.139
Token have
Token ID423
Feature activation+3.12e-02

Pos logit contributions
 Int+0.008
 Curious+0.007
 Sadly+0.007
 Everything+0.007
 Everywhere+0.007
Neg logit contributions
els-0.009
ps-0.008
ries-0.007
 colored-0.007
 cream-0.007
Token and
Token ID290
Feature activation+1.94e-02

Pos logit contributions
els+0.208
ishing+0.173
ocol+0.166
 harm+0.162
ps+0.161
Neg logit contributions
 She-0.241
 He-0.229
 ï¿½-0.220
 So-0.216
 Her-0.215
Token ice
Token ID4771
Feature activation+1.39e-01

Pos logit contributions
els+0.032
ishing+0.025
ps+0.025
cks+0.024
ier+0.024
Neg logit contributions
 Int-0.038
 Curious-0.035
 Everything-0.033
 Everywhere-0.032
 Sadly-0.032
Token cre
Token ID1126
Feature activation+1.28e-01

Pos logit contributions
els+0.154
 harm+0.148
ening+0.143
rum+0.136
 trace+0.128
Neg logit contributions
 Int-0.219
 So-0.216
ables-0.212
 He-0.211
Of-0.210
Tokenams
Token ID4105
Feature activation+1.12e-01

Pos logit contributions
 thing+0.221
 sticker+0.219
 happen+0.213
 paper+0.211
ps+0.211
Neg logit contributions
 Int-0.176
 Sadly-0.153
Lots-0.146
Da-0.143
 Later-0.143
Token and
Token ID290
Feature activation+1.73e-02

Pos logit contributions
els+0.106
ps+0.076
 colored+0.075
cks+0.072
ws+0.071
Neg logit contributions
 Int-0.278
 Everything-0.259
 Curious-0.257
 Everywhere-0.252
 She-0.251
Token cand
Token ID2658
Feature activation+1.69e-01

Pos logit contributions
els+0.029
ier+0.023
ishing+0.022
ries+0.021
ps+0.021
Neg logit contributions
 Int-0.030
 Curious-0.028
 Everything-0.027
Of-0.026
 bust-0.026
Tokenies
Token ID444
Feature activation+1.05e-01

Pos logit contributions
bones+0.204
els+0.202
 thing+0.197
 harm+0.197
 colored+0.188
Neg logit contributions
ete-0.275
 Int-0.262
rolling-0.260
So-0.251
 couldn-0.236
Token and
Token ID290
Feature activation+2.12e-02

Pos logit contributions
els+0.129
ws+0.089
 colored+0.089
ps+0.085
ishing+0.084
Neg logit contributions
 Int-0.225
 She-0.217
 Curious-0.215
 Everything-0.214
 Everywhere-0.211
Token fruits
Token ID15921
Feature activation+1.36e-01

Pos logit contributions
els+0.032
ier+0.025
ishing+0.024
cks+0.024
ps+0.024
Neg logit contributions
 Int-0.041
 Curious-0.038
 Everything-0.037
 Everywhere-0.037
 Sadly-0.036
Token and
Token ID290
Feature activation+2.23e-02

Pos logit contributions
els+0.178
ps+0.138
 colored+0.137
cks+0.129
ishing+0.127
Neg logit contributions
 She-0.279
 He-0.269
 Int-0.266
 ï¿½-0.252
 Her-0.252
Token vegetables
Token ID13701
Feature activation+1.28e-01

Pos logit contributions
els+0.033
ps+0.026
cks+0.025
ishing+0.025
ier+0.025
Neg logit contributions
 Int-0.045
 Curious-0.042
 Everything-0.041
 Everywhere-0.040
Of-0.039
Token together
Token ID1978
Feature activation+3.53e-02

Pos logit contributions
els+0.128
 colored+0.101
ps+0.100
cks+0.091
 coloured+0.086
Neg logit contributions
 Int-0.250
 Everywhere-0.234
 Everything-0.233
 Sadly-0.233
ete-0.230
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
els+0.043
ps+0.032
 colored+0.029
ries+0.027
cks+0.027
Neg logit contributions
 Int-0.085
 Everywhere-0.080
 Curious-0.079
 Everything-0.079
 Sadly-0.079
Token They
Token ID1119
Feature activation-5.45e-02

Pos logit contributions
 Int+0.170
 Curious+0.144
ete+0.136
rolling+0.135
Sadly+0.134
Neg logit contributions
els-0.253
.-0.251
 on-0.242
."-0.232
 inside-0.230
Token cannot
Token ID2314
Feature activation-1.77e-03

Pos logit contributions
 Int+0.107
 Curious+0.097
 Sadly+0.093
 Everything+0.093
 Everywhere+0.092
Neg logit contributions
els-0.097
ps-0.082
 on-0.082
 inside-0.080
 colored-0.080
Token afford
Token ID5368
Feature activation+8.13e-02

Pos logit contributions
 Int+0.003
 Sadly+0.003
 Curious+0.003
 Everything+0.003
 Everywhere+0.003
Neg logit contributions
els-0.003
ps-0.003
 colored-0.003
ries-0.003
 on-0.003
Token gold
Token ID3869
Feature activation+1.69e-01

Pos logit contributions
els+0.166
ishing+0.140
ps+0.140
 colored+0.139
cks+0.136
Neg logit contributions
 Everything-0.115
 Everywhere-0.110
 Int-0.106
 Sadly-0.102
 Unfortunately-0.098
Token ice
Token ID4771
Feature activation+1.30e-01

Pos logit contributions
cks+0.228
ps+0.223
els+0.221
 colored+0.214
cking+0.197
Neg logit contributions
 He-0.275
ete-0.255
 Everywhere-0.247
 So-0.236
 could-0.230
Token cream
Token ID8566
Feature activation+1.17e-01

Pos logit contributions
els+0.149
ps+0.139
 colored+0.110
cks+0.108
 harm+0.104
Neg logit contributions
ete-0.251
 He-0.244
Of-0.242
 Int-0.238
 Everywhere-0.238
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
els+0.167
ps+0.136
cks+0.132
 colored+0.127
ck+0.122
Neg logit contributions
 Int-0.232
ete-0.228
 Everywhere-0.224
 He-0.222
 Sadly-0.222
Token
Token ID198
Feature activation-7.42e-02

Pos logit contributions
 Int+0.176
 Curious+0.153
rolling+0.146
ete+0.142
Of+0.140
Neg logit contributions
els-0.208
.-0.204
 on-0.189
."-0.186
 inside-0.185
Token
Token ID198
Feature activation-9.11e-02

Pos logit contributions
 Int+0.133
 Curious+0.119
 Everything+0.109
ishly+0.108
 Everywhere+0.106
Neg logit contributions
els-0.137
.-0.133
 on-0.126
 inside-0.125
 sticker-0.115
Token.
Token ID13
Feature activation-7.20e-02

Pos logit contributions
els+0.106
ps+0.082
 colored+0.081
bones+0.078
 coloured+0.075
Neg logit contributions
 Int-0.167
 Sadly-0.159
 So-0.156
 Curious-0.155
�-0.150
Token And
Token ID843
Feature activation-2.49e-02

Pos logit contributions
 Int+0.107
 Curious+0.092
 Sadly+0.084
rolling+0.083
ete+0.082
Neg logit contributions
els-0.162
 on-0.143
 inside-0.141
.-0.140
ps-0.140
Token you
Token ID345
Feature activation+5.89e-03

Pos logit contributions
 Int+0.040
 Curious+0.036
 Everything+0.035
 Sadly+0.035
 Everywhere+0.034
Neg logit contributions
els-0.052
ps-0.046
 colored-0.044
 inside-0.043
 on-0.043
Token wasted
Token ID18359
Feature activation+1.16e-01

Pos logit contributions
els+0.011
 on+0.010
ps+0.010
 inside+0.009
 colored+0.009
Neg logit contributions
 Int-0.011
 Curious-0.010
 Everything-0.009
 Sadly-0.009
 Everywhere-0.009
Token the
Token ID262
Feature activation+3.29e-02

Pos logit contributions
els+0.202
cks+0.161
ps+0.157
 colored+0.156
ishing+0.152
Neg logit contributions
 Int-0.206
 Everywhere-0.193
 Everything-0.192
 Sadly-0.188
 Curious-0.182
Token cand
Token ID2658
Feature activation+1.68e-01

Pos logit contributions
els+0.052
cks+0.041
ps+0.039
bones+0.036
 colored+0.036
Neg logit contributions
 Int-0.071
 Curious-0.067
 Sadly-0.066
 Everywhere-0.063
Of-0.063
Tokenies
Token ID444
Feature activation+1.03e-01

Pos logit contributions
els+0.196
bones+0.184
 colored+0.179
ps+0.174
cks+0.166
Neg logit contributions
ete-0.307
 Int-0.300
mber-0.283
 Sadly-0.279
 couldn-0.277
Token I
Token ID314
Feature activation+1.05e-02

Pos logit contributions
els+0.155
bones+0.119
 colored+0.118
cks+0.114
ps+0.113
Neg logit contributions
 Int-0.204
 Sadly-0.196
�-0.191
 Everywhere-0.190
 Curious-0.187
Token bought
Token ID5839
Feature activation+9.76e-02

Pos logit contributions
els+0.022
ps+0.018
ries+0.018
 on+0.018
 colored+0.017
Neg logit contributions
 Int-0.017
 Curious-0.015
 Sadly-0.015
 Everything-0.014
Of-0.014
Token for
Token ID329
Feature activation+4.58e-02

Pos logit contributions
els+0.136
 colored+0.105
cks+0.103
ps+0.101
 coloured+0.091
Neg logit contributions
 Int-0.201
 Everywhere-0.191
 Everything-0.189
 Sadly-0.187
ete-0.184
Token your
Token ID534
Feature activation+4.12e-02

Pos logit contributions
els+0.089
 colored+0.075
ps+0.075
 inside+0.073
bones+0.072
Neg logit contributions
 Int-0.071
 Everything-0.063
ete-0.063
 Sadly-0.063
 Everywhere-0.061
Token real
Token ID1103
Feature activation+8.23e-02

Pos logit contributions
els+0.060
ps+0.049
ries+0.046
cks+0.045
ws+0.044
Neg logit contributions
 Int-0.067
 Curious-0.063
 Sadly-0.061
 Everything-0.058
Of-0.057
Token treasure
Token ID14068
Feature activation+1.35e-01

Pos logit contributions
els+0.156
cks+0.129
ps+0.127
ries+0.125
ws+0.122
Neg logit contributions
 Int-0.147
 Curious-0.140
 Sadly-0.131
 Everything-0.126
 He-0.124
Token chest
Token ID7721
Feature activation+1.26e-01

Pos logit contributions
els+0.185
ws+0.141
ps+0.140
ries+0.134
 colored+0.132
Neg logit contributions
 Int-0.275
 Curious-0.256
hog-0.255
 Sadly-0.253
 couldn-0.250
Token filled
Token ID5901
Feature activation+8.59e-02

Pos logit contributions
els+0.173
ws+0.132
ries+0.122
ps+0.120
cks+0.112
Neg logit contributions
 couldn-0.264
 Int-0.258
hog-0.246
 Curious-0.246
 Sadly-0.243
Token with
Token ID351
Feature activation+5.41e-02

Pos logit contributions
els+0.124
ps+0.106
ier+0.096
ws+0.092
ries+0.088
Neg logit contributions
 Int-0.182
 Curious-0.169
Da-0.168
 Everywhere-0.168
 Exc-0.167
Token gold
Token ID3869
Feature activation+1.68e-01

Pos logit contributions
els+0.082
ps+0.064
cks+0.062
ries+0.061
ws+0.060
Neg logit contributions
 Int-0.111
 Curious-0.106
 Everywhere-0.102
 Everything-0.099
 Sadly-0.098
Token coins
Token ID10796
Feature activation+1.58e-01

Pos logit contributions
els+0.231
cks+0.193
ws+0.191
bones+0.186
 harm+0.186
Neg logit contributions
 He-0.288
 Int-0.277
hog-0.266
 Curious-0.263
 So-0.257
Token and
Token ID290
Feature activation-5.59e-03

Pos logit contributions
els+0.212
ws+0.173
ries+0.157
ps+0.157
 colored+0.152
Neg logit contributions
 couldn-0.290
 Int-0.287
 He-0.277
 Curious-0.276
 She-0.270
Token jewels
Token ID42701
Feature activation+1.53e-01

Pos logit contributions
 Int+0.013
 Curious+0.013
 Everywhere+0.012
 Sadly+0.012
 Everything+0.012
Neg logit contributions
els-0.008
ps-0.006
cks-0.005
ws-0.005
ries-0.005
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
els+0.206
ws+0.165
cks+0.146
ps+0.140
ries+0.138
Neg logit contributions
 Int-0.300
 couldn-0.295
 She-0.290
 Curious-0.286
 He-0.286
Token
Token ID220
Feature activation-5.15e-02

Pos logit contributions
ete+0.210
rolling+0.209
 Int+0.203
Harry+0.191
Quick+0.190
Neg logit contributions
.-0.206
 inside-0.192
 on-0.190
."-0.169
 toys-0.159
Token to
Token ID284
Feature activation+2.06e-02

Pos logit contributions
els+0.202
ps+0.164
cks+0.155
 colored+0.154
 frost+0.149
Neg logit contributions
 Int-0.301
 Everywhere-0.299
 Everything-0.280
 Sadly-0.279
 Curious-0.271
Token be
Token ID307
Feature activation+6.16e-02

Pos logit contributions
els+0.037
ps+0.031
 colored+0.030
 on+0.030
 cream+0.029
Neg logit contributions
 Int-0.040
 Curious-0.035
 Sadly-0.035
 Everything-0.035
 Everywhere-0.034
Token restored
Token ID15032
Feature activation+1.11e-01

Pos logit contributions
els+0.115
ps+0.095
ries+0.092
bones+0.090
 icing+0.088
Neg logit contributions
 Int-0.107
 Sadly-0.098
 Curious-0.096
 Everything-0.091
 Everywhere-0.090
Token to
Token ID284
Feature activation+1.94e-02

Pos logit contributions
els+0.147
ps+0.117
cks+0.115
bones+0.113
 colored+0.108
Neg logit contributions
 Everywhere-0.238
 Int-0.235
 Sadly-0.224
 Everything-0.222
 Curious-0.220
Token its
Token ID663
Feature activation+2.00e-02

Pos logit contributions
els+0.035
ps+0.029
 cream+0.028
ries+0.027
 colored+0.027
Neg logit contributions
 Int-0.036
 Everything-0.034
 Sadly-0.032
 Curious-0.032
 Everywhere-0.032
Token former
Token ID1966
Feature activation+1.68e-01

Pos logit contributions
els+0.039
ps+0.034
cks+0.032
 on+0.032
 colored+0.032
Neg logit contributions
 Int-0.034
 Sadly-0.032
 Curious-0.032
 Everywhere-0.031
 Everything-0.030
Token glory
Token ID13476
Feature activation+1.22e-01

Pos logit contributions
els+0.308
 on+0.291
ps+0.276
 inside+0.274
 sticker+0.272
Neg logit contributions
 Int-0.278
 Everywhere-0.274
 Sadly-0.261
 Curious-0.250
 Everything-0.250
Token."
Token ID526
Feature activation-1.24e-01

Pos logit contributions
els+0.129
ps+0.105
bones+0.092
cks+0.090
 colored+0.089
Neg logit contributions
 Int-0.297
 Everywhere-0.291
 Everything-0.278
 Curious-0.277
 Sadly-0.268
Token
Token ID220
Feature activation-6.03e-02

Pos logit contributions
 Int+0.207
Sadly+0.192
 concession+0.192
 Barkley+0.190
rolling+0.190
Neg logit contributions
.-0.242
 on-0.209
els-0.203
 inside-0.198
."-0.184
Token
Token ID198
Feature activation-7.77e-02

Pos logit contributions
 Int+0.106
 Curious+0.098
Of+0.095
Quick+0.091
Lots+0.087
Neg logit contributions
.-0.109
els-0.108
 inside-0.101
 on-0.100
."-0.090
Token
Token ID198
Feature activation-9.32e-02

Pos logit contributions
 Int+0.149
ishly+0.136
 Curious+0.135
 Everything+0.131
 Barkley+0.130
Neg logit contributions
.-0.131
 on-0.119
els-0.115
 inside-0.111
."-0.103
Token friendship
Token ID14738
Feature activation+9.54e-02

Pos logit contributions
 Int+0.019
 Curious+0.018
 Everything+0.018
 Sadly+0.018
 Everywhere+0.018
Neg logit contributions
els-0.017
ps-0.013
ries-0.013
 colored-0.013
 inside-0.012
Token are
Token ID389
Feature activation+3.32e-02

Pos logit contributions
els+0.114
 colored+0.077
ries+0.075
cks+0.075
bones+0.075
Neg logit contributions
 Int-0.224
 Everything-0.220
 Everywhere-0.219
 Curious-0.210
 Sadly-0.208
Token more
Token ID517
Feature activation+7.24e-02

Pos logit contributions
els+0.064
ries+0.051
ps+0.050
cks+0.048
bones+0.047
Neg logit contributions
 Sadly-0.047
 Int-0.047
 Everywhere-0.047
 Everything-0.047
 couldn-0.046
Token precious
Token ID14186
Feature activation+1.22e-01

Pos logit contributions
els+0.147
ries+0.123
ps+0.118
bones+0.111
cks+0.109
Neg logit contributions
 He-0.112
 From-0.112
 Everywhere-0.108
Of-0.107
 Sadly-0.107
Token than
Token ID621
Feature activation+5.77e-02

Pos logit contributions
els+0.190
cks+0.167
ps+0.150
bones+0.148
ck+0.144
Neg logit contributions
 He-0.219
 Everywhere-0.216
 So-0.216
Of-0.208
 Nice-0.206
Token gold
Token ID3869
Feature activation+1.68e-01

Pos logit contributions
els+0.096
cks+0.075
 colored+0.075
ps+0.075
ws+0.071
Neg logit contributions
 Int-0.103
 Everything-0.098
 Everywhere-0.096
 Curious-0.092
 Sadly-0.092
Token."
Token ID526
Feature activation-1.25e-01

Pos logit contributions
els+0.207
cks+0.193
bones+0.189
 colored+0.188
ps+0.185
Neg logit contributions
 He-0.308
 So-0.282
ete-0.279
 Everywhere-0.273
 People-0.266
Token
Token ID198
Feature activation-8.16e-02

Pos logit contributions
 Int+0.251
rolling+0.225
 Curious+0.221
ete+0.211
 Everywhere+0.203
Neg logit contributions
.-0.216
 on-0.194
 inside-0.185
els-0.183
."-0.176
Token
Token ID198
Feature activation-9.90e-02

Pos logit contributions
 Int+0.168
 Curious+0.151
rolling+0.142
 Everything+0.140
ishly+0.135
Neg logit contributions
.-0.138
 on-0.130
els-0.127
 inside-0.124
 cream-0.114
TokenL
Token ID43
Feature activation-2.60e-02

Pos logit contributions
 Int+0.203
 Curious+0.183
 Everything+0.177
 Everywhere+0.174
rolling+0.165
Neg logit contributions
els-0.166
.-0.155
 on-0.154
 cream-0.152
 colored-0.147
Tokenily
Token ID813
Feature activation-4.04e-02

Pos logit contributions
 Int+0.050
 Curious+0.045
 Everything+0.044
 Sadly+0.043
 Everywhere+0.043
Neg logit contributions
els-0.047
ps-0.040
 on-0.040
 inside-0.039
 colored-0.039
Token found
Token ID1043
Feature activation+4.39e-02

Pos logit contributions
 Int+0.081
 Curious+0.073
 Everything+0.070
 Sadly+0.070
 Everywhere+0.069
Neg logit contributions
els-0.073
 on-0.064
 inside-0.063
ps-0.063
 colored-0.062
Token a
Token ID257
Feature activation+2.71e-02

Pos logit contributions
els+0.072
ps+0.058
 colored+0.057
 cream+0.057
ries+0.055
Neg logit contributions
 Int-0.092
 Curious-0.085
 Everything-0.084
 Everywhere-0.083
 Sadly-0.081
Token chest
Token ID7721
Feature activation+1.11e-01

Pos logit contributions
els+0.047
ps+0.039
ries+0.036
 colored+0.035
cks+0.034
Neg logit contributions
 Int-0.057
 Curious-0.053
 Sadly-0.051
 Everything-0.050
 Everywhere-0.049
Token filled
Token ID5901
Feature activation+7.94e-02

Pos logit contributions
els+0.188
ps+0.149
ws+0.149
ries+0.147
cks+0.136
Neg logit contributions
 Int-0.219
 Curious-0.210
 Sadly-0.201
ete-0.200
rolling-0.194
Token with
Token ID351
Feature activation+5.44e-02

Pos logit contributions
els+0.117
ps+0.101
ier+0.090
ws+0.087
ries+0.085
Neg logit contributions
 Int-0.173
 Curious-0.162
 Everywhere-0.161
 Exc-0.156
 Sadly-0.155
Token gold
Token ID3869
Feature activation+1.67e-01

Pos logit contributions
els+0.085
ps+0.069
ries+0.063
cks+0.063
 cream+0.063
Neg logit contributions
 Int-0.115
 Curious-0.109
 Everywhere-0.106
 Everything-0.104
 Sadly-0.102
Token and
Token ID290
Feature activation-7.38e-03

Pos logit contributions
els+0.227
ws+0.195
cks+0.191
ps+0.185
ases+0.177
Neg logit contributions
 He-0.304
 Int-0.301
ete-0.290
 Curious-0.289
 Everywhere-0.274
Token jewels
Token ID42701
Feature activation+1.54e-01

Pos logit contributions
 Int+0.017
 Curious+0.016
 Everything+0.015
 Sadly+0.015
 Everywhere+0.015
Neg logit contributions
els-0.011
ps-0.009
 on-0.008
 cream-0.007
cks-0.007
Token.
Token ID13
Feature activation-1.07e-01

Pos logit contributions
els+0.199
ws+0.156
ps+0.139
cks+0.136
 cream+0.133
Neg logit contributions
 Int-0.350
 Curious-0.334
 He-0.322
 couldn-0.318
ete-0.312
Token From
Token ID3574
Feature activation+3.66e-03

Pos logit contributions
 Int+0.131
Da+0.129
Mos+0.118
Sadly+0.114
Quick+0.114
Neg logit contributions
.-0.223
 on-0.215
 inside-0.214
els-0.205
."-0.194
Token that
Token ID326
Feature activation-3.41e-02

Pos logit contributions
els+0.006
ps+0.005
 on+0.005
 colored+0.005
 inside+0.005
Neg logit contributions
 Int-0.008
 Curious-0.007
 Sadly-0.007
 Everything-0.007
 Everywhere-0.007
Token that
Token ID326
Feature activation-1.38e-02

Pos logit contributions
els+0.032
ps+0.027
 colored+0.026
 sticker+0.026
 on+0.025
Neg logit contributions
 Int-0.049
 Curious-0.045
 Everything-0.044
 Everywhere-0.043
 Sadly-0.042
Token it
Token ID340
Feature activation+1.34e-02

Pos logit contributions
 Int+0.029
 Curious+0.027
 Everything+0.027
 Sadly+0.026
 Everywhere+0.025
Neg logit contributions
els-0.022
ps-0.019
 colored-0.018
ishing-0.017
 cream-0.017
Token is
Token ID318
Feature activation+1.44e-02

Pos logit contributions
els+0.020
 cream+0.017
ps+0.016
 colored+0.016
 inside+0.016
Neg logit contributions
 Int-0.029
 Curious-0.027
 Everything-0.026
 Sadly-0.026
 Everywhere-0.025
Token not
Token ID407
Feature activation+2.18e-02

Pos logit contributions
els+0.030
ws+0.026
ishing+0.026
ries+0.025
ps+0.024
Neg logit contributions
 Int-0.018
 Curious-0.017
 Everywhere-0.017
 Everything-0.016
 Sadly-0.016
Token real
Token ID1103
Feature activation+9.80e-02

Pos logit contributions
els+0.046
ries+0.034
ws+0.033
ating+0.033
ases+0.032
Neg logit contributions
 Int-0.032
 Curious-0.029
 He-0.029
 His-0.026
 couldn-0.026
Token gold
Token ID3869
Feature activation+1.66e-01

Pos logit contributions
els+0.133
 cream+0.128
ep+0.123
cks+0.121
ps+0.119
Neg logit contributions
 He-0.194
 So-0.171
 Nice-0.169
 His-0.168
 Int-0.162
Token.
Token ID13
Feature activation-9.41e-02

Pos logit contributions
els+0.221
cks+0.210
 colored+0.199
ck+0.190
cking+0.189
Neg logit contributions
 He-0.312
ete-0.266
 So-0.264
 could-0.254
 Nice-0.249
Token It
Token ID632
Feature activation-1.91e-02

Pos logit contributions
 Int+0.106
rolling+0.086
 Curious+0.085
ete+0.083
 Sadly+0.081
Neg logit contributions
els-0.231
.-0.216
 on-0.216
 inside-0.213
 sticker-0.201
Token is
Token ID318
Feature activation+6.03e-03

Pos logit contributions
 Int+0.046
 Curious+0.042
 Everything+0.041
 Sadly+0.041
 Everywhere+0.041
Neg logit contributions
els-0.026
ps-0.021
 on-0.021
 inside-0.020
 colored-0.020
Token chocolate
Token ID11311
Feature activation+1.12e-01

Pos logit contributions
els+0.011
ishing+0.009
ries+0.009
ps+0.009
ws+0.009
Neg logit contributions
 Int-0.010
 Everywhere-0.009
 Curious-0.009
 Everything-0.009
 ï¿½-0.008
Token!
Token ID0
Feature activation-9.62e-02

Pos logit contributions
els+0.155
ws+0.128
ps+0.126
cks+0.118
ishing+0.113
Neg logit contributions
 He-0.209
 She-0.199
 Int-0.195
 couldn-0.193
ete-0.191
Token the
Token ID262
Feature activation-5.73e-03

Pos logit contributions
els+0.206
ps+0.188
 colored+0.173
bones+0.169
cks+0.168
Neg logit contributions
 Int-0.207
 Everything-0.201
 Everywhere-0.195
ete-0.190
 Sadly-0.187
Token village
Token ID7404
Feature activation+9.81e-02

Pos logit contributions
 Int+0.010
 Curious+0.009
 Sadly+0.009
 Everything+0.009
 Everywhere+0.008
Neg logit contributions
els-0.011
ps-0.010
cks-0.009
 colored-0.009
ws-0.009
Token back
Token ID736
Feature activation+4.60e-02

Pos logit contributions
els+0.124
ps+0.096
 colored+0.090
 cream+0.084
ws+0.081
Neg logit contributions
 Int-0.238
 Curious-0.221
 Sadly-0.217
 Everything-0.216
 Everywhere-0.216
Token to
Token ID284
Feature activation+1.82e-02

Pos logit contributions
els+0.066
ps+0.054
 colored+0.051
 cream+0.048
bones+0.048
Neg logit contributions
 Int-0.101
 Curious-0.094
 Everywhere-0.094
 Everything-0.093
 Sadly-0.093
Token its
Token ID663
Feature activation+2.02e-02

Pos logit contributions
els+0.032
ps+0.026
 cream+0.025
 colored+0.024
ries+0.024
Neg logit contributions
 Int-0.035
 Everything-0.034
 Sadly-0.033
 Everywhere-0.032
 Curious-0.031
Token former
Token ID1966
Feature activation+1.66e-01

Pos logit contributions
els+0.038
ps+0.032
 colored+0.031
cks+0.030
 on+0.030
Neg logit contributions
 Int-0.036
 Sadly-0.034
 Curious-0.034
 Everywhere-0.033
 Everything-0.032
Token glory
Token ID13476
Feature activation+1.18e-01

Pos logit contributions
els+0.317
 on+0.299
 inside+0.281
ps+0.277
 sticker+0.275
Neg logit contributions
 Int-0.264
 Everywhere-0.258
 Sadly-0.252
 Curious-0.239
 Everything-0.237
Token!"
Token ID2474
Feature activation-1.19e-01

Pos logit contributions
els+0.127
ps+0.094
 colored+0.085
 cream+0.083
bones+0.082
Neg logit contributions
 Int-0.293
 Everywhere-0.279
 Curious-0.275
 Everything-0.275
 Sadly-0.268
Token
Token ID220
Feature activation-5.66e-02

Pos logit contributions
 Int+0.153
 Curious+0.131
rolling+0.121
 Sadly+0.119
 Everything+0.119
Neg logit contributions
els-0.271
.-0.266
 on-0.262
 inside-0.248
ps-0.240
Token
Token ID198
Feature activation-7.18e-02

Pos logit contributions
 Int+0.093
 Curious+0.085
Of+0.082
Da+0.078
Quick+0.076
Neg logit contributions
els-0.107
.-0.105
 on-0.099
 inside-0.098
ps-0.097
Token
Token ID198
Feature activation-8.37e-02

Pos logit contributions
 Int+0.133
 Curious+0.120
 Everything+0.116
ishly+0.113
 Everywhere+0.110
Neg logit contributions
.-0.116
 on-0.114
els-0.107
 sticker-0.101
 cream-0.100
Token,
Token ID11
Feature activation-2.06e-02

Pos logit contributions
 Int+0.026
 He+0.025
 So+0.025
 She+0.025
 Sadly+0.025
Neg logit contributions
els-0.016
ps-0.012
bones-0.012
 icing-0.012
 frost-0.012
Token maybe
Token ID3863
Feature activation-1.17e-02

Pos logit contributions
 Int+0.045
 Curious+0.041
 Sadly+0.040
 Everywhere+0.039
 Everything+0.039
Neg logit contributions
els-0.031
 on-0.029
 inside-0.028
ps-0.026
.-0.026
Token we
Token ID356
Feature activation-6.26e-03

Pos logit contributions
 Int+0.012
 Curious+0.010
 Everything+0.009
 couldn+0.009
 So+0.009
Neg logit contributions
ps-0.028
els-0.028
bones-0.026
ws-0.026
 channel-0.025
Token will
Token ID481
Feature activation+7.47e-03

Pos logit contributions
 Int+0.012
 Curious+0.011
 Sadly+0.011
 Everything+0.011
 Everywhere+0.010
Neg logit contributions
els-0.011
ps-0.010
 cream-0.009
 colored-0.009
 on-0.009
Token find
Token ID1064
Feature activation+6.84e-02

Pos logit contributions
els+0.017
ries+0.015
 cream+0.014
ps+0.014
bones+0.014
Neg logit contributions
 Int-0.011
 Curious-0.010
 Sadly-0.010
 Everywhere-0.009
 Everything-0.009
Token gold
Token ID3869
Feature activation+1.66e-01

Pos logit contributions
els+0.120
ge+0.105
 colored+0.103
ater+0.102
ps+0.101
Neg logit contributions
 Int-0.115
 Everything-0.101
 Curious-0.098
 Everywhere-0.095
Of-0.093
Token and
Token ID290
Feature activation+6.82e-03

Pos logit contributions
els+0.208
ps+0.192
bones+0.178
 colored+0.176
cks+0.172
Neg logit contributions
 Int-0.303
 He-0.293
ete-0.288
 So-0.279
 Everywhere-0.277
Token jewels
Token ID42701
Feature activation+1.60e-01

Pos logit contributions
els+0.009
ps+0.008
ier+0.007
ishing+0.007
ries+0.007
Neg logit contributions
 Int-0.015
 Curious-0.014
 Everywhere-0.014
 Sadly-0.013
 Everything-0.013
Token and
Token ID290
Feature activation+8.31e-03

Pos logit contributions
els+0.246
ps+0.194
bones+0.193
cks+0.184
 colored+0.182
Neg logit contributions
 Int-0.307
 She-0.288
Of-0.285
 He-0.277
 Curious-0.276
Token pear
Token ID25286
Feature activation+1.46e-01

Pos logit contributions
els+0.011
ps+0.009
ishing+0.008
ier+0.008
ries+0.008
Neg logit contributions
 Int-0.019
 Curious-0.017
 Everywhere-0.017
 Everything-0.017
 Sadly-0.017
Tokenls
Token ID7278
Feature activation+1.14e-01

Pos logit contributions
els+0.244
ps+0.243
 colored+0.221
epad+0.214
 channel+0.213
Neg logit contributions
 Int-0.201
 He-0.190
 She-0.180
 Exc-0.179
 Everywhere-0.173
Token Maybe
Token ID6674
Feature activation+5.20e-04

Pos logit contributions
 Int+0.129
 Curious+0.113
rolling+0.112
ete+0.101
 Sadly+0.100
Neg logit contributions
els-0.167
.-0.146
ps-0.146
 on-0.146
 inside-0.142
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
els+0.001
ps+0.001
bones+0.001
cks+0.001
ws+0.001
Neg logit contributions
 Int-0.001
 Everything-0.001
Of-0.001
 Curious-0.001
 Sadly-0.001
Token birds
Token ID10087
Feature activation+7.78e-02

Pos logit contributions
 Int+0.031
 Curious+0.029
 Sadly+0.029
 Everything+0.028
 Everywhere+0.028
Neg logit contributions
els-0.023
ps-0.018
cks-0.018
bones-0.018
 colored-0.017
Token can
Token ID460
Feature activation+1.64e-02

Pos logit contributions
els+0.143
 colored+0.120
ps+0.115
 cream+0.114
 coloured+0.111
Neg logit contributions
 Int-0.140
 Curious-0.125
 Sadly-0.123
 Everything-0.123
 Everywhere-0.117
Token still
Token ID991
Feature activation+5.68e-02

Pos logit contributions
els+0.031
ps+0.025
 cream+0.025
 colored+0.025
ries+0.025
Neg logit contributions
 Int-0.030
 Sadly-0.027
 Curious-0.027
 Everything-0.026
 Everywhere-0.026
Token hatch
Token ID25834
Feature activation+1.66e-01

Pos logit contributions
els+0.097
ps+0.078
 colored+0.076
 cream+0.076
bones+0.073
Neg logit contributions
 Int-0.107
 Curious-0.096
 Everything-0.095
 Sadly-0.094
 Everywhere-0.094
Token them
Token ID606
Feature activation+8.14e-02

Pos logit contributions
els+0.204
 colored+0.175
bones+0.174
ps+0.167
cks+0.160
Neg logit contributions
 Int-0.328
 Everywhere-0.306
 She-0.305
 Curious-0.304
 He-0.303
Token.
Token ID13
Feature activation-8.10e-02

Pos logit contributions
els+0.100
bones+0.073
 colored+0.073
ps+0.072
 cream+0.068
Neg logit contributions
 Int-0.195
 Everywhere-0.180
 Sadly-0.180
 Curious-0.179
 Everything-0.174
Token But
Token ID887
Feature activation-4.08e-02

Pos logit contributions
 Int+0.125
rolling+0.120
ete+0.110
 Curious+0.109
mber+0.103
Neg logit contributions
els-0.149
.-0.146
."-0.140
 on-0.140
 inside-0.137
Token you
Token ID345
Feature activation-1.57e-03

Pos logit contributions
 Int+0.065
 Everything+0.057
 Curious+0.056
 Sadly+0.055
Of+0.054
Neg logit contributions
els-0.087
ps-0.075
 colored-0.073
bones-0.071
cks-0.070
Token have
Token ID423
Feature activation+1.86e-02

Pos logit contributions
 Int+0.003
 Curious+0.003
 Everything+0.003
 Everywhere+0.003
 Sadly+0.003
Neg logit contributions
els-0.003
 on-0.002
ps-0.002
 inside-0.002
 colored-0.002
Token a
Token ID257
Feature activation+2.72e-02

Pos logit contributions
els+0.067
 on+0.057
 inside+0.056
ps+0.055
 colored+0.053
Neg logit contributions
 Int-0.117
 Curious-0.107
 Sadly-0.104
 Everything-0.103
 Everywhere-0.102
Token treasure
Token ID14068
Feature activation+1.31e-01

Pos logit contributions
els+0.049
ps+0.041
 on+0.038
 colored+0.038
 inside+0.037
Neg logit contributions
 Int-0.055
 Curious-0.050
 Sadly-0.048
 Everything-0.047
 Everywhere-0.047
Token chest
Token ID7721
Feature activation+1.18e-01

Pos logit contributions
els+0.233
ps+0.195
ws+0.192
cks+0.178
ries+0.177
Neg logit contributions
 Int-0.229
 Curious-0.209
ete-0.208
 Sadly-0.199
 Everywhere-0.195
Token full
Token ID1336
Feature activation+6.58e-02

Pos logit contributions
els+0.211
ws+0.169
ps+0.168
ries+0.160
cks+0.154
Neg logit contributions
 Int-0.219
 Curious-0.205
 Sadly-0.199
 couldn-0.198
ete-0.195
Token of
Token ID286
Feature activation+5.59e-02

Pos logit contributions
els+0.122
ps+0.105
ws+0.097
cks+0.096
 colored+0.095
Neg logit contributions
 Int-0.123
 Curious-0.112
 Everywhere-0.111
 Everything-0.109
ete-0.106
Token gold
Token ID3869
Feature activation+1.66e-01

Pos logit contributions
els+0.085
ps+0.072
cks+0.064
ws+0.063
 cutter+0.061
Neg logit contributions
 Int-0.119
 Curious-0.112
 Everything-0.108
 Everywhere-0.106
 Sadly-0.103
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
els+0.239
ws+0.214
ps+0.211
cks+0.208
bones+0.198
Neg logit contributions
ete-0.269
 Int-0.264
 He-0.254
 couldn-0.242
 Curious-0.239
Token jewels
Token ID42701
Feature activation+1.49e-01

Pos logit contributions
 Int+0.026
 Curious+0.025
 Sadly+0.023
 Everywhere+0.023
 Everything+0.023
Neg logit contributions
els-0.016
ps-0.013
cks-0.011
ws-0.011
 on-0.011
Token!
Token ID0
Feature activation-1.12e-01

Pos logit contributions
els+0.204
ws+0.164
ps+0.142
cks+0.140
ries+0.129
Neg logit contributions
 Int-0.316
 couldn-0.304
 Curious-0.297
ete-0.287
rolling-0.283
Token Tim
Token ID5045
Feature activation-3.13e-02

Pos logit contributions
 Int+0.210
mber+0.207
ete+0.199
Da+0.193
 bust+0.190
Neg logit contributions
.-0.177
 inside-0.170
 on-0.161
els-0.154
."-0.144
Tokenmy
Token ID1820
Feature activation-1.80e-02

Pos logit contributions
 Int+0.069
 Curious+0.062
 Everything+0.061
 Sadly+0.060
 Everywhere+0.060
Neg logit contributions
els-0.050
 on-0.042
ps-0.042
 colored-0.041
 inside-0.041
Token instead
Token ID2427
Feature activation+3.39e-02

Pos logit contributions
els+0.072
ps+0.054
 colored+0.052
 cream+0.050
 inside+0.048
Neg logit contributions
 Int-0.155
 Curious-0.144
 Everything-0.140
 Everywhere-0.140
 Sadly-0.140
Token of
Token ID286
Feature activation+9.48e-03

Pos logit contributions
els+0.033
ps+0.022
 colored+0.021
 cream+0.020
ws+0.019
Neg logit contributions
 Int-0.092
 Curious-0.085
 Everything-0.085
 Everywhere-0.085
 Sadly-0.084
Token the
Token ID262
Feature activation-2.56e-02

Pos logit contributions
els+0.015
ier+0.012
ishing+0.012
 inside+0.011
 colored+0.011
Neg logit contributions
 Int-0.018
 Everything-0.016
 Everywhere-0.016
 Curious-0.015
 ï¿½-0.015
Token sharp
Token ID7786
Feature activation+6.68e-02

Pos logit contributions
 Int+0.050
 Curious+0.048
 Everything+0.046
 Sadly+0.045
 Everywhere+0.045
Neg logit contributions
els-0.043
ps-0.034
ries-0.033
cks-0.033
 colored-0.032
Token rock
Token ID3881
Feature activation+7.56e-02

Pos logit contributions
els+0.096
ries+0.076
cks+0.076
ps+0.072
 frost+0.071
Neg logit contributions
 Int-0.130
 Everything-0.124
 Curious-0.123
 He-0.122
 Everywhere-0.118
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
els+0.074
ps+0.050
 colored+0.049
 cream+0.045
 sticker+0.043
Neg logit contributions
 Int-0.200
 Curious-0.186
 Sadly-0.184
 Everywhere-0.182
 Everything-0.182
Token
Token ID198
Feature activation-9.48e-02

Pos logit contributions
 Int+0.238
 Curious+0.207
rolling+0.198
 Everything+0.198
Sadly+0.194
Neg logit contributions
.-0.277
els-0.260
 on-0.258
 inside-0.240
."-0.231
Token
Token ID198
Feature activation-1.00e-01

Pos logit contributions
 Int+0.174
 Curious+0.157
 Everything+0.151
ishly+0.149
 Sadly+0.145
Neg logit contributions
.-0.160
els-0.151
 on-0.142
 inside-0.135
aks-0.130
TokenSo
Token ID2396
Feature activation-5.59e-02

Pos logit contributions
 Int+0.179
 Curious+0.159
 Everything+0.159
 Everywhere+0.156
 Sadly+0.152
Neg logit contributions
els-0.161
.-0.156
 on-0.147
 cream-0.146
aks-0.143
Token the
Token ID262
Feature activation-6.50e-02

Pos logit contributions
 Int+0.138
 Curious+0.126
 Everything+0.122
 Sadly+0.122
 Everywhere+0.121
Neg logit contributions
els-0.072
 on-0.061
 inside-0.058
.-0.056
ps-0.056
Token duck
Token ID22045
Feature activation-7.15e-03

Pos logit contributions
 Int+0.125
 Curious+0.113
 Everything+0.109
 Sadly+0.108
 Everywhere+0.106
Neg logit contributions
els-0.119
ps-0.101
 on-0.100
 inside-0.098
 colored-0.098
Token much
Token ID881
Feature activation+2.81e-02

Pos logit contributions
 Int+0.163
 Curious+0.144
 Everything+0.141
 Sadly+0.140
 Everywhere+0.138
Neg logit contributions
els-0.140
 on-0.135
 inside-0.130
.-0.129
 cream-0.125
Token fun
Token ID1257
Feature activation+4.10e-02

Pos logit contributions
els+0.057
ps+0.050
cks+0.047
ws+0.045
bones+0.042
Neg logit contributions
 Int-0.052
 Curious-0.048
Of-0.046
 Sadly-0.045
 Everything-0.045
Token playing
Token ID2712
Feature activation+5.96e-02

Pos logit contributions
els+0.051
ps+0.038
ws+0.034
cks+0.032
 sticker+0.031
Neg logit contributions
 Int-0.100
 Curious-0.093
 Everything-0.092
 Sadly-0.091
 Everywhere-0.090
Token with
Token ID351
Feature activation+1.19e-02

Pos logit contributions
els+0.077
ps+0.056
ries+0.051
 colored+0.051
 sticker+0.049
Neg logit contributions
 Int-0.147
 Curious-0.135
 Everything-0.133
 Sadly-0.132
 Everywhere-0.131
Token it
Token ID340
Feature activation+2.80e-02

Pos logit contributions
els+0.019
ps+0.015
 on+0.015
 inside+0.014
 colored+0.014
Neg logit contributions
 Int-0.024
 Everything-0.022
 Curious-0.022
 Everywhere-0.022
 Sadly-0.022
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
els+0.025
ps+0.017
 colored+0.014
bones+0.013
cks+0.013
Neg logit contributions
 Int-0.079
 Curious-0.074
 Sadly-0.073
 Everywhere-0.073
 Everything-0.072
Token
Token ID220
Feature activation-7.92e-02

Pos logit contributions
 Int+0.247
 Curious+0.215
rolling+0.204
 Everything+0.202
Sadly+0.201
Neg logit contributions
.-0.281
 on-0.257
els-0.252
 inside-0.235
."-0.233
Token
Token ID198
Feature activation-9.80e-02

Pos logit contributions
 Int+0.140
 Curious+0.128
Of+0.118
Quick+0.117
 Sadly+0.116
Neg logit contributions
els-0.143
.-0.142
 on-0.134
 inside-0.130
ps-0.121
Token
Token ID198
Feature activation-1.05e-01

Pos logit contributions
 Int+0.185
 Curious+0.167
 Everything+0.162
rolling+0.156
 Sadly+0.156
Neg logit contributions
.-0.163
els-0.146
 on-0.142
."-0.133
 inside-0.129
TokenL
Token ID43
Feature activation-4.47e-02

Pos logit contributions
 Int+0.184
 Everything+0.164
 Curious+0.163
 Everywhere+0.157
 Sadly+0.152
Neg logit contributions
.-0.171
els-0.170
 cream-0.160
 on-0.159
 colored-0.158
Tokenily
Token ID813
Feature activation-5.20e-02

Pos logit contributions
 Int+0.085
 Curious+0.076
 Everything+0.074
 Sadly+0.073
 Everywhere+0.072
Neg logit contributions
els-0.082
 on-0.071
ps-0.070
 inside-0.069
 colored-0.069
Token knee
Token ID10329
Feature activation+6.28e-02

Pos logit contributions
 Int+0.007
 Curious+0.007
 Everywhere+0.007
 Everything+0.006
 Sadly+0.006
Neg logit contributions
els-0.005
ps-0.004
 colored-0.004
 inside-0.004
 on-0.004
Token for
Token ID329
Feature activation+1.28e-02

Pos logit contributions
els+0.064
ps+0.043
 colored+0.041
cks+0.037
bones+0.037
Neg logit contributions
 Int-0.171
 Curious-0.159
 Sadly-0.156
 Everything-0.155
 Everywhere-0.155
Token a
Token ID257
Feature activation+1.02e-02

Pos logit contributions
els+0.026
 colored+0.021
 inside+0.020
ishing+0.020
ps+0.020
Neg logit contributions
 Int-0.022
 Everything-0.019
 Curious-0.019
 Everywhere-0.018
 Sadly-0.018
Token little
Token ID1310
Feature activation+3.58e-03

Pos logit contributions
els+0.019
ps+0.016
 colored+0.015
ries+0.014
cks+0.014
Neg logit contributions
 Int-0.021
 Curious-0.019
 Sadly-0.018
 Everywhere-0.018
 Everything-0.018
Token while
Token ID981
Feature activation-4.53e-03

Pos logit contributions
els+0.007
ps+0.006
cks+0.006
bones+0.005
ries+0.005
Neg logit contributions
 Int-0.007
 Curious-0.007
 Sadly-0.006
Of-0.006
 Exc-0.006
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
 Int+0.009
 Curious+0.008
 Everything+0.008
 Sadly+0.008
 Everywhere+0.008
Neg logit contributions
els-0.008
ps-0.007
 colored-0.006
 cream-0.006
ws-0.006
Token Mom
Token ID11254
Feature activation-5.86e-02

Pos logit contributions
 Int+0.243
 Curious+0.211
 Everything+0.204
rolling+0.203
Sadly+0.202
Neg logit contributions
.-0.276
 on-0.252
els-0.247
 inside-0.234
."-0.229
Tokenmy
Token ID1820
Feature activation-4.20e-02

Pos logit contributions
 Int+0.115
 Curious+0.105
 Everywhere+0.102
 Sadly+0.101
 Everything+0.100
Neg logit contributions
els-0.103
 on-0.090
 colored-0.088
 inside-0.088
ps-0.088
Token said
Token ID531
Feature activation-2.79e-02

Pos logit contributions
 Int+0.080
 Curious+0.072
 Everywhere+0.072
 Sadly+0.071
 Everything+0.070
Neg logit contributions
els-0.076
 on-0.067
 colored-0.067
ps-0.066
.-0.065
Token.
Token ID13
Feature activation-1.32e-01

Pos logit contributions
 Int+0.076
 Curious+0.071
 Everything+0.069
 Sadly+0.069
 Everywhere+0.067
Neg logit contributions
els-0.031
 colored-0.021
ps-0.021
ws-0.019
cks-0.019
Token
Token ID198
Feature activation-9.27e-02

Pos logit contributions
 Int+0.210
Sadly+0.186
ete+0.184
 Curious+0.182
 Sadly+0.178
Neg logit contributions
.-0.244
 on-0.227
els-0.223
 inside-0.217
."-0.205
Token store
Token ID3650
Feature activation+4.86e-02

Pos logit contributions
 Int+0.101
 Curious+0.093
 Sadly+0.089
 Everything+0.089
 Everywhere+0.088
Neg logit contributions
els-0.089
ps-0.074
 colored-0.071
 on-0.071
 inside-0.070
Token to
Token ID284
Feature activation-3.55e-02

Pos logit contributions
els+0.048
ps+0.035
 on+0.033
 colored+0.032
 inside+0.032
Neg logit contributions
 Int-0.135
 Curious-0.125
 Sadly-0.122
 Everything-0.122
 Everywhere-0.121
Token buy
Token ID2822
Feature activation+7.11e-02

Pos logit contributions
 Int+0.074
 Curious+0.068
 Sadly+0.065
 Everything+0.065
 Everywhere+0.064
Neg logit contributions
els-0.058
 on-0.049
ps-0.049
 inside-0.049
 colored-0.048
Token some
Token ID617
Feature activation+6.36e-02

Pos logit contributions
els+0.109
ps+0.095
cks+0.091
ws+0.085
 colored+0.082
Neg logit contributions
 Int-0.147
 Everything-0.139
 Curious-0.134
 Everywhere-0.134
 Sadly-0.129
Token food
Token ID2057
Feature activation+1.01e-01

Pos logit contributions
els+0.089
cks+0.082
ps+0.081
ws+0.072
bones+0.069
Neg logit contributions
 Int-0.134
 Everywhere-0.127
 Everything-0.125
 Curious-0.124
Of-0.122
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
els+0.107
ps+0.081
cks+0.071
 colored+0.070
ws+0.065
Neg logit contributions
 Int-0.255
 Everywhere-0.240
 Everything-0.238
 Curious-0.237
 Sadly-0.237
Token
Token ID198
Feature activation-9.07e-02

Pos logit contributions
 Int+0.245
 Curious+0.213
rolling+0.211
Sadly+0.207
 Everything+0.205
Neg logit contributions
.-0.274
 on-0.246
els-0.241
 inside-0.227
."-0.222
Token
Token ID198
Feature activation-9.89e-02

Pos logit contributions
 Int+0.171
 Curious+0.153
 Everything+0.149
ishly+0.148
rolling+0.146
Neg logit contributions
.-0.153
els-0.139
 on-0.136
 inside-0.126
 cream-0.123
TokenL
Token ID43
Feature activation-4.00e-02

Pos logit contributions
 Int+0.175
 Everything+0.158
 Curious+0.155
 Everywhere+0.153
 Sadly+0.150
Neg logit contributions
els-0.161
.-0.157
 cream-0.155
 colored-0.148
 on-0.148
Tokenily
Token ID813
Feature activation-4.90e-02

Pos logit contributions
 Int+0.075
 Curious+0.067
 Everything+0.065
 Sadly+0.064
 Everywhere+0.064
Neg logit contributions
els-0.074
 on-0.064
ps-0.063
 inside-0.062
 colored-0.062
Token replied
Token ID8712
Feature activation-2.59e-02

Pos logit contributions
 Int+0.102
 Curious+0.091
 Everywhere+0.090
 Everything+0.090
Of+0.087
Neg logit contributions
els-0.079
 colored-0.071
.-0.070
 inside-0.069
 on-0.068
Token scared
Token ID12008
Feature activation+2.58e-02

Pos logit contributions
els+0.008
ps+0.007
 colored+0.006
 on+0.006
 cream+0.006
Neg logit contributions
 Int-0.008
 Curious-0.007
 Everything-0.007
 Sadly-0.007
 Everywhere-0.007
Token of
Token ID286
Feature activation+2.74e-03

Pos logit contributions
els+0.019
 on+0.013
ps+0.013
 inside+0.013
.+0.012
Neg logit contributions
 Int-0.078
 Curious-0.073
 Sadly-0.071
 Everything-0.071
 Everywhere-0.070
Token a
Token ID257
Feature activation+4.31e-03

Pos logit contributions
els+0.004
 colored+0.003
ps+0.003
 sticker+0.003
 on+0.003
Neg logit contributions
 Int-0.006
 Everything-0.005
 Curious-0.005
 Everywhere-0.005
 Sadly-0.005
Token fake
Token ID8390
Feature activation+5.31e-02

Pos logit contributions
els+0.008
ps+0.007
 colored+0.006
cks+0.006
ries+0.006
Neg logit contributions
 Int-0.008
 Curious-0.008
 Sadly-0.007
 Everything-0.007
 Everywhere-0.007
Token wolf
Token ID17481
Feature activation+5.68e-02

Pos logit contributions
els+0.087
ps+0.070
cks+0.069
ws+0.066
 colored+0.065
Neg logit contributions
 Int-0.107
 Curious-0.100
 Everything-0.096
 Sadly-0.096
 Everywhere-0.095
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
els+0.064
ps+0.044
 colored+0.042
 cream+0.041
ries+0.038
Neg logit contributions
 Int-0.149
 Curious-0.139
 Sadly-0.135
 Everything-0.134
 Everywhere-0.131
Token
Token ID198
Feature activation-1.00e-01

Pos logit contributions
 Int+0.232
 Curious+0.200
rolling+0.195
 Everything+0.194
 Everywhere+0.192
Neg logit contributions
.-0.284
 on-0.263
els-0.256
 inside-0.244
."-0.237
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
 Int+0.179
 Curious+0.160
 Everything+0.158
ishly+0.152
rolling+0.151
Neg logit contributions
.-0.173
els-0.162
 on-0.154
 inside-0.146
."-0.141
TokenL
Token ID43
Feature activation-4.04e-02

Pos logit contributions
 Int+0.185
 Everything+0.170
 Everywhere+0.164
 Curious+0.163
 Sadly+0.159
Neg logit contributions
els-0.169
.-0.168
 on-0.154
 cream-0.154
aks-0.150
Tokenily
Token ID813
Feature activation-5.13e-02

Pos logit contributions
 Int+0.080
 Curious+0.072
 Everything+0.070
 Sadly+0.069
 Everywhere+0.068
Neg logit contributions
els-0.072
 on-0.061
ps-0.061
 inside-0.060
 colored-0.060
Token had
Token ID550
Feature activation-1.90e-02

Pos logit contributions
 Int+0.105
 Everywhere+0.094
 Everything+0.093
 Curious+0.093
 Sadly+0.091
Neg logit contributions
els-0.083
.-0.074
 colored-0.074
 on-0.073
ps-0.073
Token needs
Token ID2476
Feature activation+4.94e-02

Pos logit contributions
 Int+0.052
 Everything+0.049
 Curious+0.048
 Sadly+0.048
 Everywhere+0.047
Neg logit contributions
els-0.036
ps-0.030
cks-0.028
 inside-0.026
ries-0.026
Token to
Token ID284
Feature activation-3.75e-03

Pos logit contributions
els+0.083
ps+0.067
cks+0.065
 colored+0.064
bones+0.062
Neg logit contributions
 Int-0.102
 Everything-0.093
 Curious-0.092
 Everywhere-0.092
 Sadly-0.091
Token be
Token ID307
Feature activation+3.84e-02

Pos logit contributions
 Int+0.007
 Everything+0.006
 Sadly+0.006
 Curious+0.006
 Everywhere+0.006
Neg logit contributions
els-0.007
ps-0.006
 on-0.006
 colored-0.006
 inside-0.006
Token done
Token ID1760
Feature activation+4.16e-02

Pos logit contributions
els+0.074
ries+0.057
ps+0.057
bones+0.056
 sticker+0.055
Neg logit contributions
 Int-0.066
 Sadly-0.061
 Curious-0.060
 Everything-0.059
 Everywhere-0.055
Token today
Token ID1909
Feature activation+6.88e-02

Pos logit contributions
els+0.054
ries+0.038
cks+0.038
ps+0.038
 colored+0.037
Neg logit contributions
 Int-0.096
 Sadly-0.092
 Everything-0.091
 Curious-0.089
 Everywhere-0.089
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
els+0.081
ps+0.061
 colored+0.057
cks+0.054
ws+0.053
Neg logit contributions
 Int-0.175
 Sadly-0.163
 Curious-0.163
 Everywhere-0.162
 Everything-0.161
Token
Token ID198
Feature activation-9.25e-02

Pos logit contributions
 Int+0.239
rolling+0.208
 Curious+0.208
 Everything+0.197
 Everywhere+0.195
Neg logit contributions
.-0.274
 on-0.246
els-0.242
 inside-0.229
."-0.222
Token
Token ID198
Feature activation-1.02e-01

Pos logit contributions
 Int+0.171
 Curious+0.153
 Everything+0.147
ishly+0.145
rolling+0.144
Neg logit contributions
.-0.158
els-0.147
 on-0.140
 inside-0.132
."-0.125
TokenCl
Token ID2601
Feature activation-4.64e-02

Pos logit contributions
 Int+0.176
 Curious+0.155
 Everything+0.155
 Everywhere+0.151
 Sadly+0.143
Neg logit contributions
els-0.175
.-0.169
 cream-0.159
 on-0.159
aks-0.153
Tokenara
Token ID3301
Feature activation-5.27e-02

Pos logit contributions
 Int+0.082
 Curious+0.073
 Sadly+0.070
 Everything+0.070
 Everywhere+0.069
Neg logit contributions
els-0.093
ps-0.080
 on-0.080
 inside-0.079
 colored-0.078
Token accepted
Token ID6292
Feature activation+1.04e-02

Pos logit contributions
 Int+0.109
 Everywhere+0.099
 Curious+0.098
 Everything+0.096
Of+0.095
Neg logit contributions
els-0.086
.-0.075
 on-0.075
ps-0.075
 inside-0.075
Token wanted
Token ID2227
Feature activation-1.25e-02

Pos logit contributions
 Int+0.199
 Curious+0.173
ete+0.167
 Everything+0.162
 Everywhere+0.161
Neg logit contributions
els-0.257
.-0.252
 on-0.249
 inside-0.243
 colored-0.230
Token to
Token ID284
Feature activation-3.79e-02

Pos logit contributions
 Int+0.028
 Curious+0.026
 Everything+0.025
 Sadly+0.025
 Everywhere+0.024
Neg logit contributions
els-0.019
ps-0.016
 on-0.016
 inside-0.015
 colored-0.015
Token make
Token ID787
Feature activation+3.47e-02

Pos logit contributions
 Int+0.079
 Curious+0.071
 Sadly+0.069
 Everything+0.069
 Everywhere+0.068
Neg logit contributions
els-0.064
ps-0.053
 on-0.053
 inside-0.052
 colored-0.052
Token new
Token ID649
Feature activation+5.35e-02

Pos logit contributions
els+0.050
ps+0.041
 colored+0.040
 on+0.040
 inside+0.039
Neg logit contributions
 Int-0.078
 Curious-0.070
 Everything-0.070
 Sadly-0.068
 Everywhere-0.067
Token friends
Token ID2460
Feature activation+4.57e-02

Pos logit contributions
els+0.076
ps+0.065
ws+0.060
cks+0.059
 on+0.059
Neg logit contributions
 Int-0.118
 Curious-0.109
 Everything-0.105
 Everywhere-0.102
 Sadly-0.101
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
els+0.049
ps+0.036
ws+0.031
 colored+0.029
bones+0.029
Neg logit contributions
 Int-0.122
 Curious-0.114
 Everything-0.110
 Everywhere-0.107
Of-0.105
Token
Token ID220
Feature activation-7.67e-02

Pos logit contributions
 Int+0.242
 Curious+0.210
Sadly+0.202
 Everything+0.200
rolling+0.199
Neg logit contributions
.-0.285
 on-0.258
els-0.256
 inside-0.238
."-0.233
Token
Token ID198
Feature activation-8.81e-02

Pos logit contributions
 Int+0.133
 Curious+0.121
 Sadly+0.111
Of+0.110
 Everything+0.108
Neg logit contributions
els-0.146
.-0.143
 on-0.134
 inside-0.128
ps-0.121
Token
Token ID198
Feature activation-9.93e-02

Pos logit contributions
 Int+0.157
 Curious+0.141
 Everything+0.138
 Sadly+0.134
rolling+0.133
Neg logit contributions
.-0.154
els-0.145
 on-0.140
 cream-0.129
 colored-0.127
TokenL
Token ID43
Feature activation-4.72e-02

Pos logit contributions
 Int+0.162
 Everything+0.142
 Curious+0.141
 Sadly+0.136
 Everywhere+0.135
Neg logit contributions
els-0.179
.-0.175
 cream-0.167
 colored-0.167
 on-0.165
Tokenily
Token ID813
Feature activation-5.48e-02

Pos logit contributions
 Int+0.089
 Curious+0.079
 Everything+0.077
 Sadly+0.076
 Everywhere+0.075
Neg logit contributions
els-0.088
 on-0.076
ps-0.075
 inside-0.074
 colored-0.073
Token a
Token ID257
Feature activation+7.97e-03

Pos logit contributions
els+0.036
ps+0.028
 colored+0.028
 inside+0.028
 on+0.027
Neg logit contributions
 Int-0.053
 Everything-0.049
 Curious-0.048
 Sadly-0.047
 Everywhere-0.047
Token cartoon
Token ID16251
Feature activation+9.81e-02

Pos logit contributions
els+0.012
ps+0.010
 colored+0.009
 on+0.009
 inside+0.009
Neg logit contributions
 Int-0.018
 Curious-0.016
 Sadly-0.016
 Everything-0.016
 Everywhere-0.015
Token with
Token ID351
Feature activation+1.23e-02

Pos logit contributions
els+0.120
ps+0.087
 colored+0.083
ries+0.076
cks+0.076
Neg logit contributions
 Int-0.246
 Curious-0.227
 Everything-0.225
 Sadly-0.220
 Everywhere-0.218
Token this
Token ID428
Feature activation+2.66e-02

Pos logit contributions
els+0.020
 inside+0.015
ps+0.015
 colored+0.015
 on+0.014
Neg logit contributions
 Int-0.025
 Everything-0.023
 Everywhere-0.023
 Curious-0.022
 Sadly-0.022
Token box
Token ID3091
Feature activation+8.08e-02

Pos logit contributions
els+0.035
ps+0.026
bones+0.026
cks+0.025
ries+0.024
Neg logit contributions
 Int-0.062
 Curious-0.058
 Everywhere-0.058
 Everything-0.056
 Sadly-0.056
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
els+0.094
ps+0.068
cks+0.062
 colored+0.061
bones+0.061
Neg logit contributions
 Int-0.197
 Everywhere-0.185
 Curious-0.184
 Everything-0.182
 Sadly-0.182
Token Tim
Token ID5045
Feature activation-6.58e-02

Pos logit contributions
 Int+0.241
 Curious+0.209
Sadly+0.204
rolling+0.203
 Everything+0.201
Neg logit contributions
.-0.277
 on-0.257
els-0.251
 inside-0.234
."-0.229
Token drew
Token ID9859
Feature activation+4.16e-02

Pos logit contributions
 Int+0.140
 Curious+0.127
 Everything+0.125
 Everywhere+0.124
 Sadly+0.123
Neg logit contributions
els-0.105
 colored-0.091
 on-0.089
.-0.089
ps-0.088
Token pictures
Token ID5986
Feature activation+8.12e-02

Pos logit contributions
els+0.072
ps+0.057
ws+0.056
ries+0.056
 colored+0.056
Neg logit contributions
 Int-0.083
 Curious-0.077
 Everything-0.075
 Everywhere-0.073
 Sadly-0.071
Token on
Token ID319
Feature activation+1.41e-02

Pos logit contributions
els+0.131
ps+0.109
ws+0.104
cks+0.102
 cream+0.100
Neg logit contributions
 Int-0.163
 Everywhere-0.153
 Everything-0.151
 Curious-0.148
 He-0.141
Token the
Token ID262
Feature activation-2.75e-02

Pos logit contributions
els+0.021
ps+0.019
ier+0.018
ws+0.018
 inside+0.018
Neg logit contributions
 Int-0.028
 Everywhere-0.027
 Curious-0.026
 Everything-0.026
 Exc-0.025
Token especially
Token ID2592
Feature activation+3.25e-03

Pos logit contributions
 Int+0.083
 Curious+0.073
 Sadly+0.073
 Everything+0.072
 Everywhere+0.070
Neg logit contributions
els-0.066
 on-0.063
 inside-0.060
.-0.060
 colored-0.057
Token when
Token ID618
Feature activation-4.96e-02

Pos logit contributions
els+0.005
ps+0.004
 sticker+0.004
ws+0.004
 frost+0.004
Neg logit contributions
 Int-0.007
 Curious-0.006
 Everywhere-0.006
 Everything-0.006
Of-0.006
Token things
Token ID1243
Feature activation+3.75e-02

Pos logit contributions
 Int+0.095
 Curious+0.086
 Everything+0.084
 Sadly+0.083
 Everywhere+0.082
Neg logit contributions
els-0.091
ps-0.077
 colored-0.074
 inside-0.073
 cream-0.073
Token seem
Token ID1283
Feature activation+4.61e-02

Pos logit contributions
els+0.063
ps+0.052
 on+0.052
 colored+0.051
 inside+0.051
Neg logit contributions
 Int-0.077
 Curious-0.070
 Everything-0.068
 Sadly-0.067
 Everywhere-0.067
Token scary
Token ID14343
Feature activation+8.31e-02

Pos logit contributions
els+0.080
ps+0.065
 sticker+0.061
bones+0.060
 colored+0.060
Neg logit contributions
 Int-0.093
 Curious-0.085
 Everywhere-0.082
 Everything-0.082
 Sadly-0.079
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
els+0.077
 sticker+0.053
 cream+0.050
ps+0.050
 colored+0.048
Neg logit contributions
 Int-0.228
 Curious-0.212
 Everything-0.212
 Everywhere-0.210
 Sadly-0.202
Token
Token ID220
Feature activation-7.68e-02

Pos logit contributions
 Int+0.236
 Curious+0.204
 Everything+0.194
rolling+0.192
 Everywhere+0.191
Neg logit contributions
.-0.282
 on-0.259
els-0.258
 inside-0.245
."-0.234
Token
Token ID198
Feature activation-9.46e-02

Pos logit contributions
 Int+0.132
 Curious+0.121
Of+0.114
Quick+0.109
 Sadly+0.108
Neg logit contributions
els-0.143
.-0.140
 on-0.133
 inside-0.129
ps-0.121
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
 Int+0.172
 Curious+0.155
 Everything+0.150
 Sadly+0.145
rolling+0.145
Neg logit contributions
.-0.161
els-0.148
 on-0.142
 inside-0.133
."-0.131
TokenThe
Token ID464
Feature activation-6.29e-02

Pos logit contributions
 Int+0.176
 Curious+0.156
 Everything+0.155
 Everywhere+0.149
 Sadly+0.148
Neg logit contributions
.-0.178
els-0.177
 colored-0.166
 cream-0.165
 on-0.164
Token beetle
Token ID45489
Feature activation-6.41e-03

Pos logit contributions
 Int+0.123
 Curious+0.112
 Everything+0.107
 Sadly+0.107
 Everywhere+0.106
Neg logit contributions
els-0.113
ps-0.095
 on-0.094
 inside-0.093
 colored-0.093
Token to
Token ID284
Feature activation-2.49e-02

Pos logit contributions
els+0.087
ps+0.070
 colored+0.066
 on+0.064
 inside+0.063
Neg logit contributions
 Int-0.160
 Curious-0.148
 Everything-0.143
 Sadly-0.142
 Everywhere-0.142
Token share
Token ID2648
Feature activation+6.01e-02

Pos logit contributions
 Int+0.053
 Curious+0.049
 Sadly+0.047
 Everything+0.047
 Everywhere+0.046
Neg logit contributions
els-0.040
 on-0.034
ps-0.033
 inside-0.033
 colored-0.033
Token my
Token ID616
Feature activation+1.46e-02

Pos logit contributions
els+0.091
ps+0.086
ier+0.080
ws+0.078
ishing+0.076
Neg logit contributions
 Int-0.111
 Everywhere-0.103
 Curious-0.101
 She-0.101
 Everything-0.100
Token meal
Token ID9799
Feature activation+1.04e-01

Pos logit contributions
els+0.020
ps+0.017
 inside+0.016
 on+0.016
 colored+0.015
Neg logit contributions
 Int-0.033
 Curious-0.031
 Everywhere-0.029
 Everything-0.029
 Sadly-0.029
Token with
Token ID351
Feature activation+1.28e-02

Pos logit contributions
els+0.127
ps+0.102
 colored+0.090
ws+0.084
cks+0.084
Neg logit contributions
 Int-0.240
ete-0.232
 Everywhere-0.230
 Sadly-0.228
Of-0.225
Token."
Token ID526
Feature activation-1.45e-01

Pos logit contributions
els+0.019
ps+0.016
 inside+0.015
 colored+0.015
 on+0.015
Neg logit contributions
 Int-0.028
 Curious-0.026
 Everything-0.026
 Everywhere-0.025
 Sadly-0.025
Token
Token ID198
Feature activation-9.09e-02

Pos logit contributions
 Int+0.247
 Curious+0.215
rolling+0.212
Sadly+0.211
 Everything+0.208
Neg logit contributions
.-0.273
 on-0.250
els-0.245
 inside-0.229
."-0.224
TokenThe
Token ID464
Feature activation-6.62e-02

Pos logit contributions
 Int+0.170
 Curious+0.153
 Everything+0.150
 Sadly+0.145
rolling+0.145
Neg logit contributions
.-0.157
els-0.146
 on-0.140
 inside-0.129
 cream-0.127
Token kitten
Token ID42143
Feature activation-7.85e-03

Pos logit contributions
 Int+0.124
 Curious+0.114
 Everything+0.108
 Sadly+0.107
 Everywhere+0.106
Neg logit contributions
els-0.123
ps-0.104
 on-0.102
 colored-0.101
 inside-0.101
Token was
Token ID373
Feature activation-1.86e-02

Pos logit contributions
 Int+0.016
 Everywhere+0.014
 Everything+0.014
 Curious+0.014
 Sadly+0.013
Neg logit contributions
els-0.013
.-0.012
 on-0.012
 colored-0.012
 inside-0.012
Token so
Token ID523
Feature activation-3.90e-02

Pos logit contributions
 Int+0.033
 Curious+0.030
 Sadly+0.029
 Everything+0.028
 Everywhere+0.028
Neg logit contributions
els-0.036
ps-0.031
 on-0.031
 colored-0.030
 inside-0.030
Token it
Token ID340
Feature activation+2.41e-02

Pos logit contributions
els+0.132
ps+0.109
cks+0.104
 colored+0.101
ws+0.094
Neg logit contributions
 Int-0.195
 Everything-0.184
 Everywhere-0.182
 Curious-0.178
 Sadly-0.175
Token to
Token ID284
Feature activation-2.28e-02

Pos logit contributions
els+0.038
ps+0.030
 colored+0.029
cks+0.028
 cream+0.028
Neg logit contributions
 Int-0.051
 Curious-0.047
 Everywhere-0.046
 Everything-0.045
 Sadly-0.045
Token its
Token ID663
Feature activation-9.31e-03

Pos logit contributions
 Int+0.051
 Curious+0.047
 Everything+0.046
 Sadly+0.046
 Everywhere+0.045
Neg logit contributions
els-0.034
ps-0.027
 on-0.026
 colored-0.026
 inside-0.026
Token mom
Token ID1995
Feature activation+7.27e-03

Pos logit contributions
 Int+0.018
 Curious+0.016
 Sadly+0.016
 Everywhere+0.016
 Everything+0.016
Neg logit contributions
els-0.017
ps-0.014
 colored-0.013
cks-0.013
ries-0.013
Tokenmy
Token ID1820
Feature activation+2.41e-02

Pos logit contributions
els+0.009
ps+0.007
 colored+0.007
 cream+0.006
cks+0.006
Neg logit contributions
 Int-0.018
 Curious-0.016
 Everything-0.016
 Everywhere-0.016
 Sadly-0.016
Token."
Token ID526
Feature activation-1.44e-01

Pos logit contributions
els+0.027
ps+0.021
 colored+0.019
cks+0.018
 cream+0.018
Neg logit contributions
 Int-0.061
 Curious-0.057
 Everywhere-0.056
 Everything-0.056
 Sadly-0.056
Token
Token ID220
Feature activation-7.56e-02

Pos logit contributions
 Int+0.251
 Curious+0.220
 Everything+0.209
 Everywhere+0.207
rolling+0.206
Neg logit contributions
.-0.273
 on-0.247
els-0.247
 inside-0.232
."-0.224
Token
Token ID198
Feature activation-9.23e-02

Pos logit contributions
 Int+0.133
 Curious+0.121
Of+0.113
Quick+0.109
 Sadly+0.108
Neg logit contributions
els-0.142
.-0.138
 on-0.129
 inside-0.126
ps-0.119
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
 Int+0.172
 Curious+0.157
 Everything+0.152
rolling+0.145
 Sadly+0.145
Neg logit contributions
.-0.151
els-0.142
 on-0.133
 cream-0.125
."-0.124
TokenTwe
Token ID32665
Feature activation-2.83e-02

Pos logit contributions
 Int+0.179
 Curious+0.161
 Everything+0.160
 Everywhere+0.155
 Sadly+0.149
Neg logit contributions
els-0.174
.-0.167
 cream-0.163
 colored-0.162
 on-0.158
Tokenety
Token ID2963
Feature activation-3.62e-02

Pos logit contributions
 Int+0.053
 Curious+0.047
 Everything+0.046
 Sadly+0.045
 Everywhere+0.045
Neg logit contributions
els-0.055
ps-0.046
 on-0.046
 inside-0.045
 colored-0.045
Token and
Token ID290
Feature activation-6.23e-02

Pos logit contributions
els+0.048
ps+0.032
 colored+0.032
 sticker+0.031
 cream+0.031
Neg logit contributions
 Int-0.135
 Curious-0.126
 Everything-0.125
 Everywhere-0.122
 Sadly-0.121
Token don
Token ID836
Feature activation-1.27e-02

Pos logit contributions
 Int+0.125
 Curious+0.111
 Everywhere+0.111
ete+0.110
 Sadly+0.109
Neg logit contributions
els-0.099
 on-0.095
 inside-0.094
.-0.093
ps-0.091
Token't
Token ID470
Feature activation-2.53e-02

Pos logit contributions
 Int+0.021
 Curious+0.019
Of+0.018
 Sadly+0.018
 Everything+0.017
Neg logit contributions
els-0.026
 on-0.025
.-0.024
ps-0.023
 inside-0.023
Token know
Token ID760
Feature activation+2.89e-02

Pos logit contributions
 Int+0.048
 Curious+0.043
 Everything+0.042
 Sadly+0.041
 Everywhere+0.040
Neg logit contributions
els-0.048
ps-0.040
 colored-0.039
 on-0.039
 cream-0.039
Token why
Token ID1521
Feature activation+2.25e-03

Pos logit contributions
els+0.047
ps+0.040
bones+0.040
 colored+0.039
 sticker+0.039
Neg logit contributions
 Int-0.055
 Everything-0.051
 Curious-0.048
 Everywhere-0.048
 Sadly-0.046
Token."
Token ID526
Feature activation-1.44e-01

Pos logit contributions
els+0.003
ps+0.003
 colored+0.003
 cream+0.003
 inside+0.003
Neg logit contributions
 Int-0.005
 Curious-0.005
 Everything-0.005
 Everywhere-0.004
 Sadly-0.004
Token
Token ID198
Feature activation-9.96e-02

Pos logit contributions
 Int+0.236
 Curious+0.206
rolling+0.199
 Everywhere+0.197
 Everything+0.195
Neg logit contributions
.-0.280
 on-0.252
els-0.251
 inside-0.237
."-0.230
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
 Int+0.182
 Curious+0.165
 Everything+0.159
rolling+0.153
ishly+0.152
Neg logit contributions
.-0.171
els-0.161
 on-0.150
 inside-0.142
aks-0.136
TokenMom
Token ID29252
Feature activation-3.59e-02

Pos logit contributions
 Int+0.187
 Everything+0.167
 Curious+0.165
 Everywhere+0.164
 Sadly+0.154
Neg logit contributions
els-0.181
.-0.177
 on-0.163
 cream-0.163
aks-0.160
Token smiled
Token ID13541
Feature activation+2.01e-02

Pos logit contributions
 Int+0.074
 Curious+0.067
 Sadly+0.065
 Everywhere+0.065
 Everything+0.064
Neg logit contributions
els-0.061
 on-0.052
ps-0.052
 inside-0.051
 colored-0.051
Token and
Token ID290
Feature activation-6.01e-02

Pos logit contributions
els+0.024
ps+0.019
 colored+0.018
 cream+0.018
 on+0.017
Neg logit contributions
 Int-0.052
 Curious-0.048
 Everything-0.046
 Sadly-0.046
 Everywhere-0.046
Token longer
Token ID2392
Feature activation+1.75e-02

Pos logit contributions
 Int+0.054
 Curious+0.049
 Everything+0.045
 Sadly+0.045
 Everywhere+0.043
Neg logit contributions
els-0.070
ps-0.061
cks-0.057
ws-0.057
 colored-0.056
Token until
Token ID1566
Feature activation-5.24e-02

Pos logit contributions
els+0.026
ps+0.020
ries+0.019
cks+0.018
ws+0.018
Neg logit contributions
 Int-0.039
 Curious-0.037
 Everything-0.035
 Sadly-0.035
 Everywhere-0.034
Token they
Token ID484
Feature activation-4.11e-02

Pos logit contributions
 Int+0.106
 Curious+0.097
 Everything+0.096
 Sadly+0.094
 Everywhere+0.091
Neg logit contributions
els-0.091
ps-0.077
 colored-0.072
ws-0.070
cks-0.070
Token cool
Token ID3608
Feature activation+4.09e-02

Pos logit contributions
 Int+0.077
 Curious+0.069
 Everywhere+0.064
 Everything+0.064
 Sadly+0.063
Neg logit contributions
els-0.077
 inside-0.072
 on-0.072
.-0.068
ps-0.068
Token down
Token ID866
Feature activation+3.10e-02

Pos logit contributions
els+0.057
ps+0.043
 colored+0.041
 sticker+0.040
cks+0.040
Neg logit contributions
 Int-0.094
 Curious-0.087
 Everything-0.085
 Sadly-0.082
 Everywhere-0.082
Token."
Token ID526
Feature activation-1.44e-01

Pos logit contributions
els+0.045
ps+0.033
ws+0.033
cks+0.032
 colored+0.032
Neg logit contributions
 Int-0.069
 Everywhere-0.064
 Curious-0.064
 Everything-0.062
 Sadly-0.062
Token
Token ID198
Feature activation-9.59e-02

Pos logit contributions
 Int+0.235
 Curious+0.205
 Everything+0.191
rolling+0.191
Sadly+0.188
Neg logit contributions
.-0.276
els-0.257
 on-0.251
 inside-0.238
."-0.230
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
 Int+0.183
 Curious+0.166
 Everything+0.158
ishly+0.153
 Sadly+0.152
Neg logit contributions
.-0.157
els-0.145
 on-0.138
 inside-0.131
aks-0.125
TokenBut
Token ID1537
Feature activation-7.24e-02

Pos logit contributions
 Int+0.183
 Curious+0.162
 Everything+0.160
 Everywhere+0.157
 Sadly+0.152
Neg logit contributions
els-0.172
.-0.167
 on-0.157
 cream-0.155
 colored-0.154
Token Lily
Token ID20037
Feature activation-4.46e-02

Pos logit contributions
 Int+0.147
 Curious+0.134
 Sadly+0.130
 Everything+0.130
 Everywhere+0.127
Neg logit contributions
els-0.123
ps-0.104
 colored-0.100
 on-0.098
 cream-0.098
Token didn
Token ID1422
Feature activation-1.35e-02

Pos logit contributions
 Int+0.080
 Curious+0.070
 Everywhere+0.069
 Everything+0.068
 Sadly+0.066
Neg logit contributions
els-0.086
 on-0.077
.-0.077
 inside-0.076
ps-0.075
Token's
Token ID338
Feature activation-1.03e-02

Pos logit contributions
 Int+0.006
 Curious+0.005
 Sadly+0.005
 Everything+0.005
 Everywhere+0.005
Neg logit contributions
els-0.007
ps-0.006
 on-0.006
 inside-0.006
 colored-0.006
Token raining
Token ID43079
Feature activation+8.50e-02

Pos logit contributions
 Int+0.020
 Curious+0.018
 Sadly+0.018
 Everything+0.018
 Everywhere+0.018
Neg logit contributions
els-0.018
ps-0.014
 colored-0.014
bones-0.014
cks-0.013
Token:
Token ID25
Feature activation-5.44e-02

Pos logit contributions
els+0.091
ps+0.064
 colored+0.059
 cream+0.057
 on+0.056
Neg logit contributions
 Int-0.229
 Curious-0.212
 Everywhere-0.209
 Everything-0.209
 Sadly-0.207
Token an
Token ID281
Feature activation+2.10e-02

Pos logit contributions
 Int+0.113
 Curious+0.102
 Sadly+0.099
 Everywhere+0.097
 Everything+0.097
Neg logit contributions
els-0.089
 on-0.079
 inside-0.075
ps-0.074
 colored-0.074
Token umbrella
Token ID25510
Feature activation+6.46e-02

Pos logit contributions
els+0.034
ps+0.029
 on+0.028
cks+0.027
 inside+0.026
Neg logit contributions
 Int-0.044
 Curious-0.041
 Everything-0.040
 Everywhere-0.040
 Sadly-0.039
Token!"
Token ID2474
Feature activation-1.44e-01

Pos logit contributions
els+0.081
ps+0.063
 colored+0.060
 on+0.060
 inside+0.059
Neg logit contributions
 Int-0.161
 Curious-0.149
 Everything-0.145
 Sadly-0.145
 Everywhere-0.143
Token F
Token ID376
Feature activation-5.44e-02

Pos logit contributions
 Int+0.247
 Curious+0.217
rolling+0.216
 Everywhere+0.212
 Everything+0.211
Neg logit contributions
.-0.274
 on-0.258
els-0.245
 inside-0.232
."-0.220
Tokenido
Token ID17305
Feature activation-4.63e-02

Pos logit contributions
 Int+0.107
 Curious+0.097
 Everything+0.095
 Sadly+0.093
Sadly+0.089
Neg logit contributions
els-0.089
.-0.083
 inside-0.078
ps-0.077
 on-0.076
Token was
Token ID373
Feature activation-2.61e-02

Pos logit contributions
 Int+0.096
 Everywhere+0.089
 Everything+0.087
 Sadly+0.087
Sadly+0.087
Neg logit contributions
.-0.068
els-0.067
 on-0.064
 inside-0.063
 colored-0.060
Token so
Token ID523
Feature activation-4.41e-02

Pos logit contributions
 Int+0.046
 Curious+0.041
 Everything+0.039
 Sadly+0.039
 Everywhere+0.039
Neg logit contributions
els-0.050
 on-0.046
 inside-0.045
ps-0.045
 colored-0.044
Token excited
Token ID6568
Feature activation+1.42e-02

Pos logit contributions
 Int+0.089
 Everything+0.078
 Curious+0.078
 Everywhere+0.078
 Sadly+0.076
Neg logit contributions
.-0.067
 on-0.067
els-0.067
 inside-0.065
 colored-0.061
Token deserve
Token ID10925
Feature activation+5.93e-02

Pos logit contributions
 Int+0.045
 Curious+0.041
 Everything+0.040
 Sadly+0.039
 Everywhere+0.038
Neg logit contributions
els-0.050
ps-0.040
ries-0.039
 colored-0.038
 cream-0.037
Token to
Token ID284
Feature activation-4.06e-03

Pos logit contributions
els+0.101
 colored+0.082
ps+0.077
bones+0.075
 sticker+0.075
Neg logit contributions
 Int-0.108
 Everything-0.103
 Everywhere-0.099
 Curious-0.095
ete-0.094
Token ride
Token ID6594
Feature activation+1.00e-01

Pos logit contributions
 Int+0.007
 Everything+0.006
 Curious+0.006
 Sadly+0.006
 Everywhere+0.006
Neg logit contributions
els-0.008
ps-0.006
 on-0.006
 colored-0.006
ries-0.006
Token on
Token ID319
Feature activation+3.22e-02

Pos logit contributions
els+0.118
ps+0.089
 colored+0.087
 sticker+0.079
 frost+0.078
Neg logit contributions
 Int-0.246
 Curious-0.229
 Everything-0.226
 Everywhere-0.226
 Sadly-0.220
Token me
Token ID502
Feature activation+3.49e-02

Pos logit contributions
els+0.046
ps+0.038
 colored+0.036
ws+0.035
 cream+0.034
Neg logit contributions
 Int-0.072
 Everywhere-0.067
 Curious-0.066
 Everything-0.065
 Sadly-0.064
Token."
Token ID526
Feature activation-1.44e-01

Pos logit contributions
els+0.031
ps+0.022
 colored+0.021
 cream+0.019
 inside+0.019
Neg logit contributions
 Int-0.100
 Curious-0.093
 Everywhere-0.090
 Sadly-0.090
 Everything-0.090
Token
Token ID198
Feature activation-1.01e-01

Pos logit contributions
 Int+0.249
 Curious+0.219
rolling+0.213
 Everything+0.211
 Everywhere+0.206
Neg logit contributions
.-0.272
 on-0.240
els-0.237
 inside-0.224
."-0.220
Token
Token ID198
Feature activation-1.09e-01

Pos logit contributions
 Int+0.193
 Curious+0.177
 Everything+0.170
ishly+0.168
rolling+0.164
Neg logit contributions
.-0.170
els-0.156
 on-0.147
 inside-0.140
aks-0.137
TokenTim
Token ID14967
Feature activation-5.27e-02

Pos logit contributions
 Int+0.205
 Everything+0.186
 Curious+0.185
 Everywhere+0.178
 Sadly+0.170
Neg logit contributions
els-0.172
.-0.165
 cream-0.157
aks-0.154
 on-0.153
Token and
Token ID290
Feature activation-7.12e-02

Pos logit contributions
 Int+0.110
 Curious+0.099
 Everywhere+0.098
 Everything+0.098
 Sadly+0.095
Neg logit contributions
els-0.086
 on-0.076
ps-0.075
.-0.074
 colored-0.074
Token Mia
Token ID32189
Feature activation-3.65e-02

Pos logit contributions
 Int+0.151
 Curious+0.135
 Sadly+0.131
 Everywhere+0.130
 Everything+0.130
Neg logit contributions
els-0.115
 on-0.101
 inside-0.099
ps-0.098
.-0.097
Token to
Token ID284
Feature activation-3.79e-02

Pos logit contributions
 Int+0.028
 Curious+0.026
 Everything+0.025
 Sadly+0.025
 Everywhere+0.024
Neg logit contributions
els-0.019
ps-0.016
 on-0.016
 inside-0.015
 colored-0.015
Token go
Token ID467
Feature activation+3.01e-02

Pos logit contributions
 Int+0.079
 Curious+0.071
 Sadly+0.069
 Everything+0.069
 Everywhere+0.068
Neg logit contributions
els-0.064
ps-0.053
 on-0.053
 inside-0.052
 colored-0.052
Token to
Token ID284
Feature activation-2.23e-02

Pos logit contributions
els+0.044
ps+0.034
 colored+0.033
 frost+0.030
ries+0.030
Neg logit contributions
 Int-0.071
 Curious-0.065
 Everything-0.062
 Everywhere-0.062
 Sadly-0.062
Token the
Token ID262
Feature activation-3.83e-02

Pos logit contributions
 Int+0.057
 Curious+0.053
 Sadly+0.051
 Everything+0.051
 Everywhere+0.050
Neg logit contributions
els-0.027
ps-0.021
 on-0.021
 inside-0.020
 colored-0.020
Token park
Token ID3952
Feature activation+5.62e-02

Pos logit contributions
 Int+0.075
 Curious+0.068
 Everything+0.065
 Sadly+0.065
 Everywhere+0.064
Neg logit contributions
els-0.069
ps-0.058
 on-0.057
 colored-0.057
 inside-0.057
Token."
Token ID526
Feature activation-1.44e-01

Pos logit contributions
els+0.053
ps+0.039
 colored+0.032
 on+0.029
 cream+0.029
Neg logit contributions
 Int-0.159
 Curious-0.149
 Sadly-0.145
 Everything-0.144
 Everywhere-0.143
Token
Token ID198
Feature activation-9.13e-02

Pos logit contributions
 Int+0.254
Sadly+0.227
rolling+0.226
 Curious+0.223
 bust+0.219
Neg logit contributions
.-0.261
 on-0.227
els-0.216
 inside-0.208
."-0.202
Token
Token ID198
Feature activation-1.01e-01

Pos logit contributions
 Int+0.173
 Curious+0.157
ishly+0.157
 Everything+0.155
rolling+0.150
Neg logit contributions
.-0.152
els-0.137
 on-0.135
 inside-0.125
aks-0.121
TokenDad
Token ID46270
Feature activation-3.64e-02

Pos logit contributions
 Int+0.183
 Everything+0.169
 Curious+0.164
 Everywhere+0.164
 Sadly+0.161
Neg logit contributions
els-0.161
.-0.155
 cream-0.150
 on-0.148
 colored-0.145
Token said
Token ID531
Feature activation-2.73e-02

Pos logit contributions
 Int+0.070
 Curious+0.063
 Sadly+0.061
 Everything+0.061
 Everywhere+0.061
Neg logit contributions
els-0.066
 on-0.057
ps-0.057
 inside-0.056
 colored-0.055
Token,
Token ID11
Feature activation-5.47e-02

Pos logit contributions
 Int+0.075
 Curious+0.070
 Everything+0.068
 Sadly+0.067
 Everywhere+0.067
Neg logit contributions
els-0.029
ps-0.021
 colored-0.020
 on-0.019
 inside-0.019
Token I
Token ID314
Feature activation-4.04e-02

Pos logit contributions
els+0.002
ps+0.002
 on+0.002
 inside+0.002
 colored+0.002
Neg logit contributions
 Int-0.008
 Curious-0.007
 Everything-0.007
 Sadly-0.007
 Everywhere-0.007
Token couldn
Token ID3521
Feature activation-2.18e-03

Pos logit contributions
 Int+0.081
 Curious+0.073
 Everything+0.070
 Everywhere+0.069
 Sadly+0.069
Neg logit contributions
els-0.069
 on-0.061
 inside-0.060
ps-0.060
 colored-0.058
Token't
Token ID470
Feature activation-7.38e-03

Pos logit contributions
 Int+0.004
 Curious+0.003
Of+0.003
 Everywhere+0.003
 Sadly+0.003
Neg logit contributions
els-0.004
 on-0.004
.-0.004
 inside-0.004
ps-0.004
Token help
Token ID1037
Feature activation+2.95e-02

Pos logit contributions
 Int+0.016
 Curious+0.014
 Sadly+0.014
 Everything+0.014
 Everywhere+0.013
Neg logit contributions
els-0.012
ps-0.010
 on-0.010
 inside-0.010
 colored-0.010
Token sooner
Token ID14556
Feature activation+5.19e-02

Pos logit contributions
els+0.055
ps+0.048
 colored+0.047
 cream+0.045
 on+0.045
Neg logit contributions
 Int-0.054
 Curious-0.048
 Everything-0.047
 Everywhere-0.047
 Sadly-0.046
Token."
Token ID526
Feature activation-1.44e-01

Pos logit contributions
els+0.055
ps+0.040
 colored+0.036
ws+0.034
bones+0.033
Neg logit contributions
 Int-0.135
 Curious-0.126
 Sadly-0.125
 Everything-0.124
 Everywhere-0.122
Token Qu
Token ID2264
Feature activation-3.78e-02

Pos logit contributions
 Int+0.237
 Curious+0.204
rolling+0.202
 Everything+0.198
 Everywhere+0.197
Neg logit contributions
.-0.283
 on-0.258
els-0.241
 inside-0.240
."-0.235
Tokenackers
Token ID28874
Feature activation-3.49e-02

Pos logit contributions
 Int+0.067
 Curious+0.060
 Everything+0.059
 Everywhere+0.057
 Sadly+0.055
Neg logit contributions
els-0.071
.-0.067
 on-0.065
ps-0.063
ws-0.061
Token said
Token ID531
Feature activation-3.49e-02

Pos logit contributions
 Int+0.067
 Everywhere+0.060
 Curious+0.059
 Everything+0.058
 Sadly+0.056
Neg logit contributions
els-0.061
.-0.058
 on-0.057
ps-0.056
 colored-0.055
Token,
Token ID11
Feature activation-6.43e-02

Pos logit contributions
 Int+0.094
 Curious+0.088
 Sadly+0.085
 Everything+0.085
 Everywhere+0.083
Neg logit contributions
els-0.040
ps-0.028
 colored-0.027
 cream-0.025
 inside-0.025
Token "
Token ID366
Feature activation-8.01e-02

Pos logit contributions
 Int+0.099
 Curious+0.085
 Everything+0.084
 Sadly+0.084
 Everywhere+0.083
Neg logit contributions
els-0.130
 on-0.130
.-0.126
 inside-0.125
ps-0.121
Token this
Token ID428
Feature activation+2.90e-02

Pos logit contributions
els+0.086
ps+0.067
 colored+0.062
 sticker+0.061
cks+0.061
Neg logit contributions
 Int-0.163
 Everywhere-0.155
 Everything-0.153
 Curious-0.150
 Sadly-0.147
Token drain
Token ID14782
Feature activation+1.04e-01

Pos logit contributions
els+0.043
ps+0.036
cks+0.034
bones+0.032
ries+0.032
Neg logit contributions
 Int-0.062
 Everywhere-0.058
 Curious-0.057
 Sadly-0.056
 Everything-0.056
Token by
Token ID416
Feature activation+2.99e-03

Pos logit contributions
els+0.118
ps+0.087
 colored+0.080
cks+0.078
 sticker+0.076
Neg logit contributions
 Int-0.259
 Everywhere-0.240
 Curious-0.239
 Everything-0.239
 Sadly-0.236
Token cleaning
Token ID12724
Feature activation+1.02e-01

Pos logit contributions
els+0.005
ries+0.004
 cream+0.004
 sticker+0.004
 inside+0.004
Neg logit contributions
 Int-0.006
 Everything-0.005
 Everywhere-0.005
 Sadly-0.005
 Curious-0.005
Token it
Token ID340
Feature activation+3.65e-02

Pos logit contributions
els+0.129
ps+0.110
cks+0.109
bones+0.106
 sticker+0.105
Neg logit contributions
 Int-0.215
 Everything-0.209
 Everywhere-0.209
 Sadly-0.197
 Curious-0.196
Token."
Token ID526
Feature activation-1.44e-01

Pos logit contributions
els+0.042
bones+0.029
ps+0.029
cks+0.028
 colored+0.027
Neg logit contributions
 Int-0.092
 Curious-0.085
 Everywhere-0.085
 Sadly-0.085
 Everything-0.084
Token He
Token ID679
Feature activation-7.12e-02

Pos logit contributions
 Int+0.242
 Curious+0.211
 Everything+0.201
rolling+0.201
Sadly+0.199
Neg logit contributions
.-0.280
 on-0.257
els-0.252
 inside-0.236
."-0.229
Token ran
Token ID4966
Feature activation+2.41e-02

Pos logit contributions
 Int+0.143
 Curious+0.130
 Everything+0.125
 Sadly+0.125
 Everywhere+0.124
Neg logit contributions
els-0.123
ps-0.105
 on-0.105
 colored-0.103
 inside-0.103
Token inside
Token ID2641
Feature activation+1.80e-02

Pos logit contributions
els+0.043
cks+0.035
ps+0.033
ries+0.033
 colored+0.032
Neg logit contributions
 Int-0.051
 Curious-0.047
 Sadly-0.045
 Everywhere-0.045
 Everything-0.045
Token and
Token ID290
Feature activation-6.89e-02

Pos logit contributions
els+0.028
ps+0.023
cks+0.021
 colored+0.021
ws+0.020
Neg logit contributions
 Int-0.041
 Curious-0.038
 Sadly-0.036
 Everywhere-0.036
 Everything-0.036
Token asked
Token ID1965
Feature activation-3.39e-02

Pos logit contributions
 Int+0.136
 Curious+0.123
 Sadly+0.118
 Everything+0.118
 Everywhere+0.115
Neg logit contributions
els-0.124
ps-0.105
 on-0.103
 colored-0.101
 inside-0.101
Token with
Token ID351
Feature activation+6.44e-03

Pos logit contributions
els+0.026
bones+0.020
ps+0.019
ws+0.019
ries+0.019
Neg logit contributions
 Int-0.028
 Sadly-0.025
 Curious-0.025
 Everything-0.024
ete-0.024
Token me
Token ID502
Feature activation+2.77e-02

Pos logit contributions
els+0.010
ps+0.008
 inside+0.008
 on+0.008
 colored+0.008
Neg logit contributions
 Int-0.013
 Everything-0.012
 Curious-0.012
 Everywhere-0.012
 Sadly-0.012
Token for
Token ID329
Feature activation+1.29e-02

Pos logit contributions
els+0.030
 colored+0.023
ps+0.022
 cream+0.021
bones+0.021
Neg logit contributions
 Int-0.069
 Curious-0.064
 Sadly-0.064
 Everything-0.064
 Everywhere-0.064
Token so
Token ID523
Feature activation-3.03e-02

Pos logit contributions
els+0.023
 colored+0.018
ishing+0.018
ps+0.017
 cream+0.017
Neg logit contributions
 Int-0.024
 Everything-0.022
 Curious-0.022
 Sadly-0.022
 Everywhere-0.022
Token long
Token ID890
Feature activation+5.29e-02

Pos logit contributions
 Int+0.057
 Curious+0.052
 Sadly+0.051
 Everything+0.051
 Everywhere+0.050
Neg logit contributions
els-0.056
ps-0.047
 colored-0.046
 inside-0.045
 on-0.045
Token."
Token ID526
Feature activation-1.44e-01

Pos logit contributions
els+0.059
 sticker+0.044
ws+0.042
 colored+0.040
 stamp+0.038
Neg logit contributions
 Int-0.128
 Curious-0.119
 Everywhere-0.117
 Everything-0.116
 Sadly-0.115
Token
Token ID198
Feature activation-9.84e-02

Pos logit contributions
 Int+0.242
 Curious+0.211
rolling+0.207
 Everything+0.202
Sadly+0.200
Neg logit contributions
.-0.272
 on-0.247
els-0.242
 inside-0.229
."-0.225
Token
Token ID198
Feature activation-1.05e-01

Pos logit contributions
 Int+0.185
 Curious+0.166
 Everything+0.162
ishly+0.161
rolling+0.158
Neg logit contributions
.-0.164
els-0.149
 on-0.144
 inside-0.135
."-0.131
TokenHer
Token ID9360
Feature activation-5.44e-02

Pos logit contributions
 Int+0.188
 Everything+0.168
 Curious+0.166
 Everywhere+0.164
 Sadly+0.160
Neg logit contributions
.-0.163
els-0.163
 cream-0.152
 on-0.151
 colored-0.146
Token mom
Token ID1995
Feature activation-1.96e-02

Pos logit contributions
 Int+0.100
 Curious+0.091
 Sadly+0.087
 Everything+0.087
 Everywhere+0.086
Neg logit contributions
els-0.103
ps-0.089
 on-0.087
 colored-0.086
 inside-0.086
Token said
Token ID531
Feature activation-2.29e-02

Pos logit contributions
 Int+0.036
 Curious+0.032
 Sadly+0.031
 Everything+0.031
 Everywhere+0.031
Neg logit contributions
els-0.038
ps-0.033
 on-0.032
 colored-0.032
 inside-0.032
Token want
Token ID765
Feature activation+1.25e-02

Pos logit contributions
els+0.017
ps+0.015
 colored+0.015
 cream+0.014
 on+0.014
Neg logit contributions
 Int-0.014
 Curious-0.012
 Everything-0.012
 Sadly-0.012
 Everywhere-0.011
Token to
Token ID284
Feature activation-1.94e-02

Pos logit contributions
els+0.021
ps+0.017
 colored+0.017
 inside+0.016
 on+0.016
Neg logit contributions
 Int-0.026
 Curious-0.024
 Everything-0.023
 Sadly-0.023
 Everywhere-0.023
Token make
Token ID787
Feature activation+3.97e-02

Pos logit contributions
 Int+0.036
 Curious+0.033
 Everything+0.032
 Everywhere+0.032
 Sadly+0.032
Neg logit contributions
els-0.036
ps-0.030
 on-0.029
 colored-0.029
 inside-0.029
Token a
Token ID257
Feature activation+2.10e-02

Pos logit contributions
els+0.058
ps+0.048
 colored+0.046
 on+0.045
 inside+0.044
Neg logit contributions
 Int-0.087
 Everything-0.080
 Curious-0.078
 Sadly-0.075
 Everywhere-0.075
Token fort
Token ID6285
Feature activation+1.00e-01

Pos logit contributions
els+0.035
ps+0.030
 colored+0.026
cks+0.026
ries+0.026
Neg logit contributions
 Int-0.043
 Curious-0.040
 Sadly-0.039
 Everything-0.038
Of-0.037
Token."
Token ID526
Feature activation-1.44e-01

Pos logit contributions
els+0.123
ps+0.085
 colored+0.081
ries+0.077
cks+0.076
Neg logit contributions
 Int-0.241
 Sadly-0.224
 Everywhere-0.224
 Everything-0.222
 Curious-0.221
Token
Token ID198
Feature activation-9.78e-02

Pos logit contributions
 Int+0.248
 Curious+0.217
Sadly+0.215
rolling+0.213
 Everything+0.209
Neg logit contributions
.-0.271
 on-0.246
els-0.241
 inside-0.226
."-0.221
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
 Int+0.181
 Curious+0.165
 Everything+0.160
ishly+0.157
 Sadly+0.153
Neg logit contributions
.-0.169
els-0.163
 on-0.151
 inside-0.143
aks-0.139
TokenM
Token ID44
Feature activation-7.27e-02

Pos logit contributions
 Int+0.199
 Everything+0.185
 Curious+0.180
 Everywhere+0.179
 Sadly+0.173
Neg logit contributions
els-0.173
.-0.160
 cream-0.154
 on-0.152
 colored-0.149
Tokenia
Token ID544
Feature activation-5.32e-02

Pos logit contributions
 Int+0.137
 Curious+0.124
 Everything+0.118
 Sadly+0.117
 Everywhere+0.116
Neg logit contributions
els-0.139
ps-0.117
 on-0.116
 colored-0.115
 inside-0.115
Token said
Token ID531
Feature activation-3.42e-02

Pos logit contributions
 Int+0.111
 Curious+0.100
 Everything+0.098
 Sadly+0.098
 Everywhere+0.097
Neg logit contributions
els-0.088
 on-0.075
 colored-0.075
ps-0.075
 inside-0.074
Token Rex
Token ID17853
Feature activation+4.21e-02

Pos logit contributions
els+0.074
ps+0.062
 colored+0.061
 on+0.060
 inside+0.060
Neg logit contributions
 Int-0.100
 Curious-0.091
 Everything-0.088
 Sadly-0.088
 Everywhere-0.087
Token.
Token ID13
Feature activation-7.80e-02

Pos logit contributions
els+0.038
 colored+0.024
ps+0.020
ries+0.019
 coloured+0.019
Neg logit contributions
 Int-0.120
 Curious-0.113
 Sadly-0.109
 Everything-0.108
 Everywhere-0.106
Token He
Token ID679
Feature activation-1.66e-02

Pos logit contributions
 Int+0.126
ete+0.117
mber+0.111
Da+0.110
Sadly+0.108
Neg logit contributions
.-0.150
 on-0.149
 inside-0.135
els-0.134
ps-0.128
Token drops
Token ID10532
Feature activation+1.05e-01

Pos logit contributions
 Int+0.039
 Curious+0.036
 Everything+0.035
 Sadly+0.035
 Everywhere+0.034
Neg logit contributions
els-0.023
ps-0.019
 on-0.019
 inside-0.018
 colored-0.018
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
els+0.155
ps+0.124
 colored+0.124
 inside+0.117
 cream+0.116
Neg logit contributions
 Int-0.236
 Curious-0.215
 Everything-0.213
 Everywhere-0.211
 Sadly-0.205
Token ball
Token ID2613
Feature activation+9.71e-02

Pos logit contributions
els+0.028
cks+0.022
ps+0.022
 colored+0.022
bones+0.022
Neg logit contributions
 Int-0.024
 Curious-0.023
 Everywhere-0.022
 Everything-0.021
 Exc-0.021
Token.
Token ID13
Feature activation-7.49e-02

Pos logit contributions
els+0.118
ries+0.088
 colored+0.082
ocol+0.075
 coloured+0.074
Neg logit contributions
 Int-0.234
 He-0.219
 Sadly-0.218
 Curious-0.215
 Everywhere-0.214
Token He
Token ID679
Feature activation-1.79e-02

Pos logit contributions
 Int+0.106
ete+0.092
 Curious+0.089
Sadly+0.088
rolling+0.088
Neg logit contributions
.-0.157
 on-0.155
 inside-0.143
els-0.143
 sticker-0.141
Token w
Token ID266
Feature activation+7.56e-02

Pos logit contributions
 Int+0.041
 Curious+0.038
 Sadly+0.037
 Everything+0.037
 Everywhere+0.036
Neg logit contributions
els-0.026
ps-0.021
 on-0.021
 inside-0.020
 colored-0.020
Tokenags
Token ID3775
Feature activation+8.32e-02

Pos logit contributions
els+0.091
 on+0.070
ps+0.069
 inside+0.069
 colored+0.069
Neg logit contributions
 Int-0.193
 Curious-0.178
 Everything-0.174
 Sadly-0.173
 Everywhere-0.172
Token his
Token ID465
Feature activation+3.43e-02

Pos logit contributions
els+0.165
ps+0.133
ries+0.130
ws+0.130
 icing+0.129
Neg logit contributions
 Int-0.145
 Curious-0.130
 Everything-0.122
 His-0.121
 Sadly-0.119
Token tree
Token ID5509
Feature activation+7.94e-02

Pos logit contributions
 Int+0.032
 Curious+0.029
 Sadly+0.028
 Everything+0.028
 Everywhere+0.027
Neg logit contributions
els-0.024
ps-0.020
 colored-0.020
 on-0.019
 inside-0.019
Token.
Token ID13
Feature activation-9.41e-02

Pos logit contributions
els+0.093
ps+0.068
 colored+0.065
ries+0.060
 cream+0.060
Neg logit contributions
 Int-0.206
 Sadly-0.192
 Curious-0.191
 Everywhere-0.182
 Unfortunately-0.182
Token The
Token ID383
Feature activation-2.71e-02

Pos logit contributions
 Int+0.131
 Curious+0.109
Da+0.108
ete+0.106
Sadly+0.104
Neg logit contributions
 on-0.195
els-0.194
 inside-0.191
.-0.190
ps-0.181
Token dog
Token ID3290
Feature activation+1.85e-02

Pos logit contributions
 Int+0.065
 Everywhere+0.057
 Sadly+0.057
 Curious+0.057
ete+0.057
Neg logit contributions
 on-0.034
els-0.034
 cream-0.032
 inside-0.032
 sticker-0.030
Token cannot
Token ID2314
Feature activation+1.78e-02

Pos logit contributions
els+0.026
 on+0.022
ps+0.022
 inside+0.021
.+0.020
Neg logit contributions
 Int-0.043
 Curious-0.039
 Everything-0.039
 Sadly-0.039
 Everywhere-0.039
Token climb
Token ID12080
Feature activation+1.14e-01

Pos logit contributions
els+0.025
 inside+0.023
.+0.023
 on+0.023
ps+0.022
Neg logit contributions
 Int-0.040
 Curious-0.036
 Everything-0.036
 Sadly-0.036
 Everywhere-0.035
Token.
Token ID13
Feature activation-9.13e-02

Pos logit contributions
els+0.134
 colored+0.109
 sticker+0.107
 stamp+0.106
 frost+0.104
Neg logit contributions
 Int-0.275
 Everywhere-0.258
 Curious-0.249
 Sadly-0.244
 Everything-0.243
Token It
Token ID632
Feature activation-2.14e-02

Pos logit contributions
 Int+0.166
rolling+0.161
Ell+0.161
Da+0.158
Bet+0.153
Neg logit contributions
.-0.145
 on-0.144
 inside-0.132
els-0.126
."-0.125
Token stays
Token ID14768
Feature activation+6.96e-02

Pos logit contributions
 Int+0.053
ete+0.052
 Everything+0.051
 Everywhere+0.050
 Curious+0.050
Neg logit contributions
els-0.023
 on-0.022
ps-0.021
.-0.020
 inside-0.020
Token under
Token ID739
Feature activation+1.79e-02

Pos logit contributions
els+0.109
 icing+0.082
ries+0.082
 frost+0.078
ps+0.078
Neg logit contributions
 Int-0.141
 Curious-0.135
 Sadly-0.132
 Barkley-0.123
Don-0.122
Token the
Token ID262
Feature activation-6.23e-03

Pos logit contributions
els+0.027
 colored+0.022
 cream+0.021
ps+0.021
 inside+0.020
Neg logit contributions
 Int-0.040
 Curious-0.037
 Everything-0.036
 Everywhere-0.036
 Sadly-0.035
TokenThe
Token ID464
Feature activation-5.40e-02

Pos logit contributions
 Int+0.153
ishly+0.136
 Curious+0.135
 Sadly+0.134
 Everything+0.131
Neg logit contributions
 on-0.151
els-0.149
 colored-0.148
 cream-0.142
.-0.141
Token party
Token ID2151
Feature activation+3.96e-02

Pos logit contributions
 Int+0.107
 Curious+0.098
 Everything+0.094
 Sadly+0.093
 Everywhere+0.092
Neg logit contributions
els-0.097
ps-0.081
 on-0.078
 inside-0.077
 colored-0.077
Token was
Token ID373
Feature activation-4.83e-03

Pos logit contributions
els+0.078
ps+0.068
 on+0.067
 inside+0.066
 colored+0.066
Neg logit contributions
 Int-0.070
 Curious-0.063
 Everything-0.060
 Sadly-0.060
 Everywhere-0.059
Token filled
Token ID5901
Feature activation+6.09e-02

Pos logit contributions
 Int+0.008
 Curious+0.007
 Everything+0.007
 Everywhere+0.007
 Sadly+0.007
Neg logit contributions
els-0.010
ps-0.009
 colored-0.008
 cream-0.008
ws-0.008
Token with
Token ID351
Feature activation+3.56e-02

Pos logit contributions
els+0.085
ps+0.069
ws+0.062
ries+0.056
 sticker+0.056
Neg logit contributions
 Int-0.141
 Curious-0.132
 Everywhere-0.128
 Sadly-0.128
 Exc-0.126
Token balloons
Token ID35485
Feature activation+1.11e-01

Pos logit contributions
els+0.059
ps+0.049
 on+0.045
cks+0.045
ws+0.045
Neg logit contributions
 Int-0.073
 Curious-0.067
 Everything-0.066
 Everywhere-0.065
 Sadly-0.064
Token and
Token ID290
Feature activation-2.02e-02

Pos logit contributions
els+0.150
ps+0.124
ws+0.121
cks+0.110
bones+0.105
Neg logit contributions
 Int-0.249
 Curious-0.233
 Everything-0.225
 Everywhere-0.224
ete-0.220
Token music
Token ID2647
Feature activation+8.20e-02

Pos logit contributions
 Int+0.046
 Curious+0.043
 Everything+0.041
 Everywhere+0.041
 Sadly+0.040
Neg logit contributions
els-0.029
ps-0.024
 inside-0.022
 on-0.021
ws-0.021
Token and
Token ID290
Feature activation-2.14e-02

Pos logit contributions
els+0.094
ps+0.064
ws+0.062
cks+0.061
ries+0.057
Neg logit contributions
 Int-0.209
 Everything-0.200
 Curious-0.197
 Everywhere-0.194
 Sadly-0.188
Token lots
Token ID6041
Feature activation+7.74e-02

Pos logit contributions
 Int+0.045
 Curious+0.042
 Everything+0.041
 Everywhere+0.040
 Sadly+0.039
Neg logit contributions
els-0.035
ps-0.029
 inside-0.026
 on-0.026
ws-0.026
Token of
Token ID286
Feature activation+3.19e-02

Pos logit contributions
els+0.156
ps+0.132
ws+0.130
cks+0.129
ries+0.117
Neg logit contributions
 Int-0.137
 Everything-0.126
 Curious-0.123
 Everywhere-0.122
 Sadly-0.114
Token likes
Token ID7832
Feature activation+3.18e-02

Pos logit contributions
 Int+0.100
 Curious+0.091
 Sadly+0.088
 Everything+0.087
 Everywhere+0.086
Neg logit contributions
els-0.081
 on-0.069
ps-0.068
 colored-0.068
 inside-0.068
Token to
Token ID284
Feature activation-7.20e-03

Pos logit contributions
els+0.058
ps+0.053
cks+0.048
ier+0.047
 colored+0.046
Neg logit contributions
 Int-0.057
 Everywhere-0.052
 Everything-0.051
 Curious-0.051
Of-0.047
Token look
Token ID804
Feature activation+4.81e-02

Pos logit contributions
 Int+0.012
 Curious+0.011
 Everywhere+0.011
 Everything+0.010
 Exc+0.010
Neg logit contributions
els-0.015
ps-0.012
ries-0.012
ishing-0.012
ier-0.012
Token at
Token ID379
Feature activation+2.41e-02

Pos logit contributions
els+0.080
ps+0.076
bones+0.073
cks+0.069
 colored+0.065
Neg logit contributions
 Int-0.090
 Curious-0.084
 Everywhere-0.082
Of-0.080
 Everything-0.079
Token the
Token ID262
Feature activation+6.28e-04

Pos logit contributions
els+0.038
ps+0.037
 cream+0.035
ier+0.034
 colored+0.034
Neg logit contributions
 Int-0.043
 Everywhere-0.042
 Everything-0.041
 Curious-0.038
 Exc-0.035
Token pictures
Token ID5986
Feature activation+9.85e-02

Pos logit contributions
els+0.001
ps+0.001
cks+0.001
ck+0.001
ws+0.001
Neg logit contributions
 Int-0.001
 Curious-0.001
 Everywhere-0.001
 Everything-0.001
Of-0.001
Token and
Token ID290
Feature activation-4.46e-03

Pos logit contributions
els+0.152
ps+0.143
ws+0.140
cks+0.137
ishing+0.135
Neg logit contributions
 She-0.193
 Everywhere-0.174
 Int-0.172
 Everything-0.170
 Her-0.167
Token the
Token ID262
Feature activation-7.48e-03

Pos logit contributions
 Int+0.009
 Curious+0.008
 Everything+0.008
 Everywhere+0.008
 Sadly+0.007
Neg logit contributions
els-0.008
ps-0.007
 cream-0.006
 on-0.006
 colored-0.006
Token words
Token ID2456
Feature activation+8.71e-02

Pos logit contributions
 Int+0.017
 Curious+0.016
 Everything+0.015
 Everywhere+0.015
Of+0.014
Neg logit contributions
els-0.011
ps-0.009
cks-0.008
ws-0.008
ck-0.008
Token on
Token ID319
Feature activation+2.51e-02

Pos logit contributions
els+0.138
ishing+0.126
ps+0.125
ws+0.122
cks+0.120
Neg logit contributions
 She-0.168
 Everything-0.153
 Her-0.152
 He-0.150
 Int-0.149
Token the
Token ID262
Feature activation-1.22e-02

Pos logit contributions
ps+0.038
els+0.038
ishing+0.036
ws+0.036
 inside+0.035
Neg logit contributions
 Int-0.045
 Everything-0.043
 Everywhere-0.043
 She-0.041
 Curious-0.041
Token mom
Token ID1995
Feature activation+1.33e-05

Pos logit contributions
 Int+0.002
 Curious+0.002
 Sadly+0.002
 Everything+0.002
 Everywhere+0.002
Neg logit contributions
els-0.002
ps-0.002
cks-0.002
 colored-0.002
 on-0.002
Token reminded
Token ID14516
Feature activation+1.97e-02

Pos logit contributions
els+0.000
 on+0.000
 inside+0.000
 colored+0.000
ps+0.000
Neg logit contributions
 Int-0.000
 Curious-0.000
 Everywhere-0.000
 Everything-0.000
 Sadly-0.000
Token him
Token ID683
Feature activation+1.77e-02

Pos logit contributions
els+0.031
 on+0.028
.+0.027
 inside+0.026
 cream+0.025
Neg logit contributions
 Int-0.044
 Curious-0.041
 Sadly-0.038
 Everywhere-0.038
 Everything-0.037
Token to
Token ID284
Feature activation+3.09e-04

Pos logit contributions
els+0.021
 on+0.019
 inside+0.018
.+0.018
ps+0.017
Neg logit contributions
 Int-0.046
 Curious-0.042
 Everywhere-0.039
 Sadly-0.039
 Everything-0.039
Token finish
Token ID5461
Feature activation+7.57e-02

Pos logit contributions
els+0.001
ps+0.001
 on+0.000
 inside+0.000
 colored+0.000
Neg logit contributions
 Int-0.001
 Curious-0.001
 Sadly-0.000
 Everything-0.000
 Everywhere-0.000
Token cleaning
Token ID12724
Feature activation+1.22e-01

Pos logit contributions
els+0.125
ier+0.110
bones+0.107
iest+0.106
ws+0.106
Neg logit contributions
 Int-0.121
 Unfortunately-0.109
 Everything-0.109
 Sadly-0.107
 Everywhere-0.106
Token up
Token ID510
Feature activation+6.98e-02

Pos logit contributions
els+0.181
 colored+0.150
cks+0.150
ps+0.149
ck+0.148
Neg logit contributions
 Int-0.238
 Everywhere-0.228
 Everything-0.221
 Curious-0.213
 Sadly-0.212
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
els+0.114
ps+0.096
 colored+0.092
cks+0.092
bones+0.090
Neg logit contributions
 Int-0.133
 Everywhere-0.124
 Sadly-0.121
 Everything-0.120
 Curious-0.118
Token Tim
Token ID5045
Feature activation-3.45e-02

Pos logit contributions
ete+0.169
 Int+0.163
mber+0.149
 Barkley+0.148
 Dolphin+0.144
Neg logit contributions
.-0.241
 inside-0.221
els-0.219
 on-0.213
 chocolate-0.210
Tokenmy
Token ID1820
Feature activation-2.63e-02

Pos logit contributions
 Int+0.075
 Curious+0.068
 Everything+0.066
 Everywhere+0.065
 Sadly+0.065
Neg logit contributions
els-0.056
 on-0.047
ps-0.046
 colored-0.046
 inside-0.046
Token picked
Token ID6497
Feature activation+6.03e-02

Pos logit contributions
 Int+0.052
 Everywhere+0.047
 Curious+0.046
 Nice+0.046
 Everything+0.045
Neg logit contributions
els-0.044
 on-0.042
 colored-0.042
 inside-0.041
.-0.040
Token't
Token ID470
Feature activation-8.06e-03

Pos logit contributions
 Int+0.003
 Curious+0.003
Of+0.003
mber+0.003
 Sadly+0.002
Neg logit contributions
 on-0.003
els-0.003
.-0.003
 inside-0.003
ps-0.003
Token know
Token ID760
Feature activation+3.23e-02

Pos logit contributions
 Int+0.012
 Curious+0.011
 Everything+0.011
 Sadly+0.011
 Everywhere+0.010
Neg logit contributions
els-0.018
ries-0.016
 cream-0.015
ps-0.015
 colored-0.015
Token how
Token ID703
Feature activation+1.33e-02

Pos logit contributions
els+0.052
ps+0.046
 colored+0.045
ws+0.044
 sticker+0.043
Neg logit contributions
 Int-0.064
 Everything-0.058
 Curious-0.058
 Everywhere-0.057
 Sadly-0.054
Token to
Token ID284
Feature activation+1.13e-02

Pos logit contributions
els+0.018
 on+0.015
 inside+0.015
ps+0.014
.+0.014
Neg logit contributions
 Int-0.032
 Curious-0.029
 Everything-0.028
 Sadly-0.028
 Everywhere-0.028
Token get
Token ID651
Feature activation+6.83e-02

Pos logit contributions
els+0.020
ps+0.016
 on+0.016
 cream+0.016
 colored+0.016
Neg logit contributions
 Int-0.022
 Curious-0.020
 Sadly-0.020
 Everything-0.019
 Everywhere-0.019
Token back
Token ID736
Feature activation+4.22e-02

Pos logit contributions
els+0.113
 colored+0.091
ps+0.089
bones+0.085
cks+0.084
Neg logit contributions
 Int-0.141
 Curious-0.129
 Everything-0.126
 Sadly-0.125
 Everywhere-0.124
Token home
Token ID1363
Feature activation+7.97e-02

Pos logit contributions
els+0.069
ps+0.059
bones+0.057
cks+0.055
 frost+0.055
Neg logit contributions
 Int-0.082
 Everywhere-0.079
 Curious-0.077
 Sadly-0.076
 Everything-0.074
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
els+0.090
ps+0.065
 colored+0.061
cks+0.058
ws+0.057
Neg logit contributions
 Int-0.209
 Curious-0.196
 Sadly-0.195
 Everywhere-0.190
 Everything-0.189
Token Just
Token ID2329
Feature activation-1.52e-02

Pos logit contributions
rolling+0.145
Da+0.138
 Int+0.137
Ag+0.132
 concession+0.130
Neg logit contributions
.-0.239
."-0.212
 inside-0.202
 on-0.200
els-0.192
Token when
Token ID618
Feature activation-5.05e-02

Pos logit contributions
 Int+0.028
 Curious+0.025
 Sadly+0.024
 Everywhere+0.024
 Everything+0.024
Neg logit contributions
els-0.030
ps-0.025
 cream-0.024
ries-0.023
 colored-0.023
Token he
Token ID339
Feature activation-2.40e-02

Pos logit contributions
 Int+0.100
 Curious+0.089
 Everywhere+0.085
 Everything+0.085
 Sadly+0.083
Neg logit contributions
els-0.086
 on-0.082
 inside-0.078
.-0.077
ps-0.074
Token mom
Token ID1995
Feature activation+2.14e-03

Pos logit contributions
 Int+0.043
 Curious+0.038
 Sadly+0.038
 Everything+0.037
 Everywhere+0.037
Neg logit contributions
els-0.030
 on-0.027
 inside-0.026
ps-0.025
 cream-0.025
Tokenmy
Token ID1820
Feature activation-4.80e-03

Pos logit contributions
els+0.005
ps+0.004
cks+0.004
 sticker+0.004
 cream+0.004
Neg logit contributions
 Int-0.003
 Everything-0.003
 Curious-0.003
 Sadly-0.003
 Everywhere-0.003
Token drove
Token ID10357
Feature activation+5.58e-02

Pos logit contributions
 Int+0.007
 Curious+0.006
 Everything+0.006
 Sadly+0.006
 Everywhere+0.006
Neg logit contributions
els-0.011
ps-0.010
 cream-0.009
 sticker-0.009
cks-0.009
Token to
Token ID284
Feature activation-1.28e-02

Pos logit contributions
els+0.116
ps+0.096
cks+0.095
ries+0.089
ws+0.088
Neg logit contributions
 Int-0.097
 Curious-0.088
 Everything-0.084
 Everywhere-0.083
 Sadly-0.083
Token the
Token ID262
Feature activation-3.10e-02

Pos logit contributions
 Int+0.033
 Curious+0.030
 Sadly+0.029
 Everything+0.029
 Everywhere+0.029
Neg logit contributions
els-0.015
 on-0.012
 inside-0.012
ps-0.012
 colored-0.012
Token pet
Token ID4273
Feature activation+5.70e-02

Pos logit contributions
 Int+0.071
 Curious+0.064
 Sadly+0.062
 Everywhere+0.061
 Everything+0.060
Neg logit contributions
els-0.041
 inside-0.040
 cream-0.039
 on-0.038
 sticker-0.037
Token shop
Token ID6128
Feature activation+6.57e-02

Pos logit contributions
els+0.086
 on+0.075
 inside+0.074
.+0.074
 cream+0.068
Neg logit contributions
 Int-0.127
 Curious-0.116
 Sadly-0.111
 Everywhere-0.110
 Everything-0.108
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
els+0.061
ps+0.052
ws+0.044
 colored+0.040
cks+0.039
Neg logit contributions
 Int-0.182
 Curious-0.173
 Sadly-0.170
 Everything-0.166
Of-0.163
Token The
Token ID383
Feature activation-4.53e-02

Pos logit contributions
 Int+0.145
 Curious+0.118
Sadly+0.114
 Sadly+0.114
ete+0.112
Neg logit contributions
els-0.278
 on-0.265
 inside-0.262
.-0.257
 colored-0.252
Token pet
Token ID4273
Feature activation+3.26e-02

Pos logit contributions
 Int+0.082
 Curious+0.078
 Everything+0.074
Of+0.072
 Sadly+0.070
Neg logit contributions
els-0.092
ps-0.079
cks-0.076
ws-0.074
 colored-0.071
Token shop
Token ID6128
Feature activation+4.22e-02

Pos logit contributions
els+0.051
 on+0.049
 inside+0.048
.+0.047
 cream+0.044
Neg logit contributions
 Int-0.068
 Curious-0.061
 Sadly-0.059
Sadly-0.059
 Everywhere-0.058
Token the
Token ID262
Feature activation-4.33e-02

Pos logit contributions
 Int+0.118
 Curious+0.104
 Everything+0.099
 Sadly+0.099
 Everywhere+0.097
Neg logit contributions
els-0.151
 on-0.134
 inside-0.132
ps-0.131
 colored-0.130
Token boy
Token ID2933
Feature activation-1.85e-02

Pos logit contributions
 Int+0.071
 Curious+0.065
 Everything+0.061
 Sadly+0.060
 Everywhere+0.059
Neg logit contributions
els-0.094
ps-0.082
cks-0.078
 on-0.077
 colored-0.077
Token always
Token ID1464
Feature activation+4.40e-03

Pos logit contributions
 Int+0.035
 Curious+0.032
 Everything+0.031
 Sadly+0.030
 Everywhere+0.030
Neg logit contributions
els-0.034
ps-0.029
 on-0.029
 inside-0.029
 colored-0.028
Token came
Token ID1625
Feature activation+2.83e-02

Pos logit contributions
els+0.010
ps+0.008
ries+0.008
 sticker+0.008
 cream+0.008
Neg logit contributions
 Int-0.007
 Curious-0.006
 Sadly-0.006
 Everything-0.006
Of-0.006
Token to
Token ID284
Feature activation-5.87e-03

Pos logit contributions
els+0.041
 on+0.034
 inside+0.033
ps+0.033
 colored+0.032
Neg logit contributions
 Int-0.066
 Curious-0.060
 Everything-0.058
 Sadly-0.058
 Everywhere-0.058
Token see
Token ID766
Feature activation+2.38e-02

Pos logit contributions
 Int+0.012
 Curious+0.011
 Everything+0.011
 Sadly+0.011
 Everywhere+0.011
Neg logit contributions
els-0.010
ps-0.008
 colored-0.008
 inside-0.007
 on-0.007
Token the
Token ID262
Feature activation-6.44e-03

Pos logit contributions
els+0.036
ps+0.032
 sticker+0.029
 colored+0.029
 cream+0.028
Neg logit contributions
 Int-0.049
 Everywhere-0.046
 Everything-0.046
 Curious-0.045
 Sadly-0.045
Token cows
Token ID22575
Feature activation+8.26e-02

Pos logit contributions
 Int+0.012
 Curious+0.011
 Everything+0.011
 Sadly+0.011
 Everywhere+0.010
Neg logit contributions
els-0.012
ps-0.010
 on-0.010
 colored-0.010
 inside-0.010
Token and
Token ID290
Feature activation-2.07e-02

Pos logit contributions
els+0.110
ps+0.086
 colored+0.085
 cream+0.082
 sticker+0.078
Neg logit contributions
 Int-0.195
 Curious-0.181
 Everything-0.178
 Sadly-0.177
 Everywhere-0.174
Token he
Token ID339
Feature activation-2.44e-02

Pos logit contributions
 Int+0.047
 Curious+0.042
 Sadly+0.041
 Everywhere+0.041
 Everything+0.041
Neg logit contributions
els-0.031
 on-0.026
 inside-0.026
ps-0.026
 colored-0.025
Token was
Token ID373
Feature activation+2.79e-03

Pos logit contributions
 Int+0.043
 Curious+0.038
 Everything+0.036
 Everywhere+0.035
 Sadly+0.035
Neg logit contributions
els-0.049
 on-0.044
 inside-0.043
ps-0.043
.-0.043
Token,
Token ID11
Feature activation-2.78e-02

Pos logit contributions
els+0.112
ws+0.095
ps+0.093
 sticker+0.093
bones+0.085
Neg logit contributions
 Int-0.153
 Curious-0.144
 Everything-0.141
 Everywhere-0.140
 Sadly-0.139
Token running
Token ID2491
Feature activation+7.42e-02

Pos logit contributions
 Int+0.059
 Curious+0.053
 Sadly+0.052
 Everything+0.051
 Everywhere+0.051
Neg logit contributions
els-0.045
 on-0.039
ps-0.038
 inside-0.038
 colored-0.037
Token from
Token ID422
Feature activation+7.91e-03

Pos logit contributions
els+0.105
ps+0.083
 colored+0.080
 on+0.080
 inside+0.079
Neg logit contributions
 Int-0.175
 Curious-0.162
 Everything-0.156
 Sadly-0.156
 Everywhere-0.155
Token place
Token ID1295
Feature activation+5.25e-02

Pos logit contributions
els+0.014
ps+0.012
ier+0.012
 sticker+0.011
cks+0.011
Neg logit contributions
 Int-0.015
 Everywhere-0.014
 Curious-0.014
 Everything-0.014
 Sadly-0.013
Token to
Token ID284
Feature activation-7.39e-03

Pos logit contributions
els+0.078
ps+0.064
ws+0.063
 stamp+0.062
 colored+0.062
Neg logit contributions
 Int-0.110
 Everywhere-0.105
 Everything-0.102
 Sadly-0.101
 Curious-0.100
Token place
Token ID1295
Feature activation+5.77e-02

Pos logit contributions
 Int+0.016
 Curious+0.015
 Everything+0.014
 Everywhere+0.014
 Sadly+0.014
Neg logit contributions
els-0.012
ps-0.010
 colored-0.009
 on-0.009
 inside-0.009
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
els+0.066
ps+0.053
 colored+0.047
 stamp+0.047
ws+0.047
Neg logit contributions
 Int-0.141
 Everywhere-0.134
 Everything-0.134
 Curious-0.130
 Sadly-0.130
Token One
Token ID1881
Feature activation-1.25e-02

Pos logit contributions
 Int+0.135
Da+0.130
Sadly+0.129
ete+0.125
mber+0.124
Neg logit contributions
.-0.247
 on-0.232
els-0.231
 inside-0.228
."-0.214
Token day
Token ID1110
Feature activation-3.56e-02

Pos logit contributions
 Int+0.033
 Curious+0.031
 Sadly+0.030
 Everything+0.030
 Everywhere+0.030
Neg logit contributions
els-0.013
 on-0.011
 inside-0.011
 colored-0.010
.-0.010
Token he
Token ID339
Feature activation-4.82e-02

Pos logit contributions
 Int+0.085
 Curious+0.079
 Everything+0.078
 Everywhere+0.074
 Sadly+0.074
Neg logit contributions
els-0.053
ps-0.046
ws-0.041
cks-0.039
 colored-0.038
Token was
Token ID373
Feature activation-7.85e-03

Pos logit contributions
 Int+0.104
 Everywhere+0.096
Of+0.094
 Curious+0.094
 Everything+0.094
Neg logit contributions
 on-0.071
 inside-0.070
els-0.068
 colored-0.065
.-0.065
Token They
Token ID1119
Feature activation-4.99e-02

Pos logit contributions
 Int+0.173
 Curious+0.150
Sadly+0.150
ete+0.147
 Sadly+0.146
Neg logit contributions
els-0.220
.-0.214
 on-0.212
 inside-0.206
 colored-0.188
Token saw
Token ID2497
Feature activation+1.05e-02

Pos logit contributions
 Int+0.095
 Curious+0.086
 Everything+0.083
 Sadly+0.083
 Everywhere+0.081
Neg logit contributions
els-0.092
 on-0.083
 colored-0.081
 inside-0.081
.-0.079
Token a
Token ID257
Feature activation+1.44e-02

Pos logit contributions
els+0.011
 on+0.010
 inside+0.009
 cream+0.009
 colored+0.009
Neg logit contributions
 Int-0.028
 Curious-0.025
 Sadly-0.025
ete-0.025
 Everywhere-0.024
Token big
Token ID1263
Feature activation+5.06e-02

Pos logit contributions
els+0.026
ps+0.023
ws+0.021
 colored+0.020
cks+0.020
Neg logit contributions
 Int-0.028
 Curious-0.026
 Everything-0.024
 Sadly-0.024
Of-0.023
Token pond
Token ID16723
Feature activation+8.36e-02

Pos logit contributions
els+0.096
ps+0.091
ws+0.083
ier+0.078
cks+0.078
Neg logit contributions
 Int-0.083
 Curious-0.076
 Everything-0.075
 Exc-0.071
 Sadly-0.070
Token with
Token ID351
Feature activation+2.44e-02

Pos logit contributions
els+0.126
ps+0.109
ws+0.099
 problems+0.094
ries+0.089
Neg logit contributions
 Int-0.179
 Curious-0.168
 Sadly-0.160
 She-0.157
 Everywhere-0.157
Token ducks
Token ID39694
Feature activation+8.40e-02

Pos logit contributions
els+0.034
 on+0.029
ps+0.028
 colored+0.028
 inside+0.028
Neg logit contributions
 Int-0.057
 Curious-0.052
 Sadly-0.051
 Everything-0.051
 Everywhere-0.050
Token and
Token ID290
Feature activation-2.32e-02

Pos logit contributions
els+0.116
ws+0.104
 problems+0.102
 sticker+0.102
ps+0.100
Neg logit contributions
 Int-0.159
 Curious-0.149
 She-0.148
 He-0.144
 Fro-0.144
Token fish
Token ID5916
Feature activation+9.27e-02

Pos logit contributions
 Int+0.055
 Sadly+0.051
 Everything+0.050
 Everywhere+0.050
 Curious+0.050
Neg logit contributions
els-0.030
 on-0.027
 colored-0.026
 inside-0.026
 cream-0.025
Token.
Token ID13
Feature activation-1.09e-01

Pos logit contributions
els+0.097
 colored+0.081
ishing+0.080
ps+0.079
 sticker+0.079
Neg logit contributions
 Int-0.192
 Fro-0.190
 He-0.186
 She-0.182
 Curious-0.182
Token They
Token ID1119
Feature activation-4.50e-02

Pos logit contributions
 Int+0.195
Sadly+0.183
rolling+0.181
ete+0.178
 Barkley+0.176
Neg logit contributions
.-0.182
els-0.176
 on-0.174
 inside-0.160
."-0.152
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
els+0.099
ps+0.083
cks+0.080
ws+0.077
bones+0.077
Neg logit contributions
 Int-0.137
 Curious-0.130
 Everything-0.125
 Sadly-0.119
 Everywhere-0.118
Token delicious
Token ID12625
Feature activation+9.34e-02

Pos logit contributions
 Int+0.031
 Curious+0.028
 Sadly+0.027
 Everything+0.027
 Everywhere+0.027
Neg logit contributions
els-0.025
ps-0.021
 on-0.021
 inside-0.021
 colored-0.021
Token.
Token ID13
Feature activation-9.85e-02

Pos logit contributions
els+0.117
ps+0.102
cks+0.096
ws+0.095
ck+0.081
Neg logit contributions
 Int-0.205
 Curious-0.199
 She-0.196
 So-0.196
 He-0.195
Token
Token ID198
Feature activation-6.23e-02

Pos logit contributions
rolling+0.175
 Int+0.174
Ag+0.164
Of+0.153
Sadly+0.151
Neg logit contributions
.-0.163
 on-0.155
els-0.144
."-0.136
cks-0.136
Token
Token ID198
Feature activation-8.43e-02

Pos logit contributions
 Int+0.126
 Curious+0.113
ishly+0.111
 Everything+0.108
Of+0.106
Neg logit contributions
.-0.098
 on-0.096
 inside-0.090
els-0.087
aks-0.082
TokenHe
Token ID1544
Feature activation-3.99e-02

Pos logit contributions
 Int+0.154
 Everything+0.139
 Curious+0.134
 Sadly+0.132
 Everywhere+0.131
Neg logit contributions
 on-0.148
els-0.135
aks-0.133
cks-0.131
.-0.126
Token thanked
Token ID26280
Feature activation+9.45e-03

Pos logit contributions
 Int+0.080
 Curious+0.072
 Everything+0.070
 Everywhere+0.069
 Sadly+0.068
Neg logit contributions
els-0.069
 on-0.062
 colored-0.062
 inside-0.061
 cream-0.059
Token the
Token ID262
Feature activation-2.60e-02

Pos logit contributions
els+0.009
 on+0.008
ps+0.007
.+0.007
 inside+0.007
Neg logit contributions
 Int-0.026
 Curious-0.024
 Sadly-0.024
 Everything-0.023
 Everywhere-0.023
Token old
Token ID1468
Feature activation+6.39e-03

Pos logit contributions
 Int+0.050
 Curious+0.047
 Everything+0.045
 Sadly+0.044
 Everywhere+0.043
Neg logit contributions
els-0.049
ps-0.041
 colored-0.039
 on-0.039
cks-0.039
Token lady
Token ID10846
Feature activation+4.78e-02

Pos logit contributions
els+0.014
ps+0.012
cks+0.012
ws+0.012
ishing+0.012
Neg logit contributions
 Int-0.010
 Curious-0.010
 Everything-0.009
 Sadly-0.009
Of-0.008
Token and
Token ID290
Feature activation-2.69e-02

Pos logit contributions
els+0.061
ps+0.050
 colored+0.048
 cream+0.045
cks+0.043
Neg logit contributions
 Int-0.115
 Curious-0.107
 Everything-0.104
 Sadly-0.104
 Everywhere-0.104
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
cks+0.198
els+0.197
ps+0.174
ws+0.156
ck+0.156
Neg logit contributions
hog-0.277
 Nice-0.276
 He-0.271
 Int-0.268
 Everywhere-0.267
Token but
Token ID475
Feature activation-1.39e-02

Pos logit contributions
els+0.027
ps+0.023
 colored+0.022
 on+0.022
 inside+0.022
Neg logit contributions
 Int-0.031
 Curious-0.029
 Everything-0.027
 Sadly-0.027
 Everywhere-0.027
Token a
Token ID257
Feature activation+2.83e-02

Pos logit contributions
 Int+0.024
 Curious+0.022
 Everything+0.022
 Everywhere+0.021
 Sadly+0.021
Neg logit contributions
els-0.027
ps-0.023
 colored-0.023
 cream-0.022
ries-0.022
Token responsibility
Token ID5798
Feature activation+9.98e-02

Pos logit contributions
els+0.052
ps+0.042
ries+0.042
cks+0.040
ws+0.039
Neg logit contributions
 Int-0.053
 Curious-0.052
 Sadly-0.052
 Everything-0.049
Of-0.049
Token.
Token ID13
Feature activation-9.58e-02

Pos logit contributions
els+0.142
ries+0.103
cks+0.103
 colored+0.103
ps+0.097
Neg logit contributions
 Int-0.216
 Everything-0.209
 Everywhere-0.203
 Curious-0.202
 Sadly-0.199
Token They
Token ID1119
Feature activation-3.55e-02

Pos logit contributions
 Int+0.149
 Curious+0.128
ete+0.121
rolling+0.119
 Everything+0.114
Neg logit contributions
els-0.195
.-0.182
 on-0.181
 inside-0.177
 sticker-0.171
Token also
Token ID635
Feature activation-3.06e-04

Pos logit contributions
 Int+0.074
 Curious+0.065
 Everything+0.063
 Everywhere+0.062
rolling+0.061
Neg logit contributions
els-0.058
 inside-0.056
.-0.055
 on-0.055
 colored-0.054
Token learned
Token ID4499
Feature activation+3.77e-02

Pos logit contributions
 Int+0.001
 Curious+0.000
 Everything+0.000
 Everywhere+0.000
 Sadly+0.000
Neg logit contributions
els-0.001
 colored-0.001
 on-0.001
 inside-0.001
ps-0.001
Token that
Token ID326
Feature activation-1.85e-03

Pos logit contributions
els+0.055
ps+0.045
 colored+0.043
 on+0.043
 inside+0.042
Neg logit contributions
 Int-0.086
 Curious-0.079
 Sadly-0.077
 Everything-0.077
 Everywhere-0.076
Token Fl
Token ID1610
Feature activation+1.62e-02

Pos logit contributions
 Int+0.003
 Curious+0.003
 Everything+0.003
 Sadly+0.003
 Everywhere+0.003
Neg logit contributions
els-0.003
ps-0.003
 cream-0.003
 colored-0.003
 inside-0.003
Tokenuffy
Token ID15352
Feature activation+3.19e-02

Pos logit contributions
els+0.036
ps+0.031
 on+0.031
 inside+0.031
 colored+0.030
Neg logit contributions
 Int-0.026
 Curious-0.023
 Everything-0.022
 Sadly-0.021
 Everywhere-0.021
Token She
Token ID1375
Feature activation-6.63e-02

Pos logit contributions
 Int+0.119
 Curious+0.092
 Sadly+0.087
ete+0.085
Sadly+0.085
Neg logit contributions
els-0.343
.-0.319
 on-0.313
 inside-0.311
 colored-0.310
Token had
Token ID550
Feature activation+9.50e-05

Pos logit contributions
 Int+0.113
 Curious+0.101
 Everything+0.096
 Sadly+0.095
 Everywhere+0.093
Neg logit contributions
els-0.136
 on-0.122
 colored-0.122
 inside-0.120
ps-0.118
Token a
Token ID257
Feature activation+1.33e-02

Pos logit contributions
els+0.000
 on+0.000
 colored+0.000
 inside+0.000
ps+0.000
Neg logit contributions
 Int-0.000
 Curious-0.000
 Sadly-0.000
 Everything-0.000
ete-0.000
Token warm
Token ID5814
Feature activation+6.98e-02

Pos logit contributions
els+0.021
ps+0.018
cks+0.016
ws+0.016
bones+0.016
Neg logit contributions
 Int-0.029
 Curious-0.027
 Sadly-0.026
 Everywhere-0.026
 Everything-0.026
Token blanket
Token ID18447
Feature activation+8.00e-02

Pos logit contributions
els+0.108
ps+0.097
cks+0.095
ws+0.089
bones+0.087
Neg logit contributions
 Int-0.145
 Curious-0.140
 Everywhere-0.139
 Everything-0.136
 Sadly-0.132
Token that
Token ID326
Feature activation-3.21e-02

Pos logit contributions
els+0.118
ps+0.096
ws+0.087
 colored+0.085
cks+0.084
Neg logit contributions
 Int-0.178
 Curious-0.166
 Everywhere-0.164
 Sadly-0.164
 Everything-0.160
Token she
Token ID673
Feature activation-4.55e-02

Pos logit contributions
 Int+0.061
 Curious+0.055
 Sadly+0.051
ete+0.051
 Everything+0.050
Neg logit contributions
els-0.057
 on-0.056
 inside-0.054
 colored-0.053
.-0.053
Token loved
Token ID6151
Feature activation+3.58e-02

Pos logit contributions
 Int+0.074
 Curious+0.065
 Everything+0.063
 Sadly+0.061
 Everywhere+0.060
Neg logit contributions
els-0.097
 on-0.087
 inside-0.086
 colored-0.086
ps-0.086
Token to
Token ID284
Feature activation-9.48e-03

Pos logit contributions
els+0.067
ps+0.059
cks+0.054
ws+0.054
 colored+0.052
Neg logit contributions
 Int-0.065
 Everywhere-0.059
 Curious-0.058
 Everything-0.058
 Sadly-0.057
Token sn
Token ID3013
Feature activation+7.51e-02

Pos logit contributions
 Int+0.018
 Curious+0.017
 Everywhere+0.016
 Everything+0.016
 Sadly+0.016
Neg logit contributions
els-0.017
ps-0.015
 cream-0.014
 on-0.014
 inside-0.014
Tokenuggle
Token ID33498
Feature activation+1.07e-01

Pos logit contributions
els+0.167
ps+0.151
 colored+0.146
 on+0.143
 inside+0.141
Neg logit contributions
 Int-0.114
 Curious-0.101
 Sadly-0.098
 Everything-0.097
 Everywhere-0.097
Token think
Token ID892
Feature activation-6.92e-03

Pos logit contributions
 Int+0.096
 Curious+0.087
 Sadly+0.084
 Everything+0.084
 Everywhere+0.083
Neg logit contributions
els-0.074
 on-0.062
ps-0.062
 inside-0.061
 colored-0.060
Token they
Token ID484
Feature activation-3.54e-02

Pos logit contributions
 Int+0.014
 Everything+0.013
 Everywhere+0.012
 Curious+0.012
 Sadly+0.012
Neg logit contributions
els-0.012
ps-0.010
 colored-0.010
ws-0.010
 cream-0.010
Token are
Token ID389
Feature activation-6.05e-04

Pos logit contributions
 Int+0.074
 Curious+0.067
 Sadly+0.064
 Everything+0.063
 Everywhere+0.063
Neg logit contributions
els-0.059
 on-0.051
 inside-0.051
ps-0.050
 colored-0.049
Token almost
Token ID2048
Feature activation+1.21e-02

Pos logit contributions
 Int+0.001
 Curious+0.001
 Everything+0.001
 Everywhere+0.001
 Sadly+0.001
Neg logit contributions
els-0.001
 colored-0.001
ries-0.001
ishing-0.001
ws-0.001
Token at
Token ID379
Feature activation+1.53e-02

Pos logit contributions
els+0.020
 colored+0.015
ries+0.014
 sticker+0.014
 paper+0.014
Neg logit contributions
 Int-0.025
 Curious-0.023
 Everything-0.022
ete-0.021
 Everywhere-0.021
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
els+0.027
 colored+0.021
ps+0.021
 cream+0.021
ries+0.020
Neg logit contributions
 Int-0.029
 Everywhere-0.027
 Everything-0.027
 Sadly-0.027
 Curious-0.027
Token park
Token ID3952
Feature activation+6.60e-02

Pos logit contributions
 Int+0.028
 Curious+0.026
 Everything+0.026
 Sadly+0.025
 Everywhere+0.025
Neg logit contributions
els-0.032
ps-0.027
ries-0.026
 colored-0.025
cks-0.025
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
els+0.061
ps+0.042
ws+0.036
 colored+0.036
cks+0.033
Neg logit contributions
 Int-0.181
 Curious-0.172
 Sadly-0.170
 Everything-0.169
 Everywhere-0.167
Token
Token ID198
Feature activation-7.36e-02

Pos logit contributions
 Int+0.188
ete+0.168
rolling+0.167
 Curious+0.164
Sadly+0.163
Neg logit contributions
.-0.190
 on-0.181
els-0.180
 inside-0.173
."-0.164
Token
Token ID198
Feature activation-9.38e-02

Pos logit contributions
 Int+0.124
 Curious+0.111
 Everything+0.105
 Sadly+0.101
Of+0.100
Neg logit contributions
els-0.141
.-0.135
 on-0.132
 inside-0.129
 colored-0.122
TokenBut
Token ID1537
Feature activation-6.50e-02

Pos logit contributions
 Int+0.186
 Curious+0.168
 Everything+0.163
 Everywhere+0.162
 Sadly+0.159
Neg logit contributions
els-0.160
 on-0.144
 inside-0.139
.-0.136
 colored-0.136
Token the
Token ID262
Feature activation-3.85e-02

Pos logit contributions
 Int+0.053
 Curious+0.049
 Sadly+0.048
 Everything+0.048
 Everywhere+0.047
Neg logit contributions
els-0.029
 on-0.023
 inside-0.023
ps-0.023
 colored-0.023
Token beach
Token ID10481
Feature activation+5.68e-02

Pos logit contributions
 Int+0.087
 Curious+0.079
 Sadly+0.077
 Everywhere+0.076
 Everything+0.076
Neg logit contributions
els-0.055
 inside-0.050
 cream-0.050
 on-0.050
 colored-0.048
Token with
Token ID351
Feature activation+3.73e-03

Pos logit contributions
els+0.062
ps+0.050
 colored+0.039
ws+0.038
 sticker+0.037
Neg logit contributions
 Int-0.151
 Curious-0.141
 Sadly-0.136
 Everywhere-0.134
 Everything-0.133
Token her
Token ID607
Feature activation-1.32e-02

Pos logit contributions
els+0.005
ps+0.004
 on+0.004
 inside+0.004
 colored+0.004
Neg logit contributions
 Int-0.009
 Curious-0.008
 Sadly-0.008
 Everything-0.008
 Everywhere-0.008
Token mom
Token ID1995
Feature activation+2.20e-02

Pos logit contributions
 Int+0.025
 Curious+0.023
 Sadly+0.022
 Everything+0.022
 Everywhere+0.021
Neg logit contributions
els-0.025
ps-0.021
 on-0.020
 colored-0.020
 inside-0.020
Token and
Token ID290
Feature activation-4.18e-02

Pos logit contributions
ps+0.036
els+0.035
bones+0.032
cks+0.030
ws+0.030
Neg logit contributions
 Int-0.042
 Everywhere-0.038
 So-0.038
 She-0.038
 Curious-0.038
Token dad
Token ID9955
Feature activation+3.08e-02

Pos logit contributions
 Int+0.094
 Curious+0.086
 Everywhere+0.084
 Sadly+0.083
 Everything+0.082
Neg logit contributions
 inside-0.060
els-0.060
 on-0.059
 cream-0.056
 colored-0.054
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
els+0.038
ps+0.036
cks+0.029
bones+0.029
ws+0.029
Neg logit contributions
 Int-0.073
 So-0.070
 Sadly-0.068
 Her-0.068
 Everywhere-0.068
Token She
Token ID1375
Feature activation-6.11e-02

Pos logit contributions
 Int+0.172
ete+0.149
Sadly+0.147
 Curious+0.143
 Sadly+0.140
Neg logit contributions
els-0.264
 inside-0.253
.-0.252
 on-0.249
 colored-0.241
Token saw
Token ID2497
Feature activation+1.08e-02

Pos logit contributions
 Int+0.134
 Everything+0.123
 Curious+0.123
 Everywhere+0.121
 Sadly+0.116
Neg logit contributions
 on-0.089
 inside-0.088
 colored-0.087
.-0.084
els-0.080
Token some
Token ID617
Feature activation+5.94e-02

Pos logit contributions
els+0.011
 on+0.010
 inside+0.010
 cream+0.009
 colored+0.009
Neg logit contributions
 Int-0.029
 Sadly-0.028
 Curious-0.027
 Unfortunately-0.026
rolling-0.026
Token many
Token ID867
Feature activation+7.02e-02

Pos logit contributions
 Int+0.015
 Curious+0.014
 Sadly+0.014
 Everything+0.014
 Everywhere+0.013
Neg logit contributions
els-0.010
 on-0.009
 inside-0.008
ps-0.008
 colored-0.008
Token dresses
Token ID27309
Feature activation+1.17e-01

Pos logit contributions
ps+0.095
els+0.091
ws+0.089
cks+0.083
bones+0.074
Neg logit contributions
 Int-0.149
 Curious-0.136
 Everywhere-0.131
 Everything-0.130
Of-0.128
Token in
Token ID287
Feature activation+1.74e-02

Pos logit contributions
els+0.171
ps+0.147
ws+0.147
cks+0.139
 harm+0.130
Neg logit contributions
 Int-0.238
 Everywhere-0.220
 Curious-0.219
 Everything-0.216
 She-0.216
Token the
Token ID262
Feature activation-2.06e-02

Pos logit contributions
els+0.027
ps+0.026
ws+0.024
ier+0.023
 cream+0.022
Neg logit contributions
 Int-0.035
 Everywhere-0.032
 Everything-0.032
 Curious-0.032
 Exc-0.030
Token store
Token ID3650
Feature activation+6.51e-02

Pos logit contributions
 Int+0.043
 Curious+0.039
 Sadly+0.037
 Everything+0.037
 Everywhere+0.037
Neg logit contributions
els-0.035
ps-0.029
 on-0.029
 inside-0.029
 colored-0.029
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
els+0.066
ps+0.052
ws+0.044
 colored+0.042
cks+0.042
Neg logit contributions
 Int-0.173
 Curious-0.162
 Sadly-0.161
 Everything-0.159
 Everywhere-0.158
Token Some
Token ID2773
Feature activation+2.89e-02

Pos logit contributions
 Int+0.200
Sadly+0.186
 Barkley+0.182
Da+0.180
ishly+0.179
Neg logit contributions
.-0.195
els-0.193
 on-0.185
 inside-0.182
 paper-0.166
Token were
Token ID547
Feature activation+2.20e-02

Pos logit contributions
els+0.043
ps+0.039
ws+0.036
cks+0.036
 inside+0.034
Neg logit contributions
 Int-0.062
 Curious-0.057
 Everything-0.056
 Everywhere-0.055
Of-0.053
Token red
Token ID2266
Feature activation+7.20e-02

Pos logit contributions
els+0.034
ps+0.030
ws+0.028
cks+0.027
ries+0.026
Neg logit contributions
 Int-0.043
 Curious-0.040
 Everything-0.039
 Everywhere-0.039
 Sadly-0.038
Token,
Token ID11
Feature activation-1.07e-02

Pos logit contributions
els+0.087
cks+0.085
ps+0.080
ws+0.077
ck+0.073
Neg logit contributions
 Int-0.156
 Everything-0.145
 Everywhere-0.144
 Curious-0.143
 She-0.142
Token some
Token ID617
Feature activation+4.58e-02

Pos logit contributions
 Int+0.025
 Curious+0.023
 Everything+0.022
 Sadly+0.022
 Everywhere+0.022
Neg logit contributions
els-0.015
ps-0.012
 on-0.012
 inside-0.012
 colored-0.012
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
els+0.112
ps+0.088
 sticker+0.087
ries+0.083
ier+0.083
Neg logit contributions
 Int-0.131
 Curious-0.122
 Everywhere-0.120
 Everything-0.119
 Sadly-0.114
Token sun
Token ID4252
Feature activation+7.12e-02

Pos logit contributions
els+0.025
ps+0.021
 on+0.020
 inside+0.020
 colored+0.020
Neg logit contributions
 Int-0.028
 Curious-0.026
 Everything-0.025
 Sadly-0.025
 Everywhere-0.024
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
els+0.088
ps+0.070
ws+0.068
 cream+0.062
cks+0.060
Neg logit contributions
 Int-0.167
 Curious-0.156
 Everything-0.155
 Everywhere-0.152
 Sadly-0.148
Token the
Token ID262
Feature activation-1.08e-02

Pos logit contributions
 Int+0.025
 Curious+0.024
 Everything+0.023
 Everywhere+0.022
 Sadly+0.022
Neg logit contributions
els-0.018
ps-0.015
 sticker-0.014
 cream-0.013
 on-0.013
Token waves
Token ID9813
Feature activation+7.99e-02

Pos logit contributions
 Int+0.021
 Curious+0.020
 Everything+0.019
 Sadly+0.019
 Everywhere+0.019
Neg logit contributions
els-0.020
ps-0.016
 on-0.015
 colored-0.015
 cream-0.015
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
els+0.112
ps+0.084
 cream+0.080
 colored+0.079
ries+0.078
Neg logit contributions
 Int-0.186
 Curious-0.173
 Everything-0.169
 Everywhere-0.166
 Sadly-0.166
Token The
Token ID383
Feature activation-4.29e-02

Pos logit contributions
 Int+0.142
mber+0.121
Of+0.119
ete+0.118
 Curious+0.117
Neg logit contributions
.-0.221
 on-0.209
els-0.209
 inside-0.205
."-0.195
Token gentle
Token ID10296
Feature activation+2.45e-02

Pos logit contributions
 Int+0.083
 Curious+0.078
 Everything+0.075
 Everywhere+0.074
 Sadly+0.073
Neg logit contributions
els-0.078
ps-0.064
 on-0.062
 cream-0.061
ries-0.061
Token old
Token ID1468
Feature activation-1.69e-02

Pos logit contributions
els+0.042
ps+0.035
 on+0.035
 inside+0.034
 colored+0.034
Neg logit contributions
 Int-0.050
 Curious-0.045
 Everything-0.044
 Sadly-0.044
 Everywhere-0.043
Token boat
Token ID8848
Feature activation+3.53e-02

Pos logit contributions
 Int+0.033
 Curious+0.030
 Everything+0.029
 Sadly+0.029
 Everywhere+0.028
Neg logit contributions
els-0.030
ps-0.026
 on-0.025
 colored-0.024
 inside-0.024
Token ate
Token ID15063
Feature activation+8.54e-02

Pos logit contributions
els+0.064
 on+0.057
ps+0.056
 inside+0.056
 colored+0.055
Neg logit contributions
 Int-0.067
 Curious-0.060
 Everything-0.058
 Sadly-0.057
 Everywhere-0.057
Token had
Token ID550
Feature activation-1.00e-02

Pos logit contributions
 Int+0.119
 Curious+0.106
 Everything+0.102
 Sadly+0.102
 Everywhere+0.100
Neg logit contributions
els-0.104
 on-0.094
 inside-0.092
ps-0.091
 colored-0.090
Token fun
Token ID1257
Feature activation+4.62e-02

Pos logit contributions
 Int+0.022
 Curious+0.020
 Everything+0.019
 Sadly+0.019
 Everywhere+0.019
Neg logit contributions
els-0.016
 on-0.014
ps-0.013
 inside-0.013
 colored-0.013
Token together
Token ID1978
Feature activation+4.47e-02

Pos logit contributions
els+0.075
 sticker+0.062
ws+0.061
ps+0.059
ries+0.057
Neg logit contributions
 Int-0.090
 Curious-0.082
 Sadly-0.080
 Everything-0.080
 As-0.079
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
els+0.062
ps+0.048
ws+0.045
ries+0.044
 sticker+0.043
Neg logit contributions
 Int-0.095
 Everywhere-0.092
 Sadly-0.090
ete-0.090
 couldn-0.089
Token
Token ID198
Feature activation-7.26e-02

Pos logit contributions
 Int+0.174
 Curious+0.149
ete+0.146
rolling+0.143
Sadly+0.136
Neg logit contributions
.-0.208
els-0.205
 on-0.199
 inside-0.193
."-0.186
Token
Token ID198
Feature activation-9.20e-02

Pos logit contributions
 Int+0.135
 Curious+0.121
 Everything+0.112
 Sadly+0.110
ishly+0.109
Neg logit contributions
els-0.130
 on-0.125
.-0.124
 inside-0.121
 colored-0.113
TokenAt
Token ID2953
Feature activation-7.84e-03

Pos logit contributions
 Int+0.169
 Curious+0.153
 Everything+0.145
 Everywhere+0.144
 Sadly+0.139
Neg logit contributions
els-0.170
 on-0.158
 inside-0.150
 colored-0.149
 cream-0.149
Token the
Token ID262
Feature activation-3.81e-02

Pos logit contributions
 Int+0.017
 Everywhere+0.015
 Curious+0.015
 Sadly+0.015
 Everything+0.015
Neg logit contributions
els-0.012
ps-0.010
 colored-0.009
ries-0.009
 cream-0.009
Token end
Token ID886
Feature activation-2.16e-04

Pos logit contributions
 Int+0.080
 Curious+0.073
 Sadly+0.070
 Everything+0.070
 Everywhere+0.069
Neg logit contributions
els-0.063
ps-0.053
 on-0.053
 inside-0.052
 colored-0.052
Token of
Token ID286
Feature activation-5.39e-03

Pos logit contributions
 Int+0.000
 Everything+0.000
 Curious+0.000
 Everywhere+0.000
 Sadly+0.000
Neg logit contributions
els-0.000
ps-0.000
ws-0.000
 colored-0.000
cks-0.000
Token the
Token ID262
Feature activation-2.42e-02

Pos logit contributions
 Int+0.010
 Everywhere+0.009
 Sadly+0.009
 Everything+0.009
 Curious+0.009
Neg logit contributions
els-0.010
ps-0.008
ws-0.008
ier-0.008
ater-0.008
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
els+0.103
ps+0.082
ws+0.069
 colored+0.069
cks+0.068
Neg logit contributions
 Int-0.197
ete-0.193
 Curious-0.187
 Sadly-0.186
 Everywhere-0.185
Token
Token ID198
Feature activation-6.40e-02

Pos logit contributions
 Int+0.197
 Curious+0.172
Sadly+0.168
rolling+0.163
 Sadly+0.158
Neg logit contributions
.-0.211
els-0.206
 on-0.201
 inside-0.193
."-0.180
Token
Token ID198
Feature activation-8.79e-02

Pos logit contributions
 Int+0.107
 Curious+0.096
 Sadly+0.090
 Everything+0.090
 Everywhere+0.085
Neg logit contributions
 on-0.123
els-0.122
.-0.119
 inside-0.114
 colored-0.109
TokenOne
Token ID3198
Feature activation-3.77e-03

Pos logit contributions
 Int+0.156
 Curious+0.139
 Sadly+0.134
 Everything+0.132
 Everywhere+0.129
Neg logit contributions
els-0.169
 on-0.158
 colored-0.153
.-0.151
 inside-0.150
Token day
Token ID1110
Feature activation-2.13e-02

Pos logit contributions
 Int+0.008
 Curious+0.008
 Everything+0.008
 Everywhere+0.008
 Sadly+0.007
Neg logit contributions
els-0.006
ps-0.005
 colored-0.004
ries-0.004
 cream-0.004
Token,
Token ID11
Feature activation-7.20e-02

Pos logit contributions
 Int+0.046
 Everything+0.044
 Curious+0.043
 Everywhere+0.041
 Sadly+0.040
Neg logit contributions
els-0.035
ps-0.031
cks-0.029
bones-0.028
ws-0.028
Token they
Token ID484
Feature activation-3.63e-02

Pos logit contributions
 Int+0.112
 Curious+0.102
 Everything+0.097
Of+0.094
 Everywhere+0.093
Neg logit contributions
els-0.161
ps-0.146
cks-0.137
ws-0.136
bones-0.134
Token saw
Token ID2497
Feature activation+7.03e-03

Pos logit contributions
 Int+0.075
 Curious+0.067
 Everywhere+0.066
 Everything+0.066
 Sadly+0.066
Neg logit contributions
els-0.060
 inside-0.057
 on-0.057
.-0.053
 colored-0.053
Token a
Token ID257
Feature activation+1.43e-02

Pos logit contributions
els+0.010
ps+0.008
 on+0.008
 colored+0.008
 inside+0.008
Neg logit contributions
 Int-0.016
 Curious-0.015
 Everything-0.015
 Sadly-0.015
 Everywhere-0.015
Token big
Token ID1263
Feature activation+5.63e-02

Pos logit contributions
els+0.028
ps+0.025
ws+0.023
ries+0.022
cks+0.022
Neg logit contributions
 Int-0.026
 Curious-0.024
 Everything-0.022
 Sadly-0.022
Of-0.021
Token boat
Token ID8848
Feature activation+8.73e-02

Pos logit contributions
els+0.109
ps+0.101
ws+0.093
cks+0.088
bones+0.087
Neg logit contributions
 Int-0.091
 Curious-0.086
 Everything-0.080
ete-0.078
 Exc-0.078
Token,
Token ID11
Feature activation-4.64e-02

Pos logit contributions
 Int+0.051
 Curious+0.048
 Sadly+0.046
 Everything+0.046
 Everywhere+0.046
Neg logit contributions
els-0.011
 on-0.008
 inside-0.007
ps-0.007
.-0.007
Token "
Token ID366
Feature activation-5.73e-02

Pos logit contributions
 Int+0.072
 Sadly+0.063
 Curious+0.062
 Everywhere+0.062
 Everything+0.060
Neg logit contributions
els-0.094
 on-0.093
.-0.090
 inside-0.089
ps-0.086
TokenHi
Token ID17250
Feature activation+6.54e-03

Pos logit contributions
 Sadly+0.104
rolling+0.101
 Int+0.101
ete+0.099
 Everywhere+0.097
Neg logit contributions
 colored-0.091
 coloured-0.084
 on-0.082
els-0.079
 cream-0.079
Token,
Token ID11
Feature activation-3.72e-02

Pos logit contributions
els+0.007
ps+0.006
 colored+0.005
 on+0.005
 cream+0.005
Neg logit contributions
 Int-0.018
 Curious-0.016
 Everything-0.016
 Sadly-0.016
 Everywhere-0.016
Token Tim
Token ID5045
Feature activation-1.26e-02

Pos logit contributions
 Int+0.076
 Sadly+0.070
 Curious+0.068
rolling+0.068
 Everywhere+0.068
Neg logit contributions
 on-0.058
 inside-0.057
.-0.054
els-0.051
 thing-0.050
Token!
Token ID0
Feature activation-7.56e-02

Pos logit contributions
 Int+0.028
 Curious+0.026
 He+0.025
 Sadly+0.025
 So+0.024
Neg logit contributions
els-0.018
ps-0.015
bones-0.014
ws-0.014
 sticker-0.014
Token I
Token ID314
Feature activation-1.24e-02

Pos logit contributions
Sadly+0.132
 Int+0.129
ete+0.124
Days+0.120
So+0.118
Neg logit contributions
 on-0.118
.-0.109
 inside-0.108
."-0.103
els-0.100
Token was
Token ID373
Feature activation+9.26e-03

Pos logit contributions
 Int+0.020
 Curious+0.018
 Sadly+0.018
 Everything+0.017
 Everywhere+0.017
Neg logit contributions
els-0.027
ps-0.023
ries-0.022
 cream-0.022
 colored-0.021
Token scared
Token ID12008
Feature activation+6.70e-02

Pos logit contributions
els+0.015
ps+0.013
 colored+0.012
 on+0.012
 inside+0.012
Neg logit contributions
 Int-0.020
 Curious-0.018
 Sadly-0.017
 Everything-0.017
 Everywhere-0.017
Token too
Token ID1165
Feature activation+4.62e-02

Pos logit contributions
els+0.089
 on+0.072
ps+0.071
 inside+0.071
 colored+0.068
Neg logit contributions
 Int-0.162
 Curious-0.150
 Sadly-0.145
 Everything-0.145
 Everywhere-0.143
Token,
Token ID11
Feature activation-2.69e-02

Pos logit contributions
els+0.051
ps+0.040
 colored+0.036
ws+0.035
bones+0.033
Neg logit contributions
 Int-0.125
 Curious-0.117
 Sadly-0.114
 Everywhere-0.110
ete-0.108