TOP ACTIVATIONS
MAX = 18.852

real treasure chest filled with gold coins and jewels .
treasure chest full of shiny gold coins ! Tim , Sally
shouldn 't have opened the battery without asking me first ."
treasure is not gold or jewels ," L ila says .
treasure chest filled with gold coins and jewels .
found a chest filled with gold and jewels . From that
to be gold is not gold , but something else that
chest full of shiny gold coins ! Tim , Sally ,
try to use the broken toilet . It 's a prank
a treasure chest full of gold and jewels ! Tim my
Maybe the treasure is not gold or jewels ," L ila
chest and found lots of gold coins and shiny jewels .
. We just need some glue and a new wheel .
. You are ruining my flour . You are not welcome
. We need a new battery ," Dad says .
treasure chests , full of gold and shiny coins . The
you should never remove your wallet from your pocket ."
found a chest filled with gold and jewels . From that
. I don 't need gold ice cream . I have
the place I buried my treasure ! â

MIN ACTIVATIONS
MIN = -15.804

say sorry first ." L ily did not want
to say sorry first ." L ily did not
play with us ." Tom comes back . He
have to say sorry first ." L ily did
not bother it ." He took the pig away
not sharing before ." Her friend was happy and
Leave me alone !" Their mom heard the noise
wants to play with us ." Tom comes back
to hurt us !" A man hears Tom and
You are foolish ." Sam takes his car and
we are spies ." They crawl into the box
you are happy ." The cater pillar seemed to
unhappy and angry ." Tom and Sam looked at
for not sharing before ." Her friend was happy
is not safe ." " Look , Tom ,
wants to hurt us !" A man hears Tom
birds they are ." " Maybe they are rob
funny and cute ." They walked back to their
cake and candy ." Mom sigh s . She
Throw it away ." She took the cookie and

INTERVAL 10.188 - 18.852
CONTAINS 5.506%

loved to play with her toys and draw pictures . One
. Stay away from the road . Come back here ."
She loved playing in the sand and collecting shells . Suddenly
! The girl chose a machine that said ' R ever
Ben ! It has a note for you too . It

INTERVAL 1.524 - 10.188
CONTAINS 34.118%

closes her eyes . She sleeps and dreams of purple things
," Lily says . She fills another cup with water from
sun was already shining high , and the water was sparkling
told her that it 's important to respect other people 's
in the bushes and she found a baby fox . The

INTERVAL -7.140 - 1.524
CONTAINS 45.602%

waved to the path . Suddenly , the path said hello
learned that being bald is not bad . He learned to
outside . " Let 's go to the library
soccer in the park . They both wanted to score a
, Sara and Lily decided to have a tea party in

INTERVAL -15.804 - -7.140
CONTAINS 14.774%

you can raise your arms ." L ily took
each other and laughed . But then a big
his friend , the cow . <|endoftext|> Once upon a time
to c uddle with Lily . One day , Lily and
The little boy was naughty . He knew he had done
Token real
Token ID1103
Feature activation+9.51e+00

Pos logit contributions
els+0.078
ps+0.066
ries+0.063
 on+0.061
ws+0.058
Neg logit contributions
ete-0.074
 Sadly-0.073
 Curious-0.072
Sadly-0.072
 Int-0.071
Token treasure
Token ID14068
Feature activation+1.71e+01

Pos logit contributions
els+0.170
ries+0.143
ps+0.140
ws+0.134
cks+0.133
Neg logit contributions
ete-0.137
 Curious-0.134
 Sadly-0.130
 Int-0.128
Of-0.125
Token chest
Token ID7721
Feature activation+1.59e+01

Pos logit contributions
els+0.207
ries+0.158
ps+0.158
ws+0.157
ishing+0.131
Neg logit contributions
ete-0.309
hog-0.307
 Sadly-0.290
 Barkley-0.287
 couldn-0.286
Token filled
Token ID5901
Feature activation+8.72e+00

Pos logit contributions
els+0.181
ws+0.133
ries+0.131
ps+0.123
 pret+0.097
Neg logit contributions
ete-0.317
 couldn-0.313
hog-0.305
 Sadly-0.294
 Curious-0.286
Token with
Token ID351
Feature activation+7.28e+00

Pos logit contributions
els+0.104
ps+0.088
ier+0.082
ries+0.075
 on+0.073
Neg logit contributions
Da-0.169
ete-0.169
 Sadly-0.165
From-0.165
 Everywhere-0.164
Token gold
Token ID3869
Feature activation+1.89e+01

Pos logit contributions
els+0.103
ps+0.083
ries+0.080
ier+0.078
 on+0.077
Neg logit contributions
ete-0.122
 Curious-0.121
 Everywhere-0.121
 Int-0.119
 Sadly-0.118
Token coins
Token ID10796
Feature activation+1.85e+01

Pos logit contributions
els+0.236
ws+0.196
ps+0.187
ries+0.185
 channel+0.183
Neg logit contributions
ete-0.278
hog-0.276
 He-0.271
He-0.250
pill-0.250
Token and
Token ID290
Feature activation+2.44e+00

Pos logit contributions
els+0.204
ws+0.161
ries+0.155
ps+0.149
 channel+0.141
Neg logit contributions
ete-0.340
 couldn-0.321
hog-0.309
 Barkley-0.307
rolling-0.305
Token jewels
Token ID42701
Feature activation+1.63e+01

Pos logit contributions
els+0.030
ps+0.024
 on+0.023
 inside+0.022
.+0.022
Neg logit contributions
 Curious-0.048
ete-0.048
 Int-0.047
 Sadly-0.047
Sadly-0.047
Token.
Token ID13
Feature activation-1.08e+01

Pos logit contributions
els+0.184
ws+0.144
ries+0.127
ps+0.123
 channel+0.114
Neg logit contributions
 couldn-0.296
ete-0.295
hog-0.293
Ag-0.285
 Curious-0.280
Token
Token ID220
Feature activation-6.72e+00

Pos logit contributions
ete+0.192
rolling+0.177
Harry+0.174
David+0.173
Sadly+0.166
Neg logit contributions
.-0.176
 on-0.165
 inside-0.163
."-0.147
els-0.132
Token treasure
Token ID14068
Feature activation+1.52e+01

Pos logit contributions
els+0.048
ps+0.042
 on+0.039
ries+0.039
 inside+0.038
Neg logit contributions
ete-0.042
 Sadly-0.042
 Curious-0.042
 Int-0.041
Sadly-0.041
Token chest
Token ID7721
Feature activation+1.38e+01

Pos logit contributions
els+0.260
ps+0.221
ws+0.221
ries+0.205
cks+0.190
Neg logit contributions
ete-0.234
 Sadly-0.212
 Barkley-0.212
 Curious-0.209
hog-0.208
Token full
Token ID1336
Feature activation+6.51e+00

Pos logit contributions
els+0.243
ps+0.202
ws+0.198
ries+0.195
 spills+0.168
Neg logit contributions
ete-0.220
 Sadly-0.204
 Curious-0.201
 Int-0.195
 couldn-0.194
Token of
Token ID286
Feature activation+5.39e+00

Pos logit contributions
els+0.127
ps+0.116
ws+0.109
ries+0.104
cks+0.102
Neg logit contributions
ete-0.088
 Everywhere-0.083
 Everything-0.079
 Int-0.078
 Curious-0.077
Token shiny
Token ID22441
Feature activation+1.40e+01

Pos logit contributions
els+0.075
ps+0.066
ier+0.061
ws+0.057
 on+0.057
Neg logit contributions
 Curious-0.092
ete-0.092
 Everywhere-0.092
 Everything-0.090
 Int-0.090
Token gold
Token ID3869
Feature activation+1.88e+01

Pos logit contributions
ps+0.196
cks+0.184
ws+0.184
els+0.177
ier+0.171
Neg logit contributions
 He-0.192
 Curious-0.182
 Everywhere-0.174
 From-0.171
ete-0.167
Token coins
Token ID10796
Feature activation+1.84e+01

Pos logit contributions
els+0.248
ws+0.231
ps+0.220
cks+0.212
ck+0.197
Neg logit contributions
ete-0.280
 He-0.276
hog-0.246
 Nut-0.239
 couldn-0.236
Token!
Token ID0
Feature activation-1.00e+01

Pos logit contributions
els+0.235
ws+0.202
ps+0.195
 spills+0.175
ries+0.172
Neg logit contributions
ete-0.329
 couldn-0.312
 Phil-0.294
 Barkley-0.293
 shouldn-0.286
Token Tim
Token ID5045
Feature activation-1.03e+00

Pos logit contributions
ete+0.176
Sadly+0.168
rolling+0.166
Days+0.163
Gary+0.161
Neg logit contributions
.-0.149
 on-0.138
 inside-0.138
."-0.128
els-0.114
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
ete+0.023
 Int+0.022
Sadly+0.022
 Everything+0.022
 Everywhere+0.022
Neg logit contributions
els-0.010
 on-0.010
 inside-0.010
.-0.009
ps-0.008
Token Sally
Token ID25737
Feature activation+2.93e-01

Pos logit contributions
ete+0.035
 Int+0.033
Sadly+0.033
 Sadly+0.033
 Everywhere+0.032
Neg logit contributions
els-0.038
 on-0.038
 inside-0.036
.-0.036
ps-0.033
Token shouldn
Token ID6584
Feature activation-1.58e+00

Pos logit contributions
els+0.013
 on+0.012
 inside+0.012
.+0.011
ps+0.011
Neg logit contributions
ete-0.013
 Int-0.012
 Curious-0.012
 Sadly-0.012
Sadly-0.012
Token't
Token ID470
Feature activation+5.29e-01

Pos logit contributions
Of+0.024
ete+0.024
 Int+0.023
 Curious+0.023
 Sadly+0.022
Neg logit contributions
 on-0.027
 inside-0.026
els-0.026
.-0.026
ps-0.024
Token have
Token ID423
Feature activation+2.98e+00

Pos logit contributions
els+0.009
ps+0.008
 on+0.008
ries+0.007
 inside+0.007
Neg logit contributions
ete-0.008
 Sadly-0.008
Sadly-0.008
 Everything-0.008
 Int-0.008
Token opened
Token ID4721
Feature activation+1.02e+01

Pos logit contributions
els+0.054
ps+0.046
 inside+0.042
 on+0.042
le+0.042
Neg logit contributions
ete-0.042
 Int-0.041
Sadly-0.041
 Sadly-0.040
Of-0.039
Token the
Token ID262
Feature activation+3.75e+00

Pos logit contributions
els+0.160
ps+0.139
ws+0.136
ier+0.132
 sticker+0.127
Neg logit contributions
 Everywhere-0.144
ete-0.143
 Everything-0.138
 Sadly-0.136
 Int-0.135
Token battery
Token ID6555
Feature activation+1.86e+01

Pos logit contributions
els+0.065
ries+0.057
cks+0.055
ws+0.052
ps+0.052
Neg logit contributions
 Sadly-0.057
 Everything-0.056
 Everywhere-0.055
 Curious-0.055
Sadly-0.053
Token without
Token ID1231
Feature activation+4.97e+00

Pos logit contributions
els+0.209
ps+0.161
ries+0.144
ws+0.136
 frost+0.133
Neg logit contributions
ete-0.397
 Sadly-0.377
 Everywhere-0.370
 Int-0.364
 Curious-0.361
Token asking
Token ID4737
Feature activation+7.87e+00

Pos logit contributions
els+0.080
le+0.066
packs+0.066
 on+0.064
 inside+0.063
Neg logit contributions
ete-0.078
 Everything-0.071
 Int-0.070
Sadly-0.068
 Sadly-0.068
Token me
Token ID502
Feature activation+6.96e+00

Pos logit contributions
els+0.087
ps+0.065
 on+0.064
 inside+0.062
ries+0.062
Neg logit contributions
ete-0.173
 Everything-0.159
 Everywhere-0.159
 Sadly-0.157
 Int-0.156
Token first
Token ID717
Feature activation+7.36e+00

Pos logit contributions
els+0.073
ps+0.056
 on+0.055
 inside+0.053
ries+0.049
Neg logit contributions
ete-0.154
 Sadly-0.145
 Int-0.143
 Curious-0.143
 Everywhere-0.143
Token."
Token ID526
Feature activation-1.13e+01

Pos logit contributions
els+0.073
ps+0.051
ries+0.048
 on+0.044
 inside+0.043
Neg logit contributions
ete-0.169
 Sadly-0.160
 Everything-0.160
 Everywhere-0.158
 Curious-0.157
Token treasure
Token ID14068
Feature activation+1.39e+01

Pos logit contributions
ete+0.006
 Int+0.006
 Curious+0.006
 Sadly+0.006
Sadly+0.006
Neg logit contributions
els-0.006
 on-0.005
 inside-0.005
ps-0.005
.-0.005
Token is
Token ID318
Feature activation+3.38e+00

Pos logit contributions
els+0.209
 on+0.170
ps+0.169
 inside+0.166
ries+0.162
Neg logit contributions
ete-0.224
 Int-0.211
 Sadly-0.206
 Curious-0.204
 Everywhere-0.203
Token not
Token ID407
Feature activation+4.04e+00

Pos logit contributions
els+0.061
ries+0.048
ps+0.045
le+0.045
ishing+0.042
Neg logit contributions
ete-0.047
 Everywhere-0.044
 Sadly-0.044
 Barkley-0.043
 Everything-0.043
Token gold
Token ID3869
Feature activation+1.82e+01

Pos logit contributions
els+0.076
ries+0.059
ps+0.057
le+0.054
 spills+0.052
Neg logit contributions
ete-0.053
 Barkley-0.051
 Int-0.050
 Everywhere-0.050
 Curious-0.049
Token or
Token ID393
Feature activation+4.39e+00

Pos logit contributions
els+0.227
ps+0.182
bones+0.167
cks+0.166
 colored+0.159
Neg logit contributions
ete-0.327
 He-0.293
 Everywhere-0.276
 Not-0.271
So-0.266
Token jewels
Token ID42701
Feature activation+1.85e+01

Pos logit contributions
els+0.065
ps+0.054
 on+0.053
ier+0.052
 inside+0.052
Neg logit contributions
ete-0.072
 Everywhere-0.070
 Int-0.070
 Curious-0.069
 Sadly-0.068
Token,"
Token ID553
Feature activation-8.96e+00

Pos logit contributions
els+0.260
ps+0.200
bones+0.177
ries+0.176
cks+0.173
Neg logit contributions
ete-0.328
 Not-0.295
-0.284
So-0.284
 He-0.282
Token L
Token ID406
Feature activation-4.43e+00

Pos logit contributions
ete+0.151
 Int+0.148
 Sadly+0.145
 Curious+0.145
Sadly+0.144
Neg logit contributions
els-0.139
 on-0.127
 inside-0.123
ps-0.120
.-0.119
Tokenila
Token ID10102
Feature activation-9.94e-01

Pos logit contributions
ete+0.088
 Int+0.087
 Sadly+0.086
Sadly+0.086
 Curious+0.085
Neg logit contributions
els-0.055
 on-0.049
 inside-0.047
.-0.046
ps-0.045
Token says
Token ID1139
Feature activation+1.60e-01

Pos logit contributions
 Int+0.017
ete+0.017
 Curious+0.017
 Sadly+0.016
 Everywhere+0.016
Neg logit contributions
els-0.014
 on-0.013
 inside-0.012
ps-0.012
ries-0.011
Token.
Token ID13
Feature activation-9.69e+00

Pos logit contributions
els+0.001
 colored+0.001
ries+0.001
 frost+0.001
ws+0.001
Neg logit contributions
ete-0.004
 Int-0.004
 Curious-0.003
 �-0.003
 Sadly-0.003
Token treasure
Token ID14068
Feature activation+1.71e+01

Pos logit contributions
els+0.170
ries+0.143
ps+0.140
ws+0.134
cks+0.133
Neg logit contributions
ete-0.137
 Curious-0.134
 Sadly-0.130
 Int-0.128
Of-0.125
Token chest
Token ID7721
Feature activation+1.59e+01

Pos logit contributions
els+0.207
ries+0.158
ps+0.158
ws+0.157
ishing+0.131
Neg logit contributions
ete-0.309
hog-0.307
 Sadly-0.290
 Barkley-0.287
 couldn-0.286
Token filled
Token ID5901
Feature activation+8.72e+00

Pos logit contributions
els+0.181
ws+0.133
ries+0.131
ps+0.123
 pret+0.097
Neg logit contributions
ete-0.317
 couldn-0.313
hog-0.305
 Sadly-0.294
 Curious-0.286
Token with
Token ID351
Feature activation+7.28e+00

Pos logit contributions
els+0.104
ps+0.088
ier+0.082
ries+0.075
 on+0.073
Neg logit contributions
Da-0.169
ete-0.169
 Sadly-0.165
From-0.165
 Everywhere-0.164
Token gold
Token ID3869
Feature activation+1.89e+01

Pos logit contributions
els+0.103
ps+0.083
ries+0.080
ier+0.078
 on+0.077
Neg logit contributions
ete-0.122
 Curious-0.121
 Everywhere-0.121
 Int-0.119
 Sadly-0.118
Token coins
Token ID10796
Feature activation+1.85e+01

Pos logit contributions
els+0.236
ws+0.196
ps+0.187
ries+0.185
 channel+0.183
Neg logit contributions
ete-0.278
hog-0.276
 He-0.271
He-0.250
pill-0.250
Token and
Token ID290
Feature activation+2.44e+00

Pos logit contributions
els+0.204
ws+0.161
ries+0.155
ps+0.149
 channel+0.141
Neg logit contributions
ete-0.340
 couldn-0.321
hog-0.309
 Barkley-0.307
rolling-0.305
Token jewels
Token ID42701
Feature activation+1.63e+01

Pos logit contributions
els+0.030
ps+0.024
 on+0.023
 inside+0.022
.+0.022
Neg logit contributions
 Curious-0.048
ete-0.048
 Int-0.047
 Sadly-0.047
Sadly-0.047
Token.
Token ID13
Feature activation-1.08e+01

Pos logit contributions
els+0.184
ws+0.144
ries+0.127
ps+0.123
 channel+0.114
Neg logit contributions
 couldn-0.296
ete-0.295
hog-0.293
Ag-0.285
 Curious-0.280
Token
Token ID220
Feature activation-6.72e+00

Pos logit contributions
ete+0.192
rolling+0.177
Harry+0.174
David+0.173
Sadly+0.166
Neg logit contributions
.-0.176
 on-0.165
 inside-0.163
."-0.147
els-0.132
Token
Token ID198
Feature activation-9.72e+00

Pos logit contributions
ete+0.103
mber+0.096
Sadly+0.095
 Curious+0.095
 Int+0.094
Neg logit contributions
.-0.115
 on-0.111
els-0.108
 inside-0.108
."-0.099
Token found
Token ID1043
Feature activation+3.30e+00

Pos logit contributions
ete+0.033
 Int+0.033
 Sadly+0.032
 Curious+0.032
Sadly+0.032
Neg logit contributions
els-0.034
 on-0.031
 inside-0.030
.-0.029
ps-0.029
Token a
Token ID257
Feature activation+3.52e+00

Pos logit contributions
els+0.048
 on+0.041
ps+0.040
 inside+0.039
.+0.038
Neg logit contributions
ete-0.058
 Int-0.056
 Curious-0.056
 Sadly-0.056
 Everywhere-0.055
Token chest
Token ID7721
Feature activation+1.36e+01

Pos logit contributions
els+0.056
ps+0.048
 on+0.047
ries+0.045
 inside+0.045
Neg logit contributions
ete-0.060
 Sadly-0.058
 Curious-0.058
 Int-0.058
Sadly-0.058
Token filled
Token ID5901
Feature activation+7.83e+00

Pos logit contributions
els+0.201
ries+0.161
ps+0.161
ws+0.155
 frost+0.133
Neg logit contributions
ete-0.254
 Curious-0.232
 Sadly-0.232
rolling-0.231
 Int-0.225
Token with
Token ID351
Feature activation+6.87e+00

Pos logit contributions
els+0.091
ps+0.076
ier+0.068
ries+0.065
 on+0.065
Neg logit contributions
ete-0.161
 Everywhere-0.155
 Int-0.155
 Sadly-0.154
 Curious-0.154
Token gold
Token ID3869
Feature activation+1.85e+01

Pos logit contributions
els+0.099
 on+0.083
ps+0.083
 inside+0.078
ries+0.078
Neg logit contributions
ete-0.121
 Curious-0.118
 Int-0.117
 Everywhere-0.116
 Sadly-0.116
Token and
Token ID290
Feature activation+2.12e+00

Pos logit contributions
els+0.223
ws+0.191
ps+0.185
ries+0.179
cks+0.170
Neg logit contributions
ete-0.327
 He-0.289
 Curious-0.280
hog-0.279
 Barkley-0.275
Token jewels
Token ID42701
Feature activation+1.62e+01

Pos logit contributions
els+0.029
 on+0.024
ps+0.024
 inside+0.023
.+0.023
Neg logit contributions
ete-0.039
 Int-0.039
 Curious-0.039
 Sadly-0.038
Sadly-0.038
Token.
Token ID13
Feature activation-9.35e+00

Pos logit contributions
els+0.176
ws+0.134
ps+0.123
ries+0.121
 spills+0.105
Neg logit contributions
ete-0.341
 Curious-0.320
rolling-0.319
 Int-0.313
 couldn-0.312
Token From
Token ID3574
Feature activation-4.14e-02

Pos logit contributions
Da+0.109
Sadly+0.103
Ag+0.101
Days+0.100
Mos+0.099
Neg logit contributions
.-0.184
 on-0.179
 inside-0.175
els-0.164
."-0.162
Token that
Token ID326
Feature activation-3.01e+00

Pos logit contributions
ete+0.001
 Int+0.001
 Sadly+0.001
 Curious+0.001
Sadly+0.001
Neg logit contributions
els-0.001
 on-0.001
 inside-0.001
.-0.001
ps-0.001
Token to
Token ID284
Feature activation+4.42e-01

Pos logit contributions
els+0.080
ps+0.070
 frost+0.067
ries+0.065
 sticker+0.064
Neg logit contributions
ete-0.090
 Everywhere-0.089
 Int-0.085
 Everything-0.085
 Sadly-0.084
Token be
Token ID307
Feature activation+3.46e+00

Pos logit contributions
els+0.007
 on+0.007
 inside+0.006
ps+0.006
.+0.006
Neg logit contributions
ete-0.007
 Int-0.007
 Sadly-0.007
Sadly-0.007
 Curious-0.007
Token gold
Token ID3869
Feature activation+1.71e+01

Pos logit contributions
els+0.065
ries+0.056
ps+0.054
 on+0.051
 frost+0.051
Neg logit contributions
 Sadly-0.044
 Curious-0.044
Sadly-0.044
 Int-0.043
 Everywhere-0.042
Token is
Token ID318
Feature activation+5.09e+00

Pos logit contributions
els+0.188
ps+0.158
 frost+0.145
bones+0.139
ries+0.137
Neg logit contributions
ete-0.337
 Everywhere-0.311
 Barkley-0.301
 Int-0.298
 Everything-0.297
Token not
Token ID407
Feature activation+5.51e+00

Pos logit contributions
els+0.084
ries+0.072
ps+0.068
 frost+0.065
 candles+0.062
Neg logit contributions
ete-0.076
 Everywhere-0.072
 Sadly-0.071
 Not-0.069
 shouldn-0.069
Token gold
Token ID3869
Feature activation+1.84e+01

Pos logit contributions
els+0.094
ries+0.078
ps+0.072
ips+0.070
 frost+0.069
Neg logit contributions
ete-0.072
 shouldn-0.072
 Everywhere-0.071
 Not-0.069
 Joey-0.068
Token,
Token ID11
Feature activation+2.42e+00

Pos logit contributions
els+0.228
ps+0.203
cks+0.193
bones+0.189
 frost+0.186
Neg logit contributions
ete-0.318
 Everywhere-0.271
 He-0.271
 Not-0.266
 shouldn-0.263
Token but
Token ID475
Feature activation-1.80e+00

Pos logit contributions
els+0.029
 on+0.026
 inside+0.025
.+0.024
ps+0.024
Neg logit contributions
ete-0.048
 Int-0.048
 Sadly-0.047
 Curious-0.047
Sadly-0.047
Token something
Token ID1223
Feature activation+3.44e+00

Pos logit contributions
ete+0.027
 Sadly+0.026
 Int+0.026
Sadly+0.025
 Everything+0.025
Neg logit contributions
els-0.030
ps-0.027
 on-0.026
 inside-0.025
.-0.025
Token else
Token ID2073
Feature activation+6.01e+00

Pos logit contributions
els+0.057
ps+0.049
 on+0.048
ries+0.047
 inside+0.046
Neg logit contributions
ete-0.053
 Int-0.050
 Everywhere-0.050
 Sadly-0.049
 Curious-0.049
Token that
Token ID326
Feature activation+2.02e+00

Pos logit contributions
els+0.094
ps+0.080
 on+0.078
ries+0.076
 frost+0.074
Neg logit contributions
ete-0.097
 Int-0.091
 Everything-0.089
 Everywhere-0.089
 Sadly-0.089
Token chest
Token ID7721
Feature activation+1.38e+01

Pos logit contributions
els+0.260
ps+0.221
ws+0.221
ries+0.205
cks+0.190
Neg logit contributions
ete-0.234
 Sadly-0.212
 Barkley-0.212
 Curious-0.209
hog-0.208
Token full
Token ID1336
Feature activation+6.51e+00

Pos logit contributions
els+0.243
ps+0.202
ws+0.198
ries+0.195
 spills+0.168
Neg logit contributions
ete-0.220
 Sadly-0.204
 Curious-0.201
 Int-0.195
 couldn-0.194
Token of
Token ID286
Feature activation+5.39e+00

Pos logit contributions
els+0.127
ps+0.116
ws+0.109
ries+0.104
cks+0.102
Neg logit contributions
ete-0.088
 Everywhere-0.083
 Everything-0.079
 Int-0.078
 Curious-0.077
Token shiny
Token ID22441
Feature activation+1.40e+01

Pos logit contributions
els+0.075
ps+0.066
ier+0.061
ws+0.057
 on+0.057
Neg logit contributions
 Curious-0.092
ete-0.092
 Everywhere-0.092
 Everything-0.090
 Int-0.090
Token gold
Token ID3869
Feature activation+1.88e+01

Pos logit contributions
ps+0.196
cks+0.184
ws+0.184
els+0.177
ier+0.171
Neg logit contributions
 He-0.192
 Curious-0.182
 Everywhere-0.174
 From-0.171
ete-0.167
Token coins
Token ID10796
Feature activation+1.84e+01

Pos logit contributions
els+0.248
ws+0.231
ps+0.220
cks+0.212
ck+0.197
Neg logit contributions
ete-0.280
 He-0.276
hog-0.246
 Nut-0.239
 couldn-0.236
Token!
Token ID0
Feature activation-1.00e+01

Pos logit contributions
els+0.235
ws+0.202
ps+0.195
 spills+0.175
ries+0.172
Neg logit contributions
ete-0.329
 couldn-0.312
 Phil-0.294
 Barkley-0.293
 shouldn-0.286
Token Tim
Token ID5045
Feature activation-1.03e+00

Pos logit contributions
ete+0.176
Sadly+0.168
rolling+0.166
Days+0.163
Gary+0.161
Neg logit contributions
.-0.149
 on-0.138
 inside-0.138
."-0.128
els-0.114
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
ete+0.023
 Int+0.022
Sadly+0.022
 Everything+0.022
 Everywhere+0.022
Neg logit contributions
els-0.010
 on-0.010
 inside-0.010
.-0.009
ps-0.008
Token Sally
Token ID25737
Feature activation+2.93e-01

Pos logit contributions
ete+0.035
 Int+0.033
Sadly+0.033
 Sadly+0.033
 Everywhere+0.032
Neg logit contributions
els-0.038
 on-0.038
 inside-0.036
.-0.036
ps-0.033
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
els+0.004
 on+0.004
 inside+0.003
.+0.003
ps+0.003
Neg logit contributions
ete-0.006
 Int-0.005
Sadly-0.005
 Sadly-0.005
 Curious-0.005
Token try
Token ID1949
Feature activation+5.73e+00

Pos logit contributions
els+0.070
 on+0.061
ps+0.060
 inside+0.059
.+0.057
Neg logit contributions
ete-0.069
 Int-0.068
 Sadly-0.068
 Everything-0.066
 Curious-0.066
Token to
Token ID284
Feature activation+3.37e+00

Pos logit contributions
els+0.100
ps+0.085
ries+0.082
 frost+0.079
ier+0.078
Neg logit contributions
 Int-0.078
ete-0.078
 Curious-0.076
 Everything-0.076
 Everywhere-0.075
Token use
Token ID779
Feature activation+1.05e+01

Pos logit contributions
els+0.063
ries+0.055
ps+0.052
 on+0.052
ws+0.049
Neg logit contributions
ete-0.044
rolling-0.041
 Int-0.040
 Curious-0.040
Sadly-0.039
Token the
Token ID262
Feature activation+4.38e+00

Pos logit contributions
els+0.151
ier+0.141
ries+0.130
ps+0.129
ishing+0.122
Neg logit contributions
ete-0.152
 Everywhere-0.150
 Everything-0.146
 Int-0.138
 Sadly-0.137
Token broken
Token ID5445
Feature activation+1.31e+01

Pos logit contributions
els+0.070
ries+0.060
ier+0.054
packs+0.052
 on+0.051
Neg logit contributions
 Curious-0.068
 Everything-0.068
 Int-0.067
Of-0.065
 Sadly-0.065
Token toilet
Token ID16146
Feature activation+1.83e+01

Pos logit contributions
els+0.174
ries+0.148
 on+0.137
ws+0.134
ier+0.133
Neg logit contributions
 Everything-0.204
 Curious-0.201
 Int-0.200
ete-0.190
 His-0.188
Token.
Token ID13
Feature activation-6.40e+00

Pos logit contributions
els+0.152
ps+0.111
ries+0.094
 frost+0.086
 colored+0.084
Neg logit contributions
ete-0.387
 Sadly-0.375
 Int-0.374
 Everywhere-0.371
 Curious-0.363
Token It
Token ID632
Feature activation+1.72e+00

Pos logit contributions
ete+0.082
rolling+0.073
mber+0.072
Sadly+0.071
Da+0.071
Neg logit contributions
els-0.119
 on-0.115
.-0.114
."-0.111
 inside-0.111
Token's
Token ID338
Feature activation+2.45e+00

Pos logit contributions
els+0.025
 on+0.023
 inside+0.022
.+0.022
ps+0.022
Neg logit contributions
ete-0.031
 Int-0.029
 Sadly-0.029
Sadly-0.029
 Curious-0.029
Token a
Token ID257
Feature activation+3.68e+00

Pos logit contributions
els+0.048
ries+0.041
ps+0.038
ier+0.038
le+0.038
Neg logit contributions
 Sadly-0.028
Sadly-0.028
 Everywhere-0.027
 Everything-0.027
ete-0.026
Token prank
Token ID33401
Feature activation+1.41e+01

Pos logit contributions
els+0.065
ps+0.054
ries+0.053
 on+0.052
 inside+0.050
Neg logit contributions
 Sadly-0.057
ete-0.056
Sadly-0.056
 Curious-0.055
 Int-0.055
Token a
Token ID257
Feature activation+3.06e+00

Pos logit contributions
els+0.045
 on+0.044
 inside+0.043
.+0.039
ps+0.038
Neg logit contributions
 Int-0.069
ete-0.069
Sadly-0.069
 Sadly-0.068
 Curious-0.068
Token treasure
Token ID14068
Feature activation+1.51e+01

Pos logit contributions
els+0.050
 on+0.045
 inside+0.044
ps+0.043
.+0.043
Neg logit contributions
ete-0.049
 Int-0.048
 Sadly-0.047
 Curious-0.047
Sadly-0.047
Token chest
Token ID7721
Feature activation+1.38e+01

Pos logit contributions
els+0.244
ps+0.208
ws+0.201
ries+0.194
 frost+0.174
Neg logit contributions
ete-0.245
 Barkley-0.210
 Sadly-0.209
 Int-0.208
 Curious-0.208
Token full
Token ID1336
Feature activation+6.93e+00

Pos logit contributions
els+0.212
ps+0.170
ries+0.166
ws+0.165
 problems+0.141
Neg logit contributions
ete-0.240
 Sadly-0.221
 Curious-0.218
rolling-0.217
 Int-0.215
Token of
Token ID286
Feature activation+5.83e+00

Pos logit contributions
els+0.119
ps+0.104
ries+0.095
ws+0.095
 on+0.091
Neg logit contributions
ete-0.112
 Everywhere-0.103
 Int-0.101
 Curious-0.100
 Sadly-0.099
Token gold
Token ID3869
Feature activation+1.83e+01

Pos logit contributions
els+0.083
ps+0.071
 on+0.067
ier+0.063
ries+0.063
Neg logit contributions
ete-0.104
 Curious-0.100
 Int-0.099
 Everywhere-0.097
 Sadly-0.096
Token and
Token ID290
Feature activation+1.54e+00

Pos logit contributions
els+0.241
ps+0.216
ws+0.215
 channel+0.204
cks+0.192
Neg logit contributions
ete-0.298
 Barkley-0.236
 couldn-0.232
 He-0.232
 Phil-0.230
Token jewels
Token ID42701
Feature activation+1.56e+01

Pos logit contributions
els+0.020
ps+0.016
 on+0.016
.+0.015
 inside+0.015
Neg logit contributions
ete-0.030
 Curious-0.029
 Int-0.029
 Sadly-0.029
Sadly-0.028
Token!
Token ID0
Feature activation-1.09e+01

Pos logit contributions
els+0.182
ws+0.143
ps+0.126
ries+0.119
 channel+0.116
Neg logit contributions
ete-0.312
 couldn-0.297
rolling-0.290
Ag-0.283
 Curious-0.282
Token Tim
Token ID5045
Feature activation-2.92e+00

Pos logit contributions
ete+0.193
mber+0.190
Da+0.177
Days+0.176
Fre+0.175
Neg logit contributions
.-0.169
 inside-0.159
 on-0.154
."-0.141
els-0.135
Tokenmy
Token ID1820
Feature activation-5.22e-01

Pos logit contributions
ete+0.057
 Int+0.056
Sadly+0.055
 Sadly+0.055
 Curious+0.055
Neg logit contributions
els-0.038
 on-0.035
 inside-0.033
.-0.033
ps-0.031
TokenMaybe
Token ID13300
Feature activation-9.31e-01

Pos logit contributions
 Int+0.072
ete+0.072
 Curious+0.070
Sadly+0.069
 Sadly+0.068
Neg logit contributions
els-0.176
 on-0.163
 inside-0.161
ps-0.158
.-0.157
Token the
Token ID262
Feature activation-3.81e-01

Pos logit contributions
ete+0.009
 Int+0.009
Of+0.008
 Curious+0.008
From+0.008
Neg logit contributions
els-0.021
ps-0.020
ws-0.018
 channel-0.018
ier-0.017
Token treasure
Token ID14068
Feature activation+1.39e+01

Pos logit contributions
ete+0.006
 Int+0.006
 Curious+0.006
 Sadly+0.006
Sadly+0.006
Neg logit contributions
els-0.006
 on-0.005
 inside-0.005
ps-0.005
.-0.005
Token is
Token ID318
Feature activation+3.38e+00

Pos logit contributions
els+0.209
 on+0.170
ps+0.169
 inside+0.166
ries+0.162
Neg logit contributions
ete-0.224
 Int-0.211
 Sadly-0.206
 Curious-0.204
 Everywhere-0.203
Token not
Token ID407
Feature activation+4.04e+00

Pos logit contributions
els+0.061
ries+0.048
ps+0.045
le+0.045
ishing+0.042
Neg logit contributions
ete-0.047
 Everywhere-0.044
 Sadly-0.044
 Barkley-0.043
 Everything-0.043
Token gold
Token ID3869
Feature activation+1.82e+01

Pos logit contributions
els+0.076
ries+0.059
ps+0.057
le+0.054
 spills+0.052
Neg logit contributions
ete-0.053
 Barkley-0.051
 Int-0.050
 Everywhere-0.050
 Curious-0.049
Token or
Token ID393
Feature activation+4.39e+00

Pos logit contributions
els+0.227
ps+0.182
bones+0.167
cks+0.166
 colored+0.159
Neg logit contributions
ete-0.327
 He-0.293
 Everywhere-0.276
 Not-0.271
So-0.266
Token jewels
Token ID42701
Feature activation+1.85e+01

Pos logit contributions
els+0.065
ps+0.054
 on+0.053
ier+0.052
 inside+0.052
Neg logit contributions
ete-0.072
 Everywhere-0.070
 Int-0.070
 Curious-0.069
 Sadly-0.068
Token,"
Token ID553
Feature activation-8.96e+00

Pos logit contributions
els+0.260
ps+0.200
bones+0.177
ries+0.176
cks+0.173
Neg logit contributions
ete-0.328
 Not-0.295
-0.284
So-0.284
 He-0.282
Token L
Token ID406
Feature activation-4.43e+00

Pos logit contributions
ete+0.151
 Int+0.148
 Sadly+0.145
 Curious+0.145
Sadly+0.144
Neg logit contributions
els-0.139
 on-0.127
 inside-0.123
ps-0.120
.-0.119
Tokenila
Token ID10102
Feature activation-9.94e-01

Pos logit contributions
ete+0.088
 Int+0.087
 Sadly+0.086
Sadly+0.086
 Curious+0.085
Neg logit contributions
els-0.055
 on-0.049
 inside-0.047
.-0.046
ps-0.045
Token chest
Token ID7721
Feature activation+1.26e+01

Pos logit contributions
els+0.026
ps+0.021
ries+0.021
 on+0.020
ier+0.019
Neg logit contributions
 Curious-0.028
ete-0.028
 Int-0.028
 Sadly-0.028
Sadly-0.027
Token and
Token ID290
Feature activation-2.19e+00

Pos logit contributions
els+0.156
ps+0.126
ries+0.113
ws+0.112
 on+0.112
Neg logit contributions
ete-0.256
 Sadly-0.250
 Int-0.247
 Curious-0.246
Sadly-0.238
Token found
Token ID1043
Feature activation+3.40e+00

Pos logit contributions
ete+0.038
 Int+0.037
 Sadly+0.037
 Curious+0.037
Sadly+0.036
Neg logit contributions
els-0.033
 on-0.030
 inside-0.028
ps-0.028
.-0.028
Token lots
Token ID6041
Feature activation+8.97e+00

Pos logit contributions
els+0.049
 on+0.042
ps+0.041
 inside+0.039
.+0.039
Neg logit contributions
ete-0.061
 Int-0.059
 Sadly-0.059
 Curious-0.059
Sadly-0.058
Token of
Token ID286
Feature activation+6.78e+00

Pos logit contributions
els+0.176
ps+0.168
ws+0.155
cks+0.147
 on+0.142
Neg logit contributions
ete-0.107
 Everywhere-0.099
 Curious-0.098
 Int-0.097
 From-0.096
Token gold
Token ID3869
Feature activation+1.80e+01

Pos logit contributions
els+0.097
ps+0.087
 on+0.079
ier+0.078
 cutter+0.074
Neg logit contributions
 Curious-0.115
 Int-0.114
ete-0.112
 Sadly-0.109
 Everywhere-0.108
Token coins
Token ID10796
Feature activation+1.77e+01

Pos logit contributions
 channel+0.226
els+0.225
ps+0.209
ws+0.207
 volume+0.199
Neg logit contributions
ete-0.280
 He-0.250
hog-0.232
He-0.231
 Nut-0.228
Token and
Token ID290
Feature activation+2.03e+00

Pos logit contributions
els+0.177
 channel+0.144
ps+0.141
ws+0.139
 pret+0.123
Neg logit contributions
ete-0.373
 couldn-0.329
rolling-0.320
 Phil-0.315
 Barkley-0.314
Token shiny
Token ID22441
Feature activation+1.40e+01

Pos logit contributions
els+0.025
ps+0.020
 on+0.019
 inside+0.018
.+0.017
Neg logit contributions
ete-0.040
 Curious-0.039
 Int-0.039
 Sadly-0.039
 Everywhere-0.038
Token jewels
Token ID42701
Feature activation+1.63e+01

Pos logit contributions
 channel+0.185
ps+0.176
els+0.172
 falling+0.168
ws+0.165
Neg logit contributions
 He-0.182
ete-0.169
 From-0.168
 Curious-0.167
 Everywhere-0.165
Token.
Token ID13
Feature activation-1.08e+01

Pos logit contributions
els+0.177
ws+0.141
ps+0.124
 channel+0.121
ries+0.111
Neg logit contributions
ete-0.331
 couldn-0.315
hog-0.300
Ag-0.299
rolling-0.298
Token.
Token ID13
Feature activation-4.49e+00

Pos logit contributions
els+0.110
 frost+0.081
ps+0.079
ries+0.075
 candles+0.070
Neg logit contributions
 Sadly-0.195
 Int-0.188
 Everywhere-0.188
ete-0.188
Sadly-0.187
Token We
Token ID775
Feature activation+1.63e-01

Pos logit contributions
ete+0.056
Da+0.052
 Int+0.051
Sadly+0.049
mber+0.049
Neg logit contributions
els-0.088
 on-0.083
 inside-0.082
.-0.081
ps-0.078
Token just
Token ID655
Feature activation+3.38e+00

Pos logit contributions
els+0.003
 on+0.002
ps+0.002
ries+0.002
 inside+0.002
Neg logit contributions
 Sadly-0.003
ete-0.003
Sadly-0.002
 Int-0.002
 Curious-0.002
Token need
Token ID761
Feature activation+5.95e+00

Pos logit contributions
els+0.062
ries+0.054
ps+0.053
 on+0.051
ier+0.050
Neg logit contributions
 Sadly-0.046
 Everything-0.045
 Int-0.044
ete-0.044
Sadly-0.043
Token some
Token ID617
Feature activation+1.01e+01

Pos logit contributions
els+0.105
le+0.094
ps+0.093
cks+0.090
ier+0.089
Neg logit contributions
Sadly-0.078
 Sadly-0.077
 Everything-0.077
ete-0.076
 Everywhere-0.076
Token glue
Token ID22749
Feature activation+1.80e+01

Pos logit contributions
els+0.136
ps+0.128
cks+0.127
bones+0.118
le+0.111
Neg logit contributions
 Everywhere-0.168
 Sadly-0.158
 Everything-0.158
Sadly-0.158
Of-0.158
Token and
Token ID290
Feature activation+4.20e+00

Pos logit contributions
els+0.261
cks+0.217
ps+0.215
bones+0.208
ries+0.205
Neg logit contributions
ete-0.267
 Everywhere-0.254
Of-0.242
 Sadly-0.241
 couldn-0.228
Token a
Token ID257
Feature activation+4.44e+00

Pos logit contributions
els+0.047
ps+0.037
 on+0.036
le+0.036
 inside+0.035
Neg logit contributions
Sadly-0.084
 Int-0.082
 Sadly-0.082
Of-0.080
 Everything-0.080
Token new
Token ID649
Feature activation+8.54e+00

Pos logit contributions
els+0.074
ps+0.064
ries+0.061
cks+0.057
le+0.056
Neg logit contributions
 Sadly-0.076
Of-0.073
Sadly-0.073
 Curious-0.072
 Int-0.071
Token wheel
Token ID7825
Feature activation+1.42e+01

Pos logit contributions
els+0.132
ps+0.117
cks+0.106
ries+0.101
 on+0.099
Neg logit contributions
 Curious-0.147
 Int-0.146
 Sadly-0.145
Of-0.143
ete-0.138
Token.
Token ID13
Feature activation-5.26e+00

Pos logit contributions
els+0.148
ps+0.102
ries+0.101
cks+0.094
 colored+0.085
Neg logit contributions
Of-0.279
ete-0.279
 Sadly-0.276
 Everywhere-0.273
 Int-0.268
Token.
Token ID13
Feature activation-6.88e+00

Pos logit contributions
els+0.113
ries+0.080
ws+0.069
 frost+0.065
 icing+0.062
Neg logit contributions
 Sadly-0.177
Sadly-0.175
 Everything-0.173
Luckily-0.173
 Curious-0.173
Token You
Token ID921
Feature activation+1.08e-01

Pos logit contributions
ete+0.091
 Int+0.085
Sadly+0.083
 Curious+0.082
 Sadly+0.082
Neg logit contributions
els-0.134
 on-0.124
.-0.120
 inside-0.119
ps-0.118
Token are
Token ID389
Feature activation+5.47e-01

Pos logit contributions
els+0.002
 on+0.001
 inside+0.001
ps+0.001
.+0.001
Neg logit contributions
ete-0.002
 Sadly-0.002
 Int-0.002
Sadly-0.002
 Curious-0.002
Token ruining
Token ID39842
Feature activation+7.94e+00

Pos logit contributions
els+0.010
le+0.009
ries+0.009
packs+0.008
ws+0.008
Neg logit contributions
Sadly-0.006
ete-0.006
 Not-0.006
 Sadly-0.006
 Everything-0.006
Token my
Token ID616
Feature activation+4.87e+00

Pos logit contributions
els+0.117
ier+0.109
ps+0.108
 on+0.107
le+0.099
Neg logit contributions
ete-0.117
 Everything-0.112
 Everywhere-0.111
 Int-0.110
Sadly-0.110
Token flour
Token ID10601
Feature activation+1.80e+01

Pos logit contributions
els+0.072
ier+0.066
le+0.065
ps+0.062
 on+0.059
Neg logit contributions
 Sadly-0.075
Sadly-0.073
rolling-0.071
ete-0.070
 Int-0.069
Token.
Token ID13
Feature activation-5.71e+00

Pos logit contributions
els+0.189
 colored+0.141
ps+0.140
bones+0.128
ws+0.128
Neg logit contributions
ete-0.323
 He-0.307
 Sadly-0.305
 She-0.302
 Fro-0.296
Token You
Token ID921
Feature activation+1.09e+00

Pos logit contributions
ete+0.068
 Int+0.065
 Curious+0.063
 Sadly+0.062
Sadly+0.062
Neg logit contributions
els-0.118
 on-0.110
 inside-0.107
.-0.106
ps-0.104
Token are
Token ID389
Feature activation+1.08e+00

Pos logit contributions
els+0.018
 on+0.015
 inside+0.015
ps+0.015
ries+0.015
Neg logit contributions
 Sadly-0.016
ete-0.016
Sadly-0.016
 Int-0.016
 Everything-0.015
Token not
Token ID407
Feature activation+2.06e+00

Pos logit contributions
els+0.019
le+0.017
packs+0.016
ries+0.016
.+0.015
Neg logit contributions
Sadly-0.013
ete-0.013
 Not-0.013
 Sadly-0.013
 Int-0.012
Token welcome
Token ID7062
Feature activation+6.26e+00

Pos logit contributions
els+0.035
le+0.029
ries+0.026
ps+0.026
 spills+0.025
Neg logit contributions
 Not-0.027
 Sadly-0.026
Sadly-0.026
Of-0.026
ete-0.026
Token.
Token ID13
Feature activation-5.88e+00

Pos logit contributions
els+0.148
ries+0.126
ps+0.105
 frost+0.105
ws+0.097
Neg logit contributions
 Everywhere-0.212
 Everything-0.208
ete-0.202
 Not-0.200
 Int-0.193
Token We
Token ID775
Feature activation-9.37e-01

Pos logit contributions
ete+0.050
 Int+0.045
Sadly+0.044
 Sadly+0.044
 Curious+0.043
Neg logit contributions
els-0.139
 on-0.133
 inside-0.129
.-0.128
ps-0.126
Token need
Token ID761
Feature activation+3.49e+00

Pos logit contributions
ete+0.014
 Sadly+0.014
Sadly+0.014
 Int+0.013
 Curious+0.013
Neg logit contributions
els-0.016
 on-0.014
ps-0.013
ries-0.013
 inside-0.013
Token a
Token ID257
Feature activation+3.54e+00

Pos logit contributions
els+0.062
ries+0.053
ps+0.052
et+0.052
ier+0.052
Neg logit contributions
 Everywhere-0.048
 Everything-0.048
ete-0.048
Sadly-0.047
 Sadly-0.047
Token new
Token ID649
Feature activation+8.11e+00

Pos logit contributions
els+0.070
ries+0.062
ps+0.056
cks+0.052
bones+0.050
Neg logit contributions
 Sadly-0.054
Sadly-0.053
 Curious-0.052
 Int-0.051
Of-0.050
Token battery
Token ID6555
Feature activation+1.80e+01

Pos logit contributions
els+0.152
ries+0.135
cks+0.121
ps+0.120
ier+0.114
Neg logit contributions
 Sadly-0.118
 Curious-0.117
 Everywhere-0.117
 Int-0.116
Sadly-0.113
Token,"
Token ID553
Feature activation-1.03e+01

Pos logit contributions
els+0.258
ries+0.211
cks+0.189
ps+0.185
ws+0.172
Neg logit contributions
ete-0.270
 Sadly-0.266
 Everywhere-0.265
hog-0.256
Of-0.255
Token Dad
Token ID17415
Feature activation+7.02e-01

Pos logit contributions
ete+0.173
Sadly+0.170
rolling+0.169
 Sadly+0.168
 Everywhere+0.161
Neg logit contributions
.-0.160
 on-0.158
els-0.157
 inside-0.147
 paper-0.137
Token says
Token ID1139
Feature activation+1.18e+00

Pos logit contributions
els+0.011
 on+0.010
 inside+0.010
ps+0.009
ries+0.009
Neg logit contributions
 Int-0.011
 Curious-0.011
ete-0.011
 Everything-0.010
 Sadly-0.010
Token.
Token ID13
Feature activation-9.90e+00

Pos logit contributions
els+0.013
ries+0.009
ws+0.008
 colored+0.008
 frost+0.007
Neg logit contributions
 Int-0.026
 Curious-0.026
 His-0.025
ete-0.025
 Sadly-0.024
Token
Token ID198
Feature activation-1.08e+01

Pos logit contributions
ete+0.144
rolling+0.137
Sadly+0.136
 Sadly+0.130
 Int+0.126
Neg logit contributions
.-0.175
 on-0.175
 inside-0.166
els-0.159
."-0.152
Token treasure
Token ID14068
Feature activation+1.55e+01

Pos logit contributions
els+0.057
 on+0.049
ps+0.049
 inside+0.048
.+0.047
Neg logit contributions
ete-0.065
 Int-0.063
 Curious-0.062
 Sadly-0.061
 Everywhere-0.061
Token chests
Token ID34572
Feature activation+1.21e+01

Pos logit contributions
els+0.183
ps+0.156
ws+0.143
ries+0.132
 frost+0.127
Neg logit contributions
ete-0.313
 Barkley-0.283
 Int-0.282
 Everywhere-0.281
 Curious-0.278
Token,
Token ID11
Feature activation+1.14e+00

Pos logit contributions
els+0.154
ps+0.116
ries+0.114
ws+0.107
 on+0.101
Neg logit contributions
ete-0.242
 Curious-0.230
 Int-0.230
 Everywhere-0.229
 Sadly-0.225
Token full
Token ID1336
Feature activation+6.10e+00

Pos logit contributions
els+0.016
 on+0.013
ps+0.013
 inside+0.013
.+0.012
Neg logit contributions
ete-0.021
 Int-0.021
 Curious-0.021
 Sadly-0.020
 Everywhere-0.020
Token of
Token ID286
Feature activation+5.09e+00

Pos logit contributions
els+0.088
 on+0.076
ps+0.075
 inside+0.074
.+0.069
Neg logit contributions
ete-0.110
 Int-0.106
 Curious-0.104
 Sadly-0.104
 Everywhere-0.103
Token gold
Token ID3869
Feature activation+1.79e+01

Pos logit contributions
els+0.072
 on+0.061
ps+0.061
 inside+0.058
.+0.057
Neg logit contributions
ete-0.090
 Int-0.088
 Curious-0.087
 Everywhere-0.086
 Sadly-0.086
Token and
Token ID290
Feature activation+2.34e+00

Pos logit contributions
els+0.238
ps+0.216
ws+0.207
cks+0.189
ries+0.183
Neg logit contributions
ete-0.294
 Barkley-0.247
 He-0.247
 Everywhere-0.242
 Nut-0.240
Token shiny
Token ID22441
Feature activation+1.36e+01

Pos logit contributions
els+0.032
ps+0.027
 on+0.025
 inside+0.025
ries+0.024
Neg logit contributions
ete-0.043
 Curious-0.042
 Int-0.042
 Everywhere-0.041
 Sadly-0.041
Token coins
Token ID10796
Feature activation+1.65e+01

Pos logit contributions
els+0.188
ps+0.186
cks+0.176
ws+0.174
ips+0.173
Neg logit contributions
ete-0.170
 He-0.169
 Curious-0.159
 Everywhere-0.157
 From-0.148
Token.
Token ID13
Feature activation-1.05e+01

Pos logit contributions
els+0.181
ws+0.147
ps+0.135
ries+0.127
 channel+0.122
Neg logit contributions
ete-0.326
 couldn-0.310
 He-0.290
 Barkley-0.287
 shouldn-0.282
Token The
Token ID383
Feature activation-3.31e+00

Pos logit contributions
Ag+0.164
ete+0.163
Days+0.160
rolling+0.158
Da+0.154
Neg logit contributions
.-0.178
 on-0.160
."-0.156
 inside-0.151
els-0.134
Token you
Token ID345
Feature activation-8.17e-01

Pos logit contributions
ete+0.031
 Int+0.031
 Sadly+0.030
Sadly+0.030
 Curious+0.030
Neg logit contributions
els-0.038
ps-0.034
 on-0.033
 inside-0.033
.-0.032
Token should
Token ID815
Feature activation-1.66e+00

Pos logit contributions
ete+0.013
 Int+0.013
 Curious+0.013
Sadly+0.013
 Sadly+0.013
Neg logit contributions
els-0.013
 on-0.012
 inside-0.012
.-0.012
ps-0.011
Token never
Token ID1239
Feature activation+2.88e+00

Pos logit contributions
ete+0.024
 Int+0.023
 Sadly+0.023
Sadly+0.023
 Curious+0.022
Neg logit contributions
els-0.030
 on-0.027
 inside-0.026
ps-0.026
.-0.025
Token remove
Token ID4781
Feature activation+1.11e+01

Pos logit contributions
els+0.055
ps+0.049
ries+0.048
 on+0.046
ier+0.044
Neg logit contributions
ete-0.039
 Int-0.038
 Curious-0.037
 Sadly-0.037
Of-0.036
Token your
Token ID534
Feature activation+6.37e+00

Pos logit contributions
 channel+0.173
ier+0.170
els+0.168
 colored+0.167
ps+0.161
Neg logit contributions
 Everywhere-0.122
 From-0.121
 Nut-0.121
ete-0.120
 Sadly-0.112
Token wallet
Token ID13008
Feature activation+1.79e+01

Pos logit contributions
els+0.122
ier+0.113
ps+0.112
le+0.109
 on+0.108
Neg logit contributions
ete-0.070
 Curious-0.067
 Sadly-0.064
 Int-0.064
 His-0.061
Token from
Token ID422
Feature activation+4.58e+00

Pos logit contributions
els+0.266
 colored+0.226
ps+0.209
 coloured+0.202
 frost+0.200
Neg logit contributions
ete-0.295
 Sadly-0.275
Sadly-0.269
 Everywhere-0.268
 Int-0.264
Token your
Token ID534
Feature activation+5.34e+00

Pos logit contributions
els+0.068
ier+0.062
ps+0.059
le+0.058
 on+0.055
Neg logit contributions
ete-0.065
 Sadly-0.062
 Everywhere-0.061
 Barkley-0.061
 Nut-0.060
Token pocket
Token ID10000
Feature activation+1.45e+01

Pos logit contributions
els+0.096
ps+0.088
 on+0.085
ier+0.084
ries+0.082
Neg logit contributions
 Sadly-0.074
ete-0.073
 Curious-0.067
Sadly-0.066
 Int-0.066
Token."
Token ID526
Feature activation-1.15e+01

Pos logit contributions
els+0.161
ps+0.123
ries+0.110
 colored+0.108
 frost+0.102
Neg logit contributions
ete-0.317
 Sadly-0.311
Sadly-0.299
 Everywhere-0.295
 Curious-0.289
Token
Token ID198
Feature activation-1.00e+01

Pos logit contributions
ete+0.178
mber+0.154
Sadly+0.153
rolling+0.152
 Int+0.150
Neg logit contributions
.-0.206
 on-0.193
els-0.192
 inside-0.189
."-0.174
Token found
Token ID1043
Feature activation+2.86e+00

Pos logit contributions
ete+0.041
 Int+0.040
 Sadly+0.039
 Curious+0.039
Sadly+0.039
Neg logit contributions
els-0.041
 on-0.038
 inside-0.036
ps-0.035
.-0.035
Token a
Token ID257
Feature activation+3.01e+00

Pos logit contributions
els+0.042
 on+0.036
ps+0.035
 inside+0.034
.+0.033
Neg logit contributions
ete-0.051
 Int-0.049
 Curious-0.049
 Sadly-0.048
 Everywhere-0.048
Token chest
Token ID7721
Feature activation+1.30e+01

Pos logit contributions
els+0.048
ps+0.041
 on+0.041
ries+0.039
 inside+0.039
Neg logit contributions
ete-0.051
 Sadly-0.049
 Curious-0.049
 Int-0.049
Sadly-0.049
Token filled
Token ID5901
Feature activation+7.25e+00

Pos logit contributions
els+0.195
ries+0.158
ps+0.157
ws+0.151
 frost+0.130
Neg logit contributions
ete-0.240
 Curious-0.220
 Sadly-0.218
rolling-0.218
 Int-0.212
Token with
Token ID351
Feature activation+6.33e+00

Pos logit contributions
els+0.084
ps+0.070
ier+0.063
ries+0.060
 on+0.060
Neg logit contributions
ete-0.150
 Int-0.144
 Everywhere-0.144
 Curious-0.143
 Sadly-0.143
Token gold
Token ID3869
Feature activation+1.79e+01

Pos logit contributions
els+0.091
 on+0.076
ps+0.076
 inside+0.072
ries+0.071
Neg logit contributions
ete-0.112
 Curious-0.109
 Int-0.108
 Everywhere-0.108
 Sadly-0.107
Token and
Token ID290
Feature activation+1.58e+00

Pos logit contributions
els+0.215
ws+0.185
ps+0.178
ries+0.173
cks+0.164
Neg logit contributions
ete-0.315
 He-0.278
hog-0.269
 Curious-0.268
 Barkley-0.266
Token jewels
Token ID42701
Feature activation+1.56e+01

Pos logit contributions
els+0.021
 on+0.018
ps+0.018
 inside+0.017
.+0.017
Neg logit contributions
ete-0.030
 Int-0.029
 Curious-0.029
 Sadly-0.029
Sadly-0.029
Token.
Token ID13
Feature activation-9.80e+00

Pos logit contributions
els+0.169
ws+0.128
ps+0.117
ries+0.116
 spills+0.101
Neg logit contributions
ete-0.328
 Curious-0.307
rolling-0.306
 couldn-0.301
 Int-0.300
Token From
Token ID3574
Feature activation-5.54e-01

Pos logit contributions
Da+0.115
Sadly+0.110
Ag+0.107
Days+0.107
Mos+0.105
Neg logit contributions
.-0.191
 on-0.185
 inside-0.180
els-0.169
."-0.167
Token that
Token ID326
Feature activation-3.65e+00

Pos logit contributions
ete+0.010
 Int+0.010
 Sadly+0.010
 Curious+0.010
Sadly+0.010
Neg logit contributions
els-0.008
 on-0.007
 inside-0.007
.-0.007
ps-0.007
Token.
Token ID13
Feature activation-4.52e+00

Pos logit contributions
els+0.098
ps+0.085
 inside+0.073
cks+0.068
ries+0.068
Neg logit contributions
Of-0.111
ete-0.110
 Int-0.110
 Everywhere-0.109
 Curious-0.108
Token I
Token ID314
Feature activation-1.81e-01

Pos logit contributions
ete+0.046
 Int+0.045
 Sadly+0.043
 Curious+0.043
Sadly+0.043
Neg logit contributions
els-0.100
 on-0.093
 inside-0.091
ps-0.090
.-0.089
Token don
Token ID836
Feature activation+9.77e-01

Pos logit contributions
ete+0.003
 Sadly+0.003
 Curious+0.002
 Int+0.002
Sadly+0.002
Neg logit contributions
els-0.003
ps-0.003
ries-0.003
 on-0.003
 inside-0.003
Token't
Token ID470
Feature activation-6.11e-02

Pos logit contributions
els+0.022
 on+0.020
 inside+0.020
ps+0.019
.+0.019
Neg logit contributions
ete-0.010
 Int-0.009
 Sadly-0.009
 Curious-0.009
Sadly-0.009
Token need
Token ID761
Feature activation+5.72e+00

Pos logit contributions
ete+0.001
 Sadly+0.001
Sadly+0.001
Of+0.001
 Everything+0.001
Neg logit contributions
els-0.001
ries-0.001
ps-0.001
 on-0.001
 inside-0.001
Token gold
Token ID3869
Feature activation+1.79e+01

Pos logit contributions
els+0.086
ps+0.077
packs+0.072
 inside+0.072
ier+0.071
Neg logit contributions
ete-0.088
 Everywhere-0.084
 Int-0.084
 Not-0.083
Of-0.083
Token ice
Token ID4771
Feature activation+1.65e+01

Pos logit contributions
els+0.190
ps+0.177
 colored+0.170
cks+0.165
 spills+0.155
Neg logit contributions
ete-0.305
 He-0.289
Of-0.286
 Everywhere-0.267
 His-0.267
Token cream
Token ID8566
Feature activation+1.57e+01

Pos logit contributions
els+0.199
ps+0.181
 envelop+0.149
cks+0.147
ening+0.147
Neg logit contributions
ete-0.278
Of-0.270
 He-0.246
Fin-0.241
oun-0.238
Token.
Token ID13
Feature activation-4.68e+00

Pos logit contributions
els+0.201
ps+0.171
cks+0.158
 colored+0.154
 coloured+0.134
Neg logit contributions
ete-0.285
Of-0.282
 He-0.266
 Everywhere-0.256
 Not-0.254
Token I
Token ID314
Feature activation+8.10e-02

Pos logit contributions
ete+0.048
 Int+0.046
Sadly+0.045
 Sadly+0.045
 Curious+0.045
Neg logit contributions
els-0.102
 on-0.096
 inside-0.094
.-0.093
ps-0.092
Token have
Token ID423
Feature activation+3.00e+00

Pos logit contributions
els+0.001
 on+0.001
ps+0.001
 inside+0.001
ries+0.001
Neg logit contributions
ete-0.001
 Sadly-0.001
 Int-0.001
 Curious-0.001
Sadly-0.001
Token the
Token ID262
Feature activation+2.48e+00

Pos logit contributions
els+0.031
 on+0.028
 inside+0.026
.+0.026
ps+0.025
Neg logit contributions
ete-0.054
 Int-0.053
 Sadly-0.052
 Curious-0.052
Sadly-0.052
Token place
Token ID1295
Feature activation+1.03e+01

Pos logit contributions
els+0.040
 on+0.034
 inside+0.033
ps+0.033
.+0.032
Neg logit contributions
ete-0.041
 Int-0.041
 Curious-0.041
 Sadly-0.040
Sadly-0.040
Token I
Token ID314
Feature activation+2.10e+00

Pos logit contributions
els+0.109
 on+0.083
ps+0.082
 inside+0.078
ries+0.071
Neg logit contributions
ete-0.227
 Int-0.218
 Sadly-0.217
 Curious-0.215
 Everywhere-0.213
Token buried
Token ID11694
Feature activation+1.29e+01

Pos logit contributions
els+0.037
 on+0.034
 inside+0.032
ps+0.032
.+0.032
Neg logit contributions
ete-0.031
 Int-0.030
 Sadly-0.030
 Curious-0.030
Sadly-0.030
Token my
Token ID616
Feature activation+6.79e+00

Pos logit contributions
els+0.148
ps+0.115
ries+0.099
ws+0.098
 sticker+0.094
Neg logit contributions
ete-0.262
 Everywhere-0.240
 Int-0.234
 Sadly-0.230
 Everything-0.229
Token treasure
Token ID14068
Feature activation+1.78e+01

Pos logit contributions
els+0.104
ps+0.085
ries+0.084
 on+0.081
ier+0.080
Neg logit contributions
ete-0.116
 Curious-0.108
 Everywhere-0.107
 Sadly-0.107
 Int-0.105
Token!
Token ID0
Feature activation-7.98e+00

Pos logit contributions
els+0.219
ps+0.163
ries+0.152
ws+0.144
cks+0.134
Neg logit contributions
ete-0.370
 Sadly-0.339
 Int-0.339
 Curious-0.337
Sadly-0.333
Tokenâ
Token ID22940
Feature activation-1.92e+00

Pos logit contributions
ete+0.123
rolling+0.118
Days+0.117
Sadly+0.115
Da+0.109
Neg logit contributions
.-0.124
 on-0.122
."-0.118
 inside-0.110
els-0.102
Token
Token ID26391
Feature activation-5.39e+00

Pos logit contributions
ete+0.019
 Int+0.019
 Sadly+0.018
 Curious+0.018
Sadly+0.018
Neg logit contributions
els-0.043
 on-0.040
 inside-0.039
.-0.038
ps-0.038
Token
Token ID220
Feature activation-3.45e+00

Pos logit contributions
ete+0.082
Da+0.076
 Everywhere+0.075
Sadly+0.075
Of+0.075
Neg logit contributions
.-0.094
 on-0.092
els-0.086
 inside-0.085
."-0.084
Token
Token ID198
Feature activation-8.20e+00

Pos logit contributions
ete+0.046
Sadly+0.046
Da+0.044
Of+0.043
 Sadly+0.043
Neg logit contributions
.-0.061
 on-0.060
."-0.057
 inside-0.057
els-0.057
Token say
Token ID910
Feature activation-4.95e-01

Pos logit contributions
ete+0.074
 Sadly+0.070
Sadly+0.070
 Int+0.069
 Curious+0.068
Neg logit contributions
els-0.077
 on-0.067
 inside-0.065
ps-0.064
ries-0.062
Token sorry
Token ID7926
Feature activation+9.96e-01

Pos logit contributions
ete+0.008
 Int+0.007
 Curious+0.007
 Everything+0.007
 Sadly+0.007
Neg logit contributions
els-0.008
 on-0.007
 inside-0.007
ps-0.007
.-0.007
Token first
Token ID717
Feature activation+1.45e+00

Pos logit contributions
els+0.013
ps+0.010
 on+0.010
 inside+0.009
ries+0.009
Neg logit contributions
ete-0.019
 Int-0.019
 Everything-0.019
 Curious-0.019
 Sadly-0.019
Token."
Token ID526
Feature activation-1.55e+01

Pos logit contributions
els+0.014
ries+0.008
ps+0.008
cks+0.007
ws+0.007
Neg logit contributions
ete-0.033
 Everything-0.032
 Sadly-0.032
 Everywhere-0.031
 Curious-0.031
Token
Token ID198
Feature activation-1.57e+01

Pos logit contributions
ete+0.231
Sadly+0.223
rolling+0.221
 Int+0.218
Days+0.214
Neg logit contributions
.-0.276
 on-0.253
."-0.235
els-0.234
 inside-0.232
Token
Token ID198
Feature activation-1.58e+01

Pos logit contributions
ete+0.250
 Int+0.246
rolling+0.243
 Curious+0.241
 Everything+0.237
Neg logit contributions
.-0.256
 on-0.226
els-0.222
 inside-0.213
."-0.212
TokenL
Token ID43
Feature activation-9.67e+00

Pos logit contributions
ete+0.249
 Int+0.246
 Everything+0.245
 Everywhere+0.244
 Curious+0.239
Neg logit contributions
.-0.237
els-0.224
 on-0.222
 inside-0.205
."-0.201
Tokenily
Token ID813
Feature activation-6.56e+00

Pos logit contributions
ete+0.138
Sadly+0.131
 Int+0.129
 Sadly+0.127
 Curious+0.124
Neg logit contributions
els-0.173
 on-0.164
 inside-0.156
ps-0.148
.-0.147
Token did
Token ID750
Feature activation-5.59e+00

Pos logit contributions
ete+0.130
 Everywhere+0.123
Sadly+0.121
rolling+0.121
 Int+0.121
Neg logit contributions
els-0.082
.-0.081
 on-0.080
 inside-0.077
ps-0.073
Token not
Token ID407
Feature activation-5.40e+00

Pos logit contributions
ete+0.067
 Int+0.066
 Sadly+0.064
 Curious+0.064
Sadly+0.063
Neg logit contributions
els-0.115
 on-0.105
 inside-0.102
ps-0.101
.-0.098
Token want
Token ID765
Feature activation-6.75e-01

Pos logit contributions
ete+0.064
Of+0.062
 Curious+0.061
 Int+0.060
 Everything+0.060
Neg logit contributions
els-0.116
ries-0.100
 on-0.092
ps-0.091
 inside-0.089
Token to
Token ID284
Feature activation-4.63e+00

Pos logit contributions
ete+0.004
 Sadly+0.004
 Int+0.004
Sadly+0.004
 Curious+0.004
Neg logit contributions
els-0.003
 on-0.003
ps-0.003
 inside-0.003
.-0.002
Token say
Token ID910
Feature activation-4.95e-01

Pos logit contributions
ete+0.074
 Sadly+0.070
Sadly+0.070
 Int+0.069
 Curious+0.068
Neg logit contributions
els-0.077
 on-0.067
 inside-0.065
ps-0.064
ries-0.062
Token sorry
Token ID7926
Feature activation+9.96e-01

Pos logit contributions
ete+0.008
 Int+0.007
 Curious+0.007
 Everything+0.007
 Sadly+0.007
Neg logit contributions
els-0.008
 on-0.007
 inside-0.007
ps-0.007
.-0.007
Token first
Token ID717
Feature activation+1.45e+00

Pos logit contributions
els+0.013
ps+0.010
 on+0.010
 inside+0.009
ries+0.009
Neg logit contributions
ete-0.019
 Int-0.019
 Everything-0.019
 Curious-0.019
 Sadly-0.019
Token."
Token ID526
Feature activation-1.55e+01

Pos logit contributions
els+0.014
ries+0.008
ps+0.008
cks+0.007
ws+0.007
Neg logit contributions
ete-0.033
 Everything-0.032
 Sadly-0.032
 Everywhere-0.031
 Curious-0.031
Token
Token ID198
Feature activation-1.57e+01

Pos logit contributions
ete+0.231
Sadly+0.223
rolling+0.221
 Int+0.218
Days+0.214
Neg logit contributions
.-0.276
 on-0.253
."-0.235
els-0.234
 inside-0.232
Token
Token ID198
Feature activation-1.58e+01

Pos logit contributions
ete+0.250
 Int+0.246
rolling+0.243
 Curious+0.241
 Everything+0.237
Neg logit contributions
.-0.256
 on-0.226
els-0.222
 inside-0.213
."-0.212
TokenL
Token ID43
Feature activation-9.67e+00

Pos logit contributions
ete+0.249
 Int+0.246
 Everything+0.245
 Everywhere+0.244
 Curious+0.239
Neg logit contributions
.-0.237
els-0.224
 on-0.222
 inside-0.205
."-0.201
Tokenily
Token ID813
Feature activation-6.56e+00

Pos logit contributions
ete+0.138
Sadly+0.131
 Int+0.129
 Sadly+0.127
 Curious+0.124
Neg logit contributions
els-0.173
 on-0.164
 inside-0.156
ps-0.148
.-0.147
Token did
Token ID750
Feature activation-5.59e+00

Pos logit contributions
ete+0.130
 Everywhere+0.123
Sadly+0.121
rolling+0.121
 Int+0.121
Neg logit contributions
els-0.082
.-0.081
 on-0.080
 inside-0.077
ps-0.073
Token not
Token ID407
Feature activation-5.40e+00

Pos logit contributions
ete+0.067
 Int+0.066
 Sadly+0.064
 Curious+0.064
Sadly+0.063
Neg logit contributions
els-0.115
 on-0.105
 inside-0.102
ps-0.101
.-0.098
Token play
Token ID711
Feature activation+2.01e+00

Pos logit contributions
ete+0.084
 Int+0.081
 Sadly+0.080
 Curious+0.079
Sadly+0.079
Neg logit contributions
els-0.080
 on-0.073
 inside-0.070
ps-0.069
.-0.067
Token with
Token ID351
Feature activation-6.67e-01

Pos logit contributions
els+0.026
ps+0.021
ries+0.019
ier+0.018
ws+0.018
Neg logit contributions
 Int-0.038
ete-0.037
 Sadly-0.037
 Curious-0.037
Sadly-0.037
Token us
Token ID514
Feature activation+4.73e+00

Pos logit contributions
ete+0.011
 Int+0.011
 Everywhere+0.010
 Sadly+0.010
 Everything+0.010
Neg logit contributions
els-0.010
 on-0.009
ps-0.009
 inside-0.009
ier-0.008
Token."
Token ID526
Feature activation-1.54e+01

Pos logit contributions
els+0.036
ps+0.025
ries+0.021
 frost+0.020
 inside+0.017
Neg logit contributions
ete-0.112
 Everywhere-0.110
 Int-0.109
 Curious-0.108
 Sadly-0.107
Token
Token ID198
Feature activation-1.52e+01

Pos logit contributions
ete+0.236
Sadly+0.229
rolling+0.221
 Int+0.215
Days+0.215
Neg logit contributions
.-0.274
 on-0.253
els-0.238
 inside-0.232
."-0.230
Token
Token ID198
Feature activation-1.56e+01

Pos logit contributions
ete+0.232
 Int+0.224
 Curious+0.221
rolling+0.221
 Sadly+0.217
Neg logit contributions
.-0.261
 on-0.241
els-0.241
 inside-0.227
."-0.219
TokenTom
Token ID13787
Feature activation-7.21e+00

Pos logit contributions
ete+0.245
 Everything+0.239
 Int+0.238
 Everywhere+0.235
 Curious+0.234
Neg logit contributions
.-0.236
els-0.234
 on-0.224
 inside-0.207
."-0.198
Token comes
Token ID2058
Feature activation-2.76e+00

Pos logit contributions
ete+0.159
Sadly+0.153
 Sadly+0.152
 Int+0.151
 Everywhere+0.149
Neg logit contributions
els-0.078
 on-0.071
.-0.068
 inside-0.066
ps-0.064
Token back
Token ID736
Feature activation+1.63e-02

Pos logit contributions
 Int+0.053
 Curious+0.053
ete+0.052
 Everywhere+0.050
rolling+0.050
Neg logit contributions
els-0.035
ries-0.028
ps-0.027
 frost-0.027
ier-0.026
Token.
Token ID13
Feature activation-1.27e+01

Pos logit contributions
els+0.000
ps+0.000
ws+0.000
 frost+0.000
ier+0.000
Neg logit contributions
 Curious-0.000
ete-0.000
 Everywhere-0.000
 He-0.000
 Int-0.000
Token He
Token ID679
Feature activation-6.00e+00

Pos logit contributions
ete+0.095
Sadly+0.085
 Int+0.079
 Sadly+0.077
rolling+0.074
Neg logit contributions
els-0.313
 on-0.307
.-0.306
 inside-0.299
ps-0.286
Token have
Token ID423
Feature activation-2.15e-01

Pos logit contributions
ete+0.013
 Int+0.012
 Sadly+0.012
 Curious+0.012
Sadly+0.012
Neg logit contributions
els-0.013
 on-0.011
 inside-0.011
ps-0.011
.-0.011
Token to
Token ID284
Feature activation-4.63e+00

Pos logit contributions
ete+0.004
 Sadly+0.004
 Int+0.004
Sadly+0.004
 Curious+0.004
Neg logit contributions
els-0.003
 on-0.003
ps-0.003
 inside-0.003
.-0.002
Token say
Token ID910
Feature activation-4.95e-01

Pos logit contributions
ete+0.074
 Sadly+0.070
Sadly+0.070
 Int+0.069
 Curious+0.068
Neg logit contributions
els-0.077
 on-0.067
 inside-0.065
ps-0.064
ries-0.062
Token sorry
Token ID7926
Feature activation+9.96e-01

Pos logit contributions
ete+0.008
 Int+0.007
 Curious+0.007
 Everything+0.007
 Sadly+0.007
Neg logit contributions
els-0.008
 on-0.007
 inside-0.007
ps-0.007
.-0.007
Token first
Token ID717
Feature activation+1.45e+00

Pos logit contributions
els+0.013
ps+0.010
 on+0.010
 inside+0.009
ries+0.009
Neg logit contributions
ete-0.019
 Int-0.019
 Everything-0.019
 Curious-0.019
 Sadly-0.019
Token."
Token ID526
Feature activation-1.55e+01

Pos logit contributions
els+0.014
ries+0.008
ps+0.008
cks+0.007
ws+0.007
Neg logit contributions
ete-0.033
 Everything-0.032
 Sadly-0.032
 Everywhere-0.031
 Curious-0.031
Token
Token ID198
Feature activation-1.57e+01

Pos logit contributions
ete+0.231
Sadly+0.223
rolling+0.221
 Int+0.218
Days+0.214
Neg logit contributions
.-0.276
 on-0.253
."-0.235
els-0.234
 inside-0.232
Token
Token ID198
Feature activation-1.58e+01

Pos logit contributions
ete+0.250
 Int+0.246
rolling+0.243
 Curious+0.241
 Everything+0.237
Neg logit contributions
.-0.256
 on-0.226
els-0.222
 inside-0.213
."-0.212
TokenL
Token ID43
Feature activation-9.67e+00

Pos logit contributions
ete+0.249
 Int+0.246
 Everything+0.245
 Everywhere+0.244
 Curious+0.239
Neg logit contributions
.-0.237
els-0.224
 on-0.222
 inside-0.205
."-0.201
Tokenily
Token ID813
Feature activation-6.56e+00

Pos logit contributions
ete+0.138
Sadly+0.131
 Int+0.129
 Sadly+0.127
 Curious+0.124
Neg logit contributions
els-0.173
 on-0.164
 inside-0.156
ps-0.148
.-0.147
Token did
Token ID750
Feature activation-5.59e+00

Pos logit contributions
ete+0.130
 Everywhere+0.123
Sadly+0.121
rolling+0.121
 Int+0.121
Neg logit contributions
els-0.082
.-0.081
 on-0.080
 inside-0.077
ps-0.073
Token not
Token ID407
Feature activation-4.67e+00

Pos logit contributions
ete+0.120
 Int+0.120
 Curious+0.117
 Sadly+0.114
Sadly+0.113
Neg logit contributions
els-0.112
 on-0.109
.-0.109
 inside-0.105
ps-0.098
Token bother
Token ID11393
Feature activation+3.25e+00

Pos logit contributions
ete+0.063
Sadly+0.063
 Sadly+0.063
 Int+0.062
 Curious+0.062
Neg logit contributions
els-0.088
ps-0.075
 on-0.073
ries-0.072
 inside-0.071
Token it
Token ID340
Feature activation+4.55e+00

Pos logit contributions
els+0.050
ps+0.044
 sticker+0.043
 on+0.043
ier+0.043
Neg logit contributions
ete-0.048
 Everywhere-0.047
 Everything-0.047
 Int-0.046
 Barkley-0.046
Token."
Token ID526
Feature activation-1.48e+01

Pos logit contributions
els+0.047
ps+0.031
 frost+0.031
ries+0.030
 icing+0.027
Neg logit contributions
 Sadly-0.097
ete-0.096
Sadly-0.096
 Everywhere-0.095
 Int-0.094
Token
Token ID198
Feature activation-1.51e+01

Pos logit contributions
ete+0.216
Sadly+0.206
rolling+0.206
 Int+0.201
Days+0.198
Neg logit contributions
.-0.266
 on-0.243
els-0.227
 inside-0.223
."-0.221
Token
Token ID198
Feature activation-1.55e+01

Pos logit contributions
ete+0.232
rolling+0.229
 Int+0.229
 Curious+0.227
 Everything+0.219
Neg logit contributions
.-0.251
els-0.223
 on-0.220
 inside-0.208
."-0.205
TokenHe
Token ID1544
Feature activation-6.68e+00

Pos logit contributions
ete+0.222
 Everything+0.221
 Int+0.221
 Everywhere+0.219
 Curious+0.216
Neg logit contributions
.-0.243
els-0.235
 on-0.223
 inside-0.209
."-0.201
Token took
Token ID1718
Feature activation-1.41e-01

Pos logit contributions
ete+0.119
 Int+0.114
 Sadly+0.112
 Everywhere+0.112
Sadly+0.112
Neg logit contributions
els-0.098
 on-0.092
 inside-0.088
.-0.087
ps-0.085
Token the
Token ID262
Feature activation-2.32e+00

Pos logit contributions
ete+0.002
 Int+0.002
 Curious+0.002
 Everywhere+0.002
 Everything+0.002
Neg logit contributions
els-0.002
 on-0.002
.-0.002
 inside-0.002
ps-0.002
Token pig
Token ID12967
Feature activation+7.74e+00

Pos logit contributions
 Curious+0.040
 Int+0.039
ete+0.039
 Sadly+0.038
Sadly+0.037
Neg logit contributions
els-0.034
 on-0.027
ps-0.027
 inside-0.027
.-0.026
Token away
Token ID1497
Feature activation+2.41e+00

Pos logit contributions
els+0.090
 on+0.078
 inside+0.074
ps+0.072
.+0.071
Neg logit contributions
ete-0.159
 Int-0.156
 Sadly-0.155
 Curious-0.154
Sadly-0.153
Token not
Token ID407
Feature activation-2.33e+00

Pos logit contributions
ete+0.022
Sadly+0.022
 Sadly+0.021
 Int+0.020
 Everything+0.020
Neg logit contributions
els-0.027
 on-0.023
 inside-0.022
ps-0.022
le-0.021
Token sharing
Token ID7373
Feature activation+5.08e+00

Pos logit contributions
 Sadly+0.033
ete+0.033
Sadly+0.033
 Everything+0.032
 Int+0.032
Neg logit contributions
els-0.042
ps-0.033
ries-0.032
le-0.031
 on-0.031
Token before
Token ID878
Feature activation-9.53e-01

Pos logit contributions
els+0.056
ps+0.045
 inside+0.038
ries+0.037
 frost+0.037
Neg logit contributions
ete-0.094
From-0.092
Fred-0.091
Of-0.091
 Everywhere-0.091
Token."
Token ID526
Feature activation-1.52e+01

Pos logit contributions
ete+0.018
 Sadly+0.017
Sadly+0.017
 Int+0.017
 Everything+0.017
Neg logit contributions
els-0.013
ps-0.010
 inside-0.010
 on-0.010
ws-0.009
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
ete+0.222
Sadly+0.214
 Int+0.208
rolling+0.207
Days+0.201
Neg logit contributions
.-0.275
 on-0.259
 inside-0.238
els-0.237
."-0.235
Token
Token ID198
Feature activation-1.55e+01

Pos logit contributions
ete+0.241
 Int+0.240
rolling+0.235
 Curious+0.235
 Everything+0.227
Neg logit contributions
.-0.253
 on-0.225
els-0.214
 inside-0.211
."-0.206
TokenHer
Token ID9360
Feature activation-8.33e+00

Pos logit contributions
 Int+0.238
 Everything+0.235
 Everywhere+0.235
ete+0.235
 Curious+0.231
Neg logit contributions
.-0.233
 on-0.217
els-0.210
 inside-0.199
."-0.194
Token friend
Token ID1545
Feature activation-1.49e+00

Pos logit contributions
ete+0.128
 Int+0.125
 Sadly+0.123
 Curious+0.122
Sadly+0.122
Neg logit contributions
els-0.141
 on-0.130
 inside-0.126
ps-0.123
.-0.123
Token was
Token ID373
Feature activation-4.20e+00

Pos logit contributions
ete+0.027
Sadly+0.026
 Int+0.026
 Everywhere+0.026
 Sadly+0.026
Neg logit contributions
els-0.021
 on-0.020
.-0.020
 inside-0.019
ps-0.018
Token happy
Token ID3772
Feature activation+4.80e-01

Pos logit contributions
ete+0.059
 Int+0.057
 Sadly+0.056
 Curious+0.056
Sadly+0.056
Neg logit contributions
els-0.077
 on-0.071
 inside-0.069
ps-0.068
.-0.067
Token and
Token ID290
Feature activation-5.63e+00

Pos logit contributions
els+0.006
ps+0.005
 on+0.005
 inside+0.004
ws+0.004
Neg logit contributions
ete-0.010
 Int-0.010
 Sadly-0.009
 Curious-0.009
Sadly-0.009
Token Leave
Token ID17446
Feature activation+3.40e-02

Pos logit contributions
ete+0.128
rolling+0.114
Sadly+0.112
mber+0.110
Days+0.110
Neg logit contributions
els-0.189
 on-0.180
.-0.174
 inside-0.173
ps-0.165
Token me
Token ID502
Feature activation+1.51e+00

Pos logit contributions
els+0.001
 on+0.000
ps+0.000
 inside+0.000
.+0.000
Neg logit contributions
ete-0.001
 Int-0.001
 Sadly-0.001
Sadly-0.001
 Curious-0.001
Token alone
Token ID3436
Feature activation+4.26e+00

Pos logit contributions
els+0.023
ps+0.018
ries+0.018
 inside+0.017
 on+0.017
Neg logit contributions
 Int-0.026
 Sadly-0.025
Sadly-0.025
 Curious-0.025
ete-0.025
Token!"
Token ID2474
Feature activation-1.42e+01

Pos logit contributions
els+0.048
ries+0.033
ps+0.033
 frost+0.032
ws+0.032
Neg logit contributions
 Int-0.085
 Curious-0.084
 Sadly-0.084
ete-0.084
Sadly-0.084
Token
Token ID198
Feature activation-1.50e+01

Pos logit contributions
ete+0.217
rolling+0.210
Sadly+0.200
Days+0.200
 Int+0.190
Neg logit contributions
.-0.257
 on-0.242
 inside-0.220
els-0.220
."-0.212
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
ete+0.229
rolling+0.216
 Int+0.214
 Curious+0.210
 Sadly+0.209
Neg logit contributions
.-0.255
 on-0.236
els-0.235
 inside-0.222
."-0.212
TokenTheir
Token ID14574
Feature activation-6.37e+00

Pos logit contributions
ete+0.223
 Everywhere+0.213
 Int+0.211
 Everything+0.209
 Curious+0.205
Neg logit contributions
els-0.256
.-0.255
 on-0.245
 inside-0.230
."-0.216
Token mom
Token ID1995
Feature activation-1.17e+00

Pos logit contributions
ete+0.104
 Int+0.103
 Sadly+0.100
 Curious+0.100
Sadly+0.099
Neg logit contributions
els-0.102
 on-0.094
 inside-0.091
.-0.090
ps-0.089
Token heard
Token ID2982
Feature activation-2.41e+00

Pos logit contributions
ete+0.020
 Int+0.020
 Sadly+0.020
 Curious+0.020
Sadly+0.020
Neg logit contributions
els-0.017
 on-0.016
 inside-0.015
ps-0.015
.-0.014
Token the
Token ID262
Feature activation-1.78e+00

Pos logit contributions
ete+0.050
 Int+0.049
 Sadly+0.049
Sadly+0.049
 Curious+0.048
Neg logit contributions
els-0.028
 on-0.025
 inside-0.024
.-0.023
ps-0.023
Token noise
Token ID7838
Feature activation+6.47e+00

Pos logit contributions
ete+0.033
 Int+0.032
 Sadly+0.032
Sadly+0.031
 Everywhere+0.031
Neg logit contributions
els-0.025
 on-0.023
 inside-0.022
.-0.022
ps-0.021
Token wants
Token ID3382
Feature activation-1.04e+00

Pos logit contributions
els+0.030
 on+0.028
 inside+0.027
.+0.026
ps+0.026
Neg logit contributions
ete-0.035
 Int-0.034
 Sadly-0.033
 Curious-0.033
Sadly-0.033
Token to
Token ID284
Feature activation-5.07e+00

Pos logit contributions
ete+0.018
 Int+0.017
 Curious+0.017
 Sadly+0.017
Sadly+0.017
Neg logit contributions
els-0.016
 on-0.013
ps-0.013
 inside-0.013
.-0.012
Token play
Token ID711
Feature activation+2.01e+00

Pos logit contributions
ete+0.084
 Int+0.081
 Sadly+0.080
 Curious+0.079
Sadly+0.079
Neg logit contributions
els-0.080
 on-0.073
 inside-0.070
ps-0.069
.-0.067
Token with
Token ID351
Feature activation-6.67e-01

Pos logit contributions
els+0.026
ps+0.021
ries+0.019
ier+0.018
ws+0.018
Neg logit contributions
 Int-0.038
ete-0.037
 Sadly-0.037
 Curious-0.037
Sadly-0.037
Token us
Token ID514
Feature activation+4.73e+00

Pos logit contributions
ete+0.011
 Int+0.011
 Everywhere+0.010
 Sadly+0.010
 Everything+0.010
Neg logit contributions
els-0.010
 on-0.009
ps-0.009
 inside-0.009
ier-0.008
Token."
Token ID526
Feature activation-1.54e+01

Pos logit contributions
els+0.036
ps+0.025
ries+0.021
 frost+0.020
 inside+0.017
Neg logit contributions
ete-0.112
 Everywhere-0.110
 Int-0.109
 Curious-0.108
 Sadly-0.107
Token
Token ID198
Feature activation-1.52e+01

Pos logit contributions
ete+0.236
Sadly+0.229
rolling+0.221
 Int+0.215
Days+0.215
Neg logit contributions
.-0.274
 on-0.253
els-0.238
 inside-0.232
."-0.230
Token
Token ID198
Feature activation-1.56e+01

Pos logit contributions
ete+0.232
 Int+0.224
 Curious+0.221
rolling+0.221
 Sadly+0.217
Neg logit contributions
.-0.261
 on-0.241
els-0.241
 inside-0.227
."-0.219
TokenTom
Token ID13787
Feature activation-7.21e+00

Pos logit contributions
ete+0.245
 Everything+0.239
 Int+0.238
 Everywhere+0.235
 Curious+0.234
Neg logit contributions
.-0.236
els-0.234
 on-0.224
 inside-0.207
."-0.198
Token comes
Token ID2058
Feature activation-2.76e+00

Pos logit contributions
ete+0.159
Sadly+0.153
 Sadly+0.152
 Int+0.151
 Everywhere+0.149
Neg logit contributions
els-0.078
 on-0.071
.-0.068
 inside-0.066
ps-0.064
Token back
Token ID736
Feature activation+1.63e-02

Pos logit contributions
 Int+0.053
 Curious+0.053
ete+0.052
 Everywhere+0.050
rolling+0.050
Neg logit contributions
els-0.035
ries-0.028
ps-0.027
 frost-0.027
ier-0.026
Token to
Token ID284
Feature activation-5.07e+00

Pos logit contributions
ete+0.018
 Int+0.017
 Curious+0.017
 Sadly+0.017
Sadly+0.017
Neg logit contributions
els-0.016
 on-0.013
ps-0.013
 inside-0.013
.-0.012
Token hurt
Token ID5938
Feature activation+3.42e+00

Pos logit contributions
ete+0.084
 Int+0.081
 Sadly+0.080
 Curious+0.079
Sadly+0.079
Neg logit contributions
els-0.080
 on-0.073
 inside-0.070
ps-0.069
.-0.067
Token us
Token ID514
Feature activation+4.03e+00

Pos logit contributions
els+0.052
 on+0.046
 inside+0.046
ps+0.045
ier+0.043
Neg logit contributions
ete-0.054
 Int-0.053
 Everywhere-0.052
Sadly-0.051
 Sadly-0.051
Token!"
Token ID2474
Feature activation-1.45e+01

Pos logit contributions
els+0.032
ps+0.021
 frost+0.019
ries+0.018
 colored+0.017
Neg logit contributions
ete-0.094
 Sadly-0.092
 Everywhere-0.091
 Int-0.090
 Curious-0.090
Token
Token ID198
Feature activation-1.53e+01

Pos logit contributions
ete+0.220
rolling+0.212
Sadly+0.211
 Int+0.203
 Curious+0.198
Neg logit contributions
.-0.253
 on-0.241
els-0.227
 inside-0.220
."-0.210
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
ete+0.235
rolling+0.228
 Int+0.228
 Curious+0.225
 Sadly+0.220
Neg logit contributions
.-0.257
 on-0.235
els-0.230
 inside-0.220
."-0.214
TokenA
Token ID32
Feature activation-5.51e+00

Pos logit contributions
ete+0.228
 Int+0.223
 Everything+0.222
 Everywhere+0.220
 Curious+0.218
Neg logit contributions
.-0.242
els-0.237
 on-0.232
 inside-0.216
."-0.206
Token man
Token ID582
Feature activation+1.49e+00

Pos logit contributions
ete+0.088
 Int+0.086
 Curious+0.084
 Sadly+0.084
Sadly+0.084
Neg logit contributions
els-0.091
 on-0.082
 inside-0.079
ps-0.079
.-0.077
Token hears
Token ID23180
Feature activation-2.63e+00

Pos logit contributions
els+0.024
ps+0.022
 on+0.021
ier+0.021
 inside+0.021
Neg logit contributions
 Int-0.023
 Curious-0.022
 Sadly-0.021
ete-0.021
 Everything-0.020
Token Tom
Token ID4186
Feature activation+1.01e+00

Pos logit contributions
ete+0.054
 Int+0.053
 Sadly+0.053
Sadly+0.053
 Curious+0.052
Neg logit contributions
els-0.031
 on-0.028
.-0.027
 inside-0.027
ps-0.026
Token and
Token ID290
Feature activation-6.19e+00

Pos logit contributions
els+0.012
 on+0.010
 inside+0.009
ps+0.009
ries+0.009
Neg logit contributions
 Int-0.020
ete-0.020
 Curious-0.020
 Sadly-0.020
Sadly-0.019
Token You
Token ID921
Feature activation-4.93e+00

Pos logit contributions
ete+0.031
 Int+0.030
 Sadly+0.030
Sadly+0.030
 Curious+0.030
Neg logit contributions
els-0.019
 on-0.018
 inside-0.017
.-0.017
ps-0.016
Token are
Token ID389
Feature activation-4.16e+00

Pos logit contributions
ete+0.082
 Int+0.080
Sadly+0.079
 Sadly+0.079
 Curious+0.078
Neg logit contributions
els-0.075
 on-0.074
.-0.071
 inside-0.070
ps-0.066
Token foolish
Token ID19538
Feature activation+1.58e+00

Pos logit contributions
ete+0.061
 Everything+0.057
Sadly+0.056
 Int+0.056
 Sadly+0.056
Neg logit contributions
els-0.072
ries-0.059
 inside-0.058
 on-0.058
ps-0.057
Token."
Token ID526
Feature activation-1.38e+01

Pos logit contributions
els+0.015
ps+0.011
 on+0.011
 inside+0.010
ries+0.010
Neg logit contributions
ete-0.035
 Int-0.034
 Curious-0.034
 Sadly-0.033
 Everywhere-0.033
Token
Token ID198
Feature activation-1.48e+01

Pos logit contributions
ete+0.193
Sadly+0.186
 Int+0.178
rolling+0.176
Days+0.175
Neg logit contributions
.-0.257
 on-0.245
 inside-0.228
els-0.228
."-0.224
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
ete+0.236
 Int+0.229
 Curious+0.226
rolling+0.226
ishly+0.223
Neg logit contributions
.-0.242
 on-0.220
els-0.213
 inside-0.204
."-0.202
TokenSam
Token ID16305
Feature activation-6.28e+00

Pos logit contributions
ete+0.241
 Everything+0.238
 Int+0.235
 Everywhere+0.233
 Curious+0.231
Neg logit contributions
.-0.230
els-0.222
 on-0.221
 inside-0.199
."-0.195
Token takes
Token ID2753
Feature activation-3.74e-01

Pos logit contributions
ete+0.113
 Int+0.109
Sadly+0.108
 Sadly+0.107
 Everywhere+0.106
Neg logit contributions
els-0.090
 on-0.085
.-0.083
 inside-0.080
ps-0.078
Token his
Token ID465
Feature activation-1.09e+00

Pos logit contributions
ete+0.005
 Int+0.005
 Curious+0.005
 Everything+0.005
 Everywhere+0.005
Neg logit contributions
els-0.007
 inside-0.006
ps-0.006
 on-0.006
ier-0.006
Token car
Token ID1097
Feature activation+8.85e+00

Pos logit contributions
ete+0.019
 Curious+0.018
 Int+0.018
 Sadly+0.017
 Everywhere+0.017
Neg logit contributions
els-0.016
ier-0.013
 inside-0.013
 on-0.013
ps-0.013
Token and
Token ID290
Feature activation-5.36e+00

Pos logit contributions
els+0.108
ps+0.085
ries+0.082
 frost+0.078
ws+0.075
Neg logit contributions
 Sadly-0.170
 Int-0.170
 Everywhere-0.168
ete-0.168
Sadly-0.165
Token we
Token ID356
Feature activation-4.33e+00

Pos logit contributions
ete+0.144
Sadly+0.134
rolling+0.131
 Sadly+0.131
 Int+0.127
Neg logit contributions
.-0.180
 on-0.176
els-0.173
 inside-0.169
."-0.153
Token are
Token ID389
Feature activation-1.72e+00

Pos logit contributions
ete+0.072
 Int+0.070
 Sadly+0.069
 Curious+0.069
Sadly+0.069
Neg logit contributions
els-0.068
 on-0.061
 inside-0.059
ps-0.058
.-0.057
Token spies
Token ID28801
Feature activation+9.77e+00

Pos logit contributions
ete+0.026
 Sadly+0.024
Sadly+0.024
 Int+0.023
 Everything+0.023
Neg logit contributions
els-0.030
ps-0.024
 on-0.023
ries-0.023
 inside-0.023
Token."
Token ID526
Feature activation-1.37e+01

Pos logit contributions
els+0.102
ps+0.073
ries+0.066
 on+0.063
 inside+0.059
Neg logit contributions
ete-0.211
 Sadly-0.201
 Int-0.199
Of-0.197
 Everywhere-0.197
Token
Token ID198
Feature activation-1.49e+01

Pos logit contributions
ete+0.203
Sadly+0.196
 Int+0.191
rolling+0.190
 Curious+0.183
Neg logit contributions
.-0.244
 on-0.231
els-0.219
 inside-0.214
."-0.210
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
ete+0.231
 Int+0.224
 Curious+0.222
rolling+0.218
 Everything+0.215
Neg logit contributions
.-0.251
 on-0.230
els-0.230
 inside-0.216
."-0.212
TokenThey
Token ID2990
Feature activation-6.04e+00

Pos logit contributions
ete+0.242
 Everything+0.239
 Int+0.238
 Everywhere+0.236
 Curious+0.234
Neg logit contributions
els-0.231
.-0.230
 on-0.222
 inside-0.203
."-0.196
Token crawl
Token ID27318
Feature activation+3.66e+00

Pos logit contributions
ete+0.107
 Int+0.105
 Sadly+0.103
Sadly+0.102
 Curious+0.102
Neg logit contributions
els-0.088
 on-0.081
 inside-0.078
.-0.077
ps-0.076
Token into
Token ID656
Feature activation-9.46e-01

Pos logit contributions
els+0.047
ries+0.034
ps+0.033
 frost+0.033
cks+0.032
Neg logit contributions
 Everywhere-0.072
 Sadly-0.070
 Curious-0.070
ete-0.070
 Int-0.069
Token the
Token ID262
Feature activation-1.78e+00

Pos logit contributions
ete+0.015
 Curious+0.015
 Int+0.015
 Everywhere+0.015
 Sadly+0.015
Neg logit contributions
els-0.014
ier-0.012
ps-0.012
 on-0.012
 inside-0.011
Token box
Token ID3091
Feature activation+7.48e+00

Pos logit contributions
 Sadly+0.030
 Int+0.030
ete+0.030
 Curious+0.030
Sadly+0.030
Neg logit contributions
els-0.026
 on-0.022
ps-0.022
 inside-0.021
.-0.021
Token you
Token ID345
Feature activation-1.30e+00

Pos logit contributions
els+0.016
 on+0.013
ps+0.013
 inside+0.013
ries+0.012
Neg logit contributions
ete-0.024
 Int-0.024
 Sadly-0.023
 Curious-0.023
Sadly-0.023
Token are
Token ID389
Feature activation-5.02e-01

Pos logit contributions
ete+0.021
 Int+0.021
 Sadly+0.020
 Curious+0.020
Sadly+0.020
Neg logit contributions
els-0.021
 on-0.019
 inside-0.018
ps-0.018
.-0.017
Token happy
Token ID3772
Feature activation+2.49e+00

Pos logit contributions
ete+0.007
 Sadly+0.007
Sadly+0.006
 Curious+0.006
 Int+0.006
Neg logit contributions
els-0.009
ries-0.007
ps-0.007
 on-0.007
 inside-0.007
Token."
Token ID526
Feature activation-1.43e+01

Pos logit contributions
els+0.021
ps+0.014
 on+0.012
 inside+0.012
ries+0.011
Neg logit contributions
ete-0.058
 Int-0.058
 Curious-0.057
 Sadly-0.057
 Everywhere-0.056
Token
Token ID198
Feature activation-1.49e+01

Pos logit contributions
ete+0.202
Sadly+0.197
 Int+0.190
rolling+0.183
 Curious+0.182
Neg logit contributions
.-0.265
 on-0.253
els-0.242
 inside-0.236
."-0.229
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
 Int+0.228
ete+0.228
 Curious+0.226
rolling+0.221
 Everything+0.216
Neg logit contributions
.-0.252
 on-0.229
els-0.224
 inside-0.215
."-0.209
TokenThe
Token ID464
Feature activation-7.05e+00

Pos logit contributions
 Int+0.236
 Everything+0.236
 Curious+0.233
ete+0.232
 Everywhere+0.229
Neg logit contributions
.-0.237
els-0.226
 on-0.222
 inside-0.201
."-0.199
Token cater
Token ID20825
Feature activation-7.95e-01

Pos logit contributions
ete+0.111
 Int+0.108
 Curious+0.106
 Sadly+0.106
Sadly+0.106
Neg logit contributions
els-0.116
 on-0.105
 inside-0.102
ps-0.100
.-0.098
Tokenpillar
Token ID41643
Feature activation+1.60e-01

Pos logit contributions
ete+0.006
 Int+0.006
 Sadly+0.005
 Curious+0.005
Sadly+0.005
Neg logit contributions
els-0.020
 on-0.019
 inside-0.018
.-0.018
ps-0.018
Token seemed
Token ID3947
Feature activation-6.86e-01

Pos logit contributions
els+0.003
 on+0.003
 inside+0.002
.+0.002
ps+0.002
Neg logit contributions
ete-0.002
 Int-0.002
 Sadly-0.002
 Curious-0.002
Sadly-0.002
Token to
Token ID284
Feature activation-4.31e+00

Pos logit contributions
ete+0.013
 Int+0.013
Sadly+0.013
 Sadly+0.013
 Curious+0.013
Neg logit contributions
 on-0.009
els-0.008
 inside-0.008
.-0.008
ps-0.007
Token unhappy
Token ID19283
Feature activation+4.71e+00

Pos logit contributions
els+0.018
 on+0.015
 inside+0.015
ps+0.015
ries+0.015
Neg logit contributions
 Int-0.013
ete-0.013
 Curious-0.013
 Sadly-0.013
Sadly-0.013
Token and
Token ID290
Feature activation-4.24e+00

Pos logit contributions
els+0.039
ps+0.025
 inside+0.024
ries+0.023
 on+0.023
Neg logit contributions
 Int-0.106
 Everything-0.106
ete-0.106
 Curious-0.105
Sadly-0.105
Token angry
Token ID7954
Feature activation+1.65e+00

Pos logit contributions
ete+0.065
 Int+0.065
 Sadly+0.064
 Curious+0.064
Sadly+0.063
Neg logit contributions
els-0.070
 on-0.062
 inside-0.061
ps-0.060
.-0.058
Token."
Token ID526
Feature activation-1.46e+01

Pos logit contributions
els+0.011
 inside+0.007
 on+0.007
ps+0.006
ries+0.006
Neg logit contributions
 Int-0.040
ete-0.040
 Curious-0.040
 Everything-0.040
 Sadly-0.039
Token
Token ID198
Feature activation-1.51e+01

Pos logit contributions
ete+0.214
Sadly+0.206
 Int+0.197
rolling+0.196
Days+0.193
Neg logit contributions
.-0.263
 on-0.249
els-0.233
 inside-0.231
."-0.227
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
ete+0.234
 Int+0.227
 Curious+0.224
rolling+0.222
 Everything+0.218
Neg logit contributions
.-0.252
 on-0.227
els-0.223
 inside-0.216
."-0.210
TokenTom
Token ID13787
Feature activation-6.61e+00

Pos logit contributions
ete+0.241
 Everywhere+0.238
 Int+0.236
 Everything+0.235
 Curious+0.231
Neg logit contributions
.-0.229
els-0.216
 on-0.214
 inside-0.200
."-0.193
Token and
Token ID290
Feature activation-9.35e+00

Pos logit contributions
ete+0.137
Sadly+0.132
 Int+0.131
 Everywhere+0.131
 Sadly+0.131
Neg logit contributions
els-0.078
 on-0.074
.-0.071
ps-0.069
 inside-0.068
Token Sam
Token ID3409
Feature activation-4.54e+00

Pos logit contributions
ete+0.166
 Int+0.159
Sadly+0.158
 Sadly+0.157
 Curious+0.154
Neg logit contributions
els-0.136
 on-0.130
 inside-0.124
.-0.124
ps-0.119
Token looked
Token ID3114
Feature activation-1.89e+00

Pos logit contributions
ete+0.074
 Int+0.071
Sadly+0.071
 Sadly+0.070
 Everywhere+0.070
Neg logit contributions
els-0.071
 on-0.069
.-0.067
 inside-0.066
ps-0.064
Token at
Token ID379
Feature activation-3.41e+00

Pos logit contributions
 Int+0.039
ete+0.039
 Curious+0.039
 Sadly+0.038
 Everything+0.037
Neg logit contributions
els-0.022
ps-0.017
 on-0.016
ries-0.016
 sticker-0.015
Token for
Token ID329
Feature activation-1.54e+00

Pos logit contributions
ete+0.003
 Int+0.003
 Sadly+0.003
 Curious+0.003
Sadly+0.003
Neg logit contributions
els-0.001
 on-0.001
 inside-0.001
ps-0.001
.-0.001
Token not
Token ID407
Feature activation-2.33e+00

Pos logit contributions
ete+0.022
Sadly+0.022
 Sadly+0.021
 Int+0.020
 Everything+0.020
Neg logit contributions
els-0.027
 on-0.023
 inside-0.022
ps-0.022
le-0.021
Token sharing
Token ID7373
Feature activation+5.08e+00

Pos logit contributions
 Sadly+0.033
ete+0.033
Sadly+0.033
 Everything+0.032
 Int+0.032
Neg logit contributions
els-0.042
ps-0.033
ries-0.032
le-0.031
 on-0.031
Token before
Token ID878
Feature activation-9.53e-01

Pos logit contributions
els+0.056
ps+0.045
 inside+0.038
ries+0.037
 frost+0.037
Neg logit contributions
ete-0.094
From-0.092
Fred-0.091
Of-0.091
 Everywhere-0.091
Token."
Token ID526
Feature activation-1.52e+01

Pos logit contributions
ete+0.018
 Sadly+0.017
Sadly+0.017
 Int+0.017
 Everything+0.017
Neg logit contributions
els-0.013
ps-0.010
 inside-0.010
 on-0.010
ws-0.009
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
ete+0.222
Sadly+0.214
 Int+0.208
rolling+0.207
Days+0.201
Neg logit contributions
.-0.275
 on-0.259
 inside-0.238
els-0.237
."-0.235
Token
Token ID198
Feature activation-1.55e+01

Pos logit contributions
ete+0.241
 Int+0.240
rolling+0.235
 Curious+0.235
 Everything+0.227
Neg logit contributions
.-0.253
 on-0.225
els-0.214
 inside-0.211
."-0.206
TokenHer
Token ID9360
Feature activation-8.33e+00

Pos logit contributions
 Int+0.238
 Everything+0.235
 Everywhere+0.235
ete+0.235
 Curious+0.231
Neg logit contributions
.-0.233
 on-0.217
els-0.210
 inside-0.199
."-0.194
Token friend
Token ID1545
Feature activation-1.49e+00

Pos logit contributions
ete+0.128
 Int+0.125
 Sadly+0.123
 Curious+0.122
Sadly+0.122
Neg logit contributions
els-0.141
 on-0.130
 inside-0.126
ps-0.123
.-0.123
Token was
Token ID373
Feature activation-4.20e+00

Pos logit contributions
ete+0.027
Sadly+0.026
 Int+0.026
 Everywhere+0.026
 Sadly+0.026
Neg logit contributions
els-0.021
 on-0.020
.-0.020
 inside-0.019
ps-0.018
Token happy
Token ID3772
Feature activation+4.80e-01

Pos logit contributions
ete+0.059
 Int+0.057
 Sadly+0.056
 Curious+0.056
Sadly+0.056
Neg logit contributions
els-0.077
 on-0.071
 inside-0.069
ps-0.068
.-0.067
Token is
Token ID318
Feature activation-3.30e+00

Pos logit contributions
 Int+0.020
 Curious+0.019
 Sadly+0.019
Sadly+0.019
ete+0.019
Neg logit contributions
els-0.016
 on-0.014
 inside-0.014
ps-0.013
.-0.013
Token not
Token ID407
Feature activation-2.93e+00

Pos logit contributions
ete+0.050
 Int+0.049
 Sadly+0.048
 Curious+0.048
Sadly+0.047
Neg logit contributions
els-0.056
ps-0.045
ries-0.044
 on-0.044
 inside-0.044
Token safe
Token ID3338
Feature activation+6.48e+00

Pos logit contributions
 Curious+0.039
 Int+0.039
ete+0.039
 Everything+0.038
Of+0.038
Neg logit contributions
els-0.054
ries-0.043
ps-0.041
le-0.040
 on-0.038
Token."
Token ID526
Feature activation-1.44e+01

Pos logit contributions
els+0.070
ps+0.046
 frost+0.043
ries+0.042
ws+0.041
Neg logit contributions
ete-0.134
 Sadly-0.133
 Everywhere-0.130
 Everything-0.128
 Curious-0.127
Token
Token ID198
Feature activation-1.50e+01

Pos logit contributions
ete+0.220
Sadly+0.210
rolling+0.208
Days+0.203
 Int+0.202
Neg logit contributions
.-0.257
 on-0.236
 inside-0.215
."-0.214
els-0.210
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
ete+0.239
rolling+0.234
 Int+0.233
 Curious+0.230
ishly+0.224
Neg logit contributions
.-0.247
 on-0.220
els-0.210
 inside-0.206
."-0.202
Token"
Token ID1
Feature activation-1.21e+01

Pos logit contributions
ete+0.235
 Int+0.233
 Everything+0.231
 Everywhere+0.229
 Curious+0.228
Neg logit contributions
.-0.230
 on-0.216
els-0.215
 inside-0.200
."-0.192
TokenLook
Token ID8567
Feature activation-3.77e+00

Pos logit contributions
ete+0.161
 Sadly+0.144
rolling+0.143
 Int+0.139
 Everywhere+0.139
Neg logit contributions
els-0.225
 on-0.223
.-0.208
 inside-0.208
ps-0.205
Token,
Token ID11
Feature activation-4.94e+00

Pos logit contributions
ete+0.083
 Int+0.078
 Curious+0.075
 Everything+0.074
 Sadly+0.074
Neg logit contributions
els-0.039
ps-0.033
 on-0.029
 colored-0.028
 inside-0.027
Token Tom
Token ID4186
Feature activation-2.62e+00

Pos logit contributions
ete+0.083
 Int+0.081
Sadly+0.081
 Sadly+0.080
 Curious+0.079
Neg logit contributions
els-0.075
 on-0.072
 inside-0.069
.-0.067
ps-0.065
Token,
Token ID11
Feature activation-3.13e+00

Pos logit contributions
ete+0.055
 Int+0.053
 Sadly+0.053
 Curious+0.053
 Everything+0.050
Neg logit contributions
els-0.030
ps-0.023
 on-0.021
 inside-0.021
ries-0.021
Token wants
Token ID3382
Feature activation-1.04e+00

Pos logit contributions
els+0.025
 on+0.024
.+0.023
 inside+0.023
ps+0.022
Neg logit contributions
ete-0.026
 Int-0.025
 Sadly-0.025
 Curious-0.025
Sadly-0.025
Token to
Token ID284
Feature activation-5.07e+00

Pos logit contributions
ete+0.018
 Int+0.017
 Curious+0.017
 Sadly+0.017
Sadly+0.017
Neg logit contributions
els-0.016
 on-0.013
ps-0.013
 inside-0.013
.-0.012
Token hurt
Token ID5938
Feature activation+3.42e+00

Pos logit contributions
ete+0.084
 Int+0.081
 Sadly+0.080
 Curious+0.079
Sadly+0.079
Neg logit contributions
els-0.080
 on-0.073
 inside-0.070
ps-0.069
.-0.067
Token us
Token ID514
Feature activation+4.03e+00

Pos logit contributions
els+0.052
 on+0.046
 inside+0.046
ps+0.045
ier+0.043
Neg logit contributions
ete-0.054
 Int-0.053
 Everywhere-0.052
Sadly-0.051
 Sadly-0.051
Token!"
Token ID2474
Feature activation-1.45e+01

Pos logit contributions
els+0.032
ps+0.021
 frost+0.019
ries+0.018
 colored+0.017
Neg logit contributions
ete-0.094
 Sadly-0.092
 Everywhere-0.091
 Int-0.090
 Curious-0.090
Token
Token ID198
Feature activation-1.53e+01

Pos logit contributions
ete+0.220
rolling+0.212
Sadly+0.211
 Int+0.203
 Curious+0.198
Neg logit contributions
.-0.253
 on-0.241
els-0.227
 inside-0.220
."-0.210
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
ete+0.235
rolling+0.228
 Int+0.228
 Curious+0.225
 Sadly+0.220
Neg logit contributions
.-0.257
 on-0.235
els-0.230
 inside-0.220
."-0.214
TokenA
Token ID32
Feature activation-5.51e+00

Pos logit contributions
ete+0.228
 Int+0.223
 Everything+0.222
 Everywhere+0.220
 Curious+0.218
Neg logit contributions
.-0.242
els-0.237
 on-0.232
 inside-0.216
."-0.206
Token man
Token ID582
Feature activation+1.49e+00

Pos logit contributions
ete+0.088
 Int+0.086
 Curious+0.084
 Sadly+0.084
Sadly+0.084
Neg logit contributions
els-0.091
 on-0.082
 inside-0.079
ps-0.079
.-0.077
Token hears
Token ID23180
Feature activation-2.63e+00

Pos logit contributions
els+0.024
ps+0.022
 on+0.021
ier+0.021
 inside+0.021
Neg logit contributions
 Int-0.023
 Curious-0.022
 Sadly-0.021
ete-0.021
 Everything-0.020
Token Tom
Token ID4186
Feature activation+1.01e+00

Pos logit contributions
ete+0.054
 Int+0.053
 Sadly+0.053
Sadly+0.053
 Curious+0.052
Neg logit contributions
els-0.031
 on-0.028
.-0.027
 inside-0.027
ps-0.026
Token birds
Token ID10087
Feature activation+6.12e+00

Pos logit contributions
ete+0.043
 Everywhere+0.042
 Everything+0.042
 Sadly+0.042
 Int+0.042
Neg logit contributions
els-0.036
 on-0.031
 inside-0.030
ps-0.030
ier-0.028
Token they
Token ID484
Feature activation-3.26e+00

Pos logit contributions
els+0.059
 on+0.044
ps+0.044
 inside+0.043
ries+0.038
Neg logit contributions
ete-0.135
 Int-0.133
 Sadly-0.132
 Curious-0.131
Sadly-0.130
Token are
Token ID389
Feature activation-7.08e-01

Pos logit contributions
ete+0.047
 Int+0.046
 Sadly+0.045
 Curious+0.045
Sadly+0.045
Neg logit contributions
els-0.058
 on-0.053
 inside-0.051
ps-0.050
.-0.050
Token."
Token ID526
Feature activation-1.48e+01

Pos logit contributions
ete+0.010
 Sadly+0.010
Sadly+0.010
 Int+0.010
 Curious+0.010
Neg logit contributions
els-0.012
 on-0.010
 inside-0.010
ps-0.010
ries-0.010
Token
Token ID198
Feature activation-1.49e+01

Pos logit contributions
ete+0.208
Sadly+0.207
 Int+0.199
rolling+0.194
 Curious+0.191
Neg logit contributions
.-0.269
 on-0.254
els-0.246
 inside-0.235
."-0.229
Token
Token ID198
Feature activation-1.53e+01

Pos logit contributions
ete+0.228
 Int+0.227
 Curious+0.224
rolling+0.220
 Sadly+0.219
Neg logit contributions
.-0.249
els-0.226
 on-0.222
 inside-0.209
."-0.206
Token"
Token ID1
Feature activation-1.21e+01

Pos logit contributions
 Everything+0.235
ete+0.233
 Int+0.233
 Everywhere+0.230
 Curious+0.229
Neg logit contributions
.-0.231
els-0.229
 on-0.213
 inside-0.198
."-0.193
TokenMaybe
Token ID13300
Feature activation-3.63e+00

Pos logit contributions
ete+0.155
 Sadly+0.144
rolling+0.131
 Int+0.130
 Everything+0.128
Neg logit contributions
els-0.231
 on-0.230
.-0.217
ps-0.210
 inside-0.210
Token they
Token ID484
Feature activation-9.03e-01

Pos logit contributions
 Int+0.047
 Curious+0.046
ete+0.046
Of+0.045
From+0.043
Neg logit contributions
els-0.065
ps-0.063
ws-0.057
 channel-0.056
ier-0.054
Token are
Token ID389
Feature activation-4.73e-01

Pos logit contributions
ete+0.015
 Int+0.014
 Sadly+0.014
 Curious+0.014
Sadly+0.014
Neg logit contributions
els-0.014
 on-0.013
ps-0.012
 inside-0.012
.-0.012
Token rob
Token ID3857
Feature activation+7.89e+00

Pos logit contributions
 Barkley+0.005
 Curious+0.005
 Sadly+0.005
 Int+0.005
Sadly+0.005
Neg logit contributions
els-0.009
ws-0.007
le-0.007
ries-0.007
ishing-0.007
Token funny
Token ID8258
Feature activation+4.20e+00

Pos logit contributions
ete+0.055
 Sadly+0.053
Sadly+0.052
 Int+0.052
 Curious+0.051
Neg logit contributions
els-0.060
 on-0.050
ps-0.050
 inside-0.049
ries-0.047
Token and
Token ID290
Feature activation-4.03e+00

Pos logit contributions
els+0.040
ps+0.026
 on+0.023
ries+0.023
 inside+0.023
Neg logit contributions
 Int-0.089
ete-0.089
 Curious-0.088
 Everything-0.086
 Sadly-0.086
Token cute
Token ID13779
Feature activation+4.86e+00

Pos logit contributions
ete+0.061
 Int+0.061
 Curious+0.060
 Sadly+0.060
Sadly+0.059
Neg logit contributions
els-0.068
 on-0.060
 inside-0.058
ps-0.057
.-0.056
Token."
Token ID526
Feature activation-1.47e+01

Pos logit contributions
els+0.047
ps+0.025
ries+0.025
ws+0.024
cks+0.023
Neg logit contributions
 Curious-0.105
 Everything-0.104
 Int-0.104
ete-0.102
 Sadly-0.101
Token
Token ID198
Feature activation-1.51e+01

Pos logit contributions
ete+0.223
Sadly+0.216
rolling+0.207
 Int+0.206
Days+0.204
Neg logit contributions
.-0.263
 on-0.246
 inside-0.227
els-0.226
."-0.225
Token
Token ID198
Feature activation-1.53e+01

Pos logit contributions
ete+0.239
 Int+0.232
rolling+0.230
 Curious+0.230
ishly+0.223
Neg logit contributions
.-0.249
 on-0.226
els-0.221
 inside-0.212
."-0.208
TokenThey
Token ID2990
Feature activation-6.35e+00

Pos logit contributions
ete+0.245
 Everything+0.241
 Int+0.240
 Everywhere+0.240
 Curious+0.236
Neg logit contributions
.-0.224
els-0.218
 on-0.212
 inside-0.196
."-0.188
Token walked
Token ID6807
Feature activation-2.02e-02

Pos logit contributions
ete+0.110
 Int+0.109
 Sadly+0.106
Sadly+0.106
 Curious+0.106
Neg logit contributions
els-0.094
 on-0.088
 inside-0.084
.-0.083
ps-0.081
Token back
Token ID736
Feature activation-1.12e+00

Pos logit contributions
ete+0.000
 Curious+0.000
 Int+0.000
 Sadly+0.000
 Everywhere+0.000
Neg logit contributions
els-0.000
ps-0.000
ries-0.000
 on-0.000
 frost-0.000
Token to
Token ID284
Feature activation-5.14e+00

Pos logit contributions
 Everywhere+0.015
 Sadly+0.015
 Int+0.015
 Curious+0.015
 Everything+0.014
Neg logit contributions
els-0.020
ps-0.018
ier-0.018
 frost-0.017
ries-0.016
Token their
Token ID511
Feature activation-2.91e+00

Pos logit contributions
ete+0.099
 Int+0.097
 Sadly+0.096
 Curious+0.095
 Everywhere+0.094
Neg logit contributions
els-0.068
 on-0.059
 inside-0.056
ps-0.056
.-0.053
Token cake
Token ID12187
Feature activation+1.12e+01

Pos logit contributions
ete+0.028
 Int+0.027
 Sadly+0.027
 Curious+0.027
Sadly+0.027
Neg logit contributions
els-0.024
 on-0.021
 inside-0.021
ps-0.021
.-0.020
Token and
Token ID290
Feature activation-2.44e+00

Pos logit contributions
els+0.134
ps+0.127
ws+0.097
cks+0.095
ening+0.095
Neg logit contributions
ete-0.205
Of-0.191
mber-0.187
So-0.185
 Everywhere-0.178
Token candy
Token ID18550
Feature activation+9.94e+00

Pos logit contributions
ete+0.044
 Int+0.043
 Sadly+0.043
Sadly+0.042
 Curious+0.042
Neg logit contributions
els-0.032
ps-0.027
 on-0.027
 inside-0.027
ier-0.024
Token."
Token ID526
Feature activation-1.48e+01

Pos logit contributions
els+0.111
ps+0.089
cks+0.070
ws+0.063
ries+0.062
Neg logit contributions
ete-0.200
Of-0.190
mber-0.185
So-0.183
 Sadly-0.182
Token
Token ID198
Feature activation-1.50e+01

Pos logit contributions
ete+0.227
Sadly+0.223
rolling+0.212
 Int+0.210
Days+0.209
Neg logit contributions
.-0.260
 on-0.242
els-0.226
 inside-0.220
."-0.219
Token
Token ID198
Feature activation-1.53e+01

Pos logit contributions
ete+0.228
 Int+0.222
 Curious+0.220
rolling+0.219
 Sadly+0.214
Neg logit contributions
.-0.252
els-0.238
 on-0.233
 inside-0.217
."-0.212
TokenMom
Token ID29252
Feature activation-3.97e+00

Pos logit contributions
ete+0.241
 Everything+0.235
 Int+0.234
 Everywhere+0.233
 Curious+0.231
Neg logit contributions
els-0.234
.-0.228
 on-0.218
 inside-0.201
."-0.191
Token sigh
Token ID19680
Feature activation+4.40e-01

Pos logit contributions
ete+0.074
 Int+0.074
 Curious+0.072
 Sadly+0.071
Sadly+0.070
Neg logit contributions
els-0.052
 on-0.046
 inside-0.045
ps-0.044
.-0.042
Tokens
Token ID82
Feature activation-1.42e+00

Pos logit contributions
els+0.006
ps+0.005
ries+0.005
 sticker+0.005
ws+0.005
Neg logit contributions
 Curious-0.008
 Int-0.008
 Sadly-0.008
 Everything-0.007
 From-0.007
Token.
Token ID13
Feature activation-1.19e+01

Pos logit contributions
 Curious+0.030
 Int+0.030
 Sadly+0.030
ete+0.029
Sadly+0.029
Neg logit contributions
els-0.016
ries-0.012
ws-0.010
ps-0.010
 inside-0.010
Token She
Token ID1375
Feature activation-5.15e+00

Pos logit contributions
ete+0.115
Sadly+0.106
 Int+0.093
 Sadly+0.092
rolling+0.092
Neg logit contributions
.-0.268
els-0.267
 on-0.263
 inside-0.258
."-0.234
Token Throw
Token ID22481
Feature activation+3.52e+00

Pos logit contributions
ete+0.022
 Sadly+0.021
 Int+0.021
Sadly+0.021
 Curious+0.021
Neg logit contributions
els-0.011
 on-0.011
 inside-0.010
.-0.010
ps-0.009
Token it
Token ID340
Feature activation+5.53e+00

Pos logit contributions
els+0.047
ps+0.040
 inside+0.039
 on+0.038
.+0.036
Neg logit contributions
ete-0.064
 Int-0.063
 Everything-0.061
Sadly-0.061
 Curious-0.061
Token away
Token ID1497
Feature activation+3.28e+00

Pos logit contributions
els+0.070
ries+0.051
ps+0.049
 frost+0.046
 inside+0.043
Neg logit contributions
ete-0.105
 Int-0.104
Sadly-0.104
 Everywhere-0.103
 Sadly-0.103
Token."
Token ID526
Feature activation-1.46e+01

Pos logit contributions
els+0.036
ps+0.024
ries+0.023
 on+0.022
 frost+0.022
Neg logit contributions
ete-0.069
 Sadly-0.068
 Everywhere-0.068
Sadly-0.067
 Int-0.066
Token
Token ID198
Feature activation-1.50e+01

Pos logit contributions
Sadly+0.216
ete+0.216
rolling+0.206
Days+0.202
 Int+0.202
Neg logit contributions
.-0.260
 on-0.244
els-0.226
."-0.220
 inside-0.219
Token
Token ID198
Feature activation-1.53e+01

Pos logit contributions
ete+0.230
rolling+0.221
 Int+0.221
 Curious+0.220
 Sadly+0.217
Neg logit contributions
.-0.253
els-0.234
 on-0.230
 inside-0.212
."-0.212
TokenShe
Token ID3347
Feature activation-7.31e+00

Pos logit contributions
ete+0.231
 Everything+0.226
 Everywhere+0.224
 Int+0.223
 Sadly+0.222
Neg logit contributions
els-0.237
.-0.237
 on-0.221
 inside-0.203
."-0.200
Token took
Token ID1718
Feature activation-4.77e-01

Pos logit contributions
ete+0.137
 Int+0.132
 Sadly+0.131
Sadly+0.130
 Curious+0.129
Neg logit contributions
els-0.100
 on-0.093
 inside-0.088
.-0.086
ps-0.085
Token the
Token ID262
Feature activation-2.59e+00

Pos logit contributions
 Int+0.008
ete+0.008
 Everything+0.008
 Everywhere+0.008
 Curious+0.008
Neg logit contributions
els-0.007
ier-0.006
ps-0.006
 on-0.006
 inside-0.006
Token cookie
Token ID19751
Feature activation+1.07e+01

Pos logit contributions
 Curious+0.046
ete+0.045
 Int+0.045
 Sadly+0.044
Sadly+0.043
Neg logit contributions
els-0.037
ps-0.030
 on-0.030
 inside-0.030
.-0.029
Token and
Token ID290
Feature activation-4.97e+00

Pos logit contributions
els+0.124
ps+0.104
ries+0.091
 colored+0.087
ws+0.087
Neg logit contributions
ete-0.212
 Sadly-0.204
Da-0.202
Sadly-0.201
 Int-0.201
Token loved
Token ID6151
Feature activation-4.48e-01

Pos logit contributions
ete+0.092
 Int+0.083
 Sadly+0.080
 Curious+0.079
 Everywhere+0.079
Neg logit contributions
 on-0.121
els-0.117
 inside-0.117
.-0.116
ps-0.106
Token to
Token ID284
Feature activation-2.36e+00

Pos logit contributions
ete+0.008
 Int+0.008
 Sadly+0.008
 Curious+0.008
Sadly+0.008
Neg logit contributions
els-0.006
 on-0.006
 inside-0.005
ps-0.005
.-0.005
Token play
Token ID711
Feature activation+4.22e+00

Pos logit contributions
ete+0.040
 Int+0.039
 Curious+0.038
 Sadly+0.038
Sadly+0.038
Neg logit contributions
els-0.037
 on-0.033
 inside-0.032
ps-0.032
.-0.031
Token with
Token ID351
Feature activation+2.35e+00

Pos logit contributions
els+0.045
ps+0.036
 on+0.031
ier+0.031
ries+0.030
Neg logit contributions
 Int-0.090
 Curious-0.088
ete-0.088
Sadly-0.088
 Sadly-0.088
Token her
Token ID607
Feature activation+8.74e-01

Pos logit contributions
ier+0.035
els+0.034
ps+0.034
 inside+0.032
ws+0.031
Neg logit contributions
Da-0.034
 Int-0.034
 Everywhere-0.033
 Curious-0.033
Sadly-0.033
Token toys
Token ID14958
Feature activation+1.08e+01

Pos logit contributions
els+0.009
ps+0.008
ws+0.008
ier+0.008
le+0.008
Neg logit contributions
 Curious-0.017
 Int-0.017
Of-0.017
 Sadly-0.017
 Everywhere-0.017
Token and
Token ID290
Feature activation-2.36e-01

Pos logit contributions
els+0.185
ps+0.177
cks+0.175
ws+0.169
bones+0.165
Neg logit contributions
 Everywhere-0.145
So-0.142
 Sadly-0.138
 She-0.135
 couldn-0.129
Token draw
Token ID3197
Feature activation+9.57e+00

Pos logit contributions
 Int+0.004
 Curious+0.004
ete+0.004
 Sadly+0.004
Sadly+0.004
Neg logit contributions
els-0.004
ps-0.003
 on-0.003
 inside-0.003
.-0.003
Token pictures
Token ID5986
Feature activation+1.08e+01

Pos logit contributions
els+0.102
ps+0.085
ws+0.081
 harm+0.072
 inside+0.071
Neg logit contributions
 Int-0.194
 Curious-0.191
 Everywhere-0.188
 Everything-0.187
Da-0.184
Token.
Token ID13
Feature activation-1.16e+01

Pos logit contributions
ps+0.132
els+0.127
ws+0.126
 spills+0.116
ishing+0.112
Neg logit contributions
 She-0.207
 Everywhere-0.196
 All-0.194
 Everyone-0.193
 He-0.190
Token One
Token ID1881
Feature activation-3.05e+00

Pos logit contributions
ete+0.114
Sadly+0.104
 Int+0.103
 Sadly+0.100
 Curious+0.099
Neg logit contributions
els-0.262
 on-0.251
.-0.247
 inside-0.245
ps-0.231
Token.
Token ID13
Feature activation-6.32e+00

Pos logit contributions
els+0.118
ries+0.079
 colored+0.069
ps+0.067
 frost+0.061
Neg logit contributions
ete-0.260
Sadly-0.260
 Sadly-0.260
 Not-0.253
 Everything-0.251
Token Stay
Token ID16160
Feature activation+7.67e+00

Pos logit contributions
ete+0.077
 Int+0.070
Sadly+0.069
 Curious+0.068
 Sadly+0.068
Neg logit contributions
els-0.129
 on-0.122
.-0.118
 inside-0.118
ps-0.115
Token away
Token ID1497
Feature activation+5.59e+00

Pos logit contributions
els+0.116
ries+0.095
ier+0.089
 frost+0.086
 icing+0.086
Neg logit contributions
 Sadly-0.127
 Curious-0.125
ete-0.124
Sadly-0.124
 Int-0.124
Token from
Token ID422
Feature activation+2.26e+00

Pos logit contributions
els+0.066
ps+0.049
 on+0.047
 frost+0.047
ries+0.046
Neg logit contributions
ete-0.113
 Sadly-0.111
Sadly-0.109
 Everywhere-0.109
 Everything-0.108
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
els+0.035
ier+0.030
le+0.029
ps+0.029
 sticker+0.028
Neg logit contributions
 Everything-0.032
Sadly-0.031
ete-0.031
 Sadly-0.031
 Int-0.031
Token road
Token ID2975
Feature activation+1.15e+01

Pos logit contributions
els+0.034
 on+0.028
ps+0.028
 inside+0.028
ries+0.027
Neg logit contributions
 Sadly-0.028
Sadly-0.027
 Int-0.027
 Curious-0.027
ete-0.026
Token.
Token ID13
Feature activation-7.17e+00

Pos logit contributions
els+0.124
ries+0.090
ps+0.077
 frost+0.077
 colored+0.072
Neg logit contributions
 Sadly-0.249
Sadly-0.241
 Everywhere-0.240
ete-0.236
From-0.229
Token Come
Token ID7911
Feature activation+1.87e+00

Pos logit contributions
ete+0.094
Da+0.082
 Int+0.082
mber+0.082
Sadly+0.082
Neg logit contributions
els-0.136
 on-0.132
.-0.127
 inside-0.125
ps-0.122
Token back
Token ID736
Feature activation+3.84e+00

Pos logit contributions
els+0.017
ps+0.011
ries+0.010
 colored+0.010
ier+0.009
Neg logit contributions
 Sadly-0.044
 Curious-0.043
 Int-0.043
Of-0.043
ete-0.043
Token here
Token ID994
Feature activation+6.83e+00

Pos logit contributions
els+0.061
ries+0.047
ps+0.046
 frost+0.045
 candles+0.042
Neg logit contributions
 Sadly-0.067
 Curious-0.065
 Int-0.064
Of-0.064
 Everywhere-0.063
Token."
Token ID526
Feature activation-1.20e+01

Pos logit contributions
els+0.080
ries+0.055
ps+0.050
 colored+0.045
bones+0.043
Neg logit contributions
 Sadly-0.148
ete-0.147
 Everywhere-0.144
Sadly-0.141
 Int-0.140
Token She
Token ID1375
Feature activation-4.41e+00

Pos logit contributions
ete+0.208
Sadly+0.196
 Int+0.188
 Barkley+0.181
 Sadly+0.181
Neg logit contributions
 on-0.172
.-0.172
 inside-0.172
els-0.169
."-0.145
Token loved
Token ID6151
Feature activation+2.16e-01

Pos logit contributions
ete+0.087
 Everywhere+0.086
Sadly+0.083
 Everything+0.082
 Int+0.082
Neg logit contributions
 inside-0.057
 on-0.057
.-0.057
 colored-0.050
els-0.044
Token playing
Token ID2712
Feature activation+6.04e+00

Pos logit contributions
els+0.003
 on+0.002
 inside+0.002
.+0.002
ps+0.002
Neg logit contributions
ete-0.005
Sadly-0.004
 Sadly-0.004
 Int-0.004
 Curious-0.004
Token in
Token ID287
Feature activation+2.19e+00

Pos logit contributions
els+0.069
ps+0.055
ier+0.048
ries+0.048
ws+0.046
Neg logit contributions
 Int-0.125
ete-0.123
 Curious-0.122
Da-0.117
Sadly-0.117
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
els+0.027
 on+0.024
ps+0.023
 inside+0.023
.+0.021
Neg logit contributions
ete-0.042
 Int-0.042
 Curious-0.041
 Sadly-0.041
 Everywhere-0.041
Token sand
Token ID6450
Feature activation+1.19e+01

Pos logit contributions
els+0.010
 on+0.010
 inside+0.009
.+0.009
ps+0.007
Neg logit contributions
ete-0.028
 Int-0.028
 Sadly-0.027
Sadly-0.027
 Curious-0.027
Token and
Token ID290
Feature activation-8.43e-01

Pos logit contributions
els+0.164
ps+0.142
ws+0.128
 envelop+0.124
 sticker+0.123
Neg logit contributions
ete-0.196
Lots-0.180
 Fro-0.180
 Sadly-0.179
 Everywhere-0.178
Token collecting
Token ID13157
Feature activation+9.27e+00

Pos logit contributions
ete+0.016
 Int+0.016
 Sadly+0.015
 Curious+0.015
Sadly+0.015
Neg logit contributions
els-0.011
 on-0.010
 inside-0.010
ps-0.009
.-0.009
Token shells
Token ID19679
Feature activation+1.44e+01

Pos logit contributions
els+0.133
ps+0.117
ier+0.113
 channel+0.106
 harm+0.105
Neg logit contributions
ete-0.151
 Int-0.147
 Curious-0.143
 Everywhere-0.136
 Fro-0.136
Token.
Token ID13
Feature activation-1.11e+01

Pos logit contributions
els+0.161
ps+0.154
 channel+0.138
ws+0.136
 harm+0.127
Neg logit contributions
ete-0.259
 Fro-0.239
 She-0.237
Da-0.232
 Everywhere-0.232
Token Suddenly
Token ID24975
Feature activation+1.60e-01

Pos logit contributions
ete+0.149
Sadly+0.136
Da+0.123
 Int+0.123
mber+0.122
Neg logit contributions
.-0.215
 inside-0.208
els-0.206
 on-0.206
."-0.185
Token!
Token ID0
Feature activation-1.10e+01

Pos logit contributions
els+0.063
ps+0.052
ries+0.047
 frost+0.046
ws+0.045
Neg logit contributions
ete-0.110
 Everywhere-0.104
 Curious-0.103
 Int-0.103
 Sadly-0.102
Token The
Token ID383
Feature activation-3.74e+00

Pos logit contributions
ete+0.168
Sadly+0.168
Days+0.163
rolling+0.154
Quick+0.153
Neg logit contributions
.-0.176
 on-0.161
 inside-0.156
."-0.145
els-0.137
Token girl
Token ID2576
Feature activation-2.46e-01

Pos logit contributions
 Curious+0.061
 Sadly+0.061
Sadly+0.061
 Int+0.060
 Everything+0.060
Neg logit contributions
els-0.060
 on-0.049
ps-0.049
ries-0.048
ws-0.048
Token chose
Token ID7690
Feature activation+2.36e+00

Pos logit contributions
ete+0.004
 Int+0.004
Sadly+0.004
 Everywhere+0.004
 Sadly+0.004
Neg logit contributions
els-0.004
 on-0.004
 inside-0.004
.-0.004
ps-0.003
Token a
Token ID257
Feature activation+1.71e+00

Pos logit contributions
els+0.030
 on+0.028
 inside+0.027
.+0.026
ps+0.025
Neg logit contributions
ete-0.047
 Int-0.046
Sadly-0.046
 Sadly-0.045
 Curious-0.045
Token machine
Token ID4572
Feature activation+1.06e+01

Pos logit contributions
els+0.029
ps+0.025
 on+0.024
ries+0.024
 inside+0.023
Neg logit contributions
 Int-0.026
 Curious-0.026
 Sadly-0.026
ete-0.026
Sadly-0.025
Token that
Token ID326
Feature activation-1.30e+00

Pos logit contributions
els+0.132
ps+0.107
 on+0.105
 inside+0.100
ries+0.096
Neg logit contributions
ete-0.210
 Int-0.203
 Sadly-0.201
 Curious-0.200
 Everywhere-0.199
Token said
Token ID531
Feature activation+8.14e-01

Pos logit contributions
ete+0.021
 Int+0.020
Sadly+0.020
 Curious+0.020
 Sadly+0.020
Neg logit contributions
els-0.021
 on-0.020
 inside-0.019
.-0.019
ps-0.018
Token '
Token ID705
Feature activation-5.08e+00

Pos logit contributions
els+0.012
 on+0.012
 inside+0.011
.+0.011
ps+0.011
Neg logit contributions
ete-0.014
 Int-0.014
 Sadly-0.014
Sadly-0.013
 Curious-0.013
TokenR
Token ID49
Feature activation-2.94e-01

Pos logit contributions
 Int+0.052
ete+0.052
 Curious+0.051
Of+0.049
 Sadly+0.049
Neg logit contributions
els-0.109
ps-0.101
 inside-0.100
 on-0.099
.-0.097
Tokenever
Token ID964
Feature activation+8.62e+00

Pos logit contributions
ete+0.005
 Int+0.004
 Sadly+0.004
Sadly+0.004
 Curious+0.004
Neg logit contributions
els-0.005
 on-0.005
 inside-0.005
ps-0.004
.-0.004
Token Ben
Token ID3932
Feature activation+2.47e+00

Pos logit contributions
Sadly+0.025
 Sadly+0.025
 Int+0.024
ete+0.023
 Curious+0.023
Neg logit contributions
 on-0.019
els-0.018
.-0.018
 inside-0.018
."-0.016
Token!
Token ID0
Feature activation-6.26e+00

Pos logit contributions
els+0.023
ps+0.019
ries+0.015
 on+0.015
 inside+0.014
Neg logit contributions
ete-0.056
 Sadly-0.054
 Curious-0.053
 Int-0.053
Sadly-0.053
Token It
Token ID632
Feature activation+2.04e+00

Pos logit contributions
ete+0.073
Sadly+0.070
 Sadly+0.066
 Int+0.064
Da+0.062
Neg logit contributions
els-0.126
 on-0.124
.-0.120
 inside-0.118
."-0.113
Token has
Token ID468
Feature activation+3.15e+00

Pos logit contributions
els+0.031
 on+0.027
 inside+0.027
ps+0.027
.+0.026
Neg logit contributions
 Int-0.034
 Sadly-0.033
 Curious-0.033
ete-0.032
Sadly-0.032
Token a
Token ID257
Feature activation+3.90e+00

Pos logit contributions
els+0.050
 on+0.042
 inside+0.042
ps+0.040
ries+0.040
Neg logit contributions
 Int-0.051
Sadly-0.050
 Everywhere-0.050
 Curious-0.050
 Sadly-0.049
Token note
Token ID3465
Feature activation+1.42e+01

Pos logit contributions
els+0.066
ps+0.056
ries+0.056
ws+0.052
 inside+0.051
Neg logit contributions
Sadly-0.061
 Sadly-0.061
 Curious-0.060
 Int-0.059
Of-0.057
Token for
Token ID329
Feature activation+3.42e+00

Pos logit contributions
els+0.195
ps+0.154
ries+0.148
ws+0.138
cks+0.123
Neg logit contributions
ete-0.240
mber-0.239
Of-0.234
 �-0.230
 Everything-0.229
Token you
Token ID345
Feature activation+4.92e+00

Pos logit contributions
els+0.060
 inside+0.053
 on+0.052
ps+0.051
ier+0.050
Neg logit contributions
ete-0.049
 Everything-0.044
 Int-0.044
 Everywhere-0.043
Sadly-0.043
Token too
Token ID1165
Feature activation+4.54e+00

Pos logit contributions
els+0.056
ps+0.045
ries+0.041
 on+0.039
 inside+0.039
Neg logit contributions
ete-0.103
 Sadly-0.098
 Everywhere-0.097
 Int-0.097
 Curious-0.097
Token.
Token ID13
Feature activation-6.70e+00

Pos logit contributions
els+0.057
ps+0.045
ries+0.043
 inside+0.037
le+0.037
Neg logit contributions
ete-0.090
 Sadly-0.090
Sadly-0.088
Lots-0.086
 Curious-0.086
Token It
Token ID632
Feature activation+1.79e+00

Pos logit contributions
ete+0.075
Sadly+0.071
 Int+0.070
 Sadly+0.069
 Curious+0.066
Neg logit contributions
els-0.140
 on-0.133
 inside-0.129
.-0.128
ps-0.124
Token closes
Token ID20612
Feature activation+4.94e+00

Pos logit contributions
 Int+0.072
 Curious+0.072
ete+0.072
 Sadly+0.071
 Everywhere+0.070
Neg logit contributions
els-0.067
ps-0.056
 on-0.056
 inside-0.056
.-0.053
Token her
Token ID607
Feature activation+2.25e+00

Pos logit contributions
els+0.066
ws+0.053
ries+0.052
ier+0.051
ps+0.049
Neg logit contributions
ete-0.081
 Everywhere-0.081
 Int-0.077
 From-0.077
 Curious-0.076
Token eyes
Token ID2951
Feature activation+6.73e+00

Pos logit contributions
els+0.039
ier+0.035
ries+0.034
ps+0.033
ws+0.033
Neg logit contributions
 Everywhere-0.031
 Sadly-0.030
ete-0.030
 Int-0.029
 Curious-0.029
Token.
Token ID13
Feature activation-1.19e+01

Pos logit contributions
els+0.080
ries+0.064
ws+0.055
ps+0.052
cks+0.051
Neg logit contributions
 Sadly-0.137
 Curious-0.136
 Everywhere-0.136
 Int-0.135
ete-0.135
Token She
Token ID1375
Feature activation-5.52e+00

Pos logit contributions
ete+0.143
Sadly+0.131
rolling+0.118
 Int+0.117
Days+0.116
Neg logit contributions
.-0.238
 on-0.229
els-0.225
 inside-0.222
."-0.218
Token sleeps
Token ID44263
Feature activation+4.67e+00

Pos logit contributions
ete+0.098
 Int+0.096
 Sadly+0.094
Sadly+0.094
 Curious+0.094
Neg logit contributions
els-0.080
 on-0.073
 inside-0.070
.-0.069
ps-0.068
Token and
Token ID290
Feature activation-3.54e+00

Pos logit contributions
els+0.056
ries+0.041
ps+0.038
ws+0.035
 inside+0.034
Neg logit contributions
Sadly-0.096
ete-0.096
 Sadly-0.096
 Everywhere-0.095
 Curious-0.095
Token dreams
Token ID10625
Feature activation+2.94e+00

Pos logit contributions
 Int+0.051
 Sadly+0.050
 Curious+0.050
 Everything+0.048
Sadly+0.048
Neg logit contributions
els-0.062
ries-0.051
ier-0.051
ps-0.050
 inside-0.050
Token of
Token ID286
Feature activation+6.08e-01

Pos logit contributions
els+0.040
ries+0.030
ps+0.029
 on+0.029
 inside+0.028
Neg logit contributions
 Sadly-0.055
 Int-0.055
ete-0.055
 Everywhere-0.055
Sadly-0.055
Token purple
Token ID14032
Feature activation+8.21e+00

Pos logit contributions
els+0.010
ier+0.008
le+0.008
 on+0.008
 inside+0.008
Neg logit contributions
 Everywhere-0.009
 Everything-0.009
 Sadly-0.009
ete-0.009
 Int-0.009
Token things
Token ID1243
Feature activation+6.71e+00

Pos logit contributions
els+0.100
ws+0.094
cks+0.092
ps+0.092
bones+0.082
Neg logit contributions
 Everywhere-0.125
pill-0.121
 Int-0.119
 Everything-0.119
 Curious-0.118
Token,"
Token ID553
Feature activation-1.01e+01

Pos logit contributions
els+0.026
ps+0.021
ries+0.020
bones+0.017
et+0.016
Neg logit contributions
 Sadly-0.041
Sadly-0.040
ete-0.039
 Not-0.039
Luckily-0.038
Token Lily
Token ID20037
Feature activation-1.52e+00

Pos logit contributions
ete+0.176
 Int+0.171
Sadly+0.168
 Sadly+0.168
 Curious+0.167
Neg logit contributions
els-0.152
 on-0.142
 inside-0.135
.-0.134
ps-0.127
Token says
Token ID1139
Feature activation-3.53e-01

Pos logit contributions
 Int+0.026
 Curious+0.026
ete+0.025
 Sadly+0.025
 Everywhere+0.024
Neg logit contributions
els-0.023
 on-0.020
 inside-0.020
ps-0.019
ries-0.018
Token.
Token ID13
Feature activation-1.03e+01

Pos logit contributions
 Int+0.007
 Curious+0.007
 Her+0.007
 His+0.007
 Sadly+0.007
Neg logit contributions
els-0.004
 colored-0.003
ries-0.003
ws-0.003
 frost-0.003
Token She
Token ID1375
Feature activation-3.04e+00

Pos logit contributions
ete+0.085
 Int+0.074
Sadly+0.074
 Sadly+0.073
 Curious+0.070
Neg logit contributions
els-0.250
 on-0.242
.-0.236
 inside-0.235
ps-0.228
Token fills
Token ID23816
Feature activation+7.93e+00

Pos logit contributions
 Int+0.060
 Curious+0.058
ete+0.058
 Everywhere+0.056
 Sadly+0.056
Neg logit contributions
els-0.037
 on-0.033
 inside-0.031
.-0.030
ps-0.029
Token another
Token ID1194
Feature activation+7.34e+00

Pos logit contributions
els+0.109
ier+0.092
 on+0.090
ps+0.089
 inside+0.087
Neg logit contributions
ete-0.139
 Int-0.137
 Curious-0.135
 Sadly-0.132
 Everywhere-0.131
Token cup
Token ID6508
Feature activation+1.23e+01

Pos logit contributions
els+0.127
ps+0.117
ier+0.112
ries+0.108
ws+0.107
Neg logit contributions
 Sadly-0.099
ete-0.095
 Curious-0.095
Sadly-0.095
 Int-0.088
Token with
Token ID351
Feature activation+5.49e+00

Pos logit contributions
els+0.173
ries+0.132
ps+0.130
 problems+0.126
cks+0.119
Neg logit contributions
 Sadly-0.220
Sadly-0.220
ete-0.216
From-0.210
 Everywhere-0.209
Token water
Token ID1660
Feature activation+1.14e+01

Pos logit contributions
els+0.073
ier+0.057
 inside+0.057
 on+0.057
ps+0.057
Neg logit contributions
ete-0.096
 Everywhere-0.094
 Curious-0.093
 Sadly-0.093
 Everything-0.093
Token from
Token ID422
Feature activation+3.12e+00

Pos logit contributions
els+0.137
ries+0.100
ps+0.099
 problems+0.098
ws+0.096
Neg logit contributions
ete-0.208
 Fro-0.198
 Sadly-0.196
 He-0.196
 Everywhere-0.195
Token sun
Token ID4252
Feature activation+8.41e+00

Pos logit contributions
ete+0.023
 Int+0.023
 Sadly+0.022
 Curious+0.022
Sadly+0.022
Neg logit contributions
els-0.023
 on-0.021
 inside-0.020
ps-0.020
.-0.020
Token was
Token ID373
Feature activation+2.47e+00

Pos logit contributions
els+0.153
ps+0.135
 on+0.134
 inside+0.132
.+0.129
Neg logit contributions
ete-0.116
 Int-0.113
 Curious-0.111
 Sadly-0.110
 Everything-0.108
Token already
Token ID1541
Feature activation+6.51e+00

Pos logit contributions
els+0.039
ps+0.032
ws+0.031
ries+0.030
 inside+0.029
Neg logit contributions
ete-0.041
 Int-0.040
 Everywhere-0.039
 Curious-0.039
rolling-0.038
Token shining
Token ID22751
Feature activation+1.18e+01

Pos logit contributions
els+0.107
ps+0.087
ws+0.086
 problems+0.080
cks+0.080
Neg logit contributions
 Int-0.103
ete-0.102
 Curious-0.100
 Everywhere-0.098
 Everything-0.098
Token high
Token ID1029
Feature activation+1.18e+01

Pos logit contributions
els+0.169
ps+0.144
ws+0.141
cks+0.129
ries+0.121
Neg logit contributions
ete-0.209
 Curious-0.206
 Int-0.205
 Everywhere-0.203
 Everything-0.198
Token,
Token ID11
Feature activation+1.84e+00

Pos logit contributions
els+0.152
ws+0.125
ps+0.122
 sticker+0.114
cks+0.111
Neg logit contributions
 Everywhere-0.214
ete-0.213
 Curious-0.212
 Int-0.208
 He-0.200
Token and
Token ID290
Feature activation-4.91e-01

Pos logit contributions
els+0.019
 on+0.016
 inside+0.015
.+0.015
ps+0.015
Neg logit contributions
ete-0.041
 Int-0.040
 Sadly-0.039
 Curious-0.039
Sadly-0.039
Token the
Token ID262
Feature activation-2.08e-01

Pos logit contributions
ete+0.009
 Int+0.009
 Sadly+0.009
Sadly+0.009
 Curious+0.009
Neg logit contributions
els-0.006
 on-0.006
 inside-0.006
.-0.006
ps-0.005
Token water
Token ID1660
Feature activation+8.86e+00

Pos logit contributions
ete+0.003
 Int+0.003
 Curious+0.003
 Sadly+0.003
 Everything+0.003
Neg logit contributions
els-0.003
 on-0.003
ps-0.003
 inside-0.003
.-0.003
Token was
Token ID373
Feature activation+2.12e+00

Pos logit contributions
els+0.150
 on+0.139
 inside+0.134
.+0.131
ps+0.131
Neg logit contributions
ete-0.135
 Int-0.132
 Sadly-0.129
 Curious-0.129
Sadly-0.128
Token sparkling
Token ID39072
Feature activation+1.30e+01

Pos logit contributions
els+0.037
ps+0.030
ries+0.029
 on+0.029
ws+0.029
Neg logit contributions
ete-0.032
 Int-0.031
 Curious-0.030
 Everywhere-0.030
 Sadly-0.030
Token told
Token ID1297
Feature activation+1.16e-01

Pos logit contributions
els+0.054
 on+0.050
 inside+0.048
.+0.047
ps+0.047
Neg logit contributions
ete-0.057
 Int-0.056
 Sadly-0.055
 Curious-0.055
Sadly-0.055
Token her
Token ID607
Feature activation+2.67e-01

Pos logit contributions
els+0.001
 on+0.001
.+0.001
 inside+0.001
ps+0.001
Neg logit contributions
ete-0.003
 Int-0.002
 Sadly-0.002
 Curious-0.002
Sadly-0.002
Token that
Token ID326
Feature activation-5.54e-01

Pos logit contributions
 on+0.003
els+0.003
 inside+0.003
.+0.003
ps+0.003
Neg logit contributions
 Int-0.005
ete-0.005
 Curious-0.005
Sadly-0.005
 Everywhere-0.005
Token it
Token ID340
Feature activation+3.32e+00

Pos logit contributions
 Int+0.008
 Everything+0.008
 Sadly+0.008
ete+0.008
Sadly+0.008
Neg logit contributions
els-0.009
ps-0.008
 on-0.007
 inside-0.007
.-0.007
Token's
Token ID338
Feature activation+1.42e+00

Pos logit contributions
els+0.055
 on+0.053
 inside+0.051
.+0.050
ps+0.048
Neg logit contributions
ete-0.054
 Int-0.051
 Curious-0.050
Sadly-0.050
 Sadly-0.049
Token important
Token ID1593
Feature activation+5.25e+00

Pos logit contributions
els+0.035
ries+0.031
cks+0.030
ps+0.030
bones+0.029
Neg logit contributions
Sadly-0.011
 Sadly-0.010
Luckily-0.009
From-0.009
 shouldn-0.009
Token to
Token ID284
Feature activation-1.36e+00

Pos logit contributions
els+0.090
ps+0.076
 on+0.074
ries+0.073
 inside+0.072
Neg logit contributions
 Sadly-0.078
ete-0.076
 Everything-0.076
Sadly-0.076
 Int-0.076
Token respect
Token ID2461
Feature activation+7.65e+00

Pos logit contributions
ete+0.019
Sadly+0.019
 Sadly+0.019
 Int+0.018
 Everything+0.018
Neg logit contributions
els-0.024
 on-0.022
 inside-0.021
ps-0.021
ries-0.021
Token other
Token ID584
Feature activation+5.99e+00

Pos logit contributions
els+0.100
ps+0.093
ier+0.083
 frost+0.082
ries+0.082
Neg logit contributions
 Everywhere-0.126
 Everything-0.123
ete-0.120
From-0.116
Sadly-0.116
Token people
Token ID661
Feature activation+9.83e+00

Pos logit contributions
els+0.084
ps+0.075
 on+0.074
 inside+0.072
ries+0.068
Neg logit contributions
ete-0.099
 Int-0.097
 Everything-0.097
Sadly-0.097
 Curious-0.096
Token's
Token ID338
Feature activation+6.07e+00

Pos logit contributions
els+0.188
ps+0.169
 on+0.160
 inside+0.159
ries+0.157
Neg logit contributions
ete-0.122
 Sadly-0.117
 Int-0.116
Sadly-0.114
 Curious-0.113
Token in
Token ID287
Feature activation+1.40e+00

Pos logit contributions
els+0.108
ps+0.086
ws+0.085
ries+0.076
 sticker+0.068
Neg logit contributions
 Int-0.203
 Curious-0.197
ete-0.194
 Everything-0.193
 Everywhere-0.188
Token the
Token ID262
Feature activation+1.69e+00

Pos logit contributions
els+0.021
ps+0.018
ier+0.018
ws+0.018
 on+0.017
Neg logit contributions
 Curious-0.024
 Int-0.023
Da-0.023
 Everywhere-0.022
 Sadly-0.022
Token bushes
Token ID37413
Feature activation+9.70e+00

Pos logit contributions
els+0.027
 on+0.025
 inside+0.024
.+0.024
ps+0.023
Neg logit contributions
ete-0.028
 Int-0.027
 Sadly-0.026
 Curious-0.026
Sadly-0.026
Token and
Token ID290
Feature activation-5.46e-01

Pos logit contributions
els+0.037
 on+0.021
 inside+0.017
ps+0.016
.+0.011
Neg logit contributions
ete-0.275
 Int-0.272
 Sadly-0.269
 Curious-0.269
Sadly-0.268
Token she
Token ID673
Feature activation-1.16e+00

Pos logit contributions
ete+0.010
Sadly+0.009
 Sadly+0.009
 Int+0.009
 Everywhere+0.009
Neg logit contributions
els-0.008
 on-0.008
 inside-0.008
.-0.008
ps-0.007
Token found
Token ID1043
Feature activation+3.01e+00

Pos logit contributions
ete+0.022
 Everywhere+0.022
Of+0.021
Sadly+0.021
 Sadly+0.021
Neg logit contributions
 on-0.015
.-0.015
 inside-0.015
els-0.015
ps-0.013
Token a
Token ID257
Feature activation+2.52e+00

Pos logit contributions
els+0.041
 on+0.036
 inside+0.034
ps+0.034
.+0.033
Neg logit contributions
ete-0.056
 Int-0.055
 Curious-0.054
 Sadly-0.054
Sadly-0.054
Token baby
Token ID5156
Feature activation+1.11e+01

Pos logit contributions
els+0.043
ps+0.037
 on+0.037
 inside+0.036
.+0.035
Neg logit contributions
ete-0.039
 Int-0.038
 Curious-0.038
 Sadly-0.038
Sadly-0.038
Token fox
Token ID21831
Feature activation+1.16e+01

Pos logit contributions
els+0.211
ws+0.202
ps+0.193
 on+0.173
cks+0.173
Neg logit contributions
 Curious-0.128
 Int-0.124
ete-0.123
Da-0.121
Bet-0.119
Token.
Token ID13
Feature activation-1.03e+01

Pos logit contributions
els+0.107
ps+0.082
ws+0.074
ries+0.072
 on+0.068
Neg logit contributions
ete-0.261
 Curious-0.254
 Int-0.253
 Sadly-0.251
 Barkley-0.246
Token The
Token ID383
Feature activation-3.27e+00

Pos logit contributions
ete+0.145
Sadly+0.141
Ag+0.133
Days+0.132
rolling+0.123
Neg logit contributions
.-0.197
 on-0.176
 inside-0.176
."-0.168
els-0.159
Token waved
Token ID26834
Feature activation+4.92e+00

Pos logit contributions
ete+0.035
 Int+0.035
Sadly+0.034
 Sadly+0.034
 Curious+0.034
Neg logit contributions
els-0.037
 on-0.035
 inside-0.034
.-0.033
ps-0.032
Token to
Token ID284
Feature activation-2.12e+00

Pos logit contributions
els+0.071
 on+0.062
 inside+0.059
ps+0.059
.+0.057
Neg logit contributions
ete-0.089
 Int-0.087
 Curious-0.086
 Sadly-0.085
Sadly-0.084
Token the
Token ID262
Feature activation+8.12e-03

Pos logit contributions
ete+0.040
 Int+0.039
 Sadly+0.039
 Curious+0.039
Sadly+0.038
Neg logit contributions
els-0.028
 on-0.026
 inside-0.024
ps-0.024
.-0.024
Token path
Token ID3108
Feature activation+1.01e+01

Pos logit contributions
els+0.000
 on+0.000
 inside+0.000
.+0.000
ps+0.000
Neg logit contributions
ete-0.000
 Int-0.000
 Sadly-0.000
Sadly-0.000
 Curious-0.000
Token.
Token ID13
Feature activation-1.13e+01

Pos logit contributions
els+0.084
ws+0.061
ps+0.060
 frost+0.056
ries+0.055
Neg logit contributions
ete-0.228
 Curious-0.226
 Int-0.226
 Sadly-0.223
 Everywhere-0.220
Token Suddenly
Token ID24975
Feature activation-4.91e-01

Pos logit contributions
ete+0.144
Sadly+0.140
Da+0.128
mber+0.127
Bet+0.124
Neg logit contributions
els-0.211
.-0.211
 on-0.206
 inside-0.204
ps-0.186
Token,
Token ID11
Feature activation-6.02e+00

Pos logit contributions
ete+0.011
 Int+0.011
 Sadly+0.011
 Curious+0.011
 Everything+0.011
Neg logit contributions
els-0.005
ps-0.004
 on-0.003
 inside-0.003
.-0.003
Token the
Token ID262
Feature activation-2.63e+00

Pos logit contributions
ete+0.100
 Int+0.100
 Curious+0.099
 Sadly+0.098
Sadly+0.096
Neg logit contributions
els-0.094
ps-0.080
 on-0.079
 inside-0.077
.-0.076
Token path
Token ID3108
Feature activation+8.33e+00

Pos logit contributions
ete+0.047
 Int+0.047
 Sadly+0.045
Sadly+0.045
 Everywhere+0.044
Neg logit contributions
els-0.036
 on-0.036
 inside-0.035
.-0.033
ps-0.032
Token said
Token ID531
Feature activation+8.86e-01

Pos logit contributions
els+0.140
 on+0.125
ps+0.122
 inside+0.122
.+0.119
Neg logit contributions
ete-0.127
 Int-0.125
 Curious-0.123
 Sadly-0.122
Sadly-0.121
Token hello
Token ID23748
Feature activation+6.25e+00

Pos logit contributions
 on+0.007
 inside+0.006
.+0.006
els+0.006
ps+0.005
Neg logit contributions
ete-0.022
Sadly-0.021
 Int-0.021
 Sadly-0.021
 Everywhere-0.021
Token learned
Token ID4499
Feature activation-1.40e+00

Pos logit contributions
ete+0.057
 Int+0.056
 Sadly+0.055
Sadly+0.055
 Curious+0.055
Neg logit contributions
els-0.038
 on-0.035
 inside-0.033
.-0.033
ps-0.032
Token that
Token ID326
Feature activation-1.90e+00

Pos logit contributions
ete+0.024
 Int+0.024
 Sadly+0.023
Sadly+0.023
 Everywhere+0.023
Neg logit contributions
els-0.020
 on-0.018
 inside-0.017
ps-0.017
.-0.016
Token being
Token ID852
Feature activation+1.25e+00

Pos logit contributions
ete+0.030
 Int+0.029
 Sadly+0.029
 Everything+0.029
Sadly+0.029
Neg logit contributions
els-0.031
ps-0.027
 on-0.026
 inside-0.026
.-0.025
Token bald
Token ID28273
Feature activation+7.25e+00

Pos logit contributions
els+0.023
 on+0.021
 inside+0.020
ps+0.020
.+0.020
Neg logit contributions
ete-0.017
 Int-0.017
 Sadly-0.017
 Curious-0.017
Sadly-0.017
Token is
Token ID318
Feature activation-5.64e-01

Pos logit contributions
els+0.102
ries+0.083
 inside+0.082
 frost+0.081
 on+0.081
Neg logit contributions
 Everywhere-0.119
 Sadly-0.119
 Everything-0.118
ete-0.116
Sadly-0.114
Token not
Token ID407
Feature activation+2.45e-01

Pos logit contributions
Sadly+0.007
ete+0.006
 Sadly+0.006
From+0.006
Of+0.006
Neg logit contributions
els-0.011
ries-0.010
le-0.009
ps-0.009
 inside-0.009
Token bad
Token ID2089
Feature activation+4.58e+00

Pos logit contributions
els+0.004
ries+0.003
ps+0.003
le+0.003
 inside+0.003
Neg logit contributions
 Everything-0.003
 Not-0.003
 Barkley-0.003
Sadly-0.003
 Sadly-0.003
Token.
Token ID13
Feature activation-1.12e+01

Pos logit contributions
els+0.053
cks+0.038
ps+0.038
ries+0.037
 frost+0.037
Neg logit contributions
 Not-0.090
 Everything-0.089
 Everywhere-0.087
ete-0.086
 He-0.085
Token He
Token ID679
Feature activation-3.34e+00

Pos logit contributions
ete+0.093
Sadly+0.085
 Int+0.084
rolling+0.081
 Curious+0.079
Neg logit contributions
els-0.262
.-0.251
 on-0.250
 inside-0.243
ps-0.236
Token learned
Token ID4499
Feature activation-1.95e+00

Pos logit contributions
ete+0.058
 Int+0.057
 Curious+0.055
 Sadly+0.055
Sadly+0.055
Neg logit contributions
els-0.050
 on-0.046
 inside-0.044
.-0.044
ps-0.043
Token to
Token ID284
Feature activation-4.26e+00

Pos logit contributions
ete+0.034
 Int+0.033
 Sadly+0.033
Sadly+0.033
 Curious+0.032
Neg logit contributions
els-0.029
 on-0.025
 inside-0.025
ps-0.024
.-0.023
Token outside
Token ID2354
Feature activation+7.15e+00

Pos logit contributions
els+0.079
ws+0.060
ps+0.060
ries+0.058
 uses+0.051
Neg logit contributions
 Int-0.095
 Everything-0.094
 Curious-0.094
 All-0.093
 Everywhere-0.092
Token.
Token ID13
Feature activation-1.10e+01

Pos logit contributions
els+0.072
ps+0.061
ws+0.058
cks+0.044
ries+0.043
Neg logit contributions
 Everywhere-0.148
 Sadly-0.147
ete-0.144
 Curious-0.141
 Int-0.140
Token
Token ID198
Feature activation-1.04e+01

Pos logit contributions
 Barkley+0.182
Nut+0.179
ete+0.178
rolling+0.177
Sadly+0.172
Neg logit contributions
.-0.179
 on-0.158
."-0.156
els-0.154
 inside-0.148
Token
Token ID198
Feature activation-1.25e+01

Pos logit contributions
ete+0.161
ishly+0.158
 Barkley+0.156
 Int+0.154
 Curious+0.153
Neg logit contributions
.-0.172
 on-0.164
els-0.154
 inside-0.153
."-0.143
Token"
Token ID1
Feature activation-9.33e+00

Pos logit contributions
 Int+0.195
ete+0.191
 Curious+0.190
 Sadly+0.190
 Barkley+0.189
Neg logit contributions
els-0.201
.-0.194
 on-0.193
 inside-0.176
."-0.161
TokenLet
Token ID5756
Feature activation-1.35e+00

Pos logit contributions
ete+0.097
 Int+0.094
 Sadly+0.091
 Curious+0.090
Sadly+0.090
Neg logit contributions
els-0.206
 on-0.194
 inside-0.189
.-0.186
ps-0.185
Token's
Token ID338
Feature activation-1.66e+00

Pos logit contributions
ete+0.025
Of+0.025
 Sadly+0.025
rolling+0.025
 Everywhere+0.025
Neg logit contributions
.-0.018
 on-0.018
els-0.017
 inside-0.017
."-0.015
Token go
Token ID467
Feature activation+4.08e+00

Pos logit contributions
ete+0.023
 Sadly+0.022
 Int+0.022
Sadly+0.022
 Curious+0.022
Neg logit contributions
els-0.030
ps-0.026
 on-0.026
ries-0.025
ier-0.025
Token to
Token ID284
Feature activation-9.78e-01

Pos logit contributions
els+0.054
ps+0.042
ries+0.039
 frost+0.037
ws+0.035
Neg logit contributions
ete-0.078
 Int-0.077
 Curious-0.077
 Sadly-0.077
rolling-0.077
Token the
Token ID262
Feature activation-2.97e-01

Pos logit contributions
ete+0.019
 Int+0.019
 Sadly+0.018
 Curious+0.018
 Everywhere+0.018
Neg logit contributions
els-0.013
 on-0.011
ps-0.010
 inside-0.010
.-0.010
Token library
Token ID5888
Feature activation+9.95e+00

Pos logit contributions
ete+0.004
 Int+0.004
 Curious+0.004
 Sadly+0.004
Sadly+0.004
Neg logit contributions
els-0.006
 on-0.005
 inside-0.005
.-0.005
ps-0.005
Token soccer
Token ID11783
Feature activation+7.60e+00

Pos logit contributions
els+0.021
ps+0.015
ries+0.015
ier+0.014
 inside+0.013
Neg logit contributions
 Int-0.038
Sadly-0.037
ete-0.037
 Sadly-0.037
 Curious-0.037
Token in
Token ID287
Feature activation-2.72e+00

Pos logit contributions
els+0.093
ps+0.076
ries+0.071
 inside+0.066
 frost+0.066
Neg logit contributions
ete-0.146
 Everywhere-0.142
 Int-0.141
Sadly-0.139
From-0.138
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
ete+0.048
 Everywhere+0.047
 Int+0.047
 Curious+0.046
 Everything+0.045
Neg logit contributions
els-0.039
ps-0.033
 on-0.032
ier-0.032
 inside-0.031
Token park
Token ID3952
Feature activation+4.36e+00

Pos logit contributions
ete+0.039
 Int+0.038
 Sadly+0.037
 Curious+0.037
Sadly+0.037
Neg logit contributions
els-0.032
 on-0.031
 inside-0.030
.-0.029
ps-0.028
Token.
Token ID13
Feature activation-1.24e+01

Pos logit contributions
els+0.034
ps+0.023
ries+0.020
ws+0.018
 inside+0.016
Neg logit contributions
ete-0.105
 Sadly-0.105
 Everywhere-0.103
 Curious-0.103
 Int-0.103
Token They
Token ID1119
Feature activation-5.20e+00

Pos logit contributions
ete+0.206
Sadly+0.184
rolling+0.175
mber+0.174
 Int+0.169
Neg logit contributions
.-0.199
 on-0.188
."-0.177
 inside-0.176
els-0.171
Token both
Token ID1111
Feature activation-3.22e+00

Pos logit contributions
ete+0.085
 Int+0.082
Sadly+0.082
 Sadly+0.081
 Curious+0.079
Neg logit contributions
 on-0.083
els-0.082
 inside-0.079
.-0.078
ps-0.072
Token wanted
Token ID2227
Feature activation-1.22e+00

Pos logit contributions
ete+0.043
 Int+0.042
 Sadly+0.041
Sadly+0.041
 Curious+0.041
Neg logit contributions
els-0.060
 on-0.056
 inside-0.055
.-0.054
ps-0.053
Token to
Token ID284
Feature activation-4.51e+00

Pos logit contributions
ete+0.019
 Int+0.018
 Everywhere+0.018
 Everything+0.018
 Sadly+0.018
Neg logit contributions
els-0.021
ps-0.018
 on-0.017
 inside-0.017
ries-0.016
Token score
Token ID4776
Feature activation+7.98e+00

Pos logit contributions
ete+0.079
 Int+0.078
 Sadly+0.077
Sadly+0.076
 Curious+0.076
Neg logit contributions
els-0.065
 on-0.061
 inside-0.059
.-0.059
ps-0.056
Token a
Token ID257
Feature activation+1.30e+00

Pos logit contributions
els+0.103
 inside+0.083
ps+0.081
ries+0.081
 on+0.079
Neg logit contributions
 Int-0.150
ete-0.149
 Everywhere-0.148
 Curious-0.147
Sadly-0.143
Token,
Token ID11
Feature activation-1.01e+01

Pos logit contributions
ete+0.107
 Int+0.104
 Curious+0.103
 Sadly+0.102
 Everything+0.101
Neg logit contributions
els-0.048
ps-0.037
 on-0.034
 inside-0.033
ws-0.032
Token Sara
Token ID24799
Feature activation-4.95e+00

Pos logit contributions
ete+0.168
 Int+0.165
Sadly+0.165
 Sadly+0.163
 Curious+0.160
Neg logit contributions
els-0.153
 on-0.149
 inside-0.142
.-0.140
ps-0.130
Token and
Token ID290
Feature activation-9.38e+00

Pos logit contributions
ete+0.089
 Int+0.088
Sadly+0.087
 Sadly+0.086
 Curious+0.085
Neg logit contributions
els-0.070
 on-0.066
 inside-0.064
.-0.063
ps-0.059
Token Lily
Token ID20037
Feature activation-3.94e+00

Pos logit contributions
Sadly+0.198
 Int+0.196
 Sadly+0.192
ete+0.191
rolling+0.190
Neg logit contributions
 on-0.110
 inside-0.105
els-0.103
.-0.100
ps-0.087
Token decided
Token ID3066
Feature activation-3.18e+00

Pos logit contributions
ete+0.066
Sadly+0.065
 Int+0.065
 Sadly+0.063
 Curious+0.062
Neg logit contributions
els-0.060
 on-0.059
 inside-0.058
.-0.058
ps-0.051
Token to
Token ID284
Feature activation-6.69e+00

Pos logit contributions
ete+0.065
 Int+0.063
Sadly+0.063
 Sadly+0.062
 Everywhere+0.062
Neg logit contributions
 on-0.037
els-0.036
 inside-0.035
.-0.035
ps-0.030
Token have
Token ID423
Feature activation-5.45e-01

Pos logit contributions
ete+0.116
 Int+0.114
 Sadly+0.112
Sadly+0.112
 Curious+0.111
Neg logit contributions
els-0.099
 on-0.092
 inside-0.088
.-0.087
ps-0.085
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
ete+0.010
 Int+0.010
 Sadly+0.010
 Curious+0.010
Sadly+0.010
Neg logit contributions
els-0.008
 on-0.007
 inside-0.007
ps-0.006
.-0.006
Token tea
Token ID8887
Feature activation+9.77e+00

Pos logit contributions
ete+0.017
 Int+0.017
 Sadly+0.016
 Curious+0.016
Sadly+0.016
Neg logit contributions
els-0.018
 on-0.017
 inside-0.016
ps-0.016
.-0.016
Token party
Token ID2151
Feature activation+8.22e+00

Pos logit contributions
els+0.185
ps+0.163
ws+0.150
cks+0.149
ries+0.149
Neg logit contributions
ete-0.129
Of-0.120
Da-0.118
 Sadly-0.118
 Everywhere-0.114
Token in
Token ID287
Feature activation-1.76e+00

Pos logit contributions
els+0.085
ps+0.066
ws+0.061
cks+0.051
ries+0.051
Neg logit contributions
ete-0.175
Da-0.168
Of-0.166
Bet-0.163
 Sadly-0.163
Token you
Token ID345
Feature activation+5.89e+00

Pos logit contributions
els+0.139
ps+0.114
 inside+0.112
 frost+0.108
ries+0.104
Neg logit contributions
ete-0.174
 Everywhere-0.172
 Int-0.171
 Curious-0.168
From-0.165
Token can
Token ID460
Feature activation+4.85e+00

Pos logit contributions
els+0.096
 on+0.079
ries+0.079
 inside+0.078
ps+0.077
Neg logit contributions
 Int-0.092
ete-0.091
 Curious-0.089
 Sadly-0.088
 Everywhere-0.088
Token raise
Token ID5298
Feature activation+1.24e+01

Pos logit contributions
els+0.075
 on+0.062
ries+0.062
 inside+0.061
ps+0.060
Neg logit contributions
ete-0.079
 Int-0.078
Sadly-0.077
 Sadly-0.077
 Everywhere-0.076
Token your
Token ID534
Feature activation+8.11e+00

Pos logit contributions
els+0.136
ps+0.105
ries+0.101
 inside+0.091
 frost+0.089
Neg logit contributions
 Everywhere-0.248
ete-0.246
 Int-0.238
 Sadly-0.235
Sadly-0.234
Token arms
Token ID5101
Feature activation+1.50e+01

Pos logit contributions
els+0.137
ries+0.123
ier+0.117
 colored+0.110
bones+0.109
Neg logit contributions
 Everywhere-0.116
 Curious-0.114
 Int-0.112
 Sadly-0.112
ete-0.109
Token."
Token ID526
Feature activation-1.06e+01

Pos logit contributions
els+0.163
ries+0.125
ps+0.111
bones+0.105
 frost+0.101
Neg logit contributions
 Everywhere-0.320
ete-0.310
 Int-0.303
 Sadly-0.303
 Curious-0.300
Token
Token ID198
Feature activation-1.04e+01

Pos logit contributions
ete+0.145
Sadly+0.138
 Int+0.132
rolling+0.128
 Curious+0.127
Neg logit contributions
 on-0.201
.-0.200
els-0.191
 inside-0.186
."-0.171
Token
Token ID198
Feature activation-1.20e+01

Pos logit contributions
ete+0.157
 Int+0.150
 Curious+0.148
 Sadly+0.144
Sadly+0.144
Neg logit contributions
.-0.184
 on-0.175
els-0.166
 inside-0.163
."-0.154
TokenL
Token ID43
Feature activation-4.15e+00

Pos logit contributions
ete+0.167
 Everything+0.161
 Int+0.160
 Everywhere+0.156
 Curious+0.155
Neg logit contributions
 on-0.226
els-0.216
.-0.214
 inside-0.196
 sticker-0.188
Tokenily
Token ID813
Feature activation-2.09e+00

Pos logit contributions
ete+0.065
 Int+0.063
Sadly+0.062
 Sadly+0.062
 Curious+0.061
Neg logit contributions
els-0.070
 on-0.064
 inside-0.062
.-0.060
ps-0.059
Token took
Token ID1718
Feature activation+3.06e+00

Pos logit contributions
ete+0.037
 Everywhere+0.036
 Barkley+0.035
Ag+0.035
rolling+0.034
Neg logit contributions
.-0.030
els-0.029
 on-0.029
 inside-0.029
ps-0.027
Token each
Token ID1123
Feature activation+1.16e+00

Pos logit contributions
ete+0.043
 Int+0.042
 Curious+0.041
 Sadly+0.041
Sadly+0.041
Neg logit contributions
els-0.036
 on-0.032
 inside-0.030
ps-0.030
.-0.029
Token other
Token ID584
Feature activation+4.93e+00

Pos logit contributions
els+0.015
ps+0.013
 inside+0.012
 on+0.012
ries+0.011
Neg logit contributions
ete-0.022
 Int-0.022
 Curious-0.022
Sadly-0.022
 Everywhere-0.021
Token and
Token ID290
Feature activation-2.46e+00

Pos logit contributions
els+0.063
ps+0.052
ries+0.049
 frost+0.047
ws+0.046
Neg logit contributions
 Everywhere-0.088
ete-0.087
 Curious-0.085
 Int-0.084
 Sadly-0.082
Token laughed
Token ID13818
Feature activation+3.52e+00

Pos logit contributions
ete+0.036
 Int+0.035
 Sadly+0.035
 Curious+0.035
Sadly+0.034
Neg logit contributions
els-0.044
ps-0.037
 on-0.037
 inside-0.036
ries-0.035
Token.
Token ID13
Feature activation-1.06e+01

Pos logit contributions
els+0.028
ries+0.017
ps+0.015
ws+0.014
 inside+0.012
Neg logit contributions
 Curious-0.086
 Int-0.086
ete-0.084
 Everywhere-0.083
From-0.083
Token
Token ID198
Feature activation-9.20e+00

Pos logit contributions
ete+0.135
Sadly+0.124
 Int+0.122
 Curious+0.119
 Sadly+0.118
Neg logit contributions
els-0.207
 on-0.204
.-0.201
 inside-0.195
ps-0.181
Token
Token ID198
Feature activation-1.10e+01

Pos logit contributions
ete+0.121
 Int+0.115
 Curious+0.113
Sadly+0.112
 Sadly+0.111
Neg logit contributions
 on-0.169
els-0.166
.-0.166
 inside-0.162
."-0.146
TokenBut
Token ID1537
Feature activation-5.45e+00

Pos logit contributions
ete+0.173
 Int+0.167
 Sadly+0.164
 Curious+0.163
 Everywhere+0.160
Neg logit contributions
els-0.180
 on-0.173
 inside-0.166
.-0.165
ps-0.153
Token then
Token ID788
Feature activation-3.66e+00

Pos logit contributions
ete+0.064
 Everything+0.062
 Int+0.062
 Sadly+0.062
Sadly+0.061
Neg logit contributions
els-0.107
ps-0.097
ws-0.091
ier-0.090
 sticker-0.088
Token a
Token ID257
Feature activation-1.03e+00

Pos logit contributions
ete+0.076
 Int+0.074
 Curious+0.073
 Sadly+0.073
 Everything+0.073
Neg logit contributions
els-0.042
ps-0.035
 on-0.034
 inside-0.033
.-0.030
Token big
Token ID1263
Feature activation+1.97e+00

Pos logit contributions
ete+0.017
 Int+0.017
 Sadly+0.016
Sadly+0.016
 Curious+0.016
Neg logit contributions
els-0.016
 on-0.015
 inside-0.014
.-0.014
ps-0.013
Token his
Token ID465
Feature activation+1.78e+00

Pos logit contributions
els+0.017
 on+0.015
 inside+0.015
ps+0.015
.+0.014
Neg logit contributions
ete-0.021
 Int-0.020
 Sadly-0.020
 Curious-0.020
Sadly-0.020
Token friend
Token ID1545
Feature activation+5.75e+00

Pos logit contributions
els+0.031
 on+0.027
ps+0.026
 inside+0.026
ier+0.025
Neg logit contributions
ete-0.026
 Sadly-0.025
 Curious-0.025
 Int-0.025
Sadly-0.024
Token,
Token ID11
Feature activation-3.17e+00

Pos logit contributions
els+0.057
ps+0.045
 on+0.041
 inside+0.041
ries+0.037
Neg logit contributions
ete-0.130
 Sadly-0.125
 Int-0.124
 Curious-0.124
 Everywhere-0.124
Token the
Token ID262
Feature activation-1.61e+00

Pos logit contributions
ete+0.062
 Int+0.060
Sadly+0.060
 Sadly+0.059
 Curious+0.058
Neg logit contributions
 on-0.042
.-0.039
 inside-0.038
els-0.038
ps-0.032
Token cow
Token ID9875
Feature activation+6.75e+00

Pos logit contributions
ete+0.027
 Int+0.026
 Sadly+0.026
 Curious+0.026
Sadly+0.026
Neg logit contributions
els-0.025
 on-0.023
 inside-0.022
.-0.022
ps-0.022
Token.
Token ID13
Feature activation-1.10e+01

Pos logit contributions
els+0.051
 on+0.041
 inside+0.038
ps+0.035
.+0.035
Neg logit contributions
ete-0.167
 Int-0.164
 Sadly-0.163
 Curious-0.162
Sadly-0.161
Token<|endoftext|>
Token ID50256
Feature activation-6.15e+00

Pos logit contributions
rolling+0.197
ete+0.192
mber+0.190
 Int+0.182
Quick+0.177
Neg logit contributions
.-0.176
 on-0.148
."-0.146
 inside-0.137
els-0.120
TokenOnce
Token ID7454
Feature activation-1.18e+01

Pos logit contributions
ete+0.058
 Int+0.056
 Sadly+0.055
 Curious+0.054
 Everywhere+0.053
Neg logit contributions
els-0.141
 on-0.129
ps-0.127
 inside-0.127
.-0.123
Token upon
Token ID2402
Feature activation-7.05e+00

Pos logit contributions
 Int+0.150
Sadly+0.149
ete+0.149
 Curious+0.147
 Sadly+0.144
Neg logit contributions
els-0.225
 on-0.220
.-0.215
 inside-0.212
ps-0.192
Token a
Token ID257
Feature activation-8.38e+00

Pos logit contributions
ete+0.163
 Sadly+0.153
Sadly+0.152
rolling+0.151
 Int+0.149
Neg logit contributions
 on-0.071
.-0.070
 inside-0.064
els-0.059
ps-0.052
Token time
Token ID640
Feature activation-6.75e+00

Pos logit contributions
ete+0.214
 Int+0.212
 Curious+0.210
 Sadly+0.207
 Everything+0.207
Neg logit contributions
els-0.049
 on-0.045
.-0.044
 inside-0.039
ps-0.031
Token to
Token ID284
Feature activation-1.48e+00

Pos logit contributions
els+0.017
 on+0.016
 inside+0.015
.+0.015
ps+0.014
Neg logit contributions
ete-0.025
 Int-0.025
Sadly-0.024
 Sadly-0.024
 Curious-0.024
Token c
Token ID269
Feature activation+5.60e+00

Pos logit contributions
ete+0.025
 Int+0.025
 Sadly+0.024
Sadly+0.024
 Curious+0.024
Neg logit contributions
els-0.023
 on-0.021
 inside-0.020
.-0.020
ps-0.019
Tokenuddle
Token ID24500
Feature activation+9.77e+00

Pos logit contributions
els+0.104
 on+0.094
.+0.092
 inside+0.092
ries+0.087
Neg logit contributions
 Int-0.080
 Curious-0.080
ete-0.078
 Everywhere-0.076
Sadly-0.075
Token with
Token ID351
Feature activation+2.63e+00

Pos logit contributions
els+0.100
ps+0.080
 on+0.078
 inside+0.078
.+0.069
Neg logit contributions
ete-0.209
 Int-0.205
 Sadly-0.203
Sadly-0.202
 Curious-0.202
Token Lily
Token ID20037
Feature activation+4.39e+00

Pos logit contributions
els+0.038
 on+0.034
 inside+0.033
ps+0.033
.+0.031
Neg logit contributions
ete-0.046
 Int-0.045
 Sadly-0.045
 Curious-0.044
Sadly-0.044
Token.
Token ID13
Feature activation-1.18e+01

Pos logit contributions
els+0.042
ps+0.036
ier+0.030
ws+0.029
ries+0.029
Neg logit contributions
 Sadly-0.094
 Everywhere-0.092
Da-0.091
ete-0.090
 Curious-0.090
Token One
Token ID1881
Feature activation-3.59e+00

Pos logit contributions
ete+0.197
Sadly+0.180
 Int+0.172
rolling+0.171
mber+0.167
Neg logit contributions
.-0.204
 on-0.187
."-0.176
 inside-0.173
els-0.169
Token day
Token ID1110
Feature activation-3.60e+00

Pos logit contributions
ete+0.081
 Int+0.079
 Sadly+0.078
Sadly+0.078
 Curious+0.078
Neg logit contributions
els-0.035
 on-0.033
 inside-0.030
.-0.030
ps-0.026
Token,
Token ID11
Feature activation-8.26e+00

Pos logit contributions
 Sadly+0.070
ete+0.069
 Int+0.069
 Everything+0.068
 Curious+0.068
Neg logit contributions
els-0.048
ps-0.042
ws-0.035
ries-0.033
cks-0.033
Token Lily
Token ID20037
Feature activation-3.53e+00

Pos logit contributions
ete+0.149
 Int+0.143
Sadly+0.140
 Sadly+0.140
 Curious+0.138
Neg logit contributions
 on-0.118
els-0.118
 inside-0.111
.-0.111
ps-0.098
Token and
Token ID290
Feature activation-7.99e+00

Pos logit contributions
ete+0.070
 Everywhere+0.067
Of+0.066
 Int+0.066
Sadly+0.065
Neg logit contributions
 on-0.046
 inside-0.046
.-0.045
els-0.043
 colored-0.036
TokenThe
Token ID464
Feature activation-5.38e+00

Pos logit contributions
ete+0.079
 Int+0.078
 Sadly+0.076
 Curious+0.076
Sadly+0.074
Neg logit contributions
els-0.124
 on-0.113
 inside-0.111
ps-0.110
.-0.108
Token little
Token ID1310
Feature activation-5.43e+00

Pos logit contributions
ete+0.085
 Int+0.084
 Sadly+0.083
Sadly+0.083
 Curious+0.081
Neg logit contributions
els-0.087
 on-0.083
 inside-0.079
.-0.078
ps-0.075
Token boy
Token ID2933
Feature activation-1.69e+00

Pos logit contributions
ete+0.083
 Int+0.080
 Curious+0.079
 Sadly+0.077
 Everywhere+0.077
Neg logit contributions
els-0.093
ps-0.083
 on-0.083
 inside-0.081
.-0.078
Token was
Token ID373
Feature activation-4.38e+00

Pos logit contributions
ete+0.027
 Int+0.027
Sadly+0.027
 Curious+0.026
 Sadly+0.026
Neg logit contributions
els-0.027
 on-0.025
 inside-0.025
.-0.024
ps-0.023
Token naughty
Token ID45128
Feature activation+3.89e+00

Pos logit contributions
ete+0.073
 Int+0.071
 Sadly+0.070
Sadly+0.070
 Curious+0.070
Neg logit contributions
els-0.068
 on-0.064
 inside-0.061
.-0.059
ps-0.059
Token.
Token ID13
Feature activation-1.12e+01

Pos logit contributions
els+0.030
ps+0.021
 on+0.020
 inside+0.019
ws+0.017
Neg logit contributions
ete-0.096
 Int-0.094
 Curious-0.093
 Sadly-0.092
 Everywhere-0.092
Token He
Token ID679
Feature activation-3.80e+00

Pos logit contributions
ete+0.100
Sadly+0.099
Of+0.094
Da+0.090
 Int+0.089
Neg logit contributions
els-0.253
.-0.252
 on-0.238
 inside-0.234
."-0.228
Token knew
Token ID2993
Feature activation-1.21e+00

Pos logit contributions
ete+0.057
 Int+0.055
Sadly+0.053
 Curious+0.053
 Sadly+0.052
Neg logit contributions
els-0.065
 on-0.064
.-0.062
 inside-0.061
ps-0.058
Token he
Token ID339
Feature activation-9.14e-01

Pos logit contributions
ete+0.018
 Int+0.018
 Sadly+0.018
 Curious+0.018
Sadly+0.017
Neg logit contributions
els-0.021
 on-0.019
 inside-0.018
.-0.018
ps-0.018
Token had
Token ID550
Feature activation+4.12e-01

Pos logit contributions
ete+0.015
 Int+0.015
Sadly+0.014
 Curious+0.014
Of+0.014
Neg logit contributions
 on-0.014
els-0.014
 inside-0.014
.-0.014
ps-0.012
Token done
Token ID1760
Feature activation+3.63e+00

Pos logit contributions
els+0.006
 on+0.006
 inside+0.005
ps+0.005
.+0.005
Neg logit contributions
ete-0.007
 Int-0.007
 Sadly-0.007
 Curious-0.007
Sadly-0.007