TOP ACTIVATIONS
MAX = 19.335

chest and found lots of gold coins and shiny jewels .
and found lots of gold coins and shiny jewels .
found a chest filled with gold and jewels . From that
real treasure chest filled with gold coins and jewels .
. We need a new battery ," Dad says .
. They hope to find jewels . " Look
very special box full of matches . The baby
treasure chest filled with gold coins and jewels .
her they could buy new batteries at the store . When
like it was made of gold ! The
fun until they saw a switch on the wall . It
good to have so much clutter . You can trip and
and tried to take the battery from his mouth . Max
her camera ran out of battery . L
his pocket and found some coins . He put the coins
. It 's made of gold and has a heart -
filled with lots of shiny coins . He had never seen
saw some more che wed threads in the house . Fl
that she could find more treasures if she kept looking .
. We just need some glue and a new wheel .

MIN ACTIVATIONS
MIN = -27.418

by one 1 , 2 ,
perfect bag now I 'm very
of the stream . <|endoftext|> M olly was a three -
had saved him . <|endoftext|> Charl otte and her mom were
was happy too . <|endoftext|> S ara liked to play dress
helped him fly . <|endoftext|> M um and Dad had a
it forever things are only available
the ocean again . <|endoftext|> M um looked at her daughter
made them happy . <|endoftext|> S ara and Ben were playing
in her heart . <|endoftext|> S ara and Ben like to
catch a star . <|endoftext|> S andy was a little girl
her first tire . <|endoftext|> M ummy and Daddy were talking
good summer day . <|endoftext|> S ara and Tom were very
went to bed . <|endoftext|> M um and Dad carefully took
the most successful . <|endoftext|> M andy was walking home from
room before dinner . <|endoftext|> S aman tha was a lazy
to be sailors . <|endoftext|> S ara liked cream very much
happy than ever . <|endoftext|> M um and Dad were taking
for her efforts . <|endoftext|> M ia was playing in the
who saved her . <|endoftext|> M um and Dad had a

INTERVAL 7.646 - 19.335
CONTAINS 10.696%

said . She put the flowers in the sack .
, a cold st eth oscope and a loud voice .
to always listen to your heart and never give up on
She asked him lots of questions and tried to make him
idea and quickly grabbed their hats . They were so lucky

INTERVAL -4.042 - 7.646
CONTAINS 63.930%

seen again . <|endoftext|> Today , Mom my and I went
. It was a peanut butter and jelly sandwich , which
a chance . How about we make a deal ? If
and practiced , and the day of the stage show came
tree and it lit up the night with its bright colours

INTERVAL -15.730 - -4.042
CONTAINS 25.127%

ie said and realized that the tw ig had reversed and
clouds appeared in the sky and it started to rain .
, who might know . B unny looked at
was so happy and proud . From then , the grand
lot . He was happy and he had a big smile

INTERVAL -27.418 - -15.730
CONTAINS 0.247%

as she did . <|endoftext|> S ara and Ben like to
it forever . L ily 's mom saw the
â Hello ! Where are you
at were all the leaves . She dreamed of picking up
notes laid on the ground . Cl ara
Token chest
Token ID7721
Feature activation+1.29e+01

Pos logit contributions
els+0.054
 colored+0.045
 on+0.044
ps+0.043
 frost+0.042
Neg logit contributions
 Int-0.060
 Everywhere-0.055
ete-0.054
 Curious-0.054
 Sadly-0.051
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
els+0.151
 sticker+0.119
 frost+0.116
ps+0.116
 harm+0.115
Neg logit contributions
 Int-0.228
ete-0.218
 Everywhere-0.211
 Fro-0.208
 Curious-0.206
Token found
Token ID1043
Feature activation+7.21e+00

Pos logit contributions
els+0.015
 on+0.014
 inside+0.013
 sticker+0.013
.+0.012
Neg logit contributions
 Int-0.020
ete-0.019
 Everywhere-0.019
 Fro-0.018
 Curious-0.018
Token lots
Token ID6041
Feature activation+1.13e+01

Pos logit contributions
els+0.093
 on+0.080
 sticker+0.079
 colored+0.074
 inside+0.074
Neg logit contributions
 Int-0.118
ete-0.112
 Everywhere-0.112
 Fro-0.110
 Everything-0.105
Token of
Token ID286
Feature activation+8.39e+00

Pos logit contributions
els+0.205
ps+0.192
ws+0.176
 cutter+0.176
 stamp+0.174
Neg logit contributions
 Int-0.110
 Everywhere-0.107
ete-0.106
 concession-0.103
pill-0.102
Token gold
Token ID3869
Feature activation+1.88e+01

Pos logit contributions
els+0.097
 cutter+0.087
ps+0.084
 on+0.081
ier+0.079
Neg logit contributions
 Int-0.141
 Curious-0.129
 Everywhere-0.129
 Fro-0.127
 concession-0.126
Token coins
Token ID10796
Feature activation+1.87e+01

Pos logit contributions
els+0.186
 harm+0.186
 channel+0.177
 matters+0.174
 brush+0.169
Neg logit contributions
ete-0.268
pill-0.251
 He-0.248
hog-0.246
 Fro-0.244
Token and
Token ID290
Feature activation+4.81e+00

Pos logit contributions
els+0.170
 sticker+0.140
 colored+0.133
 harm+0.132
 tape+0.131
Neg logit contributions
ete-0.335
 Fro-0.310
hog-0.294
 Int-0.289
 Barkley-0.282
Token shiny
Token ID22441
Feature activation+1.47e+01

Pos logit contributions
els+0.050
 on+0.040
ps+0.038
 inside+0.037
 sticker+0.037
Neg logit contributions
 Int-0.088
 Everywhere-0.085
 Fro-0.083
ete-0.083
 Curious-0.081
Token jewels
Token ID42701
Feature activation+1.68e+01

Pos logit contributions
 harm+0.164
 falling+0.162
 channel+0.159
 cutter+0.158
 service+0.154
Neg logit contributions
 He-0.178
pill-0.165
 Everywhere-0.163
 Int-0.163
 From-0.157
Token.
Token ID13
Feature activation-6.02e+00

Pos logit contributions
els+0.170
ws+0.127
 problems+0.121
 spills+0.114
 harm+0.114
Neg logit contributions
ete-0.287
hog-0.281
 Fro-0.273
 Int-0.268
 couldn-0.263
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
els+0.151
 sticker+0.119
 frost+0.116
ps+0.116
 harm+0.115
Neg logit contributions
 Int-0.228
ete-0.218
 Everywhere-0.211
 Fro-0.208
 Curious-0.206
Token found
Token ID1043
Feature activation+7.21e+00

Pos logit contributions
els+0.015
 on+0.014
 inside+0.013
 sticker+0.013
.+0.012
Neg logit contributions
 Int-0.020
ete-0.019
 Everywhere-0.019
 Fro-0.018
 Curious-0.018
Token lots
Token ID6041
Feature activation+1.13e+01

Pos logit contributions
els+0.093
 on+0.080
 sticker+0.079
 colored+0.074
 inside+0.074
Neg logit contributions
 Int-0.118
ete-0.112
 Everywhere-0.112
 Fro-0.110
 Everything-0.105
Token of
Token ID286
Feature activation+8.39e+00

Pos logit contributions
els+0.205
ps+0.192
ws+0.176
 cutter+0.176
 stamp+0.174
Neg logit contributions
 Int-0.110
 Everywhere-0.107
ete-0.106
 concession-0.103
pill-0.102
Token gold
Token ID3869
Feature activation+1.88e+01

Pos logit contributions
els+0.097
 cutter+0.087
ps+0.084
 on+0.081
ier+0.079
Neg logit contributions
 Int-0.141
 Curious-0.129
 Everywhere-0.129
 Fro-0.127
 concession-0.126
Token coins
Token ID10796
Feature activation+1.87e+01

Pos logit contributions
els+0.186
 harm+0.186
 channel+0.177
 matters+0.174
 brush+0.169
Neg logit contributions
ete-0.268
pill-0.251
 He-0.248
hog-0.246
 Fro-0.244
Token and
Token ID290
Feature activation+4.81e+00

Pos logit contributions
els+0.170
 sticker+0.140
 colored+0.133
 harm+0.132
 tape+0.131
Neg logit contributions
ete-0.335
 Fro-0.310
hog-0.294
 Int-0.289
 Barkley-0.282
Token shiny
Token ID22441
Feature activation+1.47e+01

Pos logit contributions
els+0.050
 on+0.040
ps+0.038
 inside+0.037
 sticker+0.037
Neg logit contributions
 Int-0.088
 Everywhere-0.085
 Fro-0.083
ete-0.083
 Curious-0.081
Token jewels
Token ID42701
Feature activation+1.68e+01

Pos logit contributions
 harm+0.164
 falling+0.162
 channel+0.159
 cutter+0.158
 service+0.154
Neg logit contributions
 He-0.178
pill-0.165
 Everywhere-0.163
 Int-0.163
 From-0.157
Token.
Token ID13
Feature activation-6.02e+00

Pos logit contributions
els+0.170
ws+0.127
 problems+0.121
 spills+0.114
 harm+0.114
Neg logit contributions
ete-0.287
hog-0.281
 Fro-0.273
 Int-0.268
 couldn-0.263
Token
Token ID198
Feature activation-5.09e+00

Pos logit contributions
mber+0.081
ete+0.081
Da+0.080
 Int+0.078
 concession+0.077
Neg logit contributions
 inside-0.091
.-0.090
 on-0.089
els-0.082
 colored-0.075
Token found
Token ID1043
Feature activation+6.74e+00

Pos logit contributions
els+0.009
 on+0.009
 inside+0.009
 sticker+0.008
.+0.008
Neg logit contributions
 Int-0.011
ete-0.010
 Everywhere-0.010
 Fro-0.010
Da-0.009
Token a
Token ID257
Feature activation+5.62e+00

Pos logit contributions
els+0.085
 on+0.075
 sticker+0.073
 inside+0.071
 frost+0.070
Neg logit contributions
 Int-0.111
 Everywhere-0.107
ete-0.106
 Fro-0.102
 Curious-0.099
Token chest
Token ID7721
Feature activation+1.44e+01

Pos logit contributions
els+0.081
 on+0.065
ps+0.065
ries+0.062
 inside+0.061
Neg logit contributions
 Int-0.089
ete-0.086
 Fro-0.083
mber-0.082
 Everywhere-0.082
Token filled
Token ID5901
Feature activation+9.07e+00

Pos logit contributions
els+0.193
ries+0.146
ps+0.146
 frost+0.141
ws+0.141
Neg logit contributions
ete-0.230
 Int-0.218
hog-0.212
 Fro-0.211
 Everywhere-0.206
Token with
Token ID351
Feature activation+9.17e+00

Pos logit contributions
els+0.094
 on+0.076
 sticker+0.076
ps+0.074
 inside+0.072
Neg logit contributions
 Int-0.168
 Everywhere-0.162
ete-0.161
 Fro-0.158
Da-0.156
Token gold
Token ID3869
Feature activation+1.85e+01

Pos logit contributions
els+0.113
 on+0.096
 sticker+0.093
 inside+0.090
ps+0.087
Neg logit contributions
 Int-0.150
 Everywhere-0.148
ete-0.143
 Fro-0.140
 Curious-0.138
Token and
Token ID290
Feature activation+4.08e+00

Pos logit contributions
els+0.193
 harm+0.168
 spills+0.158
 matters+0.154
ws+0.154
Neg logit contributions
ete-0.285
hog-0.266
 He-0.260
pill-0.259
 Int-0.254
Token jewels
Token ID42701
Feature activation+1.63e+01

Pos logit contributions
els+0.045
 on+0.038
 inside+0.036
 sticker+0.035
ps+0.034
Neg logit contributions
 Int-0.075
 Everywhere-0.071
ete-0.070
 Fro-0.069
 Curious-0.069
Token.
Token ID13
Feature activation-5.95e+00

Pos logit contributions
els+0.166
ws+0.119
 spills+0.116
 problems+0.110
 harm+0.107
Neg logit contributions
ete-0.289
 Int-0.285
hog-0.270
 Fro-0.269
 He-0.265
Token From
Token ID3574
Feature activation+2.97e+00

Pos logit contributions
Da+0.068
Mos+0.063
mber+0.062
 concession+0.062
 Fro+0.060
Neg logit contributions
 on-0.102
.-0.101
 inside-0.100
els-0.087
 sticker-0.085
Token that
Token ID326
Feature activation-6.62e-01

Pos logit contributions
els+0.037
 on+0.035
 inside+0.034
 sticker+0.032
.+0.031
Neg logit contributions
 Int-0.051
ete-0.047
 Everywhere-0.047
 Fro-0.046
Da-0.045
Token real
Token ID1103
Feature activation+1.03e+01

Pos logit contributions
els+0.096
ps+0.076
 on+0.073
ries+0.071
 colored+0.070
Neg logit contributions
 Int-0.100
 Fro-0.098
Da-0.094
ete-0.094
mber-0.093
Token treasure
Token ID14068
Feature activation+1.67e+01

Pos logit contributions
els+0.159
ries+0.125
ps+0.125
 on+0.123
bones+0.119
Neg logit contributions
 Fro-0.138
 Int-0.137
ete-0.131
mber-0.128
Da-0.125
Token chest
Token ID7721
Feature activation+1.56e+01

Pos logit contributions
els+0.169
 colored+0.122
 frost+0.118
ps+0.118
ries+0.114
Neg logit contributions
hog-0.285
 Fro-0.276
ete-0.273
 Int-0.272
 concession-0.263
Token filled
Token ID5901
Feature activation+9.26e+00

Pos logit contributions
els+0.159
ws+0.106
 problems+0.105
ries+0.104
 frost+0.100
Neg logit contributions
hog-0.275
ete-0.266
 Fro-0.260
 Int-0.255
 couldn-0.252
Token with
Token ID351
Feature activation+9.08e+00

Pos logit contributions
els+0.091
 on+0.075
 sticker+0.073
 inside+0.071
ps+0.069
Neg logit contributions
 Int-0.174
Da-0.167
 Everywhere-0.167
 Fro-0.167
ete-0.165
Token gold
Token ID3869
Feature activation+1.84e+01

Pos logit contributions
els+0.107
 cutter+0.086
 sticker+0.083
 on+0.083
ier+0.079
Neg logit contributions
 Int-0.146
 Fro-0.144
 Everywhere-0.143
ete-0.136
 concession-0.135
Token coins
Token ID10796
Feature activation+1.81e+01

Pos logit contributions
els+0.190
 harm+0.171
 colored+0.151
 matters+0.148
eless+0.148
Neg logit contributions
pill-0.272
hog-0.268
ete-0.251
 Fro-0.250
 He-0.247
Token and
Token ID290
Feature activation+4.20e+00

Pos logit contributions
els+0.177
 sticker+0.133
 harm+0.131
 spills+0.129
 colored+0.128
Neg logit contributions
ete-0.287
hog-0.282
 Fro-0.281
 Int-0.267
 Barkley-0.263
Token jewels
Token ID42701
Feature activation+1.58e+01

Pos logit contributions
els+0.042
 on+0.035
 inside+0.033
 sticker+0.031
ps+0.030
Neg logit contributions
 Int-0.080
 Everywhere-0.076
 Fro-0.075
ete-0.074
 Curious-0.074
Token.
Token ID13
Feature activation-6.86e+00

Pos logit contributions
els+0.163
ws+0.118
 harm+0.108
 spills+0.105
 colored+0.105
Neg logit contributions
hog-0.260
pill-0.245
 Fro-0.243
 Int-0.241
ete-0.240
Token
Token ID220
Feature activation-5.62e+00

Pos logit contributions
ete+0.119
Fre+0.111
 concession+0.110
rolling+0.110
ternally+0.110
Neg logit contributions
.-0.092
 on-0.088
 inside-0.088
,"-0.066
,-0.066
Token.
Token ID13
Feature activation-3.80e+00

Pos logit contributions
els+0.149
 frost+0.122
ries+0.120
 colored+0.113
 icing+0.109
Neg logit contributions
 Everywhere-0.205
 Int-0.192
 Everything-0.188
ete-0.182
 As-0.180
Token We
Token ID775
Feature activation+2.51e+00

Pos logit contributions
 Int+0.033
ete+0.031
 Fro+0.030
Da+0.029
 Everywhere+0.028
Neg logit contributions
els-0.075
 on-0.075
 inside-0.073
 sticker-0.070
.-0.070
Token need
Token ID761
Feature activation+6.80e+00

Pos logit contributions
els+0.036
 on+0.032
 inside+0.031
 sticker+0.030
 colored+0.030
Neg logit contributions
 Int-0.035
 Everywhere-0.033
ete-0.032
 Fro-0.031
 Sadly-0.031
Token a
Token ID257
Feature activation+5.20e+00

Pos logit contributions
els+0.108
ier+0.091
bones+0.090
ries+0.088
 colored+0.087
Neg logit contributions
 Everywhere-0.086
 Int-0.084
 Everything-0.079
ete-0.078
 As-0.074
Token new
Token ID649
Feature activation+9.09e+00

Pos logit contributions
els+0.094
ries+0.081
bones+0.073
ps+0.070
 colored+0.066
Neg logit contributions
 Int-0.072
 Sadly-0.067
 Curious-0.065
 Everywhere-0.064
Of-0.063
Token battery
Token ID6555
Feature activation+1.83e+01

Pos logit contributions
els+0.144
ries+0.121
bones+0.111
ps+0.106
ier+0.105
Neg logit contributions
 Int-0.133
 Everywhere-0.129
 Curious-0.119
 As-0.116
 Sadly-0.115
Token,"
Token ID553
Feature activation-6.35e+00

Pos logit contributions
els+0.233
ries+0.178
 colored+0.173
bones+0.172
ck+0.163
Neg logit contributions
 Everywhere-0.247
hog-0.246
ete-0.235
 Int-0.234
 Follow-0.230
Token Dad
Token ID17415
Feature activation+2.32e+00

Pos logit contributions
 Int+0.096
ete+0.093
 Everywhere+0.092
 Fro+0.089
rolling+0.088
Neg logit contributions
 on-0.088
.-0.084
els-0.084
 inside-0.083
 paper-0.080
Token says
Token ID1139
Feature activation+3.77e+00

Pos logit contributions
els+0.033
 on+0.032
 inside+0.031
 sticker+0.030
 colored+0.029
Neg logit contributions
 Int-0.035
ete-0.029
 Curious-0.029
 Everywhere-0.028
 Exc-0.028
Token.
Token ID13
Feature activation-6.35e+00

Pos logit contributions
els+0.034
 colored+0.026
 problems+0.024
 frost+0.023
 tape+0.021
Neg logit contributions
 Int-0.079
 Curious-0.072
ete-0.070
 As-0.069
 His-0.068
Token
Token ID198
Feature activation-7.08e+00

Pos logit contributions
ete+0.086
 Int+0.083
 Fro+0.082
rolling+0.081
 concession+0.080
Neg logit contributions
 on-0.099
.-0.096
 inside-0.094
 sticker-0.086
els-0.081
Token.
Token ID13
Feature activation-5.35e+00

Pos logit contributions
 sticker+0.155
 stamp+0.146
 comb+0.135
els+0.131
 frost+0.130
Neg logit contributions
 Int-0.209
 Everywhere-0.204
 Fro-0.194
 He-0.193
 Fre-0.188
Token They
Token ID1119
Feature activation+2.67e+00

Pos logit contributions
 Int+0.060
ete+0.055
 concession+0.055
mber+0.055
Da+0.054
Neg logit contributions
 on-0.094
 inside-0.093
.-0.089
els-0.088
 colored-0.081
Token hope
Token ID2911
Feature activation+8.93e+00

Pos logit contributions
els+0.035
 on+0.034
 inside+0.033
 sticker+0.030
.+0.030
Neg logit contributions
 Int-0.043
ete-0.040
 Everywhere-0.040
 Fro-0.039
Da-0.038
Token to
Token ID284
Feature activation+3.17e+00

Pos logit contributions
els+0.112
 on+0.099
 inside+0.095
 sticker+0.094
 colored+0.090
Neg logit contributions
 Int-0.149
ete-0.142
 Everywhere-0.142
 Fro-0.140
 Everything-0.133
Token find
Token ID1064
Feature activation+8.70e+00

Pos logit contributions
 on+0.040
els+0.040
 inside+0.039
.+0.036
 sticker+0.036
Neg logit contributions
 Int-0.053
ete-0.049
 Everywhere-0.049
 Fro-0.047
Da-0.047
Token jewels
Token ID42701
Feature activation+1.82e+01

Pos logit contributions
els+0.114
 on+0.095
 sticker+0.095
 cutter+0.092
 stamp+0.090
Neg logit contributions
 Int-0.134
 Everywhere-0.129
 Fro-0.126
ete-0.125
 Everything-0.122
Token.
Token ID13
Feature activation-4.95e+00

Pos logit contributions
els+0.201
 colored+0.148
ps+0.148
 frost+0.142
bones+0.140
Neg logit contributions
ete-0.318
 Int-0.314
 Fro-0.307
 Everywhere-0.307
Da-0.296
Token
Token ID198
Feature activation-4.75e+00

Pos logit contributions
 concession+0.071
ternally+0.070
 Int+0.069
pill+0.068
mber+0.068
Neg logit contributions
 on-0.071
 inside-0.071
.-0.070
els-0.059
 paper-0.054
Token
Token ID198
Feature activation-6.66e+00

Pos logit contributions
 Int+0.068
ishly+0.064
pill+0.062
 Fro+0.062
 Barkley+0.061
Neg logit contributions
 on-0.071
 inside-0.070
.-0.070
els-0.061
 sticker-0.059
Token"
Token ID1
Feature activation-3.58e+00

Pos logit contributions
 Int+0.087
 Fro+0.082
 Everywhere+0.080
ete+0.078
 Barkley+0.075
Neg logit contributions
 on-0.107
 inside-0.105
els-0.104
.-0.100
 colored-0.096
TokenLook
Token ID8567
Feature activation+4.03e+00

Pos logit contributions
 Int+0.040
ete+0.036
 Everywhere+0.035
 Fro+0.034
 Curious+0.032
Neg logit contributions
els-0.065
 on-0.064
 inside-0.063
 sticker-0.059
.-0.059
Token very
Token ID845
Feature activation+3.19e+00

Pos logit contributions
els+0.090
ps+0.077
ws+0.070
ries+0.069
ier+0.067
Neg logit contributions
 Int-0.070
 Curious-0.065
mber-0.065
 Fro-0.064
 concession-0.063
Token special
Token ID2041
Feature activation+9.26e+00

Pos logit contributions
els+0.052
ps+0.045
 on+0.045
ries+0.041
 inside+0.041
Neg logit contributions
 Int-0.042
ete-0.040
mber-0.038
 Curious-0.038
 Everywhere-0.037
Token box
Token ID3091
Feature activation+1.40e+01

Pos logit contributions
els+0.142
cks+0.114
ps+0.112
ws+0.110
ries+0.104
Neg logit contributions
mber-0.127
 Int-0.126
 Curious-0.123
 Everywhere-0.122
 Everything-0.121
Token full
Token ID1336
Feature activation+8.67e+00

Pos logit contributions
els+0.160
ps+0.135
ws+0.130
 problems+0.129
 spills+0.118
Neg logit contributions
 Int-0.234
ete-0.231
 Everywhere-0.221
hog-0.221
 Fro-0.217
Token of
Token ID286
Feature activation+7.13e+00

Pos logit contributions
els+0.151
ps+0.135
ws+0.127
 problems+0.127
 spills+0.125
Neg logit contributions
 Everywhere-0.101
 Int-0.099
ete-0.097
 Fro-0.089
Da-0.088
Token matches
Token ID7466
Feature activation+1.82e+01

Pos logit contributions
els+0.083
ier+0.075
ps+0.073
 on+0.071
ws+0.070
Neg logit contributions
 Everywhere-0.110
 Int-0.108
 Everything-0.105
 concession-0.101
 Curious-0.098
Token.
Token ID13
Feature activation-6.48e+00

Pos logit contributions
els+0.173
ws+0.154
 sticker+0.152
ps+0.140
 tape+0.128
Neg logit contributions
ete-0.284
 Fro-0.282
hog-0.275
 Barkley-0.270
ishly-0.267
Token
Token ID220
Feature activation-5.05e+00

Pos logit contributions
 Fro+0.090
ete+0.088
Da+0.081
Hop+0.080
rig+0.079
Neg logit contributions
.-0.099
 on-0.099
 inside-0.098
 frost-0.091
 sticker-0.087
Token
Token ID198
Feature activation-6.12e+00

Pos logit contributions
mber+0.071
 Int+0.070
 concession+0.070
Da+0.069
 Fro+0.069
Neg logit contributions
 on-0.073
.-0.073
 inside-0.073
els-0.067
 sticker-0.063
TokenThe
Token ID464
Feature activation-1.37e+00

Pos logit contributions
ete+0.084
ishly+0.083
 Fro+0.081
 Int+0.078
cient+0.074
Neg logit contributions
 on-0.098
.-0.089
 inside-0.089
 sticker-0.085
 frost-0.081
Token baby
Token ID5156
Feature activation+4.53e+00

Pos logit contributions
 Int+0.016
 Everywhere+0.016
 concession+0.015
 Curious+0.015
 Sadly+0.015
Neg logit contributions
els-0.024
ws-0.020
ps-0.020
 on-0.020
bones-0.019
Token treasure
Token ID14068
Feature activation+1.67e+01

Pos logit contributions
els+0.159
ries+0.125
ps+0.125
 on+0.123
bones+0.119
Neg logit contributions
 Fro-0.138
 Int-0.137
ete-0.131
mber-0.128
Da-0.125
Token chest
Token ID7721
Feature activation+1.56e+01

Pos logit contributions
els+0.169
 colored+0.122
 frost+0.118
ps+0.118
ries+0.114
Neg logit contributions
hog-0.285
 Fro-0.276
ete-0.273
 Int-0.272
 concession-0.263
Token filled
Token ID5901
Feature activation+9.26e+00

Pos logit contributions
els+0.159
ws+0.106
 problems+0.105
ries+0.104
 frost+0.100
Neg logit contributions
hog-0.275
ete-0.266
 Fro-0.260
 Int-0.255
 couldn-0.252
Token with
Token ID351
Feature activation+9.08e+00

Pos logit contributions
els+0.091
 on+0.075
 sticker+0.073
 inside+0.071
ps+0.069
Neg logit contributions
 Int-0.174
Da-0.167
 Everywhere-0.167
 Fro-0.167
ete-0.165
Token gold
Token ID3869
Feature activation+1.84e+01

Pos logit contributions
els+0.107
 cutter+0.086
 sticker+0.083
 on+0.083
ier+0.079
Neg logit contributions
 Int-0.146
 Fro-0.144
 Everywhere-0.143
ete-0.136
 concession-0.135
Token coins
Token ID10796
Feature activation+1.81e+01

Pos logit contributions
els+0.190
 harm+0.171
 colored+0.151
 matters+0.148
eless+0.148
Neg logit contributions
pill-0.272
hog-0.268
ete-0.251
 Fro-0.250
 He-0.247
Token and
Token ID290
Feature activation+4.20e+00

Pos logit contributions
els+0.177
 sticker+0.133
 harm+0.131
 spills+0.129
 colored+0.128
Neg logit contributions
ete-0.287
hog-0.282
 Fro-0.281
 Int-0.267
 Barkley-0.263
Token jewels
Token ID42701
Feature activation+1.58e+01

Pos logit contributions
els+0.042
 on+0.035
 inside+0.033
 sticker+0.031
ps+0.030
Neg logit contributions
 Int-0.080
 Everywhere-0.076
 Fro-0.075
ete-0.074
 Curious-0.074
Token.
Token ID13
Feature activation-6.86e+00

Pos logit contributions
els+0.163
ws+0.118
 harm+0.108
 spills+0.105
 colored+0.105
Neg logit contributions
hog-0.260
pill-0.245
 Fro-0.243
 Int-0.241
ete-0.240
Token
Token ID220
Feature activation-5.62e+00

Pos logit contributions
ete+0.119
Fre+0.111
 concession+0.110
rolling+0.110
ternally+0.110
Neg logit contributions
.-0.092
 on-0.088
 inside-0.088
,"-0.066
,-0.066
Token
Token ID198
Feature activation-6.27e+00

Pos logit contributions
mber+0.089
 concession+0.089
ete+0.086
 Int+0.085
 Fro+0.083
Neg logit contributions
.-0.079
 on-0.077
 inside-0.074
els-0.066
."-0.058
Token her
Token ID607
Feature activation+3.51e+00

Pos logit contributions
els+0.052
 on+0.051
 inside+0.049
.+0.046
 sticker+0.042
Neg logit contributions
 Int-0.098
ete-0.094
 Everywhere-0.092
 Fro-0.090
 Curious-0.089
Token they
Token ID484
Feature activation+5.11e+00

Pos logit contributions
els+0.043
 on+0.040
 inside+0.039
 sticker+0.036
 colored+0.035
Neg logit contributions
 Int-0.060
ete-0.057
 Everywhere-0.056
 Fro-0.055
Da-0.053
Token could
Token ID714
Feature activation+2.71e+00

Pos logit contributions
 on+0.085
els+0.084
 inside+0.083
.+0.078
 sticker+0.076
Neg logit contributions
 Int-0.065
ete-0.059
 Everywhere-0.059
 Fro-0.058
Da-0.056
Token buy
Token ID2822
Feature activation+1.18e+01

Pos logit contributions
els+0.037
 on+0.035
 inside+0.034
 sticker+0.032
 colored+0.031
Neg logit contributions
 Int-0.042
ete-0.040
 Everywhere-0.040
 Fro-0.038
 Sadly-0.037
Token new
Token ID649
Feature activation+1.17e+01

Pos logit contributions
els+0.168
ps+0.152
ge+0.147
 inside+0.146
 cutter+0.142
Neg logit contributions
 Fro-0.151
 Int-0.149
 Everywhere-0.142
ete-0.139
 Everything-0.135
Token batteries
Token ID13591
Feature activation+1.81e+01

Pos logit contributions
ier+0.123
ps+0.118
ge+0.118
bones+0.112
els+0.110
Neg logit contributions
 Int-0.192
 Everywhere-0.177
 Curious-0.172
 Barkley-0.172
 Exc-0.167
Token at
Token ID379
Feature activation+6.54e+00

Pos logit contributions
els+0.140
ps+0.094
 colored+0.094
 frost+0.093
 sticker+0.090
Neg logit contributions
 Everywhere-0.362
 Int-0.360
ete-0.356
 Fro-0.346
 concession-0.339
Token the
Token ID262
Feature activation+3.71e+00

Pos logit contributions
els+0.083
 frost+0.071
 inside+0.070
 on+0.070
ps+0.069
Neg logit contributions
 Everywhere-0.103
 Int-0.098
 Fro-0.096
 Sadly-0.091
ete-0.091
Token store
Token ID3650
Feature activation+1.20e+01

Pos logit contributions
els+0.056
 on+0.054
 inside+0.052
 sticker+0.049
.+0.048
Neg logit contributions
 Int-0.054
ete-0.049
 Everywhere-0.049
 Fro-0.048
Da-0.046
Token.
Token ID13
Feature activation-5.86e+00

Pos logit contributions
els+0.051
 on+0.035
 inside+0.030
 sticker+0.025
ps+0.025
Neg logit contributions
 Int-0.298
ete-0.286
 Everywhere-0.286
 Fro-0.283
 Sadly-0.276
Token When
Token ID1649
Feature activation+1.12e+00

Pos logit contributions
ete+0.086
 Fro+0.079
 Barkley+0.078
mber+0.078
 Int+0.078
Neg logit contributions
.-0.086
 on-0.084
 inside-0.084
els-0.074
 paper-0.074
Token like
Token ID588
Feature activation+5.61e+00

Pos logit contributions
els+0.023
 on+0.022
 inside+0.022
 sticker+0.020
.+0.019
Neg logit contributions
 Int-0.037
ete-0.035
 Everywhere-0.035
 Fro-0.034
Da-0.033
Token it
Token ID340
Feature activation+6.47e+00

Pos logit contributions
els+0.064
 on+0.061
 inside+0.059
 sticker+0.054
.+0.053
Neg logit contributions
 Int-0.100
ete-0.094
 Everywhere-0.094
 Fro-0.092
Da-0.089
Token was
Token ID373
Feature activation+5.62e+00

Pos logit contributions
els+0.096
 on+0.091
 inside+0.088
 sticker+0.084
.+0.080
Neg logit contributions
 Int-0.094
ete-0.087
 Everywhere-0.086
 Fro-0.084
Da-0.081
Token made
Token ID925
Feature activation+8.18e+00

Pos logit contributions
els+0.088
 sticker+0.074
 problems+0.070
 papers+0.068
 stickers+0.067
Neg logit contributions
 Fro-0.073
 Everywhere-0.072
ete-0.070
 Int-0.069
Da-0.068
Token of
Token ID286
Feature activation+7.23e+00

Pos logit contributions
els+0.103
 on+0.089
 inside+0.088
 sticker+0.086
 colored+0.081
Neg logit contributions
 Int-0.137
ete-0.130
 Everywhere-0.130
 Fro-0.125
 Everything-0.121
Token gold
Token ID3869
Feature activation+1.81e+01

Pos logit contributions
els+0.088
 inside+0.079
 on+0.078
 sticker+0.073
ier+0.068
Neg logit contributions
 Fro-0.116
 Int-0.115
ete-0.111
 Everywhere-0.111
Da-0.106
Token!
Token ID0
Feature activation-5.97e+00

Pos logit contributions
els+0.202
 harm+0.159
bones+0.153
 colored+0.153
 sticker+0.152
Neg logit contributions
ete-0.295
 Fro-0.286
pill-0.271
 Int-0.267
 concession-0.257
Token
Token ID220
Feature activation-4.76e+00

Pos logit contributions
 concession+0.086
Ag+0.081
Da+0.079
Mos+0.078
isky+0.078
Neg logit contributions
.-0.087
 on-0.085
 inside-0.080
 tape-0.069
 paper-0.066
Token
Token ID198
Feature activation-5.83e+00

Pos logit contributions
Da+0.067
mber+0.066
 Int+0.065
 concession+0.065
ete+0.062
Neg logit contributions
.-0.072
 on-0.071
 inside-0.069
els-0.062
 sticker-0.058
Token
Token ID198
Feature activation-6.80e+00

Pos logit contributions
ishly+0.090
 Int+0.088
ete+0.087
 concession+0.087
cient+0.086
Neg logit contributions
 on-0.079
.-0.074
 inside-0.073
 sticker-0.061
,"-0.056
TokenThe
Token ID464
Feature activation-1.98e+00

Pos logit contributions
 Fro+0.095
ete+0.090
ishly+0.090
 Int+0.089
cient+0.087
Neg logit contributions
 on-0.101
 inside-0.090
.-0.088
 sticker-0.084
 tape-0.083
Token fun
Token ID1257
Feature activation+8.18e+00

Pos logit contributions
 on+0.037
 inside+0.035
els+0.034
.+0.032
 sticker+0.031
Neg logit contributions
 Int-0.057
ete-0.055
 Everywhere-0.053
 Fro-0.052
mber-0.051
Token until
Token ID1566
Feature activation+2.60e+00

Pos logit contributions
els+0.116
 sticker+0.107
ier+0.097
 bills+0.097
 frost+0.095
Neg logit contributions
 Int-0.115
 Fro-0.108
 As-0.101
ext-0.101
rolling-0.099
Token they
Token ID484
Feature activation+3.93e+00

Pos logit contributions
els+0.041
ps+0.035
 sticker+0.035
 colored+0.033
 ring+0.033
Neg logit contributions
 Fro-0.035
 Int-0.035
ete-0.035
 Everywhere-0.033
 Everything-0.032
Token saw
Token ID2497
Feature activation+6.19e+00

Pos logit contributions
els+0.054
 on+0.053
 inside+0.051
 sticker+0.047
.+0.047
Neg logit contributions
 Int-0.061
ete-0.057
 Everywhere-0.057
 Fro-0.055
Da-0.053
Token a
Token ID257
Feature activation+6.50e+00

Pos logit contributions
els+0.067
 on+0.064
 inside+0.062
 sticker+0.056
.+0.055
Neg logit contributions
 Int-0.115
ete-0.108
 Everywhere-0.107
 Fro-0.106
Da-0.103
Token switch
Token ID5078
Feature activation+1.81e+01

Pos logit contributions
els+0.107
ps+0.092
 on+0.084
 inside+0.083
ws+0.082
Neg logit contributions
 Int-0.090
 Fro-0.087
mber-0.084
 concession-0.083
Da-0.082
Token on
Token ID319
Feature activation+6.26e+00

Pos logit contributions
els+0.157
ws+0.141
 sticker+0.127
 frost+0.117
 penn+0.113
Neg logit contributions
 Int-0.320
hog-0.303
 concession-0.291
rolling-0.290
 Everywhere-0.287
Token the
Token ID262
Feature activation+4.46e+00

Pos logit contributions
els+0.085
ier+0.079
 sticker+0.074
 on+0.073
 inside+0.072
Neg logit contributions
 Everywhere-0.091
 Int-0.086
 Fro-0.082
 Everything-0.078
Da-0.078
Token wall
Token ID3355
Feature activation+1.29e+01

Pos logit contributions
els+0.073
ck+0.060
ries+0.059
 on+0.058
 inside+0.057
Neg logit contributions
 Everywhere-0.058
 Int-0.057
 concession-0.053
 Everything-0.050
 Curious-0.049
Token.
Token ID13
Feature activation-4.72e+00

Pos logit contributions
els+0.102
 sticker+0.072
ws+0.071
 stamp+0.071
 stamps+0.064
Neg logit contributions
 Int-0.260
 Everywhere-0.244
hog-0.242
 As-0.233
 Curious-0.232
Token It
Token ID632
Feature activation+3.93e+00

Pos logit contributions
 concession+0.063
ternally+0.063
mber+0.062
liv+0.062
isky+0.061
Neg logit contributions
 on-0.072
 inside-0.071
.-0.067
 paper-0.060
 tape-0.059
Token good
Token ID922
Feature activation+8.81e+00

Pos logit contributions
els+0.101
bones+0.079
ries+0.074
 sticker+0.073
ps+0.072
Neg logit contributions
 Int-0.070
 Everywhere-0.066
 Everything-0.064
Da-0.064
 Sadly-0.062
Token to
Token ID284
Feature activation+2.64e+00

Pos logit contributions
els+0.145
ck+0.118
bones+0.118
 stamp+0.114
cks+0.112
Neg logit contributions
 As-0.095
 She-0.094
 Fro-0.094
 Everywhere-0.092
 Not-0.091
Token have
Token ID423
Feature activation+7.61e+00

Pos logit contributions
els+0.040
 on+0.037
ier+0.035
bones+0.035
 inside+0.035
Neg logit contributions
 Int-0.033
 Everywhere-0.031
 Everything-0.030
ete-0.030
 She-0.030
Token so
Token ID523
Feature activation+4.01e+00

Pos logit contributions
els+0.109
 ring+0.082
 stamp+0.081
bones+0.080
 lever+0.080
Neg logit contributions
 Int-0.099
 Everywhere-0.098
 Everything-0.094
 Sadly-0.093
 Not-0.090
Token much
Token ID881
Feature activation+1.09e+01

Pos logit contributions
els+0.068
 colored+0.058
ps+0.058
 ring+0.057
 inside+0.057
Neg logit contributions
 Int-0.042
 Sadly-0.041
 Everywhere-0.039
Lots-0.039
Sadly-0.039
Token clutter
Token ID45343
Feature activation+1.81e+01

Pos logit contributions
els+0.141
ck+0.117
bones+0.116
cks+0.107
 stamp+0.107
Neg logit contributions
 Everywhere-0.158
 Int-0.154
 She-0.150
 Everything-0.148
 As-0.145
Token.
Token ID13
Feature activation-4.67e+00

Pos logit contributions
els+0.179
ck+0.159
 stamp+0.152
 sticker+0.147
 colored+0.145
Neg logit contributions
 Everywhere-0.274
 concession-0.265
ete-0.253
 He-0.249
 She-0.249
Token You
Token ID921
Feature activation+4.10e+00

Pos logit contributions
 Int+0.049
ete+0.046
 Fro+0.043
 Everywhere+0.042
Da+0.041
Neg logit contributions
els-0.086
 on-0.086
 inside-0.083
.-0.079
 sticker-0.079
Token can
Token ID460
Feature activation+2.14e+00

Pos logit contributions
els+0.054
 on+0.048
 inside+0.047
 colored+0.044
 sticker+0.044
Neg logit contributions
 Int-0.066
 Everywhere-0.062
ete-0.061
 Sadly-0.059
 Fro-0.059
Token trip
Token ID5296
Feature activation+1.29e+01

Pos logit contributions
els+0.029
 on+0.028
 inside+0.027
 sticker+0.025
.+0.025
Neg logit contributions
 Int-0.034
ete-0.031
 Everywhere-0.031
 Fro-0.030
 Curious-0.029
Token and
Token ID290
Feature activation+3.09e+00

Pos logit contributions
els+0.157
 frost+0.121
 colored+0.119
 on+0.116
 sticker+0.116
Neg logit contributions
 Int-0.219
 Everywhere-0.214
ete-0.207
 concession-0.199
 Fro-0.197
Token and
Token ID290
Feature activation+8.13e-01

Pos logit contributions
els+0.111
 colored+0.093
 sticker+0.089
 frost+0.089
ier+0.087
Neg logit contributions
 Int-0.114
 Fro-0.113
ete-0.105
 Everywhere-0.105
 Sadly-0.103
Token tried
Token ID3088
Feature activation+6.10e+00

Pos logit contributions
els+0.013
 on+0.010
 inside+0.010
 sticker+0.010
ier+0.010
Neg logit contributions
 Int-0.012
 Fro-0.011
Da-0.010
 Curious-0.010
ete-0.010
Token to
Token ID284
Feature activation+3.23e+00

Pos logit contributions
els+0.100
 colored+0.085
ps+0.083
 frost+0.082
ier+0.082
Neg logit contributions
 Fro-0.082
 Int-0.078
 Everywhere-0.074
ete-0.072
Da-0.071
Token take
Token ID1011
Feature activation+9.82e+00

Pos logit contributions
els+0.045
 on+0.041
 inside+0.038
 sticker+0.036
 frost+0.036
Neg logit contributions
 Int-0.049
 Fro-0.047
ete-0.046
 Everywhere-0.044
Da-0.044
Token the
Token ID262
Feature activation+4.50e+00

Pos logit contributions
els+0.158
 colored+0.139
 inside+0.138
 frost+0.137
 on+0.136
Neg logit contributions
 Int-0.126
 Everywhere-0.125
 Fro-0.119
ete-0.117
 Everything-0.113
Token battery
Token ID6555
Feature activation+1.81e+01

Pos logit contributions
els+0.072
ries+0.061
bones+0.058
ck+0.058
 colored+0.056
Neg logit contributions
 Everywhere-0.063
 Int-0.061
 Curious-0.054
 Everything-0.052
 concession-0.051
Token from
Token ID422
Feature activation+4.99e+00

Pos logit contributions
els+0.182
 colored+0.156
 frost+0.151
 icing+0.133
 coloured+0.132
Neg logit contributions
 Fro-0.328
ete-0.325
 Everywhere-0.315
hog-0.314
 Int-0.313
Token his
Token ID465
Feature activation+5.22e+00

Pos logit contributions
els+0.084
 on+0.078
 inside+0.077
 sticker+0.075
 colored+0.074
Neg logit contributions
 Int-0.059
 Everywhere-0.057
 Fro-0.056
ete-0.055
Da-0.050
Token mouth
Token ID5422
Feature activation+1.22e+01

Pos logit contributions
els+0.073
 colored+0.064
 on+0.064
 inside+0.063
ier+0.062
Neg logit contributions
 Fro-0.076
 Everywhere-0.075
 Int-0.075
ete-0.071
 Sadly-0.070
Token.
Token ID13
Feature activation-4.34e+00

Pos logit contributions
els+0.091
 colored+0.067
 inside+0.063
 frost+0.062
 sticker+0.060
Neg logit contributions
 Fro-0.259
 Int-0.259
ete-0.257
 Everywhere-0.255
 Sadly-0.239
Token Max
Token ID5436
Feature activation+3.58e+00

Pos logit contributions
 Int+0.050
ete+0.049
mber+0.048
Da+0.046
 concession+0.045
Neg logit contributions
 on-0.074
 inside-0.072
.-0.072
els-0.069
 sticker-0.068
Token her
Token ID607
Feature activation+2.92e+00

Pos logit contributions
els+0.008
 on+0.007
 inside+0.007
 sticker+0.006
 colored+0.006
Neg logit contributions
 Int-0.016
 Everywhere-0.015
ete-0.015
 Fro-0.015
 Curious-0.015
Token camera
Token ID4676
Feature activation+1.08e+01

Pos logit contributions
els+0.039
 on+0.038
 inside+0.037
 sticker+0.034
.+0.033
Neg logit contributions
 Int-0.046
ete-0.043
 Everywhere-0.043
 Fro-0.042
 Curious-0.041
Token ran
Token ID4966
Feature activation+1.04e+01

Pos logit contributions
els+0.176
 inside+0.160
 on+0.159
 sticker+0.156
 frost+0.151
Neg logit contributions
 Int-0.139
 Everywhere-0.125
ete-0.121
 Fro-0.120
 Curious-0.117
Token out
Token ID503
Feature activation+8.73e+00

Pos logit contributions
els+0.087
 on+0.068
 sticker+0.066
 frost+0.064
 inside+0.063
Neg logit contributions
 Int-0.221
 Everywhere-0.210
 Fro-0.205
ete-0.204
Da-0.201
Token of
Token ID286
Feature activation+6.23e+00

Pos logit contributions
els+0.113
 frost+0.103
 sticker+0.099
 stamp+0.097
 colored+0.092
Neg logit contributions
 Int-0.135
 Everywhere-0.128
Da-0.125
 Fro-0.124
 Curious-0.121
Token battery
Token ID6555
Feature activation+1.80e+01

Pos logit contributions
ier+0.103
els+0.097
 sticker+0.087
ps+0.085
le+0.081
Neg logit contributions
Da-0.075
 Int-0.073
 Everywhere-0.073
 Fro-0.070
Bet-0.065
Token.
Token ID13
Feature activation-6.02e+00

Pos logit contributions
els+0.127
 frost+0.108
 sticker+0.103
 icing+0.092
 colored+0.088
Neg logit contributions
 Int-0.357
ete-0.350
 Fro-0.344
hog-0.337
 Everywhere-0.335
Token
Token ID220
Feature activation-4.94e+00

Pos logit contributions
ete+0.098
 concession+0.097
mber+0.086
isky+0.086
ules+0.085
Neg logit contributions
.-0.086
 on-0.086
 inside-0.082
 paper-0.066
 tape-0.065
Token
Token ID198
Feature activation-5.83e+00

Pos logit contributions
 concession+0.076
mber+0.074
ete+0.073
 Int+0.072
 Fro+0.071
Neg logit contributions
.-0.070
 on-0.070
 inside-0.067
els-0.062
 sticker-0.054
Token
Token ID198
Feature activation-7.24e+00

Pos logit contributions
ete+0.091
 concession+0.090
ishly+0.088
 Int+0.087
 Fro+0.085
Neg logit contributions
 on-0.084
.-0.081
 inside-0.077
els-0.063
,"-0.063
TokenL
Token ID43
Feature activation-3.61e+00

Pos logit contributions
ete+0.109
 Fro+0.107
 Int+0.100
 Everywhere+0.095
 Dod+0.095
Neg logit contributions
 on-0.108
 inside-0.104
.-0.101
els-0.092
 colored-0.091
Token his
Token ID465
Feature activation+6.11e+00

Pos logit contributions
els+0.096
ier+0.081
ps+0.075
ries+0.074
 stamp+0.070
Neg logit contributions
 Everywhere-0.087
 Int-0.085
 Curious-0.081
 Fro-0.076
ete-0.074
Token pocket
Token ID10000
Feature activation+1.37e+01

Pos logit contributions
els+0.072
 on+0.070
 inside+0.067
 sticker+0.061
.+0.061
Neg logit contributions
 Int-0.107
ete-0.101
 Everywhere-0.100
 Fro-0.098
Da-0.095
Token and
Token ID290
Feature activation+1.47e+00

Pos logit contributions
els+0.156
 inside+0.125
 on+0.124
ps+0.122
 colored+0.121
Neg logit contributions
 Int-0.243
ete-0.232
 Everywhere-0.232
 Sadly-0.225
 Fro-0.225
Token found
Token ID1043
Feature activation+7.28e+00

Pos logit contributions
els+0.022
 on+0.020
 inside+0.019
 sticker+0.019
.+0.018
Neg logit contributions
 Int-0.022
ete-0.020
 Fro-0.020
 Everywhere-0.019
 Curious-0.019
Token some
Token ID617
Feature activation+1.11e+01

Pos logit contributions
els+0.100
 on+0.084
 colored+0.082
 inside+0.080
 sticker+0.080
Neg logit contributions
 Int-0.115
 Everywhere-0.112
 Fro-0.109
ete-0.107
 Everything-0.104
Token coins
Token ID10796
Feature activation+1.80e+01

Pos logit contributions
els+0.127
ps+0.113
ws+0.105
cks+0.105
bones+0.103
Neg logit contributions
 Everywhere-0.164
 Int-0.158
 concession-0.157
ete-0.151
 Fro-0.151
Token.
Token ID13
Feature activation-5.03e+00

Pos logit contributions
els+0.176
 colored+0.143
 frost+0.140
 harm+0.137
 spills+0.135
Neg logit contributions
ete-0.312
 Fro-0.295
hog-0.273
 He-0.272
 Int-0.267
Token He
Token ID679
Feature activation+1.78e+00

Pos logit contributions
ete+0.059
Ag+0.055
ternally+0.054
Da+0.052
ust+0.052
Neg logit contributions
 on-0.091
.-0.086
 inside-0.084
 sticker-0.077
 tape-0.075
Token put
Token ID1234
Feature activation+1.11e+01

Pos logit contributions
els+0.021
 on+0.020
 inside+0.020
 sticker+0.018
 colored+0.018
Neg logit contributions
 Int-0.031
ete-0.030
 Everywhere-0.029
 Fro-0.029
Da-0.028
Token the
Token ID262
Feature activation+4.80e+00

Pos logit contributions
els+0.140
 harm+0.135
 spills+0.128
 colored+0.124
 code+0.123
Neg logit contributions
 Everywhere-0.156
 Fro-0.147
 Int-0.145
ete-0.143
 Everything-0.134
Token coins
Token ID10796
Feature activation+1.70e+01

Pos logit contributions
els+0.052
 harm+0.042
bones+0.039
ck+0.038
 inside+0.038
Neg logit contributions
 Int-0.077
 concession-0.076
ete-0.074
 Everywhere-0.074
 Fro-0.071
Token.
Token ID13
Feature activation-3.53e+00

Pos logit contributions
els+0.130
 colored+0.092
 frost+0.088
 on+0.085
 inside+0.085
Neg logit contributions
 Int-0.323
 Everywhere-0.318
ete-0.314
 Fro-0.312
Da-0.305
Token It
Token ID632
Feature activation+4.13e+00

Pos logit contributions
ete+0.056
liv+0.050
mber+0.049
 Fro+0.048
oph+0.048
Neg logit contributions
 on-0.047
 inside-0.046
.-0.044
."-0.042
 sticker-0.039
Token's
Token ID338
Feature activation+5.23e+00

Pos logit contributions
 on+0.053
 inside+0.051
els+0.050
.+0.047
 sticker+0.043
Neg logit contributions
ete-0.068
 Int-0.068
 Everywhere-0.065
 Fro-0.065
Da-0.064
Token made
Token ID925
Feature activation+7.43e+00

Pos logit contributions
els+0.088
 sticker+0.075
 frost+0.073
 colored+0.071
 inside+0.071
Neg logit contributions
 Int-0.062
 Everywhere-0.060
 Fro-0.059
 Barkley-0.055
Da-0.055
Token of
Token ID286
Feature activation+6.64e+00

Pos logit contributions
els+0.111
 on+0.108
 inside+0.105
 sticker+0.098
.+0.097
Neg logit contributions
 Int-0.107
ete-0.098
 Everywhere-0.097
 Fro-0.095
Da-0.092
Token gold
Token ID3869
Feature activation+1.80e+01

Pos logit contributions
els+0.079
 inside+0.069
 on+0.069
ier+0.066
 sticker+0.062
Neg logit contributions
 Fro-0.104
 Int-0.103
 Everywhere-0.101
Da-0.097
 Barkley-0.094
Token and
Token ID290
Feature activation+4.39e+00

Pos logit contributions
els+0.196
bones+0.163
 colored+0.156
 harm+0.150
ps+0.143
Neg logit contributions
 Int-0.283
ete-0.281
 Fro-0.274
 Everywhere-0.274
mber-0.268
Token has
Token ID468
Feature activation+6.63e+00

Pos logit contributions
els+0.051
 inside+0.041
 on+0.041
ier+0.039
ps+0.039
Neg logit contributions
 Int-0.075
 Everywhere-0.071
 Fro-0.071
ete-0.068
 Curious-0.067
Token a
Token ID257
Feature activation+6.57e+00

Pos logit contributions
els+0.084
 on+0.077
 inside+0.076
 sticker+0.069
 colored+0.066
Neg logit contributions
 Int-0.109
 Everywhere-0.103
ete-0.100
 Fro-0.100
Da-0.096
Token heart
Token ID2612
Feature activation+1.49e+01

Pos logit contributions
els+0.103
ries+0.084
ps+0.082
ier+0.076
 colored+0.075
Neg logit contributions
 Int-0.091
 Fro-0.090
 Sadly-0.087
ete-0.085
 Curious-0.085
Token-
Token ID12
Feature activation+5.28e+00

Pos logit contributions
els+0.139
 problems+0.093
 colored+0.090
 spills+0.090
 harm+0.087
Neg logit contributions
 Int-0.254
 Everywhere-0.246
 As-0.241
ickets-0.239
 She-0.236
Token filled
Token ID5901
Feature activation+9.22e+00

Pos logit contributions
els+0.087
 sticker+0.071
 frost+0.069
ries+0.067
 on+0.067
Neg logit contributions
 Fro-0.075
 Int-0.074
 Everywhere-0.072
ete-0.071
Da-0.067
Token with
Token ID351
Feature activation+8.54e+00

Pos logit contributions
els+0.094
 sticker+0.075
 on+0.074
 inside+0.074
ps+0.072
Neg logit contributions
 Int-0.173
ete-0.166
 Everywhere-0.165
 Fro-0.164
Da-0.161
Token lots
Token ID6041
Feature activation+1.19e+01

Pos logit contributions
els+0.105
 on+0.096
 inside+0.092
 sticker+0.090
ps+0.084
Neg logit contributions
 Int-0.140
 Everywhere-0.135
 Fro-0.132
ete-0.132
 Curious-0.126
Token of
Token ID286
Feature activation+7.64e+00

Pos logit contributions
els+0.210
ps+0.203
ws+0.194
 sticker+0.191
 on+0.184
Neg logit contributions
 Everywhere-0.117
 Int-0.115
pill-0.110
ete-0.107
 concession-0.100
Token shiny
Token ID22441
Feature activation+1.51e+01

Pos logit contributions
els+0.087
 on+0.080
 inside+0.076
 sticker+0.075
ps+0.074
Neg logit contributions
 Int-0.132
 Everywhere-0.126
 Curious-0.121
 Fro-0.120
ete-0.120
Token coins
Token ID10796
Feature activation+1.80e+01

Pos logit contributions
 sticker+0.163
ps+0.159
 harm+0.154
ck+0.152
ws+0.149
Neg logit contributions
pill-0.207
 Int-0.201
 Curious-0.191
 Everywhere-0.189
 He-0.182
Token.
Token ID13
Feature activation-5.71e+00

Pos logit contributions
els+0.154
 sticker+0.125
 frost+0.114
 colored+0.110
ps+0.107
Neg logit contributions
ete-0.338
 Int-0.319
 Fro-0.314
 Barkley-0.301
 Everywhere-0.301
Token He
Token ID679
Feature activation+1.45e+00

Pos logit contributions
 Fro+0.063
Da+0.058
ete+0.057
Bet+0.053
pill+0.053
Neg logit contributions
.-0.104
 inside-0.103
 on-0.101
els-0.094
 frost-0.091
Token had
Token ID550
Feature activation+4.65e+00

Pos logit contributions
 on+0.018
els+0.018
 inside+0.018
 colored+0.017
.+0.016
Neg logit contributions
 Int-0.024
 Everywhere-0.023
 Fro-0.022
ete-0.022
Da-0.022
Token never
Token ID1239
Feature activation+5.62e+00

Pos logit contributions
els+0.070
 on+0.066
 inside+0.064
 sticker+0.061
.+0.059
Neg logit contributions
 Int-0.066
ete-0.061
 Everywhere-0.061
 Fro-0.059
 Curious-0.057
Token seen
Token ID1775
Feature activation+1.02e+01

Pos logit contributions
els+0.096
 on+0.083
 sticker+0.082
 inside+0.081
ps+0.079
Neg logit contributions
 Int-0.071
ete-0.061
 Curious-0.061
 Fro-0.061
Da-0.059
Token saw
Token ID2497
Feature activation+4.92e+00

Pos logit contributions
els+0.067
 on+0.066
 inside+0.064
.+0.058
 sticker+0.058
Neg logit contributions
 Int-0.094
 Everywhere-0.087
ete-0.087
 Fro-0.085
Da-0.083
Token some
Token ID617
Feature activation+9.38e+00

Pos logit contributions
 on+0.066
els+0.065
 inside+0.064
.+0.059
 sticker+0.057
Neg logit contributions
 Int-0.078
ete-0.074
 Everywhere-0.073
Da-0.069
 Curious-0.069
Token more
Token ID517
Feature activation+9.96e+00

Pos logit contributions
els+0.119
 inside+0.106
 on+0.105
ps+0.096
 sticker+0.095
Neg logit contributions
 Int-0.148
 Fro-0.141
ete-0.139
 Everywhere-0.137
 concession-0.135
Token che
Token ID1125
Feature activation+9.87e+00

Pos logit contributions
els+0.108
ps+0.098
 on+0.093
 inside+0.089
 sticker+0.086
Neg logit contributions
 Int-0.166
pill-0.163
 Fro-0.156
 Everywhere-0.155
Da-0.150
Tokenwed
Token ID19103
Feature activation+1.42e+01

Pos logit contributions
 inside+0.111
 on+0.109
 sticker+0.103
.+0.101
els+0.095
Neg logit contributions
ete-0.154
 Fro-0.149
 Int-0.145
Da-0.144
pill-0.142
Token threads
Token ID14390
Feature activation+1.80e+01

Pos logit contributions
els+0.132
 sticker+0.121
 stamp+0.111
 frost+0.111
 colored+0.111
Neg logit contributions
 Fro-0.253
 Everywhere-0.247
 Int-0.245
ete-0.241
pill-0.227
Token in
Token ID287
Feature activation+6.09e+00

Pos logit contributions
els+0.195
 sticker+0.159
 frost+0.151
 colored+0.144
ps+0.143
Neg logit contributions
 Int-0.304
ete-0.294
 Everywhere-0.292
 Fro-0.289
Da-0.284
Token the
Token ID262
Feature activation+3.81e+00

Pos logit contributions
els+0.087
 inside+0.076
ier+0.071
 on+0.071
 colored+0.070
Neg logit contributions
 Fro-0.088
 Everywhere-0.083
 Int-0.082
Da-0.081
ete-0.078
Token house
Token ID2156
Feature activation+1.30e+01

Pos logit contributions
els+0.047
 on+0.044
 inside+0.043
 sticker+0.039
.+0.038
Neg logit contributions
 Int-0.066
 Everywhere-0.062
ete-0.061
 Fro-0.060
Da-0.059
Token.
Token ID13
Feature activation-5.29e+00

Pos logit contributions
els+0.097
 sticker+0.063
ps+0.058
 colored+0.057
 inside+0.055
Neg logit contributions
 Int-0.274
 Everywhere-0.264
 Sadly-0.261
ete-0.260
 Fro-0.258
Token Fl
Token ID1610
Feature activation-1.52e+00

Pos logit contributions
Harry+0.064
ete+0.064
David+0.061
Bet+0.061
mber+0.060
Neg logit contributions
 on-0.087
.-0.086
 inside-0.084
 chocolate-0.074
 sticker-0.074
Token that
Token ID326
Feature activation+2.58e+00

Pos logit contributions
 on+0.027
 inside+0.025
.+0.023
els+0.022
 colored+0.022
Neg logit contributions
 Int-0.034
ete-0.033
 Everywhere-0.030
Da-0.029
 Fro-0.029
Token she
Token ID673
Feature activation+2.32e+00

Pos logit contributions
 on+0.034
 inside+0.032
els+0.032
.+0.030
 sticker+0.028
Neg logit contributions
 Int-0.043
ete-0.041
 Everywhere-0.039
 Fro-0.038
Da-0.038
Token could
Token ID714
Feature activation+2.52e+00

Pos logit contributions
 on+0.033
 inside+0.032
.+0.030
els+0.029
 colored+0.028
Neg logit contributions
 Int-0.037
ete-0.035
 Everywhere-0.034
 Fro-0.034
Da-0.034
Token find
Token ID1064
Feature activation+9.10e+00

Pos logit contributions
 on+0.030
.+0.028
 inside+0.027
els+0.025
 sticker+0.024
Neg logit contributions
 Int-0.047
ete-0.044
mber-0.044
Da-0.044
 Sadly-0.042
Token more
Token ID517
Feature activation+1.00e+01

Pos logit contributions
els+0.104
 on+0.095
 inside+0.092
 sticker+0.089
 colored+0.083
Neg logit contributions
 Int-0.161
ete-0.152
 Everywhere-0.152
 Fro-0.150
Da-0.145
Token treasures
Token ID29561
Feature activation+1.80e+01

Pos logit contributions
ps+0.130
els+0.130
 spills+0.118
bones+0.113
 sticker+0.107
Neg logit contributions
 Everywhere-0.138
 Int-0.134
pill-0.127
mber-0.126
So-0.125
Token if
Token ID611
Feature activation+6.15e+00

Pos logit contributions
els+0.202
 frost+0.156
ps+0.154
 colored+0.152
 sticker+0.150
Neg logit contributions
ete-0.301
 Int-0.300
 Everywhere-0.296
 Fro-0.292
Da-0.281
Token she
Token ID673
Feature activation+5.77e+00

Pos logit contributions
els+0.096
 on+0.092
 inside+0.090
 sticker+0.085
 colored+0.082
Neg logit contributions
 Int-0.084
ete-0.078
 Everywhere-0.077
 Fro-0.076
Da-0.072
Token kept
Token ID4030
Feature activation+1.01e+01

Pos logit contributions
els+0.091
 on+0.088
 inside+0.085
 sticker+0.080
.+0.078
Neg logit contributions
 Int-0.078
ete-0.072
 Everywhere-0.071
 Fro-0.070
 Curious-0.067
Token looking
Token ID2045
Feature activation+1.18e+01

Pos logit contributions
els+0.129
 sticker+0.119
 frost+0.114
 tape+0.108
ier+0.108
Neg logit contributions
 Fro-0.145
 Barkley-0.141
 Int-0.140
ete-0.135
Ag-0.135
Token.
Token ID13
Feature activation-6.11e+00

Pos logit contributions
els+0.129
 sticker+0.100
ps+0.097
 frost+0.094
 colored+0.092
Neg logit contributions
 Int-0.209
ete-0.207
 Everywhere-0.207
 Fro-0.194
 Curious-0.191
Token.
Token ID13
Feature activation-2.97e+00

Pos logit contributions
els+0.108
 frost+0.090
 sticker+0.084
 colored+0.083
 stamp+0.080
Neg logit contributions
 Int-0.195
 Everywhere-0.188
 As-0.181
 Fro-0.180
ete-0.178
Token We
Token ID775
Feature activation+3.30e+00

Pos logit contributions
 Int+0.032
ete+0.031
Da+0.030
 Fro+0.029
mber+0.029
Neg logit contributions
els-0.053
 on-0.052
 inside-0.052
 sticker-0.049
.-0.049
Token just
Token ID655
Feature activation+5.96e+00

Pos logit contributions
els+0.049
 on+0.043
 colored+0.041
 sticker+0.041
 inside+0.040
Neg logit contributions
 Int-0.045
 Sadly-0.043
ete-0.043
 Everywhere-0.043
 Everything-0.041
Token need
Token ID761
Feature activation+8.72e+00

Pos logit contributions
els+0.093
ries+0.077
 on+0.077
 sticker+0.077
ps+0.074
Neg logit contributions
 Int-0.079
 Everything-0.072
 Everywhere-0.072
 Follow-0.071
 Sadly-0.071
Token some
Token ID617
Feature activation+1.10e+01

Pos logit contributions
els+0.140
bones+0.124
ier+0.121
ps+0.120
le+0.119
Neg logit contributions
 Int-0.099
 Everywhere-0.098
 Everything-0.092
ete-0.090
 As-0.089
Token glue
Token ID22749
Feature activation+1.79e+01

Pos logit contributions
els+0.123
bones+0.122
ps+0.112
cks+0.108
ck+0.103
Neg logit contributions
 Everywhere-0.173
 Int-0.166
 So-0.156
 concession-0.152
 Everything-0.151
Token and
Token ID290
Feature activation+5.90e+00

Pos logit contributions
els+0.223
bones+0.191
 colored+0.191
ck+0.181
 harm+0.180
Neg logit contributions
 Everywhere-0.240
ete-0.237
 Fro-0.229
 Int-0.221
ables-0.212
Token a
Token ID257
Feature activation+6.01e+00

Pos logit contributions
els+0.052
 on+0.041
 inside+0.040
bones+0.037
ier+0.037
Neg logit contributions
 Int-0.114
 Everywhere-0.107
 Everything-0.102
Of-0.101
Sadly-0.101
Token new
Token ID649
Feature activation+9.56e+00

Pos logit contributions
els+0.091
ps+0.075
ries+0.072
bones+0.071
cks+0.066
Neg logit contributions
 Int-0.093
 Sadly-0.088
Of-0.088
 Fro-0.084
 Curious-0.083
Token wheel
Token ID7825
Feature activation+1.46e+01

Pos logit contributions
els+0.130
ps+0.111
 on+0.099
bones+0.099
cks+0.098
Neg logit contributions
 Int-0.157
 Curious-0.142
 Everywhere-0.141
 Fro-0.139
Of-0.138
Token.
Token ID13
Feature activation-3.33e+00

Pos logit contributions
els+0.134
 colored+0.097
 inside+0.091
 frost+0.090
 on+0.088
Neg logit contributions
 Int-0.266
 Everywhere-0.260
ete-0.254
 Fro-0.254
 concession-0.249
Token by
Token ID416
Feature activation-9.92e+00

Pos logit contributions
 Int+0.072
ete+0.071
 Fro+0.068
 concession+0.066
mber+0.062
Neg logit contributions
 on-0.089
.-0.085
 inside-0.084
els-0.079
 sticker-0.074
Token one
Token ID530
Feature activation-5.95e+00

Pos logit contributions
 Int+0.173
 Everywhere+0.162
ete+0.162
 Fro+0.159
Da+0.154
Neg logit contributions
els-0.115
 on-0.109
 inside-0.106
 sticker-0.099
 colored-0.093
Token
Token ID6184
Feature activation-9.14e+00

Pos logit contributions
 Int+0.106
 Everywhere+0.102
ete+0.100
 Fro+0.096
 Curious+0.095
Neg logit contributions
els-0.064
 inside-0.055
 on-0.054
ps-0.052
 sticker-0.051
Token
Token ID95
Feature activation-1.10e+01

Pos logit contributions
 Int+0.107
ete+0.096
 Everywhere+0.095
 Fro+0.094
Da+0.094
Neg logit contributions
els-0.158
 on-0.153
 inside-0.151
.-0.145
 sticker-0.143
Token
Token ID26391
Feature activation-1.74e+01

Pos logit contributions
 Int+0.117
 Fro+0.112
ete+0.110
Da+0.109
 Everywhere+0.100
Neg logit contributions
 on-0.190
els-0.190
 inside-0.189
.-0.188
 sticker-0.180
Token
Token ID447
Feature activation-2.13e+01

Pos logit contributions
Da+0.225
 Int+0.221
ete+0.216
 concession+0.213
 Everywhere+0.213
Neg logit contributions
.-0.273
 on-0.263
 inside-0.238
 sticker-0.227
els-0.225
Token
Token ID250
Feature activation-1.44e+01

Pos logit contributions
 Int+0.296
ete+0.285
 Everywhere+0.284
Da+0.275
 Fro+0.275
Neg logit contributions
 on-0.299
els-0.279
 sticker-0.279
.-0.276
 inside-0.267
Token 1
Token ID352
Feature activation-3.18e+00

Pos logit contributions
 Int+0.145
ete+0.137
 Fro+0.130
 Everywhere+0.127
 concession+0.118
Neg logit contributions
 on-0.266
els-0.264
 inside-0.253
 sticker-0.247
.-0.244
Token,
Token ID11
Feature activation-7.73e+00

Pos logit contributions
 Int+0.063
ete+0.060
 Everywhere+0.060
 Fro+0.059
mber+0.057
Neg logit contributions
els-0.028
 inside-0.026
 on-0.024
 sticker-0.022
 colored-0.022
Token 2
Token ID362
Feature activation-3.74e+00

Pos logit contributions
 Int+0.100
ete+0.092
 Everywhere+0.090
 Fro+0.089
Da+0.086
Neg logit contributions
els-0.125
 on-0.124
 inside-0.120
.-0.112
 sticker-0.112
Token,
Token ID11
Feature activation-7.15e+00

Pos logit contributions
 Int+0.073
ete+0.070
 Everywhere+0.068
 Fro+0.067
mber+0.067
Neg logit contributions
els-0.033
 inside-0.032
 on-0.029
 sticker-0.028
ps-0.027
Token perfect
Token ID2818
Feature activation-4.69e+00

Pos logit contributions
 Int+0.212
ete+0.201
 Everywhere+0.200
 Fro+0.198
 Curious+0.188
Neg logit contributions
els-0.150
 on-0.140
 inside-0.136
 sticker-0.123
 colored-0.121
Token bag
Token ID6131
Feature activation+1.87e+00

Pos logit contributions
 Int+0.078
ete+0.077
 Everywhere+0.076
 Fro+0.076
mber+0.074
Neg logit contributions
els-0.056
 on-0.042
 inside-0.041
ps-0.040
 colored-0.038
Token
Token ID6184
Feature activation-6.01e+00

Pos logit contributions
els+0.014
 on+0.011
 inside+0.011
 sticker+0.010
 colored+0.009
Neg logit contributions
 Int-0.040
ete-0.039
 Everywhere-0.039
 Fro-0.038
 concession-0.037
Token
Token ID95
Feature activation-9.21e+00

Pos logit contributions
 Int+0.074
ete+0.068
 Everywhere+0.067
 Fro+0.065
Da+0.063
Neg logit contributions
els-0.102
 on-0.099
 inside-0.097
 sticker-0.091
.-0.091
Token
Token ID26391
Feature activation-1.53e+01

Pos logit contributions
 Int+0.106
 Fro+0.104
ete+0.102
Da+0.100
 Everywhere+0.094
Neg logit contributions
 on-0.154
 inside-0.151
els-0.149
.-0.148
 sticker-0.143
Token
Token ID447
Feature activation-1.95e+01

Pos logit contributions
 Int+0.206
ete+0.204
Da+0.203
 Everywhere+0.200
 Fro+0.194
Neg logit contributions
 on-0.233
.-0.232
 inside-0.209
 sticker-0.196
els-0.189
Token
Token ID250
Feature activation-1.37e+01

Pos logit contributions
 Int+0.304
ete+0.299
 Everywhere+0.295
 Fro+0.289
Da+0.288
Neg logit contributions
 on-0.246
 sticker-0.224
.-0.221
 inside-0.209
els-0.207
Token now
Token ID783
Feature activation-6.30e+00

Pos logit contributions
 Int+0.132
ete+0.121
 Fro+0.115
 Everywhere+0.115
Da+0.105
Neg logit contributions
els-0.263
 on-0.262
 inside-0.253
 sticker-0.245
.-0.240
Token I
Token ID314
Feature activation-7.29e+00

Pos logit contributions
 Int+0.088
ete+0.081
 Everywhere+0.081
 Fro+0.079
Da+0.076
Neg logit contributions
els-0.095
 on-0.094
 inside-0.091
.-0.085
 sticker-0.085
Token'm
Token ID1101
Feature activation-7.46e+00

Pos logit contributions
 Int+0.107
ete+0.099
 Everywhere+0.099
 Fro+0.095
 Curious+0.093
Neg logit contributions
els-0.108
 on-0.100
 inside-0.098
 sticker-0.092
 colored-0.090
Token very
Token ID845
Feature activation-9.13e+00

Pos logit contributions
 Int+0.113
ete+0.105
 Everywhere+0.104
 Fro+0.102
Da+0.099
Neg logit contributions
els-0.105
 on-0.102
 inside-0.099
 sticker-0.092
.-0.091
Token of
Token ID286
Feature activation-8.61e+00

Pos logit contributions
 Int+0.126
ete+0.120
 Everywhere+0.120
 Fro+0.118
 Curious+0.114
Neg logit contributions
els-0.042
 on-0.036
 inside-0.035
 sticker-0.032
 colored-0.029
Token the
Token ID262
Feature activation-1.04e+01

Pos logit contributions
 Int+0.143
 Everywhere+0.140
 Fro+0.136
ete+0.134
Da+0.132
Neg logit contributions
els-0.100
 on-0.087
 inside-0.086
 sticker-0.085
ier-0.078
Token stream
Token ID4269
Feature activation+4.55e-02

Pos logit contributions
 Int+0.151
ete+0.143
 Everywhere+0.143
 Fro+0.140
 concession+0.139
Neg logit contributions
els-0.148
 on-0.132
 inside-0.128
ps-0.120
 sticker-0.120
Token.
Token ID13
Feature activation-1.54e+01

Pos logit contributions
els+0.000
 on+0.000
 inside+0.000
 sticker+0.000
.+0.000
Neg logit contributions
 Int-0.001
ete-0.001
 Everywhere-0.001
 Fro-0.001
Da-0.001
Token<|endoftext|>
Token ID50256
Feature activation-1.27e+01

Pos logit contributions
ete+0.156
 Int+0.155
 concession+0.150
ternally+0.148
mber+0.146
Neg logit contributions
.-0.263
 on-0.259
 inside-0.248
els-0.232
 sticker-0.216
TokenM
Token ID44
Feature activation-1.94e+01

Pos logit contributions
 Int+0.152
ete+0.139
 Everywhere+0.136
 Fro+0.132
Da+0.127
Neg logit contributions
els-0.218
 on-0.216
 inside-0.210
.-0.198
 sticker-0.196
Tokenolly
Token ID5098
Feature activation-1.43e+01

Pos logit contributions
 Int+0.242
 Curious+0.203
 Everywhere+0.202
 Follow+0.196
Da+0.195
Neg logit contributions
els-0.338
 on-0.330
.-0.327
 inside-0.314
 frost-0.310
Token was
Token ID373
Feature activation-1.04e+01

Pos logit contributions
 Int+0.302
ete+0.286
 Everywhere+0.285
 Fro+0.280
Da+0.274
Neg logit contributions
els-0.115
 on-0.114
 inside-0.108
.-0.095
 sticker-0.090
Token a
Token ID257
Feature activation-6.23e+00

Pos logit contributions
 Int+0.200
ete+0.193
 Everywhere+0.186
mber+0.183
 Fro+0.183
Neg logit contributions
 on-0.112
 inside-0.107
els-0.101
.-0.092
 colored-0.086
Token three
Token ID1115
Feature activation-3.63e+00

Pos logit contributions
 Int+0.096
ete+0.089
 Everywhere+0.088
 Fro+0.085
Da+0.083
Neg logit contributions
els-0.086
 on-0.086
 inside-0.083
 sticker-0.077
.-0.077
Token-
Token ID12
Feature activation-1.08e+01

Pos logit contributions
 Int+0.051
ete+0.046
mber+0.046
 Everywhere+0.046
 Fro+0.045
Neg logit contributions
els-0.051
 inside-0.048
 on-0.048
 sticker-0.047
ps-0.046
Token had
Token ID550
Feature activation-1.37e+00

Pos logit contributions
els+0.014
 on+0.012
 inside+0.012
 sticker+0.012
 frost+0.012
Neg logit contributions
 Int-0.012
ete-0.011
 Everywhere-0.011
 Fro-0.010
 Curious-0.010
Token saved
Token ID7448
Feature activation+3.48e+00

Pos logit contributions
 Int+0.019
ete+0.018
 Everywhere+0.017
Da+0.017
 Fro+0.017
Neg logit contributions
els-0.022
 sticker-0.019
 inside-0.018
 on-0.018
 frost-0.018
Token him
Token ID683
Feature activation+4.25e+00

Pos logit contributions
els+0.048
 sticker+0.044
 on+0.044
 inside+0.043
 frost+0.042
Neg logit contributions
 Int-0.050
ete-0.050
 Fro-0.049
 Everywhere-0.048
Da-0.046
Token.
Token ID13
Feature activation-1.28e+01

Pos logit contributions
els+0.031
 frost+0.021
 colored+0.021
 sticker+0.021
 on+0.019
Neg logit contributions
 Int-0.092
ete-0.091
 Everywhere-0.089
 Fro-0.089
 Sadly-0.085
Token<|endoftext|>
Token ID50256
Feature activation-9.32e+00

Pos logit contributions
ternally+0.194
 Int+0.192
 concession+0.190
rolling+0.181
Da+0.180
Neg logit contributions
.-0.181
 on-0.157
 inside-0.144
."-0.133
 sticker-0.122
TokenCharl
Token ID24453
Feature activation-1.93e+01

Pos logit contributions
 Int+0.103
ete+0.093
 Everywhere+0.092
 Fro+0.089
Da+0.085
Neg logit contributions
els-0.169
 on-0.165
 inside-0.162
 sticker-0.153
.-0.151
Tokenotte
Token ID11404
Feature activation-1.07e+01

Pos logit contributions
 Int+0.246
 Everywhere+0.226
ete+0.217
 Fro+0.211
 Curious+0.209
Neg logit contributions
els-0.326
 on-0.320
 inside-0.310
.-0.287
 sticker-0.281
Token and
Token ID290
Feature activation-9.83e+00

Pos logit contributions
 Int+0.189
ete+0.177
 Everywhere+0.175
 Fro+0.172
Da+0.167
Neg logit contributions
els-0.125
 on-0.120
 inside-0.116
 sticker-0.106
.-0.105
Token her
Token ID607
Feature activation-9.42e+00

Pos logit contributions
rig+0.248
 Int+0.236
pill+0.233
ete+0.230
rolling+0.228
Neg logit contributions
 on-0.070
 inside-0.060
.-0.056
,-0.035
 the-0.031
Token mom
Token ID1995
Feature activation-3.50e+00

Pos logit contributions
 Int+0.181
Da+0.166
 Everywhere+0.162
ete+0.160
 Fro+0.159
Neg logit contributions
 on-0.109
 inside-0.106
.-0.100
 sticker-0.094
 paper-0.089
Token were
Token ID547
Feature activation-4.73e+00

Pos logit contributions
 Int+0.052
ete+0.048
 Everywhere+0.048
 Fro+0.047
Da+0.045
Neg logit contributions
els-0.050
 on-0.048
 inside-0.047
 sticker-0.044
.-0.043
Token was
Token ID373
Feature activation-1.14e+01

Pos logit contributions
 on+0.011
els+0.010
 inside+0.010
 sticker+0.010
 colored+0.010
Neg logit contributions
 Int-0.007
ete-0.007
 Fro-0.006
mber-0.006
 Everywhere-0.006
Token happy
Token ID3772
Feature activation-4.24e+00

Pos logit contributions
 Int+0.186
ete+0.173
 Everywhere+0.171
 Fro+0.168
Da+0.163
Neg logit contributions
els-0.147
 on-0.146
 inside-0.142
.-0.129
 sticker-0.129
Token too
Token ID1165
Feature activation-7.08e+00

Pos logit contributions
 Int+0.091
ete+0.086
 Everywhere+0.085
 Fro+0.084
Da+0.082
Neg logit contributions
els-0.034
 on-0.032
 inside-0.030
 sticker-0.026
.-0.025
Token.
Token ID13
Feature activation-1.57e+01

Pos logit contributions
 Int+0.166
ete+0.160
 Everywhere+0.158
 Fro+0.154
 Curious+0.152
Neg logit contributions
els-0.048
 on-0.032
 sticker-0.029
 inside-0.028
ps-0.027
Token<|endoftext|>
Token ID50256
Feature activation-1.27e+01

Pos logit contributions
 concession+0.189
ete+0.188
ternally+0.188
 Int+0.186
Mos+0.180
Neg logit contributions
.-0.259
 on-0.231
 inside-0.219
."-0.212
,"-0.197
TokenS
Token ID50
Feature activation-1.91e+01

Pos logit contributions
 Int+0.134
ete+0.120
 Everywhere+0.119
 Fro+0.115
Da+0.109
Neg logit contributions
els-0.236
 on-0.231
 inside-0.226
 sticker-0.214
.-0.212
Tokenara
Token ID3301
Feature activation-1.30e+01

Pos logit contributions
 Int+0.229
 Everywhere+0.199
 Curious+0.192
Da+0.185
 Fro+0.184
Neg logit contributions
els-0.345
 on-0.329
 inside-0.325
.-0.314
 sticker-0.304
Token liked
Token ID8288
Feature activation-7.22e+00

Pos logit contributions
 Int+0.249
ete+0.235
 Everywhere+0.234
 Fro+0.233
 Curious+0.224
Neg logit contributions
els-0.129
 on-0.114
 inside-0.109
 sticker-0.106
ps-0.097
Token to
Token ID284
Feature activation-8.51e+00

Pos logit contributions
 Int+0.138
ete+0.132
 Everywhere+0.129
 Fro+0.125
 Curious+0.124
Neg logit contributions
 on-0.074
els-0.073
 inside-0.070
.-0.065
 sticker-0.063
Token play
Token ID711
Feature activation-2.03e+00

Pos logit contributions
 Int+0.149
ete+0.139
 Everywhere+0.138
 Fro+0.136
Da+0.132
Neg logit contributions
els-0.100
 on-0.096
 inside-0.093
 sticker-0.085
.-0.084
Token dress
Token ID6576
Feature activation+2.94e+00

Pos logit contributions
 Int+0.039
 Fro+0.037
 Everywhere+0.036
ete+0.036
Da+0.035
Neg logit contributions
els-0.020
 sticker-0.016
 inside-0.016
 on-0.015
ps-0.015
Token helped
Token ID4193
Feature activation-7.92e+00

Pos logit contributions
 Fro+0.097
 Int+0.093
 Dod+0.089
ete+0.088
 Everywhere+0.087
Neg logit contributions
 on-0.099
.-0.094
 inside-0.093
els-0.092
 colored-0.088
Token him
Token ID683
Feature activation-3.97e+00

Pos logit contributions
 Int+0.135
ete+0.126
 Everywhere+0.125
 Fro+0.123
Da+0.119
Neg logit contributions
els-0.097
 on-0.093
 inside-0.090
 sticker-0.083
.-0.081
Token fly
Token ID6129
Feature activation+2.89e+00

Pos logit contributions
 Int+0.074
ete+0.070
 Everywhere+0.070
 Fro+0.068
 Curious+0.066
Neg logit contributions
els-0.042
 on-0.036
 inside-0.034
 sticker-0.034
 frost-0.033
Token.
Token ID13
Feature activation-1.52e+01

Pos logit contributions
els+0.022
 on+0.022
 inside+0.020
.+0.018
 sticker+0.017
Neg logit contributions
 Int-0.063
ete-0.059
 Everywhere-0.059
 Fro-0.058
 Curious-0.057
Token<|endoftext|>
Token ID50256
Feature activation-1.25e+01

Pos logit contributions
 Int+0.171
 concession+0.164
ete+0.159
Mos+0.157
ternally+0.155
Neg logit contributions
.-0.251
 on-0.244
 inside-0.231
els-0.224
 tape-0.206
TokenM
Token ID44
Feature activation-1.90e+01

Pos logit contributions
 Int+0.131
ete+0.117
 Everywhere+0.115
 Fro+0.112
Da+0.106
Neg logit contributions
els-0.235
 on-0.231
 inside-0.226
 sticker-0.214
.-0.213
Tokenum
Token ID388
Feature activation-8.48e+00

Pos logit contributions
 Int+0.241
 Curious+0.203
 Everywhere+0.202
Da+0.196
 Follow+0.193
Neg logit contributions
els-0.326
 on-0.320
.-0.317
 sticker-0.301
 frost-0.300
Token and
Token ID290
Feature activation-1.06e+01

Pos logit contributions
 Int+0.148
ete+0.139
 Everywhere+0.136
 Fro+0.136
 Curious+0.130
Neg logit contributions
els-0.097
 on-0.089
 inside-0.087
 sticker-0.085
ps-0.077
Token Dad
Token ID17415
Feature activation-7.50e+00

Pos logit contributions
 Int+0.204
ete+0.200
rig+0.199
 Everywhere+0.198
Sadly+0.198
Neg logit contributions
 on-0.128
 inside-0.115
.-0.094
els-0.085
 frost-0.083
Token had
Token ID550
Feature activation-5.20e+00

Pos logit contributions
 Int+0.114
ete+0.108
 Everywhere+0.105
 Fro+0.103
 Everything+0.100
Neg logit contributions
els-0.102
 on-0.091
 sticker-0.089
 inside-0.088
ps-0.085
Token a
Token ID257
Feature activation-4.43e+00

Pos logit contributions
 Int+0.095
ete+0.089
 Everywhere+0.089
 Fro+0.087
Da+0.085
Neg logit contributions
els-0.058
 on-0.056
 inside-0.054
 sticker-0.049
.-0.048
Token it
Token ID340
Feature activation+8.19e-01

Pos logit contributions
 Int+0.038
 Fro+0.037
ete+0.036
 Everywhere+0.035
 Barkley+0.035
Neg logit contributions
els-0.043
 sticker-0.041
ier-0.039
 on-0.039
 colored-0.039
Token forever
Token ID8097
Feature activation-1.66e+00

Pos logit contributions
els+0.010
bones+0.008
 frost+0.008
 colored+0.008
 sticker+0.008
Neg logit contributions
 Int-0.013
ete-0.013
 Everywhere-0.013
 Fro-0.012
Da-0.012
Token
Token ID6184
Feature activation-6.12e+00

Pos logit contributions
 Int+0.031
 Everywhere+0.030
ete+0.029
 Curious+0.028
 Fro+0.028
Neg logit contributions
els-0.017
ps-0.012
bones-0.012
 frost-0.012
 sticker-0.012
Token
Token ID95
Feature activation-9.22e+00

Pos logit contributions
 Int+0.075
ete+0.068
 Everywhere+0.067
 Fro+0.065
Da+0.063
Neg logit contributions
els-0.105
 on-0.102
 inside-0.100
 sticker-0.094
.-0.093
Token
Token ID26391
Feature activation-1.50e+01

Pos logit contributions
 Int+0.104
 Fro+0.101
ete+0.099
Da+0.096
 Everywhere+0.093
Neg logit contributions
 on-0.155
 inside-0.153
els-0.152
.-0.149
 sticker-0.148
Token
Token ID447
Feature activation-1.90e+01

Pos logit contributions
Da+0.208
ete+0.203
 Everywhere+0.202
 Int+0.201
 concession+0.199
Neg logit contributions
.-0.226
 on-0.223
 inside-0.193
 sticker-0.188
."-0.183
Token
Token ID250
Feature activation-1.26e+01

Pos logit contributions
 Int+0.303
ete+0.291
 Everywhere+0.290
 Fro+0.281
Da+0.277
Neg logit contributions
 on-0.237
 sticker-0.220
els-0.216
.-0.213
 inside-0.207
Token things
Token ID1243
Feature activation-1.74e+00

Pos logit contributions
 Int+0.116
ete+0.103
 Everywhere+0.101
 Fro+0.098
Da+0.091
Neg logit contributions
els-0.251
 on-0.246
 inside-0.241
 sticker-0.230
.-0.228
Token are
Token ID389
Feature activation-5.80e+00

Pos logit contributions
 Int+0.026
 Everywhere+0.025
ete+0.024
 Everything+0.023
 Fro+0.022
Neg logit contributions
els-0.024
 on-0.020
 sticker-0.020
 inside-0.020
 colored-0.020
Token only
Token ID691
Feature activation-2.83e+00

Pos logit contributions
 Everywhere+0.077
ete+0.074
 Int+0.072
 Fro+0.068
 Barkley+0.067
Neg logit contributions
els-0.089
 sticker-0.075
 frost-0.072
ries-0.071
 colored-0.070
Token available
Token ID1695
Feature activation+2.58e+00

Pos logit contributions
 Int+0.043
 Everywhere+0.042
ete+0.042
mber+0.040
 Fro+0.040
Neg logit contributions
els-0.037
 sticker-0.028
 frost-0.027
 inside-0.027
ries-0.027
Token the
Token ID262
Feature activation-7.29e+00

Pos logit contributions
 Int+0.036
 Everywhere+0.034
ete+0.034
 Fro+0.033
Da+0.033
Neg logit contributions
els-0.020
 sticker-0.017
 on-0.016
 inside-0.015
ps-0.015
Token ocean
Token ID9151
Feature activation+1.15e+00

Pos logit contributions
 Int+0.098
 Everywhere+0.090
ete+0.090
 Fro+0.088
Da+0.084
Neg logit contributions
els-0.115
 on-0.108
 inside-0.105
 sticker-0.100
.-0.098
Token again
Token ID757
Feature activation-1.89e+00

Pos logit contributions
els+0.007
 sticker+0.005
 on+0.005
 inside+0.005
 frost+0.004
Neg logit contributions
 Int-0.026
ete-0.025
 Everywhere-0.025
 Fro-0.025
 concession-0.024
Token.
Token ID13
Feature activation-1.47e+01

Pos logit contributions
 Int+0.041
ete+0.041
 Fro+0.040
Da+0.039
 Everywhere+0.039
Neg logit contributions
els-0.013
 sticker-0.010
ps-0.009
 frost-0.009
 tape-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.21e+01

Pos logit contributions
ternally+0.188
 concession+0.185
Mos+0.178
 Int+0.178
rig+0.170
Neg logit contributions
.-0.221
 on-0.206
 inside-0.195
."-0.174
els-0.166
TokenM
Token ID44
Feature activation-1.90e+01

Pos logit contributions
 Int+0.148
ete+0.135
 Everywhere+0.132
 Fro+0.129
Da+0.125
Neg logit contributions
 on-0.203
els-0.203
 inside-0.198
.-0.187
 sticker-0.183
Tokenum
Token ID388
Feature activation-8.51e+00

Pos logit contributions
 Int+0.236
 Curious+0.197
 Everywhere+0.193
Da+0.189
 Follow+0.188
Neg logit contributions
els-0.332
 on-0.326
.-0.324
 inside-0.308
 sticker-0.305
Token looked
Token ID3114
Feature activation-7.58e+00

Pos logit contributions
 Int+0.149
ete+0.141
 Everywhere+0.138
 Fro+0.137
Da+0.132
Neg logit contributions
els-0.098
 on-0.093
 inside-0.090
 sticker-0.084
.-0.080
Token at
Token ID379
Feature activation-5.06e+00

Pos logit contributions
 Int+0.178
 Everywhere+0.167
ete+0.166
Da+0.165
 Fro+0.163
Neg logit contributions
 inside-0.053
 on-0.052
els-0.043
.-0.039
 sticker-0.026
Token her
Token ID607
Feature activation-5.42e+00

Pos logit contributions
 Int+0.105
ete+0.099
 Everywhere+0.097
 Fro+0.096
Da+0.094
Neg logit contributions
els-0.044
 on-0.042
 inside-0.041
.-0.035
 sticker-0.035
Token daughter
Token ID4957
Feature activation-9.57e-01

Pos logit contributions
 Int+0.087
ete+0.080
 Everywhere+0.079
 Fro+0.078
Da+0.077
Neg logit contributions
els-0.074
 on-0.072
 inside-0.071
.-0.065
 sticker-0.064
Token made
Token ID925
Feature activation-6.40e+00

Pos logit contributions
els+0.037
 on+0.035
 inside+0.034
 sticker+0.032
.+0.031
Neg logit contributions
 Int-0.046
ete-0.042
 Fro-0.042
 Everywhere-0.042
Da-0.041
Token them
Token ID606
Feature activation-1.71e+00

Pos logit contributions
 Int+0.119
ete+0.111
 Everywhere+0.111
 Fro+0.109
Da+0.106
Neg logit contributions
els-0.069
 on-0.066
 inside-0.064
 sticker-0.058
.-0.057
Token happy
Token ID3772
Feature activation-2.05e+00

Pos logit contributions
 Int+0.028
 Fro+0.026
 Everywhere+0.026
ete+0.026
 Curious+0.025
Neg logit contributions
els-0.022
 inside-0.018
 on-0.018
 sticker-0.017
 colored-0.017
Token.
Token ID13
Feature activation-1.58e+01

Pos logit contributions
 Int+0.045
 Everywhere+0.042
ete+0.042
 Fro+0.042
 Curious+0.041
Neg logit contributions
els-0.015
 on-0.012
 inside-0.012
 sticker-0.011
 colored-0.009
Token<|endoftext|>
Token ID50256
Feature activation-1.25e+01

Pos logit contributions
 Int+0.190
ete+0.186
 concession+0.178
ternally+0.174
mber+0.170
Neg logit contributions
.-0.259
 on-0.249
 inside-0.234
els-0.228
."-0.214
TokenS
Token ID50
Feature activation-1.90e+01

Pos logit contributions
 Int+0.125
 Everywhere+0.111
ete+0.111
 Fro+0.107
Da+0.100
Neg logit contributions
els-0.240
 on-0.231
 inside-0.228
 sticker-0.217
 colored-0.213
Tokenara
Token ID3301
Feature activation-1.28e+01

Pos logit contributions
 Int+0.225
 Everywhere+0.195
 Curious+0.189
Da+0.181
 Fro+0.178
Neg logit contributions
els-0.347
 on-0.329
 inside-0.324
.-0.314
 frost-0.307
Token and
Token ID290
Feature activation-1.24e+01

Pos logit contributions
 Int+0.240
 Fro+0.226
ete+0.225
 Everywhere+0.224
 Curious+0.216
Neg logit contributions
els-0.131
 on-0.114
 inside-0.110
 sticker-0.108
ps-0.099
Token Ben
Token ID3932
Feature activation-1.07e+01

Pos logit contributions
rig+0.278
 Int+0.269
pill+0.268
ete+0.264
Sadly+0.261
Neg logit contributions
 on-0.129
 inside-0.114
.-0.108
,"-0.075
,-0.074
Token were
Token ID547
Feature activation-8.32e+00

Pos logit contributions
 Int+0.178
ete+0.166
 Everywhere+0.165
 Fro+0.161
Da+0.160
Neg logit contributions
 on-0.135
els-0.132
 inside-0.131
.-0.124
 sticker-0.116
Token playing
Token ID2712
Feature activation-3.15e-01

Pos logit contributions
 Int+0.167
ete+0.157
 Everywhere+0.154
mber+0.153
Da+0.150
Neg logit contributions
 on-0.094
 inside-0.085
els-0.078
.-0.075
 frost-0.064
Token in
Token ID287
Feature activation-5.53e+00

Pos logit contributions
 Int+0.029
 Everywhere+0.029
ete+0.028
 Fro+0.027
mber+0.026
Neg logit contributions
els-0.016
 sticker-0.014
 on-0.014
 inside-0.013
ps-0.013
Token her
Token ID607
Feature activation-5.93e+00

Pos logit contributions
 Int+0.089
 Everywhere+0.088
ete+0.085
 Fro+0.082
 Curious+0.079
Neg logit contributions
els-0.065
 sticker-0.055
ps-0.055
 inside-0.055
 on-0.054
Token heart
Token ID2612
Feature activation+1.90e+00

Pos logit contributions
 Int+0.094
 Everywhere+0.089
ete+0.088
 Fro+0.085
 Curious+0.083
Neg logit contributions
els-0.077
 on-0.072
 inside-0.070
 sticker-0.065
 colored-0.065
Token.
Token ID13
Feature activation-1.51e+01

Pos logit contributions
els+0.014
 on+0.010
 inside+0.010
 sticker+0.009
 frost+0.009
Neg logit contributions
 Int-0.040
 Everywhere-0.039
ete-0.039
mber-0.038
 Fro-0.037
Token<|endoftext|>
Token ID50256
Feature activation-1.26e+01

Pos logit contributions
ete+0.156
 concession+0.149
 Int+0.148
liv+0.142
ternally+0.141
Neg logit contributions
.-0.269
 on-0.259
 inside-0.254
els-0.245
 sticker-0.224
TokenS
Token ID50
Feature activation-1.90e+01

Pos logit contributions
 Int+0.146
ete+0.131
 Everywhere+0.131
 Fro+0.127
Da+0.121
Neg logit contributions
els-0.223
 on-0.215
 inside-0.211
 sticker-0.200
.-0.197
Tokenara
Token ID3301
Feature activation-1.32e+01

Pos logit contributions
 Int+0.213
 Everywhere+0.185
 Curious+0.176
 Fro+0.174
ete+0.174
Neg logit contributions
els-0.353
 on-0.338
 inside-0.333
.-0.320
 sticker-0.312
Token and
Token ID290
Feature activation-1.24e+01

Pos logit contributions
 Int+0.244
ete+0.228
 Everywhere+0.228
 Fro+0.228
 Curious+0.219
Neg logit contributions
els-0.138
 on-0.122
 inside-0.114
 sticker-0.114
 frost-0.105
Token Ben
Token ID3932
Feature activation-1.10e+01

Pos logit contributions
rig+0.287
 Int+0.279
Sadly+0.274
ete+0.273
pill+0.273
Neg logit contributions
 on-0.114
 inside-0.101
.-0.093
,"-0.062
,-0.060
Token like
Token ID588
Feature activation-6.68e+00

Pos logit contributions
 Int+0.178
ete+0.166
 Everywhere+0.164
 Fro+0.161
Da+0.161
Neg logit contributions
 on-0.144
 inside-0.142
els-0.141
.-0.134
 sticker-0.125
Token to
Token ID284
Feature activation-7.78e+00

Pos logit contributions
 Int+0.119
ete+0.111
 Everywhere+0.111
 Fro+0.109
Da+0.106
Neg logit contributions
els-0.077
 on-0.074
 inside-0.071
 sticker-0.065
.-0.064
Token catch
Token ID4929
Feature activation+2.10e+00

Pos logit contributions
 Int+0.086
ete+0.083
 Everywhere+0.083
 Fro+0.082
 Everything+0.076
Neg logit contributions
els-0.069
 on-0.062
 inside-0.061
 sticker-0.056
 frost-0.055
Token a
Token ID257
Feature activation-3.43e+00

Pos logit contributions
els+0.019
 on+0.018
 inside+0.017
 sticker+0.016
 colored+0.015
Neg logit contributions
 Int-0.042
ete-0.039
 Everywhere-0.039
 Fro-0.039
Da-0.038
Token star
Token ID3491
Feature activation+4.64e+00

Pos logit contributions
 Int+0.052
 Fro+0.051
ete+0.050
 Everywhere+0.049
Da+0.049
Neg logit contributions
els-0.049
 on-0.041
 inside-0.041
ps-0.040
 colored-0.039
Token.
Token ID13
Feature activation-1.49e+01

Pos logit contributions
els+0.038
ps+0.026
 colored+0.025
 frost+0.025
 inside+0.025
Neg logit contributions
 Fro-0.090
ete-0.090
 Int-0.090
 Everywhere-0.088
mber-0.086
Token<|endoftext|>
Token ID50256
Feature activation-1.22e+01

Pos logit contributions
 Int+0.172
 concession+0.166
ete+0.164
ternally+0.161
rolling+0.155
Neg logit contributions
.-0.243
 on-0.231
 inside-0.216
els-0.204
."-0.200
TokenS
Token ID50
Feature activation-1.90e+01

Pos logit contributions
 Int+0.131
ete+0.119
 Everywhere+0.115
 Fro+0.112
Da+0.107
Neg logit contributions
els-0.225
 on-0.223
 inside-0.218
.-0.206
 sticker-0.204
Tokenandy
Token ID10757
Feature activation-1.40e+01

Pos logit contributions
 Int+0.224
 Everywhere+0.190
 Curious+0.188
Da+0.179
 Follow+0.175
Neg logit contributions
els-0.348
 on-0.328
 inside-0.324
.-0.318
 frost-0.307
Token was
Token ID373
Feature activation-9.73e+00

Pos logit contributions
 Int+0.277
ete+0.260
 Everywhere+0.258
 Fro+0.253
Da+0.252
Neg logit contributions
 on-0.140
els-0.135
 inside-0.133
.-0.126
 colored-0.114
Token a
Token ID257
Feature activation-5.94e+00

Pos logit contributions
 Int+0.183
ete+0.180
mber+0.171
 Everywhere+0.170
 Fro+0.166
Neg logit contributions
 on-0.112
 inside-0.106
els-0.096
.-0.092
 colored-0.084
Token little
Token ID1310
Feature activation-6.50e+00

Pos logit contributions
 Int+0.095
ete+0.089
 Everywhere+0.088
 Fro+0.084
Da+0.083
Neg logit contributions
 on-0.080
els-0.077
 inside-0.077
.-0.072
 sticker-0.071
Token girl
Token ID2576
Feature activation-4.60e+00

Pos logit contributions
 Int+0.089
 Fro+0.081
ete+0.080
 Everywhere+0.080
 Curious+0.077
Neg logit contributions
els-0.101
 on-0.095
 inside-0.093
 sticker-0.088
ps-0.086
Token her
Token ID607
Feature activation-1.22e+01

Pos logit contributions
els+0.057
 sticker+0.047
 on+0.046
 colored+0.044
 frost+0.044
Neg logit contributions
 Int-0.110
ete-0.104
 Everywhere-0.103
 Fro-0.102
 Curious-0.099
Token first
Token ID717
Feature activation-6.70e+00

Pos logit contributions
 Int+0.208
ete+0.195
 Everywhere+0.194
 Fro+0.191
 Curious+0.184
Neg logit contributions
els-0.148
 on-0.141
 inside-0.137
 sticker-0.125
.-0.122
Token tire
Token ID15867
Feature activation-1.42e+00

Pos logit contributions
 Int+0.125
ete+0.119
 Everywhere+0.119
 Fro+0.117
 Curious+0.113
Neg logit contributions
els-0.071
 on-0.062
 inside-0.062
 sticker-0.055
 colored-0.053
Token.
Token ID13
Feature activation-1.54e+01

Pos logit contributions
 Int+0.025
ete+0.023
 Everywhere+0.023
 Fro+0.023
 Curious+0.022
Neg logit contributions
els-0.017
 on-0.016
 inside-0.015
 sticker-0.014
.-0.014
Token<|endoftext|>
Token ID50256
Feature activation-1.29e+01

Pos logit contributions
ete+0.165
 Int+0.153
 concession+0.148
 Fro+0.141
mber+0.140
Neg logit contributions
.-0.273
 on-0.271
 inside-0.255
els-0.238
 sticker-0.229
TokenM
Token ID44
Feature activation-1.89e+01

Pos logit contributions
 Int+0.146
ete+0.133
 Everywhere+0.129
 Fro+0.126
Da+0.121
Neg logit contributions
els-0.228
 on-0.228
 inside-0.222
.-0.210
 sticker-0.207
Tokenummy
Token ID13513
Feature activation-1.04e+01

Pos logit contributions
 Int+0.216
 Everywhere+0.179
 Curious+0.177
Da+0.167
 Follow+0.164
Neg logit contributions
els-0.350
 on-0.343
.-0.335
 inside-0.326
 sticker-0.319
Token and
Token ID290
Feature activation-1.12e+01

Pos logit contributions
 Int+0.200
ete+0.189
 Everywhere+0.187
 Fro+0.185
 Curious+0.179
Neg logit contributions
els-0.102
 on-0.095
 inside-0.091
 sticker-0.084
.-0.078
Token Daddy
Token ID26926
Feature activation-7.52e+00

Pos logit contributions
 Int+0.229
rig+0.228
Sadly+0.224
ete+0.224
 Everywhere+0.219
Neg logit contributions
 on-0.127
 inside-0.109
.-0.090
els-0.075
 colored-0.074
Token were
Token ID547
Feature activation-6.91e+00

Pos logit contributions
 Int+0.112
ete+0.106
 Everywhere+0.104
 Fro+0.103
 Everything+0.098
Neg logit contributions
els-0.105
 on-0.096
 inside-0.093
 sticker-0.092
ps-0.086
Token talking
Token ID3375
Feature activation+1.19e+00

Pos logit contributions
 Int+0.134
ete+0.126
 Everywhere+0.126
 Fro+0.122
Da+0.122
Neg logit contributions
 on-0.076
 inside-0.070
els-0.068
.-0.061
 frost-0.055
Token good
Token ID922
Feature activation-5.03e+00

Pos logit contributions
 Fro+0.103
 Int+0.103
mber+0.098
ete+0.096
 concession+0.096
Neg logit contributions
els-0.117
ps-0.094
ries-0.091
 on-0.088
 inside-0.087
Token summer
Token ID3931
Feature activation+8.65e-01

Pos logit contributions
 Fro+0.065
ete+0.061
 Int+0.059
mber+0.058
 Everywhere+0.057
Neg logit contributions
els-0.086
ries-0.069
ps-0.069
 on-0.067
bones-0.064
Token day
Token ID1110
Feature activation-1.61e+00

Pos logit contributions
els+0.012
ps+0.009
 inside+0.008
 sticker+0.008
 on+0.008
Neg logit contributions
 Int-0.013
mber-0.013
ete-0.013
 concession-0.013
 Everywhere-0.013
Token.
Token ID13
Feature activation-1.48e+01

Pos logit contributions
 Int+0.035
ete+0.034
 Everywhere+0.034
 Fro+0.033
mber+0.032
Neg logit contributions
els-0.012
 sticker-0.008
 inside-0.007
 on-0.007
 colored-0.007
Token<|endoftext|>
Token ID50256
Feature activation-1.19e+01

Pos logit contributions
ete+0.184
 concession+0.172
rolling+0.171
 Int+0.170
ternally+0.168
Neg logit contributions
.-0.235
 on-0.227
 inside-0.206
."-0.203
 sticker-0.196
TokenS
Token ID50
Feature activation-1.89e+01

Pos logit contributions
 Int+0.127
ete+0.113
 Everywhere+0.113
 Fro+0.109
Da+0.103
Neg logit contributions
els-0.223
 on-0.215
 inside-0.212
 sticker-0.201
.-0.197
Tokenara
Token ID3301
Feature activation-1.26e+01

Pos logit contributions
 Int+0.240
 Everywhere+0.207
 Curious+0.203
Da+0.194
 Fro+0.191
Neg logit contributions
els-0.332
 on-0.317
 inside-0.311
.-0.302
 sticker-0.290
Token and
Token ID290
Feature activation-1.25e+01

Pos logit contributions
 Int+0.244
ete+0.229
 Everywhere+0.228
 Fro+0.227
 Curious+0.220
Neg logit contributions
els-0.123
 on-0.109
 inside-0.104
 sticker-0.099
 frost-0.091
Token Tom
Token ID4186
Feature activation-9.89e+00

Pos logit contributions
rig+0.292
 Int+0.284
ete+0.283
pill+0.280
Sadly+0.277
Neg logit contributions
 on-0.118
 inside-0.102
.-0.094
,"-0.062
,-0.060
Token were
Token ID547
Feature activation-8.50e+00

Pos logit contributions
 Int+0.148
ete+0.138
 Everywhere+0.136
 Fro+0.133
Da+0.130
Neg logit contributions
els-0.141
 on-0.138
 inside-0.135
.-0.124
 sticker-0.124
Token very
Token ID845
Feature activation-6.75e+00

Pos logit contributions
 Int+0.169
ete+0.158
 Everywhere+0.156
mber+0.153
Da+0.151
Neg logit contributions
 on-0.097
 inside-0.088
els-0.081
.-0.077
 frost-0.067
Token went
Token ID1816
Feature activation-6.23e+00

Pos logit contributions
 Int+0.179
ete+0.169
 Everywhere+0.168
 Fro+0.165
Da+0.161
Neg logit contributions
els-0.083
 on-0.079
 inside-0.076
 sticker-0.067
.-0.066
Token to
Token ID284
Feature activation-8.28e+00

Pos logit contributions
 Int+0.135
ete+0.128
 Fro+0.128
 Everywhere+0.128
Da+0.122
Neg logit contributions
els-0.050
 sticker-0.036
 on-0.036
 inside-0.036
 frost-0.036
Token bed
Token ID3996
Feature activation-1.90e+00

Pos logit contributions
 Int+0.146
ete+0.136
 Everywhere+0.135
 Fro+0.133
Da+0.129
Neg logit contributions
els-0.097
 on-0.094
 inside-0.091
 sticker-0.083
.-0.082
Token.
Token ID13
Feature activation-1.54e+01

Pos logit contributions
 Int+0.040
ete+0.038
 Everywhere+0.037
 Fro+0.037
Da+0.036
Neg logit contributions
els-0.016
 on-0.014
 inside-0.013
 sticker-0.012
 colored-0.012
Token<|endoftext|>
Token ID50256
Feature activation-1.27e+01

Pos logit contributions
ete+0.176
ternally+0.174
 concession+0.168
isky+0.163
 Int+0.161
Neg logit contributions
.-0.253
 on-0.242
 inside-0.233
."-0.202
 sticker-0.199
TokenM
Token ID44
Feature activation-1.89e+01

Pos logit contributions
 Int+0.134
ete+0.121
 Everywhere+0.118
 Fro+0.115
Da+0.110
Neg logit contributions
els-0.237
 on-0.233
 inside-0.228
 sticker-0.215
.-0.215
Tokenum
Token ID388
Feature activation-8.33e+00

Pos logit contributions
 Int+0.240
 Curious+0.202
 Everywhere+0.200
Da+0.192
 Follow+0.191
Neg logit contributions
els-0.325
 on-0.321
.-0.313
 inside-0.301
 sticker-0.299
Token and
Token ID290
Feature activation-1.05e+01

Pos logit contributions
 Int+0.139
ete+0.131
 Everywhere+0.128
 Fro+0.127
Da+0.123
Neg logit contributions
els-0.101
 on-0.094
 inside-0.091
 sticker-0.089
ps-0.081
Token Dad
Token ID17415
Feature activation-7.26e+00

Pos logit contributions
 Int+0.195
ete+0.190
 Everywhere+0.188
Sadly+0.187
rig+0.186
Neg logit contributions
 on-0.133
 inside-0.122
.-0.101
els-0.096
 frost-0.091
Token carefully
Token ID7773
Feature activation-9.34e-01

Pos logit contributions
 Int+0.104
ete+0.098
 Everywhere+0.095
 Fro+0.093
 Everything+0.091
Neg logit contributions
els-0.105
 on-0.094
 sticker-0.092
 inside-0.091
ps-0.088
Token took
Token ID1718
Feature activation-7.51e-01

Pos logit contributions
 Int+0.012
 Fro+0.011
ete+0.011
 concession+0.011
 Exc+0.010
Neg logit contributions
els-0.014
ier-0.012
ps-0.012
 sticker-0.012
 thing-0.012
Token the
Token ID262
Feature activation-7.03e+00

Pos logit contributions
 Int+0.060
 Fro+0.055
 Everywhere+0.055
ete+0.054
 Curious+0.052
Neg logit contributions
els-0.046
 on-0.039
 inside-0.039
 sticker-0.037
 frost-0.036
Token most
Token ID749
Feature activation-2.35e+00

Pos logit contributions
 Int+0.118
 Fro+0.111
 Everywhere+0.110
ete+0.109
 concession+0.105
Neg logit contributions
els-0.084
 on-0.075
 inside-0.074
 sticker-0.066
 colored-0.065
Token successful
Token ID4388
Feature activation-1.02e-01

Pos logit contributions
 Int+0.035
ete+0.033
 Fro+0.033
 concession+0.032
rig+0.032
Neg logit contributions
els-0.031
 on-0.026
 inside-0.026
ps-0.025
 sticker-0.025
Token.
Token ID13
Feature activation-1.53e+01

Pos logit contributions
 Int+0.002
ete+0.002
mber+0.002
 Everywhere+0.002
 concession+0.002
Neg logit contributions
els-0.001
 on-0.001
 inside-0.001
ps-0.001
 sticker-0.001
Token<|endoftext|>
Token ID50256
Feature activation-1.24e+01

Pos logit contributions
 Int+0.170
ete+0.166
 concession+0.161
Da+0.156
mber+0.153
Neg logit contributions
.-0.256
 on-0.250
els-0.239
 inside-0.234
 sticker-0.223
TokenM
Token ID44
Feature activation-1.89e+01

Pos logit contributions
 Int+0.134
ete+0.120
 Everywhere+0.119
 Fro+0.115
Da+0.109
Neg logit contributions
els-0.227
 on-0.221
 inside-0.217
 sticker-0.206
.-0.203
Tokenandy
Token ID10757
Feature activation-1.37e+01

Pos logit contributions
 Int+0.246
 Everywhere+0.208
 Curious+0.208
Da+0.197
 Follow+0.195
Neg logit contributions
els-0.323
 on-0.316
.-0.310
 frost-0.297
 inside-0.297
Token was
Token ID373
Feature activation-8.98e+00

Pos logit contributions
 Int+0.258
ete+0.242
 Everywhere+0.241
 Fro+0.236
Da+0.233
Neg logit contributions
 on-0.145
els-0.144
 inside-0.138
.-0.130
 colored-0.120
Token walking
Token ID6155
Feature activation-3.03e+00

Pos logit contributions
 Int+0.159
ete+0.154
 Everywhere+0.148
mber+0.145
 Fro+0.145
Neg logit contributions
 on-0.109
 inside-0.104
els-0.101
.-0.092
 sticker-0.087
Token home
Token ID1363
Feature activation+1.43e-01

Pos logit contributions
 Int+0.064
 Everywhere+0.061
ete+0.061
 Fro+0.060
 Curious+0.058
Neg logit contributions
els-0.025
 on-0.022
 inside-0.021
 sticker-0.020
 colored-0.018
Token from
Token ID422
Feature activation-5.86e+00

Pos logit contributions
els+0.001
 sticker+0.001
 frost+0.001
ps+0.001
 colored+0.001
Neg logit contributions
 Int-0.003
 Fro-0.003
 Everywhere-0.003
 Curious-0.002
ete-0.002
Token room
Token ID2119
Feature activation-3.08e+00

Pos logit contributions
 Int+0.208
ete+0.195
 Everywhere+0.194
 Fro+0.191
 Curious+0.184
Neg logit contributions
els-0.148
 on-0.141
 inside-0.137
 sticker-0.125
.-0.122
Token before
Token ID878
Feature activation-7.11e+00

Pos logit contributions
 Int+0.063
 Everywhere+0.060
ete+0.059
 Fro+0.057
 Curious+0.057
Neg logit contributions
els-0.027
 on-0.019
 inside-0.019
 sticker-0.018
ps-0.017
Token dinner
Token ID8073
Feature activation+2.48e-02

Pos logit contributions
 Int+0.114
 Fro+0.106
 Everywhere+0.105
ete+0.104
 Everything+0.101
Neg logit contributions
els-0.094
 inside-0.076
 on-0.076
 sticker-0.075
ps-0.075
Token.
Token ID13
Feature activation-1.52e+01

Pos logit contributions
els+0.000
 on+0.000
 inside+0.000
 sticker+0.000
 colored+0.000
Neg logit contributions
 Int-0.000
ete-0.000
 Everywhere-0.000
 Fro-0.000
mber-0.000
Token<|endoftext|>
Token ID50256
Feature activation-1.24e+01

Pos logit contributions
ete+0.169
 Int+0.164
 concession+0.162
ternally+0.158
rolling+0.155
Neg logit contributions
.-0.259
 on-0.243
 inside-0.235
els-0.213
."-0.212
TokenS
Token ID50
Feature activation-1.89e+01

Pos logit contributions
 Int+0.140
ete+0.127
 Everywhere+0.125
 Fro+0.121
Da+0.116
Neg logit contributions
els-0.222
 on-0.218
 inside-0.213
 sticker-0.201
.-0.200
Tokenaman
Token ID10546
Feature activation-1.59e+01

Pos logit contributions
 Int+0.212
 Everywhere+0.182
 Curious+0.176
 Fro+0.170
Da+0.169
Neg logit contributions
els-0.354
 on-0.338
 inside-0.336
.-0.323
 frost-0.314
Tokentha
Token ID12898
Feature activation-1.37e+01

Pos logit contributions
 Int+0.249
 Everywhere+0.234
Da+0.218
 Curious+0.217
ete+0.217
Neg logit contributions
els-0.229
 on-0.219
 inside-0.216
.-0.203
 sticker-0.189
Token was
Token ID373
Feature activation-9.96e+00

Pos logit contributions
 Int+0.252
ete+0.238
 Everywhere+0.236
 Fro+0.233
 Curious+0.226
Neg logit contributions
els-0.143
 on-0.131
 inside-0.124
 sticker-0.121
ps-0.111
Token a
Token ID257
Feature activation-6.39e+00

Pos logit contributions
 Int+0.196
ete+0.191
 Everywhere+0.183
mber+0.181
 Fro+0.180
Neg logit contributions
 on-0.105
 inside-0.099
els-0.090
.-0.085
 colored-0.077
Token lazy
Token ID16931
Feature activation-1.55e+00

Pos logit contributions
 Int+0.101
ete+0.095
 Everywhere+0.095
 Fro+0.090
Da+0.088
Neg logit contributions
 on-0.087
 inside-0.083
els-0.082
.-0.078
 sticker-0.078
Token to
Token ID284
Feature activation-6.55e+00

Pos logit contributions
 Int+0.093
 Fro+0.089
 Everywhere+0.088
ete+0.087
Da+0.084
Neg logit contributions
els-0.070
 on-0.063
 inside-0.062
 sticker-0.060
 colored-0.055
Token be
Token ID307
Feature activation-3.01e+00

Pos logit contributions
 Int+0.104
ete+0.097
 Everywhere+0.097
 Fro+0.095
 Curious+0.091
Neg logit contributions
els-0.088
 on-0.084
 inside-0.082
 sticker-0.075
.-0.073
Token sailors
Token ID29996
Feature activation+3.16e+00

Pos logit contributions
 Fro+0.043
 Int+0.043
Da+0.042
ete+0.041
 Barkley+0.040
Neg logit contributions
els-0.042
 sticker-0.035
 spills-0.034
 frost-0.034
 on-0.033
Token.
Token ID13
Feature activation-1.42e+01

Pos logit contributions
els+0.028
 on+0.023
 inside+0.022
 sticker+0.021
 colored+0.020
Neg logit contributions
 Int-0.064
ete-0.061
 Everywhere-0.061
 Fro-0.060
Da-0.058
Token<|endoftext|>
Token ID50256
Feature activation-1.18e+01

Pos logit contributions
 Int+0.175
ete+0.171
mber+0.165
 concession+0.163
Da+0.161
Neg logit contributions
.-0.223
 on-0.217
els-0.206
 inside-0.205
 sticker-0.191
TokenS
Token ID50
Feature activation-1.89e+01

Pos logit contributions
 Int+0.110
 Everywhere+0.098
ete+0.095
 Fro+0.093
 Curious+0.086
Neg logit contributions
els-0.237
 on-0.223
 inside-0.221
 sticker-0.214
 colored-0.210
Tokenara
Token ID3301
Feature activation-1.24e+01

Pos logit contributions
 Int+0.237
 Everywhere+0.207
 Curious+0.201
Da+0.196
 Fro+0.196
Neg logit contributions
els-0.331
 on-0.314
 inside-0.307
.-0.294
 sticker-0.291
Token liked
Token ID8288
Feature activation-7.10e+00

Pos logit contributions
 Int+0.212
 Fro+0.201
 Everywhere+0.201
ete+0.200
 From+0.191
Neg logit contributions
els-0.139
 on-0.119
 sticker-0.117
 inside-0.112
ps-0.108
Token cream
Token ID8566
Feature activation+2.17e+00

Pos logit contributions
 Int+0.133
ete+0.126
 Everywhere+0.124
 Fro+0.121
 Curious+0.118
Neg logit contributions
els-0.075
 on-0.075
 inside-0.072
.-0.065
 sticker-0.065
Token very
Token ID845
Feature activation-5.80e+00

Pos logit contributions
els+0.018
 harm+0.015
 inside+0.015
ps+0.015
 problems+0.014
Neg logit contributions
 Int-0.041
 Fro-0.041
ete-0.039
 concession-0.039
 Everywhere-0.038
Token much
Token ID881
Feature activation+1.32e-01

Pos logit contributions
 Int+0.075
 Everywhere+0.073
ete+0.071
 Fro+0.066
mber+0.065
Neg logit contributions
els-0.090
 inside-0.085
 on-0.081
ps-0.079
 sticker-0.077
Token happy
Token ID3772
Feature activation-3.50e+00

Pos logit contributions
 Int+0.162
ete+0.152
 Fro+0.149
 Everywhere+0.148
Da+0.145
Neg logit contributions
 on-0.143
 inside-0.137
els-0.136
.-0.127
 sticker-0.123
Token than
Token ID621
Feature activation-5.92e+00

Pos logit contributions
 Int+0.079
ete+0.076
 Everywhere+0.076
 Fro+0.075
Da+0.073
Neg logit contributions
els-0.023
 on-0.018
 inside-0.017
 sticker-0.016
 frost-0.013
Token ever
Token ID1683
Feature activation-3.77e+00

Pos logit contributions
 Int+0.092
ete+0.089
 Everywhere+0.089
 Fro+0.087
 Everything+0.082
Neg logit contributions
els-0.078
 on-0.066
 inside-0.066
 sticker-0.066
 colored-0.065
Token.
Token ID13
Feature activation-1.49e+01

Pos logit contributions
 Int+0.061
 Everywhere+0.058
 Fro+0.057
ete+0.056
 Curious+0.055
Neg logit contributions
els-0.049
 sticker-0.039
 frost-0.038
 tape-0.037
 paper-0.037
Token<|endoftext|>
Token ID50256
Feature activation-1.22e+01

Pos logit contributions
 Int+0.205
 concession+0.194
ete+0.190
ternally+0.187
Mos+0.186
Neg logit contributions
.-0.226
 on-0.207
 inside-0.191
."-0.178
els-0.170
TokenM
Token ID44
Feature activation-1.89e+01

Pos logit contributions
 Int+0.143
ete+0.130
 Everywhere+0.127
 Fro+0.124
Da+0.120
Neg logit contributions
 on-0.211
els-0.211
 inside-0.205
.-0.194
 sticker-0.191
Tokenum
Token ID388
Feature activation-8.22e+00

Pos logit contributions
 Int+0.239
 Curious+0.199
 Everywhere+0.197
Da+0.191
 Follow+0.188
Neg logit contributions
 on-0.327
els-0.323
.-0.320
 inside-0.305
 frost-0.298
Token and
Token ID290
Feature activation-1.05e+01

Pos logit contributions
 Int+0.144
ete+0.136
 Everywhere+0.133
 Fro+0.133
Da+0.128
Neg logit contributions
els-0.094
 on-0.087
 inside-0.084
 sticker-0.081
 colored-0.074
Token Dad
Token ID17415
Feature activation-7.40e+00

Pos logit contributions
 Int+0.210
ete+0.206
 Everywhere+0.205
rig+0.204
Sadly+0.204
Neg logit contributions
 on-0.123
 inside-0.111
.-0.088
els-0.076
 frost-0.075
Token were
Token ID547
Feature activation-7.48e+00

Pos logit contributions
 Int+0.112
ete+0.107
 Everywhere+0.103
 Fro+0.101
 Everything+0.098
Neg logit contributions
els-0.102
 on-0.091
 sticker-0.089
 inside-0.088
ps-0.084
Token taking
Token ID2263
Feature activation+4.31e-01

Pos logit contributions
 Int+0.152
 Everywhere+0.144
ete+0.142
 Fro+0.141
Da+0.139
Neg logit contributions
 on-0.074
 inside-0.069
els-0.066
.-0.059
 sticker-0.052
Token for
Token ID329
Feature activation-5.86e+00

Pos logit contributions
 Int+0.085
ete+0.081
 Everywhere+0.080
 Fro+0.079
mber+0.075
Neg logit contributions
els-0.063
 on-0.050
 sticker-0.049
 inside-0.049
 frost-0.048
Token her
Token ID607
Feature activation-7.40e+00

Pos logit contributions
ete+0.086
 Int+0.086
 Fro+0.084
 Everywhere+0.081
Da+0.078
Neg logit contributions
els-0.079
 on-0.070
 sticker-0.068
 inside-0.068
 colored-0.067
Token efforts
Token ID4040
Feature activation+6.15e-01

Pos logit contributions
 Int+0.154
 Everywhere+0.147
ete+0.147
 Fro+0.145
 Curious+0.140
Neg logit contributions
els-0.062
 on-0.053
 inside-0.051
 sticker-0.044
 colored-0.044
Token.
Token ID13
Feature activation-1.58e+01

Pos logit contributions
els+0.005
 on+0.004
 inside+0.003
 sticker+0.003
 colored+0.003
Neg logit contributions
 Int-0.013
ete-0.013
 Everywhere-0.013
 Fro-0.012
Da-0.012
Token<|endoftext|>
Token ID50256
Feature activation-1.25e+01

Pos logit contributions
ete+0.181
 Int+0.176
 concession+0.172
 Fro+0.165
mber+0.160
Neg logit contributions
.-0.262
 on-0.256
 inside-0.245
els-0.234
 sticker-0.223
TokenM
Token ID44
Feature activation-1.89e+01

Pos logit contributions
 Int+0.145
ete+0.132
 Everywhere+0.130
 Fro+0.126
Da+0.121
Neg logit contributions
els-0.220
 on-0.215
 inside-0.211
 sticker-0.198
.-0.197
Tokenia
Token ID544
Feature activation-1.40e+01

Pos logit contributions
 Int+0.232
 Curious+0.193
 Everywhere+0.192
Da+0.185
 Follow+0.183
Neg logit contributions
els-0.329
 on-0.323
.-0.322
 inside-0.307
 frost-0.303
Token was
Token ID373
Feature activation-9.74e+00

Pos logit contributions
 Int+0.255
ete+0.239
 Everywhere+0.237
 Fro+0.233
Da+0.227
Neg logit contributions
els-0.155
 on-0.148
 inside-0.143
 sticker-0.131
.-0.126
Token playing
Token ID2712
Feature activation-1.31e+00

Pos logit contributions
 Int+0.178
ete+0.173
 Everywhere+0.167
 Fro+0.163
mber+0.163
Neg logit contributions
 on-0.113
 inside-0.107
els-0.104
.-0.094
 colored-0.088
Token in
Token ID287
Feature activation-5.26e+00

Pos logit contributions
 Int+0.030
ete+0.028
 Everywhere+0.028
 Fro+0.028
Da+0.027
Neg logit contributions
els-0.009
 on-0.008
 inside-0.008
 sticker-0.006
.-0.006
Token the
Token ID262
Feature activation-5.32e+00

Pos logit contributions
 Int+0.100
 Everywhere+0.094
 Fro+0.092
ete+0.092
Da+0.090
Neg logit contributions
els-0.054
 on-0.050
 inside-0.049
 sticker-0.045
.-0.043
Token who
Token ID508
Feature activation-6.92e+00

Pos logit contributions
 Int+0.060
ete+0.057
 Everywhere+0.055
 Fro+0.055
 Curious+0.052
Neg logit contributions
els-0.055
 on-0.052
 inside-0.051
 sticker-0.049
 colored-0.046
Token saved
Token ID7448
Feature activation+1.19e+00

Pos logit contributions
 Int+0.102
ete+0.094
 Everywhere+0.092
 Fro+0.092
Da+0.089
Neg logit contributions
els-0.101
 on-0.100
 inside-0.097
.-0.091
 sticker-0.089
Token her
Token ID607
Feature activation-3.02e+00

Pos logit contributions
els+0.014
 on+0.012
 inside+0.012
 sticker+0.012
 frost+0.011
Neg logit contributions
 Int-0.020
ete-0.020
 Fro-0.020
 Everywhere-0.020
Da-0.019
Token.
Token ID13
Feature activation-1.45e+01

Pos logit contributions
 Everywhere+0.053
ete+0.053
 Int+0.053
 Fro+0.052
 Sadly+0.050
Neg logit contributions
els-0.032
 on-0.025
 colored-0.024
 inside-0.024
 frost-0.024
Token<|endoftext|>
Token ID50256
Feature activation-1.15e+01

Pos logit contributions
 Int+0.165
 concession+0.158
ete+0.154
ternally+0.151
ext+0.149
Neg logit contributions
.-0.233
 on-0.220
 inside-0.211
els-0.198
."-0.192
TokenM
Token ID44
Feature activation-1.89e+01

Pos logit contributions
 Int+0.143
ete+0.131
 Everywhere+0.128
 Fro+0.125
Da+0.121
Neg logit contributions
els-0.191
 on-0.191
 inside-0.185
.-0.175
 sticker-0.172
Tokenum
Token ID388
Feature activation-8.29e+00

Pos logit contributions
 Int+0.259
 Curious+0.218
 Everywhere+0.211
 Follow+0.208
Da+0.207
Neg logit contributions
els-0.304
 on-0.304
.-0.302
 inside-0.283
 frost-0.283
Token and
Token ID290
Feature activation-1.00e+01

Pos logit contributions
 Int+0.149
ete+0.140
 Everywhere+0.138
 Fro+0.136
Da+0.132
Neg logit contributions
els-0.093
 on-0.089
 inside-0.086
 sticker-0.079
.-0.076
Token Dad
Token ID17415
Feature activation-6.90e+00

Pos logit contributions
 Int+0.210
 Everywhere+0.206
Sadly+0.205
ete+0.204
rig+0.202
Neg logit contributions
 on-0.104
 inside-0.094
.-0.071
els-0.063
 away-0.060
Token had
Token ID550
Feature activation-4.66e+00

Pos logit contributions
 Int+0.107
ete+0.102
 Everywhere+0.099
 Fro+0.098
 Everything+0.094
Neg logit contributions
els-0.093
 on-0.084
 inside-0.081
 sticker-0.081
 colored-0.076
Token a
Token ID257
Feature activation-4.18e+00

Pos logit contributions
 Int+0.087
ete+0.081
 Everywhere+0.081
 Fro+0.079
Da+0.077
Neg logit contributions
 on-0.050
els-0.050
 inside-0.048
.-0.043
 sticker-0.042
Token said
Token ID531
Feature activation+2.39e+00

Pos logit contributions
 Int+0.014
ete+0.013
 Everywhere+0.013
 Fro+0.013
Da+0.012
Neg logit contributions
els-0.010
 on-0.010
 inside-0.010
.-0.009
 sticker-0.009
Token.
Token ID13
Feature activation-7.85e+00

Pos logit contributions
els+0.019
 colored+0.014
 frost+0.013
 sticker+0.012
 spills+0.012
Neg logit contributions
 Int-0.052
ete-0.048
 Fro-0.048
 So-0.047
 Curious-0.047
Token She
Token ID1375
Feature activation-1.11e+00

Pos logit contributions
ete+0.095
 Int+0.090
 concession+0.087
mber+0.086
 Fro+0.083
Neg logit contributions
 on-0.133
.-0.130
 inside-0.126
els-0.124
 sticker-0.115
Token put
Token ID1234
Feature activation+8.19e+00

Pos logit contributions
 Int+0.020
ete+0.018
 Everywhere+0.018
 Fro+0.018
Da+0.017
Neg logit contributions
els-0.013
 on-0.013
 inside-0.012
 sticker-0.011
.-0.011
Token the
Token ID262
Feature activation+2.88e+00

Pos logit contributions
 harm+0.107
els+0.098
ge+0.096
 code+0.094
 problems+0.093
Neg logit contributions
 Fro-0.111
 Everywhere-0.111
 Int-0.108
ete-0.099
 Everything-0.099
Token flowers
Token ID12734
Feature activation+1.28e+01

Pos logit contributions
els+0.034
 inside+0.028
bones+0.028
ier+0.028
 harm+0.027
Neg logit contributions
 Int-0.045
 Fro-0.044
 Everywhere-0.044
 Curious-0.042
 Barkley-0.041
Token in
Token ID287
Feature activation+4.08e+00

Pos logit contributions
els+0.110
 harm+0.091
 problems+0.088
 sticker+0.086
 colored+0.085
Neg logit contributions
 Everywhere-0.232
 Int-0.231
 Fro-0.227
Da-0.226
ete-0.225
Token the
Token ID262
Feature activation+2.72e+00

Pos logit contributions
els+0.060
ier+0.056
ps+0.052
 inside+0.051
 harm+0.050
Neg logit contributions
 Everywhere-0.055
Da-0.054
 Fro-0.054
 Int-0.053
Bet-0.049
Token sack
Token ID23704
Feature activation+1.23e+01

Pos logit contributions
els+0.034
 on+0.033
 inside+0.032
 sticker+0.029
.+0.029
Neg logit contributions
 Int-0.046
ete-0.043
 Everywhere-0.043
 Fro-0.042
Da-0.041
Token.
Token ID13
Feature activation-6.54e+00

Pos logit contributions
els+0.136
ps+0.101
 colored+0.100
 frost+0.099
ries+0.097
Neg logit contributions
 Int-0.214
 Everywhere-0.212
 Sadly-0.204
hog-0.202
ete-0.198
Token
Token ID198
Feature activation-6.52e+00

Pos logit contributions
 Int+0.077
ete+0.075
 concession+0.068
 Fro+0.068
mber+0.067
Neg logit contributions
 on-0.112
els-0.108
 inside-0.107
.-0.105
 sticker-0.099
Token,
Token ID11
Feature activation+1.80e+00

Pos logit contributions
els+0.142
ries+0.106
 frost+0.105
 colored+0.100
 icing+0.095
Neg logit contributions
 Fro-0.202
hog-0.201
 Int-0.196
 Everywhere-0.192
ishly-0.190
Token a
Token ID257
Feature activation+3.01e+00

Pos logit contributions
els+0.022
 inside+0.020
 on+0.019
 sticker+0.018
 colored+0.017
Neg logit contributions
 Int-0.030
 Everywhere-0.028
 Fro-0.028
 Curious-0.027
ete-0.027
Token cold
Token ID4692
Feature activation+7.59e+00

Pos logit contributions
els+0.045
ries+0.035
ps+0.035
 colored+0.033
bones+0.033
Neg logit contributions
 Int-0.046
 Fro-0.044
 Curious-0.042
 Sadly-0.042
Da-0.040
Token st
Token ID336
Feature activation+5.69e+00

Pos logit contributions
els+0.114
 problems+0.085
ck+0.084
bones+0.082
ries+0.082
Neg logit contributions
 Int-0.109
 Fro-0.104
 Everywhere-0.099
 Curious-0.098
 Everything-0.096
Tokeneth
Token ID2788
Feature activation+6.11e+00

Pos logit contributions
els+0.093
 on+0.090
 inside+0.088
 sticker+0.086
.+0.085
Neg logit contributions
 Int-0.075
 Everywhere-0.068
ete-0.067
Da-0.067
 Fro-0.065
Tokenoscope
Token ID40326
Feature activation+1.30e+01

Pos logit contributions
els+0.103
 inside+0.086
 on+0.084
 problems+0.084
ries+0.084
Neg logit contributions
 Int-0.075
 Everywhere-0.066
 Fro-0.064
 Curious-0.064
 As-0.060
Token and
Token ID290
Feature activation+1.14e+00

Pos logit contributions
els+0.176
 colored+0.135
ries+0.131
ck+0.125
ws+0.125
Neg logit contributions
 She-0.189
 As-0.179
 Not-0.178
 Int-0.176
 Fro-0.176
Token a
Token ID257
Feature activation+2.60e+00

Pos logit contributions
els+0.015
 inside+0.013
 on+0.012
 sticker+0.011
ps+0.011
Neg logit contributions
 Int-0.019
 Everywhere-0.017
 Curious-0.017
 Fro-0.016
 Everything-0.016
Token loud
Token ID7812
Feature activation+6.75e+00

Pos logit contributions
els+0.040
ries+0.031
 inside+0.030
ps+0.030
 colored+0.029
Neg logit contributions
 Int-0.041
 Curious-0.037
 Fro-0.037
 Sadly-0.036
 Terr-0.035
Token voice
Token ID3809
Feature activation+8.72e+00

Pos logit contributions
els+0.105
 inside+0.091
 on+0.086
ries+0.082
 colored+0.081
Neg logit contributions
 Int-0.100
 Fro-0.087
 Curious-0.087
 Everything-0.085
 Everywhere-0.081
Token.
Token ID13
Feature activation-7.85e+00

Pos logit contributions
els+0.092
 penn+0.078
 half+0.076
 beads+0.075
 pieces+0.071
Neg logit contributions
 Int-0.144
 Her-0.129
 Curious-0.126
 His-0.124
 Everything-0.119
Token to
Token ID284
Feature activation-2.49e-02

Pos logit contributions
els+0.015
 on+0.015
 inside+0.014
 sticker+0.013
.+0.013
Neg logit contributions
 Int-0.025
ete-0.024
 Everywhere-0.023
 Fro-0.023
 Curious-0.022
Token always
Token ID1464
Feature activation+3.74e+00

Pos logit contributions
 Int+0.000
ete+0.000
 Everywhere+0.000
 Fro+0.000
Da+0.000
Neg logit contributions
els-0.000
 on-0.000
 inside-0.000
 sticker-0.000
.-0.000
Token listen
Token ID6004
Feature activation+6.58e+00

Pos logit contributions
els+0.067
ries+0.056
 frost+0.052
bones+0.051
ps+0.051
Neg logit contributions
 Int-0.044
ete-0.043
 Fro-0.043
 Everywhere-0.042
Of-0.042
Token to
Token ID284
Feature activation+2.00e+00

Pos logit contributions
els+0.097
 frost+0.081
 sticker+0.080
 colored+0.078
 on+0.078
Neg logit contributions
 Int-0.095
ete-0.094
 Everywhere-0.094
 Fro-0.087
Da-0.086
Token your
Token ID534
Feature activation+2.63e+00

Pos logit contributions
els+0.026
 on+0.025
 inside+0.024
 sticker+0.023
 colored+0.021
Neg logit contributions
 Int-0.031
ete-0.030
 Fro-0.030
 Everywhere-0.030
Da-0.028
Token heart
Token ID2612
Feature activation+1.09e+01

Pos logit contributions
els+0.048
 on+0.046
 inside+0.045
 colored+0.044
bones+0.042
Neg logit contributions
 Fro-0.024
ete-0.023
 Everywhere-0.022
 Sadly-0.021
 Int-0.020
Token and
Token ID290
Feature activation+3.27e+00

Pos logit contributions
els+0.110
 inside+0.085
 frost+0.085
 colored+0.084
 on+0.083
Neg logit contributions
 Int-0.196
 Everywhere-0.193
ete-0.193
 Fro-0.189
 Sadly-0.181
Token never
Token ID1239
Feature activation+3.87e+00

Pos logit contributions
els+0.045
 on+0.041
 inside+0.040
 sticker+0.037
 colored+0.037
Neg logit contributions
 Int-0.050
ete-0.047
 Everywhere-0.047
 Fro-0.046
Da-0.044
Token give
Token ID1577
Feature activation+7.71e+00

Pos logit contributions
els+0.066
ps+0.052
ries+0.052
 on+0.051
 colored+0.050
Neg logit contributions
 Int-0.050
 Fro-0.050
ete-0.045
Of-0.045
 Curious-0.044
Token up
Token ID510
Feature activation+8.67e+00

Pos logit contributions
els+0.090
 colored+0.083
 sticker+0.083
 on+0.078
 frost+0.078
Neg logit contributions
 Fro-0.118
ete-0.117
 Int-0.115
 Everywhere-0.108
 Dod-0.108
Token on
Token ID319
Feature activation+5.49e+00

Pos logit contributions
els+0.082
 frost+0.062
 sticker+0.060
 tape+0.060
 colored+0.059
Neg logit contributions
ete-0.159
 Fro-0.158
 Everywhere-0.152
 Int-0.148
 Not-0.144
Token She
Token ID1375
Feature activation+8.66e-01

Pos logit contributions
ete+0.041
 Int+0.040
 Fro+0.038
Da+0.038
mber+0.036
Neg logit contributions
 on-0.105
 inside-0.104
els-0.102
.-0.101
 sticker-0.099
Token asked
Token ID1965
Feature activation+4.18e+00

Pos logit contributions
els+0.010
 on+0.010
 inside+0.010
 sticker+0.009
.+0.009
Neg logit contributions
 Int-0.015
ete-0.014
 Everywhere-0.014
 Fro-0.014
Da-0.013
Token him
Token ID683
Feature activation+5.16e+00

Pos logit contributions
els+0.052
 sticker+0.044
 on+0.044
ps+0.043
 spills+0.043
Neg logit contributions
 Int-0.067
 Everywhere-0.066
ete-0.066
 Fro-0.064
 Everything-0.062
Token lots
Token ID6041
Feature activation+8.36e+00

Pos logit contributions
els+0.064
 on+0.055
 colored+0.053
 inside+0.053
 frost+0.052
Neg logit contributions
 Int-0.087
ete-0.084
 Everywhere-0.083
 Fro-0.083
 Curious-0.078
Token of
Token ID286
Feature activation+7.44e+00

Pos logit contributions
els+0.117
 on+0.112
 inside+0.110
 sticker+0.106
ps+0.105
Neg logit contributions
 Int-0.116
ete-0.113
 Everywhere-0.113
 Fro-0.108
 concession-0.103
Token questions
Token ID2683
Feature activation+1.14e+01

Pos logit contributions
els+0.112
 inside+0.099
 on+0.098
 cutter+0.093
 sticker+0.091
Neg logit contributions
 Int-0.103
 Everywhere-0.092
 Everything-0.090
 Curious-0.090
 Fro-0.089
Token and
Token ID290
Feature activation+2.41e+00

Pos logit contributions
els+0.124
 colored+0.101
 inside+0.101
 spills+0.099
ps+0.097
Neg logit contributions
 Int-0.198
 Everywhere-0.198
ete-0.186
 Everything-0.181
 Fro-0.179
Token tried
Token ID3088
Feature activation+7.00e+00

Pos logit contributions
els+0.035
 on+0.031
 inside+0.031
 sticker+0.029
ps+0.029
Neg logit contributions
 Int-0.036
 Everywhere-0.033
ete-0.033
 Fro-0.032
 Curious-0.032
Token to
Token ID284
Feature activation+3.72e+00

Pos logit contributions
els+0.102
 colored+0.087
ps+0.086
 frost+0.086
ier+0.085
Neg logit contributions
 Int-0.095
 Fro-0.095
 Everywhere-0.094
ete-0.088
 Everything-0.087
Token make
Token ID787
Feature activation+8.08e+00

Pos logit contributions
els+0.053
 on+0.047
 inside+0.046
 frost+0.043
ries+0.042
Neg logit contributions
 Int-0.056
 Everywhere-0.051
ete-0.050
 Curious-0.049
 Fro-0.049
Token him
Token ID683
Feature activation+9.80e+00

Pos logit contributions
els+0.109
 on+0.102
 inside+0.100
 sticker+0.095
 colored+0.092
Neg logit contributions
 Int-0.125
ete-0.117
 Fro-0.116
 Everywhere-0.115
 Everything-0.109
Token idea
Token ID2126
Feature activation+6.97e+00

Pos logit contributions
els+0.071
 on+0.058
 inside+0.056
ps+0.054
 frost+0.053
Neg logit contributions
 Int-0.080
 Fro-0.078
ete-0.077
 Everywhere-0.077
mber-0.073
Token and
Token ID290
Feature activation-1.45e+00

Pos logit contributions
els+0.083
 sticker+0.065
 frost+0.063
ps+0.061
 colored+0.061
Neg logit contributions
ete-0.112
 Int-0.112
 Everywhere-0.112
 Fro-0.110
 As-0.106
Token quickly
Token ID2952
Feature activation+3.16e+00

Pos logit contributions
 Int+0.026
 Everywhere+0.024
ete+0.024
Da+0.023
 Fro+0.022
Neg logit contributions
 on-0.018
 inside-0.017
.-0.016
els-0.015
 colored-0.014
Token grabbed
Token ID13176
Feature activation+7.01e+00

Pos logit contributions
els+0.053
 on+0.044
 sticker+0.043
 inside+0.043
ps+0.042
Neg logit contributions
 Int-0.041
ete-0.040
 Fro-0.038
mber-0.036
 Curious-0.036
Token their
Token ID511
Feature activation+3.85e+00

Pos logit contributions
els+0.085
ier+0.079
 sticker+0.079
 ring+0.077
 harm+0.076
Neg logit contributions
 Fro-0.104
ete-0.103
 Int-0.102
 Everywhere-0.100
 Everything-0.092
Token hats
Token ID23910
Feature activation+1.32e+01

Pos logit contributions
els+0.050
 on+0.045
 inside+0.044
 sticker+0.042
 harm+0.041
Neg logit contributions
 Int-0.061
ete-0.059
 Fro-0.057
 Everywhere-0.055
 concession-0.055
Token.
Token ID13
Feature activation-7.83e+00

Pos logit contributions
els+0.110
 colored+0.084
 harm+0.077
bones+0.077
 frost+0.075
Neg logit contributions
ete-0.262
 Fro-0.248
 Int-0.247
 Everywhere-0.246
ishly-0.235
Token They
Token ID1119
Feature activation+1.23e-01

Pos logit contributions
Da+0.100
Bet+0.097
 Int+0.096
ternally+0.096
ete+0.095
Neg logit contributions
.-0.116
 on-0.112
 inside-0.110
 paper-0.099
els-0.098
Token were
Token ID547
Feature activation+1.51e+00

Pos logit contributions
els+0.002
 on+0.002
 inside+0.001
 sticker+0.001
.+0.001
Neg logit contributions
 Int-0.002
ete-0.002
 Everywhere-0.002
 Fro-0.002
 Curious-0.002
Token so
Token ID523
Feature activation-8.56e-01

Pos logit contributions
els+0.024
 on+0.021
 sticker+0.021
 inside+0.021
.+0.020
Neg logit contributions
 Int-0.020
 Fro-0.019
ete-0.019
 Everywhere-0.019
 Curious-0.017
Token lucky
Token ID9670
Feature activation+6.24e+00

Pos logit contributions
 Int+0.012
 Everywhere+0.011
ete+0.011
 Fro+0.011
Da+0.011
Neg logit contributions
 on-0.013
els-0.013
 inside-0.012
.-0.012
 sticker-0.011
Token seen
Token ID1775
Feature activation+8.65e+00

Pos logit contributions
els+0.068
 on+0.057
 sticker+0.057
 frost+0.056
ps+0.054
Neg logit contributions
 Int-0.056
 Fro-0.052
ete-0.051
 Curious-0.049
Da-0.047
Token again
Token ID757
Feature activation+7.22e+00

Pos logit contributions
els+0.108
 on+0.097
 sticker+0.091
 colored+0.091
 inside+0.091
Neg logit contributions
 Int-0.146
 Fro-0.142
ete-0.137
 Everywhere-0.136
 Everything-0.129
Token.
Token ID13
Feature activation-5.78e+00

Pos logit contributions
els+0.071
 frost+0.054
 sticker+0.052
ps+0.052
 tape+0.050
Neg logit contributions
 Int-0.142
 Fro-0.141
ete-0.136
 Curious-0.134
 He-0.132
Token<|endoftext|>
Token ID50256
Feature activation-1.06e+00

Pos logit contributions
 concession+0.085
ternally+0.081
rolling+0.080
Ag+0.080
Da+0.079
Neg logit contributions
.-0.080
 inside-0.072
 on-0.071
."-0.059
 half-0.059
TokenToday
Token ID8888
Feature activation+3.64e+00

Pos logit contributions
 Int+0.011
ete+0.010
 Everywhere+0.010
 Fro+0.010
Da+0.009
Neg logit contributions
 on-0.019
els-0.019
 inside-0.019
.-0.018
 sticker-0.018
Token,
Token ID11
Feature activation-3.19e+00

Pos logit contributions
 on+0.034
.+0.032
 inside+0.031
 sticker+0.025
els+0.025
Neg logit contributions
 Int-0.075
Of-0.071
 Everywhere-0.071
Mos-0.070
Da-0.070
Token Mom
Token ID11254
Feature activation+2.45e+00

Pos logit contributions
 Int+0.054
ete+0.052
Of+0.052
rig+0.051
Mos+0.050
Neg logit contributions
 on-0.042
 inside-0.039
.-0.039
 paper-0.031
 colored-0.031
Tokenmy
Token ID1820
Feature activation+2.29e+00

Pos logit contributions
els+0.037
 on+0.035
 inside+0.033
 sticker+0.033
.+0.031
Neg logit contributions
 Int-0.034
ete-0.031
 Everywhere-0.031
 Fro-0.030
Da-0.029
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
els+0.029
 on+0.026
 inside+0.025
 sticker+0.025
 frost+0.023
Neg logit contributions
 Int-0.038
ete-0.035
 Everywhere-0.035
 Fro-0.035
Da-0.033
Token I
Token ID314
Feature activation+4.12e-01

Pos logit contributions
 Int+0.024
 Everywhere+0.023
ete+0.023
pill+0.023
rig+0.022
Neg logit contributions
 on-0.023
 inside-0.022
.-0.021
els-0.020
 frost-0.020
Token went
Token ID1816
Feature activation+6.69e+00

Pos logit contributions
els+0.006
 on+0.006
 inside+0.005
 sticker+0.005
 paper+0.005
Neg logit contributions
 Int-0.006
ete-0.005
 Everywhere-0.005
 Fro-0.005
 Curious-0.005
Token.
Token ID13
Feature activation-1.57e+01

Pos logit contributions
 Int+0.011
ete+0.010
 Everywhere+0.010
 Fro+0.010
mber+0.010
Neg logit contributions
els-0.003
 on-0.002
 inside-0.002
 sticker-0.002
 colored-0.002
Token It
Token ID632
Feature activation-7.05e+00

Pos logit contributions
ternally+0.197
ete+0.195
 Int+0.194
 concession+0.188
rig+0.183
Neg logit contributions
.-0.247
 on-0.237
 inside-0.218
."-0.203
,"-0.194
Token was
Token ID373
Feature activation-6.34e+00

Pos logit contributions
 Int+0.123
ete+0.120
 Everywhere+0.117
 Fro+0.112
Da+0.111
Neg logit contributions
 on-0.086
 inside-0.082
els-0.078
.-0.077
 colored-0.073
Token a
Token ID257
Feature activation-5.16e+00

Pos logit contributions
 Int+0.092
 Fro+0.089
 Everywhere+0.089
ete+0.088
Da+0.084
Neg logit contributions
els-0.092
 on-0.079
 inside-0.079
 sticker-0.078
ps-0.073
Token peanut
Token ID26636
Feature activation+2.50e+00

Pos logit contributions
 Int+0.082
ete+0.077
 Fro+0.077
 Everywhere+0.076
Da+0.074
Neg logit contributions
els-0.071
 on-0.063
 inside-0.062
ps-0.058
.-0.056
Token butter
Token ID9215
Feature activation+1.93e+00

Pos logit contributions
els+0.033
bones+0.028
ps+0.027
ws+0.025
 colored+0.025
Neg logit contributions
 Fro-0.039
ete-0.036
mber-0.036
olly-0.036
Da-0.036
Token and
Token ID290
Feature activation-5.60e+00

Pos logit contributions
els+0.019
 inside+0.014
 problems+0.014
 colored+0.014
ps+0.013
Neg logit contributions
 Fro-0.036
ete-0.035
 Int-0.034
Da-0.033
 Everywhere-0.033
Token jelly
Token ID27644
Feature activation+1.78e+00

Pos logit contributions
 Int+0.099
 Fro+0.093
ete+0.093
 Everywhere+0.092
Da+0.091
Neg logit contributions
els-0.064
 on-0.058
 inside-0.058
 sticker-0.052
.-0.051
Token sandwich
Token ID20433
Feature activation+2.86e+00

Pos logit contributions
els+0.022
 harm+0.016
 colored+0.016
ps+0.015
 inside+0.015
Neg logit contributions
ete-0.029
 Fro-0.029
mber-0.028
Da-0.027
 Int-0.027
Token,
Token ID11
Feature activation-5.28e+00

Pos logit contributions
els+0.026
 harm+0.017
 problems+0.016
 colored+0.015
 sticker+0.015
Neg logit contributions
 Fro-0.056
ete-0.055
 Int-0.054
 Everywhere-0.054
Da-0.053
Token which
Token ID543
Feature activation-3.08e+00

Pos logit contributions
 Int+0.099
ete+0.093
 Everywhere+0.092
 Fro+0.090
Da+0.088
Neg logit contributions
els-0.056
 on-0.055
 inside-0.053
 sticker-0.047
.-0.047
Token a
Token ID257
Feature activation+3.75e+00

Pos logit contributions
els+0.129
bones+0.107
 colored+0.105
ries+0.099
ge+0.098
Neg logit contributions
 Everywhere-0.105
ete-0.102
 Fro-0.100
 Int-0.096
Da-0.096
Token chance
Token ID2863
Feature activation+8.92e+00

Pos logit contributions
els+0.061
ries+0.050
ps+0.048
bones+0.047
 inside+0.045
Neg logit contributions
 Int-0.056
mber-0.055
 Everywhere-0.054
ete-0.054
 Fro-0.053
Token.
Token ID13
Feature activation-6.86e+00

Pos logit contributions
els+0.106
bones+0.086
 colored+0.083
ries+0.081
 frost+0.081
Neg logit contributions
 Everywhere-0.145
 Fro-0.140
ete-0.139
 As-0.133
 Int-0.132
Token How
Token ID1374
Feature activation-2.86e+00

Pos logit contributions
 Int+0.071
ete+0.067
mber+0.062
Da+0.061
 Fro+0.061
Neg logit contributions
els-0.125
 on-0.122
 inside-0.120
.-0.116
 sticker-0.116
Token about
Token ID546
Feature activation+1.59e-02

Pos logit contributions
 Int+0.046
ete+0.043
 Everywhere+0.042
 Fro+0.041
Da+0.040
Neg logit contributions
els-0.038
 on-0.037
 inside-0.036
 sticker-0.033
.-0.033
Token we
Token ID356
Feature activation+2.88e+00

Pos logit contributions
els+0.000
 inside+0.000
 on+0.000
ps+0.000
 sticker+0.000
Neg logit contributions
 Int-0.000
 Fro-0.000
ete-0.000
 Everywhere-0.000
 Everything-0.000
Token make
Token ID787
Feature activation+5.48e+00

Pos logit contributions
els+0.045
 on+0.039
 inside+0.039
 sticker+0.037
ps+0.037
Neg logit contributions
ete-0.039
 Int-0.038
 Fro-0.037
 Everywhere-0.036
 Sadly-0.033
Token a
Token ID257
Feature activation+3.85e+00

Pos logit contributions
els+0.071
 inside+0.064
 on+0.062
 sticker+0.059
 colored+0.058
Neg logit contributions
 Int-0.084
ete-0.083
 Fro-0.081
 Everywhere-0.078
Da-0.075
Token deal
Token ID1730
Feature activation+1.03e+01

Pos logit contributions
els+0.058
 inside+0.048
ps+0.048
 on+0.048
 colored+0.045
Neg logit contributions
 Int-0.054
ete-0.054
 Fro-0.054
mber-0.053
Da-0.051
Token?
Token ID30
Feature activation-5.97e+00

Pos logit contributions
els+0.119
 spills+0.101
ps+0.099
 problems+0.098
 colored+0.097
Neg logit contributions
 Fro-0.159
 Int-0.158
ete-0.157
 Everywhere-0.151
 concession-0.147
Token If
Token ID1002
Feature activation+5.19e-01

Pos logit contributions
 Int+0.060
ete+0.056
 Fro+0.052
 Everywhere+0.052
Da+0.051
Neg logit contributions
els-0.112
 on-0.111
 inside-0.108
 sticker-0.104
.-0.103
Token and
Token ID290
Feature activation+5.24e-01

Pos logit contributions
els+0.101
 frost+0.084
 inside+0.084
ries+0.079
 sticker+0.079
Neg logit contributions
 Int-0.125
 Everywhere-0.121
 Fro-0.114
ete-0.111
 Curious-0.108
Token practiced
Token ID19893
Feature activation+8.35e+00

Pos logit contributions
els+0.008
 inside+0.007
 sticker+0.007
 on+0.007
 frost+0.007
Neg logit contributions
 Int-0.007
 Everywhere-0.007
ete-0.006
 Fro-0.006
 Curious-0.006
Token,
Token ID11
Feature activation+1.77e+00

Pos logit contributions
els+0.114
 frost+0.098
ries+0.094
 inside+0.092
 sticker+0.089
Neg logit contributions
 Everywhere-0.132
 Int-0.131
 Fro-0.121
 As-0.117
ete-0.115
Token and
Token ID290
Feature activation+9.05e-01

Pos logit contributions
els+0.022
 inside+0.020
 on+0.019
 sticker+0.018
 frost+0.018
Neg logit contributions
 Int-0.031
 Everywhere-0.029
 Fro-0.027
ete-0.027
 Curious-0.027
Token the
Token ID262
Feature activation+9.54e-01

Pos logit contributions
els+0.014
 inside+0.011
 sticker+0.011
 frost+0.011
 on+0.011
Neg logit contributions
 Int-0.014
 Everywhere-0.013
ete-0.012
 Fro-0.012
 Curious-0.012
Token day
Token ID1110
Feature activation+6.80e+00

Pos logit contributions
els+0.014
 inside+0.012
 on+0.011
ries+0.011
 frost+0.011
Neg logit contributions
 Int-0.015
 Everywhere-0.014
 Curious-0.013
mber-0.013
 Everything-0.013
Token of
Token ID286
Feature activation+2.94e+00

Pos logit contributions
els+0.102
 inside+0.090
 on+0.089
ps+0.088
 sticker+0.087
Neg logit contributions
 Int-0.100
ete-0.093
 Everywhere-0.091
 Everything-0.088
 Fro-0.086
Token the
Token ID262
Feature activation+2.72e+00

Pos logit contributions
els+0.040
 inside+0.036
 sticker+0.035
 spills+0.034
 frost+0.033
Neg logit contributions
 Int-0.040
 Everywhere-0.040
 Fro-0.038
ete-0.038
 Everything-0.037
Token stage
Token ID3800
Feature activation+1.11e+01

Pos logit contributions
els+0.042
 inside+0.035
 on+0.033
ries+0.033
ps+0.032
Neg logit contributions
 Int-0.040
 Everywhere-0.036
mber-0.036
 concession-0.035
 Curious-0.034
Token show
Token ID905
Feature activation+1.09e+01

Pos logit contributions
els+0.135
 on+0.120
 inside+0.118
 sticker+0.111
 frost+0.108
Neg logit contributions
 Int-0.190
 Everywhere-0.179
ete-0.179
 Fro-0.172
 Curious-0.168
Token came
Token ID1625
Feature activation+8.04e+00

Pos logit contributions
els+0.103
 inside+0.087
 on+0.085
 sticker+0.083
 frost+0.080
Neg logit contributions
 Int-0.210
 Everywhere-0.204
ete-0.204
 Fro-0.195
 Everything-0.194
Token tree
Token ID5509
Feature activation+4.50e+00

Pos logit contributions
 Int+0.046
 Everywhere+0.043
 Curious+0.040
Da+0.039
ete+0.039
Neg logit contributions
els-0.043
 on-0.038
 inside-0.036
ps-0.036
 sticker-0.036
Token and
Token ID290
Feature activation-3.84e+00

Pos logit contributions
els+0.043
 on+0.037
 inside+0.035
 sticker+0.034
ps+0.032
Neg logit contributions
 Int-0.088
 Everywhere-0.083
ete-0.083
 Fro-0.082
Da-0.080
Token it
Token ID340
Feature activation-2.44e-01

Pos logit contributions
 Int+0.059
ete+0.055
 Everywhere+0.055
 Fro+0.053
Da+0.052
Neg logit contributions
els-0.053
 on-0.052
 inside-0.051
 sticker-0.046
.-0.046
Token lit
Token ID6578
Feature activation+2.82e+00

Pos logit contributions
 Int+0.004
ete+0.004
 Everywhere+0.004
 Fro+0.004
Da+0.003
Neg logit contributions
 on-0.003
els-0.003
 inside-0.003
.-0.003
 colored-0.003
Token up
Token ID510
Feature activation+3.37e+00

Pos logit contributions
els+0.035
 sticker+0.029
 on+0.029
ps+0.027
 inside+0.027
Neg logit contributions
 Int-0.049
 Everywhere-0.046
 Curious-0.044
ete-0.043
 Fro-0.043
Token the
Token ID262
Feature activation-1.81e+00

Pos logit contributions
els+0.034
 sticker+0.029
ps+0.026
 stamp+0.025
 frost+0.025
Neg logit contributions
ete-0.063
 Int-0.063
 Everywhere-0.061
 Fro-0.060
Da-0.059
Token night
Token ID1755
Feature activation+3.90e+00

Pos logit contributions
 Int+0.024
 Everywhere+0.023
 Curious+0.022
 Everything+0.021
 concession+0.021
Neg logit contributions
els-0.029
 on-0.024
 inside-0.023
ps-0.023
 sticker-0.023
Token with
Token ID351
Feature activation+7.04e-01

Pos logit contributions
els+0.067
ps+0.058
ws+0.056
 on+0.055
 sticker+0.055
Neg logit contributions
 Int-0.048
 Everywhere-0.045
 Everything-0.043
ete-0.043
 Curious-0.041
Token its
Token ID663
Feature activation-4.45e-02

Pos logit contributions
els+0.009
 on+0.008
 inside+0.008
 sticker+0.008
.+0.007
Neg logit contributions
 Int-0.012
ete-0.011
 Everywhere-0.011
 Fro-0.011
Da-0.010
Token bright
Token ID6016
Feature activation+4.89e+00

Pos logit contributions
 Int+0.001
 Everywhere+0.001
 Fro+0.001
ete+0.001
 Curious+0.001
Neg logit contributions
els-0.001
 on-0.001
 inside-0.001
 sticker-0.001
ps-0.000
Token colours
Token ID18915
Feature activation+8.27e+00

Pos logit contributions
els+0.070
 on+0.062
 inside+0.061
bones+0.059
 problems+0.058
Neg logit contributions
 Int-0.067
 Everywhere-0.066
 Curious-0.062
 Everything-0.059
ete-0.056
Tokenie
Token ID494
Feature activation-1.11e+00

Pos logit contributions
els+0.020
 on+0.019
 sticker+0.019
 inside+0.019
 frost+0.018
Neg logit contributions
 Int-0.020
ete-0.019
 Fro-0.019
 Everywhere-0.018
Da-0.018
Token said
Token ID531
Feature activation+1.36e-01

Pos logit contributions
 Int+0.014
ete+0.013
 Fro+0.013
 Everywhere+0.013
 Curious+0.013
Neg logit contributions
els-0.017
 on-0.016
 sticker-0.016
 inside-0.016
 frost-0.015
Token and
Token ID290
Feature activation-6.29e+00

Pos logit contributions
els+0.001
 sticker+0.001
 colored+0.001
 on+0.001
 inside+0.001
Neg logit contributions
 Int-0.003
 Everywhere-0.002
ete-0.002
 Fro-0.002
 Everything-0.002
Token realized
Token ID6939
Feature activation-2.34e+00

Pos logit contributions
 Int+0.098
ete+0.091
 Everywhere+0.090
 Fro+0.088
Da+0.085
Neg logit contributions
els-0.087
 on-0.084
 inside-0.082
 sticker-0.076
.-0.075
Token that
Token ID326
Feature activation-2.87e+00

Pos logit contributions
 Int+0.037
 Fro+0.035
 Everywhere+0.035
ete+0.034
Da+0.033
Neg logit contributions
els-0.030
 on-0.027
 inside-0.027
 sticker-0.027
 colored-0.026
Token the
Token ID262
Feature activation-4.42e+00

Pos logit contributions
 Int+0.041
 Fro+0.039
 Everywhere+0.038
Da+0.036
 Everything+0.036
Neg logit contributions
els-0.042
ps-0.038
 inside-0.038
 sticker-0.037
 on-0.037
Token tw
Token ID665
Feature activation-9.88e-01

Pos logit contributions
 Int+0.068
 Everywhere+0.063
 Fro+0.062
 Curious+0.061
 Sadly+0.059
Neg logit contributions
els-0.063
 on-0.054
 inside-0.054
 colored-0.050
 frost-0.049
Tokenig
Token ID328
Feature activation+5.52e-01

Pos logit contributions
 Int+0.019
ete+0.018
 Everywhere+0.018
 Fro+0.018
Da+0.017
Neg logit contributions
els-0.010
 on-0.009
 inside-0.009
 sticker-0.008
.-0.008
Token had
Token ID550
Feature activation-7.56e-01

Pos logit contributions
els+0.008
 on+0.008
 inside+0.008
.+0.007
 sticker+0.007
Neg logit contributions
 Int-0.008
ete-0.008
 Everywhere-0.007
 Fro-0.007
Da-0.007
Token reversed
Token ID17687
Feature activation+5.82e+00

Pos logit contributions
 Int+0.012
 Everywhere+0.011
ete+0.011
 Fro+0.011
Da+0.011
Neg logit contributions
els-0.010
 on-0.009
 inside-0.009
 sticker-0.009
 frost-0.008
Token and
Token ID290
Feature activation-2.74e+00

Pos logit contributions
els+0.060
 frost+0.054
 sticker+0.053
 stamp+0.047
 icing+0.047
Neg logit contributions
 Int-0.101
 Everywhere-0.101
 Fro-0.094
ete-0.093
 Everything-0.092
Token clouds
Token ID15114
Feature activation+1.97e+00

Pos logit contributions
els+0.008
 sticker+0.007
 on+0.007
 inside+0.007
 stamp+0.006
Neg logit contributions
 Int-0.009
 Everywhere-0.008
Da-0.008
 Everything-0.008
 Fro-0.008
Token appeared
Token ID4120
Feature activation-1.76e+00

Pos logit contributions
els+0.027
 sticker+0.025
 inside+0.023
 stickers+0.023
 on+0.022
Neg logit contributions
 Int-0.030
 Everywhere-0.026
 Curious-0.025
 Fro-0.025
ete-0.025
Token in
Token ID287
Feature activation-4.27e+00

Pos logit contributions
 Int+0.037
 Everywhere+0.035
ete+0.035
Da+0.034
 Fro+0.034
Neg logit contributions
els-0.015
 sticker-0.012
 on-0.011
 inside-0.011
 frost-0.011
Token the
Token ID262
Feature activation-5.91e+00

Pos logit contributions
 Int+0.066
 Everywhere+0.065
Da+0.063
 Curious+0.059
Bet+0.059
Neg logit contributions
els-0.056
 sticker-0.050
ier-0.048
 on-0.047
 inside-0.046
Token sky
Token ID6766
Feature activation+7.70e-01

Pos logit contributions
 Int+0.084
 Everywhere+0.077
ete+0.077
 Fro+0.074
Da+0.072
Neg logit contributions
els-0.090
 on-0.086
 inside-0.084
 sticker-0.079
.-0.077
Token and
Token ID290
Feature activation-5.44e+00

Pos logit contributions
els+0.005
 sticker+0.004
 on+0.003
 inside+0.003
ps+0.003
Neg logit contributions
 Int-0.017
ete-0.017
 Everywhere-0.016
 Fro-0.016
 Curious-0.016
Token it
Token ID340
Feature activation-3.22e+00

Pos logit contributions
 Int+0.096
ete+0.090
 Everywhere+0.089
 Fro+0.087
Da+0.085
Neg logit contributions
els-0.063
 on-0.061
 inside-0.059
 sticker-0.054
.-0.053
Token started
Token ID2067
Feature activation-3.29e+00

Pos logit contributions
 Int+0.052
ete+0.049
 Everywhere+0.049
 Fro+0.048
Da+0.046
Neg logit contributions
 on-0.042
els-0.041
 inside-0.040
.-0.037
 colored-0.036
Token to
Token ID284
Feature activation-5.32e+00

Pos logit contributions
 Int+0.055
 Fro+0.050
 Everywhere+0.050
ete+0.049
Da+0.048
Neg logit contributions
els-0.043
 on-0.038
 inside-0.037
 sticker-0.037
 frost-0.034
Token rain
Token ID6290
Feature activation+3.96e+00

Pos logit contributions
 Int+0.090
 Everywhere+0.082
 Fro+0.080
ete+0.080
 Curious+0.079
Neg logit contributions
els-0.069
 on-0.064
 inside-0.061
 sticker-0.057
.-0.054
Token.
Token ID13
Feature activation-1.34e+01

Pos logit contributions
els+0.017
 on+0.012
 inside+0.011
 sticker+0.010
 frost+0.007
Neg logit contributions
 Int-0.100
 Everywhere-0.094
 Fro-0.093
ete-0.093
 Curious-0.091
Token,
Token ID11
Feature activation+2.49e+00

Pos logit contributions
els+0.033
 on+0.029
 inside+0.027
 sticker+0.024
 colored+0.023
Neg logit contributions
 Int-0.112
ete-0.108
 Fro-0.107
 Everywhere-0.107
Da-0.103
Token who
Token ID508
Feature activation+1.80e+00

Pos logit contributions
 on+0.022
 inside+0.020
.+0.019
els+0.017
 sticker+0.014
Neg logit contributions
 Int-0.053
ete-0.052
 Everywhere-0.049
mber-0.049
Da-0.049
Token might
Token ID1244
Feature activation+3.84e+00

Pos logit contributions
els+0.026
 on+0.026
 inside+0.025
 sticker+0.023
.+0.023
Neg logit contributions
 Int-0.026
ete-0.024
 Everywhere-0.024
 Fro-0.023
Da-0.023
Token know
Token ID760
Feature activation+7.59e+00

Pos logit contributions
els+0.057
 on+0.049
 inside+0.048
 frost+0.045
 colored+0.044
Neg logit contributions
 Int-0.058
 Fro-0.055
 Everywhere-0.054
ete-0.054
Da-0.052
Token.
Token ID13
Feature activation-6.27e+00

Pos logit contributions
els+0.103
 sticker+0.095
 on+0.091
ps+0.091
 inside+0.090
Neg logit contributions
 Int-0.104
 Everywhere-0.103
ete-0.100
 Fro-0.098
 Everything-0.097
Token
Token ID198
Feature activation-5.42e+00

Pos logit contributions
ete+0.101
 concession+0.099
ternally+0.099
ust+0.095
annie+0.095
Neg logit contributions
.-0.079
 on-0.079
 inside-0.073
,-0.062
,"-0.060
Token
Token ID198
Feature activation-7.22e+00

Pos logit contributions
 Int+0.080
 concession+0.074
ete+0.074
ishly+0.071
mber+0.070
Neg logit contributions
 on-0.082
.-0.080
 inside-0.076
els-0.068
 sticker-0.064
TokenB
Token ID33
Feature activation-2.86e+00

Pos logit contributions
 Int+0.087
ete+0.085
 Fro+0.079
 Everywhere+0.077
 Everything+0.072
Neg logit contributions
 on-0.131
 inside-0.118
.-0.115
els-0.113
 colored-0.109
Tokenunny
Token ID16948
Feature activation+2.20e+00

Pos logit contributions
 Int+0.029
 Everywhere+0.026
 concession+0.025
ete+0.024
 sch+0.024
Neg logit contributions
 on-0.052
.-0.052
 inside-0.050
els-0.048
 sticker-0.045
Token looked
Token ID3114
Feature activation+4.88e+00

Pos logit contributions
 on+0.025
 inside+0.025
els+0.024
.+0.023
 colored+0.021
Neg logit contributions
 Int-0.039
 Everywhere-0.037
ete-0.037
 concession-0.035
 Fro-0.035
Token at
Token ID379
Feature activation+2.65e+00

Pos logit contributions
els+0.053
 sticker+0.045
 on+0.043
 frost+0.042
 inside+0.041
Neg logit contributions
 Int-0.092
 Everywhere-0.086
 Fro-0.085
ete-0.085
 Curious-0.083
Token was
Token ID373
Feature activation-1.50e-01

Pos logit contributions
 Int+0.003
 Everywhere+0.003
ete+0.003
 Fro+0.003
Da+0.002
Neg logit contributions
 on-0.002
els-0.002
 inside-0.002
.-0.002
 sticker-0.002
Token so
Token ID523
Feature activation-3.24e+00

Pos logit contributions
 Int+0.002
ete+0.002
 Everywhere+0.002
 Fro+0.002
Da+0.002
Neg logit contributions
 on-0.002
 inside-0.002
els-0.002
.-0.002
 sticker-0.002
Token happy
Token ID3772
Feature activation+3.14e+00

Pos logit contributions
 Int+0.045
ete+0.041
 Everywhere+0.041
 Fro+0.041
Da+0.040
Neg logit contributions
 on-0.050
els-0.048
 inside-0.048
.-0.045
 sticker-0.045
Token and
Token ID290
Feature activation-3.06e+00

Pos logit contributions
els+0.034
 on+0.030
 inside+0.029
 sticker+0.028
 colored+0.026
Neg logit contributions
 Int-0.058
ete-0.055
 Fro-0.054
 Everywhere-0.054
Da-0.052
Token proud
Token ID6613
Feature activation+3.49e+00

Pos logit contributions
 Int+0.048
 Everywhere+0.046
ete+0.045
Da+0.044
 Fro+0.043
Neg logit contributions
 on-0.044
 inside-0.041
.-0.039
 colored-0.035
els-0.035
Token.
Token ID13
Feature activation-9.67e+00

Pos logit contributions
els+0.047
ps+0.039
 sticker+0.039
 spills+0.037
ws+0.037
Neg logit contributions
 Int-0.055
ete-0.054
 Everywhere-0.053
 Fro-0.051
mber-0.050
Token From
Token ID3574
Feature activation-4.69e-01

Pos logit contributions
Da+0.126
 Fro+0.123
Hop+0.117
Mos+0.115
ete+0.115
Neg logit contributions
.-0.155
 on-0.150
 inside-0.135
els-0.132
 letters-0.126
Token then
Token ID788
Feature activation-1.68e+00

Pos logit contributions
 Int+0.008
ete+0.008
 Everywhere+0.008
 Fro+0.008
Da+0.007
Neg logit contributions
els-0.006
 on-0.005
 inside-0.005
 sticker-0.005
.-0.005
Token,
Token ID11
Feature activation-4.59e+00

Pos logit contributions
 Int+0.038
 Everywhere+0.037
 Everything+0.037
ete+0.036
 From+0.036
Neg logit contributions
els-0.010
 frost-0.007
 colored-0.007
ps-0.007
 sticker-0.006
Token the
Token ID262
Feature activation-3.94e+00

Pos logit contributions
 Int+0.070
ete+0.065
 Everywhere+0.065
 Fro+0.063
Da+0.061
Neg logit contributions
els-0.065
 on-0.062
 inside-0.061
 sticker-0.056
.-0.055
Token grand
Token ID4490
Feature activation-5.75e+00

Pos logit contributions
 Int+0.041
 Everywhere+0.037
ete+0.037
 Fro+0.035
 Curious+0.034
Neg logit contributions
els-0.075
 on-0.072
 inside-0.071
 sticker-0.066
.-0.066
Token lot
Token ID1256
Feature activation+1.74e+00

Pos logit contributions
els+0.064
 on+0.059
 inside+0.057
 sticker+0.052
 colored+0.052
Neg logit contributions
 Int-0.086
ete-0.082
 Everywhere-0.081
 Fro-0.080
Da-0.077
Token.
Token ID13
Feature activation-1.13e+01

Pos logit contributions
els+0.012
 frost+0.009
 sticker+0.009
 colored+0.009
 stamp+0.009
Neg logit contributions
 Everywhere-0.037
ete-0.036
 Int-0.035
 Fro-0.034
 concession-0.033
Token He
Token ID679
Feature activation-4.53e+00

Pos logit contributions
Da+0.115
ext+0.109
mber+0.108
ternally+0.106
ete+0.105
Neg logit contributions
.-0.196
 inside-0.193
 on-0.192
 sticker-0.169
 tape-0.168
Token was
Token ID373
Feature activation-2.68e+00

Pos logit contributions
 Int+0.074
 Everywhere+0.070
ete+0.069
 Fro+0.067
Da+0.067
Neg logit contributions
 on-0.058
 inside-0.057
els-0.056
.-0.051
 sticker-0.051
Token happy
Token ID3772
Feature activation+1.22e+00

Pos logit contributions
 Int+0.036
ete+0.033
 Everywhere+0.033
 Fro+0.032
Da+0.031
Neg logit contributions
 on-0.042
els-0.042
 inside-0.041
 sticker-0.038
.-0.038
Token and
Token ID290
Feature activation-4.57e+00

Pos logit contributions
els+0.015
 on+0.013
 inside+0.013
 sticker+0.012
 colored+0.012
Neg logit contributions
 Int-0.021
ete-0.020
 Fro-0.020
 Everywhere-0.020
Da-0.019
Token he
Token ID339
Feature activation-4.05e+00

Pos logit contributions
 Int+0.073
 Everywhere+0.069
ete+0.068
Da+0.068
pill+0.066
Neg logit contributions
 on-0.067
 inside-0.064
.-0.059
 sticker-0.052
 colored-0.051
Token had
Token ID550
Feature activation-2.18e+00

Pos logit contributions
 Int+0.067
 Everywhere+0.064
ete+0.063
Da+0.062
 Fro+0.061
Neg logit contributions
 on-0.053
 inside-0.051
.-0.048
 colored-0.044
 sticker-0.044
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
 Int+0.038
ete+0.035
 Everywhere+0.035
 Fro+0.035
Da+0.034
Neg logit contributions
 on-0.026
els-0.025
 inside-0.025
.-0.023
 sticker-0.022
Token big
Token ID1263
Feature activation+1.39e+00

Pos logit contributions
 Int+0.018
ete+0.017
 Everywhere+0.017
 Fro+0.017
Da+0.016
Neg logit contributions
els-0.016
 on-0.014
 inside-0.014
ps-0.013
 colored-0.013
Token smile
Token ID8212
Feature activation+5.57e+00

Pos logit contributions
els+0.020
 on+0.017
ps+0.017
 inside+0.017
ier+0.016
Neg logit contributions
 Int-0.021
 Everywhere-0.020
 Fro-0.020
ete-0.020
 Sadly-0.019
Token as
Token ID355
Feature activation-5.29e+00

Pos logit contributions
els+0.014
ps+0.010
bones+0.010
ries+0.010
 envelop+0.010
Neg logit contributions
 Int-0.011
 Everywhere-0.011
 As-0.011
 Not-0.010
 Fro-0.010
Token she
Token ID673
Feature activation-3.51e+00

Pos logit contributions
 Int+0.069
ete+0.066
 Everywhere+0.066
 Fro+0.065
Da+0.063
Neg logit contributions
els-0.084
 inside-0.073
 on-0.072
 sticker-0.071
ps-0.070
Token did
Token ID750
Feature activation-4.44e-03

Pos logit contributions
 Int+0.046
ete+0.040
 Everywhere+0.040
 Fro+0.039
 Curious+0.039
Neg logit contributions
els-0.058
 on-0.052
 inside-0.052
 sticker-0.050
 frost-0.048
Token.
Token ID13
Feature activation-1.34e+01

Pos logit contributions
 Int+0.000
 Everywhere+0.000
ete+0.000
 Everything+0.000
 Fro+0.000
Neg logit contributions
els-0.000
 frost-0.000
 sticker-0.000
 tape-0.000
 colored-0.000
Token<|endoftext|>
Token ID50256
Feature activation-9.40e+00

Pos logit contributions
 concession+0.198
ete+0.192
ternally+0.184
rolling+0.182
mber+0.176
Neg logit contributions
.-0.204
 inside-0.173
 on-0.169
."-0.154
els-0.145
TokenS
Token ID50
Feature activation-1.59e+01

Pos logit contributions
 Int+0.102
 Everywhere+0.091
ete+0.091
 Fro+0.088
Da+0.083
Neg logit contributions
els-0.174
 on-0.166
 inside-0.164
 sticker-0.157
 colored-0.154
Tokenara
Token ID3301
Feature activation-9.64e+00

Pos logit contributions
 Int+0.199
 Everywhere+0.176
 Curious+0.169
 Fro+0.167
ete+0.166
Neg logit contributions
els-0.276
 on-0.264
 inside-0.258
.-0.247
 sticker-0.243
Token and
Token ID290
Feature activation-9.93e+00

Pos logit contributions
 Int+0.171
 Everywhere+0.161
ete+0.161
 Fro+0.160
 Curious+0.153
Neg logit contributions
els-0.108
 on-0.096
 inside-0.090
 sticker-0.090
 frost-0.083
Token Ben
Token ID3932
Feature activation-7.89e+00

Pos logit contributions
rig+0.231
 Int+0.229
ete+0.227
pill+0.222
Sadly+0.220
Neg logit contributions
 on-0.082
 inside-0.073
.-0.071
els-0.044
 stamp-0.042
Token like
Token ID588
Feature activation-4.30e+00

Pos logit contributions
 Int+0.127
ete+0.118
 Everywhere+0.117
 Fro+0.115
Da+0.114
Neg logit contributions
 on-0.104
els-0.102
 inside-0.102
.-0.095
 sticker-0.090
Token to
Token ID284
Feature activation-5.92e+00

Pos logit contributions
 Int+0.081
ete+0.077
 Everywhere+0.075
 Fro+0.073
 Curious+0.072
Neg logit contributions
 on-0.046
els-0.044
 inside-0.043
.-0.040
 sticker-0.039
Token it
Token ID340
Feature activation-7.25e+00

Pos logit contributions
ete+0.074
 concession+0.069
ickets+0.067
Fre+0.066
mber+0.065
Neg logit contributions
.-0.116
 inside-0.108
 on-0.106
 paper-0.095
els-0.094
Token forever
Token ID8097
Feature activation-5.36e+00

Pos logit contributions
 Int+0.134
ete+0.126
 Everywhere+0.126
 Fro+0.124
 Curious+0.119
Neg logit contributions
els-0.078
 on-0.071
 inside-0.069
 sticker-0.064
 colored-0.060
Token.
Token ID13
Feature activation-1.64e+01

Pos logit contributions
 Int+0.111
ete+0.107
 Everywhere+0.106
 Fro+0.104
Da+0.101
Neg logit contributions
els-0.045
 on-0.036
 inside-0.034
 sticker-0.032
 frost-0.030
Token
Token ID198
Feature activation-1.52e+01

Pos logit contributions
 Int+0.194
 concession+0.188
ete+0.181
Mos+0.177
Da+0.175
Neg logit contributions
.-0.268
 on-0.253
 inside-0.240
els-0.228
."-0.217
Token
Token ID198
Feature activation-1.54e+01

Pos logit contributions
 Int+0.249
ishly+0.236
 concession+0.227
cient+0.226
rified+0.217
Neg logit contributions
.-0.197
 on-0.186
 inside-0.172
els-0.155
 sticker-0.144
TokenL
Token ID43
Feature activation-1.59e+01

Pos logit contributions
 Int+0.203
ishly+0.186
cient+0.183
 Everywhere+0.180
 Fro+0.178
Neg logit contributions
 on-0.222
.-0.222
 inside-0.200
els-0.196
 half-0.193
Tokenily
Token ID813
Feature activation-1.04e+01

Pos logit contributions
 Int+0.247
 Everywhere+0.226
ete+0.224
 Fro+0.219
 Curious+0.216
Neg logit contributions
els-0.225
 on-0.216
 inside-0.211
.-0.198
 sticker-0.195
Token's
Token ID338
Feature activation-8.71e+00

Pos logit contributions
 Int+0.187
 Everywhere+0.175
ete+0.175
 Fro+0.171
Da+0.170
Neg logit contributions
 on-0.120
 inside-0.116
els-0.115
.-0.110
 colored-0.105
Token mom
Token ID1995
Feature activation-4.83e+00

Pos logit contributions
 Int+0.152
 Everywhere+0.138
ete+0.137
Da+0.135
 Fro+0.134
Neg logit contributions
els-0.106
 on-0.105
 inside-0.103
.-0.096
 sticker-0.095
Token saw
Token ID2497
Feature activation-3.59e+00

Pos logit contributions
 Int+0.078
ete+0.073
 Everywhere+0.072
 Fro+0.071
Da+0.069
Neg logit contributions
els-0.064
 on-0.062
 inside-0.060
 sticker-0.055
.-0.055
Token the
Token ID262
Feature activation-4.04e+00

Pos logit contributions
 Int+0.056
 Fro+0.052
 Everywhere+0.052
Da+0.051
ete+0.050
Neg logit contributions
els-0.045
 sticker-0.043
 on-0.042
 inside-0.041
ps-0.039
Token
Token ID198
Feature activation-1.81e+01

Pos logit contributions
els+0.046
 on+0.044
 inside+0.043
 sticker+0.040
.+0.039
Neg logit contributions
 Int-0.053
ete-0.049
 Everywhere-0.049
 Fro-0.048
Da-0.046
Token
Token ID198
Feature activation-1.64e+01

Pos logit contributions
 Int+0.258
ishly+0.254
cient+0.245
 concession+0.242
ete+0.234
Neg logit contributions
 on-0.246
.-0.245
 inside-0.218
els-0.206
 half-0.203
Tokenâ
Token ID22940
Feature activation-1.59e+01

Pos logit contributions
 Int+0.236
ishly+0.228
cient+0.223
 Everywhere+0.218
 Fro+0.218
Neg logit contributions
 on-0.212
.-0.206
 inside-0.189
els-0.189
 half-0.185
Token
Token ID26391
Feature activation-1.91e+01

Pos logit contributions
 Int+0.146
ete+0.127
 Fro+0.127
 Everywhere+0.126
Da+0.118
Neg logit contributions
els-0.314
 on-0.309
 inside-0.305
.-0.291
 colored-0.287
Token
Token ID129
Feature activation-1.36e+01

Pos logit contributions
 Int+0.237
 Everywhere+0.235
Da+0.231
 concession+0.231
ete+0.219
Neg logit contributions
.-0.304
 on-0.301
 inside-0.282
els-0.265
 sticker-0.259
Token
Token ID241
Feature activation-1.68e+01

Pos logit contributions
 Int+0.166
ete+0.150
Da+0.150
 Everywhere+0.145
 Fro+0.143
Neg logit contributions
els-0.229
 inside-0.215
 on-0.213
.-0.210
ps-0.204
TokenHello
Token ID15496
Feature activation-3.00e+00

Pos logit contributions
 Int+0.187
ete+0.180
 Fro+0.169
 Everywhere+0.165
Da+0.158
Neg logit contributions
 on-0.304
 inside-0.293
els-0.286
.-0.284
 sticker-0.269
Token!
Token ID0
Feature activation-1.28e+01

Pos logit contributions
 Int+0.063
ete+0.060
 Fro+0.059
 Everywhere+0.059
 Curious+0.057
Neg logit contributions
els-0.025
 on-0.020
 inside-0.020
 sticker-0.020
 colored-0.019
Token Where
Token ID6350
Feature activation-7.52e+00

Pos logit contributions
 Int+0.132
ete+0.132
 concession+0.125
 Fro+0.123
 Everywhere+0.119
Neg logit contributions
 on-0.222
 inside-0.215
els-0.208
.-0.204
 sticker-0.196
Token are
Token ID389
Feature activation-4.22e+00

Pos logit contributions
 Int+0.137
 Everywhere+0.130
ete+0.130
Da+0.128
 Fro+0.128
Neg logit contributions
 on-0.086
els-0.081
 inside-0.080
.-0.075
 sticker-0.070
Token you
Token ID345
Feature activation-2.77e+00

Pos logit contributions
 Int+0.068
 Everywhere+0.066
ete+0.063
rig+0.063
 Curious+0.062
Neg logit contributions
 on-0.056
els-0.052
 inside-0.051
.-0.051
 paper-0.044
Token at
Token ID379
Feature activation-1.06e+01

Pos logit contributions
els+0.032
 on+0.028
 inside+0.028
ps+0.026
 sticker+0.025
Neg logit contributions
 Int-0.059
 Everywhere-0.055
ete-0.055
 Fro-0.055
Da-0.054
Token were
Token ID547
Feature activation-9.21e+00

Pos logit contributions
 Int+0.205
ete+0.193
 Everywhere+0.191
 Fro+0.188
Da+0.185
Neg logit contributions
els-0.108
 on-0.105
 inside-0.101
.-0.090
 sticker-0.089
Token all
Token ID477
Feature activation-6.49e+00

Pos logit contributions
 Int+0.147
ete+0.137
 Everywhere+0.136
 Fro+0.134
Da+0.129
Neg logit contributions
els-0.122
 on-0.116
 inside-0.113
 sticker-0.106
.-0.103
Token the
Token ID262
Feature activation-7.99e+00

Pos logit contributions
 Int+0.110
 Everywhere+0.103
ete+0.098
 Curious+0.097
 Everything+0.097
Neg logit contributions
els-0.079
 sticker-0.067
 on-0.062
 inside-0.061
 frost-0.059
Token leaves
Token ID5667
Feature activation+1.92e+00

Pos logit contributions
 Int+0.128
 Everywhere+0.120
ete+0.118
 Fro+0.115
 Curious+0.114
Neg logit contributions
els-0.098
 inside-0.087
 on-0.087
 sticker-0.084
ps-0.078
Token.
Token ID13
Feature activation-1.59e+01

Pos logit contributions
els+0.015
 sticker+0.011
 on+0.011
 inside+0.011
 colored+0.010
Neg logit contributions
 Int-0.041
ete-0.039
 Everywhere-0.039
 Fro-0.038
Da-0.037
Token She
Token ID1375
Feature activation-9.64e+00

Pos logit contributions
ete+0.173
 Int+0.169
 concession+0.165
ternally+0.162
mber+0.157
Neg logit contributions
.-0.271
 on-0.261
 inside-0.246
els-0.235
."-0.220
Token dreamed
Token ID27947
Feature activation-3.13e+00

Pos logit contributions
 Int+0.157
ete+0.147
 Everywhere+0.147
 Fro+0.142
Da+0.138
Neg logit contributions
els-0.125
 on-0.125
 inside-0.122
.-0.113
 colored-0.111
Token of
Token ID286
Feature activation-3.14e+00

Pos logit contributions
 Int+0.053
ete+0.050
 Everywhere+0.050
 Fro+0.049
Da+0.048
Neg logit contributions
els-0.038
 on-0.036
 inside-0.035
 sticker-0.033
.-0.031
Token picking
Token ID10868
Feature activation+3.03e+00

Pos logit contributions
 Int+0.053
 Fro+0.050
 Everywhere+0.050
ete+0.048
Da+0.047
Neg logit contributions
els-0.038
 on-0.034
 inside-0.034
 sticker-0.033
 colored-0.030
Token up
Token ID510
Feature activation+1.81e+00

Pos logit contributions
els+0.033
 sticker+0.030
 on+0.029
 inside+0.029
ps+0.026
Neg logit contributions
 Int-0.053
 Fro-0.050
 Everywhere-0.050
Da-0.049
ete-0.048
Token notes
Token ID4710
Feature activation+2.28e+00

Pos logit contributions
pill+0.020
 Int+0.020
 Everywhere+0.020
 concession+0.020
 Fro+0.020
Neg logit contributions
els-0.015
ps-0.014
 on-0.014
 sticker-0.014
 inside-0.013
Token laid
Token ID8104
Feature activation-1.10e+00

Pos logit contributions
els+0.021
 sticker+0.015
 inside+0.015
 frost+0.014
 on+0.014
Neg logit contributions
 Int-0.043
 Everywhere-0.043
ete-0.041
 Everything-0.039
mber-0.039
Token on
Token ID319
Feature activation-3.86e+00

Pos logit contributions
 Int+0.021
 Everywhere+0.021
ete+0.021
 Fro+0.020
Da+0.020
Neg logit contributions
els-0.009
 sticker-0.008
 inside-0.008
 on-0.007
 colored-0.007
Token the
Token ID262
Feature activation-7.71e+00

Pos logit contributions
 Everywhere+0.064
 Int+0.061
 Fro+0.059
ete+0.059
Da+0.058
Neg logit contributions
els-0.045
 inside-0.039
 sticker-0.037
 on-0.037
ier-0.036
Token ground
Token ID2323
Feature activation-3.46e-02

Pos logit contributions
 Int+0.121
 Everywhere+0.117
ete+0.114
 Fro+0.111
 concession+0.111
Neg logit contributions
els-0.100
 on-0.088
 inside-0.086
 sticker-0.078
 colored-0.077
Token.
Token ID13
Feature activation-1.58e+01

Pos logit contributions
 Int+0.001
 Everywhere+0.001
ete+0.001
 Fro+0.001
Da+0.001
Neg logit contributions
els-0.000
 sticker-0.000
 inside-0.000
 on-0.000
 frost-0.000
Token
Token ID220
Feature activation-1.54e+01

Pos logit contributions
 Int+0.162
ete+0.157
 concession+0.155
mber+0.145
ternally+0.144
Neg logit contributions
.-0.270
 on-0.268
 inside-0.255
els-0.249
 sticker-0.229
Token
Token ID198
Feature activation-1.46e+01

Pos logit contributions
 Int+0.224
 concession+0.213
mber+0.205
Da+0.204
 Fro+0.203
Neg logit contributions
.-0.222
 on-0.218
els-0.210
 inside-0.208
 sticker-0.179
Token
Token ID198
Feature activation-1.47e+01

Pos logit contributions
 Int+0.211
ishly+0.204
 concession+0.199
cient+0.199
ete+0.193
Neg logit contributions
.-0.197
 on-0.191
 inside-0.175
els-0.172
 sticker-0.155
TokenCl
Token ID2601
Feature activation-1.38e+01

Pos logit contributions
 Int+0.182
 Fro+0.171
cient+0.170
ishly+0.167
 Everywhere+0.163
Neg logit contributions
 on-0.217
.-0.209
 inside-0.203
els-0.195
 half-0.191
Tokenara
Token ID3301
Feature activation-1.02e+01

Pos logit contributions
 Int+0.169
 Everywhere+0.153
ete+0.149
 Fro+0.149
Da+0.142
Neg logit contributions
els-0.235
 on-0.230
 inside-0.225
.-0.214
 sticker-0.212