TOP ACTIVATIONS
MAX = 6.834

your fault and ask for mercy . I will spare you
, " Look at my steak ! It 's so good
very scared and begged for mercy . He said , '
" Do you want your toast rare or well - done
- you ' to his toast when he put it in
eat a new vegetable called broccoli . Lily had never seen
Tom cried and begged for mercy . They wished they had
her mom to make her toast with butter and jam .
friend Tommy asked her to cheat on a test . "
new , it was called oxygen . He was a bit
lion . He saw a steak lying in the grass .
asked his mum for some meat . But his mum said
, her mom gave her toast with butter on it .
was a young bear called Be art ie . Be art
saw a big plate of steak on the table . It
her mom my made her toast but forgot to warm up
butter to spread on his toast . He took it and
to eat jam on her toast every morning . One day
big brother to make her toast . At first
and wanted to eat her steak right away . She nodded

MIN ACTIVATIONS
MIN = -8.106

. He laughed with g lee . Mom
two friends laughed with g lee . They had found a
forth and laughed with g lee . When he tired of
The princess laughed with g lee . She pointed and pointed
hugged each other with g lee . After playing
toys and puzzles and cr ay ons . There are hats
knowing that even the t ini est and darkest of gardens
stopped . Kay lee was excited and said ,
a chain . Everywhere N yl a looked , she saw
her . Kay lee was too scared to move
jumped and laughed with g lee . As he
pies and laughed with g lee . But
the redd est and sh ini est apples they could find
and spark led with g lee . From then on ,
and it became the sh ini est tower in the whole
room and laughed with g lee . The little girl had
Sarah smiled with g lee , and started to wonder
all gigg led with g lee . Then ,
the girl laughed with g lee . The cookies were delicious
As the light ening f lic kered away , the man

INTERVAL 3.099 - 6.834
CONTAINS 0.723%

, her mom praised her for being so brave . Lily
said they had to be patient . They
, Tim and Spot went for a walk to the store
she saw the balloon was stuck in a tree . She
" This is for you , Sara . Do

INTERVAL -0.636 - 3.099
CONTAINS 72.301%

it was time to go home . The
Tim my climbed onto the chair , but it
upon a time there lived a very curious little girl .
looking for things to fix . She found a picture frame
mom my hugged him and said , " you don 't

INTERVAL -4.371 - -0.636
CONTAINS 26.884%

The boat came closer and closer until it
noticed he was missing and went to look for him .
ink ! E lla was so excited she ran
trains could not reach the town and everyone was unhappy .
her mom my and smiled as she sn ugg led into

INTERVAL -8.106 - -4.371
CONTAINS 0.092%

harder . So B izzy looked . She searched
filled with joy , watching how happy the ducks were !
" Thanks for teaching me how to improve ." <|endoftext|> There
his friends , and even how to be depend able .
find her favorite red cr ay on .
Token your
Token ID534
Feature activation+1.88e-01

Pos logit contributions
 unison+0.076
 thirteen+0.073
 Rover+0.072
 alas+0.072
 then+0.069
Neg logit contributions
 brushes-0.111
ies-0.111
 designs-0.107
 work-0.105
working-0.105
Token fault
Token ID8046
Feature activation+1.05e+00

Pos logit contributions
 Rover+0.008
 then+0.008
 mercy+0.008
 unison+0.008
 alas+0.008
Neg logit contributions
 brushes-0.009
 designs-0.009
ies-0.009
working-0.008
ini-0.008
Token and
Token ID290
Feature activation-4.71e-01

Pos logit contributions
 Rover+0.054
 thirteen+0.053
 unison+0.053
Mr+0.052
 alas+0.052
Neg logit contributions
ies-0.046
 brushes-0.043
 designs-0.042
 work-0.040
working-0.039
Token ask
Token ID1265
Feature activation+1.17e+00

Pos logit contributions
 brushes+0.028
 designs+0.027
ies+0.027
ini+0.027
ugs+0.026
Neg logit contributions
 Rover-0.017
 then-0.017
 thirteen-0.015
 mercy-0.015
 alas-0.015
Token for
Token ID329
Feature activation+3.91e+00

Pos logit contributions
 Rover+0.041
 thirteen+0.040
 unison+0.040
 alas+0.039
 then+0.039
Neg logit contributions
 brushes-0.067
ies-0.066
 designs-0.065
 work-0.064
working-0.062
Token mercy
Token ID17703
Feature activation+6.83e+00

Pos logit contributions
 thirteen+0.208
rified+0.204
 Rover+0.201
 unison+0.198
 alas+0.197
Neg logit contributions
 work-0.155
 brushes-0.155
 designs-0.146
ies-0.143
working-0.133
Token.
Token ID13
Feature activation-1.97e-01

Pos logit contributions
 Rover+0.352
 thirteen+0.342
 alas+0.336
Mr+0.336
tons+0.329
Neg logit contributions
ies-0.293
 brushes-0.270
packs-0.267
 designs-0.263
working-0.261
Token I
Token ID314
Feature activation+9.64e-01

Pos logit contributions
 brushes+0.010
 designs+0.009
working+0.009
ini+0.009
ugs+0.009
Neg logit contributions
 thirteen-0.007
 mercy-0.007
 unison-0.007
 louder-0.007
 alas-0.007
Token will
Token ID481
Feature activation+1.69e+00

Pos logit contributions
 then+0.042
 Rover+0.042
 mercy+0.038
 alas+0.037
 no+0.037
Neg logit contributions
 brushes-0.050
 designs-0.050
ies-0.047
ugs-0.046
ini-0.046
Token spare
Token ID13952
Feature activation+5.19e-01

Pos logit contributions
 Rover+0.081
 then+0.079
 thirteen+0.078
 alas+0.077
 unison+0.076
Neg logit contributions
 brushes-0.076
 designs-0.074
ies-0.073
working-0.070
 work-0.069
Token you
Token ID345
Feature activation+4.15e-01

Pos logit contributions
 unison+0.022
 Rover+0.022
 thirteen+0.022
 alas+0.021
Mr+0.021
Neg logit contributions
 brushes-0.025
ies-0.025
 work-0.025
 designs-0.025
working-0.023
Token,
Token ID11
Feature activation+8.92e-01

Pos logit contributions
 Rover+0.041
 then+0.039
 thirteen+0.039
 alas+0.038
 unison+0.038
Neg logit contributions
 brushes-0.048
ies-0.047
 designs-0.047
working-0.045
 work-0.044
Token "
Token ID366
Feature activation+1.36e+00

Pos logit contributions
 thirteen+0.032
 Rover+0.032
 mercy+0.032
 alas+0.031
 unison+0.031
Neg logit contributions
 brushes-0.049
 designs-0.047
ies-0.046
 work-0.045
working-0.043
TokenLook
Token ID8567
Feature activation+9.57e-02

Pos logit contributions
 mercy+0.051
 Rover+0.050
 alas+0.047
 then+0.046
tons+0.046
Neg logit contributions
 brushes-0.072
ini-0.071
rolled-0.069
 designs-0.068
ies-0.067
Token at
Token ID379
Feature activation+6.49e-01

Pos logit contributions
 Rover+0.004
 then+0.004
 thirteen+0.004
 alas+0.003
tons+0.003
Neg logit contributions
 brushes-0.005
 designs-0.005
ies-0.005
working-0.005
ini-0.005
Token my
Token ID616
Feature activation+2.18e+00

Pos logit contributions
 Rover+0.021
 thirteen+0.020
 unison+0.020
 then+0.020
 alas+0.020
Neg logit contributions
 brushes-0.038
 designs-0.037
ies-0.037
 work-0.036
working-0.036
Token steak
Token ID26320
Feature activation+6.79e+00

Pos logit contributions
 Rover+0.071
 thirteen+0.069
 then+0.068
 alas+0.066
 unison+0.065
Neg logit contributions
 brushes-0.130
 designs-0.130
ies-0.128
 work-0.124
working-0.123
Token!
Token ID0
Feature activation+7.51e-01

Pos logit contributions
 then+0.275
 Rover+0.271
 thirteen+0.251
tons+0.243
 alas+0.238
Neg logit contributions
 brushes-0.375
 designs-0.355
ini-0.345
ies-0.338
bel-0.335
Token It
Token ID632
Feature activation+1.79e-01

Pos logit contributions
 Rover+0.029
 mercy+0.028
 unison+0.028
 then+0.028
 alas+0.027
Neg logit contributions
 brushes-0.039
 designs-0.038
working-0.035
rolled-0.035
 work-0.035
Token's
Token ID338
Feature activation+1.41e+00

Pos logit contributions
 Rover+0.006
 then+0.006
 thirteen+0.006
 alas+0.005
 mercy+0.005
Neg logit contributions
 brushes-0.011
ies-0.010
 designs-0.010
working-0.010
ini-0.010
Token so
Token ID523
Feature activation-2.13e+00

Pos logit contributions
 Rover+0.048
 unison+0.047
rified+0.046
 mercy+0.046
 alas+0.045
Neg logit contributions
 brushes-0.080
 designs-0.079
 work-0.079
ies-0.076
 design-0.074
Token good
Token ID922
Feature activation+8.93e-01

Pos logit contributions
 brushes+0.091
ies+0.088
 designs+0.088
working+0.082
ini+0.082
Neg logit contributions
 Rover-0.109
 then-0.107
 thirteen-0.105
 alas-0.101
Mr-0.101
Token very
Token ID845
Feature activation-6.93e-01

Pos logit contributions
 Rover+0.004
 unison+0.004
 alas+0.004
 thirteen+0.004
tons+0.004
Neg logit contributions
 brushes-0.004
ies-0.004
 designs-0.004
 work-0.004
working-0.003
Token scared
Token ID12008
Feature activation+9.41e-02

Pos logit contributions
 brushes+0.037
 designs+0.036
bows+0.035
ies+0.034
ini+0.034
Neg logit contributions
 Rover-0.030
 then-0.029
 louder-0.027
 thirteen-0.027
 mercy-0.026
Token and
Token ID290
Feature activation+4.21e-01

Pos logit contributions
 unison+0.006
 thirteen+0.006
 alas+0.006
tons+0.006
Isa+0.006
Neg logit contributions
ies-0.003
 work-0.003
 brushes-0.002
 designs-0.002
 stuff-0.002
Token begged
Token ID32946
Feature activation+8.90e-01

Pos logit contributions
 Rover+0.020
 then+0.020
 thirteen+0.019
 unison+0.019
 alas+0.019
Neg logit contributions
 brushes-0.019
ies-0.018
 designs-0.018
working-0.017
 work-0.017
Token for
Token ID329
Feature activation+3.66e+00

Pos logit contributions
 unison+0.040
 thirteen+0.039
 alas+0.038
tons+0.037
 Rover+0.037
Neg logit contributions
 work-0.044
ies-0.042
 brushes-0.041
 designs-0.040
working-0.039
Token mercy
Token ID17703
Feature activation+6.72e+00

Pos logit contributions
 thirteen+0.224
rified+0.219
 unison+0.211
tons+0.204
 alas+0.201
Neg logit contributions
 work-0.132
 brushes-0.106
 designs-0.102
ies-0.100
 design-0.096
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
Mr+0.412
 Rover+0.408
 thirteen+0.396
 alas+0.392
 unison+0.382
Neg logit contributions
ies-0.234
 work-0.204
packs-0.196
working-0.188
 designs-0.176
Token He
Token ID679
Feature activation+1.39e+00

Pos logit contributions
 thirteen+0.011
 mercy+0.011
 unison+0.011
 then+0.010
 louder+0.010
Neg logit contributions
 brushes-0.009
rolled-0.009
 designs-0.008
 work-0.008
bel-0.008
Token said
Token ID531
Feature activation+2.26e-01

Pos logit contributions
 thirteen+0.073
 Rover+0.073
 unison+0.071
 alas+0.071
tons+0.069
Neg logit contributions
 brushes-0.055
ies-0.053
 designs-0.050
 work-0.049
bel-0.046
Token,
Token ID11
Feature activation+4.64e-01

Pos logit contributions
 unison+0.011
 thirteen+0.009
 Rover+0.009
Mr+0.009
rified+0.008
Neg logit contributions
ies-0.010
working-0.010
 brushes-0.010
 designs-0.010
 work-0.010
Token '
Token ID705
Feature activation-3.84e-02

Pos logit contributions
 unison+0.028
 mercy+0.026
 thirteen+0.026
rified+0.025
 course+0.024
Neg logit contributions
 brushes-0.013
 work-0.012
working-0.012
 designs-0.012
 design-0.011
Token "
Token ID366
Feature activation+2.62e+00

Pos logit contributions
 mercy+0.127
 thirteen+0.127
 Rover+0.125
 unison+0.125
 alas+0.119
Neg logit contributions
 brushes-0.130
 designs-0.125
ies-0.124
 work-0.123
working-0.118
TokenDo
Token ID5211
Feature activation+2.51e+00

Pos logit contributions
 Rover+0.089
 mercy+0.080
 thirteen+0.079
tons+0.076
 unison+0.075
Neg logit contributions
 brushes-0.157
ini-0.154
ies-0.152
bel-0.149
 designs-0.146
Token you
Token ID345
Feature activation+1.72e+00

Pos logit contributions
 Rover+0.073
 thirteen+0.070
 then+0.070
 unison+0.067
 alas+0.066
Neg logit contributions
 brushes-0.163
ies-0.156
 designs-0.155
working-0.150
 work-0.150
Token want
Token ID765
Feature activation+2.15e+00

Pos logit contributions
 Rover+0.077
 then+0.077
 thirteen+0.074
 alas+0.072
 unison+0.072
Neg logit contributions
 brushes-0.084
 designs-0.081
ies-0.081
ini-0.076
working-0.076
Token your
Token ID534
Feature activation+2.33e+00

Pos logit contributions
 unison+0.126
rified+0.122
 thirteen+0.119
 Rover+0.119
tons+0.118
Neg logit contributions
 brushes-0.081
 work-0.075
 designs-0.071
working-0.069
ade-0.067
Token toast
Token ID27805
Feature activation+6.52e+00

Pos logit contributions
 Rover+0.078
 then+0.077
 thirteen+0.071
 alas+0.070
 unison+0.069
Neg logit contributions
 brushes-0.139
 designs-0.136
ies-0.135
ini-0.130
working-0.130
Token rare
Token ID4071
Feature activation+1.65e+00

Pos logit contributions
 then+0.269
 Rover+0.268
 alas+0.248
 thirteen+0.246
tons+0.242
Neg logit contributions
 brushes-0.344
 designs-0.333
ies-0.325
ini-0.315
working-0.315
Token or
Token ID393
Feature activation+2.35e+00

Pos logit contributions
 Rover+0.079
 then+0.076
 thirteen+0.075
 unison+0.073
 alas+0.073
Neg logit contributions
 brushes-0.076
 designs-0.073
ies-0.073
working-0.069
 work-0.068
Token well
Token ID880
Feature activation+3.27e+00

Pos logit contributions
 Rover+0.100
 then+0.099
 thirteen+0.093
 alas+0.092
tons+0.090
Neg logit contributions
 brushes-0.120
 designs-0.117
ies-0.117
ini-0.112
working-0.111
Token-
Token ID12
Feature activation-4.16e+00

Pos logit contributions
 then+0.150
 Rover+0.146
tons+0.134
 thirteen+0.134
 alas+0.134
Neg logit contributions
 brushes-0.160
 designs-0.156
ini-0.151
ies-0.148
rolled-0.145
Tokendone
Token ID28060
Feature activation+1.37e+00

Pos logit contributions
 designs+0.211
 brushes+0.209
ies+0.206
rolled+0.196
packs+0.194
Neg logit contributions
 then-0.178
 Rover-0.176
 thirteen-0.169
Mr-0.166
 alas-0.165
Token-
Token ID12
Feature activation-3.71e+00

Pos logit contributions
 unison+0.030
 thirteen+0.029
 Rover+0.028
 then+0.028
 mercy+0.027
Neg logit contributions
 work-0.046
working-0.045
 brushes-0.045
 designs-0.045
ies-0.043
Tokenyou
Token ID5832
Feature activation+8.64e-01

Pos logit contributions
 brushes+0.184
 designs+0.179
ies+0.177
packs+0.164
 work+0.163
Neg logit contributions
 Rover-0.166
 then-0.162
 thirteen-0.157
 alas-0.154
Mr-0.152
Token'
Token ID6
Feature activation+2.26e+00

Pos logit contributions
 Rover+0.029
 then+0.028
 thirteen+0.027
 unison+0.027
 alas+0.027
Neg logit contributions
 brushes-0.050
 designs-0.049
ies-0.049
working-0.048
 work-0.047
Token to
Token ID284
Feature activation-7.15e-01

Pos logit contributions
 Rover+0.092
 then+0.090
 thirteen+0.086
 alas+0.085
 unison+0.083
Neg logit contributions
 brushes-0.120
 designs-0.116
ies-0.115
working-0.109
ini-0.108
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 brushes+0.042
ies+0.041
 designs+0.040
bows+0.039
packs+0.039
Neg logit contributions
 then-0.027
 Rover-0.026
 mercy-0.023
 unison-0.023
 thirteen-0.023
Token toast
Token ID27805
Feature activation+6.50e+00

Pos logit contributions
 then+0.049
 Rover+0.049
 unison+0.045
 alas+0.045
 thirteen+0.045
Neg logit contributions
 brushes-0.045
 designs-0.042
ies-0.042
working-0.041
ini-0.041
Token when
Token ID618
Feature activation+1.42e+00

Pos logit contributions
 Rover+0.280
 thirteen+0.267
 then+0.266
 unison+0.259
 alas+0.257
Neg logit contributions
 brushes-0.324
ies-0.318
 designs-0.317
working-0.300
 work-0.298
Token he
Token ID339
Feature activation+2.23e+00

Pos logit contributions
 Rover+0.042
 then+0.042
 alas+0.038
 thirteen+0.037
tons+0.037
Neg logit contributions
 brushes-0.092
 designs-0.089
ies-0.089
ini-0.087
bows-0.085
Token put
Token ID1234
Feature activation+1.19e-01

Pos logit contributions
 then+0.108
 Rover+0.092
 thirteen+0.091
�+0.091
 no+0.090
Neg logit contributions
 brushes-0.106
ini-0.106
 designs-0.104
ugs-0.103
ies-0.100
Token it
Token ID340
Feature activation-3.15e-01

Pos logit contributions
 Rover+0.004
 then+0.004
 thirteen+0.004
 alas+0.004
 unison+0.004
Neg logit contributions
 brushes-0.007
 designs-0.007
ies-0.007
working-0.006
ini-0.006
Token in
Token ID287
Feature activation-1.74e-01

Pos logit contributions
 brushes+0.019
 designs+0.018
ini+0.018
ies+0.017
bows+0.017
Neg logit contributions
 then-0.012
 Rover-0.010
 alas-0.010
 more-0.010
 louder-0.010
Token eat
Token ID4483
Feature activation+3.06e+00

Pos logit contributions
 then+0.014
 Rover+0.014
 end+0.013
 no+0.013
 unison+0.013
Neg logit contributions
ies-0.014
ini-0.014
ugs-0.013
 designs-0.013
 brushes-0.013
Token a
Token ID257
Feature activation+5.36e-01

Pos logit contributions
 Rover+0.143
 then+0.138
 thirteen+0.137
 unison+0.136
 alas+0.134
Neg logit contributions
 brushes-0.145
 designs-0.139
ies-0.139
 work-0.131
working-0.130
Token new
Token ID649
Feature activation+1.76e+00

Pos logit contributions
 Rover+0.025
 then+0.024
 alas+0.023
 mercy+0.023
Mr+0.022
Neg logit contributions
 brushes-0.025
 designs-0.025
ies-0.024
working-0.023
ini-0.023
Token vegetable
Token ID20236
Feature activation+2.79e+00

Pos logit contributions
 then+0.075
 Rover+0.074
 more+0.069
 alas+0.069
 mercy+0.068
Neg logit contributions
 brushes-0.087
 designs-0.083
ies-0.082
bel-0.081
rolled-0.080
Token called
Token ID1444
Feature activation+1.08e+00

Pos logit contributions
 Rover+0.138
 thirteen+0.138
 unison+0.135
 alas+0.130
Mr+0.128
Neg logit contributions
ies-0.123
 brushes-0.122
 designs-0.119
 work-0.117
working-0.114
Token broccoli
Token ID44653
Feature activation+6.49e+00

Pos logit contributions
 Rover+0.042
 then+0.041
 thirteen+0.039
 alas+0.038
 unison+0.038
Neg logit contributions
 brushes-0.059
 designs-0.058
ies-0.058
working-0.055
ini-0.055
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
 Rover+0.303
 thirteen+0.296
 alas+0.291
 then+0.291
 unison+0.287
Neg logit contributions
 brushes-0.301
ies-0.297
 designs-0.293
working-0.281
 work-0.279
Token Lily
Token ID20037
Feature activation-1.07e+00

Pos logit contributions
 Rover+0.011
 then+0.011
 alas+0.010
 thirteen+0.010
 unison+0.010
Neg logit contributions
 brushes-0.016
 designs-0.016
ies-0.016
working-0.016
ini-0.015
Token had
Token ID550
Feature activation-9.99e-01

Pos logit contributions
 brushes+0.047
 designs+0.046
ies+0.045
ini+0.043
working+0.043
Neg logit contributions
 then-0.054
 Rover-0.053
 alas-0.049
Mr-0.049
tons-0.049
Token never
Token ID1239
Feature activation-1.08e+00

Pos logit contributions
ies+0.057
 brushes+0.056
 designs+0.056
ini+0.054
ugs+0.053
Neg logit contributions
 then-0.038
 Rover-0.037
 thirteen-0.035
 alas-0.034
 mercy-0.033
Token seen
Token ID1775
Feature activation-1.25e+00

Pos logit contributions
 designs+0.050
 brushes+0.049
rolled+0.048
ini+0.047
ies+0.047
Neg logit contributions
 then-0.053
 more-0.049
 Rover-0.049
Please-0.048
 mercy-0.048
Token Tom
Token ID4186
Feature activation-2.61e-01

Pos logit contributions
 alas+0.009
 unison+0.009
 unfortunately+0.008
Henry+0.008
 Amen+0.008
Neg logit contributions
 brushes-0.008
 work-0.007
working-0.006
ies-0.006
bows-0.006
Token cried
Token ID16896
Feature activation+5.87e-01

Pos logit contributions
 brushes+0.008
ies+0.008
 work+0.007
rolled+0.007
 designs+0.006
Neg logit contributions
 Rover-0.016
 thirteen-0.016
 alas-0.016
tons-0.015
Mr-0.015
Token and
Token ID290
Feature activation+6.50e-01

Pos logit contributions
 thirteen+0.032
 Rover+0.031
 alas+0.031
tons+0.031
 unison+0.030
Neg logit contributions
ies-0.023
 work-0.023
 brushes-0.022
 designs-0.022
working-0.020
Token begged
Token ID32946
Feature activation+5.18e-01

Pos logit contributions
 thirteen+0.036
 alas+0.035
 unison+0.034
 End+0.034
Mr+0.034
Neg logit contributions
 brushes-0.026
 work-0.022
ies-0.022
 designs-0.021
bel-0.020
Token for
Token ID329
Feature activation+3.36e+00

Pos logit contributions
 thirteen+0.029
rified+0.028
 alas+0.027
 unison+0.027
 Rover+0.027
Neg logit contributions
 work-0.021
ies-0.020
 brushes-0.019
 designs-0.018
packs-0.018
Token mercy
Token ID17703
Feature activation+6.28e+00

Pos logit contributions
 thirteen+0.193
rified+0.189
 alas+0.176
 unison+0.171
tons+0.171
Neg logit contributions
 work-0.142
 brushes-0.127
 designs-0.115
ies-0.110
 design-0.106
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
Mr+0.385
 Rover+0.385
 thirteen+0.379
 alas+0.369
rified+0.358
Neg logit contributions
ies-0.214
 work-0.187
packs-0.185
working-0.168
 brushes-0.160
Token They
Token ID1119
Feature activation+1.12e+00

Pos logit contributions
 brushes+0.001
bel+0.001
rolled+0.001
 designs+0.001
ies+0.001
Neg logit contributions
 thirteen-0.001
 mercy-0.001
 alas-0.001
 Rover-0.001
 course-0.001
Token wished
Token ID16555
Feature activation-8.94e-01

Pos logit contributions
 thirteen+0.065
 Rover+0.062
 alas+0.062
tons+0.061
 unison+0.060
Neg logit contributions
 brushes-0.041
ies-0.037
 work-0.036
 designs-0.033
rolled-0.032
Token they
Token ID484
Feature activation+9.82e-02

Pos logit contributions
 brushes+0.038
ies+0.038
 designs+0.036
 work+0.036
working+0.033
Neg logit contributions
 unison-0.045
 thirteen-0.044
rified-0.043
 Rover-0.043
 alas-0.042
Token had
Token ID550
Feature activation-1.54e+00

Pos logit contributions
 thirteen+0.005
 Rover+0.005
 unison+0.005
 alas+0.005
 louder+0.005
Neg logit contributions
 brushes-0.004
ies-0.003
 designs-0.003
 work-0.003
working-0.003
Token her
Token ID607
Feature activation-9.37e-02

Pos logit contributions
 Rover+0.087
 then+0.087
 louder+0.080
 mercy+0.080
 thirteen+0.080
Neg logit contributions
 brushes-0.078
 designs-0.076
ini-0.075
ies-0.075
working-0.071
Token mom
Token ID1995
Feature activation+1.02e+00

Pos logit contributions
ini+0.006
 brushes+0.005
rolled+0.005
 designs+0.005
bows+0.005
Neg logit contributions
 then-0.003
 Rover-0.003
 more-0.003
 mercy-0.003
 louder-0.003
Token to
Token ID284
Feature activation+1.15e+00

Pos logit contributions
 Rover+0.043
 then+0.043
 thirteen+0.041
 alas+0.040
 unison+0.039
Neg logit contributions
 brushes-0.052
 designs-0.051
ies-0.050
ini-0.048
working-0.048
Token make
Token ID787
Feature activation+2.46e+00

Pos logit contributions
 Rover+0.065
 then+0.063
 thirteen+0.063
 unison+0.062
 alas+0.062
Neg logit contributions
 brushes-0.043
 designs-0.040
ies-0.040
 work-0.038
working-0.037
Token her
Token ID607
Feature activation-1.17e+00

Pos logit contributions
 Rover+0.093
 then+0.092
 thirteen+0.086
 alas+0.084
 mercy+0.083
Neg logit contributions
 brushes-0.137
 designs-0.132
ies-0.132
working-0.128
ini-0.127
Token toast
Token ID27805
Feature activation+6.20e+00

Pos logit contributions
ini+0.068
rolled+0.067
 brushes+0.065
ies+0.065
 designs+0.065
Neg logit contributions
 then-0.043
 more-0.040
 Rover-0.039
 less-0.036
 no-0.036
Token with
Token ID351
Feature activation+1.90e+00

Pos logit contributions
 then+0.225
 Rover+0.213
 louder+0.199
 more+0.197
 alas+0.197
Neg logit contributions
 brushes-0.357
 designs-0.342
working-0.337
ini-0.336
ies-0.333
Token butter
Token ID9215
Feature activation+3.36e+00

Pos logit contributions
 Rover+0.083
 then+0.081
 thirteen+0.078
 alas+0.077
 unison+0.076
Neg logit contributions
 brushes-0.095
 designs-0.092
ies-0.092
working-0.088
ini-0.087
Token and
Token ID290
Feature activation-2.44e-01

Pos logit contributions
 Rover+0.140
 then+0.139
 thirteen+0.130
 alas+0.128
 unison+0.126
Neg logit contributions
 brushes-0.174
 designs-0.168
ies-0.164
working-0.159
ini-0.159
Token jam
Token ID16853
Feature activation+2.64e+00

Pos logit contributions
 brushes+0.013
ies+0.013
 designs+0.013
ini+0.012
working+0.012
Neg logit contributions
 then-0.010
 Rover-0.010
 thirteen-0.009
 alas-0.009
 mercy-0.009
Token.
Token ID13
Feature activation-5.58e-01

Pos logit contributions
 then+0.107
 Rover+0.106
 thirteen+0.099
 alas+0.097
 mercy+0.095
Neg logit contributions
 brushes-0.139
 designs-0.136
ies-0.132
ini-0.131
working-0.129
Token friend
Token ID1545
Feature activation-1.73e-01

Pos logit contributions
ugs+0.118
ini+0.117
working+0.116
 brushes+0.116
ies+0.115
Neg logit contributions
 then-0.078
 Rover-0.074
 no-0.067
 more-0.066
 end-0.064
Token Tommy
Token ID19919
Feature activation+7.68e-01

Pos logit contributions
 brushes+0.009
 designs+0.009
ies+0.009
ini+0.009
working+0.009
Neg logit contributions
 Rover-0.007
 then-0.007
 alas-0.006
 louder-0.006
tons-0.006
Token asked
Token ID1965
Feature activation+1.56e+00

Pos logit contributions
 then+0.033
 Rover+0.032
 louder+0.028
Mr+0.028
tons+0.028
Neg logit contributions
 brushes-0.041
 designs-0.039
ini-0.038
ies-0.038
ugs-0.037
Token her
Token ID607
Feature activation+1.16e+00

Pos logit contributions
 thirteen+0.080
 unison+0.077
 Rover+0.075
 alas+0.074
 then+0.073
Neg logit contributions
ies-0.069
 brushes-0.066
 designs-0.065
 work-0.065
packs-0.062
Token to
Token ID284
Feature activation+1.85e+00

Pos logit contributions
 then+0.046
 Rover+0.045
 louder+0.042
 mercy+0.042
 thirteen+0.041
Neg logit contributions
 brushes-0.063
ini-0.061
 designs-0.060
working-0.059
ies-0.059
Token cheat
Token ID22705
Feature activation+6.16e+00

Pos logit contributions
 then+0.087
 Rover+0.086
 mercy+0.080
 unison+0.079
 alas+0.079
Neg logit contributions
 brushes-0.087
ies-0.087
 designs-0.086
ini-0.084
ugs-0.083
Token on
Token ID319
Feature activation+1.28e+00

Pos logit contributions
 Rover+0.283
 thirteen+0.271
 then+0.270
 alas+0.264
 unison+0.264
Neg logit contributions
 brushes-0.296
ies-0.288
 designs-0.285
working-0.269
 work-0.267
Token a
Token ID257
Feature activation+9.42e-01

Pos logit contributions
 Rover+0.050
 then+0.048
 thirteen+0.045
 unison+0.045
 alas+0.045
Neg logit contributions
 brushes-0.072
ies-0.070
 designs-0.069
ini-0.067
working-0.066
Token test
Token ID1332
Feature activation+2.86e+00

Pos logit contributions
 Rover+0.053
 thirteen+0.053
 then+0.052
 alas+0.052
 unison+0.051
Neg logit contributions
 brushes-0.034
 designs-0.033
 work-0.033
working-0.032
ies-0.031
Token.
Token ID13
Feature activation+4.09e-02

Pos logit contributions
 Rover+0.120
 then+0.120
 thirteen+0.113
 alas+0.112
 unison+0.111
Neg logit contributions
 brushes-0.146
 designs-0.141
ies-0.140
ini-0.134
working-0.133
Token "
Token ID366
Feature activation+1.95e-01

Pos logit contributions
 Rover+0.001
 then+0.001
 ï¿½+0.001
 Please+0.001
 louder+0.001
Neg logit contributions
ies-0.003
ini-0.003
 designs-0.003
 brushes-0.003
working-0.003
Token new
Token ID649
Feature activation+7.86e-01

Pos logit contributions
 Rover+0.101
 then+0.099
 thirteen+0.098
 unison+0.096
 alas+0.096
Neg logit contributions
 brushes-0.068
 designs-0.065
ies-0.065
working-0.060
ini-0.059
Token,
Token ID11
Feature activation+5.03e-01

Pos logit contributions
 Rover+0.034
 then+0.033
 thirteen+0.032
 alas+0.031
 unison+0.031
Neg logit contributions
 brushes-0.040
 designs-0.039
ies-0.038
working-0.037
ini-0.036
Token it
Token ID340
Feature activation+8.37e-02

Pos logit contributions
 Rover+0.015
 thirteen+0.014
 unison+0.014
 then+0.014
 alas+0.013
Neg logit contributions
 brushes-0.032
 designs-0.031
ies-0.031
 work-0.030
working-0.030
Token was
Token ID373
Feature activation+7.63e-01

Pos logit contributions
 then+0.004
 Rover+0.004
 louder+0.004
Please+0.004
tons+0.004
Neg logit contributions
 brushes-0.004
 designs-0.004
ugs-0.004
ies-0.004
packs-0.004
Token called
Token ID1444
Feature activation+8.71e-01

Pos logit contributions
 Rover+0.030
 then+0.029
 thirteen+0.028
 alas+0.028
 unison+0.027
Neg logit contributions
 brushes-0.042
 designs-0.041
ies-0.040
working-0.038
ini-0.038
Token oxygen
Token ID11863
Feature activation+6.12e+00

Pos logit contributions
 Rover+0.033
 then+0.033
 thirteen+0.031
 alas+0.031
 unison+0.031
Neg logit contributions
 brushes-0.048
 designs-0.046
ies-0.046
working-0.044
 work-0.044
Token.
Token ID13
Feature activation+2.55e-02

Pos logit contributions
 thirteen+0.302
 Rover+0.301
 alas+0.295
Mr+0.290
 then+0.289
Neg logit contributions
ies-0.267
 brushes-0.256
 designs-0.256
 work-0.246
working-0.244
Token He
Token ID679
Feature activation+1.42e+00

Pos logit contributions
 Rover+0.001
 then+0.001
 thirteen+0.001
 mercy+0.001
 alas+0.001
Neg logit contributions
 brushes-0.001
 designs-0.001
ies-0.001
working-0.001
 work-0.001
Token was
Token ID373
Feature activation+5.81e-02

Pos logit contributions
 then+0.073
 Rover+0.071
 unison+0.065
 alas+0.064
tons+0.064
Neg logit contributions
 brushes-0.064
 designs-0.062
ies-0.060
bows-0.058
ugs-0.058
Token a
Token ID257
Feature activation-6.99e-01

Pos logit contributions
 Rover+0.002
 unison+0.002
 thirteen+0.002
 alas+0.002
 mercy+0.002
Neg logit contributions
 brushes-0.003
 designs-0.003
ies-0.003
 work-0.003
working-0.003
Token bit
Token ID1643
Feature activation+1.88e+00

Pos logit contributions
 work+0.024
 designs+0.023
working+0.023
 design+0.023
ies+0.022
Neg logit contributions
 thirteen-0.040
 Rover-0.039
rified-0.039
 alas-0.038
 mercy-0.038
Token lion
Token ID18744
Feature activation+2.14e+00

Pos logit contributions
 thirteen+0.064
 Rover+0.064
 alas+0.063
 then+0.063
 unison+0.062
Neg logit contributions
 brushes-0.071
ies-0.070
 designs-0.068
 work-0.066
working-0.066
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
 thirteen+0.090
 Rover+0.089
 alas+0.086
 unison+0.085
 mercy+0.085
Neg logit contributions
ies-0.108
 brushes-0.106
 designs-0.104
working-0.102
 work-0.098
Token He
Token ID679
Feature activation+1.85e+00

Pos logit contributions
 Rover+0.013
 then+0.012
 thirteen+0.012
 alas+0.012
 unison+0.012
Neg logit contributions
 brushes-0.018
 designs-0.017
ies-0.017
working-0.017
ini-0.016
Token saw
Token ID2497
Feature activation+9.43e-02

Pos logit contributions
 then+0.091
 Rover+0.087
Please+0.079
Mr+0.079
 thirteen+0.079
Neg logit contributions
 brushes-0.090
ugs-0.087
 designs-0.086
ini-0.085
ies-0.084
Token a
Token ID257
Feature activation+8.24e-01

Pos logit contributions
 Rover+0.004
 unison+0.004
tons+0.004
 alas+0.004
 thirteen+0.004
Neg logit contributions
 brushes-0.005
 designs-0.005
ies-0.004
working-0.004
 work-0.004
Token steak
Token ID26320
Feature activation+6.10e+00

Pos logit contributions
 Rover+0.050
 then+0.049
 thirteen+0.049
 alas+0.048
 unison+0.048
Neg logit contributions
 brushes-0.027
ies-0.026
 designs-0.026
working-0.024
 work-0.024
Token lying
Token ID9105
Feature activation+1.51e+00

Pos logit contributions
 Rover+0.300
 thirteen+0.293
 unison+0.293
 alas+0.286
 mercy+0.282
Neg logit contributions
ies-0.275
 designs-0.258
 brushes-0.258
 work-0.253
working-0.253
Token in
Token ID287
Feature activation+1.14e+00

Pos logit contributions
 Rover+0.070
 unison+0.067
 thirteen+0.067
 then+0.066
 mercy+0.066
Neg logit contributions
 brushes-0.070
ies-0.069
 designs-0.067
working-0.066
ini-0.064
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
 Rover+0.049
 then+0.047
tons+0.047
 thirteen+0.047
 alas+0.047
Neg logit contributions
 brushes-0.058
 designs-0.055
 work-0.054
ies-0.053
working-0.051
Token grass
Token ID8701
Feature activation+2.42e+00

Pos logit contributions
 unison+0.062
 Rover+0.062
 alas+0.062
 then+0.062
tons+0.061
Neg logit contributions
 brushes-0.061
 designs-0.056
ies-0.055
working-0.054
 work-0.054
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
 Rover+0.106
 then+0.104
 thirteen+0.100
 alas+0.099
 unison+0.098
Neg logit contributions
 brushes-0.120
 designs-0.116
ies-0.116
working-0.110
ini-0.109
Token asked
Token ID1965
Feature activation+1.06e+00

Pos logit contributions
 then+0.092
 Rover+0.086
Please+0.082
Mr+0.081
 no+0.078
Neg logit contributions
 brushes-0.093
 designs-0.092
ugs-0.088
bows-0.088
ini-0.086
Token his
Token ID465
Feature activation+1.91e+00

Pos logit contributions
 Rover+0.042
 then+0.041
 mercy+0.037
 thirteen+0.037
 alas+0.037
Neg logit contributions
 brushes-0.060
 designs-0.058
ies-0.058
ini-0.057
working-0.055
Token mum
Token ID25682
Feature activation+1.88e+00

Pos logit contributions
 Rover+0.080
 then+0.079
 mercy+0.074
 thirteen+0.074
 unison+0.072
Neg logit contributions
 brushes-0.100
ies-0.096
 designs-0.095
working-0.092
ini-0.091
Token for
Token ID329
Feature activation+4.78e+00

Pos logit contributions
 Rover+0.083
 unison+0.079
 then+0.079
 thirteen+0.079
 alas+0.078
Neg logit contributions
 brushes-0.093
ies-0.091
 designs-0.090
working-0.086
 work-0.085
Token some
Token ID617
Feature activation+1.34e+00

Pos logit contributions
rified+0.258
 thirteen+0.247
 unison+0.245
 Rover+0.238
tons+0.229
Neg logit contributions
 work-0.212
 brushes-0.196
 designs-0.183
 design-0.180
ies-0.175
Token meat
Token ID6174
Feature activation+6.09e+00

Pos logit contributions
 thirteen+0.055
 Rover+0.055
 unison+0.054
 alas+0.054
 then+0.053
Neg logit contributions
 brushes-0.071
 work-0.069
 designs-0.068
ies-0.068
working-0.066
Token.
Token ID13
Feature activation+6.59e-01

Pos logit contributions
 thirteen+0.297
 unison+0.294
 Rover+0.294
 alas+0.285
 then+0.280
Neg logit contributions
ies-0.273
 brushes-0.265
 designs-0.259
working-0.250
 work-0.248
Token But
Token ID887
Feature activation+2.59e+00

Pos logit contributions
 Rover+0.027
 then+0.026
 alas+0.024
 thirteen+0.024
 ï¿½+0.024
Neg logit contributions
ies-0.035
 brushes-0.035
 designs-0.035
ini-0.033
working-0.033
Token his
Token ID465
Feature activation-6.58e-01

Pos logit contributions
 unison+0.127
 thirteen+0.127
 Rover+0.124
 mercy+0.122
Mr+0.117
Neg logit contributions
ies-0.114
 brushes-0.113
 designs-0.108
 work-0.104
working-0.103
Token mum
Token ID25682
Feature activation-3.75e-01

Pos logit contributions
 brushes+0.033
ies+0.032
 designs+0.032
 work+0.031
working+0.030
Neg logit contributions
 Rover-0.029
 thirteen-0.028
 then-0.027
 alas-0.027
 unison-0.027
Token said
Token ID531
Feature activation+1.26e+00

Pos logit contributions
 brushes+0.018
 designs+0.017
ies+0.017
working+0.016
ini+0.016
Neg logit contributions
 Rover-0.017
 then-0.017
 thirteen-0.016
 alas-0.016
 unison-0.016
Token,
Token ID11
Feature activation+4.00e-01

Pos logit contributions
 brushes+0.031
 designs+0.031
ini+0.031
ies+0.030
bel+0.029
Neg logit contributions
 then-0.018
 Rover-0.018
 alas-0.016
tons-0.016
 thirteen-0.015
Token her
Token ID607
Feature activation-1.91e+00

Pos logit contributions
 then+0.018
 Rover+0.018
 alas+0.017
 thirteen+0.017
tons+0.017
Neg logit contributions
 brushes-0.019
 designs-0.019
ies-0.019
ini-0.018
working-0.018
Token mom
Token ID1995
Feature activation-2.28e-01

Pos logit contributions
 disl+0.118
 brushes+0.117
bows+0.115
rolled+0.115
ini+0.115
Neg logit contributions
 then-0.056
 Rover-0.056
 no-0.048
 end-0.047
 vet-0.047
Token gave
Token ID2921
Feature activation+1.05e+00

Pos logit contributions
 designs+0.012
 brushes+0.012
bel+0.012
bows+0.011
packs+0.011
Neg logit contributions
 then-0.010
 Rover-0.009
Mr-0.009
 unfortunately-0.008
 no-0.008
Token her
Token ID607
Feature activation-9.77e-01

Pos logit contributions
 Rover+0.056
 then+0.055
 thirteen+0.052
 alas+0.052
 mercy+0.051
Neg logit contributions
 brushes-0.043
 designs-0.041
ies-0.040
working-0.039
ini-0.039
Token toast
Token ID27805
Feature activation+6.09e+00

Pos logit contributions
ini+0.065
ies+0.064
 designs+0.064
rolled+0.063
 brushes+0.063
Neg logit contributions
 then-0.027
 Rover-0.025
 more-0.024
 no-0.021
 mercy-0.021
Token with
Token ID351
Feature activation+2.01e+00

Pos logit contributions
 then+0.251
 Rover+0.236
 alas+0.212
 more+0.212
 mercy+0.209
Neg logit contributions
 brushes-0.325
 designs-0.321
ies-0.312
ini-0.311
working-0.309
Token butter
Token ID9215
Feature activation+3.40e+00

Pos logit contributions
 Rover+0.091
 then+0.090
 thirteen+0.086
 alas+0.084
 unison+0.084
Neg logit contributions
 brushes-0.096
ies-0.094
 designs-0.093
working-0.089
ini-0.088
Token on
Token ID319
Feature activation-4.97e-01

Pos logit contributions
 then+0.127
 Rover+0.123
 thirteen+0.109
 alas+0.109
Mr+0.108
Neg logit contributions
 brushes-0.193
 designs-0.187
working-0.179
ini-0.179
ies-0.177
Token it
Token ID340
Feature activation+5.89e-01

Pos logit contributions
 brushes+0.023
 designs+0.022
ies+0.022
working+0.021
ini+0.021
Neg logit contributions
 Rover-0.024
 then-0.023
 thirteen-0.023
 alas-0.022
 unison-0.022
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 then+0.022
 Rover+0.020
 alas+0.019
 unfortunately+0.018
 no+0.018
Neg logit contributions
 designs-0.034
 brushes-0.034
ies-0.033
ini-0.033
rolled-0.031
Token was
Token ID373
Feature activation+1.46e+00

Pos logit contributions
 brushes+0.091
 designs+0.089
ies+0.087
working+0.086
bel+0.082
Neg logit contributions
 then-0.091
 Rover-0.090
tons-0.084
 alas-0.083
 unison-0.081
Token a
Token ID257
Feature activation+5.96e-01

Pos logit contributions
 Rover+0.059
 then+0.057
 thirteen+0.057
 alas+0.055
 mercy+0.055
Neg logit contributions
 brushes-0.078
 designs-0.075
ies-0.074
working-0.071
 work-0.071
Token young
Token ID1862
Feature activation+6.98e-01

Pos logit contributions
 Rover+0.033
 then+0.032
 thirteen+0.032
 alas+0.031
 unison+0.031
Neg logit contributions
 brushes-0.023
 designs-0.022
ies-0.022
working-0.020
ini-0.020
Token bear
Token ID6842
Feature activation+8.92e-01

Pos logit contributions
 thirteen+0.033
 alas+0.033
 Rover+0.033
 unison+0.032
 then+0.032
Neg logit contributions
ies-0.032
 brushes-0.031
 designs-0.031
working-0.030
ini-0.029
Token called
Token ID1444
Feature activation+7.93e-01

Pos logit contributions
 Rover+0.034
 then+0.034
 thirteen+0.032
 alas+0.031
 unison+0.031
Neg logit contributions
 brushes-0.050
 designs-0.048
ies-0.048
working-0.045
ini-0.045
Token Be
Token ID1355
Feature activation+6.08e+00

Pos logit contributions
 then+0.031
 Rover+0.031
 thirteen+0.030
 unison+0.030
 alas+0.030
Neg logit contributions
 brushes-0.042
 designs-0.041
ies-0.041
working-0.039
 work-0.039
Tokenart
Token ID433
Feature activation+2.18e+00

Pos logit contributions
 Rover+0.262
 then+0.251
 louder+0.240
 thirteen+0.239
Mr+0.237
Neg logit contributions
 brushes-0.316
 designs-0.297
ies-0.287
working-0.280
ini-0.279
Tokenie
Token ID494
Feature activation+1.60e-01

Pos logit contributions
 Rover+0.099
 then+0.097
 thirteen+0.094
 alas+0.093
 unison+0.092
Neg logit contributions
 brushes-0.104
ies-0.101
 designs-0.101
working-0.096
ini-0.095
Token.
Token ID13
Feature activation+7.14e-01

Pos logit contributions
 thirteen+0.009
 alas+0.009
 Rover+0.009
 unison+0.008
tons+0.008
Neg logit contributions
ies-0.006
 designs-0.006
working-0.006
 work-0.005
 brushes-0.005
Token Be
Token ID1355
Feature activation+3.91e+00

Pos logit contributions
 Rover+0.027
 then+0.026
 thirteen+0.025
 alas+0.024
 unison+0.024
Neg logit contributions
 brushes-0.041
ies-0.039
 designs-0.039
working-0.037
ini-0.037
Tokenart
Token ID433
Feature activation+1.53e-01

Pos logit contributions
 then+0.137
 Rover+0.137
 louder+0.130
 no+0.124
Mr+0.124
Neg logit contributions
 brushes-0.232
 designs-0.217
 artwork-0.205
 work-0.204
bows-0.203
Token saw
Token ID2497
Feature activation+3.67e-01

Pos logit contributions
 then+0.029
 Rover+0.028
Mr+0.025
 louder+0.024
Please+0.024
Neg logit contributions
 designs-0.033
 brushes-0.033
working-0.032
ugs-0.032
ini-0.032
Token a
Token ID257
Feature activation+9.72e-01

Pos logit contributions
 Rover+0.014
 then+0.014
 thirteen+0.014
 alas+0.013
 unison+0.013
Neg logit contributions
 brushes-0.020
 designs-0.019
ies-0.019
working-0.018
ini-0.018
Token big
Token ID1263
Feature activation+1.13e+00

Pos logit contributions
 Rover+0.056
 then+0.055
 thirteen+0.054
 alas+0.053
 unison+0.052
Neg logit contributions
 brushes-0.035
ies-0.034
 designs-0.034
ini-0.031
working-0.031
Token plate
Token ID7480
Feature activation+2.77e+00

Pos logit contributions
 Rover+0.057
 alas+0.055
 thirteen+0.055
 then+0.054
 unison+0.054
Neg logit contributions
 brushes-0.049
ies-0.048
 designs-0.047
 work-0.045
working-0.043
Token of
Token ID286
Feature activation-4.65e-01

Pos logit contributions
 Rover+0.144
 thirteen+0.140
 unison+0.138
 alas+0.138
tons+0.135
Neg logit contributions
ies-0.114
 brushes-0.113
ade-0.104
working-0.103
 work-0.103
Token steak
Token ID26320
Feature activation+6.07e+00

Pos logit contributions
 brushes+0.021
 designs+0.020
ies+0.019
 work+0.018
working+0.018
Neg logit contributions
 Rover-0.023
 thirteen-0.023
 unison-0.023
 alas-0.023
 then-0.023
Token on
Token ID319
Feature activation+6.37e-01

Pos logit contributions
 Rover+0.271
 then+0.262
 thirteen+0.257
 alas+0.253
 unison+0.252
Neg logit contributions
 brushes-0.297
ies-0.289
 designs-0.287
working-0.272
 work-0.268
Token the
Token ID262
Feature activation+9.43e-01

Pos logit contributions
 Rover+0.027
 thirteen+0.026
 then+0.026
 alas+0.026
 mercy+0.025
Neg logit contributions
 brushes-0.033
 designs-0.031
 work-0.030
ies-0.030
working-0.029
Token table
Token ID3084
Feature activation+2.40e+00

Pos logit contributions
 alas+0.060
 unison+0.058
 thirteen+0.056
 louder+0.056
 unfortunately+0.056
Neg logit contributions
 brushes-0.029
 work-0.028
working-0.027
 designs-0.027
ies-0.025
Token.
Token ID13
Feature activation-4.96e-02

Pos logit contributions
 Rover+0.112
 then+0.108
 thirteen+0.106
 alas+0.105
 unison+0.104
Neg logit contributions
 brushes-0.113
ies-0.109
 designs-0.109
working-0.103
 work-0.101
Token It
Token ID632
Feature activation-4.79e-01

Pos logit contributions
ies+0.003
working+0.003
 designs+0.003
ini+0.003
bel+0.003
Neg logit contributions
 then-0.001
 ï¿½-0.001
 Rover-0.001
 Please-0.001
Mr-0.001
Token her
Token ID607
Feature activation-2.09e+00

Pos logit contributions
 then+0.009
 Rover+0.009
 alas+0.009
tons+0.008
 thirteen+0.008
Neg logit contributions
 brushes-0.009
 designs-0.009
ies-0.009
ini-0.009
working-0.008
Token mom
Token ID1995
Feature activation-3.69e-01

Pos logit contributions
rolled+0.132
ini+0.128
 designs+0.127
bows+0.127
 disl+0.125
Neg logit contributions
 then-0.064
 Rover-0.058
 no-0.054
 end-0.053
 more-0.052
Tokenmy
Token ID1820
Feature activation-1.21e+00

Pos logit contributions
 designs+0.020
bel+0.020
 brushes+0.020
rolled+0.019
packs+0.019
Neg logit contributions
 then-0.017
Mr-0.014
 Rover-0.014
 more-0.013
 no-0.013
Token made
Token ID925
Feature activation+1.19e+00

Pos logit contributions
 designs+0.065
 brushes+0.064
bel+0.063
packs+0.061
bows+0.061
Neg logit contributions
 then-0.056
 Rover-0.049
Mr-0.047
'-0.043
 unfortunately-0.043
Token her
Token ID607
Feature activation-1.15e+00

Pos logit contributions
 then+0.047
 Rover+0.045
 louder+0.043
 more+0.042
Mr+0.041
Neg logit contributions
 brushes-0.064
 designs-0.063
ies-0.062
working-0.062
ini-0.062
Token toast
Token ID27805
Feature activation+6.06e+00

Pos logit contributions
ini+0.075
ies+0.074
rolled+0.073
ugs+0.072
 designs+0.071
Neg logit contributions
 then-0.035
 more-0.033
 no-0.029
 Rover-0.028
 less-0.028
Token but
Token ID475
Feature activation+2.27e+00

Pos logit contributions
 then+0.214
 Rover+0.191
 more+0.186
 alas+0.181
 louder+0.178
Neg logit contributions
 brushes-0.358
 designs-0.345
working-0.343
ini-0.339
rolled-0.337
Token forgot
Token ID16453
Feature activation+1.51e+00

Pos logit contributions
 then+0.097
 Rover+0.091
 alas+0.088
tons+0.086
 unfortunately+0.086
Neg logit contributions
 brushes-0.119
 designs-0.117
ini-0.115
ies-0.114
working-0.112
Token to
Token ID284
Feature activation-1.40e+00

Pos logit contributions
 Rover+0.066
 then+0.065
 thirteen+0.062
 alas+0.061
 mercy+0.060
Neg logit contributions
 brushes-0.075
 designs-0.074
ies-0.073
working-0.071
ini-0.071
Token warm
Token ID5814
Feature activation+1.57e+00

Pos logit contributions
 brushes+0.058
 designs+0.057
ies+0.055
ini+0.053
working+0.052
Neg logit contributions
 then-0.073
 Rover-0.073
 thirteen-0.069
 alas-0.069
 unison-0.068
Token up
Token ID510
Feature activation+8.83e-01

Pos logit contributions
 Rover+0.070
 then+0.070
tons+0.062
 alas+0.062
 louder+0.062
Neg logit contributions
 brushes-0.079
 designs-0.076
working-0.075
ini-0.075
bel-0.075
Token butter
Token ID9215
Feature activation+3.51e+00

Pos logit contributions
 brushes+0.025
 designs+0.024
ies+0.023
working+0.022
ini+0.022
Neg logit contributions
 Rover-0.033
 then-0.032
 thirteen-0.031
 alas-0.031
 unison-0.031
Token to
Token ID284
Feature activation-3.33e-01

Pos logit contributions
 Rover+0.151
 then+0.148
 thirteen+0.141
 alas+0.139
 unison+0.138
Neg logit contributions
 brushes-0.179
 designs-0.173
ies-0.171
working-0.163
ini-0.162
Token spread
Token ID4104
Feature activation+5.82e-01

Pos logit contributions
ies+0.013
 brushes+0.013
 designs+0.013
bows+0.013
packs+0.013
Neg logit contributions
 then-0.019
 Rover-0.018
Mr-0.017
 alas-0.017
 unison-0.016
Token on
Token ID319
Feature activation-3.42e-01

Pos logit contributions
 Rover+0.025
 then+0.024
 thirteen+0.024
 unison+0.023
 alas+0.023
Neg logit contributions
 brushes-0.030
 designs-0.029
ies-0.029
working-0.027
 work-0.027
Token his
Token ID465
Feature activation+2.41e-01

Pos logit contributions
 brushes+0.017
 designs+0.016
 work+0.016
ies+0.016
 creations+0.015
Neg logit contributions
 Rover-0.015
 thirteen-0.014
 unison-0.014
 mercy-0.014
 alas-0.014
Token toast
Token ID27805
Feature activation+6.06e+00

Pos logit contributions
 Rover+0.012
 then+0.012
 thirteen+0.011
 alas+0.011
 unison+0.011
Neg logit contributions
 brushes-0.011
 designs-0.010
ies-0.010
working-0.010
ini-0.010
Token.
Token ID13
Feature activation+9.38e-02

Pos logit contributions
 Rover+0.288
 thirteen+0.282
 then+0.274
 unison+0.273
 alas+0.270
Neg logit contributions
 brushes-0.280
 designs-0.271
ies-0.271
working-0.256
 work-0.253
Token He
Token ID679
Feature activation+1.34e+00

Pos logit contributions
 Rover+0.004
 then+0.004
 thirteen+0.003
 alas+0.003
 unison+0.003
Neg logit contributions
 brushes-0.005
ies-0.005
 designs-0.005
working-0.005
ini-0.005
Token took
Token ID1718
Feature activation-4.10e-01

Pos logit contributions
 then+0.070
 Rover+0.060
Mr+0.055
 no+0.054
tons+0.054
Neg logit contributions
bows-0.063
 designs-0.060
ugs-0.060
working-0.060
ini-0.059
Token it
Token ID340
Feature activation+1.86e-01

Pos logit contributions
 brushes+0.026
ies+0.025
 designs+0.025
ini+0.024
bows+0.024
Neg logit contributions
 then-0.014
 Rover-0.013
 thirteen-0.012
 alas-0.012
 louder-0.012
Token and
Token ID290
Feature activation+4.67e-01

Pos logit contributions
 then+0.007
 Rover+0.007
 more+0.006
 louder+0.006
 alas+0.006
Neg logit contributions
 brushes-0.011
 designs-0.010
ini-0.010
 disl-0.010
ies-0.010
Token to
Token ID284
Feature activation-6.91e-01

Pos logit contributions
ini+0.020
ies+0.019
rolled+0.019
ouses+0.019
bows+0.019
Neg logit contributions
 then-0.012
 Rover-0.012
 more-0.011
Please-0.011
Mr-0.011
Token eat
Token ID4483
Feature activation+2.08e+00

Pos logit contributions
ies+0.040
ini+0.040
ugs+0.039
inks+0.039
packs+0.037
Neg logit contributions
 then-0.029
 end-0.025
 no-0.024
 more-0.024
 Rover-0.024
Token jam
Token ID16853
Feature activation+3.48e+00

Pos logit contributions
 Rover+0.092
 then+0.090
 thirteen+0.084
Mr+0.083
tons+0.082
Neg logit contributions
 brushes-0.102
 designs-0.098
ies-0.097
ini-0.096
working-0.096
Token on
Token ID319
Feature activation-1.49e-01

Pos logit contributions
 then+0.141
 Rover+0.141
 thirteen+0.129
 alas+0.128
tons+0.126
Neg logit contributions
 brushes-0.185
 designs-0.179
ies-0.175
ini-0.171
working-0.171
Token her
Token ID607
Feature activation-1.04e+00

Pos logit contributions
 brushes+0.007
 designs+0.007
ies+0.007
working+0.007
ini+0.007
Neg logit contributions
 Rover-0.007
 then-0.006
 thirteen-0.006
 alas-0.006
 unison-0.006
Token toast
Token ID27805
Feature activation+6.06e+00

Pos logit contributions
rolled+0.066
bows+0.064
ini+0.064
 designs+0.063
 brushes+0.063
Neg logit contributions
 then-0.030
 Rover-0.027
 more-0.025
Please-0.024
 no-0.023
Token every
Token ID790
Feature activation-1.82e+00

Pos logit contributions
 Rover+0.277
 then+0.270
 thirteen+0.262
 alas+0.257
 unison+0.257
Neg logit contributions
 brushes-0.291
 designs-0.281
ies-0.279
working-0.265
ini-0.261
Token morning
Token ID3329
Feature activation+2.42e-01

Pos logit contributions
 brushes+0.079
 designs+0.077
ies+0.074
ini+0.070
bows+0.070
Neg logit contributions
 then-0.097
 Rover-0.094
 end-0.088
 alas-0.087
Please-0.087
Token.
Token ID13
Feature activation+8.02e-02

Pos logit contributions
 Rover+0.011
 then+0.010
 thirteen+0.010
 alas+0.010
 unison+0.010
Neg logit contributions
 brushes-0.012
 designs-0.012
ies-0.012
working-0.011
ini-0.011
Token One
Token ID1881
Feature activation-6.61e-01

Pos logit contributions
 Rover+0.003
 then+0.003
 thirteen+0.003
 alas+0.003
 unison+0.003
Neg logit contributions
 brushes-0.005
ies-0.004
 designs-0.004
working-0.004
ini-0.004
Token day
Token ID1110
Feature activation-5.89e-01

Pos logit contributions
 designs+0.028
 brushes+0.028
rolled+0.026
ies+0.026
bel+0.025
Neg logit contributions
 then-0.036
 Rover-0.034
 end-0.033
 more-0.032
tons-0.031
Token big
Token ID1263
Feature activation+1.48e+00

Pos logit contributions
 brushes+0.025
 designs+0.024
ini+0.024
ies+0.024
rolled+0.023
Neg logit contributions
 then-0.019
 Rover-0.019
 alas-0.017
 thirteen-0.017
 mercy-0.017
Token brother
Token ID3956
Feature activation+2.62e+00

Pos logit contributions
 Rover+0.078
 then+0.076
 thirteen+0.074
 alas+0.074
 unison+0.074
Neg logit contributions
 brushes-0.061
ies-0.059
 designs-0.059
 work-0.055
working-0.055
Token to
Token ID284
Feature activation+7.73e-01

Pos logit contributions
 Rover+0.115
 then+0.111
 thirteen+0.111
 unison+0.110
 alas+0.109
Neg logit contributions
 brushes-0.128
ies-0.124
 designs-0.123
working-0.118
 work-0.116
Token make
Token ID787
Feature activation+2.01e+00

Pos logit contributions
 Rover+0.040
 then+0.039
 thirteen+0.039
 alas+0.038
 unison+0.038
Neg logit contributions
 brushes-0.032
 designs-0.030
ies-0.030
 work-0.029
working-0.029
Token her
Token ID607
Feature activation-1.12e+00

Pos logit contributions
 Rover+0.090
 then+0.088
 thirteen+0.083
 alas+0.082
 mercy+0.082
Neg logit contributions
 brushes-0.099
 designs-0.095
ies-0.095
working-0.092
ini-0.091
Token toast
Token ID27805
Feature activation+6.05e+00

Pos logit contributions
 brushes+0.065
 designs+0.063
ini+0.062
ies+0.062
rolled+0.062
Neg logit contributions
 then-0.040
 Rover-0.039
 thirteen-0.035
 alas-0.035
 louder-0.035
Token.
Token ID13
Feature activation+1.58e-01

Pos logit contributions
 then+0.219
 Rover+0.205
 louder+0.199
 more+0.195
 alas+0.194
Neg logit contributions
 brushes-0.352
 designs-0.336
ini-0.333
working-0.330
ies-0.326
Token
Token ID198
Feature activation+4.51e-01

Pos logit contributions
 Rover+0.005
 then+0.005
Mr+0.004
 alas+0.004
 thirteen+0.004
Neg logit contributions
 brushes-0.010
ies-0.010
 designs-0.010
working-0.009
ini-0.009
Token
Token ID198
Feature activation+8.66e-01

Pos logit contributions
 Rover+0.015
 then+0.014
 thirteen+0.013
 alas+0.013
 unison+0.013
Neg logit contributions
 brushes-0.027
 designs-0.027
ies-0.027
working-0.026
ini-0.025
TokenAt
Token ID2953
Feature activation-1.02e+00

Pos logit contributions
 Rover+0.033
 then+0.031
 thirteen+0.031
 mercy+0.030
 alas+0.030
Neg logit contributions
 brushes-0.049
ies-0.047
 designs-0.047
working-0.045
ini-0.045
Token first
Token ID717
Feature activation+3.85e-01

Pos logit contributions
 work+0.049
 brushes+0.048
ies+0.045
working+0.045
 designs+0.044
Neg logit contributions
 thirteen-0.048
 unison-0.047
 Rover-0.046
 alas-0.045
rified-0.045
Token and
Token ID290
Feature activation+6.14e-01

Pos logit contributions
 brushes+0.002
ies+0.002
 designs+0.002
working+0.002
 work+0.002
Neg logit contributions
 Rover-0.002
 thirteen-0.002
 unison-0.002
 alas-0.002
Mr-0.002
Token wanted
Token ID2227
Feature activation-1.09e+00

Pos logit contributions
 then+0.031
 Rover+0.029
Please+0.028
 Please+0.027
tons+0.027
Neg logit contributions
 designs-0.030
ies-0.029
ini-0.029
packs-0.028
 creations-0.028
Token to
Token ID284
Feature activation-3.59e-01

Pos logit contributions
 brushes+0.048
 designs+0.046
ies+0.046
working+0.044
 work+0.043
Neg logit contributions
 Rover-0.054
 then-0.053
 thirteen-0.052
 unison-0.051
 alas-0.051
Token eat
Token ID4483
Feature activation+2.47e+00

Pos logit contributions
ies+0.018
ini+0.018
ugs+0.017
 designs+0.017
 brushes+0.017
Neg logit contributions
 then-0.017
 Rover-0.016
 end-0.015
 no-0.015
 mercy-0.015
Token her
Token ID607
Feature activation-1.38e+00

Pos logit contributions
 Rover+0.118
 thirteen+0.116
 unison+0.114
 then+0.113
 alas+0.113
Neg logit contributions
 brushes-0.113
 designs-0.108
ies-0.107
 work-0.103
working-0.098
Token steak
Token ID26320
Feature activation+6.04e+00

Pos logit contributions
ini+0.071
rolled+0.069
 brushes+0.069
bel+0.068
bows+0.067
Neg logit contributions
 then-0.055
 Rover-0.054
 more-0.053
 steak-0.050
 no-0.048
Token right
Token ID826
Feature activation+2.33e+00

Pos logit contributions
 Rover+0.259
 then+0.252
 thirteen+0.244
 alas+0.239
 unison+0.238
Neg logit contributions
 brushes-0.308
 designs-0.298
ies-0.296
working-0.282
ini-0.279
Token away
Token ID1497
Feature activation+1.21e+00

Pos logit contributions
 Rover+0.095
 thirteen+0.092
 unison+0.089
 mercy+0.087
 alas+0.087
Neg logit contributions
ies-0.123
 brushes-0.122
 designs-0.118
working-0.114
 work-0.114
Token.
Token ID13
Feature activation+2.45e-01

Pos logit contributions
 Rover+0.057
 thirteen+0.055
 then+0.055
 unison+0.054
 alas+0.054
Neg logit contributions
 brushes-0.055
ies-0.054
 designs-0.053
working-0.051
 work-0.050
Token She
Token ID1375
Feature activation-2.24e-01

Pos logit contributions
 then+0.009
 Rover+0.008
 ï¿½+0.008
 alas+0.008
 Please+0.007
Neg logit contributions
ies-0.015
 designs-0.014
bel-0.014
working-0.014
ini-0.014
Token nodded
Token ID14464
Feature activation+1.06e+00

Pos logit contributions
 brushes+0.009
 designs+0.009
ies+0.009
ini+0.008
working+0.008
Neg logit contributions
 Rover-0.012
 then-0.012
 thirteen-0.011
 alas-0.011
 unison-0.011
Token.
Token ID13
Feature activation-7.33e-01

Pos logit contributions
 brushes+0.006
 designs+0.005
ies+0.005
ini+0.005
working+0.005
Neg logit contributions
 then-0.005
 Rover-0.005
 thirteen-0.004
 alas-0.004
 unison-0.004
Token He
Token ID679
Feature activation+8.19e-01

Pos logit contributions
 brushes+0.037
 designs+0.034
ies+0.033
packs+0.032
rolled+0.032
Neg logit contributions
 mercy-0.031
 Rover-0.030
 thirteen-0.030
 then-0.029
 alas-0.029
Token laughed
Token ID13818
Feature activation-2.64e-01

Pos logit contributions
 then+0.040
 Rover+0.039
 thirteen+0.037
 unison+0.036
 mercy+0.036
Neg logit contributions
 brushes-0.037
 designs-0.036
ies-0.036
ini-0.036
ugs-0.035
Token with
Token ID351
Feature activation-3.95e-01

Pos logit contributions
ies+0.011
 designs+0.011
 brushes+0.010
 work+0.010
bel+0.010
Neg logit contributions
 mercy-0.013
 unison-0.013
 thirteen-0.013
 Rover-0.013
 alas-0.013
Token g
Token ID308
Feature activation-8.11e-01

Pos logit contributions
 brushes+0.019
 designs+0.019
 work+0.018
ies+0.018
 creations+0.016
Neg logit contributions
 Rover-0.017
 thirteen-0.017
 alas-0.017
tons-0.017
 then-0.017
Tokenlee
Token ID7197
Feature activation-7.86e+00

Pos logit contributions
 brushes+0.035
 works+0.035
bel+0.034
 collect+0.033
packs+0.033
Neg logit contributions
der-0.039
 thirteen-0.037
aw-0.037
 then-0.036
 unison-0.036
Token.
Token ID13
Feature activation-6.32e-01

Pos logit contributions
 brushes+0.412
 designs+0.398
ies+0.391
ini+0.377
working+0.375
Neg logit contributions
 Rover-0.327
 then-0.324
 thirteen-0.303
 unison-0.296
 alas-0.296
Token
Token ID220
Feature activation-3.71e+00

Pos logit contributions
 brushes+0.035
 designs+0.034
ies+0.033
working+0.032
ini+0.032
Neg logit contributions
 Rover-0.024
 then-0.023
 thirteen-0.023
 alas-0.022
 unison-0.022
Token
Token ID198
Feature activation-7.26e-02

Pos logit contributions
 brushes+0.252
 designs+0.251
ies+0.250
working+0.249
 work+0.239
Neg logit contributions
 Rover-0.098
 then-0.088
Mr-0.083
 ï¿½-0.080
 thirteen-0.078
Token
Token ID198
Feature activation+4.76e-01

Pos logit contributions
 brushes+0.004
 designs+0.004
ies+0.004
ini+0.004
working+0.004
Neg logit contributions
 Rover-0.003
 then-0.003
 thirteen-0.002
 alas-0.002
 unison-0.002
TokenMom
Token ID29252
Feature activation-1.16e+00

Pos logit contributions
 Rover+0.016
 mercy+0.015
 thirteen+0.015
 then+0.015
rified+0.014
Neg logit contributions
 brushes-0.028
ini-0.027
 designs-0.026
ies-0.026
rolled-0.026
Token two
Token ID734
Feature activation-1.69e+00

Pos logit contributions
 brushes+0.044
 designs+0.043
ies+0.043
working+0.040
ini+0.040
Neg logit contributions
 Rover-0.040
 then-0.039
 thirteen-0.038
 alas-0.037
 unison-0.037
Token friends
Token ID2460
Feature activation-2.76e+00

Pos logit contributions
 brushes+0.073
ies+0.070
 designs+0.070
working+0.065
ini+0.064
Neg logit contributions
 Rover-0.086
 then-0.084
 thirteen-0.082
 alas-0.081
 unison-0.081
Token laughed
Token ID13818
Feature activation-5.33e-02

Pos logit contributions
 brushes+0.126
 designs+0.122
ies+0.119
ini+0.116
working+0.114
Neg logit contributions
 then-0.135
 Rover-0.133
 thirteen-0.127
 unison-0.124
 alas-0.123
Token with
Token ID351
Feature activation-1.55e-01

Pos logit contributions
ies+0.002
 designs+0.002
 brushes+0.002
 work+0.002
working+0.002
Neg logit contributions
 Rover-0.002
 mercy-0.002
 thirteen-0.002
 alas-0.002
Mr-0.002
Token g
Token ID308
Feature activation-6.84e-01

Pos logit contributions
 brushes+0.007
 designs+0.007
 work+0.007
ies+0.006
 artwork+0.006
Neg logit contributions
 alas-0.007
 thirteen-0.007
tons-0.007
 unfortunately-0.007
Mr-0.007
Tokenlee
Token ID7197
Feature activation-7.82e+00

Pos logit contributions
 brushes+0.032
 works+0.031
bel+0.031
packs+0.030
 collect+0.027
Neg logit contributions
 thirteen-0.030
der-0.030
 unison-0.028
 Rover-0.028
 then-0.028
Token.
Token ID13
Feature activation-7.77e-01

Pos logit contributions
 brushes+0.403
 designs+0.390
ies+0.388
working+0.370
ini+0.365
Neg logit contributions
 Rover-0.328
 then-0.319
 thirteen-0.309
 alas-0.303
 unison-0.302
Token They
Token ID1119
Feature activation+6.57e-01

Pos logit contributions
 brushes+0.037
 designs+0.036
ies+0.034
rolled+0.034
ugs+0.033
Neg logit contributions
 mercy-0.034
 Rover-0.034
 thirteen-0.034
 then-0.033
 alas-0.033
Token had
Token ID550
Feature activation-1.78e+00

Pos logit contributions
 Rover+0.031
 then+0.031
 thirteen+0.030
 alas+0.029
 unison+0.029
Neg logit contributions
 brushes-0.030
 designs-0.029
ies-0.029
working-0.027
ini-0.027
Token found
Token ID1043
Feature activation-2.12e+00

Pos logit contributions
 brushes+0.096
 designs+0.092
ies+0.089
 work+0.087
working+0.085
Neg logit contributions
 Rover-0.069
 alas-0.068
tons-0.067
 then-0.067
 mercy-0.066
Token a
Token ID257
Feature activation+9.25e-02

Pos logit contributions
 brushes+0.112
 designs+0.109
ies+0.108
 work+0.102
working+0.101
Neg logit contributions
 Rover-0.085
 then-0.082
 thirteen-0.081
 unison-0.080
 alas-0.080
Token forth
Token ID6071
Feature activation-3.32e-01

Pos logit contributions
 Rover+0.012
 then+0.012
 thirteen+0.011
 unison+0.011
 alas+0.011
Neg logit contributions
 brushes-0.015
 designs-0.014
ies-0.014
working-0.014
 work-0.013
Token and
Token ID290
Feature activation-4.64e-01

Pos logit contributions
ies+0.016
 brushes+0.016
 designs+0.015
 work+0.015
working+0.015
Neg logit contributions
 Rover-0.015
 unison-0.015
 mercy-0.014
 thirteen-0.014
Mr-0.014
Token laughed
Token ID13818
Feature activation-5.80e-01

Pos logit contributions
 brushes+0.022
 designs+0.021
ies+0.021
working+0.020
 work+0.020
Neg logit contributions
 Rover-0.021
 thirteen-0.020
 unison-0.020
 then-0.020
 alas-0.020
Token with
Token ID351
Feature activation-3.63e-01

Pos logit contributions
 designs+0.025
ies+0.024
 brushes+0.023
 work+0.023
bel+0.022
Neg logit contributions
 Rover-0.029
 thirteen-0.028
 mercy-0.028
Mr-0.027
 unison-0.027
Token g
Token ID308
Feature activation-9.12e-01

Pos logit contributions
 brushes+0.018
 designs+0.018
ies+0.017
 work+0.017
working+0.016
Neg logit contributions
 Rover-0.015
 then-0.015
 thirteen-0.015
 alas-0.015
tons-0.014
Tokenlee
Token ID7197
Feature activation-7.81e+00

Pos logit contributions
 brushes+0.039
 works+0.039
bel+0.039
packs+0.036
bows+0.035
Neg logit contributions
der-0.043
 thirteen-0.043
 unison-0.042
 Rover-0.042
aw-0.041
Token.
Token ID13
Feature activation-6.92e-01

Pos logit contributions
 brushes+0.364
ies+0.357
 designs+0.354
working+0.338
 work+0.329
Neg logit contributions
 Rover-0.360
 then-0.348
 thirteen-0.345
 alas-0.341
 unison-0.339
Token When
Token ID1649
Feature activation-7.56e-01

Pos logit contributions
 brushes+0.036
 designs+0.035
rolled+0.033
ies+0.033
working+0.033
Neg logit contributions
 mercy-0.027
 then-0.026
 Rover-0.026
 thirteen-0.026
 alas-0.026
Token he
Token ID339
Feature activation+1.03e+00

Pos logit contributions
 work+0.038
 brushes+0.037
ies+0.036
 designs+0.036
working+0.036
Neg logit contributions
 unison-0.030
 thirteen-0.029
 mercy-0.028
Did-0.027
 alas-0.027
Token tired
Token ID10032
Feature activation-3.82e-01

Pos logit contributions
 then+0.055
 Rover+0.055
 thirteen+0.052
 alas+0.051
 unison+0.051
Neg logit contributions
 brushes-0.043
 designs-0.042
ies-0.041
working-0.039
ini-0.038
Token of
Token ID286
Feature activation-2.33e+00

Pos logit contributions
 brushes+0.021
ies+0.020
 designs+0.020
working+0.019
 work+0.019
Neg logit contributions
 Rover-0.015
 then-0.014
 thirteen-0.014
 unison-0.014
 alas-0.014
TokenThe
Token ID464
Feature activation+1.82e-01

Pos logit contributions
 Rover+0.016
 mercy+0.016
 thirteen+0.015
 then+0.015
 alas+0.015
Neg logit contributions
 brushes-0.027
ies-0.025
ini-0.025
 designs-0.025
bel-0.024
Token princess
Token ID21752
Feature activation-7.17e-01

Pos logit contributions
 then+0.009
 Rover+0.008
 unison+0.008
 thirteen+0.008
 alas+0.008
Neg logit contributions
 brushes-0.008
ies-0.008
 designs-0.008
working-0.008
ugs-0.008
Token laughed
Token ID13818
Feature activation-1.10e-02

Pos logit contributions
 brushes+0.034
 designs+0.033
ugs+0.032
working+0.032
bows+0.031
Neg logit contributions
 then-0.036
 Rover-0.031
 unison-0.031
 alas-0.030
 thirteen-0.030
Token with
Token ID351
Feature activation-1.90e-01

Pos logit contributions
ies+0.000
 designs+0.000
 brushes+0.000
 work+0.000
bel+0.000
Neg logit contributions
 mercy-0.001
 Rover-0.001
 thirteen-0.001
 unison-0.001
Mr-0.001
Token g
Token ID308
Feature activation-7.52e-01

Pos logit contributions
 brushes+0.009
 designs+0.009
 work+0.009
ies+0.008
 artwork+0.008
Neg logit contributions
 thirteen-0.008
 Rover-0.008
 alas-0.008
Mr-0.008
tons-0.008
Tokenlee
Token ID7197
Feature activation-7.76e+00

Pos logit contributions
 works+0.037
bel+0.037
 brushes+0.034
packs+0.034
working+0.033
Neg logit contributions
der-0.032
 unison-0.030
 thirteen-0.030
aw-0.030
 then-0.029
Token.
Token ID13
Feature activation-6.13e-01

Pos logit contributions
 brushes+0.406
 designs+0.393
ies+0.391
working+0.373
ini+0.369
Neg logit contributions
 Rover-0.321
 then-0.313
 thirteen-0.301
 alas-0.296
 unison-0.295
Token She
Token ID1375
Feature activation-7.23e-01

Pos logit contributions
ies+0.034
 brushes+0.033
 designs+0.032
working+0.032
ini+0.032
Neg logit contributions
 Rover-0.024
 then-0.024
 thirteen-0.022
 ï¿½-0.022
 alas-0.022
Token pointed
Token ID6235
Feature activation+4.70e-01

Pos logit contributions
 brushes+0.034
 designs+0.033
ugs+0.033
ies+0.033
working+0.033
Neg logit contributions
 then-0.036
 Rover-0.033
 unison-0.031
 thirteen-0.030
 alas-0.030
Token and
Token ID290
Feature activation+3.47e-01

Pos logit contributions
 Rover+0.023
 thirteen+0.022
 unison+0.022
 mercy+0.022
 then+0.022
Neg logit contributions
 brushes-0.021
ies-0.020
 designs-0.020
 work-0.019
working-0.018
Token pointed
Token ID6235
Feature activation+6.48e-01

Pos logit contributions
 then+0.016
 Rover+0.015
 louder+0.014
 thirteen+0.013
 alas+0.013
Neg logit contributions
ini-0.017
 brushes-0.017
 designs-0.017
working-0.017
ies-0.017
Token hugged
Token ID41130
Feature activation-1.62e+00

Pos logit contributions
 brushes+0.054
 designs+0.053
ies+0.053
ini+0.052
working+0.051
Neg logit contributions
 then-0.050
 Rover-0.049
 unison-0.045
 thirteen-0.044
 louder-0.044
Token each
Token ID1123
Feature activation+2.86e-01

Pos logit contributions
 brushes+0.087
 designs+0.087
ies+0.086
 work+0.082
 creations+0.080
Neg logit contributions
 thirteen-0.065
 Rover-0.064
Mr-0.060
 mercy-0.060
 alas-0.060
Token other
Token ID584
Feature activation+7.07e-01

Pos logit contributions
 Rover+0.012
 thirteen+0.011
 then+0.011
 alas+0.011
tons+0.011
Neg logit contributions
 brushes-0.015
ies-0.015
 designs-0.015
 work-0.014
working-0.014
Token with
Token ID351
Feature activation+2.39e-01

Pos logit contributions
 Rover+0.035
 thirteen+0.034
 mercy+0.034
 alas+0.033
Mr+0.033
Neg logit contributions
ies-0.030
 designs-0.029
 brushes-0.029
 work-0.029
working-0.027
Token g
Token ID308
Feature activation-7.17e-01

Pos logit contributions
 thirteen+0.011
 Rover+0.011
 alas+0.011
tons+0.011
Mr+0.011
Neg logit contributions
 brushes-0.011
 designs-0.011
 work-0.010
ies-0.010
 artwork-0.010
Tokenlee
Token ID7197
Feature activation-7.76e+00

Pos logit contributions
 works+0.030
 brushes+0.030
bel+0.030
 collect+0.030
working+0.028
Neg logit contributions
der-0.033
 unison-0.033
 then-0.031
 thirteen-0.030
 louder-0.029
Token.
Token ID13
Feature activation-9.86e-01

Pos logit contributions
 brushes+0.383
 designs+0.371
ies+0.369
working+0.351
ini+0.345
Neg logit contributions
 Rover-0.342
 then-0.332
 thirteen-0.323
 alas-0.318
 unison-0.316
Token
Token ID198
Feature activation-1.67e-01

Pos logit contributions
 brushes+0.057
 designs+0.056
ies+0.056
working+0.053
ini+0.053
Neg logit contributions
 Rover-0.035
 then-0.034
 thirteen-0.033
 alas-0.032
 unison-0.032
Token
Token ID198
Feature activation+4.10e-01

Pos logit contributions
 brushes+0.010
 designs+0.009
ies+0.009
working+0.009
ini+0.009
Neg logit contributions
 Rover-0.006
 then-0.006
 thirteen-0.006
 alas-0.006
 mercy-0.005
TokenAfter
Token ID3260
Feature activation+1.90e+00

Pos logit contributions
 Rover+0.014
 thirteen+0.013
 mercy+0.013
tons+0.013
 then+0.013
Neg logit contributions
 brushes-0.024
 designs-0.023
ies-0.023
ini-0.022
bel-0.022
Token playing
Token ID2712
Feature activation-3.03e+00

Pos logit contributions
 Rover+0.074
 thirteen+0.074
 alas+0.070
Mr+0.070
 mercy+0.069
Neg logit contributions
 brushes-0.101
 designs-0.099
 work-0.095
ies-0.095
working-0.092
Token toys
Token ID14958
Feature activation-1.83e+00

Pos logit contributions
 brushes+0.056
 designs+0.053
ies+0.053
working+0.050
ini+0.049
Neg logit contributions
 Rover-0.068
 then-0.067
 thirteen-0.065
 alas-0.064
 unison-0.064
Token and
Token ID290
Feature activation-1.60e+00

Pos logit contributions
 brushes+0.099
 designs+0.097
ies+0.095
ini+0.095
working+0.092
Neg logit contributions
 then-0.072
 Rover-0.071
 thirteen-0.066
 unison-0.063
 alas-0.063
Token puzzles
Token ID24367
Feature activation-1.39e+00

Pos logit contributions
 brushes+0.065
ies+0.063
 designs+0.062
ini+0.060
working+0.060
Neg logit contributions
 Rover-0.084
 then-0.083
 thirteen-0.080
 unison-0.079
 alas-0.078
Token and
Token ID290
Feature activation-2.33e+00

Pos logit contributions
 brushes+0.070
 designs+0.068
ies+0.067
working+0.064
ini+0.064
Neg logit contributions
 Rover-0.060
 then-0.059
 thirteen-0.057
 alas-0.055
 unison-0.055
Token cr
Token ID1067
Feature activation-2.57e+00

Pos logit contributions
 brushes+0.095
ies+0.092
 designs+0.092
ini+0.090
working+0.088
Neg logit contributions
 then-0.121
 Rover-0.120
 thirteen-0.116
 unison-0.113
 mercy-0.112
Tokenay
Token ID323
Feature activation-7.66e+00

Pos logit contributions
working+0.123
 designs+0.120
ies+0.117
 brushes+0.115
 works+0.114
Neg logit contributions
 thirteen-0.111
 unison-0.105
 Rover-0.104
 then-0.103
Mr-0.103
Tokenons
Token ID684
Feature activation-3.54e+00

Pos logit contributions
working+0.264
ini+0.262
bel+0.257
rolled+0.254
 complexion+0.239
Neg logit contributions
 course-0.430
 then-0.415
 thirteen-0.408
 Rover-0.404
 more-0.403
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
 brushes+0.173
 designs+0.167
ies+0.167
working+0.158
 work+0.155
Neg logit contributions
 Rover-0.158
 then-0.154
 thirteen-0.150
 alas-0.148
 unison-0.147
Token There
Token ID1318
Feature activation-2.49e+00

Pos logit contributions
 brushes+0.061
 designs+0.058
ies+0.056
packs+0.054
working+0.053
Neg logit contributions
 Rover-0.053
 then-0.053
 mercy-0.052
 thirteen-0.052
 alas-0.052
Token are
Token ID389
Feature activation+1.52e+00

Pos logit contributions
 brushes+0.126
 designs+0.125
ini+0.123
ies+0.122
working+0.120
Neg logit contributions
 Rover-0.104
 then-0.103
 thirteen-0.096
 alas-0.095
tons-0.094
Token hats
Token ID23910
Feature activation-1.21e+00

Pos logit contributions
 Rover+0.076
 then+0.072
 alas+0.072
 unison+0.072
tons+0.070
Neg logit contributions
 brushes-0.068
 designs-0.064
ies-0.063
 work-0.059
working-0.058
Token knowing
Token ID6970
Feature activation-1.19e+00

Pos logit contributions
 brushes+0.001
 designs+0.001
ies+0.001
working+0.000
ini+0.000
Neg logit contributions
 Rover-0.000
 then-0.000
 thirteen-0.000
 alas-0.000
 unison-0.000
Token that
Token ID326
Feature activation-1.69e+00

Pos logit contributions
 brushes+0.071
ies+0.070
 designs+0.069
 work+0.068
working+0.065
Neg logit contributions
 thirteen-0.041
 Rover-0.038
 unison-0.038
Mr-0.037
rified-0.035
Token even
Token ID772
Feature activation-3.24e-01

Pos logit contributions
 brushes+0.083
 designs+0.080
ies+0.078
 work+0.075
working+0.075
Neg logit contributions
 Rover-0.075
 thirteen-0.074
 unison-0.073
 then-0.072
Mr-0.070
Token the
Token ID262
Feature activation-1.81e+00

Pos logit contributions
 brushes+0.020
 designs+0.019
ies+0.019
working+0.018
 work+0.018
Neg logit contributions
 Rover-0.011
 then-0.010
 thirteen-0.010
 unison-0.010
 alas-0.010
Token t
Token ID256
Feature activation-1.89e+00

Pos logit contributions
 brushes+0.075
ies+0.072
 designs+0.072
working+0.068
ini+0.067
Neg logit contributions
 Rover-0.094
 then-0.094
 thirteen-0.091
 alas-0.089
 unison-0.089
Tokenini
Token ID5362
Feature activation-7.64e+00

Pos logit contributions
 brushes+0.087
 works+0.083
 designs+0.082
 work+0.082
working+0.077
Neg logit contributions
 then-0.088
 alas-0.080
 Rover-0.079
Mr-0.078
tons-0.077
Tokenest
Token ID395
Feature activation-2.08e+00

Pos logit contributions
 brushes+0.332
packs+0.310
 designs+0.309
rolled+0.301
bel+0.296
Neg logit contributions
 then-0.403
 Rover-0.373
 course-0.368
 alas-0.362
rified-0.361
Token and
Token ID290
Feature activation-9.74e-01

Pos logit contributions
 brushes+0.091
 designs+0.087
working+0.086
ies+0.085
rolled+0.084
Neg logit contributions
 then-0.103
 Rover-0.099
 alas-0.095
 unison-0.095
 thirteen-0.094
Token darkest
Token ID35995
Feature activation-8.67e-02

Pos logit contributions
 brushes+0.039
ies+0.038
 designs+0.037
working+0.036
ugs+0.036
Neg logit contributions
 then-0.051
 Rover-0.051
 thirteen-0.049
 alas-0.048
tons-0.047
Token of
Token ID286
Feature activation-1.94e+00

Pos logit contributions
 brushes+0.005
 designs+0.004
ies+0.004
working+0.004
ini+0.004
Neg logit contributions
 then-0.004
 Rover-0.003
 unison-0.003
 alas-0.003
 thirteen-0.003
Token gardens
Token ID25476
Feature activation-2.39e+00

Pos logit contributions
 brushes+0.090
ies+0.086
 designs+0.085
working+0.083
ini+0.082
Neg logit contributions
 then-0.093
 Rover-0.093
 thirteen-0.087
 unison-0.086
 alas-0.086
Token stopped
Token ID5025
Feature activation-4.99e-01

Pos logit contributions
 then+0.075
 Rover+0.074
 thirteen+0.070
 alas+0.069
 louder+0.068
Neg logit contributions
 brushes-0.072
 designs-0.071
ini-0.070
ies-0.067
ugs-0.067
Token.
Token ID13
Feature activation-3.00e-01

Pos logit contributions
 brushes+0.025
ies+0.024
 designs+0.024
working+0.023
 work+0.022
Neg logit contributions
 Rover-0.022
 then-0.021
 thirteen-0.021
 unison-0.021
 alas-0.020
Token
Token ID198
Feature activation+2.91e-01

Pos logit contributions
ies+0.019
 designs+0.018
 brushes+0.017
ini+0.017
working+0.017
Neg logit contributions
 then-0.010
 Rover-0.010
 ï¿½-0.010
 alas-0.009
 thirteen-0.009
Token
Token ID198
Feature activation+7.44e-01

Pos logit contributions
 Rover+0.009
 then+0.009
 thirteen+0.008
 alas+0.008
Mr+0.008
Neg logit contributions
 designs-0.019
 brushes-0.019
ies-0.018
working-0.018
 creations-0.017
TokenKay
Token ID37247
Feature activation-3.91e+00

Pos logit contributions
 Rover+0.024
 then+0.024
 thirteen+0.022
 alas+0.022
 unison+0.021
Neg logit contributions
 designs-0.046
 brushes-0.046
ies-0.045
working-0.043
 creations-0.043
Tokenlee
Token ID7197
Feature activation-7.62e+00

Pos logit contributions
 designs+0.164
 brushes+0.162
working+0.155
 work+0.150
 works+0.150
Neg logit contributions
 then-0.213
 Rover-0.193
 thirteen-0.186
 unison-0.184
 alas-0.181
Token was
Token ID373
Feature activation-2.12e-01

Pos logit contributions
 brushes+0.335
 designs+0.321
ies+0.320
working+0.302
 work+0.296
Neg logit contributions
 Rover-0.378
 then-0.366
 thirteen-0.360
 alas-0.354
 unison-0.352
Token excited
Token ID6568
Feature activation-1.09e+00

Pos logit contributions
 brushes+0.012
ies+0.012
 designs+0.011
 work+0.011
rolled+0.011
Neg logit contributions
 Rover-0.007
 unison-0.007
Mr-0.007
 thirteen-0.007
 mercy-0.007
Token and
Token ID290
Feature activation-1.91e-02

Pos logit contributions
ies+0.050
 brushes+0.049
 designs+0.048
 work+0.047
working+0.045
Neg logit contributions
 Rover-0.053
 thirteen-0.051
 unison-0.051
 alas-0.049
Mr-0.049
Token said
Token ID531
Feature activation+1.77e-01

Pos logit contributions
 brushes+0.001
 designs+0.001
ini+0.001
ugs+0.001
ies+0.001
Neg logit contributions
 then-0.001
 Rover-0.001
 alas-0.001
 louder-0.001
 unfortunately-0.001
Token,
Token ID11
Feature activation+2.39e-01

Pos logit contributions
 Rover+0.008
 unison+0.007
 then+0.007
 thirteen+0.007
Mr+0.007
Neg logit contributions
 brushes-0.009
ies-0.008
 designs-0.008
working-0.008
 work-0.008
Token a
Token ID257
Feature activation+3.99e-01

Pos logit contributions
 brushes+0.024
 designs+0.023
ies+0.023
working+0.021
 work+0.021
Neg logit contributions
 Rover-0.027
 then-0.027
 thirteen-0.026
 unison-0.026
 alas-0.026
Token chain
Token ID6333
Feature activation-8.47e-01

Pos logit contributions
 Rover+0.020
 then+0.020
 mercy+0.019
 louder+0.019
tons+0.019
Neg logit contributions
ies-0.017
 brushes-0.016
 designs-0.016
ini-0.016
ugs-0.016
Token.
Token ID13
Feature activation+3.38e-01

Pos logit contributions
 brushes+0.047
 designs+0.046
ies+0.045
ini+0.044
rolled+0.043
Neg logit contributions
 then-0.034
 Rover-0.032
 louder-0.030
 more-0.030
 no-0.029
Token Everywhere
Token ID48201
Feature activation-2.11e+00

Pos logit contributions
 mercy+0.017
 thirteen+0.017
 Rover+0.017
 unison+0.016
 then+0.016
Neg logit contributions
 brushes-0.015
 designs-0.014
ies-0.014
 work-0.013
rolled-0.013
Token N
Token ID399
Feature activation-2.35e+00

Pos logit contributions
 brushes+0.088
ies+0.085
 designs+0.083
working+0.079
 work+0.078
Neg logit contributions
 thirteen-0.110
 unison-0.108
 Rover-0.108
 mercy-0.107
 alas-0.104
Tokenyl
Token ID2645
Feature activation-7.53e+00

Pos logit contributions
working+0.109
 designs+0.107
 work+0.106
 works+0.103
 creations+0.103
Neg logit contributions
 then-0.097
aw-0.092
Where-0.090
Please-0.089
 Please-0.087
Tokena
Token ID64
Feature activation-1.68e+00

Pos logit contributions
 designs+0.351
 brushes+0.340
 work+0.338
ugs+0.336
working+0.321
Neg logit contributions
 then-0.369
aw-0.309
Please-0.301
 course-0.298
 who-0.295
Token looked
Token ID3114
Feature activation-2.80e+00

Pos logit contributions
 designs+0.077
ugs+0.075
bows+0.075
 brushes+0.074
 disl+0.074
Neg logit contributions
 then-0.086
Please-0.073
 Rover-0.072
 Please-0.069
'-0.069
Token,
Token ID11
Feature activation-5.28e-01

Pos logit contributions
 brushes+0.162
 designs+0.161
ini+0.155
rolled+0.152
bows+0.152
Neg logit contributions
 then-0.106
 Rover-0.100
tons-0.091
 alas-0.089
Please-0.086
Token she
Token ID673
Feature activation-5.92e-01

Pos logit contributions
 brushes+0.028
ini+0.028
 designs+0.028
ies+0.027
working+0.027
Neg logit contributions
 then-0.021
 Rover-0.021
tons-0.019
 alas-0.019
 thirteen-0.019
Token saw
Token ID2497
Feature activation-3.44e-02

Pos logit contributions
 brushes+0.027
 designs+0.026
ies+0.025
ugs+0.024
bows+0.024
Neg logit contributions
 then-0.030
 Rover-0.028
 alas-0.027
tons-0.026
 thirteen-0.026
Token her
Token ID607
Feature activation-2.17e+00

Pos logit contributions
 Rover+0.040
 then+0.039
 thirteen+0.037
 alas+0.036
 unison+0.036
Neg logit contributions
 brushes-0.054
 designs-0.052
ies-0.052
working-0.050
ini-0.049
Token.
Token ID13
Feature activation-1.62e-01

Pos logit contributions
 brushes+0.114
 designs+0.110
ies+0.108
working+0.105
ini+0.104
Neg logit contributions
 Rover-0.091
 then-0.089
 thirteen-0.084
 alas-0.084
 unison-0.083
Token
Token ID198
Feature activation+1.16e-01

Pos logit contributions
 brushes+0.009
ies+0.009
 designs+0.009
working+0.008
ini+0.008
Neg logit contributions
 Rover-0.006
 then-0.006
 thirteen-0.006
 alas-0.006
 unison-0.006
Token
Token ID198
Feature activation+5.37e-01

Pos logit contributions
 Rover+0.004
 then+0.004
 thirteen+0.003
 alas+0.003
 unison+0.003
Neg logit contributions
 brushes-0.007
 designs-0.007
ies-0.007
working-0.007
ini-0.007
TokenKay
Token ID37247
Feature activation-3.98e+00

Pos logit contributions
 Rover+0.027
 then+0.026
 thirteen+0.025
 alas+0.025
 unison+0.025
Neg logit contributions
 brushes-0.024
 designs-0.023
ies-0.023
working-0.021
ini-0.021
Tokenlee
Token ID7197
Feature activation-7.52e+00

Pos logit contributions
 brushes+0.153
 designs+0.153
working+0.146
 works+0.139
 work+0.139
Neg logit contributions
 then-0.230
 Rover-0.213
 thirteen-0.202
 unison-0.200
 alas-0.198
Token was
Token ID373
Feature activation-1.71e-01

Pos logit contributions
 brushes+0.327
ies+0.314
 designs+0.311
working+0.295
 work+0.292
Neg logit contributions
 Rover-0.370
 thirteen-0.357
 then-0.354
 alas-0.350
 mercy-0.349
Token too
Token ID1165
Feature activation-1.30e+00

Pos logit contributions
 brushes+0.009
ies+0.009
 designs+0.009
 work+0.009
working+0.008
Neg logit contributions
 Rover-0.006
 thirteen-0.006
 unison-0.006
Mr-0.006
 alas-0.006
Token scared
Token ID12008
Feature activation-6.35e-01

Pos logit contributions
 brushes+0.069
 designs+0.068
 creations+0.063
ies+0.062
ini+0.061
Neg logit contributions
 Rover-0.059
 then-0.057
 louder-0.052
 thirteen-0.052
Mr-0.051
Token to
Token ID284
Feature activation-1.14e+00

Pos logit contributions
 brushes+0.026
ies+0.026
 designs+0.025
 work+0.024
working+0.024
Neg logit contributions
 Rover-0.033
 unison-0.033
 thirteen-0.032
 alas-0.031
 then-0.031
Token move
Token ID1445
Feature activation+9.96e-01

Pos logit contributions
 brushes+0.048
 designs+0.046
ies+0.045
working+0.043
ini+0.042
Neg logit contributions
 Rover-0.059
 then-0.058
 thirteen-0.056
 alas-0.056
 unison-0.055
Token jumped
Token ID11687
Feature activation+1.52e-01

Pos logit contributions
 then+0.038
 Rover+0.035
 unison+0.032
 thirteen+0.032
Please+0.031
Neg logit contributions
 brushes-0.038
 designs-0.037
ini-0.036
ugs-0.036
working-0.036
Token and
Token ID290
Feature activation+4.78e-01

Pos logit contributions
 Rover+0.007
 mercy+0.007
 unison+0.007
 alas+0.007
 thirteen+0.007
Neg logit contributions
 brushes-0.007
ies-0.007
 designs-0.007
 work-0.006
working-0.006
Token laughed
Token ID13818
Feature activation+1.42e-01

Pos logit contributions
 then+0.024
 Rover+0.022
 louder+0.021
 more+0.019
 thirteen+0.019
Neg logit contributions
ini-0.024
ugs-0.023
 brushes-0.023
 designs-0.023
ies-0.022
Token with
Token ID351
Feature activation+2.46e-01

Pos logit contributions
 Rover+0.007
 mercy+0.007
 thirteen+0.007
 unison+0.007
Mr+0.007
Neg logit contributions
 designs-0.006
ies-0.006
 work-0.006
 brushes-0.006
bel-0.005
Token g
Token ID308
Feature activation-4.64e-01

Pos logit contributions
 Rover+0.011
 thirteen+0.010
 then+0.010
 alas+0.010
tons+0.010
Neg logit contributions
 brushes-0.012
 designs-0.012
ies-0.012
 work-0.012
working-0.011
Tokenlee
Token ID7197
Feature activation-7.49e+00

Pos logit contributions
 brushes+0.021
packs+0.020
 works+0.020
bel+0.020
bows+0.018
Neg logit contributions
der-0.021
 thirteen-0.020
aw-0.020
 then-0.020
 unison-0.020
Token.
Token ID13
Feature activation-4.90e-01

Pos logit contributions
 brushes+0.366
 designs+0.355
ies+0.354
working+0.336
 work+0.329
Neg logit contributions
 Rover-0.333
 then-0.324
 thirteen-0.316
 alas-0.311
 unison-0.309
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
 brushes+0.030
ies+0.030
 designs+0.029
working+0.028
ini+0.028
Neg logit contributions
 Rover-0.016
 then-0.016
 thirteen-0.015
 alas-0.014
 unison-0.014
Token
Token ID198
Feature activation+8.87e-01

Pos logit contributions
 Rover+0.012
 then+0.011
 thirteen+0.011
 alas+0.011
 unison+0.011
Neg logit contributions
 brushes-0.018
 designs-0.018
ies-0.018
working-0.017
ini-0.017
TokenAs
Token ID1722
Feature activation-2.67e-01

Pos logit contributions
 Rover+0.029
 mercy+0.027
 thirteen+0.026
 then+0.026
 unison+0.026
Neg logit contributions
 brushes-0.053
 designs-0.052
ies-0.051
working-0.051
ini-0.050
Token he
Token ID339
Feature activation+1.30e-01

Pos logit contributions
 work+0.012
working+0.012
ies+0.012
 brushes+0.012
 designs+0.012
Neg logit contributions
 unison-0.012
 thirteen-0.012
 mercy-0.012
 alas-0.011
rified-0.011
Token pies
Token ID47199
Feature activation-3.21e-02

Pos logit contributions
working+0.020
ini+0.019
 brushes+0.018
ouses+0.018
packs+0.018
Neg logit contributions
 then-0.023
 more-0.023
 louder-0.022
 thirteen-0.021
 less-0.021
Token and
Token ID290
Feature activation-5.95e-01

Pos logit contributions
 brushes+0.002
ini+0.002
 designs+0.002
working+0.002
ies+0.002
Neg logit contributions
 then-0.001
 Rover-0.001
 thirteen-0.001
 more-0.001
 louder-0.001
Token laughed
Token ID13818
Feature activation-1.78e-01

Pos logit contributions
ini+0.026
ugs+0.025
ies+0.024
working+0.024
 brushes+0.024
Neg logit contributions
 then-0.032
 thirteen-0.028
 Rover-0.028
 more-0.027
 louder-0.027
Token with
Token ID351
Feature activation-3.91e-02

Pos logit contributions
 brushes+0.009
 designs+0.009
ies+0.009
 work+0.008
working+0.008
Neg logit contributions
 Rover-0.008
 thirteen-0.007
 then-0.007
 alas-0.007
 mercy-0.007
Token g
Token ID308
Feature activation-5.04e-01

Pos logit contributions
ies+0.002
 brushes+0.002
working+0.002
 designs+0.002
ini+0.002
Neg logit contributions
 Rover-0.002
 then-0.002
 unison-0.002
 thirteen-0.001
 alas-0.001
Tokenlee
Token ID7197
Feature activation-7.48e+00

Pos logit contributions
bel+0.023
 works+0.022
packs+0.022
bows+0.022
gy+0.021
Neg logit contributions
der-0.022
 then-0.021
aw-0.020
 thirteen-0.020
 unison-0.020
Token.
Token ID13
Feature activation-5.20e-01

Pos logit contributions
 brushes+0.399
 designs+0.387
ini+0.378
ies+0.375
working+0.364
Neg logit contributions
 then-0.300
 Rover-0.297
 thirteen-0.270
 unison-0.265
 alas-0.260
Token
Token ID220
Feature activation-3.51e+00

Pos logit contributions
 brushes+0.029
 designs+0.028
ies+0.028
working+0.027
ini+0.027
Neg logit contributions
 Rover-0.020
 then-0.019
 thirteen-0.018
 alas-0.018
 unison-0.018
Token
Token ID198
Feature activation+4.77e-02

Pos logit contributions
 designs+0.258
ies+0.257
working+0.254
 brushes+0.250
ini+0.248
Neg logit contributions
 Rover-0.079
 then-0.073
Mr-0.064
 ï¿½-0.064
tons-0.056
Token
Token ID198
Feature activation+5.60e-01

Pos logit contributions
 Rover+0.002
 then+0.002
 thirteen+0.002
 alas+0.002
 mercy+0.002
Neg logit contributions
 brushes-0.003
 designs-0.003
ies-0.003
working-0.002
ini-0.002
TokenBut
Token ID1537
Feature activation+1.72e+00

Pos logit contributions
 mercy+0.022
 Rover+0.022
 thirteen+0.021
 alas+0.020
rified+0.019
Neg logit contributions
 brushes-0.030
 designs-0.028
ies-0.028
bel-0.028
working-0.028
Token the
Token ID262
Feature activation+2.97e-01

Pos logit contributions
 brushes+0.015
 designs+0.015
ies+0.015
working+0.015
ini+0.015
Neg logit contributions
 Rover-0.009
 then-0.009
 thirteen-0.008
 alas-0.008
 mercy-0.008
Token redd
Token ID14688
Feature activation+6.93e-01

Pos logit contributions
 Rover+0.013
 then+0.013
 thirteen+0.013
 alas+0.012
tons+0.012
Neg logit contributions
 brushes-0.015
 designs-0.014
ies-0.014
ini-0.013
working-0.013
Tokenest
Token ID395
Feature activation-1.02e+00

Pos logit contributions
 Rover+0.038
 then+0.038
 thirteen+0.037
 alas+0.036
 mercy+0.035
Neg logit contributions
 brushes-0.027
 designs-0.025
ies-0.024
working-0.024
ini-0.023
Token and
Token ID290
Feature activation-6.30e-01

Pos logit contributions
 brushes+0.052
 designs+0.050
ini+0.049
ies+0.049
working+0.048
Neg logit contributions
 then-0.043
 Rover-0.041
 thirteen-0.040
 alas-0.039
tons-0.038
Token sh
Token ID427
Feature activation-1.74e+00

Pos logit contributions
 brushes+0.029
 designs+0.029
ies+0.028
ugs+0.028
working+0.027
Neg logit contributions
 then-0.030
 Rover-0.029
 thirteen-0.027
Mr-0.027
 alas-0.027
Tokenini
Token ID5362
Feature activation-7.47e+00

Pos logit contributions
 brushes+0.073
 designs+0.071
 work+0.068
packs+0.067
 works+0.066
Neg logit contributions
 alas-0.085
aw-0.085
 thirteen-0.085
 then-0.084
Mr-0.083
Tokenest
Token ID395
Feature activation-1.98e+00

Pos logit contributions
 brushes+0.312
 designs+0.294
packs+0.277
bel+0.276
ies+0.275
Neg logit contributions
 then-0.397
 Rover-0.384
 thirteen-0.367
 alas-0.362
 mercy-0.359
Token apples
Token ID22514
Feature activation+5.65e-01

Pos logit contributions
 brushes+0.090
 designs+0.086
working+0.084
ies+0.084
ini+0.084
Neg logit contributions
 then-0.093
 Rover-0.092
 thirteen-0.087
 alas-0.086
 mercy-0.086
Token they
Token ID484
Feature activation+1.45e-01

Pos logit contributions
 Rover+0.023
 then+0.022
 thirteen+0.022
 alas+0.021
 unison+0.021
Neg logit contributions
 brushes-0.030
 designs-0.029
ies-0.029
working-0.027
ini-0.027
Token could
Token ID714
Feature activation-1.05e+00

Pos logit contributions
 then+0.006
Please+0.005
 Rover+0.005
 no+0.005
Mr+0.005
Neg logit contributions
 brushes-0.008
ugs-0.008
 designs-0.008
packs-0.008
ies-0.008
Token find
Token ID1064
Feature activation-1.31e+00

Pos logit contributions
 brushes+0.050
 designs+0.048
ini+0.047
ies+0.047
ugs+0.046
Neg logit contributions
 then-0.051
 Rover-0.045
 no-0.044
 end-0.044
 thirteen-0.044
Token and
Token ID290
Feature activation-9.16e-03

Pos logit contributions
 brushes+0.038
 designs+0.037
ies+0.037
working+0.035
 work+0.034
Neg logit contributions
 Rover-0.029
 then-0.027
 thirteen-0.027
 unison-0.026
 alas-0.026
Token spark
Token ID9009
Feature activation-2.74e+00

Pos logit contributions
 brushes+0.000
 designs+0.000
ies+0.000
working+0.000
ini+0.000
Neg logit contributions
 Rover-0.000
 then-0.000
 thirteen-0.000
 alas-0.000
tons-0.000
Tokenled
Token ID992
Feature activation-1.57e+00

Pos logit contributions
 brushes+0.101
 designs+0.094
ini+0.088
ies+0.087
working+0.087
Neg logit contributions
 then-0.157
 Rover-0.157
 alas-0.147
 thirteen-0.147
 unison-0.147
Token with
Token ID351
Feature activation+1.92e-01

Pos logit contributions
ies+0.073
 designs+0.073
 brushes+0.071
 work+0.069
bel+0.066
Neg logit contributions
 Rover-0.073
Mr-0.070
 thirteen-0.069
 unison-0.068
 mercy-0.067
Token g
Token ID308
Feature activation-5.21e-01

Pos logit contributions
 Rover+0.008
 then+0.008
 thirteen+0.007
 alas+0.007
 unison+0.007
Neg logit contributions
 brushes-0.010
 designs-0.010
ies-0.010
working-0.009
 work-0.009
Tokenlee
Token ID7197
Feature activation-7.46e+00

Pos logit contributions
 brushes+0.025
 works+0.023
bel+0.023
packs+0.022
working+0.022
Neg logit contributions
der-0.023
 unison-0.022
 thirteen-0.021
aw-0.021
 deeply-0.020
Token.
Token ID13
Feature activation-6.54e-01

Pos logit contributions
 brushes+0.376
 designs+0.362
ies+0.359
working+0.342
ini+0.341
Neg logit contributions
 Rover-0.324
 then-0.317
 thirteen-0.304
 alas-0.298
 unison-0.298
Token From
Token ID3574
Feature activation-1.15e+00

Pos logit contributions
 brushes+0.034
ies+0.034
 designs+0.033
working+0.032
ini+0.031
Neg logit contributions
 Rover-0.027
 then-0.027
 thirteen-0.025
 unison-0.025
 alas-0.025
Token then
Token ID788
Feature activation+8.96e-01

Pos logit contributions
 brushes+0.076
ies+0.076
 work+0.073
 designs+0.071
working+0.069
Neg logit contributions
 thirteen-0.033
 unison-0.029
 Rover-0.028
 alas-0.028
 mercy-0.027
Token on
Token ID319
Feature activation-1.24e+00

Pos logit contributions
 Rover+0.039
 then+0.038
 thirteen+0.037
 unison+0.037
 mercy+0.036
Neg logit contributions
 brushes-0.044
 designs-0.043
ies-0.043
working-0.041
 work-0.040
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 brushes+0.056
ies+0.055
 designs+0.055
 work+0.052
working+0.052
Neg logit contributions
 thirteen-0.059
 Rover-0.058
 unison-0.057
 mercy-0.056
Mr-0.056
Token and
Token ID290
Feature activation-6.87e-01

Pos logit contributions
 brushes+0.017
 designs+0.017
ies+0.017
 work+0.016
working+0.016
Neg logit contributions
 Rover-0.018
 thirteen-0.017
 mercy-0.016
Mr-0.016
 alas-0.016
Token it
Token ID340
Feature activation-1.54e+00

Pos logit contributions
 brushes+0.036
ies+0.035
 designs+0.035
ini+0.033
working+0.033
Neg logit contributions
 then-0.028
 Rover-0.028
 thirteen-0.027
 alas-0.026
 unison-0.026
Token became
Token ID2627
Feature activation-1.07e+00

Pos logit contributions
ugs+0.078
 brushes+0.075
ini+0.074
ies+0.074
packs+0.074
Neg logit contributions
 then-0.071
 louder-0.063
 more-0.062
 thirteen-0.062
 no-0.061
Token the
Token ID262
Feature activation-5.56e-01

Pos logit contributions
 brushes+0.059
 designs+0.057
ies+0.057
working+0.054
 work+0.053
Neg logit contributions
 Rover-0.042
 then-0.040
 thirteen-0.039
 alas-0.038
 unison-0.038
Token sh
Token ID427
Feature activation-1.26e+00

Pos logit contributions
 brushes+0.025
 designs+0.024
 work+0.023
ies+0.023
 creations+0.022
Neg logit contributions
 Rover-0.029
 alas-0.026
 then-0.026
 unison-0.026
Mr-0.026
Tokenini
Token ID5362
Feature activation-7.42e+00

Pos logit contributions
 brushes+0.052
 designs+0.048
packs+0.048
rolled+0.043
 work+0.042
Neg logit contributions
 thirteen-0.066
 then-0.065
 alas-0.063
 unison-0.062
jected-0.062
Tokenest
Token ID395
Feature activation-1.47e+00

Pos logit contributions
 brushes+0.293
 designs+0.276
ies+0.266
working+0.256
packs+0.254
Neg logit contributions
 then-0.404
 Rover-0.395
 thirteen-0.386
 alas-0.378
 mercy-0.375
Token tower
Token ID10580
Feature activation-9.24e-01

Pos logit contributions
 brushes+0.072
 designs+0.069
ies+0.069
working+0.066
ini+0.065
Neg logit contributions
 then-0.065
 Rover-0.065
 thirteen-0.062
 unison-0.061
 alas-0.061
Token in
Token ID287
Feature activation-1.14e+00

Pos logit contributions
 brushes+0.049
 designs+0.047
ies+0.047
working+0.045
ini+0.045
Neg logit contributions
 then-0.037
 Rover-0.037
 thirteen-0.035
 alas-0.034
 unison-0.034
Token the
Token ID262
Feature activation+2.46e-01

Pos logit contributions
 brushes+0.062
 designs+0.060
ies+0.059
working+0.057
 work+0.057
Neg logit contributions
 Rover-0.045
 then-0.044
 thirteen-0.043
 alas-0.042
 mercy-0.041
Token whole
Token ID2187
Feature activation-1.67e+00

Pos logit contributions
 Rover+0.013
 then+0.012
 thirteen+0.012
 alas+0.012
 unison+0.012
Neg logit contributions
 brushes-0.010
 designs-0.010
ies-0.010
working-0.009
ini-0.009
Token room
Token ID2119
Feature activation-7.54e-01

Pos logit contributions
 then+0.060
 Rover+0.060
 unison+0.058
 thirteen+0.057
 alas+0.057
Neg logit contributions
 brushes-0.046
working-0.044
ies-0.043
 designs-0.043
bows-0.041
Token and
Token ID290
Feature activation+5.69e-01

Pos logit contributions
 brushes+0.040
 designs+0.039
ies+0.039
ini+0.038
working+0.038
Neg logit contributions
 then-0.030
 Rover-0.028
 thirteen-0.027
 unison-0.027
 alas-0.027
Token laughed
Token ID13818
Feature activation-1.10e-02

Pos logit contributions
 then+0.029
 Rover+0.027
 thirteen+0.025
 louder+0.025
 alas+0.024
Neg logit contributions
ini-0.026
ugs-0.026
ies-0.025
 brushes-0.025
 designs-0.025
Token with
Token ID351
Feature activation+1.84e-01

Pos logit contributions
 brushes+0.001
 designs+0.001
ies+0.001
working+0.000
 work+0.000
Neg logit contributions
 Rover-0.000
 thirteen-0.000
 then-0.000
 mercy-0.000
 alas-0.000
Token g
Token ID308
Feature activation-4.88e-01

Pos logit contributions
 Rover+0.007
 then+0.007
 thirteen+0.007
 unison+0.007
 alas+0.007
Neg logit contributions
 brushes-0.010
ies-0.010
 designs-0.010
working-0.009
ini-0.009
Tokenlee
Token ID7197
Feature activation-7.40e+00

Pos logit contributions
 works+0.022
 brushes+0.022
bel+0.021
 collect+0.021
packs+0.021
Neg logit contributions
der-0.022
 unison-0.021
 then-0.020
 thirteen-0.020
 Rover-0.019
Token.
Token ID13
Feature activation-4.75e-01

Pos logit contributions
 brushes+0.372
 designs+0.359
ies+0.355
working+0.338
ini+0.336
Neg logit contributions
 Rover-0.322
 then-0.316
 thirteen-0.302
 alas-0.297
 unison-0.296
Token The
Token ID383
Feature activation-6.51e-01

Pos logit contributions
 brushes+0.026
 designs+0.025
ies+0.024
working+0.023
ini+0.023
Neg logit contributions
 Rover-0.019
 then-0.018
 thirteen-0.018
 alas-0.017
 unison-0.017
Token little
Token ID1310
Feature activation-3.73e+00

Pos logit contributions
 brushes+0.030
 designs+0.029
ies+0.029
 work+0.028
working+0.027
Neg logit contributions
 Rover-0.031
 thirteen-0.030
 then-0.029
 alas-0.029
 unison-0.029
Token girl
Token ID2576
Feature activation-8.21e-01

Pos logit contributions
 brushes+0.186
 designs+0.178
ies+0.176
working+0.173
bel+0.170
Neg logit contributions
 then-0.161
 Rover-0.157
 thirteen-0.148
 alas-0.147
 unison-0.146
Token had
Token ID550
Feature activation-1.48e+00

Pos logit contributions
 brushes+0.037
 designs+0.036
ies+0.036
working+0.034
ini+0.034
Neg logit contributions
 then-0.040
 Rover-0.039
 thirteen-0.037
 unison-0.037
 alas-0.037
Token
Token ID198
Feature activation+8.36e-01

Pos logit contributions
 Rover+0.013
 then+0.013
 thirteen+0.012
 unison+0.012
 alas+0.012
Neg logit contributions
 brushes-0.025
 designs-0.024
ies-0.024
working-0.023
ini-0.023
TokenSarah
Token ID29284
Feature activation-2.91e+00

Pos logit contributions
 Rover+0.027
 then+0.026
 thirteen+0.025
 alas+0.025
 unison+0.025
Neg logit contributions
 brushes-0.051
 designs-0.050
ies-0.049
working-0.048
ini-0.047
Token smiled
Token ID13541
Feature activation+2.58e-01

Pos logit contributions
 brushes+0.139
 designs+0.137
ies+0.135
working+0.129
ini+0.128
Neg logit contributions
 then-0.137
 Rover-0.132
 thirteen-0.124
 unison-0.122
 alas-0.122
Token with
Token ID351
Feature activation+1.91e-01

Pos logit contributions
 thirteen+0.013
 Rover+0.013
 mercy+0.012
 course+0.012
 alas+0.012
Neg logit contributions
ies-0.011
 designs-0.011
 brushes-0.011
 work-0.010
bel-0.010
Token g
Token ID308
Feature activation-4.93e-01

Pos logit contributions
 Rover+0.008
 thirteen+0.008
 alas+0.008
tons+0.008
Mr+0.008
Neg logit contributions
 brushes-0.009
 designs-0.009
 work-0.009
ies-0.008
 artwork-0.008
Tokenlee
Token ID7197
Feature activation-7.38e+00

Pos logit contributions
bel+0.022
 brushes+0.020
 works+0.020
working+0.019
 collect+0.019
Neg logit contributions
 thirteen-0.023
 unison-0.023
der-0.023
 then-0.023
 Rover-0.022
Token,
Token ID11
Feature activation-2.50e-01

Pos logit contributions
 brushes+0.366
ies+0.354
 designs+0.352
working+0.337
 work+0.330
Neg logit contributions
 Rover-0.324
 thirteen-0.311
 then-0.309
 alas-0.304
 unison-0.302
Token and
Token ID290
Feature activation-6.95e-01

Pos logit contributions
 brushes+0.013
 designs+0.013
ies+0.013
working+0.012
 work+0.012
Neg logit contributions
 Rover-0.010
 thirteen-0.010
 then-0.010
 mercy-0.009
 unison-0.009
Token started
Token ID2067
Feature activation-1.20e+00

Pos logit contributions
 brushes+0.036
ini+0.036
 designs+0.036
ies+0.035
working+0.035
Neg logit contributions
 then-0.031
 Rover-0.028
 louder-0.026
 alas-0.025
 unfortunately-0.025
Token to
Token ID284
Feature activation-6.91e-01

Pos logit contributions
ini+0.060
 brushes+0.059
ies+0.059
 designs+0.057
bows+0.056
Neg logit contributions
 then-0.052
 Rover-0.051
 louder-0.048
 ï¿½-0.047
 thirteen-0.045
Token wonder
Token ID4240
Feature activation-1.69e-01

Pos logit contributions
ugs+0.030
ies+0.029
ini+0.028
 brushes+0.028
 designs+0.027
Neg logit contributions
 then-0.038
 Rover-0.035
 unison-0.034
tons-0.033
 thirteen-0.033
Token all
Token ID477
Feature activation-3.24e+00

Pos logit contributions
 then+0.065
 Rover+0.064
 thirteen+0.060
 unison+0.059
 alas+0.059
Neg logit contributions
 brushes-0.069
 designs-0.066
ies-0.065
ini-0.064
ugs-0.062
Token gigg
Token ID30442
Feature activation-1.77e+00

Pos logit contributions
 brushes+0.161
 designs+0.152
ugs+0.150
ini+0.149
ies+0.148
Neg logit contributions
 then-0.150
 Rover-0.144
 thirteen-0.134
 unison-0.133
 louder-0.133
Tokenled
Token ID992
Feature activation-1.83e+00

Pos logit contributions
 brushes+0.073
 designs+0.064
packs+0.062
 work+0.060
working+0.060
Neg logit contributions
 then-0.098
 Rover-0.093
 louder-0.093
 unison-0.092
rified-0.091
Token with
Token ID351
Feature activation+1.91e-01

Pos logit contributions
ies+0.085
 designs+0.083
 brushes+0.081
 work+0.078
working+0.076
Neg logit contributions
 Rover-0.085
 thirteen-0.083
Mr-0.081
 mercy-0.080
 alas-0.080
Token g
Token ID308
Feature activation-6.50e-01

Pos logit contributions
 alas+0.009
 thirteen+0.009
tons+0.008
 Rover+0.008
Mr+0.008
Neg logit contributions
 designs-0.009
 brushes-0.009
 work-0.009
ies-0.008
 creations-0.008
Tokenlee
Token ID7197
Feature activation-7.34e+00

Pos logit contributions
 brushes+0.028
 works+0.025
bel+0.025
packs+0.025
 collect+0.024
Neg logit contributions
 unison-0.031
der-0.030
 Rover-0.030
 then-0.030
aw-0.030
Token.
Token ID13
Feature activation-5.65e-01

Pos logit contributions
 brushes+0.368
 designs+0.355
ies+0.352
working+0.335
ini+0.333
Neg logit contributions
 Rover-0.321
 then-0.313
 thirteen-0.301
 unison-0.295
 alas-0.295
Token
Token ID198
Feature activation+3.97e-02

Pos logit contributions
 brushes+0.031
 designs+0.030
ies+0.030
working+0.029
ini+0.028
Neg logit contributions
 Rover-0.022
 then-0.021
 thirteen-0.021
 alas-0.020
 unison-0.020
Token
Token ID198
Feature activation+5.50e-01

Pos logit contributions
 Rover+0.001
 then+0.001
 thirteen+0.001
 alas+0.001
 unison+0.001
Neg logit contributions
 brushes-0.002
 designs-0.002
ies-0.002
working-0.002
ini-0.002
TokenThen
Token ID6423
Feature activation+9.81e-01

Pos logit contributions
 Rover+0.019
 mercy+0.018
 then+0.018
 thirteen+0.017
 unison+0.017
Neg logit contributions
 brushes-0.031
 designs-0.031
working-0.030
ini-0.030
ies-0.030
Token,
Token ID11
Feature activation-3.88e-01

Pos logit contributions
 thirteen+0.047
Mr+0.047
 unison+0.047
 Rover+0.046
 mercy+0.046
Neg logit contributions
ies-0.041
 designs-0.041
 work-0.041
 brushes-0.040
working-0.039
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
 brushes+0.005
 designs+0.005
ini+0.005
working+0.005
ies+0.005
Neg logit contributions
 then-0.004
 Rover-0.004
 alas-0.003
 unfortunately-0.003
 thirteen-0.003
Token girl
Token ID2576
Feature activation-4.30e-01

Pos logit contributions
 brushes+0.052
 designs+0.050
ies+0.050
working+0.047
ini+0.046
Neg logit contributions
 Rover-0.049
 then-0.048
 thirteen-0.047
 alas-0.046
 unison-0.046
Token laughed
Token ID13818
Feature activation+1.78e-01

Pos logit contributions
 brushes+0.020
 designs+0.019
ies+0.019
working+0.019
ini+0.019
Neg logit contributions
 then-0.021
 Rover-0.020
 unison-0.019
 thirteen-0.019
 alas-0.018
Token with
Token ID351
Feature activation+2.56e-01

Pos logit contributions
 thirteen+0.009
 Rover+0.009
Mr+0.009
 mercy+0.009
Bad+0.009
Neg logit contributions
ies-0.007
 designs-0.007
 work-0.007
 brushes-0.007
 creations-0.006
Token g
Token ID308
Feature activation-5.77e-01

Pos logit contributions
 thirteen+0.013
tons+0.013
 alas+0.013
Mr+0.012
 unfortunately+0.012
Neg logit contributions
 work-0.011
 brushes-0.011
 designs-0.011
 artwork-0.010
 creations-0.009
Tokenlee
Token ID7197
Feature activation-7.34e+00

Pos logit contributions
bel+0.029
 works+0.026
 brushes+0.026
bows+0.025
packs+0.024
Neg logit contributions
der-0.026
 thirteen-0.025
 unison-0.024
 then-0.023
 Rover-0.023
Token.
Token ID13
Feature activation-4.73e-01

Pos logit contributions
 brushes+0.371
 designs+0.359
ies+0.356
working+0.339
ini+0.336
Neg logit contributions
 Rover-0.316
 then-0.309
 thirteen-0.298
 alas-0.292
 unison-0.291
Token The
Token ID383
Feature activation-1.01e+00

Pos logit contributions
 brushes+0.029
ies+0.028
 designs+0.028
working+0.027
ini+0.026
Neg logit contributions
 Rover-0.016
 then-0.015
 thirteen-0.014
 unison-0.014
 alas-0.014
Token cookies
Token ID14746
Feature activation+1.21e+00

Pos logit contributions
 brushes+0.051
 designs+0.050
ies+0.049
working+0.047
ini+0.046
Neg logit contributions
 Rover-0.044
 then-0.043
 thirteen-0.042
 alas-0.041
 unison-0.041
Token were
Token ID547
Feature activation-1.90e-01

Pos logit contributions
 then+0.058
 Rover+0.056
 thirteen+0.054
 alas+0.054
 unison+0.053
Neg logit contributions
 brushes-0.055
 designs-0.053
ies-0.053
working-0.052
ini-0.051
Token delicious
Token ID12625
Feature activation+6.63e-01

Pos logit contributions
 brushes+0.011
 designs+0.010
ies+0.010
working+0.010
ini+0.010
Neg logit contributions
 Rover-0.007
 then-0.007
 thirteen-0.007
 alas-0.007
 unison-0.007
TokenAs
Token ID1722
Feature activation-2.77e-01

Pos logit contributions
 mercy+0.021
 thirteen+0.021
rified+0.020
 Rover+0.020
 louder+0.019
Neg logit contributions
ini-0.032
rolled-0.031
bel-0.031
 brushes-0.031
ade-0.030
Token the
Token ID262
Feature activation-1.63e+00

Pos logit contributions
 work+0.013
working+0.013
 brushes+0.013
ies+0.013
 designs+0.013
Neg logit contributions
 thirteen-0.012
 unison-0.011
 mercy-0.011
 Rover-0.011
 alas-0.010
Token light
Token ID1657
Feature activation-8.49e-01

Pos logit contributions
 brushes+0.077
ies+0.075
 designs+0.075
working+0.070
packs+0.070
Neg logit contributions
 then-0.076
 Rover-0.076
 thirteen-0.071
 unison-0.071
 alas-0.070
Tokenening
Token ID3101
Feature activation-1.46e+00

Pos logit contributions
 brushes+0.037
 designs+0.035
ugs+0.034
ies+0.034
ini+0.033
Neg logit contributions
 then-0.044
 Rover-0.041
 alas-0.040
 louder-0.039
 unfortunately-0.039
Token f
Token ID277
Feature activation-1.96e+00

Pos logit contributions
 brushes+0.075
 designs+0.072
ies+0.072
ini+0.069
working+0.068
Neg logit contributions
 Rover-0.061
 then-0.061
 alas-0.057
 thirteen-0.057
tons-0.056
Tokenlic
Token ID677
Feature activation-7.31e+00

Pos logit contributions
ies+0.086
 work+0.084
 designs+0.083
 brushes+0.082
packs+0.082
Neg logit contributions
rified-0.096
 thirteen-0.096
jected-0.089
 then-0.088
ified-0.088
Tokenkered
Token ID28970
Feature activation-1.50e-01

Pos logit contributions
 brushes+0.302
packs+0.283
ies+0.277
 designs+0.273
working+0.271
Neg logit contributions
 then-0.390
 unison-0.361
 thirteen-0.360
 Rover-0.355
 alas-0.348
Token away
Token ID1497
Feature activation-4.33e-01

Pos logit contributions
 brushes+0.008
 designs+0.008
ies+0.008
ini+0.007
working+0.007
Neg logit contributions
 Rover-0.006
 then-0.006
 thirteen-0.006
 alas-0.005
 louder-0.005
Token,
Token ID11
Feature activation-3.40e-01

Pos logit contributions
 brushes+0.023
 designs+0.023
ies+0.022
working+0.022
ini+0.021
Neg logit contributions
 Rover-0.017
 then-0.017
 thirteen-0.016
 alas-0.016
 unison-0.016
Token the
Token ID262
Feature activation-5.90e-01

Pos logit contributions
 brushes+0.018
 designs+0.017
ies+0.017
 work+0.017
working+0.017
Neg logit contributions
 thirteen-0.013
 Rover-0.013
 unison-0.013
 mercy-0.012
 then-0.012
Token man
Token ID582
Feature activation+9.37e-01

Pos logit contributions
 brushes+0.026
 designs+0.025
ies+0.025
working+0.024
 work+0.024
Neg logit contributions
 Rover-0.029
 then-0.028
 thirteen-0.028
 alas-0.027
 unison-0.027
Token,
Token ID11
Feature activation+3.17e-01

Pos logit contributions
 then+0.033
 Rover+0.032
 alas+0.029
 unfortunately+0.028
tons+0.028
Neg logit contributions
 brushes-0.056
 designs-0.054
ies-0.053
ini-0.053
working-0.052
Token her
Token ID607
Feature activation-1.75e+00

Pos logit contributions
 then+0.013
 Rover+0.013
 alas+0.012
 unfortunately+0.012
 louder+0.012
Neg logit contributions
 brushes-0.017
 designs-0.017
ies-0.017
ini-0.017
working-0.016
Token mom
Token ID1995
Feature activation-4.78e-01

Pos logit contributions
working+0.112
ini+0.108
bows+0.108
 works+0.106
 designs+0.105
Neg logit contributions
 then-0.054
 Rover-0.047
 no-0.046
 end-0.045
 more-0.044
Token praised
Token ID15342
Feature activation+5.93e-01

Pos logit contributions
 brushes+0.020
 designs+0.019
ies+0.019
working+0.018
ini+0.017
Neg logit contributions
 Rover-0.025
 then-0.025
 thirteen-0.024
 alas-0.024
 unison-0.024
Token her
Token ID607
Feature activation-1.39e+00

Pos logit contributions
 Rover+0.032
 then+0.032
 thirteen+0.030
 alas+0.030
 unison+0.030
Neg logit contributions
 brushes-0.024
 designs-0.023
ies-0.022
working-0.021
ini-0.021
Token for
Token ID329
Feature activation+3.77e+00

Pos logit contributions
 brushes+0.085
 designs+0.082
ies+0.080
ini+0.079
working+0.078
Neg logit contributions
 then-0.047
 Rover-0.046
 thirteen-0.043
 unison-0.042
 alas-0.042
Token being
Token ID852
Feature activation+1.57e+00

Pos logit contributions
 Rover+0.167
 thirteen+0.163
 unison+0.159
 then+0.155
 alas+0.154
Neg logit contributions
 brushes-0.182
 designs-0.178
ies-0.176
 work-0.174
working-0.162
Token so
Token ID523
Feature activation-1.46e+00

Pos logit contributions
 Rover+0.051
 unison+0.050
 thirteen+0.050
tons+0.047
 alas+0.046
Neg logit contributions
 brushes-0.093
 designs-0.089
ies-0.089
ini-0.087
bel-0.086
Token brave
Token ID14802
Feature activation-3.48e-01

Pos logit contributions
 designs+0.082
 brushes+0.080
packs+0.080
ies+0.079
 creations+0.077
Neg logit contributions
 then-0.058
 Rover-0.057
 louder-0.056
 mercy-0.054
 thirteen-0.052
Token.
Token ID13
Feature activation-4.02e-01

Pos logit contributions
 brushes+0.018
ies+0.017
 designs+0.017
working+0.017
 work+0.016
Neg logit contributions
 Rover-0.015
 then-0.014
 thirteen-0.014
 alas-0.014
 unison-0.014
Token Lily
Token ID20037
Feature activation-1.49e+00

Pos logit contributions
ies+0.022
 brushes+0.021
 designs+0.021
working+0.021
ini+0.021
Neg logit contributions
 then-0.016
 Rover-0.016
 ï¿½-0.015
 thirteen-0.014
Mr-0.014
Token said
Token ID531
Feature activation+1.72e+00

Pos logit contributions
 then+0.000
 Rover+0.000
 thirteen+0.000
tons+0.000
 alas+0.000
Neg logit contributions
 brushes-0.001
 designs-0.000
ies-0.000
ini-0.000
ugs-0.000
Token they
Token ID484
Feature activation+3.20e+00

Pos logit contributions
 unison+0.066
Mr+0.066
 Rover+0.063
 thirteen+0.063
 mercy+0.060
Neg logit contributions
 designs-0.088
 brushes-0.088
ies-0.088
 work-0.084
working-0.083
Token had
Token ID550
Feature activation+7.40e-01

Pos logit contributions
 then+0.139
 Rover+0.135
 thirteen+0.128
 unison+0.125
 alas+0.125
Neg logit contributions
 brushes-0.164
 designs-0.158
ies-0.158
ini-0.152
ugs-0.151
Token to
Token ID284
Feature activation-1.53e-01

Pos logit contributions
 Rover+0.038
 then+0.037
 thirteen+0.036
 alas+0.036
 unison+0.036
Neg logit contributions
 brushes-0.031
 designs-0.030
ies-0.030
 work-0.028
working-0.028
Token be
Token ID307
Feature activation+1.53e+00

Pos logit contributions
 brushes+0.006
 work+0.006
 designs+0.006
ies+0.006
working+0.005
Neg logit contributions
 Rover-0.008
 thirteen-0.008
 alas-0.008
 then-0.008
rified-0.008
Token patient
Token ID5827
Feature activation+3.82e+00

Pos logit contributions
 Rover+0.068
 then+0.066
 thirteen+0.065
 alas+0.064
 unison+0.064
Neg logit contributions
 brushes-0.075
 designs-0.073
ies-0.073
working-0.069
 work-0.068
Token.
Token ID13
Feature activation-5.92e-02

Pos logit contributions
 Rover+0.187
 thirteen+0.184
Mr+0.178
 alas+0.177
 mercy+0.176
Neg logit contributions
ies-0.168
 designs-0.161
 brushes-0.159
 work-0.159
working-0.155
Token
Token ID220
Feature activation-3.05e+00

Pos logit contributions
 brushes+0.003
 designs+0.003
ies+0.003
working+0.003
ini+0.003
Neg logit contributions
 Rover-0.002
 then-0.002
 thirteen-0.002
 alas-0.002
 unison-0.002
Token
Token ID198
Feature activation+5.83e-01

Pos logit contributions
 brushes+0.190
 designs+0.187
ies+0.185
working+0.180
ini+0.177
Neg logit contributions
 Rover-0.096
 then-0.092
 thirteen-0.085
 alas-0.084
 unison-0.083
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
 Rover+0.023
 thirteen+0.022
 then+0.022
 alas+0.021
 unison+0.021
Neg logit contributions
 brushes-0.031
ies-0.030
 designs-0.030
working-0.029
rolled-0.028
TokenThey
Token ID2990
Feature activation+1.74e+00

Pos logit contributions
 Rover+0.042
 thirteen+0.040
tons+0.040
 mercy+0.039
 then+0.039
Neg logit contributions
 brushes-0.059
 designs-0.055
working-0.055
bel-0.055
rolled-0.055
Token,
Token ID11
Feature activation+5.39e-01

Pos logit contributions
 brushes+0.040
 designs+0.039
ini+0.039
ies+0.038
 creations+0.037
Neg logit contributions
 then-0.022
 Rover-0.022
 alas-0.019
tons-0.018
 unfortunately-0.018
Token Tim
Token ID5045
Feature activation-4.77e-01

Pos logit contributions
 then+0.026
 Rover+0.025
 alas+0.024
 unfortunately+0.023
tons+0.023
Neg logit contributions
 brushes-0.026
 designs-0.025
ies-0.025
ini-0.024
ugs-0.024
Token and
Token ID290
Feature activation+4.48e-01

Pos logit contributions
 designs+0.029
 brushes+0.029
bel+0.028
ugs+0.028
bows+0.027
Neg logit contributions
 then-0.019
 Rover-0.017
�-0.015
Mr-0.014
Please-0.014
Token Spot
Token ID15899
Feature activation+1.03e+00

Pos logit contributions
 Rover+0.021
 then+0.020
 mercy+0.019
Mr+0.019
 alas+0.019
Neg logit contributions
 brushes-0.022
 designs-0.021
ies-0.021
ini-0.020
working-0.020
Token went
Token ID1816
Feature activation-1.24e+00

Pos logit contributions
 Rover+0.059
 then+0.058
 unison+0.055
 thirteen+0.055
 alas+0.054
Neg logit contributions
 brushes-0.039
 designs-0.037
ies-0.036
ini-0.035
working-0.034
Token for
Token ID329
Feature activation+3.86e+00

Pos logit contributions
 brushes+0.063
 designs+0.061
ies+0.060
ini+0.058
working+0.057
Neg logit contributions
 Rover-0.055
 then-0.054
 mercy-0.050
 louder-0.049
 thirteen-0.049
Token a
Token ID257
Feature activation+4.41e-01

Pos logit contributions
 Rover+0.155
 thirteen+0.154
 then+0.151
 alas+0.149
 unison+0.145
Neg logit contributions
 brushes-0.202
 designs-0.195
ies-0.192
 work-0.190
working-0.186
Token walk
Token ID2513
Feature activation-1.25e+00

Pos logit contributions
 Rover+0.027
 then+0.027
 unison+0.025
tons+0.025
 alas+0.025
Neg logit contributions
 brushes-0.015
 designs-0.014
ies-0.014
ini-0.013
ugs-0.013
Token to
Token ID284
Feature activation+1.87e-01

Pos logit contributions
ies+0.056
 brushes+0.053
working+0.052
 designs+0.051
packs+0.051
Neg logit contributions
 thirteen-0.063
 Rover-0.061
 unison-0.059
 alas-0.058
 mercy-0.058
Token the
Token ID262
Feature activation+1.34e+00

Pos logit contributions
 Rover+0.007
 then+0.007
 unison+0.006
tons+0.006
 mercy+0.006
Neg logit contributions
 brushes-0.011
ies-0.011
 designs-0.011
ini-0.010
ugs-0.010
Token store
Token ID3650
Feature activation+9.83e-01

Pos logit contributions
 alas+0.074
 thirteen+0.074
 unison+0.073
 Rover+0.073
 mercy+0.072
Neg logit contributions
 brushes-0.050
 work-0.046
 designs-0.046
ies-0.044
working-0.044
Token she
Token ID673
Feature activation+2.59e-01

Pos logit contributions
 then+0.013
 Rover+0.012
 alas+0.012
tons+0.011
 unfortunately+0.011
Neg logit contributions
ini-0.017
 brushes-0.017
 designs-0.017
ies-0.017
ugs-0.017
Token saw
Token ID2497
Feature activation-5.03e-02

Pos logit contributions
 then+0.013
 Rover+0.012
 alas+0.011
 thirteen+0.011
 louder+0.011
Neg logit contributions
 designs-0.012
 brushes-0.012
ies-0.012
bows-0.012
ugs-0.012
Token the
Token ID262
Feature activation+6.85e-01

Pos logit contributions
 brushes+0.003
 designs+0.003
ies+0.003
working+0.003
ini+0.003
Neg logit contributions
 Rover-0.002
 then-0.002
 thirteen-0.002
 alas-0.002
 unison-0.002
Token balloon
Token ID21190
Feature activation+2.59e+00

Pos logit contributions
 Rover+0.033
 then+0.031
 thirteen+0.031
 alas+0.031
 unison+0.030
Neg logit contributions
 brushes-0.032
 designs-0.031
ies-0.031
 work-0.029
working-0.029
Token was
Token ID373
Feature activation+1.11e+00

Pos logit contributions
 Rover+0.111
 then+0.109
 thirteen+0.105
 alas+0.103
 unison+0.102
Neg logit contributions
 brushes-0.132
 designs-0.128
ies-0.127
working-0.121
ini-0.120
Token stuck
Token ID7819
Feature activation+3.31e+00

Pos logit contributions
 Rover+0.050
 then+0.048
 thirteen+0.047
 alas+0.046
 unison+0.046
Neg logit contributions
 brushes-0.055
 designs-0.053
ies-0.053
working-0.050
ini-0.049
Token in
Token ID287
Feature activation+9.67e-01

Pos logit contributions
 Rover+0.149
 then+0.142
 thirteen+0.141
 unison+0.139
 alas+0.138
Neg logit contributions
 brushes-0.160
ies-0.156
 designs-0.154
working-0.148
 work-0.146
Token a
Token ID257
Feature activation+1.29e+00

Pos logit contributions
 Rover+0.038
 then+0.037
 thirteen+0.036
 alas+0.035
 unison+0.035
Neg logit contributions
 brushes-0.053
 designs-0.051
ies-0.051
working-0.049
 work-0.048
Token tree
Token ID5509
Feature activation+1.26e+00

Pos logit contributions
 Rover+0.074
 then+0.072
tons+0.071
 thirteen+0.070
 unison+0.070
Neg logit contributions
 brushes-0.048
 designs-0.044
 work-0.043
ies-0.042
ade-0.040
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
 Rover+0.060
 then+0.057
 thirteen+0.057
 unison+0.056
 alas+0.056
Neg logit contributions
 brushes-0.059
ies-0.056
 designs-0.055
working-0.054
 work-0.053
Token She
Token ID1375
Feature activation-9.33e-02

Pos logit contributions
 Rover+0.005
 then+0.005
Mr+0.004
 alas+0.004
 thirteen+0.004
Neg logit contributions
ies-0.007
 designs-0.007
 brushes-0.007
working-0.007
ini-0.006
Token
Token ID198
Feature activation-4.68e-02

Pos logit contributions
 brushes+0.003
 designs+0.003
ies+0.003
working+0.002
 work+0.002
Neg logit contributions
 Rover-0.002
 then-0.002
 thirteen-0.002
 alas-0.002
 unison-0.002
Token
Token ID198
Feature activation+4.15e-01

Pos logit contributions
 brushes+0.003
 designs+0.003
ies+0.003
working+0.003
ini+0.003
Neg logit contributions
 Rover-0.002
 then-0.002
 thirteen-0.001
 alas-0.001
 unison-0.001
Token"
Token ID1
Feature activation-4.84e-03

Pos logit contributions
 mercy+0.017
 Rover+0.017
 unison+0.016
 alas+0.016
 thirteen+0.016
Neg logit contributions
 brushes-0.022
ies-0.021
bel-0.020
 designs-0.020
working-0.020
TokenThis
Token ID1212
Feature activation+5.09e-01

Pos logit contributions
 brushes+0.000
bows+0.000
 designs+0.000
bel+0.000
ini+0.000
Neg logit contributions
 mercy-0.000
 Rover-0.000
 alas-0.000
 then-0.000
 thirteen-0.000
Token is
Token ID318
Feature activation+1.92e+00

Pos logit contributions
 then+0.021
 Rover+0.020
 thirteen+0.019
 course+0.019
 unison+0.019
Neg logit contributions
ies-0.026
bel-0.026
 brushes-0.025
working-0.025
ugs-0.025
Token for
Token ID329
Feature activation+3.42e+00

Pos logit contributions
 Rover+0.075
 then+0.072
 thirteen+0.070
 alas+0.069
 unison+0.069
Neg logit contributions
 brushes-0.106
 designs-0.102
ies-0.101
working-0.096
 work-0.096
Token you
Token ID345
Feature activation+1.48e+00

Pos logit contributions
 Rover+0.100
 then+0.100
 thirteen+0.091
 unison+0.088
 alas+0.087
Neg logit contributions
 brushes-0.220
 designs-0.216
ies-0.215
ini-0.212
working-0.211
Token,
Token ID11
Feature activation+1.44e+00

Pos logit contributions
 then+0.057
 Rover+0.056
 thirteen+0.052
 alas+0.051
 unison+0.051
Neg logit contributions
 brushes-0.083
 designs-0.082
ini-0.079
ies-0.079
working-0.077
Token Sara
Token ID24799
Feature activation+1.33e+00

Pos logit contributions
 mercy+0.066
 alas+0.066
 thirteen+0.065
 Rover+0.064
 unison+0.063
Neg logit contributions
ies-0.067
 brushes-0.066
 work-0.060
 designs-0.060
bows-0.054
Token.
Token ID13
Feature activation+5.62e-01

Pos logit contributions
 Rover+0.057
 then+0.056
 thirteen+0.054
 alas+0.053
 unison+0.052
Neg logit contributions
 brushes-0.067
 designs-0.065
ies-0.064
working-0.061
ini-0.061
Token Do
Token ID2141
Feature activation+1.30e+00

Pos logit contributions
 Rover+0.019
 then+0.018
 thirteen+0.018
 alas+0.017
 unison+0.017
Neg logit contributions
 brushes-0.034
 designs-0.033
ies-0.032
working-0.031
ini-0.031
Token it
Token ID340
Feature activation-1.19e+00

Pos logit contributions
 brushes+0.024
 designs+0.023
ies+0.023
 work+0.022
working+0.021
Neg logit contributions
 Rover-0.019
 thirteen-0.018
 then-0.018
 alas-0.018
 mercy-0.018
Token was
Token ID373
Feature activation+9.75e-02

Pos logit contributions
 brushes+0.055
 designs+0.052
working+0.052
ies+0.052
ugs+0.051
Neg logit contributions
 then-0.058
 Rover-0.056
 thirteen-0.054
 louder-0.053
 unison-0.052
Token time
Token ID640
Feature activation-1.34e+00

Pos logit contributions
 Rover+0.005
 then+0.005
 thirteen+0.005
 alas+0.005
 unison+0.005
Neg logit contributions
 brushes-0.004
 designs-0.004
ies-0.004
working-0.004
ini-0.004
Token to
Token ID284
Feature activation-7.05e-01

Pos logit contributions
 brushes+0.064
 designs+0.062
ies+0.061
working+0.058
ini+0.057
Neg logit contributions
 Rover-0.062
 then-0.061
 thirteen-0.059
 alas-0.058
 unison-0.058
Token go
Token ID467
Feature activation+3.90e-01

Pos logit contributions
 brushes+0.029
 designs+0.028
ies+0.028
working+0.026
ini+0.026
Neg logit contributions
 Rover-0.037
 then-0.036
 thirteen-0.035
 alas-0.035
 unison-0.035
Token home
Token ID1363
Feature activation-3.36e-01

Pos logit contributions
 then+0.019
 Rover+0.019
tons+0.017
 louder+0.016
 more+0.016
Neg logit contributions
 brushes-0.018
 designs-0.018
working-0.017
ies-0.017
bel-0.017
Token.
Token ID13
Feature activation-2.80e-01

Pos logit contributions
 brushes+0.017
 designs+0.017
ies+0.016
working+0.016
ini+0.016
Neg logit contributions
 Rover-0.014
 then-0.014
 thirteen-0.013
 alas-0.013
 unison-0.013
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
 brushes+0.016
 designs+0.015
ies+0.015
 work+0.014
working+0.014
Neg logit contributions
 Rover-0.010
 then-0.010
 thirteen-0.010
 mercy-0.010
 alas-0.010
Token
Token ID220
Feature activation-2.99e+00

Pos logit contributions
 thirteen+0.006
 Rover+0.006
 alas+0.006
tons+0.006
 mercy+0.006
Neg logit contributions
 brushes-0.008
ies-0.008
 designs-0.008
ade-0.008
ini-0.008
Token
Token ID198
Feature activation+6.16e-01

Pos logit contributions
ies+0.202
 designs+0.201
 brushes+0.200
working+0.198
ini+0.193
Neg logit contributions
 Rover-0.084
 then-0.076
 ï¿½-0.069
 unison-0.066
 thirteen-0.066
TokenThe
Token ID464
Feature activation+9.67e-02

Pos logit contributions
 mercy+0.026
 Rover+0.026
 thirteen+0.025
tons+0.025
 alas+0.023
Neg logit contributions
ade-0.031
 brushes-0.031
ini-0.029
ies-0.029
bel-0.028
Token
Token ID198
Feature activation+3.42e-01

Pos logit contributions
ies+0.014
 brushes+0.014
 designs+0.014
working+0.013
ini+0.013
Neg logit contributions
 Rover-0.007
 then-0.007
 alas-0.006
 thirteen-0.006
 unison-0.006
Token
Token ID198
Feature activation+7.44e-01

Pos logit contributions
 Rover+0.011
 then+0.011
 thirteen+0.010
 unison+0.010
 alas+0.010
Neg logit contributions
 brushes-0.021
 designs-0.020
ies-0.020
working-0.020
ini-0.019
TokenTim
Token ID14967
Feature activation-1.23e+00

Pos logit contributions
 Rover+0.030
 thirteen+0.028
 then+0.028
 mercy+0.028
 alas+0.028
Neg logit contributions
 brushes-0.040
ies-0.039
 designs-0.038
ini-0.037
 work-0.036
Tokenmy
Token ID1820
Feature activation-2.84e+00

Pos logit contributions
 designs+0.069
bel+0.068
 brushes+0.068
working+0.065
rolled+0.063
Neg logit contributions
 then-0.049
 end-0.041
 Rover-0.041
 louder-0.040
 more-0.039
Token climbed
Token ID19952
Feature activation-5.45e-01

Pos logit contributions
 brushes+0.127
 designs+0.126
ies+0.122
ini+0.118
working+0.117
Neg logit contributions
 then-0.146
 Rover-0.135
 alas-0.128
tons-0.127
 thirteen-0.126
Token onto
Token ID4291
Feature activation+1.11e+00

Pos logit contributions
 brushes+0.023
 designs+0.023
ies+0.022
ini+0.021
working+0.021
Neg logit contributions
 Rover-0.028
 then-0.028
 thirteen-0.027
 alas-0.026
tons-0.026
Token the
Token ID262
Feature activation+4.01e-01

Pos logit contributions
 Rover+0.041
 then+0.040
 thirteen+0.038
 alas+0.038
 unison+0.037
Neg logit contributions
 brushes-0.063
ies-0.061
 designs-0.061
working-0.058
ini-0.058
Token chair
Token ID5118
Feature activation+4.15e-01

Pos logit contributions
 then+0.020
 Rover+0.020
 thirteen+0.019
tons+0.019
 louder+0.019
Neg logit contributions
 brushes-0.017
 designs-0.017
ies-0.017
ini-0.016
working-0.016
Token,
Token ID11
Feature activation+6.34e-01

Pos logit contributions
 Rover+0.019
 thirteen+0.018
 unison+0.018
Mr+0.018
tons+0.018
Neg logit contributions
 brushes-0.020
ies-0.019
 designs-0.019
 work-0.018
working-0.018
Token but
Token ID475
Feature activation+1.81e+00

Pos logit contributions
 Rover+0.026
 then+0.025
 thirteen+0.025
 unison+0.025
 alas+0.024
Neg logit contributions
 brushes-0.033
 designs-0.032
ies-0.032
working-0.030
 work-0.030
Token it
Token ID340
Feature activation-2.81e-01

Pos logit contributions
 Rover+0.057
 thirteen+0.053
 then+0.053
 unison+0.052
 mercy+0.051
Neg logit contributions
 brushes-0.113
ies-0.110
 designs-0.110
working-0.106
 work-0.104
Token upon
Token ID2402
Feature activation+1.89e-02

Pos logit contributions
 designs+0.071
 brushes+0.071
bows+0.065
ini+0.065
working+0.064
Neg logit contributions
 then-0.104
 Rover-0.103
 alas-0.097
tons-0.096
 ï¿½-0.095
Token a
Token ID257
Feature activation+2.35e-01

Pos logit contributions
 Rover+0.001
 thirteen+0.001
 then+0.001
 alas+0.001
 unison+0.001
Neg logit contributions
 brushes-0.001
 designs-0.001
ies-0.001
 work-0.001
working-0.001
Token time
Token ID640
Feature activation-3.76e-01

Pos logit contributions
 Rover+0.014
 thirteen+0.014
 then+0.013
 alas+0.013
 mercy+0.013
Neg logit contributions
 brushes-0.008
ies-0.007
working-0.007
 work-0.007
 designs-0.007
Token there
Token ID612
Feature activation-1.77e+00

Pos logit contributions
 brushes+0.020
 designs+0.019
ies+0.019
working+0.018
ini+0.018
Neg logit contributions
 Rover-0.015
 then-0.015
 thirteen-0.014
 alas-0.014
 unison-0.014
Token lived
Token ID5615
Feature activation+1.44e+00

Pos logit contributions
 brushes+0.085
 designs+0.083
ies+0.081
working+0.081
bel+0.077
Neg logit contributions
 then-0.084
 Rover-0.083
tons-0.077
 alas-0.077
 unison-0.074
Token a
Token ID257
Feature activation+9.23e-01

Pos logit contributions
 thirteen+0.070
 Rover+0.068
 alas+0.068
 mercy+0.067
 then+0.066
Neg logit contributions
 brushes-0.062
 designs-0.058
ies-0.058
 work-0.056
packs-0.055
Token very
Token ID845
Feature activation+8.80e-01

Pos logit contributions
 Rover+0.051
 then+0.050
 thirteen+0.049
 alas+0.049
 unison+0.048
Neg logit contributions
 brushes-0.035
ies-0.034
 designs-0.034
working-0.031
ini-0.031
Token curious
Token ID11040
Feature activation+4.29e-01

Pos logit contributions
 then+0.036
 Rover+0.035
 End+0.035
 end+0.033
 louder+0.033
Neg logit contributions
 brushes-0.047
bows-0.046
packs-0.045
 designs-0.045
ugs-0.044
Token little
Token ID1310
Feature activation-1.34e+00

Pos logit contributions
 Rover+0.021
 then+0.020
 thirteen+0.019
 unison+0.019
tons+0.019
Neg logit contributions
 brushes-0.020
 designs-0.019
ies-0.019
working-0.018
bel-0.018
Token girl
Token ID2576
Feature activation+1.48e+00

Pos logit contributions
 brushes+0.068
ies+0.067
 designs+0.066
working+0.063
 work+0.062
Neg logit contributions
 Rover-0.056
 then-0.056
 thirteen-0.056
 alas-0.055
 unison-0.053
Token.
Token ID13
Feature activation+8.11e-01

Pos logit contributions
 then+0.055
 Rover+0.055
 unison+0.049
 thirteen+0.049
 alas+0.048
Neg logit contributions
 brushes-0.084
 designs-0.082
ies-0.080
working-0.077
ini-0.077
Token looking
Token ID2045
Feature activation-2.76e+00

Pos logit contributions
 brushes+0.067
 designs+0.065
ies+0.065
working+0.062
ini+0.061
Neg logit contributions
 Rover-0.051
 then-0.050
 thirteen-0.048
 alas-0.047
 unison-0.047
Token for
Token ID329
Feature activation+2.67e+00

Pos logit contributions
 brushes+0.172
 designs+0.165
ini+0.164
ies+0.162
working+0.160
Neg logit contributions
 then-0.088
 Rover-0.084
 louder-0.080
 thirteen-0.076
 alas-0.075
Token things
Token ID1243
Feature activation-7.91e-01

Pos logit contributions
 Rover+0.106
 then+0.105
 thirteen+0.099
 alas+0.099
 unison+0.098
Neg logit contributions
 brushes-0.143
ies-0.139
 designs-0.138
working-0.134
ini-0.132
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
 brushes+0.037
 designs+0.035
ies+0.035
working+0.033
ini+0.033
Neg logit contributions
 Rover-0.037
 then-0.037
 thirteen-0.035
 alas-0.035
 unison-0.035
Token fix
Token ID4259
Feature activation+1.33e+00

Pos logit contributions
ies+0.049
ugs+0.047
packs+0.046
 brushes+0.046
 designs+0.045
Neg logit contributions
 then-0.055
 alas-0.048
 more-0.048
 Rover-0.048
 end-0.048
Token.
Token ID13
Feature activation-2.86e-01

Pos logit contributions
 then+0.054
 Rover+0.052
 alas+0.048
 louder+0.048
 thirteen+0.048
Neg logit contributions
 brushes-0.069
working-0.067
 designs-0.067
ies-0.067
ini-0.066
Token She
Token ID1375
Feature activation-4.69e-01

Pos logit contributions
ies+0.019
ini+0.017
 designs+0.017
working+0.017
 works+0.017
Neg logit contributions
 then-0.010
 ï¿½-0.009
 Rover-0.009
 alas-0.008
 Please-0.008
Token found
Token ID1043
Feature activation-2.11e+00

Pos logit contributions
bows+0.021
ugs+0.020
ies+0.020
 brushes+0.020
 designs+0.020
Neg logit contributions
 then-0.025
 Rover-0.021
 alas-0.021
 unfortunately-0.020
 more-0.020
Token a
Token ID257
Feature activation-8.17e-02

Pos logit contributions
 brushes+0.118
ies+0.116
 designs+0.115
working+0.111
ini+0.110
Neg logit contributions
 Rover-0.078
 then-0.077
 thirteen-0.073
 alas-0.072
 mercy-0.070
Token picture
Token ID4286
Feature activation-7.86e-01

Pos logit contributions
 brushes+0.003
ies+0.003
 designs+0.003
working+0.002
ini+0.002
Neg logit contributions
 Rover-0.005
 then-0.005
 thirteen-0.005
 alas-0.005
 unison-0.005
Token frame
Token ID5739
Feature activation-7.36e-01

Pos logit contributions
 brushes+0.040
 designs+0.039
ies+0.039
working+0.037
ini+0.037
Neg logit contributions
 Rover-0.033
 then-0.033
 thirteen-0.031
 alas-0.031
 unison-0.031
Token mom
Token ID1995
Feature activation-1.46e+00

Pos logit contributions
 Rover+0.006
 then+0.006
 thirteen+0.005
 unison+0.005
 alas+0.005
Neg logit contributions
 brushes-0.006
ies-0.006
 designs-0.006
working-0.006
ini-0.006
Tokenmy
Token ID1820
Feature activation-2.43e+00

Pos logit contributions
 brushes+0.069
 designs+0.068
ies+0.066
working+0.064
ini+0.063
Neg logit contributions
 then-0.068
 Rover-0.067
 thirteen-0.063
 alas-0.063
 unison-0.063
Token hugged
Token ID41130
Feature activation-4.05e-01

Pos logit contributions
 brushes+0.120
 designs+0.117
ies+0.113
working+0.111
ini+0.109
Neg logit contributions
 then-0.110
 Rover-0.108
 thirteen-0.100
 unison-0.099
 alas-0.098
Token him
Token ID683
Feature activation+2.80e-01

Pos logit contributions
 brushes+0.022
 work+0.021
ies+0.021
 designs+0.020
bel+0.019
Neg logit contributions
 thirteen-0.016
 Rover-0.015
 unison-0.015
 alas-0.014
tons-0.014
Token and
Token ID290
Feature activation+6.89e-02

Pos logit contributions
 thirteen+0.014
 Rover+0.014
Did+0.013
Please+0.013
 unison+0.013
Neg logit contributions
ies-0.012
 brushes-0.011
 work-0.011
 designs-0.011
 creations-0.010
Token said
Token ID531
Feature activation+4.12e-01

Pos logit contributions
 then+0.003
 Rover+0.003
 mercy+0.003
 alas+0.003
 unison+0.003
Neg logit contributions
 brushes-0.004
 designs-0.004
ies-0.003
working-0.003
ini-0.003
Token,
Token ID11
Feature activation+2.81e-01

Pos logit contributions
 unison+0.019
 Rover+0.019
 thirteen+0.019
Mr+0.018
 alas+0.018
Neg logit contributions
 brushes-0.019
ies-0.018
 designs-0.018
working-0.017
 work-0.017
Token "
Token ID366
Feature activation+6.52e-01

Pos logit contributions
 Rover+0.011
 unison+0.011
 mercy+0.011
 thirteen+0.011
 alas+0.010
Neg logit contributions
 brushes-0.015
 designs-0.014
ies-0.014
 work-0.013
working-0.013
Tokenyou
Token ID5832
Feature activation+8.57e-01

Pos logit contributions
 Rover+0.023
 thirteen+0.022
 unison+0.021
 mercy+0.021
 then+0.021
Neg logit contributions
 brushes-0.037
ini-0.035
working-0.033
 designs-0.033
bel-0.033
Token don
Token ID836
Feature activation+1.36e+00

Pos logit contributions
 then+0.037
 no+0.034
 Rover+0.034
 mercy+0.033
 alas+0.032
Neg logit contributions
 brushes-0.044
ies-0.044
 designs-0.043
ugs-0.043
bel-0.043
Token't
Token ID470
Feature activation+3.20e-02

Pos logit contributions
 Rover+0.065
 then+0.063
 thirteen+0.062
 alas+0.061
 unison+0.061
Neg logit contributions
 brushes-0.061
 designs-0.059
ies-0.058
 work-0.056
working-0.056
Token
Token ID220
Feature activation-3.00e+00

Pos logit contributions
 brushes+0.008
ies+0.008
 designs+0.008
working+0.008
ini+0.008
Neg logit contributions
 Rover-0.006
 then-0.006
 thirteen-0.006
 alas-0.006
 unison-0.006
Token
Token ID198
Feature activation+2.30e-01

Pos logit contributions
 brushes+0.192
 designs+0.190
ies+0.187
working+0.184
ini+0.180
Neg logit contributions
 Rover-0.090
 then-0.086
Mr-0.077
 thirteen-0.077
 alas-0.076
Token
Token ID198
Feature activation+6.30e-01

Pos logit contributions
 then+0.006
 Rover+0.006
Mr+0.006
 alas+0.005
 ï¿½+0.005
Neg logit contributions
 designs-0.015
 brushes-0.015
ies-0.015
working-0.015
 creations-0.014
TokenThe
Token ID464
Feature activation+3.87e-01

Pos logit contributions
 then+0.024
 Rover+0.024
 thirteen+0.023
 alas+0.023
Mr+0.023
Neg logit contributions
 brushes-0.035
 designs-0.034
ies-0.034
working-0.032
 work-0.032
Token boat
Token ID8848
Feature activation-5.55e-01

Pos logit contributions
 Rover+0.018
 alas+0.018
 thirteen+0.017
 mercy+0.017
rified+0.017
Neg logit contributions
 designs-0.019
 brushes-0.018
ies-0.018
 work-0.018
ini-0.017
Token came
Token ID1625
Feature activation-2.15e+00

Pos logit contributions
 brushes+0.022
ies+0.022
 designs+0.021
 work+0.020
ini+0.020
Neg logit contributions
 Rover-0.030
 thirteen-0.029
 then-0.028
 alas-0.028
 mercy-0.028
Token closer
Token ID5699
Feature activation+1.36e+00

Pos logit contributions
 brushes+0.115
ies+0.115
 designs+0.112
 work+0.108
working+0.107
Neg logit contributions
 Rover-0.085
 then-0.080
 thirteen-0.080
 alas-0.078
 unison-0.078
Token and
Token ID290
Feature activation+1.28e+00

Pos logit contributions
 Rover+0.071
 unison+0.069
Mr+0.068
 mercy+0.068
 thirteen+0.067
Neg logit contributions
ies-0.057
 brushes-0.055
 work-0.054
 designs-0.054
working-0.049
Token closer
Token ID5699
Feature activation+2.27e-01

Pos logit contributions
 Rover+0.051
 then+0.051
 thirteen+0.048
 alas+0.047
 unison+0.046
Neg logit contributions
 brushes-0.069
 designs-0.067
ies-0.067
working-0.064
ini-0.063
Token until
Token ID1566
Feature activation+1.05e-01

Pos logit contributions
 Rover+0.011
 unison+0.011
 thirteen+0.011
Mr+0.011
 alas+0.011
Neg logit contributions
 brushes-0.010
ies-0.010
 designs-0.010
 work-0.009
working-0.009
Token it
Token ID340
Feature activation-3.68e-01

Pos logit contributions
 Rover+0.004
 then+0.004
 thirteen+0.004
 alas+0.004
 unison+0.003
Neg logit contributions
 brushes-0.006
 designs-0.006
ies-0.006
working-0.006
ini-0.006
Token noticed
Token ID6810
Feature activation+1.28e+00

Pos logit contributions
 Rover+0.027
 then+0.026
 thirteen+0.026
 alas+0.025
 unison+0.025
Neg logit contributions
 brushes-0.026
ies-0.025
 designs-0.025
working-0.024
 work-0.023
Token he
Token ID339
Feature activation+2.55e+00

Pos logit contributions
 thirteen+0.039
 Rover+0.037
 unison+0.037
 alas+0.035
tons+0.034
Neg logit contributions
 brushes-0.081
ies-0.080
 designs-0.079
 work-0.075
working-0.074
Token was
Token ID373
Feature activation+1.58e+00

Pos logit contributions
 then+0.126
 Rover+0.119
 alas+0.112
 louder+0.112
 unison+0.112
Neg logit contributions
 brushes-0.122
 designs-0.115
ies-0.111
ugs-0.109
working-0.108
Token missing
Token ID4814
Feature activation+6.72e-01

Pos logit contributions
 Rover+0.071
 thirteen+0.069
 then+0.069
 unison+0.068
 alas+0.068
Neg logit contributions
 brushes-0.075
 designs-0.073
ies-0.073
working-0.069
 work-0.069
Token and
Token ID290
Feature activation+8.33e-01

Pos logit contributions
 Rover+0.029
 then+0.028
 thirteen+0.028
 alas+0.027
 unison+0.027
Neg logit contributions
 brushes-0.034
 designs-0.033
ies-0.033
 work-0.031
working-0.031
Token went
Token ID1816
Feature activation-1.26e+00

Pos logit contributions
 then+0.040
 Rover+0.039
 louder+0.035
 alas+0.035
 mercy+0.035
Neg logit contributions
 brushes-0.040
 designs-0.038
ini-0.038
ugs-0.037
bows-0.037
Token to
Token ID284
Feature activation+6.23e-02

Pos logit contributions
 brushes+0.063
 designs+0.061
ies+0.060
ini+0.058
working+0.057
Neg logit contributions
 Rover-0.056
 then-0.055
 louder-0.052
 thirteen-0.051
 mercy-0.051
Token look
Token ID804
Feature activation-1.30e+00

Pos logit contributions
 Rover+0.003
 then+0.003
 mercy+0.003
 louder+0.002
 alas+0.002
Neg logit contributions
 brushes-0.003
ini-0.003
ies-0.003
ugs-0.003
 designs-0.003
Token for
Token ID329
Feature activation+3.04e+00

Pos logit contributions
ies+0.066
 brushes+0.066
 designs+0.065
 work+0.063
working+0.060
Neg logit contributions
 Rover-0.055
 thirteen-0.054
 unison-0.053
 alas-0.052
 then-0.051
Token him
Token ID683
Feature activation+7.48e-01

Pos logit contributions
 thirteen+0.128
 Rover+0.124
 then+0.120
 unison+0.118
 alas+0.118
Neg logit contributions
 brushes-0.156
 designs-0.155
ies-0.152
 work-0.148
 creations-0.139
Token.
Token ID13
Feature activation+1.12e-01

Pos logit contributions
 Rover+0.033
 then+0.032
 thirteen+0.031
 alas+0.030
 unison+0.030
Neg logit contributions
 brushes-0.037
 designs-0.036
ies-0.036
working-0.034
ini-0.034
Token ink
Token ID16882
Feature activation+3.31e-01

Pos logit contributions
 brushes+0.057
ies+0.055
 designs+0.055
working+0.052
ini+0.051
Neg logit contributions
 Rover-0.071
 then-0.070
 thirteen-0.067
 alas-0.066
 unison-0.066
Token!
Token ID0
Feature activation-7.34e-01

Pos logit contributions
 Rover+0.015
 then+0.015
 alas+0.014
 thirteen+0.014
 unison+0.014
Neg logit contributions
 brushes-0.016
 designs-0.015
ies-0.015
working-0.014
 work-0.014
Token
Token ID198
Feature activation+2.87e-01

Pos logit contributions
 brushes+0.037
 designs+0.036
ies+0.035
working+0.034
ini+0.034
Neg logit contributions
 Rover-0.031
 then-0.031
 thirteen-0.030
 unison-0.029
 mercy-0.029
Token
Token ID198
Feature activation+7.93e-01

Pos logit contributions
 Rover+0.010
 then+0.009
 thirteen+0.009
 alas+0.009
 unison+0.009
Neg logit contributions
 brushes-0.017
 designs-0.017
ies-0.017
working-0.016
ini-0.016
TokenE
Token ID36
Feature activation-2.36e+00

Pos logit contributions
 Rover+0.028
 then+0.027
 thirteen+0.026
 alas+0.026
 unison+0.025
Neg logit contributions
 brushes-0.047
 designs-0.045
ies-0.045
working-0.043
ini-0.043
Tokenlla
Token ID8466
Feature activation-9.63e-01

Pos logit contributions
 brushes+0.108
 designs+0.104
ies+0.099
ini+0.095
working+0.094
Neg logit contributions
 Rover-0.116
 then-0.114
 thirteen-0.109
tons-0.108
 alas-0.108
Token was
Token ID373
Feature activation-3.17e-02

Pos logit contributions
 brushes+0.043
 designs+0.043
working+0.041
bows+0.041
ini+0.041
Neg logit contributions
 then-0.049
 Rover-0.043
 thirteen-0.042
 unison-0.042
 louder-0.041
Token so
Token ID523
Feature activation-2.51e+00

Pos logit contributions
 brushes+0.002
 designs+0.002
ies+0.002
working+0.002
 work+0.002
Neg logit contributions
 Rover-0.001
 then-0.001
 thirteen-0.001
 alas-0.001
 unison-0.001
Token excited
Token ID6568
Feature activation-1.70e+00

Pos logit contributions
 designs+0.150
 brushes+0.148
ini+0.145
ies+0.145
bows+0.145
Neg logit contributions
 then-0.101
 no-0.085
 louder-0.083
�-0.082
 Rover-0.081
Token she
Token ID673
Feature activation-4.95e-01

Pos logit contributions
 brushes+0.099
 designs+0.097
ies+0.094
ini+0.092
working+0.092
Neg logit contributions
 Rover-0.061
 then-0.060
 thirteen-0.054
 alas-0.054
 mercy-0.053
Token ran
Token ID4966
Feature activation-1.12e+00

Pos logit contributions
packs+0.024
ugs+0.024
 brushes+0.024
bows+0.023
working+0.023
Neg logit contributions
 then-0.025
 Rover-0.022
 Please-0.021
Please-0.020
 louder-0.020
Token trains
Token ID13404
Feature activation-1.92e+00

Pos logit contributions
 brushes+0.084
 designs+0.079
ies+0.077
bows+0.077
working+0.076
Neg logit contributions
 then-0.050
 Rover-0.048
 alas-0.044
 mercy-0.043
 louder-0.043
Token could
Token ID714
Feature activation-1.21e+00

Pos logit contributions
 brushes+0.095
 designs+0.093
ini+0.092
ies+0.089
ugs+0.089
Neg logit contributions
 then-0.088
 Rover-0.084
 thirteen-0.077
 alas-0.076
 no-0.076
Token not
Token ID407
Feature activation-2.21e-01

Pos logit contributions
 brushes+0.057
 designs+0.056
ies+0.055
ugs+0.053
ini+0.053
Neg logit contributions
 then-0.058
 Rover-0.056
 mercy-0.053
 thirteen-0.053
 alas-0.052
Token reach
Token ID3151
Feature activation+7.82e-01

Pos logit contributions
 brushes+0.010
 designs+0.009
ies+0.009
ugs+0.009
ini+0.009
Neg logit contributions
 then-0.011
 Rover-0.011
 thirteen-0.010
 mercy-0.010
 unison-0.010
Token the
Token ID262
Feature activation+1.72e-02

Pos logit contributions
 Rover+0.026
 then+0.025
 thirteen+0.023
 alas+0.023
 louder+0.022
Neg logit contributions
 brushes-0.048
 designs-0.046
ies-0.046
ini-0.045
working-0.045
Token town
Token ID3240
Feature activation-1.21e+00

Pos logit contributions
 Rover+0.001
 alas+0.001
Mr+0.001
Did+0.001
 unison+0.001
Neg logit contributions
 work-0.001
 designs-0.001
 brushes-0.001
working-0.001
 design-0.001
Token and
Token ID290
Feature activation+1.75e-01

Pos logit contributions
 brushes+0.066
 designs+0.063
ies+0.062
working+0.060
ini+0.060
Neg logit contributions
 then-0.048
 Rover-0.047
 thirteen-0.045
 alas-0.044
 unison-0.044
Token everyone
Token ID2506
Feature activation-1.28e+00

Pos logit contributions
 then+0.009
 Rover+0.008
 alas+0.008
 thirteen+0.008
 mercy+0.008
Neg logit contributions
 brushes-0.008
ies-0.008
 designs-0.008
ini-0.008
ugs-0.008
Token was
Token ID373
Feature activation-2.08e-01

Pos logit contributions
 brushes+0.067
 designs+0.063
ies+0.062
ini+0.062
ugs+0.061
Neg logit contributions
 then-0.055
 Rover-0.052
 thirteen-0.049
 louder-0.048
 alas-0.047
Token unhappy
Token ID19283
Feature activation+7.64e-01

Pos logit contributions
 brushes+0.010
 designs+0.010
ies+0.010
working+0.009
ini+0.009
Neg logit contributions
 Rover-0.009
 then-0.009
 thirteen-0.009
 alas-0.009
 unison-0.009
Token.
Token ID13
Feature activation-4.90e-01

Pos logit contributions
 Rover+0.037
 thirteen+0.035
 then+0.034
 alas+0.034
 unison+0.034
Neg logit contributions
 brushes-0.034
ies-0.034
 designs-0.034
 work-0.032
working-0.032
Token her
Token ID607
Feature activation-2.10e+00

Pos logit contributions
 brushes+0.026
 designs+0.025
ies+0.024
ini+0.024
working+0.024
Neg logit contributions
 then-0.027
 Rover-0.027
 alas-0.025
 louder-0.025
Mr-0.025
Token mom
Token ID1995
Feature activation-7.53e-01

Pos logit contributions
working+0.136
bows+0.135
 brushes+0.135
ini+0.134
rolled+0.133
Neg logit contributions
 then-0.066
 more-0.055
 end-0.055
 no-0.054
 mercy-0.051
Tokenmy
Token ID1820
Feature activation-1.55e+00

Pos logit contributions
 brushes+0.037
 designs+0.036
rolled+0.035
working+0.035
ini+0.034
Neg logit contributions
 then-0.035
 more-0.031
 louder-0.030
 end-0.030
 Rover-0.030
Token and
Token ID290
Feature activation+9.32e-01

Pos logit contributions
 brushes+0.094
 designs+0.090
ini+0.089
ies+0.088
rolled+0.088
Neg logit contributions
 then-0.054
 Rover-0.046
 more-0.045
'-0.044
 alas-0.043
Token smiled
Token ID13541
Feature activation+2.12e-01

Pos logit contributions
 then+0.040
 Rover+0.036
 mercy+0.034
 unfortunately+0.034
 louder+0.033
Neg logit contributions
 brushes-0.049
ini-0.048
working-0.048
ugs-0.047
 designs-0.047
Token as
Token ID355
Feature activation-1.71e+00

Pos logit contributions
 Rover+0.010
 thirteen+0.010
 alas+0.009
 unison+0.009
 then+0.009
Neg logit contributions
ies-0.010
 designs-0.009
 brushes-0.009
 work-0.009
working-0.009
Token she
Token ID673
Feature activation-3.41e-01

Pos logit contributions
rolled+0.114
ini+0.113
ugs+0.111
ats+0.110
ade+0.109
Neg logit contributions
 no-0.051
 then-0.050
 Rover-0.047
 more-0.046
 louder-0.046
Token sn
Token ID3013
Feature activation-1.09e+00

Pos logit contributions
 brushes+0.016
 designs+0.016
ies+0.016
working+0.016
ini+0.015
Neg logit contributions
 then-0.016
 Rover-0.016
 unison-0.015
Mr-0.014
 thirteen-0.014
Tokenugg
Token ID6837
Feature activation-3.86e-01

Pos logit contributions
 designs+0.055
 brushes+0.053
working+0.049
 works+0.048
ies+0.048
Neg logit contributions
 then-0.051
 Rover-0.048
 alas-0.047
tons-0.045
Mr-0.044
Tokenled
Token ID992
Feature activation-3.01e-01

Pos logit contributions
 brushes+0.014
 designs+0.013
rolled+0.012
working+0.012
packs+0.012
Neg logit contributions
 then-0.023
 Rover-0.022
 louder-0.021
tons-0.021
Please-0.021
Token into
Token ID656
Feature activation+3.74e-01

Pos logit contributions
 brushes+0.014
 designs+0.014
ies+0.013
ini+0.013
working+0.013
Neg logit contributions
 Rover-0.014
 then-0.014
 thirteen-0.013
 alas-0.013
 unison-0.013
Token harder
Token ID7069
Feature activation+2.72e+00

Pos logit contributions
 brushes+0.121
ies+0.121
 designs+0.120
 work+0.115
working+0.112
Neg logit contributions
 Rover-0.098
Mr-0.091
 thirteen-0.090
 then-0.089
 unison-0.089
Token.
Token ID13
Feature activation-4.38e-01

Pos logit contributions
 then+0.108
 Rover+0.105
 thirteen+0.099
 alas+0.098
 mercy+0.096
Neg logit contributions
 brushes-0.146
 designs-0.142
ies-0.138
ini-0.137
rolled-0.135
Token
Token ID198
Feature activation-9.28e-02

Pos logit contributions
 brushes+0.023
 designs+0.022
ies+0.022
working+0.020
ini+0.020
Neg logit contributions
 Rover-0.018
 thirteen-0.018
 then-0.017
 mercy-0.017
 unison-0.017
Token
Token ID198
Feature activation+4.16e-01

Pos logit contributions
 brushes+0.005
 designs+0.005
ies+0.005
working+0.005
ini+0.005
Neg logit contributions
 Rover-0.003
 then-0.003
 thirteen-0.003
 alas-0.003
 unison-0.003
TokenSo
Token ID2396
Feature activation+3.39e-01

Pos logit contributions
 Rover+0.014
 thirteen+0.014
 mercy+0.013
 unison+0.013
 then+0.013
Neg logit contributions
 brushes-0.025
ies-0.024
ini-0.023
 designs-0.023
rolled-0.023
Token B
Token ID347
Feature activation-4.99e+00

Pos logit contributions
 Rover+0.015
 then+0.014
 thirteen+0.014
 alas+0.014
 unison+0.014
Neg logit contributions
 brushes-0.017
 designs-0.016
ies-0.016
working-0.016
ini-0.015
Tokenizzy
Token ID40593
Feature activation-3.60e+00

Pos logit contributions
 brushes+0.241
 designs+0.228
packs+0.226
bel+0.218
 creations+0.206
Neg logit contributions
 then-0.214
 Rover-0.212
Mr-0.208
rified-0.207
�-0.206
Token looked
Token ID3114
Feature activation-2.75e+00

Pos logit contributions
 brushes+0.173
 designs+0.170
ies+0.165
ugs+0.164
ini+0.163
Neg logit contributions
 then-0.176
 Rover-0.160
 thirteen-0.150
 alas-0.148
 unison-0.148
Token.
Token ID13
Feature activation-5.83e-01

Pos logit contributions
 brushes+0.122
 designs+0.117
ies+0.117
working+0.110
 work+0.108
Neg logit contributions
 Rover-0.137
 then-0.132
 thirteen-0.130
 unison-0.128
 alas-0.128
Token She
Token ID1375
Feature activation-5.84e-01

Pos logit contributions
ies+0.031
 brushes+0.031
 designs+0.030
working+0.029
ini+0.029
Neg logit contributions
 Rover-0.024
 then-0.023
 thirteen-0.022
 alas-0.022
tons-0.021
Token searched
Token ID16499
Feature activation-2.96e+00

Pos logit contributions
 brushes+0.026
 designs+0.025
ies+0.025
ugs+0.024
working+0.024
Neg logit contributions
 then-0.030
 Rover-0.028
 thirteen-0.027
 alas-0.027
 unison-0.026
Token filled
Token ID5901
Feature activation+6.41e-01

Pos logit contributions
 then+0.038
 Rover+0.035
 louder+0.034
 thirteen+0.032
 unison+0.032
Neg logit contributions
 brushes-0.037
ugs-0.034
packs-0.034
working-0.034
 designs-0.033
Token with
Token ID351
Feature activation+1.22e+00

Pos logit contributions
 thirteen+0.032
 Rover+0.031
 mercy+0.031
 unison+0.030
 alas+0.030
Neg logit contributions
 brushes-0.027
 designs-0.026
ies-0.026
 work-0.025
working-0.025
Token joy
Token ID8716
Feature activation+1.15e-01

Pos logit contributions
 Rover+0.059
 thirteen+0.058
 then+0.057
 alas+0.057
tons+0.056
Neg logit contributions
 brushes-0.055
 designs-0.053
ies-0.053
 work-0.050
ini-0.048
Token,
Token ID11
Feature activation-6.71e-01

Pos logit contributions
 Rover+0.004
 then+0.004
 thirteen+0.004
 unison+0.004
 alas+0.004
Neg logit contributions
 brushes-0.006
 designs-0.006
ies-0.006
ini-0.006
working-0.006
Token watching
Token ID4964
Feature activation-8.70e-01

Pos logit contributions
 brushes+0.036
 designs+0.034
ies+0.034
working+0.033
 work+0.032
Neg logit contributions
 Rover-0.027
 then-0.026
 thirteen-0.026
 alas-0.025
 unison-0.025
Token how
Token ID703
Feature activation-4.70e+00

Pos logit contributions
 designs+0.045
ies+0.044
 brushes+0.044
 work+0.043
working+0.041
Neg logit contributions
 thirteen-0.034
 alas-0.033
 Rover-0.033
 mercy-0.033
 unison-0.033
Token happy
Token ID3772
Feature activation-3.18e+00

Pos logit contributions
 brushes+0.253
 designs+0.242
ies+0.240
ugs+0.233
packs+0.233
Neg logit contributions
 Rover-0.194
 then-0.186
 louder-0.175
 mercy-0.169
Mr-0.169
Token the
Token ID262
Feature activation-2.58e+00

Pos logit contributions
 brushes+0.181
ies+0.178
 designs+0.178
 work+0.167
working+0.167
Neg logit contributions
 Rover-0.112
 thirteen-0.111
 then-0.109
 alas-0.107
 unison-0.107
Token ducks
Token ID39694
Feature activation-2.10e-01

Pos logit contributions
 brushes+0.138
 designs+0.132
ies+0.132
working+0.126
ini+0.124
Neg logit contributions
 Rover-0.105
 then-0.103
 thirteen-0.098
 unison-0.096
 alas-0.095
Token were
Token ID547
Feature activation-4.56e-01

Pos logit contributions
 brushes+0.011
 designs+0.010
ugs+0.010
ies+0.010
ini+0.010
Neg logit contributions
 then-0.010
 Rover-0.009
 thirteen-0.009
 alas-0.008
tons-0.008
Token!
Token ID0
Feature activation-9.58e-01

Pos logit contributions
 brushes+0.023
 designs+0.022
ies+0.022
working+0.021
ini+0.021
Neg logit contributions
 Rover-0.020
 then-0.019
 thirteen-0.019
 alas-0.018
 unison-0.018
Token "
Token ID366
Feature activation+5.73e-01

Pos logit contributions
 Rover+0.011
 mercy+0.010
 thirteen+0.010
 unison+0.010
 alas+0.010
Neg logit contributions
 brushes-0.013
 designs-0.013
ies-0.013
 work-0.012
working-0.012
TokenThanks
Token ID9690
Feature activation+2.25e-01

Pos logit contributions
 Rover+0.021
 mercy+0.020
 thirteen+0.019
 then+0.019
 alas+0.018
Neg logit contributions
 brushes-0.033
 designs-0.032
ini-0.032
working-0.031
ies-0.030
Token for
Token ID329
Feature activation+3.11e+00

Pos logit contributions
 Rover+0.010
 thirteen+0.009
 then+0.009
 unison+0.009
 alas+0.009
Neg logit contributions
 brushes-0.011
ies-0.011
 designs-0.011
 work-0.010
working-0.010
Token teaching
Token ID7743
Feature activation+5.90e-01

Pos logit contributions
 Rover+0.126
 thirteen+0.123
 unison+0.118
 alas+0.118
rified+0.116
Neg logit contributions
 brushes-0.157
 work-0.156
 designs-0.155
ies-0.150
working-0.146
Token me
Token ID502
Feature activation-2.27e+00

Pos logit contributions
 Rover+0.017
 then+0.016
 thirteen+0.016
 alas+0.016
 unison+0.016
Neg logit contributions
 brushes-0.038
ies-0.037
 designs-0.037
 work-0.036
working-0.034
Token how
Token ID703
Feature activation-4.49e+00

Pos logit contributions
 brushes+0.122
 designs+0.119
ies+0.119
working+0.113
 work+0.112
Neg logit contributions
 Rover-0.090
 then-0.087
 thirteen-0.085
 alas-0.083
 unison-0.083
Token to
Token ID284
Feature activation-2.81e+00

Pos logit contributions
ies+0.253
 brushes+0.248
ini+0.246
packs+0.243
 designs+0.242
Neg logit contributions
 Please-0.163
 then-0.161
 Rover-0.153
 more-0.152
 louder-0.151
Token improve
Token ID2987
Feature activation+9.37e-01

Pos logit contributions
 brushes+0.124
ies+0.121
 designs+0.118
packs+0.118
ugs+0.116
Neg logit contributions
 then-0.146
 Rover-0.136
tons-0.131
 mercy-0.130
 unison-0.130
Token."
Token ID526
Feature activation+5.88e-01

Pos logit contributions
 Rover+0.040
 thirteen+0.038
 then+0.038
 unison+0.037
 alas+0.037
Neg logit contributions
 brushes-0.047
 designs-0.047
ies-0.046
 work-0.044
working-0.043
Token<|endoftext|>
Token ID50256
Feature activation+9.83e-02

Pos logit contributions
 Rover+0.022
 then+0.021
 unison+0.020
 alas+0.020
 thirteen+0.020
Neg logit contributions
 brushes-0.034
 designs-0.033
ies-0.032
working-0.031
ini-0.031
TokenThere
Token ID1858
Feature activation-1.49e+00

Pos logit contributions
 Rover+0.004
 then+0.004
 thirteen+0.004
 alas+0.004
 unison+0.004
Neg logit contributions
 brushes-0.005
 designs-0.005
ies-0.005
working-0.005
ini-0.005
Token his
Token ID465
Feature activation-1.29e-01

Pos logit contributions
 Rover+0.032
 then+0.031
 thirteen+0.030
 alas+0.029
 unison+0.029
Neg logit contributions
 brushes-0.045
 designs-0.044
ies-0.044
working-0.042
ini-0.042
Token friends
Token ID2460
Feature activation-7.52e-01

Pos logit contributions
 brushes+0.006
ies+0.005
ini+0.005
ugs+0.005
 designs+0.005
Neg logit contributions
 Rover-0.007
 then-0.006
 mercy-0.006
 alas-0.006
 unison-0.006
Token,
Token ID11
Feature activation+3.73e-01

Pos logit contributions
 brushes+0.031
 designs+0.031
ies+0.030
working+0.029
 work+0.029
Neg logit contributions
 Rover-0.038
 thirteen-0.037
 mercy-0.037
Mr-0.036
 unison-0.036
Token and
Token ID290
Feature activation+5.01e-02

Pos logit contributions
 Rover+0.014
 then+0.014
 thirteen+0.014
 unison+0.013
 alas+0.013
Neg logit contributions
 brushes-0.020
 designs-0.020
ies-0.020
working-0.019
 work-0.019
Token even
Token ID772
Feature activation+1.00e+00

Pos logit contributions
 thirteen+0.002
 Rover+0.002
 unison+0.002
Mr+0.002
tons+0.002
Neg logit contributions
 brushes-0.003
 designs-0.003
ies-0.003
 work-0.003
working-0.003
Token how
Token ID703
Feature activation-5.06e+00

Pos logit contributions
 then+0.039
 Rover+0.039
 thirteen+0.035
 louder+0.035
 alas+0.035
Neg logit contributions
 brushes-0.056
ies-0.053
 designs-0.053
ini-0.053
ugs-0.052
Token to
Token ID284
Feature activation-3.02e+00

Pos logit contributions
 brushes+0.308
ugs+0.302
ini+0.295
 designs+0.289
bows+0.288
Neg logit contributions
 Rover-0.166
 then-0.162
 louder-0.157
 mercy-0.152
 more-0.151
Token be
Token ID307
Feature activation-7.06e-01

Pos logit contributions
 brushes+0.124
 designs+0.119
ies+0.118
ini+0.111
working+0.110
Neg logit contributions
 Rover-0.158
 then-0.158
 thirteen-0.151
 alas-0.149
 unison-0.149
Token depend
Token ID4745
Feature activation-1.48e+00

Pos logit contributions
 brushes+0.039
ugs+0.037
ini+0.037
ies+0.036
 designs+0.036
Neg logit contributions
 then-0.028
 more-0.028
 Rover-0.027
 louder-0.027
 less-0.026
Tokenable
Token ID540
Feature activation+4.58e-01

Pos logit contributions
 brushes+0.051
ies+0.050
 designs+0.049
working+0.047
 work+0.045
Neg logit contributions
 Rover-0.087
 thirteen-0.085
 then-0.085
 unison-0.083
 alas-0.082
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
 Rover+0.022
 thirteen+0.022
Mr+0.021
 unison+0.021
 alas+0.021
Neg logit contributions
 designs-0.020
ies-0.020
 brushes-0.020
 work-0.019
working-0.019
Token find
Token ID1064
Feature activation-1.18e+00

Pos logit contributions
 brushes+0.008
 designs+0.008
ies+0.008
ini+0.007
bel+0.007
Neg logit contributions
 then-0.010
 Rover-0.009
 thirteen-0.009
 alas-0.009
 unison-0.009
Token her
Token ID607
Feature activation-1.22e+00

Pos logit contributions
 brushes+0.067
 designs+0.065
ies+0.064
working+0.062
 work+0.061
Neg logit contributions
 Rover-0.045
 then-0.043
 thirteen-0.042
 unison-0.041
 alas-0.041
Token favorite
Token ID4004
Feature activation+5.22e-01

Pos logit contributions
rolled+0.045
ugs+0.045
ini+0.045
bel+0.044
ies+0.044
Neg logit contributions
 then-0.069
 no-0.065
 more-0.064
 end-0.062
 thirteen-0.059
Token red
Token ID2266
Feature activation-5.95e-01

Pos logit contributions
 then+0.029
 more+0.027
 end+0.026
 alas+0.026
 Rover+0.026
Neg logit contributions
 brushes-0.019
ies-0.019
bel-0.019
working-0.019
rolled-0.018
Token cr
Token ID1067
Feature activation+7.81e-01

Pos logit contributions
 brushes+0.023
 designs+0.023
ies+0.023
ini+0.022
working+0.022
Neg logit contributions
 then-0.031
 Rover-0.030
 thirteen-0.029
 alas-0.029
 unison-0.029
Tokenay
Token ID323
Feature activation-4.49e+00

Pos logit contributions
 then+0.039
 thirteen+0.038
 unison+0.038
Mr+0.037
 Rover+0.037
Neg logit contributions
 brushes-0.031
 designs-0.030
ies-0.030
working-0.030
 work-0.029
Tokenon
Token ID261
Feature activation-2.94e-01

Pos logit contributions
 brushes+0.258
bel+0.255
ies+0.251
 designs+0.251
rolled+0.248
Neg logit contributions
 then-0.158
 course-0.140
 no-0.136
 more-0.136
 end-0.131
Token.
Token ID13
Feature activation+1.52e-01

Pos logit contributions
 brushes+0.017
 designs+0.016
ies+0.016
rolled+0.016
ini+0.016
Neg logit contributions
 then-0.011
 Rover-0.010
 no-0.009
 alas-0.009
 unfortunately-0.009
Token
Token ID220
Feature activation-2.97e+00

Pos logit contributions
 Rover+0.005
 then+0.005
 alas+0.005
 thirteen+0.005
 unison+0.005
Neg logit contributions
 brushes-0.009
 designs-0.009
ies-0.009
working-0.009
ini-0.008
Token
Token ID198
Feature activation+6.47e-01

Pos logit contributions
 brushes+0.197
 designs+0.197
working+0.192
ies+0.192
 work+0.185
Neg logit contributions
 Rover-0.079
 then-0.078
Mr-0.070
 alas-0.069
 unison-0.066
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
 Rover+0.023
 then+0.023
 thirteen+0.022
 alas+0.021
 unison+0.021
Neg logit contributions
 brushes-0.037
ies-0.036
 designs-0.036
working-0.034
ini-0.034