TOP ACTIVATIONS
MAX = 11.613

don 't get a sun burn ." Lily yawn ed and
want Lily to get sun burn ed . L
she knew the word ' burn '. Her friends were so
him from getting a sun burn . Tim my
to put on his sun burn . Tim my thought the
face and said " y uck , it 's disgusting !"
. " Y uck !" Lily said . "
an idea . He sn uck into Chris 's room and
the button . Tim sn uck out of his room and
the medicine . " Y uck !" he said . But
, y uck , y uck !" they say . "
help get the car unst uck . The rescue
accidentally tasted it . Y uck ! It tasted very bad
a bite . " Y uck !" she said , "
all the fur was unst uck ! The squirrel
's mom came to t uck her in and said ,
is bitter , like y uck . You should wear something
It is bitter . Y uck !" Mom laughs
filthy . " Y uck !" Mama said . "
. " Y uck , what is that ?"

MIN ACTIVATIONS
MIN = -12.668

wanted to play with something else . " Tom
is something that no one else can give you . Sadly
I want to buy something else with the money ," said
ignoring the plane like everyone else and the plane was so
box so that no one else could get into it .
to grab it , someone else was already taking it .
to your room . Or else , no story time for
cups and look for something else to drink . They see
maybe they belong to someone else ," Ben said . "
finding something can make someone else happy . <|endoftext|> L ily
would look for it somewhere else . He left the room
They decided to play something else . They did not want
" Can I have something else ?" Her mom
wanted to make sure nobody else found this clean treasure !
that there was no one else near her .
give it away to someone else . They used the money
away but he noticed something else on the island . It
he decided to collect something else instead . He saw a
ignore danger signs , or else we might end up with
He wanted to try something else , so he decided to

INTERVAL 5.543 - 11.613
CONTAINS 0.341%

so frustrated , he y anked the strong sun flower out
snow angels . M ama said , " Let 's
closer . M ama smiled . " He 's
she learned all about the soup kitchen . She saw her
went to bed with a belly full of cake and a

INTERVAL -0.527 - 5.543
CONTAINS 71.408%

m ister . We like to play with hoops , too
like they were giving you a special birthday present ."
stepped closer it felt a pinch . It was a nasty
hole together and looked inside . " Wow ,
crunch in his hand . He was so happy that he

INTERVAL -6.598 - -0.527
CONTAINS 28.047%

Dad say . " You are not behaving ! We are
never leave his toy computer outside again . And they all
, triumphant . Amy looked back and smiled
so helpful . From that day on ,
it . " Oh , Lily , you have a very

INTERVAL -12.668 - -6.598
CONTAINS 0.203%

big book with pictures of volcan oes . Vol can oes
The animals crowded around and gasped in amaz ement . Where
, " My name is Kate ." The triangle
They like to play with blocks . They have many blocks
pull and tug at the blocks . They made loud noises
Token don
Token ID836
Feature activation+5.62e-01

Pos logit contributions
uck+0.004
 Cow+0.004
oo+0.004
 Wolf+0.004
burn+0.004
Neg logit contributions
ents-0.004
 dice-0.004
 else-0.004
 contain-0.004
 clearly-0.004
Token't
Token ID470
Feature activation+7.35e-01

Pos logit contributions
 Cow+0.043
uck+0.043
 Wolf+0.041
eled+0.041
ivered+0.039
Neg logit contributions
ents-0.046
 else-0.046
 dice-0.044
 contain-0.043
 clearly-0.041
Token get
Token ID651
Feature activation-2.45e-01

Pos logit contributions
uck+0.068
 Cow+0.065
oo+0.064
 Wolf+0.064
ivered+0.061
Neg logit contributions
 else-0.052
ents-0.052
 dice-0.050
 contain-0.049
 clearly-0.045
Token a
Token ID257
Feature activation+7.26e-01

Pos logit contributions
 else+0.022
ents+0.021
 dice+0.020
 contain+0.019
 clearly+0.019
Neg logit contributions
uck-0.019
 Cow-0.018
oo-0.018
 Wolf-0.018
ivered-0.017
Token sun
Token ID4252
Feature activation+9.73e-01

Pos logit contributions
uck+0.050
 Cow+0.047
oo+0.047
 Wolf+0.046
ivered+0.044
Neg logit contributions
 else-0.069
 dice-0.067
ents-0.067
 contain-0.064
 clearly-0.060
Tokenburn
Token ID10899
Feature activation+1.16e+01

Pos logit contributions
uck+0.027
 Cow+0.025
 Wolf+0.023
oo+0.022
ivered+0.020
Neg logit contributions
ents-0.129
 else-0.128
 dice-0.126
 contain-0.124
 clearly-0.119
Token."
Token ID526
Feature activation+9.47e-01

Pos logit contributions
uck+0.888
 Cow+0.868
 Wolf+0.838
oo+0.835
P+0.833
Neg logit contributions
ents-0.957
 else-0.955
 dice-0.934
 contain-0.897
 clearly-0.827
Token Lily
Token ID20037
Feature activation-9.29e-01

Pos logit contributions
uck+0.080
oo+0.078
 Cow+0.078
 Wolf+0.078
burn+0.075
Neg logit contributions
 else-0.073
ents-0.073
 dice-0.071
 contain-0.065
 clearly-0.063
Token yawn
Token ID40908
Feature activation+4.76e+00

Pos logit contributions
 else+0.074
ents+0.072
 dice+0.069
 contain+0.066
 clearly+0.064
Neg logit contributions
uck-0.077
 Cow-0.075
 Wolf-0.074
oo-0.073
burn-0.071
Tokened
Token ID276
Feature activation+3.55e+00

Pos logit contributions
 Cow+0.241
uck+0.236
 Wolf+0.227
P+0.221
oo+0.221
Neg logit contributions
 else-0.521
ents-0.515
 dice-0.486
 clearly-0.483
 contain-0.477
Token and
Token ID290
Feature activation-6.73e-02

Pos logit contributions
uck+0.340
oo+0.326
 Cow+0.322
P+0.321
 Wolf+0.314
Neg logit contributions
ents-0.238
 else-0.237
 clearly-0.222
 contain-0.214
 dice-0.214
Token want
Token ID765
Feature activation-1.61e+00

Pos logit contributions
uck+0.070
 Cow+0.068
oo+0.067
 Wolf+0.066
burn+0.064
Neg logit contributions
ents-0.051
 else-0.051
 dice-0.051
 contain-0.046
 clearly-0.044
Token Lily
Token ID20037
Feature activation-9.32e-01

Pos logit contributions
ents+0.146
 dice+0.146
 else+0.144
 contain+0.139
 clearly+0.129
Neg logit contributions
uck-0.115
 Cow-0.111
 Wolf-0.108
oo-0.107
burn-0.101
Token to
Token ID284
Feature activation-6.64e-02

Pos logit contributions
 dice+0.086
ents+0.085
 else+0.082
 contain+0.079
 clearly+0.076
Neg logit contributions
uck-0.068
 Cow-0.064
oo-0.063
burn-0.062
 Wolf-0.062
Token get
Token ID651
Feature activation-2.46e-01

Pos logit contributions
 else+0.005
ents+0.005
 contain+0.005
 dice+0.004
 clearly+0.004
Neg logit contributions
uck-0.006
 Cow-0.006
 Wolf-0.006
P-0.006
oo-0.006
Token sun
Token ID4252
Feature activation+9.86e-01

Pos logit contributions
 else+0.022
ents+0.021
 dice+0.020
 contain+0.019
 clearly+0.018
Neg logit contributions
uck-0.019
ivered-0.018
 Cow-0.018
oo-0.018
 Wolf-0.018
Tokenburn
Token ID10899
Feature activation+1.16e+01

Pos logit contributions
uck+0.027
 Cow+0.024
 Wolf+0.023
oo+0.022
P+0.021
Neg logit contributions
ents-0.131
 else-0.129
 dice-0.126
 contain-0.124
 clearly-0.121
Tokened
Token ID276
Feature activation+3.55e+00

Pos logit contributions
 Amen+0.828
 Cow+0.811
P+0.791
 Wolf+0.782
uck+0.779
Neg logit contributions
ents-1.038
 else-1.020
 dice-0.950
ows-0.928
cases-0.914
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
uck+0.273
 Cow+0.261
oo+0.256
 Wolf+0.253
burn+0.242
Neg logit contributions
 else-0.303
ents-0.302
 dice-0.299
 contain-0.283
 clearly-0.267
Token
Token ID198
Feature activation-4.69e-01

Pos logit contributions
 else+0.001
ents+0.001
 dice+0.001
 clearly+0.001
 contain+0.001
Neg logit contributions
uck-0.001
burn-0.001
oo-0.001
 Cow-0.001
 Wolf-0.001
Token
Token ID198
Feature activation-4.64e-01

Pos logit contributions
ents+0.049
 dice+0.049
 else+0.048
 contain+0.047
 clearly+0.044
Neg logit contributions
uck-0.028
 Cow-0.026
oo-0.026
 Wolf-0.025
P-0.023
TokenL
Token ID43
Feature activation+2.13e-01

Pos logit contributions
 else+0.043
ents+0.041
 dice+0.041
 contain+0.038
 clearly+0.037
Neg logit contributions
uck-0.034
 Cow-0.033
 Wolf-0.033
oo-0.031
burn-0.031
Token she
Token ID673
Feature activation-1.76e+00

Pos logit contributions
ents+0.260
 dice+0.259
 else+0.258
 contain+0.244
 clearly+0.227
Neg logit contributions
uck-0.269
 Cow-0.256
oo-0.251
 Wolf-0.248
P-0.236
Token knew
Token ID2993
Feature activation-8.48e-02

Pos logit contributions
ents+0.162
 dice+0.162
 else+0.157
 contain+0.154
ooters+0.141
Neg logit contributions
uck-0.125
 Cow-0.118
 Wolf-0.115
oo-0.115
P-0.107
Token the
Token ID262
Feature activation-1.94e-01

Pos logit contributions
 dice+0.008
ents+0.008
 contain+0.007
 else+0.007
ooters+0.007
Neg logit contributions
 Cow-0.006
uck-0.006
 Wolf-0.006
oo-0.005
 wolf-0.005
Token word
Token ID1573
Feature activation-1.04e+00

Pos logit contributions
 else+0.018
 dice+0.017
ents+0.017
 contain+0.016
 clearly+0.016
Neg logit contributions
uck-0.014
oo-0.013
 Cow-0.013
ivered-0.013
burn-0.013
Token '
Token ID705
Feature activation-5.44e+00

Pos logit contributions
ents+0.096
 dice+0.094
 contain+0.091
 else+0.088
ooters+0.082
Neg logit contributions
uck-0.073
 Cow-0.070
 Wolf-0.070
oo-0.067
 wolf-0.063
Tokenburn
Token ID10899
Feature activation+1.16e+01

Pos logit contributions
 else+0.509
ents+0.499
 dice+0.491
 contain+0.470
 clearly+0.448
Neg logit contributions
uck-0.371
 Cow-0.359
 Wolf-0.347
oo-0.343
ivered-0.326
Token'.
Token ID4458
Feature activation+2.45e+00

Pos logit contributions
uck+0.843
 Cow+0.841
 Wolf+0.813
oo+0.811
P+0.809
Neg logit contributions
 else-0.979
ents-0.951
 dice-0.943
 contain-0.897
 wins-0.842
Token Her
Token ID2332
Feature activation-9.03e-01

Pos logit contributions
uck+0.154
 Cow+0.145
oo+0.142
 Wolf+0.140
burn+0.134
Neg logit contributions
 else-0.245
ents-0.244
 dice-0.241
 contain-0.230
 clearly-0.220
Token friends
Token ID2460
Feature activation+1.63e+00

Pos logit contributions
 else+0.086
ents+0.083
 dice+0.081
 contain+0.076
 clearly+0.076
Neg logit contributions
uck-0.063
 Cow-0.058
P-0.057
oo-0.057
 Wolf-0.056
Token were
Token ID547
Feature activation-2.71e-01

Pos logit contributions
uck+0.138
 Cow+0.133
oo+0.130
 Wolf+0.129
burn+0.124
Neg logit contributions
 else-0.127
ents-0.126
 dice-0.125
 contain-0.118
 clearly-0.111
Token so
Token ID523
Feature activation-1.24e+00

Pos logit contributions
 dice+0.023
ents+0.022
 else+0.021
 contain+0.021
ooters+0.019
Neg logit contributions
uck-0.022
 Cow-0.021
oo-0.021
 Wolf-0.020
ivered-0.019
Token him
Token ID683
Feature activation+2.65e+00

Pos logit contributions
uck+0.242
ivered+0.231
P+0.227
 Cow+0.225
 Amen+0.224
Neg logit contributions
 else-0.338
ents-0.331
 clearly-0.306
 dice-0.300
 contain-0.291
Token from
Token ID422
Feature activation-2.72e+00

Pos logit contributions
uck+0.209
 Cow+0.198
oo+0.198
burn+0.194
 Wolf+0.190
Neg logit contributions
 dice-0.227
ents-0.217
 else-0.214
 contain-0.196
 clearly-0.190
Token getting
Token ID1972
Feature activation+1.94e+00

Pos logit contributions
 else+0.216
ents+0.209
 dice+0.196
 clearly+0.194
 contain+0.187
Neg logit contributions
uck-0.230
ivered-0.214
P-0.210
 Cow-0.207
 Wolf-0.203
Token a
Token ID257
Feature activation+7.15e-01

Pos logit contributions
uck+0.160
ivered+0.155
oo+0.153
 Cow+0.150
over+0.150
Neg logit contributions
 else-0.161
ents-0.153
 clearly-0.136
 contain-0.135
 dice-0.132
Token sun
Token ID4252
Feature activation+9.70e-01

Pos logit contributions
uck+0.057
 Cow+0.054
oo+0.053
ivered+0.052
 Wolf+0.052
Neg logit contributions
 else-0.061
 dice-0.058
ents-0.058
 contain-0.055
 clearly-0.052
Tokenburn
Token ID10899
Feature activation+1.16e+01

Pos logit contributions
uck+0.043
 Cow+0.041
 Wolf+0.040
oo+0.038
P+0.037
Neg logit contributions
ents-0.112
 else-0.112
 dice-0.107
 contain-0.107
 clearly-0.103
Token.
Token ID13
Feature activation-2.95e-02

Pos logit contributions
uck+0.942
 Cow+0.906
oo+0.884
 Wolf+0.877
P+0.848
Neg logit contributions
 else-0.938
ents-0.931
 dice-0.909
 contain-0.871
 clearly-0.814
Token
Token ID198
Feature activation-4.79e-01

Pos logit contributions
 else+0.002
ents+0.002
 dice+0.002
 clearly+0.002
 contain+0.002
Neg logit contributions
uck-0.002
oo-0.002
burn-0.002
ivered-0.002
 Cow-0.002
Token
Token ID198
Feature activation-4.75e-01

Pos logit contributions
 else+0.047
ents+0.046
 dice+0.046
 contain+0.044
 clearly+0.042
Neg logit contributions
uck-0.032
 Cow-0.030
oo-0.029
 Wolf-0.029
burn-0.027
TokenTim
Token ID14967
Feature activation-7.45e-01

Pos logit contributions
 else+0.040
ents+0.039
 dice+0.038
 contain+0.035
 clearly+0.034
Neg logit contributions
uck-0.039
 Cow-0.037
 Wolf-0.037
oo-0.036
burn-0.035
Tokenmy
Token ID1820
Feature activation+9.40e-01

Pos logit contributions
ents+0.057
 else+0.055
 dice+0.051
 contain+0.048
 clearly+0.047
Neg logit contributions
uck-0.063
 Cow-0.062
 Wolf-0.062
oo-0.061
 Amen-0.059
Token to
Token ID284
Feature activation-3.07e-01

Pos logit contributions
uck+0.174
 Cow+0.160
oo+0.158
 Wolf+0.152
burn+0.150
Neg logit contributions
 else-0.197
ents-0.196
 dice-0.194
 contain-0.187
 clearly-0.179
Token put
Token ID1234
Feature activation-4.86e-01

Pos logit contributions
 else+0.023
 dice+0.022
ents+0.021
 contain+0.021
 clearly+0.018
Neg logit contributions
uck-0.029
 Wolf-0.027
 Cow-0.027
ivered-0.027
oo-0.026
Token on
Token ID319
Feature activation+2.59e+00

Pos logit contributions
 else+0.041
 dice+0.040
ents+0.039
 clearly+0.034
 contain+0.034
Neg logit contributions
ivered-0.038
 Cow-0.037
 Wolf-0.037
uck-0.036
oo-0.036
Token his
Token ID465
Feature activation+1.10e+00

Pos logit contributions
ivered+0.198
uck+0.198
 Cow+0.190
 Wolf+0.186
oo+0.183
Neg logit contributions
 dice-0.222
 else-0.219
ents-0.214
 clearly-0.195
 contain-0.190
Token sun
Token ID4252
Feature activation+7.13e-01

Pos logit contributions
uck+0.085
 Cow+0.081
oo+0.079
 Wolf+0.079
burn+0.075
Neg logit contributions
 else-0.094
 dice-0.093
ents-0.092
 contain-0.087
 clearly-0.082
Tokenburn
Token ID10899
Feature activation+1.13e+01

Pos logit contributions
uck+0.050
 Cow+0.048
 Wolf+0.048
oo+0.046
ivered+0.045
Neg logit contributions
ents-0.064
 else-0.064
 dice-0.063
 contain-0.060
 clearly-0.056
Token.
Token ID13
Feature activation-2.88e-01

Pos logit contributions
uck+0.971
 Cow+0.929
oo+0.921
 Wolf+0.916
ivered+0.889
Neg logit contributions
ents-0.854
 else-0.848
 dice-0.830
 contain-0.779
 clearly-0.727
Token Tim
Token ID5045
Feature activation-6.16e-01

Pos logit contributions
ents+0.024
 else+0.024
 dice+0.024
 contain+0.022
 clearly+0.021
Neg logit contributions
uck-0.022
ivered-0.021
oo-0.021
 Cow-0.021
 Wolf-0.020
Tokenmy
Token ID1820
Feature activation+6.74e-01

Pos logit contributions
ents+0.042
 else+0.041
 dice+0.038
 clearly+0.033
 counts+0.033
Neg logit contributions
 Cow-0.056
 Wolf-0.055
uck-0.055
over-0.054
oo-0.054
Token thought
Token ID1807
Feature activation+3.38e-01

Pos logit contributions
uck+0.049
 Cow+0.047
oo+0.046
 Wolf+0.046
burn+0.044
Neg logit contributions
 else-0.062
ents-0.060
 dice-0.059
 contain-0.056
 clearly-0.054
Token the
Token ID262
Feature activation-4.71e-01

Pos logit contributions
uck+0.018
oo+0.018
 Cow+0.017
 Wolf+0.016
 wolf+0.015
Neg logit contributions
 dice-0.036
 contain-0.036
ents-0.036
 else-0.034
ows-0.033
Token face
Token ID1986
Feature activation+5.40e+00

Pos logit contributions
 Amen+0.116
P+0.115
 Cow+0.114
 Wolf+0.113
burn+0.110
Neg logit contributions
 else-0.106
 contain-0.106
ents-0.100
 poses-0.096
 clearly-0.095
Token and
Token ID290
Feature activation-7.76e-02

Pos logit contributions
uck+0.438
 Cow+0.421
 Wolf+0.412
oo+0.411
P+0.393
Neg logit contributions
 else-0.445
ents-0.441
 dice-0.427
 contain-0.410
 clearly-0.391
Token said
Token ID531
Feature activation-9.57e-01

Pos logit contributions
 else+0.007
ents+0.006
 dice+0.006
 contain+0.006
 clearly+0.006
Neg logit contributions
uck-0.007
oo-0.006
 Wolf-0.006
 Cow-0.006
burn-0.006
Token "
Token ID366
Feature activation-6.21e-02

Pos logit contributions
 else+0.079
ents+0.069
 dice+0.063
 clearly+0.063
 contain+0.061
Neg logit contributions
burn-0.082
ivered-0.082
eled-0.081
uck-0.081
 Wolf-0.078
Tokeny
Token ID88
Feature activation+3.45e+00

Pos logit contributions
 else+0.007
ents+0.006
 dice+0.006
 contain+0.006
 clearly+0.006
Neg logit contributions
 Cow-0.003
 Wolf-0.003
uck-0.003
 Amen-0.003
 wolf-0.003
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Amen+0.293
eled+0.291
 Wolf+0.278
P+0.277
 Cow+0.273
Neg logit contributions
 else-0.245
ents-0.243
 dice-0.230
ows-0.217
 wins-0.214
Token,
Token ID11
Feature activation+6.63e-01

Pos logit contributions
 Cow+0.858
 Wolf+0.839
uck+0.839
 Amen+0.833
burn+0.827
Neg logit contributions
 else-0.894
ents-0.844
 dice-0.820
 contain-0.751
 clearly-0.740
Token it
Token ID340
Feature activation+1.49e+00

Pos logit contributions
uck+0.054
ivered+0.052
oo+0.052
 Cow+0.051
 Wolf+0.051
Neg logit contributions
 else-0.055
ents-0.053
 dice-0.051
 contain-0.048
 clearly-0.046
Token's
Token ID338
Feature activation-3.04e+00

Pos logit contributions
 Cow+0.153
oo+0.152
eled+0.150
uck+0.149
over+0.149
Neg logit contributions
 else-0.087
ents-0.079
 dice-0.074
 clearly-0.066
 wins-0.064
Token disgusting
Token ID23374
Feature activation+4.75e-01

Pos logit contributions
 else+0.224
ents+0.217
 dice+0.199
 clearly+0.188
 contain+0.184
Neg logit contributions
uck-0.269
eled-0.258
ivered-0.255
 Wolf-0.255
 Amen-0.254
Token!"
Token ID2474
Feature activation+4.92e-01

Pos logit contributions
 Amen+0.045
 Cow+0.043
eled+0.043
 Wolf+0.042
P+0.042
Neg logit contributions
 else-0.030
ents-0.027
 dice-0.027
 wins-0.026
ows-0.025
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
uck+0.032
 Cow+0.030
oo+0.030
 Wolf+0.029
burn+0.028
Neg logit contributions
 else-0.035
ents-0.034
 dice-0.034
 contain-0.032
 clearly-0.030
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
 else+0.001
ents+0.001
 dice+0.001
 contain+0.001
 clearly+0.001
Neg logit contributions
uck-0.001
 Cow-0.001
oo-0.001
 Wolf-0.001
burn-0.001
Token
Token ID198
Feature activation-4.70e-01

Pos logit contributions
 dice+0.052
ents+0.051
 contain+0.051
 else+0.050
 clearly+0.046
Neg logit contributions
uck-0.026
 Cow-0.024
oo-0.024
 Wolf-0.023
P-0.021
Token"
Token ID1
Feature activation+6.48e-01

Pos logit contributions
 else+0.049
ents+0.049
 dice+0.048
 contain+0.047
 clearly+0.044
Neg logit contributions
uck-0.028
 Cow-0.026
oo-0.025
 Wolf-0.025
burn-0.023
TokenY
Token ID56
Feature activation+2.42e+00

Pos logit contributions
uck+0.040
oo+0.037
 Cow+0.036
 Wolf+0.035
ivered+0.034
Neg logit contributions
 else-0.069
 dice-0.067
ents-0.064
 contain-0.062
 clearly-0.060
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Cow+0.083
 Wolf+0.081
 Sleep+0.073
 Amen+0.073
 wolf+0.069
Neg logit contributions
ents-0.316
 dice-0.298
 else-0.297
 contain-0.280
ows-0.278
Token!"
Token ID2474
Feature activation+4.83e-01

Pos logit contributions
uck+0.828
 Cow+0.801
 Wolf+0.782
oo+0.771
burn+0.756
Neg logit contributions
 else-0.920
ents-0.911
 dice-0.893
 contain-0.835
 clearly-0.797
Token Lily
Token ID20037
Feature activation-9.29e-01

Pos logit contributions
uck+0.039
 Cow+0.038
oo+0.037
 Wolf+0.036
P+0.034
Neg logit contributions
ents-0.039
 else-0.038
 dice-0.038
 contain-0.037
 clearly-0.035
Token said
Token ID531
Feature activation-9.48e-01

Pos logit contributions
 dice+0.083
 contain+0.080
 else+0.077
ents+0.075
ooters+0.072
Neg logit contributions
uck-0.070
 Cow-0.066
oo-0.063
 Wolf-0.062
P-0.058
Token.
Token ID13
Feature activation-1.90e-02

Pos logit contributions
 else+0.081
 dice+0.081
ents+0.081
 contain+0.076
 clearly+0.072
Neg logit contributions
uck-0.074
 Cow-0.070
oo-0.069
 Wolf-0.068
burn-0.065
Token "
Token ID366
Feature activation-5.67e-02

Pos logit contributions
 else+0.002
ents+0.002
 dice+0.002
 contain+0.002
 clearly+0.002
Neg logit contributions
uck-0.001
 Cow-0.001
oo-0.001
 Wolf-0.001
burn-0.001
Token an
Token ID281
Feature activation+1.49e+00

Pos logit contributions
 else+0.115
ents+0.114
 dice+0.112
 contain+0.107
 clearly+0.102
Neg logit contributions
uck-0.072
 Cow-0.067
oo-0.067
 Wolf-0.065
burn-0.063
Token idea
Token ID2126
Feature activation+9.49e-02

Pos logit contributions
uck+0.093
oo+0.091
 Cow+0.088
 Wolf+0.085
ivered+0.081
Neg logit contributions
 else-0.150
ents-0.147
 dice-0.147
 contain-0.135
 clearly-0.131
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
uck+0.006
oo+0.006
 Wolf+0.005
 Cow+0.005
 wolf+0.005
Neg logit contributions
 dice-0.010
ents-0.010
 else-0.009
 contain-0.009
 clearly-0.008
Token He
Token ID679
Feature activation+9.90e-01

Pos logit contributions
ents+0.002
 dice+0.002
 else+0.002
 contain+0.002
 clearly+0.002
Neg logit contributions
uck-0.001
oo-0.001
 Cow-0.001
burn-0.001
ivered-0.001
Token sn
Token ID3013
Feature activation+2.63e+00

Pos logit contributions
uck+0.066
 Cow+0.063
 Wolf+0.061
oo+0.060
eled+0.057
Neg logit contributions
 dice-0.093
 else-0.091
ents-0.090
 contain-0.090
 clearly-0.083
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 wolf+0.111
 Cow+0.103
 Wolf+0.103
 Amen+0.101
 chick+0.101
Neg logit contributions
ents-0.317
 dice-0.306
 else-0.306
ows-0.303
ooters-0.291
Token into
Token ID656
Feature activation-3.91e+00

Pos logit contributions
uck+0.864
 Cow+0.812
oo+0.803
 Wolf+0.791
burn+0.747
Neg logit contributions
ents-0.900
 else-0.893
 dice-0.876
 contain-0.835
 clearly-0.787
Token Chris
Token ID5180
Feature activation-4.01e+00

Pos logit contributions
 else+0.364
ents+0.362
 dice+0.359
 contain+0.341
 clearly+0.324
Neg logit contributions
uck-0.275
 Cow-0.260
oo-0.255
 Wolf-0.251
burn-0.240
Token's
Token ID338
Feature activation-3.04e+00

Pos logit contributions
 dice+0.335
 else+0.327
ents+0.326
 contain+0.314
 clearly+0.296
Neg logit contributions
uck-0.303
 Cow-0.291
 Wolf-0.288
oo-0.280
burn-0.264
Token room
Token ID2119
Feature activation-4.11e-01

Pos logit contributions
 else+0.270
ents+0.268
 dice+0.264
 contain+0.251
 clearly+0.237
Neg logit contributions
uck-0.225
oo-0.214
 Cow-0.214
 Wolf-0.205
ivered-0.199
Token and
Token ID290
Feature activation-6.73e-02

Pos logit contributions
 else+0.036
ents+0.036
 dice+0.036
 contain+0.034
 clearly+0.032
Neg logit contributions
uck-0.031
 Cow-0.029
oo-0.028
 Wolf-0.028
burn-0.026
Token the
Token ID262
Feature activation-1.95e-01

Pos logit contributions
ents+0.248
 else+0.244
 contain+0.244
 dice+0.242
ows+0.219
Neg logit contributions
uck-0.146
 Wolf-0.138
 Cow-0.135
oo-0.131
eled-0.123
Token button
Token ID4936
Feature activation-4.41e+00

Pos logit contributions
ents+0.018
 else+0.017
 contain+0.017
 dice+0.016
 clearly+0.015
Neg logit contributions
 Wolf-0.013
 wolf-0.013
uck-0.013
 Cow-0.012
 therm-0.012
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
ents+0.431
 dice+0.429
 contain+0.419
 else+0.406
ooters+0.369
Neg logit contributions
uck-0.265
oo-0.234
burn-0.227
 Wolf-0.223
 Cow-0.220
Token Tim
Token ID5045
Feature activation-3.47e-01

Pos logit contributions
 else+0.002
ents+0.002
 dice+0.002
 contain+0.002
 clearly+0.002
Neg logit contributions
uck-0.001
 Cow-0.001
oo-0.001
 Wolf-0.001
burn-0.001
Token sn
Token ID3013
Feature activation+2.63e+00

Pos logit contributions
 dice+0.033
 else+0.033
ents+0.032
 contain+0.032
 clearly+0.029
Neg logit contributions
uck-0.024
 Cow-0.022
 Wolf-0.022
oo-0.022
eled-0.021
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 wolf+0.143
 Cow+0.136
 Wolf+0.134
 Amen+0.129
 chick+0.124
Neg logit contributions
 else-0.288
ents-0.285
ows-0.279
 dice-0.275
 contain-0.262
Token out
Token ID503
Feature activation-2.33e-01

Pos logit contributions
uck+0.839
oo+0.778
 Cow+0.773
 Wolf+0.764
eled+0.716
Neg logit contributions
ents-0.936
 else-0.909
 dice-0.901
 contain-0.861
 clearly-0.802
Token of
Token ID286
Feature activation-8.37e-01

Pos logit contributions
 else+0.020
 dice+0.020
ents+0.020
 contain+0.019
ooters+0.017
Neg logit contributions
uck-0.017
 Cow-0.017
oo-0.017
 Wolf-0.017
burn-0.015
Token his
Token ID465
Feature activation+1.34e+00

Pos logit contributions
 else+0.081
ents+0.081
 dice+0.080
 contain+0.076
 clearly+0.073
Neg logit contributions
uck-0.056
 Cow-0.053
oo-0.051
 Wolf-0.051
burn-0.048
Token room
Token ID2119
Feature activation-4.11e-01

Pos logit contributions
uck+0.078
 Wolf+0.077
 Cow+0.076
 fever+0.074
oo+0.072
Neg logit contributions
ents-0.138
 else-0.130
 dice-0.129
 contain-0.127
 clearly-0.123
Token and
Token ID290
Feature activation-7.27e-02

Pos logit contributions
 else+0.039
 dice+0.039
ents+0.038
 contain+0.036
ooters+0.034
Neg logit contributions
uck-0.027
 Cow-0.025
oo-0.025
 Wolf-0.024
burn-0.023
Token the
Token ID262
Feature activation-1.95e-01

Pos logit contributions
 else+0.248
ents+0.241
 dice+0.233
 contain+0.219
 clearly+0.217
Neg logit contributions
uck-0.228
oo-0.208
ivered-0.208
 Cow-0.205
P-0.204
Token medicine
Token ID9007
Feature activation+4.00e+00

Pos logit contributions
 else+0.017
 poses+0.016
 dice+0.015
 contain+0.015
 clearly+0.015
Neg logit contributions
eled-0.015
uck-0.014
oo-0.014
P-0.014
 Amen-0.014
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
uck+0.267
 Cow+0.257
 Wolf+0.255
oo+0.245
 wolf+0.236
Neg logit contributions
ents-0.381
 dice-0.373
 contain-0.354
 else-0.354
 clearly-0.332
Token "
Token ID366
Feature activation-6.13e-02

Pos logit contributions
ents+0.002
 else+0.002
 dice+0.002
 contain+0.002
 clearly+0.002
Neg logit contributions
uck-0.001
oo-0.001
 Cow-0.001
burn-0.001
 Wolf-0.001
TokenY
Token ID56
Feature activation+2.42e+00

Pos logit contributions
 else+0.006
ents+0.006
 dice+0.006
 clearly+0.005
 contain+0.005
Neg logit contributions
 Cow-0.004
uck-0.004
oo-0.004
 Wolf-0.004
 wolf-0.003
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Amen+0.081
 Cow+0.076
 wolf+0.068
 Wolf+0.068
 Sleep+0.065
Neg logit contributions
ents-0.329
 else-0.299
 dice-0.294
bars-0.284
ows-0.279
Token!"
Token ID2474
Feature activation+4.84e-01

Pos logit contributions
uck+0.894
 Cow+0.889
eled+0.862
 Wolf+0.861
burn+0.857
Neg logit contributions
 else-0.853
ents-0.829
 dice-0.800
 contain-0.724
 clearly-0.700
Token he
Token ID339
Feature activation-1.38e+00

Pos logit contributions
uck+0.043
 Cow+0.041
oo+0.041
 Wolf+0.040
burn+0.039
Neg logit contributions
 else-0.036
ents-0.036
 dice-0.035
 contain-0.033
 clearly-0.031
Token said
Token ID531
Feature activation-9.61e-01

Pos logit contributions
 else+0.097
ents+0.093
 dice+0.088
 contain+0.083
 clearly+0.079
Neg logit contributions
uck-0.131
 Cow-0.124
 Wolf-0.123
oo-0.123
burn-0.122
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
 else+0.076
ents+0.069
 contain+0.067
 dice+0.066
 clearly+0.065
Neg logit contributions
uck-0.084
P-0.081
 Cow-0.081
 Wolf-0.079
oo-0.078
Token But
Token ID887
Feature activation-6.46e-01

Pos logit contributions
 else+0.002
ents+0.002
 dice+0.002
 contain+0.002
 clearly+0.002
Neg logit contributions
uck-0.001
burn-0.001
oo-0.001
 Cow-0.001
ivered-0.001
Token,
Token ID11
Feature activation+6.62e-01

Pos logit contributions
uck+0.827
 Cow+0.805
 Wolf+0.788
oo+0.771
burn+0.766
Neg logit contributions
 else-0.916
ents-0.901
 dice-0.882
 contain-0.815
 clearly-0.784
Token y
Token ID331
Feature activation+1.58e+00

Pos logit contributions
uck+0.052
 Cow+0.050
oo+0.048
 Wolf+0.048
burn+0.046
Neg logit contributions
 else-0.056
ents-0.056
 dice-0.055
 contain-0.053
 clearly-0.050
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Amen+0.086
 Wolf+0.079
 Cow+0.077
 Sleep+0.077
 wolf+0.067
Neg logit contributions
ents-0.179
 else-0.167
 dice-0.165
 contain-0.152
hest-0.144
Token,
Token ID11
Feature activation+6.68e-01

Pos logit contributions
uck+0.848
 Cow+0.816
 Wolf+0.798
oo+0.794
burn+0.767
Neg logit contributions
 else-0.905
ents-0.894
 dice-0.880
 contain-0.824
 clearly-0.785
Token y
Token ID331
Feature activation+1.58e+00

Pos logit contributions
uck+0.050
 Cow+0.047
oo+0.047
 Wolf+0.046
ivered+0.045
Neg logit contributions
 else-0.060
ents-0.060
 dice-0.059
 contain-0.054
 clearly-0.052
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Amen+0.089
 Wolf+0.083
 Cow+0.081
 Sleep+0.081
 Pig+0.070
Neg logit contributions
ents-0.177
 else-0.166
 dice-0.162
 contain-0.151
hest-0.142
Token!"
Token ID2474
Feature activation+4.84e-01

Pos logit contributions
uck+0.877
 Cow+0.841
oo+0.824
 Wolf+0.820
burn+0.785
Neg logit contributions
 else-0.877
ents-0.871
 dice-0.858
 contain-0.809
 clearly-0.764
Token they
Token ID484
Feature activation-1.63e+00

Pos logit contributions
uck+0.047
 Cow+0.046
oo+0.045
 Wolf+0.044
burn+0.043
Neg logit contributions
 else-0.032
ents-0.032
 dice-0.031
 contain-0.029
 clearly-0.027
Token say
Token ID910
Feature activation-1.59e+00

Pos logit contributions
ents+0.103
 dice+0.103
 else+0.102
 contain+0.092
 clearly+0.088
Neg logit contributions
uck-0.159
 Cow-0.155
oo-0.151
 Wolf-0.149
burn-0.143
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
 else+0.112
ents+0.106
 dice+0.100
 contain+0.098
 clearly+0.094
Neg logit contributions
uck-0.146
 Cow-0.144
 Wolf-0.143
P-0.143
ivered-0.140
Token "
Token ID366
Feature activation-6.21e-02

Pos logit contributions
 else+0.001
ents+0.001
 dice+0.001
 clearly+0.001
 contain+0.001
Neg logit contributions
ivered-0.002
uck-0.002
burn-0.002
oo-0.002
eled-0.002
Token help
Token ID1037
Feature activation+1.13e+00

Pos logit contributions
ents+0.272
 dice+0.271
 else+0.261
 contain+0.240
ooters+0.228
Neg logit contributions
uck-0.260
 Cow-0.257
oo-0.254
 Wolf-0.250
burn-0.232
Token get
Token ID651
Feature activation-2.46e-01

Pos logit contributions
uck+0.087
 Cow+0.081
oo+0.080
 Wolf+0.079
ivered+0.078
Neg logit contributions
 else-0.098
ents-0.096
 dice-0.094
 contain-0.091
 clearly-0.087
Token the
Token ID262
Feature activation-1.80e-01

Pos logit contributions
 else+0.022
ents+0.021
 dice+0.020
 contain+0.019
 clearly+0.019
Neg logit contributions
uck-0.019
oo-0.018
 Cow-0.018
ivered-0.018
 Wolf-0.017
Token car
Token ID1097
Feature activation-1.42e+00

Pos logit contributions
 else+0.016
ents+0.016
 dice+0.016
 contain+0.015
 clearly+0.014
Neg logit contributions
uck-0.013
 Cow-0.012
oo-0.012
 Wolf-0.012
burn-0.011
Token unst
Token ID15014
Feature activation+1.15e+00

Pos logit contributions
 dice+0.139
 else+0.137
ents+0.135
 contain+0.128
 poses+0.123
Neg logit contributions
uck-0.091
burn-0.083
oo-0.082
 Cow-0.082
 Wolf-0.080
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Sleep+0.045
 Amen+0.041
 Cow+0.040
mare+0.038
 wolf+0.035
Neg logit contributions
ents-0.139
 dice-0.131
 else-0.122
ooters-0.122
 wins-0.120
Token.
Token ID13
Feature activation-1.90e-02

Pos logit contributions
uck+0.824
 Cow+0.784
oo+0.769
 Wolf+0.761
burn+0.724
Neg logit contributions
 else-0.927
ents-0.926
 dice-0.918
 contain-0.868
 clearly-0.817
Token
Token ID198
Feature activation-4.64e-01

Pos logit contributions
ents+0.002
 else+0.002
 dice+0.001
 contain+0.001
 clearly+0.001
Neg logit contributions
ivered-0.001
uck-0.001
oo-0.001
burn-0.001
eled-0.001
Token
Token ID198
Feature activation-4.65e-01

Pos logit contributions
 else+0.040
ents+0.038
 dice+0.036
 contain+0.035
 clearly+0.034
Neg logit contributions
uck-0.037
ivered-0.034
burn-0.034
 Cow-0.034
 Wolf-0.033
TokenThe
Token ID464
Feature activation+1.36e-02

Pos logit contributions
 else+0.040
ents+0.037
 contain+0.035
 dice+0.035
hest+0.033
Neg logit contributions
uck-0.036
 Cow-0.033
ivered-0.033
 Wolf-0.033
night-0.032
Token rescue
Token ID9992
Feature activation+1.64e+00

Pos logit contributions
uck+0.001
oo+0.001
 Cow+0.001
ivered+0.001
 Wolf+0.001
Neg logit contributions
 dice-0.001
 else-0.001
ents-0.001
 contain-0.001
 clearly-0.001
Token accidentally
Token ID14716
Feature activation+1.24e+00

Pos logit contributions
ents+0.007
 contain+0.007
 dice+0.007
 else+0.007
 clearly+0.006
Neg logit contributions
uck-0.005
 Cow-0.005
oo-0.005
 Wolf-0.004
P-0.004
Token tasted
Token ID29187
Feature activation+1.11e+00

Pos logit contributions
uck+0.104
 Cow+0.100
oo+0.098
 Wolf+0.098
ivered+0.095
Neg logit contributions
 else-0.100
ents-0.098
 dice-0.097
 contain-0.088
 clearly-0.085
Token it
Token ID340
Feature activation+1.50e+00

Pos logit contributions
uck+0.084
 Cow+0.084
oo+0.082
 Wolf+0.082
 Amen+0.080
Neg logit contributions
 else-0.099
ents-0.095
 dice-0.091
 clearly-0.086
 contain-0.085
Token.
Token ID13
Feature activation-2.26e-02

Pos logit contributions
uck+0.117
 Cow+0.110
oo+0.108
 Wolf+0.107
burn+0.103
Neg logit contributions
ents-0.128
 else-0.127
 dice-0.127
 contain-0.120
 clearly-0.112
Token Y
Token ID575
Feature activation+4.90e-01

Pos logit contributions
ents+0.002
 else+0.002
 dice+0.002
 clearly+0.002
 contain+0.002
Neg logit contributions
burn-0.002
oo-0.002
 Cow-0.002
ivered-0.002
 Amen-0.002
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Amen+0.017
 Cow+0.017
 wolf+0.016
 Wolf+0.016
 Sleep+0.012
Neg logit contributions
ents-0.064
 else-0.061
 dice-0.059
ows-0.056
 contain-0.056
Token!
Token ID0
Feature activation+6.92e-01

Pos logit contributions
uck+0.817
 Cow+0.809
 Wolf+0.790
oo+0.776
 Amen+0.767
Neg logit contributions
 else-0.943
ents-0.910
 dice-0.891
 contain-0.830
 clearly-0.808
Token It
Token ID632
Feature activation+1.46e+00

Pos logit contributions
burn+0.051
oo+0.051
ivered+0.051
 Cow+0.050
 Amen+0.050
Neg logit contributions
ents-0.061
 else-0.057
 dice-0.057
 contain-0.051
 clearly-0.050
Token tasted
Token ID29187
Feature activation+1.12e+00

Pos logit contributions
uck+0.101
 Cow+0.095
oo+0.093
 Wolf+0.092
burn+0.087
Neg logit contributions
ents-0.137
 else-0.137
 dice-0.136
 contain-0.130
 clearly-0.122
Token very
Token ID845
Feature activation-3.03e+00

Pos logit contributions
 Cow+0.104
 Wolf+0.103
 Amen+0.103
uck+0.102
burn+0.101
Neg logit contributions
 else-0.085
ents-0.074
 dice-0.070
 clearly-0.069
 contain-0.067
Token bad
Token ID2089
Feature activation+1.74e+00

Pos logit contributions
 else+0.265
ents+0.241
 dice+0.228
 clearly+0.221
 contain+0.218
Neg logit contributions
 Wolf-0.248
 Cow-0.244
burn-0.242
 Amen-0.236
uck-0.235
Token a
Token ID257
Feature activation+7.26e-01

Pos logit contributions
 Cow+0.152
uck+0.150
 Wolf+0.146
ivered+0.145
oo+0.145
Neg logit contributions
 else-0.193
ents-0.188
 dice-0.186
 contain-0.171
 clearly-0.169
Token bite
Token ID13197
Feature activation-4.25e-01

Pos logit contributions
uck+0.058
 Cow+0.054
oo+0.054
 Wolf+0.053
P+0.051
Neg logit contributions
ents-0.060
 dice-0.057
 else-0.056
 contain-0.054
 clearly-0.054
Token.
Token ID13
Feature activation-8.64e-03

Pos logit contributions
 dice+0.036
ents+0.036
 else+0.036
 contain+0.035
 clearly+0.032
Neg logit contributions
uck-0.033
 Cow-0.031
oo-0.031
 Wolf-0.030
burn-0.029
Token "
Token ID366
Feature activation-6.58e-02

Pos logit contributions
 else+0.001
ents+0.001
 dice+0.001
 contain+0.001
 clearly+0.001
Neg logit contributions
uck-0.001
 Cow-0.001
oo-0.001
 Wolf-0.001
burn-0.001
TokenY
Token ID56
Feature activation+2.42e+00

Pos logit contributions
 else+0.007
 dice+0.006
ents+0.006
 contain+0.006
 clearly+0.006
Neg logit contributions
 Cow-0.004
uck-0.004
oo-0.004
 Wolf-0.004
ivered-0.004
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Cow+0.080
 Wolf+0.077
 Amen+0.068
 wolf+0.065
 Sleep+0.065
Neg logit contributions
ents-0.324
 else-0.308
 dice-0.305
 contain-0.292
 clearly-0.277
Token!"
Token ID2474
Feature activation+4.87e-01

Pos logit contributions
 Cow+0.939
uck+0.934
 Wolf+0.918
oo+0.892
burn+0.892
Neg logit contributions
 else-0.803
ents-0.788
 dice-0.763
 contain-0.702
 clearly-0.663
Token she
Token ID673
Feature activation-1.76e+00

Pos logit contributions
uck+0.043
 Cow+0.041
oo+0.040
 Wolf+0.040
burn+0.039
Neg logit contributions
 else-0.037
ents-0.036
 dice-0.035
 contain-0.034
 clearly-0.032
Token said
Token ID531
Feature activation-9.52e-01

Pos logit contributions
 dice+0.137
ents+0.136
 else+0.135
 contain+0.129
 clearly+0.120
Neg logit contributions
uck-0.149
 Cow-0.143
oo-0.140
 Wolf-0.138
ivered-0.131
Token,
Token ID11
Feature activation+6.70e-01

Pos logit contributions
 else+0.075
ents+0.069
 contain+0.068
 dice+0.065
 clearly+0.064
Neg logit contributions
uck-0.081
 Cow-0.081
P-0.080
 Wolf-0.079
eled-0.078
Token "
Token ID366
Feature activation-6.57e-02

Pos logit contributions
uck+0.057
P+0.054
 Wolf+0.053
 Cow+0.053
oo+0.052
Neg logit contributions
ents-0.054
 else-0.053
 dice-0.049
 contain-0.048
 clearly-0.048
Token all
Token ID477
Feature activation+1.65e+00

Pos logit contributions
 else+0.362
ents+0.356
 dice+0.338
 clearly+0.327
 contain+0.321
Neg logit contributions
uck-0.449
ivered-0.421
P-0.409
oo-0.405
 Cow-0.401
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
uck+0.113
 Cow+0.107
 Wolf+0.107
oo+0.102
 wolf+0.099
Neg logit contributions
 dice-0.155
ents-0.153
 else-0.146
 contain-0.142
ows-0.135
Token fur
Token ID9230
Feature activation+5.32e+00

Pos logit contributions
ents+0.016
 else+0.016
 dice+0.016
 contain+0.015
 clearly+0.014
Neg logit contributions
uck-0.015
 Cow-0.014
 Wolf-0.014
oo-0.014
burn-0.013
Token was
Token ID373
Feature activation+3.50e-01

Pos logit contributions
uck+0.410
 Cow+0.392
oo+0.385
 Wolf+0.379
P+0.364
Neg logit contributions
 else-0.457
ents-0.450
 dice-0.442
 contain-0.421
 clearly-0.402
Token unst
Token ID15014
Feature activation+1.15e+00

Pos logit contributions
uck+0.027
 Cow+0.026
oo+0.026
 Wolf+0.025
burn+0.024
Neg logit contributions
 else-0.030
ents-0.030
 dice-0.029
 contain-0.028
 clearly-0.026
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Amen+0.056
 Sleep+0.055
 Cow+0.054
 wolf+0.053
mare+0.048
Neg logit contributions
ents-0.133
 dice-0.118
 else-0.113
ooters-0.112
 contain-0.111
Token!
Token ID0
Feature activation+6.95e-01

Pos logit contributions
uck+0.850
 Cow+0.814
oo+0.798
 Wolf+0.789
burn+0.756
Neg logit contributions
 else-0.902
ents-0.896
 dice-0.881
 contain-0.837
 clearly-0.793
Token
Token ID198
Feature activation-4.73e-01

Pos logit contributions
uck+0.048
oo+0.045
 Cow+0.045
burn+0.044
ivered+0.043
Neg logit contributions
ents-0.065
 else-0.065
 dice-0.063
 contain-0.060
 clearly-0.058
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
ents+0.048
 else+0.048
 dice+0.048
 contain+0.045
 clearly+0.043
Neg logit contributions
uck-0.029
 Cow-0.027
oo-0.027
 Wolf-0.026
burn-0.024
TokenThe
Token ID464
Feature activation+6.66e-03

Pos logit contributions
 else+0.042
ents+0.041
 dice+0.041
 contain+0.039
 clearly+0.037
Neg logit contributions
uck-0.036
 Cow-0.034
oo-0.033
 Wolf-0.033
burn-0.032
Token squirrel
Token ID33039
Feature activation+5.50e-01

Pos logit contributions
uck+0.000
oo+0.000
ivered+0.000
burn+0.000
 Cow+0.000
Neg logit contributions
 else-0.001
ents-0.001
 dice-0.001
 contain-0.001
 clearly-0.001
Token's
Token ID338
Feature activation-3.04e+00

Pos logit contributions
 else+0.080
ents+0.074
 clearly+0.066
 counts+0.065
 dice+0.064
Neg logit contributions
uck-0.074
oo-0.073
 Cow-0.071
 Wolf-0.069
burn-0.067
Token mom
Token ID1995
Feature activation+3.43e-01

Pos logit contributions
 else+0.250
ents+0.228
 dice+0.221
 contain+0.214
 clearly+0.209
Neg logit contributions
uck-0.239
P-0.231
eled-0.230
ivered-0.229
 Cow-0.228
Token came
Token ID1625
Feature activation+2.16e+00

Pos logit contributions
uck+0.030
over+0.030
 Wolf+0.030
P+0.029
ivered+0.029
Neg logit contributions
 else-0.025
ents-0.025
 clearly-0.020
 dice-0.020
 contain-0.019
Token to
Token ID284
Feature activation-6.09e-02

Pos logit contributions
uck+0.187
 Cow+0.186
 Wolf+0.180
oo+0.179
ivered+0.177
Neg logit contributions
ents-0.159
 else-0.156
 dice-0.145
 contain-0.143
 clearly-0.138
Token t
Token ID256
Feature activation-1.70e+00

Pos logit contributions
 else+0.005
 contain+0.005
 dice+0.005
ents+0.005
 clearly+0.004
Neg logit contributions
uck-0.005
ivered-0.005
oo-0.005
 Wolf-0.005
 Cow-0.005
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 else+0.198
ents+0.198
 dice+0.183
 contain+0.175
 clearly+0.173
Neg logit contributions
 Cow-0.084
 Wolf-0.082
 wolf-0.075
 Squirrel-0.071
 Pig-0.069
Token her
Token ID607
Feature activation+8.79e-01

Pos logit contributions
uck+0.736
ivered+0.733
 Cow+0.722
 Wolf+0.716
oo+0.709
Neg logit contributions
 else-1.010
ents-0.960
 dice-0.945
 contain-0.889
 clearly-0.882
Token in
Token ID287
Feature activation-5.15e-01

Pos logit contributions
uck+0.073
 Cow+0.070
oo+0.069
 Wolf+0.068
P+0.067
Neg logit contributions
 else-0.071
ents-0.070
 dice-0.067
 contain-0.065
 clearly-0.062
Token and
Token ID290
Feature activation-7.67e-02

Pos logit contributions
 else+0.036
 contain+0.034
 dice+0.033
ents+0.032
 clearly+0.032
Neg logit contributions
uck-0.049
ivered-0.048
oo-0.045
 Cow-0.043
eled-0.043
Token said
Token ID531
Feature activation-9.63e-01

Pos logit contributions
 else+0.006
ents+0.005
 dice+0.005
 contain+0.005
 clearly+0.005
Neg logit contributions
uck-0.007
oo-0.007
 Cow-0.006
 Wolf-0.006
ivered-0.006
Token,
Token ID11
Feature activation+6.53e-01

Pos logit contributions
 else+0.081
ents+0.071
 dice+0.066
 clearly+0.063
 contain+0.063
Neg logit contributions
eled-0.081
ivered-0.081
uck-0.080
burn-0.079
P-0.079
Token is
Token ID318
Feature activation-2.30e-01

Pos logit contributions
uck+0.114
 Cow+0.102
oo+0.102
 Wolf+0.101
burn+0.098
Neg logit contributions
 contain-0.132
ents-0.130
 dice-0.130
 else-0.124
 clearly-0.114
Token bitter
Token ID12922
Feature activation+3.39e+00

Pos logit contributions
 dice+0.020
ents+0.020
 else+0.020
 contain+0.020
 clearly+0.018
Neg logit contributions
uck-0.017
 Cow-0.017
oo-0.016
 Wolf-0.016
burn-0.015
Token,
Token ID11
Feature activation+6.62e-01

Pos logit contributions
uck+0.264
 Cow+0.247
oo+0.243
 Wolf+0.239
burn+0.231
Neg logit contributions
 dice-0.291
ents-0.289
 else-0.289
 contain-0.276
 clearly-0.256
Token like
Token ID588
Feature activation-2.83e+00

Pos logit contributions
uck+0.053
 Cow+0.052
oo+0.050
 Wolf+0.050
burn+0.048
Neg logit contributions
ents-0.053
 dice-0.053
 else-0.053
 contain-0.050
 clearly-0.046
Token y
Token ID331
Feature activation+1.58e+00

Pos logit contributions
 else+0.253
ents+0.252
 dice+0.249
 contain+0.236
 clearly+0.224
Neg logit contributions
uck-0.208
 Cow-0.198
oo-0.194
 Wolf-0.192
burn-0.183
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Amen+0.113
 Cow+0.108
 Wolf+0.107
 wolf+0.098
 Squirrel+0.097
Neg logit contributions
ents-0.148
 else-0.137
 dice-0.133
 contain-0.123
ows-0.119
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
uck+0.841
 Cow+0.803
oo+0.787
 Wolf+0.778
burn+0.742
Neg logit contributions
ents-0.904
 else-0.903
 dice-0.896
 contain-0.852
 clearly-0.800
Token You
Token ID921
Feature activation+2.16e+00

Pos logit contributions
ents+0.002
 else+0.002
 dice+0.002
 contain+0.002
 clearly+0.001
Neg logit contributions
uck-0.001
ivered-0.001
eled-0.001
burn-0.001
oo-0.001
Token should
Token ID815
Feature activation-6.14e-01

Pos logit contributions
uck+0.176
 Cow+0.171
oo+0.167
 Wolf+0.165
burn+0.156
Neg logit contributions
 dice-0.171
ents-0.169
 else-0.164
 contain-0.162
 clearly-0.152
Token wear
Token ID5806
Feature activation+3.66e-01

Pos logit contributions
 else+0.043
ents+0.041
 dice+0.040
 contain+0.039
 clearly+0.036
Neg logit contributions
uck-0.058
 Wolf-0.056
 Cow-0.055
ivered-0.055
eled-0.054
Token something
Token ID1223
Feature activation+3.34e+00

Pos logit contributions
uck+0.027
 Cow+0.025
 Wolf+0.025
oo+0.025
ivered+0.025
Neg logit contributions
 else-0.034
 dice-0.032
ents-0.032
 contain-0.030
 clearly-0.029
Token It
Token ID632
Feature activation+1.45e+00

Pos logit contributions
ents+0.002
 else+0.002
 dice+0.002
 clearly+0.002
 contain+0.002
Neg logit contributions
uck-0.002
ivered-0.002
 Cow-0.002
oo-0.002
 Wolf-0.002
Token is
Token ID318
Feature activation-2.30e-01

Pos logit contributions
uck+0.100
 Cow+0.089
oo+0.087
 Wolf+0.087
burn+0.081
Neg logit contributions
 contain-0.140
ents-0.140
 dice-0.137
 else-0.129
ooters-0.124
Token bitter
Token ID12922
Feature activation+3.39e+00

Pos logit contributions
ents+0.020
 else+0.020
 dice+0.020
 contain+0.019
 clearly+0.018
Neg logit contributions
uck-0.017
 Cow-0.017
oo-0.016
 Wolf-0.016
burn-0.015
Token.
Token ID13
Feature activation-1.73e-02

Pos logit contributions
uck+0.261
 Cow+0.245
oo+0.242
 Wolf+0.238
burn+0.230
Neg logit contributions
ents-0.292
 dice-0.291
 else-0.291
 contain-0.277
 clearly-0.257
Token Y
Token ID575
Feature activation+4.86e-01

Pos logit contributions
ents+0.002
 else+0.002
 dice+0.002
 clearly+0.001
 contain+0.001
Neg logit contributions
uck-0.001
ivered-0.001
burn-0.001
oo-0.001
 Cow-0.001
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 wolf+0.016
 Cow+0.016
 Wolf+0.015
 Amen+0.014
eled+0.013
Neg logit contributions
ents-0.061
ows-0.060
 else-0.059
 dice-0.058
hest-0.055
Token!"
Token ID2474
Feature activation+4.86e-01

Pos logit contributions
uck+0.793
 Cow+0.752
oo+0.741
 Wolf+0.724
burn+0.690
Neg logit contributions
ents-0.954
 else-0.950
 dice-0.946
 contain-0.903
 clearly-0.848
Token
Token ID198
Feature activation-4.61e-01

Pos logit contributions
uck+0.035
 Cow+0.033
oo+0.033
 Wolf+0.032
burn+0.032
Neg logit contributions
 else-0.045
ents-0.044
 dice-0.043
 contain-0.041
 clearly-0.040
Token
Token ID198
Feature activation-4.71e-01

Pos logit contributions
 else+0.043
ents+0.042
 dice+0.041
 contain+0.039
 clearly+0.038
Neg logit contributions
uck-0.033
 Cow-0.031
 Wolf-0.030
oo-0.030
burn-0.030
TokenMom
Token ID29252
Feature activation+2.56e+00

Pos logit contributions
 else+0.053
ents+0.051
 dice+0.051
 contain+0.048
 clearly+0.046
Neg logit contributions
uck-0.025
 Cow-0.024
 Wolf-0.024
oo-0.023
burn-0.022
Token laughs
Token ID22051
Feature activation-5.79e+00

Pos logit contributions
uck+0.226
oo+0.200
 Cow+0.197
 Wolf+0.189
eled+0.183
Neg logit contributions
 contain-0.198
 dice-0.196
 else-0.193
ents-0.184
ooters-0.174
Token filthy
Token ID35677
Feature activation+1.51e+00

Pos logit contributions
uck+0.028
 Cow+0.027
oo+0.026
 Wolf+0.026
burn+0.025
Neg logit contributions
 else-0.028
ents-0.028
 dice-0.027
 contain-0.026
 clearly-0.025
Token.
Token ID13
Feature activation-2.41e-02

Pos logit contributions
uck+0.125
 Cow+0.122
oo+0.119
 Wolf+0.117
ivered+0.112
Neg logit contributions
 else-0.120
ents-0.119
 dice-0.117
 contain-0.110
 clearly-0.105
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
 else+0.002
ents+0.002
 dice+0.002
 clearly+0.002
 maybe+0.002
Neg logit contributions
burn-0.002
ivered-0.002
oo-0.002
eled-0.002
uck-0.002
Token"
Token ID1
Feature activation+6.49e-01

Pos logit contributions
 else+0.041
ents+0.038
 dice+0.036
 clearly+0.033
 contain+0.033
Neg logit contributions
burn-0.037
uck-0.037
ivered-0.036
 Cow-0.036
oo-0.035
TokenY
Token ID56
Feature activation+2.42e+00

Pos logit contributions
uck+0.045
oo+0.043
ivered+0.041
 Amen+0.040
 Cow+0.040
Neg logit contributions
 else-0.063
 dice-0.057
ents-0.056
 contain-0.054
 clearly-0.053
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Cow+0.107
 Amen+0.103
 Wolf+0.100
 wolf+0.097
 Sleep+0.089
Neg logit contributions
ents-0.297
 dice-0.278
 else-0.273
ows-0.258
hest-0.255
Token!"
Token ID2474
Feature activation+4.86e-01

Pos logit contributions
uck+0.881
 Cow+0.868
 Wolf+0.847
oo+0.838
burn+0.828
Neg logit contributions
 else-0.866
ents-0.844
 dice-0.822
 contain-0.771
 clearly-0.730
Token Mama
Token ID40084
Feature activation+4.70e+00

Pos logit contributions
ivered+0.045
uck+0.042
oo+0.042
burn+0.042
eled+0.041
Neg logit contributions
 else-0.038
ents-0.032
 contain-0.030
 dice-0.029
 clearly-0.027
Token said
Token ID531
Feature activation-9.58e-01

Pos logit contributions
uck+0.416
 Cow+0.401
oo+0.392
 Wolf+0.389
burn+0.369
Neg logit contributions
 dice-0.349
 else-0.348
ents-0.343
 contain-0.327
 clearly-0.303
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
 else+0.075
ents+0.071
 contain+0.067
 dice+0.065
 clearly+0.063
Neg logit contributions
uck-0.083
P-0.079
 Cow-0.079
 Wolf-0.078
oo-0.078
Token "
Token ID366
Feature activation-6.70e-02

Pos logit contributions
 else+0.001
ents+0.001
 dice+0.001
 clearly+0.001
 contain+0.001
Neg logit contributions
ivered-0.001
burn-0.001
oo-0.001
eled-0.001
uck-0.001
Token.
Token ID13
Feature activation-2.41e-02

Pos logit contributions
uck+0.175
 Cow+0.162
oo+0.160
 Wolf+0.156
burn+0.150
Neg logit contributions
 dice-0.224
ents-0.223
 else-0.222
 contain-0.213
 clearly-0.194
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
 else+0.002
ents+0.002
 dice+0.002
 contain+0.002
 clearly+0.002
Neg logit contributions
uck-0.002
oo-0.002
ivered-0.002
 Cow-0.002
burn-0.002
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
 dice+0.051
ents+0.051
 contain+0.051
 else+0.050
 clearly+0.046
Neg logit contributions
uck-0.026
 Cow-0.024
oo-0.024
 Wolf-0.023
P-0.021
Token"
Token ID1
Feature activation+6.49e-01

Pos logit contributions
 else+0.049
ents+0.049
 dice+0.048
 contain+0.046
 clearly+0.044
Neg logit contributions
uck-0.029
 Cow-0.027
 Wolf-0.027
oo-0.026
burn-0.025
TokenY
Token ID56
Feature activation+2.42e+00

Pos logit contributions
uck+0.036
oo+0.033
 Cow+0.033
 Wolf+0.032
ivered+0.031
Neg logit contributions
 else-0.071
 dice-0.070
ents-0.069
 contain-0.066
 clearly-0.063
Tokenuck
Token ID1347
Feature activation+1.08e+01

Pos logit contributions
 Cow+0.064
 Wolf+0.061
 Amen+0.053
 wolf+0.051
eled+0.051
Neg logit contributions
ents-0.332
 dice-0.319
 else-0.317
 contain-0.302
ows-0.295
Token,
Token ID11
Feature activation+6.66e-01

Pos logit contributions
uck+0.881
 Cow+0.868
 Wolf+0.850
oo+0.829
burn+0.825
Neg logit contributions
 else-0.854
ents-0.842
 dice-0.824
 contain-0.763
 clearly-0.725
Token what
Token ID644
Feature activation-4.26e-01

Pos logit contributions
uck+0.054
 Cow+0.051
oo+0.050
 Wolf+0.050
burn+0.048
Neg logit contributions
 else-0.055
ents-0.055
 dice-0.054
 contain-0.051
 clearly-0.048
Token is
Token ID318
Feature activation-2.28e-01

Pos logit contributions
 else+0.034
ents+0.033
 dice+0.033
 contain+0.031
 clearly+0.029
Neg logit contributions
uck-0.036
 Cow-0.034
oo-0.034
 Wolf-0.034
burn-0.032
Token that
Token ID326
Feature activation+1.91e+00

Pos logit contributions
 else+0.018
 dice+0.017
ents+0.017
 contain+0.015
 clearly+0.015
Neg logit contributions
uck-0.019
ivered-0.019
 Wolf-0.018
 Cow-0.018
eled-0.018
Token?"
Token ID1701
Feature activation+2.59e+00

Pos logit contributions
uck+0.133
 Cow+0.128
 Wolf+0.127
eled+0.126
oo+0.124
Neg logit contributions
 else-0.175
ents-0.169
 dice-0.166
 contain-0.157
 clearly-0.153
Token wanted
Token ID2227
Feature activation+8.33e-01

Pos logit contributions
uck+0.080
 Cow+0.078
 Wolf+0.075
oo+0.075
P+0.071
Neg logit contributions
ents-0.119
 dice-0.117
 else-0.115
 contain-0.114
 clearly-0.104
Token to
Token ID284
Feature activation-3.59e-01

Pos logit contributions
uck+0.051
 Cow+0.051
 Wolf+0.050
oo+0.045
 wolf+0.045
Neg logit contributions
ents-0.083
 dice-0.083
 contain-0.080
 else-0.077
ows-0.077
Token play
Token ID711
Feature activation+2.65e-01

Pos logit contributions
ents+0.032
 dice+0.030
 else+0.029
cases+0.027
ows+0.027
Neg logit contributions
uck-0.027
 Cow-0.027
oo-0.026
 Wolf-0.026
burn-0.025
Token with
Token ID351
Feature activation+8.62e-01

Pos logit contributions
uck+0.018
oo+0.016
 Cow+0.016
 Wolf+0.016
P+0.015
Neg logit contributions
ents-0.026
 dice-0.025
 else-0.025
 contain-0.024
ooters-0.022
Token something
Token ID1223
Feature activation+3.04e+00

Pos logit contributions
uck+0.048
 Cow+0.046
oo+0.044
 Wolf+0.044
 wolf+0.040
Neg logit contributions
ents-0.091
 dice-0.090
 contain-0.088
 else-0.088
ooters-0.082
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
P+0.305
uck+0.304
 wolf+0.299
 Betsy+0.298
time+0.294
Neg logit contributions
ents-0.166
 dice-0.159
bars-0.130
 contain-0.128
cases-0.119
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
 dice+1.411
ents+1.392
 contain+1.347
ooters+1.253
saw+1.237
Neg logit contributions
uck-0.643
 Wolf-0.595
 Cow-0.580
oo-0.545
over-0.542
Token
Token ID198
Feature activation-7.82e-01

Pos logit contributions
 else+0.033
ents+0.033
 dice+0.033
 contain+0.032
 clearly+0.030
Neg logit contributions
uck-0.021
 Cow-0.020
oo-0.019
 Wolf-0.019
P-0.018
Token
Token ID198
Feature activation-7.91e-01

Pos logit contributions
 dice+0.088
ents+0.087
 contain+0.084
 else+0.084
 clearly+0.078
Neg logit contributions
uck-0.041
oo-0.039
 Cow-0.038
 Wolf-0.035
P-0.032
Token"
Token ID1
Feature activation+3.19e-01

Pos logit contributions
ents+0.077
 else+0.076
 dice+0.076
 contain+0.073
 clearly+0.069
Neg logit contributions
uck-0.053
oo-0.049
 Cow-0.049
 Wolf-0.046
P-0.046
TokenTom
Token ID13787
Feature activation-1.69e+00

Pos logit contributions
uck+0.017
 Cow+0.016
oo+0.015
 Wolf+0.015
P+0.015
Neg logit contributions
ents-0.035
 else-0.035
 dice-0.035
 contain-0.033
 clearly-0.032
Token is
Token ID318
Feature activation-5.12e-01

Pos logit contributions
ents+0.243
 dice+0.238
 else+0.237
 contain+0.229
 clearly+0.211
Neg logit contributions
uck-0.203
 Cow-0.192
 Wolf-0.189
oo-0.187
burn-0.174
Token something
Token ID1223
Feature activation+3.06e+00

Pos logit contributions
 else+0.045
ents+0.045
 dice+0.044
 contain+0.042
 clearly+0.040
Neg logit contributions
uck-0.038
 Cow-0.037
oo-0.036
 Wolf-0.035
burn-0.034
Token that
Token ID326
Feature activation+1.61e+00

Pos logit contributions
uck+0.203
 Cow+0.198
 Wolf+0.197
 Sleep+0.187
 wolf+0.186
Neg logit contributions
ents-0.293
 dice-0.287
 contain-0.265
ooters-0.241
saw-0.236
Token no
Token ID645
Feature activation+2.82e+00

Pos logit contributions
uck+0.121
 Cow+0.114
 Wolf+0.111
oo+0.108
 Sleep+0.107
Neg logit contributions
ents-0.145
 dice-0.141
 contain-0.132
 else-0.130
 clearly-0.120
Token one
Token ID530
Feature activation+1.22e-02

Pos logit contributions
uck+0.206
 Cow+0.198
oo+0.193
 Wolf+0.192
burn+0.182
Neg logit contributions
ents-0.254
 else-0.250
 dice-0.250
 contain-0.238
 clearly-0.224
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.001
 Wolf+0.001
 Cow+0.001
oo+0.001
P+0.001
Neg logit contributions
ents-0.001
 dice-0.001
 contain-0.001
cases-0.001
ows-0.001
Token can
Token ID460
Feature activation-2.44e+00

Pos logit contributions
ents+1.033
 dice+1.025
 else+0.971
 contain+0.966
 clearly+0.882
Neg logit contributions
uck-1.031
 Cow-0.971
oo-0.952
 Wolf-0.952
 Sleep-0.891
Token give
Token ID1577
Feature activation-3.86e-01

Pos logit contributions
 dice+0.210
ents+0.208
 else+0.198
 contain+0.181
ows+0.174
Neg logit contributions
uck-0.195
oo-0.186
 Cow-0.184
 Wolf-0.179
burn-0.170
Token you
Token ID345
Feature activation-2.67e-01

Pos logit contributions
 else+0.033
ents+0.032
 dice+0.032
 contain+0.030
 clearly+0.029
Neg logit contributions
uck-0.030
 Cow-0.029
oo-0.028
 Wolf-0.028
burn-0.027
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
ents+0.026
 dice+0.025
 else+0.025
 contain+0.024
 clearly+0.023
Neg logit contributions
uck-0.018
 Cow-0.017
oo-0.016
 Wolf-0.016
burn-0.015
Token Sadly
Token ID27213
Feature activation+4.80e+00

Pos logit contributions
ents+0.034
 else+0.033
 dice+0.032
 clearly+0.030
 contain+0.030
Neg logit contributions
uck-0.023
oo-0.022
burn-0.021
ivered-0.021
 Cow-0.021
Token I
Token ID314
Feature activation-1.14e+00

Pos logit contributions
ents+0.210
 else+0.209
 dice+0.208
 contain+0.206
 clearly+0.187
Neg logit contributions
uck-0.176
 Cow-0.169
oo-0.164
 Wolf-0.163
 Sleep-0.151
Token want
Token ID765
Feature activation-1.89e+00

Pos logit contributions
ents+0.087
 dice+0.086
 else+0.086
 contain+0.080
 clearly+0.075
Neg logit contributions
uck-0.097
 Cow-0.094
oo-0.092
 Wolf-0.091
burn-0.086
Token to
Token ID284
Feature activation-3.64e-01

Pos logit contributions
 contain+0.180
ents+0.180
 dice+0.178
 else+0.174
ows+0.170
Neg logit contributions
uck-0.123
 Cow-0.116
 Wolf-0.113
oo-0.108
P-0.101
Token buy
Token ID2822
Feature activation+2.06e-01

Pos logit contributions
ents+0.028
 else+0.028
 dice+0.028
 contain+0.026
 clearly+0.024
Neg logit contributions
uck-0.031
 Cow-0.030
oo-0.029
 Wolf-0.029
burn-0.028
Token something
Token ID1223
Feature activation+3.04e+00

Pos logit contributions
uck+0.013
 Cow+0.012
oo+0.012
 Wolf+0.012
P+0.011
Neg logit contributions
ents-0.020
 dice-0.020
 else-0.020
 contain-0.020
 clearly-0.018
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.203
 Cow+0.192
oo+0.178
 Wolf+0.178
 sausage+0.177
Neg logit contributions
 dice-0.278
ents-0.274
 contain-0.258
ows-0.240
 counts-0.236
Token with
Token ID351
Feature activation+8.54e-01

Pos logit contributions
 dice+1.214
ents+1.187
 contain+1.154
 else+1.132
 clearly+1.071
Neg logit contributions
uck-0.838
 Cow-0.780
oo-0.746
 Wolf-0.737
 Sleep-0.692
Token the
Token ID262
Feature activation-5.03e-01

Pos logit contributions
uck+0.058
 Cow+0.056
oo+0.054
 Wolf+0.053
P+0.050
Neg logit contributions
ents-0.080
 else-0.079
 dice-0.079
 contain-0.076
 clearly-0.071
Token money
Token ID1637
Feature activation-2.74e+00

Pos logit contributions
ents+0.047
 else+0.046
 dice+0.045
 contain+0.043
 clearly+0.042
Neg logit contributions
uck-0.035
 Cow-0.034
 Wolf-0.033
oo-0.032
burn-0.031
Token,"
Token ID553
Feature activation-3.01e-01

Pos logit contributions
 dice+0.258
ents+0.248
 else+0.247
 contain+0.239
 clearly+0.234
Neg logit contributions
uck-0.180
 Cow-0.161
oo-0.156
 Wolf-0.149
over-0.144
Token said
Token ID531
Feature activation-1.29e+00

Pos logit contributions
 else+0.022
ents+0.019
 dice+0.018
 contain+0.017
 clearly+0.017
Neg logit contributions
uck-0.029
oo-0.028
burn-0.027
 Cow-0.027
ivered-0.027
Token ignoring
Token ID15482
Feature activation+2.48e+00

Pos logit contributions
uck+0.032
 Cow+0.031
oo+0.031
 Wolf+0.030
burn+0.029
Neg logit contributions
 dice-0.043
ents-0.042
 else-0.041
 contain-0.039
 clearly-0.037
Token the
Token ID262
Feature activation-4.83e-01

Pos logit contributions
uck+0.188
oo+0.176
ivered+0.175
 Cow+0.174
 Wolf+0.168
Neg logit contributions
 else-0.219
ents-0.213
 dice-0.206
 contain-0.198
 clearly-0.195
Token plane
Token ID6614
Feature activation-2.77e+00

Pos logit contributions
ents+0.039
 else+0.039
 dice+0.039
 contain+0.037
 clearly+0.034
Neg logit contributions
uck-0.039
 Cow-0.038
oo-0.037
 Wolf-0.037
burn-0.035
Token like
Token ID588
Feature activation-3.14e+00

Pos logit contributions
 dice+0.304
ents+0.292
 else+0.270
 contain+0.265
ooters+0.261
Neg logit contributions
uck-0.141
 Cow-0.140
 Wolf-0.139
oo-0.138
 wolf-0.127
Token everyone
Token ID2506
Feature activation-1.19e+00

Pos logit contributions
 else+0.283
ents+0.282
 dice+0.280
 contain+0.266
 clearly+0.251
Neg logit contributions
uck-0.226
 Cow-0.216
oo-0.211
 Wolf-0.209
burn-0.198
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
 dice+0.114
ents+0.103
 contain+0.097
ooters+0.092
hest+0.089
Neg logit contributions
uck-0.082
 Wolf-0.079
 Cow-0.078
 wolf-0.077
oo-0.075
Token and
Token ID290
Feature activation-3.83e-01

Pos logit contributions
 dice+1.237
ents+1.193
 contain+1.144
 else+1.119
ooters+1.045
Neg logit contributions
uck-0.842
 Cow-0.786
 Wolf-0.782
oo-0.770
burn-0.725
Token the
Token ID262
Feature activation-5.03e-01

Pos logit contributions
 dice+0.036
ents+0.036
 else+0.035
 contain+0.034
 clearly+0.031
Neg logit contributions
uck-0.026
 Cow-0.025
 Wolf-0.024
oo-0.024
burn-0.023
Token plane
Token ID6614
Feature activation-2.79e+00

Pos logit contributions
ents+0.042
 else+0.042
 dice+0.042
 contain+0.040
 clearly+0.037
Neg logit contributions
uck-0.039
 Cow-0.038
oo-0.037
 Wolf-0.037
burn-0.035
Token was
Token ID373
Feature activation+2.12e-02

Pos logit contributions
 dice+0.270
ents+0.266
 else+0.254
 contain+0.248
ooters+0.229
Neg logit contributions
uck-0.182
 Cow-0.174
oo-0.173
 Wolf-0.168
burn-0.162
Token so
Token ID523
Feature activation-1.57e+00

Pos logit contributions
uck+0.002
 Cow+0.002
 Wolf+0.002
oo+0.002
P+0.001
Neg logit contributions
 else-0.002
ents-0.002
 dice-0.002
 contain-0.002
 clearly-0.002
Token box
Token ID3091
Feature activation+4.94e-01

Pos logit contributions
 dice+0.530
 contain+0.521
 else+0.513
ents+0.469
ooters+0.462
Neg logit contributions
uck-0.345
burn-0.290
oo-0.278
 Cow-0.270
over-0.269
Token so
Token ID523
Feature activation-1.53e+00

Pos logit contributions
uck+0.034
 Cow+0.031
oo+0.030
 Wolf+0.029
burn+0.028
Neg logit contributions
 dice-0.047
 else-0.046
ents-0.045
 contain-0.042
 clearly-0.040
Token that
Token ID326
Feature activation+1.61e+00

Pos logit contributions
ents+0.130
 else+0.130
 dice+0.129
 contain+0.123
 clearly+0.115
Neg logit contributions
uck-0.118
 Cow-0.112
oo-0.110
 Wolf-0.109
burn-0.103
Token no
Token ID645
Feature activation+2.82e+00

Pos logit contributions
uck+0.118
 Cow+0.112
 Wolf+0.109
oo+0.108
burn+0.102
Neg logit contributions
ents-0.145
 dice-0.143
 else-0.141
 contain-0.136
 clearly-0.125
Token one
Token ID530
Feature activation+1.22e-02

Pos logit contributions
uck+0.134
 Cow+0.130
 Wolf+0.128
oo+0.123
 wolf+0.121
Neg logit contributions
ents-0.316
 dice-0.305
 contain-0.301
 else-0.299
ows-0.295
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.001
 Wolf+0.001
 Cow+0.001
oo+0.001
 wolf+0.001
Neg logit contributions
 dice-0.001
ents-0.001
 contain-0.001
ooters-0.001
ows-0.001
Token could
Token ID714
Feature activation-3.60e+00

Pos logit contributions
 dice+1.052
ents+1.029
 contain+0.972
 else+0.954
 clearly+0.888
Neg logit contributions
uck-1.016
 Cow-0.944
 Wolf-0.927
oo-0.925
 Sleep-0.867
Token get
Token ID651
Feature activation-5.56e-01

Pos logit contributions
 dice+0.329
ents+0.312
ooters+0.284
 else+0.280
ows+0.278
Neg logit contributions
uck-0.250
oo-0.249
 Cow-0.240
 Wolf-0.239
burn-0.228
Token into
Token ID656
Feature activation-4.23e+00

Pos logit contributions
 else+0.051
ents+0.050
 dice+0.050
 contain+0.048
 clearly+0.045
Neg logit contributions
uck-0.040
 Cow-0.038
oo-0.037
 Wolf-0.037
burn-0.035
Token it
Token ID340
Feature activation+1.16e+00

Pos logit contributions
 else+0.396
ents+0.392
 dice+0.388
 contain+0.367
 clearly+0.355
Neg logit contributions
uck-0.298
 Cow-0.282
oo-0.277
 Wolf-0.270
burn-0.261
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
uck+0.074
 Cow+0.069
 Wolf+0.067
oo+0.066
burn+0.065
Neg logit contributions
 dice-0.113
ents-0.109
 else-0.106
 contain-0.105
ooters-0.096
Token to
Token ID284
Feature activation-3.53e-01

Pos logit contributions
uck+0.122
 Cow+0.113
 Wolf+0.110
oo+0.110
 wolf+0.101
Neg logit contributions
ents-0.162
 dice-0.158
 else-0.155
 contain-0.149
 clearly-0.142
Token grab
Token ID5552
Feature activation-2.76e+00

Pos logit contributions
ents+0.029
 dice+0.028
 else+0.028
 contain+0.026
 clearly+0.024
Neg logit contributions
uck-0.029
 Cow-0.028
oo-0.028
 Wolf-0.027
burn-0.026
Token it
Token ID340
Feature activation+1.19e+00

Pos logit contributions
 else+0.253
 dice+0.248
ents+0.247
 contain+0.233
 clearly+0.223
Neg logit contributions
uck-0.199
 Cow-0.190
oo-0.186
 Wolf-0.183
ivered-0.179
Token,
Token ID11
Feature activation+3.64e-01

Pos logit contributions
uck+0.072
 Cow+0.066
 Wolf+0.065
oo+0.065
burn+0.059
Neg logit contributions
ents-0.124
 dice-0.124
 else-0.122
 contain-0.117
 clearly-0.110
Token someone
Token ID2130
Feature activation+1.93e+00

Pos logit contributions
uck+0.031
 Cow+0.030
oo+0.029
 Wolf+0.029
burn+0.028
Neg logit contributions
 else-0.028
ents-0.028
 dice-0.028
 contain-0.026
 clearly-0.025
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
 Cow+0.123
uck+0.121
 Wolf+0.119
 wolf+0.114
oo+0.110
Neg logit contributions
 dice-0.185
ents-0.178
 contain-0.164
ooters-0.163
 poses-0.158
Token was
Token ID373
Feature activation+3.97e-02

Pos logit contributions
 dice+1.157
ents+1.150
 else+1.118
 contain+1.087
 clearly+1.013
Neg logit contributions
uck-0.897
 Cow-0.852
 Wolf-0.834
oo-0.823
eled-0.763
Token already
Token ID1541
Feature activation+5.53e-01

Pos logit contributions
uck+0.003
 Cow+0.003
oo+0.003
 Wolf+0.003
burn+0.003
Neg logit contributions
 else-0.004
ents-0.004
 dice-0.003
 contain-0.003
 clearly-0.003
Token taking
Token ID2263
Feature activation+3.48e+00

Pos logit contributions
uck+0.043
 Cow+0.041
 Wolf+0.040
oo+0.039
burn+0.037
Neg logit contributions
 else-0.048
ents-0.047
 dice-0.046
 contain-0.044
 clearly-0.042
Token it
Token ID340
Feature activation+1.16e+00

Pos logit contributions
uck+0.218
 Cow+0.206
oo+0.200
ivered+0.197
burn+0.194
Neg logit contributions
 else-0.353
ents-0.345
 dice-0.345
 contain-0.323
 clearly-0.314
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
uck+0.077
 Cow+0.071
oo+0.070
 Wolf+0.070
burn+0.068
Neg logit contributions
 dice-0.112
ents-0.110
 else-0.108
 contain-0.102
 clearly-0.098
Token to
Token ID284
Feature activation-3.53e-01

Pos logit contributions
 else+0.027
ents+0.027
 dice+0.027
 contain+0.026
 clearly+0.025
Neg logit contributions
uck-0.022
 Cow-0.020
oo-0.020
 Wolf-0.020
burn-0.019
Token your
Token ID534
Feature activation+2.08e+00

Pos logit contributions
ents+0.036
 dice+0.035
 else+0.035
 contain+0.033
ows+0.032
Neg logit contributions
uck-0.021
 Cow-0.021
oo-0.020
 Wolf-0.020
 Sleep-0.019
Token room
Token ID2119
Feature activation-7.08e-01

Pos logit contributions
uck+0.126
 fever+0.116
 Wolf+0.115
 Cow+0.115
oo+0.113
Neg logit contributions
ents-0.213
 contain-0.204
 else-0.204
 dice-0.202
hest-0.183
Token.
Token ID13
Feature activation-3.22e-01

Pos logit contributions
 dice+0.062
 else+0.062
ents+0.062
 contain+0.059
 clearly+0.055
Neg logit contributions
uck-0.052
 Cow-0.049
oo-0.048
 Wolf-0.047
burn-0.046
Token Or
Token ID1471
Feature activation-7.46e+00

Pos logit contributions
ents+0.027
 dice+0.027
 else+0.027
 contain+0.025
 clearly+0.024
Neg logit contributions
uck-0.025
ivered-0.023
oo-0.023
eled-0.023
 Cow-0.023
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
 dice+0.672
ents+0.667
 contain+0.619
hest+0.592
ows+0.591
Neg logit contributions
uck-0.501
 Cow-0.483
oo-0.480
 Wolf-0.455
 Sleep-0.423
Token,
Token ID11
Feature activation+3.56e-01

Pos logit contributions
 else+1.013
ents+1.003
 dice+0.989
 contain+0.934
 clearly+0.880
Neg logit contributions
uck-1.049
 Cow-1.007
oo-0.988
 Wolf-0.979
burn-0.939
Token no
Token ID645
Feature activation+2.81e+00

Pos logit contributions
uck+0.027
 Cow+0.026
 Wolf+0.026
oo+0.025
burn+0.024
Neg logit contributions
ents-0.030
 dice-0.030
 else-0.029
 contain-0.028
 clearly-0.027
Token story
Token ID1621
Feature activation+2.88e+00

Pos logit contributions
uck+0.214
 Cow+0.206
oo+0.203
 Wolf+0.200
burn+0.199
Neg logit contributions
ents-0.239
 dice-0.232
 else-0.231
 contain-0.225
 clearly-0.207
Token time
Token ID640
Feature activation+3.26e-01

Pos logit contributions
uck+0.221
oo+0.212
 Cow+0.199
burn+0.196
 Wolf+0.191
Neg logit contributions
ents-0.247
 dice-0.245
 else-0.241
 contain-0.237
 clearly-0.215
Token for
Token ID329
Feature activation-2.54e-01

Pos logit contributions
uck+0.024
oo+0.022
 Cow+0.021
 Wolf+0.020
burn+0.020
Neg logit contributions
 dice-0.029
ents-0.029
 else-0.028
 contain-0.027
 clearly-0.026
Token cups
Token ID14180
Feature activation-1.91e+00

Pos logit contributions
ents+0.042
 else+0.042
 dice+0.042
 contain+0.040
 clearly+0.038
Neg logit contributions
uck-0.033
 Cow-0.031
oo-0.030
 Wolf-0.030
burn-0.029
Token and
Token ID290
Feature activation-3.65e-01

Pos logit contributions
 dice+0.178
 else+0.175
ents+0.173
 contain+0.169
 clearly+0.162
Neg logit contributions
uck-0.135
 Cow-0.121
oo-0.119
 Wolf-0.116
burn-0.114
Token look
Token ID804
Feature activation+2.51e-01

Pos logit contributions
ents+0.033
 else+0.032
 dice+0.032
 contain+0.030
 clearly+0.029
Neg logit contributions
uck-0.027
 Cow-0.025
 Wolf-0.024
oo-0.024
burn-0.024
Token for
Token ID329
Feature activation-2.23e-01

Pos logit contributions
uck+0.015
 Cow+0.014
oo+0.014
 Wolf+0.013
 wolf+0.013
Neg logit contributions
 dice-0.026
ents-0.025
 contain-0.024
 else-0.024
ooters-0.022
Token something
Token ID1223
Feature activation+3.04e+00

Pos logit contributions
 else+0.020
ents+0.020
 dice+0.020
 contain+0.018
 clearly+0.018
Neg logit contributions
uck-0.017
 Cow-0.015
oo-0.015
 Wolf-0.015
ivered-0.015
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.357
 Cow+0.348
P+0.341
 Wolf+0.337
oo+0.333
Neg logit contributions
ents-0.130
 dice-0.129
 contain-0.105
cases-0.089
bars-0.088
Token to
Token ID284
Feature activation-3.77e-01

Pos logit contributions
 dice+1.299
ents+1.283
 contain+1.256
cases+1.138
hest+1.137
Neg logit contributions
uck-0.758
 Cow-0.695
 Wolf-0.676
oo-0.648
P-0.602
Token drink
Token ID4144
Feature activation+4.56e+00

Pos logit contributions
 else+0.033
 dice+0.031
ents+0.030
 contain+0.030
 clearly+0.028
Neg logit contributions
uck-0.030
 Wolf-0.029
 Cow-0.029
oo-0.028
ivered-0.027
Token.
Token ID13
Feature activation-3.41e-01

Pos logit contributions
uck+0.395
 Cow+0.390
 Wolf+0.388
oo+0.384
ivered+0.383
Neg logit contributions
 else-0.346
ents-0.327
 dice-0.321
 contain-0.303
 clearly-0.292
Token They
Token ID1119
Feature activation+1.70e+00

Pos logit contributions
ents+0.035
 else+0.035
 dice+0.035
 contain+0.033
 clearly+0.032
Neg logit contributions
uck-0.021
 Cow-0.020
oo-0.019
 Wolf-0.019
burn-0.017
Token see
Token ID766
Feature activation-1.88e+00

Pos logit contributions
uck+0.125
 Cow+0.123
 Wolf+0.118
 fever+0.115
oo+0.113
Neg logit contributions
ents-0.145
 dice-0.141
 else-0.132
ooters-0.127
cases-0.121
Token maybe
Token ID3863
Feature activation-6.36e+00

Pos logit contributions
uck+0.086
 Cow+0.084
 Wolf+0.081
oo+0.080
 Sleep+0.076
Neg logit contributions
ents-0.074
 dice-0.071
 else-0.071
 contain-0.069
 clearly-0.063
Token they
Token ID484
Feature activation-1.92e+00

Pos logit contributions
ents+0.483
 dice+0.474
 else+0.469
 contain+0.461
 clearly+0.425
Neg logit contributions
 Cow-0.539
uck-0.535
oo-0.508
 Wolf-0.505
burn-0.481
Token belong
Token ID5594
Feature activation-1.53e+00

Pos logit contributions
 dice+0.159
ents+0.154
 else+0.150
 contain+0.146
 clearly+0.139
Neg logit contributions
uck-0.147
 Cow-0.141
 Wolf-0.137
oo-0.135
burn-0.128
Token to
Token ID284
Feature activation-3.70e-01

Pos logit contributions
ents+0.146
 dice+0.145
 contain+0.138
 else+0.138
 clearly+0.128
Neg logit contributions
uck-0.106
 Cow-0.097
oo-0.094
 Wolf-0.093
P-0.090
Token someone
Token ID2130
Feature activation+1.93e+00

Pos logit contributions
ents+0.038
 dice+0.037
 else+0.036
 contain+0.036
ows+0.034
Neg logit contributions
 Cow-0.022
uck-0.021
oo-0.020
 Wolf-0.020
 Sleep-0.019
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.190
 Cow+0.178
oo+0.173
 wolf+0.173
 Betsy+0.171
Neg logit contributions
 dice-0.110
 contain-0.110
ents-0.102
 levers-0.094
ooters-0.087
Token,"
Token ID553
Feature activation-2.83e-01

Pos logit contributions
 dice+1.174
ents+1.151
 contain+1.135
 else+1.092
 clearly+1.039
Neg logit contributions
uck-0.890
 Cow-0.801
oo-0.790
 Wolf-0.768
over-0.737
Token Ben
Token ID3932
Feature activation-9.21e-01

Pos logit contributions
ents+0.023
 else+0.023
 dice+0.023
 contain+0.021
 clearly+0.020
Neg logit contributions
uck-0.023
 Cow-0.022
oo-0.022
 Wolf-0.022
burn-0.021
Token said
Token ID531
Feature activation-1.28e+00

Pos logit contributions
 dice+0.075
ents+0.074
 else+0.072
 contain+0.069
 clearly+0.064
Neg logit contributions
uck-0.076
 Cow-0.073
oo-0.071
 Wolf-0.070
ivered-0.067
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
ents+0.110
 dice+0.109
 else+0.108
 contain+0.102
 clearly+0.096
Neg logit contributions
uck-0.099
 Cow-0.094
oo-0.093
 Wolf-0.091
burn-0.087
Token "
Token ID366
Feature activation-3.97e-01

Pos logit contributions
 else+0.033
ents+0.033
 dice+0.033
 contain+0.031
 clearly+0.030
Neg logit contributions
uck-0.023
 Cow-0.022
oo-0.022
 Wolf-0.022
burn-0.021
Token finding
Token ID4917
Feature activation-2.92e+00

Pos logit contributions
uck+0.028
 Cow+0.027
 Wolf+0.026
oo+0.026
 Sleep+0.024
Neg logit contributions
ents-0.034
 dice-0.034
 else-0.033
 contain-0.033
 clearly-0.030
Token something
Token ID1223
Feature activation+3.06e+00

Pos logit contributions
ents+0.290
 dice+0.283
 else+0.279
 contain+0.276
 clearly+0.252
Neg logit contributions
uck-0.187
 Cow-0.180
 Wolf-0.174
oo-0.170
P-0.159
Token can
Token ID460
Feature activation-2.43e+00

Pos logit contributions
 Cow+0.257
uck+0.255
 wolf+0.253
 Wolf+0.252
 Bunny+0.242
Neg logit contributions
 dice-0.224
ents-0.223
 contain-0.213
bars-0.179
saw-0.170
Token make
Token ID787
Feature activation+6.88e-01

Pos logit contributions
 dice+0.228
ents+0.224
 else+0.198
 contain+0.196
ooters+0.190
Neg logit contributions
 Cow-0.171
uck-0.167
 Wolf-0.163
oo-0.162
 wolf-0.155
Token someone
Token ID2130
Feature activation+1.93e+00

Pos logit contributions
uck+0.045
 Cow+0.041
oo+0.041
 Wolf+0.040
burn+0.037
Neg logit contributions
 dice-0.068
ents-0.068
 contain-0.065
 else-0.065
 clearly-0.059
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.165
 Cow+0.160
 wolf+0.159
burn+0.157
 suffering+0.156
Neg logit contributions
 dice-0.143
ents-0.132
 contain-0.116
nine-0.107
bars-0.106
Token happy
Token ID3772
Feature activation-1.95e+00

Pos logit contributions
 dice+1.200
ents+1.140
 contain+1.069
 else+1.012
ooters+0.967
Neg logit contributions
uck-0.903
 Cow-0.833
 Wolf-0.795
oo-0.795
burn-0.772
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
ents+0.184
 dice+0.183
 else+0.175
 contain+0.172
 clearly+0.155
Neg logit contributions
uck-0.138
 Cow-0.126
oo-0.124
 Wolf-0.122
 Sleep-0.117
Token<|endoftext|>
Token ID50256
Feature activation+1.17e+00

Pos logit contributions
 else+0.032
ents+0.032
 dice+0.031
 contain+0.030
 clearly+0.028
Neg logit contributions
uck-0.022
 Cow-0.021
oo-0.021
 Wolf-0.020
burn-0.019
TokenL
Token ID43
Feature activation-1.20e-01

Pos logit contributions
uck+0.072
 Cow+0.072
ivered+0.071
oo+0.070
burn+0.069
Neg logit contributions
 else-0.113
 dice-0.112
ents-0.107
 contain-0.100
 clearly-0.098
Tokenily
Token ID813
Feature activation-4.30e+00

Pos logit contributions
 else+0.011
 dice+0.011
ents+0.010
 contain+0.010
 clearly+0.010
Neg logit contributions
uck-0.009
oo-0.008
 Cow-0.008
 Wolf-0.008
P-0.008
Token would
Token ID561
Feature activation-3.12e-01

Pos logit contributions
 else+0.136
ents+0.136
 dice+0.135
 contain+0.128
 clearly+0.120
Neg logit contributions
uck-0.134
 Cow-0.128
oo-0.126
 Wolf-0.125
burn-0.119
Token look
Token ID804
Feature activation+2.56e-01

Pos logit contributions
ents+0.025
 dice+0.025
 else+0.025
 contain+0.023
 clearly+0.022
Neg logit contributions
uck-0.025
 Cow-0.025
oo-0.024
 Wolf-0.024
burn-0.023
Token for
Token ID329
Feature activation-2.17e-01

Pos logit contributions
uck+0.015
 Cow+0.014
 Wolf+0.014
 wolf+0.014
oo+0.013
Neg logit contributions
ents-0.025
 dice-0.024
 contain-0.024
 else-0.023
ooters-0.022
Token it
Token ID340
Feature activation+1.18e+00

Pos logit contributions
ents+0.022
 else+0.022
 dice+0.022
 contain+0.021
 clearly+0.020
Neg logit contributions
uck-0.013
 Cow-0.013
 Wolf-0.012
oo-0.012
P-0.011
Token somewhere
Token ID7382
Feature activation-1.98e+00

Pos logit contributions
uck+0.075
 Cow+0.075
 Wolf+0.073
oo+0.071
 wolf+0.067
Neg logit contributions
 dice-0.111
ents-0.111
 contain-0.106
 else-0.100
saw-0.096
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
 dice+0.133
ents+0.120
 contain+0.114
 Isabel+0.103
bars+0.101
Neg logit contributions
 Cow-0.172
 wolf-0.170
 Wolf-0.168
uck-0.165
 Betsy-0.163
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
ents+1.223
 dice+1.202
 contain+1.171
 else+1.114
 clearly+1.072
Neg logit contributions
uck-0.808
 Wolf-0.757
 Cow-0.755
oo-0.713
over-0.690
Token He
Token ID679
Feature activation+6.77e-01

Pos logit contributions
ents+0.035
 else+0.035
 dice+0.035
 contain+0.034
 clearly+0.032
Neg logit contributions
uck-0.018
 Cow-0.017
 Wolf-0.017
oo-0.016
 Sleep-0.015
Token left
Token ID1364
Feature activation+7.59e-01

Pos logit contributions
 Cow+0.046
uck+0.045
 Wolf+0.043
oo+0.042
eled+0.042
Neg logit contributions
 dice-0.063
ents-0.062
 contain-0.060
 else-0.058
ooters-0.056
Token the
Token ID262
Feature activation-5.19e-01

Pos logit contributions
uck+0.054
 Cow+0.051
oo+0.050
 Wolf+0.049
burn+0.047
Neg logit contributions
 else-0.070
ents-0.070
 dice-0.069
 contain-0.066
 clearly-0.063
Token room
Token ID2119
Feature activation-7.46e-01

Pos logit contributions
 else+0.048
ents+0.048
 dice+0.047
 contain+0.045
 clearly+0.042
Neg logit contributions
uck-0.037
 Cow-0.035
oo-0.034
 Wolf-0.034
burn-0.032
Token They
Token ID1119
Feature activation+1.75e+00

Pos logit contributions
 else+0.026
ents+0.026
 dice+0.026
 contain+0.024
 clearly+0.024
Neg logit contributions
uck-0.023
oo-0.022
ivered-0.021
 Cow-0.020
P-0.020
Token decided
Token ID3066
Feature activation+1.37e+00

Pos logit contributions
 Cow+0.107
uck+0.106
P+0.100
oo+0.099
 Wolf+0.097
Neg logit contributions
 dice-0.169
ents-0.167
 else-0.162
 contain-0.157
 clearly-0.152
Token to
Token ID284
Feature activation-3.64e-01

Pos logit contributions
uck+0.103
 Cow+0.098
oo+0.096
 Wolf+0.095
burn+0.091
Neg logit contributions
 dice-0.121
ents-0.121
 else-0.120
 contain-0.114
 clearly-0.106
Token play
Token ID711
Feature activation+2.60e-01

Pos logit contributions
 else+0.025
 dice+0.024
ents+0.024
 contain+0.023
 clearly+0.021
Neg logit contributions
uck-0.034
 Cow-0.033
 Wolf-0.032
ivered-0.032
oo-0.032
Token something
Token ID1223
Feature activation+3.04e+00

Pos logit contributions
uck+0.019
oo+0.017
 Cow+0.017
 Wolf+0.017
P+0.016
Neg logit contributions
ents-0.024
 else-0.023
 dice-0.023
 contain-0.023
 clearly-0.021
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
P+0.310
time+0.296
uck+0.286
oo+0.274
Poor+0.264
Neg logit contributions
ents-0.192
 dice-0.157
bars-0.154
 levers-0.142
 contain-0.141
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
ents+1.447
 dice+1.429
 contain+1.390
ooters+1.353
cases+1.324
Neg logit contributions
uck-0.616
over-0.543
oo-0.489
P-0.456
 Wolf-0.451
Token They
Token ID1119
Feature activation+1.72e+00

Pos logit contributions
 else+0.031
ents+0.031
 dice+0.031
 contain+0.029
 clearly+0.028
Neg logit contributions
uck-0.023
oo-0.021
 Cow-0.021
ivered-0.020
 Wolf-0.020
Token did
Token ID750
Feature activation+1.53e+00

Pos logit contributions
 Cow+0.094
uck+0.090
P+0.090
eled+0.087
burn+0.085
Neg logit contributions
ents-0.173
 dice-0.171
 else-0.164
 contain-0.162
ooters-0.159
Token not
Token ID407
Feature activation+6.17e-01

Pos logit contributions
uck+0.102
oo+0.098
 Cow+0.093
 Wolf+0.087
burn+0.086
Neg logit contributions
ents-0.144
 dice-0.138
 contain-0.138
 else-0.135
bars-0.124
Token want
Token ID765
Feature activation-1.94e+00

Pos logit contributions
uck+0.049
 Cow+0.048
oo+0.047
 Wolf+0.047
burn+0.045
Neg logit contributions
ents-0.049
 dice-0.048
 else-0.047
cases-0.043
 clearly-0.042
Token "
Token ID366
Feature activation-3.49e-01

Pos logit contributions
uck+0.025
 Cow+0.024
oo+0.023
 Wolf+0.023
burn+0.022
Neg logit contributions
ents-0.038
 dice-0.037
 else-0.037
 contain-0.036
 clearly-0.034
TokenCan
Token ID6090
Feature activation+2.50e+00

Pos logit contributions
 else+0.037
ents+0.036
 dice+0.036
 contain+0.034
 clearly+0.033
Neg logit contributions
uck-0.020
 Cow-0.019
oo-0.019
 Wolf-0.019
ivered-0.018
Token I
Token ID314
Feature activation-1.15e+00

Pos logit contributions
uck+0.200
 Cow+0.190
 Wolf+0.183
oo+0.182
 Sleep+0.171
Neg logit contributions
 dice-0.209
ents-0.209
 contain-0.200
 else-0.193
ows-0.184
Token have
Token ID423
Feature activation-1.11e+00

Pos logit contributions
 else+0.085
ents+0.081
 dice+0.080
 contain+0.076
 clearly+0.070
Neg logit contributions
uck-0.103
 Cow-0.099
oo-0.098
 Wolf-0.098
ivered-0.096
Token something
Token ID1223
Feature activation+3.04e+00

Pos logit contributions
 else+0.101
ents+0.097
 dice+0.095
 contain+0.085
 clearly+0.084
Neg logit contributions
uck-0.085
ivered-0.083
oo-0.080
 Wolf-0.079
 Cow-0.079
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.228
 Cow+0.218
oo+0.213
P+0.201
 Wolf+0.201
Neg logit contributions
ents-0.264
 dice-0.263
 contain-0.246
bars-0.225
cases-0.225
Token?"
Token ID1701
Feature activation+2.28e+00

Pos logit contributions
ents+1.257
 dice+1.245
 else+1.233
 contain+1.195
 clearly+1.129
Neg logit contributions
uck-0.814
 Cow-0.759
oo-0.745
 Wolf-0.724
burn-0.681
Token
Token ID198
Feature activation-7.82e-01

Pos logit contributions
uck+0.156
 Cow+0.148
oo+0.145
 Wolf+0.144
burn+0.138
Neg logit contributions
 else-0.217
ents-0.213
 dice-0.211
 contain-0.201
 clearly-0.193
Token
Token ID198
Feature activation-7.91e-01

Pos logit contributions
 else+0.075
ents+0.074
 dice+0.073
 contain+0.069
 clearly+0.067
Neg logit contributions
uck-0.053
 Cow-0.050
 Wolf-0.049
oo-0.049
burn-0.046
TokenHer
Token ID9360
Feature activation-4.95e-01

Pos logit contributions
 else+0.079
ents+0.077
 dice+0.076
 contain+0.072
 clearly+0.069
Neg logit contributions
uck-0.052
 Cow-0.050
 Wolf-0.048
oo-0.048
burn-0.045
Token mom
Token ID1995
Feature activation+1.01e-03

Pos logit contributions
 dice+0.060
ents+0.060
 contain+0.059
 else+0.058
hest+0.055
Neg logit contributions
uck-0.020
 Cow-0.020
oo-0.019
 Wolf-0.018
burn-0.018
Token wanted
Token ID2227
Feature activation+8.39e-01

Pos logit contributions
 dice+0.174
ents+0.167
 contain+0.167
 else+0.166
ooters+0.156
Neg logit contributions
uck-0.117
 Cow-0.109
 Wolf-0.107
oo-0.105
eled-0.103
Token to
Token ID284
Feature activation-3.53e-01

Pos logit contributions
uck+0.054
 Cow+0.051
 Wolf+0.050
oo+0.047
 Sleep+0.044
Neg logit contributions
 dice-0.083
ents-0.081
 contain-0.079
 else-0.078
ows-0.074
Token make
Token ID787
Feature activation+6.98e-01

Pos logit contributions
ents+0.029
 dice+0.029
 else+0.028
 contain+0.025
ows+0.025
Neg logit contributions
uck-0.028
 Cow-0.027
oo-0.027
 Wolf-0.026
burn-0.025
Token sure
Token ID1654
Feature activation-7.51e-01

Pos logit contributions
uck+0.046
 Cow+0.042
oo+0.042
 Wolf+0.041
burn+0.038
Neg logit contributions
 dice-0.068
ents-0.068
 else-0.068
 contain-0.065
 clearly-0.060
Token nobody
Token ID8168
Feature activation-6.93e-02

Pos logit contributions
 dice+0.081
ents+0.079
ooters+0.072
 else+0.071
ows+0.071
Neg logit contributions
uck-0.041
 Cow-0.038
 Wolf-0.036
 Betsy-0.033
oo-0.032
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
 dice+0.007
ents+0.007
ooters+0.006
cases+0.006
 levers+0.006
Neg logit contributions
uck-0.004
 Wolf-0.004
 Cow-0.004
 Sleep-0.004
oo-0.004
Token found
Token ID1043
Feature activation-3.79e-01

Pos logit contributions
 dice+1.307
ents+1.260
 contain+1.131
ooters+1.126
hest+1.062
Neg logit contributions
uck-0.755
 Cow-0.725
 Wolf-0.717
 Sleep-0.692
oo-0.678
Token this
Token ID428
Feature activation-3.14e+00

Pos logit contributions
 dice+0.040
ents+0.039
 contain+0.039
 else+0.038
ooters+0.036
Neg logit contributions
uck-0.021
 Cow-0.020
 Wolf-0.019
oo-0.019
 wolf-0.018
Token clean
Token ID3424
Feature activation+1.49e+00

Pos logit contributions
ents+0.320
 contain+0.294
 dice+0.293
 else+0.284
ooters+0.278
Neg logit contributions
uck-0.182
 Wolf-0.177
 wolf-0.177
 Cow-0.175
 Sleep-0.168
Token treasure
Token ID14068
Feature activation-3.05e+00

Pos logit contributions
uck+0.116
 Cow+0.110
oo+0.109
 Wolf+0.107
burn+0.102
Neg logit contributions
 else-0.128
ents-0.127
 dice-0.126
 contain-0.119
 clearly-0.113
Token!
Token ID0
Feature activation+3.63e-01

Pos logit contributions
ents+0.311
 dice+0.309
 contain+0.297
ooters+0.288
 else+0.287
Neg logit contributions
uck-0.160
 Wolf-0.148
 Cow-0.144
 wolf-0.142
 Sleep-0.139
Token that
Token ID326
Feature activation+1.62e+00

Pos logit contributions
uck+0.167
oo+0.154
 Cow+0.153
ivered+0.152
 Wolf+0.150
Neg logit contributions
 else-0.162
ents-0.158
 dice-0.148
 contain-0.142
 clearly-0.141
Token there
Token ID612
Feature activation+2.53e+00

Pos logit contributions
uck+0.129
 Cow+0.121
oo+0.121
 Wolf+0.118
ivered+0.115
Neg logit contributions
 else-0.137
ents-0.135
 dice-0.133
 contain-0.126
 clearly-0.120
Token was
Token ID373
Feature activation+5.76e-02

Pos logit contributions
uck+0.138
 Wolf+0.137
 Cow+0.137
oo+0.122
 Sleep+0.120
Neg logit contributions
ents-0.276
 dice-0.271
 contain-0.252
 else-0.243
ooters-0.236
Token no
Token ID645
Feature activation+2.83e+00

Pos logit contributions
uck+0.004
oo+0.004
ivered+0.004
 Cow+0.004
burn+0.004
Neg logit contributions
 else-0.005
ents-0.005
 dice-0.005
 clearly-0.005
 contain-0.005
Token one
Token ID530
Feature activation+1.96e-02

Pos logit contributions
uck+0.217
 Cow+0.202
oo+0.201
 Wolf+0.196
ivered+0.192
Neg logit contributions
 else-0.248
ents-0.243
 dice-0.242
 contain-0.229
 clearly-0.218
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
 Wolf+0.001
 Cow+0.001
uck+0.001
 wolf+0.001
 Squirrel+0.001
Neg logit contributions
ents-0.002
 dice-0.002
 contain-0.002
cases-0.002
ows-0.002
Token near
Token ID1474
Feature activation-5.48e-01

Pos logit contributions
 dice+1.200
ents+1.170
 contain+1.131
 else+1.066
 clearly+1.018
Neg logit contributions
uck-0.847
 Cow-0.820
 Wolf-0.808
oo-0.772
 wolf-0.732
Token her
Token ID607
Feature activation+5.72e-01

Pos logit contributions
 else+0.047
ents+0.046
 dice+0.045
 contain+0.043
 clearly+0.041
Neg logit contributions
uck-0.043
oo-0.040
 Cow-0.039
ivered-0.039
 Wolf-0.038
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
 Cow+0.036
 Wolf+0.035
uck+0.035
oo+0.032
 wolf+0.032
Neg logit contributions
 dice-0.057
ents-0.056
 else-0.055
 contain-0.054
 clearly-0.049
Token
Token ID198
Feature activation-7.91e-01

Pos logit contributions
ents+0.029
 else+0.026
 dice+0.026
 clearly+0.024
 contain+0.024
Neg logit contributions
burn-0.026
ivered-0.025
eled-0.025
oo-0.024
 Amen-0.024
Token
Token ID198
Feature activation-7.98e-01

Pos logit contributions
ents+0.082
 dice+0.082
 else+0.081
 contain+0.078
 clearly+0.074
Neg logit contributions
uck-0.047
 Cow-0.044
oo-0.044
 Wolf-0.043
P-0.039
Token give
Token ID1577
Feature activation-3.54e-01

Pos logit contributions
 else+0.023
 contain+0.023
 dice+0.022
ents+0.022
 clearly+0.020
Neg logit contributions
uck-0.032
 Cow-0.031
ivered-0.030
 Wolf-0.030
P-0.030
Token it
Token ID340
Feature activation+1.20e+00

Pos logit contributions
 else+0.030
 dice+0.027
ents+0.027
 contain+0.026
 clearly+0.026
Neg logit contributions
uck-0.028
ivered-0.028
 Cow-0.027
oo-0.027
 Wolf-0.027
Token away
Token ID1497
Feature activation+2.43e+00

Pos logit contributions
uck+0.084
 Cow+0.079
oo+0.077
 Wolf+0.076
burn+0.072
Neg logit contributions
ents-0.112
 else-0.111
 dice-0.111
 contain-0.105
 clearly-0.101
Token to
Token ID284
Feature activation-3.64e-01

Pos logit contributions
uck+0.153
 Cow+0.153
 Wolf+0.146
oo+0.142
 Sleep+0.133
Neg logit contributions
 dice-0.237
ents-0.235
 else-0.223
 contain-0.210
ooters-0.202
Token someone
Token ID2130
Feature activation+1.93e+00

Pos logit contributions
 else+0.032
ents+0.031
 dice+0.031
 contain+0.030
 clearly+0.028
Neg logit contributions
uck-0.028
 Cow-0.027
oo-0.026
 Wolf-0.026
burn-0.025
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.193
 Cow+0.190
 Wolf+0.183
 wolf+0.181
 chick+0.179
Neg logit contributions
 dice-0.112
ents-0.102
 contain-0.084
ooters-0.081
nine-0.078
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
 dice+1.186
ents+1.178
 else+1.089
 contain+1.083
ooters+1.012
Neg logit contributions
uck-0.879
 Cow-0.826
oo-0.806
 Wolf-0.802
 Sleep-0.748
Token They
Token ID1119
Feature activation+1.72e+00

Pos logit contributions
ents+0.033
 else+0.033
 dice+0.033
 contain+0.031
 clearly+0.030
Neg logit contributions
uck-0.021
 Cow-0.020
oo-0.020
 Wolf-0.019
burn-0.019
Token used
Token ID973
Feature activation-3.19e+00

Pos logit contributions
uck+0.117
 Cow+0.115
 Wolf+0.110
oo+0.110
eled+0.104
Neg logit contributions
 dice-0.159
ents-0.159
 else-0.158
 contain-0.148
 clearly-0.142
Token the
Token ID262
Feature activation-5.12e-01

Pos logit contributions
 else+0.290
ents+0.288
 dice+0.285
 contain+0.271
 clearly+0.257
Neg logit contributions
uck-0.231
 Cow-0.220
oo-0.216
 Wolf-0.213
burn-0.203
Token money
Token ID1637
Feature activation-2.74e+00

Pos logit contributions
ents+0.047
 else+0.045
 dice+0.043
 contain+0.042
 clearly+0.041
Neg logit contributions
uck-0.037
 Cow-0.036
 Wolf-0.035
oo-0.034
burn-0.033
Token away
Token ID1497
Feature activation+2.44e+00

Pos logit contributions
uck+0.055
 Wolf+0.050
 Cow+0.049
oo+0.048
P+0.046
Neg logit contributions
ents-0.075
 else-0.072
 dice-0.072
 contain-0.067
 clearly-0.063
Token but
Token ID475
Feature activation-2.33e+00

Pos logit contributions
uck+0.161
 Cow+0.156
 Wolf+0.155
oo+0.148
 wolf+0.143
Neg logit contributions
 dice-0.233
 else-0.227
ents-0.226
 contain-0.212
ooters-0.205
Token he
Token ID339
Feature activation-1.67e+00

Pos logit contributions
ents+0.195
 dice+0.193
 else+0.191
 contain+0.180
 clearly+0.169
Neg logit contributions
uck-0.186
 Cow-0.180
 Wolf-0.176
oo-0.173
ivered-0.162
Token noticed
Token ID6810
Feature activation+2.25e+00

Pos logit contributions
 else+0.148
ents+0.144
 dice+0.140
 contain+0.133
 clearly+0.129
Neg logit contributions
uck-0.128
 Cow-0.121
oo-0.119
 Wolf-0.117
burn-0.116
Token something
Token ID1223
Feature activation+3.05e+00

Pos logit contributions
uck+0.170
oo+0.158
 Cow+0.158
burn+0.152
 Wolf+0.152
Neg logit contributions
 else-0.204
ents-0.199
 dice-0.194
 contain-0.181
 clearly-0.181
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.201
 wolf+0.200
 Wolf+0.200
 Cow+0.197
oo+0.186
Neg logit contributions
ents-0.284
 dice-0.281
 contain-0.255
ooters-0.246
 wins-0.235
Token on
Token ID319
Feature activation+2.53e+00

Pos logit contributions
 dice+1.234
ents+1.221
 contain+1.164
 else+1.119
ooters+1.066
Neg logit contributions
uck-0.814
 Cow-0.781
 Wolf-0.779
oo-0.739
 wolf-0.724
Token the
Token ID262
Feature activation-5.01e-01

Pos logit contributions
uck+0.235
ivered+0.219
oo+0.217
 Cow+0.210
burn+0.208
Neg logit contributions
 dice-0.187
 else-0.185
ents-0.180
 contain-0.168
 clearly-0.167
Token island
Token ID7022
Feature activation-2.07e+00

Pos logit contributions
ents+0.048
 else+0.047
 dice+0.046
 clearly+0.042
 contain+0.042
Neg logit contributions
 Cow-0.033
uck-0.032
 Wolf-0.032
 wolf-0.031
oo-0.030
Token.
Token ID13
Feature activation-3.41e-01

Pos logit contributions
 dice+0.194
ents+0.192
 else+0.187
 contain+0.180
ooters+0.166
Neg logit contributions
uck-0.140
 Cow-0.132
 Wolf-0.128
oo-0.127
 wolf-0.121
Token It
Token ID632
Feature activation+1.13e+00

Pos logit contributions
ents+0.031
 dice+0.030
 else+0.028
 contain+0.027
 clearly+0.026
Neg logit contributions
burn-0.026
uck-0.026
oo-0.026
ivered-0.025
P-0.024
Token he
Token ID339
Feature activation-1.66e+00

Pos logit contributions
 else+0.091
ents+0.077
 clearly+0.072
 dice+0.070
 contain+0.066
Neg logit contributions
uck-0.171
burn-0.163
ivered-0.161
oo-0.160
P-0.153
Token decided
Token ID3066
Feature activation+1.38e+00

Pos logit contributions
 else+0.145
ents+0.143
 dice+0.134
 contain+0.129
 clearly+0.125
Neg logit contributions
uck-0.131
oo-0.124
 Cow-0.120
 Wolf-0.120
burn-0.119
Token to
Token ID284
Feature activation-3.59e-01

Pos logit contributions
uck+0.120
oo+0.116
 Cow+0.115
 Wolf+0.113
ivered+0.112
Neg logit contributions
 else-0.107
ents-0.105
 dice-0.100
 contain-0.096
 clearly-0.092
Token collect
Token ID2824
Feature activation-2.67e-01

Pos logit contributions
 else+0.023
 contain+0.022
 dice+0.022
ents+0.021
 count+0.019
Neg logit contributions
uck-0.035
ivered-0.034
 Cow-0.034
eled-0.034
burn-0.034
Token something
Token ID1223
Feature activation+3.05e+00

Pos logit contributions
ents+0.026
 dice+0.026
 else+0.026
 contain+0.025
cases+0.023
Neg logit contributions
uck-0.016
 Cow-0.016
 Wolf-0.015
oo-0.014
P-0.014
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.231
P+0.210
 Cow+0.207
 Squirrel+0.203
 wolf+0.203
Neg logit contributions
 dice-0.241
ents-0.233
cases-0.211
 contain-0.209
ows-0.207
Token instead
Token ID2427
Feature activation+1.32e+00

Pos logit contributions
 dice+1.316
ents+1.299
 contain+1.222
cases+1.170
 else+1.160
Neg logit contributions
uck-0.711
 Cow-0.602
 Squirrel-0.569
 Wolf-0.568
eled-0.551
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
uck+0.102
 Cow+0.097
oo+0.097
 Wolf+0.095
burn+0.088
Neg logit contributions
 dice-0.114
ents-0.112
 else-0.111
 contain-0.105
 clearly-0.098
Token He
Token ID679
Feature activation+6.77e-01

Pos logit contributions
ents+0.033
 else+0.032
 dice+0.032
 contain+0.031
 clearly+0.029
Neg logit contributions
uck-0.021
oo-0.020
 Cow-0.020
burn-0.019
 Wolf-0.019
Token saw
Token ID2497
Feature activation+1.05e+00

Pos logit contributions
uck+0.046
 Cow+0.044
 Wolf+0.042
oo+0.042
P+0.040
Neg logit contributions
 else-0.063
 dice-0.063
ents-0.062
 contain-0.061
 clearly-0.057
Token a
Token ID257
Feature activation+3.97e-01

Pos logit contributions
uck+0.070
 Cow+0.066
oo+0.064
 Wolf+0.064
burn+0.060
Neg logit contributions
 else-0.101
ents-0.101
 dice-0.100
 contain-0.096
 clearly-0.091
Token ignore
Token ID8856
Feature activation-3.41e-01

Pos logit contributions
ents+0.028
 dice+0.028
 else+0.028
 contain+0.026
 clearly+0.024
Neg logit contributions
uck-0.029
 Cow-0.028
oo-0.027
 Wolf-0.027
burn-0.025
Token danger
Token ID3514
Feature activation+3.79e+00

Pos logit contributions
 else+0.029
ents+0.028
 dice+0.028
 contain+0.026
 clearly+0.025
Neg logit contributions
uck-0.027
 Cow-0.026
oo-0.025
 Wolf-0.025
ivered-0.024
Token signs
Token ID5895
Feature activation-4.98e+00

Pos logit contributions
uck+0.239
oo+0.219
 Wolf+0.218
 Cow+0.213
over+0.208
Neg logit contributions
 dice-0.372
ents-0.368
 else-0.346
 contain-0.342
ooters-0.317
Token,
Token ID11
Feature activation+3.69e-01

Pos logit contributions
 dice+0.447
ents+0.431
 else+0.429
 contain+0.410
 clearly+0.378
Neg logit contributions
uck-0.358
oo-0.330
 Cow-0.327
 Wolf-0.324
over-0.313
Token or
Token ID393
Feature activation-3.75e+00

Pos logit contributions
 Cow+0.030
uck+0.029
 Wolf+0.029
oo+0.028
 Sleep+0.026
Neg logit contributions
 dice-0.029
ents-0.029
 else-0.028
 contain-0.027
ows-0.025
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
 dice+0.315
ents+0.313
 else+0.298
 contain+0.289
 clearly+0.272
Neg logit contributions
uck-0.285
 Cow-0.284
 Wolf-0.275
oo-0.266
 Sleep-0.253
Token we
Token ID356
Feature activation-2.53e+00

Pos logit contributions
ents+1.024
 dice+1.013
 else+1.011
 contain+0.956
 clearly+0.894
Neg logit contributions
uck-1.032
 Cow-0.988
oo-0.968
 Wolf-0.963
burn-0.906
Token might
Token ID1244
Feature activation-1.49e+00

Pos logit contributions
 else+0.197
ents+0.195
 dice+0.193
 contain+0.182
 clearly+0.171
Neg logit contributions
uck-0.215
 Cow-0.206
oo-0.202
 Wolf-0.200
burn-0.192
Token end
Token ID886
Feature activation-2.02e+00

Pos logit contributions
ents+0.115
 dice+0.114
 else+0.113
 contain+0.104
 clearly+0.098
Neg logit contributions
uck-0.128
 Cow-0.123
oo-0.121
 Wolf-0.120
burn-0.115
Token up
Token ID510
Feature activation-6.15e-01

Pos logit contributions
ents+0.175
 else+0.174
 dice+0.173
 contain+0.165
 clearly+0.153
Neg logit contributions
uck-0.154
 Cow-0.147
oo-0.144
 Wolf-0.143
burn-0.135
Token with
Token ID351
Feature activation+8.36e-01

Pos logit contributions
 else+0.051
ents+0.050
 dice+0.049
 contain+0.047
 clearly+0.045
Neg logit contributions
uck-0.050
 Cow-0.047
oo-0.047
 Wolf-0.046
burn-0.044
Token He
Token ID679
Feature activation+7.09e-01

Pos logit contributions
 else+0.032
ents+0.032
 dice+0.031
 contain+0.030
 clearly+0.029
Neg logit contributions
uck-0.016
 Cow-0.015
 Wolf-0.015
oo-0.015
burn-0.013
Token wanted
Token ID2227
Feature activation+8.33e-01

Pos logit contributions
uck+0.040
 Cow+0.039
 Wolf+0.038
eled+0.037
oo+0.037
Neg logit contributions
 dice-0.071
 contain-0.068
ents-0.065
ooters-0.064
ows-0.064
Token to
Token ID284
Feature activation-3.59e-01

Pos logit contributions
 Wolf+0.045
uck+0.045
 Cow+0.044
oo+0.039
 wolf+0.038
Neg logit contributions
 dice-0.088
ents-0.088
 contain-0.085
ows-0.084
 else-0.082
Token try
Token ID1949
Feature activation+2.67e-01

Pos logit contributions
ents+0.029
 dice+0.029
 else+0.027
cases+0.027
ows+0.027
Neg logit contributions
uck-0.029
oo-0.028
 Cow-0.027
 Wolf-0.027
 Squirrel-0.026
Token something
Token ID1223
Feature activation+3.05e+00

Pos logit contributions
uck+0.016
 Cow+0.016
oo+0.015
 Wolf+0.015
burn+0.014
Neg logit contributions
 dice-0.027
 contain-0.027
ents-0.026
 else-0.026
ows-0.024
Token else
Token ID2073
Feature activation-1.26e+01

Pos logit contributions
uck+0.185
 Wolf+0.167
 Cow+0.166
oo+0.163
 wolf+0.158
Neg logit contributions
 dice-0.304
ents-0.304
cases-0.261
 contain-0.258
ooters-0.256
Token,
Token ID11
Feature activation+3.54e-01

Pos logit contributions
 dice+1.475
ents+1.427
 contain+1.323
ooters+1.258
saw+1.229
Neg logit contributions
uck-0.605
 Wolf-0.576
 Cow-0.547
oo-0.511
 Squirrel-0.507
Token so
Token ID523
Feature activation-1.54e+00

Pos logit contributions
 Cow+0.027
 Wolf+0.026
uck+0.026
 Squirrel+0.025
 wolf+0.025
Neg logit contributions
ents-0.029
 dice-0.029
 else-0.027
ows-0.026
 contain-0.025
Token he
Token ID339
Feature activation-1.69e+00

Pos logit contributions
 else+0.115
ents+0.114
 dice+0.112
 contain+0.105
 clearly+0.099
Neg logit contributions
uck-0.138
 Cow-0.132
oo-0.130
 Wolf-0.129
burn-0.124
Token decided
Token ID3066
Feature activation+1.34e+00

Pos logit contributions
 dice+0.158
ents+0.154
 contain+0.152
 else+0.151
ows+0.140
Neg logit contributions
uck-0.113
 Cow-0.107
P-0.103
oo-0.103
 Wolf-0.102
Token to
Token ID284
Feature activation-3.95e-01

Pos logit contributions
uck+0.090
 Cow+0.084
 Wolf+0.082
oo+0.081
burn+0.075
Neg logit contributions
 dice-0.129
ents-0.127
 else-0.123
 contain-0.121
ows-0.112
Token so
Token ID523
Feature activation-1.36e+00

Pos logit contributions
uck+0.018
 Cow+0.018
oo+0.017
 Wolf+0.017
burn+0.016
Neg logit contributions
 else-0.020
ents-0.020
 dice-0.020
 contain-0.018
 clearly-0.017
Token frustrated
Token ID14718
Feature activation-1.01e+00

Pos logit contributions
 else+0.115
ents+0.108
 contain+0.099
 clearly+0.097
 dice+0.097
Neg logit contributions
 Cow-0.114
burn-0.114
uck-0.113
 Wolf-0.111
oo-0.108
Token,
Token ID11
Feature activation+5.37e-01

Pos logit contributions
 dice+0.103
ents+0.101
 contain+0.101
 else+0.097
ooters+0.091
Neg logit contributions
uck-0.062
 Wolf-0.058
oo-0.057
 Cow-0.056
 wolf-0.051
Token he
Token ID339
Feature activation-1.51e+00

Pos logit contributions
uck+0.051
 Cow+0.049
oo+0.048
 Wolf+0.048
burn+0.046
Neg logit contributions
 else-0.037
ents-0.036
 dice-0.036
 contain-0.034
 clearly-0.031
Token y
Token ID331
Feature activation+1.46e+00

Pos logit contributions
 else+0.138
ents+0.136
 dice+0.133
 contain+0.127
 clearly+0.121
Neg logit contributions
uck-0.111
 Cow-0.106
oo-0.104
 Wolf-0.102
burn-0.099
Tokenanked
Token ID14076
Feature activation+5.94e+00

Pos logit contributions
 Amen+0.101
 Cow+0.097
 Wolf+0.093
 Sleep+0.088
 wolf+0.088
Neg logit contributions
ents-0.150
 else-0.135
 contain-0.125
 clearly-0.120
 dice-0.118
Token the
Token ID262
Feature activation-3.36e-01

Pos logit contributions
uck+0.424
 Cow+0.403
oo+0.394
 Wolf+0.391
burn+0.368
Neg logit contributions
 else-0.546
ents-0.544
 dice-0.538
 contain-0.513
 clearly-0.484
Token strong
Token ID1913
Feature activation+1.00e+00

Pos logit contributions
 else+0.029
ents+0.028
 dice+0.028
 contain+0.027
 clearly+0.025
Neg logit contributions
uck-0.026
 Cow-0.025
oo-0.025
 Wolf-0.025
burn-0.024
Token sun
Token ID4252
Feature activation+8.16e-01

Pos logit contributions
uck+0.078
 Cow+0.075
 Wolf+0.075
oo+0.074
P+0.072
Neg logit contributions
ents-0.084
 else-0.082
 dice-0.082
 contain-0.079
 clearly-0.074
Tokenflower
Token ID25547
Feature activation+3.08e+00

Pos logit contributions
uck+0.047
 Cow+0.044
oo+0.043
 Wolf+0.043
P+0.040
Neg logit contributions
ents-0.085
 else-0.085
 dice-0.083
 contain-0.081
 clearly-0.077
Token out
Token ID503
Feature activation-3.90e-01

Pos logit contributions
uck+0.228
 Cow+0.212
 Wolf+0.207
oo+0.206
burn+0.199
Neg logit contributions
 dice-0.277
ents-0.274
 else-0.273
 contain-0.254
 clearly-0.244
Token snow
Token ID6729
Feature activation+7.25e-01

Pos logit contributions
uck+0.304
 Cow+0.293
 Wolf+0.285
oo+0.283
ivered+0.271
Neg logit contributions
 else-0.371
ents-0.364
 dice-0.360
 contain-0.336
 clearly-0.327
Token angels
Token ID21981
Feature activation-9.12e-01

Pos logit contributions
uck+0.060
oo+0.057
 Cow+0.056
 Wolf+0.055
burn+0.055
Neg logit contributions
 else-0.059
ents-0.058
 dice-0.058
 contain-0.055
 clearly-0.051
Token.
Token ID13
Feature activation-2.95e-02

Pos logit contributions
 dice+0.084
 else+0.081
ents+0.080
 contain+0.076
 clearly+0.071
Neg logit contributions
uck-0.064
oo-0.061
burn-0.061
 Cow-0.060
 Wolf-0.057
Token
Token ID198
Feature activation-4.79e-01

Pos logit contributions
ents+0.003
 else+0.003
 dice+0.003
 contain+0.002
 clearly+0.002
Neg logit contributions
uck-0.002
oo-0.002
ivered-0.002
P-0.002
burn-0.002
TokenM
Token ID44
Feature activation+5.48e+00

Pos logit contributions
 else+0.046
ents+0.045
 dice+0.044
 contain+0.042
 clearly+0.041
Neg logit contributions
uck-0.032
 Cow-0.030
 Wolf-0.030
oo-0.029
burn-0.029
Tokenama
Token ID1689
Feature activation+5.92e+00

Pos logit contributions
uck+0.173
 Cow+0.155
 Wolf+0.150
burn+0.139
 Sleep+0.137
Neg logit contributions
ents-0.715
 else-0.705
 dice-0.682
 contain-0.662
 clearly-0.653
Token said
Token ID531
Feature activation-9.66e-01

Pos logit contributions
uck+0.456
 Cow+0.446
oo+0.434
 Wolf+0.429
eled+0.400
Neg logit contributions
 dice-0.503
 else-0.492
ents-0.485
 contain-0.485
 clearly-0.442
Token,
Token ID11
Feature activation+6.58e-01

Pos logit contributions
 else+0.085
ents+0.078
 dice+0.076
 contain+0.074
 clearly+0.070
Neg logit contributions
uck-0.074
ivered-0.072
 Cow-0.072
burn-0.071
P-0.070
Token "
Token ID366
Feature activation-6.39e-02

Pos logit contributions
uck+0.057
ivered+0.054
burn+0.052
eled+0.052
oo+0.051
Neg logit contributions
 else-0.051
ents-0.051
 dice-0.048
 contain-0.045
 clearly-0.044
TokenLet
Token ID5756
Feature activation+3.76e+00

Pos logit contributions
 else+0.007
ents+0.007
 dice+0.007
 contain+0.006
 clearly+0.006
Neg logit contributions
uck-0.004
 Cow-0.004
oo-0.004
 Wolf-0.003
ivered-0.003
Token's
Token ID338
Feature activation-3.05e+00

Pos logit contributions
uck+0.386
 Cow+0.384
oo+0.379
 Wolf+0.376
ivered+0.373
Neg logit contributions
 else-0.224
ents-0.199
 dice-0.189
 contain-0.180
 clearly-0.168
Token closer
Token ID5699
Feature activation-3.55e+00

Pos logit contributions
ents+0.078
 else+0.077
 dice+0.074
 contain+0.072
 clearly+0.066
Neg logit contributions
 Cow-0.092
uck-0.090
ivered-0.090
oo-0.088
 Wolf-0.088
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
 dice+0.365
 else+0.346
ents+0.345
 contain+0.329
ooters+0.308
Neg logit contributions
uck-0.224
 Cow-0.202
oo-0.199
 Wolf-0.198
burn-0.178
Token
Token ID198
Feature activation-6.14e-01

Pos logit contributions
ents+0.012
 else+0.012
 clearly+0.011
 dice+0.011
 contain+0.010
Neg logit contributions
ivered-0.014
oo-0.014
burn-0.014
uck-0.014
P-0.013
Token
Token ID198
Feature activation-6.29e-01

Pos logit contributions
 else+0.056
ents+0.055
 dice+0.053
 contain+0.050
 clearly+0.049
Neg logit contributions
uck-0.044
 Cow-0.042
burn-0.041
oo-0.041
 Wolf-0.041
TokenM
Token ID44
Feature activation+5.33e+00

Pos logit contributions
 else+0.059
ents+0.056
 dice+0.052
 contain+0.050
 clearly+0.048
Neg logit contributions
uck-0.044
burn-0.043
oo-0.042
ivered-0.042
 Wolf-0.041
Tokenama
Token ID1689
Feature activation+5.77e+00

Pos logit contributions
 Sleep+0.241
 Amen+0.234
burn+0.222
Look+0.220
P+0.218
Neg logit contributions
ents-0.602
 else-0.560
 clearly-0.522
 contain-0.499
 dice-0.490
Token smiled
Token ID13541
Feature activation-2.54e+00

Pos logit contributions
uck+0.476
 Cow+0.459
oo+0.448
 Wolf+0.446
burn+0.423
Neg logit contributions
 else-0.458
 dice-0.455
ents-0.454
 contain-0.432
 clearly-0.402
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
 else+0.174
ents+0.173
 clearly+0.164
 contain+0.151
 dice+0.144
Neg logit contributions
uck-0.235
oo-0.231
 Cow-0.229
P-0.224
 Wolf-0.223
Token "
Token ID366
Feature activation-2.33e-01

Pos logit contributions
ents+0.015
 else+0.014
 clearly+0.013
 dice+0.013
 contain+0.012
Neg logit contributions
uck-0.015
oo-0.015
ivered-0.014
burn-0.014
P-0.014
TokenHe
Token ID1544
Feature activation-2.90e-01

Pos logit contributions
 else+0.023
ents+0.023
 dice+0.022
 contain+0.022
 clearly+0.021
Neg logit contributions
uck-0.015
oo-0.014
 Cow-0.014
 Wolf-0.014
ivered-0.013
Token's
Token ID338
Feature activation-3.22e+00

Pos logit contributions
 else+0.022
ents+0.022
 dice+0.022
 contain+0.020
 clearly+0.019
Neg logit contributions
uck-0.025
 Cow-0.024
oo-0.024
 Wolf-0.023
burn-0.023
Token she
Token ID673
Feature activation-1.83e+00

Pos logit contributions
uck+0.052
 Cow+0.050
oo+0.049
 Wolf+0.048
ivered+0.047
Neg logit contributions
 else-0.044
ents-0.044
 dice-0.043
 contain-0.041
 clearly-0.038
Token learned
Token ID4499
Feature activation-2.07e-02

Pos logit contributions
 dice+0.172
ents+0.168
 else+0.166
 contain+0.163
 clearly+0.150
Neg logit contributions
uck-0.126
 Cow-0.122
 Wolf-0.116
oo-0.116
eled-0.109
Token all
Token ID477
Feature activation+1.58e+00

Pos logit contributions
 else+0.002
ents+0.002
 dice+0.002
 contain+0.002
 clearly+0.002
Neg logit contributions
uck-0.001
 Cow-0.001
oo-0.001
 Wolf-0.001
burn-0.001
Token about
Token ID546
Feature activation-2.78e+00

Pos logit contributions
uck+0.152
 Cow+0.147
oo+0.144
 Wolf+0.143
burn+0.138
Neg logit contributions
ents-0.105
 else-0.105
 dice-0.103
 contain-0.096
 clearly-0.089
Token the
Token ID262
Feature activation-2.69e-01

Pos logit contributions
 else+0.248
ents+0.242
 dice+0.239
 contain+0.228
 clearly+0.221
Neg logit contributions
uck-0.209
 Cow-0.192
oo-0.192
 Wolf-0.187
ivered-0.184
Token soup
Token ID17141
Feature activation+5.74e+00

Pos logit contributions
 else+0.025
 dice+0.025
ents+0.024
 contain+0.023
 clearly+0.022
Neg logit contributions
uck-0.019
oo-0.018
 Cow-0.018
 Wolf-0.017
ivered-0.017
Token kitchen
Token ID9592
Feature activation+3.34e+00

Pos logit contributions
uck+0.388
 Cow+0.360
 Wolf+0.347
oo+0.345
 Sleep+0.332
Neg logit contributions
 dice-0.550
ents-0.546
 else-0.507
 contain-0.507
ooters-0.473
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
uck+0.232
 Cow+0.216
oo+0.212
 Wolf+0.207
burn+0.200
Neg logit contributions
ents-0.309
 dice-0.307
 else-0.300
 contain-0.289
 clearly-0.264
Token She
Token ID1375
Feature activation+1.43e+00

Pos logit contributions
ents+0.010
 else+0.010
 dice+0.010
 clearly+0.009
 contain+0.009
Neg logit contributions
uck-0.008
oo-0.008
ivered-0.007
burn-0.007
 Cow-0.007
Token saw
Token ID2497
Feature activation+1.28e+00

Pos logit contributions
uck+0.098
 Cow+0.094
oo+0.090
 Wolf+0.089
eled+0.085
Neg logit contributions
 dice-0.135
ents-0.132
 else-0.130
 contain-0.128
ooters-0.119
Token her
Token ID607
Feature activation+7.80e-01

Pos logit contributions
uck+0.091
 Cow+0.087
oo+0.085
 Wolf+0.084
burn+0.080
Neg logit contributions
 else-0.117
ents-0.117
 dice-0.115
 contain-0.110
 clearly-0.104
Token went
Token ID1816
Feature activation+1.89e+00

Pos logit contributions
 else+0.017
ents+0.017
 dice+0.016
 contain+0.016
 clearly+0.015
Neg logit contributions
uck-0.016
 Cow-0.016
oo-0.015
 Wolf-0.015
burn-0.015
Token to
Token ID284
Feature activation-1.74e-01

Pos logit contributions
uck+0.140
 Cow+0.135
oo+0.132
 Wolf+0.131
ivered+0.128
Neg logit contributions
ents-0.165
 else-0.164
 dice-0.162
 contain-0.155
 clearly-0.146
Token bed
Token ID3996
Feature activation+1.70e+00

Pos logit contributions
 else+0.015
ents+0.015
 dice+0.015
 contain+0.014
 clearly+0.013
Neg logit contributions
uck-0.013
 Cow-0.013
oo-0.013
 Wolf-0.012
burn-0.012
Token with
Token ID351
Feature activation+1.05e+00

Pos logit contributions
uck+0.129
 Cow+0.123
 Wolf+0.121
oo+0.118
P+0.116
Neg logit contributions
 else-0.148
ents-0.145
 dice-0.140
 contain-0.139
 clearly-0.133
Token a
Token ID257
Feature activation+6.17e-01

Pos logit contributions
uck+0.079
oo+0.074
 Cow+0.072
ivered+0.071
 Wolf+0.069
Neg logit contributions
 else-0.097
ents-0.091
 dice-0.091
 contain-0.090
 clearly-0.084
Token belly
Token ID19921
Feature activation+7.92e+00

Pos logit contributions
uck+0.050
 Cow+0.048
oo+0.047
 Wolf+0.047
burn+0.045
Neg logit contributions
ents-0.050
 else-0.050
 dice-0.049
 contain-0.047
 clearly-0.044
Token full
Token ID1336
Feature activation+3.68e+00

Pos logit contributions
uck+0.678
 Cow+0.643
oo+0.638
 Wolf+0.637
P+0.631
Neg logit contributions
ents-0.626
 else-0.612
 dice-0.579
 contain-0.576
 clearly-0.542
Token of
Token ID286
Feature activation-9.71e-01

Pos logit contributions
uck+0.352
 Cow+0.339
oo+0.334
 Wolf+0.332
burn+0.323
Neg logit contributions
 else-0.251
ents-0.246
 dice-0.239
 contain-0.234
 clearly-0.211
Token cake
Token ID12187
Feature activation-4.78e-01

Pos logit contributions
 else+0.081
ents+0.079
 dice+0.078
 contain+0.074
 clearly+0.070
Neg logit contributions
uck-0.079
 Cow-0.076
oo-0.075
 Wolf-0.074
burn-0.071
Token and
Token ID290
Feature activation-2.03e-01

Pos logit contributions
 dice+0.044
ents+0.043
 else+0.043
 contain+0.041
 clearly+0.039
Neg logit contributions
uck-0.033
 Cow-0.031
oo-0.030
burn-0.030
 Wolf-0.030
Token a
Token ID257
Feature activation+5.93e-01

Pos logit contributions
 else+0.019
ents+0.018
 dice+0.018
 contain+0.017
 clearly+0.016
Neg logit contributions
uck-0.015
 Cow-0.014
oo-0.014
 Wolf-0.013
burn-0.013
Token m
Token ID285
Feature activation+1.11e+00

Pos logit contributions
uck+0.036
 Cow+0.036
 Wolf+0.034
oo+0.034
burn+0.032
Neg logit contributions
 else-0.045
 dice-0.044
ents-0.044
 contain-0.043
 clearly-0.040
Tokenister
Token ID1694
Feature activation+2.40e+00

Pos logit contributions
uck+0.071
 Cow+0.069
 Sleep+0.069
P+0.068
 Wolf+0.068
Neg logit contributions
ents-0.105
 else-0.104
 dice-0.099
 contain-0.096
ooters-0.092
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
uck+0.177
 Cow+0.171
oo+0.167
 Wolf+0.165
burn+0.155
Neg logit contributions
 dice-0.213
 else-0.212
ents-0.211
 contain-0.204
 clearly-0.190
Token We
Token ID775
Feature activation-4.28e-02

Pos logit contributions
 else+0.014
ents+0.014
 dice+0.014
 clearly+0.013
 wins+0.012
Neg logit contributions
uck-0.017
P-0.016
ivered-0.016
eled-0.016
oo-0.016
Token like
Token ID588
Feature activation-3.02e+00

Pos logit contributions
 else+0.003
ents+0.003
 dice+0.003
 contain+0.003
 clearly+0.002
Neg logit contributions
uck-0.004
 Cow-0.004
oo-0.004
 Wolf-0.004
burn-0.004
Token to
Token ID284
Feature activation-2.50e-01

Pos logit contributions
ents+0.281
 contain+0.279
 dice+0.276
 else+0.273
 clearly+0.255
Neg logit contributions
uck-0.204
 Cow-0.202
 Wolf-0.191
oo-0.189
burn-0.177
Token play
Token ID711
Feature activation+3.73e-01

Pos logit contributions
 else+0.020
 dice+0.019
ents+0.019
 contain+0.018
 clearly+0.017
Neg logit contributions
uck-0.021
 Cow-0.020
 Wolf-0.020
oo-0.020
ivered-0.019
Token with
Token ID351
Feature activation+9.64e-01

Pos logit contributions
uck+0.028
 Cow+0.027
oo+0.026
 Wolf+0.026
burn+0.025
Neg logit contributions
ents-0.033
 else-0.033
 dice-0.032
 contain-0.031
 clearly-0.029
Token hoops
Token ID46730
Feature activation-8.28e+00

Pos logit contributions
uck+0.064
 Cow+0.063
oo+0.061
 Wolf+0.060
P+0.055
Neg logit contributions
ents-0.091
 contain-0.091
 dice-0.089
 else-0.087
 clearly-0.082
Token,
Token ID11
Feature activation+4.60e-01

Pos logit contributions
 dice+0.845
 contain+0.825
ents+0.822
 else+0.798
 clearly+0.793
Neg logit contributions
uck-0.453
P-0.387
oo-0.374
 Cow-0.374
 Sleep-0.357
Token too
Token ID1165
Feature activation-1.28e+00

Pos logit contributions
 Cow+0.040
 Wolf+0.036
 Squirrel+0.036
oo+0.035
 wolf+0.034
Neg logit contributions
 dice-0.035
ents-0.034
 contain-0.033
 else-0.033
hest-0.031
Token like
Token ID588
Feature activation-2.83e+00

Pos logit contributions
 else+0.251
ents+0.248
 dice+0.243
 contain+0.229
 clearly+0.222
Neg logit contributions
uck-0.250
 Cow-0.237
oo-0.234
 Wolf-0.231
burn-0.221
Token they
Token ID484
Feature activation-1.62e+00

Pos logit contributions
 else+0.254
ents+0.251
 dice+0.249
 contain+0.236
 clearly+0.225
Neg logit contributions
uck-0.208
 Cow-0.197
oo-0.194
 Wolf-0.191
burn-0.182
Token were
Token ID547
Feature activation-2.66e-01

Pos logit contributions
ents+0.133
 dice+0.133
 else+0.122
 contain+0.117
ooters+0.112
Neg logit contributions
uck-0.128
 Cow-0.121
oo-0.118
 Wolf-0.117
 Sleep-0.115
Token giving
Token ID3501
Feature activation+2.48e+00

Pos logit contributions
 dice+0.023
ents+0.023
 else+0.023
 contain+0.022
 clearly+0.020
Neg logit contributions
uck-0.019
 Cow-0.019
oo-0.018
 Wolf-0.018
burn-0.017
Token you
Token ID345
Feature activation+5.87e-02

Pos logit contributions
uck+0.186
 Cow+0.176
oo+0.172
 Wolf+0.171
burn+0.162
Neg logit contributions
ents-0.218
 else-0.217
 dice-0.215
 contain-0.205
 clearly-0.193
Token a
Token ID257
Feature activation+7.29e-01

Pos logit contributions
uck+0.004
 Cow+0.004
 Wolf+0.003
oo+0.003
burn+0.003
Neg logit contributions
ents-0.006
 dice-0.006
 else-0.006
 contain-0.005
 clearly-0.005
Token special
Token ID2041
Feature activation-3.64e+00

Pos logit contributions
uck+0.058
 Cow+0.055
 Wolf+0.054
oo+0.054
burn+0.052
Neg logit contributions
ents-0.061
 dice-0.059
 else-0.059
 contain-0.057
 clearly-0.053
Token birthday
Token ID10955
Feature activation-9.26e-01

Pos logit contributions
 else+0.323
 dice+0.320
ents+0.310
 contain+0.301
 clearly+0.283
Neg logit contributions
uck-0.268
 Cow-0.258
oo-0.252
 Wolf-0.247
ivered-0.242
Token present
Token ID1944
Feature activation-6.51e-01

Pos logit contributions
ents+0.087
 dice+0.084
 contain+0.082
 else+0.082
 clearly+0.078
Neg logit contributions
uck-0.063
 Cow-0.058
oo-0.057
burn-0.056
 Wolf-0.056
Token."
Token ID526
Feature activation+9.43e-01

Pos logit contributions
 dice+0.060
ents+0.060
 else+0.058
 contain+0.057
 clearly+0.054
Neg logit contributions
uck-0.045
 Cow-0.043
oo-0.042
 Wolf-0.041
burn-0.040
Token
Token ID198
Feature activation-4.83e-01

Pos logit contributions
uck+0.073
oo+0.070
ivered+0.070
 Cow+0.069
burn+0.069
Neg logit contributions
 else-0.080
ents-0.077
 dice-0.073
 contain-0.069
 clearly-0.069
Token stepped
Token ID10764
Feature activation-9.30e-01

Pos logit contributions
uck+0.020
 Cow+0.019
oo+0.019
 Wolf+0.019
burn+0.019
Neg logit contributions
 else-0.025
ents-0.025
 dice-0.024
 contain-0.023
 clearly-0.022
Token closer
Token ID5699
Feature activation-3.40e+00

Pos logit contributions
 else+0.051
ents+0.049
 dice+0.047
 contain+0.045
 clearly+0.041
Neg logit contributions
ivered-0.096
uck-0.096
 Cow-0.095
oo-0.095
burn-0.095
Token it
Token ID340
Feature activation+1.49e+00

Pos logit contributions
 dice+0.325
ents+0.321
 else+0.318
 contain+0.306
 clearly+0.283
Neg logit contributions
uck-0.235
 Cow-0.221
oo-0.215
 Wolf-0.214
 wolf-0.193
Token felt
Token ID2936
Feature activation+2.11e+00

Pos logit contributions
uck+0.103
 Cow+0.098
 Wolf+0.094
oo+0.094
P+0.088
Neg logit contributions
ents-0.139
 dice-0.139
 else-0.137
 contain-0.133
 clearly-0.123
Token a
Token ID257
Feature activation+7.21e-01

Pos logit contributions
uck+0.175
oo+0.167
 Cow+0.165
burn+0.164
 Wolf+0.161
Neg logit contributions
 else-0.181
ents-0.169
 dice-0.161
 contain-0.160
 clearly-0.157
Token pinch
Token ID25394
Feature activation+2.76e+00

Pos logit contributions
uck+0.062
 Cow+0.059
oo+0.059
 Wolf+0.057
burn+0.056
Neg logit contributions
 else-0.058
ents-0.055
 dice-0.054
 contain-0.053
 clearly-0.050
Token.
Token ID13
Feature activation-1.90e-02

Pos logit contributions
uck+0.218
 Cow+0.212
oo+0.205
 Wolf+0.203
 Amen+0.199
Neg logit contributions
 else-0.232
ents-0.223
 dice-0.217
 contain-0.212
 clearly-0.202
Token It
Token ID632
Feature activation+1.45e+00

Pos logit contributions
ents+0.001
 else+0.001
 clearly+0.001
ows+0.001
 dice+0.001
Neg logit contributions
burn-0.002
ivered-0.001
P-0.001
eled-0.001
uck-0.001
Token was
Token ID373
Feature activation+3.61e-01

Pos logit contributions
uck+0.092
 Cow+0.088
 Wolf+0.086
oo+0.083
ivered+0.078
Neg logit contributions
 dice-0.146
ents-0.144
 else-0.139
 contain-0.138
ooters-0.128
Token a
Token ID257
Feature activation+7.22e-01

Pos logit contributions
uck+0.027
oo+0.025
 Cow+0.025
 Wolf+0.024
burn+0.024
Neg logit contributions
 else-0.032
ents-0.032
 dice-0.031
 contain-0.029
 clearly-0.029
Token nasty
Token ID17166
Feature activation+2.37e+00

Pos logit contributions
uck+0.063
oo+0.059
 Cow+0.058
ivered+0.056
 Wolf+0.056
Neg logit contributions
 else-0.058
 dice-0.055
ents-0.053
 contain-0.051
 clearly-0.049
Token hole
Token ID7604
Feature activation+1.71e+00

Pos logit contributions
uck+0.003
 Cow+0.003
oo+0.003
 Wolf+0.003
ivered+0.003
Neg logit contributions
 dice-0.003
 else-0.003
ents-0.003
 contain-0.003
 clearly-0.002
Token together
Token ID1978
Feature activation-1.04e+00

Pos logit contributions
uck+0.124
oo+0.115
 Cow+0.115
 Wolf+0.110
ivered+0.107
Neg logit contributions
 dice-0.153
ents-0.153
 else-0.152
 contain-0.144
 clearly-0.135
Token and
Token ID290
Feature activation-3.30e-01

Pos logit contributions
 dice+0.098
ents+0.097
 else+0.094
 contain+0.089
ooters+0.085
Neg logit contributions
uck-0.068
 Cow-0.063
oo-0.062
 Wolf-0.061
 Sleep-0.058
Token looked
Token ID3114
Feature activation+2.01e-01

Pos logit contributions
ents+0.029
 dice+0.028
 else+0.027
 contain+0.026
ooters+0.025
Neg logit contributions
uck-0.023
 Cow-0.022
 Wolf-0.022
 Sleep-0.021
oo-0.021
Token inside
Token ID2641
Feature activation-2.85e+00

Pos logit contributions
uck+0.014
 Cow+0.013
 wolf+0.012
 Wolf+0.012
eled+0.012
Neg logit contributions
 dice-0.018
ents-0.018
 else-0.017
 contain-0.017
ooters-0.016
Token.
Token ID13
Feature activation-2.59e-01

Pos logit contributions
 dice+0.254
ents+0.254
 else+0.252
 contain+0.238
 clearly+0.221
Neg logit contributions
uck-0.206
 Cow-0.197
 Wolf-0.192
oo-0.191
burn-0.177
Token
Token ID198
Feature activation-6.95e-01

Pos logit contributions
 else+0.027
ents+0.027
 dice+0.027
 contain+0.026
 clearly+0.025
Neg logit contributions
uck-0.015
 Cow-0.014
 Wolf-0.013
oo-0.013
 Sleep-0.012
Token
Token ID198
Feature activation-6.93e-01

Pos logit contributions
 dice+0.074
ents+0.074
 contain+0.073
 else+0.073
 clearly+0.067
Neg logit contributions
uck-0.038
 Cow-0.036
oo-0.035
 Wolf-0.034
P-0.032
Token"
Token ID1
Feature activation+4.30e-01

Pos logit contributions
ents+0.068
 else+0.068
 dice+0.067
 contain+0.065
 clearly+0.062
Neg logit contributions
uck-0.044
 Cow-0.042
oo-0.041
 Wolf-0.040
P-0.038
TokenWow
Token ID22017
Feature activation+7.71e-02

Pos logit contributions
uck+0.024
 Cow+0.023
oo+0.022
 Wolf+0.022
ivered+0.020
Neg logit contributions
 else-0.046
 dice-0.046
ents-0.046
 contain-0.044
 clearly-0.042
Token,
Token ID11
Feature activation+4.78e-01

Pos logit contributions
uck+0.004
oo+0.004
 Cow+0.004
 Wolf+0.004
burn+0.003
Neg logit contributions
ents-0.008
 dice-0.008
 else-0.008
 contain-0.008
 clearly-0.008
Token crunch
Token ID23527
Feature activation+5.90e-01

Pos logit contributions
uck+0.305
oo+0.290
 Cow+0.285
P+0.275
 Wolf+0.274
Neg logit contributions
 else-0.276
 contain-0.275
ents-0.266
 dice-0.261
 clearly-0.245
Token in
Token ID287
Feature activation-5.15e-01

Pos logit contributions
uck+0.035
 Cow+0.034
 Wolf+0.033
oo+0.032
P+0.030
Neg logit contributions
 else-0.061
ents-0.059
 dice-0.058
 contain-0.057
 clearly-0.055
Token his
Token ID465
Feature activation+1.35e+00

Pos logit contributions
 else+0.038
 dice+0.038
 contain+0.037
ents+0.037
 clearly+0.035
Neg logit contributions
uck-0.048
ivered-0.045
oo-0.045
 Wolf-0.043
 Cow-0.042
Token hand
Token ID1021
Feature activation+2.54e+00

Pos logit contributions
uck+0.098
oo+0.094
 Cow+0.091
 Wolf+0.090
burn+0.087
Neg logit contributions
 else-0.121
ents-0.119
 dice-0.118
 contain-0.113
ows-0.104
Token.
Token ID13
Feature activation-2.15e-02

Pos logit contributions
uck+0.201
 Cow+0.192
oo+0.189
 Wolf+0.187
burn+0.179
Neg logit contributions
 else-0.211
ents-0.209
 dice-0.208
 contain-0.196
 clearly-0.185
Token He
Token ID679
Feature activation+1.00e+00

Pos logit contributions
 else+0.002
ents+0.002
 dice+0.002
 contain+0.002
 clearly+0.002
Neg logit contributions
uck-0.001
oo-0.001
 Cow-0.001
burn-0.001
 Wolf-0.001
Token was
Token ID373
Feature activation+3.47e-01

Pos logit contributions
uck+0.068
 Cow+0.066
oo+0.063
 Wolf+0.063
eled+0.061
Neg logit contributions
 dice-0.092
ents-0.091
 else-0.090
 contain-0.088
ooters-0.082
Token so
Token ID523
Feature activation-1.23e+00

Pos logit contributions
uck+0.030
 Cow+0.029
oo+0.029
 Wolf+0.028
burn+0.027
Neg logit contributions
 else-0.026
ents-0.026
 dice-0.026
 contain-0.024
 clearly-0.023
Token happy
Token ID3772
Feature activation-1.64e+00

Pos logit contributions
 else+0.089
ents+0.084
 contain+0.079
 dice+0.078
 clearly+0.074
Neg logit contributions
uck-0.117
 Cow-0.114
 Wolf-0.114
oo-0.110
burn-0.110
Token that
Token ID326
Feature activation+1.90e+00

Pos logit contributions
ents+0.141
 else+0.141
 dice+0.140
 contain+0.133
 clearly+0.124
Neg logit contributions
uck-0.126
 Cow-0.120
oo-0.118
 Wolf-0.117
burn-0.110
Token he
Token ID339
Feature activation-1.38e+00

Pos logit contributions
uck+0.171
 Cow+0.163
oo+0.162
 Wolf+0.159
burn+0.155
Neg logit contributions
 else-0.143
ents-0.139
 dice-0.138
 contain-0.131
 clearly-0.122
Token Dad
Token ID17415
Feature activation-1.16e+00

Pos logit contributions
 else+0.010
ents+0.010
 dice+0.010
 contain+0.009
 clearly+0.008
Neg logit contributions
uck-0.011
oo-0.011
ivered-0.011
 Cow-0.011
 Wolf-0.010
Token say
Token ID910
Feature activation-1.64e+00

Pos logit contributions
 dice+0.086
ents+0.083
 else+0.079
 contain+0.069
 clearly+0.069
Neg logit contributions
uck-0.103
 Cow-0.100
oo-0.098
 Wolf-0.097
 Amen-0.090
Token.
Token ID13
Feature activation-8.29e-02

Pos logit contributions
 else+0.123
ents+0.116
 dice+0.110
 contain+0.107
 clearly+0.103
Neg logit contributions
uck-0.145
 Cow-0.143
P-0.140
 Wolf-0.139
ivered-0.137
Token "
Token ID366
Feature activation-1.13e-01

Pos logit contributions
 else+0.006
 dice+0.006
ents+0.006
 clearly+0.005
ows+0.005
Neg logit contributions
uck-0.007
ivered-0.007
oo-0.007
eled-0.007
 Cow-0.007
TokenYou
Token ID1639
Feature activation+2.24e+00

Pos logit contributions
 else+0.012
ents+0.011
 dice+0.011
 contain+0.011
 clearly+0.010
Neg logit contributions
uck-0.007
 Cow-0.007
oo-0.006
 Wolf-0.006
burn-0.006
Token are
Token ID389
Feature activation-2.06e+00

Pos logit contributions
uck+0.199
 Cow+0.190
oo+0.188
 Wolf+0.186
burn+0.178
Neg logit contributions
ents-0.166
 else-0.164
 dice-0.163
 contain-0.155
 clearly-0.145
Token not
Token ID407
Feature activation+8.91e-01

Pos logit contributions
ents+0.190
 dice+0.188
 else+0.185
 contain+0.183
 clearly+0.170
Neg logit contributions
uck-0.144
 Cow-0.138
oo-0.136
 Wolf-0.136
P-0.128
Token behaving
Token ID37722
Feature activation+2.15e+00

Pos logit contributions
uck+0.066
 Cow+0.062
 Wolf+0.061
oo+0.060
ivered+0.057
Neg logit contributions
 else-0.081
ents-0.080
 dice-0.078
 contain-0.075
 clearly-0.071
Token!
Token ID0
Feature activation+6.14e-01

Pos logit contributions
uck+0.156
oo+0.146
 Cow+0.141
 Wolf+0.139
ivered+0.133
Neg logit contributions
ents-0.198
 dice-0.192
 else-0.187
 contain-0.185
 clearly-0.168
Token We
Token ID775
Feature activation+7.58e-02

Pos logit contributions
uck+0.042
 Cow+0.042
P+0.041
ivered+0.041
burn+0.040
Neg logit contributions
ents-0.054
 else-0.054
 dice-0.052
 wins-0.049
 clearly-0.048
Token are
Token ID389
Feature activation-2.07e+00

Pos logit contributions
uck+0.007
 Cow+0.007
oo+0.007
 Wolf+0.007
burn+0.006
Neg logit contributions
ents-0.005
 else-0.005
 dice-0.005
 contain-0.005
 clearly-0.005
Token never
Token ID1239
Feature activation+1.49e+00

Pos logit contributions
 else+0.011
ents+0.010
 contain+0.010
 dice+0.010
 clearly+0.009
Neg logit contributions
uck-0.014
 Cow-0.013
ivered-0.013
 Wolf-0.013
eled-0.012
Token leave
Token ID2666
Feature activation+1.55e+00

Pos logit contributions
uck+0.117
 Cow+0.108
 Wolf+0.106
oo+0.105
burn+0.104
Neg logit contributions
 dice-0.124
ents-0.121
 else-0.117
 contain-0.112
ows-0.112
Token his
Token ID465
Feature activation+1.28e+00

Pos logit contributions
uck+0.115
 Cow+0.109
oo+0.107
 Wolf+0.105
burn+0.101
Neg logit contributions
 else-0.140
ents-0.138
 dice-0.137
 contain-0.129
 clearly-0.124
Token toy
Token ID13373
Feature activation-1.25e+00

Pos logit contributions
uck+0.088
 Wolf+0.086
 Cow+0.084
 wolf+0.083
 fever+0.081
Neg logit contributions
ents-0.120
 contain-0.114
 else-0.112
 dice-0.108
ows-0.104
Token computer
Token ID3644
Feature activation-2.50e+00

Pos logit contributions
ents+0.114
 contain+0.109
 else+0.109
 dice+0.107
 clearly+0.099
Neg logit contributions
uck-0.091
 Wolf-0.084
 Cow-0.082
oo-0.082
burn-0.081
Token outside
Token ID2354
Feature activation-1.53e+00

Pos logit contributions
ents+0.221
 dice+0.219
 else+0.208
 contain+0.202
ooters+0.195
Neg logit contributions
uck-0.178
burn-0.156
 Wolf-0.154
oo-0.152
 Cow-0.151
Token again
Token ID757
Feature activation-1.03e+00

Pos logit contributions
ents+0.132
 dice+0.131
 else+0.130
 contain+0.123
 clearly+0.114
Neg logit contributions
uck-0.116
 Cow-0.109
 Wolf-0.108
oo-0.107
burn-0.101
Token.
Token ID13
Feature activation-5.83e-02

Pos logit contributions
 else+0.084
ents+0.084
 dice+0.083
 contain+0.078
 clearly+0.074
Neg logit contributions
uck-0.084
 Cow-0.080
oo-0.079
 Wolf-0.078
burn-0.074
Token And
Token ID843
Feature activation-1.07e+00

Pos logit contributions
 else+0.005
ents+0.005
 dice+0.005
 contain+0.005
 clearly+0.005
Neg logit contributions
uck-0.004
oo-0.004
 Cow-0.004
burn-0.004
ivered-0.004
Token they
Token ID484
Feature activation-1.68e+00

Pos logit contributions
 dice+0.093
ents+0.093
 else+0.089
 contain+0.084
ows+0.081
Neg logit contributions
uck-0.082
 Cow-0.079
 Wolf-0.077
oo-0.077
 Squirrel-0.071
Token all
Token ID477
Feature activation+1.62e+00

Pos logit contributions
 else+0.151
ents+0.148
 dice+0.145
 contain+0.139
 clearly+0.133
Neg logit contributions
uck-0.126
 Cow-0.119
oo-0.116
 Wolf-0.115
burn-0.110
Token,
Token ID11
Feature activation+4.68e-01

Pos logit contributions
uck+0.071
 Cow+0.067
oo+0.066
 Wolf+0.064
burn+0.062
Neg logit contributions
 dice-0.085
ents-0.084
 else-0.084
 contain-0.079
 clearly-0.075
Token triumphant
Token ID44005
Feature activation-2.24e+00

Pos logit contributions
uck+0.038
 Cow+0.036
oo+0.036
 Wolf+0.035
P+0.034
Neg logit contributions
 else-0.039
ents-0.038
 dice-0.037
 contain-0.036
 clearly-0.034
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
 else+0.180
ents+0.177
 dice+0.174
 contain+0.166
 clearly+0.160
Neg logit contributions
uck-0.183
 Cow-0.175
oo-0.173
 Wolf-0.169
burn-0.167
Token
Token ID220
Feature activation+1.15e+00

Pos logit contributions
ents+0.019
 else+0.019
 dice+0.018
 contain+0.018
 clearly+0.017
Neg logit contributions
uck-0.013
oo-0.012
 Cow-0.012
 Wolf-0.011
burn-0.011
Token
Token ID198
Feature activation-6.34e-01

Pos logit contributions
uck+0.072
 Cow+0.067
oo+0.066
 Wolf+0.064
burn+0.059
Neg logit contributions
 else-0.116
 dice-0.115
ents-0.115
 contain-0.111
 clearly-0.104
Token
Token ID198
Feature activation-6.11e-01

Pos logit contributions
 else+0.062
ents+0.062
 dice+0.061
 contain+0.058
 clearly+0.055
Neg logit contributions
uck-0.041
 Cow-0.039
oo-0.038
 Wolf-0.038
burn-0.036
TokenAmy
Token ID40797
Feature activation-3.49e+00

Pos logit contributions
 else+0.057
ents+0.056
 dice+0.056
 contain+0.053
 clearly+0.050
Neg logit contributions
uck-0.043
 Cow-0.041
oo-0.040
 Wolf-0.040
burn-0.038
Token looked
Token ID3114
Feature activation+3.22e-01

Pos logit contributions
 else+0.300
ents+0.298
 dice+0.294
 contain+0.279
 clearly+0.264
Neg logit contributions
uck-0.268
 Cow-0.256
oo-0.251
 Wolf-0.248
burn-0.237
Token back
Token ID736
Feature activation-2.06e+00

Pos logit contributions
uck+0.024
 Cow+0.022
 Wolf+0.022
oo+0.021
 wolf+0.021
Neg logit contributions
 dice-0.029
ents-0.028
 else-0.028
 contain-0.027
ooters-0.024
Token and
Token ID290
Feature activation-1.83e-01

Pos logit contributions
 else+0.194
 dice+0.193
ents+0.191
 contain+0.185
 clearly+0.170
Neg logit contributions
uck-0.145
 Cow-0.135
oo-0.133
 Wolf-0.131
burn-0.121
Token smiled
Token ID13541
Feature activation-2.49e+00

Pos logit contributions
 dice+0.017
ents+0.017
 else+0.016
 contain+0.016
 clearly+0.015
Neg logit contributions
uck-0.013
 Cow-0.013
 Wolf-0.012
oo-0.012
eled-0.011
Token so
Token ID523
Feature activation-1.37e+00

Pos logit contributions
 else+0.064
ents+0.060
 dice+0.057
 contain+0.055
 clearly+0.054
Neg logit contributions
uck-0.069
 Cow-0.066
oo-0.065
 Wolf-0.064
burn-0.063
Token helpful
Token ID7613
Feature activation+2.28e+00

Pos logit contributions
 else+0.122
ents+0.117
 dice+0.114
 contain+0.110
 clearly+0.106
Neg logit contributions
uck-0.107
 Cow-0.103
oo-0.101
 Wolf-0.100
burn-0.097
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
uck+0.196
 Cow+0.192
oo+0.189
 Wolf+0.184
ivered+0.182
Neg logit contributions
 else-0.172
ents-0.169
 dice-0.163
 contain-0.155
 clearly-0.149
Token
Token ID220
Feature activation+1.20e+00

Pos logit contributions
ents+0.015
 else+0.015
 dice+0.015
 contain+0.014
 clearly+0.013
Neg logit contributions
uck-0.010
 Cow-0.009
oo-0.009
 Wolf-0.009
burn-0.009
Token
Token ID198
Feature activation-5.80e-01

Pos logit contributions
uck+0.076
 Cow+0.071
oo+0.070
 Wolf+0.069
burn+0.064
Neg logit contributions
 else-0.119
ents-0.118
 dice-0.118
 contain-0.113
 clearly-0.107
Token
Token ID198
Feature activation-5.80e-01

Pos logit contributions
 else+0.057
ents+0.057
 dice+0.057
 contain+0.054
 clearly+0.052
Neg logit contributions
uck-0.037
 Cow-0.035
oo-0.034
 Wolf-0.034
burn-0.031
TokenFrom
Token ID4863
Feature activation+8.71e-01

Pos logit contributions
 else+0.053
ents+0.053
 dice+0.052
 contain+0.049
 clearly+0.047
Neg logit contributions
uck-0.042
 Cow-0.040
oo-0.039
 Wolf-0.038
burn-0.037
Token that
Token ID326
Feature activation+1.81e+00

Pos logit contributions
uck+0.074
 Cow+0.071
oo+0.070
 Wolf+0.069
burn+0.067
Neg logit contributions
 else-0.068
ents-0.067
 dice-0.066
 contain-0.063
 clearly-0.059
Token day
Token ID1110
Feature activation+1.09e+00

Pos logit contributions
uck+0.128
 Cow+0.120
ivered+0.119
oo+0.118
eled+0.115
Neg logit contributions
 else-0.163
 dice-0.158
 contain-0.152
ents-0.151
 clearly-0.141
Token on
Token ID319
Feature activation+2.76e+00

Pos logit contributions
uck+0.088
 Cow+0.087
ivered+0.087
 fever+0.086
eled+0.085
Neg logit contributions
 else-0.078
 dice-0.076
ents-0.073
 contain-0.071
 correctly-0.062
Token,
Token ID11
Feature activation+5.94e-01

Pos logit contributions
uck+0.203
 Cow+0.190
oo+0.187
burn+0.183
 Wolf+0.183
Neg logit contributions
 else-0.250
 dice-0.240
ents-0.238
 contain-0.228
 clearly-0.216
Token it
Token ID340
Feature activation+1.73e+00

Pos logit contributions
uck+0.209
 Cow+0.195
oo+0.193
 Wolf+0.186
P+0.181
Neg logit contributions
ents-0.245
 dice-0.242
 else-0.233
 contain-0.230
 clearly-0.215
Token.
Token ID13
Feature activation-1.71e-01

Pos logit contributions
uck+0.129
 Cow+0.118
oo+0.116
 Wolf+0.113
burn+0.110
Neg logit contributions
ents-0.153
 dice-0.152
 else-0.148
 contain-0.140
 clearly-0.132
Token "
Token ID366
Feature activation-2.68e-01

Pos logit contributions
 else+0.017
ents+0.017
 dice+0.017
 contain+0.016
 clearly+0.015
Neg logit contributions
uck-0.011
 Cow-0.010
oo-0.010
 Wolf-0.010
burn-0.010
TokenOh
Token ID5812
Feature activation+7.01e-01

Pos logit contributions
 else+0.028
 dice+0.028
ents+0.028
 contain+0.026
 clearly+0.025
Neg logit contributions
uck-0.016
 Cow-0.015
oo-0.014
 Wolf-0.014
ivered-0.013
Token,
Token ID11
Feature activation+4.10e-01

Pos logit contributions
uck+0.048
 Cow+0.046
oo+0.045
 Wolf+0.044
burn+0.042
Neg logit contributions
 else-0.067
ents-0.066
 dice-0.065
 contain-0.062
 clearly-0.059
Token Lily
Token ID20037
Feature activation-1.19e+00

Pos logit contributions
uck+0.033
 Cow+0.031
oo+0.031
 Wolf+0.030
ivered+0.029
Neg logit contributions
 else-0.034
ents-0.034
 dice-0.034
 contain-0.032
 clearly-0.030
Token,
Token ID11
Feature activation+4.19e-01

Pos logit contributions
ents+0.081
 else+0.080
 dice+0.073
 clearly+0.067
ows+0.066
Neg logit contributions
uck-0.103
 Cow-0.101
P-0.101
 Wolf-0.100
eled-0.100
Token you
Token ID345
Feature activation-1.85e-01

Pos logit contributions
uck+0.035
 Cow+0.034
oo+0.033
 Wolf+0.033
burn+0.032
Neg logit contributions
 else-0.033
ents-0.033
 dice-0.032
 contain-0.030
 clearly-0.029
Token have
Token ID423
Feature activation-1.03e+00

Pos logit contributions
 else+0.013
ents+0.013
 dice+0.013
 contain+0.012
 clearly+0.011
Neg logit contributions
uck-0.017
 Cow-0.017
oo-0.017
 Wolf-0.017
burn-0.016
Token a
Token ID257
Feature activation+5.02e-01

Pos logit contributions
 else+0.092
ents+0.092
 dice+0.090
 contain+0.086
 clearly+0.081
Neg logit contributions
uck-0.077
 Cow-0.073
oo-0.072
 Wolf-0.071
burn-0.068
Token very
Token ID845
Feature activation-3.25e+00

Pos logit contributions
uck+0.038
 Cow+0.036
oo+0.036
 Wolf+0.035
burn+0.034
Neg logit contributions
 else-0.044
ents-0.043
 dice-0.043
 contain-0.040
 clearly-0.038
Token big
Token ID1263
Feature activation+1.75e-01

Pos logit contributions
uck+0.056
 Cow+0.054
 Wolf+0.053
oo+0.053
burn+0.050
Neg logit contributions
ents-0.061
 else-0.059
 dice-0.059
 contain-0.057
 clearly-0.053
Token book
Token ID1492
Feature activation-2.13e+00

Pos logit contributions
uck+0.014
 Cow+0.014
oo+0.013
 Wolf+0.013
ivered+0.013
Neg logit contributions
 dice-0.014
 else-0.014
ents-0.014
 contain-0.013
 clearly-0.012
Token with
Token ID351
Feature activation+1.16e+00

Pos logit contributions
ents+0.211
 dice+0.205
 else+0.194
 contain+0.194
inations+0.193
Neg logit contributions
uck-0.132
oo-0.115
time-0.101
over-0.100
P-0.098
Token pictures
Token ID5986
Feature activation-2.95e+00

Pos logit contributions
 Cow+0.057
uck+0.054
 Wolf+0.054
 wolf+0.051
oo+0.051
Neg logit contributions
 contain-0.123
 else-0.120
ents-0.116
 dice-0.115
ows-0.111
Token of
Token ID286
Feature activation-8.58e-01

Pos logit contributions
 contain+0.303
 else+0.302
 dice+0.301
ents+0.288
rig+0.281
Neg logit contributions
uck-0.161
oo-0.142
 Cow-0.123
over-0.119
 Sleep-0.118
Token volcan
Token ID17516
Feature activation-7.68e+00

Pos logit contributions
 contain+0.088
ents+0.085
 else+0.083
 dice+0.082
ooters+0.079
Neg logit contributions
 Cow-0.049
 Wolf-0.045
 wolf-0.045
uck-0.041
 chick-0.041
Tokenoes
Token ID3028
Feature activation-3.39e+00

Pos logit contributions
 else+0.581
 dice+0.562
 contain+0.525
ents+0.492
 clearly+0.462
Neg logit contributions
uck-0.715
oo-0.689
burn-0.643
 Cow-0.628
ivered-0.610
Token.
Token ID13
Feature activation-4.45e-02

Pos logit contributions
 dice+0.316
ents+0.313
 else+0.309
 contain+0.306
 clearly+0.272
Neg logit contributions
uck-0.232
 Cow-0.225
oo-0.220
 Wolf-0.215
burn-0.199
Token Vol
Token ID4709
Feature activation-6.57e+00

Pos logit contributions
 else+0.004
ents+0.004
 dice+0.004
 contain+0.004
 clearly+0.004
Neg logit contributions
uck-0.003
 Cow-0.003
oo-0.003
 Wolf-0.003
burn-0.002
Tokencan
Token ID5171
Feature activation-1.20e+00

Pos logit contributions
 else+0.525
ents+0.522
 dice+0.516
 contain+0.486
 clearly+0.459
Neg logit contributions
uck-0.544
 Cow-0.523
oo-0.512
 Wolf-0.508
burn-0.487
Tokenoes
Token ID3028
Feature activation-3.40e+00

Pos logit contributions
 dice+0.104
 contain+0.101
 else+0.100
ents+0.099
 clearly+0.089
Neg logit contributions
uck-0.092
 Cow-0.089
oo-0.088
 Wolf-0.084
burn-0.079
TokenThe
Token ID464
Feature activation-1.63e-01

Pos logit contributions
 else+0.064
ents+0.059
 dice+0.058
 contain+0.056
hest+0.055
Neg logit contributions
burn-0.042
 Wolf-0.039
oo-0.038
 Cow-0.038
ivered-0.037
Token animals
Token ID4695
Feature activation-7.86e-01

Pos logit contributions
 else+0.016
 dice+0.016
ents+0.016
 contain+0.015
 clearly+0.015
Neg logit contributions
uck-0.010
 Cow-0.010
oo-0.010
 Wolf-0.009
burn-0.009
Token crowded
Token ID18012
Feature activation-2.64e+00

Pos logit contributions
 else+0.064
ents+0.064
 dice+0.063
 contain+0.059
 clearly+0.056
Neg logit contributions
uck-0.064
 Cow-0.061
oo-0.060
 Wolf-0.060
burn-0.057
Token around
Token ID1088
Feature activation-4.16e+00

Pos logit contributions
ents+0.212
 else+0.209
 dice+0.207
 contain+0.191
 clearly+0.178
Neg logit contributions
uck-0.221
 Cow-0.209
oo-0.205
 Wolf-0.205
eled-0.193
Token and
Token ID290
Feature activation-2.56e-01

Pos logit contributions
 else+0.350
 dice+0.341
ents+0.340
 contain+0.323
 clearly+0.309
Neg logit contributions
uck-0.332
 Cow-0.314
oo-0.309
 Wolf-0.303
burn-0.302
Token gasped
Token ID45236
Feature activation-7.86e+00

Pos logit contributions
ents+0.022
 dice+0.022
 else+0.022
 contain+0.021
 clearly+0.019
Neg logit contributions
uck-0.019
 Cow-0.019
 Wolf-0.018
oo-0.018
burn-0.017
Token in
Token ID287
Feature activation-7.00e-01

Pos logit contributions
 else+0.551
ents+0.517
 clearly+0.517
 contain+0.484
 dice+0.466
Neg logit contributions
uck-0.717
burn-0.712
 Cow-0.708
P-0.697
 Wolf-0.677
Token amaz
Token ID40642
Feature activation-2.04e+00

Pos logit contributions
 else+0.053
 dice+0.051
 contain+0.050
ents+0.048
 clearly+0.046
Neg logit contributions
uck-0.064
 Cow-0.061
oo-0.061
burn-0.060
ivered-0.059
Tokenement
Token ID972
Feature activation-1.29e+00

Pos logit contributions
 else+0.135
ents+0.131
 dice+0.120
 clearly+0.118
 contain+0.116
Neg logit contributions
 Cow-0.201
uck-0.196
 Wolf-0.194
burn-0.194
oo-0.186
Token.
Token ID13
Feature activation-2.26e-01

Pos logit contributions
 else+0.103
ents+0.103
 dice+0.099
 contain+0.096
 clearly+0.091
Neg logit contributions
uck-0.107
 Cow-0.103
 Wolf-0.100
oo-0.100
burn-0.096
Token Where
Token ID6350
Feature activation-4.78e-01

Pos logit contributions
ents+0.020
 dice+0.018
 else+0.018
 clearly+0.017
 contain+0.017
Neg logit contributions
burn-0.018
oo-0.017
ivered-0.017
uck-0.016
eled-0.016
Token,
Token ID11
Feature activation+4.23e-01

Pos logit contributions
 else+0.109
ents+0.103
 dice+0.102
 contain+0.098
 clearly+0.094
Neg logit contributions
uck-0.087
 Cow-0.085
 Wolf-0.083
oo-0.082
burn-0.081
Token "
Token ID366
Feature activation-3.17e-01

Pos logit contributions
uck+0.038
ivered+0.036
P+0.035
burn+0.034
oo+0.034
Neg logit contributions
 else-0.031
ents-0.031
 dice-0.029
 clearly-0.028
 contain-0.028
TokenMy
Token ID3666
Feature activation-2.27e+00

Pos logit contributions
 else+0.031
ents+0.030
 dice+0.030
 contain+0.029
 clearly+0.027
Neg logit contributions
uck-0.021
 Cow-0.020
oo-0.020
 Wolf-0.019
ivered-0.018
Token name
Token ID1438
Feature activation+9.43e-01

Pos logit contributions
 else+0.222
 dice+0.219
ents+0.218
 contain+0.208
 clearly+0.200
Neg logit contributions
uck-0.145
 Cow-0.136
oo-0.134
 Wolf-0.132
P-0.128
Token is
Token ID318
Feature activation-4.90e-01

Pos logit contributions
uck+0.079
 Cow+0.076
oo+0.074
 Wolf+0.074
burn+0.070
Neg logit contributions
 else-0.075
ents-0.074
 dice-0.073
 contain-0.068
 clearly-0.065
Token Kate
Token ID16693
Feature activation-8.38e+00

Pos logit contributions
ents+0.044
 dice+0.044
 else+0.043
 contain+0.043
 clearly+0.039
Neg logit contributions
uck-0.035
 Cow-0.035
 Wolf-0.033
oo-0.033
P-0.031
Token."
Token ID526
Feature activation+6.83e-01

Pos logit contributions
 dice+0.762
 else+0.749
ents+0.743
 contain+0.731
 clearly+0.669
Neg logit contributions
uck-0.598
 Cow-0.587
oo-0.571
 Wolf-0.558
burn-0.546
Token
Token ID220
Feature activation+1.04e+00

Pos logit contributions
uck+0.028
 Cow+0.026
 Wolf+0.022
oo+0.022
 Sleep+0.022
Neg logit contributions
 dice-0.082
 else-0.081
ents-0.080
 contain-0.079
 clearly-0.073
Token
Token ID198
Feature activation-7.47e-01

Pos logit contributions
uck+0.055
 Cow+0.048
oo+0.048
 Wolf+0.045
 Sleep+0.040
Neg logit contributions
 dice-0.119
 contain-0.116
 else-0.116
ents-0.115
 clearly-0.104
TokenThe
Token ID464
Feature activation-2.70e-01

Pos logit contributions
 contain+0.087
ents+0.085
 dice+0.085
 else+0.083
 wins+0.078
Neg logit contributions
uck-0.034
P-0.030
oo-0.029
 Cow-0.029
M-0.026
Token triangle
Token ID22950
Feature activation-7.40e+00

Pos logit contributions
ents+0.025
 contain+0.024
 else+0.023
cases+0.023
ows+0.023
Neg logit contributions
uck-0.020
P-0.017
 wolf-0.016
 Cow-0.016
 Wolf-0.016
Token They
Token ID1119
Feature activation+1.94e+00

Pos logit contributions
 else+0.012
ents+0.011
 dice+0.011
 contain+0.011
 clearly+0.010
Neg logit contributions
uck-0.008
 Cow-0.007
oo-0.007
 Wolf-0.007
burn-0.007
Token like
Token ID588
Feature activation-2.92e+00

Pos logit contributions
uck+0.178
 Cow+0.173
oo+0.171
 Wolf+0.165
P+0.164
Neg logit contributions
ents-0.137
 dice-0.131
 else-0.131
 contain-0.117
 clearly-0.114
Token to
Token ID284
Feature activation-1.39e-01

Pos logit contributions
ents+0.263
 else+0.259
 dice+0.257
 contain+0.245
 clearly+0.235
Neg logit contributions
uck-0.210
 Cow-0.206
oo-0.198
 Wolf-0.197
burn-0.189
Token play
Token ID711
Feature activation+4.82e-01

Pos logit contributions
 else+0.011
ents+0.011
 dice+0.011
 contain+0.010
 clearly+0.010
Neg logit contributions
uck-0.012
 Cow-0.011
oo-0.011
 Wolf-0.011
burn-0.010
Token with
Token ID351
Feature activation+1.09e+00

Pos logit contributions
uck+0.039
 Cow+0.038
 Wolf+0.038
oo+0.037
burn+0.036
Neg logit contributions
 else-0.040
 dice-0.039
ents-0.037
 contain-0.035
 clearly-0.034
Token blocks
Token ID7021
Feature activation-8.43e+00

Pos logit contributions
 Cow+0.077
uck+0.075
oo+0.074
 Wolf+0.073
P+0.070
Neg logit contributions
ents-0.098
 contain-0.097
 dice-0.092
 else-0.092
 clearly-0.087
Token.
Token ID13
Feature activation-8.23e-02

Pos logit contributions
 contain+1.012
 dice+0.986
ents+0.965
 levers+0.931
inations+0.924
Neg logit contributions
uck-0.332
P-0.303
oo-0.278
over-0.252
burn-0.241
Token They
Token ID1119
Feature activation+1.98e+00

Pos logit contributions
ents+0.009
 else+0.009
 dice+0.009
 contain+0.009
 clearly+0.008
Neg logit contributions
 Cow-0.004
uck-0.004
oo-0.004
 Sleep-0.004
 Wolf-0.004
Token have
Token ID423
Feature activation-8.64e-01

Pos logit contributions
P+0.102
 Cow+0.101
 Amen+0.100
 Betsy+0.096
 Pig+0.094
Neg logit contributions
ents-0.203
cases-0.190
 else-0.182
 clearly-0.182
rig-0.178
Token many
Token ID867
Feature activation+1.02e+00

Pos logit contributions
 contain+0.093
ents+0.092
 else+0.091
 clearly+0.090
 dice+0.087
Neg logit contributions
P-0.043
 Cow-0.040
uck-0.038
 Wolf-0.035
oo-0.035
Token blocks
Token ID7021
Feature activation-8.40e+00

Pos logit contributions
P+0.067
ined+0.061
eled+0.060
el+0.060
uck+0.059
Neg logit contributions
 contain-0.092
ents-0.082
nes-0.080
cases-0.075
 existed-0.075
Token pull
Token ID2834
Feature activation-2.89e+00

Pos logit contributions
ents+0.024
 else+0.024
 dice+0.023
 contain+0.022
 clearly+0.020
Neg logit contributions
uck-0.025
 Cow-0.024
oo-0.023
 Wolf-0.023
burn-0.022
Token and
Token ID290
Feature activation-3.17e-01

Pos logit contributions
ents+0.288
 else+0.283
 contain+0.280
 dice+0.273
ooters+0.256
Neg logit contributions
uck-0.186
 Cow-0.169
 Wolf-0.168
oo-0.164
eled-0.155
Token tug
Token ID27762
Feature activation-2.44e+00

Pos logit contributions
ents+0.029
ooters+0.027
 else+0.026
hest+0.026
rig+0.026
Neg logit contributions
uck-0.019
 Wolf-0.018
 Cow-0.017
 Mitt-0.017
eled-0.017
Token at
Token ID379
Feature activation-1.44e-01

Pos logit contributions
ents+0.237
 else+0.233
 dice+0.231
 contain+0.224
 clearly+0.210
Neg logit contributions
uck-0.163
oo-0.152
 Cow-0.151
 Wolf-0.146
P-0.141
Token the
Token ID262
Feature activation-4.53e-01

Pos logit contributions
ents+0.016
 else+0.015
ooters+0.015
 contain+0.015
 dice+0.015
Neg logit contributions
uck-0.007
 Cow-0.006
oo-0.006
 Wolf-0.006
eled-0.006
Token blocks
Token ID7021
Feature activation-8.62e+00

Pos logit contributions
 else+0.035
ents+0.035
 contain+0.034
nes+0.031
 dice+0.031
Neg logit contributions
uck-0.033
 wolf-0.032
 fever-0.031
 Wolf-0.030
 Cow-0.029
Token.
Token ID13
Feature activation-2.88e-01

Pos logit contributions
 dice+0.924
ents+0.886
inations+0.872
 contain+0.871
ooters+0.842
Neg logit contributions
uck-0.438
oo-0.353
 Wolf-0.312
 Cow-0.309
burn-0.297
Token They
Token ID1119
Feature activation+1.77e+00

Pos logit contributions
 dice+0.027
ents+0.026
 else+0.026
 contain+0.025
 clearly+0.024
Neg logit contributions
uck-0.020
 Cow-0.019
oo-0.019
 Wolf-0.019
burn-0.019
Token made
Token ID925
Feature activation+3.88e+00

Pos logit contributions
uck+0.093
eled+0.091
 Cow+0.091
ivered+0.086
 Wolf+0.085
Neg logit contributions
ents-0.170
 dice-0.164
ooters-0.164
 contain-0.160
 else-0.158
Token loud
Token ID7812
Feature activation+2.34e+00

Pos logit contributions
uck+0.181
oo+0.163
 Cow+0.152
 Wolf+0.142
eled+0.133
Neg logit contributions
 contain-0.432
ents-0.431
 else-0.425
 dice-0.418
ooters-0.400
Token noises
Token ID26782
Feature activation-1.17e+00

Pos logit contributions
uck+0.201
 Cow+0.190
oo+0.189
 Wolf+0.186
burn+0.178
Neg logit contributions
 else-0.180
ents-0.177
 dice-0.174
 contain-0.171
 clearly-0.153