TOP ACTIVATIONS
MAX = 12.272

want Lily to get sun burn ed . L
don 't get a sun burn ." Lily yawn ed and
him from getting a sun burn . Tim my
she knew the word ' burn '. Her friends were so
to put on his sun burn . Tim my thought the
," Lily says , sh ivering . " Let
inside . Lily was sh ivering and her mom wrapped her
inside . Lily was sh ivering and her mom wrapped her
! " Y uck !" the owl said .
. " Y uck ! My t eddy bear
around her and stopped sh ivering . Lily was so happy
w , gross . Y uck ." They looked
the juice and said y uck again . She wanted water
is bitter , like y uck . You should wear something
the fire and stopped sh ivering . " Did
. " Y uck ! I don 't like
Sara and Tom were sh ivering in their bed . They
noses . They say y uck . They throw the doll
. " Y uck , Ben , put it
. The tree stopped sh ivering and was happy . The

MIN ACTIVATIONS
MIN = -13.101

away . He rolled the dice and moved his pieces around
came and he rolled the dice . He moved his piece
Daddy took turns rolling the dice and moving their pieces around
But then Joe noticed something else . He saw a box
very slowly . He counted 1 , 2 , 3 ...
lonely because there was nobody else around . Suddenly , the
He wants to know what else he can mark with .
But then he sees something else in the garden . He
was so special that everybody else wanted to join in the
â„¢ s go find something else . Put the paper and
but there was no one else in the park .
the cars in order from 1 to 10 . Tim my
laser and play with something else . <|endoftext|> One day ,
, 3 , 2 , 1 ". They hear a big
� � € � � 1 , 2 , 3 ,
park tomorrow and let someone else have a turn ," she
went first . He counted 1 , 2 , 3 ,
her joy spread and everyone else began to feel light and
it off or play something else . She wanted to enjoy
'll go and find something else to do ."

INTERVAL 5.928 - 12.272
CONTAINS 0.304%

it made her face scr unch up !
the street whist ling , feeling too scared to smile .
named D ucky . D ucky loved to swim in the
whistle throughout the day until dinner . <|endoftext|> Once upon a
L ily felt better after drinking the hot cocoa and sn

INTERVAL -0.415 - 5.928
CONTAINS 69.244%

But Tim my was also anxious because he
the others . After the parade , the
at his wife and shrugged . Ben and Lily
bite and said , " M mm , this cookie is
house !" The cater pillar was so happy

INTERVAL -6.758 - -0.415
CONTAINS 30.261%

Ben looked for the button . He saw it
to Tim my and he grabbed it tightly . " It
sees the gum in Anna 's hair . She is not
story is that sometimes things are hard , but if you
heard something that made her stop and listen . Was it

INTERVAL -13.101 - -6.758
CONTAINS 0.191%

arrow . " That 's awesome !" Billy said , "
book . He had written 12 and 15 on one line
was a little girl named Anne . One day she was
Daisy !" J as mine : " Hi Daisy
! He often counted the stairs as he climbed them ,
Token want
Token ID765
Feature activation-1.85e+00

Pos logit contributions
uck+0.107
burn+0.101
ivered+0.098
 Cow+0.097
 Wolf+0.096
Neg logit contributions
 dice-0.080
 contain-0.072
ents-0.069
 else-0.067
 clearly-0.061
Token Lily
Token ID20037
Feature activation-6.97e-01

Pos logit contributions
 dice+0.171
 contain+0.163
ents+0.153
 else+0.149
 clearly+0.139
Neg logit contributions
uck-0.145
burn-0.132
 Wolf-0.129
 Cow-0.128
oo-0.124
Token to
Token ID284
Feature activation-4.65e-02

Pos logit contributions
 dice+0.062
 contain+0.057
ents+0.055
 else+0.053
 clearly+0.050
Neg logit contributions
uck-0.058
burn-0.053
ivered-0.052
 Cow-0.051
 Wolf-0.051
Token get
Token ID651
Feature activation-1.04e-01

Pos logit contributions
 contain+0.003
 dice+0.003
 else+0.003
ents+0.003
 clearly+0.003
Neg logit contributions
uck-0.005
ivered-0.004
 Wolf-0.004
P-0.004
burn-0.004
Token sun
Token ID4252
Feature activation+8.58e-01

Pos logit contributions
 else+0.009
 dice+0.009
 contain+0.009
ents+0.008
 clearly+0.008
Neg logit contributions
uck-0.008
ivered-0.008
burn-0.007
 Wolf-0.007
oo-0.007
Tokenburn
Token ID10899
Feature activation+1.23e+01

Pos logit contributions
uck+0.025
ivered+0.021
 Wolf+0.018
P+0.017
 Ow+0.017
Neg logit contributions
 dice-0.114
 contain-0.113
ents-0.111
 else-0.107
 clearly-0.104
Tokened
Token ID276
Feature activation+3.99e+00

Pos logit contributions
uck+0.863
ivered+0.861
P+0.858
 Cow+0.831
 Wolf+0.829
Neg logit contributions
 dice-1.040
ents-1.024
 contain-0.997
 else-0.987
ows-0.957
Token.
Token ID13
Feature activation-1.09e-01

Pos logit contributions
uck+0.326
burn+0.296
ivered+0.295
 Wolf+0.286
 Cow+0.286
Neg logit contributions
 dice-0.349
 contain-0.331
ents-0.314
 else-0.311
 clearly-0.286
Token
Token ID198
Feature activation-1.35e-01

Pos logit contributions
 dice+0.009
 else+0.008
ents+0.008
 contain+0.008
 clearly+0.007
Neg logit contributions
burn-0.009
uck-0.009
ivered-0.009
 Wolf-0.008
oo-0.008
Token
Token ID198
Feature activation-1.42e-01

Pos logit contributions
 dice+0.014
 contain+0.013
ents+0.013
 else+0.012
 clearly+0.012
Neg logit contributions
uck-0.009
ivered-0.008
burn-0.008
 Cow-0.008
 Wolf-0.008
TokenL
Token ID43
Feature activation-6.48e-02

Pos logit contributions
 dice+0.013
 contain+0.012
 else+0.012
ents+0.011
 clearly+0.010
Neg logit contributions
uck-0.011
burn-0.011
ivered-0.010
 Wolf-0.010
 Cow-0.010
Token don
Token ID836
Feature activation+5.28e-01

Pos logit contributions
 dice+0.013
 contain+0.012
ents+0.012
 else+0.011
 clearly+0.010
Neg logit contributions
uck-0.014
burn-0.013
ivered-0.012
 Wolf-0.012
 Cow-0.012
Token't
Token ID470
Feature activation+1.05e+00

Pos logit contributions
uck+0.042
ivered+0.040
 Cow+0.039
burn+0.039
 Wolf+0.039
Neg logit contributions
 dice-0.043
 contain-0.042
ents-0.040
 else-0.039
 clearly-0.037
Token get
Token ID651
Feature activation-1.01e-01

Pos logit contributions
uck+0.102
ivered+0.094
burn+0.093
 Wolf+0.091
 Cow+0.091
Neg logit contributions
 dice-0.076
 contain-0.073
 else-0.067
ents-0.067
 clearly-0.060
Token a
Token ID257
Feature activation+5.49e-01

Pos logit contributions
 dice+0.009
 else+0.008
 contain+0.008
ents+0.008
 clearly+0.007
Neg logit contributions
uck-0.008
ivered-0.008
 Wolf-0.007
burn-0.007
 Ow-0.007
Token sun
Token ID4252
Feature activation+8.25e-01

Pos logit contributions
uck+0.039
ivered+0.036
burn+0.034
 Wolf+0.034
 Cow+0.033
Neg logit contributions
 dice-0.054
 contain-0.051
 else-0.050
ents-0.048
 clearly-0.045
Tokenburn
Token ID10899
Feature activation+1.23e+01

Pos logit contributions
uck+0.022
ivered+0.019
 Wolf+0.016
 Cow+0.015
 Ow+0.014
Neg logit contributions
 dice-0.114
 contain-0.112
ents-0.108
 else-0.106
 clearly-0.101
Token."
Token ID526
Feature activation+9.75e-01

Pos logit contributions
uck+0.994
ivered+0.940
 Cow+0.895
P+0.894
 Wolf+0.891
Neg logit contributions
 dice-1.031
 contain-0.988
ents-0.930
 else-0.915
ows-0.831
Token Lily
Token ID20037
Feature activation-6.91e-01

Pos logit contributions
uck+0.088
burn+0.085
ivered+0.085
 Wolf+0.082
oo+0.080
Neg logit contributions
 dice-0.075
 contain-0.069
ents-0.068
 else-0.067
 clearly-0.060
Token yawn
Token ID40908
Feature activation+4.60e+00

Pos logit contributions
 dice+0.053
 contain+0.051
 else+0.050
ents+0.049
 clearly+0.045
Neg logit contributions
uck-0.061
burn-0.058
 Wolf-0.056
ivered-0.055
 Cow-0.055
Tokened
Token ID276
Feature activation+3.95e+00

Pos logit contributions
uck+0.234
 Cow+0.209
burn+0.207
 Ow+0.207
 Wolf+0.206
Neg logit contributions
 dice-0.501
 contain-0.492
 else-0.485
ents-0.483
 clearly-0.465
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
uck+0.394
ivered+0.371
burn+0.359
P+0.357
oo+0.353
Neg logit contributions
 dice-0.257
 contain-0.255
ents-0.242
 else-0.237
 clearly-0.234
Token him
Token ID683
Feature activation+2.55e+00

Pos logit contributions
ivered+0.227
uck+0.226
P+0.205
burn+0.199
ivering+0.194
Neg logit contributions
 else-0.259
 dice-0.255
ents-0.255
 contain-0.247
 clearly-0.244
Token from
Token ID422
Feature activation-2.75e+00

Pos logit contributions
uck+0.221
burn+0.206
ivered+0.199
 Cow+0.195
 Wolf+0.194
Neg logit contributions
 dice-0.215
 contain-0.195
ents-0.187
 else-0.183
 clearly-0.168
Token getting
Token ID1972
Feature activation+2.41e+00

Pos logit contributions
 dice+0.208
 else+0.200
 contain+0.198
ents+0.194
 clearly+0.188
Neg logit contributions
uck-0.244
ivered-0.234
P-0.215
 Ow-0.210
burn-0.206
Token a
Token ID257
Feature activation+5.03e-01

Pos logit contributions
ivered+0.206
uck+0.206
 Ow+0.184
oo+0.183
 Wolf+0.182
Neg logit contributions
 else-0.187
 contain-0.179
ents-0.178
 dice-0.175
 clearly-0.164
Token sun
Token ID4252
Feature activation+8.78e-01

Pos logit contributions
uck+0.042
ivered+0.040
burn+0.037
oo+0.036
 Cow+0.036
Neg logit contributions
 dice-0.043
 contain-0.041
 else-0.040
ents-0.038
 clearly-0.035
Tokenburn
Token ID10899
Feature activation+1.23e+01

Pos logit contributions
uck+0.041
ivered+0.036
 Wolf+0.034
 Cow+0.033
 Ow+0.033
Neg logit contributions
 dice-0.103
 contain-0.103
ents-0.098
 else-0.097
 clearly-0.092
Token.
Token ID13
Feature activation+4.09e-02

Pos logit contributions
uck+1.065
ivered+0.985
 Cow+0.949
 Wolf+0.945
burn+0.944
Neg logit contributions
 dice-0.986
 contain-0.946
ents-0.889
 else-0.886
 clearly-0.797
Token
Token ID198
Feature activation-1.81e-01

Pos logit contributions
uck+0.003
burn+0.003
ivered+0.003
oo+0.003
 Wolf+0.003
Neg logit contributions
 dice-0.003
ents-0.003
 else-0.003
 contain-0.003
 clearly-0.003
Token
Token ID198
Feature activation-1.85e-01

Pos logit contributions
 dice+0.018
 contain+0.017
ents+0.016
 else+0.016
 clearly+0.015
Neg logit contributions
uck-0.013
burn-0.012
ivered-0.012
 Wolf-0.011
 Cow-0.011
TokenTim
Token ID14967
Feature activation-5.52e-01

Pos logit contributions
 dice+0.016
 contain+0.015
 else+0.014
ents+0.014
 clearly+0.012
Neg logit contributions
uck-0.016
burn-0.015
ivered-0.015
 Wolf-0.014
 Cow-0.014
Tokenmy
Token ID1820
Feature activation+6.41e-01

Pos logit contributions
 dice+0.039
ents+0.038
 contain+0.037
 else+0.037
 count+0.033
Neg logit contributions
uck-0.049
 Wolf-0.046
 Ow-0.046
 Cow-0.045
burn-0.045
Token she
Token ID673
Feature activation-1.50e+00

Pos logit contributions
 dice+0.227
 contain+0.213
ents+0.200
 else+0.194
 clearly+0.179
Neg logit contributions
uck-0.231
ivered-0.206
burn-0.205
 Cow-0.202
 Wolf-0.202
Token knew
Token ID2993
Feature activation+3.55e-02

Pos logit contributions
 dice+0.141
 contain+0.135
ents+0.126
 else+0.119
ooters+0.117
Neg logit contributions
uck-0.116
ivered-0.102
 Wolf-0.101
 Cow-0.100
burn-0.100
Token the
Token ID262
Feature activation-2.86e-01

Pos logit contributions
uck+0.003
 Wolf+0.002
 Cow+0.002
burn+0.002
oo+0.002
Neg logit contributions
 dice-0.003
 contain-0.003
ents-0.003
ooters-0.003
 else-0.003
Token word
Token ID1573
Feature activation-1.69e+00

Pos logit contributions
 dice+0.026
 contain+0.025
 else+0.024
ents+0.023
 clearly+0.022
Neg logit contributions
uck-0.022
ivered-0.020
burn-0.020
oo-0.019
 Cow-0.019
Token '
Token ID705
Feature activation-5.78e+00

Pos logit contributions
 dice+0.160
 contain+0.154
ents+0.145
ooters+0.133
inations+0.131
Neg logit contributions
uck-0.125
 Wolf-0.113
 Cow-0.110
burn-0.107
 wolf-0.104
Tokenburn
Token ID10899
Feature activation+1.23e+01

Pos logit contributions
 dice+0.548
 contain+0.523
 else+0.500
ents+0.498
 clearly+0.458
Neg logit contributions
uck-0.419
ivered-0.381
burn-0.378
 Cow-0.368
 Wolf-0.368
Token'.
Token ID4458
Feature activation+1.64e+00

Pos logit contributions
uck+0.955
ivered+0.905
burn+0.888
 Cow+0.866
 Wolf+0.866
Neg logit contributions
 dice-1.049
 contain-0.995
 else-0.947
ents-0.933
 wins-0.875
Token Her
Token ID2332
Feature activation-4.10e-01

Pos logit contributions
uck+0.110
burn+0.097
ivered+0.097
 Wolf+0.094
 Cow+0.094
Neg logit contributions
 dice-0.168
 contain-0.161
ents-0.153
 else-0.152
 clearly-0.142
Token friends
Token ID2460
Feature activation+2.06e+00

Pos logit contributions
 dice+0.039
 else+0.036
 contain+0.036
ents+0.035
 clearly+0.033
Neg logit contributions
uck-0.030
ivered-0.027
burn-0.027
P-0.026
 Cow-0.025
Token were
Token ID547
Feature activation+5.84e-02

Pos logit contributions
uck+0.183
burn+0.167
ivered+0.167
 Cow+0.162
 Wolf+0.162
Neg logit contributions
 dice-0.167
 contain-0.155
ents-0.147
 else-0.145
 clearly-0.132
Token so
Token ID523
Feature activation-1.36e+00

Pos logit contributions
uck+0.005
ivered+0.004
burn+0.004
 Cow+0.004
oo+0.004
Neg logit contributions
 dice-0.005
 contain-0.005
ents-0.004
ooters-0.004
 else-0.004
Token to
Token ID284
Feature activation-1.79e-01

Pos logit contributions
uck+0.179
burn+0.159
 Cow+0.150
oo+0.149
 Wolf+0.146
Neg logit contributions
 dice-0.198
 contain-0.191
ents-0.178
 else-0.177
 clearly-0.169
Token put
Token ID1234
Feature activation-3.02e-01

Pos logit contributions
 dice+0.014
 contain+0.013
 else+0.012
ents+0.012
 clearly+0.011
Neg logit contributions
uck-0.017
ivered-0.016
burn-0.016
 Wolf-0.015
 Cow-0.015
Token on
Token ID319
Feature activation+2.85e+00

Pos logit contributions
 dice+0.026
 else+0.023
 contain+0.023
ents+0.023
 wins+0.021
Neg logit contributions
ivered-0.025
uck-0.024
burn-0.023
 Wolf-0.023
ivering-0.023
Token his
Token ID465
Feature activation+9.94e-01

Pos logit contributions
ivered+0.230
uck+0.229
burn+0.209
 Wolf+0.204
ivering+0.202
Neg logit contributions
 dice-0.257
 contain-0.225
 else-0.223
ents-0.222
 clearly-0.207
Token sun
Token ID4252
Feature activation+6.16e-01

Pos logit contributions
uck+0.079
burn+0.072
ivered+0.072
 Wolf+0.070
 Cow+0.069
Neg logit contributions
 dice-0.090
 contain-0.085
 else-0.080
ents-0.080
 clearly-0.073
Tokenburn
Token ID10899
Feature activation+1.21e+01

Pos logit contributions
uck+0.046
ivered+0.043
 Wolf+0.041
 Cow+0.040
burn+0.040
Neg logit contributions
 dice-0.057
 contain-0.054
ents-0.052
 else-0.051
 clearly-0.046
Token.
Token ID13
Feature activation+9.43e-03

Pos logit contributions
uck+1.083
ivered+1.016
 Wolf+0.972
burn+0.964
 Cow+0.959
Neg logit contributions
 dice-0.941
 contain-0.882
ents-0.843
 else-0.825
 clearly-0.741
Token Tim
Token ID5045
Feature activation-4.41e-01

Pos logit contributions
uck+0.001
ivered+0.001
burn+0.001
 Wolf+0.001
P+0.001
Neg logit contributions
 dice-0.001
 contain-0.001
ents-0.001
 else-0.001
 clearly-0.001
Tokenmy
Token ID1820
Feature activation+7.61e-01

Pos logit contributions
 dice+0.028
ents+0.026
 else+0.025
 count+0.023
 contain+0.023
Neg logit contributions
uck-0.042
 Wolf-0.040
 Cow-0.040
over-0.039
burn-0.039
Token thought
Token ID1807
Feature activation+7.76e-01

Pos logit contributions
uck+0.060
burn+0.055
ivered+0.054
 Wolf+0.053
 Cow+0.052
Neg logit contributions
 dice-0.069
 contain-0.065
 else-0.063
ents-0.062
 clearly-0.057
Token the
Token ID262
Feature activation-3.19e-01

Pos logit contributions
uck+0.047
oo+0.042
 Wolf+0.040
 Cow+0.040
burn+0.039
Neg logit contributions
 dice-0.082
 contain-0.082
ents-0.074
ooters-0.070
 else-0.070
Token,"
Token ID553
Feature activation+3.44e-01

Pos logit contributions
uck+0.159
ivered+0.150
burn+0.149
 Cow+0.145
 Wolf+0.144
Neg logit contributions
 dice-0.126
 contain-0.118
 else-0.112
ents-0.111
 clearly-0.100
Token Lily
Token ID20037
Feature activation-6.96e-01

Pos logit contributions
uck+0.025
 Cow+0.022
burn+0.022
 Wolf+0.022
ivered+0.021
Neg logit contributions
 dice-0.033
 contain-0.032
ents-0.030
 else-0.029
 clearly-0.028
Token says
Token ID1139
Feature activation-4.05e+00

Pos logit contributions
 dice+0.052
 contain+0.051
ents+0.044
 else+0.042
inations+0.041
Neg logit contributions
uck-0.069
 Cow-0.060
burn-0.059
 Wolf-0.059
 fever-0.059
Token,
Token ID11
Feature activation+6.32e-01

Pos logit contributions
 dice+0.335
 contain+0.321
 else+0.306
ents+0.305
 clearly+0.277
Neg logit contributions
uck-0.342
ivered-0.321
burn-0.317
 Wolf-0.308
 Cow-0.306
Token sh
Token ID427
Feature activation+3.84e+00

Pos logit contributions
uck+0.059
 Ow+0.052
ivered+0.052
P+0.050
 Wolf+0.049
Neg logit contributions
 contain-0.048
 dice-0.046
ents-0.045
 else-0.045
 clearly-0.044
Tokenivering
Token ID35598
Feature activation+1.14e+01

Pos logit contributions
 Wolf+0.219
 wolf+0.211
burn+0.211
uck+0.209
 Cow+0.207
Neg logit contributions
 dice-0.367
 contain-0.367
 else-0.367
ents-0.361
 clearly-0.354
Token.
Token ID13
Feature activation+3.83e-01

Pos logit contributions
uck+1.160
ivered+1.148
 Wolf+1.127
P+1.109
burn+1.103
Neg logit contributions
 contain-0.620
 dice-0.614
 else-0.603
ows-0.592
 clearly-0.583
Token
Token ID198
Feature activation-2.65e-01

Pos logit contributions
uck+0.034
ivered+0.034
burn+0.032
eled+0.030
P+0.030
Neg logit contributions
 dice-0.029
ents-0.028
 else-0.027
 contain-0.026
 clearly-0.026
Token
Token ID198
Feature activation-2.61e-01

Pos logit contributions
 dice+0.025
 contain+0.023
ents+0.023
 else+0.023
 clearly+0.021
Neg logit contributions
uck-0.020
burn-0.018
ivered-0.018
 Wolf-0.017
 Cow-0.017
Token"
Token ID1
Feature activation+1.98e-01

Pos logit contributions
 dice+0.024
 contain+0.023
ents+0.022
 else+0.022
 clearly+0.020
Neg logit contributions
uck-0.020
burn-0.018
ivered-0.018
 Wolf-0.018
 Cow-0.018
TokenLet
Token ID5756
Feature activation+3.82e+00

Pos logit contributions
uck+0.014
ivered+0.013
burn+0.013
 Wolf+0.012
oo+0.012
Neg logit contributions
 dice-0.019
 else-0.019
 contain-0.018
ents-0.017
 clearly-0.017
Token inside
Token ID2641
Feature activation-2.12e+00

Pos logit contributions
uck+0.139
burn+0.124
 Wolf+0.118
 Cow+0.117
oo+0.117
Neg logit contributions
 dice-0.184
 contain-0.175
ents-0.168
 else-0.162
 clearly-0.154
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
 dice+0.202
 contain+0.186
ents+0.181
 else+0.173
ooters+0.164
Neg logit contributions
uck-0.161
burn-0.145
 Wolf-0.141
 Cow-0.141
ivered-0.140
Token Lily
Token ID20037
Feature activation-6.53e-01

Pos logit contributions
uck+0.012
burn+0.011
ivered+0.011
 Wolf+0.011
 Cow+0.011
Neg logit contributions
 dice-0.014
 contain-0.013
ents-0.013
 else-0.012
 clearly-0.011
Token was
Token ID373
Feature activation+3.53e-01

Pos logit contributions
 dice+0.058
 contain+0.055
ents+0.052
 else+0.052
 clearly+0.047
Neg logit contributions
uck-0.053
burn-0.048
ivered-0.048
 Wolf-0.046
 Cow-0.046
Token sh
Token ID427
Feature activation+3.91e+00

Pos logit contributions
uck+0.035
 Ow+0.032
burn+0.032
P+0.031
ivered+0.031
Neg logit contributions
 else-0.023
 dice-0.023
 contain-0.023
ents-0.022
 clearly-0.020
Tokenivering
Token ID35598
Feature activation+1.14e+01

Pos logit contributions
 Wolf+0.148
 Ow+0.148
 wolf+0.145
 Cow+0.143
 fever+0.124
Neg logit contributions
 contain-0.464
ents-0.461
 else-0.445
 dice-0.435
ows-0.429
Token and
Token ID290
Feature activation-5.16e-04

Pos logit contributions
uck+1.115
ivered+1.049
P+1.036
 Cow+1.025
burn+1.018
Neg logit contributions
 contain-0.723
 dice-0.706
 else-0.672
 clearly-0.657
ents-0.648
Token her
Token ID607
Feature activation+9.12e-01

Pos logit contributions
 else+0.000
 contain+0.000
 clearly+0.000
ents+0.000
 dice+0.000
Neg logit contributions
uck-0.000
P-0.000
ivered-0.000
burn-0.000
 Ow-0.000
Token mom
Token ID1995
Feature activation+3.06e-01

Pos logit contributions
uck+0.072
ivered+0.064
burn+0.062
P+0.061
 Cow+0.060
Neg logit contributions
 dice-0.084
 contain-0.078
 else-0.077
ents-0.074
 clearly-0.070
Token wrapped
Token ID12908
Feature activation+6.51e-01

Pos logit contributions
uck+0.028
ivered+0.025
 Ow+0.025
burn+0.025
P+0.025
Neg logit contributions
 else-0.023
 contain-0.023
 dice-0.022
ents-0.022
 clearly-0.020
Token her
Token ID607
Feature activation+9.25e-01

Pos logit contributions
uck+0.050
ivered+0.049
burn+0.046
ivering+0.045
 Cow+0.044
Neg logit contributions
 dice-0.057
 contain-0.054
 else-0.052
ents-0.049
 clearly-0.049
Token inside
Token ID2641
Feature activation-2.14e+00

Pos logit contributions
uck+0.133
burn+0.120
 Wolf+0.115
 Cow+0.114
oo+0.113
Neg logit contributions
 dice-0.179
 contain-0.170
ents-0.163
 else-0.158
 clearly-0.149
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 dice+0.202
 contain+0.186
ents+0.182
 else+0.173
ooters+0.164
Neg logit contributions
uck-0.163
burn-0.147
 Wolf-0.143
 Cow-0.143
ivered-0.142
Token Lily
Token ID20037
Feature activation-6.44e-01

Pos logit contributions
uck+0.009
burn+0.008
ivered+0.008
 Wolf+0.008
 Cow+0.008
Neg logit contributions
 dice-0.010
 contain-0.010
ents-0.009
 else-0.009
 clearly-0.008
Token was
Token ID373
Feature activation+3.23e-01

Pos logit contributions
 dice+0.058
 contain+0.055
ents+0.052
 else+0.052
 clearly+0.048
Neg logit contributions
uck-0.051
burn-0.046
ivered-0.046
 Wolf-0.045
 Cow-0.045
Token sh
Token ID427
Feature activation+3.87e+00

Pos logit contributions
uck+0.032
 Ow+0.029
burn+0.029
ivered+0.029
P+0.028
Neg logit contributions
 dice-0.021
 else-0.021
 contain-0.021
ents-0.021
 clearly-0.019
Tokenivering
Token ID35598
Feature activation+1.13e+01

Pos logit contributions
 Wolf+0.136
 Ow+0.134
 wolf+0.132
 Cow+0.131
uck+0.116
Neg logit contributions
 contain-0.474
ents-0.468
 else-0.453
 dice-0.446
 clearly-0.435
Token and
Token ID290
Feature activation-2.65e-02

Pos logit contributions
uck+1.110
ivered+1.042
P+1.031
 Cow+1.016
burn+1.012
Neg logit contributions
 contain-0.731
 dice-0.711
 else-0.674
 clearly-0.660
ents-0.654
Token her
Token ID607
Feature activation+9.51e-01

Pos logit contributions
 else+0.002
 contain+0.002
 clearly+0.002
ents+0.002
 dice+0.002
Neg logit contributions
uck-0.003
P-0.002
ivered-0.002
burn-0.002
 Ow-0.002
Token mom
Token ID1995
Feature activation+2.95e-01

Pos logit contributions
uck+0.073
ivered+0.065
burn+0.063
P+0.063
 Cow+0.062
Neg logit contributions
 dice-0.088
 contain-0.083
 else-0.081
ents-0.078
 clearly-0.074
Token wrapped
Token ID12908
Feature activation+6.38e-01

Pos logit contributions
uck+0.026
ivered+0.023
 Ow+0.023
burn+0.023
P+0.023
Neg logit contributions
 dice-0.022
 contain-0.022
 else-0.022
ents-0.022
 clearly-0.020
Token her
Token ID607
Feature activation+9.63e-01

Pos logit contributions
uck+0.049
ivered+0.048
burn+0.045
ivering+0.044
P+0.043
Neg logit contributions
 dice-0.055
 contain-0.053
 else-0.051
ents-0.048
 clearly-0.048
Token!
Token ID0
Feature activation+3.11e-01

Pos logit contributions
uck+0.251
ivered+0.233
burn+0.232
 Cow+0.227
 Wolf+0.226
Neg logit contributions
 dice-0.232
 contain-0.218
ents-0.209
 else-0.206
 clearly-0.189
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
uck+0.024
burn+0.023
ivered+0.022
 Wolf+0.020
 Cow+0.020
Neg logit contributions
 dice-0.029
 contain-0.027
ents-0.026
 else-0.026
 clearly-0.024
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
 dice+0.043
 contain+0.041
ents+0.038
 else+0.038
 clearly+0.035
Neg logit contributions
uck-0.029
burn-0.025
ivered-0.025
 Cow-0.025
 Wolf-0.025
Token"
Token ID1
Feature activation+1.82e-02

Pos logit contributions
 dice+0.040
 contain+0.038
 else+0.037
ents+0.036
 clearly+0.033
Neg logit contributions
uck-0.031
burn-0.028
ivered-0.028
 Wolf-0.027
 Cow-0.027
TokenY
Token ID56
Feature activation+2.25e+00

Pos logit contributions
uck+0.001
ivered+0.001
 Wolf+0.001
oo+0.001
burn+0.001
Neg logit contributions
 dice-0.002
 else-0.002
 contain-0.002
ents-0.002
 clearly-0.002
Tokenuck
Token ID1347
Feature activation+1.13e+01

Pos logit contributions
 Wolf+0.050
 wolf+0.047
 Cow+0.047
 suffering+0.037
ivered+0.036
Neg logit contributions
ents-0.316
 dice-0.314
 contain-0.293
 else-0.292
hest-0.283
Token!"
Token ID2474
Feature activation+5.77e-02

Pos logit contributions
uck+1.022
burn+1.004
ivered+0.991
 Wolf+0.953
 Cow+0.947
Neg logit contributions
 dice-0.850
 contain-0.781
 else-0.771
ents-0.763
 clearly-0.667
Token the
Token ID262
Feature activation-3.15e-01

Pos logit contributions
uck+0.005
burn+0.005
ivered+0.005
 Cow+0.004
 Wolf+0.004
Neg logit contributions
 dice-0.005
 contain-0.005
ents-0.004
 else-0.004
 clearly-0.004
Token owl
Token ID39610
Feature activation+5.12e-03

Pos logit contributions
 dice+0.027
ents+0.026
 else+0.025
 contain+0.023
 clearly+0.022
Neg logit contributions
uck-0.026
ivered-0.025
burn-0.023
oo-0.022
 Ow-0.021
Token said
Token ID531
Feature activation-1.11e+00

Pos logit contributions
uck+0.000
ivered+0.000
 Cow+0.000
burn+0.000
oo+0.000
Neg logit contributions
 dice-0.000
 contain-0.000
ents-0.000
 else-0.000
ooters-0.000
Token.
Token ID13
Feature activation-1.89e-01

Pos logit contributions
 dice+0.091
 contain+0.087
ents+0.082
 else+0.082
 clearly+0.075
Neg logit contributions
uck-0.097
burn-0.090
ivered-0.089
 Wolf-0.086
 Cow-0.086
Token.
Token ID13
Feature activation-4.39e-02

Pos logit contributions
uck+0.232
burn+0.214
ivered+0.214
 Cow+0.208
 Wolf+0.208
Neg logit contributions
 dice-0.221
 contain-0.207
ents-0.196
 else-0.196
 clearly-0.178
Token
Token ID198
Feature activation-2.57e-01

Pos logit contributions
 dice+0.004
ents+0.004
 else+0.004
 contain+0.004
 clearly+0.003
Neg logit contributions
uck-0.003
burn-0.003
ivered-0.003
 Wolf-0.003
oo-0.003
Token
Token ID198
Feature activation-2.63e-01

Pos logit contributions
 dice+0.026
 contain+0.024
ents+0.023
 else+0.023
 clearly+0.021
Neg logit contributions
uck-0.018
burn-0.016
ivered-0.016
 Wolf-0.016
 Cow-0.016
Token"
Token ID1
Feature activation+1.69e-01

Pos logit contributions
 dice+0.025
 contain+0.023
 else+0.023
ents+0.023
 clearly+0.020
Neg logit contributions
uck-0.020
burn-0.018
ivered-0.018
 Wolf-0.018
 Cow-0.018
TokenY
Token ID56
Feature activation+2.21e+00

Pos logit contributions
uck+0.012
ivered+0.011
oo+0.010
burn+0.010
 Wolf+0.010
Neg logit contributions
 dice-0.017
 else-0.016
 contain-0.014
 clearly-0.014
ents-0.014
Tokenuck
Token ID1347
Feature activation+1.13e+01

Pos logit contributions
 Cow+0.073
 Wolf+0.070
 wolf+0.066
 suffering+0.063
 lamb+0.059
Neg logit contributions
 dice-0.284
ents-0.281
 contain-0.260
 else-0.260
hest-0.251
Token!
Token ID0
Feature activation+3.75e-01

Pos logit contributions
uck+0.981
burn+0.940
ivered+0.931
 Wolf+0.900
 Cow+0.897
Neg logit contributions
 dice-0.913
 contain-0.844
 else-0.818
ents-0.818
 clearly-0.724
Token My
Token ID2011
Feature activation-1.73e+00

Pos logit contributions
ivered+0.035
P+0.032
eled+0.031
burn+0.031
ivery+0.031
Neg logit contributions
 dice-0.026
ents-0.024
 wins-0.023
 else-0.022
 Lisa-0.021
Token t
Token ID256
Feature activation-1.25e+00

Pos logit contributions
 dice+0.150
 else+0.131
 contain+0.129
ents+0.125
 count+0.120
Neg logit contributions
uck-0.143
ivered-0.137
P-0.131
oo-0.122
burn-0.122
Tokeneddy
Token ID21874
Feature activation+4.24e+00

Pos logit contributions
 dice+0.129
ents+0.128
 else+0.121
 contain+0.117
bars+0.113
Neg logit contributions
uck-0.065
 Cow-0.064
ivered-0.063
P-0.061
 Wolf-0.061
Token bear
Token ID6842
Feature activation+2.25e+00

Pos logit contributions
uck+0.312
ivered+0.288
 Cow+0.276
burn+0.272
P+0.269
Neg logit contributions
 dice-0.383
 else-0.340
 counts-0.330
 count-0.327
ents-0.319
Token around
Token ID1088
Feature activation-4.05e+00

Pos logit contributions
uck+0.265
burn+0.230
ivered+0.230
 Wolf+0.224
 Cow+0.223
Neg logit contributions
 dice-0.279
 contain-0.258
ents-0.248
 else-0.243
ooters-0.227
Token her
Token ID607
Feature activation+8.52e-01

Pos logit contributions
 dice+0.385
 contain+0.363
 else+0.347
ents+0.345
 clearly+0.321
Neg logit contributions
uck-0.307
ivered-0.280
burn-0.279
 Cow-0.263
 Wolf-0.262
Token and
Token ID290
Feature activation-1.08e-01

Pos logit contributions
uck+0.071
burn+0.064
ivered+0.064
 Wolf+0.062
 Cow+0.062
Neg logit contributions
 dice-0.075
 contain-0.071
 else-0.067
ents-0.067
 clearly-0.061
Token stopped
Token ID5025
Feature activation-1.39e+00

Pos logit contributions
 dice+0.008
 contain+0.008
 else+0.008
ents+0.007
 clearly+0.007
Neg logit contributions
uck-0.010
ivered-0.009
burn-0.009
oo-0.009
 Wolf-0.009
Token sh
Token ID427
Feature activation+3.78e+00

Pos logit contributions
 else+0.111
 clearly+0.106
ents+0.106
 contain+0.102
 dice+0.101
Neg logit contributions
uck-0.117
ivered-0.114
burn-0.106
P-0.102
 Ow-0.099
Tokenivering
Token ID35598
Feature activation+1.13e+01

Pos logit contributions
 wolf+0.173
 Wolf+0.169
 Ow+0.159
 Cow+0.150
 sausage+0.145
Neg logit contributions
ows-0.409
ents-0.409
 contain-0.400
rig-0.392
 else-0.389
Token.
Token ID13
Feature activation-1.09e-01

Pos logit contributions
uck+1.163
ivered+1.109
P+1.108
 Cow+1.076
burn+1.067
Neg logit contributions
 contain-0.682
 dice-0.658
ents-0.607
 clearly-0.603
 else-0.601
Token Lily
Token ID20037
Feature activation-7.27e-01

Pos logit contributions
 dice+0.009
 contain+0.008
ents+0.008
 else+0.008
 clearly+0.007
Neg logit contributions
uck-0.009
ivered-0.009
burn-0.009
oo-0.008
 Wolf-0.008
Token was
Token ID373
Feature activation+1.85e-01

Pos logit contributions
 dice+0.062
 contain+0.058
ents+0.056
 else+0.055
 clearly+0.050
Neg logit contributions
uck-0.061
burn-0.056
ivered-0.056
 Wolf-0.054
 Cow-0.054
Token so
Token ID523
Feature activation-1.37e+00

Pos logit contributions
uck+0.020
burn+0.018
P+0.018
 Ow+0.018
 Wolf+0.018
Neg logit contributions
 else-0.011
 contain-0.011
ents-0.010
 clearly-0.010
 dice-0.010
Token happy
Token ID3772
Feature activation-1.45e+00

Pos logit contributions
 else+0.077
 contain+0.071
 clearly+0.060
ents+0.060
 dice+0.059
Neg logit contributions
uck-0.153
burn-0.151
 Wolf-0.149
 Cow-0.142
 wolf-0.141
Tokenw
Token ID86
Feature activation+7.99e+00

Pos logit contributions
uck+0.037
burn+0.036
ivered+0.034
 Wolf+0.033
 Cow+0.033
Neg logit contributions
 dice-0.121
 contain-0.114
 else-0.114
ents-0.110
 clearly-0.104
Token,
Token ID11
Feature activation+6.07e-01

Pos logit contributions
uck+0.665
P+0.664
ivered+0.657
 Wolf+0.656
burn+0.655
Neg logit contributions
 dice-0.564
ents-0.555
 else-0.550
ows-0.474
 contain-0.473
Token gross
Token ID10319
Feature activation+3.00e+00

Pos logit contributions
uck+0.044
ivered+0.040
burn+0.039
 Wolf+0.037
 Cow+0.037
Neg logit contributions
 dice-0.060
 contain-0.057
 else-0.055
ents-0.054
 clearly-0.050
Token.
Token ID13
Feature activation+7.79e-02

Pos logit contributions
eled+0.288
ivered+0.286
 Wolf+0.286
burn+0.285
 Amen+0.285
Neg logit contributions
 dice-0.174
 else-0.169
 wins-0.152
ents-0.150
ows-0.141
Token Y
Token ID575
Feature activation+2.29e-01

Pos logit contributions
ivered+0.007
burn+0.007
eled+0.007
ivering+0.007
 wolf+0.006
Neg logit contributions
 dice-0.006
 else-0.005
 wins-0.005
 contain-0.005
ents-0.005
Tokenuck
Token ID1347
Feature activation+1.13e+01

Pos logit contributions
 Cow+0.007
 Wolf+0.006
 wolf+0.006
eled+0.005
 fever+0.005
Neg logit contributions
 dice-0.028
ents-0.028
 else-0.028
 contain-0.027
hest-0.026
Token."
Token ID526
Feature activation+1.05e+00

Pos logit contributions
uck+0.984
burn+0.919
ivered+0.915
 Wolf+0.888
 Cow+0.886
Neg logit contributions
 dice-0.926
 contain-0.863
 else-0.834
ents-0.826
 clearly-0.748
Token
Token ID198
Feature activation-2.93e-01

Pos logit contributions
uck+0.076
burn+0.068
ivered+0.068
 Wolf+0.065
 Cow+0.065
Neg logit contributions
 dice-0.103
 contain-0.098
ents-0.094
 else-0.093
 clearly-0.086
Token
Token ID198
Feature activation-2.99e-01

Pos logit contributions
 dice+0.029
 contain+0.028
ents+0.027
 else+0.026
 clearly+0.025
Neg logit contributions
uck-0.020
burn-0.018
ivered-0.018
 Cow-0.018
 Wolf-0.018
TokenThey
Token ID2990
Feature activation+2.07e+00

Pos logit contributions
 dice+0.026
 contain+0.024
 else+0.023
ents+0.023
 clearly+0.021
Neg logit contributions
uck-0.025
burn-0.023
ivered-0.023
 Wolf-0.022
 Cow-0.022
Token looked
Token ID3114
Feature activation+1.92e-01

Pos logit contributions
uck+0.182
burn+0.166
ivered+0.166
 Wolf+0.160
oo+0.158
Neg logit contributions
 dice-0.172
 contain-0.165
 else-0.157
ents-0.156
 clearly-0.139
Token the
Token ID262
Feature activation-2.71e-01

Pos logit contributions
 dice+0.027
ents+0.025
 else+0.024
 contain+0.022
 clearly+0.020
Neg logit contributions
ivered-0.045
uck-0.042
ivering-0.041
ivery-0.041
burn-0.041
Token juice
Token ID13135
Feature activation+2.73e+00

Pos logit contributions
 dice+0.025
 contain+0.023
 else+0.022
ents+0.021
 clearly+0.019
Neg logit contributions
uck-0.021
ivered-0.021
 Wolf-0.021
burn-0.020
 Cow-0.020
Token and
Token ID290
Feature activation-2.20e-01

Pos logit contributions
uck+0.233
ivered+0.217
burn+0.214
 Wolf+0.211
 Cow+0.210
Neg logit contributions
 dice-0.228
 contain-0.214
ents-0.204
 else-0.202
 clearly-0.184
Token said
Token ID531
Feature activation-1.15e+00

Pos logit contributions
 dice+0.019
 contain+0.018
 else+0.018
ents+0.017
 clearly+0.016
Neg logit contributions
uck-0.018
ivered-0.017
burn-0.017
 Wolf-0.017
oo-0.017
Token y
Token ID331
Feature activation+1.42e+00

Pos logit contributions
 else+0.085
 dice+0.079
ents+0.073
 contain+0.069
 clearly+0.066
Neg logit contributions
ivered-0.108
burn-0.107
uck-0.102
eled-0.102
ivering-0.101
Tokenuck
Token ID1347
Feature activation+1.13e+01

Pos logit contributions
 Wolf+0.086
 Cow+0.081
 wolf+0.078
 Squirrel+0.075
burn+0.072
Neg logit contributions
ents-0.140
 dice-0.133
 else-0.129
 contain-0.127
 wins-0.117
Token again
Token ID757
Feature activation+4.37e-01

Pos logit contributions
uck+1.013
burn+0.947
ivered+0.947
 Wolf+0.915
 Cow+0.910
Neg logit contributions
 dice-0.902
 contain-0.843
 else-0.808
ents-0.808
 clearly-0.732
Token.
Token ID13
Feature activation-2.04e-01

Pos logit contributions
uck+0.033
burn+0.029
 Cow+0.029
ivered+0.029
oo+0.028
Neg logit contributions
 dice-0.042
 contain-0.039
ents-0.036
 else-0.036
ooters-0.033
Token She
Token ID1375
Feature activation+1.70e+00

Pos logit contributions
 dice+0.016
 else+0.015
ents+0.015
 contain+0.014
 clearly+0.014
Neg logit contributions
burn-0.018
ivered-0.018
uck-0.017
eled-0.017
ivering-0.016
Token wanted
Token ID2227
Feature activation+1.09e+00

Pos logit contributions
uck+0.135
burn+0.123
ivered+0.122
 Wolf+0.119
 Cow+0.118
Neg logit contributions
 dice-0.154
 contain-0.146
 else-0.139
ents-0.139
 clearly-0.127
Token water
Token ID1660
Feature activation+2.51e+00

Pos logit contributions
uck+0.090
ivered+0.082
burn+0.082
 Wolf+0.079
 Cow+0.079
Neg logit contributions
 dice-0.096
 contain-0.091
ents-0.087
 else-0.086
 clearly-0.079
Token is
Token ID318
Feature activation-2.46e-01

Pos logit contributions
uck+0.086
burn+0.077
ivered+0.074
 Wolf+0.073
 Cow+0.073
Neg logit contributions
 dice-0.094
 contain-0.094
ents-0.084
 else-0.080
ooters-0.077
Token bitter
Token ID12922
Feature activation+4.19e+00

Pos logit contributions
 dice+0.023
 contain+0.022
ents+0.020
 else+0.020
ooters+0.019
Neg logit contributions
uck-0.018
burn-0.017
 Cow-0.017
 Wolf-0.016
 fever-0.016
Token,
Token ID11
Feature activation+6.51e-01

Pos logit contributions
uck+0.356
burn+0.325
ivered+0.321
 Wolf+0.313
 Cow+0.313
Neg logit contributions
 dice-0.358
 contain-0.338
ents-0.318
 else-0.315
 clearly-0.288
Token like
Token ID588
Feature activation-3.06e+00

Pos logit contributions
uck+0.055
burn+0.051
ivered+0.050
 Cow+0.049
 Wolf+0.049
Neg logit contributions
 dice-0.055
 contain-0.052
ents-0.049
 else-0.048
 clearly-0.044
Token y
Token ID331
Feature activation+1.45e+00

Pos logit contributions
 dice+0.280
 contain+0.264
 else+0.253
ents+0.252
 clearly+0.231
Neg logit contributions
uck-0.241
ivered-0.220
burn-0.218
 Wolf-0.209
 Cow-0.208
Tokenuck
Token ID1347
Feature activation+1.13e+01

Pos logit contributions
 Wolf+0.100
 Cow+0.098
ivered+0.095
 wolf+0.094
 Squirrel+0.092
Neg logit contributions
ents-0.126
 dice-0.126
 contain-0.116
 else-0.113
hest-0.106
Token.
Token ID13
Feature activation+4.43e-02

Pos logit contributions
uck+0.954
burn+0.870
ivered+0.864
 Wolf+0.841
 Cow+0.840
Neg logit contributions
 dice-0.968
 contain-0.915
ents-0.864
 else-0.856
 clearly-0.784
Token You
Token ID921
Feature activation+1.78e+00

Pos logit contributions
uck+0.003
ivered+0.003
burn+0.003
eled+0.003
 Wolf+0.003
Neg logit contributions
 dice-0.004
 contain-0.004
ents-0.004
 else-0.004
 clearly-0.003
Token should
Token ID815
Feature activation-5.74e-01

Pos logit contributions
uck+0.155
burn+0.141
ivered+0.138
 Cow+0.138
 Wolf+0.138
Neg logit contributions
 dice-0.146
 contain-0.138
ents-0.127
 else-0.122
 clearly-0.117
Token wear
Token ID5806
Feature activation+7.65e-01

Pos logit contributions
 dice+0.039
 contain+0.038
 else+0.036
ents+0.034
 count+0.032
Neg logit contributions
uck-0.057
ivered-0.055
 Wolf-0.052
burn-0.052
eled-0.051
Token something
Token ID1223
Feature activation+3.36e+00

Pos logit contributions
uck+0.060
ivered+0.056
burn+0.055
 Wolf+0.053
 Cow+0.052
Neg logit contributions
 dice-0.070
 contain-0.065
 else-0.064
ents-0.063
 clearly-0.058
Token the
Token ID262
Feature activation-2.85e-01

Pos logit contributions
 dice+0.102
 contain+0.097
 else+0.096
 clearly+0.088
ents+0.087
Neg logit contributions
uck-0.144
ivered-0.141
burn-0.128
P-0.125
ivering-0.123
Token fire
Token ID2046
Feature activation+1.45e+00

Pos logit contributions
 dice+0.028
 contain+0.025
 else+0.025
ents+0.024
 clearly+0.023
Neg logit contributions
uck-0.021
ivered-0.020
burn-0.018
oo-0.018
P-0.017
Token and
Token ID290
Feature activation-1.74e-01

Pos logit contributions
uck+0.122
ivered+0.111
burn+0.111
 Wolf+0.107
 Cow+0.107
Neg logit contributions
 dice-0.123
 contain-0.116
ents-0.110
 else-0.109
 clearly-0.100
Token stopped
Token ID5025
Feature activation-1.37e+00

Pos logit contributions
 dice+0.015
 contain+0.014
 else+0.013
ents+0.013
 clearly+0.012
Neg logit contributions
uck-0.015
ivered-0.014
burn-0.013
oo-0.013
 Wolf-0.013
Token sh
Token ID427
Feature activation+3.79e+00

Pos logit contributions
 else+0.102
ents+0.099
 clearly+0.098
 dice+0.096
 contain+0.094
Neg logit contributions
uck-0.121
ivered-0.116
P-0.109
burn-0.108
 Ow-0.102
Tokenivering
Token ID35598
Feature activation+1.13e+01

Pos logit contributions
 Ow+0.164
 Wolf+0.162
 wolf+0.160
P+0.159
 Squirrel+0.155
Neg logit contributions
ows-0.403
ents-0.402
 clearly-0.383
 else-0.381
rig-0.374
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
uck+1.111
ivered+1.047
burn+1.015
P+1.014
 Cow+1.008
Neg logit contributions
 dice-0.752
 contain-0.744
ents-0.682
 else-0.682
 clearly-0.660
Token
Token ID198
Feature activation-3.66e-01

Pos logit contributions
 dice+0.015
 else+0.014
ents+0.013
 contain+0.012
 clearly+0.012
Neg logit contributions
ivered-0.018
uck-0.017
burn-0.017
oo-0.016
ivering-0.016
Token
Token ID198
Feature activation-3.62e-01

Pos logit contributions
 dice+0.037
 contain+0.036
ents+0.034
 else+0.033
 clearly+0.031
Neg logit contributions
uck-0.025
burn-0.022
ivered-0.022
 Cow-0.022
 Wolf-0.022
Token"
Token ID1
Feature activation+1.19e-01

Pos logit contributions
 dice+0.031
 else+0.030
 contain+0.028
ents+0.028
 clearly+0.025
Neg logit contributions
uck-0.030
burn-0.029
ivered-0.028
 Wolf-0.027
 Cow-0.026
TokenDid
Token ID11633
Feature activation+3.15e+00

Pos logit contributions
uck+0.009
ivered+0.008
oo+0.008
aged+0.007
 fever+0.007
Neg logit contributions
 dice-0.011
 else-0.010
 contain-0.010
 clearly-0.009
ents-0.009
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
 dice+0.028
ents+0.026
 contain+0.026
 else+0.026
 clearly+0.024
Neg logit contributions
uck-0.038
ivered-0.037
burn-0.036
 Wolf-0.035
 Cow-0.035
Token
Token ID198
Feature activation-2.90e-01

Pos logit contributions
 dice+0.008
 else+0.008
ents+0.008
 clearly+0.007
 contain+0.007
Neg logit contributions
ivered-0.009
burn-0.009
uck-0.008
eled-0.008
ivering-0.008
Token
Token ID198
Feature activation-2.96e-01

Pos logit contributions
 dice+0.028
 contain+0.026
ents+0.025
 else+0.025
 clearly+0.023
Neg logit contributions
uck-0.021
burn-0.020
ivered-0.019
 Wolf-0.019
 Cow-0.018
Token"
Token ID1
Feature activation+2.22e-01

Pos logit contributions
 dice+0.027
 else+0.025
 contain+0.024
ents+0.024
 clearly+0.021
Neg logit contributions
burn-0.023
uck-0.023
ivered-0.022
 Wolf-0.022
 Cow-0.021
TokenY
Token ID56
Feature activation+2.24e+00

Pos logit contributions
uck+0.016
ivered+0.016
oo+0.015
 Wolf+0.015
 Cow+0.014
Neg logit contributions
 dice-0.020
 else-0.019
 contain-0.017
 clearly-0.016
 hex-0.016
Tokenuck
Token ID1347
Feature activation+1.13e+01

Pos logit contributions
 wolf+0.066
 Wolf+0.066
 Cow+0.061
 suffering+0.056
 Squirrel+0.053
Neg logit contributions
ents-0.295
 dice-0.294
 else-0.272
 contain-0.270
hest-0.268
Token!
Token ID0
Feature activation+3.45e-01

Pos logit contributions
uck+1.005
burn+1.003
ivered+0.992
 Wolf+0.955
 Cow+0.946
Neg logit contributions
 dice-0.850
 else-0.773
 contain-0.770
ents-0.768
 clearly-0.665
Token I
Token ID314
Feature activation-1.09e+00

Pos logit contributions
ivered+0.032
eled+0.029
 wolf+0.029
burn+0.027
dd+0.027
Neg logit contributions
 dice-0.024
ents-0.023
 wins-0.022
 else-0.021
 hex-0.020
Token don
Token ID836
Feature activation+6.72e-01

Pos logit contributions
 else+0.086
 dice+0.084
 contain+0.080
 believe+0.072
ents+0.072
Neg logit contributions
burn-0.092
 Wolf-0.092
uck-0.091
 Cow-0.090
ivered-0.088
Token't
Token ID470
Feature activation+1.08e+00

Pos logit contributions
uck+0.052
ivered+0.051
burn+0.050
 Cow+0.049
 Wolf+0.048
Neg logit contributions
 dice-0.058
 contain-0.055
ents-0.053
 else-0.053
 clearly-0.048
Token like
Token ID588
Feature activation-3.12e+00

Pos logit contributions
uck+0.127
ivered+0.122
burn+0.122
 Wolf+0.120
 Cow+0.119
Neg logit contributions
 dice-0.053
 contain-0.053
 else-0.048
ents-0.043
 believe-0.040
Token Sara
Token ID24799
Feature activation-6.66e-01

Pos logit contributions
uck+0.064
ivered+0.062
burn+0.059
ivering+0.058
 Wolf+0.056
Neg logit contributions
 dice-0.050
 contain-0.046
 else-0.046
ents-0.042
 clearly-0.040
Token and
Token ID290
Feature activation+2.22e-02

Pos logit contributions
 dice+0.051
ents+0.047
 contain+0.047
 else+0.047
 clearly+0.042
Neg logit contributions
uck-0.059
burn-0.056
ivered-0.055
oo-0.053
 Wolf-0.053
Token Tom
Token ID4186
Feature activation-3.16e+00

Pos logit contributions
uck+0.003
ivered+0.003
ivering+0.002
burn+0.002
ivery+0.002
Neg logit contributions
 dice-0.001
 contain-0.001
ents-0.001
 else-0.001
 count-0.001
Token were
Token ID547
Feature activation+1.85e-01

Pos logit contributions
 dice+0.285
 contain+0.282
 else+0.268
ents+0.265
 clearly+0.244
Neg logit contributions
uck-0.246
burn-0.223
ivered-0.220
oo-0.213
 Wolf-0.209
Token sh
Token ID427
Feature activation+3.84e+00

Pos logit contributions
uck+0.015
 Ow+0.014
P+0.014
ivered+0.014
burn+0.014
Neg logit contributions
 else-0.015
ents-0.014
 dice-0.014
 contain-0.014
 clearly-0.013
Tokenivering
Token ID35598
Feature activation+1.13e+01

Pos logit contributions
 Ow+0.158
 Wolf+0.154
 wolf+0.149
 Cow+0.146
uck+0.135
Neg logit contributions
 contain-0.450
ents-0.449
 else-0.428
rig-0.414
 dice-0.412
Token in
Token ID287
Feature activation-3.53e-01

Pos logit contributions
uck+1.152
ivered+1.079
P+1.072
 Ow+1.031
eled+1.030
Neg logit contributions
 contain-0.691
 dice-0.639
 else-0.617
 clearly-0.616
ents-0.586
Token their
Token ID511
Feature activation-5.44e-01

Pos logit contributions
 dice+0.024
 contain+0.023
 else+0.021
 clearly+0.019
 count+0.019
Neg logit contributions
uck-0.037
ivered-0.036
burn-0.033
oo-0.032
P-0.032
Token bed
Token ID3996
Feature activation+1.68e+00

Pos logit contributions
 dice+0.040
 else+0.038
 contain+0.036
 seats+0.035
ents+0.033
Neg logit contributions
uck-0.051
ivered-0.049
burn-0.046
oo-0.046
P-0.044
Token.
Token ID13
Feature activation-8.18e-02

Pos logit contributions
uck+0.147
burn+0.135
ivered+0.134
 Wolf+0.131
 Cow+0.130
Neg logit contributions
 dice-0.135
 contain-0.130
ents-0.122
 else-0.122
 clearly-0.111
Token They
Token ID1119
Feature activation+1.72e+00

Pos logit contributions
 dice+0.007
 contain+0.007
ents+0.006
 else+0.006
 clearly+0.006
Neg logit contributions
uck-0.007
burn-0.007
ivered-0.006
P-0.006
 Wolf-0.006
Token noses
Token ID41760
Feature activation+2.33e+00

Pos logit contributions
 dice+0.071
 contain+0.067
ents+0.064
 else+0.063
 clearly+0.059
Neg logit contributions
uck-0.054
ivered-0.048
burn-0.048
 Wolf-0.047
 Cow-0.047
Token.
Token ID13
Feature activation+2.07e-01

Pos logit contributions
uck+0.202
ivered+0.185
burn+0.185
 Wolf+0.182
 Cow+0.181
Neg logit contributions
 dice-0.191
 contain-0.182
ents-0.173
 else-0.171
 clearly-0.157
Token They
Token ID1119
Feature activation+1.79e+00

Pos logit contributions
uck+0.016
burn+0.015
ivered+0.015
 Wolf+0.014
 Cow+0.014
Neg logit contributions
 dice-0.020
 contain-0.018
ents-0.018
 else-0.017
 clearly-0.016
Token say
Token ID910
Feature activation-1.94e+00

Pos logit contributions
uck+0.155
burn+0.142
ivered+0.140
 Cow+0.140
 fever+0.139
Neg logit contributions
 dice-0.145
 contain-0.131
ents-0.127
 else-0.122
ooters-0.115
Token y
Token ID331
Feature activation+1.40e+00

Pos logit contributions
 else+0.174
 dice+0.174
ents+0.162
 contain+0.160
 clearly+0.148
Neg logit contributions
ivered-0.147
uck-0.143
burn-0.136
eled-0.134
ivering-0.129
Tokenuck
Token ID1347
Feature activation+1.13e+01

Pos logit contributions
 Wolf+0.076
 Cow+0.072
 wolf+0.070
burn+0.067
 Sleep+0.067
Neg logit contributions
ents-0.147
 dice-0.136
 else-0.133
 contain-0.131
ows-0.126
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
uck+1.020
ivered+0.950
burn+0.949
 Wolf+0.916
 Cow+0.911
Neg logit contributions
 dice-0.898
 contain-0.840
ents-0.798
 else-0.797
 clearly-0.722
Token They
Token ID1119
Feature activation+1.77e+00

Pos logit contributions
uck+0.015
ivered+0.015
burn+0.015
 Wolf+0.013
ivering+0.013
Neg logit contributions
 dice-0.017
 contain-0.016
ents-0.015
 else-0.015
 clearly-0.014
Token throw
Token ID3714
Feature activation-4.88e-01

Pos logit contributions
uck+0.158
burn+0.145
ivered+0.143
 Cow+0.141
 Wolf+0.139
Neg logit contributions
 dice-0.141
 contain-0.130
ents-0.124
 else-0.121
 clearly-0.111
Token the
Token ID262
Feature activation-3.67e-01

Pos logit contributions
 dice+0.042
 contain+0.038
 else+0.038
ents+0.037
 clearly+0.034
Neg logit contributions
uck-0.041
ivered-0.039
burn-0.037
 Wolf-0.036
 Cow-0.036
Token doll
Token ID3654
Feature activation-3.66e+00

Pos logit contributions
 dice+0.033
 contain+0.029
 else+0.029
ents+0.028
 clearly+0.025
Neg logit contributions
uck-0.031
ivered-0.029
oo-0.027
burn-0.027
 Wolf-0.027
Token.
Token ID13
Feature activation-2.75e-02

Pos logit contributions
 dice+0.062
 contain+0.058
ents+0.054
 else+0.053
ooters+0.050
Neg logit contributions
uck-0.046
burn-0.041
 Cow-0.040
ivered-0.040
oo-0.039
Token
Token ID198
Feature activation-2.10e-01

Pos logit contributions
 dice+0.003
 contain+0.003
ents+0.003
 else+0.003
 clearly+0.002
Neg logit contributions
uck-0.002
burn-0.002
 Cow-0.002
ivered-0.002
 Wolf-0.002
Token
Token ID198
Feature activation-2.11e-01

Pos logit contributions
 dice+0.023
 contain+0.022
ents+0.021
 else+0.020
 clearly+0.019
Neg logit contributions
uck-0.013
 Cow-0.011
oo-0.011
ivered-0.011
 Wolf-0.011
Token"
Token ID1
Feature activation+2.17e-01

Pos logit contributions
 dice+0.022
 contain+0.021
ents+0.020
 else+0.020
 clearly+0.018
Neg logit contributions
uck-0.014
burn-0.013
ivered-0.012
 Cow-0.012
 Wolf-0.012
TokenY
Token ID56
Feature activation+2.19e+00

Pos logit contributions
uck+0.014
ivered+0.013
burn+0.012
oo+0.012
 Wolf+0.012
Neg logit contributions
 dice-0.024
 contain-0.022
 else-0.022
ents-0.020
 clearly-0.020
Tokenuck
Token ID1347
Feature activation+1.13e+01

Pos logit contributions
 Wolf+0.078
 Cow+0.076
ivered+0.075
burn+0.074
 wolf+0.071
Neg logit contributions
 dice-0.279
ents-0.267
 contain-0.260
 else-0.252
 count-0.244
Token,
Token ID11
Feature activation+5.35e-01

Pos logit contributions
uck+0.929
burn+0.859
ivered+0.852
 Wolf+0.824
 Cow+0.823
Neg logit contributions
 dice-0.988
 contain-0.929
ents-0.886
 else-0.882
 clearly-0.803
Token Ben
Token ID3932
Feature activation-1.97e-01

Pos logit contributions
uck+0.046
ivered+0.042
burn+0.042
 Wolf+0.041
 Cow+0.040
Neg logit contributions
 dice-0.045
 contain-0.043
ents-0.040
 else-0.040
 clearly-0.037
Token,
Token ID11
Feature activation+5.44e-01

Pos logit contributions
 dice+0.014
 else+0.013
ents+0.013
 contain+0.012
 counts+0.012
Neg logit contributions
uck-0.018
burn-0.017
ivered-0.017
eled-0.016
 Wolf-0.016
Token put
Token ID1234
Feature activation-1.97e-01

Pos logit contributions
uck+0.049
ivered+0.047
burn+0.046
 Wolf+0.044
 Cow+0.044
Neg logit contributions
 dice-0.044
 contain-0.041
 else-0.040
ents-0.038
 clearly-0.034
Token it
Token ID340
Feature activation+1.12e+00

Pos logit contributions
 dice+0.016
 else+0.015
 contain+0.014
ents+0.014
 wins+0.013
Neg logit contributions
ivered-0.017
burn-0.017
uck-0.016
 Wolf-0.016
 Cow-0.015
Token.
Token ID13
Feature activation+5.94e-04

Pos logit contributions
 dice+0.084
 contain+0.081
 else+0.077
ents+0.077
 clearly+0.070
Neg logit contributions
uck-0.091
burn-0.085
ivered-0.084
 Wolf-0.081
 Cow-0.081
Token The
Token ID383
Feature activation+1.10e+00

Pos logit contributions
uck+0.000
 Wolf+0.000
 Cow+0.000
burn+0.000
ivered+0.000
Neg logit contributions
 dice-0.000
 contain-0.000
 else-0.000
ents-0.000
inations-0.000
Token tree
Token ID5509
Feature activation-2.16e+00

Pos logit contributions
uck+0.097
burn+0.090
ivered+0.089
oo+0.085
 Cow+0.084
Neg logit contributions
 dice-0.093
 contain-0.086
 else-0.083
ents-0.082
 clearly-0.075
Token stopped
Token ID5025
Feature activation-1.35e+00

Pos logit contributions
 dice+0.196
 contain+0.184
ents+0.173
 else+0.171
 clearly+0.157
Neg logit contributions
uck-0.172
ivered-0.156
burn-0.155
 Wolf-0.151
 Cow-0.150
Token sh
Token ID427
Feature activation+3.85e+00

Pos logit contributions
ents+0.099
 contain+0.097
 dice+0.096
 clearly+0.095
 else+0.095
Neg logit contributions
uck-0.120
ivered-0.111
burn-0.106
P-0.105
 Ow-0.102
Tokenivering
Token ID35598
Feature activation+1.13e+01

Pos logit contributions
 Wolf+0.141
 Ow+0.133
 Cow+0.131
burn+0.130
 wolf+0.128
Neg logit contributions
ents-0.444
ows-0.432
 contain-0.431
 clearly-0.430
 dice-0.422
Token and
Token ID290
Feature activation-6.65e-02

Pos logit contributions
uck+1.109
ivered+1.044
burn+1.028
P+1.027
 Cow+1.006
Neg logit contributions
 contain-0.746
 dice-0.743
ents-0.686
 clearly-0.669
 else-0.668
Token was
Token ID373
Feature activation+2.76e-01

Pos logit contributions
 dice+0.005
 contain+0.005
 else+0.005
ents+0.005
 clearly+0.004
Neg logit contributions
uck-0.006
ivered-0.006
burn-0.006
oo-0.005
P-0.005
Token happy
Token ID3772
Feature activation-1.47e+00

Pos logit contributions
uck+0.025
burn+0.023
ivered+0.022
 Wolf+0.022
 Ow+0.022
Neg logit contributions
 contain-0.020
 dice-0.020
 else-0.020
ents-0.020
 clearly-0.018
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
 dice+0.126
 contain+0.118
ents+0.112
 else+0.110
 clearly+0.101
Neg logit contributions
uck-0.125
burn-0.114
ivered-0.113
 Wolf-0.111
 Cow-0.110
Token The
Token ID383
Feature activation+1.07e+00

Pos logit contributions
 dice+0.003
 contain+0.002
 else+0.002
ents+0.002
 clearly+0.002
Neg logit contributions
uck-0.002
 Wolf-0.001
 Cow-0.001
burn-0.001
ivered-0.001
Token away
Token ID1497
Feature activation+2.22e+00

Pos logit contributions
 dice+0.049
ents+0.046
 contain+0.046
 else+0.043
ooters+0.042
Neg logit contributions
uck-0.031
burn-0.026
ivered-0.026
 Wolf-0.025
oo-0.025
Token.
Token ID13
Feature activation-3.46e-01

Pos logit contributions
uck+0.124
 Wolf+0.120
 wolf+0.113
 Cow+0.107
 self+0.107
Neg logit contributions
 dice-0.238
 contain-0.226
ooters-0.213
ents-0.212
inations-0.206
Token He
Token ID679
Feature activation+5.78e-01

Pos logit contributions
 dice+0.035
 contain+0.034
ents+0.032
 else+0.032
 clearly+0.030
Neg logit contributions
uck-0.024
burn-0.021
ivered-0.021
 Wolf-0.020
 Cow-0.020
Token rolled
Token ID11686
Feature activation+9.69e-01

Pos logit contributions
uck+0.036
eled+0.032
 Cow+0.031
ivered+0.031
P+0.031
Neg logit contributions
 dice-0.058
 contain-0.057
ents-0.052
ooters-0.051
ows-0.051
Token the
Token ID262
Feature activation-6.14e-01

Pos logit contributions
uck+0.050
 Wolf+0.036
 Ow+0.035
 wolf+0.034
eled+0.033
Neg logit contributions
 contain-0.109
inations-0.105
ents-0.105
 else-0.104
 dice-0.102
Token dice
Token ID17963
Feature activation-1.31e+01

Pos logit contributions
 contain+0.052
ents+0.050
 dice+0.047
ooters+0.046
 clearly+0.045
Neg logit contributions
uck-0.049
P-0.044
 Ow-0.042
ivered-0.041
 wolf-0.041
Token and
Token ID290
Feature activation-2.96e-01

Pos logit contributions
 dice+1.359
 contain+1.314
inations+1.303
ooters+1.300
rig+1.265
Neg logit contributions
uck-0.802
burn-0.545
ivered-0.526
oo-0.498
 self-0.474
Token moved
Token ID3888
Feature activation-2.08e+00

Pos logit contributions
rig+0.030
 contain+0.029
ents+0.028
ows+0.027
 dice+0.027
Neg logit contributions
uck-0.018
burn-0.015
aged-0.014
eled-0.014
 Wolf-0.014
Token his
Token ID465
Feature activation+8.36e-01

Pos logit contributions
 contain+0.229
 dice+0.223
ents+0.221
inations+0.216
ooters+0.215
Neg logit contributions
uck-0.104
 Wolf-0.077
eled-0.076
oo-0.074
 Ow-0.069
Token pieces
Token ID5207
Feature activation-2.30e+00

Pos logit contributions
uck+0.051
 wing+0.049
 lung+0.049
 Ow+0.047
 wolf+0.047
Neg logit contributions
ents-0.077
 contain-0.076
 clearly-0.070
hest-0.069
cases-0.069
Token around
Token ID1088
Feature activation-4.24e+00

Pos logit contributions
 dice+0.236
 equations+0.202
 contain+0.200
 counts+0.198
ents+0.197
Neg logit contributions
uck-0.146
burn-0.118
ivered-0.111
oo-0.108
P-0.105
Token came
Token ID1625
Feature activation+1.70e+00

Pos logit contributions
 contain+0.359
 dice+0.352
ents+0.314
 else+0.306
inations+0.303
Neg logit contributions
uck-0.280
burn-0.242
ivered-0.224
 Wolf-0.223
oo-0.223
Token and
Token ID290
Feature activation-7.14e-02

Pos logit contributions
uck+0.129
burn+0.116
ivered+0.110
 Wolf+0.110
 Cow+0.107
Neg logit contributions
 dice-0.164
 contain-0.159
 else-0.148
ents-0.143
inations-0.135
Token he
Token ID339
Feature activation-1.44e+00

Pos logit contributions
 contain+0.007
ents+0.006
 dice+0.006
hest+0.006
rig+0.006
Neg logit contributions
uck-0.005
burn-0.005
 Mitt-0.004
 Cow-0.004
 Wolf-0.004
Token rolled
Token ID11686
Feature activation+1.00e+00

Pos logit contributions
 contain+0.160
 dice+0.150
ents+0.141
ooters+0.137
hest+0.135
Neg logit contributions
uck-0.082
burn-0.070
ivered-0.068
 Cow-0.066
 fever-0.065
Token the
Token ID262
Feature activation-2.68e-01

Pos logit contributions
uck+0.059
 Wolf+0.049
 Ow+0.048
 wolf+0.046
burn+0.044
Neg logit contributions
 contain-0.107
ents-0.102
inations-0.100
 dice-0.100
 else-0.097
Token dice
Token ID17963
Feature activation-1.31e+01

Pos logit contributions
 contain+0.025
ents+0.023
 clearly+0.023
 else+0.021
 believe+0.020
Neg logit contributions
uck-0.019
P-0.019
burn-0.018
 wolf-0.016
w-0.016
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
inations+1.394
 contain+1.361
rig+1.354
 dice+1.326
ooters+1.291
Neg logit contributions
uck-0.726
burn-0.451
ung-0.419
oo-0.379
 self-0.373
Token He
Token ID679
Feature activation+7.32e-01

Pos logit contributions
uck+0.009
burn+0.008
ivered+0.008
 Wolf+0.008
 Cow+0.008
Neg logit contributions
 dice-0.013
 contain-0.012
ents-0.012
 else-0.012
 clearly-0.011
Token moved
Token ID3888
Feature activation-2.06e+00

Pos logit contributions
uck+0.033
aged+0.027
 self+0.026
P+0.025
 suffered+0.025
Neg logit contributions
 contain-0.083
rig-0.075
ooters-0.073
ents-0.073
hips-0.070
Token his
Token ID465
Feature activation+1.07e+00

Pos logit contributions
 contain+0.246
inations+0.232
ooters+0.222
 dice+0.222
rig+0.220
Neg logit contributions
uck-0.095
 Wolf-0.075
 Ow-0.072
oo-0.069
eled-0.067
Token piece
Token ID3704
Feature activation-1.03e+00

Pos logit contributions
uck+0.052
 wing+0.047
 lung+0.045
 heart+0.043
 stomach+0.043
Neg logit contributions
 contain-0.114
 clearly-0.107
ents-0.107
hest-0.103
ount-0.102
Token Daddy
Token ID26926
Feature activation+9.63e-01

Pos logit contributions
 dice+0.011
 contain+0.011
 else+0.011
ents+0.011
 1+0.010
Neg logit contributions
uck-0.010
ivered-0.009
ivering-0.009
burn-0.008
ivery-0.008
Token took
Token ID1718
Feature activation+1.97e+00

Pos logit contributions
uck+0.074
burn+0.067
ivered+0.066
 Cow+0.064
oo+0.064
Neg logit contributions
 dice-0.089
 contain-0.085
ents-0.080
 else-0.080
 clearly-0.075
Token turns
Token ID4962
Feature activation-3.57e+00

Pos logit contributions
uck+0.158
burn+0.143
ivered+0.142
 Wolf+0.138
 Cow+0.138
Neg logit contributions
 dice-0.178
 contain-0.168
ents-0.160
 else-0.158
 clearly-0.146
Token rolling
Token ID10708
Feature activation+8.84e-01

Pos logit contributions
 contain+0.364
 dice+0.355
ents+0.331
 else+0.327
ooters+0.317
Neg logit contributions
uck-0.229
oo-0.190
burn-0.177
aged-0.176
 suffering-0.166
Token the
Token ID262
Feature activation-5.16e-01

Pos logit contributions
uck+0.049
 Wolf+0.036
 Ow+0.036
burn+0.035
oo+0.035
Neg logit contributions
 contain-0.101
ents-0.094
inations-0.093
 dice-0.091
 else-0.091
Token dice
Token ID17963
Feature activation-1.30e+01

Pos logit contributions
 contain+0.042
ents+0.040
 clearly+0.038
 else+0.036
ows+0.036
Neg logit contributions
uck-0.042
P-0.035
 sausage-0.033
 Ow-0.033
 lung-0.032
Token and
Token ID290
Feature activation-1.41e-01

Pos logit contributions
inations+1.424
 contain+1.411
 dice+1.363
rig+1.355
ooters+1.345
Neg logit contributions
uck-0.673
burn-0.359
over-0.359
oo-0.346
ung-0.329
Token moving
Token ID3867
Feature activation-5.55e-01

Pos logit contributions
 contain+0.014
rig+0.013
hest+0.012
ows+0.012
aly+0.012
Neg logit contributions
uck-0.009
burn-0.008
 Mitt-0.008
 Wolf-0.007
 self-0.007
Token their
Token ID511
Feature activation-8.57e-01

Pos logit contributions
 contain+0.063
ents+0.058
inations+0.057
 dice+0.057
rig+0.056
Neg logit contributions
uck-0.027
 Wolf-0.022
 wolf-0.020
oo-0.020
eled-0.020
Token pieces
Token ID5207
Feature activation-2.26e+00

Pos logit contributions
ents+0.069
 contain+0.067
 clearly+0.064
 else+0.061
 outage+0.060
Neg logit contributions
uck-0.060
 sausage-0.053
 pain-0.053
 fever-0.053
aged-0.052
Token around
Token ID1088
Feature activation-4.16e+00

Pos logit contributions
 dice+0.240
ents+0.211
engu+0.211
 contain+0.209
 counts+0.207
Neg logit contributions
uck-0.124
burn-0.100
ivered-0.095
oo-0.091
 wing-0.087
Token But
Token ID887
Feature activation-7.49e-01

Pos logit contributions
uck+0.067
burn+0.061
ivered+0.060
 Wolf+0.058
 Cow+0.058
Neg logit contributions
 dice-0.081
 contain-0.077
ents-0.073
 else-0.072
 clearly-0.067
Token then
Token ID788
Feature activation-2.61e+00

Pos logit contributions
 dice+0.064
 contain+0.060
ents+0.057
 else+0.056
 clearly+0.051
Neg logit contributions
uck-0.063
burn-0.058
ivered-0.057
 Wolf-0.056
 Cow-0.056
Token Joe
Token ID5689
Feature activation-2.13e+00

Pos logit contributions
 dice+0.224
 contain+0.211
 else+0.205
ents+0.199
 clearly+0.184
Neg logit contributions
uck-0.216
ivered-0.201
burn-0.198
 Wolf-0.187
oo-0.187
Token noticed
Token ID6810
Feature activation+2.12e+00

Pos logit contributions
 dice+0.188
 contain+0.177
ents+0.165
 else+0.163
 clearly+0.149
Neg logit contributions
uck-0.174
ivered-0.158
burn-0.157
 Wolf-0.155
 Cow-0.154
Token something
Token ID1223
Feature activation+2.82e+00

Pos logit contributions
uck+0.149
burn+0.134
ivered+0.133
 Wolf+0.128
 Cow+0.127
Neg logit contributions
 dice-0.211
 contain-0.201
ents-0.192
 else-0.192
 clearly-0.178
Token else
Token ID2073
Feature activation-1.21e+01

Pos logit contributions
uck+0.200
 wolf+0.183
 suffering+0.181
 Wolf+0.178
 Cow+0.173
Neg logit contributions
 dice-0.265
ents-0.238
 contain-0.233
ooters-0.219
 counts-0.214
Token.
Token ID13
Feature activation-7.60e-01

Pos logit contributions
 dice+1.126
 contain+1.028
ents+0.991
 else+0.928
ooters+0.894
Neg logit contributions
uck-0.939
ivered-0.842
burn-0.827
 Wolf-0.814
 Cow-0.813
Token He
Token ID679
Feature activation+4.83e-01

Pos logit contributions
 dice+0.073
 contain+0.069
ents+0.067
 else+0.066
 clearly+0.061
Neg logit contributions
uck-0.055
burn-0.052
ivered-0.051
oo-0.047
 Cow-0.047
Token saw
Token ID2497
Feature activation+8.85e-01

Pos logit contributions
uck+0.036
ivered+0.032
 Wolf+0.032
 Cow+0.032
burn+0.031
Neg logit contributions
 dice-0.046
 contain-0.044
ents-0.040
 else-0.039
ooters-0.038
Token a
Token ID257
Feature activation+2.61e-01

Pos logit contributions
uck+0.055
 Wolf+0.048
 Cow+0.047
ivered+0.046
burn+0.046
Neg logit contributions
 dice-0.094
 contain-0.091
ents-0.086
 else-0.083
ooters-0.080
Token box
Token ID3091
Feature activation-2.04e-01

Pos logit contributions
uck+0.021
burn+0.019
ivered+0.019
 Wolf+0.019
 Cow+0.019
Neg logit contributions
 dice-0.023
 contain-0.022
ents-0.021
 else-0.020
 clearly-0.019
Token very
Token ID845
Feature activation-4.06e+00

Pos logit contributions
 contain+1.017
 dice+0.993
 else+0.904
inations+0.901
 believe+0.869
Neg logit contributions
uck-0.657
burn-0.528
 Cow-0.519
 Wolf-0.517
oo-0.515
Token slowly
Token ID6364
Feature activation-7.30e-01

Pos logit contributions
 contain+0.361
 dice+0.351
ophone+0.331
 outage+0.327
inations+0.326
Neg logit contributions
uck-0.303
aged-0.250
oo-0.227
 self-0.226
um-0.222
Token.
Token ID13
Feature activation-3.45e-01

Pos logit contributions
 dice+0.071
 contain+0.070
 else+0.066
inations+0.062
ents+0.062
Neg logit contributions
uck-0.050
burn-0.040
ivered-0.039
aged-0.039
oo-0.038
Token He
Token ID679
Feature activation+4.48e-01

Pos logit contributions
 dice+0.036
 contain+0.036
 else+0.034
ents+0.033
 clearly+0.031
Neg logit contributions
uck-0.022
 Cow-0.019
 Wolf-0.019
 Ow-0.018
burn-0.018
Token counted
Token ID14789
Feature activation-9.85e+00

Pos logit contributions
uck+0.031
eled+0.027
aged+0.026
 self+0.026
M+0.025
Neg logit contributions
 contain-0.041
ooters-0.038
 dice-0.038
 outage-0.037
rig-0.035
Token 1
Token ID352
Feature activation-1.19e+01

Pos logit contributions
 contain+1.025
 dice+1.000
 else+0.910
inations+0.906
rig+0.883
Neg logit contributions
uck-0.652
oo-0.511
burn-0.507
 Cow-0.506
 Wolf-0.501
Token,
Token ID11
Feature activation+2.96e-01

Pos logit contributions
 contain+1.346
 dice+1.243
inations+1.189
 outage+1.187
rig+1.186
Neg logit contributions
uck-0.721
oo-0.527
aged-0.461
over-0.458
 Wolf-0.440
Token2
Token ID17
Feature activation-7.71e+00

Pos logit contributions
uck+0.027
 Cow+0.026
P+0.026
 Wolf+0.025
 wolf+0.024
Neg logit contributions
 contain-0.020
 dice-0.018
 else-0.018
inations-0.017
 outage-0.017
Token,
Token ID11
Feature activation+2.92e-01

Pos logit contributions
 contain+0.869
 dice+0.833
 else+0.765
rig+0.750
ooters+0.747
Neg logit contributions
uck-0.442
oo-0.369
ivered-0.328
 Cow-0.309
aged-0.307
Token3
Token ID18
Feature activation-5.72e+00

Pos logit contributions
uck+0.024
P+0.022
 Cow+0.022
 Wolf+0.021
M+0.020
Neg logit contributions
 contain-0.023
 dice-0.021
 else-0.021
 outage-0.020
inations-0.020
Token...
Token ID986
Feature activation-6.04e+00

Pos logit contributions
 contain+0.634
 dice+0.616
 else+0.572
ooters+0.561
rig+0.545
Neg logit contributions
uck-0.329
oo-0.271
ivered-0.248
over-0.230
burn-0.229
Token lonely
Token ID21757
Feature activation+1.98e+00

Pos logit contributions
burn+0.566
uck+0.566
ivered+0.527
 Wolf+0.515
 Ow+0.513
Neg logit contributions
 else-0.489
 contain-0.480
 dice-0.454
 clearly-0.442
ents-0.419
Token because
Token ID780
Feature activation+1.83e-01

Pos logit contributions
uck+0.173
burn+0.158
ivered+0.158
 Cow+0.153
 Wolf+0.153
Neg logit contributions
 dice-0.162
 contain-0.155
 else-0.146
ents-0.145
 clearly-0.133
Token there
Token ID612
Feature activation+2.99e+00

Pos logit contributions
uck+0.015
burn+0.014
 Wolf+0.014
 Cow+0.014
ivered+0.014
Neg logit contributions
 dice-0.016
 contain-0.015
ents-0.014
 else-0.013
 clearly-0.012
Token was
Token ID373
Feature activation+1.13e-01

Pos logit contributions
uck+0.222
ivered+0.198
burn+0.198
 Wolf+0.195
 Cow+0.193
Neg logit contributions
 dice-0.286
 contain-0.269
ents-0.259
 else-0.249
 clearly-0.232
Token nobody
Token ID8168
Feature activation-4.65e-01

Pos logit contributions
uck+0.008
ivered+0.007
burn+0.007
 Cow+0.007
 Wolf+0.007
Neg logit contributions
 dice-0.011
 contain-0.010
 else-0.010
ents-0.010
 clearly-0.009
Token else
Token ID2073
Feature activation-1.19e+01

Pos logit contributions
 dice+0.046
 contain+0.039
ents+0.038
 levers+0.035
ooters+0.034
Neg logit contributions
uck-0.034
burn-0.032
 Wolf-0.031
 suffering-0.031
ivered-0.031
Token around
Token ID1088
Feature activation-4.25e+00

Pos logit contributions
 dice+1.080
 contain+0.998
ents+0.949
 else+0.893
 clearly+0.844
Neg logit contributions
uck-0.949
burn-0.853
ivered-0.847
 Wolf-0.836
 Cow-0.831
Token.
Token ID13
Feature activation-4.53e-01

Pos logit contributions
 dice+0.353
 contain+0.342
 else+0.323
ents+0.315
 clearly+0.300
Neg logit contributions
uck-0.363
burn-0.331
ivered-0.330
 Cow-0.316
 Wolf-0.314
Token Suddenly
Token ID24975
Feature activation+1.49e+00

Pos logit contributions
 dice+0.046
 contain+0.044
ents+0.042
 else+0.042
 clearly+0.039
Neg logit contributions
uck-0.030
 Wolf-0.027
 Cow-0.026
ivered-0.026
burn-0.026
Token,
Token ID11
Feature activation+2.60e-01

Pos logit contributions
uck+0.112
burn+0.102
ivered+0.101
 Cow+0.096
 Wolf+0.096
Neg logit contributions
 dice-0.141
 contain-0.133
 else-0.127
ents-0.127
 clearly-0.117
Token the
Token ID262
Feature activation-4.35e-01

Pos logit contributions
uck+0.020
burn+0.018
ivered+0.018
 Wolf+0.018
 Cow+0.018
Neg logit contributions
 dice-0.024
 contain-0.023
ents-0.022
 else-0.021
 clearly-0.020
Token He
Token ID679
Feature activation+1.36e+00

Pos logit contributions
 dice+0.281
 contain+0.262
ents+0.246
 else+0.236
ooters+0.225
Neg logit contributions
uck-0.203
burn-0.189
 Cow-0.184
 Wolf-0.183
ivered-0.179
Token wants
Token ID3382
Feature activation-8.57e-01

Pos logit contributions
uck+0.100
ivered+0.088
 Wolf+0.087
 Cow+0.087
 fever+0.085
Neg logit contributions
 dice-0.126
 contain-0.122
ents-0.115
 else-0.109
ooters-0.105
Token to
Token ID284
Feature activation-3.06e-01

Pos logit contributions
 dice+0.082
 contain+0.079
ents+0.074
 else+0.070
ows+0.068
Neg logit contributions
uck-0.062
burn-0.056
 Cow-0.056
 Wolf-0.056
 wolf-0.053
Token know
Token ID760
Feature activation-3.02e+00

Pos logit contributions
 dice+0.025
ents+0.023
 contain+0.023
 else+0.022
 clearly+0.020
Neg logit contributions
uck-0.026
burn-0.024
 Cow-0.024
oo-0.023
 Wolf-0.023
Token what
Token ID644
Feature activation-9.70e-02

Pos logit contributions
 dice+0.301
 contain+0.298
rig+0.261
ents+0.260
hest+0.247
Neg logit contributions
 Cow-0.189
 Wolf-0.183
 wolf-0.177
uck-0.177
oo-0.167
Token else
Token ID2073
Feature activation-1.19e+01

Pos logit contributions
 contain+0.010
 dice+0.010
ents+0.010
ows+0.008
rig+0.008
Neg logit contributions
uck-0.006
 Wolf-0.006
 Cow-0.006
 wolf-0.006
 lung-0.005
Token he
Token ID339
Feature activation-1.60e+00

Pos logit contributions
 dice+1.127
 contain+1.072
ents+1.022
ooters+0.901
uits+0.897
Neg logit contributions
uck-0.892
 Wolf-0.776
 Cow-0.768
 wolf-0.746
burn-0.738
Token can
Token ID460
Feature activation-2.35e+00

Pos logit contributions
 dice+0.149
 contain+0.144
ents+0.134
ooters+0.123
 else+0.120
Neg logit contributions
uck-0.122
ivered-0.105
 Wolf-0.102
 Cow-0.102
 Ow-0.100
Token mark
Token ID1317
Feature activation-1.08e-01

Pos logit contributions
 dice+0.205
 contain+0.182
ents+0.180
 else+0.172
ooters+0.158
Neg logit contributions
uck-0.201
ivered-0.177
oo-0.174
burn-0.174
 Cow-0.171
Token with
Token ID351
Feature activation+6.05e-01

Pos logit contributions
 contain+0.011
 dice+0.011
ents+0.010
 else+0.010
ooters+0.010
Neg logit contributions
uck-0.007
burn-0.006
 Wolf-0.006
 Cow-0.006
 wolf-0.006
Token.
Token ID13
Feature activation+1.94e-01

Pos logit contributions
uck+0.040
 Wolf+0.034
 Cow+0.034
 Ow+0.034
burn+0.034
Neg logit contributions
 dice-0.060
 contain-0.060
ents-0.056
 else-0.053
ooters-0.052
TokenBut
Token ID1537
Feature activation+1.01e+00

Pos logit contributions
 dice+0.044
 contain+0.043
ents+0.041
 else+0.040
 clearly+0.037
Neg logit contributions
uck-0.042
burn-0.039
 Wolf-0.037
 Cow-0.037
ivered-0.037
Token then
Token ID788
Feature activation-2.72e+00

Pos logit contributions
uck+0.084
burn+0.076
 Cow+0.076
 Wolf+0.075
ivered+0.074
Neg logit contributions
 dice-0.087
 contain-0.082
ents-0.077
 else-0.074
ooters-0.070
Token he
Token ID339
Feature activation-1.64e+00

Pos logit contributions
 dice+0.237
 contain+0.223
 else+0.215
ents+0.211
 clearly+0.194
Neg logit contributions
uck-0.222
ivered-0.204
burn-0.203
 Wolf-0.193
 Cow-0.192
Token sees
Token ID7224
Feature activation-1.93e+00

Pos logit contributions
 dice+0.161
 contain+0.153
ents+0.143
 else+0.138
ooters+0.134
Neg logit contributions
uck-0.117
ivered-0.104
burn-0.101
 Wolf-0.100
 Cow-0.100
Token something
Token ID1223
Feature activation+3.09e+00

Pos logit contributions
 dice+0.203
 contain+0.196
ents+0.186
 else+0.181
 clearly+0.170
Neg logit contributions
uck-0.123
burn-0.107
ivered-0.107
 Wolf-0.106
 Cow-0.105
Token else
Token ID2073
Feature activation-1.19e+01

Pos logit contributions
uck+0.214
 wolf+0.205
 Wolf+0.194
 suffering+0.190
 sausage+0.185
Neg logit contributions
 dice-0.288
 contain-0.272
ents-0.272
ooters-0.242
 counts-0.235
Token in
Token ID287
Feature activation-4.34e-01

Pos logit contributions
 dice+1.142
 contain+1.065
ents+1.015
 else+0.949
ooters+0.928
Neg logit contributions
uck-0.883
ivered-0.769
burn-0.764
 Wolf-0.746
 Cow-0.736
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
 dice+0.037
 contain+0.035
ents+0.033
 else+0.033
 clearly+0.030
Neg logit contributions
uck-0.037
ivered-0.034
burn-0.034
 Cow-0.032
oo-0.032
Token garden
Token ID11376
Feature activation+8.77e-01

Pos logit contributions
 dice+0.011
 contain+0.010
ents+0.010
 else+0.009
 clearly+0.009
Neg logit contributions
uck-0.009
burn-0.008
ivered-0.008
 Wolf-0.008
 Cow-0.008
Token.
Token ID13
Feature activation+6.31e-02

Pos logit contributions
uck+0.071
burn+0.064
ivered+0.064
 Wolf+0.062
 Cow+0.062
Neg logit contributions
 dice-0.078
 contain-0.073
ents-0.069
 else-0.068
 clearly-0.063
Token He
Token ID679
Feature activation+6.48e-01

Pos logit contributions
uck+0.005
burn+0.004
ivered+0.004
oo+0.004
 Cow+0.004
Neg logit contributions
 dice-0.006
 contain-0.006
ents-0.006
 else-0.005
 clearly-0.005
Token was
Token ID373
Feature activation+8.00e-01

Pos logit contributions
uck+0.172
ivered+0.154
burn+0.152
oo+0.149
 Wolf+0.148
Neg logit contributions
 dice-0.219
 contain-0.207
ents-0.197
 else-0.197
 clearly-0.179
Token so
Token ID523
Feature activation-1.33e+00

Pos logit contributions
uck+0.063
burn+0.057
ivered+0.057
 Wolf+0.055
 Cow+0.055
Neg logit contributions
 dice-0.072
 contain-0.069
ents-0.065
 else-0.064
 clearly-0.059
Token special
Token ID2041
Feature activation-3.85e+00

Pos logit contributions
 dice+0.115
 contain+0.107
ents+0.103
 else+0.098
 clearly+0.090
Neg logit contributions
uck-0.113
ivered-0.102
burn-0.100
 Cow-0.098
 Wolf-0.098
Token that
Token ID326
Feature activation+2.17e+00

Pos logit contributions
 dice+0.390
 contain+0.361
ents+0.352
 else+0.330
ooters+0.318
Neg logit contributions
uck-0.275
 Wolf-0.237
 Cow-0.232
burn-0.230
oo-0.229
Token everybody
Token ID7288
Feature activation-3.91e-01

Pos logit contributions
uck+0.172
 Wolf+0.155
 Cow+0.155
burn+0.154
ivered+0.149
Neg logit contributions
 dice-0.195
 contain-0.181
ents-0.177
 else-0.163
ooters-0.158
Token else
Token ID2073
Feature activation-1.18e+01

Pos logit contributions
 dice+0.038
ents+0.033
 contain+0.033
ooters+0.031
 levers+0.029
Neg logit contributions
uck-0.029
ivered-0.026
 Wolf-0.025
 Cow-0.025
burn-0.025
Token wanted
Token ID2227
Feature activation+9.34e-01

Pos logit contributions
 dice+1.033
 contain+0.957
ents+0.915
 else+0.866
ooters+0.812
Neg logit contributions
uck-0.987
ivered-0.885
burn-0.884
 Wolf-0.870
 Cow-0.866
Token to
Token ID284
Feature activation-1.74e-01

Pos logit contributions
uck+0.067
 Wolf+0.061
 Cow+0.060
burn+0.058
oo+0.056
Neg logit contributions
 dice-0.091
 contain-0.085
ents-0.083
 else-0.076
ooters-0.074
Token join
Token ID4654
Feature activation-4.51e+00

Pos logit contributions
 dice+0.014
 contain+0.012
ents+0.012
 else+0.012
 clearly+0.011
Neg logit contributions
uck-0.016
burn-0.014
ivered-0.014
 Cow-0.014
 Wolf-0.014
Token in
Token ID287
Feature activation-5.44e-01

Pos logit contributions
 dice+0.408
 contain+0.389
 else+0.367
ents+0.365
 clearly+0.337
Neg logit contributions
uck-0.357
burn-0.327
ivered-0.324
 Wolf-0.312
 Cow-0.312
Token the
Token ID262
Feature activation-1.77e-01

Pos logit contributions
 dice+0.051
 contain+0.048
ents+0.047
 else+0.045
ooters+0.042
Neg logit contributions
uck-0.041
burn-0.036
 Wolf-0.036
 Cow-0.036
oo-0.035
Tokenâ„¢
Token ID8151
Feature activation+8.41e-01

Pos logit contributions
 dice+0.263
 contain+0.248
ents+0.238
 else+0.236
 clearly+0.216
Neg logit contributions
uck-0.207
burn-0.188
ivered-0.187
 Wolf-0.180
 Cow-0.180
Tokens
Token ID82
Feature activation-1.86e+00

Pos logit contributions
uck+0.072
ivered+0.063
burn+0.063
oo+0.062
 Cow+0.059
Neg logit contributions
 dice-0.074
 contain-0.070
ents-0.064
 else-0.063
 clearly-0.060
Token go
Token ID467
Feature activation-4.82e-01

Pos logit contributions
 dice+0.154
 contain+0.146
ents+0.137
 else+0.137
 clearly+0.125
Neg logit contributions
uck-0.160
burn-0.147
ivered-0.146
 Wolf-0.142
 Cow-0.142
Token find
Token ID1064
Feature activation-2.77e+00

Pos logit contributions
 dice+0.043
 contain+0.041
ents+0.040
 else+0.039
 clearly+0.036
Neg logit contributions
uck-0.039
burn-0.035
oo-0.034
 Cow-0.034
 Wolf-0.034
Token something
Token ID1223
Feature activation+2.90e+00

Pos logit contributions
 dice+0.284
 contain+0.269
ents+0.258
 else+0.258
 clearly+0.239
Neg logit contributions
uck-0.185
ivered-0.166
burn-0.166
 Wolf-0.157
 Cow-0.157
Token else
Token ID2073
Feature activation-1.18e+01

Pos logit contributions
uck+0.215
 Cow+0.194
 wolf+0.194
 sausage+0.191
 Wolf+0.191
Neg logit contributions
 dice-0.260
ents-0.246
 contain-0.233
cases-0.201
 Isabel-0.198
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
 dice+1.112
 contain+1.038
ents+0.999
 else+0.890
 clearly+0.886
Neg logit contributions
uck-0.914
 Cow-0.797
burn-0.786
 Wolf-0.784
ivered-0.774
Token Put
Token ID5930
Feature activation+1.56e+00

Pos logit contributions
 dice+0.052
 contain+0.050
 else+0.047
ents+0.047
 clearly+0.044
Neg logit contributions
uck-0.037
 Wolf-0.033
burn-0.033
 Cow-0.033
ivered-0.032
Token the
Token ID262
Feature activation-5.67e-01

Pos logit contributions
ivered+0.135
ivering+0.122
uck+0.121
ivery+0.121
burn+0.119
Neg logit contributions
 dice-0.135
 contain-0.113
 else-0.112
ents-0.100
 wins-0.100
Token paper
Token ID3348
Feature activation-4.41e+00

Pos logit contributions
 dice+0.054
 contain+0.050
 else+0.048
ents+0.047
 clearly+0.043
Neg logit contributions
uck-0.043
ivered-0.039
burn-0.039
 Cow-0.037
 Wolf-0.037
Token and
Token ID290
Feature activation-3.95e-01

Pos logit contributions
 dice+0.427
 contain+0.412
ents+0.391
 else+0.384
 clearly+0.368
Neg logit contributions
uck-0.318
burn-0.287
 Cow-0.267
ivered-0.267
 Wolf-0.263
Token but
Token ID475
Feature activation-1.36e+00

Pos logit contributions
 dice+0.159
 contain+0.147
ents+0.131
 else+0.128
ooters+0.127
Neg logit contributions
uck-0.151
 Cow-0.132
ivered-0.131
oo-0.130
 Wolf-0.129
Token there
Token ID612
Feature activation+3.21e+00

Pos logit contributions
 dice+0.115
 contain+0.108
ents+0.102
 else+0.101
 clearly+0.092
Neg logit contributions
uck-0.116
burn-0.106
ivered-0.105
 Wolf-0.103
 Cow-0.103
Token was
Token ID373
Feature activation+4.72e-01

Pos logit contributions
uck+0.238
ivered+0.210
burn+0.208
 Wolf+0.204
 Cow+0.204
Neg logit contributions
 dice-0.310
 contain-0.294
ents-0.281
 else-0.270
 clearly-0.254
Token no
Token ID645
Feature activation+2.90e+00

Pos logit contributions
uck+0.032
burn+0.029
ivered+0.029
 Wolf+0.028
 Cow+0.028
Neg logit contributions
 dice-0.048
 contain-0.045
ents-0.043
 else-0.043
 clearly-0.040
Token one
Token ID530
Feature activation+1.99e-01

Pos logit contributions
uck+0.225
burn+0.206
ivered+0.201
 Wolf+0.199
 Cow+0.198
Neg logit contributions
 dice-0.267
 contain-0.255
ents-0.241
 else-0.237
 clearly-0.219
Token else
Token ID2073
Feature activation-1.18e+01

Pos logit contributions
uck+0.015
 wolf+0.015
 Wolf+0.015
 suffering+0.014
 Cow+0.014
Neg logit contributions
 dice-0.018
 contain-0.016
ents-0.015
ooters-0.014
inations-0.014
Token in
Token ID287
Feature activation-3.65e-01

Pos logit contributions
 dice+1.076
 contain+0.993
ents+0.926
 else+0.839
ooters+0.838
Neg logit contributions
uck-0.940
 Wolf-0.836
ivered-0.828
 Cow-0.825
burn-0.823
Token the
Token ID262
Feature activation-3.48e-02

Pos logit contributions
 dice+0.032
 contain+0.030
ents+0.029
 else+0.029
 clearly+0.026
Neg logit contributions
uck-0.030
ivered-0.027
burn-0.027
 Cow-0.026
 Wolf-0.026
Token park
Token ID3952
Feature activation-1.20e+00

Pos logit contributions
 dice+0.003
 contain+0.003
ents+0.003
 else+0.003
 clearly+0.003
Neg logit contributions
uck-0.003
burn-0.002
ivered-0.002
 Wolf-0.002
 Cow-0.002
Token.
Token ID13
Feature activation-2.93e-01

Pos logit contributions
 dice+0.115
 contain+0.106
ents+0.102
 else+0.098
ooters+0.091
Neg logit contributions
uck-0.090
burn-0.081
ivered-0.079
 Cow-0.079
 Wolf-0.079
Token
Token ID198
Feature activation-3.44e-01

Pos logit contributions
 dice+0.029
 contain+0.028
ents+0.026
 else+0.026
 clearly+0.025
Neg logit contributions
uck-0.020
burn-0.018
ivered-0.018
 Wolf-0.018
 Cow-0.018
Token the
Token ID262
Feature activation-4.19e-01

Pos logit contributions
 contain+0.557
 dice+0.514
rig+0.493
inations+0.485
ents+0.481
Neg logit contributions
uck-0.223
 Wolf-0.190
 Cow-0.185
burn-0.180
 Ow-0.179
Token cars
Token ID5006
Feature activation-6.09e+00

Pos logit contributions
 contain+0.040
ents+0.037
 dice+0.037
 else+0.036
 clearly+0.034
Neg logit contributions
uck-0.030
 Ow-0.027
 Wolf-0.027
 wolf-0.027
burn-0.027
Token in
Token ID287
Feature activation-5.90e-01

Pos logit contributions
 dice+0.621
 contain+0.606
inations+0.579
rig+0.574
ents+0.566
Neg logit contributions
uck-0.345
oo-0.270
burn-0.269
ivered-0.213
 Wolf-0.212
Token order
Token ID1502
Feature activation-3.10e-01

Pos logit contributions
 contain+0.057
 dice+0.055
ents+0.053
 else+0.051
ows+0.049
Neg logit contributions
uck-0.039
 Wolf-0.037
burn-0.036
 Cow-0.035
 Ow-0.034
Token from
Token ID422
Feature activation-2.93e+00

Pos logit contributions
 contain+0.031
 dice+0.030
ents+0.027
inations+0.027
rig+0.027
Neg logit contributions
uck-0.020
oo-0.017
 Wolf-0.016
burn-0.015
 Cow-0.015
Token 1
Token ID352
Feature activation-1.18e+01

Pos logit contributions
 contain+0.287
 dice+0.287
ents+0.269
 else+0.258
ows+0.243
Neg logit contributions
uck-0.196
 Wolf-0.182
burn-0.179
 Cow-0.179
oo-0.172
Token to
Token ID284
Feature activation-2.75e-01

Pos logit contributions
 contain+1.138
 dice+1.037
 else+1.026
 clearly+1.024
inations+0.960
Neg logit contributions
uck-0.847
oo-0.745
burn-0.701
over-0.696
 Wolf-0.667
Token 10
Token ID838
Feature activation-7.81e+00

Pos logit contributions
 contain+0.022
ows+0.021
ents+0.021
 dice+0.020
cases+0.020
Neg logit contributions
 Wolf-0.019
 Cow-0.018
uck-0.018
 wolf-0.018
 Squirrel-0.018
Token.
Token ID13
Feature activation-6.73e-02

Pos logit contributions
 contain+0.825
 believe+0.729
 dice+0.687
rig+0.686
aly+0.670
Neg logit contributions
uck-0.425
P-0.390
over-0.381
oo-0.360
 Wolf-0.340
Token Tim
Token ID5045
Feature activation-5.08e-01

Pos logit contributions
 dice+0.007
 contain+0.007
 else+0.006
ents+0.006
 clearly+0.006
Neg logit contributions
uck-0.005
burn-0.004
ivered-0.004
 Wolf-0.004
 Cow-0.004
Tokenmy
Token ID1820
Feature activation+6.70e-01

Pos logit contributions
 dice+0.045
ents+0.041
 contain+0.041
 else+0.040
 clearly+0.037
Neg logit contributions
uck-0.041
burn-0.037
 Wolf-0.036
ivered-0.036
 Cow-0.036
Token laser
Token ID12855
Feature activation-2.80e+00

Pos logit contributions
 dice+0.001
 contain+0.000
ents+0.000
 else+0.000
 clearly+0.000
Neg logit contributions
uck-0.000
burn-0.000
ivered-0.000
 Wolf-0.000
 Cow-0.000
Token and
Token ID290
Feature activation-2.43e-01

Pos logit contributions
 dice+0.259
 contain+0.246
ents+0.233
 else+0.224
ooters+0.209
Neg logit contributions
uck-0.214
burn-0.196
 Wolf-0.189
ivered-0.188
 Cow-0.186
Token play
Token ID711
Feature activation+1.66e-01

Pos logit contributions
 dice+0.023
 contain+0.022
ents+0.021
 else+0.020
ooters+0.018
Neg logit contributions
uck-0.018
burn-0.016
 Cow-0.016
 Wolf-0.016
ivered-0.016
Token with
Token ID351
Feature activation+4.23e-01

Pos logit contributions
uck+0.013
ivered+0.011
burn+0.011
 Wolf+0.011
oo+0.011
Neg logit contributions
 dice-0.015
 contain-0.015
ents-0.014
 else-0.013
ooters-0.013
Token something
Token ID1223
Feature activation+2.95e+00

Pos logit contributions
uck+0.026
 Wolf+0.023
 Cow+0.023
burn+0.023
oo+0.022
Neg logit contributions
 contain-0.044
 dice-0.044
ents-0.041
 else-0.039
ooters-0.038
Token else
Token ID2073
Feature activation-1.18e+01

Pos logit contributions
uck+0.264
 wolf+0.257
 suffering+0.250
 sausage+0.244
 Wolf+0.243
Neg logit contributions
 dice-0.212
ents-0.189
 contain-0.188
 levers-0.151
ooters-0.150
Token.
Token ID13
Feature activation-5.44e-01

Pos logit contributions
 dice+1.147
 contain+1.081
ents+1.028
 else+0.921
ooters+0.915
Neg logit contributions
uck-0.861
burn-0.749
 Wolf-0.727
 Cow-0.727
ivered-0.721
Token<|endoftext|>
Token ID50256
Feature activation+1.19e+00

Pos logit contributions
 dice+0.056
 contain+0.054
 else+0.051
ents+0.051
 clearly+0.047
Neg logit contributions
uck-0.036
 Cow-0.032
 Wolf-0.032
burn-0.031
ivered-0.031
TokenOne
Token ID3198
Feature activation-1.84e+00

Pos logit contributions
uck+0.089
burn+0.084
ivered+0.083
 Cow+0.078
 Wolf+0.077
Neg logit contributions
 dice-0.112
 contain-0.103
 else-0.101
ents-0.099
 clearly-0.092
Token day
Token ID1110
Feature activation+6.67e-01

Pos logit contributions
 dice+0.199
 contain+0.188
ents+0.183
 else+0.178
 clearly+0.168
Neg logit contributions
uck-0.116
burn-0.102
ivered-0.101
 Wolf-0.097
 Cow-0.097
Token,
Token ID11
Feature activation+4.34e-01

Pos logit contributions
burn+0.062
uck+0.059
 Wolf+0.058
ivered+0.058
 Cow+0.058
Neg logit contributions
 dice-0.048
 else-0.045
 contain-0.043
ents-0.038
 count-0.036
Token,
Token ID11
Feature activation+4.31e-01

Pos logit contributions
 dice+0.310
 contain+0.299
ents+0.279
 else+0.277
 clearly+0.257
Neg logit contributions
uck-0.266
burn-0.239
ivered-0.238
oo-0.234
 Wolf-0.233
Token 3
Token ID513
Feature activation-4.07e-02

Pos logit contributions
uck+0.038
 Cow+0.036
burn+0.036
 Wolf+0.035
oo+0.035
Neg logit contributions
 contain-0.032
 dice-0.032
 else-0.030
ents-0.027
 clearly-0.026
Token,
Token ID11
Feature activation+4.13e-01

Pos logit contributions
 dice+0.004
 contain+0.004
ents+0.004
 else+0.003
 clearly+0.003
Neg logit contributions
uck-0.003
ivered-0.003
oo-0.003
burn-0.003
 Wolf-0.003
Token 2
Token ID362
Feature activation-8.48e+00

Pos logit contributions
uck+0.034
burn+0.031
 Cow+0.031
oo+0.030
 Wolf+0.030
Neg logit contributions
 dice-0.034
 contain-0.034
 else-0.032
ents-0.030
 clearly-0.028
Token,
Token ID11
Feature activation+3.87e-01

Pos logit contributions
 dice+0.806
 contain+0.791
ents+0.717
 else+0.716
 clearly+0.676
Neg logit contributions
uck-0.623
oo-0.549
burn-0.548
ivered-0.541
 Wolf-0.540
Token 1
Token ID352
Feature activation-1.18e+01

Pos logit contributions
uck+0.034
 Cow+0.032
burn+0.031
 Wolf+0.031
oo+0.031
Neg logit contributions
 dice-0.030
 contain-0.030
 else-0.027
ents-0.026
 clearly-0.024
Token".
Token ID1911
Feature activation+2.18e+00

Pos logit contributions
 dice+1.134
 contain+1.126
 else+1.051
ents+1.027
 clearly+0.981
Neg logit contributions
uck-0.859
oo-0.758
burn-0.751
 Wolf-0.729
 Cow-0.726
Token They
Token ID1119
Feature activation+1.40e+00

Pos logit contributions
uck+0.172
 Cow+0.148
 Wolf+0.148
ivered+0.146
oo+0.146
Neg logit contributions
 dice-0.202
 contain-0.192
ents-0.183
 else-0.180
 clearly-0.166
Token hear
Token ID3285
Feature activation+1.58e+00

Pos logit contributions
uck+0.098
burn+0.090
 Cow+0.087
 Ow+0.087
oo+0.087
Neg logit contributions
 dice-0.114
 else-0.107
ents-0.106
ooters-0.104
inations-0.100
Token a
Token ID257
Feature activation+3.73e-01

Pos logit contributions
uck+0.088
 Wolf+0.080
 Cow+0.078
oo+0.076
 Ow+0.074
Neg logit contributions
 dice-0.172
 contain-0.169
ents-0.155
ooters-0.153
 else-0.151
Token big
Token ID1263
Feature activation-2.37e-01

Pos logit contributions
uck+0.024
 Wolf+0.022
 wolf+0.022
 Cow+0.022
 fever+0.021
Neg logit contributions
 dice-0.037
ents-0.035
 contain-0.034
 wins-0.032
 counts-0.031
Token �
Token ID6184
Feature activation-3.75e-01

Pos logit contributions
uck+0.030
 Wolf+0.026
ivered+0.026
 wolf+0.025
 fever+0.025
Neg logit contributions
 dice-0.035
 contain-0.032
ents-0.030
ooters-0.029
 else-0.028
Token�
Token ID95
Feature activation-9.47e-01

Pos logit contributions
 dice+0.032
 contain+0.030
 else+0.029
ents+0.028
 clearly+0.026
Neg logit contributions
uck-0.032
oo-0.029
 Wolf-0.028
 Cow-0.028
ivered-0.028
Token€
Token ID26391
Feature activation-2.67e+00

Pos logit contributions
 dice+0.090
 contain+0.085
 else+0.083
ents+0.083
 clearly+0.074
Neg logit contributions
uck-0.072
oo-0.062
ivered-0.062
 Wolf-0.061
 Cow-0.060
Token�
Token ID447
Feature activation-8.90e+00

Pos logit contributions
 dice+0.274
 contain+0.264
ents+0.246
 else+0.245
 clearly+0.229
Neg logit contributions
uck-0.183
ivered-0.159
burn-0.157
oo-0.154
 Wolf-0.153
Token�
Token ID250
Feature activation+3.44e+00

Pos logit contributions
 dice+1.002
 contain+0.926
 else+0.911
ents+0.899
 clearly+0.847
Neg logit contributions
uck-0.476
burn-0.450
ivered-0.432
 Wolf-0.422
 Cow-0.419
Token 1
Token ID352
Feature activation-1.18e+01

Pos logit contributions
uck+0.224
burn+0.198
ivered+0.194
 Wolf+0.189
 Cow+0.187
Neg logit contributions
 dice-0.360
 contain-0.343
ents-0.329
 else-0.325
 clearly-0.302
Token,
Token ID11
Feature activation+2.51e-01

Pos logit contributions
 dice+1.118
 contain+1.086
 else+1.010
ents+1.003
 clearly+0.960
Neg logit contributions
uck-0.879
oo-0.766
 Cow-0.742
 Wolf-0.741
burn-0.737
Token 2
Token ID362
Feature activation-8.59e+00

Pos logit contributions
uck+0.023
 Cow+0.021
 Wolf+0.021
burn+0.021
oo+0.020
Neg logit contributions
 dice-0.018
 contain-0.017
 else-0.016
ents-0.015
 clearly-0.015
Token,
Token ID11
Feature activation+1.57e-01

Pos logit contributions
 dice+0.860
 contain+0.854
 else+0.764
ents+0.759
 clearly+0.742
Neg logit contributions
uck-0.576
oo-0.497
 Cow-0.489
 Wolf-0.477
P-0.473
Token 3
Token ID513
Feature activation-3.63e-03

Pos logit contributions
uck+0.014
 Cow+0.013
 Wolf+0.013
burn+0.013
oo+0.012
Neg logit contributions
 dice-0.011
 contain-0.011
 else-0.010
ents-0.010
 clearly-0.009
Token,
Token ID11
Feature activation+1.50e-01

Pos logit contributions
 dice+0.000
 contain+0.000
ents+0.000
 else+0.000
 clearly+0.000
Neg logit contributions
uck-0.000
oo-0.000
ivered-0.000
 Cow-0.000
 Wolf-0.000
Token park
Token ID3952
Feature activation-9.09e-01

Pos logit contributions
uck+0.047
burn+0.042
ivered+0.041
 Wolf+0.041
 Cow+0.041
Neg logit contributions
 dice-0.056
 contain-0.053
ents-0.051
 else-0.050
 clearly-0.046
Token tomorrow
Token ID9439
Feature activation-2.43e+00

Pos logit contributions
 dice+0.084
 contain+0.078
ents+0.076
 else+0.074
 clearly+0.068
Neg logit contributions
uck-0.071
burn-0.063
ivered-0.063
 Cow-0.061
 Wolf-0.060
Token and
Token ID290
Feature activation-2.28e-01

Pos logit contributions
 dice+0.204
 contain+0.193
ents+0.183
 else+0.181
 clearly+0.166
Neg logit contributions
uck-0.206
ivered-0.189
burn-0.189
 Wolf-0.183
 Cow-0.182
Token let
Token ID1309
Feature activation+8.78e-01

Pos logit contributions
 dice+0.021
 contain+0.019
ents+0.019
 else+0.018
ooters+0.017
Neg logit contributions
uck-0.018
burn-0.016
ivered-0.016
 Cow-0.016
 Wolf-0.016
Token someone
Token ID2130
Feature activation+1.88e+00

Pos logit contributions
uck+0.072
ivered+0.068
burn+0.066
 Cow+0.063
 Wolf+0.063
Neg logit contributions
 dice-0.075
 contain-0.070
 else-0.068
ents-0.067
 clearly-0.062
Token else
Token ID2073
Feature activation-1.18e+01

Pos logit contributions
uck+0.148
burn+0.134
 Cow+0.125
oo+0.123
P+0.120
Neg logit contributions
 dice-0.173
ents-0.150
 contain-0.139
 levers-0.136
 equations-0.133
Token have
Token ID423
Feature activation-9.79e-01

Pos logit contributions
 dice+1.061
 contain+0.947
ents+0.932
 else+0.862
 clearly+0.826
Neg logit contributions
uck-0.970
burn-0.869
ivered-0.850
 Cow-0.838
oo-0.825
Token a
Token ID257
Feature activation+4.89e-01

Pos logit contributions
 dice+0.087
 contain+0.083
ents+0.078
 else+0.077
 clearly+0.071
Neg logit contributions
uck-0.079
burn-0.071
ivered-0.071
 Cow-0.069
 Wolf-0.069
Token turn
Token ID1210
Feature activation-3.81e+00

Pos logit contributions
uck+0.039
burn+0.037
 Wolf+0.036
 Cow+0.036
 fever+0.035
Neg logit contributions
 dice-0.042
 contain-0.040
ents-0.039
 else-0.036
 clearly-0.035
Token,"
Token ID553
Feature activation-2.48e-01

Pos logit contributions
 dice+0.362
 contain+0.339
ents+0.318
 else+0.310
 clearly+0.288
Neg logit contributions
uck-0.290
burn-0.261
ivered-0.248
 Wolf-0.244
 Cow-0.243
Token she
Token ID673
Feature activation-1.71e+00

Pos logit contributions
 dice+0.021
 contain+0.020
 else+0.019
ents+0.018
 clearly+0.017
Neg logit contributions
uck-0.021
burn-0.020
ivered-0.020
 Wolf-0.019
 Cow-0.019
Token went
Token ID1816
Feature activation+1.71e+00

Pos logit contributions
 contain+0.023
 dice+0.023
hips+0.021
rig+0.021
ooters+0.021
Neg logit contributions
uck-0.020
aged-0.016
eled-0.016
ivered-0.015
P-0.015
Token first
Token ID717
Feature activation-3.33e+00

Pos logit contributions
uck+0.106
burn+0.076
 wolf+0.075
 Wolf+0.074
oo+0.074
Neg logit contributions
 contain-0.185
 dice-0.179
ents-0.177
 else-0.172
inations-0.166
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
 dice+0.377
 contain+0.356
ents+0.345
 else+0.334
inations+0.331
Neg logit contributions
uck-0.191
oo-0.136
 Wolf-0.125
 self-0.123
ivered-0.122
Token He
Token ID679
Feature activation+6.45e-01

Pos logit contributions
 dice+0.012
 contain+0.011
 else+0.011
ents+0.011
inations+0.010
Neg logit contributions
uck-0.007
 Cow-0.006
 Wolf-0.006
 Ow-0.005
burn-0.005
Token counted
Token ID14789
Feature activation-9.70e+00

Pos logit contributions
uck+0.051
aged+0.044
 Wolf+0.044
 self+0.043
eled+0.043
Neg logit contributions
 contain-0.053
 dice-0.051
ooters-0.047
hips-0.046
hest-0.045
Token 1
Token ID352
Feature activation-1.18e+01

Pos logit contributions
 contain+1.010
 dice+0.989
 else+0.883
inations+0.868
 believe+0.866
Neg logit contributions
uck-0.650
oo-0.530
 Cow-0.527
 Wolf-0.517
burn-0.515
Token,
Token ID11
Feature activation+5.34e-01

Pos logit contributions
 contain+1.400
 dice+1.306
inations+1.260
rig+1.252
nes+1.238
Neg logit contributions
uck-0.640
oo-0.455
over-0.376
aged-0.376
el-0.359
Token2
Token ID17
Feature activation-7.59e+00

Pos logit contributions
uck+0.051
 Cow+0.048
 Wolf+0.047
P+0.047
 wolf+0.046
Neg logit contributions
 contain-0.036
 dice-0.034
 else-0.031
inations-0.029
 outage-0.028
Token,
Token ID11
Feature activation+5.37e-01

Pos logit contributions
 contain+0.871
 dice+0.837
 else+0.773
rig+0.753
ooters+0.744
Neg logit contributions
uck-0.421
oo-0.359
ivered-0.303
 Cow-0.300
burn-0.290
Token3
Token ID18
Feature activation-5.61e+00

Pos logit contributions
uck+0.044
 Cow+0.040
 Wolf+0.039
P+0.038
 wolf+0.037
Neg logit contributions
 contain-0.044
 dice-0.042
 else-0.039
inations-0.038
 outage-0.037
Token,
Token ID11
Feature activation+4.95e-01

Pos logit contributions
 contain+0.614
 dice+0.601
 else+0.551
ooters+0.538
ents+0.530
Neg logit contributions
uck-0.330
oo-0.281
ivered-0.254
 Cow-0.241
burn-0.240
Token her
Token ID607
Feature activation+7.14e-01

Pos logit contributions
uck+0.030
ivered+0.028
burn+0.027
oo+0.026
 Cow+0.026
Neg logit contributions
 dice-0.026
 contain-0.024
 else-0.023
ents-0.023
 clearly-0.021
Token joy
Token ID8716
Feature activation-1.14e+00

Pos logit contributions
uck+0.053
burn+0.047
ivered+0.047
 Wolf+0.046
 Cow+0.045
Neg logit contributions
 dice-0.069
 contain-0.065
ents-0.063
 else-0.061
 clearly-0.056
Token spread
Token ID4104
Feature activation+1.86e+00

Pos logit contributions
 dice+0.105
 contain+0.101
ents+0.095
 else+0.095
 clearly+0.088
Neg logit contributions
uck-0.087
burn-0.078
ivered-0.078
 Wolf-0.076
 Cow-0.076
Token and
Token ID290
Feature activation-3.68e-01

Pos logit contributions
uck+0.114
burn+0.099
ivered+0.097
 Wolf+0.093
 Cow+0.092
Neg logit contributions
 dice-0.204
 contain-0.193
ents-0.186
 else-0.183
ooters-0.171
Token everyone
Token ID2506
Feature activation-1.19e+00

Pos logit contributions
 dice+0.033
ents+0.031
 contain+0.030
ooters+0.029
 else+0.027
Neg logit contributions
uck-0.028
burn-0.026
ivered-0.024
 Wolf-0.023
 self-0.023
Token else
Token ID2073
Feature activation-1.18e+01

Pos logit contributions
 dice+0.111
ents+0.096
ooters+0.093
 contain+0.092
aly+0.084
Neg logit contributions
uck-0.094
ivered-0.084
burn-0.082
eled-0.081
aged-0.079
Token began
Token ID2540
Feature activation-5.00e-01

Pos logit contributions
 dice+1.083
 contain+0.980
ents+0.955
ooters+0.886
 else+0.845
Neg logit contributions
uck-0.964
burn-0.841
ivered-0.835
 Wolf-0.787
oo-0.770
Token to
Token ID284
Feature activation-3.61e-01

Pos logit contributions
 dice+0.047
 contain+0.044
ents+0.041
 else+0.041
ooters+0.037
Neg logit contributions
uck-0.039
burn-0.036
ivered-0.034
oo-0.033
 Wolf-0.033
Token feel
Token ID1254
Feature activation+2.70e+00

Pos logit contributions
 dice+0.027
ents+0.024
 contain+0.023
 else+0.022
ooters+0.020
Neg logit contributions
uck-0.034
burn-0.033
ivered-0.032
oo-0.031
 Cow-0.031
Token light
Token ID1657
Feature activation-1.63e-01

Pos logit contributions
uck+0.212
burn+0.192
ivered+0.189
 Cow+0.185
 Wolf+0.185
Neg logit contributions
 dice-0.248
 contain-0.227
ents-0.220
 else-0.206
ooters-0.200
Token and
Token ID290
Feature activation-3.87e-01

Pos logit contributions
 dice+0.015
 contain+0.014
ents+0.013
 else+0.013
ooters+0.012
Neg logit contributions
uck-0.013
burn-0.012
ivered-0.012
 Wolf-0.011
 Cow-0.011
Token it
Token ID340
Feature activation+8.29e-01

Pos logit contributions
 dice+0.391
 contain+0.372
ents+0.352
 else+0.338
ooters+0.315
Neg logit contributions
uck-0.284
burn-0.258
ivered-0.242
 Wolf-0.237
 Cow-0.236
Token off
Token ID572
Feature activation+9.80e-01

Pos logit contributions
uck+0.056
burn+0.051
ivered+0.048
 Wolf+0.046
oo+0.046
Neg logit contributions
 dice-0.085
 contain-0.080
ents-0.078
 else-0.073
ooters-0.069
Token or
Token ID393
Feature activation-3.23e+00

Pos logit contributions
uck+0.069
burn+0.063
 Wolf+0.060
 Cow+0.060
ivered+0.060
Neg logit contributions
 dice-0.095
 contain-0.088
ents-0.086
 else-0.082
ooters-0.080
Token play
Token ID711
Feature activation+3.03e-01

Pos logit contributions
 dice+0.281
ents+0.261
 contain+0.254
hest+0.234
ooters+0.230
Neg logit contributions
uck-0.250
burn-0.235
 fever-0.223
 Cow-0.217
 Wolf-0.215
Token something
Token ID1223
Feature activation+3.01e+00

Pos logit contributions
uck+0.023
burn+0.020
ivered+0.020
oo+0.020
 Wolf+0.020
Neg logit contributions
 dice-0.028
 contain-0.027
ents-0.027
 else-0.024
ooters-0.024
Token else
Token ID2073
Feature activation-1.18e+01

Pos logit contributions
uck+0.291
 suffering+0.269
burn+0.266
P+0.263
 wolf+0.262
Neg logit contributions
 dice-0.202
ents-0.183
 contain-0.171
 1-0.148
ooters-0.142
Token.
Token ID13
Feature activation-2.84e-01

Pos logit contributions
 dice+1.235
 contain+1.146
ents+1.099
ooters+1.003
 levers+0.998
Neg logit contributions
uck-0.782
burn-0.668
ivered-0.622
 suffering-0.621
 Wolf-0.617
Token She
Token ID1375
Feature activation+1.48e+00

Pos logit contributions
 dice+0.030
 contain+0.028
ents+0.027
 else+0.027
 clearly+0.025
Neg logit contributions
uck-0.019
burn-0.016
ivered-0.016
 Wolf-0.016
 Cow-0.016
Token wanted
Token ID2227
Feature activation+9.27e-01

Pos logit contributions
uck+0.103
ivered+0.091
 Cow+0.088
burn+0.088
 Wolf+0.087
Neg logit contributions
 dice-0.146
 contain-0.141
ents-0.132
 else-0.124
ooters-0.123
Token to
Token ID284
Feature activation-1.98e-01

Pos logit contributions
uck+0.061
burn+0.055
 Wolf+0.055
 Cow+0.054
 fever+0.050
Neg logit contributions
 dice-0.094
 contain-0.092
ents-0.086
hest-0.081
ows-0.081
Token enjoy
Token ID2883
Feature activation+9.09e-01

Pos logit contributions
 dice+0.017
ents+0.015
 contain+0.014
 else+0.014
cases+0.013
Neg logit contributions
uck-0.017
burn-0.015
oo-0.015
 Cow-0.015
 Wolf-0.015
Token'll
Token ID1183
Feature activation+1.65e+00

Pos logit contributions
 dice+0.115
 contain+0.108
ents+0.101
 else+0.101
 clearly+0.091
Neg logit contributions
uck-0.135
burn-0.125
ivered-0.124
 Wolf-0.121
 Cow-0.121
Token go
Token ID467
Feature activation-3.51e-01

Pos logit contributions
uck+0.156
burn+0.144
ivered+0.143
 Wolf+0.140
 Cow+0.139
Neg logit contributions
 dice-0.124
 contain-0.117
ents-0.109
 else-0.109
 clearly-0.098
Token and
Token ID290
Feature activation-4.88e-01

Pos logit contributions
 dice+0.034
 contain+0.032
ents+0.031
 else+0.030
 clearly+0.028
Neg logit contributions
uck-0.026
burn-0.023
 Cow-0.022
ivered-0.022
 Wolf-0.022
Token find
Token ID1064
Feature activation-2.67e+00

Pos logit contributions
 dice+0.040
 contain+0.037
ents+0.035
 else+0.034
 clearly+0.031
Neg logit contributions
uck-0.043
burn-0.040
 Wolf-0.039
 Cow-0.039
ivered-0.039
Token something
Token ID1223
Feature activation+2.91e+00

Pos logit contributions
 dice+0.267
 contain+0.258
ents+0.243
 else+0.234
 clearly+0.221
Neg logit contributions
uck-0.186
burn-0.164
 Wolf-0.162
 Cow-0.162
ivered-0.157
Token else
Token ID2073
Feature activation-1.18e+01

Pos logit contributions
uck+0.208
 wolf+0.195
 suffering+0.191
 Cow+0.190
 Wolf+0.189
Neg logit contributions
 dice-0.266
ents-0.244
 contain-0.229
bars-0.210
saw-0.205
Token to
Token ID284
Feature activation-1.51e-01

Pos logit contributions
 dice+1.103
 contain+1.004
ents+0.970
 levers+0.857
ooters+0.841
Neg logit contributions
uck-0.909
 Cow-0.788
 Wolf-0.781
burn-0.758
 wolf-0.749
Token do
Token ID466
Feature activation+1.51e+00

Pos logit contributions
 dice+0.012
 contain+0.011
ents+0.011
 else+0.010
 clearly+0.010
Neg logit contributions
uck-0.014
burn-0.012
ivered-0.012
 Cow-0.012
 Wolf-0.012
Token."
Token ID526
Feature activation+7.09e-01

Pos logit contributions
uck+0.112
ivered+0.096
oo+0.095
burn+0.093
 Wolf+0.093
Neg logit contributions
 dice-0.144
 contain-0.136
ents-0.132
 else-0.121
ooters-0.117
Token
Token ID198
Feature activation-4.34e-01

Pos logit contributions
uck+0.054
burn+0.050
ivered+0.050
 Wolf+0.047
oo+0.046
Neg logit contributions
 dice-0.065
 contain-0.062
 else-0.059
ents-0.059
 clearly-0.054
Token
Token ID198
Feature activation-4.06e-01

Pos logit contributions
 dice+0.044
 contain+0.042
ents+0.040
 else+0.040
 clearly+0.037
Neg logit contributions
uck-0.030
burn-0.026
ivered-0.026
 Cow-0.025
 Wolf-0.025
Token it
Token ID340
Feature activation+1.11e+00

Pos logit contributions
uck+0.230
burn+0.207
ivered+0.204
 Wolf+0.201
 Cow+0.200
Neg logit contributions
 dice-0.253
 contain-0.239
ents-0.225
 else-0.222
 clearly-0.203
Token made
Token ID925
Feature activation+3.73e+00

Pos logit contributions
uck+0.088
burn+0.079
ivered+0.079
 Wolf+0.077
 Cow+0.076
Neg logit contributions
 dice-0.102
 contain-0.095
ents-0.091
 else-0.086
ooters-0.083
Token her
Token ID607
Feature activation+8.89e-01

Pos logit contributions
uck+0.230
burn+0.202
ivered+0.199
 Wolf+0.192
 Cow+0.192
Neg logit contributions
 dice-0.404
 contain-0.386
ents-0.370
 else-0.367
 clearly-0.343
Token face
Token ID1986
Feature activation+5.42e+00

Pos logit contributions
uck+0.070
burn+0.064
ivered+0.063
 Wolf+0.061
 Cow+0.061
Neg logit contributions
 dice-0.082
 contain-0.077
ents-0.073
 else-0.072
 clearly-0.066
Token scr
Token ID6040
Feature activation+4.25e-02

Pos logit contributions
uck+0.471
 Wolf+0.430
ivered+0.429
 Cow+0.426
burn+0.423
Neg logit contributions
 dice-0.430
 contain-0.424
 else-0.405
ents-0.401
 clearly-0.373
Tokenunch
Token ID3316
Feature activation+7.02e+00

Pos logit contributions
burn+0.002
 Wolf+0.002
uck+0.002
 Cow+0.002
 wolf+0.002
Neg logit contributions
ents-0.005
 dice-0.005
 contain-0.005
 else-0.004
 clearly-0.004
Token up
Token ID510
Feature activation-3.42e-01

Pos logit contributions
uck+0.451
ivered+0.418
burn+0.416
 Cow+0.409
 Wolf+0.408
Neg logit contributions
 dice-0.685
 contain-0.665
 else-0.647
ents-0.642
 clearly-0.618
Token!
Token ID0
Feature activation+2.97e-01

Pos logit contributions
 dice+0.028
 contain+0.027
ents+0.025
 else+0.025
 clearly+0.023
Neg logit contributions
uck-0.029
ivered-0.027
burn-0.027
 Wolf-0.026
 Cow-0.026
Token
Token ID220
Feature activation+5.78e-01

Pos logit contributions
uck+0.022
burn+0.019
ivered+0.019
 Wolf+0.019
 Cow+0.019
Neg logit contributions
 dice-0.029
 contain-0.027
ents-0.026
 else-0.026
 clearly-0.024
Token
Token ID198
Feature activation-2.77e-01

Pos logit contributions
uck+0.041
burn+0.036
ivered+0.036
 Wolf+0.035
 Cow+0.035
Neg logit contributions
 dice-0.057
 contain-0.055
ents-0.052
 else-0.052
 clearly-0.048
Token
Token ID198
Feature activation-2.88e-01

Pos logit contributions
 dice+0.028
 contain+0.027
ents+0.026
 else+0.025
 clearly+0.024
Neg logit contributions
uck-0.019
burn-0.017
ivered-0.017
 Cow-0.016
 Wolf-0.016
Token the
Token ID262
Feature activation-4.04e-01

Pos logit contributions
uck+0.131
 Wolf+0.107
oo+0.103
 Cow+0.101
 wolf+0.095
Neg logit contributions
 contain-0.233
 dice-0.231
ents-0.229
 else-0.222
inations-0.218
Token street
Token ID4675
Feature activation-2.86e+00

Pos logit contributions
 dice+0.034
 contain+0.034
 else+0.032
ents+0.031
inations+0.031
Neg logit contributions
uck-0.032
 Wolf-0.029
 wolf-0.028
 Cow-0.028
burn-0.028
Token whist
Token ID32262
Feature activation+2.72e+00

Pos logit contributions
 dice+0.295
 contain+0.278
 else+0.269
ents+0.267
inations+0.251
Neg logit contributions
uck-0.192
ivered-0.155
burn-0.154
 Wolf-0.154
oo-0.154
Tokenling
Token ID1359
Feature activation+7.67e+00

Pos logit contributions
uck+0.143
burn+0.125
ivered+0.119
 Cow+0.118
 Wolf+0.118
Neg logit contributions
 dice-0.320
 contain-0.307
ents-0.295
 else-0.294
 clearly-0.278
Token,
Token ID11
Feature activation+6.17e-01

Pos logit contributions
uck+0.576
ivered+0.507
 Wolf+0.505
burn+0.502
oo+0.497
Neg logit contributions
 dice-0.716
 contain-0.669
ents-0.664
 else-0.634
ooters-0.589
Token feeling
Token ID4203
Feature activation+6.23e+00

Pos logit contributions
uck+0.051
burn+0.047
ivered+0.046
 Wolf+0.046
 Cow+0.046
Neg logit contributions
 dice-0.053
 contain-0.050
ents-0.047
 else-0.047
ooters-0.042
Token too
Token ID1165
Feature activation-1.07e+00

Pos logit contributions
uck+0.541
burn+0.493
ivered+0.489
 Cow+0.469
 Wolf+0.466
Neg logit contributions
 dice-0.518
 contain-0.495
 else-0.476
ents-0.464
 clearly-0.437
Token scared
Token ID12008
Feature activation+5.85e-01

Pos logit contributions
 dice+0.097
 contain+0.091
ents+0.086
 else+0.086
 clearly+0.079
Neg logit contributions
uck-0.086
burn-0.078
ivered-0.077
 Wolf-0.075
 Cow-0.075
Token to
Token ID284
Feature activation-1.61e-01

Pos logit contributions
uck+0.048
burn+0.043
ivered+0.043
 Wolf+0.042
 Cow+0.042
Neg logit contributions
 dice-0.052
 contain-0.049
ents-0.046
 else-0.045
 clearly-0.041
Token smile
Token ID8212
Feature activation-2.03e-01

Pos logit contributions
 dice+0.013
 contain+0.012
ents+0.012
 else+0.011
 clearly+0.010
Neg logit contributions
uck-0.014
burn-0.013
ivered-0.013
 Wolf-0.013
 Cow-0.013
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
 dice+0.016
 contain+0.015
 else+0.014
ents+0.014
 clearly+0.013
Neg logit contributions
uck-0.018
ivered-0.017
burn-0.017
 Cow-0.016
 Wolf-0.016
Token named
Token ID3706
Feature activation+1.81e+00

Pos logit contributions
uck+0.425
ivered+0.404
 Ow+0.399
burn+0.392
 Wolf+0.391
Neg logit contributions
 else-0.389
 contain-0.368
 dice-0.356
ents-0.337
 clearly-0.311
Token D
Token ID360
Feature activation+5.55e+00

Pos logit contributions
ivered+0.131
uck+0.127
ivering+0.110
burn+0.109
eled+0.107
Neg logit contributions
 dice-0.175
 else-0.166
 contain-0.163
ents-0.162
 Saturn-0.148
Tokenucky
Token ID5309
Feature activation+7.41e+00

Pos logit contributions
uck+0.293
burn+0.284
ivered+0.273
 Wolf+0.267
aged+0.256
Neg logit contributions
 dice-0.599
 contain-0.583
 else-0.580
ents-0.567
ooters-0.524
Token.
Token ID13
Feature activation+3.78e-02

Pos logit contributions
uck+0.769
ivered+0.710
burn+0.700
 Wolf+0.660
 fever+0.649
Neg logit contributions
 dice-0.478
 else-0.460
 contain-0.448
ents-0.430
 clearly-0.392
Token D
Token ID360
Feature activation+5.56e+00

Pos logit contributions
ivered+0.003
burn+0.003
uck+0.003
aged+0.003
 fever+0.003
Neg logit contributions
 dice-0.003
 contain-0.003
ents-0.003
 wins-0.003
 else-0.003
Tokenucky
Token ID5309
Feature activation+7.42e+00

Pos logit contributions
burn+0.291
 Wolf+0.269
eled+0.257
 Cow+0.253
aged+0.247
Neg logit contributions
 else-0.591
 dice-0.574
 contain-0.558
rig-0.545
hest-0.543
Token loved
Token ID6151
Feature activation+1.38e+00

Pos logit contributions
uck+0.698
burn+0.659
ivered+0.624
 Wolf+0.604
 Ow+0.584
Neg logit contributions
 else-0.545
 dice-0.530
 contain-0.489
ents-0.457
 clearly-0.445
Token to
Token ID284
Feature activation-7.27e-02

Pos logit contributions
ivered+0.167
uck+0.157
ivering+0.155
ivery+0.154
over+0.145
Neg logit contributions
 dice-0.069
 else-0.064
ents-0.061
 contain-0.060
ooters-0.054
Token swim
Token ID9422
Feature activation+1.31e-01

Pos logit contributions
 contain+0.004
 dice+0.004
 count+0.003
 else+0.003
 believe+0.003
Neg logit contributions
uck-0.008
ivered-0.008
 Wolf-0.008
eled-0.008
burn-0.008
Token in
Token ID287
Feature activation-4.42e-01

Pos logit contributions
uck+0.011
burn+0.010
ivered+0.010
 Wolf+0.009
 Cow+0.009
Neg logit contributions
 dice-0.012
 contain-0.011
ents-0.010
 else-0.010
 clearly-0.010
Token the
Token ID262
Feature activation-2.98e-01

Pos logit contributions
 dice+0.037
 contain+0.034
 else+0.032
ents+0.032
 clearly+0.029
Neg logit contributions
uck-0.039
ivered-0.036
burn-0.036
 Wolf-0.034
 Cow-0.034
Token whistle
Token ID16121
Feature activation+3.83e+00

Pos logit contributions
 dice+0.009
 contain+0.009
 else+0.008
ents+0.008
 clearly+0.007
Neg logit contributions
uck-0.014
ivered-0.013
burn-0.013
 Wolf-0.013
 Cow-0.013
Token throughout
Token ID3690
Feature activation-1.12e+00

Pos logit contributions
uck+0.290
burn+0.253
ivered+0.253
 Wolf+0.249
oo+0.247
Neg logit contributions
 dice-0.361
 contain-0.342
ents-0.330
 else-0.310
ooters-0.298
Token the
Token ID262
Feature activation-4.22e-01

Pos logit contributions
 dice+0.104
 contain+0.098
ents+0.093
 else+0.093
 clearly+0.086
Neg logit contributions
uck-0.086
burn-0.078
ivered-0.078
 Wolf-0.075
 Cow-0.075
Token day
Token ID1110
Feature activation+7.58e-01

Pos logit contributions
 dice+0.043
 contain+0.041
 else+0.039
ents+0.039
 clearly+0.037
Neg logit contributions
uck-0.029
ivered-0.026
burn-0.026
 Cow-0.024
oo-0.024
Token until
Token ID1566
Feature activation-2.45e+00

Pos logit contributions
uck+0.052
burn+0.044
over+0.043
 Wolf+0.043
oo+0.043
Neg logit contributions
 dice-0.079
 contain-0.072
ents-0.068
 else-0.067
ooters-0.064
Token dinner
Token ID8073
Feature activation+6.48e+00

Pos logit contributions
 dice+0.189
 contain+0.183
 else+0.175
 clearly+0.163
ents+0.161
Neg logit contributions
uck-0.215
ivered-0.214
ivering-0.196
burn-0.196
 Cow-0.195
Token.
Token ID13
Feature activation-2.12e-01

Pos logit contributions
uck+0.435
burn+0.373
ivered+0.369
 Wolf+0.355
oo+0.355
Neg logit contributions
 dice-0.666
 contain-0.631
ents-0.621
 else-0.581
ooters-0.566
Token<|endoftext|>
Token ID50256
Feature activation+1.38e+00

Pos logit contributions
 dice+0.019
 contain+0.018
 else+0.017
ents+0.017
 clearly+0.016
Neg logit contributions
uck-0.017
burn-0.016
ivered-0.016
 Cow-0.014
oo-0.014
TokenOnce
Token ID7454
Feature activation-7.27e-01

Pos logit contributions
uck+0.101
burn+0.098
ivered+0.098
 fever+0.091
 Cow+0.089
Neg logit contributions
 dice-0.129
 contain-0.117
 else-0.116
ents-0.112
 counts-0.107
Token upon
Token ID2402
Feature activation-1.06e+00

Pos logit contributions
 dice+0.074
 contain+0.069
ents+0.065
 levers+0.061
 else+0.060
Neg logit contributions
uck-0.053
ivered-0.044
oo-0.043
 Cow-0.043
 Wolf-0.043
Token a
Token ID257
Feature activation+5.25e-01

Pos logit contributions
 dice+0.094
 contain+0.091
 else+0.089
ents+0.084
 clearly+0.080
Neg logit contributions
uck-0.086
burn-0.078
ivered-0.077
 Cow-0.075
 Wolf-0.075
TokenL
Token ID43
Feature activation-1.33e-01

Pos logit contributions
 dice+0.030
 contain+0.029
ents+0.028
 else+0.027
 clearly+0.025
Neg logit contributions
uck-0.023
ivered-0.021
burn-0.021
 Cow-0.020
 Wolf-0.020
Tokenily
Token ID813
Feature activation-3.39e+00

Pos logit contributions
 dice+0.011
 contain+0.010
ents+0.009
 else+0.009
 clearly+0.009
Neg logit contributions
uck-0.012
ivered-0.011
burn-0.011
oo-0.010
 Cow-0.010
Token felt
Token ID2936
Feature activation+2.89e+00

Pos logit contributions
 dice+0.250
 else+0.244
ents+0.238
 contain+0.233
 clearly+0.211
Neg logit contributions
uck-0.286
burn-0.282
 Wolf-0.276
 Ow-0.272
 Cow-0.269
Token better
Token ID1365
Feature activation-1.20e+00

Pos logit contributions
burn+0.315
uck+0.311
ivered+0.295
 Wolf+0.289
oo+0.288
Neg logit contributions
 else-0.186
 contain-0.167
 clearly-0.148
 believe-0.140
 dice-0.139
Token after
Token ID706
Feature activation-5.42e-01

Pos logit contributions
 contain+0.079
 else+0.075
ents+0.073
 dice+0.073
 clearly+0.068
Neg logit contributions
uck-0.115
ivered-0.113
 Ow-0.111
burn-0.110
 Wolf-0.105
Token drinking
Token ID7722
Feature activation+6.91e+00

Pos logit contributions
 else+0.040
 dice+0.040
ents+0.040
 clearly+0.039
 contain+0.039
Neg logit contributions
uck-0.049
ivered-0.045
P-0.045
burn-0.041
eled-0.041
Token the
Token ID262
Feature activation-2.61e-01

Pos logit contributions
ivered+0.633
uck+0.626
P+0.582
eled+0.574
burn+0.562
Neg logit contributions
 dice-0.483
 contain-0.476
 else-0.465
 clearly-0.427
ents-0.423
Token hot
Token ID3024
Feature activation+2.95e+00

Pos logit contributions
 dice+0.023
 else+0.022
 contain+0.022
ents+0.020
 seats+0.020
Neg logit contributions
uck-0.019
ivered-0.018
P-0.017
eled-0.016
 Cow-0.016
Token cocoa
Token ID35845
Feature activation+4.49e+00

Pos logit contributions
uck+0.202
ivered+0.202
P+0.182
 Cow+0.179
 Wolf+0.175
Neg logit contributions
 dice-0.270
 contain-0.253
 else-0.246
 counts-0.241
 count-0.230
Token and
Token ID290
Feature activation-1.23e-01

Pos logit contributions
uck+0.366
burn+0.335
ivered+0.325
 Wolf+0.320
 Cow+0.318
Neg logit contributions
 dice-0.400
 contain-0.377
ents-0.360
 else-0.353
 clearly-0.324
Token sn
Token ID3013
Feature activation+2.63e+00

Pos logit contributions
 dice+0.010
 else+0.010
 contain+0.009
ents+0.009
 clearly+0.009
Neg logit contributions
uck-0.011
ivered-0.010
P-0.010
burn-0.010
oo-0.009
Token
Token ID220
Feature activation+7.28e-01

Pos logit contributions
uck+0.017
burn+0.016
ivered+0.016
 Wolf+0.015
 Cow+0.015
Neg logit contributions
 dice-0.021
 contain-0.019
 else-0.019
ents-0.019
 clearly-0.017
Token
Token ID198
Feature activation-8.30e-02

Pos logit contributions
uck+0.046
ivered+0.039
 Cow+0.038
 Wolf+0.038
burn+0.038
Neg logit contributions
 dice-0.079
 contain-0.076
ents-0.070
 else-0.069
 clearly-0.065
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
 dice+0.009
 contain+0.009
ents+0.008
 else+0.008
 clearly+0.008
Neg logit contributions
uck-0.005
 Cow-0.004
oo-0.004
 Wolf-0.004
ivered-0.004
TokenBut
Token ID1537
Feature activation+1.11e+00

Pos logit contributions
 dice+0.009
 contain+0.008
 else+0.008
ents+0.008
 clearly+0.007
Neg logit contributions
uck-0.009
burn-0.008
ivered-0.008
 Wolf-0.008
 Cow-0.008
Token Tim
Token ID5045
Feature activation-2.91e-01

Pos logit contributions
uck+0.120
ivered+0.115
burn+0.113
ivering+0.107
P+0.107
Neg logit contributions
 dice-0.067
 contain-0.059
 else-0.058
ents-0.058
 count-0.050
Tokenmy
Token ID1820
Feature activation+8.84e-01

Pos logit contributions
 dice+0.023
ents+0.021
 else+0.021
 contain+0.021
 counts+0.019
Neg logit contributions
uck-0.025
burn-0.023
 Wolf-0.023
 Cow-0.023
ivered-0.022
Token was
Token ID373
Feature activation+4.05e-01

Pos logit contributions
burn+0.078
uck+0.077
 Cow+0.071
 Ow+0.071
ivered+0.071
Neg logit contributions
 dice-0.065
 else-0.065
ents-0.062
 contain-0.061
 clearly-0.056
Token also
Token ID635
Feature activation-3.84e+00

Pos logit contributions
uck+0.031
ivered+0.027
 Wolf+0.027
burn+0.027
 Cow+0.027
Neg logit contributions
 dice-0.039
 contain-0.037
ents-0.034
 else-0.034
ooters-0.031
Token anxious
Token ID18116
Feature activation+1.52e+00

Pos logit contributions
 dice+0.313
 contain+0.296
 else+0.295
ents+0.281
 clearly+0.261
Neg logit contributions
uck-0.331
burn-0.322
ivered-0.303
 Cow-0.298
 Wolf-0.297
Token because
Token ID780
Feature activation+3.77e-01

Pos logit contributions
uck+0.121
burn+0.107
 Wolf+0.106
ivered+0.105
 Cow+0.104
Neg logit contributions
 dice-0.143
 contain-0.134
ents-0.124
 else-0.122
 clearly-0.112
Token he
Token ID339
Feature activation-1.31e+00

Pos logit contributions
uck+0.030
 Wolf+0.028
 Cow+0.028
 wolf+0.027
burn+0.027
Neg logit contributions
 dice-0.033
 contain-0.032
ents-0.029
 else-0.028
ows-0.028
Token the
Token ID262
Feature activation-1.79e-01

Pos logit contributions
uck+0.039
burn+0.035
ivered+0.035
 Wolf+0.034
 Cow+0.034
Neg logit contributions
 dice-0.048
 contain-0.045
ents-0.043
 else-0.043
 clearly-0.039
Token others
Token ID1854
Feature activation+1.62e+00

Pos logit contributions
 dice+0.017
 contain+0.016
ents+0.016
 else+0.015
 clearly+0.014
Neg logit contributions
uck-0.013
burn-0.012
ivered-0.011
 Wolf-0.011
 Cow-0.011
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
uck+0.114
burn+0.101
 Wolf+0.100
ivered+0.099
 Cow+0.098
Neg logit contributions
 dice-0.159
 contain-0.148
ents-0.140
 else-0.132
ooters-0.130
Token
Token ID220
Feature activation+4.78e-01

Pos logit contributions
 dice+0.024
 contain+0.023
 else+0.022
ents+0.022
 clearly+0.020
Neg logit contributions
uck-0.015
 Cow-0.013
 Wolf-0.013
burn-0.013
ivered-0.012
Token
Token ID198
Feature activation-3.93e-01

Pos logit contributions
uck+0.031
ivered+0.026
burn+0.026
 Cow+0.026
 Wolf+0.026
Neg logit contributions
 dice-0.051
 contain-0.049
ents-0.046
 else-0.045
 clearly-0.042
Token
Token ID198
Feature activation-3.83e-01

Pos logit contributions
 dice+0.042
 contain+0.040
ents+0.038
 else+0.037
 clearly+0.035
Neg logit contributions
uck-0.025
 Cow-0.021
ivered-0.021
oo-0.021
 Wolf-0.021
TokenAfter
Token ID3260
Feature activation+1.42e+00

Pos logit contributions
 dice+0.035
 contain+0.034
ents+0.032
 else+0.031
 clearly+0.029
Neg logit contributions
uck-0.029
burn-0.026
ivered-0.026
 Cow-0.026
 Wolf-0.026
Token the
Token ID262
Feature activation-2.34e-01

Pos logit contributions
uck+0.119
ivered+0.113
burn+0.109
 Wolf+0.101
 Cow+0.101
Neg logit contributions
 dice-0.125
 contain-0.117
 else-0.113
ents-0.110
 clearly-0.101
Token parade
Token ID16134
Feature activation-2.09e+00

Pos logit contributions
 dice+0.022
 contain+0.021
ents+0.020
 else+0.020
 clearly+0.018
Neg logit contributions
uck-0.018
burn-0.016
ivered-0.016
 Wolf-0.016
 Cow-0.016
Token,
Token ID11
Feature activation+5.02e-01

Pos logit contributions
 dice+0.218
 contain+0.203
ents+0.200
 else+0.191
ooters+0.181
Neg logit contributions
uck-0.141
ivered-0.120
burn-0.120
 Wolf-0.118
 Cow-0.117
Token the
Token ID262
Feature activation-2.87e-02

Pos logit contributions
uck+0.042
burn+0.038
ivered+0.038
 Wolf+0.037
 Cow+0.037
Neg logit contributions
 dice-0.043
 contain-0.041
ents-0.039
 else-0.038
 clearly-0.035
Token at
Token ID379
Feature activation+3.87e-01

Pos logit contributions
uck+0.011
ivered+0.009
 Wolf+0.009
 Cow+0.009
burn+0.009
Neg logit contributions
 dice-0.018
 contain-0.017
ents-0.016
 else-0.015
ooters-0.015
Token his
Token ID465
Feature activation+9.73e-01

Pos logit contributions
uck+0.030
burn+0.027
ivered+0.027
 Wolf+0.026
 Cow+0.026
Neg logit contributions
 dice-0.036
 contain-0.034
ents-0.032
 else-0.032
 clearly-0.029
Token wife
Token ID3656
Feature activation+1.13e+00

Pos logit contributions
uck+0.065
burn+0.058
ivered+0.057
 Wolf+0.056
 Cow+0.055
Neg logit contributions
 dice-0.100
 contain-0.096
ents-0.092
 else-0.091
 clearly-0.084
Token and
Token ID290
Feature activation-1.69e-01

Pos logit contributions
uck+0.081
burn+0.072
 Cow+0.071
 Wolf+0.071
ivered+0.070
Neg logit contributions
 dice-0.110
 contain-0.104
ents-0.095
 else-0.095
ooters-0.091
Token shrugged
Token ID28271
Feature activation+1.07e+00

Pos logit contributions
 dice+0.014
 contain+0.013
ents+0.012
 else+0.012
 clearly+0.011
Neg logit contributions
uck-0.015
burn-0.014
ivered-0.014
 Cow-0.014
 Wolf-0.013
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
uck+0.091
burn+0.084
ivered+0.084
 Wolf+0.080
 Cow+0.079
Neg logit contributions
 dice-0.088
 contain-0.086
 else-0.083
ents-0.080
 clearly-0.077
Token
Token ID198
Feature activation-4.46e-01

Pos logit contributions
 dice+0.014
 contain+0.013
ents+0.013
 else+0.013
 clearly+0.012
Neg logit contributions
uck-0.012
burn-0.011
ivered-0.011
oo-0.010
 Wolf-0.010
Token
Token ID198
Feature activation-4.34e-01

Pos logit contributions
 dice+0.042
 contain+0.040
 else+0.039
ents+0.039
 clearly+0.035
Neg logit contributions
uck-0.033
burn-0.030
ivered-0.030
 Wolf-0.028
 Cow-0.028
TokenBen
Token ID11696
Feature activation+6.29e-01

Pos logit contributions
 dice+0.041
 else+0.038
ents+0.038
 contain+0.038
 clearly+0.034
Neg logit contributions
uck-0.032
burn-0.031
ivered-0.029
 Wolf-0.028
 fever-0.027
Token and
Token ID290
Feature activation-1.68e-01

Pos logit contributions
uck+0.055
burn+0.051
ivered+0.050
 Wolf+0.049
 Cow+0.049
Neg logit contributions
 dice-0.051
 contain-0.048
ents-0.046
 else-0.046
 clearly-0.042
Token Lily
Token ID20037
Feature activation-8.44e-01

Pos logit contributions
 dice+0.013
 contain+0.012
 else+0.012
ents+0.011
 clearly+0.010
Neg logit contributions
uck-0.016
ivered-0.015
burn-0.015
oo-0.014
 Wolf-0.013
Token bite
Token ID13197
Feature activation-7.78e-02

Pos logit contributions
uck+0.041
burn+0.037
ivered+0.036
 Wolf+0.036
 Cow+0.035
Neg logit contributions
 dice-0.041
 contain-0.039
ents-0.037
 else-0.036
 clearly-0.034
Token and
Token ID290
Feature activation-1.50e-01

Pos logit contributions
 dice+0.006
 contain+0.006
 else+0.005
ents+0.005
 clearly+0.005
Neg logit contributions
uck-0.007
ivered-0.007
burn-0.007
 Cow-0.006
 Wolf-0.006
Token said
Token ID531
Feature activation-1.17e+00

Pos logit contributions
 dice+0.011
 contain+0.010
 else+0.010
ents+0.010
 clearly+0.009
Neg logit contributions
uck-0.014
ivered-0.013
burn-0.013
 Wolf-0.013
oo-0.013
Token,
Token ID11
Feature activation+6.01e-01

Pos logit contributions
 else+0.087
 dice+0.087
 contain+0.082
ents+0.080
 clearly+0.072
Neg logit contributions
ivered-0.105
burn-0.104
uck-0.103
 Wolf-0.098
eled-0.098
Token "
Token ID366
Feature activation-4.19e-01

Pos logit contributions
uck+0.057
ivered+0.054
burn+0.053
P+0.051
 Wolf+0.050
Neg logit contributions
 dice-0.044
 contain-0.042
ents-0.040
 else-0.039
 clearly-0.037
TokenM
Token ID44
Feature activation+5.12e+00

Pos logit contributions
 dice+0.039
 contain+0.038
 else+0.037
ents+0.035
 clearly+0.034
Neg logit contributions
uck-0.028
 Wolf-0.028
ivered-0.027
 Cow-0.027
oo-0.027
Tokenmm
Token ID3020
Feature activation+3.79e+00

Pos logit contributions
 Wolf+0.340
burn+0.329
uck+0.326
ivered+0.324
 wolf+0.324
Neg logit contributions
ents-0.464
 dice-0.463
 contain-0.437
 else-0.431
ooters-0.413
Token,
Token ID11
Feature activation+5.75e-01

Pos logit contributions
uck+0.307
ivered+0.294
 Wolf+0.291
burn+0.289
 Cow+0.280
Neg logit contributions
 dice-0.314
 else-0.301
ents-0.295
 contain-0.286
 clearly-0.261
Token this
Token ID428
Feature activation-3.10e+00

Pos logit contributions
uck+0.044
burn+0.040
 Cow+0.039
 Wolf+0.039
oo+0.038
Neg logit contributions
 dice-0.052
 contain-0.050
ents-0.047
 else-0.045
ooters-0.044
Token cookie
Token ID19751
Feature activation+1.57e+00

Pos logit contributions
 dice+0.269
 contain+0.241
 else+0.240
ents+0.230
 clearly+0.213
Neg logit contributions
uck-0.253
ivered-0.241
 Wolf-0.233
burn-0.233
 Cow-0.232
Token is
Token ID318
Feature activation-4.36e-01

Pos logit contributions
uck+0.132
burn+0.118
ivered+0.116
 Wolf+0.112
 Cow+0.112
Neg logit contributions
 dice-0.136
 contain-0.131
ents-0.124
 else-0.119
 clearly-0.112
Token house
Token ID2156
Feature activation+1.23e+00

Pos logit contributions
 dice+0.044
 contain+0.042
ents+0.040
 else+0.039
 clearly+0.037
Neg logit contributions
uck-0.031
burn-0.028
ivered-0.027
 Wolf-0.027
 Cow-0.027
Token!"
Token ID2474
Feature activation+1.16e-01

Pos logit contributions
uck+0.092
burn+0.079
ivered+0.077
oo+0.077
 Wolf+0.077
Neg logit contributions
 dice-0.116
 contain-0.110
ents-0.103
 else-0.102
ooters-0.096
Token
Token ID220
Feature activation+5.73e-01

Pos logit contributions
uck+0.007
 Cow+0.006
 Wolf+0.006
burn+0.006
ivered+0.006
Neg logit contributions
 dice-0.012
 contain-0.012
 else-0.011
ents-0.011
 clearly-0.011
Token
Token ID198
Feature activation-3.01e-01

Pos logit contributions
uck+0.039
ivered+0.033
 Wolf+0.032
burn+0.032
 Cow+0.032
Neg logit contributions
 dice-0.060
 contain-0.058
ents-0.054
 else-0.054
 clearly-0.050
Token
Token ID198
Feature activation-2.94e-01

Pos logit contributions
 dice+0.032
 contain+0.031
ents+0.030
 else+0.029
 clearly+0.027
Neg logit contributions
uck-0.019
 Cow-0.016
 Wolf-0.016
oo-0.016
ivered-0.016
TokenThe
Token ID464
Feature activation-2.74e-02

Pos logit contributions
 dice+0.029
 contain+0.027
ents+0.026
 else+0.025
ows+0.025
Neg logit contributions
uck-0.020
P-0.019
ivered-0.018
 Cow-0.017
burn-0.017
Token cater
Token ID20825
Feature activation+1.20e+00

Pos logit contributions
 dice+0.002
 contain+0.002
ents+0.002
 else+0.002
 clearly+0.002
Neg logit contributions
uck-0.003
ivered-0.002
burn-0.002
oo-0.002
 Wolf-0.002
Tokenpillar
Token ID41643
Feature activation-1.49e-02

Pos logit contributions
uck+0.077
burn+0.069
ivered+0.068
 Wolf+0.066
 Cow+0.065
Neg logit contributions
 dice-0.126
 contain-0.120
ents-0.115
 else-0.115
 clearly-0.107
Token was
Token ID373
Feature activation+2.80e-01

Pos logit contributions
 dice+0.001
 contain+0.001
ents+0.001
 else+0.001
ooters+0.001
Neg logit contributions
uck-0.001
ivered-0.001
 Cow-0.001
 Wolf-0.001
burn-0.001
Token so
Token ID523
Feature activation-1.36e+00

Pos logit contributions
uck+0.025
ivered+0.023
burn+0.023
 Wolf+0.022
 Cow+0.022
Neg logit contributions
 dice-0.023
 contain-0.021
ents-0.020
 else-0.020
ooters-0.018
Token happy
Token ID3772
Feature activation-1.59e+00

Pos logit contributions
 dice+0.124
 contain+0.112
ents+0.108
 else+0.102
ooters+0.097
Neg logit contributions
uck-0.112
ivered-0.102
oo-0.098
 Cow-0.097
 Wolf-0.096
Token
Token ID198
Feature activation-4.93e-01

Pos logit contributions
 dice+0.025
ents+0.023
 contain+0.023
 else+0.023
 clearly+0.021
Neg logit contributions
uck-0.021
burn-0.021
ivered-0.020
 Wolf-0.019
eled-0.019
Token
Token ID198
Feature activation-4.94e-01

Pos logit contributions
 dice+0.052
 contain+0.049
ents+0.047
 else+0.046
 clearly+0.043
Neg logit contributions
uck-0.033
burn-0.028
ivered-0.028
 Cow-0.028
 Wolf-0.028
TokenBen
Token ID11696
Feature activation+5.62e-01

Pos logit contributions
 dice+0.052
 contain+0.049
 else+0.048
ents+0.048
 clearly+0.044
Neg logit contributions
uck-0.032
burn-0.029
ivered-0.029
 Wolf-0.028
 Cow-0.027
Token looked
Token ID3114
Feature activation-8.29e-02

Pos logit contributions
uck+0.044
ivered+0.040
burn+0.039
 Wolf+0.038
 Cow+0.038
Neg logit contributions
 dice-0.052
 contain-0.049
ents-0.046
 else-0.045
ooters-0.042
Token for
Token ID329
Feature activation-5.99e-01

Pos logit contributions
 dice+0.009
 contain+0.009
ents+0.008
 else+0.008
ooters+0.008
Neg logit contributions
uck-0.005
ivered-0.004
 Wolf-0.004
 wolf-0.004
 Cow-0.004
Token the
Token ID262
Feature activation-4.84e-01

Pos logit contributions
 dice+0.060
 contain+0.057
ents+0.055
 else+0.053
ooters+0.050
Neg logit contributions
uck-0.042
 Wolf-0.037
 Cow-0.037
burn-0.037
oo-0.036
Token button
Token ID4936
Feature activation-4.41e+00

Pos logit contributions
ents+0.043
 dice+0.043
 contain+0.042
 else+0.037
ooters+0.036
Neg logit contributions
uck-0.037
burn-0.034
 Wolf-0.034
 wolf-0.033
 fever-0.033
Token.
Token ID13
Feature activation-2.95e-01

Pos logit contributions
 dice+0.474
 contain+0.471
ents+0.421
inations+0.417
rig+0.408
Neg logit contributions
uck-0.254
burn-0.217
 Wolf-0.207
oo-0.198
 Cow-0.194
Token He
Token ID679
Feature activation+4.86e-01

Pos logit contributions
 dice+0.031
 contain+0.029
ents+0.028
 else+0.028
 clearly+0.026
Neg logit contributions
uck-0.020
burn-0.017
ivered-0.017
 Wolf-0.017
 Cow-0.017
Token saw
Token ID2497
Feature activation+8.19e-01

Pos logit contributions
uck+0.031
 Cow+0.029
 Wolf+0.029
eled+0.028
ivered+0.028
Neg logit contributions
 dice-0.049
 contain-0.049
ooters-0.043
hest-0.042
ents-0.042
Token it
Token ID340
Feature activation+8.52e-01

Pos logit contributions
uck+0.052
 Wolf+0.045
 Cow+0.044
 Ow+0.043
burn+0.043
Neg logit contributions
 contain-0.086
 dice-0.085
ents-0.078
ooters-0.076
inations-0.075
Token to
Token ID284
Feature activation-6.76e-02

Pos logit contributions
 contain+0.272
 dice+0.269
inations+0.248
ents+0.242
 else+0.240
Neg logit contributions
uck-0.207
 Ow-0.166
P-0.154
burn-0.154
 Cow-0.153
Token Tim
Token ID5045
Feature activation-4.20e-01

Pos logit contributions
 dice+0.006
 contain+0.006
 else+0.006
ents+0.006
 clearly+0.005
Neg logit contributions
uck-0.005
ivered-0.005
burn-0.005
 Wolf-0.005
 fever-0.004
Tokenmy
Token ID1820
Feature activation+8.29e-01

Pos logit contributions
 dice+0.044
 contain+0.042
 else+0.039
ents+0.039
 clearly+0.037
Neg logit contributions
uck-0.028
burn-0.025
ivered-0.024
 Cow-0.024
 Wolf-0.023
Token and
Token ID290
Feature activation-9.99e-02

Pos logit contributions
uck+0.059
burn+0.049
 Cow+0.049
 Wolf+0.048
ivered+0.047
Neg logit contributions
 dice-0.085
 contain-0.079
ents-0.072
 else-0.072
ooters-0.070
Token he
Token ID339
Feature activation-1.51e+00

Pos logit contributions
 dice+0.009
 contain+0.008
ents+0.008
 else+0.007
ooters+0.007
Neg logit contributions
uck-0.008
burn-0.008
 Cow-0.008
 Wolf-0.007
ivered-0.007
Token grabbed
Token ID13176
Feature activation-1.67e+00

Pos logit contributions
 dice+0.140
 contain+0.135
ooters+0.122
ents+0.120
hest+0.114
Neg logit contributions
uck-0.119
 Cow-0.102
ivered-0.101
burn-0.099
oo-0.098
Token it
Token ID340
Feature activation+1.05e+00

Pos logit contributions
 dice+0.159
 contain+0.151
ents+0.144
 else+0.143
 clearly+0.132
Neg logit contributions
uck-0.125
burn-0.113
ivered-0.112
 Wolf-0.109
 Cow-0.109
Token tightly
Token ID17707
Feature activation+1.36e-01

Pos logit contributions
uck+0.072
burn+0.062
 Cow+0.058
 Wolf+0.058
oo+0.057
Neg logit contributions
 dice-0.108
 contain-0.101
ents-0.094
 else-0.094
inations-0.091
Token.
Token ID13
Feature activation-4.76e-02

Pos logit contributions
uck+0.011
burn+0.010
ivered+0.009
 Wolf+0.009
 Cow+0.009
Neg logit contributions
 dice-0.013
 contain-0.012
ents-0.011
 else-0.011
ooters-0.010
Token "
Token ID366
Feature activation-3.78e-01

Pos logit contributions
 dice+0.005
 contain+0.004
ents+0.004
 else+0.004
 clearly+0.004
Neg logit contributions
uck-0.004
burn-0.003
ivered-0.003
 Wolf-0.003
 Cow-0.003
TokenIt
Token ID1026
Feature activation+3.54e+00

Pos logit contributions
 dice+0.038
 contain+0.037
 else+0.036
ents+0.034
 clearly+0.032
Neg logit contributions
uck-0.025
ivered-0.024
burn-0.023
 Wolf-0.023
 Cow-0.023
Token sees
Token ID7224
Feature activation-1.64e+00

Pos logit contributions
uck+0.160
 Wolf+0.138
 Cow+0.137
burn+0.135
oo+0.133
Neg logit contributions
 contain-0.154
 dice-0.152
ents-0.132
 else-0.130
hest-0.122
Token the
Token ID262
Feature activation-3.34e-01

Pos logit contributions
 dice+0.147
 contain+0.144
ents+0.129
 else+0.126
ooters+0.123
Neg logit contributions
uck-0.132
burn-0.117
 Cow-0.115
 Wolf-0.115
 Ow-0.113
Token gum
Token ID27142
Feature activation+3.94e+00

Pos logit contributions
 dice+0.032
 else+0.028
ents+0.028
 contain+0.027
 clearly+0.025
Neg logit contributions
uck-0.025
ivered-0.024
 Wolf-0.024
 Cow-0.024
burn-0.023
Token in
Token ID287
Feature activation-4.59e-01

Pos logit contributions
uck+0.273
burn+0.239
ivered+0.234
 Wolf+0.224
oo+0.222
Neg logit contributions
 dice-0.398
 contain-0.387
ents-0.364
 else-0.357
 clearly-0.333
Token Anna
Token ID11735
Feature activation-4.15e+00

Pos logit contributions
 dice+0.035
 contain+0.031
ents+0.030
 else+0.030
 clearly+0.028
Neg logit contributions
uck-0.044
ivered-0.042
burn-0.040
oo-0.038
ivering-0.038
Token's
Token ID338
Feature activation-2.63e+00

Pos logit contributions
 dice+0.288
 contain+0.264
ents+0.257
 else+0.254
 clearly+0.223
Neg logit contributions
uck-0.408
burn-0.386
ivered-0.382
 Cow-0.378
 Wolf-0.377
Token hair
Token ID4190
Feature activation+1.44e+00

Pos logit contributions
 dice+0.261
 contain+0.242
ents+0.236
 else+0.234
 wins+0.211
Neg logit contributions
uck-0.187
burn-0.176
ivered-0.171
 Wolf-0.166
 Cow-0.165
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
uck+0.127
burn+0.118
 Wolf+0.117
ivered+0.117
 Cow+0.116
Neg logit contributions
 dice-0.115
 contain-0.107
ents-0.104
 else-0.102
 clearly-0.094
Token She
Token ID1375
Feature activation+1.80e+00

Pos logit contributions
burn+0.014
ivered+0.014
uck+0.013
eled+0.013
oo+0.012
Neg logit contributions
 dice-0.017
 else-0.016
ents-0.015
 contain-0.015
 clearly-0.015
Token is
Token ID318
Feature activation-2.06e-01

Pos logit contributions
uck+0.153
 Wolf+0.129
ivered+0.129
 Cow+0.129
burn+0.128
Neg logit contributions
 contain-0.158
 dice-0.156
ents-0.136
 else-0.135
ooters-0.127
Token not
Token ID407
Feature activation+8.60e-01

Pos logit contributions
 dice+0.017
 else+0.016
 contain+0.016
ents+0.015
 clearly+0.014
Neg logit contributions
uck-0.018
burn-0.017
ivered-0.017
 Wolf-0.016
 Cow-0.016
Token story
Token ID1621
Feature activation+3.15e+00

Pos logit contributions
 dice+0.020
 contain+0.019
ents+0.018
 else+0.018
 clearly+0.016
Neg logit contributions
uck-0.012
ivered-0.011
burn-0.011
 Cow-0.010
 Wolf-0.010
Token is
Token ID318
Feature activation-2.25e-01

Pos logit contributions
uck+0.206
oo+0.155
 Wolf+0.154
ivered+0.152
 Ow+0.144
Neg logit contributions
 contain-0.341
 dice-0.337
ents-0.323
inations-0.300
 else-0.297
Token that
Token ID326
Feature activation+2.30e+00

Pos logit contributions
 dice+0.020
 contain+0.019
ents+0.018
 else+0.018
 clearly+0.016
Neg logit contributions
uck-0.019
burn-0.017
ivered-0.017
 Wolf-0.016
 Cow-0.016
Token sometimes
Token ID3360
Feature activation+3.59e-01

Pos logit contributions
uck+0.178
burn+0.161
 Wolf+0.158
 Cow+0.156
ivered+0.155
Neg logit contributions
 dice-0.214
 contain-0.206
ents-0.194
 else-0.186
 clearly-0.173
Token things
Token ID1243
Feature activation+1.35e+00

Pos logit contributions
uck+0.028
burn+0.026
ivered+0.026
 Wolf+0.025
 Cow+0.025
Neg logit contributions
 dice-0.033
 contain-0.031
ents-0.029
 else-0.029
 clearly-0.027
Token are
Token ID389
Feature activation-1.57e+00

Pos logit contributions
uck+0.113
burn+0.099
 Wolf+0.097
 Cow+0.096
oo+0.095
Neg logit contributions
 dice-0.116
 contain-0.104
ents-0.104
 1-0.089
 else-0.089
Token hard
Token ID1327
Feature activation-8.61e-01

Pos logit contributions
 dice+0.157
 contain+0.150
ents+0.139
 else+0.132
 Isabel+0.127
Neg logit contributions
uck-0.103
 Cow-0.099
burn-0.098
 Wolf-0.097
 suffering-0.093
Token,
Token ID11
Feature activation+7.59e-01

Pos logit contributions
 dice+0.079
 contain+0.076
ents+0.071
 else+0.070
 clearly+0.064
Neg logit contributions
uck-0.068
burn-0.060
ivered-0.059
 Wolf-0.059
 Cow-0.058
Token but
Token ID475
Feature activation-1.61e+00

Pos logit contributions
uck+0.072
burn+0.067
 Wolf+0.066
 Cow+0.066
ivered+0.064
Neg logit contributions
 dice-0.055
 contain-0.052
ents-0.048
 else-0.045
ows-0.041
Token if
Token ID611
Feature activation-7.95e-01

Pos logit contributions
 dice+0.142
 contain+0.134
ents+0.129
 else+0.122
ows+0.114
Neg logit contributions
uck-0.133
burn-0.119
 Wolf-0.118
 Cow-0.117
oo-0.114
Token you
Token ID345
Feature activation-1.35e-01

Pos logit contributions
 dice+0.061
 contain+0.056
 else+0.055
ents+0.053
 clearly+0.048
Neg logit contributions
uck-0.074
ivered-0.070
burn-0.069
oo-0.065
 Cow-0.065
Token heard
Token ID2982
Feature activation+1.95e+00

Pos logit contributions
uck+0.103
ivered+0.088
 Wolf+0.086
 Cow+0.086
oo+0.083
Neg logit contributions
 dice-0.166
 contain-0.160
ents-0.151
 else-0.143
ooters-0.138
Token something
Token ID1223
Feature activation+3.17e+00

Pos logit contributions
uck+0.122
burn+0.105
 Wolf+0.105
 Cow+0.105
ivered+0.104
Neg logit contributions
 dice-0.210
 contain-0.198
ents-0.187
 else-0.184
 wins-0.173
Token that
Token ID326
Feature activation+2.16e+00

Pos logit contributions
uck+0.220
 wolf+0.208
 Wolf+0.203
oo+0.201
 Cow+0.200
Neg logit contributions
 dice-0.321
ents-0.286
 contain-0.277
 wins-0.265
inations-0.248
Token made
Token ID925
Feature activation+3.65e+00

Pos logit contributions
uck+0.151
 Cow+0.136
 Wolf+0.134
burn+0.130
ivered+0.129
Neg logit contributions
 dice-0.220
ents-0.202
 contain-0.200
 else-0.179
ooters-0.179
Token her
Token ID607
Feature activation+8.98e-01

Pos logit contributions
uck+0.227
burn+0.197
ivered+0.194
 Wolf+0.191
 Cow+0.191
Neg logit contributions
 dice-0.397
 contain-0.376
ents-0.360
 else-0.355
 clearly-0.331
Token stop
Token ID2245
Feature activation-9.48e-01

Pos logit contributions
uck+0.064
burn+0.058
ivered+0.057
 Wolf+0.056
 Cow+0.056
Neg logit contributions
 dice-0.090
 contain-0.082
ents-0.081
 else-0.078
 clearly-0.072
Token and
Token ID290
Feature activation-1.34e-01

Pos logit contributions
 dice+0.091
 contain+0.082
ents+0.077
 else+0.076
ooters+0.070
Neg logit contributions
uck-0.073
burn-0.067
ivered-0.065
 Cow-0.064
 Wolf-0.064
Token listen
Token ID6004
Feature activation+2.69e+00

Pos logit contributions
 dice+0.013
 contain+0.012
ents+0.011
 else+0.010
hest+0.010
Neg logit contributions
uck-0.010
 Wolf-0.009
 Cow-0.009
burn-0.009
ivered-0.009
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
uck+0.160
oo+0.133
 Wolf+0.131
 Cow+0.128
burn+0.127
Neg logit contributions
 dice-0.299
 contain-0.263
ents-0.258
ooters-0.240
 else-0.238
Token Was
Token ID8920
Feature activation-1.72e-01

Pos logit contributions
 dice+0.013
 contain+0.012
ents+0.012
 else+0.012
 clearly+0.011
Neg logit contributions
uck-0.008
burn-0.008
ivered-0.008
 Cow-0.007
 Wolf-0.007
Token it
Token ID340
Feature activation+1.04e+00

Pos logit contributions
 dice+0.017
 contain+0.016
ents+0.014
 else+0.013
 wins+0.013
Neg logit contributions
uck-0.013
oo-0.012
 Wolf-0.012
 Cow-0.012
burn-0.011
Token arrow
Token ID15452
Feature activation-3.81e+00

Pos logit contributions
 dice+0.098
 contain+0.097
ents+0.090
 else+0.088
inations+0.085
Neg logit contributions
uck-0.068
burn-0.060
 Wolf-0.060
 Cow-0.058
 wolf-0.057
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
 dice+0.380
 contain+0.354
 else+0.328
ents+0.327
ooters+0.313
Neg logit contributions
uck-0.271
burn-0.242
ivered-0.220
 Wolf-0.214
 wolf-0.212
Token "
Token ID366
Feature activation-4.49e-01

Pos logit contributions
 dice+0.008
 contain+0.007
ents+0.007
ooters+0.006
cases+0.006
Neg logit contributions
uck-0.010
ivered-0.010
burn-0.010
P-0.010
ivering-0.010
TokenThat
Token ID2504
Feature activation+1.82e+00

Pos logit contributions
 dice+0.048
 contain+0.046
ents+0.044
 else+0.044
 clearly+0.041
Neg logit contributions
uck-0.028
burn-0.025
ivered-0.025
 Wolf-0.024
 Cow-0.024
Token's
Token ID338
Feature activation-2.79e+00

Pos logit contributions
uck+0.158
burn+0.144
ivered+0.140
 Wolf+0.139
 Cow+0.138
Neg logit contributions
 dice-0.152
 contain-0.147
ents-0.138
 else-0.134
 clearly-0.124
Token awesome
Token ID7427
Feature activation-7.48e+00

Pos logit contributions
 dice+0.256
 contain+0.245
ents+0.236
 else+0.224
ooters+0.211
Neg logit contributions
uck-0.212
 Cow-0.191
burn-0.190
 Wolf-0.188
oo-0.188
Token!"
Token ID2474
Feature activation+7.11e-02

Pos logit contributions
 dice+0.543
 contain+0.469
 else+0.467
ents+0.465
 wins+0.418
Neg logit contributions
uck-0.715
ivered-0.685
burn-0.667
 Cow-0.651
 Wolf-0.644
Token Billy
Token ID15890
Feature activation+3.81e+00

Pos logit contributions
uck+0.005
ivered+0.005
burn+0.004
P+0.004
 fever+0.004
Neg logit contributions
 dice-0.007
 contain-0.007
ents-0.007
 else-0.006
 clearly-0.006
Token said
Token ID531
Feature activation-1.19e+00

Pos logit contributions
uck+0.325
ivered+0.294
burn+0.294
 Cow+0.290
 Wolf+0.286
Neg logit contributions
 dice-0.325
 contain-0.305
ents-0.283
 else-0.279
 clearly-0.258
Token,
Token ID11
Feature activation+4.89e-01

Pos logit contributions
 dice+0.098
 contain+0.097
 else+0.092
ents+0.088
 clearly+0.082
Neg logit contributions
uck-0.099
ivered-0.093
burn-0.093
 Wolf-0.089
 Cow-0.089
Token "
Token ID366
Feature activation-4.69e-01

Pos logit contributions
uck+0.045
ivered+0.042
burn+0.041
P+0.039
 Wolf+0.039
Neg logit contributions
 dice-0.038
 contain-0.037
ents-0.035
 else-0.034
 clearly-0.032
Token book
Token ID1492
Feature activation-2.98e+00

Pos logit contributions
 contain+0.563
ents+0.540
 dice+0.513
ooters+0.497
inations+0.497
Neg logit contributions
uck-0.434
 self-0.381
 wolf-0.378
P-0.375
 sausage-0.365
Token.
Token ID13
Feature activation-2.42e-02

Pos logit contributions
 dice+0.319
 contain+0.301
inations+0.288
ents+0.273
 else+0.271
Neg logit contributions
uck-0.166
burn-0.128
over-0.125
oo-0.122
 Wolf-0.120
Token He
Token ID679
Feature activation+6.37e-01

Pos logit contributions
 dice+0.002
 contain+0.002
 else+0.002
ents+0.002
 clearly+0.002
Neg logit contributions
uck-0.002
burn-0.001
 Wolf-0.001
 Cow-0.001
ivered-0.001
Token had
Token ID550
Feature activation-1.48e+00

Pos logit contributions
uck+0.035
eled+0.030
P+0.028
aged+0.028
 self+0.028
Neg logit contributions
 contain-0.069
ooters-0.065
 dice-0.065
hest-0.060
 outage-0.060
Token written
Token ID3194
Feature activation-5.54e+00

Pos logit contributions
 contain+0.152
 dice+0.149
rig+0.136
ows+0.133
 else+0.133
Neg logit contributions
uck-0.081
P-0.072
 Ow-0.072
 Wolf-0.069
burn-0.069
Token 12
Token ID1105
Feature activation-7.49e+00

Pos logit contributions
 contain+0.597
 dice+0.583
rig+0.532
inations+0.529
ents+0.526
Neg logit contributions
uck-0.331
 Wolf-0.281
 Cow-0.273
oo-0.267
burn-0.263
Token and
Token ID290
Feature activation-1.94e-01

Pos logit contributions
 contain+0.653
 dice+0.583
ooters+0.544
nes+0.534
 believe+0.531
Neg logit contributions
uck-0.593
oo-0.521
over-0.503
ivered-0.495
 Wolf-0.493
Token 15
Token ID1315
Feature activation-5.97e+00

Pos logit contributions
 contain+0.017
 dice+0.017
ents+0.015
ooters+0.015
rig+0.015
Neg logit contributions
uck-0.014
 Wolf-0.012
 self-0.012
 Cow-0.012
aged-0.012
Token on
Token ID319
Feature activation+2.89e+00

Pos logit contributions
 contain+0.564
 dice+0.528
ooters+0.480
rig+0.463
 else+0.458
Neg logit contributions
uck-0.410
over-0.368
oo-0.356
ivered-0.350
aged-0.350
Token one
Token ID530
Feature activation+9.05e-02

Pos logit contributions
uck+0.208
burn+0.186
ivered+0.184
 Wolf+0.180
 Cow+0.180
Neg logit contributions
 dice-0.281
 contain-0.270
ents-0.256
 else-0.254
 clearly-0.234
Token line
Token ID1627
Feature activation-1.80e+00

Pos logit contributions
uck+0.006
over+0.005
oo+0.005
 Wolf+0.005
 lung+0.005
Neg logit contributions
 dice-0.008
 contain-0.008
ents-0.008
nes-0.008
ooters-0.007
Token was
Token ID373
Feature activation+1.73e-01

Pos logit contributions
uck+0.220
burn+0.199
ivered+0.197
 Wolf+0.190
 Cow+0.190
Neg logit contributions
 dice-0.279
 contain-0.265
ents-0.252
 else-0.251
 clearly-0.232
Token a
Token ID257
Feature activation+4.30e-01

Pos logit contributions
uck+0.012
ivered+0.012
burn+0.012
 Wolf+0.011
 Cow+0.011
Neg logit contributions
 dice-0.017
 contain-0.016
 else-0.015
ents-0.015
 clearly-0.013
Token little
Token ID1310
Feature activation+2.89e+00

Pos logit contributions
uck+0.027
burn+0.024
ivered+0.024
 Wolf+0.023
 Cow+0.023
Neg logit contributions
 dice-0.046
 contain-0.044
ents-0.042
 else-0.042
 clearly-0.039
Token girl
Token ID2576
Feature activation+1.62e+00

Pos logit contributions
uck+0.186
ivered+0.171
burn+0.164
 Wolf+0.156
 Cow+0.154
Neg logit contributions
 dice-0.305
 contain-0.286
 else-0.277
ents-0.275
 clearly-0.252
Token named
Token ID3706
Feature activation+1.69e+00

Pos logit contributions
uck+0.098
burn+0.086
ivered+0.084
 Cow+0.082
 Wolf+0.082
Neg logit contributions
 dice-0.178
 contain-0.170
ents-0.162
 else-0.161
 clearly-0.151
Token Anne
Token ID15397
Feature activation-6.77e+00

Pos logit contributions
uck+0.132
burn+0.115
oo+0.114
 Cow+0.114
 Wolf+0.110
Neg logit contributions
 dice-0.160
 contain-0.152
 else-0.140
ents-0.139
 levers-0.132
Token.
Token ID13
Feature activation-3.66e-01

Pos logit contributions
 dice+0.529
 contain+0.508
 else+0.491
ents+0.477
 clearly+0.441
Neg logit contributions
uck-0.612
burn-0.553
ivered-0.552
 Wolf-0.541
 Cow-0.537
Token One
Token ID1881
Feature activation-1.07e+00

Pos logit contributions
 dice+0.038
 contain+0.036
ents+0.034
 else+0.034
ooters+0.031
Neg logit contributions
uck-0.024
burn-0.022
 Wolf-0.021
ivered-0.021
 Cow-0.021
Token day
Token ID1110
Feature activation+5.72e-01

Pos logit contributions
 dice+0.091
 contain+0.090
 else+0.087
ents+0.078
 clearly+0.077
Neg logit contributions
uck-0.085
burn-0.079
ivered-0.078
 Wolf-0.076
 Cow-0.076
Token she
Token ID673
Feature activation-1.64e+00

Pos logit contributions
uck+0.046
ivered+0.043
burn+0.042
 Cow+0.040
 Wolf+0.039
Neg logit contributions
 dice-0.051
 contain-0.047
 else-0.047
ents-0.045
 clearly-0.042
Token was
Token ID373
Feature activation+3.13e-01

Pos logit contributions
 dice+0.143
 else+0.135
 contain+0.133
ents+0.124
 clearly+0.118
Neg logit contributions
uck-0.131
burn-0.122
ivered-0.121
 Wolf-0.120
 Cow-0.118
Token Daisy
Token ID40355
Feature activation+1.76e+00

Pos logit contributions
 dice+0.049
 contain+0.046
 else+0.045
ents+0.045
 clearly+0.041
Neg logit contributions
uck-0.038
ivered-0.035
burn-0.034
 Wolf-0.032
 Cow-0.032
Token!"
Token ID2474
Feature activation-1.64e-01

Pos logit contributions
uck+0.148
ivered+0.135
burn+0.134
 Wolf+0.129
 Cow+0.128
Neg logit contributions
 dice-0.149
 contain-0.141
ents-0.135
 else-0.134
 clearly-0.122
Token
Token ID198
Feature activation-6.24e-01

Pos logit contributions
 dice+0.018
 contain+0.017
ents+0.016
 else+0.016
 clearly+0.015
Neg logit contributions
uck-0.011
 Cow-0.009
oo-0.009
burn-0.009
 Wolf-0.009
Token
Token ID198
Feature activation-6.19e-01

Pos logit contributions
 dice+0.066
 contain+0.064
ents+0.060
 else+0.059
 clearly+0.055
Neg logit contributions
uck-0.041
oo-0.035
 Cow-0.035
 Wolf-0.035
burn-0.035
TokenJ
Token ID41
Feature activation-2.72e+00

Pos logit contributions
 dice+0.067
 contain+0.065
ents+0.061
 else+0.058
 clearly+0.056
Neg logit contributions
uck-0.041
P-0.035
oo-0.034
 Cow-0.033
 Wolf-0.032
Tokenas
Token ID292
Feature activation-8.18e+00

Pos logit contributions
 dice+0.214
 contain+0.211
 else+0.186
 count+0.168
 clearly+0.167
Neg logit contributions
uck-0.268
oo-0.243
ivered-0.238
burn-0.231
P-0.217
Tokenmine
Token ID3810
Feature activation-9.62e-02

Pos logit contributions
 dice+0.682
 contain+0.655
ents+0.610
 else+0.606
 clearly+0.557
Neg logit contributions
uck-0.698
burn-0.640
ivered-0.631
 Wolf-0.619
 Cow-0.616
Token:
Token ID25
Feature activation-4.72e+00

Pos logit contributions
 dice+0.009
 contain+0.008
ents+0.007
 else+0.007
ooters+0.007
Neg logit contributions
uck-0.008
ivered-0.007
 Cow-0.007
 Wolf-0.007
oo-0.007
Token "
Token ID366
Feature activation-4.96e-01

Pos logit contributions
 dice+0.358
 contain+0.325
 else+0.320
ents+0.304
 clearly+0.300
Neg logit contributions
uck-0.434
ivered-0.416
burn-0.403
 fever-0.381
 Wolf-0.376
TokenHi
Token ID17250
Feature activation+2.94e+00

Pos logit contributions
 dice+0.052
 contain+0.050
 else+0.047
ents+0.047
 clearly+0.044
Neg logit contributions
uck-0.032
ivered-0.029
burn-0.028
 Wolf-0.028
 Cow-0.027
Token Daisy
Token ID40355
Feature activation+1.72e+00

Pos logit contributions
uck+0.216
ivered+0.206
burn+0.196
 fever+0.185
eled+0.184
Neg logit contributions
 dice-0.277
 else-0.255
 contain-0.253
ents-0.251
 clearly-0.229
Token!
Token ID0
Feature activation+2.05e-01

Pos logit contributions
 contain+1.087
 dice+1.045
 believe+0.955
rig+0.952
nes+0.942
Neg logit contributions
uck-0.499
oo-0.394
 Wolf-0.392
 Cow-0.390
 Ow-0.384
Token He
Token ID679
Feature activation+6.67e-01

Pos logit contributions
uck+0.013
 Wolf+0.012
 Cow+0.012
burn+0.012
 Ow+0.011
Neg logit contributions
 dice-0.021
 contain-0.021
ents-0.020
 else-0.019
 clearly-0.018
Token often
Token ID1690
Feature activation-8.80e-01

Pos logit contributions
uck+0.037
P+0.035
eled+0.034
 Wolf+0.033
 Cow+0.032
Neg logit contributions
 contain-0.070
 dice-0.067
ents-0.064
rig-0.062
ooters-0.061
Token counted
Token ID14789
Feature activation-9.71e+00

Pos logit contributions
 dice+0.084
 contain+0.080
ents+0.077
 else+0.073
hest+0.071
Neg logit contributions
uck-0.062
burn-0.056
ivered-0.055
 Cow-0.054
 Wolf-0.053
Token the
Token ID262
Feature activation-2.40e-01

Pos logit contributions
 contain+1.029
 dice+1.004
 else+0.912
 believe+0.892
inations+0.878
Neg logit contributions
uck-0.621
burn-0.505
 Cow-0.504
oo-0.500
 Wolf-0.493
Token stairs
Token ID16046
Feature activation-8.31e+00

Pos logit contributions
 contain+0.021
ents+0.020
 dice+0.019
 else+0.019
inations+0.018
Neg logit contributions
uck-0.017
 wolf-0.016
 Ow-0.016
 Wolf-0.016
 fever-0.016
Token as
Token ID355
Feature activation-3.45e+00

Pos logit contributions
 dice+0.864
 contain+0.821
ents+0.797
ooters+0.778
inations+0.770
Neg logit contributions
uck-0.512
ivered-0.390
oo-0.389
burn-0.376
 Wolf-0.363
Token he
Token ID339
Feature activation-1.47e+00

Pos logit contributions
 contain+0.338
 dice+0.313
ents+0.290
rig+0.278
 believe+0.277
Neg logit contributions
uck-0.240
 Wolf-0.224
 Cow-0.223
oo-0.217
burn-0.214
Token climbed
Token ID19952
Feature activation-2.94e+00

Pos logit contributions
 contain+0.120
 dice+0.114
ooters+0.101
ents+0.098
rig+0.096
Neg logit contributions
uck-0.130
eled-0.114
ivered-0.113
P-0.112
aged-0.108
Token them
Token ID606
Feature activation+2.75e-01

Pos logit contributions
 contain+0.336
 dice+0.316
 else+0.314
inations+0.311
 believe+0.302
Neg logit contributions
uck-0.159
 Wolf-0.129
burn-0.123
 Cow-0.122
 wolf-0.122
Token,
Token ID11
Feature activation+6.25e-01

Pos logit contributions
uck+0.014
aged+0.009
oo+0.008
 Wolf+0.008
burn+0.008
Neg logit contributions
 dice-0.032
 contain-0.030
inations-0.030
 else-0.028
ents-0.028