TOP ACTIVATIONS
MAX = 16.746

He used it to make waves and bubbles . He saw
runs to the fence and waves at the bull . The
them could really ride the waves , but they all had
learned that just because someone waves at you , it does
spend the day chasing the waves . Mom my
water . They saw the waves and the shells . They
face , and watching the waves . One
jog with Rex . Luna waves to them . She finishes
, he ran into big waves . The waves kept spl
the cat . The cat waves back with its tail .
She liked to ride the waves and feel the wind and
he could , but the waves kept getting higher and higher
tub . They make some waves with their hands . They
in the water . The waves felt gentle as they rolled
hand as they watched the waves crashing against the shore .
The boat floated across the waves . It passed lots of
picks up a stick and waves it . She makes a
a great afternoon watching the waves and the attractive ocean .
jump on it and make waves . They laugh and have
gentle seal playing in the waves . Jimmy was mesmer ized

MIN ACTIVATIONS
MIN = -16.782

saw that Mr . Wh isk ers was lost and she
family sh oo ed Wh isk ers away and told Max
a long time . Wh isk ers learned that sometimes it
wise old cat named Wh isk ers . Wh isk ers
scared . Mr . Wh isk ers said , " Don
Wh isk ers . Wh isk ers looked at Spot and
. After Wh isk ers got better , he
, and he and Wh isk ers played with the new
I 'm sorry , Wh isk ers . Let 's go
she could and pushed Wh isk ers out of the way
her pet cat , Wh isk ers . Lily loved to
a lazy cat named Wh isk ers . Wh isk ers
closed . Wh isk ers asked his friend ,
would heal soon . Wh isk ers was very happy to
lock on it . Wh isk ers tried to open the
Ch ir py and Wh isk ers became good friends ,
was heading straight towards Wh isk ers . She ran as
. One day , Wh isk ers saw a can of
One sunny day , Wh isk ers went for a walk
and held tight . Wh isk ers pushed him gently ,

INTERVAL 8.364 - 16.746
CONTAINS 0.443%

delight . M agg ie 's mum came out
" Sorry , sorry , sorry ." Their moms
tree heard them and felt sorry . It moved a little
and waited in line . Everyone was ready to pay ,
said , " I 'm sorry , my loves . I

INTERVAL -0.018 - 8.364
CONTAINS 49.189%

eyes and was looking at him curiously . M ummy smiled
cl apped and cheered for them . The band felt so
Look , mom ! The sunshine is back ! Can we
. The tiger was very hungry and wanted to eat something
One day Molly found a telephone on the

INTERVAL -8.400 - -0.018
CONTAINS 49.245%

. Joe wanted to check on the pie , but he
to hop back home , but his paw hurt too much
take things . Now , give it back to me before
their ice - cream . It is cold and sweet .
. But one day , a little pin got stuck in

INTERVAL -16.782 - -8.400
CONTAINS 1.124%

was . " Are we there yet ?" Lily
v ase inside . The v ase was very fragile and
around the village looking for cel ery to catch . One
was for dinner . Tim my guessed it was pizza ,
a hammer and a screw driver on a shelf .
Token He
Token ID679
Feature activation+2.30e+00

Pos logit contributions
 floats+0.127
 waves+0.127
 screams+0.122
ls+0.117
 cheers+0.117
Neg logit contributions
isk-0.105
ety-0.102
my-0.099
 whether-0.098
coat-0.093
Token used
Token ID973
Feature activation-2.79e+00

Pos logit contributions
 floats+0.143
 ourselves+0.142
 waves+0.136
ls+0.135
 screams+0.134
Neg logit contributions
my-0.108
 whether-0.100
isk-0.100
ety-0.099
coat-0.089
Token it
Token ID340
Feature activation-5.90e-01

Pos logit contributions
isk+0.165
ety+0.163
my+0.155
 whether+0.152
tr+0.148
Neg logit contributions
 waves-0.150
 floats-0.145
 screams-0.141
ls-0.135
 cheers-0.134
Token to
Token ID284
Feature activation-2.51e-01

Pos logit contributions
isk+0.028
ety+0.028
my+0.026
 whether+0.025
coat+0.024
Neg logit contributions
 waves-0.038
 floats-0.036
 screams-0.036
ls-0.036
 sinks-0.034
Token make
Token ID787
Feature activation-2.79e+00

Pos logit contributions
isk+0.014
ety+0.013
my+0.013
 whether+0.013
coat+0.012
Neg logit contributions
 floats-0.014
 waves-0.014
 screams-0.014
ls-0.013
 sinks-0.013
Token waves
Token ID9813
Feature activation+1.33e+01

Pos logit contributions
isk+0.150
my+0.149
 whether+0.148
ety+0.142
coat+0.129
Neg logit contributions
 floats-0.153
 waves-0.146
ls-0.143
 screams-0.141
 sinks-0.137
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
 floats+0.854
 screams+0.842
 waves+0.840
ls+0.819
 sinks+0.800
Neg logit contributions
my-0.593
isk-0.574
ety-0.572
 whether-0.569
coat-0.521
Token bubbles
Token ID25037
Feature activation+6.04e+00

Pos logit contributions
ls+0.050
 floats+0.049
 screams+0.048
ext+0.047
 waves+0.047
Neg logit contributions
 whether-0.065
my-0.064
isk-0.062
ety-0.062
coat-0.057
Token.
Token ID13
Feature activation-2.62e-02

Pos logit contributions
 floats+0.387
 screams+0.381
 waves+0.378
ls+0.369
 sinks+0.358
Neg logit contributions
my-0.263
isk-0.259
 whether-0.254
ety-0.248
coat-0.240
Token He
Token ID679
Feature activation+8.92e-01

Pos logit contributions
isk+0.001
ety+0.001
my+0.001
 whether+0.001
coat+0.001
Neg logit contributions
 floats-0.001
 waves-0.001
 screams-0.001
ls-0.001
 cheers-0.001
Token saw
Token ID2497
Feature activation-7.56e-01

Pos logit contributions
 ourselves+0.062
 screams+0.058
ls+0.058
ext+0.057
 floats+0.057
Neg logit contributions
my-0.036
ety-0.033
 whether-0.033
isk-0.031
 brushed-0.031
Token runs
Token ID4539
Feature activation+5.80e+00

Pos logit contributions
my+0.076
isk+0.072
 whether+0.072
ety+0.071
 pad+0.064
Neg logit contributions
 floats-0.083
 ourselves-0.081
 waves-0.078
ls-0.075
 screams-0.075
Token to
Token ID284
Feature activation-4.71e-02

Pos logit contributions
 waves+0.355
 screams+0.342
 floats+0.338
ls+0.333
 cheers+0.321
Neg logit contributions
isk-0.294
ety-0.285
my-0.275
 whether-0.266
tr-0.261
Token the
Token ID262
Feature activation+1.71e+00

Pos logit contributions
isk+0.003
ety+0.003
my+0.003
 whether+0.002
coat+0.002
Neg logit contributions
 waves-0.003
 floats-0.003
 screams-0.003
ls-0.002
 cheers-0.002
Token fence
Token ID13990
Feature activation+1.50e+00

Pos logit contributions
 floats+0.090
 waves+0.087
 screams+0.085
ls+0.085
 sinks+0.080
Neg logit contributions
my-0.096
isk-0.096
ety-0.095
 whether-0.093
tr-0.089
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
 waves+0.084
 floats+0.082
 screams+0.080
 cheers+0.076
ls+0.076
Neg logit contributions
isk-0.082
ety-0.082
 whether-0.077
my-0.076
tr-0.071
Token waves
Token ID9813
Feature activation+1.32e+01

Pos logit contributions
 floats+0.061
 waves+0.060
 screams+0.058
ls+0.057
 ourselves+0.056
Neg logit contributions
isk-0.064
ety-0.063
my-0.063
 whether-0.062
coat-0.058
Token at
Token ID379
Feature activation+1.60e+00

Pos logit contributions
 floats+0.754
 waves+0.753
 screams+0.726
ls+0.708
 cheers+0.683
Neg logit contributions
isk-0.708
ety-0.690
my-0.680
 whether-0.672
coat-0.625
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
 floats+0.104
 waves+0.103
 screams+0.099
ls+0.097
 sinks+0.095
Neg logit contributions
isk-0.073
ety-0.072
my-0.071
 whether-0.069
coat-0.063
Token bull
Token ID6473
Feature activation+6.02e+00

Pos logit contributions
ls+0.063
 floats+0.063
ext+0.061
 waves+0.058
 screams+0.057
Neg logit contributions
my-0.072
isk-0.070
ety-0.069
 whether-0.069
tr-0.069
Token.
Token ID13
Feature activation-1.44e-01

Pos logit contributions
 floats+0.382
 waves+0.377
 screams+0.363
ls+0.361
 sinks+0.347
Neg logit contributions
isk-0.280
my-0.279
ety-0.274
 whether-0.270
coat-0.245
Token The
Token ID383
Feature activation+6.02e-01

Pos logit contributions
isk+0.006
ety+0.006
my+0.006
 whether+0.006
coat+0.006
Neg logit contributions
 floats-0.010
 waves-0.010
 screams-0.009
ls-0.009
 cheers-0.009
Token them
Token ID606
Feature activation+5.19e+00

Pos logit contributions
 floats+0.256
 waves+0.250
 screams+0.244
ls+0.237
 sinks+0.229
Neg logit contributions
my-0.305
isk-0.304
 whether-0.300
ety-0.300
coat-0.270
Token could
Token ID714
Feature activation+8.01e-01

Pos logit contributions
ext+0.289
 floats+0.288
ls+0.281
 waves+0.280
 screams+0.278
Neg logit contributions
my-0.259
 whether-0.244
ety-0.236
isk-0.232
coat-0.211
Token really
Token ID1107
Feature activation+7.14e-01

Pos logit contributions
 floats+0.051
ls+0.049
 waves+0.047
 screams+0.046
 sinks+0.045
Neg logit contributions
my-0.038
 whether-0.038
isk-0.037
ety-0.035
coat-0.034
Token ride
Token ID6594
Feature activation-1.02e+00

Pos logit contributions
 floats+0.039
ls+0.038
 waves+0.037
 screams+0.037
ext+0.037
Neg logit contributions
my-0.036
 whether-0.036
ety-0.035
isk-0.034
coat-0.033
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
my+0.056
isk+0.056
ety+0.055
 whether+0.055
coat+0.051
Neg logit contributions
 waves-0.055
 floats-0.055
 screams-0.054
ls-0.051
 sinks-0.050
Token waves
Token ID9813
Feature activation+1.31e+01

Pos logit contributions
 floats+0.070
ls+0.069
 screams+0.069
ext+0.067
 sinks+0.066
Neg logit contributions
my-0.079
ety-0.073
coat-0.073
 whether-0.073
 pad-0.072
Token,
Token ID11
Feature activation-4.03e-01

Pos logit contributions
 floats+0.842
 waves+0.831
 screams+0.830
ls+0.802
 sinks+0.782
Neg logit contributions
my-0.600
ety-0.581
 whether-0.569
isk-0.569
coat-0.533
Token but
Token ID475
Feature activation-4.92e+00

Pos logit contributions
isk+0.017
my+0.017
 whether+0.017
ety+0.017
coat+0.015
Neg logit contributions
 floats-0.027
 waves-0.027
 screams-0.026
ls-0.026
 sinks-0.025
Token they
Token ID484
Feature activation+5.17e+00

Pos logit contributions
isk+0.279
my+0.279
ety+0.276
 whether+0.274
coat+0.249
Neg logit contributions
 floats-0.258
 waves-0.256
 screams-0.250
ls-0.241
 sinks-0.232
Token all
Token ID477
Feature activation+1.81e+00

Pos logit contributions
 floats+0.323
 waves+0.313
ls+0.306
 screams+0.306
 ourselves+0.296
Neg logit contributions
my-0.249
isk-0.244
 whether-0.235
ety-0.233
coat-0.207
Token had
Token ID550
Feature activation+4.03e-02

Pos logit contributions
 floats+0.108
ls+0.107
 waves+0.107
 screams+0.106
 ourselves+0.105
Neg logit contributions
my-0.089
ety-0.082
 whether-0.082
isk-0.081
coat-0.073
Token learned
Token ID4499
Feature activation+3.52e+00

Pos logit contributions
my+0.009
isk+0.008
ety+0.008
 whether+0.007
 brushed+0.007
Neg logit contributions
 ourselves-0.013
 floats-0.013
ls-0.012
 waves-0.012
 fishermen-0.012
Token that
Token ID326
Feature activation+7.69e-01

Pos logit contributions
 waves+0.199
 floats+0.194
 screams+0.191
ls+0.185
 cheers+0.178
Neg logit contributions
isk-0.193
ety-0.191
my-0.186
 whether-0.180
coat-0.176
Token just
Token ID655
Feature activation+1.20e+00

Pos logit contributions
 waves+0.043
 floats+0.043
 screams+0.041
ls+0.040
 sinks+0.039
Neg logit contributions
isk-0.043
ety-0.042
my-0.040
 whether-0.040
coat-0.039
Token because
Token ID780
Feature activation-6.78e+00

Pos logit contributions
 waves+0.071
 floats+0.069
 screams+0.068
 ourselves+0.065
 sinks+0.065
Neg logit contributions
isk-0.063
ety-0.060
coat-0.058
my-0.057
 whether-0.055
Token someone
Token ID2130
Feature activation-2.31e-01

Pos logit contributions
my+0.342
 whether+0.340
ety+0.321
isk+0.308
 though+0.285
Neg logit contributions
 floats-0.395
 screams-0.386
 waves-0.385
ls-0.367
 cheers-0.362
Token waves
Token ID9813
Feature activation+1.31e+01

Pos logit contributions
my+0.012
ety+0.011
 whether+0.011
isk+0.010
coat+0.009
Neg logit contributions
 waves-0.014
 floats-0.014
 screams-0.013
 ourselves-0.013
ls-0.013
Token at
Token ID379
Feature activation+1.54e+00

Pos logit contributions
 floats+0.755
 waves+0.754
 screams+0.741
ls+0.707
ext+0.700
Neg logit contributions
my-0.668
ety-0.639
 whether-0.634
isk-0.621
coat-0.585
Token you
Token ID345
Feature activation+6.83e+00

Pos logit contributions
 floats+0.081
 waves+0.079
 screams+0.077
ls+0.073
 cheers+0.072
Neg logit contributions
my-0.090
isk-0.090
ety-0.089
 whether-0.087
coat-0.078
Token,
Token ID11
Feature activation-5.21e-01

Pos logit contributions
 floats+0.444
 waves+0.436
 screams+0.420
ls+0.414
 sinks+0.398
Neg logit contributions
my-0.314
isk-0.311
 whether-0.307
ety-0.305
coat-0.266
Token it
Token ID340
Feature activation-1.41e+00

Pos logit contributions
 whether+0.018
isk+0.018
my+0.018
ety+0.018
coat+0.015
Neg logit contributions
 floats-0.039
 waves-0.038
 screams-0.038
ls-0.037
 sinks-0.036
Token does
Token ID857
Feature activation+1.30e+00

Pos logit contributions
my+0.064
ety+0.052
isk+0.051
 whether+0.051
c+0.050
Neg logit contributions
 floats-0.091
 moms-0.090
 fishermen-0.089
 waves-0.087
 ourselves-0.087
Token spend
Token ID4341
Feature activation-8.19e-01

Pos logit contributions
ls+0.015
 floats+0.014
 screams+0.014
 waves+0.013
 sinks+0.013
Neg logit contributions
my-0.010
 whether-0.010
isk-0.010
ety-0.009
coat-0.009
Token the
Token ID262
Feature activation+1.71e+00

Pos logit contributions
isk+0.045
ety+0.044
my+0.044
 whether+0.043
coat+0.041
Neg logit contributions
 floats-0.045
 waves-0.045
 screams-0.043
ls-0.042
 sinks-0.041
Token day
Token ID1110
Feature activation-6.69e-01

Pos logit contributions
 floats+0.093
 waves+0.089
 screams+0.088
ls+0.088
 sinks+0.085
Neg logit contributions
my-0.096
isk-0.091
ety-0.090
 whether-0.090
coat-0.090
Token chasing
Token ID20023
Feature activation+2.02e+00

Pos logit contributions
my+0.031
 whether+0.030
coat+0.029
ety+0.028
isk+0.028
Neg logit contributions
 floats-0.041
 waves-0.041
 screams-0.040
 sinks-0.038
 explodes-0.038
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
 floats+0.111
 waves+0.109
 screams+0.106
ls+0.102
 sinks+0.101
Neg logit contributions
isk-0.110
my-0.109
ety-0.108
 whether-0.107
coat-0.099
Token waves
Token ID9813
Feature activation+1.31e+01

Pos logit contributions
 floats+0.073
 screams+0.071
ls+0.070
 waves+0.069
 sinks+0.069
Neg logit contributions
my-0.078
ety-0.075
isk-0.074
coat-0.073
 whether-0.073
Token.
Token ID13
Feature activation-2.62e-02

Pos logit contributions
 floats+0.815
 waves+0.803
 screams+0.791
ls+0.767
 sinks+0.749
Neg logit contributions
my-0.622
isk-0.615
ety-0.613
 whether-0.601
coat-0.566
Token
Token ID220
Feature activation-1.61e+00

Pos logit contributions
isk+0.001
ety+0.001
my+0.001
 whether+0.001
coat+0.001
Neg logit contributions
 floats-0.001
 waves-0.001
 screams-0.001
ls-0.001
 cheers-0.001
Token
Token ID198
Feature activation+5.94e-01

Pos logit contributions
my+0.079
isk+0.078
ety+0.076
 whether+0.074
coat+0.067
Neg logit contributions
 floats-0.103
 waves-0.102
 screams-0.097
ls-0.093
 sinks-0.092
TokenMom
Token ID29252
Feature activation+7.17e-01

Pos logit contributions
 floats+0.032
 waves+0.031
 screams+0.030
ls+0.029
 cheers+0.029
Neg logit contributions
isk-0.034
ety-0.033
my-0.033
 whether-0.032
coat-0.030
Tokenmy
Token ID1820
Feature activation-1.16e+01

Pos logit contributions
 waves+0.063
 floats+0.062
 screams+0.061
ls+0.059
 cheers+0.059
Neg logit contributions
isk-0.016
ety-0.016
 whether-0.015
my-0.014
coat-0.012
Token water
Token ID1660
Feature activation+5.71e+00

Pos logit contributions
ls+0.100
 floats+0.098
 screams+0.096
ext+0.095
 sinks+0.091
Neg logit contributions
my-0.133
ety-0.127
 whether-0.122
isk-0.121
coat-0.120
Token.
Token ID13
Feature activation+4.28e-01

Pos logit contributions
 screams+0.349
 floats+0.348
ext+0.336
ls+0.334
 waves+0.331
Neg logit contributions
my-0.257
ety-0.246
isk-0.239
coat-0.234
 whether-0.234
Token They
Token ID1119
Feature activation+3.62e+00

Pos logit contributions
 floats+0.025
 waves+0.025
 screams+0.024
ls+0.023
 cheers+0.023
Neg logit contributions
isk-0.022
ety-0.022
my-0.021
 whether-0.021
coat-0.020
Token saw
Token ID2497
Feature activation-4.21e-01

Pos logit contributions
ls+0.223
ext+0.217
 screams+0.215
 floats+0.215
 ourselves+0.213
Neg logit contributions
my-0.171
isk-0.159
 whether-0.159
ety-0.158
 brushed-0.149
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
my+0.020
 whether+0.020
ety+0.019
isk+0.019
coat+0.017
Neg logit contributions
 floats-0.025
 screams-0.024
ls-0.024
 waves-0.024
ext-0.023
Token waves
Token ID9813
Feature activation+1.31e+01

Pos logit contributions
ext+0.065
 screams+0.062
text+0.061
ls+0.061
am+0.061
Neg logit contributions
my-0.083
ety-0.080
 whether-0.076
coat-0.074
tr-0.073
Token and
Token ID290
Feature activation+9.38e-01

Pos logit contributions
 floats+0.822
 screams+0.813
 waves+0.809
ls+0.800
 ourselves+0.775
Neg logit contributions
my-0.603
ety-0.576
 whether-0.567
isk-0.567
coat-0.528
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
ls+0.045
 floats+0.044
 screams+0.044
 waves+0.043
ext+0.043
Neg logit contributions
my-0.054
 whether-0.054
ety-0.054
isk-0.052
coat-0.048
Token shells
Token ID19679
Feature activation+8.41e-01

Pos logit contributions
ext+0.051
ls+0.045
text+0.044
am+0.044
 ourselves+0.043
Neg logit contributions
my-0.075
ety-0.071
 pad-0.067
coat-0.067
 pencil-0.067
Token.
Token ID13
Feature activation-1.89e-01

Pos logit contributions
 floats+0.052
 screams+0.052
ext+0.050
 waves+0.050
 ourselves+0.049
Neg logit contributions
my-0.037
ety-0.035
 whether-0.035
coat-0.033
isk-0.033
Token They
Token ID1119
Feature activation+3.08e+00

Pos logit contributions
isk+0.011
ety+0.011
my+0.011
 whether+0.011
coat+0.010
Neg logit contributions
 floats-0.010
 waves-0.010
 screams-0.010
ls-0.009
 sinks-0.009
Token face
Token ID1986
Feature activation+3.97e+00

Pos logit contributions
my+0.171
ety+0.163
 whether+0.159
tr+0.158
 pad+0.157
Neg logit contributions
 floats-0.162
ls-0.154
 sinks-0.149
 waves-0.149
 fishermen-0.146
Token,
Token ID11
Feature activation+5.10e-02

Pos logit contributions
 waves+0.226
 floats+0.225
 screams+0.215
ls+0.209
 cheers+0.204
Neg logit contributions
isk-0.217
ety-0.212
 whether-0.203
my-0.203
coat-0.190
Token and
Token ID290
Feature activation+1.27e+00

Pos logit contributions
 waves+0.003
 floats+0.003
 screams+0.003
 cheers+0.003
ls+0.003
Neg logit contributions
isk-0.003
ety-0.002
my-0.002
 whether-0.002
coat-0.002
Token watching
Token ID4964
Feature activation-1.84e+00

Pos logit contributions
 floats+0.082
 waves+0.081
 screams+0.079
ls+0.077
 sinks+0.075
Neg logit contributions
isk-0.058
my-0.057
ety-0.057
 whether-0.056
coat-0.051
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
isk+0.096
my+0.095
ety+0.095
 whether+0.093
coat+0.086
Neg logit contributions
 floats-0.105
 waves-0.104
 screams-0.101
ls-0.099
 sinks-0.096
Token waves
Token ID9813
Feature activation+1.31e+01

Pos logit contributions
 floats+0.064
ext+0.064
ls+0.064
 sinks+0.064
 screams+0.063
Neg logit contributions
my-0.083
ety-0.079
tr-0.076
 whether-0.075
isk-0.075
Token.
Token ID13
Feature activation-2.62e-02

Pos logit contributions
 floats+0.772
 waves+0.763
 screams+0.757
ls+0.741
 sinks+0.722
Neg logit contributions
my-0.656
ety-0.638
isk-0.637
 whether-0.621
coat-0.587
Token
Token ID220
Feature activation-1.61e+00

Pos logit contributions
isk+0.001
ety+0.001
my+0.001
 whether+0.001
coat+0.001
Neg logit contributions
 floats-0.002
 waves-0.002
 screams-0.002
 cheers-0.002
ls-0.002
Token
Token ID198
Feature activation+5.94e-01

Pos logit contributions
my+0.080
isk+0.079
ety+0.078
 whether+0.075
tr+0.068
Neg logit contributions
 floats-0.101
 waves-0.100
 screams-0.096
ls-0.092
 sinks-0.091
Token
Token ID198
Feature activation+5.50e-01

Pos logit contributions
 floats+0.029
 waves+0.028
 screams+0.028
ls+0.027
 cheers+0.026
Neg logit contributions
isk-0.036
ety-0.035
 whether-0.034
my-0.034
coat-0.033
TokenOne
Token ID3198
Feature activation-7.59e+00

Pos logit contributions
 floats+0.031
 waves+0.031
 screams+0.030
ls+0.029
 cheers+0.029
Neg logit contributions
isk-0.028
ety-0.028
my-0.027
 whether-0.027
coat-0.026
Token jog
Token ID48342
Feature activation+2.86e+00

Pos logit contributions
ls+0.197
 floats+0.195
bled+0.191
 ourselves+0.188
 sinks+0.187
Neg logit contributions
my-0.159
isk-0.158
ety-0.148
 whether-0.141
coat-0.140
Token with
Token ID351
Feature activation-2.93e+00

Pos logit contributions
 floats+0.175
 screams+0.174
 waves+0.171
 explodes+0.165
 sinks+0.163
Neg logit contributions
my-0.133
ety-0.131
isk-0.128
 whether-0.127
coat-0.125
Token Rex
Token ID17853
Feature activation+1.41e+00

Pos logit contributions
isk+0.161
my+0.157
ety+0.155
 whether+0.154
coat+0.140
Neg logit contributions
 floats-0.163
 waves-0.160
 screams-0.154
ls-0.152
 sinks-0.148
Token.
Token ID13
Feature activation-6.12e-02

Pos logit contributions
 floats+0.092
 waves+0.091
 screams+0.088
ls+0.085
 sinks+0.084
Neg logit contributions
my-0.063
isk-0.063
ety-0.062
 whether-0.061
coat-0.056
Token Luna
Token ID23694
Feature activation-4.66e+00

Pos logit contributions
isk+0.004
ety+0.004
my+0.004
 whether+0.003
coat+0.003
Neg logit contributions
 floats-0.003
 waves-0.003
 screams-0.003
ls-0.003
 sinks-0.003
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
my+0.239
 whether+0.225
isk+0.224
coat+0.212
ety+0.210
Neg logit contributions
bled-0.256
 ourselves-0.236
keys-0.236
ading-0.234
 fishermen-0.234
Token to
Token ID284
Feature activation-7.34e-01

Pos logit contributions
 floats+0.863
 waves+0.846
 screams+0.831
ls+0.804
 sinks+0.801
Neg logit contributions
my-0.573
ety-0.553
isk-0.542
 whether-0.541
coat-0.497
Token them
Token ID606
Feature activation+4.53e+00

Pos logit contributions
ety+0.047
my+0.045
isk+0.044
 whether+0.044
coat+0.044
Neg logit contributions
 waves-0.035
 floats-0.034
 screams-0.034
 cheers-0.031
ls-0.030
Token.
Token ID13
Feature activation-3.59e-02

Pos logit contributions
 floats+0.310
 waves+0.302
 screams+0.295
ls+0.293
 sinks+0.285
Neg logit contributions
my-0.185
isk-0.177
ety-0.177
 whether-0.176
coat-0.162
Token She
Token ID1375
Feature activation-2.79e+00

Pos logit contributions
isk+0.002
ety+0.002
my+0.002
 whether+0.002
coat+0.002
Neg logit contributions
 floats-0.002
 waves-0.002
 screams-0.002
ls-0.002
 cheers-0.002
Token finishes
Token ID20271
Feature activation+2.83e+00

Pos logit contributions
my+0.159
isk+0.149
 whether+0.148
ety+0.142
coat+0.132
Neg logit contributions
bled-0.142
 ourselves-0.138
ading-0.135
 fishermen-0.129
ls-0.128
Token,
Token ID11
Feature activation-4.34e-01

Pos logit contributions
isk+0.074
ety+0.071
my+0.066
coat+0.066
tr+0.065
Neg logit contributions
 waves-0.050
 floats-0.050
 screams-0.046
ls-0.046
 sinks-0.044
Token he
Token ID339
Feature activation+7.91e-01

Pos logit contributions
isk+0.027
ety+0.027
my+0.026
coat+0.025
 whether+0.025
Neg logit contributions
 waves-0.022
 floats-0.022
 screams-0.020
ls-0.020
 sinks-0.020
Token ran
Token ID4966
Feature activation+5.86e+00

Pos logit contributions
 ourselves+0.061
 screams+0.059
ags+0.058
 floats+0.057
 cheers+0.057
Neg logit contributions
my-0.028
ety-0.027
isk-0.025
 whether-0.022
 brushed-0.020
Token into
Token ID656
Feature activation+4.73e+00

Pos logit contributions
 floats+0.369
 waves+0.358
 screams+0.354
ls+0.340
 cheers+0.332
Neg logit contributions
isk-0.278
my-0.276
ety-0.274
 whether-0.272
coat-0.250
Token big
Token ID1263
Feature activation+4.19e+00

Pos logit contributions
 floats+0.244
 waves+0.240
 screams+0.231
ls+0.225
 sinks+0.217
Neg logit contributions
isk-0.278
ety-0.274
my-0.274
 whether-0.270
coat-0.252
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
bled+0.230
ext+0.216
 ourselves+0.213
izes+0.207
 screams+0.207
Neg logit contributions
my-0.231
 whether-0.200
ety-0.197
c-0.194
 pencil-0.190
Token.
Token ID13
Feature activation-5.48e-02

Pos logit contributions
 floats+0.858
 waves+0.841
 screams+0.826
ls+0.811
 sinks+0.788
Neg logit contributions
my-0.578
isk-0.575
ety-0.572
 whether-0.559
coat-0.518
Token The
Token ID383
Feature activation+7.57e-01

Pos logit contributions
isk+0.003
ety+0.003
 whether+0.003
my+0.003
coat+0.003
Neg logit contributions
 waves-0.003
 floats-0.003
 screams-0.003
 cheers-0.003
ls-0.003
Token waves
Token ID9813
Feature activation+1.31e+01

Pos logit contributions
 screams+0.039
aro+0.039
ls+0.039
ext+0.037
cks+0.037
Neg logit contributions
my-0.041
 one-0.038
 pen-0.036
 though-0.036
 while-0.036
Token kept
Token ID4030
Feature activation+4.27e-01

Pos logit contributions
 ourselves+0.884
 screams+0.838
ls+0.822
 waves+0.820
ext+0.818
Neg logit contributions
my-0.546
ety-0.502
coat-0.484
 whether-0.477
isk-0.474
Token spl
Token ID4328
Feature activation+6.38e+00

Pos logit contributions
 floats+0.025
 screams+0.024
 waves+0.024
ls+0.023
 sinks+0.022
Neg logit contributions
my-0.022
 whether-0.022
isk-0.021
ety-0.021
coat-0.019
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
 floats+0.081
 waves+0.080
 screams+0.077
ls+0.075
 sinks+0.073
Neg logit contributions
my-0.081
isk-0.080
 whether-0.078
ety-0.078
coat-0.068
Token cat
Token ID3797
Feature activation-4.19e+00

Pos logit contributions
 floats+0.069
 waves+0.069
 screams+0.067
 cheers+0.062
 sinks+0.061
Neg logit contributions
isk-0.069
ety-0.068
 whether-0.067
my-0.064
coat-0.060
Token.
Token ID13
Feature activation-5.73e-02

Pos logit contributions
isk+0.224
ety+0.223
 whether+0.215
my+0.215
coat+0.202
Neg logit contributions
 floats-0.239
 waves-0.237
 screams-0.231
ls-0.222
 cheers-0.218
Token The
Token ID383
Feature activation+7.29e-01

Pos logit contributions
ety+0.003
isk+0.003
my+0.003
 whether+0.003
tr+0.003
Neg logit contributions
 waves-0.003
 floats-0.003
 screams-0.003
ls-0.003
 sinks-0.003
Token cat
Token ID3797
Feature activation-4.19e+00

Pos logit contributions
 floats+0.042
 waves+0.041
 screams+0.039
 moms+0.037
 cheers+0.037
Neg logit contributions
ety-0.040
 whether-0.040
isk-0.039
my-0.036
coat-0.034
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
my+0.247
isk+0.240
 whether+0.230
ety+0.229
tr+0.223
Neg logit contributions
 ourselves-0.195
bled-0.187
 fishermen-0.184
 floats-0.184
ext-0.183
Token back
Token ID736
Feature activation+6.35e+00

Pos logit contributions
 floats+0.744
 waves+0.734
 screams+0.710
 ourselves+0.700
ls+0.699
Neg logit contributions
my-0.654
ety-0.620
isk-0.616
 whether-0.613
coat-0.564
Token with
Token ID351
Feature activation-2.90e+00

Pos logit contributions
 floats+0.404
 waves+0.401
 screams+0.391
 cheers+0.372
ls+0.371
Neg logit contributions
ety-0.292
my-0.290
isk-0.287
 whether-0.281
coat-0.259
Token its
Token ID663
Feature activation+1.70e+00

Pos logit contributions
my+0.142
isk+0.137
 whether+0.133
ety+0.133
 hers+0.122
Neg logit contributions
 floats-0.171
 waves-0.168
ls-0.165
 screams-0.162
 sinks-0.155
Token tail
Token ID7894
Feature activation+1.62e+00

Pos logit contributions
 floats+0.092
 waves+0.091
 screams+0.087
ls+0.085
 sinks+0.083
Neg logit contributions
isk-0.095
ety-0.094
my-0.094
 whether-0.092
coat-0.086
Token.
Token ID13
Feature activation-3.43e-02

Pos logit contributions
 floats+0.099
 waves+0.098
 screams+0.095
ls+0.093
 sinks+0.090
Neg logit contributions
isk-0.079
ety-0.078
my-0.077
 whether-0.076
coat-0.071
Token She
Token ID1375
Feature activation-2.81e+00

Pos logit contributions
isk+0.005
ety+0.004
 whether+0.004
ge+0.004
coat+0.004
Neg logit contributions
 Everyone-0.004
 floats-0.004
 waves-0.004
 Everybody-0.004
 cheers-0.004
Token liked
Token ID8288
Feature activation-1.89e+00

Pos logit contributions
my+0.105
ety+0.102
isk+0.099
 whether+0.097
coat+0.091
Neg logit contributions
 screams-0.199
 floats-0.199
 waves-0.197
ls-0.196
 ourselves-0.195
Token to
Token ID284
Feature activation-7.37e-01

Pos logit contributions
isk+0.091
ety+0.085
my+0.080
 whether+0.077
coat+0.077
Neg logit contributions
 waves-0.130
 floats-0.128
 screams-0.119
ls-0.116
 cheers-0.116
Token ride
Token ID6594
Feature activation-1.27e+00

Pos logit contributions
isk+0.045
my+0.044
ety+0.044
 whether+0.044
coat+0.041
Neg logit contributions
 floats-0.035
 screams-0.034
 waves-0.034
ls-0.034
 sinks-0.032
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
isk+0.077
ety+0.074
 whether+0.070
my+0.069
coat+0.068
Neg logit contributions
 waves-0.069
 floats-0.069
ls-0.066
 screams-0.064
 ourselves-0.061
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
 screams+0.064
 drowned+0.062
 drown+0.062
ext+0.062
 sinks+0.062
Neg logit contributions
my-0.067
 pencil-0.060
 whether-0.058
ety-0.057
 pad-0.057
Token and
Token ID290
Feature activation+9.09e-01

Pos logit contributions
 waves+0.738
 floats+0.738
 screams+0.695
ls+0.686
 cheers+0.664
Neg logit contributions
isk-0.716
ety-0.689
 whether-0.672
my-0.667
coat-0.618
Token feel
Token ID1254
Feature activation-1.55e+00

Pos logit contributions
 floats+0.050
 waves+0.049
ls+0.049
 screams+0.049
 sinks+0.046
Neg logit contributions
my-0.049
ety-0.048
isk-0.048
 whether-0.048
coat-0.044
Token the
Token ID262
Feature activation+1.25e+00

Pos logit contributions
my+0.088
 whether+0.088
isk+0.088
ety+0.087
coat+0.080
Neg logit contributions
 floats-0.081
 waves-0.079
 screams-0.079
ls-0.076
 sinks-0.074
Token wind
Token ID2344
Feature activation+3.74e+00

Pos logit contributions
ls+0.071
ext+0.064
 explodes+0.062
izes+0.061
 sinks+0.060
Neg logit contributions
my-0.071
ety-0.063
coat-0.063
 pencil-0.062
 screw-0.059
Token and
Token ID290
Feature activation+9.30e-01

Pos logit contributions
 floats+0.201
 waves+0.199
 screams+0.190
ls+0.185
 sinks+0.179
Neg logit contributions
isk-0.214
ety-0.211
 whether-0.207
my-0.206
coat-0.191
Token he
Token ID339
Feature activation+7.91e-01

Pos logit contributions
 whether+0.085
my+0.081
ety+0.077
isk+0.071
 though+0.069
Neg logit contributions
 floats-0.150
 screams-0.141
 waves-0.140
 cheers-0.134
ext-0.131
Token could
Token ID714
Feature activation+3.16e-01

Pos logit contributions
 floats+0.046
 waves+0.045
 screams+0.044
ls+0.042
 cheers+0.041
Neg logit contributions
isk-0.041
ety-0.041
my-0.041
 whether-0.040
coat-0.037
Token,
Token ID11
Feature activation-4.38e-01

Pos logit contributions
 floats+0.024
 waves+0.022
ls+0.021
 screams+0.021
 teams+0.020
Neg logit contributions
 whether-0.012
isk-0.012
my-0.012
ety-0.011
coat-0.011
Token but
Token ID475
Feature activation-4.88e+00

Pos logit contributions
isk+0.017
my+0.016
ety+0.016
 whether+0.016
coat+0.014
Neg logit contributions
 floats-0.032
 waves-0.031
 screams-0.031
ls-0.030
 sinks-0.029
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
my+0.230
 whether+0.228
isk+0.226
ety+0.217
tr+0.195
Neg logit contributions
 floats-0.314
 waves-0.302
 screams-0.289
ls-0.286
 cheers-0.278
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
ext+0.063
ls+0.060
cks+0.058
text+0.058
izes+0.057
Neg logit contributions
my-0.068
 one-0.060
ety-0.059
 screw-0.059
 whether-0.059
Token kept
Token ID4030
Feature activation+4.31e-01

Pos logit contributions
 floats+0.789
 waves+0.774
 screams+0.757
 ourselves+0.755
 cheers+0.727
Neg logit contributions
isk-0.628
my-0.623
ety-0.611
 whether-0.590
coat-0.581
Token getting
Token ID1972
Feature activation-3.11e+00

Pos logit contributions
 floats+0.023
 waves+0.023
 screams+0.022
ls+0.022
 sinks+0.021
Neg logit contributions
isk-0.024
ety-0.024
my-0.024
 whether-0.023
coat-0.022
Token higher
Token ID2440
Feature activation+6.20e-01

Pos logit contributions
ety+0.176
my+0.175
 whether+0.170
isk+0.169
coat+0.154
Neg logit contributions
 floats-0.170
 screams-0.163
 waves-0.162
ls-0.153
 cheers-0.151
Token and
Token ID290
Feature activation+9.30e-01

Pos logit contributions
 floats+0.043
 screams+0.041
 waves+0.041
 cheers+0.039
 sinks+0.039
Neg logit contributions
my-0.026
ety-0.025
 whether-0.025
isk-0.024
coat-0.022
Token higher
Token ID2440
Feature activation+6.35e-01

Pos logit contributions
 floats+0.050
 screams+0.049
 waves+0.048
ls+0.047
ext+0.045
Neg logit contributions
my-0.052
ety-0.052
 whether-0.051
isk-0.051
coat-0.045
Token tub
Token ID12202
Feature activation+5.77e+00

Pos logit contributions
ls+0.057
 screams+0.057
ext+0.056
 floats+0.054
keys+0.052
Neg logit contributions
my-0.075
ety-0.072
isk-0.069
coat-0.069
 screw-0.068
Token.
Token ID13
Feature activation-5.73e-02

Pos logit contributions
 floats+0.352
 waves+0.350
 screams+0.338
ls+0.330
 sinks+0.320
Neg logit contributions
isk-0.284
ety-0.280
my-0.276
 whether-0.272
coat-0.253
Token They
Token ID1119
Feature activation+3.25e+00

Pos logit contributions
ety+0.003
 whether+0.003
isk+0.003
my+0.003
tr+0.003
Neg logit contributions
 floats-0.003
 waves-0.003
 screams-0.003
ls-0.003
 ourselves-0.003
Token make
Token ID787
Feature activation-3.16e+00

Pos logit contributions
ls+0.201
 floats+0.200
 screams+0.199
ext+0.194
 waves+0.193
Neg logit contributions
my-0.152
isk-0.141
 whether-0.140
ety-0.140
coat-0.129
Token some
Token ID617
Feature activation-3.41e+00

Pos logit contributions
my+0.138
 whether+0.137
isk+0.136
ety+0.134
coat+0.120
Neg logit contributions
 floats-0.196
ls-0.191
 screams-0.190
 waves-0.187
 sinks-0.180
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
my+0.195
 whether+0.190
ety+0.189
isk+0.182
coat+0.181
Neg logit contributions
ext-0.157
 floats-0.155
 screams-0.154
ls-0.150
 waves-0.145
Token with
Token ID351
Feature activation-2.90e+00

Pos logit contributions
 floats+0.815
 waves+0.807
 screams+0.794
ls+0.771
 sinks+0.753
Neg logit contributions
isk-0.607
my-0.603
ety-0.603
 whether-0.584
coat-0.546
Token their
Token ID511
Feature activation+5.66e+00

Pos logit contributions
isk+0.183
ety+0.182
my+0.181
 whether+0.179
coat+0.168
Neg logit contributions
 floats-0.133
 waves-0.131
 screams-0.128
ls-0.124
 sinks-0.118
Token hands
Token ID2832
Feature activation+3.96e+00

Pos logit contributions
ext+0.281
izes+0.277
 screams+0.274
ls+0.268
 floats+0.265
Neg logit contributions
ety-0.307
my-0.299
coat-0.283
 pencil-0.281
isk-0.278
Token.
Token ID13
Feature activation-3.43e-02

Pos logit contributions
 waves+0.223
 floats+0.221
 screams+0.210
ls+0.205
 cheers+0.199
Neg logit contributions
isk-0.223
ety-0.214
 whether-0.207
my-0.206
coat-0.193
Token They
Token ID1119
Feature activation+3.28e+00

Pos logit contributions
ety+0.002
 whether+0.002
isk+0.002
my+0.002
tr+0.001
Neg logit contributions
 floats-0.002
 waves-0.002
 screams-0.002
ls-0.002
 ourselves-0.002
Token in
Token ID287
Feature activation+2.33e+00

Pos logit contributions
 floats+0.122
 screams+0.121
 waves+0.120
 explodes+0.115
 sinks+0.115
Neg logit contributions
my-0.118
ety-0.116
 whether-0.113
isk-0.112
coat-0.109
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
 floats+0.122
 waves+0.119
 screams+0.119
ls+0.118
 sinks+0.112
Neg logit contributions
my-0.131
 whether-0.128
ety-0.126
isk-0.125
coat-0.116
Token water
Token ID1660
Feature activation+5.00e+00

Pos logit contributions
keys+0.067
ext+0.062
 screams+0.060
ls+0.060
 explodes+0.059
Neg logit contributions
my-0.070
ety-0.060
 pad-0.053
coat-0.053
c-0.052
Token.
Token ID13
Feature activation-8.68e-02

Pos logit contributions
 floats+0.316
 waves+0.312
 screams+0.305
ls+0.297
 sinks+0.290
Neg logit contributions
isk-0.232
my-0.230
ety-0.229
 whether-0.223
coat-0.210
Token The
Token ID383
Feature activation+7.11e-01

Pos logit contributions
isk+0.004
ety+0.004
my+0.004
 whether+0.004
coat+0.004
Neg logit contributions
 floats-0.005
 waves-0.005
 screams-0.005
ls-0.005
 sinks-0.005
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
ls+0.037
ext+0.036
keys+0.035
 explodes+0.034
izes+0.033
Neg logit contributions
my-0.043
ety-0.037
 pad-0.035
 whether-0.034
coat-0.034
Token felt
Token ID2936
Feature activation+1.12e+00

Pos logit contributions
 floats+0.754
 waves+0.753
 screams+0.749
 ourselves+0.747
ls+0.739
Neg logit contributions
my-0.641
ety-0.621
isk-0.620
coat-0.606
 whether-0.600
Token gentle
Token ID10296
Feature activation+8.07e-01

Pos logit contributions
ls+0.074
 floats+0.074
 screams+0.072
 waves+0.070
 explodes+0.068
Neg logit contributions
 whether-0.047
my-0.046
ety-0.044
isk-0.043
coat-0.042
Token as
Token ID355
Feature activation-2.08e+00

Pos logit contributions
 floats+0.050
 waves+0.050
 screams+0.048
ls+0.047
 sinks+0.046
Neg logit contributions
isk-0.039
ety-0.038
my-0.038
 whether-0.037
coat-0.034
Token they
Token ID484
Feature activation+5.26e+00

Pos logit contributions
 whether+0.125
ety+0.120
my+0.118
 hers+0.109
 though+0.109
Neg logit contributions
 screams-0.105
 floats-0.102
 waves-0.100
 sinks-0.098
 explodes-0.096
Token rolled
Token ID11686
Feature activation+1.51e+00

Pos logit contributions
 floats+0.307
 screams+0.304
 waves+0.301
 ourselves+0.293
ls+0.292
Neg logit contributions
ety-0.268
isk-0.265
my-0.264
 whether-0.251
coat-0.245
Token hand
Token ID1021
Feature activation+1.39e+00

Pos logit contributions
my+0.002
isk+0.002
coat+0.001
ety+0.001
 whether+0.001
Neg logit contributions
 floats-0.002
ls-0.002
 waves-0.002
 screams-0.002
 sinks-0.002
Token as
Token ID355
Feature activation-2.11e+00

Pos logit contributions
 waves+0.078
 floats+0.077
 screams+0.075
ls+0.072
 cheers+0.071
Neg logit contributions
isk-0.077
ety-0.074
 whether-0.071
my-0.070
coat-0.066
Token they
Token ID484
Feature activation+5.24e+00

Pos logit contributions
 whether+0.114
my+0.113
ety+0.109
isk+0.106
coat+0.100
Neg logit contributions
 floats-0.117
 screams-0.116
 waves-0.115
 sinks-0.110
ls-0.106
Token watched
Token ID7342
Feature activation+1.56e+00

Pos logit contributions
 screams+0.342
 ourselves+0.338
 floats+0.332
ext+0.327
ls+0.323
Neg logit contributions
my-0.238
isk-0.229
ety-0.223
 whether-0.206
 pad-0.194
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
 floats+0.092
ls+0.089
 waves+0.089
 screams+0.088
 sinks+0.086
Neg logit contributions
my-0.077
 whether-0.077
isk-0.072
ety-0.070
 hers-0.066
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
cks+0.053
ext+0.052
inkle+0.051
izes+0.050
 explodes+0.049
Neg logit contributions
my-0.073
 pad-0.061
 whether-0.061
 jewelry-0.061
 library-0.060
Token crashing
Token ID21899
Feature activation+4.72e+00

Pos logit contributions
 screams+0.732
 floats+0.728
ls+0.715
 waves+0.711
 sinks+0.697
Neg logit contributions
my-0.681
ety-0.642
isk-0.636
 whether-0.628
coat-0.604
Token against
Token ID1028
Feature activation-8.96e-01

Pos logit contributions
 screams+0.281
 floats+0.277
 waves+0.267
ls+0.265
 cheers+0.262
Neg logit contributions
my-0.219
ety-0.213
coat-0.212
isk-0.208
tr-0.204
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
isk+0.054
my+0.054
ety+0.054
 whether+0.053
coat+0.049
Neg logit contributions
 floats-0.044
 waves-0.043
 screams-0.042
ls-0.041
 sinks-0.039
Token shore
Token ID15191
Feature activation+1.03e+01

Pos logit contributions
ext+0.056
ls+0.050
izes+0.049
inkle+0.048
 explodes+0.048
Neg logit contributions
my-0.081
 pad-0.072
 which-0.070
ety-0.069
 pencil-0.067
Token.
Token ID13
Feature activation-3.38e-02

Pos logit contributions
 floats+0.670
 screams+0.654
 waves+0.652
ls+0.640
 sinks+0.623
Neg logit contributions
my-0.455
isk-0.441
ety-0.437
 whether-0.430
coat-0.396
TokenThe
Token ID464
Feature activation+1.67e+00

Pos logit contributions
 floats+0.037
 waves+0.036
 screams+0.035
ls+0.034
 sinks+0.033
Neg logit contributions
isk-0.036
ety-0.035
my-0.035
 whether-0.034
coat-0.032
Token boat
Token ID8848
Feature activation+9.24e+00

Pos logit contributions
 ourselves+0.122
 screams+0.121
ls+0.120
cks+0.120
efully+0.117
Neg logit contributions
my-0.052
 screw-0.043
 thread-0.043
 while-0.043
 which-0.041
Token floated
Token ID29742
Feature activation+6.85e+00

Pos logit contributions
 ourselves+0.722
ags+0.680
 screams+0.678
ls+0.668
ext+0.666
Neg logit contributions
my-0.329
ety-0.271
isk-0.265
c-0.253
 whether-0.236
Token across
Token ID1973
Feature activation+4.65e-01

Pos logit contributions
 floats+0.410
 screams+0.401
 waves+0.391
ls+0.375
 cheers+0.370
Neg logit contributions
my-0.361
ety-0.351
 whether-0.344
isk-0.343
coat-0.322
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
 floats+0.028
 screams+0.027
ls+0.027
 waves+0.026
ext+0.026
Neg logit contributions
my-0.022
 whether-0.022
ety-0.021
isk-0.021
 though-0.018
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
 screams+0.071
ext+0.067
icks+0.066
cks+0.063
ls+0.063
Neg logit contributions
my-0.066
ety-0.065
L-0.054
c-0.053
 since-0.052
Token.
Token ID13
Feature activation-6.12e-02

Pos logit contributions
 floats+0.911
 screams+0.888
ls+0.885
 waves+0.882
 sinks+0.846
Neg logit contributions
my-0.534
ety-0.501
 whether-0.496
isk-0.493
coat-0.459
Token It
Token ID632
Feature activation-1.53e+00

Pos logit contributions
isk+0.003
ety+0.003
my+0.003
 whether+0.003
coat+0.003
Neg logit contributions
 floats-0.004
 waves-0.004
 screams-0.003
ls-0.003
 sinks-0.003
Token passed
Token ID3804
Feature activation-1.09e+00

Pos logit contributions
my+0.054
ety+0.050
isk+0.048
 whether+0.046
c+0.045
Neg logit contributions
 ourselves-0.115
ags-0.108
 screams-0.108
ls-0.108
 floats-0.105
Token lots
Token ID6041
Feature activation-1.87e+00

Pos logit contributions
my+0.055
ety+0.054
isk+0.053
 whether+0.053
coat+0.048
Neg logit contributions
 floats-0.067
 waves-0.065
 screams-0.064
ls-0.062
 sinks-0.061
Token of
Token ID286
Feature activation+4.18e+00

Pos logit contributions
 though+0.078
my+0.077
 whether+0.075
ety+0.070
 which+0.069
Neg logit contributions
 screams-0.120
 floats-0.118
 ourselves-0.117
ext-0.116
 waves-0.114
Token picks
Token ID11103
Feature activation-7.05e-01

Pos logit contributions
my+0.159
isk+0.150
 whether+0.148
ety+0.145
tr+0.134
Neg logit contributions
 ourselves-0.153
 floats-0.139
bled-0.134
 fishermen-0.133
ading-0.131
Token up
Token ID510
Feature activation+5.55e+00

Pos logit contributions
isk+0.051
ety+0.050
my+0.050
 whether+0.049
coat+0.047
Neg logit contributions
 floats-0.027
 waves-0.027
 screams-0.026
ls-0.024
 sinks-0.023
Token a
Token ID257
Feature activation-8.31e-01

Pos logit contributions
 waves+0.267
 floats+0.251
ls+0.239
 screams+0.237
 cheers+0.223
Neg logit contributions
isk-0.375
ety-0.360
 whether-0.355
tr-0.355
my-0.346
Token stick
Token ID4859
Feature activation-2.63e+00

Pos logit contributions
isk+0.057
ety+0.053
 whether+0.053
my+0.050
coat+0.046
Neg logit contributions
 floats-0.039
 waves-0.039
 screams-0.037
ls-0.036
 cheers-0.035
Token and
Token ID290
Feature activation+9.01e-01

Pos logit contributions
isk+0.171
ety+0.165
 whether+0.158
my+0.152
coat+0.147
Neg logit contributions
 waves-0.129
 screams-0.121
 floats-0.119
ls-0.117
 sinks-0.112
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
 floats+0.042
ls+0.039
 screams+0.039
 ourselves+0.039
 waves+0.039
Neg logit contributions
my-0.053
 whether-0.052
ety-0.051
isk-0.051
tr-0.048
Token it
Token ID340
Feature activation-1.28e+00

Pos logit contributions
 waves+0.730
 floats+0.723
 screams+0.698
ls+0.682
 cheers+0.660
Neg logit contributions
isk-0.741
ety-0.707
my-0.685
 whether-0.682
tr-0.648
Token.
Token ID13
Feature activation-5.76e-02

Pos logit contributions
my+0.069
ety+0.068
 whether+0.066
isk+0.066
tr+0.063
Neg logit contributions
 floats-0.067
 waves-0.065
 screams-0.064
 ourselves-0.062
 fishermen-0.062
Token She
Token ID1375
Feature activation-2.80e+00

Pos logit contributions
isk+0.002
ety+0.002
my+0.002
 whether+0.002
coat+0.002
Neg logit contributions
 waves-0.004
 screams-0.004
 floats-0.004
ls-0.004
 cheers-0.004
Token makes
Token ID1838
Feature activation+1.40e+00

Pos logit contributions
my+0.153
 whether+0.145
isk+0.141
ety+0.139
 brushed+0.132
Neg logit contributions
 ourselves-0.147
keys-0.132
bled-0.132
ading-0.131
 fishermen-0.131
Token a
Token ID257
Feature activation-7.85e-01

Pos logit contributions
 floats+0.091
 waves+0.088
ls+0.086
 screams+0.085
 sinks+0.082
Neg logit contributions
isk-0.062
my-0.062
ety-0.061
 whether-0.060
coat-0.054
Token a
Token ID257
Feature activation-8.24e-01

Pos logit contributions
 floats+0.012
ls+0.012
 waves+0.012
 screams+0.012
 sinks+0.011
Neg logit contributions
my-0.008
isk-0.008
 whether-0.008
ety-0.007
coat-0.007
Token great
Token ID1049
Feature activation+3.12e+00

Pos logit contributions
my+0.052
isk+0.051
 whether+0.051
ety+0.051
coat+0.048
Neg logit contributions
 floats-0.037
 waves-0.037
 screams-0.037
ls-0.036
 sinks-0.035
Token afternoon
Token ID6672
Feature activation+6.67e-01

Pos logit contributions
 floats+0.170
 waves+0.167
 screams+0.165
ls+0.162
 sinks+0.157
Neg logit contributions
my-0.173
isk-0.171
ety-0.167
 whether-0.167
coat-0.158
Token watching
Token ID4964
Feature activation-2.08e+00

Pos logit contributions
 screams+0.041
 floats+0.040
ls+0.040
 sinks+0.039
 waves+0.039
Neg logit contributions
my-0.030
 whether-0.030
isk-0.028
coat-0.028
ety-0.028
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
 whether+0.106
my+0.106
isk+0.101
ety+0.094
 hers+0.091
Neg logit contributions
 floats-0.116
ls-0.115
 screams-0.115
 waves-0.113
 sinks-0.110
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
cks+0.064
agged+0.063
ags+0.062
ext+0.060
 explodes+0.060
Neg logit contributions
my-0.066
 one-0.055
Today-0.053
 whether-0.053
 while-0.052
Token and
Token ID290
Feature activation+9.03e-01

Pos logit contributions
 floats+0.774
 screams+0.767
 waves+0.759
ls+0.745
 sinks+0.728
Neg logit contributions
my-0.647
isk-0.623
ety-0.611
 whether-0.608
coat-0.573
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
ls+0.049
 floats+0.048
 screams+0.047
 waves+0.046
 sinks+0.046
Neg logit contributions
my-0.050
 whether-0.049
isk-0.048
ety-0.046
coat-0.041
Token attractive
Token ID10966
Feature activation-6.98e-01

Pos logit contributions
ls+0.057
ext+0.056
 explodes+0.055
cks+0.055
 sinks+0.051
Neg logit contributions
my-0.077
 one-0.064
 whether-0.061
 pad-0.061
 pair-0.061
Token ocean
Token ID9151
Feature activation+7.43e+00

Pos logit contributions
my+0.047
 one+0.040
 whether+0.040
 color+0.039
 while+0.038
Neg logit contributions
 losers-0.029
 explodes-0.029
ext-0.027
keys-0.027
ls-0.027
Token.
Token ID13
Feature activation-3.38e-02

Pos logit contributions
 screams+0.444
 floats+0.433
ls+0.431
ext+0.427
 waves+0.424
Neg logit contributions
my-0.343
isk-0.332
 whether-0.318
ety-0.309
coat-0.298
Token jump
Token ID4391
Feature activation+8.64e-01

Pos logit contributions
ls+0.195
 floats+0.189
 screams+0.188
ext+0.185
 waves+0.185
Neg logit contributions
my-0.160
ety-0.147
 whether-0.146
isk-0.143
coat-0.137
Token on
Token ID319
Feature activation-2.18e-01

Pos logit contributions
 floats+0.056
 waves+0.055
 screams+0.055
ls+0.052
 ourselves+0.052
Neg logit contributions
my-0.038
isk-0.037
ety-0.036
 whether-0.036
coat-0.034
Token it
Token ID340
Feature activation-1.30e+00

Pos logit contributions
isk+0.013
ety+0.013
my+0.013
 whether+0.013
coat+0.012
Neg logit contributions
 floats-0.011
 waves-0.011
 screams-0.010
ls-0.010
 sinks-0.010
Token and
Token ID290
Feature activation+9.07e-01

Pos logit contributions
isk+0.074
ety+0.074
 whether+0.069
my+0.069
coat+0.064
Neg logit contributions
 waves-0.072
 floats-0.071
 screams-0.070
ls-0.067
 cheers-0.065
Token make
Token ID787
Feature activation-3.16e+00

Pos logit contributions
ls+0.054
ext+0.053
 floats+0.052
 fishermen+0.052
 screams+0.052
Neg logit contributions
my-0.043
 whether-0.040
ety-0.038
coat-0.037
tr-0.037
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
 whether+0.153
my+0.150
isk+0.141
ety+0.135
tr+0.132
Neg logit contributions
ls-0.171
 floats-0.169
 fishermen-0.159
 waves-0.158
ext-0.156
Token.
Token ID13
Feature activation-6.12e-02

Pos logit contributions
 floats+0.798
 waves+0.793
 screams+0.769
ls+0.750
 sinks+0.729
Neg logit contributions
isk-0.638
ety-0.628
my-0.624
 whether-0.612
coat-0.569
Token They
Token ID1119
Feature activation+3.25e+00

Pos logit contributions
ety+0.003
 whether+0.003
my+0.003
isk+0.003
coat+0.002
Neg logit contributions
 waves-0.004
 floats-0.004
ls-0.004
 screams-0.004
 ourselves-0.003
Token laugh
Token ID6487
Feature activation+2.59e+00

Pos logit contributions
ls+0.194
ext+0.185
 screams+0.182
 floats+0.182
 waves+0.178
Neg logit contributions
my-0.169
ety-0.153
 whether-0.152
isk-0.147
coat-0.143
Token and
Token ID290
Feature activation+9.09e-01

Pos logit contributions
 floats+0.173
 waves+0.171
 screams+0.168
ls+0.165
 sinks+0.161
Neg logit contributions
my-0.109
isk-0.108
 whether-0.107
ety-0.106
coat-0.096
Token have
Token ID423
Feature activation+1.00e+00

Pos logit contributions
ls+0.049
 floats+0.045
ext+0.045
 waves+0.045
 sinks+0.044
Neg logit contributions
my-0.051
 whether-0.048
ety-0.048
isk-0.047
coat-0.047
Token gentle
Token ID10296
Feature activation+7.91e-01

Pos logit contributions
isk+0.027
ety+0.027
my+0.026
 whether+0.026
coat+0.024
Neg logit contributions
 floats-0.024
 waves-0.024
 screams-0.023
ls-0.022
 cheers-0.022
Token seal
Token ID13810
Feature activation+6.75e+00

Pos logit contributions
 floats+0.033
 waves+0.033
 screams+0.031
ls+0.030
 sinks+0.029
Neg logit contributions
isk-0.054
ety-0.053
my-0.053
 whether-0.052
coat-0.050
Token playing
Token ID2712
Feature activation+5.58e-01

Pos logit contributions
 floats+0.392
 waves+0.388
 screams+0.378
ls+0.365
 sinks+0.358
Neg logit contributions
isk-0.347
ety-0.345
my-0.344
 whether-0.337
coat-0.312
Token in
Token ID287
Feature activation+2.36e+00

Pos logit contributions
 floats+0.033
 waves+0.033
 screams+0.032
ls+0.031
 sinks+0.030
Neg logit contributions
isk-0.028
ety-0.028
my-0.028
 whether-0.027
coat-0.025
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
 floats+0.117
 waves+0.115
 screams+0.111
ls+0.108
 sinks+0.104
Neg logit contributions
isk-0.144
ety-0.141
my-0.141
 whether-0.139
coat-0.130
Token waves
Token ID9813
Feature activation+1.30e+01

Pos logit contributions
keys+0.062
 screams+0.062
ls+0.059
ext+0.058
aro+0.057
Neg logit contributions
my-0.073
ety-0.066
 pad-0.063
 which-0.061
shirt-0.059
Token.
Token ID13
Feature activation-6.12e-02

Pos logit contributions
 floats+0.819
 waves+0.806
 screams+0.797
ls+0.775
 sinks+0.757
Neg logit contributions
my-0.610
ety-0.601
isk-0.601
 whether-0.585
coat-0.545
Token Jimmy
Token ID12963
Feature activation+6.40e+00

Pos logit contributions
isk+0.003
my+0.003
ety+0.003
 whether+0.003
coat+0.003
Neg logit contributions
 floats-0.004
 waves-0.004
ls-0.004
 screams-0.004
 ourselves-0.003
Token was
Token ID373
Feature activation+4.03e-01

Pos logit contributions
 ourselves+0.429
 floats+0.427
ls+0.424
 screams+0.409
 waves+0.407
Neg logit contributions
my-0.274
ety-0.254
isk-0.254
 whether-0.242
 pad-0.223
Token mesmer
Token ID46814
Feature activation+3.92e+00

Pos logit contributions
 floats+0.024
ls+0.024
 waves+0.023
 screams+0.022
 sinks+0.022
Neg logit contributions
my-0.021
 whether-0.020
isk-0.020
ety-0.020
coat-0.018
Tokenized
Token ID1143
Feature activation+6.61e+00

Pos logit contributions
 screams+0.148
 floats+0.144
 sinks+0.137
 cheers+0.136
 explodes+0.135
Neg logit contributions
ety-0.288
my-0.278
 whether-0.276
isk-0.267
 though-0.253
Token saw
Token ID2497
Feature activation-1.14e+00

Pos logit contributions
isk+0.143
ety+0.142
 whether+0.140
my+0.136
coat+0.134
Neg logit contributions
 waves-0.108
 floats-0.107
 screams-0.106
ls-0.101
 cheers-0.096
Token that
Token ID326
Feature activation-2.00e-01

Pos logit contributions
ety+0.074
tr+0.073
coat+0.072
isk+0.069
 pad+0.068
Neg logit contributions
 waves-0.053
 screams-0.050
 floats-0.048
ls-0.047
 cheers-0.045
Token Mr
Token ID1770
Feature activation-3.47e+00

Pos logit contributions
isk+0.013
tr+0.013
ety+0.012
 pad+0.012
coat+0.012
Neg logit contributions
 waves-0.009
 floats-0.009
ls-0.008
 cheers-0.008
 boat-0.007
Token.
Token ID13
Feature activation-5.28e-01

Pos logit contributions
 whether+0.219
coat+0.198
my+0.194
tr+0.192
kit+0.191
Neg logit contributions
ls-0.170
 screams-0.159
 waves-0.156
 floats-0.151
 ourselves-0.151
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
shirt+0.026
 whether+0.025
wn+0.023
my+0.023
ety+0.023
Neg logit contributions
 shore-0.027
 screams-0.027
 waves-0.026
 shark-0.026
 drown-0.026
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.196
 though+0.125
 hers+0.111
 pencil+0.107
Which+0.106
Neg logit contributions
ls-1.090
 screams-1.086
am-1.081
 waves-1.068
 ourselves-1.060
Tokeners
Token ID364
Feature activation+1.02e+00

Pos logit contributions
ety+1.102
isk+1.098
my+1.082
 whether+1.081
coat+1.026
Neg logit contributions
 floats-0.731
 waves-0.726
 screams-0.692
ls-0.680
 sinks-0.643
Token was
Token ID373
Feature activation-7.73e-02

Pos logit contributions
 floats+0.058
 waves+0.058
 screams+0.056
ls+0.054
 sinks+0.053
Neg logit contributions
isk-0.054
ety-0.053
my-0.052
 whether-0.052
coat-0.048
Token lost
Token ID2626
Feature activation+2.19e+00

Pos logit contributions
ety+0.005
isk+0.005
 whether+0.005
my+0.004
coat+0.004
Neg logit contributions
 waves-0.004
 screams-0.004
 floats-0.004
 sinks-0.003
 cheers-0.003
Token and
Token ID290
Feature activation+4.20e-01

Pos logit contributions
 waves+0.094
ls+0.084
 screams+0.083
 floats+0.082
 sinks+0.081
Neg logit contributions
isk-0.145
ety-0.140
tr-0.133
 whether-0.130
 gloss-0.126
Token she
Token ID673
Feature activation-3.19e+00

Pos logit contributions
 floats+0.026
 waves+0.026
 screams+0.025
ls+0.024
 sinks+0.024
Neg logit contributions
isk-0.020
ety-0.020
my-0.020
 whether-0.019
coat-0.018
Token family
Token ID1641
Feature activation+3.72e+00

Pos logit contributions
my+0.028
isk+0.027
ety+0.027
 whether+0.026
coat+0.026
Neg logit contributions
 floats-0.024
 waves-0.024
ls-0.023
 screams-0.023
 ourselves-0.023
Token sh
Token ID427
Feature activation-1.31e+00

Pos logit contributions
 floats+0.220
 waves+0.219
 screams+0.211
ls+0.205
 sinks+0.199
Neg logit contributions
isk-0.192
ety-0.189
my-0.186
 whether-0.184
coat-0.172
Tokenoo
Token ID2238
Feature activation-4.02e-01

Pos logit contributions
 whether+0.084
ety+0.082
isk+0.081
my+0.081
tr+0.078
Neg logit contributions
 floats-0.057
 waves-0.056
 screams-0.054
ls-0.052
 ourselves-0.050
Tokened
Token ID276
Feature activation+1.40e+00

Pos logit contributions
isk+0.023
my+0.022
ety+0.022
 whether+0.021
coat+0.021
Neg logit contributions
 floats-0.022
 waves-0.021
 screams-0.020
ls-0.020
 sinks-0.020
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
 waves+0.073
 floats+0.070
 screams+0.070
 cheers+0.067
ls+0.065
Neg logit contributions
ety-0.084
isk-0.082
my-0.077
 whether-0.076
tr-0.075
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.159
ety+0.077
 though+0.069
 hers+0.062
Which+0.058
Neg logit contributions
 screams-1.152
 floats-1.131
 waves-1.129
 ourselves-1.120
am-1.112
Tokeners
Token ID364
Feature activation+1.03e+00

Pos logit contributions
isk+1.072
ety+1.055
my+1.048
 whether+1.033
coat+0.977
Neg logit contributions
 floats-0.781
 waves-0.776
 screams-0.742
ls-0.716
 sinks-0.690
Token away
Token ID1497
Feature activation+2.36e+00

Pos logit contributions
 floats+0.064
 waves+0.063
 screams+0.061
ls+0.060
 sinks+0.058
Neg logit contributions
isk-0.050
ety-0.049
my-0.049
 whether-0.048
coat-0.044
Token and
Token ID290
Feature activation+4.20e-01

Pos logit contributions
 waves+0.126
 floats+0.124
 screams+0.124
ls+0.118
 sinks+0.115
Neg logit contributions
ety-0.134
isk-0.130
my-0.128
 whether-0.124
coat-0.119
Token told
Token ID1297
Feature activation-3.54e-01

Pos logit contributions
 floats+0.023
 waves+0.023
 screams+0.022
ls+0.021
 sinks+0.021
Neg logit contributions
isk-0.023
ety-0.023
my-0.023
 whether-0.023
coat-0.021
Token Max
Token ID5436
Feature activation+2.53e-01

Pos logit contributions
ety+0.023
isk+0.022
my+0.022
coat+0.022
 whether+0.021
Neg logit contributions
 floats-0.016
 waves-0.016
 screams-0.016
ls-0.015
 cheers-0.015
Token a
Token ID257
Feature activation-1.29e+00

Pos logit contributions
ety+0.261
isk+0.259
my+0.256
 whether+0.251
coat+0.240
Neg logit contributions
 waves-0.240
 floats-0.240
 screams-0.232
ls-0.225
 sinks-0.218
Token long
Token ID890
Feature activation-6.15e-01

Pos logit contributions
isk+0.077
ety+0.074
my+0.074
 whether+0.072
coat+0.067
Neg logit contributions
 screams-0.064
 floats-0.064
 waves-0.063
ls-0.061
 sinks-0.059
Token time
Token ID640
Feature activation+2.60e+00

Pos logit contributions
ety+0.045
 whether+0.044
isk+0.044
my+0.043
tr+0.040
Neg logit contributions
 waves-0.022
 screams-0.021
 floats-0.021
 sinks-0.019
 cheers-0.019
Token.
Token ID13
Feature activation-5.26e-01

Pos logit contributions
 floats+0.159
 waves+0.158
 screams+0.153
ls+0.149
 sinks+0.145
Neg logit contributions
isk-0.127
ety-0.125
my-0.125
 whether-0.123
coat-0.113
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
 whether+0.029
ety+0.029
my+0.028
isk+0.028
coat+0.028
Neg logit contributions
 floats-0.026
 screams-0.026
 waves-0.025
ls-0.024
 cheers-0.024
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.139
my+0.078
coat+0.075
ety+0.066
 hers+0.055
Neg logit contributions
ls-1.175
 floats-1.160
 screams-1.153
am-1.153
 waves-1.149
Tokeners
Token ID364
Feature activation+1.03e+00

Pos logit contributions
isk+1.051
my+1.019
ety+1.018
 whether+0.992
coat+0.937
Neg logit contributions
 floats-0.819
 waves-0.813
 screams-0.781
ls-0.744
 sinks-0.725
Token learned
Token ID4499
Feature activation+2.36e+00

Pos logit contributions
 floats+0.070
 waves+0.067
 ourselves+0.064
ls+0.064
 screams+0.063
Neg logit contributions
isk-0.048
my-0.048
ety-0.043
 whether-0.041
 brushed-0.039
Token that
Token ID326
Feature activation-2.03e-01

Pos logit contributions
 waves+0.106
 floats+0.101
 screams+0.101
ls+0.098
 ourselves+0.093
Neg logit contributions
ety-0.155
isk-0.154
my-0.151
coat-0.147
 whether-0.144
Token sometimes
Token ID3360
Feature activation-3.59e+00

Pos logit contributions
isk+0.012
ety+0.012
coat+0.011
my+0.011
 whether+0.011
Neg logit contributions
 waves-0.011
 floats-0.011
 screams-0.010
 sinks-0.010
ls-0.010
Token it
Token ID340
Feature activation-1.74e+00

Pos logit contributions
my+0.208
 whether+0.207
ety+0.200
isk+0.198
 though+0.176
Neg logit contributions
 floats-0.184
 waves-0.181
 screams-0.173
ls-0.168
 cheers-0.164
Token wise
Token ID10787
Feature activation+3.90e-01

Pos logit contributions
 floats+0.045
 waves+0.043
ls+0.042
 screams+0.042
 sinks+0.041
Neg logit contributions
my-0.039
isk-0.039
ety-0.038
 whether-0.037
coat-0.035
Token old
Token ID1468
Feature activation-3.60e-01

Pos logit contributions
 waves+0.018
 screams+0.017
 floats+0.017
ls+0.015
 cheers+0.015
Neg logit contributions
isk-0.026
 whether-0.025
ety-0.025
my-0.025
tr-0.023
Token cat
Token ID3797
Feature activation-4.66e+00

Pos logit contributions
isk+0.023
ety+0.022
 whether+0.022
my+0.021
tr+0.020
Neg logit contributions
 waves-0.018
 screams-0.016
 floats-0.016
 boat-0.015
 moms-0.015
Token named
Token ID3706
Feature activation-2.11e+00

Pos logit contributions
isk+0.265
ety+0.259
 whether+0.257
coat+0.247
tr+0.242
Neg logit contributions
 waves-0.230
 floats-0.226
ls-0.224
 screams-0.220
 boat-0.205
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
ety+0.114
tr+0.112
coat+0.112
 whether+0.108
my+0.107
Neg logit contributions
 waves-0.114
 screams-0.113
 floats-0.105
 cheers-0.103
ls-0.102
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.178
 though+0.127
 blonde+0.113
 brushed+0.111
 pencil+0.110
Neg logit contributions
 screams-1.085
 waves-1.079
ls-1.076
 ourselves-1.062
 floats-1.057
Tokeners
Token ID364
Feature activation+1.03e+00

Pos logit contributions
ety+1.128
isk+1.111
 whether+1.106
my+1.101
coat+1.055
Neg logit contributions
 floats-0.701
 waves-0.700
ls-0.660
 screams-0.660
 sinks-0.616
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
 waves+0.050
 floats+0.047
ls+0.047
 screams+0.047
 cheers+0.044
Neg logit contributions
ety-0.061
isk-0.060
 whether-0.058
tr-0.058
my-0.057
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
ety+0.027
my+0.026
 whether+0.026
isk+0.024
coat+0.024
Neg logit contributions
 floats-0.031
 screams-0.031
 waves-0.031
 cheers-0.028
 moms-0.028
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.149
ety+0.083
rings+0.083
 pencil+0.083
tr+0.080
Neg logit contributions
ls-1.141
 screams-1.135
 floats-1.126
 waves-1.120
am-1.110
Tokeners
Token ID364
Feature activation+1.04e+00

Pos logit contributions
ety+1.109
isk+1.101
my+1.088
 whether+1.088
coat+1.033
Neg logit contributions
 floats-0.723
 waves-0.717
 screams-0.681
ls-0.672
 sinks-0.635
Token scared
Token ID12008
Feature activation+2.95e+00

Pos logit contributions
isk+0.004
my+0.004
ety+0.004
 whether+0.004
coat+0.004
Neg logit contributions
 floats-0.004
 waves-0.004
 screams-0.004
ls-0.004
 sinks-0.004
Token.
Token ID13
Feature activation-5.48e-01

Pos logit contributions
 waves+0.153
 floats+0.150
 screams+0.148
ls+0.148
 cheers+0.137
Neg logit contributions
isk-0.181
ety-0.168
tr-0.161
my-0.161
 whether-0.160
Token Mr
Token ID1770
Feature activation-3.49e+00

Pos logit contributions
isk+0.028
 whether+0.027
ety+0.026
my+0.026
coat+0.025
Neg logit contributions
 screams-0.031
 floats-0.031
 waves-0.031
ls-0.029
 cheers-0.029
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
 whether+0.191
isk+0.188
my+0.184
ety+0.183
coat+0.178
Neg logit contributions
 waves-0.188
 floats-0.187
ls-0.182
 screams-0.182
 ourselves-0.170
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
 whether+0.021
ety+0.019
my+0.019
shirt+0.019
 brushed+0.018
Neg logit contributions
 waves-0.032
 floats-0.032
 screams-0.032
 sinks-0.032
 ourselves-0.031
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.086
my+0.040
ety+0.039
tr+0.012
coat+0.007
Neg logit contributions
 floats-1.225
 waves-1.223
 screams-1.215
ls-1.211
am-1.173
Tokeners
Token ID364
Feature activation+1.02e+00

Pos logit contributions
isk+1.056
ety+1.032
my+1.030
 whether+1.007
coat+0.952
Neg logit contributions
 floats-0.805
 waves-0.799
 screams-0.765
ls-0.732
 sinks-0.712
Token said
Token ID531
Feature activation+2.22e+00

Pos logit contributions
 floats+0.059
 waves+0.058
 screams+0.056
ls+0.055
 sinks+0.053
Neg logit contributions
isk-0.054
ety-0.053
my-0.052
 whether-0.051
coat-0.048
Token,
Token ID11
Feature activation-9.13e-01

Pos logit contributions
 waves+0.115
 floats+0.115
 screams+0.113
ls+0.111
 sinks+0.105
Neg logit contributions
isk-0.133
ety-0.130
my-0.126
 whether-0.122
coat-0.117
Token "
Token ID366
Feature activation-5.01e+00

Pos logit contributions
isk+0.047
ety+0.047
my+0.043
coat+0.041
 whether+0.040
Neg logit contributions
 waves-0.054
 screams-0.054
 floats-0.052
 cheers-0.051
ls-0.050
TokenDon
Token ID3987
Feature activation-4.61e+00

Pos logit contributions
isk+0.238
 whether+0.238
my+0.217
ety+0.217
tr+0.208
Neg logit contributions
 floats-0.296
 waves-0.296
 screams-0.288
ls-0.274
 ourselves-0.272
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
ety+0.114
tr+0.112
coat+0.112
 whether+0.108
my+0.107
Neg logit contributions
 waves-0.114
 screams-0.113
 floats-0.105
 cheers-0.103
ls-0.102
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.178
 though+0.127
 blonde+0.113
 brushed+0.111
 pencil+0.110
Neg logit contributions
 screams-1.085
 waves-1.079
ls-1.076
 ourselves-1.062
 floats-1.057
Tokeners
Token ID364
Feature activation+1.03e+00

Pos logit contributions
ety+1.128
isk+1.111
 whether+1.106
my+1.101
coat+1.055
Neg logit contributions
 floats-0.701
 waves-0.700
ls-0.660
 screams-0.660
 sinks-0.616
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
 waves+0.050
 floats+0.047
ls+0.047
 screams+0.047
 cheers+0.044
Neg logit contributions
ety-0.061
isk-0.060
 whether-0.058
tr-0.058
my-0.057
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
ety+0.027
my+0.026
 whether+0.026
isk+0.024
coat+0.024
Neg logit contributions
 floats-0.031
 screams-0.031
 waves-0.031
 cheers-0.028
 moms-0.028
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.149
ety+0.083
rings+0.083
 pencil+0.083
tr+0.080
Neg logit contributions
ls-1.141
 screams-1.135
 floats-1.126
 waves-1.120
am-1.110
Tokeners
Token ID364
Feature activation+1.04e+00

Pos logit contributions
ety+1.109
isk+1.101
my+1.088
 whether+1.088
coat+1.033
Neg logit contributions
 floats-0.723
 waves-0.717
 screams-0.681
ls-0.672
 sinks-0.635
Token looked
Token ID3114
Feature activation-2.70e+00

Pos logit contributions
 floats+0.063
 waves+0.062
 screams+0.060
ls+0.058
 sinks+0.057
Neg logit contributions
isk-0.053
my-0.052
ety-0.052
 whether-0.050
coat-0.047
Token at
Token ID379
Feature activation+1.03e+00

Pos logit contributions
my+0.156
isk+0.155
ety+0.155
 whether+0.153
coat+0.141
Neg logit contributions
 floats-0.142
 waves-0.140
 screams-0.135
ls-0.129
 cheers-0.127
Token Spot
Token ID15899
Feature activation+2.76e+00

Pos logit contributions
 waves+0.047
 floats+0.047
 screams+0.045
ls+0.044
 sinks+0.042
Neg logit contributions
ety-0.064
isk-0.064
 whether-0.063
my-0.063
coat-0.062
Token and
Token ID290
Feature activation+4.30e-01

Pos logit contributions
 waves+0.146
 floats+0.143
 screams+0.143
ls+0.137
 sinks+0.133
Neg logit contributions
isk-0.156
ety-0.154
 whether-0.149
my-0.146
coat-0.144
Token.
Token ID13
Feature activation-5.48e-01

Pos logit contributions
 waves+0.111
 floats+0.111
 screams+0.107
ls+0.103
 cheers+0.102
Neg logit contributions
ety-0.103
isk-0.103
my-0.099
 whether-0.097
coat-0.092
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
ety+0.033
 whether+0.033
isk+0.033
my+0.032
coat+0.032
Neg logit contributions
 floats-0.025
 waves-0.025
 screams-0.025
 cheers-0.024
ls-0.023
Token
Token ID198
Feature activation+2.09e-01

Pos logit contributions
 floats+0.009
 waves+0.009
 screams+0.009
ls+0.009
 cheers+0.008
Neg logit contributions
isk-0.012
ety-0.012
my-0.011
 whether-0.011
coat-0.011
TokenAfter
Token ID3260
Feature activation-1.84e+00

Pos logit contributions
 floats+0.010
 waves+0.010
 screams+0.010
ls+0.010
 cheers+0.010
Neg logit contributions
ety-0.012
isk-0.012
my-0.011
 whether-0.011
coat-0.011
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
ety+0.109
isk+0.108
my+0.106
 whether+0.103
coat+0.100
Neg logit contributions
 floats-0.097
 waves-0.096
 screams-0.092
ls-0.088
 cheers-0.087
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.188
 hers+0.109
my+0.105
ety+0.100
 which+0.097
Neg logit contributions
am-1.109
 screams-1.093
ls-1.086
ars-1.084
ext-1.060
Tokeners
Token ID364
Feature activation+1.04e+00

Pos logit contributions
isk+1.086
ety+1.085
my+1.069
 whether+1.063
coat+1.007
Neg logit contributions
 floats-0.748
 waves-0.743
 screams-0.710
ls-0.687
 sinks-0.659
Token got
Token ID1392
Feature activation-3.72e+00

Pos logit contributions
 floats+0.062
 waves+0.061
 screams+0.059
ls+0.058
 sinks+0.056
Neg logit contributions
isk-0.054
my-0.053
ety-0.052
 whether-0.050
coat-0.047
Token better
Token ID1365
Feature activation-9.25e-01

Pos logit contributions
my+0.196
isk+0.196
ety+0.196
 whether+0.189
coat+0.177
Neg logit contributions
 floats-0.216
 waves-0.212
 screams-0.207
ls-0.200
 sinks-0.194
Token,
Token ID11
Feature activation-9.12e-01

Pos logit contributions
isk+0.044
my+0.044
ety+0.043
 whether+0.043
coat+0.040
Neg logit contributions
 floats-0.058
 waves-0.057
 screams-0.055
ls-0.054
 sinks-0.052
Token he
Token ID339
Feature activation+3.19e-01

Pos logit contributions
ety+0.039
isk+0.038
my+0.037
 whether+0.037
coat+0.036
Neg logit contributions
 floats-0.060
 waves-0.060
ls-0.057
 screams-0.056
 cheers-0.055
Token,
Token ID11
Feature activation-9.01e-01

Pos logit contributions
 waves+0.055
 floats+0.055
 screams+0.053
ls+0.051
 cheers+0.050
Neg logit contributions
isk-0.047
ety-0.046
my-0.045
 whether-0.044
coat-0.042
Token and
Token ID290
Feature activation+4.32e-01

Pos logit contributions
my+0.042
ety+0.042
isk+0.042
 whether+0.041
coat+0.038
Neg logit contributions
 floats-0.057
 waves-0.056
 screams-0.054
ls-0.053
 sinks-0.052
Token he
Token ID339
Feature activation+3.20e-01

Pos logit contributions
 floats+0.026
inkle+0.026
 screams+0.026
 ourselves+0.025
 explodes+0.025
Neg logit contributions
ety-0.020
my-0.020
isk-0.019
 whether-0.019
coat-0.017
Token and
Token ID290
Feature activation+4.34e-01

Pos logit contributions
 ourselves+0.020
 floats+0.019
 sinks+0.019
ading+0.018
ls+0.018
Neg logit contributions
my-0.015
isk-0.015
ety-0.014
 brushed-0.014
 whether-0.013
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
 floats+0.024
 waves+0.023
 screams+0.023
ls+0.022
 sinks+0.021
Neg logit contributions
isk-0.024
ety-0.024
my-0.024
 whether-0.023
coat-0.022
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
isk+0.042
ety+0.040
 whether+0.038
my+0.033
coat-0.013
Neg logit contributions
 floats-1.298
 waves-1.292
 screams-1.271
ls-1.251
 sinks-1.229
Tokeners
Token ID364
Feature activation+1.01e+00

Pos logit contributions
isk+1.062
ety+1.041
my+1.037
 whether+1.016
coat+0.961
Neg logit contributions
 floats-0.793
 waves-0.790
 screams-0.757
ls-0.728
 sinks-0.704
Token played
Token ID2826
Feature activation+3.92e-01

Pos logit contributions
 floats+0.058
 waves+0.057
 screams+0.056
ls+0.055
 sinks+0.054
Neg logit contributions
isk-0.054
my-0.053
ety-0.052
 whether-0.050
coat-0.049
Token with
Token ID351
Feature activation-3.40e+00

Pos logit contributions
 floats+0.026
 waves+0.025
 screams+0.025
ls+0.024
 sinks+0.023
Neg logit contributions
isk-0.017
 whether-0.017
my-0.017
ety-0.016
coat-0.015
Token the
Token ID262
Feature activation+7.44e-01

Pos logit contributions
isk+0.185
my+0.182
 whether+0.181
ety+0.179
coat+0.159
Neg logit contributions
 floats-0.184
 waves-0.183
 screams-0.180
ls-0.172
 sinks-0.165
Token new
Token ID649
Feature activation+1.02e+00

Pos logit contributions
 floats+0.042
 waves+0.041
 screams+0.041
ls+0.040
 sinks+0.039
Neg logit contributions
my-0.040
ety-0.038
isk-0.038
 whether-0.037
coat-0.036
TokenI
Token ID40
Feature activation+1.96e+00

Pos logit contributions
isk+0.267
 whether+0.262
ety+0.238
my+0.238
tr+0.228
Neg logit contributions
 waves-0.279
 floats-0.275
 screams-0.259
 cheers-0.253
 ourselves-0.245
Token'm
Token ID1101
Feature activation-1.47e+00

Pos logit contributions
 floats+0.112
 waves+0.112
 screams+0.108
ls+0.104
 sinks+0.102
Neg logit contributions
isk-0.104
ety-0.102
my-0.101
 whether-0.100
coat-0.093
Token sorry
Token ID7926
Feature activation+8.50e+00

Pos logit contributions
my+0.090
 whether+0.089
isk+0.088
ety+0.088
 hers+0.083
Neg logit contributions
 floats-0.077
 waves-0.072
ls-0.069
 screams-0.068
ars-0.064
Token,
Token ID11
Feature activation-9.23e-01

Pos logit contributions
 floats+0.461
 waves+0.461
 screams+0.445
ls+0.438
 ourselves+0.415
Neg logit contributions
isk-0.481
ety-0.467
my-0.457
 whether-0.450
coat-0.434
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
isk+0.046
ety+0.045
my+0.044
 whether+0.043
coat+0.041
Neg logit contributions
 floats-0.056
 waves-0.056
 screams-0.054
ls-0.052
 cheers-0.051
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.411
 though+0.328
my+0.312
kit+0.310
 brushed+0.308
Neg logit contributions
ls-0.907
 floats-0.890
 ourselves-0.889
 waves-0.876
 screams-0.872
Tokeners
Token ID364
Feature activation+1.02e+00

Pos logit contributions
isk+1.013
my+0.978
ety+0.969
 whether+0.932
coat+0.888
Neg logit contributions
 waves-0.863
 floats-0.862
 screams-0.828
 sinks-0.778
ls-0.773
Token.
Token ID13
Feature activation-5.42e-01

Pos logit contributions
 floats+0.063
 waves+0.063
 screams+0.060
ls+0.059
 sinks+0.058
Neg logit contributions
isk-0.050
my-0.050
ety-0.049
 whether-0.047
coat-0.044
Token Let
Token ID3914
Feature activation-1.19e+00

Pos logit contributions
isk+0.034
ety+0.034
my+0.033
 whether+0.033
coat+0.031
Neg logit contributions
 floats-0.026
 waves-0.025
 screams-0.024
ls-0.023
 cheers-0.022
Token's
Token ID338
Feature activation-4.80e-01

Pos logit contributions
my+0.056
isk+0.048
ety+0.048
 whether+0.041
c+0.038
Neg logit contributions
 floats-0.083
 waves-0.081
 fishermen-0.081
 sinks-0.079
 lighthouse-0.079
Token go
Token ID467
Feature activation+2.81e+00

Pos logit contributions
my+0.026
isk+0.025
ety+0.024
 whether+0.023
coat+0.022
Neg logit contributions
ls-0.030
 floats-0.029
 screams-0.028
 waves-0.027
 sinks-0.027
Token she
Token ID673
Feature activation-3.18e+00

Pos logit contributions
isk+0.112
ety+0.108
my+0.106
 whether+0.103
coat+0.097
Neg logit contributions
 floats-0.171
 waves-0.170
 screams-0.166
ls-0.165
 sinks-0.158
Token could
Token ID714
Feature activation-1.53e-01

Pos logit contributions
isk+0.166
ety+0.166
my+0.164
 whether+0.160
coat+0.150
Neg logit contributions
 floats-0.184
 waves-0.182
 screams-0.176
ls-0.170
 sinks-0.166
Token and
Token ID290
Feature activation+4.16e-01

Pos logit contributions
 whether+0.006
isk+0.006
my+0.006
ety+0.006
coat+0.006
Neg logit contributions
 floats-0.011
 waves-0.011
 screams-0.010
ls-0.010
 ourselves-0.010
Token pushed
Token ID7121
Feature activation-1.39e+00

Pos logit contributions
 waves+0.020
 screams+0.020
 floats+0.019
 sinks+0.018
 cheers+0.018
Neg logit contributions
ety-0.026
isk-0.025
my-0.025
 whether-0.024
tr-0.024
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
isk+0.085
ety+0.082
tr+0.080
coat+0.078
 whether+0.078
Neg logit contributions
 screams-0.069
 waves-0.069
ls-0.065
 floats-0.065
 cheers-0.063
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.156
 though+0.115
ety+0.085
my+0.084
 hers+0.076
Neg logit contributions
 screams-1.140
am-1.136
 waves-1.125
 ourselves-1.116
 floats-1.113
Tokeners
Token ID364
Feature activation+1.02e+00

Pos logit contributions
isk+1.077
ety+1.069
my+1.058
 whether+1.047
coat+0.989
Neg logit contributions
 floats-0.764
 waves-0.759
 screams-0.726
ls-0.705
 sinks-0.674
Token out
Token ID503
Feature activation+2.17e-01

Pos logit contributions
 waves+0.047
 floats+0.047
 screams+0.045
ls+0.043
 sinks+0.042
Neg logit contributions
isk-0.065
ety-0.065
my-0.064
 whether-0.063
coat-0.059
Token of
Token ID286
Feature activation+3.69e+00

Pos logit contributions
 waves+0.009
 floats+0.009
 screams+0.009
ls+0.008
 cheers+0.008
Neg logit contributions
isk-0.015
ety-0.015
my-0.015
 whether-0.014
coat-0.014
Token the
Token ID262
Feature activation+7.49e-01

Pos logit contributions
 waves+0.123
 cheers+0.108
 screams+0.106
 floats+0.103
 shore+0.103
Neg logit contributions
tr-0.291
ety-0.288
isk-0.272
my-0.269
 whether-0.268
Token way
Token ID835
Feature activation+3.31e+00

Pos logit contributions
bled+0.044
 ourselves+0.044
ext+0.044
 floats+0.043
keys+0.043
Neg logit contributions
my-0.039
 screw-0.035
 dictionary-0.033
 thread-0.032
coat-0.031
Token her
Token ID607
Feature activation-4.50e+00

Pos logit contributions
 floats+0.025
 waves+0.025
 screams+0.023
ls+0.022
 cheers+0.022
Neg logit contributions
isk-0.024
ety-0.022
coat-0.021
 whether-0.021
my-0.021
Token pet
Token ID4273
Feature activation-6.19e-01

Pos logit contributions
my+0.248
ety+0.242
isk+0.242
 whether+0.233
tr+0.232
Neg logit contributions
 floats-0.242
 waves-0.236
 screams-0.232
ls-0.231
 sinks-0.229
Token cat
Token ID3797
Feature activation-4.68e+00

Pos logit contributions
isk+0.035
ety+0.034
my+0.034
 whether+0.034
coat+0.032
Neg logit contributions
 floats-0.034
 waves-0.033
 screams-0.032
ls-0.031
 sinks-0.030
Token,
Token ID11
Feature activation-9.23e-01

Pos logit contributions
isk+0.253
ety+0.252
 whether+0.251
coat+0.238
my+0.236
Neg logit contributions
 floats-0.256
 waves-0.247
ls-0.235
 screams-0.229
 moms-0.218
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
isk+0.047
ety+0.046
 whether+0.045
my+0.044
coat+0.042
Neg logit contributions
 floats-0.055
 waves-0.054
 screams-0.052
ls-0.050
 cheers-0.049
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.180
 though+0.137
 Which+0.115
Which+0.111
 hers+0.097
Neg logit contributions
 ourselves-1.094
 screams-1.085
 floats-1.075
 waves-1.062
ext-1.048
Tokeners
Token ID364
Feature activation+1.02e+00

Pos logit contributions
ety+1.130
isk+1.122
 whether+1.119
my+1.107
coat+1.061
Neg logit contributions
 floats-0.699
 waves-0.684
ls-0.648
 screams-0.647
 ourselves-0.609
Token.
Token ID13
Feature activation-5.42e-01

Pos logit contributions
 floats+0.054
 waves+0.054
 screams+0.051
ls+0.049
 cheers+0.048
Neg logit contributions
ety-0.057
isk-0.056
 whether-0.055
my-0.055
coat-0.052
Token Lily
Token ID20037
Feature activation-5.17e+00

Pos logit contributions
 whether+0.029
ety+0.029
coat+0.027
isk+0.027
epad+0.027
Neg logit contributions
 floats-0.029
 waves-0.025
 moms-0.025
 Everyone-0.024
 cheers-0.024
Token loved
Token ID6151
Feature activation+9.41e-01

Pos logit contributions
my+0.238
ety+0.230
isk+0.230
 whether+0.217
coat+0.200
Neg logit contributions
 floats-0.338
 waves-0.337
 screams-0.327
ls-0.327
 sinks-0.319
Token to
Token ID284
Feature activation-1.21e+00

Pos logit contributions
 waves+0.057
 floats+0.056
 screams+0.052
 our+0.050
 moms+0.049
Neg logit contributions
isk-0.048
ety-0.047
my-0.045
coat-0.045
tr-0.044
Token a
Token ID257
Feature activation-1.29e+00

Pos logit contributions
isk+0.004
ety+0.004
my+0.004
 whether+0.004
coat+0.003
Neg logit contributions
 floats-0.005
 waves-0.005
 screams-0.005
ls-0.004
 sinks-0.004
Token lazy
Token ID16931
Feature activation+1.44e+00

Pos logit contributions
isk+0.079
ety+0.078
my+0.078
 whether+0.076
coat+0.072
Neg logit contributions
 floats-0.062
 waves-0.062
 screams-0.060
ls-0.058
 sinks-0.056
Token cat
Token ID3797
Feature activation-4.67e+00

Pos logit contributions
 floats+0.080
 waves+0.080
 screams+0.076
ls+0.073
 cheers+0.071
Neg logit contributions
isk-0.079
ety-0.078
 whether-0.077
my-0.076
coat-0.070
Token named
Token ID3706
Feature activation-2.10e+00

Pos logit contributions
isk+0.254
ety+0.252
 whether+0.242
coat+0.234
my+0.229
Neg logit contributions
 floats-0.249
 waves-0.244
ls-0.230
 screams-0.228
 whale-0.220
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
isk+0.100
 whether+0.098
my+0.097
ety+0.097
kit+0.096
Neg logit contributions
 screams-0.112
 floats-0.110
 our-0.110
 waves-0.108
 ourselves-0.104
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.196
 though+0.168
 Which+0.157
Which+0.142
 hers+0.140
Neg logit contributions
 screams-1.044
 ourselves-1.029
ls-1.022
ext-1.022
ars-1.019
Tokeners
Token ID364
Feature activation+1.03e+00

Pos logit contributions
ety+1.102
isk+1.098
 whether+1.084
my+1.082
coat+1.028
Neg logit contributions
 floats-0.729
 waves-0.724
 screams-0.687
ls-0.674
 sinks-0.639
Token.
Token ID13
Feature activation-5.34e-01

Pos logit contributions
 waves+0.035
 floats+0.034
 screams+0.032
 moms+0.031
 ourselves+0.031
Neg logit contributions
ety-0.070
coat-0.069
isk-0.067
 whether-0.067
my-0.065
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
epad+0.024
 whether+0.023
ety+0.023
 pad+0.022
 brushed+0.021
Neg logit contributions
 floats-0.033
 waves-0.031
 moms-0.030
 screams-0.030
 Everyone-0.029
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.166
rings+0.095
 brushed+0.095
 though+0.090
 hers+0.087
Neg logit contributions
ls-1.119
 screams-1.099
 floats-1.098
 waves-1.093
am-1.092
Tokeners
Token ID364
Feature activation+1.04e+00

Pos logit contributions
isk+1.089
ety+1.086
my+1.070
 whether+1.066
coat+1.011
Neg logit contributions
 floats-0.744
 waves-0.737
 screams-0.701
ls-0.685
 sinks-0.653
Token closed
Token ID4838
Feature activation-1.67e+00

Pos logit contributions
isk+0.005
ety+0.005
 whether+0.004
my+0.004
coat+0.004
Neg logit contributions
 waves-0.003
 screams-0.003
ls-0.003
 floats-0.003
 sinks-0.003
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
isk+0.089
ety+0.086
my+0.084
 whether+0.083
coat+0.077
Neg logit contributions
 waves-0.097
 floats-0.096
 screams-0.092
ls-0.091
 sinks-0.088
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
 whether+0.028
ety+0.027
isk+0.027
my+0.027
coat+0.026
Neg logit contributions
 screams-0.027
 floats-0.027
 waves-0.027
ls-0.026
 sinks-0.025
Token
Token ID198
Feature activation+2.05e-01

Pos logit contributions
 screams+0.009
ls+0.009
 floats+0.009
 waves+0.009
ext+0.008
Neg logit contributions
ety-0.013
isk-0.013
 whether-0.013
my-0.013
coat-0.012
TokenWh
Token ID1199
Feature activation-1.29e+01

Pos logit contributions
 floats+0.012
ars+0.012
 screams+0.012
ls+0.012
 waves+0.012
Neg logit contributions
 whether-0.009
ety-0.009
my-0.008
isk-0.008
coat-0.008
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.179
ety+0.097
my+0.089
coat+0.083
tr+0.082
Neg logit contributions
ls-1.188
am-1.170
 screams-1.164
 floats-1.160
 waves-1.157
Tokeners
Token ID364
Feature activation+1.03e+00

Pos logit contributions
ety+1.095
isk+1.092
my+1.077
 whether+1.072
coat+1.019
Neg logit contributions
 floats-0.739
 waves-0.733
 screams-0.699
ls-0.683
 sinks-0.649
Token asked
Token ID1965
Feature activation+7.58e-01

Pos logit contributions
 waves+0.057
 floats+0.056
 screams+0.056
ls+0.053
 sinks+0.051
Neg logit contributions
isk-0.055
ety-0.055
 whether-0.054
my-0.052
coat-0.050
Token his
Token ID465
Feature activation-2.46e-01

Pos logit contributions
 our+0.038
ls+0.037
 waves+0.036
 moms+0.036
 screams+0.035
Neg logit contributions
ety-0.042
coat-0.041
 pad-0.039
tr-0.039
 gloss-0.038
Token friend
Token ID1545
Feature activation+1.45e+00

Pos logit contributions
isk+0.017
ety+0.017
my+0.017
 whether+0.016
coat+0.016
Neg logit contributions
 waves-0.010
 floats-0.010
 screams-0.010
ls-0.009
 cheers-0.009
Token,
Token ID11
Feature activation-9.19e-01

Pos logit contributions
ls+0.046
 moms+0.045
 waves+0.045
 sorry+0.043
 screams+0.041
Neg logit contributions
coat-0.106
tr-0.102
ety-0.100
kit-0.098
isk-0.097
Token would
Token ID561
Feature activation-3.25e+00

Pos logit contributions
isk+0.010
ety+0.010
 whether+0.009
my+0.009
coat+0.009
Neg logit contributions
 floats-0.011
 waves-0.011
 screams-0.010
ls-0.010
 sinks-0.010
Token heal
Token ID12035
Feature activation-4.68e+00

Pos logit contributions
isk+0.180
my+0.177
ety+0.176
 whether+0.173
coat+0.161
Neg logit contributions
 floats-0.179
 waves-0.177
 screams-0.170
ls-0.168
 sinks-0.160
Token soon
Token ID2582
Feature activation-9.41e-01

Pos logit contributions
ety+0.255
isk+0.253
my+0.244
 whether+0.242
coat+0.240
Neg logit contributions
 waves-0.252
 floats-0.250
 screams-0.244
 ourselves-0.239
ls-0.238
Token.
Token ID13
Feature activation-5.34e-01

Pos logit contributions
isk+0.047
ety+0.046
my+0.045
 whether+0.044
coat+0.041
Neg logit contributions
 floats-0.057
 waves-0.057
 screams-0.055
ls-0.053
 sinks-0.052
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
ety+0.027
 whether+0.027
isk+0.027
my+0.027
coat+0.026
Neg logit contributions
 floats-0.030
 waves-0.029
 screams-0.028
 cheers-0.027
ls-0.027
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.174
 brushed+0.107
coat+0.103
 hers+0.102
 pencil+0.102
Neg logit contributions
ls-1.108
am-1.104
 screams-1.084
 floats-1.065
ext-1.061
Tokeners
Token ID364
Feature activation+1.03e+00

Pos logit contributions
ety+1.107
isk+1.097
 whether+1.088
my+1.087
coat+1.032
Neg logit contributions
 floats-0.722
 waves-0.716
 screams-0.683
ls-0.667
 sinks-0.635
Token was
Token ID373
Feature activation-9.66e-02

Pos logit contributions
 floats+0.063
 waves+0.062
 screams+0.059
ls+0.059
 sinks+0.057
Neg logit contributions
isk-0.051
my-0.051
ety-0.049
 whether-0.047
coat-0.044
Token very
Token ID845
Feature activation-1.11e+00

Pos logit contributions
my+0.006
isk+0.006
 whether+0.006
ety+0.005
coat+0.005
Neg logit contributions
 floats-0.005
 waves-0.005
ls-0.005
 screams-0.005
 sinks-0.004
Token happy
Token ID3772
Feature activation+9.07e-01

Pos logit contributions
my+0.057
 whether+0.056
ety+0.054
isk+0.050
 though+0.050
Neg logit contributions
 floats-0.067
 waves-0.063
ls-0.061
 ourselves-0.058
 screams-0.057
Token to
Token ID284
Feature activation-1.22e+00

Pos logit contributions
 waves+0.051
 floats+0.050
 screams+0.049
ls+0.047
 cheers+0.047
Neg logit contributions
isk-0.050
ety-0.050
my-0.047
 whether-0.046
coat-0.046
Token lock
Token ID5793
Feature activation-5.39e+00

Pos logit contributions
isk+0.084
ety+0.082
my+0.081
 whether+0.080
coat+0.074
Neg logit contributions
 waves-0.059
 floats-0.058
 screams-0.057
ls-0.054
 sinks-0.052
Token on
Token ID319
Feature activation-6.80e-01

Pos logit contributions
isk+0.329
ety+0.317
 whether+0.306
my+0.304
coat+0.282
Neg logit contributions
 waves-0.276
 floats-0.272
 screams-0.262
ls-0.261
 sinks-0.246
Token it
Token ID340
Feature activation-1.75e+00

Pos logit contributions
ety+0.052
isk+0.050
coat+0.048
my+0.048
 whether+0.047
Neg logit contributions
 waves-0.025
 screams-0.024
 floats-0.024
 ourselves-0.022
 sinks-0.022
Token.
Token ID13
Feature activation-5.32e-01

Pos logit contributions
isk+0.111
ety+0.109
 whether+0.101
my+0.101
coat+0.100
Neg logit contributions
 waves-0.081
 screams-0.080
ls-0.078
 floats-0.077
 sinks-0.074
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
 whether+0.023
ety+0.023
isk+0.023
my+0.023
coat+0.021
Neg logit contributions
 floats-0.034
 waves-0.034
 screams-0.034
ls-0.032
 sinks-0.031
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.250
 hers+0.186
 brushed+0.183
 though+0.177
 blonde+0.174
Neg logit contributions
am-1.062
ls-1.052
ars-1.013
 screams-1.012
 floats-0.993
Tokeners
Token ID364
Feature activation+1.04e+00

Pos logit contributions
ety+1.144
 whether+1.124
isk+1.124
my+1.114
coat+1.072
Neg logit contributions
 floats-0.682
 waves-0.676
ls-0.642
 screams-0.640
 sinks-0.595
Token tried
Token ID3088
Feature activation-3.80e-01

Pos logit contributions
 floats+0.058
 waves+0.058
 screams+0.056
ls+0.055
 sinks+0.053
Neg logit contributions
isk-0.055
ety-0.054
my-0.053
 whether-0.053
coat-0.050
Token to
Token ID284
Feature activation-1.23e+00

Pos logit contributions
isk+0.024
ety+0.022
coat+0.022
tr+0.022
shirt+0.020
Neg logit contributions
 screams-0.019
 waves-0.018
ls-0.018
 sinks-0.017
 cheers-0.017
Token open
Token ID1280
Feature activation-4.18e-01

Pos logit contributions
ety+0.068
isk+0.067
my+0.065
 whether+0.065
coat+0.061
Neg logit contributions
 waves-0.067
 screams-0.066
 floats-0.065
 ourselves-0.061
ls-0.061
Token the
Token ID262
Feature activation+7.56e-01

Pos logit contributions
isk+0.032
ety+0.031
tr+0.031
coat+0.030
my+0.030
Neg logit contributions
 waves-0.015
 floats-0.015
ls-0.014
 screams-0.013
 sinks-0.013
Token Ch
Token ID609
Feature activation-6.21e+00

Pos logit contributions
isk+0.047
ety+0.046
my+0.046
 whether+0.045
coat+0.042
Neg logit contributions
 floats-0.052
 waves-0.052
 screams-0.050
ls-0.049
 sinks-0.047
Tokenir
Token ID343
Feature activation+1.43e+00

Pos logit contributions
isk+0.437
ety+0.422
my+0.412
 whether+0.399
coat+0.383
Neg logit contributions
 floats-0.275
 waves-0.274
 screams-0.252
 cheers-0.241
ls-0.235
Tokenpy
Token ID9078
Feature activation+3.04e-01

Pos logit contributions
 floats+0.080
 waves+0.079
 screams+0.075
ls+0.073
 sinks+0.072
Neg logit contributions
isk-0.079
ety-0.078
my-0.077
 whether-0.075
coat-0.070
Token and
Token ID290
Feature activation+4.40e-01

Pos logit contributions
 floats+0.020
 waves+0.019
 ourselves+0.019
 screams+0.019
ls+0.018
Neg logit contributions
isk-0.014
my-0.014
ety-0.014
 whether-0.013
coat-0.012
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
 waves+0.029
 floats+0.028
 screams+0.027
ls+0.027
 cheers+0.026
Neg logit contributions
coat-0.020
 whether-0.019
ety-0.019
my-0.019
tr-0.018
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.140
coat+0.086
ety+0.076
my+0.075
tr+0.060
Neg logit contributions
 floats-1.158
 screams-1.157
ls-1.149
 waves-1.146
 ourselves-1.134
Tokeners
Token ID364
Feature activation+1.03e+00

Pos logit contributions
isk+1.068
ety+1.051
my+1.044
 whether+1.028
coat+0.971
Neg logit contributions
 floats-0.783
 waves-0.778
 screams-0.744
ls-0.717
 sinks-0.691
Token became
Token ID2627
Feature activation+2.94e+00

Pos logit contributions
 floats+0.057
 waves+0.056
 screams+0.055
ls+0.054
 sinks+0.052
Neg logit contributions
isk-0.058
my-0.056
ety-0.055
 whether-0.054
coat-0.051
Token good
Token ID922
Feature activation+4.86e-01

Pos logit contributions
 floats+0.130
 waves+0.128
 screams+0.124
ls+0.119
 cheers+0.113
Neg logit contributions
isk-0.195
ety-0.194
my-0.193
 whether-0.191
coat-0.177
Token friends
Token ID2460
Feature activation+6.37e+00

Pos logit contributions
 floats+0.016
 waves+0.015
 screams+0.014
 sinks+0.013
ls+0.013
Neg logit contributions
ety-0.038
isk-0.038
my-0.038
 whether-0.037
coat-0.036
Token,
Token ID11
Feature activation-9.26e-01

Pos logit contributions
 floats+0.400
 waves+0.398
 screams+0.384
ls+0.374
 sinks+0.365
Neg logit contributions
isk-0.304
ety-0.298
my-0.298
 whether-0.292
coat-0.268
Token was
Token ID373
Feature activation-7.53e-02

Pos logit contributions
 waves+0.025
 floats+0.025
 screams+0.024
ls+0.023
 sinks+0.023
Neg logit contributions
isk-0.025
ety-0.024
my-0.024
 whether-0.023
coat-0.022
Token heading
Token ID9087
Feature activation+6.55e+00

Pos logit contributions
 whether+0.004
my+0.004
isk+0.004
ety+0.004
coat+0.004
Neg logit contributions
 floats-0.004
ls-0.004
 waves-0.004
 screams-0.004
 ourselves-0.004
Token straight
Token ID3892
Feature activation+2.07e+00

Pos logit contributions
 floats+0.369
 waves+0.359
 screams+0.352
ls+0.342
 sinks+0.330
Neg logit contributions
isk-0.355
ety-0.351
my-0.349
 whether-0.346
coat-0.325
Token towards
Token ID3371
Feature activation-1.83e-01

Pos logit contributions
 waves+0.129
 floats+0.128
 screams+0.124
ls+0.121
 sinks+0.118
Neg logit contributions
isk-0.100
ety-0.099
my-0.097
 whether-0.095
coat-0.088
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
isk+0.012
ety+0.012
my+0.012
 whether+0.012
tr+0.011
Neg logit contributions
 waves-0.008
 floats-0.008
 screams-0.008
ls-0.007
 cheers-0.007
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.338
ety+0.295
my+0.277
isk+0.269
 though+0.265
Neg logit contributions
 waves-0.974
 screams-0.968
 floats-0.960
 ourselves-0.939
am-0.923
Tokeners
Token ID364
Feature activation+1.03e+00

Pos logit contributions
isk+1.035
my+1.007
ety+1.001
 whether+0.977
coat+0.927
Neg logit contributions
 floats-0.836
 waves-0.825
 screams-0.791
ls-0.752
 sinks-0.739
Token.
Token ID13
Feature activation-5.37e-01

Pos logit contributions
 waves+0.061
 floats+0.060
 screams+0.058
ls+0.057
 sinks+0.055
Neg logit contributions
isk-0.052
ety-0.052
my-0.051
 whether-0.050
coat-0.046
Token She
Token ID1375
Feature activation-3.30e+00

Pos logit contributions
 whether+0.025
ety+0.024
isk+0.024
my+0.024
coat+0.022
Neg logit contributions
 floats-0.032
 screams-0.032
 waves-0.031
 cheers-0.030
ls-0.030
Token ran
Token ID4966
Feature activation+5.39e+00

Pos logit contributions
isk+0.161
my+0.159
ety+0.159
 whether+0.153
coat+0.143
Neg logit contributions
 floats-0.204
 waves-0.201
 screams-0.195
ls-0.192
 ourselves-0.190
Token as
Token ID355
Feature activation-2.55e+00

Pos logit contributions
 waves+0.313
 floats+0.309
 screams+0.301
ls+0.292
 cheers+0.284
Neg logit contributions
isk-0.283
ety-0.278
my-0.270
 whether-0.266
tr-0.251
Token.
Token ID13
Feature activation-5.49e-01

Pos logit contributions
isk+0.110
ety+0.108
my+0.107
 whether+0.106
coat+0.099
Neg logit contributions
 floats-0.116
 waves-0.115
 screams-0.112
ls-0.106
 cheers-0.105
Token One
Token ID1881
Feature activation-5.94e+00

Pos logit contributions
epad+0.033
 whether+0.031
 brushed+0.030
ge+0.030
pr+0.030
Neg logit contributions
 Everyone-0.024
 moms-0.024
 floats-0.023
 waves-0.022
 Everybody-0.020
Token day
Token ID1110
Feature activation-1.51e+00

Pos logit contributions
isk+0.452
 whether+0.441
rings+0.427
coat+0.420
kit+0.417
Neg logit contributions
 waves-0.194
 floats-0.189
 screams-0.172
ls-0.171
 shark-0.170
Token,
Token ID11
Feature activation-9.19e-01

Pos logit contributions
isk+0.118
ety+0.111
 whether+0.107
my+0.106
coat+0.102
Neg logit contributions
ls-0.058
 floats-0.055
 screams-0.051
 waves-0.049
 sinks-0.048
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
isk+0.045
 pad+0.044
epad+0.043
ety+0.043
my+0.042
Neg logit contributions
 floats-0.055
ls-0.054
 moms-0.052
 waves-0.051
 boat-0.049
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.228
 though+0.160
 hers+0.156
 blonde+0.154
 brushed+0.145
Neg logit contributions
ls-1.078
am-1.066
 screams-1.036
ars-1.034
ext-1.032
Tokeners
Token ID364
Feature activation+1.04e+00

Pos logit contributions
ety+1.110
isk+1.104
 whether+1.092
my+1.090
coat+1.037
Neg logit contributions
 floats-0.720
 waves-0.716
 screams-0.675
ls-0.669
 sinks-0.632
Token saw
Token ID2497
Feature activation-1.16e+00

Pos logit contributions
 floats+0.065
 waves+0.065
 screams+0.063
ls+0.061
 sinks+0.060
Neg logit contributions
isk-0.049
ety-0.048
my-0.048
 whether-0.047
coat-0.043
Token a
Token ID257
Feature activation-1.30e+00

Pos logit contributions
coat+0.076
isk+0.075
ety+0.074
 pad+0.073
 whether+0.070
Neg logit contributions
 waves-0.055
 screams-0.049
 floats-0.049
ls-0.048
 shark-0.046
Token can
Token ID460
Feature activation-1.88e+00

Pos logit contributions
isk+0.101
ety+0.098
 whether+0.098
my+0.096
coat+0.090
Neg logit contributions
 waves-0.046
 floats-0.045
ls-0.040
 screams-0.040
 shark-0.039
Token of
Token ID286
Feature activation+3.68e+00

Pos logit contributions
ety+0.129
isk+0.127
my+0.124
 whether+0.122
coat+0.115
Neg logit contributions
 screams-0.073
 floats-0.073
 waves-0.073
ls-0.072
 sinks-0.069
Token One
Token ID1881
Feature activation-5.96e+00

Pos logit contributions
epad+0.031
 whether+0.030
coat+0.029
 brushed+0.029
ety+0.028
Neg logit contributions
 Everyone-0.025
 moms-0.024
 floats-0.024
 waves-0.022
 Everybody-0.021
Token sunny
Token ID27737
Feature activation-8.78e-01

Pos logit contributions
isk+0.437
 whether+0.420
ety+0.409
rings+0.402
kit+0.400
Neg logit contributions
 waves-0.213
 floats-0.211
 screams-0.192
ls-0.183
 boat-0.178
Token day
Token ID1110
Feature activation-1.51e+00

Pos logit contributions
isk+0.057
ety+0.055
 whether+0.054
my+0.052
coat+0.050
Neg logit contributions
 waves-0.042
 floats-0.042
 screams-0.038
ls-0.038
 sinks-0.036
Token,
Token ID11
Feature activation-9.19e-01

Pos logit contributions
isk+0.116
ety+0.110
my+0.105
 whether+0.104
coat+0.101
Neg logit contributions
ls-0.060
 floats-0.058
 screams-0.055
 waves-0.052
 sinks-0.050
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
isk+0.049
ety+0.047
my+0.047
 pad+0.046
epad+0.045
Neg logit contributions
 floats-0.053
ls-0.051
 waves-0.050
 moms-0.048
 boat-0.047
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.174
 though+0.101
 hers+0.097
 brushed+0.096
my+0.092
Neg logit contributions
ls-1.124
am-1.101
 floats-1.101
 screams-1.093
 waves-1.091
Tokeners
Token ID364
Feature activation+1.04e+00

Pos logit contributions
isk+1.047
ety+1.020
my+1.018
 whether+0.995
coat+0.939
Neg logit contributions
 floats-0.812
 waves-0.807
 screams-0.778
ls-0.738
 cheers-0.723
Token went
Token ID1816
Feature activation+1.55e+00

Pos logit contributions
 floats+0.068
 waves+0.068
 screams+0.066
ls+0.064
 sinks+0.062
Neg logit contributions
isk-0.046
ety-0.045
my-0.045
 whether-0.044
coat-0.041
Token for
Token ID329
Feature activation-4.59e+00

Pos logit contributions
 waves+0.085
ls+0.082
 floats+0.078
 screams+0.077
 sinks+0.072
Neg logit contributions
isk-0.090
ety-0.089
 whether-0.084
my-0.082
coat-0.077
Token a
Token ID257
Feature activation-1.28e+00

Pos logit contributions
ety+0.274
my+0.266
isk+0.263
 whether+0.251
coat+0.247
Neg logit contributions
 waves-0.247
 screams-0.238
 floats-0.231
 sinks-0.217
ls-0.215
Token walk
Token ID2513
Feature activation-2.93e-02

Pos logit contributions
isk+0.081
my+0.077
 whether+0.073
ety+0.073
tr+0.069
Neg logit contributions
 floats-0.059
 waves-0.058
 boat-0.056
ls-0.056
 screams-0.053
Token and
Token ID290
Feature activation+4.57e-01

Pos logit contributions
isk+0.006
ety+0.006
 whether+0.005
my+0.005
tr+0.005
Neg logit contributions
 waves-0.006
 floats-0.005
 screams-0.005
ls-0.005
 cheers-0.005
Token held
Token ID2714
Feature activation-2.92e+00

Pos logit contributions
 waves+0.023
 floats+0.023
 screams+0.022
ls+0.021
 cheers+0.020
Neg logit contributions
isk-0.029
ety-0.027
 whether-0.027
my-0.027
tr-0.025
Token tight
Token ID5381
Feature activation+4.90e-02

Pos logit contributions
isk+0.166
my+0.164
ety+0.164
 whether+0.162
coat+0.152
Neg logit contributions
 floats-0.154
 waves-0.151
 screams-0.147
ls-0.140
 sinks-0.137
Token.
Token ID13
Feature activation-5.19e-01

Pos logit contributions
 waves+0.003
 screams+0.002
 floats+0.002
ls+0.002
 cheers+0.002
Neg logit contributions
isk-0.003
ety-0.003
my-0.003
 whether-0.003
tr-0.003
Token Wh
Token ID854
Feature activation-1.23e+01

Pos logit contributions
isk+0.029
ety+0.028
my+0.028
 whether+0.028
coat+0.026
Neg logit contributions
 floats-0.028
 waves-0.028
 screams-0.027
ls-0.027
 sinks-0.026
Tokenisk
Token ID1984
Feature activation-1.68e+01

Pos logit contributions
 whether+0.129
my+0.067
tr+0.056
 pencil+0.051
 brushed+0.049
Neg logit contributions
 floats-1.178
ls-1.171
 waves-1.163
 screams-1.154
am-1.151
Tokeners
Token ID364
Feature activation+1.04e+00

Pos logit contributions
ety+1.101
isk+1.098
my+1.085
 whether+1.081
coat+1.022
Neg logit contributions
 floats-0.732
 waves-0.725
 screams-0.688
ls-0.674
 sinks-0.639
Token pushed
Token ID7121
Feature activation-1.37e+00

Pos logit contributions
 floats+0.060
 waves+0.059
 screams+0.058
ls+0.056
 sinks+0.054
Neg logit contributions
isk-0.054
ety-0.054
my-0.053
 whether-0.052
coat-0.049
Token him
Token ID683
Feature activation+4.18e-01

Pos logit contributions
isk+0.085
ety+0.083
 whether+0.081
my+0.081
tr+0.078
Neg logit contributions
 waves-0.068
 screams-0.067
 floats-0.067
ls-0.063
 cheers-0.061
Token gently
Token ID15165
Feature activation-9.67e-01

Pos logit contributions
 screams+0.019
 waves+0.019
ls+0.018
 floats+0.017
 sinks+0.017
Neg logit contributions
isk-0.028
ety-0.027
my-0.025
 whether-0.025
coat-0.024
Token,
Token ID11
Feature activation-9.11e-01

Pos logit contributions
isk+0.065
ety+0.062
 whether+0.059
my+0.058
tr+0.053
Neg logit contributions
 waves-0.048
 screams-0.047
 floats-0.043
 cheers-0.042
ls-0.042
Token delight
Token ID10974
Feature activation+5.14e+00

Pos logit contributions
 whether+0.218
my+0.214
isk+0.203
 hers+0.201
ety+0.193
Neg logit contributions
ls-0.146
 floats-0.145
 sinks-0.130
 waves-0.127
 screams-0.126
Token.
Token ID13
Feature activation-4.67e-01

Pos logit contributions
 floats+0.320
 waves+0.317
 screams+0.307
ls+0.299
 sinks+0.293
Neg logit contributions
isk-0.247
ety-0.244
my-0.242
 whether-0.238
coat-0.219
Token
Token ID198
Feature activation+2.51e-01

Pos logit contributions
isk+0.022
ety+0.022
my+0.021
 whether+0.021
coat+0.019
Neg logit contributions
 floats-0.030
 waves-0.029
 screams-0.029
ls-0.028
 sinks-0.027
Token
Token ID198
Feature activation+2.51e-01

Pos logit contributions
 floats+0.017
 waves+0.016
 screams+0.015
 sinks+0.015
ls+0.015
Neg logit contributions
my-0.012
 whether-0.011
isk-0.011
ety-0.011
coat-0.009
TokenM
Token ID44
Feature activation-7.57e+00

Pos logit contributions
 floats+0.019
 waves+0.018
 screams+0.018
ls+0.017
 sinks+0.017
Neg logit contributions
my-0.009
ety-0.009
isk-0.009
 whether-0.009
coat-0.008
Tokenagg
Token ID9460
Feature activation+8.60e+00

Pos logit contributions
ety+0.491
my+0.482
 whether+0.462
isk+0.459
ine+0.437
Neg logit contributions
 floats-0.348
 sinks-0.315
 cheers-0.307
 waves-0.303
 teams-0.296
Tokenie
Token ID494
Feature activation-4.92e+00

Pos logit contributions
 floats+0.716
 waves+0.698
 screams+0.691
 sinks+0.681
 cheers+0.677
Neg logit contributions
my-0.250
ety-0.237
isk-0.221
 whether-0.212
coat-0.162
Token's
Token ID338
Feature activation-4.47e-01

Pos logit contributions
my+0.234
ety+0.219
isk+0.211
 whether+0.203
 pad+0.185
Neg logit contributions
 floats-0.316
 ourselves-0.307
 screams-0.300
ls-0.300
 sinks-0.299
Token mum
Token ID25682
Feature activation+6.90e+00

Pos logit contributions
my+0.025
 whether+0.024
tr+0.023
isk+0.023
 dictionary+0.023
Neg logit contributions
ls-0.023
 floats-0.023
 sinks-0.022
 ourselves-0.021
 explodes-0.021
Token came
Token ID1625
Feature activation-2.74e+00

Pos logit contributions
 waves+0.393
 floats+0.387
 screams+0.372
ls+0.358
 cheers+0.355
Neg logit contributions
isk-0.368
ety-0.362
 whether-0.353
tr-0.336
coat-0.335
Token out
Token ID503
Feature activation+2.50e-01

Pos logit contributions
isk+0.145
ety+0.141
 whether+0.136
my+0.135
coat+0.129
Neg logit contributions
 waves-0.160
 floats-0.156
 screams-0.153
ls-0.147
 cheers-0.145
Token "
Token ID366
Feature activation-4.62e+00

Pos logit contributions
ety+0.022
isk+0.022
my+0.021
 whether+0.020
coat+0.019
Neg logit contributions
 floats-0.036
 waves-0.036
 screams-0.035
ls-0.034
 cheers-0.033
TokenSorry
Token ID14385
Feature activation+4.28e+00

Pos logit contributions
ety+0.251
my+0.234
tr+0.224
isk+0.223
L+0.221
Neg logit contributions
 waves-0.259
 screams-0.244
ls-0.243
 floats-0.242
 cheers-0.236
Token,
Token ID11
Feature activation-5.26e-01

Pos logit contributions
 floats+0.264
 waves+0.262
 screams+0.255
ls+0.250
 sinks+0.245
Neg logit contributions
my-0.209
isk-0.204
ety-0.204
 whether-0.202
coat-0.181
Token sorry
Token ID7926
Feature activation+8.92e+00

Pos logit contributions
 whether+0.027
isk+0.026
my+0.026
tr+0.025
 screw+0.025
Neg logit contributions
ls-0.028
inkle-0.027
am-0.027
led-0.027
ext-0.027
Token,
Token ID11
Feature activation-5.06e-01

Pos logit contributions
 floats+0.573
 waves+0.561
 screams+0.551
 sinks+0.542
ls+0.539
Neg logit contributions
my-0.409
 whether-0.394
ety-0.393
isk-0.376
tr-0.341
Token sorry
Token ID7926
Feature activation+8.92e+00

Pos logit contributions
 whether+0.026
isk+0.026
my+0.025
tr+0.025
 screw+0.024
Neg logit contributions
ls-0.028
 waves-0.026
am-0.026
 floats-0.026
inkle-0.026
Token."
Token ID526
Feature activation+4.20e+00

Pos logit contributions
 floats+0.571
 waves+0.556
 screams+0.546
 sinks+0.537
ls+0.537
Neg logit contributions
my-0.414
 whether-0.397
ety-0.395
isk-0.379
tr-0.350
Token
Token ID198
Feature activation+6.18e-01

Pos logit contributions
 floats+0.303
 waves+0.299
ading+0.292
 screams+0.288
ls+0.286
Neg logit contributions
ety-0.145
my-0.139
 whether-0.138
L-0.125
isk-0.125
Token
Token ID198
Feature activation+6.27e-01

Pos logit contributions
 floats+0.051
 waves+0.050
 losers+0.048
ading+0.047
 sinks+0.046
Neg logit contributions
my-0.017
L-0.016
ety-0.014
 whether-0.014
isk-0.014
TokenTheir
Token ID14574
Feature activation+6.57e+00

Pos logit contributions
 floats+0.037
 waves+0.037
 screams+0.036
ls+0.035
 sinks+0.034
Neg logit contributions
L-0.031
my-0.031
ety-0.031
isk-0.029
 whether-0.029
Token moms
Token ID34015
Feature activation+8.35e+00

Pos logit contributions
inkle+0.245
vation+0.236
 floats+0.227
 ourselves+0.227
text+0.226
Neg logit contributions
ety-0.434
isk-0.426
my-0.425
coat-0.405
 whether-0.402
Token tree
Token ID5509
Feature activation-4.64e-02

Pos logit contributions
 floats+0.079
ls+0.075
 waves+0.074
 ourselves+0.073
 screams+0.073
Neg logit contributions
my-0.084
ety-0.082
isk-0.081
tr-0.080
 whether-0.078
Token heard
Token ID2982
Feature activation-1.18e+00

Pos logit contributions
isk+0.002
my+0.002
ety+0.002
 whether+0.002
coat+0.002
Neg logit contributions
 floats-0.003
 waves-0.003
 screams-0.003
ls-0.003
 ourselves-0.003
Token them
Token ID606
Feature activation+4.25e+00

Pos logit contributions
isk+0.061
my+0.059
 whether+0.057
ety+0.057
tr+0.051
Neg logit contributions
 floats-0.069
 waves-0.067
ls-0.065
 screams-0.064
 sinks-0.063
Token and
Token ID290
Feature activation+6.17e-01

Pos logit contributions
 waves+0.245
 screams+0.241
 floats+0.241
ls+0.227
 cheers+0.223
Neg logit contributions
isk-0.226
ety-0.222
 whether-0.214
my-0.211
coat-0.204
Token felt
Token ID2936
Feature activation+8.16e-01

Pos logit contributions
 waves+0.038
 floats+0.038
 screams+0.037
ls+0.036
 sinks+0.035
Neg logit contributions
isk-0.030
ety-0.030
my-0.029
 whether-0.029
coat-0.027
Token sorry
Token ID7926
Feature activation+8.66e+00

Pos logit contributions
 floats+0.039
 waves+0.035
ls+0.035
 ourselves+0.033
ext+0.032
Neg logit contributions
 whether-0.049
my-0.048
isk-0.047
 hers-0.045
ety-0.045
Token.
Token ID13
Feature activation-4.03e-01

Pos logit contributions
 waves+0.477
 screams+0.474
 floats+0.466
ls+0.451
 cheers+0.434
Neg logit contributions
isk-0.496
ety-0.487
coat-0.460
 whether-0.458
my-0.454
Token It
Token ID632
Feature activation-1.89e+00

Pos logit contributions
isk+0.021
ety+0.020
my+0.020
 whether+0.020
coat+0.018
Neg logit contributions
 floats-0.024
 waves-0.024
 screams-0.023
ls-0.022
 sinks-0.022
Token moved
Token ID3888
Feature activation+6.64e-01

Pos logit contributions
isk+0.108
ety+0.106
 whether+0.105
my+0.102
coat+0.099
Neg logit contributions
 waves-0.102
 screams-0.100
 floats-0.100
ls-0.094
 cheers-0.091
Token a
Token ID257
Feature activation-1.20e+00

Pos logit contributions
 waves+0.033
 floats+0.032
ls+0.032
 screams+0.031
 sinks+0.030
Neg logit contributions
isk-0.042
ety-0.040
 whether-0.038
my-0.038
coat-0.037
Token little
Token ID1310
Feature activation+2.25e+00

Pos logit contributions
isk+0.082
ety+0.079
 whether+0.078
my+0.078
coat+0.072
Neg logit contributions
 waves-0.051
 floats-0.051
 screams-0.050
ls-0.047
 sinks-0.046
Token and
Token ID290
Feature activation+7.45e-01

Pos logit contributions
isk+0.225
ety+0.218
 whether+0.204
tr+0.202
my+0.199
Neg logit contributions
 waves-0.245
 floats-0.235
 screams-0.225
ls-0.218
 cheers-0.215
Token waited
Token ID13488
Feature activation+1.75e+00

Pos logit contributions
 waves+0.046
 floats+0.045
 screams+0.040
ls+0.040
 sinks+0.039
Neg logit contributions
isk-0.038
ety-0.036
 whether-0.035
tr-0.034
coat-0.034
Token in
Token ID287
Feature activation+2.21e+00

Pos logit contributions
 waves+0.113
 floats+0.112
 screams+0.108
ls+0.105
 sinks+0.103
Neg logit contributions
isk-0.081
ety-0.080
my-0.078
 whether-0.075
coat-0.070
Token line
Token ID1627
Feature activation+2.54e+00

Pos logit contributions
 waves+0.116
 cheers+0.108
 screams+0.107
 floats+0.105
 ourselves+0.100
Neg logit contributions
ety-0.143
isk-0.135
coat-0.128
tr-0.126
my-0.124
Token.
Token ID13
Feature activation-1.96e-01

Pos logit contributions
 floats+0.173
 waves+0.168
 screams+0.165
ls+0.161
 sinks+0.158
Neg logit contributions
my-0.109
isk-0.107
ety-0.105
 whether-0.104
coat-0.097
Token Everyone
Token ID11075
Feature activation+9.01e+00

Pos logit contributions
ety+0.012
isk+0.012
my+0.011
 whether+0.011
coat+0.011
Neg logit contributions
 waves-0.011
 floats-0.010
 screams-0.010
 cheers-0.009
 sails-0.009
Token was
Token ID373
Feature activation+2.57e-01

Pos logit contributions
 floats+0.542
 waves+0.540
 screams+0.525
ls+0.510
 sinks+0.497
Neg logit contributions
isk-0.444
my-0.443
ety-0.442
 whether-0.432
coat-0.400
Token ready
Token ID3492
Feature activation+2.08e+00

Pos logit contributions
 floats+0.014
 waves+0.014
ls+0.013
 screams+0.013
 sinks+0.012
Neg logit contributions
my-0.015
isk-0.014
ety-0.014
 whether-0.014
coat-0.014
Token to
Token ID284
Feature activation-8.58e-01

Pos logit contributions
 waves+0.145
 floats+0.144
 screams+0.140
ls+0.136
 cheers+0.133
Neg logit contributions
isk-0.086
ety-0.084
my-0.081
 whether-0.078
coat-0.073
Token pay
Token ID1414
Feature activation-4.59e+00

Pos logit contributions
isk+0.041
my+0.040
 whether+0.039
ety+0.038
coat+0.036
Neg logit contributions
ls-0.053
 floats-0.053
 screams-0.052
 waves-0.051
 sinks-0.049
Token,
Token ID11
Feature activation-5.39e-01

Pos logit contributions
my+0.203
 whether+0.197
isk+0.185
ety+0.183
coat+0.159
Neg logit contributions
 floats-0.311
 waves-0.294
 screams-0.289
 sinks-0.283
ls-0.283
Token said
Token ID531
Feature activation+2.29e+00

Pos logit contributions
 waves+0.025
 floats+0.025
 screams+0.024
ls+0.023
 cheers+0.022
Neg logit contributions
isk-0.031
ety-0.031
my-0.030
 whether-0.030
coat-0.028
Token,
Token ID11
Feature activation-8.39e-01

Pos logit contributions
 floats+0.123
 waves+0.122
 screams+0.119
ls+0.117
 sinks+0.111
Neg logit contributions
isk-0.133
ety-0.131
my-0.128
 whether-0.124
coat-0.119
Token "
Token ID366
Feature activation-4.93e+00

Pos logit contributions
ety+0.040
isk+0.038
my+0.037
 whether+0.035
coat+0.034
Neg logit contributions
 waves-0.054
 screams-0.053
 floats-0.053
 cheers-0.050
ls-0.050
TokenI
Token ID40
Feature activation+2.04e+00

Pos logit contributions
ety+0.223
isk+0.220
my+0.219
 whether+0.213
coat+0.194
Neg logit contributions
 floats-0.322
 waves-0.321
 screams-0.310
ls-0.304
 sinks-0.296
Token'm
Token ID1101
Feature activation-1.39e+00

Pos logit contributions
 screams+0.085
 floats+0.085
 waves+0.082
 cheers+0.080
 sinks+0.078
Neg logit contributions
isk-0.132
ety-0.127
coat-0.126
 whether-0.123
my-0.121
Token sorry
Token ID7926
Feature activation+8.61e+00

Pos logit contributions
 whether+0.104
my+0.102
isk+0.098
ety+0.096
 hers+0.096
Neg logit contributions
 floats-0.055
 waves-0.053
ls-0.049
 screams-0.044
 sinks-0.041
Token,
Token ID11
Feature activation-8.15e-01

Pos logit contributions
 floats+0.510
 waves+0.506
 screams+0.487
ls+0.472
 sinks+0.464
Neg logit contributions
my-0.438
isk-0.438
ety-0.435
 whether-0.431
coat-0.389
Token my
Token ID616
Feature activation+4.09e+00

Pos logit contributions
 whether+0.033
my+0.032
ety+0.031
isk+0.030
 though+0.028
Neg logit contributions
 floats-0.054
 waves-0.053
am-0.053
ls-0.053
 explodes-0.052
Token loves
Token ID10408
Feature activation+3.77e+00

Pos logit contributions
 floats+0.181
 waves+0.179
 screams+0.170
ls+0.163
 sinks+0.157
Neg logit contributions
isk-0.275
ety-0.272
my-0.267
 whether-0.265
coat-0.251
Token.
Token ID13
Feature activation-4.20e-01

Pos logit contributions
 floats+0.243
 waves+0.241
 screams+0.232
ls+0.229
 sinks+0.223
Neg logit contributions
my-0.172
isk-0.168
 whether-0.166
ety-0.165
coat-0.152
Token I
Token ID314
Feature activation+5.13e+00

Pos logit contributions
isk+0.022
ety+0.021
my+0.021
 whether+0.021
coat+0.020
Neg logit contributions
 floats-0.025
 waves-0.024
 screams-0.024
ls-0.023
 sinks-0.022
Token eyes
Token ID2951
Feature activation+1.86e+00

Pos logit contributions
isk+0.292
ety+0.284
my+0.275
 whether+0.274
coat+0.257
Neg logit contributions
 waves-0.259
 floats-0.254
 screams-0.247
ls-0.235
 cheers-0.231
Token and
Token ID290
Feature activation+7.78e-01

Pos logit contributions
 waves+0.095
 floats+0.091
 screams+0.086
ls+0.084
 cheers+0.082
Neg logit contributions
isk-0.123
ety-0.117
 whether-0.109
my-0.107
tr-0.101
Token was
Token ID373
Feature activation+2.57e-01

Pos logit contributions
 waves+0.044
 floats+0.043
 screams+0.042
ls+0.040
 cheers+0.039
Neg logit contributions
isk-0.045
ety-0.044
my-0.042
 whether-0.041
coat-0.039
Token looking
Token ID2045
Feature activation-4.07e+00

Pos logit contributions
 waves+0.014
 floats+0.014
 screams+0.014
 cheers+0.013
ls+0.013
Neg logit contributions
isk-0.015
ety-0.015
my-0.014
 whether-0.014
coat-0.013
Token at
Token ID379
Feature activation+1.39e+00

Pos logit contributions
my+0.210
 whether+0.208
isk+0.208
ety+0.206
coat+0.188
Neg logit contributions
 floats-0.234
 waves-0.228
 screams-0.225
ls-0.218
 sinks-0.214
Token him
Token ID683
Feature activation+7.75e-01

Pos logit contributions
 floats+0.072
 waves+0.071
 screams+0.068
ls+0.066
 sinks+0.064
Neg logit contributions
isk-0.082
ety-0.081
my-0.079
 whether-0.078
coat-0.074
Token curiously
Token ID43211
Feature activation+1.86e+00

Pos logit contributions
 waves+0.045
 floats+0.045
 screams+0.043
ls+0.042
 sinks+0.041
Neg logit contributions
isk-0.040
ety-0.040
my-0.039
 whether-0.038
coat-0.036
Token.
Token ID13
Feature activation-1.70e-01

Pos logit contributions
 waves+0.105
 floats+0.103
 screams+0.100
ls+0.095
 sinks+0.094
Neg logit contributions
isk-0.104
ety-0.103
my-0.099
 whether-0.097
coat-0.092
Token M
Token ID337
Feature activation-3.77e+00

Pos logit contributions
isk+0.009
ety+0.009
my+0.009
 whether+0.008
coat+0.008
Neg logit contributions
 floats-0.010
 waves-0.010
 screams-0.010
ls-0.009
 sinks-0.009
Tokenummy
Token ID13513
Feature activation+3.41e+00

Pos logit contributions
isk+0.260
my+0.259
ety+0.259
 whether+0.247
coat+0.237
Neg logit contributions
 floats-0.163
 waves-0.157
 screams-0.149
ls-0.145
 sinks-0.144
Token smiled
Token ID13541
Feature activation+4.25e+00

Pos logit contributions
 waves+0.189
 floats+0.184
 screams+0.174
ls+0.168
 cheers+0.167
Neg logit contributions
isk-0.193
ety-0.189
 whether-0.184
coat-0.174
tr-0.174
Token cl
Token ID537
Feature activation+6.43e+00

Pos logit contributions
 ourselves+0.291
ags+0.279
ext+0.278
 floats+0.273
ls+0.268
Neg logit contributions
my-0.175
 brushed-0.146
ety-0.134
 whether-0.133
 though-0.130
Tokenapped
Token ID6320
Feature activation+6.91e+00

Pos logit contributions
ls+0.324
 screams+0.313
 waves+0.306
 sinks+0.295
 floats+0.292
Neg logit contributions
ety-0.381
my-0.380
isk-0.372
c-0.351
ress-0.341
Token and
Token ID290
Feature activation+4.16e-01

Pos logit contributions
 floats+0.466
 waves+0.452
 screams+0.447
ls+0.442
 sinks+0.432
Neg logit contributions
my-0.298
 whether-0.290
isk-0.283
ety-0.280
coat-0.249
Token cheered
Token ID34767
Feature activation+7.10e+00

Pos logit contributions
ls+0.019
ext+0.018
 sinks+0.017
 floats+0.017
inkle+0.017
Neg logit contributions
my-0.027
 whether-0.024
ety-0.024
isk-0.024
 brushed-0.023
Token for
Token ID329
Feature activation-4.57e+00

Pos logit contributions
 floats+0.464
 waves+0.459
 screams+0.447
ls+0.439
 sinks+0.429
Neg logit contributions
isk-0.319
my-0.314
ety-0.309
 whether-0.309
coat-0.277
Token them
Token ID606
Feature activation+4.02e+00

Pos logit contributions
isk+0.274
 whether+0.267
my+0.266
ety+0.264
coat+0.242
Neg logit contributions
 floats-0.233
 waves-0.230
 screams-0.222
ls-0.218
 sinks-0.210
Token.
Token ID13
Feature activation-5.33e-01

Pos logit contributions
 floats+0.264
 waves+0.257
 screams+0.251
ls+0.251
 sinks+0.244
Neg logit contributions
my-0.182
isk-0.176
 whether-0.174
ety-0.170
coat-0.157
Token The
Token ID383
Feature activation+2.63e-01

Pos logit contributions
isk+0.031
my+0.031
ety+0.031
 whether+0.030
tr+0.028
Neg logit contributions
 waves-0.027
 floats-0.027
ls-0.026
 screams-0.026
 sinks-0.024
Token band
Token ID4097
Feature activation-1.11e-01

Pos logit contributions
ls+0.014
 floats+0.013
 ourselves+0.013
 screams+0.013
am+0.013
Neg logit contributions
my-0.014
coat-0.014
isk-0.013
 thread-0.013
ety-0.013
Token felt
Token ID2936
Feature activation+6.35e-01

Pos logit contributions
my+0.005
isk+0.005
coat+0.005
ety+0.005
tr+0.005
Neg logit contributions
 waves-0.007
 floats-0.007
ls-0.007
 sinks-0.007
 screams-0.007
Token so
Token ID523
Feature activation+1.52e+00

Pos logit contributions
ls+0.044
 floats+0.043
 screams+0.041
 waves+0.040
 sinks+0.039
Neg logit contributions
 whether-0.027
my-0.026
 hers-0.024
 though-0.024
isk-0.023
TokenLook
Token ID8567
Feature activation-9.69e-01

Pos logit contributions
isk+0.260
 whether+0.248
ety+0.240
my+0.238
coat+0.228
Neg logit contributions
 floats-0.263
 waves-0.256
 screams-0.236
 cheers-0.233
 ourselves-0.231
Token,
Token ID11
Feature activation-4.64e-01

Pos logit contributions
my+0.047
ety+0.046
isk+0.045
 whether+0.045
coat+0.040
Neg logit contributions
 floats-0.060
 waves-0.060
 screams-0.059
 sinks-0.057
 cheers-0.056
Token mom
Token ID1995
Feature activation+2.73e+00

Pos logit contributions
isk+0.032
ety+0.031
my+0.031
 whether+0.031
coat+0.029
Neg logit contributions
 floats-0.020
 waves-0.019
 screams-0.018
ls-0.018
 cheers-0.017
Token!
Token ID0
Feature activation+1.76e+00

Pos logit contributions
 floats+0.125
 waves+0.125
 screams+0.111
 cheers+0.109
ls+0.108
Neg logit contributions
isk-0.178
 whether-0.166
ety-0.163
coat-0.159
tr-0.159
Token The
Token ID383
Feature activation+7.11e-01

Pos logit contributions
 floats+0.096
 waves+0.095
 screams+0.091
ls+0.088
 cheers+0.086
Neg logit contributions
isk-0.099
ety-0.097
my-0.096
 whether-0.095
coat-0.089
Token sunshine
Token ID34488
Feature activation+3.02e+00

Pos logit contributions
ls+0.041
ete+0.041
am+0.040
 fishermen+0.039
ext+0.039
Neg logit contributions
my-0.035
tr-0.034
ety-0.031
 thread-0.031
coat-0.030
Token is
Token ID318
Feature activation+2.80e+00

Pos logit contributions
 fishermen+0.167
ext+0.152
 victorious+0.152
 screams+0.151
 teams+0.151
Neg logit contributions
my-0.167
tr-0.152
isk-0.148
ety-0.146
coat-0.144
Token back
Token ID736
Feature activation+6.35e+00

Pos logit contributions
 screams+0.169
ls+0.167
 floats+0.167
 waves+0.164
 sinks+0.157
Neg logit contributions
my-0.143
 whether-0.134
ety-0.133
isk-0.129
tr-0.126
Token!
Token ID0
Feature activation+1.78e+00

Pos logit contributions
 floats+0.414
 screams+0.406
 waves+0.405
 cheers+0.392
 sinks+0.390
Neg logit contributions
my-0.313
ety-0.298
 whether-0.281
isk-0.261
coat-0.258
Token Can
Token ID1680
Feature activation-1.13e+00

Pos logit contributions
 waves+0.101
 floats+0.101
 screams+0.098
ls+0.096
 sinks+0.093
Neg logit contributions
isk-0.093
ety-0.093
my-0.092
 whether-0.091
coat-0.085
Token we
Token ID356
Feature activation+7.99e+00

Pos logit contributions
my+0.078
ety+0.076
 whether+0.073
isk+0.071
tr+0.068
Neg logit contributions
 waves-0.049
 screams-0.048
 fishermen-0.046
 floats-0.046
ls-0.044
Token.
Token ID13
Feature activation-4.87e-01

Pos logit contributions
my+0.157
ety+0.157
isk+0.156
 whether+0.154
coat+0.139
Neg logit contributions
 floats-0.216
 waves-0.215
 screams-0.209
ls-0.202
 sinks-0.198
Token The
Token ID383
Feature activation+3.06e-01

Pos logit contributions
isk+0.020
ety+0.020
 whether+0.018
my+0.018
coat+0.018
Neg logit contributions
 floats-0.035
 waves-0.034
 screams-0.033
 cheers-0.032
 sinks-0.032
Token tiger
Token ID26241
Feature activation+1.00e+00

Pos logit contributions
ls+0.018
 floats+0.018
 screams+0.017
 sinks+0.017
 waves+0.017
Neg logit contributions
my-0.016
isk-0.016
 whether-0.014
ety-0.014
tr-0.014
Token was
Token ID373
Feature activation-5.06e-02

Pos logit contributions
 waves+0.067
 floats+0.067
 screams+0.066
 ourselves+0.065
 cheers+0.063
Neg logit contributions
my-0.043
ety-0.041
isk-0.041
 whether-0.039
tr-0.036
Token very
Token ID845
Feature activation-1.08e+00

Pos logit contributions
isk+0.003
ety+0.003
my+0.003
 whether+0.003
coat+0.003
Neg logit contributions
 floats-0.003
 waves-0.003
 screams-0.002
ls-0.002
 sinks-0.002
Token hungry
Token ID14720
Feature activation+8.05e-01

Pos logit contributions
ety+0.052
my+0.051
isk+0.046
 whether+0.046
 though+0.041
Neg logit contributions
 ourselves-0.066
text-0.065
 floats-0.063
 waves-0.062
 screams-0.062
Token and
Token ID290
Feature activation+4.45e-01

Pos logit contributions
 waves+0.044
 floats+0.043
 screams+0.042
ls+0.040
 cheers+0.039
Neg logit contributions
isk-0.047
ety-0.045
my-0.043
 whether-0.042
tr-0.041
Token wanted
Token ID2227
Feature activation-1.35e+00

Pos logit contributions
 floats+0.030
 screams+0.029
 waves+0.029
ls+0.028
 cheers+0.028
Neg logit contributions
my-0.019
ety-0.019
isk-0.018
 whether-0.018
coat-0.016
Token to
Token ID284
Feature activation-1.20e+00

Pos logit contributions
isk+0.050
ety+0.048
my+0.048
 whether+0.046
coat+0.042
Neg logit contributions
 floats-0.100
 waves-0.099
 screams-0.096
ls-0.094
 sinks-0.092
Token eat
Token ID4483
Feature activation-3.99e+00

Pos logit contributions
isk+0.048
my+0.046
ety+0.043
 whether+0.042
coat+0.036
Neg logit contributions
 floats-0.087
ls-0.086
 waves-0.082
 screams-0.082
 cheers-0.079
Token something
Token ID1223
Feature activation-5.32e+00

Pos logit contributions
isk+0.198
ety+0.194
my+0.194
 whether+0.192
coat+0.175
Neg logit contributions
 floats-0.242
 waves-0.239
 screams-0.233
ls-0.226
 cheers-0.220
Token
Token ID220
Feature activation-2.03e+00

Pos logit contributions
isk+0.031
ety+0.030
 whether+0.028
coat+0.027
my+0.027
Neg logit contributions
 waves-0.030
 floats-0.029
 screams-0.029
 cheers-0.029
 sinks-0.027
Token
Token ID198
Feature activation+2.07e-01

Pos logit contributions
isk+0.120
ety+0.119
 whether+0.116
my+0.116
coat+0.110
Neg logit contributions
 floats-0.099
 waves-0.099
 screams-0.097
ls-0.094
 cheers-0.090
Token
Token ID198
Feature activation+1.97e-01

Pos logit contributions
 waves+0.010
 floats+0.009
 screams+0.009
ls+0.009
 cheers+0.009
Neg logit contributions
isk-0.013
ety-0.013
my-0.012
 whether-0.012
coat-0.012
TokenOne
Token ID3198
Feature activation-7.89e+00

Pos logit contributions
 waves+0.011
 floats+0.011
 screams+0.011
ls+0.011
 cheers+0.011
Neg logit contributions
isk-0.010
ety-0.010
my-0.009
 whether-0.009
coat-0.009
Token day
Token ID1110
Feature activation-1.51e+00

Pos logit contributions
isk+0.531
ety+0.502
 whether+0.493
my+0.475
coat+0.469
Neg logit contributions
 waves-0.364
 floats-0.362
 screams-0.339
ls-0.329
 cheers-0.313
Token Molly
Token ID30236
Feature activation+2.48e+00

Pos logit contributions
isk+0.091
ety+0.088
my+0.085
 whether+0.083
tr+0.080
Neg logit contributions
 waves-0.080
 floats-0.080
 screams-0.078
ls-0.077
 cheers-0.074
Token found
Token ID1043
Feature activation-3.40e+00

Pos logit contributions
 floats+0.161
 waves+0.157
 screams+0.154
ls+0.153
 ourselves+0.152
Neg logit contributions
my-0.113
ety-0.110
isk-0.109
 whether-0.104
coat-0.098
Token a
Token ID257
Feature activation-1.30e+00

Pos logit contributions
isk+0.208
ety+0.205
tr+0.182
 whether+0.181
coat+0.180
Neg logit contributions
 waves-0.186
 screams-0.170
 cheers-0.162
 floats-0.160
 ourselves-0.158
Token telephone
Token ID11426
Feature activation-4.80e+00

Pos logit contributions
isk+0.089
ety+0.084
 whether+0.080
my+0.079
coat+0.072
Neg logit contributions
 waves-0.060
 floats-0.058
 screams-0.053
 boat-0.052
 shark-0.052
Token on
Token ID319
Feature activation-6.67e-01

Pos logit contributions
ety+0.318
isk+0.316
 whether+0.290
my+0.280
coat+0.257
Neg logit contributions
 waves-0.233
 floats-0.223
 screams-0.215
 cheers-0.200
ls-0.197
Token the
Token ID262
Feature activation+7.41e-01

Pos logit contributions
ety+0.049
isk+0.048
 whether+0.045
my+0.045
coat+0.044
Neg logit contributions
 waves-0.027
 floats-0.025
 screams-0.025
 cheers-0.023
 ourselves-0.023
Token.
Token ID13
Feature activation-1.20e-01

Pos logit contributions
isk+0.206
my+0.204
ety+0.201
 whether+0.200
coat+0.183
Neg logit contributions
 floats-0.272
 waves-0.268
 screams-0.264
ls-0.255
 sinks-0.248
Token Joe
Token ID5689
Feature activation+3.57e+00

Pos logit contributions
isk+0.006
my+0.006
 whether+0.006
ety+0.006
tr+0.005
Neg logit contributions
 floats-0.007
 waves-0.007
ls-0.007
 screams-0.007
 sinks-0.007
Token wanted
Token ID2227
Feature activation-9.42e-01

Pos logit contributions
 floats+0.228
 waves+0.224
 ourselves+0.219
 screams+0.218
ls+0.217
Neg logit contributions
my-0.167
isk-0.157
ety-0.157
 whether-0.151
coat-0.138
Token to
Token ID284
Feature activation-7.85e-01

Pos logit contributions
isk+0.054
ety+0.051
coat+0.049
tr+0.048
my+0.047
Neg logit contributions
 waves-0.055
 floats-0.052
 screams-0.051
 cheers-0.049
ls-0.049
Token check
Token ID2198
Feature activation-7.18e+00

Pos logit contributions
my+0.034
 whether+0.033
isk+0.033
ety+0.031
 sew+0.028
Neg logit contributions
ls-0.054
 floats-0.052
 screams-0.050
 waves-0.049
 sinks-0.049
Token on
Token ID319
Feature activation-2.33e-01

Pos logit contributions
isk+0.432
ety+0.426
my+0.420
 whether+0.414
coat+0.393
Neg logit contributions
 floats-0.362
 waves-0.361
 screams-0.345
ls-0.336
 sinks-0.324
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
isk+0.015
ety+0.015
my+0.014
 whether+0.014
coat+0.014
Neg logit contributions
 waves-0.011
 floats-0.011
 screams-0.010
ls-0.010
 cheers-0.010
Token pie
Token ID2508
Feature activation-2.21e+00

Pos logit contributions
 floats+0.066
 waves+0.066
 screams+0.063
ls+0.062
 sinks+0.060
Neg logit contributions
isk-0.066
ety-0.065
my-0.065
 whether-0.063
coat-0.059
Token,
Token ID11
Feature activation-4.60e-01

Pos logit contributions
ety+0.127
isk+0.126
 whether+0.119
my+0.116
coat+0.112
Neg logit contributions
 waves-0.119
 floats-0.113
 screams-0.109
ls-0.109
 sinks-0.105
Token but
Token ID475
Feature activation-4.91e+00

Pos logit contributions
isk+0.015
ety+0.015
my+0.014
 whether+0.013
coat+0.012
Neg logit contributions
 waves-0.036
 floats-0.036
 screams-0.035
 cheers-0.034
 sinks-0.034
Token he
Token ID339
Feature activation+7.80e-01

Pos logit contributions
isk+0.276
my+0.274
ety+0.274
 whether+0.271
coat+0.248
Neg logit contributions
 floats-0.264
 waves-0.261
 screams-0.252
ls-0.245
 cheers-0.236
Token to
Token ID284
Feature activation-8.30e-01

Pos logit contributions
isk+0.000
ety+0.000
tr+0.000
coat+0.000
rings+0.000
Neg logit contributions
 screams-0.000
 waves-0.000
 cheers-0.000
ls-0.000
 shore-0.000
Token hop
Token ID1725
Feature activation-3.81e-01

Pos logit contributions
isk+0.046
ety+0.045
my+0.045
 whether+0.044
coat+0.041
Neg logit contributions
 floats-0.046
 waves-0.046
 screams-0.044
ls-0.043
 sinks-0.042
Token back
Token ID736
Feature activation+6.28e+00

Pos logit contributions
isk+0.025
ety+0.025
my+0.024
 whether+0.024
coat+0.023
Neg logit contributions
 waves-0.017
 floats-0.016
 screams-0.016
ls-0.016
 sinks-0.015
Token home
Token ID1363
Feature activation+1.53e+00

Pos logit contributions
 waves+0.316
ls+0.306
 floats+0.301
 screams+0.300
 sinks+0.278
Neg logit contributions
isk-0.386
ety-0.367
my-0.357
 whether-0.355
tr-0.349
Token,
Token ID11
Feature activation-5.06e-01

Pos logit contributions
 waves+0.081
 screams+0.081
 floats+0.079
ls+0.077
 cheers+0.074
Neg logit contributions
isk-0.085
ety-0.085
my-0.082
 whether-0.081
tr-0.077
Token but
Token ID475
Feature activation-4.95e+00

Pos logit contributions
isk+0.020
ety+0.019
tr+0.019
coat+0.017
my+0.017
Neg logit contributions
 screams-0.036
 waves-0.036
 sinks-0.034
 cheers-0.033
 floats-0.033
Token his
Token ID465
Feature activation+1.51e-01

Pos logit contributions
isk+0.266
ety+0.262
my+0.261
 whether+0.257
coat+0.237
Neg logit contributions
 floats-0.284
 waves-0.281
 screams-0.269
ls-0.262
 cheers-0.255
Token paw
Token ID41827
Feature activation-4.02e+00

Pos logit contributions
 waves+0.008
 screams+0.008
 floats+0.008
ls+0.008
 sinks+0.008
Neg logit contributions
isk-0.008
ety-0.008
 whether-0.008
my-0.008
coat-0.007
Token hurt
Token ID5938
Feature activation+3.15e+00

Pos logit contributions
isk+0.230
ety+0.225
 whether+0.223
my+0.220
tr+0.202
Neg logit contributions
 waves-0.212
 floats-0.209
 screams-0.207
ls-0.201
 sinks-0.193
Token too
Token ID1165
Feature activation-8.62e-01

Pos logit contributions
 screams+0.143
 waves+0.141
 sinks+0.126
ls+0.124
 ourselves+0.124
Neg logit contributions
isk-0.200
ety-0.198
 whether-0.193
coat-0.181
tr-0.180
Token much
Token ID881
Feature activation-2.30e+00

Pos logit contributions
isk+0.047
ety+0.046
my+0.046
 whether+0.045
coat+0.042
Neg logit contributions
 floats-0.048
 waves-0.048
 screams-0.046
ls-0.044
 sinks-0.043
Token take
Token ID1011
Feature activation-2.80e+00

Pos logit contributions
 floats+0.373
 waves+0.372
 screams+0.358
ls+0.349
 sinks+0.338
Neg logit contributions
isk-0.333
ety-0.327
my-0.323
 whether-0.319
coat-0.297
Token things
Token ID1243
Feature activation-4.78e+00

Pos logit contributions
isk+0.155
my+0.154
ety+0.154
 whether+0.151
coat+0.140
Neg logit contributions
 floats-0.151
 waves-0.150
 screams-0.145
ls-0.141
 sinks-0.136
Token.
Token ID13
Feature activation-5.33e-01

Pos logit contributions
my+0.259
ety+0.255
 whether+0.246
isk+0.242
coat+0.232
Neg logit contributions
 floats-0.278
 waves-0.272
 screams-0.267
 cheers-0.254
 sinks-0.245
Token Now
Token ID2735
Feature activation+4.12e+00

Pos logit contributions
isk+0.031
ety+0.030
my+0.030
 whether+0.029
coat+0.029
Neg logit contributions
 floats-0.027
 waves-0.027
 screams-0.026
 cheers-0.025
 ourselves-0.025
Token,
Token ID11
Feature activation-9.03e-01

Pos logit contributions
 floats+0.208
 waves+0.205
 screams+0.200
ls+0.194
 sinks+0.187
Neg logit contributions
isk-0.245
my-0.244
ety-0.242
 whether-0.240
coat-0.221
Token give
Token ID1577
Feature activation-4.85e+00

Pos logit contributions
isk+0.050
ety+0.049
my+0.047
coat+0.046
 whether+0.046
Neg logit contributions
 waves-0.049
 floats-0.049
 screams-0.046
 ourselves-0.045
ls-0.044
Token it
Token ID340
Feature activation-1.77e+00

Pos logit contributions
isk+0.339
ety+0.335
my+0.332
 whether+0.328
coat+0.312
Neg logit contributions
 floats-0.198
 waves-0.197
 screams-0.187
ls-0.180
 sinks-0.172
Token back
Token ID736
Feature activation+5.88e+00

Pos logit contributions
isk+0.089
ety+0.088
my+0.087
 whether+0.086
coat+0.080
Neg logit contributions
 floats-0.105
 waves-0.105
 screams-0.101
ls-0.099
 sinks-0.096
Token to
Token ID284
Feature activation-1.21e+00

Pos logit contributions
 waves+0.385
 floats+0.384
 screams+0.369
ls+0.366
 ourselves+0.360
Neg logit contributions
isk-0.257
ety-0.243
my-0.235
 whether-0.235
coat-0.220
Token me
Token ID502
Feature activation+4.66e+00

Pos logit contributions
isk+0.082
my+0.081
 whether+0.079
ety+0.078
tr+0.069
Neg logit contributions
 floats-0.056
 waves-0.054
ls-0.053
 screams-0.053
 sinks-0.050
Token before
Token ID878
Feature activation-5.71e+00

Pos logit contributions
 floats+0.254
 waves+0.252
 screams+0.243
ls+0.236
 sinks+0.228
Neg logit contributions
isk-0.260
ety-0.256
my-0.254
 whether-0.250
coat-0.234
Token their
Token ID511
Feature activation+5.51e+00

Pos logit contributions
isk+0.238
my+0.238
ety+0.236
 whether+0.233
coat+0.217
Neg logit contributions
 floats-0.159
 waves-0.158
 screams-0.152
ls-0.145
 sinks-0.140
Token ice
Token ID4771
Feature activation-1.85e+00

Pos logit contributions
ading+0.256
 victorious+0.252
izes+0.248
 floats+0.248
 explodes+0.247
Neg logit contributions
my-0.289
ety-0.283
 pencil-0.280
tr-0.279
isk-0.277
Token-
Token ID12
Feature activation-6.08e+00

Pos logit contributions
my+0.086
ety+0.086
isk+0.082
coat+0.079
tr+0.076
Neg logit contributions
 floats-0.114
 waves-0.112
 screams-0.112
 cheers-0.111
 explodes-0.107
Tokencream
Token ID36277
Feature activation-6.34e+00

Pos logit contributions
isk+0.314
ety+0.311
my+0.307
 whether+0.299
coat+0.280
Neg logit contributions
 floats-0.358
 waves-0.356
 screams-0.344
ls-0.333
 sinks-0.325
Token.
Token ID13
Feature activation-3.14e-01

Pos logit contributions
my+0.255
tr+0.230
ety+0.226
isk+0.220
 whether+0.214
Neg logit contributions
 floats-0.424
ext-0.414
 waves-0.412
 screams-0.410
ading-0.404
Token It
Token ID632
Feature activation-1.82e+00

Pos logit contributions
isk+0.020
ety+0.019
my+0.019
 whether+0.019
coat+0.018
Neg logit contributions
 floats-0.015
 waves-0.015
 screams-0.014
ls-0.014
 cheers-0.014
Token is
Token ID318
Feature activation+2.50e+00

Pos logit contributions
my+0.113
isk+0.108
ety+0.105
tr+0.105
 whether+0.102
Neg logit contributions
 floats-0.081
 fishermen-0.080
 waves-0.079
 ourselves-0.079
ls-0.077
Token cold
Token ID4692
Feature activation-2.06e-01

Pos logit contributions
 floats+0.133
 waves+0.129
 screams+0.127
ls+0.124
 cheers+0.119
Neg logit contributions
my-0.143
ety-0.140
 whether-0.139
isk-0.139
tr-0.128
Token and
Token ID290
Feature activation+5.61e-01

Pos logit contributions
my+0.010
isk+0.010
ety+0.010
 whether+0.009
coat+0.009
Neg logit contributions
 floats-0.013
 waves-0.013
 screams-0.013
ls-0.012
 sinks-0.012
Token sweet
Token ID6029
Feature activation-2.14e+00

Pos logit contributions
 floats+0.029
 waves+0.029
 screams+0.028
ls+0.028
ext+0.028
Neg logit contributions
my-0.031
ety-0.030
 whether-0.029
isk-0.029
coat-0.027
Token.
Token ID13
Feature activation-4.31e-01

Pos logit contributions
my+0.099
isk+0.098
ety+0.098
 whether+0.095
coat+0.087
Neg logit contributions
 floats-0.135
 waves-0.133
 screams-0.130
ls-0.127
 sinks-0.125
Token.
Token ID13
Feature activation-4.08e-01

Pos logit contributions
isk+0.117
my+0.115
ety+0.113
 whether+0.112
coat+0.106
Neg logit contributions
 waves-0.161
 floats-0.160
 screams-0.159
ls-0.155
 sinks-0.152
Token But
Token ID887
Feature activation-1.95e+00

Pos logit contributions
isk+0.019
ety+0.019
my+0.017
 whether+0.017
coat+0.017
Neg logit contributions
 floats-0.027
 waves-0.027
 cheers-0.025
 screams-0.025
 sinks-0.024
Token one
Token ID530
Feature activation-6.29e+00

Pos logit contributions
isk+0.103
ety+0.095
coat+0.094
my+0.088
 whether+0.086
Neg logit contributions
 floats-0.122
 waves-0.118
 sinks-0.114
 ourselves-0.111
 screams-0.109
Token day
Token ID1110
Feature activation-1.38e+00

Pos logit contributions
isk+0.443
ety+0.411
 whether+0.405
coat+0.395
my+0.388
Neg logit contributions
 floats-0.280
 waves-0.270
 screams-0.252
 shark-0.241
 whale-0.240
Token,
Token ID11
Feature activation-7.74e-01

Pos logit contributions
isk+0.090
ety+0.086
my+0.083
 whether+0.081
coat+0.079
Neg logit contributions
 floats-0.069
 waves-0.068
 screams-0.065
ls-0.064
 sinks-0.062
Token a
Token ID257
Feature activation-1.14e+00

Pos logit contributions
isk+0.046
ety+0.045
my+0.044
 whether+0.044
coat+0.042
Neg logit contributions
 floats-0.040
 waves-0.040
 screams-0.038
ls-0.037
 sinks-0.036
Token little
Token ID1310
Feature activation+2.32e+00

Pos logit contributions
isk+0.069
my+0.069
ety+0.068
 whether+0.067
coat+0.064
Neg logit contributions
 floats-0.055
 waves-0.055
 screams-0.053
ls-0.052
 ourselves-0.050
Token pin
Token ID6757
Feature activation-6.52e+00

Pos logit contributions
 floats+0.124
 waves+0.123
 screams+0.113
 cheers+0.108
ls+0.108
Neg logit contributions
isk-0.148
 whether-0.137
ety-0.134
my-0.128
tr-0.124
Token got
Token ID1392
Feature activation-3.57e+00

Pos logit contributions
my+0.296
isk+0.295
ety+0.294
 whether+0.280
coat+0.270
Neg logit contributions
 floats-0.410
 waves-0.408
 screams-0.397
 ourselves-0.388
 cheers-0.384
Token stuck
Token ID7819
Feature activation-3.32e-01

Pos logit contributions
ety+0.199
my+0.198
isk+0.194
 whether+0.193
coat+0.177
Neg logit contributions
 floats-0.195
 waves-0.191
 screams-0.187
ls-0.177
 cheers-0.176
Token in
Token ID287
Feature activation+2.04e+00

Pos logit contributions
isk+0.020
ety+0.019
 whether+0.019
my+0.018
tr+0.017
Neg logit contributions
 waves-0.017
 floats-0.017
 screams-0.016
ls-0.016
 sinks-0.015
Token was
Token ID373
Feature activation+1.11e-01

Pos logit contributions
 screams+0.217
 waves+0.216
 floats+0.214
ls+0.209
 sinks+0.205
Neg logit contributions
my-0.159
isk-0.157
ety-0.153
 whether-0.145
coat-0.144
Token.
Token ID13
Feature activation-3.55e-01

Pos logit contributions
 waves+0.007
 floats+0.007
 screams+0.006
ls+0.006
 sinks+0.006
Neg logit contributions
isk-0.006
ety-0.006
my-0.005
 whether-0.005
coat-0.005
Token
Token ID198
Feature activation+3.95e-01

Pos logit contributions
isk+0.018
my+0.018
ety+0.018
 whether+0.017
coat+0.016
Neg logit contributions
 waves-0.021
 floats-0.021
 screams-0.021
ls-0.020
 sinks-0.019
Token
Token ID198
Feature activation+3.89e-01

Pos logit contributions
 waves+0.023
 floats+0.023
 screams+0.023
ls+0.022
 sinks+0.021
Neg logit contributions
my-0.020
isk-0.020
ety-0.020
 whether-0.020
coat-0.018
Token"
Token ID1
Feature activation-2.15e+00

Pos logit contributions
 floats+0.024
 waves+0.023
ls+0.022
 screams+0.022
 sinks+0.022
Neg logit contributions
isk-0.019
ety-0.019
my-0.018
 whether-0.018
coat-0.016
TokenAre
Token ID8491
Feature activation-8.68e+00

Pos logit contributions
isk+0.137
 whether+0.131
ety+0.128
my+0.120
coat+0.119
Neg logit contributions
 floats-0.109
 waves-0.104
 cheers-0.095
 screams-0.089
 moms-0.088
Token we
Token ID356
Feature activation+7.71e+00

Pos logit contributions
my+0.622
ety+0.613
isk+0.606
 whether+0.600
 hers+0.535
Neg logit contributions
 waves-0.325
 screams-0.318
 floats-0.310
 explodes-0.284
 cheers-0.282
Token there
Token ID612
Feature activation-1.04e+00

Pos logit contributions
 screams+0.473
 floats+0.471
 waves+0.461
ls+0.454
 cheers+0.437
Neg logit contributions
my-0.388
ety-0.376
isk-0.375
 whether-0.367
coat-0.337
Token yet
Token ID1865
Feature activation-5.10e+00

Pos logit contributions
my+0.051
isk+0.051
ety+0.050
 whether+0.050
coat+0.045
Neg logit contributions
 floats-0.063
 waves-0.063
 screams-0.062
ls-0.059
 sinks-0.058
Token?"
Token ID1701
Feature activation+5.50e+00

Pos logit contributions
isk+0.292
ety+0.287
my+0.285
 whether+0.281
coat+0.264
Neg logit contributions
 floats-0.271
 waves-0.269
 screams-0.259
ls-0.252
 sinks-0.243
Token Lily
Token ID20037
Feature activation-4.98e+00

Pos logit contributions
 waves+0.369
 floats+0.368
 screams+0.356
ls+0.346
 sinks+0.339
Neg logit contributions
isk-0.232
ety-0.232
my-0.230
 whether-0.223
coat-0.203
Token v
Token ID410
Feature activation-9.77e+00

Pos logit contributions
 waves+0.037
 floats+0.037
 boat+0.034
 whale+0.032
ls+0.031
Neg logit contributions
isk-0.049
ety-0.047
 whether-0.046
my-0.041
Wh-0.039
Tokenase
Token ID589
Feature activation-5.67e+00

Pos logit contributions
 whether+0.457
ety+0.421
my+0.421
isk+0.399
 pad+0.359
Neg logit contributions
 floats-0.617
 waves-0.607
 screams-0.584
ls-0.579
 moms-0.561
Token inside
Token ID2641
Feature activation-1.09e+00

Pos logit contributions
isk+0.349
ety+0.341
 whether+0.332
my+0.321
coat+0.298
Neg logit contributions
 waves-0.281
 floats-0.276
ls-0.271
 screams-0.267
 cheers-0.245
Token.
Token ID13
Feature activation-1.20e-01

Pos logit contributions
isk+0.068
ety+0.067
coat+0.062
 whether+0.062
tr+0.061
Neg logit contributions
 waves-0.055
 floats-0.055
 screams-0.052
ls-0.051
 cheers-0.050
Token The
Token ID383
Feature activation+6.79e-01

Pos logit contributions
isk+0.008
ety+0.007
 whether+0.007
coat+0.007
my+0.007
Neg logit contributions
 waves-0.006
 floats-0.006
 screams-0.006
 cheers-0.005
ls-0.005
Token v
Token ID410
Feature activation-9.74e+00

Pos logit contributions
 floats+0.032
 waves+0.032
 screams+0.030
 moms+0.030
ls+0.028
Neg logit contributions
 whether-0.044
ety-0.043
isk-0.042
my-0.042
coat-0.037
Tokenase
Token ID589
Feature activation-5.64e+00

Pos logit contributions
 whether+0.460
my+0.422
ety+0.418
isk+0.401
 pad+0.368
Neg logit contributions
 floats-0.623
 waves-0.602
 screams-0.586
ls-0.581
 moms-0.568
Token was
Token ID373
Feature activation+3.52e-01

Pos logit contributions
my+0.254
ety+0.241
isk+0.239
 whether+0.223
tr+0.222
Neg logit contributions
 floats-0.368
 waves-0.364
 screams-0.356
 ourselves-0.353
 cheers-0.343
Token very
Token ID845
Feature activation-6.82e-01

Pos logit contributions
 waves+0.016
 floats+0.016
 screams+0.015
ls+0.014
 cheers+0.014
Neg logit contributions
isk-0.023
ety-0.023
 whether-0.022
my-0.022
coat-0.021
Token fragile
Token ID21049
Feature activation-1.88e+00

Pos logit contributions
ety+0.033
my+0.033
 whether+0.031
isk+0.029
coat+0.029
Neg logit contributions
 waves-0.041
 floats-0.041
 screams-0.040
 sinks-0.038
 cheers-0.038
Token and
Token ID290
Feature activation+8.72e-01

Pos logit contributions
isk+0.125
ety+0.119
 whether+0.112
my+0.110
tr+0.108
Neg logit contributions
 waves-0.091
 screams-0.086
 floats-0.084
ls-0.083
 ourselves-0.077
Token around
Token ID1088
Feature activation+3.39e+00

Pos logit contributions
 waves+0.072
 floats+0.072
 screams+0.069
ls+0.067
 sinks+0.065
Neg logit contributions
isk-0.070
ety-0.069
my-0.068
 whether-0.067
coat-0.063
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
 waves+0.177
 floats+0.173
 screams+0.167
ls+0.160
 sinks+0.158
Neg logit contributions
ety-0.203
isk-0.199
my-0.194
 whether-0.190
coat-0.188
Token village
Token ID7404
Feature activation+4.26e+00

Pos logit contributions
 floats+0.060
 screams+0.058
ls+0.058
 ourselves+0.055
ext+0.055
Neg logit contributions
my-0.058
ety-0.055
isk-0.053
 whether-0.050
 pad-0.049
Token looking
Token ID2045
Feature activation-4.10e+00

Pos logit contributions
 floats+0.269
 waves+0.267
 screams+0.260
ls+0.253
 sinks+0.246
Neg logit contributions
isk-0.199
ety-0.195
my-0.195
 whether-0.191
coat-0.176
Token for
Token ID329
Feature activation-4.27e+00

Pos logit contributions
my+0.132
 whether+0.131
isk+0.118
ety+0.118
 though+0.101
Neg logit contributions
 floats-0.319
 screams-0.312
 waves-0.304
 ourselves-0.298
 cheers-0.295
Token cel
Token ID18725
Feature activation-9.68e+00

Pos logit contributions
tr+0.258
coat+0.255
ety+0.253
isk+0.253
my+0.249
Neg logit contributions
 waves-0.235
 floats-0.211
 screams-0.209
ls-0.205
 sinks-0.201
Tokenery
Token ID1924
Feature activation-1.58e+00

Pos logit contributions
isk+0.542
my+0.529
ety+0.529
 whether+0.506
coat+0.484
Neg logit contributions
 floats-0.548
 waves-0.531
 screams-0.522
 cheers-0.491
 sinks-0.482
Token to
Token ID284
Feature activation-8.87e-01

Pos logit contributions
isk+0.079
ety+0.078
my+0.076
 whether+0.074
coat+0.070
Neg logit contributions
 waves-0.095
 floats-0.093
ls-0.090
 screams-0.090
 sinks-0.086
Token catch
Token ID4929
Feature activation+2.90e+00

Pos logit contributions
ety+0.049
my+0.048
isk+0.047
 whether+0.046
coat+0.046
Neg logit contributions
 waves-0.050
 floats-0.048
 screams-0.047
 sinks-0.045
 ourselves-0.044
Token.
Token ID13
Feature activation-1.96e-01

Pos logit contributions
 waves+0.159
 floats+0.159
 screams+0.153
ls+0.149
 sinks+0.144
Neg logit contributions
isk-0.161
ety-0.159
my-0.157
 whether-0.154
coat-0.147
Token One
Token ID1881
Feature activation-5.60e+00

Pos logit contributions
isk+0.010
ety+0.010
coat+0.010
 whether+0.010
my+0.009
Neg logit contributions
 waves-0.011
 floats-0.011
 cheers-0.010
 screams-0.010
 sinks-0.010
Token was
Token ID373
Feature activation-6.69e-02

Pos logit contributions
isk+0.522
ety+0.503
my+0.483
coat+0.466
 whether+0.465
Neg logit contributions
 floats-0.460
 waves-0.456
 screams-0.427
ls-0.424
 sinks-0.403
Token for
Token ID329
Feature activation-4.57e+00

Pos logit contributions
isk+0.004
ety+0.004
my+0.004
 whether+0.004
coat+0.004
Neg logit contributions
 waves-0.003
 floats-0.003
 screams-0.003
ls-0.003
 sinks-0.003
Token dinner
Token ID8073
Feature activation-3.25e-01

Pos logit contributions
ety+0.254
isk+0.250
my+0.245
 whether+0.233
coat+0.228
Neg logit contributions
 waves-0.253
 floats-0.252
 screams-0.244
ls-0.240
 ourselves-0.234
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
ety+0.017
isk+0.017
my+0.016
 whether+0.016
coat+0.015
Neg logit contributions
 waves-0.019
 floats-0.018
 screams-0.017
ls-0.017
 ourselves-0.017
Token Tim
Token ID5045
Feature activation-1.65e+00

Pos logit contributions
isk+0.032
ety+0.031
 whether+0.031
my+0.030
coat+0.029
Neg logit contributions
 floats-0.024
 waves-0.023
 screams-0.022
 moms-0.021
ls-0.021
Tokenmy
Token ID1820
Feature activation-1.19e+01

Pos logit contributions
 whether+0.031
rings+0.023
 hers+0.023
coat+0.022
isk+0.021
Neg logit contributions
 waves-0.147
 screams-0.144
am-0.142
 floats-0.135
 cheers-0.135
Token guessed
Token ID25183
Feature activation-3.69e+00

Pos logit contributions
isk+0.564
my+0.560
ety+0.545
 whether+0.531
coat+0.486
Neg logit contributions
 floats-0.754
 waves-0.735
 screams-0.709
ls-0.705
 ourselves-0.687
Token it
Token ID340
Feature activation-1.76e+00

Pos logit contributions
ety+0.223
isk+0.222
my+0.207
coat+0.203
tr+0.200
Neg logit contributions
 waves-0.193
 floats-0.188
 screams-0.184
ls-0.181
 sinks-0.171
Token was
Token ID373
Feature activation-8.52e-02

Pos logit contributions
ety+0.105
isk+0.104
coat+0.098
 whether+0.097
my+0.095
Neg logit contributions
 waves-0.088
 floats-0.087
 screams-0.085
ls-0.084
 sinks-0.079
Token pizza
Token ID14256
Feature activation-2.09e+00

Pos logit contributions
isk+0.005
ety+0.004
my+0.004
coat+0.004
 whether+0.004
Neg logit contributions
 waves-0.005
 floats-0.005
 screams-0.005
ls-0.005
 sinks-0.004
Token,
Token ID11
Feature activation-9.03e-01

Pos logit contributions
isk+0.107
ety+0.106
my+0.103
 whether+0.102
coat+0.095
Neg logit contributions
 waves-0.123
 floats-0.123
 screams-0.117
ls-0.115
 sinks-0.111
Token a
Token ID257
Feature activation-1.16e+00

Pos logit contributions
isk+0.051
ety+0.050
my+0.049
 whether+0.047
coat+0.046
Neg logit contributions
 waves-0.064
 floats-0.064
 screams-0.061
ls-0.060
 sinks-0.058
Token hammer
Token ID15554
Feature activation-9.87e+00

Pos logit contributions
isk+0.080
ety+0.077
 whether+0.075
my+0.073
coat+0.068
Neg logit contributions
 waves-0.053
 floats-0.052
 screams-0.050
ls-0.048
 boat-0.046
Token and
Token ID290
Feature activation+5.57e-01

Pos logit contributions
isk+0.597
ety+0.583
 whether+0.575
my+0.545
coat+0.520
Neg logit contributions
 waves-0.512
 floats-0.505
 screams-0.486
ls-0.483
 sinks-0.455
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
 waves+0.031
 floats+0.030
 screams+0.030
 sinks+0.027
 sails+0.027
Neg logit contributions
isk-0.036
ety-0.033
coat-0.033
my-0.031
tr-0.029
Token screw
Token ID9580
Feature activation-1.43e+01

Pos logit contributions
isk+0.070
ety+0.066
 whether+0.063
my+0.061
coat+0.055
Neg logit contributions
 waves-0.063
 floats-0.062
 screams-0.061
 boat-0.058
 sinks-0.058
Tokendriver
Token ID26230
Feature activation-9.68e+00

Pos logit contributions
isk+0.651
ety+0.643
 whether+0.623
my+0.603
coat+0.529
Neg logit contributions
 waves-0.955
 floats-0.948
ls-0.934
 screams-0.932
 sinks-0.900
Token on
Token ID319
Feature activation-5.25e-01

Pos logit contributions
isk+0.531
ety+0.523
my+0.507
 whether+0.505
coat+0.468
Neg logit contributions
 waves-0.542
 floats-0.540
 screams-0.524
ls-0.513
 sinks-0.493
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
ety+0.040
isk+0.039
my+0.038
coat+0.038
 whether+0.037
Neg logit contributions
 waves-0.020
 floats-0.019
 screams-0.018
 sinks-0.018
 ourselves-0.017
Token shelf
Token ID18316
Feature activation-1.02e+01

Pos logit contributions
isk+0.070
ety+0.069
 whether+0.064
my+0.061
coat+0.057
Neg logit contributions
 waves-0.059
 boat-0.059
 screams-0.058
 floats-0.056
 sinks-0.055
Token.
Token ID13
Feature activation-3.81e-01

Pos logit contributions
isk+0.571
ety+0.558
my+0.541
 whether+0.540
coat+0.500
Neg logit contributions
 waves-0.562
 floats-0.559
 screams-0.538
ls-0.528
 sinks-0.512
Token
Token ID198
Feature activation+3.68e-01

Pos logit contributions
isk+0.022
ety+0.022
my+0.022
 whether+0.021
coat+0.020
Neg logit contributions
 floats-0.020
 waves-0.020
 screams-0.019
ls-0.019
 sinks-0.018