TOP ACTIVATIONS
MAX = 16.379

her hair in a pony tail so she could look pretty
grandma 's house for safe keeping . When
, Emma decided a pony tail wasn 't so important after
with glasses and a pony tail . She is reading a
it to me for safe keeping . He told me to
has a purple band - aid . She puts it on
and put a band - aid on it .
quickly got a band - aid to make it better .
They get some band - aid s and cookies . They
gave Anna a band - aid and a candy . Anna
He needs a band - aid ." Lily says , "
gave her a band - aid and a sticker . He
, and a band - aid on his arm . She
and wrapped a band - aid around it .
I need a band - aid , but I can 't
He finds a band - aid in his pocket and he
and puts a band - aid on it . She kisses
get you a band - aid and a kiss ."
They need a band - aid and a hug . They
and put a band - aid on it . Mia felt

MIN ACTIVATIONS
MIN = -13.790

But the sleepy whale was too tired to play
, there was a big whale who lived in the ocean
, " Hi , pink whale ! You 're so pretty
the sailor invited the big whale . The whale said
! We can use the whale to explore the oceans ."
." The purple whale saw Tom and Lily and
too ?" The whale smiled and said , "
to the tank and the whale sw am over to them
excited to hear about the whale . " You
had hoped for a happy ending , but it didn 't
and laughed too . The whale was very happy .
One day , the whale saw a big , white
are cool . But this whale is not for playing .
, she saw a big whale stuck in the sand .
around the boat collecting the whale . The parents were worried
, there was a big whale . The whale was very
sad to have a bad ending to their day . The
whale !" The whale sees them and makes a
and too deep . The whale lives in the ocean ,
He said goodbye to the whale and went back to his

INTERVAL 8.837 - 16.379
CONTAINS 0.081%

family broke open their ac orns and shared them with everyone
it . M ittens wasn 't happy with this
she bought an ice - cream as a reward for catching
Teddy noticed that the band age was starting to come loose
went to the ice - cream shop with her mom .

INTERVAL 1.295 - 8.837
CONTAINS 23.067%

and wiped her tears . She put a band - aid
see what was inside the box . She ran
her sc ooter . She picked it up and went riding
Pink said , " I am lost and I don 't
Inside the frame was a picture of the happy family singing

INTERVAL -6.248 - 1.295
CONTAINS 75.459%

nice for the big bad man . He worked hard all
upon a time , there was a little girl called Lily
so much fun , they didn 't realize how dirty they
host and Ben is the guest . " Welcome
was . When they opened it , they saw

INTERVAL -13.790 - -6.248
CONTAINS 1.393%

could have happened to the bull ?". Sally said , "
â Dad , let 's fly
and the flowers . He waves to the birds and the
. He waits for the right moment and then d ashes
Jack put his arm around Joe and said " It 's
Token her
Token ID607
Feature activation+2.74e+00

Pos logit contributions
ons+0.026
aid+0.024
quet+0.023
tail+0.023
powers+0.022
Neg logit contributions
 beginning-0.019
 ending-0.019
 size-0.017
 crashing-0.017
sing-0.017
Token hair
Token ID4190
Feature activation+6.49e+00

Pos logit contributions
ons+0.208
aid+0.183
quet+0.177
tail+0.171
keeping+0.162
Neg logit contributions
 ending-0.150
 beginning-0.148
 size-0.137
sing-0.135
 camps-0.129
Token in
Token ID287
Feature activation-8.35e-01

Pos logit contributions
ons+0.504
aid+0.442
quet+0.418
tail+0.417
 exclaimed+0.387
Neg logit contributions
 ending-0.349
 beginning-0.335
sing-0.312
 size-0.312
 crashing-0.303
Token a
Token ID257
Feature activation+6.96e-01

Pos logit contributions
 ending+0.048
 beginning+0.047
 size+0.046
 end+0.043
 crashing+0.042
Neg logit contributions
ons-0.062
aid-0.057
quet-0.052
tail-0.051
powers-0.049
Token pony
Token ID26902
Feature activation+2.30e+00

Pos logit contributions
ons+0.054
aid+0.048
quet+0.045
keeping+0.044
tail+0.043
Neg logit contributions
 beginning-0.037
 ending-0.037
 size-0.034
 camps-0.034
 end-0.033
Tokentail
Token ID13199
Feature activation+1.64e+01

Pos logit contributions
ons+0.179
aid+0.172
quet+0.149
powers+0.145
keeping+0.141
Neg logit contributions
 beginning-0.121
sing-0.108
 end-0.106
 ending-0.106
 size-0.103
Token so
Token ID523
Feature activation+3.96e+00

Pos logit contributions
ons+1.374
aid+1.298
quet+1.204
tail+1.137
 stitches+1.105
Neg logit contributions
 ending-0.761
 beginning-0.746
 end-0.687
 size-0.672
sing-0.652
Token she
Token ID673
Feature activation+4.78e+00

Pos logit contributions
ons+0.209
aid+0.184
quet+0.162
tail+0.160
keeping+0.155
Neg logit contributions
 ending-0.319
 beginning-0.300
 size-0.290
sing-0.279
 end-0.275
Token could
Token ID714
Feature activation+3.64e+00

Pos logit contributions
ons+0.379
aid+0.339
quet+0.327
tail+0.326
keeping+0.310
Neg logit contributions
 ending-0.241
 beginning-0.229
 size-0.214
sing-0.209
 camps-0.206
Token look
Token ID804
Feature activation-1.93e+00

Pos logit contributions
ons+0.309
aid+0.270
keeping+0.259
quet+0.257
tail+0.246
Neg logit contributions
 ending-0.159
 beginning-0.159
 size-0.150
 end-0.150
sing-0.147
Token pretty
Token ID2495
Feature activation+6.91e-01

Pos logit contributions
 ending+0.087
 beginning+0.079
sing+0.075
 size+0.073
 crashing+0.072
Neg logit contributions
ons-0.169
aid-0.152
tail-0.145
quet-0.143
 exclaimed-0.134
Token grandma
Token ID49890
Feature activation+2.28e+00

Pos logit contributions
 ending+0.017
 beginning+0.016
ksh+0.015
 crashing+0.015
rant+0.015
Neg logit contributions
ons-0.027
aid-0.023
tail-0.023
quet-0.022
 stitches-0.021
Token's
Token ID338
Feature activation-1.91e+00

Pos logit contributions
ons+0.177
aid+0.166
quet+0.151
tail+0.143
powers+0.141
Neg logit contributions
 ending-0.119
 beginning-0.117
sing-0.107
 size-0.107
 camps-0.103
Token house
Token ID2156
Feature activation+2.38e+00

Pos logit contributions
 ending+0.108
 beginning+0.102
ksh+0.098
 crashing+0.097
rant+0.097
Neg logit contributions
ons-0.138
aid-0.121
tail-0.117
quet-0.111
 stitches-0.110
Token for
Token ID329
Feature activation-3.35e+00

Pos logit contributions
ons+0.191
aid+0.172
tail+0.160
quet+0.160
 exclaimed+0.147
Neg logit contributions
 ending-0.125
 beginning-0.117
 size-0.111
sing-0.109
 crashing-0.106
Token safe
Token ID3338
Feature activation+5.76e+00

Pos logit contributions
 ending+0.141
 beginning+0.137
 size+0.131
 end+0.120
 crashing+0.119
Neg logit contributions
ons-0.288
aid-0.273
quet-0.259
tail-0.256
chair-0.254
Tokenkeeping
Token ID19934
Feature activation+1.63e+01

Pos logit contributions
ons+0.374
aid+0.340
quet+0.312
tail+0.303
 exclaimed+0.270
Neg logit contributions
 ending-0.394
 beginning-0.370
 size-0.367
 crashing-0.348
sing-0.348
Token.
Token ID13
Feature activation+2.46e-01

Pos logit contributions
ons+1.352
aid+1.278
quet+1.171
tail+1.149
 stitches+1.073
Neg logit contributions
 ending-0.798
 beginning-0.754
 size-0.701
sing-0.694
 crashing-0.680
Token
Token ID220
Feature activation+3.45e+00

Pos logit contributions
ons+0.021
aid+0.019
quet+0.018
tail+0.018
odic+0.017
Neg logit contributions
 ending-0.012
ksh-0.011
 beginning-0.010
 crashing-0.010
sing-0.010
Token
Token ID198
Feature activation+8.04e-01

Pos logit contributions
ons+0.252
tail+0.233
aid+0.232
quet+0.227
 stitches+0.200
Neg logit contributions
 ending-0.171
 beginning-0.164
 crashing-0.164
sing-0.161
ksh-0.156
Token
Token ID198
Feature activation+7.85e-01

Pos logit contributions
ons+0.066
tail+0.063
quet+0.061
aid+0.060
 stitches+0.056
Neg logit contributions
 crashing-0.031
 ending-0.029
 camps-0.028
 them-0.027
 end-0.026
TokenWhen
Token ID2215
Feature activation+1.74e+00

Pos logit contributions
ons+0.058
tail+0.050
aid+0.050
quet+0.047
 stitches+0.045
Neg logit contributions
 ending-0.044
 beginning-0.041
 crashing-0.041
 size-0.038
 end-0.038
Token,
Token ID11
Feature activation+4.32e-01

Pos logit contributions
ons+0.070
aid+0.064
quet+0.058
tail+0.058
keeping+0.052
Neg logit contributions
 ending-0.064
 beginning-0.059
 size-0.056
sing-0.055
 crashing-0.054
Token Emma
Token ID18966
Feature activation+2.10e+00

Pos logit contributions
ons+0.029
aid+0.026
quet+0.024
tail+0.023
keeping+0.021
Neg logit contributions
 ending-0.030
 beginning-0.028
 size-0.026
 end-0.025
sing-0.025
Token decided
Token ID3066
Feature activation+3.62e-01

Pos logit contributions
ons+0.169
aid+0.156
quet+0.147
tail+0.143
keeping+0.132
Neg logit contributions
 ending-0.108
 beginning-0.101
 size-0.096
sing-0.090
 camps-0.088
Token a
Token ID257
Feature activation+4.87e-01

Pos logit contributions
ons+0.029
aid+0.026
tail+0.024
quet+0.024
bes+0.022
Neg logit contributions
 ending-0.020
 beginning-0.019
 size-0.017
 crashing-0.017
sing-0.016
Token pony
Token ID26902
Feature activation+2.14e+00

Pos logit contributions
ons+0.039
aid+0.034
tail+0.032
quet+0.032
keeping+0.030
Neg logit contributions
 ending-0.027
 beginning-0.025
sing-0.023
 size-0.023
 crashing-0.023
Tokentail
Token ID13199
Feature activation+1.62e+01

Pos logit contributions
ons+0.171
aid+0.158
quet+0.143
tail+0.137
bes+0.137
Neg logit contributions
 ending-0.112
 beginning-0.106
 size-0.095
 crashing-0.092
 end-0.092
Token wasn
Token ID2492
Feature activation-2.47e+00

Pos logit contributions
ons+1.335
aid+1.215
tail+1.145
quet+1.135
 stitches+1.047
Neg logit contributions
 ending-0.820
 beginning-0.755
 size-0.695
sing-0.688
 crashing-0.674
Token't
Token ID470
Feature activation+1.67e+00

Pos logit contributions
 beginning+0.161
 ending+0.158
 crashing+0.153
 end+0.148
sing+0.141
Neg logit contributions
ons-0.160
aid-0.141
tail-0.128
keeping-0.126
quet-0.126
Token so
Token ID523
Feature activation+3.70e+00

Pos logit contributions
ons+0.134
aid+0.118
tail+0.115
quet+0.113
bes+0.105
Neg logit contributions
 ending-0.090
 beginning-0.088
 size-0.080
 crashing-0.079
sing-0.074
Token important
Token ID1593
Feature activation+9.41e-02

Pos logit contributions
ons+0.308
aid+0.266
tail+0.264
quet+0.262
keeping+0.246
Neg logit contributions
 ending-0.194
 beginning-0.178
 size-0.160
 crashing-0.160
sing-0.156
Token after
Token ID706
Feature activation-5.02e-01

Pos logit contributions
ons+0.008
aid+0.007
tail+0.006
quet+0.006
 exclaimed+0.006
Neg logit contributions
 ending-0.005
 beginning-0.005
 size-0.004
 crashing-0.004
sing-0.004
Token with
Token ID351
Feature activation+6.09e-01

Pos logit contributions
 ending+0.019
 beginning+0.018
sing+0.017
 size+0.017
 crashing+0.017
Neg logit contributions
ons-0.024
aid-0.022
quet-0.020
tail-0.019
powers-0.018
Token glasses
Token ID15232
Feature activation+3.63e+00

Pos logit contributions
ons+0.046
aid+0.041
quet+0.039
tail+0.039
 exclaimed+0.035
Neg logit contributions
 ending-0.034
 beginning-0.032
sing-0.031
 size-0.030
 crashing-0.030
Token and
Token ID290
Feature activation-4.70e-01

Pos logit contributions
ons+0.273
aid+0.244
tail+0.228
quet+0.225
 exclaimed+0.207
Neg logit contributions
 ending-0.203
 beginning-0.196
sing-0.181
 size-0.180
 crashing-0.179
Token a
Token ID257
Feature activation-1.05e-01

Pos logit contributions
 ending+0.024
 beginning+0.022
sing+0.021
 size+0.021
 crashing+0.020
Neg logit contributions
ons-0.038
aid-0.035
quet-0.033
tail-0.031
keeping-0.030
Token pony
Token ID26902
Feature activation+1.42e+00

Pos logit contributions
 ending+0.005
 beginning+0.005
sing+0.005
 size+0.005
 crashing+0.005
Neg logit contributions
ons-0.008
aid-0.008
quet-0.007
tail-0.007
keeping-0.007
Tokentail
Token ID13199
Feature activation+1.56e+01

Pos logit contributions
ons+0.110
aid+0.101
quet+0.092
tail+0.086
 exclaimed+0.086
Neg logit contributions
 ending-0.075
 beginning-0.073
sing-0.068
 size-0.067
 crashing-0.066
Token.
Token ID13
Feature activation-6.54e-01

Pos logit contributions
ons+1.159
aid+1.030
tail+0.956
quet+0.956
 exclaimed+0.867
Neg logit contributions
 ending-0.911
 beginning-0.859
sing-0.814
 size-0.805
 crashing-0.792
Token She
Token ID1375
Feature activation+2.69e+00

Pos logit contributions
 ending+0.042
 beginning+0.040
sing+0.038
 size+0.037
 crashing+0.037
Neg logit contributions
ons-0.045
aid-0.039
tail-0.036
quet-0.036
 exclaimed-0.032
Token is
Token ID318
Feature activation-1.29e+00

Pos logit contributions
ons+0.190
aid+0.159
tail+0.155
quet+0.143
 stitches+0.141
Neg logit contributions
 ending-0.170
 beginning-0.158
sing-0.153
 crashing-0.149
ksh-0.148
Token reading
Token ID3555
Feature activation+5.92e+00

Pos logit contributions
 ending+0.084
 beginning+0.074
ksh+0.073
sing+0.073
 camps+0.072
Neg logit contributions
ons-0.086
aid-0.075
tail-0.072
quet-0.068
 exclaimed-0.066
Token a
Token ID257
Feature activation-1.57e-01

Pos logit contributions
ons+0.425
aid+0.373
tail+0.351
quet+0.345
 stitches+0.315
Neg logit contributions
 ending-0.358
 beginning-0.335
sing-0.323
 size-0.316
 crashing-0.314
Token it
Token ID340
Feature activation+8.83e-02

Pos logit contributions
ons+0.081
aid+0.073
quet+0.071
tail+0.070
bes+0.064
Neg logit contributions
 ending-0.058
 beginning-0.055
 size-0.052
sing-0.051
 crashing-0.051
Token to
Token ID284
Feature activation+3.84e-01

Pos logit contributions
ons+0.007
aid+0.006
quet+0.006
tail+0.006
bes+0.006
Neg logit contributions
 ending-0.004
 beginning-0.004
sing-0.004
 size-0.004
 mate-0.004
Token me
Token ID502
Feature activation-1.63e+00

Pos logit contributions
ons+0.033
aid+0.030
tail+0.029
quet+0.028
 exclaimed+0.026
Neg logit contributions
 ending-0.017
 beginning-0.016
sing-0.015
 size-0.015
 crashing-0.015
Token for
Token ID329
Feature activation-4.13e+00

Pos logit contributions
 ending+0.074
 beginning+0.072
 end+0.070
 size+0.070
sing+0.070
Neg logit contributions
ons-0.133
aid-0.118
quet-0.115
tail-0.111
keeping-0.108
Token safe
Token ID3338
Feature activation+4.99e+00

Pos logit contributions
 ending+0.208
 size+0.200
 beginning+0.200
 end+0.185
 crashing+0.185
Neg logit contributions
ons-0.322
aid-0.298
tail-0.292
quet-0.285
chair-0.276
Tokenkeeping
Token ID19934
Feature activation+1.55e+01

Pos logit contributions
ons+0.278
aid+0.257
quet+0.233
tail+0.224
odic+0.194
Neg logit contributions
 ending-0.381
 size-0.366
 beginning-0.351
sing-0.337
 crashing-0.336
Token.
Token ID13
Feature activation-7.11e-01

Pos logit contributions
ons+1.274
aid+1.173
quet+1.096
tail+1.080
 stitches+0.990
Neg logit contributions
 ending-0.770
 beginning-0.710
 size-0.700
sing-0.670
 crashing-0.646
Token He
Token ID679
Feature activation-7.03e-01

Pos logit contributions
 ending+0.034
 beginning+0.032
 crashing+0.030
ksh+0.030
sing+0.030
Neg logit contributions
ons-0.060
aid-0.052
quet-0.051
tail-0.049
bes-0.045
Token told
Token ID1297
Feature activation+2.60e+00

Pos logit contributions
 ending+0.040
 size+0.038
 beginning+0.036
 camps+0.036
 only+0.036
Neg logit contributions
ons-0.049
quet-0.045
aid-0.043
tail-0.042
keeping-0.041
Token me
Token ID502
Feature activation-1.73e+00

Pos logit contributions
ons+0.214
aid+0.193
tail+0.184
quet+0.182
bes+0.166
Neg logit contributions
 ending-0.131
 beginning-0.121
 size-0.114
sing-0.114
 crashing-0.111
Token to
Token ID284
Feature activation+3.01e-01

Pos logit contributions
 ending+0.091
 beginning+0.083
 end+0.081
 crashing+0.078
sing+0.077
Neg logit contributions
ons-0.130
quet-0.119
aid-0.118
tail-0.118
bes-0.109
Token has
Token ID468
Feature activation-3.71e-01

Pos logit contributions
 ending+0.225
 beginning+0.216
 size+0.209
 camps+0.206
 end+0.201
Neg logit contributions
ons-0.264
aid-0.238
quet-0.230
tail-0.229
keeping-0.218
Token a
Token ID257
Feature activation+5.75e-01

Pos logit contributions
 ending+0.018
 beginning+0.017
 size+0.016
 camps+0.015
 crashing+0.015
Neg logit contributions
ons-0.030
quet-0.028
aid-0.027
tail-0.027
chair-0.024
Token purple
Token ID14032
Feature activation-5.57e+00

Pos logit contributions
ons+0.038
aid+0.035
tail+0.031
quet+0.031
 stitches+0.028
Neg logit contributions
 ending-0.037
 beginning-0.035
ksh-0.034
sing-0.033
 size-0.033
Token band
Token ID4097
Feature activation+9.40e+00

Pos logit contributions
 ending+0.360
 beginning+0.339
ksh+0.333
sing+0.326
 crashing+0.316
Neg logit contributions
ons-0.366
aid-0.324
tail-0.305
quet-0.297
 stitches-0.269
Token-
Token ID12
Feature activation+4.27e+00

Pos logit contributions
ons+0.558
quet+0.454
aid+0.451
tail+0.425
 exclaimed+0.399
Neg logit contributions
 ending-0.659
 beginning-0.614
sing-0.606
 size-0.597
 crashing-0.589
Tokenaid
Token ID1698
Feature activation+1.37e+01

Pos logit contributions
ons+0.092
quet+0.046
tail+0.044
aid+0.029
keeping+0.028
Neg logit contributions
 ending-0.440
 beginning-0.432
ksh-0.425
 crashing-0.419
rant-0.417
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
ons+1.092
aid+0.967
tail+0.923
quet+0.918
bes+0.846
Neg logit contributions
 ending-0.718
 beginning-0.671
sing-0.629
 size-0.628
 crashing-0.618
Token She
Token ID1375
Feature activation+3.52e+00

Pos logit contributions
ons+0.007
aid+0.006
quet+0.005
tail+0.005
keeping+0.005
Neg logit contributions
 ending-0.008
 beginning-0.008
sing-0.007
 crashing-0.007
 size-0.007
Token puts
Token ID7584
Feature activation+5.69e-01

Pos logit contributions
ons+0.228
aid+0.198
tail+0.186
quet+0.185
keeping+0.167
Neg logit contributions
 ending-0.236
 beginning-0.223
sing-0.214
 size-0.213
 camps-0.209
Token it
Token ID340
Feature activation+8.50e-01

Pos logit contributions
ons+0.042
aid+0.037
tail+0.035
quet+0.034
 stitches+0.032
Neg logit contributions
 ending-0.034
 beginning-0.031
sing-0.030
 size-0.029
 crashing-0.029
Token on
Token ID319
Feature activation+2.40e+00

Pos logit contributions
ons+0.057
aid+0.051
tail+0.047
quet+0.046
 stitches+0.042
Neg logit contributions
 ending-0.055
 beginning-0.051
sing-0.050
 size-0.049
ksh-0.048
Token and
Token ID290
Feature activation+2.16e-01

Pos logit contributions
ons+0.558
aid+0.485
quet+0.483
tail+0.464
keeping+0.457
Neg logit contributions
 ending-0.288
 beginning-0.272
 size-0.256
sing-0.255
 crashing-0.246
Token put
Token ID1234
Feature activation+5.71e-01

Pos logit contributions
ons+0.016
aid+0.014
quet+0.014
tail+0.013
keeping+0.012
Neg logit contributions
 ending-0.012
 beginning-0.012
sing-0.011
 size-0.011
 crashing-0.011
Token a
Token ID257
Feature activation+6.55e-01

Pos logit contributions
ons+0.045
aid+0.040
tail+0.037
 stitches+0.035
quet+0.034
Neg logit contributions
 ending-0.033
sing-0.030
 beginning-0.030
 size-0.028
 crashing-0.027
Token band
Token ID4097
Feature activation+9.41e+00

Pos logit contributions
ons+0.039
keeping+0.030
tail+0.029
quet+0.029
bes+0.027
Neg logit contributions
 ending-0.048
 beginning-0.046
 crashing-0.046
 size-0.045
 end-0.045
Token-
Token ID12
Feature activation+4.23e+00

Pos logit contributions
ons+0.539
quet+0.436
tail+0.394
 exclaimed+0.388
ner+0.385
Neg logit contributions
 ending-0.629
 beginning-0.614
 crashing-0.601
sing-0.593
king-0.591
Tokenaid
Token ID1698
Feature activation+1.37e+01

Pos logit contributions
ons+0.079
quet+0.040
tail+0.039
bes+0.031
keeping+0.023
Neg logit contributions
 beginning-0.432
 ending-0.432
ksh-0.426
 crashing-0.420
 end-0.414
Token on
Token ID319
Feature activation+2.55e+00

Pos logit contributions
ons+0.966
aid+0.849
tail+0.795
quet+0.785
 exclaimed+0.713
Neg logit contributions
 ending-0.844
 beginning-0.795
sing-0.760
 size-0.749
 crashing-0.740
Token it
Token ID340
Feature activation+8.71e-01

Pos logit contributions
ons+0.170
aid+0.139
tail+0.137
quet+0.123
 stitches+0.122
Neg logit contributions
 ending-0.177
 beginning-0.162
sing-0.158
rated-0.156
ksh-0.154
Token.
Token ID13
Feature activation+7.90e-03

Pos logit contributions
ons+0.074
aid+0.065
quet+0.063
tail+0.061
keeping+0.058
Neg logit contributions
 ending-0.041
 beginning-0.040
 size-0.037
sing-0.037
 crashing-0.035
Token
Token ID198
Feature activation+6.90e-01

Pos logit contributions
ons+0.001
aid+0.001
tail+0.000
 exclaimed+0.000
quet+0.000
Neg logit contributions
 ending-0.000
 beginning-0.000
 size-0.000
sing-0.000
 crashing-0.000
Token
Token ID198
Feature activation+6.89e-01

Pos logit contributions
ons+0.051
tail+0.046
aid+0.046
quet+0.046
keeping+0.040
Neg logit contributions
 ending-0.036
 crashing-0.034
 camps-0.033
 beginning-0.033
sing-0.032
Token quickly
Token ID2952
Feature activation+4.50e+00

Pos logit contributions
 ending+0.097
 beginning+0.092
sing+0.087
 size+0.086
 crashing+0.085
Neg logit contributions
ons-0.111
aid-0.098
tail-0.092
quet-0.091
keeping-0.083
Token got
Token ID1392
Feature activation+1.47e+00

Pos logit contributions
ons+0.324
aid+0.281
tail+0.280
quet+0.271
keeping+0.252
Neg logit contributions
 ending-0.263
 beginning-0.248
 size-0.234
 crashing-0.231
sing-0.228
Token a
Token ID257
Feature activation+6.85e-01

Pos logit contributions
ons+0.118
quet+0.112
tail+0.106
aid+0.105
bes+0.102
Neg logit contributions
 beginning-0.071
 ending-0.063
 end-0.062
 them-0.061
 only-0.061
Token band
Token ID4097
Feature activation+9.35e+00

Pos logit contributions
ons+0.049
aid+0.044
tail+0.041
quet+0.040
 exclaimed+0.037
Neg logit contributions
 ending-0.041
 beginning-0.039
sing-0.037
 size-0.036
 crashing-0.036
Token-
Token ID12
Feature activation+4.16e+00

Pos logit contributions
ons+0.596
aid+0.495
quet+0.477
tail+0.474
powers+0.439
Neg logit contributions
 ending-0.607
 beginning-0.566
sing-0.549
 crashing-0.547
 size-0.544
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.080
tail+0.053
quet+0.051
bes+0.050
keeping+0.044
Neg logit contributions
 beginning-0.396
ksh-0.395
 ending-0.391
king-0.385
 crashing-0.384
Token to
Token ID284
Feature activation+1.18e+00

Pos logit contributions
ons+1.171
aid+1.051
tail+1.007
quet+0.995
bes+0.948
Neg logit contributions
 ending-0.619
 beginning-0.577
 size-0.529
 crashing-0.527
sing-0.523
Token make
Token ID787
Feature activation+6.34e-02

Pos logit contributions
ons+0.094
pill+0.086
keeping+0.086
powers+0.086
quet+0.085
Neg logit contributions
 end-0.048
 size-0.045
 only-0.045
 crashing-0.044
 beginning-0.043
Token it
Token ID340
Feature activation+1.00e+00

Pos logit contributions
ons+0.004
tail+0.004
aid+0.004
quet+0.004
grand+0.004
Neg logit contributions
 them-0.003
 beginning-0.003
 ending-0.003
 end-0.003
 size-0.003
Token better
Token ID1365
Feature activation+2.76e+00

Pos logit contributions
ons+0.080
aid+0.069
quet+0.067
tail+0.066
keeping+0.064
Neg logit contributions
 ending-0.053
 beginning-0.049
 size-0.048
sing-0.047
 crashing-0.045
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
ons+0.230
aid+0.201
quet+0.187
tail+0.186
 exclaimed+0.173
Neg logit contributions
 ending-0.145
 size-0.131
 beginning-0.130
sing-0.123
 crashing-0.118
Token They
Token ID1119
Feature activation-7.82e-01

Pos logit contributions
ons+0.016
aid+0.014
quet+0.014
tail+0.014
 exclaimed+0.013
Neg logit contributions
 ending-0.008
 beginning-0.008
sing-0.007
 crashing-0.007
 size-0.007
Token get
Token ID651
Feature activation+7.08e-01

Pos logit contributions
 ending+0.042
 beginning+0.039
 size+0.036
sing+0.036
 end+0.036
Neg logit contributions
ons-0.062
aid-0.054
quet-0.053
tail-0.051
keeping-0.048
Token some
Token ID617
Feature activation-2.71e-01

Pos logit contributions
ons+0.060
quet+0.053
aid+0.053
tail+0.052
bes+0.048
Neg logit contributions
 ending-0.032
 beginning-0.031
 crashing-0.028
sing-0.028
 size-0.028
Token band
Token ID4097
Feature activation+9.34e+00

Pos logit contributions
 ending+0.018
 end+0.017
 beginning+0.017
 size+0.017
 camps+0.016
Neg logit contributions
ons-0.016
quet-0.014
tail-0.014
keeping-0.014
aid-0.014
Token-
Token ID12
Feature activation+4.20e+00

Pos logit contributions
ons+0.559
quet+0.474
ner+0.449
pill+0.437
tail+0.434
Neg logit contributions
 ending-0.577
king-0.548
sing-0.538
 mate-0.521
 end-0.518
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.099
quet+0.067
tail+0.064
bes+0.057
 exclaimed+0.048
Neg logit contributions
 ending-0.398
ksh-0.395
 beginning-0.388
king-0.385
 end-0.381
Tokens
Token ID82
Feature activation+2.52e+00

Pos logit contributions
ons+1.068
aid+0.937
tail+0.927
quet+0.913
bes+0.850
Neg logit contributions
 ending-0.694
 beginning-0.634
 size-0.611
 crashing-0.604
sing-0.602
Token and
Token ID290
Feature activation+1.88e-01

Pos logit contributions
ons+0.214
quet+0.189
aid+0.188
tail+0.186
 exclaimed+0.170
Neg logit contributions
 ending-0.115
 beginning-0.108
 size-0.101
sing-0.101
 crashing-0.099
Token cookies
Token ID14746
Feature activation+7.80e+00

Pos logit contributions
ons+0.015
quet+0.014
aid+0.013
handed+0.013
keeping+0.012
Neg logit contributions
 ending-0.009
 size-0.008
 end-0.008
sing-0.008
 beginning-0.008
Token.
Token ID13
Feature activation+1.94e-01

Pos logit contributions
ons+0.605
aid+0.534
tail+0.511
quet+0.502
 exclaimed+0.460
Neg logit contributions
 ending-0.426
 beginning-0.398
 size-0.375
sing-0.375
 crashing-0.369
Token They
Token ID1119
Feature activation-8.14e-01

Pos logit contributions
ons+0.017
aid+0.015
tail+0.014
quet+0.014
 exclaimed+0.013
Neg logit contributions
 ending-0.009
 beginning-0.009
sing-0.008
 size-0.008
 crashing-0.008
Token gave
Token ID2921
Feature activation+1.72e+00

Pos logit contributions
ons+0.257
aid+0.225
tail+0.213
quet+0.210
keeping+0.191
Neg logit contributions
 ending-0.224
 beginning-0.211
sing-0.200
 size-0.199
 crashing-0.196
Token Anna
Token ID11735
Feature activation+1.55e+00

Pos logit contributions
ons+0.138
aid+0.121
tail+0.115
quet+0.111
 stitches+0.106
Neg logit contributions
 ending-0.094
 beginning-0.086
sing-0.083
 size-0.081
 crashing-0.080
Token a
Token ID257
Feature activation+6.38e-01

Pos logit contributions
ons+0.126
quet+0.110
aid+0.110
tail+0.106
keeping+0.099
Neg logit contributions
 ending-0.073
 beginning-0.072
 size-0.065
 end-0.065
ksh-0.065
Token band
Token ID4097
Feature activation+9.33e+00

Pos logit contributions
ons+0.048
quet+0.040
tail+0.040
aid+0.039
keeping+0.038
Neg logit contributions
 ending-0.036
 beginning-0.033
sing-0.033
 crashing-0.032
 size-0.031
Token-
Token ID12
Feature activation+4.23e+00

Pos logit contributions
ons+0.616
quet+0.488
aid+0.478
tail+0.473
 exclaimed+0.437
Neg logit contributions
 ending-0.604
sing-0.570
 beginning-0.564
 crashing-0.550
 size-0.541
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.071
quet+0.022
tail+0.019
aid+0.010
bes+0.004
Neg logit contributions
 ending-0.468
 beginning-0.456
ksh-0.445
sing-0.443
 crashing-0.443
Token and
Token ID290
Feature activation+2.50e-01

Pos logit contributions
ons+1.083
aid+0.965
tail+0.913
quet+0.903
 exclaimed+0.831
Neg logit contributions
 ending-0.721
 beginning-0.672
sing-0.638
 size-0.627
 crashing-0.619
Token a
Token ID257
Feature activation+6.51e-01

Pos logit contributions
ons+0.021
quet+0.019
aid+0.018
keeping+0.018
tail+0.018
Neg logit contributions
 ending-0.012
 size-0.010
 beginning-0.010
 camps-0.010
sing-0.010
Token candy
Token ID18550
Feature activation+9.63e+00

Pos logit contributions
ons+0.046
aid+0.037
quet+0.037
tail+0.037
keeping+0.035
Neg logit contributions
 ending-0.041
 beginning-0.038
sing-0.038
 crashing-0.037
 size-0.036
Token.
Token ID13
Feature activation+3.95e-02

Pos logit contributions
ons+0.734
aid+0.634
tail+0.603
quet+0.596
 exclaimed+0.558
Neg logit contributions
 ending-0.548
 beginning-0.511
 size-0.485
sing-0.482
 crashing-0.474
Token Anna
Token ID11735
Feature activation+1.55e+00

Pos logit contributions
ons+0.003
aid+0.003
quet+0.003
tail+0.002
keeping+0.002
Neg logit contributions
 ending-0.002
 beginning-0.002
 crashing-0.002
sing-0.002
ksh-0.002
Token He
Token ID679
Feature activation+1.56e-02

Pos logit contributions
 ending+0.002
sing+0.002
 beginning+0.002
ksh+0.002
 crashing+0.002
Neg logit contributions
ons-0.004
aid-0.003
quet-0.003
tail-0.003
 exclaimed-0.003
Token needs
Token ID2476
Feature activation+1.92e+00

Pos logit contributions
ons+0.001
quet+0.001
aid+0.001
keeping+0.001
tail+0.001
Neg logit contributions
 ending-0.001
 beginning-0.001
 size-0.001
 only-0.001
 end-0.001
Token a
Token ID257
Feature activation+5.65e-01

Pos logit contributions
ons+0.153
quet+0.142
tail+0.136
aid+0.131
bes+0.125
Neg logit contributions
 ending-0.092
 beginning-0.090
 size-0.088
 end-0.084
 crashing-0.082
Token band
Token ID4097
Feature activation+9.30e+00

Pos logit contributions
ons+0.031
aid+0.026
tail+0.024
quet+0.024
 exclaimed+0.020
Neg logit contributions
 ending-0.044
 beginning-0.042
sing-0.041
 size-0.040
 crashing-0.040
Token-
Token ID12
Feature activation+4.17e+00

Pos logit contributions
ons+0.649
quet+0.562
tail+0.511
aid+0.502
 exclaimed+0.493
Neg logit contributions
 ending-0.570
 beginning-0.508
 size-0.504
sing-0.501
 mate-0.500
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.083
quet+0.049
tail+0.036
bes+0.031
 exclaimed+0.022
Neg logit contributions
 ending-0.435
 beginning-0.424
 end-0.411
ksh-0.411
 crashing-0.409
Token."
Token ID526
Feature activation+4.22e+00

Pos logit contributions
ons+1.070
aid+0.950
tail+0.900
quet+0.893
 exclaimed+0.820
Neg logit contributions
 ending-0.733
 beginning-0.683
sing-0.647
 size-0.640
 crashing-0.630
Token Lily
Token ID20037
Feature activation+2.32e+00

Pos logit contributions
ons+0.409
quet+0.382
aid+0.363
tail+0.353
odic+0.339
Neg logit contributions
 ending-0.166
 Dad-0.151
sing-0.122
ksh-0.122
 end-0.119
Token says
Token ID1139
Feature activation+9.31e-01

Pos logit contributions
ons+0.142
aid+0.123
tail+0.113
quet+0.109
 exclaimed+0.101
Neg logit contributions
 ending-0.166
 beginning-0.158
sing-0.152
 size-0.149
 crashing-0.149
Token,
Token ID11
Feature activation+4.09e-01

Pos logit contributions
ons+0.059
aid+0.051
tail+0.048
quet+0.047
 exclaimed+0.042
Neg logit contributions
 ending-0.064
 beginning-0.061
sing-0.058
 size-0.058
 crashing-0.057
Token "
Token ID366
Feature activation-3.36e+00

Pos logit contributions
ons+0.034
aid+0.031
tail+0.029
quet+0.029
 stitches+0.027
Neg logit contributions
 ending-0.019
 camps-0.017
 size-0.017
 beginning-0.017
sing-0.016
Token gave
Token ID2921
Feature activation+1.53e+00

Pos logit contributions
 ending+0.005
 beginning+0.005
 size+0.005
sing+0.004
 whale+0.004
Neg logit contributions
ons-0.005
quet-0.005
tail-0.005
aid-0.004
keeping-0.004
Token her
Token ID607
Feature activation+2.38e+00

Pos logit contributions
ons+0.110
quet+0.102
aid+0.099
tail+0.097
bes+0.089
Neg logit contributions
 ending-0.086
 beginning-0.082
 end-0.078
 size-0.078
rant-0.073
Token a
Token ID257
Feature activation+4.68e-01

Pos logit contributions
ons+0.195
quet+0.175
tail+0.170
aid+0.167
keeping+0.160
Neg logit contributions
 ending-0.111
 beginning-0.106
 end-0.099
 size-0.097
 camps-0.092
Token band
Token ID4097
Feature activation+9.31e+00

Pos logit contributions
ons+0.035
tail+0.030
quet+0.030
keeping+0.029
bes+0.028
Neg logit contributions
 ending-0.027
 beginning-0.024
sing-0.023
 end-0.023
 crashing-0.023
Token-
Token ID12
Feature activation+4.21e+00

Pos logit contributions
ons+0.571
quet+0.465
ner+0.448
tail+0.430
pill+0.413
Neg logit contributions
 ending-0.615
sing-0.575
 mate-0.558
 beginning-0.552
 end-0.547
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.087
quet+0.048
tail+0.046
bes+0.041
keeping+0.036
Neg logit contributions
 ending-0.421
 beginning-0.414
ksh-0.409
 end-0.403
 crashing-0.399
Token and
Token ID290
Feature activation+1.99e-01

Pos logit contributions
ons+1.197
tail+1.041
quet+1.038
aid+1.031
bes+0.987
Neg logit contributions
 ending-0.577
 beginning-0.521
 size-0.483
 end-0.481
sing-0.479
Token a
Token ID257
Feature activation+5.02e-01

Pos logit contributions
quet+0.016
ons+0.016
tail+0.015
keeping+0.015
ner+0.014
Neg logit contributions
 ending-0.008
 size-0.007
 only-0.007
 one-0.007
 whale-0.007
Token sticker
Token ID27735
Feature activation+6.56e+00

Pos logit contributions
ons+0.035
keeping+0.030
tail+0.028
quet+0.028
bes+0.026
Neg logit contributions
 ending-0.032
 beginning-0.029
sing-0.029
 crashing-0.029
 end-0.028
Token.
Token ID13
Feature activation+2.24e-01

Pos logit contributions
ons+0.523
aid+0.446
tail+0.446
quet+0.438
keeping+0.415
Neg logit contributions
 ending-0.331
 beginning-0.304
 size-0.289
 crashing-0.278
 end-0.277
Token He
Token ID679
Feature activation-6.35e-02

Pos logit contributions
ons+0.022
quet+0.020
tail+0.019
aid+0.019
keeping+0.017
Neg logit contributions
 ending-0.008
 Dad-0.007
 beginning-0.007
 crashing-0.006
ksh-0.006
Token,
Token ID11
Feature activation+3.63e-01

Pos logit contributions
ons+0.291
aid+0.260
tail+0.248
quet+0.248
keeping+0.223
Neg logit contributions
 ending-0.195
 beginning-0.188
 size-0.177
sing-0.174
 crashing-0.171
Token and
Token ID290
Feature activation+1.62e-01

Pos logit contributions
ons+0.030
aid+0.027
quet+0.025
tail+0.025
keeping+0.023
Neg logit contributions
 ending-0.018
 beginning-0.017
sing-0.016
 size-0.016
 crashing-0.016
Token a
Token ID257
Feature activation+5.65e-01

Pos logit contributions
ons+0.012
quet+0.011
aid+0.011
tail+0.010
keeping+0.010
Neg logit contributions
 ending-0.009
 beginning-0.008
 size-0.008
sing-0.008
 crashing-0.008
Token band
Token ID4097
Feature activation+9.32e+00

Pos logit contributions
ons+0.042
aid+0.036
quet+0.034
tail+0.034
keeping+0.032
Neg logit contributions
 ending-0.034
 beginning-0.032
 size-0.030
sing-0.030
 crashing-0.030
Token-
Token ID12
Feature activation+4.09e+00

Pos logit contributions
ons+0.442
quet+0.335
aid+0.330
tail+0.303
keeping+0.271
Neg logit contributions
 ending-0.773
 beginning-0.727
sing-0.723
 size-0.717
 crashing-0.715
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.091
quet+0.053
tail+0.044
bes+0.039
keeping+0.038
Neg logit contributions
 beginning-0.404
 ending-0.404
ksh-0.399
 crashing-0.391
rant-0.387
Token on
Token ID319
Feature activation+2.63e+00

Pos logit contributions
ons+1.055
aid+0.939
tail+0.896
quet+0.895
bes+0.807
Neg logit contributions
 ending-0.726
 beginning-0.684
 size-0.645
sing-0.644
 crashing-0.634
Token his
Token ID465
Feature activation-3.45e-01

Pos logit contributions
ons+0.185
aid+0.160
tail+0.152
quet+0.148
 stitches+0.135
Neg logit contributions
 ending-0.164
 beginning-0.154
sing-0.149
 size-0.145
ksh-0.144
Token arm
Token ID3211
Feature activation+2.84e+00

Pos logit contributions
 ending+0.016
sing+0.015
 beginning+0.015
 size+0.015
 end+0.013
Neg logit contributions
ons-0.030
aid-0.027
quet-0.026
tail-0.024
 exclaimed-0.023
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
ons+0.230
quet+0.206
aid+0.205
tail+0.189
bes+0.182
Neg logit contributions
 ending-0.130
 beginning-0.116
 size-0.114
 end-0.108
 crashing-0.108
Token She
Token ID1375
Feature activation+3.44e+00

Pos logit contributions
ons+0.011
aid+0.010
quet+0.010
tail+0.009
 stitches+0.008
Neg logit contributions
 ending-0.008
 beginning-0.007
 crashing-0.007
sing-0.007
ksh-0.007
Token and
Token ID290
Feature activation+1.39e-01

Pos logit contributions
ons+0.028
aid+0.025
quet+0.024
keeping+0.024
tail+0.023
Neg logit contributions
 ending-0.011
 beginning-0.011
 size-0.010
sing-0.010
 end-0.009
Token wrapped
Token ID12908
Feature activation+2.73e+00

Pos logit contributions
ons+0.010
quet+0.009
tail+0.008
aid+0.008
ner+0.008
Neg logit contributions
 ending-0.008
 size-0.007
 beginning-0.007
 only-0.007
 captain-0.007
Token a
Token ID257
Feature activation+4.83e-01

Pos logit contributions
ons+0.185
aid+0.162
tail+0.148
quet+0.143
 stitches+0.136
Neg logit contributions
 ending-0.183
 beginning-0.170
sing-0.165
 camps-0.160
 size-0.159
Token band
Token ID4097
Feature activation+9.39e+00

Pos logit contributions
ons+0.034
tail+0.030
quet+0.028
keeping+0.028
aid+0.027
Neg logit contributions
 ending-0.027
 size-0.026
 crashing-0.025
 beginning-0.025
 end-0.025
Token-
Token ID12
Feature activation+4.18e+00

Pos logit contributions
ons+0.516
quet+0.401
aid+0.388
tail+0.363
 stitches+0.347
Neg logit contributions
 ending-0.692
 beginning-0.665
sing-0.647
 crashing-0.635
king-0.633
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.080
quet+0.047
tail+0.044
bes+0.027
 exclaimed+0.024
Neg logit contributions
 ending-0.418
 beginning-0.415
ksh-0.413
rant-0.405
 crashing-0.402
Token around
Token ID1088
Feature activation-2.30e+00

Pos logit contributions
ons+0.982
aid+0.865
tail+0.813
quet+0.803
 exclaimed+0.730
Neg logit contributions
 ending-0.818
 beginning-0.770
sing-0.734
 size-0.724
 crashing-0.716
Token it
Token ID340
Feature activation+7.82e-01

Pos logit contributions
 ending+0.143
 beginning+0.134
sing+0.129
 camps+0.126
ksh+0.126
Neg logit contributions
ons-0.163
aid-0.141
tail-0.134
quet-0.130
keeping-0.120
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
ons+0.067
aid+0.059
quet+0.056
tail+0.055
keeping+0.053
Neg logit contributions
 ending-0.037
 beginning-0.036
 size-0.034
sing-0.033
 end-0.031
Token
Token ID198
Feature activation+7.78e-01

Pos logit contributions
ons+0.014
aid+0.012
quet+0.012
tail+0.011
 stitches+0.010
Neg logit contributions
 ending-0.013
 beginning-0.012
sing-0.012
 crashing-0.012
 size-0.012
Token
Token ID198
Feature activation+7.71e-01

Pos logit contributions
ons+0.062
tail+0.059
quet+0.058
aid+0.057
 stitches+0.051
Neg logit contributions
 ending-0.032
 crashing-0.031
 end-0.030
 them-0.029
 camps-0.029
Token I
Token ID314
Feature activation+1.13e+00

Pos logit contributions
ons+0.009
aid+0.008
quet+0.008
tail+0.007
keeping+0.007
Neg logit contributions
 ending-0.006
sing-0.006
 beginning-0.006
ksh-0.006
 crashing-0.006
Token need
Token ID761
Feature activation+3.23e+00

Pos logit contributions
ons+0.096
quet+0.084
aid+0.082
tail+0.079
keeping+0.076
Neg logit contributions
 ending-0.057
 beginning-0.051
sing-0.049
 size-0.048
 end-0.046
Token a
Token ID257
Feature activation+6.00e-01

Pos logit contributions
ons+0.252
quet+0.241
tail+0.237
bes+0.228
keeping+0.219
Neg logit contributions
 beginning-0.147
 ending-0.146
 end-0.141
 size-0.139
 only-0.131
Token band
Token ID4097
Feature activation+9.34e+00

Pos logit contributions
ons+0.044
quet+0.035
tail+0.035
aid+0.035
keeping+0.033
Neg logit contributions
 ending-0.039
 beginning-0.036
 size-0.034
sing-0.034
 crashing-0.034
Token-
Token ID12
Feature activation+4.21e+00

Pos logit contributions
ons+0.501
quet+0.412
pill+0.371
tail+0.348
ner+0.339
Neg logit contributions
 ending-0.708
 mate-0.651
 beginning-0.643
 end-0.637
 captain-0.636
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.097
quet+0.064
bes+0.062
tail+0.051
keeping+0.046
Neg logit contributions
 ending-0.411
 beginning-0.407
 end-0.397
 crashing-0.388
ksh-0.384
Token,
Token ID11
Feature activation+4.57e-01

Pos logit contributions
ons+1.200
aid+1.055
quet+1.053
tail+1.038
bes+0.989
Neg logit contributions
 ending-0.586
 beginning-0.537
 size-0.496
sing-0.481
 end-0.480
Token but
Token ID475
Feature activation-3.60e+00

Pos logit contributions
ons+0.042
aid+0.038
quet+0.037
tail+0.036
keeping+0.034
Neg logit contributions
 ending-0.018
 beginning-0.017
sing-0.016
 crashing-0.015
 size-0.015
Token I
Token ID314
Feature activation+1.16e+00

Pos logit contributions
 ending+0.237
 beginning+0.224
 size+0.212
sing+0.210
 crashing+0.207
Neg logit contributions
ons-0.241
aid-0.211
quet-0.200
tail-0.199
keeping-0.180
Token can
Token ID460
Feature activation-3.15e-01

Pos logit contributions
ons+0.104
quet+0.093
aid+0.090
tail+0.086
keeping+0.085
Neg logit contributions
 ending-0.052
 beginning-0.046
sing-0.044
 size-0.044
 end-0.042
Token't
Token ID470
Feature activation+1.87e+00

Pos logit contributions
 end+0.016
 ending+0.016
 beginning+0.016
 only+0.016
 size+0.015
Neg logit contributions
ons-0.024
keeping-0.021
aid-0.020
quet-0.020
tail-0.019
Token He
Token ID679
Feature activation+9.70e-02

Pos logit contributions
ons+0.010
aid+0.009
quet+0.009
tail+0.009
keeping+0.008
Neg logit contributions
 ending-0.006
 beginning-0.005
sing-0.005
 crashing-0.005
 size-0.005
Token finds
Token ID7228
Feature activation-2.26e+00

Pos logit contributions
ons+0.007
aid+0.006
tail+0.006
quet+0.006
 exclaimed+0.005
Neg logit contributions
 ending-0.006
 beginning-0.005
sing-0.005
 size-0.005
 crashing-0.005
Token a
Token ID257
Feature activation+6.19e-01

Pos logit contributions
 ending+0.139
 beginning+0.129
sing+0.125
 size+0.121
 crashing+0.121
Neg logit contributions
ons-0.165
aid-0.145
tail-0.134
quet-0.131
 stitches-0.123
Token band
Token ID4097
Feature activation+9.33e+00

Pos logit contributions
ons+0.045
aid+0.040
tail+0.037
quet+0.037
keeping+0.034
Neg logit contributions
 ending-0.037
 beginning-0.035
sing-0.033
 size-0.033
 crashing-0.033
Token-
Token ID12
Feature activation+4.19e+00

Pos logit contributions
ons+0.586
aid+0.491
quet+0.469
tail+0.466
 exclaimed+0.426
Neg logit contributions
 ending-0.625
 beginning-0.585
sing-0.576
 crashing-0.573
 size-0.571
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.102
tail+0.060
quet+0.058
bes+0.045
keeping+0.042
Neg logit contributions
 ending-0.412
 beginning-0.406
ksh-0.401
 crashing-0.396
rant-0.392
Token in
Token ID287
Feature activation-8.34e-01

Pos logit contributions
ons+1.012
aid+0.894
tail+0.841
quet+0.831
 exclaimed+0.759
Neg logit contributions
 ending-0.791
 beginning-0.741
sing-0.708
 size-0.696
 crashing-0.686
Token his
Token ID465
Feature activation-3.44e-01

Pos logit contributions
 ending+0.047
 beginning+0.044
sing+0.042
 size+0.041
 crashing+0.041
Neg logit contributions
ons-0.064
aid-0.057
tail-0.053
quet-0.053
 exclaimed-0.048
Token pocket
Token ID10000
Feature activation+9.35e+00

Pos logit contributions
sing+0.016
 ending+0.015
 beginning+0.015
 size+0.015
 end+0.014
Neg logit contributions
ons-0.029
quet-0.027
aid-0.026
ner-0.024
keeping-0.024
Token and
Token ID290
Feature activation+1.79e-01

Pos logit contributions
ons+0.863
aid+0.813
tail+0.769
quet+0.763
powers+0.704
Neg logit contributions
 beginning-0.335
 size-0.334
 ending-0.330
 end-0.297
 crashing-0.280
Token he
Token ID339
Feature activation+1.07e-01

Pos logit contributions
ons+0.014
quet+0.013
aid+0.012
tail+0.012
keeping+0.011
Neg logit contributions
 ending-0.009
 size-0.009
 beginning-0.009
 camps-0.008
sing-0.008
Token and
Token ID290
Feature activation+1.92e-01

Pos logit contributions
ons+0.104
aid+0.092
quet+0.089
tail+0.088
keeping+0.083
Neg logit contributions
 ending-0.061
 beginning-0.060
 size-0.056
sing-0.056
 crashing-0.054
Token puts
Token ID7584
Feature activation+5.15e-01

Pos logit contributions
ons+0.014
aid+0.013
quet+0.013
tail+0.012
keeping+0.012
Neg logit contributions
 ending-0.010
 size-0.010
 beginning-0.010
sing-0.010
 crashing-0.009
Token a
Token ID257
Feature activation+5.54e-01

Pos logit contributions
ons+0.037
aid+0.032
tail+0.029
 stitches+0.028
quet+0.027
Neg logit contributions
 ending-0.034
sing-0.030
 beginning-0.030
 camps-0.029
 size-0.028
Token band
Token ID4097
Feature activation+9.32e+00

Pos logit contributions
ons+0.038
keeping+0.032
tail+0.031
quet+0.031
aid+0.031
Neg logit contributions
 ending-0.034
 beginning-0.033
sing-0.032
 size-0.032
 crashing-0.032
Token-
Token ID12
Feature activation+4.17e+00

Pos logit contributions
ons+0.514
quet+0.409
tail+0.382
aid+0.373
ner+0.362
Neg logit contributions
 ending-0.642
 beginning-0.631
sing-0.619
 crashing-0.614
king-0.612
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.070
quet+0.034
tail+0.034
bes+0.022
keeping+0.017
Neg logit contributions
 beginning-0.432
 ending-0.428
ksh-0.427
 crashing-0.419
rant-0.416
Token on
Token ID319
Feature activation+2.49e+00

Pos logit contributions
ons+0.984
aid+0.858
tail+0.820
quet+0.813
 exclaimed+0.739
Neg logit contributions
 ending-0.802
 beginning-0.763
sing-0.718
 size-0.716
 crashing-0.711
Token it
Token ID340
Feature activation+8.56e-01

Pos logit contributions
ons+0.169
aid+0.142
tail+0.136
quet+0.127
 stitches+0.120
Neg logit contributions
 ending-0.168
 beginning-0.155
sing-0.152
ksh-0.149
 camps-0.147
Token.
Token ID13
Feature activation+7.40e-02

Pos logit contributions
ons+0.074
aid+0.066
quet+0.063
tail+0.062
keeping+0.058
Neg logit contributions
 ending-0.039
 beginning-0.038
 size-0.036
sing-0.034
 end-0.032
Token She
Token ID1375
Feature activation+3.42e+00

Pos logit contributions
ons+0.005
aid+0.004
quet+0.004
tail+0.004
keeping+0.003
Neg logit contributions
 ending-0.005
 beginning-0.005
sing-0.005
 crashing-0.005
 size-0.005
Token kisses
Token ID36544
Feature activation+6.03e+00

Pos logit contributions
ons+0.226
aid+0.195
tail+0.180
quet+0.175
 stitches+0.163
Neg logit contributions
 ending-0.232
 beginning-0.219
sing-0.210
 crashing-0.206
 size-0.205
Token get
Token ID651
Feature activation+7.95e-01

Pos logit contributions
sing+0.036
 end+0.036
 beginning+0.035
 ending+0.035
 captain+0.035
Neg logit contributions
ons-0.067
keeping-0.057
quet-0.057
aid-0.054
bes-0.054
Token you
Token ID345
Feature activation+4.58e+00

Pos logit contributions
ons+0.054
quet+0.047
aid+0.047
tail+0.047
bes+0.042
Neg logit contributions
 ending-0.048
 beginning-0.047
 size-0.044
 crashing-0.043
sing-0.043
Token a
Token ID257
Feature activation+7.04e-01

Pos logit contributions
ons+0.378
quet+0.349
tail+0.332
aid+0.325
odic+0.314
Neg logit contributions
 beginning-0.197
 ending-0.194
 end-0.189
 size-0.183
sing-0.179
Token band
Token ID4097
Feature activation+9.32e+00

Pos logit contributions
ons+0.048
aid+0.042
tail+0.039
quet+0.038
 exclaimed+0.035
Neg logit contributions
 ending-0.046
 beginning-0.043
sing-0.041
 size-0.041
 crashing-0.040
Token-
Token ID12
Feature activation+4.21e+00

Pos logit contributions
ons+0.584
quet+0.470
tail+0.449
aid+0.446
pill+0.440
Neg logit contributions
 ending-0.626
 captain-0.572
sing-0.571
 mate-0.571
 end-0.565
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.089
quet+0.048
tail+0.045
bes+0.041
keeping+0.037
Neg logit contributions
 ending-0.423
 beginning-0.414
ksh-0.410
 end-0.407
 crashing-0.403
Token and
Token ID290
Feature activation+2.42e-01

Pos logit contributions
ons+1.088
aid+0.958
tail+0.940
quet+0.929
bes+0.885
Neg logit contributions
 ending-0.684
 beginning-0.601
 size-0.589
 end-0.589
sing-0.572
Token a
Token ID257
Feature activation+7.40e-01

Pos logit contributions
ons+0.021
quet+0.019
keeping+0.018
handed+0.018
bes+0.017
Neg logit contributions
 ending-0.009
 captain-0.009
 only-0.009
 end-0.009
 size-0.009
Token kiss
Token ID9245
Feature activation+2.51e+00

Pos logit contributions
ons+0.051
aid+0.042
tail+0.041
quet+0.040
keeping+0.038
Neg logit contributions
 ending-0.049
 beginning-0.045
sing-0.044
 size-0.043
 crashing-0.043
Token."
Token ID526
Feature activation+4.21e+00

Pos logit contributions
ons+0.164
aid+0.142
quet+0.131
tail+0.131
 exclaimed+0.119
Neg logit contributions
 ending-0.167
 beginning-0.158
 size-0.150
sing-0.150
 crashing-0.148
Token
Token ID198
Feature activation+8.14e-01

Pos logit contributions
ons+0.364
quet+0.338
aid+0.333
tail+0.307
odic+0.292
Neg logit contributions
 ending-0.173
 Dad-0.173
sing-0.154
 father-0.154
 whale-0.150
Token They
Token ID1119
Feature activation-1.10e+00

Pos logit contributions
 ending+0.006
 beginning+0.006
sing+0.005
 size+0.005
 crashing+0.005
Neg logit contributions
ons-0.010
aid-0.009
tail-0.008
quet-0.008
 exclaimed-0.008
Token need
Token ID761
Feature activation+2.86e+00

Pos logit contributions
 ending+0.056
 beginning+0.052
 end+0.050
sing+0.049
 size+0.049
Neg logit contributions
ons-0.089
aid-0.077
quet-0.077
tail-0.073
keeping-0.069
Token a
Token ID257
Feature activation+5.26e-01

Pos logit contributions
ons+0.234
tail+0.210
quet+0.206
aid+0.204
bes+0.189
Neg logit contributions
 ending-0.139
 beginning-0.131
 size-0.127
 crashing-0.121
 end-0.118
Token band
Token ID4097
Feature activation+9.19e+00

Pos logit contributions
ons+0.037
aid+0.033
tail+0.030
quet+0.030
 stitches+0.027
Neg logit contributions
 ending-0.032
 beginning-0.031
sing-0.029
 size-0.029
 crashing-0.028
Token-
Token ID12
Feature activation+4.25e+00

Pos logit contributions
ons+0.508
quet+0.408
tail+0.370
ner+0.359
pill+0.358
Neg logit contributions
 ending-0.686
 mate-0.649
sing-0.629
 captain-0.626
 beginning-0.622
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.085
tail+0.041
quet+0.040
bes+0.032
 exclaimed+0.021
Neg logit contributions
 ending-0.443
 beginning-0.431
 mate-0.422
 crashing-0.418
 end-0.417
Token and
Token ID290
Feature activation+1.22e-01

Pos logit contributions
ons+1.134
aid+0.996
tail+0.969
quet+0.964
bes+0.889
Neg logit contributions
 ending-0.661
 beginning-0.605
sing-0.573
 size-0.569
 crashing-0.555
Token a
Token ID257
Feature activation+5.92e-01

Pos logit contributions
ons+0.011
quet+0.010
tail+0.009
aid+0.009
ner+0.009
Neg logit contributions
 ending-0.005
 size-0.004
 beginning-0.004
sing-0.004
 end-0.004
Token hug
Token ID16225
Feature activation+1.50e+00

Pos logit contributions
ons+0.039
aid+0.032
tail+0.031
quet+0.031
keeping+0.028
Neg logit contributions
 ending-0.041
 beginning-0.038
sing-0.037
 size-0.036
 crashing-0.036
Token.
Token ID13
Feature activation+3.44e-02

Pos logit contributions
ons+0.101
aid+0.089
tail+0.084
quet+0.081
keeping+0.075
Neg logit contributions
 ending-0.096
 beginning-0.090
sing-0.089
 size-0.085
 camps-0.085
Token They
Token ID1119
Feature activation-9.95e-01

Pos logit contributions
ons+0.003
aid+0.003
quet+0.003
tail+0.003
 stitches+0.002
Neg logit contributions
 ending-0.002
 beginning-0.001
sing-0.001
 size-0.001
 crashing-0.001
Token and
Token ID290
Feature activation+2.99e-01

Pos logit contributions
ons+0.060
aid+0.052
quet+0.050
tail+0.050
keeping+0.047
Neg logit contributions
 ending-0.038
 beginning-0.036
sing-0.033
 size-0.033
 crashing-0.033
Token put
Token ID1234
Feature activation+5.01e-01

Pos logit contributions
ons+0.021
quet+0.019
tail+0.019
keeping+0.018
powers+0.017
Neg logit contributions
 ending-0.018
 size-0.016
 beginning-0.016
 only-0.015
 end-0.015
Token a
Token ID257
Feature activation+5.82e-01

Pos logit contributions
ons+0.034
aid+0.030
quet+0.029
tail+0.029
bes+0.026
Neg logit contributions
 ending-0.030
 beginning-0.030
 size-0.028
 crashing-0.027
sing-0.027
Token band
Token ID4097
Feature activation+9.36e+00

Pos logit contributions
ons+0.036
keeping+0.029
tail+0.028
quet+0.027
bes+0.025
Neg logit contributions
 ending-0.042
 beginning-0.040
 crashing-0.039
 end-0.039
 size-0.039
Token-
Token ID12
Feature activation+4.19e+00

Pos logit contributions
ons+0.527
quet+0.408
tail+0.375
pill+0.374
ner+0.364
Neg logit contributions
 ending-0.655
 beginning-0.638
 crashing-0.629
sing-0.617
king-0.614
Tokenaid
Token ID1698
Feature activation+1.36e+01

Pos logit contributions
ons+0.082
quet+0.042
tail+0.039
bes+0.032
keeping+0.025
Neg logit contributions
 beginning-0.429
 ending-0.429
ksh-0.421
 crashing-0.418
 end-0.413
Token on
Token ID319
Feature activation+2.63e+00

Pos logit contributions
ons+1.025
aid+0.877
tail+0.851
quet+0.847
bes+0.782
Neg logit contributions
 ending-0.759
 beginning-0.728
 crashing-0.676
 size-0.673
sing-0.673
Token it
Token ID340
Feature activation+8.07e-01

Pos logit contributions
ons+0.164
aid+0.139
tail+0.131
quet+0.127
 stitches+0.114
Neg logit contributions
 ending-0.186
 beginning-0.176
sing-0.169
ksh-0.167
 camps-0.166
Token.
Token ID13
Feature activation+1.85e-01

Pos logit contributions
ons+0.071
aid+0.063
quet+0.060
tail+0.058
keeping+0.056
Neg logit contributions
 ending-0.036
 beginning-0.035
 size-0.033
sing-0.031
 crashing-0.030
Token Mia
Token ID32189
Feature activation-4.20e-02

Pos logit contributions
ons+0.014
aid+0.012
quet+0.012
tail+0.012
keeping+0.011
Neg logit contributions
 ending-0.011
 beginning-0.010
sing-0.010
 size-0.009
 crashing-0.009
Token felt
Token ID2936
Feature activation+2.09e+00

Pos logit contributions
 ending+0.002
 size+0.002
 beginning+0.002
 end+0.002
 only+0.002
Neg logit contributions
ons-0.003
quet-0.003
aid-0.003
tail-0.003
keeping-0.003
Token
Token ID198
Feature activation-2.88e+00

Pos logit contributions
ons+0.136
aid+0.117
tail+0.112
quet+0.110
 exclaimed+0.101
Neg logit contributions
 ending-0.104
 beginning-0.097
sing-0.093
 crashing-0.093
 size-0.092
Token
Token ID198
Feature activation-2.00e+00

Pos logit contributions
 ending+0.144
 beginning+0.136
 crashing+0.133
 mate+0.132
 end+0.131
Neg logit contributions
ons-0.220
aid-0.196
tail-0.187
quet-0.187
 stitches-0.180
TokenBut
Token ID1537
Feature activation-3.17e+00

Pos logit contributions
 ending+0.101
 beginning+0.097
 crashing+0.095
 mate+0.094
 end+0.092
Neg logit contributions
ons-0.153
aid-0.134
tail-0.128
quet-0.124
 stitches-0.124
Token the
Token ID262
Feature activation-6.13e-01

Pos logit contributions
 ending+0.166
 beginning+0.155
sing+0.146
 size+0.144
 crashing+0.141
Neg logit contributions
ons-0.255
aid-0.228
tail-0.214
quet-0.213
 stitches-0.195
Token sleepy
Token ID42023
Feature activation-1.13e+00

Pos logit contributions
 ending+0.031
 beginning+0.030
ksh+0.029
sing+0.028
 crashing+0.028
Neg logit contributions
ons-0.047
aid-0.043
tail-0.040
quet-0.039
 stitches-0.037
Token whale
Token ID22206
Feature activation-1.21e+01

Pos logit contributions
 ending+0.056
 beginning+0.055
sing+0.052
ksh+0.051
 crashing+0.051
Neg logit contributions
ons-0.089
aid-0.079
tail-0.075
quet-0.074
 stitches-0.069
Token was
Token ID373
Feature activation-4.56e-01

Pos logit contributions
 ending+0.644
 beginning+0.597
 size+0.560
sing+0.558
 crashing+0.542
Neg logit contributions
ons-0.959
aid-0.874
quet-0.820
tail-0.817
keeping-0.749
Token too
Token ID1165
Feature activation+5.47e-01

Pos logit contributions
 ending+0.027
ksh+0.025
 beginning+0.024
sing+0.024
 size+0.024
Neg logit contributions
ons-0.032
aid-0.028
tail-0.026
quet-0.025
 exclaimed-0.024
Token tired
Token ID10032
Feature activation-1.25e-02

Pos logit contributions
ons+0.040
aid+0.035
tail+0.033
quet+0.033
 exclaimed+0.030
Neg logit contributions
 ending-0.032
 beginning-0.030
sing-0.029
 size-0.029
 crashing-0.028
Token to
Token ID284
Feature activation+5.36e-01

Pos logit contributions
 ending+0.001
 beginning+0.001
sing+0.001
ksh+0.001
 whale+0.001
Neg logit contributions
ons-0.001
aid-0.001
tail-0.001
quet-0.001
chair-0.001
Token play
Token ID711
Feature activation+2.63e+00

Pos logit contributions
ons+0.044
aid+0.039
quet+0.037
tail+0.036
keeping+0.034
Neg logit contributions
 ending-0.028
 beginning-0.026
sing-0.025
 size-0.024
 crashing-0.024
Token,
Token ID11
Feature activation-1.44e+00

Pos logit contributions
 ending+0.388
 beginning+0.367
 crashing+0.343
 size+0.340
sing+0.330
Neg logit contributions
ons-0.472
aid-0.444
tail-0.408
quet-0.404
 stitches-0.362
Token there
Token ID612
Feature activation-1.96e+00

Pos logit contributions
 ending+0.081
 beginning+0.077
sing+0.072
 size+0.071
 crashing+0.071
Neg logit contributions
ons-0.109
aid-0.098
quet-0.091
tail-0.090
 stitches-0.082
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 ending+0.116
 beginning+0.109
sing+0.107
ksh+0.105
 crashing+0.104
Neg logit contributions
ons-0.145
aid-0.123
tail-0.117
quet-0.114
 exclaimed-0.109
Token a
Token ID257
Feature activation-6.06e-01

Pos logit contributions
 ending+0.058
 beginning+0.054
sing+0.052
 size+0.051
 crashing+0.050
Neg logit contributions
ons-0.075
aid-0.066
tail-0.062
quet-0.061
 exclaimed-0.056
Token big
Token ID1263
Feature activation-2.62e+00

Pos logit contributions
 ending+0.032
 beginning+0.031
ksh+0.030
rant+0.029
sing+0.029
Neg logit contributions
ons-0.045
aid-0.040
tail-0.039
quet-0.038
 exclaimed-0.035
Token whale
Token ID22206
Feature activation-1.18e+01

Pos logit contributions
 ending+0.152
 beginning+0.147
ksh+0.144
sing+0.138
 crashing+0.137
Neg logit contributions
ons-0.186
aid-0.159
tail-0.156
quet-0.150
 stitches-0.138
Token who
Token ID508
Feature activation-1.11e-01

Pos logit contributions
 ending+0.683
 beginning+0.648
sing+0.613
 crashing+0.608
 size+0.606
Neg logit contributions
ons-0.867
aid-0.785
tail-0.727
quet-0.720
 stitches-0.662
Token lived
Token ID5615
Feature activation+2.18e+00

Pos logit contributions
 ending+0.006
 beginning+0.006
sing+0.005
 size+0.005
 crashing+0.005
Neg logit contributions
ons-0.009
aid-0.008
tail-0.007
quet-0.007
 exclaimed-0.007
Token in
Token ID287
Feature activation-1.51e+00

Pos logit contributions
ons+0.163
aid+0.142
quet+0.133
tail+0.131
 exclaimed+0.123
Neg logit contributions
 ending-0.129
sing-0.112
ksh-0.111
 beginning-0.110
 size-0.106
Token the
Token ID262
Feature activation+6.03e-01

Pos logit contributions
 ending+0.091
 beginning+0.085
sing+0.081
 size+0.081
 crashing+0.079
Neg logit contributions
ons-0.109
aid-0.097
tail-0.091
quet-0.090
 exclaimed-0.082
Token ocean
Token ID9151
Feature activation-6.72e+00

Pos logit contributions
ons+0.049
aid+0.040
tail+0.040
handed+0.039
 exclaimed+0.038
Neg logit contributions
 ending-0.028
ksh-0.027
sing-0.027
king-0.025
 crashing-0.024
Token,
Token ID11
Feature activation-1.83e+00

Pos logit contributions
ons+0.047
aid+0.041
tail+0.038
quet+0.037
 exclaimed+0.035
Neg logit contributions
 ending-0.046
 beginning-0.044
sing-0.042
 size-0.042
 camps-0.041
Token "
Token ID366
Feature activation-5.01e+00

Pos logit contributions
 ending+0.078
 beginning+0.071
 size+0.066
sing+0.066
 camps+0.064
Neg logit contributions
ons-0.163
aid-0.147
tail-0.141
quet-0.139
 exclaimed-0.131
TokenHi
Token ID17250
Feature activation-7.18e+00

Pos logit contributions
 ending+0.292
 beginning+0.268
sing+0.266
 camps+0.262
 size+0.256
Neg logit contributions
ons-0.371
aid-0.347
tail-0.321
quet-0.311
 exclaimed-0.287
Token,
Token ID11
Feature activation-9.40e-01

Pos logit contributions
 ending+0.522
 beginning+0.504
sing+0.483
 size+0.475
 crashing+0.469
Neg logit contributions
ons-0.428
aid-0.350
tail-0.349
quet-0.333
 exclaimed-0.299
Token pink
Token ID11398
Feature activation-6.17e+00

Pos logit contributions
 ending+0.064
 beginning+0.061
 size+0.060
sing+0.059
ksh+0.059
Neg logit contributions
ons-0.059
aid-0.052
quet-0.047
tail-0.047
 exclaimed-0.042
Token whale
Token ID22206
Feature activation-1.18e+01

Pos logit contributions
 ending+0.349
 beginning+0.327
sing+0.311
 size+0.307
 crashing+0.303
Neg logit contributions
ons-0.469
aid-0.416
tail-0.392
quet-0.387
 exclaimed-0.355
Token!
Token ID0
Feature activation-1.47e+00

Pos logit contributions
 ending+0.767
 beginning+0.725
sing+0.697
 size+0.691
 crashing+0.683
Neg logit contributions
ons-0.786
aid-0.694
tail-0.638
quet-0.636
keeping-0.576
Token You
Token ID921
Feature activation-1.38e+00

Pos logit contributions
 ending+0.105
 size+0.098
 beginning+0.096
sing+0.091
 camps+0.091
Neg logit contributions
ons-0.087
tail-0.076
aid-0.076
quet-0.068
 exclaimed-0.065
Token're
Token ID821
Feature activation-2.33e+00

Pos logit contributions
 ending+0.080
 beginning+0.075
sing+0.071
 size+0.070
 crashing+0.069
Neg logit contributions
ons-0.104
aid-0.092
tail-0.086
quet-0.085
 exclaimed-0.078
Token so
Token ID523
Feature activation+3.30e+00

Pos logit contributions
 ending+0.162
 beginning+0.147
 camps+0.146
ksh+0.145
sing+0.143
Neg logit contributions
ons-0.143
aid-0.121
tail-0.113
quet-0.109
 exclaimed-0.105
Token pretty
Token ID2495
Feature activation+2.52e-02

Pos logit contributions
ons+0.265
aid+0.236
tail+0.226
quet+0.222
keeping+0.205
Neg logit contributions
 ending-0.173
 beginning-0.163
sing-0.157
 crashing-0.152
 size-0.152
Token the
Token ID262
Feature activation-2.14e+00

Pos logit contributions
 size+0.193
 beginning+0.191
 ending+0.189
 camps+0.188
sing+0.184
Neg logit contributions
ons-0.425
aid-0.417
quet-0.392
tail-0.376
odic-0.355
Token sailor
Token ID43272
Feature activation-7.42e+00

Pos logit contributions
 ending+0.114
 beginning+0.107
sing+0.102
ksh+0.099
 size+0.099
Neg logit contributions
ons-0.167
aid-0.147
tail-0.140
quet-0.138
 stitches-0.127
Token invited
Token ID9392
Feature activation-1.70e+00

Pos logit contributions
 ending+0.401
 beginning+0.385
 size+0.357
 crashing+0.357
sing+0.355
Neg logit contributions
ons-0.564
aid-0.522
quet-0.485
tail-0.476
 stitches-0.438
Token the
Token ID262
Feature activation-2.99e-01

Pos logit contributions
 ending+0.084
 beginning+0.078
 size+0.073
sing+0.072
 crashing+0.072
Neg logit contributions
ons-0.140
aid-0.127
tail-0.120
quet-0.119
bes-0.109
Token big
Token ID1263
Feature activation-2.70e+00

Pos logit contributions
 ending+0.015
 beginning+0.014
sing+0.014
ksh+0.013
 size+0.013
Neg logit contributions
ons-0.024
aid-0.021
quet-0.020
tail-0.020
 exclaimed-0.019
Token whale
Token ID22206
Feature activation-1.18e+01

Pos logit contributions
 ending+0.115
 beginning+0.107
sing+0.103
ksh+0.098
 size+0.096
Neg logit contributions
ons-0.240
aid-0.214
tail-0.203
quet-0.202
 stitches-0.189
Token.
Token ID13
Feature activation-9.44e-01

Pos logit contributions
 ending+0.651
 beginning+0.611
sing+0.578
 size+0.570
 crashing+0.565
Neg logit contributions
ons-0.902
aid-0.806
tail-0.760
quet-0.752
bes-0.685
Token The
Token ID383
Feature activation+8.83e-01

Pos logit contributions
 ending+0.056
 beginning+0.054
 crashing+0.051
sing+0.051
ksh+0.050
Neg logit contributions
ons-0.068
aid-0.061
tail-0.056
quet-0.056
 stitches-0.051
Token whale
Token ID22206
Feature activation-1.15e+01

Pos logit contributions
ons+0.077
aid+0.067
quet+0.064
tail+0.062
 exclaimed+0.062
Neg logit contributions
ksh-0.034
 ending-0.031
sing-0.030
rant-0.029
 beginning-0.028
Token said
Token ID531
Feature activation+2.90e+00

Pos logit contributions
 ending+0.650
 beginning+0.611
sing+0.575
 size+0.573
 crashing+0.561
Neg logit contributions
ons-0.865
aid-0.777
tail-0.737
quet-0.727
keeping-0.667
Token
Token ID6184
Feature activation+7.31e-01

Pos logit contributions
ons+0.196
aid+0.171
tail+0.157
quet+0.157
 exclaimed+0.146
Neg logit contributions
 ending-0.184
 beginning-0.174
sing-0.170
 size-0.168
ksh-0.166
Token!
Token ID0
Feature activation-1.33e+00

Pos logit contributions
 ending+0.084
 beginning+0.079
 size+0.076
sing+0.075
 crashing+0.074
Neg logit contributions
ons-0.083
aid-0.074
tail-0.069
quet-0.067
 stitches-0.060
Token We
Token ID775
Feature activation-2.50e+00

Pos logit contributions
 ending+0.080
 beginning+0.074
 size+0.070
sing+0.070
 camps+0.068
Neg logit contributions
ons-0.095
aid-0.084
tail-0.082
quet-0.079
 exclaimed-0.072
Token can
Token ID460
Feature activation-1.13e+00

Pos logit contributions
 ending+0.122
 beginning+0.110
sing+0.109
 size+0.104
ksh+0.103
Neg logit contributions
ons-0.208
aid-0.188
tail-0.179
quet-0.172
 exclaimed-0.164
Token use
Token ID779
Feature activation+1.34e+00

Pos logit contributions
 ending+0.057
 beginning+0.055
 crashing+0.052
 size+0.052
sing+0.052
Neg logit contributions
ons-0.091
aid-0.082
quet-0.076
tail-0.075
keeping-0.072
Token the
Token ID262
Feature activation+3.24e-01

Pos logit contributions
ons+0.096
aid+0.087
quet+0.081
tail+0.081
keeping+0.073
Neg logit contributions
 ending-0.078
 beginning-0.076
 size-0.071
sing-0.071
 crashing-0.071
Token whale
Token ID22206
Feature activation-1.18e+01

Pos logit contributions
ons+0.029
aid+0.023
 exclaimed+0.023
on+0.023
quet+0.023
Neg logit contributions
ksh-0.014
 ending-0.013
 camps-0.012
king-0.011
 crashing-0.011
Token to
Token ID284
Feature activation+1.73e-01

Pos logit contributions
 ending+0.581
 beginning+0.553
 mate+0.546
 size+0.544
sing+0.522
Neg logit contributions
ons-0.923
aid-0.848
quet-0.802
tail-0.782
 exclaimed-0.718
Token explore
Token ID7301
Feature activation+9.01e-01

Pos logit contributions
ons+0.015
aid+0.013
quet+0.012
tail+0.012
keeping+0.012
Neg logit contributions
 ending-0.008
 beginning-0.008
 crashing-0.007
sing-0.007
 end-0.007
Token the
Token ID262
Feature activation+3.03e-01

Pos logit contributions
ons+0.060
aid+0.052
tail+0.048
quet+0.048
 exclaimed+0.043
Neg logit contributions
 ending-0.060
 beginning-0.056
sing-0.054
 size-0.053
 crashing-0.052
Token oceans
Token ID23744
Feature activation-3.81e+00

Pos logit contributions
ons+0.021
handed+0.016
keeping+0.016
tail+0.016
quet+0.016
Neg logit contributions
sing-0.017
 ending-0.016
ksh-0.016
king-0.014
 beginning-0.014
Token."
Token ID526
Feature activation+3.55e+00

Pos logit contributions
 ending+0.232
 beginning+0.221
 size+0.214
sing+0.209
 crashing+0.207
Neg logit contributions
ons-0.266
aid-0.242
tail-0.218
quet-0.215
 stitches-0.200
Token."
Token ID526
Feature activation+3.54e+00

Pos logit contributions
ons+0.001
aid+0.001
tail+0.001
quet+0.001
 exclaimed+0.001
Neg logit contributions
 ending-0.001
 beginning-0.001
sing-0.001
 size-0.001
 crashing-0.001
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
ons+0.280
aid+0.249
tail+0.248
quet+0.245
odic+0.216
Neg logit contributions
 ending-0.182
 beginning-0.159
 crashing-0.159
 end-0.154
 camps-0.152
Token
Token ID198
Feature activation-2.31e-02

Pos logit contributions
 ending+0.001
 end+0.001
 crashing+0.001
 camps+0.001
 beginning+0.000
Neg logit contributions
ons-0.001
tail-0.001
quet-0.001
aid-0.001
keeping-0.001
TokenThe
Token ID464
Feature activation+1.24e+00

Pos logit contributions
 ending+0.001
 end+0.001
 beginning+0.001
 crashing+0.001
 size+0.001
Neg logit contributions
ons-0.002
tail-0.002
aid-0.002
quet-0.002
keeping-0.002
Token purple
Token ID14032
Feature activation-6.30e+00

Pos logit contributions
ons+0.097
 exclaimed+0.076
aid+0.075
on+0.075
quet+0.070
Neg logit contributions
ksh-0.061
 crashing-0.054
king-0.053
rated-0.053
uding-0.052
Token whale
Token ID22206
Feature activation-1.18e+01

Pos logit contributions
ksh+0.263
 beginning+0.232
 ending+0.231
 crashing+0.230
 impossible+0.217
Neg logit contributions
ons-0.600
on-0.475
 exclaimed-0.474
quet-0.456
aid-0.450
Token saw
Token ID2497
Feature activation-4.55e+00

Pos logit contributions
 ending+0.792
 beginning+0.750
sing+0.721
 crashing+0.715
ksh+0.710
Neg logit contributions
ons-0.763
aid-0.658
tail-0.601
quet-0.595
 exclaimed-0.569
Token Tom
Token ID4186
Feature activation-3.33e+00

Pos logit contributions
 ending+0.254
sing+0.239
 beginning+0.234
ksh+0.230
 size+0.222
Neg logit contributions
ons-0.365
aid-0.321
quet-0.287
 exclaimed-0.277
 stitches-0.273
Token and
Token ID290
Feature activation-5.69e-01

Pos logit contributions
 ending+0.189
 beginning+0.174
sing+0.166
 size+0.164
ksh+0.163
Neg logit contributions
ons-0.253
aid-0.222
tail-0.212
quet-0.206
 exclaimed-0.192
Token Lily
Token ID20037
Feature activation+1.32e+00

Pos logit contributions
 ending+0.035
 beginning+0.033
sing+0.031
 size+0.031
 crashing+0.031
Neg logit contributions
ons-0.041
aid-0.036
tail-0.034
quet-0.034
keeping-0.030
Token and
Token ID290
Feature activation-5.80e-01

Pos logit contributions
ons+0.102
aid+0.091
tail+0.086
quet+0.085
 exclaimed+0.078
Neg logit contributions
 ending-0.073
 beginning-0.068
sing-0.065
 size-0.064
 crashing-0.063
Token too
Token ID1165
Feature activation+6.31e-01

Pos logit contributions
 ending+0.142
 beginning+0.135
 size+0.126
sing+0.123
 end+0.121
Neg logit contributions
ons-0.199
aid-0.180
tail-0.169
quet-0.168
keeping-0.159
Token?"
Token ID1701
Feature activation+1.17e+00

Pos logit contributions
ons+0.037
aid+0.031
quet+0.029
tail+0.029
 exclaimed+0.026
Neg logit contributions
 ending-0.046
 beginning-0.043
sing-0.042
ksh-0.042
 size-0.041
Token
Token ID198
Feature activation+7.63e-02

Pos logit contributions
quet+0.097
ons+0.096
aid+0.095
keeping+0.089
vern+0.085
Neg logit contributions
 ending-0.047
 beginning-0.042
 moon-0.042
 end-0.042
sing-0.041
Token
Token ID198
Feature activation+7.41e-02

Pos logit contributions
ons+0.006
quet+0.006
aid+0.006
tail+0.006
keeping+0.006
Neg logit contributions
 end-0.003
 ending-0.003
 them-0.003
 crashing-0.003
 beginning-0.002
TokenThe
Token ID464
Feature activation+1.36e+00

Pos logit contributions
ons+0.006
keeping+0.005
aid+0.005
tail+0.005
 stitches+0.005
Neg logit contributions
 end-0.004
 beginning-0.003
Can-0.003
 ending-0.003
 crashing-0.003
Token whale
Token ID22206
Feature activation-1.18e+01

Pos logit contributions
ons+0.116
 exclaimed+0.099
aid+0.097
quet+0.094
on+0.091
Neg logit contributions
ksh-0.058
sing-0.054
 crashing-0.052
king-0.050
 beginning-0.049
Token smiled
Token ID13541
Feature activation-2.98e-01

Pos logit contributions
 ending+0.798
 beginning+0.751
ksh+0.741
sing+0.738
 crashing+0.736
Neg logit contributions
ons-0.737
aid-0.641
tail-0.573
 exclaimed-0.571
quet-0.570
Token and
Token ID290
Feature activation-4.91e-01

Pos logit contributions
 ending+0.024
ators+0.024
works+0.024
sing+0.024
 whales+0.023
Neg logit contributions
aid-0.009
ons-0.008
 exclaimed-0.008
tail-0.007
 cried-0.007
Token said
Token ID531
Feature activation+2.69e+00

Pos logit contributions
 ending+0.035
 beginning+0.032
sing+0.031
 size+0.031
 crashing+0.030
Neg logit contributions
ons-0.031
aid-0.026
tail-0.025
quet-0.024
keeping-0.023
Token,
Token ID11
Feature activation-3.88e-01

Pos logit contributions
ons+0.172
aid+0.149
tail+0.139
quet+0.136
keeping+0.122
Neg logit contributions
 ending-0.184
 beginning-0.174
sing-0.167
 size-0.165
 crashing-0.163
Token "
Token ID366
Feature activation-4.00e+00

Pos logit contributions
 ending+0.021
 size+0.019
 beginning+0.019
 camps+0.018
sing+0.018
Neg logit contributions
ons-0.029
aid-0.026
quet-0.025
tail-0.024
 exclaimed-0.024
Token to
Token ID284
Feature activation+2.90e-01

Pos logit contributions
 ending+0.326
 beginning+0.306
sing+0.292
 size+0.288
 crashing+0.284
Neg logit contributions
ons-0.412
aid-0.366
tail-0.342
quet-0.340
 exclaimed-0.309
Token the
Token ID262
Feature activation+4.79e-01

Pos logit contributions
ons+0.019
aid+0.016
tail+0.015
quet+0.015
 stitches+0.014
Neg logit contributions
 ending-0.020
 beginning-0.019
sing-0.018
 size-0.018
 crashing-0.017
Token tank
Token ID6873
Feature activation-3.14e+00

Pos logit contributions
ons+0.036
 exclaimed+0.030
handed+0.028
aid+0.027
quet+0.027
Neg logit contributions
 ending-0.025
ksh-0.024
 beginning-0.023
sing-0.023
 crashing-0.023
Token and
Token ID290
Feature activation-5.64e-01

Pos logit contributions
 ending+0.174
 beginning+0.162
sing+0.154
 size+0.151
 crashing+0.150
Neg logit contributions
ons-0.243
aid-0.215
tail-0.203
quet-0.201
 exclaimed-0.185
Token the
Token ID262
Feature activation+4.29e-01

Pos logit contributions
 ending+0.030
 beginning+0.027
sing+0.026
 size+0.026
 crashing+0.025
Neg logit contributions
ons-0.045
aid-0.040
tail-0.038
quet-0.037
 exclaimed-0.035
Token whale
Token ID22206
Feature activation-1.18e+01

Pos logit contributions
ons+0.031
 exclaimed+0.026
aid+0.025
handed+0.024
 cand+0.024
Neg logit contributions
ksh-0.021
 ending-0.020
rated-0.020
OO-0.019
sing-0.019
Token sw
Token ID1509
Feature activation-7.73e+00

Pos logit contributions
 ending+0.702
 beginning+0.650
sing+0.631
ksh+0.621
 size+0.619
Neg logit contributions
ons-0.857
aid-0.734
tail-0.690
quet-0.685
 exclaimed-0.663
Tokenam
Token ID321
Feature activation+1.11e+00

Pos logit contributions
 ending+0.663
 beginning+0.611
 end+0.569
 crashing+0.568
 size+0.551
Neg logit contributions
ons-0.409
aid-0.316
tail-0.291
bes-0.282
 exclaimed-0.273
Token over
Token ID625
Feature activation-8.45e+00

Pos logit contributions
ons+0.062
 exclaimed+0.057
aid+0.055
keeping+0.053
 free+0.050
Neg logit contributions
 ending-0.078
sing-0.070
ksh-0.069
 size-0.066
ains-0.066
Token to
Token ID284
Feature activation+2.07e-01

Pos logit contributions
 ending+0.545
 beginning+0.507
sing+0.492
 size+0.477
ksh+0.472
Neg logit contributions
ons-0.585
aid-0.505
tail-0.479
quet-0.470
 stitches-0.430
Token them
Token ID606
Feature activation-8.49e+00

Pos logit contributions
ons+0.013
aid+0.011
tail+0.011
quet+0.010
 exclaimed+0.010
Neg logit contributions
 ending-0.014
 beginning-0.013
ksh-0.013
 size-0.013
sing-0.012
Token excited
Token ID6568
Feature activation-9.00e-01

Pos logit contributions
ons+0.133
aid+0.119
tail+0.109
quet+0.109
 exclaimed+0.101
Neg logit contributions
 ending-0.119
 beginning-0.112
sing-0.107
ksh-0.105
 size-0.105
Token to
Token ID284
Feature activation+2.34e-01

Pos logit contributions
 ending+0.062
ksh+0.060
sing+0.060
 beginning+0.059
 camps+0.059
Neg logit contributions
ons-0.055
aid-0.042
tail-0.041
quet-0.041
 exclaimed-0.039
Token hear
Token ID3285
Feature activation-2.11e+00

Pos logit contributions
ons+0.018
aid+0.016
tail+0.015
quet+0.015
 stitches+0.014
Neg logit contributions
 ending-0.013
 beginning-0.012
sing-0.012
 crashing-0.012
 size-0.011
Token about
Token ID546
Feature activation-1.98e+00

Pos logit contributions
 ending+0.134
 beginning+0.127
sing+0.120
 crashing+0.120
 size+0.119
Neg logit contributions
ons-0.144
aid-0.128
tail-0.119
quet-0.118
keeping-0.106
Token the
Token ID262
Feature activation+4.01e-01

Pos logit contributions
 ending+0.131
 beginning+0.123
sing+0.120
ksh+0.117
 size+0.117
Neg logit contributions
ons-0.132
aid-0.113
tail-0.107
quet-0.105
 stitches-0.095
Token whale
Token ID22206
Feature activation-1.18e+01

Pos logit contributions
ons+0.035
 exclaimed+0.028
on+0.026
odic+0.026
quet+0.026
Neg logit contributions
ksh-0.018
mo-0.015
 camps-0.015
 ending-0.014
 crashing-0.014
Token.
Token ID13
Feature activation-6.77e-01

Pos logit contributions
 ending+0.726
 beginning+0.675
sing+0.654
 size+0.644
ksh+0.638
Neg logit contributions
ons-0.832
aid-0.712
tail-0.681
quet-0.672
 exclaimed-0.616
Token
Token ID198
Feature activation-3.19e-02

Pos logit contributions
 ending+0.045
 beginning+0.043
sing+0.042
 size+0.040
 camps+0.040
Neg logit contributions
ons-0.044
aid-0.036
tail-0.035
quet-0.034
 exclaimed-0.033
Token
Token ID198
Feature activation-3.51e-02

Pos logit contributions
 ending+0.002
 beginning+0.002
 crashing+0.002
 camps+0.002
 end+0.002
Neg logit contributions
ons-0.002
aid-0.002
tail-0.002
quet-0.002
 stitches-0.002
Token"
Token ID1
Feature activation+1.02e+00

Pos logit contributions
 ending+0.003
 beginning+0.002
sing+0.002
 size+0.002
 crashing+0.002
Neg logit contributions
ons-0.002
aid-0.002
tail-0.002
quet-0.002
 exclaimed-0.001
TokenYou
Token ID1639
Feature activation-2.62e+00

Pos logit contributions
ons+0.070
aid+0.066
quet+0.057
odic+0.055
 exclaimed+0.055
Neg logit contributions
sing-0.076
 ending-0.068
 size-0.061
sed-0.057
 camps-0.057
Token had
Token ID550
Feature activation-3.67e+00

Pos logit contributions
 ending+0.087
 beginning+0.079
 crashing+0.075
ksh+0.074
sing+0.074
Neg logit contributions
ons-0.094
aid-0.083
tail-0.079
quet-0.078
bes-0.071
Token hoped
Token ID10719
Feature activation-3.16e+00

Pos logit contributions
 ending+0.198
 beginning+0.189
 size+0.177
 crashing+0.169
sing+0.167
Neg logit contributions
ons-0.285
aid-0.260
quet-0.250
tail-0.249
bes-0.224
Token for
Token ID329
Feature activation-5.43e+00

Pos logit contributions
 ending+0.191
 beginning+0.179
 size+0.165
 crashing+0.164
sing+0.160
Neg logit contributions
ons-0.223
aid-0.206
quet-0.194
tail-0.191
bes-0.178
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
 ending+0.199
 beginning+0.192
 size+0.172
 end+0.166
 crashing+0.162
Neg logit contributions
ons-0.505
aid-0.465
quet-0.444
tail-0.442
powers-0.426
Token happy
Token ID3772
Feature activation+7.14e-01

Pos logit contributions
 ending+0.062
 beginning+0.061
ksh+0.060
sing+0.058
rant+0.057
Neg logit contributions
ons-0.096
aid-0.088
tail-0.082
quet-0.081
 stitches-0.075
Token ending
Token ID7464
Feature activation-1.17e+01

Pos logit contributions
ons+0.054
aid+0.049
tail+0.047
quet+0.046
 stitches+0.043
Neg logit contributions
ksh-0.035
 beginning-0.034
 ending-0.034
rant-0.033
sing-0.033
Token,
Token ID11
Feature activation-5.33e-01

Pos logit contributions
 ending+0.608
 beginning+0.553
 size+0.522
sing+0.507
 crashing+0.501
Neg logit contributions
ons-0.962
aid-0.868
tail-0.823
quet-0.807
 exclaimed-0.746
Token but
Token ID475
Feature activation-4.49e+00

Pos logit contributions
 ending+0.026
 beginning+0.024
sing+0.023
 size+0.023
ksh+0.022
Neg logit contributions
ons-0.044
aid-0.039
tail-0.037
quet-0.037
 stitches-0.034
Token it
Token ID340
Feature activation+7.38e-02

Pos logit contributions
 ending+0.263
 beginning+0.241
 size+0.224
sing+0.221
 crashing+0.215
Neg logit contributions
ons-0.357
aid-0.313
quet-0.294
tail-0.289
bes-0.274
Token didn
Token ID1422
Feature activation+1.82e-01

Pos logit contributions
ons+0.006
aid+0.005
tail+0.005
quet+0.005
 exclaimed+0.005
Neg logit contributions
 ending-0.004
 beginning-0.004
sing-0.003
 size-0.003
 crashing-0.003
Token't
Token ID470
Feature activation+1.36e+00

Pos logit contributions
ons+0.011
aid+0.009
tail+0.008
quet+0.008
keeping+0.008
Neg logit contributions
 ending-0.013
 beginning-0.013
 size-0.012
sing-0.012
 crashing-0.012
Token and
Token ID290
Feature activation-4.98e-01

Pos logit contributions
 ending+0.020
sing+0.019
rant+0.019
ksh+0.019
rated+0.018
Neg logit contributions
ons-0.010
aid-0.009
quet-0.009
tail-0.009
 exclaimed-0.009
Token laughed
Token ID13818
Feature activation-1.47e+00

Pos logit contributions
 ending+0.024
 beginning+0.022
 crashing+0.021
 size+0.021
sing+0.021
Neg logit contributions
ons-0.043
aid-0.038
quet-0.036
tail-0.036
keeping-0.033
Token too
Token ID1165
Feature activation+5.59e-01

Pos logit contributions
 ending+0.114
sing+0.108
rant+0.105
rated+0.104
ksh+0.103
Neg logit contributions
ons-0.066
quet-0.059
aid-0.058
tail-0.055
 exclaimed-0.054
Token.
Token ID13
Feature activation-7.27e-01

Pos logit contributions
ons+0.037
aid+0.031
tail+0.030
quet+0.030
 exclaimed+0.027
Neg logit contributions
 ending-0.036
 beginning-0.034
sing-0.033
ksh-0.032
 size-0.032
Token The
Token ID383
Feature activation+7.73e-01

Pos logit contributions
 ending+0.044
 beginning+0.042
sing+0.040
 size+0.039
 crashing+0.039
Neg logit contributions
ons-0.052
aid-0.046
tail-0.043
quet-0.043
 exclaimed-0.039
Token whale
Token ID22206
Feature activation-1.17e+01

Pos logit contributions
ons+0.052
aid+0.042
 tree+0.039
 exclaimed+0.039
on+0.039
Neg logit contributions
ksh-0.046
sing-0.040
idget-0.039
OO-0.039
rant-0.038
Token was
Token ID373
Feature activation-5.20e-01

Pos logit contributions
 ending+0.674
 beginning+0.625
sing+0.616
ksh+0.597
 crashing+0.588
Neg logit contributions
ons-0.874
aid-0.764
tail-0.709
quet-0.702
 exclaimed-0.685
Token very
Token ID845
Feature activation+1.89e+00

Pos logit contributions
 ending+0.037
 beginning+0.034
sing+0.034
ksh+0.034
 size+0.033
Neg logit contributions
ons-0.031
aid-0.027
tail-0.024
quet-0.024
 exclaimed-0.022
Token happy
Token ID3772
Feature activation+1.05e+00

Pos logit contributions
ons+0.122
aid+0.105
tail+0.098
quet+0.097
 exclaimed+0.087
Neg logit contributions
 ending-0.130
 beginning-0.123
sing-0.118
 size-0.117
 crashing-0.115
Token.
Token ID13
Feature activation-7.58e-01

Pos logit contributions
ons+0.068
aid+0.056
quet+0.053
tail+0.053
 exclaimed+0.050
Neg logit contributions
 ending-0.069
 beginning-0.065
ksh-0.065
sing-0.064
rant-0.062
Token
Token ID220
Feature activation+2.55e+00

Pos logit contributions
 ending+0.047
 beginning+0.044
sing+0.042
 size+0.041
 crashing+0.041
Neg logit contributions
ons-0.054
aid-0.048
tail-0.045
quet-0.044
 exclaimed-0.040
Token
Token ID198
Feature activation-6.78e-01

Pos logit contributions
 ending+0.043
 beginning+0.040
 camps+0.040
 crashing+0.040
 end+0.039
Neg logit contributions
ons-0.066
aid-0.060
tail-0.058
quet-0.058
 stitches-0.053
TokenOne
Token ID3198
Feature activation-6.81e+00

Pos logit contributions
 ending+0.034
 beginning+0.034
 crashing+0.033
 end+0.032
 camps+0.031
Neg logit contributions
ons-0.052
aid-0.045
tail-0.042
 stitches-0.041
quet-0.041
Token day
Token ID1110
Feature activation-1.81e+00

Pos logit contributions
 ending+0.281
 beginning+0.252
sing+0.230
 size+0.228
 crashing+0.221
Neg logit contributions
ons-0.636
aid-0.576
tail-0.549
quet-0.546
keeping-0.512
Token,
Token ID11
Feature activation-6.32e-01

Pos logit contributions
 ending+0.100
 beginning+0.091
 crashing+0.084
 mate+0.084
 size+0.083
Neg logit contributions
ons-0.136
aid-0.129
quet-0.119
tail-0.116
keeping-0.111
Token the
Token ID262
Feature activation+2.28e-01

Pos logit contributions
 ending+0.037
 beginning+0.034
 size+0.032
sing+0.032
 camps+0.032
Neg logit contributions
ons-0.047
aid-0.043
quet-0.039
tail-0.038
bes-0.035
Token whale
Token ID22206
Feature activation-1.17e+01

Pos logit contributions
ons+0.018
aid+0.016
tail+0.015
quet+0.015
 stitches+0.014
Neg logit contributions
 ending-0.012
 beginning-0.011
ksh-0.011
sing-0.011
 crashing-0.010
Token saw
Token ID2497
Feature activation-4.60e+00

Pos logit contributions
 ending+0.650
 beginning+0.616
sing+0.610
ksh+0.596
 crashing+0.582
Neg logit contributions
ons-0.895
aid-0.757
tail-0.720
 exclaimed-0.702
quet-0.693
Token a
Token ID257
Feature activation-6.71e-02

Pos logit contributions
 ending+0.246
 beginning+0.231
sing+0.214
 size+0.214
 crashing+0.211
Neg logit contributions
ons-0.355
aid-0.322
tail-0.311
quet-0.307
bes-0.278
Token big
Token ID1263
Feature activation-2.14e+00

Pos logit contributions
 ending+0.004
ksh+0.004
 beginning+0.004
rant+0.004
sing+0.003
Neg logit contributions
ons-0.005
aid-0.004
quet-0.004
tail-0.004
 stitches-0.004
Token,
Token ID11
Feature activation+6.75e-02

Pos logit contributions
 ending+0.123
 beginning+0.117
ksh+0.117
sing+0.110
rant+0.107
Neg logit contributions
ons-0.154
aid-0.131
tail-0.130
quet-0.127
 stitches-0.116
Token white
Token ID2330
Feature activation-4.24e+00

Pos logit contributions
ons+0.005
aid+0.004
tail+0.004
quet+0.004
 stitches+0.004
Neg logit contributions
 ending-0.004
 beginning-0.004
 size-0.003
ksh-0.003
 camps-0.003
Token are
Token ID389
Feature activation-5.56e-01

Pos logit contributions
 ending+0.043
 beginning+0.041
 end+0.040
 size+0.038
 crashing+0.038
Neg logit contributions
ons-0.069
tail-0.063
aid-0.063
quet-0.063
bes-0.057
Token cool
Token ID3608
Feature activation-2.72e+00

Pos logit contributions
 ending+0.033
 beginning+0.031
sing+0.030
ksh+0.029
 camps+0.029
Neg logit contributions
ons-0.040
aid-0.036
tail-0.033
quet-0.032
 stitches-0.030
Token.
Token ID13
Feature activation-8.43e-01

Pos logit contributions
 ending+0.158
 beginning+0.149
 size+0.147
sing+0.141
 crashing+0.139
Neg logit contributions
ons-0.198
aid-0.175
quet-0.167
tail-0.165
bes-0.149
Token But
Token ID887
Feature activation-2.10e+00

Pos logit contributions
 ending+0.053
 beginning+0.050
 size+0.048
sing+0.047
 camps+0.047
Neg logit contributions
ons-0.057
aid-0.051
tail-0.049
quet-0.046
 stitches-0.043
Token this
Token ID428
Feature activation+6.61e-01

Pos logit contributions
ksh+0.176
 ending+0.176
 camps+0.168
sing+0.168
 beginning+0.167
Neg logit contributions
ons-0.095
aid-0.076
tail-0.063
 stitches-0.062
quet-0.061
Token whale
Token ID22206
Feature activation-1.17e+01

Pos logit contributions
ons+0.046
aid+0.035
 exclaimed+0.032
grand+0.031
quet+0.030
Neg logit contributions
ksh-0.043
sing-0.042
 crashing-0.041
 beginning-0.041
king-0.040
Token is
Token ID318
Feature activation-1.38e+00

Pos logit contributions
 ending+0.520
 beginning+0.466
 size+0.461
 end+0.439
 whale+0.435
Neg logit contributions
ons-0.975
aid-0.920
quet-0.909
tail-0.874
keeping-0.850
Token not
Token ID407
Feature activation-2.75e+00

Pos logit contributions
 ending+0.088
ksh+0.080
 camps+0.079
 beginning+0.078
sing+0.076
Neg logit contributions
ons-0.088
aid-0.080
tail-0.069
 stitches-0.069
 exclaimed-0.066
Token for
Token ID329
Feature activation-4.31e+00

Pos logit contributions
 ending+0.153
 beginning+0.142
sing+0.135
 size+0.132
 crashing+0.131
Neg logit contributions
ons-0.210
aid-0.187
tail-0.175
quet-0.173
 exclaimed-0.160
Token playing
Token ID2712
Feature activation+8.43e-01

Pos logit contributions
 ending+0.237
 beginning+0.222
sing+0.211
 size+0.207
 crashing+0.205
Neg logit contributions
ons-0.333
aid-0.295
tail-0.279
quet-0.275
 exclaimed-0.253
Token.
Token ID13
Feature activation-9.13e-01

Pos logit contributions
ons+0.058
aid+0.051
tail+0.048
quet+0.047
 exclaimed+0.043
Neg logit contributions
 ending-0.053
 beginning-0.050
sing-0.048
 size-0.047
 crashing-0.047
Token,
Token ID11
Feature activation-3.35e-01

Pos logit contributions
 ending+0.099
 beginning+0.091
sing+0.085
 size+0.085
 crashing+0.085
Neg logit contributions
ons-0.109
aid-0.100
quet-0.094
tail-0.092
keeping-0.088
Token she
Token ID673
Feature activation+3.71e+00

Pos logit contributions
 ending+0.027
 beginning+0.026
sing+0.025
 size+0.025
 crashing+0.025
Neg logit contributions
ons-0.017
aid-0.014
quet-0.013
tail-0.013
keeping-0.011
Token saw
Token ID2497
Feature activation-4.65e+00

Pos logit contributions
ons+0.273
aid+0.219
 exclaimed+0.213
tail+0.207
 stitches+0.196
Neg logit contributions
 ending-0.221
 beginning-0.216
sing-0.215
ksh-0.209
 crashing-0.207
Token a
Token ID257
Feature activation-2.22e-01

Pos logit contributions
 ending+0.286
 beginning+0.268
sing+0.259
 size+0.254
 crashing+0.251
Neg logit contributions
ons-0.336
aid-0.295
tail-0.272
quet-0.268
 exclaimed-0.248
Token big
Token ID1263
Feature activation-2.42e+00

Pos logit contributions
ksh+0.014
 ending+0.013
 beginning+0.013
rant+0.013
sing+0.013
Neg logit contributions
ons-0.015
aid-0.013
quet-0.012
tail-0.011
 exclaimed-0.011
Token whale
Token ID22206
Feature activation-1.17e+01

Pos logit contributions
ksh+0.137
 ending+0.136
 beginning+0.131
sing+0.126
rated+0.122
Neg logit contributions
ons-0.176
aid-0.144
tail-0.143
quet-0.141
 stitches-0.129
Token stuck
Token ID7819
Feature activation+8.09e-01

Pos logit contributions
 ending+0.748
ksh+0.683
sing+0.667
 beginning+0.667
 size+0.649
Neg logit contributions
ons-0.850
aid-0.687
tail-0.678
quet-0.670
 exclaimed-0.612
Token in
Token ID287
Feature activation-1.79e+00

Pos logit contributions
ons+0.051
aid+0.041
tail+0.041
quet+0.038
 stitches+0.037
Neg logit contributions
 ending-0.057
ksh-0.051
 camps-0.049
 whales-0.049
 whale-0.049
Token the
Token ID262
Feature activation+4.32e-01

Pos logit contributions
 ending+0.087
 beginning+0.087
 size+0.087
 end+0.081
 crashing+0.080
Neg logit contributions
ons-0.143
aid-0.136
tail-0.123
quet-0.122
powers-0.117
Token sand
Token ID6450
Feature activation-3.05e+00

Pos logit contributions
ons+0.029
aid+0.024
handed+0.022
tail+0.022
quet+0.022
Neg logit contributions
 ending-0.026
sing-0.025
ksh-0.024
king-0.024
rated-0.023
Token.
Token ID13
Feature activation-8.01e-01

Pos logit contributions
 ending+0.197
 beginning+0.182
sing+0.179
ksh+0.174
 crashing+0.169
Neg logit contributions
ons-0.212
aid-0.176
quet-0.168
tail-0.168
keeping-0.155
Token around
Token ID1088
Feature activation-3.24e+00

Pos logit contributions
 ending+0.137
 beginning+0.124
sing+0.122
 size+0.118
ksh+0.114
Neg logit contributions
ons-0.155
aid-0.140
tail-0.138
quet-0.126
keeping-0.124
Token the
Token ID262
Feature activation+4.60e-01

Pos logit contributions
 ending+0.230
ksh+0.208
 beginning+0.208
sing+0.206
rated+0.199
Neg logit contributions
ons-0.202
tail-0.160
aid-0.156
quet-0.151
 stitches-0.144
Token boat
Token ID8848
Feature activation-4.44e+00

Pos logit contributions
ons+0.035
 exclaimed+0.029
handed+0.028
on+0.026
keeping+0.026
Neg logit contributions
 ending-0.025
sing-0.022
ksh-0.022
 beginning-0.021
rated-0.020
Token collecting
Token ID13157
Feature activation+2.08e-01

Pos logit contributions
 ending+0.276
 beginning+0.258
sing+0.247
 size+0.243
 crashing+0.240
Neg logit contributions
ons-0.311
aid-0.271
tail-0.257
quet-0.252
 exclaimed-0.231
Token the
Token ID262
Feature activation+4.24e-01

Pos logit contributions
ons+0.018
aid+0.016
tail+0.015
quet+0.015
 exclaimed+0.014
Neg logit contributions
 ending-0.010
 beginning-0.009
 size-0.008
sing-0.008
 crashing-0.008
Token whale
Token ID22206
Feature activation-1.17e+01

Pos logit contributions
ons+0.035
aid+0.028
 exclaimed+0.028
 stitches+0.026
quet+0.026
Neg logit contributions
 ending-0.022
sing-0.019
ksh-0.019
 crashing-0.018
 beginning-0.018
Token.
Token ID13
Feature activation-6.89e-01

Pos logit contributions
 ending+0.630
 beginning+0.604
 size+0.574
sing+0.566
 crashing+0.561
Neg logit contributions
ons-0.893
aid-0.825
tail-0.758
quet-0.753
 stitches-0.679
Token The
Token ID383
Feature activation+6.58e-01

Pos logit contributions
 ending+0.046
 beginning+0.043
 size+0.042
sing+0.041
 camps+0.041
Neg logit contributions
ons-0.043
aid-0.036
tail-0.036
 exclaimed-0.034
quet-0.034
Token parents
Token ID3397
Feature activation-1.39e+00

Pos logit contributions
ons+0.052
 exclaimed+0.045
aid+0.043
on+0.041
chair+0.040
Neg logit contributions
ksh-0.028
rant-0.025
 ending-0.025
 crashing-0.023
king-0.023
Token were
Token ID547
Feature activation-4.21e-01

Pos logit contributions
 ending+0.086
 beginning+0.078
sing+0.078
ksh+0.076
 size+0.074
Neg logit contributions
ons-0.103
aid-0.089
quet-0.080
tail-0.080
 exclaimed-0.077
Token worried
Token ID7960
Feature activation-2.41e+00

Pos logit contributions
 ending+0.028
 beginning+0.026
ksh+0.025
sing+0.025
 size+0.025
Neg logit contributions
ons-0.027
aid-0.024
tail-0.022
quet-0.022
 exclaimed-0.021
Token,
Token ID11
Feature activation-9.24e-01

Pos logit contributions
 ending+0.392
 beginning+0.370
sing+0.359
 size+0.352
ksh+0.350
Neg logit contributions
ons-0.372
aid-0.317
tail-0.296
quet-0.292
 exclaimed-0.264
Token there
Token ID612
Feature activation-1.48e+00

Pos logit contributions
 ending+0.069
ksh+0.066
sing+0.065
 beginning+0.062
 size+0.062
Neg logit contributions
ons-0.051
aid-0.040
tail-0.038
quet-0.036
 exclaimed-0.035
Token was
Token ID373
Feature activation-7.17e-01

Pos logit contributions
 ending+0.090
sing+0.086
 beginning+0.086
ksh+0.085
 crashing+0.083
Neg logit contributions
ons-0.106
aid-0.087
tail-0.084
quet-0.081
 exclaimed-0.079
Token a
Token ID257
Feature activation-3.69e-01

Pos logit contributions
 ending+0.044
 beginning+0.041
sing+0.040
 size+0.039
 crashing+0.039
Neg logit contributions
ons-0.051
aid-0.044
tail-0.042
quet-0.040
 exclaimed-0.038
Token big
Token ID1263
Feature activation-2.39e+00

Pos logit contributions
 ending+0.021
 beginning+0.020
ksh+0.020
rant+0.020
 crashing+0.019
Neg logit contributions
ons-0.025
aid-0.022
tail-0.022
quet-0.021
 stitches-0.019
Token whale
Token ID22206
Feature activation-1.17e+01

Pos logit contributions
 ending+0.143
 beginning+0.139
ksh+0.139
sing+0.131
 crashing+0.131
Neg logit contributions
ons-0.164
aid-0.137
tail-0.137
quet-0.130
chair-0.120
Token.
Token ID13
Feature activation-6.19e-01

Pos logit contributions
 ending+0.683
 beginning+0.645
sing+0.611
 size+0.605
 crashing+0.601
Neg logit contributions
ons-0.858
aid-0.769
tail-0.717
quet-0.710
 stitches-0.649
Token The
Token ID383
Feature activation+1.06e+00

Pos logit contributions
 ending+0.040
 beginning+0.038
sing+0.036
 size+0.036
 crashing+0.035
Neg logit contributions
ons-0.042
aid-0.037
tail-0.034
quet-0.034
 exclaimed-0.031
Token whale
Token ID22206
Feature activation-1.15e+01

Pos logit contributions
ons+0.083
 exclaimed+0.069
aid+0.068
on+0.067
 tree+0.065
Neg logit contributions
ksh-0.049
king-0.041
 ending-0.040
 camps-0.038
sing-0.038
Token was
Token ID373
Feature activation-3.48e-02

Pos logit contributions
 ending+0.623
 beginning+0.581
sing+0.554
 size+0.543
 crashing+0.537
Neg logit contributions
ons-0.903
aid-0.799
tail-0.753
quet-0.744
 exclaimed-0.692
Token very
Token ID845
Feature activation+2.52e+00

Pos logit contributions
 ending+0.002
ksh+0.002
 beginning+0.002
 size+0.002
 camps+0.002
Neg logit contributions
ons-0.002
aid-0.002
tail-0.002
 exclaimed-0.002
quet-0.002
Token sad
Token ID6507
Feature activation+3.95e-01

Pos logit contributions
 ending+0.023
 beginning+0.022
sing+0.021
 crashing+0.020
 size+0.020
Neg logit contributions
ons-0.033
aid-0.029
quet-0.028
tail-0.028
keeping-0.025
Token to
Token ID284
Feature activation+1.91e-01

Pos logit contributions
ons+0.023
aid+0.020
tail+0.019
quet+0.018
 stitches+0.018
Neg logit contributions
 ending-0.026
 beginning-0.025
ksh-0.025
sing-0.025
rant-0.024
Token have
Token ID423
Feature activation+3.19e+00

Pos logit contributions
ons+0.015
aid+0.014
tail+0.013
quet+0.013
 exclaimed+0.012
Neg logit contributions
 ending-0.010
 beginning-0.009
sing-0.009
 size-0.009
 crashing-0.009
Token a
Token ID257
Feature activation-3.45e-01

Pos logit contributions
ons+0.257
aid+0.232
quet+0.222
tail+0.222
bes+0.201
Neg logit contributions
 ending-0.169
 beginning-0.157
 size-0.147
sing-0.141
 crashing-0.140
Token bad
Token ID2089
Feature activation-1.19e+00

Pos logit contributions
 ending+0.020
 beginning+0.020
ksh+0.020
 size+0.019
 crashing+0.019
Neg logit contributions
ons-0.023
aid-0.022
tail-0.019
quet-0.018
 stitches-0.018
Token ending
Token ID7464
Feature activation-1.17e+01

Pos logit contributions
ksh+0.043
ains+0.039
rant+0.036
 beginning+0.034
 crashing+0.033
Neg logit contributions
ons-0.108
aid-0.106
tail-0.098
 stitches-0.097
powers-0.095
Token to
Token ID284
Feature activation+1.72e-01

Pos logit contributions
 ending+0.669
 beginning+0.624
sing+0.588
 size+0.587
 crashing+0.579
Neg logit contributions
ons-0.883
aid-0.791
tail-0.744
quet-0.730
 exclaimed-0.665
Token their
Token ID511
Feature activation-5.62e+00

Pos logit contributions
ons+0.012
tail+0.010
aid+0.010
quet+0.010
keeping+0.009
Neg logit contributions
 ending-0.010
ksh-0.010
 beginning-0.009
sing-0.009
 camps-0.009
Token day
Token ID1110
Feature activation-1.76e+00

Pos logit contributions
 ending+0.281
 beginning+0.280
ksh+0.276
 crashing+0.270
rant+0.269
Neg logit contributions
ons-0.413
aid-0.366
tail-0.357
quet-0.341
 stitches-0.338
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 ending+0.094
 beginning+0.088
 size+0.082
 crashing+0.080
sing+0.078
Neg logit contributions
ons-0.139
aid-0.134
tail-0.119
quet-0.119
bes-0.109
Token The
Token ID383
Feature activation+6.05e-01

Pos logit contributions
 ending+0.080
 beginning+0.077
 size+0.075
sing+0.075
 crashing+0.073
Neg logit contributions
ons-0.057
aid-0.047
tail-0.043
quet-0.042
 exclaimed-0.041
Token whale
Token ID22206
Feature activation-1.16e+01

Pos logit contributions
 ending+0.014
rated+0.013
ksh+0.013
 size+0.012
 beginning+0.012
Neg logit contributions
ons-0.029
aid-0.025
on-0.025
 exclaimed-0.024
quet-0.024
Token!"
Token ID2474
Feature activation+2.15e+00

Pos logit contributions
 ending+0.865
 beginning+0.822
sing+0.793
 size+0.789
 crashing+0.781
Neg logit contributions
ons-0.669
aid-0.579
tail-0.527
quet-0.519
 stitches-0.460
Token
Token ID198
Feature activation+3.41e-03

Pos logit contributions
ons+0.149
aid+0.129
quet+0.122
tail+0.122
keeping+0.108
Neg logit contributions
 ending-0.137
 beginning-0.129
sing-0.125
 crashing-0.122
 size-0.121
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
ons+0.000
aid+0.000
tail+0.000
quet+0.000
keeping+0.000
Neg logit contributions
 ending-0.000
 crashing-0.000
 beginning-0.000
 end-0.000
 whale-0.000
TokenThe
Token ID464
Feature activation+1.26e+00

Pos logit contributions
 ending+0.001
 beginning+0.001
 crashing+0.001
 size+0.001
sing+0.001
Neg logit contributions
ons-0.001
aid-0.001
tail-0.001
quet-0.001
 stitches-0.000
Token whale
Token ID22206
Feature activation-1.17e+01

Pos logit contributions
ons+0.102
 exclaimed+0.089
on+0.081
aid+0.081
quet+0.079
Neg logit contributions
ksh-0.059
 ending-0.051
 camps-0.049
sing-0.049
rated-0.048
Token sees
Token ID7224
Feature activation-5.75e+00

Pos logit contributions
 ending+0.741
 beginning+0.691
ksh+0.686
sing+0.664
 crashing+0.646
Neg logit contributions
ons-0.807
aid-0.666
tail-0.629
 exclaimed-0.621
quet-0.598
Token them
Token ID606
Feature activation-8.40e+00

Pos logit contributions
 ending+0.295
sing+0.279
ksh+0.268
rated+0.262
 beginning+0.262
Neg logit contributions
ons-0.502
aid-0.427
 exclaimed-0.395
quet-0.384
 stitches-0.377
Token and
Token ID290
Feature activation-5.69e-01

Pos logit contributions
 ending+0.566
ksh+0.521
 camps+0.500
 beginning+0.490
sing+0.489
Neg logit contributions
ons-0.531
aid-0.437
tail-0.433
 exclaimed-0.419
quet-0.417
Token makes
Token ID1838
Feature activation-3.39e+00

Pos logit contributions
 ending+0.034
 beginning+0.032
ksh+0.031
sing+0.030
 camps+0.029
Neg logit contributions
ons-0.038
aid-0.033
 exclaimed-0.033
tail-0.033
 stitches-0.030
Token a
Token ID257
Feature activation-2.13e-01

Pos logit contributions
 ending+0.205
 beginning+0.190
sing+0.183
 size+0.180
 camps+0.178
Neg logit contributions
ons-0.245
aid-0.213
tail-0.196
quet-0.193
 exclaimed-0.191
Token and
Token ID290
Feature activation-3.92e-01

Pos logit contributions
 ending+0.082
sing+0.079
 beginning+0.077
ksh+0.075
 camps+0.074
Neg logit contributions
ons-0.063
aid-0.056
tail-0.052
keeping-0.048
 exclaimed-0.047
Token too
Token ID1165
Feature activation+6.63e-01

Pos logit contributions
 ending+0.025
 beginning+0.024
sing+0.023
 size+0.022
 crashing+0.022
Neg logit contributions
ons-0.027
aid-0.024
tail-0.022
quet-0.022
 exclaimed-0.020
Token deep
Token ID2769
Feature activation-2.07e+00

Pos logit contributions
ons+0.050
aid+0.044
tail+0.042
quet+0.041
 exclaimed+0.038
Neg logit contributions
 ending-0.038
 beginning-0.035
sing-0.034
 size-0.033
 crashing-0.032
Token.
Token ID13
Feature activation-5.84e-01

Pos logit contributions
 ending+0.130
 beginning+0.119
sing+0.117
 size+0.111
ksh+0.111
Neg logit contributions
ons-0.141
aid-0.120
tail-0.118
quet-0.111
keeping-0.109
Token The
Token ID383
Feature activation+8.15e-01

Pos logit contributions
 ending+0.034
 beginning+0.032
sing+0.030
 size+0.030
 crashing+0.030
Neg logit contributions
ons-0.043
aid-0.038
tail-0.036
quet-0.035
 exclaimed-0.032
Token whale
Token ID22206
Feature activation-1.17e+01

Pos logit contributions
ons+0.048
 stitches+0.038
aid+0.037
 exclaimed+0.037
 tree+0.035
Neg logit contributions
ksh-0.056
king-0.047
mo-0.046
 camps-0.046
rant-0.045
Token lives
Token ID3160
Feature activation-1.34e+00

Pos logit contributions
 ending+0.578
 beginning+0.539
sing+0.506
 size+0.505
 crashing+0.490
Neg logit contributions
ons-0.953
aid-0.872
quet-0.827
tail-0.822
keeping-0.760
Token in
Token ID287
Feature activation-1.69e+00

Pos logit contributions
 ending+0.070
 beginning+0.065
sing+0.062
 size+0.060
 crashing+0.060
Neg logit contributions
ons-0.107
aid-0.096
tail-0.091
quet-0.090
 exclaimed-0.083
Token the
Token ID262
Feature activation+4.47e-01

Pos logit contributions
 ending+0.085
 beginning+0.084
 size+0.082
 end+0.079
 whale+0.076
Neg logit contributions
ons-0.133
aid-0.127
quet-0.116
tail-0.115
 exclaimed-0.107
Token ocean
Token ID9151
Feature activation-6.97e+00

Pos logit contributions
ons+0.027
aid+0.022
 stitches+0.020
 exclaimed+0.020
tail+0.020
Neg logit contributions
ksh-0.028
king-0.027
sing-0.027
 ending-0.026
rated-0.024
Token,
Token ID11
Feature activation-3.48e-01

Pos logit contributions
 ending+0.440
 beginning+0.421
 size+0.403
sing+0.396
 crashing+0.387
Neg logit contributions
ons-0.478
aid-0.431
quet-0.395
tail-0.392
 stitches-0.350
Token He
Token ID679
Feature activation-5.61e-01

Pos logit contributions
 ending+0.025
 beginning+0.024
sing+0.023
 size+0.023
 crashing+0.023
Neg logit contributions
ons-0.031
aid-0.027
tail-0.025
quet-0.025
 exclaimed-0.023
Token said
Token ID531
Feature activation+2.88e+00

Pos logit contributions
 ending+0.031
 beginning+0.029
sing+0.027
 size+0.027
 camps+0.027
Neg logit contributions
ons-0.043
aid-0.039
quet-0.037
tail-0.037
keeping-0.033
Token goodbye
Token ID24829
Feature activation+3.77e-01

Pos logit contributions
ons+0.212
aid+0.186
tail+0.182
quet+0.177
keeping+0.166
Neg logit contributions
 ending-0.176
 beginning-0.161
 crashing-0.151
sing-0.147
 camps-0.146
Token to
Token ID284
Feature activation+3.20e-01

Pos logit contributions
ons+0.031
aid+0.027
quet+0.026
tail+0.026
 stitches+0.024
Neg logit contributions
 ending-0.020
 beginning-0.018
sing-0.018
 size-0.017
 crashing-0.016
Token the
Token ID262
Feature activation+5.33e-01

Pos logit contributions
ons+0.022
aid+0.019
tail+0.018
quet+0.017
 stitches+0.016
Neg logit contributions
 ending-0.021
 beginning-0.019
sing-0.019
 size-0.018
ksh-0.018
Token whale
Token ID22206
Feature activation-1.17e+01

Pos logit contributions
ons+0.042
 exclaimed+0.036
handed+0.034
grand+0.033
on+0.032
Neg logit contributions
ksh-0.028
sing-0.022
iful-0.022
king-0.022
rant-0.021
Token and
Token ID290
Feature activation-4.12e-01

Pos logit contributions
 ending+0.547
 beginning+0.505
sing+0.473
 size+0.463
 camps+0.456
Neg logit contributions
ons-1.003
aid-0.923
tail-0.864
quet-0.862
keeping-0.802
Token went
Token ID1816
Feature activation-1.04e+00

Pos logit contributions
 ending+0.020
 beginning+0.017
 captain+0.016
 camps+0.016
 whale+0.016
Neg logit contributions
ons-0.036
aid-0.032
quet-0.032
tail-0.031
powers-0.029
Token back
Token ID736
Feature activation-4.70e+00

Pos logit contributions
 ending+0.038
 beginning+0.036
 crashing+0.035
 camps+0.035
 mate+0.032
Neg logit contributions
ons-0.098
aid-0.088
quet-0.085
tail-0.084
bes-0.079
Token to
Token ID284
Feature activation+2.67e-01

Pos logit contributions
 camps+0.182
 ending+0.182
 beginning+0.173
sing+0.165
 crashing+0.164
Neg logit contributions
ons-0.422
aid-0.390
quet-0.383
tail-0.359
bes-0.341
Token his
Token ID465
Feature activation-1.01e+00

Pos logit contributions
ons+0.022
aid+0.021
quet+0.020
tail+0.019
powers+0.018
Neg logit contributions
 ending-0.014
 beginning-0.013
 crashing-0.012
sing-0.012
 camps-0.012
Token family
Token ID1641
Feature activation+4.10e-01

Pos logit contributions
ons+0.176
aid+0.153
tail+0.144
quet+0.144
 exclaimed+0.137
Neg logit contributions
 ending-0.138
 beginning-0.127
ksh-0.124
sing-0.124
 camps-0.121
Token broke
Token ID6265
Feature activation+2.02e+00

Pos logit contributions
ons+0.031
aid+0.028
tail+0.027
quet+0.026
keeping+0.024
Neg logit contributions
 ending-0.023
 beginning-0.021
 size-0.020
sing-0.020
 crashing-0.020
Token open
Token ID1280
Feature activation-1.28e-01

Pos logit contributions
ons+0.135
aid+0.110
keeping+0.104
 stitches+0.103
quet+0.103
Neg logit contributions
 ending-0.137
 beginning-0.120
sing-0.120
ksh-0.113
 end-0.112
Token their
Token ID511
Feature activation-4.75e+00

Pos logit contributions
 ending+0.008
 beginning+0.007
sing+0.007
 crashing+0.007
ksh+0.007
Neg logit contributions
ons-0.009
aid-0.008
tail-0.007
quet-0.007
 stitches-0.007
Token ac
Token ID936
Feature activation+5.52e+00

Pos logit contributions
 ending+0.260
 beginning+0.243
sing+0.230
 crashing+0.226
 size+0.225
Neg logit contributions
ons-0.368
aid-0.324
tail-0.308
quet-0.303
 stitches-0.281
Tokenorns
Token ID19942
Feature activation+1.04e+01

Pos logit contributions
 exclaimed+0.260
aid+0.239
tail+0.235
ons+0.223
ird+0.220
Neg logit contributions
et-0.353
 camps-0.352
 end-0.339
 beginning-0.333
 size-0.328
Token and
Token ID290
Feature activation+1.32e-01

Pos logit contributions
ons+0.823
aid+0.780
tail+0.715
quet+0.703
 stitches+0.646
Neg logit contributions
 ending-0.528
 beginning-0.516
 size-0.478
sing-0.472
 crashing-0.463
Token shared
Token ID4888
Feature activation+4.38e+00

Pos logit contributions
ons+0.010
aid+0.009
quet+0.009
tail+0.009
bes+0.008
Neg logit contributions
 ending-0.007
 beginning-0.007
 size-0.007
sing-0.007
 crashing-0.006
Token them
Token ID606
Feature activation-7.70e+00

Pos logit contributions
ons+0.417
aid+0.379
tail+0.361
quet+0.357
 exclaimed+0.336
Neg logit contributions
 ending-0.165
 beginning-0.149
sing-0.139
 size-0.133
 crashing-0.131
Token with
Token ID351
Feature activation+1.09e+00

Pos logit contributions
 ending+0.411
 size+0.391
 beginning+0.389
 end+0.370
sing+0.361
Neg logit contributions
ons-0.564
aid-0.512
tail-0.489
quet-0.472
keeping-0.431
Token everyone
Token ID2506
Feature activation+3.03e-03

Pos logit contributions
ons+0.102
aid+0.093
tail+0.089
quet+0.088
 exclaimed+0.082
Neg logit contributions
 ending-0.042
 beginning-0.038
sing-0.035
 size-0.035
 crashing-0.034
Token it
Token ID340
Feature activation+1.51e-01

Pos logit contributions
ons+0.048
aid+0.045
quet+0.044
tail+0.041
 exclaimed+0.039
Neg logit contributions
 beginning-0.033
 ending-0.033
 size-0.032
 crashing-0.031
 end-0.030
Token.
Token ID13
Feature activation-5.39e-01

Pos logit contributions
ons+0.012
aid+0.011
quet+0.010
tail+0.010
keeping+0.009
Neg logit contributions
 ending-0.008
 beginning-0.008
 size-0.007
sing-0.007
 crashing-0.007
Token
Token ID198
Feature activation+1.15e-01

Pos logit contributions
 ending+0.022
ksh+0.022
 beginning+0.021
sing+0.019
 only+0.019
Neg logit contributions
ons-0.047
quet-0.043
aid-0.043
keeping-0.040
odic-0.039
Token
Token ID198
Feature activation+1.09e-01

Pos logit contributions
ons+0.009
quet+0.009
aid+0.009
tail+0.008
keeping+0.008
Neg logit contributions
 ending-0.005
 end-0.004
 camps-0.004
 beginning-0.004
 crashing-0.004
TokenM
Token ID44
Feature activation-5.45e+00

Pos logit contributions
ons+0.008
keeping+0.007
aid+0.007
quet+0.007
tail+0.007
Neg logit contributions
 ending-0.006
 beginning-0.006
 end-0.005
iet-0.005
 crashing-0.005
Tokenittens
Token ID34978
Feature activation+9.59e+00

Pos logit contributions
 ending+0.476
 beginning+0.445
 size+0.435
 end+0.434
 camps+0.431
Neg logit contributions
ons-0.242
aid-0.183
keeping-0.177
tail-0.172
quet-0.164
Token wasn
Token ID2492
Feature activation-3.08e+00

Pos logit contributions
ons+0.759
aid+0.686
quet+0.649
tail+0.649
keeping+0.592
Neg logit contributions
 ending-0.505
 beginning-0.474
 size-0.440
sing-0.438
 crashing-0.426
Token't
Token ID470
Feature activation+9.64e-01

Pos logit contributions
 beginning+0.209
 ending+0.208
 crashing+0.199
 end+0.195
sing+0.191
Neg logit contributions
ons-0.196
aid-0.167
tail-0.156
keeping-0.152
quet-0.151
Token happy
Token ID3772
Feature activation+1.10e+00

Pos logit contributions
ons+0.077
aid+0.066
tail+0.065
quet+0.065
chair+0.060
Neg logit contributions
 ending-0.052
 beginning-0.050
 crashing-0.047
 size-0.046
sing-0.046
Token with
Token ID351
Feature activation+5.64e-01

Pos logit contributions
ons+0.081
aid+0.067
tail+0.065
quet+0.064
 exclaimed+0.061
Neg logit contributions
 ending-0.061
sing-0.060
ksh-0.059
 beginning-0.059
 camps-0.057
Token this
Token ID428
Feature activation+7.79e-01

Pos logit contributions
ons+0.040
aid+0.035
quet+0.033
tail+0.033
 exclaimed+0.030
Neg logit contributions
 ending-0.035
 beginning-0.033
 size-0.032
sing-0.031
 crashing-0.031
Token she
Token ID673
Feature activation+4.57e+00

Pos logit contributions
ons+0.026
aid+0.023
quet+0.021
tail+0.021
powers+0.019
Neg logit contributions
 ending-0.034
 beginning-0.032
 size-0.030
 crashing-0.029
sing-0.029
Token bought
Token ID5839
Feature activation+2.28e+00

Pos logit contributions
ons+0.391
aid+0.358
quet+0.349
tail+0.331
keeping+0.314
Neg logit contributions
 ending-0.211
 beginning-0.202
 size-0.186
sing-0.182
 crashing-0.176
Token an
Token ID281
Feature activation-2.31e+00

Pos logit contributions
ons+0.172
aid+0.156
tail+0.153
quet+0.150
bes+0.140
Neg logit contributions
 ending-0.121
 beginning-0.119
 size-0.108
 end-0.104
 crashing-0.103
Token ice
Token ID4771
Feature activation+3.39e-01

Pos logit contributions
 ending+0.133
 beginning+0.123
sing+0.118
 crashing+0.117
 size+0.116
Neg logit contributions
ons-0.170
aid-0.152
tail-0.143
quet-0.142
 exclaimed-0.130
Token-
Token ID12
Feature activation+4.14e+00

Pos logit contributions
ons+0.020
aid+0.017
tail+0.016
quet+0.016
 exclaimed+0.014
Neg logit contributions
 ending-0.025
 beginning-0.023
sing-0.022
 size-0.022
 crashing-0.022
Tokencream
Token ID36277
Feature activation+8.92e+00

Pos logit contributions
ons+0.201
tail+0.160
quet+0.155
aid+0.155
bes+0.141
Neg logit contributions
 ending-0.320
 beginning-0.315
ksh-0.304
 crashing-0.298
 size-0.297
Token as
Token ID355
Feature activation+1.49e+00

Pos logit contributions
ons+0.668
aid+0.586
tail+0.555
quet+0.544
bes+0.505
Neg logit contributions
 ending-0.515
 beginning-0.485
sing-0.455
 size-0.454
 crashing-0.450
Token a
Token ID257
Feature activation+5.91e-01

Pos logit contributions
ons+0.116
aid+0.104
tail+0.103
quet+0.100
bes+0.095
Neg logit contributions
 ending-0.079
 beginning-0.075
 size-0.070
 crashing-0.067
 camps-0.065
Token reward
Token ID6721
Feature activation+4.78e+00

Pos logit contributions
ons+0.041
aid+0.036
tail+0.034
quet+0.033
 exclaimed+0.030
Neg logit contributions
 ending-0.037
 beginning-0.035
sing-0.034
 size-0.033
 crashing-0.033
Token for
Token ID329
Feature activation-3.37e+00

Pos logit contributions
ons+0.372
aid+0.334
tail+0.312
quet+0.306
bes+0.284
Neg logit contributions
 ending-0.259
 beginning-0.244
 size-0.228
sing-0.226
 crashing-0.223
Token catching
Token ID16508
Feature activation-3.30e-01

Pos logit contributions
 beginning+0.185
 ending+0.182
 size+0.168
 crashing+0.165
 end+0.159
Neg logit contributions
ons-0.255
aid-0.240
powers-0.226
tail-0.220
quet-0.212
Token Teddy
Token ID29345
Feature activation+1.31e+00

Pos logit contributions
 ending+0.003
 beginning+0.003
sing+0.003
 size+0.003
 crashing+0.003
Neg logit contributions
ons-0.003
aid-0.003
tail-0.003
quet-0.003
 exclaimed-0.002
Token noticed
Token ID6810
Feature activation-4.29e+00

Pos logit contributions
ons+0.103
aid+0.093
tail+0.087
quet+0.087
 stitches+0.079
Neg logit contributions
 ending-0.069
 beginning-0.065
sing-0.061
 size-0.060
 crashing-0.059
Token that
Token ID326
Feature activation+4.62e-01

Pos logit contributions
 ending+0.282
 beginning+0.266
sing+0.257
 size+0.252
 crashing+0.249
Neg logit contributions
ons-0.290
aid-0.252
tail-0.233
quet-0.230
 stitches-0.210
Token the
Token ID262
Feature activation+6.68e-01

Pos logit contributions
ons+0.032
aid+0.028
tail+0.026
quet+0.026
 stitches+0.024
Neg logit contributions
 ending-0.029
 beginning-0.028
sing-0.027
 size-0.026
 crashing-0.026
Token band
Token ID4097
Feature activation+8.57e+00

Pos logit contributions
ons+0.037
aid+0.030
tail+0.028
quet+0.028
keeping+0.025
Neg logit contributions
 ending-0.053
 beginning-0.050
 size-0.048
sing-0.048
 crashing-0.048
Tokenage
Token ID496
Feature activation+9.21e+00

Pos logit contributions
ons+0.429
quet+0.366
tail+0.358
aid+0.356
keeping+0.317
Neg logit contributions
 ending-0.642
 beginning-0.606
 crashing-0.585
 size-0.582
sing-0.578
Token was
Token ID373
Feature activation-2.95e-01

Pos logit contributions
ons+0.760
aid+0.727
quet+0.695
tail+0.694
keeping+0.656
Neg logit contributions
 ending-0.426
 beginning-0.400
 size-0.368
 crashing-0.352
sing-0.344
Token starting
Token ID3599
Feature activation-3.72e+00

Pos logit contributions
 ending+0.018
ksh+0.017
sing+0.016
 beginning+0.016
 camps+0.016
Neg logit contributions
ons-0.020
aid-0.018
tail-0.016
quet-0.016
 stitches-0.016
Token to
Token ID284
Feature activation+4.12e-01

Pos logit contributions
 ending+0.162
 beginning+0.156
 size+0.154
sing+0.149
 crashing+0.145
Neg logit contributions
ons-0.326
aid-0.298
quet-0.288
tail-0.286
ner-0.255
Token come
Token ID1282
Feature activation-4.86e+00

Pos logit contributions
ons+0.033
aid+0.029
tail+0.027
quet+0.027
keeping+0.025
Neg logit contributions
 ending-0.023
 beginning-0.022
 size-0.021
 crashing-0.020
sing-0.020
Token loose
Token ID9155
Feature activation-1.12e+00

Pos logit contributions
 ending+0.282
 whale+0.248
 beginning+0.246
 whales+0.246
sing+0.240
Neg logit contributions
ons-0.340
tail-0.311
aid-0.303
quet-0.291
 stitches-0.285
Token went
Token ID1816
Feature activation-3.60e-01

Pos logit contributions
ons+0.387
aid+0.340
tail+0.322
quet+0.310
 exclaimed+0.302
Neg logit contributions
 ending-0.240
 beginning-0.227
sing-0.218
 crashing-0.211
ksh-0.207
Token to
Token ID284
Feature activation+1.08e+00

Pos logit contributions
 ending+0.017
 beginning+0.017
 camps+0.015
 crashing+0.015
sing+0.015
Neg logit contributions
ons-0.030
aid-0.027
quet-0.026
tail-0.025
 exclaimed-0.024
Token the
Token ID262
Feature activation+1.49e+00

Pos logit contributions
ons+0.090
aid+0.085
quet+0.082
tail+0.075
bes+0.074
Neg logit contributions
 ending-0.051
 beginning-0.048
 camps-0.046
 size-0.046
 end-0.046
Token ice
Token ID4771
Feature activation+3.42e-01

Pos logit contributions
ons+0.092
aid+0.083
tail+0.078
quet+0.074
 stitches+0.072
Neg logit contributions
 ending-0.099
sing-0.094
rated-0.090
ksh-0.089
 beginning-0.089
Token-
Token ID12
Feature activation+4.04e+00

Pos logit contributions
ons+0.021
aid+0.019
tail+0.017
quet+0.017
 exclaimed+0.015
Neg logit contributions
 ending-0.023
 beginning-0.022
 size-0.021
 camps-0.021
sing-0.021
Tokencream
Token ID36277
Feature activation+8.88e+00

Pos logit contributions
ons+0.203
quet+0.166
tail+0.165
aid+0.161
bes+0.143
Neg logit contributions
 ending-0.298
 beginning-0.298
ksh-0.287
 camps-0.284
rant-0.281
Token shop
Token ID6128
Feature activation+5.90e+00

Pos logit contributions
ons+0.599
aid+0.532
tail+0.489
quet+0.488
bes+0.453
Neg logit contributions
 ending-0.573
 beginning-0.550
 camps-0.518
sing-0.518
 size-0.516
Token with
Token ID351
Feature activation+1.40e+00

Pos logit contributions
ons+0.391
aid+0.337
tail+0.321
quet+0.314
 exclaimed+0.285
Neg logit contributions
 ending-0.392
 beginning-0.366
sing-0.351
 crashing-0.346
 size-0.344
Token her
Token ID607
Feature activation+2.75e+00

Pos logit contributions
ons+0.102
aid+0.095
quet+0.094
tail+0.090
bes+0.087
Neg logit contributions
 ending-0.075
 beginning-0.074
 end-0.072
 size-0.068
 names-0.066
Token mom
Token ID1995
Feature activation-3.70e+00

Pos logit contributions
ons+0.277
aid+0.254
tail+0.245
quet+0.245
keeping+0.230
Neg logit contributions
 ending-0.088
 beginning-0.078
 size-0.072
sing-0.070
 camps-0.068
Token.
Token ID13
Feature activation+2.07e-01

Pos logit contributions
 ending+0.229
sing+0.214
 beginning+0.203
 crashing+0.199
 size+0.193
Neg logit contributions
ons-0.274
aid-0.228
tail-0.220
quet-0.212
 exclaimed-0.199
Token and
Token ID290
Feature activation+2.97e-01

Pos logit contributions
ons+0.203
aid+0.184
tail+0.175
quet+0.172
 stitches+0.161
Neg logit contributions
 ending-0.118
 beginning-0.110
sing-0.102
 crashing-0.100
ksh-0.099
Token wiped
Token ID21122
Feature activation+2.96e+00

Pos logit contributions
ons+0.021
quet+0.020
tail+0.018
aid+0.018
keeping+0.017
Neg logit contributions
 ending-0.018
 size-0.016
 only-0.015
 beginning-0.015
sing-0.014
Token her
Token ID607
Feature activation+2.38e+00

Pos logit contributions
ons+0.207
quet+0.186
aid+0.180
tail+0.165
powers+0.158
Neg logit contributions
 ending-0.184
 beginning-0.171
 size-0.171
 end-0.168
 crashing-0.159
Token tears
Token ID10953
Feature activation-2.35e-01

Pos logit contributions
ons+0.200
quet+0.178
aid+0.176
tail+0.166
grand+0.157
Neg logit contributions
 ending-0.124
 beginning-0.111
 size-0.111
sing-0.109
 end-0.109
Token.
Token ID13
Feature activation+5.89e-02

Pos logit contributions
 ending+0.012
 beginning+0.011
 size+0.010
 crashing+0.010
sing+0.010
Neg logit contributions
ons-0.020
aid-0.017
quet-0.017
tail-0.016
bes-0.015
Token She
Token ID1375
Feature activation+3.30e+00

Pos logit contributions
ons+0.005
quet+0.004
aid+0.004
tail+0.004
odic+0.004
Neg logit contributions
 ending-0.003
 beginning-0.003
sing-0.003
 crashing-0.003
 size-0.003
Token put
Token ID1234
Feature activation+3.39e-01

Pos logit contributions
ons+0.237
quet+0.221
tail+0.209
keeping+0.204
aid+0.197
Neg logit contributions
 ending-0.183
 size-0.169
 beginning-0.165
 only-0.165
 camps-0.164
Token a
Token ID257
Feature activation+5.08e-01

Pos logit contributions
ons+0.024
quet+0.021
aid+0.020
tail+0.020
powers+0.019
Neg logit contributions
 ending-0.020
 beginning-0.019
 size-0.018
 end-0.018
 crashing-0.017
Token band
Token ID4097
Feature activation+9.22e+00

Pos logit contributions
ons+0.030
tail+0.024
keeping+0.023
quet+0.023
aid+0.021
Neg logit contributions
 ending-0.038
 beginning-0.036
 size-0.035
 end-0.035
 crashing-0.035
Token-
Token ID12
Feature activation+4.03e+00

Pos logit contributions
ons+0.568
quet+0.461
pill+0.439
 exclaimed+0.435
tail+0.415
Neg logit contributions
 ending-0.581
 beginning-0.569
 crashing-0.549
 end-0.548
king-0.540
Tokenaid
Token ID1698
Feature activation+1.33e+01

Pos logit contributions
ons+0.079
quet+0.041
tail+0.039
bes+0.034
keeping+0.025
Neg logit contributions
 ending-0.408
 beginning-0.406
ksh-0.397
 crashing-0.393
 end-0.390
Token see
Token ID766
Feature activation-2.25e-01

Pos logit contributions
ons+0.112
aid+0.097
pill+0.096
con+0.091
es+0.091
Neg logit contributions
 end-0.046
 ending-0.045
 captain-0.045
 size-0.044
 only-0.041
Token what
Token ID644
Feature activation-1.97e+00

Pos logit contributions
 ending+0.012
 beginning+0.012
sing+0.011
 size+0.011
 crashing+0.011
Neg logit contributions
ons-0.017
aid-0.015
tail-0.014
quet-0.014
keeping-0.013
Token was
Token ID373
Feature activation+4.36e-01

Pos logit contributions
 ending+0.097
 beginning+0.080
 would+0.078
 end+0.077
 only+0.076
Neg logit contributions
ons-0.155
aid-0.154
quet-0.147
tail-0.144
keeping-0.141
Token inside
Token ID2641
Feature activation-1.44e+00

Pos logit contributions
ons+0.038
aid+0.035
quet+0.033
tail+0.033
powers+0.030
Neg logit contributions
 ending-0.019
 beginning-0.018
 size-0.017
sing-0.016
 crashing-0.016
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
 ending+0.076
 beginning+0.074
 end+0.070
 size+0.068
 crashing+0.065
Neg logit contributions
ons-0.111
aid-0.108
quet-0.097
tail-0.096
powers-0.094
Token box
Token ID3091
Feature activation+3.35e+00

Pos logit contributions
ons+0.079
aid+0.067
quet+0.062
tail+0.062
 stitches+0.057
Neg logit contributions
 ending-0.085
 beginning-0.080
sing-0.079
 crashing-0.077
ksh-0.077
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
ons+0.262
aid+0.247
tail+0.223
quet+0.216
powers+0.210
Neg logit contributions
 ending-0.178
 beginning-0.173
 size-0.161
sing-0.153
 end-0.152
Token
Token ID198
Feature activation+9.63e-01

Pos logit contributions
ons+0.016
aid+0.014
quet+0.013
tail+0.012
keeping+0.011
Neg logit contributions
 ending-0.014
 beginning-0.014
ksh-0.013
sing-0.013
 camps-0.013
Token
Token ID198
Feature activation+9.64e-01

Pos logit contributions
ons+0.071
aid+0.063
tail+0.061
quet+0.061
keeping+0.056
Neg logit contributions
 ending-0.053
 beginning-0.048
 camps-0.047
 end-0.047
 crashing-0.047
TokenShe
Token ID3347
Feature activation+3.39e+00

Pos logit contributions
ons+0.072
aid+0.063
tail+0.059
quet+0.058
keeping+0.055
Neg logit contributions
 ending-0.056
 beginning-0.053
 size-0.049
 crashing-0.049
sing-0.049
Token ran
Token ID4966
Feature activation+1.49e+00

Pos logit contributions
ons+0.252
aid+0.222
tail+0.209
quet+0.204
 exclaimed+0.191
Neg logit contributions
 ending-0.199
 beginning-0.186
sing-0.178
 crashing-0.174
 size-0.173
Token her
Token ID607
Feature activation+6.73e-01

Pos logit contributions
ons+0.030
aid+0.027
quet+0.025
tail+0.025
 stitches+0.023
Neg logit contributions
 ending-0.021
 beginning-0.020
sing-0.019
 size-0.018
 crashing-0.018
Token sc
Token ID629
Feature activation+1.68e+00

Pos logit contributions
ons+0.052
aid+0.049
quet+0.045
tail+0.043
keeping+0.040
Neg logit contributions
 ending-0.036
 beginning-0.034
sing-0.032
 size-0.032
 camps-0.032
Tokenooter
Token ID25141
Feature activation+6.06e+00

Pos logit contributions
ons+0.114
aid+0.093
quet+0.089
tail+0.087
keeping+0.084
Neg logit contributions
 beginning-0.103
 ending-0.102
sing-0.101
 crashing-0.099
 mate-0.099
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
ons+0.504
aid+0.463
quet+0.433
tail+0.413
keeping+0.395
Neg logit contributions
 beginning-0.281
 ending-0.279
 size-0.252
 crashing-0.243
sing-0.241
Token She
Token ID1375
Feature activation+2.27e+00

Pos logit contributions
 ending+0.076
 beginning+0.072
ksh+0.069
 crashing+0.068
 camps+0.068
Neg logit contributions
ons-0.093
aid-0.084
quet-0.077
tail-0.073
 stitches-0.069
Token picked
Token ID6497
Feature activation+1.30e+00

Pos logit contributions
ons+0.180
aid+0.163
quet+0.157
tail+0.152
keeping+0.143
Neg logit contributions
 ending-0.117
 beginning-0.110
 size-0.104
sing-0.103
 crashing-0.102
Token it
Token ID340
Feature activation-1.86e-04

Pos logit contributions
ons+0.106
aid+0.094
tail+0.089
quet+0.087
 stitches+0.082
Neg logit contributions
 ending-0.068
 beginning-0.062
sing-0.061
 size-0.059
 crashing-0.057
Token up
Token ID510
Feature activation-5.45e+00

Pos logit contributions
 ending+0.000
 beginning+0.000
sing+0.000
 size+0.000
 crashing+0.000
Neg logit contributions
ons-0.000
aid-0.000
tail-0.000
quet-0.000
 exclaimed-0.000
Token and
Token ID290
Feature activation-5.05e-01

Pos logit contributions
 ending+0.276
 beginning+0.259
 size+0.245
sing+0.244
 crashing+0.237
Neg logit contributions
ons-0.447
aid-0.400
tail-0.377
quet-0.376
 stitches-0.343
Token went
Token ID1816
Feature activation-9.01e-01

Pos logit contributions
 ending+0.026
 beginning+0.025
 crashing+0.024
 size+0.024
sing+0.023
Neg logit contributions
ons-0.038
quet-0.034
aid-0.034
tail-0.031
keeping-0.030
Token riding
Token ID10311
Feature activation+5.31e+00

Pos logit contributions
 beginning+0.039
 ending+0.039
 crashing+0.037
 camps+0.035
 size+0.034
Neg logit contributions
ons-0.079
aid-0.071
quet-0.067
tail-0.066
 exclaimed-0.061
Token Pink
Token ID14657
Feature activation-2.84e-01

Pos logit contributions
ons+0.009
aid+0.007
quet+0.007
tail+0.007
keeping+0.006
Neg logit contributions
 ending-0.009
 beginning-0.008
sing-0.008
 size-0.008
 crashing-0.007
Token said
Token ID531
Feature activation+3.34e+00

Pos logit contributions
 ending+0.019
 beginning+0.018
sing+0.017
 size+0.017
 crashing+0.017
Neg logit contributions
ons-0.019
aid-0.016
tail-0.015
quet-0.015
keeping-0.013
Token,
Token ID11
Feature activation+2.62e-01

Pos logit contributions
ons+0.200
aid+0.173
tail+0.160
quet+0.158
keeping+0.140
Neg logit contributions
 ending-0.243
 beginning-0.230
sing-0.221
 size-0.218
 crashing-0.216
Token "
Token ID366
Feature activation-3.31e+00

Pos logit contributions
ons+0.022
aid+0.020
tail+0.019
quet+0.018
 exclaimed+0.018
Neg logit contributions
 ending-0.012
 size-0.011
 beginning-0.011
 camps-0.011
sing-0.011
TokenI
Token ID40
Feature activation-2.73e+00

Pos logit contributions
 ending+0.183
 beginning+0.170
sing+0.167
 size+0.161
 crashing+0.160
Neg logit contributions
ons-0.256
aid-0.229
tail-0.216
quet-0.213
 exclaimed-0.198
Token am
Token ID716
Feature activation+4.10e+00

Pos logit contributions
 ending+0.140
 beginning+0.127
sing+0.121
 size+0.118
 crashing+0.113
Neg logit contributions
ons-0.226
aid-0.203
quet-0.195
tail-0.190
keeping-0.177
Token lost
Token ID2626
Feature activation-5.50e-01

Pos logit contributions
ons+0.348
aid+0.317
quet+0.298
tail+0.297
keeping+0.279
Neg logit contributions
 ending-0.208
 beginning-0.205
sing-0.189
 size-0.180
 crashing-0.178
Token and
Token ID290
Feature activation+6.85e-02

Pos logit contributions
 ending+0.028
 beginning+0.026
 size+0.025
sing+0.024
 crashing+0.024
Neg logit contributions
ons-0.045
aid-0.041
quet-0.038
tail-0.038
 exclaimed-0.036
Token I
Token ID314
Feature activation+8.50e-01

Pos logit contributions
ons+0.006
aid+0.005
quet+0.005
tail+0.005
bes+0.005
Neg logit contributions
 ending-0.004
 beginning-0.003
sing-0.003
 crashing-0.003
 only-0.003
Token don
Token ID836
Feature activation+6.85e-01

Pos logit contributions
ons+0.074
aid+0.066
quet+0.065
tail+0.062
keeping+0.058
Neg logit contributions
 ending-0.040
 beginning-0.036
sing-0.034
 size-0.034
 crashing-0.032
Token't
Token ID470
Feature activation+1.54e+00

Pos logit contributions
aid+0.048
ons+0.047
keeping+0.045
tail+0.043
con+0.040
Neg logit contributions
 beginning-0.035
 end-0.034
 crashing-0.034
 ending-0.034
 only-0.030
Token Inside
Token ID14384
Feature activation+4.92e+00

Pos logit contributions
ons+0.018
aid+0.016
tail+0.015
quet+0.015
bes+0.014
Neg logit contributions
 ending-0.012
 beginning-0.011
ksh-0.011
 crashing-0.010
 camps-0.010
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
ons+0.352
aid+0.351
tail+0.320
quet+0.315
powers+0.295
Neg logit contributions
 ending-0.279
 beginning-0.265
 size-0.250
 camps-0.241
 end-0.241
Token frame
Token ID5739
Feature activation+1.00e+01

Pos logit contributions
ons+0.070
aid+0.067
tail+0.054
keeping+0.053
quet+0.052
Neg logit contributions
 beginning-0.106
 ending-0.106
 end-0.105
 size-0.102
 side-0.094
Token was
Token ID373
Feature activation+4.20e-01

Pos logit contributions
aid+0.766
ons+0.728
quet+0.672
tail+0.670
powers+0.655
Neg logit contributions
 ending-0.503
 beginning-0.493
 size-0.471
 end-0.433
sing-0.415
Token a
Token ID257
Feature activation+7.41e-01

Pos logit contributions
ons+0.032
aid+0.028
tail+0.027
quet+0.026
 exclaimed+0.024
Neg logit contributions
 ending-0.024
 beginning-0.022
sing-0.021
 size-0.021
 crashing-0.021
Token picture
Token ID4286
Feature activation+2.51e+00

Pos logit contributions
ons+0.060
aid+0.053
tail+0.049
quet+0.048
keeping+0.046
Neg logit contributions
 ending-0.041
 beginning-0.038
 size-0.035
sing-0.035
 crashing-0.034
Token of
Token ID286
Feature activation+3.97e+00

Pos logit contributions
ons+0.139
aid+0.125
tail+0.113
quet+0.107
keeping+0.101
Neg logit contributions
 ending-0.190
 beginning-0.181
 size-0.172
 crashing-0.170
sing-0.169
Token the
Token ID262
Feature activation+1.31e+00

Pos logit contributions
ons+0.309
aid+0.283
tail+0.267
quet+0.261
powers+0.240
Neg logit contributions
 ending-0.213
 beginning-0.203
 size-0.192
sing-0.185
 crashing-0.182
Token happy
Token ID3772
Feature activation+1.97e+00

Pos logit contributions
ons+0.095
aid+0.086
tail+0.078
quet+0.077
 exclaimed+0.071
Neg logit contributions
 ending-0.080
 beginning-0.076
 size-0.072
sing-0.071
 end-0.069
Token family
Token ID1641
Feature activation+6.30e-01

Pos logit contributions
ons+0.156
aid+0.143
tail+0.133
quet+0.129
 exclaimed+0.120
Neg logit contributions
 ending-0.109
 beginning-0.099
 size-0.096
sing-0.091
 crashing-0.090
Token singing
Token ID13777
Feature activation+6.82e-01

Pos logit contributions
ons+0.047
aid+0.042
tail+0.039
quet+0.038
bes+0.036
Neg logit contributions
 ending-0.037
 beginning-0.035
 size-0.032
sing-0.032
 crashing-0.032
Token nice
Token ID3621
Feature activation-1.54e+00

Pos logit contributions
 ending+0.007
 beginning+0.007
sing+0.007
 size+0.006
 crashing+0.006
Neg logit contributions
ons-0.009
aid-0.008
tail-0.007
quet-0.007
 exclaimed-0.007
Token for
Token ID329
Feature activation-3.63e+00

Pos logit contributions
 ending+0.088
 beginning+0.083
sing+0.078
 size+0.078
 crashing+0.076
Neg logit contributions
ons-0.116
aid-0.103
tail-0.097
quet-0.096
 exclaimed-0.087
Token the
Token ID262
Feature activation+9.49e-01

Pos logit contributions
 ending+0.218
 beginning+0.206
sing+0.194
 size+0.193
 crashing+0.190
Neg logit contributions
ons-0.266
aid-0.236
tail-0.223
quet-0.219
 exclaimed-0.199
Token big
Token ID1263
Feature activation-1.86e+00

Pos logit contributions
ons+0.068
aid+0.060
quet+0.054
tail+0.054
powers+0.051
Neg logit contributions
ksh-0.054
rant-0.054
 ending-0.053
sing-0.052
 camps-0.051
Token bad
Token ID2089
Feature activation-5.25e-01

Pos logit contributions
 ending+0.093
 beginning+0.093
ksh+0.092
rant+0.090
sing+0.090
Neg logit contributions
ons-0.146
aid-0.126
tail-0.123
quet-0.118
 stitches-0.112
Token man
Token ID582
Feature activation+3.39e-01

Pos logit contributions
ksh+0.031
ains+0.029
rant+0.029
king+0.026
sing+0.025
Neg logit contributions
ons-0.038
aid-0.033
tail-0.031
 stitches-0.031
quet-0.030
Token.
Token ID13
Feature activation-6.15e-02

Pos logit contributions
ons+0.027
aid+0.025
quet+0.023
tail+0.023
bes+0.021
Neg logit contributions
 ending-0.018
 beginning-0.017
 size-0.015
sing-0.015
 crashing-0.015
Token He
Token ID679
Feature activation-1.47e-01

Pos logit contributions
 ending+0.004
 beginning+0.003
sing+0.003
 size+0.003
 crashing+0.003
Neg logit contributions
ons-0.004
aid-0.004
tail-0.004
quet-0.004
 exclaimed-0.003
Token worked
Token ID3111
Feature activation+3.50e+00

Pos logit contributions
 ending+0.008
 beginning+0.008
 size+0.008
sing+0.007
 crashing+0.007
Neg logit contributions
ons-0.011
aid-0.010
tail-0.009
quet-0.009
keeping-0.008
Token hard
Token ID1327
Feature activation-2.73e-01

Pos logit contributions
ons+0.270
aid+0.241
tail+0.228
quet+0.225
 exclaimed+0.205
Neg logit contributions
 ending-0.193
 beginning-0.181
sing-0.171
 size-0.170
 crashing-0.167
Token all
Token ID477
Feature activation-1.12e+00

Pos logit contributions
 ending+0.014
 beginning+0.013
 size+0.012
 crashing+0.012
sing+0.011
Neg logit contributions
ons-0.022
aid-0.021
tail-0.020
quet-0.020
 stitches-0.018
Token upon
Token ID2402
Feature activation-6.30e+00

Pos logit contributions
 ending+0.071
 beginning+0.067
sing+0.064
 size+0.063
 crashing+0.062
Neg logit contributions
ons-0.095
aid-0.084
tail-0.079
quet-0.078
 exclaimed-0.072
Token a
Token ID257
Feature activation-1.88e+00

Pos logit contributions
 ending+0.322
 beginning+0.305
 size+0.274
sing+0.264
 crashing+0.262
Neg logit contributions
ons-0.519
aid-0.478
quet-0.447
tail-0.444
bes-0.406
Token time
Token ID640
Feature activation-6.04e+00

Pos logit contributions
 beginning+0.076
 ending+0.076
 crashing+0.075
ksh+0.075
sing+0.073
Neg logit contributions
ons-0.159
tail-0.140
aid-0.137
chair-0.128
powers-0.125
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
 ending+0.426
 beginning+0.401
sing+0.395
ksh+0.390
 size+0.384
Neg logit contributions
ons-0.376
aid-0.311
tail-0.292
quet-0.287
 exclaimed-0.260
Token there
Token ID612
Feature activation-1.65e+00

Pos logit contributions
 ending+0.082
ksh+0.079
sing+0.076
 beginning+0.073
 size+0.072
Neg logit contributions
ons-0.058
aid-0.044
tail-0.042
quet-0.040
on-0.040
Token was
Token ID373
Feature activation-7.74e-01

Pos logit contributions
 ending+0.103
ksh+0.101
sing+0.100
 beginning+0.098
 crashing+0.096
Neg logit contributions
ons-0.115
aid-0.092
tail-0.089
quet-0.086
 exclaimed-0.084
Token a
Token ID257
Feature activation-4.27e-01

Pos logit contributions
 ending+0.049
 beginning+0.045
sing+0.044
 size+0.043
 crashing+0.043
Neg logit contributions
ons-0.054
aid-0.046
tail-0.044
quet-0.042
 exclaimed-0.039
Token little
Token ID1310
Feature activation-1.14e+00

Pos logit contributions
 ending+0.024
ksh+0.024
 beginning+0.024
rant+0.023
sing+0.022
Neg logit contributions
ons-0.028
aid-0.025
tail-0.025
quet-0.024
 stitches-0.022
Token girl
Token ID2576
Feature activation-3.44e-01

Pos logit contributions
 ending+0.057
 beginning+0.053
sing+0.050
 crashing+0.049
ksh+0.048
Neg logit contributions
ons-0.091
aid-0.082
tail-0.080
quet-0.077
 stitches-0.073
Token called
Token ID1444
Feature activation+3.84e+00

Pos logit contributions
 ending+0.020
 beginning+0.019
 size+0.019
 whale+0.018
sing+0.018
Neg logit contributions
ons-0.024
aid-0.023
quet-0.021
tail-0.019
powers-0.019
Token Lily
Token ID20037
Feature activation+1.60e+00

Pos logit contributions
ons+0.275
aid+0.253
quet+0.228
tail+0.224
powers+0.205
Neg logit contributions
 ending-0.230
 beginning-0.217
sing-0.205
 size-0.202
 crashing-0.201
Token so
Token ID523
Feature activation+4.00e+00

Pos logit contributions
ons+0.121
aid+0.108
quet+0.103
tail+0.101
keeping+0.089
Neg logit contributions
 ending-0.101
 beginning-0.099
 crashing-0.091
 size-0.091
sing-0.091
Token much
Token ID881
Feature activation+3.80e+00

Pos logit contributions
ons+0.253
aid+0.232
quet+0.232
tail+0.225
keeping+0.216
Neg logit contributions
 ending-0.261
 beginning-0.232
 camps-0.231
sing-0.224
 end-0.223
Token fun
Token ID1257
Feature activation+7.07e-01

Pos logit contributions
ons+0.303
aid+0.270
tail+0.268
quet+0.262
keeping+0.242
Neg logit contributions
 ending-0.205
 beginning-0.180
 size-0.170
 crashing-0.168
sing-0.167
Token,
Token ID11
Feature activation+5.39e-01

Pos logit contributions
ons+0.050
aid+0.043
tail+0.041
quet+0.040
 exclaimed+0.036
Neg logit contributions
 ending-0.044
 beginning-0.042
sing-0.040
 size-0.039
 crashing-0.039
Token they
Token ID484
Feature activation-2.66e+00

Pos logit contributions
ons+0.040
aid+0.034
tail+0.032
 exclaimed+0.031
quet+0.031
Neg logit contributions
 ending-0.029
 beginning-0.028
ksh-0.027
rant-0.027
sing-0.026
Token didn
Token ID1422
Feature activation+7.95e-01

Pos logit contributions
 ending+0.157
 beginning+0.144
sing+0.141
 size+0.138
ksh+0.138
Neg logit contributions
ons-0.196
aid-0.171
tail-0.160
quet-0.154
 exclaimed-0.153
Token't
Token ID470
Feature activation+1.84e+00

Pos logit contributions
ons+0.044
aid+0.038
tail+0.034
quet+0.033
keeping+0.029
Neg logit contributions
 ending-0.061
 beginning-0.059
sing-0.056
 crashing-0.056
 size-0.056
Token realize
Token ID6537
Feature activation+8.01e-01

Pos logit contributions
ons+0.136
aid+0.128
tail+0.122
quet+0.115
keeping+0.109
Neg logit contributions
 ending-0.095
 beginning-0.092
 camps-0.090
ksh-0.089
sing-0.088
Token how
Token ID703
Feature activation+1.76e-01

Pos logit contributions
ons+0.054
aid+0.046
tail+0.042
quet+0.042
 exclaimed+0.039
Neg logit contributions
 ending-0.052
 beginning-0.050
sing-0.049
 size-0.047
ksh-0.047
Token dirty
Token ID11841
Feature activation+7.73e-01

Pos logit contributions
ons+0.012
aid+0.011
tail+0.010
quet+0.010
keeping+0.009
Neg logit contributions
 ending-0.011
 beginning-0.011
sing-0.010
 crashing-0.010
 size-0.010
Token they
Token ID484
Feature activation-2.64e+00

Pos logit contributions
ons+0.057
aid+0.047
tail+0.044
quet+0.043
 stitches+0.042
Neg logit contributions
 ending-0.045
 beginning-0.044
sing-0.043
ksh-0.041
 camps-0.041
Token host
Token ID2583
Feature activation-9.98e-01

Pos logit contributions
ons+0.087
aid+0.081
tail+0.081
quet+0.078
 stitches+0.072
Neg logit contributions
 ending-0.065
rated-0.063
ksh-0.062
rant-0.060
 crashing-0.060
Token and
Token ID290
Feature activation+3.50e-01

Pos logit contributions
 ending+0.059
 beginning+0.056
sing+0.053
 size+0.052
 crashing+0.052
Neg logit contributions
ons-0.073
aid-0.064
tail-0.060
quet-0.060
 exclaimed-0.054
Token Ben
Token ID3932
Feature activation-4.35e+00

Pos logit contributions
ons+0.032
aid+0.029
tail+0.028
quet+0.027
 exclaimed+0.025
Neg logit contributions
 ending-0.014
 beginning-0.013
sing-0.012
 size-0.012
 crashing-0.012
Token is
Token ID318
Feature activation-4.35e-01

Pos logit contributions
 ending+0.228
 beginning+0.212
 size+0.201
sing+0.198
 camps+0.193
Neg logit contributions
ons-0.344
aid-0.313
quet-0.294
tail-0.291
keeping-0.266
Token the
Token ID262
Feature activation+1.25e+00

Pos logit contributions
 ending+0.026
 beginning+0.025
sing+0.024
 size+0.023
 crashing+0.023
Neg logit contributions
ons-0.031
aid-0.028
tail-0.026
quet-0.026
 exclaimed-0.023
Token guest
Token ID8319
Feature activation-2.63e+00

Pos logit contributions
ons+0.085
aid+0.079
tail+0.077
quet+0.076
 stitches+0.069
Neg logit contributions
 ending-0.065
rated-0.063
ksh-0.062
sing-0.061
 beginning-0.060
Token.
Token ID13
Feature activation+7.72e-02

Pos logit contributions
 ending+0.155
 beginning+0.146
sing+0.138
 size+0.137
 crashing+0.135
Neg logit contributions
ons-0.192
aid-0.171
tail-0.159
quet-0.157
 exclaimed-0.144
Token
Token ID198
Feature activation+8.62e-01

Pos logit contributions
ons+0.006
aid+0.005
tail+0.005
quet+0.005
 exclaimed+0.004
Neg logit contributions
 ending-0.005
 beginning-0.004
sing-0.004
 size-0.004
 crashing-0.004
Token
Token ID198
Feature activation+8.48e-01

Pos logit contributions
ons+0.061
aid+0.054
quet+0.051
tail+0.051
 stitches+0.046
Neg logit contributions
 ending-0.051
 beginning-0.048
 crashing-0.047
 camps-0.046
 size-0.045
Token"
Token ID1
Feature activation+1.81e+00

Pos logit contributions
ons+0.052
aid+0.045
tail+0.042
quet+0.041
 exclaimed+0.037
Neg logit contributions
 ending-0.060
 beginning-0.057
sing-0.055
 size-0.055
 crashing-0.054
TokenWelcome
Token ID14618
Feature activation-3.37e+00

Pos logit contributions
ons+0.139
aid+0.131
quet+0.119
tail+0.116
 exclaimed+0.110
Neg logit contributions
sing-0.102
 ending-0.099
 beginning-0.091
 size-0.087
 crashing-0.084
Token was
Token ID373
Feature activation+2.01e-01

Pos logit contributions
ons+0.056
aid+0.046
tail+0.044
 exclaimed+0.042
quet+0.042
Neg logit contributions
 ending-0.051
 beginning-0.048
sing-0.048
 crashing-0.047
ksh-0.046
Token.
Token ID13
Feature activation+7.78e-02

Pos logit contributions
ons+0.014
aid+0.012
tail+0.011
quet+0.011
 exclaimed+0.011
Neg logit contributions
 ending-0.012
 beginning-0.011
ksh-0.011
sing-0.011
 crashing-0.011
Token
Token ID198
Feature activation+7.04e-01

Pos logit contributions
ons+0.006
aid+0.005
tail+0.005
quet+0.005
 stitches+0.004
Neg logit contributions
 ending-0.004
 beginning-0.004
sing-0.004
 size-0.004
 crashing-0.004
Token
Token ID198
Feature activation+7.06e-01

Pos logit contributions
ons+0.051
aid+0.047
tail+0.045
quet+0.043
 stitches+0.040
Neg logit contributions
 ending-0.039
 beginning-0.037
 crashing-0.036
 camps-0.035
sing-0.034
TokenWhen
Token ID2215
Feature activation+1.68e+00

Pos logit contributions
ons+0.044
aid+0.038
tail+0.036
quet+0.035
 stitches+0.031
Neg logit contributions
 ending-0.049
 beginning-0.047
sing-0.044
 size-0.044
 crashing-0.044
Token they
Token ID484
Feature activation-2.84e+00

Pos logit contributions
ons+0.117
aid+0.099
tail+0.095
quet+0.093
 exclaimed+0.086
Neg logit contributions
 ending-0.105
 beginning-0.099
sing-0.097
ksh-0.096
 size-0.094
Token opened
Token ID4721
Feature activation+2.93e+00

Pos logit contributions
 ending+0.167
 beginning+0.155
sing+0.154
 size+0.149
ksh+0.149
Neg logit contributions
ons-0.210
aid-0.183
tail-0.170
quet-0.168
 exclaimed-0.160
Token it
Token ID340
Feature activation+7.43e-01

Pos logit contributions
ons+0.181
aid+0.141
tail+0.140
quet+0.132
 stitches+0.125
Neg logit contributions
 ending-0.215
 beginning-0.198
sing-0.195
ksh-0.193
 crashing-0.188
Token,
Token ID11
Feature activation+3.57e-01

Pos logit contributions
ons+0.052
aid+0.048
tail+0.044
quet+0.043
keeping+0.040
Neg logit contributions
 ending-0.046
 beginning-0.043
 size-0.041
sing-0.041
 camps-0.040
Token they
Token ID484
Feature activation-2.86e+00

Pos logit contributions
ons+0.024
aid+0.020
tail+0.019
quet+0.018
 exclaimed+0.018
Neg logit contributions
 ending-0.023
 beginning-0.022
ksh-0.021
sing-0.021
 size-0.021
Token saw
Token ID2497
Feature activation-3.96e+00

Pos logit contributions
 ending+0.149
sing+0.134
 beginning+0.134
ksh+0.133
 size+0.127
Neg logit contributions
ons-0.233
aid-0.200
tail-0.188
 exclaimed-0.188
quet-0.184
Token could
Token ID714
Feature activation+3.33e+00

Pos logit contributions
 ending+0.339
 beginning+0.314
 size+0.295
sing+0.295
 crashing+0.288
Neg logit contributions
ons-0.454
aid-0.415
tail-0.390
quet-0.390
keeping-0.354
Token have
Token ID423
Feature activation+3.96e+00

Pos logit contributions
ons+0.219
aid+0.201
quet+0.180
tail+0.179
bes+0.162
Neg logit contributions
 ending-0.222
 beginning-0.207
sing-0.199
 size-0.195
 crashing-0.194
Token happened
Token ID3022
Feature activation-4.15e+00

Pos logit contributions
ons+0.235
aid+0.225
tail+0.213
quet+0.202
bes+0.183
Neg logit contributions
 ending-0.279
 beginning-0.262
 size-0.245
 crashing-0.243
 whale-0.241
Token to
Token ID284
Feature activation+8.66e-01

Pos logit contributions
 ending+0.292
ksh+0.274
sing+0.272
 beginning+0.272
 size+0.262
Neg logit contributions
ons-0.253
tail-0.201
aid-0.198
quet-0.190
 exclaimed-0.179
Token the
Token ID262
Feature activation+1.10e+00

Pos logit contributions
ons+0.064
aid+0.061
quet+0.055
tail+0.055
powers+0.052
Neg logit contributions
 ending-0.049
 beginning-0.046
 end-0.044
 whale-0.044
 crashing-0.043
Token bull
Token ID6473
Feature activation-7.66e+00

Pos logit contributions
ons+0.072
 tree+0.069
chair+0.067
tail+0.065
handed+0.061
Neg logit contributions
ksh-0.048
lin-0.047
 ending-0.045
rant-0.045
 mate-0.045
Token?".
Token ID43634
Feature activation-1.71e-01

Pos logit contributions
 ending+0.417
 beginning+0.402
sing+0.375
 size+0.367
 crashing+0.364
Neg logit contributions
ons-0.588
aid-0.547
quet-0.492
tail-0.482
bes-0.460
Token Sally
Token ID25737
Feature activation+8.14e-02

Pos logit contributions
 ending+0.009
 beginning+0.008
sing+0.008
 crashing+0.007
 whale+0.007
Neg logit contributions
ons-0.014
aid-0.013
quet-0.012
tail-0.012
bes-0.011
Token said
Token ID531
Feature activation+3.30e+00

Pos logit contributions
ons+0.006
aid+0.005
tail+0.005
quet+0.005
keeping+0.004
Neg logit contributions
 ending-0.005
 beginning-0.005
 size-0.005
sing-0.005
 crashing-0.005
Token,
Token ID11
Feature activation+3.99e-01

Pos logit contributions
ons+0.173
aid+0.143
tail+0.131
quet+0.130
 exclaimed+0.119
Neg logit contributions
 ending-0.258
 camps-0.245
ksh-0.244
 beginning-0.242
sing-0.240
Token "
Token ID366
Feature activation-3.19e+00

Pos logit contributions
ons+0.031
aid+0.027
quet+0.026
tail+0.026
 exclaimed+0.026
Neg logit contributions
 ending-0.021
 camps-0.019
 size-0.018
sing-0.018
ksh-0.018
Token
Token ID198
Feature activation+4.29e-01

Pos logit contributions
 ending+0.013
 beginning+0.013
sing+0.012
 size+0.012
rant+0.012
Neg logit contributions
ons-0.011
aid-0.009
tail-0.009
 exclaimed-0.009
quet-0.008
Token
Token ID198
Feature activation+4.10e-01

Pos logit contributions
ons+0.029
aid+0.026
tail+0.024
quet+0.024
 stitches+0.022
Neg logit contributions
 ending-0.027
 beginning-0.026
 crashing-0.025
 camps-0.024
 size-0.024
Tokenâ
Token ID22940
Feature activation-1.97e+00

Pos logit contributions
ons+0.025
aid+0.021
tail+0.020
quet+0.020
 exclaimed+0.018
Neg logit contributions
 ending-0.030
 beginning-0.028
sing-0.028
 size-0.027
ksh-0.026
Token
Token ID26391
Feature activation+4.45e-01

Pos logit contributions
 ending+0.109
 beginning+0.102
 size+0.097
sing+0.097
 camps+0.096
Neg logit contributions
ons-0.150
aid-0.135
tail-0.128
quet-0.125
 exclaimed-0.116
Token
Token ID129
Feature activation+3.44e+00

Pos logit contributions
ons+0.028
aid+0.024
tail+0.022
quet+0.022
 exclaimed+0.020
Neg logit contributions
 ending-0.031
 beginning-0.030
sing-0.028
 size-0.028
 crashing-0.028
Token
Token ID241
Feature activation-7.07e+00

Pos logit contributions
ons+0.274
tail+0.240
aid+0.239
quet+0.225
 exclaimed+0.222
Neg logit contributions
 ending-0.178
 beginning-0.157
 size-0.154
sing-0.149
 camps-0.146
TokenDad
Token ID46270
Feature activation-7.69e+00

Pos logit contributions
 ending+0.427
sing+0.426
ains+0.400
 beginning+0.396
 whale+0.396
Neg logit contributions
aid-0.445
ons-0.421
tail-0.398
 exclaimed-0.372
keeping-0.365
Token,
Token ID11
Feature activation+3.24e-02

Pos logit contributions
 ending+0.504
 beginning+0.484
 size+0.460
sing+0.452
 crashing+0.445
Neg logit contributions
ons-0.478
aid-0.439
quet-0.411
tail-0.404
keeping-0.370
Token let
Token ID1309
Feature activation-3.67e+00

Pos logit contributions
ons+0.002
aid+0.002
 stitches+0.002
 exclaimed+0.002
ird+0.002
Neg logit contributions
sing-0.002
king-0.002
ksh-0.002
 ending-0.002
 beginning-0.002
Token's
Token ID338
Feature activation-2.43e+00

Pos logit contributions
sing+0.209
 ending+0.207
 size+0.196
 beginning+0.191
king+0.183
Neg logit contributions
ons-0.281
aid-0.231
tail-0.226
 exclaimed-0.219
 stitches-0.214
Token fly
Token ID6129
Feature activation+2.21e+00

Pos logit contributions
 ending+0.088
 beginning+0.084
 size+0.075
sing+0.074
 crashing+0.074
Neg logit contributions
ons-0.231
aid-0.213
quet-0.199
tail-0.192
bes-0.188
Token and
Token ID290
Feature activation-2.93e-01

Pos logit contributions
ons+0.214
aid+0.187
tail+0.178
quet+0.175
 stitches+0.161
Neg logit contributions
 ending-0.184
 beginning-0.168
ksh-0.166
sing-0.165
 camps-0.160
Token the
Token ID262
Feature activation+6.61e-01

Pos logit contributions
 ending+0.019
 beginning+0.018
sing+0.017
 size+0.017
 crashing+0.016
Neg logit contributions
ons-0.020
aid-0.018
tail-0.016
quet-0.016
 exclaimed-0.015
Token flowers
Token ID12734
Feature activation+8.36e-01

Pos logit contributions
ons+0.043
aid+0.039
tail+0.036
 stitches+0.035
quet+0.034
Neg logit contributions
 ending-0.041
sing-0.039
 beginning-0.038
ksh-0.038
rant-0.037
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
ons+0.064
aid+0.057
tail+0.053
quet+0.053
 exclaimed+0.049
Neg logit contributions
 ending-0.046
 beginning-0.044
 crashing-0.041
 size-0.041
sing-0.041
Token He
Token ID679
Feature activation-3.94e-01

Pos logit contributions
 ending+0.021
 beginning+0.019
sing+0.018
 size+0.018
 crashing+0.018
Neg logit contributions
ons-0.026
aid-0.023
tail-0.022
quet-0.022
 exclaimed-0.020
Token waves
Token ID9813
Feature activation-6.65e+00

Pos logit contributions
 ending+0.024
 beginning+0.022
sing+0.022
ksh+0.021
 size+0.021
Neg logit contributions
ons-0.029
aid-0.025
tail-0.023
quet-0.022
 stitches-0.022
Token to
Token ID284
Feature activation+5.89e-01

Pos logit contributions
 ending+0.416
sing+0.384
 beginning+0.382
ksh+0.379
 camps+0.373
Neg logit contributions
ons-0.457
aid-0.417
tail-0.386
 exclaimed-0.351
quet-0.347
Token the
Token ID262
Feature activation+6.68e-01

Pos logit contributions
ons+0.036
aid+0.033
tail+0.031
 stitches+0.028
quet+0.027
Neg logit contributions
 ending-0.041
 beginning-0.038
ksh-0.037
sing-0.037
 size-0.036
Token birds
Token ID10087
Feature activation-2.20e+00

Pos logit contributions
ons+0.043
aid+0.039
tail+0.038
quet+0.036
 stitches+0.035
Neg logit contributions
 ending-0.041
sing-0.038
ksh-0.037
rant-0.037
 beginning-0.037
Token and
Token ID290
Feature activation-3.23e-01

Pos logit contributions
 ending+0.117
 beginning+0.112
sing+0.106
 crashing+0.105
 size+0.103
Neg logit contributions
ons-0.173
aid-0.155
quet-0.144
tail-0.144
keeping-0.132
Token the
Token ID262
Feature activation+6.60e-01

Pos logit contributions
 ending+0.019
 beginning+0.018
sing+0.017
 size+0.017
 crashing+0.017
Neg logit contributions
ons-0.024
aid-0.021
quet-0.020
tail-0.020
keeping-0.018
Token.
Token ID13
Feature activation-4.93e-01

Pos logit contributions
ons+0.025
aid+0.022
quet+0.021
tail+0.021
keeping+0.019
Neg logit contributions
 ending-0.018
 beginning-0.017
 size-0.016
sing-0.016
 crashing-0.015
Token He
Token ID679
Feature activation-3.71e-01

Pos logit contributions
 ending+0.022
 crashing+0.020
 beginning+0.020
ksh+0.020
sing+0.020
Neg logit contributions
ons-0.042
aid-0.037
quet-0.037
tail-0.035
keeping-0.033
Token waits
Token ID28364
Feature activation-2.10e+00

Pos logit contributions
 ending+0.022
 beginning+0.021
sing+0.020
 crashing+0.019
 size+0.019
Neg logit contributions
ons-0.027
aid-0.024
tail-0.023
quet-0.021
 exclaimed-0.021
Token for
Token ID329
Feature activation-3.77e+00

Pos logit contributions
 ending+0.110
ksh+0.102
 beginning+0.098
sing+0.094
 size+0.093
Neg logit contributions
ons-0.163
aid-0.143
quet-0.133
tail-0.133
 exclaimed-0.127
Token the
Token ID262
Feature activation+6.71e-01

Pos logit contributions
 ending+0.184
 beginning+0.169
 size+0.166
 end+0.163
 crashing+0.161
Neg logit contributions
ons-0.305
tail-0.276
aid-0.272
quet-0.271
bes-0.250
Token right
Token ID826
Feature activation-6.42e+00

Pos logit contributions
ons+0.048
aid+0.043
quet+0.040
tail+0.040
 stitches+0.037
Neg logit contributions
 ending-0.038
 beginning-0.036
ksh-0.036
sing-0.035
rant-0.034
Token moment
Token ID2589
Feature activation-5.04e+00

Pos logit contributions
ksh+0.297
 beginning+0.289
 ending+0.287
sing+0.274
 crashing+0.271
Neg logit contributions
ons-0.522
aid-0.481
tail-0.456
quet-0.436
keeping-0.432
Token and
Token ID290
Feature activation-2.92e-01

Pos logit contributions
 ending+0.288
 beginning+0.270
sing+0.256
 size+0.254
 crashing+0.249
Neg logit contributions
ons-0.378
aid-0.337
tail-0.316
quet-0.314
 stitches-0.286
Token then
Token ID788
Feature activation-6.87e-01

Pos logit contributions
 ending+0.016
 beginning+0.015
sing+0.015
 size+0.014
 crashing+0.014
Neg logit contributions
ons-0.022
aid-0.020
tail-0.019
quet-0.018
 exclaimed-0.017
Token d
Token ID288
Feature activation-5.64e+00

Pos logit contributions
 ending+0.039
 beginning+0.036
sing+0.034
 size+0.034
 crashing+0.034
Neg logit contributions
ons-0.052
aid-0.047
quet-0.044
tail-0.043
keeping-0.040
Tokenashes
Token ID7465
Feature activation-1.15e+00

Pos logit contributions
 ending+0.340
 beginning+0.308
 size+0.294
sing+0.292
 camps+0.280
Neg logit contributions
ons-0.425
aid-0.385
quet-0.354
tail-0.352
 exclaimed-0.321
Token Jack
Token ID3619
Feature activation-4.83e+00

Pos logit contributions
 ending+0.002
 beginning+0.002
sing+0.002
 size+0.002
 camps+0.002
Neg logit contributions
ons-0.002
aid-0.001
tail-0.001
quet-0.001
 exclaimed-0.001
Token put
Token ID1234
Feature activation+2.72e-01

Pos logit contributions
 ending+0.271
 beginning+0.257
 size+0.241
sing+0.239
 crashing+0.235
Neg logit contributions
ons-0.365
aid-0.328
quet-0.309
tail-0.307
keeping-0.280
Token his
Token ID465
Feature activation-4.01e-01

Pos logit contributions
ons+0.020
aid+0.018
tail+0.016
quet+0.016
 exclaimed+0.015
Neg logit contributions
 ending-0.016
 beginning-0.015
sing-0.015
 size-0.014
 crashing-0.014
Token arm
Token ID3211
Feature activation+2.58e+00

Pos logit contributions
 ending+0.022
 beginning+0.021
sing+0.020
 size+0.019
 crashing+0.019
Neg logit contributions
ons-0.031
aid-0.027
tail-0.026
quet-0.025
 exclaimed-0.023
Token around
Token ID1088
Feature activation-2.50e+00

Pos logit contributions
ons+0.223
quet+0.203
aid+0.198
keeping+0.194
grand+0.190
Neg logit contributions
 beginning-0.103
 ending-0.098
 crashing-0.089
 size-0.088
 end-0.087
Token Joe
Token ID5689
Feature activation-7.04e+00

Pos logit contributions
 ending+0.153
ksh+0.144
 beginning+0.143
 camps+0.141
sing+0.140
Neg logit contributions
ons-0.176
aid-0.144
tail-0.138
quet-0.133
keeping-0.129
Token and
Token ID290
Feature activation+1.40e-01

Pos logit contributions
 ending+0.403
 beginning+0.390
sing+0.365
 size+0.362
 crashing+0.357
Neg logit contributions
ons-0.519
aid-0.472
quet-0.440
tail-0.423
keeping-0.394
Token said
Token ID531
Feature activation+3.27e+00

Pos logit contributions
ons+0.010
aid+0.009
quet+0.008
tail+0.008
bes+0.007
Neg logit contributions
 ending-0.009
 beginning-0.008
 size-0.008
sing-0.008
 crashing-0.007
Token "
Token ID366
Feature activation-3.24e+00

Pos logit contributions
ons+0.221
aid+0.192
tail+0.181
quet+0.178
 stitches+0.160
Neg logit contributions
 ending-0.213
 beginning-0.201
sing-0.191
 size-0.189
 crashing-0.188
TokenIt
Token ID1026
Feature activation-1.60e+00

Pos logit contributions
 ending+0.196
sing+0.185
 camps+0.176
 beginning+0.174
 size+0.168
Neg logit contributions
ons-0.236
aid-0.215
quet-0.195
tail-0.195
 exclaimed-0.182
Token's
Token ID338
Feature activation-1.97e+00

Pos logit contributions
 ending+0.091
 beginning+0.085
 size+0.081
sing+0.079
 crashing+0.078
Neg logit contributions
ons-0.119
aid-0.108
tail-0.101
quet-0.100
keeping-0.093