TOP ACTIVATIONS
MAX = 73.976

checked for a fire extingu isher , but there wasn 't
She grabbed the fire extingu isher and put out the fire
. They thanked the Pol isher for all its kind work
're a very good pol isher ." Lily smiled and felt
her dad watched from the sidelines , his heart swelling with
oven and taken the las agna . But the
. They asked the Pol isher to make them look nice
his bear watched from the sidelines . Tim my was happy
not find his famous las agna . The chef
wished they had eaten w affles . She wished they had
the store was selling w affles for very cheap . She
they watched the pretend w affles fly high up into the
When the w affles were ready , Lily 's
on , the two w affles were never apart . Where
. She ate the w affles with the syrup and smiled
store and got the y umm iest ice cream they could
fed the baby duck w affles too . They all played
impatient . She wanted w affles for breakfast . She went
. It was even y umm ier this time ! The
our shorts and T - shirts under our warm clothes .

MIN ACTIVATIONS
MIN = -47.176

looked at it in awe . But then she realized it
? â asked Jill . â
and ran in the waves . As they were
y ummy fruits and veggies . One day , Lucky went
the handle was completely soaked . He pond ered what could
it . It was paint . It spl ashed on the
mean to make a mess . And I know you tried
little bird ie named Tweet . Tweet was very happy and
Max had listened to her . She wished Max was still
could help with the job . His dad said , "
does not like to dance . It likes to bite .
the bird became very sick . Lily tried to help it
that she loved to wear . One day , she went
wished they had a car . But they knew mom was
a branch and it broke . He was so surprised that
something soft and squ ishy . It was a pile of
them from the mean boy . They give Spot a treat
steps for him to walk . Tim my was
and said they were sorry . They felt bad for ruining
's skin is not boring . It is beautiful , like

INTERVAL 43.688 - 73.976
CONTAINS 1.537%

. A mer maid that lived in the deep
child visited him in the garage and asked him why he
play . Billy saw the razor and wanted to play with
play hide - and - seek and tag with her friends
pilot . The pilot waved back . Soon the

INTERVAL 13.400 - 43.688
CONTAINS 27.880%

<|endoftext|> Once there was a ch ubby little girl called Lucy
on the grass and just closed their eyes .
<|endoftext|> Ben and Lily were playing in the snow . They
A mber thought it was amazing . " Let 's play
into piles . Then , the mom cried out

INTERVAL -16.888 - 13.400
CONTAINS 46.142%

parents decided to spend more time with her and go on
et . She was curious about how it worked . The
The end . <|endoftext|> Once upon a time , there lived
he sped desperately to explore everything . Just
plant . I like mint too ." L ily

INTERVAL -47.176 - -16.888
CONTAINS 24.441%

you should act . Go to your room and stay there
until he reached his den . The bird gave
in their room until morning . S ara and
have to go home now ," her daddy explained .
soon ! <|endoftext|> Once upon a time , there was a
Token checked
Token ID10667
Feature activation+2.93e+01

Pos logit contributions
 Majesty+0.376
Ta+0.375
agna+0.358
 sidelines+0.357
 pulp+0.357
Neg logit contributions
.-0.425
,-0.399
 and-0.391
-0.342
 with-0.331
Token for
Token ID329
Feature activation-6.80e+00

Pos logit contributions
isher+1.554
heit+1.508
bek+1.503
wy+1.477
 sidelines+1.457
Neg logit contributions
.-0.591
,-0.522
 and-0.497
-0.422
 with-0.338
Token a
Token ID257
Feature activation-1.23e+01

Pos logit contributions
.+0.143
,+0.124
 and+0.122
+0.079
 with+0.073
Neg logit contributions
 Majesty-0.423
Ta-0.421
 sidelines-0.419
agna-0.415
umm-0.413
Token fire
Token ID2046
Feature activation+2.87e+01

Pos logit contributions
.+0.514
,+0.481
 and+0.468
+0.389
 to+0.388
Neg logit contributions
Ta-0.540
 Majesty-0.537
umm-0.528
 sidelines-0.518
agna-0.514
Token extingu
Token ID24995
Feature activation+3.54e+01

Pos logit contributions
umm+1.603
isher+1.559
agna+1.556
 Majesty+1.531
ses+1.494
Neg logit contributions
.-0.569
,-0.533
 and-0.507
-0.457
 the-0.369
Tokenisher
Token ID4828
Feature activation+7.40e+01

Pos logit contributions
 Majesty+0.303
Ta+0.302
 sidelines+0.230
 former+0.229
umm+0.218
Neg logit contributions
.-2.782
,-2.689
 and-2.647
 to-2.429
 with-2.386
Token,
Token ID11
Feature activation-2.80e+01

Pos logit contributions
isher+4.733
 Majesty+4.729
 supervision+4.671
cher+4.641
agna+4.631
Neg logit contributions
-0.125
.-0.063
,-0.056
 and+0.037
 the+0.198
Token but
Token ID475
Feature activation-9.01e+00

Pos logit contributions
.+0.822
,+0.745
 and+0.654
+0.545
 to+0.514
Neg logit contributions
 sidelines-1.457
 Majesty-1.441
Ta-1.430
agna-1.409
 former-1.406
Token there
Token ID612
Feature activation+6.38e-01

Pos logit contributions
.+0.232
,+0.201
 and+0.193
+0.141
 to+0.132
Neg logit contributions
Ta-0.519
 Majesty-0.517
 sidelines-0.510
agna-0.509
umm-0.507
Token wasn
Token ID2492
Feature activation+1.12e+01

Pos logit contributions
Ta+0.035
umm+0.035
agna+0.034
 Majesty+0.034
 pulp+0.034
Neg logit contributions
.-0.016
,-0.013
 and-0.013
-0.010
 to-0.009
Token't
Token ID470
Feature activation+1.19e+01

Pos logit contributions
Ta+0.446
 sidelines+0.434
agna+0.420
 Majesty+0.419
umm+0.416
Neg logit contributions
.-0.470
,-0.447
 and-0.434
 with-0.362
-0.350
Token She
Token ID1375
Feature activation+2.35e+00

Pos logit contributions
.+1.388
,+1.279
 and+1.245
 with+0.971
 to+0.956
Neg logit contributions
 Majesty-1.936
umm-1.915
Ta-1.882
 former-1.850
 majesty-1.828
Token grabbed
Token ID13176
Feature activation+2.35e+01

Pos logit contributions
Ta+0.111
 Majesty+0.110
umm+0.108
 sidelines+0.107
agna+0.106
Neg logit contributions
.-0.090
,-0.083
 and-0.081
-0.066
 to-0.066
Token the
Token ID262
Feature activation-1.65e+01

Pos logit contributions
Ta+1.512
 Majesty+1.467
 sidelines+1.466
umm+1.455
isher+1.444
Neg logit contributions
.-0.440
,-0.369
 and-0.342
-0.212
 to-0.192
Token fire
Token ID2046
Feature activation+2.79e+01

Pos logit contributions
.+0.775
,+0.727
 and+0.712
 to+0.601
+0.593
Neg logit contributions
Ta-0.665
 Majesty-0.662
umm-0.645
agna-0.627
 former-0.624
Token extingu
Token ID24995
Feature activation+3.46e+01

Pos logit contributions
umm+1.769
isher+1.720
 Majesty+1.718
agna+1.714
 sidelines+1.696
Neg logit contributions
.-0.455
,-0.366
 and-0.317
-0.270
 to-0.177
Tokenisher
Token ID4828
Feature activation+7.32e+01

Pos logit contributions
 Majesty+0.358
Ta+0.343
 sidelines+0.276
 former+0.271
 majesty+0.270
Neg logit contributions
.-2.664
,-2.574
 and-2.538
 to-2.314
 with-2.289
Token and
Token ID290
Feature activation-2.20e+01

Pos logit contributions
 sidelines+4.638
 Majesty+4.603
isher+4.582
inge+4.544
umm+4.519
Neg logit contributions
.-0.509
-0.270
,-0.253
 and-0.045
 the+0.193
Token put
Token ID1234
Feature activation+1.44e+01

Pos logit contributions
.+0.910
,+0.848
 and+0.823
+0.690
 to+0.680
Neg logit contributions
Ta-0.964
 Majesty-0.958
umm-0.937
 sidelines-0.925
agna-0.920
Token out
Token ID503
Feature activation+4.37e+00

Pos logit contributions
Ta+0.804
isher+0.796
 sidelines+0.784
 Majesty+0.777
umm+0.773
Neg logit contributions
.-0.336
,-0.311
 and-0.289
-0.212
 to-0.201
Token the
Token ID262
Feature activation-1.63e+01

Pos logit contributions
Ta+0.256
 Majesty+0.253
 sidelines+0.249
 former+0.248
isher+0.248
Neg logit contributions
.-0.089
,-0.082
 and-0.077
-0.057
 with-0.045
Token fire
Token ID2046
Feature activation+2.80e+01

Pos logit contributions
.+0.776
,+0.724
 and+0.712
 to+0.604
+0.597
Neg logit contributions
Ta-0.644
 Majesty-0.635
umm-0.622
agna-0.603
 sidelines-0.600
Token.
Token ID13
Feature activation-4.07e+01

Pos logit contributions
 Majesty+0.552
umm+0.550
 majesty+0.524
 former+0.521
osity+0.516
Neg logit contributions
.-0.045
,-0.041
-0.033
 and-0.016
 the+0.018
Token They
Token ID1119
Feature activation+1.57e+00

Pos logit contributions
.+1.496
,+1.388
 and+1.341
 to+1.081
 with+1.060
Neg logit contributions
Ta-1.979
 Majesty-1.972
umm-1.958
 sidelines-1.908
 former-1.901
Token thanked
Token ID26280
Feature activation+1.68e+01

Pos logit contributions
Ta+0.073
 Majesty+0.072
umm+0.071
 sidelines+0.070
agna+0.070
Neg logit contributions
.-0.060
,-0.056
 and-0.054
-0.045
 to-0.044
Token the
Token ID262
Feature activation-1.81e+01

Pos logit contributions
 former+0.993
 Majesty+0.988
 sidelines+0.979
umm+0.964
Ta+0.962
Neg logit contributions
.-0.308
,-0.264
 and-0.238
 to-0.148
-0.146
Token Pol
Token ID2165
Feature activation+3.29e+01

Pos logit contributions
.+0.868
,+0.815
 and+0.796
+0.676
 to+0.674
Neg logit contributions
Ta-0.706
 Majesty-0.695
umm-0.679
agna-0.663
 sidelines-0.663
Tokenisher
Token ID4828
Feature activation+7.16e+01

Pos logit contributions
Ta+0.544
 Majesty+0.539
umm+0.508
 sidelines+0.498
 former+0.477
Neg logit contributions
.-2.284
,-2.189
 and-2.157
 to-1.941
-1.938
Token for
Token ID329
Feature activation-9.02e+00

Pos logit contributions
 Majesty+4.659
isher+4.438
 majesty+4.436
 former+4.419
 sidelines+4.381
Neg logit contributions
.-0.462
,-0.260
-0.184
 and-0.082
 with+0.120
Token all
Token ID477
Feature activation+3.59e-01

Pos logit contributions
.+0.315
,+0.291
 and+0.282
+0.224
 to+0.222
Neg logit contributions
Ta-0.455
 Majesty-0.454
umm-0.446
 sidelines-0.441
agna-0.437
Token its
Token ID663
Feature activation+6.18e+00

Pos logit contributions
 Majesty+0.020
 majesty+0.019
 pulp+0.019
isher+0.019
 sidelines+0.019
Neg logit contributions
.-0.007
,-0.006
 and-0.006
-0.005
 with-0.004
Token kind
Token ID1611
Feature activation+1.60e+01

Pos logit contributions
 Majesty+0.247
Ta+0.244
umm+0.239
 former+0.238
 sidelines+0.234
Neg logit contributions
.-0.271
,-0.255
 and-0.250
-0.214
 with-0.208
Token work
Token ID670
Feature activation+2.37e+01

Pos logit contributions
 Majesty+0.677
umm+0.672
 majesty+0.645
 former+0.643
Ta+0.633
Neg logit contributions
.-0.592
,-0.558
 and-0.536
-0.490
 with-0.454
Token're
Token ID821
Feature activation+1.53e+01

Pos logit contributions
 Majesty+0.671
Ta+0.670
umm+0.662
 sidelines+0.653
agna+0.643
Neg logit contributions
.-0.553
,-0.516
 and-0.501
-0.408
 to-0.405
Token a
Token ID257
Feature activation-1.60e+01

Pos logit contributions
Ta+0.864
 Majesty+0.863
umm+0.846
agna+0.843
 sidelines+0.835
Neg logit contributions
.-0.423
,-0.376
 and-0.367
-0.267
 to-0.265
Token very
Token ID845
Feature activation-1.06e+01

Pos logit contributions
.+0.594
,+0.553
 and+0.536
+0.429
 to+0.427
Neg logit contributions
Ta-0.783
 Majesty-0.775
umm-0.764
 sidelines-0.755
 former-0.746
Token good
Token ID922
Feature activation+1.17e+01

Pos logit contributions
.+0.475
,+0.448
 and+0.432
 to+0.366
+0.363
Neg logit contributions
Ta-0.441
 Majesty-0.430
umm-0.428
 sidelines-0.421
 former-0.417
Token pol
Token ID755
Feature activation+2.68e+01

Pos logit contributions
 Majesty+0.298
umm+0.294
Ta+0.285
agna+0.275
 majesty+0.269
Neg logit contributions
.-0.684
,-0.653
 and-0.637
-0.582
 to-0.568
Tokenisher
Token ID4828
Feature activation+7.04e+01

Pos logit contributions
Ta+0.544
 Majesty+0.532
 sidelines+0.508
umm+0.507
 former+0.483
Neg logit contributions
.-1.779
,-1.700
 and-1.678
 to-1.498
 with-1.492
Token."
Token ID526
Feature activation-1.94e+01

Pos logit contributions
 Majesty+4.199
 majesty+4.053
umm+4.052
isher+4.000
agna+3.947
Neg logit contributions
.-0.950
,-0.846
-0.784
 and-0.745
 the-0.479
Token Lily
Token ID20037
Feature activation+7.48e+00

Pos logit contributions
.+0.530
,+0.510
 and+0.463
 to+0.381
 with+0.344
Neg logit contributions
 Majesty-1.016
Ta-1.007
 former-0.996
umm-0.983
 sidelines-0.980
Token smiled
Token ID13541
Feature activation+1.60e+01

Pos logit contributions
Ta+0.312
 Majesty+0.308
 majesty+0.292
umm+0.287
agna+0.287
Neg logit contributions
.-0.310
,-0.280
 and-0.267
 to-0.236
-0.233
Token and
Token ID290
Feature activation-2.46e+01

Pos logit contributions
 Majesty+0.942
umm+0.929
 former+0.929
 majesty+0.863
Ta+0.860
Neg logit contributions
.-0.149
,-0.087
-0.034
 and-0.007
 to-0.001
Token felt
Token ID2936
Feature activation+7.41e+00

Pos logit contributions
.+0.956
,+0.872
 and+0.847
 to+0.702
+0.695
Neg logit contributions
Ta-1.125
 Majesty-1.109
umm-1.061
 majesty-1.059
 sidelines-1.058
Token her
Token ID607
Feature activation-1.14e+00

Pos logit contributions
.+0.686
,+0.588
 and+0.569
+0.405
 to+0.391
Neg logit contributions
 Majesty-1.677
Ta-1.664
umm-1.643
 former-1.632
 sidelines-1.629
Token dad
Token ID9955
Feature activation+2.35e+01

Pos logit contributions
.+0.050
,+0.047
 and+0.046
+0.039
 to+0.038
Neg logit contributions
 Majesty-0.047
Ta-0.046
umm-0.045
 sidelines-0.045
agna-0.044
Token watched
Token ID7342
Feature activation+2.02e+01

Pos logit contributions
 Majesty+1.100
Ta+1.093
 sidelines+1.067
agna+1.060
umm+1.041
Neg logit contributions
.-0.793
,-0.692
 and-0.664
-0.573
 to-0.569
Token from
Token ID422
Feature activation-7.40e+00

Pos logit contributions
 sidelines+1.073
ocol+1.024
umm+1.015
ubb+1.003
 former+0.998
Neg logit contributions
.-0.433
,-0.380
-0.332
 and-0.329
 to-0.248
Token the
Token ID262
Feature activation-1.56e+01

Pos logit contributions
.+0.186
,+0.167
 and+0.158
+0.113
 to+0.110
Neg logit contributions
Ta-0.444
 Majesty-0.442
umm-0.435
 sidelines-0.434
 former-0.429
Token sidelines
Token ID29394
Feature activation+6.97e+01

Pos logit contributions
.+0.749
,+0.702
 and+0.690
 to+0.589
+0.579
Neg logit contributions
 Majesty-0.602
Ta-0.595
umm-0.595
agna-0.584
isher-0.578
Token,
Token ID11
Feature activation-2.79e+01

Pos logit contributions
 sidelines+4.633
 Majesty+4.590
 former+4.480
umm+4.441
icious+4.346
Neg logit contributions
.-0.444
,-0.266
 and-0.160
-0.140
 to+0.076
Token his
Token ID465
Feature activation+3.37e-04

Pos logit contributions
.+1.096
,+1.016
 and+0.966
+0.809
 to+0.804
Neg logit contributions
 Majesty-1.258
Ta-1.255
 sidelines-1.234
umm-1.212
 former-1.206
Token heart
Token ID2612
Feature activation+3.51e+01

Pos logit contributions
Ta+0.000
 Majesty+0.000
umm+0.000
 sidelines+0.000
agna+0.000
Neg logit contributions
.-0.000
,-0.000
 and-0.000
-0.000
 to-0.000
Token swelling
Token ID29844
Feature activation+5.61e+01

Pos logit contributions
 Majesty+1.543
Ta+1.518
 sidelines+1.502
 former+1.500
umm+1.476
Neg logit contributions
.-1.212
,-1.118
 and-1.069
-0.963
 to-0.884
Token with
Token ID351
Feature activation-6.71e+00

Pos logit contributions
 Majesty+3.416
 former+3.348
 sidelines+3.322
umm+3.318
mic+3.298
Neg logit contributions
.-0.699
,-0.603
 and-0.472
-0.472
 to-0.196
Token oven
Token ID14361
Feature activation+4.56e+01

Pos logit contributions
.+1.032
,+0.986
 and+0.968
+0.862
 to+0.849
Neg logit contributions
 Majesty-0.455
Ta-0.453
umm-0.447
agna-0.429
 sidelines-0.427
Token and
Token ID290
Feature activation-2.30e+01

Pos logit contributions
umm+3.021
agna+2.900
 former+2.872
 sidelines+2.857
isher+2.835
Neg logit contributions
.-0.421
,-0.290
-0.164
 and-0.154
 the+0.041
Token taken
Token ID2077
Feature activation+2.18e+01

Pos logit contributions
.+0.891
,+0.828
 and+0.805
+0.659
 to+0.654
Neg logit contributions
Ta-1.070
 Majesty-1.067
umm-1.048
 sidelines-1.031
agna-1.025
Token the
Token ID262
Feature activation-1.74e+01

Pos logit contributions
Ta+1.076
isher+1.071
umm+1.061
 Majesty+1.059
agna+1.032
Neg logit contributions
.-0.617
,-0.555
 and-0.541
-0.417
 with-0.374
Token las
Token ID39990
Feature activation+3.63e+01

Pos logit contributions
.+0.900
,+0.856
 and+0.837
+0.732
 to+0.721
Neg logit contributions
Ta-0.575
 Majesty-0.570
umm-0.565
 sidelines-0.548
agna-0.547
Tokenagna
Token ID48669
Feature activation+6.91e+01

Pos logit contributions
Ta+0.288
 sidelines+0.205
 Majesty+0.198
 former+0.177
umm+0.141
Neg logit contributions
.-2.865
,-2.754
 and-2.723
 to-2.479
 with-2.474
Token.
Token ID13
Feature activation-3.98e+01

Pos logit contributions
umm+4.113
agna+4.080
isher+4.077
 pulp+4.019
 Majesty+4.001
Neg logit contributions
.-0.566
,-0.565
-0.486
 and-0.391
 with-0.051
Token
Token ID198
Feature activation-3.10e+01

Pos logit contributions
.+1.194
,+1.101
 and+1.047
 to+0.789
 with+0.765
Neg logit contributions
Ta-2.191
 Majesty-2.169
umm-2.158
 sidelines-2.112
agna-2.107
Token
Token ID198
Feature activation-3.10e+01

Pos logit contributions
.+0.532
,+0.468
 and+0.405
 to+0.217
 with+0.182
Neg logit contributions
Ta-2.100
 sidelines-2.007
 Majesty-2.001
icious-1.967
agna-1.961
TokenBut
Token ID1537
Feature activation+2.14e+00

Pos logit contributions
.+1.142
,+1.052
 and+1.046
+0.890
 with+0.842
Neg logit contributions
 Majesty-1.532
umm-1.500
Ta-1.475
 majesty-1.449
 former-1.448
Token the
Token ID262
Feature activation-1.71e+01

Pos logit contributions
 Majesty+0.121
agna+0.119
 former+0.119
umm+0.118
 sidelines+0.118
Neg logit contributions
.-0.055
,-0.046
 and-0.045
-0.035
 to-0.033
Token.
Token ID13
Feature activation-4.38e+01

Pos logit contributions
.+0.009
,+0.007
 and+0.006
+0.003
 to+0.001
Neg logit contributions
 former-0.070
 Majesty-0.070
umm-0.070
agna-0.070
atmeal-0.070
Token They
Token ID1119
Feature activation-1.50e+00

Pos logit contributions
.+1.199
,+1.096
 and+1.047
 to+0.776
 with+0.743
Neg logit contributions
Ta-2.478
 Majesty-2.440
umm-2.439
 former-2.390
 sidelines-2.375
Token asked
Token ID1965
Feature activation-4.71e-01

Pos logit contributions
.+0.052
,+0.047
 and+0.046
+0.036
 to+0.036
Neg logit contributions
Ta-0.077
 Majesty-0.077
umm-0.076
 sidelines-0.075
agna-0.074
Token the
Token ID262
Feature activation-2.11e+01

Pos logit contributions
.+0.011
,+0.009
 and+0.009
+0.007
 to+0.006
Neg logit contributions
 Majesty-0.027
 former-0.026
Ta-0.026
agna-0.026
 majesty-0.026
Token Pol
Token ID2165
Feature activation+2.99e+01

Pos logit contributions
.+0.949
,+0.883
 and+0.863
+0.726
 to+0.721
Neg logit contributions
Ta-0.887
 Majesty-0.873
umm-0.853
 sidelines-0.837
agna-0.834
Tokenisher
Token ID4828
Feature activation+6.87e+01

Pos logit contributions
Ta+0.661
 Majesty+0.653
umm+0.630
 sidelines+0.616
agna+0.596
Neg logit contributions
.-1.910
,-1.825
 and-1.795
 to-1.600
-1.598
Token to
Token ID284
Feature activation-2.17e+01

Pos logit contributions
 Majesty+4.574
 majesty+4.403
isher+4.402
agna+4.341
 former+4.263
Neg logit contributions
.-0.544
,-0.272
 and-0.244
-0.141
 with+0.054
Token make
Token ID787
Feature activation+5.65e+00

Pos logit contributions
.+0.944
,+0.887
 and+0.865
+0.728
 to+0.721
Neg logit contributions
Ta-0.893
 Majesty-0.890
umm-0.868
 sidelines-0.861
 former-0.853
Token them
Token ID606
Feature activation+6.50e+00

Pos logit contributions
 Majesty+0.319
 former+0.305
umm+0.299
agna+0.298
Ta+0.298
Neg logit contributions
.-0.139
,-0.133
 and-0.128
-0.095
 to-0.092
Token look
Token ID804
Feature activation+1.24e+01

Pos logit contributions
 Majesty+0.299
 former+0.298
Ta+0.289
agna+0.286
 pulp+0.283
Neg logit contributions
.-0.211
,-0.198
 and-0.190
-0.169
 to-0.151
Token nice
Token ID3621
Feature activation+1.08e+01

Pos logit contributions
 former+0.508
 Majesty+0.503
agna+0.486
icious+0.473
umm+0.472
Neg logit contributions
.-0.421
,-0.400
 and-0.389
-0.367
 to-0.317
Token his
Token ID465
Feature activation-1.59e+00

Pos logit contributions
.+0.691
,+0.628
 and+0.604
+0.461
 to+0.455
Neg logit contributions
Ta-1.249
 Majesty-1.245
umm-1.226
 sidelines-1.210
agna-1.203
Token bear
Token ID6842
Feature activation+2.75e+01

Pos logit contributions
.+0.060
,+0.056
 and+0.054
 to+0.043
 with+0.043
Neg logit contributions
Ta-0.080
umm-0.078
 Majesty-0.076
icious-0.076
agna-0.076
Token watched
Token ID7342
Feature activation+1.88e+01

Pos logit contributions
 Majesty+1.439
 majesty+1.358
 former+1.343
 sidelines+1.328
agna+1.324
Neg logit contributions
.-0.742
,-0.652
 and-0.645
-0.548
 to-0.508
Token from
Token ID422
Feature activation-8.81e+00

Pos logit contributions
 sidelines+0.851
umm+0.811
ocol+0.796
isher+0.777
icious+0.772
Neg logit contributions
.-0.530
,-0.506
 and-0.463
-0.453
 to-0.371
Token the
Token ID262
Feature activation-1.70e+01

Pos logit contributions
.+0.239
,+0.215
 and+0.206
+0.150
 to+0.148
Neg logit contributions
Ta-0.516
 Majesty-0.515
umm-0.507
 sidelines-0.501
agna-0.498
Token sidelines
Token ID29394
Feature activation+6.83e+01

Pos logit contributions
.+0.820
,+0.761
 and+0.754
 to+0.644
 with+0.624
Neg logit contributions
 Majesty-0.666
Ta-0.658
umm-0.650
agna-0.634
wy-0.621
Token.
Token ID13
Feature activation-3.94e+01

Pos logit contributions
 sidelines+4.568
 Majesty+4.526
 former+4.438
umm+4.315
mic+4.296
Neg logit contributions
,-0.186
.-0.179
 and-0.052
+0.002
 to+0.242
Token Tim
Token ID5045
Feature activation+1.52e+01

Pos logit contributions
.+1.459
,+1.357
 and+1.301
 to+1.045
 with+1.030
Neg logit contributions
 Majesty-1.886
Ta-1.882
umm-1.878
 former-1.828
 sidelines-1.825
Tokenmy
Token ID1820
Feature activation+1.54e+01

Pos logit contributions
Ta+0.672
 Majesty+0.670
umm+0.648
 sidelines+0.638
 former+0.635
Neg logit contributions
.-0.614
,-0.567
 and-0.546
-0.462
 to-0.460
Token was
Token ID373
Feature activation-9.83e+00

Pos logit contributions
 Majesty+0.790
Ta+0.781
 majesty+0.745
 former+0.740
 sidelines+0.737
Neg logit contributions
.-0.494
,-0.429
 and-0.400
-0.335
 to-0.334
Token happy
Token ID3772
Feature activation+1.09e+01

Pos logit contributions
.+0.346
,+0.314
 and+0.302
+0.245
 to+0.242
Neg logit contributions
 Majesty-0.484
Ta-0.475
 sidelines-0.463
 former-0.460
umm-0.453
Token not
Token ID407
Feature activation+2.00e+00

Pos logit contributions
.+0.119
,+0.111
 and+0.105
+0.085
 to+0.084
Neg logit contributions
 Majesty-0.141
 sidelines-0.139
Ta-0.137
 former-0.136
 majesty-0.134
Token find
Token ID1064
Feature activation+1.32e+01

Pos logit contributions
agna+0.066
 Majesty+0.066
 former+0.065
atmeal+0.065
umm+0.065
Neg logit contributions
.-0.084
,-0.084
 and-0.080
-0.070
 with-0.069
Token his
Token ID465
Feature activation-3.18e+00

Pos logit contributions
 Majesty+0.667
isher+0.661
agna+0.646
umm+0.644
 pulp+0.639
Neg logit contributions
.-0.364
,-0.346
 and-0.345
-0.272
 with-0.249
Token famous
Token ID5863
Feature activation+2.12e+01

Pos logit contributions
.+0.159
,+0.154
 and+0.151
+0.135
 with+0.130
Neg logit contributions
 Majesty-0.103
umm-0.100
agna-0.097
 former-0.095
 majesty-0.094
Token las
Token ID39990
Feature activation+3.50e+01

Pos logit contributions
 Majesty+0.586
umm+0.582
agna+0.552
 majesty+0.526
Ta+0.526
Neg logit contributions
.-1.168
,-1.121
 and-1.104
-1.017
 with-0.974
Tokenagna
Token ID48669
Feature activation+6.78e+01

Pos logit contributions
Ta+0.276
 sidelines+0.207
 Majesty+0.196
 former+0.168
umm+0.144
Neg logit contributions
.-2.768
,-2.656
 and-2.621
 to-2.395
 with-2.382
Token.
Token ID13
Feature activation-4.10e+01

Pos logit contributions
agna+4.553
umm+4.453
 Majesty+4.422
wy+4.395
atmeal+4.390
Neg logit contributions
,-0.113
-0.066
.-0.026
 and-0.024
 with+0.307
Token
Token ID198
Feature activation-3.23e+01

Pos logit contributions
.+1.468
,+1.368
 and+1.317
 to+1.042
 with+1.027
Neg logit contributions
Ta-2.031
 Majesty-2.024
umm-2.016
 former-1.953
agna-1.953
Token
Token ID198
Feature activation-3.22e+01

Pos logit contributions
.+0.528
,+0.472
 and+0.415
 to+0.210
 with+0.173
Neg logit contributions
Ta-2.172
 Majesty-2.084
 sidelines-2.064
icious-2.041
 former-2.039
TokenThe
Token ID464
Feature activation+1.13e+00

Pos logit contributions
.+1.338
,+1.241
 and+1.231
+1.079
 with+1.025
Neg logit contributions
 Majesty-1.442
umm-1.422
Ta-1.386
 majesty-1.362
agna-1.358
Token chef
Token ID21221
Feature activation+4.80e+01

Pos logit contributions
Ta+0.034
 Majesty+0.033
umm+0.032
 sidelines+0.031
 former+0.031
Neg logit contributions
.-0.064
,-0.061
 and-0.060
 to-0.053
-0.053
Token wished
Token ID16555
Feature activation+2.23e+01

Pos logit contributions
Ta+0.119
 Majesty+0.117
 former+0.111
umm+0.111
agna+0.111
Neg logit contributions
.-0.162
,-0.154
 and-0.148
 to-0.129
-0.129
Token they
Token ID484
Feature activation-1.54e+00

Pos logit contributions
 sidelines+1.033
 Majesty+1.032
Ta+1.008
umm+1.002
 majesty+0.996
Neg logit contributions
.-0.691
,-0.628
 and-0.604
-0.475
 with-0.469
Token had
Token ID550
Feature activation+2.11e+00

Pos logit contributions
.+0.054
,+0.051
 and+0.048
+0.038
 to+0.038
Neg logit contributions
Ta-0.074
agna-0.072
 Majesty-0.071
umm-0.070
 sidelines-0.070
Token eaten
Token ID17065
Feature activation+3.37e+01

Pos logit contributions
Ta+0.091
 Majesty+0.090
agna+0.087
 former+0.086
umm+0.086
Neg logit contributions
.-0.086
,-0.081
 and-0.079
-0.065
 with-0.063
Token w
Token ID266
Feature activation+1.31e+01

Pos logit contributions
umm+1.728
Ta+1.698
 pulp+1.693
isher+1.682
wy+1.677
Neg logit contributions
.-0.835
,-0.784
 and-0.722
-0.575
 to-0.510
Tokenaffles
Token ID48501
Feature activation+6.75e+01

Pos logit contributions
Ta+0.114
 Majesty+0.108
umm+0.093
 former+0.083
 sidelines+0.083
Neg logit contributions
.-1.020
,-0.983
 and-0.972
 to-0.878
-0.876
Token.
Token ID13
Feature activation-3.80e+01

Pos logit contributions
agna+4.366
wy+4.327
atmeal+4.325
icious+4.259
affles+4.158
Neg logit contributions
.-0.182
,-0.132
-0.053
 and-0.008
 to+0.245
Token She
Token ID1375
Feature activation+3.48e+00

Pos logit contributions
.+1.104
,+1.024
 and+0.969
 to+0.730
 with+0.727
Neg logit contributions
Ta-2.073
 Majesty-2.035
 former-1.981
umm-1.977
 majesty-1.951
Token wished
Token ID16555
Feature activation+2.25e+01

Pos logit contributions
Ta+0.113
 Majesty+0.110
 former+0.105
agna+0.104
umm+0.103
Neg logit contributions
.-0.181
,-0.173
 and-0.166
 to-0.147
-0.146
Token they
Token ID484
Feature activation-1.39e+00

Pos logit contributions
 Majesty+1.031
 sidelines+1.027
umm+0.999
 majesty+0.994
agna+0.989
Neg logit contributions
.-0.700
,-0.636
 and-0.617
-0.484
 with-0.477
Token had
Token ID550
Feature activation+2.26e+00

Pos logit contributions
.+0.048
,+0.045
 and+0.043
+0.034
 to+0.034
Neg logit contributions
Ta-0.068
agna-0.066
 Majesty-0.066
umm-0.065
 sidelines-0.064
Token the
Token ID262
Feature activation-1.56e+01

Pos logit contributions
.+0.319
,+0.288
 and+0.281
+0.214
 to+0.210
Neg logit contributions
Ta-0.571
 Majesty-0.566
umm-0.558
agna-0.552
 former-0.548
Token store
Token ID3650
Feature activation+2.93e+01

Pos logit contributions
.+0.757
,+0.712
 and+0.695
+0.595
 to+0.591
Neg logit contributions
Ta-0.602
 Majesty-0.593
umm-0.577
 pulp-0.565
 former-0.560
Token was
Token ID373
Feature activation-8.54e+00

Pos logit contributions
umm+1.420
 Majesty+1.406
Ta+1.404
agna+1.395
 former+1.365
Neg logit contributions
.-0.948
,-0.881
 and-0.850
-0.692
 to-0.663
Token selling
Token ID6301
Feature activation+3.23e+01

Pos logit contributions
.+0.300
,+0.281
 and+0.272
+0.228
 with+0.211
Neg logit contributions
 Majesty-0.398
 former-0.395
Ta-0.393
agna-0.391
 sidelines-0.389
Token w
Token ID266
Feature activation+1.31e+01

Pos logit contributions
umm+1.494
 Majesty+1.490
agna+1.486
 former+1.482
isher+1.478
Neg logit contributions
.-0.959
,-0.918
 and-0.871
-0.766
 to-0.687
Tokenaffles
Token ID48501
Feature activation+6.75e+01

Pos logit contributions
Ta+0.114
 Majesty+0.108
umm+0.090
 sidelines+0.083
 former+0.082
Neg logit contributions
.-1.017
,-0.979
 and-0.966
 to-0.875
-0.875
Token for
Token ID329
Feature activation-6.56e+00

Pos logit contributions
atmeal+4.505
agna+4.491
wy+4.393
umm+4.370
affles+4.340
Neg logit contributions
,-0.142
.-0.108
 and-0.080
-0.017
 with+0.314
Token very
Token ID845
Feature activation-6.69e+00

Pos logit contributions
.+0.171
,+0.158
 and+0.155
+0.113
 to+0.109
Neg logit contributions
 Majesty-0.365
Ta-0.360
 sidelines-0.360
 former-0.358
agna-0.356
Token cheap
Token ID7026
Feature activation+3.14e+01

Pos logit contributions
.+0.234
,+0.218
 and+0.214
+0.170
 to+0.165
Neg logit contributions
 Majesty-0.333
Ta-0.328
umm-0.326
 former-0.322
 sidelines-0.322
Token.
Token ID13
Feature activation-3.79e+01

Pos logit contributions
affles+1.968
umm+1.962
 Majesty+1.923
isher+1.922
agna+1.909
Neg logit contributions
.-0.341
,-0.313
 and-0.286
-0.214
 with-0.111
Token She
Token ID1375
Feature activation+3.52e+00

Pos logit contributions
.+1.205
,+1.094
 and+1.050
+0.797
 with+0.796
Neg logit contributions
 Majesty-2.004
Ta-1.996
umm-1.987
 sidelines-1.942
 former-1.931
Token they
Token ID484
Feature activation-1.71e+00

Pos logit contributions
.+0.194
,+0.179
 and+0.176
+0.137
 with+0.123
Neg logit contributions
 Majesty-0.394
agna-0.384
 majesty-0.382
 pulp-0.377
umm-0.377
Token watched
Token ID7342
Feature activation+2.01e+01

Pos logit contributions
.+0.073
,+0.068
 and+0.067
+0.056
 to+0.055
Neg logit contributions
Ta-0.074
 Majesty-0.074
umm-0.073
 sidelines-0.071
agna-0.070
Token the
Token ID262
Feature activation-1.57e+01

Pos logit contributions
 Majesty+1.060
 sidelines+1.055
agna+1.036
affles+1.022
umm+1.022
Neg logit contributions
.-0.464
,-0.438
 and-0.418
-0.338
 to-0.298
Token pretend
Token ID16614
Feature activation+2.28e+01

Pos logit contributions
.+0.589
,+0.542
 and+0.528
+0.428
 to+0.423
Neg logit contributions
Ta-0.764
 Majesty-0.760
umm-0.742
agna-0.735
 former-0.730
Token w
Token ID266
Feature activation+1.30e+01

Pos logit contributions
 Majesty+0.771
umm+0.763
Ta+0.751
 sidelines+0.739
agna+0.730
Neg logit contributions
.-1.130
,-1.081
 and-1.055
-0.921
 with-0.901
Tokenaffles
Token ID48501
Feature activation+6.74e+01

Pos logit contributions
Ta+0.117
 Majesty+0.105
umm+0.092
 former+0.088
 sidelines+0.084
Neg logit contributions
.-1.005
,-0.967
 and-0.955
-0.867
 to-0.865
Token fly
Token ID6129
Feature activation+3.35e+01

Pos logit contributions
atmeal+4.217
agna+4.180
 sidelines+4.148
 Majesty+4.112
icious+4.102
Neg logit contributions
.-0.750
,-0.704
 and-0.612
-0.486
 to-0.280
Token high
Token ID1029
Feature activation+1.77e+01

Pos logit contributions
 Majesty+2.065
wy+2.047
umm+2.039
 sidelines+2.020
isher+2.017
Neg logit contributions
.-0.315
,-0.302
 and-0.251
-0.208
 with-0.044
Token up
Token ID510
Feature activation+1.29e+01

Pos logit contributions
umm+1.164
 former+1.115
 Majesty+1.112
isher+1.095
agna+1.089
Neg logit contributions
-0.127
.-0.126
,-0.125
 and-0.085
 the-0.004
Token into
Token ID656
Feature activation+3.35e+00

Pos logit contributions
 former+0.674
umm+0.673
 pulp+0.659
isher+0.642
iful+0.635
Neg logit contributions
,-0.179
.-0.170
-0.147
 and-0.145
 with-0.073
Token the
Token ID262
Feature activation-1.55e+01

Pos logit contributions
Ta+0.231
 Majesty+0.231
umm+0.228
 sidelines+0.227
 former+0.225
Neg logit contributions
.-0.052
,-0.044
 and-0.040
 to-0.018
-0.018
Token
Token ID198
Feature activation-2.95e+01

Pos logit contributions
.+0.844
,+0.749
 and+0.690
 with+0.455
 to+0.443
Neg logit contributions
Ta-2.241
umm-2.228
 Majesty-2.198
 sidelines-2.156
 majesty-2.127
Token
Token ID198
Feature activation-2.95e+01

Pos logit contributions
.+0.527
,+0.491
 and+0.412
 to+0.252
 with+0.241
Neg logit contributions
Ta-1.760
 sidelines-1.613
reath-1.612
affles-1.610
icious-1.595
TokenWhen
Token ID2215
Feature activation+9.72e+00

Pos logit contributions
.+1.360
,+1.281
 and+1.242
 to+1.054
+1.050
Neg logit contributions
Ta-1.181
 Majesty-1.161
umm-1.141
 sidelines-1.126
agna-1.112
Token the
Token ID262
Feature activation-1.57e+01

Pos logit contributions
umm+0.489
Ta+0.486
 Majesty+0.485
 pulp+0.485
 sidelines+0.484
Neg logit contributions
.-0.283
 and-0.255
,-0.241
-0.186
 with-0.181
Token w
Token ID266
Feature activation+1.29e+01

Pos logit contributions
.+0.700
,+0.657
 and+0.642
+0.541
 to+0.538
Neg logit contributions
Ta-0.644
 Majesty-0.639
umm-0.631
 sidelines-0.618
agna-0.611
Tokenaffles
Token ID48501
Feature activation+6.73e+01

Pos logit contributions
Ta+0.153
 Majesty+0.125
 former+0.118
bek+0.117
 pulp+0.115
Neg logit contributions
.-0.969
,-0.938
 and-0.927
 to-0.831
-0.828
Token were
Token ID547
Feature activation+2.85e+00

Pos logit contributions
atmeal+3.429
umm+3.411
Ta+3.411
 Majesty+3.368
agna+3.349
Neg logit contributions
.-2.011
,-1.705
 and-1.693
-1.383
 to-1.338
Token ready
Token ID3492
Feature activation+1.61e+01

Pos logit contributions
Ta+0.107
 Majesty+0.107
 sidelines+0.106
umm+0.105
agna+0.104
Neg logit contributions
.-0.128
,-0.119
 and-0.117
-0.102
 with-0.097
Token,
Token ID11
Feature activation-2.80e+01

Pos logit contributions
 Majesty+0.927
umm+0.925
 sidelines+0.913
 majesty+0.905
 pulp+0.898
Neg logit contributions
.-0.154
-0.071
 and-0.067
 with-0.018
,-0.003
Token Lily
Token ID20037
Feature activation+1.10e+01

Pos logit contributions
.+0.871
,+0.778
 and+0.769
+0.587
 to+0.576
Neg logit contributions
Ta-1.448
 Majesty-1.441
umm-1.424
 sidelines-1.422
agna-1.410
Token's
Token ID338
Feature activation-8.45e+00

Pos logit contributions
 Majesty+0.542
Ta+0.535
agna+0.524
 majesty+0.521
atmeal+0.519
Neg logit contributions
.-0.348
,-0.309
 and-0.280
-0.247
 with-0.233
Token on
Token ID319
Feature activation-2.11e+00

Pos logit contributions
Ta+0.289
umm+0.283
 sidelines+0.277
 Majesty+0.275
agna+0.274
Neg logit contributions
.-0.109
,-0.088
 and-0.087
 to-0.059
-0.057
Token,
Token ID11
Feature activation-2.82e+01

Pos logit contributions
.+0.020
 and+0.010
,+0.004
+0.002
 to+0.000
Neg logit contributions
 Majesty-0.143
 majesty-0.138
heit-0.137
agna-0.137
 sidelines-0.137
Token the
Token ID262
Feature activation-1.58e+01

Pos logit contributions
.+0.729
,+0.632
 and+0.611
+0.430
 to+0.430
Neg logit contributions
 Majesty-1.621
umm-1.616
Ta-1.602
 sidelines-1.597
agna-1.585
Token two
Token ID734
Feature activation+1.12e+01

Pos logit contributions
.+0.596
,+0.555
 and+0.538
+0.437
 to+0.427
Neg logit contributions
Ta-0.773
 Majesty-0.770
umm-0.752
agna-0.740
 sidelines-0.738
Token w
Token ID266
Feature activation+1.28e+01

Pos logit contributions
Ta+0.494
umm+0.493
 Majesty+0.492
 sidelines+0.481
agna+0.476
Neg logit contributions
.-0.448
,-0.412
 and-0.402
-0.340
 to-0.336
Tokenaffles
Token ID48501
Feature activation+6.73e+01

Pos logit contributions
Ta+0.154
 Majesty+0.131
 pulp+0.122
bek+0.117
agna+0.116
Neg logit contributions
.-0.957
,-0.925
 and-0.915
-0.818
 to-0.815
Token were
Token ID547
Feature activation+2.78e+00

Pos logit contributions
Ta+3.334
umm+3.333
 Majesty+3.327
agna+3.270
isher+3.259
Neg logit contributions
.-2.117
,-1.876
 and-1.792
-1.521
 to-1.409
Token never
Token ID1239
Feature activation+1.36e+01

Pos logit contributions
 Majesty+0.143
Ta+0.137
umm+0.136
agna+0.136
 majesty+0.136
Neg logit contributions
.-0.090
,-0.080
 and-0.078
-0.063
 to-0.059
Token apart
Token ID5475
Feature activation+2.57e+01

Pos logit contributions
 Majesty+0.512
Ta+0.506
umm+0.498
 sidelines+0.493
 former+0.492
Neg logit contributions
.-0.618
,-0.582
 and-0.573
-0.499
 with-0.479
Token.
Token ID13
Feature activation-3.82e+01

Pos logit contributions
umm+1.354
 former+1.322
ubb+1.310
 Majesty+1.302
isher+1.248
Neg logit contributions
.-0.531
,-0.478
-0.466
 and-0.458
 with-0.294
Token Where
Token ID6350
Feature activation+2.13e+01

Pos logit contributions
.+1.253
,+1.165
 and+1.115
 to+0.865
 with+0.861
Neg logit contributions
Ta-1.988
umm-1.977
 Majesty-1.966
 former-1.911
agna-1.903
Token.
Token ID13
Feature activation-3.83e+01

Pos logit contributions
umm+0.679
 Majesty+0.661
atmeal+0.640
 majesty+0.636
 sidelines+0.630
Neg logit contributions
,-0.079
.-0.053
-0.043
 and-0.033
 the+0.011
Token She
Token ID1375
Feature activation+3.06e+00

Pos logit contributions
.+1.091
,+0.992
 and+0.933
 with+0.695
 to+0.683
Neg logit contributions
 Majesty-2.088
umm-2.037
Ta-2.029
 majesty-2.013
 sidelines-1.985
Token ate
Token ID15063
Feature activation+2.97e+01

Pos logit contributions
 Majesty+0.127
Ta+0.126
agna+0.121
 majesty+0.120
 former+0.120
Neg logit contributions
.-0.130
,-0.119
 and-0.113
-0.098
 to-0.097
Token the
Token ID262
Feature activation-1.58e+01

Pos logit contributions
wy+1.519
 pulp+1.507
agna+1.506
affles+1.497
umm+1.487
Neg logit contributions
.-0.716
,-0.602
 and-0.543
 to-0.428
-0.421
Token w
Token ID266
Feature activation+1.28e+01

Pos logit contributions
.+0.638
,+0.596
 and+0.577
+0.479
 to+0.478
Neg logit contributions
Ta-0.698
 Majesty-0.689
umm-0.686
 sidelines-0.672
agna-0.664
Tokenaffles
Token ID48501
Feature activation+6.73e+01

Pos logit contributions
Ta+0.126
 Majesty+0.114
umm+0.096
 former+0.096
 sidelines+0.089
Neg logit contributions
.-0.979
,-0.944
 and-0.938
-0.843
 to-0.841
Token with
Token ID351
Feature activation-6.99e+00

Pos logit contributions
atmeal+4.231
umm+4.164
agna+4.072
wy+4.059
 sidelines+4.006
Neg logit contributions
.-0.829
,-0.570
-0.425
 and-0.404
 to-0.179
Token the
Token ID262
Feature activation-1.57e+01

Pos logit contributions
.+0.203
,+0.185
 and+0.177
 to+0.134
+0.132
Neg logit contributions
Ta-0.389
 Majesty-0.381
umm-0.378
 sidelines-0.374
 pulp-0.369
Token syrup
Token ID24208
Feature activation+5.34e+01

Pos logit contributions
.+0.716
,+0.673
 and+0.657
+0.556
 to+0.554
Neg logit contributions
Ta-0.631
 Majesty-0.629
umm-0.617
 sidelines-0.605
agna-0.600
Token and
Token ID290
Feature activation-2.13e+01

Pos logit contributions
atmeal+3.471
umm+3.460
wy+3.362
agna+3.304
 pulp+3.298
Neg logit contributions
.-0.343
,-0.196
-0.154
 and-0.049
 to+0.076
Token smiled
Token ID13541
Feature activation+1.94e+01

Pos logit contributions
.+0.859
,+0.796
 and+0.768
+0.647
 to+0.638
Neg logit contributions
 Majesty-0.943
Ta-0.941
umm-0.916
 sidelines-0.905
agna-0.902
Token store
Token ID3650
Feature activation+2.94e+01

Pos logit contributions
.+0.705
,+0.658
 and+0.643
 to+0.568
+0.544
Neg logit contributions
Ta-0.634
umm-0.621
 Majesty-0.619
isher-0.607
wy-0.602
Token and
Token ID290
Feature activation-2.10e+01

Pos logit contributions
 sidelines+1.854
 Majesty+1.830
 former+1.801
agna+1.767
bek+1.758
Neg logit contributions
.-0.257
,-0.140
-0.059
 and-0.046
 the+0.053
Token got
Token ID1392
Feature activation+8.31e+00

Pos logit contributions
.+0.888
,+0.829
 and+0.807
+0.674
 to+0.668
Neg logit contributions
Ta-0.910
 Majesty-0.905
umm-0.887
 sidelines-0.875
agna-0.869
Token the
Token ID262
Feature activation-1.54e+01

Pos logit contributions
Ta+0.486
 Majesty+0.482
umm+0.474
 sidelines+0.474
agna+0.472
Neg logit contributions
.-0.219
,-0.198
 and-0.189
-0.136
 to-0.127
Token y
Token ID331
Feature activation+1.94e+01

Pos logit contributions
.+0.706
,+0.659
 and+0.648
 to+0.557
+0.540
Neg logit contributions
Ta-0.632
 Majesty-0.630
umm-0.611
 sidelines-0.605
 former-0.604
Tokenumm
Token ID13929
Feature activation+6.72e+01

Pos logit contributions
umm+0.608
 Majesty+0.576
Ta+0.575
 sidelines+0.562
 former+0.548
Neg logit contributions
.-1.049
,-0.994
 and-0.981
-0.876
 to-0.859
Tokeniest
Token ID6386
Feature activation+4.23e+01

Pos logit contributions
umm+1.292
isher+1.151
atmeal+1.059
wy+1.010
Ta+0.950
Neg logit contributions
.-4.118
,-3.985
 and-3.951
-3.816
 to-3.584
Token ice
Token ID4771
Feature activation+3.58e+01

Pos logit contributions
umm+1.634
 Majesty+1.602
 former+1.561
 sidelines+1.540
Ta+1.522
Neg logit contributions
.-1.767
,-1.666
 and-1.624
-1.480
 to-1.408
Token cream
Token ID8566
Feature activation+5.02e+01

Pos logit contributions
umm+0.882
atmeal+0.872
isher+0.830
agna+0.789
wy+0.775
Neg logit contributions
.-1.906
,-1.878
 and-1.818
-1.730
 the-1.591
Token they
Token ID484
Feature activation-1.30e+00

Pos logit contributions
umm+3.262
agna+3.148
atmeal+3.115
isher+3.110
 pulp+3.077
Neg logit contributions
.-0.543
,-0.498
 and-0.429
-0.344
 with-0.113
Token could
Token ID714
Feature activation+2.40e-01

Pos logit contributions
.+0.046
,+0.043
 and+0.041
+0.033
 to+0.033
Neg logit contributions
Ta-0.063
 Majesty-0.063
umm-0.062
agna-0.061
 sidelines-0.061
Token fed
Token ID11672
Feature activation+3.29e+01

Pos logit contributions
 Majesty+1.276
 majesty+1.249
 pulp+1.227
Ta+1.224
agna+1.184
Neg logit contributions
.-0.888
,-0.765
 and-0.713
-0.629
 to-0.622
Token the
Token ID262
Feature activation-1.60e+01

Pos logit contributions
 pulp+1.774
 sidelines+1.727
Ta+1.726
wy+1.702
ubb+1.693
Neg logit contributions
.-0.602
,-0.487
 and-0.441
 to-0.298
-0.279
Token baby
Token ID5156
Feature activation+3.22e+01

Pos logit contributions
.+0.794
,+0.751
 and+0.736
+0.632
 to+0.628
Neg logit contributions
Ta-0.586
 Majesty-0.582
umm-0.568
 sidelines-0.560
agna-0.554
Token duck
Token ID22045
Feature activation+3.58e+01

Pos logit contributions
 pulp+1.252
 Majesty+1.230
wy+1.194
umm+1.194
 majesty+1.170
Neg logit contributions
.-1.115
,-1.066
 and-0.984
-0.906
 to-0.836
Token w
Token ID266
Feature activation+1.27e+01

Pos logit contributions
 Majesty+2.127
 pulp+2.101
 former+2.070
 majesty+2.064
wy+2.057
Neg logit contributions
.-0.458
,-0.393
 and-0.320
-0.280
 to-0.128
Tokenaffles
Token ID48501
Feature activation+6.72e+01

Pos logit contributions
 Majesty+0.265
Ta+0.263
umm+0.247
agna+0.240
 sidelines+0.238
Neg logit contributions
.-0.836
,-0.799
 and-0.788
-0.698
 to-0.696
Token too
Token ID1165
Feature activation-7.58e+00

Pos logit contributions
atmeal+4.072
wy+4.045
agna+4.005
 pulp+3.932
beans+3.901
Neg logit contributions
.-0.254
-0.236
,-0.208
 and-0.066
 to+0.208
Token.
Token ID13
Feature activation-3.83e+01

Pos logit contributions
+0.004
.+0.003
 and+0.003
,+0.001
 with-0.012
Neg logit contributions
 pulp-0.456
 majesty-0.454
 Majesty-0.452
icious-0.448
wy-0.442
Token They
Token ID1119
Feature activation+3.89e+00

Pos logit contributions
.+1.192
,+1.079
 and+1.035
 to+0.775
 with+0.773
Neg logit contributions
 Majesty-2.039
Ta-2.022
umm-2.021
 sidelines-1.971
 majesty-1.958
Token all
Token ID477
Feature activation+2.49e+00

Pos logit contributions
Ta+0.155
 Majesty+0.152
 pulp+0.150
agna+0.147
 sidelines+0.145
Neg logit contributions
.-0.166
,-0.154
 and-0.150
-0.130
 to-0.128
Token played
Token ID2826
Feature activation+2.35e+01

Pos logit contributions
Ta+0.089
 Majesty+0.085
 pulp+0.084
 majesty+0.081
wy+0.081
Neg logit contributions
.-0.116
,-0.111
 and-0.108
-0.094
 to-0.094
Token impatient
Token ID33440
Feature activation+2.48e+01

Pos logit contributions
.+0.282
,+0.255
 and+0.243
+0.189
 to+0.188
Neg logit contributions
Ta-0.478
 Majesty-0.477
umm-0.465
 sidelines-0.463
 former-0.462
Token.
Token ID13
Feature activation-3.85e+01

Pos logit contributions
umm+1.681
 sidelines+1.560
 Majesty+1.540
 former+1.533
isher+1.524
Neg logit contributions
.-0.094
-0.064
,-0.059
 and-0.018
 the+0.044
Token She
Token ID1375
Feature activation+2.98e+00

Pos logit contributions
.+1.225
,+1.130
 and+1.084
 to+0.842
 with+0.835
Neg logit contributions
umm-2.020
 Majesty-2.009
Ta-1.984
 former-1.946
 sidelines-1.938
Token wanted
Token ID2227
Feature activation+7.24e+00

Pos logit contributions
Ta+0.140
 Majesty+0.140
umm+0.137
 sidelines+0.135
agna+0.134
Neg logit contributions
.-0.112
,-0.104
 and-0.100
-0.082
 to-0.081
Token w
Token ID266
Feature activation+1.27e+01

Pos logit contributions
Ta+0.495
 Majesty+0.491
 sidelines+0.488
umm+0.486
 former+0.485
Neg logit contributions
.-0.101
,-0.081
 and-0.071
-0.031
 with-0.021
Tokenaffles
Token ID48501
Feature activation+6.72e+01

Pos logit contributions
Ta+0.153
 Majesty+0.144
umm+0.125
 pulp+0.122
 former+0.119
Neg logit contributions
.-0.953
,-0.922
 and-0.911
 to-0.818
 with-0.816
Token for
Token ID329
Feature activation-6.95e+00

Pos logit contributions
atmeal+3.758
agna+3.680
icious+3.678
affles+3.642
reath+3.579
Neg logit contributions
.-0.935
 and-0.765
,-0.711
-0.686
 with-0.304
Token breakfast
Token ID12607
Feature activation+3.88e+01

Pos logit contributions
.+0.260
,+0.244
 and+0.239
+0.204
 to+0.194
Neg logit contributions
 Majesty-0.301
affles-0.294
agna-0.293
 sidelines-0.293
 former-0.290
Token.
Token ID13
Feature activation-3.83e+01

Pos logit contributions
umm+2.505
agna+2.487
icious+2.475
atmeal+2.470
affles+2.392
Neg logit contributions
.-0.238
 and-0.169
,-0.156
-0.095
 to+0.018
Token She
Token ID1375
Feature activation+3.13e+00

Pos logit contributions
.+1.300
,+1.207
 and+1.160
 to+0.925
 with+0.922
Neg logit contributions
umm-1.880
 Majesty-1.831
Ta-1.797
 former-1.764
 sidelines-1.754
Token went
Token ID1816
Feature activation+7.48e+00

Pos logit contributions
Ta+0.152
 Majesty+0.152
umm+0.149
 sidelines+0.147
agna+0.146
Neg logit contributions
.-0.114
,-0.106
 and-0.103
-0.083
 to-0.082
Token.
Token ID13
Feature activation-3.80e+01

Pos logit contributions
 former+0.177
 Majesty+0.171
umm+0.168
 sidelines+0.168
 ounces+0.167
Neg logit contributions
,-0.005
.-0.002
 and-0.000
+0.000
 to+0.012
Token It
Token ID632
Feature activation+1.57e+01

Pos logit contributions
.+1.193
,+1.089
 and+1.041
 to+0.776
 with+0.770
Neg logit contributions
Ta-1.958
 Majesty-1.956
umm-1.929
 sidelines-1.911
agna-1.871
Token was
Token ID373
Feature activation-8.47e+00

Pos logit contributions
Ta+0.824
 Majesty+0.819
umm+0.808
 sidelines+0.795
agna+0.792
Neg logit contributions
.-0.504
,-0.462
 and-0.442
-0.347
 to-0.343
Token even
Token ID772
Feature activation+9.20e+00

Pos logit contributions
.+0.234
,+0.211
 and+0.200
+0.152
 to+0.144
Neg logit contributions
 Majesty-0.467
Ta-0.466
umm-0.463
 former-0.459
 sidelines-0.458
Token y
Token ID331
Feature activation+1.94e+01

Pos logit contributions
Ta+0.462
 Majesty+0.461
umm+0.454
 sidelines+0.446
agna+0.444
Neg logit contributions
.-0.327
,-0.301
 and-0.292
 to-0.232
-0.232
Tokenumm
Token ID13929
Feature activation+6.71e+01

Pos logit contributions
umm+0.601
Ta+0.527
icious+0.514
 Majesty+0.512
 sidelines+0.512
Neg logit contributions
.-1.060
,-1.010
 and-1.003
-0.902
 to-0.882
Tokenier
Token ID959
Feature activation+3.72e+01

Pos logit contributions
umm+1.587
atmeal+1.384
isher+1.369
wy+1.353
ocol+1.223
Neg logit contributions
.-3.775
 and-3.692
,-3.685
-3.596
 the-3.351
Token this
Token ID428
Feature activation+3.60e+00

Pos logit contributions
 Majesty+1.937
umm+1.889
 majesty+1.856
 former+1.806
agna+1.766
Neg logit contributions
.-0.764
-0.752
,-0.733
 and-0.676
 the-0.517
Token time
Token ID640
Feature activation+3.59e+00

Pos logit contributions
umm+0.150
 Majesty+0.148
 former+0.145
Ta+0.144
agna+0.142
Neg logit contributions
.-0.140
,-0.133
 and-0.126
-0.112
 with-0.106
Token!
Token ID0
Feature activation-2.13e+01

Pos logit contributions
 Majesty+0.235
umm+0.230
 majesty+0.226
agna+0.226
 ounces+0.225
Neg logit contributions
.-0.025
,-0.022
 and-0.020
-0.014
 with+0.002
Token The
Token ID383
Feature activation-3.25e+00

Pos logit contributions
.+0.587
,+0.531
 and+0.509
 to+0.368
 with+0.357
Neg logit contributions
Ta-1.214
 Majesty-1.195
umm-1.189
 sidelines-1.171
 former-1.150
Token our
Token ID674
Feature activation+1.40e+01

Pos logit contributions
Ta+1.379
 Majesty+1.360
isher+1.326
agna+1.324
 sidelines+1.318
Neg logit contributions
.-0.809
,-0.745
 and-0.708
-0.571
 to-0.532
Token shorts
Token ID22078
Feature activation+4.74e+01

Pos logit contributions
Ta+0.446
 Majesty+0.445
umm+0.432
 sidelines+0.423
agna+0.419
Neg logit contributions
.-0.747
,-0.709
 and-0.695
-0.607
 to-0.603
Token and
Token ID290
Feature activation-2.13e+01

Pos logit contributions
isher+3.031
agna+2.953
belt+2.930
wy+2.926
 Majesty+2.889
Neg logit contributions
.-0.313
,-0.264
-0.216
 and-0.116
 the-0.020
Token T
Token ID309
Feature activation+1.99e+01

Pos logit contributions
.+1.159
,+1.101
 and+1.078
+0.945
 to+0.939
Neg logit contributions
Ta-0.654
 Majesty-0.651
umm-0.627
 sidelines-0.617
agna-0.611
Token-
Token ID12
Feature activation+1.59e+01

Pos logit contributions
 Majesty+0.872
umm+0.839
 majesty+0.831
Ta+0.822
isher+0.817
Neg logit contributions
.-0.773
,-0.741
 and-0.715
-0.620
 with-0.605
Tokenshirts
Token ID23231
Feature activation+6.71e+01

Pos logit contributions
Ta+0.243
 Majesty+0.222
umm+0.206
 sidelines+0.200
agna+0.195
Neg logit contributions
.-1.109
,-1.075
 and-1.054
-0.951
 to-0.941
Token under
Token ID739
Feature activation+8.11e+00

Pos logit contributions
isher+4.102
agna+3.996
 Majesty+3.951
wy+3.932
 majesty+3.871
Neg logit contributions
,-0.332
.-0.229
-0.214
 and-0.085
 the+0.069
Token our
Token ID674
Feature activation+1.41e+01

Pos logit contributions
 Majesty+0.442
 sidelines+0.440
Ta+0.433
ocol+0.424
 majesty+0.424
Neg logit contributions
.-0.159
,-0.150
 and-0.138
-0.100
 to-0.096
Token warm
Token ID5814
Feature activation+1.89e+01

Pos logit contributions
 Majesty+0.450
Ta+0.442
umm+0.434
 sidelines+0.431
agna+0.422
Neg logit contributions
.-0.742
,-0.706
 and-0.693
-0.611
 to-0.598
Token clothes
Token ID8242
Feature activation+3.78e+01

Pos logit contributions
 Majesty+0.530
umm+0.506
 majesty+0.502
isher+0.490
 sidelines+0.478
Neg logit contributions
.-0.990
,-0.940
 and-0.924
-0.853
 with-0.829
Token.
Token ID13
Feature activation-3.80e+01

Pos logit contributions
 Majesty+2.315
agna+2.276
isher+2.272
 majesty+2.262
 former+2.252
Neg logit contributions
,-0.281
.-0.250
-0.203
 and-0.185
 the-0.061
Token looked
Token ID3114
Feature activation+5.46e+00

Pos logit contributions
 former+0.301
umm+0.300
 Majesty+0.296
 sidelines+0.294
mic+0.292
Neg logit contributions
.-0.063
,-0.038
-0.032
 and-0.026
 to-0.007
Token at
Token ID379
Feature activation-1.66e+01

Pos logit contributions
 Majesty+0.278
 sidelines+0.270
 pulp+0.270
 former+0.270
Ta+0.262
Neg logit contributions
.-0.144
,-0.133
 and-0.124
-0.112
 to-0.091
Token it
Token ID340
Feature activation-4.42e+00

Pos logit contributions
.+0.354
,+0.323
 and+0.296
+0.196
 with+0.184
Neg logit contributions
 Majesty-1.013
Ta-1.005
 sidelines-0.996
 former-0.984
umm-0.973
Token in
Token ID287
Feature activation-1.68e+01

Pos logit contributions
.+0.034
,+0.028
 and+0.020
+0.017
 to-0.001
Neg logit contributions
 Majesty-0.295
 former-0.293
 sidelines-0.285
icious-0.282
umm-0.281
Token awe
Token ID25030
Feature activation+2.81e+01

Pos logit contributions
.+0.415
,+0.379
 and+0.354
 to+0.260
+0.253
Neg logit contributions
Ta-0.948
 Majesty-0.943
 sidelines-0.938
 former-0.927
umm-0.924
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
umm+1.901
 Majesty+1.893
ocol+1.892
isher+1.881
ubb+1.839
Neg logit contributions
,-0.087
.-0.083
-0.041
 and-0.039
 with+0.077
Token But
Token ID887
Feature activation-6.99e+00

Pos logit contributions
.+1.562
,+1.441
 and+1.390
 to+1.086
 with+1.064
Neg logit contributions
 Majesty-2.392
Ta-2.365
umm-2.314
 sidelines-2.289
 former-2.276
Token then
Token ID788
Feature activation-3.30e+00

Pos logit contributions
.+0.194
,+0.168
 and+0.165
+0.127
 to+0.121
Neg logit contributions
 Majesty-0.391
Ta-0.380
umm-0.378
 former-0.376
 sidelines-0.375
Token she
Token ID673
Feature activation-6.49e+00

Pos logit contributions
.+0.068
 and+0.057
,+0.054
+0.040
 to+0.037
Neg logit contributions
 Majesty-0.203
Ta-0.192
 majesty-0.192
agna-0.191
 former-0.189
Token realized
Token ID6939
Feature activation+8.95e+00

Pos logit contributions
.+0.277
,+0.258
 and+0.250
+0.215
 to+0.212
Neg logit contributions
 Majesty-0.267
Ta-0.264
 former-0.253
umm-0.253
agna-0.252
Token it
Token ID340
Feature activation-3.25e+00

Pos logit contributions
 Majesty+0.497
 former+0.480
 sidelines+0.469
 pulp+0.467
Ta+0.467
Neg logit contributions
.-0.193
,-0.172
 and-0.166
-0.132
 with-0.111
Token?
Token ID30
Feature activation-2.03e+01

Pos logit contributions
 Majesty+0.708
Ta+0.698
umm+0.689
 sidelines+0.687
 former+0.675
Neg logit contributions
.-0.468
,-0.418
 and-0.408
-0.338
 to-0.329
Tokenâ
Token ID22940
Feature activation+1.76e+01

Pos logit contributions
.+0.855
,+0.797
 and+0.782
 to+0.654
 with+0.633
Neg logit contributions
 Majesty-0.856
Ta-0.851
umm-0.817
 sidelines-0.809
agna-0.807
Token
Token ID26391
Feature activation+2.34e+01

Pos logit contributions
Ta+0.364
 Majesty+0.359
umm+0.346
 sidelines+0.332
agna+0.325
Neg logit contributions
.-1.141
,-1.094
 and-1.076
 to-0.964
-0.963
Token asked
Token ID1965
Feature activation-3.92e+00

Pos logit contributions
 Majesty+1.386
 majesty+1.327
Ta+1.302
agna+1.282
 former+1.272
Neg logit contributions
.-0.530
 and-0.463
,-0.453
 to-0.336
 with-0.307
Token Jill
Token ID22772
Feature activation+2.28e+01

Pos logit contributions
.+0.101
 and+0.093
,+0.088
+0.070
 to+0.068
Neg logit contributions
 Majesty-0.207
 majesty-0.197
heit-0.195
isher-0.194
agna-0.192
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 Majesty+1.469
 majesty+1.381
 former+1.380
agna+1.347
 supervision+1.341
Neg logit contributions
.-0.203
 and-0.140
,-0.139
-0.049
 with-0.048
Token
Token ID198
Feature activation-3.79e+01

Pos logit contributions
.+1.330
,+1.207
 and+1.171
 to+0.874
 with+0.840
Neg logit contributions
 Majesty-2.567
Ta-2.517
umm-2.493
 former-2.470
 sidelines-2.450
Token
Token ID198
Feature activation-3.78e+01

Pos logit contributions
.+0.798
,+0.724
 and+0.682
 to+0.425
 with+0.405
Neg logit contributions
Ta-2.413
 Majesty-2.331
 sidelines-2.293
 former-2.286
umm-2.263
Tokenâ
Token ID22940
Feature activation+1.84e+01

Pos logit contributions
.+1.692
,+1.590
 and+1.551
 to+1.303
+1.301
Neg logit contributions
Ta-1.540
 Majesty-1.526
umm-1.494
 sidelines-1.471
agna-1.456
Token
Token ID26391
Feature activation+2.44e+01

Pos logit contributions
Ta+0.340
 Majesty+0.335
umm+0.321
 sidelines+0.307
agna+0.301
Neg logit contributions
.-1.230
,-1.181
 and-1.162
-1.044
 to-1.044
Token
Token ID129
Feature activation+1.61e+01

Pos logit contributions
 Majesty+1.353
 majesty+1.297
agna+1.278
Ta+1.273
bek+1.244
Neg logit contributions
.-0.653
 and-0.575
,-0.565
 to-0.444
 with-0.419
Token and
Token ID290
Feature activation-3.02e+01

Pos logit contributions
Ta+1.444
 Majesty+1.417
agna+1.383
isher+1.347
 pulp+1.342
Neg logit contributions
.-1.212
,-1.041
 and-1.013
-0.880
 to-0.863
Token ran
Token ID4966
Feature activation+8.15e+00

Pos logit contributions
.+1.155
,+1.073
 and+1.043
 to+0.847
+0.847
Neg logit contributions
Ta-1.424
 Majesty-1.414
umm-1.400
 sidelines-1.369
agna-1.364
Token in
Token ID287
Feature activation-1.69e+01

Pos logit contributions
 sidelines+0.386
 Majesty+0.348
uries+0.339
 unnecessary+0.336
 former+0.333
Neg logit contributions
.-0.222
,-0.209
 and-0.197
-0.177
 with-0.145
Token the
Token ID262
Feature activation-2.46e+01

Pos logit contributions
.+0.281
,+0.257
 and+0.234
+0.161
 to+0.139
Neg logit contributions
 sidelines-0.998
 former-0.960
 Majesty-0.947
heit-0.918
 ounces-0.917
Token waves
Token ID9813
Feature activation+3.09e+01

Pos logit contributions
.+1.134
,+1.063
 and+1.040
+0.883
 to+0.872
Neg logit contributions
 Majesty-0.985
agna-0.985
umm-0.977
Ta-0.973
icious-0.948
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 sidelines+2.102
 Majesty+2.043
isher+2.032
umm+1.988
 former+1.974
Neg logit contributions
.-0.087
,-0.078
 and-0.024
+0.003
 to+0.145
Token
Token ID198
Feature activation-3.79e+01

Pos logit contributions
.+1.491
,+1.369
 and+1.313
 to+1.007
 with+0.988
Neg logit contributions
 Majesty-2.467
Ta-2.467
umm-2.409
 sidelines-2.404
 former-2.371
Token
Token ID198
Feature activation-3.78e+01

Pos logit contributions
.+0.611
,+0.549
 and+0.471
 to+0.243
 with+0.218
Neg logit contributions
Ta-2.544
 Majesty-2.462
 sidelines-2.445
 former-2.397
 majesty-2.392
TokenAs
Token ID1722
Feature activation+5.95e-01

Pos logit contributions
.+1.523
,+1.416
 and+1.391
+1.215
 with+1.150
Neg logit contributions
umm-1.726
 Majesty-1.714
Ta-1.661
agna-1.646
 former-1.630
Token they
Token ID484
Feature activation-9.67e+00

Pos logit contributions
 Majesty+0.033
 sidelines+0.032
Ta+0.032
umm+0.032
 former+0.032
Neg logit contributions
.-0.016
,-0.014
 and-0.014
-0.010
 to-0.010
Token were
Token ID547
Feature activation-4.99e+00

Pos logit contributions
.+0.411
,+0.383
 and+0.373
+0.313
 to+0.312
Neg logit contributions
Ta-0.413
 Majesty-0.405
umm-0.397
 sidelines-0.395
agna-0.390
Token y
Token ID331
Feature activation+1.02e+01

Pos logit contributions
wy+0.529
ode+0.486
isher+0.476
Ta+0.470
ited+0.454
Neg logit contributions
,-0.670
.-0.604
 and-0.601
 with-0.557
 to-0.544
Tokenummy
Token ID13513
Feature activation+2.39e+01

Pos logit contributions
umm+0.322
 Majesty+0.272
 pulp+0.265
 sidelines+0.263
isher+0.258
Neg logit contributions
.-0.541
,-0.518
 and-0.510
-0.474
 to-0.449
Token fruits
Token ID15921
Feature activation+3.37e+01

Pos logit contributions
 Majesty+0.967
umm+0.964
 pulp+0.933
Ta+0.932
 majesty+0.925
Neg logit contributions
.-0.940
,-0.882
 and-0.850
-0.775
 with-0.717
Token and
Token ID290
Feature activation-3.02e+01

Pos logit contributions
umm+2.220
atmeal+2.186
 pulp+2.173
wy+2.127
agna+2.088
Neg logit contributions
.-0.182
,-0.120
 and-0.053
-0.037
 the+0.088
Token veggies
Token ID39425
Feature activation+4.23e+01

Pos logit contributions
.+1.497
,+1.415
 and+1.386
+1.190
 to+1.188
Neg logit contributions
Ta-1.082
 Majesty-1.074
umm-1.052
 sidelines-1.027
agna-1.019
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
atmeal+2.718
 pulp+2.658
wy+2.627
agna+2.622
umm+2.611
Neg logit contributions
.-0.146
,-0.133
 and-0.071
-0.010
 the+0.139
Token One
Token ID1881
Feature activation-1.07e+01

Pos logit contributions
.+1.419
,+1.297
 and+1.242
 to+0.953
 with+0.921
Neg logit contributions
 Majesty-2.541
Ta-2.508
umm-2.477
 sidelines-2.445
 former-2.438
Token day
Token ID1110
Feature activation+7.89e+00

Pos logit contributions
.+0.426
,+0.393
 and+0.384
+0.328
 to+0.320
Neg logit contributions
umm-0.445
 Majesty-0.440
Ta-0.440
 former-0.427
 pulp-0.425
Token,
Token ID11
Feature activation-3.61e+01

Pos logit contributions
 Majesty+0.550
Ta+0.546
 majesty+0.541
 sidelines+0.535
 pulp+0.534
Neg logit contributions
.-0.063
 and-0.035
,-0.015
+0.010
 to+0.017
Token Lucky
Token ID27963
Feature activation+2.35e+01

Pos logit contributions
.+1.111
,+0.999
 and+0.973
+0.737
 to+0.726
Neg logit contributions
 Majesty-1.932
Ta-1.922
umm-1.879
 sidelines-1.876
agna-1.867
Token went
Token ID1816
Feature activation-2.75e-01

Pos logit contributions
 Majesty+1.131
Ta+1.076
 majesty+1.068
agna+1.047
 former+1.030
Neg logit contributions
.-0.786
,-0.700
 and-0.669
-0.566
 with-0.563
Token the
Token ID262
Feature activation-2.46e+01

Pos logit contributions
 former+1.045
 Majesty+1.040
agna+1.022
isher+1.008
ocol+0.998
Neg logit contributions
.-0.212
,-0.185
 and-0.158
-0.146
 to-0.055
Token handle
Token ID5412
Feature activation+2.78e+01

Pos logit contributions
.+1.243
,+1.173
 and+1.152
 to+0.984
+0.981
Neg logit contributions
Ta-0.878
 Majesty-0.849
umm-0.846
agna-0.823
 sidelines-0.810
Token was
Token ID373
Feature activation-1.77e+01

Pos logit contributions
umm+1.787
isher+1.772
 sidelines+1.771
 Majesty+1.762
Ta+1.745
Neg logit contributions
.-0.322
,-0.275
 and-0.229
-0.155
 to-0.077
Token completely
Token ID3190
Feature activation+1.99e+01

Pos logit contributions
.+0.662
,+0.619
 and+0.601
+0.509
 to+0.478
Neg logit contributions
 Majesty-0.798
Ta-0.781
 sidelines-0.778
 former-0.774
umm-0.771
Token soaked
Token ID33657
Feature activation+2.73e+01

Pos logit contributions
 Majesty+0.764
 former+0.757
 sidelines+0.746
umm+0.741
 pulp+0.729
Neg logit contributions
.-0.831
,-0.789
 and-0.772
-0.696
 to-0.633
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
umm+1.773
isher+1.737
 sidelines+1.694
 Majesty+1.681
 former+1.677
Neg logit contributions
.-0.213
,-0.198
-0.181
 and-0.144
 to-0.049
Token He
Token ID679
Feature activation-3.54e+00

Pos logit contributions
.+1.594
,+1.474
 and+1.422
 to+1.124
 with+1.089
Neg logit contributions
 Majesty-2.354
Ta-2.348
umm-2.304
 sidelines-2.281
 former-2.273
Token pond
Token ID16723
Feature activation+3.13e+01

Pos logit contributions
.+0.129
,+0.119
 and+0.115
+0.094
 to+0.093
Neg logit contributions
Ta-0.171
 Majesty-0.170
umm-0.166
 sidelines-0.165
agna-0.164
Tokenered
Token ID1068
Feature activation+3.49e+01

Pos logit contributions
 Majesty+1.669
 sidelines+1.645
 pulp+1.642
Ta+1.637
isher+1.619
Neg logit contributions
.-0.827
,-0.715
 and-0.683
-0.580
 to-0.497
Token what
Token ID644
Feature activation-7.61e+00

Pos logit contributions
 sidelines+1.703
umm+1.699
 former+1.696
 Majesty+1.671
 pulp+1.669
Neg logit contributions
.-0.822
,-0.699
-0.662
 and-0.632
 with-0.453
Token could
Token ID714
Feature activation-7.97e+00

Pos logit contributions
.+0.196
,+0.173
 and+0.168
+0.120
 with+0.114
Neg logit contributions
Ta-0.438
umm-0.432
 Majesty-0.431
 sidelines-0.423
agna-0.422
Token it
Token ID340
Feature activation-4.33e+00

Pos logit contributions
.+0.796
,+0.751
 and+0.738
 to+0.636
+0.632
Neg logit contributions
Ta-0.525
 Majesty-0.515
umm-0.506
 sidelines-0.490
agna-0.487
Token.
Token ID13
Feature activation-4.71e+01

Pos logit contributions
.+0.103
,+0.094
 and+0.089
+0.071
 with+0.062
Neg logit contributions
 Majesty-0.241
 former-0.233
Ta-0.233
umm-0.232
 pulp-0.231
Token It
Token ID632
Feature activation+6.48e+00

Pos logit contributions
.+1.604
,+1.480
 and+1.431
 to+1.137
 with+1.097
Neg logit contributions
 Majesty-2.362
Ta-2.339
umm-2.284
 sidelines-2.267
 former-2.259
Token was
Token ID373
Feature activation-1.76e+01

Pos logit contributions
Ta+0.349
 Majesty+0.348
umm+0.343
 sidelines+0.338
agna+0.336
Neg logit contributions
.-0.202
,-0.184
 and-0.177
-0.137
 to-0.136
Token paint
Token ID7521
Feature activation+2.59e+01

Pos logit contributions
.+0.539
,+0.491
 and+0.470
+0.368
 to+0.356
Neg logit contributions
 Majesty-0.941
Ta-0.934
umm-0.919
 former-0.913
 sidelines-0.912
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
umm+1.729
icious+1.629
Ta+1.615
isher+1.610
agna+1.608
Neg logit contributions
.-0.273
,-0.210
 and-0.186
-0.175
 to-0.100
Token It
Token ID632
Feature activation+7.07e+00

Pos logit contributions
.+1.642
,+1.523
 and+1.466
 to+1.177
 with+1.138
Neg logit contributions
Ta-2.308
 Majesty-2.294
umm-2.237
 sidelines-2.212
 former-2.200
Token spl
Token ID4328
Feature activation+1.67e+01

Pos logit contributions
Ta+0.396
 Majesty+0.394
umm+0.389
 sidelines+0.383
agna+0.382
Neg logit contributions
.-0.204
,-0.186
 and-0.177
-0.134
 to-0.132
Tokenashed
Token ID5263
Feature activation+4.46e+01

Pos logit contributions
 Majesty+0.362
Ta+0.353
umm+0.335
 sidelines+0.330
agna+0.320
Neg logit contributions
.-1.083
,-1.034
 and-1.024
-0.909
 to-0.908
Token on
Token ID319
Feature activation-9.93e+00

Pos logit contributions
isher+2.551
umm+2.546
 pulp+2.530
Ta+2.465
 sidelines+2.451
Neg logit contributions
.-0.644
,-0.560
 and-0.452
-0.435
 to-0.285
Token the
Token ID262
Feature activation-2.35e+01

Pos logit contributions
.+0.173
,+0.150
 and+0.128
+0.084
 to+0.076
Neg logit contributions
 sidelines-0.600
 pulp-0.595
 Majesty-0.594
 former-0.586
umm-0.586
Token mean
Token ID1612
Feature activation+1.29e+01

Pos logit contributions
Ta+0.760
 Majesty+0.743
 sidelines+0.720
icious+0.712
agna+0.712
Neg logit contributions
.-0.612
,-0.559
 and-0.552
 to-0.441
 with-0.441
Token to
Token ID284
Feature activation-2.54e+01

Pos logit contributions
 Majesty+0.711
 majesty+0.675
 pulp+0.671
umm+0.669
heit+0.661
Neg logit contributions
.-0.304
,-0.277
 and-0.257
-0.211
 with-0.197
Token make
Token ID787
Feature activation+1.74e+00

Pos logit contributions
.+0.993
,+0.928
 and+0.906
+0.741
 to+0.734
Neg logit contributions
 Majesty-1.158
Ta-1.149
 sidelines-1.124
umm-1.119
 former-1.103
Token a
Token ID257
Feature activation-2.12e+01

Pos logit contributions
 Majesty+0.088
Ta+0.085
 sidelines+0.085
umm+0.084
 former+0.084
Neg logit contributions
.-0.057
,-0.054
 and-0.052
-0.041
 to-0.040
Token mess
Token ID2085
Feature activation+2.84e+01

Pos logit contributions
.+1.077
,+1.016
 and+0.998
 to+0.858
+0.853
Neg logit contributions
Ta-0.764
 Majesty-0.736
umm-0.731
agna-0.708
 sidelines-0.703
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
umm+1.844
isher+1.811
wy+1.796
icious+1.777
 sidelines+1.769
Neg logit contributions
.-0.133
,-0.126
-0.095
 and-0.076
 to+0.006
Token And
Token ID843
Feature activation-5.50e+00

Pos logit contributions
.+1.532
,+1.409
 and+1.361
 to+1.065
 with+1.027
Neg logit contributions
 Majesty-2.426
Ta-2.414
umm-2.350
 sidelines-2.328
 former-2.326
Token I
Token ID314
Feature activation+1.99e+00

Pos logit contributions
.+0.163
,+0.145
 and+0.141
+0.108
 to+0.104
Neg logit contributions
 Majesty-0.297
Ta-0.292
 former-0.286
umm-0.286
 sidelines-0.285
Token know
Token ID760
Feature activation+1.21e+01

Pos logit contributions
Ta+0.078
 Majesty+0.076
 former+0.075
agna+0.073
 majesty+0.072
Neg logit contributions
.-0.088
,-0.083
 and-0.080
 to-0.068
 with-0.068
Token you
Token ID345
Feature activation+3.52e+00

Pos logit contributions
 Majesty+0.625
 majesty+0.601
agna+0.581
 pulp+0.569
icious+0.566
Neg logit contributions
.-0.290
 and-0.280
,-0.252
-0.229
 to-0.195
Token tried
Token ID3088
Feature activation+6.49e+00

Pos logit contributions
 Majesty+0.146
Ta+0.145
agna+0.141
 majesty+0.138
 former+0.137
Neg logit contributions
.-0.140
,-0.131
 and-0.126
-0.111
 with-0.108
Token little
Token ID1310
Feature activation-4.91e+00

Pos logit contributions
.+1.178
,+1.070
 and+1.037
+0.915
 to+0.890
Neg logit contributions
 Majesty-1.130
Ta-1.106
umm-1.090
 majesty-1.081
agna-1.072
Token bird
Token ID6512
Feature activation+2.23e+01

Pos logit contributions
.+0.221
,+0.207
 and+0.202
+0.178
 to+0.173
Neg logit contributions
 Majesty-0.186
umm-0.180
Ta-0.177
 sidelines-0.176
 majesty-0.173
Tokenie
Token ID494
Feature activation+1.57e+01

Pos logit contributions
 Majesty+1.146
 majesty+1.086
 pulp+1.052
 former+1.051
Ta+1.047
Neg logit contributions
.-0.580
,-0.526
 and-0.503
-0.429
 to-0.407
Token named
Token ID3706
Feature activation-8.83e-01

Pos logit contributions
 Majesty+0.867
 majesty+0.827
Ta+0.799
 pulp+0.796
 former+0.794
Neg logit contributions
.-0.403
,-0.364
 and-0.343
-0.280
 to-0.273
Token Tweet
Token ID18752
Feature activation+2.31e+01

Pos logit contributions
.+0.037
,+0.035
 and+0.034
 with+0.029
 to+0.028
Neg logit contributions
 Majesty-0.036
Ta-0.035
 sidelines-0.034
 former-0.034
umm-0.034
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 Majesty+0.681
wy+0.660
 former+0.648
 majesty+0.621
isher+0.602
Neg logit contributions
,-1.056
.-1.054
 and-0.998
 to-0.922
 with-0.915
Token Tweet
Token ID18752
Feature activation+2.34e+01

Pos logit contributions
.+2.205
,+2.057
 and+2.023
 to+1.757
 with+1.700
Neg logit contributions
 Majesty-1.773
umm-1.698
Ta-1.689
 former-1.664
 sidelines-1.633
Token was
Token ID373
Feature activation-1.70e+01

Pos logit contributions
 Majesty+0.759
Ta+0.733
wy+0.727
 former+0.717
 majesty+0.705
Neg logit contributions
.-1.077
,-1.001
 and-0.957
 to-0.913
-0.862
Token very
Token ID845
Feature activation-1.50e+01

Pos logit contributions
.+0.547
,+0.479
 and+0.466
 to+0.359
+0.359
Neg logit contributions
 Majesty-0.885
Ta-0.865
 former-0.847
 majesty-0.837
 sidelines-0.831
Token happy
Token ID3772
Feature activation+4.01e+00

Pos logit contributions
.+0.636
,+0.594
 and+0.580
+0.482
 to+0.481
Neg logit contributions
Ta-0.648
 Majesty-0.647
umm-0.632
 sidelines-0.622
 former-0.617
Token and
Token ID290
Feature activation-2.91e+01

Pos logit contributions
 Majesty+0.225
umm+0.216
 majesty+0.214
 former+0.212
 habitats+0.208
Neg logit contributions
.-0.045
,-0.034
-0.032
 and-0.022
 the-0.013
Token Max
Token ID5436
Feature activation+9.94e+00

Pos logit contributions
Ta+0.193
 Majesty+0.192
umm+0.188
 sidelines+0.185
agna+0.184
Neg logit contributions
.-0.184
,-0.172
 and-0.167
-0.139
 to-0.139
Token had
Token ID550
Feature activation-6.94e+00

Pos logit contributions
 Majesty+0.598
 majesty+0.570
 sidelines+0.562
Ta+0.558
 former+0.556
Neg logit contributions
.-0.179
,-0.160
 and-0.143
-0.109
 with-0.104
Token listened
Token ID16399
Feature activation+1.36e+01

Pos logit contributions
.+0.207
,+0.193
 and+0.183
+0.143
 with+0.136
Neg logit contributions
 Majesty-0.370
Ta-0.368
 former-0.358
 sidelines-0.357
umm-0.355
Token to
Token ID284
Feature activation-2.54e+01

Pos logit contributions
 former+0.821
 Majesty+0.785
 sidelines+0.782
mic+0.782
icious+0.750
Neg logit contributions
.-0.160
,-0.144
 and-0.106
-0.092
 with-0.043
Token her
Token ID607
Feature activation-9.91e+00

Pos logit contributions
.+0.746
,+0.690
 and+0.658
 to+0.495
+0.491
Neg logit contributions
 Majesty-1.372
 sidelines-1.353
Ta-1.337
 former-1.332
agna-1.301
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
.+0.213
,+0.192
 and+0.178
+0.138
 to+0.120
Neg logit contributions
 Majesty-0.578
 former-0.567
 majesty-0.542
agna-0.537
 sidelines-0.531
Token She
Token ID1375
Feature activation-5.18e+00

Pos logit contributions
.+1.552
,+1.445
 and+1.379
 to+1.082
 with+1.061
Neg logit contributions
 Majesty-2.369
Ta-2.353
 sidelines-2.296
 former-2.285
umm-2.277
Token wished
Token ID16555
Feature activation+1.40e+01

Pos logit contributions
.+0.196
,+0.181
 and+0.173
+0.143
 to+0.142
Neg logit contributions
Ta-0.242
 Majesty-0.242
 former-0.232
umm-0.231
agna-0.230
Token Max
Token ID5436
Feature activation+1.07e+01

Pos logit contributions
 sidelines+0.707
 Majesty+0.704
Ta+0.692
 former+0.688
umm+0.688
Neg logit contributions
.-0.385
,-0.335
 and-0.316
-0.263
 with-0.231
Token was
Token ID373
Feature activation-1.67e+01

Pos logit contributions
 Majesty+0.489
Ta+0.484
agna+0.475
 former+0.467
 majesty+0.464
Neg logit contributions
.-0.346
,-0.329
 and-0.307
-0.250
 with-0.240
Token still
Token ID991
Feature activation+2.31e+00

Pos logit contributions
.+0.616
,+0.577
 and+0.558
+0.460
 to+0.443
Neg logit contributions
 Majesty-0.789
Ta-0.778
umm-0.757
 sidelines-0.757
 former-0.756
Token could
Token ID714
Feature activation-9.05e+00

Pos logit contributions
.+0.537
,+0.478
 and+0.426
 to+0.251
 with+0.229
Neg logit contributions
Ta-1.721
 sidelines-1.606
 Majesty-1.551
vation-1.535
Whe-1.513
Token help
Token ID1037
Feature activation+1.85e+00

Pos logit contributions
.+0.346
,+0.328
 and+0.318
+0.266
 to+0.258
Neg logit contributions
 Majesty-0.391
 former-0.377
 sidelines-0.375
Ta-0.372
 majesty-0.366
Token with
Token ID351
Feature activation-1.59e+01

Pos logit contributions
 sidelines+0.102
 Majesty+0.101
 former+0.101
umm+0.101
ocol+0.099
Neg logit contributions
.-0.037
,-0.035
 and-0.033
-0.026
 to-0.020
Token the
Token ID262
Feature activation-2.46e+01

Pos logit contributions
.+0.400
,+0.365
 and+0.358
+0.267
 to+0.254
Neg logit contributions
 Majesty-0.912
 former-0.890
 sidelines-0.882
umm-0.877
Ta-0.876
Token job
Token ID1693
Feature activation+3.03e+01

Pos logit contributions
.+1.254
,+1.175
 and+1.160
 to+1.001
+0.971
Neg logit contributions
Ta-0.887
umm-0.843
 Majesty-0.841
agna-0.817
 sidelines-0.808
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 Majesty+1.907
umm+1.894
 sidelines+1.888
agna+1.879
Ta+1.851
Neg logit contributions
.-0.452
,-0.390
 and-0.364
-0.250
 to-0.167
Token His
Token ID2399
Feature activation+4.89e+00

Pos logit contributions
.+1.436
,+1.322
 and+1.266
 to+0.972
 with+0.939
Neg logit contributions
 Majesty-2.511
Ta-2.510
umm-2.465
 former-2.432
 sidelines-2.422
Token dad
Token ID9955
Feature activation+1.52e+01

Pos logit contributions
Ta+0.223
 Majesty+0.221
umm+0.219
 sidelines+0.215
agna+0.213
Neg logit contributions
.-0.196
,-0.183
 and-0.178
-0.146
 to-0.146
Token said
Token ID531
Feature activation-8.05e+00

Pos logit contributions
 Majesty+0.712
Ta+0.700
agna+0.686
 majesty+0.675
isher+0.674
Neg logit contributions
.-0.526
,-0.466
 and-0.444
-0.385
 to-0.375
Token,
Token ID11
Feature activation-3.60e+01

Pos logit contributions
.+0.174
 and+0.143
,+0.130
+0.110
 with+0.090
Neg logit contributions
 Majesty-0.465
agna-0.452
 majesty-0.444
 pulp-0.436
 sidelines-0.431
Token "
Token ID366
Feature activation-1.67e+01

Pos logit contributions
.+1.092
,+0.967
 and+0.883
+0.688
 with+0.685
Neg logit contributions
 Majesty-1.845
agna-1.825
 sidelines-1.793
Ta-1.757
umm-1.752
Token does
Token ID857
Feature activation+8.82e+00

Pos logit contributions
isher+1.958
arians+1.909
agna+1.903
wy+1.902
 sidelines+1.897
Neg logit contributions
-0.176
.-0.152
,-0.115
 and-0.059
 to+0.020
Token not
Token ID407
Feature activation-3.78e+00

Pos logit contributions
 Majesty+0.454
 sidelines+0.431
 majesty+0.426
 former+0.425
Ta+0.412
Neg logit contributions
.-0.258
,-0.241
 and-0.235
-0.190
 with-0.176
Token like
Token ID588
Feature activation-1.21e+01

Pos logit contributions
.+0.165
,+0.157
 and+0.153
+0.131
 to+0.127
Neg logit contributions
 Majesty-0.150
Ta-0.147
 sidelines-0.145
umm-0.143
 former-0.143
Token to
Token ID284
Feature activation-2.54e+01

Pos logit contributions
.+0.351
,+0.323
 and+0.309
+0.241
 with+0.226
Neg logit contributions
 Majesty-0.661
 sidelines-0.643
Ta-0.639
 majesty-0.632
umm-0.631
Token dance
Token ID9280
Feature activation+2.81e+01

Pos logit contributions
.+1.101
,+1.040
 and+1.025
+0.874
 to+0.859
Neg logit contributions
 Majesty-1.027
Ta-1.004
 sidelines-0.993
 former-0.973
umm-0.973
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 Majesty+1.806
 former+1.782
 sidelines+1.736
 majesty+1.735
isher+1.697
Neg logit contributions
.-0.237
,-0.207
 and-0.170
-0.129
 to-0.009
Token It
Token ID632
Feature activation+7.07e+00

Pos logit contributions
.+1.625
,+1.504
 and+1.459
 to+1.159
 with+1.124
Neg logit contributions
 Majesty-2.336
Ta-2.334
umm-2.272
 sidelines-2.245
 former-2.235
Token likes
Token ID7832
Feature activation+1.18e+01

Pos logit contributions
Ta+0.334
 Majesty+0.334
umm+0.323
 former+0.319
 sidelines+0.319
Neg logit contributions
.-0.262
,-0.244
 and-0.235
-0.193
 to-0.190
Token to
Token ID284
Feature activation-2.46e+01

Pos logit contributions
 Majesty+0.759
 sidelines+0.744
Ta+0.741
 majesty+0.737
 pulp+0.732
Neg logit contributions
.-0.196
,-0.171
 and-0.149
-0.093
 with-0.075
Token bite
Token ID13197
Feature activation+1.89e+01

Pos logit contributions
.+1.111
,+1.049
 and+1.030
+0.880
 to+0.874
Neg logit contributions
 Majesty-0.962
Ta-0.952
 sidelines-0.926
 former-0.909
umm-0.906
Token.
Token ID13
Feature activation-4.60e+01

Pos logit contributions
 pulp+1.170
umm+1.147
 Majesty+1.142
wy+1.140
Ta+1.138
Neg logit contributions
.-0.188
,-0.161
 and-0.120
-0.072
 to-0.040
Token the
Token ID262
Feature activation-2.46e+01

Pos logit contributions
 Majesty+0.620
agna+0.594
 sidelines+0.591
Ta+0.590
 former+0.589
Neg logit contributions
.-0.293
,-0.277
 and-0.265
-0.196
 to-0.193
Token bird
Token ID6512
Feature activation+2.23e+01

Pos logit contributions
.+1.243
,+1.173
 and+1.152
 to+0.984
+0.981
Neg logit contributions
Ta-0.878
 Majesty-0.849
umm-0.846
agna-0.823
 sidelines-0.810
Token became
Token ID2627
Feature activation+6.82e+00

Pos logit contributions
 Majesty+1.267
 former+1.215
 majesty+1.211
 sidelines+1.211
Ta+1.211
Neg logit contributions
.-0.475
,-0.439
 and-0.399
-0.309
 to-0.293
Token very
Token ID845
Feature activation-1.58e+01

Pos logit contributions
 Majesty+0.333
 former+0.323
Ta+0.320
 sidelines+0.316
umm+0.315
Neg logit contributions
.-0.230
,-0.214
 and-0.207
-0.171
 to-0.161
Token sick
Token ID6639
Feature activation+1.84e+01

Pos logit contributions
.+0.694
,+0.651
 and+0.636
+0.536
 to+0.532
Neg logit contributions
 Majesty-0.652
Ta-0.652
umm-0.636
 sidelines-0.626
 former-0.622
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
umm+1.284
 Majesty+1.250
 pulp+1.243
 sidelines+1.240
isher+1.232
Neg logit contributions
.-0.029
,-0.022
+0.003
 and+0.026
 with+0.087
Token Lily
Token ID20037
Feature activation+2.50e+00

Pos logit contributions
.+1.500
,+1.389
 and+1.332
 to+1.041
 with+1.009
Neg logit contributions
 Majesty-2.442
Ta-2.402
 former-2.361
umm-2.359
 sidelines-2.350
Token tried
Token ID3088
Feature activation+6.04e+00

Pos logit contributions
 Majesty+0.114
Ta+0.109
 majesty+0.107
 former+0.106
 pulp+0.105
Neg logit contributions
.-0.089
,-0.080
 and-0.074
-0.066
 to-0.066
Token to
Token ID284
Feature activation-2.46e+01

Pos logit contributions
 former+0.375
 sidelines+0.375
 Majesty+0.371
 pulp+0.367
 unnecessary+0.364
Neg logit contributions
.-0.095
,-0.072
 and-0.058
-0.036
 with-0.025
Token help
Token ID1037
Feature activation+2.76e+00

Pos logit contributions
.+1.091
,+1.028
 and+0.998
+0.853
 to+0.851
Neg logit contributions
 Majesty-0.983
Ta-0.976
 sidelines-0.951
 former-0.942
umm-0.936
Token it
Token ID340
Feature activation-3.25e+00

Pos logit contributions
 sidelines+0.140
agna+0.140
umm+0.138
icious+0.137
 Majesty+0.136
Neg logit contributions
.-0.064
,-0.054
 and-0.053
-0.043
 to-0.040
Token that
Token ID326
Feature activation-1.96e+01

Pos logit contributions
Ta+0.709
 Majesty+0.690
 sidelines+0.681
icious+0.673
ited+0.671
Neg logit contributions
.-0.709
,-0.629
 and-0.598
-0.545
 to-0.535
Token she
Token ID673
Feature activation-7.27e+00

Pos logit contributions
.+0.596
,+0.543
 and+0.527
+0.405
 to+0.396
Neg logit contributions
 Majesty-1.039
Ta-1.030
umm-1.016
 sidelines-1.000
 former-1.000
Token loved
Token ID6151
Feature activation+6.56e-01

Pos logit contributions
.+0.270
,+0.252
 and+0.243
+0.201
 to+0.199
Neg logit contributions
 Majesty-0.342
Ta-0.338
umm-0.326
 former-0.325
agna-0.324
Token to
Token ID284
Feature activation-2.54e+01

Pos logit contributions
 Majesty+0.040
 former+0.039
 sidelines+0.039
umm+0.039
 majesty+0.038
Neg logit contributions
.-0.012
,-0.011
 and-0.010
-0.007
 with-0.006
Token wear
Token ID5806
Feature activation+1.86e+01

Pos logit contributions
.+1.093
,+1.025
 and+1.010
+0.867
 to+0.855
Neg logit contributions
 Majesty-1.037
Ta-1.003
 former-0.991
 sidelines-0.983
 majesty-0.978
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 Majesty+0.902
Ta+0.882
 former+0.882
 sidelines+0.881
 majesty+0.861
Neg logit contributions
.-0.504
,-0.463
 and-0.440
-0.389
 to-0.332
Token One
Token ID1881
Feature activation-1.07e+01

Pos logit contributions
.+1.604
,+1.496
 and+1.429
 to+1.146
 with+1.132
Neg logit contributions
 Majesty-2.346
Ta-2.322
umm-2.287
 former-2.267
 sidelines-2.246
Token day
Token ID1110
Feature activation+7.89e+00

Pos logit contributions
.+0.426
,+0.395
 and+0.383
+0.323
 to+0.317
Neg logit contributions
Ta-0.469
umm-0.455
 Majesty-0.451
 former-0.443
 sidelines-0.439
Token,
Token ID11
Feature activation-3.61e+01

Pos logit contributions
 Majesty+0.515
Ta+0.512
 majesty+0.508
 sidelines+0.501
 pulp+0.500
Neg logit contributions
.-0.073
 and-0.052
,-0.020
-0.007
 to+0.004
Token she
Token ID673
Feature activation-6.37e+00

Pos logit contributions
.+1.124
,+1.010
 and+0.982
+0.752
 to+0.738
Neg logit contributions
 Majesty-1.877
Ta-1.868
 sidelines-1.840
umm-1.833
agna-1.814
Token went
Token ID1816
Feature activation-2.75e-01

Pos logit contributions
.+0.229
,+0.209
 and+0.200
+0.165
 to+0.162
Neg logit contributions
 Majesty-0.308
Ta-0.307
 former-0.292
umm-0.291
agna-0.291
Token wished
Token ID16555
Feature activation+1.33e+01

Pos logit contributions
umm+1.974
 former+1.891
agna+1.863
 sidelines+1.856
 Majesty+1.849
Neg logit contributions
.-0.157
-0.133
,-0.084
 and+0.012
 the+0.088
Token they
Token ID484
Feature activation-1.06e+01

Pos logit contributions
 Majesty+0.733
 former+0.718
 majesty+0.706
 sidelines+0.702
agna+0.693
Neg logit contributions
.-0.289
,-0.253
 and-0.241
-0.194
 with-0.160
Token had
Token ID550
Feature activation-7.05e+00

Pos logit contributions
.+0.387
,+0.367
 and+0.354
+0.293
 with+0.290
Neg logit contributions
 Majesty-0.442
Ta-0.442
agna-0.438
 former-0.431
 sidelines-0.421
Token a
Token ID257
Feature activation-2.12e+01

Pos logit contributions
.+0.281
,+0.268
 and+0.260
+0.220
 with+0.211
Neg logit contributions
 Majesty-0.304
Ta-0.298
agna-0.289
 former-0.288
 majesty-0.285
Token car
Token ID1097
Feature activation+2.50e+01

Pos logit contributions
.+0.976
,+0.924
 and+0.898
 to+0.780
+0.757
Neg logit contributions
Ta-0.870
umm-0.865
 Majesty-0.837
 sidelines-0.818
icious-0.812
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
isher+1.624
 Majesty+1.621
agna+1.617
 sidelines+1.610
 former+1.610
Neg logit contributions
.-0.147
,-0.123
 and-0.082
-0.042
 with+0.049
Token But
Token ID887
Feature activation-6.99e+00

Pos logit contributions
.+1.548
,+1.434
 and+1.380
 to+1.105
 with+1.053
Neg logit contributions
 Majesty-2.382
Ta-2.364
umm-2.280
 former-2.277
 sidelines-2.273
Token they
Token ID484
Feature activation-9.96e+00

Pos logit contributions
.+0.190
,+0.166
 and+0.162
+0.122
 to+0.117
Neg logit contributions
 Majesty-0.398
Ta-0.392
umm-0.387
agna-0.386
 sidelines-0.385
Token knew
Token ID2993
Feature activation+4.15e+00

Pos logit contributions
.+0.370
,+0.342
 and+0.331
+0.271
 to+0.269
Neg logit contributions
Ta-0.474
 Majesty-0.472
umm-0.460
agna-0.456
 sidelines-0.456
Token mom
Token ID1995
Feature activation+9.74e+00

Pos logit contributions
 Majesty+0.219
 former+0.213
agna+0.212
 majesty+0.210
isher+0.206
Neg logit contributions
.-0.108
,-0.099
 and-0.097
-0.076
 with-0.069
Token was
Token ID373
Feature activation-1.65e+01

Pos logit contributions
wy+0.455
atmeal+0.449
agna+0.444
isher+0.443
 Majesty+0.439
Neg logit contributions
.-0.261
,-0.245
 and-0.220
-0.204
 with-0.187
Token a
Token ID257
Feature activation-2.12e+01

Pos logit contributions
 Majesty+1.656
 former+1.628
agna+1.623
umm+1.607
 sidelines+1.602
Neg logit contributions
.-0.061
,-0.040
 and-0.025
+0.018
 to+0.104
Token branch
Token ID8478
Feature activation+2.41e+01

Pos logit contributions
.+0.922
,+0.872
 and+0.849
 to+0.704
+0.701
Neg logit contributions
Ta-0.912
umm-0.892
 Majesty-0.890
agna-0.862
 sidelines-0.859
Token and
Token ID290
Feature activation-3.03e+01

Pos logit contributions
Ta+1.491
umm+1.479
 sidelines+1.473
 Majesty+1.473
isher+1.473
Neg logit contributions
.-0.229
,-0.178
 and-0.136
-0.102
 the-0.021
Token it
Token ID340
Feature activation-4.31e+00

Pos logit contributions
.+1.222
,+1.139
 and+1.112
 to+0.916
+0.913
Neg logit contributions
Ta-1.364
 Majesty-1.360
umm-1.352
 sidelines-1.316
agna-1.311
Token broke
Token ID6265
Feature activation+1.84e+01

Pos logit contributions
.+0.178
,+0.167
 and+0.162
+0.136
 to+0.134
Neg logit contributions
 Majesty-0.186
Ta-0.186
umm-0.180
 sidelines-0.177
 former-0.177
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
umm+1.154
 pulp+1.150
 former+1.136
 Majesty+1.123
isher+1.122
Neg logit contributions
,-0.188
.-0.172
 and-0.142
-0.115
 to-0.052
Token He
Token ID679
Feature activation-3.54e+00

Pos logit contributions
.+1.550
,+1.427
 and+1.377
 to+1.071
 with+1.047
Neg logit contributions
 Majesty-2.382
Ta-2.380
umm-2.310
 sidelines-2.305
 former-2.284
Token was
Token ID373
Feature activation-1.70e+01

Pos logit contributions
.+0.136
,+0.126
 and+0.122
+0.101
 to+0.100
Neg logit contributions
Ta-0.165
 Majesty-0.164
umm-0.159
 sidelines-0.158
agna-0.157
Token so
Token ID523
Feature activation-2.16e+01

Pos logit contributions
.+0.574
,+0.530
 and+0.510
+0.408
 to+0.395
Neg logit contributions
 Majesty-0.859
Ta-0.858
 sidelines-0.838
umm-0.832
 former-0.824
Token surprised
Token ID6655
Feature activation+8.70e+00

Pos logit contributions
.+0.966
,+0.908
 and+0.887
+0.747
 to+0.744
Neg logit contributions
Ta-0.879
 Majesty-0.875
umm-0.850
 sidelines-0.841
 former-0.833
Token that
Token ID326
Feature activation-1.85e+01

Pos logit contributions
umm+0.460
isher+0.448
 sidelines+0.439
 Majesty+0.435
rim+0.431
Neg logit contributions
.-0.103
,-0.091
-0.082
 and-0.063
 a-0.047
Token something
Token ID1223
Feature activation-1.49e+00

Pos logit contributions
umm+0.962
wy+0.959
isher+0.941
ubb+0.938
Ta+0.910
Neg logit contributions
.-1.209
,-1.163
 and-1.071
-1.033
 to-0.998
Token soft
Token ID2705
Feature activation+1.17e+01

Pos logit contributions
.+0.046
,+0.043
 and+0.042
+0.035
 with+0.032
Neg logit contributions
umm-0.070
 Majesty-0.068
Ta-0.068
 pulp-0.068
 former-0.067
Token and
Token ID290
Feature activation-3.03e+01

Pos logit contributions
umm+0.810
 Majesty+0.805
 pulp+0.798
 former+0.784
isher+0.783
Neg logit contributions
.-0.063
,-0.041
-0.020
 and-0.016
 to+0.025
Token squ
Token ID2809
Feature activation+1.37e+01

Pos logit contributions
.+1.346
,+1.262
 and+1.232
+1.035
 to+1.035
Neg logit contributions
Ta-1.247
 Majesty-1.241
umm-1.221
 sidelines-1.195
agna-1.189
Tokenishy
Token ID49785
Feature activation+4.76e+01

Pos logit contributions
Ta+0.256
 Majesty+0.254
umm+0.243
 sidelines+0.231
agna+0.226
Neg logit contributions
.-0.916
,-0.880
 and-0.865
-0.780
 to-0.777
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
umm+3.099
 pulp+3.085
 Majesty+3.080
isher+3.040
 sidelines+3.000
Neg logit contributions
.-0.409
,-0.382
-0.315
 and-0.292
 to-0.057
Token It
Token ID632
Feature activation+7.07e+00

Pos logit contributions
.+1.524
,+1.399
 and+1.346
 to+1.047
 with+1.022
Neg logit contributions
 Majesty-2.415
Ta-2.382
umm-2.338
 sidelines-2.315
 former-2.306
Token was
Token ID373
Feature activation-1.70e+01

Pos logit contributions
Ta+0.384
 Majesty+0.384
umm+0.377
 sidelines+0.371
agna+0.370
Neg logit contributions
.-0.216
,-0.197
 and-0.189
-0.145
 to-0.143
Token a
Token ID257
Feature activation-2.04e+01

Pos logit contributions
.+0.467
,+0.421
 and+0.402
+0.299
 to+0.290
Neg logit contributions
Ta-0.967
 Majesty-0.967
umm-0.951
 sidelines-0.940
 former-0.939
Token pile
Token ID14540
Feature activation+2.59e+01

Pos logit contributions
.+0.900
,+0.857
 and+0.829
 to+0.691
 with+0.684
Neg logit contributions
Ta-0.871
 Majesty-0.846
umm-0.843
 sidelines-0.814
agna-0.810
Token of
Token ID286
Feature activation-9.66e+00

Pos logit contributions
umm+1.435
isher+1.363
 sidelines+1.329
ubb+1.321
icious+1.314
Neg logit contributions
.-0.517
,-0.463
-0.436
 and-0.433
 to-0.322
Token them
Token ID606
Feature activation+2.63e+00

Pos logit contributions
 sidelines+0.907
agna+0.904
Ta+0.899
wy+0.891
 Majesty+0.887
Neg logit contributions
.-0.366
,-0.317
-0.269
 and-0.267
 to-0.222
Token from
Token ID422
Feature activation-1.64e+01

Pos logit contributions
 Majesty+0.167
 sidelines+0.162
 pulp+0.161
 majesty+0.161
 former+0.161
Neg logit contributions
.-0.042
,-0.038
 and-0.034
-0.024
 with-0.018
Token the
Token ID262
Feature activation-2.47e+01

Pos logit contributions
.+0.421
,+0.390
 and+0.370
+0.281
 to+0.264
Neg logit contributions
 Majesty-0.933
 sidelines-0.923
Ta-0.907
 former-0.900
umm-0.896
Token mean
Token ID1612
Feature activation+1.29e+01

Pos logit contributions
.+1.298
,+1.224
 and+1.203
 to+1.040
+1.025
Neg logit contributions
Ta-0.841
 Majesty-0.811
umm-0.807
agna-0.773
icious-0.764
Token boy
Token ID2933
Feature activation+1.92e+01

Pos logit contributions
 Majesty+0.496
umm+0.484
Ta+0.483
 sidelines+0.473
agna+0.466
Neg logit contributions
.-0.587
,-0.551
 and-0.539
-0.468
 to-0.458
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 Majesty+1.299
agna+1.265
 supervision+1.262
 sidelines+1.260
 ounces+1.254
Neg logit contributions
.-0.059
,-0.047
 and-0.009
+0.023
 to+0.076
Token They
Token ID1119
Feature activation-4.39e+00

Pos logit contributions
.+1.507
,+1.387
 and+1.346
 to+1.061
 with+1.009
Neg logit contributions
 Majesty-2.428
Ta-2.392
umm-2.343
 sidelines-2.334
 majesty-2.314
Token give
Token ID1577
Feature activation+1.13e+01

Pos logit contributions
.+0.181
,+0.169
 and+0.164
+0.137
 to+0.136
Neg logit contributions
Ta-0.192
 Majesty-0.192
umm-0.187
 sidelines-0.185
agna-0.184
Token Spot
Token ID15899
Feature activation+1.43e+01

Pos logit contributions
Ta+0.641
 Majesty+0.637
umm+0.627
 sidelines+0.626
agna+0.622
Neg logit contributions
.-0.306
,-0.277
 and-0.263
-0.197
 with-0.185
Token a
Token ID257
Feature activation-2.03e+01

Pos logit contributions
wy+0.835
agna+0.824
Ta+0.823
 Majesty+0.818
isher+0.809
Neg logit contributions
.-0.254
,-0.235
 and-0.198
-0.141
 with-0.120
Token treat
Token ID2190
Feature activation+3.02e+01

Pos logit contributions
.+0.943
,+0.891
 and+0.869
 to+0.751
+0.736
Neg logit contributions
 Majesty-0.808
umm-0.803
Ta-0.800
agna-0.767
 sidelines-0.767
Token steps
Token ID4831
Feature activation+2.77e+01

Pos logit contributions
 Majesty+1.809
Ta+1.809
agna+1.782
 sidelines+1.773
 former+1.744
Neg logit contributions
.-0.880
,-0.745
 and-0.685
-0.635
 to-0.517
Token for
Token ID329
Feature activation-1.57e+01

Pos logit contributions
 sidelines+1.741
isher+1.639
Ta+1.631
vation+1.615
belt+1.604
Neg logit contributions
.-0.279
,-0.247
 and-0.196
-0.119
 with-0.081
Token him
Token ID683
Feature activation+2.81e+00

Pos logit contributions
.+0.401
,+0.369
 and+0.360
+0.267
 to+0.261
Neg logit contributions
 Majesty-0.876
 sidelines-0.876
 former-0.850
Ta-0.849
 majesty-0.839
Token to
Token ID284
Feature activation-2.54e+01

Pos logit contributions
 Majesty+0.199
 former+0.193
 majesty+0.192
 sidelines+0.187
agna+0.187
Neg logit contributions
.-0.016
,-0.011
 and-0.008
+0.001
 with+0.006
Token walk
Token ID2513
Feature activation+2.11e+01

Pos logit contributions
.+1.081
,+1.016
 and+0.995
+0.838
 to+0.832
Neg logit contributions
 Majesty-1.051
Ta-1.031
 sidelines-1.022
 former-1.006
umm-0.996
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 former+1.260
 sidelines+1.256
belt+1.224
isher+1.222
immers+1.205
Neg logit contributions
.-0.119
,-0.104
 and-0.072
-0.045
 to+0.037
Token
Token ID198
Feature activation-3.79e+01

Pos logit contributions
.+1.543
,+1.426
 and+1.363
 to+1.072
 with+1.047
Neg logit contributions
 Majesty-2.406
Ta-2.391
umm-2.336
 sidelines-2.332
 former-2.312
Token
Token ID198
Feature activation-3.78e+01

Pos logit contributions
.+0.660
,+0.594
 and+0.520
 to+0.290
 with+0.266
Neg logit contributions
Ta-2.531
 Majesty-2.473
 sidelines-2.432
 former-2.414
 majesty-2.382
TokenTim
Token ID14967
Feature activation+2.37e+01

Pos logit contributions
.+1.605
,+1.493
 and+1.473
+1.294
 with+1.225
Neg logit contributions
umm-1.630
 Majesty-1.621
Ta-1.576
agna-1.555
 former-1.532
Tokenmy
Token ID1820
Feature activation+8.62e+00

Pos logit contributions
 Majesty+1.070
Ta+1.066
 sidelines+1.006
 former+0.999
 majesty+0.999
Neg logit contributions
.-0.870
,-0.788
 and-0.734
 to-0.639
-0.639
Token was
Token ID373
Feature activation-1.65e+01

Pos logit contributions
 Majesty+0.456
Ta+0.445
 majesty+0.429
 sidelines+0.427
 former+0.423
Neg logit contributions
.-0.260
,-0.222
 and-0.206
 to-0.172
-0.167
Token and
Token ID290
Feature activation-3.02e+01

Pos logit contributions
.+1.007
,+0.949
 and+0.927
+0.796
 to+0.791
Neg logit contributions
Ta-0.775
 Majesty-0.771
umm-0.753
 sidelines-0.740
agna-0.733
Token said
Token ID531
Feature activation-8.83e+00

Pos logit contributions
.+1.155
,+1.073
 and+1.043
 to+0.847
+0.847
Neg logit contributions
Ta-1.424
 Majesty-1.414
umm-1.400
 sidelines-1.369
agna-1.364
Token they
Token ID484
Feature activation-1.07e+01

Pos logit contributions
.+0.112
 and+0.098
,+0.075
+0.048
 with+0.033
Neg logit contributions
 Majesty-0.598
 majesty-0.570
agna-0.549
 pulp-0.545
 former-0.543
Token were
Token ID547
Feature activation-6.06e+00

Pos logit contributions
.+0.399
,+0.377
 and+0.363
+0.306
 with+0.297
Neg logit contributions
 Majesty-0.479
Ta-0.475
agna-0.462
 former-0.458
 pulp-0.454
Token sorry
Token ID7926
Feature activation+1.86e+01

Pos logit contributions
.+0.230
,+0.214
 and+0.209
+0.172
 to+0.169
Neg logit contributions
 Majesty-0.284
Ta-0.279
umm-0.274
 sidelines-0.273
agna-0.272
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 Majesty+1.071
 former+1.041
umm+1.038
 majesty+1.036
icious+1.027
Neg logit contributions
.-0.101
,-0.095
 and-0.045
-0.041
 the-0.013
Token They
Token ID1119
Feature activation-4.39e+00

Pos logit contributions
.+1.347
,+1.232
 and+1.183
 to+0.889
 with+0.848
Neg logit contributions
 Majesty-2.544
Ta-2.515
 majesty-2.448
umm-2.439
 former-2.435
Token felt
Token ID2936
Feature activation+2.42e+00

Pos logit contributions
.+0.166
,+0.155
 and+0.150
+0.123
 to+0.122
Neg logit contributions
Ta-0.207
 Majesty-0.205
umm-0.200
 sidelines-0.199
agna-0.197
Token bad
Token ID2089
Feature activation+5.60e+00

Pos logit contributions
Ta+0.082
 Majesty+0.081
 sidelines+0.079
 former+0.077
 pulp+0.076
Neg logit contributions
.-0.117
,-0.111
 and-0.109
-0.095
 to-0.094
Token for
Token ID329
Feature activation-1.48e+01

Pos logit contributions
ses+0.339
umm+0.328
 Majesty+0.328
sed+0.323
 inconvenience+0.316
Neg logit contributions
,-0.022
.-0.021
-0.018
 the-0.013
 and-0.001
Token ruining
Token ID39842
Feature activation+2.91e+01

Pos logit contributions
.+0.584
,+0.550
 and+0.543
+0.459
 to+0.447
Neg logit contributions
 sidelines-0.624
 Majesty-0.622
agna-0.601
 former-0.590
 pulp-0.589
Token's
Token ID338
Feature activation-1.74e+01

Pos logit contributions
 Majesty+0.034
Ta+0.033
 majesty+0.032
agna+0.032
umm+0.032
Neg logit contributions
.-0.027
,-0.026
 and-0.025
-0.020
 with-0.019
Token skin
Token ID4168
Feature activation+3.11e+01

Pos logit contributions
.+0.754
,+0.710
 and+0.690
+0.583
 to+0.575
Neg logit contributions
 Majesty-0.724
Ta-0.713
umm-0.702
 sidelines-0.691
 former-0.690
Token is
Token ID318
Feature activation-7.72e+00

Pos logit contributions
atmeal+1.992
umm+1.989
 Majesty+1.985
wy+1.962
icious+1.955
Neg logit contributions
.-0.253
,-0.199
 and-0.152
-0.112
 to-0.022
Token not
Token ID407
Feature activation-3.78e+00

Pos logit contributions
.+0.287
,+0.270
 and+0.259
+0.220
 to+0.207
Neg logit contributions
 Majesty-0.348
 former-0.343
Ta-0.339
umm-0.336
 sidelines-0.335
Token boring
Token ID14262
Feature activation+1.93e+01

Pos logit contributions
.+0.129
,+0.124
 and+0.117
+0.101
 with+0.093
Neg logit contributions
 Majesty-0.177
 majesty-0.170
 former-0.170
 pulp-0.169
Ta-0.168
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
umm+1.171
 Majesty+1.141
 former+1.107
 inconvenience+1.093
 unnecessary+1.085
Neg logit contributions
.-0.168
-0.168
,-0.166
 and-0.155
 the-0.091
Token It
Token ID632
Feature activation+7.07e+00

Pos logit contributions
.+1.850
,+1.725
 and+1.674
 to+1.376
+1.349
Neg logit contributions
Ta-2.134
 Majesty-2.123
umm-2.082
 sidelines-2.039
agna-2.034
Token is
Token ID318
Feature activation-6.99e+00

Pos logit contributions
Ta+0.338
 Majesty+0.335
umm+0.328
 former+0.323
agna+0.322
Neg logit contributions
.-0.256
,-0.239
 and-0.229
-0.187
 to-0.185
Token beautiful
Token ID4950
Feature activation+1.35e+01

Pos logit contributions
.+0.248
,+0.232
 and+0.222
+0.182
 to+0.175
Neg logit contributions
 Majesty-0.335
Ta-0.334
 former-0.328
umm-0.327
agna-0.324
Token,
Token ID11
Feature activation-3.60e+01

Pos logit contributions
umm+0.923
 Majesty+0.906
 former+0.871
 majesty+0.868
 pulp+0.853
Neg logit contributions
.-0.051
-0.044
,-0.034
 and-0.003
 the+0.036
Token like
Token ID588
Feature activation-1.09e+01

Pos logit contributions
.+1.076
,+0.971
 and+0.918
+0.722
 to+0.701
Neg logit contributions
Ta-1.958
 Majesty-1.953
umm-1.933
 sidelines-1.901
 former-1.899
Token.
Token ID13
Feature activation-3.96e+01

Pos logit contributions
umm+2.072
atmeal+1.937
 Majesty+1.909
agna+1.885
 former+1.873
Neg logit contributions
,-0.046
.-0.019
 and-0.013
+0.054
 to+0.115
Token
Token ID198
Feature activation-3.09e+01

Pos logit contributions
.+1.383
,+1.284
 and+1.235
 to+0.984
 with+0.966
Neg logit contributions
Ta-1.958
 Majesty-1.928
umm-1.904
 sidelines-1.890
 former-1.861
Token
Token ID198
Feature activation-3.08e+01

Pos logit contributions
.+0.546
,+0.509
 and+0.439
 to+0.258
 with+0.206
Neg logit contributions
Ta-1.892
 sidelines-1.754
icious-1.735
 Majesty-1.719
bek-1.713
TokenA
Token ID32
Feature activation+9.58e+00

Pos logit contributions
.+1.317
,+1.233
 and+1.202
+1.007
 to+0.998
Neg logit contributions
Ta-1.330
 Majesty-1.328
umm-1.305
 sidelines-1.277
agna-1.270
Token mer
Token ID4017
Feature activation+3.43e+01

Pos logit contributions
Ta+0.503
 Majesty+0.501
umm+0.494
 sidelines+0.486
agna+0.484
Neg logit contributions
.-0.317
,-0.292
 and-0.281
-0.221
 to-0.219
Tokenmaid
Token ID23151
Feature activation+5.15e+01

Pos logit contributions
Ta+0.619
 sidelines+0.558
 Majesty+0.548
umm+0.544
agna+0.518
Neg logit contributions
.-2.362
,-2.279
 and-2.249
 to-2.011
 with-2.008
Token that
Token ID326
Feature activation-1.19e+01

Pos logit contributions
 Majesty+2.155
Ta+2.081
 majesty+2.074
 pulp+1.999
atmeal+1.944
Neg logit contributions
.-1.671
,-1.459
 and-1.431
-1.247
 to-1.237
Token lived
Token ID5615
Feature activation+1.85e+01

Pos logit contributions
.+0.327
,+0.292
 and+0.282
 to+0.203
+0.203
Neg logit contributions
Ta-0.682
 Majesty-0.675
umm-0.663
 sidelines-0.650
agna-0.649
Token in
Token ID287
Feature activation-9.12e+00

Pos logit contributions
 Majesty+1.047
 former+1.031
Ta+1.024
ubb+1.021
isher+1.016
Neg logit contributions
.-0.359
,-0.337
 and-0.290
-0.217
 to-0.174
Token the
Token ID262
Feature activation-1.68e+01

Pos logit contributions
.+0.156
,+0.136
 and+0.129
 to+0.076
+0.065
Neg logit contributions
Ta-0.588
 Majesty-0.571
 sidelines-0.564
 former-0.562
umm-0.558
Token deep
Token ID2769
Feature activation+1.55e+01

Pos logit contributions
.+0.902
,+0.856
 and+0.837
+0.728
 to+0.727
Neg logit contributions
Ta-0.549
 Majesty-0.541
umm-0.534
agna-0.524
 sidelines-0.518
Token child
Token ID1200
Feature activation+3.29e+01

Pos logit contributions
Ta+0.712
 Majesty+0.703
umm+0.683
isher+0.675
 pulp+0.663
Neg logit contributions
.-0.904
,-0.827
 and-0.815
-0.722
 to-0.700
Token visited
Token ID8672
Feature activation+2.68e+01

Pos logit contributions
Ta+1.316
 Majesty+1.307
 pulp+1.270
isher+1.248
 majesty+1.242
Neg logit contributions
.-1.201
 and-1.068
,-1.047
-0.956
 to-0.879
Token him
Token ID683
Feature activation+1.09e+01

Pos logit contributions
 sidelines+1.412
 Majesty+1.394
 former+1.394
Ta+1.376
isher+1.374
Neg logit contributions
.-0.490
,-0.416
 and-0.349
 with-0.224
-0.200
Token in
Token ID287
Feature activation-8.92e+00

Pos logit contributions
 former+0.691
 Majesty+0.665
agna+0.645
arians+0.643
 sidelines+0.640
Neg logit contributions
.-0.114
,-0.092
 and-0.065
-0.051
 the-0.011
Token the
Token ID262
Feature activation-1.66e+01

Pos logit contributions
.+0.159
,+0.152
 and+0.142
 to+0.093
 with+0.085
Neg logit contributions
 sidelines-0.490
Ta-0.487
 former-0.481
umm-0.469
agna-0.466
Token garage
Token ID15591
Feature activation+4.44e+01

Pos logit contributions
.+0.803
,+0.753
 and+0.736
+0.636
 to+0.632
Neg logit contributions
Ta-0.634
 Majesty-0.633
umm-0.615
agna-0.606
 sidelines-0.601
Token and
Token ID290
Feature activation-2.21e+01

Pos logit contributions
 former+2.899
 Majesty+2.867
 sidelines+2.818
umm+2.758
isher+2.729
Neg logit contributions
.-0.136
,-0.110
 and-0.021
+0.079
 the+0.173
Token asked
Token ID1965
Feature activation+4.12e+00

Pos logit contributions
.+0.812
,+0.744
 and+0.721
+0.584
 to+0.574
Neg logit contributions
 Majesty-1.070
Ta-1.067
 sidelines-1.034
umm-1.030
agna-1.027
Token him
Token ID683
Feature activation+1.10e+01

Pos logit contributions
 Majesty+0.196
agna+0.182
 majesty+0.181
ocol+0.176
heit+0.176
Neg logit contributions
.-0.113
 and-0.100
,-0.091
-0.077
 with-0.076
Token why
Token ID1521
Feature activation+6.85e+00

Pos logit contributions
 Majesty+0.652
 majesty+0.620
agna+0.617
 former+0.585
uries+0.567
Neg logit contributions
.-0.221
 and-0.189
,-0.164
-0.132
 with-0.106
Token he
Token ID339
Feature activation+1.10e-01

Pos logit contributions
 Majesty+0.333
agna+0.330
atmeal+0.328
 majesty+0.326
 sidelines+0.320
Neg logit contributions
.-0.164
 and-0.158
,-0.152
-0.125
 with-0.114
Token play
Token ID711
Feature activation+1.58e+01

Pos logit contributions
.+0.808
,+0.768
 and+0.744
+0.605
 with+0.598
Neg logit contributions
Ta-0.915
 sidelines-0.906
 Majesty-0.901
 former-0.867
agna-0.861
Token.
Token ID13
Feature activation-4.26e+01

Pos logit contributions
 sidelines+0.990
isher+0.986
affles+0.968
agna+0.958
ocol+0.958
Neg logit contributions
,-0.084
.-0.077
 and-0.056
-0.031
 to-0.003
Token Billy
Token ID15890
Feature activation+1.96e+01

Pos logit contributions
.+1.587
,+1.451
 and+1.409
 with+1.135
 to+1.118
Neg logit contributions
 sidelines-1.938
umm-1.918
Ta-1.901
 Majesty-1.897
agna-1.857
Token saw
Token ID2497
Feature activation+2.91e+00

Pos logit contributions
 Majesty+0.881
Ta+0.853
 sidelines+0.831
 majesty+0.827
heit+0.827
Neg logit contributions
.-0.687
,-0.600
 and-0.573
 to-0.487
-0.487
Token the
Token ID262
Feature activation-1.99e+01

Pos logit contributions
Ta+0.159
 sidelines+0.156
 Majesty+0.153
 pulp+0.153
isher+0.151
Neg logit contributions
.-0.076
,-0.066
 and-0.062
 with-0.045
-0.045
Token razor
Token ID26811
Feature activation+4.56e+01

Pos logit contributions
.+1.213
,+1.157
 and+1.147
 to+1.011
+1.004
Neg logit contributions
 Majesty-0.513
Ta-0.505
umm-0.490
 former-0.477
 sidelines-0.473
Token and
Token ID290
Feature activation-2.53e+01

Pos logit contributions
agna+2.875
umm+2.872
atmeal+2.872
 sidelines+2.868
isher+2.859
Neg logit contributions
.-0.349
-0.198
,-0.191
 and-0.082
 to+0.048
Token wanted
Token ID2227
Feature activation+3.45e+00

Pos logit contributions
.+0.942
,+0.863
 and+0.839
+0.687
 to+0.675
Neg logit contributions
 Majesty-1.175
Ta-1.152
 sidelines-1.131
 pulp-1.116
 majesty-1.111
Token to
Token ID284
Feature activation-2.05e+01

Pos logit contributions
Ta+0.214
 sidelines+0.213
 pulp+0.208
agna+0.207
 Majesty+0.206
Neg logit contributions
.-0.054
,-0.048
 and-0.043
-0.022
 with-0.018
Token play
Token ID711
Feature activation+1.63e+01

Pos logit contributions
.+0.826
,+0.771
 and+0.753
+0.621
 to+0.612
Neg logit contributions
Ta-0.913
 Majesty-0.904
umm-0.882
 sidelines-0.881
 former-0.864
Token with
Token ID351
Feature activation-1.08e+01

Pos logit contributions
 sidelines+0.964
icious+0.933
Ta+0.933
isher+0.933
agna+0.929
Neg logit contributions
.-0.228
,-0.200
 and-0.194
-0.124
 to-0.110
Token play
Token ID711
Feature activation+1.88e+01

Pos logit contributions
.+0.736
,+0.685
 and+0.669
+0.561
 to+0.551
Neg logit contributions
Ta-0.785
 Majesty-0.783
umm-0.756
 sidelines-0.754
 former-0.747
Token hide
Token ID7808
Feature activation+3.10e+01

Pos logit contributions
agna+1.046
 sidelines+1.041
Ta+1.040
 Majesty+1.036
atmeal+1.033
Neg logit contributions
.-0.371
,-0.308
 and-0.276
-0.213
 to-0.190
Token-
Token ID12
Feature activation+1.45e+01

Pos logit contributions
 Majesty+1.987
 former+1.945
 majesty+1.940
isher+1.930
agna+1.924
Neg logit contributions
.-0.375
,-0.278
-0.235
 and-0.170
 to-0.113
Tokenand
Token ID392
Feature activation+3.44e+01

Pos logit contributions
Ta+0.479
 Majesty+0.477
umm+0.466
 sidelines+0.454
agna+0.450
Neg logit contributions
.-0.756
,-0.717
 and-0.702
-0.610
 to-0.608
Token-
Token ID12
Feature activation+1.45e+01

Pos logit contributions
 Majesty+1.751
Ta+1.702
agna+1.690
 majesty+1.682
 sidelines+1.653
Neg logit contributions
.-1.062
,-0.958
 and-0.937
-0.792
 to-0.733
Tokenseek
Token ID36163
Feature activation+4.39e+01

Pos logit contributions
Ta+0.251
 Majesty+0.237
 sidelines+0.223
umm+0.214
agna+0.214
Neg logit contributions
.-0.968
,-0.932
 and-0.914
-0.825
 to-0.805
Token and
Token ID290
Feature activation-2.25e+01

Pos logit contributions
 former+2.147
 Majesty+2.120
immers+2.099
 majesty+2.084
iders+2.075
Neg logit contributions
.-0.435
-0.414
,-0.349
 and-0.266
 to-0.197
Token tag
Token ID7621
Feature activation+3.70e+01

Pos logit contributions
.+1.001
,+0.941
 and+0.917
+0.772
 to+0.770
Neg logit contributions
Ta-0.930
 Majesty-0.925
umm-0.910
 sidelines-0.889
agna-0.884
Token with
Token ID351
Feature activation-8.12e+00

Pos logit contributions
 Majesty+2.299
 majesty+2.259
 former+2.251
agna+2.246
 pulp+2.236
Neg logit contributions
-0.075
,-0.031
 and-0.025
.-0.023
 to+0.150
Token her
Token ID607
Feature activation-2.38e+00

Pos logit contributions
.+0.214
,+0.193
 and+0.185
+0.132
 to+0.131
Neg logit contributions
Ta-0.479
 Majesty-0.477
umm-0.469
 sidelines-0.464
agna-0.461
Token friends
Token ID2460
Feature activation+2.26e+01

Pos logit contributions
.+0.094
,+0.087
 and+0.086
+0.071
 to+0.070
Neg logit contributions
 Majesty-0.105
Ta-0.103
umm-0.102
 former-0.100
 majesty-0.100
Token pilot
Token ID8022
Feature activation+5.08e+01

Pos logit contributions
.+0.766
,+0.728
 and+0.708
+0.613
 to+0.609
Neg logit contributions
 Majesty-0.553
Ta-0.549
 sidelines-0.531
umm-0.531
agna-0.516
Token.
Token ID13
Feature activation-3.79e+01

Pos logit contributions
 Majesty+3.396
 sidelines+3.249
arians+3.210
 former+3.203
 majesty+3.191
Neg logit contributions
.-0.078
,-0.013
 and+0.003
+0.050
 to+0.254
Token
Token ID198
Feature activation-2.92e+01

Pos logit contributions
.+1.226
,+1.122
 and+1.075
 with+0.801
 to+0.781
Neg logit contributions
 Majesty-1.945
Ta-1.937
umm-1.907
 sidelines-1.899
 former-1.859
Token
Token ID198
Feature activation-2.92e+01

Pos logit contributions
.+0.511
,+0.436
 and+0.377
 to+0.190
 with+0.159
Neg logit contributions
Ta-1.904
 sidelines-1.815
 Majesty-1.801
 majesty-1.760
vation-1.752
TokenThe
Token ID464
Feature activation+4.15e+00

Pos logit contributions
.+1.124
,+1.045
 and+1.031
+0.873
 with+0.842
Neg logit contributions
 Majesty-1.388
umm-1.372
Ta-1.353
agna-1.321
 former-1.318
Token pilot
Token ID8022
Feature activation+5.09e+01

Pos logit contributions
 Majesty+0.151
Ta+0.151
umm+0.145
 sidelines+0.144
agna+0.142
Neg logit contributions
.-0.203
,-0.191
 and-0.187
-0.161
 to-0.160
Token waved
Token ID26834
Feature activation+2.62e+01

Pos logit contributions
 Majesty+2.337
Ta+2.313
 former+2.199
 majesty+2.199
isher+2.184
Neg logit contributions
.-1.766
,-1.564
 and-1.468
-1.296
 to-1.280
Token back
Token ID736
Feature activation+7.55e+00

Pos logit contributions
 Majesty+1.026
Ta+0.987
 sidelines+0.981
 reins+0.976
isher+0.967
Neg logit contributions
.-0.876
,-0.789
-0.718
 and-0.717
 with-0.608
Token.
Token ID13
Feature activation-3.78e+01

Pos logit contributions
 former+0.453
 Majesty+0.452
 reins+0.431
 sidelines+0.430
bek+0.429
Neg logit contributions
.-0.063
,-0.042
-0.032
 and-0.019
 a+0.003
Token Soon
Token ID15894
Feature activation+2.61e+01

Pos logit contributions
.+1.241
,+1.122
 and+1.076
 with+0.830
 to+0.819
Neg logit contributions
 Majesty-1.977
umm-1.934
Ta-1.922
 sidelines-1.890
 former-1.879
Token the
Token ID262
Feature activation-1.53e+01

Pos logit contributions
 Majesty+1.690
Ta+1.614
 majesty+1.607
umm+1.604
agna+1.593
Neg logit contributions
.-0.412
 and-0.283
,-0.222
-0.165
 with-0.126
Token<|endoftext|>
Token ID50256
Feature activation+6.10e-01

Pos logit contributions
.+1.313
,+1.219
 and+1.153
 to+0.901
 with+0.877
Neg logit contributions
Ta-2.092
umm-2.069
 Majesty-2.068
 sidelines-2.030
 former-2.029
TokenOnce
Token ID7454
Feature activation+1.61e+00

Pos logit contributions
Ta+0.025
umm+0.024
rob+0.023
Whe+0.023
roach+0.022
Neg logit contributions
,-0.020
.-0.020
 and-0.019
 to-0.016
 with-0.015
Token there
Token ID612
Feature activation-1.46e+00

Pos logit contributions
 Majesty+0.046
 majesty+0.045
isher+0.044
Ta+0.043
heit+0.042
Neg logit contributions
.-0.071
 and-0.067
,-0.063
 with-0.060
 to-0.055
Token was
Token ID373
Feature activation-1.07e+01

Pos logit contributions
.+0.038
 and+0.032
,+0.029
 to+0.026
 with+0.023
Neg logit contributions
Ta-0.070
 Majesty-0.069
umm-0.069
isher-0.067
liv-0.067
Token a
Token ID257
Feature activation-1.43e+01

Pos logit contributions
.+0.194
,+0.163
 and+0.143
 to+0.086
 with+0.077
Neg logit contributions
Ta-0.709
 Majesty-0.669
umm-0.665
 former-0.659
 sidelines-0.656
Token ch
Token ID442
Feature activation+1.67e+01

Pos logit contributions
.+0.491
,+0.452
 and+0.438
+0.347
 to+0.346
Neg logit contributions
Ta-0.723
 Majesty-0.721
umm-0.709
 sidelines-0.699
agna-0.695
Tokenubby
Token ID38393
Feature activation+4.32e+01

Pos logit contributions
Ta+0.388
agna+0.364
 Majesty+0.363
umm+0.353
 sidelines+0.344
Neg logit contributions
.-1.069
,-1.032
 and-1.009
 with-0.884
 to-0.879
Token little
Token ID1310
Feature activation+2.16e+00

Pos logit contributions
 Majesty+1.807
umm+1.763
Ta+1.737
 majesty+1.736
 sidelines+1.726
Neg logit contributions
.-1.695
,-1.543
 and-1.489
 to-1.326
-1.313
Token girl
Token ID2576
Feature activation+2.62e+01

Pos logit contributions
Ta+0.080
 Majesty+0.080
umm+0.078
 sidelines+0.076
agna+0.075
Neg logit contributions
.-0.103
,-0.097
 and-0.095
-0.082
 to-0.081
Token called
Token ID1444
Feature activation-5.66e-01

Pos logit contributions
 Majesty+0.623
bek+0.618
 majesty+0.595
 former+0.588
oise+0.585
Neg logit contributions
 and-0.977
.-0.977
,-0.948
-0.902
 to-0.859
Token Lucy
Token ID22162
Feature activation+2.49e+01

Pos logit contributions
.+0.024
 and+0.023
,+0.022
+0.018
 with+0.018
Neg logit contributions
 Majesty-0.022
Ta-0.021
 majesty-0.021
umm-0.021
 former-0.020
Token on
Token ID319
Feature activation-2.93e+00

Pos logit contributions
umm+0.526
 former+0.522
 sidelines+0.516
ubb+0.506
isher+0.502
Neg logit contributions
.-0.147
,-0.130
-0.111
 and-0.111
 with-0.048
Token the
Token ID262
Feature activation-1.66e+01

Pos logit contributions
.+0.052
,+0.047
 and+0.042
+0.029
 to+0.024
Neg logit contributions
 sidelines-0.172
 pulp-0.170
 former-0.168
agna-0.167
umm-0.166
Token grass
Token ID8701
Feature activation+2.82e+01

Pos logit contributions
.+0.813
,+0.770
 and+0.749
+0.650
 to+0.644
Neg logit contributions
Ta-0.602
 Majesty-0.596
umm-0.584
 sidelines-0.581
agna-0.566
Token and
Token ID290
Feature activation-2.21e+01

Pos logit contributions
umm+1.911
 former+1.896
 Majesty+1.853
 sidelines+1.843
isher+1.828
Neg logit contributions
.-0.043
,-0.009
 and+0.018
+0.064
 the+0.124
Token just
Token ID655
Feature activation+2.81e+00

Pos logit contributions
.+0.945
,+0.888
 and+0.867
 to+0.722
+0.717
Neg logit contributions
Ta-0.956
 Majesty-0.950
umm-0.939
 sidelines-0.915
agna-0.908
Token closed
Token ID4838
Feature activation+2.12e+01

Pos logit contributions
Ta+0.117
 Majesty+0.116
umm+0.113
 sidelines+0.111
 former+0.111
Neg logit contributions
.-0.118
,-0.110
 and-0.108
-0.093
 to-0.089
Token their
Token ID511
Feature activation+7.19e-01

Pos logit contributions
 Majesty+1.104
isher+1.099
 pulp+1.080
 former+1.080
 sidelines+1.070
Neg logit contributions
.-0.440
 and-0.417
,-0.406
-0.354
 with-0.270
Token eyes
Token ID2951
Feature activation+3.02e+01

Pos logit contributions
Ta+0.027
 Majesty+0.026
umm+0.026
 sidelines+0.025
agna+0.025
Neg logit contributions
.-0.035
,-0.033
 and-0.032
-0.028
 to-0.028
Token.
Token ID13
Feature activation-3.89e+01

Pos logit contributions
umm+1.978
 former+1.960
mic+1.870
agna+1.870
icious+1.863
Neg logit contributions
,-0.043
-0.023
.-0.018
 and-0.018
 the+0.104
Token
Token ID198
Feature activation-3.02e+01

Pos logit contributions
.+0.727
,+0.629
 and+0.575
 with+0.306
 to+0.302
Neg logit contributions
Ta-2.529
umm-2.468
 Majesty-2.466
 sidelines-2.454
agna-2.444
Token
Token ID198
Feature activation-3.02e+01

Pos logit contributions
.+0.564
,+0.510
 and+0.447
 to+0.262
 with+0.243
Neg logit contributions
Ta-1.809
 sidelines-1.700
affles-1.675
 Majesty-1.668
icious-1.664
Token<|endoftext|>
Token ID50256
Feature activation+2.83e+00

Pos logit contributions
.+1.173
,+1.095
 and+1.038
 to+0.802
 with+0.790
Neg logit contributions
Ta-2.020
 Majesty-1.996
umm-1.992
 former-1.946
 majesty-1.939
TokenBen
Token ID11696
Feature activation+3.64e+01

Pos logit contributions
Ta+0.098
rob+0.094
Whe+0.089
umm+0.089
atmeal+0.087
Neg logit contributions
,-0.110
.-0.109
 and-0.106
 to-0.093
 with-0.086
Token and
Token ID290
Feature activation-2.11e+01

Pos logit contributions
Ta+2.271
 Majesty+2.223
wy+2.222
 majesty+2.213
isher+2.178
Neg logit contributions
.-0.468
,-0.296
 and-0.173
 to-0.125
-0.122
Token Lily
Token ID20037
Feature activation+1.11e+01

Pos logit contributions
.+0.585
,+0.527
 and+0.494
+0.357
 with+0.357
Neg logit contributions
Ta-1.187
 Majesty-1.141
 sidelines-1.118
 pulp-1.113
umm-1.113
Token were
Token ID547
Feature activation+3.05e+00

Pos logit contributions
Ta+0.486
 Majesty+0.461
wy+0.454
agna+0.447
 pulp+0.444
Neg logit contributions
.-0.399
,-0.362
 and-0.337
 to-0.310
-0.305
Token playing
Token ID2712
Feature activation+1.87e+01

Pos logit contributions
 Majesty+0.137
Ta+0.133
 sidelines+0.130
 former+0.129
 majesty+0.128
Neg logit contributions
.-0.114
,-0.104
 and-0.101
-0.083
 to-0.082
Token in
Token ID287
Feature activation-7.68e+00

Pos logit contributions
 sidelines+1.057
Ta+1.050
isher+1.047
 Majesty+1.046
agna+1.008
Neg logit contributions
.-0.318
,-0.286
 and-0.257
-0.194
 to-0.167
Token the
Token ID262
Feature activation-1.54e+01

Pos logit contributions
.+0.141
,+0.123
 and+0.111
 to+0.063
+0.055
Neg logit contributions
Ta-0.478
 sidelines-0.474
 Majesty-0.455
isher-0.447
 pulp-0.446
Token snow
Token ID6729
Feature activation+3.46e+01

Pos logit contributions
.+0.756
,+0.713
 and+0.697
+0.598
 to+0.596
Neg logit contributions
Ta-0.562
 Majesty-0.558
umm-0.544
 sidelines-0.536
agna-0.527
Token.
Token ID13
Feature activation-3.79e+01

Pos logit contributions
umm+2.398
atmeal+2.387
 Majesty+2.289
isher+2.276
agna+2.266
Neg logit contributions
,-0.028
.-0.027
 and+0.002
+0.014
 to+0.092
Token They
Token ID1119
Feature activation+4.34e+00

Pos logit contributions
.+1.266
,+1.151
 and+1.106
 with+0.878
 to+0.870
Neg logit contributions
 Majesty-1.838
Ta-1.828
umm-1.826
 sidelines-1.814
agna-1.805
TokenA
Token ID32
Feature activation+6.72e+00

Pos logit contributions
.+1.696
,+1.604
 and+1.582
+1.397
 with+1.362
Neg logit contributions
 Majesty-1.181
umm-1.178
Ta-1.165
 sidelines-1.126
agna-1.122
Tokenmber
Token ID1916
Feature activation+4.39e+01

Pos logit contributions
Ta+0.092
 Majesty+0.089
umm+0.084
 sidelines+0.079
agna+0.075
Neg logit contributions
.-0.477
,-0.454
 and-0.450
 to-0.408
-0.407
Token thought
Token ID1807
Feature activation+1.02e+01

Pos logit contributions
Ta+2.089
 Majesty+2.079
 pulp+1.998
umm+1.979
 majesty+1.979
Neg logit contributions
.-1.381
,-1.198
 and-1.128
-1.004
 to-0.938
Token it
Token ID340
Feature activation+5.37e-01

Pos logit contributions
 Majesty+0.551
isher+0.551
Ta+0.549
reath+0.538
ocol+0.536
Neg logit contributions
.-0.212
,-0.170
 and-0.164
-0.152
 with-0.134
Token was
Token ID373
Feature activation-1.27e+01

Pos logit contributions
Ta+0.028
agna+0.028
umm+0.028
 Majesty+0.028
 former+0.027
Neg logit contributions
.-0.014
,-0.013
 and-0.012
-0.009
 to-0.009
Token amazing
Token ID4998
Feature activation+1.71e+01

Pos logit contributions
.+0.328
 and+0.303
,+0.300
+0.221
 to+0.220
Neg logit contributions
 Majesty-0.691
Ta-0.671
umm-0.670
agna-0.659
 majesty-0.652
Token.
Token ID13
Feature activation-4.20e+01

Pos logit contributions
umm+1.041
 Majesty+1.018
 former+0.958
isher+0.958
 majesty+0.954
Neg logit contributions
.-0.138
-0.131
,-0.101
 and-0.091
 with-0.016
Token "
Token ID366
Feature activation-1.27e+01

Pos logit contributions
.+1.369
,+1.244
 and+1.180
 with+0.903
 to+0.877
Neg logit contributions
 Majesty-2.198
umm-2.163
Ta-2.133
 former-2.061
 majesty-2.060
TokenLet
Token ID5756
Feature activation+2.37e+01

Pos logit contributions
.+0.572
 and+0.535
,+0.519
+0.485
 with+0.462
Neg logit contributions
 Majesty-0.528
 majesty-0.505
 sidelines-0.500
agna-0.485
umm-0.484
Token's
Token ID338
Feature activation-1.22e+01

Pos logit contributions
Ta+1.232
 Majesty+1.223
umm+1.166
icious+1.158
heit+1.156
Neg logit contributions
.-0.673
,-0.593
 and-0.577
 to-0.446
 with-0.444
Token play
Token ID711
Feature activation+1.66e+01

Pos logit contributions
.+0.509
,+0.474
 and+0.464
+0.392
 to+0.387
Neg logit contributions
 Majesty-0.522
Ta-0.513
umm-0.504
agna-0.497
 former-0.496
Token into
Token ID656
Feature activation-1.13e+00

Pos logit contributions
isher+2.153
agna+2.138
 pulp+2.109
umm+2.108
vell+2.108
Neg logit contributions
.-0.333
-0.271
,-0.204
 and-0.149
 to+0.012
Token piles
Token ID30935
Feature activation+3.56e+01

Pos logit contributions
.+0.040
,+0.038
 and+0.037
+0.030
 to+0.030
Neg logit contributions
 Majesty-0.052
Ta-0.051
 former-0.051
 sidelines-0.051
umm-0.051
Token.
Token ID13
Feature activation-4.24e+01

Pos logit contributions
umm+2.172
isher+2.150
ubb+2.094
 pulp+2.067
 sidelines+2.057
Neg logit contributions
.-0.130
-0.128
,-0.040
 and-0.001
 to+0.118
Token
Token ID198
Feature activation-3.36e+01

Pos logit contributions
.+1.305
,+1.203
 and+1.145
 to+0.851
 with+0.844
Neg logit contributions
Ta-2.277
umm-2.253
 Majesty-2.235
 sidelines-2.205
 former-2.190
Token
Token ID198
Feature activation-3.35e+01

Pos logit contributions
.+0.567
,+0.505
 and+0.433
 to+0.223
 with+0.204
Neg logit contributions
Ta-2.223
 sidelines-2.122
 Majesty-2.096
icious-2.085
 former-2.068
TokenThen
Token ID6423
Feature activation+1.55e+01

Pos logit contributions
.+1.400
,+1.304
 and+1.290
+1.110
 with+1.064
Neg logit contributions
 Majesty-1.490
umm-1.470
Ta-1.448
agna-1.412
 majesty-1.405
Token,
Token ID11
Feature activation-3.20e+01

Pos logit contributions
 Majesty+1.030
agna+0.993
 pulp+0.982
 majesty+0.981
umm+0.981
Neg logit contributions
.-0.202
 and-0.147
,-0.110
-0.080
 to-0.050
Token the
Token ID262
Feature activation-1.96e+01

Pos logit contributions
.+0.716
,+0.608
 and+0.594
+0.405
 to+0.373
Neg logit contributions
 Majesty-1.946
Ta-1.937
umm-1.912
 sidelines-1.901
agna-1.900
Token mom
Token ID1995
Feature activation+1.39e+01

Pos logit contributions
.+0.851
,+0.793
 and+0.777
 to+0.645
+0.630
Neg logit contributions
Ta-0.854
 Majesty-0.849
umm-0.827
agna-0.813
 sidelines-0.807
Token cried
Token ID16896
Feature activation+1.56e+01

Pos logit contributions
Ta+0.640
 Majesty+0.635
umm+0.624
agna+0.614
isher+0.607
Neg logit contributions
.-0.513
,-0.469
 and-0.446
-0.384
 to-0.367
Token out
Token ID503
Feature activation+1.31e+00

Pos logit contributions
Ta+0.852
umm+0.849
 sidelines+0.845
 former+0.839
mic+0.834
Neg logit contributions
.-0.252
,-0.213
 and-0.184
-0.175
 with-0.119
Token parents
Token ID3397
Feature activation+3.25e+01

Pos logit contributions
.+0.527
,+0.497
 and+0.485
+0.418
 to+0.414
Neg logit contributions
 Majesty-0.411
Ta-0.409
umm-0.400
 sidelines-0.391
agna-0.388
Token decided
Token ID3066
Feature activation+8.86e+00

Pos logit contributions
 Majesty+1.431
Ta+1.403
agna+1.372
umm+1.368
isher+1.355
Neg logit contributions
.-1.207
,-1.076
 and-1.025
-0.897
 to-0.870
Token to
Token ID284
Feature activation-1.88e+01

Pos logit contributions
 Majesty+0.560
 former+0.540
 sidelines+0.536
agna+0.532
 majesty+0.529
Neg logit contributions
.-0.141
,-0.110
 and-0.096
-0.066
 with-0.037
Token spend
Token ID4341
Feature activation+3.01e+01

Pos logit contributions
.+0.830
,+0.775
 and+0.758
+0.645
 to+0.638
Neg logit contributions
 Majesty-0.759
Ta-0.748
umm-0.739
 former-0.730
 sidelines-0.723
Token more
Token ID517
Feature activation+1.32e+00

Pos logit contributions
Ta+1.612
 Majesty+1.577
 former+1.556
 sidelines+1.549
wy+1.516
Neg logit contributions
.-0.783
,-0.699
 and-0.663
-0.504
 to-0.499
Token time
Token ID640
Feature activation+1.04e+00

Pos logit contributions
 Majesty+0.056
Ta+0.056
umm+0.054
 sidelines+0.054
 former+0.053
Neg logit contributions
.-0.051
,-0.048
 and-0.045
-0.041
 to-0.039
Token with
Token ID351
Feature activation-9.11e+00

Pos logit contributions
 Majesty+0.060
 sidelines+0.059
agna+0.058
Ta+0.058
 former+0.058
Neg logit contributions
.-0.023
,-0.021
 and-0.019
-0.016
 to-0.012
Token her
Token ID607
Feature activation-3.33e+00

Pos logit contributions
.+0.252
,+0.229
 and+0.220
+0.165
 to+0.157
Neg logit contributions
 Majesty-0.507
Ta-0.502
umm-0.490
 former-0.489
 sidelines-0.487
Token and
Token ID290
Feature activation-2.34e+01

Pos logit contributions
.+0.145
,+0.137
 and+0.133
+0.116
 to+0.112
Neg logit contributions
 Majesty-0.134
Ta-0.130
umm-0.128
 former-0.127
 sidelines-0.127
Token go
Token ID467
Feature activation+9.14e+00

Pos logit contributions
.+0.775
,+0.708
 and+0.685
+0.538
 to+0.525
Neg logit contributions
 Majesty-1.213
Ta-1.210
umm-1.187
 sidelines-1.172
 former-1.168
Token on
Token ID319
Feature activation-4.00e+00

Pos logit contributions
 sidelines+0.492
 Majesty+0.491
atmeal+0.481
 former+0.473
agna+0.462
Neg logit contributions
.-0.214
,-0.192
 and-0.179
-0.147
 with-0.119
Tokenet
Token ID316
Feature activation+3.79e+01

Pos logit contributions
 Majesty+1.120
Ta+1.114
umm+1.090
agna+1.033
 sidelines+1.029
Neg logit contributions
.-3.108
,-2.976
 and-2.922
-2.637
 to-2.612
Token.
Token ID13
Feature activation-4.49e+01

Pos logit contributions
isher+2.571
atmeal+2.546
umm+2.542
agna+2.532
 Majesty+2.525
Neg logit contributions
.-0.122
,-0.065
 and-0.045
-0.039
 to+0.133
Token She
Token ID1375
Feature activation-3.42e+00

Pos logit contributions
.+1.545
,+1.422
 and+1.372
 with+1.089
 to+1.047
Neg logit contributions
Ta-2.100
 Majesty-2.084
umm-2.046
 sidelines-2.003
 majesty-1.966
Token was
Token ID373
Feature activation-1.53e+01

Pos logit contributions
.+0.132
,+0.123
 and+0.120
+0.097
 to+0.097
Neg logit contributions
Ta-0.162
 Majesty-0.161
umm-0.160
 sidelines-0.157
agna-0.155
Token curious
Token ID11040
Feature activation+1.32e+01

Pos logit contributions
.+0.618
,+0.578
 and+0.560
+0.467
 to+0.459
Neg logit contributions
 Majesty-0.675
Ta-0.673
umm-0.652
 sidelines-0.649
agna-0.643
Token about
Token ID546
Feature activation-5.62e+00

Pos logit contributions
umm+0.831
 Majesty+0.797
 ounces+0.795
isher+0.784
 sidelines+0.784
Neg logit contributions
-0.030
.-0.023
,-0.011
 and+0.013
 with+0.031
Token how
Token ID703
Feature activation-6.79e+00

Pos logit contributions
.+0.133
,+0.122
 and+0.115
+0.084
 with+0.078
Neg logit contributions
 Majesty-0.322
 former-0.310
 pulp-0.309
 sidelines-0.309
agna-0.307
Token it
Token ID340
Feature activation-1.62e+00

Pos logit contributions
.+0.176
,+0.165
 and+0.160
+0.113
 with+0.112
Neg logit contributions
 Majesty-0.378
Ta-0.376
 pulp-0.369
umm-0.367
 sidelines-0.366
Token worked
Token ID3111
Feature activation+2.13e+01

Pos logit contributions
.+0.048
,+0.043
 and+0.041
+0.032
 to+0.031
Neg logit contributions
Ta-0.089
 Majesty-0.088
umm-0.087
agna-0.086
 sidelines-0.085
Token.
Token ID13
Feature activation-4.42e+01

Pos logit contributions
wy+1.334
 former+1.298
isher+1.287
agna+1.273
vell+1.261
Neg logit contributions
-0.077
.-0.014
,-0.012
 and-0.009
 to+0.087
Token The
Token ID383
Feature activation-9.51e+00

Pos logit contributions
.+1.174
,+1.065
 and+1.017
 with+0.728
 to+0.697
Neg logit contributions
Ta-2.474
 Majesty-2.473
 sidelines-2.387
umm-2.382
 former-2.348
Token The
Token ID383
Feature activation-6.87e+00

Pos logit contributions
.+0.855
,+0.751
 and+0.684
 to+0.434
 with+0.421
Neg logit contributions
 Majesty-2.595
Ta-2.589
umm-2.574
 sidelines-2.562
agna-2.558
Token end
Token ID886
Feature activation+3.14e+00

Pos logit contributions
.+0.278
,+0.258
 and+0.253
+0.201
 to+0.199
Neg logit contributions
Ta-0.321
 Majesty-0.315
umm-0.311
agna-0.311
 pulp-0.304
Token.
Token ID13
Feature activation-4.14e+01

Pos logit contributions
umm+0.204
roach+0.203
isher+0.201
mic+0.198
icious+0.198
Neg logit contributions
,-0.007
 and-0.006
.-0.004
+0.010
 with+0.011
Token<|endoftext|>
Token ID50256
Feature activation-4.32e-01

Pos logit contributions
.+0.662
,+0.571
 and+0.495
 to+0.256
 with+0.224
Neg logit contributions
Ta-2.777
umm-2.733
 Majesty-2.727
agna-2.722
 sidelines-2.703
TokenOnce
Token ID7454
Feature activation+5.62e-01

Pos logit contributions
,+0.013
.+0.013
 and+0.012
 to+0.011
 with+0.010
Neg logit contributions
Ta-0.018
umm-0.017
rob-0.017
Whe-0.017
roach-0.017
Token upon
Token ID2402
Feature activation+3.88e+00

Pos logit contributions
 Majesty+0.016
 majesty+0.016
isher+0.015
ubb+0.015
heit+0.015
Neg logit contributions
.-0.025
 and-0.024
,-0.022
 with-0.021
 to-0.019
Token a
Token ID257
Feature activation-1.53e+01

Pos logit contributions
Ta+0.227
 Majesty+0.209
 former+0.207
icious+0.207
 sidelines+0.205
Neg logit contributions
,-0.079
.-0.077
 and-0.064
 to-0.046
 with-0.041
Token time
Token ID640
Feature activation+3.04e-01

Pos logit contributions
.+0.567
,+0.521
 and+0.502
 to+0.430
 with+0.421
Neg logit contributions
Ta-0.687
 Majesty-0.678
icious-0.669
 former-0.663
umm-0.662
Token,
Token ID11
Feature activation-3.09e+01

Pos logit contributions
 Majesty+0.019
 majesty+0.019
Ta+0.018
ocol+0.018
heit+0.018
Neg logit contributions
.-0.003
 and-0.002
 to-0.000
 with-0.000
,-0.000
Token there
Token ID612
Feature activation-2.36e+00

Pos logit contributions
.+0.923
 and+0.793
,+0.762
 to+0.646
 with+0.544
Neg logit contributions
Ta-1.462
 Majesty-1.420
Uh-1.400
 pulp-1.383
isher-1.355
Token lived
Token ID5615
Feature activation+1.69e+01

Pos logit contributions
.+0.063
 and+0.052
,+0.050
 to+0.044
 with+0.036
Neg logit contributions
Ta-0.116
umm-0.115
 Majesty-0.112
agna-0.109
atmeal-0.109
Token he
Token ID339
Feature activation+8.38e-02

Pos logit contributions
.+0.802
,+0.693
 and+0.653
 to+0.520
+0.514
Neg logit contributions
 Majesty-1.538
 sidelines-1.500
umm-1.476
agna-1.473
Ta-1.468
Token sped
Token ID40424
Feature activation+4.59e+01

Pos logit contributions
Ta+0.004
 Majesty+0.004
umm+0.004
 sidelines+0.004
agna+0.004
Neg logit contributions
.-0.003
,-0.003
 and-0.003
-0.003
 to-0.003
Token desperately
Token ID16459
Feature activation+2.18e+01

Pos logit contributions
 sidelines+1.823
 Majesty+1.802
Ta+1.751
heit+1.736
isher+1.695
Neg logit contributions
.-1.564
,-1.440
 and-1.376
-1.207
 with-1.065
Token to
Token ID284
Feature activation-1.72e+01

Pos logit contributions
 sidelines+1.188
 Majesty+1.184
 former+1.146
heit+1.123
 unnecessary+1.123
Neg logit contributions
.-0.447
,-0.388
 and-0.354
-0.313
 with-0.238
Token explore
Token ID7301
Feature activation+3.30e+01

Pos logit contributions
.+0.417
,+0.381
 and+0.360
+0.249
 with+0.245
Neg logit contributions
 Majesty-1.026
 former-1.003
Ta-0.998
 sidelines-0.997
 majesty-0.969
Token everything
Token ID2279
Feature activation+1.31e+01

Pos logit contributions
 former+1.874
 Majesty+1.818
 sidelines+1.794
ocol+1.790
 conclusions+1.777
Neg logit contributions
.-0.274
,-0.263
 and-0.205
-0.154
 to-0.129
Token.
Token ID13
Feature activation-3.88e+01

Pos logit contributions
umm+0.730
 ounces+0.689
isher+0.686
 sidelines+0.682
 Majesty+0.675
Neg logit contributions
 and-0.190
.-0.187
,-0.185
-0.149
 to-0.103
Token
Token ID220
Feature activation-6.89e+00

Pos logit contributions
.+1.357
,+1.246
 and+1.206
 to+0.935
 with+0.924
Neg logit contributions
 Majesty-1.925
Ta-1.909
umm-1.874
 sidelines-1.854
 former-1.840
Token
Token ID198
Feature activation-3.00e+01

Pos logit contributions
.+0.116
,+0.112
 and+0.100
 to+0.057
 with+0.051
Neg logit contributions
 Majesty-0.417
Ta-0.410
 majesty-0.403
icious-0.400
 former-0.398
Token
Token ID198
Feature activation-3.00e+01

Pos logit contributions
.+0.548
,+0.475
 and+0.415
 to+0.239
 with+0.194
Neg logit contributions
Ta-1.885
 Majesty-1.812
 sidelines-1.791
icious-1.768
 former-1.759
TokenJust
Token ID5703
Feature activation+1.53e+01

Pos logit contributions
.+1.261
,+1.175
 and+1.173
+1.017
 with+0.983
Neg logit contributions
umm-1.314
 Majesty-1.313
Ta-1.276
isher-1.261
 pulp-1.247
Token plant
Token ID4618
Feature activation+3.29e+01

Pos logit contributions
.+0.770
,+0.729
 and+0.711
 to+0.609
+0.608
Neg logit contributions
Ta-0.589
 Majesty-0.573
umm-0.561
 sidelines-0.555
agna-0.549
Token.
Token ID13
Feature activation-4.15e+01

Pos logit contributions
 Majesty+2.039
umm+2.022
 majesty+1.983
agna+1.977
 former+1.976
Neg logit contributions
.-0.401
,-0.336
 and-0.318
-0.264
 to-0.121
Token I
Token ID314
Feature activation+7.02e+00

Pos logit contributions
.+1.799
,+1.692
 and+1.654
+1.424
 with+1.375
Neg logit contributions
Ta-1.728
 Majesty-1.722
 sidelines-1.685
umm-1.684
agna-1.635
Token like
Token ID588
Feature activation-6.30e+00

Pos logit contributions
Ta+0.290
 Majesty+0.288
 former+0.281
umm+0.276
agna+0.276
Neg logit contributions
.-0.297
,-0.278
 and-0.268
-0.226
 to-0.226
Token mint
Token ID21043
Feature activation+3.91e+01

Pos logit contributions
.+0.189
,+0.172
 and+0.164
+0.130
 with+0.120
Neg logit contributions
 Majesty-0.330
 pulp-0.321
umm-0.320
 sidelines-0.316
 former-0.316
Token too
Token ID1165
Feature activation-1.05e+01

Pos logit contributions
umm+1.863
 pulp+1.848
wy+1.826
affles+1.801
 fiber+1.796
Neg logit contributions
.-1.084
,-0.988
 and-0.948
-0.884
 to-0.751
Token."
Token ID526
Feature activation-1.89e+01

Pos logit contributions
.+0.223
 and+0.212
,+0.208
+0.161
 to+0.157
Neg logit contributions
 pulp-0.522
 Majesty-0.522
umm-0.517
 former-0.517
 ounces-0.510
Token
Token ID198
Feature activation-3.25e+01

Pos logit contributions
.+0.346
,+0.308
 and+0.274
 with+0.156
 in+0.155
Neg logit contributions
Ta-1.102
 sidelines-1.099
 Majesty-1.090
 majesty-1.054
 former-1.034
Token
Token ID198
Feature activation-3.24e+01

Pos logit contributions
.+0.555
,+0.552
 and+0.498
 to+0.296
 with+0.252
Neg logit contributions
Ta-1.974
 sidelines-1.892
 Majesty-1.884
 majesty-1.848
 former-1.831
TokenL
Token ID43
Feature activation+1.46e+01

Pos logit contributions
.+1.399
,+1.312
 and+1.277
 to+1.065
+1.065
Neg logit contributions
Ta-1.387
 Majesty-1.378
umm-1.350
 sidelines-1.331
agna-1.319
Tokenily
Token ID813
Feature activation+2.68e+01

Pos logit contributions
Ta+0.514
 Majesty+0.490
umm+0.481
isher+0.478
icious+0.470
Neg logit contributions
.-0.721
,-0.693
 and-0.664
 to-0.594
-0.581
Token you
Token ID345
Feature activation+8.32e+00

Pos logit contributions
.+0.139
,+0.130
 and+0.129
+0.104
 with+0.101
Neg logit contributions
 Majesty-0.170
 sidelines-0.165
Ta-0.164
umm-0.163
 majesty-0.162
Token should
Token ID815
Feature activation+7.19e+00

Pos logit contributions
Ta+0.348
 Majesty+0.345
 sidelines+0.337
agna+0.335
umm+0.335
Neg logit contributions
.-0.348
,-0.326
 and-0.317
-0.270
 to-0.266
Token act
Token ID719
Feature activation+2.88e+01

Pos logit contributions
 Majesty+0.284
 sidelines+0.280
agna+0.274
 former+0.272
 majesty+0.270
Neg logit contributions
.-0.289
,-0.273
 and-0.272
-0.236
 to-0.228
Token.
Token ID13
Feature activation-4.12e+01

Pos logit contributions
 Majesty+1.735
agna+1.689
 former+1.688
 majesty+1.660
 unnecessary+1.656
Neg logit contributions
.-0.445
,-0.391
 and-0.360
-0.317
 to-0.202
Token Go
Token ID1514
Feature activation+3.38e+01

Pos logit contributions
.+1.790
,+1.681
 and+1.650
+1.410
 to+1.373
Neg logit contributions
Ta-1.732
 Majesty-1.699
umm-1.672
 sidelines-1.665
agna-1.630
Token to
Token ID284
Feature activation-1.95e+01

Pos logit contributions
 Majesty+1.645
 sidelines+1.635
isher+1.634
agna+1.617
 former+1.604
Neg logit contributions
.-0.948
,-0.830
 and-0.777
-0.647
 the-0.583
Token your
Token ID534
Feature activation+8.42e+00

Pos logit contributions
.+0.655
,+0.601
 and+0.582
+0.459
 to+0.445
Neg logit contributions
 Majesty-1.003
Ta-0.990
 sidelines-0.974
 former-0.969
umm-0.964
Token room
Token ID2119
Feature activation+2.97e+01

Pos logit contributions
 Majesty+0.324
umm+0.311
Ta+0.307
 former+0.306
agna+0.305
Neg logit contributions
.-0.391
,-0.370
 and-0.361
-0.314
 to-0.304
Token and
Token ID290
Feature activation-2.41e+01

Pos logit contributions
 Majesty+1.939
 former+1.924
 majesty+1.867
agna+1.831
isher+1.829
Neg logit contributions
.-0.362
,-0.265
 and-0.208
-0.175
 the-0.055
Token stay
Token ID2652
Feature activation+2.32e+01

Pos logit contributions
.+0.947
,+0.884
 and+0.860
 to+0.708
+0.702
Neg logit contributions
Ta-1.123
 Majesty-1.112
umm-1.105
 sidelines-1.073
agna-1.068
Token there
Token ID612
Feature activation-2.22e+00

Pos logit contributions
 former+1.135
 Majesty+1.135
agna+1.077
 sidelines+1.069
 majesty+1.067
Neg logit contributions
.-0.699
,-0.652
 and-0.600
-0.538
 to-0.485
Token until
Token ID1566
Feature activation-3.02e+00

Pos logit contributions
 sidelines+0.714
 Majesty+0.648
uries+0.627
 unnecessary+0.627
 hind+0.615
Neg logit contributions
.-0.351
 and-0.300
-0.299
,-0.294
 with-0.223
Token he
Token ID339
Feature activation-1.19e+00

Pos logit contributions
.+0.092
 and+0.084
,+0.078
 with+0.062
+0.061
Neg logit contributions
 sidelines-0.153
Ta-0.146
 Majesty-0.144
umm-0.141
 pulp-0.140
Token reached
Token ID4251
Feature activation+1.69e+01

Pos logit contributions
.+0.048
,+0.043
 and+0.042
 to+0.035
+0.035
Neg logit contributions
 Majesty-0.050
Ta-0.049
umm-0.047
 majesty-0.047
agna-0.046
Token his
Token ID465
Feature activation-2.15e+00

Pos logit contributions
 Majesty+1.008
 majesty+0.955
 sidelines+0.943
 former+0.940
 pulp+0.926
Neg logit contributions
.-0.297
,-0.259
 and-0.257
-0.163
 with-0.149
Token den
Token ID2853
Feature activation+4.46e+01

Pos logit contributions
.+0.101
,+0.095
 and+0.094
+0.080
 with+0.079
Neg logit contributions
 Majesty-0.082
umm-0.080
Ta-0.079
 sidelines-0.078
 former-0.078
Token.
Token ID13
Feature activation-4.01e+01

Pos logit contributions
 sidelines+2.940
 former+2.897
 Majesty+2.873
umm+2.854
vation+2.759
Neg logit contributions
.-0.103
 and-0.069
,-0.059
-0.023
 with+0.224
Token
Token ID198
Feature activation-3.13e+01

Pos logit contributions
.+0.990
,+0.883
 and+0.831
 to+0.547
 with+0.542
Neg logit contributions
Ta-2.388
 Majesty-2.336
 sidelines-2.314
umm-2.314
agna-2.273
Token
Token ID198
Feature activation-3.13e+01

Pos logit contributions
.+0.527
,+0.448
 and+0.390
 to+0.189
 with+0.170
Neg logit contributions
Ta-2.081
 Majesty-2.003
 sidelines-1.993
icious-1.951
 majesty-1.940
TokenThe
Token ID464
Feature activation+2.03e+00

Pos logit contributions
.+1.483
,+1.399
 and+1.377
+1.200
 to+1.171
Neg logit contributions
 Majesty-1.199
umm-1.190
Ta-1.181
agna-1.136
 former-1.136
Token bird
Token ID6512
Feature activation+2.95e+01

Pos logit contributions
Ta+0.094
 Majesty+0.092
umm+0.091
agna+0.091
 former+0.089
Neg logit contributions
.-0.084
,-0.078
 and-0.077
 to-0.063
-0.062
Token gave
Token ID2921
Feature activation+1.09e+01

Pos logit contributions
 Majesty+1.493
Ta+1.474
 former+1.429
 sidelines+1.426
umm+1.420
Neg logit contributions
.-0.918
,-0.800
 and-0.766
-0.634
 to-0.614
Token in
Token ID287
Feature activation-1.68e+01

Pos logit contributions
 Majesty+0.903
 former+0.899
 sidelines+0.857
 majesty+0.849
isher+0.833
Neg logit contributions
.-0.431
,-0.415
 and-0.386
-0.319
 to-0.287
Token their
Token ID511
Feature activation-7.50e+00

Pos logit contributions
.+0.360
,+0.334
 and+0.317
+0.222
 to+0.212
Neg logit contributions
 former-0.959
 Majesty-0.956
 sidelines-0.949
 majesty-0.910
umm-0.896
Token room
Token ID2119
Feature activation+2.37e+01

Pos logit contributions
.+0.371
,+0.350
 and+0.342
 to+0.294
+0.292
Neg logit contributions
Ta-0.277
 Majesty-0.269
umm-0.269
 sidelines-0.261
agna-0.260
Token until
Token ID1566
Feature activation-9.67e+00

Pos logit contributions
 Majesty+1.651
 former+1.628
 majesty+1.591
 sidelines+1.562
agna+1.541
Neg logit contributions
.-0.113
,-0.086
 and-0.039
-0.023
 the+0.082
Token morning
Token ID3329
Feature activation+1.29e+01

Pos logit contributions
.+0.268
,+0.241
 and+0.237
+0.176
 to+0.169
Neg logit contributions
 Majesty-0.542
Ta-0.535
 sidelines-0.529
umm-0.527
agna-0.524
Token.
Token ID13
Feature activation-4.66e+01

Pos logit contributions
 Majesty+0.877
 majesty+0.854
vation+0.852
 sidelines+0.852
agna+0.837
Neg logit contributions
.-0.041
 and-0.030
,-0.022
+0.036
 with+0.051
Token
Token ID198
Feature activation-3.78e+01

Pos logit contributions
.+1.373
,+1.259
 and+1.199
 to+0.909
 with+0.877
Neg logit contributions
Ta-2.562
 Majesty-2.554
umm-2.494
 former-2.465
 sidelines-2.458
Token
Token ID198
Feature activation-3.76e+01

Pos logit contributions
.+0.988
,+0.914
 and+0.852
 to+0.616
 with+0.598
Neg logit contributions
Ta-2.222
 Majesty-2.173
 sidelines-2.132
 former-2.107
 majesty-2.088
TokenS
Token ID50
Feature activation+1.28e+01

Pos logit contributions
.+1.614
,+1.502
 and+1.481
+1.304
 with+1.229
Neg logit contributions
umm-1.610
 Majesty-1.604
Ta-1.557
agna-1.533
 sidelines-1.516
Tokenara
Token ID3301
Feature activation+3.00e+01

Pos logit contributions
Ta+0.350
 Majesty+0.347
umm+0.338
 sidelines+0.327
agna+0.324
Neg logit contributions
.-0.745
,-0.710
 and-0.697
-0.616
 to-0.614
Token and
Token ID290
Feature activation-2.90e+01

Pos logit contributions
 Majesty+1.714
Ta+1.702
 majesty+1.654
agna+1.608
 former+1.608
Neg logit contributions
.-0.762
,-0.657
 and-0.585
-0.473
 to-0.467
Token have
Token ID423
Feature activation+4.14e+00

Pos logit contributions
Ta+0.476
 Majesty+0.474
umm+0.458
 former+0.454
agna+0.453
Neg logit contributions
.-0.454
,-0.423
 and-0.410
-0.345
 to-0.343
Token to
Token ID284
Feature activation-2.17e+01

Pos logit contributions
 Majesty+0.291
Ta+0.289
 former+0.284
umm+0.282
agna+0.282
Neg logit contributions
.-0.058
,-0.047
 and-0.042
-0.017
 with-0.014
Token go
Token ID467
Feature activation+6.05e+00

Pos logit contributions
.+0.977
,+0.916
 and+0.897
+0.767
 to+0.761
Neg logit contributions
 Majesty-0.860
Ta-0.855
umm-0.829
 sidelines-0.822
 former-0.821
Token home
Token ID1363
Feature activation+2.02e+01

Pos logit contributions
 Majesty+0.324
 former+0.319
 sidelines+0.314
agna+0.305
mic+0.304
Neg logit contributions
.-0.136
,-0.119
 and-0.107
-0.093
 with-0.075
Token now
Token ID783
Feature activation+5.58e-01

Pos logit contributions
 former+1.178
 Majesty+1.136
agna+1.086
 majesty+1.073
arians+1.052
Neg logit contributions
.-0.355
,-0.301
 and-0.254
-0.243
 to-0.177
Token,"
Token ID553
Feature activation-1.89e+01

Pos logit contributions
 Majesty+0.034
 former+0.033
agna+0.032
 sidelines+0.032
umm+0.032
Neg logit contributions
.-0.008
,-0.007
 and-0.006
-0.005
 to-0.004
Token her
Token ID607
Feature activation-6.06e+00

Pos logit contributions
.+0.655
,+0.602
 and+0.592
 to+0.487
 with+0.468
Neg logit contributions
 Majesty-0.892
Ta-0.880
 sidelines-0.857
 former-0.847
 pulp-0.846
Token daddy
Token ID35822
Feature activation+2.45e+01

Pos logit contributions
.+0.285
,+0.269
 and+0.266
 to+0.228
+0.228
Neg logit contributions
 Majesty-0.227
Ta-0.214
umm-0.214
 former-0.212
 majesty-0.211
Token explained
Token ID4893
Feature activation+1.32e+01

Pos logit contributions
Ta+0.928
 Majesty+0.920
isher+0.900
agna+0.897
inge+0.893
Neg logit contributions
.-0.910
,-0.832
 and-0.826
 to-0.738
-0.721
Token.
Token ID13
Feature activation-4.28e+01

Pos logit contributions
isher+0.804
rier+0.799
 sidelines+0.789
 forehead+0.786
 former+0.781
Neg logit contributions
-0.032
,-0.013
 and-0.010
.-0.004
 to+0.061
Token
Token ID220
Feature activation-1.09e+01

Pos logit contributions
.+1.311
,+1.216
 and+1.150
 to+0.884
 with+0.847
Neg logit contributions
 Majesty-2.234
Ta-2.186
 sidelines-2.163
 former-2.149
umm-2.137
Token soon
Token ID2582
Feature activation+5.07e-01

Pos logit contributions
.+0.031
,+0.030
 and+0.024
+0.009
 to+0.005
Neg logit contributions
 former-0.252
 Majesty-0.246
 sidelines-0.234
bek-0.234
 lifetime-0.232
Token!
Token ID0
Feature activation-2.78e+01

Pos logit contributions
 Majesty+0.036
 sidelines+0.035
 majesty+0.035
umm+0.034
 pulp+0.034
Neg logit contributions
.-0.001
,-0.001
 and-0.000
+0.001
 with+0.002
Token<|endoftext|>
Token ID50256
Feature activation-3.32e+00

Pos logit contributions
.+0.642
,+0.576
 and+0.539
 to+0.357
 with+0.343
Neg logit contributions
Ta-1.691
 Majesty-1.675
umm-1.664
 sidelines-1.642
 former-1.634
TokenOnce
Token ID7454
Feature activation-2.25e+00

Pos logit contributions
,+0.090
.+0.090
 and+0.085
 to+0.072
 with+0.064
Neg logit contributions
Ta-0.152
umm-0.143
rob-0.143
Whe-0.140
roach-0.139
Token upon
Token ID2402
Feature activation+1.07e+00

Pos logit contributions
.+0.103
 and+0.098
,+0.094
 with+0.087
 to+0.081
Neg logit contributions
 Majesty-0.065
 majesty-0.062
isher-0.060
Ta-0.058
ubb-0.058
Token a
Token ID257
Feature activation-1.81e+01

Pos logit contributions
Ta+0.065
 Majesty+0.060
 former+0.059
 sidelines+0.059
icious+0.058
Neg logit contributions
,-0.022
.-0.021
 and-0.017
 to-0.012
 with-0.011
Token time
Token ID640
Feature activation-2.41e+00

Pos logit contributions
.+0.647
,+0.590
 and+0.568
 to+0.479
 with+0.463
Neg logit contributions
Ta-0.865
 Majesty-0.862
umm-0.842
icious-0.832
 former-0.828
Token,
Token ID11
Feature activation-3.36e+01

Pos logit contributions
.+0.028
 and+0.017
 to+0.004
 with+0.001
,+0.000
Neg logit contributions
 Majesty-0.150
 majesty-0.148
Ta-0.142
heit-0.140
umm-0.140
Token there
Token ID612
Feature activation-4.99e+00

Pos logit contributions
.+0.976
,+0.836
 and+0.828
 to+0.680
 with+0.565
Neg logit contributions
Ta-1.655
 Majesty-1.626
wy-1.602
 pulp-1.589
Uh-1.569
Token was
Token ID373
Feature activation-1.41e+01

Pos logit contributions
.+0.136
 and+0.112
,+0.109
 to+0.094
 with+0.082
Neg logit contributions
Ta-0.239
umm-0.230
 Majesty-0.229
atmeal-0.224
Uh-0.222
Token a
Token ID257
Feature activation-1.77e+01

Pos logit contributions
.+0.254
,+0.213
 and+0.186
 to+0.108
 with+0.093
Neg logit contributions
Ta-0.953
 Majesty-0.912
umm-0.901
 sidelines-0.897
 former-0.892