TOP ACTIVATIONS
MAX = 74.343

checked for a fire extingu isher , but there wasn 't
She grabbed the fire extingu isher and put out the fire
. They thanked the Pol isher for all its kind work
're a very good pol isher ." Lily smiled and felt
. They asked the Pol isher to make them look nice
her dad watched from the sidelines , his heart swelling with
oven and taken the las agna . But the
the store was selling w affles for very cheap . She
wished they had eaten w affles . She wished they had
When the w affles were ready , Lily 's
store and got the y umm iest ice cream they could
. She ate the w affles with the syrup and smiled
She loved to eat w affles for breakfast every day .
there was a yellow pol isher . It was very special
come and eat your w affles . They are hot and
just like a real b iker . With
enjoyed their y ummy w affles every morning . The end
they watched the pretend w affles fly high up into the
impatient . She wanted w affles for breakfast . She went
on , the two w affles were never apart . Where

MIN ACTIVATIONS
MIN = -47.346

girl called L izz ie . She liked to perform different
friendly rabbit named Ros ie . Ros ie warned
<|endoftext|> L ily loved ponies . She had a toy pony
- year - old child . The bar ber
little bird ie named Tweet . Tweet was very happy and
<|endoftext|> L ily was sad . She had a cold and
Lily who loved fair ies . She would go to the
. <|endoftext|> Sam loves basketball . He likes to play with
young girl named Ell a . Ell a was very excited
� T IM â € . It looked nice and he
sho el ace came undone . She stopped to tie it
Ben tries to blow bubbles . At first , he blows
ila thought for a moment . She liked Sam 's idea
old c udd ly toy . She picked it up and
curious little girl named Mary . Every day Mary would explore
child nodded his head eagerly . He was so happy to
clever ly avoided getting caught . He told all his mouse
a little girl named Lily . Lily said she made the
remember their j olly day . <|endoftext|> Once upon a time
<|endoftext|> Anna liked to dance . She danced every day in

INTERVAL 43.921 - 74.343
CONTAINS 1.454%

bounce and turn in the maze . They cheer and cl
Lily decided to choose the lions and gir aff es .
and the lucky four - leaf cl over . They all
they see a lot of machines and belts and gears .
he reached out and pl ucked the pet als from the

INTERVAL 13.499 - 43.921
CONTAINS 27.843%

. I don 't like sweet things . I prefer salty
shake and break into many pieces . Lily was scared and
and look for the tall tree with red leaves . The
way . She was very satisfied and kept thinking about the
He wanted to separate the deer from all of her friends

INTERVAL -16.923 - 13.499
CONTAINS 44.409%

there was a big gift for Sarah ! It was from
tried yawn ing , but that didn 't help either .
could open the box , he heard a voice saying ,
she saw a path that she had never seen before .
never trust him again . He wished he could have found

INTERVAL -47.346 - -16.923
CONTAINS 26.293%

a tall tower . " Look , our tower
it made John super sleepy . He went to the park
great frog . He liked to plan fun things to do
and dad with the grill . <|endoftext|> Once upon a time
to play with the dog and the ball . They ran
Token checked
Token ID10667
Feature activation+2.84e+01

Pos logit contributions
 Majesty+0.334
Ta+0.332
agna+0.317
 majesty+0.317
 pulp+0.315
Neg logit contributions
.-0.377
,-0.353
 and-0.345
-0.300
 with-0.292
Token for
Token ID329
Feature activation-7.46e+00

Pos logit contributions
isher+1.443
heit+1.405
bek+1.405
 Majesty+1.369
wy+1.368
Neg logit contributions
.-0.564
,-0.497
 and-0.469
-0.397
 with-0.323
Token a
Token ID257
Feature activation-1.46e+01

Pos logit contributions
.+0.155
,+0.134
 and+0.130
+0.083
 to+0.080
Neg logit contributions
 Majesty-0.455
Ta-0.451
 sidelines-0.448
agna-0.444
umm-0.441
Token fire
Token ID2046
Feature activation+2.78e+01

Pos logit contributions
.+0.604
,+0.566
 and+0.547
 to+0.460
+0.453
Neg logit contributions
Ta-0.625
 Majesty-0.624
umm-0.609
 sidelines-0.593
agna-0.593
Token extingu
Token ID24995
Feature activation+3.58e+01

Pos logit contributions
umm+1.505
agna+1.467
isher+1.450
 Majesty+1.449
 majesty+1.397
Neg logit contributions
.-0.552
,-0.516
 and-0.488
-0.435
 the-0.360
Tokenisher
Token ID4828
Feature activation+7.43e+01

Pos logit contributions
 Majesty+0.328
Ta+0.318
 sidelines+0.240
 majesty+0.240
agna+0.233
Neg logit contributions
.-2.733
,-2.638
 and-2.591
 to-2.383
 with-2.337
Token,
Token ID11
Feature activation-2.82e+01

Pos logit contributions
 Majesty+4.664
isher+4.604
agna+4.552
cher+4.549
 supervision+4.522
Neg logit contributions
-0.102
.-0.063
,-0.053
 and+0.050
 the+0.194
Token but
Token ID475
Feature activation-9.44e+00

Pos logit contributions
.+0.820
,+0.742
 and+0.649
+0.535
 to+0.516
Neg logit contributions
 sidelines-1.428
 Majesty-1.426
Ta-1.409
agna-1.388
 pulp-1.381
Token there
Token ID612
Feature activation+7.17e-02

Pos logit contributions
.+0.241
,+0.208
 and+0.199
+0.144
 to+0.139
Neg logit contributions
 Majesty-0.530
Ta-0.530
agna-0.519
 sidelines-0.519
umm-0.515
Token wasn
Token ID2492
Feature activation+1.05e+01

Pos logit contributions
Ta+0.004
umm+0.004
agna+0.004
 Majesty+0.004
 pulp+0.004
Neg logit contributions
.-0.002
,-0.001
 and-0.001
-0.001
 to-0.001
Token't
Token ID470
Feature activation+1.12e+01

Pos logit contributions
Ta+0.410
 sidelines+0.397
 Majesty+0.389
agna+0.386
umm+0.381
Neg logit contributions
.-0.433
,-0.411
 and-0.397
 with-0.332
 to-0.321
Token She
Token ID1375
Feature activation+2.53e+00

Pos logit contributions
.+1.339
,+1.232
 and+1.193
 with+0.934
 to+0.924
Neg logit contributions
 Majesty-1.867
umm-1.831
Ta-1.804
 majesty-1.764
 former-1.759
Token grabbed
Token ID13176
Feature activation+2.28e+01

Pos logit contributions
 Majesty+0.117
Ta+0.116
umm+0.113
 sidelines+0.112
agna+0.112
Neg logit contributions
.-0.095
,-0.088
 and-0.085
 to-0.069
-0.069
Token the
Token ID262
Feature activation-1.79e+01

Pos logit contributions
Ta+1.431
 Majesty+1.393
 sidelines+1.383
umm+1.373
agna+1.365
Neg logit contributions
.-0.421
,-0.352
 and-0.323
-0.196
 to-0.186
Token fire
Token ID2046
Feature activation+2.74e+01

Pos logit contributions
.+0.833
,+0.781
 and+0.763
 to+0.648
 with+0.633
Neg logit contributions
 Majesty-0.701
Ta-0.699
umm-0.675
agna-0.660
 majesty-0.652
Token extingu
Token ID24995
Feature activation+3.54e+01

Pos logit contributions
umm+1.689
 Majesty+1.650
agna+1.640
isher+1.627
 sidelines+1.616
Neg logit contributions
.-0.446
,-0.359
 and-0.308
-0.259
 to-0.179
Tokenisher
Token ID4828
Feature activation+7.37e+01

Pos logit contributions
 Majesty+0.379
Ta+0.356
 majesty+0.292
 sidelines+0.283
agna+0.277
Neg logit contributions
.-2.649
,-2.556
 and-2.515
 to-2.298
 with-2.270
Token and
Token ID290
Feature activation-2.23e+01

Pos logit contributions
 sidelines+4.551
 Majesty+4.549
isher+4.462
inge+4.450
umm+4.436
Neg logit contributions
.-0.516
,-0.259
-0.253
 and-0.043
 the+0.181
Token put
Token ID1234
Feature activation+1.45e+01

Pos logit contributions
.+0.902
,+0.840
 and+0.812
 to+0.676
+0.676
Neg logit contributions
Ta-0.956
 Majesty-0.956
umm-0.926
agna-0.914
 sidelines-0.913
Token out
Token ID503
Feature activation+3.25e+00

Pos logit contributions
Ta+0.783
isher+0.765
 Majesty+0.760
 sidelines+0.760
agna+0.752
Neg logit contributions
.-0.334
,-0.308
 and-0.284
-0.207
 to-0.202
Token the
Token ID262
Feature activation-1.79e+01

Pos logit contributions
Ta+0.186
 Majesty+0.185
 sidelines+0.181
 former+0.179
isher+0.178
Neg logit contributions
.-0.065
,-0.060
 and-0.056
-0.041
 with-0.033
Token fire
Token ID2046
Feature activation+2.74e+01

Pos logit contributions
.+0.843
,+0.785
 and+0.770
 to+0.658
 with+0.644
Neg logit contributions
Ta-0.685
 Majesty-0.680
umm-0.658
agna-0.641
 majesty-0.635
Token.
Token ID13
Feature activation-3.98e+01

Pos logit contributions
 Majesty+0.515
umm+0.510
 majesty+0.490
 former+0.481
 pulp+0.479
Neg logit contributions
.-0.043
,-0.038
-0.028
 and-0.013
 the+0.016
Token They
Token ID1119
Feature activation+2.04e+00

Pos logit contributions
.+1.439
,+1.333
 and+1.282
 to+1.042
 with+1.017
Neg logit contributions
 Majesty-1.894
Ta-1.890
umm-1.866
 sidelines-1.814
agna-1.813
Token thanked
Token ID26280
Feature activation+1.68e+01

Pos logit contributions
Ta+0.093
 Majesty+0.092
umm+0.089
agna+0.088
 sidelines+0.088
Neg logit contributions
.-0.076
,-0.071
 and-0.068
-0.056
 to-0.056
Token the
Token ID262
Feature activation-1.93e+01

Pos logit contributions
 Majesty+0.962
 former+0.958
 sidelines+0.947
umm+0.931
Ta+0.931
Neg logit contributions
.-0.305
,-0.261
 and-0.232
 to-0.149
-0.140
Token Pol
Token ID2165
Feature activation+3.35e+01

Pos logit contributions
.+0.912
,+0.854
 and+0.833
 to+0.708
+0.704
Neg logit contributions
Ta-0.733
 Majesty-0.726
umm-0.701
agna-0.689
 sidelines-0.684
Tokenisher
Token ID4828
Feature activation+7.25e+01

Pos logit contributions
 Majesty+0.562
Ta+0.558
umm+0.517
 sidelines+0.503
agna+0.490
Neg logit contributions
.-2.262
,-2.166
 and-2.129
 to-1.921
-1.906
Token for
Token ID329
Feature activation-9.22e+00

Pos logit contributions
 Majesty+4.598
 majesty+4.378
isher+4.314
 former+4.312
agna+4.293
Neg logit contributions
.-0.470
,-0.265
-0.180
 and-0.076
 with+0.109
Token all
Token ID477
Feature activation-3.65e-01

Pos logit contributions
.+0.317
,+0.292
 and+0.281
 to+0.223
+0.222
Neg logit contributions
 Majesty-0.455
Ta-0.453
umm-0.442
 sidelines-0.438
agna-0.436
Token its
Token ID663
Feature activation+5.39e+00

Pos logit contributions
.+0.007
,+0.006
 and+0.006
+0.005
 with+0.004
Neg logit contributions
 Majesty-0.020
 majesty-0.019
 pulp-0.019
 sidelines-0.019
 former-0.019
Token kind
Token ID1611
Feature activation+1.57e+01

Pos logit contributions
 Majesty+0.211
Ta+0.207
umm+0.201
 former+0.200
 majesty+0.200
Neg logit contributions
.-0.232
,-0.218
 and-0.213
-0.182
 with-0.178
Token work
Token ID670
Feature activation+2.34e+01

Pos logit contributions
 Majesty+0.655
umm+0.643
 majesty+0.624
 former+0.611
Ta+0.608
Neg logit contributions
.-0.571
,-0.537
 and-0.514
-0.467
 with-0.437
Token're
Token ID821
Feature activation+1.52e+01

Pos logit contributions
 Majesty+0.716
Ta+0.711
umm+0.701
 sidelines+0.690
agna+0.683
Neg logit contributions
.-0.580
,-0.540
 and-0.522
 to-0.423
-0.421
Token a
Token ID257
Feature activation-1.75e+01

Pos logit contributions
 Majesty+0.848
Ta+0.845
umm+0.825
agna+0.824
 sidelines+0.814
Neg logit contributions
.-0.412
,-0.366
 and-0.354
 to-0.258
-0.254
Token very
Token ID845
Feature activation-1.02e+01

Pos logit contributions
.+0.633
,+0.590
 and+0.569
 to+0.455
+0.449
Neg logit contributions
Ta-0.840
 Majesty-0.834
umm-0.815
 sidelines-0.807
agna-0.798
Token good
Token ID922
Feature activation+1.15e+01

Pos logit contributions
.+0.446
,+0.421
 and+0.403
 to+0.344
 with+0.339
Neg logit contributions
Ta-0.415
 Majesty-0.405
umm-0.401
 sidelines-0.393
 former-0.388
Token pol
Token ID755
Feature activation+2.78e+01

Pos logit contributions
 Majesty+0.292
umm+0.282
Ta+0.279
agna+0.267
 majesty+0.264
Neg logit contributions
.-0.666
,-0.635
 and-0.619
-0.560
 to-0.553
Tokenisher
Token ID4828
Feature activation+7.14e+01

Pos logit contributions
Ta+0.567
 Majesty+0.564
umm+0.527
 sidelines+0.523
agna+0.503
Neg logit contributions
.-1.786
,-1.705
 and-1.677
 to-1.502
 with-1.491
Token."
Token ID526
Feature activation-1.86e+01

Pos logit contributions
 Majesty+4.188
 majesty+4.045
umm+4.014
isher+3.926
agna+3.922
Neg logit contributions
.-0.949
,-0.841
-0.758
 and-0.731
 the-0.476
Token Lily
Token ID20037
Feature activation+7.31e+00

Pos logit contributions
.+0.500
,+0.480
 and+0.433
 to+0.361
 with+0.324
Neg logit contributions
 Majesty-0.954
Ta-0.941
 former-0.923
 majesty-0.918
umm-0.916
Token smiled
Token ID13541
Feature activation+1.62e+01

Pos logit contributions
Ta+0.299
 Majesty+0.297
 majesty+0.283
agna+0.275
umm+0.274
Neg logit contributions
.-0.296
,-0.268
 and-0.255
 to-0.226
-0.220
Token and
Token ID290
Feature activation-2.45e+01

Pos logit contributions
 Majesty+0.945
umm+0.925
 former+0.921
 majesty+0.867
Ta+0.861
Neg logit contributions
.-0.150
,-0.087
-0.028
 and-0.007
 to-0.002
Token felt
Token ID2936
Feature activation+7.03e+00

Pos logit contributions
.+0.930
,+0.846
 and+0.818
 to+0.683
+0.666
Neg logit contributions
Ta-1.095
 Majesty-1.085
 majesty-1.038
umm-1.027
 sidelines-1.023
Token.
Token ID13
Feature activation-4.23e+01

Pos logit contributions
.+0.008
,+0.006
 and+0.005
+0.003
 to+0.001
Neg logit contributions
 Majesty-0.061
 former-0.061
agna-0.060
umm-0.060
atmeal-0.060
Token They
Token ID1119
Feature activation-4.00e-01

Pos logit contributions
.+1.136
,+1.037
 and+0.984
 to+0.737
 with+0.701
Neg logit contributions
Ta-2.342
 Majesty-2.317
umm-2.298
 former-2.242
 sidelines-2.236
Token asked
Token ID1965
Feature activation+1.02e-01

Pos logit contributions
.+0.013
,+0.012
 and+0.012
 to+0.009
+0.009
Neg logit contributions
Ta-0.020
 Majesty-0.020
umm-0.019
agna-0.019
 sidelines-0.019
Token the
Token ID262
Feature activation-2.21e+01

Pos logit contributions
 Majesty+0.006
 former+0.005
agna+0.005
 majesty+0.005
Ta+0.005
Neg logit contributions
.-0.002
,-0.002
 and-0.002
-0.001
 to-0.001
Token Pol
Token ID2165
Feature activation+3.09e+01

Pos logit contributions
.+0.980
,+0.910
 and+0.887
 to+0.746
+0.743
Neg logit contributions
Ta-0.904
 Majesty-0.894
umm-0.865
agna-0.850
 sidelines-0.847
Tokenisher
Token ID4828
Feature activation+7.04e+01

Pos logit contributions
 Majesty+0.681
Ta+0.680
umm+0.645
 sidelines+0.626
agna+0.617
Neg logit contributions
.-1.920
,-1.832
 and-1.797
 to-1.606
-1.595
Token to
Token ID284
Feature activation-2.14e+01

Pos logit contributions
 Majesty+4.573
 majesty+4.404
isher+4.338
agna+4.320
 former+4.211
Neg logit contributions
.-0.550
,-0.270
 and-0.234
-0.132
 with+0.050
Token make
Token ID787
Feature activation+5.50e+00

Pos logit contributions
.+0.916
,+0.860
 and+0.835
 to+0.700
+0.699
Neg logit contributions
 Majesty-0.863
Ta-0.860
umm-0.832
 sidelines-0.826
agna-0.820
Token them
Token ID606
Feature activation+6.36e+00

Pos logit contributions
 Majesty+0.304
 former+0.287
agna+0.282
 majesty+0.282
umm+0.282
Neg logit contributions
.-0.134
,-0.128
 and-0.122
-0.090
 to-0.089
Token look
Token ID804
Feature activation+1.24e+01

Pos logit contributions
 Majesty+0.286
 former+0.281
Ta+0.274
agna+0.272
 pulp+0.269
Neg logit contributions
.-0.204
,-0.191
 and-0.183
-0.161
 to-0.146
Token nice
Token ID3621
Feature activation+1.03e+01

Pos logit contributions
 Majesty+0.495
 former+0.492
agna+0.475
umm+0.460
Ta+0.457
Neg logit contributions
.-0.415
,-0.394
 and-0.381
-0.357
 to-0.313
Token her
Token ID607
Feature activation-2.10e+00

Pos logit contributions
.+0.675
,+0.576
 and+0.553
+0.390
 to+0.386
Neg logit contributions
 Majesty-1.644
Ta-1.622
umm-1.598
 sidelines-1.583
 former-1.583
Token dad
Token ID9955
Feature activation+2.33e+01

Pos logit contributions
.+0.092
,+0.086
 and+0.083
+0.071
 to+0.071
Neg logit contributions
 Majesty-0.084
Ta-0.083
umm-0.080
 sidelines-0.079
agna-0.079
Token watched
Token ID7342
Feature activation+1.99e+01

Pos logit contributions
 Majesty+1.064
Ta+1.052
 sidelines+1.023
agna+1.021
 majesty+1.001
Neg logit contributions
.-0.773
,-0.672
 and-0.642
 to-0.556
-0.552
Token from
Token ID422
Feature activation-7.83e+00

Pos logit contributions
 sidelines+1.028
ocol+0.973
umm+0.972
ubb+0.954
 Majesty+0.952
Neg logit contributions
.-0.416
,-0.364
-0.312
 and-0.312
 to-0.238
Token the
Token ID262
Feature activation-1.71e+01

Pos logit contributions
.+0.194
,+0.173
 and+0.163
+0.116
 to+0.115
Neg logit contributions
Ta-0.458
 Majesty-0.458
umm-0.448
 sidelines-0.446
agna-0.441
Token sidelines
Token ID29394
Feature activation+6.97e+01

Pos logit contributions
.+0.806
,+0.755
 and+0.739
 to+0.635
+0.618
Neg logit contributions
 Majesty-0.645
Ta-0.633
umm-0.630
agna-0.622
 majesty-0.607
Token,
Token ID11
Feature activation-2.79e+01

Pos logit contributions
 sidelines+4.501
 Majesty+4.486
 former+4.334
umm+4.309
 majesty+4.240
Neg logit contributions
.-0.443
,-0.265
 and-0.150
-0.131
 to+0.061
Token his
Token ID465
Feature activation-1.65e+00

Pos logit contributions
.+1.076
,+0.995
 and+0.942
 to+0.790
+0.784
Neg logit contributions
 Majesty-1.241
Ta-1.230
 sidelines-1.205
umm-1.185
agna-1.171
Token heart
Token ID2612
Feature activation+3.43e+01

Pos logit contributions
.+0.076
,+0.071
 and+0.069
 to+0.059
+0.059
Neg logit contributions
 Majesty-0.063
Ta-0.063
umm-0.061
 sidelines-0.060
agna-0.060
Token swelling
Token ID29844
Feature activation+5.53e+01

Pos logit contributions
 Majesty+1.471
Ta+1.436
 sidelines+1.416
 former+1.408
 majesty+1.406
Neg logit contributions
.-1.165
,-1.074
 and-1.021
-0.919
 to-0.853
Token with
Token ID351
Feature activation-7.38e+00

Pos logit contributions
 Majesty+3.285
 former+3.185
 sidelines+3.169
umm+3.166
mic+3.156
Neg logit contributions
.-0.685
,-0.590
-0.454
 and-0.454
 to-0.201
Token oven
Token ID14361
Feature activation+4.46e+01

Pos logit contributions
.+1.076
,+1.027
 and+1.005
+0.891
 to+0.887
Neg logit contributions
 Majesty-0.474
Ta-0.468
umm-0.456
agna-0.441
 sidelines-0.437
Token and
Token ID290
Feature activation-2.33e+01

Pos logit contributions
umm+2.878
agna+2.771
 former+2.726
 sidelines+2.719
 Majesty+2.689
Neg logit contributions
.-0.413
,-0.285
-0.151
 and-0.148
 the+0.032
Token taken
Token ID2077
Feature activation+2.12e+01

Pos logit contributions
.+0.888
,+0.824
 and+0.797
 to+0.653
+0.648
Neg logit contributions
 Majesty-1.065
Ta-1.063
umm-1.038
 sidelines-1.019
agna-1.018
Token the
Token ID262
Feature activation-1.84e+01

Pos logit contributions
Ta+1.025
 Majesty+1.015
umm+1.007
isher+1.006
agna+0.984
Neg logit contributions
.-0.593
,-0.532
 and-0.515
-0.394
 with-0.359
Token las
Token ID39990
Feature activation+3.65e+01

Pos logit contributions
.+0.940
,+0.893
 and+0.870
+0.756
 to+0.755
Neg logit contributions
 Majesty-0.597
Ta-0.597
umm-0.582
agna-0.566
 sidelines-0.564
Tokenagna
Token ID48669
Feature activation+6.92e+01

Pos logit contributions
Ta+0.296
 Majesty+0.210
 sidelines+0.205
 former+0.169
 pulp+0.137
Neg logit contributions
.-2.813
,-2.700
 and-2.665
 to-2.431
 with-2.424
Token.
Token ID13
Feature activation-3.95e+01

Pos logit contributions
umm+4.031
agna+4.011
isher+3.961
 Majesty+3.950
 pulp+3.946
Neg logit contributions
.-0.559
,-0.553
-0.452
 and-0.373
 with-0.046
Token
Token ID198
Feature activation-3.02e+01

Pos logit contributions
.+1.165
,+1.071
 and+1.014
 to+0.772
 with+0.744
Neg logit contributions
Ta-2.125
 Majesty-2.115
umm-2.087
agna-2.046
 sidelines-2.041
Token
Token ID198
Feature activation-3.02e+01

Pos logit contributions
.+0.517
,+0.454
 and+0.389
 to+0.217
 with+0.180
Neg logit contributions
Ta-1.995
 Majesty-1.909
 sidelines-1.901
agna-1.864
 majesty-1.861
TokenBut
Token ID1537
Feature activation+2.55e+00

Pos logit contributions
.+1.089
,+1.001
 and+0.991
+0.839
 with+0.801
Neg logit contributions
 Majesty-1.471
umm-1.428
Ta-1.407
 majesty-1.392
agna-1.382
Token the
Token ID262
Feature activation-1.82e+01

Pos logit contributions
 Majesty+0.142
agna+0.139
umm+0.137
 former+0.137
Ta+0.137
Neg logit contributions
.-0.065
,-0.053
 and-0.052
-0.039
 to-0.038
Token the
Token ID262
Feature activation-1.74e+01

Pos logit contributions
.+0.330
,+0.297
 and+0.288
+0.218
 to+0.218
Neg logit contributions
Ta-0.581
 Majesty-0.579
umm-0.566
agna-0.563
 sidelines-0.554
Token store
Token ID3650
Feature activation+2.84e+01

Pos logit contributions
.+0.832
,+0.782
 and+0.761
 to+0.651
+0.649
Neg logit contributions
Ta-0.652
 Majesty-0.646
umm-0.622
 pulp-0.611
agna-0.603
Token was
Token ID373
Feature activation-9.48e+00

Pos logit contributions
umm+1.326
 Majesty+1.322
Ta+1.313
agna+1.307
 sidelines+1.267
Neg logit contributions
.-0.904
,-0.839
 and-0.806
-0.654
 to-0.636
Token selling
Token ID6301
Feature activation+3.21e+01

Pos logit contributions
.+0.327
,+0.307
 and+0.296
+0.247
 with+0.230
Neg logit contributions
 Majesty-0.431
 former-0.422
Ta-0.421
agna-0.421
 sidelines-0.416
Token w
Token ID266
Feature activation+1.24e+01

Pos logit contributions
 Majesty+1.434
umm+1.426
agna+1.426
 former+1.408
 pulp+1.406
Neg logit contributions
.-0.935
,-0.896
 and-0.845
-0.745
 to-0.675
Tokenaffles
Token ID48501
Feature activation+6.89e+01

Pos logit contributions
Ta+0.110
 Majesty+0.108
umm+0.087
 sidelines+0.080
agna+0.080
Neg logit contributions
.-0.937
,-0.901
 and-0.887
 to-0.807
-0.802
Token for
Token ID329
Feature activation-7.13e+00

Pos logit contributions
agna+4.454
atmeal+4.441
umm+4.313
wy+4.310
affles+4.248
Neg logit contributions
,-0.145
.-0.107
 and-0.073
-0.013
 with+0.310
Token very
Token ID845
Feature activation-7.13e+00

Pos logit contributions
.+0.182
,+0.168
 and+0.164
+0.119
 to+0.117
Neg logit contributions
 Majesty-0.388
Ta-0.379
 sidelines-0.379
agna-0.377
 former-0.376
Token cheap
Token ID7026
Feature activation+3.04e+01

Pos logit contributions
.+0.246
,+0.229
 and+0.223
+0.177
 to+0.174
Neg logit contributions
 Majesty-0.347
Ta-0.340
umm-0.337
 sidelines-0.332
 former-0.331
Token.
Token ID13
Feature activation-3.78e+01

Pos logit contributions
umm+1.837
affles+1.827
 Majesty+1.810
agna+1.791
isher+1.784
Neg logit contributions
.-0.329
,-0.303
 and-0.273
-0.205
 with-0.112
Token She
Token ID1375
Feature activation+3.42e+00

Pos logit contributions
.+1.184
,+1.073
 and+1.024
 to+0.784
 with+0.780
Neg logit contributions
 Majesty-1.959
Ta-1.941
umm-1.927
 sidelines-1.882
agna-1.877
Token wished
Token ID16555
Feature activation+2.27e+01

Pos logit contributions
Ta+0.099
 Majesty+0.099
agna+0.093
umm+0.093
 majesty+0.092
Neg logit contributions
.-0.134
,-0.127
 and-0.122
 to-0.107
-0.106
Token they
Token ID484
Feature activation-2.39e+00

Pos logit contributions
 Majesty+1.023
 sidelines+1.014
Ta+0.993
 majesty+0.988
umm+0.983
Neg logit contributions
.-0.694
,-0.631
 and-0.603
 with-0.472
-0.471
Token had
Token ID550
Feature activation+1.66e+00

Pos logit contributions
.+0.082
,+0.077
 and+0.073
+0.058
 to+0.058
Neg logit contributions
Ta-0.111
agna-0.109
 Majesty-0.107
umm-0.105
 pulp-0.104
Token eaten
Token ID17065
Feature activation+3.29e+01

Pos logit contributions
 Majesty+0.069
Ta+0.069
agna+0.067
 majesty+0.066
umm+0.065
Neg logit contributions
.-0.066
,-0.063
 and-0.061
-0.050
 with-0.049
Token w
Token ID266
Feature activation+1.27e+01

Pos logit contributions
umm+1.626
Ta+1.600
 pulp+1.599
agna+1.576
 former+1.568
Neg logit contributions
.-0.808
,-0.759
 and-0.693
-0.550
 to-0.498
Tokenaffles
Token ID48501
Feature activation+6.89e+01

Pos logit contributions
Ta+0.113
 Majesty+0.110
umm+0.092
agna+0.081
 sidelines+0.081
Neg logit contributions
.-0.966
,-0.931
 and-0.917
 to-0.832
-0.826
Token.
Token ID13
Feature activation-3.80e+01

Pos logit contributions
agna+4.323
atmeal+4.253
wy+4.241
icious+4.178
umm+4.088
Neg logit contributions
.-0.181
,-0.132
-0.047
 and+0.003
 to+0.234
Token She
Token ID1375
Feature activation+2.92e+00

Pos logit contributions
.+1.082
,+1.000
 and+0.941
 to+0.715
 with+0.706
Neg logit contributions
Ta-2.038
 Majesty-2.012
umm-1.941
 former-1.933
 majesty-1.930
Token wished
Token ID16555
Feature activation+2.28e+01

Pos logit contributions
Ta+0.093
 Majesty+0.092
agna+0.086
 majesty+0.086
umm+0.086
Neg logit contributions
.-0.149
,-0.142
 and-0.136
 to-0.121
-0.119
Token they
Token ID484
Feature activation-2.27e+00

Pos logit contributions
 Majesty+1.021
 sidelines+1.007
 majesty+0.985
umm+0.980
agna+0.975
Neg logit contributions
.-0.698
,-0.634
 and-0.611
-0.476
 with-0.476
Token had
Token ID550
Feature activation+1.77e+00

Pos logit contributions
.+0.078
,+0.073
 and+0.069
 to+0.055
+0.055
Neg logit contributions
Ta-0.108
agna-0.105
 Majesty-0.104
umm-0.103
 pulp-0.101
Token
Token ID198
Feature activation-2.93e+01

Pos logit contributions
.+0.829
,+0.734
 and+0.672
 with+0.446
 to+0.439
Neg logit contributions
Ta-2.178
umm-2.158
 Majesty-2.146
 sidelines-2.088
 majesty-2.078
Token
Token ID198
Feature activation-2.93e+01

Pos logit contributions
.+0.522
,+0.485
 and+0.404
 to+0.256
 with+0.242
Neg logit contributions
Ta-1.701
 sidelines-1.552
reath-1.540
affles-1.530
icious-1.525
TokenWhen
Token ID2215
Feature activation+1.01e+01

Pos logit contributions
.+1.320
,+1.241
 and+1.201
 to+1.024
+1.011
Neg logit contributions
Ta-1.147
 Majesty-1.139
umm-1.106
 sidelines-1.089
agna-1.083
Token the
Token ID262
Feature activation-1.78e+01

Pos logit contributions
 Majesty+0.499
umm+0.498
Ta+0.496
 pulp+0.494
agna+0.494
Neg logit contributions
.-0.288
 and-0.257
,-0.245
-0.186
 with-0.183
Token w
Token ID266
Feature activation+1.21e+01

Pos logit contributions
.+0.779
,+0.731
 and+0.710
 to+0.600
+0.596
Neg logit contributions
 Majesty-0.712
Ta-0.711
umm-0.692
 sidelines-0.677
agna-0.676
Tokenaffles
Token ID48501
Feature activation+6.88e+01

Pos logit contributions
Ta+0.151
 Majesty+0.127
 pulp+0.114
bek+0.114
 former+0.114
Neg logit contributions
.-0.885
,-0.855
 and-0.844
 to-0.758
-0.750
Token were
Token ID547
Feature activation+1.98e+00

Pos logit contributions
Ta+3.394
atmeal+3.391
umm+3.386
 Majesty+3.368
agna+3.338
Neg logit contributions
.-2.019
,-1.703
 and-1.683
-1.366
 to-1.347
Token ready
Token ID3492
Feature activation+1.50e+01

Pos logit contributions
 Majesty+0.074
Ta+0.073
 sidelines+0.072
umm+0.071
agna+0.071
Neg logit contributions
.-0.087
,-0.080
 and-0.079
-0.068
 to-0.065
Token,
Token ID11
Feature activation-2.83e+01

Pos logit contributions
 Majesty+0.844
umm+0.837
 majesty+0.826
 sidelines+0.825
 pulp+0.814
Neg logit contributions
.-0.141
-0.060
 and-0.058
 with-0.016
,-0.003
Token Lily
Token ID20037
Feature activation+1.01e+01

Pos logit contributions
.+0.872
,+0.778
 and+0.764
 to+0.580
+0.579
Neg logit contributions
 Majesty-1.432
Ta-1.431
umm-1.402
 sidelines-1.399
agna-1.394
Token's
Token ID338
Feature activation-9.37e+00

Pos logit contributions
 Majesty+0.491
Ta+0.482
agna+0.472
 majesty+0.471
 pulp+0.465
Neg logit contributions
.-0.313
,-0.278
 and-0.252
-0.218
 with-0.209
Token store
Token ID3650
Feature activation+2.83e+01

Pos logit contributions
.+0.779
,+0.726
 and+0.707
 to+0.630
 with+0.600
Neg logit contributions
Ta-0.692
 Majesty-0.680
umm-0.676
agna-0.654
isher-0.651
Token and
Token ID290
Feature activation-2.15e+01

Pos logit contributions
 sidelines+1.734
 Majesty+1.724
 former+1.679
agna+1.658
itor+1.644
Neg logit contributions
.-0.247
,-0.135
-0.050
 and-0.043
 the+0.047
Token got
Token ID1392
Feature activation+7.64e+00

Pos logit contributions
.+0.892
,+0.831
 and+0.806
 to+0.671
+0.669
Neg logit contributions
 Majesty-0.910
Ta-0.910
umm-0.883
 sidelines-0.871
agna-0.870
Token the
Token ID262
Feature activation-1.73e+01

Pos logit contributions
Ta+0.436
 Majesty+0.434
umm+0.424
agna+0.424
 sidelines+0.423
Neg logit contributions
.-0.198
,-0.179
 and-0.169
-0.121
 to-0.116
Token y
Token ID331
Feature activation+1.86e+01

Pos logit contributions
.+0.777
,+0.724
 and+0.709
 to+0.613
+0.588
Neg logit contributions
 Majesty-0.697
Ta-0.695
umm-0.669
 sidelines-0.662
agna-0.657
Tokenumm
Token ID13929
Feature activation+6.88e+01

Pos logit contributions
umm+0.571
 Majesty+0.550
Ta+0.544
 sidelines+0.528
agna+0.516
Neg logit contributions
.-0.983
,-0.929
 and-0.915
-0.813
 to-0.804
Tokeniest
Token ID6386
Feature activation+4.15e+01

Pos logit contributions
umm+1.312
isher+1.136
atmeal+1.057
wy+0.994
Ta+0.974
Neg logit contributions
.-4.097
,-3.961
 and-3.917
-3.775
 to-3.565
Token ice
Token ID4771
Feature activation+3.45e+01

Pos logit contributions
umm+1.561
 Majesty+1.549
 former+1.481
 sidelines+1.470
Ta+1.463
Neg logit contributions
.-1.710
,-1.611
 and-1.564
-1.416
 to-1.364
Token cream
Token ID8566
Feature activation+4.87e+01

Pos logit contributions
umm+0.827
atmeal+0.807
isher+0.760
agna+0.745
 pulp+0.711
Neg logit contributions
.-1.799
,-1.772
 and-1.710
-1.622
 to-1.502
Token they
Token ID484
Feature activation-1.72e+00

Pos logit contributions
umm+3.081
agna+2.981
atmeal+2.926
isher+2.911
 pulp+2.909
Neg logit contributions
.-0.526
,-0.482
 and-0.408
-0.322
 with-0.113
Token could
Token ID714
Feature activation-4.09e-01

Pos logit contributions
.+0.060
,+0.056
 and+0.053
+0.043
 to+0.043
Neg logit contributions
Ta-0.082
 Majesty-0.082
umm-0.080
agna-0.079
 sidelines-0.079
Token.
Token ID13
Feature activation-3.83e+01

Pos logit contributions
umm+0.592
 Majesty+0.581
 majesty+0.559
atmeal+0.554
 sidelines+0.549
Neg logit contributions
,-0.068
.-0.047
-0.034
 and-0.027
 the+0.010
Token She
Token ID1375
Feature activation+2.76e+00

Pos logit contributions
.+1.062
,+0.963
 and+0.901
 with+0.670
 to+0.665
Neg logit contributions
 Majesty-2.057
Ta-1.994
umm-1.993
 majesty-1.984
 sidelines-1.942
Token ate
Token ID15063
Feature activation+2.86e+01

Pos logit contributions
 Majesty+0.113
Ta+0.112
agna+0.108
 majesty+0.108
umm+0.106
Neg logit contributions
.-0.114
,-0.105
 and-0.100
 to-0.086
-0.085
Token the
Token ID262
Feature activation-1.76e+01

Pos logit contributions
agna+1.411
 pulp+1.411
wy+1.405
umm+1.386
Ta+1.385
Neg logit contributions
.-0.681
,-0.569
 and-0.508
 to-0.409
-0.391
Token w
Token ID266
Feature activation+1.22e+01

Pos logit contributions
.+0.702
,+0.655
 and+0.632
 to+0.528
+0.522
Neg logit contributions
Ta-0.755
 Majesty-0.751
umm-0.740
 sidelines-0.724
agna-0.719
Tokenaffles
Token ID48501
Feature activation+6.87e+01

Pos logit contributions
Ta+0.122
 Majesty+0.115
umm+0.093
 former+0.089
agna+0.088
Neg logit contributions
.-0.905
,-0.870
 and-0.863
 to-0.777
-0.773
Token with
Token ID351
Feature activation-8.02e+00

Pos logit contributions
atmeal+4.179
umm+4.123
agna+4.043
wy+3.987
 sidelines+3.959
Neg logit contributions
.-0.838
,-0.572
-0.418
 and-0.393
 to-0.190
Token the
Token ID262
Feature activation-1.76e+01

Pos logit contributions
.+0.229
,+0.208
 and+0.198
 to+0.151
 with+0.146
Neg logit contributions
Ta-0.436
 Majesty-0.430
umm-0.424
 sidelines-0.419
agna-0.414
Token syrup
Token ID24208
Feature activation+5.33e+01

Pos logit contributions
.+0.787
,+0.740
 and+0.719
 to+0.610
+0.607
Neg logit contributions
 Majesty-0.687
Ta-0.685
umm-0.667
 sidelines-0.653
agna-0.652
Token and
Token ID290
Feature activation-2.20e+01

Pos logit contributions
atmeal+3.340
umm+3.340
wy+3.219
agna+3.194
pron+3.190
Neg logit contributions
.-0.346
,-0.199
-0.153
 and-0.045
 to+0.059
Token smiled
Token ID13541
Feature activation+1.92e+01

Pos logit contributions
.+0.871
,+0.806
 and+0.773
+0.648
 to+0.647
Neg logit contributions
 Majesty-0.961
Ta-0.953
umm-0.924
agna-0.914
 majesty-0.913
Token She
Token ID1375
Feature activation+2.73e+00

Pos logit contributions
.+1.227
,+1.112
 and+1.093
 to+0.871
 with+0.830
Neg logit contributions
Ta-1.882
umm-1.868
 Majesty-1.816
 sidelines-1.790
 former-1.769
Token loved
Token ID6151
Feature activation+8.63e+00

Pos logit contributions
Ta+0.119
 Majesty+0.114
 majesty+0.111
agna+0.108
atmeal+0.106
Neg logit contributions
.-0.102
,-0.095
 and-0.091
 to-0.077
-0.074
Token to
Token ID284
Feature activation-1.77e+01

Pos logit contributions
 Majesty+0.606
Ta+0.606
umm+0.595
 sidelines+0.591
agna+0.589
Neg logit contributions
.-0.112
,-0.088
 and-0.077
-0.023
 with-0.019
Token eat
Token ID4483
Feature activation+2.18e+01

Pos logit contributions
.+0.702
,+0.653
 and+0.633
 to+0.524
+0.519
Neg logit contributions
 Majesty-0.784
Ta-0.782
umm-0.764
 sidelines-0.749
agna-0.748
Token w
Token ID266
Feature activation+1.18e+01

Pos logit contributions
umm+0.574
 pulp+0.574
affles+0.570
 Majesty+0.569
agna+0.564
Neg logit contributions
.-1.089
,-1.014
 and-0.987
-0.906
 to-0.882
Tokenaffles
Token ID48501
Feature activation+6.87e+01

Pos logit contributions
Ta+0.112
 Majesty+0.109
umm+0.093
agna+0.083
 former+0.083
Neg logit contributions
.-0.891
,-0.859
 and-0.846
 to-0.767
 with-0.765
Token for
Token ID329
Feature activation-7.88e+00

Pos logit contributions
atmeal+3.938
affles+3.874
agna+3.766
icious+3.673
wy+3.665
Neg logit contributions
.-0.698
,-0.578
-0.532
 and-0.512
 to-0.267
Token breakfast
Token ID12607
Feature activation+3.80e+01

Pos logit contributions
.+0.395
,+0.376
 and+0.367
+0.318
 to+0.317
Neg logit contributions
 Majesty-0.256
Ta-0.252
agna-0.244
umm-0.242
 sidelines-0.242
Token every
Token ID790
Feature activation-8.08e+00

Pos logit contributions
atmeal+2.318
agna+2.297
umm+2.263
icious+2.246
affles+2.235
Neg logit contributions
.-0.315
,-0.276
 and-0.254
-0.149
 to-0.094
Token day
Token ID1110
Feature activation+1.52e+01

Pos logit contributions
.+0.315
,+0.296
 and+0.287
+0.235
 with+0.234
Neg logit contributions
Ta-0.344
 Majesty-0.342
umm-0.337
 sidelines-0.330
agna-0.324
Token.
Token ID13
Feature activation-3.81e+01

Pos logit contributions
umm+0.920
atmeal+0.891
pron+0.884
agna+0.877
 sidelines+0.873
Neg logit contributions
,-0.044
 and-0.037
.-0.005
+0.018
 the+0.053
Token there
Token ID612
Feature activation-8.26e+00

Pos logit contributions
.+0.045
 and+0.027
 to+0.005
,+0.000
-0.005
Neg logit contributions
 Majesty-0.284
 majesty-0.278
Ta-0.268
umm-0.260
ubb-0.259
Token was
Token ID373
Feature activation-1.79e+01

Pos logit contributions
.+0.207
 and+0.176
,+0.170
 to+0.152
 with+0.124
Neg logit contributions
Ta-0.387
 Majesty-0.376
Uh-0.364
 majesty-0.363
umm-0.361
Token a
Token ID257
Feature activation-2.35e+01

Pos logit contributions
.+0.307
,+0.257
 and+0.232
 to+0.124
 with+0.116
Neg logit contributions
Ta-1.192
 Majesty-1.161
umm-1.138
 sidelines-1.130
 former-1.121
Token yellow
Token ID7872
Feature activation+1.06e+01

Pos logit contributions
.+0.776
,+0.725
 and+0.696
+0.536
 with+0.524
Neg logit contributions
Ta-1.227
 Majesty-1.209
umm-1.205
 sidelines-1.171
 former-1.166
Token pol
Token ID755
Feature activation+2.44e+01

Pos logit contributions
 Majesty+0.373
Ta+0.364
umm+0.357
 sidelines+0.350
 majesty+0.349
Neg logit contributions
.-0.496
,-0.465
 and-0.452
 to-0.397
-0.397
Tokenisher
Token ID4828
Feature activation+6.86e+01

Pos logit contributions
 Majesty+0.718
Ta+0.717
umm+0.689
 sidelines+0.674
agna+0.670
Neg logit contributions
.-1.340
,-1.272
 and-1.245
 to-1.091
-1.083
Token.
Token ID13
Feature activation-4.52e+01

Pos logit contributions
 Majesty+4.593
 majesty+4.444
atmeal+4.409
isher+4.397
umm+4.366
Neg logit contributions
.-0.196
,-0.183
 and-0.061
+0.002
 the+0.187
Token It
Token ID632
Feature activation+8.48e+00

Pos logit contributions
.+1.544
,+1.418
 and+1.367
 to+1.099
 with+1.057
Neg logit contributions
Ta-2.232
 Majesty-2.224
umm-2.196
 sidelines-2.132
agna-2.128
Token was
Token ID373
Feature activation-1.64e+01

Pos logit contributions
Ta+0.474
 Majesty+0.464
 majesty+0.450
umm+0.449
 former+0.447
Neg logit contributions
.-0.207
,-0.188
 and-0.172
 to-0.131
 with-0.116
Token very
Token ID845
Feature activation-1.36e+01

Pos logit contributions
.+0.520
,+0.468
 and+0.450
+0.353
 to+0.345
Neg logit contributions
Ta-0.837
 Majesty-0.834
umm-0.815
 former-0.801
 sidelines-0.800
Token special
Token ID2041
Feature activation+1.26e+01

Pos logit contributions
.+0.554
,+0.517
 and+0.499
 to+0.416
+0.412
Neg logit contributions
 Majesty-0.596
Ta-0.595
umm-0.579
agna-0.569
 sidelines-0.569
Token come
Token ID1282
Feature activation+7.72e+00

Pos logit contributions
.+1.068
,+0.983
 and+0.951
+0.774
 to+0.773
Neg logit contributions
 Majesty-1.329
Ta-1.322
umm-1.286
agna-1.274
 sidelines-1.269
Token and
Token ID290
Feature activation-2.20e+01

Pos logit contributions
 Majesty+0.374
 majesty+0.359
Ta+0.355
agna+0.353
 former+0.347
Neg logit contributions
.-0.218
,-0.191
 and-0.177
-0.142
 to-0.129
Token eat
Token ID4483
Feature activation+2.16e+01

Pos logit contributions
.+0.862
,+0.807
 and+0.778
 to+0.656
+0.638
Neg logit contributions
Ta-0.991
 Majesty-0.991
umm-0.984
agna-0.951
 sidelines-0.944
Token your
Token ID534
Feature activation+1.00e+01

Pos logit contributions
affles+0.861
 Majesty+0.858
agna+0.858
wy+0.842
icious+0.841
Neg logit contributions
.-0.713
,-0.656
 and-0.640
-0.566
 to-0.539
Token w
Token ID266
Feature activation+1.20e+01

Pos logit contributions
 Majesty+0.417
Ta+0.407
umm+0.405
 majesty+0.393
agna+0.393
Neg logit contributions
.-0.404
,-0.380
 and-0.370
-0.308
 to-0.307
Tokenaffles
Token ID48501
Feature activation+6.86e+01

Pos logit contributions
Ta+0.110
 Majesty+0.099
umm+0.084
 sidelines+0.078
agna+0.077
Neg logit contributions
.-0.912
,-0.877
 and-0.864
 to-0.784
-0.779
Token.
Token ID13
Feature activation-3.80e+01

Pos logit contributions
atmeal+3.774
wy+3.750
agna+3.710
 Majesty+3.705
icious+3.655
Neg logit contributions
.-1.084
,-0.981
 and-0.909
-0.855
 to-0.635
Token They
Token ID1119
Feature activation+3.41e+00

Pos logit contributions
.+1.514
,+1.414
 and+1.376
+1.155
 to+1.134
Neg logit contributions
Ta-1.669
 Majesty-1.653
umm-1.610
 sidelines-1.605
 former-1.571
Token are
Token ID389
Feature activation+6.42e+00

Pos logit contributions
 Majesty+0.154
Ta+0.154
umm+0.150
agna+0.148
 sidelines+0.148
Neg logit contributions
.-0.129
,-0.120
 and-0.116
 to-0.095
-0.095
Token hot
Token ID3024
Feature activation+1.82e+01

Pos logit contributions
 Majesty+0.302
Ta+0.294
umm+0.293
agna+0.292
 sidelines+0.289
Neg logit contributions
.-0.225
,-0.208
 and-0.199
-0.164
 to-0.159
Token and
Token ID290
Feature activation-2.18e+01

Pos logit contributions
umm+1.210
 Majesty+1.164
atmeal+1.143
wy+1.140
 majesty+1.135
Neg logit contributions
.-0.154
,-0.097
-0.060
 and-0.054
 to-0.015
Token just
Token ID655
Feature activation+2.91e+00

Pos logit contributions
 Majesty+2.880
umm+2.821
 sidelines+2.798
 majesty+2.767
agna+2.756
Neg logit contributions
.-0.559
,-0.410
 and-0.337
-0.331
 the-0.134
Token like
Token ID588
Feature activation-3.70e+00

Pos logit contributions
Ta+0.151
 Majesty+0.149
 former+0.143
 majesty+0.143
umm+0.143
Neg logit contributions
.-0.079
,-0.074
 and-0.068
-0.057
 with-0.052
Token a
Token ID257
Feature activation-1.45e+01

Pos logit contributions
.+0.085
,+0.076
 and+0.072
+0.051
 with+0.048
Neg logit contributions
 Majesty-0.213
Ta-0.211
 sidelines-0.208
umm-0.207
 majesty-0.204
Token real
Token ID1103
Feature activation+1.40e+01

Pos logit contributions
.+0.714
,+0.676
 and+0.657
 to+0.569
+0.561
Neg logit contributions
Ta-0.511
 Majesty-0.501
umm-0.492
agna-0.478
 sidelines-0.474
Token b
Token ID275
Feature activation+2.19e+01

Pos logit contributions
 Majesty+0.462
umm+0.438
 sidelines+0.428
Ta+0.427
 majesty+0.421
Neg logit contributions
.-0.667
,-0.633
 and-0.619
-0.559
 to-0.541
Tokeniker
Token ID18320
Feature activation+6.86e+01

Pos logit contributions
 Majesty+0.414
umm+0.404
Ta+0.402
 majesty+0.367
 sidelines+0.366
Neg logit contributions
.-1.401
,-1.349
 and-1.325
-1.204
 to-1.195
Token.
Token ID13
Feature activation-3.80e+01

Pos logit contributions
 Majesty+4.659
 majesty+4.517
agna+4.482
cher+4.433
atmeal+4.422
Neg logit contributions
.-0.300
,-0.195
 and-0.129
-0.068
 the+0.184
Token
Token ID220
Feature activation-6.07e+00

Pos logit contributions
.+0.952
,+0.856
 and+0.810
 to+0.580
 with+0.561
Neg logit contributions
Ta-2.171
 Majesty-2.148
umm-2.126
 majesty-2.091
agna-2.051
Token
Token ID198
Feature activation-2.90e+01

Pos logit contributions
.+0.090
,+0.082
 and+0.074
 to+0.035
 with+0.030
Neg logit contributions
 Majesty-0.400
Ta-0.386
 majesty-0.386
agna-0.382
umm-0.380
Token
Token ID198
Feature activation-2.90e+01

Pos logit contributions
.+0.496
,+0.430
 and+0.369
 to+0.204
 with+0.164
Neg logit contributions
Ta-1.944
 Majesty-1.851
 sidelines-1.832
 majesty-1.816
agna-1.803
TokenWith
Token ID3152
Feature activation+2.64e+01

Pos logit contributions
.+1.260
,+1.179
 and+1.153
+0.974
 to+0.969
Neg logit contributions
 Majesty-1.185
Ta-1.163
umm-1.147
agna-1.118
 sidelines-1.117
Token enjoyed
Token ID8359
Feature activation+2.66e+01

Pos logit contributions
 Majesty+0.364
Ta+0.350
 majesty+0.344
 former+0.342
umm+0.341
Neg logit contributions
.-0.344
,-0.319
 and-0.306
-0.261
 to-0.258
Token their
Token ID511
Feature activation+1.24e-01

Pos logit contributions
 Majesty+1.415
 former+1.366
agna+1.359
 sidelines+1.337
 majesty+1.335
Neg logit contributions
.-0.691
,-0.628
 and-0.590
-0.450
 to-0.428
Token y
Token ID331
Feature activation+1.81e+01

Pos logit contributions
Ta+0.005
 Majesty+0.005
umm+0.004
 sidelines+0.004
agna+0.004
Neg logit contributions
.-0.006
,-0.006
 and-0.005
 to-0.005
-0.005
Tokenummy
Token ID13513
Feature activation+3.27e+01

Pos logit contributions
umm+0.565
 Majesty+0.565
Ta+0.553
 sidelines+0.531
agna+0.525
Neg logit contributions
.-0.934
,-0.882
 and-0.866
-0.760
 to-0.756
Token w
Token ID266
Feature activation+1.18e+01

Pos logit contributions
 Majesty+1.147
umm+1.136
Ta+1.117
agna+1.086
 majesty+1.081
Neg logit contributions
.-1.490
,-1.387
 and-1.351
-1.201
 to-1.182
Tokenaffles
Token ID48501
Feature activation+6.86e+01

Pos logit contributions
Ta+0.139
 Majesty+0.124
umm+0.108
agna+0.105
 pulp+0.103
Neg logit contributions
.-0.869
,-0.834
 and-0.824
 to-0.742
-0.734
Token every
Token ID790
Feature activation-7.79e+00

Pos logit contributions
agna+4.154
atmeal+4.146
umm+4.043
 Majesty+4.032
wy+4.004
Neg logit contributions
.-0.660
,-0.547
-0.464
 and-0.416
 to-0.150
Token morning
Token ID3329
Feature activation+2.05e+01

Pos logit contributions
.+0.305
,+0.288
 and+0.279
+0.234
 with+0.231
Neg logit contributions
Ta-0.314
 Majesty-0.313
umm-0.311
 sidelines-0.304
 former-0.301
Token.
Token ID13
Feature activation-3.80e+01

Pos logit contributions
umm+1.371
agna+1.362
 Majesty+1.333
 former+1.329
 sidelines+1.327
Neg logit contributions
.-0.054
,-0.044
 and-0.009
+0.024
 with+0.100
Token The
Token ID383
Feature activation-4.27e+00

Pos logit contributions
.+0.707
,+0.627
 and+0.557
 to+0.343
 with+0.329
Neg logit contributions
Ta-2.392
umm-2.327
 Majesty-2.284
agna-2.239
 majesty-2.237
Token end
Token ID886
Feature activation+6.28e+00

Pos logit contributions
.+0.166
,+0.156
 and+0.151
+0.121
 to+0.117
Neg logit contributions
Ta-0.198
 Majesty-0.197
umm-0.192
agna-0.191
 sidelines-0.189
Token they
Token ID484
Feature activation-2.24e+00

Pos logit contributions
.+0.198
,+0.182
 and+0.178
+0.137
 with+0.125
Neg logit contributions
 Majesty-0.403
 majesty-0.391
agna-0.390
 pulp-0.383
Ta-0.382
Token watched
Token ID7342
Feature activation+1.97e+01

Pos logit contributions
.+0.094
,+0.088
 and+0.085
 to+0.071
+0.071
Neg logit contributions
 Majesty-0.094
Ta-0.094
umm-0.092
 sidelines-0.090
agna-0.090
Token the
Token ID262
Feature activation-1.75e+01

Pos logit contributions
 Majesty+1.013
 sidelines+1.001
agna+0.986
umm+0.968
ocol+0.958
Neg logit contributions
.-0.446
,-0.420
 and-0.398
-0.320
 to-0.288
Token pretend
Token ID16614
Feature activation+2.23e+01

Pos logit contributions
.+0.650
,+0.598
 and+0.580
 to+0.469
+0.467
Neg logit contributions
 Majesty-0.832
Ta-0.831
umm-0.804
agna-0.801
 majesty-0.788
Token w
Token ID266
Feature activation+1.23e+01

Pos logit contributions
 Majesty+0.734
umm+0.718
Ta+0.710
 sidelines+0.694
agna+0.690
Neg logit contributions
.-1.084
,-1.036
 and-1.007
-0.876
 to-0.865
Tokenaffles
Token ID48501
Feature activation+6.86e+01

Pos logit contributions
Ta+0.112
 Majesty+0.105
umm+0.088
agna+0.082
 former+0.081
Neg logit contributions
.-0.928
,-0.892
 and-0.879
 to-0.799
-0.795
Token fly
Token ID6129
Feature activation+3.25e+01

Pos logit contributions
atmeal+4.153
agna+4.142
 sidelines+4.092
 Majesty+4.082
umm+4.040
Neg logit contributions
.-0.743
,-0.698
 and-0.595
-0.472
 to-0.284
Token high
Token ID1029
Feature activation+1.69e+01

Pos logit contributions
 Majesty+1.959
umm+1.921
wy+1.914
 pulp+1.903
 sidelines+1.901
Neg logit contributions
.-0.302
,-0.289
 and-0.235
-0.193
 to-0.043
Token up
Token ID510
Feature activation+1.15e+01

Pos logit contributions
umm+1.079
 Majesty+1.037
 former+1.029
agna+1.012
isher+1.005
Neg logit contributions
.-0.117
,-0.116
-0.114
 and-0.077
 the-0.004
Token into
Token ID656
Feature activation+2.36e+00

Pos logit contributions
umm+0.588
 former+0.587
 pulp+0.577
 avail+0.556
isher+0.555
Neg logit contributions
,-0.157
.-0.151
 and-0.126
-0.126
 that-0.068
Token the
Token ID262
Feature activation-1.74e+01

Pos logit contributions
 Majesty+0.160
Ta+0.160
umm+0.157
 sidelines+0.156
agna+0.155
Neg logit contributions
.-0.036
,-0.030
 and-0.027
 to-0.012
-0.012
Token impatient
Token ID33440
Feature activation+2.37e+01

Pos logit contributions
.+0.313
,+0.283
 and+0.269
 to+0.209
+0.206
Neg logit contributions
 Majesty-0.530
Ta-0.529
umm-0.514
 sidelines-0.511
 former-0.507
Token.
Token ID13
Feature activation-3.88e+01

Pos logit contributions
umm+1.569
 sidelines+1.455
 Majesty+1.447
 former+1.425
isher+1.410
Neg logit contributions
.-0.090
,-0.055
-0.053
 and-0.013
 the+0.041
Token She
Token ID1375
Feature activation+2.78e+00

Pos logit contributions
.+1.217
,+1.121
 and+1.069
 to+0.840
 with+0.828
Neg logit contributions
 Majesty-1.983
umm-1.978
Ta-1.948
 former-1.895
 sidelines-1.895
Token wanted
Token ID2227
Feature activation+6.56e+00

Pos logit contributions
 Majesty+0.128
Ta+0.128
umm+0.124
agna+0.123
 sidelines+0.123
Neg logit contributions
.-0.102
,-0.095
 and-0.091
 to-0.075
-0.075
Token w
Token ID266
Feature activation+1.18e+01

Pos logit contributions
Ta+0.437
 Majesty+0.435
 sidelines+0.430
umm+0.429
 former+0.426
Neg logit contributions
.-0.090
,-0.072
 and-0.063
-0.026
 with-0.018
Tokenaffles
Token ID48501
Feature activation+6.85e+01

Pos logit contributions
Ta+0.146
 Majesty+0.141
umm+0.119
 pulp+0.116
agna+0.115
Neg logit contributions
.-0.862
,-0.832
 and-0.821
 to-0.739
 with-0.735
Token for
Token ID329
Feature activation-7.77e+00

Pos logit contributions
atmeal+3.701
agna+3.647
icious+3.611
affles+3.559
reath+3.511
Neg logit contributions
.-0.928
 and-0.748
,-0.699
-0.672
 the-0.300
Token breakfast
Token ID12607
Feature activation+3.78e+01

Pos logit contributions
.+0.286
,+0.268
 and+0.261
+0.222
 to+0.214
Neg logit contributions
 Majesty-0.329
agna-0.319
 sidelines-0.317
affles-0.315
 former-0.312
Token.
Token ID13
Feature activation-3.86e+01

Pos logit contributions
umm+2.371
agna+2.362
icious+2.330
atmeal+2.328
affles+2.239
Neg logit contributions
.-0.229
 and-0.157
,-0.149
-0.085
 to+0.011
Token She
Token ID1375
Feature activation+2.87e+00

Pos logit contributions
.+1.290
,+1.197
 and+1.145
 to+0.922
 with+0.915
Neg logit contributions
umm-1.835
 Majesty-1.802
Ta-1.757
 former-1.711
 sidelines-1.707
Token went
Token ID1816
Feature activation+6.97e+00

Pos logit contributions
 Majesty+0.137
Ta+0.137
umm+0.134
agna+0.131
 sidelines+0.131
Neg logit contributions
.-0.103
,-0.095
 and-0.092
 to-0.074
-0.074
Token on
Token ID319
Feature activation-3.24e+00

Pos logit contributions
Ta+0.238
umm+0.233
 sidelines+0.227
 Majesty+0.227
agna+0.226
Neg logit contributions
.-0.091
,-0.072
 and-0.071
 to-0.049
-0.046
Token,
Token ID11
Feature activation-2.86e+01

Pos logit contributions
.+0.031
 and+0.015
,+0.006
+0.003
 to+0.001
Neg logit contributions
 Majesty-0.214
 majesty-0.206
agna-0.204
heit-0.203
 sidelines-0.203
Token the
Token ID262
Feature activation-1.75e+01

Pos logit contributions
.+0.729
,+0.629
 and+0.604
 to+0.431
+0.420
Neg logit contributions
 Majesty-1.605
umm-1.589
Ta-1.576
 sidelines-1.569
agna-1.564
Token two
Token ID734
Feature activation+9.27e+00

Pos logit contributions
.+0.651
,+0.606
 and+0.585
+0.472
 to+0.468
Neg logit contributions
 Majesty-0.833
Ta-0.831
umm-0.806
agna-0.796
 majesty-0.791
Token w
Token ID266
Feature activation+1.21e+01

Pos logit contributions
 Majesty+0.399
Ta+0.398
umm+0.396
 sidelines+0.386
agna+0.383
Neg logit contributions
.-0.363
,-0.333
 and-0.323
 to-0.273
-0.273
Tokenaffles
Token ID48501
Feature activation+6.85e+01

Pos logit contributions
Ta+0.143
 Majesty+0.126
 pulp+0.113
agna+0.109
umm+0.107
Neg logit contributions
.-0.880
,-0.850
 and-0.838
 to-0.750
-0.748
Token were
Token ID547
Feature activation+1.83e+00

Pos logit contributions
 Majesty+3.300
Ta+3.288
umm+3.287
agna+3.239
atmeal+3.211
Neg logit contributions
.-2.096
,-1.842
 and-1.746
-1.491
 to-1.389
Token never
Token ID1239
Feature activation+1.30e+01

Pos logit contributions
 Majesty+0.093
 majesty+0.088
Ta+0.088
agna+0.087
umm+0.087
Neg logit contributions
.-0.058
,-0.052
 and-0.051
-0.041
 to-0.038
Token apart
Token ID5475
Feature activation+2.42e+01

Pos logit contributions
 Majesty+0.479
Ta+0.470
umm+0.459
 sidelines+0.454
 former+0.451
Neg logit contributions
.-0.581
,-0.546
 and-0.536
-0.465
 with-0.450
Token.
Token ID13
Feature activation-3.80e+01

Pos logit contributions
umm+1.246
 former+1.211
 Majesty+1.207
ubb+1.197
agna+1.143
Neg logit contributions
.-0.491
,-0.441
-0.424
 and-0.419
 with-0.270
Token Where
Token ID6350
Feature activation+2.20e+01

Pos logit contributions
.+1.225
,+1.138
 and+1.083
 to+0.848
 with+0.840
Neg logit contributions
Ta-1.933
 Majesty-1.921
umm-1.918
agna-1.851
 former-1.843
Token girl
Token ID2576
Feature activation+2.02e+01

Pos logit contributions
.+0.294
,+0.274
 and+0.267
+0.234
 to+0.230
Neg logit contributions
 Majesty-0.243
umm-0.232
Ta-0.230
 sidelines-0.227
 majesty-0.226
Token called
Token ID1444
Feature activation-8.25e+00

Pos logit contributions
 Majesty+1.032
 majesty+0.981
 pulp+0.941
Ta+0.938
 sidelines+0.926
Neg logit contributions
.-0.506
,-0.447
 and-0.440
-0.351
 to-0.347
Token L
Token ID406
Feature activation+8.21e+00

Pos logit contributions
.+0.337
 and+0.315
,+0.314
+0.261
 to+0.256
Neg logit contributions
 Majesty-0.325
Ta-0.307
 sidelines-0.306
 majesty-0.304
agna-0.301
Tokenizz
Token ID6457
Feature activation+4.46e+01

Pos logit contributions
Ta+0.169
 Majesty+0.166
 sidelines+0.150
 majesty+0.149
umm+0.148
Neg logit contributions
.-0.492
,-0.481
 and-0.465
 to-0.424
 with-0.418
Tokenie
Token ID494
Feature activation+1.76e+01

Pos logit contributions
 Majesty+1.397
isher+1.308
wy+1.284
 majesty+1.278
Ta+1.245
Neg logit contributions
.-1.973
,-1.949
 and-1.823
-1.683
 to-1.646
Token.
Token ID13
Feature activation-4.70e+01

Pos logit contributions
 Majesty+1.288
 majesty+1.224
 former+1.202
 sidelines+1.160
agna+1.157
Neg logit contributions
.-0.033
,-0.011
 and-0.010
+0.072
 to+0.078
Token She
Token ID1375
Feature activation-4.62e+00

Pos logit contributions
.+1.615
,+1.484
 and+1.437
 to+1.163
 with+1.120
Neg logit contributions
 Majesty-2.291
Ta-2.239
umm-2.200
 majesty-2.166
 former-2.166
Token liked
Token ID8288
Feature activation+9.36e-01

Pos logit contributions
.+0.142
,+0.127
 and+0.121
 to+0.096
+0.094
Neg logit contributions
 Majesty-0.237
Ta-0.234
 former-0.226
 majesty-0.226
agna-0.224
Token to
Token ID284
Feature activation-2.42e+01

Pos logit contributions
 Majesty+0.063
Ta+0.062
umm+0.061
 sidelines+0.061
 majesty+0.061
Neg logit contributions
.-0.013
,-0.011
 and-0.009
-0.004
 with-0.003
Token perform
Token ID1620
Feature activation+2.97e+01

Pos logit contributions
.+1.054
,+0.988
 and+0.960
 to+0.811
+0.806
Neg logit contributions
 Majesty-0.974
Ta-0.971
umm-0.944
 sidelines-0.926
agna-0.924
Token different
Token ID1180
Feature activation+3.23e+00

Pos logit contributions
 Majesty+1.696
 majesty+1.633
reath+1.603
 former+1.598
agna+1.595
Neg logit contributions
.-0.483
,-0.429
 and-0.400
-0.330
 to-0.260
Token friendly
Token ID8030
Feature activation+8.43e+00

Pos logit contributions
agna+2.133
 Majesty+2.121
 former+2.054
 supervision+2.054
uries+2.017
Neg logit contributions
.-0.339
,-0.245
 and-0.175
-0.145
 the-0.027
Token rabbit
Token ID22746
Feature activation+2.23e+01

Pos logit contributions
 Majesty+0.495
 majesty+0.473
umm+0.464
 sidelines+0.459
 former+0.457
Neg logit contributions
.-0.174
,-0.154
 and-0.141
-0.110
 with-0.101
Token named
Token ID3706
Feature activation-1.24e+00

Pos logit contributions
 Majesty+1.475
 majesty+1.425
 former+1.378
 sidelines+1.366
Ta+1.356
Neg logit contributions
.-0.193
,-0.150
 and-0.123
-0.043
 with-0.019
Token Ros
Token ID10018
Feature activation+2.10e+01

Pos logit contributions
.+0.049
,+0.047
 and+0.045
 with+0.037
 to+0.037
Neg logit contributions
 Majesty-0.049
Ta-0.048
 sidelines-0.047
 majesty-0.047
agna-0.046
Tokenie
Token ID494
Feature activation+1.74e+01

Pos logit contributions
 Majesty+0.669
 former+0.621
wy+0.612
 majesty+0.604
isher+0.586
Neg logit contributions
.-0.936
,-0.929
 and-0.885
 to-0.823
 with-0.807
Token.
Token ID13
Feature activation-4.70e+01

Pos logit contributions
 Majesty+1.249
 majesty+1.184
 former+1.125
 pulp+1.081
agna+1.078
Neg logit contributions
.-0.039
 and-0.036
,-0.016
 to+0.026
 with+0.044
Token
Token ID198
Feature activation-3.64e+01

Pos logit contributions
.+1.864
,+1.731
 and+1.692
 to+1.433
 with+1.369
Neg logit contributions
 Majesty-2.043
Ta-1.971
umm-1.963
 former-1.927
 majesty-1.882
Token
Token ID198
Feature activation-3.57e+01

Pos logit contributions
.+0.617
,+0.544
 and+0.478
 to+0.263
 with+0.240
Neg logit contributions
Ta-2.377
 Majesty-2.329
 former-2.262
 sidelines-2.240
 majesty-2.226
TokenRos
Token ID35740
Feature activation+3.81e+01

Pos logit contributions
.+1.506
,+1.410
 and+1.366
 to+1.148
+1.124
Neg logit contributions
Ta-1.488
 Majesty-1.481
umm-1.431
 sidelines-1.414
agna-1.403
Tokenie
Token ID494
Feature activation+1.80e+01

Pos logit contributions
 Majesty+1.291
Ta+1.266
 majesty+1.192
 former+1.185
umm+1.178
Neg logit contributions
.-1.835
,-1.722
 and-1.660
 to-1.491
-1.477
Token warned
Token ID7728
Feature activation+1.04e+01

Pos logit contributions
 Majesty+0.922
 majesty+0.866
Ta+0.863
 former+0.854
agna+0.825
Neg logit contributions
.-0.538
,-0.455
 and-0.430
 to-0.362
-0.356
Token<|endoftext|>
Token ID50256
Feature activation-3.40e+00

Pos logit contributions
 Majesty+0.792
Ta+0.790
 majesty+0.754
agna+0.751
 sidelines+0.749
Neg logit contributions
.-0.674
,-0.616
 and-0.612
 to-0.493
 with-0.490
TokenL
Token ID43
Feature activation+9.80e+00

Pos logit contributions
.+0.091
,+0.087
 and+0.083
 to+0.068
 with+0.057
Neg logit contributions
Ta-0.171
 Majesty-0.158
umm-0.156
 sidelines-0.155
Whe-0.153
Tokenily
Token ID813
Feature activation+2.10e+01

Pos logit contributions
Ta+0.316
 Majesty+0.307
 majesty+0.290
umm+0.282
 sidelines+0.279
Neg logit contributions
.-0.491
,-0.471
 and-0.451
 to-0.404
 with-0.398
Token loved
Token ID6151
Feature activation-8.96e-01

Pos logit contributions
 Majesty+1.324
 majesty+1.294
Ta+1.282
 pulp+1.251
 former+1.241
Neg logit contributions
.-0.166
,-0.099
 to-0.037
 and-0.012
 with-0.009
Token ponies
Token ID46370
Feature activation+4.12e+01

Pos logit contributions
.+0.018
,+0.016
 and+0.015
+0.010
 with+0.009
Neg logit contributions
 Majesty-0.055
Ta-0.055
 sidelines-0.054
umm-0.054
 majesty-0.053
Token.
Token ID13
Feature activation-4.69e+01

Pos logit contributions
atmeal+2.355
agna+2.247
ode+2.216
wy+2.208
 former+2.203
Neg logit contributions
.-0.221
-0.129
,-0.110
 and-0.073
 to-0.040
Token She
Token ID1375
Feature activation-4.51e+00

Pos logit contributions
.+1.494
,+1.374
 and+1.316
 to+1.036
 with+1.006
Neg logit contributions
 Majesty-2.396
umm-2.370
Ta-2.361
 majesty-2.284
agna-2.280
Token had
Token ID550
Feature activation-7.26e+00

Pos logit contributions
.+0.168
,+0.154
 and+0.145
 to+0.125
 with+0.123
Neg logit contributions
 Majesty-0.191
Ta-0.188
 majesty-0.186
agna-0.177
 pulp-0.176
Token a
Token ID257
Feature activation-2.16e+01

Pos logit contributions
.+0.169
,+0.152
 and+0.140
 with+0.093
 to+0.091
Neg logit contributions
Ta-0.430
 Majesty-0.429
 majesty-0.413
umm-0.413
agna-0.412
Token toy
Token ID13373
Feature activation+1.96e+01

Pos logit contributions
.+0.793
,+0.740
 and+0.717
 to+0.581
+0.569
Neg logit contributions
Ta-1.038
 Majesty-1.025
umm-1.012
 sidelines-0.993
agna-0.985
Token pony
Token ID26902
Feature activation+3.78e+01

Pos logit contributions
 Majesty+0.675
umm+0.648
 majesty+0.641
 sidelines+0.640
agna+0.631
Neg logit contributions
.-0.859
,-0.797
 and-0.779
-0.705
 to-0.668
Token-
Token ID12
Feature activation+8.48e+00

Pos logit contributions
 Majesty+0.392
umm+0.391
Ta+0.387
 sidelines+0.375
agna+0.372
Neg logit contributions
.-0.411
,-0.385
 and-0.373
 to-0.319
-0.317
Tokenyear
Token ID1941
Feature activation+4.06e+01

Pos logit contributions
 Majesty+0.255
Ta+0.255
umm+0.244
 sidelines+0.238
agna+0.237
Neg logit contributions
.-0.453
,-0.431
 and-0.420
 to-0.367
-0.366
Token-
Token ID12
Feature activation+6.36e+00

Pos logit contributions
 Majesty+1.931
umm+1.905
Ta+1.902
 former+1.862
isher+1.836
Neg logit contributions
.-1.323
,-1.210
 and-1.132
-0.982
 with-0.893
Tokenold
Token ID727
Feature activation+3.31e+01

Pos logit contributions
 Majesty+0.168
Ta+0.166
umm+0.161
agna+0.156
 sidelines+0.156
Neg logit contributions
.-0.364
,-0.347
 and-0.340
 to-0.302
-0.299
Token child
Token ID1200
Feature activation+2.45e+01

Pos logit contributions
 Majesty+1.696
 sidelines+1.606
Ta+1.599
umm+1.588
 majesty+1.586
Neg logit contributions
.-0.856
,-0.768
 and-0.709
-0.604
 with-0.537
Token.
Token ID13
Feature activation-4.69e+01

Pos logit contributions
 Majesty+1.483
 majesty+1.411
agna+1.408
umm+1.405
Ta+1.393
Neg logit contributions
.-0.378
,-0.305
 and-0.298
-0.210
 with-0.168
Token
Token ID198
Feature activation-3.61e+01

Pos logit contributions
.+1.590
,+1.467
 and+1.411
 to+1.141
 with+1.093
Neg logit contributions
 Majesty-2.291
Ta-2.256
umm-2.204
 sidelines-2.188
 former-2.170
Token
Token ID198
Feature activation-3.55e+01

Pos logit contributions
.+0.766
,+0.695
 and+0.628
 to+0.410
 with+0.381
Neg logit contributions
Ta-2.235
 Majesty-2.203
 sidelines-2.147
 former-2.126
umm-2.116
TokenThe
Token ID464
Feature activation-2.63e+00

Pos logit contributions
.+1.478
,+1.375
 and+1.343
+1.142
 to+1.121
Neg logit contributions
 Majesty-1.501
Ta-1.478
umm-1.475
agna-1.431
 sidelines-1.418
Token bar
Token ID2318
Feature activation+2.27e+01

Pos logit contributions
.+0.119
,+0.112
 and+0.110
 to+0.091
+0.091
Neg logit contributions
Ta-0.103
 Majesty-0.101
umm-0.101
agna-0.098
 majesty-0.096
Tokenber
Token ID527
Feature activation+4.83e+01

Pos logit contributions
 Majesty+0.395
Ta+0.375
umm+0.360
 majesty+0.349
agna+0.344
Neg logit contributions
.-1.465
,-1.401
 and-1.376
 to-1.249
-1.246
Token little
Token ID1310
Feature activation-6.56e+00

Pos logit contributions
.+1.178
,+1.071
 and+1.035
+0.903
 to+0.891
Neg logit contributions
 Majesty-1.134
Ta-1.105
 majesty-1.084
umm-1.084
agna-1.071
Token bird
Token ID6512
Feature activation+2.22e+01

Pos logit contributions
.+0.294
,+0.274
 and+0.267
+0.234
 to+0.230
Neg logit contributions
 Majesty-0.243
umm-0.232
Ta-0.230
 sidelines-0.227
 majesty-0.226
Tokenie
Token ID494
Feature activation+1.74e+01

Pos logit contributions
 Majesty+1.117
 majesty+1.059
 pulp+1.018
Ta+1.014
 former+1.010
Neg logit contributions
.-0.566
,-0.513
 and-0.487
-0.412
 to-0.398
Token named
Token ID3706
Feature activation-9.75e-01

Pos logit contributions
 Majesty+0.948
 majesty+0.905
Ta+0.868
 pulp+0.865
 former+0.857
Neg logit contributions
.-0.435
,-0.392
 and-0.366
-0.295
 to-0.295
Token Tweet
Token ID18752
Feature activation+2.20e+01

Pos logit contributions
.+0.040
,+0.038
 and+0.037
 with+0.031
 to+0.031
Neg logit contributions
 Majesty-0.039
Ta-0.037
 sidelines-0.037
 majesty-0.036
umm-0.036
Token.
Token ID13
Feature activation-4.69e+01

Pos logit contributions
 Majesty+0.640
wy+0.602
 former+0.595
 majesty+0.584
Ta+0.551
Neg logit contributions
,-0.977
.-0.975
 and-0.919
 to-0.855
 with-0.846
Token Tweet
Token ID18752
Feature activation+2.32e+01

Pos logit contributions
.+2.157
,+2.008
 and+1.968
 to+1.718
 with+1.656
Neg logit contributions
 Majesty-1.761
umm-1.667
Ta-1.665
 former-1.623
 majesty-1.617
Token was
Token ID373
Feature activation-1.81e+01

Pos logit contributions
 Majesty+0.741
Ta+0.708
wy+0.690
 majesty+0.690
 former+0.685
Neg logit contributions
.-1.039
,-0.964
 and-0.916
 to-0.884
-0.823
Token very
Token ID845
Feature activation-1.50e+01

Pos logit contributions
.+0.572
,+0.501
 and+0.484
 to+0.377
+0.370
Neg logit contributions
 Majesty-0.929
Ta-0.904
 majesty-0.879
 former-0.877
 sidelines-0.864
Token happy
Token ID3772
Feature activation+3.27e+00

Pos logit contributions
.+0.628
,+0.586
 and+0.569
 to+0.476
+0.472
Neg logit contributions
 Majesty-0.634
Ta-0.632
umm-0.615
 sidelines-0.604
agna-0.602
Token and
Token ID290
Feature activation-2.94e+01

Pos logit contributions
 Majesty+0.181
umm+0.172
 majesty+0.172
 former+0.169
 habitats+0.163
Neg logit contributions
.-0.036
,-0.027
-0.025
 and-0.017
 the-0.010
Token<|endoftext|>
Token ID50256
Feature activation-3.40e+00

Pos logit contributions
.+0.251
,+0.233
 and+0.226
+0.188
 to+0.184
Neg logit contributions
 Majesty-0.312
Ta-0.304
agna-0.302
 sidelines-0.298
 majesty-0.297
TokenL
Token ID43
Feature activation+9.80e+00

Pos logit contributions
.+0.091
,+0.087
 and+0.083
 to+0.068
 with+0.057
Neg logit contributions
Ta-0.171
 Majesty-0.158
umm-0.156
 sidelines-0.155
Whe-0.153
Tokenily
Token ID813
Feature activation+2.10e+01

Pos logit contributions
Ta+0.316
 Majesty+0.307
 majesty+0.290
umm+0.282
 sidelines+0.279
Neg logit contributions
.-0.491
,-0.471
 and-0.451
 to-0.404
 with-0.398
Token was
Token ID373
Feature activation-1.97e+01

Pos logit contributions
 Majesty+1.324
 majesty+1.294
Ta+1.282
 pulp+1.251
 former+1.241
Neg logit contributions
.-0.166
,-0.099
 to-0.037
 and-0.012
 with-0.009
Token sad
Token ID6507
Feature activation+8.60e+00

Pos logit contributions
.+0.724
,+0.672
 and+0.635
 to+0.528
+0.518
Neg logit contributions
 Majesty-0.899
Ta-0.890
 sidelines-0.856
 former-0.854
 majesty-0.845
Token.
Token ID13
Feature activation-4.68e+01

Pos logit contributions
 former+0.556
 sidelines+0.543
 Majesty+0.537
umm+0.536
pron+0.527
Neg logit contributions
.-0.039
,-0.032
-0.015
 and-0.011
 the+0.011
Token She
Token ID1375
Feature activation-4.53e+00

Pos logit contributions
.+1.535
,+1.407
 and+1.356
 to+1.077
 with+1.036
Neg logit contributions
 Majesty-2.351
Ta-2.336
umm-2.312
 sidelines-2.250
agna-2.242
Token had
Token ID550
Feature activation-7.02e+00

Pos logit contributions
.+0.162
,+0.151
 and+0.143
 to+0.119
+0.118
Neg logit contributions
Ta-0.208
 Majesty-0.208
agna-0.198
 majesty-0.198
umm-0.197
Token a
Token ID257
Feature activation-2.17e+01

Pos logit contributions
.+0.253
,+0.235
 and+0.226
+0.181
 to+0.180
Neg logit contributions
Ta-0.333
 Majesty-0.332
umm-0.322
 sidelines-0.318
agna-0.317
Token cold
Token ID4692
Feature activation+1.43e+01

Pos logit contributions
.+0.882
,+0.841
 and+0.810
 to+0.701
 with+0.650
Neg logit contributions
Ta-0.977
umm-0.962
 Majesty-0.944
agna-0.921
 sidelines-0.918
Token and
Token ID290
Feature activation-2.86e+01

Pos logit contributions
 Majesty+0.605
Ta+0.599
umm+0.590
 majesty+0.577
 sidelines+0.576
Neg logit contributions
.-0.561
,-0.517
 and-0.498
-0.433
 to-0.422
Token Lily
Token ID20037
Feature activation-3.72e-01

Pos logit contributions
.+0.048
,+0.045
 and+0.044
 to+0.035
 with+0.035
Neg logit contributions
 Majesty-0.054
Ta-0.053
 sidelines-0.052
 majesty-0.051
agna-0.050
Token who
Token ID508
Feature activation-1.05e+01

Pos logit contributions
,+0.002
.+0.001
 and+0.001
 to+0.000
 with-0.000
Neg logit contributions
 Majesty-0.023
 majesty-0.021
 former-0.020
ppa-0.020
icians-0.020
Token loved
Token ID6151
Feature activation+4.09e-01

Pos logit contributions
.+0.360
,+0.336
 and+0.323
 to+0.269
 with+0.253
Neg logit contributions
 Majesty-0.479
Ta-0.475
 former-0.458
 majesty-0.454
umm-0.450
Token fair
Token ID3148
Feature activation+1.68e+01

Pos logit contributions
 Majesty+0.025
Ta+0.025
umm+0.025
 sidelines+0.024
agna+0.024
Neg logit contributions
.-0.009
,-0.008
 and-0.007
-0.005
 with-0.004
Tokenies
Token ID444
Feature activation+2.97e+01

Pos logit contributions
 Majesty+0.962
umm+0.952
 majesty+0.935
 former+0.933
isher+0.910
Neg logit contributions
.-0.259
,-0.230
-0.217
 and-0.187
 the-0.126
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 former+1.641
 Majesty+1.619
 groom+1.616
ode+1.612
 highlight+1.608
Neg logit contributions
-0.095
.-0.071
,-0.055
 and+0.017
 to+0.052
Token She
Token ID1375
Feature activation-4.28e+00

Pos logit contributions
.+1.520
,+1.406
 and+1.339
 to+1.074
 with+1.043
Neg logit contributions
 Majesty-2.369
Ta-2.329
umm-2.324
 former-2.254
 sidelines-2.254
Token would
Token ID561
Feature activation+1.18e-02

Pos logit contributions
.+0.161
,+0.146
 and+0.140
+0.118
 to+0.118
Neg logit contributions
 Majesty-0.186
Ta-0.180
 majesty-0.177
 former-0.173
agna-0.169
Token go
Token ID467
Feature activation+3.02e+00

Pos logit contributions
 Majesty+0.001
Ta+0.001
umm+0.000
 sidelines+0.000
agna+0.000
Neg logit contributions
.-0.000
,-0.000
 and-0.000
 to-0.000
-0.000
Token to
Token ID284
Feature activation-2.43e+01

Pos logit contributions
 Majesty+0.186
Ta+0.184
 sidelines+0.182
 former+0.180
 majesty+0.179
Neg logit contributions
.-0.054
,-0.041
 and-0.035
-0.022
 with-0.015
Token the
Token ID262
Feature activation-2.36e+01

Pos logit contributions
.+0.499
,+0.431
 and+0.400
+0.241
 to+0.240
Neg logit contributions
 Majesty-1.517
Ta-1.493
 sidelines-1.467
 former-1.463
 majesty-1.448
Token.
Token ID13
Feature activation-4.18e+01

Pos logit contributions
.+0.819
,+0.764
 and+0.751
 to+0.649
 with+0.641
Neg logit contributions
 Majesty-0.622
Ta-0.616
umm-0.597
agna-0.582
 majesty-0.579
Token<|endoftext|>
Token ID50256
Feature activation-2.89e+00

Pos logit contributions
.+1.330
,+1.217
 and+1.169
 to+0.924
 with+0.885
Neg logit contributions
 Majesty-2.118
Ta-2.081
umm-2.028
 sidelines-2.020
 majesty-2.009
TokenSam
Token ID16305
Feature activation+2.57e+01

Pos logit contributions
.+0.069
,+0.064
 and+0.064
 to+0.051
 with+0.045
Neg logit contributions
Ta-0.140
umm-0.124
Whe-0.124
 Majesty-0.124
rob-0.123
Token loves
Token ID10408
Feature activation+1.39e+01

Pos logit contributions
 Majesty+1.139
Ta+1.117
wy+1.109
 majesty+1.093
ited+1.071
Neg logit contributions
.-0.710
,-0.631
 to-0.560
 and-0.554
 with-0.549
Token basketball
Token ID9669
Feature activation+3.55e+01

Pos logit contributions
 Majesty+0.654
Ta+0.637
umm+0.631
 sidelines+0.629
 majesty+0.621
Neg logit contributions
.-0.485
,-0.449
 and-0.433
-0.353
 with-0.340
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 sidelines+2.253
umm+2.226
isher+2.184
atmeal+2.164
Ta+2.163
Neg logit contributions
.-0.089
,-0.078
-0.027
 and+0.004
 to+0.103
Token He
Token ID679
Feature activation-2.96e+00

Pos logit contributions
.+1.587
,+1.460
 and+1.408
 to+1.132
 with+1.091
Neg logit contributions
 Majesty-2.300
Ta-2.290
umm-2.270
 sidelines-2.192
agna-2.186
Token likes
Token ID7832
Feature activation+1.22e+01

Pos logit contributions
.+0.120
,+0.110
 and+0.104
 to+0.091
+0.089
Neg logit contributions
 Majesty-0.120
Ta-0.116
 majesty-0.114
 sidelines-0.110
umm-0.109
Token to
Token ID284
Feature activation-2.44e+01

Pos logit contributions
 sidelines+0.723
 Majesty+0.719
Ta+0.702
umm+0.696
 majesty+0.692
Neg logit contributions
.-0.227
,-0.198
 and-0.169
-0.108
 with-0.093
Token play
Token ID711
Feature activation+1.19e+01

Pos logit contributions
.+1.080
,+1.015
 and+0.989
 to+0.840
+0.837
Neg logit contributions
 Majesty-0.951
Ta-0.946
umm-0.915
 sidelines-0.911
agna-0.896
Token with
Token ID351
Feature activation-1.51e+01

Pos logit contributions
 sidelines+0.491
 Majesty+0.467
reath+0.456
Ta+0.455
affles+0.454
Neg logit contributions
.-0.395
,-0.366
 and-0.345
-0.305
 to-0.300
Token young
Token ID1862
Feature activation+9.87e+00

Pos logit contributions
 Majesty+2.643
 former+2.546
 majesty+2.507
agna+2.501
 sidelines+2.479
Neg logit contributions
.-0.679
,-0.581
 and-0.537
-0.366
 to-0.355
Token girl
Token ID2576
Feature activation+1.98e+01

Pos logit contributions
 Majesty+0.410
umm+0.387
 majesty+0.380
 sidelines+0.379
 former+0.373
Neg logit contributions
.-0.382
,-0.357
 and-0.343
-0.302
 to-0.290
Token named
Token ID3706
Feature activation-1.32e+00

Pos logit contributions
 Majesty+1.135
 majesty+1.078
 former+1.036
Ta+1.035
agna+1.026
Neg logit contributions
.-0.384
,-0.331
 and-0.322
-0.234
 to-0.227
Token Ell
Token ID7122
Feature activation+2.14e+01

Pos logit contributions
.+0.053
,+0.050
 and+0.049
 to+0.040
+0.040
Neg logit contributions
 Majesty-0.054
Ta-0.052
 sidelines-0.051
 majesty-0.051
umm-0.050
Tokena
Token ID64
Feature activation+2.89e+01

Pos logit contributions
 Majesty+0.644
Ta+0.610
 majesty+0.597
 former+0.585
bek+0.584
Neg logit contributions
.-1.054
,-1.028
 and-0.982
 to-0.886
-0.847
Token.
Token ID13
Feature activation-4.67e+01

Pos logit contributions
 Majesty+2.006
 majesty+1.928
 former+1.811
agna+1.791
Ta+1.761
Neg logit contributions
.-0.083
,-0.061
 and-0.057
 to+0.006
 with+0.088
Token Ell
Token ID7122
Feature activation+2.30e+01

Pos logit contributions
.+1.594
,+1.463
 and+1.425
 to+1.149
 with+1.093
Neg logit contributions
 Majesty-2.309
Ta-2.266
umm-2.244
 majesty-2.185
agna-2.174
Tokena
Token ID64
Feature activation+2.92e+01

Pos logit contributions
 Majesty+0.714
Ta+0.711
 majesty+0.667
 former+0.661
umm+0.655
Neg logit contributions
.-1.161
,-1.112
 and-1.067
 to-0.958
-0.924
Token was
Token ID373
Feature activation-1.70e+01

Pos logit contributions
 Majesty+1.448
Ta+1.392
 majesty+1.381
 former+1.361
agna+1.335
Neg logit contributions
.-0.810
,-0.697
 and-0.639
 to-0.579
-0.526
Token very
Token ID845
Feature activation-1.44e+01

Pos logit contributions
.+0.503
,+0.447
 and+0.430
 to+0.323
+0.322
Neg logit contributions
 Majesty-0.918
Ta-0.900
umm-0.874
 former-0.873
 sidelines-0.871
Token excited
Token ID6568
Feature activation+6.38e+00

Pos logit contributions
.+0.607
,+0.570
 and+0.549
 to+0.461
+0.459
Neg logit contributions
Ta-0.610
 Majesty-0.607
umm-0.597
agna-0.583
 sidelines-0.581
Token�
Token ID241
Feature activation+1.86e+01

Pos logit contributions
 Majesty+0.515
Ta+0.502
 sidelines+0.473
umm+0.463
 majesty+0.453
Neg logit contributions
.-1.230
,-1.180
 and-1.149
 to-1.037
 with-1.009
TokenT
Token ID51
Feature activation+1.72e+01

Pos logit contributions
Ta+0.975
 Majesty+0.944
umm+0.941
 sidelines+0.939
 former+0.922
Neg logit contributions
.-0.516
,-0.494
 and-0.444
 to-0.340
 with-0.314
TokenIM
Token ID3955
Feature activation+2.74e+01

Pos logit contributions
 Majesty+0.430
umm+0.427
Ta+0.424
 former+0.398
agna+0.388
Neg logit contributions
.-0.972
,-0.942
 and-0.907
 to-0.822
-0.806
Tokenâ
Token ID22940
Feature activation+2.03e+01

Pos logit contributions
 Majesty+1.225
Ta+1.212
 sidelines+1.206
umm+1.187
agna+1.173
Neg logit contributions
.-0.908
,-0.831
 and-0.822
 to-0.680
-0.660
Token€
Token ID26391
Feature activation+2.72e+01

Pos logit contributions
Ta+0.365
 Majesty+0.364
umm+0.343
 sidelines+0.326
agna+0.323
Neg logit contributions
.-1.335
,-1.280
 and-1.257
 to-1.134
-1.125
Token.
Token ID13
Feature activation-4.66e+01

Pos logit contributions
 Majesty+1.332
 majesty+1.277
agna+1.247
Ta+1.243
umm+1.214
Neg logit contributions
.-0.866
 and-0.784
,-0.781
 to-0.636
 with-0.601
Token It
Token ID632
Feature activation+7.91e+00

Pos logit contributions
.+2.067
,+1.941
 and+1.886
 to+1.602
+1.579
Neg logit contributions
 Majesty-1.825
Ta-1.817
umm-1.769
 sidelines-1.731
agna-1.729
Token looked
Token ID3114
Feature activation+7.54e+00

Pos logit contributions
 Majesty+0.424
Ta+0.422
umm+0.413
agna+0.407
 sidelines+0.406
Neg logit contributions
.-0.233
,-0.212
 and-0.202
 to-0.155
-0.152
Token nice
Token ID3621
Feature activation+8.27e+00

Pos logit contributions
 Majesty+0.347
umm+0.337
 former+0.335
 sidelines+0.333
Ta+0.330
Neg logit contributions
.-0.230
,-0.207
 and-0.197
-0.175
 to-0.159
Token and
Token ID290
Feature activation-2.83e+01

Pos logit contributions
umm+0.518
 Majesty+0.517
 majesty+0.500
isher+0.492
 former+0.491
Neg logit contributions
.-0.041
,-0.027
-0.021
 and-0.003
 the+0.009
Token he
Token ID339
Feature activation-5.90e+00

Pos logit contributions
.+1.058
,+0.982
 and+0.949
 to+0.775
+0.765
Neg logit contributions
Ta-1.318
 Majesty-1.318
umm-1.288
agna-1.266
 sidelines-1.263
Token sho
Token ID7354
Feature activation+3.27e+01

Pos logit contributions
,+0.104
.+0.102
 and+0.091
 to+0.058
 with+0.054
Neg logit contributions
Ta-0.312
 Majesty-0.305
icious-0.304
agna-0.299
 majesty-0.298
Tokenel
Token ID417
Feature activation+2.62e+01

Pos logit contributions
Ta+1.020
 Majesty+1.008
isher+0.981
umm+0.980
 sidelines+0.945
Neg logit contributions
.-1.523
,-1.446
 and-1.405
 to-1.238
-1.222
Tokenace
Token ID558
Feature activation+3.53e+01

Pos logit contributions
 Majesty+1.007
 majesty+0.953
Ta+0.948
 pulp+0.932
 former+0.927
Neg logit contributions
.-1.054
,-1.007
 and-0.960
-0.874
 to-0.814
Token came
Token ID1625
Feature activation-1.64e+00

Pos logit contributions
 Majesty+2.228
 majesty+2.134
 pulp+2.113
agna+2.090
 sidelines+2.085
Neg logit contributions
.-0.475
,-0.404
 and-0.361
-0.253
 to-0.197
Token undone
Token ID45171
Feature activation+3.38e+01

Pos logit contributions
.+0.035
,+0.034
 and+0.029
+0.022
 with+0.018
Neg logit contributions
 Majesty-0.087
 former-0.085
 sidelines-0.084
Ta-0.083
 majesty-0.082
Token.
Token ID13
Feature activation-4.66e+01

Pos logit contributions
 Majesty+2.105
 sidelines+2.071
umm+2.063
 former+2.054
isher+2.050
Neg logit contributions
.-0.108
,-0.087
-0.039
 and+0.027
 the+0.079
Token She
Token ID1375
Feature activation-4.52e+00

Pos logit contributions
.+1.485
,+1.363
 and+1.313
 to+1.025
 with+0.997
Neg logit contributions
 Majesty-2.353
Ta-2.302
 sidelines-2.243
umm-2.241
 majesty-2.234
Token stopped
Token ID5025
Feature activation+6.13e+00

Pos logit contributions
.+0.157
,+0.143
 and+0.136
 to+0.111
+0.111
Neg logit contributions
 Majesty-0.217
Ta-0.216
 sidelines-0.206
 majesty-0.205
agna-0.205
Token to
Token ID284
Feature activation-2.40e+01

Pos logit contributions
 sidelines+0.263
 Majesty+0.254
 former+0.246
 pulp+0.236
Ta+0.235
Neg logit contributions
.-0.202
,-0.188
 and-0.166
-0.149
 with-0.135
Token tie
Token ID9839
Feature activation+2.88e+01

Pos logit contributions
.+1.025
,+0.963
 and+0.933
 to+0.785
+0.781
Neg logit contributions
 Majesty-0.975
Ta-0.968
 sidelines-0.934
umm-0.922
agna-0.915
Token it
Token ID340
Feature activation-2.06e+00

Pos logit contributions
Ta+1.544
 sidelines+1.530
 Majesty+1.508
 former+1.481
isher+1.477
Neg logit contributions
.-0.603
,-0.529
 and-0.490
-0.365
 to-0.325
TokenBen
Token ID11696
Feature activation+2.89e+01

Pos logit contributions
Ta+0.209
 Majesty+0.206
 majesty+0.196
umm+0.195
 pulp+0.195
Neg logit contributions
.-0.335
,-0.317
 and-0.308
-0.280
 to-0.274
Token tries
Token ID8404
Feature activation+2.13e+01

Pos logit contributions
 Majesty+1.702
 majesty+1.644
Ta+1.607
wy+1.580
 pulp+1.556
Neg logit contributions
.-0.471
,-0.408
 and-0.339
 to-0.268
 with-0.244
Token to
Token ID284
Feature activation-2.51e+01

Pos logit contributions
 sidelines+1.255
 Majesty+1.246
 former+1.201
 majesty+1.194
 pulp+1.193
Neg logit contributions
.-0.236
,-0.196
 and-0.171
-0.120
 with-0.040
Token blow
Token ID6611
Feature activation+2.28e+01

Pos logit contributions
.+1.031
,+0.962
 and+0.934
 to+0.779
+0.773
Neg logit contributions
 Majesty-1.067
Ta-1.065
umm-1.037
 sidelines-1.018
agna-1.017
Token bubbles
Token ID25037
Feature activation+3.21e+01

Pos logit contributions
isher+1.198
 Majesty+1.194
Ta+1.187
 sidelines+1.182
 pulp+1.177
Neg logit contributions
.-0.422
,-0.399
 and-0.353
-0.290
 to-0.277
Token.
Token ID13
Feature activation-4.66e+01

Pos logit contributions
 sidelines+1.850
wy+1.845
mic+1.805
isher+1.799
 pulp+1.796
Neg logit contributions
.-0.254
,-0.209
-0.202
 and-0.150
 to-0.059
Token At
Token ID1629
Feature activation+8.30e+00

Pos logit contributions
.+1.552
,+1.432
 and+1.385
 to+1.101
 with+1.053
Neg logit contributions
 Majesty-2.308
Ta-2.279
umm-2.223
 sidelines-2.212
 majesty-2.202
Token first
Token ID717
Feature activation-2.93e-01

Pos logit contributions
Ta+0.433
 sidelines+0.432
 Majesty+0.426
umm+0.416
agna+0.407
Neg logit contributions
.-0.243
,-0.216
 and-0.211
 to-0.158
-0.158
Token,
Token ID11
Feature activation-3.53e+01

Pos logit contributions
.+0.002
 and+0.001
,+0.000
-0.000
 to-0.000
Neg logit contributions
 Majesty-0.020
 majesty-0.019
 sidelines-0.019
umm-0.019
 pulp-0.019
Token he
Token ID339
Feature activation-6.45e+00

Pos logit contributions
.+1.020
,+0.908
 and+0.876
 to+0.657
+0.654
Neg logit contributions
 Majesty-1.885
Ta-1.848
agna-1.830
 sidelines-1.817
 majesty-1.802
Token blows
Token ID20385
Feature activation+3.12e+01

Pos logit contributions
.+0.300
,+0.283
 and+0.275
 to+0.235
+0.234
Neg logit contributions
 Majesty-0.239
Ta-0.239
umm-0.232
 sidelines-0.226
agna-0.226
Tokenila
Token ID10102
Feature activation+3.17e+01

Pos logit contributions
Ta+0.370
 Majesty+0.369
umm+0.357
 sidelines+0.349
agna+0.348
Neg logit contributions
.-0.459
,-0.433
 and-0.420
 to-0.361
-0.357
Token thought
Token ID1807
Feature activation+3.75e+00

Pos logit contributions
 Majesty+1.782
 majesty+1.739
Ta+1.731
wy+1.622
atmeal+1.612
Neg logit contributions
.-0.606
,-0.545
 and-0.470
 to-0.431
 with-0.357
Token for
Token ID329
Feature activation-1.66e+01

Pos logit contributions
 Majesty+0.200
 majesty+0.190
Ta+0.187
 sidelines+0.183
 former+0.183
Neg logit contributions
.-0.095
,-0.084
 and-0.083
-0.069
 with-0.063
Token a
Token ID257
Feature activation-2.37e+01

Pos logit contributions
.+0.302
,+0.256
 and+0.236
 to+0.134
+0.129
Neg logit contributions
Ta-1.097
 Majesty-1.097
umm-1.082
agna-1.065
 sidelines-1.065
Token moment
Token ID2589
Feature activation+1.61e+01

Pos logit contributions
.+1.058
,+0.993
 and+0.966
 to+0.818
+0.813
Neg logit contributions
 Majesty-0.924
Ta-0.922
umm-0.893
 sidelines-0.877
agna-0.876
Token.
Token ID13
Feature activation-4.66e+01

Pos logit contributions
 Majesty+0.913
pron+0.899
agna+0.868
 majesty+0.858
 former+0.856
Neg logit contributions
.-0.092
,-0.036
-0.034
 the+0.003
 and+0.004
Token She
Token ID1375
Feature activation-4.69e+00

Pos logit contributions
.+1.510
,+1.387
 and+1.341
 to+1.087
 with+1.036
Neg logit contributions
 Majesty-2.350
Ta-2.219
umm-2.202
 majesty-2.200
 sidelines-2.164
Token liked
Token ID8288
Feature activation+1.20e+00

Pos logit contributions
.+0.171
,+0.158
 and+0.151
 to+0.125
+0.123
Neg logit contributions
Ta-0.218
 Majesty-0.217
 majesty-0.206
agna-0.206
umm-0.205
Token Sam
Token ID3409
Feature activation+1.15e+01

Pos logit contributions
 Majesty+0.069
Ta+0.068
umm+0.067
 sidelines+0.067
 majesty+0.067
Neg logit contributions
.-0.030
,-0.026
 and-0.025
-0.018
 with-0.017
Token's
Token ID338
Feature activation-1.63e+01

Pos logit contributions
 Majesty+0.683
 majesty+0.654
 former+0.642
Ta+0.641
wy+0.641
Neg logit contributions
.-0.159
,-0.136
 and-0.118
-0.098
 to-0.073
Token idea
Token ID2126
Feature activation+1.94e+01

Pos logit contributions
.+0.703
,+0.666
 and+0.643
+0.563
 with+0.550
Neg logit contributions
 Majesty-0.611
Ta-0.585
 former-0.576
umm-0.575
 majesty-0.574
Token old
Token ID1468
Feature activation+9.95e+00

Pos logit contributions
umm+0.566
isher+0.431
icious+0.429
agna+0.423
 pulp+0.419
Neg logit contributions
.-1.018
,-0.984
 and-0.969
-0.901
 to-0.866
Token c
Token ID269
Feature activation+1.59e+01

Pos logit contributions
 Majesty+0.487
Ta+0.462
 majesty+0.462
umm+0.461
 former+0.461
Neg logit contributions
.-0.312
,-0.288
 and-0.271
-0.228
 to-0.218
Tokenudd
Token ID4185
Feature activation+3.33e+01

Pos logit contributions
 Majesty+0.325
Ta+0.307
umm+0.306
 former+0.288
 majesty+0.287
Neg logit contributions
.-0.998
,-0.964
 and-0.937
-0.849
 to-0.844
Tokenly
Token ID306
Feature activation+2.09e+01

Pos logit contributions
umm+0.920
Ta+0.856
isher+0.852
 Majesty+0.849
 sidelines+0.811
Neg logit contributions
.-1.729
,-1.674
 and-1.596
-1.503
 to-1.471
Token toy
Token ID13373
Feature activation+1.66e+01

Pos logit contributions
 Majesty+1.224
umm+1.192
Ta+1.188
 sidelines+1.186
 former+1.186
Neg logit contributions
.-0.426
,-0.372
 and-0.343
-0.273
 to-0.232
Token.
Token ID13
Feature activation-4.66e+01

Pos logit contributions
 Majesty+1.015
 majesty+0.982
umm+0.979
Ta+0.973
 former+0.969
Neg logit contributions
.-0.282
,-0.243
 and-0.230
-0.163
 with-0.131
Token She
Token ID1375
Feature activation-4.06e+00

Pos logit contributions
.+1.555
,+1.433
 and+1.370
 to+1.107
 with+1.058
Neg logit contributions
 Majesty-2.311
Ta-2.273
umm-2.222
 sidelines-2.203
 former-2.190
Token picked
Token ID6497
Feature activation+1.19e+01

Pos logit contributions
.+0.149
,+0.138
 and+0.132
 to+0.109
+0.108
Neg logit contributions
 Majesty-0.189
Ta-0.187
 majesty-0.179
umm-0.179
agna-0.179
Token it
Token ID340
Feature activation-2.50e+00

Pos logit contributions
 Majesty+0.658
Ta+0.654
agna+0.624
 sidelines+0.622
umm+0.621
Neg logit contributions
.-0.296
,-0.263
 and-0.241
-0.177
 to-0.171
Token up
Token ID510
Feature activation+4.56e+00

Pos logit contributions
.+0.084
,+0.073
 and+0.068
+0.066
 to+0.055
Neg logit contributions
isher-0.098
 Majesty-0.097
Ta-0.096
 former-0.096
agna-0.094
Token and
Token ID290
Feature activation-2.86e+01

Pos logit contributions
 former+0.284
isher+0.270
bek+0.270
 Majesty+0.268
 pulp+0.267
Neg logit contributions
.-0.034
,-0.018
-0.012
 and-0.003
 to+0.011
Token curious
Token ID11040
Feature activation+1.01e+01

Pos logit contributions
.+1.268
,+1.162
 and+1.113
 to+0.880
 with+0.870
Neg logit contributions
 Majesty-1.865
Ta-1.855
umm-1.814
 sidelines-1.796
agna-1.775
Token little
Token ID1310
Feature activation-7.40e+00

Pos logit contributions
 Majesty+0.644
umm+0.617
 sidelines+0.614
 majesty+0.610
 pulp+0.607
Neg logit contributions
.-0.109
,-0.090
 and-0.075
-0.049
 with-0.037
Token girl
Token ID2576
Feature activation+1.97e+01

Pos logit contributions
.+0.338
,+0.318
 and+0.311
+0.273
 to+0.267
Neg logit contributions
 Majesty-0.265
umm-0.251
Ta-0.249
 majesty-0.245
 sidelines-0.245
Token named
Token ID3706
Feature activation-1.31e+00

Pos logit contributions
 Majesty+1.181
 majesty+1.117
 former+1.084
agna+1.070
 pulp+1.061
Neg logit contributions
.-0.289
,-0.249
 and-0.239
-0.164
 with-0.149
Token Mary
Token ID5335
Feature activation+2.17e+01

Pos logit contributions
.+0.056
,+0.053
 and+0.052
 with+0.044
 to+0.043
Neg logit contributions
 Majesty-0.050
Ta-0.047
 sidelines-0.047
 majesty-0.046
agna-0.046
Token.
Token ID13
Feature activation-4.66e+01

Pos logit contributions
 Majesty+1.434
 majesty+1.365
 former+1.307
agna+1.289
 groom+1.289
Neg logit contributions
,-0.051
.-0.040
 and-0.025
 to+0.049
 with+0.050
Token Every
Token ID3887
Feature activation-2.91e+00

Pos logit contributions
.+1.598
,+1.469
 and+1.414
 to+1.149
 with+1.129
Neg logit contributions
 Majesty-2.292
Ta-2.213
umm-2.188
 majesty-2.158
 former-2.142
Token day
Token ID1110
Feature activation+8.50e+00

Pos logit contributions
.+0.117
,+0.109
 and+0.106
 with+0.090
+0.089
Neg logit contributions
 Majesty-0.118
Ta-0.118
umm-0.115
 sidelines-0.113
 former-0.112
Token Mary
Token ID5335
Feature activation+2.30e+01

Pos logit contributions
 Majesty+0.475
umm+0.462
 majesty+0.458
Ta+0.453
agna+0.441
Neg logit contributions
.-0.172
 and-0.139
,-0.123
-0.099
 with-0.087
Token would
Token ID561
Feature activation+2.37e-02

Pos logit contributions
 Majesty+1.030
Ta+0.995
 majesty+0.977
 former+0.976
agna+0.964
Neg logit contributions
.-0.798
,-0.736
 and-0.680
-0.590
 with-0.586
Token explore
Token ID7301
Feature activation+2.58e+01

Pos logit contributions
 Majesty+0.001
Ta+0.001
 sidelines+0.001
umm+0.001
 former+0.001
Neg logit contributions
.-0.001
,-0.001
 and-0.001
-0.001
 to-0.001
Token child
Token ID1200
Feature activation+2.52e+01

Pos logit contributions
umm+0.092
 Majesty+0.091
 sidelines+0.088
agna+0.088
Ta+0.087
Neg logit contributions
.-0.146
,-0.140
 and-0.135
-0.123
 with-0.119
Token nodded
Token ID14464
Feature activation+1.78e+01

Pos logit contributions
 Majesty+1.335
 majesty+1.268
Ta+1.262
 former+1.229
 pulp+1.229
Neg logit contributions
.-0.586
,-0.532
 and-0.502
-0.394
 to-0.385
Token his
Token ID465
Feature activation-1.06e+01

Pos logit contributions
 Majesty+1.067
 sidelines+1.026
 fascination+1.013
 former+1.010
 pulp+1.004
Neg logit contributions
.-0.130
,-0.081
-0.055
 and-0.043
 to+0.012
Token head
Token ID1182
Feature activation+2.55e+01

Pos logit contributions
.+0.551
,+0.521
 and+0.508
 to+0.448
+0.446
Neg logit contributions
 Majesty-0.335
Ta-0.312
 majesty-0.308
umm-0.304
 sidelines-0.300
Token eagerly
Token ID30130
Feature activation+1.25e+01

Pos logit contributions
 Majesty+1.538
 reins+1.514
 former+1.502
 sidelines+1.497
mic+1.493
Neg logit contributions
.-0.137
,-0.071
-0.036
 and-0.024
 the+0.021
Token.
Token ID13
Feature activation-4.66e+01

Pos logit contributions
 Majesty+0.806
 reins+0.774
 sidelines+0.770
umm+0.765
 majesty+0.763
Neg logit contributions
.-0.050
,-0.021
-0.005
 and-0.004
 the+0.031
Token He
Token ID679
Feature activation-2.71e+00

Pos logit contributions
.+1.325
,+1.204
 and+1.145
 to+0.864
 with+0.856
Neg logit contributions
 Majesty-2.499
Ta-2.423
umm-2.389
 sidelines-2.378
 majesty-2.370
Token was
Token ID373
Feature activation-1.64e+01

Pos logit contributions
.+0.102
,+0.094
 and+0.089
 to+0.074
+0.074
Neg logit contributions
 Majesty-0.120
Ta-0.119
 majesty-0.113
 former-0.113
agna-0.113
Token so
Token ID523
Feature activation-2.00e+01

Pos logit contributions
.+0.624
,+0.576
 and+0.555
+0.456
 to+0.454
Neg logit contributions
 Majesty-0.737
Ta-0.714
 sidelines-0.695
 majesty-0.690
umm-0.690
Token happy
Token ID3772
Feature activation+4.80e+00

Pos logit contributions
.+0.811
,+0.754
 and+0.736
 to+0.610
+0.599
Neg logit contributions
 Majesty-0.852
Ta-0.846
umm-0.816
 sidelines-0.810
 majesty-0.806
Token to
Token ID284
Feature activation-2.34e+01

Pos logit contributions
 Majesty+0.272
umm+0.265
 sidelines+0.262
 majesty+0.260
fer+0.253
Neg logit contributions
.-0.056
,-0.046
 and-0.036
-0.036
 the-0.010
Token clever
Token ID14169
Feature activation+1.43e+01

Pos logit contributions
.+0.179
,+0.166
 and+0.158
+0.124
 with+0.121
Neg logit contributions
 Majesty-0.292
Ta-0.290
umm-0.281
 sidelines-0.279
 majesty-0.278
Tokenly
Token ID306
Feature activation+2.12e+01

Pos logit contributions
 Majesty+0.723
umm+0.697
 majesty+0.689
 sidelines+0.684
Ta+0.678
Neg logit contributions
.-0.431
,-0.390
 and-0.373
-0.328
 with-0.295
Token avoided
Token ID13941
Feature activation+2.58e+01

Pos logit contributions
 Majesty+1.137
Ta+1.097
 sidelines+1.088
 majesty+1.088
umm+1.076
Neg logit contributions
.-0.573
,-0.519
 and-0.488
-0.400
 to-0.366
Token getting
Token ID1972
Feature activation+4.06e+00

Pos logit contributions
 Majesty+1.508
 sidelines+1.469
 former+1.458
umm+1.426
 majesty+1.422
Neg logit contributions
.-0.500
,-0.453
 and-0.416
-0.314
 to-0.257
Token caught
Token ID4978
Feature activation+1.42e+01

Pos logit contributions
 Majesty+0.194
Ta+0.190
 sidelines+0.188
umm+0.186
 former+0.186
Neg logit contributions
.-0.133
,-0.124
 and-0.119
-0.098
 with-0.090
Token.
Token ID13
Feature activation-4.66e+01

Pos logit contributions
 Majesty+0.813
umm+0.800
 sidelines+0.788
Ta+0.785
 former+0.784
Neg logit contributions
.-0.288
,-0.263
 and-0.235
-0.187
 to-0.170
Token He
Token ID679
Feature activation-3.04e+00

Pos logit contributions
.+1.504
,+1.387
 and+1.325
 to+1.051
 with+1.019
Neg logit contributions
 Majesty-2.369
Ta-2.327
umm-2.289
 sidelines-2.259
 former-2.255
Token told
Token ID1297
Feature activation-5.34e-01

Pos logit contributions
.+0.110
,+0.101
 and+0.097
 to+0.079
+0.079
Neg logit contributions
 Majesty-0.144
Ta-0.143
umm-0.139
 sidelines-0.137
agna-0.137
Token all
Token ID477
Feature activation-4.40e+00

Pos logit contributions
.+0.015
,+0.013
 and+0.013
+0.010
 to+0.009
Neg logit contributions
 Majesty-0.029
Ta-0.028
 sidelines-0.028
 majesty-0.028
umm-0.028
Token his
Token ID465
Feature activation-7.83e+00

Pos logit contributions
.+0.102
,+0.088
 and+0.081
+0.063
 with+0.059
Neg logit contributions
 Majesty-0.238
 sidelines-0.233
Ta-0.231
 majesty-0.230
 former-0.226
Token mouse
Token ID10211
Feature activation+2.79e+01

Pos logit contributions
.+0.317
,+0.296
 and+0.288
 to+0.240
+0.235
Neg logit contributions
Ta-0.345
 Majesty-0.339
umm-0.336
agna-0.329
 sidelines-0.329
Token a
Token ID257
Feature activation-2.25e+01

Pos logit contributions
 Majesty+1.042
 former+1.001
Ta+0.988
 sidelines+0.987
 majesty+0.970
Neg logit contributions
.-0.323
,-0.296
 and-0.281
-0.177
 to-0.177
Token little
Token ID1310
Feature activation-5.83e+00

Pos logit contributions
.+0.906
,+0.858
 and+0.823
 to+0.681
+0.678
Neg logit contributions
Ta-0.999
 Majesty-0.980
umm-0.979
agna-0.945
 sidelines-0.936
Token girl
Token ID2576
Feature activation+2.04e+01

Pos logit contributions
.+0.275
,+0.259
 and+0.252
 to+0.216
+0.214
Neg logit contributions
 Majesty-0.213
Ta-0.213
umm-0.207
 sidelines-0.202
agna-0.202
Token named
Token ID3706
Feature activation-1.92e-01

Pos logit contributions
 Majesty+0.769
 former+0.711
 majesty+0.700
Ta+0.692
 sidelines+0.682
Neg logit contributions
.-0.718
,-0.680
 and-0.670
-0.577
 to-0.573
Token Lily
Token ID20037
Feature activation+1.01e+00

Pos logit contributions
.+0.007
,+0.007
 and+0.007
 to+0.006
 with+0.005
Neg logit contributions
 Majesty-0.008
Ta-0.008
 sidelines-0.008
 majesty-0.008
umm-0.008
Token.
Token ID13
Feature activation-4.65e+01

Pos logit contributions
 Majesty+0.057
 former+0.054
 majesty+0.053
ppa+0.053
bek+0.052
Neg logit contributions
,-0.010
.-0.009
 and-0.008
 to-0.005
 with-0.005
Token Lily
Token ID20037
Feature activation+3.04e+00

Pos logit contributions
.+1.506
,+1.378
 and+1.334
 to+1.079
 with+1.018
Neg logit contributions
 Majesty-2.340
Ta-2.290
umm-2.268
 sidelines-2.237
 former-2.226
Token said
Token ID531
Feature activation-7.82e+00

Pos logit contributions
 Majesty+0.131
Ta+0.123
 majesty+0.122
 former+0.120
agna+0.115
Neg logit contributions
.-0.098
,-0.084
 and-0.077
 to-0.075
 with-0.068
Token she
Token ID673
Feature activation-5.94e+00

Pos logit contributions
.+0.087
 and+0.054
,+0.031
+0.011
 with-0.001
Neg logit contributions
 Majesty-0.549
 majesty-0.522
Ta-0.512
 pulp-0.509
 sidelines-0.506
Token made
Token ID925
Feature activation-3.40e+00

Pos logit contributions
.+0.196
,+0.180
 and+0.172
 to+0.136
+0.135
Neg logit contributions
 Majesty-0.299
Ta-0.297
umm-0.290
agna-0.287
 majesty-0.284
Token the
Token ID262
Feature activation-2.36e+01

Pos logit contributions
.+0.070
,+0.067
 and+0.060
+0.041
 to+0.036
Neg logit contributions
 Majesty-0.192
Ta-0.189
 former-0.186
 sidelines-0.184
 majesty-0.182
Token remember
Token ID3505
Feature activation+1.83e+01

Pos logit contributions
.+1.340
,+1.255
 and+1.225
 to+0.993
 with+0.968
Neg logit contributions
 Majesty-1.717
Ta-1.677
umm-1.676
 sidelines-1.623
agna-1.619
Token their
Token ID511
Feature activation-7.29e+00

Pos logit contributions
 Majesty+1.032
 former+0.987
 majesty+0.984
 sidelines+0.947
umm+0.932
Neg logit contributions
.-0.354
,-0.315
 and-0.312
-0.233
 with-0.200
Token j
Token ID474
Feature activation+3.86e+00

Pos logit contributions
.+0.354
,+0.334
 and+0.326
 to+0.281
+0.279
Neg logit contributions
Ta-0.256
 Majesty-0.255
umm-0.248
 sidelines-0.242
agna-0.242
Tokenolly
Token ID5098
Feature activation+3.38e+01

Pos logit contributions
 Majesty+0.102
umm+0.098
Ta+0.098
 sidelines+0.094
 majesty+0.094
Neg logit contributions
.-0.219
,-0.209
 and-0.205
-0.183
 to-0.182
Token day
Token ID1110
Feature activation+8.19e+00

Pos logit contributions
 Majesty+2.082
 majesty+2.007
umm+1.964
Ta+1.963
 sidelines+1.953
Neg logit contributions
.-0.600
,-0.522
 and-0.471
-0.333
 to-0.298
Token.
Token ID13
Feature activation-4.65e+01

Pos logit contributions
 Majesty+0.528
 sidelines+0.515
 majesty+0.507
agna+0.497
umm+0.497
Neg logit contributions
.-0.092
,-0.080
 and-0.076
-0.038
 to-0.028
Token<|endoftext|>
Token ID50256
Feature activation-3.73e+00

Pos logit contributions
.+1.256
,+1.142
 and+1.086
 to+0.822
 with+0.777
Neg logit contributions
 Majesty-2.607
Ta-2.555
umm-2.505
 majesty-2.496
 former-2.473
TokenOnce
Token ID7454
Feature activation-2.12e+00

Pos logit contributions
.+0.085
,+0.084
 and+0.079
 to+0.065
 that+0.057
Neg logit contributions
Ta-0.180
Whe-0.166
rob-0.166
 Majesty-0.163
umm-0.161
Token upon
Token ID2402
Feature activation-4.55e-02

Pos logit contributions
.+0.098
 and+0.091
,+0.089
 with+0.080
 to+0.077
Neg logit contributions
 Majesty-0.063
 majesty-0.060
Ta-0.058
ubb-0.055
 pulp-0.055
Token a
Token ID257
Feature activation-2.14e+01

Pos logit contributions
.+0.001
,+0.001
 and+0.001
 to+0.000
 with+0.000
Neg logit contributions
Ta-0.003
 Majesty-0.003
 former-0.003
 majesty-0.003
 sidelines-0.003
Token time
Token ID640
Feature activation-4.04e+00

Pos logit contributions
.+0.745
,+0.678
 and+0.652
 to+0.542
 with+0.527
Neg logit contributions
 Majesty-1.026
Ta-1.020
umm-0.993
 majesty-0.980
agna-0.972
Token<|endoftext|>
Token ID50256
Feature activation-3.40e+00

Pos logit contributions
Ta+0.160
 Majesty+0.159
umm+0.155
agna+0.153
 sidelines+0.153
Neg logit contributions
.-0.100
,-0.092
 and-0.087
 to-0.069
-0.068
TokenAnna
Token ID31160
Feature activation+2.98e+01

Pos logit contributions
.+0.091
,+0.087
 and+0.083
 to+0.068
 with+0.057
Neg logit contributions
Ta-0.171
 Majesty-0.158
umm-0.156
 sidelines-0.155
Whe-0.153
Token liked
Token ID8288
Feature activation-9.78e-01

Pos logit contributions
 Majesty+1.819
 majesty+1.796
Ta+1.762
 pulp+1.714
acket+1.698
Neg logit contributions
.-0.223
,-0.150
 to-0.061
 and-0.008
-0.004
Token to
Token ID284
Feature activation-2.60e+01

Pos logit contributions
.+0.014
,+0.012
 and+0.011
+0.005
 with+0.004
Neg logit contributions
 Majesty-0.065
Ta-0.064
 sidelines-0.063
 majesty-0.063
umm-0.063
Token dance
Token ID9280
Feature activation+2.82e+01

Pos logit contributions
.+1.113
,+1.043
 and+1.013
 to+0.854
+0.850
Neg logit contributions
 Majesty-1.053
Ta-1.049
umm-1.014
 sidelines-1.000
agna-0.996
Token.
Token ID13
Feature activation-4.65e+01

Pos logit contributions
 Majesty+1.755
 former+1.727
 majesty+1.706
 sidelines+1.695
pron+1.679
Neg logit contributions
.-0.157
,-0.097
 and-0.064
-0.053
 to+0.053
Token She
Token ID1375
Feature activation-4.35e+00

Pos logit contributions
.+1.439
,+1.315
 and+1.258
 to+0.972
 with+0.943
Neg logit contributions
 Majesty-2.435
Ta-2.418
umm-2.395
 majesty-2.327
agna-2.326
Token danced
Token ID39480
Feature activation+2.87e+01

Pos logit contributions
.+0.179
,+0.166
 and+0.158
 to+0.136
+0.135
Neg logit contributions
Ta-0.175
 Majesty-0.174
 majesty-0.167
 sidelines-0.161
agna-0.160
Token every
Token ID790
Feature activation-1.47e+01

Pos logit contributions
Ta+1.554
 Majesty+1.537
pron+1.497
 sidelines+1.494
mic+1.490
Neg logit contributions
.-0.398
,-0.279
-0.199
 and-0.184
 to-0.056
Token day
Token ID1110
Feature activation+8.91e+00

Pos logit contributions
.+0.559
,+0.520
 and+0.503
+0.411
 to+0.409
Neg logit contributions
Ta-0.667
 Majesty-0.665
umm-0.648
 sidelines-0.640
agna-0.631
Token in
Token ID287
Feature activation-1.57e+01

Pos logit contributions
 Majesty+0.528
 sidelines+0.528
mic+0.523
umm+0.506
 former+0.506
Neg logit contributions
.-0.064
,-0.043
-0.036
 and-0.028
 the+0.006
Token bounce
Token ID20110
Feature activation+3.44e+01

Pos logit contributions
.+1.151
,+1.084
 and+1.056
 to+0.906
+0.902
Neg logit contributions
 Majesty-0.888
Ta-0.886
umm-0.856
 sidelines-0.844
agna-0.839
Token and
Token ID290
Feature activation-2.43e+01

Pos logit contributions
 seventeen+1.708
 pulp+1.678
 spoiler+1.653
Goal+1.643
 sidelines+1.637
Neg logit contributions
.-0.436
,-0.385
-0.310
 and-0.308
 to-0.174
Token turn
Token ID1210
Feature activation+2.69e+01

Pos logit contributions
.+1.146
,+1.078
 and+1.053
+0.908
 to+0.903
Neg logit contributions
Ta-0.863
 Majesty-0.862
 sidelines-0.830
umm-0.819
agna-0.814
Token in
Token ID287
Feature activation-1.06e+01

Pos logit contributions
 sidelines+1.403
 pulp+1.393
isher+1.369
mic+1.363
ubb+1.362
Neg logit contributions
,-0.289
.-0.288
-0.239
 and-0.235
 with-0.111
Token the
Token ID262
Feature activation-1.98e+01

Pos logit contributions
.+0.093
,+0.092
 and+0.077
 to+0.052
+0.046
Neg logit contributions
 former-0.587
 sidelines-0.583
 avail-0.576
 Majesty-0.564
 spoiler-0.561
Token maze
Token ID31237
Feature activation+4.59e+01

Pos logit contributions
.+1.162
,+1.107
 and+1.084
 to+0.961
+0.957
Neg logit contributions
 Majesty-0.503
Ta-0.501
umm-0.479
 sidelines-0.463
agna-0.463
Token.
Token ID13
Feature activation-4.00e+01

Pos logit contributions
 Majesty+2.881
umm+2.859
 sidelines+2.835
 majesty+2.801
vation+2.780
Neg logit contributions
,-0.066
 and-0.038
.-0.007
+0.046
 to+0.191
Token They
Token ID1119
Feature activation+1.55e+00

Pos logit contributions
.+1.300
,+1.195
 and+1.122
 to+0.869
 with+0.863
Neg logit contributions
Ta-1.989
 Majesty-1.963
umm-1.927
 sidelines-1.918
agna-1.908
Token cheer
Token ID14042
Feature activation+3.09e+01

Pos logit contributions
 Majesty+0.067
Ta+0.067
umm+0.065
 sidelines+0.064
agna+0.064
Neg logit contributions
.-0.062
,-0.058
 and-0.056
 to-0.046
-0.046
Token and
Token ID290
Feature activation-2.40e+01

Pos logit contributions
 Majesty+1.648
core+1.628
rated+1.608
icious+1.604
immers+1.598
Neg logit contributions
.-0.316
,-0.266
-0.202
 and-0.127
 to-0.086
Token cl
Token ID537
Feature activation+1.72e+01

Pos logit contributions
.+1.146
,+1.080
 and+1.052
 to+0.904
+0.898
Neg logit contributions
 Majesty-0.871
Ta-0.869
umm-0.841
 sidelines-0.825
agna-0.822
Token Lily
Token ID20037
Feature activation+9.90e+00

Pos logit contributions
.+0.789
,+0.687
 and+0.682
+0.502
 to+0.477
Neg logit contributions
Ta-1.499
 Majesty-1.454
 sidelines-1.454
 pulp-1.436
umm-1.435
Token decided
Token ID3066
Feature activation+1.08e+01

Pos logit contributions
 Majesty+0.469
Ta+0.456
agna+0.445
 majesty+0.438
 former+0.434
Neg logit contributions
.-0.305
,-0.258
 and-0.239
-0.210
 with-0.210
Token to
Token ID284
Feature activation-1.74e+01

Pos logit contributions
 Majesty+0.624
 sidelines+0.594
 majesty+0.589
 pulp+0.579
agna+0.578
Neg logit contributions
.-0.175
,-0.135
 and-0.126
-0.097
 with-0.060
Token choose
Token ID3853
Feature activation+2.94e+01

Pos logit contributions
.+0.696
,+0.648
 and+0.628
 to+0.523
+0.520
Neg logit contributions
 Majesty-0.750
Ta-0.743
umm-0.725
agna-0.713
 sidelines-0.712
Token the
Token ID262
Feature activation-1.77e+01

Pos logit contributions
 Majesty+1.518
 former+1.516
agna+1.458
 majesty+1.442
 pulp+1.439
Neg logit contributions
.-0.630
,-0.600
 and-0.543
-0.428
 with-0.348
Token lions
Token ID31420
Feature activation+4.53e+01

Pos logit contributions
.+0.841
,+0.793
 and+0.773
 to+0.663
+0.659
Neg logit contributions
 Majesty-0.647
Ta-0.645
umm-0.625
 sidelines-0.611
agna-0.610
Token and
Token ID290
Feature activation-2.19e+01

Pos logit contributions
 Majesty+2.932
agna+2.892
 majesty+2.830
atmeal+2.828
arians+2.819
Neg logit contributions
.-0.267
,-0.151
 and-0.098
-0.058
 to+0.089
Token gir
Token ID37370
Feature activation+4.29e+01

Pos logit contributions
.+0.977
,+0.916
 and+0.891
 to+0.752
+0.752
Neg logit contributions
 Majesty-0.855
Ta-0.846
umm-0.821
 sidelines-0.810
agna-0.807
Tokenaff
Token ID2001
Feature activation+5.06e+01

Pos logit contributions
umm+0.726
 Majesty+0.718
Ta+0.687
 sidelines+0.666
 majesty+0.648
Neg logit contributions
.-2.731
,-2.622
 and-2.573
-2.376
 with-2.345
Tokenes
Token ID274
Feature activation+2.52e+01

Pos logit contributions
 Majesty+1.542
 pulp+1.485
wy+1.475
 majesty+1.463
Ta+1.456
Neg logit contributions
.-2.260
,-2.179
 and-2.109
-1.879
 to-1.864
Token.
Token ID13
Feature activation-3.80e+01

Pos logit contributions
 Majesty+1.572
 sidelines+1.562
 former+1.534
agna+1.531
 pulp+1.529
Neg logit contributions
.-0.125
,-0.106
 and-0.059
+0.001
 with+0.072
Token and
Token ID290
Feature activation-2.58e+01

Pos logit contributions
 Majesty+1.851
 pulp+1.829
 former+1.797
umm+1.797
 majesty+1.792
Neg logit contributions
.-0.398
,-0.344
 and-0.292
-0.233
 to-0.127
Token the
Token ID262
Feature activation-2.19e+01

Pos logit contributions
.+0.904
,+0.828
 and+0.799
+0.642
 to+0.639
Neg logit contributions
 Majesty-1.251
Ta-1.237
umm-1.202
 sidelines-1.192
 majesty-1.189
Token lucky
Token ID9670
Feature activation+1.66e+01

Pos logit contributions
.+1.051
,+0.981
 and+0.965
 to+0.827
 with+0.812
Neg logit contributions
Ta-0.810
umm-0.786
 Majesty-0.784
agna-0.770
 sidelines-0.756
Token four
Token ID1440
Feature activation+1.98e+01

Pos logit contributions
 Majesty+0.560
umm+0.545
Ta+0.534
 majesty+0.522
 pulp+0.513
Neg logit contributions
.-0.800
,-0.758
 and-0.739
-0.654
 to-0.639
Token-
Token ID12
Feature activation+1.08e+01

Pos logit contributions
 Majesty+0.914
 pulp+0.884
umm+0.882
 majesty+0.878
Ta+0.871
Neg logit contributions
.-0.653
,-0.610
 and-0.592
-0.510
 to-0.499
Tokenleaf
Token ID33201
Feature activation+5.59e+01

Pos logit contributions
Ta+0.289
 Majesty+0.280
umm+0.263
 majesty+0.256
agna+0.256
Neg logit contributions
.-0.606
,-0.586
 and-0.570
-0.498
 to-0.497
Token cl
Token ID537
Feature activation+1.49e+01

Pos logit contributions
 Majesty+2.275
Ta+2.246
umm+2.211
 sidelines+2.186
 majesty+2.136
Neg logit contributions
.-2.244
,-2.126
 and-2.064
-1.782
 to-1.756
Tokenover
Token ID2502
Feature activation+4.06e+01

Pos logit contributions
 Majesty+0.409
Ta+0.399
umm+0.395
 majesty+0.371
 sidelines+0.371
Neg logit contributions
.-0.842
,-0.806
 and-0.792
 to-0.702
-0.700
Token.
Token ID13
Feature activation-4.17e+01

Pos logit contributions
 Majesty+2.807
 sidelines+2.748
agna+2.721
umm+2.710
 majesty+2.701
Neg logit contributions
,-0.080
.-0.058
 and-0.022
+0.042
 to+0.217
Token They
Token ID1119
Feature activation+3.42e-01

Pos logit contributions
.+1.682
,+1.564
 and+1.514
 to+1.255
 with+1.244
Neg logit contributions
 Majesty-1.783
Ta-1.756
umm-1.743
 sidelines-1.700
 former-1.696
Token all
Token ID477
Feature activation-2.38e+00

Pos logit contributions
Ta+0.017
 Majesty+0.017
umm+0.017
 sidelines+0.017
agna+0.017
Neg logit contributions
.-0.011
,-0.010
 and-0.009
 to-0.007
-0.007
Token they
Token ID484
Feature activation-3.34e+00

Pos logit contributions
.+0.831
,+0.715
 and+0.683
 to+0.516
+0.506
Neg logit contributions
Ta-1.501
 Majesty-1.483
 pulp-1.459
 sidelines-1.446
umm-1.443
Token see
Token ID766
Feature activation+9.49e+00

Pos logit contributions
.+0.125
,+0.115
 and+0.111
 to+0.092
+0.091
Neg logit contributions
 Majesty-0.151
Ta-0.151
 sidelines-0.145
agna-0.145
 majesty-0.143
Token a
Token ID257
Feature activation-1.53e+01

Pos logit contributions
 Majesty+0.486
Ta+0.482
 pulp+0.473
agna+0.461
 majesty+0.460
Neg logit contributions
.-0.242
,-0.229
 and-0.212
 to-0.168
 with-0.158
Token lot
Token ID1256
Feature activation+1.60e+01

Pos logit contributions
.+0.597
,+0.558
 and+0.538
 to+0.442
+0.438
Neg logit contributions
Ta-0.711
 Majesty-0.697
umm-0.681
 sidelines-0.662
 majesty-0.659
Token of
Token ID286
Feature activation-3.55e+00

Pos logit contributions
Ta+0.830
umm+0.820
 sidelines+0.793
isher+0.787
 pulp+0.774
Neg logit contributions
.-0.360
,-0.348
 and-0.316
-0.273
 to-0.234
Token machines
Token ID8217
Feature activation+4.46e+01

Pos logit contributions
.+0.144
,+0.133
 and+0.131
 to+0.107
+0.105
Neg logit contributions
 Majesty-0.161
Ta-0.158
umm-0.156
agna-0.154
 majesty-0.152
Token and
Token ID290
Feature activation-2.24e+01

Pos logit contributions
 sidelines+2.878
agna+2.788
 inconvenience+2.741
umm+2.706
mic+2.700
Neg logit contributions
.-0.114
,-0.113
-0.048
 and-0.024
 to+0.138
Token belts
Token ID28617
Feature activation+5.03e+01

Pos logit contributions
.+1.246
,+1.185
 and+1.158
 to+1.019
+1.017
Neg logit contributions
 Majesty-0.626
Ta-0.624
umm-0.598
 sidelines-0.583
agna-0.582
Token and
Token ID290
Feature activation-2.23e+01

Pos logit contributions
 sidelines+3.291
 inconvenience+3.192
vation+3.181
mic+3.164
agna+3.133
Neg logit contributions
,-0.112
 and-0.045
.-0.041
+0.078
 to+0.205
Token gears
Token ID28713
Feature activation+4.99e+01

Pos logit contributions
.+1.183
,+1.121
 and+1.097
 to+0.959
+0.951
Neg logit contributions
 Majesty-0.689
Ta-0.688
umm-0.666
 sidelines-0.645
agna-0.644
Token.
Token ID13
Feature activation-3.84e+01

Pos logit contributions
 sidelines+3.233
agna+3.177
mic+3.167
isher+3.164
 Majesty+3.162
Neg logit contributions
,-0.115
.-0.085
 and-0.002
+0.014
 with+0.225
Token he
Token ID339
Feature activation-2.83e+00

Pos logit contributions
.+0.501
,+0.448
 and+0.448
+0.340
 to+0.332
Neg logit contributions
 Majesty-0.883
 majesty-0.835
 former-0.834
umm-0.830
Ta-0.828
Token reached
Token ID4251
Feature activation+1.51e+01

Pos logit contributions
.+0.109
,+0.101
 and+0.098
 to+0.080
+0.079
Neg logit contributions
 Majesty-0.130
Ta-0.129
umm-0.127
 sidelines-0.124
agna-0.124
Token out
Token ID503
Feature activation+6.08e-01

Pos logit contributions
 Majesty+0.669
 pulp+0.635
 sidelines+0.628
 majesty+0.623
Ta+0.622
Neg logit contributions
.-0.437
,-0.413
 and-0.386
-0.321
 with-0.302
Token and
Token ID290
Feature activation-2.47e+01

Pos logit contributions
 reins+0.028
ubb+0.027
ocol+0.027
 fascination+0.026
bek+0.026
Neg logit contributions
.-0.009
,-0.008
-0.007
 and-0.006
 with-0.003
Token pl
Token ID458
Feature activation+2.32e+01

Pos logit contributions
.+1.073
,+1.006
 and+0.976
+0.819
 to+0.818
Neg logit contributions
Ta-0.978
 Majesty-0.972
umm-0.932
agna-0.926
 sidelines-0.925
Tokenucked
Token ID17758
Feature activation+4.59e+01

Pos logit contributions
Ta+0.535
 Majesty+0.525
umm+0.500
agna+0.484
 sidelines+0.471
Neg logit contributions
.-1.408
,-1.346
 and-1.320
 to-1.172
-1.166
Token the
Token ID262
Feature activation-2.05e+01

Pos logit contributions
 pulp+2.256
 Majesty+2.241
isher+2.170
ubb+2.159
Ta+2.153
Neg logit contributions
.-0.971
,-0.881
 and-0.830
-0.679
 to-0.608
Token pet
Token ID4273
Feature activation+2.19e+01

Pos logit contributions
.+1.008
,+0.951
 and+0.928
 to+0.801
+0.794
Neg logit contributions
 Majesty-0.714
Ta-0.712
umm-0.690
 sidelines-0.674
agna-0.673
Tokenals
Token ID874
Feature activation+4.54e+01

Pos logit contributions
umm+0.379
Ta+0.358
 pulp+0.329
 sidelines+0.322
agna+0.321
Neg logit contributions
.-1.284
,-1.254
 and-1.222
-1.142
 to-1.098
Token from
Token ID422
Feature activation-1.08e+01

Pos logit contributions
 pulp+2.571
umm+2.551
ubb+2.516
 sidelines+2.464
ocol+2.424
Neg logit contributions
.-0.300
,-0.278
-0.240
 and-0.191
 with+0.039
Token the
Token ID262
Feature activation-2.03e+01

Pos logit contributions
.+0.204
,+0.191
 and+0.171
+0.119
 to+0.107
Neg logit contributions
 Majesty-0.649
 sidelines-0.641
 former-0.631
 pulp-0.631
Ta-0.629
Token.
Token ID13
Feature activation-3.79e+01

Pos logit contributions
 Majesty+0.633
 majesty+0.606
agna+0.598
umm+0.596
icious+0.595
Neg logit contributions
.-0.123
 and-0.089
,-0.085
-0.076
 with-0.047
Token I
Token ID314
Feature activation+9.84e+00

Pos logit contributions
.+1.507
,+1.420
 and+1.366
+1.178
 with+1.141
Neg logit contributions
 Majesty-1.651
Ta-1.638
 sidelines-1.600
umm-1.576
 former-1.564
Token don
Token ID836
Feature activation+1.66e+01

Pos logit contributions
Ta+0.413
 Majesty+0.410
agna+0.397
umm+0.395
 former+0.393
Neg logit contributions
.-0.395
,-0.368
 and-0.354
 to-0.296
-0.295
Token't
Token ID470
Feature activation+1.13e+01

Pos logit contributions
 sidelines+0.554
Ta+0.548
icious+0.543
umm+0.540
agna+0.519
Neg logit contributions
.-0.718
,-0.671
 and-0.652
 to-0.578
-0.538
Token like
Token ID588
Feature activation-3.72e+00

Pos logit contributions
Ta+0.466
 Majesty+0.465
umm+0.457
 sidelines+0.449
agna+0.446
Neg logit contributions
.-0.440
,-0.415
 and-0.402
-0.341
 to-0.340
Token sweet
Token ID6029
Feature activation+1.47e+01

Pos logit contributions
.+0.167
,+0.155
 and+0.153
+0.131
 with+0.127
Neg logit contributions
 Majesty-0.136
umm-0.128
 pulp-0.127
 majesty-0.126
agna-0.125
Token things
Token ID1243
Feature activation+2.32e+01

Pos logit contributions
umm+0.381
 Majesty+0.372
 pulp+0.345
 majesty+0.337
atmeal+0.327
Neg logit contributions
.-0.777
,-0.724
 and-0.721
-0.684
 to-0.648
Token.
Token ID13
Feature activation-3.78e+01

Pos logit contributions
agna+1.094
 Majesty+1.090
 majesty+1.064
umm+1.063
wy+1.059
Neg logit contributions
.-0.490
,-0.451
 and-0.450
-0.418
 the-0.315
Token I
Token ID314
Feature activation+9.97e+00

Pos logit contributions
.+1.504
,+1.406
 and+1.369
+1.160
 to+1.132
Neg logit contributions
 Majesty-1.649
Ta-1.646
 sidelines-1.593
umm-1.580
agna-1.558
Token prefer
Token ID4702
Feature activation+3.53e+01

Pos logit contributions
Ta+0.428
 Majesty+0.426
agna+0.413
 former+0.411
umm+0.411
Neg logit contributions
.-0.388
,-0.362
 and-0.347
 to-0.289
-0.288
Token salty
Token ID36021
Feature activation+3.77e+01

Pos logit contributions
 Majesty+1.572
 former+1.512
 pulp+1.505
 majesty+1.502
agna+1.476
Neg logit contributions
.-1.174
,-1.067
 and-1.020
-0.842
 with-0.772
Token shake
Token ID13279
Feature activation+2.70e+01

Pos logit contributions
.+0.810
,+0.763
 and+0.745
 to+0.637
+0.632
Neg logit contributions
 Majesty-0.622
Ta-0.618
umm-0.602
agna-0.587
 sidelines-0.585
Token and
Token ID290
Feature activation-2.19e+01

Pos logit contributions
umm+1.504
 former+1.485
 pulp+1.479
isher+1.451
 prope+1.439
Neg logit contributions
-0.113
,-0.081
.-0.068
 and+0.033
 to+0.042
Token break
Token ID2270
Feature activation+3.28e+01

Pos logit contributions
.+0.634
,+0.574
 and+0.549
 to+0.414
+0.408
Neg logit contributions
 Majesty-1.205
Ta-1.203
umm-1.180
agna-1.161
 sidelines-1.161
Token into
Token ID656
Feature activation+2.39e+00

Pos logit contributions
umm+1.992
 pulp+1.979
ubb+1.977
 ounces+1.955
 unnecessary+1.942
Neg logit contributions
,-0.132
 and-0.079
.-0.076
-0.038
 to+0.060
Token many
Token ID867
Feature activation+6.59e+00

Pos logit contributions
Ta+0.140
 Majesty+0.139
 pulp+0.137
umm+0.137
 former+0.136
Neg logit contributions
.-0.050
,-0.046
 and-0.042
 to-0.029
-0.028
Token pieces
Token ID5207
Feature activation+3.35e+01

Pos logit contributions
umm+0.204
 Majesty+0.201
Ta+0.199
 sidelines+0.195
 pulp+0.194
Neg logit contributions
.-0.315
,-0.301
 and-0.295
-0.266
 to-0.265
Token.
Token ID13
Feature activation-3.79e+01

Pos logit contributions
 sidelines+2.017
inge+1.971
inations+1.951
 pulp+1.945
vell+1.936
Neg logit contributions
,-0.119
-0.099
 and-0.048
.-0.013
 the+0.081
Token Lily
Token ID20037
Feature activation+1.03e+01

Pos logit contributions
.+1.191
,+1.082
 and+1.036
 to+0.800
+0.793
Neg logit contributions
 Majesty-1.932
Ta-1.922
umm-1.903
 sidelines-1.872
agna-1.857
Token was
Token ID373
Feature activation-9.33e+00

Pos logit contributions
Ta+0.520
 Majesty+0.517
agna+0.498
 pulp+0.495
 sidelines+0.490
Neg logit contributions
.-0.298
,-0.255
 and-0.231
-0.198
 to-0.194
Token scared
Token ID12008
Feature activation+1.29e+01

Pos logit contributions
.+0.369
,+0.336
 and+0.322
+0.280
 to+0.268
Neg logit contributions
 Majesty-0.375
Ta-0.368
 sidelines-0.367
agna-0.352
 former-0.351
Token and
Token ID290
Feature activation-2.17e+01

Pos logit contributions
 sidelines+0.627
umm+0.618
 unnecessary+0.615
uster+0.601
andel+0.591
Neg logit contributions
.-0.096
-0.084
,-0.053
 with-0.023
 a-0.019
Token and
Token ID290
Feature activation-2.68e+01

Pos logit contributions
 Majesty+0.869
 majesty+0.819
 former+0.817
isher+0.810
 sidelines+0.800
Neg logit contributions
.-0.257
,-0.220
 and-0.182
-0.178
 with-0.113
Token look
Token ID804
Feature activation+1.14e+01

Pos logit contributions
.+1.117
,+1.044
 and+1.012
 to+0.850
+0.839
Neg logit contributions
Ta-1.135
 Majesty-1.131
umm-1.106
agna-1.081
 sidelines-1.076
Token for
Token ID329
Feature activation-1.25e+01

Pos logit contributions
 Majesty+0.619
 majesty+0.588
agna+0.580
 former+0.579
 sidelines+0.576
Neg logit contributions
.-0.255
,-0.232
 and-0.224
-0.198
 to-0.155
Token the
Token ID262
Feature activation-2.24e+01

Pos logit contributions
.+0.431
,+0.402
 and+0.394
+0.321
 to+0.314
Neg logit contributions
 Majesty-0.590
 majesty-0.560
Ta-0.559
 former-0.554
 sidelines-0.551
Token tall
Token ID7331
Feature activation+1.62e+01

Pos logit contributions
.+1.038
,+0.962
 and+0.948
 to+0.803
 with+0.789
Neg logit contributions
Ta-0.879
 Majesty-0.851
agna-0.830
umm-0.829
 sidelines-0.823
Token tree
Token ID5509
Feature activation+2.31e+01

Pos logit contributions
 Majesty+0.595
umm+0.573
Ta+0.562
 majesty+0.554
 sidelines+0.554
Neg logit contributions
.-0.726
,-0.681
 and-0.662
-0.590
 to-0.576
Token with
Token ID351
Feature activation-1.24e+01

Pos logit contributions
 Majesty+1.384
 former+1.341
 majesty+1.338
agna+1.332
umm+1.308
Neg logit contributions
.-0.317
,-0.298
 and-0.255
-0.201
 to-0.111
Token red
Token ID2266
Feature activation+1.21e+01

Pos logit contributions
.+0.364
,+0.334
 and+0.320
+0.245
 to+0.238
Neg logit contributions
 Majesty-0.656
Ta-0.645
umm-0.631
 sidelines-0.627
 majesty-0.627
Token leaves
Token ID5667
Feature activation+2.88e+01

Pos logit contributions
 Majesty+0.395
umm+0.383
 pulp+0.375
 majesty+0.374
Ta+0.365
Neg logit contributions
.-0.565
,-0.531
 and-0.515
-0.479
 with-0.461
Token.
Token ID13
Feature activation-4.23e+01

Pos logit contributions
wy+1.626
 Majesty+1.617
isher+1.612
 pulp+1.606
umm+1.598
Neg logit contributions
,-0.250
.-0.250
-0.179
 and-0.177
 with-0.058
Token The
Token ID383
Feature activation-8.55e+00

Pos logit contributions
.+1.658
,+1.544
 and+1.497
+1.235
 to+1.235
Neg logit contributions
 Majesty-1.872
Ta-1.869
umm-1.823
 sidelines-1.792
agna-1.787
Token way
Token ID835
Feature activation+1.56e+01

Pos logit contributions
.+0.269
,+0.254
 and+0.249
+0.215
 to+0.213
Neg logit contributions
 Majesty-0.194
Ta-0.188
umm-0.184
 sidelines-0.182
agna-0.181
Token.
Token ID13
Feature activation-4.10e+01

Pos logit contributions
 sidelines+0.901
 Majesty+0.870
 spine+0.863
 footing+0.857
inge+0.850
Neg logit contributions
 and-0.057
.-0.047
,-0.041
+0.005
 a+0.025
Token She
Token ID1375
Feature activation+3.02e-01

Pos logit contributions
.+1.397
,+1.292
 and+1.235
 to+0.964
 with+0.959
Neg logit contributions
Ta-2.030
 Majesty-2.022
umm-1.985
 sidelines-1.934
 majesty-1.933
Token was
Token ID373
Feature activation-1.23e+01

Pos logit contributions
 Majesty+0.015
Ta+0.015
umm+0.015
 sidelines+0.015
agna+0.015
Neg logit contributions
.-0.010
,-0.009
 and-0.009
 to-0.007
-0.007
Token very
Token ID845
Feature activation-1.02e+01

Pos logit contributions
.+0.370
,+0.332
 and+0.316
+0.244
 to+0.239
Neg logit contributions
 Majesty-0.657
Ta-0.649
 sidelines-0.631
umm-0.628
 majesty-0.624
Token satisfied
Token ID11378
Feature activation+2.87e+01

Pos logit contributions
.+0.423
,+0.394
 and+0.383
 to+0.319
+0.318
Neg logit contributions
 Majesty-0.433
Ta-0.430
umm-0.418
 sidelines-0.412
agna-0.410
Token and
Token ID290
Feature activation-2.47e+01

Pos logit contributions
umm+1.630
 Majesty+1.600
 former+1.565
 majesty+1.556
fer+1.498
Neg logit contributions
.-0.178
,-0.139
-0.105
 and-0.067
 to+0.035
Token kept
Token ID4030
Feature activation+2.08e+01

Pos logit contributions
.+1.030
,+0.956
 and+0.932
+0.781
 to+0.779
Neg logit contributions
 Majesty-1.006
Ta-1.005
umm-0.969
 sidelines-0.959
 majesty-0.948
Token thinking
Token ID3612
Feature activation+1.90e+01

Pos logit contributions
 Majesty+0.992
Ta+0.972
 sidelines+0.954
agna+0.942
umm+0.940
Neg logit contributions
.-0.713
,-0.663
 and-0.630
-0.508
 to-0.485
Token about
Token ID546
Feature activation-2.48e+00

Pos logit contributions
umm+0.908
 Majesty+0.908
 sidelines+0.895
 former+0.888
Ta+0.883
Neg logit contributions
.-0.470
,-0.422
 and-0.406
-0.342
 to-0.276
Token the
Token ID262
Feature activation-2.02e+01

Pos logit contributions
.+0.063
,+0.059
 and+0.055
+0.041
 to+0.039
Neg logit contributions
 Majesty-0.137
 former-0.132
umm-0.131
 sidelines-0.130
 majesty-0.130
Token He
Token ID679
Feature activation+1.58e+00

Pos logit contributions
.+1.347
,+1.212
 and+1.169
 with+0.909
 to+0.901
Neg logit contributions
 Majesty-2.010
Ta-1.985
umm-1.973
 sidelines-1.948
agna-1.925
Token wanted
Token ID2227
Feature activation+3.87e+00

Pos logit contributions
 Majesty+0.073
Ta+0.072
agna+0.070
 majesty+0.069
 sidelines+0.069
Neg logit contributions
.-0.055
,-0.049
 and-0.047
 to-0.038
 with-0.038
Token to
Token ID284
Feature activation-2.00e+01

Pos logit contributions
Ta+0.261
 Majesty+0.260
 sidelines+0.255
 majesty+0.253
umm+0.253
Neg logit contributions
.-0.052
,-0.040
 and-0.035
-0.012
 with-0.010
Token separate
Token ID4553
Feature activation+2.98e+01

Pos logit contributions
.+0.837
,+0.782
 and+0.759
 to+0.635
+0.631
Neg logit contributions
 Majesty-0.840
Ta-0.839
umm-0.817
 sidelines-0.801
agna-0.800
Token the
Token ID262
Feature activation-2.05e+01

Pos logit contributions
umm+1.580
 sidelines+1.543
Ta+1.535
ocol+1.535
 pulp+1.533
Neg logit contributions
.-0.657
,-0.567
 and-0.525
-0.455
 to-0.404
Token deer
Token ID20096
Feature activation+3.69e+01

Pos logit contributions
.+1.048
,+0.995
 and+0.974
 to+0.839
+0.833
Neg logit contributions
Ta-0.696
 Majesty-0.688
umm-0.668
agna-0.658
 sidelines-0.648
Token from
Token ID422
Feature activation-1.07e+01

Pos logit contributions
 Majesty+2.019
 former+2.006
 sidelines+1.990
 pulp+1.985
umm+1.946
Neg logit contributions
.-0.540
,-0.392
-0.374
 and-0.345
 to-0.178
Token all
Token ID477
Feature activation-1.50e+00

Pos logit contributions
.+0.202
,+0.177
 and+0.167
+0.104
 to+0.101
Neg logit contributions
 Majesty-0.672
 sidelines-0.659
Ta-0.644
 former-0.644
 majesty-0.643
Token of
Token ID286
Feature activation-5.36e+00

Pos logit contributions
.+0.036
,+0.032
 and+0.031
+0.024
 to+0.023
Neg logit contributions
 sidelines-0.076
 Majesty-0.074
isher-0.074
Ta-0.073
umm-0.073
Token her
Token ID607
Feature activation-5.08e+00

Pos logit contributions
.+0.125
,+0.112
 and+0.106
 to+0.075
+0.070
Neg logit contributions
 Majesty-0.293
 sidelines-0.289
 majesty-0.285
 former-0.283
Ta-0.282
Token friends
Token ID2460
Feature activation+2.08e+01

Pos logit contributions
.+0.209
,+0.196
 and+0.190
+0.164
 with+0.160
Neg logit contributions
 Majesty-0.202
 majesty-0.192
 former-0.190
umm-0.189
Ta-0.186
Token there
Token ID612
Feature activation-2.47e+00

Pos logit contributions
.+0.869
,+0.792
 and+0.762
+0.611
 to+0.606
Neg logit contributions
Ta-1.161
 Majesty-1.131
 sidelines-1.094
umm-1.093
agna-1.091
Token was
Token ID373
Feature activation-1.22e+01

Pos logit contributions
.+0.061
 and+0.051
,+0.051
+0.039
 to+0.039
Neg logit contributions
Ta-0.125
umm-0.122
agna-0.122
 Majesty-0.121
bek-0.119
Token a
Token ID257
Feature activation-1.72e+01

Pos logit contributions
.+0.269
,+0.237
 and+0.225
+0.151
 to+0.143
Neg logit contributions
Ta-0.718
 Majesty-0.715
 former-0.700
umm-0.698
agna-0.692
Token big
Token ID1263
Feature activation+6.97e-01

Pos logit contributions
.+0.691
,+0.654
 and+0.629
 to+0.518
+0.516
Neg logit contributions
 Majesty-0.762
Ta-0.762
umm-0.743
isher-0.724
agna-0.723
Token gift
Token ID6979
Feature activation+3.44e+01

Pos logit contributions
 Majesty+0.030
Ta+0.030
umm+0.030
 sidelines+0.029
agna+0.029
Neg logit contributions
.-0.028
,-0.026
 and-0.025
 to-0.021
-0.021
Token for
Token ID329
Feature activation-9.93e+00

Pos logit contributions
umm+1.958
Ta+1.898
 sidelines+1.897
agna+1.890
 Majesty+1.863
Neg logit contributions
.-0.499
,-0.458
 and-0.426
-0.351
 to-0.233
Token Sarah
Token ID10490
Feature activation+2.15e+01

Pos logit contributions
.+0.404
,+0.377
 and+0.370
+0.308
 to+0.307
Neg logit contributions
Ta-0.401
 Majesty-0.399
 sidelines-0.396
umm-0.391
agna-0.383
Token!
Token ID0
Feature activation-2.37e+01

Pos logit contributions
 Majesty+1.395
 majesty+1.361
 former+1.360
agna+1.349
uries+1.347
Neg logit contributions
.-0.075
,-0.065
 and-0.020
+0.013
 with+0.072
Token It
Token ID632
Feature activation+1.28e+01

Pos logit contributions
.+0.952
,+0.881
 and+0.844
 with+0.698
 to+0.693
Neg logit contributions
Ta-0.996
umm-0.950
 sidelines-0.938
 Majesty-0.936
 former-0.918
Token was
Token ID373
Feature activation-1.19e+01

Pos logit contributions
Ta+0.705
 Majesty+0.681
umm+0.677
 former+0.669
agna+0.668
Neg logit contributions
.-0.314
,-0.277
 and-0.256
 to-0.190
-0.186
Token from
Token ID422
Feature activation-1.03e+01

Pos logit contributions
.+0.291
,+0.253
 and+0.239
+0.182
 to+0.168
Neg logit contributions
Ta-0.653
 Majesty-0.648
 former-0.643
umm-0.637
agna-0.634
Token tried
Token ID3088
Feature activation+9.08e+00

Pos logit contributions
.+0.068
,+0.063
 and+0.060
 to+0.049
+0.049
Neg logit contributions
 Majesty-0.088
Ta-0.087
umm-0.085
agna-0.084
 sidelines-0.083
Token yawn
Token ID40908
Feature activation+2.63e+01

Pos logit contributions
 Majesty+0.529
 former+0.524
 sidelines+0.517
 pulp+0.506
agna+0.504
Neg logit contributions
.-0.150
,-0.110
 and-0.097
-0.071
 with-0.053
Tokening
Token ID278
Feature activation+2.33e+01

Pos logit contributions
umm+1.272
agna+1.222
icious+1.202
wy+1.176
Ta+1.169
Neg logit contributions
.-0.665
,-0.555
 and-0.524
 the-0.470
-0.459
Token,
Token ID11
Feature activation-3.29e+01

Pos logit contributions
 Majesty+1.437
 sidelines+1.406
umm+1.394
agna+1.391
 former+1.387
Neg logit contributions
.-0.227
,-0.138
 and-0.107
-0.107
 to-0.024
Token but
Token ID475
Feature activation-1.43e+01

Pos logit contributions
.+0.866
,+0.757
 and+0.674
+0.525
 to+0.511
Neg logit contributions
 Majesty-1.820
Ta-1.804
 sidelines-1.788
agna-1.756
umm-1.749
Token that
Token ID326
Feature activation-1.58e+01

Pos logit contributions
.+0.359
,+0.316
 and+0.300
+0.213
 to+0.211
Neg logit contributions
 Majesty-0.826
Ta-0.824
umm-0.804
 sidelines-0.801
agna-0.800
Token didn
Token ID1422
Feature activation+7.03e+00

Pos logit contributions
.+0.467
,+0.424
 and+0.407
 to+0.313
+0.310
Neg logit contributions
 Majesty-0.825
Ta-0.823
umm-0.803
agna-0.790
 former-0.788
Token't
Token ID470
Feature activation+6.91e+00

Pos logit contributions
Ta+0.245
 sidelines+0.235
icious+0.228
umm+0.227
mic+0.225
Neg logit contributions
.-0.306
,-0.290
 and-0.285
 to-0.242
 with-0.240
Token help
Token ID1037
Feature activation+5.75e+00

Pos logit contributions
 Majesty+0.247
Ta+0.242
 sidelines+0.241
umm+0.236
 majesty+0.234
Neg logit contributions
.-0.289
,-0.278
 and-0.272
-0.239
 to-0.230
Token either
Token ID2035
Feature activation-5.60e-01

Pos logit contributions
 unnecessary+0.288
 Majesty+0.285
reath+0.283
 sidelines+0.281
icious+0.280
Neg logit contributions
.-0.101
 and-0.093
,-0.089
-0.077
 with-0.061
Token.
Token ID13
Feature activation-4.22e+01

Pos logit contributions
,+0.000
.+0.000
 and+0.000
-0.002
 to-0.003
Neg logit contributions
umm-0.037
 avail-0.036
 former-0.035
 ounces-0.035
 sidelines-0.035
Token could
Token ID714
Feature activation-5.51e+00

Pos logit contributions
.+0.196
,+0.179
 and+0.172
+0.145
 with+0.144
Neg logit contributions
Ta-0.228
 Majesty-0.220
agna-0.216
umm-0.212
 majesty-0.212
Token open
Token ID1280
Feature activation+1.07e+01

Pos logit contributions
.+0.206
,+0.192
 and+0.190
+0.167
 to+0.160
Neg logit contributions
 Majesty-0.202
agna-0.195
 majesty-0.190
 former-0.190
 sidelines-0.190
Token the
Token ID262
Feature activation-2.29e+01

Pos logit contributions
isher+0.533
 Majesty+0.509
ocol+0.505
 avail+0.501
agna+0.495
Neg logit contributions
.-0.237
,-0.202
 and-0.199
-0.181
 to-0.152
Token box
Token ID3091
Feature activation+2.86e+01

Pos logit contributions
.+1.110
,+1.049
 and+1.022
 to+0.883
+0.882
Neg logit contributions
Ta-0.794
 Majesty-0.791
umm-0.773
agna-0.754
 sidelines-0.753
Token,
Token ID11
Feature activation-3.38e+01

Pos logit contributions
umm+1.794
isher+1.729
 sidelines+1.723
agna+1.721
 Majesty+1.714
Neg logit contributions
.-0.145
 and-0.044
,-0.021
-0.017
 to+0.067
Token he
Token ID339
Feature activation-4.77e+00

Pos logit contributions
.+0.859
,+0.762
 and+0.721
 to+0.516
+0.512
Neg logit contributions
 Majesty-1.924
Ta-1.914
umm-1.876
agna-1.867
 sidelines-1.866
Token heard
Token ID2982
Feature activation+4.95e+00

Pos logit contributions
.+0.177
,+0.164
 and+0.158
+0.129
 to+0.129
Neg logit contributions
 Majesty-0.219
Ta-0.218
umm-0.211
agna-0.210
 sidelines-0.208
Token a
Token ID257
Feature activation-1.96e+01

Pos logit contributions
Ta+0.291
 Majesty+0.290
 sidelines+0.281
 former+0.278
agna+0.276
Neg logit contributions
.-0.095
,-0.087
 and-0.079
-0.052
 to-0.047
Token voice
Token ID3809
Feature activation+2.14e+01

Pos logit contributions
.+0.884
,+0.831
 and+0.807
 to+0.684
+0.680
Neg logit contributions
 Majesty-0.768
Ta-0.766
umm-0.747
agna-0.728
 sidelines-0.722
Token saying
Token ID2282
Feature activation+6.37e+00

Pos logit contributions
 sidelines+1.099
vation+1.083
Ta+1.083
mic+1.069
 stret+1.069
Neg logit contributions
.-0.301
,-0.296
 and-0.270
-0.239
 to-0.192
Token,
Token ID11
Feature activation-3.33e+01

Pos logit contributions
 Majesty+0.376
 majesty+0.357
agna+0.352
 pulp+0.335
ocol+0.335
Neg logit contributions
.-0.084
 and-0.080
,-0.057
-0.045
 with-0.037
Token she
Token ID673
Feature activation-8.69e-01

Pos logit contributions
.+0.958
 and+0.829
,+0.827
+0.639
 to+0.630
Neg logit contributions
Ta-1.446
 sidelines-1.417
 pulp-1.383
 Majesty-1.358
agna-1.350
Token saw
Token ID2497
Feature activation+4.27e+00

Pos logit contributions
.+0.032
,+0.029
 and+0.027
 to+0.023
+0.022
Neg logit contributions
Ta-0.040
 Majesty-0.040
agna-0.038
umm-0.037
 majesty-0.037
Token a
Token ID257
Feature activation-1.68e+01

Pos logit contributions
Ta+0.273
 Majesty+0.264
 pulp+0.256
 sidelines+0.256
 majesty+0.253
Neg logit contributions
.-0.080
,-0.068
 and-0.062
 with-0.036
 to-0.033
Token path
Token ID3108
Feature activation+4.00e+01

Pos logit contributions
.+0.599
,+0.570
 and+0.544
 to+0.433
+0.426
Neg logit contributions
Ta-0.831
 Majesty-0.823
umm-0.808
 sidelines-0.786
 former-0.785
Token that
Token ID326
Feature activation-1.33e+01

Pos logit contributions
 sidelines+1.996
Ta+1.880
inge+1.849
bek+1.833
umm+1.833
Neg logit contributions
.-0.794
,-0.749
 and-0.650
-0.648
 to-0.435
Token she
Token ID673
Feature activation-7.57e-01

Pos logit contributions
.+0.351
,+0.315
 and+0.300
 to+0.217
+0.214
Neg logit contributions
Ta-0.759
 Majesty-0.758
umm-0.743
 sidelines-0.732
agna-0.731
Token had
Token ID550
Feature activation-5.49e-01

Pos logit contributions
.+0.025
,+0.023
 and+0.022
 to+0.017
+0.017
Neg logit contributions
 Majesty-0.039
Ta-0.039
umm-0.038
 sidelines-0.038
agna-0.038
Token never
Token ID1239
Feature activation+1.13e+01

Pos logit contributions
.+0.022
,+0.021
 and+0.020
 with+0.016
+0.016
Neg logit contributions
Ta-0.022
 Majesty-0.022
 majesty-0.021
 former-0.020
 sidelines-0.020
Token seen
Token ID1775
Feature activation+2.29e+01

Pos logit contributions
Ta+0.367
 Majesty+0.347
umm+0.333
 sidelines+0.333
 former+0.328
Neg logit contributions
.-0.544
,-0.524
 and-0.505
-0.441
 with-0.434
Token before
Token ID878
Feature activation-5.09e-01

Pos logit contributions
 Majesty+0.964
Ta+0.958
umm+0.955
 former+0.906
atmeal+0.895
Neg logit contributions
.-0.701
,-0.680
 and-0.634
-0.548
 to-0.491
Token.
Token ID13
Feature activation-3.99e+01

Pos logit contributions
,+0.001
.+0.000
 and-0.000
-0.001
 with-0.002
Neg logit contributions
 sidelines-0.031
 former-0.030
umm-0.030
mic-0.029
 Majesty-0.029
Token never
Token ID1239
Feature activation+1.29e+01

Pos logit contributions
 Majesty+0.286
 sidelines+0.281
 former+0.279
Ta+0.279
 majesty+0.271
Neg logit contributions
.-0.268
,-0.251
 and-0.243
 to-0.202
-0.201
Token trust
Token ID3774
Feature activation+2.03e+01

Pos logit contributions
 Majesty+0.498
Ta+0.484
 sidelines+0.482
umm+0.474
agna+0.471
Neg logit contributions
.-0.528
,-0.499
 and-0.490
-0.417
 with-0.403
Token him
Token ID683
Feature activation+1.04e+01

Pos logit contributions
 Majesty+0.953
 sidelines+0.941
 unnecessary+0.915
 ounces+0.900
ubb+0.897
Neg logit contributions
.-0.508
,-0.477
 and-0.452
-0.388
 with-0.343
Token again
Token ID757
Feature activation+1.10e+00

Pos logit contributions
 Majesty+0.544
 former+0.535
agna+0.525
uries+0.518
umm+0.514
Neg logit contributions
.-0.177
,-0.177
 and-0.160
-0.146
 the-0.112
Token.
Token ID13
Feature activation-3.79e+01

Pos logit contributions
 former+0.077
 Majesty+0.073
umm+0.070
 ounces+0.069
 lifetime+0.069
Neg logit contributions
,-0.002
.-0.001
 and-0.001
+0.003
 with+0.005
Token He
Token ID679
Feature activation+4.43e+00

Pos logit contributions
.+1.393
,+1.323
 and+1.246
 with+1.009
 to+0.983
Neg logit contributions
umm-1.717
Ta-1.706
 Majesty-1.702
 former-1.659
agna-1.609
Token wished
Token ID16555
Feature activation+2.25e+01

Pos logit contributions
 Majesty+0.189
Ta+0.187
umm+0.180
agna+0.179
 former+0.179
Neg logit contributions
.-0.178
,-0.165
 and-0.157
 to-0.133
-0.133
Token he
Token ID339
Feature activation+9.77e-02

Pos logit contributions
 Majesty+1.096
 sidelines+1.085
 majesty+1.044
umm+1.041
 former+1.039
Neg logit contributions
.-0.595
,-0.499
 and-0.454
-0.385
 with-0.349
Token could
Token ID714
Feature activation-8.46e-01

Pos logit contributions
Ta+0.004
 Majesty+0.004
agna+0.004
umm+0.004
 majesty+0.004
Neg logit contributions
.-0.004
,-0.003
 and-0.003
-0.003
 to-0.003
Token have
Token ID423
Feature activation+8.86e+00

Pos logit contributions
.+0.035
,+0.033
 and+0.032
+0.027
 to+0.026
Neg logit contributions
 Majesty-0.033
 former-0.032
agna-0.031
umm-0.031
 sidelines-0.031
Token found
Token ID1043
Feature activation+1.44e+01

Pos logit contributions
 Majesty+0.359
Ta+0.357
umm+0.343
agna+0.341
 sidelines+0.339
Neg logit contributions
.-0.366
,-0.345
 and-0.335
-0.277
 to-0.274
Token a
Token ID257
Feature activation-1.48e+01

Pos logit contributions
 Majesty+0.186
Ta+0.182
 sidelines+0.179
agna+0.177
umm+0.177
Neg logit contributions
.-0.083
,-0.077
 and-0.070
-0.051
 to-0.049
Token tall
Token ID7331
Feature activation+2.09e+01

Pos logit contributions
.+0.578
,+0.540
 and+0.520
 to+0.424
+0.424
Neg logit contributions
Ta-0.678
 Majesty-0.662
umm-0.652
agna-0.639
isher-0.634
Token tower
Token ID10580
Feature activation+4.78e+01

Pos logit contributions
 Majesty+0.493
umm+0.462
 sidelines+0.448
 majesty+0.432
Ta+0.425
Neg logit contributions
.-1.207
,-1.141
 and-1.121
 to-1.034
-1.033
Token.
Token ID13
Feature activation-3.79e+01

Pos logit contributions
umm+3.126
 sidelines+3.013
 Majesty+3.002
 seventeen+2.997
agna+2.962
Neg logit contributions
,-0.136
.-0.070
 and-0.050
+0.009
 the+0.200
Token
Token ID198
Feature activation-2.93e+01

Pos logit contributions
.+0.920
,+0.816
 and+0.770
 to+0.548
 with+0.514
Neg logit contributions
Ta-2.144
 Majesty-2.117
 sidelines-2.105
umm-2.081
agna-2.080
Token
Token ID198
Feature activation-2.92e+01

Pos logit contributions
.+0.544
,+0.492
 and+0.406
 to+0.240
 with+0.213
Neg logit contributions
Ta-1.705
Uh-1.619
 sidelines-1.592
icious-1.587
agna-1.568
Token"
Token ID1
Feature activation-7.99e+00

Pos logit contributions
.+1.211
,+1.131
 and+1.097
 to+0.917
+0.909
Neg logit contributions
 Majesty-1.243
Ta-1.242
umm-1.209
 sidelines-1.187
agna-1.185
TokenLook
Token ID8567
Feature activation+3.79e+01

Pos logit contributions
.+0.400
 and+0.373
,+0.363
+0.350
 with+0.329
Neg logit contributions
 Majesty-0.295
 majesty-0.278
 former-0.258
agna-0.251
umm-0.249
Token,
Token ID11
Feature activation-2.84e+01

Pos logit contributions
 Majesty+2.566
agna+2.454
 majesty+2.432
heit+2.402
isher+2.344
Neg logit contributions
.-0.405
 and-0.283
,-0.181
-0.137
 to-0.077
Token our
Token ID674
Feature activation+1.33e+01

Pos logit contributions
.+0.917
,+0.856
 and+0.811
 to+0.642
+0.632
Neg logit contributions
Ta-1.465
 Majesty-1.462
umm-1.449
 sidelines-1.421
 former-1.405
Token tower
Token ID10580
Feature activation+4.79e+01

Pos logit contributions
 Majesty+0.338
umm+0.313
Ta+0.306
 majesty+0.302
 sidelines+0.294
Neg logit contributions
.-0.766
,-0.727
 and-0.716
-0.638
 to-0.635
Token it
Token ID340
Feature activation+1.40e+00

Pos logit contributions
umm+0.958
 sidelines+0.867
isher+0.858
atmeal+0.858
roach+0.855
Neg logit contributions
.-0.217
,-0.177
-0.157
 and-0.149
 to-0.097
Token made
Token ID925
Feature activation+8.83e-01

Pos logit contributions
 Majesty+0.062
Ta+0.062
umm+0.061
 sidelines+0.060
agna+0.060
Neg logit contributions
.-0.049
,-0.046
 and-0.043
-0.037
 with-0.035
Token John
Token ID1757
Feature activation+2.52e+01

Pos logit contributions
umm+0.037
 sidelines+0.035
 Majesty+0.035
Ta+0.035
icious+0.035
Neg logit contributions
.-0.031
,-0.029
 and-0.028
 to-0.023
-0.023
Token super
Token ID2208
Feature activation+1.84e+01

Pos logit contributions
 Majesty+1.033
Ta+1.006
 majesty+0.976
 former+0.974
 sidelines+0.971
Neg logit contributions
.-0.896
,-0.865
 and-0.806
-0.701
 to-0.661
Token sleepy
Token ID42023
Feature activation+1.94e+01

Pos logit contributions
 Majesty+0.722
Ta+0.709
umm+0.690
 sidelines+0.686
 majesty+0.683
Neg logit contributions
.-0.769
,-0.733
 and-0.707
-0.598
 to-0.590
Token.
Token ID13
Feature activation-4.03e+01

Pos logit contributions
umm+1.304
 Majesty+1.258
 former+1.230
agna+1.215
 sidelines+1.207
Neg logit contributions
,-0.027
.-0.013
 and+0.007
+0.015
 to+0.086
Token He
Token ID679
Feature activation+2.16e+00

Pos logit contributions
.+1.311
,+1.227
 and+1.157
 with+0.937
 to+0.930
Neg logit contributions
 Majesty-1.866
umm-1.855
Ta-1.821
 sidelines-1.785
agna-1.757
Token went
Token ID1816
Feature activation+5.27e+00

Pos logit contributions
 Majesty+0.091
Ta+0.091
umm+0.088
agna+0.087
 sidelines+0.086
Neg logit contributions
.-0.089
,-0.083
 and-0.080
 to-0.067
-0.067
Token to
Token ID284
Feature activation-1.93e+01

Pos logit contributions
 sidelines+0.286
 Majesty+0.274
 reins+0.269
Ta+0.264
 majesty+0.263
Neg logit contributions
.-0.108
,-0.091
 and-0.076
-0.067
 with-0.049
Token the
Token ID262
Feature activation-1.93e+01

Pos logit contributions
.+0.634
,+0.588
 and+0.555
 with+0.440
 to+0.431
Neg logit contributions
 Majesty-0.908
 sidelines-0.892
Ta-0.886
 former-0.864
 majesty-0.857
Token park
Token ID3952
Feature activation+2.61e+01

Pos logit contributions
.+0.918
,+0.860
 and+0.840
 to+0.729
+0.718
Neg logit contributions
 Majesty-0.709
Ta-0.708
umm-0.691
agna-0.674
 majesty-0.668
Token great
Token ID1049
Feature activation+1.47e+01

Pos logit contributions
.+0.573
,+0.528
 and+0.510
 to+0.405
+0.404
Neg logit contributions
Ta-0.806
 Majesty-0.805
umm-0.788
 sidelines-0.773
agna-0.772
Token frog
Token ID21264
Feature activation+3.53e+01

Pos logit contributions
Ta+0.582
 Majesty+0.577
umm+0.559
agna+0.551
 sidelines+0.551
Neg logit contributions
.-0.663
,-0.629
 and-0.612
 to-0.512
-0.507
Token.
Token ID13
Feature activation-3.97e+01

Pos logit contributions
rim+1.274
oise+1.233
 Majesty+1.231
 majesty+1.229
cker+1.225
Neg logit contributions
,-0.770
.-0.709
 and-0.708
 the-0.652
-0.610
Token He
Token ID679
Feature activation+2.75e+00

Pos logit contributions
.+1.297
,+1.193
 and+1.163
 to+0.942
 with+0.895
Neg logit contributions
Ta-1.929
 Majesty-1.922
umm-1.909
 majesty-1.837
 sidelines-1.836
Token liked
Token ID8288
Feature activation+7.16e+00

Pos logit contributions
Ta+0.110
 Majesty+0.109
 majesty+0.106
wy+0.101
 pulp+0.101
Neg logit contributions
.-0.106
,-0.097
 and-0.093
 to-0.080
-0.078
Token to
Token ID284
Feature activation-1.89e+01

Pos logit contributions
 Majesty+0.483
Ta+0.474
 sidelines+0.472
umm+0.469
 pulp+0.468
Neg logit contributions
.-0.098
,-0.076
 and-0.063
-0.025
 with-0.018
Token plan
Token ID1410
Feature activation+3.07e+01

Pos logit contributions
.+0.832
,+0.781
 and+0.758
 to+0.639
+0.632
Neg logit contributions
Ta-0.760
 Majesty-0.758
umm-0.745
agna-0.727
 sidelines-0.720
Token fun
Token ID1257
Feature activation+1.49e+01

Pos logit contributions
 unnecessary+1.401
 Majesty+1.379
 sidelines+1.354
agna+1.349
affles+1.341
Neg logit contributions
.-0.726
,-0.668
 and-0.629
-0.532
 with-0.408
Token things
Token ID1243
Feature activation+2.17e+01

Pos logit contributions
 Majesty+0.539
affles+0.529
 majesty+0.505
umm+0.499
 sidelines+0.492
Neg logit contributions
.-0.533
,-0.494
 and-0.476
-0.459
 to-0.406
Token to
Token ID284
Feature activation-1.88e+01

Pos logit contributions
vell+1.114
robat+1.081
 Majesty+1.073
atmeal+1.071
cher+1.064
Neg logit contributions
.-0.177
,-0.153
-0.140
 and-0.134
 the-0.048
Token do
Token ID466
Feature activation+1.05e+01

Pos logit contributions
.+0.773
,+0.722
 and+0.707
 to+0.589
+0.587
Neg logit contributions
 Majesty-0.789
Ta-0.773
 sidelines-0.755
 majesty-0.744
umm-0.744
Token and
Token ID290
Feature activation-2.17e+01

Pos logit contributions
atmeal+0.985
agna+0.978
umm+0.965
 Majesty+0.961
 former+0.951
Neg logit contributions
.-0.314
,-0.268
 and-0.219
-0.189
 to-0.118
Token dad
Token ID9955
Feature activation+2.31e+01

Pos logit contributions
.+0.817
,+0.754
 and+0.728
+0.594
 to+0.589
Neg logit contributions
 Majesty-0.998
Ta-0.988
umm-0.966
agna-0.953
 sidelines-0.950
Token with
Token ID351
Feature activation-7.71e+00

Pos logit contributions
agna+1.288
 Majesty+1.242
atmeal+1.240
umm+1.231
 former+1.198
Neg logit contributions
.-0.431
,-0.373
 and-0.340
-0.285
 to-0.183
Token the
Token ID262
Feature activation-1.74e+01

Pos logit contributions
.+0.173
,+0.153
 and+0.144
 with+0.097
+0.097
Neg logit contributions
Ta-0.459
 Majesty-0.455
 sidelines-0.446
agna-0.444
umm-0.443
Token grill
Token ID29901
Feature activation+4.21e+01

Pos logit contributions
.+0.937
,+0.888
 and+0.871
 to+0.763
+0.755
Neg logit contributions
 Majesty-0.538
Ta-0.537
umm-0.513
 sidelines-0.497
agna-0.495
Token.
Token ID13
Feature activation-3.78e+01

Pos logit contributions
umm+2.756
 Majesty+2.650
atmeal+2.649
agna+2.649
 former+2.642
Neg logit contributions
.-0.299
,-0.223
 and-0.157
-0.132
 with+0.060
Token<|endoftext|>
Token ID50256
Feature activation+3.33e+00

Pos logit contributions
.+0.639
,+0.570
 and+0.484
 to+0.273
 with+0.268
Neg logit contributions
Ta-2.417
umm-2.396
 Majesty-2.353
 former-2.293
 majesty-2.287
TokenOnce
Token ID7454
Feature activation+4.64e+00

Pos logit contributions
Ta+0.153
rob+0.147
Whe+0.144
acket+0.140
umm+0.137
Neg logit contributions
,-0.063
 and-0.056
.-0.056
 to-0.045
 that-0.041
Token upon
Token ID2402
Feature activation+6.85e+00

Pos logit contributions
 Majesty+0.133
 majesty+0.131
Ta+0.130
 sidelines+0.122
pron+0.119
Neg logit contributions
.-0.192
 and-0.182
,-0.172
 with-0.163
 to-0.147
Token a
Token ID257
Feature activation-1.46e+01

Pos logit contributions
Ta+0.382
 Majesty+0.345
atmeal+0.342
icious+0.340
 pulp+0.339
Neg logit contributions
,-0.140
.-0.133
 and-0.113
 to-0.080
 with-0.070
Token time
Token ID640
Feature activation+2.96e+00

Pos logit contributions
.+0.533
,+0.492
 and+0.472
 to+0.410
 with+0.401
Neg logit contributions
Ta-0.619
 Majesty-0.608
icious-0.596
umm-0.592
 former-0.584
Token to
Token ID284
Feature activation-1.94e+01

Pos logit contributions
Ta+0.271
 pulp+0.270
 sidelines+0.268
agna+0.264
 unnecessary+0.262
Neg logit contributions
.-0.077
,-0.066
 and-0.056
-0.034
 with-0.029
Token play
Token ID711
Feature activation+1.66e+01

Pos logit contributions
.+0.770
,+0.718
 and+0.695
 to+0.574
+0.570
Neg logit contributions
 Majesty-0.854
Ta-0.854
umm-0.830
 sidelines-0.818
agna-0.814
Token with
Token ID351
Feature activation-1.01e+01

Pos logit contributions
 sidelines+0.875
icious+0.842
Ta+0.834
agna+0.826
Whe+0.825
Neg logit contributions
.-0.270
,-0.239
 and-0.216
 to-0.169
-0.148
Token the
Token ID262
Feature activation-2.00e+01

Pos logit contributions
.+0.195
,+0.186
 and+0.170
+0.121
 to+0.118
Neg logit contributions
 sidelines-0.545
 pulp-0.543
Ta-0.535
agna-0.532
 Majesty-0.528
Token dog
Token ID3290
Feature activation+2.62e+01

Pos logit contributions
.+0.831
,+0.768
 and+0.764
 to+0.633
+0.619
Neg logit contributions
 Majesty-0.903
Ta-0.868
umm-0.861
agna-0.853
 former-0.847
Token and
Token ID290
Feature activation-2.39e+01

Pos logit contributions
 Majesty+1.519
 Indians+1.503
agna+1.496
wy+1.481
uries+1.478
Neg logit contributions
.-0.058
-0.025
,+0.008
 and+0.011
 to+0.110
Token the
Token ID262
Feature activation-1.99e+01

Pos logit contributions
.+0.911
,+0.849
 and+0.823
 to+0.681
+0.668
Neg logit contributions
 Majesty-1.111
Ta-1.105
umm-1.103
agna-1.063
 sidelines-1.053
Token ball
Token ID2613
Feature activation+3.24e+01

Pos logit contributions
.+0.779
,+0.714
 and+0.713
 to+0.585
+0.573
Neg logit contributions
 Majesty-0.942
umm-0.919
Ta-0.915
 former-0.903
agna-0.894
Token.
Token ID13
Feature activation-3.97e+01

Pos logit contributions
ses+2.090
 Majesty+2.081
 sidelines+2.053
 pulp+2.052
sed+2.048
Neg logit contributions
,-0.058
.-0.057
 and-0.030
+0.034
 with+0.150
Token They
Token ID1119
Feature activation+1.84e+00

Pos logit contributions
.+0.850
,+0.722
 and+0.671
 with+0.455
 to+0.427
Neg logit contributions
Ta-2.142
Uh-2.121
umm-2.100
 Majesty-2.093
 pulp-2.079
Token ran
Token ID4966
Feature activation+1.48e+01

Pos logit contributions
 Majesty+0.084
Ta+0.084
umm+0.082
 sidelines+0.081
agna+0.081
Neg logit contributions
.-0.070
,-0.065
 and-0.063
 to-0.052
-0.051