TOP ACTIVATIONS
MAX = 0.616

girl called L izz ie . She liked to perform different
little bird ie named Tweet . Tweet was very happy and
friendly rabbit named Ros ie . Ros ie warned
Ben tries to blow bubbles . At first , he blows
his tears and his nose . She smiled and kissed him
Lily who loved fair ies . She would go to the
sorry ," Ben sob bed . Mom my wiped
young girl named Ell a . Ell a was very excited
tum m ies rum ble . They look for something to
beautiful young girl named Elsa . She was very graceful and
S ara and Ben sighed . They looked at the dessert
Ben stayed outside . He saw a dog running
who liked to play games . They were very competitive and
cute little puppy named Max . Max loved to play with
saw a girl named Sue . Sue was holding a bag
pet him ," Ben says . " You 're
going outs ite with Mom . They began
L ily nodded . She had a bump on
curious little girl named Mary . Every day Mary would explore
ow !" they sob bed . The tree heard

MIN ACTIVATIONS
MIN = -1.112

her the card . " Ta - da ! We made
. She said , " Ta - da ! Here I
shook . " Uh - oh , Lily ,
to Tim my , " Uh oh , the roof has
. " Uh oh , we have a
big rain clouds . " Uh oh ," said Tim ,
shirts . " Uh - oh , our shirts
angry . " Uh - oh , what is
in and said , " Uh oh , you boys made
a . " Uh , what are you ?"
pieces . " Uh - oh , our missile
sofa . " Uh - oh , where did
open freezer and said " Uh oh !" He quickly shut
scared , she said " Uh - oh ", and ran
bends . " Uh - oh , this spoon
shake . " Uh - oh , what is
in the distance . " Uh oh , that cloud looks
filled the air . " Uh - oh ," said Jack
â € � � Uh - oh , â €
too . " Uh - oh !" Tom said

INTERVAL 0.184 - 0.616
CONTAINS 31.269%

lake was once again filled with warm and happy water .
the squirrel s . One day , a heavy
Mom leads Anna and Ben to the new room . It
coming from next to him . It was a little bird
. He knew he had to remove it , but he

INTERVAL -0.248 - 0.184
CONTAINS 43.843%

, Tim . Maybe we can ask for help ." Sam
again and again . They are not shy anymore . They
One day , his mom gave him a special
felt sorry and ashamed . They did not know about the
showed them all the pictures he had drawn in his notebook

INTERVAL -0.680 - -0.248
CONTAINS 24.191%

he said . ' Can I go play in the rain
every day . <|endoftext|> Once upon a time , there was
day on . The moral of the story is
. So she started to sway her arms and move her
saw that it was not smooth , but instead it was

INTERVAL -1.112 - -0.680
CONTAINS 0.697%

Amy sadly replied , " Okay Mom . I 'm sorry
a new map � � € � � the map of
Harry wanted to mail a parcel to his grandma . Sarah
completed its journey without any fears . Everyone celebrated and cheered
was time to put the mixture into the freezer . Lily
Token girl
Token ID2576
Feature activation-3.60e-01

Pos logit contributions
Ta+0.254
 Majesty+0.246
Uh+0.238
umm+0.235
 former+0.231
Neg logit contributions
.-0.283
,-0.271
 and-0.266
 to-0.227
 with-0.227
Token called
Token ID1444
Feature activation-4.13e-02

Pos logit contributions
 Majesty+1.273
Ta+1.257
 majesty+1.212
bek+1.184
 former+1.182
Neg logit contributions
.-0.647
,-0.607
 and-0.603
 to-0.473
 with-0.456
Token L
Token ID406
Feature activation-1.98e-01

Pos logit contributions
Ta+0.113
 Majesty+0.109
 former+0.103
agna+0.102
 majesty+0.102
Neg logit contributions
.-0.121
 and-0.118
,-0.116
 to-0.095
 with-0.095
Tokenizz
Token ID6457
Feature activation-7.28e-01

Pos logit contributions
Ta+0.329
 Majesty+0.269
icious+0.266
Uh+0.261
bek+0.260
Neg logit contributions
.-0.855
,-0.850
 and-0.829
 to-0.748
 with-0.738
Tokenie
Token ID494
Feature activation-2.05e-01

Pos logit contributions
Ta+1.704
 Majesty+1.691
 former+1.591
bek+1.574
wy+1.569
Neg logit contributions
,-2.167
.-2.118
 and-2.024
 to-1.808
 with-1.805
Token.
Token ID13
Feature activation+6.14e-01

Pos logit contributions
 Majesty+1.052
 former+1.007
 majesty+1.000
Ta+0.973
bek+0.960
Neg logit contributions
 and-0.029
,-0.027
.-0.027
 to+0.050
 with+0.055
Token She
Token ID1375
Feature activation-8.80e-02

Pos logit contributions
.+1.630
,+1.569
 and+1.530
 to+1.244
 with+1.213
Neg logit contributions
Ta-2.133
 Majesty-1.968
Uh-1.940
 former-1.932
icious-1.928
Token liked
Token ID8288
Feature activation-2.37e-01

Pos logit contributions
Ta+0.323
 Majesty+0.302
 former+0.299
Uh+0.298
bek+0.292
Neg logit contributions
.-0.206
,-0.195
 and-0.188
 to-0.150
 with-0.145
Token to
Token ID284
Feature activation+3.92e-01

Pos logit contributions
Ta+1.108
 Majesty+1.090
 former+1.077
umm+1.054
 majesty+1.052
Neg logit contributions
.-0.240
,-0.218
 and-0.183
 with-0.071
 in-0.039
Token perform
Token ID1620
Feature activation-4.62e-01

Pos logit contributions
.+1.217
,+1.178
 and+1.153
 to+0.973
 with+0.954
Neg logit contributions
Ta-1.178
 Majesty-1.079
 former-1.058
icious-1.046
Uh-1.045
Token different
Token ID1180
Feature activation-1.03e-01

Pos logit contributions
 Majesty+1.833
 former+1.778
Ta+1.775
 majesty+1.769
icious+1.748
Neg logit contributions
.-0.476
,-0.466
 and-0.442
 to-0.283
 with-0.183
Token little
Token ID1310
Feature activation-9.36e-02

Pos logit contributions
.+1.577
,+1.474
 and+1.433
 to+1.232
 with+1.190
Neg logit contributions
Ta-1.600
 Majesty-1.492
icious-1.483
Uh-1.437
agna-1.436
Token bird
Token ID6512
Feature activation-2.22e-01

Pos logit contributions
Ta+0.254
 Majesty+0.246
Uh+0.238
umm+0.235
 former+0.231
Neg logit contributions
.-0.283
,-0.271
 and-0.266
 to-0.227
 with-0.227
Tokenie
Token ID494
Feature activation-2.10e-01

Pos logit contributions
 Majesty+0.762
Ta+0.753
 majesty+0.722
 former+0.715
Uh+0.701
Neg logit contributions
.-0.426
,-0.408
 and-0.394
 to-0.323
 with-0.302
Token named
Token ID3706
Feature activation-1.55e-01

Pos logit contributions
 Majesty+0.788
Ta+0.778
 majesty+0.753
 former+0.736
bek+0.720
Neg logit contributions
.-0.381
,-0.364
 and-0.345
 to-0.278
 with-0.267
Token Tweet
Token ID18752
Feature activation-3.39e-01

Pos logit contributions
Ta+0.441
 Majesty+0.414
 former+0.401
agna+0.390
icious+0.389
Neg logit contributions
.-0.470
,-0.461
 and-0.450
 with-0.375
 to-0.374
Token.
Token ID13
Feature activation+6.11e-01

Pos logit contributions
Ta+0.775
 Majesty+0.770
 former+0.763
icious+0.724
wy+0.722
Neg logit contributions
,-1.022
.-0.993
 and-0.966
 to-0.880
 with-0.868
Token Tweet
Token ID18752
Feature activation-3.55e-01

Pos logit contributions
.+1.995
,+1.928
 and+1.892
 to+1.615
 with+1.578
Neg logit contributions
Ta-1.743
 Majesty-1.594
 former-1.555
Uh-1.554
icious-1.549
Token was
Token ID373
Feature activation+1.36e-01

Pos logit contributions
Ta+0.947
 Majesty+0.876
 former+0.860
Uh+0.853
icious+0.845
Neg logit contributions
.-1.057
,-1.010
 and-0.968
 to-0.904
 with-0.844
Token very
Token ID845
Feature activation+1.56e-01

Pos logit contributions
.+0.303
,+0.278
 and+0.271
 to+0.210
 with+0.200
Neg logit contributions
Ta-0.511
 Majesty-0.486
 former-0.477
Uh-0.468
icious-0.466
Token happy
Token ID3772
Feature activation-1.10e-01

Pos logit contributions
.+0.454
,+0.439
 and+0.428
 to+0.354
 with+0.347
Neg logit contributions
Ta-0.515
 Majesty-0.470
Uh-0.464
icious-0.462
umm-0.461
Token and
Token ID290
Feature activation+5.35e-01

Pos logit contributions
 Majesty+0.417
 former+0.401
umm+0.396
 majesty+0.395
 habitats+0.389
Neg logit contributions
.-0.078
,-0.066
 and-0.043
 the-0.025
 with-0.013
Token friendly
Token ID8030
Feature activation-2.92e-01

Pos logit contributions
agna+1.353
 former+1.323
 Majesty+1.322
bek+1.317
uries+1.311
Neg logit contributions
.-0.215
,-0.184
 and-0.144
 with-0.040
 the-0.034
Token rabbit
Token ID22746
Feature activation-3.86e-01

Pos logit contributions
 Majesty+1.246
Ta+1.206
 majesty+1.195
 former+1.183
umm+1.165
Neg logit contributions
.-0.350
,-0.329
 and-0.302
 with-0.201
 to-0.182
Token named
Token ID3706
Feature activation-1.59e-01

Pos logit contributions
 Majesty+1.793
Ta+1.754
 majesty+1.736
 former+1.723
bek+1.683
Neg logit contributions
.-0.223
,-0.206
 and-0.179
 with-0.039
 to-0.034
Token Ros
Token ID10018
Feature activation-3.80e-01

Pos logit contributions
Ta+0.453
 Majesty+0.424
 former+0.411
agna+0.403
 majesty+0.401
Neg logit contributions
.-0.462
,-0.453
 and-0.442
 with-0.366
 to-0.361
Tokenie
Token ID494
Feature activation-2.02e-01

Pos logit contributions
 Majesty+0.929
Ta+0.924
 former+0.916
icious+0.881
 majesty+0.851
Neg logit contributions
,-1.137
.-1.116
 and-1.089
 to-0.990
 with-0.969
Token.
Token ID13
Feature activation+6.11e-01

Pos logit contributions
 Majesty+1.028
 majesty+0.978
 former+0.959
Ta+0.924
agna+0.910
Neg logit contributions
 and-0.048
.-0.034
,-0.031
 to+0.013
 with+0.028
Token
Token ID198
Feature activation+2.04e-01

Pos logit contributions
.+1.781
,+1.720
 and+1.683
 to+1.403
 with+1.367
Neg logit contributions
Ta-1.955
 Majesty-1.798
 former-1.765
Uh-1.762
umm-1.752
Token
Token ID198
Feature activation+1.97e-01

Pos logit contributions
.+0.302
,+0.286
 and+0.269
 to+0.174
 with+0.165
Neg logit contributions
Ta-0.948
 Majesty-0.883
Uh-0.882
 former-0.876
icious-0.872
TokenRos
Token ID35740
Feature activation-6.93e-01

Pos logit contributions
.+0.616
,+0.594
 and+0.585
 to+0.488
 with+0.486
Neg logit contributions
Ta-0.595
 Majesty-0.552
umm-0.551
agna-0.544
 former-0.534
Tokenie
Token ID494
Feature activation-1.91e-01

Pos logit contributions
Ta+1.948
 Majesty+1.785
 former+1.737
icious+1.711
bek+1.696
Neg logit contributions
.-2.188
,-2.106
 and-2.037
 to-1.802
 with-1.753
Token warned
Token ID7728
Feature activation-3.53e-01

Pos logit contributions
Ta+0.666
 Majesty+0.653
 former+0.627
 majesty+0.612
Uh+0.604
Neg logit contributions
.-0.441
,-0.399
 and-0.383
 to-0.323
 with-0.307
TokenBen
Token ID11696
Feature activation-5.64e-01

Pos logit contributions
Ta+0.031
 Majesty+0.026
 former+0.026
Uh+0.026
 majesty+0.025
Neg logit contributions
.-0.041
,-0.040
 and-0.039
 to-0.034
 with-0.034
Token tries
Token ID8404
Feature activation-4.54e-01

Pos logit contributions
Ta+2.422
 Majesty+2.389
 majesty+2.314
Uh+2.264
icious+2.254
Neg logit contributions
.-0.581
,-0.545
 and-0.462
 to-0.341
 with-0.305
Token to
Token ID284
Feature activation+3.93e-01

Pos logit contributions
 sidelines+1.825
 Majesty+1.812
Ta+1.806
 former+1.805
 seventeen+1.765
Neg logit contributions
.-0.320
,-0.305
 and-0.275
 in-0.068
 with-0.065
Token blow
Token ID6611
Feature activation-3.84e-01

Pos logit contributions
.+1.150
,+1.114
 and+1.091
 to+0.904
 with+0.880
Neg logit contributions
Ta-1.250
 Majesty-1.143
Uh-1.123
 former-1.118
icious-1.115
Token bubbles
Token ID25037
Feature activation-3.81e-01

Pos logit contributions
Ta+1.462
heit+1.387
 Deal+1.372
Whe+1.365
 seventeen+1.365
Neg logit contributions
,-0.488
.-0.471
 and-0.436
 to-0.348
 with-0.273
Token.
Token ID13
Feature activation+6.08e-01

Pos logit contributions
icious+1.524
mic+1.508
Ta+1.506
wy+1.504
 sidelines+1.488
Neg logit contributions
,-0.184
.-0.182
 and-0.137
 to-0.064
 the+0.003
Token At
Token ID1629
Feature activation-3.93e-01

Pos logit contributions
.+1.580
,+1.520
 and+1.480
 to+1.194
 with+1.165
Neg logit contributions
Ta-2.169
 Majesty-1.998
Uh-1.975
icious-1.964
 former-1.962
Token first
Token ID717
Feature activation-7.66e-02

Pos logit contributions
Ta+1.544
 sidelines+1.439
Uh+1.432
 Majesty+1.396
 former+1.387
Neg logit contributions
.-0.773
,-0.713
 and-0.709
 with-0.517
 to-0.514
Token,
Token ID11
Feature activation+5.38e-01

Pos logit contributions
 Majesty+0.359
 majesty+0.354
Uh+0.349
heit+0.346
Ta+0.345
Neg logit contributions
.-0.038
 and-0.031
,-0.005
 to+0.004
 with+0.006
Token he
Token ID339
Feature activation+3.18e-02

Pos logit contributions
.+1.162
,+1.089
 and+1.066
 to+0.811
 with+0.773
Neg logit contributions
Ta-2.048
 Majesty-1.937
Uh-1.916
agna-1.912
bek-1.874
Token blows
Token ID20385
Feature activation-5.80e-01

Pos logit contributions
.+0.106
,+0.103
 and+0.101
 to+0.086
 with+0.085
Neg logit contributions
Ta-0.088
 Majesty-0.080
Uh-0.078
 former-0.077
icious-0.077
Token his
Token ID465
Feature activation+1.01e-01

Pos logit contributions
Ta+1.366
Uh+1.275
heit+1.235
umm+1.215
bek+1.212
Neg logit contributions
.-0.686
,-0.622
 and-0.571
 to-0.463
 with-0.415
Token tears
Token ID10953
Feature activation-3.92e-01

Pos logit contributions
.+0.343
,+0.332
 and+0.327
 to+0.279
 with+0.274
Neg logit contributions
Ta-0.282
 Majesty-0.253
icious-0.251
Uh-0.250
umm-0.249
Token and
Token ID290
Feature activation+5.49e-01

Pos logit contributions
Ta+1.665
ited+1.621
Uh+1.609
mic+1.601
 former+1.594
Neg logit contributions
.-0.194
,-0.145
 and-0.063
 to-0.051
 the+0.043
Token his
Token ID465
Feature activation+9.96e-02

Pos logit contributions
.+1.520
,+1.456
 and+1.418
 to+1.166
 with+1.135
Neg logit contributions
Ta-1.869
 Majesty-1.699
Uh-1.688
 former-1.665
icious-1.660
Token nose
Token ID9686
Feature activation-3.61e-01

Pos logit contributions
.+0.331
,+0.320
 and+0.314
 to+0.267
 with+0.262
Neg logit contributions
Ta-0.287
 Majesty-0.258
Uh-0.255
icious-0.254
umm-0.253
Token.
Token ID13
Feature activation+6.07e-01

Pos logit contributions
Ta+1.356
icious+1.270
mic+1.268
 former+1.261
Uh+1.259
Neg logit contributions
,-0.482
.-0.478
 and-0.438
 to-0.308
 with-0.288
Token She
Token ID1375
Feature activation-7.34e-02

Pos logit contributions
.+1.558
,+1.488
 and+1.440
 to+1.160
 with+1.149
Neg logit contributions
Ta-2.133
 Majesty-1.988
Uh-1.962
 former-1.962
umm-1.932
Token smiled
Token ID13541
Feature activation-2.72e-01

Pos logit contributions
Ta+0.245
 Majesty+0.224
Uh+0.221
 former+0.219
icious+0.219
Neg logit contributions
.-0.208
,-0.200
 and-0.195
 to-0.161
 with-0.157
Token and
Token ID290
Feature activation+5.45e-01

Pos logit contributions
 former+1.116
 Majesty+1.110
umm+1.081
Ta+1.070
 Deal+1.068
Neg logit contributions
.-0.125
,-0.112
 and-0.018
 to-0.003
 the+0.001
Token kissed
Token ID27946
Feature activation-4.33e-01

Pos logit contributions
.+1.619
,+1.565
 and+1.531
 to+1.273
 with+1.247
Neg logit contributions
Ta-1.748
 Majesty-1.596
Uh-1.574
icious-1.568
 former-1.565
Token him
Token ID683
Feature activation+3.55e-02

Pos logit contributions
Ta+1.351
agna+1.256
Uh+1.231
bek+1.221
 Majesty+1.212
Neg logit contributions
.-0.893
,-0.877
 and-0.835
 with-0.660
 to-0.652
Token Lily
Token ID20037
Feature activation-4.19e-02

Pos logit contributions
Ta+0.486
 Majesty+0.452
 former+0.439
Uh+0.434
bek+0.430
Neg logit contributions
.-0.434
,-0.423
 and-0.415
 to-0.337
 with-0.336
Token who
Token ID508
Feature activation+1.78e-03

Pos logit contributions
 Majesty+0.185
Ta+0.177
 majesty+0.175
 former+0.175
bek+0.174
Neg logit contributions
,-0.016
 and-0.008
.-0.008
 to-0.004
 with+0.002
Token loved
Token ID6151
Feature activation-1.76e-01

Pos logit contributions
.+0.005
,+0.004
 and+0.004
 to+0.004
 with+0.003
Neg logit contributions
Ta-0.006
 Majesty-0.005
 former-0.005
Uh-0.005
bek-0.005
Token fair
Token ID3148
Feature activation-2.07e-01

Pos logit contributions
Ta+0.787
 Majesty+0.760
 former+0.750
umm+0.741
agna+0.732
Neg logit contributions
.-0.249
,-0.237
 and-0.216
 with-0.132
 in-0.107
Tokenies
Token ID444
Feature activation-2.77e-01

Pos logit contributions
 former+0.811
 Majesty+0.805
umm+0.803
 majesty+0.787
Ta+0.763
Neg logit contributions
.-0.204
,-0.203
 and-0.162
 the-0.118
 to-0.112
Token.
Token ID13
Feature activation+6.07e-01

Pos logit contributions
 former+1.081
 highlight+1.063
 lifetime+1.056
 Majesty+1.023
sie+1.023
Neg logit contributions
,-0.056
.-0.037
 and-0.010
 to+0.013
 the+0.030
Token She
Token ID1375
Feature activation-9.32e-02

Pos logit contributions
.+1.562
,+1.500
 and+1.462
 to+1.175
 with+1.147
Neg logit contributions
Ta-2.178
 Majesty-2.005
Uh-1.984
icious-1.973
 former-1.969
Token would
Token ID561
Feature activation-2.36e-01

Pos logit contributions
Ta+0.281
 Majesty+0.263
 former+0.255
 majesty+0.250
Uh+0.247
Neg logit contributions
.-0.267
,-0.252
 and-0.244
 to-0.208
 with-0.204
Token go
Token ID467
Feature activation-5.45e-02

Pos logit contributions
Ta+0.743
 Majesty+0.689
 former+0.677
icious+0.663
Uh+0.663
Neg logit contributions
.-0.689
,-0.663
 and-0.645
 to-0.535
 with-0.525
Token to
Token ID284
Feature activation+3.98e-01

Pos logit contributions
Ta+0.242
 former+0.230
 Majesty+0.229
 sidelines+0.225
 Indians+0.222
Neg logit contributions
.-0.066
,-0.054
 and-0.046
 with-0.018
 the-0.014
Token the
Token ID262
Feature activation+2.90e-01

Pos logit contributions
.+0.588
,+0.544
 and+0.515
 with+0.313
 to+0.311
Neg logit contributions
Ta-1.806
 Majesty-1.727
 former-1.722
 sidelines-1.659
 majesty-1.646
Tokensorry
Token ID41599
Feature activation-5.67e-01

Pos logit contributions
.+0.671
,+0.650
 and+0.641
 to+0.578
 with+0.567
Neg logit contributions
Ta-0.339
 Majesty-0.308
 former-0.297
umm-0.295
Uh-0.293
Token,"
Token ID553
Feature activation+1.47e-01

Pos logit contributions
icious+2.212
Ta+2.200
 Majesty+2.191
umm+2.167
Uh+2.124
Neg logit contributions
.-0.765
 and-0.750
,-0.712
 to-0.554
 with-0.442
Token Ben
Token ID3932
Feature activation-1.31e-01

Pos logit contributions
.+0.374
,+0.357
 and+0.349
 to+0.285
 with+0.271
Neg logit contributions
Ta-0.509
 Majesty-0.465
Uh-0.464
bek-0.458
 former-0.453
Token sob
Token ID27355
Feature activation-4.46e-01

Pos logit contributions
Ta+0.417
 Majesty+0.396
 majesty+0.383
Uh+0.374
 former+0.374
Neg logit contributions
.-0.314
,-0.296
 and-0.282
 to-0.244
 with-0.232
Tokenbed
Token ID3077
Feature activation-3.46e-01

Pos logit contributions
 Majesty+0.926
Ta+0.885
 former+0.880
icious+0.826
umm+0.821
Neg logit contributions
.-1.612
 and-1.571
,-1.562
 to-1.414
 the-1.320
Token.
Token ID13
Feature activation+6.06e-01

Pos logit contributions
 Majesty+1.492
 former+1.460
 Deal+1.448
 majesty+1.441
icious+1.406
Neg logit contributions
.-0.095
,-0.084
 and-0.019
 to+0.027
 that+0.089
Token
Token ID198
Feature activation+2.40e-01

Pos logit contributions
.+1.273
,+1.221
 and+1.175
 to+0.898
 with+0.859
Neg logit contributions
Ta-2.384
 Majesty-2.255
Uh-2.225
 former-2.218
umm-2.181
Token
Token ID198
Feature activation+2.37e-01

Pos logit contributions
.+0.357
,+0.342
 and+0.321
 to+0.213
 with+0.194
Neg logit contributions
Ta-1.099
Uh-1.027
 Majesty-1.018
 former-1.013
icious-1.010
TokenMom
Token ID29252
Feature activation-3.89e-01

Pos logit contributions
.+0.724
,+0.697
 and+0.687
 with+0.572
 to+0.567
Neg logit contributions
Ta-0.736
 Majesty-0.684
umm-0.684
agna-0.665
 former-0.663
Tokenmy
Token ID1820
Feature activation-1.58e-01

Pos logit contributions
Ta+1.240
Uh+1.139
icious+1.077
bek+1.064
 Majesty+1.046
Neg logit contributions
.-1.023
,-0.956
 and-0.882
 with-0.791
 to-0.783
Token wiped
Token ID21122
Feature activation-3.76e-01

Pos logit contributions
Ta+0.520
Uh+0.483
 Majesty+0.466
icious+0.460
agna+0.458
Neg logit contributions
.-0.392
,-0.356
 and-0.330
 to-0.295
 with-0.290
Token young
Token ID1862
Feature activation-3.37e-01

Pos logit contributions
 Majesty+2.046
 former+2.031
Ta+2.013
bek+2.007
icious+1.975
Neg logit contributions
.-0.576
,-0.543
 and-0.516
 to-0.356
 with-0.299
Token girl
Token ID2576
Feature activation-3.56e-01

Pos logit contributions
 Majesty+1.027
Ta+1.011
 former+0.976
umm+0.972
bek+0.957
Neg logit contributions
.-0.839
,-0.811
 and-0.781
 to-0.646
 with-0.633
Token named
Token ID3706
Feature activation-1.57e-01

Pos logit contributions
 Majesty+1.416
Ta+1.393
 majesty+1.347
 former+1.338
bek+1.335
Neg logit contributions
.-0.494
,-0.459
 and-0.453
 to-0.321
 with-0.305
Token Ell
Token ID7122
Feature activation-3.96e-01

Pos logit contributions
Ta+0.459
 Majesty+0.428
 former+0.416
agna+0.406
bek+0.405
Neg logit contributions
.-0.462
,-0.451
 and-0.442
 to-0.363
 with-0.362
Tokena
Token ID64
Feature activation-4.11e-01

Pos logit contributions
Ta+0.937
 Majesty+0.860
bek+0.856
 former+0.834
Uh+0.820
Neg logit contributions
.-1.369
,-1.363
 and-1.315
 to-1.167
 with-1.118
Token.
Token ID13
Feature activation+6.06e-01

Pos logit contributions
 Majesty+2.031
 majesty+1.956
Ta+1.926
 former+1.899
icious+1.862
Neg logit contributions
,-0.100
 and-0.099
.-0.090
 to-0.010
 with+0.068
Token Ell
Token ID7122
Feature activation-4.10e-01

Pos logit contributions
.+1.593
,+1.534
 and+1.491
 to+1.205
 with+1.182
Neg logit contributions
Ta-2.138
 Majesty-1.960
Uh-1.942
icious-1.934
 former-1.928
Tokena
Token ID64
Feature activation-4.00e-01

Pos logit contributions
Ta+1.034
 Majesty+0.918
bek+0.903
 former+0.902
Uh+0.897
Neg logit contributions
.-1.450
,-1.420
 and-1.378
 to-1.213
 with-1.168
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
Ta+1.394
 Majesty+1.329
 former+1.296
icious+1.274
 majesty+1.267
Neg logit contributions
.-0.863
,-0.793
 and-0.746
 to-0.663
 with-0.583
Token very
Token ID845
Feature activation+1.45e-01

Pos logit contributions
.+0.247
,+0.230
 and+0.224
 to+0.167
 with+0.158
Neg logit contributions
Ta-0.467
 Majesty-0.444
 former-0.438
Uh-0.430
icious-0.429
Token excited
Token ID6568
Feature activation-1.12e-01

Pos logit contributions
.+0.430
,+0.417
 and+0.404
 to+0.337
 with+0.333
Neg logit contributions
Ta-0.472
 Majesty-0.427
Uh-0.425
umm-0.424
icious-0.422
Token tum
Token ID11814
Feature activation-4.21e-01

Pos logit contributions
Ta+1.391
 Majesty+1.349
 former+1.321
 majesty+1.279
agna+1.269
Neg logit contributions
.-1.185
,-1.152
 and-1.120
 to-0.917
 with-0.910
Tokenm
Token ID76
Feature activation-2.66e-01

Pos logit contributions
Ta+1.181
icious+1.067
Uh+1.067
wy+1.058
bek+1.044
Neg logit contributions
.-1.154
,-1.122
 and-1.075
 to-0.942
 with-0.901
Tokenies
Token ID444
Feature activation-2.78e-01

Pos logit contributions
Ta+0.559
icious+0.559
 former+0.538
Uh+0.528
 Majesty+0.523
Neg logit contributions
.-0.908
,-0.898
 and-0.866
 to-0.794
 with-0.742
Token rum
Token ID7440
Feature activation-4.38e-01

Pos logit contributions
Ta+1.064
agna+1.051
icious+1.050
 Majesty+1.049
 former+1.040
Neg logit contributions
.-0.329
,-0.320
 and-0.296
 to-0.218
 with-0.189
Tokenble
Token ID903
Feature activation-4.63e-01

Pos logit contributions
icious+0.913
Ta+0.901
 Majesty+0.886
 former+0.885
Uh+0.863
Neg logit contributions
.-1.525
,-1.510
 and-1.493
 to-1.352
 with-1.260
Token.
Token ID13
Feature activation+6.06e-01

Pos logit contributions
 former+2.048
icious+2.018
Ta+1.985
umm+1.976
 Majesty+1.973
Neg logit contributions
.-0.262
,-0.245
 and-0.193
 to-0.048
 the-0.013
Token They
Token ID1119
Feature activation-1.06e-01

Pos logit contributions
.+1.538
,+1.478
 and+1.439
 to+1.154
 with+1.126
Neg logit contributions
Ta-2.188
 Majesty-2.016
Uh-1.993
icious-1.984
 former-1.981
Token look
Token ID804
Feature activation-9.98e-02

Pos logit contributions
Ta+0.338
 Majesty+0.310
Uh+0.307
 former+0.306
icious+0.303
Neg logit contributions
.-0.303
,-0.293
 and-0.284
 to-0.239
 with-0.234
Token for
Token ID329
Feature activation+2.82e-01

Pos logit contributions
Ta+0.371
 Majesty+0.354
 former+0.354
Uh+0.351
icious+0.349
Neg logit contributions
.-0.176
,-0.157
 and-0.150
 to-0.118
 with-0.107
Token something
Token ID1223
Feature activation-4.03e-02

Pos logit contributions
.+0.597
,+0.569
 and+0.560
 with+0.417
 to+0.413
Neg logit contributions
Ta-1.089
 Majesty-1.022
 former-1.010
agna-0.994
Uh-0.992
Token to
Token ID284
Feature activation+3.93e-01

Pos logit contributions
Ta+0.142
agna+0.136
 former+0.135
 pulp+0.134
Uh+0.134
Neg logit contributions
.-0.068
,-0.066
 and-0.064
 with-0.045
 the-0.039
Token beautiful
Token ID4950
Feature activation-3.32e-01

Pos logit contributions
.+0.545
,+0.528
 and+0.516
 to+0.428
 with+0.419
Neg logit contributions
Ta-0.596
 Majesty-0.540
icious-0.534
umm-0.532
Uh-0.532
Token young
Token ID1862
Feature activation-3.59e-01

Pos logit contributions
Ta+0.843
 Majesty+0.827
 former+0.793
 majesty+0.773
umm+0.769
Neg logit contributions
.-1.050
,-1.010
 and-0.994
 to-0.864
 with-0.842
Token girl
Token ID2576
Feature activation-3.69e-01

Pos logit contributions
Ta+0.857
 Majesty+0.832
umm+0.791
Uh+0.787
 former+0.778
Neg logit contributions
.-1.191
,-1.155
 and-1.129
 to-0.987
 with-0.961
Token named
Token ID3706
Feature activation-1.77e-01

Pos logit contributions
Ta+1.446
 Majesty+1.434
bek+1.380
 majesty+1.378
 former+1.360
Neg logit contributions
.-0.478
,-0.467
 and-0.454
 to-0.318
 with-0.292
Token Elsa
Token ID19226
Feature activation-5.76e-01

Pos logit contributions
Ta+0.515
 Majesty+0.475
 former+0.457
bek+0.457
Uh+0.453
Neg logit contributions
.-0.542
,-0.526
 and-0.520
 to-0.429
 with-0.424
Token.
Token ID13
Feature activation+6.05e-01

Pos logit contributions
 Majesty+2.690
 majesty+2.595
 former+2.553
agna+2.549
Ta+2.535
Neg logit contributions
,-0.155
.-0.135
 and-0.113
 with-0.001
 to+0.015
Token She
Token ID1375
Feature activation-9.87e-02

Pos logit contributions
.+1.681
,+1.623
 and+1.582
 to+1.300
 with+1.266
Neg logit contributions
Ta-2.016
 Majesty-1.865
 former-1.830
Uh-1.818
umm-1.813
Token was
Token ID373
Feature activation+1.17e-01

Pos logit contributions
Ta+0.357
 Majesty+0.333
 former+0.328
Uh+0.325
bek+0.322
Neg logit contributions
.-0.233
,-0.219
 and-0.209
 to-0.171
 with-0.163
Token very
Token ID845
Feature activation+1.43e-01

Pos logit contributions
.+0.258
,+0.244
 and+0.236
 to+0.179
 with+0.173
Neg logit contributions
Ta-0.457
 Majesty-0.428
 former-0.421
Uh-0.419
icious-0.418
Token graceful
Token ID44363
Feature activation-3.89e-01

Pos logit contributions
.+0.437
,+0.423
 and+0.414
 to+0.346
 with+0.340
Neg logit contributions
Ta-0.446
 Majesty-0.405
Uh-0.400
icious-0.398
 former-0.397
Token and
Token ID290
Feature activation+5.17e-01

Pos logit contributions
umm+1.616
 Majesty+1.582
 former+1.564
arians+1.521
 majesty+1.500
Neg logit contributions
.-0.164
,-0.103
 and-0.029
 the-0.022
 to+0.003
TokenS
Token ID50
Feature activation-3.85e-01

Pos logit contributions
.+0.327
,+0.310
 and+0.303
 to+0.235
 with+0.235
Neg logit contributions
Ta-0.529
umm-0.498
 Majesty-0.497
agna-0.491
icious-0.487
Tokenara
Token ID3301
Feature activation-5.02e-01

Pos logit contributions
Ta+0.767
 Majesty+0.658
Uh+0.642
icious+0.638
 former+0.635
Neg logit contributions
.-1.613
,-1.573
 and-1.548
 to-1.369
 with-1.350
Token and
Token ID290
Feature activation+5.29e-01

Pos logit contributions
Ta+2.306
 Majesty+2.213
 majesty+2.148
Uh+2.134
 former+2.133
Neg logit contributions
.-0.615
,-0.536
 and-0.443
 to-0.339
 with-0.288
Token Ben
Token ID3932
Feature activation-1.50e-01

Pos logit contributions
.+1.357
,+1.294
 and+1.266
 to+1.021
 with+1.004
Neg logit contributions
Ta-1.883
 Majesty-1.751
Uh-1.708
 former-1.692
icious-1.679
Token sighed
Token ID21893
Feature activation-5.64e-01

Pos logit contributions
Ta+0.464
 Majesty+0.427
Uh+0.412
agna+0.410
icious+0.409
Neg logit contributions
.-0.411
,-0.396
 and-0.385
 to-0.338
 with-0.329
Token.
Token ID13
Feature activation+6.05e-01

Pos logit contributions
 former+2.485
icious+2.324
 gag+2.322
 Majesty+2.298
ksh+2.289
Neg logit contributions
,-0.107
.-0.060
 and-0.007
 to+0.033
 the+0.130
Token They
Token ID1119
Feature activation-1.18e-01

Pos logit contributions
.+1.487
,+1.428
 and+1.387
 to+1.108
 with+1.077
Neg logit contributions
Ta-2.230
 Majesty-2.071
Uh-2.048
 former-2.032
icious-2.027
Token looked
Token ID3114
Feature activation-1.91e-01

Pos logit contributions
Ta+0.391
 Majesty+0.356
Uh+0.354
 former+0.351
icious+0.351
Neg logit contributions
.-0.331
,-0.320
 and-0.312
 to-0.259
 with-0.253
Token at
Token ID379
Feature activation+2.65e-01

Pos logit contributions
Ta+0.748
 Majesty+0.721
Uh+0.713
 former+0.712
icious+0.694
Neg logit contributions
.-0.283
,-0.260
 and-0.229
 to-0.164
 with-0.161
Token the
Token ID262
Feature activation+2.91e-01

Pos logit contributions
.+0.485
,+0.470
 and+0.442
 to+0.327
 with+0.321
Neg logit contributions
Ta-1.104
 Majesty-1.033
 former-1.024
Uh-1.016
umm-1.002
Token dessert
Token ID23084
Feature activation-4.42e-01

Pos logit contributions
.+0.977
,+0.954
 and+0.930
 to+0.796
 with+0.782
Neg logit contributions
Ta-0.801
 Majesty-0.720
Uh-0.710
 former-0.705
icious-0.701
Token
Token ID198
Feature activation+1.78e-01

Pos logit contributions
.+1.298
,+1.247
 and+1.214
 to+0.977
 with+0.949
Neg logit contributions
Ta-1.831
 Majesty-1.695
Uh-1.675
 former-1.664
icious-1.657
Token
Token ID198
Feature activation+1.79e-01

Pos logit contributions
.+0.287
,+0.272
 and+0.258
 to+0.177
 with+0.167
Neg logit contributions
Ta-0.802
 Majesty-0.749
Uh-0.745
icious-0.740
 former-0.740
TokenBen
Token ID11696
Feature activation-5.33e-01

Pos logit contributions
.+0.533
,+0.508
 and+0.504
 with+0.420
 to+0.408
Neg logit contributions
Ta-0.563
umm-0.547
 Majesty-0.536
agna-0.526
 former-0.521
Token stayed
Token ID9658
Feature activation-4.47e-01

Pos logit contributions
Ta+2.431
 Majesty+2.304
Uh+2.261
icious+2.242
agna+2.228
Neg logit contributions
.-0.724
,-0.624
 and-0.545
 to-0.403
 with-0.363
Token outside
Token ID2354
Feature activation+1.32e-01

Pos logit contributions
 former+1.777
 Majesty+1.739
Ta+1.710
 sidelines+1.684
 Deal+1.652
Neg logit contributions
.-0.603
,-0.547
 and-0.466
 to-0.324
 that-0.203
Token.
Token ID13
Feature activation+6.05e-01

Pos logit contributions
.+0.050
,+0.030
 and+0.017
 to-0.015
 that-0.029
Neg logit contributions
 former-0.577
 Majesty-0.559
 sidelines-0.555
umm-0.547
 majesty-0.524
Token He
Token ID679
Feature activation-1.46e-01

Pos logit contributions
.+1.548
,+1.486
 and+1.449
 to+1.163
 with+1.134
Neg logit contributions
Ta-2.198
 Majesty-2.019
Uh-1.999
icious-1.994
 former-1.984
Token saw
Token ID2497
Feature activation-1.57e-01

Pos logit contributions
Ta+0.472
 Majesty+0.434
 former+0.424
Uh+0.423
icious+0.418
Neg logit contributions
.-0.419
,-0.404
 and-0.390
 to-0.327
 with-0.322
Token a
Token ID257
Feature activation+2.39e-01

Pos logit contributions
Ta+0.681
 Majesty+0.640
 former+0.623
Uh+0.622
 sidelines+0.616
Neg logit contributions
.-0.259
,-0.246
 and-0.229
 to-0.154
 with-0.152
Token dog
Token ID3290
Feature activation-1.95e-01

Pos logit contributions
.+0.606
,+0.588
 and+0.571
 to+0.455
 with+0.444
Neg logit contributions
Ta-0.876
 Majesty-0.804
icious-0.794
 former-0.793
umm-0.792
Token running
Token ID2491
Feature activation-1.21e-01

Pos logit contributions
 sidelines+0.756
Ta+0.739
icious+0.727
Uh+0.722
assing+0.713
Neg logit contributions
.-0.142
,-0.141
 and-0.121
 to-0.097
 the-0.047
Token who
Token ID508
Feature activation+7.30e-03

Pos logit contributions
 Majesty+0.825
 former+0.818
arians+0.806
agna+0.794
emic+0.791
Neg logit contributions
,-0.282
 and-0.250
.-0.245
 to-0.215
 the-0.170
Token liked
Token ID8288
Feature activation-2.29e-01

Pos logit contributions
.+0.021
,+0.021
 and+0.020
 to+0.017
 with+0.016
Neg logit contributions
Ta-0.022
 Majesty-0.020
 former-0.020
Uh-0.020
agna-0.020
Token to
Token ID284
Feature activation+4.02e-01

Pos logit contributions
Ta+1.040
 former+1.012
 Majesty+1.007
agna+0.993
 majesty+0.980
Neg logit contributions
,-0.228
.-0.227
 and-0.187
 with-0.077
 in-0.041
Token play
Token ID711
Feature activation-1.17e-01

Pos logit contributions
.+1.092
,+1.054
 and+1.030
 to+0.843
 with+0.827
Neg logit contributions
Ta-1.358
 Majesty-1.254
 former-1.238
icious-1.221
Uh-1.220
Token games
Token ID1830
Feature activation-3.83e-01

Pos logit contributions
Ta+0.423
agna+0.411
 former+0.410
 sidelines+0.403
icious+0.402
Neg logit contributions
,-0.160
.-0.157
 and-0.140
 to-0.102
!-0.071
Token.
Token ID13
Feature activation+6.05e-01

Pos logit contributions
uries+1.423
 former+1.419
arians+1.396
icious+1.391
inations+1.389
Neg logit contributions
,-0.132
.-0.103
 and-0.056
 to-0.050
 the-0.025
Token They
Token ID1119
Feature activation-9.94e-02

Pos logit contributions
.+1.534
,+1.470
 and+1.431
 to+1.142
 with+1.116
Neg logit contributions
Ta-2.213
 Majesty-2.040
Uh-2.014
icious-2.009
umm-2.001
Token were
Token ID547
Feature activation-8.33e-02

Pos logit contributions
Ta+0.333
 Majesty+0.301
Uh+0.300
 former+0.297
icious+0.296
Neg logit contributions
.-0.274
,-0.265
 and-0.257
 to-0.214
 with-0.209
Token very
Token ID845
Feature activation+1.55e-01

Pos logit contributions
Ta+0.297
 Majesty+0.277
icious+0.270
Uh+0.270
 former+0.270
Neg logit contributions
.-0.209
,-0.200
 and-0.193
 to-0.153
 with-0.148
Token competitive
Token ID7606
Feature activation-4.15e-01

Pos logit contributions
.+0.446
,+0.430
 and+0.420
 to+0.347
 with+0.340
Neg logit contributions
Ta-0.513
 Majesty-0.469
Uh-0.463
icious-0.460
 former-0.460
Token and
Token ID290
Feature activation+5.34e-01

Pos logit contributions
arians+1.617
core+1.604
umm+1.603
 habitats+1.598
Ta+1.597
Neg logit contributions
.-0.102
,-0.077
 to-0.012
 and+0.002
 the+0.056
Token cute
Token ID13779
Feature activation-4.10e-01

Pos logit contributions
 former+1.081
Looks+1.040
bek+1.035
 highlight+1.033
 fullest+1.029
Neg logit contributions
,-0.074
 and-0.055
.-0.026
 to-0.010
 the+0.030
Token little
Token ID1310
Feature activation-6.43e-02

Pos logit contributions
 Majesty+1.463
Ta+1.414
 former+1.393
umm+1.393
 majesty+1.389
Neg logit contributions
.-0.784
,-0.760
 and-0.717
 to-0.564
 with-0.563
Token puppy
Token ID26188
Feature activation-4.14e-01

Pos logit contributions
Ta+0.178
 Majesty+0.173
Uh+0.168
umm+0.167
 majesty+0.162
Neg logit contributions
.-0.186
,-0.180
 and-0.177
 to-0.150
 with-0.149
Token named
Token ID3706
Feature activation-1.59e-01

Pos logit contributions
 Majesty+1.787
Ta+1.763
 majesty+1.724
 former+1.690
bek+1.672
Neg logit contributions
.-0.395
,-0.364
 and-0.345
 to-0.199
 with-0.191
Token Max
Token ID5436
Feature activation-1.41e-01

Pos logit contributions
Ta+0.458
 Majesty+0.420
 former+0.410
agna+0.405
icious+0.403
Neg logit contributions
.-0.479
,-0.469
 and-0.458
 to-0.382
 with-0.379
Token.
Token ID13
Feature activation+6.05e-01

Pos logit contributions
 Majesty+0.640
 Indians+0.612
 former+0.607
itionally+0.607
 majesty+0.604
Neg logit contributions
,-0.031
 and-0.018
.-0.008
 with+0.021
 to+0.023
Token Max
Token ID5436
Feature activation-1.65e-01

Pos logit contributions
.+1.572
,+1.510
 and+1.475
 to+1.176
 with+1.159
Neg logit contributions
Ta-2.172
 Majesty-1.990
Uh-1.972
umm-1.960
icious-1.958
Token loved
Token ID6151
Feature activation-1.81e-01

Pos logit contributions
Ta+0.527
 Majesty+0.508
 former+0.488
 majesty+0.480
icious+0.478
Neg logit contributions
.-0.408
,-0.374
 and-0.351
 to-0.316
 with-0.296
Token to
Token ID284
Feature activation+3.95e-01

Pos logit contributions
Ta+0.849
 Majesty+0.820
 former+0.805
 sidelines+0.802
 Indians+0.795
Neg logit contributions
.-0.193
,-0.177
 and-0.146
 with-0.068
 in-0.047
Token play
Token ID711
Feature activation-1.20e-01

Pos logit contributions
.+1.120
,+1.079
 and+1.055
 to+0.877
 with+0.861
Neg logit contributions
Ta-1.278
 Majesty-1.177
 former-1.155
icious-1.139
agna-1.125
Token with
Token ID351
Feature activation+3.82e-01

Pos logit contributions
Ta+0.465
 sidelines+0.453
icious+0.449
Mic+0.444
agna+0.441
Neg logit contributions
.-0.155
,-0.137
 and-0.117
 to-0.088
 the-0.061
Token saw
Token ID2497
Feature activation-1.65e-01

Pos logit contributions
.+0.255
,+0.240
 and+0.232
 to+0.181
 with+0.167
Neg logit contributions
Ta-0.378
 Majesty-0.363
icious-0.344
 former-0.341
Uh-0.339
Token a
Token ID257
Feature activation+2.32e-01

Pos logit contributions
Ta+0.696
 Majesty+0.658
 former+0.646
Uh+0.633
 sidelines+0.630
Neg logit contributions
.-0.237
,-0.237
 and-0.214
 to-0.140
 with-0.136
Token girl
Token ID2576
Feature activation-3.61e-01

Pos logit contributions
.+0.636
,+0.618
 and+0.602
 to+0.489
 with+0.481
Neg logit contributions
Ta-0.802
 Majesty-0.732
icious-0.722
umm-0.721
 former-0.719
Token named
Token ID3706
Feature activation-1.75e-01

Pos logit contributions
Ta+1.199
 Majesty+1.147
 former+1.131
Uh+1.104
 sidelines+1.094
Neg logit contributions
.-0.712
,-0.704
 and-0.666
 to-0.555
 the-0.477
Token Sue
Token ID26089
Feature activation-2.48e-01

Pos logit contributions
Ta+0.546
 Majesty+0.505
 former+0.487
Uh+0.483
 majesty+0.479
Neg logit contributions
.-0.481
,-0.468
 and-0.467
 to-0.377
 with-0.370
Token.
Token ID13
Feature activation+6.05e-01

Pos logit contributions
 Majesty+1.035
Ta+1.028
ited+1.018
bek+1.002
 former+0.997
Neg logit contributions
,-0.081
 and-0.047
.-0.046
 to+0.001
 with+0.015
Token Sue
Token ID26089
Feature activation-2.57e-01

Pos logit contributions
.+1.766
,+1.677
 and+1.664
 to+1.395
 with+1.365
Neg logit contributions
Ta-1.833
Uh-1.689
 Majesty-1.678
umm-1.641
icious-1.631
Token was
Token ID373
Feature activation+1.26e-01

Pos logit contributions
Ta+0.792
 Majesty+0.738
Uh+0.730
icious+0.711
 majesty+0.707
Neg logit contributions
.-0.575
,-0.533
 and-0.502
 to-0.469
 with-0.425
Token holding
Token ID4769
Feature activation-1.80e-01

Pos logit contributions
.+0.284
,+0.266
 and+0.258
 to+0.198
 with+0.188
Neg logit contributions
Ta-0.480
 Majesty-0.450
 former-0.444
Uh-0.441
icious-0.438
Token a
Token ID257
Feature activation+2.41e-01

Pos logit contributions
Ta+0.781
 Majesty+0.733
 former+0.730
agna+0.716
icious+0.716
Neg logit contributions
.-0.286
,-0.267
 and-0.254
 to-0.175
 with-0.156
Token bag
Token ID6131
Feature activation-3.26e-01

Pos logit contributions
.+0.754
,+0.732
 and+0.716
 to+0.603
 with+0.592
Neg logit contributions
Ta-0.735
 Majesty-0.667
Uh-0.656
icious-0.656
umm-0.653
Token pet
Token ID4273
Feature activation-2.22e-01

Pos logit contributions
Ta+0.679
 Majesty+0.621
 former+0.618
icious+0.612
 sidelines+0.605
Neg logit contributions
.-0.602
,-0.581
 and-0.575
 with-0.453
 to-0.451
Token him
Token ID683
Feature activation+2.76e-02

Pos logit contributions
Ta+0.829
 Majesty+0.780
 sidelines+0.764
Uh+0.756
agna+0.752
Neg logit contributions
.-0.405
,-0.391
 and-0.376
 with-0.269
 to-0.268
Token,"
Token ID553
Feature activation+1.22e-01

Pos logit contributions
.+0.044
,+0.040
 and+0.040
 to+0.030
 with+0.029
Neg logit contributions
 Majesty-0.101
 former-0.099
Ta-0.097
icious-0.096
 majesty-0.096
Token Ben
Token ID3932
Feature activation-1.38e-01

Pos logit contributions
.+0.289
,+0.275
 and+0.268
 to+0.214
 with+0.205
Neg logit contributions
Ta-0.445
 Majesty-0.409
Uh-0.406
 former-0.401
bek-0.399
Token says
Token ID1139
Feature activation-2.34e-01

Pos logit contributions
Ta+0.425
 Majesty+0.406
 majesty+0.387
 former+0.377
Uh+0.377
Neg logit contributions
.-0.332
,-0.314
 and-0.289
 to-0.258
 with-0.251
Token.
Token ID13
Feature activation+6.04e-01

Pos logit contributions
cher+0.997
 Majesty+0.986
mic+0.981
fer+0.966
uer+0.965
Neg logit contributions
 and-0.054
,-0.026
.-0.009
 the+0.071
 that+0.074
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
.+1.343
,+1.317
 and+1.226
 to+0.986
 in+0.967
Neg logit contributions
Ta-2.121
Uh-2.027
 Majesty-2.024
 Deal-1.953
 majesty-1.951
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
,+0.354
.+0.342
 and+0.320
 to+0.241
 in+0.224
Neg logit contributions
Ta-0.928
 Majesty-0.856
Uh-0.845
 former-0.840
 majesty-0.836
Token"
Token ID1
Feature activation+3.69e-02

Pos logit contributions
.+0.614
,+0.594
 and+0.578
 to+0.478
 with+0.465
Neg logit contributions
Ta-0.724
 Majesty-0.662
Uh-0.655
icious-0.650
 former-0.649
TokenYou
Token ID1639
Feature activation-4.28e-01

Pos logit contributions
.+0.129
 and+0.124
,+0.120
 with+0.109
 to+0.103
Neg logit contributions
 Majesty-0.094
Ta-0.091
 former-0.089
heit-0.088
umm-0.087
Token're
Token ID821
Feature activation-5.09e-01

Pos logit contributions
Ta+1.206
 Majesty+1.085
Uh+1.064
icious+1.060
 former+1.057
Neg logit contributions
.-1.442
,-1.397
 and-1.370
 to-1.171
 with-1.152
Token going
Token ID1016
Feature activation-1.62e-01

Pos logit contributions
Ta+0.863
 Majesty+0.858
 former+0.848
 majesty+0.827
 sidelines+0.826
Neg logit contributions
.-0.273
,-0.267
 and-0.246
 with-0.146
 in-0.115
Token outs
Token ID12198
Feature activation-3.41e-01

Pos logit contributions
 former+0.608
 Majesty+0.607
Ta+0.598
 sidelines+0.595
 lifetime+0.594
Neg logit contributions
.-0.200
,-0.181
 and-0.166
 with-0.094
 the-0.082
Tokenite
Token ID578
Feature activation-6.03e-01

Pos logit contributions
 Majesty+1.641
umm+1.611
 majesty+1.602
Ta+1.587
 former+1.583
Neg logit contributions
.-0.131
 and-0.106
,-0.104
 with+0.025
 to+0.032
Token with
Token ID351
Feature activation+3.83e-01

Pos logit contributions
 Majesty+2.680
 former+2.668
Ta+2.661
vation+2.578
 majesty+2.567
Neg logit contributions
,-0.393
.-0.381
 and-0.333
 with-0.053
 to-0.038
Token Mom
Token ID11254
Feature activation-9.09e-02

Pos logit contributions
.+0.746
,+0.724
 and+0.699
 to+0.511
 with+0.499
Neg logit contributions
Ta-1.522
 Majesty-1.456
 former-1.436
umm-1.395
icious-1.394
Token.
Token ID13
Feature activation+6.04e-01

Pos logit contributions
 Majesty+0.402
icious+0.396
Ta+0.395
agna+0.393
 former+0.391
Neg logit contributions
,-0.041
.-0.035
 and-0.025
 with+0.001
 to+0.005
Token
Token ID220
Feature activation-1.54e-01

Pos logit contributions
.+1.440
,+1.380
 and+1.341
 to+1.057
 with+1.028
Neg logit contributions
Ta-2.275
 Majesty-2.104
Uh-2.081
icious-2.072
 former-2.069
Token
Token ID198
Feature activation+1.93e-01

Pos logit contributions
Ta+0.668
 Majesty+0.626
icious+0.624
 former+0.616
Uh+0.611
Neg logit contributions
,-0.230
.-0.228
 and-0.214
 to-0.142
 with-0.136
Token
Token ID198
Feature activation+1.88e-01

Pos logit contributions
.+0.302
,+0.289
 and+0.272
 to+0.183
 with+0.172
Neg logit contributions
Ta-0.873
 Majesty-0.812
Uh-0.810
icious-0.807
 former-0.805
TokenThey
Token ID2990
Feature activation-2.25e-01

Pos logit contributions
.+0.613
,+0.588
 and+0.585
 with+0.494
 to+0.493
Neg logit contributions
Ta-0.543
 Majesty-0.506
umm-0.499
agna-0.492
 former-0.481
Token began
Token ID2540
Feature activation-3.03e-01

Pos logit contributions
Ta+0.724
 Majesty+0.657
Uh+0.644
 former+0.643
bek+0.641
Neg logit contributions
.-0.638
,-0.611
 and-0.595
 to-0.500
 with-0.492
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
.+1.211
,+1.211
 and+1.117
 to+0.886
 in+0.861
Neg logit contributions
Ta-2.128
Uh-2.117
 Deal-1.997
Really-1.951
 Majesty-1.949
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
,+0.361
.+0.351
 and+0.343
 to+0.250
 in+0.245
Neg logit contributions
Ta-0.910
Uh-0.851
 Majesty-0.818
 Deal-0.816
Ah-0.814
TokenL
Token ID43
Feature activation-3.32e-01

Pos logit contributions
.+0.680
,+0.660
 and+0.644
 to+0.543
 with+0.525
Neg logit contributions
Ta-0.692
Uh-0.626
 Majesty-0.623
icious-0.614
bek-0.612
Tokenily
Token ID813
Feature activation-3.28e-01

Pos logit contributions
Ta+0.844
Uh+0.727
 Majesty+0.724
icious+0.723
umm+0.708
Neg logit contributions
.-1.169
,-1.153
 and-1.121
 to-0.994
 with-0.966
Token nodded
Token ID14464
Feature activation-4.37e-01

Pos logit contributions
Ta+0.957
Uh+0.881
 Majesty+0.852
agna+0.835
icious+0.833
Neg logit contributions
.-0.962
,-0.909
 and-0.854
 to-0.765
 with-0.741
Token.
Token ID13
Feature activation+6.04e-01

Pos logit contributions
 Deal+1.682
Uh+1.665
Ta+1.662
ivery+1.655
 Amen+1.651
Neg logit contributions
,-0.094
 that-0.019
 and-0.007
.-0.003
 to+0.015
Token She
Token ID1375
Feature activation-8.29e-02

Pos logit contributions
.+1.585
,+1.529
 and+1.478
 to+1.223
 with+1.199
Neg logit contributions
Ta-1.993
Uh-1.859
 Majesty-1.837
 Deal-1.797
umm-1.793
Token had
Token ID550
Feature activation-1.17e-01

Pos logit contributions
Ta+0.295
 Majesty+0.269
Uh+0.269
icious+0.265
 former+0.264
Neg logit contributions
.-0.215
,-0.207
 and-0.199
 to-0.163
 with-0.158
Token a
Token ID257
Feature activation+2.46e-01

Pos logit contributions
Ta+0.489
 former+0.453
 Majesty+0.452
Uh+0.448
icious+0.446
Neg logit contributions
.-0.225
,-0.217
 and-0.208
 with-0.151
 to-0.145
Token bump
Token ID13852
Feature activation-2.94e-01

Pos logit contributions
.+0.751
,+0.740
 and+0.716
 to+0.602
 with+0.573
Neg logit contributions
Ta-0.787
umm-0.724
icious-0.712
 Majesty-0.711
agna-0.699
Token on
Token ID319
Feature activation+2.91e-01

Pos logit contributions
Ta+1.003
Uh+0.979
ited+0.959
wy+0.958
isher+0.950
Neg logit contributions
,-0.471
.-0.430
 and-0.373
 the-0.272
 with-0.272
Token curious
Token ID11040
Feature activation-2.89e-01

Pos logit contributions
.+1.543
,+1.485
 and+1.444
 to+1.174
 with+1.154
Neg logit contributions
Ta-1.978
 Majesty-1.819
Uh-1.792
 former-1.788
icious-1.777
Token little
Token ID1310
Feature activation-5.68e-02

Pos logit contributions
 Majesty+1.293
Ta+1.267
 former+1.244
umm+1.237
 majesty+1.224
Neg logit contributions
.-0.180
,-0.171
 and-0.143
 with-0.066
 to-0.034
Token girl
Token ID2576
Feature activation-3.47e-01

Pos logit contributions
Ta+0.148
 Majesty+0.144
umm+0.137
Uh+0.136
 former+0.134
Neg logit contributions
.-0.176
,-0.171
 and-0.168
 with-0.143
 to-0.142
Token named
Token ID3706
Feature activation-1.52e-01

Pos logit contributions
 Majesty+1.454
Ta+1.401
 former+1.380
 majesty+1.378
bek+1.344
Neg logit contributions
.-0.365
,-0.346
 and-0.338
 with-0.212
 to-0.208
Token Mary
Token ID5335
Feature activation-3.62e-01

Pos logit contributions
Ta+0.411
 Majesty+0.386
 former+0.372
 sidelines+0.363
agna+0.363
Neg logit contributions
.-0.474
,-0.463
 and-0.459
 with-0.382
 to-0.380
Token.
Token ID13
Feature activation+6.04e-01

Pos logit contributions
 Majesty+1.696
 majesty+1.620
Ta+1.615
 former+1.598
bek+1.569
Neg logit contributions
,-0.087
 and-0.063
.-0.047
 to+0.041
 with+0.044
Token Every
Token ID3887
Feature activation-3.33e-01

Pos logit contributions
.+1.589
,+1.528
 and+1.489
 to+1.208
 with+1.181
Neg logit contributions
Ta-2.116
 Majesty-1.956
Uh-1.926
 former-1.917
icious-1.915
Token day
Token ID1110
Feature activation-1.10e-01

Pos logit contributions
Ta+0.984
 former+0.890
 Majesty+0.889
Uh+0.886
umm+0.872
Neg logit contributions
.-0.947
,-0.905
 and-0.886
 with-0.762
 to-0.722
Token Mary
Token ID5335
Feature activation-3.72e-01

Pos logit contributions
Ta+0.436
 Majesty+0.426
umm+0.415
Uh+0.412
 majesty+0.411
Neg logit contributions
.-0.161
 and-0.143
,-0.126
 with-0.091
 to-0.090
Token would
Token ID561
Feature activation-2.45e-01

Pos logit contributions
Ta+1.178
 Majesty+1.114
 former+1.099
Uh+1.058
agna+1.058
Neg logit contributions
.-0.960
,-0.923
 and-0.863
 with-0.745
 to-0.735
Token explore
Token ID7301
Feature activation-4.30e-01

Pos logit contributions
Ta+0.717
 Majesty+0.675
 former+0.664
icious+0.649
agna+0.637
Neg logit contributions
.-0.730
,-0.701
 and-0.681
 to-0.574
 with-0.566
Token ow
Token ID12334
Feature activation-5.37e-01

Pos logit contributions
.+1.652
,+1.576
 and+1.552
 to+1.318
 with+1.283
Neg logit contributions
Ta-1.590
Uh-1.450
 Majesty-1.439
icious-1.408
agna-1.396
Token!"
Token ID2474
Feature activation+6.98e-02

Pos logit contributions
 Majesty+2.300
umm+2.274
 pulp+2.271
Uh+2.270
icious+2.264
Neg logit contributions
 and-0.439
.-0.431
,-0.331
 to-0.277
 with-0.234
Token they
Token ID484
Feature activation+6.40e-02

Pos logit contributions
.+0.160
,+0.151
 and+0.148
 to+0.119
 with+0.114
Neg logit contributions
Ta-0.240
Uh-0.226
 Majesty-0.221
 majesty-0.215
 Deal-0.213
Token sob
Token ID27355
Feature activation-4.48e-01

Pos logit contributions
.+0.200
,+0.192
 and+0.190
 to+0.163
 with+0.160
Neg logit contributions
Ta-0.182
Uh-0.165
 Majesty-0.165
icious-0.159
agna-0.157
Tokenbed
Token ID3077
Feature activation-3.49e-01

Pos logit contributions
 Majesty+0.789
icious+0.716
Ta+0.697
 former+0.696
ivery+0.690
Neg logit contributions
.-1.601
 and-1.572
,-1.570
 to-1.473
 the-1.358
Token.
Token ID13
Feature activation+6.04e-01

Pos logit contributions
 Deal+1.383
ksh+1.348
arse+1.347
 disapproval+1.340
 gag+1.326
Neg logit contributions
,-0.045
 and-0.006
.-0.002
 to+0.006
 with+0.051
Token
Token ID198
Feature activation+2.32e-01

Pos logit contributions
.+1.066
,+1.013
 and+0.958
 to+0.696
 with+0.664
Neg logit contributions
Ta-2.496
Uh-2.392
 Majesty-2.312
 former-2.300
icious-2.290
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
.+0.346
,+0.335
 and+0.314
 to+0.218
 with+0.196
Neg logit contributions
Ta-1.040
Uh-0.976
 Majesty-0.949
icious-0.947
 former-0.946
TokenThe
Token ID464
Feature activation-2.57e-01

Pos logit contributions
.+0.709
,+0.685
 and+0.672
 with+0.562
 to+0.558
Neg logit contributions
Ta-0.681
 Majesty-0.633
umm-0.632
 former-0.618
icious-0.613
Token tree
Token ID5509
Feature activation-1.83e-01

Pos logit contributions
Ta+0.799
 Majesty+0.726
icious+0.721
umm+0.715
bek+0.714
Neg logit contributions
.-0.795
,-0.763
 and-0.751
 to-0.628
 with-0.614
Token heard
Token ID2982
Feature activation-1.74e-01

Pos logit contributions
Ta+0.639
Uh+0.604
 Majesty+0.575
icious+0.571
 former+0.570
Neg logit contributions
.-0.439
,-0.417
 and-0.397
 to-0.331
 with-0.310
Token her
Token ID607
Feature activation+1.19e-01

Pos logit contributions
Ta+1.053
 Majesty+0.991
 majesty+0.955
agna+0.943
umm+0.941
Neg logit contributions
.-0.509
,-0.501
 and-0.481
 to-0.332
 with-0.327
Token the
Token ID262
Feature activation+2.97e-01

Pos logit contributions
,+0.189
.+0.187
 and+0.177
 with+0.117
 to+0.107
Neg logit contributions
 Majesty-0.463
Ta-0.459
 majesty-0.449
 former-0.445
agna-0.439
Token card
Token ID2657
Feature activation-4.35e-01

Pos logit contributions
.+1.025
,+0.997
 and+0.976
 to+0.836
 with+0.824
Neg logit contributions
Ta-0.803
 Majesty-0.721
Uh-0.708
 former-0.704
icious-0.703
Token.
Token ID13
Feature activation+5.90e-01

Pos logit contributions
 majesty+1.973
 Majesty+1.970
 pulp+1.913
Ta+1.913
agna+1.908
Neg logit contributions
,-0.119
.-0.064
 and-0.057
 the+0.093
 in+0.150
Token "
Token ID366
Feature activation+6.15e-03

Pos logit contributions
.+1.364
,+1.309
 and+1.265
 to+0.980
 with+0.957
Neg logit contributions
Ta-2.233
 Majesty-2.062
Uh-2.057
 former-2.031
agna-2.018
TokenTa
Token ID38586
Feature activation-1.11e+00

Pos logit contributions
.+0.022
 and+0.022
,+0.021
 with+0.019
 to+0.018
Neg logit contributions
 Majesty-0.015
 majesty-0.014
 former-0.014
umm-0.014
agna-0.013
Token-
Token ID12
Feature activation+1.62e-01

Pos logit contributions
 Majesty+4.263
Ta+4.082
 majesty+4.066
icious+4.015
agna+3.970
Neg logit contributions
.-2.073
 and-1.904
,-1.854
 to-1.597
 with-1.304
Tokenda
Token ID6814
Feature activation-6.32e-01

Pos logit contributions
.+0.702
,+0.687
 and+0.675
 to+0.596
 with+0.588
Neg logit contributions
Ta-0.303
 Majesty-0.248
Uh-0.241
bek-0.239
icious-0.239
Token!
Token ID0
Feature activation+2.92e-01

Pos logit contributions
 Majesty+2.199
 majesty+2.066
icious+2.045
agna+1.956
vation+1.947
Neg logit contributions
.-1.173
,-1.062
 and-1.054
 to-1.002
 the-0.830
Token We
Token ID775
Feature activation-3.88e-01

Pos logit contributions
.+0.972
,+0.948
 and+0.919
 to+0.786
 with+0.780
Neg logit contributions
Ta-0.814
 former-0.751
icious-0.746
Uh-0.739
 Majesty-0.739
Token made
Token ID925
Feature activation-4.55e-02

Pos logit contributions
Ta+1.393
 Majesty+1.292
umm+1.280
 former+1.277
Uh+1.274
Neg logit contributions
.-1.009
,-0.975
 and-0.953
 to-0.754
 with-0.744
Token.
Token ID13
Feature activation+5.84e-01

Pos logit contributions
 former+1.984
umm+1.962
agna+1.957
 sidelines+1.953
vation+1.946
Neg logit contributions
,-0.061
 and-0.028
.-0.017
 to+0.119
 the+0.126
Token She
Token ID1375
Feature activation-8.76e-02

Pos logit contributions
.+1.457
,+1.399
 and+1.360
 to+1.083
 with+1.059
Neg logit contributions
Ta-2.142
 Majesty-1.980
Uh-1.956
 former-1.947
icious-1.942
Token said
Token ID531
Feature activation-5.78e-02

Pos logit contributions
Ta+0.299
 Majesty+0.273
Uh+0.271
 former+0.268
icious+0.266
Neg logit contributions
.-0.238
,-0.228
 and-0.220
 to-0.182
 with-0.177
Token,
Token ID11
Feature activation+5.38e-01

Pos logit contributions
 Majesty+0.291
 majesty+0.282
Ta+0.277
agna+0.268
 seventeen+0.264
Neg logit contributions
.-0.031
 and-0.023
,-0.001
 with+0.010
 in+0.018
Token "
Token ID366
Feature activation+1.10e-02

Pos logit contributions
.+1.270
,+1.183
 and+1.154
 to+0.916
 with+0.904
Neg logit contributions
Ta-1.778
 Majesty-1.739
agna-1.714
Uh-1.688
 Deal-1.662
TokenTa
Token ID38586
Feature activation-1.10e+00

Pos logit contributions
.+0.035
 and+0.033
,+0.031
 with+0.028
 to+0.027
Neg logit contributions
 Majesty-0.032
 majesty-0.030
 former-0.030
agna-0.030
umm-0.030
Token-
Token ID12
Feature activation+1.63e-01

Pos logit contributions
 Majesty+4.241
 majesty+4.033
Ta+3.995
umm+3.975
icious+3.930
Neg logit contributions
.-1.967
 and-1.868
,-1.778
 to-1.487
 with-1.269
Tokenda
Token ID6814
Feature activation-6.27e-01

Pos logit contributions
.+0.718
,+0.702
 and+0.691
 to+0.614
 with+0.607
Neg logit contributions
Ta-0.288
 Majesty-0.243
Uh-0.236
icious-0.234
 former-0.234
Token!
Token ID0
Feature activation+2.92e-01

Pos logit contributions
 Majesty+2.411
icious+2.313
 majesty+2.293
vation+2.229
heit+2.225
Neg logit contributions
.-0.850
 and-0.800
,-0.769
 to-0.659
 with-0.546
Token Here
Token ID3423
Feature activation-7.62e-01

Pos logit contributions
.+0.978
,+0.954
 and+0.926
 to+0.795
 with+0.784
Neg logit contributions
Ta-0.807
icious-0.737
 former-0.736
Uh-0.732
 Majesty-0.726
Token I
Token ID314
Feature activation-2.74e-01

Pos logit contributions
Ta+3.336
 Majesty+3.295
 majesty+3.199
agna+3.122
heit+3.117
Neg logit contributions
.-0.726
 and-0.690
,-0.534
 to-0.416
 with-0.335
Token shook
Token ID14682
Feature activation-3.86e-01

Pos logit contributions
.+1.300
,+1.253
 and+1.220
 to+0.996
 with+0.974
Neg logit contributions
Ta-1.624
 Majesty-1.496
Uh-1.471
 former-1.467
icious-1.460
Token.
Token ID13
Feature activation+5.73e-01

Pos logit contributions
 Majesty+1.437
 former+1.427
heit+1.387
 sidelines+1.370
Ta+1.364
Neg logit contributions
,-0.478
.-0.466
 and-0.433
 to-0.291
 with-0.260
Token
Token ID198
Feature activation+1.91e-01

Pos logit contributions
.+1.252
,+1.197
 and+1.159
 to+0.892
 with+0.867
Neg logit contributions
Ta-2.206
 Majesty-2.068
Uh-2.047
 former-2.026
icious-2.011
Token
Token ID198
Feature activation+1.88e-01

Pos logit contributions
.+0.281
,+0.274
 and+0.257
 to+0.174
 with+0.164
Neg logit contributions
Ta-0.861
Uh-0.802
 Majesty-0.793
icious-0.791
 former-0.787
Token"
Token ID1
Feature activation+1.37e-02

Pos logit contributions
.+0.512
,+0.490
 and+0.480
 with+0.387
 to+0.385
Neg logit contributions
Ta-0.650
umm-0.611
 Majesty-0.611
 former-0.598
agna-0.597
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.048
 and+0.047
,+0.045
 with+0.041
 to+0.039
Neg logit contributions
 Majesty-0.037
umm-0.034
 majesty-0.034
 former-0.033
Ta-0.032
Token-
Token ID12
Feature activation+1.55e-01

Pos logit contributions
 Majesty+3.869
Ta+3.829
bek+3.788
 majesty+3.750
heit+3.709
Neg logit contributions
 and-1.541
.-1.511
,-1.339
 to-1.096
 with-0.941
Tokenoh
Token ID1219
Feature activation-4.56e-01

Pos logit contributions
.+0.613
,+0.600
 and+0.590
 to+0.511
 with+0.508
Neg logit contributions
Ta-0.340
 Majesty-0.291
Uh-0.290
bek-0.287
icious-0.284
Token,
Token ID11
Feature activation+5.35e-01

Pos logit contributions
 Majesty+2.032
 majesty+1.928
heit+1.827
Ta+1.825
icious+1.819
Neg logit contributions
.-0.437
 and-0.433
,-0.349
 to-0.270
 with-0.230
Token Lily
Token ID20037
Feature activation-7.09e-02

Pos logit contributions
.+1.322
,+1.249
 and+1.229
 to+0.999
 with+0.965
Neg logit contributions
Ta-1.899
 Majesty-1.812
Uh-1.759
agna-1.750
icious-1.734
Token,
Token ID11
Feature activation+5.35e-01

Pos logit contributions
 Majesty+0.327
 majesty+0.317
bek+0.306
icious+0.304
Ta+0.302
Neg logit contributions
 and-0.022
.-0.022
,-0.013
 to-0.011
 with-0.006
Token to
Token ID284
Feature activation+3.91e-01

Pos logit contributions
 Majesty+0.324
Ta+0.319
 majesty+0.313
bek+0.306
agna+0.306
Neg logit contributions
.-0.043
 and-0.034
,-0.007
 with+0.011
 in+0.018
Token Tim
Token ID5045
Feature activation-1.14e-01

Pos logit contributions
.+0.909
,+0.866
 and+0.848
 to+0.674
 with+0.649
Neg logit contributions
Ta-1.469
 Majesty-1.374
icious-1.350
 former-1.343
Uh-1.339
Tokenmy
Token ID1820
Feature activation-1.88e-01

Pos logit contributions
 Majesty+0.205
Ta+0.204
icious+0.199
bek+0.195
 Deal+0.193
Neg logit contributions
.-0.369
 and-0.343
,-0.337
 with-0.318
 in-0.305
Token,
Token ID11
Feature activation+5.35e-01

Pos logit contributions
 Majesty+0.837
osity+0.803
agna+0.799
 majesty+0.778
icious+0.772
Neg logit contributions
.-0.105
 and-0.049
,-0.002
 in+0.005
 with+0.010
Token "
Token ID366
Feature activation+8.01e-03

Pos logit contributions
.+1.328
,+1.233
 and+1.191
 with+0.970
 to+0.935
Neg logit contributions
Ta-1.565
Uh-1.562
 Majesty-1.508
Really-1.507
bek-1.488
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.028
 and+0.027
,+0.026
 with+0.023
 to+0.023
Neg logit contributions
 Majesty-0.022
 majesty-0.021
Ta-0.021
agna-0.020
 former-0.020
Token oh
Token ID11752
Feature activation-5.65e-01

Pos logit contributions
Ta+2.792
 Majesty+2.745
bek+2.692
 majesty+2.554
icious+2.548
Neg logit contributions
.-2.891
 and-2.878
,-2.734
 to-2.357
 with-2.242
Token,
Token ID11
Feature activation+5.36e-01

Pos logit contributions
 Majesty+2.404
Ta+2.385
Uh+2.338
 majesty+2.324
bek+2.309
Neg logit contributions
 and-0.534
.-0.515
,-0.347
 with-0.261
 in-0.182
Token the
Token ID262
Feature activation+2.96e-01

Pos logit contributions
.+1.194
,+1.124
 and+1.095
 to+0.844
 with+0.820
Neg logit contributions
Ta-2.091
 Majesty-1.954
Uh-1.927
icious-1.916
umm-1.911
Token roof
Token ID9753
Feature activation-5.58e-01

Pos logit contributions
.+1.206
,+1.175
 and+1.156
 to+1.017
 with+1.000
Neg logit contributions
Ta-0.634
 Majesty-0.543
icious-0.534
Uh-0.532
 former-0.527
Token has
Token ID468
Feature activation-2.93e-01

Pos logit contributions
Ta+1.742
Uh+1.720
umm+1.716
bek+1.704
icious+1.696
Neg logit contributions
.-1.150
,-1.107
 and-1.088
 to-0.882
 with-0.862
Token.
Token ID13
Feature activation+5.76e-01

Pos logit contributions
umm+2.209
Uh+2.188
roach+2.151
Ta+2.128
 pulp+2.069
Neg logit contributions
,-0.455
.-0.388
 and-0.352
 to-0.187
 with-0.179
Token
Token ID220
Feature activation-1.54e-01

Pos logit contributions
.+1.320
,+1.252
 and+1.223
 to+0.937
 with+0.920
Neg logit contributions
Ta-2.136
Uh-2.006
 Majesty-1.938
umm-1.928
 former-1.918
Token
Token ID198
Feature activation+2.01e-01

Pos logit contributions
Ta+0.636
Uh+0.591
icious+0.589
 Deal+0.578
 Majesty+0.573
Neg logit contributions
,-0.245
.-0.239
 and-0.224
 to-0.158
 with-0.145
Token
Token ID198
Feature activation+2.01e-01

Pos logit contributions
.+0.333
,+0.318
 and+0.302
 in+0.211
 to+0.209
Neg logit contributions
Ta-0.819
Uh-0.769
icious-0.731
bek-0.723
Looks-0.715
Token"
Token ID1
Feature activation+1.96e-02

Pos logit contributions
.+0.596
,+0.575
 and+0.561
 to+0.472
 with+0.453
Neg logit contributions
Ta-0.637
Uh-0.577
icious-0.568
 Majesty-0.563
bek-0.561
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
 and+0.065
.+0.065
,+0.061
 with+0.057
 in+0.054
Neg logit contributions
 Majesty-0.060
 majesty-0.056
 former-0.052
heit-0.050
mic-0.049
Token oh
Token ID11752
Feature activation-5.68e-01

Pos logit contributions
Ta+3.272
bek+3.162
 Majesty+3.155
 majesty+3.025
icious+3.015
Neg logit contributions
.-2.484
 and-2.414
,-2.307
 to-1.888
 with-1.797
Token,
Token ID11
Feature activation+5.33e-01

Pos logit contributions
 Majesty+2.589
Ta+2.560
 majesty+2.542
Uh+2.477
bek+2.476
Neg logit contributions
.-0.415
 and-0.398
,-0.236
 with-0.109
 in-0.046
Token we
Token ID356
Feature activation-2.08e-01

Pos logit contributions
.+1.233
,+1.182
 and+1.146
 to+0.894
 with+0.870
Neg logit contributions
Ta-2.058
 Majesty-1.906
Uh-1.886
icious-1.878
 former-1.878
Token have
Token ID423
Feature activation-1.59e-01

Pos logit contributions
Ta+0.613
 Majesty+0.554
Uh+0.547
icious+0.543
 former+0.542
Neg logit contributions
.-0.669
,-0.648
 and-0.634
 to-0.537
 with-0.527
Token a
Token ID257
Feature activation+2.39e-01

Pos logit contributions
Ta+0.792
 former+0.757
 Majesty+0.756
Uh+0.749
agna+0.747
Neg logit contributions
.-0.142
,-0.130
 and-0.119
 with-0.043
 in-0.018
Token big
Token ID1263
Feature activation+1.15e-02

Pos logit contributions
Ta+0.708
 Majesty+0.654
Uh+0.653
 former+0.648
umm+0.636
Neg logit contributions
.-0.254
,-0.243
 and-0.234
 to-0.158
 with-0.153
Token rain
Token ID6290
Feature activation-3.33e-01

Pos logit contributions
.+0.029
,+0.026
 and+0.026
 to+0.023
 with+0.023
Neg logit contributions
umm-0.034
 Majesty-0.034
Ta-0.032
 pulp-0.032
Uh-0.032
Token clouds
Token ID15114
Feature activation-3.08e-01

Pos logit contributions
umm+1.309
storm+1.302
icious+1.241
drops+1.238
Ta+1.231
Neg logit contributions
,-0.374
.-0.353
 and-0.334
 to-0.260
 the-0.218
Token.
Token ID13
Feature activation+5.85e-01

Pos logit contributions
storm+1.315
 sidelines+1.271
umm+1.266
 seventeen+1.254
icious+1.250
Neg logit contributions
,-0.094
.-0.074
 and-0.040
 to-0.017
 the+0.047
Token "
Token ID366
Feature activation+2.16e-03

Pos logit contributions
.+1.538
,+1.482
 and+1.443
 to+1.166
 with+1.142
Neg logit contributions
Ta-2.062
 Majesty-1.898
Uh-1.880
 former-1.869
icious-1.867
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.008
 and+0.007
,+0.007
 with+0.006
 to+0.006
Neg logit contributions
 Majesty-0.006
 majesty-0.005
agna-0.005
heit-0.005
Ta-0.005
Token oh
Token ID11752
Feature activation-5.66e-01

Pos logit contributions
Ta+3.749
 Majesty+3.680
 majesty+3.540
Uh+3.532
bek+3.519
Neg logit contributions
.-1.883
 and-1.862
,-1.745
 to-1.383
 with-1.266
Token,"
Token ID553
Feature activation+1.36e-01

Pos logit contributions
 Majesty+2.530
 majesty+2.473
Ta+2.427
Uh+2.409
 seventeen+2.380
Neg logit contributions
 and-0.433
.-0.409
,-0.255
 with-0.161
 the-0.082
Token said
Token ID531
Feature activation-5.67e-02

Pos logit contributions
.+0.299
,+0.282
 and+0.279
 to+0.218
 with+0.206
Neg logit contributions
Ta-0.508
Uh-0.474
 Majesty-0.466
bek-0.451
 former-0.451
Token Tim
Token ID5045
Feature activation-1.22e-01

Pos logit contributions
Ta+0.195
Uh+0.182
 Majesty+0.178
umm+0.175
bek+0.174
Neg logit contributions
 and-0.124
.-0.120
,-0.114
 to-0.085
 with-0.085
Token,
Token ID11
Feature activation+5.36e-01

Pos logit contributions
 Majesty+0.536
 supervision+0.521
icious+0.520
rated+0.520
bek+0.519
Neg logit contributions
 and-0.031
,-0.017
.-0.002
 to+0.017
 with+0.019
Token shirts
Token ID20858
Feature activation-5.32e-01

Pos logit contributions
.+0.217
,+0.214
 and+0.209
 with+0.183
 to+0.183
Neg logit contributions
Ta-0.111
 Majesty-0.098
agna-0.094
 sidelines-0.092
 former-0.091
Token.
Token ID13
Feature activation+5.90e-01

Pos logit contributions
uries+2.081
agna+2.044
wy+2.038
 former+2.020
belt+2.016
Neg logit contributions
,-0.159
 and-0.012
.-0.001
 the+0.034
 to+0.102
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
.+1.386
,+1.335
 and+1.288
 to+0.998
 with+0.975
Neg logit contributions
Ta-2.199
Uh-2.028
 Majesty-2.000
icious-1.984
agna-1.981
Token
Token ID198
Feature activation+2.19e-01

Pos logit contributions
,+0.363
.+0.362
 and+0.335
 to+0.251
 in+0.248
Neg logit contributions
Ta-0.889
Uh-0.804
icious-0.800
 Deal-0.797
 Majesty-0.795
Token"
Token ID1
Feature activation+4.03e-02

Pos logit contributions
.+0.549
,+0.528
 and+0.511
 to+0.412
 with+0.394
Neg logit contributions
Ta-0.806
Uh-0.736
icious-0.732
 Majesty-0.730
bek-0.722
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.145
 and+0.141
,+0.132
 with+0.123
 to+0.116
Neg logit contributions
 Majesty-0.109
 majesty-0.099
 former-0.097
umm-0.091
heit-0.090
Token-
Token ID12
Feature activation+1.58e-01

Pos logit contributions
Ta+3.663
 Majesty+3.609
bek+3.565
 majesty+3.509
icious+3.400
Neg logit contributions
 and-1.910
.-1.851
,-1.720
 to-1.400
 with-1.283
Tokenoh
Token ID1219
Feature activation-4.57e-01

Pos logit contributions
.+0.625
,+0.609
 and+0.599
 to+0.525
 with+0.517
Neg logit contributions
Ta-0.349
 Majesty-0.306
Uh-0.299
 former-0.297
icious-0.297
Token,
Token ID11
Feature activation+5.44e-01

Pos logit contributions
 Majesty+2.149
 majesty+2.076
agna+1.939
Ta+1.935
bek+1.917
Neg logit contributions
 and-0.309
.-0.271
,-0.176
 to-0.135
 with-0.098
Token our
Token ID674
Feature activation-2.36e-01

Pos logit contributions
.+1.447
,+1.363
 and+1.347
 to+1.106
 with+1.071
Neg logit contributions
Ta-1.845
 Majesty-1.734
Uh-1.702
agna-1.692
icious-1.676
Token shirts
Token ID20858
Feature activation-5.31e-01

Pos logit contributions
Ta+0.509
 Majesty+0.459
agna+0.433
icious+0.429
 former+0.427
Neg logit contributions
.-0.903
,-0.884
 and-0.865
 to-0.763
 with-0.755
Token angry
Token ID7954
Feature activation-2.56e-01

Pos logit contributions
Ta+0.658
Uh+0.638
 Majesty+0.632
heit+0.624
 sidelines+0.620
Neg logit contributions
.-0.368
,-0.334
 and-0.317
 to-0.253
 with-0.242
Token.
Token ID13
Feature activation+5.78e-01

Pos logit contributions
 Majesty+0.942
umm+0.928
ote+0.918
icious+0.915
 seventeen+0.907
Neg logit contributions
,-0.044
.-0.037
 the-0.037
 to+0.006
 and+0.013
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
.+1.502
,+1.447
 and+1.408
 to+1.148
 with+1.116
Neg logit contributions
Ta-2.010
 Majesty-1.856
Uh-1.854
 former-1.811
icious-1.809
Token
Token ID198
Feature activation+1.96e-01

Pos logit contributions
.+0.287
,+0.276
 and+0.260
 to+0.168
 with+0.158
Neg logit contributions
Ta-0.913
Uh-0.854
 Majesty-0.843
icious-0.838
 former-0.836
Token"
Token ID1
Feature activation+2.10e-02

Pos logit contributions
.+0.490
,+0.466
 and+0.454
 with+0.359
 to+0.353
Neg logit contributions
Ta-0.720
umm-0.688
 Majesty-0.685
 former-0.675
agna-0.669
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.083
 and+0.081
,+0.079
 with+0.071
 the+0.070
Neg logit contributions
 Majesty-0.047
 former-0.046
umm-0.044
Ta-0.044
 majesty-0.043
Token-
Token ID12
Feature activation+1.56e-01

Pos logit contributions
Ta+3.718
 Majesty+3.686
bek+3.623
 majesty+3.549
heit+3.521
Neg logit contributions
 and-1.789
.-1.780
,-1.617
 to-1.296
 with-1.158
Tokenoh
Token ID1219
Feature activation-4.57e-01

Pos logit contributions
.+0.626
,+0.612
 and+0.602
 to+0.523
 with+0.519
Neg logit contributions
Ta-0.336
Uh-0.287
 Majesty-0.287
bek-0.282
icious-0.279
Token,
Token ID11
Feature activation+5.34e-01

Pos logit contributions
 Majesty+2.110
 majesty+2.008
Ta+1.905
heit+1.894
agna+1.876
Neg logit contributions
 and-0.367
.-0.358
,-0.264
 to-0.193
 with-0.152
Token what
Token ID644
Feature activation-1.05e-02

Pos logit contributions
.+1.257
,+1.179
 and+1.166
 to+0.935
 with+0.901
Neg logit contributions
Ta-1.959
 Majesty-1.864
Uh-1.816
agna-1.802
icious-1.782
Token is
Token ID318
Feature activation-6.60e-04

Pos logit contributions
Ta+0.034
 Majesty+0.031
Uh+0.031
agna+0.030
 majesty+0.030
Neg logit contributions
.-0.028
,-0.027
 and-0.027
 with-0.022
 to-0.022
Token in
Token ID287
Feature activation+3.47e-01

Pos logit contributions
Ta+0.261
Uh+0.253
behavior+0.252
 sidelines+0.252
 Indians+0.250
Neg logit contributions
,-0.135
.-0.127
 and-0.100
 that-0.092
 with-0.068
Token and
Token ID290
Feature activation+5.24e-01

Pos logit contributions
,+0.181
.+0.162
 and+0.062
 in-0.042
 with-0.058
Neg logit contributions
 Deal-1.509
Uh-1.507
Ta-1.503
bek-1.465
 sidelines-1.463
Token said
Token ID531
Feature activation-6.47e-02

Pos logit contributions
.+1.412
,+1.349
 and+1.321
 to+1.054
 with+1.053
Neg logit contributions
Ta-1.729
Uh-1.587
bek-1.545
agna-1.526
 Majesty-1.522
Token,
Token ID11
Feature activation+5.39e-01

Pos logit contributions
 Majesty+0.308
agna+0.303
Ta+0.301
 majesty+0.299
 pulp+0.294
Neg logit contributions
.-0.035
 and-0.028
,-0.002
 with+0.006
 in+0.015
Token "
Token ID366
Feature activation+9.13e-03

Pos logit contributions
.+1.294
,+1.207
 and+1.176
 with+0.947
 to+0.899
Neg logit contributions
Ta-1.635
Uh-1.620
agna-1.620
bek-1.571
icious-1.571
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.031
 and+0.031
,+0.030
 to+0.026
 with+0.026
Neg logit contributions
 Majesty-0.025
Ta-0.024
 majesty-0.024
 former-0.024
agna-0.024
Token oh
Token ID11752
Feature activation-5.63e-01

Pos logit contributions
Ta+3.004
bek+2.961
 Majesty+2.905
icious+2.784
Uh+2.747
Neg logit contributions
.-2.733
 and-2.711
,-2.590
 to-2.167
 with-2.067
Token,
Token ID11
Feature activation+5.38e-01

Pos logit contributions
 Majesty+2.463
Ta+2.446
bek+2.421
Uh+2.407
 majesty+2.392
Neg logit contributions
 and-0.409
.-0.399
,-0.228
 with-0.144
 in-0.058
Token you
Token ID345
Feature activation-1.12e-01

Pos logit contributions
.+1.288
,+1.221
 and+1.192
 to+0.939
 with+0.915
Neg logit contributions
Ta-2.021
 Majesty-1.875
Uh-1.858
icious-1.849
umm-1.843
Token boys
Token ID6510
Feature activation-3.55e-01

Pos logit contributions
Ta+0.339
 Majesty+0.307
Uh+0.303
icious+0.301
 former+0.300
Neg logit contributions
.-0.354
,-0.342
 and-0.335
 to-0.283
 with-0.278
Token made
Token ID925
Feature activation-3.22e-02

Pos logit contributions
 Majesty+1.492
icious+1.486
Ta+1.477
bek+1.462
agna+1.455
Neg logit contributions
 and-0.305
.-0.283
,-0.264
 with-0.149
 to-0.146
Tokena
Token ID64
Feature activation-3.87e-01

Pos logit contributions
Ta+1.090
 Majesty+0.977
icious+0.937
Uh+0.934
 former+0.930
Neg logit contributions
.-1.743
,-1.718
 and-1.691
 to-1.469
 with-1.420
Token.
Token ID13
Feature activation+5.79e-01

Pos logit contributions
 Majesty+1.807
 majesty+1.716
uer+1.698
Ta+1.685
bek+1.680
Neg logit contributions
 and-0.078
,-0.032
.-0.021
 to+0.061
 the+0.102
Token
Token ID220
Feature activation-1.51e-01

Pos logit contributions
.+1.163
,+1.096
 and+1.064
 to+0.765
 with+0.757
Neg logit contributions
Ta-2.236
Uh-2.095
 Majesty-2.058
agna-2.026
 former-2.003
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
Ta+0.544
 Deal+0.519
 Majesty+0.518
icious+0.513
 former+0.507
Neg logit contributions
,-0.250
.-0.241
 and-0.241
 in-0.172
 with-0.170
Token"
Token ID1
Feature activation+2.19e-02

Pos logit contributions
.+0.309
,+0.290
 and+0.275
 to+0.180
 with+0.165
Neg logit contributions
Ta-0.903
Uh-0.832
 Majesty-0.828
icious-0.823
bek-0.820
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.081
 and+0.077
,+0.075
 with+0.069
 to+0.067
Neg logit contributions
 Majesty-0.052
Ta-0.050
 majesty-0.049
agna-0.048
 former-0.048
Token,
Token ID11
Feature activation+5.34e-01

Pos logit contributions
Ta+3.167
 Majesty+3.069
bek+2.954
 majesty+2.907
icious+2.828
Neg logit contributions
.-2.723
 and-2.653
,-2.599
 to-2.117
 with-2.013
Token what
Token ID644
Feature activation-1.06e-02

Pos logit contributions
.+1.462
,+1.387
 and+1.360
 to+1.109
 with+1.083
Neg logit contributions
Ta-1.787
 Majesty-1.670
Uh-1.619
agna-1.617
icious-1.613
Token are
Token ID389
Feature activation-1.64e-01

Pos logit contributions
Ta+0.037
 Majesty+0.034
Uh+0.033
agna+0.033
icious+0.033
Neg logit contributions
.-0.026
,-0.025
 and-0.024
 to-0.019
 with-0.019
Token you
Token ID345
Feature activation-1.16e-01

Pos logit contributions
Ta+0.546
 Majesty+0.500
agna+0.493
Uh+0.490
umm+0.488
Neg logit contributions
.-0.442
 and-0.425
,-0.424
 with-0.328
 to-0.324
Token?"
Token ID1701
Feature activation-6.37e-02

Pos logit contributions
Ta+0.320
 Majesty+0.313
icious+0.296
 former+0.294
Uh+0.291
Neg logit contributions
.-0.358
 and-0.332
,-0.330
 to-0.284
 with-0.278
Token pieces
Token ID5207
Feature activation-2.88e-01

Pos logit contributions
.+0.137
,+0.136
 and+0.131
 to+0.104
 with+0.097
Neg logit contributions
Ta-0.228
 Majesty-0.217
Uh-0.216
 former-0.216
 pulp-0.209
Token.
Token ID13
Feature activation+5.78e-01

Pos logit contributions
 pulp+1.210
 ounces+1.199
inations+1.194
 former+1.193
Ta+1.190
Neg logit contributions
,-0.079
.-0.039
 and-0.019
 to+0.036
 the+0.056
Token
Token ID198
Feature activation+1.93e-01

Pos logit contributions
.+1.394
,+1.337
 and+1.299
 to+1.028
 with+1.001
Neg logit contributions
Ta-2.157
 Majesty-1.994
Uh-1.972
icious-1.962
 former-1.961
Token
Token ID198
Feature activation+1.89e-01

Pos logit contributions
.+0.288
,+0.280
 and+0.262
 to+0.172
 with+0.164
Neg logit contributions
Ta-0.878
Uh-0.815
 Majesty-0.808
 former-0.803
icious-0.803
Token"
Token ID1
Feature activation+1.61e-02

Pos logit contributions
.+0.553
,+0.528
 and+0.520
 with+0.429
 to+0.423
Neg logit contributions
Ta-0.611
umm-0.586
 Majesty-0.581
agna-0.566
 former-0.565
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.057
 and+0.056
,+0.053
 with+0.050
 the+0.048
Neg logit contributions
 Majesty-0.042
 majesty-0.038
umm-0.038
 former-0.037
mic-0.036
Token-
Token ID12
Feature activation+1.54e-01

Pos logit contributions
Ta+3.812
 Majesty+3.755
bek+3.709
 majesty+3.628
Uh+3.593
Neg logit contributions
.-1.771
 and-1.762
,-1.591
 to-1.259
 with-1.144
Tokenoh
Token ID1219
Feature activation-4.57e-01

Pos logit contributions
.+0.612
,+0.597
 and+0.588
 to+0.511
 with+0.506
Neg logit contributions
Ta-0.342
 Majesty-0.295
Uh-0.292
icious-0.287
bek-0.286
Token,
Token ID11
Feature activation+5.38e-01

Pos logit contributions
 Majesty+2.028
 majesty+1.932
Ta+1.857
icious+1.815
agna+1.811
Neg logit contributions
.-0.479
 and-0.468
,-0.385
 to-0.298
 with-0.257
Token our
Token ID674
Feature activation-2.45e-01

Pos logit contributions
.+1.340
,+1.270
 and+1.250
 to+1.005
 with+0.976
Neg logit contributions
Ta-1.927
 Majesty-1.819
Uh-1.774
agna-1.761
icious-1.747
Token missile
Token ID10105
Feature activation-5.81e-01

Pos logit contributions
Ta+0.673
 Majesty+0.609
Uh+0.595
 former+0.593
icious+0.591
Neg logit contributions
.-0.833
,-0.809
 and-0.793
 to-0.677
 with-0.667
Token sofa
Token ID34902
Feature activation-5.06e-01

Pos logit contributions
.+1.021
,+1.003
 and+0.971
 with+0.850
 to+0.846
Neg logit contributions
Ta-0.737
 sidelines-0.686
Uh-0.640
umm-0.638
 Majesty-0.627
Token.
Token ID13
Feature activation+5.85e-01

Pos logit contributions
 sidelines+2.180
agna+2.120
umm+2.111
 ounces+2.093
 seventeen+2.087
Neg logit contributions
 and-0.047
,-0.036
.-0.003
 to+0.104
 the+0.126
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
.+1.429
,+1.367
 and+1.317
 to+1.038
 with+1.012
Neg logit contributions
Ta-2.139
Uh-1.984
 former-1.943
 Majesty-1.940
agna-1.927
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
.+0.349
,+0.333
 and+0.315
 to+0.220
 in+0.213
Neg logit contributions
Ta-0.891
Uh-0.829
icious-0.803
 Deal-0.798
 former-0.796
Token"
Token ID1
Feature activation+3.35e-02

Pos logit contributions
.+0.490
,+0.470
 and+0.456
 to+0.354
 with+0.348
Neg logit contributions
Ta-0.823
 Majesty-0.769
umm-0.758
 former-0.757
icious-0.753
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.124
 and+0.122
,+0.114
 with+0.108
 the+0.107
Neg logit contributions
 Majesty-0.084
 former-0.078
 majesty-0.076
umm-0.071
osity-0.070
Token-
Token ID12
Feature activation+1.56e-01

Pos logit contributions
Ta+3.705
bek+3.637
 Majesty+3.623
 majesty+3.516
Uh+3.492
Neg logit contributions
 and-1.858
.-1.837
,-1.683
 to-1.377
 with-1.217
Tokenoh
Token ID1219
Feature activation-4.61e-01

Pos logit contributions
.+0.620
,+0.604
 and+0.594
 to+0.520
 with+0.513
Neg logit contributions
Ta-0.346
 Majesty-0.302
Uh-0.296
icious-0.293
 former-0.292
Token,
Token ID11
Feature activation+5.41e-01

Pos logit contributions
 Majesty+2.075
 majesty+2.000
Ta+1.906
bek+1.882
heit+1.874
Neg logit contributions
 and-0.419
.-0.408
,-0.297
 to-0.241
 with-0.198
Token where
Token ID810
Feature activation+3.84e-02

Pos logit contributions
.+1.073
,+1.005
 and+0.982
 to+0.732
 with+0.702
Neg logit contributions
Ta-2.248
 Majesty-2.115
Uh-2.086
agna-2.064
icious-2.064
Token did
Token ID750
Feature activation-2.28e-01

Pos logit contributions
.+0.095
 and+0.090
,+0.090
 with+0.071
 to+0.070
Neg logit contributions
Ta-0.134
 Majesty-0.124
Uh-0.122
agna-0.120
bek-0.119
Token open
Token ID1280
Feature activation-1.06e-01

Pos logit contributions
.+0.910
,+0.881
 and+0.865
 to+0.728
 with+0.716
Neg logit contributions
Ta-0.843
 Majesty-0.762
Uh-0.749
icious-0.745
 former-0.743
Token freezer
Token ID30967
Feature activation-5.76e-01

Pos logit contributions
Ta+0.229
umm+0.220
 former+0.213
 Majesty+0.212
Uh+0.208
Neg logit contributions
.-0.377
,-0.361
 and-0.356
 to-0.313
 with-0.306
Token and
Token ID290
Feature activation+5.11e-01

Pos logit contributions
umm+2.611
bek+2.516
agna+2.512
 seventeen+2.464
icious+2.436
Neg logit contributions
.-0.269
,-0.198
 and-0.117
 to+0.019
 the+0.046
Token said
Token ID531
Feature activation-6.73e-02

Pos logit contributions
.+1.334
,+1.279
 and+1.243
 to+1.001
 with+0.974
Neg logit contributions
Ta-1.794
 Majesty-1.650
Uh-1.640
 former-1.623
bek-1.612
Token "
Token ID366
Feature activation+1.21e-03

Pos logit contributions
 Majesty+0.325
Ta+0.316
 majesty+0.314
agna+0.313
 seventeen+0.309
Neg logit contributions
.-0.046
 and-0.039
,-0.020
 with+0.001
 to+0.008
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.005
 and+0.005
,+0.004
 with+0.004
 to+0.004
Neg logit contributions
 Majesty-0.003
 majesty-0.003
Ta-0.003
agna-0.002
 former-0.002
Token oh
Token ID11752
Feature activation-5.67e-01

Pos logit contributions
 Majesty+3.694
bek+3.631
Ta+3.569
 majesty+3.522
icious+3.442
Neg logit contributions
.-1.968
 and-1.961
,-1.830
 to-1.450
 with-1.317
Token!"
Token ID2474
Feature activation+5.94e-02

Pos logit contributions
 Majesty+2.447
 majesty+2.376
Ta+2.359
bek+2.358
agna+2.326
Neg logit contributions
.-0.573
 and-0.566
,-0.432
 with-0.291
 in-0.209
Token He
Token ID679
Feature activation-1.47e-01

Pos logit contributions
.+0.143
,+0.137
 and+0.129
 in+0.105
 to+0.105
Neg logit contributions
Ta-0.191
Uh-0.180
umm-0.177
 Majesty-0.176
 former-0.174
Token quickly
Token ID2952
Feature activation-6.04e-03

Pos logit contributions
Ta+0.487
Uh+0.445
 Majesty+0.443
bek+0.437
icious+0.435
Neg logit contributions
.-0.408
,-0.390
 and-0.376
 to-0.313
 with-0.304
Token shut
Token ID4423
Feature activation-2.46e-01

Pos logit contributions
Ta+0.016
Uh+0.014
bek+0.014
agna+0.014
icious+0.014
Neg logit contributions
.-0.019
,-0.018
 and-0.018
 to-0.015
 with-0.015
Token scared
Token ID12008
Feature activation-1.95e-01

Pos logit contributions
.+0.431
,+0.415
 and+0.414
 to+0.356
 with+0.341
Neg logit contributions
Ta-0.376
 Majesty-0.338
 former-0.332
Uh-0.330
icious-0.329
Token,
Token ID11
Feature activation+5.23e-01

Pos logit contributions
umm+0.731
 sidelines+0.719
Uh+0.700
icious+0.681
uster+0.676
Neg logit contributions
.-0.103
,-0.093
 and-0.077
 with-0.048
 to-0.022
Token she
Token ID673
Feature activation+1.22e-02

Pos logit contributions
.+1.062
,+0.985
 and+0.905
 to+0.710
 with+0.687
Neg logit contributions
Ta-1.970
Uh-1.898
 sidelines-1.854
 Majesty-1.840
 former-1.832
Token said
Token ID531
Feature activation-7.10e-02

Pos logit contributions
.+0.036
,+0.035
 and+0.034
 to+0.028
 with+0.028
Neg logit contributions
Ta-0.038
 Majesty-0.035
Uh-0.035
icious-0.034
 former-0.034
Token "
Token ID366
Feature activation+5.18e-04

Pos logit contributions
 Majesty+0.265
 majesty+0.255
bek+0.254
agna+0.252
Ta+0.250
Neg logit contributions
.-0.108
 and-0.100
,-0.087
 with-0.062
 the-0.053
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.002
 and+0.002
,+0.002
 with+0.001
 to+0.001
Neg logit contributions
Ta-0.001
 Majesty-0.001
 majesty-0.001
agna-0.001
 former-0.001
Token-
Token ID12
Feature activation+1.56e-01

Pos logit contributions
 Majesty+3.596
Ta+3.538
bek+3.475
 majesty+3.413
icious+3.345
Neg logit contributions
.-2.126
 and-2.109
,-2.018
 to-1.585
 with-1.463
Tokenoh
Token ID1219
Feature activation-4.63e-01

Pos logit contributions
.+0.604
,+0.588
 and+0.578
 to+0.507
 with+0.499
Neg logit contributions
Ta-0.361
 Majesty-0.319
 former-0.311
Uh-0.311
icious-0.310
Token",
Token ID1600
Feature activation-2.79e-01

Pos logit contributions
 Majesty+2.182
 majesty+2.087
Ta+2.042
agna+2.029
bek+2.022
Neg logit contributions
.-0.424
 and-0.408
,-0.336
 to-0.185
 with-0.171
Token and
Token ID290
Feature activation+5.13e-01

Pos logit contributions
Ta+1.076
 Majesty+1.067
icious+1.042
agna+1.038
 majesty+1.030
Neg logit contributions
.-0.356
,-0.313
 and-0.260
 to-0.197
 in-0.175
Token ran
Token ID4966
Feature activation-2.26e-01

Pos logit contributions
.+1.267
,+1.202
 and+1.177
 to+0.926
 with+0.898
Neg logit contributions
Ta-1.820
 Majesty-1.711
Uh-1.685
bek-1.673
 former-1.657
Token bends
Token ID44370
Feature activation-4.78e-01

Pos logit contributions
.+1.683
,+1.628
 and+1.595
 to+1.357
 with+1.335
Neg logit contributions
Ta-1.511
 Majesty-1.360
Uh-1.333
icious-1.331
 former-1.327
Token.
Token ID13
Feature activation+5.72e-01

Pos logit contributions
 Majesty+2.061
wy+2.018
heit+2.009
 pulp+2.006
 former+1.989
Neg logit contributions
,-0.286
.-0.225
 and-0.214
 with-0.065
 to-0.052
Token
Token ID198
Feature activation+1.89e-01

Pos logit contributions
.+1.503
,+1.447
 and+1.408
 to+1.141
 with+1.113
Neg logit contributions
Ta-2.012
 Majesty-1.853
Uh-1.831
 former-1.819
icious-1.818
Token
Token ID198
Feature activation+1.87e-01

Pos logit contributions
.+0.283
,+0.275
 and+0.253
 to+0.173
 with+0.161
Neg logit contributions
Ta-0.856
Uh-0.797
icious-0.786
 Majesty-0.785
 former-0.781
Token"
Token ID1
Feature activation+1.62e-02

Pos logit contributions
.+0.513
,+0.489
 and+0.484
 with+0.396
 to+0.384
Neg logit contributions
Ta-0.636
umm-0.615
 Majesty-0.611
 former-0.595
agna-0.593
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.057
 and+0.056
,+0.053
 with+0.049
 in+0.046
Neg logit contributions
 Majesty-0.042
 former-0.038
 majesty-0.038
umm-0.037
heit-0.036
Token-
Token ID12
Feature activation+1.60e-01

Pos logit contributions
 Majesty+3.781
Ta+3.780
 majesty+3.689
bek+3.679
icious+3.562
Neg logit contributions
 and-1.711
.-1.696
,-1.495
 to-1.246
 with-1.077
Tokenoh
Token ID1219
Feature activation-4.52e-01

Pos logit contributions
.+0.631
,+0.615
 and+0.604
 to+0.529
 with+0.522
Neg logit contributions
Ta-0.354
 Majesty-0.309
Uh-0.303
icious-0.300
 former-0.299
Token,
Token ID11
Feature activation+5.35e-01

Pos logit contributions
 Majesty+2.129
 majesty+2.043
Ta+1.954
agna+1.919
 seventeen+1.915
Neg logit contributions
.-0.353
 and-0.337
,-0.249
 to-0.166
 with-0.132
Token this
Token ID428
Feature activation+4.27e-03

Pos logit contributions
.+1.349
,+1.279
 and+1.257
 to+1.019
 with+0.987
Neg logit contributions
Ta-1.909
 Majesty-1.801
Uh-1.753
agna-1.744
icious-1.734
Token spoon
Token ID24556
Feature activation-4.59e-01

Pos logit contributions
.+0.015
,+0.014
 and+0.014
 to+0.013
 with+0.012
Neg logit contributions
Ta-0.010
 Majesty-0.009
umm-0.009
Uh-0.009
agna-0.009
Token shake
Token ID13279
Feature activation-2.95e-01

Pos logit contributions
Ta+2.142
mic+2.105
 sidelines+2.087
Uh+2.076
vation+2.068
Neg logit contributions
,-0.364
.-0.294
 and-0.285
 to-0.149
 with-0.061
Token.
Token ID13
Feature activation+5.83e-01

Pos logit contributions
ksh+1.239
rated+1.226
umm+1.207
mic+1.207
Uh+1.203
Neg logit contributions
,-0.088
 and-0.017
.-0.006
 to+0.026
 with+0.068
Token
Token ID198
Feature activation+2.10e-01

Pos logit contributions
.+1.406
,+1.350
 and+1.307
 to+1.028
 with+1.006
Neg logit contributions
Ta-2.178
 Majesty-2.013
Uh-2.006
 former-1.975
umm-1.968
Token
Token ID198
Feature activation+2.09e-01

Pos logit contributions
.+0.332
,+0.325
 and+0.303
 to+0.221
 in+0.208
Neg logit contributions
Ta-0.886
Uh-0.817
 Majesty-0.807
icious-0.795
 majesty-0.789
Token"
Token ID1
Feature activation+3.21e-02

Pos logit contributions
.+0.449
,+0.429
 and+0.416
 to+0.313
 with+0.310
Neg logit contributions
Ta-0.841
 Majesty-0.793
umm-0.786
 former-0.786
icious-0.776
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.115
 and+0.112
,+0.105
 the+0.098
 with+0.098
Neg logit contributions
 Majesty-0.085
 former-0.077
 majesty-0.077
umm-0.075
agna-0.071
Token-
Token ID12
Feature activation+1.58e-01

Pos logit contributions
Ta+3.705
 Majesty+3.621
bek+3.546
 majesty+3.496
Uh+3.461
Neg logit contributions
 and-1.884
.-1.858
,-1.725
 to-1.401
 with-1.271
Tokenoh
Token ID1219
Feature activation-4.58e-01

Pos logit contributions
.+0.628
,+0.612
 and+0.602
 to+0.527
 with+0.520
Neg logit contributions
Ta-0.347
 Majesty-0.302
Uh-0.296
icious-0.294
 former-0.293
Token,
Token ID11
Feature activation+5.40e-01

Pos logit contributions
 Majesty+2.111
 majesty+2.033
Ta+1.920
agna+1.887
bek+1.873
Neg logit contributions
 and-0.377
.-0.368
,-0.267
 to-0.209
 with-0.174
Token what
Token ID644
Feature activation-1.03e-02

Pos logit contributions
.+1.361
,+1.284
 and+1.267
 to+1.026
 with+0.994
Neg logit contributions
Ta-1.917
 Majesty-1.807
Uh-1.770
agna-1.752
icious-1.734
Token is
Token ID318
Feature activation+3.16e-05

Pos logit contributions
Ta+0.036
 Majesty+0.033
Uh+0.032
agna+0.032
 majesty+0.032
Neg logit contributions
.-0.025
,-0.024
 and-0.024
 with-0.019
 to-0.019
Token in
Token ID287
Feature activation+3.40e-01

Pos logit contributions
storm+1.507
Uh+1.488
plet+1.486
Ta+1.481
 sidelines+1.469
Neg logit contributions
,-0.322
.-0.258
 and-0.248
 to-0.131
 the-0.130
Token the
Token ID262
Feature activation+2.95e-01

Pos logit contributions
,+0.419
.+0.419
 and+0.368
 to+0.247
 with+0.214
Neg logit contributions
Ta-1.585
Uh-1.503
 former-1.446
umm-1.443
 sidelines-1.427
Token distance
Token ID5253
Feature activation-5.39e-01

Pos logit contributions
.+0.995
,+0.967
 and+0.947
 to+0.810
 with+0.797
Neg logit contributions
Ta-0.823
 Majesty-0.730
Uh-0.728
 former-0.718
umm-0.717
Token.
Token ID13
Feature activation+5.73e-01

Pos logit contributions
umm+1.934
 sidelines+1.882
 spoiler+1.841
 spine+1.840
 horizon+1.833
Neg logit contributions
,-0.204
 and-0.104
.-0.000
 to+0.002
 in+0.034
Token "
Token ID366
Feature activation+2.95e-03

Pos logit contributions
.+1.592
,+1.526
 and+1.489
 to+1.218
 with+1.193
Neg logit contributions
Ta-1.916
 Majesty-1.769
Uh-1.747
umm-1.727
 former-1.725
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.010
 and+0.010
,+0.009
 with+0.009
 in+0.009
Neg logit contributions
 Majesty-0.008
 majesty-0.008
agna-0.007
 former-0.007
umm-0.007
Token oh
Token ID11752
Feature activation-5.64e-01

Pos logit contributions
Ta+3.814
 Majesty+3.671
bek+3.550
 majesty+3.521
Uh+3.520
Neg logit contributions
.-1.994
 and-1.930
,-1.843
 to-1.400
 with-1.316
Token,
Token ID11
Feature activation+5.31e-01

Pos logit contributions
 Majesty+2.396
Ta+2.373
 majesty+2.341
Uh+2.334
heit+2.260
Neg logit contributions
.-0.508
 and-0.503
,-0.345
 with-0.212
 the-0.142
Token that
Token ID326
Feature activation+3.08e-01

Pos logit contributions
.+1.662
,+1.604
 and+1.573
 to+1.323
 with+1.296
Neg logit contributions
Ta-1.610
 Majesty-1.464
Uh-1.443
icious-1.433
 former-1.428
Token cloud
Token ID6279
Feature activation-3.74e-01

Pos logit contributions
.+0.910
,+0.870
 and+0.866
 to+0.722
 with+0.711
Neg logit contributions
Ta-0.880
Uh-0.837
 Majesty-0.832
icious-0.805
umm-0.797
Token looks
Token ID3073
Feature activation-3.04e-01

Pos logit contributions
Ta+1.174
 Majesty+1.150
Uh+1.145
umm+1.135
icious+1.131
Neg logit contributions
.-0.830
 and-0.795
,-0.777
 to-0.648
 with-0.619
Token filled
Token ID5901
Feature activation-1.96e-01

Pos logit contributions
Uh+2.204
Ta+2.189
mic+2.168
umm+2.152
vation+2.123
Neg logit contributions
,-0.535
 and-0.523
.-0.502
 to-0.311
 with-0.256
Token the
Token ID262
Feature activation+2.85e-01

Pos logit contributions
Ta+0.759
bek+0.742
 former+0.733
Uh+0.722
Looks+0.719
Neg logit contributions
,-0.282
.-0.258
 and-0.244
 to-0.177
 with-0.108
Token air
Token ID1633
Feature activation-2.92e-01

Pos logit contributions
.+0.962
,+0.934
 and+0.915
 to+0.780
 with+0.768
Neg logit contributions
Ta-0.797
 Majesty-0.717
Uh-0.706
icious-0.702
 former-0.700
Token.
Token ID13
Feature activation+5.71e-01

Pos logit contributions
 former+1.331
 sidelines+1.305
icious+1.304
umm+1.298
mic+1.288
Neg logit contributions
,-0.088
 and-0.056
.-0.041
 to+0.030
 the+0.086
Token "
Token ID366
Feature activation-1.60e-03

Pos logit contributions
.+1.602
,+1.548
 and+1.510
 to+1.239
 with+1.216
Neg logit contributions
Ta-1.903
 Majesty-1.738
Uh-1.726
 former-1.713
icious-1.708
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
 Majesty+0.005
 majesty+0.004
agna+0.004
umm+0.004
heit+0.004
Neg logit contributions
.-0.005
 and-0.005
,-0.005
 with-0.005
 to-0.004
Token-
Token ID12
Feature activation+1.59e-01

Pos logit contributions
Ta+3.667
 Majesty+3.566
bek+3.511
icious+3.401
 majesty+3.389
Neg logit contributions
.-2.127
 and-2.054
,-2.021
 to-1.557
 with-1.439
Tokenoh
Token ID1219
Feature activation-4.57e-01

Pos logit contributions
.+0.614
,+0.598
 and+0.587
 to+0.515
 with+0.507
Neg logit contributions
Ta-0.367
 Majesty-0.326
 former-0.316
Uh-0.316
icious-0.316
Token,"
Token ID553
Feature activation+1.22e-01

Pos logit contributions
 Majesty+1.818
Ta+1.747
 majesty+1.725
bek+1.685
icious+1.656
Neg logit contributions
.-0.753
 and-0.716
,-0.661
 to-0.535
 with-0.493
Token said
Token ID531
Feature activation-7.00e-02

Pos logit contributions
.+0.300
,+0.289
 and+0.283
 to+0.228
 with+0.217
Neg logit contributions
Ta-0.418
Uh-0.388
 Majesty-0.376
bek-0.370
 former-0.369
Token Jack
Token ID3619
Feature activation-2.34e-01

Pos logit contributions
Ta+0.178
 Majesty+0.159
Uh+0.159
bek+0.157
umm+0.155
Neg logit contributions
.-0.222
 and-0.220
,-0.214
 to-0.182
 with-0.179
Token
Token ID198
Feature activation+1.94e-01

Pos logit contributions
.+0.303
,+0.291
 and+0.268
 with+0.175
 to+0.175
Neg logit contributions
Ta-0.858
Uh-0.785
bek-0.765
icious-0.764
 Majesty-0.757
Tokenâ
Token ID22940
Feature activation-5.73e-01

Pos logit contributions
.+0.750
,+0.731
 and+0.718
 to+0.627
 with+0.618
Neg logit contributions
Ta-0.447
 Majesty-0.392
Uh-0.385
icious-0.382
 former-0.380
Token€
Token ID26391
Feature activation-7.77e-01

Pos logit contributions
Ta+0.933
 Majesty+0.799
Uh+0.772
 former+0.758
umm+0.751
Neg logit contributions
.-2.607
,-2.543
 and-2.510
 to-2.226
 with-2.207
Token�
Token ID129
Feature activation-5.63e-01

Pos logit contributions
Ta+2.769
 Majesty+2.624
agna+2.559
bek+2.534
 majesty+2.507
Neg logit contributions
.-1.777
,-1.659
 and-1.630
 to-1.311
 with-1.278
Token�
Token ID241
Feature activation-6.03e-01

Pos logit contributions
Ta+1.049
 Majesty+0.919
icious+0.883
 former+0.875
agna+0.862
Neg logit contributions
.-2.356
,-2.314
 and-2.265
 to-2.012
 with-1.984
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
Ta+1.455
 Majesty+1.281
Uh+1.262
icious+1.250
 former+1.248
Neg logit contributions
.-2.257
,-2.200
 and-2.158
 to-1.875
 with-1.847
Token-
Token ID12
Feature activation+1.61e-01

Pos logit contributions
Ta+3.661
 Majesty+3.581
bek+3.543
 majesty+3.422
icious+3.389
Neg logit contributions
.-1.949
 and-1.935
,-1.831
 to-1.457
 with-1.283
Tokenoh
Token ID1219
Feature activation-4.55e-01

Pos logit contributions
.+0.629
,+0.612
 and+0.601
 to+0.529
 with+0.520
Neg logit contributions
Ta-0.362
 Majesty-0.321
Uh-0.313
 former-0.313
icious-0.312
Token,
Token ID11
Feature activation+5.27e-01

Pos logit contributions
 Majesty+2.187
 majesty+2.083
Ta+2.049
bek+2.012
agna+1.985
Neg logit contributions
.-0.309
 and-0.305
,-0.240
 to-0.101
 with-0.075
Tokenâ
Token ID22940
Feature activation-5.68e-01

Pos logit contributions
.+1.454
,+1.376
 and+1.341
 to+1.140
 with+1.093
Neg logit contributions
Ta-1.704
 Majesty-1.579
Uh-1.560
icious-1.557
bek-1.530
Token€
Token ID26391
Feature activation-7.72e-01

Pos logit contributions
Ta+0.732
 Majesty+0.577
Uh+0.553
 former+0.544
icious+0.542
Neg logit contributions
.-2.777
,-2.719
 and-2.681
 to-2.410
 with-2.386
Token too
Token ID1165
Feature activation+2.10e-01

Pos logit contributions
Ta+2.021
 former+2.005
 Majesty+1.989
icious+1.985
uries+1.975
Neg logit contributions
.-0.306
,-0.289
 and-0.274
 the-0.089
 with-0.073
Token.
Token ID13
Feature activation+5.85e-01

Pos logit contributions
 and+0.043
,+0.042
.+0.023
 with+0.008
 in-0.008
Neg logit contributions
rated-0.847
icious-0.843
heit-0.841
bek-0.836
Uh-0.835
Token
Token ID198
Feature activation+2.13e-01

Pos logit contributions
.+1.478
,+1.421
 and+1.369
 to+1.109
 with+1.063
Neg logit contributions
Ta-2.074
Uh-1.926
 Majesty-1.882
agna-1.851
icious-1.845
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
.+0.325
,+0.319
 and+0.297
 to+0.211
 in+0.195
Neg logit contributions
Ta-0.928
Uh-0.866
icious-0.843
 Majesty-0.840
bek-0.837
Token"
Token ID1
Feature activation+3.36e-02

Pos logit contributions
.+0.490
,+0.467
 and+0.455
 with+0.348
 to+0.346
Neg logit contributions
Ta-0.827
 Majesty-0.782
umm-0.774
 former-0.771
icious-0.761
TokenUh
Token ID34653
Feature activation-1.03e+00

Pos logit contributions
.+0.134
 and+0.129
,+0.126
 with+0.114
 to+0.113
Neg logit contributions
 Majesty-0.079
 former-0.074
 majesty-0.071
umm-0.070
Ta-0.066
Token-
Token ID12
Feature activation+1.59e-01

Pos logit contributions
Ta+3.622
 Majesty+3.556
bek+3.505
 majesty+3.452
Uh+3.379
Neg logit contributions
 and-1.914
.-1.912
,-1.774
 to-1.433
 with-1.293
Tokenoh
Token ID1219
Feature activation-4.56e-01

Pos logit contributions
.+0.637
,+0.622
 and+0.612
 to+0.532
 with+0.528
Neg logit contributions
Ta-0.340
 Majesty-0.290
Uh-0.290
bek-0.285
icious-0.282
Token!"
Token ID2474
Feature activation+6.14e-02

Pos logit contributions
 Majesty+2.030
 majesty+1.953
Ta+1.874
bek+1.817
agna+1.811
Neg logit contributions
.-0.444
 and-0.429
,-0.341
 to-0.252
 with-0.228
Token Tom
Token ID4186
Feature activation-7.38e-02

Pos logit contributions
.+0.142
,+0.133
 and+0.133
 to+0.105
 with+0.100
Neg logit contributions
Ta-0.221
Uh-0.206
 Majesty-0.198
 sidelines-0.194
 majesty-0.192
Token said
Token ID531
Feature activation-5.89e-02

Pos logit contributions
Ta+0.304
Uh+0.302
Quick+0.279
ited+0.277
bek+0.277
Neg logit contributions
.-0.075
,-0.063
 and-0.038
 to-0.035
 the-0.031
Token lake
Token ID13546
Feature activation-5.15e-01

Pos logit contributions
.+0.871
,+0.842
 and+0.827
 to+0.682
 with+0.670
Neg logit contributions
Ta-0.899
 Majesty-0.814
bek-0.809
icious-0.802
Uh-0.795
Token was
Token ID373
Feature activation+1.07e-01

Pos logit contributions
Ta+1.870
Uh+1.811
umm+1.782
 former+1.776
 Majesty+1.773
Neg logit contributions
.-1.025
,-0.946
 and-0.893
 to-0.708
 with-0.685
Token once
Token ID1752
Feature activation+3.71e-02

Pos logit contributions
.+0.294
,+0.284
 and+0.278
 to+0.225
 with+0.222
Neg logit contributions
Ta-0.316
 former-0.309
Uh-0.303
icious-0.303
 Majesty-0.300
Token again
Token ID757
Feature activation+1.67e-01

Pos logit contributions
.+0.074
,+0.073
 and+0.069
 to+0.052
 with+0.050
Neg logit contributions
Ta-0.133
 former-0.131
 Majesty-0.125
Uh-0.124
icious-0.123
Token filled
Token ID5901
Feature activation-1.98e-01

Pos logit contributions
.+0.034
,+0.029
 and+0.026
 to-0.021
 with-0.036
Neg logit contributions
 former-0.817
 Majesty-0.796
 lifetime-0.781
icious-0.779
Ta-0.777
Token with
Token ID351
Feature activation+3.63e-01

Pos logit contributions
 former+0.782
bek+0.766
 fullest+0.730
Ta+0.725
umm+0.723
Neg logit contributions
,-0.211
.-0.197
 and-0.182
 to-0.114
 at-0.055
Token warm
Token ID5814
Feature activation-1.77e-01

Pos logit contributions
.+1.006
,+0.980
 and+0.945
 to+0.785
 with+0.773
Neg logit contributions
Ta-1.173
 former-1.081
 Majesty-1.068
Uh-1.061
umm-1.049
Token and
Token ID290
Feature activation+5.14e-01

Pos logit contributions
umm+0.525
Ta+0.494
 former+0.468
 pulp+0.459
Uh+0.459
Neg logit contributions
.-0.440
,-0.416
 and-0.406
 to-0.358
 with-0.341
Token happy
Token ID3772
Feature activation-1.30e-01

Pos logit contributions
.+1.573
,+1.517
 and+1.487
 to+1.251
 with+1.231
Neg logit contributions
Ta-1.613
 Majesty-1.479
icious-1.444
Uh-1.436
agna-1.435
Token water
Token ID1660
Feature activation-1.57e-01

Pos logit contributions
 former+0.565
umm+0.563
 Majesty+0.556
Ta+0.541
 majesty+0.537
Neg logit contributions
,-0.133
.-0.129
 and-0.122
 to-0.065
 with-0.062
Token.
Token ID13
Feature activation+5.74e-01

Pos logit contributions
umm+0.744
 Majesty+0.736
 former+0.733
Ta+0.730
 sidelines+0.723
Neg logit contributions
,-0.045
 and-0.030
.-0.029
 with+0.032
 to+0.044
Token the
Token ID262
Feature activation+2.91e-01

Pos logit contributions
.+1.141
,+1.089
 and+1.056
 to+0.817
 with+0.794
Neg logit contributions
Ta-1.979
 Majesty-1.837
Uh-1.817
 former-1.807
icious-1.806
Token squirrel
Token ID33039
Feature activation-3.71e-01

Pos logit contributions
.+0.959
,+0.927
 and+0.913
 to+0.772
 with+0.752
Neg logit contributions
Ta-0.844
icious-0.750
 Majesty-0.747
Uh-0.745
bek-0.742
Tokens
Token ID82
Feature activation+6.78e-02

Pos logit contributions
Ta+1.398
 Majesty+1.384
 former+1.380
mic+1.347
 sidelines+1.345
Neg logit contributions
.-0.487
,-0.461
 and-0.449
 with-0.300
 to-0.298
Token.
Token ID13
Feature activation+5.68e-01

Pos logit contributions
.+0.135
,+0.130
 and+0.127
 with+0.097
 to+0.093
Neg logit contributions
Ta-0.244
 Majesty-0.242
 former-0.239
 majesty-0.234
icious-0.228
Token
Token ID198
Feature activation+1.86e-01

Pos logit contributions
.+1.352
,+1.294
 and+1.259
 to+0.989
 with+0.964
Neg logit contributions
Ta-2.151
 Majesty-1.987
Uh-1.967
icious-1.960
 former-1.953
Token
Token ID198
Feature activation+1.86e-01

Pos logit contributions
.+0.285
,+0.272
 and+0.256
 to+0.174
 with+0.162
Neg logit contributions
Ta-0.841
Uh-0.781
 Majesty-0.780
icious-0.777
 former-0.774
TokenOne
Token ID3198
Feature activation-3.90e-01

Pos logit contributions
.+0.569
,+0.544
 and+0.543
 with+0.455
 to+0.448
Neg logit contributions
Ta-0.561
umm-0.549
 Majesty-0.538
agna-0.523
 former-0.513
Token day
Token ID1110
Feature activation-1.17e-01

Pos logit contributions
Ta+1.107
Uh+1.016
 former+0.995
bek+0.978
icious+0.968
Neg logit contributions
.-1.090
,-1.033
 and-1.026
 to-0.872
 with-0.849
Token,
Token ID11
Feature activation+5.30e-01

Pos logit contributions
Ta+0.569
Uh+0.554
 Majesty+0.551
 majesty+0.541
agna+0.537
Neg logit contributions
.-0.057
 and-0.042
,-0.003
 to+0.015
 with+0.026
Token a
Token ID257
Feature activation+2.41e-01

Pos logit contributions
.+0.886
,+0.795
 and+0.787
 to+0.526
 with+0.489
Neg logit contributions
Ta-2.258
 Majesty-2.146
Uh-2.132
bek-2.106
 former-2.099
Token heavy
Token ID4334
Feature activation-2.66e-01

Pos logit contributions
.+0.595
,+0.589
 and+0.559
 to+0.450
 with+0.431
Neg logit contributions
Ta-0.908
icious-0.830
 Majesty-0.829
umm-0.827
 former-0.826
TokenMom
Token ID29252
Feature activation-4.10e-01

Pos logit contributions
.+0.697
,+0.670
 and+0.669
 with+0.568
 to+0.550
Neg logit contributions
Ta-0.622
umm-0.604
 Majesty-0.603
 former-0.587
agna-0.573
Token leads
Token ID5983
Feature activation-4.00e-01

Pos logit contributions
Ta+1.314
 Majesty+1.151
Uh+1.145
icious+1.130
 majesty+1.126
Neg logit contributions
.-1.080
,-1.006
 and-0.921
 to-0.831
 with-0.819
Token Anna
Token ID11735
Feature activation-2.17e-01

Pos logit contributions
Ta+1.382
 sidelines+1.309
 Majesty+1.301
agna+1.275
bek+1.266
Neg logit contributions
.-0.763
,-0.724
 and-0.684
 with-0.517
 in-0.456
Token and
Token ID290
Feature activation+5.28e-01

Pos logit contributions
Ta+0.856
uries+0.833
 Majesty+0.825
agna+0.818
 supervision+0.813
Neg logit contributions
.-0.193
,-0.178
 and-0.118
 with-0.035
 the-0.032
Token Ben
Token ID3932
Feature activation-1.52e-01

Pos logit contributions
.+1.421
,+1.367
 and+1.330
 to+1.081
 with+1.070
Neg logit contributions
Ta-1.818
 Majesty-1.671
Uh-1.636
 former-1.620
agna-1.613
Token to
Token ID284
Feature activation+3.86e-01

Pos logit contributions
Ta+0.593
agna+0.565
 supervision+0.564
uries+0.560
 Majesty+0.554
Neg logit contributions
.-0.149
,-0.144
 and-0.130
 with-0.058
 the-0.046
Token the
Token ID262
Feature activation+2.97e-01

Pos logit contributions
.+0.565
,+0.539
 and+0.509
 to+0.313
 with+0.304
Neg logit contributions
Ta-1.759
 Majesty-1.647
 former-1.640
 sidelines-1.631
Uh-1.618
Token new
Token ID649
Feature activation-6.95e-02

Pos logit contributions
.+1.001
,+0.968
 and+0.947
 to+0.813
 with+0.799
Neg logit contributions
Ta-0.841
Uh-0.748
 Majesty-0.747
icious-0.746
umm-0.741
Token room
Token ID2119
Feature activation-2.32e-01

Pos logit contributions
Ta+0.164
 Majesty+0.164
 former+0.164
 majesty+0.154
umm+0.151
Neg logit contributions
.-0.215
,-0.211
 and-0.209
 to-0.177
 with-0.168
Token.
Token ID13
Feature activation+5.92e-01

Pos logit contributions
 former+1.019
bek+0.986
 Majesty+0.978
Ta+0.975
agna+0.973
Neg logit contributions
,-0.051
 and-0.020
.-0.001
 the+0.051
 to+0.068
Token It
Token ID632
Feature activation-2.15e-01

Pos logit contributions
.+1.536
,+1.481
 and+1.433
 to+1.152
 with+1.134
Neg logit contributions
Ta-2.090
Uh-1.915
 Majesty-1.912
 former-1.883
umm-1.880
Token coming
Token ID2406
Feature activation-1.13e-01

Pos logit contributions
Ta+0.958
Uh+0.897
vation+0.885
mic+0.871
 sidelines+0.870
Neg logit contributions
,-0.349
.-0.335
 and-0.328
 to-0.243
 with-0.213
Token from
Token ID422
Feature activation+2.33e-01

Pos logit contributions
Ta+0.386
bek+0.368
Uh+0.366
chie+0.352
ksh+0.350
Neg logit contributions
,-0.177
.-0.161
 and-0.155
 that-0.108
 with-0.103
Token next
Token ID1306
Feature activation+6.17e-02

Pos logit contributions
,+0.394
.+0.375
 and+0.349
 to+0.253
 with+0.242
Neg logit contributions
Ta-0.909
 sidelines-0.850
Uh-0.843
 former-0.832
bek-0.810
Token to
Token ID284
Feature activation+3.86e-01

Pos logit contributions
,+0.069
.+0.067
 and+0.065
 with+0.046
 in+0.035
Neg logit contributions
Ta-0.245
umm-0.237
bek-0.236
 Majesty-0.235
 sidelines-0.234
Token him
Token ID683
Feature activation+2.53e-02

Pos logit contributions
,+0.825
.+0.805
 and+0.790
 to+0.611
 with+0.595
Neg logit contributions
Ta-1.374
 Majesty-1.248
bek-1.246
Uh-1.227
icious-1.221
Token.
Token ID13
Feature activation+5.73e-01

Pos logit contributions
,+0.006
 and+0.005
.+0.000
 to-0.001
 the-0.001
Neg logit contributions
 Majesty-0.100
 former-0.100
bek-0.098
agna-0.098
uries-0.097
Token It
Token ID632
Feature activation-2.13e-01

Pos logit contributions
.+1.565
,+1.478
 and+1.445
 with+1.182
 in+1.178
Neg logit contributions
Ta-1.799
Uh-1.637
 Majesty-1.585
agna-1.582
 seventeen-1.574
Token was
Token ID373
Feature activation+1.09e-01

Pos logit contributions
Ta+0.840
Uh+0.799
 Majesty+0.764
icious+0.758
 former+0.753
Neg logit contributions
.-0.409
,-0.381
 and-0.373
 to-0.292
 with-0.266
Token a
Token ID257
Feature activation+2.41e-01

Pos logit contributions
.+0.176
,+0.164
 and+0.155
 to+0.106
 with+0.099
Neg logit contributions
Ta-0.464
Uh-0.438
 former-0.433
icious-0.428
 Majesty-0.427
Token little
Token ID1310
Feature activation-7.26e-02

Pos logit contributions
.+0.686
,+0.673
 and+0.652
 to+0.538
 with+0.526
Neg logit contributions
Ta-0.809
 Majesty-0.739
umm-0.733
icious-0.733
 former-0.723
Token bird
Token ID6512
Feature activation-2.15e-01

Pos logit contributions
Ta+0.196
Uh+0.179
 Majesty+0.176
umm+0.174
 former+0.173
Neg logit contributions
.-0.239
,-0.229
 and-0.226
 to-0.193
 with-0.189
Token.
Token ID13
Feature activation+5.71e-01

Pos logit contributions
,+0.004
 and+0.003
.+0.000
 with-0.006
 to-0.007
Neg logit contributions
 Majesty-0.109
 former-0.109
agna-0.107
 majesty-0.106
bek-0.105
Token He
Token ID679
Feature activation-1.55e-01

Pos logit contributions
.+1.547
,+1.491
 and+1.458
 to+1.186
 with+1.160
Neg logit contributions
Ta-1.940
 Majesty-1.766
Uh-1.752
umm-1.748
icious-1.735
Token knew
Token ID2993
Feature activation-2.98e-01

Pos logit contributions
Ta+0.521
 Majesty+0.475
Uh+0.470
 former+0.466
bek+0.463
Neg logit contributions
.-0.429
,-0.414
 and-0.401
 to-0.330
 with-0.324
Token he
Token ID339
Feature activation+2.71e-02

Pos logit contributions
 Majesty+1.084
Ta+1.064
 majesty+1.037
 former+1.023
agna+1.006
Neg logit contributions
.-0.531
,-0.482
 and-0.480
 with-0.341
 to-0.292
Token had
Token ID550
Feature activation-1.38e-01

Pos logit contributions
.+0.075
,+0.072
 and+0.070
 to+0.057
 with+0.056
Neg logit contributions
Ta-0.091
 Majesty-0.083
Uh-0.083
icious-0.082
agna-0.082
Token to
Token ID284
Feature activation+3.85e-01

Pos logit contributions
Ta+0.654
 Majesty+0.643
agna+0.628
 former+0.620
 majesty+0.620
Neg logit contributions
.-0.137
,-0.130
 and-0.119
 with-0.051
 in-0.024
Token remove
Token ID4781
Feature activation-4.74e-01

Pos logit contributions
.+1.206
,+1.170
 and+1.154
 to+0.990
 with+0.958
Neg logit contributions
Ta-1.098
 Majesty-1.015
 former-1.001
icious-0.990
agna-0.973
Token it
Token ID340
Feature activation+1.64e-01

Pos logit contributions
Ta+1.718
 ounces+1.714
 former+1.678
 Majesty+1.669
heit+1.664
Neg logit contributions
.-0.648
,-0.635
 and-0.588
 to-0.421
 with-0.354
Token,
Token ID11
Feature activation+5.29e-01

Pos logit contributions
,+0.118
.+0.112
 and+0.104
 to+0.049
 the+0.042
Neg logit contributions
 former-0.600
umm-0.588
rated-0.584
uries-0.582
 ounces-0.580
Token but
Token ID475
Feature activation+7.17e-02

Pos logit contributions
.+1.010
,+0.950
 and+0.856
 to+0.619
 with+0.611
Neg logit contributions
Ta-2.147
 former-2.006
Uh-2.000
 Majesty-1.988
agna-1.981
Token he
Token ID339
Feature activation+2.73e-02

Pos logit contributions
.+0.138
,+0.126
 and+0.123
 to+0.088
 with+0.084
Neg logit contributions
Ta-0.288
 Majesty-0.270
Uh-0.270
 former-0.268
bek-0.268
Token,
Token ID11
Feature activation+5.42e-01

Pos logit contributions
 Majesty+1.485
 majesty+1.449
heit+1.421
 ounces+1.412
 supervision+1.396
Neg logit contributions
 and-0.253
.-0.178
,-0.150
 to-0.134
 with-0.128
Token Tim
Token ID5045
Feature activation-1.12e-01

Pos logit contributions
.+1.370
,+1.290
 and+1.251
 to+1.039
 with+1.002
Neg logit contributions
Ta-1.915
 Majesty-1.798
Uh-1.778
icious-1.770
umm-1.745
Token.
Token ID13
Feature activation+5.81e-01

Pos logit contributions
 Majesty+0.480
wy+0.476
rated+0.470
 majesty+0.469
bek+0.450
Neg logit contributions
 and-0.037
,-0.023
 to-0.020
.-0.018
 with-0.015
Token Maybe
Token ID6674
Feature activation-4.48e-01

Pos logit contributions
.+1.832
,+1.773
 and+1.735
 to+1.462
 with+1.437
Neg logit contributions
Ta-1.766
 Majesty-1.595
Uh-1.581
icious-1.580
 former-1.568
Token we
Token ID356
Feature activation-1.95e-01

Pos logit contributions
Ta+1.430
 Majesty+1.380
Uh+1.337
 former+1.325
agna+1.316
Neg logit contributions
.-0.875
 and-0.852
,-0.839
 with-0.627
 to-0.626
Token can
Token ID460
Feature activation-1.56e-01

Pos logit contributions
Ta+0.626
 Majesty+0.575
Uh+0.573
agna+0.569
 former+0.567
Neg logit contributions
.-0.542
,-0.524
 and-0.509
 to-0.419
 with-0.418
Token ask
Token ID1265
Feature activation-3.19e-01

Pos logit contributions
Ta+0.406
 Majesty+0.401
 former+0.395
icious+0.380
agna+0.373
Neg logit contributions
.-0.439
,-0.429
 and-0.424
 to-0.361
 with-0.348
Token for
Token ID329
Feature activation+2.72e-01

Pos logit contributions
Ta+1.218
 Majesty+1.215
 former+1.205
 Deal+1.180
agna+1.175
Neg logit contributions
.-0.407
,-0.403
 and-0.398
 with-0.245
 to-0.221
Token help
Token ID1037
Feature activation-1.44e-01

Pos logit contributions
.+0.647
,+0.627
 and+0.617
 with+0.470
 to+0.469
Neg logit contributions
Ta-0.996
 Majesty-0.929
 former-0.912
Uh-0.897
agna-0.895
Token."
Token ID526
Feature activation+1.25e-01

Pos logit contributions
agna+0.548
 former+0.540
icious+0.539
 fiber+0.539
arians+0.539
Neg logit contributions
,-0.123
 and-0.111
.-0.093
 to-0.036
 with-0.033
Token Sam
Token ID3409
Feature activation-1.73e-01

Pos logit contributions
.+0.339
,+0.325
 and+0.323
 with+0.264
 to+0.259
Neg logit contributions
Ta-0.409
Uh-0.374
 Majesty-0.363
 former-0.360
 sidelines-0.350
Token again
Token ID757
Feature activation+1.66e-01

Pos logit contributions
bars+1.234
 salon+1.227
Whe+1.216
 sidelines+1.214
 seventeen+1.197
Neg logit contributions
,-0.056
.-0.011
 and+0.011
 to+0.065
 in+0.066
Token and
Token ID290
Feature activation+5.28e-01

Pos logit contributions
,+0.040
 and+0.021
.+0.000
 to-0.006
 with-0.025
Neg logit contributions
 former-0.677
bek-0.656
 lifetime-0.636
 Majesty-0.619
brance-0.614
Token again
Token ID757
Feature activation+1.66e-01

Pos logit contributions
.+1.325
,+1.263
 and+1.222
 to+0.977
 with+0.936
Neg logit contributions
Ta-1.867
 Majesty-1.691
 sidelines-1.677
Uh-1.674
agna-1.654
Token.
Token ID13
Feature activation+5.87e-01

Pos logit contributions
 and+0.055
,+0.047
.+0.000
 to-0.015
 with-0.021
Neg logit contributions
 former-0.708
 Majesty-0.706
 lifetime-0.682
 majesty-0.680
bek-0.679
Token They
Token ID1119
Feature activation-1.09e-01

Pos logit contributions
.+1.405
,+1.350
 and+1.310
 to+1.030
 with+1.004
Neg logit contributions
Ta-2.225
 Majesty-2.054
Uh-2.035
 former-2.023
icious-2.020
Token are
Token ID389
Feature activation-1.44e-01

Pos logit contributions
Ta+0.345
 Majesty+0.312
Uh+0.305
icious+0.304
 former+0.303
Neg logit contributions
.-0.315
,-0.304
 and-0.297
 to-0.250
 with-0.245
Token not
Token ID407
Feature activation-5.88e-02

Pos logit contributions
Ta+0.442
 Majesty+0.425
icious+0.411
 majesty+0.404
 former+0.404
Neg logit contributions
.-0.393
,-0.381
 and-0.371
 to-0.310
 with-0.289
Token shy
Token ID15800
Feature activation-3.32e-01

Pos logit contributions
Ta+0.155
 Majesty+0.154
Uh+0.148
 majesty+0.145
 former+0.144
Neg logit contributions
.-0.172
,-0.170
 and-0.166
 to-0.140
 with-0.139
Token anymore
Token ID7471
Feature activation-7.24e-02

Pos logit contributions
umm+0.956
 Majesty+0.883
ubb+0.857
 ounces+0.841
icious+0.833
Neg logit contributions
,-0.649
 and-0.633
.-0.605
 the-0.541
 with-0.533
Token.
Token ID13
Feature activation+5.88e-01

Pos logit contributions
 Majesty+0.289
 former+0.288
 majesty+0.282
 ounces+0.279
agna+0.278
Neg logit contributions
 and-0.024
,-0.022
 to-0.006
.-0.000
 with+0.003
Token They
Token ID1119
Feature activation-1.08e-01

Pos logit contributions
.+1.189
,+1.134
 and+1.095
 to+0.817
 with+0.789
Neg logit contributions
Ta-2.440
 Majesty-2.269
Uh-2.252
 former-2.238
icious-2.238
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
Ta+0.596
icious+0.557
 Deal+0.553
Uh+0.550
 Majesty+0.544
Neg logit contributions
,-0.264
.-0.248
 and-0.241
 to-0.176
 with-0.161
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
.+0.308
,+0.301
 and+0.281
 to+0.191
 with+0.176
Neg logit contributions
Ta-0.856
Uh-0.803
icious-0.782
 former-0.773
 Majesty-0.772
TokenOne
Token ID3198
Feature activation-3.82e-01

Pos logit contributions
.+0.655
,+0.631
 and+0.623
 to+0.525
 with+0.521
Neg logit contributions
Ta-0.585
 Majesty-0.535
umm-0.524
agna-0.516
 former-0.515
Token day
Token ID1110
Feature activation-1.10e-01

Pos logit contributions
Ta+1.141
Uh+1.006
bek+0.978
umm+0.965
 former+0.961
Neg logit contributions
.-1.083
,-1.042
 and-1.018
 to-0.858
 with-0.841
Token,
Token ID11
Feature activation+5.32e-01

Pos logit contributions
Ta+0.498
Uh+0.486
agna+0.471
 Majesty+0.467
Whe+0.466
Neg logit contributions
.-0.066
 and-0.046
,-0.001
 to-0.001
 with+0.012
Token his
Token ID465
Feature activation+8.27e-02

Pos logit contributions
.+1.002
,+0.885
 and+0.880
 to+0.637
 with+0.581
Neg logit contributions
Ta-2.151
Uh-2.043
agna-1.975
bek-1.970
 Majesty-1.949
Token mom
Token ID1995
Feature activation-7.95e-02

Pos logit contributions
.+0.211
,+0.203
 and+0.198
 to+0.159
 with+0.154
Neg logit contributions
Ta-0.303
 Majesty-0.279
Uh-0.277
icious-0.276
 former-0.274
Token gave
Token ID2921
Feature activation-2.37e-01

Pos logit contributions
Ta+0.236
Uh+0.220
icious+0.207
bek+0.206
 Majesty+0.205
Neg logit contributions
.-0.227
,-0.213
 and-0.200
 to-0.174
 with-0.174
Token him
Token ID683
Feature activation+2.57e-02

Pos logit contributions
Ta+0.849
 Majesty+0.765
Uh+0.764
agna+0.751
umm+0.750
Neg logit contributions
.-0.561
,-0.538
 and-0.512
 with-0.400
 to-0.383
Token a
Token ID257
Feature activation+2.43e-01

Pos logit contributions
.+0.037
,+0.036
 and+0.033
 with+0.021
 to+0.017
Neg logit contributions
Ta-0.114
 Majesty-0.103
 former-0.103
agna-0.103
icious-0.101
Token special
Token ID2041
Feature activation-2.40e-01

Pos logit contributions
.+0.638
,+0.620
 and+0.602
 to+0.497
 with+0.473
Neg logit contributions
Ta-0.862
 Majesty-0.801
icious-0.791
umm-0.789
 former-0.787
Token felt
Token ID2936
Feature activation-2.09e-01

Pos logit contributions
Ta+0.327
 Majesty+0.294
Uh+0.293
 former+0.290
icious+0.289
Neg logit contributions
.-0.307
,-0.297
 and-0.288
 to-0.243
 with-0.237
Token sorry
Token ID7926
Feature activation-4.27e-01

Pos logit contributions
Ta+0.609
 Majesty+0.549
Uh+0.533
 former+0.532
 sidelines+0.529
Neg logit contributions
.-0.594
,-0.576
 and-0.571
 to-0.482
 with-0.457
Token and
Token ID290
Feature activation+5.30e-01

Pos logit contributions
 Deal+1.332
 former+1.295
icious+1.254
 Majesty+1.218
mic+1.218
Neg logit contributions
,-0.399
.-0.376
 and-0.294
 with-0.289
 the-0.284
Token ashamed
Token ID22461
Feature activation-5.00e-01

Pos logit contributions
.+1.915
,+1.865
 and+1.830
 to+1.578
 with+1.558
Neg logit contributions
Ta-1.337
 Majesty-1.182
Uh-1.171
 former-1.150
icious-1.140
Token.
Token ID13
Feature activation+5.89e-01

Pos logit contributions
umm+1.910
Uh+1.808
icious+1.780
 Deal+1.778
Really+1.768
Neg logit contributions
,-0.191
 and-0.140
 the-0.024
.-0.005
 with+0.031
Token They
Token ID1119
Feature activation-1.03e-01

Pos logit contributions
.+1.490
,+1.433
 and+1.392
 to+1.119
 with+1.091
Neg logit contributions
Ta-2.118
 Majesty-1.969
Uh-1.955
 former-1.933
umm-1.912
Token did
Token ID750
Feature activation-2.24e-01

Pos logit contributions
Ta+0.325
 Majesty+0.297
Uh+0.292
icious+0.291
 former+0.290
Neg logit contributions
.-0.312
,-0.302
 and-0.295
 to-0.246
 with-0.241
Token not
Token ID407
Feature activation-5.84e-02

Pos logit contributions
Ta+0.599
 Majesty+0.588
 former+0.586
 Deal+0.580
icious+0.575
Neg logit contributions
,-0.594
.-0.584
 and-0.582
 to-0.470
 with-0.462
Token know
Token ID760
Feature activation-3.65e-01

Pos logit contributions
Ta+0.158
 Majesty+0.140
Uh+0.139
icious+0.138
 former+0.136
Neg logit contributions
.-0.191
,-0.188
 and-0.185
 to-0.158
 with-0.157
Token about
Token ID546
Feature activation+8.36e-02

Pos logit contributions
 Majesty+1.112
agna+1.085
Ta+1.069
 majesty+1.063
ubb+1.060
Neg logit contributions
 and-0.667
,-0.645
.-0.634
 to-0.514
 with-0.506
Token the
Token ID262
Feature activation+2.98e-01

Pos logit contributions
.+0.136
,+0.133
 and+0.128
 to+0.088
 with+0.084
Neg logit contributions
Ta-0.356
 Majesty-0.345
 former-0.340
agna-0.335
icious-0.333
Token showed
Token ID3751
Feature activation-2.70e-01

Pos logit contributions
.+0.082
,+0.079
 and+0.076
 to+0.065
 with+0.063
Neg logit contributions
Ta-0.082
 Majesty-0.075
 former-0.073
Uh-0.073
agna-0.072
Token them
Token ID606
Feature activation+1.25e-03

Pos logit contributions
Ta+1.034
 Majesty+1.009
 former+0.951
 majesty+0.951
 sidelines+0.933
Neg logit contributions
.-0.518
,-0.487
 and-0.462
 with-0.318
 to-0.312
Token all
Token ID477
Feature activation+1.71e-01

Pos logit contributions
.+0.002
,+0.002
 and+0.002
 with+0.001
 in+0.001
Neg logit contributions
Ta-0.005
 Majesty-0.004
 former-0.004
agna-0.004
 majesty-0.004
Token the
Token ID262
Feature activation+2.90e-01

Pos logit contributions
,+0.391
.+0.390
 and+0.364
 with+0.294
 to+0.277
Neg logit contributions
Ta-0.521
 former-0.460
 sidelines-0.451
bek-0.451
 Majesty-0.449
Token pictures
Token ID5986
Feature activation-2.59e-01

Pos logit contributions
.+0.868
,+0.839
 and+0.819
 to+0.683
 with+0.670
Neg logit contributions
Ta-0.922
 Majesty-0.840
Uh-0.828
icious-0.823
 former-0.823
Token he
Token ID339
Feature activation+2.70e-02

Pos logit contributions
Ta+1.062
umm+1.016
vation+1.003
agna+0.992
icious+0.991
Neg logit contributions
.-0.229
,-0.223
 and-0.176
 to-0.090
 with-0.075
Token had
Token ID550
Feature activation-1.39e-01

Pos logit contributions
.+0.075
,+0.073
 and+0.071
 to+0.058
 with+0.057
Neg logit contributions
Ta-0.089
 Majesty-0.081
Uh-0.080
 former-0.080
icious-0.079
Token drawn
Token ID7428
Feature activation-4.56e-01

Pos logit contributions
Ta+0.354
 Majesty+0.330
 former+0.325
agna+0.311
bek+0.310
Neg logit contributions
,-0.415
.-0.406
 and-0.390
 with-0.324
 to-0.322
Token in
Token ID287
Feature activation+3.37e-01

Pos logit contributions
Ta+1.948
bridge+1.933
 former+1.928
 Majesty+1.921
umm+1.915
Neg logit contributions
,-0.211
.-0.132
 and-0.097
 the+0.074
 to+0.088
Token his
Token ID465
Feature activation+8.24e-02

Pos logit contributions
,+0.573
.+0.572
 and+0.535
 to+0.404
 with+0.370
Neg logit contributions
Ta-1.312
 former-1.291
 Majesty-1.250
agna-1.224
Uh-1.224
Token notebook
Token ID20922
Feature activation-5.58e-01

Pos logit contributions
.+0.338
,+0.329
 and+0.324
 to+0.286
 with+0.283
Neg logit contributions
Ta-0.168
 Majesty-0.146
 former-0.141
Uh-0.140
umm-0.139
Token he
Token ID339
Feature activation+3.19e-02

Pos logit contributions
Ta+1.105
Uh+1.045
 Majesty+1.002
umm+0.990
bek+0.988
Neg logit contributions
.-0.732
,-0.693
 and-0.674
 to-0.552
 with-0.511
Token said
Token ID531
Feature activation-6.98e-02

Pos logit contributions
.+0.081
,+0.077
 and+0.076
 to+0.061
 with+0.059
Neg logit contributions
Ta-0.112
 Majesty-0.104
Uh-0.103
icious-0.102
bek-0.102
Token.
Token ID13
Feature activation+5.70e-01

Pos logit contributions
 Majesty+0.322
agna+0.313
bek+0.312
 former+0.311
Ta+0.309
Neg logit contributions
 and-0.032
.-0.024
,-0.023
 to+0.001
 with+0.012
Token '
Token ID705
Feature activation-2.64e-01

Pos logit contributions
.+1.496
,+1.439
 and+1.403
 to+1.147
 with+1.123
Neg logit contributions
Ta-1.834
 Majesty-1.722
Uh-1.711
umm-1.683
 former-1.680
TokenCan
Token ID6090
Feature activation-5.03e-01

Pos logit contributions
Ta+0.612
 Majesty+0.587
 majesty+0.553
 former+0.551
umm+0.537
Neg logit contributions
.-1.017
 and-0.974
,-0.971
 to-0.846
 with-0.844
Token I
Token ID314
Feature activation-2.77e-01

Pos logit contributions
Ta+1.565
 Majesty+1.433
Uh+1.417
icious+1.414
agna+1.407
Neg logit contributions
.-1.455
,-1.394
 and-1.369
 to-1.116
 with-1.100
Token go
Token ID467
Feature activation-6.99e-02

Pos logit contributions
Ta+0.770
icious+0.720
 former+0.708
agna+0.699
Uh+0.693
Neg logit contributions
.-0.790
,-0.759
 and-0.744
 to-0.611
 with-0.610
Token play
Token ID711
Feature activation-1.32e-01

Pos logit contributions
Ta+0.251
 former+0.240
 sidelines+0.239
agna+0.237
 Majesty+0.236
Neg logit contributions
.-0.131
,-0.122
 and-0.113
 with-0.079
 to-0.077
Token in
Token ID287
Feature activation+3.38e-01

Pos logit contributions
Ta+0.506
icious+0.492
agna+0.488
 former+0.487
 Majesty+0.484
Neg logit contributions
.-0.232
,-0.209
 and-0.201
 to-0.149
 with-0.121
Token the
Token ID262
Feature activation+2.92e-01

Pos logit contributions
.+0.505
,+0.434
 and+0.410
 to+0.297
 with+0.243
Neg logit contributions
Ta-1.330
 sidelines-1.312
 former-1.311
 Majesty-1.306
icious-1.287
Token rain
Token ID6290
Feature activation-3.17e-01

Pos logit contributions
.+1.022
,+0.993
 and+0.975
 to+0.836
 with+0.825
Neg logit contributions
Ta-0.769
 Majesty-0.691
Uh-0.678
 former-0.677
umm-0.674
Token every
Token ID790
Feature activation+1.19e-01

Pos logit contributions
,+0.175
.+0.172
 and+0.164
 to+0.118
 the+0.097
Neg logit contributions
 former-0.383
umm-0.365
 fullest-0.362
icious-0.362
 Majesty-0.359
Token day
Token ID1110
Feature activation-1.06e-01

Pos logit contributions
.+0.327
,+0.316
 and+0.305
 to+0.248
 with+0.247
Neg logit contributions
Ta-0.393
Uh-0.367
 former-0.352
umm-0.351
 Majesty-0.351
Token.
Token ID13
Feature activation+5.69e-01

Pos logit contributions
umm+0.466
mic+0.454
 lifetime+0.451
 former+0.451
agna+0.444
Neg logit contributions
,-0.031
 and-0.024
.-0.009
 to+0.019
 the+0.028
Token<|endoftext|>
Token ID50256
Feature activation-9.56e-02

Pos logit contributions
.+0.911
,+0.876
 and+0.817
 to+0.569
 with+0.538
Neg logit contributions
Ta-2.548
Uh-2.340
umm-2.326
 former-2.324
icious-2.323
TokenOnce
Token ID7454
Feature activation-3.20e-01

Pos logit contributions
Ta+0.322
Whe+0.288
Ah+0.287
Uh+0.284
rob+0.275
Neg logit contributions
,-0.174
 and-0.161
.-0.156
 to-0.128
 at-0.120
Token upon
Token ID2402
Feature activation-2.63e-01

Pos logit contributions
Ta+0.710
Uh+0.643
 Majesty+0.638
 majesty+0.632
 Deal+0.609
Neg logit contributions
.-0.948
 and-0.934
,-0.883
 with-0.809
 in-0.751
Token a
Token ID257
Feature activation+2.38e-01

Pos logit contributions
Ta+1.141
icious+1.023
Uh+1.022
Whe+1.003
Looks+0.998
Neg logit contributions
,-0.372
.-0.344
 and-0.312
 to-0.213
 with-0.185
Token time
Token ID640
Feature activation-8.52e-02

Pos logit contributions
.+0.599
,+0.575
 and+0.555
 to+0.465
 with+0.454
Neg logit contributions
Ta-0.818
icious-0.761
Uh-0.749
 Majesty-0.743
 former-0.740
Token,
Token ID11
Feature activation+5.30e-01

Pos logit contributions
Ta+0.393
 Majesty+0.388
 majesty+0.378
Uh+0.378
 Deal+0.373
Neg logit contributions
.-0.050
 and-0.033
 to-0.001
,-0.000
 with+0.007
Token there
Token ID612
Feature activation+7.38e-02

Pos logit contributions
.+1.065
 and+0.964
,+0.944
 to+0.745
 with+0.654
Neg logit contributions
Ta-2.019
Uh-1.953
bek-1.813
 Majesty-1.801
Looks-1.779
Token was
Token ID373
Feature activation+1.11e-01

Pos logit contributions
.+0.148
 and+0.134
,+0.127
 to+0.110
 with+0.097
Neg logit contributions
Ta-0.258
Uh-0.246
 Majesty-0.236
liv-0.234
umm-0.232
Token day
Token ID1110
Feature activation-1.09e-01

Pos logit contributions
.+0.832
,+0.805
 and+0.793
 to+0.664
 with+0.646
Neg logit contributions
Ta-0.934
bek-0.862
Uh-0.858
 Majesty-0.855
umm-0.854
Token on
Token ID319
Feature activation+2.71e-01

Pos logit contributions
 sidelines+0.486
icious+0.480
agna+0.479
arians+0.478
 former+0.476
Neg logit contributions
,-0.029
.-0.028
 and-0.026
 to+0.017
 with+0.018
Token.
Token ID13
Feature activation+5.65e-01

Pos logit contributions
.+0.401
,+0.389
 and+0.382
 to+0.274
 with+0.261
Neg logit contributions
 former-1.068
 Majesty-1.041
Ta-1.039
agna-1.020
bek-1.013
Token
Token ID198
Feature activation+1.89e-01

Pos logit contributions
.+1.231
,+1.176
 and+1.139
 to+0.873
 with+0.851
Neg logit contributions
Ta-2.237
 Majesty-2.083
Uh-2.059
 former-2.050
icious-2.046
Token
Token ID198
Feature activation+1.88e-01

Pos logit contributions
.+0.300
,+0.286
 and+0.270
 to+0.181
 with+0.174
Neg logit contributions
Ta-0.856
 Majesty-0.795
Uh-0.792
icious-0.787
 former-0.783
TokenThe
Token ID464
Feature activation-2.72e-01

Pos logit contributions
.+0.606
,+0.584
 and+0.579
 to+0.489
 with+0.483
Neg logit contributions
Ta-0.548
 Majesty-0.513
umm-0.508
 former-0.497
agna-0.493
Token moral
Token ID6573
Feature activation-4.95e-01

Pos logit contributions
Ta+0.855
 Majesty+0.778
Uh+0.769
icious+0.763
 former+0.762
Neg logit contributions
.-0.833
,-0.805
 and-0.788
 to-0.655
 with-0.644
Token of
Token ID286
Feature activation+1.91e-01

Pos logit contributions
Ta+2.132
Uh+1.967
 former+1.933
bek+1.926
umm+1.914
Neg logit contributions
.-0.798
,-0.749
 and-0.682
 with-0.458
 in-0.437
Token the
Token ID262
Feature activation+2.95e-01

Pos logit contributions
.+0.308
,+0.271
 and+0.256
 to+0.177
 with+0.169
Neg logit contributions
Ta-0.844
 Majesty-0.780
 former-0.774
bek-0.772
icious-0.771
Token story
Token ID1621
Feature activation-2.90e-01

Pos logit contributions
.+0.953
,+0.928
 and+0.907
 to+0.764
 with+0.749
Neg logit contributions
Ta-0.876
 Majesty-0.791
Uh-0.783
 former-0.776
icious-0.773
Token is
Token ID318
Feature activation-1.68e-02

Pos logit contributions
Ta+0.886
bek+0.864
 Majesty+0.852
 majesty+0.844
 former+0.843
Neg logit contributions
.-0.672
,-0.605
 and-0.599
 to-0.486
 with-0.482
Token.
Token ID13
Feature activation+5.66e-01

Pos logit contributions
 former+0.418
umm+0.410
 Majesty+0.408
agna+0.403
icious+0.391
Neg logit contributions
,-0.022
 and-0.019
.-0.017
 to-0.000
 the+0.006
Token So
Token ID1406
Feature activation-2.21e-01

Pos logit contributions
.+1.624
,+1.568
 and+1.533
 with+1.254
 to+1.246
Neg logit contributions
Ta-1.813
 Majesty-1.669
 former-1.638
umm-1.637
Uh-1.625
Token she
Token ID673
Feature activation+1.61e-02

Pos logit contributions
bek+0.884
 Majesty+0.877
Ta+0.871
 former+0.861
umm+0.858
Neg logit contributions
.-0.232
 and-0.195
,-0.132
 to-0.096
 with-0.072
Token started
Token ID2067
Feature activation-1.77e-01

Pos logit contributions
.+0.046
,+0.044
 and+0.043
 to+0.036
 with+0.035
Neg logit contributions
Ta-0.052
 Majesty-0.048
Uh-0.047
 former-0.047
icious-0.047
Token to
Token ID284
Feature activation+3.83e-01

Pos logit contributions
Ta+0.681
 Majesty+0.679
mic+0.673
 former+0.666
 sidelines+0.664
Neg logit contributions
.-0.210
,-0.202
 and-0.181
 with-0.104
 in-0.091
Token sway
Token ID20009
Feature activation-4.00e-01

Pos logit contributions
.+1.297
,+1.258
 and+1.234
 to+1.058
 with+1.042
Neg logit contributions
Ta-1.024
 Majesty-0.937
 former-0.918
icious-0.903
Uh-0.896
Token her
Token ID607
Feature activation+1.12e-01

Pos logit contributions
Ta+1.402
umm+1.391
ubb+1.358
mic+1.355
icious+1.348
Neg logit contributions
,-0.468
.-0.457
 and-0.407
 to-0.259
 with-0.246
Token arms
Token ID5101
Feature activation-3.57e-01

Pos logit contributions
.+0.414
,+0.406
 and+0.395
 to+0.347
 with+0.340
Neg logit contributions
Ta-0.236
 Majesty-0.229
 former-0.220
umm-0.217
 majesty-0.215
Token and
Token ID290
Feature activation+5.12e-01

Pos logit contributions
mic+1.310
Ta+1.288
 former+1.250
icious+1.228
arse+1.219
Neg logit contributions
,-0.367
.-0.365
 and-0.306
 the-0.199
 to-0.192
Token move
Token ID1445
Feature activation-2.53e-01

Pos logit contributions
.+1.573
,+1.517
 and+1.484
 to+1.238
 with+1.212
Neg logit contributions
Ta-1.526
 Majesty-1.411
 former-1.374
Uh-1.371
agna-1.333
Token her
Token ID607
Feature activation+1.12e-01

Pos logit contributions
Ta+0.924
 former+0.912
mic+0.906
ubb+0.902
Uh+0.895
Neg logit contributions
,-0.276
 and-0.247
.-0.246
 to-0.121
 with-0.119
Token saw
Token ID2497
Feature activation-1.58e-01

Pos logit contributions
.+1.479
,+1.413
 and+1.371
 to+1.142
 with+1.109
Neg logit contributions
Ta-1.678
 Majesty-1.538
Uh-1.519
agna-1.494
bek-1.494
Token that
Token ID326
Feature activation+3.06e-01

Pos logit contributions
Ta+0.698
 Majesty+0.652
Uh+0.637
 pulp+0.631
agna+0.616
Neg logit contributions
.-0.223
,-0.208
 and-0.198
 to-0.121
 with-0.121
Token it
Token ID340
Feature activation+1.63e-01

Pos logit contributions
.+0.494
,+0.457
 and+0.440
 to+0.297
 with+0.267
Neg logit contributions
Ta-1.260
 Majesty-1.214
Uh-1.195
 former-1.181
bek-1.177
Token was
Token ID373
Feature activation+1.09e-01

Pos logit contributions
.+0.305
,+0.290
 and+0.276
 to+0.206
 with+0.194
Neg logit contributions
Ta-0.681
Uh-0.641
icious-0.630
umm-0.625
 Majesty-0.625
Token not
Token ID407
Feature activation-6.39e-02

Pos logit contributions
.+0.266
,+0.257
 and+0.245
 to+0.194
 with+0.189
Neg logit contributions
Ta-0.365
Uh-0.351
 former-0.347
icious-0.347
umm-0.337
Token smooth
Token ID7209
Feature activation-3.16e-01

Pos logit contributions
Ta+0.208
icious+0.199
Uh+0.198
 former+0.198
agna+0.197
Neg logit contributions
.-0.146
,-0.146
 and-0.137
 to-0.108
 with-0.107
Token,
Token ID11
Feature activation+5.30e-01

Pos logit contributions
umm+1.172
 former+1.079
 Majesty+1.026
rated+1.025
icious+1.013
Neg logit contributions
.-0.425
,-0.419
 and-0.390
-0.330
 the-0.321
Token but
Token ID475
Feature activation+7.41e-02

Pos logit contributions
.+1.065
,+0.995
 and+0.874
 to+0.700
 with+0.664
Neg logit contributions
Ta-1.995
Uh-1.925
umm-1.854
icious-1.847
 former-1.835
Token instead
Token ID2427
Feature activation+3.67e-03

Pos logit contributions
.+0.134
,+0.123
 and+0.120
 to+0.084
 with+0.079
Neg logit contributions
Ta-0.312
Uh-0.293
 Majesty-0.291
bek-0.286
 former-0.286
Token it
Token ID340
Feature activation+1.63e-01

Pos logit contributions
.+0.006
 and+0.006
,+0.006
 to+0.004
 with+0.004
Neg logit contributions
Ta-0.015
 Majesty-0.014
Uh-0.014
umm-0.014
icious-0.014
Token was
Token ID373
Feature activation+1.09e-01

Pos logit contributions
.+0.325
,+0.309
 and+0.296
 to+0.223
 with+0.214
Neg logit contributions
Ta-0.668
Uh-0.623
icious-0.615
 Majesty-0.613
 former-0.612
TokenAmy
Token ID40797
Feature activation-6.42e-01

Pos logit contributions
.+0.563
,+0.538
 and+0.535
 with+0.456
 to+0.448
Neg logit contributions
Ta-0.513
umm-0.493
 Majesty-0.489
agna-0.473
 former-0.464
Token sadly
Token ID21098
Feature activation-2.44e-01

Pos logit contributions
Ta+2.223
 Majesty+2.105
 former+2.067
Uh+2.055
bek+2.019
Neg logit contributions
.-1.399
,-1.291
 and-1.204
 to-1.028
 with-0.989
Token replied
Token ID8712
Feature activation-3.44e-01

Pos logit contributions
Ta+0.657
 Majesty+0.609
 former+0.594
bek+0.584
Uh+0.580
Neg logit contributions
.-0.740
,-0.708
 and-0.700
 to-0.591
 with-0.589
Token,
Token ID11
Feature activation+5.15e-01

Pos logit contributions
 Majesty+1.444
Ta+1.366
 majesty+1.352
agna+1.344
 former+1.342
Neg logit contributions
.-0.296
 and-0.239
,-0.180
 with-0.054
 to-0.039
Token "
Token ID366
Feature activation-1.29e-02

Pos logit contributions
.+1.318
,+1.237
 and+1.191
 to+0.967
 with+0.966
Neg logit contributions
 Majesty-1.486
Ta-1.472
Uh-1.429
agna-1.415
 sidelines-1.410
TokenOkay
Token ID16454
Feature activation-8.28e-01

Pos logit contributions
 Majesty+0.036
Ta+0.035
 majesty+0.035
agna+0.035
umm+0.034
Neg logit contributions
.-0.041
 and-0.040
,-0.038
 with-0.035
 to-0.033
Token Mom
Token ID11254
Feature activation-1.29e-01

Pos logit contributions
 Majesty+4.033
 majesty+3.820
icious+3.749
umm+3.746
agna+3.745
Neg logit contributions
.-0.317
 and-0.268
,-0.080
 to+0.047
 with+0.104
Token.
Token ID13
Feature activation+5.74e-01

Pos logit contributions
 Majesty+0.314
umm+0.302
icious+0.297
 majesty+0.290
Ta+0.288
Neg logit contributions
.-0.359
 and-0.347
,-0.341
 with-0.304
 in-0.295
Token I
Token ID314
Feature activation-2.93e-01

Pos logit contributions
.+1.686
,+1.663
 and+1.608
 to+1.358
 with+1.336
Neg logit contributions
Ta-1.839
Uh-1.648
icious-1.638
 Majesty-1.636
 former-1.625
Token'm
Token ID1101
Feature activation-4.60e-01

Pos logit contributions
Ta+0.770
 Majesty+0.694
 former+0.693
Uh+0.688
icious+0.684
Neg logit contributions
.-0.996
,-0.967
 and-0.943
 to-0.815
 with-0.804
Token sorry
Token ID7926
Feature activation-4.30e-01

Pos logit contributions
Ta+1.180
 Majesty+1.144
icious+1.111
 former+1.095
agna+1.063
Neg logit contributions
.-1.498
,-1.443
 and-1.426
 to-1.199
 with-1.180
Token a
Token ID257
Feature activation+2.36e-01

Pos logit contributions
.+0.772
,+0.741
 and+0.724
 to+0.543
 with+0.541
Neg logit contributions
Ta-1.414
 Majesty-1.329
 former-1.313
agna-1.285
Uh-1.284
Token new
Token ID649
Feature activation-7.39e-02

Pos logit contributions
.+0.735
,+0.715
 and+0.697
 to+0.587
 with+0.572
Neg logit contributions
Ta-0.727
 Majesty-0.656
Uh-0.648
umm-0.648
icious-0.647
Token map
Token ID3975
Feature activation-4.97e-01

Pos logit contributions
Ta+0.200
 Majesty+0.190
 former+0.186
umm+0.179
Uh+0.178
Neg logit contributions
.-0.233
,-0.223
 and-0.221
 with-0.187
 to-0.186
Token �
Token ID6184
Feature activation-6.31e-01

Pos logit contributions
Ta+2.296
 Majesty+2.268
umm+2.244
 former+2.230
agna+2.229
Neg logit contributions
,-0.142
.-0.114
 and-0.084
 the+0.083
 with+0.108
Token�
Token ID95
Feature activation-6.49e-01

Pos logit contributions
Ta+1.197
 Majesty+1.021
Uh+0.987
icious+0.986
 former+0.981
Neg logit contributions
.-2.690
,-2.627
 and-2.590
 to-2.293
 with-2.261
Token€
Token ID26391
Feature activation-7.84e-01

Pos logit contributions
Ta+1.325
 Majesty+1.154
Uh+1.140
 former+1.118
icious+1.107
Neg logit contributions
.-2.697
,-2.618
 and-2.581
 to-2.263
 with-2.247
Token�
Token ID447
Feature activation-2.05e-01

Pos logit contributions
Ta+2.179
 Majesty+2.067
agna+1.981
bek+1.964
icious+1.959
Neg logit contributions
.-2.432
,-2.347
 and-2.315
 to-1.972
 with-1.941
Token�
Token ID250
Feature activation-4.42e-01

Pos logit contributions
Ta+0.446
 Majesty+0.402
 former+0.388
umm+0.385
Uh+0.379
Neg logit contributions
.-0.813
,-0.800
 and-0.782
 with-0.675
 to-0.669
Token the
Token ID262
Feature activation+2.87e-01

Pos logit contributions
Ta+0.964
 Majesty+0.934
agna+0.867
 majesty+0.863
umm+0.855
Neg logit contributions
.-1.726
 and-1.666
,-1.644
 to-1.461
 with-1.452
Token map
Token ID3975
Feature activation-4.87e-01

Pos logit contributions
.+1.071
,+1.043
 and+1.022
 to+0.877
 with+0.867
Neg logit contributions
Ta-0.710
Uh-0.629
icious-0.623
 former-0.614
umm-0.613
Token of
Token ID286
Feature activation+1.91e-01

Pos logit contributions
Ta+1.886
 Majesty+1.821
 majesty+1.766
bek+1.760
agna+1.755
Neg logit contributions
.-0.821
,-0.764
 and-0.728
 to-0.524
 with-0.504
Token Harry
Token ID5850
Feature activation-5.13e-01

Pos logit contributions
.+1.298
,+1.247
 and+1.214
 to+0.977
 with+0.949
Neg logit contributions
Ta-1.831
 Majesty-1.695
Uh-1.675
 former-1.664
icious-1.657
Token wanted
Token ID2227
Feature activation-2.53e-01

Pos logit contributions
Ta+1.755
 Majesty+1.693
 former+1.626
 majesty+1.599
Uh+1.595
Neg logit contributions
.-1.184
,-1.125
 and-1.063
 to-0.905
 with-0.876
Token to
Token ID284
Feature activation+3.71e-01

Pos logit contributions
Ta+1.179
 Majesty+1.098
 former+1.098
agna+1.091
Uh+1.082
Neg logit contributions
.-0.247
,-0.224
 and-0.198
 with-0.081
 in-0.051
Token mail
Token ID6920
Feature activation-2.94e-01

Pos logit contributions
.+1.109
,+1.077
 and+1.066
 to+0.883
 with+0.869
Neg logit contributions
Ta-1.114
 Majesty-1.027
 former-1.002
icious-0.992
Uh-0.978
Token a
Token ID257
Feature activation+2.29e-01

Pos logit contributions
Ta+1.081
agna+1.040
 Majesty+1.020
icious+1.004
bek+0.990
Neg logit contributions
.-0.472
,-0.444
 and-0.440
 to-0.292
 with-0.272
Token parcel
Token ID25232
Feature activation-7.39e-01

Pos logit contributions
.+0.832
,+0.806
 and+0.794
 to+0.683
 with+0.677
Neg logit contributions
Ta-0.555
 Majesty-0.500
Uh-0.490
 former-0.488
agna-0.486
Token to
Token ID284
Feature activation+3.67e-01

Pos logit contributions
belt+2.893
bek+2.883
agna+2.873
pron+2.852
cher+2.833
Neg logit contributions
.-0.480
,-0.422
 and-0.383
-0.077
 the-0.070
Token his
Token ID465
Feature activation+7.23e-02

Pos logit contributions
.+0.997
,+0.980
 and+0.954
 to+0.785
 with+0.753
Neg logit contributions
Ta-1.169
 former-1.081
 Majesty-1.054
icious-1.044
bek-1.044
Token grandma
Token ID49890
Feature activation-2.60e-01

Pos logit contributions
.+0.237
,+0.229
 and+0.226
 to+0.191
 with+0.189
Neg logit contributions
Ta-0.197
 Majesty-0.188
 former-0.185
agna-0.177
umm-0.176
Token.
Token ID13
Feature activation+5.66e-01

Pos logit contributions
agna+1.076
 Majesty+1.048
 former+1.048
uries+1.024
Ta+1.023
Neg logit contributions
.-0.207
,-0.199
 and-0.182
 to-0.106
 the-0.079
Token Sarah
Token ID10490
Feature activation-3.24e-01

Pos logit contributions
.+1.422
,+1.364
 and+1.320
 to+1.053
 with+1.039
Neg logit contributions
Ta-2.028
 Majesty-1.880
Uh-1.861
 former-1.849
umm-1.838
Token completed
Token ID5668
Feature activation-4.98e-01

Pos logit contributions
Ta+0.400
 Majesty+0.359
Uh+0.352
 former+0.352
icious+0.347
Neg logit contributions
.-0.457
,-0.443
 and-0.432
 with-0.362
 to-0.362
Token its
Token ID663
Feature activation-8.14e-02

Pos logit contributions
 former+1.835
 Majesty+1.767
bek+1.668
agna+1.658
mic+1.648
Neg logit contributions
.-0.770
,-0.746
 and-0.696
 to-0.484
 with-0.439
Token journey
Token ID7002
Feature activation-4.98e-01

Pos logit contributions
Ta+0.170
 former+0.155
 Majesty+0.155
icious+0.145
umm+0.145
Neg logit contributions
.-0.321
,-0.311
 and-0.308
 to-0.268
 with-0.268
Token without
Token ID1231
Feature activation+5.74e-02

Pos logit contributions
 former+2.198
 lifetime+2.167
vation+2.131
bek+2.129
 Majesty+2.108
Neg logit contributions
.-0.165
,-0.138
 and-0.095
 with+0.080
 to+0.093
Token any
Token ID597
Feature activation-1.50e-01

Pos logit contributions
.+0.158
,+0.154
 and+0.152
 with+0.123
 to+0.122
Neg logit contributions
Ta-0.180
 Majesty-0.172
 former-0.169
Uh-0.165
agna-0.164
Token fears
Token ID10251
Feature activation-7.02e-01

Pos logit contributions
Ta+0.339
 Majesty+0.315
 former+0.312
umm+0.304
Uh+0.302
Neg logit contributions
.-0.538
,-0.528
 and-0.522
 with-0.452
 to-0.450
Token.
Token ID13
Feature activation+5.69e-01

Pos logit contributions
mic+3.061
Ta+3.001
vation+2.996
 sidelines+2.990
 lifetime+2.975
Neg logit contributions
,-0.238
.-0.204
 and-0.162
 with+0.037
 to+0.140
Token Everyone
Token ID11075
Feature activation-3.89e-01

Pos logit contributions
.+1.605
,+1.546
 and+1.511
 to+1.241
 with+1.215
Neg logit contributions
Ta-1.913
 Majesty-1.752
Uh-1.730
icious-1.721
 former-1.717
Token celebrated
Token ID13943
Feature activation-5.24e-01

Pos logit contributions
Ta+1.328
bek+1.222
Uh+1.204
 Majesty+1.193
vation+1.157
Neg logit contributions
.-0.865
,-0.816
 and-0.761
 to-0.628
 with-0.583
Token and
Token ID290
Feature activation+5.11e-01

Pos logit contributions
 former+2.150
bek+2.102
 Majesty+2.097
agna+2.067
icious+2.063
Neg logit contributions
.-0.505
,-0.445
 and-0.356
 to-0.251
 with-0.107
Token cheered
Token ID34767
Feature activation-3.64e-01

Pos logit contributions
.+1.176
,+1.122
 and+1.090
 to+0.848
 with+0.824
Neg logit contributions
Ta-1.972
 Majesty-1.828
Uh-1.807
 former-1.800
icious-1.792
Token was
Token ID373
Feature activation+1.09e-01

Pos logit contributions
.+0.295
,+0.277
 and+0.266
 to+0.197
 with+0.188
Neg logit contributions
Ta-0.653
Uh-0.615
icious-0.609
umm-0.609
agna-0.607
Token time
Token ID640
Feature activation-8.74e-02

Pos logit contributions
.+0.273
 and+0.267
,+0.264
 to+0.217
 with+0.215
Neg logit contributions
Ta-0.329
icious-0.329
umm-0.321
Uh-0.320
 former-0.318
Token to
Token ID284
Feature activation+3.85e-01

Pos logit contributions
bek+0.309
Ta+0.304
agna+0.302
inations+0.299
 supervision+0.298
Neg logit contributions
.-0.101
,-0.098
 and-0.088
 with-0.070
 the-0.053
Token put
Token ID1234
Feature activation-1.20e-01

Pos logit contributions
.+1.108
,+1.063
 and+1.051
 to+0.913
 with+0.865
Neg logit contributions
Ta-1.182
icious-1.103
 Majesty-1.089
 former-1.087
agna-1.076
Token the
Token ID262
Feature activation+2.89e-01

Pos logit contributions
Ta+0.457
wy+0.444
agna+0.436
 pulp+0.433
umm+0.433
Neg logit contributions
,-0.181
.-0.179
 and-0.166
 to-0.120
 with-0.106
Token mixture
Token ID11710
Feature activation-6.82e-01

Pos logit contributions
.+0.989
,+0.961
 and+0.941
 to+0.805
 with+0.791
Neg logit contributions
Ta-0.794
 Majesty-0.711
Uh-0.702
icious-0.697
umm-0.695
Token into
Token ID656
Feature activation+9.33e-02

Pos logit contributions
Ta+2.369
umm+2.351
 pulp+2.337
wy+2.318
atmeal+2.309
Neg logit contributions
,-0.915
.-0.849
 and-0.820
 with-0.447
 the-0.436
Token the
Token ID262
Feature activation+2.89e-01

Pos logit contributions
,+0.127
.+0.126
 and+0.120
 to+0.079
 with+0.068
Neg logit contributions
Ta-0.401
 former-0.394
 Majesty-0.383
agna-0.381
icious-0.381
Token freezer
Token ID30967
Feature activation-5.75e-01

Pos logit contributions
.+1.050
,+1.024
 and+1.004
 to+0.868
 with+0.855
Neg logit contributions
Ta-0.728
 Majesty-0.648
Uh-0.638
umm-0.633
 former-0.633
Token.
Token ID13
Feature activation+5.70e-01

Pos logit contributions
umm+2.637
 former+2.577
agna+2.571
bek+2.565
belt+2.465
Neg logit contributions
,-0.156
.-0.052
 and-0.034
 the+0.082
 with+0.152
Token Lily
Token ID20037
Feature activation-7.79e-02

Pos logit contributions
.+1.361
,+1.300
 and+1.260
 to+0.978
 with+0.968
Neg logit contributions
Ta-2.117
 Majesty-1.963
Uh-1.941
 former-1.940
umm-1.930