TOP ACTIVATIONS
MAX = 4.095

I go play with Ben now ?" Lisa asks .
and sad . Suddenly , it heard a cheerful voice behind
playing in the park when it saw its special brush with
and Ben woke up when it was dark . They heard
bedroom door opened and a ghost appeared . The ghost was
writing in the notebook when it split into two ! She
Can I play with you now ?" he asked .
discover where it 's coming from !" So ,
Suddenly , she heard a voice from behind
apping its wings around when it saw a butterfly . It
home and play with Rose now ?" Her mom said ,
Can I have Big Red now ?" His mom
box . He turned around and saw a man with a
to travel to a far away place and brought Spot along
walking in the park when it saw a little bird .
want to play with me now ?" The gr umpy cat
Can I play with you later ?" Mia says , "
playing with her doll when it accidentally fell out of her
then , she fell off and hurt her knee .
all of a sudden , it heard a loud noise .

MIN ACTIVATIONS
MIN = -4.865

." They shared Ben 's ice - cream and
! They collected all the treasure and stored it
the luggage . They added all the luggage to the truck
Ben and Lily hugged grand pa . They said , "
for you ." Bill gave Bob the bread , and Bob
. Peter ate all the food and enjoyed the
. Together , they collected all the stones Jack wanted .
, Dad ." They hugged Dad and said , " You
The shop keeper gathered all of the items and told
Tom and his rabbit sold all the carrots . They were
were filled with fresh cuc umber and cheese . It made
The woman gave two yellow le mons to Sam
to do ! she gathered all her friends together to help
The pals gathered all of the ingredients for the
to help ! They gathered all of the supplies they would
ily and Ben gave the man some bread . They taped
sweet . He quickly collected all the berries he could find
tw igs . He gave them to the squirrel , who
ummy treats . They took all of the strawberries inside and
great dish . They gathered all the ingredients and mixed them

INTERVAL 1.855 - 4.095
CONTAINS 1.783%

a witch on the cover and many strange words inside .
You can go and play now . But first , let
walking to the store when she saw a little girl with
play with . One day , she took her hoop to
the bottom of the hill . She is dirty and has

INTERVAL -0.385 - 1.855
CONTAINS 65.426%

. She saw a small path that lead to a river
thought his sled was better and more fun . He did
Henry was so happy that he had an idea to test
was so happy . She then realized that the small writing
ahead . It was a big blue towel on the ground

INTERVAL -2.625 - -0.385
CONTAINS 32.401%

. Dave looked at Ms . Sarah with a
the mess and Lily learned to be more careful when playing
in the sunshine and eat y ummy carrots . One day
<|endoftext|> Tom and Sue are friends . They like to play
so she gave it some food and water . This was

INTERVAL -4.865 - -2.625
CONTAINS 0.390%

was so happy to make people happy . Next
look good . He was so proud of his work !
ave them . She hugged them . L ily
pancakes . They were so y ummy ! Lily was happy
help so he got two sp oons and two plates .
Token I
Token ID314
Feature activation+5.63e-01

Pos logit contributions
 bumped+0.824
 stub+0.811
 accidentally+0.750
 tri+0.678
 landed+0.673
Neg logit contributions
 pleasure-1.442
 generosity-1.429
 appreciation-1.332
ipp-1.310
 pride-1.300
Token go
Token ID467
Feature activation+2.48e+00

Pos logit contributions
 bumped+0.984
 stub+0.978
 accidentally+0.934
 tri+0.883
 landed+0.861
Neg logit contributions
 pleasure-0.943
 generosity-0.913
 appreciation-0.861
 pride-0.837
ipp-0.826
Token play
Token ID711
Feature activation+2.74e+00

Pos logit contributions
 stub+4.038
 bumped+3.906
 accidentally+3.823
Ro+3.640
 tri+3.620
Neg logit contributions
 pleasure-4.565
 generosity-4.374
 appreciation-4.139
 pride-4.109
 kindness-4.093
Token with
Token ID351
Feature activation+7.84e-01

Pos logit contributions
 bumped+4.649
 stub+4.641
 accidentally+4.084
 tri+4.064
 landed+3.839
Neg logit contributions
 pleasure-5.139
 generosity-4.989
 appreciation-4.752
 kindness-4.680
ipp-4.594
Token Ben
Token ID3932
Feature activation+1.70e+00

Pos logit contributions
 bumped+1.186
 stub+1.186
 accidentally+1.089
 tri+1.006
 landed+0.999
Neg logit contributions
 pleasure-1.496
 generosity-1.477
 appreciation-1.403
 pride-1.373
ipp-1.364
Token now
Token ID783
Feature activation+4.10e+00

Pos logit contributions
 bumped+1.292
 stub+1.273
Ro+1.003
 accidentally+0.951
 tri+0.911
Neg logit contributions
 generosity-4.558
 pleasure-4.539
 appreciation-4.342
ipp-4.327
 choices-4.230
Token?"
Token ID1701
Feature activation+7.14e-01

Pos logit contributions
 bumped+4.952
 stub+4.940
Ro+4.751
Raw+4.400
 tri+4.137
Neg logit contributions
 pleasure-10.428
 generosity-9.725
ish-9.306
 appreciation-9.264
ipp-9.243
Token Lisa
Token ID15378
Feature activation+3.68e-01

Pos logit contributions
 stub+1.045
 bumped+1.033
 accidentally+0.943
 landed+0.894
 tri+0.881
Neg logit contributions
 pleasure-1.377
 generosity-1.355
 pride-1.292
 appreciation-1.284
 kinds-1.225
Token asks
Token ID7893
Feature activation+6.76e-01

Pos logit contributions
 stub+0.511
 bumped+0.506
 accidentally+0.476
 seek+0.437
 tri+0.435
Neg logit contributions
 pleasure-0.726
 generosity-0.721
 appreciation-0.693
 pride-0.680
ipp-0.670
Token.
Token ID13
Feature activation+9.79e-01

Pos logit contributions
 stub+0.834
 bumped+0.817
 accidentally+0.762
Ro+0.697
 landed+0.693
Neg logit contributions
 generosity-1.463
 pleasure-1.458
 appreciation-1.379
 pride-1.378
ipp-1.363
Token
Token ID198
Feature activation+6.83e-01

Pos logit contributions
 bumped+1.539
 stub+1.514
 accidentally+1.403
 landed+1.349
 seek+1.336
Neg logit contributions
 pleasure-1.815
 generosity-1.737
 appreciation-1.665
 pride-1.655
ipp-1.608
Token and
Token ID290
Feature activation+1.48e+00

Pos logit contributions
 stub+1.454
 bumped+1.429
 accidentally+1.311
Ro+1.280
 seek+1.268
Neg logit contributions
 pleasure-1.694
 generosity-1.655
 appreciation-1.579
ipp-1.567
 choices-1.544
Token sad
Token ID6507
Feature activation+1.45e+00

Pos logit contributions
 stub+2.104
 bumped+2.029
 accidentally+1.915
 Ach+1.839
Ro+1.835
Neg logit contributions
 pleasure-2.841
 generosity-2.840
 appreciation-2.734
 pride-2.716
 choices-2.631
Token.
Token ID13
Feature activation+6.68e-01

Pos logit contributions
 stub+2.122
 bumped+2.010
 accidentally+1.878
Ro+1.783
 seek+1.754
Neg logit contributions
 pleasure-2.922
 generosity-2.908
 appreciation-2.805
 choices-2.773
ipp-2.677
Token Suddenly
Token ID24975
Feature activation+1.24e+00

Pos logit contributions
 bumped+1.295
 stub+1.290
 accidentally+1.196
 landed+1.149
 seek+1.149
Neg logit contributions
 pleasure-1.024
 generosity-0.962
 appreciation-0.902
ipp-0.884
 pride-0.879
Token,
Token ID11
Feature activation+1.74e+00

Pos logit contributions
 stub+2.001
 bumped+1.879
Ro+1.808
 accidentally+1.758
Please+1.746
Neg logit contributions
 pleasure-2.212
 generosity-2.133
 appreciation-2.038
 pride-1.981
ipp-1.948
Token it
Token ID340
Feature activation+3.94e+00

Pos logit contributions
 stub+2.986
 bumped+2.841
 accidentally+2.620
Ro+2.558
 tri+2.516
Neg logit contributions
 pleasure-3.094
 generosity-2.960
 appreciation-2.803
 pride-2.770
 kindness-2.702
Token heard
Token ID2982
Feature activation+1.26e+00

Pos logit contributions
Ro+5.562
?"+5.106
Who+5.090
 seeker+5.028
 university+4.991
Neg logit contributions
 pride-7.262
 appreciation-7.225
 generosity-6.835
ealing-6.665
 compassion-6.516
Token a
Token ID257
Feature activation+8.54e-01

Pos logit contributions
 stub+2.038
 bumped+2.014
 accidentally+1.875
 seek+1.837
 tri+1.737
Neg logit contributions
 pleasure-2.279
 generosity-2.250
 appreciation-2.207
 pride-2.163
 kindness-2.078
Token cheerful
Token ID37999
Feature activation+3.01e-01

Pos logit contributions
 bumped+1.256
 stub+1.252
 accidentally+1.177
?"+1.084
 tri+1.070
Neg logit contributions
 generosity-1.646
 pleasure-1.634
 appreciation-1.591
 pride-1.487
 kindness-1.472
Token voice
Token ID3809
Feature activation+1.86e+00

Pos logit contributions
 bumped+0.372
 stub+0.369
 accidentally+0.348
 tri+0.311
?"+0.307
Neg logit contributions
 pleasure-0.646
 generosity-0.644
 appreciation-0.619
 pride-0.593
 choices-0.586
Token behind
Token ID2157
Feature activation+1.04e+00

Pos logit contributions
 stub+2.772
 bumped+2.638
 accidentally+2.594
 seek+2.347
Ro+2.347
Neg logit contributions
 generosity-3.576
 appreciation-3.413
 pleasure-3.381
 kindness-3.286
 kinds-3.277
Token playing
Token ID2712
Feature activation+1.49e+00

Pos logit contributions
 bumped+1.888
Ro+1.876
 stub+1.874
Who+1.845
?"+1.775
Neg logit contributions
 pleasure-2.211
 generosity-2.107
 pride-2.073
 appreciation-2.041
ipp-2.022
Token in
Token ID287
Feature activation+8.27e-01

Pos logit contributions
 stub+2.103
 bumped+2.062
 accidentally+1.826
 landed+1.738
 rang+1.736
Neg logit contributions
 pleasure-2.900
 generosity-2.872
 appreciation-2.739
 pride-2.668
 kindness-2.656
Token the
Token ID262
Feature activation+1.32e+00

Pos logit contributions
 stub+1.344
 bumped+1.335
 accidentally+1.234
 tri+1.156
?"+1.147
Neg logit contributions
 pleasure-1.492
 generosity-1.438
 appreciation-1.395
 kinds-1.357
 pride-1.332
Token park
Token ID3952
Feature activation+1.01e+00

Pos logit contributions
 bumped+1.716
 stub+1.613
 accidentally+1.487
 tri+1.396
?"+1.385
Neg logit contributions
 pleasure-2.863
 generosity-2.695
 appreciation-2.676
 pride-2.646
ipp-2.635
Token when
Token ID618
Feature activation+1.66e+00

Pos logit contributions
 stub+1.348
 bumped+1.326
 accidentally+1.191
Ro+1.149
?"+1.125
Neg logit contributions
 pleasure-2.068
 generosity-2.020
 appreciation-1.917
 kinds-1.912
ipp-1.898
Token it
Token ID340
Feature activation+3.90e+00

Pos logit contributions
 stub+2.470
 bumped+2.424
Ro+2.217
 seek+2.181
 accidentally+2.169
Neg logit contributions
 pleasure-3.163
 generosity-2.991
 appreciation-2.821
 pride-2.817
ipp-2.743
Token saw
Token ID2497
Feature activation+1.41e+00

Pos logit contributions
?"+4.983
 university+4.978
Who+4.942
Ro+4.792
 seeker+4.783
Neg logit contributions
ipp-7.057
ealing-6.726
apa-6.596
 appreciation-6.586
leased-6.509
Token its
Token ID663
Feature activation+9.21e-01

Pos logit contributions
 stub+2.343
 bumped+2.292
 seek+2.108
 accidentally+2.085
 tri+2.000
Neg logit contributions
 pleasure-2.541
 generosity-2.477
 appreciation-2.386
 pride-2.325
 kindness-2.272
Token special
Token ID2041
Feature activation+9.82e-01

Pos logit contributions
 bumped+1.800
 stub+1.754
 accidentally+1.692
Ro+1.628
?"+1.605
Neg logit contributions
 pleasure-1.409
 generosity-1.357
 appreciation-1.275
 pride-1.212
 choices-1.199
Token brush
Token ID14093
Feature activation+9.61e-01

Pos logit contributions
 bumped+1.613
 stub+1.597
 accidentally+1.542
Ro+1.452
 tri+1.438
Neg logit contributions
 pleasure-1.770
 generosity-1.755
 appreciation-1.673
 choices-1.581
 pride-1.575
Token with
Token ID351
Feature activation-1.02e+00

Pos logit contributions
 bumped+1.181
 stub+1.176
Ro+1.082
 accidentally+1.059
Raw+1.000
Neg logit contributions
 generosity-2.020
 pleasure-2.016
 appreciation-1.925
ipp-1.899
 pride-1.872
Token and
Token ID290
Feature activation-5.56e-01

Pos logit contributions
 bumped+3.342
Ro+3.319
 stub+3.283
 Ach+3.125
 seek+3.121
Neg logit contributions
 pleasure-2.859
 generosity-2.816
 appreciation-2.772
 pride-2.720
ipp-2.685
Token Ben
Token ID3932
Feature activation+1.84e+00

Pos logit contributions
 pleasure+1.318
 generosity+1.295
 appreciation+1.242
ipp+1.220
 pride+1.214
Neg logit contributions
 bumped-0.580
 stub-0.579
 accidentally-0.535
 tri-0.475
?"-0.471
Token woke
Token ID19092
Feature activation+8.47e-01

Pos logit contributions
 stub+2.619
 bumped+2.495
Ro+2.459
 seek+2.458
?"+2.401
Neg logit contributions
 pleasure-3.302
 pride-3.216
 generosity-3.193
 appreciation-3.178
ipp-3.100
Token up
Token ID510
Feature activation+4.77e-01

Pos logit contributions
 stub+1.660
 bumped+1.643
 accidentally+1.528
 seek+1.504
Ro+1.473
Neg logit contributions
 pleasure-1.254
 generosity-1.192
ipp-1.154
 appreciation-1.142
 choices-1.115
Token when
Token ID618
Feature activation+7.60e-01

Pos logit contributions
 stub+0.715
 bumped+0.712
 accidentally+0.675
Ro+0.635
 landed+0.624
Neg logit contributions
 pleasure-0.900
 generosity-0.894
 appreciation-0.853
 pride-0.825
ipp-0.816
Token it
Token ID340
Feature activation+3.88e+00

Pos logit contributions
 stub+1.166
 bumped+1.153
 accidentally+1.054
 landed+1.026
?"+1.023
Neg logit contributions
 pleasure-1.388
 generosity-1.350
 appreciation-1.297
 pride-1.256
ipp-1.248
Token was
Token ID373
Feature activation+1.83e+00

Pos logit contributions
 stub+5.993
 seek+5.930
Ro+5.838
 seeker+5.793
Who+5.741
Neg logit contributions
ipp-6.232
 appreciation-6.199
 kinds-6.087
 pride-5.882
aying-5.846
Token dark
Token ID3223
Feature activation+1.44e+00

Pos logit contributions
 stub+3.201
 bumped+3.149
Ro+2.949
 accidentally+2.909
Who+2.883
Neg logit contributions
 pleasure-3.157
 pride-2.859
 generosity-2.826
 appreciation-2.813
ipp-2.811
Token.
Token ID13
Feature activation+6.12e-01

Pos logit contributions
 stub+1.988
 bumped+1.890
 accidentally+1.828
Ro+1.660
 tri+1.650
Neg logit contributions
 pleasure-2.875
 generosity-2.822
 kinds-2.796
ipp-2.731
 appreciation-2.709
Token They
Token ID1119
Feature activation+1.30e+00

Pos logit contributions
 bumped+1.196
 stub+1.190
 accidentally+1.102
 landed+1.079
?"+1.074
Neg logit contributions
 pleasure-0.918
 generosity-0.853
 appreciation-0.815
ipp-0.802
 pride-0.795
Token heard
Token ID2982
Feature activation+5.16e-01

Pos logit contributions
 stub+1.707
 bumped+1.608
?"+1.522
 accidentally+1.504
Ro+1.495
Neg logit contributions
 pleasure-2.558
 appreciation-2.511
 generosity-2.506
 pride-2.488
ipp-2.386
Token bedroom
Token ID14043
Feature activation+2.16e+00

Pos logit contributions
 bumped+3.080
 accidentally+2.984
 stub+2.920
 landed+2.856
?"+2.770
Neg logit contributions
 pleasure-2.791
 generosity-2.635
 pride-2.523
 appreciation-2.472
 choices-2.384
Token door
Token ID3420
Feature activation+2.32e+00

Pos logit contributions
Ro+2.466
 stub+2.344
 seek+2.299
?"+2.247
Raw+2.197
Neg logit contributions
ums-4.333
ipp-4.310
 generosity-4.278
 pleasure-4.238
 appreciation-4.194
Token opened
Token ID4721
Feature activation+1.06e+00

Pos logit contributions
Ro+3.253
 stub+3.215
 seek+3.013
Raw+2.937
 accidentally+2.922
Neg logit contributions
 generosity-4.210
 appreciation-4.176
 pleasure-4.139
 kinds-4.114
 pride-4.087
Token and
Token ID290
Feature activation+2.45e+00

Pos logit contributions
 stub+1.694
 bumped+1.624
 accidentally+1.552
Ro+1.500
 tri+1.455
Neg logit contributions
 pleasure-1.935
 appreciation-1.845
 generosity-1.829
ipp-1.794
 pride-1.768
Token a
Token ID257
Feature activation+1.74e+00

Pos logit contributions
 stub+4.019
 bumped+3.687
 accidentally+3.545
Ro+3.526
Raw+3.494
Neg logit contributions
 pleasure-4.394
 generosity-4.204
 pride-4.163
 appreciation-4.114
 kindness-3.888
Token ghost
Token ID10905
Feature activation+3.88e+00

Pos logit contributions
 bumped+3.196
 stub+3.136
 accidentally+2.964
 tri+2.882
 landed+2.870
Neg logit contributions
 pleasure-2.764
 generosity-2.635
 appreciation-2.576
 pride-2.413
 kindness-2.306
Token appeared
Token ID4120
Feature activation+1.86e+00

Pos logit contributions
Ro+5.100
 stub+5.077
 seek+5.024
Raw+4.985
 accidentally+4.792
Neg logit contributions
 appreciation-7.050
 generosity-6.957
 kinds-6.908
 pleasure-6.810
 sorts-6.736
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 stub+2.701
 bumped+2.642
 accidentally+2.616
Ro+2.358
Raw+2.261
Neg logit contributions
 pleasure-3.675
 generosity-3.558
ipp-3.489
 appreciation-3.487
 pride-3.455
Token The
Token ID383
Feature activation+2.52e+00

Pos logit contributions
 bumped+1.978
 stub+1.955
 accidentally+1.889
 tri+1.829
 seek+1.764
Neg logit contributions
 pleasure-1.514
 generosity-1.429
 appreciation-1.375
 pride-1.341
 kinds-1.261
Token ghost
Token ID10905
Feature activation+2.19e+00

Pos logit contributions
 bumped+5.153
 accidentally+5.146
 stub+5.028
 tri+4.766
 landed+4.737
Neg logit contributions
 pleasure-3.762
 pride-3.524
 appreciation-3.458
 generosity-3.447
 kindness-3.198
Token was
Token ID373
Feature activation+1.38e+00

Pos logit contributions
 stub+2.844
Ro+2.706
 bumped+2.611
?"+2.539
 seek+2.493
Neg logit contributions
 generosity-4.112
 pleasure-4.008
 appreciation-3.998
 kinds-3.856
 pride-3.822
Token writing
Token ID3597
Feature activation+1.53e+00

Pos logit contributions
 bumped+1.292
 stub+1.287
Ro+1.261
 seek+1.199
 accidentally+1.195
Neg logit contributions
 pleasure-1.688
 generosity-1.600
ipp-1.569
 pride-1.553
 appreciation-1.546
Token in
Token ID287
Feature activation-1.01e-01

Pos logit contributions
 stub+2.429
 bumped+2.329
Ro+2.223
 accidentally+2.186
 tri+2.077
Neg logit contributions
 pleasure-2.804
 generosity-2.800
 pride-2.687
 appreciation-2.673
ipp-2.587
Token the
Token ID262
Feature activation+4.03e-01

Pos logit contributions
 pleasure+0.214
 generosity+0.210
 appreciation+0.203
 pride+0.197
ipp+0.193
Neg logit contributions
 stub-0.131
 bumped-0.131
 accidentally-0.123
 tri-0.112
?"-0.110
Token notebook
Token ID20922
Feature activation+1.72e+00

Pos logit contributions
 bumped+0.631
 stub+0.613
 accidentally+0.586
 tri+0.543
?"+0.533
Neg logit contributions
 pleasure-0.763
 generosity-0.739
 pride-0.709
 appreciation-0.707
ipp-0.675
Token when
Token ID618
Feature activation+1.87e+00

Pos logit contributions
 bumped+2.777
 stub+2.757
 accidentally+2.474
Ro+2.462
Who+2.431
Neg logit contributions
 pleasure-3.186
 generosity-3.097
ipp-2.999
 kinds-2.920
 appreciation-2.866
Token it
Token ID340
Feature activation+3.83e+00

Pos logit contributions
 stub+2.395
 bumped+2.384
Ro+2.219
?"+2.168
 accidentally+2.141
Neg logit contributions
 pleasure-3.879
 generosity-3.725
 appreciation-3.600
 pride-3.535
ipp-3.527
Token split
Token ID6626
Feature activation+1.51e+00

Pos logit contributions
Ro+5.791
?"+5.458
Raw+5.387
 stub+5.302
Who+5.277
Neg logit contributions
 generosity-6.465
 pride-6.408
 appreciation-6.334
ipp-6.279
 pleasure-6.131
Token into
Token ID656
Feature activation+4.41e-01

Pos logit contributions
 stub+2.190
 bumped+2.112
Ro+2.029
 accidentally+1.948
Raw+1.870
Neg logit contributions
 pleasure-2.940
ipp-2.860
 generosity-2.800
 appreciation-2.786
 pride-2.767
Token two
Token ID734
Feature activation-8.69e-01

Pos logit contributions
 stub+0.725
 bumped+0.722
 accidentally+0.674
?"+0.637
 tri+0.636
Neg logit contributions
 pleasure-0.782
 generosity-0.773
 appreciation-0.749
 pride-0.723
 kinds-0.701
Token!
Token ID0
Feature activation+2.64e-02

Pos logit contributions
 pleasure+1.552
 generosity+1.507
 appreciation+1.422
ipp+1.397
 pride+1.370
Neg logit contributions
 bumped-1.425
 stub-1.422
 accidentally-1.364
 tri-1.269
?"-1.251
Token She
Token ID1375
Feature activation+1.26e+00

Pos logit contributions
 bumped+0.041
 stub+0.041
 accidentally+0.038
 tri+0.036
?"+0.036
Neg logit contributions
 pleasure-0.049
 generosity-0.047
 appreciation-0.045
ipp-0.045
 pride-0.044
Token Can
Token ID1680
Feature activation+6.19e-02

Pos logit contributions
 pleasure+2.008
 generosity+1.984
 appreciation+1.904
ipp+1.866
 pride+1.846
Neg logit contributions
 stub-1.155
 bumped-1.141
 accidentally-1.082
?"-1.000
 tri-0.977
Token I
Token ID314
Feature activation-8.77e-02

Pos logit contributions
 bumped+0.073
 stub+0.072
 accidentally+0.066
 tri+0.061
 landed+0.059
Neg logit contributions
 pleasure-0.138
 generosity-0.137
 appreciation-0.129
ipp-0.127
 pride-0.126
Token play
Token ID711
Feature activation+1.80e+00

Pos logit contributions
 pleasure+0.162
 generosity+0.157
 appreciation+0.149
ipp+0.145
 pride+0.145
Neg logit contributions
 stub-0.138
 bumped-0.138
 accidentally-0.130
 tri-0.122
 landed-0.119
Token with
Token ID351
Feature activation+8.86e-02

Pos logit contributions
 bumped+2.844
 stub+2.843
 accidentally+2.530
 tri+2.503
 landed+2.271
Neg logit contributions
 pleasure-3.444
 generosity-3.421
 appreciation-3.240
 pride-3.131
 kindness-3.108
Token you
Token ID345
Feature activation+3.04e+00

Pos logit contributions
 stub+0.125
 bumped+0.124
 accidentally+0.114
 tri+0.107
 landed+0.102
Neg logit contributions
 pleasure-0.177
 generosity-0.175
 appreciation-0.166
ipp-0.163
 pride-0.163
Token now
Token ID783
Feature activation+3.82e+00

Pos logit contributions
 stub+4.395
 bumped+4.129
Ro+3.667
Raw+3.497
 tri+3.473
Neg logit contributions
 generosity-6.648
 pleasure-6.596
ipp-6.295
 kindness-6.206
izza-6.092
Token?"
Token ID1701
Feature activation+1.28e+00

Pos logit contributions
 stub+4.562
 bumped+4.473
Ro+3.890
Raw+3.761
 tri+3.706
Neg logit contributions
 pleasure-9.676
 generosity-9.197
ish-9.108
ipp-9.032
 pride-8.700
Token he
Token ID339
Feature activation+1.75e+00

Pos logit contributions
 bumped+1.872
 stub+1.818
 accidentally+1.642
 seek+1.613
 landed+1.579
Neg logit contributions
 pleasure-2.474
 generosity-2.445
 pride-2.317
 appreciation-2.309
ums-2.230
Token asked
Token ID1965
Feature activation+2.91e-01

Pos logit contributions
 stub+2.340
 bumped+2.327
 accidentally+2.139
storm+2.007
?"+2.005
Neg logit contributions
 generosity-3.440
 appreciation-3.321
 pleasure-3.231
 pride-3.226
 kinds-3.107
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
 stub+0.329
 bumped+0.328
 accidentally+0.302
 landed+0.273
 tri+0.272
Neg logit contributions
 pleasure-0.655
 generosity-0.653
 appreciation-0.620
 pride-0.615
ipp-0.614
Token
Token ID198
Feature activation+8.89e-01

Pos logit contributions
 bumped+2.834
 stub+2.741
 accidentally+2.590
 tri+2.542
 landed+2.471
Neg logit contributions
 pleasure-2.834
 generosity-2.649
 appreciation-2.593
 pride-2.545
ipp-2.417
Token discover
Token ID7073
Feature activation+1.06e+00

Pos logit contributions
 stub+0.661
 bumped+0.639
 accidentally+0.612
 tri+0.555
 landed+0.527
Neg logit contributions
 pleasure-1.093
 generosity-1.056
 appreciation-1.019
 pride-0.993
ipp-0.987
Token where
Token ID810
Feature activation+1.21e+00

Pos logit contributions
 stub+1.873
 bumped+1.822
 accidentally+1.693
 tri+1.614
 seek+1.552
Neg logit contributions
 pleasure-1.898
 generosity-1.836
 appreciation-1.766
 kindness-1.669
 kinds-1.667
Token it
Token ID340
Feature activation+1.33e+00

Pos logit contributions
 bumped+1.785
 stub+1.768
 accidentally+1.684
 tri+1.557
 seek+1.510
Neg logit contributions
 pleasure-2.310
 generosity-2.291
 appreciation-2.205
 pride-2.120
 choices-2.118
Token's
Token ID338
Feature activation+3.09e+00

Pos logit contributions
 stub+2.648
 bumped+2.514
 accidentally+2.429
 seek+2.329
 tri+2.286
Neg logit contributions
 pleasure-1.854
 generosity-1.824
 appreciation-1.704
 kinds-1.690
 choices-1.658
Token coming
Token ID2406
Feature activation+2.85e+00

Pos logit contributions
 stub+5.422
 bumped+5.237
Who+5.085
Ro+4.951
 accidentally+4.827
Neg logit contributions
 pleasure-5.659
 generosity-5.286
 decisions-5.106
 appreciation-5.059
 choices-5.055
Token from
Token ID422
Feature activation+3.81e+00

Pos logit contributions
 stub+5.391
 bumped+5.297
 accidentally+5.160
 tri+4.732
lf+4.617
Neg logit contributions
 pleasure-4.809
 appreciation-4.423
 generosity-4.338
 decisions-4.267
 choices-4.252
Token!"
Token ID2474
Feature activation+3.76e-01

Pos logit contributions
 bumped+5.625
 stub+5.610
 tri+4.823
 accidentally+4.799
Who+4.621
Neg logit contributions
 pleasure-8.799
 kinds-8.105
 generosity-7.952
 decisions-7.812
 sorts-7.796
Token
Token ID198
Feature activation+8.03e-01

Pos logit contributions
 stub+0.670
 bumped+0.669
 accidentally+0.627
 tri+0.600
 seek+0.584
Neg logit contributions
 pleasure-0.606
 generosity-0.591
 appreciation-0.568
 pride-0.543
ipp-0.539
Token
Token ID198
Feature activation+9.45e-01

Pos logit contributions
 bumped+1.267
 stub+1.228
 accidentally+1.158
 tri+1.105
 seek+1.065
Neg logit contributions
 pleasure-1.482
 appreciation-1.429
 generosity-1.426
ipp-1.376
 pride-1.346
TokenSo
Token ID2396
Feature activation-4.30e-01

Pos logit contributions
 bumped+1.257
 stub+1.235
 accidentally+1.090
 tri+1.044
?"+0.986
Neg logit contributions
 generosity-1.964
 pleasure-1.951
 appreciation-1.949
ipp-1.909
izza-1.831
Token,
Token ID11
Feature activation+6.87e-01

Pos logit contributions
 pleasure+0.925
 generosity+0.906
 appreciation+0.868
ipp+0.847
 pride+0.844
Neg logit contributions
 stub-0.545
 bumped-0.543
 accidentally-0.507
 tri-0.463
?"-0.457
Token
Token ID220
Feature activation-3.37e-02

Pos logit contributions
 bumped+2.641
 stub+2.640
 accidentally+2.479
 tri+2.415
 seek+2.338
Neg logit contributions
 pleasure-1.936
 generosity-1.801
 appreciation-1.736
 pride-1.699
ipp-1.582
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
 pleasure+0.069
 generosity+0.067
 appreciation+0.065
ipp+0.064
 pride+0.063
Neg logit contributions
 bumped-0.046
 stub-0.045
 accidentally-0.043
 tri-0.039
 landed-0.038
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
 bumped+2.099
 stub+2.037
 accidentally+1.968
 tri+1.879
 landed+1.778
Neg logit contributions
 pleasure-2.208
 appreciation-2.129
 generosity-2.090
 pride-2.046
ipp-2.003
TokenSuddenly
Token ID38582
Feature activation+1.62e+00

Pos logit contributions
 bumped+2.148
 stub+2.127
 accidentally+2.036
 tri+1.934
?"+1.886
Neg logit contributions
 pleasure-1.792
 generosity-1.776
 appreciation-1.758
 pride-1.678
ipp-1.644
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
 stub+2.710
 bumped+2.501
 accidentally+2.361
Ro+2.336
Please+2.248
Neg logit contributions
 pleasure-2.868
 generosity-2.743
 appreciation-2.617
 pride-2.568
 kindness-2.498
Token she
Token ID673
Feature activation+3.79e+00

Pos logit contributions
 stub+2.676
 bumped+2.506
 accidentally+2.366
 tri+2.267
Ro+2.264
Neg logit contributions
 pleasure-2.910
 generosity-2.798
 pride-2.638
 appreciation-2.634
 kindness-2.539
Token heard
Token ID2982
Feature activation+1.29e+00

Pos logit contributions
Ro+4.729
 seek+3.996
Raw+3.967
storm+3.748
jar+3.742
Neg logit contributions
 pride-8.113
 appreciation-7.794
apa-7.780
ipp-7.764
ealing-7.749
Token a
Token ID257
Feature activation+6.12e-01

Pos logit contributions
 stub+2.132
 bumped+2.080
 accidentally+1.985
 tri+1.830
 seek+1.817
Neg logit contributions
 pleasure-2.344
 generosity-2.324
 appreciation-2.275
 pride-2.223
 kindness-2.140
Token voice
Token ID3809
Feature activation+1.87e+00

Pos logit contributions
 stub+0.845
 bumped+0.842
 accidentally+0.795
?"+0.727
 tri+0.717
Neg logit contributions
 generosity-1.218
 pleasure-1.213
 appreciation-1.177
 pride-1.116
 choices-1.090
Token from
Token ID422
Feature activation+2.30e+00

Pos logit contributions
 stub+2.905
 accidentally+2.746
 bumped+2.713
 seek+2.445
Ro+2.430
Neg logit contributions
 generosity-3.521
 appreciation-3.398
 pleasure-3.347
 kinds-3.283
 kindness-3.212
Token behind
Token ID2157
Feature activation+1.41e+00

Pos logit contributions
 stub+3.634
 accidentally+3.512
 bumped+3.428
 tri+3.098
 Ach+2.914
Neg logit contributions
 pleasure-4.571
 generosity-4.499
 appreciation-4.359
 pride-4.311
 kinds-4.289
Tokenapping
Token ID5912
Feature activation+8.69e-01

Pos logit contributions
 generosity+3.039
 pleasure+3.026
 appreciation+2.766
 choices+2.643
 pride+2.624
Neg logit contributions
 stub-3.814
 bumped-3.718
 accidentally-3.527
 tri-3.445
?"-3.339
Token its
Token ID663
Feature activation-3.34e-01

Pos logit contributions
 bumped+1.239
 stub+1.233
 accidentally+1.134
Ro+1.075
Who+1.052
Neg logit contributions
 pleasure-1.676
 generosity-1.636
 appreciation-1.598
 pride-1.556
ipp-1.552
Token wings
Token ID12098
Feature activation+3.12e-01

Pos logit contributions
 pleasure+0.516
 generosity+0.502
 appreciation+0.472
 pride+0.455
ipp+0.454
Neg logit contributions
 bumped-0.623
 stub-0.621
 accidentally-0.597
?"-0.561
 tri-0.560
Token around
Token ID1088
Feature activation+1.22e+00

Pos logit contributions
 stub+0.443
 bumped+0.440
 accidentally+0.417
?"+0.387
Ro+0.385
Neg logit contributions
 pleasure-0.595
 generosity-0.582
 appreciation-0.556
ipp-0.551
 pride-0.547
Token when
Token ID618
Feature activation+1.91e+00

Pos logit contributions
 stub+1.981
 bumped+1.963
 accidentally+1.782
 landed+1.688
 tri+1.671
Neg logit contributions
 pleasure-2.218
 generosity-2.143
 appreciation-2.052
 kinds-1.991
 pride-1.990
Token it
Token ID340
Feature activation+3.76e+00

Pos logit contributions
 stub+2.898
 bumped+2.765
Ro+2.563
 seek+2.512
 accidentally+2.476
Neg logit contributions
 pleasure-3.595
 generosity-3.443
 pride-3.218
 appreciation-3.195
 kindness-3.134
Token saw
Token ID2497
Feature activation+7.82e-01

Pos logit contributions
 seeker+4.986
 university+4.588
Who+4.536
Ro+4.531
 stub+4.493
Neg logit contributions
 kinds-6.570
 appreciation-6.564
 generosity-6.538
ipp-6.503
 pride-6.395
Token a
Token ID257
Feature activation+1.14e+00

Pos logit contributions
 stub+1.203
 bumped+1.187
 accidentally+1.085
 seek+1.068
 tri+1.027
Neg logit contributions
 pleasure-1.496
 generosity-1.465
 appreciation-1.403
 pride-1.368
ipp-1.356
Token butterfly
Token ID35113
Feature activation+1.56e+00

Pos logit contributions
 bumped+1.958
 stub+1.912
 accidentally+1.785
 tri+1.697
 landed+1.675
Neg logit contributions
 pleasure-1.966
 generosity-1.887
 appreciation-1.791
 pride-1.695
ipp-1.656
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
 bumped+2.100
 stub+2.053
 accidentally+1.934
 seek+1.784
Ro+1.738
Neg logit contributions
 generosity-2.981
 pleasure-2.960
 appreciation-2.908
 kinds-2.790
ipp-2.763
Token It
Token ID632
Feature activation+1.87e+00

Pos logit contributions
 pleasure+0.348
 generosity+0.336
 appreciation+0.316
ipp+0.309
 pride+0.306
Neg logit contributions
 bumped-0.390
 stub-0.389
 accidentally-0.369
 tri-0.349
?"-0.344
Token home
Token ID1363
Feature activation+2.33e+00

Pos logit contributions
 stub+4.338
 bumped+4.128
 accidentally+4.011
Ro+3.924
 tri+3.890
Neg logit contributions
 pleasure-3.676
 generosity-3.506
 appreciation-3.349
 pride-3.309
 kindness-3.167
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
 stub+4.105
 bumped+4.019
 accidentally+3.675
Ro+3.618
 tri+3.513
Neg logit contributions
 pleasure-4.314
 generosity-4.221
 appreciation-3.995
ipp-3.974
 choices-3.846
Token play
Token ID711
Feature activation+2.89e+00

Pos logit contributions
 bumped+2.244
 stub+2.227
 accidentally+2.070
 tri+1.987
 landed+1.888
Neg logit contributions
 pleasure-2.572
 generosity-2.556
 appreciation-2.382
 pride-2.317
 kindness-2.283
Token with
Token ID351
Feature activation+5.51e-01

Pos logit contributions
 stub+5.697
 bumped+5.686
 tri+5.155
 accidentally+5.022
 landed+4.696
Neg logit contributions
 pleasure-4.933
 generosity-4.786
 appreciation-4.622
 kindness-4.441
 pride-4.419
Token Rose
Token ID8049
Feature activation+1.91e+00

Pos logit contributions
 stub+0.837
 bumped+0.832
 accidentally+0.761
 tri+0.720
 seek+0.699
Neg logit contributions
 pleasure-1.054
 generosity-1.044
 appreciation-0.992
ipp-0.972
 pride-0.969
Token now
Token ID783
Feature activation+3.75e+00

Pos logit contributions
 stub+2.368
 bumped+2.341
Ro+2.049
 accidentally+1.979
Raw+1.968
Neg logit contributions
 generosity-4.157
 pleasure-4.109
ipp-4.026
 appreciation-3.957
 kinds-3.867
Token?"
Token ID1701
Feature activation+1.32e+00

Pos logit contributions
 stub+4.393
 bumped+4.296
Ro+4.091
Raw+3.704
 tri+3.606
Neg logit contributions
 pleasure-9.744
 generosity-9.095
ipp-8.961
ish-8.845
 choices-8.720
Token Her
Token ID2332
Feature activation+1.37e+00

Pos logit contributions
 bumped+2.137
 stub+2.019
 landed+1.864
 accidentally+1.856
 tri+1.811
Neg logit contributions
 pleasure-2.462
 generosity-2.410
 appreciation-2.285
 pride-2.242
 kindness-2.164
Token mom
Token ID1995
Feature activation+1.59e-01

Pos logit contributions
 bumped+2.453
 stub+2.357
 accidentally+2.322
 landed+2.208
 seek+2.195
Neg logit contributions
 pleasure-2.304
 generosity-2.288
 appreciation-2.187
 pride-2.103
ums-2.035
Token said
Token ID531
Feature activation-1.03e+00

Pos logit contributions
 stub+0.232
 bumped+0.226
 accidentally+0.214
 seek+0.198
 tri+0.197
Neg logit contributions
 generosity-0.303
 pleasure-0.302
 appreciation-0.291
ipp-0.282
 pride-0.281
Token,
Token ID11
Feature activation+5.85e-01

Pos logit contributions
 pleasure+2.505
 generosity+2.448
 appreciation+2.356
ipp+2.315
 pride+2.303
Neg logit contributions
 bumped-1.014
 stub-1.007
 accidentally-0.933
?"-0.829
 tri-0.817
Token Can
Token ID1680
Feature activation+8.31e-02

Pos logit contributions
 pleasure+1.269
 generosity+1.244
 appreciation+1.191
ipp+1.168
 pride+1.163
Neg logit contributions
 bumped-0.616
 stub-0.615
 accidentally-0.573
 tri-0.513
?"-0.507
Token I
Token ID314
Feature activation+7.82e-01

Pos logit contributions
 bumped+0.092
 stub+0.091
 accidentally+0.084
 tri+0.075
 landed+0.073
Neg logit contributions
 pleasure-0.191
 generosity-0.189
 appreciation-0.179
ipp-0.176
 pride-0.175
Token have
Token ID423
Feature activation+4.66e-01

Pos logit contributions
 bumped+1.182
 stub+1.180
 accidentally+1.081
 tri+1.026
 landed+1.017
Neg logit contributions
 pleasure-1.506
 generosity-1.447
 appreciation-1.401
 pride-1.354
ipp-1.342
Token Big
Token ID4403
Feature activation-1.52e-01

Pos logit contributions
 stub+0.779
 bumped+0.771
 accidentally+0.721
 tri+0.694
 seek+0.665
Neg logit contributions
 pleasure-0.823
 generosity-0.801
 appreciation-0.757
 pride-0.744
ipp-0.716
Token Red
Token ID2297
Feature activation+1.13e+00

Pos logit contributions
 pleasure+0.271
 generosity+0.265
 appreciation+0.248
ipp+0.242
 pride+0.240
Neg logit contributions
 stub-0.247
 bumped-0.246
 accidentally-0.235
 tri-0.218
 landed-0.216
Token now
Token ID783
Feature activation+3.74e+00

Pos logit contributions
 stub+1.526
 bumped+1.513
 accidentally+1.375
Ro+1.287
 tri+1.281
Neg logit contributions
 pleasure-2.330
 generosity-2.329
ipp-2.178
 appreciation-2.155
 choices-2.111
Token?"
Token ID1701
Feature activation+1.57e+00

Pos logit contributions
 stub+5.077
 bumped+4.846
Ro+4.723
Raw+4.463
Who+4.247
Neg logit contributions
 pleasure-8.720
 generosity-8.268
ipp-7.948
 pride-7.704
 kindness-7.683
Token
Token ID198
Feature activation+8.75e-01

Pos logit contributions
 bumped+2.675
 stub+2.599
 accidentally+2.470
 landed+2.432
 seek+2.381
Neg logit contributions
 pleasure-2.696
 generosity-2.640
 appreciation-2.537
 pride-2.408
ums-2.386
Token
Token ID198
Feature activation+5.60e-01

Pos logit contributions
 bumped+1.369
 stub+1.335
 accidentally+1.288
 tri+1.206
 seek+1.162
Neg logit contributions
 pleasure-1.608
 generosity-1.579
 appreciation-1.563
 pride-1.486
ipp-1.477
TokenHis
Token ID6653
Feature activation+1.27e+00

Pos logit contributions
 bumped+0.889
 stub+0.886
 accidentally+0.846
 tri+0.781
?"+0.772
Neg logit contributions
 generosity-1.001
 pleasure-0.982
 appreciation-0.964
 pride-0.918
ipp-0.916
Token mom
Token ID1995
Feature activation-4.44e-01

Pos logit contributions
 bumped+2.163
 accidentally+2.126
 stub+2.108
 landed+1.976
Please+1.908
Neg logit contributions
 generosity-2.226
 pleasure-2.221
 appreciation-2.155
 pride-2.055
 kindness-1.966
Token box
Token ID3091
Feature activation+2.55e-01

Pos logit contributions
 accidentally+2.225
 stub+2.111
 bumped+2.051
 tri+1.899
 suddenly+1.789
Neg logit contributions
 pleasure-3.374
 generosity-3.350
 choices-3.215
 appreciation-3.211
 pride-3.184
Token.
Token ID13
Feature activation+1.67e+00

Pos logit contributions
 stub+0.288
 bumped+0.282
 accidentally+0.269
?"+0.245
 tri+0.239
Neg logit contributions
 pleasure-0.564
 generosity-0.560
 appreciation-0.541
 pride-0.526
ipp-0.522
Token He
Token ID679
Feature activation+2.87e+00

Pos logit contributions
 bumped+3.418
 stub+3.382
 accidentally+3.225
 landed+3.167
 seek+3.099
Neg logit contributions
 pleasure-2.384
 generosity-2.164
 appreciation-2.116
 pride-2.075
ums-2.014
Token turned
Token ID2900
Feature activation+3.26e-01

Pos logit contributions
?"+3.563
 seek+3.534
 university+3.462
 stub+3.434
storm+3.407
Neg logit contributions
 pride-5.323
 appreciation-5.283
 generosity-5.140
 kinds-5.019
 pleasure-4.838
Token around
Token ID1088
Feature activation+1.00e+00

Pos logit contributions
 stub+0.471
 bumped+0.460
 accidentally+0.453
 seek+0.417
?"+0.412
Neg logit contributions
 pleasure-0.615
 generosity-0.601
 appreciation-0.589
 pride-0.586
ipp-0.557
Token and
Token ID290
Feature activation+3.74e+00

Pos logit contributions
 stub+1.785
 bumped+1.736
 accidentally+1.668
 landed+1.542
Ro+1.525
Neg logit contributions
 pleasure-1.664
 generosity-1.652
 appreciation-1.583
 pride-1.533
 kindness-1.481
Token saw
Token ID2497
Feature activation+3.24e-01

Pos logit contributions
Ro+5.244
 stub+5.149
storm+5.080
?"+5.026
Raw+4.842
Neg logit contributions
 pride-7.283
 appreciation-6.873
 generosity-6.762
 kindness-6.621
 pleasure-6.559
Token a
Token ID257
Feature activation+1.81e+00

Pos logit contributions
 stub+0.445
 bumped+0.441
 accidentally+0.419
 tri+0.382
?"+0.377
Neg logit contributions
 pleasure-0.658
 generosity-0.649
 appreciation-0.621
 pride-0.614
ipp-0.594
Token man
Token ID582
Feature activation+3.66e-01

Pos logit contributions
 bumped+2.999
 stub+2.964
 accidentally+2.799
Please+2.740
?"+2.715
Neg logit contributions
 pleasure-3.123
 generosity-2.987
 appreciation-2.960
 pride-2.764
 kindness-2.648
Token with
Token ID351
Feature activation-3.64e-02

Pos logit contributions
 stub+0.440
 bumped+0.435
 accidentally+0.418
?"+0.370
 tri+0.361
Neg logit contributions
 generosity-0.774
 pleasure-0.774
 appreciation-0.745
 pride-0.719
ipp-0.711
Token a
Token ID257
Feature activation+1.13e+00

Pos logit contributions
 pleasure+0.074
 generosity+0.072
 appreciation+0.069
 pride+0.068
ipp+0.066
Neg logit contributions
 stub-0.050
 bumped-0.050
 accidentally-0.047
?"-0.043
 tri-0.043
Token to
Token ID284
Feature activation-6.12e-01

Pos logit contributions
 stub+2.955
Ro+2.792
 bumped+2.787
Raw+2.587
jar+2.541
Neg logit contributions
 generosity-3.571
 pleasure-3.390
 appreciation-3.225
 kindness-3.182
 choices-3.151
Token travel
Token ID3067
Feature activation+2.12e+00

Pos logit contributions
 pleasure+1.210
 generosity+1.184
 appreciation+1.126
ipp+1.100
 pride+1.094
Neg logit contributions
 bumped-0.875
 stub-0.874
 accidentally-0.827
 tri-0.761
?"-0.757
Token to
Token ID284
Feature activation+1.09e+00

Pos logit contributions
 stub+3.686
 accidentally+3.396
 bumped+3.351
Ro+3.195
 tri+3.105
Neg logit contributions
 generosity-3.632
 pleasure-3.613
 kinds-3.488
 appreciation-3.448
 pride-3.390
Token a
Token ID257
Feature activation+1.88e+00

Pos logit contributions
 stub+1.762
 bumped+1.752
 accidentally+1.617
 tri+1.529
 landed+1.458
Neg logit contributions
 pleasure-2.048
 generosity-1.984
 appreciation-1.892
 pride-1.829
 kindness-1.786
Token far
Token ID1290
Feature activation+1.71e+00

Pos logit contributions
 bumped+3.575
 stub+3.538
 accidentally+3.222
 tri+3.145
 landed+3.099
Neg logit contributions
 pleasure-2.975
 generosity-2.733
 appreciation-2.609
ipp-2.467
 pride-2.443
Tokenaway
Token ID8272
Feature activation+3.74e+00

Pos logit contributions
 stub+2.852
 bumped+2.750
 accidentally+2.622
 tri+2.522
 Ach+2.431
Neg logit contributions
 pleasure-3.011
 generosity-2.947
 appreciation-2.898
 kinds-2.782
 sorts-2.740
Token place
Token ID1295
Feature activation+1.28e+00

Pos logit contributions
 stub+7.046
 Ach+6.691
 bumped+6.521
 accidentally+6.315
 tri+6.060
Neg logit contributions
 pleasure-6.870
 appreciation-6.865
 kinds-6.773
 generosity-6.538
 sorts-6.495
Token and
Token ID290
Feature activation+1.80e+00

Pos logit contributions
 stub+1.738
 bumped+1.634
 accidentally+1.606
Ro+1.507
 tri+1.428
Neg logit contributions
 pleasure-2.677
 generosity-2.643
 kinds-2.546
 appreciation-2.542
 sorts-2.466
Token brought
Token ID3181
Feature activation-7.37e-01

Pos logit contributions
 stub+3.123
 bumped+3.041
 accidentally+2.844
 tri+2.740
 Ach+2.722
Neg logit contributions
 pleasure-3.266
 generosity-3.213
 appreciation-2.998
 pride-2.918
 kindness-2.845
Token Spot
Token ID15899
Feature activation-1.32e+00

Pos logit contributions
 pleasure+1.462
 generosity+1.433
 appreciation+1.361
ipp+1.335
 pride+1.319
Neg logit contributions
 bumped-1.059
 stub-1.056
 accidentally-0.999
 tri-0.920
?"-0.916
Token along
Token ID1863
Feature activation+1.09e+00

Pos logit contributions
 pleasure+3.092
 generosity+3.015
 appreciation+2.882
 pride+2.830
ipp+2.827
Neg logit contributions
 bumped-1.475
 stub-1.459
 accidentally-1.349
?"-1.248
 tri-1.222
Token walking
Token ID6155
Feature activation+6.31e-01

Pos logit contributions
 bumped+1.868
 stub+1.836
Ro+1.799
Who+1.763
 accidentally+1.758
Neg logit contributions
 pleasure-2.359
 generosity-2.245
ipp-2.228
 pride-2.166
 appreciation-2.160
Token in
Token ID287
Feature activation+6.99e-01

Pos logit contributions
 stub+0.907
 bumped+0.887
 accidentally+0.851
 tri+0.769
 landed+0.752
Neg logit contributions
 pleasure-1.233
 generosity-1.213
 appreciation-1.156
 pride-1.141
ipp-1.137
Token the
Token ID262
Feature activation+1.28e+00

Pos logit contributions
 stub+1.141
 bumped+1.126
 accidentally+1.067
 tri+0.978
?"+0.963
Neg logit contributions
 pleasure-1.275
 generosity-1.223
 appreciation-1.190
 kinds-1.141
 pride-1.140
Token park
Token ID3952
Feature activation+1.18e+00

Pos logit contributions
 bumped+1.611
 stub+1.545
 accidentally+1.448
 tri+1.309
?"+1.296
Neg logit contributions
 pleasure-2.807
 generosity-2.669
 appreciation-2.648
 pride-2.597
ipp-2.591
Token when
Token ID618
Feature activation+2.32e+00

Pos logit contributions
 stub+1.601
 bumped+1.557
 accidentally+1.443
Ro+1.410
 landed+1.348
Neg logit contributions
 pleasure-2.373
 generosity-2.295
 appreciation-2.200
ipp-2.199
 kinds-2.197
Token it
Token ID340
Feature activation+3.74e+00

Pos logit contributions
 stub+3.451
 bumped+3.368
Ro+3.157
?"+3.079
 seek+3.066
Neg logit contributions
 pleasure-4.330
 generosity-4.023
 pride-3.805
 appreciation-3.724
 kindness-3.710
Token saw
Token ID2497
Feature activation+1.42e+00

Pos logit contributions
?"+4.486
 seek+4.315
Ro+4.233
 stub+4.196
 university+4.106
Neg logit contributions
ipp-7.123
ealing-6.803
apa-6.666
 pleasure-6.598
 appreciation-6.446
Token a
Token ID257
Feature activation+1.27e+00

Pos logit contributions
 stub+2.415
 bumped+2.378
 accidentally+2.200
 seek+2.117
 tri+2.091
Neg logit contributions
 pleasure-2.484
 generosity-2.448
 appreciation-2.362
 pride-2.295
 kinds-2.263
Token little
Token ID1310
Feature activation+4.52e-01

Pos logit contributions
 bumped+2.161
 stub+2.128
 accidentally+1.980
 tri+1.874
?"+1.874
Neg logit contributions
 pleasure-2.187
 generosity-2.106
 appreciation-2.002
ipp-1.874
 pride-1.857
Token bird
Token ID6512
Feature activation+1.59e+00

Pos logit contributions
 bumped+0.678
 stub+0.671
 accidentally+0.634
 tri+0.582
?"+0.577
Neg logit contributions
 pleasure-0.859
 generosity-0.841
 appreciation-0.794
 pride-0.763
 choices-0.763
Token.
Token ID13
Feature activation+1.05e-01

Pos logit contributions
 bumped+2.105
 stub+1.998
 accidentally+1.940
Ro+1.812
 seek+1.796
Neg logit contributions
 generosity-3.032
 appreciation-2.982
 pleasure-2.973
 kinds-2.901
ipp-2.887
Token want
Token ID765
Feature activation+1.54e+00

Pos logit contributions
 pleasure+0.132
 generosity+0.126
 appreciation+0.119
 pride+0.117
ipp+0.115
Neg logit contributions
 stub-0.151
 bumped-0.151
 accidentally-0.144
 tri-0.135
 landed-0.134
Token to
Token ID284
Feature activation+1.04e+00

Pos logit contributions
 stub+2.321
 bumped+2.288
 accidentally+2.116
 tri+2.000
 seek+1.909
Neg logit contributions
 pleasure-3.002
 generosity-2.972
 appreciation-2.833
 pride-2.802
 kindness-2.756
Token play
Token ID711
Feature activation+2.37e+00

Pos logit contributions
 bumped+1.360
 stub+1.360
 accidentally+1.251
 tri+1.155
 landed+1.095
Neg logit contributions
 pleasure-2.194
 generosity-2.110
 appreciation-2.047
 pride-2.030
ipp-2.025
Token with
Token ID351
Feature activation+2.14e+00

Pos logit contributions
 stub+4.049
 bumped+3.963
 accidentally+3.513
 tri+3.513
 rang+3.254
Neg logit contributions
 pleasure-4.416
 generosity-4.235
 appreciation-4.117
 pride-3.993
 kindness-3.946
Token me
Token ID502
Feature activation+2.32e+00

Pos logit contributions
 stub+3.066
 bumped+3.002
 accidentally+2.566
 tri+2.529
jar+2.491
Neg logit contributions
 pleasure-4.230
 generosity-4.169
ipp-4.009
 appreciation-4.008
 pride-4.003
Token now
Token ID783
Feature activation+3.73e+00

Pos logit contributions
 stub+2.260
 bumped+2.141
 accidentally+1.842
Who+1.758
Ro+1.757
Neg logit contributions
 pleasure-5.905
ipp-5.796
 generosity-5.767
 appreciation-5.642
 kinds-5.536
Token?"
Token ID1701
Feature activation-2.17e-01

Pos logit contributions
 stub+4.427
 bumped+4.114
Ro+3.421
 tri+3.384
Raw+3.230
Neg logit contributions
 pleasure-9.478
ipp-8.993
 generosity-8.808
ish-8.639
 appreciation-8.531
Token The
Token ID383
Feature activation+1.18e+00

Pos logit contributions
 pleasure+0.383
 generosity+0.374
 appreciation+0.352
ipp+0.342
 pride+0.342
Neg logit contributions
 stub-0.359
 bumped-0.359
 accidentally-0.339
 tri-0.319
?"-0.312
Token gr
Token ID1036
Feature activation-1.15e+00

Pos logit contributions
 bumped+1.596
 stub+1.555
 accidentally+1.457
 tri+1.355
?"+1.345
Neg logit contributions
 generosity-2.419
 pleasure-2.405
 appreciation-2.263
 pride-2.208
 kindness-2.197
Tokenumpy
Token ID32152
Feature activation+5.27e-01

Pos logit contributions
 pleasure+1.783
 generosity+1.712
 appreciation+1.593
 pride+1.539
ipp+1.520
Neg logit contributions
 stub-2.188
 bumped-2.181
 accidentally-2.103
?"-2.004
 tri-1.974
Token cat
Token ID3797
Feature activation+7.32e-01

Pos logit contributions
 bumped+0.607
 stub+0.606
 accidentally+0.556
 tri+0.501
 landed+0.499
Neg logit contributions
 generosity-1.168
 pleasure-1.165
 appreciation-1.101
 choices-1.075
 pride-1.073
Token Can
Token ID1680
Feature activation+1.27e-01

Pos logit contributions
 pleasure+0.032
 generosity+0.031
 appreciation+0.029
 pride+0.029
ipp+0.029
Neg logit contributions
 bumped-0.016
 stub-0.015
 accidentally-0.014
 tri-0.013
 landed-0.013
Token I
Token ID314
Feature activation+3.64e-01

Pos logit contributions
 bumped+0.151
 stub+0.149
 accidentally+0.138
 tri+0.125
 landed+0.124
Neg logit contributions
 pleasure-0.281
 generosity-0.277
 appreciation-0.262
ipp-0.259
 pride-0.256
Token play
Token ID711
Feature activation+2.19e+00

Pos logit contributions
 bumped+0.602
 stub+0.597
 accidentally+0.569
 tri+0.532
 landed+0.524
Neg logit contributions
 pleasure-0.650
 generosity-0.620
 appreciation-0.595
 pride-0.574
ipp-0.571
Token with
Token ID351
Feature activation+1.40e-01

Pos logit contributions
 bumped+3.617
 stub+3.591
 accidentally+3.217
 tri+3.204
 landed+3.006
Neg logit contributions
 pleasure-4.170
 generosity-4.012
 appreciation-3.825
 kindness-3.725
 pride-3.685
Token you
Token ID345
Feature activation+3.06e+00

Pos logit contributions
 stub+0.202
 bumped+0.202
 accidentally+0.187
 tri+0.174
 landed+0.169
Neg logit contributions
 pleasure-0.276
 generosity-0.271
 appreciation-0.257
ipp-0.253
 pride-0.252
Token later
Token ID1568
Feature activation+3.72e+00

Pos logit contributions
 stub+4.391
 bumped+4.131
Ro+3.789
Raw+3.579
Who+3.495
Neg logit contributions
 pleasure-6.606
 generosity-6.515
ipp-6.249
 kindness-6.140
 appreciation-6.017
Token?"
Token ID1701
Feature activation+2.66e-01

Pos logit contributions
 stub+5.002
 bumped+4.588
Ro+4.224
 landed+4.091
Raw+3.998
Neg logit contributions
 pleasure-9.149
ipp-8.860
 generosity-8.251
ums-7.885
 appreciation-7.881
Token Mia
Token ID32189
Feature activation-1.30e+00

Pos logit contributions
 bumped+0.378
 stub+0.377
 accidentally+0.351
 tri+0.327
 landed+0.327
Neg logit contributions
 pleasure-0.531
 generosity-0.522
 appreciation-0.504
 pride-0.484
ipp-0.483
Token says
Token ID1139
Feature activation-1.37e+00

Pos logit contributions
 pleasure+2.352
 generosity+2.267
 appreciation+2.122
ipp+2.070
 pride+2.050
Neg logit contributions
 bumped-2.145
 stub-2.112
 accidentally-2.035
?"-1.961
 tri-1.918
Token,
Token ID11
Feature activation+1.14e+00

Pos logit contributions
 pleasure+3.208
 generosity+3.135
 appreciation+3.025
 pride+2.942
 kinds+2.938
Neg logit contributions
 bumped-1.486
 stub-1.476
 accidentally-1.372
?"-1.238
 tri-1.235
Token "
Token ID366
Feature activation-4.38e-01

Pos logit contributions
 stub+1.707
 bumped+1.699
 accidentally+1.581
Ro+1.484
?"+1.483
Neg logit contributions
 pleasure-2.126
 generosity-2.095
ipp-1.930
 appreciation-1.924
 pride-1.922
Token playing
Token ID2712
Feature activation+2.00e+00

Pos logit contributions
 stub+1.131
 bumped+1.112
 accidentally+1.018
Ro+0.972
 landed+0.970
Neg logit contributions
 pleasure-1.511
 generosity-1.496
 appreciation-1.429
 kinds-1.399
 choices-1.399
Token with
Token ID351
Feature activation+4.59e-01

Pos logit contributions
 stub+3.091
 bumped+3.017
 accidentally+2.645
 landed+2.645
 tri+2.578
Neg logit contributions
 generosity-3.539
 pleasure-3.516
 appreciation-3.445
 kinds-3.408
 sorts-3.385
Token her
Token ID607
Feature activation-2.22e-01

Pos logit contributions
 stub+0.630
 bumped+0.626
 accidentally+0.568
 tri+0.544
?"+0.536
Neg logit contributions
 pleasure-0.915
 generosity-0.904
 appreciation-0.875
 pride-0.857
ipp-0.851
Token doll
Token ID3654
Feature activation+1.29e+00

Pos logit contributions
 pleasure+0.384
 generosity+0.375
 appreciation+0.354
ipp+0.343
 pride+0.343
Neg logit contributions
 bumped-0.376
 stub-0.373
 accidentally-0.353
 tri-0.334
?"-0.330
Token when
Token ID618
Feature activation+2.18e+00

Pos logit contributions
 bumped+1.793
 stub+1.792
Ro+1.604
Raw+1.562
 accidentally+1.554
Neg logit contributions
 generosity-2.519
 pleasure-2.501
ipp-2.368
 appreciation-2.363
 choices-2.317
Token it
Token ID340
Feature activation+3.71e+00

Pos logit contributions
 stub+2.923
 bumped+2.888
Ro+2.754
?"+2.706
 accidentally+2.568
Neg logit contributions
 pleasure-4.420
 generosity-4.276
 pride-4.112
 appreciation-4.087
 kindness-3.919
Token accidentally
Token ID14716
Feature activation+2.78e+00

Pos logit contributions
Who+5.292
Ro+5.104
 spy+5.009
Please+5.008
?"+4.967
Neg logit contributions
ipp-6.504
 kinds-5.925
 appreciation-5.825
 pride-5.746
 generosity-5.741
Token fell
Token ID3214
Feature activation+1.64e+00

Pos logit contributions
Please+2.665
Ro+2.585
 seek+2.552
 stub+2.519
Who+2.503
Neg logit contributions
 pleasure-6.201
 appreciation-6.082
 pride-6.054
 generosity-6.024
ipp-5.909
Token out
Token ID503
Feature activation+1.01e+00

Pos logit contributions
 stub+2.694
 bumped+2.631
 accidentally+2.517
 seek+2.446
Who+2.406
Neg logit contributions
 pleasure-2.731
ipp-2.680
 pride-2.600
 appreciation-2.596
 generosity-2.573
Token of
Token ID286
Feature activation+1.86e+00

Pos logit contributions
 stub+2.285
 bumped+2.261
 accidentally+2.214
 tri+2.093
Ro+2.081
Neg logit contributions
 pleasure-1.089
 appreciation-1.075
 generosity-1.071
 kinds-1.021
 pride-1.011
Token her
Token ID607
Feature activation+8.09e-01

Pos logit contributions
 stub+2.906
 bumped+2.853
 accidentally+2.670
Ro+2.634
 tri+2.583
Neg logit contributions
 pleasure-3.584
 pride-3.412
 appreciation-3.375
 generosity-3.355
 kinds-3.120
Token then
Token ID788
Feature activation+1.75e+00

Pos logit contributions
Ro+4.848
 stub+4.840
Raw+4.746
 bumped+4.630
?"+4.314
Neg logit contributions
 pleasure-5.465
 generosity-5.078
 pride-4.930
ipp-4.835
 kindness-4.784
Token,
Token ID11
Feature activation+3.00e+00

Pos logit contributions
 stub+2.900
 bumped+2.752
Ro+2.637
Please+2.534
 accidentally+2.500
Neg logit contributions
 pleasure-2.950
 generosity-2.867
 appreciation-2.734
 pride-2.704
 choices-2.655
Token she
Token ID673
Feature activation+3.19e+00

Pos logit contributions
 stub+4.597
Ro+4.481
 bumped+4.429
Raw+4.388
?"+4.195
Neg logit contributions
 pleasure-5.363
 generosity-5.074
 pride-4.958
 kindness-4.891
 appreciation-4.718
Token fell
Token ID3214
Feature activation+2.55e+00

Pos logit contributions
Ro+3.579
jar+3.098
 seek+3.000
Raw+2.973
 Ach+2.863
Neg logit contributions
ipp-6.934
ealing-6.798
 pride-6.689
 pleasure-6.511
ish-6.466
Token off
Token ID572
Feature activation+2.03e+00

Pos logit contributions
 stub+4.272
 accidentally+4.209
Who+4.200
 bumped+4.165
Ro+4.073
Neg logit contributions
ipp-4.123
 pleasure-3.986
 pride-3.936
 appreciation-3.782
 kinds-3.735
Token and
Token ID290
Feature activation+3.71e+00

Pos logit contributions
 stub+3.750
 bumped+3.540
 accidentally+3.482
Ro+3.348
 tri+3.245
Neg logit contributions
 pleasure-3.040
 pride-3.007
 generosity-3.004
 appreciation-2.962
 kinds-2.943
Token hurt
Token ID5938
Feature activation+1.30e+00

Pos logit contributions
Ro+4.786
 spy+4.537
Raw+4.493
Who+4.410
storm+4.368
Neg logit contributions
 pride-7.416
 pleasure-6.851
 appreciation-6.701
 kinds-6.668
 generosity-6.637
Token her
Token ID607
Feature activation+8.76e-01

Pos logit contributions
 bumped+2.053
 stub+2.045
 accidentally+1.968
?"+1.782
Ro+1.754
Neg logit contributions
 pleasure-2.328
 generosity-2.274
 appreciation-2.269
 pride-2.240
ipp-2.230
Token knee
Token ID10329
Feature activation+1.72e+00

Pos logit contributions
 bumped+1.525
 accidentally+1.476
 stub+1.456
Ro+1.396
 tri+1.339
Neg logit contributions
 pleasure-1.503
 generosity-1.459
 appreciation-1.395
 choices-1.341
 pride-1.333
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
 stub+2.213
 bumped+2.207
Ro+2.165
 accidentally+2.152
Who+2.075
Neg logit contributions
 pleasure-3.252
 generosity-3.241
 appreciation-3.161
ipp-3.150
 choices-3.129
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
 bumped+3.126
 stub+2.918
 seek+2.908
 accidentally+2.798
 tri+2.783
Neg logit contributions
 pleasure-2.502
 pride-2.219
 generosity-2.213
ipp-2.213
 appreciation-2.211
Token all
Token ID477
Feature activation-1.63e+00

Pos logit contributions
 stub+2.046
 bumped+1.922
 accidentally+1.805
Ro+1.705
 tri+1.702
Neg logit contributions
 pleasure-2.220
 generosity-2.122
 appreciation-2.037
 pride-1.958
 choices-1.925
Token of
Token ID286
Feature activation+3.06e-01

Pos logit contributions
 pleasure+3.424
 generosity+3.290
ipp+3.125
 appreciation+3.095
izza+3.051
Neg logit contributions
 bumped-2.274
 stub-2.270
 accidentally-2.150
?"-2.107
 landed-2.019
Token a
Token ID257
Feature activation+8.73e-01

Pos logit contributions
 stub+0.465
 bumped+0.459
 accidentally+0.428
 tri+0.398
?"+0.384
Neg logit contributions
 pleasure-0.594
 generosity-0.582
 appreciation-0.561
 pride-0.542
 kinds-0.527
Token sudden
Token ID4802
Feature activation+1.39e+00

Pos logit contributions
 bumped+1.422
 stub+1.407
 accidentally+1.330
?"+1.327
 landed+1.281
Neg logit contributions
 pleasure-1.499
 generosity-1.484
 appreciation-1.437
 pride-1.358
 kindness-1.288
Token,
Token ID11
Feature activation+1.74e+00

Pos logit contributions
 stub+2.314
 bumped+2.136
 accidentally+2.048
Please+1.969
Ro+1.962
Neg logit contributions
 pleasure-2.429
 generosity-2.382
 appreciation-2.251
 pride-2.200
 kindness-2.147
Token it
Token ID340
Feature activation+3.70e+00

Pos logit contributions
 stub+2.931
 bumped+2.778
 accidentally+2.545
Ro+2.461
 seek+2.417
Neg logit contributions
 pleasure-3.189
 generosity-3.018
 appreciation-2.855
 kindness-2.778
 pride-2.765
Token heard
Token ID2982
Feature activation+1.36e+00

Pos logit contributions
Ro+4.454
 university+4.421
?"+4.351
Who+4.337
 seeker+4.265
Neg logit contributions
 pride-6.999
 appreciation-6.996
 generosity-6.777
ealing-6.482
apa-6.443
Token a
Token ID257
Feature activation+6.29e-01

Pos logit contributions
 stub+2.296
 bumped+2.240
 accidentally+2.105
 seek+2.016
 tri+1.942
Neg logit contributions
 pleasure-2.453
 generosity-2.409
 appreciation-2.384
 pride-2.304
 kindness-2.230
Token loud
Token ID7812
Feature activation+2.41e-01

Pos logit contributions
 stub+0.904
 bumped+0.901
 accidentally+0.844
 tri+0.768
?"+0.765
Neg logit contributions
 pleasure-1.234
 generosity-1.234
 appreciation-1.191
 pride-1.118
 choices-1.103
Token noise
Token ID7838
Feature activation+1.90e+00

Pos logit contributions
 stub+0.281
 bumped+0.276
 accidentally+0.260
 tri+0.229
?"+0.221
Neg logit contributions
 pleasure-0.532
 generosity-0.528
 appreciation-0.508
 pride-0.489
ipp-0.488
Token.
Token ID13
Feature activation+4.81e-01

Pos logit contributions
 stub+2.924
 accidentally+2.730
 bumped+2.725
 seek+2.422
Ro+2.410
Neg logit contributions
 generosity-3.430
 kinds-3.360
 pleasure-3.350
 appreciation-3.325
 sorts-3.277
Token."
Token ID526
Feature activation-2.21e-01

Pos logit contributions
 stub+0.146
 bumped+0.145
 accidentally+0.136
 tri+0.125
 landed+0.120
Neg logit contributions
 pleasure-0.216
 generosity-0.214
 appreciation-0.205
 pride-0.199
ipp-0.199
Token
Token ID198
Feature activation+1.00e-01

Pos logit contributions
 pleasure+0.464
 generosity+0.453
 appreciation+0.432
ipp+0.425
 pride+0.424
Neg logit contributions
 bumped-0.295
 stub-0.291
 accidentally-0.273
 tri-0.250
 landed-0.246
Token
Token ID198
Feature activation-5.10e-01

Pos logit contributions
 bumped+0.141
 stub+0.137
 accidentally+0.131
 tri+0.120
 landed+0.115
Neg logit contributions
 pleasure-0.202
 generosity-0.199
 appreciation-0.191
ipp-0.187
 pride-0.186
TokenThey
Token ID2990
Feature activation-6.72e-01

Pos logit contributions
 pleasure+1.121
 generosity+1.102
 appreciation+1.054
ipp+1.033
 pride+1.027
Neg logit contributions
 bumped-0.625
 stub-0.623
 accidentally-0.584
 tri-0.528
?"-0.523
Token shared
Token ID4888
Feature activation-1.97e+00

Pos logit contributions
 pleasure+1.392
 generosity+1.362
 appreciation+1.298
ipp+1.266
 pride+1.259
Neg logit contributions
 bumped-0.904
 stub-0.901
 accidentally-0.852
 tri-0.780
?"-0.771
Token Ben
Token ID3932
Feature activation-4.87e+00

Pos logit contributions
 pleasure+4.218
 generosity+4.012
ipp+3.742
 appreciation+3.726
 pride+3.667
Neg logit contributions
 bumped-2.688
 stub-2.542
 accidentally-2.525
?"-2.393
 landed-2.352
Token's
Token ID338
Feature activation-2.94e+00

Pos logit contributions
 pleasure+14.113
umm+12.415
 generosity+12.181
ipp+11.934
omp+11.846
Neg logit contributions
 bumped-5.673
 landed-5.254
 saw-5.184
 broke-5.125
 heard-5.095
Token ice
Token ID4771
Feature activation-3.14e+00

Pos logit contributions
 pleasure+5.708
ipp+5.590
 generosity+5.525
ums+5.246
 pride+5.211
Neg logit contributions
 bumped-4.310
 stub-4.185
?"-4.007
 accidentally-3.870
 landed-3.793
Token-
Token ID12
Feature activation-3.37e+00

Pos logit contributions
 pleasure+7.656
 generosity+7.000
 pride+6.866
 appreciation+6.698
 equal+6.363
Neg logit contributions
 bumped-3.909
?"-3.635
 stub-3.381
 landed-3.322
 hit-3.280
Tokencream
Token ID36277
Feature activation-1.76e+00

Pos logit contributions
 pleasure+5.843
 generosity+5.212
 appreciation+5.086
 pride+4.852
 decisions+4.501
Neg logit contributions
 bumped-6.944
 stub-6.350
 landed-6.221
 accidentally-6.121
 broke-6.020
Token and
Token ID290
Feature activation-5.58e-01

Pos logit contributions
 pleasure+3.956
 generosity+3.770
 appreciation+3.654
 pride+3.561
ipp+3.542
Neg logit contributions
 bumped-2.090
 stub-1.971
 accidentally-1.908
?"-1.807
 suddenly-1.776
Token!
Token ID0
Feature activation-8.00e-01

Pos logit contributions
 pleasure+1.706
 generosity+1.674
 appreciation+1.608
ipp+1.578
 pride+1.570
Neg logit contributions
 bumped-0.712
 stub-0.709
 accidentally-0.656
 tri-0.580
?"-0.572
Token
Token ID198
Feature activation+3.38e-01

Pos logit contributions
 pleasure+1.335
 generosity+1.311
 appreciation+1.232
ipp+1.195
 pride+1.187
Neg logit contributions
 bumped-1.396
 stub-1.395
 accidentally-1.343
 tri-1.250
?"-1.243
Token
Token ID198
Feature activation-4.68e-01

Pos logit contributions
 bumped+0.479
 stub+0.477
 accidentally+0.444
 tri+0.417
 landed+0.401
Neg logit contributions
 pleasure-0.664
 generosity-0.643
 appreciation-0.632
ipp-0.610
 pride-0.603
TokenThey
Token ID2990
Feature activation-9.92e-01

Pos logit contributions
 pleasure+0.909
 generosity+0.891
 appreciation+0.849
ipp+0.828
 pride+0.823
Neg logit contributions
 stub-0.688
 bumped-0.688
 accidentally-0.649
 tri-0.600
?"-0.597
Token collected
Token ID7723
Feature activation-2.67e+00

Pos logit contributions
 pleasure+2.276
 generosity+2.218
 appreciation+2.118
ipp+2.079
 pride+2.058
Neg logit contributions
 bumped-1.153
 stub-1.127
 accidentally-1.079
 tri-0.967
 landed-0.949
Token all
Token ID477
Feature activation-4.86e+00

Pos logit contributions
 pleasure+5.805
 generosity+5.640
ipp+5.496
ums+5.408
izza+5.404
Neg logit contributions
 bumped-3.514
 landed-3.266
 accidentally-3.179
 stub-3.100
 suddenly-3.032
Token the
Token ID262
Feature activation-1.97e+00

Pos logit contributions
 generosity+15.536
 pleasure+15.530
ealing+15.454
izza+15.452
ums+14.993
Neg logit contributions
 landed-3.116
 bumped-2.680
 saw-2.626
?"-2.618
 behind-2.544
Token treasure
Token ID14068
Feature activation-1.73e+00

Pos logit contributions
 pleasure+3.421
 generosity+3.327
ipp+3.290
ums+3.162
 appreciation+3.128
Neg logit contributions
 bumped-3.357
 stub-3.229
 accidentally-3.168
?"-3.023
 landed-3.000
Token and
Token ID290
Feature activation-8.64e-01

Pos logit contributions
 pleasure+3.860
 generosity+3.701
 appreciation+3.587
ipp+3.578
 pride+3.512
Neg logit contributions
 bumped-2.081
 stub-1.984
 accidentally-1.948
 landed-1.798
?"-1.777
Token stored
Token ID8574
Feature activation-1.69e+00

Pos logit contributions
 pleasure+1.921
 generosity+1.873
 appreciation+1.796
ipp+1.772
 pride+1.748
Neg logit contributions
 bumped-1.042
 stub-1.028
 accidentally-0.979
 tri-0.880
 landed-0.866
Token it
Token ID340
Feature activation-1.20e+00

Pos logit contributions
 pleasure+3.787
 generosity+3.749
ipp+3.580
 appreciation+3.532
ums+3.511
Neg logit contributions
 bumped-1.995
 stub-1.845
 accidentally-1.825
 landed-1.713
?"-1.658
Token the
Token ID262
Feature activation-1.73e+00

Pos logit contributions
 pleasure+7.693
 generosity+7.655
izza+7.113
ipp+7.108
ums+7.103
Neg logit contributions
 bumped-3.011
?"-2.850
 stub-2.800
 accidentally-2.731
 landed-2.582
Token luggage
Token ID31488
Feature activation-5.09e-01

Pos logit contributions
 pleasure+3.229
 generosity+3.176
ipp+3.023
 appreciation+2.968
ums+2.912
Neg logit contributions
 bumped-2.771
 stub-2.667
 accidentally-2.599
?"-2.431
 landed-2.419
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
 pleasure+1.199
 generosity+1.179
 appreciation+1.130
ipp+1.108
 pride+1.103
Neg logit contributions
 stub-0.531
 bumped-0.530
 accidentally-0.491
 tri-0.436
?"-0.430
Token They
Token ID1119
Feature activation-1.69e+00

Pos logit contributions
 pleasure+2.248
 generosity+2.247
 appreciation+2.109
ipp+2.059
 pride+2.018
Neg logit contributions
 bumped-2.213
 stub-2.193
 accidentally-2.158
?"-1.989
 tri-1.969
Token added
Token ID2087
Feature activation-2.39e+00

Pos logit contributions
 pleasure+3.945
 generosity+3.807
 decisions+3.638
 appreciation+3.620
 choices+3.616
Neg logit contributions
 bumped-2.079
 accidentally-1.962
 stub-1.937
 tri-1.740
 landed-1.724
Token all
Token ID477
Feature activation-4.81e+00

Pos logit contributions
 pleasure+5.297
 generosity+5.194
ipp+4.949
 appreciation+4.868
ums+4.842
Neg logit contributions
 bumped-3.004
 accidentally-2.704
 landed-2.684
 stub-2.637
?"-2.624
Token the
Token ID262
Feature activation-2.45e+00

Pos logit contributions
 generosity+15.265
 pleasure+14.900
ums+14.841
ealing+14.616
izza+14.245
Neg logit contributions
?"-3.176
 bumped-3.116
 hit-2.795
 landed-2.792
 saw-2.641
Token luggage
Token ID31488
Feature activation-2.20e+00

Pos logit contributions
 pleasure+4.759
 generosity+4.692
ipp+4.642
ums+4.468
izza+4.351
Neg logit contributions
 bumped-3.838
 stub-3.524
 accidentally-3.493
 landed-3.357
?"-3.265
Token to
Token ID284
Feature activation-4.86e-01

Pos logit contributions
 pleasure+5.368
 generosity+5.167
 appreciation+5.023
ipp+4.948
 pride+4.907
Neg logit contributions
 bumped-2.341
 accidentally-2.112
 stub-2.078
 landed-1.886
 suddenly-1.879
Token the
Token ID262
Feature activation-5.46e-01

Pos logit contributions
 pleasure+0.964
 generosity+0.943
 appreciation+0.897
ipp+0.873
 pride+0.872
Neg logit contributions
 stub-0.699
 bumped-0.697
 accidentally-0.658
 tri-0.607
?"-0.601
Token truck
Token ID7779
Feature activation-1.18e+00

Pos logit contributions
 pleasure+1.158
 generosity+1.134
 appreciation+1.083
ipp+1.058
 pride+1.054
Neg logit contributions
 bumped-0.714
 stub-0.714
 accidentally-0.671
 tri-0.612
?"-0.608
TokenBen
Token ID11696
Feature activation-7.41e-01

Pos logit contributions
 pleasure+0.253
 generosity+0.251
 appreciation+0.243
ipp+0.238
 pride+0.236
Neg logit contributions
 bumped-0.101
 stub-0.099
 accidentally-0.090
 tri-0.079
 landed-0.078
Token and
Token ID290
Feature activation+1.63e+00

Pos logit contributions
 pleasure+1.481
 generosity+1.445
 appreciation+1.374
ipp+1.342
 pride+1.336
Neg logit contributions
 bumped-1.050
 stub-1.047
 accidentally-0.993
 tri-0.915
?"-0.909
Token Lily
Token ID20037
Feature activation-1.33e+00

Pos logit contributions
 bumped+1.695
 stub+1.660
?"+1.409
 landed+1.405
 accidentally+1.392
Neg logit contributions
 generosity-3.847
 pleasure-3.842
 appreciation-3.707
 pride-3.616
 kinds-3.501
Token hugged
Token ID41130
Feature activation-5.10e-01

Pos logit contributions
 pleasure+2.640
 generosity+2.550
 appreciation+2.405
ipp+2.339
 choices+2.337
Neg logit contributions
 bumped-1.963
 stub-1.923
 accidentally-1.849
 tri-1.727
 landed-1.695
Token grand
Token ID4490
Feature activation-2.57e+00

Pos logit contributions
 pleasure+0.938
 generosity+0.917
 appreciation+0.869
ipp+0.845
 pride+0.843
Neg logit contributions
 bumped-0.807
 stub-0.807
 accidentally-0.766
 tri-0.711
?"-0.705
Tokenpa
Token ID8957
Feature activation-4.65e+00

Pos logit contributions
 pleasure+4.109
 generosity+3.656
ipp+3.519
 appreciation+3.438
izza+3.344
Neg logit contributions
 bumped-4.762
?"-4.747
 stub-4.641
 accidentally-4.442
 tri-4.375
Token.
Token ID13
Feature activation-5.16e-01

Pos logit contributions
 pleasure+14.244
ipp+13.294
 generosity+13.109
omp+13.100
ums+13.076
Neg logit contributions
 behind-2.960
 bumped-2.869
 saw-2.682
 under-2.679
?"-2.634
Token They
Token ID1119
Feature activation-8.25e-01

Pos logit contributions
 pleasure+0.802
 generosity+0.779
 appreciation+0.731
ipp+0.707
 pride+0.704
Neg logit contributions
 bumped-0.963
 stub-0.963
 accidentally-0.922
 tri-0.866
?"-0.860
Token said
Token ID531
Feature activation-1.99e+00

Pos logit contributions
 pleasure+1.722
 generosity+1.681
 appreciation+1.601
ipp+1.566
 pride+1.553
Neg logit contributions
 bumped-1.101
 stub-1.093
 accidentally-1.038
 tri-0.949
?"-0.938
Token,
Token ID11
Feature activation-4.49e-01

Pos logit contributions
 pleasure+4.656
 generosity+4.502
 appreciation+4.313
ipp+4.307
 kinds+4.280
Neg logit contributions
 bumped-2.207
 stub-2.113
 accidentally-2.033
?"-1.919
 suddenly-1.831
Token "
Token ID366
Feature activation-1.57e+00

Pos logit contributions
 pleasure+0.962
 generosity+0.944
 appreciation+0.900
ipp+0.881
 pride+0.876
Neg logit contributions
 bumped-0.569
 stub-0.568
 accidentally-0.532
 tri-0.485
?"-0.479
Token for
Token ID329
Feature activation-1.60e-01

Pos logit contributions
 pleasure+0.833
 generosity+0.818
 appreciation+0.779
ipp+0.762
 pride+0.757
Neg logit contributions
 stub-0.534
 bumped-0.534
 accidentally-0.502
 tri-0.460
?"-0.448
Token you
Token ID345
Feature activation+2.25e-01

Pos logit contributions
 pleasure+0.324
 generosity+0.318
 appreciation+0.302
ipp+0.294
 pride+0.294
Neg logit contributions
 stub-0.225
 bumped-0.224
 accidentally-0.209
 tri-0.193
 landed-0.187
Token."
Token ID526
Feature activation-1.72e+00

Pos logit contributions
 stub+0.272
 bumped+0.270
 accidentally+0.249
 tri+0.227
 landed+0.220
Neg logit contributions
 pleasure-0.503
 generosity-0.490
 appreciation-0.468
ipp-0.457
 pride-0.456
Token Bill
Token ID3941
Feature activation-2.11e+00

Pos logit contributions
 pleasure+3.714
 generosity+3.677
 appreciation+3.477
 pride+3.423
ipp+3.404
Neg logit contributions
 bumped-2.101
 stub-2.070
 accidentally-2.037
?"-1.886
 tri-1.825
Token gave
Token ID2921
Feature activation-2.28e+00

Pos logit contributions
 pleasure+5.193
 generosity+4.921
 appreciation+4.722
 decisions+4.705
 choices+4.691
Neg logit contributions
 bumped-2.382
 accidentally-2.303
 stub-2.259
 landed-2.061
 tri-2.008
Token Bob
Token ID5811
Feature activation-4.63e+00

Pos logit contributions
 pleasure+4.699
 generosity+4.601
izza+4.280
 appreciation+4.257
ipp+4.207
Neg logit contributions
 bumped-3.305
 accidentally-3.075
 stub-3.056
 landed-2.984
?"-2.898
Token the
Token ID262
Feature activation-1.90e+00

Pos logit contributions
 pleasure+15.007
 generosity+13.769
 decisions+13.424
 appreciation+13.364
ums+13.326
Neg logit contributions
 bumped-3.274
 saw-2.833
 landed-2.809
 heard-2.639
 accidentally-2.589
Token bread
Token ID8509
Feature activation-3.36e+00

Pos logit contributions
 pleasure+3.027
 generosity+2.946
ipp+2.876
 appreciation+2.823
ums+2.775
Neg logit contributions
 stub-3.369
 bumped-3.359
 accidentally-3.249
?"-3.134
 landed-3.032
Token,
Token ID11
Feature activation-2.42e+00

Pos logit contributions
 pleasure+8.827
 generosity+8.198
ipp+7.993
 appreciation+7.945
 decisions+7.883
Neg logit contributions
 bumped-3.179
 stub-3.038
 accidentally-3.025
 landed-2.827
 suddenly-2.793
Token and
Token ID290
Feature activation-1.36e+00

Pos logit contributions
 pleasure+5.256
 generosity+4.952
ipp+4.874
 appreciation+4.862
 decisions+4.841
Neg logit contributions
 accidentally-3.252
 bumped-3.200
 stub-3.160
 landed-2.835
 tri-2.829
Token Bob
Token ID5811
Feature activation-2.42e+00

Pos logit contributions
 pleasure+3.042
 generosity+2.931
 appreciation+2.813
ipp+2.791
 pride+2.765
Neg logit contributions
 bumped-1.661
 stub-1.609
 accidentally-1.578
 landed-1.414
 tri-1.414
Token.
Token ID13
Feature activation-6.07e-01

Pos logit contributions
 pleasure+0.621
 generosity+0.606
 appreciation+0.576
ipp+0.563
 pride+0.557
Neg logit contributions
 stub-0.540
 bumped-0.540
 accidentally-0.512
 tri-0.476
?"-0.469
Token
Token ID198
Feature activation-7.25e-02

Pos logit contributions
 pleasure+1.180
 generosity+1.155
 appreciation+1.097
ipp+1.071
 pride+1.065
Neg logit contributions
 bumped-0.890
 stub-0.890
 accidentally-0.844
 tri-0.778
?"-0.773
Token
Token ID198
Feature activation-3.78e-01

Pos logit contributions
 pleasure+0.150
 generosity+0.145
 appreciation+0.141
ipp+0.137
 pride+0.137
Neg logit contributions
 bumped-0.097
 stub-0.097
 accidentally-0.090
 tri-0.084
 landed-0.082
TokenPeter
Token ID19727
Feature activation-1.10e+00

Pos logit contributions
 pleasure+0.675
 generosity+0.660
 appreciation+0.628
ipp+0.608
 pride+0.607
Neg logit contributions
 bumped-0.615
 stub-0.614
 accidentally-0.581
 tri-0.544
?"-0.540
Token ate
Token ID15063
Feature activation-1.76e+00

Pos logit contributions
 pleasure+2.632
 generosity+2.570
 appreciation+2.457
ipp+2.415
 pride+2.409
Neg logit contributions
 bumped-1.156
 stub-1.134
 accidentally-1.075
 tri-0.955
 landed-0.939
Token all
Token ID477
Feature activation-4.59e+00

Pos logit contributions
 pleasure+3.804
 generosity+3.794
 appreciation+3.589
ipp+3.577
ums+3.536
Neg logit contributions
 bumped-2.157
 accidentally-2.045
 stub-2.034
?"-1.925
 suddenly-1.860
Token the
Token ID262
Feature activation-3.20e+00

Pos logit contributions
 generosity+14.138
 pleasure+13.875
izza+13.605
ealing+13.561
ums+13.338
Neg logit contributions
?"-3.172
 bumped-2.815
 suddenly-2.687
 landed-2.673
 hit-2.436
Token food
Token ID2057
Feature activation-2.00e+00

Pos logit contributions
 pleasure+5.709
 generosity+5.550
ipp+5.525
ums+5.430
 appreciation+5.284
Neg logit contributions
 bumped-5.469
?"-5.249
 stub-5.202
 seek-5.188
 landed-5.187
Token and
Token ID290
Feature activation-9.98e-01

Pos logit contributions
 pleasure+4.603
 generosity+4.488
 appreciation+4.355
ipp+4.256
 pride+4.235
Neg logit contributions
 bumped-2.294
 stub-2.196
 accidentally-2.156
 suddenly-1.948
?"-1.926
Token enjoyed
Token ID8359
Feature activation-1.33e+00

Pos logit contributions
 pleasure+2.148
 generosity+2.091
 appreciation+2.001
ipp+1.987
 pride+1.951
Neg logit contributions
 bumped-1.259
 stub-1.245
 accidentally-1.193
 tri-1.075
 landed-1.058
Token the
Token ID262
Feature activation-2.95e+00

Pos logit contributions
 pleasure+2.904
 generosity+2.847
 appreciation+2.715
ipp+2.701
 pride+2.659
Neg logit contributions
 bumped-1.619
 stub-1.585
 accidentally-1.531
?"-1.374
 tri-1.371
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
 pleasure+5.406
 generosity+5.190
ipp+5.103
 appreciation+5.023
 pride+4.949
Neg logit contributions
 bumped-2.392
 stub-2.318
 accidentally-2.239
?"-2.130
 tri-2.039
Token Together
Token ID17083
Feature activation-2.51e+00

Pos logit contributions
 pleasure+1.934
 generosity+1.921
 appreciation+1.810
ipp+1.754
 pride+1.737
Neg logit contributions
 stub-1.676
 bumped-1.667
 accidentally-1.618
?"-1.487
 tri-1.482
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
 pleasure+6.349
 generosity+6.093
ipp+5.791
 appreciation+5.755
 kinds+5.733
Neg logit contributions
 bumped-2.540
 accidentally-2.499
 stub-2.348
 suddenly-2.313
 tri-2.181
Token they
Token ID484
Feature activation-1.55e+00

Pos logit contributions
 pleasure+5.439
 generosity+5.293
 appreciation+5.087
ipp+5.053
ums+5.003
Neg logit contributions
 accidentally-2.462
 bumped-2.420
 stub-2.321
 suddenly-2.196
 tri-2.086
Token collected
Token ID7723
Feature activation-2.75e+00

Pos logit contributions
 pleasure+3.573
 generosity+3.485
 appreciation+3.301
 choices+3.282
 decisions+3.278
Neg logit contributions
 bumped-1.875
 stub-1.806
 accidentally-1.751
 landed-1.615
 tri-1.593
Token all
Token ID477
Feature activation-4.55e+00

Pos logit contributions
 pleasure+6.419
 generosity+6.332
izza+6.020
ums+5.962
ipp+5.940
Neg logit contributions
 bumped-3.172
 accidentally-2.952
 landed-2.952
 stub-2.784
 suddenly-2.763
Token the
Token ID262
Feature activation-2.82e+00

Pos logit contributions
 pleasure+13.777
 generosity+13.704
izza+13.631
ealing+13.317
ums+13.186
Neg logit contributions
 landed-3.240
 bumped-2.958
 accidentally-2.805
 hit-2.741
 saw-2.723
Token stones
Token ID14966
Feature activation-2.31e+00

Pos logit contributions
 pleasure+5.638
 generosity+5.591
ums+5.484
ipp+5.452
izza+5.366
Neg logit contributions
 bumped-4.202
 accidentally-4.038
 stub-3.918
 landed-3.805
 seek-3.720
Token Jack
Token ID3619
Feature activation-1.77e+00

Pos logit contributions
 pleasure+5.444
 generosity+5.302
 appreciation+5.108
 pride+4.967
izza+4.957
Neg logit contributions
 bumped-2.597
 stub-2.495
 accidentally-2.474
 landed-2.302
 suddenly-2.266
Token wanted
Token ID2227
Feature activation-1.17e+00

Pos logit contributions
 pleasure+4.156
 generosity+4.077
 appreciation+3.948
ipp+3.870
 pride+3.834
Neg logit contributions
 stub-1.947
 bumped-1.942
 accidentally-1.880
 tri-1.672
 landed-1.663
Token.
Token ID13
Feature activation-1.79e+00

Pos logit contributions
 pleasure+2.737
 generosity+2.683
 appreciation+2.580
ipp+2.546
 pride+2.519
Neg logit contributions
 bumped-1.247
 stub-1.226
 accidentally-1.178
 tri-1.040
?"-1.030
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
 pleasure+2.977
 generosity+2.936
 appreciation+2.807
ipp+2.746
 pride+2.725
Neg logit contributions
 bumped-1.674
 stub-1.654
 accidentally-1.600
?"-1.562
 tri-1.459
Token Dad
Token ID17415
Feature activation-1.68e+00

Pos logit contributions
 pleasure+2.707
 generosity+2.695
 appreciation+2.532
ipp+2.506
 kinds+2.454
Neg logit contributions
 stub-2.298
 bumped-2.286
 accidentally-2.248
?"-2.168
 tri-2.083
Token."
Token ID526
Feature activation-1.32e+00

Pos logit contributions
 pleasure+4.378
 generosity+4.271
 appreciation+4.136
 pride+4.074
ipp+4.039
Neg logit contributions
 bumped-1.359
?"-1.294
 stub-1.272
 accidentally-1.223
 tri-1.050
Token They
Token ID1119
Feature activation-1.61e+00

Pos logit contributions
 pleasure+2.701
 generosity+2.659
 appreciation+2.510
ipp+2.447
 pride+2.434
Neg logit contributions
 stub-1.809
 bumped-1.791
 accidentally-1.750
?"-1.640
 tri-1.580
Token hugged
Token ID41130
Feature activation-1.62e+00

Pos logit contributions
 pleasure+3.800
 generosity+3.696
 appreciation+3.529
 decisions+3.503
 choices+3.489
Neg logit contributions
 bumped-1.808
 stub-1.731
 accidentally-1.669
 tri-1.528
 landed-1.517
Token Dad
Token ID17415
Feature activation-4.55e+00

Pos logit contributions
 pleasure+3.324
 generosity+3.212
 appreciation+3.012
ipp+2.972
izza+2.923
Neg logit contributions
 bumped-2.260
 stub-2.234
 accidentally-2.157
?"-2.053
 tri-1.981
Token and
Token ID290
Feature activation-9.01e-01

Pos logit contributions
 pleasure+14.434
 generosity+13.086
ums+12.720
 appreciation+12.676
omp+12.485
Neg logit contributions
 bumped-3.180
 saw-2.902
 woke-2.842
 heard-2.771
 under-2.756
Token said
Token ID531
Feature activation-2.55e+00

Pos logit contributions
 pleasure+1.917
 generosity+1.868
 appreciation+1.786
ipp+1.757
 pride+1.736
Neg logit contributions
 bumped-1.162
 stub-1.154
 accidentally-1.095
 tri-0.998
?"-0.987
Token,
Token ID11
Feature activation-8.51e-01

Pos logit contributions
 pleasure+6.321
 generosity+6.108
 sorts+5.847
 appreciation+5.837
 kinds+5.827
Neg logit contributions
 bumped-2.548
 stub-2.346
?"-2.269
 accidentally-2.229
 suddenly-2.060
Token "
Token ID366
Feature activation-2.11e+00

Pos logit contributions
 pleasure+1.888
 generosity+1.848
 appreciation+1.773
ipp+1.730
 pride+1.727
Neg logit contributions
 bumped-1.013
 stub-1.010
 accidentally-0.949
 tri-0.858
?"-0.852
TokenYou
Token ID1639
Feature activation-2.43e+00

Pos logit contributions
 pleasure+4.741
 generosity+4.696
 appreciation+4.530
 decisions+4.413
 pride+4.378
Neg logit contributions
 stub-2.450
 bumped-2.344
 accidentally-2.241
Please-2.217
Can-2.152
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
 pleasure+2.427
 generosity+2.376
 appreciation+2.256
ipp+2.211
 pride+2.211
Neg logit contributions
 stub-1.235
 bumped-1.221
 accidentally-1.151
?"-1.062
 tri-1.030
TokenThe
Token ID464
Feature activation-1.17e+00

Pos logit contributions
 pleasure+3.292
 generosity+3.121
 appreciation+2.916
 pride+2.901
 choices+2.877
Neg logit contributions
 bumped-2.386
 stub-2.380
 accidentally-2.299
 suddenly-2.141
?"-2.139
Token shop
Token ID6128
Feature activation-2.20e+00

Pos logit contributions
 pleasure+2.426
 generosity+2.357
 appreciation+2.250
ipp+2.236
 pride+2.185
Neg logit contributions
 bumped-1.592
 stub-1.562
 accidentally-1.484
?"-1.376
 tri-1.339
Tokenkeeper
Token ID13884
Feature activation-1.90e+00

Pos logit contributions
 pleasure+4.287
 generosity+4.010
 appreciation+3.805
 pride+3.770
 decisions+3.758
Neg logit contributions
 bumped-3.498
 stub-3.351
 accidentally-3.315
 suddenly-3.123
 tri-3.070
Token gathered
Token ID9272
Feature activation-3.23e+00

Pos logit contributions
 pleasure+4.304
 generosity+4.078
 choices+3.895
 decisions+3.881
 appreciation+3.864
Neg logit contributions
 bumped-2.456
 accidentally-2.348
 stub-2.331
 suddenly-2.109
 tri-2.108
Token all
Token ID477
Feature activation-4.54e+00

Pos logit contributions
 pleasure+7.847
 generosity+7.712
izza+7.373
lette+7.269
ipp+7.261
Neg logit contributions
 bumped-3.629
 accidentally-3.267
 suddenly-3.212
 stub-3.179
 landed-3.121
Token of
Token ID286
Feature activation-2.25e+00

Pos logit contributions
 pleasure+14.036
 generosity+14.008
izza+13.712
ealing+13.566
lette+13.281
Neg logit contributions
 bumped-3.268
 landed-3.025
 accidentally-2.806
 hit-2.780
 stub-2.708
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 pleasure+5.229
 generosity+5.153
izza+4.875
 appreciation+4.842
ealing+4.831
Neg logit contributions
 bumped-2.627
 accidentally-2.523
 stub-2.482
 suddenly-2.193
 tri-2.160
Token items
Token ID3709
Feature activation-2.44e+00

Pos logit contributions
 pleasure+2.455
 generosity+2.393
 appreciation+2.255
ipp+2.242
izza+2.204
Neg logit contributions
 bumped-1.913
 stub-1.872
 accidentally-1.785
 tri-1.652
 landed-1.649
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
 pleasure+6.114
 generosity+5.868
 appreciation+5.695
ipp+5.551
ums+5.527
Neg logit contributions
 bumped-2.493
 accidentally-2.253
 stub-2.226
 suddenly-2.151
 landed-2.095
Token told
Token ID1297
Feature activation-1.65e+00

Pos logit contributions
 pleasure+2.378
 generosity+2.316
 appreciation+2.211
ipp+2.169
 pride+2.154
Neg logit contributions
 bumped-1.349
 stub-1.322
 accidentally-1.279
 tri-1.149
 landed-1.137
Token Tom
Token ID4186
Feature activation-6.82e-02

Pos logit contributions
 stub+1.281
 bumped+1.197
 accidentally+1.091
 tri+1.026
 seek+1.002
Neg logit contributions
 pleasure-1.774
 generosity-1.709
 appreciation-1.643
 pride-1.602
ipp-1.584
Token and
Token ID290
Feature activation+2.79e-01

Pos logit contributions
 pleasure+0.134
 generosity+0.133
 appreciation+0.126
ipp+0.124
 pride+0.123
Neg logit contributions
 stub-0.095
 bumped-0.094
 accidentally-0.087
 tri-0.081
?"-0.079
Token his
Token ID465
Feature activation-1.13e+00

Pos logit contributions
 bumped+0.430
 stub+0.430
 accidentally+0.400
 tri+0.381
?"+0.366
Neg logit contributions
 generosity-0.518
 pleasure-0.516
 appreciation-0.491
 pride-0.475
 kinds-0.462
Token rabbit
Token ID22746
Feature activation-1.28e+00

Pos logit contributions
 pleasure+1.960
 generosity+1.898
 appreciation+1.800
ipp+1.782
 pride+1.745
Neg logit contributions
 stub-1.893
 bumped-1.884
 accidentally-1.818
?"-1.688
 tri-1.687
Token sold
Token ID2702
Feature activation-2.12e+00

Pos logit contributions
 pleasure+2.666
 generosity+2.589
 appreciation+2.477
ipp+2.451
 pride+2.398
Neg logit contributions
 bumped-1.727
 stub-1.683
 accidentally-1.632
 tri-1.479
 landed-1.479
Token all
Token ID477
Feature activation-4.52e+00

Pos logit contributions
 pleasure+4.361
 generosity+4.196
ipp+4.105
ums+3.999
 appreciation+3.972
Neg logit contributions
 bumped-2.825
 stub-2.703
 accidentally-2.675
?"-2.618
 landed-2.516
Token the
Token ID262
Feature activation-2.44e+00

Pos logit contributions
 pleasure+13.746
 generosity+13.517
ealing+13.318
ums+13.179
izza+12.834
Neg logit contributions
?"-3.278
 bumped-2.530
 accidentally-2.441
 hit-2.427
 landed-2.421
Token carrots
Token ID34397
Feature activation-1.30e+00

Pos logit contributions
 pleasure+3.742
 generosity+3.533
ipp+3.431
 appreciation+3.407
izza+3.374
Neg logit contributions
 bumped-4.739
 stub-4.670
 accidentally-4.633
?"-4.609
 landed-4.335
Token.
Token ID13
Feature activation-3.13e-01

Pos logit contributions
 pleasure+3.239
 generosity+3.149
 appreciation+3.041
ipp+2.979
 pride+2.973
Neg logit contributions
 bumped-1.231
 stub-1.202
 accidentally-1.138
?"-1.026
 landed-0.989
Token They
Token ID1119
Feature activation-1.16e+00

Pos logit contributions
 pleasure+0.535
 generosity+0.520
 appreciation+0.491
ipp+0.479
 pride+0.478
Neg logit contributions
 stub-0.537
 bumped-0.532
 accidentally-0.507
 tri-0.474
?"-0.470
Token were
Token ID547
Feature activation-2.45e+00

Pos logit contributions
 pleasure+2.303
 generosity+2.226
 appreciation+2.122
ipp+2.076
 decisions+2.044
Neg logit contributions
 bumped-1.732
 stub-1.686
 accidentally-1.653
 tri-1.505
 landed-1.501
Token were
Token ID547
Feature activation-3.53e+00

Pos logit contributions
 pleasure+4.511
 generosity+4.338
 decisions+4.187
 appreciation+4.137
 choices+4.117
Neg logit contributions
 bumped-2.799
 accidentally-2.723
 stub-2.675
 landed-2.456
?"-2.444
Token filled
Token ID5901
Feature activation-2.20e+00

Pos logit contributions
 generosity+8.232
izza+8.068
 pleasure+8.003
 decisions+7.628
apa+7.620
Neg logit contributions
 accidentally-4.304
 bumped-4.260
?"-4.185
 stub-4.026
 suddenly-3.972
Token with
Token ID351
Feature activation-3.63e+00

Pos logit contributions
 pleasure+5.923
 generosity+5.816
 pride+5.546
 appreciation+5.485
izza+5.377
Neg logit contributions
 bumped-1.647
 stub-1.615
 accidentally-1.613
?"-1.596
 suddenly-1.435
Token fresh
Token ID4713
Feature activation-2.86e+00

Pos logit contributions
ipp+6.416
 generosity+6.250
ums+6.223
 pleasure+6.197
izza+6.020
Neg logit contributions
 bumped-5.847
 accidentally-5.846
 suddenly-5.695
?"-5.634
 stub-5.620
Token cuc
Token ID38421
Feature activation-3.61e+00

Pos logit contributions
 pleasure+6.279
 generosity+6.179
 appreciation+5.929
ipp+5.844
izza+5.700
Neg logit contributions
 bumped-3.640
 accidentally-3.402
?"-3.355
 stub-3.288
 landed-3.162
Tokenumber
Token ID4494
Feature activation-4.52e+00

Pos logit contributions
 pleasure+7.145
 generosity+6.590
 appreciation+6.425
 pride+6.042
 kindness+5.993
Neg logit contributions
 bumped-6.060
 stub-6.007
 accidentally-5.691
?"-5.640
 seek-5.502
Token and
Token ID290
Feature activation-1.79e+00

Pos logit contributions
 pleasure+13.425
 pride+12.497
 generosity+12.273
 appreciation+12.229
 satisfaction+12.158
Neg logit contributions
 bumped-3.652
 accidentally-3.529
?"-3.183
 stub-3.078
 suddenly-2.987
Token cheese
Token ID9891
Feature activation-1.83e+00

Pos logit contributions
 pleasure+2.688
 generosity+2.544
ipp+2.460
 appreciation+2.425
ums+2.369
Neg logit contributions
 bumped-3.435
 stub-3.399
 accidentally-3.398
?"-3.129
 suddenly-3.124
Token.
Token ID13
Feature activation-2.08e+00

Pos logit contributions
 pleasure+4.493
 generosity+4.344
 appreciation+4.177
 pride+4.138
ipp+4.093
Neg logit contributions
 bumped-1.793
 stub-1.723
 accidentally-1.700
?"-1.527
 landed-1.477
Token It
Token ID632
Feature activation-2.14e+00

Pos logit contributions
 generosity+4.165
 pleasure+4.118
 appreciation+3.892
 pride+3.845
 choices+3.816
Neg logit contributions
 bumped-2.828
 stub-2.820
 accidentally-2.760
?"-2.681
 landed-2.535
Token made
Token ID925
Feature activation-2.92e+00

Pos logit contributions
 pleasure+5.955
 generosity+5.756
 appreciation+5.548
 pride+5.521
 choices+5.469
Neg logit contributions
 bumped-1.624
 accidentally-1.612
 stub-1.535
 landed-1.339
 suddenly-1.323
Token
Token ID198
Feature activation-4.57e-01

Pos logit contributions
 pleasure+1.645
 generosity+1.617
 appreciation+1.543
ipp+1.506
 pride+1.505
Neg logit contributions
 bumped-0.901
 stub-0.901
 accidentally-0.848
 tri-0.765
?"-0.764
Token
Token ID198
Feature activation-1.38e+00

Pos logit contributions
 pleasure+0.981
 generosity+0.962
 appreciation+0.921
ipp+0.900
 pride+0.896
Neg logit contributions
 bumped-0.578
 stub-0.578
 accidentally-0.540
 tri-0.492
?"-0.486
TokenThe
Token ID464
Feature activation-7.92e-02

Pos logit contributions
 pleasure+3.158
 generosity+3.037
 appreciation+2.871
ipp+2.854
 pride+2.840
Neg logit contributions
 bumped-1.653
 stub-1.629
 accidentally-1.599
 tri-1.427
 landed-1.408
Token woman
Token ID2415
Feature activation-1.50e+00

Pos logit contributions
 pleasure+0.170
 generosity+0.168
 appreciation+0.160
 pride+0.157
ipp+0.155
Neg logit contributions
 stub-0.102
 bumped-0.101
 accidentally-0.096
?"-0.087
 tri-0.086
Token gave
Token ID2921
Feature activation-2.24e+00

Pos logit contributions
 pleasure+3.315
 generosity+3.185
 appreciation+3.027
 pride+2.985
 choices+2.969
Neg logit contributions
 bumped-1.944
 stub-1.864
 accidentally-1.828
 tri-1.657
 landed-1.634
Token two
Token ID734
Feature activation-4.49e+00

Pos logit contributions
 pleasure+4.984
 generosity+4.860
 appreciation+4.604
izza+4.558
ipp+4.530
Neg logit contributions
 bumped-2.892
 accidentally-2.768
 stub-2.747
 landed-2.533
 tri-2.522
Token yellow
Token ID7872
Feature activation-2.38e+00

Pos logit contributions
 pleasure+11.377
 generosity+9.902
ipp+9.689
izza+9.655
 appreciation+9.548
Neg logit contributions
 bumped-6.216
 saw-5.910
 accidentally-5.873
 heard-5.552
 landed-5.469
Token le
Token ID443
Feature activation-2.85e+00

Pos logit contributions
 pleasure+4.586
 generosity+4.280
 appreciation+4.115
ipp+4.004
 pride+3.992
Neg logit contributions
 bumped-3.814
 accidentally-3.684
 stub-3.623
?"-3.361
 tri-3.352
Tokenmons
Token ID11567
Feature activation-3.32e+00

Pos logit contributions
 pleasure+5.912
 appreciation+5.566
 generosity+5.565
 pride+5.313
 decisions+5.067
Neg logit contributions
 bumped-4.310
 stub-4.015
 accidentally-3.942
 tri-3.858
?"-3.617
Token to
Token ID284
Feature activation-7.51e-01

Pos logit contributions
 pleasure+9.142
 generosity+8.625
 appreciation+8.462
 decisions+8.227
 pride+8.135
Neg logit contributions
 bumped-2.851
 accidentally-2.664
 stub-2.591
 landed-2.404
?"-2.339
Token Sam
Token ID3409
Feature activation-2.24e+00

Pos logit contributions
 pleasure+1.487
 generosity+1.453
 appreciation+1.384
ipp+1.351
 pride+1.342
Neg logit contributions
 bumped-1.073
 stub-1.069
 accidentally-1.019
 tri-0.934
?"-0.929
Token to
Token ID284
Feature activation-5.76e-01

Pos logit contributions
 pleasure+0.043
 generosity+0.042
 appreciation+0.040
ipp+0.039
 pride+0.039
Neg logit contributions
 stub-0.024
 bumped-0.023
 accidentally-0.022
 tri-0.020
Ro-0.020
Token do
Token ID466
Feature activation-1.37e-01

Pos logit contributions
 pleasure+1.073
 generosity+1.049
 appreciation+0.994
ipp+0.969
 pride+0.963
Neg logit contributions
 bumped-0.896
 stub-0.896
 accidentally-0.851
 tri-0.789
?"-0.783
Token!
Token ID0
Feature activation-1.99e+00

Pos logit contributions
 pleasure+0.313
 generosity+0.310
 appreciation+0.297
ipp+0.288
 pride+0.288
Neg logit contributions
 stub-0.152
 bumped-0.150
 accidentally-0.138
 tri-0.125
 landed-0.122
Token she
Token ID673
Feature activation-6.90e-01

Pos logit contributions
 generosity+4.144
 pleasure+4.120
 appreciation+3.937
ipp+3.774
 pride+3.771
Neg logit contributions
 stub-2.644
 accidentally-2.554
 bumped-2.553
?"-2.296
 tri-2.257
Token gathered
Token ID9272
Feature activation-3.42e+00

Pos logit contributions
 pleasure+1.508
 generosity+1.475
 appreciation+1.408
ipp+1.382
 pride+1.373
Neg logit contributions
 bumped-0.848
 stub-0.845
 accidentally-0.796
 tri-0.721
?"-0.714
Token all
Token ID477
Feature activation-4.46e+00

Pos logit contributions
 pleasure+8.248
 generosity+8.073
ipp+7.986
izza+7.960
ums+7.728
Neg logit contributions
 bumped-3.765
 accidentally-3.345
 suddenly-3.329
 stub-3.280
 landed-3.280
Token her
Token ID607
Feature activation-2.93e+00

Pos logit contributions
 pleasure+13.735
 generosity+13.591
izza+13.413
ealing+13.181
ums+12.911
Neg logit contributions
 bumped-2.882
?"-2.842
 landed-2.696
 accidentally-2.583
 stub-2.535
Token friends
Token ID2460
Feature activation-3.08e+00

Pos logit contributions
 pleasure+5.074
ipp+5.036
ums+4.767
 generosity+4.759
 pride+4.430
Neg logit contributions
 bumped-5.170
 stub-4.958
 accidentally-4.925
 landed-4.646
?"-4.593
Token together
Token ID1978
Feature activation-2.00e+00

Pos logit contributions
 pleasure+7.669
 generosity+7.289
ipp+7.167
ums+7.149
 appreciation+7.007
Neg logit contributions
 bumped-3.318
 accidentally-3.114
 stub-3.077
 suddenly-2.831
 tri-2.808
Token to
Token ID284
Feature activation-1.39e+00

Pos logit contributions
 pleasure+4.538
 generosity+4.372
ipp+4.213
 appreciation+4.168
 pride+4.125
Neg logit contributions
 bumped-2.395
 accidentally-2.280
 stub-2.274
 tri-2.039
 suddenly-2.036
Token help
Token ID1037
Feature activation-1.35e+00

Pos logit contributions
 pleasure+2.757
 generosity+2.675
ipp+2.558
 appreciation+2.538
 pride+2.492
Neg logit contributions
 bumped-1.977
 stub-1.934
 accidentally-1.901
?"-1.776
 seek-1.720
Token
Token ID198
Feature activation-5.08e-04

Pos logit contributions
 pleasure+2.044
 generosity+2.019
 appreciation+1.904
ipp+1.855
 pride+1.854
Neg logit contributions
 stub-1.364
 bumped-1.364
 accidentally-1.314
?"-1.204
 tri-1.191
Token
Token ID198
Feature activation-7.20e-01

Pos logit contributions
 pleasure+0.001
 generosity+0.001
 appreciation+0.001
ipp+0.001
 pride+0.001
Neg logit contributions
 bumped-0.001
 stub-0.001
 accidentally-0.001
 tri-0.001
 landed-0.001
TokenThe
Token ID464
Feature activation-9.59e-01

Pos logit contributions
 pleasure+1.751
 generosity+1.720
 appreciation+1.647
ipp+1.615
 pride+1.610
Neg logit contributions
 bumped-0.720
 stub-0.718
 accidentally-0.670
 tri-0.589
?"-0.583
Token pals
Token ID39898
Feature activation-7.00e-01

Pos logit contributions
 pleasure+1.951
 generosity+1.892
 appreciation+1.806
ipp+1.788
 pride+1.762
Neg logit contributions
 stub-1.349
 bumped-1.344
 accidentally-1.278
?"-1.180
 tri-1.166
Token gathered
Token ID9272
Feature activation-3.32e+00

Pos logit contributions
 pleasure+1.620
 generosity+1.590
 appreciation+1.522
ipp+1.493
 pride+1.484
Neg logit contributions
 bumped-0.771
 stub-0.766
 accidentally-0.718
 tri-0.641
?"-0.631
Token all
Token ID477
Feature activation-4.46e+00

Pos logit contributions
 pleasure+7.923
 generosity+7.766
ipp+7.572
ums+7.353
izza+7.198
Neg logit contributions
 bumped-4.081
 accidentally-3.722
 stub-3.506
 suddenly-3.454
 hit-3.426
Token of
Token ID286
Feature activation-2.83e+00

Pos logit contributions
 pleasure+13.830
 generosity+13.693
ealing+12.807
ums+12.743
izza+12.612
Neg logit contributions
?"-3.373
 bumped-3.079
 accidentally-3.003
 hit-2.935
 landed-2.757
Token the
Token ID262
Feature activation-2.53e+00

Pos logit contributions
 pleasure+6.942
 generosity+6.867
ipp+6.608
ums+6.573
 appreciation+6.431
Neg logit contributions
 accidentally-3.158
 bumped-2.867
 stub-2.641
?"-2.588
 suddenly-2.564
Token ingredients
Token ID9391
Feature activation-2.84e+00

Pos logit contributions
 pleasure+4.669
 generosity+4.441
ipp+4.217
 appreciation+4.132
 pride+4.078
Neg logit contributions
 bumped-4.425
 accidentally-4.363
 stub-4.311
?"-4.234
 suddenly-3.889
Token for
Token ID329
Feature activation-1.51e+00

Pos logit contributions
 pleasure+7.117
 generosity+6.717
 appreciation+6.444
 pride+6.371
ipp+6.320
Neg logit contributions
 bumped-3.159
 accidentally-3.018
 suddenly-2.786
?"-2.762
 stub-2.711
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
 pleasure+3.010
 generosity+2.916
 appreciation+2.784
ipp+2.773
 pride+2.698
Neg logit contributions
 bumped-2.143
 accidentally-2.121
 stub-2.073
 tri-1.885
 suddenly-1.872
Token to
Token ID284
Feature activation-5.69e-01

Pos logit contributions
 pleasure+3.661
 generosity+3.533
 appreciation+3.385
ipp+3.381
 pride+3.344
Neg logit contributions
 bumped-1.795
 stub-1.711
 accidentally-1.704
 tri-1.509
?"-1.507
Token help
Token ID1037
Feature activation-1.07e+00

Pos logit contributions
 pleasure+0.905
 generosity+0.881
 appreciation+0.827
ipp+0.802
 pride+0.797
Neg logit contributions
 bumped-1.040
 stub-1.040
 accidentally-0.995
 tri-0.934
?"-0.929
Token!
Token ID0
Feature activation-1.46e+00

Pos logit contributions
 pleasure+2.284
 generosity+2.215
 appreciation+2.116
ipp+2.090
 pride+2.067
Neg logit contributions
 bumped-1.382
 stub-1.359
 accidentally-1.324
?"-1.204
 tri-1.193
Token They
Token ID1119
Feature activation-1.42e+00

Pos logit contributions
 pleasure+2.498
 generosity+2.488
 appreciation+2.321
 pride+2.238
ipp+2.232
Neg logit contributions
 bumped-2.452
 stub-2.424
 accidentally-2.410
?"-2.250
 tri-2.200
Token gathered
Token ID9272
Feature activation-3.14e+00

Pos logit contributions
 pleasure+3.186
 generosity+3.074
 appreciation+2.920
ipp+2.879
 decisions+2.878
Neg logit contributions
 bumped-1.812
 accidentally-1.732
 stub-1.732
 tri-1.531
 landed-1.522
Token all
Token ID477
Feature activation-4.45e+00

Pos logit contributions
 pleasure+7.429
 generosity+7.242
ipp+7.108
ums+6.909
izza+6.740
Neg logit contributions
 bumped-3.846
 accidentally-3.419
 landed-3.224
 suddenly-3.220
 hit-3.174
Token of
Token ID286
Feature activation-2.53e+00

Pos logit contributions
 pleasure+13.916
 generosity+13.670
ums+13.005
ealing+12.977
izza+12.756
Neg logit contributions
?"-3.234
 bumped-3.056
 hit-2.916
 accidentally-2.785
 saw-2.670
Token the
Token ID262
Feature activation-1.77e+00

Pos logit contributions
 pleasure+6.146
 generosity+5.989
ums+5.789
ipp+5.785
ealing+5.713
Neg logit contributions
 accidentally-2.737
 bumped-2.718
 stub-2.432
?"-2.304
 tri-2.195
Token supplies
Token ID9416
Feature activation-2.29e+00

Pos logit contributions
 pleasure+3.330
 generosity+3.199
ipp+3.113
 appreciation+3.016
ums+2.965
Neg logit contributions
 bumped-2.895
 stub-2.784
 accidentally-2.745
?"-2.556
 tri-2.494
Token they
Token ID484
Feature activation-5.06e-01

Pos logit contributions
 pleasure+5.383
 generosity+5.163
 appreciation+5.010
ipp+4.991
ums+4.861
Neg logit contributions
 bumped-2.623
 accidentally-2.511
 stub-2.405
?"-2.294
 suddenly-2.253
Token would
Token ID561
Feature activation-1.05e+00

Pos logit contributions
 pleasure+1.127
 generosity+1.107
 appreciation+1.058
ipp+1.035
 pride+1.034
Neg logit contributions
 stub-0.597
 bumped-0.596
 accidentally-0.556
 tri-0.503
?"-0.498
Tokenily
Token ID813
Feature activation-1.27e+00

Pos logit contributions
 pleasure+0.003
 generosity+0.003
ipp+0.003
 appreciation+0.003
 pride+0.003
Neg logit contributions
 stub-0.002
 bumped-0.002
 accidentally-0.002
 tri-0.002
 landed-0.002
Token and
Token ID290
Feature activation-9.52e-02

Pos logit contributions
 pleasure+2.735
 generosity+2.630
 appreciation+2.500
 pride+2.439
ipp+2.422
Neg logit contributions
 bumped-1.714
 stub-1.674
 accidentally-1.618
 tri-1.482
 landed-1.453
Token Ben
Token ID3932
Feature activation-1.56e+00

Pos logit contributions
 pleasure+0.214
 generosity+0.213
 appreciation+0.202
ipp+0.199
 pride+0.199
Neg logit contributions
 stub-0.110
 bumped-0.109
 accidentally-0.100
?"-0.091
 tri-0.089
Token gave
Token ID2921
Feature activation-2.29e+00

Pos logit contributions
 pleasure+3.436
 generosity+3.328
 appreciation+3.157
 decisions+3.115
 choices+3.104
Neg logit contributions
 bumped-2.062
 stub-1.958
 accidentally-1.921
 landed-1.761
 tri-1.758
Token the
Token ID262
Feature activation-1.51e+00

Pos logit contributions
 pleasure+5.396
 generosity+5.272
ipp+4.985
izza+4.967
 appreciation+4.942
Neg logit contributions
 bumped-2.592
 stub-2.421
 accidentally-2.349
 landed-2.195
?"-2.192
Token man
Token ID582
Feature activation-4.44e+00

Pos logit contributions
 pleasure+3.498
 generosity+3.382
ipp+3.263
 appreciation+3.255
 pride+3.194
Neg logit contributions
 bumped-1.732
 stub-1.699
 accidentally-1.614
?"-1.483
 tri-1.453
Token some
Token ID617
Feature activation-1.72e+00

Pos logit contributions
 pleasure+13.991
 generosity+12.528
umm+12.297
ipp+12.260
omp+12.011
Neg logit contributions
 bumped-3.475
 heard-2.802
 saw-2.772
 behind-2.763
 broke-2.671
Token bread
Token ID8509
Feature activation-2.43e+00

Pos logit contributions
 pleasure+3.193
 generosity+3.052
ipp+2.977
 appreciation+2.902
ums+2.809
Neg logit contributions
 bumped-2.688
 stub-2.629
 accidentally-2.572
?"-2.426
 tri-2.371
Token.
Token ID13
Feature activation-7.02e-01

Pos logit contributions
 pleasure+6.265
 generosity+5.831
ipp+5.685
 appreciation+5.660
 pride+5.606
Neg logit contributions
 bumped-2.283
 stub-2.161
 accidentally-2.078
?"-1.998
 landed-1.913
Token They
Token ID1119
Feature activation-1.10e+00

Pos logit contributions
 pleasure+1.167
 generosity+1.140
 appreciation+1.072
ipp+1.043
 pride+1.035
Neg logit contributions
 bumped-1.226
 stub-1.225
 accidentally-1.176
 tri-1.096
?"-1.095
Token taped
Token ID36114
Feature activation-2.36e+00

Pos logit contributions
 pleasure+2.404
 generosity+2.329
 appreciation+2.225
ipp+2.171
 decisions+2.151
Neg logit contributions
 bumped-1.401
 stub-1.368
 accidentally-1.320
 tri-1.191
 landed-1.184
Token sweet
Token ID6029
Feature activation-1.52e+00

Pos logit contributions
 pleasure+3.634
 generosity+3.505
 appreciation+3.281
izza+3.249
ipp+3.248
Neg logit contributions
 bumped-3.651
 stub-3.558
 accidentally-3.519
 landed-3.252
 tri-3.231
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
 pleasure+3.631
 generosity+3.545
 appreciation+3.412
ipp+3.368
 pride+3.361
Neg logit contributions
 bumped-1.560
 stub-1.515
 accidentally-1.475
?"-1.333
 tri-1.282
Token He
Token ID679
Feature activation-4.52e-01

Pos logit contributions
 pleasure+2.351
 generosity+2.348
 appreciation+2.230
ipp+2.147
 pride+2.146
Neg logit contributions
 bumped-1.904
 stub-1.903
 accidentally-1.858
?"-1.690
 tri-1.684
Token quickly
Token ID2952
Feature activation-4.86e-01

Pos logit contributions
 pleasure+0.976
 generosity+0.959
 appreciation+0.916
ipp+0.897
 pride+0.892
Neg logit contributions
 stub-0.560
 bumped-0.560
 accidentally-0.523
 tri-0.475
?"-0.475
Token collected
Token ID7723
Feature activation-2.57e+00

Pos logit contributions
 pleasure+1.063
 generosity+1.044
 appreciation+0.999
ipp+0.977
 pride+0.974
Neg logit contributions
 stub-0.588
 bumped-0.587
 accidentally-0.548
?"-0.497
 tri-0.497
Token all
Token ID477
Feature activation-4.44e+00

Pos logit contributions
 pleasure+5.868
 generosity+5.762
izza+5.455
 appreciation+5.369
ums+5.313
Neg logit contributions
 bumped-3.214
 landed-2.975
 accidentally-2.975
 stub-2.892
?"-2.828
Token the
Token ID262
Feature activation-2.45e+00

Pos logit contributions
 pleasure+13.943
 generosity+13.778
izza+13.371
ealing+12.984
ums+12.860
Neg logit contributions
?"-3.248
 landed-2.986
 bumped-2.877
 accidentally-2.794
 saw-2.574
Token berries
Token ID36322
Feature activation-2.40e+00

Pos logit contributions
 pleasure+4.363
 generosity+4.260
izza+4.098
 appreciation+4.002
ipp+3.986
Neg logit contributions
 bumped-4.185
 accidentally-4.062
 stub-4.060
?"-3.852
 landed-3.833
Token he
Token ID339
Feature activation+3.20e-01

Pos logit contributions
 pleasure+6.048
 generosity+5.824
 appreciation+5.693
 pride+5.530
 decisions+5.431
Neg logit contributions
 bumped-2.448
 accidentally-2.351
 stub-2.251
 landed-2.132
?"-2.086
Token could
Token ID714
Feature activation+6.23e-02

Pos logit contributions
 stub+0.382
 bumped+0.375
 accidentally+0.339
?"+0.316
 tri+0.314
Neg logit contributions
 pleasure-0.703
 generosity-0.695
 appreciation-0.655
ipp-0.647
 pride-0.645
Token find
Token ID1064
Feature activation-1.52e+00

Pos logit contributions
 bumped+0.095
 stub+0.094
 accidentally+0.088
 tri+0.083
 landed+0.080
Neg logit contributions
 pleasure-0.118
 generosity-0.116
 appreciation-0.110
ipp-0.108
 pride-0.106
Token tw
Token ID665
Feature activation-1.90e+00

Pos logit contributions
 pleasure+0.496
 generosity+0.489
 appreciation+0.462
 pride+0.447
ipp+0.445
Neg logit contributions
 stub-0.412
 bumped-0.409
 accidentally-0.385
 tri-0.360
?"-0.358
Tokenigs
Token ID9235
Feature activation-1.24e+00

Pos logit contributions
 pleasure+3.098
 generosity+3.027
 appreciation+2.757
 pride+2.727
 choices+2.579
Neg logit contributions
 bumped-3.574
 stub-3.462
 accidentally-3.438
?"-3.318
 landed-3.254
Token.
Token ID13
Feature activation-7.20e-01

Pos logit contributions
 pleasure+3.045
 generosity+2.974
 appreciation+2.850
 pride+2.784
ipp+2.778
Neg logit contributions
 bumped-1.233
 stub-1.191
 accidentally-1.124
?"-0.994
 tri-0.989
Token He
Token ID679
Feature activation-8.49e-01

Pos logit contributions
 pleasure+1.271
 generosity+1.243
 appreciation+1.177
ipp+1.141
 pride+1.136
Neg logit contributions
 bumped-1.185
 stub-1.181
 accidentally-1.131
 tri-1.051
?"-1.046
Token gave
Token ID2921
Feature activation-8.52e-01

Pos logit contributions
 pleasure+1.911
 generosity+1.862
 appreciation+1.783
ipp+1.742
 pride+1.740
Neg logit contributions
 bumped-1.013
 stub-1.003
 accidentally-0.953
 tri-0.855
?"-0.842
Token them
Token ID606
Feature activation-4.43e+00

Pos logit contributions
 pleasure+1.800
 generosity+1.765
 appreciation+1.680
ipp+1.638
 pride+1.629
Neg logit contributions
 bumped-1.126
 stub-1.113
 accidentally-1.059
 tri-0.963
?"-0.961
Token to
Token ID284
Feature activation+4.01e-01

Pos logit contributions
 pleasure+12.897
ums+12.224
 generosity+12.179
ealing+11.865
 decisions+11.828
Neg logit contributions
 bumped-3.797
 broke-3.247
 hit-3.238
 landed-3.195
?"-3.150
Token the
Token ID262
Feature activation+9.82e-01

Pos logit contributions
 stub+0.612
 bumped+0.598
 accidentally+0.555
 tri+0.515
 seek+0.507
Neg logit contributions
 pleasure-0.779
 generosity-0.766
 appreciation-0.723
 pride-0.712
ipp-0.701
Token squirrel
Token ID33039
Feature activation-2.80e+00

Pos logit contributions
 bumped+1.502
 stub+1.446
 accidentally+1.411
?"+1.293
 tri+1.278
Neg logit contributions
 pleasure-1.922
 generosity-1.894
 appreciation-1.777
 pride-1.736
 choices-1.698
Token,
Token ID11
Feature activation-4.65e-01

Pos logit contributions
 pleasure+7.821
 generosity+7.293
 pride+7.110
 appreciation+7.093
ipp+6.924
Neg logit contributions
 accidentally-2.377
 bumped-2.363
 stub-2.205
?"-2.114
 tri-1.838
Token who
Token ID508
Feature activation-1.53e+00

Pos logit contributions
 pleasure+0.956
 generosity+0.938
 appreciation+0.893
ipp+0.869
 pride+0.868
Neg logit contributions
 stub-0.634
 bumped-0.634
 accidentally-0.595
 tri-0.546
?"-0.542
Tokenummy
Token ID13513
Feature activation-2.09e+00

Pos logit contributions
 pleasure+3.007
 generosity+2.902
 appreciation+2.589
 pride+2.378
izza+2.232
Neg logit contributions
 bumped-6.813
 stub-6.780
 accidentally-6.709
?"-6.498
 tri-6.399
Token treats
Token ID18432
Feature activation-7.18e-01

Pos logit contributions
 pleasure+3.531
 generosity+3.370
ipp+3.198
 appreciation+3.176
 pride+3.112
Neg logit contributions
 bumped-3.644
 stub-3.545
 accidentally-3.500
?"-3.405
 landed-3.266
Token.
Token ID13
Feature activation-1.63e+00

Pos logit contributions
 pleasure+1.657
 generosity+1.625
 appreciation+1.560
ipp+1.528
 pride+1.522
Neg logit contributions
 bumped-0.784
 stub-0.780
 accidentally-0.729
 tri-0.651
?"-0.648
Token They
Token ID1119
Feature activation-1.16e+00

Pos logit contributions
 generosity+2.906
 pleasure+2.856
 appreciation+2.694
ipp+2.617
 choices+2.608
Neg logit contributions
 stub-2.652
 accidentally-2.640
 bumped-2.580
?"-2.401
 tri-2.372
Token took
Token ID1718
Feature activation-3.12e+00

Pos logit contributions
 pleasure+2.335
 generosity+2.281
 appreciation+2.156
ipp+2.109
 choices+2.098
Neg logit contributions
 bumped-1.662
 stub-1.640
 accidentally-1.580
 tri-1.453
 landed-1.437
Token all
Token ID477
Feature activation-4.43e+00

Pos logit contributions
 generosity+7.219
 pleasure+6.976
ipp+6.614
ums+6.608
 appreciation+6.588
Neg logit contributions
 bumped-3.956
 accidentally-3.634
 stub-3.530
 landed-3.441
 suddenly-3.373
Token of
Token ID286
Feature activation-3.18e+00

Pos logit contributions
 generosity+13.193
 pleasure+12.834
izza+12.482
ealing+12.267
ums+12.045
Neg logit contributions
 bumped-3.738
 landed-3.448
 stub-3.277
?"-3.257
 suddenly-3.105
Token the
Token ID262
Feature activation-2.18e+00

Pos logit contributions
 generosity+8.114
 pleasure+8.043
 decisions+7.573
izza+7.541
ealing+7.495
Neg logit contributions
 accidentally-3.467
 bumped-3.269
 stub-2.974
 suddenly-2.926
?"-2.868
Token strawberries
Token ID36973
Feature activation-2.33e+00

Pos logit contributions
 pleasure+3.536
 generosity+3.445
 appreciation+3.226
izza+3.200
ipp+3.198
Neg logit contributions
 bumped-4.070
 stub-3.948
 accidentally-3.862
 landed-3.637
?"-3.632
Token inside
Token ID2641
Feature activation-1.33e+00

Pos logit contributions
 pleasure+5.439
 generosity+5.299
 appreciation+5.104
 pride+4.968
 decisions+4.908
Neg logit contributions
 bumped-2.667
 stub-2.568
 accidentally-2.530
 landed-2.356
?"-2.354
Token and
Token ID290
Feature activation-6.44e-01

Pos logit contributions
 pleasure+2.854
 generosity+2.807
 appreciation+2.658
ipp+2.633
 pride+2.608
Neg logit contributions
 bumped-1.671
 stub-1.638
 accidentally-1.612
?"-1.461
 tri-1.435
Token great
Token ID1049
Feature activation-1.91e+00

Pos logit contributions
 pleasure+0.844
 generosity+0.824
 appreciation+0.780
ipp+0.756
 pride+0.753
Neg logit contributions
 bumped-0.765
 stub-0.763
 accidentally-0.725
 tri-0.676
?"-0.670
Token dish
Token ID9433
Feature activation-1.64e+00

Pos logit contributions
 generosity+3.689
 pleasure+3.681
ipp+3.507
 pride+3.470
 appreciation+3.435
Neg logit contributions
 stub-2.738
 bumped-2.734
 accidentally-2.680
?"-2.519
 tri-2.422
Token.
Token ID13
Feature activation-8.08e-01

Pos logit contributions
 pleasure+3.569
 generosity+3.423
 appreciation+3.301
 pride+3.237
ipp+3.220
Neg logit contributions
 bumped-2.053
 stub-2.003
 accidentally-1.956
?"-1.920
 seek-1.763
Token They
Token ID1119
Feature activation-1.02e+00

Pos logit contributions
 pleasure+1.321
 generosity+1.289
 appreciation+1.210
ipp+1.183
 pride+1.168
Neg logit contributions
 bumped-1.439
 stub-1.433
 accidentally-1.385
 tri-1.290
?"-1.285
Token gathered
Token ID9272
Feature activation-3.21e+00

Pos logit contributions
 pleasure+2.365
 generosity+2.311
 appreciation+2.200
ipp+2.152
 pride+2.140
Neg logit contributions
 bumped-1.183
 stub-1.159
 accidentally-1.122
 tri-0.989
?"-0.970
Token all
Token ID477
Feature activation-4.42e+00

Pos logit contributions
 generosity+7.335
 pleasure+7.301
ipp+7.080
ums+6.998
izza+6.760
Neg logit contributions
 bumped-4.130
 accidentally-3.694
 landed-3.554
 suddenly-3.498
 stub-3.454
Token the
Token ID262
Feature activation-2.99e+00

Pos logit contributions
 generosity+13.454
 pleasure+13.239
ealing+12.512
ums+12.507
izza+12.406
Neg logit contributions
?"-3.817
 bumped-3.198
 accidentally-3.083
 landed-3.072
 hit-3.046
Token ingredients
Token ID9391
Feature activation-2.88e+00

Pos logit contributions
 pleasure+5.860
 generosity+5.693
ipp+5.314
 appreciation+5.223
ums+5.208
Neg logit contributions
 bumped-4.916
 accidentally-4.661
?"-4.624
 stub-4.515
 saw-4.078
Token and
Token ID290
Feature activation-1.29e+00

Pos logit contributions
 pleasure+7.109
 generosity+6.838
 appreciation+6.509
 pride+6.463
ipp+6.362
Neg logit contributions
 bumped-3.209
 accidentally-3.080
?"-2.893
 suddenly-2.860
 stub-2.745
Token mixed
Token ID7668
Feature activation-2.55e+00

Pos logit contributions
 pleasure+2.934
 generosity+2.839
 appreciation+2.722
ipp+2.694
 pride+2.647
Neg logit contributions
 bumped-1.559
 accidentally-1.498
 stub-1.487
 tri-1.302
 landed-1.291
Token them
Token ID606
Feature activation-2.85e+00

Pos logit contributions
 generosity+6.017
 pleasure+5.998
 pride+5.490
 appreciation+5.490
ums+5.319
Neg logit contributions
 bumped-3.019
?"-2.881
 accidentally-2.864
 suddenly-2.646
 landed-2.607
Token a
Token ID257
Feature activation+1.87e+00

Pos logit contributions
 pleasure+0.772
 generosity+0.757
 appreciation+0.721
 pride+0.703
ipp+0.700
Neg logit contributions
 stub-0.489
 bumped-0.487
 accidentally-0.456
 tri-0.417
?"-0.413
Token witch
Token ID16365
Feature activation+1.40e+00

Pos logit contributions
 bumped+3.165
 stub+3.057
 accidentally+2.813
 landed+2.775
 tri+2.750
Neg logit contributions
 pleasure-3.458
 generosity-3.318
 appreciation-3.119
 pride-3.087
 kindness-2.951
Token on
Token ID319
Feature activation+1.50e+00

Pos logit contributions
 stub+1.993
 bumped+1.975
 accidentally+1.843
Ro+1.752
Raw+1.674
Neg logit contributions
 generosity-2.666
 pleasure-2.653
 appreciation-2.476
 pride-2.429
 kinds-2.382
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
 stub+2.438
 bumped+2.335
 accidentally+2.190
 tri+2.130
 seek+1.999
Neg logit contributions
 pleasure-2.718
 generosity-2.621
 appreciation-2.567
 pride-2.512
 kinds-2.485
Token cover
Token ID3002
Feature activation+6.46e-01

Pos logit contributions
 bumped+1.582
 stub+1.557
 accidentally+1.497
 tri+1.344
?"+1.326
Neg logit contributions
 pleasure-2.180
 generosity-2.145
 pride-2.017
 appreciation-1.986
 choices-1.969
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
 stub+0.929
 bumped+0.919
 accidentally+0.866
 tri+0.798
Ro+0.793
Neg logit contributions
 pleasure-1.254
 generosity-1.243
 appreciation-1.177
ipp-1.160
 kinds-1.153
Token many
Token ID867
Feature activation+1.65e-01

Pos logit contributions
 stub+2.969
 bumped+2.798
 accidentally+2.595
?"+2.488
 tri+2.477
Neg logit contributions
 generosity-3.611
 pleasure-3.485
 appreciation-3.391
 pride-3.332
 kindness-3.223
Token strange
Token ID6283
Feature activation+5.99e-01

Pos logit contributions
 stub+0.288
 bumped+0.285
 accidentally+0.272
 tri+0.256
?"+0.253
Neg logit contributions
 pleasure-0.274
 generosity-0.272
 appreciation-0.256
 pride-0.250
 kinds-0.244
Token words
Token ID2456
Feature activation+4.30e-01

Pos logit contributions
 stub+1.179
 bumped+1.157
 accidentally+1.125
 tri+1.056
 landed+1.042
Neg logit contributions
 pleasure-0.876
 generosity-0.869
 appreciation-0.820
 choices-0.798
 kinds-0.786
Token inside
Token ID2641
Feature activation-4.45e-02

Pos logit contributions
 stub+0.514
 bumped+0.494
 accidentally+0.463
 tri+0.419
Ro+0.415
Neg logit contributions
 pleasure-0.950
 generosity-0.941
 appreciation-0.899
 pride-0.885
ipp-0.870
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
 pleasure+0.099
 generosity+0.098
 appreciation+0.094
 pride+0.092
ipp+0.091
Neg logit contributions
 stub-0.053
 bumped-0.052
 accidentally-0.048
 tri-0.043
 landed-0.042
Token You
Token ID921
Feature activation-1.72e+00

Pos logit contributions
 generosity+2.842
 pleasure+2.836
 appreciation+2.729
ipp+2.644
 pride+2.611
Neg logit contributions
 stub-2.039
 bumped-2.011
 accidentally-1.956
?"-1.863
 tri-1.779
Token can
Token ID460
Feature activation-6.34e-01

Pos logit contributions
 pleasure+3.553
 generosity+3.465
 appreciation+3.319
ums+3.270
ipp+3.267
Neg logit contributions
 bumped-2.498
 stub-2.366
 accidentally-2.363
?"-2.204
 landed-2.132
Token go
Token ID467
Feature activation+6.86e-01

Pos logit contributions
 pleasure+1.259
 generosity+1.233
 appreciation+1.172
ipp+1.143
 pride+1.139
Neg logit contributions
 bumped-0.907
 stub-0.907
 accidentally-0.859
 tri-0.789
?"-0.786
Token and
Token ID290
Feature activation+4.37e-01

Pos logit contributions
 stub+1.016
 bumped+0.983
 accidentally+0.934
 tri+0.882
Ro+0.860
Neg logit contributions
 pleasure-1.335
 generosity-1.283
 appreciation-1.221
ipp-1.217
 pride-1.205
Token play
Token ID711
Feature activation+1.88e+00

Pos logit contributions
 bumped+0.674
 stub+0.673
 accidentally+0.629
 tri+0.595
 landed+0.571
Neg logit contributions
 pleasure-0.821
 generosity-0.797
 appreciation-0.746
 pride-0.730
ipp-0.707
Token now
Token ID783
Feature activation+2.35e+00

Pos logit contributions
 stub+3.219
 bumped+3.099
 tri+2.808
 accidentally+2.778
 rang+2.590
Neg logit contributions
 pleasure-3.554
 generosity-3.398
 appreciation-3.287
 equal-3.231
 pride-3.191
Token.
Token ID13
Feature activation-8.59e-01

Pos logit contributions
 stub+3.872
 bumped+3.776
 accidentally+3.339
 tri+3.308
Ro+3.261
Neg logit contributions
 pleasure-4.601
 generosity-4.358
ipp-4.204
 choices-4.199
 pride-4.124
Token But
Token ID887
Feature activation+1.22e-01

Pos logit contributions
 pleasure+1.752
 generosity+1.731
 appreciation+1.654
ipp+1.618
 pride+1.602
Neg logit contributions
 stub-1.153
 bumped-1.146
 accidentally-1.089
?"-1.001
 tri-0.996
Token first
Token ID717
Feature activation+3.68e-01

Pos logit contributions
 stub+0.172
 bumped+0.171
 accidentally+0.159
 tri+0.145
 landed+0.143
Neg logit contributions
 pleasure-0.251
 generosity-0.245
 appreciation-0.232
ipp-0.226
 pride-0.224
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
 stub+0.531
 bumped+0.521
 accidentally+0.483
 tri+0.445
Ro+0.442
Neg logit contributions
 pleasure-0.748
 generosity-0.731
 appreciation-0.695
ipp-0.675
 choices-0.671
Token let
Token ID1309
Feature activation-7.36e-01

Pos logit contributions
 pleasure+0.277
 generosity+0.271
 appreciation+0.258
ipp+0.251
 pride+0.250
Neg logit contributions
 stub-0.174
 bumped-0.174
 accidentally-0.161
 tri-0.147
 landed-0.144
Token walking
Token ID6155
Feature activation+2.44e-01

Pos logit contributions
 stub+1.144
 bumped+1.134
 accidentally+1.063
Ro+1.052
?"+1.029
Neg logit contributions
 pleasure-1.518
 generosity-1.459
ipp-1.404
 pride-1.391
 appreciation-1.388
Token to
Token ID284
Feature activation+1.88e+00

Pos logit contributions
 stub+0.322
 bumped+0.317
 accidentally+0.300
 tri+0.273
?"+0.268
Neg logit contributions
 pleasure-0.504
 generosity-0.496
 appreciation-0.474
ipp-0.464
 pride-0.464
Token the
Token ID262
Feature activation+1.79e+00

Pos logit contributions
 bumped+3.282
 stub+3.239
 accidentally+2.879
 tri+2.823
 landed+2.759
Neg logit contributions
 pleasure-3.381
 generosity-3.187
 appreciation-3.018
 kindness-2.894
 pride-2.887
Token store
Token ID3650
Feature activation+1.38e-01

Pos logit contributions
 bumped+2.604
 stub+2.547
 accidentally+2.320
 tri+2.185
 landed+2.114
Neg logit contributions
 pleasure-3.591
 generosity-3.520
 appreciation-3.352
ipp-3.316
 pride-3.247
Token when
Token ID618
Feature activation+2.18e+00

Pos logit contributions
 stub+0.178
 bumped+0.176
 accidentally+0.162
Ro+0.151
 landed+0.150
Neg logit contributions
 pleasure-0.290
 generosity-0.287
 appreciation-0.272
ipp-0.269
 pride-0.264
Token she
Token ID673
Feature activation+2.46e+00

Pos logit contributions
 stub+2.855
 bumped+2.754
Ro+2.681
?"+2.592
 landed+2.542
Neg logit contributions
 pleasure-4.524
 generosity-4.346
 pride-4.117
 appreciation-4.036
ipp-4.025
Token saw
Token ID2497
Feature activation+9.40e-01

Pos logit contributions
Ro+2.289
 seek+2.210
 stub+2.083
Raw+2.061
 bumped+1.918
Neg logit contributions
 pleasure-5.596
ipp-5.447
apa-5.441
 generosity-5.338
 pride-5.327
Token a
Token ID257
Feature activation+7.61e-01

Pos logit contributions
 stub+1.441
 bumped+1.414
 accidentally+1.297
 seek+1.284
 tri+1.241
Neg logit contributions
 pleasure-1.762
 generosity-1.740
 appreciation-1.651
 pride-1.626
ipp-1.606
Token little
Token ID1310
Feature activation-1.44e-01

Pos logit contributions
 bumped+1.233
 stub+1.227
 accidentally+1.146
 tri+1.089
 landed+1.086
Neg logit contributions
 pleasure-1.349
 generosity-1.307
 appreciation-1.240
 pride-1.179
ipp-1.168
Token girl
Token ID2576
Feature activation+1.09e+00

Pos logit contributions
 pleasure+0.283
 generosity+0.278
 appreciation+0.263
 pride+0.255
ipp+0.254
Neg logit contributions
 bumped-0.206
 stub-0.206
 accidentally-0.194
 tri-0.179
?"-0.178
Token with
Token ID351
Feature activation-5.86e-01

Pos logit contributions
 bumped+1.359
 stub+1.322
 accidentally+1.225
Ro+1.215
 seek+1.148
Neg logit contributions
 pleasure-2.236
 generosity-2.233
 appreciation-2.151
ums-2.087
 pride-2.068
Token play
Token ID711
Feature activation+2.65e+00

Pos logit contributions
 stub+2.333
 bumped+2.331
 accidentally+2.194
 tri+2.058
 landed+1.997
Neg logit contributions
 pleasure-3.415
 generosity-3.298
 pride-3.222
 appreciation-3.220
 kinds-3.182
Token with
Token ID351
Feature activation+1.85e+00

Pos logit contributions
 stub+4.671
 bumped+4.608
 tri+4.171
 accidentally+4.103
 landed+4.004
Neg logit contributions
 pleasure-4.273
ipp-4.217
 generosity-4.182
 appreciation-4.172
ums-4.133
Token.
Token ID13
Feature activation+5.91e-01

Pos logit contributions
 stub+3.306
 bumped+3.216
 accidentally+2.983
 landed+2.941
 tri+2.910
Neg logit contributions
 generosity-2.954
 pleasure-2.909
 appreciation-2.879
 kinds-2.874
ipp-2.841
Token One
Token ID1881
Feature activation-6.45e-01

Pos logit contributions
 bumped+1.023
 stub+0.986
 accidentally+0.921
 tri+0.900
 landed+0.880
Neg logit contributions
 pleasure-1.058
 generosity-0.977
ipp-0.949
 appreciation-0.937
 pride-0.920
Token day
Token ID1110
Feature activation+9.64e-01

Pos logit contributions
 pleasure+1.131
 generosity+1.105
 appreciation+1.043
ipp+1.015
 pride+1.010
Neg logit contributions
 stub-1.076
 bumped-1.075
 accidentally-1.026
 tri-0.955
?"-0.952
Token,
Token ID11
Feature activation+2.38e+00

Pos logit contributions
 stub+1.487
 bumped+1.443
 accidentally+1.343
Ro+1.337
 seek+1.312
Neg logit contributions
 pleasure-1.731
 generosity-1.658
ipp-1.616
 appreciation-1.601
 pride-1.562
Token she
Token ID673
Feature activation+2.03e+00

Pos logit contributions
 stub+3.287
 bumped+3.267
Ro+2.996
Raw+2.929
 tri+2.852
Neg logit contributions
 pleasure-4.682
 generosity-4.455
 pride-4.289
 appreciation-4.239
ipp-4.213
Token took
Token ID1718
Feature activation+5.23e-02

Pos logit contributions
 bumped+2.394
 stub+2.343
 Ach+2.284
Ro+2.238
 seek+2.193
Neg logit contributions
 pleasure-3.896
 appreciation-3.892
 generosity-3.853
ipp-3.843
 kinds-3.819
Token her
Token ID607
Feature activation+6.05e-01

Pos logit contributions
 stub+0.070
 bumped+0.069
 accidentally+0.065
?"+0.060
 tri+0.060
Neg logit contributions
 pleasure-0.109
 generosity-0.105
 appreciation-0.102
 pride-0.101
ipp-0.099
Token hoop
Token ID44792
Feature activation+2.43e-01

Pos logit contributions
 bumped+0.833
 stub+0.815
 accidentally+0.750
?"+0.708
 tri+0.703
Neg logit contributions
 pleasure-1.233
 generosity-1.198
 appreciation-1.160
 pride-1.147
ipp-1.122
Token to
Token ID284
Feature activation+2.13e+00

Pos logit contributions
 stub+0.316
 bumped+0.314
 accidentally+0.296
?"+0.269
 tri+0.269
Neg logit contributions
 pleasure-0.494
 generosity-0.490
 appreciation-0.468
 pride-0.463
ipp-0.462
Token the
Token ID262
Feature activation+3.07e-01

Pos logit contributions
 stub+2.252
 bumped+2.169
 accidentally+2.149
 tri+1.933
?"+1.901
Neg logit contributions
 pleasure-2.446
 generosity-2.388
 appreciation-2.292
 pride-2.266
ipp-2.253
Token bottom
Token ID4220
Feature activation+6.52e-01

Pos logit contributions
 stub+0.405
 bumped+0.402
 accidentally+0.382
 tri+0.346
?"+0.333
Neg logit contributions
 pleasure-0.652
 generosity-0.632
 appreciation-0.608
ipp-0.595
 pride-0.594
Token of
Token ID286
Feature activation+1.28e+00

Pos logit contributions
 stub+1.414
 bumped+1.395
 accidentally+1.347
 tri+1.265
 landed+1.252
Neg logit contributions
 pleasure-0.776
 generosity-0.762
 appreciation-0.726
ipp-0.707
 kinds-0.688
Token the
Token ID262
Feature activation-2.89e-01

Pos logit contributions
 stub+2.215
 bumped+2.157
 accidentally+2.098
 tri+1.942
?"+1.878
Neg logit contributions
 pleasure-2.155
 generosity-2.108
 appreciation-2.068
 pride-2.003
 kinds-1.953
Token hill
Token ID12788
Feature activation-1.83e-01

Pos logit contributions
 pleasure+0.650
 generosity+0.636
 appreciation+0.612
ipp+0.598
 pride+0.595
Neg logit contributions
 stub-0.343
 bumped-0.341
 accidentally-0.321
 tri-0.288
?"-0.280
Token.
Token ID13
Feature activation+2.03e+00

Pos logit contributions
 pleasure+0.393
 generosity+0.387
 appreciation+0.371
ipp+0.364
 pride+0.360
Neg logit contributions
 stub-0.230
 bumped-0.227
 accidentally-0.213
 tri-0.193
?"-0.190
Token She
Token ID1375
Feature activation-1.51e-01

Pos logit contributions
 stub+4.314
 bumped+4.219
 accidentally+4.167
 seek+3.984
 tri+3.892
Neg logit contributions
 pleasure-2.749
 generosity-2.473
 appreciation-2.428
ums-2.382
ipp-2.318
Token is
Token ID318
Feature activation+8.10e-01

Pos logit contributions
 pleasure+0.286
 generosity+0.279
 appreciation+0.268
ipp+0.261
 pride+0.260
Neg logit contributions
 stub-0.223
 bumped-0.219
 accidentally-0.210
 seek-0.191
?"-0.191
Token dirty
Token ID11841
Feature activation+8.12e-01

Pos logit contributions
 stub+1.278
 bumped+1.251
 accidentally+1.231
Ro+1.180
?"+1.136
Neg logit contributions
 pleasure-1.427
 generosity-1.352
 pride-1.334
 appreciation-1.319
ipp-1.281
Token and
Token ID290
Feature activation+6.90e-01

Pos logit contributions
 stub+1.390
 bumped+1.355
 accidentally+1.297
Ro+1.216
 landed+1.181
Neg logit contributions
 pleasure-1.349
 generosity-1.326
ipp-1.254
 choices-1.250
 appreciation-1.244
Token has
Token ID468
Feature activation+2.64e-01

Pos logit contributions
 stub+1.025
 bumped+1.010
 accidentally+0.980
?"+0.894
Ro+0.880
Neg logit contributions
 pleasure-1.281
 generosity-1.268
 appreciation-1.200
 pride-1.191
 choices-1.168
Token.
Token ID13
Feature activation+9.06e-02

Pos logit contributions
 pleasure+0.808
 generosity+0.792
 appreciation+0.762
ipp+0.744
 pride+0.741
Neg logit contributions
 stub-0.408
 bumped-0.408
 accidentally-0.380
 tri-0.339
?"-0.335
Token She
Token ID1375
Feature activation+1.62e+00

Pos logit contributions
 bumped+0.134
 stub+0.134
 accidentally+0.125
?"+0.116
 tri+0.116
Neg logit contributions
 pleasure-0.177
 generosity-0.171
 appreciation-0.161
ipp-0.159
 pride-0.157
Token saw
Token ID2497
Feature activation-3.67e-01

Pos logit contributions
 stub+1.817
 bumped+1.672
 Ach+1.578
?"+1.550
 accidentally+1.508
Neg logit contributions
 pleasure-3.463
 generosity-3.455
 appreciation-3.392
ipp-3.260
 pride-3.252
Token a
Token ID257
Feature activation+6.37e-01

Pos logit contributions
 pleasure+0.761
 generosity+0.747
 appreciation+0.712
ipp+0.694
 pride+0.693
Neg logit contributions
 bumped-0.490
 stub-0.490
 accidentally-0.460
 tri-0.420
?"-0.417
Token small
Token ID1402
Feature activation-3.29e-01

Pos logit contributions
 bumped+1.066
 stub+1.061
 accidentally+0.995
 tri+0.935
 landed+0.927
Neg logit contributions
 pleasure-1.104
 generosity-1.065
 appreciation-1.017
 pride-0.950
ipp-0.950
Token path
Token ID3108
Feature activation+5.60e-01

Pos logit contributions
 pleasure+0.587
 generosity+0.574
 appreciation+0.543
ipp+0.525
 pride+0.523
Neg logit contributions
 bumped-0.530
 stub-0.530
 accidentally-0.503
 tri-0.467
?"-0.462
Token that
Token ID326
Feature activation+1.20e+00

Pos logit contributions
 stub+0.734
 bumped+0.708
 accidentally+0.683
Ro+0.619
?"+0.609
Neg logit contributions
 pleasure-1.130
 generosity-1.126
 appreciation-1.074
 kinds-1.044
ipp-1.042
Token lead
Token ID1085
Feature activation-6.68e-03

Pos logit contributions
 stub+1.422
 bumped+1.375
 accidentally+1.306
?"+1.238
Ro+1.185
Neg logit contributions
 pleasure-2.550
 generosity-2.507
 kinds-2.408
 appreciation-2.405
 pride-2.349
Token to
Token ID284
Feature activation+1.36e+00

Pos logit contributions
 pleasure+0.014
 generosity+0.014
 appreciation+0.014
 pride+0.013
ipp+0.013
Neg logit contributions
 stub-0.008
 bumped-0.008
 accidentally-0.008
?"-0.007
 tri-0.007
Token a
Token ID257
Feature activation+1.21e+00

Pos logit contributions
 stub+2.242
 bumped+2.139
 accidentally+2.031
?"+1.913
 tri+1.830
Neg logit contributions
 generosity-2.532
 pleasure-2.528
 appreciation-2.378
 kinds-2.287
 kindness-2.286
Token river
Token ID7850
Feature activation+7.94e-03

Pos logit contributions
 stub+1.999
 bumped+1.983
 accidentally+1.846
?"+1.733
 tri+1.732
Neg logit contributions
 pleasure-2.238
 generosity-2.102
 appreciation-2.044
 kindness-1.864
 pride-1.864
Token thought
Token ID1807
Feature activation-1.40e-01

Pos logit contributions
 pleasure+0.318
 generosity+0.312
 appreciation+0.299
ipp+0.295
 pride+0.293
Neg logit contributions
 stub-0.171
 bumped-0.168
 accidentally-0.159
?"-0.143
 tri-0.142
Token his
Token ID465
Feature activation+8.28e-02

Pos logit contributions
 pleasure+0.289
 generosity+0.283
 appreciation+0.270
ipp+0.264
 pride+0.264
Neg logit contributions
 stub-0.186
 bumped-0.185
 accidentally-0.172
 tri-0.158
?"-0.156
Token sled
Token ID46822
Feature activation+1.57e-01

Pos logit contributions
 stub+0.133
 bumped+0.131
 accidentally+0.127
 tri+0.116
 landed+0.113
Neg logit contributions
 pleasure-0.153
 generosity-0.146
 appreciation-0.141
ipp-0.136
 pride-0.135
Token was
Token ID373
Feature activation-2.84e-01

Pos logit contributions
 stub+0.223
 bumped+0.215
 accidentally+0.208
?"+0.190
 tri+0.190
Neg logit contributions
 pleasure-0.306
 generosity-0.301
ipp-0.285
 appreciation-0.284
 pride-0.278
Token better
Token ID1365
Feature activation+1.72e-01

Pos logit contributions
 pleasure+0.483
 generosity+0.467
 appreciation+0.441
ipp+0.429
 pride+0.428
Neg logit contributions
 stub-0.488
 bumped-0.485
 accidentally-0.462
 tri-0.432
?"-0.426
Token and
Token ID290
Feature activation+1.15e-01

Pos logit contributions
 stub+0.266
 bumped+0.262
 accidentally+0.246
Ro+0.228
 landed+0.227
Neg logit contributions
 pleasure-0.324
 generosity-0.318
 appreciation-0.302
ipp-0.294
 pride-0.290
Token more
Token ID517
Feature activation-1.59e-01

Pos logit contributions
 stub+0.157
 bumped+0.153
 accidentally+0.143
?"+0.133
 tri+0.131
Neg logit contributions
 pleasure-0.239
 generosity-0.237
 appreciation-0.224
 pride-0.218
ipp-0.215
Token fun
Token ID1257
Feature activation+1.67e-01

Pos logit contributions
 pleasure+0.312
 generosity+0.304
 appreciation+0.290
ipp+0.283
 pride+0.282
Neg logit contributions
 stub-0.232
 bumped-0.230
 accidentally-0.218
 tri-0.200
?"-0.197
Token.
Token ID13
Feature activation+9.66e-02

Pos logit contributions
 stub+0.208
 bumped+0.205
 accidentally+0.192
 tri+0.171
?"+0.171
Neg logit contributions
 pleasure-0.363
 generosity-0.358
 appreciation-0.342
 pride-0.333
ipp-0.332
Token He
Token ID679
Feature activation+2.36e-01

Pos logit contributions
 bumped+0.143
 stub+0.142
 accidentally+0.131
 tri+0.123
 landed+0.120
Neg logit contributions
 pleasure-0.191
 generosity-0.185
 appreciation-0.177
ipp-0.172
 pride-0.172
Token did
Token ID750
Feature activation+2.55e-02

Pos logit contributions
 stub+0.284
 bumped+0.274
 accidentally+0.259
?"+0.238
 seek+0.233
Neg logit contributions
 pleasure-0.508
 generosity-0.500
 appreciation-0.481
ipp-0.478
 pride-0.473
Token Henry
Token ID8616
Feature activation-1.42e+00

Pos logit contributions
 generosity+3.817
 pleasure+3.743
 appreciation+3.573
ipp+3.557
 choices+3.493
Neg logit contributions
 stub-2.448
 bumped-2.407
 accidentally-2.344
?"-2.274
 suddenly-2.113
Token was
Token ID373
Feature activation-9.41e-01

Pos logit contributions
 pleasure+3.588
 generosity+3.528
 appreciation+3.359
ipp+3.299
 choices+3.290
Neg logit contributions
 bumped-1.386
 stub-1.343
 accidentally-1.260
 tri-1.122
 landed-1.083
Token so
Token ID523
Feature activation-2.64e+00

Pos logit contributions
 pleasure+1.638
 generosity+1.621
 appreciation+1.517
ipp+1.484
izza+1.455
Neg logit contributions
 bumped-1.586
 stub-1.578
 accidentally-1.509
 tri-1.409
?"-1.392
Token happy
Token ID3772
Feature activation-6.60e-02

Pos logit contributions
 generosity+4.413
 pleasure+4.303
ipp+4.035
izza+4.014
 appreciation+3.930
Neg logit contributions
 bumped-4.930
 stub-4.836
 accidentally-4.572
 seek-4.419
 tri-4.398
Token that
Token ID326
Feature activation-8.16e-01

Pos logit contributions
 pleasure+0.141
 generosity+0.139
 appreciation+0.133
ipp+0.129
 pride+0.128
Neg logit contributions
 stub-0.084
 bumped-0.084
 accidentally-0.078
 tri-0.070
 landed-0.069
Token he
Token ID339
Feature activation+8.43e-01

Pos logit contributions
 pleasure+1.903
 generosity+1.869
 appreciation+1.794
ipp+1.768
 pride+1.750
Neg logit contributions
 bumped-0.887
 stub-0.886
 accidentally-0.827
?"-0.739
 tri-0.735
Token had
Token ID550
Feature activation+7.90e-02

Pos logit contributions
 stub+1.060
 bumped+1.014
 accidentally+0.948
?"+0.891
Ro+0.864
Neg logit contributions
 pleasure-1.763
 generosity-1.720
 pride-1.661
 appreciation-1.651
ipp-1.639
Token an
Token ID281
Feature activation-3.19e+00

Pos logit contributions
 stub+0.120
 bumped+0.118
 accidentally+0.112
?"+0.104
 tri+0.103
Neg logit contributions
 pleasure-0.151
 generosity-0.146
 appreciation-0.141
 pride-0.138
ipp-0.136
Token idea
Token ID2126
Feature activation+5.94e-01

Pos logit contributions
 generosity+5.147
ipp+5.039
 pleasure+5.011
ums+4.959
izza+4.900
Neg logit contributions
?"-5.568
 bumped-5.456
 accidentally-5.211
 stub-5.185
 hit-4.955
Token to
Token ID284
Feature activation+1.68e-01

Pos logit contributions
 stub+0.761
 bumped+0.735
 accidentally+0.701
Ro+0.647
 tri+0.620
Neg logit contributions
 pleasure-1.276
 generosity-1.254
 appreciation-1.217
 pride-1.168
ipp-1.151
Token test
Token ID1332
Feature activation-1.33e-01

Pos logit contributions
 stub+0.257
 bumped+0.252
 accidentally+0.240
 tri+0.221
 landed+0.217
Neg logit contributions
 pleasure-0.323
 generosity-0.314
 appreciation-0.301
 pride-0.291
ipp-0.289
Token was
Token ID373
Feature activation+4.47e-01

Pos logit contributions
 stub+1.154
 bumped+1.120
 accidentally+1.031
?"+1.012
Ro+1.005
Neg logit contributions
 generosity-1.887
 appreciation-1.870
 pleasure-1.864
ipp-1.827
 pride-1.807
Token so
Token ID523
Feature activation-1.58e-01

Pos logit contributions
 bumped+0.719
 stub+0.711
 accidentally+0.684
Ro+0.647
?"+0.635
Neg logit contributions
 pleasure-0.789
 generosity-0.756
 appreciation-0.734
 pride-0.724
ipp-0.700
Token happy
Token ID3772
Feature activation+8.24e-01

Pos logit contributions
 pleasure+0.209
 generosity+0.201
 appreciation+0.189
 pride+0.181
ipp+0.180
Neg logit contributions
 bumped-0.332
 stub-0.330
 accidentally-0.320
 tri-0.302
?"-0.300
Token.
Token ID13
Feature activation+2.54e-01

Pos logit contributions
 stub+1.293
 bumped+1.269
 accidentally+1.181
Ro+1.126
 landed+1.077
Neg logit contributions
 generosity-1.519
 pleasure-1.508
 appreciation-1.454
 choices-1.393
 kinds-1.371
Token She
Token ID1375
Feature activation+1.65e+00

Pos logit contributions
 bumped+0.384
 stub+0.378
 accidentally+0.354
 tri+0.334
 seek+0.328
Neg logit contributions
 pleasure-0.488
 generosity-0.467
 appreciation-0.447
ipp-0.444
 pride-0.444
Token then
Token ID788
Feature activation+1.43e+00

Pos logit contributions
 stub+2.688
 bumped+2.615
 accidentally+2.393
 seek+2.387
Ro+2.376
Neg logit contributions
 generosity-2.721
 pleasure-2.703
 pride-2.684
 appreciation-2.667
ipp-2.473
Token realized
Token ID6939
Feature activation+1.09e+00

Pos logit contributions
 stub+1.809
 bumped+1.709
 accidentally+1.623
Ro+1.614
 seek+1.573
Neg logit contributions
 generosity-2.864
 pleasure-2.862
 appreciation-2.789
 pride-2.789
ipp-2.658
Token that
Token ID326
Feature activation+4.17e-01

Pos logit contributions
 stub+1.820
 bumped+1.757
 accidentally+1.614
 seek+1.548
Ro+1.547
Neg logit contributions
 generosity-1.990
 pleasure-1.939
 appreciation-1.884
 pride-1.800
 kindness-1.781
Token the
Token ID262
Feature activation+2.60e-01

Pos logit contributions
 bumped+0.553
 stub+0.544
 accidentally+0.507
 tri+0.461
?"+0.459
Neg logit contributions
 generosity-0.872
 pleasure-0.870
 appreciation-0.820
 pride-0.802
 choices-0.786
Token small
Token ID1402
Feature activation+4.89e-01

Pos logit contributions
 bumped+0.371
 stub+0.361
 accidentally+0.346
 tri+0.318
?"+0.316
Neg logit contributions
 pleasure-0.528
 generosity-0.518
 appreciation-0.491
 pride-0.479
 choices-0.466
Token writing
Token ID3597
Feature activation+4.68e-01

Pos logit contributions
 bumped+0.581
 stub+0.568
 accidentally+0.533
 tri+0.484
?"+0.475
Neg logit contributions
 pleasure-1.088
 generosity-1.080
 appreciation-1.032
 kinds-0.997
 pride-0.997
Token ahead
Token ID4058
Feature activation-1.01e-01

Pos logit contributions
 stub+0.339
 bumped+0.337
 accidentally+0.319
?"+0.297
 tri+0.294
Neg logit contributions
 pleasure-0.421
 generosity-0.416
 appreciation-0.395
 pride-0.381
ipp-0.376
Token.
Token ID13
Feature activation+7.94e-01

Pos logit contributions
 pleasure+0.198
 generosity+0.196
 appreciation+0.188
 pride+0.181
ipp+0.181
Neg logit contributions
 stub-0.144
 bumped-0.143
 accidentally-0.135
 tri-0.124
?"-0.123
Token It
Token ID632
Feature activation+1.42e+00

Pos logit contributions
 bumped+1.445
 stub+1.406
 accidentally+1.328
 seek+1.291
 landed+1.283
Neg logit contributions
 pleasure-1.319
 generosity-1.217
 appreciation-1.156
 pride-1.150
ipp-1.148
Token was
Token ID373
Feature activation+4.37e-01

Pos logit contributions
 stub+1.792
 bumped+1.691
?"+1.599
 accidentally+1.551
Ro+1.469
Neg logit contributions
 generosity-2.906
 pleasure-2.873
 appreciation-2.850
 kinds-2.734
 pride-2.709
Token a
Token ID257
Feature activation+9.16e-01

Pos logit contributions
 stub+0.626
 bumped+0.623
 accidentally+0.577
?"+0.544
 tri+0.527
Neg logit contributions
 pleasure-0.870
 generosity-0.849
 appreciation-0.810
 pride-0.789
ipp-0.773
Token big
Token ID1263
Feature activation+2.64e-01

Pos logit contributions
 bumped+1.484
 stub+1.465
 accidentally+1.380
?"+1.318
 tri+1.294
Neg logit contributions
 pleasure-1.654
 generosity-1.590
 appreciation-1.512
 pride-1.433
 kindness-1.390
Token blue
Token ID4171
Feature activation+6.80e-01

Pos logit contributions
 stub+0.378
 bumped+0.376
 accidentally+0.352
 tri+0.323
 landed+0.317
Neg logit contributions
 pleasure-0.527
 generosity-0.512
 appreciation-0.485
 pride-0.470
 choices-0.459
Token towel
Token ID24808
Feature activation+3.45e-01

Pos logit contributions
 bumped+1.132
 stub+1.115
 accidentally+1.063
 tri+0.981
 landed+0.964
Neg logit contributions
 pleasure-1.209
 generosity-1.166
 appreciation-1.111
ipp-1.077
 pride-1.058
Token on
Token ID319
Feature activation-1.43e-01

Pos logit contributions
 stub+0.457
 bumped+0.457
 accidentally+0.425
 tri+0.387
?"+0.385
Neg logit contributions
 pleasure-0.696
 generosity-0.683
 appreciation-0.649
ipp-0.647
 pride-0.631
Token the
Token ID262
Feature activation+4.39e-02

Pos logit contributions
 pleasure+0.286
 generosity+0.280
 appreciation+0.268
 pride+0.261
ipp+0.261
Neg logit contributions
 stub-0.202
 bumped-0.200
 accidentally-0.187
 tri-0.174
?"-0.170
Token ground
Token ID2323
Feature activation+4.79e-01

Pos logit contributions
 bumped+0.054
 stub+0.053
 accidentally+0.050
 tri+0.045
?"+0.045
Neg logit contributions
 pleasure-0.097
 generosity-0.094
 appreciation-0.090
 pride-0.088
ipp-0.088
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
 pleasure+1.366
 generosity+1.340
 appreciation+1.281
ipp+1.255
 pride+1.248
Neg logit contributions
 bumped-0.758
 stub-0.756
 accidentally-0.710
 tri-0.642
?"-0.638
Token
Token ID198
Feature activation-7.14e-01

Pos logit contributions
 pleasure+2.612
 generosity+2.600
 appreciation+2.466
 pride+2.394
ipp+2.381
Neg logit contributions
 bumped-1.872
 stub-1.851
 accidentally-1.801
?"-1.679
 tri-1.610
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
 pleasure+1.573
 generosity+1.545
 appreciation+1.473
ipp+1.441
 pride+1.437
Neg logit contributions
 bumped-0.864
 stub-0.863
 accidentally-0.810
?"-0.733
 tri-0.730
TokenDave
Token ID27984
Feature activation-1.04e+00

Pos logit contributions
 pleasure+2.469
 generosity+2.403
 appreciation+2.267
 pride+2.212
ipp+2.183
Neg logit contributions
 bumped-1.765
 stub-1.751
 accidentally-1.685
?"-1.560
 tri-1.523
Token looked
Token ID3114
Feature activation-9.55e-01

Pos logit contributions
 pleasure+2.507
 generosity+2.442
 appreciation+2.330
 pride+2.282
ipp+2.280
Neg logit contributions
 bumped-1.107
 stub-1.087
 accidentally-1.040
 tri-0.923
 landed-0.900
Token at
Token ID379
Feature activation-6.71e-01

Pos logit contributions
 pleasure+1.978
 generosity+1.953
 appreciation+1.845
ipp+1.826
 pride+1.780
Neg logit contributions
 bumped-1.295
 stub-1.276
 accidentally-1.220
 tri-1.111
?"-1.105
Token Ms
Token ID6997
Feature activation-1.68e-01

Pos logit contributions
 pleasure+1.395
 generosity+1.368
 appreciation+1.303
ipp+1.275
 pride+1.267
Neg logit contributions
 bumped-0.897
 stub-0.895
 accidentally-0.846
 tri-0.771
?"-0.769
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
 pleasure+0.386
 generosity+0.382
 appreciation+0.364
ipp+0.358
 pride+0.354
Neg logit contributions
 stub-0.190
 bumped-0.187
 accidentally-0.174
 tri-0.156
 landed-0.153
Token Sarah
Token ID10490
Feature activation-9.61e-01

Pos logit contributions
 stub+2.130
 bumped+2.083
 accidentally+1.964
 landed+1.923
?"+1.902
Neg logit contributions
 pleasure-2.196
 generosity-2.082
 appreciation-1.957
ipp-1.905
 pride-1.888
Token with
Token ID351
Feature activation-2.03e+00

Pos logit contributions
 pleasure+1.994
 generosity+1.947
 appreciation+1.858
ipp+1.817
 pride+1.808
Neg logit contributions
 bumped-1.297
 stub-1.284
 accidentally-1.231
 tri-1.125
?"-1.105
Token a
Token ID257
Feature activation-1.13e+00

Pos logit contributions
 pleasure+4.301
 generosity+4.155
ipp+4.130
 choices+3.983
izza+3.975
Neg logit contributions
 bumped-2.688
 stub-2.610
 accidentally-2.513
 landed-2.323
 suddenly-2.311
Token the
Token ID262
Feature activation+5.44e-01

Pos logit contributions
 pleasure+1.688
 generosity+1.647
 appreciation+1.570
ipp+1.536
 pride+1.527
Neg logit contributions
 bumped-1.072
 stub-1.062
 accidentally-1.011
 tri-0.921
?"-0.914
Token mess
Token ID2085
Feature activation-1.04e+00

Pos logit contributions
 bumped+0.671
 stub+0.656
 accidentally+0.628
 tri+0.569
Ro+0.568
Neg logit contributions
 pleasure-1.165
 generosity-1.150
 appreciation-1.119
 pride-1.090
 choices-1.085
Token and
Token ID290
Feature activation+1.45e-01

Pos logit contributions
 pleasure+2.138
 generosity+2.076
 appreciation+1.980
ipp+1.933
 pride+1.919
Neg logit contributions
 bumped-1.432
 stub-1.411
 accidentally-1.363
 tri-1.242
?"-1.233
Token Lily
Token ID20037
Feature activation-9.99e-01

Pos logit contributions
 stub+0.201
 bumped+0.198
 accidentally+0.183
?"+0.176
 tri+0.170
Neg logit contributions
 pleasure-0.291
 generosity-0.287
 appreciation-0.274
 pride-0.268
ipp-0.265
Token learned
Token ID4499
Feature activation-1.51e-01

Pos logit contributions
 pleasure+2.320
 generosity+2.274
 appreciation+2.165
 pride+2.106
ipp+2.105
Neg logit contributions
 bumped-1.142
 stub-1.130
 accidentally-1.068
 tri-0.955
 landed-0.927
Token to
Token ID284
Feature activation-4.16e-01

Pos logit contributions
 pleasure+0.310
 generosity+0.307
 appreciation+0.295
ipp+0.285
 pride+0.285
Neg logit contributions
 stub-0.207
 bumped-0.205
 accidentally-0.192
 tri-0.177
 landed-0.173
Token be
Token ID307
Feature activation-6.78e-01

Pos logit contributions
 pleasure+0.809
 generosity+0.792
 appreciation+0.756
ipp+0.736
 pride+0.731
Neg logit contributions
 stub-0.610
 bumped-0.610
 accidentally-0.576
 tri-0.531
?"-0.525
Token more
Token ID517
Feature activation-6.90e-01

Pos logit contributions
 pleasure+0.876
 generosity+0.846
 appreciation+0.780
ipp+0.754
 pride+0.744
Neg logit contributions
 bumped-1.443
 stub-1.440
 accidentally-1.391
 tri-1.317
?"-1.311
Token careful
Token ID8161
Feature activation-1.11e+00

Pos logit contributions
 pleasure+0.691
 generosity+0.661
 appreciation+0.593
ipp+0.565
 pride+0.557
Neg logit contributions
 bumped-1.682
 stub-1.681
 accidentally-1.630
 tri-1.554
?"-1.547
Token when
Token ID618
Feature activation+1.32e+00

Pos logit contributions
 pleasure+2.734
 generosity+2.665
 appreciation+2.559
ipp+2.524
 pride+2.508
Neg logit contributions
 bumped-1.073
 stub-1.072
 accidentally-0.989
?"-0.884
 tri-0.873
Token playing
Token ID2712
Feature activation+2.21e+00

Pos logit contributions
 stub+2.007
 bumped+1.987
Ro+1.846
 accidentally+1.777
?"+1.763
Neg logit contributions
 generosity-2.477
 pleasure-2.467
 appreciation-2.309
 choices-2.303
 decisions-2.241
Token in
Token ID287
Feature activation-7.07e-01

Pos logit contributions
 bumped+2.404
 stub+2.391
 accidentally+2.154
 tri+2.080
 landed+2.052
Neg logit contributions
 generosity-2.838
 pleasure-2.719
ipp-2.682
 appreciation-2.638
ums-2.605
Token the
Token ID262
Feature activation+3.93e-01

Pos logit contributions
 pleasure+1.491
 generosity+1.461
 appreciation+1.392
ipp+1.367
 pride+1.355
Neg logit contributions
 stub-0.929
 bumped-0.927
 accidentally-0.873
 tri-0.796
?"-0.795
Token sunshine
Token ID34488
Feature activation-9.22e-02

Pos logit contributions
 bumped+0.625
 stub+0.592
 accidentally+0.579
 tri+0.536
?"+0.515
Neg logit contributions
 pleasure-0.730
 generosity-0.698
 appreciation-0.672
ipp-0.667
 pride-0.663
Token and
Token ID290
Feature activation+6.75e-01

Pos logit contributions
 pleasure+0.209
 generosity+0.206
 appreciation+0.197
ipp+0.194
 pride+0.192
Neg logit contributions
 bumped-0.104
 stub-0.103
 accidentally-0.096
 tri-0.087
 landed-0.084
Token eat
Token ID4483
Feature activation-3.70e-01

Pos logit contributions
 bumped+1.022
 stub+1.006
 accidentally+0.956
 tri+0.893
?"+0.865
Neg logit contributions
 pleasure-1.262
 generosity-1.248
 appreciation-1.180
 pride-1.140
 kinds-1.113
Token y
Token ID331
Feature activation-2.14e+00

Pos logit contributions
 pleasure+0.502
 generosity+0.488
 appreciation+0.453
ipp+0.437
 pride+0.433
Neg logit contributions
 bumped-0.764
 stub-0.763
 accidentally-0.733
 tri-0.694
?"-0.683
Tokenummy
Token ID13513
Feature activation-1.98e+00

Pos logit contributions
 pleasure+1.964
 generosity+1.890
 appreciation+1.628
 pride+1.529
izza+1.394
Neg logit contributions
 stub-5.358
 bumped-5.310
 accidentally-5.174
?"-5.102
 tri-4.974
Token carrots
Token ID34397
Feature activation-7.50e-01

Pos logit contributions
 pleasure+3.269
 generosity+3.107
 appreciation+2.930
ipp+2.920
 pride+2.857
Neg logit contributions
 bumped-3.593
 stub-3.514
 accidentally-3.383
?"-3.380
 landed-3.202
Token.
Token ID13
Feature activation-4.69e-01

Pos logit contributions
 pleasure+1.836
 generosity+1.800
 appreciation+1.731
ipp+1.696
 pride+1.692
Neg logit contributions
 bumped-0.722
 stub-0.719
 accidentally-0.665
?"-0.588
 tri-0.582
Token One
Token ID1881
Feature activation-1.06e+00

Pos logit contributions
 pleasure+0.934
 generosity+0.912
 appreciation+0.868
ipp+0.851
 pride+0.843
Neg logit contributions
 bumped-0.669
 stub-0.665
 accidentally-0.630
 tri-0.580
?"-0.573
Token day
Token ID1110
Feature activation+7.74e-01

Pos logit contributions
 pleasure+1.968
 generosity+1.930
 appreciation+1.824
 pride+1.783
ipp+1.778
Neg logit contributions
 stub-1.692
 bumped-1.667
 accidentally-1.599
?"-1.504
 tri-1.479
Token<|endoftext|>
Token ID50256
Feature activation+9.62e-01

Pos logit contributions
 bumped+0.316
 stub+0.313
 accidentally+0.296
 tri+0.276
 landed+0.269
Neg logit contributions
 pleasure-0.379
 generosity-0.362
 appreciation-0.347
 pride-0.337
ipp-0.335
TokenTom
Token ID13787
Feature activation-3.77e-01

Pos logit contributions
 bumped+1.195
 stub+1.192
 accidentally+1.148
 tri+1.021
 seek+0.964
Neg logit contributions
 pleasure-2.045
 generosity-1.970
ipp-1.949
 appreciation-1.923
 pride-1.886
Token and
Token ID290
Feature activation-4.67e-01

Pos logit contributions
 pleasure+0.764
 generosity+0.748
 appreciation+0.713
ipp+0.700
 pride+0.695
Neg logit contributions
 stub-0.519
 bumped-0.517
 accidentally-0.486
 tri-0.446
?"-0.437
Token Sue
Token ID26089
Feature activation-1.05e+00

Pos logit contributions
 pleasure+0.986
 generosity+0.966
 appreciation+0.923
ipp+0.899
 pride+0.899
Neg logit contributions
 bumped-0.612
 stub-0.611
 accidentally-0.574
 tri-0.525
?"-0.517
Token are
Token ID389
Feature activation-1.92e+00

Pos logit contributions
 pleasure+2.011
 generosity+1.972
 appreciation+1.865
ipp+1.800
 pride+1.792
Neg logit contributions
 bumped-1.606
 stub-1.581
 accidentally-1.520
?"-1.427
 tri-1.409
Token friends
Token ID2460
Feature activation-9.40e-01

Pos logit contributions
 generosity+4.128
 pleasure+4.102
 appreciation+3.842
 decisions+3.742
izza+3.737
Neg logit contributions
 stub-2.484
 bumped-2.465
 accidentally-2.421
?"-2.328
 tri-2.207
Token.
Token ID13
Feature activation-6.51e-01

Pos logit contributions
 pleasure+2.357
 generosity+2.319
 appreciation+2.223
ipp+2.180
 pride+2.172
Neg logit contributions
 stub-0.857
 bumped-0.855
 accidentally-0.783
?"-0.710
 tri-0.681
Token They
Token ID1119
Feature activation-1.22e+00

Pos logit contributions
 pleasure+1.200
 generosity+1.175
 appreciation+1.112
ipp+1.084
 pride+1.078
Neg logit contributions
 bumped-1.020
 stub-1.018
 accidentally-0.970
 tri-0.898
?"-0.896
Token like
Token ID588
Feature activation-2.49e-01

Pos logit contributions
 pleasure+2.437
 generosity+2.407
 appreciation+2.290
ipp+2.209
 decisions+2.196
Neg logit contributions
 bumped-1.734
 stub-1.711
 accidentally-1.642
?"-1.553
 tri-1.507
Token to
Token ID284
Feature activation+5.22e-01

Pos logit contributions
 pleasure+0.560
 generosity+0.552
 appreciation+0.527
ipp+0.515
 pride+0.513
Neg logit contributions
 bumped-0.288
 stub-0.286
 accidentally-0.267
 tri-0.241
?"-0.236
Token play
Token ID711
Feature activation+1.94e+00

Pos logit contributions
 bumped+0.628
 stub+0.627
 accidentally+0.583
 tri+0.538
 landed+0.518
Neg logit contributions
 pleasure-1.146
 generosity-1.124
 appreciation-1.079
ipp-1.049
 pride-1.049
Token so
Token ID523
Feature activation+1.36e+00

Pos logit contributions
 pleasure+2.286
 generosity+2.248
 appreciation+2.137
ipp+2.127
 pride+2.072
Neg logit contributions
 stub-1.764
 bumped-1.749
 accidentally-1.694
 tri-1.561
?"-1.547
Token she
Token ID673
Feature activation+7.00e-01

Pos logit contributions
 bumped+1.745
 stub+1.726
 accidentally+1.557
 tri+1.473
?"+1.453
Neg logit contributions
 pleasure-2.906
 generosity-2.903
 appreciation-2.784
 pride-2.688
 kindness-2.636
Token gave
Token ID2921
Feature activation-5.10e-01

Pos logit contributions
 stub+0.854
 bumped+0.815
 accidentally+0.755
?"+0.713
 Ach+0.692
Neg logit contributions
 generosity-1.468
 pleasure-1.451
 appreciation-1.400
ipp-1.380
 pride-1.369
Token it
Token ID340
Feature activation-2.04e+00

Pos logit contributions
 pleasure+1.085
 generosity+1.063
 appreciation+1.016
ipp+0.992
 pride+0.990
Neg logit contributions
 bumped-0.656
 stub-0.656
 accidentally-0.617
 tri-0.562
?"-0.556
Token some
Token ID617
Feature activation-8.68e-01

Pos logit contributions
 pleasure+4.952
 generosity+4.826
ipp+4.625
ums+4.599
 appreciation+4.585
Neg logit contributions
 bumped-2.090
 stub-2.061
 accidentally-1.862
?"-1.781
 landed-1.696
Token food
Token ID2057
Feature activation-7.26e-01

Pos logit contributions
 pleasure+1.322
 generosity+1.279
 appreciation+1.201
ipp+1.185
 pride+1.152
Neg logit contributions
 bumped-1.630
 stub-1.629
 accidentally-1.563
?"-1.475
 tri-1.467
Token and
Token ID290
Feature activation+1.53e-01

Pos logit contributions
 pleasure+1.638
 generosity+1.604
 appreciation+1.538
ipp+1.509
 pride+1.500
Neg logit contributions
 bumped-0.834
 stub-0.831
 accidentally-0.777
 tri-0.700
?"-0.699
Token water
Token ID1660
Feature activation-6.66e-01

Pos logit contributions
 stub+0.248
 bumped+0.244
 accidentally+0.228
 tri+0.216
?"+0.213
Neg logit contributions
 generosity-0.276
 pleasure-0.276
 appreciation-0.259
 pride-0.251
ipp-0.243
Token.
Token ID13
Feature activation-4.18e-01

Pos logit contributions
 pleasure+1.570
 generosity+1.540
 appreciation+1.477
ipp+1.449
 pride+1.443
Neg logit contributions
 bumped-0.699
 stub-0.697
 accidentally-0.646
 tri-0.575
?"-0.571
Token This
Token ID770
Feature activation-8.97e-01

Pos logit contributions
 pleasure+0.817
 generosity+0.797
 appreciation+0.758
ipp+0.741
 pride+0.737
Neg logit contributions
 bumped-0.612
 stub-0.610
 accidentally-0.576
 tri-0.534
?"-0.528
Token was
Token ID373
Feature activation+2.98e-02

Pos logit contributions
 pleasure+1.916
 generosity+1.876
 appreciation+1.785
ipp+1.766
 pride+1.749
Neg logit contributions
 bumped-1.166
 stub-1.164
 accidentally-1.098
 tri-0.995
?"-0.993
Token was
Token ID373
Feature activation-9.07e-01

Pos logit contributions
 pleasure+0.952
 generosity+0.937
 appreciation+0.897
ipp+0.880
 pride+0.875
Neg logit contributions
 stub-0.502
 bumped-0.501
 accidentally-0.466
 tri-0.421
?"-0.420
Token so
Token ID523
Feature activation-2.01e+00

Pos logit contributions
 pleasure+1.545
 generosity+1.517
 appreciation+1.423
ipp+1.386
 pride+1.366
Neg logit contributions
 bumped-1.561
 stub-1.558
 accidentally-1.494
 tri-1.393
?"-1.379
Token happy
Token ID3772
Feature activation+3.58e-01

Pos logit contributions
 pleasure+3.111
 generosity+3.092
 appreciation+2.796
izza+2.781
 decisions+2.741
Neg logit contributions
 bumped-3.893
 stub-3.870
 accidentally-3.725
?"-3.505
 tri-3.472
Token to
Token ID284
Feature activation+1.37e+00

Pos logit contributions
 stub+0.521
 bumped+0.511
 accidentally+0.472
Ro+0.437
 landed+0.432
Neg logit contributions
 pleasure-0.703
 generosity-0.696
 appreciation-0.662
ipp-0.638
 choices-0.632
Token make
Token ID787
Feature activation-1.89e+00

Pos logit contributions
 stub+2.122
 bumped+2.110
 accidentally+2.010
 tri+1.907
Please+1.772
Neg logit contributions
 pleasure-2.596
 generosity-2.536
 pride-2.467
 appreciation-2.463
ipp-2.305
Token people
Token ID661
Feature activation-2.73e+00

Pos logit contributions
 pleasure+4.121
 generosity+4.011
ipp+3.855
izza+3.771
 appreciation+3.747
Neg logit contributions
 bumped-2.345
 stub-2.281
 accidentally-2.269
?"-2.041
 tri-2.020
Token happy
Token ID3772
Feature activation+2.26e-01

Pos logit contributions
 pleasure+4.890
 generosity+4.620
ipp+4.517
izza+4.431
apa+4.339
Neg logit contributions
 stub-4.699
 bumped-4.639
 accidentally-4.416
?"-4.369
 suddenly-4.181
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
 stub+0.303
 bumped+0.298
 accidentally+0.279
 tri+0.253
 landed+0.249
Neg logit contributions
 pleasure-0.476
 generosity-0.472
 appreciation-0.451
 pride-0.437
 choices-0.434
Token
Token ID198
Feature activation+1.22e-01

Pos logit contributions
 generosity+3.064
 pleasure+3.044
 appreciation+2.860
ipp+2.795
 pride+2.726
Neg logit contributions
 bumped-2.640
 stub-2.630
 accidentally-2.558
?"-2.401
 tri-2.343
Token
Token ID198
Feature activation-5.93e-01

Pos logit contributions
 bumped+0.168
 stub+0.167
 accidentally+0.158
 tri+0.146
 landed+0.141
Neg logit contributions
 pleasure-0.246
 generosity-0.239
 appreciation-0.232
ipp-0.227
 pride-0.224
TokenNext
Token ID10019
Feature activation-8.89e-01

Pos logit contributions
 pleasure+1.270
 generosity+1.245
 appreciation+1.189
ipp+1.163
 pride+1.157
Neg logit contributions
 bumped-0.758
 stub-0.758
 accidentally-0.712
 tri-0.648
?"-0.643
Token look
Token ID804
Feature activation-1.53e-01

Pos logit contributions
 pleasure+1.243
 generosity+1.217
 appreciation+1.159
ipp+1.133
 pride+1.127
Neg logit contributions
 bumped-0.840
 stub-0.839
 accidentally-0.793
 tri-0.727
?"-0.722
Token good
Token ID922
Feature activation-1.58e-01

Pos logit contributions
 pleasure+0.248
 generosity+0.239
 appreciation+0.226
 pride+0.220
ipp+0.218
Neg logit contributions
 stub-0.277
 bumped-0.274
 accidentally-0.260
 tri-0.247
?"-0.241
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
 pleasure+0.369
 generosity+0.363
 appreciation+0.348
ipp+0.341
 pride+0.340
Neg logit contributions
 stub-0.174
 bumped-0.171
 accidentally-0.158
 tri-0.142
 landed-0.137
Token He
Token ID679
Feature activation-3.22e-01

Pos logit contributions
 generosity+3.626
 pleasure+3.564
 appreciation+3.404
ipp+3.358
 pride+3.297
Neg logit contributions
 bumped-2.307
 stub-2.282
 accidentally-2.271
?"-2.098
 suddenly-2.012
Token was
Token ID373
Feature activation-1.44e+00

Pos logit contributions
 pleasure+0.702
 generosity+0.690
 appreciation+0.662
ipp+0.647
 pride+0.646
Neg logit contributions
 stub-0.390
 bumped-0.388
 accidentally-0.361
?"-0.329
 tri-0.328
Token so
Token ID523
Feature activation-2.79e+00

Pos logit contributions
 pleasure+2.575
 generosity+2.565
 appreciation+2.392
ipp+2.362
izza+2.358
Neg logit contributions
 bumped-2.384
 stub-2.342
 accidentally-2.290
 tri-2.109
 landed-2.075
Token proud
Token ID6613
Feature activation-6.03e-01

Pos logit contributions
 generosity+4.380
 pleasure+4.343
izza+4.022
 decisions+3.930
 appreciation+3.922
Neg logit contributions
 bumped-5.515
 stub-5.396
 accidentally-5.316
 tri-4.935
 suddenly-4.917
Token of
Token ID286
Feature activation-1.10e+00

Pos logit contributions
 pleasure+0.854
 generosity+0.829
 appreciation+0.772
ipp+0.746
 pride+0.740
Neg logit contributions
 bumped-1.203
 stub-1.202
 accidentally-1.156
 tri-1.091
?"-1.086
Token his
Token ID465
Feature activation-1.74e+00

Pos logit contributions
 pleasure+1.907
 generosity+1.850
 appreciation+1.735
ipp+1.721
 pride+1.685
Neg logit contributions
 bumped-1.853
 stub-1.819
 accidentally-1.777
?"-1.668
 tri-1.649
Token work
Token ID670
Feature activation-1.52e+00

Pos logit contributions
 pleasure+3.094
 generosity+3.025
ipp+2.934
 appreciation+2.868
 pride+2.797
Neg logit contributions
 bumped-2.759
 stub-2.741
 accidentally-2.692
?"-2.484
 tri-2.471
Token!
Token ID0
Feature activation-1.38e+00

Pos logit contributions
 pleasure+3.608
 generosity+3.526
 appreciation+3.370
ipp+3.350
 pride+3.285
Neg logit contributions
 bumped-1.605
 stub-1.572
 accidentally-1.495
 tri-1.324
?"-1.323
Tokenave
Token ID1015
Feature activation+1.98e-01

Pos logit contributions
 stub+0.290
 bumped+0.289
 accidentally+0.278
?"+0.266
 tri+0.262
Neg logit contributions
 pleasure-0.204
 generosity-0.201
 appreciation-0.187
 pride-0.186
ipp-0.182
Token them
Token ID606
Feature activation-2.01e+00

Pos logit contributions
 bumped+0.258
 stub+0.258
 accidentally+0.236
 tri+0.216
 seek+0.213
Neg logit contributions
 pleasure-0.420
 generosity-0.417
 appreciation-0.397
 pride-0.391
ipp-0.385
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
 pleasure+5.028
 generosity+4.770
 appreciation+4.634
ipp+4.571
 pride+4.476
Neg logit contributions
 bumped-1.980
 stub-1.914
 accidentally-1.875
?"-1.722
 tri-1.666
Token She
Token ID1375
Feature activation+8.45e-02

Pos logit contributions
 bumped+2.841
 stub+2.724
 accidentally+2.561
 tri+2.483
 landed+2.478
Neg logit contributions
 pleasure-2.576
 generosity-2.421
 pride-2.366
 appreciation-2.323
ums-2.187
Token hugged
Token ID41130
Feature activation+5.37e-01

Pos logit contributions
 stub+0.125
 bumped+0.123
 accidentally+0.115
 tri+0.107
?"+0.106
Neg logit contributions
 pleasure-0.160
 generosity-0.160
 appreciation-0.152
ipp-0.148
 pride-0.148
Token them
Token ID606
Feature activation-2.69e+00

Pos logit contributions
 stub+0.873
 bumped+0.868
 accidentally+0.806
 seek+0.768
 tri+0.760
Neg logit contributions
 pleasure-0.950
 generosity-0.948
 appreciation-0.895
 pride-0.893
ipp-0.867
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
 pleasure+6.675
 generosity+6.358
 appreciation+6.075
 decisions+5.994
 kinds+5.941
Neg logit contributions
 bumped-2.841
 stub-2.752
 accidentally-2.577
?"-2.572
 tri-2.395
Token
Token ID198
Feature activation+5.51e-01

Pos logit contributions
 bumped+3.139
 stub+3.002
 accidentally+2.817
 tri+2.756
 landed+2.731
Neg logit contributions
 pleasure-2.796
 generosity-2.600
 pride-2.566
 appreciation-2.496
ipp-2.358
Token
Token ID198
Feature activation+6.10e-01

Pos logit contributions
 bumped+0.826
 stub+0.804
 accidentally+0.762
 tri+0.705
 seek+0.682
Neg logit contributions
 pleasure-1.046
 generosity-1.044
 appreciation-1.006
ipp-0.978
 pride-0.975
TokenL
Token ID43
Feature activation+2.13e-01

Pos logit contributions
 bumped+0.964
 stub+0.952
 accidentally+0.888
 tri+0.823
 landed+0.802
Neg logit contributions
 generosity-1.103
 pleasure-1.073
 appreciation-1.059
ipp-1.032
 pride-1.022
Tokenily
Token ID813
Feature activation+1.62e-02

Pos logit contributions
 stub+0.330
 bumped+0.327
 accidentally+0.306
 tri+0.283
 landed+0.281
Neg logit contributions
 pleasure-0.400
 generosity-0.397
ipp-0.377
 appreciation-0.376
 pride-0.370
Token pancakes
Token ID47008
Feature activation-4.43e-01

Pos logit contributions
 pleasure+2.048
 generosity+1.999
 appreciation+1.890
ipp+1.864
 pride+1.813
Neg logit contributions
 bumped-2.125
 stub-2.119
 accidentally-2.026
?"-1.913
 tri-1.880
Token.
Token ID13
Feature activation-2.41e-01

Pos logit contributions
 pleasure+1.001
 generosity+0.984
 appreciation+0.942
ipp+0.925
 pride+0.919
Neg logit contributions
 stub-0.506
 bumped-0.505
 accidentally-0.470
 tri-0.423
?"-0.415
Token They
Token ID1119
Feature activation-8.26e-01

Pos logit contributions
 pleasure+0.443
 generosity+0.428
 appreciation+0.407
ipp+0.398
 pride+0.396
Neg logit contributions
 bumped-0.385
 stub-0.383
 accidentally-0.360
 tri-0.339
?"-0.334
Token were
Token ID547
Feature activation-2.07e+00

Pos logit contributions
 pleasure+1.604
 generosity+1.564
 appreciation+1.484
ipp+1.448
 pride+1.434
Neg logit contributions
 bumped-1.229
 stub-1.221
 accidentally-1.168
 tri-1.074
?"-1.066
Token so
Token ID523
Feature activation-3.10e+00

Pos logit contributions
 generosity+3.812
 pleasure+3.776
 appreciation+3.487
izza+3.470
ipp+3.437
Neg logit contributions
 bumped-3.298
 stub-3.275
 accidentally-3.241
?"-3.003
 landed-2.989
Token y
Token ID331
Feature activation-2.70e+00

Pos logit contributions
 pleasure+5.376
 generosity+5.318
 decisions+4.872
 appreciation+4.787
 choices+4.613
Neg logit contributions
 bumped-5.644
 stub-5.505
 accidentally-5.366
?"-5.159
 landed-4.980
Tokenummy
Token ID13513
Feature activation-1.79e+00

Pos logit contributions
 pleasure+2.922
 generosity+2.854
 appreciation+2.564
 pride+2.453
 kindness+2.145
Neg logit contributions
 stub-6.385
 accidentally-6.358
 bumped-6.353
?"-6.193
 tri-6.061
Token!
Token ID0
Feature activation-1.81e-01

Pos logit contributions
 pleasure+4.264
 generosity+4.164
 appreciation+4.009
ipp+3.989
 pride+3.970
Neg logit contributions
 bumped-1.819
 stub-1.737
 accidentally-1.720
?"-1.615
 tri-1.519
Token Lily
Token ID20037
Feature activation-6.94e-01

Pos logit contributions
 pleasure+0.337
 generosity+0.325
 appreciation+0.311
ipp+0.303
 pride+0.302
Neg logit contributions
 bumped-0.284
 stub-0.281
 accidentally-0.265
 tri-0.248
?"-0.244
Token was
Token ID373
Feature activation-1.46e+00

Pos logit contributions
 pleasure+1.485
 generosity+1.454
 appreciation+1.388
ipp+1.356
 pride+1.350
Neg logit contributions
 bumped-0.885
 stub-0.884
 accidentally-0.834
 tri-0.758
?"-0.750
Token happy
Token ID3772
Feature activation-2.62e-01

Pos logit contributions
 generosity+2.585
 pleasure+2.584
 appreciation+2.410
izza+2.373
ipp+2.350
Neg logit contributions
 stub-2.414
 bumped-2.412
 accidentally-2.316
 tri-2.144
?"-2.140
Token help
Token ID1037
Feature activation-2.53e-01

Pos logit contributions
 pleasure+1.187
 generosity+1.162
 appreciation+1.103
ipp+1.078
 pride+1.071
Neg logit contributions
 bumped-0.905
 stub-0.903
 accidentally-0.857
 tri-0.790
?"-0.786
Token so
Token ID523
Feature activation-4.36e-02

Pos logit contributions
 pleasure+0.512
 generosity+0.504
 appreciation+0.480
ipp+0.468
 pride+0.466
Neg logit contributions
 stub-0.352
 bumped-0.350
 accidentally-0.326
 tri-0.302
 landed-0.297
Token he
Token ID339
Feature activation+2.34e-01

Pos logit contributions
 pleasure+0.094
 generosity+0.092
 appreciation+0.088
ipp+0.087
 pride+0.085
Neg logit contributions
 stub-0.055
 bumped-0.055
 accidentally-0.051
 tri-0.047
 landed-0.046
Token got
Token ID1392
Feature activation-1.16e+00

Pos logit contributions
 stub+0.293
 bumped+0.283
 accidentally+0.260
 tri+0.243
?"+0.238
Neg logit contributions
 pleasure-0.498
 generosity-0.489
 appreciation-0.467
ipp-0.463
 pride-0.456
Token two
Token ID734
Feature activation-2.46e+00

Pos logit contributions
 pleasure+2.506
 generosity+2.451
 appreciation+2.331
ipp+2.304
 pride+2.254
Neg logit contributions
 bumped-1.521
 stub-1.483
 accidentally-1.422
?"-1.297
 tri-1.274
Token sp
Token ID599
Feature activation-2.91e+00

Pos logit contributions
 pleasure+4.624
 generosity+4.345
ipp+4.272
 appreciation+4.142
 pride+3.983
Neg logit contributions
 bumped-4.091
 accidentally-3.939
 stub-3.859
?"-3.575
 tri-3.546
Tokenoons
Token ID13022
Feature activation-1.27e+00

Pos logit contributions
 pleasure+5.549
 generosity+5.099
 appreciation+4.725
 pride+4.626
 kindness+4.454
Neg logit contributions
 bumped-5.248
?"-4.863
 stub-4.811
 accidentally-4.780
 tri-4.500
Token and
Token ID290
Feature activation-4.87e-01

Pos logit contributions
 pleasure+2.948
 generosity+2.863
 appreciation+2.744
ipp+2.686
 pride+2.680
Neg logit contributions
 bumped-1.433
 stub-1.378
 accidentally-1.331
?"-1.224
 tri-1.184
Token two
Token ID734
Feature activation-1.97e+00

Pos logit contributions
 pleasure+0.802
 generosity+0.784
 appreciation+0.737
ipp+0.715
 pride+0.712
Neg logit contributions
 stub-0.858
 bumped-0.856
 accidentally-0.815
 tri-0.765
?"-0.761
Token plates
Token ID13375
Feature activation-9.16e-01

Pos logit contributions
 pleasure+4.350
 generosity+4.150
 appreciation+3.964
ipp+3.958
 pride+3.858
Neg logit contributions
 bumped-2.615
 accidentally-2.498
 stub-2.480
 tri-2.215
 suddenly-2.188
Token.
Token ID13
Feature activation-4.29e-01

Pos logit contributions
 pleasure+2.199
 generosity+2.151
 appreciation+2.064
ipp+2.024
 pride+2.013
Neg logit contributions
 bumped-0.941
 stub-0.923
 accidentally-0.864
?"-0.767
 tri-0.765