TOP ACTIVATIONS
MAX = 4.095

I go play with Ben now ?" Lisa asks .
and sad . Suddenly , it heard a cheerful voice behind
playing in the park when it saw its special brush with
and Ben woke up when it was dark . They heard
bedroom door opened and a ghost appeared . The ghost was
writing in the notebook when it split into two ! She
Can I play with you now ?" he asked .
discover where it 's coming from !" So ,
Suddenly , she heard a voice from behind
apping its wings around when it saw a butterfly . It
home and play with Rose now ?" Her mom said ,
Can I have Big Red now ?" His mom
box . He turned around and saw a man with a
to travel to a far away place and brought Spot along
walking in the park when it saw a little bird .
want to play with me now ?" The gr umpy cat
Can I play with you later ?" Mia says , "
playing with her doll when it accidentally fell out of her
then , she fell off and hurt her knee .
all of a sudden , it heard a loud noise .

MIN ACTIVATIONS
MIN = -4.865

." They shared Ben 's ice - cream and
! They collected all the treasure and stored it
the luggage . They added all the luggage to the truck
Ben and Lily hugged grand pa . They said , "
for you ." Bill gave Bob the bread , and Bob
. Peter ate all the food and enjoyed the
. Together , they collected all the stones Jack wanted .
, Dad ." They hugged Dad and said , " You
The shop keeper gathered all of the items and told
Tom and his rabbit sold all the carrots . They were
were filled with fresh cuc umber and cheese . It made
The woman gave two yellow le mons to Sam
to do ! she gathered all her friends together to help
The pals gathered all of the ingredients for the
to help ! They gathered all of the supplies they would
ily and Ben gave the man some bread . They taped
sweet . He quickly collected all the berries he could find
tw igs . He gave them to the squirrel , who
ummy treats . They took all of the strawberries inside and
great dish . They gathered all the ingredients and mixed them

INTERVAL 1.855 - 4.095
CONTAINS 1.783%

here , but do not go near it . It might
, John sat in his arm chair . He wanted to
hold Anna . It broke and fell down . Anna fell
dirt behind . Suddenly , he heard his mom calling him
. As he flew , he could see all the bright

INTERVAL -0.385 - 1.855
CONTAINS 65.426%

watch the bird . It was new to her . She
. From that day on , Lily always stayed close to
and laughed . Sue did not care about the sign .
gave Max a big hug and told him he was very
She looks for another way down . She sees

INTERVAL -2.625 - -0.385
CONTAINS 32.401%

. " That looks dangerous , little one ," he said
You 're so pretty and you put the pieces together so
time , there was a little chicken named Ro oster .
, " Well , being a king isn 't easy .
From then on , the obedient nation always remembered

INTERVAL -4.865 - -2.625
CONTAINS 0.390%

The little girl loved all the books she saw .
a time , there were two best friends . They were
The two sisters held hands , walked back home
His mom was proud of him for being a good collector
not happy and sc olded John for ignoring her . She
Token I
Token ID314
Feature activation+5.63e-01

Pos logit contributions
 bumped+0.824
 stub+0.811
 accidentally+0.750
 tri+0.678
 landed+0.673
Neg logit contributions
 pleasure-1.442
 generosity-1.429
 appreciation-1.332
ipp-1.310
 pride-1.300
Token go
Token ID467
Feature activation+2.48e+00

Pos logit contributions
 bumped+0.984
 stub+0.978
 accidentally+0.934
 tri+0.883
 landed+0.861
Neg logit contributions
 pleasure-0.943
 generosity-0.913
 appreciation-0.861
 pride-0.837
ipp-0.826
Token play
Token ID711
Feature activation+2.74e+00

Pos logit contributions
 stub+4.038
 bumped+3.906
 accidentally+3.823
Ro+3.640
 tri+3.620
Neg logit contributions
 pleasure-4.565
 generosity-4.374
 appreciation-4.139
 pride-4.109
 kindness-4.093
Token with
Token ID351
Feature activation+7.84e-01

Pos logit contributions
 bumped+4.649
 stub+4.641
 accidentally+4.084
 tri+4.064
 landed+3.839
Neg logit contributions
 pleasure-5.139
 generosity-4.989
 appreciation-4.752
 kindness-4.680
ipp-4.594
Token Ben
Token ID3932
Feature activation+1.70e+00

Pos logit contributions
 bumped+1.186
 stub+1.186
 accidentally+1.089
 tri+1.006
 landed+0.999
Neg logit contributions
 pleasure-1.496
 generosity-1.477
 appreciation-1.403
 pride-1.373
ipp-1.364
Token now
Token ID783
Feature activation+4.10e+00

Pos logit contributions
 bumped+1.292
 stub+1.273
Ro+1.003
 accidentally+0.951
 tri+0.911
Neg logit contributions
 generosity-4.558
 pleasure-4.539
 appreciation-4.342
ipp-4.327
 choices-4.230
Token?"
Token ID1701
Feature activation+7.14e-01

Pos logit contributions
 bumped+4.952
 stub+4.940
Ro+4.751
Raw+4.400
 tri+4.137
Neg logit contributions
 pleasure-10.428
 generosity-9.725
ish-9.306
 appreciation-9.264
ipp-9.243
Token Lisa
Token ID15378
Feature activation+3.68e-01

Pos logit contributions
 stub+1.045
 bumped+1.033
 accidentally+0.943
 landed+0.894
 tri+0.881
Neg logit contributions
 pleasure-1.377
 generosity-1.355
 pride-1.292
 appreciation-1.284
 kinds-1.225
Token asks
Token ID7893
Feature activation+6.76e-01

Pos logit contributions
 stub+0.511
 bumped+0.506
 accidentally+0.476
 seek+0.437
 tri+0.435
Neg logit contributions
 pleasure-0.726
 generosity-0.721
 appreciation-0.693
 pride-0.680
ipp-0.670
Token.
Token ID13
Feature activation+9.79e-01

Pos logit contributions
 stub+0.834
 bumped+0.817
 accidentally+0.762
Ro+0.697
 landed+0.693
Neg logit contributions
 generosity-1.463
 pleasure-1.458
 appreciation-1.379
 pride-1.378
ipp-1.363
Token
Token ID198
Feature activation+6.83e-01

Pos logit contributions
 bumped+1.539
 stub+1.514
 accidentally+1.403
 landed+1.349
 seek+1.336
Neg logit contributions
 pleasure-1.815
 generosity-1.737
 appreciation-1.665
 pride-1.655
ipp-1.608
Token and
Token ID290
Feature activation+1.48e+00

Pos logit contributions
 stub+1.454
 bumped+1.429
 accidentally+1.311
Ro+1.280
 seek+1.268
Neg logit contributions
 pleasure-1.694
 generosity-1.655
 appreciation-1.579
ipp-1.567
 choices-1.544
Token sad
Token ID6507
Feature activation+1.45e+00

Pos logit contributions
 stub+2.104
 bumped+2.029
 accidentally+1.915
 Ach+1.839
Ro+1.835
Neg logit contributions
 pleasure-2.841
 generosity-2.840
 appreciation-2.734
 pride-2.716
 choices-2.631
Token.
Token ID13
Feature activation+6.68e-01

Pos logit contributions
 stub+2.122
 bumped+2.010
 accidentally+1.878
Ro+1.783
 seek+1.754
Neg logit contributions
 pleasure-2.922
 generosity-2.908
 appreciation-2.805
 choices-2.773
ipp-2.677
Token Suddenly
Token ID24975
Feature activation+1.24e+00

Pos logit contributions
 bumped+1.295
 stub+1.290
 accidentally+1.196
 landed+1.149
 seek+1.149
Neg logit contributions
 pleasure-1.024
 generosity-0.962
 appreciation-0.902
ipp-0.884
 pride-0.879
Token,
Token ID11
Feature activation+1.74e+00

Pos logit contributions
 stub+2.001
 bumped+1.879
Ro+1.808
 accidentally+1.758
Please+1.746
Neg logit contributions
 pleasure-2.212
 generosity-2.133
 appreciation-2.038
 pride-1.981
ipp-1.948
Token it
Token ID340
Feature activation+3.94e+00

Pos logit contributions
 stub+2.986
 bumped+2.841
 accidentally+2.620
Ro+2.558
 tri+2.516
Neg logit contributions
 pleasure-3.094
 generosity-2.960
 appreciation-2.803
 pride-2.770
 kindness-2.702
Token heard
Token ID2982
Feature activation+1.26e+00

Pos logit contributions
Ro+5.562
?"+5.106
Who+5.090
 seeker+5.028
 university+4.991
Neg logit contributions
 pride-7.262
 appreciation-7.225
 generosity-6.835
ealing-6.665
 compassion-6.516
Token a
Token ID257
Feature activation+8.54e-01

Pos logit contributions
 stub+2.038
 bumped+2.014
 accidentally+1.875
 seek+1.837
 tri+1.737
Neg logit contributions
 pleasure-2.279
 generosity-2.250
 appreciation-2.207
 pride-2.163
 kindness-2.078
Token cheerful
Token ID37999
Feature activation+3.01e-01

Pos logit contributions
 bumped+1.256
 stub+1.252
 accidentally+1.177
?"+1.084
 tri+1.070
Neg logit contributions
 generosity-1.646
 pleasure-1.634
 appreciation-1.591
 pride-1.487
 kindness-1.472
Token voice
Token ID3809
Feature activation+1.86e+00

Pos logit contributions
 bumped+0.372
 stub+0.369
 accidentally+0.348
 tri+0.311
?"+0.307
Neg logit contributions
 pleasure-0.646
 generosity-0.644
 appreciation-0.619
 pride-0.593
 choices-0.586
Token behind
Token ID2157
Feature activation+1.04e+00

Pos logit contributions
 stub+2.772
 bumped+2.638
 accidentally+2.594
 seek+2.347
Ro+2.347
Neg logit contributions
 generosity-3.576
 appreciation-3.413
 pleasure-3.381
 kindness-3.286
 kinds-3.277
Token playing
Token ID2712
Feature activation+1.49e+00

Pos logit contributions
 bumped+1.888
Ro+1.876
 stub+1.874
Who+1.845
?"+1.775
Neg logit contributions
 pleasure-2.211
 generosity-2.107
 pride-2.073
 appreciation-2.041
ipp-2.022
Token in
Token ID287
Feature activation+8.27e-01

Pos logit contributions
 stub+2.103
 bumped+2.062
 accidentally+1.826
 landed+1.738
 rang+1.736
Neg logit contributions
 pleasure-2.900
 generosity-2.872
 appreciation-2.739
 pride-2.668
 kindness-2.656
Token the
Token ID262
Feature activation+1.32e+00

Pos logit contributions
 stub+1.344
 bumped+1.335
 accidentally+1.234
 tri+1.156
?"+1.147
Neg logit contributions
 pleasure-1.492
 generosity-1.438
 appreciation-1.395
 kinds-1.357
 pride-1.332
Token park
Token ID3952
Feature activation+1.01e+00

Pos logit contributions
 bumped+1.716
 stub+1.613
 accidentally+1.487
 tri+1.396
?"+1.385
Neg logit contributions
 pleasure-2.863
 generosity-2.695
 appreciation-2.676
 pride-2.646
ipp-2.635
Token when
Token ID618
Feature activation+1.66e+00

Pos logit contributions
 stub+1.348
 bumped+1.326
 accidentally+1.191
Ro+1.149
?"+1.125
Neg logit contributions
 pleasure-2.068
 generosity-2.020
 appreciation-1.917
 kinds-1.912
ipp-1.898
Token it
Token ID340
Feature activation+3.90e+00

Pos logit contributions
 stub+2.470
 bumped+2.424
Ro+2.217
 seek+2.181
 accidentally+2.169
Neg logit contributions
 pleasure-3.163
 generosity-2.991
 appreciation-2.821
 pride-2.817
ipp-2.743
Token saw
Token ID2497
Feature activation+1.41e+00

Pos logit contributions
?"+4.983
 university+4.978
Who+4.942
Ro+4.792
 seeker+4.783
Neg logit contributions
ipp-7.057
ealing-6.726
apa-6.596
 appreciation-6.586
leased-6.509
Token its
Token ID663
Feature activation+9.21e-01

Pos logit contributions
 stub+2.343
 bumped+2.292
 seek+2.108
 accidentally+2.085
 tri+2.000
Neg logit contributions
 pleasure-2.541
 generosity-2.477
 appreciation-2.386
 pride-2.325
 kindness-2.272
Token special
Token ID2041
Feature activation+9.82e-01

Pos logit contributions
 bumped+1.800
 stub+1.754
 accidentally+1.692
Ro+1.628
?"+1.605
Neg logit contributions
 pleasure-1.409
 generosity-1.357
 appreciation-1.275
 pride-1.212
 choices-1.199
Token brush
Token ID14093
Feature activation+9.61e-01

Pos logit contributions
 bumped+1.613
 stub+1.597
 accidentally+1.542
Ro+1.452
 tri+1.438
Neg logit contributions
 pleasure-1.770
 generosity-1.755
 appreciation-1.673
 choices-1.581
 pride-1.575
Token with
Token ID351
Feature activation-1.02e+00

Pos logit contributions
 bumped+1.181
 stub+1.176
Ro+1.082
 accidentally+1.059
Raw+1.000
Neg logit contributions
 generosity-2.020
 pleasure-2.016
 appreciation-1.925
ipp-1.899
 pride-1.872
Token and
Token ID290
Feature activation-5.56e-01

Pos logit contributions
 bumped+3.342
Ro+3.319
 stub+3.283
 Ach+3.125
 seek+3.121
Neg logit contributions
 pleasure-2.859
 generosity-2.816
 appreciation-2.772
 pride-2.720
ipp-2.685
Token Ben
Token ID3932
Feature activation+1.84e+00

Pos logit contributions
 pleasure+1.318
 generosity+1.295
 appreciation+1.242
ipp+1.220
 pride+1.214
Neg logit contributions
 bumped-0.580
 stub-0.579
 accidentally-0.535
 tri-0.475
?"-0.471
Token woke
Token ID19092
Feature activation+8.47e-01

Pos logit contributions
 stub+2.619
 bumped+2.495
Ro+2.459
 seek+2.458
?"+2.401
Neg logit contributions
 pleasure-3.302
 pride-3.216
 generosity-3.193
 appreciation-3.178
ipp-3.100
Token up
Token ID510
Feature activation+4.77e-01

Pos logit contributions
 stub+1.660
 bumped+1.643
 accidentally+1.528
 seek+1.504
Ro+1.473
Neg logit contributions
 pleasure-1.254
 generosity-1.192
ipp-1.154
 appreciation-1.142
 choices-1.115
Token when
Token ID618
Feature activation+7.60e-01

Pos logit contributions
 stub+0.715
 bumped+0.712
 accidentally+0.675
Ro+0.635
 landed+0.624
Neg logit contributions
 pleasure-0.900
 generosity-0.894
 appreciation-0.853
 pride-0.825
ipp-0.816
Token it
Token ID340
Feature activation+3.88e+00

Pos logit contributions
 stub+1.166
 bumped+1.153
 accidentally+1.054
 landed+1.026
?"+1.023
Neg logit contributions
 pleasure-1.388
 generosity-1.350
 appreciation-1.297
 pride-1.256
ipp-1.248
Token was
Token ID373
Feature activation+1.83e+00

Pos logit contributions
 stub+5.993
 seek+5.930
Ro+5.838
 seeker+5.793
Who+5.741
Neg logit contributions
ipp-6.232
 appreciation-6.199
 kinds-6.087
 pride-5.882
aying-5.846
Token dark
Token ID3223
Feature activation+1.44e+00

Pos logit contributions
 stub+3.201
 bumped+3.149
Ro+2.949
 accidentally+2.909
Who+2.883
Neg logit contributions
 pleasure-3.157
 pride-2.859
 generosity-2.826
 appreciation-2.813
ipp-2.811
Token.
Token ID13
Feature activation+6.12e-01

Pos logit contributions
 stub+1.988
 bumped+1.890
 accidentally+1.828
Ro+1.660
 tri+1.650
Neg logit contributions
 pleasure-2.875
 generosity-2.822
 kinds-2.796
ipp-2.731
 appreciation-2.709
Token They
Token ID1119
Feature activation+1.30e+00

Pos logit contributions
 bumped+1.196
 stub+1.190
 accidentally+1.102
 landed+1.079
?"+1.074
Neg logit contributions
 pleasure-0.918
 generosity-0.853
 appreciation-0.815
ipp-0.802
 pride-0.795
Token heard
Token ID2982
Feature activation+5.16e-01

Pos logit contributions
 stub+1.707
 bumped+1.608
?"+1.522
 accidentally+1.504
Ro+1.495
Neg logit contributions
 pleasure-2.558
 appreciation-2.511
 generosity-2.506
 pride-2.488
ipp-2.386
Token bedroom
Token ID14043
Feature activation+2.16e+00

Pos logit contributions
 bumped+3.080
 accidentally+2.984
 stub+2.920
 landed+2.856
?"+2.770
Neg logit contributions
 pleasure-2.791
 generosity-2.635
 pride-2.523
 appreciation-2.472
 choices-2.384
Token door
Token ID3420
Feature activation+2.32e+00

Pos logit contributions
Ro+2.466
 stub+2.344
 seek+2.299
?"+2.247
Raw+2.197
Neg logit contributions
ums-4.333
ipp-4.310
 generosity-4.278
 pleasure-4.238
 appreciation-4.194
Token opened
Token ID4721
Feature activation+1.06e+00

Pos logit contributions
Ro+3.253
 stub+3.215
 seek+3.013
Raw+2.937
 accidentally+2.922
Neg logit contributions
 generosity-4.210
 appreciation-4.176
 pleasure-4.139
 kinds-4.114
 pride-4.087
Token and
Token ID290
Feature activation+2.45e+00

Pos logit contributions
 stub+1.694
 bumped+1.624
 accidentally+1.552
Ro+1.500
 tri+1.455
Neg logit contributions
 pleasure-1.935
 appreciation-1.845
 generosity-1.829
ipp-1.794
 pride-1.768
Token a
Token ID257
Feature activation+1.74e+00

Pos logit contributions
 stub+4.019
 bumped+3.687
 accidentally+3.545
Ro+3.526
Raw+3.494
Neg logit contributions
 pleasure-4.394
 generosity-4.204
 pride-4.163
 appreciation-4.114
 kindness-3.888
Token ghost
Token ID10905
Feature activation+3.88e+00

Pos logit contributions
 bumped+3.196
 stub+3.136
 accidentally+2.964
 tri+2.882
 landed+2.870
Neg logit contributions
 pleasure-2.764
 generosity-2.635
 appreciation-2.576
 pride-2.413
 kindness-2.306
Token appeared
Token ID4120
Feature activation+1.86e+00

Pos logit contributions
Ro+5.100
 stub+5.077
 seek+5.024
Raw+4.985
 accidentally+4.792
Neg logit contributions
 appreciation-7.050
 generosity-6.957
 kinds-6.908
 pleasure-6.810
 sorts-6.736
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 stub+2.701
 bumped+2.642
 accidentally+2.616
Ro+2.358
Raw+2.261
Neg logit contributions
 pleasure-3.675
 generosity-3.558
ipp-3.489
 appreciation-3.487
 pride-3.455
Token The
Token ID383
Feature activation+2.52e+00

Pos logit contributions
 bumped+1.978
 stub+1.955
 accidentally+1.889
 tri+1.829
 seek+1.764
Neg logit contributions
 pleasure-1.514
 generosity-1.429
 appreciation-1.375
 pride-1.341
 kinds-1.261
Token ghost
Token ID10905
Feature activation+2.19e+00

Pos logit contributions
 bumped+5.153
 accidentally+5.146
 stub+5.028
 tri+4.766
 landed+4.737
Neg logit contributions
 pleasure-3.762
 pride-3.524
 appreciation-3.458
 generosity-3.447
 kindness-3.198
Token was
Token ID373
Feature activation+1.38e+00

Pos logit contributions
 stub+2.844
Ro+2.706
 bumped+2.611
?"+2.539
 seek+2.493
Neg logit contributions
 generosity-4.112
 pleasure-4.008
 appreciation-3.998
 kinds-3.856
 pride-3.822
Token writing
Token ID3597
Feature activation+1.53e+00

Pos logit contributions
 bumped+1.292
 stub+1.287
Ro+1.261
 seek+1.199
 accidentally+1.195
Neg logit contributions
 pleasure-1.688
 generosity-1.600
ipp-1.569
 pride-1.553
 appreciation-1.546
Token in
Token ID287
Feature activation-1.01e-01

Pos logit contributions
 stub+2.429
 bumped+2.329
Ro+2.223
 accidentally+2.186
 tri+2.077
Neg logit contributions
 pleasure-2.804
 generosity-2.800
 pride-2.687
 appreciation-2.673
ipp-2.587
Token the
Token ID262
Feature activation+4.03e-01

Pos logit contributions
 pleasure+0.214
 generosity+0.210
 appreciation+0.203
 pride+0.197
ipp+0.193
Neg logit contributions
 stub-0.131
 bumped-0.131
 accidentally-0.123
 tri-0.112
?"-0.110
Token notebook
Token ID20922
Feature activation+1.72e+00

Pos logit contributions
 bumped+0.631
 stub+0.613
 accidentally+0.586
 tri+0.543
?"+0.533
Neg logit contributions
 pleasure-0.763
 generosity-0.739
 pride-0.709
 appreciation-0.707
ipp-0.675
Token when
Token ID618
Feature activation+1.87e+00

Pos logit contributions
 bumped+2.777
 stub+2.757
 accidentally+2.474
Ro+2.462
Who+2.431
Neg logit contributions
 pleasure-3.186
 generosity-3.097
ipp-2.999
 kinds-2.920
 appreciation-2.866
Token it
Token ID340
Feature activation+3.83e+00

Pos logit contributions
 stub+2.395
 bumped+2.384
Ro+2.219
?"+2.168
 accidentally+2.141
Neg logit contributions
 pleasure-3.879
 generosity-3.725
 appreciation-3.600
 pride-3.535
ipp-3.527
Token split
Token ID6626
Feature activation+1.51e+00

Pos logit contributions
Ro+5.791
?"+5.458
Raw+5.387
 stub+5.302
Who+5.277
Neg logit contributions
 generosity-6.465
 pride-6.408
 appreciation-6.334
ipp-6.279
 pleasure-6.131
Token into
Token ID656
Feature activation+4.41e-01

Pos logit contributions
 stub+2.190
 bumped+2.112
Ro+2.029
 accidentally+1.948
Raw+1.870
Neg logit contributions
 pleasure-2.940
ipp-2.860
 generosity-2.800
 appreciation-2.786
 pride-2.767
Token two
Token ID734
Feature activation-8.69e-01

Pos logit contributions
 stub+0.725
 bumped+0.722
 accidentally+0.674
?"+0.637
 tri+0.636
Neg logit contributions
 pleasure-0.782
 generosity-0.773
 appreciation-0.749
 pride-0.723
 kinds-0.701
Token!
Token ID0
Feature activation+2.64e-02

Pos logit contributions
 pleasure+1.552
 generosity+1.507
 appreciation+1.422
ipp+1.397
 pride+1.370
Neg logit contributions
 bumped-1.425
 stub-1.422
 accidentally-1.364
 tri-1.269
?"-1.251
Token She
Token ID1375
Feature activation+1.26e+00

Pos logit contributions
 bumped+0.041
 stub+0.041
 accidentally+0.038
 tri+0.036
?"+0.036
Neg logit contributions
 pleasure-0.049
 generosity-0.047
 appreciation-0.045
ipp-0.045
 pride-0.044
Token Can
Token ID1680
Feature activation+6.19e-02

Pos logit contributions
 pleasure+2.008
 generosity+1.984
 appreciation+1.904
ipp+1.866
 pride+1.846
Neg logit contributions
 stub-1.155
 bumped-1.141
 accidentally-1.082
?"-1.000
 tri-0.977
Token I
Token ID314
Feature activation-8.77e-02

Pos logit contributions
 bumped+0.073
 stub+0.072
 accidentally+0.066
 tri+0.061
 landed+0.059
Neg logit contributions
 pleasure-0.138
 generosity-0.137
 appreciation-0.129
ipp-0.127
 pride-0.126
Token play
Token ID711
Feature activation+1.80e+00

Pos logit contributions
 pleasure+0.162
 generosity+0.157
 appreciation+0.149
ipp+0.145
 pride+0.145
Neg logit contributions
 stub-0.138
 bumped-0.138
 accidentally-0.130
 tri-0.122
 landed-0.119
Token with
Token ID351
Feature activation+8.86e-02

Pos logit contributions
 bumped+2.844
 stub+2.843
 accidentally+2.530
 tri+2.503
 landed+2.271
Neg logit contributions
 pleasure-3.444
 generosity-3.421
 appreciation-3.240
 pride-3.131
 kindness-3.108
Token you
Token ID345
Feature activation+3.04e+00

Pos logit contributions
 stub+0.125
 bumped+0.124
 accidentally+0.114
 tri+0.107
 landed+0.102
Neg logit contributions
 pleasure-0.177
 generosity-0.175
 appreciation-0.166
ipp-0.163
 pride-0.163
Token now
Token ID783
Feature activation+3.82e+00

Pos logit contributions
 stub+4.395
 bumped+4.129
Ro+3.667
Raw+3.497
 tri+3.473
Neg logit contributions
 generosity-6.648
 pleasure-6.596
ipp-6.295
 kindness-6.206
izza-6.092
Token?"
Token ID1701
Feature activation+1.28e+00

Pos logit contributions
 stub+4.562
 bumped+4.473
Ro+3.890
Raw+3.761
 tri+3.706
Neg logit contributions
 pleasure-9.676
 generosity-9.197
ish-9.108
ipp-9.032
 pride-8.700
Token he
Token ID339
Feature activation+1.75e+00

Pos logit contributions
 bumped+1.872
 stub+1.818
 accidentally+1.642
 seek+1.613
 landed+1.579
Neg logit contributions
 pleasure-2.474
 generosity-2.445
 pride-2.317
 appreciation-2.309
ums-2.230
Token asked
Token ID1965
Feature activation+2.91e-01

Pos logit contributions
 stub+2.340
 bumped+2.327
 accidentally+2.139
storm+2.007
?"+2.005
Neg logit contributions
 generosity-3.440
 appreciation-3.321
 pleasure-3.231
 pride-3.226
 kinds-3.107
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
 stub+0.329
 bumped+0.328
 accidentally+0.302
 landed+0.273
 tri+0.272
Neg logit contributions
 pleasure-0.655
 generosity-0.653
 appreciation-0.620
 pride-0.615
ipp-0.614
Token
Token ID198
Feature activation+8.89e-01

Pos logit contributions
 bumped+2.834
 stub+2.741
 accidentally+2.590
 tri+2.542
 landed+2.471
Neg logit contributions
 pleasure-2.834
 generosity-2.649
 appreciation-2.593
 pride-2.545
ipp-2.417
Token discover
Token ID7073
Feature activation+1.06e+00

Pos logit contributions
 stub+0.661
 bumped+0.639
 accidentally+0.612
 tri+0.555
 landed+0.527
Neg logit contributions
 pleasure-1.093
 generosity-1.056
 appreciation-1.019
 pride-0.993
ipp-0.987
Token where
Token ID810
Feature activation+1.21e+00

Pos logit contributions
 stub+1.873
 bumped+1.822
 accidentally+1.693
 tri+1.614
 seek+1.552
Neg logit contributions
 pleasure-1.898
 generosity-1.836
 appreciation-1.766
 kindness-1.669
 kinds-1.667
Token it
Token ID340
Feature activation+1.33e+00

Pos logit contributions
 bumped+1.785
 stub+1.768
 accidentally+1.684
 tri+1.557
 seek+1.510
Neg logit contributions
 pleasure-2.310
 generosity-2.291
 appreciation-2.205
 pride-2.120
 choices-2.118
Token's
Token ID338
Feature activation+3.09e+00

Pos logit contributions
 stub+2.648
 bumped+2.514
 accidentally+2.429
 seek+2.329
 tri+2.286
Neg logit contributions
 pleasure-1.854
 generosity-1.824
 appreciation-1.704
 kinds-1.690
 choices-1.658
Token coming
Token ID2406
Feature activation+2.85e+00

Pos logit contributions
 stub+5.422
 bumped+5.237
Who+5.085
Ro+4.951
 accidentally+4.827
Neg logit contributions
 pleasure-5.659
 generosity-5.286
 decisions-5.106
 appreciation-5.059
 choices-5.055
Token from
Token ID422
Feature activation+3.81e+00

Pos logit contributions
 stub+5.391
 bumped+5.297
 accidentally+5.160
 tri+4.732
lf+4.617
Neg logit contributions
 pleasure-4.809
 appreciation-4.423
 generosity-4.338
 decisions-4.267
 choices-4.252
Token!"
Token ID2474
Feature activation+3.76e-01

Pos logit contributions
 bumped+5.625
 stub+5.610
 tri+4.823
 accidentally+4.799
Who+4.621
Neg logit contributions
 pleasure-8.799
 kinds-8.105
 generosity-7.952
 decisions-7.812
 sorts-7.796
Token
Token ID198
Feature activation+8.03e-01

Pos logit contributions
 stub+0.670
 bumped+0.669
 accidentally+0.627
 tri+0.600
 seek+0.584
Neg logit contributions
 pleasure-0.606
 generosity-0.591
 appreciation-0.568
 pride-0.543
ipp-0.539
Token
Token ID198
Feature activation+9.45e-01

Pos logit contributions
 bumped+1.267
 stub+1.228
 accidentally+1.158
 tri+1.105
 seek+1.065
Neg logit contributions
 pleasure-1.482
 appreciation-1.429
 generosity-1.426
ipp-1.376
 pride-1.346
TokenSo
Token ID2396
Feature activation-4.30e-01

Pos logit contributions
 bumped+1.257
 stub+1.235
 accidentally+1.090
 tri+1.044
?"+0.986
Neg logit contributions
 generosity-1.964
 pleasure-1.951
 appreciation-1.949
ipp-1.909
izza-1.831
Token,
Token ID11
Feature activation+6.87e-01

Pos logit contributions
 pleasure+0.925
 generosity+0.906
 appreciation+0.868
ipp+0.847
 pride+0.844
Neg logit contributions
 stub-0.545
 bumped-0.543
 accidentally-0.507
 tri-0.463
?"-0.457
Token
Token ID220
Feature activation-3.37e-02

Pos logit contributions
 bumped+2.641
 stub+2.640
 accidentally+2.479
 tri+2.415
 seek+2.338
Neg logit contributions
 pleasure-1.936
 generosity-1.801
 appreciation-1.736
 pride-1.699
ipp-1.582
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
 pleasure+0.069
 generosity+0.067
 appreciation+0.065
ipp+0.064
 pride+0.063
Neg logit contributions
 bumped-0.046
 stub-0.045
 accidentally-0.043
 tri-0.039
 landed-0.038
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
 bumped+2.099
 stub+2.037
 accidentally+1.968
 tri+1.879
 landed+1.778
Neg logit contributions
 pleasure-2.208
 appreciation-2.129
 generosity-2.090
 pride-2.046
ipp-2.003
TokenSuddenly
Token ID38582
Feature activation+1.62e+00

Pos logit contributions
 bumped+2.148
 stub+2.127
 accidentally+2.036
 tri+1.934
?"+1.886
Neg logit contributions
 pleasure-1.792
 generosity-1.776
 appreciation-1.758
 pride-1.678
ipp-1.644
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
 stub+2.710
 bumped+2.501
 accidentally+2.361
Ro+2.336
Please+2.248
Neg logit contributions
 pleasure-2.868
 generosity-2.743
 appreciation-2.617
 pride-2.568
 kindness-2.498
Token she
Token ID673
Feature activation+3.79e+00

Pos logit contributions
 stub+2.676
 bumped+2.506
 accidentally+2.366
 tri+2.267
Ro+2.264
Neg logit contributions
 pleasure-2.910
 generosity-2.798
 pride-2.638
 appreciation-2.634
 kindness-2.539
Token heard
Token ID2982
Feature activation+1.29e+00

Pos logit contributions
Ro+4.729
 seek+3.996
Raw+3.967
storm+3.748
jar+3.742
Neg logit contributions
 pride-8.113
 appreciation-7.794
apa-7.780
ipp-7.764
ealing-7.749
Token a
Token ID257
Feature activation+6.12e-01

Pos logit contributions
 stub+2.132
 bumped+2.080
 accidentally+1.985
 tri+1.830
 seek+1.817
Neg logit contributions
 pleasure-2.344
 generosity-2.324
 appreciation-2.275
 pride-2.223
 kindness-2.140
Token voice
Token ID3809
Feature activation+1.87e+00

Pos logit contributions
 stub+0.845
 bumped+0.842
 accidentally+0.795
?"+0.727
 tri+0.717
Neg logit contributions
 generosity-1.218
 pleasure-1.213
 appreciation-1.177
 pride-1.116
 choices-1.090
Token from
Token ID422
Feature activation+2.30e+00

Pos logit contributions
 stub+2.905
 accidentally+2.746
 bumped+2.713
 seek+2.445
Ro+2.430
Neg logit contributions
 generosity-3.521
 appreciation-3.398
 pleasure-3.347
 kinds-3.283
 kindness-3.212
Token behind
Token ID2157
Feature activation+1.41e+00

Pos logit contributions
 stub+3.634
 accidentally+3.512
 bumped+3.428
 tri+3.098
 Ach+2.914
Neg logit contributions
 pleasure-4.571
 generosity-4.499
 appreciation-4.359
 pride-4.311
 kinds-4.289
Tokenapping
Token ID5912
Feature activation+8.69e-01

Pos logit contributions
 generosity+3.039
 pleasure+3.026
 appreciation+2.766
 choices+2.643
 pride+2.624
Neg logit contributions
 stub-3.814
 bumped-3.718
 accidentally-3.527
 tri-3.445
?"-3.339
Token its
Token ID663
Feature activation-3.34e-01

Pos logit contributions
 bumped+1.239
 stub+1.233
 accidentally+1.134
Ro+1.075
Who+1.052
Neg logit contributions
 pleasure-1.676
 generosity-1.636
 appreciation-1.598
 pride-1.556
ipp-1.552
Token wings
Token ID12098
Feature activation+3.12e-01

Pos logit contributions
 pleasure+0.516
 generosity+0.502
 appreciation+0.472
 pride+0.455
ipp+0.454
Neg logit contributions
 bumped-0.623
 stub-0.621
 accidentally-0.597
?"-0.561
 tri-0.560
Token around
Token ID1088
Feature activation+1.22e+00

Pos logit contributions
 stub+0.443
 bumped+0.440
 accidentally+0.417
?"+0.387
Ro+0.385
Neg logit contributions
 pleasure-0.595
 generosity-0.582
 appreciation-0.556
ipp-0.551
 pride-0.547
Token when
Token ID618
Feature activation+1.91e+00

Pos logit contributions
 stub+1.981
 bumped+1.963
 accidentally+1.782
 landed+1.688
 tri+1.671
Neg logit contributions
 pleasure-2.218
 generosity-2.143
 appreciation-2.052
 kinds-1.991
 pride-1.990
Token it
Token ID340
Feature activation+3.76e+00

Pos logit contributions
 stub+2.898
 bumped+2.765
Ro+2.563
 seek+2.512
 accidentally+2.476
Neg logit contributions
 pleasure-3.595
 generosity-3.443
 pride-3.218
 appreciation-3.195
 kindness-3.134
Token saw
Token ID2497
Feature activation+7.82e-01

Pos logit contributions
 seeker+4.986
 university+4.588
Who+4.536
Ro+4.531
 stub+4.493
Neg logit contributions
 kinds-6.570
 appreciation-6.564
 generosity-6.538
ipp-6.503
 pride-6.395
Token a
Token ID257
Feature activation+1.14e+00

Pos logit contributions
 stub+1.203
 bumped+1.187
 accidentally+1.085
 seek+1.068
 tri+1.027
Neg logit contributions
 pleasure-1.496
 generosity-1.465
 appreciation-1.403
 pride-1.368
ipp-1.356
Token butterfly
Token ID35113
Feature activation+1.56e+00

Pos logit contributions
 bumped+1.958
 stub+1.912
 accidentally+1.785
 tri+1.697
 landed+1.675
Neg logit contributions
 pleasure-1.966
 generosity-1.887
 appreciation-1.791
 pride-1.695
ipp-1.656
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
 bumped+2.100
 stub+2.053
 accidentally+1.934
 seek+1.784
Ro+1.738
Neg logit contributions
 generosity-2.981
 pleasure-2.960
 appreciation-2.908
 kinds-2.790
ipp-2.763
Token It
Token ID632
Feature activation+1.87e+00

Pos logit contributions
 pleasure+0.348
 generosity+0.336
 appreciation+0.316
ipp+0.309
 pride+0.306
Neg logit contributions
 bumped-0.390
 stub-0.389
 accidentally-0.369
 tri-0.349
?"-0.344
Token home
Token ID1363
Feature activation+2.33e+00

Pos logit contributions
 stub+4.338
 bumped+4.128
 accidentally+4.011
Ro+3.924
 tri+3.890
Neg logit contributions
 pleasure-3.676
 generosity-3.506
 appreciation-3.349
 pride-3.309
 kindness-3.167
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
 stub+4.105
 bumped+4.019
 accidentally+3.675
Ro+3.618
 tri+3.513
Neg logit contributions
 pleasure-4.314
 generosity-4.221
 appreciation-3.995
ipp-3.974
 choices-3.846
Token play
Token ID711
Feature activation+2.89e+00

Pos logit contributions
 bumped+2.244
 stub+2.227
 accidentally+2.070
 tri+1.987
 landed+1.888
Neg logit contributions
 pleasure-2.572
 generosity-2.556
 appreciation-2.382
 pride-2.317
 kindness-2.283
Token with
Token ID351
Feature activation+5.51e-01

Pos logit contributions
 stub+5.697
 bumped+5.686
 tri+5.155
 accidentally+5.022
 landed+4.696
Neg logit contributions
 pleasure-4.933
 generosity-4.786
 appreciation-4.622
 kindness-4.441
 pride-4.419
Token Rose
Token ID8049
Feature activation+1.91e+00

Pos logit contributions
 stub+0.837
 bumped+0.832
 accidentally+0.761
 tri+0.720
 seek+0.699
Neg logit contributions
 pleasure-1.054
 generosity-1.044
 appreciation-0.992
ipp-0.972
 pride-0.969
Token now
Token ID783
Feature activation+3.75e+00

Pos logit contributions
 stub+2.368
 bumped+2.341
Ro+2.049
 accidentally+1.979
Raw+1.968
Neg logit contributions
 generosity-4.157
 pleasure-4.109
ipp-4.026
 appreciation-3.957
 kinds-3.867
Token?"
Token ID1701
Feature activation+1.32e+00

Pos logit contributions
 stub+4.393
 bumped+4.296
Ro+4.091
Raw+3.704
 tri+3.606
Neg logit contributions
 pleasure-9.744
 generosity-9.095
ipp-8.961
ish-8.845
 choices-8.720
Token Her
Token ID2332
Feature activation+1.37e+00

Pos logit contributions
 bumped+2.137
 stub+2.019
 landed+1.864
 accidentally+1.856
 tri+1.811
Neg logit contributions
 pleasure-2.462
 generosity-2.410
 appreciation-2.285
 pride-2.242
 kindness-2.164
Token mom
Token ID1995
Feature activation+1.59e-01

Pos logit contributions
 bumped+2.453
 stub+2.357
 accidentally+2.322
 landed+2.208
 seek+2.195
Neg logit contributions
 pleasure-2.304
 generosity-2.288
 appreciation-2.187
 pride-2.103
ums-2.035
Token said
Token ID531
Feature activation-1.03e+00

Pos logit contributions
 stub+0.232
 bumped+0.226
 accidentally+0.214
 seek+0.198
 tri+0.197
Neg logit contributions
 generosity-0.303
 pleasure-0.302
 appreciation-0.291
ipp-0.282
 pride-0.281
Token,
Token ID11
Feature activation+5.85e-01

Pos logit contributions
 pleasure+2.505
 generosity+2.448
 appreciation+2.356
ipp+2.315
 pride+2.303
Neg logit contributions
 bumped-1.014
 stub-1.007
 accidentally-0.933
?"-0.829
 tri-0.817
Token Can
Token ID1680
Feature activation+8.31e-02

Pos logit contributions
 pleasure+1.269
 generosity+1.244
 appreciation+1.191
ipp+1.168
 pride+1.163
Neg logit contributions
 bumped-0.616
 stub-0.615
 accidentally-0.573
 tri-0.513
?"-0.507
Token I
Token ID314
Feature activation+7.82e-01

Pos logit contributions
 bumped+0.092
 stub+0.091
 accidentally+0.084
 tri+0.075
 landed+0.073
Neg logit contributions
 pleasure-0.191
 generosity-0.189
 appreciation-0.179
ipp-0.176
 pride-0.175
Token have
Token ID423
Feature activation+4.66e-01

Pos logit contributions
 bumped+1.182
 stub+1.180
 accidentally+1.081
 tri+1.026
 landed+1.017
Neg logit contributions
 pleasure-1.506
 generosity-1.447
 appreciation-1.401
 pride-1.354
ipp-1.342
Token Big
Token ID4403
Feature activation-1.52e-01

Pos logit contributions
 stub+0.779
 bumped+0.771
 accidentally+0.721
 tri+0.694
 seek+0.665
Neg logit contributions
 pleasure-0.823
 generosity-0.801
 appreciation-0.757
 pride-0.744
ipp-0.716
Token Red
Token ID2297
Feature activation+1.13e+00

Pos logit contributions
 pleasure+0.271
 generosity+0.265
 appreciation+0.248
ipp+0.242
 pride+0.240
Neg logit contributions
 stub-0.247
 bumped-0.246
 accidentally-0.235
 tri-0.218
 landed-0.216
Token now
Token ID783
Feature activation+3.74e+00

Pos logit contributions
 stub+1.526
 bumped+1.513
 accidentally+1.375
Ro+1.287
 tri+1.281
Neg logit contributions
 pleasure-2.330
 generosity-2.329
ipp-2.178
 appreciation-2.155
 choices-2.111
Token?"
Token ID1701
Feature activation+1.57e+00

Pos logit contributions
 stub+5.077
 bumped+4.846
Ro+4.723
Raw+4.463
Who+4.247
Neg logit contributions
 pleasure-8.720
 generosity-8.268
ipp-7.948
 pride-7.704
 kindness-7.683
Token
Token ID198
Feature activation+8.75e-01

Pos logit contributions
 bumped+2.675
 stub+2.599
 accidentally+2.470
 landed+2.432
 seek+2.381
Neg logit contributions
 pleasure-2.696
 generosity-2.640
 appreciation-2.537
 pride-2.408
ums-2.386
Token
Token ID198
Feature activation+5.60e-01

Pos logit contributions
 bumped+1.369
 stub+1.335
 accidentally+1.288
 tri+1.206
 seek+1.162
Neg logit contributions
 pleasure-1.608
 generosity-1.579
 appreciation-1.563
 pride-1.486
ipp-1.477
TokenHis
Token ID6653
Feature activation+1.27e+00

Pos logit contributions
 bumped+0.889
 stub+0.886
 accidentally+0.846
 tri+0.781
?"+0.772
Neg logit contributions
 generosity-1.001
 pleasure-0.982
 appreciation-0.964
 pride-0.918
ipp-0.916
Token mom
Token ID1995
Feature activation-4.44e-01

Pos logit contributions
 bumped+2.163
 accidentally+2.126
 stub+2.108
 landed+1.976
Please+1.908
Neg logit contributions
 generosity-2.226
 pleasure-2.221
 appreciation-2.155
 pride-2.055
 kindness-1.966
Token box
Token ID3091
Feature activation+2.55e-01

Pos logit contributions
 accidentally+2.225
 stub+2.111
 bumped+2.051
 tri+1.899
 suddenly+1.789
Neg logit contributions
 pleasure-3.374
 generosity-3.350
 choices-3.215
 appreciation-3.211
 pride-3.184
Token.
Token ID13
Feature activation+1.67e+00

Pos logit contributions
 stub+0.288
 bumped+0.282
 accidentally+0.269
?"+0.245
 tri+0.239
Neg logit contributions
 pleasure-0.564
 generosity-0.560
 appreciation-0.541
 pride-0.526
ipp-0.522
Token He
Token ID679
Feature activation+2.87e+00

Pos logit contributions
 bumped+3.418
 stub+3.382
 accidentally+3.225
 landed+3.167
 seek+3.099
Neg logit contributions
 pleasure-2.384
 generosity-2.164
 appreciation-2.116
 pride-2.075
ums-2.014
Token turned
Token ID2900
Feature activation+3.26e-01

Pos logit contributions
?"+3.563
 seek+3.534
 university+3.462
 stub+3.434
storm+3.407
Neg logit contributions
 pride-5.323
 appreciation-5.283
 generosity-5.140
 kinds-5.019
 pleasure-4.838
Token around
Token ID1088
Feature activation+1.00e+00

Pos logit contributions
 stub+0.471
 bumped+0.460
 accidentally+0.453
 seek+0.417
?"+0.412
Neg logit contributions
 pleasure-0.615
 generosity-0.601
 appreciation-0.589
 pride-0.586
ipp-0.557
Token and
Token ID290
Feature activation+3.74e+00

Pos logit contributions
 stub+1.785
 bumped+1.736
 accidentally+1.668
 landed+1.542
Ro+1.525
Neg logit contributions
 pleasure-1.664
 generosity-1.652
 appreciation-1.583
 pride-1.533
 kindness-1.481
Token saw
Token ID2497
Feature activation+3.24e-01

Pos logit contributions
Ro+5.244
 stub+5.149
storm+5.080
?"+5.026
Raw+4.842
Neg logit contributions
 pride-7.283
 appreciation-6.873
 generosity-6.762
 kindness-6.621
 pleasure-6.559
Token a
Token ID257
Feature activation+1.81e+00

Pos logit contributions
 stub+0.445
 bumped+0.441
 accidentally+0.419
 tri+0.382
?"+0.377
Neg logit contributions
 pleasure-0.658
 generosity-0.649
 appreciation-0.621
 pride-0.614
ipp-0.594
Token man
Token ID582
Feature activation+3.66e-01

Pos logit contributions
 bumped+2.999
 stub+2.964
 accidentally+2.799
Please+2.740
?"+2.715
Neg logit contributions
 pleasure-3.123
 generosity-2.987
 appreciation-2.960
 pride-2.764
 kindness-2.648
Token with
Token ID351
Feature activation-3.64e-02

Pos logit contributions
 stub+0.440
 bumped+0.435
 accidentally+0.418
?"+0.370
 tri+0.361
Neg logit contributions
 generosity-0.774
 pleasure-0.774
 appreciation-0.745
 pride-0.719
ipp-0.711
Token a
Token ID257
Feature activation+1.13e+00

Pos logit contributions
 pleasure+0.074
 generosity+0.072
 appreciation+0.069
 pride+0.068
ipp+0.066
Neg logit contributions
 stub-0.050
 bumped-0.050
 accidentally-0.047
?"-0.043
 tri-0.043
Token to
Token ID284
Feature activation-6.12e-01

Pos logit contributions
 stub+2.955
Ro+2.792
 bumped+2.787
Raw+2.587
jar+2.541
Neg logit contributions
 generosity-3.571
 pleasure-3.390
 appreciation-3.225
 kindness-3.182
 choices-3.151
Token travel
Token ID3067
Feature activation+2.12e+00

Pos logit contributions
 pleasure+1.210
 generosity+1.184
 appreciation+1.126
ipp+1.100
 pride+1.094
Neg logit contributions
 bumped-0.875
 stub-0.874
 accidentally-0.827
 tri-0.761
?"-0.757
Token to
Token ID284
Feature activation+1.09e+00

Pos logit contributions
 stub+3.686
 accidentally+3.396
 bumped+3.351
Ro+3.195
 tri+3.105
Neg logit contributions
 generosity-3.632
 pleasure-3.613
 kinds-3.488
 appreciation-3.448
 pride-3.390
Token a
Token ID257
Feature activation+1.88e+00

Pos logit contributions
 stub+1.762
 bumped+1.752
 accidentally+1.617
 tri+1.529
 landed+1.458
Neg logit contributions
 pleasure-2.048
 generosity-1.984
 appreciation-1.892
 pride-1.829
 kindness-1.786
Token far
Token ID1290
Feature activation+1.71e+00

Pos logit contributions
 bumped+3.575
 stub+3.538
 accidentally+3.222
 tri+3.145
 landed+3.099
Neg logit contributions
 pleasure-2.975
 generosity-2.733
 appreciation-2.609
ipp-2.467
 pride-2.443
Tokenaway
Token ID8272
Feature activation+3.74e+00

Pos logit contributions
 stub+2.852
 bumped+2.750
 accidentally+2.622
 tri+2.522
 Ach+2.431
Neg logit contributions
 pleasure-3.011
 generosity-2.947
 appreciation-2.898
 kinds-2.782
 sorts-2.740
Token place
Token ID1295
Feature activation+1.28e+00

Pos logit contributions
 stub+7.046
 Ach+6.691
 bumped+6.521
 accidentally+6.315
 tri+6.060
Neg logit contributions
 pleasure-6.870
 appreciation-6.865
 kinds-6.773
 generosity-6.538
 sorts-6.495
Token and
Token ID290
Feature activation+1.80e+00

Pos logit contributions
 stub+1.738
 bumped+1.634
 accidentally+1.606
Ro+1.507
 tri+1.428
Neg logit contributions
 pleasure-2.677
 generosity-2.643
 kinds-2.546
 appreciation-2.542
 sorts-2.466
Token brought
Token ID3181
Feature activation-7.37e-01

Pos logit contributions
 stub+3.123
 bumped+3.041
 accidentally+2.844
 tri+2.740
 Ach+2.722
Neg logit contributions
 pleasure-3.266
 generosity-3.213
 appreciation-2.998
 pride-2.918
 kindness-2.845
Token Spot
Token ID15899
Feature activation-1.32e+00

Pos logit contributions
 pleasure+1.462
 generosity+1.433
 appreciation+1.361
ipp+1.335
 pride+1.319
Neg logit contributions
 bumped-1.059
 stub-1.056
 accidentally-0.999
 tri-0.920
?"-0.916
Token along
Token ID1863
Feature activation+1.09e+00

Pos logit contributions
 pleasure+3.092
 generosity+3.015
 appreciation+2.882
 pride+2.830
ipp+2.827
Neg logit contributions
 bumped-1.475
 stub-1.459
 accidentally-1.349
?"-1.248
 tri-1.222
Token walking
Token ID6155
Feature activation+6.31e-01

Pos logit contributions
 bumped+1.868
 stub+1.836
Ro+1.799
Who+1.763
 accidentally+1.758
Neg logit contributions
 pleasure-2.359
 generosity-2.245
ipp-2.228
 pride-2.166
 appreciation-2.160
Token in
Token ID287
Feature activation+6.99e-01

Pos logit contributions
 stub+0.907
 bumped+0.887
 accidentally+0.851
 tri+0.769
 landed+0.752
Neg logit contributions
 pleasure-1.233
 generosity-1.213
 appreciation-1.156
 pride-1.141
ipp-1.137
Token the
Token ID262
Feature activation+1.28e+00

Pos logit contributions
 stub+1.141
 bumped+1.126
 accidentally+1.067
 tri+0.978
?"+0.963
Neg logit contributions
 pleasure-1.275
 generosity-1.223
 appreciation-1.190
 kinds-1.141
 pride-1.140
Token park
Token ID3952
Feature activation+1.18e+00

Pos logit contributions
 bumped+1.611
 stub+1.545
 accidentally+1.448
 tri+1.309
?"+1.296
Neg logit contributions
 pleasure-2.807
 generosity-2.669
 appreciation-2.648
 pride-2.597
ipp-2.591
Token when
Token ID618
Feature activation+2.32e+00

Pos logit contributions
 stub+1.601
 bumped+1.557
 accidentally+1.443
Ro+1.410
 landed+1.348
Neg logit contributions
 pleasure-2.373
 generosity-2.295
 appreciation-2.200
ipp-2.199
 kinds-2.197
Token it
Token ID340
Feature activation+3.74e+00

Pos logit contributions
 stub+3.451
 bumped+3.368
Ro+3.157
?"+3.079
 seek+3.066
Neg logit contributions
 pleasure-4.330
 generosity-4.023
 pride-3.805
 appreciation-3.724
 kindness-3.710
Token saw
Token ID2497
Feature activation+1.42e+00

Pos logit contributions
?"+4.486
 seek+4.315
Ro+4.233
 stub+4.196
 university+4.106
Neg logit contributions
ipp-7.123
ealing-6.803
apa-6.666
 pleasure-6.598
 appreciation-6.446
Token a
Token ID257
Feature activation+1.27e+00

Pos logit contributions
 stub+2.415
 bumped+2.378
 accidentally+2.200
 seek+2.117
 tri+2.091
Neg logit contributions
 pleasure-2.484
 generosity-2.448
 appreciation-2.362
 pride-2.295
 kinds-2.263
Token little
Token ID1310
Feature activation+4.52e-01

Pos logit contributions
 bumped+2.161
 stub+2.128
 accidentally+1.980
 tri+1.874
?"+1.874
Neg logit contributions
 pleasure-2.187
 generosity-2.106
 appreciation-2.002
ipp-1.874
 pride-1.857
Token bird
Token ID6512
Feature activation+1.59e+00

Pos logit contributions
 bumped+0.678
 stub+0.671
 accidentally+0.634
 tri+0.582
?"+0.577
Neg logit contributions
 pleasure-0.859
 generosity-0.841
 appreciation-0.794
 pride-0.763
 choices-0.763
Token.
Token ID13
Feature activation+1.05e-01

Pos logit contributions
 bumped+2.105
 stub+1.998
 accidentally+1.940
Ro+1.812
 seek+1.796
Neg logit contributions
 generosity-3.032
 appreciation-2.982
 pleasure-2.973
 kinds-2.901
ipp-2.887
Token want
Token ID765
Feature activation+1.54e+00

Pos logit contributions
 pleasure+0.132
 generosity+0.126
 appreciation+0.119
 pride+0.117
ipp+0.115
Neg logit contributions
 stub-0.151
 bumped-0.151
 accidentally-0.144
 tri-0.135
 landed-0.134
Token to
Token ID284
Feature activation+1.04e+00

Pos logit contributions
 stub+2.321
 bumped+2.288
 accidentally+2.116
 tri+2.000
 seek+1.909
Neg logit contributions
 pleasure-3.002
 generosity-2.972
 appreciation-2.833
 pride-2.802
 kindness-2.756
Token play
Token ID711
Feature activation+2.37e+00

Pos logit contributions
 bumped+1.360
 stub+1.360
 accidentally+1.251
 tri+1.155
 landed+1.095
Neg logit contributions
 pleasure-2.194
 generosity-2.110
 appreciation-2.047
 pride-2.030
ipp-2.025
Token with
Token ID351
Feature activation+2.14e+00

Pos logit contributions
 stub+4.049
 bumped+3.963
 accidentally+3.513
 tri+3.513
 rang+3.254
Neg logit contributions
 pleasure-4.416
 generosity-4.235
 appreciation-4.117
 pride-3.993
 kindness-3.946
Token me
Token ID502
Feature activation+2.32e+00

Pos logit contributions
 stub+3.066
 bumped+3.002
 accidentally+2.566
 tri+2.529
jar+2.491
Neg logit contributions
 pleasure-4.230
 generosity-4.169
ipp-4.009
 appreciation-4.008
 pride-4.003
Token now
Token ID783
Feature activation+3.73e+00

Pos logit contributions
 stub+2.260
 bumped+2.141
 accidentally+1.842
Who+1.758
Ro+1.757
Neg logit contributions
 pleasure-5.905
ipp-5.796
 generosity-5.767
 appreciation-5.642
 kinds-5.536
Token?"
Token ID1701
Feature activation-2.17e-01

Pos logit contributions
 stub+4.427
 bumped+4.114
Ro+3.421
 tri+3.384
Raw+3.230
Neg logit contributions
 pleasure-9.478
ipp-8.993
 generosity-8.808
ish-8.639
 appreciation-8.531
Token The
Token ID383
Feature activation+1.18e+00

Pos logit contributions
 pleasure+0.383
 generosity+0.374
 appreciation+0.352
ipp+0.342
 pride+0.342
Neg logit contributions
 stub-0.359
 bumped-0.359
 accidentally-0.339
 tri-0.319
?"-0.312
Token gr
Token ID1036
Feature activation-1.15e+00

Pos logit contributions
 bumped+1.596
 stub+1.555
 accidentally+1.457
 tri+1.355
?"+1.345
Neg logit contributions
 generosity-2.419
 pleasure-2.405
 appreciation-2.263
 pride-2.208
 kindness-2.197
Tokenumpy
Token ID32152
Feature activation+5.27e-01

Pos logit contributions
 pleasure+1.783
 generosity+1.712
 appreciation+1.593
 pride+1.539
ipp+1.520
Neg logit contributions
 stub-2.188
 bumped-2.181
 accidentally-2.103
?"-2.004
 tri-1.974
Token cat
Token ID3797
Feature activation+7.32e-01

Pos logit contributions
 bumped+0.607
 stub+0.606
 accidentally+0.556
 tri+0.501
 landed+0.499
Neg logit contributions
 generosity-1.168
 pleasure-1.165
 appreciation-1.101
 choices-1.075
 pride-1.073
Token Can
Token ID1680
Feature activation+1.27e-01

Pos logit contributions
 pleasure+0.032
 generosity+0.031
 appreciation+0.029
 pride+0.029
ipp+0.029
Neg logit contributions
 bumped-0.016
 stub-0.015
 accidentally-0.014
 tri-0.013
 landed-0.013
Token I
Token ID314
Feature activation+3.64e-01

Pos logit contributions
 bumped+0.151
 stub+0.149
 accidentally+0.138
 tri+0.125
 landed+0.124
Neg logit contributions
 pleasure-0.281
 generosity-0.277
 appreciation-0.262
ipp-0.259
 pride-0.256
Token play
Token ID711
Feature activation+2.19e+00

Pos logit contributions
 bumped+0.602
 stub+0.597
 accidentally+0.569
 tri+0.532
 landed+0.524
Neg logit contributions
 pleasure-0.650
 generosity-0.620
 appreciation-0.595
 pride-0.574
ipp-0.571
Token with
Token ID351
Feature activation+1.40e-01

Pos logit contributions
 bumped+3.617
 stub+3.591
 accidentally+3.217
 tri+3.204
 landed+3.006
Neg logit contributions
 pleasure-4.170
 generosity-4.012
 appreciation-3.825
 kindness-3.725
 pride-3.685
Token you
Token ID345
Feature activation+3.06e+00

Pos logit contributions
 stub+0.202
 bumped+0.202
 accidentally+0.187
 tri+0.174
 landed+0.169
Neg logit contributions
 pleasure-0.276
 generosity-0.271
 appreciation-0.257
ipp-0.253
 pride-0.252
Token later
Token ID1568
Feature activation+3.72e+00

Pos logit contributions
 stub+4.391
 bumped+4.131
Ro+3.789
Raw+3.579
Who+3.495
Neg logit contributions
 pleasure-6.606
 generosity-6.515
ipp-6.249
 kindness-6.140
 appreciation-6.017
Token?"
Token ID1701
Feature activation+2.66e-01

Pos logit contributions
 stub+5.002
 bumped+4.588
Ro+4.224
 landed+4.091
Raw+3.998
Neg logit contributions
 pleasure-9.149
ipp-8.860
 generosity-8.251
ums-7.885
 appreciation-7.881
Token Mia
Token ID32189
Feature activation-1.30e+00

Pos logit contributions
 bumped+0.378
 stub+0.377
 accidentally+0.351
 tri+0.327
 landed+0.327
Neg logit contributions
 pleasure-0.531
 generosity-0.522
 appreciation-0.504
 pride-0.484
ipp-0.483
Token says
Token ID1139
Feature activation-1.37e+00

Pos logit contributions
 pleasure+2.352
 generosity+2.267
 appreciation+2.122
ipp+2.070
 pride+2.050
Neg logit contributions
 bumped-2.145
 stub-2.112
 accidentally-2.035
?"-1.961
 tri-1.918
Token,
Token ID11
Feature activation+1.14e+00

Pos logit contributions
 pleasure+3.208
 generosity+3.135
 appreciation+3.025
 pride+2.942
 kinds+2.938
Neg logit contributions
 bumped-1.486
 stub-1.476
 accidentally-1.372
?"-1.238
 tri-1.235
Token "
Token ID366
Feature activation-4.38e-01

Pos logit contributions
 stub+1.707
 bumped+1.699
 accidentally+1.581
Ro+1.484
?"+1.483
Neg logit contributions
 pleasure-2.126
 generosity-2.095
ipp-1.930
 appreciation-1.924
 pride-1.922
Token playing
Token ID2712
Feature activation+2.00e+00

Pos logit contributions
 stub+1.131
 bumped+1.112
 accidentally+1.018
Ro+0.972
 landed+0.970
Neg logit contributions
 pleasure-1.511
 generosity-1.496
 appreciation-1.429
 kinds-1.399
 choices-1.399
Token with
Token ID351
Feature activation+4.59e-01

Pos logit contributions
 stub+3.091
 bumped+3.017
 accidentally+2.645
 landed+2.645
 tri+2.578
Neg logit contributions
 generosity-3.539
 pleasure-3.516
 appreciation-3.445
 kinds-3.408
 sorts-3.385
Token her
Token ID607
Feature activation-2.22e-01

Pos logit contributions
 stub+0.630
 bumped+0.626
 accidentally+0.568
 tri+0.544
?"+0.536
Neg logit contributions
 pleasure-0.915
 generosity-0.904
 appreciation-0.875
 pride-0.857
ipp-0.851
Token doll
Token ID3654
Feature activation+1.29e+00

Pos logit contributions
 pleasure+0.384
 generosity+0.375
 appreciation+0.354
ipp+0.343
 pride+0.343
Neg logit contributions
 bumped-0.376
 stub-0.373
 accidentally-0.353
 tri-0.334
?"-0.330
Token when
Token ID618
Feature activation+2.18e+00

Pos logit contributions
 bumped+1.793
 stub+1.792
Ro+1.604
Raw+1.562
 accidentally+1.554
Neg logit contributions
 generosity-2.519
 pleasure-2.501
ipp-2.368
 appreciation-2.363
 choices-2.317
Token it
Token ID340
Feature activation+3.71e+00

Pos logit contributions
 stub+2.923
 bumped+2.888
Ro+2.754
?"+2.706
 accidentally+2.568
Neg logit contributions
 pleasure-4.420
 generosity-4.276
 pride-4.112
 appreciation-4.087
 kindness-3.919
Token accidentally
Token ID14716
Feature activation+2.78e+00

Pos logit contributions
Who+5.292
Ro+5.104
 spy+5.009
Please+5.008
?"+4.967
Neg logit contributions
ipp-6.504
 kinds-5.925
 appreciation-5.825
 pride-5.746
 generosity-5.741
Token fell
Token ID3214
Feature activation+1.64e+00

Pos logit contributions
Please+2.665
Ro+2.585
 seek+2.552
 stub+2.519
Who+2.503
Neg logit contributions
 pleasure-6.201
 appreciation-6.082
 pride-6.054
 generosity-6.024
ipp-5.909
Token out
Token ID503
Feature activation+1.01e+00

Pos logit contributions
 stub+2.694
 bumped+2.631
 accidentally+2.517
 seek+2.446
Who+2.406
Neg logit contributions
 pleasure-2.731
ipp-2.680
 pride-2.600
 appreciation-2.596
 generosity-2.573
Token of
Token ID286
Feature activation+1.86e+00

Pos logit contributions
 stub+2.285
 bumped+2.261
 accidentally+2.214
 tri+2.093
Ro+2.081
Neg logit contributions
 pleasure-1.089
 appreciation-1.075
 generosity-1.071
 kinds-1.021
 pride-1.011
Token her
Token ID607
Feature activation+8.09e-01

Pos logit contributions
 stub+2.906
 bumped+2.853
 accidentally+2.670
Ro+2.634
 tri+2.583
Neg logit contributions
 pleasure-3.584
 pride-3.412
 appreciation-3.375
 generosity-3.355
 kinds-3.120
Token then
Token ID788
Feature activation+1.75e+00

Pos logit contributions
Ro+4.848
 stub+4.840
Raw+4.746
 bumped+4.630
?"+4.314
Neg logit contributions
 pleasure-5.465
 generosity-5.078
 pride-4.930
ipp-4.835
 kindness-4.784
Token,
Token ID11
Feature activation+3.00e+00

Pos logit contributions
 stub+2.900
 bumped+2.752
Ro+2.637
Please+2.534
 accidentally+2.500
Neg logit contributions
 pleasure-2.950
 generosity-2.867
 appreciation-2.734
 pride-2.704
 choices-2.655
Token she
Token ID673
Feature activation+3.19e+00

Pos logit contributions
 stub+4.597
Ro+4.481
 bumped+4.429
Raw+4.388
?"+4.195
Neg logit contributions
 pleasure-5.363
 generosity-5.074
 pride-4.958
 kindness-4.891
 appreciation-4.718
Token fell
Token ID3214
Feature activation+2.55e+00

Pos logit contributions
Ro+3.579
jar+3.098
 seek+3.000
Raw+2.973
 Ach+2.863
Neg logit contributions
ipp-6.934
ealing-6.798
 pride-6.689
 pleasure-6.511
ish-6.466
Token off
Token ID572
Feature activation+2.03e+00

Pos logit contributions
 stub+4.272
 accidentally+4.209
Who+4.200
 bumped+4.165
Ro+4.073
Neg logit contributions
ipp-4.123
 pleasure-3.986
 pride-3.936
 appreciation-3.782
 kinds-3.735
Token and
Token ID290
Feature activation+3.71e+00

Pos logit contributions
 stub+3.750
 bumped+3.540
 accidentally+3.482
Ro+3.348
 tri+3.245
Neg logit contributions
 pleasure-3.040
 pride-3.007
 generosity-3.004
 appreciation-2.962
 kinds-2.943
Token hurt
Token ID5938
Feature activation+1.30e+00

Pos logit contributions
Ro+4.786
 spy+4.537
Raw+4.493
Who+4.410
storm+4.368
Neg logit contributions
 pride-7.416
 pleasure-6.851
 appreciation-6.701
 kinds-6.668
 generosity-6.637
Token her
Token ID607
Feature activation+8.76e-01

Pos logit contributions
 bumped+2.053
 stub+2.045
 accidentally+1.968
?"+1.782
Ro+1.754
Neg logit contributions
 pleasure-2.328
 generosity-2.274
 appreciation-2.269
 pride-2.240
ipp-2.230
Token knee
Token ID10329
Feature activation+1.72e+00

Pos logit contributions
 bumped+1.525
 accidentally+1.476
 stub+1.456
Ro+1.396
 tri+1.339
Neg logit contributions
 pleasure-1.503
 generosity-1.459
 appreciation-1.395
 choices-1.341
 pride-1.333
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
 stub+2.213
 bumped+2.207
Ro+2.165
 accidentally+2.152
Who+2.075
Neg logit contributions
 pleasure-3.252
 generosity-3.241
 appreciation-3.161
ipp-3.150
 choices-3.129
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
 bumped+3.126
 stub+2.918
 seek+2.908
 accidentally+2.798
 tri+2.783
Neg logit contributions
 pleasure-2.502
 pride-2.219
 generosity-2.213
ipp-2.213
 appreciation-2.211
Token all
Token ID477
Feature activation-1.63e+00

Pos logit contributions
 stub+2.046
 bumped+1.922
 accidentally+1.805
Ro+1.705
 tri+1.702
Neg logit contributions
 pleasure-2.220
 generosity-2.122
 appreciation-2.037
 pride-1.958
 choices-1.925
Token of
Token ID286
Feature activation+3.06e-01

Pos logit contributions
 pleasure+3.424
 generosity+3.290
ipp+3.125
 appreciation+3.095
izza+3.051
Neg logit contributions
 bumped-2.274
 stub-2.270
 accidentally-2.150
?"-2.107
 landed-2.019
Token a
Token ID257
Feature activation+8.73e-01

Pos logit contributions
 stub+0.465
 bumped+0.459
 accidentally+0.428
 tri+0.398
?"+0.384
Neg logit contributions
 pleasure-0.594
 generosity-0.582
 appreciation-0.561
 pride-0.542
 kinds-0.527
Token sudden
Token ID4802
Feature activation+1.39e+00

Pos logit contributions
 bumped+1.422
 stub+1.407
 accidentally+1.330
?"+1.327
 landed+1.281
Neg logit contributions
 pleasure-1.499
 generosity-1.484
 appreciation-1.437
 pride-1.358
 kindness-1.288
Token,
Token ID11
Feature activation+1.74e+00

Pos logit contributions
 stub+2.314
 bumped+2.136
 accidentally+2.048
Please+1.969
Ro+1.962
Neg logit contributions
 pleasure-2.429
 generosity-2.382
 appreciation-2.251
 pride-2.200
 kindness-2.147
Token it
Token ID340
Feature activation+3.70e+00

Pos logit contributions
 stub+2.931
 bumped+2.778
 accidentally+2.545
Ro+2.461
 seek+2.417
Neg logit contributions
 pleasure-3.189
 generosity-3.018
 appreciation-2.855
 kindness-2.778
 pride-2.765
Token heard
Token ID2982
Feature activation+1.36e+00

Pos logit contributions
Ro+4.454
 university+4.421
?"+4.351
Who+4.337
 seeker+4.265
Neg logit contributions
 pride-6.999
 appreciation-6.996
 generosity-6.777
ealing-6.482
apa-6.443
Token a
Token ID257
Feature activation+6.29e-01

Pos logit contributions
 stub+2.296
 bumped+2.240
 accidentally+2.105
 seek+2.016
 tri+1.942
Neg logit contributions
 pleasure-2.453
 generosity-2.409
 appreciation-2.384
 pride-2.304
 kindness-2.230
Token loud
Token ID7812
Feature activation+2.41e-01

Pos logit contributions
 stub+0.904
 bumped+0.901
 accidentally+0.844
 tri+0.768
?"+0.765
Neg logit contributions
 pleasure-1.234
 generosity-1.234
 appreciation-1.191
 pride-1.118
 choices-1.103
Token noise
Token ID7838
Feature activation+1.90e+00

Pos logit contributions
 stub+0.281
 bumped+0.276
 accidentally+0.260
 tri+0.229
?"+0.221
Neg logit contributions
 pleasure-0.532
 generosity-0.528
 appreciation-0.508
 pride-0.489
ipp-0.488
Token.
Token ID13
Feature activation+4.81e-01

Pos logit contributions
 stub+2.924
 accidentally+2.730
 bumped+2.725
 seek+2.422
Ro+2.410
Neg logit contributions
 generosity-3.430
 kinds-3.360
 pleasure-3.350
 appreciation-3.325
 sorts-3.277
Token."
Token ID526
Feature activation-2.21e-01

Pos logit contributions
 stub+0.146
 bumped+0.145
 accidentally+0.136
 tri+0.125
 landed+0.120
Neg logit contributions
 pleasure-0.216
 generosity-0.214
 appreciation-0.205
 pride-0.199
ipp-0.199
Token
Token ID198
Feature activation+1.00e-01

Pos logit contributions
 pleasure+0.464
 generosity+0.453
 appreciation+0.432
ipp+0.425
 pride+0.424
Neg logit contributions
 bumped-0.295
 stub-0.291
 accidentally-0.273
 tri-0.250
 landed-0.246
Token
Token ID198
Feature activation-5.10e-01

Pos logit contributions
 bumped+0.141
 stub+0.137
 accidentally+0.131
 tri+0.120
 landed+0.115
Neg logit contributions
 pleasure-0.202
 generosity-0.199
 appreciation-0.191
ipp-0.187
 pride-0.186
TokenThey
Token ID2990
Feature activation-6.72e-01

Pos logit contributions
 pleasure+1.121
 generosity+1.102
 appreciation+1.054
ipp+1.033
 pride+1.027
Neg logit contributions
 bumped-0.625
 stub-0.623
 accidentally-0.584
 tri-0.528
?"-0.523
Token shared
Token ID4888
Feature activation-1.97e+00

Pos logit contributions
 pleasure+1.392
 generosity+1.362
 appreciation+1.298
ipp+1.266
 pride+1.259
Neg logit contributions
 bumped-0.904
 stub-0.901
 accidentally-0.852
 tri-0.780
?"-0.771
Token Ben
Token ID3932
Feature activation-4.87e+00

Pos logit contributions
 pleasure+4.218
 generosity+4.012
ipp+3.742
 appreciation+3.726
 pride+3.667
Neg logit contributions
 bumped-2.688
 stub-2.542
 accidentally-2.525
?"-2.393
 landed-2.352
Token's
Token ID338
Feature activation-2.94e+00

Pos logit contributions
 pleasure+14.113
umm+12.415
 generosity+12.181
ipp+11.934
omp+11.846
Neg logit contributions
 bumped-5.673
 landed-5.254
 saw-5.184
 broke-5.125
 heard-5.095
Token ice
Token ID4771
Feature activation-3.14e+00

Pos logit contributions
 pleasure+5.708
ipp+5.590
 generosity+5.525
ums+5.246
 pride+5.211
Neg logit contributions
 bumped-4.310
 stub-4.185
?"-4.007
 accidentally-3.870
 landed-3.793
Token-
Token ID12
Feature activation-3.37e+00

Pos logit contributions
 pleasure+7.656
 generosity+7.000
 pride+6.866
 appreciation+6.698
 equal+6.363
Neg logit contributions
 bumped-3.909
?"-3.635
 stub-3.381
 landed-3.322
 hit-3.280
Tokencream
Token ID36277
Feature activation-1.76e+00

Pos logit contributions
 pleasure+5.843
 generosity+5.212
 appreciation+5.086
 pride+4.852
 decisions+4.501
Neg logit contributions
 bumped-6.944
 stub-6.350
 landed-6.221
 accidentally-6.121
 broke-6.020
Token and
Token ID290
Feature activation-5.58e-01

Pos logit contributions
 pleasure+3.956
 generosity+3.770
 appreciation+3.654
 pride+3.561
ipp+3.542
Neg logit contributions
 bumped-2.090
 stub-1.971
 accidentally-1.908
?"-1.807
 suddenly-1.776
Token!
Token ID0
Feature activation-8.00e-01

Pos logit contributions
 pleasure+1.706
 generosity+1.674
 appreciation+1.608
ipp+1.578
 pride+1.570
Neg logit contributions
 bumped-0.712
 stub-0.709
 accidentally-0.656
 tri-0.580
?"-0.572
Token
Token ID198
Feature activation+3.38e-01

Pos logit contributions
 pleasure+1.335
 generosity+1.311
 appreciation+1.232
ipp+1.195
 pride+1.187
Neg logit contributions
 bumped-1.396
 stub-1.395
 accidentally-1.343
 tri-1.250
?"-1.243
Token
Token ID198
Feature activation-4.68e-01

Pos logit contributions
 bumped+0.479
 stub+0.477
 accidentally+0.444
 tri+0.417
 landed+0.401
Neg logit contributions
 pleasure-0.664
 generosity-0.643
 appreciation-0.632
ipp-0.610
 pride-0.603
TokenThey
Token ID2990
Feature activation-9.92e-01

Pos logit contributions
 pleasure+0.909
 generosity+0.891
 appreciation+0.849
ipp+0.828
 pride+0.823
Neg logit contributions
 stub-0.688
 bumped-0.688
 accidentally-0.649
 tri-0.600
?"-0.597
Token collected
Token ID7723
Feature activation-2.67e+00

Pos logit contributions
 pleasure+2.276
 generosity+2.218
 appreciation+2.118
ipp+2.079
 pride+2.058
Neg logit contributions
 bumped-1.153
 stub-1.127
 accidentally-1.079
 tri-0.967
 landed-0.949
Token all
Token ID477
Feature activation-4.86e+00

Pos logit contributions
 pleasure+5.805
 generosity+5.640
ipp+5.496
ums+5.408
izza+5.404
Neg logit contributions
 bumped-3.514
 landed-3.266
 accidentally-3.179
 stub-3.100
 suddenly-3.032
Token the
Token ID262
Feature activation-1.97e+00

Pos logit contributions
 generosity+15.536
 pleasure+15.530
ealing+15.454
izza+15.452
ums+14.993
Neg logit contributions
 landed-3.116
 bumped-2.680
 saw-2.626
?"-2.618
 behind-2.544
Token treasure
Token ID14068
Feature activation-1.73e+00

Pos logit contributions
 pleasure+3.421
 generosity+3.327
ipp+3.290
ums+3.162
 appreciation+3.128
Neg logit contributions
 bumped-3.357
 stub-3.229
 accidentally-3.168
?"-3.023
 landed-3.000
Token and
Token ID290
Feature activation-8.64e-01

Pos logit contributions
 pleasure+3.860
 generosity+3.701
 appreciation+3.587
ipp+3.578
 pride+3.512
Neg logit contributions
 bumped-2.081
 stub-1.984
 accidentally-1.948
 landed-1.798
?"-1.777
Token stored
Token ID8574
Feature activation-1.69e+00

Pos logit contributions
 pleasure+1.921
 generosity+1.873
 appreciation+1.796
ipp+1.772
 pride+1.748
Neg logit contributions
 bumped-1.042
 stub-1.028
 accidentally-0.979
 tri-0.880
 landed-0.866
Token it
Token ID340
Feature activation-1.20e+00

Pos logit contributions
 pleasure+3.787
 generosity+3.749
ipp+3.580
 appreciation+3.532
ums+3.511
Neg logit contributions
 bumped-1.995
 stub-1.845
 accidentally-1.825
 landed-1.713
?"-1.658
Token the
Token ID262
Feature activation-1.73e+00

Pos logit contributions
 pleasure+7.693
 generosity+7.655
izza+7.113
ipp+7.108
ums+7.103
Neg logit contributions
 bumped-3.011
?"-2.850
 stub-2.800
 accidentally-2.731
 landed-2.582
Token luggage
Token ID31488
Feature activation-5.09e-01

Pos logit contributions
 pleasure+3.229
 generosity+3.176
ipp+3.023
 appreciation+2.968
ums+2.912
Neg logit contributions
 bumped-2.771
 stub-2.667
 accidentally-2.599
?"-2.431
 landed-2.419
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
 pleasure+1.199
 generosity+1.179
 appreciation+1.130
ipp+1.108
 pride+1.103
Neg logit contributions
 stub-0.531
 bumped-0.530
 accidentally-0.491
 tri-0.436
?"-0.430
Token They
Token ID1119
Feature activation-1.69e+00

Pos logit contributions
 pleasure+2.248
 generosity+2.247
 appreciation+2.109
ipp+2.059
 pride+2.018
Neg logit contributions
 bumped-2.213
 stub-2.193
 accidentally-2.158
?"-1.989
 tri-1.969
Token added
Token ID2087
Feature activation-2.39e+00

Pos logit contributions
 pleasure+3.945
 generosity+3.807
 decisions+3.638
 appreciation+3.620
 choices+3.616
Neg logit contributions
 bumped-2.079
 accidentally-1.962
 stub-1.937
 tri-1.740
 landed-1.724
Token all
Token ID477
Feature activation-4.81e+00

Pos logit contributions
 pleasure+5.297
 generosity+5.194
ipp+4.949
 appreciation+4.868
ums+4.842
Neg logit contributions
 bumped-3.004
 accidentally-2.704
 landed-2.684
 stub-2.637
?"-2.624
Token the
Token ID262
Feature activation-2.45e+00

Pos logit contributions
 generosity+15.265
 pleasure+14.900
ums+14.841
ealing+14.616
izza+14.245
Neg logit contributions
?"-3.176
 bumped-3.116
 hit-2.795
 landed-2.792
 saw-2.641
Token luggage
Token ID31488
Feature activation-2.20e+00

Pos logit contributions
 pleasure+4.759
 generosity+4.692
ipp+4.642
ums+4.468
izza+4.351
Neg logit contributions
 bumped-3.838
 stub-3.524
 accidentally-3.493
 landed-3.357
?"-3.265
Token to
Token ID284
Feature activation-4.86e-01

Pos logit contributions
 pleasure+5.368
 generosity+5.167
 appreciation+5.023
ipp+4.948
 pride+4.907
Neg logit contributions
 bumped-2.341
 accidentally-2.112
 stub-2.078
 landed-1.886
 suddenly-1.879
Token the
Token ID262
Feature activation-5.46e-01

Pos logit contributions
 pleasure+0.964
 generosity+0.943
 appreciation+0.897
ipp+0.873
 pride+0.872
Neg logit contributions
 stub-0.699
 bumped-0.697
 accidentally-0.658
 tri-0.607
?"-0.601
Token truck
Token ID7779
Feature activation-1.18e+00

Pos logit contributions
 pleasure+1.158
 generosity+1.134
 appreciation+1.083
ipp+1.058
 pride+1.054
Neg logit contributions
 bumped-0.714
 stub-0.714
 accidentally-0.671
 tri-0.612
?"-0.608
TokenBen
Token ID11696
Feature activation-7.41e-01

Pos logit contributions
 pleasure+0.253
 generosity+0.251
 appreciation+0.243
ipp+0.238
 pride+0.236
Neg logit contributions
 bumped-0.101
 stub-0.099
 accidentally-0.090
 tri-0.079
 landed-0.078
Token and
Token ID290
Feature activation+1.63e+00

Pos logit contributions
 pleasure+1.481
 generosity+1.445
 appreciation+1.374
ipp+1.342
 pride+1.336
Neg logit contributions
 bumped-1.050
 stub-1.047
 accidentally-0.993
 tri-0.915
?"-0.909
Token Lily
Token ID20037
Feature activation-1.33e+00

Pos logit contributions
 bumped+1.695
 stub+1.660
?"+1.409
 landed+1.405
 accidentally+1.392
Neg logit contributions
 generosity-3.847
 pleasure-3.842
 appreciation-3.707
 pride-3.616
 kinds-3.501
Token hugged
Token ID41130
Feature activation-5.10e-01

Pos logit contributions
 pleasure+2.640
 generosity+2.550
 appreciation+2.405
ipp+2.339
 choices+2.337
Neg logit contributions
 bumped-1.963
 stub-1.923
 accidentally-1.849
 tri-1.727
 landed-1.695
Token grand
Token ID4490
Feature activation-2.57e+00

Pos logit contributions
 pleasure+0.938
 generosity+0.917
 appreciation+0.869
ipp+0.845
 pride+0.843
Neg logit contributions
 bumped-0.807
 stub-0.807
 accidentally-0.766
 tri-0.711
?"-0.705
Tokenpa
Token ID8957
Feature activation-4.65e+00

Pos logit contributions
 pleasure+4.109
 generosity+3.656
ipp+3.519
 appreciation+3.438
izza+3.344
Neg logit contributions
 bumped-4.762
?"-4.747
 stub-4.641
 accidentally-4.442
 tri-4.375
Token.
Token ID13
Feature activation-5.16e-01

Pos logit contributions
 pleasure+14.244
ipp+13.294
 generosity+13.109
omp+13.100
ums+13.076
Neg logit contributions
 behind-2.960
 bumped-2.869
 saw-2.682
 under-2.679
?"-2.634
Token They
Token ID1119
Feature activation-8.25e-01

Pos logit contributions
 pleasure+0.802
 generosity+0.779
 appreciation+0.731
ipp+0.707
 pride+0.704
Neg logit contributions
 bumped-0.963
 stub-0.963
 accidentally-0.922
 tri-0.866
?"-0.860
Token said
Token ID531
Feature activation-1.99e+00

Pos logit contributions
 pleasure+1.722
 generosity+1.681
 appreciation+1.601
ipp+1.566
 pride+1.553
Neg logit contributions
 bumped-1.101
 stub-1.093
 accidentally-1.038
 tri-0.949
?"-0.938
Token,
Token ID11
Feature activation-4.49e-01

Pos logit contributions
 pleasure+4.656
 generosity+4.502
 appreciation+4.313
ipp+4.307
 kinds+4.280
Neg logit contributions
 bumped-2.207
 stub-2.113
 accidentally-2.033
?"-1.919
 suddenly-1.831
Token "
Token ID366
Feature activation-1.57e+00

Pos logit contributions
 pleasure+0.962
 generosity+0.944
 appreciation+0.900
ipp+0.881
 pride+0.876
Neg logit contributions
 bumped-0.569
 stub-0.568
 accidentally-0.532
 tri-0.485
?"-0.479
Token for
Token ID329
Feature activation-1.60e-01

Pos logit contributions
 pleasure+0.833
 generosity+0.818
 appreciation+0.779
ipp+0.762
 pride+0.757
Neg logit contributions
 stub-0.534
 bumped-0.534
 accidentally-0.502
 tri-0.460
?"-0.448
Token you
Token ID345
Feature activation+2.25e-01

Pos logit contributions
 pleasure+0.324
 generosity+0.318
 appreciation+0.302
ipp+0.294
 pride+0.294
Neg logit contributions
 stub-0.225
 bumped-0.224
 accidentally-0.209
 tri-0.193
 landed-0.187
Token."
Token ID526
Feature activation-1.72e+00

Pos logit contributions
 stub+0.272
 bumped+0.270
 accidentally+0.249
 tri+0.227
 landed+0.220
Neg logit contributions
 pleasure-0.503
 generosity-0.490
 appreciation-0.468
ipp-0.457
 pride-0.456
Token Bill
Token ID3941
Feature activation-2.11e+00

Pos logit contributions
 pleasure+3.714
 generosity+3.677
 appreciation+3.477
 pride+3.423
ipp+3.404
Neg logit contributions
 bumped-2.101
 stub-2.070
 accidentally-2.037
?"-1.886
 tri-1.825
Token gave
Token ID2921
Feature activation-2.28e+00

Pos logit contributions
 pleasure+5.193
 generosity+4.921
 appreciation+4.722
 decisions+4.705
 choices+4.691
Neg logit contributions
 bumped-2.382
 accidentally-2.303
 stub-2.259
 landed-2.061
 tri-2.008
Token Bob
Token ID5811
Feature activation-4.63e+00

Pos logit contributions
 pleasure+4.699
 generosity+4.601
izza+4.280
 appreciation+4.257
ipp+4.207
Neg logit contributions
 bumped-3.305
 accidentally-3.075
 stub-3.056
 landed-2.984
?"-2.898
Token the
Token ID262
Feature activation-1.90e+00

Pos logit contributions
 pleasure+15.007
 generosity+13.769
 decisions+13.424
 appreciation+13.364
ums+13.326
Neg logit contributions
 bumped-3.274
 saw-2.833
 landed-2.809
 heard-2.639
 accidentally-2.589
Token bread
Token ID8509
Feature activation-3.36e+00

Pos logit contributions
 pleasure+3.027
 generosity+2.946
ipp+2.876
 appreciation+2.823
ums+2.775
Neg logit contributions
 stub-3.369
 bumped-3.359
 accidentally-3.249
?"-3.134
 landed-3.032
Token,
Token ID11
Feature activation-2.42e+00

Pos logit contributions
 pleasure+8.827
 generosity+8.198
ipp+7.993
 appreciation+7.945
 decisions+7.883
Neg logit contributions
 bumped-3.179
 stub-3.038
 accidentally-3.025
 landed-2.827
 suddenly-2.793
Token and
Token ID290
Feature activation-1.36e+00

Pos logit contributions
 pleasure+5.256
 generosity+4.952
ipp+4.874
 appreciation+4.862
 decisions+4.841
Neg logit contributions
 accidentally-3.252
 bumped-3.200
 stub-3.160
 landed-2.835
 tri-2.829
Token Bob
Token ID5811
Feature activation-2.42e+00

Pos logit contributions
 pleasure+3.042
 generosity+2.931
 appreciation+2.813
ipp+2.791
 pride+2.765
Neg logit contributions
 bumped-1.661
 stub-1.609
 accidentally-1.578
 landed-1.414
 tri-1.414
Token.
Token ID13
Feature activation-6.07e-01

Pos logit contributions
 pleasure+0.621
 generosity+0.606
 appreciation+0.576
ipp+0.563
 pride+0.557
Neg logit contributions
 stub-0.540
 bumped-0.540
 accidentally-0.512
 tri-0.476
?"-0.469
Token
Token ID198
Feature activation-7.25e-02

Pos logit contributions
 pleasure+1.180
 generosity+1.155
 appreciation+1.097
ipp+1.071
 pride+1.065
Neg logit contributions
 bumped-0.890
 stub-0.890
 accidentally-0.844
 tri-0.778
?"-0.773
Token
Token ID198
Feature activation-3.78e-01

Pos logit contributions
 pleasure+0.150
 generosity+0.145
 appreciation+0.141
ipp+0.137
 pride+0.137
Neg logit contributions
 bumped-0.097
 stub-0.097
 accidentally-0.090
 tri-0.084
 landed-0.082
TokenPeter
Token ID19727
Feature activation-1.10e+00

Pos logit contributions
 pleasure+0.675
 generosity+0.660
 appreciation+0.628
ipp+0.608
 pride+0.607
Neg logit contributions
 bumped-0.615
 stub-0.614
 accidentally-0.581
 tri-0.544
?"-0.540
Token ate
Token ID15063
Feature activation-1.76e+00

Pos logit contributions
 pleasure+2.632
 generosity+2.570
 appreciation+2.457
ipp+2.415
 pride+2.409
Neg logit contributions
 bumped-1.156
 stub-1.134
 accidentally-1.075
 tri-0.955
 landed-0.939
Token all
Token ID477
Feature activation-4.59e+00

Pos logit contributions
 pleasure+3.804
 generosity+3.794
 appreciation+3.589
ipp+3.577
ums+3.536
Neg logit contributions
 bumped-2.157
 accidentally-2.045
 stub-2.034
?"-1.925
 suddenly-1.860
Token the
Token ID262
Feature activation-3.20e+00

Pos logit contributions
 generosity+14.138
 pleasure+13.875
izza+13.605
ealing+13.561
ums+13.338
Neg logit contributions
?"-3.172
 bumped-2.815
 suddenly-2.687
 landed-2.673
 hit-2.436
Token food
Token ID2057
Feature activation-2.00e+00

Pos logit contributions
 pleasure+5.709
 generosity+5.550
ipp+5.525
ums+5.430
 appreciation+5.284
Neg logit contributions
 bumped-5.469
?"-5.249
 stub-5.202
 seek-5.188
 landed-5.187
Token and
Token ID290
Feature activation-9.98e-01

Pos logit contributions
 pleasure+4.603
 generosity+4.488
 appreciation+4.355
ipp+4.256
 pride+4.235
Neg logit contributions
 bumped-2.294
 stub-2.196
 accidentally-2.156
 suddenly-1.948
?"-1.926
Token enjoyed
Token ID8359
Feature activation-1.33e+00

Pos logit contributions
 pleasure+2.148
 generosity+2.091
 appreciation+2.001
ipp+1.987
 pride+1.951
Neg logit contributions
 bumped-1.259
 stub-1.245
 accidentally-1.193
 tri-1.075
 landed-1.058
Token the
Token ID262
Feature activation-2.95e+00

Pos logit contributions
 pleasure+2.904
 generosity+2.847
 appreciation+2.715
ipp+2.701
 pride+2.659
Neg logit contributions
 bumped-1.619
 stub-1.585
 accidentally-1.531
?"-1.374
 tri-1.371
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
 pleasure+5.406
 generosity+5.190
ipp+5.103
 appreciation+5.023
 pride+4.949
Neg logit contributions
 bumped-2.392
 stub-2.318
 accidentally-2.239
?"-2.130
 tri-2.039
Token Together
Token ID17083
Feature activation-2.51e+00

Pos logit contributions
 pleasure+1.934
 generosity+1.921
 appreciation+1.810
ipp+1.754
 pride+1.737
Neg logit contributions
 stub-1.676
 bumped-1.667
 accidentally-1.618
?"-1.487
 tri-1.482
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
 pleasure+6.349
 generosity+6.093
ipp+5.791
 appreciation+5.755
 kinds+5.733
Neg logit contributions
 bumped-2.540
 accidentally-2.499
 stub-2.348
 suddenly-2.313
 tri-2.181
Token they
Token ID484
Feature activation-1.55e+00

Pos logit contributions
 pleasure+5.439
 generosity+5.293
 appreciation+5.087
ipp+5.053
ums+5.003
Neg logit contributions
 accidentally-2.462
 bumped-2.420
 stub-2.321
 suddenly-2.196
 tri-2.086
Token collected
Token ID7723
Feature activation-2.75e+00

Pos logit contributions
 pleasure+3.573
 generosity+3.485
 appreciation+3.301
 choices+3.282
 decisions+3.278
Neg logit contributions
 bumped-1.875
 stub-1.806
 accidentally-1.751
 landed-1.615
 tri-1.593
Token all
Token ID477
Feature activation-4.55e+00

Pos logit contributions
 pleasure+6.419
 generosity+6.332
izza+6.020
ums+5.962
ipp+5.940
Neg logit contributions
 bumped-3.172
 accidentally-2.952
 landed-2.952
 stub-2.784
 suddenly-2.763
Token the
Token ID262
Feature activation-2.82e+00

Pos logit contributions
 pleasure+13.777
 generosity+13.704
izza+13.631
ealing+13.317
ums+13.186
Neg logit contributions
 landed-3.240
 bumped-2.958
 accidentally-2.805
 hit-2.741
 saw-2.723
Token stones
Token ID14966
Feature activation-2.31e+00

Pos logit contributions
 pleasure+5.638
 generosity+5.591
ums+5.484
ipp+5.452
izza+5.366
Neg logit contributions
 bumped-4.202
 accidentally-4.038
 stub-3.918
 landed-3.805
 seek-3.720
Token Jack
Token ID3619
Feature activation-1.77e+00

Pos logit contributions
 pleasure+5.444
 generosity+5.302
 appreciation+5.108
 pride+4.967
izza+4.957
Neg logit contributions
 bumped-2.597
 stub-2.495
 accidentally-2.474
 landed-2.302
 suddenly-2.266
Token wanted
Token ID2227
Feature activation-1.17e+00

Pos logit contributions
 pleasure+4.156
 generosity+4.077
 appreciation+3.948
ipp+3.870
 pride+3.834
Neg logit contributions
 stub-1.947
 bumped-1.942
 accidentally-1.880
 tri-1.672
 landed-1.663
Token.
Token ID13
Feature activation-1.79e+00

Pos logit contributions
 pleasure+2.737
 generosity+2.683
 appreciation+2.580
ipp+2.546
 pride+2.519
Neg logit contributions
 bumped-1.247
 stub-1.226
 accidentally-1.178
 tri-1.040
?"-1.030
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
 pleasure+2.977
 generosity+2.936
 appreciation+2.807
ipp+2.746
 pride+2.725
Neg logit contributions
 bumped-1.674
 stub-1.654
 accidentally-1.600
?"-1.562
 tri-1.459
Token Dad
Token ID17415
Feature activation-1.68e+00

Pos logit contributions
 pleasure+2.707
 generosity+2.695
 appreciation+2.532
ipp+2.506
 kinds+2.454
Neg logit contributions
 stub-2.298
 bumped-2.286
 accidentally-2.248
?"-2.168
 tri-2.083
Token."
Token ID526
Feature activation-1.32e+00

Pos logit contributions
 pleasure+4.378
 generosity+4.271
 appreciation+4.136
 pride+4.074
ipp+4.039
Neg logit contributions
 bumped-1.359
?"-1.294
 stub-1.272
 accidentally-1.223
 tri-1.050
Token They
Token ID1119
Feature activation-1.61e+00

Pos logit contributions
 pleasure+2.701
 generosity+2.659
 appreciation+2.510
ipp+2.447
 pride+2.434
Neg logit contributions
 stub-1.809
 bumped-1.791
 accidentally-1.750
?"-1.640
 tri-1.580
Token hugged
Token ID41130
Feature activation-1.62e+00

Pos logit contributions
 pleasure+3.800
 generosity+3.696
 appreciation+3.529
 decisions+3.503
 choices+3.489
Neg logit contributions
 bumped-1.808
 stub-1.731
 accidentally-1.669
 tri-1.528
 landed-1.517
Token Dad
Token ID17415
Feature activation-4.55e+00

Pos logit contributions
 pleasure+3.324
 generosity+3.212
 appreciation+3.012
ipp+2.972
izza+2.923
Neg logit contributions
 bumped-2.260
 stub-2.234
 accidentally-2.157
?"-2.053
 tri-1.981
Token and
Token ID290
Feature activation-9.01e-01

Pos logit contributions
 pleasure+14.434
 generosity+13.086
ums+12.720
 appreciation+12.676
omp+12.485
Neg logit contributions
 bumped-3.180
 saw-2.902
 woke-2.842
 heard-2.771
 under-2.756
Token said
Token ID531
Feature activation-2.55e+00

Pos logit contributions
 pleasure+1.917
 generosity+1.868
 appreciation+1.786
ipp+1.757
 pride+1.736
Neg logit contributions
 bumped-1.162
 stub-1.154
 accidentally-1.095
 tri-0.998
?"-0.987
Token,
Token ID11
Feature activation-8.51e-01

Pos logit contributions
 pleasure+6.321
 generosity+6.108
 sorts+5.847
 appreciation+5.837
 kinds+5.827
Neg logit contributions
 bumped-2.548
 stub-2.346
?"-2.269
 accidentally-2.229
 suddenly-2.060
Token "
Token ID366
Feature activation-2.11e+00

Pos logit contributions
 pleasure+1.888
 generosity+1.848
 appreciation+1.773
ipp+1.730
 pride+1.727
Neg logit contributions
 bumped-1.013
 stub-1.010
 accidentally-0.949
 tri-0.858
?"-0.852
TokenYou
Token ID1639
Feature activation-2.43e+00

Pos logit contributions
 pleasure+4.741
 generosity+4.696
 appreciation+4.530
 decisions+4.413
 pride+4.378
Neg logit contributions
 stub-2.450
 bumped-2.344
 accidentally-2.241
Please-2.217
Can-2.152
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
 pleasure+2.427
 generosity+2.376
 appreciation+2.256
ipp+2.211
 pride+2.211
Neg logit contributions
 stub-1.235
 bumped-1.221
 accidentally-1.151
?"-1.062
 tri-1.030
TokenThe
Token ID464
Feature activation-1.17e+00

Pos logit contributions
 pleasure+3.292
 generosity+3.121
 appreciation+2.916
 pride+2.901
 choices+2.877
Neg logit contributions
 bumped-2.386
 stub-2.380
 accidentally-2.299
 suddenly-2.141
?"-2.139
Token shop
Token ID6128
Feature activation-2.20e+00

Pos logit contributions
 pleasure+2.426
 generosity+2.357
 appreciation+2.250
ipp+2.236
 pride+2.185
Neg logit contributions
 bumped-1.592
 stub-1.562
 accidentally-1.484
?"-1.376
 tri-1.339
Tokenkeeper
Token ID13884
Feature activation-1.90e+00

Pos logit contributions
 pleasure+4.287
 generosity+4.010
 appreciation+3.805
 pride+3.770
 decisions+3.758
Neg logit contributions
 bumped-3.498
 stub-3.351
 accidentally-3.315
 suddenly-3.123
 tri-3.070
Token gathered
Token ID9272
Feature activation-3.23e+00

Pos logit contributions
 pleasure+4.304
 generosity+4.078
 choices+3.895
 decisions+3.881
 appreciation+3.864
Neg logit contributions
 bumped-2.456
 accidentally-2.348
 stub-2.331
 suddenly-2.109
 tri-2.108
Token all
Token ID477
Feature activation-4.54e+00

Pos logit contributions
 pleasure+7.847
 generosity+7.712
izza+7.373
lette+7.269
ipp+7.261
Neg logit contributions
 bumped-3.629
 accidentally-3.267
 suddenly-3.212
 stub-3.179
 landed-3.121
Token of
Token ID286
Feature activation-2.25e+00

Pos logit contributions
 pleasure+14.036
 generosity+14.008
izza+13.712
ealing+13.566
lette+13.281
Neg logit contributions
 bumped-3.268
 landed-3.025
 accidentally-2.806
 hit-2.780
 stub-2.708
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 pleasure+5.229
 generosity+5.153
izza+4.875
 appreciation+4.842
ealing+4.831
Neg logit contributions
 bumped-2.627
 accidentally-2.523
 stub-2.482
 suddenly-2.193
 tri-2.160
Token items
Token ID3709
Feature activation-2.44e+00

Pos logit contributions
 pleasure+2.455
 generosity+2.393
 appreciation+2.255
ipp+2.242
izza+2.204
Neg logit contributions
 bumped-1.913
 stub-1.872
 accidentally-1.785
 tri-1.652
 landed-1.649
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
 pleasure+6.114
 generosity+5.868
 appreciation+5.695
ipp+5.551
ums+5.527
Neg logit contributions
 bumped-2.493
 accidentally-2.253
 stub-2.226
 suddenly-2.151
 landed-2.095
Token told
Token ID1297
Feature activation-1.65e+00

Pos logit contributions
 pleasure+2.378
 generosity+2.316
 appreciation+2.211
ipp+2.169
 pride+2.154
Neg logit contributions
 bumped-1.349
 stub-1.322
 accidentally-1.279
 tri-1.149
 landed-1.137
Token Tom
Token ID4186
Feature activation-6.82e-02

Pos logit contributions
 stub+1.281
 bumped+1.197
 accidentally+1.091
 tri+1.026
 seek+1.002
Neg logit contributions
 pleasure-1.774
 generosity-1.709
 appreciation-1.643
 pride-1.602
ipp-1.584
Token and
Token ID290
Feature activation+2.79e-01

Pos logit contributions
 pleasure+0.134
 generosity+0.133
 appreciation+0.126
ipp+0.124
 pride+0.123
Neg logit contributions
 stub-0.095
 bumped-0.094
 accidentally-0.087
 tri-0.081
?"-0.079
Token his
Token ID465
Feature activation-1.13e+00

Pos logit contributions
 bumped+0.430
 stub+0.430
 accidentally+0.400
 tri+0.381
?"+0.366
Neg logit contributions
 generosity-0.518
 pleasure-0.516
 appreciation-0.491
 pride-0.475
 kinds-0.462
Token rabbit
Token ID22746
Feature activation-1.28e+00

Pos logit contributions
 pleasure+1.960
 generosity+1.898
 appreciation+1.800
ipp+1.782
 pride+1.745
Neg logit contributions
 stub-1.893
 bumped-1.884
 accidentally-1.818
?"-1.688
 tri-1.687
Token sold
Token ID2702
Feature activation-2.12e+00

Pos logit contributions
 pleasure+2.666
 generosity+2.589
 appreciation+2.477
ipp+2.451
 pride+2.398
Neg logit contributions
 bumped-1.727
 stub-1.683
 accidentally-1.632
 tri-1.479
 landed-1.479
Token all
Token ID477
Feature activation-4.52e+00

Pos logit contributions
 pleasure+4.361
 generosity+4.196
ipp+4.105
ums+3.999
 appreciation+3.972
Neg logit contributions
 bumped-2.825
 stub-2.703
 accidentally-2.675
?"-2.618
 landed-2.516
Token the
Token ID262
Feature activation-2.44e+00

Pos logit contributions
 pleasure+13.746
 generosity+13.517
ealing+13.318
ums+13.179
izza+12.834
Neg logit contributions
?"-3.278
 bumped-2.530
 accidentally-2.441
 hit-2.427
 landed-2.421
Token carrots
Token ID34397
Feature activation-1.30e+00

Pos logit contributions
 pleasure+3.742
 generosity+3.533
ipp+3.431
 appreciation+3.407
izza+3.374
Neg logit contributions
 bumped-4.739
 stub-4.670
 accidentally-4.633
?"-4.609
 landed-4.335
Token.
Token ID13
Feature activation-3.13e-01

Pos logit contributions
 pleasure+3.239
 generosity+3.149
 appreciation+3.041
ipp+2.979
 pride+2.973
Neg logit contributions
 bumped-1.231
 stub-1.202
 accidentally-1.138
?"-1.026
 landed-0.989
Token They
Token ID1119
Feature activation-1.16e+00

Pos logit contributions
 pleasure+0.535
 generosity+0.520
 appreciation+0.491
ipp+0.479
 pride+0.478
Neg logit contributions
 stub-0.537
 bumped-0.532
 accidentally-0.507
 tri-0.474
?"-0.470
Token were
Token ID547
Feature activation-2.45e+00

Pos logit contributions
 pleasure+2.303
 generosity+2.226
 appreciation+2.122
ipp+2.076
 decisions+2.044
Neg logit contributions
 bumped-1.732
 stub-1.686
 accidentally-1.653
 tri-1.505
 landed-1.501
Token were
Token ID547
Feature activation-3.53e+00

Pos logit contributions
 pleasure+4.511
 generosity+4.338
 decisions+4.187
 appreciation+4.137
 choices+4.117
Neg logit contributions
 bumped-2.799
 accidentally-2.723
 stub-2.675
 landed-2.456
?"-2.444
Token filled
Token ID5901
Feature activation-2.20e+00

Pos logit contributions
 generosity+8.232
izza+8.068
 pleasure+8.003
 decisions+7.628
apa+7.620
Neg logit contributions
 accidentally-4.304
 bumped-4.260
?"-4.185
 stub-4.026
 suddenly-3.972
Token with
Token ID351
Feature activation-3.63e+00

Pos logit contributions
 pleasure+5.923
 generosity+5.816
 pride+5.546
 appreciation+5.485
izza+5.377
Neg logit contributions
 bumped-1.647
 stub-1.615
 accidentally-1.613
?"-1.596
 suddenly-1.435
Token fresh
Token ID4713
Feature activation-2.86e+00

Pos logit contributions
ipp+6.416
 generosity+6.250
ums+6.223
 pleasure+6.197
izza+6.020
Neg logit contributions
 bumped-5.847
 accidentally-5.846
 suddenly-5.695
?"-5.634
 stub-5.620
Token cuc
Token ID38421
Feature activation-3.61e+00

Pos logit contributions
 pleasure+6.279
 generosity+6.179
 appreciation+5.929
ipp+5.844
izza+5.700
Neg logit contributions
 bumped-3.640
 accidentally-3.402
?"-3.355
 stub-3.288
 landed-3.162
Tokenumber
Token ID4494
Feature activation-4.52e+00

Pos logit contributions
 pleasure+7.145
 generosity+6.590
 appreciation+6.425
 pride+6.042
 kindness+5.993
Neg logit contributions
 bumped-6.060
 stub-6.007
 accidentally-5.691
?"-5.640
 seek-5.502
Token and
Token ID290
Feature activation-1.79e+00

Pos logit contributions
 pleasure+13.425
 pride+12.497
 generosity+12.273
 appreciation+12.229
 satisfaction+12.158
Neg logit contributions
 bumped-3.652
 accidentally-3.529
?"-3.183
 stub-3.078
 suddenly-2.987
Token cheese
Token ID9891
Feature activation-1.83e+00

Pos logit contributions
 pleasure+2.688
 generosity+2.544
ipp+2.460
 appreciation+2.425
ums+2.369
Neg logit contributions
 bumped-3.435
 stub-3.399
 accidentally-3.398
?"-3.129
 suddenly-3.124
Token.
Token ID13
Feature activation-2.08e+00

Pos logit contributions
 pleasure+4.493
 generosity+4.344
 appreciation+4.177
 pride+4.138
ipp+4.093
Neg logit contributions
 bumped-1.793
 stub-1.723
 accidentally-1.700
?"-1.527
 landed-1.477
Token It
Token ID632
Feature activation-2.14e+00

Pos logit contributions
 generosity+4.165
 pleasure+4.118
 appreciation+3.892
 pride+3.845
 choices+3.816
Neg logit contributions
 bumped-2.828
 stub-2.820
 accidentally-2.760
?"-2.681
 landed-2.535
Token made
Token ID925
Feature activation-2.92e+00

Pos logit contributions
 pleasure+5.955
 generosity+5.756
 appreciation+5.548
 pride+5.521
 choices+5.469
Neg logit contributions
 bumped-1.624
 accidentally-1.612
 stub-1.535
 landed-1.339
 suddenly-1.323
Token
Token ID198
Feature activation-4.57e-01

Pos logit contributions
 pleasure+1.645
 generosity+1.617
 appreciation+1.543
ipp+1.506
 pride+1.505
Neg logit contributions
 bumped-0.901
 stub-0.901
 accidentally-0.848
 tri-0.765
?"-0.764
Token
Token ID198
Feature activation-1.38e+00

Pos logit contributions
 pleasure+0.981
 generosity+0.962
 appreciation+0.921
ipp+0.900
 pride+0.896
Neg logit contributions
 bumped-0.578
 stub-0.578
 accidentally-0.540
 tri-0.492
?"-0.486
TokenThe
Token ID464
Feature activation-7.92e-02

Pos logit contributions
 pleasure+3.158
 generosity+3.037
 appreciation+2.871
ipp+2.854
 pride+2.840
Neg logit contributions
 bumped-1.653
 stub-1.629
 accidentally-1.599
 tri-1.427
 landed-1.408
Token woman
Token ID2415
Feature activation-1.50e+00

Pos logit contributions
 pleasure+0.170
 generosity+0.168
 appreciation+0.160
 pride+0.157
ipp+0.155
Neg logit contributions
 stub-0.102
 bumped-0.101
 accidentally-0.096
?"-0.087
 tri-0.086
Token gave
Token ID2921
Feature activation-2.24e+00

Pos logit contributions
 pleasure+3.315
 generosity+3.185
 appreciation+3.027
 pride+2.985
 choices+2.969
Neg logit contributions
 bumped-1.944
 stub-1.864
 accidentally-1.828
 tri-1.657
 landed-1.634
Token two
Token ID734
Feature activation-4.49e+00

Pos logit contributions
 pleasure+4.984
 generosity+4.860
 appreciation+4.604
izza+4.558
ipp+4.530
Neg logit contributions
 bumped-2.892
 accidentally-2.768
 stub-2.747
 landed-2.533
 tri-2.522
Token yellow
Token ID7872
Feature activation-2.38e+00

Pos logit contributions
 pleasure+11.377
 generosity+9.902
ipp+9.689
izza+9.655
 appreciation+9.548
Neg logit contributions
 bumped-6.216
 saw-5.910
 accidentally-5.873
 heard-5.552
 landed-5.469
Token le
Token ID443
Feature activation-2.85e+00

Pos logit contributions
 pleasure+4.586
 generosity+4.280
 appreciation+4.115
ipp+4.004
 pride+3.992
Neg logit contributions
 bumped-3.814
 accidentally-3.684
 stub-3.623
?"-3.361
 tri-3.352
Tokenmons
Token ID11567
Feature activation-3.32e+00

Pos logit contributions
 pleasure+5.912
 appreciation+5.566
 generosity+5.565
 pride+5.313
 decisions+5.067
Neg logit contributions
 bumped-4.310
 stub-4.015
 accidentally-3.942
 tri-3.858
?"-3.617
Token to
Token ID284
Feature activation-7.51e-01

Pos logit contributions
 pleasure+9.142
 generosity+8.625
 appreciation+8.462
 decisions+8.227
 pride+8.135
Neg logit contributions
 bumped-2.851
 accidentally-2.664
 stub-2.591
 landed-2.404
?"-2.339
Token Sam
Token ID3409
Feature activation-2.24e+00

Pos logit contributions
 pleasure+1.487
 generosity+1.453
 appreciation+1.384
ipp+1.351
 pride+1.342
Neg logit contributions
 bumped-1.073
 stub-1.069
 accidentally-1.019
 tri-0.934
?"-0.929
Token to
Token ID284
Feature activation-5.76e-01

Pos logit contributions
 pleasure+0.043
 generosity+0.042
 appreciation+0.040
ipp+0.039
 pride+0.039
Neg logit contributions
 stub-0.024
 bumped-0.023
 accidentally-0.022
 tri-0.020
Ro-0.020
Token do
Token ID466
Feature activation-1.37e-01

Pos logit contributions
 pleasure+1.073
 generosity+1.049
 appreciation+0.994
ipp+0.969
 pride+0.963
Neg logit contributions
 bumped-0.896
 stub-0.896
 accidentally-0.851
 tri-0.789
?"-0.783
Token!
Token ID0
Feature activation-1.99e+00

Pos logit contributions
 pleasure+0.313
 generosity+0.310
 appreciation+0.297
ipp+0.288
 pride+0.288
Neg logit contributions
 stub-0.152
 bumped-0.150
 accidentally-0.138
 tri-0.125
 landed-0.122
Token she
Token ID673
Feature activation-6.90e-01

Pos logit contributions
 generosity+4.144
 pleasure+4.120
 appreciation+3.937
ipp+3.774
 pride+3.771
Neg logit contributions
 stub-2.644
 accidentally-2.554
 bumped-2.553
?"-2.296
 tri-2.257
Token gathered
Token ID9272
Feature activation-3.42e+00

Pos logit contributions
 pleasure+1.508
 generosity+1.475
 appreciation+1.408
ipp+1.382
 pride+1.373
Neg logit contributions
 bumped-0.848
 stub-0.845
 accidentally-0.796
 tri-0.721
?"-0.714
Token all
Token ID477
Feature activation-4.46e+00

Pos logit contributions
 pleasure+8.248
 generosity+8.073
ipp+7.986
izza+7.960
ums+7.728
Neg logit contributions
 bumped-3.765
 accidentally-3.345
 suddenly-3.329
 stub-3.280
 landed-3.280
Token her
Token ID607
Feature activation-2.93e+00

Pos logit contributions
 pleasure+13.735
 generosity+13.591
izza+13.413
ealing+13.181
ums+12.911
Neg logit contributions
 bumped-2.882
?"-2.842
 landed-2.696
 accidentally-2.583
 stub-2.535
Token friends
Token ID2460
Feature activation-3.08e+00

Pos logit contributions
 pleasure+5.074
ipp+5.036
ums+4.767
 generosity+4.759
 pride+4.430
Neg logit contributions
 bumped-5.170
 stub-4.958
 accidentally-4.925
 landed-4.646
?"-4.593
Token together
Token ID1978
Feature activation-2.00e+00

Pos logit contributions
 pleasure+7.669
 generosity+7.289
ipp+7.167
ums+7.149
 appreciation+7.007
Neg logit contributions
 bumped-3.318
 accidentally-3.114
 stub-3.077
 suddenly-2.831
 tri-2.808
Token to
Token ID284
Feature activation-1.39e+00

Pos logit contributions
 pleasure+4.538
 generosity+4.372
ipp+4.213
 appreciation+4.168
 pride+4.125
Neg logit contributions
 bumped-2.395
 accidentally-2.280
 stub-2.274
 tri-2.039
 suddenly-2.036
Token help
Token ID1037
Feature activation-1.35e+00

Pos logit contributions
 pleasure+2.757
 generosity+2.675
ipp+2.558
 appreciation+2.538
 pride+2.492
Neg logit contributions
 bumped-1.977
 stub-1.934
 accidentally-1.901
?"-1.776
 seek-1.720
Token
Token ID198
Feature activation-5.08e-04

Pos logit contributions
 pleasure+2.044
 generosity+2.019
 appreciation+1.904
ipp+1.855
 pride+1.854
Neg logit contributions
 stub-1.364
 bumped-1.364
 accidentally-1.314
?"-1.204
 tri-1.191
Token
Token ID198
Feature activation-7.20e-01

Pos logit contributions
 pleasure+0.001
 generosity+0.001
 appreciation+0.001
ipp+0.001
 pride+0.001
Neg logit contributions
 bumped-0.001
 stub-0.001
 accidentally-0.001
 tri-0.001
 landed-0.001
TokenThe
Token ID464
Feature activation-9.59e-01

Pos logit contributions
 pleasure+1.751
 generosity+1.720
 appreciation+1.647
ipp+1.615
 pride+1.610
Neg logit contributions
 bumped-0.720
 stub-0.718
 accidentally-0.670
 tri-0.589
?"-0.583
Token pals
Token ID39898
Feature activation-7.00e-01

Pos logit contributions
 pleasure+1.951
 generosity+1.892
 appreciation+1.806
ipp+1.788
 pride+1.762
Neg logit contributions
 stub-1.349
 bumped-1.344
 accidentally-1.278
?"-1.180
 tri-1.166
Token gathered
Token ID9272
Feature activation-3.32e+00

Pos logit contributions
 pleasure+1.620
 generosity+1.590
 appreciation+1.522
ipp+1.493
 pride+1.484
Neg logit contributions
 bumped-0.771
 stub-0.766
 accidentally-0.718
 tri-0.641
?"-0.631
Token all
Token ID477
Feature activation-4.46e+00

Pos logit contributions
 pleasure+7.923
 generosity+7.766
ipp+7.572
ums+7.353
izza+7.198
Neg logit contributions
 bumped-4.081
 accidentally-3.722
 stub-3.506
 suddenly-3.454
 hit-3.426
Token of
Token ID286
Feature activation-2.83e+00

Pos logit contributions
 pleasure+13.830
 generosity+13.693
ealing+12.807
ums+12.743
izza+12.612
Neg logit contributions
?"-3.373
 bumped-3.079
 accidentally-3.003
 hit-2.935
 landed-2.757
Token the
Token ID262
Feature activation-2.53e+00

Pos logit contributions
 pleasure+6.942
 generosity+6.867
ipp+6.608
ums+6.573
 appreciation+6.431
Neg logit contributions
 accidentally-3.158
 bumped-2.867
 stub-2.641
?"-2.588
 suddenly-2.564
Token ingredients
Token ID9391
Feature activation-2.84e+00

Pos logit contributions
 pleasure+4.669
 generosity+4.441
ipp+4.217
 appreciation+4.132
 pride+4.078
Neg logit contributions
 bumped-4.425
 accidentally-4.363
 stub-4.311
?"-4.234
 suddenly-3.889
Token for
Token ID329
Feature activation-1.51e+00

Pos logit contributions
 pleasure+7.117
 generosity+6.717
 appreciation+6.444
 pride+6.371
ipp+6.320
Neg logit contributions
 bumped-3.159
 accidentally-3.018
 suddenly-2.786
?"-2.762
 stub-2.711
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
 pleasure+3.010
 generosity+2.916
 appreciation+2.784
ipp+2.773
 pride+2.698
Neg logit contributions
 bumped-2.143
 accidentally-2.121
 stub-2.073
 tri-1.885
 suddenly-1.872
Token to
Token ID284
Feature activation-5.69e-01

Pos logit contributions
 pleasure+3.661
 generosity+3.533
 appreciation+3.385
ipp+3.381
 pride+3.344
Neg logit contributions
 bumped-1.795
 stub-1.711
 accidentally-1.704
 tri-1.509
?"-1.507
Token help
Token ID1037
Feature activation-1.07e+00

Pos logit contributions
 pleasure+0.905
 generosity+0.881
 appreciation+0.827
ipp+0.802
 pride+0.797
Neg logit contributions
 bumped-1.040
 stub-1.040
 accidentally-0.995
 tri-0.934
?"-0.929
Token!
Token ID0
Feature activation-1.46e+00

Pos logit contributions
 pleasure+2.284
 generosity+2.215
 appreciation+2.116
ipp+2.090
 pride+2.067
Neg logit contributions
 bumped-1.382
 stub-1.359
 accidentally-1.324
?"-1.204
 tri-1.193
Token They
Token ID1119
Feature activation-1.42e+00

Pos logit contributions
 pleasure+2.498
 generosity+2.488
 appreciation+2.321
 pride+2.238
ipp+2.232
Neg logit contributions
 bumped-2.452
 stub-2.424
 accidentally-2.410
?"-2.250
 tri-2.200
Token gathered
Token ID9272
Feature activation-3.14e+00

Pos logit contributions
 pleasure+3.186
 generosity+3.074
 appreciation+2.920
ipp+2.879
 decisions+2.878
Neg logit contributions
 bumped-1.812
 accidentally-1.732
 stub-1.732
 tri-1.531
 landed-1.522
Token all
Token ID477
Feature activation-4.45e+00

Pos logit contributions
 pleasure+7.429
 generosity+7.242
ipp+7.108
ums+6.909
izza+6.740
Neg logit contributions
 bumped-3.846
 accidentally-3.419
 landed-3.224
 suddenly-3.220
 hit-3.174
Token of
Token ID286
Feature activation-2.53e+00

Pos logit contributions
 pleasure+13.916
 generosity+13.670
ums+13.005
ealing+12.977
izza+12.756
Neg logit contributions
?"-3.234
 bumped-3.056
 hit-2.916
 accidentally-2.785
 saw-2.670
Token the
Token ID262
Feature activation-1.77e+00

Pos logit contributions
 pleasure+6.146
 generosity+5.989
ums+5.789
ipp+5.785
ealing+5.713
Neg logit contributions
 accidentally-2.737
 bumped-2.718
 stub-2.432
?"-2.304
 tri-2.195
Token supplies
Token ID9416
Feature activation-2.29e+00

Pos logit contributions
 pleasure+3.330
 generosity+3.199
ipp+3.113
 appreciation+3.016
ums+2.965
Neg logit contributions
 bumped-2.895
 stub-2.784
 accidentally-2.745
?"-2.556
 tri-2.494
Token they
Token ID484
Feature activation-5.06e-01

Pos logit contributions
 pleasure+5.383
 generosity+5.163
 appreciation+5.010
ipp+4.991
ums+4.861
Neg logit contributions
 bumped-2.623
 accidentally-2.511
 stub-2.405
?"-2.294
 suddenly-2.253
Token would
Token ID561
Feature activation-1.05e+00

Pos logit contributions
 pleasure+1.127
 generosity+1.107
 appreciation+1.058
ipp+1.035
 pride+1.034
Neg logit contributions
 stub-0.597
 bumped-0.596
 accidentally-0.556
 tri-0.503
?"-0.498
Tokenily
Token ID813
Feature activation-1.27e+00

Pos logit contributions
 pleasure+0.003
 generosity+0.003
ipp+0.003
 appreciation+0.003
 pride+0.003
Neg logit contributions
 stub-0.002
 bumped-0.002
 accidentally-0.002
 tri-0.002
 landed-0.002
Token and
Token ID290
Feature activation-9.52e-02

Pos logit contributions
 pleasure+2.735
 generosity+2.630
 appreciation+2.500
 pride+2.439
ipp+2.422
Neg logit contributions
 bumped-1.714
 stub-1.674
 accidentally-1.618
 tri-1.482
 landed-1.453
Token Ben
Token ID3932
Feature activation-1.56e+00

Pos logit contributions
 pleasure+0.214
 generosity+0.213
 appreciation+0.202
ipp+0.199
 pride+0.199
Neg logit contributions
 stub-0.110
 bumped-0.109
 accidentally-0.100
?"-0.091
 tri-0.089
Token gave
Token ID2921
Feature activation-2.29e+00

Pos logit contributions
 pleasure+3.436
 generosity+3.328
 appreciation+3.157
 decisions+3.115
 choices+3.104
Neg logit contributions
 bumped-2.062
 stub-1.958
 accidentally-1.921
 landed-1.761
 tri-1.758
Token the
Token ID262
Feature activation-1.51e+00

Pos logit contributions
 pleasure+5.396
 generosity+5.272
ipp+4.985
izza+4.967
 appreciation+4.942
Neg logit contributions
 bumped-2.592
 stub-2.421
 accidentally-2.349
 landed-2.195
?"-2.192
Token man
Token ID582
Feature activation-4.44e+00

Pos logit contributions
 pleasure+3.498
 generosity+3.382
ipp+3.263
 appreciation+3.255
 pride+3.194
Neg logit contributions
 bumped-1.732
 stub-1.699
 accidentally-1.614
?"-1.483
 tri-1.453
Token some
Token ID617
Feature activation-1.72e+00

Pos logit contributions
 pleasure+13.991
 generosity+12.528
umm+12.297
ipp+12.260
omp+12.011
Neg logit contributions
 bumped-3.475
 heard-2.802
 saw-2.772
 behind-2.763
 broke-2.671
Token bread
Token ID8509
Feature activation-2.43e+00

Pos logit contributions
 pleasure+3.193
 generosity+3.052
ipp+2.977
 appreciation+2.902
ums+2.809
Neg logit contributions
 bumped-2.688
 stub-2.629
 accidentally-2.572
?"-2.426
 tri-2.371
Token.
Token ID13
Feature activation-7.02e-01

Pos logit contributions
 pleasure+6.265
 generosity+5.831
ipp+5.685
 appreciation+5.660
 pride+5.606
Neg logit contributions
 bumped-2.283
 stub-2.161
 accidentally-2.078
?"-1.998
 landed-1.913
Token They
Token ID1119
Feature activation-1.10e+00

Pos logit contributions
 pleasure+1.167
 generosity+1.140
 appreciation+1.072
ipp+1.043
 pride+1.035
Neg logit contributions
 bumped-1.226
 stub-1.225
 accidentally-1.176
 tri-1.096
?"-1.095
Token taped
Token ID36114
Feature activation-2.36e+00

Pos logit contributions
 pleasure+2.404
 generosity+2.329
 appreciation+2.225
ipp+2.171
 decisions+2.151
Neg logit contributions
 bumped-1.401
 stub-1.368
 accidentally-1.320
 tri-1.191
 landed-1.184
Token sweet
Token ID6029
Feature activation-1.52e+00

Pos logit contributions
 pleasure+3.634
 generosity+3.505
 appreciation+3.281
izza+3.249
ipp+3.248
Neg logit contributions
 bumped-3.651
 stub-3.558
 accidentally-3.519
 landed-3.252
 tri-3.231
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
 pleasure+3.631
 generosity+3.545
 appreciation+3.412
ipp+3.368
 pride+3.361
Neg logit contributions
 bumped-1.560
 stub-1.515
 accidentally-1.475
?"-1.333
 tri-1.282
Token He
Token ID679
Feature activation-4.52e-01

Pos logit contributions
 pleasure+2.351
 generosity+2.348
 appreciation+2.230
ipp+2.147
 pride+2.146
Neg logit contributions
 bumped-1.904
 stub-1.903
 accidentally-1.858
?"-1.690
 tri-1.684
Token quickly
Token ID2952
Feature activation-4.86e-01

Pos logit contributions
 pleasure+0.976
 generosity+0.959
 appreciation+0.916
ipp+0.897
 pride+0.892
Neg logit contributions
 stub-0.560
 bumped-0.560
 accidentally-0.523
 tri-0.475
?"-0.475
Token collected
Token ID7723
Feature activation-2.57e+00

Pos logit contributions
 pleasure+1.063
 generosity+1.044
 appreciation+0.999
ipp+0.977
 pride+0.974
Neg logit contributions
 stub-0.588
 bumped-0.587
 accidentally-0.548
?"-0.497
 tri-0.497
Token all
Token ID477
Feature activation-4.44e+00

Pos logit contributions
 pleasure+5.868
 generosity+5.762
izza+5.455
 appreciation+5.369
ums+5.313
Neg logit contributions
 bumped-3.214
 landed-2.975
 accidentally-2.975
 stub-2.892
?"-2.828
Token the
Token ID262
Feature activation-2.45e+00

Pos logit contributions
 pleasure+13.943
 generosity+13.778
izza+13.371
ealing+12.984
ums+12.860
Neg logit contributions
?"-3.248
 landed-2.986
 bumped-2.877
 accidentally-2.794
 saw-2.574
Token berries
Token ID36322
Feature activation-2.40e+00

Pos logit contributions
 pleasure+4.363
 generosity+4.260
izza+4.098
 appreciation+4.002
ipp+3.986
Neg logit contributions
 bumped-4.185
 accidentally-4.062
 stub-4.060
?"-3.852
 landed-3.833
Token he
Token ID339
Feature activation+3.20e-01

Pos logit contributions
 pleasure+6.048
 generosity+5.824
 appreciation+5.693
 pride+5.530
 decisions+5.431
Neg logit contributions
 bumped-2.448
 accidentally-2.351
 stub-2.251
 landed-2.132
?"-2.086
Token could
Token ID714
Feature activation+6.23e-02

Pos logit contributions
 stub+0.382
 bumped+0.375
 accidentally+0.339
?"+0.316
 tri+0.314
Neg logit contributions
 pleasure-0.703
 generosity-0.695
 appreciation-0.655
ipp-0.647
 pride-0.645
Token find
Token ID1064
Feature activation-1.52e+00

Pos logit contributions
 bumped+0.095
 stub+0.094
 accidentally+0.088
 tri+0.083
 landed+0.080
Neg logit contributions
 pleasure-0.118
 generosity-0.116
 appreciation-0.110
ipp-0.108
 pride-0.106
Token tw
Token ID665
Feature activation-1.90e+00

Pos logit contributions
 pleasure+0.496
 generosity+0.489
 appreciation+0.462
 pride+0.447
ipp+0.445
Neg logit contributions
 stub-0.412
 bumped-0.409
 accidentally-0.385
 tri-0.360
?"-0.358
Tokenigs
Token ID9235
Feature activation-1.24e+00

Pos logit contributions
 pleasure+3.098
 generosity+3.027
 appreciation+2.757
 pride+2.727
 choices+2.579
Neg logit contributions
 bumped-3.574
 stub-3.462
 accidentally-3.438
?"-3.318
 landed-3.254
Token.
Token ID13
Feature activation-7.20e-01

Pos logit contributions
 pleasure+3.045
 generosity+2.974
 appreciation+2.850
 pride+2.784
ipp+2.778
Neg logit contributions
 bumped-1.233
 stub-1.191
 accidentally-1.124
?"-0.994
 tri-0.989
Token He
Token ID679
Feature activation-8.49e-01

Pos logit contributions
 pleasure+1.271
 generosity+1.243
 appreciation+1.177
ipp+1.141
 pride+1.136
Neg logit contributions
 bumped-1.185
 stub-1.181
 accidentally-1.131
 tri-1.051
?"-1.046
Token gave
Token ID2921
Feature activation-8.52e-01

Pos logit contributions
 pleasure+1.911
 generosity+1.862
 appreciation+1.783
ipp+1.742
 pride+1.740
Neg logit contributions
 bumped-1.013
 stub-1.003
 accidentally-0.953
 tri-0.855
?"-0.842
Token them
Token ID606
Feature activation-4.43e+00

Pos logit contributions
 pleasure+1.800
 generosity+1.765
 appreciation+1.680
ipp+1.638
 pride+1.629
Neg logit contributions
 bumped-1.126
 stub-1.113
 accidentally-1.059
 tri-0.963
?"-0.961
Token to
Token ID284
Feature activation+4.01e-01

Pos logit contributions
 pleasure+12.897
ums+12.224
 generosity+12.179
ealing+11.865
 decisions+11.828
Neg logit contributions
 bumped-3.797
 broke-3.247
 hit-3.238
 landed-3.195
?"-3.150
Token the
Token ID262
Feature activation+9.82e-01

Pos logit contributions
 stub+0.612
 bumped+0.598
 accidentally+0.555
 tri+0.515
 seek+0.507
Neg logit contributions
 pleasure-0.779
 generosity-0.766
 appreciation-0.723
 pride-0.712
ipp-0.701
Token squirrel
Token ID33039
Feature activation-2.80e+00

Pos logit contributions
 bumped+1.502
 stub+1.446
 accidentally+1.411
?"+1.293
 tri+1.278
Neg logit contributions
 pleasure-1.922
 generosity-1.894
 appreciation-1.777
 pride-1.736
 choices-1.698
Token,
Token ID11
Feature activation-4.65e-01

Pos logit contributions
 pleasure+7.821
 generosity+7.293
 pride+7.110
 appreciation+7.093
ipp+6.924
Neg logit contributions
 accidentally-2.377
 bumped-2.363
 stub-2.205
?"-2.114
 tri-1.838
Token who
Token ID508
Feature activation-1.53e+00

Pos logit contributions
 pleasure+0.956
 generosity+0.938
 appreciation+0.893
ipp+0.869
 pride+0.868
Neg logit contributions
 stub-0.634
 bumped-0.634
 accidentally-0.595
 tri-0.546
?"-0.542
Tokenummy
Token ID13513
Feature activation-2.09e+00

Pos logit contributions
 pleasure+3.007
 generosity+2.902
 appreciation+2.589
 pride+2.378
izza+2.232
Neg logit contributions
 bumped-6.813
 stub-6.780
 accidentally-6.709
?"-6.498
 tri-6.399
Token treats
Token ID18432
Feature activation-7.18e-01

Pos logit contributions
 pleasure+3.531
 generosity+3.370
ipp+3.198
 appreciation+3.176
 pride+3.112
Neg logit contributions
 bumped-3.644
 stub-3.545
 accidentally-3.500
?"-3.405
 landed-3.266
Token.
Token ID13
Feature activation-1.63e+00

Pos logit contributions
 pleasure+1.657
 generosity+1.625
 appreciation+1.560
ipp+1.528
 pride+1.522
Neg logit contributions
 bumped-0.784
 stub-0.780
 accidentally-0.729
 tri-0.651
?"-0.648
Token They
Token ID1119
Feature activation-1.16e+00

Pos logit contributions
 generosity+2.906
 pleasure+2.856
 appreciation+2.694
ipp+2.617
 choices+2.608
Neg logit contributions
 stub-2.652
 accidentally-2.640
 bumped-2.580
?"-2.401
 tri-2.372
Token took
Token ID1718
Feature activation-3.12e+00

Pos logit contributions
 pleasure+2.335
 generosity+2.281
 appreciation+2.156
ipp+2.109
 choices+2.098
Neg logit contributions
 bumped-1.662
 stub-1.640
 accidentally-1.580
 tri-1.453
 landed-1.437
Token all
Token ID477
Feature activation-4.43e+00

Pos logit contributions
 generosity+7.219
 pleasure+6.976
ipp+6.614
ums+6.608
 appreciation+6.588
Neg logit contributions
 bumped-3.956
 accidentally-3.634
 stub-3.530
 landed-3.441
 suddenly-3.373
Token of
Token ID286
Feature activation-3.18e+00

Pos logit contributions
 generosity+13.193
 pleasure+12.834
izza+12.482
ealing+12.267
ums+12.045
Neg logit contributions
 bumped-3.738
 landed-3.448
 stub-3.277
?"-3.257
 suddenly-3.105
Token the
Token ID262
Feature activation-2.18e+00

Pos logit contributions
 generosity+8.114
 pleasure+8.043
 decisions+7.573
izza+7.541
ealing+7.495
Neg logit contributions
 accidentally-3.467
 bumped-3.269
 stub-2.974
 suddenly-2.926
?"-2.868
Token strawberries
Token ID36973
Feature activation-2.33e+00

Pos logit contributions
 pleasure+3.536
 generosity+3.445
 appreciation+3.226
izza+3.200
ipp+3.198
Neg logit contributions
 bumped-4.070
 stub-3.948
 accidentally-3.862
 landed-3.637
?"-3.632
Token inside
Token ID2641
Feature activation-1.33e+00

Pos logit contributions
 pleasure+5.439
 generosity+5.299
 appreciation+5.104
 pride+4.968
 decisions+4.908
Neg logit contributions
 bumped-2.667
 stub-2.568
 accidentally-2.530
 landed-2.356
?"-2.354
Token and
Token ID290
Feature activation-6.44e-01

Pos logit contributions
 pleasure+2.854
 generosity+2.807
 appreciation+2.658
ipp+2.633
 pride+2.608
Neg logit contributions
 bumped-1.671
 stub-1.638
 accidentally-1.612
?"-1.461
 tri-1.435
Token great
Token ID1049
Feature activation-1.91e+00

Pos logit contributions
 pleasure+0.844
 generosity+0.824
 appreciation+0.780
ipp+0.756
 pride+0.753
Neg logit contributions
 bumped-0.765
 stub-0.763
 accidentally-0.725
 tri-0.676
?"-0.670
Token dish
Token ID9433
Feature activation-1.64e+00

Pos logit contributions
 generosity+3.689
 pleasure+3.681
ipp+3.507
 pride+3.470
 appreciation+3.435
Neg logit contributions
 stub-2.738
 bumped-2.734
 accidentally-2.680
?"-2.519
 tri-2.422
Token.
Token ID13
Feature activation-8.08e-01

Pos logit contributions
 pleasure+3.569
 generosity+3.423
 appreciation+3.301
 pride+3.237
ipp+3.220
Neg logit contributions
 bumped-2.053
 stub-2.003
 accidentally-1.956
?"-1.920
 seek-1.763
Token They
Token ID1119
Feature activation-1.02e+00

Pos logit contributions
 pleasure+1.321
 generosity+1.289
 appreciation+1.210
ipp+1.183
 pride+1.168
Neg logit contributions
 bumped-1.439
 stub-1.433
 accidentally-1.385
 tri-1.290
?"-1.285
Token gathered
Token ID9272
Feature activation-3.21e+00

Pos logit contributions
 pleasure+2.365
 generosity+2.311
 appreciation+2.200
ipp+2.152
 pride+2.140
Neg logit contributions
 bumped-1.183
 stub-1.159
 accidentally-1.122
 tri-0.989
?"-0.970
Token all
Token ID477
Feature activation-4.42e+00

Pos logit contributions
 generosity+7.335
 pleasure+7.301
ipp+7.080
ums+6.998
izza+6.760
Neg logit contributions
 bumped-4.130
 accidentally-3.694
 landed-3.554
 suddenly-3.498
 stub-3.454
Token the
Token ID262
Feature activation-2.99e+00

Pos logit contributions
 generosity+13.454
 pleasure+13.239
ealing+12.512
ums+12.507
izza+12.406
Neg logit contributions
?"-3.817
 bumped-3.198
 accidentally-3.083
 landed-3.072
 hit-3.046
Token ingredients
Token ID9391
Feature activation-2.88e+00

Pos logit contributions
 pleasure+5.860
 generosity+5.693
ipp+5.314
 appreciation+5.223
ums+5.208
Neg logit contributions
 bumped-4.916
 accidentally-4.661
?"-4.624
 stub-4.515
 saw-4.078
Token and
Token ID290
Feature activation-1.29e+00

Pos logit contributions
 pleasure+7.109
 generosity+6.838
 appreciation+6.509
 pride+6.463
ipp+6.362
Neg logit contributions
 bumped-3.209
 accidentally-3.080
?"-2.893
 suddenly-2.860
 stub-2.745
Token mixed
Token ID7668
Feature activation-2.55e+00

Pos logit contributions
 pleasure+2.934
 generosity+2.839
 appreciation+2.722
ipp+2.694
 pride+2.647
Neg logit contributions
 bumped-1.559
 accidentally-1.498
 stub-1.487
 tri-1.302
 landed-1.291
Token them
Token ID606
Feature activation-2.85e+00

Pos logit contributions
 generosity+6.017
 pleasure+5.998
 pride+5.490
 appreciation+5.490
ums+5.319
Neg logit contributions
 bumped-3.019
?"-2.881
 accidentally-2.864
 suddenly-2.646
 landed-2.607
Token here
Token ID994
Feature activation+7.30e-01

Pos logit contributions
 stub+4.750
 bumped+4.696
 accidentally+4.270
 Ach+4.038
 seek+4.006
Neg logit contributions
 pleasure-5.100
 generosity-4.872
 appreciation-4.671
 pride-4.580
 kindness-4.547
Token,
Token ID11
Feature activation+2.97e-01

Pos logit contributions
 stub+1.040
 bumped+1.030
 accidentally+0.951
 tri+0.885
 landed+0.853
Neg logit contributions
 pleasure-1.500
 generosity-1.460
 appreciation-1.392
 pride-1.347
ipp-1.339
Token but
Token ID475
Feature activation+6.50e-01

Pos logit contributions
 bumped+0.401
 stub+0.399
 accidentally+0.369
 tri+0.334
 landed+0.329
Neg logit contributions
 pleasure-0.621
 generosity-0.605
 appreciation-0.577
 pride-0.560
ipp-0.557
Token do
Token ID466
Feature activation+3.52e-02

Pos logit contributions
 stub+0.927
 bumped+0.920
 accidentally+0.847
 tri+0.767
 landed+0.751
Neg logit contributions
 pleasure-1.322
 generosity-1.302
 appreciation-1.229
 pride-1.185
ipp-1.181
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 stub+0.049
 bumped+0.048
 accidentally+0.045
 tri+0.041
 landed+0.040
Neg logit contributions
 pleasure-0.072
 generosity-0.072
 appreciation-0.068
ipp-0.066
 pride-0.066
Token go
Token ID467
Feature activation+2.13e+00

Pos logit contributions
 bumped+1.562
 stub+1.548
 accidentally+1.462
 tri+1.342
 landed+1.341
Neg logit contributions
 pleasure-1.863
 generosity-1.827
 appreciation-1.742
 pride-1.676
 kindness-1.635
Token near
Token ID1474
Feature activation+8.16e-01

Pos logit contributions
 stub+3.241
 bumped+2.983
 accidentally+2.946
lf+2.762
Ro+2.761
Neg logit contributions
 pleasure-3.832
 generosity-3.775
 appreciation-3.686
 choices-3.644
 pride-3.627
Token it
Token ID340
Feature activation+2.52e+00

Pos logit contributions
 bumped+1.191
 stub+1.188
 accidentally+1.092
 tri+1.013
 seek+0.980
Neg logit contributions
 pleasure-1.619
 generosity-1.582
 appreciation-1.538
 pride-1.466
 decisions-1.464
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 stub+3.863
 bumped+3.748
 accidentally+3.372
 tri+3.312
Who+3.215
Neg logit contributions
 pleasure-4.825
 generosity-4.744
 kinds-4.594
 appreciation-4.589
 equal-4.515
Token It
Token ID632
Feature activation-8.35e-01

Pos logit contributions
 pleasure+0.266
 generosity+0.256
 appreciation+0.243
 pride+0.236
ipp+0.236
Neg logit contributions
 bumped-0.235
 stub-0.232
 accidentally-0.222
 tri-0.206
 landed-0.203
Token might
Token ID1244
Feature activation+1.32e+00

Pos logit contributions
 pleasure+1.627
 generosity+1.590
 appreciation+1.506
ipp+1.473
 pride+1.466
Neg logit contributions
 bumped-1.238
 stub-1.227
 accidentally-1.169
 tri-1.082
?"-1.080
Token,
Token ID11
Feature activation-4.94e-01

Pos logit contributions
 pleasure+0.174
 generosity+0.170
 appreciation+0.163
ipp+0.158
 pride+0.158
Neg logit contributions
 stub-0.115
 bumped-0.112
 accidentally-0.105
 tri-0.097
 landed-0.096
Token John
Token ID1757
Feature activation+3.99e-01

Pos logit contributions
 pleasure+0.787
 generosity+0.765
 appreciation+0.719
ipp+0.696
 pride+0.693
Neg logit contributions
 stub-0.899
 bumped-0.897
 accidentally-0.857
 tri-0.806
?"-0.799
Token sat
Token ID3332
Feature activation+1.51e+00

Pos logit contributions
 stub+0.449
 bumped+0.444
 accidentally+0.398
 seek+0.375
Ro+0.374
Neg logit contributions
 pleasure-0.874
 generosity-0.860
 appreciation-0.830
ipp-0.819
 pride-0.810
Token in
Token ID287
Feature activation+3.27e-01

Pos logit contributions
 stub+2.259
 bumped+2.191
 accidentally+2.090
Ro+2.035
 tri+1.909
Neg logit contributions
 pleasure-2.840
 generosity-2.788
ipp-2.770
 appreciation-2.719
 choices-2.711
Token his
Token ID465
Feature activation-1.62e-01

Pos logit contributions
 stub+0.505
 bumped+0.500
 accidentally+0.475
 tri+0.440
 landed+0.429
Neg logit contributions
 pleasure-0.625
 generosity-0.608
 appreciation-0.585
 pride-0.567
 kinds-0.553
Token arm
Token ID3211
Feature activation+1.99e+00

Pos logit contributions
 pleasure+0.349
 generosity+0.340
 appreciation+0.326
ipp+0.318
 pride+0.317
Neg logit contributions
 bumped-0.206
 stub-0.203
 accidentally-0.194
 tri-0.176
?"-0.172
Tokenchair
Token ID16337
Feature activation+1.70e+00

Pos logit contributions
 bumped+2.630
 stub+2.549
 accidentally+2.526
Ro+2.403
Who+2.331
Neg logit contributions
 pleasure-4.050
 generosity-3.997
 kinds-3.969
 pride-3.884
ipp-3.853
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
 stub+2.606
 bumped+2.578
 accidentally+2.431
Ro+2.402
Raw+2.320
Neg logit contributions
 pleasure-3.131
 generosity-3.080
ipp-2.927
 appreciation-2.896
 kinds-2.892
Token He
Token ID679
Feature activation+2.27e+00

Pos logit contributions
 stub+0.247
 bumped+0.247
 accidentally+0.234
 tri+0.221
 landed+0.218
Neg logit contributions
 pleasure-0.237
 generosity-0.225
 appreciation-0.214
 pride-0.208
ipp-0.205
Token wanted
Token ID2227
Feature activation+4.81e-01

Pos logit contributions
Ro+3.049
 stub+2.976
 bumped+2.883
 seek+2.814
 Ach+2.724
Neg logit contributions
 pleasure-4.214
 generosity-4.144
 appreciation-4.036
 pride-4.035
ipp-3.963
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
 stub+0.620
 bumped+0.613
 accidentally+0.577
 tri+0.525
 landed+0.518
Neg logit contributions
 pleasure-1.027
 generosity-1.010
 appreciation-0.965
 pride-0.941
 kindness-0.917
Token hold
Token ID1745
Feature activation-7.65e-02

Pos logit contributions
 stub+2.851
 bumped+2.774
 accidentally+2.687
 tri+2.419
Please+2.270
Neg logit contributions
 pleasure-3.392
 generosity-3.359
 appreciation-3.166
 pride-3.111
 kindness-3.110
Token Anna
Token ID11735
Feature activation-6.44e-01

Pos logit contributions
 pleasure+0.161
 generosity+0.158
 appreciation+0.151
ipp+0.148
 pride+0.148
Neg logit contributions
 stub-0.099
 bumped-0.098
 accidentally-0.093
 tri-0.084
?"-0.083
Token.
Token ID13
Feature activation+8.26e-01

Pos logit contributions
 pleasure+1.364
 generosity+1.337
 appreciation+1.275
ipp+1.248
 pride+1.242
Neg logit contributions
 bumped-0.828
 stub-0.827
 accidentally-0.778
 tri-0.709
?"-0.703
Token It
Token ID632
Feature activation+1.92e+00

Pos logit contributions
 bumped+1.306
 stub+1.288
 accidentally+1.205
 tri+1.132
 landed+1.131
Neg logit contributions
 pleasure-1.572
 generosity-1.475
 appreciation-1.425
 pride-1.405
ipp-1.399
Token broke
Token ID6265
Feature activation+1.28e+00

Pos logit contributions
 stub+2.304
 accidentally+2.120
?"+2.099
 bumped+2.076
 seek+2.066
Neg logit contributions
 generosity-3.747
 appreciation-3.678
 kinds-3.630
 pleasure-3.609
 pride-3.534
Token and
Token ID290
Feature activation+2.32e+00

Pos logit contributions
 stub+1.848
 bumped+1.765
 accidentally+1.679
Ro+1.560
?"+1.524
Neg logit contributions
 pleasure-2.465
ipp-2.399
 generosity-2.385
 appreciation-2.373
 pride-2.320
Token fell
Token ID3214
Feature activation+1.86e+00

Pos logit contributions
 stub+3.253
 bumped+3.016
 accidentally+2.976
?"+2.952
 seek+2.899
Neg logit contributions
 generosity-4.473
 pride-4.347
 pleasure-4.341
 appreciation-4.307
 kindness-4.128
Token down
Token ID866
Feature activation+1.32e+00

Pos logit contributions
 stub+2.831
 accidentally+2.727
 bumped+2.665
 seek+2.565
Who+2.476
Neg logit contributions
 pleasure-3.284
 appreciation-3.190
ipp-3.185
 pride-3.183
 generosity-3.162
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
 stub+1.850
 bumped+1.697
 accidentally+1.668
 tri+1.495
?"+1.474
Neg logit contributions
 generosity-2.626
 pleasure-2.614
 appreciation-2.571
 pride-2.492
 choices-2.481
Token Anna
Token ID11735
Feature activation+9.49e-01

Pos logit contributions
 bumped+1.917
 stub+1.858
 accidentally+1.734
 seek+1.695
 landed+1.652
Neg logit contributions
 pleasure-2.574
 generosity-2.386
 appreciation-2.341
 pride-2.341
ipp-2.286
Token fell
Token ID3214
Feature activation+1.85e+00

Pos logit contributions
 stub+1.093
 bumped+1.060
 accidentally+1.024
?"+1.020
 seek+1.019
Neg logit contributions
 pleasure-1.902
 generosity-1.900
 appreciation-1.877
 pride-1.852
ipp-1.849
Token dirt
Token ID13647
Feature activation+7.01e-01

Pos logit contributions
 stub+0.083
 bumped+0.082
 accidentally+0.078
?"+0.073
 tri+0.073
Neg logit contributions
 pleasure-0.088
 generosity-0.086
 appreciation-0.083
 pride-0.081
 kinds-0.078
Token behind
Token ID2157
Feature activation+8.45e-01

Pos logit contributions
 stub+0.903
 bumped+0.861
 accidentally+0.834
Ro+0.760
?"+0.759
Neg logit contributions
 pleasure-1.429
 generosity-1.423
 appreciation-1.354
ipp-1.351
 pride-1.339
Token.
Token ID13
Feature activation-7.79e-01

Pos logit contributions
 stub+1.199
 bumped+1.178
 accidentally+1.103
 tri+1.030
Ro+1.004
Neg logit contributions
 pleasure-1.680
 generosity-1.621
 appreciation-1.576
 pride-1.550
 kinds-1.535
Token Suddenly
Token ID24975
Feature activation-2.45e-01

Pos logit contributions
 pleasure+1.314
 generosity+1.291
 appreciation+1.213
ipp+1.185
 pride+1.172
Neg logit contributions
 bumped-1.337
 stub-1.333
 accidentally-1.282
?"-1.191
 tri-1.191
Token,
Token ID11
Feature activation+2.61e-01

Pos logit contributions
 pleasure+0.512
 generosity+0.502
 appreciation+0.480
 pride+0.467
ipp+0.465
Neg logit contributions
 stub-0.322
 bumped-0.316
 accidentally-0.296
 tri-0.272
?"-0.269
Token he
Token ID339
Feature activation+1.95e+00

Pos logit contributions
 stub+0.386
 bumped+0.369
 accidentally+0.348
 tri+0.326
?"+0.319
Neg logit contributions
 pleasure-0.513
 generosity-0.501
 appreciation-0.480
 pride-0.467
ipp-0.455
Token heard
Token ID2982
Feature activation+5.43e-01

Pos logit contributions
Ro+1.670
?"+1.662
 stub+1.655
 bumped+1.530
 seek+1.475
Neg logit contributions
 pride-4.488
 appreciation-4.431
 pleasure-4.421
 generosity-4.407
ipp-4.361
Token his
Token ID465
Feature activation+6.59e-01

Pos logit contributions
 stub+0.801
 bumped+0.784
 accidentally+0.743
 tri+0.686
?"+0.678
Neg logit contributions
 pleasure-1.061
 generosity-1.044
 appreciation-1.012
 pride-0.989
 kindness-0.939
Token mom
Token ID1995
Feature activation+5.47e-02

Pos logit contributions
 bumped+1.053
 stub+1.030
 accidentally+1.012
 landed+0.946
?"+0.943
Neg logit contributions
 pleasure-1.238
 generosity-1.188
 appreciation-1.149
 pride-1.113
 choices-1.075
Token calling
Token ID4585
Feature activation+2.70e-01

Pos logit contributions
 stub+0.070
 bumped+0.070
 accidentally+0.065
?"+0.061
 tri+0.059
Neg logit contributions
 pleasure-0.113
 generosity-0.112
 appreciation-0.107
 pride-0.104
ipp-0.103
Token him
Token ID683
Feature activation+2.06e-01

Pos logit contributions
 stub+0.448
 bumped+0.440
 accidentally+0.422
 tri+0.391
 landed+0.387
Neg logit contributions
 pleasure-0.475
 generosity-0.466
 appreciation-0.447
 pride-0.433
ipp-0.418
Token.
Token ID13
Feature activation-1.09e-03

Pos logit contributions
 pleasure+1.447
 generosity+1.422
 appreciation+1.366
ipp+1.340
 pride+1.335
Neg logit contributions
 bumped-0.586
 stub-0.585
 accidentally-0.540
 tri-0.476
?"-0.470
Token As
Token ID1081
Feature activation+1.03e+00

Pos logit contributions
 pleasure+0.002
 generosity+0.002
 appreciation+0.002
ipp+0.002
 pride+0.002
Neg logit contributions
 bumped-0.002
 stub-0.002
 accidentally-0.002
 tri-0.002
?"-0.002
Token he
Token ID339
Feature activation+6.70e-01

Pos logit contributions
 stub+1.459
 bumped+1.397
 accidentally+1.280
?"+1.223
 landed+1.195
Neg logit contributions
 pleasure-2.057
 generosity-2.004
 appreciation-1.879
 pride-1.871
ipp-1.823
Token flew
Token ID13112
Feature activation+2.56e-01

Pos logit contributions
 stub+0.743
 bumped+0.686
 accidentally+0.672
?"+0.614
 tri+0.583
Neg logit contributions
 generosity-1.487
 pleasure-1.483
 appreciation-1.445
ipp-1.417
 kinds-1.403
Token,
Token ID11
Feature activation+9.70e-01

Pos logit contributions
 stub+0.373
 bumped+0.362
 accidentally+0.349
 tri+0.317
?"+0.311
Neg logit contributions
 pleasure-0.504
 generosity-0.486
 appreciation-0.466
 pride-0.459
ipp-0.457
Token he
Token ID339
Feature activation+2.18e+00

Pos logit contributions
 stub+1.331
 bumped+1.267
 accidentally+1.201
?"+1.154
 tri+1.102
Neg logit contributions
 pleasure-1.935
 generosity-1.894
 pride-1.768
 appreciation-1.758
 choices-1.722
Token could
Token ID714
Feature activation+3.57e-01

Pos logit contributions
 stub+2.037
?"+1.906
Ro+1.884
 Ach+1.880
 seek+1.868
Neg logit contributions
 generosity-4.737
 pleasure-4.737
ipp-4.701
 appreciation-4.641
 pride-4.584
Token see
Token ID766
Feature activation-3.14e-01

Pos logit contributions
 stub+0.519
 bumped+0.516
 accidentally+0.494
Ro+0.453
?"+0.450
Neg logit contributions
 pleasure-0.687
 generosity-0.661
 appreciation-0.635
ipp-0.620
 pride-0.616
Token all
Token ID477
Feature activation-2.44e+00

Pos logit contributions
 pleasure+0.584
 generosity+0.571
 appreciation+0.542
 pride+0.526
ipp+0.524
Neg logit contributions
 stub-0.487
 bumped-0.484
 accidentally-0.461
 tri-0.427
?"-0.422
Token the
Token ID262
Feature activation-1.03e+00

Pos logit contributions
 pleasure+5.665
 generosity+5.469
izza+5.333
ipp+5.172
 appreciation+5.102
Neg logit contributions
 bumped-3.017
 stub-2.953
 landed-2.799
 accidentally-2.773
?"-2.722
Token bright
Token ID6016
Feature activation-1.10e+00

Pos logit contributions
 pleasure+1.973
 generosity+1.931
 appreciation+1.828
ipp+1.812
 pride+1.769
Neg logit contributions
 bumped-1.538
 stub-1.527
 accidentally-1.454
 tri-1.349
?"-1.345
Token watch
Token ID2342
Feature activation+2.10e-01

Pos logit contributions
 stub+0.731
 bumped+0.719
 accidentally+0.687
 tri+0.640
 landed+0.605
Neg logit contributions
 pleasure-0.966
 generosity-0.948
 appreciation-0.899
 pride-0.867
ipp-0.846
Token the
Token ID262
Feature activation+3.58e-02

Pos logit contributions
 stub+0.315
 bumped+0.313
 accidentally+0.294
 tri+0.271
 seek+0.264
Neg logit contributions
 pleasure-0.401
 generosity-0.397
 appreciation-0.375
 pride-0.365
ipp-0.361
Token bird
Token ID6512
Feature activation-6.57e-01

Pos logit contributions
 bumped+0.052
 stub+0.051
 accidentally+0.049
 tri+0.045
?"+0.043
Neg logit contributions
 pleasure-0.071
 generosity-0.070
 appreciation-0.067
 pride-0.064
ipp-0.064
Token.
Token ID13
Feature activation-7.16e-02

Pos logit contributions
 pleasure+1.482
 generosity+1.452
 appreciation+1.390
ipp+1.363
 pride+1.357
Neg logit contributions
 bumped-0.758
 stub-0.757
 accidentally-0.708
 tri-0.636
?"-0.633
Token It
Token ID632
Feature activation+6.10e-01

Pos logit contributions
 pleasure+0.139
 generosity+0.133
 appreciation+0.127
ipp+0.124
 pride+0.124
Neg logit contributions
 bumped-0.109
 stub-0.108
 accidentally-0.102
 tri-0.095
 landed-0.093
Token was
Token ID373
Feature activation+2.15e-01

Pos logit contributions
 stub+0.707
 bumped+0.677
 accidentally+0.650
?"+0.586
 tri+0.577
Neg logit contributions
 generosity-1.318
 pleasure-1.309
 appreciation-1.272
 kinds-1.233
ipp-1.226
Token new
Token ID649
Feature activation-3.18e-01

Pos logit contributions
 stub+0.363
 bumped+0.362
 accidentally+0.342
 tri+0.318
?"+0.313
Neg logit contributions
 pleasure-0.373
 generosity-0.359
 appreciation-0.342
 pride-0.332
ipp-0.328
Token to
Token ID284
Feature activation+9.12e-01

Pos logit contributions
 pleasure+0.650
 generosity+0.638
 appreciation+0.610
ipp+0.590
 pride+0.589
Neg logit contributions
 stub-0.438
 bumped-0.437
 accidentally-0.414
 tri-0.376
?"-0.369
Token her
Token ID607
Feature activation+1.01e-01

Pos logit contributions
 stub+1.359
 bumped+1.336
 accidentally+1.250
 tri+1.152
Please+1.076
Neg logit contributions
 pleasure-1.849
 generosity-1.785
 appreciation-1.703
 pride-1.666
ipp-1.635
Token.
Token ID13
Feature activation+2.34e-01

Pos logit contributions
 bumped+0.156
 stub+0.155
 accidentally+0.147
 tri+0.135
 landed+0.133
Neg logit contributions
 pleasure-0.195
 generosity-0.190
 appreciation-0.180
 pride-0.173
ipp-0.171
Token She
Token ID1375
Feature activation+4.45e-01

Pos logit contributions
 bumped+0.395
 stub+0.388
 accidentally+0.365
 tri+0.345
 landed+0.340
Neg logit contributions
 pleasure-0.426
 generosity-0.402
 appreciation-0.383
ipp-0.374
 pride-0.373
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 pleasure+1.975
 generosity+1.939
 appreciation+1.863
ipp+1.831
 pride+1.816
Neg logit contributions
 stub-0.835
 bumped-0.834
 accidentally-0.771
 tri-0.681
?"-0.679
Token From
Token ID3574
Feature activation+1.38e+00

Pos logit contributions
 pleasure+2.157
 generosity+2.130
 appreciation+2.014
ipp+1.956
 pride+1.949
Neg logit contributions
 stub-1.473
 bumped-1.451
 accidentally-1.395
?"-1.278
 tri-1.266
Token that
Token ID326
Feature activation-1.50e+00

Pos logit contributions
 stub+2.514
 bumped+2.465
 accidentally+2.330
Raw+2.173
Ro+2.169
Neg logit contributions
 generosity-2.281
 pleasure-2.246
 appreciation-2.158
 pride-2.090
ipp-2.073
Token day
Token ID1110
Feature activation-5.44e-02

Pos logit contributions
 pleasure+2.930
 generosity+2.860
 appreciation+2.694
ipp+2.677
 pride+2.638
Neg logit contributions
 stub-2.312
 bumped-2.268
 accidentally-2.163
?"-2.066
 tri-1.989
Token on
Token ID319
Feature activation+3.83e-01

Pos logit contributions
 pleasure+0.128
 generosity+0.125
 appreciation+0.121
ipp+0.120
 pride+0.119
Neg logit contributions
 stub-0.056
 bumped-0.055
 accidentally-0.051
 tri-0.045
?"-0.044
Token,
Token ID11
Feature activation-1.52e-02

Pos logit contributions
 stub+0.556
 bumped+0.552
 accidentally+0.505
Ro+0.479
 tri+0.479
Neg logit contributions
 pleasure-0.742
 generosity-0.726
 appreciation-0.692
ipp-0.688
 pride-0.672
Token Lily
Token ID20037
Feature activation-9.98e-01

Pos logit contributions
 pleasure+0.039
 generosity+0.039
 appreciation+0.037
ipp+0.037
 pride+0.037
Neg logit contributions
 stub-0.013
 bumped-0.013
 accidentally-0.011
 tri-0.010
?"-0.009
Token always
Token ID1464
Feature activation+5.07e-01

Pos logit contributions
 pleasure+2.047
 generosity+1.996
 appreciation+1.899
ipp+1.841
 pride+1.837
Neg logit contributions
 bumped-1.390
 stub-1.384
 accidentally-1.320
 tri-1.208
?"-1.192
Token stayed
Token ID9658
Feature activation-6.84e-02

Pos logit contributions
 stub+0.651
 bumped+0.632
 accidentally+0.602
?"+0.552
Ro+0.539
Neg logit contributions
 pleasure-1.068
 generosity-1.058
 appreciation-1.008
 pride-0.984
ipp-0.973
Token close
Token ID1969
Feature activation-8.55e-01

Pos logit contributions
 pleasure+0.124
 generosity+0.121
 appreciation+0.116
 pride+0.112
ipp+0.111
Neg logit contributions
 stub-0.110
 bumped-0.109
 accidentally-0.103
 tri-0.096
 landed-0.094
Token to
Token ID284
Feature activation+9.56e-01

Pos logit contributions
 pleasure+2.008
 generosity+1.975
 appreciation+1.892
ipp+1.854
 pride+1.844
Neg logit contributions
 bumped-0.910
 stub-0.909
 accidentally-0.842
 tri-0.754
?"-0.746
Token and
Token ID290
Feature activation+1.94e+00

Pos logit contributions
 pleasure+0.134
 generosity+0.133
 appreciation+0.127
ipp+0.123
 pride+0.122
Neg logit contributions
 stub-0.100
 bumped-0.099
 accidentally-0.093
 tri-0.085
?"-0.085
Token laughed
Token ID13818
Feature activation+1.04e+00

Pos logit contributions
 stub+2.878
 bumped+2.488
 Ach+2.381
jar+2.359
 seek+2.355
Neg logit contributions
 generosity-3.708
 kinds-3.563
 appreciation-3.475
 pleasure-3.434
 pride-3.373
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
 stub+1.477
 bumped+1.450
 accidentally+1.333
 seek+1.217
 tri+1.215
Neg logit contributions
 pleasure-2.063
 generosity-2.042
 appreciation-1.960
 pride-1.938
ipp-1.912
Token Sue
Token ID26089
Feature activation+6.24e-01

Pos logit contributions
 stub+0.227
 bumped+0.221
 accidentally+0.208
 tri+0.195
 landed+0.189
Neg logit contributions
 pleasure-0.277
 generosity-0.270
 appreciation-0.255
ipp-0.251
 pride-0.247
Token did
Token ID750
Feature activation+5.42e-01

Pos logit contributions
 stub+0.746
 bumped+0.697
 accidentally+0.636
 seek+0.618
Ro+0.606
Neg logit contributions
 generosity-1.309
 pleasure-1.285
 appreciation-1.254
ipp-1.253
 pride-1.222
Token not
Token ID407
Feature activation+1.17e-01

Pos logit contributions
 stub+0.766
 bumped+0.721
 accidentally+0.682
Ro+0.628
 tri+0.622
Neg logit contributions
 pleasure-1.087
 generosity-1.080
 appreciation-1.038
 pride-0.998
 choices-0.997
Token care
Token ID1337
Feature activation+7.60e-01

Pos logit contributions
 stub+0.208
 bumped+0.206
 accidentally+0.199
 tri+0.185
 landed+0.183
Neg logit contributions
 pleasure-0.190
 generosity-0.187
 appreciation-0.176
 pride-0.172
ipp-0.168
Token about
Token ID546
Feature activation+6.33e-01

Pos logit contributions
 stub+1.209
 bumped+1.188
 accidentally+1.099
 seek+1.051
 landed+1.044
Neg logit contributions
 pleasure-1.342
 generosity-1.335
 appreciation-1.273
ipp-1.264
 choices-1.241
Token the
Token ID262
Feature activation+5.01e-01

Pos logit contributions
 stub+0.991
 bumped+0.962
 accidentally+0.892
 tri+0.834
Ro+0.834
Neg logit contributions
 pleasure-1.201
 generosity-1.186
 appreciation-1.111
 choices-1.078
 pride-1.069
Token sign
Token ID1051
Feature activation+6.54e-01

Pos logit contributions
 stub+0.712
 bumped+0.695
 accidentally+0.676
 tri+0.613
?"+0.594
Neg logit contributions
 pleasure-1.009
 generosity-0.986
 appreciation-0.944
 choices-0.923
 pride-0.910
Token.
Token ID13
Feature activation+5.55e-02

Pos logit contributions
 stub+0.836
 bumped+0.806
 accidentally+0.751
Ro+0.686
 tri+0.686
Neg logit contributions
 generosity-1.383
 pleasure-1.382
 appreciation-1.311
 pride-1.288
 choices-1.280
Token gave
Token ID2921
Feature activation-1.35e-01

Pos logit contributions
 stub+0.999
 bumped+0.976
 accidentally+0.905
?"+0.885
 seek+0.866
Neg logit contributions
 generosity-1.251
 pleasure-1.239
 appreciation-1.206
 pride-1.182
ipp-1.134
Token Max
Token ID5436
Feature activation-3.27e+00

Pos logit contributions
 pleasure+0.286
 generosity+0.280
 appreciation+0.267
 pride+0.264
ipp+0.260
Neg logit contributions
 bumped-0.175
 stub-0.174
 accidentally-0.162
 tri-0.149
?"-0.148
Token a
Token ID257
Feature activation-7.48e-01

Pos logit contributions
 pleasure+8.894
 generosity+8.523
ums+8.482
ipp+8.247
izza+8.183
Neg logit contributions
 bumped-2.952
 stub-2.860
 accidentally-2.524
 landed-2.432
 tri-2.367
Token big
Token ID1263
Feature activation-1.17e+00

Pos logit contributions
 pleasure+1.412
 generosity+1.382
 appreciation+1.308
ipp+1.287
 pride+1.269
Neg logit contributions
 stub-1.138
 bumped-1.135
 accidentally-1.076
 tri-0.995
?"-0.990
Token hug
Token ID16225
Feature activation+4.29e-01

Pos logit contributions
 pleasure+1.853
 generosity+1.818
ipp+1.701
 appreciation+1.696
ums+1.639
Neg logit contributions
 stub-2.145
 bumped-2.144
 accidentally-2.032
?"-1.930
 tri-1.922
Token and
Token ID290
Feature activation+9.45e-01

Pos logit contributions
 stub+0.693
 bumped+0.690
 accidentally+0.657
Ro+0.609
 tri+0.604
Neg logit contributions
 pleasure-0.763
 generosity-0.751
 appreciation-0.704
 pride-0.692
ipp-0.681
Token told
Token ID1297
Feature activation-8.24e-02

Pos logit contributions
 stub+1.384
 bumped+1.360
 accidentally+1.235
?"+1.188
 tri+1.180
Neg logit contributions
 generosity-1.875
 pleasure-1.845
 appreciation-1.777
 pride-1.725
 kindness-1.673
Token him
Token ID683
Feature activation-1.41e+00

Pos logit contributions
 pleasure+0.140
 generosity+0.138
 appreciation+0.130
 pride+0.127
ipp+0.123
Neg logit contributions
 stub-0.142
 bumped-0.142
 accidentally-0.134
 tri-0.126
?"-0.125
Token he
Token ID339
Feature activation-4.89e-01

Pos logit contributions
 pleasure+3.288
 generosity+3.176
 appreciation+3.056
ipp+3.049
 pride+2.989
Neg logit contributions
 bumped-1.580
 stub-1.544
 accidentally-1.463
 tri-1.306
?"-1.293
Token was
Token ID373
Feature activation+1.92e-02

Pos logit contributions
 pleasure+1.145
 generosity+1.126
 appreciation+1.079
ipp+1.058
 pride+1.054
Neg logit contributions
 stub-0.522
 bumped-0.521
 accidentally-0.482
 tri-0.429
?"-0.428
Token very
Token ID845
Feature activation-1.40e-01

Pos logit contributions
 bumped+0.034
 stub+0.034
 accidentally+0.032
 tri+0.031
Ro+0.030
Neg logit contributions
 pleasure-0.032
 generosity-0.030
 appreciation-0.029
 pride-0.028
ipp-0.028
Token She
Token ID1375
Feature activation+3.75e-01

Pos logit contributions
 bumped+2.119
 stub+2.030
 accidentally+1.965
 seek+1.911
 landed+1.846
Neg logit contributions
 pleasure-2.336
ipp-2.108
 generosity-2.103
ums-2.068
 appreciation-2.064
Token looks
Token ID3073
Feature activation+5.00e-01

Pos logit contributions
 stub+0.617
 bumped+0.595
 accidentally+0.580
 seek+0.555
?"+0.527
Neg logit contributions
 pleasure-0.645
 generosity-0.624
 appreciation-0.602
 pride-0.592
ipp-0.586
Token for
Token ID329
Feature activation+7.95e-01

Pos logit contributions
 stub+0.804
 bumped+0.783
 accidentally+0.753
 seek+0.715
Ro+0.708
Neg logit contributions
 pleasure-0.900
 generosity-0.856
 appreciation-0.835
 pride-0.828
ipp-0.792
Token another
Token ID1194
Feature activation+1.19e+00

Pos logit contributions
 stub+1.168
 bumped+1.148
 accidentally+1.095
 tri+0.978
?"+0.976
Neg logit contributions
 pleasure-1.617
 generosity-1.578
 appreciation-1.487
 pride-1.478
 kindness-1.434
Token way
Token ID835
Feature activation+7.31e-01

Pos logit contributions
 stub+1.802
 bumped+1.784
 accidentally+1.721
 tri+1.593
Please+1.571
Neg logit contributions
 pleasure-2.308
 generosity-2.146
 appreciation-2.141
 kinds-1.984
 pride-1.976
Token down
Token ID866
Feature activation+1.80e+00

Pos logit contributions
 stub+0.953
 bumped+0.943
 accidentally+0.901
Ro+0.806
 seek+0.795
Neg logit contributions
 pleasure-1.523
 generosity-1.438
 appreciation-1.416
ipp-1.409
 pride-1.374
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 stub+2.712
 bumped+2.480
 accidentally+2.462
Ro+2.320
 Ach+2.241
Neg logit contributions
 pleasure-3.496
 generosity-3.325
 appreciation-3.319
 kinds-3.294
 pride-3.248
Token
Token ID198
Feature activation+6.47e-01

Pos logit contributions
 bumped+1.809
 stub+1.750
 accidentally+1.682
 seek+1.583
 landed+1.578
Neg logit contributions
 pleasure-2.161
 generosity-1.967
ipp-1.948
 appreciation-1.907
ums-1.891
Token
Token ID198
Feature activation+3.37e-01

Pos logit contributions
 bumped+0.975
 stub+0.958
 accidentally+0.923
 tri+0.835
 seek+0.831
Neg logit contributions
 pleasure-1.234
 generosity-1.195
 appreciation-1.177
 pride-1.129
ipp-1.125
TokenShe
Token ID3347
Feature activation+6.57e-01

Pos logit contributions
 stub+0.471
 bumped+0.471
 accidentally+0.443
 tri+0.398
?"+0.395
Neg logit contributions
 pleasure-0.662
 generosity-0.653
 appreciation-0.635
ipp-0.630
 pride-0.612
Token sees
Token ID7224
Feature activation-2.01e-01

Pos logit contributions
 stub+0.882
 bumped+0.823
 accidentally+0.814
 seek+0.770
?"+0.726
Neg logit contributions
 pleasure-1.317
 generosity-1.285
 appreciation-1.252
 pride-1.239
ipp-1.218
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
 pleasure+2.437
 generosity+2.383
 appreciation+2.291
ipp+2.251
 pride+2.243
Neg logit contributions
 bumped-1.008
 stub-0.992
 accidentally-0.931
 tri-0.825
?"-0.817
Token "
Token ID366
Feature activation-7.57e-01

Pos logit contributions
 pleasure+2.389
 generosity+2.352
 appreciation+2.238
ipp+2.188
 pride+2.161
Neg logit contributions
 bumped-1.846
 stub-1.826
 accidentally-1.768
?"-1.626
 tri-1.621
TokenThat
Token ID2504
Feature activation-9.47e-01

Pos logit contributions
 pleasure+1.949
 generosity+1.914
 appreciation+1.840
ipp+1.811
 pride+1.802
Neg logit contributions
 bumped-0.641
 stub-0.640
 accidentally-0.582
 tri-0.500
?"-0.490
Token looks
Token ID3073
Feature activation-6.69e-01

Pos logit contributions
 pleasure+1.778
 generosity+1.739
 appreciation+1.646
ipp+1.615
 pride+1.609
Neg logit contributions
 bumped-1.459
 stub-1.454
 accidentally-1.391
?"-1.295
 tri-1.283
Token dangerous
Token ID4923
Feature activation-4.97e-01

Pos logit contributions
 pleasure+1.187
 generosity+1.162
 appreciation+1.098
ipp+1.072
 pride+1.063
Neg logit contributions
 bumped-1.089
 stub-1.086
 accidentally-1.038
 tri-0.966
?"-0.960
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 pleasure+1.092
 generosity+1.071
 appreciation+1.024
ipp+1.002
 pride+0.997
Neg logit contributions
 stub-0.599
 bumped-0.598
 accidentally-0.559
 tri-0.506
?"-0.498
Token little
Token ID1310
Feature activation-1.31e+00

Pos logit contributions
 pleasure+2.550
 generosity+2.511
 appreciation+2.393
ipp+2.364
 pride+2.334
Neg logit contributions
 bumped-1.912
 stub-1.901
 accidentally-1.856
?"-1.731
 tri-1.700
Token one
Token ID530
Feature activation-1.64e-01

Pos logit contributions
 pleasure+2.074
 generosity+2.019
 appreciation+1.921
ipp+1.882
 pride+1.848
Neg logit contributions
 stub-2.353
 bumped-2.352
 accidentally-2.278
?"-2.164
 tri-2.122
Token,"
Token ID553
Feature activation-5.34e-01

Pos logit contributions
 pleasure+0.343
 generosity+0.336
 appreciation+0.321
 pride+0.310
ipp+0.310
Neg logit contributions
 stub-0.216
 bumped-0.215
 accidentally-0.202
 tri-0.185
 landed-0.179
Token he
Token ID339
Feature activation+6.90e-02

Pos logit contributions
 pleasure+1.190
 generosity+1.169
 appreciation+1.118
ipp+1.093
 pride+1.089
Neg logit contributions
 stub-0.628
 bumped-0.628
 accidentally-0.586
 tri-0.529
?"-0.523
Token said
Token ID531
Feature activation-5.00e-01

Pos logit contributions
 stub+0.086
 bumped+0.084
 accidentally+0.078
?"+0.072
 tri+0.071
Neg logit contributions
 generosity-0.146
 pleasure-0.146
 appreciation-0.140
ipp-0.135
 pride-0.135
TokenYou
Token ID1639
Feature activation-2.09e+00

Pos logit contributions
 pleasure+3.309
 generosity+3.277
 appreciation+3.152
 choices+3.064
 decisions+3.045
Neg logit contributions
 stub-1.869
 bumped-1.805
 accidentally-1.721
 suddenly-1.558
 tri-1.553
Token're
Token ID821
Feature activation-1.97e+00

Pos logit contributions
 generosity+3.601
 pleasure+3.554
ums+3.398
 appreciation+3.343
ipp+3.286
Neg logit contributions
 bumped-3.643
 accidentally-3.518
 stub-3.516
?"-3.376
 tri-3.259
Token so
Token ID523
Feature activation-3.15e+00

Pos logit contributions
 generosity+3.280
 pleasure+3.179
 appreciation+3.096
izza+3.014
ums+2.954
Neg logit contributions
 bumped-3.382
 stub-3.312
 accidentally-3.275
?"-3.273
 seek-3.081
Token pretty
Token ID2495
Feature activation-4.87e-01

Pos logit contributions
 generosity+5.295
 decisions+5.088
 appreciation+5.068
 choices+4.994
 pleasure+4.989
Neg logit contributions
 bumped-5.681
 stub-5.496
?"-5.468
 seek-5.408
 accidentally-5.397
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
 pleasure+1.063
 generosity+1.042
 appreciation+0.995
ipp+0.975
 pride+0.969
Neg logit contributions
 bumped-0.598
 stub-0.598
 accidentally-0.559
 tri-0.507
?"-0.498
Token you
Token ID345
Feature activation-1.35e+00

Pos logit contributions
 pleasure+2.423
 generosity+2.389
 appreciation+2.294
ipp+2.271
ums+2.212
Neg logit contributions
 bumped-1.804
 stub-1.800
 accidentally-1.729
?"-1.617
 seek-1.585
Token put
Token ID1234
Feature activation-1.11e+00

Pos logit contributions
 pleasure+2.469
 generosity+2.445
 appreciation+2.304
ipp+2.245
ums+2.234
Neg logit contributions
 bumped-2.137
 stub-2.087
 accidentally-2.026
?"-1.911
 tri-1.878
Token the
Token ID262
Feature activation-1.95e+00

Pos logit contributions
 pleasure+2.379
 generosity+2.356
 appreciation+2.237
ipp+2.200
 pride+2.154
Neg logit contributions
 bumped-1.397
 stub-1.368
 accidentally-1.313
?"-1.220
 landed-1.183
Token pieces
Token ID5207
Feature activation-8.64e-01

Pos logit contributions
 pleasure+3.341
 generosity+3.295
 appreciation+3.145
ipp+3.129
ums+3.035
Neg logit contributions
 stub-3.349
 bumped-3.343
 accidentally-3.167
 tri-2.960
?"-2.960
Token together
Token ID1978
Feature activation+2.64e-01

Pos logit contributions
 pleasure+1.855
 generosity+1.823
 appreciation+1.737
ipp+1.699
 pride+1.684
Neg logit contributions
 bumped-1.084
 stub-1.081
 accidentally-1.014
?"-0.932
 tri-0.921
Token so
Token ID523
Feature activation-7.47e-01

Pos logit contributions
 stub+0.360
 bumped+0.357
 accidentally+0.334
 tri+0.308
Ro+0.299
Neg logit contributions
 pleasure-0.551
 generosity-0.530
 appreciation-0.510
ipp-0.504
 pride-0.497
Token time
Token ID640
Feature activation+1.86e-02

Pos logit contributions
 generosity+2.454
 pleasure+2.452
 appreciation+2.307
 pride+2.268
ipp+2.250
Neg logit contributions
 stub-1.594
 bumped-1.577
 accidentally-1.506
?"-1.382
 tri-1.377
Token,
Token ID11
Feature activation+7.51e-01

Pos logit contributions
 stub+0.025
 bumped+0.025
 accidentally+0.024
 landed+0.022
 tri+0.022
Neg logit contributions
 pleasure-0.038
 generosity-0.036
 appreciation-0.035
ipp-0.034
 pride-0.034
Token there
Token ID612
Feature activation-2.35e-02

Pos logit contributions
 bumped+0.720
 stub+0.711
 accidentally+0.612
 tri+0.569
Ro+0.542
Neg logit contributions
 pleasure-1.843
 generosity-1.736
 appreciation-1.712
ipp-1.679
 pride-1.641
Token was
Token ID373
Feature activation-6.53e-01

Pos logit contributions
 pleasure+0.055
 generosity+0.054
 appreciation+0.052
ipp+0.051
 pride+0.050
Neg logit contributions
 bumped-0.025
 stub-0.025
 accidentally-0.022
 tri-0.020
?"-0.020
Token a
Token ID257
Feature activation+2.52e-01

Pos logit contributions
 pleasure+1.405
 generosity+1.381
 appreciation+1.318
ipp+1.289
 pride+1.283
Neg logit contributions
 stub-0.819
 bumped-0.818
 accidentally-0.770
 tri-0.698
?"-0.694
Token little
Token ID1310
Feature activation-9.92e-01

Pos logit contributions
 bumped+0.439
 stub+0.432
 accidentally+0.406
 landed+0.386
 tri+0.385
Neg logit contributions
 pleasure-0.421
 generosity-0.405
 appreciation-0.380
ipp-0.369
 pride-0.366
Token chicken
Token ID9015
Feature activation+6.65e-01

Pos logit contributions
 pleasure+2.072
 generosity+2.040
 appreciation+1.953
ipp+1.906
 pride+1.897
Neg logit contributions
 stub-1.317
 bumped-1.307
 accidentally-1.245
?"-1.140
 tri-1.135
Token named
Token ID3706
Feature activation-5.51e-01

Pos logit contributions
 bumped+0.728
 stub+0.703
 accidentally+0.657
 tri+0.583
 landed+0.572
Neg logit contributions
 generosity-1.505
 pleasure-1.499
 appreciation-1.456
ipp-1.421
ums-1.405
Token Ro
Token ID5564
Feature activation+3.90e-01

Pos logit contributions
 pleasure+1.132
 generosity+1.108
 appreciation+1.056
ipp+1.033
 pride+1.027
Neg logit contributions
 bumped-0.753
 stub-0.751
 accidentally-0.708
 tri-0.649
?"-0.643
Tokenoster
Token ID6197
Feature activation-1.04e-01

Pos logit contributions
 stub+0.682
 bumped+0.678
 accidentally+0.658
 landed+0.605
?"+0.605
Neg logit contributions
 pleasure-0.644
 generosity-0.622
ipp-0.584
 appreciation-0.576
izza-0.560
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
 pleasure+0.239
 generosity+0.237
 appreciation+0.227
ipp+0.224
 pride+0.220
Neg logit contributions
 stub-0.111
 bumped-0.110
 accidentally-0.102
 tri-0.090
 landed-0.088
Token,
Token ID11
Feature activation-1.82e-01

Pos logit contributions
 pleasure+0.858
 generosity+0.843
 appreciation+0.804
ipp+0.785
 pride+0.782
Neg logit contributions
 stub-0.540
 bumped-0.538
 accidentally-0.505
 tri-0.462
?"-0.456
Token "
Token ID366
Feature activation-1.13e+00

Pos logit contributions
 pleasure+0.380
 generosity+0.373
 appreciation+0.355
ipp+0.348
 pride+0.346
Neg logit contributions
 stub-0.239
 bumped-0.238
 accidentally-0.222
 tri-0.204
?"-0.202
TokenWell
Token ID5779
Feature activation-6.22e-01

Pos logit contributions
 pleasure+2.562
 generosity+2.517
 appreciation+2.408
ipp+2.347
 pride+2.344
Neg logit contributions
 stub-1.313
 bumped-1.297
 accidentally-1.220
 tri-1.098
 landed-1.074
Token,
Token ID11
Feature activation-1.18e+00

Pos logit contributions
 pleasure+1.242
 generosity+1.217
 appreciation+1.157
ipp+1.130
 pride+1.124
Neg logit contributions
 bumped-0.894
 stub-0.892
 accidentally-0.845
 tri-0.778
?"-0.774
Token being
Token ID852
Feature activation-7.64e-01

Pos logit contributions
 pleasure+2.675
 generosity+2.635
 appreciation+2.527
ipp+2.505
 pride+2.473
Neg logit contributions
 bumped-1.312
 stub-1.304
 accidentally-1.239
?"-1.131
 tri-1.119
Token a
Token ID257
Feature activation-4.86e-01

Pos logit contributions
 pleasure+1.198
 generosity+1.168
 appreciation+1.093
ipp+1.065
 pride+1.050
Neg logit contributions
 bumped-1.407
 stub-1.404
 accidentally-1.350
 tri-1.266
?"-1.261
Token king
Token ID5822
Feature activation-1.14e-01

Pos logit contributions
 pleasure+0.819
 generosity+0.797
 appreciation+0.752
ipp+0.728
 pride+0.727
Neg logit contributions
 stub-0.839
 bumped-0.839
 accidentally-0.801
 tri-0.749
?"-0.744
Token isn
Token ID2125
Feature activation-9.24e-01

Pos logit contributions
 pleasure+0.231
 generosity+0.226
 appreciation+0.215
ipp+0.210
 pride+0.209
Neg logit contributions
 stub-0.155
 bumped-0.155
 accidentally-0.145
 tri-0.133
 landed-0.130
Token't
Token ID470
Feature activation-9.11e-02

Pos logit contributions
 pleasure+2.399
 generosity+2.365
 appreciation+2.269
 pride+2.214
ipp+2.207
Neg logit contributions
 bumped-0.775
 stub-0.772
 accidentally-0.703
?"-0.613
 tri-0.606
Token easy
Token ID2562
Feature activation+4.31e-01

Pos logit contributions
 pleasure+0.168
 generosity+0.164
 appreciation+0.155
 pride+0.151
ipp+0.150
Neg logit contributions
 stub-0.143
 bumped-0.142
 accidentally-0.133
 tri-0.125
?"-0.122
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
 stub+0.572
 bumped+0.563
 accidentally+0.525
 landed+0.484
 tri+0.484
Neg logit contributions
 pleasure-0.906
 generosity-0.890
 appreciation-0.845
 choices-0.822
 pride-0.818
Token
Token ID198
Feature activation-3.50e-02

Pos logit contributions
 pleasure+2.627
 generosity+2.618
 appreciation+2.449
ipp+2.390
 choices+2.363
Neg logit contributions
 stub-2.103
 bumped-2.092
 accidentally-2.045
?"-1.909
 tri-1.855
Token
Token ID198
Feature activation-9.20e-01

Pos logit contributions
 pleasure+0.071
 generosity+0.070
 appreciation+0.067
ipp+0.065
 pride+0.065
Neg logit contributions
 bumped-0.048
 stub-0.047
 accidentally-0.045
 tri-0.041
 landed-0.040
TokenFrom
Token ID4863
Feature activation+8.00e-01

Pos logit contributions
 pleasure+1.707
 generosity+1.658
 appreciation+1.559
ipp+1.523
 pride+1.515
Neg logit contributions
 bumped-1.453
 stub-1.451
 accidentally-1.393
 tri-1.284
?"-1.281
Token then
Token ID788
Feature activation-4.79e-01

Pos logit contributions
 stub+1.307
 bumped+1.269
 accidentally+1.193
 tri+1.093
Ro+1.071
Neg logit contributions
 pleasure-1.479
 generosity-1.476
 appreciation-1.392
 pride-1.352
ipp-1.336
Token on
Token ID319
Feature activation-2.27e-01

Pos logit contributions
 pleasure+1.172
 generosity+1.153
 appreciation+1.109
ipp+1.089
 pride+1.085
Neg logit contributions
 stub-0.458
 bumped-0.458
 accidentally-0.421
 tri-0.368
?"-0.363
Token,
Token ID11
Feature activation-1.63e+00

Pos logit contributions
 pleasure+0.474
 generosity+0.465
 appreciation+0.444
ipp+0.434
 pride+0.431
Neg logit contributions
 stub-0.304
 bumped-0.301
 accidentally-0.280
 tri-0.258
 landed-0.253
Token the
Token ID262
Feature activation-8.36e-01

Pos logit contributions
 pleasure+3.563
 generosity+3.489
 appreciation+3.348
ipp+3.317
izza+3.256
Neg logit contributions
 bumped-2.016
 accidentally-1.966
 stub-1.948
 suddenly-1.740
?"-1.730
Token obedient
Token ID41652
Feature activation-1.24e+00

Pos logit contributions
 pleasure+1.577
 generosity+1.538
 appreciation+1.459
ipp+1.436
 pride+1.410
Neg logit contributions
 bumped-1.290
 stub-1.280
 accidentally-1.223
 tri-1.130
?"-1.130
Token nation
Token ID3277
Feature activation-1.44e+00

Pos logit contributions
 pleasure+2.497
 generosity+2.429
 appreciation+2.314
ipp+2.293
 pride+2.240
Neg logit contributions
 bumped-1.754
 stub-1.743
 accidentally-1.656
?"-1.531
 tri-1.526
Token always
Token ID1464
Feature activation+6.88e-01

Pos logit contributions
 pleasure+3.471
 generosity+3.352
 appreciation+3.193
ipp+3.157
izza+3.135
Neg logit contributions
 bumped-1.553
 stub-1.509
 accidentally-1.450
 tri-1.277
?"-1.271
Token remembered
Token ID12086
Feature activation-2.56e-01

Pos logit contributions
 stub+0.946
 bumped+0.914
 accidentally+0.864
Ro+0.821
?"+0.801
Neg logit contributions
 pleasure-1.407
 generosity-1.388
 appreciation-1.314
 pride-1.294
ipp-1.269
Token
Token ID198
Feature activation+5.03e-01

Pos logit contributions
 bumped+0.913
 stub+0.894
 accidentally+0.845
 tri+0.799
 landed+0.771
Neg logit contributions
 pleasure-1.160
 generosity-1.106
 appreciation-1.099
ipp-1.065
 pride-1.049
TokenThe
Token ID464
Feature activation+5.70e-01

Pos logit contributions
 bumped+0.775
 stub+0.771
 accidentally+0.724
?"+0.681
 tri+0.679
Neg logit contributions
 pleasure-0.910
 generosity-0.888
 appreciation-0.870
ipp-0.856
 pride-0.837
Token little
Token ID1310
Feature activation-6.19e-02

Pos logit contributions
 stub+0.974
 bumped+0.966
 accidentally+0.935
 tri+0.858
?"+0.850
Neg logit contributions
 pleasure-0.990
 generosity-0.980
 appreciation-0.908
 pride-0.897
 choices-0.877
Token girl
Token ID2576
Feature activation+6.32e-02

Pos logit contributions
 pleasure+0.116
 generosity+0.113
 appreciation+0.107
 pride+0.104
ipp+0.104
Neg logit contributions
 stub-0.096
 bumped-0.096
 accidentally-0.091
 tri-0.084
?"-0.084
Token loved
Token ID6151
Feature activation-3.09e-01

Pos logit contributions
 stub+0.071
 bumped+0.070
 accidentally+0.064
?"+0.060
 tri+0.057
Neg logit contributions
 pleasure-0.141
 generosity-0.140
 appreciation-0.134
ipp-0.132
 pride-0.131
Token all
Token ID477
Feature activation-2.95e+00

Pos logit contributions
 pleasure+0.652
 generosity+0.641
 appreciation+0.612
ipp+0.596
 pride+0.595
Neg logit contributions
 stub-0.404
 bumped-0.400
 accidentally-0.376
 tri-0.342
?"-0.342
Token the
Token ID262
Feature activation-1.56e+00

Pos logit contributions
 pleasure+7.672
 generosity+7.370
izza+7.248
ipp+7.078
apa+6.907
Neg logit contributions
 bumped-2.989
 accidentally-2.749
 stub-2.722
?"-2.639
 hall-2.561
Token books
Token ID3835
Feature activation-5.42e-01

Pos logit contributions
 pleasure+3.111
 generosity+3.026
ipp+2.908
 appreciation+2.881
izza+2.817
Neg logit contributions
 bumped-2.259
 stub-2.186
 accidentally-2.084
?"-1.955
 tri-1.936
Token she
Token ID673
Feature activation+1.22e+00

Pos logit contributions
 pleasure+1.179
 generosity+1.156
 appreciation+1.105
ipp+1.082
 pride+1.077
Neg logit contributions
 stub-0.665
 bumped-0.665
 accidentally-0.623
 tri-0.565
?"-0.559
Token saw
Token ID2497
Feature activation+2.79e-01

Pos logit contributions
 stub+1.454
 bumped+1.395
 accidentally+1.271
Ro+1.233
?"+1.204
Neg logit contributions
 pleasure-2.707
 generosity-2.655
 pride-2.524
 kinds-2.498
 appreciation-2.466
Token.
Token ID13
Feature activation-4.54e-01

Pos logit contributions
 stub+0.376
 bumped+0.368
 accidentally+0.348
 tri+0.315
?"+0.308
Neg logit contributions
 pleasure-0.578
 generosity-0.571
 appreciation-0.544
 pride-0.532
ipp-0.530
Token a
Token ID257
Feature activation-1.19e+00

Pos logit contributions
 bumped+1.101
 stub+1.071
 accidentally+1.015
 tri+0.950
 landed+0.926
Neg logit contributions
 pleasure-1.341
 generosity-1.285
 appreciation-1.228
 pride-1.212
ipp-1.187
Token time
Token ID640
Feature activation-5.74e-02

Pos logit contributions
 pleasure+2.466
 generosity+2.464
 appreciation+2.322
 pride+2.266
ipp+2.241
Neg logit contributions
 stub-1.621
 bumped-1.597
 accidentally-1.529
 tri-1.402
?"-1.391
Token,
Token ID11
Feature activation+9.25e-01

Pos logit contributions
 pleasure+0.118
 generosity+0.114
 appreciation+0.109
ipp+0.108
 pride+0.105
Neg logit contributions
 stub-0.078
 bumped-0.078
 accidentally-0.072
 tri-0.066
 landed-0.066
Token there
Token ID612
Feature activation+4.65e-01

Pos logit contributions
 bumped+0.918
 stub+0.900
 accidentally+0.787
 tri+0.730
Ro+0.715
Neg logit contributions
 pleasure-2.191
ipp-2.092
 generosity-2.066
 appreciation-2.044
 pride-1.974
Token were
Token ID547
Feature activation-1.68e+00

Pos logit contributions
 bumped+0.528
 stub+0.526
 accidentally+0.475
 tri+0.434
 seek+0.432
Neg logit contributions
 pleasure-1.040
 generosity-0.998
 appreciation-0.983
ipp-0.971
 kinds-0.946
Token two
Token ID734
Feature activation-3.23e+00

Pos logit contributions
 generosity+2.942
 pleasure+2.926
 appreciation+2.726
 choices+2.596
 pride+2.566
Neg logit contributions
 stub-2.860
 bumped-2.786
 accidentally-2.756
 tri-2.541
?"-2.535
Token best
Token ID1266
Feature activation-2.05e+00

Pos logit contributions
 pleasure+6.013
 generosity+5.584
 appreciation+5.323
izza+5.177
 pride+4.981
Neg logit contributions
 bumped-5.716
 accidentally-5.655
 stub-5.582
 tri-5.421
 saw-5.155
Token friends
Token ID2460
Feature activation-1.43e+00

Pos logit contributions
 pleasure+2.871
 generosity+2.724
 appreciation+2.477
ipp+2.445
 kinds+2.379
Neg logit contributions
 bumped-4.239
 stub-4.225
 accidentally-4.053
 tri-3.921
?"-3.850
Token.
Token ID13
Feature activation-2.25e-01

Pos logit contributions
 pleasure+3.668
 generosity+3.564
 appreciation+3.392
ipp+3.371
 pride+3.339
Neg logit contributions
 bumped-1.334
 stub-1.314
 accidentally-1.275
?"-1.136
 tri-1.103
Token They
Token ID1119
Feature activation-9.08e-01

Pos logit contributions
 pleasure+0.361
 generosity+0.350
 appreciation+0.330
ipp+0.321
 pride+0.317
Neg logit contributions
 bumped-0.409
 stub-0.407
 accidentally-0.387
 tri-0.366
?"-0.362
Token were
Token ID547
Feature activation-2.60e+00

Pos logit contributions
 pleasure+1.835
 generosity+1.801
 appreciation+1.709
ipp+1.657
 pride+1.647
Neg logit contributions
 bumped-1.293
 stub-1.284
 accidentally-1.231
 tri-1.123
?"-1.121
Token
Token ID198
Feature activation-8.06e-01

Pos logit contributions
 pleasure+1.501
 generosity+1.477
 appreciation+1.409
ipp+1.375
 pride+1.373
Neg logit contributions
 stub-0.841
 bumped-0.841
 accidentally-0.790
 tri-0.715
?"-0.712
Token
Token ID198
Feature activation-1.08e+00

Pos logit contributions
 pleasure+1.760
 generosity+1.727
 appreciation+1.642
ipp+1.611
 pride+1.602
Neg logit contributions
 stub-0.996
 bumped-0.991
 accidentally-0.934
 tri-0.844
?"-0.842
TokenThe
Token ID464
Feature activation-4.91e-01

Pos logit contributions
 pleasure+2.093
 generosity+2.032
 appreciation+1.904
ipp+1.866
 pride+1.861
Neg logit contributions
 stub-1.622
 bumped-1.612
 accidentally-1.553
 tri-1.425
 landed-1.404
Token two
Token ID734
Feature activation-2.41e+00

Pos logit contributions
 pleasure+0.871
 generosity+0.852
 appreciation+0.805
 pride+0.782
ipp+0.782
Neg logit contributions
 bumped-0.807
 stub-0.806
 accidentally-0.767
 tri-0.715
?"-0.711
Token sisters
Token ID15153
Feature activation-1.56e+00

Pos logit contributions
 pleasure+3.881
 generosity+3.515
 appreciation+3.270
izza+3.244
ipp+3.183
Neg logit contributions
 bumped-4.836
 accidentally-4.774
 stub-4.715
 tri-4.413
 suddenly-4.365
Token held
Token ID2714
Feature activation-2.68e+00

Pos logit contributions
 pleasure+3.484
 generosity+3.420
 appreciation+3.218
 choices+3.193
 decisions+3.172
Neg logit contributions
 bumped-1.923
 stub-1.892
 accidentally-1.785
 tri-1.650
 landed-1.628
Token hands
Token ID2832
Feature activation-2.97e+00

Pos logit contributions
 generosity+4.075
 pleasure+3.964
ipp+3.735
izza+3.717
 appreciation+3.522
Neg logit contributions
 bumped-5.157
 stub-5.060
 accidentally-4.883
 landed-4.703
 suddenly-4.684
Token,
Token ID11
Feature activation-2.28e+00

Pos logit contributions
 generosity+7.140
 pleasure+7.111
 decisions+6.696
 choices+6.679
 appreciation+6.665
Neg logit contributions
 bumped-3.482
 stub-3.425
 accidentally-3.237
 suddenly-3.145
 tri-2.993
Token walked
Token ID6807
Feature activation-4.16e-01

Pos logit contributions
 pleasure+5.135
 generosity+5.029
 decisions+4.847
 appreciation+4.820
 choices+4.783
Neg logit contributions
 bumped-2.961
 stub-2.897
 accidentally-2.837
 tri-2.612
 landed-2.501
Token back
Token ID736
Feature activation+2.37e-01

Pos logit contributions
 pleasure+0.814
 generosity+0.793
 appreciation+0.757
ipp+0.738
 pride+0.736
Neg logit contributions
 stub-0.610
 bumped-0.607
 accidentally-0.576
 tri-0.530
?"-0.527
Token home
Token ID1363
Feature activation-4.91e-01

Pos logit contributions
 stub+0.379
 bumped+0.372
 accidentally+0.357
 tri+0.328
Ro+0.327
Neg logit contributions
 pleasure-0.436
 generosity-0.420
 appreciation-0.406
ipp-0.392
 pride-0.389
Token His
Token ID2399
Feature activation-7.96e-01

Pos logit contributions
 pleasure+2.595
 generosity+2.544
 appreciation+2.383
ipp+2.324
 pride+2.311
Neg logit contributions
 stub-2.195
 bumped-2.173
 accidentally-2.133
?"-1.956
 tri-1.931
Token mom
Token ID1995
Feature activation-2.42e+00

Pos logit contributions
 pleasure+1.673
 generosity+1.638
 appreciation+1.558
ipp+1.541
 pride+1.517
Neg logit contributions
 stub-1.057
 bumped-1.055
 accidentally-0.994
 tri-0.908
?"-0.900
Token was
Token ID373
Feature activation-1.67e+00

Pos logit contributions
 pleasure+5.879
 generosity+5.578
 decisions+5.524
 choices+5.432
 appreciation+5.271
Neg logit contributions
 bumped-2.773
 accidentally-2.701
 stub-2.604
 landed-2.443
 suddenly-2.432
Token proud
Token ID6613
Feature activation-1.55e+00

Pos logit contributions
 pleasure+2.691
 generosity+2.685
izza+2.477
 appreciation+2.451
ipp+2.404
Neg logit contributions
 bumped-3.038
 stub-3.021
 accidentally-2.931
?"-2.796
 suddenly-2.743
Token of
Token ID286
Feature activation-9.99e-01

Pos logit contributions
 pleasure+2.544
 generosity+2.419
ipp+2.302
izza+2.268
 appreciation+2.251
Neg logit contributions
 stub-2.778
 bumped-2.762
 accidentally-2.706
?"-2.553
 tri-2.531
Token him
Token ID683
Feature activation-2.89e+00

Pos logit contributions
 pleasure+1.842
 generosity+1.797
 appreciation+1.704
ipp+1.653
 pride+1.643
Neg logit contributions
 bumped-1.580
 stub-1.569
 accidentally-1.526
?"-1.418
 tri-1.408
Token for
Token ID329
Feature activation-9.67e-01

Pos logit contributions
 pleasure+6.906
ipp+6.458
 generosity+6.457
 appreciation+6.425
izza+6.217
Neg logit contributions
 accidentally-3.102
 bumped-3.078
 stub-2.987
?"-2.832
 tri-2.813
Token being
Token ID852
Feature activation-1.55e+00

Pos logit contributions
 pleasure+1.913
 generosity+1.857
 appreciation+1.772
ipp+1.743
 pride+1.719
Neg logit contributions
 bumped-1.402
 stub-1.390
 accidentally-1.344
 tri-1.236
?"-1.215
Token a
Token ID257
Feature activation-2.12e+00

Pos logit contributions
 pleasure+2.844
 generosity+2.783
 appreciation+2.634
izza+2.606
ipp+2.581
Neg logit contributions
 bumped-2.526
 stub-2.494
 accidentally-2.437
?"-2.256
 tri-2.230
Token good
Token ID922
Feature activation-1.92e+00

Pos logit contributions
 pleasure+3.230
 generosity+3.182
ipp+3.018
ums+3.014
 appreciation+2.964
Neg logit contributions
 bumped-4.020
 stub-3.975
 accidentally-3.934
 seek-3.610
 tri-3.608
Token collector
Token ID22967
Feature activation-6.04e-01

Pos logit contributions
 pleasure+3.451
 generosity+3.368
 appreciation+3.196
ipp+3.184
izza+3.183
Neg logit contributions
 bumped-3.156
 stub-3.138
 accidentally-3.029
 tri-2.807
?"-2.788
Token not
Token ID407
Feature activation+4.54e-01

Pos logit contributions
 stub+0.030
 bumped+0.029
 accidentally+0.028
?"+0.026
 tri+0.026
Neg logit contributions
 pleasure-0.030
 generosity-0.029
 appreciation-0.028
 pride-0.027
ipp-0.026
Token happy
Token ID3772
Feature activation+1.67e-02

Pos logit contributions
 stub+0.907
 bumped+0.900
 accidentally+0.862
Ro+0.845
?"+0.821
Neg logit contributions
 pleasure-0.626
 generosity-0.605
 appreciation-0.573
 pride-0.561
ipp-0.536
Token and
Token ID290
Feature activation-7.70e-01

Pos logit contributions
 stub+0.021
 bumped+0.021
 accidentally+0.019
?"+0.017
 landed+0.017
Neg logit contributions
 pleasure-0.036
 generosity-0.036
 appreciation-0.034
 pride-0.033
ipp-0.033
Token sc
Token ID629
Feature activation-2.02e+00

Pos logit contributions
 pleasure+1.718
 generosity+1.681
 appreciation+1.608
ipp+1.579
 pride+1.563
Neg logit contributions
 bumped-0.928
 stub-0.920
 accidentally-0.868
 tri-0.785
?"-0.770
Tokenolded
Token ID48959
Feature activation-9.75e-01

Pos logit contributions
 pleasure+2.771
 generosity+2.697
 appreciation+2.479
 kinds+2.357
 pride+2.294
Neg logit contributions
 bumped-4.330
 stub-4.216
 accidentally-4.052
 tri-3.992
?"-3.815
Token John
Token ID1757
Feature activation-2.66e+00

Pos logit contributions
 pleasure+1.828
 generosity+1.784
 appreciation+1.692
ipp+1.650
 pride+1.640
Neg logit contributions
 bumped-1.504
 stub-1.480
 accidentally-1.431
 tri-1.329
?"-1.321
Token for
Token ID329
Feature activation+2.81e-01

Pos logit contributions
 pleasure+6.885
 generosity+6.574
ipp+6.375
 appreciation+6.375
 pride+6.271
Neg logit contributions
 bumped-2.521
 accidentally-2.307
 stub-2.302
 tri-2.072
?"-1.995
Token ignoring
Token ID15482
Feature activation-1.03e+00

Pos logit contributions
 stub+0.426
 bumped+0.416
 accidentally+0.391
?"+0.372
Ro+0.365
Neg logit contributions
 pleasure-0.531
 generosity-0.529
 appreciation-0.501
 pride-0.485
ipp-0.478
Token her
Token ID607
Feature activation-1.09e+00

Pos logit contributions
 pleasure+2.019
 generosity+1.970
 appreciation+1.862
ipp+1.842
 pride+1.809
Neg logit contributions
 bumped-1.507
 stub-1.486
 accidentally-1.430
?"-1.323
 tri-1.315
Token.
Token ID13
Feature activation-3.88e-01

Pos logit contributions
 pleasure+2.685
 generosity+2.617
 appreciation+2.513
ipp+2.504
 pride+2.468
Neg logit contributions
 bumped-1.044
 stub-1.039
 accidentally-0.963
 tri-0.844
?"-0.836
Token She
Token ID1375
Feature activation-2.22e-01

Pos logit contributions
 pleasure+0.745
 generosity+0.726
 appreciation+0.692
ipp+0.673
 pride+0.671
Neg logit contributions
 stub-0.580
 bumped-0.579
 accidentally-0.546
 tri-0.506
?"-0.500