TOP ACTIVATIONS
MAX = 4.512

bee named Buzz was flying around in the me adow .
little red bird was flying around when it spotted a big
a little frog was hopping through the pond . He wanted
bird flew down and started to pinch the popcorn out of
mosquito . He was sitting on the window sill and he
the distance she could see fluffy clouds of blue smoke .
bird outside . He started to chase after it , but
a frog . He loved to sing . He opened his
. Pat loved to lay under the warm sun . One
One day , it started to rain . The frog was
named Buzz y was flying around . She was looking for
and suddenly , she started to shrink ! Lily was so
. It had big , soft wings with lots of fur
very hot and he started to feel funny . Suddenly ,
boy went outside and saw white coal on the ground .
and hotter . Benny started to feel very hot and thirsty
. She started to play , and she moved
where he was and started to panic . He flew around
a sudden , he started to hear noises . He looked
sky and saw big , fluffy clouds . The

MIN ACTIVATIONS
MIN = -4.825

ceiling break ?" Her grandma explained , " Sometimes
was fixed , Lily asked her grandma , " Why did
open it . He asked his grandma if he could but
outside anymore . His grandma told him that it
was very happy and thanked her grandma for letting her unlock
so happy and she thanked her grandma for the y ummy
, please ?" Her grandma replied , " I
love you ." Her grandma said , " By
love you ." Her grandma sounded very happy and
. Tom smiled and thanked his mom for helping him send
you soon ?" Her grandma said , " Of
some . Her grandma said , " Of
went to the park with her mom . She was playing
the apples . Her grandma was shocked . She
and thirsty . They asked their grandma if they could have
ila felt better and thanked her grandma . She decided to
love you ." Her grandma sounded very happy and
that wasn 't hers . Her mom explained to her that
the cool space stuff . His grandma had to convince him
his knee . His grandma helped him up and

INTERVAL 2.178 - 4.512
CONTAINS 1.605%

She loved to play outside , but one day it was
best friends . They loved to play together .
through the forest when he saw a big , scary wolf
He laughed as it spun round and round .
and the big human began to laugh . The little human

INTERVAL -0.157 - 2.178
CONTAINS 54.796%

The frog jumped away and Tim my fell into the pond
he saw a mouse ! He ran to his mom and
the man played together for a while and had a lot
into a wet p uddle . Emma cried and tried to
. The fountain has water that goes up and down .

INTERVAL -2.491 - -0.157
CONTAINS 43.055%

. But even fair ies need to eat . Come on
my didn 't see Molly at first , because she was
He hoped someone would hear it . He also
her friends stayed in Slip for many days . They played
the party that she wanted to wear her new shoes .

INTERVAL -4.825 - -2.491
CONTAINS 0.543%

to the store , Lily 's mom my told her to
No , mine ! Mine !" Sam said .
couldn 't wait to tell her friends about the butterflies that
went on a walk with her mum and dad . They
to each other . <|endoftext|> Once upon a time , there
Token bee
Token ID20697
Feature activation+2.65e+00

Pos logit contributions
 nib+3.153
 hopping+2.953
 buzzing+2.932
ping+2.885
 prow+2.718
Neg logit contributions
 promise-1.348
 clerk-1.311
 grandparents-1.182
keeper-1.172
 cousin-1.168
Token named
Token ID3706
Feature activation+2.46e+00

Pos logit contributions
 nib+5.278
 hopping+4.910
 snowball+4.758
ping+4.708
 buzzing+4.615
Neg logit contributions
 promise-4.066
 grandparents-3.621
 cousin-3.549
olding-3.531
 grandmother-3.501
Token Buzz
Token ID15753
Feature activation+2.26e+00

Pos logit contributions
 nib+4.605
 hopping+4.460
 buzzing+4.334
ping+4.199
 snowball+3.954
Neg logit contributions
 promise-3.842
 cousin-3.774
 grandma-3.740
 clerk-3.643
 grandmother-3.592
Token was
Token ID373
Feature activation+2.78e+00

Pos logit contributions
 nib+3.762
 hopping+3.611
 buzzing+3.412
 snowball+3.285
ping+3.162
Neg logit contributions
 promise-4.378
 grandparents-3.844
 cousin-3.756
olding-3.756
 policy-3.643
Token flying
Token ID7348
Feature activation+2.44e+00

Pos logit contributions
 nib+6.026
 seaw+5.393
bee+5.312
 snowball+5.296
ping+5.209
Neg logit contributions
 promise-3.368
 cousin-3.357
 grandmother-3.313
 grandma-3.274
 grandparents-3.260
Token around
Token ID1088
Feature activation+4.51e+00

Pos logit contributions
 nib+4.817
 hopping+4.426
 buzzing+4.277
 gra+4.130
 snowball+4.111
Neg logit contributions
keeper-3.489
 promise-3.434
 wisely-3.406
 clerk-3.305
belt-3.261
Token in
Token ID287
Feature activation+3.38e+00

Pos logit contributions
latable+10.840
pill+10.415
 nib+10.388
 snowball+10.353
umping+10.018
Neg logit contributions
 grandma-4.678
 grandmother-4.512
 forward-4.482
 wisely-4.393
 grandparents-4.186
Token the
Token ID262
Feature activation+3.27e+00

Pos logit contributions
 nib+6.646
bee+6.477
ping+6.164
 hopping+5.860
icans+5.838
Neg logit contributions
 grandma-4.966
 grandparents-4.819
 promise-4.779
 gift-4.741
 grandmother-4.633
Token me
Token ID502
Feature activation+1.21e+00

Pos logit contributions
 nib+6.677
bee+5.760
ping+5.654
 hopping+5.583
 buzzing+5.392
Neg logit contributions
 promise-5.136
 grandmother-4.886
 grandparents-4.882
 clerk-4.849
keeper-4.831
Tokenadow
Token ID4584
Feature activation+2.74e+00

Pos logit contributions
 nib+1.981
 hopping+1.761
 buzzing+1.725
ping+1.586
 Buzz+1.585
Neg logit contributions
 promise-2.373
olding-2.253
belt-2.134
 wisely-2.133
 grandparents-2.129
Token.
Token ID13
Feature activation+3.80e+00

Pos logit contributions
 nib+5.941
 snowball+5.686
 buzzing+5.273
 gra+5.146
 hopping+5.120
Neg logit contributions
olding-3.635
 promise-3.594
keeper-3.570
belt-3.445
 wisely-3.400
Token little
Token ID1310
Feature activation+1.18e+00

Pos logit contributions
 nib+0.632
 hopping+0.591
 buzzing+0.587
ping+0.565
 Buzz+0.522
Neg logit contributions
 promise-0.508
 clerk-0.494
 grandparents-0.472
olding-0.466
 cousin-0.462
Token red
Token ID2266
Feature activation+1.69e+00

Pos logit contributions
 nib+2.943
 hopping+2.748
 buzzing+2.745
ping+2.699
 prow+2.535
Neg logit contributions
 promise-1.210
 clerk-1.199
 grandparents-1.088
 cousin-1.074
keeper-1.064
Token bird
Token ID6512
Feature activation+2.13e+00

Pos logit contributions
 nib+3.554
 buzzing+3.335
 hopping+3.278
ping+3.173
 Buzz+3.022
Neg logit contributions
 promise-2.557
 clerk-2.501
 cousin-2.360
keeper-2.325
 grandparents-2.301
Token was
Token ID373
Feature activation+2.40e+00

Pos logit contributions
 nib+3.908
 buzzing+3.606
 hopping+3.567
ping+3.328
 Buzz+3.290
Neg logit contributions
 promise-3.584
 cousin-3.287
 grandparents-3.240
olding-3.184
aila-3.152
Token flying
Token ID7348
Feature activation+2.35e+00

Pos logit contributions
 nib+5.152
 buzzing+4.502
bee+4.478
ping+4.468
 snowball+4.466
Neg logit contributions
 promise-3.096
 clerk-2.986
 cousin-2.920
 grandmother-2.880
aila-2.854
Token around
Token ID1088
Feature activation+4.46e+00

Pos logit contributions
 nib+4.685
 hopping+4.250
 buzzing+4.175
 gra+3.993
 Buzz+3.933
Neg logit contributions
keeper-3.417
 promise-3.352
belt-3.296
 wisely-3.261
 clerk-3.255
Token when
Token ID618
Feature activation+2.56e+00

Pos logit contributions
pill+10.539
 snowball+10.375
 nib+10.288
latable+10.174
bee+9.867
Neg logit contributions
 grandma-5.053
 grandmother-4.761
 forward-4.627
 grandparents-4.497
 wisely-4.371
Token it
Token ID340
Feature activation+1.45e+00

Pos logit contributions
 nib+5.462
ping+5.294
 buzzing+4.930
 snowball+4.683
 gra+4.674
Neg logit contributions
 grandmother-3.396
 grandparents-3.340
 grandma-3.278
belt-3.190
keeper-3.178
Token spotted
Token ID13489
Feature activation+2.47e+00

Pos logit contributions
 nib+2.805
 hopping+2.625
 buzzing+2.567
ping+2.509
 Buzz+2.375
Neg logit contributions
 promise-2.208
keeper-2.070
belt-2.017
 clerk-2.011
olded-1.956
Token a
Token ID257
Feature activation+7.89e-01

Pos logit contributions
 nib+4.472
 buzzing+3.994
 hopping+3.949
ping+3.896
bee+3.787
Neg logit contributions
 clerk-4.232
 grandma-4.127
 grandmother-4.077
keeper-4.020
 grandparents-3.990
Token big
Token ID1263
Feature activation+1.79e+00

Pos logit contributions
 nib+1.416
 hopping+1.320
ping+1.302
 buzzing+1.278
 Buzz+1.195
Neg logit contributions
 promise-1.361
 clerk-1.319
keeper-1.264
 grandparents-1.249
 grandmother-1.213
Token a
Token ID257
Feature activation-7.23e-02

Pos logit contributions
 nib+3.526
 buzzing+3.239
ping+3.181
 hopping+3.041
 gra+2.936
Neg logit contributions
 clerk-3.363
 promise-3.174
belt-3.155
keeper-3.145
aila-3.116
Token little
Token ID1310
Feature activation+8.73e-01

Pos logit contributions
 promise+0.115
 clerk+0.113
 grandparents+0.107
olding+0.106
keeper+0.105
Neg logit contributions
 nib-0.138
 hopping-0.129
 buzzing-0.128
ping-0.122
 Buzz-0.114
Token frog
Token ID21264
Feature activation+2.84e+00

Pos logit contributions
 nib+2.216
 buzzing+2.081
 hopping+2.078
ping+2.042
 prow+1.911
Neg logit contributions
 clerk-0.862
 promise-0.855
keeper-0.753
 grandparents-0.749
olding-0.735
Token was
Token ID373
Feature activation+2.58e+00

Pos logit contributions
 nib+5.866
ping+5.848
 buzzing+5.592
 hopping+5.374
 tweeting+5.316
Neg logit contributions
 promise-3.950
olded-3.472
 cousin-3.435
olding-3.405
 grandparents-3.331
Token hopping
Token ID47153
Feature activation+2.26e+00

Pos logit contributions
 nib+4.658
bee+4.088
 buzzing+4.029
ping+4.020
 snowball+3.962
Neg logit contributions
 promise-4.140
 clerk-4.070
 cousin-3.996
olded-3.956
olding-3.921
Token through
Token ID832
Feature activation+4.35e+00

Pos logit contributions
 nib+4.473
 buzzing+4.139
 hopping+3.996
 gra+3.850
ping+3.842
Neg logit contributions
 promise-3.383
olding-3.235
 clerk-3.216
keeper-3.165
olded-3.099
Token the
Token ID262
Feature activation+3.02e+00

Pos logit contributions
bee+10.294
ping+9.778
 nib+9.500
 gra+9.317
hing+9.173
Neg logit contributions
 grandma-4.917
 mom-4.666
 grandparents-4.498
 promise-4.432
 wisely-4.326
Token pond
Token ID16723
Feature activation+2.32e+00

Pos logit contributions
 nib+4.768
bee+4.187
ping+4.108
 buzzing+4.027
 hopping+3.724
Neg logit contributions
 promise-6.041
 clerk-5.851
keeper-5.835
 grandparents-5.818
ari-5.621
Token.
Token ID13
Feature activation+3.07e+00

Pos logit contributions
 nib+4.795
 snowball+4.545
 buzzing+4.381
ping+4.262
bee+4.241
Neg logit contributions
 promise-3.339
olding-3.242
 wisely-3.033
 clerk-2.987
olded-2.953
Token He
Token ID679
Feature activation+2.46e+00

Pos logit contributions
 buzzing+6.162
ping+5.941
 nib+5.743
 snowball+5.644
 tweeting+5.563
Neg logit contributions
 clerk-4.796
keeper-4.584
Of-4.516
 cousin-4.335
olded-4.228
Token wanted
Token ID2227
Feature activation+1.57e+00

Pos logit contributions
ping+5.284
 nib+5.060
 buzzing+5.050
 hopping+4.854
 lava+4.843
Neg logit contributions
 promise-3.009
olded-2.677
keeper-2.673
 grandparents-2.631
belt-2.624
Token bird
Token ID6512
Feature activation+2.77e+00

Pos logit contributions
 nib+2.508
 hopping+2.294
 buzzing+2.266
ping+2.254
 Buzz+2.116
Neg logit contributions
 promise-2.117
 clerk-2.065
 manager-1.876
 officer-1.858
 grandparents-1.839
Token flew
Token ID13112
Feature activation+1.54e+00

Pos logit contributions
 nib+4.896
 Buzz+4.524
 ponds+4.447
 hopping+4.427
 algae+4.312
Neg logit contributions
 promise-4.513
olded-4.282
 officer-4.076
olding-4.072
keeper-4.061
Token down
Token ID866
Feature activation+6.02e-01

Pos logit contributions
 nib+3.091
 hopping+2.770
 buzzing+2.738
 snowball+2.554
bee+2.554
Neg logit contributions
 clerk-2.318
keeper-2.242
 promise-2.238
olding-2.173
olded-2.172
Token and
Token ID290
Feature activation+1.58e+00

Pos logit contributions
 nib+1.065
 hopping+0.988
 buzzing+0.963
ping+0.923
 snowball+0.865
Neg logit contributions
 clerk-1.015
 promise-0.986
olding-0.955
keeper-0.953
 manager-0.946
Token started
Token ID2067
Feature activation+3.03e+00

Pos logit contributions
 nib+2.852
 hopping+2.800
ping+2.722
 buzzing+2.679
 ponds+2.585
Neg logit contributions
 clerk-2.372
keeper-2.344
olded-2.333
 promise-2.281
ounded-2.278
Token to
Token ID284
Feature activation+4.28e+00

Pos logit contributions
 nib+7.428
bee+7.419
ping+7.312
 snowball+7.094
 ponds+7.064
Neg logit contributions
 clerk-2.466
 wisely-2.419
olding-2.391
olded-2.366
 promise-2.342
Token pinch
Token ID25394
Feature activation+1.26e+00

Pos logit contributions
ping+8.426
 hopping+8.370
umping+8.212
pill+8.078
 ponds+7.910
Neg logit contributions
 promise-8.161
 officer-5.829
 reward-5.630
 cheaper-5.585
 forgive-5.397
Token the
Token ID262
Feature activation+1.35e+00

Pos logit contributions
 nib+2.108
 hopping+1.981
 buzzing+1.909
ping+1.794
 prow+1.723
Neg logit contributions
 promise-2.199
 clerk-2.195
 grandparents-2.118
 bills-2.112
 wisely-2.107
Token popcorn
Token ID38914
Feature activation+1.43e+00

Pos logit contributions
 nib+2.312
 hopping+2.199
 buzzing+2.098
 Buzz+1.957
ping+1.921
Neg logit contributions
 promise-2.547
 clerk-2.428
keeper-2.283
 bills-2.283
 manager-2.282
Token out
Token ID503
Feature activation+3.78e-01

Pos logit contributions
 nib+2.741
 hopping+2.632
 buzzing+2.475
ping+2.387
 tweeting+2.333
Neg logit contributions
 promise-2.342
 clerk-2.182
keeper-2.125
 wisely-2.076
 bills-2.066
Token of
Token ID286
Feature activation+1.50e+00

Pos logit contributions
 nib+0.871
 hopping+0.817
 buzzing+0.798
ping+0.755
 Buzz+0.738
Neg logit contributions
 promise-0.454
 clerk-0.436
olding-0.422
keeper-0.408
 wisely-0.406
Token mosquito
Token ID35446
Feature activation+2.20e+00

Pos logit contributions
 nib+3.488
 buzzing+3.329
 hopping+3.273
ping+3.267
 tweeting+3.016
Neg logit contributions
 promise-2.617
 clerk-2.507
 cousin-2.396
 grandparents-2.319
keeper-2.289
Token.
Token ID13
Feature activation+3.13e+00

Pos logit contributions
 nib+4.374
 hopping+4.067
ping+3.994
 buzzing+3.948
 ponds+3.911
Neg logit contributions
 promise-3.340
keeper-2.993
 cousin-2.924
olding-2.922
 clerk-2.902
Token He
Token ID679
Feature activation+2.75e+00

Pos logit contributions
 nib+5.540
ping+5.528
 buzzing+5.492
 hopping+5.444
 ponds+4.930
Neg logit contributions
 promise-5.077
 grandma-5.045
 cousin-4.991
Of-4.952
 clerk-4.835
Token was
Token ID373
Feature activation+2.85e+00

Pos logit contributions
 nib+5.656
ping+5.532
 hopping+5.520
 ponds+5.388
 lava+5.312
Neg logit contributions
 promise-3.741
olded-3.210
belt-3.154
Of-3.119
 wisely-3.064
Token sitting
Token ID5586
Feature activation+3.65e-01

Pos logit contributions
 nib+6.578
 ponds+6.068
growth+6.044
 seaw+5.910
ping+5.886
Neg logit contributions
 promise-3.215
olded-2.893
 cousin-2.822
 clerk-2.810
 grandma-2.758
Token on
Token ID319
Feature activation+4.28e+00

Pos logit contributions
 nib+0.669
 buzzing+0.608
 hopping+0.604
ping+0.551
 gra+0.544
Neg logit contributions
 promise-0.606
 clerk-0.591
belt-0.565
olding-0.565
 grandparents-0.556
Token the
Token ID262
Feature activation+2.69e+00

Pos logit contributions
 nib+9.313
ping+8.456
 hopping+8.362
bee+8.253
 tweeting+8.197
Neg logit contributions
 grandma-6.786
 clerk-5.857
 grandmother-5.787
 lesson-5.734
 bills-5.596
Token window
Token ID4324
Feature activation+1.19e+00

Pos logit contributions
 nib+4.602
 hopping+3.973
 Buzz+3.644
 buzzing+3.628
 prow+3.538
Neg logit contributions
 clerk-5.310
 promise-5.163
belt-4.928
 manager-4.917
keeper-4.904
Token sill
Token ID49276
Feature activation+3.28e-01

Pos logit contributions
 nib+2.232
 buzzing+2.086
 hopping+2.055
 snowball+1.919
 Buzz+1.910
Neg logit contributions
 promise-1.890
 clerk-1.845
olding-1.805
belt-1.788
keeper-1.784
Token and
Token ID290
Feature activation+3.03e+00

Pos logit contributions
 nib+0.588
 hopping+0.542
 buzzing+0.539
 Buzz+0.479
ping+0.473
Neg logit contributions
 promise-0.563
 clerk-0.555
olding-0.536
belt-0.535
 bills-0.524
Token he
Token ID339
Feature activation+2.25e+00

Pos logit contributions
 snowball+5.318
ping+5.198
 nib+5.178
 buzzing+4.928
latable+4.920
Neg logit contributions
 wisely-4.671
 promise-4.549
belt-4.442
 grandparents-4.348
keeper-4.318
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
 nib+0.517
 buzzing+0.459
 hopping+0.456
ping+0.438
 snowball+0.391
Neg logit contributions
 promise-0.749
 clerk-0.743
 grandparents-0.720
 grandmother-0.699
 bills-0.696
Token distance
Token ID5253
Feature activation+7.59e-01

Pos logit contributions
 nib+1.914
 hopping+1.685
 buzzing+1.678
ping+1.630
 Buzz+1.575
Neg logit contributions
 promise-2.200
 clerk-2.172
 grandparents-2.062
 grandmother-2.025
keeper-1.980
Token she
Token ID673
Feature activation-3.58e-01

Pos logit contributions
 nib+1.423
 hopping+1.285
 buzzing+1.271
ping+1.202
 snowball+1.166
Neg logit contributions
 clerk-1.248
 promise-1.245
keeper-1.190
olding-1.162
 grandparents-1.162
Token could
Token ID714
Feature activation+3.59e-01

Pos logit contributions
 promise+0.625
 clerk+0.614
olding+0.586
 grandparents+0.585
belt+0.579
Neg logit contributions
 nib-0.627
 hopping-0.583
 buzzing-0.581
ping-0.535
 Buzz-0.511
Token see
Token ID766
Feature activation+8.65e-01

Pos logit contributions
 nib+0.612
 hopping+0.563
 buzzing+0.561
ping+0.538
 snowball+0.511
Neg logit contributions
 promise-0.653
 clerk-0.618
olding-0.604
belt-0.597
 grandparents-0.592
Token fluffy
Token ID39145
Feature activation+4.26e+00

Pos logit contributions
 nib+1.473
 hopping+1.311
 buzzing+1.283
ping+1.224
 snowball+1.172
Neg logit contributions
 clerk-1.549
 promise-1.535
 grandparents-1.492
keeper-1.476
 wisely-1.469
Token clouds
Token ID15114
Feature activation+3.21e+00

Pos logit contributions
 nib+8.989
eh+8.340
 gra+8.334
 prow+7.536
 hopping+7.467
Neg logit contributions
 grandparents-6.227
 clerk-6.089
keeper-6.080
 grandmother-5.990
 grandma-5.963
Token of
Token ID286
Feature activation+2.65e+00

Pos logit contributions
 nib+6.896
 snowball+6.243
 Buzz+6.160
eh+6.097
 hopping+6.064
Neg logit contributions
 promise-4.405
keeper-4.035
 wisely-3.977
olding-3.971
 clerk-3.842
Token blue
Token ID4171
Feature activation+3.01e+00

Pos logit contributions
 nib+4.175
 hopping+3.474
ping+3.418
eh+3.345
bee+3.271
Neg logit contributions
 grandparents-5.077
 bills-4.975
 clerk-4.967
 grandma-4.942
 grandmother-4.848
Token smoke
Token ID7523
Feature activation+2.58e+00

Pos logit contributions
 nib+5.731
 buzzing+4.934
 gra+4.900
 hopping+4.829
 Buzz+4.587
Neg logit contributions
 grandparents-4.986
 wisely-4.927
 clerk-4.873
 bills-4.807
ari-4.750
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
 nib+5.592
 hopping+4.995
 buzzing+4.847
 snowball+4.817
 Buzz+4.771
Neg logit contributions
 clerk-3.371
keeper-3.297
 manager-3.239
 promise-3.211
 wisely-3.160
Token bird
Token ID6512
Feature activation+2.51e+00

Pos logit contributions
 nib+0.969
 hopping+0.913
 buzzing+0.888
ping+0.884
 Buzz+0.824
Neg logit contributions
 promise-0.816
 clerk-0.797
keeper-0.751
 grandparents-0.742
 manager-0.731
Token outside
Token ID2354
Feature activation+2.22e+00

Pos logit contributions
 nib+4.581
 Buzz+4.151
 hopping+4.065
 buzzing+4.010
 snowball+3.987
Neg logit contributions
 promise-4.417
olding-3.902
keeper-3.723
belt-3.622
olded-3.617
Token.
Token ID13
Feature activation+2.33e+00

Pos logit contributions
 nib+4.260
 buzzing+3.822
 hopping+3.814
ping+3.571
 snowball+3.535
Neg logit contributions
 promise-3.665
 clerk-3.370
 policy-3.344
keeper-3.320
belt-3.306
Token He
Token ID679
Feature activation+2.00e+00

Pos logit contributions
 nib+4.174
 buzzing+4.163
ping+3.936
 hopping+3.731
 snowball+3.517
Neg logit contributions
 clerk-4.178
keeper-4.020
olded-3.919
Of-3.763
 cousin-3.753
Token started
Token ID2067
Feature activation+3.00e+00

Pos logit contributions
ping+3.949
 nib+3.931
 buzzing+3.881
 hopping+3.861
 ponds+3.642
Neg logit contributions
 promise-2.882
keeper-2.717
olded-2.700
 clerk-2.628
 grandparents-2.628
Token to
Token ID284
Feature activation+4.25e+00

Pos logit contributions
 nib+7.462
ping+7.197
bee+7.135
 Buzz+7.067
Buzz+7.064
Neg logit contributions
 wisely-2.527
 promise-2.477
 clerk-2.357
olding-2.330
keeper-2.278
Token chase
Token ID15505
Feature activation+1.77e+00

Pos logit contributions
ping+8.607
pill+8.464
 hopping+8.181
umping+7.782
 lava+7.683
Neg logit contributions
 promise-8.368
 officer-5.872
 grandmother-5.720
 reward-5.643
 grandparents-5.642
Token after
Token ID706
Feature activation+2.47e+00

Pos logit contributions
 nib+2.965
 hopping+2.938
 buzzing+2.835
ping+2.690
 tweeting+2.650
Neg logit contributions
 clerk-3.014
 promise-2.981
keeper-2.903
 bills-2.860
 wisely-2.816
Token it
Token ID340
Feature activation+1.33e+00

Pos logit contributions
 nib+4.479
 buzzing+4.048
 hopping+4.014
ping+3.958
 gra+3.745
Neg logit contributions
 clerk-4.074
 bills-3.901
 wisely-3.816
keeper-3.791
 manager-3.768
Token,
Token ID11
Feature activation+2.13e+00

Pos logit contributions
 nib+2.462
 buzzing+2.291
 hopping+2.261
ping+2.174
 gra+2.150
Neg logit contributions
 promise-2.148
olding-1.963
 wisely-1.954
 clerk-1.951
keeper-1.887
Token but
Token ID475
Feature activation+6.46e-02

Pos logit contributions
 nib+4.392
 buzzing+4.081
 hopping+4.040
ping+4.027
 snowball+3.895
Neg logit contributions
 promise-2.961
 clerk-2.796
 wisely-2.743
olding-2.722
keeper-2.674
Token a
Token ID257
Feature activation+6.06e-01

Pos logit contributions
 nib+4.210
 buzzing+3.979
ping+3.939
 hopping+3.747
 prow+3.556
Neg logit contributions
 promise-4.420
 cousin-4.386
 clerk-4.384
 grandma-4.383
 grandparents-4.346
Token frog
Token ID21264
Feature activation+2.24e+00

Pos logit contributions
 nib+1.205
 hopping+1.146
ping+1.134
 buzzing+1.133
 tweeting+1.024
Neg logit contributions
 promise-0.919
 clerk-0.878
 grandparents-0.837
 cousin-0.823
olding-0.814
Token.
Token ID13
Feature activation+2.85e+00

Pos logit contributions
 nib+4.495
ping+4.358
 buzzing+4.250
 hopping+4.168
 lava+4.054
Neg logit contributions
 promise-3.298
olding-3.044
keeper-2.916
 grandparents-2.897
 cousin-2.886
Token He
Token ID679
Feature activation+3.13e+00

Pos logit contributions
 buzzing+5.316
ping+5.095
 nib+4.907
 hopping+4.873
 tweeting+4.521
Neg logit contributions
 promise-4.557
 cousin-4.500
 clerk-4.434
keeper-4.400
Of-4.383
Token loved
Token ID6151
Feature activation+3.10e+00

Pos logit contributions
ping+7.068
 buzzing+6.766
 nib+6.741
 tweeting+6.643
 lava+6.538
Neg logit contributions
 promise-3.679
 cousin-2.992
olded-2.971
belt-2.957
Of-2.910
Token to
Token ID284
Feature activation+4.23e+00

Pos logit contributions
 nib+7.989
ping+7.729
 buzzing+7.637
bee+7.423
 snowball+7.375
Neg logit contributions
 wisely-2.222
 bills-2.204
 grandparents-2.089
 promise-2.078
 clerk-2.044
Token sing
Token ID1702
Feature activation+1.86e+00

Pos logit contributions
ping+9.804
 buzzing+8.647
 hovering+8.572
umping+8.561
 perched+8.428
Neg logit contributions
 promise-7.512
 reward-6.340
 gift-5.318
 teacher-5.247
 forgive-5.033
Token.
Token ID13
Feature activation+1.63e+00

Pos logit contributions
 nib+3.501
 hopping+3.197
 buzzing+3.165
ping+2.972
 ponds+2.893
Neg logit contributions
 grandparents-2.828
 clerk-2.781
 promise-2.779
 wisely-2.764
 manager-2.700
Token He
Token ID679
Feature activation+1.53e+00

Pos logit contributions
 buzzing+2.953
 nib+2.824
ping+2.750
 hopping+2.724
 tweeting+2.501
Neg logit contributions
 promise-2.775
 clerk-2.635
 cousin-2.590
 grandparents-2.574
olding-2.534
Token opened
Token ID4721
Feature activation+2.11e+00

Pos logit contributions
 nib+3.014
 hopping+2.897
ping+2.893
 buzzing+2.805
 snowball+2.736
Neg logit contributions
 promise-2.229
 grandparents-2.012
 cousin-1.966
 wisely-1.929
 grandmother-1.908
Token his
Token ID465
Feature activation+1.64e-01

Pos logit contributions
 nib+3.397
 buzzing+3.270
 hopping+3.177
ping+3.054
bee+2.892
Neg logit contributions
 grandparents-3.797
 grandmother-3.686
 wisely-3.662
 grandma-3.656
 clerk-3.603
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
 nib+4.146
 buzzing+3.970
 hopping+3.959
ping+3.708
 tweeting+3.694
Neg logit contributions
 promise-3.132
olding-2.818
keeper-2.723
 wisely-2.720
 grandparents-2.690
Token Pat
Token ID3208
Feature activation+2.78e+00

Pos logit contributions
 nib+1.657
 buzzing+1.549
 hopping+1.492
ping+1.462
 tweeting+1.320
Neg logit contributions
 promise-2.173
 clerk-2.060
olding-2.019
 grandparents-1.999
belt-1.998
Token loved
Token ID6151
Feature activation+2.62e+00

Pos logit contributions
 nib+5.970
 buzzing+5.674
ping+5.668
 hopping+5.652
 tweeting+5.570
Neg logit contributions
 promise-3.906
 grandparents-3.268
keeper-3.262
 wisely-3.222
 clerk-3.193
Token to
Token ID284
Feature activation+3.95e+00

Pos logit contributions
 nib+6.439
 buzzing+6.166
ping+6.159
bee+5.932
 snowball+5.913
Neg logit contributions
 grandparents-2.300
 bills-2.289
 clerk-2.224
 wisely-2.171
 promise-2.109
Token lay
Token ID3830
Feature activation+1.30e+00

Pos logit contributions
ping+9.234
 buzzing+8.296
 perched+8.204
 hovering+8.092
ric+8.072
Neg logit contributions
 promise-6.709
 reward-5.954
 gift-4.833
 charity-4.674
 clerk-4.604
Token under
Token ID739
Feature activation+4.23e+00

Pos logit contributions
 nib+2.510
 buzzing+2.316
 hopping+2.240
ping+2.081
 prow+2.079
Neg logit contributions
 promise-1.975
belt-1.964
 clerk-1.944
 wisely-1.912
 grandparents-1.839
Token the
Token ID262
Feature activation+3.29e+00

Pos logit contributions
ping+8.342
 nib+8.052
bee+7.993
 buzzing+7.823
eh+7.781
Neg logit contributions
 grandma-6.344
 mom-6.201
 promise-5.973
 parents-5.960
 grandparents-5.803
Token warm
Token ID5814
Feature activation+2.95e+00

Pos logit contributions
 nib+6.547
 hopping+5.690
 perched+5.541
 buzzing+5.506
 prow+5.443
Neg logit contributions
 clerk-5.371
 promise-5.313
 grandparents-5.246
 grandmother-5.056
keeper-5.031
Token sun
Token ID4252
Feature activation+1.75e+00

Pos logit contributions
 nib+5.564
 buzzing+4.948
 hopping+4.846
 prow+4.749
cks+4.521
Neg logit contributions
 promise-4.836
 clerk-4.785
 grandparents-4.693
 grandmother-4.647
keeper-4.594
Token.
Token ID13
Feature activation+1.68e+00

Pos logit contributions
 nib+3.465
 buzzing+3.234
 hopping+3.196
 snowball+3.062
 prow+2.982
Neg logit contributions
 promise-2.474
 clerk-2.435
keeper-2.376
 wisely-2.372
belt-2.335
Token One
Token ID1881
Feature activation+1.46e+00

Pos logit contributions
 buzzing+2.984
 nib+2.970
ping+2.838
 hopping+2.759
 tweeting+2.580
Neg logit contributions
 promise-2.788
 grandparents-2.632
 clerk-2.621
olding-2.582
Of-2.579
TokenOne
Token ID3198
Feature activation+1.31e+00

Pos logit contributions
 nib+3.202
ping+3.014
 buzzing+3.012
 hopping+2.953
 tweeting+2.726
Neg logit contributions
 promise-2.008
 clerk-1.972
olding-1.924
belt-1.873
Of-1.857
Token day
Token ID1110
Feature activation+1.63e+00

Pos logit contributions
 nib+3.023
 hopping+2.850
 buzzing+2.825
ping+2.810
 tweeting+2.659
Neg logit contributions
 promise-1.669
olding-1.394
 policy-1.305
 clerk-1.297
 cousin-1.269
Token,
Token ID11
Feature activation+1.62e+00

Pos logit contributions
 nib+3.250
 buzzing+3.206
ping+3.182
 hopping+3.013
 gra+2.979
Neg logit contributions
 clerk-2.191
 promise-2.150
 policy-2.087
olded-2.068
 reward-2.036
Token it
Token ID340
Feature activation+1.04e+00

Pos logit contributions
ping+2.943
 nib+2.943
 buzzing+2.926
 hopping+2.799
 tweeting+2.595
Neg logit contributions
 clerk-2.514
 promise-2.394
keeper-2.352
 manager-2.333
Of-2.315
Token started
Token ID2067
Feature activation+2.61e+00

Pos logit contributions
 nib+1.930
 hopping+1.838
 buzzing+1.820
ping+1.730
 tweeting+1.657
Neg logit contributions
 promise-1.695
 clerk-1.547
keeper-1.529
olded-1.507
olding-1.501
Token to
Token ID284
Feature activation+4.22e+00

Pos logit contributions
 nib+6.656
bee+6.357
ping+6.279
 buzzing+6.193
 gra+6.162
Neg logit contributions
olding-2.114
 promise-2.110
 wisely-1.930
 manager-1.879
 clerk-1.864
Token rain
Token ID6290
Feature activation+2.53e+00

Pos logit contributions
ping+7.756
umping+7.541
pill+7.492
ez+7.382
 nib+7.344
Neg logit contributions
 promise-8.612
 manager-6.931
 cheaper-6.920
 reward-6.811
 punishment-6.760
Token.
Token ID13
Feature activation+2.61e+00

Pos logit contributions
 nib+5.292
 hopping+4.948
 buzzing+4.650
 Buzz+4.621
 gra+4.452
Neg logit contributions
 promise-3.676
 manager-3.668
 clerk-3.558
olding-3.543
keeper-3.473
Token The
Token ID383
Feature activation+1.20e+00

Pos logit contributions
 buzzing+4.662
 nib+4.514
ping+4.383
 hopping+4.232
 tweeting+4.151
Neg logit contributions
 clerk-4.684
olded-4.463
keeper-4.321
 manager-4.294
 cousin-4.175
Token frog
Token ID21264
Feature activation+1.18e+00

Pos logit contributions
 nib+1.952
 buzzing+1.718
 hopping+1.715
ping+1.633
 prow+1.518
Neg logit contributions
 promise-2.385
 clerk-2.256
 manager-2.148
olding-2.146
keeper-2.141
Token was
Token ID373
Feature activation+1.64e+00

Pos logit contributions
 nib+2.249
 hopping+2.140
 buzzing+2.110
ping+2.068
 tweeting+1.944
Neg logit contributions
 promise-1.898
olded-1.663
 clerk-1.657
keeper-1.651
olding-1.633
Token named
Token ID3706
Feature activation+2.53e+00

Pos logit contributions
 nib+5.666
 hopping+5.293
ping+5.225
 snowball+5.115
 buzzing+5.085
Neg logit contributions
 promise-4.095
 cousin-3.636
ari-3.543
olded-3.539
olding-3.529
Token Buzz
Token ID15753
Feature activation+2.31e+00

Pos logit contributions
 nib+4.396
 hopping+4.346
 buzzing+4.184
ping+4.057
 snowball+3.881
Neg logit contributions
 cousin-4.093
 promise-4.089
 grandma-4.028
 clerk-3.860
Of-3.810
Tokeny
Token ID88
Feature activation+3.06e+00

Pos logit contributions
 nib+4.544
 hopping+4.370
 buzzing+4.244
ping+4.147
 snowball+4.074
Neg logit contributions
 promise-3.529
 cousin-2.991
 grandparents-2.947
olding-2.888
ari-2.877
Token was
Token ID373
Feature activation+2.11e+00

Pos logit contributions
 nib+6.322
 buzzing+5.896
ping+5.880
 snowball+5.720
 hopping+5.683
Neg logit contributions
 cousin-4.408
 promise-4.107
 grandmother-4.091
 grandparents-4.066
 grandma-4.035
Token flying
Token ID7348
Feature activation+2.55e+00

Pos logit contributions
 nib+3.684
 snowball+3.310
bee+3.253
ping+3.232
 buzzing+3.221
Neg logit contributions
 promise-3.449
 clerk-3.275
 cousin-3.247
olding-3.243
olded-3.208
Token around
Token ID1088
Feature activation+4.21e+00

Pos logit contributions
 nib+5.171
 hopping+4.733
 buzzing+4.569
 gra+4.511
 snowball+4.413
Neg logit contributions
keeper-3.474
 wisely-3.471
 promise-3.318
 manager-3.226
ounded-3.222
Token.
Token ID13
Feature activation+3.60e+00

Pos logit contributions
 snowball+9.554
latable+9.531
pill+9.309
 nib+9.280
bee+9.174
Neg logit contributions
 wisely-4.188
 grandma-4.094
 grandmother-4.014
 forward-3.946
 grandparents-3.856
Token She
Token ID1375
Feature activation+1.45e+00

Pos logit contributions
 buzzing+6.466
 snowball+6.180
ping+6.144
ric+5.900
 tweeting+5.894
Neg logit contributions
keeper-5.999
Of-5.823
 promise-5.776
 clerk-5.641
 manager-5.488
Token was
Token ID373
Feature activation+1.81e+00

Pos logit contributions
 nib+2.847
 hopping+2.680
ping+2.674
 buzzing+2.653
 snowball+2.549
Neg logit contributions
 promise-2.039
 grandparents-1.894
belt-1.864
 clerk-1.855
keeper-1.842
Token looking
Token ID2045
Feature activation+1.51e+00

Pos logit contributions
 nib+3.537
 snowball+3.196
growth+3.077
 buzzing+3.071
 hopping+3.065
Neg logit contributions
 promise-2.606
 clerk-2.440
olding-2.386
 wisely-2.372
 cheaper-2.339
Token for
Token ID329
Feature activation+1.34e+00

Pos logit contributions
 nib+3.966
 buzzing+3.672
 hopping+3.647
 snowball+3.565
ping+3.527
Neg logit contributions
 promise-1.268
 clerk-1.122
olding-1.118
 grandparents-1.057
 wisely-1.015
Token and
Token ID290
Feature activation+7.02e-01

Pos logit contributions
 promise+0.325
 clerk+0.319
olding+0.307
keeper+0.304
 grandparents+0.303
Neg logit contributions
 nib-0.233
 hopping-0.216
 buzzing-0.214
ping-0.193
 snowball-0.181
Token suddenly
Token ID6451
Feature activation+9.81e-01

Pos logit contributions
 nib+1.254
 hopping+1.231
 buzzing+1.218
ping+1.163
 snowball+1.090
Neg logit contributions
 promise-1.147
 clerk-1.107
 bills-1.066
 grandparents-1.060
keeper-1.048
Token,
Token ID11
Feature activation+7.56e-01

Pos logit contributions
 nib+1.769
 hopping+1.675
 buzzing+1.649
ping+1.611
 snowball+1.536
Neg logit contributions
 promise-1.635
 clerk-1.569
 bills-1.522
 wisely-1.518
 grandparents-1.517
Token she
Token ID673
Feature activation+1.65e+00

Pos logit contributions
 nib+1.570
 hopping+1.459
 buzzing+1.457
ping+1.421
 snowball+1.330
Neg logit contributions
 clerk-1.088
 promise-1.047
 bills-0.993
 grandparents-0.983
keeper-0.983
Token started
Token ID2067
Feature activation+2.96e+00

Pos logit contributions
 nib+3.356
 hopping+3.274
 buzzing+3.266
ping+3.151
 gra+3.103
Neg logit contributions
 promise-2.288
 clerk-2.140
 grandparents-2.125
keeper-2.112
olded-2.109
Token to
Token ID284
Feature activation+4.18e+00

Pos logit contributions
ping+7.357
 nib+7.302
bee+7.274
 snowball+7.247
 Buzz+7.125
Neg logit contributions
 promise-2.367
 clerk-2.312
 grandparents-2.126
 grandmother-2.098
 wisely-2.087
Token shrink
Token ID22085
Feature activation+1.79e+00

Pos logit contributions
ping+9.349
pill+8.976
 hopping+8.961
 buzzing+8.570
eh+8.503
Neg logit contributions
 promise-8.414
 reward-5.466
 grandparents-5.361
 grandma-5.325
 choice-5.310
Token!
Token ID0
Feature activation+4.41e-01

Pos logit contributions
 nib+3.494
 hopping+3.336
 buzzing+3.106
 gra+3.005
 prow+2.978
Neg logit contributions
 promise-2.742
 clerk-2.646
 bills-2.634
 wisely-2.620
keeper-2.565
Token Lily
Token ID20037
Feature activation+1.87e+00

Pos logit contributions
 nib+0.702
 buzzing+0.657
 hopping+0.643
ping+0.606
 snowball+0.559
Neg logit contributions
 clerk-0.863
 promise-0.821
olding-0.811
olded-0.804
 grandparents-0.798
Token was
Token ID373
Feature activation+1.62e+00

Pos logit contributions
 hopping+3.634
 nib+3.633
 buzzing+3.553
ping+3.495
 gra+3.477
Neg logit contributions
 promise-2.765
olded-2.506
keeper-2.504
 clerk-2.457
 wisely-2.418
Token so
Token ID523
Feature activation+5.68e-01

Pos logit contributions
 nib+3.058
 hopping+2.843
 gra+2.801
 buzzing+2.785
ping+2.739
Neg logit contributions
 promise-2.476
 clerk-2.321
 wisely-2.257
olded-2.255
 manager-2.236
Token.
Token ID13
Feature activation+2.57e+00

Pos logit contributions
 nib+3.050
 hopping+2.838
 buzzing+2.697
ping+2.597
 snowball+2.511
Neg logit contributions
 promise-2.733
 wisely-2.514
keeper-2.469
belt-2.393
ounded-2.379
Token It
Token ID632
Feature activation+1.73e+00

Pos logit contributions
 buzzing+4.631
 nib+4.560
 hopping+4.358
ping+4.280
bee+4.006
Neg logit contributions
 clerk-4.437
 promise-4.300
 grandma-4.265
keeper-4.251
 cousin-4.227
Token had
Token ID550
Feature activation+1.70e+00

Pos logit contributions
 nib+3.409
 hopping+3.328
 buzzing+3.227
ping+3.222
 snowball+3.029
Neg logit contributions
 promise-2.517
keeper-2.202
belt-2.152
ounded-2.137
olded-2.111
Token big
Token ID1263
Feature activation+1.84e+00

Pos logit contributions
 nib+3.107
 hopping+2.804
 buzzing+2.743
 snowball+2.621
ping+2.578
Neg logit contributions
 promise-2.656
 clerk-2.602
 bills-2.585
 wisely-2.558
keeper-2.517
Token,
Token ID11
Feature activation+6.67e-01

Pos logit contributions
 nib+2.748
 buzzing+2.449
 hopping+2.404
 gra+2.337
ping+2.305
Neg logit contributions
 promise-3.636
 bills-3.413
 grandparents-3.327
 dollars-3.312
 wisely-3.309
Token soft
Token ID2705
Feature activation+4.18e+00

Pos logit contributions
 nib+1.136
 hopping+1.025
 buzzing+0.994
ping+0.961
 Buzz+0.900
Neg logit contributions
 promise-1.190
 clerk-1.169
belt-1.112
keeper-1.101
olding-1.091
Token wings
Token ID12098
Feature activation+1.89e+00

Pos logit contributions
 nib+8.943
 gra+8.466
 buzzing+7.847
 Larry+7.847
 prow+7.774
Neg logit contributions
 gift-5.624
 grandma-5.236
 reward-5.092
 promise-4.967
 grandmother-4.918
Token with
Token ID351
Feature activation+2.13e+00

Pos logit contributions
 nib+3.684
 hopping+3.288
 snowball+3.281
 Buzz+3.220
 buzzing+3.178
Neg logit contributions
 promise-2.864
 wisely-2.725
olding-2.718
olded-2.664
belt-2.644
Token lots
Token ID6041
Feature activation+8.54e-01

Pos logit contributions
 nib+3.744
 hopping+3.317
 buzzing+3.264
 snowball+3.256
 gra+3.080
Neg logit contributions
 wisely-3.355
 promise-3.330
 bills-3.324
 grandparents-3.215
olded-3.213
Token of
Token ID286
Feature activation+1.87e+00

Pos logit contributions
 nib+2.192
 hopping+2.073
 buzzing+2.054
ping+1.943
 gra+1.927
Neg logit contributions
 promise-0.749
 clerk-0.713
olding-0.705
 wisely-0.677
keeper-0.668
Token fur
Token ID9230
Feature activation+1.91e+00

Pos logit contributions
 nib+2.855
 hopping+2.601
 buzzing+2.454
 Buzz+2.272
ping+2.262
Neg logit contributions
 promise-3.617
 bills-3.506
 grandparents-3.340
 wisely-3.332
 dollars-3.295
Token very
Token ID845
Feature activation+7.80e-01

Pos logit contributions
 nib+4.487
 buzzing+4.057
ping+3.964
 hopping+3.899
bee+3.854
Neg logit contributions
 clerk-3.901
 promise-3.832
 cheaper-3.817
 grandparents-3.648
 grandmother-3.587
Token hot
Token ID3024
Feature activation+1.89e+00

Pos logit contributions
 nib+1.443
 hopping+1.343
 buzzing+1.323
ping+1.284
 prow+1.215
Neg logit contributions
 promise-1.298
 clerk-1.228
olding-1.211
 policy-1.156
 grandparents-1.154
Token and
Token ID290
Feature activation+1.06e+00

Pos logit contributions
 nib+3.413
 buzzing+3.094
 hopping+3.022
 gra+2.921
 prow+2.856
Neg logit contributions
 clerk-3.267
 policy-3.095
 promise-3.073
olding-3.058
 wisely-3.001
Token he
Token ID339
Feature activation+1.66e+00

Pos logit contributions
 nib+1.925
 buzzing+1.771
 hopping+1.739
ping+1.686
 gra+1.569
Neg logit contributions
 promise-1.745
 clerk-1.689
belt-1.652
 grandparents-1.616
 wisely-1.594
Token started
Token ID2067
Feature activation+3.19e+00

Pos logit contributions
 nib+3.506
 hopping+3.412
 buzzing+3.404
ping+3.336
 tweeting+3.160
Neg logit contributions
 promise-2.281
olded-2.043
 clerk-2.039
keeper-1.992
 grandparents-1.954
Token to
Token ID284
Feature activation+4.17e+00

Pos logit contributions
 nib+8.174
 Buzz+7.822
ping+7.812
bee+7.781
Buzz+7.731
Neg logit contributions
 promise-2.536
 wisely-2.507
 clerk-2.476
olding-2.305
 grandparents-2.251
Token feel
Token ID1254
Feature activation+1.65e+00

Pos logit contributions
ping+8.795
pill+8.653
 hopping+8.133
 buzzing+7.881
 nib+7.868
Neg logit contributions
 promise-8.209
 cheaper-5.542
 reward-5.326
 clerk-5.323
 punishment-5.317
Token funny
Token ID8258
Feature activation+3.18e+00

Pos logit contributions
 nib+3.390
ping+3.228
 hopping+3.179
 buzzing+3.150
bee+3.074
Neg logit contributions
 promise-2.431
 clerk-2.192
 wisely-2.092
olding-2.089
 cheaper-2.083
Token.
Token ID13
Feature activation+8.99e-01

Pos logit contributions
 nib+6.891
 gra+6.500
Buzz+6.259
 hopping+6.216
 buzzing+6.185
Neg logit contributions
 promise-4.168
 wisely-4.021
 clerk-3.984
olding-3.954
 manager-3.940
Token Suddenly
Token ID24975
Feature activation+9.93e-01

Pos logit contributions
 nib+1.476
 buzzing+1.445
 hopping+1.321
ping+1.305
 tweeting+1.232
Neg logit contributions
 clerk-1.677
 promise-1.619
olded-1.596
olding-1.580
keeper-1.568
Token,
Token ID11
Feature activation+8.38e-01

Pos logit contributions
 nib+1.849
 buzzing+1.760
ping+1.699
 hopping+1.658
 gra+1.609
Neg logit contributions
 clerk-1.551
 promise-1.541
 wisely-1.461
 grandparents-1.461
keeper-1.453
Token boy
Token ID2933
Feature activation+9.97e-01

Pos logit contributions
 promise+1.185
 clerk+1.165
olding+1.105
 grandparents+1.087
belt+1.084
Neg logit contributions
 nib-1.935
 hopping-1.835
 buzzing-1.822
ping-1.683
 snowball-1.638
Token went
Token ID1816
Feature activation+3.34e-01

Pos logit contributions
 nib+1.919
 buzzing+1.842
 hopping+1.804
ping+1.743
 tweeting+1.683
Neg logit contributions
 promise-1.580
 clerk-1.499
olded-1.447
 cousin-1.439
 grandparents-1.438
Token outside
Token ID2354
Feature activation+1.57e+00

Pos logit contributions
 nib+0.665
 buzzing+0.618
 hopping+0.605
ping+0.586
 Buzz+0.562
Neg logit contributions
 promise-0.496
 clerk-0.490
olding-0.468
 grandparents-0.458
 manager-0.458
Token and
Token ID290
Feature activation+9.41e-01

Pos logit contributions
 nib+3.549
 buzzing+3.453
ping+3.405
 hopping+3.329
bee+3.257
Neg logit contributions
 promise-1.821
olding-1.616
 grandparents-1.607
 clerk-1.594
 cousin-1.534
Token saw
Token ID2497
Feature activation+1.52e+00

Pos logit contributions
 nib+1.833
 buzzing+1.759
 hopping+1.736
ping+1.691
 snowball+1.614
Neg logit contributions
 promise-1.426
 clerk-1.357
keeper-1.306
 grandparents-1.305
olded-1.296
Token white
Token ID2330
Feature activation+4.17e+00

Pos logit contributions
 nib+2.468
 hopping+2.196
 buzzing+2.186
ping+2.124
bee+1.947
Neg logit contributions
 clerk-2.883
 promise-2.782
 grandparents-2.780
 manager-2.708
 bills-2.702
Token coal
Token ID5655
Feature activation+2.40e+00

Pos logit contributions
 nib+8.493
 gra+7.408
 hopping+7.207
cks+7.202
 buzzing+7.080
Neg logit contributions
 clerk-7.161
 grandma-6.809
 manager-6.720
 bills-6.672
keeper-6.600
Token on
Token ID319
Feature activation+2.74e+00

Pos logit contributions
 nib+4.898
 buzzing+4.464
 hopping+4.414
 Buzz+4.349
 gra+4.212
Neg logit contributions
 clerk-3.409
 bills-3.338
 promise-3.295
keeper-3.276
 manager-3.267
Token the
Token ID262
Feature activation+8.96e-01

Pos logit contributions
 nib+5.002
 hopping+4.957
 buzzing+4.776
ping+4.672
 tweeting+4.571
Neg logit contributions
 bills-4.678
 clerk-4.542
 grandma-4.495
 grandparents-4.383
 grandmother-4.261
Token ground
Token ID2323
Feature activation+1.76e+00

Pos logit contributions
 nib+1.310
 hopping+1.210
 buzzing+1.153
 Buzz+1.024
ping+1.021
Neg logit contributions
 promise-1.846
 clerk-1.835
 grandparents-1.761
 bills-1.755
belt-1.740
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
 nib+3.645
 buzzing+3.335
 hopping+3.297
 snowball+3.153
ping+3.137
Neg logit contributions
 promise-2.494
 clerk-2.480
keeper-2.418
olding-2.415
belt-2.349
Token and
Token ID290
Feature activation+1.82e-01

Pos logit contributions
 nib+3.277
 hopping+3.051
 buzzing+2.893
 gra+2.836
 prow+2.813
Neg logit contributions
 promise-2.976
 clerk-2.886
keeper-2.816
 bills-2.795
 wisely-2.782
Token hotter
Token ID37546
Feature activation+1.97e+00

Pos logit contributions
 nib+0.286
 hopping+0.265
 buzzing+0.262
ping+0.241
 prow+0.221
Neg logit contributions
 promise-0.345
 clerk-0.344
 grandparents-0.327
keeper-0.327
 bills-0.326
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
 nib+4.018
 hopping+3.759
 buzzing+3.564
 gra+3.556
 prow+3.521
Neg logit contributions
 promise-2.832
 clerk-2.710
 bills-2.697
keeper-2.650
 policy-2.649
Token Benny
Token ID44275
Feature activation+2.57e+00

Pos logit contributions
 nib+1.607
 buzzing+1.512
 hopping+1.435
ping+1.383
 snowball+1.275
Neg logit contributions
 clerk-1.963
 promise-1.839
olded-1.834
keeper-1.829
 grandparents-1.806
Token started
Token ID2067
Feature activation+3.66e+00

Pos logit contributions
ping+5.289
 nib+5.276
 hopping+5.179
 buzzing+5.133
 gra+5.093
Neg logit contributions
 promise-3.404
 clerk-3.238
keeper-3.219
olded-3.098
 grandparents-2.899
Token to
Token ID284
Feature activation+4.17e+00

Pos logit contributions
bee+10.027
ping+9.836
 nib+9.679
hing+9.642
 Buzz+9.563
Neg logit contributions
 promise-2.359
 wisely-2.219
olding-2.184
 clerk-2.062
olded-2.015
Token feel
Token ID1254
Feature activation+1.99e+00

Pos logit contributions
ping+8.705
pill+8.520
 hopping+7.822
umping+7.721
clear+7.699
Neg logit contributions
 promise-8.722
 forgive-5.759
 reward-5.754
 punishment-5.749
 grandmother-5.668
Token very
Token ID845
Feature activation+1.23e+00

Pos logit contributions
 nib+4.120
ping+3.991
 hopping+3.917
bee+3.860
 buzzing+3.826
Neg logit contributions
 promise-2.884
 clerk-2.509
 cheaper-2.431
olding-2.392
 wisely-2.376
Token hot
Token ID3024
Feature activation+1.60e+00

Pos logit contributions
 nib+2.299
 hopping+2.160
 buzzing+2.113
ping+2.056
 prow+1.989
Neg logit contributions
 promise-2.080
olding-1.885
 clerk-1.814
olded-1.762
 wisely-1.761
Token and
Token ID290
Feature activation+1.75e+00

Pos logit contributions
 nib+3.020
 hopping+2.743
 buzzing+2.742
 gra+2.635
 prow+2.580
Neg logit contributions
 clerk-2.587
 promise-2.440
olding-2.405
 policy-2.379
 wisely-2.372
Token thirsty
Token ID47124
Feature activation+1.50e+00

Pos logit contributions
 nib+3.158
 buzzing+2.971
 hopping+2.953
ping+2.886
bee+2.742
Neg logit contributions
 promise-2.996
 clerk-2.690
olded-2.594
keeper-2.580
 bills-2.576
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
 nib+2.699
 hopping+2.567
 buzzing+2.557
ping+2.537
 gra+2.344
Neg logit contributions
 promise-2.391
 clerk-2.216
keeper-2.159
olding-2.123
 cousin-2.103
Token
Token ID198
Feature activation+1.77e-01

Pos logit contributions
 nib+2.700
 buzzing+2.580
 hopping+2.454
ping+2.426
 snowball+2.205
Neg logit contributions
 clerk-2.483
Of-2.361
 grandparents-2.360
keeper-2.333
 promise-2.307
Token
Token ID198
Feature activation+1.65e+00

Pos logit contributions
 nib+0.341
 hopping+0.317
 buzzing+0.316
ping+0.298
 bees+0.277
Neg logit contributions
 promise-0.276
 clerk-0.273
 grandparents-0.263
olding-0.261
keeper-0.255
TokenShe
Token ID3347
Feature activation+1.17e+00

Pos logit contributions
 nib+3.562
 hopping+3.200
ping+3.138
 buzzing+3.120
 Buzz+2.926
Neg logit contributions
 clerk-2.416
 promise-2.289
olding-2.255
Of-2.237
belt-2.211
Token started
Token ID2067
Feature activation+3.57e+00

Pos logit contributions
 nib+2.066
 buzzing+2.064
 hopping+2.040
ping+2.013
 tweeting+1.864
Neg logit contributions
 promise-1.930
 clerk-1.810
olded-1.764
 grandparents-1.763
keeper-1.744
Token to
Token ID284
Feature activation+4.16e+00

Pos logit contributions
ping+9.515
bee+9.414
Buzz+9.379
 nib+9.332
 Buzz+9.303
Neg logit contributions
 wisely-2.144
 promise-2.093
 clerk-1.896
olding-1.820
 manager-1.805
Token play
Token ID711
Feature activation+1.90e+00

Pos logit contributions
ping+8.816
pill+8.568
 hopping+8.013
umping+7.827
 lava+7.814
Neg logit contributions
 promise-8.516
 grandparents-5.636
 reward-5.576
 forgive-5.473
 officer-5.444
Token,
Token ID11
Feature activation+2.44e+00

Pos logit contributions
 nib+3.622
ping+3.349
 hopping+3.321
 buzzing+3.266
 tweeting+3.147
Neg logit contributions
 promise-2.967
 clerk-2.765
 manager-2.685
 wisely-2.587
 responsibly-2.562
Token and
Token ID290
Feature activation+7.67e-01

Pos logit contributions
 nib+4.752
ping+4.494
 buzzing+4.418
 hopping+4.338
 snowball+4.284
Neg logit contributions
 promise-3.617
 clerk-3.332
 wisely-3.222
olding-3.205
Of-3.181
Token she
Token ID673
Feature activation+1.33e+00

Pos logit contributions
 nib+1.417
 hopping+1.342
 buzzing+1.330
ping+1.298
 snowball+1.216
Neg logit contributions
 promise-1.200
 clerk-1.185
keeper-1.124
 wisely-1.120
 bills-1.110
Token moved
Token ID3888
Feature activation+7.10e-01

Pos logit contributions
 nib+2.448
 buzzing+2.375
 hopping+2.356
ping+2.340
 tweeting+2.154
Neg logit contributions
 promise-2.097
 clerk-1.951
belt-1.900
olded-1.885
 manager-1.872
Token where
Token ID810
Feature activation+9.95e-01

Pos logit contributions
 nib+4.073
 buzzing+3.786
 hopping+3.727
ping+3.595
 gra+3.522
Neg logit contributions
 promise-2.034
 grandparents-2.032
 wisely-2.032
 bills-2.023
 clerk-1.978
Token he
Token ID339
Feature activation+8.17e-01

Pos logit contributions
 nib+2.052
 buzzing+1.931
 hopping+1.905
ping+1.884
 tweeting+1.730
Neg logit contributions
 promise-1.402
 clerk-1.364
 policy-1.322
 grandparents-1.311
 bills-1.309
Token was
Token ID373
Feature activation+1.40e+00

Pos logit contributions
 nib+1.534
 hopping+1.463
 buzzing+1.458
ping+1.361
 tweeting+1.303
Neg logit contributions
 promise-1.304
 clerk-1.203
 grandparents-1.169
keeper-1.148
 bills-1.148
Token and
Token ID290
Feature activation+2.50e+00

Pos logit contributions
 nib+2.053
 hopping+1.778
 buzzing+1.747
 snowball+1.726
 Buzz+1.644
Neg logit contributions
 promise-2.886
olding-2.670
 clerk-2.648
 grandparents-2.600
 policy-2.589
Token started
Token ID2067
Feature activation+2.20e+00

Pos logit contributions
 nib+5.237
 buzzing+4.968
 gra+4.867
ping+4.853
 snowball+4.833
Neg logit contributions
 promise-3.403
olded-3.173
keeper-3.079
 clerk-3.052
 wisely-3.005
Token to
Token ID284
Feature activation+4.16e+00

Pos logit contributions
 nib+5.294
ping+4.948
 Buzz+4.925
bee+4.777
 hopping+4.771
Neg logit contributions
 promise-2.069
 clerk-2.033
 wisely-2.030
olding-2.006
keeper-1.906
Token panic
Token ID13619
Feature activation+2.23e+00

Pos logit contributions
eh+8.570
ping+8.473
pill+8.420
 hopping+8.197
 nib+8.018
Neg logit contributions
 promise-7.532
 officer-5.069
 grandparents-4.954
ounded-4.874
 grandmother-4.868
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
 nib+4.918
 hopping+4.486
 gra+4.388
 buzzing+4.326
 tweeting+4.182
Neg logit contributions
 promise-2.942
 clerk-2.743
 manager-2.721
keeper-2.715
 wisely-2.645
Token He
Token ID679
Feature activation+1.12e+00

Pos logit contributions
 buzzing+2.712
 nib+2.694
 hopping+2.580
ping+2.533
 tweeting+2.416
Neg logit contributions
keeper-1.987
 clerk-1.980
 promise-1.917
 grandparents-1.832
 manager-1.822
Token flew
Token ID13112
Feature activation+2.97e-01

Pos logit contributions
 nib+2.028
 hopping+1.947
 buzzing+1.940
ping+1.859
 gra+1.774
Neg logit contributions
 promise-1.822
 clerk-1.652
olded-1.643
keeper-1.609
 grandparents-1.607
Token around
Token ID1088
Feature activation+1.56e+00

Pos logit contributions
 nib+0.565
 buzzing+0.528
 hopping+0.526
ping+0.488
 gra+0.466
Neg logit contributions
 promise-0.466
 clerk-0.459
keeper-0.437
 wisely-0.434
olding-0.434
Token a
Token ID257
Feature activation+1.71e-01

Pos logit contributions
 nib+3.707
 buzzing+3.447
ping+3.339
 hopping+3.240
 gra+3.073
Neg logit contributions
 grandparents-3.866
 promise-3.760
 grandma-3.728
 bills-3.693
 clerk-3.677
Token sudden
Token ID4802
Feature activation+1.78e+00

Pos logit contributions
 nib+0.299
 hopping+0.277
 buzzing+0.273
ping+0.265
 Buzz+0.245
Neg logit contributions
 promise-0.296
 clerk-0.287
 grandparents-0.276
keeper-0.272
olding-0.270
Token,
Token ID11
Feature activation+2.35e+00

Pos logit contributions
 nib+3.617
 buzzing+3.455
ping+3.409
 gra+3.288
 hopping+3.250
Neg logit contributions
 clerk-2.402
 promise-2.399
 bills-2.380
 policy-2.331
 wisely-2.309
Token he
Token ID339
Feature activation+9.99e-01

Pos logit contributions
 nib+5.296
ping+5.203
 buzzing+5.129
 hopping+4.825
 tweeting+4.766
Neg logit contributions
 clerk-2.770
belt-2.600
 bills-2.588
 grandma-2.583
 promise-2.562
Token started
Token ID2067
Feature activation+2.47e+00

Pos logit contributions
 nib+1.969
 buzzing+1.861
 hopping+1.825
ping+1.776
 Buzz+1.702
Neg logit contributions
 promise-1.457
 clerk-1.395
 grandparents-1.359
keeper-1.350
belt-1.343
Token to
Token ID284
Feature activation+4.16e+00

Pos logit contributions
 nib+6.242
ping+5.809
 Buzz+5.804
bee+5.798
Buzz+5.677
Neg logit contributions
 promise-2.139
 clerk-2.133
keeper-2.070
 manager-2.065
 grandparents-2.029
Token hear
Token ID3285
Feature activation+2.30e+00

Pos logit contributions
ping+8.906
 hopping+8.084
 nib+8.023
pill+7.926
 buzzing+7.818
Neg logit contributions
 promise-7.954
 officer-6.038
 manager-5.974
 clerk-5.936
 grandma-5.773
Token noises
Token ID26782
Feature activation+1.87e+00

Pos logit contributions
 nib+4.100
 hopping+3.709
 buzzing+3.562
ping+3.440
 tweeting+3.318
Neg logit contributions
 clerk-4.003
 grandparents-3.915
 manager-3.896
 bills-3.891
 grandma-3.853
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
 nib+3.816
 hopping+3.439
 Buzz+3.396
 buzzing+3.338
 gra+3.256
Neg logit contributions
 clerk-2.633
 promise-2.603
olding-2.592
 manager-2.574
 officer-2.471
Token He
Token ID679
Feature activation+1.32e+00

Pos logit contributions
 buzzing+3.959
 nib+3.951
ping+3.738
 hopping+3.616
bee+3.480
Neg logit contributions
 clerk-3.506
keeper-3.345
 manager-3.227
olded-3.224
belt-3.163
Token looked
Token ID3114
Feature activation+1.38e+00

Pos logit contributions
 nib+2.546
 hopping+2.410
 buzzing+2.387
ping+2.320
 Buzz+2.177
Neg logit contributions
 promise-2.006
 grandparents-1.889
 clerk-1.877
keeper-1.848
olded-1.838
Token sky
Token ID6766
Feature activation+1.39e+00

Pos logit contributions
 nib+1.053
 buzzing+0.942
 hopping+0.941
ping+0.874
 Buzz+0.858
Neg logit contributions
 promise-0.999
 clerk-0.977
 grandparents-0.923
keeper-0.908
 manager-0.896
Token and
Token ID290
Feature activation+1.01e+00

Pos logit contributions
 nib+2.397
 hopping+2.188
 buzzing+2.140
 snowball+2.089
 gra+2.070
Neg logit contributions
 promise-2.412
keeper-2.281
 clerk-2.269
 wisely-2.241
olding-2.227
Token saw
Token ID2497
Feature activation+1.19e+00

Pos logit contributions
 nib+1.935
 hopping+1.857
 buzzing+1.831
ping+1.771
 snowball+1.723
Neg logit contributions
 promise-1.521
 clerk-1.444
 grandparents-1.431
keeper-1.416
 wisely-1.384
Token big
Token ID1263
Feature activation+2.39e+00

Pos logit contributions
 nib+1.959
 hopping+1.722
 buzzing+1.684
ping+1.641
 gra+1.496
Neg logit contributions
 clerk-2.245
 promise-2.200
 grandparents-2.167
keeper-2.127
 bills-2.114
Token,
Token ID11
Feature activation+2.04e+00

Pos logit contributions
 nib+4.198
 hopping+3.645
 buzzing+3.606
 gra+3.577
eh+3.500
Neg logit contributions
 promise-4.238
 clerk-4.190
 grandparents-4.115
 cousin-4.041
 grandma-4.040
Token fluffy
Token ID39145
Feature activation+4.16e+00

Pos logit contributions
 nib+3.690
ping+3.261
 hopping+3.259
 Buzz+3.014
 buzzing+3.009
Neg logit contributions
 promise-3.509
 clerk-3.486
 cheaper-3.354
keeper-3.337
 manager-3.241
Token clouds
Token ID15114
Feature activation+2.87e+00

Pos logit contributions
 nib+9.062
 gra+8.556
eh+7.845
 prow+7.802
 hopping+7.711
Neg logit contributions
 grandma-6.430
 clerk-5.906
 cousin-5.845
keeper-5.830
 grandparents-5.819
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
 nib+6.051
 hopping+5.480
 Buzz+5.445
 gra+5.386
 snowball+5.361
Neg logit contributions
 promise-3.884
olding-3.700
keeper-3.578
 clerk-3.522
 manager-3.412
Token
Token ID198
Feature activation-2.87e-01

Pos logit contributions
 nib+3.121
 buzzing+2.983
ping+2.920
 hopping+2.864
bee+2.623
Neg logit contributions
 clerk-2.097
 promise-2.020
Of-2.015
olded-1.992
keeper-1.990
Token
Token ID198
Feature activation+1.84e+00

Pos logit contributions
 promise+0.482
 clerk+0.471
olding+0.452
 grandparents+0.450
keeper+0.442
Neg logit contributions
 nib-0.522
 hopping-0.489
 buzzing-0.486
ping-0.449
 Buzz-0.422
TokenThe
Token ID464
Feature activation+4.95e-02

Pos logit contributions
 nib+3.867
ping+3.631
 hopping+3.581
 buzzing+3.528
 tweeting+3.264
Neg logit contributions
 promise-2.506
 clerk-2.475
olding-2.409
Of-2.373
olded-2.305
Token ceiling
Token ID13387
Feature activation+1.42e-01

Pos logit contributions
 nib+1.111
 hopping+1.025
 buzzing+0.990
ping+0.956
 Buzz+0.876
Neg logit contributions
 promise-1.292
 clerk-1.233
 grandparents-1.194
 manager-1.172
 bills-1.171
Token break
Token ID2270
Feature activation+4.54e-02

Pos logit contributions
 nib+0.249
 hopping+0.231
 buzzing+0.227
ping+0.208
 Buzz+0.200
Neg logit contributions
 promise-0.257
 clerk-0.243
olding-0.234
belt-0.232
 grandparents-0.231
Token?"
Token ID1701
Feature activation-1.85e+00

Pos logit contributions
 nib+0.074
 hopping+0.068
 buzzing+0.067
ping+0.060
 Buzz+0.058
Neg logit contributions
 promise-0.087
 clerk-0.085
 grandparents-0.081
belt-0.081
olding-0.081
Token
Token ID198
Feature activation-1.98e+00

Pos logit contributions
 promise+3.634
 clerk+3.439
olding+3.370
 policy+3.332
belt+3.331
Neg logit contributions
 nib-2.977
 hopping-2.800
 buzzing-2.742
ping-2.439
 Buzz-2.334
Token
Token ID198
Feature activation-2.10e+00

Pos logit contributions
 promise+4.062
 clerk+3.956
 bills+3.879
belt+3.837
olding+3.835
Neg logit contributions
 nib-2.909
 hopping-2.726
 buzzing-2.714
ping-2.324
 snowball-2.247
TokenHer
Token ID9360
Feature activation-4.83e+00

Pos logit contributions
 promise+3.881
 clerk+3.759
 bills+3.738
 policy+3.658
belt+3.641
Neg logit contributions
 nib-3.419
 buzzing-3.279
 hopping-3.231
ping-2.815
 jumping-2.741
Token grandma
Token ID49890
Feature activation-1.00e+00

Pos logit contributions
 wisely+6.885
 responsibly+6.738
belt+6.448
bek+6.432
 policy+6.350
Neg logit contributions
 buzzing-10.935
 nib-10.641
 hopping-10.506
 snowball-10.024
 jumping-9.961
Token explained
Token ID4893
Feature activation-2.88e-01

Pos logit contributions
 promise+1.407
 clerk+1.376
olding+1.321
belt+1.302
 bills+1.298
Neg logit contributions
 nib-2.126
 hopping-2.001
 buzzing-1.986
ping-1.829
 Buzz-1.785
Token,
Token ID11
Feature activation+5.99e-01

Pos logit contributions
 promise+0.486
 clerk+0.484
olding+0.460
 grandparents+0.456
 bills+0.455
Neg logit contributions
 nib-0.525
 hopping-0.489
 buzzing-0.483
ping-0.445
 snowball-0.427
Token "
Token ID366
Feature activation-1.68e-01

Pos logit contributions
 nib+1.174
 hopping+1.066
 buzzing+1.061
 snowball+0.998
ping+0.996
Neg logit contributions
 clerk-0.941
 promise-0.893
olding-0.872
keeper-0.870
olded-0.863
TokenSometimes
Token ID15468
Feature activation+5.69e-01

Pos logit contributions
 clerk+0.277
 promise+0.275
olding+0.263
belt+0.260
keeper+0.260
Neg logit contributions
 nib-0.315
 hopping-0.294
 buzzing-0.288
ping-0.266
 Buzz-0.255
Token was
Token ID373
Feature activation-1.13e+00

Pos logit contributions
 promise+0.988
 clerk+0.973
olding+0.933
 grandparents+0.926
belt+0.920
Neg logit contributions
 nib-0.856
 hopping-0.794
 buzzing-0.788
ping-0.712
 Buzz-0.676
Token fixed
Token ID5969
Feature activation+2.86e-02

Pos logit contributions
 promise+1.877
 clerk+1.855
olding+1.772
 grandparents+1.756
keeper+1.745
Neg logit contributions
 nib-2.119
 hopping-2.000
 buzzing-1.974
ping-1.800
 Buzz-1.736
Token,
Token ID11
Feature activation-1.60e+00

Pos logit contributions
 nib+0.049
 hopping+0.045
 buzzing+0.044
ping+0.040
 Buzz+0.038
Neg logit contributions
 promise-0.052
 clerk-0.051
 grandparents-0.049
belt-0.049
olding-0.049
Token Lily
Token ID20037
Feature activation+5.69e-02

Pos logit contributions
 promise+2.954
olding+2.836
 clerk+2.812
belt+2.750
olded+2.734
Neg logit contributions
 nib-2.750
 hopping-2.595
 buzzing-2.564
 Buzz-2.293
ping-2.227
Token asked
Token ID1965
Feature activation-2.99e+00

Pos logit contributions
 nib+0.102
 hopping+0.096
 buzzing+0.096
ping+0.088
 Buzz+0.085
Neg logit contributions
 promise-0.097
 clerk-0.095
 grandparents-0.091
keeper-0.090
olded-0.089
Token her
Token ID607
Feature activation-4.72e+00

Pos logit contributions
 promise+7.892
belt+7.691
 clerk+7.488
olding+7.485
ounded+7.435
Neg logit contributions
 nib-3.054
 hopping-2.756
 buzzing-2.658
 Buzz-2.263
ping-2.197
Token grandma
Token ID49890
Feature activation-9.63e-01

Pos logit contributions
belt+8.407
 policy+8.294
ounded+7.964
bek+7.880
olding+7.824
Neg logit contributions
 nib-9.265
 buzzing-8.874
 hopping-8.512
 jumping-7.964
 snowball-7.874
Token,
Token ID11
Feature activation-1.84e+00

Pos logit contributions
 promise+1.572
 clerk+1.548
olding+1.478
belt+1.475
 grandparents+1.461
Neg logit contributions
 nib-1.813
 hopping-1.707
 buzzing-1.685
ping-1.532
 Buzz-1.496
Token "
Token ID366
Feature activation-2.05e+00

Pos logit contributions
 promise+3.811
 clerk+3.643
 bills+3.593
belt+3.575
 policy+3.550
Neg logit contributions
 nib-2.826
 hopping-2.707
 buzzing-2.630
 Buzz-2.244
ping-2.208
TokenWhy
Token ID5195
Feature activation-4.46e-01

Pos logit contributions
 promise+4.145
olding+3.898
 clerk+3.839
olded+3.821
 bills+3.788
Neg logit contributions
 nib-3.160
 buzzing-2.966
 hopping-2.956
ping-2.629
 snowball-2.524
Token did
Token ID750
Feature activation+9.56e-02

Pos logit contributions
 promise+0.703
 clerk+0.685
olding+0.652
 grandparents+0.650
 bills+0.642
Neg logit contributions
 nib-0.874
 hopping-0.821
 buzzing-0.814
ping-0.751
 Buzz-0.720
Token open
Token ID1280
Feature activation-8.56e-01

Pos logit contributions
 promise+0.270
 clerk+0.258
 grandparents+0.248
olding+0.246
 bills+0.244
Neg logit contributions
 nib-0.279
 hopping-0.263
 buzzing-0.262
ping-0.244
 snowball-0.230
Token it
Token ID340
Feature activation-1.18e+00

Pos logit contributions
 promise+1.819
 clerk+1.795
olding+1.740
 grandparents+1.713
belt+1.710
Neg logit contributions
 nib-1.196
 hopping-1.095
 buzzing-1.081
ping-0.950
 Buzz-0.903
Token.
Token ID13
Feature activation-2.58e-01

Pos logit contributions
 promise+2.332
 clerk+2.304
olding+2.233
 grandparents+2.213
olded+2.200
Neg logit contributions
 nib-1.840
 hopping-1.697
 buzzing-1.669
ping-1.475
 Buzz-1.427
Token He
Token ID679
Feature activation-1.02e+00

Pos logit contributions
 clerk+0.469
 promise+0.468
olding+0.445
 grandparents+0.445
keeper+0.442
Neg logit contributions
 nib-0.437
 buzzing-0.410
 hopping-0.407
ping-0.371
 Buzz-0.347
Token asked
Token ID1965
Feature activation-2.33e+00

Pos logit contributions
 promise+1.874
 clerk+1.844
olding+1.787
 grandparents+1.758
 bills+1.750
Neg logit contributions
 nib-1.733
 hopping-1.598
 buzzing-1.576
ping-1.402
 Buzz-1.361
Token his
Token ID465
Feature activation-4.68e+00

Pos logit contributions
 promise+6.314
olding+6.087
 clerk+6.068
belt+6.001
ounded+5.992
Neg logit contributions
 nib-2.165
 hopping-1.865
 buzzing-1.732
 Buzz-1.438
ping-1.383
Token grandma
Token ID49890
Feature activation-7.85e-01

Pos logit contributions
belt+7.753
olded+7.288
ounded+7.190
 responsibly+7.186
olding+7.058
Neg logit contributions
 nib-10.285
 hopping-9.536
 buzzing-9.406
 jumping-8.829
 carrot-8.715
Token if
Token ID611
Feature activation-1.75e+00

Pos logit contributions
 promise+1.200
 clerk+1.177
olding+1.120
 grandparents+1.108
belt+1.106
Neg logit contributions
 nib-1.557
 hopping-1.470
 buzzing-1.455
ping-1.333
 Buzz-1.295
Token he
Token ID339
Feature activation-2.92e+00

Pos logit contributions
 promise+3.369
olding+3.208
 clerk+3.165
belt+3.062
olded+3.049
Neg logit contributions
 nib-2.884
 hopping-2.728
 buzzing-2.641
 Buzz-2.388
 bees-2.298
Token could
Token ID714
Feature activation-1.16e+00

Pos logit contributions
 clerk+4.941
 promise+4.814
olding+4.731
belt+4.721
 grandparents+4.679
Neg logit contributions
 nib-5.706
 hopping-5.166
 buzzing-4.894
 Buzz-4.549
 crawling-4.405
Token but
Token ID475
Feature activation-2.10e+00

Pos logit contributions
 clerk+2.083
 promise+2.063
olding+1.992
 grandparents+1.979
 bills+1.971
Neg logit contributions
 nib-1.982
 hopping-1.836
 buzzing-1.810
ping-1.589
 Buzz-1.569
Token outside
Token ID2354
Feature activation-1.51e-01

Pos logit contributions
 nib+0.421
 hopping+0.389
 buzzing+0.388
ping+0.354
 tweeting+0.335
Neg logit contributions
 promise-0.439
 clerk-0.437
belt-0.418
olding-0.413
 manager-0.412
Token anymore
Token ID7471
Feature activation+1.69e-01

Pos logit contributions
 promise+0.228
 clerk+0.225
olding+0.212
 grandparents+0.211
belt+0.211
Neg logit contributions
 nib-0.298
 hopping-0.279
 buzzing-0.279
ping-0.257
 Buzz-0.246
Token.
Token ID13
Feature activation-9.76e-01

Pos logit contributions
 nib+0.297
 buzzing+0.278
 hopping+0.276
ping+0.255
 snowball+0.242
Neg logit contributions
 promise-0.289
 clerk-0.283
 grandparents-0.273
olding-0.272
 bills-0.269
Token
Token ID198
Feature activation-2.65e-01

Pos logit contributions
 promise+2.020
 clerk+1.975
olding+1.901
 grandparents+1.888
 bills+1.887
Neg logit contributions
 nib-1.435
 hopping-1.327
 buzzing-1.294
ping-1.156
 Buzz-1.118
Token
Token ID198
Feature activation-6.20e-01

Pos logit contributions
 promise+0.495
 clerk+0.488
olding+0.470
 grandparents+0.467
belt+0.459
Neg logit contributions
 nib-0.433
 hopping-0.401
 buzzing-0.399
ping-0.363
 Buzz-0.342
TokenHis
Token ID6653
Feature activation-4.65e+00

Pos logit contributions
 promise+1.113
 clerk+1.096
olding+1.049
 grandparents+1.042
 bills+1.034
Neg logit contributions
 nib-1.073
 hopping-1.001
 buzzing-0.994
ping-0.901
 Buzz-0.862
Token grandma
Token ID49890
Feature activation-6.82e-01

Pos logit contributions
 wisely+6.540
olded+6.511
 responsibly+6.492
ounded+6.474
olding+6.176
Neg logit contributions
 nib-10.968
 hopping-10.555
 snowball-10.215
 buzzing-10.161
 jumping-9.869
Token told
Token ID1297
Feature activation+4.69e-01

Pos logit contributions
 promise+1.081
 clerk+1.062
olding+1.014
 grandparents+1.003
 bills+0.997
Neg logit contributions
 nib-1.313
 hopping-1.234
 buzzing-1.225
ping-1.122
 Buzz-1.082
Token him
Token ID683
Feature activation+2.51e-01

Pos logit contributions
 nib+0.596
 buzzing+0.541
 hopping+0.534
ping+0.450
 tweeting+0.431
Neg logit contributions
 promise-1.060
 clerk-1.059
 bills-1.011
 grandparents-1.007
olding-1.006
Token that
Token ID326
Feature activation+7.11e-01

Pos logit contributions
 nib+0.497
 buzzing+0.462
 hopping+0.457
ping+0.431
 snowball+0.415
Neg logit contributions
 promise-0.386
 clerk-0.381
olding-0.362
 bills-0.359
keeper-0.359
Token it
Token ID340
Feature activation+1.43e-01

Pos logit contributions
 nib+1.310
 buzzing+1.214
 hopping+1.188
ping+1.150
 snowball+1.070
Neg logit contributions
 clerk-1.159
 promise-1.151
 grandparents-1.126
 bills-1.116
 policy-1.102
Token was
Token ID373
Feature activation-6.82e-01

Pos logit contributions
 promise+0.851
 clerk+0.837
olding+0.798
 grandparents+0.797
belt+0.789
Neg logit contributions
 nib-0.799
 hopping-0.746
 buzzing-0.741
ping-0.673
 Buzz-0.644
Token very
Token ID845
Feature activation-2.81e+00

Pos logit contributions
 promise+1.297
 clerk+1.279
olding+1.229
 grandparents+1.220
 bills+1.213
Neg logit contributions
 nib-1.106
 hopping-1.028
 buzzing-1.019
ping-0.916
 Buzz-0.873
Token happy
Token ID3772
Feature activation-4.00e-01

Pos logit contributions
 clerk+5.258
 bills+5.240
 grandparents+5.133
belt+5.122
 policy+5.083
Neg logit contributions
 nib-4.975
 buzzing-4.723
 hopping-4.637
 jumping-4.040
 Buzz-3.904
Token and
Token ID290
Feature activation-2.15e+00

Pos logit contributions
 promise+0.757
 clerk+0.748
olding+0.717
 grandparents+0.714
 bills+0.705
Neg logit contributions
 nib-0.651
 hopping-0.603
 buzzing-0.599
ping-0.543
 snowball-0.517
Token thanked
Token ID26280
Feature activation-2.25e+00

Pos logit contributions
 promise+3.866
 clerk+3.772
olding+3.737
 policy+3.653
 bills+3.644
Neg logit contributions
 nib-3.934
 hopping-3.747
 buzzing-3.662
ping-3.166
 jumping-3.163
Token her
Token ID607
Feature activation-4.64e+00

Pos logit contributions
 promise+5.313
olding+5.195
ounded+5.095
 clerk+5.092
olded+5.058
Neg logit contributions
 nib-2.752
 hopping-2.502
 buzzing-2.502
ping-2.137
 Buzz-2.033
Token grandma
Token ID49890
Feature activation-1.70e+00

Pos logit contributions
 policy+6.429
ounded+6.422
olding+6.347
belt+6.343
olded+6.331
Neg logit contributions
 nib-10.688
 buzzing-10.307
 hopping-9.924
 snowball-8.992
ping-8.952
Token for
Token ID329
Feature activation-1.68e+00

Pos logit contributions
 clerk+2.617
 promise+2.613
belt+2.532
 bills+2.474
olding+2.473
Neg logit contributions
 nib-3.373
 hopping-3.187
 buzzing-3.107
ping-2.869
 Buzz-2.783
Token letting
Token ID9616
Feature activation-2.33e+00

Pos logit contributions
 promise+3.851
 clerk+3.743
olding+3.710
belt+3.664
keeper+3.655
Neg logit contributions
 nib-2.201
 hopping-2.070
 buzzing-2.016
ping-1.736
 Buzz-1.618
Token her
Token ID607
Feature activation-1.85e+00

Pos logit contributions
 promise+5.788
olding+5.702
belt+5.641
 clerk+5.623
ounded+5.575
Neg logit contributions
 nib-2.450
 hopping-2.224
 buzzing-2.195
ping-1.787
 Buzz-1.747
Token unlock
Token ID12116
Feature activation-2.06e+00

Pos logit contributions
 clerk+3.525
 promise+3.419
olding+3.406
belt+3.390
olded+3.308
Neg logit contributions
 nib-3.091
 hopping-2.832
 buzzing-2.760
 Buzz-2.360
 gra-2.347
Token so
Token ID523
Feature activation-9.07e-01

Pos logit contributions
 promise+0.111
 clerk+0.108
olding+0.103
 grandparents+0.103
olded+0.102
Neg logit contributions
 nib-0.101
 hopping-0.093
 buzzing-0.093
ping-0.085
 snowball-0.081
Token happy
Token ID3772
Feature activation+6.85e-01

Pos logit contributions
 promise+1.332
 clerk+1.327
 grandparents+1.251
olding+1.251
 bills+1.249
Neg logit contributions
 nib-1.864
 hopping-1.755
 buzzing-1.754
ping-1.598
 Buzz-1.552
Token and
Token ID290
Feature activation-8.16e-01

Pos logit contributions
 nib+1.279
 buzzing+1.170
 hopping+1.169
ping+1.109
 snowball+1.089
Neg logit contributions
 clerk-1.112
 promise-1.103
 grandparents-1.066
olding-1.058
keeper-1.056
Token she
Token ID673
Feature activation+2.78e-01

Pos logit contributions
 promise+1.244
 clerk+1.224
olding+1.168
 grandparents+1.151
 bills+1.147
Neg logit contributions
 nib-1.634
 hopping-1.541
 buzzing-1.529
ping-1.398
 Buzz-1.353
Token thanked
Token ID26280
Feature activation-1.62e+00

Pos logit contributions
 nib+0.566
 hopping+0.543
 buzzing+0.538
ping+0.510
 Buzz+0.485
Neg logit contributions
 promise-0.406
 clerk-0.386
 grandparents-0.370
olded-0.367
keeper-0.362
Token her
Token ID607
Feature activation-4.64e+00

Pos logit contributions
 promise+3.812
olding+3.725
 clerk+3.719
belt+3.656
olded+3.638
Neg logit contributions
 nib-1.954
 hopping-1.776
 buzzing-1.722
ping-1.463
 Buzz-1.444
Token grandma
Token ID49890
Feature activation-1.71e+00

Pos logit contributions
 policy+6.646
olding+6.488
ounded+6.337
olded+6.270
belt+6.200
Neg logit contributions
 nib-10.824
 buzzing-10.159
 hopping-10.149
 snowball-9.291
 jumping-9.149
Token for
Token ID329
Feature activation-5.72e-01

Pos logit contributions
 clerk+2.502
 promise+2.486
belt+2.419
 bills+2.362
olding+2.355
Neg logit contributions
 nib-3.528
 hopping-3.341
 buzzing-3.256
ping-3.020
 Buzz-2.926
Token the
Token ID262
Feature activation-1.38e+00

Pos logit contributions
 promise+1.194
 clerk+1.179
olding+1.135
 grandparents+1.129
 bills+1.122
Neg logit contributions
 nib-0.817
 hopping-0.750
 buzzing-0.744
ping-0.659
 Buzz-0.623
Token y
Token ID331
Feature activation+2.29e-01

Pos logit contributions
 promise+2.177
 clerk+2.170
belt+2.123
olding+2.108
olded+2.103
Neg logit contributions
 nib-2.636
 buzzing-2.483
 hopping-2.482
ping-2.223
 Buzz-2.161
Tokenummy
Token ID13513
Feature activation-8.01e-01

Pos logit contributions
 nib+0.477
 hopping+0.455
 buzzing+0.452
ping+0.411
 Buzz+0.399
Neg logit contributions
 promise-0.336
 clerk-0.321
 grandparents-0.307
olding-0.305
 policy-0.303
Token,
Token ID11
Feature activation-2.19e+00

Pos logit contributions
 promise+1.117
 clerk+1.100
olding+1.055
 grandparents+1.047
 bills+1.041
Neg logit contributions
 nib-0.967
 hopping-0.898
 buzzing-0.891
ping-0.802
 Buzz-0.765
Token please
Token ID3387
Feature activation-5.16e-01

Pos logit contributions
 promise+3.318
 clerk+3.300
olding+3.259
belt+3.187
olded+3.178
Neg logit contributions
 nib-4.249
 buzzing-4.054
 hopping-4.033
ping-3.589
 snowball-3.586
Token?"
Token ID1701
Feature activation-1.66e+00

Pos logit contributions
 promise+1.037
 clerk+1.020
olding+0.981
 grandparents+0.975
 bills+0.968
Neg logit contributions
 nib-0.782
 hopping-0.723
 buzzing-0.718
ping-0.643
 Buzz-0.608
Token
Token ID198
Feature activation-1.71e+00

Pos logit contributions
 promise+3.375
 clerk+3.240
olding+3.200
belt+3.167
 bills+3.157
Neg logit contributions
 nib-2.536
 hopping-2.334
 buzzing-2.282
ping-2.021
 Buzz-1.971
Token
Token ID198
Feature activation-1.17e+00

Pos logit contributions
 promise+3.471
 clerk+3.374
 bills+3.315
belt+3.297
olding+3.275
Neg logit contributions
 nib-2.612
 hopping-2.416
 buzzing-2.386
 Buzz-2.093
ping-2.072
TokenHer
Token ID9360
Feature activation-4.64e+00

Pos logit contributions
 promise+2.014
 clerk+1.963
olding+1.893
 bills+1.893
belt+1.868
Neg logit contributions
 nib-2.103
 buzzing-1.966
 hopping-1.964
ping-1.761
 Buzz-1.690
Token grandma
Token ID49890
Feature activation-1.14e+00

Pos logit contributions
 responsibly+6.421
 wisely+6.283
belt+6.018
olded+5.931
olding+5.908
Neg logit contributions
 nib-10.798
 buzzing-10.666
 hopping-10.416
 snowball-10.008
 jumping-9.878
Token replied
Token ID8712
Feature activation-3.77e-01

Pos logit contributions
 promise+1.676
 clerk+1.641
olding+1.583
 grandparents+1.573
belt+1.570
Neg logit contributions
 nib-2.341
 hopping-2.186
 buzzing-2.171
ping-1.983
 Buzz-1.961
Token,
Token ID11
Feature activation+5.12e-01

Pos logit contributions
 promise+0.701
 clerk+0.694
olding+0.666
 grandparents+0.654
keeper+0.652
Neg logit contributions
 nib-0.624
 hopping-0.577
 buzzing-0.574
ping-0.519
 snowball-0.496
Token "
Token ID366
Feature activation-2.22e+00

Pos logit contributions
 nib+0.987
 buzzing+0.911
 hopping+0.906
ping+0.845
 snowball+0.842
Neg logit contributions
 clerk-0.819
 promise-0.775
olding-0.754
keeper-0.752
olded-0.745
TokenI
Token ID40
Feature activation-2.48e+00

Pos logit contributions
 promise+3.000
 grandparents+2.825
 bills+2.781
 clerk+2.763
 policy+2.755
Neg logit contributions
 nib-4.943
 buzzing-4.721
 hopping-4.644
 Buzz-4.196
ping-4.131
Token love
Token ID1842
Feature activation+2.14e-01

Pos logit contributions
 clerk+2.549
 bills+2.382
belt+2.359
 promise+2.356
 grandparents+2.351
Neg logit contributions
 nib-3.206
 buzzing-2.919
 hopping-2.888
 snowball-2.592
ping-2.577
Token you
Token ID345
Feature activation-1.32e+00

Pos logit contributions
 nib+0.432
 buzzing+0.405
 hopping+0.403
ping+0.376
 snowball+0.364
Neg logit contributions
 promise-0.315
 clerk-0.313
 grandparents-0.309
 bills-0.298
olding-0.289
Token."
Token ID526
Feature activation-1.85e+00

Pos logit contributions
 clerk+2.609
 promise+2.557
olding+2.500
 bills+2.470
belt+2.453
Neg logit contributions
 nib-2.055
 hopping-1.909
 buzzing-1.867
ping-1.682
 Buzz-1.610
Token
Token ID198
Feature activation-2.81e-01

Pos logit contributions
 promise+3.892
 clerk+3.731
belt+3.656
olding+3.646
 bills+3.608
Neg logit contributions
 nib-2.767
 hopping-2.589
 buzzing-2.467
 Buzz-2.175
ping-2.124
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
 promise+0.495
 clerk+0.488
 grandparents+0.467
olding+0.466
keeper+0.459
Neg logit contributions
 nib-0.490
 hopping-0.456
 buzzing-0.453
ping-0.413
 Buzz-0.393
TokenHer
Token ID9360
Feature activation-4.63e+00

Pos logit contributions
 promise+3.012
 clerk+2.911
olding+2.851
 bills+2.833
belt+2.809
Neg logit contributions
 nib-2.595
 hopping-2.450
 buzzing-2.450
ping-2.140
 snowball-2.070
Token grandma
Token ID49890
Feature activation-6.27e-01

Pos logit contributions
 responsibly+6.214
 wisely+5.987
ounded+5.938
belt+5.913
olding+5.809
Neg logit contributions
 buzzing-10.725
 nib-10.641
 hopping-10.224
 snowball-9.803
 jumping-9.626
Token said
Token ID531
Feature activation-8.97e-01

Pos logit contributions
 promise+0.905
 clerk+0.888
olding+0.841
 grandparents+0.833
 bills+0.825
Neg logit contributions
 nib-1.296
 hopping-1.223
 buzzing-1.216
ping-1.122
 Buzz-1.084
Token,
Token ID11
Feature activation-9.72e-01

Pos logit contributions
 promise+1.753
 clerk+1.724
olding+1.661
 grandparents+1.657
belt+1.645
Neg logit contributions
 nib-1.408
 hopping-1.313
 buzzing-1.293
ping-1.150
 Buzz-1.118
Token "
Token ID366
Feature activation-1.74e+00

Pos logit contributions
 promise+1.829
 clerk+1.776
olding+1.715
 grandparents+1.704
 bills+1.697
Neg logit contributions
 nib-1.626
 hopping-1.522
 buzzing-1.502
ping-1.344
 Buzz-1.302
TokenBy
Token ID3886
Feature activation-3.08e-02

Pos logit contributions
 promise+3.356
 clerk+3.162
olding+3.109
 bills+3.105
 grandparents+3.077
Neg logit contributions
 nib-2.865
 hopping-2.721
 buzzing-2.694
ping-2.409
 Buzz-2.364
Token love
Token ID1842
Feature activation-1.88e-02

Pos logit contributions
 nib+0.469
 hopping+0.456
 buzzing+0.448
ping+0.423
 snowball+0.398
Neg logit contributions
 promise-0.409
 clerk-0.381
olding-0.372
 grandparents-0.369
olded-0.368
Token you
Token ID345
Feature activation+1.42e-01

Pos logit contributions
 promise+0.031
 clerk+0.031
 grandparents+0.030
 bills+0.030
olding+0.029
Neg logit contributions
 nib-0.034
 hopping-0.032
 buzzing-0.032
ping-0.029
 snowball-0.028
Token."
Token ID526
Feature activation-1.87e+00

Pos logit contributions
 nib+0.255
 hopping+0.238
 buzzing+0.237
ping+0.214
 snowball+0.210
Neg logit contributions
 promise-0.248
 clerk-0.237
 grandparents-0.229
olding-0.229
 bills-0.227
Token
Token ID198
Feature activation-6.71e-01

Pos logit contributions
 promise+3.894
 clerk+3.700
belt+3.673
olding+3.651
 bills+3.615
Neg logit contributions
 nib-2.787
 hopping-2.600
 buzzing-2.507
 Buzz-2.203
ping-2.188
Token
Token ID198
Feature activation-1.73e+00

Pos logit contributions
 promise+1.188
 clerk+1.169
olding+1.119
 grandparents+1.109
 bills+1.103
Neg logit contributions
 nib-1.174
 hopping-1.096
 buzzing-1.088
ping-0.987
 Buzz-0.947
TokenHer
Token ID9360
Feature activation-4.62e+00

Pos logit contributions
 promise+2.750
 clerk+2.620
 bills+2.570
olding+2.533
 policy+2.527
Neg logit contributions
 nib-3.338
 buzzing-3.190
 hopping-3.176
ping-2.830
 jumping-2.763
Token grandma
Token ID49890
Feature activation+1.61e-02

Pos logit contributions
 responsibly+6.895
belt+6.665
 wisely+6.616
ounded+6.556
 dollars+6.428
Neg logit contributions
 buzzing-10.282
 nib-10.249
 hopping-10.021
 jumping-9.706
 snowball-9.252
Token sounded
Token ID14846
Feature activation-1.55e+00

Pos logit contributions
 nib+0.034
 hopping+0.033
 buzzing+0.033
ping+0.031
 snowball+0.029
Neg logit contributions
 promise-0.022
 clerk-0.022
keeper-0.020
 grandparents-0.020
olding-0.020
Token very
Token ID845
Feature activation-1.53e+00

Pos logit contributions
 promise+2.683
 clerk+2.658
belt+2.614
 grandparents+2.591
 bills+2.567
Neg logit contributions
 nib-2.826
 buzzing-2.644
 hopping-2.624
ping-2.343
 Buzz-2.270
Token happy
Token ID3772
Feature activation+5.08e-02

Pos logit contributions
 clerk+2.374
 promise+2.349
belt+2.304
 bills+2.271
 grandparents+2.257
Neg logit contributions
 nib-2.992
 buzzing-2.828
 hopping-2.792
ping-2.524
 Buzz-2.474
Token and
Token ID290
Feature activation-1.16e+00

Pos logit contributions
 nib+0.078
 hopping+0.072
 buzzing+0.072
ping+0.065
 snowball+0.063
Neg logit contributions
 promise-0.100
 clerk-0.098
olding-0.095
 grandparents-0.094
keeper-0.094
Token.
Token ID13
Feature activation-6.99e-01

Pos logit contributions
 promise+1.842
 clerk+1.814
olding+1.747
 grandparents+1.738
 bills+1.724
Neg logit contributions
 nib-1.474
 hopping-1.371
 buzzing-1.358
ping-1.218
 Buzz-1.153
Token Tom
Token ID4186
Feature activation+1.13e-01

Pos logit contributions
 promise+1.502
 clerk+1.479
olding+1.427
 grandparents+1.419
 bills+1.411
Neg logit contributions
 nib-0.952
 hopping-0.873
 buzzing-0.860
ping-0.757
 Buzz-0.717
Token smiled
Token ID13541
Feature activation-6.08e-01

Pos logit contributions
 nib+0.225
 buzzing+0.214
 hopping+0.214
ping+0.199
 snowball+0.191
Neg logit contributions
 promise-0.173
 clerk-0.167
 grandparents-0.160
keeper-0.158
olded-0.156
Token and
Token ID290
Feature activation-5.52e-01

Pos logit contributions
 promise+1.259
 clerk+1.243
olding+1.197
 grandparents+1.190
 bills+1.182
Neg logit contributions
 nib-0.880
 hopping-0.810
 buzzing-0.803
ping-0.712
 Buzz-0.674
Token thanked
Token ID26280
Feature activation-1.52e+00

Pos logit contributions
 promise+0.745
 clerk+0.729
olding+0.686
 grandparents+0.682
 bills+0.674
Neg logit contributions
 nib-1.191
 hopping-1.128
 buzzing-1.122
ping-1.041
 Buzz-1.005
Token his
Token ID465
Feature activation-4.62e+00

Pos logit contributions
 promise+3.476
 clerk+3.396
olding+3.361
olded+3.297
ounded+3.283
Neg logit contributions
 nib-1.913
 hopping-1.732
 buzzing-1.671
ping-1.458
 Buzz-1.405
Token mom
Token ID1995
Feature activation-1.20e+00

Pos logit contributions
ounded+6.643
 wisely+6.592
 policy+6.553
olded+6.497
 responsibly+6.463
Neg logit contributions
 nib-10.156
 hopping-9.649
 buzzing-9.381
 snowball-9.014
 jumping-8.915
Token for
Token ID329
Feature activation-1.33e+00

Pos logit contributions
 promise+1.987
 clerk+1.969
olding+1.876
 bills+1.858
belt+1.857
Neg logit contributions
 nib-2.229
 hopping-2.107
 buzzing-2.062
ping-1.894
 Buzz-1.815
Token helping
Token ID5742
Feature activation-1.28e+00

Pos logit contributions
 promise+2.973
 clerk+2.912
olding+2.859
keeper+2.821
belt+2.814
Neg logit contributions
 nib-1.758
 hopping-1.647
 buzzing-1.599
ping-1.388
 Buzz-1.289
Token him
Token ID683
Feature activation-4.25e-01

Pos logit contributions
 promise+3.175
 clerk+3.123
olding+3.068
belt+3.029
olded+3.012
Neg logit contributions
 nib-1.343
 hopping-1.210
 buzzing-1.162
ping-0.996
 Buzz-0.901
Token send
Token ID3758
Feature activation-1.36e+00

Pos logit contributions
 promise+0.773
 clerk+0.754
olding+0.724
 grandparents+0.720
 bills+0.713
Neg logit contributions
 nib-0.726
 hopping-0.676
 buzzing-0.672
ping-0.610
 Buzz-0.584
Token you
Token ID345
Feature activation-1.08e+00

Pos logit contributions
 promise+2.448
 clerk+2.411
olding+2.344
belt+2.296
 bills+2.290
Neg logit contributions
 nib-1.739
 hopping-1.601
 buzzing-1.566
ping-1.400
 Buzz-1.368
Token soon
Token ID2582
Feature activation-2.55e-01

Pos logit contributions
 promise+1.914
 clerk+1.911
olding+1.831
 bills+1.808
belt+1.792
Neg logit contributions
 nib-1.876
 hopping-1.747
 buzzing-1.725
ping-1.551
 Buzz-1.502
Token?"
Token ID1701
Feature activation-1.62e+00

Pos logit contributions
 promise+0.499
 clerk+0.488
olding+0.469
 grandparents+0.469
belt+0.463
Neg logit contributions
 nib-0.394
 hopping-0.364
 buzzing-0.363
ping-0.326
 snowball-0.310
Token
Token ID198
Feature activation-8.69e-01

Pos logit contributions
 promise+3.310
 clerk+3.139
olding+3.072
belt+3.068
 bills+3.055
Neg logit contributions
 nib-2.527
 hopping-2.375
 buzzing-2.286
 Buzz-1.993
ping-1.993
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
 promise+1.577
 clerk+1.548
olding+1.485
 bills+1.471
belt+1.471
Neg logit contributions
 nib-1.483
 hopping-1.386
 buzzing-1.374
ping-1.237
 Buzz-1.192
TokenHer
Token ID9360
Feature activation-4.57e+00

Pos logit contributions
 promise+2.273
 clerk+2.203
olding+2.130
 bills+2.127
belt+2.099
Neg logit contributions
 nib-2.362
 hopping-2.226
 buzzing-2.222
ping-1.975
 snowball-1.915
Token grandma
Token ID49890
Feature activation-3.96e-01

Pos logit contributions
 responsibly+6.031
 wisely+5.841
ounded+5.753
bek+5.700
belt+5.660
Neg logit contributions
 buzzing-10.734
 nib-10.563
 hopping-10.343
 snowball-9.872
 jumping-9.753
Token said
Token ID531
Feature activation-1.09e+00

Pos logit contributions
 promise+0.544
 clerk+0.535
olding+0.500
 grandparents+0.498
keeper+0.492
Neg logit contributions
 nib-0.843
 hopping-0.801
 buzzing-0.797
ping-0.739
 snowball-0.710
Token,
Token ID11
Feature activation-5.78e-01

Pos logit contributions
 promise+2.174
 clerk+2.133
 grandparents+2.065
olding+2.055
belt+2.050
Neg logit contributions
 nib-1.665
 hopping-1.557
 buzzing-1.525
ping-1.346
 Buzz-1.321
Token "
Token ID366
Feature activation-2.65e+00

Pos logit contributions
 promise+1.033
 clerk+1.019
olding+0.974
 grandparents+0.967
belt+0.960
Neg logit contributions
 nib-1.000
 hopping-0.932
 buzzing-0.926
ping-0.840
 Buzz-0.803
TokenOf
Token ID5189
Feature activation-2.25e+00

Pos logit contributions
 promise+3.084
 bills+2.791
belt+2.760
 grandparents+2.742
 policy+2.693
Neg logit contributions
 nib-6.504
 hopping-6.227
 buzzing-6.205
 Buzz-5.767
ping-5.596
Token some
Token ID617
Feature activation-2.37e-01

Pos logit contributions
 nib+0.391
 hopping+0.377
 buzzing+0.376
ping+0.347
 tweeting+0.319
Neg logit contributions
 promise-0.500
 clerk-0.486
 bills-0.477
 grandparents-0.470
 cheaper-0.460
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
 promise+0.534
 clerk+0.520
olding+0.504
 bills+0.502
 grandparents+0.500
Neg logit contributions
 nib-0.298
 hopping-0.276
 buzzing-0.275
ping-0.242
 Buzz-0.222
Token
Token ID220
Feature activation-2.10e+00

Pos logit contributions
 promise+3.285
olding+3.118
 clerk+3.114
belt+3.064
 bills+3.044
Neg logit contributions
 nib-2.455
 hopping-2.238
 buzzing-2.182
 Buzz-1.938
ping-1.930
Token
Token ID198
Feature activation-2.00e+00

Pos logit contributions
 promise+4.397
 clerk+4.295
 bills+4.184
belt+4.148
olding+4.089
Neg logit contributions
 nib-2.989
 hopping-2.837
 buzzing-2.730
 Buzz-2.490
ping-2.433
Token
Token ID198
Feature activation-1.41e+00

Pos logit contributions
 promise+4.178
 clerk+4.067
 bills+4.036
belt+4.031
olding+3.950
Neg logit contributions
 nib-2.935
 hopping-2.702
 buzzing-2.637
 Buzz-2.357
ping-2.252
TokenHer
Token ID9360
Feature activation-4.55e+00

Pos logit contributions
 promise+2.282
 clerk+2.183
 bills+2.142
olding+2.122
belt+2.108
Neg logit contributions
 nib-2.689
 buzzing-2.529
 hopping-2.524
ping-2.255
 Buzz-2.182
Token grandma
Token ID49890
Feature activation-1.19e+00

Pos logit contributions
 responsibly+6.912
belt+6.818
 wisely+6.798
olding+6.269
ounded+6.269
Neg logit contributions
 nib-10.214
 buzzing-9.737
 hopping-9.590
 jumping-9.094
 snowball-8.898
Token said
Token ID531
Feature activation-1.07e+00

Pos logit contributions
 promise+1.644
 clerk+1.598
olding+1.551
belt+1.546
 grandparents+1.527
Neg logit contributions
 nib-2.555
 hopping-2.383
 buzzing-2.367
ping-2.160
 Buzz-2.156
Token,
Token ID11
Feature activation-1.69e-01

Pos logit contributions
 promise+1.891
 clerk+1.842
 grandparents+1.782
olding+1.777
belt+1.759
Neg logit contributions
 nib-1.907
 hopping-1.788
 buzzing-1.765
ping-1.574
 Buzz-1.571
Token "
Token ID366
Feature activation-1.92e+00

Pos logit contributions
 clerk+0.288
 promise+0.285
olding+0.272
 grandparents+0.269
keeper+0.268
Neg logit contributions
 nib-0.305
 hopping-0.283
 buzzing-0.282
ping-0.261
 snowball-0.248
TokenOf
Token ID5189
Feature activation-1.74e+00

Pos logit contributions
 promise+2.874
 clerk+2.680
 bills+2.673
 grandparents+2.671
belt+2.651
Neg logit contributions
 nib-4.015
 buzzing-3.787
 hopping-3.723
ping-3.375
 Buzz-3.359
Token went
Token ID1816
Feature activation-2.44e-01

Pos logit contributions
 nib+0.834
 buzzing+0.781
 hopping+0.775
ping+0.720
 Buzz+0.701
Neg logit contributions
 promise-0.833
 clerk-0.817
 grandparents-0.792
 cousin-0.768
olded-0.767
Token to
Token ID284
Feature activation+4.14e-01

Pos logit contributions
 promise+0.386
 clerk+0.384
olding+0.365
 grandparents+0.361
belt+0.358
Neg logit contributions
 nib-0.466
 buzzing-0.433
 hopping-0.432
ping-0.401
 Buzz-0.386
Token the
Token ID262
Feature activation-9.18e-02

Pos logit contributions
 nib+0.627
 buzzing+0.589
 hopping+0.589
ping+0.558
 snowball+0.503
Neg logit contributions
 promise-0.818
 clerk-0.798
 grandparents-0.795
 cousin-0.769
 grandmother-0.764
Token park
Token ID3952
Feature activation-1.66e-01

Pos logit contributions
 clerk+0.170
 promise+0.169
 grandparents+0.162
keeper+0.159
olding+0.158
Neg logit contributions
 nib-0.152
 hopping-0.141
 buzzing-0.140
ping-0.130
 Buzz-0.122
Token with
Token ID351
Feature activation-2.07e+00

Pos logit contributions
 promise+0.269
 clerk+0.265
olding+0.254
 grandparents+0.249
keeper+0.249
Neg logit contributions
 nib-0.311
 buzzing-0.291
 hopping-0.291
ping-0.271
 snowball-0.259
Token her
Token ID607
Feature activation-4.54e+00

Pos logit contributions
 promise+5.328
olding+5.231
 clerk+5.108
belt+5.064
keeper+5.019
Neg logit contributions
 nib-2.200
 hopping-2.037
 buzzing-1.935
 Buzz-1.529
 jumping-1.525
Token mom
Token ID1995
Feature activation-6.58e-01

Pos logit contributions
olding+8.778
olded+8.093
 wisely+7.977
 responsibly+7.955
 promise+7.950
Neg logit contributions
 hopping-9.086
 nib-8.988
 buzzing-8.808
 jumping-8.225
 snowball-8.042
Token.
Token ID13
Feature activation+6.42e-02

Pos logit contributions
 promise+1.213
 clerk+1.194
olding+1.145
 grandparents+1.136
 bills+1.130
Neg logit contributions
 nib-1.104
 hopping-1.028
 buzzing-1.019
ping-0.921
 Buzz-0.879
Token She
Token ID1375
Feature activation-3.73e-02

Pos logit contributions
 nib+0.098
 buzzing+0.089
 hopping+0.089
ping+0.080
 snowball+0.076
Neg logit contributions
 clerk-0.129
 promise-0.126
 grandparents-0.122
olding-0.122
belt-0.121
Token was
Token ID373
Feature activation+8.89e-01

Pos logit contributions
 promise+0.062
 clerk+0.061
 grandparents+0.059
belt+0.058
olding+0.057
Neg logit contributions
 nib-0.068
 buzzing-0.064
 hopping-0.064
ping-0.060
 Buzz-0.057
Token playing
Token ID2712
Feature activation+1.53e+00

Pos logit contributions
 nib+1.520
 buzzing+1.370
 hopping+1.345
ping+1.291
 snowball+1.263
Neg logit contributions
 clerk-1.547
 promise-1.536
olded-1.466
olding-1.465
 grandparents-1.462
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
 promise+1.605
 clerk+1.582
olding+1.529
 grandparents+1.512
belt+1.508
Neg logit contributions
 nib-1.114
 hopping-1.025
 buzzing-1.013
ping-0.897
 Buzz-0.853
Token apples
Token ID22514
Feature activation-6.44e-01

Pos logit contributions
 promise+2.310
 clerk+2.264
olding+2.231
olded+2.225
belt+2.180
Neg logit contributions
 nib-2.372
 hopping-2.220
 buzzing-2.185
ping-1.944
 Buzz-1.899
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
 promise+1.214
 clerk+1.197
olding+1.149
 grandparents+1.141
 bills+1.133
Neg logit contributions
 nib-1.041
 hopping-0.967
 buzzing-0.959
ping-0.863
 Buzz-0.824
Token
Token ID198
Feature activation-6.29e-01

Pos logit contributions
 promise+2.763
 clerk+2.648
olding+2.589
belt+2.569
 bills+2.541
Neg logit contributions
 nib-1.881
 hopping-1.721
 buzzing-1.700
 Buzz-1.483
ping-1.476
Token
Token ID198
Feature activation-7.35e-01

Pos logit contributions
 promise+1.106
 clerk+1.089
olding+1.042
 grandparents+1.034
 bills+1.026
Neg logit contributions
 nib-1.105
 hopping-1.032
 buzzing-1.024
ping-0.930
 Buzz-0.892
TokenHer
Token ID9360
Feature activation-4.52e+00

Pos logit contributions
 promise+1.164
 clerk+1.138
olding+1.085
 grandparents+1.075
 bills+1.070
Neg logit contributions
 nib-1.430
 hopping-1.344
 buzzing-1.338
ping-1.222
 Buzz-1.180
Token grandma
Token ID49890
Feature activation-5.85e-01

Pos logit contributions
 responsibly+5.645
olding+5.529
 wisely+5.519
ounded+5.348
olded+5.134
Neg logit contributions
 nib-10.968
 buzzing-10.566
 hopping-10.327
 snowball-10.165
 jumping-9.572
Token was
Token ID373
Feature activation-1.35e+00

Pos logit contributions
 promise+0.918
 clerk+0.903
olding+0.857
 grandparents+0.851
 bills+0.843
Neg logit contributions
 nib-1.135
 hopping-1.068
 buzzing-1.061
ping-0.974
 Buzz-0.937
Token shocked
Token ID11472
Feature activation-8.27e-01

Pos logit contributions
 promise+2.275
 clerk+2.215
olding+2.145
 bills+2.137
 policy+2.127
Neg logit contributions
 nib-2.500
 buzzing-2.364
 hopping-2.363
ping-2.120
 jumping-2.028
Token.
Token ID13
Feature activation-2.23e-01

Pos logit contributions
 promise+1.644
 clerk+1.619
olding+1.552
belt+1.549
 grandparents+1.548
Neg logit contributions
 nib-1.274
 hopping-1.182
 buzzing-1.171
ping-1.051
 Buzz-0.995
Token She
Token ID1375
Feature activation-1.74e+00

Pos logit contributions
 clerk+0.480
 promise+0.476
olding+0.460
 grandparents+0.458
keeper+0.454
Neg logit contributions
 nib-0.306
 hopping-0.279
 buzzing-0.277
ping-0.245
 snowball-0.228
Token and
Token ID290
Feature activation-4.40e-01

Pos logit contributions
 promise+5.273
 clerk+5.195
olded+5.090
 grandparents+5.087
olding+5.069
Neg logit contributions
 nib-2.622
 hopping-2.423
 buzzing-2.414
ping-2.111
 bees-1.888
Token thirsty
Token ID47124
Feature activation+7.38e-02

Pos logit contributions
 promise+0.741
 clerk+0.723
olding+0.691
 grandparents+0.685
belt+0.682
Neg logit contributions
 nib-0.804
 hopping-0.754
 buzzing-0.748
ping-0.685
 Buzz-0.658
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
 nib+0.128
 hopping+0.119
 buzzing+0.118
ping+0.108
 Buzz+0.104
Neg logit contributions
 promise-0.131
 clerk-0.128
olding-0.122
 bills-0.121
keeper-0.120
Token They
Token ID1119
Feature activation-5.02e-01

Pos logit contributions
 promise+2.882
 clerk+2.759
olding+2.713
belt+2.679
 grandparents+2.677
Neg logit contributions
 nib-1.638
 hopping-1.489
 buzzing-1.449
ping-1.277
 Buzz-1.224
Token asked
Token ID1965
Feature activation-2.62e+00

Pos logit contributions
 promise+0.911
 clerk+0.894
olding+0.854
 grandparents+0.852
 bills+0.845
Neg logit contributions
 nib-0.849
 hopping-0.795
 buzzing-0.789
ping-0.716
 Buzz-0.683
Token their
Token ID511
Feature activation-4.52e+00

Pos logit contributions
 promise+5.755
olding+5.548
ounded+5.542
 clerk+5.501
belt+5.483
Neg logit contributions
 nib-3.621
 hopping-3.321
 buzzing-3.231
 Buzz-2.905
 bees-2.816
Token grandma
Token ID49890
Feature activation-6.78e-01

Pos logit contributions
olding+6.487
 responsibly+6.330
ounded+6.319
och+6.269
belt+6.232
Neg logit contributions
 nib-10.548
 buzzing-9.716
 hopping-9.621
 snowball-9.482
 snow-9.436
Token if
Token ID611
Feature activation-2.58e+00

Pos logit contributions
 promise+1.161
 clerk+1.143
olding+1.092
 grandparents+1.082
belt+1.076
Neg logit contributions
 nib-1.221
 hopping-1.143
 buzzing-1.133
ping-1.031
 Buzz-0.992
Token they
Token ID484
Feature activation-2.08e+00

Pos logit contributions
 promise+4.388
olding+4.281
ounded+4.023
 clerk+4.001
belt+3.979
Neg logit contributions
 nib-4.863
 hopping-4.647
 buzzing-4.544
 Buzz-4.200
 bees-4.089
Token could
Token ID714
Feature activation-3.62e-01

Pos logit contributions
 clerk+3.185
 promise+2.971
olding+2.971
belt+2.908
 grandparents+2.901
Neg logit contributions
 nib-4.454
 hopping-3.999
 buzzing-3.934
 Buzz-3.650
ping-3.496
Token have
Token ID423
Feature activation+2.68e-01

Pos logit contributions
 promise+0.676
 clerk+0.655
olding+0.629
 grandparents+0.628
belt+0.622
Neg logit contributions
 nib-0.599
 buzzing-0.560
 hopping-0.559
ping-0.509
 Buzz-0.480
Tokenila
Token ID10102
Feature activation+2.66e-01

Pos logit contributions
 promise+1.021
 clerk+1.002
 grandparents+0.937
olding+0.937
 bills+0.927
Neg logit contributions
 nib-1.693
 hopping-1.604
 buzzing-1.596
ping-1.483
 Buzz-1.431
Token felt
Token ID2936
Feature activation-4.05e-01

Pos logit contributions
 nib+0.538
 hopping+0.516
 buzzing+0.511
ping+0.475
 snowball+0.457
Neg logit contributions
 promise-0.402
 clerk-0.391
keeper-0.366
olded-0.365
olding-0.363
Token better
Token ID1365
Feature activation-1.73e+00

Pos logit contributions
 promise+0.549
 clerk+0.535
olding+0.504
 grandparents+0.496
 bills+0.493
Neg logit contributions
 nib-0.876
 hopping-0.829
 buzzing-0.824
ping-0.768
 Buzz-0.739
Token and
Token ID290
Feature activation-1.55e+00

Pos logit contributions
 clerk+3.863
 promise+3.841
belt+3.756
 bills+3.733
 grandparents+3.703
Neg logit contributions
 nib-2.257
 hopping-2.096
 buzzing-2.079
ping-1.783
 jumping-1.646
Token thanked
Token ID26280
Feature activation-1.58e+00

Pos logit contributions
 promise+2.369
 clerk+2.349
belt+2.297
 policy+2.287
 grandparents+2.283
Neg logit contributions
 nib-3.108
 hopping-2.918
 buzzing-2.899
ping-2.589
 Buzz-2.551
Token her
Token ID607
Feature activation-4.48e+00

Pos logit contributions
 promise+3.620
 clerk+3.523
olding+3.485
belt+3.428
olded+3.426
Neg logit contributions
 nib-1.985
 hopping-1.828
 buzzing-1.816
ping-1.556
 Buzz-1.443
Token grandma
Token ID49890
Feature activation-1.24e+00

Pos logit contributions
ounded+7.121
 policy+6.975
belt+6.833
olded+6.831
olding+6.616
Neg logit contributions
 nib-9.549
 buzzing-9.337
 hopping-8.880
 snowball-8.187
 jumping-8.168
Token.
Token ID13
Feature activation-1.57e+00

Pos logit contributions
 promise+2.260
 clerk+2.246
belt+2.168
olding+2.142
 grandparents+2.135
Neg logit contributions
 nib-2.076
 hopping-1.932
 buzzing-1.903
ping-1.737
 Buzz-1.656
Token She
Token ID1375
Feature activation-1.33e+00

Pos logit contributions
 promise+3.862
 clerk+3.701
olding+3.609
 bills+3.578
belt+3.563
Neg logit contributions
 nib-1.709
 hopping-1.588
 buzzing-1.524
ping-1.267
 Buzz-1.192
Token decided
Token ID3066
Feature activation-1.52e+00

Pos logit contributions
 promise+2.086
 clerk+2.072
olding+2.015
 policy+1.992
belt+1.975
Neg logit contributions
 nib-2.627
 hopping-2.416
 buzzing-2.403
ping-2.150
 Buzz-2.112
Token to
Token ID284
Feature activation+2.23e-01

Pos logit contributions
 promise+2.597
 clerk+2.477
olding+2.388
 grandparents+2.376
belt+2.368
Neg logit contributions
 nib-2.816
 hopping-2.694
 buzzing-2.650
ping-2.363
 Buzz-2.295
Token love
Token ID1842
Feature activation+3.04e-01

Pos logit contributions
 nib+0.332
 hopping+0.323
 buzzing+0.317
ping+0.299
 snowball+0.281
Neg logit contributions
 promise-0.295
 clerk-0.275
olding-0.269
 grandparents-0.265
olded-0.265
Token you
Token ID345
Feature activation+3.35e-01

Pos logit contributions
 nib+0.569
 hopping+0.525
 buzzing+0.522
ping+0.484
 snowball+0.468
Neg logit contributions
 promise-0.495
 clerk-0.491
 grandparents-0.481
 bills-0.474
 wisely-0.465
Token."
Token ID526
Feature activation-1.55e+00

Pos logit contributions
 nib+0.610
 hopping+0.572
 buzzing+0.570
ping+0.515
 snowball+0.506
Neg logit contributions
 promise-0.579
 clerk-0.546
olding-0.530
 grandparents-0.528
 bills-0.525
Token
Token ID198
Feature activation-5.94e-01

Pos logit contributions
 promise+3.043
 clerk+2.910
olding+2.865
belt+2.845
 bills+2.819
Neg logit contributions
 nib-2.453
 hopping-2.283
 buzzing-2.220
ping-1.982
 Buzz-1.969
Token
Token ID198
Feature activation-1.53e+00

Pos logit contributions
 promise+1.041
 clerk+1.025
olding+0.980
 grandparents+0.973
 bills+0.965
Neg logit contributions
 nib-1.048
 hopping-0.979
 buzzing-0.972
ping-0.884
 Buzz-0.846
TokenHer
Token ID9360
Feature activation-4.47e+00

Pos logit contributions
 promise+2.296
 clerk+2.180
 bills+2.125
olding+2.108
belt+2.077
Neg logit contributions
 nib-3.079
 buzzing-2.933
 hopping-2.931
ping-2.627
 snowball-2.582
Token grandma
Token ID49890
Feature activation+1.42e-01

Pos logit contributions
 responsibly+5.830
 wisely+5.582
belt+5.580
ounded+5.467
 dollars+5.433
Neg logit contributions
 buzzing-10.367
 nib-10.272
 hopping-10.070
 jumping-9.736
 snowball-9.419
Token sounded
Token ID14846
Feature activation-1.60e+00

Pos logit contributions
 nib+0.307
 hopping+0.297
 buzzing+0.295
ping+0.275
 snowball+0.264
Neg logit contributions
 promise-0.191
 clerk-0.189
keeper-0.175
 grandparents-0.173
olded-0.173
Token very
Token ID845
Feature activation-1.30e+00

Pos logit contributions
 promise+2.683
 clerk+2.666
belt+2.622
 grandparents+2.614
 bills+2.581
Neg logit contributions
 nib-2.968
 buzzing-2.790
 hopping-2.767
ping-2.473
 Buzz-2.399
Token happy
Token ID3772
Feature activation+1.20e-01

Pos logit contributions
 clerk+1.991
 promise+1.987
belt+1.913
 bills+1.891
 grandparents+1.891
Neg logit contributions
 nib-2.584
 buzzing-2.439
 hopping-2.420
ping-2.194
 Buzz-2.147
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
 nib+0.191
 hopping+0.176
 buzzing+0.176
ping+0.159
 snowball+0.154
Neg logit contributions
 promise-0.230
 clerk-0.226
olding-0.218
 grandparents-0.216
keeper-0.215
Token that
Token ID326
Feature activation-1.87e+00

Pos logit contributions
 clerk+4.036
 promise+4.008
belt+3.891
 grandparents+3.862
aila+3.765
Neg logit contributions
 nib-3.839
 hopping-3.670
 buzzing-3.669
ping-3.194
 jumping-3.101
Token wasn
Token ID2492
Feature activation-3.45e-01

Pos logit contributions
 promise+3.372
 clerk+3.329
olding+3.235
 grandparents+3.215
 bills+3.174
Neg logit contributions
 nib-3.243
 buzzing-3.049
 hopping-3.035
ping-2.701
 Buzz-2.668
Token't
Token ID470
Feature activation-4.04e-01

Pos logit contributions
 promise+0.595
 clerk+0.587
olding+0.560
 grandparents+0.553
belt+0.549
Neg logit contributions
 nib-0.620
 hopping-0.577
 buzzing-0.574
ping-0.525
 Buzz-0.499
Token hers
Token ID25144
Feature activation-2.25e+00

Pos logit contributions
 promise+0.675
 clerk+0.664
olding+0.633
 grandparents+0.628
 bills+0.627
Neg logit contributions
 nib-0.746
 hopping-0.697
 buzzing-0.692
ping-0.630
 Buzz-0.607
Token.
Token ID13
Feature activation-2.17e+00

Pos logit contributions
 promise+4.796
 clerk+4.706
olded+4.576
 grandparents+4.570
 bills+4.567
Neg logit contributions
 nib-3.107
 hopping-2.954
 buzzing-2.828
ping-2.502
 jumping-2.423
Token Her
Token ID2332
Feature activation-4.46e+00

Pos logit contributions
 promise+5.443
 clerk+5.155
 bills+5.084
 dollars+5.081
belt+5.077
Neg logit contributions
 nib-2.234
 hopping-2.097
 buzzing-1.928
ping-1.630
 Buzz-1.623
Token mom
Token ID1995
Feature activation-1.84e+00

Pos logit contributions
olding+5.659
ounded+5.613
olded+5.564
 wisely+5.412
 dollars+5.349
Neg logit contributions
 nib-10.547
 hopping-10.327
 buzzing-10.145
 jumping-9.524
 crawling-9.409
Token explained
Token ID4893
Feature activation-1.78e-01

Pos logit contributions
 promise+2.898
 policy+2.874
 clerk+2.867
belt+2.860
 bills+2.841
Neg logit contributions
 nib-3.573
 hopping-3.357
 buzzing-3.326
ping-2.970
 jumping-2.858
Token to
Token ID284
Feature activation-2.19e-01

Pos logit contributions
 clerk+0.281
 promise+0.281
olding+0.267
 bills+0.264
 grandparents+0.264
Neg logit contributions
 nib-0.344
 hopping-0.321
 buzzing-0.317
ping-0.293
 snowball-0.284
Token her
Token ID607
Feature activation-1.07e+00

Pos logit contributions
 promise+0.457
 clerk+0.450
 grandparents+0.433
olding+0.433
 bills+0.430
Neg logit contributions
 nib-0.319
 hopping-0.292
 buzzing-0.288
ping-0.254
 snowball-0.245
Token that
Token ID326
Feature activation-6.08e-01

Pos logit contributions
 promise+1.865
 clerk+1.827
olding+1.752
belt+1.739
 grandparents+1.737
Neg logit contributions
 nib-1.892
 hopping-1.786
 buzzing-1.772
ping-1.597
 Buzz-1.546
Token the
Token ID262
Feature activation-7.83e-01

Pos logit contributions
 nib+0.847
 buzzing+0.764
 hopping+0.759
ping+0.673
 Buzz+0.647
Neg logit contributions
 promise-1.048
 clerk-1.018
 grandparents-0.982
olding-0.980
 policy-0.975
Token cool
Token ID3608
Feature activation+5.52e-01

Pos logit contributions
 promise+1.387
 clerk+1.375
olding+1.316
belt+1.298
 grandparents+1.295
Neg logit contributions
 nib-1.377
 hopping-1.287
 buzzing-1.278
ping-1.159
 Buzz-1.108
Token space
Token ID2272
Feature activation-1.30e+00

Pos logit contributions
 nib+1.077
 hopping+1.000
 buzzing+0.996
ping+0.893
 Buzz+0.880
Neg logit contributions
 promise-0.869
 clerk-0.845
 bills-0.830
 grandparents-0.826
keeper-0.806
Token stuff
Token ID3404
Feature activation-7.14e-01

Pos logit contributions
 promise+2.484
 clerk+2.444
olding+2.364
 grandparents+2.328
 bills+2.315
Neg logit contributions
 nib-2.059
 hopping-1.936
 buzzing-1.913
ping-1.719
 Buzz-1.629
Token.
Token ID13
Feature activation-8.41e-01

Pos logit contributions
 promise+1.320
 clerk+1.300
olding+1.248
 grandparents+1.238
 bills+1.229
Neg logit contributions
 nib-1.183
 hopping-1.102
 buzzing-1.092
ping-0.986
 Buzz-0.942
Token His
Token ID2399
Feature activation-4.46e+00

Pos logit contributions
 promise+1.757
 clerk+1.723
olding+1.661
 grandparents+1.652
 bills+1.646
Neg logit contributions
 nib-1.222
 hopping-1.125
 buzzing-1.102
ping-0.981
 Buzz-0.941
Token grandma
Token ID49890
Feature activation-1.52e+00

Pos logit contributions
olding+6.373
olded+6.225
och+6.105
 clerk+6.023
ounded+5.877
Neg logit contributions
 nib-10.389
 hopping-10.124
 jumping-9.537
 buzzing-9.529
 snowball-9.276
Token had
Token ID550
Feature activation-1.33e+00

Pos logit contributions
 promise+2.630
 clerk+2.596
olding+2.546
 bills+2.516
belt+2.512
Neg logit contributions
 nib-2.745
 hopping-2.553
 buzzing-2.511
ping-2.242
 Buzz-2.206
Token to
Token ID284
Feature activation-4.96e-01

Pos logit contributions
 promise+2.544
 clerk+2.507
olding+2.427
belt+2.399
 grandparents+2.394
Neg logit contributions
 nib-2.173
 hopping-2.030
 buzzing-1.998
ping-1.759
 Buzz-1.677
Token convince
Token ID11508
Feature activation+2.46e-01

Pos logit contributions
 promise+0.821
 clerk+0.800
olding+0.762
 grandparents+0.757
 bills+0.751
Neg logit contributions
 nib-0.923
 hopping-0.867
 buzzing-0.862
ping-0.790
 Buzz-0.758
Token him
Token ID683
Feature activation-1.24e+00

Pos logit contributions
 nib+0.307
 buzzing+0.274
 hopping+0.269
ping+0.230
 snowball+0.223
Neg logit contributions
 promise-0.565
 clerk-0.563
olding-0.539
 grandparents-0.537
 bills-0.534
Token his
Token ID465
Feature activation-2.34e+00

Pos logit contributions
 nib+1.373
 buzzing+1.241
 hopping+1.208
 gra+1.115
ping+1.070
Neg logit contributions
 clerk-1.751
 promise-1.714
olding-1.662
 wisely-1.661
belt-1.657
Token knee
Token ID10329
Feature activation-1.58e-01

Pos logit contributions
 clerk+3.895
olded+3.781
olding+3.775
 promise+3.735
 wisely+3.651
Neg logit contributions
 nib-4.373
 hopping-4.110
 buzzing-4.062
 snowball-3.868
ping-3.807
Token.
Token ID13
Feature activation-7.93e-01

Pos logit contributions
 promise+0.288
 clerk+0.280
olding+0.270
belt+0.267
keeper+0.265
Neg logit contributions
 nib-0.268
 hopping-0.249
 buzzing-0.246
ping-0.223
 Buzz-0.218
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
 promise+1.600
 clerk+1.566
olding+1.509
 grandparents+1.500
 bills+1.492
Neg logit contributions
 nib-1.203
 hopping-1.113
 buzzing-1.097
ping-0.981
 Buzz-0.940
Token
Token ID198
Feature activation-6.02e-01

Pos logit contributions
 promise+2.200
 clerk+2.156
 bills+2.067
olding+2.056
belt+2.042
Neg logit contributions
 nib-1.922
 hopping-1.798
 buzzing-1.768
ping-1.573
 Buzz-1.546
TokenHis
Token ID6653
Feature activation-4.45e+00

Pos logit contributions
 promise+1.084
 clerk+1.068
olding+1.023
 grandparents+1.015
belt+1.007
Neg logit contributions
 nib-1.043
 hopping-0.973
 buzzing-0.966
ping-0.875
 Buzz-0.838
Token grandma
Token ID49890
Feature activation-5.00e-01

Pos logit contributions
 responsibly+5.884
 wisely+5.881
olded+5.834
olding+5.793
ounded+5.770
Neg logit contributions
 nib-10.483
 hopping-10.203
 snowball-10.161
 buzzing-9.862
 jumping-9.654
Token helped
Token ID4193
Feature activation+4.26e-01

Pos logit contributions
 promise+0.875
 clerk+0.863
olding+0.822
 grandparents+0.818
 bills+0.810
Neg logit contributions
 nib-0.877
 hopping-0.819
 buzzing-0.814
ping-0.742
 Buzz-0.709
Token him
Token ID683
Feature activation+1.53e+00

Pos logit contributions
 nib+0.544
 hopping+0.500
 buzzing+0.497
ping+0.424
 tweeting+0.402
Neg logit contributions
 clerk-0.963
 promise-0.962
olding-0.913
 bills-0.910
keeper-0.907
Token up
Token ID510
Feature activation-8.01e-02

Pos logit contributions
 nib+2.818
 buzzing+2.719
 hopping+2.660
ping+2.580
 bees+2.441
Neg logit contributions
 promise-2.738
olding-2.405
 clerk-2.405
 wisely-2.363
keeper-2.311
Token and
Token ID290
Feature activation-1.42e+00

Pos logit contributions
 promise+0.157
 clerk+0.154
olding+0.148
 grandparents+0.146
belt+0.146
Neg logit contributions
 nib-0.125
 hopping-0.115
 buzzing-0.114
ping-0.104
 Buzz-0.098
Token She
Token ID1375
Feature activation+1.37e+00

Pos logit contributions
 nib+1.540
 buzzing+1.467
 hopping+1.405
ping+1.333
 snowball+1.256
Neg logit contributions
 clerk-1.776
 promise-1.774
olding-1.717
 cousin-1.680
 grandparents-1.664
Token loved
Token ID6151
Feature activation+1.88e+00

Pos logit contributions
 nib+2.698
 buzzing+2.516
 hopping+2.502
ping+2.425
 ponds+2.377
Neg logit contributions
 promise-2.055
 clerk-1.946
 grandparents-1.902
 cousin-1.866
aila-1.840
Token to
Token ID284
Feature activation+2.12e+00

Pos logit contributions
 nib+4.374
 buzzing+4.082
 hopping+4.043
ping+4.018
 snowball+3.865
Neg logit contributions
 grandparents-2.189
 clerk-2.105
 bills-2.050
 grandma-2.017
 cousin-1.956
Token play
Token ID711
Feature activation+1.27e+00

Pos logit contributions
ping+4.132
 buzzing+4.010
 nib+3.993
 hopping+3.902
 perched+3.685
Neg logit contributions
 promise-3.587
 grandparents-3.185
 reward-3.157
 clerk-3.076
 grandmother-3.052
Token outside
Token ID2354
Feature activation+2.18e+00

Pos logit contributions
 nib+2.100
 buzzing+1.959
 hopping+1.944
ping+1.902
 tweeting+1.742
Neg logit contributions
 clerk-2.255
 promise-2.241
 grandparents-2.122
 manager-2.096
 cousin-2.076
Token,
Token ID11
Feature activation+2.46e+00

Pos logit contributions
 nib+4.381
ping+4.106
 buzzing+4.050
 gra+3.893
 hopping+3.884
Neg logit contributions
 promise-3.033
 clerk-2.955
 grandparents-2.842
belt-2.820
 manager-2.810
Token but
Token ID475
Feature activation-7.66e-01

Pos logit contributions
 nib+5.215
ping+5.002
 buzzing+4.808
 hopping+4.664
 Buzz+4.556
Neg logit contributions
 promise-3.406
Of-3.047
belt-2.978
 clerk-2.919
 cheaper-2.804
Token one
Token ID530
Feature activation-1.61e-01

Pos logit contributions
 promise+1.447
 clerk+1.430
olding+1.373
 grandparents+1.360
 bills+1.352
Neg logit contributions
 nib-1.253
 hopping-1.166
 buzzing-1.153
ping-1.032
 Buzz-0.995
Token day
Token ID1110
Feature activation-1.84e-01

Pos logit contributions
 promise+0.226
 clerk+0.217
olding+0.207
 grandparents+0.204
 policy+0.201
Neg logit contributions
 nib-0.338
 hopping-0.320
 buzzing-0.318
ping-0.299
 Buzz-0.285
Token it
Token ID340
Feature activation+8.89e-01

Pos logit contributions
 promise+0.305
 clerk+0.302
olding+0.288
 grandparents+0.288
belt+0.284
Neg logit contributions
 nib-0.340
 buzzing-0.315
 hopping-0.314
ping-0.294
 snowball-0.279
Token was
Token ID373
Feature activation+1.41e+00

Pos logit contributions
 nib+1.610
 hopping+1.508
 buzzing+1.482
ping+1.407
 tweeting+1.344
Neg logit contributions
 promise-1.510
 clerk-1.462
 manager-1.386
keeper-1.378
belt-1.364
Token best
Token ID1266
Feature activation-9.52e-01

Pos logit contributions
 nib+1.523
 buzzing+1.424
 hopping+1.368
ping+1.334
 Buzz+1.298
Neg logit contributions
 promise-0.775
 clerk-0.752
 grandparents-0.736
olding-0.697
 cousin-0.695
Token friends
Token ID2460
Feature activation-4.49e-01

Pos logit contributions
 promise+1.076
 clerk+1.063
olding+0.982
belt+0.978
 bills+0.968
Neg logit contributions
 nib-2.275
 hopping-2.172
 buzzing-2.151
ping-1.996
 snowball-1.952
Token.
Token ID13
Feature activation+4.31e-01

Pos logit contributions
 promise+0.827
 clerk+0.813
olding+0.781
 grandparents+0.777
belt+0.768
Neg logit contributions
 nib-0.749
 hopping-0.693
 buzzing-0.691
ping-0.623
 Buzz-0.596
Token They
Token ID1119
Feature activation+7.30e-01

Pos logit contributions
 nib+0.689
 buzzing+0.644
 hopping+0.627
ping+0.577
 tweeting+0.529
Neg logit contributions
 promise-0.825
 clerk-0.816
 grandparents-0.799
olding-0.781
 policy-0.776
Token loved
Token ID6151
Feature activation+1.98e+00

Pos logit contributions
 nib+1.308
 buzzing+1.249
 hopping+1.245
ping+1.180
 snowball+1.130
Neg logit contributions
 promise-1.247
 clerk-1.173
 grandparents-1.152
olded-1.129
belt-1.110
Token to
Token ID284
Feature activation+2.42e+00

Pos logit contributions
 nib+4.663
 buzzing+4.351
 hopping+4.277
ping+4.276
 snowball+4.016
Neg logit contributions
 grandparents-2.327
 bills-2.135
 clerk-2.120
 grandma-2.087
 promise-2.074
Token play
Token ID711
Feature activation+1.35e+00

Pos logit contributions
ping+4.745
 buzzing+4.631
 nib+4.631
 hopping+4.507
 bees+4.113
Neg logit contributions
 promise-4.224
 grandparents-3.644
 reward-3.525
 grandmother-3.408
 cousin-3.394
Token together
Token ID1978
Feature activation+6.42e-01

Pos logit contributions
 nib+2.834
 buzzing+2.655
 hopping+2.645
ping+2.633
 tweeting+2.437
Neg logit contributions
 promise-1.844
 clerk-1.774
 grandparents-1.708
 cousin-1.663
 manager-1.656
Token.
Token ID13
Feature activation+7.41e-02

Pos logit contributions
 nib+1.161
 buzzing+1.082
 hopping+1.076
ping+1.017
 snowball+0.964
Neg logit contributions
 promise-1.071
 clerk-1.026
 grandparents-0.996
 wisely-0.985
olding-0.977
Token
Token ID220
Feature activation-6.89e-01

Pos logit contributions
 nib+0.112
 buzzing+0.105
 hopping+0.103
ping+0.094
 bees+0.085
Neg logit contributions
 promise-0.149
 clerk-0.145
 grandparents-0.142
olding-0.140
 bills-0.138
Token
Token ID198
Feature activation-4.51e-01

Pos logit contributions
 promise+1.215
 clerk+1.195
olding+1.143
 grandparents+1.133
 bills+1.128
Neg logit contributions
 nib-1.196
 hopping-1.118
 buzzing-1.109
ping-1.007
 Buzz-0.967
Token through
Token ID832
Feature activation+3.73e+00

Pos logit contributions
 nib+4.323
 buzzing+4.016
 gra+3.952
 hopping+3.846
 tweeting+3.821
Neg logit contributions
 promise-3.184
 wisely-3.182
 clerk-3.152
olding-3.133
 responsibly-3.038
Token the
Token ID262
Feature activation+2.74e+00

Pos logit contributions
bee+7.709
ping+7.429
 nib+7.354
 buzzing+7.223
 gra+7.144
Neg logit contributions
 grandma-5.123
 grandparents-5.006
 grandmother-4.858
 wisely-4.581
 bills-4.556
Token forest
Token ID8222
Feature activation+2.51e+00

Pos logit contributions
 nib+4.465
 buzzing+3.837
bee+3.818
ping+3.791
 hopping+3.781
Neg logit contributions
 promise-5.132
 clerk-5.120
 grandparents-5.031
keeper-4.849
 grandmother-4.802
Token when
Token ID618
Feature activation+2.19e+00

Pos logit contributions
 nib+4.834
 snowball+4.769
 buzzing+4.448
 gra+4.434
 Buzz+4.428
Neg logit contributions
olding-3.686
 promise-3.683
 wisely-3.588
 clerk-3.478
keeper-3.387
Token he
Token ID339
Feature activation+1.07e+00

Pos logit contributions
ping+4.781
 nib+4.738
 buzzing+4.602
 gra+4.305
bee+4.204
Neg logit contributions
 clerk-2.650
 grandmother-2.614
belt-2.602
 grandparents-2.582
keeper-2.564
Token saw
Token ID2497
Feature activation+2.34e+00

Pos logit contributions
 nib+2.019
 buzzing+1.999
 hopping+1.933
ping+1.932
 snowball+1.812
Neg logit contributions
 promise-1.595
 clerk-1.505
keeper-1.462
olded-1.437
 grandparents-1.430
Token a
Token ID257
Feature activation+9.89e-01

Pos logit contributions
 nib+4.056
 buzzing+3.773
ping+3.734
 hopping+3.615
bee+3.456
Neg logit contributions
 clerk-4.060
 grandparents-3.855
 grandma-3.844
 grandmother-3.827
keeper-3.811
Token big
Token ID1263
Feature activation+2.48e+00

Pos logit contributions
 nib+1.858
ping+1.777
 hopping+1.742
 buzzing+1.699
 Buzz+1.606
Neg logit contributions
 promise-1.599
 clerk-1.537
keeper-1.464
 grandparents-1.440
 cheaper-1.396
Token,
Token ID11
Feature activation+1.98e+00

Pos logit contributions
ping+4.216
 nib+4.197
 hopping+3.889
clear+3.873
 buzzing+3.864
Neg logit contributions
 promise-4.508
 clerk-4.122
 reward-3.878
 policy-3.862
keeper-3.861
Token scary
Token ID14343
Feature activation+3.22e+00

Pos logit contributions
 nib+3.748
ping+3.436
 hopping+3.414
 buzzing+3.274
 Buzz+3.212
Neg logit contributions
 promise-3.185
 clerk-3.151
 cheaper-3.010
keeper-2.900
 bills-2.838
Token wolf
Token ID17481
Feature activation+2.55e+00

Pos logit contributions
 nib+5.546
ping+5.270
clear+5.265
 gra+5.129
 ponds+5.058
Neg logit contributions
 promise-5.575
 cousin-5.243
 clerk-5.226
 grandmother-5.183
 policy-5.156
Token He
Token ID679
Feature activation+1.32e-01

Pos logit contributions
 nib+3.078
 buzzing+3.036
ping+2.868
 hopping+2.832
 tweeting+2.657
Neg logit contributions
 clerk-2.391
 promise-2.295
olding-2.269
keeper-2.268
olded-2.244
Token laughed
Token ID13818
Feature activation+1.97e+00

Pos logit contributions
 nib+0.209
 hopping+0.194
 buzzing+0.192
ping+0.179
 Buzz+0.170
Neg logit contributions
 promise-0.252
 clerk-0.242
 grandparents-0.234
belt-0.233
olding-0.232
Token as
Token ID355
Feature activation+1.09e+00

Pos logit contributions
 nib+3.763
 gra+3.499
 buzzing+3.415
 hopping+3.364
 ponds+3.313
Neg logit contributions
 clerk-2.961
 promise-2.909
 wisely-2.875
keeper-2.816
 grandparents-2.767
Token it
Token ID340
Feature activation+1.19e-01

Pos logit contributions
 nib+2.136
 buzzing+1.957
 hopping+1.939
ping+1.882
 tweeting+1.752
Neg logit contributions
 clerk-1.684
 promise-1.659
belt-1.598
keeper-1.590
 grandparents-1.573
Token spun
Token ID26843
Feature activation+1.70e+00

Pos logit contributions
 nib+0.173
 hopping+0.159
 buzzing+0.156
ping+0.142
 Buzz+0.138
Neg logit contributions
 promise-0.243
 clerk-0.235
belt-0.229
olding-0.227
olded-0.226
Token round
Token ID2835
Feature activation+2.45e+00

Pos logit contributions
 nib+3.257
 hopping+3.036
 buzzing+2.961
ping+2.824
 gra+2.808
Neg logit contributions
 clerk-2.602
 promise-2.572
 wisely-2.519
keeper-2.493
belt-2.474
Token and
Token ID290
Feature activation+5.62e-01

Pos logit contributions
 nib+4.893
 gra+4.620
 hopping+4.604
 buzzing+4.554
 tweeting+4.319
Neg logit contributions
 clerk-3.519
 wisely-3.500
keeper-3.362
 manager-3.286
 bills-3.262
Token round
Token ID2835
Feature activation+2.34e+00

Pos logit contributions
 nib+0.893
 hopping+0.811
 buzzing+0.804
ping+0.755
 snowball+0.708
Neg logit contributions
 promise-1.039
 clerk-1.031
belt-1.004
keeper-0.986
 wisely-0.985
Token.
Token ID13
Feature activation+1.88e+00

Pos logit contributions
 nib+4.920
 gra+4.687
 hopping+4.675
 buzzing+4.653
 tweeting+4.367
Neg logit contributions
 clerk-3.047
 wisely-3.023
keeper-2.885
 bills-2.878
 manager-2.819
Token
Token ID220
Feature activation-5.68e-01

Pos logit contributions
 nib+3.608
 buzzing+3.504
ping+3.290
 hopping+3.256
 tweeting+3.066
Neg logit contributions
 clerk-2.828
 promise-2.765
olding-2.731
 grandparents-2.708
olded-2.674
Token
Token ID198
Feature activation-3.66e-02

Pos logit contributions
 promise+0.987
 clerk+0.972
olding+0.930
 grandparents+0.923
 bills+0.914
Neg logit contributions
 nib-1.002
 hopping-0.935
 buzzing-0.930
ping-0.845
 Buzz-0.808
Token and
Token ID290
Feature activation-5.25e-01

Pos logit contributions
 nib+0.688
 hopping+0.644
 buzzing+0.630
ping+0.595
 snowball+0.578
Neg logit contributions
 promise-0.706
 clerk-0.684
keeper-0.662
olding-0.660
 manager-0.646
Token the
Token ID262
Feature activation-3.72e-03

Pos logit contributions
 promise+1.003
 clerk+0.990
olding+0.947
 grandparents+0.944
 bills+0.937
Neg logit contributions
 nib-0.839
 hopping-0.779
 buzzing-0.773
ping-0.696
 Buzz-0.659
Token big
Token ID1263
Feature activation+1.27e+00

Pos logit contributions
 promise+0.006
 clerk+0.006
 grandparents+0.006
 bills+0.006
olding+0.006
Neg logit contributions
 nib-0.007
 hopping-0.007
 buzzing-0.006
ping-0.006
 Buzz-0.006
Token human
Token ID1692
Feature activation-2.11e-01

Pos logit contributions
 nib+2.326
 hopping+2.144
 buzzing+2.078
ping+2.046
 Buzz+1.959
Neg logit contributions
 promise-2.227
 clerk-2.091
 grandparents-1.994
 grandmother-1.978
 cousin-1.952
Token began
Token ID2540
Feature activation+1.30e+00

Pos logit contributions
 promise+0.363
 clerk+0.354
olding+0.337
keeper+0.337
 grandparents+0.336
Neg logit contributions
 nib-0.374
 hopping-0.350
 buzzing-0.348
ping-0.320
 snowball-0.306
Token to
Token ID284
Feature activation+3.18e+00

Pos logit contributions
 nib+2.882
ping+2.681
 buzzing+2.669
 hopping+2.660
 snowball+2.608
Neg logit contributions
 promise-1.585
 clerk-1.492
olding-1.437
 wisely-1.424
keeper-1.419
Token laugh
Token ID6487
Feature activation+1.06e+00

Pos logit contributions
ping+5.897
 hopping+5.497
 nib+5.240
eh+5.154
pill+5.149
Neg logit contributions
 promise-6.564
 reward-4.840
 cheaper-4.701
 grandparents-4.634
 clerk-4.592
Token.
Token ID13
Feature activation+5.39e-01

Pos logit contributions
 nib+2.032
 hopping+1.876
 buzzing+1.859
ping+1.730
 gra+1.701
Neg logit contributions
 promise-1.648
 clerk-1.629
keeper-1.557
 wisely-1.538
 manager-1.534
Token The
Token ID383
Feature activation-3.71e-01

Pos logit contributions
 nib+0.884
 buzzing+0.828
 hopping+0.811
ping+0.765
 snowball+0.705
Neg logit contributions
 clerk-1.014
 promise-0.974
keeper-0.965
olded-0.957
olding-0.955
Token little
Token ID1310
Feature activation+2.62e-01

Pos logit contributions
 promise+0.630
 clerk+0.617
olding+0.586
 grandparents+0.584
 bills+0.579
Neg logit contributions
 nib-0.683
 hopping-0.636
 buzzing-0.628
ping-0.580
 Buzz-0.559
Token human
Token ID1692
Feature activation+5.04e-01

Pos logit contributions
 nib+0.511
 hopping+0.477
 buzzing+0.467
ping+0.443
 Buzz+0.427
Neg logit contributions
 promise-0.415
 clerk-0.407
 grandparents-0.377
keeper-0.375
olding-0.375
Token The
Token ID383
Feature activation+1.18e+00

Pos logit contributions
 nib+1.377
 buzzing+1.337
 hopping+1.288
ping+1.181
 tweeting+1.143
Neg logit contributions
 clerk-1.659
keeper-1.571
 promise-1.568
olded-1.548
 bills-1.538
Token frog
Token ID21264
Feature activation+2.88e-01

Pos logit contributions
 nib+1.263
 buzzing+1.103
ping+1.056
 hopping+1.056
 prow+0.939
Neg logit contributions
 promise-2.984
 clerk-2.827
 reward-2.704
 manager-2.685
keeper-2.675
Token jumped
Token ID11687
Feature activation+1.35e+00

Pos logit contributions
 nib+0.470
 hopping+0.446
 buzzing+0.440
ping+0.414
 tweeting+0.383
Neg logit contributions
 promise-0.530
 clerk-0.503
keeper-0.486
olding-0.482
olded-0.482
Token away
Token ID1497
Feature activation+8.20e-01

Pos logit contributions
 nib+2.609
 buzzing+2.430
 hopping+2.395
 gra+2.343
ping+2.331
Neg logit contributions
 promise-2.060
 clerk-2.025
 wisely-1.934
olding-1.904
keeper-1.897
Token and
Token ID290
Feature activation+2.33e-01

Pos logit contributions
 nib+1.344
 hopping+1.247
 buzzing+1.232
 gra+1.172
ping+1.150
Neg logit contributions
 promise-1.495
 clerk-1.441
olding-1.400
keeper-1.395
 wisely-1.385
Token Tim
Token ID5045
Feature activation+8.11e-01

Pos logit contributions
 nib+0.356
 hopping+0.337
 buzzing+0.335
ping+0.310
 snowball+0.283
Neg logit contributions
 promise-0.447
 clerk-0.438
keeper-0.418
olded-0.418
belt-0.415
Tokenmy
Token ID1820
Feature activation+5.12e-01

Pos logit contributions
 nib+1.671
 buzzing+1.604
 hopping+1.580
 Buzz+1.433
 gra+1.415
Neg logit contributions
 promise-1.206
 clerk-1.106
 grandparents-1.048
olding-1.047
belt-1.045
Token fell
Token ID3214
Feature activation+1.33e+00

Pos logit contributions
 nib+0.908
 hopping+0.868
 buzzing+0.868
ping+0.810
 Buzz+0.770
Neg logit contributions
 promise-0.867
 clerk-0.820
olded-0.784
keeper-0.780
olding-0.770
Token into
Token ID656
Feature activation+1.72e+00

Pos logit contributions
 nib+2.532
 buzzing+2.325
 hopping+2.263
 gra+2.224
 Buzz+2.122
Neg logit contributions
 promise-2.017
 wisely-1.982
 clerk-1.944
olding-1.915
belt-1.895
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
 nib+2.686
 hopping+2.549
 buzzing+2.524
ping+2.521
 tweeting+2.299
Neg logit contributions
 promise-3.170
 clerk-3.132
 bills-2.924
 grandparents-2.915
 policy-2.899
Token pond
Token ID16723
Feature activation+1.27e+00

Pos logit contributions
 nib+2.171
 hopping+1.901
 buzzing+1.859
ping+1.779
 Buzz+1.705
Neg logit contributions
 promise-4.268
 clerk-3.932
keeper-3.814
 manager-3.756
 cheaper-3.718
Token he
Token ID339
Feature activation+4.00e-01

Pos logit contributions
 nib+1.161
 buzzing+1.072
 hopping+1.038
ping+1.022
 gra+0.932
Neg logit contributions
 clerk-1.058
 promise-1.028
belt-1.000
 grandparents-0.994
 bills-0.987
Token saw
Token ID2497
Feature activation+6.51e-01

Pos logit contributions
 nib+0.733
 buzzing+0.700
 hopping+0.699
ping+0.663
 Buzz+0.628
Neg logit contributions
 promise-0.659
 clerk-0.630
 grandparents-0.603
olded-0.603
olding-0.597
Token a
Token ID257
Feature activation+1.47e-01

Pos logit contributions
 nib+1.013
 hopping+0.922
 buzzing+0.919
ping+0.833
 tweeting+0.771
Neg logit contributions
 clerk-1.302
 promise-1.242
 bills-1.234
 grandparents-1.230
keeper-1.216
Token mouse
Token ID10211
Feature activation+2.17e+00

Pos logit contributions
 nib+0.267
 hopping+0.249
 buzzing+0.245
ping+0.238
 Buzz+0.222
Neg logit contributions
 promise-0.250
 clerk-0.247
 grandparents-0.232
keeper-0.231
 manager-0.228
Token!
Token ID0
Feature activation+3.35e-01

Pos logit contributions
 nib+3.799
ping+3.605
 buzzing+3.444
 hopping+3.437
 Buzz+3.432
Neg logit contributions
 promise-3.793
 clerk-3.532
keeper-3.427
olding-3.378
 manager-3.299
Token He
Token ID679
Feature activation+1.70e+00

Pos logit contributions
 nib+0.513
 buzzing+0.480
 hopping+0.471
ping+0.434
 tweeting+0.397
Neg logit contributions
 clerk-0.676
 promise-0.655
keeper-0.640
olding-0.634
olded-0.630
Token ran
Token ID4966
Feature activation+2.60e-02

Pos logit contributions
 nib+3.277
ping+3.195
 hopping+3.184
 buzzing+3.179
 ponds+3.056
Neg logit contributions
 promise-2.499
keeper-2.466
 clerk-2.430
olded-2.396
 grandparents-2.290
Token to
Token ID284
Feature activation+2.95e-01

Pos logit contributions
 nib+0.047
 hopping+0.044
 buzzing+0.044
ping+0.041
 Buzz+0.039
Neg logit contributions
 clerk-0.043
 promise-0.043
olding-0.041
keeper-0.040
 manager-0.040
Token his
Token ID465
Feature activation-1.59e+00

Pos logit contributions
 nib+0.492
 hopping+0.470
 buzzing+0.467
ping+0.438
 snowball+0.405
Neg logit contributions
 promise-0.544
 clerk-0.525
 grandparents-0.497
olding-0.495
keeper-0.491
Token mom
Token ID1995
Feature activation+4.19e-01

Pos logit contributions
 clerk+1.636
 promise+1.622
olding+1.607
olded+1.571
 bills+1.550
Neg logit contributions
 nib-3.979
 hopping-3.776
 buzzing-3.717
 snowball-3.448
 jumping-3.381
Token and
Token ID290
Feature activation-1.24e+00

Pos logit contributions
 nib+0.661
 buzzing+0.618
 hopping+0.606
ping+0.557
 Buzz+0.536
Neg logit contributions
 promise-0.809
 clerk-0.797
olding-0.764
 grandparents-0.762
keeper-0.756
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
 nib+0.555
 buzzing+0.519
 hopping+0.512
ping+0.461
 snowball+0.428
Neg logit contributions
 clerk-0.864
 promise-0.858
 grandparents-0.831
 bills-0.817
keeper-0.815
Token man
Token ID582
Feature activation-6.91e-01

Pos logit contributions
 promise+1.377
 clerk+1.310
olding+1.276
 grandparents+1.244
olded+1.242
Neg logit contributions
 nib-2.813
 hopping-2.685
 buzzing-2.682
ping-2.472
 snowball-2.398
Token played
Token ID2826
Feature activation+1.16e+00

Pos logit contributions
 promise+1.249
 clerk+1.233
olding+1.181
 grandparents+1.173
belt+1.165
Neg logit contributions
 nib-1.178
 hopping-1.096
 buzzing-1.087
ping-0.982
 Buzz-0.941
Token together
Token ID1978
Feature activation+1.19e+00

Pos logit contributions
 nib+2.360
 buzzing+2.205
 hopping+2.191
ping+2.154
 tweeting+2.020
Neg logit contributions
 promise-1.700
 clerk-1.612
 manager-1.493
 wisely-1.488
olding-1.462
Token for
Token ID329
Feature activation+9.01e-01

Pos logit contributions
 nib+2.195
 buzzing+2.082
 hopping+2.047
ping+2.004
 snowball+1.932
Neg logit contributions
 promise-1.905
 clerk-1.778
 wisely-1.751
olding-1.705
 manager-1.672
Token a
Token ID257
Feature activation+2.80e-02

Pos logit contributions
 nib+1.507
 hopping+1.414
 buzzing+1.399
ping+1.326
 tweeting+1.251
Neg logit contributions
 promise-1.571
 clerk-1.510
 bills-1.490
 grandparents-1.484
 wisely-1.447
Token while
Token ID981
Feature activation-1.91e-01

Pos logit contributions
 nib+0.053
 hopping+0.050
 buzzing+0.050
ping+0.047
 Buzz+0.044
Neg logit contributions
 promise-0.045
 clerk-0.043
olding-0.041
 grandparents-0.041
keeper-0.040
Token and
Token ID290
Feature activation-9.12e-03

Pos logit contributions
 promise+0.349
 clerk+0.346
olding+0.331
 grandparents+0.330
 bills+0.326
Neg logit contributions
 nib-0.319
 buzzing-0.293
 hopping-0.292
ping-0.267
 Buzz-0.253
Token had
Token ID550
Feature activation+1.68e+00

Pos logit contributions
 promise+0.014
 clerk+0.014
 grandparents+0.013
 bills+0.013
olding+0.013
Neg logit contributions
 nib-0.018
 buzzing-0.017
 hopping-0.017
ping-0.015
 snowball-0.015
Token a
Token ID257
Feature activation-6.28e-01

Pos logit contributions
 nib+3.213
 buzzing+3.065
 hopping+3.025
ping+2.963
bee+2.818
Neg logit contributions
 clerk-2.579
 promise-2.538
 cheaper-2.465
 bills-2.456
 wisely-2.450
Token lot
Token ID1256
Feature activation-3.28e-01

Pos logit contributions
 promise+0.945
 clerk+0.928
olding+0.881
 grandparents+0.873
 bills+0.866
Neg logit contributions
 nib-1.258
 hopping-1.185
 buzzing-1.178
ping-1.083
 Buzz-1.045
Token into
Token ID656
Feature activation+1.01e+00

Pos logit contributions
 nib+1.967
 buzzing+1.789
 hopping+1.775
ping+1.649
 Buzz+1.632
Neg logit contributions
 clerk-1.557
 promise-1.515
belt-1.504
 wisely-1.494
keeper-1.466
Token a
Token ID257
Feature activation+6.74e-01

Pos logit contributions
 nib+1.482
 hopping+1.385
 buzzing+1.344
ping+1.334
 tweeting+1.175
Neg logit contributions
 clerk-2.019
 promise-1.983
 grandparents-1.908
 bills-1.891
 grandmother-1.868
Token wet
Token ID9583
Feature activation+2.37e+00

Pos logit contributions
 nib+1.164
 hopping+1.091
 buzzing+1.060
ping+1.058
 Buzz+0.980
Neg logit contributions
 promise-1.207
 clerk-1.174
keeper-1.105
 manager-1.081
 grandparents-1.079
Token p
Token ID279
Feature activation-4.88e-01

Pos logit contributions
 nib+4.260
 hopping+3.893
 buzzing+3.880
 tweeting+3.582
ping+3.574
Neg logit contributions
 promise-4.261
 clerk-4.250
belt-3.957
keeper-3.942
 policy-3.855
Tokenuddle
Token ID24500
Feature activation+8.92e-01

Pos logit contributions
 promise+0.576
 clerk+0.561
olding+0.527
 grandparents+0.515
belt+0.514
Neg logit contributions
 nib-1.143
 hopping-1.086
 buzzing-1.078
ping-1.006
 Buzz-0.981
Token.
Token ID13
Feature activation+2.79e-02

Pos logit contributions
 nib+1.658
 hopping+1.532
 buzzing+1.497
ping+1.376
 Buzz+1.357
Neg logit contributions
 clerk-1.479
 promise-1.453
olding-1.395
keeper-1.392
 manager-1.370
Token Emma
Token ID18966
Feature activation+1.61e+00

Pos logit contributions
 nib+0.047
 buzzing+0.044
 hopping+0.044
ping+0.041
 snowball+0.038
Neg logit contributions
 clerk-0.050
 promise-0.049
olding-0.048
 grandparents-0.048
keeper-0.047
Token cried
Token ID16896
Feature activation+1.86e-01

Pos logit contributions
 nib+3.074
 hopping+2.942
 buzzing+2.932
ping+2.876
 ponds+2.747
Neg logit contributions
 promise-2.548
 clerk-2.387
olded-2.333
keeper-2.333
 grandparents-2.249
Token and
Token ID290
Feature activation-4.30e-01

Pos logit contributions
 nib+0.315
 buzzing+0.288
 hopping+0.288
ping+0.263
 snowball+0.256
Neg logit contributions
 promise-0.334
 clerk-0.330
olding-0.317
 grandparents-0.309
olded-0.309
Token tried
Token ID3088
Feature activation+1.76e-01

Pos logit contributions
 promise+0.753
 clerk+0.739
olding+0.703
 grandparents+0.701
belt+0.695
Neg logit contributions
 nib-0.759
 hopping-0.708
 buzzing-0.705
ping-0.645
 Buzz-0.614
Token to
Token ID284
Feature activation+1.52e+00

Pos logit contributions
 nib+0.375
 buzzing+0.352
 hopping+0.349
ping+0.335
 Buzz+0.316
Neg logit contributions
 promise-0.239
 clerk-0.234
olding-0.222
 grandparents-0.217
 wisely-0.217
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
 nib+0.129
 hopping+0.119
 buzzing+0.118
ping+0.109
 snowball+0.106
Neg logit contributions
 promise-0.130
 clerk-0.126
olding-0.122
belt-0.120
keeper-0.119
Token The
Token ID383
Feature activation-3.63e-01

Pos logit contributions
 nib+2.042
 buzzing+2.032
 hopping+1.963
ping+1.950
 tweeting+1.733
Neg logit contributions
 promise-1.949
 clerk-1.901
olding-1.822
 grandparents-1.822
 cousin-1.816
Token fountain
Token ID29420
Feature activation-2.64e-01

Pos logit contributions
 promise+0.808
 clerk+0.793
olding+0.764
 grandparents+0.758
 bills+0.756
Neg logit contributions
 nib-0.473
 hopping-0.430
 buzzing-0.422
ping-0.374
 Buzz-0.351
Token has
Token ID468
Feature activation+1.05e+00

Pos logit contributions
 promise+0.443
 clerk+0.429
olding+0.411
 grandparents+0.406
keeper+0.404
Neg logit contributions
 nib-0.490
 hopping-0.459
 buzzing-0.453
ping-0.419
 Buzz-0.401
Token water
Token ID1660
Feature activation+1.32e+00

Pos logit contributions
 nib+1.780
 hopping+1.643
 buzzing+1.565
ping+1.482
 Buzz+1.435
Neg logit contributions
 promise-1.864
 clerk-1.860
 bills-1.846
 grandparents-1.774
keeper-1.758
Token that
Token ID326
Feature activation+9.48e-02

Pos logit contributions
 nib+2.358
 hopping+2.130
 buzzing+2.070
ping+1.953
 Buzz+1.935
Neg logit contributions
 promise-2.306
 clerk-2.253
 bills-2.172
keeper-2.162
olding-2.140
Token goes
Token ID2925
Feature activation+3.47e-01

Pos logit contributions
 nib+0.168
 hopping+0.158
 buzzing+0.155
ping+0.147
 snowball+0.138
Neg logit contributions
 promise-0.168
 clerk-0.159
 grandparents-0.153
 bills-0.153
olding-0.152
Token up
Token ID510
Feature activation+8.94e-01

Pos logit contributions
 nib+0.607
 hopping+0.549
 buzzing+0.544
ping+0.502
 Buzz+0.487
Neg logit contributions
 promise-0.610
 clerk-0.603
olding-0.578
belt-0.573
keeper-0.572
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
 nib+1.613
 hopping+1.477
 buzzing+1.452
ping+1.403
 gra+1.320
Neg logit contributions
 promise-1.515
 clerk-1.439
 wisely-1.424
olding-1.404
keeper-1.389
Token down
Token ID866
Feature activation+7.52e-01

Pos logit contributions
 nib+2.165
 hopping+1.998
 buzzing+1.955
ping+1.920
bee+1.801
Neg logit contributions
 promise-2.311
 clerk-2.176
 bills-2.162
 wisely-2.134
keeper-2.118
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
 nib+1.385
 hopping+1.281
 buzzing+1.256
ping+1.195
 Buzz+1.133
Neg logit contributions
 promise-1.244
 clerk-1.199
 wisely-1.159
olding-1.157
 bills-1.149
Token.
Token ID13
Feature activation+4.30e-01

Pos logit contributions
 nib+1.168
 hopping+1.092
 buzzing+1.072
ping+0.981
 tweeting+0.945
Neg logit contributions
 promise-1.139
 clerk-1.094
keeper-1.045
 grandparents-1.040
olding-1.033
Token But
Token ID887
Feature activation-1.10e-01

Pos logit contributions
 nib+0.777
 buzzing+0.719
 hopping+0.712
ping+0.664
 snowball+0.638
Neg logit contributions
 promise-0.740
 clerk-0.733
olding-0.716
 grandparents-0.700
olded-0.695
Token even
Token ID772
Feature activation+3.39e-02

Pos logit contributions
 promise+0.175
 clerk+0.167
olding+0.160
 grandparents+0.158
 bills+0.158
Neg logit contributions
 nib-0.215
 hopping-0.201
 buzzing-0.201
ping-0.188
 snowball-0.180
Token fair
Token ID3148
Feature activation+3.48e-01

Pos logit contributions
 nib+0.068
 hopping+0.064
 buzzing+0.063
ping+0.060
 snowball+0.057
Neg logit contributions
 promise-0.051
 clerk-0.049
olding-0.047
 bills-0.047
 grandparents-0.046
Tokenies
Token ID444
Feature activation+4.72e-01

Pos logit contributions
 nib+0.672
 hopping+0.623
 buzzing+0.617
ping+0.570
 snowball+0.554
Neg logit contributions
 promise-0.570
 clerk-0.541
 bills-0.522
olding-0.521
 grandparents-0.521
Token need
Token ID761
Feature activation-5.19e-01

Pos logit contributions
 nib+0.946
 hopping+0.890
 buzzing+0.868
ping+0.816
 snowball+0.803
Neg logit contributions
 promise-0.738
 clerk-0.674
olding-0.661
 grandparents-0.647
 wisely-0.646
Token to
Token ID284
Feature activation+9.50e-01

Pos logit contributions
 promise+0.904
 clerk+0.889
olding+0.850
 grandparents+0.844
 bills+0.839
Neg logit contributions
 nib-0.914
 hopping-0.853
 buzzing-0.848
ping-0.772
 Buzz-0.737
Token eat
Token ID4483
Feature activation+8.88e-01

Pos logit contributions
 nib+1.738
 hopping+1.661
 buzzing+1.625
ping+1.540
 snowball+1.504
Neg logit contributions
 promise-1.662
 clerk-1.467
 grandparents-1.449
 bills-1.433
olding-1.427
Token.
Token ID13
Feature activation+4.39e-01

Pos logit contributions
 nib+1.607
 hopping+1.501
 buzzing+1.479
ping+1.388
 prow+1.318
Neg logit contributions
 promise-1.542
 clerk-1.456
 bills-1.423
 wisely-1.418
 grandparents-1.407
Token Come
Token ID7911
Feature activation-1.88e-01

Pos logit contributions
 nib+0.773
 buzzing+0.724
 hopping+0.716
ping+0.665
 snowball+0.616
Neg logit contributions
 promise-0.772
 clerk-0.764
olding-0.740
 grandparents-0.731
olded-0.729
Token on
Token ID319
Feature activation+4.78e-02

Pos logit contributions
 promise+0.319
 clerk+0.311
 grandparents+0.295
olding+0.294
belt+0.291
Neg logit contributions
 nib-0.342
 buzzing-0.321
 hopping-0.319
ping-0.296
 Buzz-0.279
Tokenmy
Token ID1820
Feature activation+5.61e-01

Pos logit contributions
 nib+0.595
 buzzing+0.563
 hopping+0.556
ping+0.511
 Buzz+0.501
Neg logit contributions
 promise-0.431
 clerk-0.423
 grandparents-0.404
olding-0.396
olded-0.393
Token didn
Token ID1422
Feature activation+6.13e-01

Pos logit contributions
 nib+1.182
 buzzing+1.132
 hopping+1.121
ping+1.082
 snowball+1.018
Neg logit contributions
 clerk-0.768
 promise-0.754
keeper-0.719
 grandparents-0.710
olded-0.704
Token't
Token ID470
Feature activation-1.33e+00

Pos logit contributions
 nib+1.220
 buzzing+1.134
 hopping+1.128
ping+1.092
 bees+1.015
Neg logit contributions
 clerk-0.944
 promise-0.940
olding-0.876
 manager-0.864
 grandparents-0.856
Token see
Token ID766
Feature activation+1.10e+00

Pos logit contributions
 clerk+2.189
 promise+2.120
 grandparents+2.085
belt+2.075
keeper+2.052
Neg logit contributions
 nib-2.563
 hopping-2.386
 buzzing-2.371
ping-2.130
 Buzz-2.068
Token Molly
Token ID30236
Feature activation+9.97e-01

Pos logit contributions
 nib+1.920
 buzzing+1.743
 hopping+1.743
ping+1.590
 snowball+1.583
Neg logit contributions
 promise-1.921
 clerk-1.875
 grandparents-1.845
olding-1.812
 bills-1.794
Token at
Token ID379
Feature activation-9.41e-01

Pos logit contributions
 nib+1.688
 buzzing+1.549
 hopping+1.535
 snowball+1.467
ping+1.458
Neg logit contributions
 promise-1.827
 clerk-1.747
olding-1.697
keeper-1.651
 bills-1.633
Token first
Token ID717
Feature activation-8.92e-01

Pos logit contributions
 promise+1.980
 clerk+1.951
olding+1.897
belt+1.874
olded+1.874
Neg logit contributions
 nib-1.348
 hopping-1.241
 buzzing-1.225
ping-1.072
 Buzz-1.027
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
 promise+1.713
 clerk+1.693
olding+1.623
 grandparents+1.616
 bills+1.607
Neg logit contributions
 nib-1.424
 hopping-1.328
 buzzing-1.307
ping-1.167
 Buzz-1.123
Token because
Token ID780
Feature activation-5.29e-02

Pos logit contributions
 nib+2.037
 buzzing+1.831
 hopping+1.829
ping+1.756
 snowball+1.716
Neg logit contributions
 clerk-1.684
 promise-1.637
olding-1.548
keeper-1.535
 grandparents-1.529
Token she
Token ID673
Feature activation+1.50e-01

Pos logit contributions
 promise+0.090
 clerk+0.089
 grandparents+0.086
olding+0.084
 bills+0.083
Neg logit contributions
 nib-0.095
 buzzing-0.088
 hopping-0.087
ping-0.082
 snowball-0.077
Token was
Token ID373
Feature activation+7.37e-01

Pos logit contributions
 nib+0.280
 hopping+0.265
 buzzing+0.263
ping+0.248
 Buzz+0.232
Neg logit contributions
 promise-0.244
 clerk-0.238
 grandparents-0.225
olding-0.224
keeper-0.223
Token He
Token ID679
Feature activation+2.88e-01

Pos logit contributions
 nib+0.209
 buzzing+0.196
 hopping+0.194
ping+0.179
 tweeting+0.163
Neg logit contributions
 clerk-0.274
 promise-0.267
olding-0.259
keeper-0.259
 grandparents-0.256
Token hoped
Token ID10719
Feature activation-1.77e+00

Pos logit contributions
 nib+0.506
 hopping+0.483
 buzzing+0.474
ping+0.446
 Buzz+0.422
Neg logit contributions
 promise-0.490
 clerk-0.480
keeper-0.462
olded-0.459
 grandparents-0.458
Token someone
Token ID2130
Feature activation-1.12e+00

Pos logit contributions
 promise+3.958
 clerk+3.855
olding+3.766
olded+3.695
belt+3.693
Neg logit contributions
 nib-2.381
 hopping-2.196
 buzzing-2.135
 Buzz-1.855
ping-1.804
Token would
Token ID561
Feature activation-9.84e-01

Pos logit contributions
 promise+1.545
 clerk+1.538
olding+1.449
 grandparents+1.438
belt+1.418
Neg logit contributions
 nib-2.401
 hopping-2.267
 buzzing-2.259
ping-2.070
 Buzz-2.016
Token hear
Token ID3285
Feature activation-4.60e-01

Pos logit contributions
 clerk+1.735
 promise+1.718
olding+1.649
belt+1.640
 grandparents+1.638
Neg logit contributions
 nib-1.734
 hopping-1.610
 buzzing-1.608
ping-1.438
 Buzz-1.387
Token it
Token ID340
Feature activation-1.86e-01

Pos logit contributions
 promise+1.070
 clerk+1.059
olding+1.022
 grandparents+1.020
 bills+1.015
Neg logit contributions
 nib-0.544
 hopping-0.491
 buzzing-0.485
ping-0.415
 Buzz-0.386
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
 promise+0.334
 clerk+0.322
olding+0.310
 grandparents+0.308
belt+0.305
Neg logit contributions
 nib-0.321
 hopping-0.299
 buzzing-0.296
ping-0.270
 Buzz-0.260
Token
Token ID198
Feature activation-4.78e-01

Pos logit contributions
 promise+2.828
 clerk+2.736
olding+2.653
 grandparents+2.647
 bills+2.643
Neg logit contributions
 nib-1.863
 hopping-1.699
 buzzing-1.646
 Buzz-1.462
ping-1.423
Token
Token ID198
Feature activation-9.08e-01

Pos logit contributions
 promise+0.829
 clerk+0.817
olding+0.781
 grandparents+0.775
 bills+0.766
Neg logit contributions
 nib-0.852
 hopping-0.797
 buzzing-0.791
ping-0.724
 Buzz-0.689
TokenHe
Token ID1544
Feature activation-9.68e-01

Pos logit contributions
 promise+1.597
 clerk+1.564
 grandparents+1.498
olding+1.497
 bills+1.487
Neg logit contributions
 nib-1.610
 hopping-1.505
 buzzing-1.497
ping-1.341
 Buzz-1.300
Token also
Token ID635
Feature activation-6.84e-01

Pos logit contributions
 promise+1.904
 clerk+1.875
olding+1.816
 grandparents+1.798
 bills+1.784
Neg logit contributions
 nib-1.527
 hopping-1.397
 buzzing-1.389
ping-1.224
 Buzz-1.176
Token her
Token ID607
Feature activation-1.81e-01

Pos logit contributions
 nib+0.379
 hopping+0.352
 buzzing+0.351
ping+0.320
 snowball+0.308
Neg logit contributions
 clerk-0.442
 promise-0.440
 grandparents-0.419
 bills-0.418
olding-0.413
Token friends
Token ID2460
Feature activation-2.54e-01

Pos logit contributions
 promise+0.225
 clerk+0.217
 grandparents+0.205
olding+0.202
 bills+0.199
Neg logit contributions
 nib-0.412
 hopping-0.389
 buzzing-0.389
ping-0.365
 Buzz-0.351
Token stayed
Token ID9658
Feature activation-5.68e-01

Pos logit contributions
 promise+0.429
 clerk+0.420
olding+0.399
 grandparents+0.398
olded+0.394
Neg logit contributions
 nib-0.460
 hopping-0.431
 buzzing-0.429
ping-0.393
 snowball-0.377
Token in
Token ID287
Feature activation-1.11e+00

Pos logit contributions
 promise+0.989
 clerk+0.971
olding+0.928
 grandparents+0.922
belt+0.914
Neg logit contributions
 nib-1.011
 hopping-0.944
 buzzing-0.937
ping-0.854
 Buzz-0.819
Token Slip
Token ID49988
Feature activation-7.11e-02

Pos logit contributions
 promise+2.511
 clerk+2.500
olding+2.436
olded+2.421
belt+2.408
Neg logit contributions
 nib-1.400
 hopping-1.270
 buzzing-1.264
ping-1.044
 Buzz-1.005
Token for
Token ID329
Feature activation-6.25e-01

Pos logit contributions
 promise+0.132
 clerk+0.129
 grandparents+0.124
olding+0.123
 bills+0.122
Neg logit contributions
 nib-0.117
 hopping-0.108
 buzzing-0.108
ping-0.098
 Buzz-0.094
Token many
Token ID867
Feature activation-9.07e-01

Pos logit contributions
 promise+1.356
 clerk+1.339
olding+1.292
 grandparents+1.284
 bills+1.276
Neg logit contributions
 nib-0.839
 hopping-0.767
 buzzing-0.759
ping-0.666
 Buzz-0.627
Token days
Token ID1528
Feature activation-2.10e-01

Pos logit contributions
 promise+1.457
 clerk+1.451
olding+1.378
 grandparents+1.362
belt+1.360
Neg logit contributions
 nib-1.741
 hopping-1.642
 buzzing-1.628
ping-1.480
 snowball-1.419
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
 promise+0.396
 clerk+0.390
olding+0.376
 grandparents+0.374
belt+0.370
Neg logit contributions
 nib-0.338
 hopping-0.311
 buzzing-0.310
ping-0.281
 Buzz-0.269
Token They
Token ID1119
Feature activation-6.59e-01

Pos logit contributions
 promise+0.349
 clerk+0.349
olding+0.334
 grandparents+0.332
 bills+0.330
Neg logit contributions
 nib-0.293
 buzzing-0.272
 hopping-0.272
ping-0.245
 snowball-0.231
Token played
Token ID2826
Feature activation+1.05e+00

Pos logit contributions
 promise+1.150
 clerk+1.133
olding+1.084
 grandparents+1.074
 bills+1.067
Neg logit contributions
 nib-1.164
 hopping-1.087
 buzzing-1.079
ping-0.980
 Buzz-0.940
Token the
Token ID262
Feature activation-1.96e+00

Pos logit contributions
 nib+0.885
 hopping+0.778
 buzzing+0.777
ping+0.750
 snowball+0.709
Neg logit contributions
 promise-1.150
 clerk-1.138
 grandparents-1.123
 bills-1.095
olding-1.082
Token party
Token ID2151
Feature activation-1.80e+00

Pos logit contributions
olded+3.703
 clerk+3.668
 promise+3.627
keeper+3.592
olding+3.587
Neg logit contributions
 nib-3.163
 buzzing-3.114
 hopping-3.012
 jumping-2.618
ping-2.573
Token that
Token ID326
Feature activation-2.19e+00

Pos logit contributions
 promise+3.845
 clerk+3.683
olding+3.660
olded+3.591
 bills+3.589
Neg logit contributions
 nib-2.542
 hopping-2.411
 buzzing-2.397
ping-2.065
 jumping-1.962
Token she
Token ID673
Feature activation-3.85e-01

Pos logit contributions
 promise+4.400
olding+4.288
 clerk+4.211
olded+4.123
belt+4.091
Neg logit contributions
 nib-3.560
 hopping-3.453
 buzzing-3.365
 Buzz-2.902
 jumping-2.885
Token wanted
Token ID2227
Feature activation-1.20e+00

Pos logit contributions
 promise+0.647
 clerk+0.635
olding+0.603
 grandparents+0.602
belt+0.596
Neg logit contributions
 nib-0.704
 hopping-0.663
 buzzing-0.659
ping-0.605
 Buzz-0.578
Token to
Token ID284
Feature activation-2.42e-01

Pos logit contributions
 promise+1.984
 clerk+1.919
olding+1.855
belt+1.825
 grandparents+1.799
Neg logit contributions
 nib-2.298
 hopping-2.182
 buzzing-2.147
ping-1.938
 Buzz-1.890
Token wear
Token ID5806
Feature activation-1.17e+00

Pos logit contributions
 promise+0.408
 clerk+0.392
 grandparents+0.375
olding+0.372
belt+0.368
Neg logit contributions
 nib-0.447
 hopping-0.422
 buzzing-0.420
ping-0.389
 Buzz-0.368
Token her
Token ID607
Feature activation-2.89e+00

Pos logit contributions
 promise+2.633
 clerk+2.597
olding+2.535
keeper+2.485
 bills+2.474
Neg logit contributions
 nib-1.508
 hopping-1.394
 buzzing-1.371
ping-1.176
 Buzz-1.102
Token new
Token ID649
Feature activation-2.04e+00

Pos logit contributions
 clerk+5.756
olding+5.630
keeper+5.552
olded+5.519
 policy+5.407
Neg logit contributions
 nib-4.751
 hopping-4.612
 buzzing-4.546
 jumping-4.020
 snowball-3.936
Token shoes
Token ID10012
Feature activation-2.55e+00

Pos logit contributions
 clerk+3.973
olding+3.899
olded+3.897
 promise+3.832
 bills+3.816
Neg logit contributions
 nib-3.226
 hopping-3.061
 buzzing-3.017
 snowball-2.617
 jumping-2.593
Token.
Token ID13
Feature activation-6.12e-01

Pos logit contributions
 promise+5.645
 clerk+5.604
 bills+5.526
 grandparents+5.499
olding+5.454
Neg logit contributions
 nib-3.384
 hopping-3.169
 buzzing-2.972
ping-2.599
 jumping-2.488
Token to
Token ID284
Feature activation-8.95e-01

Pos logit contributions
 promise+2.909
olding+2.758
 clerk+2.750
 policy+2.654
belt+2.643
Neg logit contributions
 nib-3.167
 hopping-3.027
 buzzing-2.920
ping-2.602
 Buzz-2.599
Token the
Token ID262
Feature activation-2.25e+00

Pos logit contributions
 promise+2.054
 clerk+2.033
olding+1.984
belt+1.962
olded+1.951
Neg logit contributions
 nib-1.098
 hopping-0.984
 buzzing-0.971
ping-0.824
 Buzz-0.787
Token store
Token ID3650
Feature activation-2.98e+00

Pos logit contributions
 promise+3.777
olding+3.646
olded+3.639
 wisely+3.570
 policy+3.550
Neg logit contributions
 nib-4.160
 hopping-3.899
 buzzing-3.845
 snowball-3.365
 gra-3.331
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 promise+7.220
olding+6.925
belt+6.834
Of+6.803
olded+6.787
Neg logit contributions
 hopping-3.323
 nib-3.321
 buzzing-3.111
 jumping-2.653
ping-2.472
Token Lily
Token ID20037
Feature activation-4.78e-02

Pos logit contributions
 promise+1.565
 clerk+1.511
olding+1.472
 bills+1.421
 grandparents+1.421
Neg logit contributions
 nib-2.008
 hopping-1.899
 buzzing-1.875
ping-1.699
 Buzz-1.688
Token's
Token ID338
Feature activation-2.79e+00

Pos logit contributions
 promise+0.081
 clerk+0.080
 grandparents+0.076
keeper+0.075
olding+0.075
Neg logit contributions
 nib-0.086
 buzzing-0.081
 hopping-0.081
ping-0.075
 Buzz-0.072
Token mom
Token ID1995
Feature activation-1.32e+00

Pos logit contributions
olding+2.929
olded+2.849
 wisely+2.833
 promise+2.827
ounded+2.766
Neg logit contributions
 nib-6.965
 hopping-6.787
 buzzing-6.483
 snowball-6.085
 jumping-6.066
Tokenmy
Token ID1820
Feature activation-7.91e-01

Pos logit contributions
 promise+2.413
 clerk+2.355
olding+2.296
 policy+2.273
 bills+2.271
Neg logit contributions
 nib-2.256
 hopping-2.104
 buzzing-2.059
ping-1.877
 snowball-1.771
Token told
Token ID1297
Feature activation+4.58e-01

Pos logit contributions
 promise+1.177
 clerk+1.150
olding+1.100
 grandparents+1.083
belt+1.080
Neg logit contributions
 nib-1.604
 hopping-1.509
 buzzing-1.497
ping-1.374
 Buzz-1.329
Token her
Token ID607
Feature activation-6.58e-01

Pos logit contributions
 nib+0.732
 hopping+0.672
 buzzing+0.665
ping+0.597
 snowball+0.578
Neg logit contributions
 clerk-0.896
 promise-0.891
 grandparents-0.850
 bills-0.844
olding-0.838
Token to
Token ID284
Feature activation-4.05e-01

Pos logit contributions
 promise+1.050
 clerk+1.031
olding+0.982
 grandparents+0.973
belt+0.965
Neg logit contributions
 nib-1.265
 hopping-1.189
 buzzing-1.181
ping-1.081
 Buzz-1.042
TokenNo
Token ID2949
Feature activation-1.15e-01

Pos logit contributions
 promise+1.951
 clerk+1.900
 grandparents+1.840
olding+1.837
belt+1.825
Neg logit contributions
 nib-2.198
 hopping-2.041
 buzzing-2.033
ping-1.819
 snowball-1.809
Token,
Token ID11
Feature activation-1.37e+00

Pos logit contributions
 promise+0.215
 clerk+0.207
 grandparents+0.198
olding+0.198
 bills+0.196
Neg logit contributions
 nib-0.195
 buzzing-0.182
 hopping-0.182
ping-0.167
 Buzz-0.157
Token mine
Token ID6164
Feature activation-1.59e-01

Pos logit contributions
 clerk+2.358
 promise+2.352
olding+2.281
 grandparents+2.280
belt+2.253
Neg logit contributions
 nib-2.425
 hopping-2.266
 buzzing-2.239
ping-2.005
 snowball-1.990
Token!
Token ID0
Feature activation-5.77e-01

Pos logit contributions
 promise+0.327
 clerk+0.318
olding+0.306
keeper+0.303
belt+0.303
Neg logit contributions
 nib-0.236
 hopping-0.219
 buzzing-0.216
ping-0.193
 Buzz-0.183
Token Mine
Token ID11517
Feature activation+1.14e-01

Pos logit contributions
 promise+0.919
 clerk+0.904
olding+0.861
 grandparents+0.852
 bills+0.846
Neg logit contributions
 nib-1.107
 hopping-1.041
 buzzing-1.035
ping-0.949
 Buzz-0.912
Token!"
Token ID2474
Feature activation-2.64e+00

Pos logit contributions
 nib+0.183
 hopping+0.171
 buzzing+0.170
ping+0.153
 Buzz+0.148
Neg logit contributions
 promise-0.223
 clerk-0.214
olding-0.205
 bills-0.203
olded-0.201
Token Sam
Token ID3409
Feature activation-3.19e-01

Pos logit contributions
 promise+5.884
olding+5.596
belt+5.541
 clerk+5.494
 wisely+5.492
Neg logit contributions
 nib-3.668
 hopping-3.386
 buzzing-3.316
 snowball-3.059
 Buzz-2.949
Token said
Token ID531
Feature activation-1.12e+00

Pos logit contributions
 promise+0.403
 clerk+0.396
olding+0.369
olded+0.362
 grandparents+0.361
Neg logit contributions
 nib-0.717
 hopping-0.685
 buzzing-0.678
ping-0.632
 Buzz-0.613
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
 promise+2.190
 clerk+2.140
 grandparents+2.089
olding+2.056
 bills+2.050
Neg logit contributions
 nib-1.750
 hopping-1.626
 buzzing-1.602
ping-1.445
 Buzz-1.380
Token
Token ID198
Feature activation-1.30e+00

Pos logit contributions
 promise+2.317
 clerk+2.197
olding+2.149
 grandparents+2.142
 bills+2.132
Neg logit contributions
 nib-1.940
 hopping-1.823
 buzzing-1.790
ping-1.583
 Buzz-1.569
Token
Token ID198
Feature activation-1.60e+00

Pos logit contributions
 promise+2.501
 clerk+2.451
belt+2.352
 bills+2.347
olding+2.339
Neg logit contributions
 nib-2.080
 hopping-1.951
 buzzing-1.922
ping-1.707
 snowball-1.682
Token couldn
Token ID3521
Feature activation+1.20e+00

Pos logit contributions
 nib+1.193
 hopping+1.121
 buzzing+1.118
ping+1.070
 snowball+1.047
Neg logit contributions
 promise-0.967
 clerk-0.927
 grandparents-0.895
keeper-0.891
olded-0.883
Token't
Token ID470
Feature activation-9.00e-01

Pos logit contributions
 nib+2.623
 buzzing+2.396
 hopping+2.372
ping+2.304
 snowball+2.156
Neg logit contributions
 promise-1.673
 clerk-1.673
 manager-1.551
keeper-1.533
olding-1.528
Token wait
Token ID4043
Feature activation+2.58e-01

Pos logit contributions
 promise+1.150
 clerk+1.148
 grandparents+1.074
olding+1.072
belt+1.064
Neg logit contributions
 nib-2.014
 hopping-1.892
 buzzing-1.887
ping-1.742
 Buzz-1.704
Token to
Token ID284
Feature activation+2.69e-01

Pos logit contributions
 nib+0.551
 buzzing+0.520
 hopping+0.517
ping+0.496
 Buzz+0.472
Neg logit contributions
 promise-0.343
 clerk-0.324
olding-0.317
 wisely-0.308
 grandparents-0.307
Token tell
Token ID1560
Feature activation-1.46e+00

Pos logit contributions
 nib+0.477
 buzzing+0.451
 hopping+0.450
ping+0.423
 snowball+0.401
Neg logit contributions
 promise-0.475
 clerk-0.438
 grandparents-0.426
olding-0.418
 bills-0.417
Token her
Token ID607
Feature activation-2.53e+00

Pos logit contributions
 promise+3.211
 clerk+3.184
olding+3.107
belt+3.107
olded+3.044
Neg logit contributions
 nib-1.935
 hopping-1.795
 buzzing-1.790
ping-1.529
 Buzz-1.492
Token friends
Token ID2460
Feature activation+4.09e-01

Pos logit contributions
belt+2.623
 clerk+2.483
olding+2.460
olded+2.418
ounded+2.328
Neg logit contributions
 nib-6.548
 hopping-6.371
 buzzing-6.331
 jumping-5.820
 bees-5.728
Token about
Token ID546
Feature activation+1.20e+00

Pos logit contributions
 nib+0.798
 buzzing+0.741
 hopping+0.739
ping+0.696
 snowball+0.682
Neg logit contributions
 promise-0.646
 clerk-0.621
olding-0.596
 grandparents-0.591
 bills-0.581
Token the
Token ID262
Feature activation-3.64e-02

Pos logit contributions
 nib+1.925
 hopping+1.703
 buzzing+1.689
ping+1.676
 snowball+1.551
Neg logit contributions
 promise-2.250
 grandparents-2.212
 clerk-2.211
 bills-2.147
 grandmother-2.126
Token butterflies
Token ID44571
Feature activation+8.03e-01

Pos logit contributions
 promise+0.076
 clerk+0.073
 grandparents+0.071
olding+0.070
 bills+0.069
Neg logit contributions
 nib-0.053
 hopping-0.048
 buzzing-0.047
ping-0.043
 snowball-0.041
Token that
Token ID326
Feature activation-1.13e+00

Pos logit contributions
 nib+1.425
 hopping+1.298
 buzzing+1.295
ping+1.239
 snowball+1.229
Neg logit contributions
 promise-1.384
 clerk-1.309
olding-1.256
keeper-1.256
 grandparents-1.252
Token went
Token ID1816
Feature activation+2.60e-01

Pos logit contributions
 nib+1.697
 hopping+1.608
 buzzing+1.604
ping+1.552
 snowball+1.475
Neg logit contributions
 promise-1.373
 clerk-1.334
 grandparents-1.272
keeper-1.265
olded-1.241
Token on
Token ID319
Feature activation+6.78e-01

Pos logit contributions
 nib+0.540
 buzzing+0.500
 hopping+0.493
ping+0.476
 Buzz+0.461
Neg logit contributions
 promise-0.362
 clerk-0.357
olding-0.341
 wisely-0.332
 grandparents-0.332
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
 nib+0.954
 buzzing+0.853
 hopping+0.848
ping+0.839
 snowball+0.751
Neg logit contributions
 promise-1.378
 clerk-1.375
 grandparents-1.362
 grandmother-1.317
 bills-1.306
Token walk
Token ID2513
Feature activation+1.20e+00

Pos logit contributions
 nib+1.823
ping+1.722
 hopping+1.674
 buzzing+1.666
 Buzz+1.563
Neg logit contributions
 promise-1.783
 clerk-1.650
 grandparents-1.644
 grandmother-1.560
 cousin-1.557
Token with
Token ID351
Feature activation-2.49e-01

Pos logit contributions
 nib+2.300
 buzzing+2.067
 hopping+2.046
 snowball+1.992
ping+1.970
Neg logit contributions
 promise-1.861
 clerk-1.856
olding-1.805
keeper-1.792
belt-1.762
Token her
Token ID607
Feature activation-2.95e+00

Pos logit contributions
 promise+0.520
 clerk+0.513
 grandparents+0.494
olding+0.490
 bills+0.487
Neg logit contributions
 nib-0.356
 hopping-0.322
 buzzing-0.322
ping-0.290
 snowball-0.274
Token mum
Token ID25682
Feature activation+8.88e-01

Pos logit contributions
olded+4.087
olding+4.082
 promise+3.883
 clerk+3.871
 dollars+3.772
Neg logit contributions
 hopping-6.781
 nib-6.733
 buzzing-6.660
 jumping-6.003
 bees-5.857
Token and
Token ID290
Feature activation-2.15e-01

Pos logit contributions
 nib+1.743
 buzzing+1.641
 hopping+1.596
ping+1.538
 snowball+1.532
Neg logit contributions
 clerk-1.336
 promise-1.331
 grandparents-1.284
keeper-1.249
olding-1.235
Token dad
Token ID9955
Feature activation-1.98e-01

Pos logit contributions
 promise+0.309
 clerk+0.305
 grandparents+0.289
olding+0.284
keeper+0.282
Neg logit contributions
 nib-0.442
 hopping-0.416
 buzzing-0.415
ping-0.389
 snowball-0.377
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
 promise+0.350
 clerk+0.344
olding+0.329
 grandparents+0.327
 bills+0.323
Neg logit contributions
 nib-0.346
 buzzing-0.322
 hopping-0.321
ping-0.295
 snowball-0.286
Token They
Token ID1119
Feature activation-2.98e-01

Pos logit contributions
 nib+1.724
 buzzing+1.641
 hopping+1.569
ping+1.495
 snowball+1.480
Neg logit contributions
 clerk-1.981
 promise-1.873
olding-1.850
 grandparents-1.809
keeper-1.807
Token to
Token ID284
Feature activation-6.25e-01

Pos logit contributions
 clerk+3.640
 promise+3.614
belt+3.537
olding+3.492
aila+3.446
Neg logit contributions
 nib-2.969
 hopping-2.836
 buzzing-2.712
ping-2.455
 Buzz-2.373
Token each
Token ID1123
Feature activation-1.45e+00

Pos logit contributions
 promise+1.240
 clerk+1.223
olding+1.176
 grandparents+1.168
 bills+1.160
Neg logit contributions
 nib-0.963
 hopping-0.890
 buzzing-0.883
ping-0.789
 Buzz-0.750
Token other
Token ID584
Feature activation-5.38e-01

Pos logit contributions
 clerk+2.061
 promise+2.056
belt+2.014
olding+1.977
olded+1.971
Neg logit contributions
 nib-3.053
 hopping-2.919
 buzzing-2.891
ping-2.661
 snowball-2.575
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
 promise+0.991
 clerk+0.972
olding+0.933
 grandparents+0.928
 bills+0.920
Neg logit contributions
 nib-0.901
 hopping-0.838
 buzzing-0.831
ping-0.751
 Buzz-0.718
Token<|endoftext|>
Token ID50256
Feature activation+9.17e-02

Pos logit contributions
 promise+2.792
 clerk+2.749
belt+2.659
olding+2.647
 bills+2.643
Neg logit contributions
 nib-2.290
 hopping-2.132
 buzzing-2.084
 Buzz-1.857
ping-1.841
TokenOnce
Token ID7454
Feature activation-2.50e+00

Pos logit contributions
 nib+0.173
 buzzing+0.163
 hopping+0.162
ping+0.150
 tweeting+0.141
Neg logit contributions
 promise-0.144
 clerk-0.142
 grandparents-0.136
olding-0.134
keeper-0.133
Token upon
Token ID2402
Feature activation+2.07e+00

Pos logit contributions
 clerk+4.685
 bills+4.543
 promise+4.481
belt+4.431
olded+4.376
Neg logit contributions
 nib-4.313
 hopping-4.193
 buzzing-3.964
 Buzz-3.665
 jumping-3.568
Token a
Token ID257
Feature activation+4.11e-01

Pos logit contributions
 nib+3.400
 buzzing+3.245
 hopping+3.197
ping+3.071
 tweeting+2.918
Neg logit contributions
 promise-3.826
 cousin-3.571
 grandma-3.523
 clerk-3.508
 grandparents-3.500
Token time
Token ID640
Feature activation-6.42e-02

Pos logit contributions
 nib+0.774
 hopping+0.735
 buzzing+0.732
ping+0.689
 tweeting+0.649
Neg logit contributions
 promise-0.678
 clerk-0.616
olding-0.615
 grandparents-0.602
 cousin-0.586
Token,
Token ID11
Feature activation-3.74e-02

Pos logit contributions
 promise+0.115
 clerk+0.112
olding+0.109
 grandparents+0.107
 policy+0.106
Neg logit contributions
 nib-0.110
 buzzing-0.102
 hopping-0.101
ping-0.094
 snowball-0.087
Token there
Token ID612
Feature activation+8.48e-01

Pos logit contributions
 promise+0.058
 clerk+0.056
olding+0.053
 grandparents+0.052
belt+0.052
Neg logit contributions
 nib-0.073
 buzzing-0.069
 hopping-0.069
ping-0.064
 tweeting-0.060