TOP ACTIVATIONS
MAX = 0.101

He knew that every time he blinked , he could always
he was happy . Now he was ready to explore and
it was gone . Now he could enjoy the beach without
to her and made sure he was safe . She said
before making decisions . If they had been a bit more
to the wagon driver as they left the park . <|endoftext|>
. John was glad that he had kept the peace just
he could . Each time he went , something even more
his new toy . Now he is extra careful not to
. They were calm as they opened the door and left
and Joe both laughed as they watched the pot in the
. The crane had helped them and they were grateful .
man had replied and helped him out of his trouble .
had ever been . Now he had a great pair of
George was glad that he was able to help ,
apped together . She wished he was still here to play
She knew that even though she was little , she could
feeling light and happy . He was a kid on a
had paid off . Now she had a new skill to
boy never saw the cushion he had just built ever again

MIN ACTIVATIONS
MIN = -0.293

giving him too many ch ocol ates ." The king learned
reward . It was ch ocol ates and sweets !
her mom my watched in amaz ement as the machine flew
't know the rh in oc eros was wild . Can
oll ip ops , ch ocol ates , g ummies ,
as he opened the ch ocol ates and started eating them
them and never threw a tant rum again . <|endoftext|> Once
John chose some carrots and cuc umbers to take home .
, cup cakes and ch ocol ates .
he would take the av oc ados and scatter them around
. Tim my watched with amaz ement as he saw stars
quiet compared to the noisy se ag ull s outside .
like lettuce , tomatoes and cuc umbers . He smiled as
not eat too many ch ocol ates and his friend learned
was a box of ch ocol ates . Lily loved ch
give to the rh in oc eros . Lily walked to
air . He watched with amaz ement as the z ig
go ! Everyone watched in amaz ement as it took off
ates . Lily loved ch ocol ates . She thanked Ben
egg ! He watched in amaz ement as the egg began

INTERVAL 0.003 - 0.101
CONTAINS 52.594%

content as he and his mom left the park . <|endoftext|>
and he and his friend were friends again . <|endoftext|> One
a big wheel and happy faces everywhere . He noticed a
She asked her mom if she could use the kitchen scale
happy and proud of their new baby . They

INTERVAL -0.096 - 0.003
CONTAINS 45.551%

Mom hears the noise . She comes to the living room
feathers . She wanted to make a nest for her and
and he couldn 't move anymore . Red was sad and
day , she saw a carpet in the hallway and decided
girl is surprised and happy she stayed brave .

INTERVAL -0.195 - -0.096
CONTAINS 1.835%

" Yes , mum ." M um then
to go home . They smiled and hugged each other ,
mess on the paper . Billy was very
, the poor rh in oc eros was never hungry again
pony again . <|endoftext|> Once upon a time , there was

INTERVAL -0.293 - -0.195
CONTAINS 0.019%

her mom my watched in amaz ement as the machine flew
John chose some carrots and cuc umbers to take home .
't know the rh in oc eros was wild . Can
reward . It was ch ocol ates and sweets !
, cup cakes and ch ocol ates .
Token He
Token ID679
Feature activation+8.69e-02

Pos logit contributions
 distance+0.101
 narrower+0.098
Uh+0.095
Hmm+0.089
 vengeance+0.089
Neg logit contributions
 disl-0.111
az-0.096
ach-0.089
uggets-0.088
ubby-0.085
Token knew
Token ID2993
Feature activation+4.74e-02

Pos logit contributions
 narrower+0.105
 distance+0.104
Oh+0.104
Uh+0.103
Hmm+0.099
Neg logit contributions
 disl-0.148
az-0.131
rou-0.126
 Majesty-0.124
ci-0.124
Token that
Token ID326
Feature activation+4.58e-02

Pos logit contributions
 distance+0.076
Oh+0.072
 narrower+0.072
Hmm+0.071
Uh+0.070
Neg logit contributions
 disl-0.069
ach-0.058
az-0.057
uggets-0.054
ubby-0.053
Token every
Token ID790
Feature activation+3.90e-02

Pos logit contributions
 distance+0.065
Uh+0.063
Hmm+0.063
Oh+0.062
 narrower+0.062
Neg logit contributions
 disl-0.073
ach-0.062
az-0.060
uggets-0.058
 Majesty-0.057
Token time
Token ID640
Feature activation+4.44e-02

Pos logit contributions
 narrower+0.050
 distance+0.050
Hmm+0.048
Oh+0.048
Uh+0.047
Neg logit contributions
 disl-0.065
az-0.057
ach-0.056
ubby-0.054
uggets-0.053
Token he
Token ID339
Feature activation+9.88e-02

Pos logit contributions
 distance+0.053
Uh+0.049
Oh+0.049
Hmm+0.049
 narrower+0.047
Neg logit contributions
 disl-0.083
ach-0.073
az-0.071
uggets-0.067
 Majesty-0.067
Token blinked
Token ID40360
Feature activation+1.65e-02

Pos logit contributions
 distance+0.101
Oh+0.097
 narrower+0.095
Uh+0.094
Hmm+0.093
Neg logit contributions
 disl-0.198
az-0.174
ach-0.172
ubby-0.164
 Majesty-0.162
Token,
Token ID11
Feature activation+6.63e-02

Pos logit contributions
 distance+0.025
 narrower+0.023
Oh+0.023
Hmm+0.022
To+0.022
Neg logit contributions
 disl-0.026
ach-0.021
az-0.021
uggets-0.021
ubby-0.020
Token he
Token ID339
Feature activation+9.18e-02

Pos logit contributions
 distance+0.084
Oh+0.079
 narrower+0.079
Uh+0.078
Hmm+0.078
Neg logit contributions
 disl-0.118
ach-0.102
az-0.101
uggets-0.096
ubby-0.095
Token could
Token ID714
Feature activation+4.82e-02

Pos logit contributions
Uh+0.092
 distance+0.091
Oh+0.091
 narrower+0.089
Hmm+0.087
Neg logit contributions
 disl-0.179
az-0.156
rou-0.151
 Majesty-0.151
ach-0.151
Token always
Token ID1464
Feature activation+5.50e-02

Pos logit contributions
 distance+0.056
To+0.048
 eventually+0.048
 narrower+0.048
Maybe+0.048
Neg logit contributions
 disl-0.093
ach-0.083
uggets-0.083
az-0.080
ubby-0.079
Token he
Token ID339
Feature activation+7.92e-02

Pos logit contributions
 distance+0.064
Oh+0.059
 narrower+0.058
Hmm+0.057
Uh+0.057
Neg logit contributions
 disl-0.105
ach-0.092
az-0.091
uggets-0.088
ubby-0.087
Token was
Token ID373
Feature activation+5.41e-02

Pos logit contributions
 distance+0.082
 narrower+0.078
Oh+0.077
Uh+0.076
Hmm+0.075
Neg logit contributions
 disl-0.157
az-0.137
ach-0.136
 Majesty-0.131
uggets-0.129
Token happy
Token ID3772
Feature activation+2.79e-02

Pos logit contributions
 distance+0.064
 eventually+0.054
Oh+0.053
 narrower+0.053
Less+0.052
Neg logit contributions
 disl-0.107
ach-0.098
az-0.091
uggets-0.090
ubby-0.087
Token.
Token ID13
Feature activation+6.99e-02

Pos logit contributions
 distance+0.041
Oh+0.039
 narrower+0.039
Uh+0.038
Hmm+0.038
Neg logit contributions
 disl-0.044
ach-0.037
az-0.037
uggets-0.035
ubby-0.034
Token Now
Token ID2735
Feature activation+6.33e-02

Pos logit contributions
 distance+0.086
 narrower+0.083
Uh+0.081
Hmm+0.076
Aw+0.075
Neg logit contributions
 disl-0.127
az-0.114
ach-0.109
uggets-0.106
 Majesty-0.104
Token he
Token ID339
Feature activation+9.84e-02

Pos logit contributions
 distance+0.082
Oh+0.077
 narrower+0.077
Uh+0.076
Hmm+0.075
Neg logit contributions
 disl-0.111
ach-0.096
az-0.095
uggets-0.091
ubby-0.089
Token was
Token ID373
Feature activation+6.69e-02

Pos logit contributions
 distance+0.092
 narrower+0.084
Oh+0.084
Uh+0.082
Hmm+0.081
Neg logit contributions
 disl-0.207
ach-0.184
az-0.183
uggets-0.176
ubby-0.174
Token ready
Token ID3492
Feature activation+5.23e-02

Pos logit contributions
 distance+0.077
 eventually+0.071
 drifted+0.065
 narrower+0.064
See+0.062
Neg logit contributions
 disl-0.137
ach-0.125
uggets-0.111
az-0.110
amb-0.107
Token to
Token ID284
Feature activation+6.19e-02

Pos logit contributions
 distance+0.083
Oh+0.078
 narrower+0.078
Hmm+0.077
Uh+0.076
Neg logit contributions
 disl-0.076
ach-0.064
az-0.063
uggets-0.060
ubby-0.059
Token explore
Token ID7301
Feature activation+3.98e-02

Pos logit contributions
 distance+0.076
To+0.067
 narrower+0.065
Oh+0.064
Hmm+0.064
Neg logit contributions
 disl-0.113
ach-0.101
az-0.100
uggets-0.098
ubby-0.096
Token and
Token ID290
Feature activation+6.94e-02

Pos logit contributions
 distance+0.060
 narrower+0.055
Oh+0.055
Hmm+0.054
To+0.053
Neg logit contributions
 disl-0.063
ach-0.053
az-0.052
uggets-0.051
ubby-0.049
Token it
Token ID340
Feature activation+7.23e-02

Pos logit contributions
 distance+0.045
Uh+0.044
 narrower+0.043
Hmm+0.043
Oh+0.043
Neg logit contributions
 disl-0.035
ach-0.028
az-0.028
uggets-0.026
 Majesty-0.025
Token was
Token ID373
Feature activation+5.15e-02

Pos logit contributions
 distance+0.083
Hmm+0.082
 narrower+0.082
Uh+0.081
Oh+0.077
Neg logit contributions
 disl-0.129
az-0.115
ach-0.112
ci-0.104
ubby-0.101
Token gone
Token ID3750
Feature activation+3.54e-02

Pos logit contributions
 distance+0.068
 narrower+0.060
Oh+0.060
To+0.058
Where+0.057
Neg logit contributions
 disl-0.092
ach-0.080
az-0.079
uggets-0.076
ubby-0.076
Token.
Token ID13
Feature activation+6.64e-02

Pos logit contributions
 distance+0.051
Oh+0.049
 narrower+0.048
Uh+0.048
Hmm+0.048
Neg logit contributions
 disl-0.056
ach-0.048
az-0.048
uggets-0.045
ubby-0.044
Token Now
Token ID2735
Feature activation+6.32e-02

Pos logit contributions
 narrower+0.095
Uh+0.090
 distance+0.089
Hmm+0.084
 darker+0.084
Neg logit contributions
 disl-0.108
az-0.092
ach-0.087
uggets-0.084
ums-0.079
Token he
Token ID339
Feature activation+9.83e-02

Pos logit contributions
 distance+0.101
Uh+0.099
 narrower+0.099
Oh+0.098
Hmm+0.097
Neg logit contributions
 disl-0.089
ach-0.076
az-0.074
 Majesty-0.068
uggets-0.066
Token could
Token ID714
Feature activation+4.88e-02

Pos logit contributions
 distance+0.085
Oh+0.076
 narrower+0.076
Hmm+0.074
Uh+0.074
Neg logit contributions
 disl-0.214
ach-0.192
az-0.190
uggets-0.184
ubby-0.181
Token enjoy
Token ID2883
Feature activation+3.35e-02

Pos logit contributions
 distance+0.040
 eventually+0.040
 drifted+0.032
Maybe+0.031
 lead+0.031
Neg logit contributions
 disl-0.111
ach-0.100
uggets-0.100
ubby-0.099
az-0.097
Token the
Token ID262
Feature activation+5.63e-02

Pos logit contributions
 distance+0.040
Oh+0.035
Who+0.035
 narrower+0.035
See+0.035
Neg logit contributions
 disl-0.060
ubby-0.055
az-0.053
ach-0.053
uggets-0.051
Token beach
Token ID10481
Feature activation+5.63e-02

Pos logit contributions
 distance+0.081
Oh+0.073
 narrower+0.071
Uh+0.070
Hmm+0.070
Neg logit contributions
 disl-0.093
uggets-0.079
ach-0.079
az-0.079
ubby-0.075
Token without
Token ID1231
Feature activation+3.69e-02

Pos logit contributions
 distance+0.075
 narrower+0.060
 eventually+0.060
 million+0.058
To+0.057
Neg logit contributions
 disl-0.103
ubby-0.090
az-0.087
ach-0.086
uggets-0.085
Token to
Token ID284
Feature activation+5.15e-02

Pos logit contributions
 distance+0.064
Oh+0.062
Uh+0.062
 narrower+0.061
Hmm+0.061
Neg logit contributions
 disl-0.054
ach-0.046
az-0.045
uggets-0.042
ubby-0.041
Token her
Token ID607
Feature activation+5.87e-02

Pos logit contributions
 distance+0.074
Oh+0.069
 narrower+0.069
Hmm+0.068
Uh+0.067
Neg logit contributions
 disl-0.083
ach-0.071
az-0.070
uggets-0.068
ubby-0.066
Token and
Token ID290
Feature activation+7.07e-02

Pos logit contributions
 distance+0.083
 narrower+0.076
Oh+0.075
Hmm+0.074
To+0.073
Neg logit contributions
 disl-0.096
az-0.082
ach-0.082
uggets-0.079
ubby-0.077
Token made
Token ID925
Feature activation+3.02e-02

Pos logit contributions
 distance+0.066
 narrower+0.057
Oh+0.057
To+0.055
Hmm+0.054
Neg logit contributions
 disl-0.149
ach-0.137
az-0.134
ubby-0.133
uggets-0.132
Token sure
Token ID1654
Feature activation+6.36e-02

Pos logit contributions
Ah+0.030
 other+0.029
See+0.028
 better+0.027
 distance+0.026
Neg logit contributions
 disl-0.061
ubby-0.056
ach-0.051
uggets-0.051
az-0.051
Token he
Token ID339
Feature activation+9.81e-02

Pos logit contributions
 distance+0.083
Oh+0.077
 narrower+0.077
To+0.074
Hmm+0.074
Neg logit contributions
 disl-0.111
ach-0.098
az-0.095
uggets-0.093
ubby-0.093
Token was
Token ID373
Feature activation+7.14e-02

Pos logit contributions
 distance+0.096
Oh+0.088
Uh+0.088
 narrower+0.086
Hmm+0.086
Neg logit contributions
 disl-0.200
az-0.179
ach-0.179
ubby-0.170
uggets-0.168
Token safe
Token ID3338
Feature activation+4.39e-02

Pos logit contributions
 distance+0.085
 narrower+0.077
 eventually+0.072
Oh+0.071
Where+0.071
Neg logit contributions
 disl-0.137
ach-0.121
az-0.115
uggets-0.113
ubby-0.109
Token.
Token ID13
Feature activation+7.31e-02

Pos logit contributions
 distance+0.051
 narrower+0.044
Where+0.042
Oh+0.042
To+0.042
Neg logit contributions
 disl-0.087
ubby-0.074
az-0.072
uggets-0.072
 majesty-0.072
Token She
Token ID1375
Feature activation+8.77e-02

Pos logit contributions
 distance+0.101
 narrower+0.093
Uh+0.092
Hmm+0.088
Oh+0.087
Neg logit contributions
 disl-0.122
az-0.106
ach-0.101
 Majesty-0.101
uggets-0.100
Token said
Token ID531
Feature activation+4.88e-02

Pos logit contributions
 distance+0.089
Oh+0.083
 narrower+0.083
Uh+0.082
Hmm+0.082
Neg logit contributions
 disl-0.175
az-0.155
ach-0.154
uggets-0.146
ubby-0.146
Token before
Token ID878
Feature activation+4.61e-02

Pos logit contributions
 distance+0.084
Oh+0.079
 narrower+0.079
Hmm+0.078
Uh+0.077
Neg logit contributions
 disl-0.081
ach-0.069
az-0.068
uggets-0.065
ubby-0.063
Token making
Token ID1642
Feature activation+3.08e-02

Pos logit contributions
 distance+0.068
 narrower+0.065
Oh+0.065
To+0.063
Hmm+0.063
Neg logit contributions
 disl-0.069
az-0.060
ach-0.059
uggets-0.057
ubby-0.056
Token decisions
Token ID5370
Feature activation+4.26e-02

Pos logit contributions
 narrower+0.034
 distance+0.034
 other+0.032
 million+0.032
Oh+0.032
Neg logit contributions
 disl-0.055
ach-0.048
uggets-0.048
az-0.048
ubby-0.048
Token.
Token ID13
Feature activation+7.06e-02

Pos logit contributions
 distance+0.065
 narrower+0.061
Oh+0.061
Hmm+0.059
Uh+0.059
Neg logit contributions
 disl-0.064
ach-0.055
az-0.054
uggets-0.052
ubby-0.051
Token If
Token ID1002
Feature activation+6.31e-02

Pos logit contributions
 distance+0.095
 narrower+0.091
Uh+0.087
Hmm+0.084
Oh+0.083
Neg logit contributions
 disl-0.121
ach-0.104
az-0.104
uggets-0.101
ubby-0.095
Token they
Token ID484
Feature activation+9.80e-02

Pos logit contributions
 distance+0.080
Oh+0.075
 narrower+0.075
Hmm+0.073
To+0.072
Neg logit contributions
 disl-0.112
az-0.097
ach-0.096
uggets-0.092
ubby-0.092
Token had
Token ID550
Feature activation+5.84e-02

Pos logit contributions
 distance+0.115
Oh+0.109
 narrower+0.109
Uh+0.106
Hmm+0.106
Neg logit contributions
 disl-0.184
ach-0.159
az-0.158
ubby-0.149
uggets-0.147
Token been
Token ID587
Feature activation+5.89e-02

Pos logit contributions
 distance+0.089
Hmm+0.086
Oh+0.085
Uh+0.085
 narrower+0.084
Neg logit contributions
 disl-0.089
ach-0.073
az-0.072
uggets-0.069
ubby-0.067
Token a
Token ID257
Feature activation+7.70e-02

Pos logit contributions
 distance+0.076
 narrower+0.071
Oh+0.070
Uh+0.067
To+0.067
Neg logit contributions
 disl-0.104
ach-0.091
az-0.088
uggets-0.086
 Majesty-0.084
Token bit
Token ID1643
Feature activation+5.38e-02

Pos logit contributions
 distance+0.091
Hmm+0.090
 narrower+0.089
Oh+0.088
Warning+0.088
Neg logit contributions
 disl-0.136
ach-0.115
az-0.115
ubby-0.113
 Majesty-0.112
Token more
Token ID517
Feature activation+4.96e-02

Pos logit contributions
 distance+0.068
Oh+0.064
Hmm+0.064
 narrower+0.063
Uh+0.063
Neg logit contributions
 disl-0.094
ach-0.082
az-0.082
ubby-0.078
uggets-0.078
Token to
Token ID284
Feature activation+5.41e-02

Pos logit contributions
 distance+0.020
 narrower+0.018
Oh+0.018
Hmm+0.018
To+0.018
Neg logit contributions
 disl-0.020
ach-0.017
az-0.017
uggets-0.016
ubby-0.016
Token the
Token ID262
Feature activation+6.66e-02

Pos logit contributions
 distance+0.082
Oh+0.076
 narrower+0.075
Hmm+0.074
Who+0.072
Neg logit contributions
 disl-0.083
ach-0.073
az-0.070
uggets-0.069
ubby-0.068
Token wagon
Token ID31860
Feature activation+6.90e-02

Pos logit contributions
 distance+0.094
Hmm+0.091
Oh+0.091
Uh+0.090
 narrower+0.090
Neg logit contributions
 disl-0.107
ach-0.090
az-0.088
ubby-0.084
 Majesty-0.083
Token driver
Token ID4639
Feature activation+5.48e-02

Pos logit contributions
Uh+0.115
Oh+0.114
Hmm+0.113
Ah+0.111
Aw+0.109
Neg logit contributions
 disl-0.086
ach-0.078
az-0.076
ci-0.072
 Majesty-0.068
Token as
Token ID355
Feature activation+4.15e-02

Pos logit contributions
 distance+0.085
Oh+0.082
Uh+0.081
 narrower+0.081
Hmm+0.081
Neg logit contributions
 disl-0.080
ach-0.069
az-0.067
uggets-0.064
ubby-0.062
Token they
Token ID484
Feature activation+9.78e-02

Pos logit contributions
 distance+0.049
Oh+0.045
 narrower+0.045
Hmm+0.044
Uh+0.043
Neg logit contributions
 disl-0.078
ach-0.068
az-0.068
uggets-0.065
ubby-0.064
Token left
Token ID1364
Feature activation+3.24e-02

Pos logit contributions
Hmm+0.125
Oh+0.123
Uh+0.118
 narrower+0.117
Warning+0.117
Neg logit contributions
 disl-0.147
az-0.144
rou-0.137
ach-0.133
ubby-0.132
Token the
Token ID262
Feature activation+6.62e-02

Pos logit contributions
 distance+0.049
Uh+0.049
Hmm+0.049
Oh+0.048
 narrower+0.048
Neg logit contributions
 disl-0.047
ach-0.041
az-0.040
uggets-0.038
ubby-0.036
Token park
Token ID3952
Feature activation+6.42e-02

Pos logit contributions
 distance+0.095
Hmm+0.092
Oh+0.092
 narrower+0.092
Uh+0.091
Neg logit contributions
 disl-0.103
ach-0.088
az-0.086
ubby-0.081
 Majesty-0.080
Token.
Token ID13
Feature activation+7.17e-02

Pos logit contributions
 distance+0.094
Oh+0.090
Uh+0.089
 narrower+0.089
Hmm+0.089
Neg logit contributions
 disl-0.100
ach-0.086
az-0.085
uggets-0.080
ubby-0.078
Token<|endoftext|>
Token ID50256
Feature activation-4.48e-03

Pos logit contributions
 distance+0.105
 narrower+0.104
Uh+0.101
Hmm+0.097
 brake+0.096
Neg logit contributions
 disl-0.115
az-0.097
ach-0.092
uggets-0.090
iddy-0.086
Token.
Token ID13
Feature activation+7.04e-02

Pos logit contributions
 distance+0.090
Oh+0.085
 narrower+0.085
Uh+0.084
Hmm+0.084
Neg logit contributions
 disl-0.073
ach-0.061
az-0.059
uggets-0.056
ubby-0.055
Token John
Token ID1757
Feature activation+7.09e-02

Pos logit contributions
 distance+0.095
 narrower+0.090
Uh+0.089
Hmm+0.087
Oh+0.087
Neg logit contributions
 disl-0.119
az-0.103
ach-0.102
uggets-0.095
 Majesty-0.093
Token was
Token ID373
Feature activation+5.17e-02

Pos logit contributions
 distance+0.076
Oh+0.072
 narrower+0.072
Uh+0.071
Hmm+0.070
Neg logit contributions
 disl-0.136
ach-0.122
az-0.121
ubby-0.114
 Majesty-0.113
Token glad
Token ID9675
Feature activation+3.71e-02

Pos logit contributions
 distance+0.067
 narrower+0.058
Oh+0.058
 eventually+0.056
Ah+0.055
Neg logit contributions
 disl-0.097
ach-0.084
az-0.079
uggets-0.079
ubby-0.076
Token that
Token ID326
Feature activation+5.28e-02

Pos logit contributions
 distance+0.050
Oh+0.047
 narrower+0.046
Hmm+0.046
Uh+0.045
Neg logit contributions
 disl-0.063
ach-0.055
az-0.054
uggets-0.052
ubby-0.051
Token he
Token ID339
Feature activation+9.77e-02

Pos logit contributions
Uh+0.067
 distance+0.065
Hmm+0.063
Oh+0.063
Finally+0.062
Neg logit contributions
 disl-0.091
ach-0.079
az-0.079
uggets-0.072
 Majesty-0.071
Token had
Token ID550
Feature activation+5.58e-02

Pos logit contributions
 distance+0.081
 narrower+0.080
Hmm+0.077
Uh+0.076
Oh+0.076
Neg logit contributions
 disl-0.207
az-0.188
ach-0.186
ubby-0.176
ci-0.172
Token kept
Token ID4030
Feature activation+3.87e-02

Pos logit contributions
Uh+0.076
Hmm+0.074
 distance+0.074
Oh+0.073
 narrower+0.072
Neg logit contributions
 disl-0.092
az-0.078
ach-0.078
uggets-0.073
 Majesty-0.071
Token the
Token ID262
Feature activation+6.75e-02

Pos logit contributions
 distance+0.054
Oh+0.050
 narrower+0.050
Hmm+0.048
To+0.048
Neg logit contributions
 disl-0.064
ach-0.055
az-0.054
uggets-0.053
ubby-0.052
Token peace
Token ID4167
Feature activation+6.66e-02

Pos logit contributions
 distance+0.085
 narrower+0.082
Oh+0.082
Hmm+0.082
Uh+0.081
Neg logit contributions
 disl-0.118
ach-0.102
az-0.100
ubby-0.096
uggets-0.095
Token just
Token ID655
Feature activation+7.11e-02

Pos logit contributions
 distance+0.101
Oh+0.094
 narrower+0.093
Uh+0.092
Hmm+0.091
Neg logit contributions
 disl-0.103
ach-0.087
az-0.087
uggets-0.083
ubby-0.081
Token he
Token ID339
Feature activation+9.06e-02

Pos logit contributions
 distance+0.035
 narrower+0.032
Oh+0.032
Hmm+0.030
Uh+0.030
Neg logit contributions
 disl-0.082
ach-0.073
az-0.072
uggets-0.070
ubby-0.069
Token could
Token ID714
Feature activation+6.27e-02

Pos logit contributions
Oh+0.099
 distance+0.097
Uh+0.097
Hmm+0.094
Less+0.092
Neg logit contributions
 disl-0.164
az-0.152
ach-0.149
ci-0.140
ere-0.140
Token.
Token ID13
Feature activation+7.40e-02

Pos logit contributions
 distance+0.085
 narrower+0.077
Oh+0.075
To+0.075
Hmm+0.074
Neg logit contributions
 disl-0.109
ach-0.093
az-0.091
uggets-0.090
ubby-0.088
Token Each
Token ID5501
Feature activation+5.35e-02

Pos logit contributions
 distance+0.084
Uh+0.079
 narrower+0.079
Oh+0.076
Hmm+0.074
Neg logit contributions
 disl-0.142
az-0.124
ach-0.120
uggets-0.117
 Majesty-0.115
Token time
Token ID640
Feature activation+4.61e-02

Pos logit contributions
 distance+0.066
Oh+0.062
 narrower+0.062
Hmm+0.061
Uh+0.061
Neg logit contributions
 disl-0.096
ach-0.084
az-0.083
uggets-0.079
ubby-0.079
Token he
Token ID339
Feature activation+9.77e-02

Pos logit contributions
 distance+0.055
Uh+0.054
Oh+0.053
Hmm+0.052
Finally+0.049
Neg logit contributions
 disl-0.082
ach-0.073
az-0.072
 Majesty-0.067
uggets-0.066
Token went
Token ID1816
Feature activation+6.16e-02

Pos logit contributions
 distance+0.098
Oh+0.097
Uh+0.093
Hmm+0.093
 narrower+0.093
Neg logit contributions
 disl-0.193
az-0.173
ach-0.171
ubby-0.162
 Majesty-0.161
Token,
Token ID11
Feature activation+6.98e-02

Pos logit contributions
Uh+0.099
Oh+0.098
Hmm+0.098
 distance+0.098
Aw+0.095
Neg logit contributions
 disl-0.083
ach-0.072
az-0.071
uggets-0.065
ubby-0.064
Token something
Token ID1223
Feature activation+6.23e-02

Pos logit contributions
 distance+0.075
Oh+0.070
 narrower+0.069
Uh+0.069
Hmm+0.069
Neg logit contributions
 disl-0.136
ach-0.120
az-0.119
uggets-0.114
ubby-0.112
Token even
Token ID772
Feature activation+4.51e-02

Pos logit contributions
Uh+0.088
Hmm+0.087
 distance+0.085
Oh+0.085
Warning+0.085
Neg logit contributions
 disl-0.101
ach-0.081
az-0.080
rou-0.072
 Majesty-0.071
Token more
Token ID517
Feature activation+3.84e-02

Pos logit contributions
 distance+0.049
Oh+0.045
 narrower+0.045
Hmm+0.045
Uh+0.044
Neg logit contributions
 disl-0.087
ach-0.077
az-0.076
uggets-0.073
ubby-0.072
Token his
Token ID465
Feature activation+6.68e-02

Pos logit contributions
 distance+0.071
Oh+0.067
 narrower+0.067
Uh+0.066
Hmm+0.066
Neg logit contributions
 disl-0.063
ach-0.053
az-0.052
uggets-0.049
ubby-0.048
Token new
Token ID649
Feature activation+4.66e-02

Pos logit contributions
 distance+0.099
 narrower+0.093
Oh+0.093
Uh+0.093
Hmm+0.092
Neg logit contributions
 disl-0.106
ach-0.089
az-0.088
ubby-0.085
uggets-0.084
Token toy
Token ID13373
Feature activation+6.81e-02

Pos logit contributions
 distance+0.061
Oh+0.056
 narrower+0.056
 million+0.055
See+0.055
Neg logit contributions
 disl-0.078
ach-0.069
az-0.067
uggets-0.066
 Majesty-0.062
Token.
Token ID13
Feature activation+6.67e-02

Pos logit contributions
 distance+0.114
Uh+0.112
Oh+0.110
 narrower+0.109
Hmm+0.109
Neg logit contributions
 disl-0.091
ach-0.078
az-0.076
ubby-0.072
uggets-0.072
Token Now
Token ID2735
Feature activation+6.55e-02

Pos logit contributions
 distance+0.088
 narrower+0.084
Uh+0.083
Hmm+0.082
Oh+0.082
Neg logit contributions
 disl-0.115
ach-0.099
az-0.099
uggets-0.094
ubby-0.091
Token he
Token ID339
Feature activation+9.76e-02

Pos logit contributions
 distance+0.108
Oh+0.103
 narrower+0.103
Uh+0.102
Hmm+0.102
Neg logit contributions
 disl-0.091
ach-0.077
az-0.075
uggets-0.070
ubby-0.069
Token is
Token ID318
Feature activation+7.18e-02

Pos logit contributions
 distance+0.092
 narrower+0.084
Oh+0.084
Uh+0.082
Hmm+0.082
Neg logit contributions
 disl-0.205
ach-0.182
az-0.182
uggets-0.173
ubby-0.172
Token extra
Token ID3131
Feature activation+5.59e-02

Pos logit contributions
 distance+0.086
 eventually+0.077
 narrower+0.074
 million+0.073
 drifted+0.073
Neg logit contributions
 disl-0.141
ach-0.126
uggets-0.114
az-0.114
uc-0.107
Token careful
Token ID8161
Feature activation+1.57e-02

Pos logit contributions
 distance+0.062
See+0.062
 narrower+0.060
Ah+0.059
To+0.059
Neg logit contributions
 disl-0.100
ach-0.091
az-0.089
uggets-0.084
 Majesty-0.081
Token not
Token ID407
Feature activation+5.73e-02

Pos logit contributions
 distance+0.020
 eventually+0.018
He+0.018
To+0.017
 narrower+0.017
Neg logit contributions
 disl-0.029
ubby-0.024
uc-0.024
az-0.024
 majesty-0.023
Token to
Token ID284
Feature activation+3.57e-02

Pos logit contributions
 distance+0.094
Oh+0.090
Hmm+0.089
 narrower+0.089
Uh+0.089
Neg logit contributions
 disl-0.080
ach-0.067
az-0.066
uggets-0.062
ubby-0.060
Token.
Token ID13
Feature activation+6.59e-02

Pos logit contributions
Uh+0.089
 distance+0.089
Oh+0.089
Hmm+0.088
 narrower+0.087
Neg logit contributions
 disl-0.069
ach-0.062
az-0.059
uggets-0.053
ubby-0.052
Token They
Token ID1119
Feature activation+8.03e-02

Pos logit contributions
 distance+0.095
 narrower+0.093
Uh+0.089
Hmm+0.087
 brake+0.084
Neg logit contributions
 disl-0.104
az-0.090
uggets-0.088
ach-0.088
ubby-0.081
Token were
Token ID547
Feature activation+5.39e-02

Pos logit contributions
Oh+0.097
Uh+0.093
Hmm+0.092
 narrower+0.092
Warning+0.090
Neg logit contributions
 disl-0.135
ach-0.118
az-0.118
rou-0.115
ere-0.114
Token calm
Token ID9480
Feature activation+2.40e-02

Pos logit contributions
 distance+0.069
 narrower+0.060
Oh+0.058
 eventually+0.058
 drifted+0.056
Neg logit contributions
 disl-0.103
ach-0.087
az-0.084
uggets-0.083
ubby-0.081
Token as
Token ID355
Feature activation+4.45e-02

Pos logit contributions
 distance+0.033
 narrower+0.028
Oh+0.027
Maybe+0.027
 eventually+0.027
Neg logit contributions
 disl-0.045
uggets-0.037
az-0.035
ach-0.035
 majesty-0.035
Token they
Token ID484
Feature activation+9.72e-02

Pos logit contributions
 distance+0.042
 narrower+0.039
Oh+0.037
To+0.035
Maybe+0.034
Neg logit contributions
 disl-0.095
ach-0.084
az-0.083
uggets-0.081
ubby-0.080
Token opened
Token ID4721
Feature activation+3.63e-02

Pos logit contributions
Oh+0.122
Uh+0.115
Hmm+0.115
Aw+0.110
Warning+0.110
Neg logit contributions
 disl-0.151
ere-0.140
ach-0.138
uc-0.138
rou-0.136
Token the
Token ID262
Feature activation+8.08e-02

Pos logit contributions
 distance+0.051
 narrower+0.045
Oh+0.044
Park+0.043
 darker+0.042
Neg logit contributions
 disl-0.064
ach-0.053
uggets-0.053
az-0.052
ubby-0.051
Token door
Token ID3420
Feature activation+7.40e-02

Pos logit contributions
Uh+0.100
Hmm+0.100
Ah+0.100
He+0.100
Aw+0.099
Neg logit contributions
 disl-0.130
 Majesty-0.116
az-0.116
ubby-0.113
 majesty-0.113
Token and
Token ID290
Feature activation+6.88e-02

Pos logit contributions
Uh+0.145
Oh+0.142
Hmm+0.141
Ah+0.141
Aw+0.140
Neg logit contributions
 disl-0.068
ach-0.068
az-0.065
rou-0.054
ci-0.053
Token left
Token ID1364
Feature activation+2.96e-02

Pos logit contributions
 distance+0.088
Oh+0.084
 narrower+0.084
Uh+0.083
Hmm+0.083
Neg logit contributions
 disl-0.118
ach-0.102
az-0.101
uggets-0.097
 Majesty-0.095
Token and
Token ID290
Feature activation+5.70e-02

Pos logit contributions
 distance+0.057
Oh+0.053
 narrower+0.053
Hmm+0.052
Uh+0.051
Neg logit contributions
 disl-0.090
ach-0.080
az-0.079
uggets-0.074
ubby-0.074
Token Joe
Token ID5689
Feature activation+6.45e-02

Pos logit contributions
 distance+0.097
Oh+0.092
 narrower+0.091
Uh+0.090
Hmm+0.090
Neg logit contributions
 disl-0.078
ach-0.065
az-0.064
uggets-0.061
ubby-0.059
Token both
Token ID1111
Feature activation+5.65e-02

Pos logit contributions
Oh+0.071
 narrower+0.070
 distance+0.069
Hmm+0.069
Uh+0.069
Neg logit contributions
 disl-0.116
az-0.104
ach-0.103
ere-0.096
 Majesty-0.096
Token laughed
Token ID13818
Feature activation+1.38e-02

Pos logit contributions
 distance+0.064
Oh+0.063
Hmm+0.063
 narrower+0.063
Warning+0.060
Neg logit contributions
 disl-0.102
az-0.093
ach-0.087
ubby-0.085
 majesty-0.084
Token as
Token ID355
Feature activation+4.71e-02

Pos logit contributions
 distance+0.022
Oh+0.021
 narrower+0.021
Hmm+0.020
Uh+0.020
Neg logit contributions
 disl-0.020
ach-0.017
az-0.017
uggets-0.016
ubby-0.015
Token they
Token ID484
Feature activation+9.70e-02

Pos logit contributions
 distance+0.048
 narrower+0.044
Oh+0.044
Hmm+0.042
Uh+0.042
Neg logit contributions
 disl-0.096
ach-0.085
az-0.084
uggets-0.082
ubby-0.080
Token watched
Token ID7342
Feature activation+3.29e-02

Pos logit contributions
Oh+0.123
 distance+0.123
Hmm+0.118
 narrower+0.118
Uh+0.115
Neg logit contributions
 disl-0.156
az-0.139
ach-0.137
ere-0.136
uc-0.135
Token the
Token ID262
Feature activation+7.69e-02

Pos logit contributions
 distance+0.041
 narrower+0.039
Oh+0.037
 eventually+0.037
 drifted+0.037
Neg logit contributions
 disl-0.059
az-0.048
 majesty-0.048
ubby-0.048
 Majesty-0.048
Token pot
Token ID1787
Feature activation+7.84e-02

Pos logit contributions
 distance+0.116
Oh+0.110
 narrower+0.109
Hmm+0.108
Uh+0.108
Neg logit contributions
 disl-0.119
ach-0.101
az-0.100
uggets-0.095
ubby-0.093
Token in
Token ID287
Feature activation+6.33e-02

Pos logit contributions
Oh+0.107
Hmm+0.106
Uh+0.105
 distance+0.104
Aw+0.102
Neg logit contributions
 disl-0.125
ach-0.111
az-0.111
iggle-0.100
ubby-0.099
Token the
Token ID262
Feature activation+7.33e-02

Pos logit contributions
Hmm+0.114
To+0.111
Uh+0.111
Finally+0.110
 narrower+0.110
Neg logit contributions
 disl-0.069
ach-0.055
az-0.055
uggets-0.050
ubby-0.050
Token.
Token ID13
Feature activation+6.06e-02

Pos logit contributions
 distance+0.061
Oh+0.058
 narrower+0.058
Uh+0.058
Hmm+0.058
Neg logit contributions
 disl-0.060
ach-0.051
az-0.050
uggets-0.048
ubby-0.047
Token The
Token ID383
Feature activation+5.52e-02

Pos logit contributions
 distance+0.090
 narrower+0.087
Uh+0.085
Hmm+0.084
Oh+0.084
Neg logit contributions
 disl-0.096
az-0.080
ach-0.080
uggets-0.076
ubby-0.073
Token crane
Token ID41175
Feature activation+6.35e-02

Pos logit contributions
 distance+0.085
Oh+0.081
 narrower+0.081
Hmm+0.080
Uh+0.079
Neg logit contributions
 disl-0.083
ach-0.070
az-0.069
ubby-0.066
uggets-0.064
Token had
Token ID550
Feature activation+4.58e-02

Pos logit contributions
 distance+0.070
Oh+0.070
 narrower+0.069
Hmm+0.069
Uh+0.068
Neg logit contributions
 disl-0.118
az-0.104
ach-0.104
 Majesty-0.098
ubby-0.097
Token helped
Token ID4193
Feature activation+4.12e-02

Pos logit contributions
Hmm+0.064
Uh+0.064
 distance+0.063
Oh+0.063
 narrower+0.060
Neg logit contributions
 disl-0.073
az-0.061
ach-0.060
uggets-0.056
 Majesty-0.056
Token them
Token ID606
Feature activation+9.70e-02

Pos logit contributions
 distance+0.064
 narrower+0.059
Oh+0.058
To+0.057
Hmm+0.057
Neg logit contributions
 disl-0.063
az-0.053
ach-0.053
uggets-0.050
ubby-0.050
Token and
Token ID290
Feature activation+6.58e-02

Pos logit contributions
 distance+0.141
 narrower+0.130
Oh+0.129
Hmm+0.127
Uh+0.126
Neg logit contributions
 disl-0.156
ach-0.134
az-0.133
uggets-0.127
ubby-0.125
Token they
Token ID484
Feature activation+8.44e-02

Pos logit contributions
 distance+0.070
 narrower+0.063
Oh+0.061
To+0.059
Uh+0.058
Neg logit contributions
 disl-0.132
ach-0.120
az-0.117
ubby-0.114
uggets-0.113
Token were
Token ID547
Feature activation+6.53e-02

Pos logit contributions
Oh+0.100
Uh+0.098
 distance+0.097
Hmm+0.096
 narrower+0.095
Neg logit contributions
 disl-0.152
az-0.130
ach-0.128
 Majesty-0.124
uggets-0.123
Token grateful
Token ID14066
Feature activation+1.36e-02

Pos logit contributions
 distance+0.078
 narrower+0.070
 eventually+0.068
See+0.066
Less+0.065
Neg logit contributions
 disl-0.130
ach-0.113
az-0.105
uggets-0.105
 Majesty-0.099
Token.
Token ID13
Feature activation+6.62e-02

Pos logit contributions
 distance+0.021
Oh+0.020
 narrower+0.020
To+0.019
Hmm+0.019
Neg logit contributions
 disl-0.020
ach-0.017
az-0.017
uggets-0.016
 Majesty-0.016
Token man
Token ID582
Feature activation+7.92e-02

Pos logit contributions
 distance+0.046
Oh+0.042
 narrower+0.042
Hmm+0.041
Uh+0.041
Neg logit contributions
 disl-0.081
ach-0.072
az-0.071
uggets-0.069
ubby-0.068
Token had
Token ID550
Feature activation+5.14e-02

Pos logit contributions
 narrower+0.081
Uh+0.081
 distance+0.079
Hmm+0.079
Oh+0.078
Neg logit contributions
 disl-0.151
az-0.134
ach-0.134
ci-0.128
ubby-0.127
Token replied
Token ID8712
Feature activation+4.48e-02

Pos logit contributions
Uh+0.074
Hmm+0.073
Oh+0.071
 distance+0.070
 narrower+0.069
Neg logit contributions
 disl-0.079
ach-0.068
az-0.068
uggets-0.062
ere-0.062
Token and
Token ID290
Feature activation+6.74e-02

Pos logit contributions
 distance+0.069
To+0.066
 narrower+0.065
Oh+0.064
He+0.064
Neg logit contributions
 disl-0.067
ach-0.055
uggets-0.054
az-0.053
ubby-0.049
Token helped
Token ID4193
Feature activation+3.65e-02

Pos logit contributions
 distance+0.067
Oh+0.061
 narrower+0.060
Hmm+0.057
Aw+0.057
Neg logit contributions
 disl-0.139
ach-0.126
az-0.122
uggets-0.119
ubby-0.118
Token him
Token ID683
Feature activation+9.70e-02

Pos logit contributions
 distance+0.046
To+0.042
Oh+0.041
Where+0.040
 narrower+0.040
Neg logit contributions
 disl-0.067
az-0.057
 majesty-0.056
 Majesty-0.056
ubby-0.055
Token out
Token ID503
Feature activation+5.75e-02

Pos logit contributions
 distance+0.134
Oh+0.126
 narrower+0.125
Uh+0.124
Hmm+0.124
Neg logit contributions
 disl-0.161
ach-0.139
az-0.137
uggets-0.131
ubby-0.129
Token of
Token ID286
Feature activation+4.09e-02

Pos logit contributions
 distance+0.088
Uh+0.087
Oh+0.087
Hmm+0.087
 narrower+0.085
Neg logit contributions
 disl-0.085
ach-0.071
az-0.070
uggets-0.065
ubby-0.065
Token his
Token ID465
Feature activation+6.80e-02

Pos logit contributions
 distance+0.060
Oh+0.054
 narrower+0.053
Uh+0.051
Aw+0.051
Neg logit contributions
 disl-0.069
ach-0.060
az-0.058
uggets-0.056
 Majesty-0.055
Token trouble
Token ID5876
Feature activation+5.25e-02

Pos logit contributions
Uh+0.101
Oh+0.101
Hmm+0.101
 distance+0.100
 narrower+0.100
Neg logit contributions
 disl-0.103
ach-0.086
az-0.084
ubby-0.080
 Majesty-0.080
Token.
Token ID13
Feature activation+7.07e-02

Pos logit contributions
 distance+0.076
 narrower+0.070
Oh+0.069
Hmm+0.067
Uh+0.067
Neg logit contributions
 disl-0.085
ach-0.072
az-0.072
uggets-0.069
ubby-0.068
Token had
Token ID550
Feature activation+2.95e-02

Pos logit contributions
 distance+0.034
Oh+0.028
To+0.025
Uh+0.025
 narrower+0.025
Neg logit contributions
 disl-0.171
ach-0.154
az-0.153
uggets-0.151
 Majesty-0.149
Token ever
Token ID1683
Feature activation+3.78e-02

Pos logit contributions
 distance+0.023
To+0.021
Oh+0.019
Where+0.018
Park+0.018
Neg logit contributions
 disl-0.068
 Majesty-0.058
az-0.058
ubby-0.057
 majesty-0.057
Token been
Token ID587
Feature activation+4.70e-02

Pos logit contributions
 distance+0.049
 narrower+0.047
Oh+0.047
Hmm+0.047
Uh+0.047
Neg logit contributions
 disl-0.065
ach-0.056
az-0.055
uggets-0.053
ubby-0.051
Token.
Token ID13
Feature activation+6.95e-02

Pos logit contributions
 distance+0.064
 narrower+0.056
Oh+0.056
To+0.056
 million+0.056
Neg logit contributions
 disl-0.081
ach-0.064
 Majesty-0.064
 majesty-0.064
az-0.064
Token Now
Token ID2735
Feature activation+6.10e-02

Pos logit contributions
 distance+0.065
 narrower+0.059
Uh+0.058
Oh+0.057
Hmm+0.056
Neg logit contributions
 disl-0.149
az-0.132
ach-0.131
uggets-0.126
ubby-0.124
Token he
Token ID339
Feature activation+9.70e-02

Pos logit contributions
 distance+0.090
Oh+0.086
 narrower+0.085
Uh+0.085
Hmm+0.084
Neg logit contributions
 disl-0.096
ach-0.082
az-0.081
uggets-0.076
ubby-0.074
Token had
Token ID550
Feature activation+5.15e-02

Pos logit contributions
 distance+0.092
Oh+0.083
 narrower+0.083
Uh+0.081
Hmm+0.081
Neg logit contributions
 disl-0.204
ach-0.181
az-0.180
uggets-0.174
ubby-0.171
Token a
Token ID257
Feature activation+6.12e-02

Pos logit contributions
 distance+0.078
Oh+0.073
 narrower+0.072
Hmm+0.071
To+0.070
Neg logit contributions
 disl-0.079
ach-0.068
az-0.066
uggets-0.063
ubby-0.063
Token great
Token ID1049
Feature activation+3.97e-02

Pos logit contributions
 distance+0.084
 narrower+0.081
Oh+0.080
Hmm+0.079
Uh+0.079
Neg logit contributions
 disl-0.100
ach-0.084
az-0.084
uggets-0.081
 Majesty-0.080
Token pair
Token ID5166
Feature activation+4.78e-02

Pos logit contributions
 distance+0.045
Oh+0.038
To+0.038
Hmm+0.037
 narrower+0.036
Neg logit contributions
 disl-0.078
ach-0.068
uggets-0.066
az-0.065
ubby-0.065
Token of
Token ID286
Feature activation+1.87e-02

Pos logit contributions
 distance+0.056
Oh+0.049
To+0.049
Hmm+0.047
 narrower+0.046
Neg logit contributions
 disl-0.094
ach-0.079
az-0.078
 Majesty-0.078
uggets-0.076
Token
Token ID198
Feature activation+5.01e-02

Pos logit contributions
 distance+0.049
 narrower+0.047
Uh+0.046
Hmm+0.045
Oh+0.045
Neg logit contributions
 disl-0.076
ach-0.068
az-0.066
uggets-0.064
ubby-0.062
TokenGeorge
Token ID20191
Feature activation+7.53e-02

Pos logit contributions
 distance+0.058
 narrower+0.055
 darker+0.050
Hmm+0.049
Aw+0.049
Neg logit contributions
 disl-0.086
ach-0.081
ere-0.072
ate-0.071
ubby-0.071
Token was
Token ID373
Feature activation+5.09e-02

Pos logit contributions
 distance+0.084
 narrower+0.082
Oh+0.081
Uh+0.080
Hmm+0.080
Neg logit contributions
 disl-0.138
az-0.123
ach-0.123
 Majesty-0.117
ubby-0.116
Token glad
Token ID9675
Feature activation+4.49e-02

Pos logit contributions
 distance+0.061
Oh+0.052
 narrower+0.051
Less+0.050
 eventually+0.049
Neg logit contributions
 disl-0.102
ach-0.092
az-0.087
uggets-0.086
 Majesty-0.083
Token that
Token ID326
Feature activation+5.57e-02

Pos logit contributions
 distance+0.075
 narrower+0.072
Uh+0.072
Hmm+0.072
Oh+0.072
Neg logit contributions
 disl-0.061
ach-0.051
az-0.050
uggets-0.047
ubby-0.046
Token he
Token ID339
Feature activation+9.67e-02

Pos logit contributions
Uh+0.080
Hmm+0.080
 distance+0.079
Oh+0.076
Finally+0.076
Neg logit contributions
 disl-0.084
az-0.069
ach-0.068
 Majesty-0.065
uggets-0.064
Token was
Token ID373
Feature activation+6.36e-02

Pos logit contributions
 distance+0.080
 narrower+0.076
Hmm+0.072
Uh+0.072
Oh+0.071
Neg logit contributions
 disl-0.205
az-0.188
ach-0.185
ubby-0.177
ere-0.171
Token able
Token ID1498
Feature activation+3.99e-02

Pos logit contributions
 distance+0.083
 narrower+0.076
Oh+0.075
To+0.073
Hmm+0.071
Neg logit contributions
 disl-0.114
ach-0.100
az-0.097
uggets-0.094
 Majesty-0.091
Token to
Token ID284
Feature activation+6.47e-02

Pos logit contributions
 distance+0.058
 narrower+0.054
Oh+0.054
To+0.052
Less+0.051
Neg logit contributions
 disl-0.064
ach-0.056
az-0.054
uggets-0.052
ubby-0.052
Token help
Token ID1037
Feature activation+4.45e-02

Pos logit contributions
 distance+0.072
 narrower+0.064
To+0.063
Oh+0.062
Hmm+0.062
Neg logit contributions
 disl-0.126
ach-0.112
az-0.111
uggets-0.110
ubby-0.107
Token,
Token ID11
Feature activation+7.44e-02

Pos logit contributions
 distance+0.073
Uh+0.071
Oh+0.071
 narrower+0.071
Hmm+0.071
Neg logit contributions
 disl-0.061
ach-0.051
az-0.050
uggets-0.047
ubby-0.046
Tokenapped
Token ID6320
Feature activation-3.83e-03

Pos logit contributions
 disl+0.001
 Majesty+0.001
 majesty+0.001
 turkey+0.001
ach+0.001
Neg logit contributions
 resumed-0.001
 drifted-0.001
See-0.001
He-0.001
Park-0.001
Token together
Token ID1978
Feature activation+4.46e-02

Pos logit contributions
 disl+0.006
az+0.005
ach+0.005
 Majesty+0.005
uggets+0.005
Neg logit contributions
 distance-0.005
Oh-0.005
 narrower-0.005
To-0.005
Maybe-0.005
Token.
Token ID13
Feature activation+6.83e-02

Pos logit contributions
 distance+0.071
Uh+0.070
Oh+0.070
Hmm+0.069
 narrower+0.069
Neg logit contributions
 disl-0.063
ach-0.053
az-0.052
uggets-0.049
ubby-0.047
Token She
Token ID1375
Feature activation+8.23e-02

Pos logit contributions
 distance+0.097
 narrower+0.093
Uh+0.091
Aw+0.087
Ah+0.086
Neg logit contributions
 disl-0.116
az-0.099
uggets-0.091
ach-0.090
 Majesty-0.088
Token wished
Token ID16555
Feature activation+3.59e-02

Pos logit contributions
 distance+0.072
Oh+0.064
 narrower+0.064
Hmm+0.062
Uh+0.062
Neg logit contributions
 disl-0.178
ach-0.162
az-0.158
uggets-0.156
ubby-0.152
Token he
Token ID339
Feature activation+9.66e-02

Pos logit contributions
 distance+0.038
Oh+0.035
 narrower+0.034
Hmm+0.033
Uh+0.033
Neg logit contributions
 disl-0.072
ach-0.064
az-0.063
uggets-0.061
ubby-0.060
Token was
Token ID373
Feature activation+7.10e-02

Pos logit contributions
 distance+0.104
Uh+0.098
Hmm+0.090
Oh+0.089
 narrower+0.088
Neg logit contributions
 disl-0.182
az-0.164
ach-0.158
ci-0.158
rou-0.153
Token still
Token ID991
Feature activation+5.32e-02

Pos logit contributions
 distance+0.088
 narrower+0.083
Oh+0.079
To+0.078
Where+0.078
Neg logit contributions
 disl-0.129
ach-0.117
az-0.112
uggets-0.108
ubby-0.105
Token here
Token ID994
Feature activation+5.73e-02

Pos logit contributions
 distance+0.066
 eventually+0.058
 better+0.057
 narrower+0.057
Where+0.054
Neg logit contributions
 disl-0.096
ach-0.089
az-0.080
amb-0.078
uggets-0.077
Token to
Token ID284
Feature activation+5.83e-02

Pos logit contributions
 distance+0.094
Oh+0.089
 narrower+0.089
Uh+0.089
Hmm+0.088
Neg logit contributions
 disl-0.080
ach-0.067
az-0.066
uggets-0.063
ubby-0.060
Token play
Token ID711
Feature activation+2.45e-02

Pos logit contributions
 distance+0.065
To+0.055
Oh+0.053
Who+0.052
 narrower+0.052
Neg logit contributions
 disl-0.111
az-0.102
ach-0.101
ubby-0.101
uggets-0.100
Token She
Token ID1375
Feature activation+8.13e-02

Pos logit contributions
 distance+0.068
Oh+0.062
 narrower+0.062
Hmm+0.060
Uh+0.060
Neg logit contributions
 disl-0.141
ach-0.125
az-0.124
uggets-0.120
ubby-0.118
Token knew
Token ID2993
Feature activation+4.39e-02

Pos logit contributions
 distance+0.075
Oh+0.068
 narrower+0.068
Hmm+0.066
Uh+0.066
Neg logit contributions
 disl-0.172
ach-0.154
az-0.152
uggets-0.147
ubby-0.145
Token that
Token ID326
Feature activation+4.28e-02

Pos logit contributions
 distance+0.067
Oh+0.065
 narrower+0.065
To+0.063
Where+0.062
Neg logit contributions
 disl-0.067
ach-0.059
uggets-0.055
az-0.055
ubby-0.055
Token even
Token ID772
Feature activation+3.43e-02

Pos logit contributions
 distance+0.060
Uh+0.057
Hmm+0.057
Oh+0.056
 narrower+0.056
Neg logit contributions
 disl-0.070
ach-0.058
az-0.058
uggets-0.055
 Majesty-0.054
Token though
Token ID996
Feature activation+4.20e-02

Pos logit contributions
 distance+0.049
Oh+0.047
 narrower+0.047
To+0.045
Where+0.045
Neg logit contributions
 disl-0.056
ach-0.051
az-0.048
uggets-0.048
 Majesty-0.047
Token she
Token ID673
Feature activation+9.64e-02

Pos logit contributions
 distance+0.047
 narrower+0.044
Oh+0.043
Hmm+0.042
Uh+0.041
Neg logit contributions
 disl-0.080
ach-0.072
az-0.070
uggets-0.069
ubby-0.067
Token was
Token ID373
Feature activation+6.57e-02

Pos logit contributions
 distance+0.086
Oh+0.078
 narrower+0.077
Hmm+0.076
Uh+0.075
Neg logit contributions
 disl-0.207
ach-0.186
az-0.183
uggets-0.179
ubby-0.175
Token little
Token ID1310
Feature activation+4.45e-02

Pos logit contributions
 distance+0.076
 narrower+0.074
 darker+0.066
Oh+0.065
 million+0.064
Neg logit contributions
 disl-0.126
ach-0.117
uggets-0.109
az-0.108
 Majesty-0.105
Token,
Token ID11
Feature activation+6.79e-02

Pos logit contributions
 distance+0.058
 narrower+0.053
Oh+0.053
Uh+0.051
Aw+0.050
Neg logit contributions
 disl-0.079
ach-0.070
az-0.069
uggets-0.068
ubby-0.063
Token she
Token ID673
Feature activation+8.93e-02

Pos logit contributions
 distance+0.072
Oh+0.067
 narrower+0.066
Hmm+0.065
Uh+0.064
Neg logit contributions
 disl-0.135
ach-0.120
az-0.118
uggets-0.114
ubby-0.113
Token could
Token ID714
Feature activation+4.15e-02

Pos logit contributions
 distance+0.079
Oh+0.071
 narrower+0.071
Uh+0.071
Hmm+0.069
Neg logit contributions
 disl-0.195
az-0.171
ach-0.169
ubby-0.163
 Majesty-0.161
Token feeling
Token ID4203
Feature activation+3.87e-02

Pos logit contributions
 distance+0.056
 narrower+0.052
Oh+0.052
To+0.051
Hmm+0.050
Neg logit contributions
 disl-0.049
ach-0.040
az-0.039
uggets-0.038
ubby-0.037
Token light
Token ID1657
Feature activation+2.01e-02

Pos logit contributions
 distance+0.050
Oh+0.045
 narrower+0.044
To+0.043
See+0.042
Neg logit contributions
 disl-0.070
ach-0.063
uggets-0.060
az-0.060
 Majesty-0.058
Token and
Token ID290
Feature activation+7.23e-02

Pos logit contributions
 distance+0.026
 narrower+0.024
 drifted+0.023
 million+0.022
 darker+0.022
Neg logit contributions
 disl-0.038
uggets-0.031
ach-0.030
 Majesty-0.030
 majesty-0.029
Token happy
Token ID3772
Feature activation+3.72e-02

Pos logit contributions
 distance+0.092
 narrower+0.083
Oh+0.083
Hmm+0.079
Ah+0.079
Neg logit contributions
 disl-0.128
ach-0.117
az-0.114
uggets-0.113
ubby-0.112
Token.
Token ID13
Feature activation+7.90e-02

Pos logit contributions
 distance+0.051
Oh+0.047
 narrower+0.047
To+0.045
Hmm+0.045
Neg logit contributions
 disl-0.063
ach-0.054
az-0.053
uggets-0.052
ubby-0.052
Token He
Token ID679
Feature activation+9.63e-02

Pos logit contributions
 distance+0.101
Uh+0.097
 narrower+0.097
Aw+0.092
Oh+0.089
Neg logit contributions
 disl-0.138
az-0.121
 Majesty-0.113
ach-0.112
 majesty-0.111
Token was
Token ID373
Feature activation+5.72e-02

Pos logit contributions
 distance+0.099
 narrower+0.095
Uh+0.093
Hmm+0.091
Oh+0.091
Neg logit contributions
 disl-0.192
az-0.172
ach-0.164
rou-0.161
 Majesty-0.159
Token a
Token ID257
Feature activation+6.20e-02

Pos logit contributions
 distance+0.066
 eventually+0.060
Oh+0.057
 narrower+0.056
Less+0.055
Neg logit contributions
 disl-0.115
ach-0.105
uggets-0.097
uc-0.093
az-0.093
Token kid
Token ID5141
Feature activation+5.53e-02

Pos logit contributions
Warning+0.087
 narrower+0.086
Hmm+0.085
 distance+0.084
Uh+0.083
Neg logit contributions
 disl-0.094
az-0.079
ubby-0.077
ach-0.077
 Majesty-0.077
Token on
Token ID319
Feature activation+3.94e-02

Pos logit contributions
 distance+0.082
Oh+0.075
 narrower+0.074
Hmm+0.072
To+0.072
Neg logit contributions
 disl-0.090
ach-0.075
az-0.073
uggets-0.071
ubby-0.069
Token a
Token ID257
Feature activation+5.93e-02

Pos logit contributions
 distance+0.056
Oh+0.051
 narrower+0.050
Less+0.048
To+0.048
Neg logit contributions
 disl-0.066
ach-0.057
uggets-0.056
az-0.054
ubby-0.053
Token had
Token ID550
Feature activation+4.70e-02

Pos logit contributions
 distance+0.079
Oh+0.073
Uh+0.072
 narrower+0.072
Hmm+0.072
Neg logit contributions
 disl-0.150
az-0.133
ach-0.130
 Majesty-0.123
ubby-0.122
Token paid
Token ID3432
Feature activation+4.29e-02

Pos logit contributions
 distance+0.058
Uh+0.054
Oh+0.054
Hmm+0.054
 narrower+0.054
Neg logit contributions
 disl-0.085
az-0.074
ach-0.072
uggets-0.068
 Majesty-0.067
Token off
Token ID572
Feature activation+3.47e-02

Pos logit contributions
 distance+0.050
Oh+0.046
 narrower+0.046
Uh+0.046
Hmm+0.045
Neg logit contributions
 disl-0.081
ach-0.071
az-0.070
uggets-0.068
ubby-0.066
Token.
Token ID13
Feature activation+6.70e-02

Pos logit contributions
 distance+0.047
To+0.044
Oh+0.043
 million+0.043
 narrower+0.042
Neg logit contributions
 disl-0.059
ach-0.049
uggets-0.049
az-0.047
ubby-0.046
Token Now
Token ID2735
Feature activation+6.46e-02

Pos logit contributions
 distance+0.081
 narrower+0.076
Uh+0.074
Oh+0.072
Hmm+0.071
Neg logit contributions
 disl-0.124
az-0.109
ach-0.106
uggets-0.103
 Majesty-0.101
Token she
Token ID673
Feature activation+9.63e-02

Pos logit contributions
 distance+0.093
Oh+0.087
 narrower+0.087
Hmm+0.085
Uh+0.085
Neg logit contributions
 disl-0.104
ach-0.089
az-0.088
uggets-0.085
ubby-0.083
Token had
Token ID550
Feature activation+5.47e-02

Pos logit contributions
 distance+0.082
Oh+0.074
 narrower+0.073
Hmm+0.071
Uh+0.071
Neg logit contributions
 disl-0.210
ach-0.190
az-0.185
uggets-0.184
ubby-0.178
Token a
Token ID257
Feature activation+6.39e-02

Pos logit contributions
 distance+0.080
Oh+0.075
 narrower+0.075
Hmm+0.073
Uh+0.073
Neg logit contributions
 disl-0.087
ach-0.075
az-0.073
uggets-0.071
ubby-0.069
Token new
Token ID649
Feature activation+3.66e-02

Pos logit contributions
 distance+0.086
 narrower+0.083
Oh+0.082
Hmm+0.082
Uh+0.081
Neg logit contributions
 disl-0.106
az-0.089
ach-0.088
 Majesty-0.086
 majesty-0.084
Token skill
Token ID5032
Feature activation+4.47e-02

Pos logit contributions
 distance+0.056
Oh+0.051
 narrower+0.050
Maybe+0.049
To+0.049
Neg logit contributions
 disl-0.057
ach-0.049
az-0.047
uggets-0.047
ubby-0.045
Token to
Token ID284
Feature activation+5.63e-02

Pos logit contributions
 distance+0.059
To+0.051
 eventually+0.051
 drifted+0.050
Oh+0.049
Neg logit contributions
 disl-0.081
ach-0.068
uggets-0.067
az-0.063
 Majesty-0.063
Token boy
Token ID2933
Feature activation+7.42e-02

Pos logit contributions
 distance+0.088
To+0.084
Park+0.084
Finally+0.084
 narrower+0.084
Neg logit contributions
 disl-0.084
az-0.069
ach-0.066
ubby-0.066
 Majesty-0.064
Token never
Token ID1239
Feature activation+4.39e-02

Pos logit contributions
Hmm+0.102
 narrower+0.101
Uh+0.099
Oh+0.099
Aw+0.097
Neg logit contributions
 disl-0.099
az-0.098
ach-0.090
gy-0.090
--0.090
Token saw
Token ID2497
Feature activation+3.83e-02

Pos logit contributions
Hmm+0.055
Finally+0.054
Oh+0.053
Uh+0.052
 distance+0.051
Neg logit contributions
 disl-0.078
az-0.067
ach-0.064
rou-0.061
 Majesty-0.060
Token the
Token ID262
Feature activation+7.10e-02

Pos logit contributions
 distance+0.056
Oh+0.052
 narrower+0.052
Hmm+0.050
Ah+0.049
Neg logit contributions
 disl-0.062
ach-0.054
az-0.052
uggets-0.051
ubby-0.049
Token cushion
Token ID33579
Feature activation+7.07e-02

Pos logit contributions
To+0.103
 distance+0.103
Park+0.103
Finally+0.102
Hmm+0.102
Neg logit contributions
 disl-0.098
az-0.085
ubby-0.080
ach-0.077
 Majesty-0.076
Token he
Token ID339
Feature activation+9.63e-02

Pos logit contributions
Ah+0.082
Hmm+0.082
Uh+0.082
Oh+0.081
Aw+0.080
Neg logit contributions
 disl-0.119
ach-0.108
az-0.107
rou-0.100
ubby-0.099
Token had
Token ID550
Feature activation+4.79e-02

Pos logit contributions
Uh+0.100
 narrower+0.100
 distance+0.099
Hmm+0.099
Oh+0.096
Neg logit contributions
 disl-0.181
ach-0.158
az-0.158
rou-0.151
ubby-0.145
Token just
Token ID655
Feature activation+6.02e-02

Pos logit contributions
 distance+0.063
Uh+0.063
Hmm+0.063
Oh+0.062
 narrower+0.060
Neg logit contributions
 disl-0.081
ach-0.070
az-0.070
uggets-0.066
 Majesty-0.063
Token built
Token ID3170
Feature activation+3.40e-02

Pos logit contributions
 distance+0.074
Uh+0.070
Oh+0.070
Hmm+0.069
 narrower+0.069
Neg logit contributions
 disl-0.109
ach-0.093
az-0.093
uggets-0.089
ubby-0.087
Token ever
Token ID1683
Feature activation+4.36e-02

Pos logit contributions
 distance+0.031
 narrower+0.027
Oh+0.026
To+0.025
Where+0.025
Neg logit contributions
 disl-0.075
ach-0.064
uggets-0.064
ubby-0.064
az-0.064
Token again
Token ID757
Feature activation+4.55e-02

Pos logit contributions
Hmm+0.042
Oh+0.040
 distance+0.040
Uh+0.039
Finally+0.039
Neg logit contributions
 disl-0.088
ach-0.076
az-0.075
 Majesty-0.071
uggets-0.071
Token giving
Token ID3501
Feature activation-1.03e-01

Pos logit contributions
 disl+0.115
az+0.107
ubby+0.105
ach+0.103
uggets+0.102
Neg logit contributions
 distance-0.103
 million-0.095
Oh-0.094
Less-0.094
See-0.091
Token him
Token ID683
Feature activation-4.66e-02

Pos logit contributions
 disl+0.194
ubby+0.168
az+0.168
uggets+0.163
ach+0.162
Neg logit contributions
 distance-0.116
Oh-0.111
Ah-0.104
To-0.103
See-0.102
Token too
Token ID1165
Feature activation-6.29e-02

Pos logit contributions
 disl+0.083
ach+0.076
az+0.074
 gol+0.072
ubby+0.071
Neg logit contributions
 distance-0.055
 better-0.047
Park-0.047
Less-0.047
To-0.047
Token many
Token ID867
Feature activation-7.83e-02

Pos logit contributions
 disl+0.137
ach+0.122
az+0.121
uggets+0.117
ubby+0.116
Neg logit contributions
 distance-0.055
Oh-0.049
 narrower-0.049
Uh-0.048
Hmm-0.048
Token ch
Token ID442
Feature activation-1.20e-01

Pos logit contributions
 disl+0.125
ach+0.108
az+0.107
uggets+0.103
ubby+0.100
Neg logit contributions
 distance-0.112
Oh-0.104
 narrower-0.104
Hmm-0.101
Uh-0.100
Tokenocol
Token ID4668
Feature activation-2.26e-01

Pos logit contributions
 disl+0.136
ci+0.101
amental+0.088
 Majesty+0.088
latable+0.086
Neg logit contributions
 distance-0.202
 drifted-0.194
 million-0.189
He-0.189
Park-0.187
Tokenates
Token ID689
Feature activation-7.24e-02

Pos logit contributions
 disl+0.360
 Majesty+0.249
 sport+0.240
latable+0.237
 majesty+0.233
Neg logit contributions
Less-0.311
Fore-0.304
To-0.302
Warning-0.301
Park-0.298
Token."
Token ID526
Feature activation-5.90e-02

Pos logit contributions
 disl+0.134
 Majesty+0.109
az+0.109
ach+0.107
 majesty+0.105
Neg logit contributions
 distance-0.090
To-0.081
 million-0.079
Oh-0.077
Park-0.076
Token The
Token ID383
Feature activation-5.22e-02

Pos logit contributions
 disl+0.119
ach+0.106
az+0.105
uggets+0.101
ubby+0.100
Neg logit contributions
 distance-0.060
Oh-0.055
 narrower-0.054
Hmm-0.054
Uh-0.054
Token king
Token ID5822
Feature activation-7.64e-02

Pos logit contributions
 disl+0.089
az+0.078
ach+0.078
uggets+0.074
ubby+0.071
Neg logit contributions
 distance-0.071
Oh-0.065
 narrower-0.064
Hmm-0.063
Uh-0.063
Token learned
Token ID4499
Feature activation-8.16e-02

Pos logit contributions
 disl+0.156
ach+0.139
uggets+0.135
az+0.134
ubby+0.129
Neg logit contributions
 distance-0.076
Oh-0.068
Hmm-0.067
Uh-0.067
 narrower-0.067
Token reward
Token ID6721
Feature activation-1.56e-01

Pos logit contributions
 disl+0.344
az+0.291
ach+0.288
uggets+0.280
uc+0.264
Neg logit contributions
 distance-0.268
Oh-0.230
 million-0.226
Uh-0.220
 narrower-0.211
Token.
Token ID13
Feature activation-7.88e-02

Pos logit contributions
 disl+0.261
az+0.223
ach+0.215
ubby+0.212
 Majesty+0.211
Neg logit contributions
 distance-0.212
Oh-0.195
To-0.195
Hmm-0.187
Maybe-0.187
Token It
Token ID632
Feature activation-5.78e-02

Pos logit contributions
 disl+0.153
ach+0.135
az+0.133
uggets+0.129
ubby+0.126
Neg logit contributions
 distance-0.086
Oh-0.080
 narrower-0.079
Hmm-0.078
Uh-0.078
Token was
Token ID373
Feature activation-7.56e-02

Pos logit contributions
 disl+0.115
ach+0.100
az+0.100
 Majesty+0.096
uggets+0.095
Neg logit contributions
 distance-0.061
Oh-0.056
 narrower-0.056
Uh-0.054
Hmm-0.054
Token ch
Token ID442
Feature activation-1.27e-01

Pos logit contributions
 disl+0.125
ach+0.107
az+0.106
uggets+0.099
ubby+0.098
Neg logit contributions
 distance-0.108
 narrower-0.100
Oh-0.100
Uh-0.097
Hmm-0.096
Tokenocol
Token ID4668
Feature activation-2.26e-01

Pos logit contributions
 disl+0.130
uggets+0.083
 Majesty+0.082
 majesty+0.078
ach+0.075
Neg logit contributions
 narrower-0.258
Uh-0.257
 distance-0.254
Oh-0.250
 million-0.248
Tokenates
Token ID689
Feature activation-7.78e-02

Pos logit contributions
 disl+0.333
 Majesty+0.221
az+0.214
 majesty+0.211
 sport+0.210
Neg logit contributions
Ah-0.346
Less-0.345
Warning-0.342
 narrower-0.339
To-0.339
Token and
Token ID290
Feature activation-4.46e-02

Pos logit contributions
 disl+0.142
az+0.113
 Majesty+0.109
ach+0.108
 majesty+0.107
Neg logit contributions
 distance-0.104
 narrower-0.092
 million-0.092
To-0.092
 eventually-0.091
Token sweets
Token ID42402
Feature activation-7.69e-02

Pos logit contributions
 disl+0.081
az+0.072
ach+0.072
ubby+0.069
uggets+0.068
Neg logit contributions
 distance-0.055
Oh-0.051
 narrower-0.050
Hmm-0.048
Uh-0.048
Token!
Token ID0
Feature activation-5.02e-02

Pos logit contributions
 disl+0.126
az+0.106
ach+0.104
ubby+0.102
uggets+0.101
Neg logit contributions
 distance-0.113
 narrower-0.102
Oh-0.102
To-0.101
Hmm-0.099
Token
Token ID198
Feature activation-6.58e-02

Pos logit contributions
 disl+0.089
ach+0.077
az+0.076
uggets+0.073
ubby+0.072
Neg logit contributions
 distance-0.064
 narrower-0.059
Oh-0.059
Uh-0.058
Hmm-0.058
Token her
Token ID607
Feature activation-1.55e-01

Pos logit contributions
 distance+0.050
Oh+0.046
 narrower+0.046
To+0.045
Uh+0.044
Neg logit contributions
 disl-0.076
ach-0.070
az-0.068
uggets-0.065
ubby-0.064
Token mom
Token ID1995
Feature activation-1.18e-01

Pos logit contributions
 disl+0.272
az+0.238
ach+0.228
ubby+0.212
uggets+0.210
Neg logit contributions
 distance-0.213
Oh-0.183
 drifted-0.177
 third-0.176
 million-0.175
Tokenmy
Token ID1820
Feature activation-1.21e-01

Pos logit contributions
 disl+0.219
ach+0.191
az+0.188
uggets+0.181
ubby+0.179
Neg logit contributions
 distance-0.141
Oh-0.129
 narrower-0.125
Hmm-0.125
Uh-0.124
Token watched
Token ID7342
Feature activation-1.27e-01

Pos logit contributions
 disl+0.230
ach+0.202
az+0.200
uggets+0.192
ubby+0.190
Neg logit contributions
 distance-0.137
Oh-0.123
To-0.121
 narrower-0.120
 drifted-0.120
Token in
Token ID287
Feature activation-7.89e-02

Pos logit contributions
 disl+0.229
az+0.195
 majesty+0.191
ubby+0.190
 Majesty+0.190
Neg logit contributions
 distance-0.147
Oh-0.142
To-0.137
Where-0.136
Who-0.136
Token amaz
Token ID40642
Feature activation-2.10e-01

Pos logit contributions
 disl+0.124
ach+0.105
az+0.104
uggets+0.100
ubby+0.099
Neg logit contributions
 distance-0.119
Oh-0.111
 narrower-0.109
Uh-0.107
Hmm-0.106
Tokenement
Token ID972
Feature activation-1.19e-01

Pos logit contributions
 disl+0.476
uggets+0.437
 majesty+0.412
gy+0.403
 Majesty+0.400
Neg logit contributions
Oh-0.132
See-0.128
 distance-0.125
 million-0.122
Finally-0.118
Token as
Token ID355
Feature activation-8.86e-02

Pos logit contributions
 disl+0.224
ubby+0.189
az+0.188
uggets+0.183
 Majesty+0.182
Neg logit contributions
 distance-0.129
To-0.119
 million-0.117
 drifted-0.116
 narrower-0.115
Token the
Token ID262
Feature activation-6.41e-02

Pos logit contributions
 disl+0.157
ach+0.137
az+0.136
uggets+0.131
ubby+0.129
Neg logit contributions
 distance-0.112
Oh-0.105
 narrower-0.104
Hmm-0.102
Uh-0.100
Token machine
Token ID4572
Feature activation-5.20e-02

Pos logit contributions
 disl+0.104
ach+0.089
az+0.087
 Majesty+0.082
ubby+0.081
Neg logit contributions
 distance-0.089
 narrower-0.086
Hmm-0.084
Oh-0.084
Uh-0.083
Token flew
Token ID13112
Feature activation-8.98e-02

Pos logit contributions
 disl+0.093
ach+0.081
az+0.081
ci+0.074
 Majesty+0.073
Neg logit contributions
 distance-0.062
Hmm-0.061
Oh-0.061
Uh-0.061
 narrower-0.059
Token't
Token ID470
Feature activation-1.36e-01

Pos logit contributions
Oh+0.031
Who+0.030
Maybe+0.029
To+0.029
Hmm+0.029
Neg logit contributions
 disl-0.039
ach-0.036
az-0.035
ubby-0.033
ishing-0.033
Token know
Token ID760
Feature activation-1.23e-01

Pos logit contributions
 disl+0.253
ach+0.230
az+0.228
uggets+0.223
ubby+0.216
Neg logit contributions
 distance-0.160
 narrower-0.138
Oh-0.136
To-0.135
Who-0.132
Token the
Token ID262
Feature activation-9.91e-02

Pos logit contributions
 disl+0.173
ach+0.150
az+0.147
ubby+0.141
uggets+0.141
Neg logit contributions
 distance-0.194
 narrower-0.185
Oh-0.183
To-0.179
Hmm-0.176
Token rh
Token ID9529
Feature activation-1.88e-01

Pos logit contributions
 disl+0.172
ach+0.150
az+0.149
uggets+0.143
ubby+0.139
Neg logit contributions
 distance-0.131
Oh-0.121
 narrower-0.120
Uh-0.117
Hmm-0.117
Tokenin
Token ID259
Feature activation-7.90e-02

Pos logit contributions
 disl+0.301
az+0.233
 Majesty+0.223
ach+0.217
 majesty+0.215
Neg logit contributions
 distance-0.292
Oh-0.279
Uh-0.272
Ah-0.270
 million-0.270
Tokenoc
Token ID420
Feature activation-2.06e-01

Pos logit contributions
 disl+0.107
ach+0.088
az+0.088
uggets+0.083
ubby+0.081
Neg logit contributions
 distance-0.134
Oh-0.127
 narrower-0.126
Hmm-0.125
Uh-0.125
Tokeneros
Token ID27498
Feature activation-8.26e-02

Pos logit contributions
 disl+0.224
uggets+0.162
iggle+0.154
aj+0.151
ubby+0.150
Neg logit contributions
 distance-0.365
Oh-0.361
 Moral-0.351
Less-0.351
Warning-0.351
Token was
Token ID373
Feature activation-7.60e-02

Pos logit contributions
 disl+0.169
ach+0.150
az+0.148
uggets+0.143
ubby+0.141
Neg logit contributions
 distance-0.082
Oh-0.075
 narrower-0.075
Hmm-0.073
Uh-0.073
Token wild
Token ID4295
Feature activation-1.11e-01

Pos logit contributions
 disl+0.141
ach+0.124
az+0.121
uggets+0.116
ubby+0.115
Neg logit contributions
 distance-0.095
 narrower-0.086
Oh-0.085
Uh-0.081
To-0.081
Token.
Token ID13
Feature activation-6.22e-02

Pos logit contributions
 disl+0.217
uggets+0.194
ubby+0.190
 majesty+0.189
 Majesty+0.187
Neg logit contributions
 distance-0.124
 narrower-0.104
Oh-0.099
 million-0.099
 darker-0.099
Token Can
Token ID1680
Feature activation-1.04e-01

Pos logit contributions
 disl+0.110
ach+0.095
az+0.094
uggets+0.091
 Majesty+0.088
Neg logit contributions
 distance-0.079
 narrower-0.075
Oh-0.072
Uh-0.072
Hmm-0.071
Tokenoll
Token ID692
Feature activation-1.12e-01

Pos logit contributions
 disl+0.149
ach+0.118
az+0.116
uggets+0.108
 Majesty+0.105
Neg logit contributions
 distance-0.197
 narrower-0.189
Oh-0.188
Hmm-0.188
Uh-0.186
Tokenip
Token ID541
Feature activation-6.83e-02

Pos logit contributions
 disl+0.184
ach+0.144
az+0.140
uggets+0.138
 Majesty+0.132
Neg logit contributions
 drifted-0.148
 million-0.146
 distance-0.145
Hmm-0.145
 narrower-0.144
Tokenops
Token ID2840
Feature activation-5.94e-02

Pos logit contributions
 disl+0.121
az+0.102
uggets+0.101
ach+0.101
ubby+0.097
Neg logit contributions
 distance-0.087
 narrower-0.080
Hmm-0.079
Oh-0.079
To-0.078
Token,
Token ID11
Feature activation-3.73e-02

Pos logit contributions
 disl+0.104
az+0.084
ach+0.083
uggets+0.080
 Majesty+0.079
Neg logit contributions
 distance-0.084
 narrower-0.074
 million-0.074
To-0.073
 drifted-0.072
Token ch
Token ID442
Feature activation-9.91e-02

Pos logit contributions
 disl+0.060
az+0.052
ach+0.051
ubby+0.048
uggets+0.048
Neg logit contributions
 distance-0.053
Oh-0.050
 narrower-0.050
Hmm-0.049
Uh-0.049
Tokenocol
Token ID4668
Feature activation-2.04e-01

Pos logit contributions
 disl+0.121
ci+0.093
uggets+0.080
 Majesty+0.077
ach+0.072
Neg logit contributions
He-0.172
 million-0.166
Ah-0.166
See-0.166
 narrower-0.166
Tokenates
Token ID689
Feature activation-5.86e-02

Pos logit contributions
 disl+0.326
 Majesty+0.216
 sport+0.211
 gol+0.211
az+0.207
Neg logit contributions
Ah-0.286
Less-0.282
See-0.281
Warning-0.277
To-0.276
Token,
Token ID11
Feature activation-3.96e-02

Pos logit contributions
 disl+0.094
az+0.077
ach+0.074
 Majesty+0.072
uggets+0.072
Neg logit contributions
 distance-0.086
 narrower-0.078
 million-0.077
To-0.076
Oh-0.075
Token g
Token ID308
Feature activation-1.11e-01

Pos logit contributions
 disl+0.064
az+0.055
ach+0.054
ubby+0.052
uggets+0.051
Neg logit contributions
 distance-0.058
Oh-0.054
 narrower-0.053
Hmm-0.052
Uh-0.052
Tokenummies
Token ID39578
Feature activation-5.62e-02

Pos logit contributions
 disl+0.146
ci+0.124
 gol+0.117
amental+0.106
 chefs+0.098
Neg logit contributions
 million-0.152
He-0.150
Finally-0.148
 drifted-0.147
lessly-0.142
Token,
Token ID11
Feature activation-3.54e-02

Pos logit contributions
 disl+0.096
ach+0.079
az+0.077
uggets+0.077
 Majesty+0.075
Neg logit contributions
 distance-0.076
 million-0.072
 drifted-0.066
He-0.066
To-0.065
Token as
Token ID355
Feature activation-5.27e-02

Pos logit contributions
 disl+0.097
ach+0.083
az+0.082
uggets+0.079
ubby+0.077
Neg logit contributions
 distance-0.086
 narrower-0.080
Oh-0.080
Hmm-0.079
Uh-0.078
Token he
Token ID339
Feature activation-4.45e-03

Pos logit contributions
 disl+0.113
az+0.101
ach+0.100
uggets+0.097
ubby+0.096
Neg logit contributions
 distance-0.047
 narrower-0.044
Oh-0.042
To-0.041
Ah-0.039
Token opened
Token ID4721
Feature activation-6.97e-02

Pos logit contributions
 disl+0.009
ach+0.008
az+0.008
ubby+0.008
 Majesty+0.008
Neg logit contributions
 distance-0.004
Oh-0.004
 narrower-0.004
Uh-0.004
Hmm-0.004
Token the
Token ID262
Feature activation-2.48e-02

Pos logit contributions
 disl+0.128
ach+0.106
uggets+0.105
az+0.105
ubby+0.104
Neg logit contributions
 distance-0.089
 narrower-0.078
Oh-0.077
Park-0.074
 million-0.074
Token ch
Token ID442
Feature activation-8.94e-02

Pos logit contributions
 disl+0.049
ach+0.043
az+0.043
ubby+0.041
 Majesty+0.041
Neg logit contributions
 distance-0.025
 narrower-0.024
Hmm-0.024
Oh-0.023
Uh-0.023
Tokenocol
Token ID4668
Feature activation-2.01e-01

Pos logit contributions
 disl+0.098
uggets+0.074
ach+0.063
 Majesty+0.059
ci+0.059
Neg logit contributions
 distance-0.171
 drifted-0.161
Uh-0.159
Ah-0.157
 narrower-0.156
Tokenates
Token ID689
Feature activation-3.90e-02

Pos logit contributions
 disl+0.285
 Majesty+0.189
az+0.185
 sport+0.181
 majesty+0.177
Neg logit contributions
 distance-0.320
Ah-0.312
To-0.311
Finally-0.308
Less-0.307
Token and
Token ID290
Feature activation-1.31e-02

Pos logit contributions
 disl+0.065
ach+0.055
az+0.054
uggets+0.052
ubby+0.051
Neg logit contributions
 distance-0.055
 narrower-0.051
Oh-0.050
Hmm-0.050
To-0.049
Token started
Token ID2067
Feature activation-5.12e-02

Pos logit contributions
 disl+0.025
az+0.023
ach+0.023
ubby+0.022
uggets+0.022
Neg logit contributions
 distance-0.015
Oh-0.013
 narrower-0.013
Hmm-0.013
Ah-0.013
Token eating
Token ID6600
Feature activation-6.70e-02

Pos logit contributions
 disl+0.082
ach+0.071
az+0.070
ubby+0.069
uggets+0.066
Neg logit contributions
 distance-0.073
 narrower-0.068
Oh-0.066
To-0.065
 drifted-0.064
Token them
Token ID606
Feature activation-7.55e-03

Pos logit contributions
 disl+0.111
az+0.095
ach+0.095
ubby+0.090
uggets+0.090
Neg logit contributions
 distance-0.093
Oh-0.086
 narrower-0.086
Hmm-0.084
Uh-0.083
Token them
Token ID606
Feature activation-3.26e-02

Pos logit contributions
 disl+0.100
az+0.086
ach+0.085
uggets+0.080
ubby+0.080
Neg logit contributions
 distance-0.096
Oh-0.091
 narrower-0.091
Hmm-0.089
Uh-0.088
Token and
Token ID290
Feature activation-2.65e-02

Pos logit contributions
 disl+0.056
az+0.049
ach+0.047
ubby+0.046
 Majesty+0.045
Neg logit contributions
 distance-0.044
To-0.040
 narrower-0.039
Oh-0.038
Park-0.037
Token never
Token ID1239
Feature activation-5.75e-02

Pos logit contributions
az+0.046
ubby+0.045
ach+0.045
 disl+0.044
uggets+0.040
Neg logit contributions
 distance-0.032
Maybe-0.029
To-0.029
 eventually-0.029
Oh-0.028
Token threw
Token ID9617
Feature activation-8.42e-02

Pos logit contributions
 disl+0.110
ach+0.099
az+0.098
uggets+0.094
ubby+0.093
Neg logit contributions
 distance-0.065
Oh-0.059
 narrower-0.059
Uh-0.057
Hmm-0.057
Token a
Token ID257
Feature activation-3.69e-02

Pos logit contributions
 disl+0.149
ach+0.129
az+0.128
uggets+0.123
ubby+0.121
Neg logit contributions
 distance-0.107
Oh-0.100
 narrower-0.099
Hmm-0.097
Uh-0.097
Token tant
Token ID24246
Feature activation-1.99e-01

Pos logit contributions
 disl+0.059
ach+0.050
az+0.050
uggets+0.047
 Majesty+0.046
Neg logit contributions
 distance-0.055
Oh-0.051
 narrower-0.050
Uh-0.050
Hmm-0.050
Tokenrum
Token ID6582
Feature activation-1.05e-01

Pos logit contributions
 disl+0.344
 Majesty+0.311
aj+0.307
 majesty+0.301
latable+0.298
Neg logit contributions
Warning-0.175
Less-0.168
 vengeance-0.163
talk-0.162
Ah-0.158
Token again
Token ID757
Feature activation-5.26e-02

Pos logit contributions
 disl+0.203
ach+0.170
az+0.166
uggets+0.166
ubby+0.163
Neg logit contributions
 distance-0.113
Oh-0.111
 narrower-0.107
Maybe-0.107
Where-0.106
Token.
Token ID13
Feature activation-3.28e-02

Pos logit contributions
 disl+0.093
ach+0.081
az+0.080
uggets+0.078
ubby+0.077
Neg logit contributions
 distance-0.068
 narrower-0.061
Oh-0.061
To-0.060
Hmm-0.058
Token<|endoftext|>
Token ID50256
Feature activation-1.06e-01

Pos logit contributions
 disl+0.064
ach+0.055
az+0.055
uggets+0.053
ubby+0.052
Neg logit contributions
 distance-0.037
 narrower-0.034
Oh-0.033
Uh-0.033
Hmm-0.033
TokenOnce
Token ID7454
Feature activation-1.12e-01

Pos logit contributions
 disl+0.230
ach+0.203
az+0.199
 Majesty+0.196
uggets+0.196
Neg logit contributions
 distance-0.098
Oh-0.089
To-0.084
 narrower-0.084
Hmm-0.082
Token John
Token ID1757
Feature activation-1.48e-01

Pos logit contributions
 distance+0.069
Oh+0.068
 narrower+0.067
Aw+0.066
Uh+0.065
Neg logit contributions
 disl-0.099
ach-0.085
az-0.084
 Majesty-0.080
ci-0.080
Token chose
Token ID7690
Feature activation-1.11e-01

Pos logit contributions
 disl+0.310
az+0.268
ach+0.265
uggets+0.263
ubby+0.258
Neg logit contributions
 distance-0.148
Oh-0.130
Uh-0.122
 drifted-0.121
Ah-0.121
Token some
Token ID617
Feature activation-9.61e-02

Pos logit contributions
 disl+0.189
az+0.160
ach+0.158
ubby+0.157
 Majesty+0.154
Neg logit contributions
 distance-0.155
Oh-0.142
 narrower-0.136
Where-0.135
To-0.134
Token carrots
Token ID34397
Feature activation-9.12e-02

Pos logit contributions
 disl+0.158
az+0.135
ach+0.135
uggets+0.129
 Majesty+0.127
Neg logit contributions
 distance-0.135
Oh-0.126
 narrower-0.124
Hmm-0.122
Uh-0.121
Token and
Token ID290
Feature activation-5.37e-02

Pos logit contributions
 disl+0.172
az+0.144
 Majesty+0.142
 majesty+0.136
ach+0.134
Neg logit contributions
 distance-0.118
To-0.106
Maybe-0.104
He-0.102
 eventually-0.101
Token cuc
Token ID38421
Feature activation-1.98e-01

Pos logit contributions
 disl+0.108
az+0.102
ach+0.098
ubby+0.096
uggets+0.092
Neg logit contributions
 distance-0.056
 eventually-0.050
 narrower-0.049
Oh-0.049
 drifted-0.048
Tokenumbers
Token ID17024
Feature activation-7.71e-02

Pos logit contributions
 disl+0.354
az+0.266
 majesty+0.242
 Majesty+0.240
ubby+0.227
Neg logit contributions
Finally-0.286
Hmm-0.276
To-0.270
Ah-0.268
Less-0.261
Token to
Token ID284
Feature activation-5.64e-02

Pos logit contributions
 disl+0.138
az+0.117
ach+0.114
 Majesty+0.112
ubby+0.111
Neg logit contributions
 distance-0.096
To-0.088
 narrower-0.087
Maybe-0.086
Oh-0.085
Token take
Token ID1011
Feature activation-8.17e-02

Pos logit contributions
 disl+0.107
az+0.105
rou+0.098
ach+0.096
ubby+0.095
Neg logit contributions
 distance-0.061
Who-0.053
To-0.053
 eventually-0.053
Ah-0.052
Token home
Token ID1363
Feature activation-8.45e-02

Pos logit contributions
 disl+0.145
az+0.124
ach+0.121
ubby+0.117
 Majesty+0.116
Neg logit contributions
 distance-0.103
Oh-0.096
To-0.095
Ah-0.095
Hmm-0.093
Token.
Token ID13
Feature activation-4.51e-02

Pos logit contributions
 disl+0.160
az+0.140
 Majesty+0.134
ubby+0.133
 majesty+0.130
Neg logit contributions
 distance-0.099
To-0.089
 narrower-0.085
 million-0.085
He-0.084
Token,
Token ID11
Feature activation-3.35e-02

Pos logit contributions
 disl+0.077
az+0.064
ach+0.063
 Majesty+0.060
uggets+0.060
Neg logit contributions
 distance-0.065
 narrower-0.059
Oh-0.058
To-0.057
Hmm-0.057
Token cup
Token ID6508
Feature activation-5.98e-02

Pos logit contributions
 disl+0.055
az+0.047
ach+0.047
ubby+0.045
 Majesty+0.043
Neg logit contributions
 distance-0.048
Oh-0.045
 narrower-0.044
Hmm-0.043
Uh-0.043
Tokencakes
Token ID37263
Feature activation-5.13e-02

Pos logit contributions
 disl+0.099
ach+0.082
az+0.081
uggets+0.079
 Majesty+0.078
Neg logit contributions
 distance-0.085
Oh-0.078
 narrower-0.078
Hmm-0.076
Uh-0.076
Token and
Token ID290
Feature activation-2.49e-02

Pos logit contributions
 disl+0.058
ach+0.050
az+0.047
uggets+0.044
ubby+0.041
Neg logit contributions
 distance-0.094
 narrower-0.092
Oh-0.092
Uh-0.092
Hmm-0.091
Token ch
Token ID442
Feature activation-9.67e-02

Pos logit contributions
 disl+0.043
az+0.037
ach+0.037
ubby+0.036
uggets+0.035
Neg logit contributions
 distance-0.033
Oh-0.031
 narrower-0.031
Hmm-0.030
Uh-0.030
Tokenocol
Token ID4668
Feature activation-1.96e-01

Pos logit contributions
 disl+0.111
ci+0.088
 Majesty+0.081
 majesty+0.069
az+0.068
Neg logit contributions
 drifted-0.164
See-0.159
Park-0.158
Ah-0.158
Aw-0.156
Tokenates
Token ID689
Feature activation-5.24e-02

Pos logit contributions
 disl+0.279
 Majesty+0.190
az+0.181
 majesty+0.174
ubby+0.171
Neg logit contributions
Ah-0.305
 narrower-0.302
To-0.301
Oh-0.300
Finally-0.300
Token.
Token ID13
Feature activation-3.05e-02

Pos logit contributions
 disl+0.096
az+0.081
ach+0.078
 Majesty+0.077
ubby+0.077
Neg logit contributions
 distance-0.065
 narrower-0.059
To-0.058
Oh-0.057
Park-0.056
Token
Token ID220
Feature activation-9.06e-02

Pos logit contributions
 disl+0.056
ach+0.049
az+0.048
uggets+0.047
 Majesty+0.045
Neg logit contributions
 distance-0.037
 narrower-0.035
Uh-0.034
Oh-0.033
Hmm-0.033
Token
Token ID198
Feature activation-4.43e-02

Pos logit contributions
 disl+0.201
ach+0.177
az+0.176
 Majesty+0.170
ubby+0.170
Neg logit contributions
 distance-0.072
Oh-0.067
To-0.066
 narrower-0.064
Finally-0.064
Token
Token ID198
Feature activation-3.64e-02

Pos logit contributions
 disl+0.080
ach+0.073
az+0.069
uggets+0.069
ubby+0.065
Neg logit contributions
 distance-0.053
 narrower-0.051
Uh-0.048
Oh-0.048
Hmm-0.048
Token he
Token ID339
Feature activation-8.17e-02

Pos logit contributions
 disl+0.314
az+0.276
ach+0.274
uggets+0.269
ubby+0.265
Neg logit contributions
 distance-0.162
Oh-0.147
 narrower-0.141
To-0.139
Hmm-0.138
Token would
Token ID561
Feature activation-8.89e-02

Pos logit contributions
 disl+0.177
ach+0.159
az+0.157
uggets+0.152
ubby+0.150
Neg logit contributions
 distance-0.070
Oh-0.063
 narrower-0.063
Hmm-0.061
Uh-0.061
Token take
Token ID1011
Feature activation-1.00e-01

Pos logit contributions
 disl+0.170
ach+0.152
az+0.151
uggets+0.149
ubby+0.144
Neg logit contributions
 distance-0.102
 narrower-0.088
To-0.087
Hmm-0.085
Oh-0.085
Token the
Token ID262
Feature activation-8.13e-02

Pos logit contributions
 disl+0.166
ach+0.140
az+0.140
uggets+0.136
ubby+0.133
Neg logit contributions
 distance-0.139
Oh-0.130
 narrower-0.129
Hmm-0.126
To-0.126
Token av
Token ID1196
Feature activation-1.58e-01

Pos logit contributions
 disl+0.134
ach+0.115
az+0.114
uggets+0.109
ubby+0.106
Neg logit contributions
 distance-0.114
Oh-0.106
 narrower-0.106
Hmm-0.104
Uh-0.104
Tokenoc
Token ID420
Feature activation-1.93e-01

Pos logit contributions
 disl+0.281
az+0.219
 majesty+0.217
 Majesty+0.212
uggets+0.209
Neg logit contributions
 distance-0.224
To-0.208
Finally-0.208
Hmm-0.205
 million-0.202
Tokenados
Token ID22484
Feature activation-7.16e-02

Pos logit contributions
 disl+0.230
uggets+0.172
az+0.160
iggle+0.158
ubby+0.157
Neg logit contributions
 distance-0.321
Less-0.316
Hmm-0.314
 million-0.313
Oh-0.310
Token and
Token ID290
Feature activation-3.68e-02

Pos logit contributions
 disl+0.120
az+0.100
ach+0.098
 Majesty+0.095
ubby+0.094
Neg logit contributions
 distance-0.105
To-0.095
 narrower-0.093
Oh-0.093
Hmm-0.092
Token scatter
Token ID41058
Feature activation-9.09e-02

Pos logit contributions
 disl+0.064
az+0.058
ach+0.057
ubby+0.055
uggets+0.054
Neg logit contributions
 distance-0.047
 narrower-0.043
Oh-0.043
Hmm-0.042
Uh-0.042
Token them
Token ID606
Feature activation-3.77e-02

Pos logit contributions
 disl+0.162
ach+0.140
az+0.139
uggets+0.134
ubby+0.131
Neg logit contributions
 distance-0.116
Oh-0.107
 narrower-0.106
Hmm-0.104
Uh-0.104
Token around
Token ID1088
Feature activation-8.42e-02

Pos logit contributions
 disl+0.060
ach+0.050
az+0.050
uggets+0.048
ubby+0.047
Neg logit contributions
 distance-0.056
 narrower-0.052
Oh-0.051
Hmm-0.050
To-0.049
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
 distance+0.062
 narrower+0.057
Oh+0.056
Where+0.053
Ah+0.053
Neg logit contributions
 disl-0.084
ach-0.072
uggets-0.066
az-0.066
 Majesty-0.066
Token Tim
Token ID5045
Feature activation-9.00e-02

Pos logit contributions
 disl+0.237
ach+0.212
az+0.211
uggets+0.202
ubby+0.201
Neg logit contributions
 distance-0.117
Oh-0.110
Hmm-0.105
Uh-0.105
 narrower-0.104
Tokenmy
Token ID1820
Feature activation-9.42e-02

Pos logit contributions
 disl+0.183
ach+0.163
uggets+0.160
az+0.155
 Majesty+0.149
Neg logit contributions
 distance-0.095
Oh-0.084
 narrower-0.082
Hmm-0.081
Maybe-0.080
Token watched
Token ID7342
Feature activation-1.15e-01

Pos logit contributions
 disl+0.204
ach+0.190
uggets+0.188
az+0.182
ubby+0.178
Neg logit contributions
 distance-0.078
 drifted-0.070
Maybe-0.068
 eventually-0.066
 narrower-0.066
Token with
Token ID351
Feature activation-8.54e-02

Pos logit contributions
 disl+0.212
 majesty+0.178
az+0.177
 Majesty+0.176
ubby+0.175
Neg logit contributions
 distance-0.125
Oh-0.119
 drifted-0.119
Who-0.119
 eventually-0.118
Token amaz
Token ID40642
Feature activation-1.93e-01

Pos logit contributions
ach+0.166
 disl+0.163
uc+0.153
uggets+0.150
az+0.149
Neg logit contributions
 distance-0.108
Oh-0.087
 narrower-0.086
Where-0.084
Aw-0.084
Tokenement
Token ID972
Feature activation-1.09e-01

Pos logit contributions
 disl+0.397
uggets+0.350
 majesty+0.336
 Majesty+0.334
ubby+0.330
Neg logit contributions
Oh-0.166
 distance-0.159
See-0.158
To-0.155
Finally-0.155
Token as
Token ID355
Feature activation-7.89e-02

Pos logit contributions
 disl+0.211
ubby+0.179
az+0.176
ere+0.174
uggets+0.173
Neg logit contributions
 distance-0.109
 drifted-0.103
 narrower-0.100
 eventually-0.100
He-0.099
Token he
Token ID339
Feature activation-1.77e-02

Pos logit contributions
 disl+0.160
ach+0.144
az+0.142
uggets+0.142
ubby+0.138
Neg logit contributions
 distance-0.079
 narrower-0.074
Oh-0.072
Ah-0.068
See-0.067
Token saw
Token ID2497
Feature activation-7.62e-02

Pos logit contributions
 disl+0.027
az+0.025
ach+0.025
rou+0.025
 Majesty+0.025
Neg logit contributions
 distance-0.024
Oh-0.023
Uh-0.022
Hmm-0.022
Less-0.021
Token stars
Token ID5788
Feature activation-6.00e-02

Pos logit contributions
 disl+0.119
ach+0.101
az+0.101
ubby+0.099
uggets+0.099
Neg logit contributions
 distance-0.114
Oh-0.108
 narrower-0.107
Ah-0.106
Hmm-0.103
Token quiet
Token ID5897
Feature activation-1.80e-01

Pos logit contributions
 distance+0.059
Uh+0.051
 narrower+0.050
 million+0.050
Oh+0.050
Neg logit contributions
 disl-0.073
ach-0.063
uggets-0.063
az-0.060
uc-0.057
Token compared
Token ID3688
Feature activation-1.69e-01

Pos logit contributions
 disl+0.333
uggets+0.293
ubby+0.290
az+0.288
ach+0.287
Neg logit contributions
 distance-0.223
 drifted-0.181
Oh-0.172
To-0.171
 million-0.166
Token to
Token ID284
Feature activation-8.34e-02

Pos logit contributions
 disl+0.296
ach+0.253
 Majesty+0.248
uggets+0.248
az+0.247
Neg logit contributions
 distance-0.228
Oh-0.204
 narrower-0.194
Where-0.191
To-0.190
Token the
Token ID262
Feature activation-9.19e-02

Pos logit contributions
 disl+0.135
ach+0.119
az+0.117
uggets+0.113
ubby+0.111
Neg logit contributions
 distance-0.118
Oh-0.108
 narrower-0.108
Hmm-0.105
To-0.104
Token noisy
Token ID31210
Feature activation-1.02e-01

Pos logit contributions
 disl+0.161
ach+0.140
az+0.138
uggets+0.133
ubby+0.130
Neg logit contributions
 distance-0.120
Oh-0.111
 narrower-0.110
Uh-0.108
Hmm-0.108
Token se
Token ID384
Feature activation-1.92e-01

Pos logit contributions
 disl+0.170
ach+0.149
uggets+0.148
az+0.147
ubby+0.140
Neg logit contributions
 distance-0.141
Oh-0.124
 narrower-0.121
Uh-0.120
 million-0.117
Tokenag
Token ID363
Feature activation-1.28e-01

Pos logit contributions
 disl+0.290
uggets+0.259
ach+0.255
 Majesty+0.243
 majesty+0.242
Neg logit contributions
 distance-0.261
Finally-0.244
To-0.241
Ah-0.234
Park-0.231
Tokenull
Token ID724
Feature activation-8.30e-02

Pos logit contributions
 disl+0.220
 Majesty+0.197
 majesty+0.187
uggets+0.186
ach+0.180
Neg logit contributions
 distance-0.162
To-0.149
He-0.146
 sounded-0.142
lessly-0.140
Tokens
Token ID82
Feature activation-7.94e-02

Pos logit contributions
 disl+0.150
uggets+0.135
ach+0.134
az+0.134
ubby+0.127
Neg logit contributions
 distance-0.100
To-0.086
 narrower-0.085
 million-0.085
 drifted-0.084
Token outside
Token ID2354
Feature activation-9.56e-02

Pos logit contributions
 disl+0.124
ach+0.106
az+0.105
uggets+0.101
ubby+0.098
Neg logit contributions
 distance-0.117
 narrower-0.109
Oh-0.109
Hmm-0.106
Uh-0.106
Token.
Token ID13
Feature activation-6.32e-02

Pos logit contributions
 disl+0.155
ach+0.133
az+0.131
uggets+0.127
ubby+0.124
Neg logit contributions
 distance-0.137
Oh-0.127
 narrower-0.126
Uh-0.122
Hmm-0.122
Token like
Token ID588
Feature activation-1.23e-01

Pos logit contributions
 disl+0.206
ach+0.176
az+0.174
uggets+0.168
ubby+0.164
Neg logit contributions
 distance-0.185
Oh-0.172
 narrower-0.172
Hmm-0.168
Uh-0.168
Token lettuce
Token ID39406
Feature activation-1.07e-01

Pos logit contributions
 disl+0.193
az+0.167
ach+0.167
ubby+0.158
uggets+0.158
Neg logit contributions
 distance-0.176
Oh-0.165
 narrower-0.163
To-0.157
Aw-0.157
Token,
Token ID11
Feature activation-6.59e-02

Pos logit contributions
 disl+0.176
az+0.155
ach+0.145
ubby+0.142
rou+0.142
Neg logit contributions
 distance-0.151
To-0.136
 million-0.135
Maybe-0.133
Oh-0.130
Token tomatoes
Token ID23972
Feature activation-8.28e-02

Pos logit contributions
 disl+0.103
az+0.098
ach+0.090
ubby+0.088
 Majesty+0.087
Neg logit contributions
 distance-0.092
Oh-0.087
 narrower-0.085
Hmm-0.084
 million-0.083
Token and
Token ID290
Feature activation-4.80e-02

Pos logit contributions
 disl+0.129
az+0.111
ach+0.105
 Majesty+0.101
ubby+0.101
Neg logit contributions
 distance-0.124
To-0.115
Maybe-0.112
Oh-0.110
 million-0.109
Token cuc
Token ID38421
Feature activation-1.91e-01

Pos logit contributions
az+0.073
 disl+0.073
ach+0.068
ubby+0.063
uff+0.062
Neg logit contributions
 distance-0.068
Oh-0.066
Ah-0.065
Who-0.065
Hmm-0.064
Tokenumbers
Token ID17024
Feature activation-7.94e-02

Pos logit contributions
 disl+0.319
az+0.247
 majesty+0.218
 Majesty+0.216
gy+0.206
Neg logit contributions
Finally-0.278
Hmm-0.272
To-0.271
Ah-0.260
Oh-0.254
Token.
Token ID13
Feature activation-5.99e-02

Pos logit contributions
 disl+0.145
az+0.126
ach+0.121
 Majesty+0.120
ubby+0.117
Neg logit contributions
 distance-0.095
To-0.086
Maybe-0.083
Oh-0.083
 narrower-0.082
Token He
Token ID679
Feature activation-3.46e-02

Pos logit contributions
 disl+0.120
ach+0.107
az+0.106
uggets+0.101
ubby+0.101
Neg logit contributions
 distance-0.061
Oh-0.057
Hmm-0.055
 narrower-0.055
Uh-0.055
Token smiled
Token ID13541
Feature activation-1.13e-01

Pos logit contributions
 disl+0.071
az+0.065
ach+0.065
uggets+0.063
ubby+0.061
Neg logit contributions
 distance-0.030
Maybe-0.029
 eventually-0.028
Hmm-0.028
To-0.027
Token as
Token ID355
Feature activation-6.20e-02

Pos logit contributions
 disl+0.238
az+0.202
affles+0.191
ocol+0.191
ach+0.189
Neg logit contributions
He-0.125
 distance-0.113
Maybe-0.105
Where-0.099
 louder-0.099
Token not
Token ID407
Feature activation-3.88e-02

Pos logit contributions
 disl+0.076
ach+0.070
az+0.069
uggets+0.069
ubby+0.066
Neg logit contributions
 distance-0.044
To-0.039
Hmm-0.039
Finally-0.038
 narrower-0.038
Token eat
Token ID4483
Feature activation-6.52e-02

Pos logit contributions
 disl+0.066
ach+0.057
az+0.055
uggets+0.053
ubby+0.052
Neg logit contributions
 distance-0.052
 narrower-0.050
Oh-0.050
Uh-0.049
Hmm-0.049
Token too
Token ID1165
Feature activation-3.51e-02

Pos logit contributions
 disl+0.110
az+0.094
ach+0.092
ubby+0.089
uggets+0.088
Neg logit contributions
 distance-0.088
 narrower-0.082
Oh-0.080
Hmm-0.078
Aw-0.077
Token many
Token ID867
Feature activation-4.62e-02

Pos logit contributions
 disl+0.078
ach+0.070
az+0.070
ubby+0.068
uggets+0.067
Neg logit contributions
 distance-0.028
Oh-0.026
Uh-0.026
 narrower-0.025
Hmm-0.025
Token ch
Token ID442
Feature activation-8.58e-02

Pos logit contributions
 disl+0.069
ach+0.058
az+0.057
uggets+0.052
ubby+0.052
Neg logit contributions
 distance-0.072
Uh-0.071
Oh-0.070
Hmm-0.070
 narrower-0.069
Tokenocol
Token ID4668
Feature activation-1.90e-01

Pos logit contributions
 disl+0.094
ci+0.074
 Majesty+0.062
uggets+0.061
ach+0.060
Neg logit contributions
 distance-0.144
 narrower-0.138
 drifted-0.138
He-0.137
 bud-0.134
Tokenates
Token ID689
Feature activation-3.82e-02

Pos logit contributions
 disl+0.273
 Majesty+0.187
az+0.184
 gol+0.173
 majesty+0.173
Neg logit contributions
To-0.290
 distance-0.288
Less-0.288
Warning-0.284
Park-0.281
Token and
Token ID290
Feature activation-7.46e-03

Pos logit contributions
 disl+0.066
az+0.056
ach+0.055
 Majesty+0.054
ubby+0.052
Neg logit contributions
 distance-0.051
 narrower-0.046
Oh-0.046
To-0.045
Hmm-0.045
Token his
Token ID465
Feature activation-3.39e-02

Pos logit contributions
 disl+0.013
az+0.013
ach+0.013
ubby+0.013
uggets+0.012
Neg logit contributions
 distance-0.008
 narrower-0.007
Oh-0.007
To-0.007
 eventually-0.007
Token friend
Token ID1545
Feature activation-1.73e-02

Pos logit contributions
 disl+0.052
az+0.045
ach+0.045
uggets+0.042
ubby+0.041
Neg logit contributions
 distance-0.051
Oh-0.046
 narrower-0.046
Hmm-0.045
Aw-0.045
Token learned
Token ID4499
Feature activation-5.43e-02

Pos logit contributions
 disl+0.035
ach+0.031
az+0.031
uggets+0.030
ubby+0.030
Neg logit contributions
 distance-0.017
Oh-0.016
 narrower-0.016
Hmm-0.015
Uh-0.015
Token was
Token ID373
Feature activation-5.27e-02

Pos logit contributions
 disl+0.021
gy+0.021
ci+0.020
az+0.019
ate+0.019
Neg logit contributions
 distance-0.026
Finally-0.024
Less-0.024
Oh-0.024
Hmm-0.023
Token a
Token ID257
Feature activation-3.86e-02

Pos logit contributions
 disl+0.067
ach+0.058
uggets+0.056
az+0.056
ubby+0.053
Neg logit contributions
 distance-0.089
Hmm-0.085
Oh-0.085
Uh-0.085
 narrower-0.083
Token box
Token ID3091
Feature activation-4.53e-02

Pos logit contributions
 disl+0.058
ach+0.050
uggets+0.047
az+0.047
 Majesty+0.047
Neg logit contributions
 distance-0.055
 narrower-0.054
Hmm-0.054
Oh-0.053
Less-0.052
Token of
Token ID286
Feature activation-6.55e-02

Pos logit contributions
 disl+0.080
az+0.062
 Majesty+0.062
ach+0.060
 majesty+0.060
Neg logit contributions
 distance-0.066
To-0.059
 narrower-0.059
 million-0.058
Oh-0.058
Token ch
Token ID442
Feature activation-9.80e-02

Pos logit contributions
 disl+0.095
ach+0.079
az+0.076
uggets+0.075
ubby+0.072
Neg logit contributions
 distance-0.103
 narrower-0.098
Oh-0.097
Hmm-0.097
Uh-0.097
Tokenocol
Token ID4668
Feature activation-1.90e-01

Pos logit contributions
 disl+0.101
 Majesty+0.063
ach+0.059
az+0.059
 majesty+0.056
Neg logit contributions
 distance-0.199
 narrower-0.197
Uh-0.194
Park-0.192
Hmm-0.189
Tokenates
Token ID689
Feature activation-4.61e-02

Pos logit contributions
 disl+0.251
 Majesty+0.166
az+0.165
ach+0.153
 majesty+0.153
Neg logit contributions
 distance-0.327
 narrower-0.317
Park-0.317
To-0.316
Warning-0.315
Token.
Token ID13
Feature activation-3.12e-02

Pos logit contributions
 disl+0.090
az+0.074
 Majesty+0.072
 majesty+0.070
ach+0.069
Neg logit contributions
 distance-0.056
 narrower-0.050
To-0.050
 million-0.048
Oh-0.048
Token Lily
Token ID20037
Feature activation-4.28e-02

Pos logit contributions
 disl+0.069
ach+0.062
az+0.061
uc+0.058
ubby+0.057
Neg logit contributions
Oh-0.025
 distance-0.025
Hmm-0.025
Ah-0.024
To-0.024
Token loved
Token ID6151
Feature activation-8.17e-02

Pos logit contributions
 disl+0.089
ach+0.080
az+0.078
uggets+0.076
ubby+0.076
Neg logit contributions
 distance-0.041
 narrower-0.037
Hmm-0.036
Oh-0.036
Maybe-0.036
Token ch
Token ID442
Feature activation-9.06e-02

Pos logit contributions
 disl+0.136
ach+0.118
ubby+0.117
az+0.116
 Majesty+0.107
Neg logit contributions
 distance-0.109
Oh-0.103
 narrower-0.102
Who-0.101
Where-0.101
Token give
Token ID1577
Feature activation-1.18e-01

Pos logit contributions
 disl+0.172
az+0.170
ach+0.167
ubby+0.162
uggets+0.160
Neg logit contributions
 distance-0.110
To-0.088
Oh-0.086
Who-0.086
 eventually-0.086
Token to
Token ID284
Feature activation-7.98e-02

Pos logit contributions
 disl+0.219
az+0.198
ach+0.194
ubby+0.193
 Majesty+0.188
Neg logit contributions
 distance-0.134
Oh-0.125
To-0.119
Ah-0.118
Where-0.117
Token the
Token ID262
Feature activation-6.82e-02

Pos logit contributions
ach+0.139
 disl+0.134
ubby+0.132
az+0.130
uggets+0.125
Neg logit contributions
 distance-0.095
 other-0.084
Less-0.080
 narrower-0.080
Park-0.079
Token rh
Token ID9529
Feature activation-1.67e-01

Pos logit contributions
 disl+0.114
ach+0.099
az+0.098
uggets+0.093
ubby+0.091
Neg logit contributions
 distance-0.094
Oh-0.088
 narrower-0.087
Hmm-0.086
Uh-0.086
Tokenin
Token ID259
Feature activation-6.27e-02

Pos logit contributions
 disl+0.264
az+0.210
 Majesty+0.196
ach+0.193
 majesty+0.189
Neg logit contributions
 distance-0.256
Oh-0.241
To-0.239
Ah-0.239
 million-0.238
Tokenoc
Token ID420
Feature activation-1.89e-01

Pos logit contributions
 disl+0.094
az+0.082
ach+0.080
uggets+0.075
ubby+0.075
Neg logit contributions
 distance-0.099
To-0.090
Oh-0.090
Hmm-0.089
Less-0.087
Tokeneros
Token ID27498
Feature activation-7.27e-02

Pos logit contributions
 disl+0.199
uggets+0.144
ubby+0.137
iggle+0.136
az+0.135
Neg logit contributions
 distance-0.339
Oh-0.329
 Moral-0.327
Less-0.327
Warning-0.327
Token.
Token ID13
Feature activation-5.98e-02

Pos logit contributions
 disl+0.122
ach+0.111
az+0.111
ubby+0.104
uggets+0.103
Neg logit contributions
 distance-0.095
To-0.087
 narrower-0.083
Less-0.082
Park-0.082
Token Lily
Token ID20037
Feature activation-6.53e-02

Pos logit contributions
 disl+0.113
ach+0.099
az+0.098
uggets+0.095
ubby+0.093
Neg logit contributions
 distance-0.069
Oh-0.063
 narrower-0.063
Uh-0.062
Hmm-0.062
Token walked
Token ID6807
Feature activation-8.26e-02

Pos logit contributions
 disl+0.141
uggets+0.135
ach+0.135
ubby+0.126
az+0.125
Neg logit contributions
 distance-0.051
Maybe-0.049
 drifted-0.046
 sounded-0.046
 seemed-0.045
Token to
Token ID284
Feature activation-4.87e-02

Pos logit contributions
 disl+0.125
ach+0.113
az+0.110
uggets+0.105
ubby+0.102
Neg logit contributions
Uh-0.115
 distance-0.115
Oh-0.114
Hmm-0.113
Aw-0.112
Token air
Token ID1633
Feature activation-1.69e-01

Pos logit contributions
Oh+0.032
 distance+0.032
Ah+0.031
 narrower+0.030
Hmm+0.030
Neg logit contributions
 disl-0.037
ach-0.033
az-0.033
uggets-0.032
ubby-0.032
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
 disl+0.299
ach+0.253
uggets+0.251
az+0.249
 Majesty+0.246
Neg logit contributions
 distance-0.231
 narrower-0.196
Oh-0.195
 drifted-0.193
 million-0.191
Token He
Token ID679
Feature activation-5.57e-02

Pos logit contributions
 disl+0.202
ach+0.182
az+0.180
uggets+0.173
ubby+0.172
Neg logit contributions
 distance-0.101
Oh-0.096
Hmm-0.092
Uh-0.091
 narrower-0.091
Token watched
Token ID7342
Feature activation-1.07e-01

Pos logit contributions
 disl+0.113
ach+0.101
az+0.099
uggets+0.098
ubby+0.095
Neg logit contributions
 distance-0.056
Oh-0.051
 narrower-0.051
Uh-0.050
Hmm-0.050
Token with
Token ID351
Feature activation-8.23e-02

Pos logit contributions
 disl+0.194
az+0.163
ach+0.159
ubby+0.156
uggets+0.155
Neg logit contributions
 distance-0.129
 narrower-0.119
Oh-0.119
Who-0.115
 drifted-0.115
Token amaz
Token ID40642
Feature activation-1.88e-01

Pos logit contributions
 disl+0.147
ach+0.134
az+0.128
uggets+0.123
uc+0.123
Neg logit contributions
 distance-0.114
Oh-0.099
 narrower-0.099
Aw-0.096
Where-0.095
Tokenement
Token ID972
Feature activation-1.06e-01

Pos logit contributions
 disl+0.388
uggets+0.341
 majesty+0.322
 Majesty+0.319
ubby+0.317
Neg logit contributions
Oh-0.164
 distance-0.164
Ah-0.154
See-0.154
Uh-0.154
Token as
Token ID355
Feature activation-7.53e-02

Pos logit contributions
 disl+0.199
ubby+0.167
az+0.167
uggets+0.163
ere+0.160
Neg logit contributions
 distance-0.115
 drifted-0.107
 narrower-0.104
 eventually-0.103
He-0.101
Token the
Token ID262
Feature activation-5.06e-02

Pos logit contributions
 disl+0.146
ach+0.129
az+0.127
uggets+0.124
ubby+0.121
Neg logit contributions
 distance-0.084
 narrower-0.077
Oh-0.077
Hmm-0.074
Uh-0.073
Token z
Token ID1976
Feature activation-1.11e-01

Pos logit contributions
 disl+0.083
ach+0.071
az+0.069
 Majesty+0.066
ubby+0.065
Neg logit contributions
 distance-0.066
 narrower-0.065
Hmm-0.065
Finally-0.064
Uh-0.064
Tokenig
Token ID328
Feature activation-6.00e-02

Pos logit contributions
 disl+0.179
uggets+0.144
ach+0.127
 shrimp+0.118
 chefs+0.117
Neg logit contributions
 drifted-0.152
 distance-0.150
 narrower-0.146
 brake-0.142
Ah-0.141
Token go
Token ID467
Feature activation-1.04e-01

Pos logit contributions
rou+0.247
az+0.246
 disl+0.245
ubby+0.240
ach+0.239
Neg logit contributions
 distance-0.130
 other-0.106
 eventually-0.104
 lead-0.103
Who-0.103
Token!
Token ID0
Feature activation-7.81e-02

Pos logit contributions
 disl+0.173
ach+0.145
az+0.143
uggets+0.138
 Majesty+0.137
Neg logit contributions
 distance-0.154
 narrower-0.135
Oh-0.134
To-0.133
Park-0.131
Token Everyone
Token ID11075
Feature activation-6.48e-02

Pos logit contributions
 disl+0.150
ach+0.132
az+0.131
uggets+0.126
ubby+0.124
Neg logit contributions
 distance-0.087
Oh-0.081
 narrower-0.079
Hmm-0.079
Uh-0.078
Token watched
Token ID7342
Feature activation-1.02e-01

Pos logit contributions
 disl+0.126
ach+0.111
az+0.110
uggets+0.106
ubby+0.105
Neg logit contributions
 distance-0.070
Oh-0.065
 narrower-0.065
Hmm-0.064
Uh-0.064
Token in
Token ID287
Feature activation-6.00e-02

Pos logit contributions
 disl+0.162
az+0.133
ach+0.132
uggets+0.130
ubby+0.126
Neg logit contributions
 distance-0.146
 narrower-0.138
Oh-0.137
To-0.133
Uh-0.132
Token amaz
Token ID40642
Feature activation-1.88e-01

Pos logit contributions
 disl+0.088
ach+0.075
az+0.073
uggets+0.069
ubby+0.068
Neg logit contributions
 distance-0.092
Hmm-0.089
 narrower-0.089
Oh-0.089
Uh-0.088
Tokenement
Token ID972
Feature activation-1.01e-01

Pos logit contributions
 disl+0.400
uggets+0.361
 majesty+0.335
 Majesty+0.330
ubby+0.329
Neg logit contributions
 distance-0.148
Oh-0.145
See-0.137
To-0.136
Finally-0.136
Token as
Token ID355
Feature activation-7.33e-02

Pos logit contributions
 disl+0.173
az+0.142
uggets+0.141
ubby+0.140
ach+0.136
Neg logit contributions
 distance-0.131
 narrower-0.119
 drifted-0.118
To-0.117
 eventually-0.117
Token it
Token ID340
Feature activation-3.36e-02

Pos logit contributions
 disl+0.126
ach+0.110
az+0.109
uggets+0.106
ubby+0.103
Neg logit contributions
 distance-0.096
Oh-0.090
 narrower-0.090
Hmm-0.087
Uh-0.086
Token took
Token ID1718
Feature activation-7.23e-02

Pos logit contributions
 disl+0.064
az+0.057
ach+0.056
 Majesty+0.053
ci+0.052
Neg logit contributions
 distance-0.036
Oh-0.035
Hmm-0.034
Uh-0.034
Less-0.032
Token off
Token ID572
Feature activation-6.75e-02

Pos logit contributions
 disl+0.104
ach+0.091
az+0.088
uggets+0.080
ubby+0.080
Neg logit contributions
 distance-0.114
Uh-0.110
 narrower-0.109
Oh-0.108
Hmm-0.108
Tokenates
Token ID689
Feature activation-4.61e-02

Pos logit contributions
 disl+0.251
 Majesty+0.166
az+0.165
ach+0.153
 majesty+0.153
Neg logit contributions
 distance-0.327
 narrower-0.317
Park-0.317
To-0.316
Warning-0.315
Token.
Token ID13
Feature activation-3.12e-02

Pos logit contributions
 disl+0.090
az+0.074
 Majesty+0.072
 majesty+0.070
ach+0.069
Neg logit contributions
 distance-0.056
 narrower-0.050
To-0.050
 million-0.048
Oh-0.048
Token Lily
Token ID20037
Feature activation-4.28e-02

Pos logit contributions
 disl+0.069
ach+0.062
az+0.061
uc+0.058
ubby+0.057
Neg logit contributions
Oh-0.025
 distance-0.025
Hmm-0.025
Ah-0.024
To-0.024
Token loved
Token ID6151
Feature activation-8.17e-02

Pos logit contributions
 disl+0.089
ach+0.080
az+0.078
uggets+0.076
ubby+0.076
Neg logit contributions
 distance-0.041
 narrower-0.037
Hmm-0.036
Oh-0.036
Maybe-0.036
Token ch
Token ID442
Feature activation-9.06e-02

Pos logit contributions
 disl+0.136
ach+0.118
ubby+0.117
az+0.116
 Majesty+0.107
Neg logit contributions
 distance-0.109
Oh-0.103
 narrower-0.102
Who-0.101
Where-0.101
Tokenocol
Token ID4668
Feature activation-1.86e-01

Pos logit contributions
 disl+0.111
ci+0.090
 Majesty+0.073
 gol+0.070
ach+0.070
Neg logit contributions
He-0.147
 million-0.145
 narrower-0.142
 drifted-0.141
 bud-0.140
Tokenates
Token ID689
Feature activation-4.69e-02

Pos logit contributions
 disl+0.262
 Majesty+0.177
az+0.173
 gol+0.168
 majesty+0.160
Neg logit contributions
 distance-0.290
Ah-0.290
To-0.289
Less-0.287
Warning-0.286
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
 disl+0.092
az+0.078
ach+0.075
 Majesty+0.075
ubby+0.074
Neg logit contributions
 distance-0.055
 narrower-0.049
He-0.048
To-0.047
 million-0.046
Token She
Token ID1375
Feature activation-6.86e-03

Pos logit contributions
 disl+0.060
ach+0.055
az+0.054
ubby+0.052
uggets+0.052
Neg logit contributions
 distance-0.019
Oh-0.019
Hmm-0.019
Maybe-0.017
To-0.017
Token thanked
Token ID26280
Feature activation-6.43e-02

Pos logit contributions
 disl+0.013
ach+0.012
az+0.012
uggets+0.011
ubby+0.011
Neg logit contributions
 distance-0.008
Oh-0.007
 narrower-0.007
Hmm-0.007
Uh-0.007
Token Ben
Token ID3932
Feature activation-2.33e-02

Pos logit contributions
 disl+0.116
az+0.097
ach+0.096
ubby+0.093
rou+0.091
Neg logit contributions
 distance-0.081
Oh-0.079
Who-0.078
Less-0.075
Hmm-0.075
Token egg
Token ID5935
Feature activation-1.20e-01

Pos logit contributions
 disl+0.186
ach+0.158
az+0.157
uggets+0.154
ubby+0.146
Neg logit contributions
 distance-0.157
Oh-0.143
Uh-0.140
 million-0.139
To-0.138
Token!
Token ID0
Feature activation-7.40e-02

Pos logit contributions
 disl+0.223
ach+0.181
az+0.181
uggets+0.177
 Majesty+0.174
Neg logit contributions
 distance-0.156
 million-0.137
 drifted-0.135
Oh-0.135
To-0.133
Token He
Token ID679
Feature activation-4.35e-02

Pos logit contributions
 disl+0.143
ach+0.126
az+0.125
uggets+0.120
ubby+0.119
Neg logit contributions
 distance-0.081
Oh-0.077
Hmm-0.074
Uh-0.074
 narrower-0.073
Token watched
Token ID7342
Feature activation-9.45e-02

Pos logit contributions
 disl+0.093
ach+0.083
az+0.082
uggets+0.080
ubby+0.079
Neg logit contributions
 distance-0.039
Oh-0.035
 narrower-0.035
Uh-0.034
Hmm-0.034
Token in
Token ID287
Feature activation-5.19e-02

Pos logit contributions
 disl+0.162
az+0.134
ach+0.129
uggets+0.126
ubby+0.126
Neg logit contributions
 distance-0.129
 narrower-0.119
Oh-0.119
To-0.115
Ah-0.115
Token amaz
Token ID40642
Feature activation-1.85e-01

Pos logit contributions
 disl+0.081
ach+0.070
az+0.069
uggets+0.065
ubby+0.064
Neg logit contributions
 distance-0.076
 narrower-0.072
Oh-0.072
Hmm-0.071
Uh-0.071
Tokenement
Token ID972
Feature activation-9.81e-02

Pos logit contributions
 disl+0.404
uggets+0.357
 majesty+0.335
ubby+0.330
 Majesty+0.329
Neg logit contributions
Oh-0.141
 distance-0.139
See-0.136
He-0.132
Ah-0.132
Token as
Token ID355
Feature activation-7.17e-02

Pos logit contributions
 disl+0.170
az+0.140
ubby+0.136
uggets+0.133
ere+0.129
Neg logit contributions
 distance-0.128
 narrower-0.117
 drifted-0.116
He-0.116
 eventually-0.115
Token the
Token ID262
Feature activation-4.64e-02

Pos logit contributions
 disl+0.139
ach+0.122
az+0.121
uggets+0.119
ubby+0.115
Neg logit contributions
 distance-0.080
Oh-0.074
 narrower-0.074
Ah-0.071
Hmm-0.070
Token egg
Token ID5935
Feature activation-5.22e-02

Pos logit contributions
 disl+0.064
ach+0.056
az+0.053
 Majesty+0.052
ubby+0.052
Neg logit contributions
 narrower-0.071
 distance-0.070
Hmm-0.069
Park-0.068
To-0.068
Token began
Token ID2540
Feature activation-6.91e-02

Pos logit contributions
 disl+0.099
ach+0.087
az+0.086
uggets+0.083
ubby+0.082
Neg logit contributions
 distance-0.059
Oh-0.055
 narrower-0.054
Hmm-0.054
Uh-0.054
Token content
Token ID2695
Feature activation-4.41e-02

Pos logit contributions
 distance+0.003
 drifted+0.002
Oh+0.002
 narrower+0.002
To+0.002
Neg logit contributions
 disl-0.006
ach-0.006
az-0.005
uggets-0.005
ubby-0.005
Token as
Token ID355
Feature activation-1.93e-02

Pos logit contributions
 disl+0.075
ubby+0.064
az+0.063
uggets+0.061
ach+0.061
Neg logit contributions
 distance-0.062
To-0.055
 drifted-0.055
 eventually-0.054
Oh-0.054
Token he
Token ID339
Feature activation+3.75e-02

Pos logit contributions
 disl+0.046
ach+0.041
az+0.041
ubby+0.040
uggets+0.040
Neg logit contributions
 distance-0.012
 narrower-0.011
Oh-0.010
To-0.010
 eventually-0.010
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
 distance+0.041
 narrower+0.040
Oh+0.039
Uh+0.039
Hmm+0.039
Neg logit contributions
 disl-0.068
ach-0.062
az-0.061
rou-0.058
uc-0.058
Token his
Token ID465
Feature activation-8.57e-05

Pos logit contributions
 distance+0.014
 drifted+0.012
Oh+0.012
 eventually+0.011
Aw+0.011
Neg logit contributions
 disl-0.025
ach-0.024
ubby-0.023
az-0.023
uggets-0.023
Token mom
Token ID1995
Feature activation+8.71e-03

Pos logit contributions
 disl+0.000
ach+0.000
az+0.000
uggets+0.000
ubby+0.000
Neg logit contributions
 distance-0.000
Oh-0.000
 narrower-0.000
Aw-0.000
 drifted-0.000
Token left
Token ID1364
Feature activation-2.61e-02

Pos logit contributions
 distance+0.009
Uh+0.009
 narrower+0.009
Oh+0.009
Hmm+0.009
Neg logit contributions
 disl-0.017
ach-0.015
az-0.015
uggets-0.014
ubby-0.014
Token the
Token ID262
Feature activation+5.84e-03

Pos logit contributions
 disl+0.042
ach+0.036
az+0.035
uggets+0.033
ubby+0.033
Neg logit contributions
 distance-0.039
Oh-0.035
 narrower-0.035
Hmm-0.034
Uh-0.034
Token park
Token ID3952
Feature activation+1.03e-03

Pos logit contributions
 distance+0.008
Oh+0.007
 narrower+0.007
Hmm+0.007
Uh+0.007
Neg logit contributions
 disl-0.010
ach-0.009
az-0.009
uggets-0.008
ubby-0.008
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
 distance+0.002
 narrower+0.001
Oh+0.001
To+0.001
Hmm+0.001
Neg logit contributions
 disl-0.002
ach-0.001
az-0.001
ubby-0.001
uggets-0.001
Token<|endoftext|>
Token ID50256
Feature activation-5.79e-02

Pos logit contributions
 distance+0.019
Oh+0.018
Hmm+0.017
Uh+0.017
 narrower+0.017
Neg logit contributions
 disl-0.025
ach-0.022
az-0.022
uggets-0.021
ubby-0.020
Token and
Token ID290
Feature activation+3.82e-02

Pos logit contributions
 distance+0.023
Oh+0.022
 narrower+0.022
Uh+0.022
Hmm+0.021
Neg logit contributions
 disl-0.020
ach-0.017
az-0.016
uggets-0.015
ubby-0.015
Token he
Token ID339
Feature activation+5.55e-02

Pos logit contributions
 distance+0.037
 narrower+0.033
Oh+0.033
To+0.031
Hmm+0.031
Neg logit contributions
 disl-0.079
ach-0.072
az-0.070
uggets-0.069
ubby-0.069
Token and
Token ID290
Feature activation+3.67e-02

Pos logit contributions
 distance+0.056
Oh+0.052
 narrower+0.052
Uh+0.051
Hmm+0.051
Neg logit contributions
 disl-0.112
az-0.100
ach-0.098
 Majesty-0.094
ubby-0.093
Token his
Token ID465
Feature activation+1.68e-02

Pos logit contributions
 distance+0.050
Oh+0.047
 narrower+0.046
Hmm+0.045
Uh+0.045
Neg logit contributions
 disl-0.062
ach-0.055
az-0.053
uggets-0.051
ubby-0.050
Token friend
Token ID1545
Feature activation+4.17e-02

Pos logit contributions
 distance+0.030
Oh+0.029
 narrower+0.029
Hmm+0.029
Uh+0.029
Neg logit contributions
 disl-0.021
ach-0.017
az-0.017
uggets-0.016
ubby-0.015
Token were
Token ID547
Feature activation+3.03e-02

Pos logit contributions
Uh+0.049
 distance+0.049
Oh+0.049
Hmm+0.048
 narrower+0.048
Neg logit contributions
 disl-0.075
ach-0.065
az-0.065
 Majesty-0.061
uggets-0.061
Token friends
Token ID2460
Feature activation+8.17e-03

Pos logit contributions
 distance+0.034
Oh+0.030
 narrower+0.030
Less+0.029
 eventually+0.029
Neg logit contributions
 disl-0.060
ach-0.054
az-0.050
uggets-0.050
ubby-0.048
Token again
Token ID757
Feature activation+1.51e-02

Pos logit contributions
 distance+0.010
Oh+0.009
 narrower+0.009
Hmm+0.009
Uh+0.009
Neg logit contributions
 disl-0.015
ach-0.013
az-0.013
uggets-0.012
ubby-0.012
Token.
Token ID13
Feature activation+4.12e-02

Pos logit contributions
 distance+0.021
 narrower+0.020
Oh+0.020
Hmm+0.019
Uh+0.019
Neg logit contributions
 disl-0.025
ach-0.021
az-0.021
uggets-0.020
ubby-0.020
Token<|endoftext|>
Token ID50256
Feature activation-3.48e-02

Pos logit contributions
 distance+0.052
 narrower+0.050
Uh+0.048
Hmm+0.047
Oh+0.047
Neg logit contributions
 disl-0.074
az-0.064
ach-0.063
uggets-0.062
ubby-0.060
TokenOne
Token ID3198
Feature activation+3.47e-03

Pos logit contributions
 disl+0.062
ach+0.056
az+0.056
uggets+0.052
ubby+0.052
Neg logit contributions
 distance-0.041
 narrower-0.039
Hmm-0.038
Uh-0.038
Oh-0.038
Token a
Token ID257
Feature activation+2.05e-02

Pos logit contributions
 disl+0.024
ach+0.020
ubby+0.020
az+0.020
uggets+0.020
Neg logit contributions
 distance-0.018
 narrower-0.017
Ah-0.017
Oh-0.017
Hmm-0.016
Token big
Token ID1263
Feature activation-8.33e-03

Pos logit contributions
 distance+0.029
Hmm+0.029
 narrower+0.028
Oh+0.028
Warning+0.028
Neg logit contributions
 disl-0.031
ach-0.027
az-0.025
 Majesty-0.025
uggets-0.024
Token wheel
Token ID7825
Feature activation+4.26e-03

Pos logit contributions
 disl+0.014
uggets+0.011
ach+0.011
az+0.011
 Majesty+0.010
Neg logit contributions
 distance-0.013
Uh-0.011
Oh-0.011
 narrower-0.011
 million-0.011
Token and
Token ID290
Feature activation+2.57e-02

Pos logit contributions
 distance+0.006
 narrower+0.006
Oh+0.006
To+0.005
Uh+0.005
Neg logit contributions
 disl-0.007
uggets-0.006
ach-0.006
 Majesty-0.006
az-0.006
Token happy
Token ID3772
Feature activation-1.11e-02

Pos logit contributions
 distance+0.027
Oh+0.025
 narrower+0.025
Uh+0.025
Hmm+0.024
Neg logit contributions
 disl-0.050
ach-0.044
az-0.044
uggets-0.042
 Majesty-0.041
Token faces
Token ID6698
Feature activation+2.98e-03

Pos logit contributions
 disl+0.018
ubby+0.015
az+0.015
uggets+0.015
ach+0.015
Neg logit contributions
 distance-0.016
Oh-0.014
 narrower-0.014
To-0.014
Who-0.014
Token everywhere
Token ID8347
Feature activation-2.43e-02

Pos logit contributions
 distance+0.005
Uh+0.005
Hmm+0.005
Oh+0.005
Ah+0.005
Neg logit contributions
 disl-0.004
ach-0.003
az-0.003
uggets-0.003
rou-0.003
Token.
Token ID13
Feature activation+2.86e-02

Pos logit contributions
 disl+0.032
ach+0.028
az+0.027
uggets+0.025
ubby+0.024
Neg logit contributions
 distance-0.041
Uh-0.041
Hmm-0.040
Oh-0.039
Aw-0.038
Token He
Token ID679
Feature activation+4.45e-02

Pos logit contributions
 distance+0.032
 narrower+0.030
Uh+0.030
Hmm+0.029
Oh+0.029
Neg logit contributions
 disl-0.055
az-0.048
ach-0.048
uggets-0.045
 Majesty-0.044
Token noticed
Token ID6810
Feature activation-1.03e-02

Pos logit contributions
 distance+0.049
Oh+0.045
 narrower+0.045
Hmm+0.044
Uh+0.044
Neg logit contributions
 disl-0.086
ach-0.077
az-0.077
 Majesty-0.072
uggets-0.072
Token a
Token ID257
Feature activation+2.77e-02

Pos logit contributions
 disl+0.016
ubby+0.013
uggets+0.013
ach+0.013
az+0.013
Neg logit contributions
 distance-0.016
 narrower-0.015
Oh-0.015
Ah-0.014
Where-0.014
Token She
Token ID1375
Feature activation+4.81e-02

Pos logit contributions
 distance+0.039
 narrower+0.037
Uh+0.036
Oh+0.035
Hmm+0.034
Neg logit contributions
 disl-0.065
az-0.058
ach-0.057
uggets-0.055
 Majesty-0.053
Token asked
Token ID1965
Feature activation+1.30e-02

Pos logit contributions
 distance+0.057
Oh+0.052
 narrower+0.051
Aw+0.049
Uh+0.049
Neg logit contributions
 disl-0.086
az-0.078
ach-0.078
uggets-0.073
ubby-0.073
Token her
Token ID607
Feature activation+1.92e-02

Pos logit contributions
 distance+0.022
Oh+0.021
 narrower+0.021
Uh+0.021
Hmm+0.020
Neg logit contributions
 disl-0.018
ach-0.015
az-0.015
uggets-0.014
ubby-0.014
Token mom
Token ID1995
Feature activation+2.46e-02

Pos logit contributions
 distance+0.024
Oh+0.023
 narrower+0.023
Hmm+0.022
To+0.022
Neg logit contributions
 disl-0.034
ach-0.030
az-0.029
uggets-0.028
ubby-0.028
Token if
Token ID611
Feature activation+4.31e-02

Pos logit contributions
 distance+0.029
To+0.026
 narrower+0.026
Where+0.026
Maybe+0.026
Neg logit contributions
 disl-0.048
ach-0.040
az-0.040
 majesty-0.038
 Majesty-0.038
Token she
Token ID673
Feature activation+6.03e-02

Pos logit contributions
 distance+0.031
Oh+0.029
 narrower+0.028
Hmm+0.027
To+0.026
Neg logit contributions
 disl-0.099
ach-0.091
az-0.091
ubby-0.088
uggets-0.088
Token could
Token ID714
Feature activation+3.68e-02

Pos logit contributions
 distance+0.070
 narrower+0.062
Uh+0.061
Oh+0.060
Park+0.060
Neg logit contributions
 disl-0.112
az-0.102
ach-0.100
uggets-0.096
ubby-0.096
Token use
Token ID779
Feature activation+2.59e-02

Pos logit contributions
 distance+0.054
Oh+0.053
Uh+0.052
 narrower+0.052
Aw+0.051
Neg logit contributions
 disl-0.056
ach-0.050
az-0.049
uggets-0.046
ubby-0.045
Token the
Token ID262
Feature activation+5.25e-02

Pos logit contributions
 distance+0.042
Oh+0.039
 narrower+0.039
Uh+0.039
Hmm+0.039
Neg logit contributions
 disl-0.037
ach-0.031
az-0.031
uggets-0.029
ubby-0.028
Token kitchen
Token ID9592
Feature activation+4.79e-02

Pos logit contributions
 distance+0.079
 narrower+0.075
Uh+0.075
Oh+0.075
Hmm+0.074
Neg logit contributions
 disl-0.078
ach-0.068
az-0.067
ubby-0.063
uggets-0.063
Token scale
Token ID5046
Feature activation+4.76e-02

Pos logit contributions
 distance+0.072
 narrower+0.068
Oh+0.068
Hmm+0.067
Uh+0.066
Neg logit contributions
 disl-0.074
ach-0.062
az-0.061
uggets-0.058
ubby-0.057
Token happy
Token ID3772
Feature activation+1.99e-04

Pos logit contributions
 distance+0.061
Oh+0.057
 narrower+0.057
Uh+0.056
Hmm+0.056
Neg logit contributions
 disl-0.067
ach-0.057
az-0.056
uggets-0.054
ubby-0.053
Token and
Token ID290
Feature activation+4.07e-02

Pos logit contributions
 distance+0.000
Oh+0.000
 narrower+0.000
Hmm+0.000
To+0.000
Neg logit contributions
 disl-0.000
ach-0.000
az-0.000
ubby-0.000
uggets-0.000
Token proud
Token ID6613
Feature activation-5.62e-03

Pos logit contributions
 distance+0.039
Oh+0.034
 narrower+0.034
Uh+0.033
To+0.033
Neg logit contributions
 disl-0.085
ach-0.078
az-0.076
ubby-0.074
uggets-0.074
Token of
Token ID286
Feature activation+5.57e-03

Pos logit contributions
 disl+0.008
ach+0.006
az+0.006
uggets+0.006
ubby+0.006
Neg logit contributions
 distance-0.010
Uh-0.009
Hmm-0.009
 narrower-0.009
Oh-0.009
Token their
Token ID511
Feature activation+4.72e-02

Pos logit contributions
 distance+0.009
 narrower+0.009
Uh+0.009
Oh+0.008
Hmm+0.008
Neg logit contributions
 disl-0.008
ach-0.007
az-0.007
uggets-0.006
ubby-0.006
Token new
Token ID649
Feature activation+2.35e-02

Pos logit contributions
 distance+0.076
Oh+0.072
Uh+0.072
 narrower+0.072
Hmm+0.072
Neg logit contributions
 disl-0.068
ach-0.057
az-0.056
uggets-0.053
ubby-0.053
Token baby
Token ID5156
Feature activation+4.17e-02

Pos logit contributions
 distance+0.034
Oh+0.032
 narrower+0.031
Hmm+0.031
Uh+0.031
Neg logit contributions
 disl-0.038
ach-0.033
az-0.032
uggets-0.031
ubby-0.030
Token.
Token ID13
Feature activation+4.58e-02

Pos logit contributions
Uh+0.071
 distance+0.070
Hmm+0.068
Oh+0.068
 narrower+0.067
Neg logit contributions
 disl-0.054
ach-0.046
az-0.045
uggets-0.044
ubby-0.042
Token
Token ID198
Feature activation+2.10e-02

Pos logit contributions
 distance+0.056
 narrower+0.053
Oh+0.052
Uh+0.051
Hmm+0.051
Neg logit contributions
 disl-0.084
ach-0.073
az-0.072
uggets-0.070
ubby-0.068
Token
Token ID198
Feature activation+3.22e-02

Pos logit contributions
 distance+0.025
 narrower+0.024
Uh+0.023
Hmm+0.023
Oh+0.023
Neg logit contributions
 disl-0.038
ach-0.034
az-0.033
uggets-0.032
ubby-0.031
TokenThey
Token ID2990
Feature activation+6.28e-02

Pos logit contributions
 distance+0.037
 narrower+0.033
 vengeance+0.030
Aw+0.030
Uh+0.030
Neg logit contributions
 disl-0.061
ach-0.054
ere-0.049
ubby-0.048
uggets-0.048
TokenMom
Token ID29252
Feature activation-4.58e-02

Pos logit contributions
 disl+0.075
ach+0.067
ubby+0.062
uggets+0.062
az+0.060
Neg logit contributions
 distance-0.051
 narrower-0.046
 vengeance-0.043
 drifted-0.043
Hmm-0.043
Token hears
Token ID23180
Feature activation-5.40e-02

Pos logit contributions
 disl+0.071
az+0.071
ubby+0.063
rou+0.063
ach+0.063
Neg logit contributions
 distance-0.062
Aw-0.060
Hmm-0.059
 narrower-0.059
Ah-0.057
Token the
Token ID262
Feature activation-2.18e-02

Pos logit contributions
 disl+0.099
ach+0.087
az+0.086
uggets+0.085
 Majesty+0.085
Neg logit contributions
 distance-0.062
Oh-0.060
Who-0.058
Where-0.057
Park-0.056
Token noise
Token ID7838
Feature activation-5.14e-02

Pos logit contributions
 disl+0.037
az+0.031
ach+0.031
ubby+0.029
 Majesty+0.029
Neg logit contributions
 distance-0.029
 narrower-0.028
Hmm-0.028
To-0.027
Aw-0.027
Token.
Token ID13
Feature activation-2.16e-02

Pos logit contributions
 disl+0.034
az+0.030
ach+0.028
iggle+0.024
?"+0.023
Neg logit contributions
Ah-0.118
Hmm-0.116
Aw-0.116
 narrower-0.113
Finally-0.112
Token She
Token ID1375
Feature activation+1.98e-03

Pos logit contributions
 disl+0.035
az+0.031
ach+0.028
uck+0.026
ubby+0.026
Neg logit contributions
 distance-0.030
 narrower-0.029
Aw-0.028
Less-0.028
Finally-0.028
Token comes
Token ID2058
Feature activation-4.40e-02

Pos logit contributions
 distance+0.002
Oh+0.002
 narrower+0.002
Uh+0.002
Hmm+0.002
Neg logit contributions
 disl-0.004
ach-0.004
az-0.004
uggets-0.003
ubby-0.003
Token to
Token ID284
Feature activation-2.59e-02

Pos logit contributions
 disl+0.061
ach+0.052
az+0.052
uggets+0.049
ubby+0.047
Neg logit contributions
 distance-0.071
Hmm-0.068
Oh-0.068
Uh-0.068
 narrower-0.068
Token the
Token ID262
Feature activation-6.55e-03

Pos logit contributions
 disl+0.039
ach+0.033
az+0.033
uggets+0.031
ubby+0.030
Neg logit contributions
 distance-0.040
Oh-0.038
 narrower-0.038
Uh-0.037
Hmm-0.037
Token living
Token ID2877
Feature activation-7.95e-02

Pos logit contributions
 disl+0.009
az+0.007
ach+0.007
 Majesty+0.007
 majesty+0.007
Neg logit contributions
 narrower-0.010
Hmm-0.010
 distance-0.010
Uh-0.010
Aw-0.010
Token room
Token ID2119
Feature activation-3.57e-02

Pos logit contributions
 disl+0.148
ach+0.122
uggets+0.115
 Majesty+0.114
az+0.111
Neg logit contributions
 distance-0.112
Oh-0.102
Where-0.097
Who-0.093
Uh-0.093
Token feathers
Token ID27400
Feature activation-3.34e-02

Pos logit contributions
 disl+0.030
ach+0.025
az+0.025
uggets+0.023
ubby+0.023
Neg logit contributions
 distance-0.037
Oh-0.035
Hmm-0.035
Uh-0.035
 narrower-0.035
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
 disl+0.076
 Majesty+0.065
 majesty+0.064
ach+0.063
az+0.063
Neg logit contributions
 distance-0.035
 narrower-0.026
He-0.026
 drifted-0.025
To-0.025
Token She
Token ID1375
Feature activation+5.40e-03

Pos logit contributions
 disl+0.016
ach+0.014
az+0.014
uggets+0.013
ubby+0.012
Neg logit contributions
 distance-0.017
 narrower-0.016
Oh-0.016
Uh-0.016
Hmm-0.016
Token wanted
Token ID2227
Feature activation-2.68e-02

Pos logit contributions
 distance+0.004
Ah+0.004
Maybe+0.004
Oh+0.004
To+0.004
Neg logit contributions
 disl-0.012
ach-0.011
az-0.011
uggets-0.011
 Majesty-0.011
Token to
Token ID284
Feature activation-7.03e-03

Pos logit contributions
 disl+0.034
ach+0.028
az+0.028
uggets+0.027
 Majesty+0.024
Neg logit contributions
 distance-0.047
Uh-0.045
Hmm-0.045
Oh-0.045
 narrower-0.045
Token make
Token ID787
Feature activation-2.27e-02

Pos logit contributions
 disl+0.015
rou+0.014
uggets+0.013
ach+0.013
az+0.013
Neg logit contributions
 distance-0.008
To-0.006
 lead-0.006
Who-0.006
 brake-0.006
Token a
Token ID257
Feature activation+1.88e-02

Pos logit contributions
 disl+0.041
ubby+0.039
uggets+0.037
ach+0.037
az+0.036
Neg logit contributions
Ah-0.024
 other-0.024
 narrower-0.023
 distance-0.023
See-0.023
Token nest
Token ID16343
Feature activation-2.87e-02

Pos logit contributions
 distance+0.027
Oh+0.025
 narrower+0.025
Uh+0.025
Hmm+0.025
Neg logit contributions
 disl-0.030
ach-0.026
az-0.026
uggets-0.024
ubby-0.024
Token for
Token ID329
Feature activation-1.89e-02

Pos logit contributions
 disl+0.060
 Majesty+0.052
ach+0.050
 majesty+0.050
ubby+0.050
Neg logit contributions
 distance-0.036
 thicker-0.028
 better-0.027
 narrower-0.027
To-0.027
Token her
Token ID607
Feature activation-1.30e-02

Pos logit contributions
 disl+0.027
ach+0.022
az+0.022
uggets+0.021
 Majesty+0.020
Neg logit contributions
 distance-0.031
Uh-0.030
Oh-0.030
Hmm-0.030
 narrower-0.029
Token and
Token ID290
Feature activation+1.83e-02

Pos logit contributions
 disl+0.018
ach+0.015
az+0.015
uggets+0.014
ubby+0.014
Neg logit contributions
 distance-0.022
 narrower-0.020
Oh-0.020
Hmm-0.020
Uh-0.020
Token and
Token ID290
Feature activation+1.73e-02

Pos logit contributions
 disl+0.027
ach+0.023
az+0.023
uggets+0.021
ubby+0.021
Neg logit contributions
 distance-0.028
Oh-0.027
Uh-0.027
Hmm-0.027
 narrower-0.027
Token he
Token ID339
Feature activation+2.43e-02

Pos logit contributions
 distance+0.020
Oh+0.018
Hmm+0.018
 narrower+0.018
Uh+0.018
Neg logit contributions
 disl-0.033
ach-0.028
az-0.028
 Majesty-0.027
uggets-0.027
Token couldn
Token ID3521
Feature activation-2.55e-02

Pos logit contributions
 distance+0.029
Oh+0.027
Hmm+0.027
Uh+0.026
 narrower+0.026
Neg logit contributions
 disl-0.044
az-0.039
ach-0.039
 Majesty-0.037
ubby-0.036
Token't
Token ID470
Feature activation+1.68e-03

Pos logit contributions
 disl+0.051
ach+0.045
uggets+0.044
az+0.044
 Majesty+0.043
Neg logit contributions
 distance-0.026
Oh-0.025
 narrower-0.024
Uh-0.024
To-0.024
Token move
Token ID1445
Feature activation-1.18e-02

Pos logit contributions
Oh+0.003
Uh+0.003
Aw+0.003
See+0.003
 million+0.002
Neg logit contributions
 disl-0.002
ach-0.002
az-0.002
?"-0.002
ubby-0.002
Token anymore
Token ID7471
Feature activation-2.02e-02

Pos logit contributions
 disl+0.018
ach+0.016
az+0.016
uggets+0.015
ubby+0.014
Neg logit contributions
 distance-0.017
Uh-0.017
Oh-0.017
Hmm-0.017
 narrower-0.016
Token.
Token ID13
Feature activation+9.90e-03

Pos logit contributions
 disl+0.028
ach+0.024
az+0.023
uggets+0.022
 Majesty+0.020
Neg logit contributions
 distance-0.033
Uh-0.033
Oh-0.032
Hmm-0.032
 narrower-0.031
Token Red
Token ID2297
Feature activation+3.00e-03

Pos logit contributions
 distance+0.011
 narrower+0.010
Uh+0.010
Hmm+0.010
Oh+0.010
Neg logit contributions
 disl-0.019
ach-0.016
az-0.016
uggets-0.015
 Majesty-0.015
Token was
Token ID373
Feature activation+8.87e-04

Pos logit contributions
 distance+0.003
 narrower+0.003
Oh+0.003
Uh+0.003
Hmm+0.003
Neg logit contributions
 disl-0.006
ach-0.006
az-0.006
uggets-0.005
ubby-0.005
Token sad
Token ID6507
Feature activation-2.64e-02

Pos logit contributions
 distance+0.001
 narrower+0.001
Oh+0.001
Uh+0.001
 darker+0.001
Neg logit contributions
 disl-0.002
ach-0.002
uggets-0.002
 majesty-0.002
az-0.002
Token and
Token ID290
Feature activation+2.61e-02

Pos logit contributions
 disl+0.037
ach+0.031
az+0.030
uggets+0.029
ubby+0.028
Neg logit contributions
 distance-0.044
Oh-0.042
 narrower-0.041
Hmm-0.041
Uh-0.041
Token day
Token ID1110
Feature activation-2.18e-02

Pos logit contributions
 distance+0.002
Oh+0.002
To+0.002
 narrower+0.002
Uh+0.002
Neg logit contributions
 disl-0.004
ach-0.004
az-0.003
uggets-0.003
 Majesty-0.003
Token,
Token ID11
Feature activation-7.78e-03

Pos logit contributions
 disl+0.029
ach+0.023
az+0.023
uggets+0.022
 Majesty+0.021
Neg logit contributions
 distance-0.038
Oh-0.037
 narrower-0.036
To-0.036
Uh-0.036
Token she
Token ID673
Feature activation+1.47e-02

Pos logit contributions
 disl+0.016
az+0.014
ach+0.014
ubby+0.013
uggets+0.013
Neg logit contributions
 distance-0.008
Oh-0.008
Hmm-0.008
 narrower-0.008
Uh-0.007
Token saw
Token ID2497
Feature activation-2.94e-02

Pos logit contributions
 distance+0.011
Maybe+0.010
To+0.010
 drifted+0.010
Oh+0.009
Neg logit contributions
 disl-0.035
uggets-0.031
ach-0.031
az-0.029
 Majesty-0.029
Token a
Token ID257
Feature activation+6.32e-03

Pos logit contributions
 disl+0.057
ach+0.055
ubby+0.051
az+0.051
uggets+0.051
Neg logit contributions
Oh-0.033
 distance-0.033
 other-0.033
Ah-0.033
Where-0.032
Token carpet
Token ID20710
Feature activation-2.06e-03

Pos logit contributions
 distance+0.009
Oh+0.009
 narrower+0.008
Uh+0.008
Hmm+0.008
Neg logit contributions
 disl-0.010
ach-0.009
az-0.009
uggets-0.008
ubby-0.008
Token in
Token ID287
Feature activation+4.48e-03

Pos logit contributions
 disl+0.003
ach+0.003
az+0.003
 Majesty+0.003
uggets+0.003
Neg logit contributions
 distance-0.003
 narrower-0.003
Oh-0.003
To-0.003
Uh-0.003
Token the
Token ID262
Feature activation+1.64e-02

Pos logit contributions
 distance+0.007
Oh+0.007
 narrower+0.006
Hmm+0.006
Uh+0.006
Neg logit contributions
 disl-0.007
ach-0.006
az-0.006
uggets-0.006
ubby-0.006
Token hallway
Token ID23959
Feature activation-1.91e-02

Pos logit contributions
 narrower+0.021
Uh+0.021
Hmm+0.021
Aw+0.021
 distance+0.020
Neg logit contributions
 disl-0.026
az-0.023
ach-0.023
uggets-0.021
ubby-0.021
Token and
Token ID290
Feature activation+2.08e-02

Pos logit contributions
 disl+0.029
ach+0.025
az+0.024
uggets+0.023
ubby+0.023
Neg logit contributions
 distance-0.029
Oh-0.027
 narrower-0.027
Hmm-0.027
Uh-0.027
Token decided
Token ID3066
Feature activation+5.32e-03

Pos logit contributions
 distance+0.023
 narrower+0.021
Oh+0.021
Uh+0.021
Hmm+0.021
Neg logit contributions
 disl-0.040
ach-0.035
az-0.035
uggets-0.033
 Majesty-0.033
Token girl
Token ID2576
Feature activation-3.59e-02

Pos logit contributions
 disl+0.141
ach+0.125
uggets+0.123
az+0.120
uc+0.114
Neg logit contributions
 distance-0.075
Oh-0.068
Uh-0.065
 narrower-0.064
Hmm-0.062
Token is
Token ID318
Feature activation-3.45e-02

Pos logit contributions
 disl+0.073
ach+0.064
az+0.063
uggets+0.062
ubby+0.061
Neg logit contributions
 distance-0.037
Oh-0.033
 narrower-0.033
Uh-0.032
Hmm-0.032
Token surprised
Token ID6655
Feature activation-8.17e-02

Pos logit contributions
 disl+0.067
ach+0.059
uggets+0.055
az+0.055
uc+0.054
Neg logit contributions
 distance-0.042
 eventually-0.036
 narrower-0.036
Oh-0.035
See-0.034
Token and
Token ID290
Feature activation-2.13e-02

Pos logit contributions
 disl+0.131
uggets+0.108
az+0.106
ach+0.105
 Majesty+0.105
Neg logit contributions
 distance-0.120
Oh-0.114
To-0.109
 narrower-0.109
Where-0.108
Token happy
Token ID3772
Feature activation-6.30e-02

Pos logit contributions
 disl+0.040
ach+0.038
uggets+0.038
az+0.037
ubby+0.036
Neg logit contributions
 distance-0.024
 narrower-0.021
Oh-0.021
Uh-0.020
See-0.020
Token she
Token ID673
Feature activation-1.08e-03

Pos logit contributions
 disl+0.092
ach+0.078
az+0.077
uggets+0.074
ubby+0.073
Neg logit contributions
 distance-0.099
Oh-0.094
 narrower-0.093
Hmm-0.091
Uh-0.091
Token stayed
Token ID9658
Feature activation-4.49e-02

Pos logit contributions
 disl+0.002
uggets+0.002
ach+0.002
az+0.002
ubby+0.002
Neg logit contributions
 distance-0.001
Oh-0.001
To-0.001
 narrower-0.001
Uh-0.001
Token brave
Token ID14802
Feature activation-6.13e-02

Pos logit contributions
 disl+0.086
uggets+0.069
az+0.069
ach+0.069
 majesty+0.068
Neg logit contributions
 distance-0.061
 eventually-0.049
 narrower-0.049
Where-0.049
Oh-0.049
Token.
Token ID13
Feature activation-1.25e-02

Pos logit contributions
 disl+0.118
uggets+0.098
ach+0.095
az+0.094
uc+0.092
Neg logit contributions
 distance-0.077
 narrower-0.067
Oh-0.066
 million-0.064
Where-0.063
Token
Token ID220
Feature activation-7.55e-02

Pos logit contributions
 disl+0.024
az+0.021
ach+0.021
uggets+0.020
 Majesty+0.020
Neg logit contributions
 distance-0.014
 narrower-0.013
Uh-0.013
Oh-0.013
Hmm-0.013
Token
Token ID198
Feature activation-3.02e-02

Pos logit contributions
 disl+0.173
ach+0.155
az+0.150
ubby+0.148
uggets+0.147
Neg logit contributions
 distance-0.052
To-0.051
Oh-0.049
Finally-0.049
Less-0.047
Token "
Token ID366
Feature activation-7.46e-02

Pos logit contributions
 disl+0.157
ach+0.138
az+0.137
uggets+0.130
ubby+0.130
Neg logit contributions
 distance-0.096
Oh-0.088
 narrower-0.087
Uh-0.085
Hmm-0.084
TokenYes
Token ID5297
Feature activation-9.60e-02

Pos logit contributions
 disl+0.173
ach+0.156
az+0.153
uggets+0.149
ubby+0.146
Neg logit contributions
 distance-0.054
 narrower-0.049
Uh-0.043
Oh-0.043
Hmm-0.043
Token,
Token ID11
Feature activation-7.10e-02

Pos logit contributions
 disl+0.131
uggets+0.108
ach+0.107
az+0.101
ubby+0.094
Neg logit contributions
 distance-0.157
Oh-0.153
 narrower-0.150
Hmm-0.149
Uh-0.148
Token mum
Token ID25682
Feature activation-8.69e-02

Pos logit contributions
 disl+0.134
ach+0.119
uggets+0.117
az+0.117
ishing+0.112
Neg logit contributions
 distance-0.082
Oh-0.078
Maybe-0.075
To-0.074
Who-0.074
Token."
Token ID526
Feature activation-4.70e-02

Pos logit contributions
 disl+0.153
uggets+0.132
ach+0.131
 gol+0.119
 Majesty+0.119
Neg logit contributions
 distance-0.101
 had-0.092
To-0.088
 million-0.088
 sounded-0.088
Token
Token ID220
Feature activation-1.08e-01

Pos logit contributions
 disl+0.092
ach+0.081
az+0.080
uggets+0.077
ubby+0.076
Neg logit contributions
 distance-0.051
Oh-0.047
 narrower-0.047
Hmm-0.046
Uh-0.046
Token
Token ID198
Feature activation-5.48e-02

Pos logit contributions
 disl+0.245
ach+0.217
az+0.213
ubby+0.207
 Majesty+0.206
Neg logit contributions
 distance-0.080
To-0.074
Oh-0.074
Finally-0.073
Less-0.070
Token
Token ID198
Feature activation-4.66e-02

Pos logit contributions
 disl+0.105
ach+0.094
az+0.091
uggets+0.090
ubby+0.087
Neg logit contributions
 distance-0.061
 narrower-0.058
Uh-0.055
Oh-0.055
Hmm-0.055
TokenM
Token ID44
Feature activation-7.73e-02

Pos logit contributions
 disl+0.100
ach+0.090
az+0.087
uggets+0.086
ubby+0.084
Neg logit contributions
 distance-0.042
 narrower-0.039
Oh-0.036
Uh-0.036
Hmm-0.036
Tokenum
Token ID388
Feature activation-6.38e-02

Pos logit contributions
 disl+0.102
az+0.078
ach+0.077
uggets+0.076
 Majesty+0.071
Neg logit contributions
 distance-0.131
Oh-0.126
Hmm-0.126
Uh-0.125
Finally-0.125
Token then
Token ID788
Feature activation-3.37e-02

Pos logit contributions
 disl+0.128
ach+0.112
az+0.111
uggets+0.109
 Majesty+0.106
Neg logit contributions
 distance-0.066
Oh-0.061
 narrower-0.060
Uh-0.060
Hmm-0.059
Token to
Token ID284
Feature activation-5.36e-02

Pos logit contributions
 disl+0.111
ach+0.093
az+0.092
uggets+0.087
ubby+0.085
Neg logit contributions
 distance-0.127
Oh-0.121
 narrower-0.120
Hmm-0.119
Uh-0.119
Token go
Token ID467
Feature activation-6.40e-02

Pos logit contributions
 disl+0.113
uggets+0.102
az+0.102
ach+0.101
uc+0.101
Neg logit contributions
 distance-0.054
 eventually-0.043
Park-0.041
 drifted-0.041
Less-0.040
Token home
Token ID1363
Feature activation-8.08e-02

Pos logit contributions
 disl+0.091
ach+0.077
az+0.076
uggets+0.072
ubby+0.070
Neg logit contributions
 distance-0.103
Oh-0.098
 narrower-0.098
Uh-0.097
Hmm-0.097
Token.
Token ID13
Feature activation-5.13e-02

Pos logit contributions
 disl+0.155
az+0.130
ach+0.130
ubby+0.129
uggets+0.126
Neg logit contributions
 distance-0.100
 narrower-0.082
To-0.081
 eventually-0.080
 million-0.080
Token They
Token ID1119
Feature activation-2.13e-02

Pos logit contributions
 disl+0.098
ach+0.086
az+0.085
uggets+0.083
ubby+0.081
Neg logit contributions
 distance-0.058
 narrower-0.053
Oh-0.053
Uh-0.052
Hmm-0.052
Token smiled
Token ID13541
Feature activation-1.03e-01

Pos logit contributions
 disl+0.039
ach+0.034
az+0.033
uggets+0.032
 Majesty+0.032
Neg logit contributions
Oh-0.023
 narrower-0.023
 distance-0.023
Uh-0.022
Hmm-0.022
Token and
Token ID290
Feature activation-2.76e-02

Pos logit contributions
 disl+0.177
az+0.147
ach+0.146
uggets+0.142
ubby+0.138
Neg logit contributions
 distance-0.147
 narrower-0.133
 drifted-0.130
Oh-0.129
Maybe-0.129
Token hugged
Token ID41130
Feature activation-8.22e-02

Pos logit contributions
 disl+0.058
ach+0.051
az+0.051
uggets+0.049
ubby+0.048
Neg logit contributions
 distance-0.026
Oh-0.024
 narrower-0.024
Uh-0.023
Hmm-0.023
Token each
Token ID1123
Feature activation-4.71e-02

Pos logit contributions
 disl+0.121
ach+0.101
az+0.101
uggets+0.095
ubby+0.095
Neg logit contributions
 distance-0.131
Oh-0.123
 narrower-0.121
Hmm-0.120
Uh-0.118
Token other
Token ID584
Feature activation-4.48e-02

Pos logit contributions
ubby+0.083
 disl+0.083
az+0.079
ach+0.077
 shrimp+0.075
Neg logit contributions
 narrower-0.048
Uh-0.046
Hmm-0.044
Aw-0.041
To-0.040
Token,
Token ID11
Feature activation-2.53e-02

Pos logit contributions
 disl+0.076
ach+0.065
az+0.065
uggets+0.062
ubby+0.061
Neg logit contributions
 distance-0.061
 narrower-0.056
Oh-0.056
Hmm-0.055
To-0.054
Token mess
Token ID2085
Feature activation-1.43e-01

Pos logit contributions
 disl+0.113
ach+0.096
az+0.093
 Majesty+0.091
uggets+0.090
Neg logit contributions
 distance-0.096
Hmm-0.094
 narrower-0.093
Oh-0.093
Warning-0.090
Token on
Token ID319
Feature activation-8.93e-02

Pos logit contributions
 disl+0.248
ach+0.208
uggets+0.208
 Majesty+0.207
az+0.202
Neg logit contributions
 distance-0.187
Oh-0.178
 million-0.176
 narrower-0.172
To-0.172
Token the
Token ID262
Feature activation-8.83e-02

Pos logit contributions
 disl+0.155
ach+0.133
uggets+0.130
az+0.128
ubby+0.128
Neg logit contributions
 distance-0.120
Oh-0.114
 million-0.112
To-0.106
 narrower-0.105
Token paper
Token ID3348
Feature activation-8.07e-02

Pos logit contributions
 disl+0.176
ach+0.156
az+0.155
uggets+0.150
ubby+0.148
Neg logit contributions
 distance-0.103
Oh-0.089
 narrower-0.084
 million-0.082
Hmm-0.081
Token.
Token ID13
Feature activation-7.31e-02

Pos logit contributions
 disl+0.140
ach+0.121
az+0.119
uggets+0.116
ubby+0.116
Neg logit contributions
 distance-0.109
Oh-0.096
 narrower-0.094
To-0.094
 million-0.094
Token
Token ID220
Feature activation-1.24e-01

Pos logit contributions
 disl+0.137
ach+0.120
az+0.119
uggets+0.115
ubby+0.113
Neg logit contributions
 distance-0.085
Oh-0.079
 narrower-0.078
Hmm-0.077
Uh-0.077
Token
Token ID198
Feature activation-7.20e-02

Pos logit contributions
 disl+0.260
ach+0.231
az+0.228
uggets+0.220
ubby+0.218
Neg logit contributions
 distance-0.118
Oh-0.108
 narrower-0.105
Hmm-0.104
Uh-0.103
Token
Token ID198
Feature activation-6.11e-02

Pos logit contributions
 disl+0.130
ach+0.114
uggets+0.109
az+0.109
ubby+0.103
Neg logit contributions
 distance-0.089
 narrower-0.086
Uh-0.081
Oh-0.079
Hmm-0.079
TokenBilly
Token ID49640
Feature activation-5.09e-02

Pos logit contributions
 disl+0.123
ach+0.109
uggets+0.103
az+0.099
ubby+0.099
Neg logit contributions
 distance-0.065
 narrower-0.061
Uh-0.054
 drifted-0.054
 vengeance-0.054
Token was
Token ID373
Feature activation-5.50e-02

Pos logit contributions
 disl+0.106
ach+0.095
az+0.094
uggets+0.092
ubby+0.090
Neg logit contributions
 distance-0.048
Oh-0.044
 narrower-0.044
Hmm-0.043
Uh-0.043
Token very
Token ID845
Feature activation-4.31e-02

Pos logit contributions
 disl+0.106
ach+0.094
az+0.089
uggets+0.087
 Majesty+0.086
Neg logit contributions
 distance-0.065
 narrower-0.059
Oh-0.059
Uh-0.057
Where-0.056
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 disl+0.002
ach+0.002
az+0.002
uggets+0.002
 Majesty+0.002
Neg logit contributions
 distance-0.004
Oh-0.004
 narrower-0.004
Uh-0.004
Hmm-0.004
Token the
Token ID262
Feature activation-5.46e-03

Pos logit contributions
 distance+0.017
Oh+0.016
 narrower+0.016
Hmm+0.016
Uh+0.016
Neg logit contributions
 disl-0.018
ach-0.015
az-0.015
uggets-0.014
ubby-0.014
Token poor
Token ID3595
Feature activation-1.08e-02

Pos logit contributions
 disl+0.005
ach+0.004
az+0.004
ubby+0.004
 Majesty+0.004
Neg logit contributions
 distance-0.011
 narrower-0.011
Hmm-0.010
Oh-0.010
To-0.010
Token rh
Token ID9529
Feature activation-7.16e-02

Pos logit contributions
 disl+0.012
ach+0.010
az+0.010
uggets+0.009
ubby+0.009
Neg logit contributions
 distance-0.021
Oh-0.020
 narrower-0.020
Uh-0.019
Hmm-0.019
Tokenin
Token ID259
Feature activation+2.10e-02

Pos logit contributions
 disl+0.093
ach+0.074
az+0.074
uggets+0.070
 Majesty+0.068
Neg logit contributions
 distance-0.125
Oh-0.120
 narrower-0.119
Uh-0.118
Hmm-0.118
Tokenoc
Token ID420
Feature activation-9.78e-02

Pos logit contributions
 distance+0.042
 narrower+0.042
Oh+0.041
Hmm+0.040
Uh+0.040
Neg logit contributions
 disl-0.021
ach-0.016
az-0.015
ubby-0.014
 Majesty-0.013
Tokeneros
Token ID27498
Feature activation+6.15e-03

Pos logit contributions
 disl+0.090
uggets+0.064
az+0.061
ubby+0.058
ach+0.057
Neg logit contributions
 distance-0.189
Oh-0.187
 narrower-0.184
Hmm-0.184
Warning-0.182
Token was
Token ID373
Feature activation-6.38e-03

Pos logit contributions
 distance+0.006
Oh+0.006
 narrower+0.006
Uh+0.006
Hmm+0.006
Neg logit contributions
 disl-0.012
ach-0.011
az-0.011
uggets-0.010
ubby-0.010
Token never
Token ID1239
Feature activation-1.55e-02

Pos logit contributions
 disl+0.012
ach+0.011
az+0.010
uggets+0.010
uc+0.010
Neg logit contributions
 distance-0.008
 narrower-0.007
 eventually-0.007
Oh-0.007
See-0.006
Token hungry
Token ID14720
Feature activation-3.99e-02

Pos logit contributions
 disl+0.029
ach+0.027
uggets+0.025
az+0.025
amb+0.024
Neg logit contributions
 distance-0.020
 narrower-0.017
Oh-0.017
 million-0.017
Uh-0.016
Token again
Token ID757
Feature activation-1.36e-02

Pos logit contributions
 disl+0.082
az+0.069
ach+0.069
uggets+0.068
 Majesty+0.067
Neg logit contributions
 distance-0.044
 narrower-0.038
Oh-0.037
Where-0.036
To-0.036
Token pony
Token ID26902
Feature activation-5.94e-03

Pos logit contributions
 narrower+0.004
Uh+0.004
 distance+0.004
Hmm+0.004
To+0.004
Neg logit contributions
 disl-0.003
ach-0.002
az-0.002
ubby-0.002
 Majesty-0.002
Token again
Token ID757
Feature activation-2.41e-02

Pos logit contributions
 disl+0.012
az+0.011
ach+0.011
ubby+0.010
 majesty+0.010
Neg logit contributions
 distance-0.006
 narrower-0.005
To-0.005
Hmm-0.005
Oh-0.005
Token.
Token ID13
Feature activation+3.41e-04

Pos logit contributions
 disl+0.044
az+0.037
ach+0.037
uggets+0.036
 Majesty+0.035
Neg logit contributions
 distance-0.031
Oh-0.028
To-0.028
 narrower-0.028
Hmm-0.027
Token<|endoftext|>
Token ID50256
Feature activation-8.03e-02

Pos logit contributions
 distance+0.000
 narrower+0.000
Uh+0.000
Oh+0.000
Hmm+0.000
Neg logit contributions
 disl-0.001
az-0.001
ach-0.001
uggets-0.001
 Majesty-0.001
TokenOnce
Token ID7454
Feature activation-8.89e-02

Pos logit contributions
 disl+0.170
ach+0.150
az+0.147
uggets+0.144
 Majesty+0.143
Neg logit contributions
 distance-0.077
Oh-0.070
 narrower-0.067
Hmm-0.066
Uh-0.066
Token upon
Token ID2402
Feature activation-9.70e-02

Pos logit contributions
 disl+0.164
ach+0.141
 Majesty+0.132
uggets+0.131
az+0.130
Neg logit contributions
 distance-0.114
Oh-0.104
Less-0.100
To-0.099
Uh-0.099
Token a
Token ID257
Feature activation-6.53e-02

Pos logit contributions
 disl+0.195
az+0.181
ach+0.179
 Majesty+0.177
ubby+0.174
Neg logit contributions
He-0.114
Oh-0.112
 distance-0.112
To-0.112
Maybe-0.112
Token time
Token ID640
Feature activation-4.83e-02

Pos logit contributions
 disl+0.156
ach+0.136
 Majesty+0.136
uggets+0.136
az+0.135
Neg logit contributions
 distance-0.048
Finally-0.043
At-0.042
To-0.040
One-0.040
Token,
Token ID11
Feature activation-5.41e-02

Pos logit contributions
 disl+0.094
uggets+0.081
 Majesty+0.081
ans+0.078
ach+0.077
Neg logit contributions
At-0.051
Over-0.051
 distance-0.049
To-0.049
 had-0.049
Token there
Token ID612
Feature activation-4.64e-02

Pos logit contributions
 disl+0.112
ach+0.093
 Majesty+0.093
az+0.091
 majesty+0.089
Neg logit contributions
Oh-0.066
Over-0.063
Uh-0.062
Where-0.062
Maybe-0.061
Token was
Token ID373
Feature activation-4.87e-02

Pos logit contributions
 disl+0.111
ach+0.098
uggets+0.097
az+0.096
 Majesty+0.096
Neg logit contributions
 distance-0.033
Oh-0.030
Uh-0.028
Maybe-0.028
To-0.028
Token her
Token ID607
Feature activation-1.55e-01

Pos logit contributions
 distance+0.050
Oh+0.046
 narrower+0.046
To+0.045
Uh+0.044
Neg logit contributions
 disl-0.076
ach-0.070
az-0.068
uggets-0.065
ubby-0.064
Token mom
Token ID1995
Feature activation-1.18e-01

Pos logit contributions
 disl+0.272
az+0.238
ach+0.228
ubby+0.212
uggets+0.210
Neg logit contributions
 distance-0.213
Oh-0.183
 drifted-0.177
 third-0.176
 million-0.175
Tokenmy
Token ID1820
Feature activation-1.21e-01

Pos logit contributions
 disl+0.219
ach+0.191
az+0.188
uggets+0.181
ubby+0.179
Neg logit contributions
 distance-0.141
Oh-0.129
 narrower-0.125
Hmm-0.125
Uh-0.124
Token watched
Token ID7342
Feature activation-1.27e-01

Pos logit contributions
 disl+0.230
ach+0.202
az+0.200
uggets+0.192
ubby+0.190
Neg logit contributions
 distance-0.137
Oh-0.123
To-0.121
 narrower-0.120
 drifted-0.120
Token in
Token ID287
Feature activation-7.89e-02

Pos logit contributions
 disl+0.229
az+0.195
 majesty+0.191
ubby+0.190
 Majesty+0.190
Neg logit contributions
 distance-0.147
Oh-0.142
To-0.137
Where-0.136
Who-0.136
Token amaz
Token ID40642
Feature activation-2.10e-01

Pos logit contributions
 disl+0.124
ach+0.105
az+0.104
uggets+0.100
ubby+0.099
Neg logit contributions
 distance-0.119
Oh-0.111
 narrower-0.109
Uh-0.107
Hmm-0.106
Tokenement
Token ID972
Feature activation-1.19e-01

Pos logit contributions
 disl+0.476
uggets+0.437
 majesty+0.412
gy+0.403
 Majesty+0.400
Neg logit contributions
Oh-0.132
See-0.128
 distance-0.125
 million-0.122
Finally-0.118
Token as
Token ID355
Feature activation-8.86e-02

Pos logit contributions
 disl+0.224
ubby+0.189
az+0.188
uggets+0.183
 Majesty+0.182
Neg logit contributions
 distance-0.129
To-0.119
 million-0.117
 drifted-0.116
 narrower-0.115
Token the
Token ID262
Feature activation-6.41e-02

Pos logit contributions
 disl+0.157
ach+0.137
az+0.136
uggets+0.131
ubby+0.129
Neg logit contributions
 distance-0.112
Oh-0.105
 narrower-0.104
Hmm-0.102
Uh-0.100
Token machine
Token ID4572
Feature activation-5.20e-02

Pos logit contributions
 disl+0.104
ach+0.089
az+0.087
 Majesty+0.082
ubby+0.081
Neg logit contributions
 distance-0.089
 narrower-0.086
Hmm-0.084
Oh-0.084
Uh-0.083
Token flew
Token ID13112
Feature activation-8.98e-02

Pos logit contributions
 disl+0.093
ach+0.081
az+0.081
ci+0.074
 Majesty+0.073
Neg logit contributions
 distance-0.062
Hmm-0.061
Oh-0.061
Uh-0.061
 narrower-0.059
Token John
Token ID1757
Feature activation-1.48e-01

Pos logit contributions
 distance+0.069
Oh+0.068
 narrower+0.067
Aw+0.066
Uh+0.065
Neg logit contributions
 disl-0.099
ach-0.085
az-0.084
 Majesty-0.080
ci-0.080
Token chose
Token ID7690
Feature activation-1.11e-01

Pos logit contributions
 disl+0.310
az+0.268
ach+0.265
uggets+0.263
ubby+0.258
Neg logit contributions
 distance-0.148
Oh-0.130
Uh-0.122
 drifted-0.121
Ah-0.121
Token some
Token ID617
Feature activation-9.61e-02

Pos logit contributions
 disl+0.189
az+0.160
ach+0.158
ubby+0.157
 Majesty+0.154
Neg logit contributions
 distance-0.155
Oh-0.142
 narrower-0.136
Where-0.135
To-0.134
Token carrots
Token ID34397
Feature activation-9.12e-02

Pos logit contributions
 disl+0.158
az+0.135
ach+0.135
uggets+0.129
 Majesty+0.127
Neg logit contributions
 distance-0.135
Oh-0.126
 narrower-0.124
Hmm-0.122
Uh-0.121
Token and
Token ID290
Feature activation-5.37e-02

Pos logit contributions
 disl+0.172
az+0.144
 Majesty+0.142
 majesty+0.136
ach+0.134
Neg logit contributions
 distance-0.118
To-0.106
Maybe-0.104
He-0.102
 eventually-0.101
Token cuc
Token ID38421
Feature activation-1.98e-01

Pos logit contributions
 disl+0.108
az+0.102
ach+0.098
ubby+0.096
uggets+0.092
Neg logit contributions
 distance-0.056
 eventually-0.050
 narrower-0.049
Oh-0.049
 drifted-0.048
Tokenumbers
Token ID17024
Feature activation-7.71e-02

Pos logit contributions
 disl+0.354
az+0.266
 majesty+0.242
 Majesty+0.240
ubby+0.227
Neg logit contributions
Finally-0.286
Hmm-0.276
To-0.270
Ah-0.268
Less-0.261
Token to
Token ID284
Feature activation-5.64e-02

Pos logit contributions
 disl+0.138
az+0.117
ach+0.114
 Majesty+0.112
ubby+0.111
Neg logit contributions
 distance-0.096
To-0.088
 narrower-0.087
Maybe-0.086
Oh-0.085
Token take
Token ID1011
Feature activation-8.17e-02

Pos logit contributions
 disl+0.107
az+0.105
rou+0.098
ach+0.096
ubby+0.095
Neg logit contributions
 distance-0.061
Who-0.053
To-0.053
 eventually-0.053
Ah-0.052
Token home
Token ID1363
Feature activation-8.45e-02

Pos logit contributions
 disl+0.145
az+0.124
ach+0.121
ubby+0.117
 Majesty+0.116
Neg logit contributions
 distance-0.103
Oh-0.096
To-0.095
Ah-0.095
Hmm-0.093
Token.
Token ID13
Feature activation-4.51e-02

Pos logit contributions
 disl+0.160
az+0.140
 Majesty+0.134
ubby+0.133
 majesty+0.130
Neg logit contributions
 distance-0.099
To-0.089
 narrower-0.085
 million-0.085
He-0.084
Token't
Token ID470
Feature activation-1.36e-01

Pos logit contributions
Oh+0.031
Who+0.030
Maybe+0.029
To+0.029
Hmm+0.029
Neg logit contributions
 disl-0.039
ach-0.036
az-0.035
ubby-0.033
ishing-0.033
Token know
Token ID760
Feature activation-1.23e-01

Pos logit contributions
 disl+0.253
ach+0.230
az+0.228
uggets+0.223
ubby+0.216
Neg logit contributions
 distance-0.160
 narrower-0.138
Oh-0.136
To-0.135
Who-0.132
Token the
Token ID262
Feature activation-9.91e-02

Pos logit contributions
 disl+0.173
ach+0.150
az+0.147
ubby+0.141
uggets+0.141
Neg logit contributions
 distance-0.194
 narrower-0.185
Oh-0.183
To-0.179
Hmm-0.176
Token rh
Token ID9529
Feature activation-1.88e-01

Pos logit contributions
 disl+0.172
ach+0.150
az+0.149
uggets+0.143
ubby+0.139
Neg logit contributions
 distance-0.131
Oh-0.121
 narrower-0.120
Uh-0.117
Hmm-0.117
Tokenin
Token ID259
Feature activation-7.90e-02

Pos logit contributions
 disl+0.301
az+0.233
 Majesty+0.223
ach+0.217
 majesty+0.215
Neg logit contributions
 distance-0.292
Oh-0.279
Uh-0.272
Ah-0.270
 million-0.270
Tokenoc
Token ID420
Feature activation-2.06e-01

Pos logit contributions
 disl+0.107
ach+0.088
az+0.088
uggets+0.083
ubby+0.081
Neg logit contributions
 distance-0.134
Oh-0.127
 narrower-0.126
Hmm-0.125
Uh-0.125
Tokeneros
Token ID27498
Feature activation-8.26e-02

Pos logit contributions
 disl+0.224
uggets+0.162
iggle+0.154
aj+0.151
ubby+0.150
Neg logit contributions
 distance-0.365
Oh-0.361
 Moral-0.351
Less-0.351
Warning-0.351
Token was
Token ID373
Feature activation-7.60e-02

Pos logit contributions
 disl+0.169
ach+0.150
az+0.148
uggets+0.143
ubby+0.141
Neg logit contributions
 distance-0.082
Oh-0.075
 narrower-0.075
Hmm-0.073
Uh-0.073
Token wild
Token ID4295
Feature activation-1.11e-01

Pos logit contributions
 disl+0.141
ach+0.124
az+0.121
uggets+0.116
ubby+0.115
Neg logit contributions
 distance-0.095
 narrower-0.086
Oh-0.085
Uh-0.081
To-0.081
Token.
Token ID13
Feature activation-6.22e-02

Pos logit contributions
 disl+0.217
uggets+0.194
ubby+0.190
 majesty+0.189
 Majesty+0.187
Neg logit contributions
 distance-0.124
 narrower-0.104
Oh-0.099
 million-0.099
 darker-0.099
Token Can
Token ID1680
Feature activation-1.04e-01

Pos logit contributions
 disl+0.110
ach+0.095
az+0.094
uggets+0.091
 Majesty+0.088
Neg logit contributions
 distance-0.079
 narrower-0.075
Oh-0.072
Uh-0.072
Hmm-0.071
Token reward
Token ID6721
Feature activation-1.56e-01

Pos logit contributions
 disl+0.344
az+0.291
ach+0.288
uggets+0.280
uc+0.264
Neg logit contributions
 distance-0.268
Oh-0.230
 million-0.226
Uh-0.220
 narrower-0.211
Token.
Token ID13
Feature activation-7.88e-02

Pos logit contributions
 disl+0.261
az+0.223
ach+0.215
ubby+0.212
 Majesty+0.211
Neg logit contributions
 distance-0.212
Oh-0.195
To-0.195
Hmm-0.187
Maybe-0.187
Token It
Token ID632
Feature activation-5.78e-02

Pos logit contributions
 disl+0.153
ach+0.135
az+0.133
uggets+0.129
ubby+0.126
Neg logit contributions
 distance-0.086
Oh-0.080
 narrower-0.079
Hmm-0.078
Uh-0.078
Token was
Token ID373
Feature activation-7.56e-02

Pos logit contributions
 disl+0.115
ach+0.100
az+0.100
 Majesty+0.096
uggets+0.095
Neg logit contributions
 distance-0.061
Oh-0.056
 narrower-0.056
Uh-0.054
Hmm-0.054
Token ch
Token ID442
Feature activation-1.27e-01

Pos logit contributions
 disl+0.125
ach+0.107
az+0.106
uggets+0.099
ubby+0.098
Neg logit contributions
 distance-0.108
 narrower-0.100
Oh-0.100
Uh-0.097
Hmm-0.096
Tokenocol
Token ID4668
Feature activation-2.26e-01

Pos logit contributions
 disl+0.130
uggets+0.083
 Majesty+0.082
 majesty+0.078
ach+0.075
Neg logit contributions
 narrower-0.258
Uh-0.257
 distance-0.254
Oh-0.250
 million-0.248
Tokenates
Token ID689
Feature activation-7.78e-02

Pos logit contributions
 disl+0.333
 Majesty+0.221
az+0.214
 majesty+0.211
 sport+0.210
Neg logit contributions
Ah-0.346
Less-0.345
Warning-0.342
 narrower-0.339
To-0.339
Token and
Token ID290
Feature activation-4.46e-02

Pos logit contributions
 disl+0.142
az+0.113
 Majesty+0.109
ach+0.108
 majesty+0.107
Neg logit contributions
 distance-0.104
 narrower-0.092
 million-0.092
To-0.092
 eventually-0.091
Token sweets
Token ID42402
Feature activation-7.69e-02

Pos logit contributions
 disl+0.081
az+0.072
ach+0.072
ubby+0.069
uggets+0.068
Neg logit contributions
 distance-0.055
Oh-0.051
 narrower-0.050
Hmm-0.048
Uh-0.048
Token!
Token ID0
Feature activation-5.02e-02

Pos logit contributions
 disl+0.126
az+0.106
ach+0.104
ubby+0.102
uggets+0.101
Neg logit contributions
 distance-0.113
 narrower-0.102
Oh-0.102
To-0.101
Hmm-0.099
Token
Token ID198
Feature activation-6.58e-02

Pos logit contributions
 disl+0.089
ach+0.077
az+0.076
uggets+0.073
ubby+0.072
Neg logit contributions
 distance-0.064
 narrower-0.059
Oh-0.059
Uh-0.058
Hmm-0.058
Token,
Token ID11
Feature activation-3.35e-02

Pos logit contributions
 disl+0.077
az+0.064
ach+0.063
 Majesty+0.060
uggets+0.060
Neg logit contributions
 distance-0.065
 narrower-0.059
Oh-0.058
To-0.057
Hmm-0.057
Token cup
Token ID6508
Feature activation-5.98e-02

Pos logit contributions
 disl+0.055
az+0.047
ach+0.047
ubby+0.045
 Majesty+0.043
Neg logit contributions
 distance-0.048
Oh-0.045
 narrower-0.044
Hmm-0.043
Uh-0.043
Tokencakes
Token ID37263
Feature activation-5.13e-02

Pos logit contributions
 disl+0.099
ach+0.082
az+0.081
uggets+0.079
 Majesty+0.078
Neg logit contributions
 distance-0.085
Oh-0.078
 narrower-0.078
Hmm-0.076
Uh-0.076
Token and
Token ID290
Feature activation-2.49e-02

Pos logit contributions
 disl+0.058
ach+0.050
az+0.047
uggets+0.044
ubby+0.041
Neg logit contributions
 distance-0.094
 narrower-0.092
Oh-0.092
Uh-0.092
Hmm-0.091
Token ch
Token ID442
Feature activation-9.67e-02

Pos logit contributions
 disl+0.043
az+0.037
ach+0.037
ubby+0.036
uggets+0.035
Neg logit contributions
 distance-0.033
Oh-0.031
 narrower-0.031
Hmm-0.030
Uh-0.030
Tokenocol
Token ID4668
Feature activation-1.96e-01

Pos logit contributions
 disl+0.111
ci+0.088
 Majesty+0.081
 majesty+0.069
az+0.068
Neg logit contributions
 drifted-0.164
See-0.159
Park-0.158
Ah-0.158
Aw-0.156
Tokenates
Token ID689
Feature activation-5.24e-02

Pos logit contributions
 disl+0.279
 Majesty+0.190
az+0.181
 majesty+0.174
ubby+0.171
Neg logit contributions
Ah-0.305
 narrower-0.302
To-0.301
Oh-0.300
Finally-0.300
Token.
Token ID13
Feature activation-3.05e-02

Pos logit contributions
 disl+0.096
az+0.081
ach+0.078
 Majesty+0.077
ubby+0.077
Neg logit contributions
 distance-0.065
 narrower-0.059
To-0.058
Oh-0.057
Park-0.056
Token
Token ID220
Feature activation-9.06e-02

Pos logit contributions
 disl+0.056
ach+0.049
az+0.048
uggets+0.047
 Majesty+0.045
Neg logit contributions
 distance-0.037
 narrower-0.035
Uh-0.034
Oh-0.033
Hmm-0.033
Token
Token ID198
Feature activation-4.43e-02

Pos logit contributions
 disl+0.201
ach+0.177
az+0.176
 Majesty+0.170
ubby+0.170
Neg logit contributions
 distance-0.072
Oh-0.067
To-0.066
 narrower-0.064
Finally-0.064
Token
Token ID198
Feature activation-3.64e-02

Pos logit contributions
 disl+0.080
ach+0.073
az+0.069
uggets+0.069
ubby+0.065
Neg logit contributions
 distance-0.053
 narrower-0.051
Uh-0.048
Oh-0.048
Hmm-0.048