TOP ACTIVATIONS
MAX = 0.194

little boy named Tim was very excited . He was going
. And everyone was very happy . <|endoftext|> Far away
Fre ya . She was very excited because she was going
Those items are very special . We need to
Alex . Alex was a very obedient sw an and he
Lilly â s very own mini theater ! It
Tim . Tim was a very good baby . He loved
curious expression . Emma sat very still , not wanting to
named Dave . Dave was very big and he liked to
but Tim wasn 't being very nice . He was being
course , if you are very careful . â
named Benny . Benny was very restless and couldn 't sit
because I love you both very much . The baby is
named Benny . Benny was very scared of everything . He
Lily 's mom my was very smart and knew how to
Tim . Tim was a very tidy kid . He always
Wow , this is a very old letter . It 's
â re a very impatient girl , â
â s a very angry man up there .
too . She loved Lily very much . She asked mom

MIN ACTIVATIONS
MIN = -0.222

sc ooter . She tr und led along the sidewalk all
Every time it th und ered afterwards , the famous
The storm th und ered and roared above ,
that 's not nice . Taking things that aren 't yours
came from it . Sc utt les was so frightened ,
began screaming again . Sc utt les was so scared ,
be healthy and happy . Taking your pill is good for
's best for you . Taking a bath is good for
happier . Everyone agreed . Taking the medicine felt nice and
the bear was right . Taking the gold would have been
They walked home . On the way , they met
bird for the advice . Taking the camera , he and
learned an important lesson . Taking things without asking was wrong
his little nose and sc utt led away . From that
learned a hard lesson . Taking things that are not yours
share with your sister . Taking her dolls was very bad
too . Sc utt les was scared but he
Let 's call them . On the count of three .
, and a shelf . On the shelf , there is
. He screamed and sc utt led away .

INTERVAL 0.090 - 0.194
CONTAINS 8.255%

again ! <|endoftext|> Once upon a time there was a boy
upon a time , in a small town , there was
The water was cold , so Amy started to g iggle
. She loved her room very much . One day ,
day on , Tim my was more careful and had fun

INTERVAL -0.014 - 0.090
CONTAINS 45.502%

there was a tall temple . Many kids liked to play
they saw a big head . It was the hipp o
asked . " Yes , please , Lily .
cool , Lily . Look , it can w iggle !"
selfish . She learned that it is better to enjoy the

INTERVAL -0.118 - -0.014
CONTAINS 41.968%

time , there was a big truck . It was very
about this and kept demanding for somebody to help her clean
ila and Ben ran to the crystal . They saw the
and helpful is more important than having luxury things . <|endoftext|>
it every day . After a few weeks ,

INTERVAL -0.222 - -0.118
CONTAINS 4.276%

turns to kick it . The ball flew high in the
is mine ! â She tried to pick it ,
happily in the sky , always sharing with everyone and never
and Ben are friends . They like to play in the
made him very sad . He wished he had stopped loading
Token little
Token ID1310
Feature activation-6.94e-03

Pos logit contributions
 babys+0.183
 gloss+0.166
 haircut+0.164
 happened+0.163
 complexion+0.162
Neg logit contributions
 often-0.182
ales-0.178
mediately-0.176
rost-0.176
isk-0.173
Token boy
Token ID2933
Feature activation+7.87e-02

Pos logit contributions
ales+0.012
mediately+0.012
ining+0.011
urbed+0.011
reci+0.011
Neg logit contributions
 babys-0.008
 haircut-0.007
 happened-0.007
 gloss-0.007
 complexion-0.007
Token named
Token ID3706
Feature activation-2.46e-02

Pos logit contributions
 babys+0.111
 happened+0.097
 haircut+0.096
 complexion+0.089
 gloss+0.088
Neg logit contributions
ales-0.132
mediately-0.121
orting-0.119
isk-0.118
reci-0.118
Token Tim
Token ID5045
Feature activation-5.07e-02

Pos logit contributions
reci+0.037
ales+0.037
ining+0.036
mediately+0.035
orting+0.035
Neg logit contributions
 babys-0.037
 happened-0.032
 haircut-0.030
 jan-0.029
 hairc-0.028
Token was
Token ID373
Feature activation+1.24e-01

Pos logit contributions
ales+0.081
 itself+0.076
rost+0.074
orting+0.073
reci+0.073
Neg logit contributions
 babys-0.072
 happened-0.068
 haircut-0.059
 complexion-0.058
 cough-0.056
Token very
Token ID845
Feature activation+1.86e-01

Pos logit contributions
 babys+0.132
 haircut+0.108
 happened+0.101
 gloss+0.096
 complexion+0.093
Neg logit contributions
ales-0.255
reci-0.234
orting-0.232
mediately-0.230
rost-0.228
Token excited
Token ID6568
Feature activation+6.89e-03

Pos logit contributions
 babys+0.202
 haircut+0.169
 happened+0.156
 complexion+0.152
 gloss+0.150
Neg logit contributions
ales-0.374
orting-0.341
mediately-0.338
isk-0.337
urbed-0.336
Token.
Token ID13
Feature activation+5.43e-02

Pos logit contributions
 babys+0.011
 haircut+0.010
 happened+0.009
 gloss+0.009
 complexion+0.009
Neg logit contributions
ales-0.010
mediately-0.009
ining-0.009
orting-0.009
isk-0.009
Token He
Token ID679
Feature activation-1.29e-01

Pos logit contributions
 babys+0.093
 gloss+0.077
 haircut+0.077
 happened+0.075
 classroom+0.074
Neg logit contributions
wed-0.073
ales-0.073
urbed-0.071
ility-0.069
ck-0.067
Token was
Token ID373
Feature activation+1.22e-01

Pos logit contributions
ales+0.233
orting+0.221
ility+0.217
reci+0.209
mediately+0.208
Neg logit contributions
 babys-0.166
 happened-0.153
 haircut-0.146
 complexion-0.135
 gloss-0.128
Token going
Token ID1016
Feature activation-2.42e-02

Pos logit contributions
 babys+0.124
 haircut+0.103
 happened+0.096
 gloss+0.088
 complexion+0.086
Neg logit contributions
ales-0.256
reci-0.239
orting-0.239
mediately-0.233
urbed-0.231
Token
Token ID8151
Feature activation+5.45e-02

Pos logit contributions
 babys+0.221
 haircut+0.202
 happened+0.193
 gloss+0.191
 complexion+0.189
Neg logit contributions
ales-0.177
mediately-0.159
ining-0.157
isk-0.155
orting-0.154
Token.
Token ID13
Feature activation+5.15e-02

Pos logit contributions
 babys+0.077
 haircut+0.069
 happened+0.065
 gloss+0.064
 complexion+0.064
Neg logit contributions
ales-0.087
mediately-0.079
ining-0.078
isk-0.078
orting-0.077
Token And
Token ID843
Feature activation-5.65e-02

Pos logit contributions
 babys+0.081
 gloss+0.069
 haircut+0.068
 hairc+0.064
ird+0.064
Neg logit contributions
ales-0.081
wed-0.077
rant-0.072
ws-0.072
lated-0.072
Token everyone
Token ID2506
Feature activation+1.12e-02

Pos logit contributions
ales+0.100
rant+0.088
wed+0.088
ws+0.086
ility+0.085
Neg logit contributions
 babys-0.077
 haircut-0.073
 hairc-0.068
 happened-0.065
 gloss-0.065
Token was
Token ID373
Feature activation+1.23e-01

Pos logit contributions
 babys+0.015
 happened+0.015
 haircut+0.013
 photo+0.012
 hairc+0.012
Neg logit contributions
ales-0.020
rost-0.017
isk-0.017
mediately-0.017
orting-0.017
Token very
Token ID845
Feature activation+1.86e-01

Pos logit contributions
 babys+0.140
 haircut+0.115
 happened+0.114
 gloss+0.100
 complexion+0.097
Neg logit contributions
ales-0.245
orting-0.222
reci-0.220
rost-0.220
mediately-0.218
Token happy
Token ID3772
Feature activation-1.07e-02

Pos logit contributions
 babys+0.207
 haircut+0.174
 happened+0.167
 gloss+0.156
 complexion+0.156
Neg logit contributions
ales-0.368
isk-0.333
orting-0.331
mediately-0.331
urbed-0.329
Token.
Token ID13
Feature activation+4.99e-02

Pos logit contributions
ales+0.017
mediately+0.015
reci+0.015
isk+0.015
ining+0.015
Neg logit contributions
 babys-0.016
 haircut-0.015
 happened-0.014
 gloss-0.014
 complexion-0.013
Token<|endoftext|>
Token ID50256
Feature activation-3.91e-02

Pos logit contributions
 babys+0.075
 haircut+0.059
 gloss+0.058
ird+0.057
 happened+0.057
Neg logit contributions
ales-0.081
wed-0.081
ws-0.074
lated-0.073
ling-0.072
TokenFar
Token ID21428
Feature activation-1.88e-01

Pos logit contributions
ales+0.066
mediately+0.059
urbed+0.059
wed+0.058
isk+0.057
Neg logit contributions
 babys-0.054
 happened-0.049
 haircut-0.046
 gloss-0.044
 complexion-0.042
Token away
Token ID1497
Feature activation+1.12e-02

Pos logit contributions
ales+0.232
ility+0.230
hips+0.222
Days+0.218
rost+0.211
Neg logit contributions
 babys-0.290
 happened-0.278
 hairc-0.264
 rang-0.255
 haircut-0.247
Token Fre
Token ID4848
Feature activation-8.70e-02

Pos logit contributions
ales+0.039
reci+0.036
orting+0.036
wed+0.035
mediately+0.035
Neg logit contributions
 babys-0.032
 happened-0.027
 haircut-0.026
 complexion-0.024
 gloss-0.024
Tokenya
Token ID3972
Feature activation-2.07e-02

Pos logit contributions
isk+0.146
 Step+0.145
ales+0.145
 If+0.140
rost+0.140
Neg logit contributions
 happened-0.076
 veil-0.075
 hairc-0.073
 babys-0.073
 husband-0.071
Token.
Token ID13
Feature activation+5.69e-02

Pos logit contributions
ales+0.038
rost+0.034
orting+0.034
isk+0.034
mediately+0.034
Neg logit contributions
 babys-0.025
 happened-0.024
 haircut-0.022
 complexion-0.021
 photo-0.020
Token She
Token ID1375
Feature activation-1.28e-01

Pos logit contributions
 babys+0.053
 Lee+0.045
 happened+0.039
 complexion+0.038
 photo+0.037
Neg logit contributions
wed-0.118
ales-0.110
ches-0.109
lated-0.109
ck-0.107
Token was
Token ID373
Feature activation+1.23e-01

Pos logit contributions
ales+0.244
orting+0.218
ility+0.215
rost+0.215
reci+0.212
Neg logit contributions
 babys-0.156
 happened-0.150
 haircut-0.136
 gloss-0.126
 complexion-0.124
Token very
Token ID845
Feature activation+1.86e-01

Pos logit contributions
 babys+0.133
 haircut+0.112
 happened+0.103
 gloss+0.095
 hairc+0.091
Neg logit contributions
ales-0.256
orting-0.238
reci-0.234
rost-0.230
wed-0.226
Token excited
Token ID6568
Feature activation+4.96e-03

Pos logit contributions
 babys+0.210
 haircut+0.179
 happened+0.167
 complexion+0.161
 gloss+0.159
Neg logit contributions
ales-0.367
orting-0.333
isk-0.330
wed-0.327
reci-0.325
Token because
Token ID780
Feature activation-9.72e-03

Pos logit contributions
 babys+0.007
 haircut+0.007
 happened+0.006
 complexion+0.006
 gloss+0.006
Neg logit contributions
ales-0.008
reci-0.007
mediately-0.007
rost-0.007
isk-0.007
Token she
Token ID673
Feature activation-3.75e-02

Pos logit contributions
ales+0.022
isk+0.022
wed+0.021
ws+0.021
orting+0.021
Neg logit contributions
 babys-0.008
 haircut-0.007
 happened-0.006
 hairc-0.005
 photo-0.005
Token was
Token ID373
Feature activation+1.24e-01

Pos logit contributions
ales+0.061
orting+0.055
mediately+0.054
reci+0.053
urbed+0.053
Neg logit contributions
 babys-0.054
 haircut-0.049
 happened-0.048
 gloss-0.045
 complexion-0.045
Token going
Token ID1016
Feature activation-2.26e-02

Pos logit contributions
 babys+0.127
 haircut+0.109
 happened+0.101
 gloss+0.095
 complexion+0.093
Neg logit contributions
ales-0.254
orting-0.236
reci-0.234
mediately-0.234
rost-0.231
Token
Token ID129
Feature activation-8.11e-02

Pos logit contributions
 babys+0.207
 haircut+0.189
 happened+0.181
 gloss+0.179
 complexion+0.177
Neg logit contributions
ales-0.157
mediately-0.141
ining-0.140
isk-0.137
urbed-0.137
Token
Token ID241
Feature activation-9.65e-03

Pos logit contributions
ales+0.115
reci+0.100
rost+0.099
ility+0.098
wed+0.098
Neg logit contributions
 babys-0.131
 haircut-0.119
 gloss-0.114
 happened-0.112
 complexion-0.109
TokenThose
Token ID9627
Feature activation-1.07e-01

Pos logit contributions
ales+0.014
isk+0.012
orting+0.012
reci+0.012
mediately+0.012
Neg logit contributions
 babys-0.015
 haircut-0.014
 happened-0.014
 gloss-0.014
 complexion-0.013
Token items
Token ID3709
Feature activation+2.14e-02

Pos logit contributions
ales+0.156
reci+0.138
mediately+0.138
orting+0.138
ining+0.136
Neg logit contributions
 babys-0.164
 haircut-0.145
 complexion-0.140
 gloss-0.140
 happened-0.140
Token are
Token ID389
Feature activation+9.12e-02

Pos logit contributions
 babys+0.028
 happened+0.027
 haircut+0.026
 lipstick+0.024
 makeup+0.024
Neg logit contributions
urbed-0.033
mediately-0.033
isk-0.033
ales-0.032
ility-0.032
Token very
Token ID845
Feature activation+1.86e-01

Pos logit contributions
 babys+0.108
 haircut+0.094
 happened+0.092
 makeup+0.085
 lipstick+0.084
Neg logit contributions
ales-0.163
mediately-0.158
reci-0.156
orting-0.155
rost-0.154
Token special
Token ID2041
Feature activation-5.01e-02

Pos logit contributions
 babys+0.181
 haircut+0.159
 happened+0.148
 virus+0.141
 photo+0.136
Neg logit contributions
ales-0.363
isk-0.356
ility-0.355
urbed-0.349
orting-0.343
Token.
Token ID13
Feature activation+5.63e-02

Pos logit contributions
ales+0.078
mediately+0.074
isk+0.073
rost+0.071
reci+0.071
Neg logit contributions
 babys-0.072
 haircut-0.066
 happened-0.062
 gloss-0.060
 complexion-0.059
Token We
Token ID775
Feature activation-8.29e-02

Pos logit contributions
 babys+0.084
 gloss+0.074
ird+0.071
 haircut+0.069
 lipstick+0.066
Neg logit contributions
ales-0.088
wed-0.086
ling-0.080
lated-0.080
urbed-0.079
Token need
Token ID761
Feature activation-1.67e-02

Pos logit contributions
ales+0.139
orting+0.131
mediately+0.128
isk+0.127
 itself+0.125
Neg logit contributions
 babys-0.107
 happened-0.095
 haircut-0.095
 photo-0.094
ird-0.094
Token to
Token ID284
Feature activation+6.73e-03

Pos logit contributions
ales+0.030
isk+0.029
mediately+0.029
ining+0.027
reci+0.027
Neg logit contributions
 babys-0.021
 haircut-0.020
 happened-0.018
 hairc-0.017
 photo-0.017
Token Alex
Token ID4422
Feature activation-2.26e-02

Pos logit contributions
wed+0.036
reci+0.036
mediately+0.035
ales+0.034
orting+0.034
Neg logit contributions
 babys-0.033
 happened-0.030
 photo-0.026
 haircut-0.026
 hairc-0.025
Token.
Token ID13
Feature activation+5.38e-02

Pos logit contributions
ales+0.040
mediately+0.036
reci+0.036
orting+0.036
rost+0.035
Neg logit contributions
 babys-0.030
 happened-0.026
 haircut-0.026
 complexion-0.024
 gloss-0.023
Token Alex
Token ID4422
Feature activation-2.32e-02

Pos logit contributions
 babys+0.070
 happened+0.059
 tutor+0.058
 haircut+0.058
 classmate+0.058
Neg logit contributions
ales-0.093
wed-0.092
isk-0.083
lated-0.082
ws-0.081
Token was
Token ID373
Feature activation+1.21e-01

Pos logit contributions
ales+0.047
rost+0.043
orting+0.043
isk+0.042
ility+0.042
Neg logit contributions
 happened-0.023
 babys-0.023
 rang-0.019
 haircut-0.019
 complexion-0.018
Token a
Token ID257
Feature activation+1.33e-01

Pos logit contributions
 babys+0.121
 happened+0.096
 haircut+0.095
 gloss+0.083
 complexion+0.082
Neg logit contributions
ales-0.258
reci-0.239
orting-0.239
rost-0.236
wed-0.231
Token very
Token ID845
Feature activation+1.86e-01

Pos logit contributions
 babys+0.157
 haircut+0.139
 gloss+0.136
 happened+0.136
 complexion+0.127
Neg logit contributions
ales-0.225
 often-0.223
ining-0.215
mediately-0.214
 freely-0.211
Token obedient
Token ID41652
Feature activation-5.46e-02

Pos logit contributions
 babys+0.212
 haircut+0.180
 happened+0.171
 complexion+0.166
 gloss+0.159
Neg logit contributions
ales-0.364
isk-0.329
orting-0.329
urbed-0.327
wed-0.326
Token sw
Token ID1509
Feature activation-5.54e-02

Pos logit contributions
ales+0.112
wed+0.110
ws+0.108
ining+0.107
isk+0.106
Neg logit contributions
 babys-0.058
 haircut-0.050
 complexion-0.044
 photo-0.042
 happened-0.041
Tokenan
Token ID272
Feature activation+1.13e-02

Pos logit contributions
reci+0.057
ility+0.055
rost+0.055
 At+0.053
 Step+0.053
Neg logit contributions
 babys-0.099
 complexion-0.095
 haircut-0.091
 underwear-0.088
ouse-0.088
Token and
Token ID290
Feature activation-3.11e-02

Pos logit contributions
 babys+0.013
 haircut+0.011
 gloss+0.011
 happened+0.010
 complexion+0.010
Neg logit contributions
reci-0.021
ales-0.020
wed-0.020
ws-0.020
rost-0.020
Token he
Token ID339
Feature activation-4.88e-02

Pos logit contributions
ales+0.051
orting+0.046
rost+0.045
reci+0.045
mediately+0.044
Neg logit contributions
 babys-0.044
 happened-0.042
 haircut-0.040
 complexion-0.037
 gloss-0.037
Token Lilly
Token ID35134
Feature activation-3.79e-02

Pos logit contributions
 babys+0.126
 happened+0.115
 haircut+0.107
 gloss+0.098
 tooth+0.096
Neg logit contributions
ales-0.250
orting-0.233
reci-0.228
isk-0.227
rost-0.222
Tokenâ
Token ID22940
Feature activation-6.77e-02

Pos logit contributions
ales+0.063
rost+0.058
reci+0.057
mediately+0.057
orting+0.057
Neg logit contributions
 babys-0.053
 haircut-0.047
 happened-0.044
 gloss-0.044
 complexion-0.043
Token
Token ID26391
Feature activation+1.28e-01

Pos logit contributions
ales+0.124
angles+0.118
 often+0.115
urbed+0.114
rost+0.114
Neg logit contributions
 babys-0.077
 haircut-0.065
 gloss-0.062
ird-0.061
 happened-0.060
Token
Token ID8151
Feature activation+5.46e-02

Pos logit contributions
 babys+0.205
 haircut+0.187
 happened+0.179
 gloss+0.177
 complexion+0.174
Neg logit contributions
ales-0.179
mediately-0.161
ining-0.159
isk-0.157
orting-0.157
Tokens
Token ID82
Feature activation+4.47e-02

Pos logit contributions
 babys+0.074
 haircut+0.065
 happened+0.063
 gloss+0.062
 complexion+0.060
Neg logit contributions
ales-0.091
mediately-0.082
ining-0.081
reci-0.081
orting-0.081
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
 babys+0.059
 haircut+0.053
 happened+0.049
 gloss+0.048
 photo+0.046
Neg logit contributions
ales-0.080
orting-0.071
mediately-0.071
reci-0.071
wed-0.070
Token own
Token ID898
Feature activation-2.86e-02

Pos logit contributions
 babys+0.199
 haircut+0.173
 happened+0.162
 gloss+0.155
 complexion+0.150
Neg logit contributions
ales-0.374
orting-0.343
isk-0.339
ility-0.337
wed-0.331
Token mini
Token ID9927
Feature activation-5.28e-02

Pos logit contributions
ales+0.049
mediately+0.044
isk+0.044
wed+0.044
rost+0.043
Neg logit contributions
 babys-0.039
 haircut-0.036
 happened-0.034
 gloss-0.032
 hairc-0.031
Token theater
Token ID13766
Feature activation+2.09e-02

Pos logit contributions
ales+0.087
ining+0.079
 often+0.079
orting+0.078
mediately+0.078
Neg logit contributions
 babys-0.071
 haircut-0.061
 happened-0.060
 gloss-0.059
 complexion-0.058
Token!
Token ID0
Feature activation+4.13e-02

Pos logit contributions
 babys+0.029
 haircut+0.027
 happened+0.025
 gloss+0.024
 complexion+0.023
Neg logit contributions
ales-0.034
mediately-0.032
reci-0.032
wed-0.031
urbed-0.031
Token It
Token ID632
Feature activation-7.69e-02

Pos logit contributions
 babys+0.066
 gloss+0.057
 haircut+0.055
ird+0.052
 happened+0.051
Neg logit contributions
ales-0.065
wed-0.059
ws-0.056
angles-0.056
rost-0.055
Token Tim
Token ID5045
Feature activation-4.69e-02

Pos logit contributions
reci+0.035
ales+0.035
orting+0.034
wed+0.034
mediately+0.033
Neg logit contributions
 babys-0.031
 happened-0.026
 haircut-0.026
 gloss-0.024
 lipstick-0.023
Token.
Token ID13
Feature activation+5.38e-02

Pos logit contributions
ales+0.077
reci+0.069
orting+0.069
 itself+0.069
rost+0.068
Neg logit contributions
 babys-0.067
 happened-0.060
 haircut-0.057
 complexion-0.053
 tooth-0.052
Token Tim
Token ID5045
Feature activation-4.82e-02

Pos logit contributions
 babys+0.082
 gloss+0.070
 haircut+0.066
 lipstick+0.064
 Lee+0.064
Neg logit contributions
wed-0.085
ales-0.084
ws-0.077
ets-0.076
urbed-0.076
Token was
Token ID373
Feature activation+1.21e-01

Pos logit contributions
ales+0.090
ility+0.085
orting+0.084
rost+0.083
hips+0.081
Neg logit contributions
 babys-0.053
 happened-0.052
 complexion-0.043
 cough-0.043
 haircut-0.043
Token a
Token ID257
Feature activation+1.32e-01

Pos logit contributions
 babys+0.119
 haircut+0.100
 happened+0.093
 gloss+0.090
 complexion+0.088
Neg logit contributions
ales-0.250
mediately-0.230
reci-0.229
orting-0.228
ining-0.228
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
 gloss+0.177
 babys+0.176
 happened+0.170
 underwear+0.167
 complexion+0.157
Neg logit contributions
 often-0.200
 freely-0.173
rost-0.169
oster-0.168
ining-0.166
Token good
Token ID922
Feature activation+6.26e-03

Pos logit contributions
 babys+0.219
 haircut+0.189
 happened+0.177
 complexion+0.172
 gloss+0.169
Neg logit contributions
ales-0.353
urbed-0.318
orting-0.318
 itself-0.318
ility-0.318
Token baby
Token ID5156
Feature activation+8.64e-02

Pos logit contributions
 babys+0.008
 happened+0.007
 gloss+0.007
 haircut+0.007
 complexion+0.006
Neg logit contributions
ales-0.011
 often-0.010
mediately-0.010
rost-0.010
isk-0.010
Token.
Token ID13
Feature activation+5.04e-02

Pos logit contributions
 babys+0.142
 haircut+0.131
 happened+0.125
 gloss+0.124
 complexion+0.122
Neg logit contributions
ales-0.115
 often-0.103
mediately-0.101
orting-0.100
ining-0.100
Token He
Token ID679
Feature activation-1.32e-01

Pos logit contributions
 babys+0.082
 gloss+0.069
 haircut+0.066
 classroom+0.065
 happened+0.064
Neg logit contributions
wed-0.073
ales-0.071
urbed-0.068
ility-0.065
ws-0.065
Token loved
Token ID6151
Feature activation-1.46e-02

Pos logit contributions
ales+0.231
orting+0.209
ility+0.207
mediately+0.204
reci+0.204
Neg logit contributions
 babys-0.183
 happened-0.170
 haircut-0.159
 gloss-0.149
 cough-0.148
Token curious
Token ID11040
Feature activation-3.67e-02

Pos logit contributions
 babys+0.180
 gloss+0.153
 happened+0.152
 haircut+0.151
 complexion+0.148
Neg logit contributions
 often-0.199
ining-0.192
mediately-0.191
urbed-0.191
ales-0.185
Token expression
Token ID5408
Feature activation+1.25e-02

Pos logit contributions
ales+0.066
mediately+0.060
isk+0.060
ining+0.060
orting+0.059
Neg logit contributions
 babys-0.046
 haircut-0.042
 happened-0.038
 complexion-0.037
 gloss-0.037
Token.
Token ID13
Feature activation+5.45e-02

Pos logit contributions
 babys+0.018
 haircut+0.016
 happened+0.015
 gloss+0.015
 complexion+0.015
Neg logit contributions
ales-0.020
mediately-0.018
ining-0.018
orting-0.018
reci-0.017
Token Emma
Token ID18966
Feature activation-2.04e-02

Pos logit contributions
 babys+0.082
 gloss+0.070
 haircut+0.069
 happened+0.067
 complexion+0.063
Neg logit contributions
ales-0.090
wed-0.082
reci-0.078
ws-0.077
orting-0.077
Token sat
Token ID3332
Feature activation+5.41e-03

Pos logit contributions
ales+0.039
rost+0.036
orting+0.035
ility+0.034
isk+0.034
Neg logit contributions
 babys-0.024
 happened-0.021
 haircut-0.021
 gloss-0.020
 cough-0.019
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
 babys+0.007
 haircut+0.006
 complexion+0.006
 gloss+0.006
 lipstick+0.006
Neg logit contributions
ales-0.010
reci-0.009
rost-0.009
ility-0.008
urbed-0.008
Token still
Token ID991
Feature activation-6.42e-02

Pos logit contributions
 babys+0.335
 haircut+0.306
 happened+0.296
 complexion+0.294
 cough+0.290
Neg logit contributions
ales-0.232
isk-0.204
ility-0.192
orting-0.191
reci-0.190
Token,
Token ID11
Feature activation+5.77e-02

Pos logit contributions
ales+0.108
isk+0.095
rost+0.095
reci+0.094
angles+0.092
Neg logit contributions
 babys-0.092
 haircut-0.086
 happened-0.078
 gloss-0.077
 complexion-0.076
Token not
Token ID407
Feature activation+1.54e-02

Pos logit contributions
 babys+0.064
 haircut+0.060
 happened+0.053
 gloss+0.053
 underwear+0.052
Neg logit contributions
ales-0.120
reci-0.108
angles-0.106
rost-0.106
orting-0.105
Token wanting
Token ID10291
Feature activation-4.34e-02

Pos logit contributions
 babys+0.021
 haircut+0.020
 happened+0.018
 gloss+0.018
 complexion+0.018
Neg logit contributions
ales-0.026
orting-0.023
reci-0.023
mediately-0.023
ining-0.023
Token to
Token ID284
Feature activation+6.05e-03

Pos logit contributions
ales+0.076
mediately+0.070
ining+0.069
isk+0.068
reci+0.068
Neg logit contributions
 babys-0.058
 haircut-0.055
 happened-0.051
 gloss-0.049
 hairc-0.048
Token named
Token ID3706
Feature activation-1.91e-02

Pos logit contributions
 babys+0.051
 happened+0.046
 haircut+0.044
 gloss+0.042
 complexion+0.040
Neg logit contributions
ales-0.063
mediately-0.059
reci-0.058
rost-0.057
ining-0.056
Token Dave
Token ID9935
Feature activation-3.91e-02

Pos logit contributions
ales+0.029
reci+0.029
mediately+0.029
orting+0.029
ining+0.028
Neg logit contributions
 babys-0.027
 happened-0.024
 haircut-0.024
 gloss-0.023
 lipstick-0.021
Token.
Token ID13
Feature activation+5.55e-02

Pos logit contributions
ales+0.061
mediately+0.055
orting+0.054
reci+0.054
ining+0.053
Neg logit contributions
 babys-0.057
 haircut-0.051
 happened-0.051
 complexion-0.048
 gloss-0.048
Token Dave
Token ID9935
Feature activation-4.21e-02

Pos logit contributions
 babys+0.082
 happened+0.075
 haircut+0.073
 complexion+0.069
 photo+0.069
Neg logit contributions
ales-0.086
wed-0.081
isk-0.078
ches-0.076
ility-0.075
Token was
Token ID373
Feature activation+1.20e-01

Pos logit contributions
ales+0.081
orting+0.076
ility+0.075
rost+0.074
lated+0.073
Neg logit contributions
 babys-0.045
 happened-0.045
 haircut-0.038
 complexion-0.037
 rang-0.036
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
 babys+0.122
 haircut+0.101
 happened+0.096
 gloss+0.090
 complexion+0.084
Neg logit contributions
ales-0.253
reci-0.234
orting-0.232
rost-0.228
wed-0.226
Token big
Token ID1263
Feature activation-1.61e-02

Pos logit contributions
 babys+0.200
 haircut+0.168
 happened+0.160
 complexion+0.153
 gloss+0.149
Neg logit contributions
ales-0.374
orting-0.339
isk-0.336
wed-0.333
ility-0.333
Token and
Token ID290
Feature activation-3.01e-02

Pos logit contributions
ales+0.022
mediately+0.020
isk+0.019
rost+0.019
ining+0.019
Neg logit contributions
 babys-0.027
 haircut-0.025
 happened-0.024
 complexion-0.023
 gloss-0.023
Token he
Token ID339
Feature activation-4.89e-02

Pos logit contributions
ales+0.055
orting+0.049
wed+0.049
ility+0.048
urbed+0.048
Neg logit contributions
 babys-0.037
 happened-0.035
 haircut-0.033
 gloss-0.032
 complexion-0.030
Token liked
Token ID8288
Feature activation-3.63e-02

Pos logit contributions
ales+0.092
orting+0.085
ility+0.083
reci+0.082
urbed+0.079
Neg logit contributions
 babys-0.061
 happened-0.058
 haircut-0.054
 gloss-0.049
 complexion-0.048
Token to
Token ID284
Feature activation+4.18e-03

Pos logit contributions
ales+0.066
mediately+0.063
reci+0.061
orting+0.061
isk+0.060
Neg logit contributions
 babys-0.045
 haircut-0.041
 happened-0.040
 hairc-0.036
 gloss-0.035
Token but
Token ID475
Feature activation-9.43e-02

Pos logit contributions
ales+0.226
orting+0.206
isk+0.200
reci+0.200
ility+0.200
Neg logit contributions
 babys-0.180
 haircut-0.162
 happened-0.161
 gloss-0.152
 complexion-0.150
Token Tim
Token ID5045
Feature activation-5.38e-02

Pos logit contributions
ales+0.161
orting+0.146
reci+0.146
ining+0.144
isk+0.143
Neg logit contributions
 babys-0.129
 happened-0.119
 haircut-0.118
 gloss-0.108
 hairc-0.103
Token wasn
Token ID2492
Feature activation+9.68e-02

Pos logit contributions
ales+0.073
ility+0.065
orting+0.064
reci+0.064
ining+0.064
Neg logit contributions
 babys-0.088
 happened-0.081
 haircut-0.081
 gloss-0.077
 complexion-0.074
Token't
Token ID470
Feature activation+1.93e-02

Pos logit contributions
 babys+0.129
 haircut+0.115
 happened+0.108
 gloss+0.107
 complexion+0.105
Neg logit contributions
ales-0.163
mediately-0.149
ining-0.148
isk-0.146
orting-0.146
Token being
Token ID852
Feature activation-2.02e-02

Pos logit contributions
 babys+0.028
 haircut+0.025
 gloss+0.024
 happened+0.024
 complexion+0.023
Neg logit contributions
ales-0.030
mediately-0.027
ining-0.027
urbed-0.027
isk-0.027
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
ales+0.040
mediately+0.037
ining+0.037
urbed+0.037
isk+0.037
Neg logit contributions
 babys-0.021
 haircut-0.018
 happened-0.016
 gloss-0.016
 complexion-0.016
Token nice
Token ID3621
Feature activation-2.30e-02

Pos logit contributions
 haircut+0.126
 babys+0.126
 happened+0.107
 liar+0.107
 mad+0.098
Neg logit contributions
ales-0.420
rant-0.398
ility-0.395
stack-0.392
ining-0.389
Token.
Token ID13
Feature activation+5.08e-02

Pos logit contributions
ales+0.031
mediately+0.028
ining+0.028
orting+0.027
isk+0.027
Neg logit contributions
 babys-0.038
 haircut-0.034
 happened-0.033
 gloss-0.033
 complexion-0.032
Token He
Token ID679
Feature activation-1.34e-01

Pos logit contributions
 babys+0.084
 gloss+0.073
 happened+0.070
 haircut+0.069
 lipstick+0.067
Neg logit contributions
ales-0.074
wed-0.069
ility-0.066
rant-0.066
reci-0.066
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ales+0.251
ility+0.230
orting+0.229
reci+0.226
urbed+0.220
Neg logit contributions
 babys-0.161
 happened-0.148
 haircut-0.141
 gloss-0.133
 cough-0.127
Token being
Token ID852
Feature activation-2.25e-02

Pos logit contributions
 babys+0.111
 haircut+0.092
 happened+0.085
 gloss+0.079
 cough+0.074
Neg logit contributions
ales-0.261
reci-0.240
orting-0.237
rost-0.230
angles-0.225
Token course
Token ID1781
Feature activation-1.69e-03

Pos logit contributions
ales+0.053
isk+0.042
wed+0.042
mediately+0.041
rant+0.041
Neg logit contributions
 babys-0.143
 haircut-0.138
 happened-0.132
 hairc-0.130
 panc-0.125
Token,
Token ID11
Feature activation+6.42e-02

Pos logit contributions
ales+0.003
wed+0.003
reci+0.003
mediately+0.003
isk+0.003
Neg logit contributions
 babys-0.002
 haircut-0.002
 happened-0.002
 gloss-0.002
 hairc-0.002
Token if
Token ID611
Feature activation-3.13e-02

Pos logit contributions
 haircut+0.083
 babys+0.082
 gloss+0.070
 hairc+0.070
 happened+0.066
Neg logit contributions
ales-0.116
wed-0.111
rant-0.109
reci-0.108
ws-0.107
Token you
Token ID345
Feature activation-2.08e-03

Pos logit contributions
ales+0.048
ining+0.043
mediately+0.043
isk+0.043
reci+0.042
Neg logit contributions
 babys-0.048
 haircut-0.045
 happened-0.043
 hairc-0.040
 gloss-0.040
Token are
Token ID389
Feature activation+9.07e-02

Pos logit contributions
ales+0.003
mediately+0.003
ining+0.003
lated+0.003
orting+0.003
Neg logit contributions
 babys-0.003
 haircut-0.002
 happened-0.002
ird-0.002
 cough-0.002
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
 babys+0.108
 haircut+0.098
 happened+0.087
 hairc+0.081
 cough+0.081
Neg logit contributions
ales-0.173
orting-0.159
reci-0.159
rost-0.157
mediately-0.156
Token careful
Token ID8161
Feature activation-5.72e-02

Pos logit contributions
 babys+0.203
 haircut+0.183
 happened+0.163
 hairc+0.159
 complexion+0.154
Neg logit contributions
ales-0.366
isk-0.344
orting-0.335
ility-0.329
 itself-0.329
Token.
Token ID13
Feature activation+5.44e-02

Pos logit contributions
ales+0.092
reci+0.085
isk+0.085
rost+0.083
mediately+0.083
Neg logit contributions
 babys-0.079
 haircut-0.073
 happened-0.069
 gloss-0.067
 hairc-0.065
Tokenâ
Token ID22940
Feature activation-7.94e-02

Pos logit contributions
 babys+0.081
 gloss+0.074
 haircut+0.072
ird+0.070
 hairc+0.066
Neg logit contributions
ales-0.085
wed-0.081
urbed-0.076
ling-0.076
rant-0.075
Token
Token ID26391
Feature activation+1.24e-01

Pos logit contributions
urbed+0.165
ales+0.164
 tunnels+0.160
angles+0.159
 often+0.156
Neg logit contributions
 babys-0.054
 haircut-0.051
ird-0.043
 gloss-0.041
 virus-0.040
Token
Token ID198
Feature activation-1.96e-02

Pos logit contributions
 babys+0.167
 haircut+0.149
 happened+0.141
 gloss+0.139
 complexion+0.136
Neg logit contributions
ales-0.207
mediately-0.190
urbed-0.186
ining-0.185
reci-0.185
Token named
Token ID3706
Feature activation-1.65e-02

Pos logit contributions
 babys+0.193
 haircut+0.173
 happened+0.165
 gloss+0.164
 complexion+0.159
Neg logit contributions
ales-0.222
mediately-0.205
ining-0.203
reci-0.200
 often-0.200
Token Benny
Token ID44275
Feature activation-5.35e-02

Pos logit contributions
mediately+0.026
ales+0.026
wed+0.025
reci+0.025
ining+0.025
Neg logit contributions
 babys-0.024
 happened-0.021
 haircut-0.020
 gloss-0.019
 complexion-0.019
Token.
Token ID13
Feature activation+5.11e-02

Pos logit contributions
ales+0.083
mediately+0.075
ining+0.074
orting+0.074
reci+0.073
Neg logit contributions
 babys-0.079
 haircut-0.070
 happened-0.069
 gloss-0.066
 complexion-0.066
Token Benny
Token ID44275
Feature activation-6.13e-02

Pos logit contributions
 babys+0.072
 gloss+0.064
 violin+0.059
 underwear+0.057
 makeup+0.057
Neg logit contributions
ales-0.085
wed-0.082
ck-0.074
ws-0.074
rant-0.074
Token was
Token ID373
Feature activation+1.19e-01

Pos logit contributions
ales+0.115
rost+0.108
ility+0.105
orting+0.104
mediately+0.100
Neg logit contributions
 babys-0.070
 happened-0.067
 complexion-0.058
 cough-0.056
 gloss-0.056
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
 babys+0.134
 haircut+0.113
 happened+0.109
 gloss+0.105
 complexion+0.102
Neg logit contributions
ales-0.234
reci-0.212
orting-0.211
rost-0.209
mediately-0.208
Token restless
Token ID42537
Feature activation-4.38e-02

Pos logit contributions
 babys+0.211
 haircut+0.178
 happened+0.172
 complexion+0.166
 gloss+0.163
Neg logit contributions
ales-0.361
orting-0.326
isk-0.323
reci-0.320
ility-0.320
Token and
Token ID290
Feature activation-3.19e-02

Pos logit contributions
ales+0.063
mediately+0.057
reci+0.056
rost+0.055
ining+0.055
Neg logit contributions
 babys-0.070
 haircut-0.064
 happened-0.063
 gloss-0.060
 complexion-0.060
Token couldn
Token ID3521
Feature activation+1.98e-02

Pos logit contributions
ales+0.060
orting+0.054
ility+0.053
reci+0.052
angles+0.052
Neg logit contributions
 babys-0.040
 happened-0.037
 haircut-0.034
 cough-0.032
 nightmare-0.032
Token't
Token ID470
Feature activation+1.93e-02

Pos logit contributions
 babys+0.028
 haircut+0.025
 complexion+0.024
 gloss+0.023
 happened+0.023
Neg logit contributions
ales-0.031
ining-0.028
mediately-0.028
isk-0.028
orting-0.028
Token sit
Token ID1650
Feature activation-4.36e-02

Pos logit contributions
 babys+0.032
 haircut+0.030
 happened+0.028
 gloss+0.028
 complexion+0.028
Neg logit contributions
ales-0.026
mediately-0.023
ining-0.023
isk-0.023
orting-0.023
Token because
Token ID780
Feature activation-7.71e-03

Pos logit contributions
 babys+0.127
 haircut+0.114
 happened+0.112
 gloss+0.110
 complexion+0.108
Neg logit contributions
ales-0.128
mediately-0.114
 often-0.112
ining-0.112
urbed-0.112
Token I
Token ID314
Feature activation-3.21e-02

Pos logit contributions
ales+0.012
isk+0.011
mediately+0.011
wed+0.011
orting+0.011
Neg logit contributions
 babys-0.011
 haircut-0.010
 happened-0.010
 gloss-0.010
 hairc-0.009
Token love
Token ID1842
Feature activation-1.16e-02

Pos logit contributions
ales+0.055
mediately+0.050
orting+0.049
ining+0.049
urbed+0.049
Neg logit contributions
 babys-0.043
 haircut-0.038
 happened-0.037
 gloss-0.035
 complexion-0.035
Token you
Token ID345
Feature activation-1.85e-03

Pos logit contributions
ales+0.017
mediately+0.016
reci+0.015
wed+0.015
isk+0.015
Neg logit contributions
 babys-0.018
 haircut-0.017
 happened-0.016
 hairc-0.016
 gloss-0.016
Token both
Token ID1111
Feature activation-1.32e-02

Pos logit contributions
ales+0.003
mediately+0.003
ining+0.003
isk+0.003
reci+0.003
Neg logit contributions
 babys-0.003
 haircut-0.002
 happened-0.002
 gloss-0.002
 complexion-0.002
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
ales+0.020
mediately+0.018
isk+0.018
orting+0.018
rost+0.018
Neg logit contributions
 babys-0.020
 haircut-0.018
 happened-0.017
 gloss-0.016
 complexion-0.016
Token much
Token ID881
Feature activation+3.85e-02

Pos logit contributions
 babys+0.162
 haircut+0.145
 happened+0.130
 complexion+0.126
 hairc+0.122
Neg logit contributions
ales-0.398
isk-0.371
mediately-0.365
orting-0.363
wed-0.359
Token.
Token ID13
Feature activation+4.77e-02

Pos logit contributions
 babys+0.059
 haircut+0.054
 happened+0.052
 gloss+0.050
 complexion+0.050
Neg logit contributions
ales-0.058
mediately-0.052
isk-0.052
ining-0.051
orting-0.050
Token The
Token ID383
Feature activation-1.35e-01

Pos logit contributions
 babys+0.070
 gloss+0.062
 haircut+0.060
ird+0.056
 panc+0.053
Neg logit contributions
ales-0.081
wed-0.076
lated-0.068
angles-0.068
urbed-0.068
Token baby
Token ID5156
Feature activation+8.45e-02

Pos logit contributions
ales+0.223
mediately+0.210
 often+0.209
urbed+0.208
reci+0.205
Neg logit contributions
 babys-0.166
 haircut-0.149
 happened-0.147
 gloss-0.146
 complexion-0.145
Token is
Token ID318
Feature activation+1.06e-01

Pos logit contributions
 babys+0.120
 haircut+0.107
 happened+0.103
 gloss+0.100
 complexion+0.097
Neg logit contributions
ales-0.138
mediately-0.124
orting-0.123
isk-0.123
ining-0.122
Token named
Token ID3706
Feature activation-2.34e-02

Pos logit contributions
 babys+0.024
 happened+0.021
 haircut+0.019
 tooth+0.018
 complexion+0.018
Neg logit contributions
ales-0.040
angles-0.039
lated-0.039
wed-0.038
rost-0.038
Token Benny
Token ID44275
Feature activation-5.99e-02

Pos logit contributions
reci+0.038
wed+0.038
mediately+0.037
orting+0.037
ales+0.036
Neg logit contributions
 babys-0.032
 happened-0.027
 stub-0.024
 gloss-0.023
 lost-0.023
Token.
Token ID13
Feature activation+5.32e-02

Pos logit contributions
ales+0.108
rost+0.098
mediately+0.098
orting+0.098
reci+0.097
Neg logit contributions
 babys-0.074
 happened-0.065
 complexion-0.061
 haircut-0.061
 cough-0.057
Token Benny
Token ID44275
Feature activation-6.18e-02

Pos logit contributions
 babys+0.070
 gloss+0.061
 violin+0.059
 underwear+0.056
 tut+0.055
Neg logit contributions
ales-0.093
wed-0.092
rost-0.085
ws-0.084
ck-0.082
Token was
Token ID373
Feature activation+1.22e-01

Pos logit contributions
ales+0.119
rost+0.112
ility+0.109
orting+0.108
angles+0.104
Neg logit contributions
 babys-0.066
 happened-0.065
 complexion-0.057
 cough-0.054
 rang-0.053
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
 babys+0.138
 haircut+0.115
 happened+0.112
 gloss+0.107
 complexion+0.104
Neg logit contributions
ales-0.239
reci-0.219
orting-0.217
rost-0.214
wed-0.210
Token scared
Token ID12008
Feature activation+8.80e-03

Pos logit contributions
 babys+0.199
 haircut+0.162
 happened+0.158
 complexion+0.155
 gloss+0.147
Neg logit contributions
ales-0.375
orting-0.342
isk-0.338
ility-0.336
wed-0.333
Token of
Token ID286
Feature activation-9.25e-02

Pos logit contributions
 babys+0.013
 happened+0.012
 haircut+0.011
 gloss+0.011
 complexion+0.010
Neg logit contributions
ales-0.014
isk-0.013
wed-0.013
rost-0.013
mediately-0.013
Token everything
Token ID2279
Feature activation+6.01e-03

Pos logit contributions
ales+0.140
mediately+0.128
rost+0.126
wed+0.125
angles+0.122
Neg logit contributions
 babys-0.144
 happened-0.138
 haircut-0.132
 nightmare-0.125
 virus-0.125
Token.
Token ID13
Feature activation+5.24e-02

Pos logit contributions
 happened+0.009
 babys+0.008
 haircut+0.008
 underwear+0.007
 nightmare+0.007
Neg logit contributions
ales-0.009
isk-0.009
rost-0.009
mediately-0.008
angles-0.008
Token He
Token ID679
Feature activation-1.29e-01

Pos logit contributions
 babys+0.082
 gloss+0.071
 happened+0.066
 underwear+0.065
 nightmare+0.065
Neg logit contributions
ales-0.081
wed-0.077
ws-0.072
rost-0.070
rant-0.070
Token Lily
Token ID20037
Feature activation-5.12e-03

Pos logit contributions
ales+0.145
reci+0.130
orting+0.129
ining+0.128
isk+0.127
Neg logit contributions
 babys-0.134
 haircut-0.123
 happened-0.121
 gloss-0.113
 hairc-0.108
Token's
Token ID338
Feature activation+7.57e-02

Pos logit contributions
ales+0.008
mediately+0.007
ining+0.007
isk+0.007
orting+0.007
Neg logit contributions
 babys-0.007
 haircut-0.007
 happened-0.006
 gloss-0.006
 complexion-0.006
Token mom
Token ID1995
Feature activation+1.18e-01

Pos logit contributions
 babys+0.081
 haircut+0.067
 happened+0.067
 gloss+0.066
 complexion+0.065
Neg logit contributions
ales-0.139
mediately-0.132
rost-0.131
 often-0.131
isk-0.130
Tokenmy
Token ID1820
Feature activation+2.83e-02

Pos logit contributions
 babys+0.132
 happened+0.118
 haircut+0.111
 gloss+0.104
 cough+0.097
Neg logit contributions
ales-0.232
 itself-0.219
urbed-0.213
rost-0.213
ining-0.212
Token was
Token ID373
Feature activation+1.26e-01

Pos logit contributions
 babys+0.032
 happened+0.029
 haircut+0.027
 gloss+0.025
 cough+0.024
Neg logit contributions
ales-0.055
rost-0.050
 itself-0.050
urbed-0.050
mediately-0.050
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
 babys+0.141
 haircut+0.124
 happened+0.117
 gloss+0.117
 complexion+0.115
Neg logit contributions
ales-0.232
mediately-0.218
ining-0.216
 often-0.215
isk-0.215
Token smart
Token ID4451
Feature activation-4.58e-02

Pos logit contributions
 babys+0.197
 haircut+0.176
 gloss+0.170
 happened+0.167
 complexion+0.161
Neg logit contributions
ales-0.339
 often-0.326
mediately-0.326
ining-0.322
reci-0.314
Token and
Token ID290
Feature activation-2.60e-02

Pos logit contributions
ales+0.053
mediately+0.046
ining+0.045
orting+0.045
 often+0.044
Neg logit contributions
 babys-0.085
 haircut-0.078
 happened-0.075
 gloss-0.075
 complexion-0.074
Token knew
Token ID2993
Feature activation+6.95e-02

Pos logit contributions
ales+0.043
urbed+0.038
rost+0.038
ility+0.038
orting+0.038
Neg logit contributions
 babys-0.037
 haircut-0.034
 happened-0.033
 gloss-0.032
 complexion-0.030
Token how
Token ID703
Feature activation+5.64e-02

Pos logit contributions
 babys+0.098
 haircut+0.091
 happened+0.089
 gloss+0.085
 hairc+0.080
Neg logit contributions
ales-0.115
reci-0.104
mediately-0.104
ining-0.102
rost-0.101
Token to
Token ID284
Feature activation+4.89e-03

Pos logit contributions
 babys+0.074
 haircut+0.067
 happened+0.066
 gloss+0.063
 complexion+0.061
Neg logit contributions
ales-0.098
mediately-0.090
isk-0.089
rost-0.087
orting-0.087
Token Tim
Token ID5045
Feature activation-4.48e-02

Pos logit contributions
reci+0.022
mediately+0.022
ales+0.021
wed+0.021
ining+0.021
Neg logit contributions
 babys-0.021
 happened-0.019
 haircut-0.018
 hairc-0.017
 gloss-0.017
Token.
Token ID13
Feature activation+4.93e-02

Pos logit contributions
ales+0.073
reci+0.067
rost+0.066
orting+0.065
ining+0.065
Neg logit contributions
 babys-0.061
 happened-0.056
 haircut-0.051
 gloss-0.049
 complexion-0.049
Token Tim
Token ID5045
Feature activation-4.95e-02

Pos logit contributions
 babys+0.076
 gloss+0.065
 Lee+0.063
 Sorry+0.061
 happened+0.061
Neg logit contributions
wed-0.075
ales-0.073
ches-0.067
urbed-0.067
reci-0.066
Token was
Token ID373
Feature activation+1.19e-01

Pos logit contributions
ales+0.090
ility+0.085
rost+0.084
urbed+0.083
reci+0.082
Neg logit contributions
 babys-0.057
 happened-0.055
 gloss-0.047
 complexion-0.046
 haircut-0.046
Token a
Token ID257
Feature activation+1.29e-01

Pos logit contributions
 babys+0.112
 haircut+0.093
 happened+0.087
 gloss+0.084
 complexion+0.080
Neg logit contributions
ales-0.251
mediately-0.230
reci-0.230
orting-0.230
ining-0.229
Token very
Token ID845
Feature activation+1.85e-01

Pos logit contributions
 babys+0.171
 gloss+0.168
 happened+0.164
 underwear+0.158
 haircut+0.154
Neg logit contributions
 often-0.192
 freely-0.171
rost-0.169
oster-0.167
ining-0.162
Token tidy
Token ID43044
Feature activation-1.11e-01

Pos logit contributions
 babys+0.208
 haircut+0.177
 happened+0.167
 complexion+0.162
 gloss+0.158
Neg logit contributions
ales-0.361
urbed-0.329
ility-0.327
 itself-0.327
orting-0.325
Token kid
Token ID5141
Feature activation+7.66e-02

Pos logit contributions
ales+0.197
ining+0.180
mediately+0.180
reci+0.180
urbed+0.179
Neg logit contributions
 babys-0.142
 haircut-0.126
 complexion-0.114
 happened-0.113
 gloss-0.110
Token.
Token ID13
Feature activation+4.97e-02

Pos logit contributions
 babys+0.102
 haircut+0.089
 happened+0.083
 gloss+0.083
 complexion+0.080
Neg logit contributions
ales-0.134
mediately-0.124
reci-0.124
rost-0.123
ining-0.122
Token He
Token ID679
Feature activation-1.30e-01

Pos logit contributions
 babys+0.080
 gloss+0.065
 haircut+0.062
 classroom+0.062
 lipstick+0.062
Neg logit contributions
wed-0.074
ales-0.071
urbed-0.068
ws-0.066
ility-0.066
Token always
Token ID1464
Feature activation-1.21e-01

Pos logit contributions
ales+0.237
ility+0.213
orting+0.212
urbed+0.209
reci+0.207
Neg logit contributions
 babys-0.173
 happened-0.159
 haircut-0.149
 gloss-0.139
 complexion-0.138
TokenWow
Token ID22017
Feature activation-4.31e-02

Pos logit contributions
 babys+0.084
 haircut+0.076
 happened+0.075
 gloss+0.075
 complexion+0.072
Neg logit contributions
ales-0.049
wed-0.042
mediately-0.042
urbed-0.041
ining-0.041
Token,
Token ID11
Feature activation+5.60e-02

Pos logit contributions
ales+0.059
ining+0.055
mediately+0.054
 often+0.054
 itself+0.053
Neg logit contributions
 babys-0.072
 haircut-0.063
 virus-0.060
 gloss-0.060
 complexion-0.059
Token this
Token ID428
Feature activation-2.48e-02

Pos logit contributions
 babys+0.089
 haircut+0.080
 happened+0.077
 gloss+0.076
 complexion+0.076
Neg logit contributions
ales-0.078
mediately-0.071
ining-0.070
isk-0.069
orting-0.069
Token is
Token ID318
Feature activation+1.05e-01

Pos logit contributions
ales+0.032
mediately+0.029
ining+0.029
 often+0.028
reci+0.028
Neg logit contributions
 babys-0.042
 haircut-0.037
 gloss-0.036
 complexion-0.036
 happened-0.035
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
 babys+0.141
 haircut+0.125
 complexion+0.118
 gloss+0.117
 happened+0.114
Neg logit contributions
ales-0.173
mediately-0.161
ining-0.161
 often-0.158
 itself-0.157
Token very
Token ID845
Feature activation+1.84e-01

Pos logit contributions
 babys+0.181
 complexion+0.163
 happened+0.160
 gloss+0.159
burn+0.150
Neg logit contributions
 often-0.181
 freely-0.165
rost-0.163
 course-0.156
et-0.155
Token old
Token ID1468
Feature activation-4.93e-02

Pos logit contributions
 babys+0.248
 haircut+0.216
 happened+0.209
 gloss+0.208
 complexion+0.202
Neg logit contributions
ales-0.299
mediately-0.278
ining-0.276
 often-0.276
rost-0.272
Token letter
Token ID3850
Feature activation+4.33e-02

Pos logit contributions
ales+0.079
 often+0.073
mediately+0.073
 itself+0.072
urbed+0.072
Neg logit contributions
 babys-0.064
 haircut-0.055
 gloss-0.055
 happened-0.054
 complexion-0.052
Token.
Token ID13
Feature activation+4.76e-02

Pos logit contributions
 babys+0.068
 haircut+0.061
 happened+0.059
 gloss+0.058
 complexion+0.057
Neg logit contributions
ales-0.062
mediately-0.056
ining-0.056
orting-0.055
isk-0.055
Token It
Token ID632
Feature activation-7.98e-02

Pos logit contributions
 babys+0.078
 haircut+0.069
 gloss+0.068
 happened+0.065
 hairc+0.063
Neg logit contributions
ales-0.069
wed-0.061
isk-0.060
urbed-0.059
mediately-0.059
Token's
Token ID338
Feature activation+7.57e-02

Pos logit contributions
ales+0.133
orting+0.118
isk+0.115
mediately+0.115
urbed+0.114
Neg logit contributions
 babys-0.115
 happened-0.108
 haircut-0.103
 gloss-0.096
 cough-0.094
Tokenâ
Token ID22940
Feature activation-7.45e-02

Pos logit contributions
ales+0.138
orting+0.126
rost+0.125
isk+0.123
mediately+0.121
Neg logit contributions
 babys-0.095
 happened-0.088
 haircut-0.084
 cough-0.079
ird-0.078
Token
Token ID26391
Feature activation+1.30e-01

Pos logit contributions
ales+0.148
urbed+0.146
 often+0.145
angles+0.143
 tunnels+0.141
Neg logit contributions
 babys-0.061
 gloss-0.054
 haircut-0.053
 happened-0.051
 lipstick-0.047
Token
Token ID8151
Feature activation+5.00e-02

Pos logit contributions
 babys+0.184
 haircut+0.164
 happened+0.156
 gloss+0.154
 complexion+0.152
Neg logit contributions
ales-0.208
mediately-0.189
ining-0.188
isk-0.186
orting-0.185
Tokenre
Token ID260
Feature activation+9.67e-02

Pos logit contributions
 babys+0.068
 haircut+0.060
 happened+0.058
 gloss+0.057
 complexion+0.055
Neg logit contributions
ales-0.082
mediately-0.075
ining-0.074
reci-0.074
urbed-0.074
Token a
Token ID257
Feature activation+1.36e-01

Pos logit contributions
 babys+0.094
 happened+0.077
 very+0.075
 haircut+0.073
 supposed+0.067
Neg logit contributions
ales-0.198
rost-0.187
wed-0.180
reci-0.179
angles-0.179
Token very
Token ID845
Feature activation+1.84e-01

Pos logit contributions
 babys+0.186
 happened+0.170
 complexion+0.167
 gloss+0.164
 underwear+0.156
Neg logit contributions
 often-0.197
 freely-0.183
ining-0.181
urbed-0.180
mediately-0.178
Token impatient
Token ID33440
Feature activation+3.60e-03

Pos logit contributions
 babys+0.214
 haircut+0.189
 happened+0.173
 gloss+0.170
 complexion+0.165
Neg logit contributions
ales-0.352
isk-0.316
mediately-0.316
orting-0.316
urbed-0.315
Token girl
Token ID2576
Feature activation+7.06e-02

Pos logit contributions
 babys+0.004
 haircut+0.004
 happened+0.004
 gloss+0.004
 complexion+0.004
Neg logit contributions
ales-0.006
mediately-0.006
ining-0.006
orting-0.006
urbed-0.006
Token,
Token ID11
Feature activation+5.86e-02

Pos logit contributions
 babys+0.092
 haircut+0.083
 happened+0.080
 gloss+0.077
 complexion+0.073
Neg logit contributions
ales-0.122
mediately-0.111
rost-0.110
reci-0.110
isk-0.109
Tokenâ
Token ID22940
Feature activation-7.70e-02

Pos logit contributions
 babys+0.080
 haircut+0.078
 gloss+0.070
 happened+0.069
 hairc+0.066
Neg logit contributions
ales-0.104
wed-0.093
rant-0.090
reci-0.090
ility-0.089
Token
Token ID26391
Feature activation+1.30e-01

Pos logit contributions
ales+0.172
angles+0.167
urbed+0.166
 tunnels+0.164
 often+0.161
Neg logit contributions
 babys-0.047
 haircut-0.042
 gloss-0.037
 happened-0.035
 virus-0.033
Tokenâ
Token ID22940
Feature activation-7.82e-02

Pos logit contributions
ales+0.269
mediately+0.240
isk+0.239
rost+0.236
ining+0.236
Neg logit contributions
 babys-0.257
 haircut-0.235
 happened-0.229
 gloss-0.218
 complexion-0.211
Token
Token ID26391
Feature activation+1.29e-01

Pos logit contributions
urbed+0.154
ales+0.152
angles+0.151
 often+0.149
rost+0.146
Neg logit contributions
 babys-0.065
 haircut-0.062
 gloss-0.056
 happened-0.053
ird-0.052
Token
Token ID8151
Feature activation+5.16e-02

Pos logit contributions
 babys+0.167
 haircut+0.147
 happened+0.138
 gloss+0.137
 complexion+0.136
Neg logit contributions
ales-0.218
ining-0.201
isk-0.199
mediately-0.198
orting-0.197
Tokens
Token ID82
Feature activation+4.55e-02

Pos logit contributions
 babys+0.072
 haircut+0.063
 happened+0.061
 gloss+0.060
 complexion+0.057
Neg logit contributions
ales-0.084
mediately-0.076
ining-0.075
reci-0.075
urbed-0.075
Token a
Token ID257
Feature activation+1.41e-01

Pos logit contributions
 babys+0.061
 haircut+0.056
 happened+0.055
 gloss+0.050
 hairc+0.049
Neg logit contributions
ales-0.078
isk-0.070
rost-0.070
reci-0.069
orting-0.069
Token very
Token ID845
Feature activation+1.84e-01

Pos logit contributions
 babys+0.182
 happened+0.166
 complexion+0.163
 gloss+0.161
 haircut+0.153
Neg logit contributions
 often-0.202
ales-0.194
mediately-0.192
 freely-0.190
ining-0.190
Token angry
Token ID7954
Feature activation+5.33e-02

Pos logit contributions
 babys+0.229
 haircut+0.202
 happened+0.190
 gloss+0.187
 complexion+0.184
Neg logit contributions
ales-0.326
mediately-0.299
ining-0.296
isk-0.294
orting-0.294
Token man
Token ID582
Feature activation+5.47e-02

Pos logit contributions
 babys+0.068
 gloss+0.058
 happened+0.058
 haircut+0.057
 complexion+0.056
Neg logit contributions
ales-0.088
isk-0.080
mediately-0.080
orting-0.080
rost-0.079
Token up
Token ID510
Feature activation+7.52e-03

Pos logit contributions
 babys+0.072
 haircut+0.064
 happened+0.064
 gloss+0.058
 hairc+0.055
Neg logit contributions
ales-0.094
mediately-0.088
reci-0.088
rost-0.086
urbed-0.085
Token there
Token ID612
Feature activation-9.47e-02

Pos logit contributions
 babys+0.010
 haircut+0.009
 happened+0.009
 virus+0.008
 complexion+0.008
Neg logit contributions
ales-0.013
isk-0.012
rost-0.012
mediately-0.011
reci-0.011
Token.
Token ID13
Feature activation+5.03e-02

Pos logit contributions
ales+0.170
reci+0.157
mediately+0.157
wed+0.156
urbed+0.156
Neg logit contributions
 babys-0.115
 happened-0.104
 haircut-0.102
 gloss-0.089
 photo-0.087
Token too
Token ID1165
Feature activation+1.30e-01

Pos logit contributions
 babys+0.096
 happened+0.088
 haircut+0.087
 gloss+0.087
 complexion+0.085
Neg logit contributions
ales-0.066
 often-0.062
isk-0.059
 itself-0.058
orting-0.058
Token.
Token ID13
Feature activation+4.88e-02

Pos logit contributions
 babys+0.180
 haircut+0.161
 happened+0.154
 gloss+0.149
 complexion+0.148
Neg logit contributions
ales-0.212
mediately-0.193
ining-0.191
isk-0.190
orting-0.190
Token She
Token ID1375
Feature activation-1.35e-01

Pos logit contributions
 babys+0.079
 gloss+0.067
 haircut+0.066
 happened+0.062
ird+0.061
Neg logit contributions
ales-0.076
wed-0.070
lated-0.065
ility-0.064
rant-0.064
Token loved
Token ID6151
Feature activation-1.36e-02

Pos logit contributions
ales+0.248
orting+0.222
ility+0.220
rost+0.217
mediately+0.216
Neg logit contributions
 babys-0.173
 happened-0.158
 haircut-0.155
 cough-0.141
 gloss-0.137
Token Lily
Token ID20037
Feature activation-1.69e-02

Pos logit contributions
ales+0.023
mediately+0.022
reci+0.021
orting+0.021
ining+0.021
Neg logit contributions
 babys-0.019
 haircut-0.018
 happened-0.016
 hairc-0.016
 gloss-0.016
Token very
Token ID845
Feature activation+1.84e-01

Pos logit contributions
ales+0.027
mediately+0.024
rost+0.024
ining+0.024
reci+0.024
Neg logit contributions
 babys-0.024
 haircut-0.022
 gloss-0.021
 happened-0.021
 complexion-0.020
Token much
Token ID881
Feature activation+3.51e-02

Pos logit contributions
 babys+0.226
 haircut+0.200
 happened+0.188
 gloss+0.186
 complexion+0.180
Neg logit contributions
ales-0.328
mediately-0.302
ining-0.301
 often-0.299
urbed-0.296
Token.
Token ID13
Feature activation+4.91e-02

Pos logit contributions
 babys+0.048
 haircut+0.043
 happened+0.040
 complexion+0.040
 gloss+0.039
Neg logit contributions
ales-0.059
mediately-0.054
isk-0.053
reci-0.053
ining-0.053
Token She
Token ID1375
Feature activation-1.32e-01

Pos logit contributions
 babys+0.072
 gloss+0.059
 haircut+0.058
 lipstick+0.055
 makeup+0.054
Neg logit contributions
ales-0.084
wed-0.080
urbed-0.074
ling-0.072
ried-0.072
Token asked
Token ID1965
Feature activation-2.86e-02

Pos logit contributions
ales+0.224
orting+0.199
ility+0.199
mediately+0.198
urbed+0.197
Neg logit contributions
 babys-0.185
 happened-0.167
 haircut-0.165
 gloss-0.154
 complexion-0.153
Token mom
Token ID1995
Feature activation+1.13e-01

Pos logit contributions
ales+0.055
mediately+0.054
ining+0.053
reci+0.051
urbed+0.050
Neg logit contributions
 babys-0.034
 haircut-0.029
 happened-0.026
 hairc-0.025
 gloss-0.025
Token sc
Token ID629
Feature activation+7.97e-02

Pos logit contributions
 babys+0.069
 gloss+0.060
 virus+0.060
 happened+0.059
 haircut+0.056
Neg logit contributions
 often-0.087
 At-0.081
ales-0.081
 itself-0.081
 freely-0.080
Tokenooter
Token ID25141
Feature activation+4.62e-02

Pos logit contributions
 babys+0.132
 haircut+0.120
 gloss+0.113
 complexion+0.113
 photo+0.104
Neg logit contributions
ales-0.108
ining-0.102
urbed-0.101
 itself-0.098
mediately-0.098
Token.
Token ID13
Feature activation+4.52e-02

Pos logit contributions
 babys+0.072
 haircut+0.064
 happened+0.061
 gloss+0.061
 complexion+0.060
Neg logit contributions
ales-0.066
mediately-0.060
 often-0.060
ining-0.059
urbed-0.059
Token She
Token ID1375
Feature activation-1.41e-01

Pos logit contributions
 babys+0.068
 haircut+0.061
 gloss+0.056
 hairc+0.055
 photo+0.054
Neg logit contributions
ales-0.070
orting-0.065
wed-0.064
reci-0.064
cut-0.064
Token tr
Token ID491
Feature activation+3.33e-02

Pos logit contributions
ales+0.223
mediately+0.199
orting+0.199
isk+0.197
rost+0.197
Neg logit contributions
 babys-0.206
 haircut-0.187
 happened-0.180
 gloss-0.175
 complexion-0.173
Tokenund
Token ID917
Feature activation-2.22e-01

Pos logit contributions
 babys+0.059
 haircut+0.055
 happened+0.053
 gloss+0.052
 virus+0.051
Neg logit contributions
ales-0.042
ining-0.037
mediately-0.037
wed-0.036
isk-0.036
Tokenled
Token ID992
Feature activation-4.28e-02

Pos logit contributions
ales+0.251
ility+0.241
isk+0.240
rost+0.239
orting+0.236
Neg logit contributions
 happened-0.381
 babys-0.365
 haircut-0.360
 complexion-0.349
 gloss-0.339
Token along
Token ID1863
Feature activation-6.49e-02

Pos logit contributions
ales+0.070
isk+0.064
mediately+0.063
rost+0.063
orting+0.063
Neg logit contributions
 babys-0.060
 haircut-0.054
 happened-0.052
 gloss-0.050
 complexion-0.049
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
ales+0.102
reci+0.092
rost+0.092
orting+0.090
mediately+0.089
Neg logit contributions
 babys-0.101
 haircut-0.096
 happened-0.091
 hairc-0.086
 photo-0.085
Token sidewalk
Token ID24058
Feature activation-5.15e-03

Pos logit contributions
ales+0.025
wed+0.023
mediately+0.023
reci+0.023
orting+0.023
Neg logit contributions
 babys-0.021
 haircut-0.019
 photo-0.017
 complexion-0.016
 virus-0.016
Token all
Token ID477
Feature activation-5.21e-02

Pos logit contributions
ales+0.008
reci+0.007
rost+0.007
mediately+0.007
orting+0.007
Neg logit contributions
 babys-0.008
 haircut-0.007
 happened-0.007
 gloss-0.007
 complexion-0.007
Token
Token ID198
Feature activation-3.31e-02

Pos logit contributions
ales+0.068
ility+0.062
et+0.061
wed+0.061
rant+0.061
Neg logit contributions
 babys-0.034
 happened-0.029
 haircut-0.028
+.-0.026
 complexion-0.026
TokenEvery
Token ID6109
Feature activation-9.76e-02

Pos logit contributions
ales+0.063
rant+0.056
wed+0.055
et+0.054
ale+0.053
Neg logit contributions
 babys-0.034
+.-0.034
odic-0.033
 haircut-0.033
 jan-0.032
Token time
Token ID640
Feature activation+2.23e-02

Pos logit contributions
ales+0.175
isk+0.154
orting+0.152
wed+0.151
reci+0.149
Neg logit contributions
 babys-0.129
 haircut-0.121
 happened-0.120
 nightmare-0.113
 virus-0.110
Token it
Token ID340
Feature activation+4.32e-02

Pos logit contributions
 babys+0.035
 happened+0.034
 haircut+0.031
 hairc+0.030
 nightmare+0.028
Neg logit contributions
ales-0.035
isk-0.032
mediately-0.031
wed-0.031
rant-0.031
Token th
Token ID294
Feature activation-5.90e-02

Pos logit contributions
 happened+0.050
 babys+0.047
 haircut+0.041
 rang+0.040
 nightmare+0.038
Neg logit contributions
ales-0.086
orting-0.078
rost-0.078
mediately-0.076
angles-0.075
Tokenund
Token ID917
Feature activation-2.20e-01

Pos logit contributions
rost+0.067
ales+0.065
 often+0.063
reci+0.060
mediately+0.060
Neg logit contributions
 babys-0.097
 happened-0.095
 gloss-0.092
 complexion-0.090
burn-0.086
Tokenered
Token ID1068
Feature activation-3.49e-02

Pos logit contributions
isk+0.312
ility+0.303
ales+0.302
reci+0.288
rost+0.284
Neg logit contributions
 happened-0.330
 babys-0.275
 haircut-0.269
 complexion-0.268
 lipstick-0.265
Token afterwards
Token ID12979
Feature activation-6.95e-03

Pos logit contributions
isk+0.074
ales+0.070
rost+0.067
wed+0.067
mediately+0.066
Neg logit contributions
 happened-0.031
 very-0.028
 babys-0.028
 ghost-0.024
 nos-0.023
Token,
Token ID11
Feature activation+4.59e-02

Pos logit contributions
ales+0.017
isk+0.016
wed+0.016
ws+0.016
mediately+0.016
Neg logit contributions
 babys-0.005
 happened-0.005
 haircut-0.004
 virus-0.003
 tooth-0.003
Token the
Token ID262
Feature activation-1.63e-02

Pos logit contributions
 babys+0.062
 happened+0.061
 haircut+0.057
 hairc+0.056
 trouble+0.054
Neg logit contributions
ales-0.079
wed-0.077
isk-0.075
mediately-0.071
rant-0.071
Token famous
Token ID5863
Feature activation-6.21e-02

Pos logit contributions
ales+0.033
wed+0.031
rost+0.030
ining+0.030
mediately+0.030
Neg logit contributions
 babys-0.019
 haircut-0.015
 virus-0.015
 happened-0.015
 nightmare-0.015
Token
Token ID198
Feature activation-3.21e-02

Pos logit contributions
 babys+0.075
 happened+0.067
 gloss+0.060
ird+0.059
uts+0.056
Neg logit contributions
ales-0.140
rant-0.136
urbed-0.133
et-0.132
wed-0.131
Token
Token ID198
Feature activation-3.18e-02

Pos logit contributions
ales+0.069
wed+0.062
et+0.062
ility+0.062
rant+0.061
Neg logit contributions
 babys-0.029
 happened-0.023
 gloss-0.022
odic-0.022
 panc-0.021
TokenThe
Token ID464
Feature activation-1.26e-01

Pos logit contributions
ales+0.060
rant+0.054
et+0.052
wed+0.052
ws+0.051
Neg logit contributions
 babys-0.035
odic-0.034
 gloss-0.032
Grand-0.030
+.-0.030
Token storm
Token ID6388
Feature activation-1.33e-02

Pos logit contributions
ales+0.230
ining+0.204
orting+0.203
rost+0.202
mediately+0.200
Neg logit contributions
 babys-0.167
 haircut-0.143
 happened-0.139
 gloss-0.134
 tooth-0.131
Token th
Token ID294
Feature activation-5.85e-02

Pos logit contributions
ales+0.030
rost+0.027
ws+0.026
isk+0.025
orting+0.025
Neg logit contributions
 happened-0.015
 rang-0.012
 babys-0.011
 hurt-0.010
 ghost-0.009
Tokenund
Token ID917
Feature activation-2.19e-01

Pos logit contributions
rost+0.079
 often+0.072
 elsewhere+0.068
hips+0.067
 closely+0.065
Neg logit contributions
 happened-0.086
 babys-0.084
 gloss-0.082
 complexion-0.078
odic-0.077
Tokenered
Token ID1068
Feature activation-3.47e-02

Pos logit contributions
rost+0.331
ility+0.322
isk+0.319
utt+0.306
ales+0.305
Neg logit contributions
 happened-0.301
 babys-0.252
 complexion-0.245
 lipstick-0.240
 trouble-0.225
Token and
Token ID290
Feature activation-3.97e-02

Pos logit contributions
isk+0.072
rost+0.069
ales+0.068
reci+0.064
ws+0.064
Neg logit contributions
 happened-0.032
 very-0.032
 babys-0.028
 scared-0.026
 shocked-0.026
Token roared
Token ID45615
Feature activation+3.38e-02

Pos logit contributions
ales+0.064
orting+0.059
acking+0.058
rost+0.057
reci+0.057
Neg logit contributions
 happened-0.056
 babys-0.055
 rang-0.048
 haircut-0.047
 nightmare-0.047
Token above
Token ID2029
Feature activation-1.18e-01

Pos logit contributions
 happened+0.038
 babys+0.038
 very+0.034
 gloss+0.033
 haircut+0.033
Neg logit contributions
ales-0.061
reci-0.057
rost-0.056
angles-0.055
wed-0.055
Token,
Token ID11
Feature activation+4.78e-02

Pos logit contributions
ales+0.239
isk+0.228
mediately+0.221
rost+0.219
wed+0.216
Neg logit contributions
 happened-0.137
 babys-0.112
 very-0.106
 nightmare-0.103
 ghost-0.097
Token that
Token ID326
Feature activation-7.84e-02

Pos logit contributions
 haircut+0.071
 babys+0.066
 makeup+0.059
 lipstick+0.058
 happened+0.056
Neg logit contributions
ales-0.107
acking-0.097
wed-0.095
rant-0.094
mediately-0.093
Token's
Token ID338
Feature activation+7.58e-02

Pos logit contributions
ales+0.131
isk+0.117
mediately+0.116
ility+0.116
urbed+0.116
Neg logit contributions
 babys-0.109
 haircut-0.101
 happened-0.099
 gloss-0.091
 photo-0.090
Token not
Token ID407
Feature activation+8.39e-03

Pos logit contributions
 babys+0.108
 complexion+0.097
 haircut+0.095
 gloss+0.093
 happened+0.087
Neg logit contributions
ales-0.115
 itself-0.112
mediately-0.112
isk-0.111
ining-0.110
Token nice
Token ID3621
Feature activation-3.06e-02

Pos logit contributions
 babys+0.011
 haircut+0.010
 happened+0.009
 gloss+0.009
 complexion+0.009
Neg logit contributions
ales-0.014
mediately-0.013
reci-0.013
orting-0.013
urbed-0.013
Token.
Token ID13
Feature activation+4.83e-02

Pos logit contributions
ales+0.048
mediately+0.044
ining+0.043
isk+0.043
reci+0.043
Neg logit contributions
 babys-0.045
 haircut-0.041
 happened-0.039
 gloss-0.038
 complexion-0.037
Token Taking
Token ID20879
Feature activation-2.11e-01

Pos logit contributions
 babys+0.066
 gloss+0.060
 lipstick+0.056
 haircut+0.054
 makeup+0.054
Neg logit contributions
ales-0.082
wed-0.076
urbed-0.075
Years-0.074
ling-0.073
Token things
Token ID1243
Feature activation+2.98e-02

Pos logit contributions
ales+0.351
isk+0.339
rost+0.337
mediately+0.335
ining+0.333
Neg logit contributions
 haircut-0.267
 babys-0.262
 hairc-0.252
 happened-0.245
 lipstick-0.241
Token that
Token ID326
Feature activation-7.78e-02

Pos logit contributions
 babys+0.041
 happened+0.038
 haircut+0.038
 lipstick+0.034
 photo+0.034
Neg logit contributions
ales-0.047
mediately-0.046
isk-0.044
rost-0.044
ining-0.043
Token aren
Token ID3588
Feature activation+8.81e-03

Pos logit contributions
ales+0.135
urbed+0.127
mediately+0.127
ining+0.124
ility+0.123
Neg logit contributions
 babys-0.102
 happened-0.092
 haircut-0.092
 photo-0.085
 wallet-0.082
Token't
Token ID470
Feature activation+1.35e-02

Pos logit contributions
 babys+0.007
 happened+0.006
 haircut+0.005
 gloss+0.005
 hurt+0.004
Neg logit contributions
ales-0.019
mediately-0.018
rost-0.018
urbed-0.018
wed-0.018
Token yours
Token ID12431
Feature activation-1.55e-02

Pos logit contributions
 babys+0.018
 happened+0.018
 haircut+0.017
 photo+0.015
 virus+0.015
Neg logit contributions
ales-0.021
mediately-0.020
reci-0.020
orting-0.020
isk-0.020
Token came
Token ID1625
Feature activation+5.17e-02

Pos logit contributions
 happened+0.076
 babys+0.067
 haircut+0.060
 gloss+0.056
 hurt+0.054
Neg logit contributions
ales-0.107
mediately-0.100
reci-0.098
isk-0.098
 Later-0.097
Token from
Token ID422
Feature activation-3.88e-02

Pos logit contributions
 babys+0.062
 happened+0.058
 haircut+0.049
 hurt+0.048
burn+0.048
Neg logit contributions
ales-0.090
reci-0.086
wed-0.084
rost-0.084
orting-0.083
Token it
Token ID340
Feature activation+3.97e-02

Pos logit contributions
ales+0.060
reci+0.058
orting+0.058
ining+0.057
isk+0.056
Neg logit contributions
 happened-0.053
 babys-0.052
 haircut-0.046
 trouble-0.045
 nightmare-0.045
Token.
Token ID13
Feature activation+4.14e-02

Pos logit contributions
 happened+0.028
 babys+0.024
 very+0.023
 nightmare+0.021
 hurt+0.021
Neg logit contributions
ales-0.088
rost-0.087
reci-0.085
angles-0.085
mediately-0.082
Token Sc
Token ID1446
Feature activation-2.54e-02

Pos logit contributions
 babys+0.066
 happened+0.054
 haircut+0.053
 gloss+0.053
 Lee+0.051
Neg logit contributions
ales-0.066
wed-0.064
ility-0.060
reci-0.059
ws-0.059
Tokenutt
Token ID15318
Feature activation-2.11e-01

Pos logit contributions
 often+0.033
 areas+0.032
 directions+0.031
 rivers+0.031
 valleys+0.031
Neg logit contributions
 babys-0.035
 gloss-0.032
ird-0.031
 haircut-0.031
 happened-0.030
Tokenles
Token ID829
Feature activation-2.91e-02

Pos logit contributions
rost+0.391
ility+0.365
 Along+0.365
 If+0.362
 Whenever+0.357
Neg logit contributions
 happened-0.242
 hurt-0.179
 rang-0.174
 bumped-0.159
 babys-0.158
Token was
Token ID373
Feature activation+1.19e-01

Pos logit contributions
ales+0.055
rost+0.053
reci+0.052
angles+0.051
isk+0.050
Neg logit contributions
 happened-0.033
 babys-0.032
 hurt-0.028
 rang-0.028
 shocked-0.028
Token so
Token ID523
Feature activation+1.50e-01

Pos logit contributions
 babys+0.130
 happened+0.120
 haircut+0.106
 gloss+0.101
 nightmare+0.094
Neg logit contributions
ales-0.234
reci-0.222
angles-0.216
orting-0.215
rost-0.212
Token frightened
Token ID24776
Feature activation-1.20e-03

Pos logit contributions
 happened+0.074
 babys+0.061
 mad+0.052
 shocked+0.049
 scared+0.048
Neg logit contributions
ales-0.399
isk-0.367
angles-0.367
ility-0.365
orting-0.364
Token,
Token ID11
Feature activation+4.99e-02

Pos logit contributions
wed+0.002
ws+0.002
angles+0.002
reci+0.002
ales+0.002
Neg logit contributions
 happened-0.001
 babys-0.001
 ghost-0.001
 haircut-0.001
 gloss-0.001
Token began
Token ID2540
Feature activation+7.77e-02

Pos logit contributions
ales+0.075
reci+0.069
rost+0.068
angles+0.068
mediately+0.064
Neg logit contributions
 happened-0.044
 babys-0.042
 hurt-0.040
 crying-0.039
 scared-0.037
Token screaming
Token ID14788
Feature activation-3.19e-02

Pos logit contributions
 babys+0.074
 happened+0.063
 haircut+0.061
 hairc+0.053
 crying+0.053
Neg logit contributions
ales-0.169
rost-0.159
mediately-0.159
angles-0.159
reci-0.157
Token again
Token ID757
Feature activation+1.61e-02

Pos logit contributions
ales+0.059
mediately+0.055
reci+0.055
rost+0.055
angles+0.055
Neg logit contributions
 babys-0.036
 happened-0.033
 haircut-0.032
 gloss-0.030
 complexion-0.030
Token.
Token ID13
Feature activation+3.91e-02

Pos logit contributions
 babys+0.014
 happened+0.013
 haircut+0.013
 complexion+0.011
 nightmare+0.011
Neg logit contributions
ales-0.033
rost-0.032
angles-0.032
mediately-0.031
reci-0.031
Token Sc
Token ID1446
Feature activation-2.58e-02

Pos logit contributions
 babys+0.062
 gloss+0.055
 jan+0.052
 happened+0.051
 haircut+0.050
Neg logit contributions
ales-0.058
wed-0.055
rant-0.055
reci-0.055
angles-0.054
Tokenutt
Token ID15318
Feature activation-2.10e-01

Pos logit contributions
 often+0.033
 areas+0.032
 directions+0.032
 valleys+0.032
 rivers+0.032
Neg logit contributions
 babys-0.033
 gloss-0.032
ird-0.031
 wallet-0.029
 happened-0.029
Tokenles
Token ID829
Feature activation-3.05e-02

Pos logit contributions
rost+0.363
 If+0.334
 Along+0.331
ility+0.325
 Step+0.323
Neg logit contributions
 happened-0.260
 hurt-0.203
 rang-0.203
 babys-0.194
 bumped-0.186
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ales+0.056
rost+0.054
isk+0.051
angles+0.050
mediately+0.050
Neg logit contributions
 happened-0.036
 babys-0.035
 hurt-0.032
 rang-0.031
 shocked-0.030
Token so
Token ID523
Feature activation+1.51e-01

Pos logit contributions
 babys+0.120
 happened+0.110
 haircut+0.096
 hurt+0.095
 nightmare+0.094
Neg logit contributions
ales-0.248
angles-0.232
reci-0.231
rost-0.223
orting-0.222
Token scared
Token ID12008
Feature activation+2.45e-04

Pos logit contributions
 happened+0.057
 babys+0.055
 mad+0.046
 scared+0.043
 shocked+0.043
Neg logit contributions
ales-0.415
angles-0.384
isk-0.380
rost-0.373
orting-0.370
Token,
Token ID11
Feature activation+4.70e-02

Pos logit contributions
 happened+0.000
 babys+0.000
 haircut+0.000
 ghost+0.000
 very+0.000
Neg logit contributions
ws-0.000
wed-0.000
isk-0.000
angles-0.000
reci-0.000
Token be
Token ID307
Feature activation+4.49e-02

Pos logit contributions
 babys+0.010
 haircut+0.009
 happened+0.008
 gloss+0.008
 complexion+0.008
Neg logit contributions
ales-0.008
mediately-0.007
ining-0.007
urbed-0.007
reci-0.007
Token healthy
Token ID5448
Feature activation-4.77e-02

Pos logit contributions
 babys+0.072
 haircut+0.065
 happened+0.063
 gloss+0.062
 complexion+0.061
Neg logit contributions
ales-0.061
ining-0.058
 often-0.058
mediately-0.056
isk-0.055
Token and
Token ID290
Feature activation-3.97e-02

Pos logit contributions
ales+0.056
mediately+0.049
 often+0.048
ining+0.048
orting+0.048
Neg logit contributions
 babys-0.087
 haircut-0.080
 happened-0.077
 gloss-0.077
 complexion-0.076
Token happy
Token ID3772
Feature activation-1.55e-02

Pos logit contributions
ales+0.067
mediately+0.059
ining+0.059
urbed+0.059
orting+0.059
Neg logit contributions
 babys-0.054
 haircut-0.050
 happened-0.047
 gloss-0.046
 complexion-0.045
Token.
Token ID13
Feature activation+3.69e-02

Pos logit contributions
ales+0.020
 often+0.018
mediately+0.018
ining+0.018
orting+0.018
Neg logit contributions
 babys-0.026
 haircut-0.023
 gloss-0.022
 happened-0.022
 complexion-0.022
Token Taking
Token ID20879
Feature activation-2.09e-01

Pos logit contributions
 babys+0.057
 haircut+0.051
 gloss+0.050
 happened+0.046
 lipstick+0.046
Neg logit contributions
ales-0.056
wed-0.050
ining-0.049
urbed-0.049
isk-0.048
Token your
Token ID534
Feature activation-2.67e-02

Pos logit contributions
ales+0.339
mediately+0.309
ining+0.306
isk+0.304
orting+0.302
Neg logit contributions
 babys-0.291
 haircut-0.263
 happened-0.250
 gloss-0.246
 complexion-0.239
Token pill
Token ID9582
Feature activation+4.47e-02

Pos logit contributions
ales+0.039
reci+0.038
rost+0.038
urbed+0.036
isk+0.036
Neg logit contributions
 babys-0.036
 complexion-0.033
 gloss-0.032
 haircut-0.032
 happened-0.031
Token is
Token ID318
Feature activation+1.05e-01

Pos logit contributions
 babys+0.069
 haircut+0.065
 happened+0.063
 gloss+0.062
 underwear+0.059
Neg logit contributions
ales-0.061
mediately-0.058
urbed-0.057
ility-0.056
reci-0.056
Token good
Token ID922
Feature activation+1.86e-03

Pos logit contributions
 babys+0.143
 haircut+0.133
 happened+0.128
 gloss+0.121
 cough+0.115
Neg logit contributions
ales-0.175
reci-0.156
mediately-0.156
urbed-0.153
ining-0.153
Token for
Token ID329
Feature activation-5.48e-02

Pos logit contributions
 babys+0.004
 haircut+0.003
 gloss+0.003
 complexion+0.003
 happened+0.003
Neg logit contributions
ales-0.002
orting-0.002
wed-0.002
 often-0.002
mediately-0.002
Token's
Token ID338
Feature activation+7.81e-02

Pos logit contributions
 often+0.013
ales+0.013
 itself+0.012
rost+0.012
reci+0.012
Neg logit contributions
 babys-0.025
 complexion-0.023
 haircut-0.022
 gloss-0.022
 jan-0.021
Token best
Token ID1266
Feature activation+6.68e-02

Pos logit contributions
 babys+0.131
 haircut+0.115
 complexion+0.114
 gloss+0.113
 virus+0.106
Neg logit contributions
ales-0.100
wed-0.091
mediately-0.090
 often-0.090
 itself-0.090
Token for
Token ID329
Feature activation-5.45e-02

Pos logit contributions
 babys+0.146
 complexion+0.128
 gloss+0.127
nam+0.127
 panc+0.124
Neg logit contributions
ales-0.049
 often-0.046
 itself-0.043
rost-0.043
wed-0.042
Token you
Token ID345
Feature activation-4.98e-03

Pos logit contributions
ales+0.089
mediately+0.087
reci+0.084
isk+0.083
urbed+0.081
Neg logit contributions
 babys-0.072
 haircut-0.070
 happened-0.065
 lipstick-0.062
 hairc-0.061
Token.
Token ID13
Feature activation+3.84e-02

Pos logit contributions
ales+0.007
mediately+0.007
ining+0.006
isk+0.006
orting+0.006
Neg logit contributions
 babys-0.008
 haircut-0.007
 happened-0.007
 gloss-0.007
 complexion-0.007
Token Taking
Token ID20879
Feature activation-2.09e-01

Pos logit contributions
 babys+0.058
 haircut+0.052
 gloss+0.051
 happened+0.047
 hairc+0.046
Neg logit contributions
ales-0.061
wed-0.053
ining-0.052
urbed-0.052
isk-0.052
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ales+0.348
mediately+0.318
ining+0.315
isk+0.314
orting+0.312
Neg logit contributions
 babys-0.281
 haircut-0.253
 happened-0.240
 gloss-0.236
 complexion-0.229
Token bath
Token ID7837
Feature activation+1.08e-02

Pos logit contributions
 babys+0.194
 complexion+0.175
 gloss+0.174
 happened+0.173
 haircut+0.168
Neg logit contributions
reci-0.165
urbed-0.163
ales-0.162
rost-0.161
ches-0.156
Token is
Token ID318
Feature activation+1.04e-01

Pos logit contributions
 babys+0.017
 haircut+0.015
 happened+0.014
 gloss+0.014
 complexion+0.014
Neg logit contributions
ales-0.016
mediately-0.015
ining-0.014
reci-0.014
urbed-0.014
Token good
Token ID922
Feature activation+1.84e-03

Pos logit contributions
 babys+0.146
 haircut+0.135
 happened+0.130
 gloss+0.124
 cough+0.116
Neg logit contributions
ales-0.171
reci-0.155
mediately-0.154
ining-0.150
urbed-0.150
Token for
Token ID329
Feature activation-5.43e-02

Pos logit contributions
 babys+0.003
 haircut+0.003
 gloss+0.003
 happened+0.003
 complexion+0.003
Neg logit contributions
ales-0.002
orting-0.002
mediately-0.002
wed-0.002
 often-0.002
Token happier
Token ID23030
Feature activation-2.45e-02

Pos logit contributions
 babys+0.025
 happened+0.022
 haircut+0.021
 cough+0.019
 panc+0.019
Neg logit contributions
ales-0.046
isk-0.042
rost-0.041
angles-0.040
mediately-0.040
Token.
Token ID13
Feature activation+4.09e-02

Pos logit contributions
ales+0.043
isk+0.041
mediately+0.039
rost+0.039
reci+0.037
Neg logit contributions
 babys-0.030
 happened-0.029
 haircut-0.028
 complexion-0.026
 panc-0.025
Token Everyone
Token ID11075
Feature activation-8.27e-02

Pos logit contributions
 babys+0.061
 gloss+0.052
 haircut+0.052
 happened+0.049
 panc+0.048
Neg logit contributions
ales-0.069
wed-0.063
isk-0.061
ck-0.058
ws-0.058
Token agreed
Token ID4987
Feature activation+7.16e-02

Pos logit contributions
ales+0.157
isk+0.146
rost+0.137
orting+0.136
mediately+0.131
Neg logit contributions
 babys-0.101
 happened-0.098
 haircut-0.093
 panc-0.084
 gloss-0.083
Token.
Token ID13
Feature activation+4.04e-02

Pos logit contributions
 haircut+0.081
 babys+0.078
 happened+0.078
 panc+0.067
 photo+0.066
Neg logit contributions
ales-0.150
isk-0.135
reci-0.131
mediately-0.131
rost-0.128
Token Taking
Token ID20879
Feature activation-2.09e-01

Pos logit contributions
 babys+0.051
 gloss+0.044
 haircut+0.040
 photo+0.040
 happened+0.039
Neg logit contributions
ales-0.076
wed-0.069
isk-0.068
ck-0.065
cut-0.065
Token the
Token ID262
Feature activation-1.86e-02

Pos logit contributions
ales+0.373
isk+0.357
rost+0.353
orting+0.341
utt+0.335
Neg logit contributions
 happened-0.237
 babys-0.228
 haircut-0.224
 hairc-0.205
 gloss-0.199
Token medicine
Token ID9007
Feature activation+3.78e-02

Pos logit contributions
ales+0.034
ining+0.031
mediately+0.031
urbed+0.030
rost+0.030
Neg logit contributions
 babys-0.024
 haircut-0.021
 happened-0.020
 gloss-0.019
 virus-0.018
Token felt
Token ID2936
Feature activation+6.47e-02

Pos logit contributions
 babys+0.050
 happened+0.048
 haircut+0.047
 gloss+0.044
 photo+0.041
Neg logit contributions
ales-0.064
mediately-0.060
rost-0.059
isk-0.058
reci-0.057
Token nice
Token ID3621
Feature activation-2.75e-02

Pos logit contributions
 babys+0.071
 happened+0.067
 haircut+0.061
 gloss+0.056
 panc+0.054
Neg logit contributions
ales-0.131
isk-0.120
rost-0.115
angles-0.113
mediately-0.109
Token and
Token ID290
Feature activation-3.73e-02

Pos logit contributions
ales+0.046
mediately+0.043
isk+0.043
rost+0.042
reci+0.041
Neg logit contributions
 happened-0.037
 babys-0.036
 haircut-0.035
 gloss-0.033
 panc-0.032
Token the
Token ID262
Feature activation-1.75e-02

Pos logit contributions
ales+0.134
rant+0.120
mediately+0.118
lated+0.117
ining+0.117
Neg logit contributions
 happened-0.108
 babys-0.106
 haircut-0.096
 hairc-0.096
 tooth-0.092
Token bear
Token ID6842
Feature activation+4.69e-02

Pos logit contributions
ales+0.034
ining+0.029
orting+0.029
wed+0.029
angles+0.029
Neg logit contributions
 babys-0.023
 haircut-0.019
 tooth-0.019
 happened-0.019
 wallet-0.018
Token was
Token ID373
Feature activation+1.21e-01

Pos logit contributions
 happened+0.053
 babys+0.045
 tooth+0.039
 hurt+0.038
 wallet+0.036
Neg logit contributions
ales-0.098
ility-0.089
orting-0.089
angles-0.088
rost-0.087
Token right
Token ID826
Feature activation-4.07e-02

Pos logit contributions
 babys+0.111
 happened+0.108
 very+0.099
 hurt+0.099
 mad+0.095
Neg logit contributions
ales-0.269
angles-0.248
reci-0.243
orting-0.238
rost-0.236
Token.
Token ID13
Feature activation+4.25e-02

Pos logit contributions
ales+0.082
reci+0.079
isk+0.076
angles+0.073
ility+0.073
Neg logit contributions
 happened-0.039
 babys-0.039
 haircut-0.033
 wallet-0.033
 tooth-0.031
Token Taking
Token ID20879
Feature activation-2.09e-01

Pos logit contributions
 babys+0.053
 A+0.045
 ashamed+0.044
 Lee+0.043
 nervous+0.043
Neg logit contributions
wed-0.075
ales-0.075
ck-0.073
angles-0.071
rant-0.070
Token the
Token ID262
Feature activation-1.79e-02

Pos logit contributions
rost+0.389
ales+0.386
isk+0.385
ide+0.384
stack+0.376
Neg logit contributions
 happened-0.224
 trouble-0.194
 tooth-0.191
 nightmare-0.180
 hairc-0.175
Token gold
Token ID3869
Feature activation-3.25e-02

Pos logit contributions
ales+0.033
isk+0.030
orting+0.030
mediately+0.030
ining+0.030
Neg logit contributions
 babys-0.022
 happened-0.019
 haircut-0.019
 photo-0.018
 tooth-0.017
Token would
Token ID561
Feature activation+5.13e-02

Pos logit contributions
ales+0.056
isk+0.055
mediately+0.055
ining+0.053
iona+0.053
Neg logit contributions
 happened-0.039
 babys-0.038
 wallet-0.036
 tooth-0.034
 haircut-0.034
Token have
Token ID423
Feature activation-6.54e-02

Pos logit contributions
 babys+0.065
 hurt+0.060
 haircut+0.058
 cough+0.057
 happened+0.051
Neg logit contributions
ales-0.097
mediately-0.090
reci-0.088
orting-0.087
angles-0.087
Token been
Token ID587
Feature activation+6.54e-02

Pos logit contributions
ales+0.131
mediately+0.127
rost+0.124
orting+0.122
rant+0.121
Neg logit contributions
 happened-0.078
 babys-0.061
 trouble-0.057
 haircut-0.056
 tooth-0.054
Token
Token ID198
Feature activation-3.93e-02

Pos logit contributions
ales+0.057
urbed+0.050
ility+0.050
mediately+0.049
ining+0.049
Neg logit contributions
 babys-0.064
 haircut-0.057
 happened-0.056
 gloss-0.054
 complexion-0.054
TokenThey
Token ID2990
Feature activation-1.13e-01

Pos logit contributions
ales+0.065
ility+0.059
urbed+0.059
rant+0.057
ining+0.056
Neg logit contributions
 babys-0.058
 haircut-0.052
 gloss-0.049
 happened-0.048
 complexion-0.047
Token walked
Token ID6807
Feature activation-5.59e-02

Pos logit contributions
ales+0.177
mediately+0.160
ining+0.160
urbed+0.159
reci+0.158
Neg logit contributions
 babys-0.169
 haircut-0.151
 complexion-0.142
 happened-0.141
 gloss-0.137
Token home
Token ID1363
Feature activation+7.53e-02

Pos logit contributions
ales+0.100
ility+0.095
reci+0.094
urbed+0.094
isk+0.094
Neg logit contributions
 babys-0.070
 haircut-0.062
 complexion-0.055
 happened-0.054
 hairc-0.053
Token.
Token ID13
Feature activation+2.91e-02

Pos logit contributions
 babys+0.134
 haircut+0.126
 gloss+0.124
 happened+0.122
 complexion+0.117
Neg logit contributions
ales-0.085
ining-0.078
 often-0.077
isk-0.077
mediately-0.077
Token On
Token ID1550
Feature activation-2.09e-01

Pos logit contributions
 babys+0.058
 haircut+0.054
 happened+0.052
 gloss+0.052
 complexion+0.051
Neg logit contributions
ales-0.030
mediately-0.025
ining-0.025
orting-0.025
isk-0.025
Token the
Token ID262
Feature activation-2.28e-02

Pos logit contributions
ales+0.264
reci+0.232
mediately+0.232
ining+0.230
urbed+0.229
Neg logit contributions
 babys-0.377
 haircut-0.347
 happened-0.326
 gloss-0.318
 panc-0.317
Token way
Token ID835
Feature activation+4.32e-02

Pos logit contributions
ales+0.036
mediately+0.033
 often+0.033
rost+0.032
orting+0.032
Neg logit contributions
 babys-0.031
 happened-0.029
 gloss-0.028
 haircut-0.027
 complexion-0.027
Token,
Token ID11
Feature activation+4.36e-02

Pos logit contributions
 babys+0.057
 haircut+0.049
 happened+0.048
 gloss+0.045
 complexion+0.045
Neg logit contributions
ales-0.075
mediately-0.068
reci-0.068
isk-0.068
ining-0.068
Token they
Token ID484
Feature activation-4.56e-02

Pos logit contributions
 babys+0.069
 haircut+0.062
 happened+0.060
 gloss+0.059
 complexion+0.058
Neg logit contributions
ales-0.063
mediately-0.057
ining-0.056
isk-0.055
orting-0.055
Token met
Token ID1138
Feature activation+4.67e-03

Pos logit contributions
ales+0.073
mediately+0.067
ining+0.066
urbed+0.065
orting+0.065
Neg logit contributions
 babys-0.066
 haircut-0.058
 happened-0.056
 gloss-0.054
 complexion-0.054
Token bird
Token ID6512
Feature activation+1.96e-02

Pos logit contributions
ales+0.043
mediately+0.040
ining+0.040
orting+0.039
isk+0.039
Neg logit contributions
 babys-0.020
 haircut-0.017
 happened-0.015
 gloss-0.015
 complexion-0.015
Token for
Token ID329
Feature activation-5.46e-02

Pos logit contributions
 babys+0.019
 haircut+0.017
 tooth+0.016
 gloss+0.015
 very+0.015
Neg logit contributions
ales-0.038
reci-0.038
lated-0.037
wed-0.037
ining-0.036
Token the
Token ID262
Feature activation-1.99e-02

Pos logit contributions
ales+0.104
reci+0.096
mediately+0.092
ches+0.090
wed+0.089
Neg logit contributions
 babys-0.075
 haircut-0.063
 happened-0.059
 crying-0.057
 phone-0.056
Token advice
Token ID5608
Feature activation-6.63e-03

Pos logit contributions
ales+0.039
mediately+0.036
ining+0.036
isk+0.035
orting+0.035
Neg logit contributions
 babys-0.021
 haircut-0.018
 happened-0.017
 gloss-0.017
 complexion-0.016
Token.
Token ID13
Feature activation+3.80e-02

Pos logit contributions
ales+0.013
isk+0.013
reci+0.013
mediately+0.013
wed+0.013
Neg logit contributions
 babys-0.006
 haircut-0.005
 happened-0.005
 photo-0.005
 gloss-0.005
Token Taking
Token ID20879
Feature activation-2.09e-01

Pos logit contributions
 babys+0.056
 gloss+0.050
 Lee+0.047
 classmate+0.047
 tutor+0.046
Neg logit contributions
ales-0.062
wed-0.056
ck-0.053
asting-0.052
et-0.052
Token the
Token ID262
Feature activation-1.92e-02

Pos logit contributions
rost+0.395
ales+0.394
isk+0.388
ide+0.372
orting+0.372
Neg logit contributions
 happened-0.211
 haircut-0.181
 babys-0.174
 photo-0.170
 tooth-0.170
Token camera
Token ID4676
Feature activation+5.51e-02

Pos logit contributions
ales+0.038
reci+0.036
mediately+0.035
ining+0.035
orting+0.035
Neg logit contributions
 babys-0.018
 haircut-0.016
 gloss-0.015
 complexion-0.014
 hairc-0.013
Token,
Token ID11
Feature activation+4.82e-02

Pos logit contributions
 babys+0.063
 happened+0.062
 haircut+0.059
 gloss+0.054
 photo+0.054
Neg logit contributions
ales-0.099
rost-0.095
reci-0.094
isk-0.094
mediately-0.092
Token he
Token ID339
Feature activation-5.15e-02

Pos logit contributions
 babys+0.052
 happened+0.045
 daddy+0.044
 haircut+0.043
 gloss+0.042
Neg logit contributions
ales-0.100
rost-0.093
wed-0.091
isk-0.089
ide-0.087
Token and
Token ID290
Feature activation-3.91e-02

Pos logit contributions
ales+0.096
rost+0.088
orting+0.088
ility+0.086
reci+0.085
Neg logit contributions
 babys-0.060
 happened-0.058
 rang-0.054
 haircut-0.053
 gloss-0.049
Token learned
Token ID4499
Feature activation-1.61e-03

Pos logit contributions
ales+0.007
orting+0.006
mediately+0.006
reci+0.006
rost+0.006
Neg logit contributions
 babys-0.005
 haircut-0.005
 happened-0.005
 gloss-0.004
 hairc-0.004
Token an
Token ID281
Feature activation+7.90e-02

Pos logit contributions
ales+0.002
mediately+0.002
ining+0.002
 itself+0.002
orting+0.002
Neg logit contributions
 babys-0.002
 haircut-0.002
 complexion-0.002
 gloss-0.002
 happened-0.002
Token important
Token ID1593
Feature activation-5.37e-02

Pos logit contributions
 babys+0.097
 haircut+0.092
 happened+0.084
 virus+0.082
 gloss+0.081
Neg logit contributions
ales-0.134
rost-0.124
isk-0.122
orting-0.121
ility-0.121
Token lesson
Token ID11483
Feature activation+6.61e-02

Pos logit contributions
ales+0.069
 often+0.066
ining+0.066
mediately+0.065
 itself+0.065
Neg logit contributions
 babys-0.088
 gloss-0.076
 underwear-0.075
 complexion-0.075
 haircut-0.075
Token.
Token ID13
Feature activation+4.02e-02

Pos logit contributions
 babys+0.088
 happened+0.085
 haircut+0.083
 gloss+0.077
 virus+0.075
Neg logit contributions
ales-0.109
reci-0.101
mediately-0.101
rost-0.099
urbed-0.098
Token Taking
Token ID20879
Feature activation-2.09e-01

Pos logit contributions
 babys+0.059
 gloss+0.052
 haircut+0.050
 happened+0.049
 hairc+0.047
Neg logit contributions
ales-0.066
wed-0.058
rant-0.057
rost-0.056
et-0.056
Token things
Token ID1243
Feature activation+3.46e-02

Pos logit contributions
ales+0.353
rost+0.341
isk+0.328
utt+0.321
orting+0.316
Neg logit contributions
 haircut-0.257
 happened-0.256
 babys-0.251
 hairc-0.237
 virus-0.228
Token without
Token ID1231
Feature activation-2.39e-02

Pos logit contributions
 happened+0.039
 babys+0.037
 haircut+0.034
 gloss+0.030
 makeup+0.029
Neg logit contributions
ales-0.068
rost-0.065
mediately-0.062
isk-0.061
rant-0.061
Token asking
Token ID4737
Feature activation-6.77e-02

Pos logit contributions
ales+0.043
orting+0.040
rost+0.039
mediately+0.039
oster+0.038
Neg logit contributions
 babys-0.030
 happened-0.028
 haircut-0.027
 hairc-0.026
 lipstick-0.024
Token was
Token ID373
Feature activation+1.17e-01

Pos logit contributions
ales+0.114
rost+0.106
mediately+0.105
utt+0.103
reci+0.101
Neg logit contributions
 babys-0.088
 happened-0.080
 haircut-0.080
 makeup-0.076
 lipstick-0.076
Token wrong
Token ID2642
Feature activation+2.33e-02

Pos logit contributions
 babys+0.147
 haircut+0.136
 happened+0.134
 gloss+0.119
 virus+0.116
Neg logit contributions
ales-0.213
reci-0.193
orting-0.189
rost-0.188
ining-0.188
Token his
Token ID465
Feature activation+3.81e-02

Pos logit contributions
ales+0.081
isk+0.076
reci+0.074
rost+0.074
orting+0.071
Neg logit contributions
 babys-0.047
 happened-0.045
 very-0.044
 haircut-0.043
 blush-0.041
Token little
Token ID1310
Feature activation-1.39e-02

Pos logit contributions
 babys+0.044
 haircut+0.040
 tooth+0.037
 happened+0.036
 wallet+0.035
Neg logit contributions
ales-0.074
reci-0.068
orting-0.068
mediately-0.067
rost-0.067
Token nose
Token ID9686
Feature activation+9.22e-02

Pos logit contributions
ales+0.025
ining+0.023
orting+0.023
isk+0.023
rost+0.023
Neg logit contributions
 babys-0.017
 haircut-0.016
 gloss-0.014
 happened-0.014
 complexion-0.014
Token and
Token ID290
Feature activation-3.82e-02

Pos logit contributions
 babys+0.118
 happened+0.113
 gloss+0.112
 haircut+0.111
 complexion+0.103
Neg logit contributions
ales-0.160
rost-0.151
reci-0.150
isk-0.144
ws-0.141
Token sc
Token ID629
Feature activation+8.12e-02

Pos logit contributions
ales+0.064
orting+0.057
reci+0.057
rost+0.057
mediately+0.056
Neg logit contributions
 babys-0.055
 haircut-0.050
 happened-0.049
 gloss-0.048
 complexion-0.046
Tokenutt
Token ID15318
Feature activation-2.09e-01

Pos logit contributions
 babys+0.123
 gloss+0.121
 haircut+0.115
 complexion+0.115
ird+0.112
Neg logit contributions
reci-0.099
rant-0.098
rost-0.098
ination-0.097
urbed-0.097
Tokenled
Token ID992
Feature activation-4.34e-02

Pos logit contributions
rost+0.387
 If+0.357
ility+0.356
isk+0.347
hips+0.340
Neg logit contributions
 happened-0.225
 complexion-0.176
 babys-0.170
 underwear-0.167
 bl-0.166
Token away
Token ID1497
Feature activation+1.02e-02

Pos logit contributions
ales+0.073
isk+0.071
rost+0.068
reci+0.067
orting+0.067
Neg logit contributions
 very-0.054
 happened-0.052
 babys-0.051
 haircut-0.050
 blush-0.048
Token.
Token ID13
Feature activation+4.08e-02

Pos logit contributions
 babys+0.013
 haircut+0.012
 happened+0.012
 complexion+0.011
 gloss+0.011
Neg logit contributions
ales-0.018
reci-0.017
rost-0.017
mediately-0.017
wed-0.016
Token From
Token ID3574
Feature activation-1.45e-01

Pos logit contributions
 babys+0.054
 gloss+0.044
 haircut+0.041
 Lee+0.039
 classmate+0.039
Neg logit contributions
ales-0.078
wed-0.073
rant-0.068
ws-0.068
et-0.067
Token that
Token ID326
Feature activation-7.60e-02

Pos logit contributions
ales+0.228
reci+0.213
isk+0.213
rost+0.211
ility+0.210
Neg logit contributions
 babys-0.202
 happened-0.195
 haircut-0.191
 lipstick-0.178
 complexion-0.178
Token learned
Token ID4499
Feature activation+1.64e-03

Pos logit contributions
ales+0.246
orting+0.231
 itself+0.227
mediately+0.226
ility+0.225
Neg logit contributions
 babys-0.180
 haircut-0.160
 happened-0.155
 cough-0.152
 rang-0.144
Token a
Token ID257
Feature activation+1.23e-01

Pos logit contributions
 babys+0.003
 haircut+0.002
 happened+0.002
 gloss+0.002
 complexion+0.002
Neg logit contributions
ales-0.002
mediately-0.002
ining-0.002
reci-0.002
isk-0.002
Token hard
Token ID1327
Feature activation+3.75e-03

Pos logit contributions
 babys+0.191
 haircut+0.169
 gloss+0.166
 happened+0.164
 complexion+0.161
Neg logit contributions
ales-0.171
 often-0.163
ining-0.161
orting-0.159
mediately-0.158
Token lesson
Token ID11483
Feature activation+6.32e-02

Pos logit contributions
 babys+0.005
 haircut+0.005
 gloss+0.005
 happened+0.004
 complexion+0.004
Neg logit contributions
ales-0.006
mediately-0.005
ining-0.005
orting-0.005
reci-0.005
Token.
Token ID13
Feature activation+4.01e-02

Pos logit contributions
 happened+0.073
 babys+0.072
 haircut+0.064
 virus+0.062
 hairc+0.060
Neg logit contributions
ales-0.112
mediately-0.110
reci-0.109
urbed-0.106
isk-0.105
Token Taking
Token ID20879
Feature activation-2.09e-01

Pos logit contributions
 babys+0.068
 happened+0.054
 Lee+0.054
 haircut+0.053
 gloss+0.053
Neg logit contributions
ales-0.059
wed-0.054
et-0.051
ling-0.050
ches-0.050
Token things
Token ID1243
Feature activation+3.03e-02

Pos logit contributions
ales+0.365
isk+0.364
rost+0.359
ide+0.351
urbed+0.345
Neg logit contributions
 haircut-0.235
 happened-0.234
 hairc-0.223
 babys-0.212
 trouble-0.208
Token that
Token ID326
Feature activation-7.91e-02

Pos logit contributions
 happened+0.039
 babys+0.034
 haircut+0.032
 underwear+0.030
 hurt+0.030
Neg logit contributions
ales-0.055
mediately-0.052
rost-0.052
rant-0.050
isk-0.050
Token are
Token ID389
Feature activation+8.90e-02

Pos logit contributions
ales+0.135
urbed+0.134
mediately+0.126
rant+0.125
ining+0.125
Neg logit contributions
 babys-0.105
 happened-0.104
 haircut-0.093
 virus-0.088
 cough-0.088
Token not
Token ID407
Feature activation+1.08e-02

Pos logit contributions
 happened+0.078
 babys+0.075
 very+0.064
 haircut+0.063
 hurt+0.062
Neg logit contributions
ales-0.192
urbed-0.179
acking-0.178
reci-0.177
rost-0.171
Token yours
Token ID12431
Feature activation-1.59e-02

Pos logit contributions
 happened+0.013
 babys+0.012
 haircut+0.012
 virus+0.010
 tooth+0.010
Neg logit contributions
ales-0.019
acking-0.018
mediately-0.018
isk-0.018
rost-0.018
Token share
Token ID2648
Feature activation-9.01e-02

Pos logit contributions
 babys+0.003
 haircut+0.003
 cough+0.002
 complexion+0.002
 happened+0.002
Neg logit contributions
ales-0.003
reci-0.003
orting-0.003
urbed-0.003
mediately-0.003
Token with
Token ID351
Feature activation-8.67e-02

Pos logit contributions
ales+0.124
mediately+0.113
reci+0.110
isk+0.109
urbed+0.109
Neg logit contributions
 babys-0.147
 haircut-0.138
 happened-0.129
 hairc-0.125
 gloss-0.125
Token your
Token ID534
Feature activation-2.90e-02

Pos logit contributions
ales+0.131
mediately+0.129
reci+0.128
utt+0.127
ining+0.126
Neg logit contributions
 babys-0.122
 haircut-0.113
 happened-0.107
 hairc-0.106
 tooth-0.105
Token sister
Token ID6621
Feature activation+1.13e-01

Pos logit contributions
ales+0.051
mediately+0.047
ining+0.046
rost+0.046
 often+0.046
Neg logit contributions
 babys-0.034
 haircut-0.030
 gloss-0.030
 happened-0.029
 complexion-0.028
Token.
Token ID13
Feature activation+4.14e-02

Pos logit contributions
 babys+0.165
 haircut+0.148
 happened+0.141
 gloss+0.139
 complexion+0.137
Neg logit contributions
ales-0.176
mediately-0.159
ining-0.157
reci-0.156
orting-0.156
Token Taking
Token ID20879
Feature activation-2.08e-01

Pos logit contributions
 babys+0.065
 haircut+0.056
 gloss+0.055
 virus+0.051
 lipstick+0.051
Neg logit contributions
ales-0.064
wed-0.058
urbed-0.057
ining-0.056
reci-0.056
Token her
Token ID607
Feature activation+5.62e-02

Pos logit contributions
ales+0.337
rost+0.312
mediately+0.311
isk+0.311
ining+0.309
Neg logit contributions
 babys-0.282
 haircut-0.270
 happened-0.252
 gloss-0.243
 hairc-0.239
Token dolls
Token ID36062
Feature activation+2.05e-02

Pos logit contributions
 babys+0.075
 gloss+0.065
 haircut+0.065
 happened+0.063
 complexion+0.063
Neg logit contributions
ales-0.089
mediately-0.082
reci-0.081
 often-0.081
ining-0.081
Token was
Token ID373
Feature activation+1.17e-01

Pos logit contributions
 babys+0.030
 haircut+0.028
 happened+0.026
 gloss+0.024
 photo+0.024
Neg logit contributions
ales-0.032
mediately-0.029
urbed-0.029
rost-0.029
reci-0.029
Token very
Token ID845
Feature activation+1.74e-01

Pos logit contributions
 babys+0.153
 haircut+0.137
 happened+0.131
 gloss+0.125
 complexion+0.120
Neg logit contributions
ales-0.205
mediately-0.186
reci-0.185
orting-0.183
ining-0.183
Token bad
Token ID2089
Feature activation+1.72e-02

Pos logit contributions
 babys+0.209
 haircut+0.186
 happened+0.174
 complexion+0.167
 gloss+0.166
Neg logit contributions
ales-0.324
isk-0.294
ining-0.290
orting-0.290
mediately-0.290
Token too
Token ID1165
Feature activation+1.41e-01

Pos logit contributions
 babys+0.071
 happened+0.064
 gloss+0.058
 haircut+0.057
 underwear+0.054
Neg logit contributions
ales-0.128
mediately-0.121
angles-0.120
rost-0.120
reci-0.119
Token.
Token ID13
Feature activation+3.72e-02

Pos logit contributions
 babys+0.133
 happened+0.122
 haircut+0.116
 cough+0.113
 nightmare+0.110
Neg logit contributions
ales-0.283
mediately-0.274
angles-0.272
wed-0.271
reci-0.265
Token
Token ID198
Feature activation-3.44e-02

Pos logit contributions
 babys+0.050
 gloss+0.039
 happened+0.036
 jan+0.034
ird+0.033
Neg logit contributions
ales-0.067
wed-0.065
rant-0.064
angles-0.064
ility-0.063
Token
Token ID198
Feature activation-3.40e-02

Pos logit contributions
ales+0.069
rant+0.063
ility+0.063
angles+0.062
wed+0.061
Neg logit contributions
 babys-0.039
 happened-0.032
 gloss-0.029
+.-0.027
 stub-0.027
TokenSc
Token ID3351
Feature activation-5.44e-02

Pos logit contributions
ales+0.060
rant+0.058
ility+0.054
angles+0.053
ws+0.051
Neg logit contributions
 babys-0.046
 gloss-0.042
Mat-0.037
 jan-0.037
 hurt-0.037
Tokenutt
Token ID15318
Feature activation-2.08e-01

Pos logit contributions
 areas+0.068
 ponds+0.068
 directions+0.066
 rivers+0.066
 often+0.066
Neg logit contributions
 babys-0.072
 gloss-0.069
ouse-0.065
ird-0.065
 haircut-0.064
Tokenles
Token ID829
Feature activation-2.99e-02

Pos logit contributions
rost+0.344
ility+0.323
 Along+0.321
 If+0.320
 Step+0.318
Neg logit contributions
 happened-0.251
 rang-0.204
 hurt-0.203
 babys-0.202
 bumped-0.184
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ales+0.056
rost+0.053
angles+0.050
mediately+0.049
reci+0.049
Neg logit contributions
 babys-0.036
 happened-0.034
 hurt-0.031
 shocked-0.029
 very-0.029
Token scared
Token ID12008
Feature activation+3.19e-04

Pos logit contributions
 babys+0.125
 happened+0.107
 haircut+0.099
 gloss+0.092
 nightmare+0.092
Neg logit contributions
ales-0.243
reci-0.226
angles-0.224
orting-0.220
rost-0.218
Token but
Token ID475
Feature activation-8.42e-02

Pos logit contributions
 happened+0.000
 babys+0.000
 crying+0.000
 very+0.000
 nightmare+0.000
Neg logit contributions
ws-0.001
ales-0.001
angles-0.001
wed-0.001
isk-0.001
Token he
Token ID339
Feature activation-4.94e-02

Pos logit contributions
ales+0.166
angles+0.151
ws+0.149
wed+0.147
reci+0.143
Neg logit contributions
 babys-0.106
 very-0.093
 happened-0.093
 hurt-0.088
 scared-0.085
Token Let
Token ID3914
Feature activation-8.94e-02

Pos logit contributions
 Sorry+0.036
 babys+0.035
 Lee+0.033
 seeker+0.030
 haircut+0.029
Neg logit contributions
ales-0.065
wed-0.064
asting-0.060
urbed-0.060
reci-0.059
Token's
Token ID338
Feature activation+7.94e-02

Pos logit contributions
ales+0.108
mediately+0.095
orting+0.095
reci+0.093
 them+0.093
Neg logit contributions
 babys-0.161
 complexion-0.150
 haircut-0.150
 gloss-0.143
 liar-0.139
Token call
Token ID869
Feature activation-9.62e-03

Pos logit contributions
 babys+0.107
 haircut+0.101
 cough+0.091
 happened+0.087
 complexion+0.084
Neg logit contributions
ales-0.134
reci-0.125
urbed-0.124
orting-0.124
mediately-0.124
Token them
Token ID606
Feature activation+2.28e-02

Pos logit contributions
ales+0.012
mediately+0.011
ining+0.011
urbed+0.011
reci+0.011
Neg logit contributions
 babys-0.017
 haircut-0.016
 happened-0.015
 gloss-0.014
 photo-0.014
Token.
Token ID13
Feature activation+3.31e-02

Pos logit contributions
 babys+0.036
 haircut+0.032
 happened+0.030
 gloss+0.029
 hairc+0.029
Neg logit contributions
ales-0.033
mediately-0.032
urbed-0.031
ining-0.031
reci-0.030
Token On
Token ID1550
Feature activation-2.08e-01

Pos logit contributions
 babys+0.049
 haircut+0.042
 gloss+0.042
 cough+0.040
 happened+0.039
Neg logit contributions
ales-0.052
wed-0.049
urbed-0.047
ining-0.046
reci-0.045
Token the
Token ID262
Feature activation-2.07e-02

Pos logit contributions
ales+0.289
reci+0.282
mediately+0.272
orting+0.270
wed+0.267
Neg logit contributions
 babys-0.322
 haircut-0.309
 panc-0.293
 photo-0.286
 happened-0.278
Token count
Token ID954
Feature activation+1.53e-02

Pos logit contributions
ales+0.027
ining+0.025
mediately+0.024
reci+0.024
urbed+0.024
Neg logit contributions
 babys-0.036
 haircut-0.033
 happened-0.031
 gloss-0.031
 complexion-0.030
Token of
Token ID286
Feature activation-1.08e-01

Pos logit contributions
 babys+0.020
 haircut+0.020
 hairc+0.018
 happened+0.018
 panc+0.018
Neg logit contributions
isk-0.023
mediately-0.023
urbed-0.023
wed-0.023
ales-0.023
Token three
Token ID1115
Feature activation+3.13e-02

Pos logit contributions
mediately+0.167
reci+0.162
ales+0.159
ining+0.158
urbed+0.155
Neg logit contributions
 babys-0.146
 haircut-0.138
 photo-0.133
 panc-0.131
 lipstick-0.131
Token.
Token ID13
Feature activation+3.63e-02

Pos logit contributions
 babys+0.041
 haircut+0.039
 happened+0.036
 hairc+0.036
 panc+0.035
Neg logit contributions
ales-0.049
mediately-0.048
isk-0.048
ility-0.047
urbed-0.047
Token,
Token ID11
Feature activation+4.46e-02

Pos logit contributions
 babys+0.036
 haircut+0.033
 happened+0.030
 complexion+0.029
 gloss+0.028
Neg logit contributions
ales-0.057
reci-0.054
urbed-0.052
mediately-0.052
wed-0.051
Token and
Token ID290
Feature activation-4.06e-02

Pos logit contributions
 babys+0.056
 haircut+0.050
 happened+0.047
 gloss+0.046
 complexion+0.045
Neg logit contributions
ales-0.079
mediately-0.072
reci-0.072
ining-0.072
urbed-0.072
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ales+0.064
 often+0.061
 itself+0.060
mediately+0.059
ining+0.058
Neg logit contributions
 babys-0.055
 haircut-0.046
 gloss-0.045
 complexion-0.045
 happened-0.045
Token shelf
Token ID18316
Feature activation+1.93e-02

Pos logit contributions
 babys+0.161
 happened+0.159
 gloss+0.152
 haircut+0.146
 complexion+0.145
Neg logit contributions
 often-0.181
rost-0.172
ales-0.170
reci-0.169
 At-0.168
Token.
Token ID13
Feature activation+3.51e-02

Pos logit contributions
 babys+0.024
 haircut+0.022
 happened+0.020
 complexion+0.020
 gloss+0.019
Neg logit contributions
ales-0.035
reci-0.033
mediately-0.033
urbed-0.033
ining-0.032
Token On
Token ID1550
Feature activation-2.08e-01

Pos logit contributions
 babys+0.072
 haircut+0.067
 happened+0.065
 gloss+0.064
 complexion+0.064
Neg logit contributions
ales-0.033
mediately-0.028
ining-0.028
reci-0.027
orting-0.027
Token the
Token ID262
Feature activation-1.87e-02

Pos logit contributions
ales+0.315
reci+0.292
orting+0.292
mediately+0.287
isk+0.283
Neg logit contributions
 babys-0.326
 haircut-0.300
 happened-0.281
 photo-0.270
 panc-0.270
Token shelf
Token ID18316
Feature activation+1.95e-02

Pos logit contributions
ales+0.029
mediately+0.027
ining+0.027
isk+0.026
reci+0.026
Neg logit contributions
 babys-0.028
 haircut-0.025
 happened-0.023
 gloss-0.023
 complexion-0.023
Token,
Token ID11
Feature activation+4.69e-02

Pos logit contributions
 babys+0.023
 haircut+0.021
 happened+0.020
 photo+0.018
 hairc+0.017
Neg logit contributions
ales-0.037
urbed-0.035
reci-0.035
isk-0.034
mediately-0.034
Token there
Token ID612
Feature activation-9.85e-02

Pos logit contributions
 babys+0.073
 haircut+0.066
 happened+0.063
 gloss+0.062
 complexion+0.061
Neg logit contributions
ales-0.069
mediately-0.062
ining-0.061
isk-0.061
orting-0.060
Token is
Token ID318
Feature activation+1.04e-01

Pos logit contributions
ales+0.175
urbed+0.161
mediately+0.159
ility+0.158
isk+0.156
Neg logit contributions
 babys-0.126
 happened-0.118
 haircut-0.118
 photo-0.104
 hairc-0.101
Token.
Token ID13
Feature activation+3.90e-02

Pos logit contributions
reci+0.024
ales+0.023
mediately+0.023
isk+0.022
orting+0.022
Neg logit contributions
 babys-0.016
 happened-0.015
 haircut-0.014
 hairc-0.014
 gloss-0.013
Token He
Token ID679
Feature activation-1.35e-01

Pos logit contributions
 babys+0.059
 haircut+0.055
 happened+0.054
 cough+0.053
 hairc+0.051
Neg logit contributions
ales-0.058
wed-0.056
reci-0.056
angles-0.056
ility-0.054
Token screamed
Token ID25421
Feature activation+4.49e-02

Pos logit contributions
ales+0.260
orting+0.253
ility+0.238
reci+0.237
hips+0.233
Neg logit contributions
 babys-0.150
 happened-0.143
 cough-0.134
 haircut-0.130
 tooth-0.127
Token and
Token ID290
Feature activation-3.79e-02

Pos logit contributions
 babys+0.061
 haircut+0.053
 cough+0.052
 gloss+0.052
 happened+0.052
Neg logit contributions
ales-0.078
reci-0.075
mediately-0.071
angles-0.070
wed-0.070
Token sc
Token ID629
Feature activation+7.99e-02

Pos logit contributions
ales+0.072
reci+0.067
orting+0.067
mediately+0.064
angles+0.064
Neg logit contributions
 babys-0.046
 happened-0.041
 haircut-0.040
 cough-0.038
 complexion-0.036
Tokenutt
Token ID15318
Feature activation-2.08e-01

Pos logit contributions
 babys+0.125
 gloss+0.119
 haircut+0.114
 complexion+0.110
ird+0.109
Neg logit contributions
reci-0.099
ining-0.098
ination-0.098
 often-0.094
urbed-0.094
Tokenled
Token ID992
Feature activation-4.25e-02

Pos logit contributions
rost+0.324
ility+0.321
reci+0.309
 Along+0.309
 If+0.308
Neg logit contributions
 happened-0.269
 babys-0.227
 gloss-0.216
 complexion-0.214
 rang-0.211
Token away
Token ID1497
Feature activation+1.07e-02

Pos logit contributions
reci+0.074
ales+0.072
angles+0.068
isk+0.066
orting+0.065
Neg logit contributions
 very-0.052
 babys-0.051
 haircut-0.050
 cough-0.049
 happened-0.048
Token.
Token ID13
Feature activation+3.63e-02

Pos logit contributions
 babys+0.014
 haircut+0.012
 happened+0.012
 complexion+0.011
 gloss+0.011
Neg logit contributions
ales-0.019
reci-0.018
mediately-0.018
wed-0.018
angles-0.017
Token
Token ID198
Feature activation-3.70e-02

Pos logit contributions
 babys+0.051
 happened+0.048
 haircut+0.043
 cough+0.042
 hairc+0.040
Neg logit contributions
ales-0.059
wed-0.057
reci-0.056
angles-0.056
ility-0.054
Token
Token ID198
Feature activation-3.65e-02

Pos logit contributions
ales+0.075
ility+0.068
wed+0.067
et+0.067
reci+0.067
Neg logit contributions
 babys-0.041
 happened-0.033
 haircut-0.030
 gloss-0.028
+.-0.027
Token again
Token ID757
Feature activation+1.68e-02

Pos logit contributions
 happened+0.045
 haircut+0.043
 babys+0.043
 very+0.039
 gloss+0.038
Neg logit contributions
ales-0.082
rost-0.078
wed-0.075
reci-0.075
mediately-0.074
Token!
Token ID0
Feature activation+3.06e-02

Pos logit contributions
 babys+0.020
 haircut+0.019
 happened+0.018
 complexion+0.017
 gloss+0.017
Neg logit contributions
ales-0.029
rost-0.027
mediately-0.027
isk-0.027
reci-0.027
Token<|endoftext|>
Token ID50256
Feature activation-4.67e-02

Pos logit contributions
 babys+0.039
 happened+0.034
 gloss+0.033
 haircut+0.032
 photo+0.030
Neg logit contributions
ales-0.053
wed-0.051
urbed-0.048
rant-0.048
ility-0.048
TokenOnce
Token ID7454
Feature activation-1.38e-02

Pos logit contributions
ales+0.093
wed+0.085
urbed+0.084
mediately+0.083
isk+0.081
Neg logit contributions
 babys-0.050
 happened-0.049
 haircut-0.039
 gloss-0.039
 photo-0.038
Token upon
Token ID2402
Feature activation-9.06e-02

Pos logit contributions
ales+0.021
isk+0.019
mediately+0.018
urbed+0.018
orting+0.018
Neg logit contributions
 babys-0.021
 happened-0.020
 haircut-0.018
 complexion-0.018
 photo-0.017
Token a
Token ID257
Feature activation+1.39e-01

Pos logit contributions
 itself+0.136
ales+0.132
 often+0.130
ining+0.128
mediately+0.127
Neg logit contributions
 babys-0.139
 gloss-0.115
 haircut-0.115
 complexion-0.114
 virus-0.111
Token time
Token ID640
Feature activation+2.52e-02

Pos logit contributions
 babys+0.221
 haircut+0.204
 happened+0.198
 gloss+0.189
 complexion+0.189
Neg logit contributions
ales-0.198
mediately-0.174
orting-0.172
 itself-0.171
reci-0.171
Token there
Token ID612
Feature activation-9.72e-02

Pos logit contributions
 babys+0.039
 haircut+0.036
 gloss+0.034
 complexion+0.034
 virus+0.033
Neg logit contributions
ales-0.035
ining-0.032
 often-0.032
rost-0.032
orting-0.032
Token was
Token ID373
Feature activation+1.21e-01

Pos logit contributions
ales+0.160
mediately+0.144
orting+0.143
urbed+0.142
reci+0.142
Neg logit contributions
 babys-0.135
 haircut-0.120
 happened-0.118
 gloss-0.112
 complexion-0.109
Token a
Token ID257
Feature activation+1.39e-01

Pos logit contributions
 babys+0.198
 haircut+0.183
 virus+0.170
 complexion+0.169
 gloss+0.167
Neg logit contributions
 itself-0.163
ales-0.162
 often-0.157
 At-0.156
mediately-0.151
Token boy
Token ID2933
Feature activation+8.04e-02

Pos logit contributions
 babys+0.219
 happened+0.204
 gloss+0.203
 haircut+0.200
 hairc+0.192
Neg logit contributions
 often-0.170
ales-0.165
mediately-0.162
rost-0.161
isk-0.157
Token upon
Token ID2402
Feature activation-9.33e-02

Pos logit contributions
ales+0.028
mediately+0.025
urbed+0.024
isk+0.024
orting+0.024
Neg logit contributions
 babys-0.034
 happened-0.031
 haircut-0.030
 complexion-0.029
 gloss-0.028
Token a
Token ID257
Feature activation+1.36e-01

Pos logit contributions
 itself+0.123
 often+0.117
ining+0.112
 them+0.110
mediately+0.110
Neg logit contributions
 babys-0.166
 gloss-0.143
 haircut-0.140
 complexion-0.137
 virus-0.137
Token time
Token ID640
Feature activation+2.17e-02

Pos logit contributions
 babys+0.228
 haircut+0.209
 happened+0.202
 gloss+0.197
 complexion+0.195
Neg logit contributions
ales-0.181
mediately-0.160
ining-0.157
orting-0.156
urbed-0.156
Token,
Token ID11
Feature activation+4.79e-02

Pos logit contributions
 babys+0.036
 haircut+0.034
 gloss+0.033
 complexion+0.032
 virus+0.032
Neg logit contributions
ales-0.027
ining-0.026
 often-0.026
rost-0.025
orting-0.025
Token in
Token ID287
Feature activation-9.12e-02

Pos logit contributions
 babys+0.068
 happened+0.064
 haircut+0.060
 complexion+0.057
 gloss+0.056
Neg logit contributions
ales-0.078
orting-0.068
isk-0.067
reci-0.067
urbed-0.067
Token a
Token ID257
Feature activation+1.36e-01

Pos logit contributions
ales+0.222
orting+0.208
reci+0.208
urbed+0.205
wed+0.204
Neg logit contributions
 babys-0.054
 haircut-0.050
 happened-0.045
 tooth-0.036
 cough-0.033
Token small
Token ID1402
Feature activation-6.26e-02

Pos logit contributions
 babys+0.148
 haircut+0.130
 happened+0.122
 complexion+0.115
 photo+0.114
Neg logit contributions
ales-0.274
orting-0.246
ility-0.242
urbed-0.242
wed-0.239
Token town
Token ID3240
Feature activation-4.79e-02

Pos logit contributions
ales+0.097
mediately+0.087
orting+0.086
ining+0.086
isk+0.085
Neg logit contributions
 babys-0.095
 haircut-0.085
 complexion-0.079
 happened-0.079
 gloss-0.077
Token,
Token ID11
Feature activation+4.77e-02

Pos logit contributions
ales+0.085
urbed+0.077
isk+0.077
reci+0.077
mediately+0.076
Neg logit contributions
 babys-0.062
 haircut-0.054
 happened-0.052
 complexion-0.050
 gloss-0.048
Token there
Token ID612
Feature activation-9.98e-02

Pos logit contributions
 babys+0.064
 happened+0.061
 haircut+0.056
 complexion+0.054
 photo+0.053
Neg logit contributions
ales-0.081
isk-0.072
urbed-0.072
orting-0.071
wed-0.071
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ales+0.138
mediately+0.126
ining+0.126
 often+0.123
isk+0.123
Neg logit contributions
 babys-0.161
 haircut-0.146
 gloss-0.139
 complexion-0.137
 happened-0.135
TokenThe
Token ID464
Feature activation-1.27e-01

Pos logit contributions
ales+0.052
reci+0.045
orting+0.045
rost+0.044
wed+0.044
Neg logit contributions
 babys-0.063
 haircut-0.060
 happened-0.059
 gloss-0.058
 complexion-0.055
Token water
Token ID1660
Feature activation+4.45e-02

Pos logit contributions
ales+0.165
mediately+0.152
ining+0.148
orting+0.147
rost+0.147
Neg logit contributions
 babys-0.209
 haircut-0.192
 happened-0.187
 gloss-0.185
 complexion-0.184
Token was
Token ID373
Feature activation+1.17e-01

Pos logit contributions
 babys+0.053
 happened+0.048
 gloss+0.042
 haircut+0.042
 underwear+0.039
Neg logit contributions
ales-0.087
reci-0.079
isk-0.078
ility-0.078
urbed-0.077
Token cold
Token ID4692
Feature activation+1.26e-02

Pos logit contributions
 babys+0.138
 haircut+0.123
 gloss+0.114
 happened+0.113
 complexion+0.113
Neg logit contributions
ales-0.209
mediately-0.198
ining-0.195
 often-0.195
isk-0.191
Token,
Token ID11
Feature activation+4.47e-02

Pos logit contributions
 babys+0.022
 haircut+0.020
 happened+0.020
 gloss+0.020
 complexion+0.019
Neg logit contributions
ales-0.016
mediately-0.014
ining-0.014
isk-0.013
reci-0.013
Token so
Token ID523
Feature activation+1.51e-01

Pos logit contributions
 babys+0.049
 happened+0.046
 haircut+0.044
 cough+0.043
 gloss+0.043
Neg logit contributions
ales-0.085
isk-0.080
reci-0.079
ility-0.078
rost-0.078
Token Amy
Token ID14235
Feature activation-3.52e-02

Pos logit contributions
 babys+0.245
 haircut+0.222
 happened+0.212
 gloss+0.210
 complexion+0.208
Neg logit contributions
ales-0.210
mediately-0.190
ining-0.187
isk-0.184
orting-0.184
Token started
Token ID2067
Feature activation+8.75e-02

Pos logit contributions
ales+0.058
orting+0.053
reci+0.053
mediately+0.052
isk+0.052
Neg logit contributions
 babys-0.049
 haircut-0.044
 happened-0.043
 gloss-0.042
 complexion-0.041
Token to
Token ID284
Feature activation+7.72e-03

Pos logit contributions
 babys+0.133
 haircut+0.119
 happened+0.115
 gloss+0.114
 complexion+0.113
Neg logit contributions
ales-0.127
mediately-0.115
 often-0.114
ining-0.114
urbed-0.113
Token g
Token ID308
Feature activation+8.69e-02

Pos logit contributions
 babys+0.012
 happened+0.011
 haircut+0.011
 gloss+0.011
 complexion+0.011
Neg logit contributions
ales-0.011
mediately-0.010
 often-0.010
isk-0.010
ining-0.009
Tokeniggle
Token ID24082
Feature activation-7.12e-02

Pos logit contributions
 babys+0.188
 haircut+0.172
 virus+0.166
 gloss+0.164
 happened+0.164
Neg logit contributions
ales-0.082
ining-0.069
urbed-0.065
wed-0.065
 often-0.065
Token.
Token ID13
Feature activation+4.30e-02

Pos logit contributions
 babys+0.058
 happened+0.053
 haircut+0.052
 gloss+0.049
 complexion+0.048
Neg logit contributions
ales-0.074
urbed-0.067
mediately-0.067
rost-0.067
isk-0.066
Token She
Token ID1375
Feature activation-1.39e-01

Pos logit contributions
 babys+0.065
 haircut+0.059
 happened+0.058
 gloss+0.056
 lipstick+0.054
Neg logit contributions
ales-0.064
urbed-0.062
ility-0.059
reci-0.058
ining-0.057
Token loved
Token ID6151
Feature activation-1.64e-02

Pos logit contributions
ales+0.175
 often+0.158
mediately+0.158
ining+0.157
wed+0.154
Neg logit contributions
 babys-0.240
 haircut-0.219
 gloss-0.209
 happened-0.208
 complexion-0.204
Token her
Token ID607
Feature activation+5.74e-02

Pos logit contributions
ales+0.025
 often+0.022
ining+0.022
mediately+0.022
isk+0.022
Neg logit contributions
 babys-0.025
 haircut-0.022
 gloss-0.021
 happened-0.021
 complexion-0.021
Token room
Token ID2119
Feature activation+6.06e-02

Pos logit contributions
 gloss+0.077
 babys+0.076
 liar+0.076
 happened+0.073
burn+0.073
Neg logit contributions
 often-0.079
 At-0.076
ales-0.076
ining-0.072
mediately-0.071
Token very
Token ID845
Feature activation+1.73e-01

Pos logit contributions
 babys+0.079
 haircut+0.070
 happened+0.066
 gloss+0.065
 complexion+0.064
Neg logit contributions
ales-0.104
mediately-0.095
ining-0.094
isk-0.093
orting-0.093
Token much
Token ID881
Feature activation+3.58e-02

Pos logit contributions
 babys+0.278
 gloss+0.242
 happened+0.241
 haircut+0.239
burn+0.237
Neg logit contributions
 often-0.255
ales-0.228
rost-0.218
ining-0.214
 freely-0.211
Token.
Token ID13
Feature activation+4.36e-02

Pos logit contributions
 babys+0.062
 haircut+0.055
 happened+0.054
 gloss+0.053
 complexion+0.052
Neg logit contributions
ales-0.046
 often-0.041
orting-0.041
ining-0.040
mediately-0.040
Token One
Token ID1881
Feature activation-9.04e-02

Pos logit contributions
 babys+0.067
 haircut+0.061
 happened+0.059
 gloss+0.058
 lipstick+0.055
Neg logit contributions
ales-0.064
urbed-0.060
ility-0.057
wed-0.057
ining-0.056
Token day
Token ID1110
Feature activation+5.83e-02

Pos logit contributions
 often+0.071
ales+0.069
mediately+0.067
rost+0.065
 At+0.063
Neg logit contributions
 babys-0.191
 gloss-0.177
 underwear-0.176
 haircut-0.174
 complexion-0.173
Token,
Token ID11
Feature activation+4.71e-02

Pos logit contributions
 babys+0.126
 poet+0.119
 virus+0.116
 haircut+0.112
 underwear+0.112
Neg logit contributions
 At-0.047
 often-0.047
ales-0.045
 until-0.045
 them-0.042
Token day
Token ID1110
Feature activation+5.58e-02

Pos logit contributions
ales+0.137
isk+0.117
rant+0.116
mediately+0.115
wed+0.114
Neg logit contributions
 babys-0.100
 happened-0.090
 haircut-0.088
 photo-0.084
 virus-0.083
Token on
Token ID319
Feature activation-5.57e-02

Pos logit contributions
 babys+0.080
 haircut+0.072
 gloss+0.066
 happened+0.066
 complexion+0.066
Neg logit contributions
ales-0.087
orting-0.081
mediately-0.080
ining-0.079
 often-0.079
Token,
Token ID11
Feature activation+4.80e-02

Pos logit contributions
ales+0.111
mediately+0.102
wed+0.100
reci+0.100
ility+0.100
Neg logit contributions
 babys-0.065
 happened-0.057
 haircut-0.055
 gloss-0.049
 complexion-0.049
Token Tim
Token ID5045
Feature activation-6.02e-02

Pos logit contributions
 babys+0.068
 happened+0.065
 haircut+0.058
 hairc+0.055
 gloss+0.055
Neg logit contributions
ales-0.082
rant-0.077
ining-0.075
ility-0.074
ws-0.074
Tokenmy
Token ID1820
Feature activation+4.17e-02

Pos logit contributions
ales+0.087
urbed+0.076
rost+0.076
ining+0.076
 itself+0.076
Neg logit contributions
 babys-0.094
 happened-0.085
 haircut-0.083
 gloss-0.082
 tooth-0.079
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
 babys+0.061
 happened+0.059
 gloss+0.054
 haircut+0.053
 cough+0.052
Neg logit contributions
ales-0.067
rost-0.059
mediately-0.058
ility-0.058
rant-0.056
Token more
Token ID517
Feature activation+4.15e-02

Pos logit contributions
 babys+0.175
 haircut+0.160
 happened+0.151
 gloss+0.150
 complexion+0.144
Neg logit contributions
ales-0.187
reci-0.164
mediately-0.162
orting-0.161
rost-0.161
Token careful
Token ID8161
Feature activation-5.06e-02

Pos logit contributions
 babys+0.067
 haircut+0.062
 happened+0.059
 gloss+0.058
 complexion+0.058
Neg logit contributions
ales-0.059
isk-0.052
mediately-0.052
rost-0.051
ining-0.051
Token and
Token ID290
Feature activation-4.05e-02

Pos logit contributions
ales+0.091
reci+0.090
rant+0.085
isk+0.085
ws+0.081
Neg logit contributions
 babys-0.056
 happened-0.055
 haircut-0.052
 gloss-0.051
 very-0.049
Token had
Token ID550
Feature activation-2.09e-03

Pos logit contributions
ales+0.064
reci+0.056
rant+0.055
urbed+0.054
ility+0.054
Neg logit contributions
 happened-0.061
 babys-0.059
 haircut-0.057
 gloss-0.054
 complexion-0.051
Token fun
Token ID1257
Feature activation+9.24e-03

Pos logit contributions
ales+0.004
rant+0.004
reci+0.004
mediately+0.004
rost+0.004
Neg logit contributions
 babys-0.002
 happened-0.002
 haircut-0.002
 gloss-0.002
 hairc-0.002
Token there
Token ID612
Feature activation-9.69e-02

Pos logit contributions
 babys+0.059
 happened+0.058
 haircut+0.051
 photo+0.050
 complexion+0.050
Neg logit contributions
ales-0.083
urbed-0.074
isk-0.074
ility-0.074
orting-0.072
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ales+0.146
mediately+0.131
ining+0.129
orting+0.129
urbed+0.129
Neg logit contributions
 babys-0.147
 haircut-0.133
 happened-0.129
 gloss-0.124
 complexion-0.123
Token a
Token ID257
Feature activation+1.37e-01

Pos logit contributions
 babys+0.194
 gloss+0.183
 haircut+0.179
 complexion+0.178
ouse+0.175
Neg logit contributions
 At-0.157
 often-0.154
 In-0.146
mediately-0.144
 itself-0.143
Token tall
Token ID7331
Feature activation-4.86e-02

Pos logit contributions
 gloss+0.220
 babys+0.215
 happened+0.211
 underwear+0.206
 haircut+0.201
Neg logit contributions
 often-0.161
 freely-0.142
rost-0.141
mediately-0.136
ining-0.135
Token temple
Token ID12505
Feature activation-6.43e-03

Pos logit contributions
ales+0.084
mediately+0.077
ining+0.076
isk+0.076
wed+0.075
Neg logit contributions
 babys-0.064
 haircut-0.057
 happened-0.052
 complexion-0.052
 gloss-0.051
Token.
Token ID13
Feature activation+4.37e-02

Pos logit contributions
ales+0.009
mediately+0.008
isk+0.008
urbed+0.008
reci+0.008
Neg logit contributions
 babys-0.011
 haircut-0.010
 happened-0.009
 gloss-0.009
 complexion-0.009
Token Many
Token ID4650
Feature activation-7.01e-02

Pos logit contributions
 babys+0.086
 haircut+0.079
 happened+0.078
 gloss+0.075
 complexion+0.074
Neg logit contributions
ales-0.046
urbed-0.040
wed-0.040
isk-0.040
orting-0.039
Token kids
Token ID3988
Feature activation+5.83e-02

Pos logit contributions
ales+0.080
 often+0.075
reci+0.072
ining+0.070
mediately+0.070
Neg logit contributions
 babys-0.127
 gloss-0.117
 haircut-0.117
burn-0.113
 complexion-0.109
Token liked
Token ID8288
Feature activation-3.57e-02

Pos logit contributions
 babys+0.077
 happened+0.073
 haircut+0.067
 complexion+0.063
 photo+0.063
Neg logit contributions
ales-0.100
urbed-0.094
isk-0.093
mediately-0.092
rost-0.090
Token to
Token ID284
Feature activation+7.66e-03

Pos logit contributions
ales+0.065
mediately+0.064
isk+0.062
reci+0.062
orting+0.061
Neg logit contributions
 babys-0.042
 haircut-0.038
 happened-0.037
 hairc-0.034
 photo-0.033
Token play
Token ID711
Feature activation-4.07e-02

Pos logit contributions
 babys+0.010
 haircut+0.010
 happened+0.009
 complexion+0.009
 photo+0.008
Neg logit contributions
ales-0.013
urbed-0.012
orting-0.012
mediately-0.012
isk-0.012
Token they
Token ID484
Feature activation-5.50e-02

Pos logit contributions
 babys+0.090
 haircut+0.079
 happened+0.078
 gloss+0.073
 cough+0.070
Neg logit contributions
ales-0.096
wed-0.087
reci-0.086
orting-0.084
isk-0.084
Token saw
Token ID2497
Feature activation-2.28e-02

Pos logit contributions
ales+0.094
mediately+0.086
orting+0.085
ining+0.084
rost+0.083
Neg logit contributions
 babys-0.075
 happened-0.068
 haircut-0.064
 complexion-0.060
 cough-0.060
Token a
Token ID257
Feature activation+1.30e-01

Pos logit contributions
ales+0.040
mediately+0.037
ining+0.037
 itself+0.037
isk+0.036
Neg logit contributions
 babys-0.028
 haircut-0.025
 gloss-0.023
 complexion-0.023
 happened-0.022
Token big
Token ID1263
Feature activation-1.87e-02

Pos logit contributions
 babys+0.174
 gloss+0.168
 happened+0.161
 complexion+0.158
 haircut+0.156
Neg logit contributions
 often-0.175
rost-0.168
 freely-0.167
mediately-0.166
isk-0.164
Token head
Token ID1182
Feature activation+6.71e-02

Pos logit contributions
ales+0.028
mediately+0.025
ining+0.025
orting+0.025
isk+0.024
Neg logit contributions
 babys-0.028
 haircut-0.025
 happened-0.024
 gloss-0.024
 complexion-0.024
Token.
Token ID13
Feature activation+4.59e-02

Pos logit contributions
 babys+0.097
 haircut+0.087
 happened+0.084
 gloss+0.082
 complexion+0.080
Neg logit contributions
ales-0.107
mediately-0.097
reci-0.097
ining-0.096
isk-0.095
Token It
Token ID632
Feature activation-8.21e-02

Pos logit contributions
 babys+0.085
 haircut+0.075
 gloss+0.073
 happened+0.073
 complexion+0.071
Neg logit contributions
ales-0.058
wed-0.052
reci-0.050
urbed-0.049
mediately-0.049
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
ales+0.112
mediately+0.105
ining+0.104
 often+0.103
isk+0.102
Neg logit contributions
 babys-0.131
 haircut-0.121
 complexion-0.114
 gloss-0.112
 virus-0.109
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
 babys+0.149
 haircut+0.136
 complexion+0.132
 gloss+0.129
 happened+0.122
Neg logit contributions
ales-0.195
 itself-0.184
 often-0.184
mediately-0.183
rost-0.182
Token hipp
Token ID18568
Feature activation+2.19e-02

Pos logit contributions
ales+0.023
mediately+0.022
 often+0.022
rost+0.022
cut+0.021
Neg logit contributions
 babys-0.022
 happened-0.020
 gloss-0.020
 complexion-0.019
 haircut-0.019
Tokeno
Token ID78
Feature activation-4.51e-03

Pos logit contributions
 babys+0.032
 haircut+0.027
 gloss+0.027
 happened+0.027
 complexion+0.026
Neg logit contributions
ales-0.034
urbed-0.032
mediately-0.032
reci-0.031
ining-0.031
Token asked
Token ID1965
Feature activation-2.43e-02

Pos logit contributions
 often+0.021
ales+0.021
mediately+0.020
ining+0.020
ches+0.019
Neg logit contributions
 babys-0.038
 haircut-0.036
 virus-0.035
 photo-0.034
 liar-0.034
Token.
Token ID13
Feature activation+3.98e-02

Pos logit contributions
ales+0.033
mediately+0.030
orting+0.029
urbed+0.029
 itself+0.029
Neg logit contributions
 babys-0.039
 haircut-0.036
 gloss-0.034
 happened-0.034
 complexion-0.034
Token
Token ID198
Feature activation-3.65e-02

Pos logit contributions
 babys+0.071
 haircut+0.065
 happened+0.063
 gloss+0.062
 complexion+0.062
Neg logit contributions
ales-0.049
mediately-0.043
ining-0.042
isk-0.042
orting-0.042
Token
Token ID198
Feature activation-3.67e-02

Pos logit contributions
ales+0.062
mediately+0.056
ility+0.056
urbed+0.055
wed+0.055
Neg logit contributions
 babys-0.052
 happened-0.043
 haircut-0.042
 gloss-0.039
 complexion-0.039
Token"
Token ID1
Feature activation-9.58e-03

Pos logit contributions
ales+0.056
wed+0.049
ility+0.049
rant+0.048
isk+0.048
Neg logit contributions
 babys-0.059
 haircut-0.053
 guitar-0.048
 shirt-0.047
 gloss-0.047
TokenYes
Token ID5297
Feature activation-7.32e-03

Pos logit contributions
ales+0.012
mediately+0.011
rost+0.011
orting+0.011
 often+0.011
Neg logit contributions
 babys-0.017
 haircut-0.015
 happened-0.015
 complexion-0.014
 gloss-0.014
Token,
Token ID11
Feature activation+4.56e-02

Pos logit contributions
ales+0.014
mediately+0.013
reci+0.013
ining+0.013
wed+0.013
Neg logit contributions
 babys-0.009
 haircut-0.008
 happened-0.007
 gloss-0.007
 panc-0.007
Token please
Token ID3387
Feature activation+2.63e-02

Pos logit contributions
 babys+0.058
 haircut+0.054
 happened+0.048
 panc+0.048
 gloss+0.048
Neg logit contributions
ales-0.088
wed-0.083
mediately-0.081
ining-0.080
acking-0.080
Token,
Token ID11
Feature activation+4.54e-02

Pos logit contributions
 babys+0.030
 haircut+0.027
 happened+0.025
 gloss+0.024
 cough+0.023
Neg logit contributions
ales-0.049
urbed-0.047
mediately-0.046
reci-0.045
ining-0.045
Token Lily
Token ID20037
Feature activation-1.99e-02

Pos logit contributions
 babys+0.053
 haircut+0.051
 gloss+0.043
 panc+0.043
 tooth+0.042
Neg logit contributions
ales-0.094
acking-0.089
wed-0.088
ining-0.087
mediately-0.086
Token.
Token ID13
Feature activation+3.93e-02

Pos logit contributions
ales+0.038
mediately+0.036
ility+0.036
urbed+0.035
reci+0.035
Neg logit contributions
 babys-0.022
 happened-0.020
 haircut-0.019
 complexion-0.018
 gloss-0.017
Token cool
Token ID3608
Feature activation-4.63e-02

Pos logit contributions
 babys+0.132
 haircut+0.119
 complexion+0.117
 gloss+0.113
 happened+0.111
Neg logit contributions
ales-0.109
 itself-0.098
mediately-0.098
 often-0.098
isk-0.097
Token,
Token ID11
Feature activation+4.58e-02

Pos logit contributions
ales+0.068
mediately+0.061
 itself+0.060
urbed+0.059
 often+0.059
Neg logit contributions
 babys-0.073
 haircut-0.064
 gloss-0.061
 happened-0.061
 complexion-0.060
Token Lily
Token ID20037
Feature activation-1.85e-02

Pos logit contributions
 babys+0.064
 haircut+0.060
 happened+0.056
 gloss+0.055
 underwear+0.054
Neg logit contributions
ales-0.080
reci-0.071
wed-0.070
mediately-0.070
ining-0.070
Token.
Token ID13
Feature activation+3.78e-02

Pos logit contributions
ales+0.030
mediately+0.027
reci+0.027
ining+0.027
orting+0.027
Neg logit contributions
 babys-0.026
 haircut-0.023
 happened-0.022
 gloss-0.022
 complexion-0.021
Token Look
Token ID6803
Feature activation+2.72e-02

Pos logit contributions
 babys+0.062
 gloss+0.055
 haircut+0.055
 happened+0.052
 complexion+0.051
Neg logit contributions
ales-0.053
ining-0.048
mediately-0.047
 often-0.047
wed-0.047
Token,
Token ID11
Feature activation+4.50e-02

Pos logit contributions
 babys+0.048
 haircut+0.041
 complexion+0.040
 virus+0.040
 gloss+0.040
Neg logit contributions
ales-0.035
orting-0.032
ining-0.031
 often-0.031
urbed-0.031
Token it
Token ID340
Feature activation+4.09e-02

Pos logit contributions
 babys+0.074
 haircut+0.068
 happened+0.065
 gloss+0.064
 complexion+0.063
Neg logit contributions
ales-0.061
mediately-0.055
ining-0.054
isk-0.053
orting-0.053
Token can
Token ID460
Feature activation+4.01e-02

Pos logit contributions
 babys+0.062
 haircut+0.056
 happened+0.053
 gloss+0.053
 complexion+0.053
Neg logit contributions
ales-0.061
mediately-0.055
ining-0.055
isk-0.054
orting-0.054
Token w
Token ID266
Feature activation+2.47e-02

Pos logit contributions
 babys+0.057
 haircut+0.051
 happened+0.049
 complexion+0.049
 gloss+0.048
Neg logit contributions
ales-0.063
mediately-0.057
rost-0.056
isk-0.056
ining-0.056
Tokeniggle
Token ID24082
Feature activation-7.04e-02

Pos logit contributions
 babys+0.041
 haircut+0.039
 happened+0.037
 gloss+0.037
 complexion+0.037
Neg logit contributions
ales-0.030
mediately-0.027
urbed-0.027
ining-0.027
reci-0.027
Token!"
Token ID2474
Feature activation+8.05e-02

Pos logit contributions
ales+0.101
mediately+0.091
ining+0.090
isk+0.089
orting+0.089
Neg logit contributions
 babys-0.111
 haircut-0.101
 happened-0.096
 gloss-0.095
 complexion-0.094
Token selfish
Token ID20363
Feature activation-3.85e-02

Pos logit contributions
ales+0.084
isk+0.074
rost+0.073
ide+0.072
ining+0.072
Neg logit contributions
 babys-0.048
 happened-0.047
 haircut-0.044
 cough-0.043
 hurt-0.040
Token.
Token ID13
Feature activation+3.32e-02

Pos logit contributions
ales+0.068
mediately+0.062
isk+0.062
rost+0.061
rant+0.061
Neg logit contributions
 babys-0.048
 happened-0.045
 haircut-0.044
 gloss-0.042
 hairc-0.040
Token She
Token ID1375
Feature activation-1.42e-01

Pos logit contributions
 babys+0.050
 haircut+0.044
 happened+0.043
 gloss+0.042
 complexion+0.041
Neg logit contributions
ales-0.052
wed-0.046
mediately-0.046
orting-0.045
isk-0.045
Token learned
Token ID4499
Feature activation-1.06e-03

Pos logit contributions
ales+0.199
mediately+0.176
ining+0.173
urbed+0.172
orting+0.172
Neg logit contributions
 babys-0.233
 haircut-0.211
 happened-0.205
 gloss-0.201
 complexion-0.198
Token that
Token ID326
Feature activation-7.88e-02

Pos logit contributions
ales+0.001
 itself+0.001
 often+0.001
orting+0.001
mediately+0.001
Neg logit contributions
 babys-0.002
 haircut-0.002
 complexion-0.002
 gloss-0.002
 happened-0.002
Token it
Token ID340
Feature activation+4.02e-02

Pos logit contributions
ales+0.121
mediately+0.108
ining+0.107
isk+0.106
urbed+0.104
Neg logit contributions
 babys-0.122
 haircut-0.111
 happened-0.108
 gloss-0.104
 virus-0.100
Token is
Token ID318
Feature activation+1.01e-01

Pos logit contributions
 happened+0.047
 babys+0.044
 gloss+0.038
 virus+0.036
 cough+0.034
Neg logit contributions
ales-0.078
rost-0.071
isk-0.069
urbed-0.068
mediately-0.067
Token better
Token ID1365
Feature activation+2.33e-02

Pos logit contributions
 happened+0.094
 babys+0.092
 haircut+0.080
 very+0.074
 virus+0.072
Neg logit contributions
ales-0.217
urbed-0.191
acking-0.190
rost-0.188
isk-0.187
Token to
Token ID284
Feature activation+6.48e-03

Pos logit contributions
 happened+0.021
 babys+0.020
 haircut+0.018
 virus+0.016
 wallet+0.016
Neg logit contributions
ales-0.049
rant-0.046
ility-0.046
urbed-0.045
mediately-0.045
Token enjoy
Token ID2883
Feature activation-1.09e-01

Pos logit contributions
 babys+0.008
 haircut+0.008
 cough+0.008
 hurt+0.007
 happened+0.007
Neg logit contributions
ales-0.012
angles-0.011
mediately-0.010
urbed-0.010
ining-0.010
Token the
Token ID262
Feature activation-2.08e-02

Pos logit contributions
ales+0.192
mediately+0.185
reci+0.174
utt+0.173
wed+0.172
Neg logit contributions
 babys-0.142
 haircut-0.129
 happened-0.125
 hairc-0.118
 makeup-0.117
Token time
Token ID640
Feature activation+2.21e-02

Pos logit contributions
 babys+0.231
 haircut+0.212
 happened+0.205
 gloss+0.200
 complexion+0.198
Neg logit contributions
ales-0.183
mediately-0.162
ining-0.159
orting-0.158
reci-0.158
Token,
Token ID11
Feature activation+4.67e-02

Pos logit contributions
 babys+0.037
 haircut+0.035
 gloss+0.034
 complexion+0.032
 virus+0.032
Neg logit contributions
ales-0.027
ining-0.026
 often-0.026
orting-0.025
rost-0.025
Token there
Token ID612
Feature activation-9.77e-02

Pos logit contributions
 babys+0.063
 happened+0.061
 haircut+0.056
 complexion+0.053
 gloss+0.053
Neg logit contributions
ales-0.079
isk-0.069
orting-0.069
urbed-0.068
reci-0.068
Token was
Token ID373
Feature activation+1.19e-01

Pos logit contributions
ales+0.135
ining+0.126
mediately+0.126
 often+0.124
isk+0.123
Neg logit contributions
 babys-0.155
 haircut-0.141
 gloss-0.135
 complexion-0.133
 virus-0.129
Token a
Token ID257
Feature activation+1.38e-01

Pos logit contributions
 babys+0.213
 haircut+0.196
 complexion+0.186
 gloss+0.186
 virus+0.184
Neg logit contributions
 At-0.144
 often-0.141
 itself-0.140
 In-0.136
ales-0.135
Token big
Token ID1263
Feature activation-2.57e-02

Pos logit contributions
 babys+0.215
 gloss+0.204
 happened+0.203
 haircut+0.197
 hairc+0.192
Neg logit contributions
 often-0.171
rost-0.158
ales-0.155
mediately-0.155
isk-0.153
Token truck
Token ID7779
Feature activation+6.78e-02

Pos logit contributions
ales+0.040
mediately+0.036
ining+0.036
orting+0.036
isk+0.036
Neg logit contributions
 babys-0.037
 haircut-0.033
 happened-0.032
 gloss-0.032
 complexion-0.031
Token.
Token ID13
Feature activation+4.16e-02

Pos logit contributions
 babys+0.106
 haircut+0.096
 happened+0.091
 complexion+0.090
 gloss+0.089
Neg logit contributions
ales-0.100
mediately-0.090
reci-0.090
urbed-0.089
ining-0.088
Token It
Token ID632
Feature activation-7.81e-02

Pos logit contributions
 babys+0.081
 haircut+0.076
 happened+0.074
 gloss+0.071
 complexion+0.071
Neg logit contributions
ales-0.044
orting-0.039
wed-0.038
isk-0.038
urbed-0.038
Token was
Token ID373
Feature activation+1.19e-01

Pos logit contributions
ales+0.145
orting+0.132
ility+0.131
urbed+0.131
mediately+0.127
Neg logit contributions
 babys-0.100
 happened-0.090
 haircut-0.083
 complexion-0.075
 rang-0.074
Token very
Token ID845
Feature activation+1.73e-01

Pos logit contributions
 babys+0.128
 haircut+0.112
 gloss+0.107
 complexion+0.103
 happened+0.102
Neg logit contributions
ales-0.226
mediately-0.213
 often-0.211
ining-0.209
rost-0.208
Token about
Token ID546
Feature activation-7.65e-02

Pos logit contributions
ales+0.028
mediately+0.026
reci+0.025
rost+0.025
ining+0.025
Neg logit contributions
 babys-0.027
 haircut-0.025
 happened-0.025
 gloss-0.024
 complexion-0.023
Token this
Token ID428
Feature activation-1.81e-02

Pos logit contributions
ales+0.136
mediately+0.125
reci+0.122
wed+0.120
orting+0.119
Neg logit contributions
 babys-0.109
 haircut-0.101
 happened-0.096
 makeup-0.089
 hairc-0.089
Token and
Token ID290
Feature activation-4.06e-02

Pos logit contributions
ales+0.030
mediately+0.027
rost+0.027
ining+0.027
isk+0.026
Neg logit contributions
 babys-0.026
 haircut-0.023
 happened-0.023
 gloss-0.022
 photo-0.021
Token kept
Token ID4030
Feature activation-2.28e-02

Pos logit contributions
ales+0.081
rost+0.074
reci+0.073
orting+0.072
ility+0.071
Neg logit contributions
 babys-0.046
 happened-0.043
 haircut-0.041
 gloss-0.040
 underwear-0.035
Token demanding
Token ID11334
Feature activation-6.19e-02

Pos logit contributions
ales+0.043
mediately+0.040
reci+0.039
orting+0.039
rost+0.039
Neg logit contributions
 babys-0.028
 haircut-0.024
 happened-0.023
 gloss-0.022
 panc-0.021
Token for
Token ID329
Feature activation-5.44e-02

Pos logit contributions
ales+0.105
mediately+0.097
isk+0.094
ining+0.093
reci+0.093
Neg logit contributions
 babys-0.086
 haircut-0.081
 gloss-0.078
 happened-0.075
 hairc-0.072
Token somebody
Token ID8276
Feature activation+5.02e-02

Pos logit contributions
ales+0.093
reci+0.092
mediately+0.091
ining+0.088
acking+0.088
Neg logit contributions
 babys-0.071
 haircut-0.071
 hairc-0.063
 lipstick-0.063
 happened-0.060
Token to
Token ID284
Feature activation+6.31e-03

Pos logit contributions
 babys+0.059
 haircut+0.054
 happened+0.052
 gloss+0.049
 complexion+0.046
Neg logit contributions
ales-0.093
mediately-0.087
reci-0.085
rost-0.085
ining-0.085
Token help
Token ID1037
Feature activation+4.41e-03

Pos logit contributions
 babys+0.009
 haircut+0.009
 cough+0.008
 happened+0.008
 lipstick+0.008
Neg logit contributions
ales-0.010
ining-0.010
reci-0.010
urbed-0.010
mediately-0.010
Token her
Token ID607
Feature activation+5.77e-02

Pos logit contributions
 babys+0.005
 haircut+0.005
 happened+0.004
 gloss+0.004
 lipstick+0.004
Neg logit contributions
ales-0.008
mediately-0.008
reci-0.008
ining-0.008
wed-0.008
Token clean
Token ID3424
Feature activation-9.82e-02

Pos logit contributions
 babys+0.078
 haircut+0.072
 gloss+0.066
 happened+0.065
 complexion+0.063
Neg logit contributions
ales-0.099
reci-0.091
mediately-0.090
orting-0.089
rost-0.089
Tokenila
Token ID10102
Feature activation-3.24e-02

Pos logit contributions
ales+0.094
mediately+0.084
ining+0.084
orting+0.083
wed+0.082
Neg logit contributions
 babys-0.106
 haircut-0.096
 gloss-0.090
 complexion-0.089
 happened-0.089
Token and
Token ID290
Feature activation-4.33e-02

Pos logit contributions
ales+0.045
mediately+0.040
ining+0.040
isk+0.039
orting+0.039
Neg logit contributions
 babys-0.053
 haircut-0.048
 happened-0.046
 gloss-0.045
 complexion-0.045
Token Ben
Token ID3932
Feature activation-2.48e-02

Pos logit contributions
ales+0.040
 itself+0.034
isk+0.033
orting+0.033
rost+0.033
Neg logit contributions
 babys-0.088
 haircut-0.081
 virus-0.079
 complexion-0.079
 happened-0.079
Token ran
Token ID4966
Feature activation-2.19e-02

Pos logit contributions
ales+0.038
mediately+0.035
wed+0.035
urbed+0.035
ining+0.035
Neg logit contributions
 babys-0.035
 haircut-0.032
 gloss-0.030
 virus-0.029
 happened-0.029
Token to
Token ID284
Feature activation+7.66e-03

Pos logit contributions
ales+0.035
mediately+0.032
reci+0.032
ining+0.032
isk+0.032
Neg logit contributions
 babys-0.031
 haircut-0.028
 happened-0.026
 gloss-0.026
 complexion-0.025
Token the
Token ID262
Feature activation-2.15e-02

Pos logit contributions
 babys+0.014
 haircut+0.013
 happened+0.013
 gloss+0.013
 complexion+0.012
Neg logit contributions
ales-0.009
mediately-0.008
ining-0.008
reci-0.007
urbed-0.007
Token crystal
Token ID15121
Feature activation+5.59e-02

Pos logit contributions
ales+0.025
 itself+0.024
mediately+0.024
rost+0.023
 often+0.023
Neg logit contributions
 babys-0.037
 haircut-0.034
 gloss-0.033
 happened-0.033
 complexion-0.032
Token.
Token ID13
Feature activation+3.06e-02

Pos logit contributions
 babys+0.079
 haircut+0.073
 complexion+0.068
 gloss+0.066
 tooth+0.066
Neg logit contributions
ales-0.089
reci-0.085
mediately-0.084
ining-0.083
urbed-0.082
Token They
Token ID1119
Feature activation-1.38e-01

Pos logit contributions
 babys+0.055
 happened+0.050
 haircut+0.048
 cough+0.047
 complexion+0.047
Neg logit contributions
ales-0.039
urbed-0.035
isk-0.034
ility-0.033
wed-0.032
Token saw
Token ID2497
Feature activation-2.42e-02

Pos logit contributions
ales+0.228
mediately+0.206
orting+0.204
ining+0.204
reci+0.204
Neg logit contributions
 babys-0.193
 haircut-0.170
 happened-0.164
 complexion-0.159
 gloss-0.157
Token the
Token ID262
Feature activation-2.20e-02

Pos logit contributions
ales+0.026
 itself+0.024
mediately+0.023
orting+0.022
reci+0.022
Neg logit contributions
 babys-0.047
 haircut-0.043
 gloss-0.041
 complexion-0.041
 virus-0.040
Token and
Token ID290
Feature activation-3.58e-02

Pos logit contributions
ales+0.078
rost+0.072
mediately+0.069
isk+0.069
rant+0.069
Neg logit contributions
 happened-0.061
 babys-0.060
 haircut-0.054
 gloss-0.053
 virus-0.051
Token helpful
Token ID7613
Feature activation-6.83e-02

Pos logit contributions
ales+0.076
rost+0.070
ide+0.068
ale+0.066
urbed+0.066
Neg logit contributions
 babys-0.033
 happened-0.032
 haircut-0.031
 hurt-0.030
nam-0.029
Token is
Token ID318
Feature activation+1.03e-01

Pos logit contributions
ales+0.117
rost+0.108
mediately+0.105
isk+0.105
rant+0.103
Neg logit contributions
 babys-0.088
 happened-0.087
 haircut-0.081
 gloss-0.075
 virus-0.074
Token more
Token ID517
Feature activation+4.08e-02

Pos logit contributions
 happened+0.115
 babys+0.113
 haircut+0.104
 gloss+0.093
 very+0.090
Neg logit contributions
ales-0.203
rost-0.185
acking-0.173
mediately-0.173
urbed-0.173
Token important
Token ID1593
Feature activation-5.68e-02

Pos logit contributions
 babys+0.054
 happened+0.050
 haircut+0.049
 complexion+0.046
 gloss+0.046
Neg logit contributions
ales-0.069
isk-0.064
rost-0.063
mediately-0.061
urbed-0.060
Token than
Token ID621
Feature activation-7.75e-02

Pos logit contributions
ales+0.079
mediately+0.071
isk+0.069
rost+0.069
ining+0.069
Neg logit contributions
 babys-0.092
 haircut-0.085
 happened-0.082
 gloss-0.080
 complexion-0.079
Token having
Token ID1719
Feature activation-1.06e-01

Pos logit contributions
ales+0.129
mediately+0.120
ining+0.118
orting+0.116
reci+0.116
Neg logit contributions
 babys-0.109
 haircut-0.099
 happened-0.096
 hairc-0.089
 gloss-0.087
Token luxury
Token ID13064
Feature activation-4.46e-02

Pos logit contributions
ales+0.187
mediately+0.172
rost+0.170
isk+0.168
ining+0.167
Neg logit contributions
 babys-0.134
 haircut-0.120
 happened-0.117
 gloss-0.110
 complexion-0.107
Token things
Token ID1243
Feature activation+3.12e-02

Pos logit contributions
ales+0.068
mediately+0.063
ining+0.062
isk+0.061
rost+0.061
Neg logit contributions
 babys-0.066
 haircut-0.060
 happened-0.058
 gloss-0.056
 complexion-0.055
Token.
Token ID13
Feature activation+4.05e-02

Pos logit contributions
 happened+0.040
 babys+0.039
 haircut+0.037
 gloss+0.035
 virus+0.033
Neg logit contributions
ales-0.053
mediately-0.050
rost-0.049
isk-0.049
urbed-0.047
Token<|endoftext|>
Token ID50256
Feature activation-4.20e-02

Pos logit contributions
 babys+0.056
 gloss+0.049
 happened+0.046
 haircut+0.045
 complexion+0.043
Neg logit contributions
ales-0.072
wed-0.063
rant-0.062
urbed-0.061
ility-0.060
Token it
Token ID340
Feature activation+4.39e-02

Pos logit contributions
isk+0.017
ales+0.016
mediately+0.016
oster+0.016
aling+0.015
Neg logit contributions
 very-0.006
 happened-0.006
 something-0.006
 photo-0.006
 hairc-0.006
Token every
Token ID790
Feature activation-5.64e-02

Pos logit contributions
 babys+0.062
 gloss+0.059
 haircut+0.059
 very+0.058
 panc+0.057
Neg logit contributions
ales-0.063
reci-0.061
mediately-0.060
rost-0.058
ility-0.057
Token day
Token ID1110
Feature activation+5.78e-02

Pos logit contributions
ales+0.090
isk+0.082
orting+0.082
ining+0.080
mediately+0.079
Neg logit contributions
 babys-0.082
 haircut-0.075
 happened-0.074
 photo-0.071
 panc-0.069
Token.
Token ID13
Feature activation+4.03e-02

Pos logit contributions
 babys+0.056
 haircut+0.049
 panc+0.049
 gloss+0.048
 complexion+0.047
Neg logit contributions
isk-0.113
mediately-0.113
reci-0.111
ales-0.110
rost-0.107
Token
Token ID198
Feature activation-3.54e-02

Pos logit contributions
 babys+0.058
 gloss+0.051
 happened+0.049
 haircut+0.047
 panc+0.044
Neg logit contributions
ales-0.069
urbed-0.062
ility-0.061
wed-0.060
mediately-0.059
Token
Token ID198
Feature activation-3.49e-02

Pos logit contributions
ales+0.053
urbed+0.047
mediately+0.047
ility+0.046
wed+0.045
Neg logit contributions
 babys-0.054
 happened-0.049
 haircut-0.048
 gloss-0.046
 complexion-0.045
TokenAfter
Token ID3260
Feature activation-1.09e-01

Pos logit contributions
ales+0.053
ility+0.043
ried+0.043
utt+0.043
ws+0.042
Neg logit contributions
 babys-0.052
 gloss-0.049
 haircut-0.049
 happened-0.048
 lipstick-0.046
Token a
Token ID257
Feature activation+1.33e-01

Pos logit contributions
ales+0.189
mediately+0.174
ining+0.171
reci+0.169
urbed+0.169
Neg logit contributions
 babys-0.143
 haircut-0.126
 happened-0.121
 gloss-0.118
 complexion-0.115
Token few
Token ID1178
Feature activation-5.56e-02

Pos logit contributions
 babys+0.227
 haircut+0.201
 gloss+0.199
 underwear+0.198
 happened+0.195
Neg logit contributions
ales-0.156
 often-0.153
rost-0.146
ining-0.143
urbed-0.142
Token weeks
Token ID2745
Feature activation-2.75e-02

Pos logit contributions
ales+0.064
isk+0.057
mediately+0.056
rost+0.055
ining+0.055
Neg logit contributions
 babys-0.102
 happened-0.096
 haircut-0.095
 gloss-0.090
 complexion-0.090
Token,
Token ID11
Feature activation+4.65e-02

Pos logit contributions
ales+0.053
mediately+0.052
ining+0.051
urbed+0.051
reci+0.049
Neg logit contributions
 babys-0.030
 happened-0.028
 haircut-0.024
 gloss-0.022
ird-0.021
Token turns
Token ID4962
Feature activation+9.38e-02

Pos logit contributions
ales+0.081
isk+0.074
mediately+0.073
orting+0.071
urbed+0.071
Neg logit contributions
 babys-0.101
 haircut-0.095
 happened-0.092
 gloss-0.089
 complexion-0.087
Token to
Token ID284
Feature activation+6.85e-03

Pos logit contributions
 babys+0.110
 haircut+0.101
 happened+0.100
 gloss+0.095
 hairc+0.088
Neg logit contributions
ales-0.172
mediately-0.160
isk-0.160
reci-0.158
rost-0.156
Token kick
Token ID4829
Feature activation-8.66e-02

Pos logit contributions
 haircut+0.009
 babys+0.009
 shirt+0.008
 underwear+0.008
 lipstick+0.008
Neg logit contributions
ales-0.011
reci-0.010
urbed-0.010
rost-0.010
ility-0.010
Token it
Token ID340
Feature activation+4.09e-02

Pos logit contributions
ales+0.119
isk+0.112
mediately+0.107
rost+0.106
ining+0.105
Neg logit contributions
 babys-0.135
 haircut-0.130
 happened-0.127
 hairc-0.124
 photo-0.124
Token.
Token ID13
Feature activation+4.22e-02

Pos logit contributions
 babys+0.044
 haircut+0.043
 gloss+0.042
 happened+0.040
 underwear+0.037
Neg logit contributions
ales-0.074
reci-0.070
isk-0.070
rost-0.069
mediately-0.068
Token The
Token ID383
Feature activation-1.39e-01

Pos logit contributions
 babys+0.069
 haircut+0.063
 gloss+0.061
 lipstick+0.060
 happened+0.060
Neg logit contributions
ales-0.060
ility-0.057
cut-0.057
urbed-0.056
reci-0.056
Token ball
Token ID2613
Feature activation+4.14e-02

Pos logit contributions
ales+0.205
mediately+0.198
 itself+0.190
orting+0.187
reci+0.186
Neg logit contributions
 babys-0.210
 haircut-0.182
 happened-0.174
 complexion-0.173
 gloss-0.172
Token flew
Token ID13112
Feature activation-5.16e-02

Pos logit contributions
 babys+0.056
 happened+0.052
 haircut+0.051
 gloss+0.049
 complexion+0.046
Neg logit contributions
ales-0.070
isk-0.062
reci-0.062
ility-0.061
orting-0.061
Token high
Token ID1029
Feature activation-9.60e-02

Pos logit contributions
ales+0.084
reci+0.082
isk+0.078
rost+0.077
ility+0.076
Neg logit contributions
 babys-0.062
 very-0.062
 happened-0.061
 lipstick-0.056
 photo-0.055
Token in
Token ID287
Feature activation-9.17e-02

Pos logit contributions
ales+0.136
reci+0.131
isk+0.127
rost+0.124
ility+0.124
Neg logit contributions
 babys-0.140
 haircut-0.133
 happened-0.129
 complexion-0.123
 gloss-0.122
Token the
Token ID262
Feature activation-1.86e-02

Pos logit contributions
ales+0.130
orting+0.123
reci+0.123
isk+0.118
rost+0.116
Neg logit contributions
 babys-0.142
 haircut-0.132
 photo-0.130
 happened-0.127
 lipstick-0.124
Token is
Token ID318
Feature activation+1.02e-01

Pos logit contributions
 babys+0.031
 haircut+0.027
 happened+0.026
 complexion+0.026
 gloss+0.026
Neg logit contributions
ales-0.028
ining-0.026
mediately-0.025
 often-0.025
orting-0.025
Token mine
Token ID6164
Feature activation+2.09e-02

Pos logit contributions
 babys+0.137
 haircut+0.123
 complexion+0.115
 virus+0.109
 gloss+0.109
Neg logit contributions
ales-0.164
 itself-0.162
 often-0.159
ining-0.157
mediately-0.154
Token!
Token ID0
Feature activation+3.51e-02

Pos logit contributions
 babys+0.040
 haircut+0.035
 gloss+0.034
 happened+0.034
 virus+0.033
Neg logit contributions
ales-0.024
 itself-0.023
 often-0.023
ining-0.022
mediately-0.020
Tokenâ
Token ID22940
Feature activation-8.24e-02

Pos logit contributions
 babys+0.068
 haircut+0.063
 happened+0.060
 complexion+0.060
 gloss+0.058
Neg logit contributions
ales-0.037
mediately-0.034
ining-0.033
reci-0.033
orting-0.033
Token
Token ID26391
Feature activation+1.30e-01

Pos logit contributions
ales+0.139
urbed+0.138
 often+0.132
rost+0.129
angles+0.127
Neg logit contributions
 babys-0.094
 haircut-0.087
 gloss-0.085
 happened-0.085
 lipstick-0.083
Token She
Token ID1375
Feature activation-1.38e-01

Pos logit contributions
 babys+0.214
 haircut+0.194
 gloss+0.180
 happened+0.180
 complexion+0.176
Neg logit contributions
ales-0.179
ining-0.165
isk-0.164
orting-0.160
mediately-0.157
Token tried
Token ID3088
Feature activation+4.18e-02

Pos logit contributions
 often+0.167
ales+0.156
mediately+0.153
ining+0.152
wed+0.151
Neg logit contributions
 babys-0.233
 haircut-0.213
 virus-0.206
 complexion-0.201
burn-0.201
Token to
Token ID284
Feature activation+8.90e-03

Pos logit contributions
 babys+0.060
 haircut+0.054
 happened+0.051
 gloss+0.051
 complexion+0.049
Neg logit contributions
ales-0.068
mediately-0.062
ining-0.061
reci-0.061
isk-0.060
Token pick
Token ID2298
Feature activation-8.03e-02

Pos logit contributions
 babys+0.015
 haircut+0.014
 complexion+0.013
 gloss+0.013
 photo+0.013
Neg logit contributions
ales-0.012
orting-0.011
reci-0.011
urbed-0.011
ining-0.011
Token it
Token ID340
Feature activation+4.48e-02

Pos logit contributions
ales+0.107
mediately+0.097
isk+0.096
ining+0.096
rost+0.095
Neg logit contributions
 babys-0.133
 haircut-0.124
 gloss-0.119
 happened-0.118
 lipstick-0.115
Token,
Token ID11
Feature activation+4.67e-02

Pos logit contributions
 babys+0.061
 haircut+0.057
 gloss+0.054
 lipstick+0.053
 happened+0.053
Neg logit contributions
ales-0.070
reci-0.069
rost-0.065
ility-0.065
urbed-0.065
Token happily
Token ID18177
Feature activation+6.15e-03

Pos logit contributions
ales+0.099
rost+0.093
reci+0.090
isk+0.090
ility+0.087
Neg logit contributions
 very-0.068
 babys-0.062
 happened-0.058
 haircut-0.056
 dizz-0.054
Token in
Token ID287
Feature activation-9.06e-02

Pos logit contributions
 babys+0.008
 haircut+0.007
 happened+0.007
 gloss+0.007
 complexion+0.007
Neg logit contributions
ales-0.011
rost-0.011
mediately-0.010
lated-0.010
angles-0.010
Token the
Token ID262
Feature activation-1.68e-02

Pos logit contributions
ales+0.149
mediately+0.134
orting+0.130
ining+0.125
rant+0.125
Neg logit contributions
 babys-0.139
 haircut-0.130
 photo-0.120
 happened-0.118
 tooth-0.116
Token sky
Token ID6766
Feature activation+1.90e-02

Pos logit contributions
ales+0.031
isk+0.028
rost+0.027
wed+0.027
ining+0.026
Neg logit contributions
 babys-0.022
 haircut-0.019
 happened-0.018
 photo-0.018
 nightmare-0.018
Token,
Token ID11
Feature activation+4.81e-02

Pos logit contributions
 babys+0.017
 happened+0.016
 very+0.016
 haircut+0.016
 complexion+0.016
Neg logit contributions
ales-0.039
rost-0.037
reci-0.037
mediately-0.036
lated-0.035
Token always
Token ID1464
Feature activation-1.24e-01

Pos logit contributions
 babys+0.040
 crying+0.038
 smiling+0.038
 happened+0.037
 very+0.036
Neg logit contributions
ales-0.111
rost-0.101
rog-0.100
lated-0.099
mediately-0.097
Token sharing
Token ID7373
Feature activation-1.16e-01

Pos logit contributions
ales+0.217
rost+0.193
orting+0.188
isk+0.188
reci+0.187
Neg logit contributions
 babys-0.162
 happened-0.155
 haircut-0.150
 complexion-0.138
 gloss-0.134
Token with
Token ID351
Feature activation-8.37e-02

Pos logit contributions
ales+0.180
mediately+0.162
rost+0.156
reci+0.155
wed+0.154
Neg logit contributions
 babys-0.169
 haircut-0.162
 happened-0.161
 hairc-0.150
 gloss-0.149
Token everyone
Token ID2506
Feature activation+1.54e-02

Pos logit contributions
ales+0.135
mediately+0.133
orting+0.128
rant+0.123
reci+0.123
Neg logit contributions
 babys-0.116
 happened-0.109
 hairc-0.104
 haircut-0.102
 tooth-0.098
Token and
Token ID290
Feature activation-4.07e-02

Pos logit contributions
 babys+0.022
 happened+0.022
 haircut+0.021
 hairc+0.019
 gloss+0.019
Neg logit contributions
ales-0.025
isk-0.023
mediately-0.022
rost-0.022
wed-0.022
Token never
Token ID1239
Feature activation-9.11e-02

Pos logit contributions
ales+0.069
rost+0.062
mediately+0.061
rant+0.060
orting+0.060
Neg logit contributions
 babys-0.057
 happened-0.055
 haircut-0.051
 hairc-0.048
 complexion-0.047
Token and
Token ID290
Feature activation-3.58e-02

Pos logit contributions
ales+0.034
urbed+0.032
ility+0.032
mediately+0.031
lated+0.030
Neg logit contributions
 babys-0.031
 happened-0.031
 haircut-0.028
 complexion-0.027
 cough-0.026
Token Ben
Token ID3932
Feature activation-2.79e-02

Pos logit contributions
ales+0.039
mediately+0.035
 often+0.034
 itself+0.034
ining+0.034
Neg logit contributions
 babys-0.068
 haircut-0.062
 gloss-0.061
 complexion-0.059
 happened-0.058
Token are
Token ID389
Feature activation+8.68e-02

Pos logit contributions
ales+0.050
urbed+0.047
mediately+0.047
ility+0.046
orting+0.045
Neg logit contributions
 babys-0.033
 happened-0.033
 haircut-0.030
 complexion-0.027
 cough-0.027
Token friends
Token ID2460
Feature activation+3.65e-02

Pos logit contributions
 babys+0.116
 haircut+0.105
 gloss+0.100
 happened+0.098
 complexion+0.095
Neg logit contributions
ales-0.144
 often-0.133
mediately-0.132
ining-0.131
orting-0.131
Token.
Token ID13
Feature activation+4.86e-02

Pos logit contributions
 babys+0.048
 haircut+0.042
 complexion+0.040
 happened+0.038
 panc+0.038
Neg logit contributions
ales-0.064
isk-0.058
mediately-0.058
reci-0.058
ining-0.057
Token They
Token ID1119
Feature activation-1.34e-01

Pos logit contributions
 babys+0.097
 haircut+0.089
 happened+0.086
 complexion+0.085
 gloss+0.084
Neg logit contributions
ales-0.051
urbed-0.044
ining-0.044
mediately-0.044
wed-0.043
Token like
Token ID588
Feature activation-8.89e-02

Pos logit contributions
ales+0.239
urbed+0.229
mediately+0.220
ining+0.217
ility+0.215
Neg logit contributions
 babys-0.169
 haircut-0.148
 happened-0.148
 complexion-0.145
 cough-0.139
Token to
Token ID284
Feature activation+3.43e-03

Pos logit contributions
ales+0.125
 often+0.112
reci+0.109
ining+0.109
mediately+0.109
Neg logit contributions
 babys-0.140
 haircut-0.124
 gloss-0.122
 happened-0.119
 complexion-0.118
Token play
Token ID711
Feature activation-4.41e-02

Pos logit contributions
 babys+0.007
 haircut+0.006
 happened+0.006
 gloss+0.006
 virus+0.006
Neg logit contributions
ales-0.004
 often-0.003
isk-0.003
mediately-0.003
orting-0.003
Token in
Token ID287
Feature activation-9.21e-02

Pos logit contributions
ales+0.060
mediately+0.054
urbed+0.053
rost+0.053
ining+0.053
Neg logit contributions
 babys-0.072
 haircut-0.067
 happened-0.064
 gloss-0.063
 complexion-0.062
Token the
Token ID262
Feature activation-1.78e-02

Pos logit contributions
ales+0.089
mediately+0.076
ining+0.074
orting+0.073
urbed+0.073
Neg logit contributions
 babys-0.190
 haircut-0.176
 happened-0.170
 gloss-0.168
 complexion-0.167
Token made
Token ID925
Feature activation-1.69e-02

Pos logit contributions
 babys+0.032
 happened+0.030
 haircut+0.029
 gloss+0.027
 complexion+0.026
Neg logit contributions
ales-0.045
reci-0.040
orting-0.040
isk-0.039
mediately-0.039
Token him
Token ID683
Feature activation+5.30e-02

Pos logit contributions
ales+0.025
reci+0.023
ility+0.023
orting+0.022
rant+0.022
Neg logit contributions
 happened-0.026
 babys-0.026
 haircut-0.023
 trouble-0.022
 nightmare-0.022
Token very
Token ID845
Feature activation+1.74e-01

Pos logit contributions
 babys+0.070
 happened+0.060
 complexion+0.058
 haircut+0.057
 gloss+0.056
Neg logit contributions
 often-0.084
 itself-0.082
ales-0.081
mediately-0.080
wed-0.078
Token sad
Token ID6507
Feature activation-2.74e-02

Pos logit contributions
 babys+0.165
 haircut+0.139
 happened+0.127
 gloss+0.125
 complexion+0.122
Neg logit contributions
ales-0.360
mediately-0.339
ining-0.333
urbed-0.333
reci-0.332
Token.
Token ID13
Feature activation+4.20e-02

Pos logit contributions
ales+0.043
mediately+0.039
reci+0.038
isk+0.038
ining+0.037
Neg logit contributions
 babys-0.041
 haircut-0.038
 happened-0.036
 gloss-0.035
 complexion-0.035
Token He
Token ID679
Feature activation-1.35e-01

Pos logit contributions
 babys+0.067
 happened+0.056
 gloss+0.054
 haircut+0.053
 lipstick+0.050
Neg logit contributions
ales-0.067
wed-0.062
reci-0.060
angles-0.060
ility-0.060
Token wished
Token ID16555
Feature activation+2.42e-02

Pos logit contributions
ales+0.232
orting+0.207
mediately+0.206
ining+0.205
isk+0.203
Neg logit contributions
 babys-0.181
 haircut-0.160
 happened-0.157
 gloss-0.150
 complexion-0.146
Token he
Token ID339
Feature activation-5.25e-02

Pos logit contributions
 babys+0.031
 happened+0.030
 haircut+0.027
 gloss+0.024
 hairc+0.024
Neg logit contributions
ales-0.047
reci-0.044
mediately-0.043
ining-0.042
wed-0.041
Token had
Token ID550
Feature activation-5.55e-03

Pos logit contributions
ales+0.090
orting+0.079
mediately+0.079
reci+0.077
isk+0.077
Neg logit contributions
 babys-0.072
 happened-0.070
 haircut-0.065
 gloss-0.063
 nightmare-0.059
Token stopped
Token ID5025
Feature activation+6.70e-02

Pos logit contributions
ales+0.007
mediately+0.006
ining+0.006
 often+0.006
wed+0.006
Neg logit contributions
 babys-0.010
 haircut-0.009
 gloss-0.008
 complexion-0.008
 happened-0.008
Token loading
Token ID11046
Feature activation-1.54e-01

Pos logit contributions
 babys+0.080
 haircut+0.069
 happened+0.066
 gloss+0.063
 cough+0.059
Neg logit contributions
ales-0.127
mediately-0.117
ining-0.116
rost-0.115
reci-0.114