TOP ACTIVATIONS
MAX = 7.311

careful , Lily . Cater pill ars can be tricky .
know , Lily . Cater pill ars need leaves and air
Did you know that cater pill ars turn into beautiful butterflies
saw lots of colourful cater pill ars and an old lady
ily liked to keep cater pill ars in a jar .
put it back . Cater pill ars need to eat leaves
but he also knew cater pill ars were not pets .
Lily , no . Cater pill ars are not for keeping
and watched as the cater pill ars grew bigger and bigger
and lady bugs and cater pill ars ." Tim
. Soon , more cater pill ars arrived . They also
, " I love cater pill ars ! You can be
said , " C ater pill ars are not meant to
you , Lily . Cater pill ars can leave and never
more of the hairy cater pill ars . <|endoftext|> Once there
he noticed some other cater pill ars stretching towards it too
. She saw two cater pill ars crawling around on a
tell the mom my cater pill ars ! ''
They saw the wood cut ter coming to the woods
his friends , the wood cut ter was able to part

MIN ACTIVATIONS
MIN = -5.854

that the kitten was un har med ! The two hugged
that the kitten was un har med ! The two hugged
tree , he was un har med , but he felt
still be healthy and un har med . Suddenly , her
managed to get away un har med . From that day
around in z ig z ags . When Zig
sure his z ig z ags were neat and tidy .
enjoyed writing z ig z ags . He wrote z ig
He wrote z ig z ags on walls , floors ,
lines to z ig z ags . One day
so many z ig z ags ," he said nervously .
Jack was turning the r ags , he looked out the
writing his z ig z ags , he heard a funny
She found some old r ags which were dirty . She
mom bought some new r ags . When they got back
The dog w ags its tail . It l
it was difficult finding r ags and water . But Bob
him to turn the r ags she kept in the closet
grabbed one of the r ags and started to spin it
and showed her the r ags . She said that she

INTERVAL 4.020 - 7.311
CONTAINS 0.004%

the machine was a shred der , that could cut things
have to find the pasta aisle ." L ily
and it was the y umm iest ice cream she ever
. She loved to eat av oc ados because they were
The ocean was a sad der place without Fin 's happy

INTERVAL 0.729 - 4.020
CONTAINS 14.486%

happy and cl apped even louder . The end . <|endoftext|>
. She said : " Don 't worry , I 'll
We found them in the box !" " No
couldn 't have finished this without you ." Tim my smiled
her . " Wow , Lily , you were

INTERVAL -2.563 - 0.729
CONTAINS 84.942%

here . Maybe it likes to splash and play with its
L ily picked one of the flowers and gave it to
little to understand , so he just listened to M ummy
see a sign that says " tem ple ". They ask
sure we will continue to hold hands for all the seasons

INTERVAL -5.854 - -2.563
CONTAINS 0.568%

there was a little boy named Tim . Tim was a
They knew they wouldn 't forget this special day . <|endoftext|>
there was a little girl named Lily . Lily loved going
there was a little girl named Lily . She had a
with his mom and they both had lots of fun !
Token careful
Token ID8161
Feature activation-1.04e+00

Pos logit contributions
 She+0.004
 means+0.003
 ornament+0.003
 Everyday+0.003
otion+0.003
Neg logit contributions
 lifetime-0.003
 injuries-0.003
uc-0.003
bek-0.003
uddled-0.002
Token,
Token ID11
Feature activation+3.80e-01

Pos logit contributions
 injuries+0.047
 lifetime+0.043
uc+0.042
har+0.042
ppers+0.041
Neg logit contributions
 She-0.040
 Everyday-0.031
 means-0.030
 ornament-0.030
 asks-0.028
Token Lily
Token ID20037
Feature activation+7.98e-01

Pos logit contributions
 She+0.015
 Everyday+0.012
 ornament+0.012
 means+0.011
otion+0.010
Neg logit contributions
 injuries-0.017
 lifetime-0.017
uc-0.016
bek-0.015
har-0.015
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
 She+0.032
 ornament+0.024
 Everyday+0.024
 means+0.023
otion+0.023
Neg logit contributions
 lifetime-0.036
 injuries-0.035
uc-0.034
bek-0.033
uddled-0.032
Token Cater
Token ID40742
Feature activation+1.10e+00

Pos logit contributions
 She+0.005
 ornament+0.005
 means+0.004
 idea+0.004
igator+0.004
Neg logit contributions
 lifetime-0.008
 injuries-0.008
uc-0.008
har-0.008
bek-0.007
Tokenpill
Token ID27215
Feature activation+6.29e+00

Pos logit contributions
 She+0.043
 ornament+0.033
 Everyday+0.032
 means+0.031
otion+0.030
Neg logit contributions
 lifetime-0.052
uc-0.048
 injuries-0.047
bek-0.046
uddled-0.046
Tokenars
Token ID945
Feature activation-2.72e+00

Pos logit contributions
 She+0.421
 means+0.354
 ornament+0.350
 Everyday+0.346
otion+0.331
Neg logit contributions
 lifetime-0.139
 injuries-0.124
uc-0.112
bek-0.107
har-0.100
Token can
Token ID460
Feature activation+7.39e-01

Pos logit contributions
 lifetime+0.144
 injuries+0.141
uc+0.138
har+0.131
bek+0.130
Neg logit contributions
 She-0.093
 ornament-0.063
 Everyday-0.060
 means-0.058
otion-0.053
Token be
Token ID307
Feature activation-1.32e+00

Pos logit contributions
 She+0.030
 ornament+0.023
otion+0.023
 means+0.023
 Everyday+0.022
Neg logit contributions
 injuries-0.033
 lifetime-0.029
har-0.029
uc-0.028
eds-0.027
Token tricky
Token ID17198
Feature activation-1.34e-02

Pos logit contributions
 lifetime+0.060
 injuries+0.057
uc+0.055
bek+0.053
har+0.052
Neg logit contributions
 She-0.058
 means-0.044
 ornament-0.043
 Everyday-0.042
otion-0.040
Token.
Token ID13
Feature activation+1.44e-01

Pos logit contributions
 lifetime+0.001
 injuries+0.001
uc+0.001
bek+0.001
uddled+0.001
Neg logit contributions
 She-0.001
 ornament-0.000
 Everyday-0.000
otion-0.000
 means-0.000
Token know
Token ID760
Feature activation+4.71e-01

Pos logit contributions
 She+0.063
 ornament+0.047
 means+0.045
 Everyday+0.044
otion+0.041
Neg logit contributions
 lifetime-0.099
 injuries-0.098
bek-0.090
uc-0.090
oy-0.090
Token,
Token ID11
Feature activation+2.90e-01

Pos logit contributions
 She+0.019
 means+0.014
 ornament+0.013
 Everyday+0.013
otion+0.012
Neg logit contributions
 lifetime-0.023
 injuries-0.022
uc-0.021
bek-0.021
har-0.020
Token Lily
Token ID20037
Feature activation+6.28e-01

Pos logit contributions
 She+0.012
 Everyday+0.009
 means+0.009
 ornament+0.009
otion+0.008
Neg logit contributions
 injuries-0.013
 lifetime-0.012
bek-0.011
edy-0.011
uc-0.011
Token.
Token ID13
Feature activation+5.55e-02

Pos logit contributions
 She+0.028
 Everyday+0.023
 ornament+0.022
otion+0.021
 means+0.021
Neg logit contributions
 lifetime-0.025
 injuries-0.024
uc-0.023
bek-0.022
ounced-0.021
Token Cater
Token ID40742
Feature activation+1.13e+00

Pos logit contributions
 She+0.002
 ornament+0.001
 idea+0.001
otion+0.001
 Everyday+0.001
Neg logit contributions
 lifetime-0.002
 injuries-0.002
uc-0.002
bek-0.002
har-0.002
Tokenpill
Token ID27215
Feature activation+6.28e+00

Pos logit contributions
 She+0.044
 ornament+0.035
 Everyday+0.033
 means+0.032
otion+0.032
Neg logit contributions
 lifetime-0.052
uc-0.048
 injuries-0.047
bek-0.045
ounced-0.045
Tokenars
Token ID945
Feature activation-2.78e+00

Pos logit contributions
 She+0.406
 means+0.339
 ornament+0.332
 Everyday+0.330
otion+0.316
Neg logit contributions
 lifetime-0.156
 injuries-0.142
uc-0.129
bek-0.124
har-0.116
Token need
Token ID761
Feature activation-5.40e-02

Pos logit contributions
 lifetime+0.151
 injuries+0.147
uc+0.144
har+0.136
ounced+0.136
Neg logit contributions
 She-0.087
 ornament-0.059
 Everyday-0.057
 means-0.055
otion-0.050
Token leaves
Token ID5667
Feature activation+6.18e-01

Pos logit contributions
 lifetime+0.003
 injuries+0.003
bek+0.003
uc+0.003
har+0.003
Neg logit contributions
 She-0.002
 means-0.001
 ornament-0.001
 Everyday-0.001
 It-0.001
Token and
Token ID290
Feature activation-6.09e-02

Pos logit contributions
 She+0.026
 ornament+0.021
 Everyday+0.020
otion+0.020
 means+0.019
Neg logit contributions
 injuries-0.025
 lifetime-0.025
uc-0.024
eds-0.023
ounced-0.022
Token air
Token ID1633
Feature activation+1.00e-01

Pos logit contributions
 injuries+0.003
 lifetime+0.003
uc+0.002
ounced+0.002
oy+0.002
Neg logit contributions
 She-0.003
otion-0.002
 ornament-0.002
 means-0.002
 Everyday-0.002
Token Did
Token ID7731
Feature activation+1.95e+00

Pos logit contributions
 lifetime+0.001
bek+0.001
 injuries+0.001
uc+0.001
har+0.001
Neg logit contributions
 She-0.001
 ornament-0.001
 means-0.001
otion-0.000
 Everyday-0.000
Token you
Token ID345
Feature activation+2.20e-01

Pos logit contributions
 She+0.095
 means+0.076
 ornament+0.076
 Everyday+0.075
otion+0.072
Neg logit contributions
 lifetime-0.071
 injuries-0.069
uc-0.063
bek-0.060
ounced-0.059
Token know
Token ID760
Feature activation+5.06e-01

Pos logit contributions
 She+0.009
 means+0.006
otion+0.006
 ornament+0.006
 Everyday+0.006
Neg logit contributions
 lifetime-0.010
 injuries-0.010
uc-0.009
har-0.009
oy-0.009
Token that
Token ID326
Feature activation-7.47e-01

Pos logit contributions
 She+0.024
 means+0.018
 Everyday+0.018
 ornament+0.017
otion+0.016
Neg logit contributions
 lifetime-0.021
 injuries-0.020
uc-0.019
bek-0.018
har-0.018
Token cater
Token ID20825
Feature activation+1.51e+00

Pos logit contributions
 lifetime+0.034
 injuries+0.031
bek+0.028
uc+0.028
har+0.027
Neg logit contributions
 She-0.031
 Everyday-0.023
otion-0.022
 ornament-0.020
 means-0.020
Tokenpill
Token ID27215
Feature activation+6.19e+00

Pos logit contributions
 She+0.063
 ornament+0.049
 means+0.047
 Everyday+0.046
otion+0.044
Neg logit contributions
 lifetime-0.067
uc-0.062
 injuries-0.059
ounced-0.058
eds-0.058
Tokenars
Token ID945
Feature activation-2.84e+00

Pos logit contributions
 She+0.413
 means+0.347
 ornament+0.341
 Everyday+0.337
otion+0.324
Neg logit contributions
 lifetime-0.142
 injuries-0.122
uc-0.113
bek-0.108
har-0.101
Token turn
Token ID1210
Feature activation-1.07e+00

Pos logit contributions
 lifetime+0.140
 injuries+0.132
uc+0.132
eds+0.123
bek+0.122
Neg logit contributions
 She-0.108
 ornament-0.076
 Everyday-0.074
 means-0.071
otion-0.069
Token into
Token ID656
Feature activation+7.36e-01

Pos logit contributions
 lifetime+0.052
uc+0.049
 injuries+0.049
ounced+0.046
eds+0.046
Neg logit contributions
 She-0.042
 ornament-0.029
 Everyday-0.029
otion-0.028
 means-0.028
Token beautiful
Token ID4950
Feature activation+5.21e-03

Pos logit contributions
 She+0.029
 means+0.020
otion+0.020
 Everyday+0.020
 ornament+0.019
Neg logit contributions
 lifetime-0.036
 injuries-0.035
uc-0.033
har-0.031
bek-0.031
Token butterflies
Token ID44571
Feature activation+9.73e-02

Pos logit contributions
 She+0.000
 means+0.000
otion+0.000
 Everyday+0.000
 ornament+0.000
Neg logit contributions
 lifetime-0.000
 injuries-0.000
uc-0.000
bek-0.000
har-0.000
Token saw
Token ID2497
Feature activation-1.27e+00

Pos logit contributions
 She+0.005
 means+0.004
 Everyday+0.004
 ornament+0.004
+0.003
Neg logit contributions
 lifetime-0.004
 injuries-0.004
uc-0.004
bek-0.004
har-0.004
Token lots
Token ID6041
Feature activation+5.71e-01

Pos logit contributions
 lifetime+0.058
 injuries+0.057
uddled+0.053
uc+0.053
ounced+0.051
Neg logit contributions
 She-0.048
 means-0.037
 Everyday-0.035
 ornament-0.034
otion-0.034
Token of
Token ID286
Feature activation+1.18e+00

Pos logit contributions
 She+0.021
 means+0.016
otion+0.014
 Everyday+0.014
 ornament+0.014
Neg logit contributions
 lifetime-0.025
uddled-0.025
 injuries-0.024
uc-0.023
uddle-0.022
Token colourful
Token ID44799
Feature activation+9.53e-01

Pos logit contributions
 She+0.054
 means+0.043
otion+0.041
 Everyday+0.041
 ornament+0.040
Neg logit contributions
 lifetime-0.047
 injuries-0.044
uc-0.042
ounced-0.039
uddled-0.039
Token cater
Token ID20825
Feature activation+1.68e+00

Pos logit contributions
 She+0.040
 means+0.031
 Everyday+0.030
otion+0.030
 ornament+0.029
Neg logit contributions
 lifetime-0.042
uc-0.040
 injuries-0.039
uddled-0.037
ounced-0.037
Tokenpill
Token ID27215
Feature activation+6.17e+00

Pos logit contributions
 She+0.069
 means+0.055
 ornament+0.052
 Everyday+0.052
otion+0.048
Neg logit contributions
 lifetime-0.077
uc-0.071
 injuries-0.070
ounced-0.068
uddled-0.067
Tokenars
Token ID945
Feature activation-2.68e+00

Pos logit contributions
 She+0.399
 means+0.336
 ornament+0.328
 Everyday+0.326
otion+0.310
Neg logit contributions
 lifetime-0.153
 injuries-0.135
uc-0.125
bek-0.120
har-0.112
Token and
Token ID290
Feature activation-1.18e-01

Pos logit contributions
 lifetime+0.118
 injuries+0.109
uc+0.105
uddled+0.101
ounced+0.101
Neg logit contributions
 She-0.115
 means-0.091
 ornament-0.089
 Everyday-0.088
 It-0.078
Token an
Token ID281
Feature activation-8.31e-01

Pos logit contributions
 lifetime+0.006
 injuries+0.005
uc+0.005
bek+0.005
ounced+0.005
Neg logit contributions
 She-0.005
otion-0.004
 means-0.003
 ornament-0.003
 Everyday-0.003
Token old
Token ID1468
Feature activation+2.29e+00

Pos logit contributions
 lifetime+0.039
 injuries+0.036
uc+0.036
bek+0.035
har+0.034
Neg logit contributions
 She-0.033
 means-0.025
otion-0.023
 ornament-0.022
 Everyday-0.022
Token lady
Token ID10846
Feature activation+2.90e-01

Pos logit contributions
 She+0.090
 means+0.067
 Everyday+0.065
 ornament+0.063
otion+0.062
Neg logit contributions
 lifetime-0.113
 injuries-0.106
uc-0.103
bek-0.100
ounced-0.097
Tokenily
Token ID813
Feature activation+4.09e-01

Pos logit contributions
 lifetime+0.006
 injuries+0.006
uc+0.006
bek+0.006
uddled+0.006
Neg logit contributions
 She-0.004
 means-0.003
 ornament-0.003
 Everyday-0.003
otion-0.003
Token liked
Token ID8288
Feature activation+1.45e-02

Pos logit contributions
 She+0.015
 ornament+0.012
otion+0.012
 Everyday+0.011
 seem+0.010
Neg logit contributions
 lifetime-0.018
 injuries-0.017
uc-0.017
uddled-0.017
oy-0.017
Token to
Token ID284
Feature activation-7.03e-02

Pos logit contributions
 She+0.001
 means+0.001
 Everyday+0.001
otion+0.001
 ornament+0.001
Neg logit contributions
 lifetime-0.001
 injuries-0.001
uc-0.001
uddled-0.000
bek-0.000
Token keep
Token ID1394
Feature activation-1.57e-01

Pos logit contributions
 lifetime+0.002
 injuries+0.002
uc+0.002
oy+0.002
inations+0.002
Neg logit contributions
 She-0.003
 means-0.003
otion-0.003
 ornament-0.003
 Everyday-0.002
Token cater
Token ID20825
Feature activation+1.81e+00

Pos logit contributions
 injuries+0.006
 lifetime+0.006
uc+0.006
eds+0.005
ounced+0.005
Neg logit contributions
 She-0.008
 Everyday-0.006
 means-0.006
 ornament-0.006
otion-0.006
Tokenpill
Token ID27215
Feature activation+6.16e+00

Pos logit contributions
 She+0.075
 means+0.056
 ornament+0.054
 Everyday+0.053
otion+0.049
Neg logit contributions
 lifetime-0.086
 injuries-0.081
uc-0.080
bek-0.076
uddled-0.075
Tokenars
Token ID945
Feature activation-2.44e+00

Pos logit contributions
 She+0.411
 means+0.349
 ornament+0.340
 Everyday+0.338
otion+0.324
Neg logit contributions
 lifetime-0.140
 injuries-0.123
uc-0.111
bek-0.105
har-0.100
Token in
Token ID287
Feature activation+4.76e-01

Pos logit contributions
 lifetime+0.108
 injuries+0.103
uc+0.101
uddled+0.095
oy+0.095
Neg logit contributions
 She-0.098
 means-0.081
otion-0.073
 Everyday-0.073
 ornament-0.068
Token a
Token ID257
Feature activation-4.42e-01

Pos logit contributions
 She+0.013
 means+0.009
 ornament+0.008
otion+0.007
 Everyday+0.007
Neg logit contributions
 injuries-0.028
 lifetime-0.028
uc-0.026
bek-0.025
har-0.025
Token jar
Token ID17379
Feature activation+1.80e+00

Pos logit contributions
 lifetime+0.023
 injuries+0.022
uc+0.021
bek+0.021
har+0.020
Neg logit contributions
 She-0.016
 means-0.012
 ornament-0.011
 Everyday-0.011
otion-0.010
Token.
Token ID13
Feature activation+1.09e-01

Pos logit contributions
 She+0.078
 means+0.063
 ornament+0.059
 Everyday+0.058
otion+0.057
Neg logit contributions
 lifetime-0.078
 injuries-0.074
uc-0.070
uddled-0.066
bek-0.065
Token put
Token ID1234
Feature activation-1.21e-01

Pos logit contributions
 injuries+0.009
 lifetime+0.008
har+0.008
uc+0.007
bek+0.007
Neg logit contributions
 She-0.009
 means-0.007
 ornament-0.007
otion-0.007
 Everyday-0.006
Token it
Token ID340
Feature activation+4.25e-01

Pos logit contributions
 lifetime+0.007
 injuries+0.007
uc+0.007
bek+0.007
har+0.007
Neg logit contributions
 She-0.004
 means-0.002
 ornament-0.002
 Everyday-0.002
otion-0.002
Token back
Token ID736
Feature activation-1.15e+00

Pos logit contributions
 She+0.016
 ornament+0.011
 means+0.011
 Everyday+0.010
 she+0.010
Neg logit contributions
 lifetime-0.022
 injuries-0.021
uc-0.020
bek-0.020
uddled-0.019
Token.
Token ID13
Feature activation+1.38e-01

Pos logit contributions
 lifetime+0.063
 injuries+0.060
uc+0.058
bek+0.057
har+0.055
Neg logit contributions
 She-0.040
 means-0.028
 ornament-0.026
 Everyday-0.025
 she-0.023
Token Cater
Token ID40742
Feature activation+1.39e+00

Pos logit contributions
 She+0.004
 ornament+0.004
igator+0.003
 means+0.003
 idea+0.003
Neg logit contributions
 injuries-0.007
 lifetime-0.007
bek-0.006
har-0.006
uc-0.006
Tokenpill
Token ID27215
Feature activation+6.14e+00

Pos logit contributions
 She+0.055
 ornament+0.044
 Everyday+0.041
 means+0.039
otion+0.038
Neg logit contributions
 lifetime-0.064
uc-0.060
 injuries-0.059
uddled-0.057
ounced-0.056
Tokenars
Token ID945
Feature activation-2.40e+00

Pos logit contributions
 She+0.405
 means+0.339
 ornament+0.333
 Everyday+0.330
otion+0.316
Neg logit contributions
 lifetime-0.144
 injuries-0.129
uc-0.117
bek-0.113
har-0.106
Token need
Token ID761
Feature activation+1.53e-01

Pos logit contributions
 lifetime+0.121
 injuries+0.119
uc+0.116
har+0.109
eds+0.109
Neg logit contributions
 She-0.084
 ornament-0.059
 Everyday-0.058
 means-0.055
otion-0.052
Token to
Token ID284
Feature activation-2.21e-01

Pos logit contributions
 She+0.006
 means+0.004
 ornament+0.004
 Everyday+0.004
otion+0.004
Neg logit contributions
 lifetime-0.008
 injuries-0.007
uc-0.007
bek-0.007
har-0.007
Token eat
Token ID4483
Feature activation-3.19e-02

Pos logit contributions
 injuries+0.010
 lifetime+0.010
uc+0.009
har+0.009
bek+0.009
Neg logit contributions
 She-0.009
 means-0.007
 Everyday-0.007
 ornament-0.007
otion-0.007
Token leaves
Token ID5667
Feature activation+8.10e-01

Pos logit contributions
 lifetime+0.002
 injuries+0.002
uc+0.002
bek+0.002
har+0.002
Neg logit contributions
 She-0.001
 means-0.001
 ornament-0.001
 Everyday-0.001
otion-0.001
Token but
Token ID475
Feature activation-1.01e+00

Pos logit contributions
 She+0.021
 ornament+0.016
 means+0.016
 Everyday+0.016
otion+0.015
Neg logit contributions
 injuries-0.022
 lifetime-0.022
uc-0.019
bek-0.019
ounced-0.019
Token he
Token ID339
Feature activation-3.42e-01

Pos logit contributions
 injuries+0.042
 lifetime+0.039
ounced+0.035
uddled+0.035
bek+0.035
Neg logit contributions
 She-0.041
 Everyday-0.034
 ornament-0.034
 means-0.032
otion-0.031
Token also
Token ID635
Feature activation-8.22e-01

Pos logit contributions
 lifetime+0.017
 injuries+0.017
uc+0.016
bek+0.016
har+0.015
Neg logit contributions
 She-0.013
 means-0.010
 ornament-0.009
 Everyday-0.009
otion-0.008
Token knew
Token ID2993
Feature activation-1.53e-01

Pos logit contributions
 lifetime+0.037
 injuries+0.035
uc+0.034
bek+0.033
har+0.032
Neg logit contributions
 She-0.037
 means-0.028
 ornament-0.027
 Everyday-0.027
otion-0.025
Token cater
Token ID20825
Feature activation+1.70e+00

Pos logit contributions
 lifetime+0.008
 injuries+0.008
uc+0.008
bek+0.007
uddled+0.007
Neg logit contributions
 She-0.005
 means-0.004
 ornament-0.004
 Everyday-0.004
otion-0.003
Tokenpill
Token ID27215
Feature activation+6.14e+00

Pos logit contributions
 She+0.069
 ornament+0.052
 means+0.051
 Everyday+0.050
otion+0.045
Neg logit contributions
 lifetime-0.079
uc-0.075
 injuries-0.074
ounced-0.071
uddled-0.070
Tokenars
Token ID945
Feature activation-2.43e+00

Pos logit contributions
 She+0.380
 means+0.314
 Everyday+0.306
 ornament+0.304
otion+0.292
Neg logit contributions
 lifetime-0.169
 injuries-0.159
uc-0.145
bek-0.141
har-0.133
Token were
Token ID547
Feature activation+1.42e-01

Pos logit contributions
 lifetime+0.117
 injuries+0.113
uc+0.108
bek+0.105
uddled+0.102
Neg logit contributions
 She-0.097
 means-0.071
 ornament-0.070
 Everyday-0.069
otion-0.063
Token not
Token ID407
Feature activation+7.02e-01

Pos logit contributions
 She+0.006
 means+0.004
 Everyday+0.004
 ornament+0.004
otion+0.004
Neg logit contributions
 lifetime-0.007
 injuries-0.006
uc-0.006
bek-0.006
har-0.006
Token pets
Token ID17252
Feature activation-9.04e-01

Pos logit contributions
 She+0.030
 Everyday+0.023
 means+0.022
 ornament+0.021
otion+0.021
Neg logit contributions
 lifetime-0.031
 injuries-0.030
uc-0.029
ounced-0.027
bek-0.027
Token.
Token ID13
Feature activation+1.83e-02

Pos logit contributions
 injuries+0.035
 lifetime+0.035
uc+0.032
uddled+0.029
eds+0.029
Neg logit contributions
 She-0.041
 Everyday-0.032
 ornament-0.031
 means-0.030
otion-0.030
Token Lily
Token ID20037
Feature activation+6.49e-01

Pos logit contributions
 She+0.020
 means+0.014
 Everyday+0.014
 ornament+0.014
otion+0.013
Neg logit contributions
 injuries-0.024
 lifetime-0.024
uc-0.021
bek-0.021
ounced-0.020
Token,
Token ID11
Feature activation+5.23e-01

Pos logit contributions
 She+0.023
 Everyday+0.016
 ornament+0.016
 means+0.016
otion+0.015
Neg logit contributions
 lifetime-0.033
 injuries-0.032
uc-0.031
bek-0.030
uddled-0.029
Token no
Token ID645
Feature activation-1.92e-01

Pos logit contributions
 She+0.023
 Everyday+0.017
 means+0.017
 ornament+0.017
otion+0.015
Neg logit contributions
 injuries-0.023
 lifetime-0.022
uc-0.020
bek-0.020
ounced-0.019
Token.
Token ID13
Feature activation+1.68e-01

Pos logit contributions
 lifetime+0.011
 injuries+0.010
uc+0.010
bek+0.010
har+0.009
Neg logit contributions
 She-0.007
 means-0.005
 ornament-0.004
 Everyday-0.004
otion-0.004
Token Cater
Token ID40742
Feature activation+1.35e+00

Pos logit contributions
 She+0.006
 ornament+0.005
 idea+0.005
 Everyday+0.005
 means+0.005
Neg logit contributions
uc-0.007
 injuries-0.007
 lifetime-0.007
har-0.006
bek-0.006
Tokenpill
Token ID27215
Feature activation+6.09e+00

Pos logit contributions
 She+0.053
 ornament+0.042
 Everyday+0.040
 means+0.038
otion+0.037
Neg logit contributions
 lifetime-0.061
uc-0.059
 injuries-0.057
ounced-0.054
uddled-0.054
Tokenars
Token ID945
Feature activation-2.40e+00

Pos logit contributions
 She+0.391
 means+0.325
 ornament+0.318
 Everyday+0.316
otion+0.304
Neg logit contributions
 lifetime-0.155
 injuries-0.141
uc-0.129
bek-0.124
har-0.117
Token are
Token ID389
Feature activation+5.16e-01

Pos logit contributions
 lifetime+0.138
 injuries+0.135
uc+0.131
bek+0.127
ounced+0.125
Neg logit contributions
 She-0.071
 ornament-0.044
 means-0.044
 Everyday-0.043
otion-0.037
Token not
Token ID407
Feature activation+8.40e-01

Pos logit contributions
 She+0.022
 means+0.016
 Everyday+0.016
 ornament+0.015
otion+0.015
Neg logit contributions
 lifetime-0.024
 injuries-0.023
uc-0.022
bek-0.021
ounced-0.021
Token for
Token ID329
Feature activation+6.84e-01

Pos logit contributions
 She+0.036
 Everyday+0.029
 means+0.026
otion+0.026
 ornament+0.025
Neg logit contributions
 lifetime-0.036
 injuries-0.035
uc-0.033
ounced-0.031
oy-0.031
Token keeping
Token ID5291
Feature activation-3.89e-02

Pos logit contributions
 She+0.030
 means+0.022
 ornament+0.021
 Everyday+0.021
otion+0.020
Neg logit contributions
 lifetime-0.032
 injuries-0.030
uc-0.029
bek-0.028
har-0.027
Token and
Token ID290
Feature activation-1.70e-01

Pos logit contributions
 She+0.020
 means+0.015
 ornament+0.015
 Everyday+0.014
otion+0.013
Neg logit contributions
 lifetime-0.029
 injuries-0.028
uc-0.026
bek-0.026
uddled-0.025
Token watched
Token ID7342
Feature activation-5.87e-01

Pos logit contributions
 lifetime+0.007
 injuries+0.007
uc+0.007
bek+0.007
har+0.006
Neg logit contributions
 She-0.008
 means-0.006
 ornament-0.006
 Everyday-0.006
otion-0.005
Token as
Token ID355
Feature activation-7.34e-01

Pos logit contributions
 lifetime+0.029
 injuries+0.029
uc+0.027
bek+0.026
har+0.026
Neg logit contributions
 She-0.022
 means-0.016
 Everyday-0.015
 ornament-0.015
otion-0.014
Token the
Token ID262
Feature activation+1.92e-01

Pos logit contributions
 lifetime+0.038
 injuries+0.037
uc+0.035
bek+0.034
uddled+0.033
Neg logit contributions
 She-0.027
 means-0.020
 Everyday-0.019
 ornament-0.018
otion-0.017
Token cater
Token ID20825
Feature activation+1.62e+00

Pos logit contributions
 She+0.006
 means+0.004
 ornament+0.004
 Everyday+0.004
otion+0.004
Neg logit contributions
 lifetime-0.011
 injuries-0.010
uc-0.010
bek-0.010
har-0.010
Tokenpill
Token ID27215
Feature activation+6.04e+00

Pos logit contributions
 She+0.062
 means+0.046
 Everyday+0.045
 ornament+0.044
otion+0.040
Neg logit contributions
 lifetime-0.080
 injuries-0.075
uc-0.074
uddled-0.071
bek-0.071
Tokenars
Token ID945
Feature activation-2.60e+00

Pos logit contributions
 She+0.348
 means+0.280
 Everyday+0.273
 ornament+0.272
+0.271
Neg logit contributions
 lifetime-0.182
 injuries-0.182
uc-0.164
bek-0.160
har-0.150
Token grew
Token ID6348
Feature activation+1.17e+00

Pos logit contributions
 lifetime+0.119
 injuries+0.114
uc+0.109
bek+0.105
uddled+0.103
Neg logit contributions
 She-0.111
 means-0.083
 ornament-0.082
 Everyday-0.081
otion-0.074
Token bigger
Token ID5749
Feature activation+2.06e+00

Pos logit contributions
 She+0.045
 means+0.033
 Everyday+0.032
 ornament+0.032
otion+0.030
Neg logit contributions
 lifetime-0.058
 injuries-0.056
uc-0.054
har-0.051
bek-0.051
Token and
Token ID290
Feature activation-1.85e-01

Pos logit contributions
 She+0.081
 Everyday+0.059
 means+0.059
 ornament+0.058
otion+0.057
Neg logit contributions
 lifetime-0.100
 injuries-0.098
uc-0.091
bek-0.087
har-0.086
Token bigger
Token ID5749
Feature activation+2.07e+00

Pos logit contributions
 lifetime+0.009
 injuries+0.008
bek+0.008
uc+0.008
har+0.008
Neg logit contributions
 She-0.008
 means-0.006
 Everyday-0.005
otion-0.005
 ornament-0.005
Token and
Token ID290
Feature activation+2.77e-01

Pos logit contributions
 She+0.001
 Everyday+0.001
 ornament+0.001
 means+0.001
otion+0.001
Neg logit contributions
 injuries-0.002
 lifetime-0.002
uc-0.001
eds-0.001
uddled-0.001
Token lady
Token ID10846
Feature activation+1.07e-01

Pos logit contributions
 She+0.012
 means+0.010
otion+0.009
 Everyday+0.009
 ornament+0.009
Neg logit contributions
 lifetime-0.012
 injuries-0.011
uc-0.010
ounced-0.010
eds-0.009
Tokenbugs
Token ID32965
Feature activation-6.28e-01

Pos logit contributions
 She+0.005
 Everyday+0.004
 means+0.004
 ornament+0.004
otion+0.003
Neg logit contributions
 lifetime-0.005
 injuries-0.004
uc-0.004
bek-0.004
ounced-0.004
Token and
Token ID290
Feature activation+2.57e-01

Pos logit contributions
 injuries+0.031
 lifetime+0.031
uc+0.030
uddled+0.028
ounced+0.028
Neg logit contributions
 She-0.022
 Everyday-0.016
 ornament-0.016
 means-0.015
otion-0.015
Token cater
Token ID20825
Feature activation+1.71e+00

Pos logit contributions
 She+0.011
otion+0.009
 Everyday+0.008
 ornament+0.008
 means+0.008
Neg logit contributions
 lifetime-0.011
 injuries-0.011
uc-0.010
ounced-0.009
bek-0.009
Tokenpill
Token ID27215
Feature activation+6.03e+00

Pos logit contributions
 She+0.069
 means+0.052
 ornament+0.051
 Everyday+0.050
otion+0.046
Neg logit contributions
 lifetime-0.081
 injuries-0.079
uc-0.076
ounced-0.072
uddled-0.071
Tokenars
Token ID945
Feature activation-2.51e+00

Pos logit contributions
 She+0.386
 means+0.321
 ornament+0.315
 Everyday+0.313
otion+0.299
Neg logit contributions
 lifetime-0.153
 injuries-0.140
uc-0.127
bek-0.124
har-0.116
Token."
Token ID526
Feature activation+6.23e-01

Pos logit contributions
 lifetime+0.114
 injuries+0.110
uc+0.106
bek+0.100
uddled+0.099
Neg logit contributions
 She-0.104
 ornament-0.078
 Everyday-0.078
 means-0.077
otion-0.072
Token
Token ID198
Feature activation+3.14e-01

Pos logit contributions
 She+0.020
 ornament+0.017
 means+0.017
 Everyday+0.015
igator+0.015
Neg logit contributions
 lifetime-0.030
 injuries-0.030
uc-0.028
bek-0.027
har-0.027
Token
Token ID198
Feature activation+3.00e-01

Pos logit contributions
 She+0.009
 means+0.005
 Everyday+0.004
 ornament+0.004
 It+0.004
Neg logit contributions
 lifetime-0.021
 injuries-0.019
bek-0.019
uc-0.019
har-0.018
TokenTim
Token ID14967
Feature activation-3.37e-01

Pos logit contributions
 She+0.015
 means+0.011
 ornament+0.011
 Everyday+0.011
+0.011
Neg logit contributions
 lifetime-0.013
 injuries-0.011
uc-0.011
bek-0.010
uddled-0.010
Token.
Token ID13
Feature activation-9.19e-03

Pos logit contributions
 lifetime+0.005
 injuries+0.005
uc+0.005
bek+0.005
har+0.005
Neg logit contributions
 She-0.004
 means-0.003
 ornament-0.002
 she-0.002
 Everyday-0.002
Token Soon
Token ID15894
Feature activation+5.91e-01

Pos logit contributions
 lifetime+0.001
 injuries+0.001
uc+0.000
bek+0.000
har+0.000
Neg logit contributions
 She-0.000
 ornament-0.000
 means-0.000
 Everyday-0.000
otion-0.000
Token,
Token ID11
Feature activation+3.73e-01

Pos logit contributions
 She+0.021
 ornament+0.015
 means+0.015
 Everyday+0.014
otion+0.013
Neg logit contributions
 lifetime-0.031
 injuries-0.030
uc-0.028
ounced-0.027
uddled-0.027
Token more
Token ID517
Feature activation+1.37e-01

Pos logit contributions
 She+0.016
 ornament+0.012
 means+0.012
 Everyday+0.012
otion+0.011
Neg logit contributions
 lifetime-0.017
 injuries-0.016
uc-0.015
bek-0.014
uddled-0.014
Token cater
Token ID20825
Feature activation+1.67e+00

Pos logit contributions
 She+0.006
 means+0.004
 Everyday+0.004
 ornament+0.004
otion+0.004
Neg logit contributions
 lifetime-0.006
 injuries-0.006
uc-0.006
bek-0.005
uddled-0.005
Tokenpill
Token ID27215
Feature activation+6.01e+00

Pos logit contributions
 She+0.065
 means+0.048
 ornament+0.047
 Everyday+0.047
otion+0.042
Neg logit contributions
 lifetime-0.083
 injuries-0.078
uc-0.077
uddled-0.073
ounced-0.072
Tokenars
Token ID945
Feature activation-2.47e+00

Pos logit contributions
 She+0.336
 means+0.264
+0.259
 ornament+0.258
 she+0.257
Neg logit contributions
 lifetime-0.191
 injuries-0.191
uc-0.177
bek-0.173
ounced-0.162
Token arrived
Token ID5284
Feature activation-6.07e-01

Pos logit contributions
 lifetime+0.126
 injuries+0.120
uc+0.115
bek+0.113
har+0.110
Neg logit contributions
 She-0.094
 means-0.068
 ornament-0.066
 Everyday-0.065
otion-0.059
Token.
Token ID13
Feature activation-1.27e-01

Pos logit contributions
 lifetime+0.034
 injuries+0.033
bek+0.032
uc+0.031
har+0.031
Neg logit contributions
 She-0.021
 means-0.013
 Everyday-0.012
 ornament-0.012
 she-0.012
Token They
Token ID1119
Feature activation-1.12e-01

Pos logit contributions
 lifetime+0.006
 injuries+0.006
uc+0.006
har+0.006
ounced+0.006
Neg logit contributions
 She-0.004
 ornament-0.003
 means-0.003
igator-0.003
otion-0.003
Token also
Token ID635
Feature activation-8.42e-01

Pos logit contributions
 lifetime+0.005
 injuries+0.005
uc+0.005
bek+0.005
har+0.004
Neg logit contributions
 She-0.005
 means-0.004
 ornament-0.004
 Everyday-0.003
otion-0.003
Token,
Token ID11
Feature activation+3.91e-01

Pos logit contributions
 lifetime+0.006
uc+0.005
uddled+0.005
 injuries+0.005
ounced+0.005
Neg logit contributions
 She-0.004
 Everyday-0.004
 means-0.003
 ornament-0.003
igator-0.003
Token "
Token ID366
Feature activation-5.13e-01

Pos logit contributions
 She+0.015
 ornament+0.012
 means+0.012
 Everyday+0.012
igator+0.011
Neg logit contributions
 lifetime-0.018
 injuries-0.016
uc-0.016
uddled-0.016
ounced-0.015
TokenI
Token ID40
Feature activation-2.83e-01

Pos logit contributions
 lifetime+0.022
 injuries+0.020
bek+0.020
uc+0.019
har+0.019
Neg logit contributions
 She-0.022
 means-0.018
 ornament-0.018
otion-0.017
 Everyday-0.017
Token love
Token ID1842
Feature activation-8.22e-01

Pos logit contributions
 lifetime+0.013
 injuries+0.012
uc+0.012
bek+0.011
ounced+0.011
Neg logit contributions
 She-0.012
 ornament-0.009
 Everyday-0.009
otion-0.008
 means-0.008
Token cater
Token ID20825
Feature activation+1.79e+00

Pos logit contributions
 lifetime+0.036
 injuries+0.034
uc+0.033
bek+0.031
ounced+0.030
Neg logit contributions
 She-0.037
 means-0.029
 Everyday-0.028
 ornament-0.028
otion-0.027
Tokenpill
Token ID27215
Feature activation+6.00e+00

Pos logit contributions
 She+0.069
 means+0.052
 ornament+0.051
 Everyday+0.049
otion+0.046
Neg logit contributions
 lifetime-0.088
uc-0.082
 injuries-0.082
ounced-0.077
uddled-0.076
Tokenars
Token ID945
Feature activation-2.60e+00

Pos logit contributions
 She+0.371
 means+0.306
 ornament+0.299
 Everyday+0.298
otion+0.285
Neg logit contributions
 lifetime-0.166
 injuries-0.154
uc-0.141
bek-0.137
har-0.129
Token!
Token ID0
Feature activation-6.71e-01

Pos logit contributions
 lifetime+0.108
uc+0.102
 injuries+0.102
bek+0.093
oy+0.092
Neg logit contributions
 She-0.117
 means-0.091
 Everyday-0.088
 ornament-0.087
otion-0.083
Token You
Token ID921
Feature activation+9.91e-01

Pos logit contributions
 lifetime+0.028
bek+0.025
 injuries+0.025
har+0.024
uc+0.024
Neg logit contributions
 She-0.025
 ornament-0.023
 means-0.021
otion-0.020
 idea-0.020
Token can
Token ID460
Feature activation+8.89e-01

Pos logit contributions
 She+0.039
otion+0.030
 ornament+0.028
 means+0.027
 Everyday+0.027
Neg logit contributions
 lifetime-0.046
uc-0.041
 injuries-0.039
oy-0.039
har-0.038
Token be
Token ID307
Feature activation-1.17e+00

Pos logit contributions
 She+0.037
 means+0.034
otion+0.032
 ornament+0.030
 Everyday+0.028
Neg logit contributions
 lifetime-0.034
 injuries-0.033
uc-0.032
oy-0.031
har-0.030
Token said
Token ID531
Feature activation-9.37e-02

Pos logit contributions
 She+0.010
 means+0.007
 she+0.007
 ornament+0.007
 Everyday+0.007
Neg logit contributions
 lifetime-0.008
 injuries-0.008
uc-0.007
bek-0.007
har-0.007
Token,
Token ID11
Feature activation+3.99e-01

Pos logit contributions
 lifetime+0.005
 injuries+0.005
uc+0.005
uddled+0.005
ounced+0.005
Neg logit contributions
 She-0.003
 Everyday-0.002
 ornament-0.002
 means-0.002
igator-0.002
Token "
Token ID366
Feature activation-4.28e-01

Pos logit contributions
 She+0.014
 means+0.011
 ornament+0.011
 Everyday+0.011
igator+0.010
Neg logit contributions
 injuries-0.019
 lifetime-0.019
uc-0.018
uddled-0.018
bek-0.017
TokenC
Token ID34
Feature activation+1.66e-01

Pos logit contributions
 lifetime+0.020
 injuries+0.018
bek+0.018
uc+0.018
har+0.017
Neg logit contributions
 She-0.018
 means-0.014
 ornament-0.014
 Everyday-0.014
otion-0.013
Tokenater
Token ID729
Feature activation-4.19e-01

Pos logit contributions
 She+0.008
 means+0.006
 ornament+0.006
 Everyday+0.006
otion+0.006
Neg logit contributions
 lifetime-0.007
 injuries-0.006
uc-0.006
bek-0.006
har-0.006
Tokenpill
Token ID27215
Feature activation+5.97e+00

Pos logit contributions
uc+0.017
 lifetime+0.016
uddled+0.016
ounced+0.016
 injuries+0.015
Neg logit contributions
 She-0.017
 Everyday-0.014
 ornament-0.013
 means-0.013
 wonder-0.012
Tokenars
Token ID945
Feature activation-2.56e+00

Pos logit contributions
 She+0.392
 means+0.329
 ornament+0.323
 Everyday+0.321
otion+0.306
Neg logit contributions
 lifetime-0.141
 injuries-0.126
uc-0.115
bek-0.111
har-0.103
Token are
Token ID389
Feature activation+3.03e-01

Pos logit contributions
 lifetime+0.146
 injuries+0.142
uc+0.138
bek+0.134
har+0.132
Neg logit contributions
 She-0.078
 means-0.051
 ornament-0.051
 Everyday-0.050
otion-0.042
Token not
Token ID407
Feature activation+7.17e-01

Pos logit contributions
 She+0.012
 means+0.008
 ornament+0.008
 Everyday+0.008
+0.007
Neg logit contributions
 lifetime-0.016
 injuries-0.015
uc-0.015
bek-0.014
uddled-0.014
Token meant
Token ID4001
Feature activation+8.17e-01

Pos logit contributions
 She+0.028
 Everyday+0.021
 means+0.020
 ornament+0.020
otion+0.019
Neg logit contributions
 lifetime-0.035
 injuries-0.034
uc-0.033
bek-0.032
ounced-0.031
Token to
Token ID284
Feature activation-2.89e-01

Pos logit contributions
 She+0.034
 means+0.026
 Everyday+0.025
 ornament+0.025
otion+0.023
Neg logit contributions
 lifetime-0.038
 injuries-0.037
uc-0.035
bek-0.033
ounced-0.033
Token you
Token ID345
Feature activation+7.81e-02

Pos logit contributions
 She+0.021
 means+0.017
 ornament+0.016
 Everyday+0.016
otion+0.015
Neg logit contributions
 lifetime-0.020
 injuries-0.019
uc-0.018
bek-0.017
har-0.017
Token,
Token ID11
Feature activation+3.65e-01

Pos logit contributions
 She+0.003
 means+0.002
 ornament+0.002
 Everyday+0.002
otion+0.002
Neg logit contributions
 lifetime-0.004
 injuries-0.004
uc-0.003
bek-0.003
uddled-0.003
Token Lily
Token ID20037
Feature activation+6.29e-01

Pos logit contributions
 She+0.015
 means+0.012
 Everyday+0.011
 ornament+0.011
igator+0.010
Neg logit contributions
 injuries-0.016
 lifetime-0.015
uc-0.014
bek-0.014
har-0.013
Token.
Token ID13
Feature activation+9.65e-03

Pos logit contributions
 She+0.024
 ornament+0.017
 means+0.017
 Everyday+0.017
otion+0.016
Neg logit contributions
 lifetime-0.032
 injuries-0.030
uc-0.029
bek-0.028
uddled-0.027
Token Cater
Token ID40742
Feature activation+1.21e+00

Pos logit contributions
 She+0.000
 ornament+0.000
 idea+0.000
 means+0.000
otion+0.000
Neg logit contributions
 lifetime-0.000
 injuries-0.000
bek-0.000
har-0.000
uc-0.000
Tokenpill
Token ID27215
Feature activation+5.95e+00

Pos logit contributions
 She+0.047
 ornament+0.036
 means+0.034
 Everyday+0.034
otion+0.032
Neg logit contributions
 lifetime-0.058
uc-0.054
 injuries-0.054
uddled-0.052
bek-0.051
Tokenars
Token ID945
Feature activation-2.67e+00

Pos logit contributions
 She+0.370
 means+0.302
 Everyday+0.296
 ornament+0.295
otion+0.284
Neg logit contributions
 lifetime-0.164
 injuries-0.155
uc-0.143
bek-0.136
har-0.128
Token can
Token ID460
Feature activation+7.02e-01

Pos logit contributions
 lifetime+0.154
 injuries+0.148
uc+0.144
bek+0.139
har+0.138
Neg logit contributions
 She-0.081
 ornament-0.051
 means-0.051
 Everyday-0.049
otion-0.043
Token leave
Token ID2666
Feature activation-8.76e-01

Pos logit contributions
 She+0.030
 means+0.024
 ornament+0.023
otion+0.022
 Everyday+0.022
Neg logit contributions
 injuries-0.030
 lifetime-0.028
har-0.027
uc-0.026
edy-0.025
Token and
Token ID290
Feature activation-1.03e-02

Pos logit contributions
 lifetime+0.045
 injuries+0.043
uc+0.041
bek+0.041
har+0.040
Neg logit contributions
 She-0.033
 means-0.024
 ornament-0.023
 Everyday-0.023
otion-0.020
Token never
Token ID1239
Feature activation-8.90e-01

Pos logit contributions
 lifetime+0.000
 injuries+0.000
har+0.000
uc+0.000
bek+0.000
Neg logit contributions
 She-0.000
 means-0.000
 Everyday-0.000
otion-0.000
 ornament-0.000
Token more
Token ID517
Feature activation+1.43e-01

Pos logit contributions
 She+0.016
 her+0.010
 she+0.010
 herself+0.009
 ornament+0.008
Neg logit contributions
 lifetime-0.035
bek-0.034
uc-0.034
ounced-0.033
uddled-0.033
Token of
Token ID286
Feature activation+1.14e+00

Pos logit contributions
 She+0.006
 means+0.005
 ornament+0.005
 Everyday+0.004
otion+0.004
Neg logit contributions
 lifetime-0.007
 injuries-0.006
uc-0.006
bek-0.006
har-0.006
Token the
Token ID262
Feature activation+1.55e-01

Pos logit contributions
 She+0.044
 means+0.031
 Everyday+0.030
 ornament+0.030
otion+0.028
Neg logit contributions
 lifetime-0.058
 injuries-0.057
uc-0.053
bek-0.051
har-0.050
Token hairy
Token ID43873
Feature activation+8.11e-02

Pos logit contributions
 She+0.006
 ornament+0.004
 means+0.004
 Everyday+0.004
+0.004
Neg logit contributions
 lifetime-0.008
 injuries-0.008
uc-0.008
bek-0.007
uddled-0.007
Token cater
Token ID20825
Feature activation+1.65e+00

Pos logit contributions
 She+0.003
 means+0.002
 ornament+0.002
 Everyday+0.002
+0.002
Neg logit contributions
 lifetime-0.004
 injuries-0.004
uc-0.004
bek-0.004
har-0.004
Tokenpill
Token ID27215
Feature activation+5.93e+00

Pos logit contributions
 She+0.064
 means+0.049
 ornament+0.046
 Everyday+0.046
otion+0.043
Neg logit contributions
 lifetime-0.082
 injuries-0.076
uc-0.074
bek-0.072
har-0.071
Tokenars
Token ID945
Feature activation-2.56e+00

Pos logit contributions
 She+0.363
 means+0.293
 ornament+0.289
 Everyday+0.288
+0.284
Neg logit contributions
 lifetime-0.159
 injuries-0.158
uc-0.145
bek-0.138
ounced-0.130
Token.
Token ID13
Feature activation-3.25e-02

Pos logit contributions
 lifetime+0.121
 injuries+0.114
uc+0.108
bek+0.105
uddled+0.102
Neg logit contributions
 She-0.105
 means-0.080
 ornament-0.078
 Everyday-0.077
otion-0.071
Token<|endoftext|>
Token ID50256
Feature activation-1.34e+00

Pos logit contributions
 lifetime+0.002
 injuries+0.002
bek+0.002
uc+0.002
har+0.001
Neg logit contributions
 She-0.001
 ornament-0.001
 means-0.001
igator-0.001
otion-0.001
TokenOnce
Token ID7454
Feature activation-7.60e-01

Pos logit contributions
 lifetime+0.060
 injuries+0.055
uc+0.053
bek+0.051
uddled+0.051
Neg logit contributions
 She-0.055
 ornament-0.044
 means-0.043
 Everyday-0.041
otion-0.039
Token there
Token ID612
Feature activation+5.23e-01

Pos logit contributions
 lifetime+0.024
 injuries+0.022
uc+0.021
bek+0.020
ounced+0.020
Neg logit contributions
 She-0.041
 means-0.035
 ornament-0.034
 Everyday-0.033
-0.031
Token he
Token ID339
Feature activation-4.21e-01

Pos logit contributions
 She+0.003
 means+0.002
 ornament+0.002
 Everyday+0.002
otion+0.002
Neg logit contributions
 lifetime-0.004
 injuries-0.004
uc-0.003
bek-0.003
har-0.003
Token noticed
Token ID6810
Feature activation+1.62e-01

Pos logit contributions
 lifetime+0.017
 injuries+0.016
uc+0.015
bek+0.015
har+0.014
Neg logit contributions
 She-0.021
 means-0.016
 ornament-0.016
 Everyday-0.016
otion-0.015
Token some
Token ID617
Feature activation+1.25e+00

Pos logit contributions
 She+0.006
 means+0.005
 ornament+0.004
 Everyday+0.004
otion+0.004
Neg logit contributions
 lifetime-0.008
 injuries-0.008
uc-0.007
bek-0.007
har-0.007
Token other
Token ID584
Feature activation-9.83e-01

Pos logit contributions
 She+0.050
 means+0.036
 ornament+0.035
 Everyday+0.035
otion+0.032
Neg logit contributions
 lifetime-0.062
 injuries-0.059
uc-0.056
bek-0.056
har-0.054
Token cater
Token ID20825
Feature activation+1.53e+00

Pos logit contributions
 lifetime+0.046
 injuries+0.044
uc+0.042
bek+0.041
har+0.040
Neg logit contributions
 She-0.042
 means-0.031
 Everyday-0.030
 ornament-0.029
otion-0.028
Tokenpill
Token ID27215
Feature activation+5.93e+00

Pos logit contributions
 She+0.065
 means+0.051
 Everyday+0.049
 ornament+0.047
otion+0.045
Neg logit contributions
 lifetime-0.070
uc-0.066
 injuries-0.065
uddled-0.063
ounced-0.062
Tokenars
Token ID945
Feature activation-2.74e+00

Pos logit contributions
 She+0.386
 means+0.323
 ornament+0.316
 Everyday+0.314
otion+0.300
Neg logit contributions
 lifetime-0.146
 injuries-0.131
uc-0.120
bek-0.115
har-0.108
Token stretching
Token ID20880
Feature activation+4.13e-02

Pos logit contributions
 lifetime+0.132
 injuries+0.127
uc+0.119
uddled+0.114
ounced+0.114
Neg logit contributions
 She-0.110
 means-0.082
 ornament-0.080
 Everyday-0.078
otion-0.070
Token towards
Token ID3371
Feature activation-6.67e-01

Pos logit contributions
 She+0.002
 means+0.001
 ornament+0.001
 Everyday+0.001
otion+0.001
Neg logit contributions
 lifetime-0.002
 injuries-0.002
uc-0.002
bek-0.002
har-0.002
Token it
Token ID340
Feature activation+2.92e-01

Pos logit contributions
 lifetime+0.033
 injuries+0.032
uc+0.031
bek+0.030
har+0.029
Neg logit contributions
 She-0.026
 means-0.019
 ornament-0.018
 Everyday-0.018
otion-0.017
Token too
Token ID1165
Feature activation-1.65e-01

Pos logit contributions
 She+0.013
 ornament+0.010
otion+0.010
 means+0.010
 Everyday+0.010
Neg logit contributions
 lifetime-0.012
 injuries-0.011
uc-0.010
bek-0.010
har-0.010
Token.
Token ID13
Feature activation-9.70e-02

Pos logit contributions
 She+0.087
 means+0.065
 ornament+0.063
 Everyday+0.062
otion+0.058
Neg logit contributions
 lifetime-0.104
 injuries-0.099
uc-0.095
bek-0.092
har-0.090
Token She
Token ID1375
Feature activation+5.79e-01

Pos logit contributions
 lifetime+0.006
uc+0.006
uddled+0.006
 injuries+0.006
ounced+0.006
Neg logit contributions
 ornament-0.001
otion-0.001
 means-0.001
igator-0.001
 angel-0.001
Token saw
Token ID2497
Feature activation-1.35e+00

Pos logit contributions
 She+0.024
 means+0.017
 Everyday+0.017
 ornament+0.016
+0.016
Neg logit contributions
 lifetime-0.029
 injuries-0.028
uc-0.026
bek-0.026
har-0.025
Token two
Token ID734
Feature activation+4.36e-01

Pos logit contributions
 lifetime+0.062
 injuries+0.059
uc+0.057
uddled+0.055
ounced+0.054
Neg logit contributions
 She-0.051
 means-0.038
 Everyday-0.037
 ornament-0.036
otion-0.036
Token cater
Token ID20825
Feature activation+1.66e+00

Pos logit contributions
 She+0.018
 means+0.013
 ornament+0.013
 Everyday+0.012
otion+0.012
Neg logit contributions
 lifetime-0.021
 injuries-0.020
uc-0.019
bek-0.018
ounced-0.018
Tokenpill
Token ID27215
Feature activation+5.84e+00

Pos logit contributions
 She+0.067
 means+0.052
 ornament+0.051
 Everyday+0.050
otion+0.046
Neg logit contributions
 lifetime-0.077
 injuries-0.073
uc-0.073
uddled-0.070
ounced-0.069
Tokenars
Token ID945
Feature activation-2.78e+00

Pos logit contributions
 She+0.369
 means+0.304
 Everyday+0.298
 ornament+0.298
otion+0.285
Neg logit contributions
 lifetime-0.152
 injuries-0.143
uc-0.130
bek-0.125
har-0.118
Token crawling
Token ID34499
Feature activation-4.30e-01

Pos logit contributions
 lifetime+0.134
 injuries+0.127
uc+0.123
uddled+0.120
ounced+0.119
Neg logit contributions
 She-0.105
 means-0.080
 ornament-0.079
 Everyday-0.076
 It-0.069
Token around
Token ID1088
Feature activation+3.94e-01

Pos logit contributions
 lifetime+0.019
uc+0.019
 injuries+0.018
uddled+0.018
ounced+0.018
Neg logit contributions
 She-0.016
 ornament-0.012
 means-0.012
 Everyday-0.012
otion-0.012
Token on
Token ID319
Feature activation+1.66e-01

Pos logit contributions
 She+0.014
 means+0.010
 ornament+0.010
 Everyday+0.010
otion+0.009
Neg logit contributions
 lifetime-0.020
 injuries-0.020
uc-0.019
bek-0.018
har-0.018
Token a
Token ID257
Feature activation-7.20e-01

Pos logit contributions
 She+0.006
 means+0.005
otion+0.005
 ornament+0.005
igator+0.005
Neg logit contributions
 injuries-0.008
 lifetime-0.007
uc-0.007
bek-0.006
har-0.006
Token tell
Token ID1560
Feature activation-4.64e-01

Pos logit contributions
 injuries+0.016
 lifetime+0.016
har+0.014
uc+0.014
bek+0.014
Neg logit contributions
 She-0.016
 means-0.013
otion-0.012
 ornament-0.012
 Everyday-0.012
Token the
Token ID262
Feature activation+4.57e-02

Pos logit contributions
 lifetime+0.020
 injuries+0.019
uc+0.018
bek+0.018
har+0.017
Neg logit contributions
 She-0.021
 means-0.016
 ornament-0.016
 Everyday-0.016
otion-0.015
Token mom
Token ID1995
Feature activation-5.11e-01

Pos logit contributions
 She+0.002
 means+0.001
otion+0.001
 Everyday+0.001
 ornament+0.001
Neg logit contributions
 lifetime-0.002
 injuries-0.002
uc-0.002
bek-0.002
ounced-0.002
Tokenmy
Token ID1820
Feature activation+9.66e-01

Pos logit contributions
 lifetime+0.023
uc+0.022
 injuries+0.021
bek+0.021
ounced+0.020
Neg logit contributions
 She-0.020
 Everyday-0.015
 ornament-0.015
 means-0.015
otion-0.015
Token cater
Token ID20825
Feature activation+1.53e+00

Pos logit contributions
 She+0.039
 ornament+0.029
 Everyday+0.029
 means+0.028
otion+0.027
Neg logit contributions
 lifetime-0.045
 injuries-0.042
uc-0.041
bek-0.040
uddled-0.038
Tokenpill
Token ID27215
Feature activation+5.63e+00

Pos logit contributions
 She+0.061
 means+0.045
 Everyday+0.043
 ornament+0.043
otion+0.040
Neg logit contributions
 lifetime-0.074
 injuries-0.070
uc-0.070
bek-0.067
ounced-0.065
Tokenars
Token ID945
Feature activation-2.78e+00

Pos logit contributions
 She+0.337
 means+0.276
 ornament+0.270
 Everyday+0.268
otion+0.256
Neg logit contributions
 lifetime-0.167
 injuries-0.156
uc-0.144
bek-0.139
har-0.132
Token!
Token ID0
Feature activation-8.66e-01

Pos logit contributions
 lifetime+0.130
 injuries+0.123
uc+0.119
bek+0.116
har+0.112
Neg logit contributions
 She-0.116
 means-0.086
 ornament-0.085
 Everyday-0.084
otion-0.078
Token''
Token ID7061
Feature activation+8.49e-02

Pos logit contributions
 lifetime+0.043
bek+0.040
 injuries+0.038
har+0.038
uc+0.038
Neg logit contributions
 She-0.029
 ornament-0.025
 means-0.024
otion-0.023
 Everyday-0.021
Token
Token ID198
Feature activation-5.45e-02

Pos logit contributions
 She+0.003
 means+0.002
 ornament+0.002
 Everyday+0.002
otion+0.002
Neg logit contributions
 lifetime-0.005
 injuries-0.005
uc-0.004
bek-0.004
har-0.004
Token
Token ID198
Feature activation-7.76e-02

Pos logit contributions
 lifetime+0.003
 injuries+0.003
uc+0.003
bek+0.003
ounced+0.003
Neg logit contributions
 She-0.001
 Everyday-0.001
 means-0.001
 ornament-0.001
-0.001
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
 lifetime+0.003
 injuries+0.003
uc+0.002
bek+0.002
har+0.002
Neg logit contributions
 She-0.002
 means-0.001
 ornament-0.001
 Everyday-0.001
otion-0.001
TokenThey
Token ID2990
Feature activation-3.04e-01

Pos logit contributions
 lifetime+0.005
 injuries+0.005
uc+0.004
bek+0.004
har+0.004
Neg logit contributions
 She-0.005
 means-0.004
 ornament-0.004
 Everyday-0.004
otion-0.003
Token saw
Token ID2497
Feature activation-1.27e+00

Pos logit contributions
 lifetime+0.013
 injuries+0.013
bek+0.012
uc+0.012
har+0.012
Neg logit contributions
 She-0.013
 means-0.010
 ornament-0.010
otion-0.009
 Everyday-0.009
Token the
Token ID262
Feature activation+1.68e-01

Pos logit contributions
 injuries+0.059
 lifetime+0.056
uc+0.053
uddled+0.052
ounced+0.050
Neg logit contributions
 She-0.051
 means-0.037
 ornament-0.036
 Everyday-0.036
otion-0.034
Token wood
Token ID4898
Feature activation+3.41e-01

Pos logit contributions
 She+0.007
 means+0.005
 Everyday+0.005
otion+0.005
+0.004
Neg logit contributions
 lifetime-0.008
 injuries-0.008
uc-0.007
har-0.007
uddled-0.007
Tokencut
Token ID8968
Feature activation+5.44e+00

Pos logit contributions
 She+0.014
 Everyday+0.010
 means+0.009
 It+0.009
 Add+0.009
Neg logit contributions
 injuries-0.015
 lifetime-0.015
uc-0.013
uddled-0.013
eds-0.013
Tokenter
Token ID353
Feature activation+5.77e-01

Pos logit contributions
 She+0.261
 Everyday+0.200
 means+0.189
 ornament+0.189
 It+0.180
Neg logit contributions
 lifetime-0.218
 injuries-0.215
uc-0.205
bek-0.195
uddled-0.193
Token coming
Token ID2406
Feature activation+3.33e-01

Pos logit contributions
 She+0.025
 means+0.018
 Everyday+0.018
 ornament+0.017
otion+0.016
Neg logit contributions
 lifetime-0.027
 injuries-0.026
uc-0.024
ounced-0.023
uddled-0.023
Token to
Token ID284
Feature activation-1.01e-01

Pos logit contributions
 She+0.012
 means+0.009
 Everyday+0.009
 ornament+0.009
otion+0.008
Neg logit contributions
 lifetime-0.017
 injuries-0.016
har-0.015
uc-0.015
bek-0.015
Token the
Token ID262
Feature activation+1.82e-01

Pos logit contributions
 lifetime+0.005
 injuries+0.005
uc+0.005
bek+0.005
har+0.005
Neg logit contributions
 She-0.004
 means-0.003
 Everyday-0.003
 ornament-0.003
otion-0.002
Token woods
Token ID16479
Feature activation+2.42e-01

Pos logit contributions
 She+0.007
 means+0.005
 Everyday+0.005
otion+0.005
 ornament+0.005
Neg logit contributions
 lifetime-0.009
 injuries-0.009
bek-0.008
har-0.008
uc-0.008
Token his
Token ID465
Feature activation-5.34e-01

Pos logit contributions
 She+0.040
 means+0.028
 ornament+0.027
 Everyday+0.026
otion+0.024
Neg logit contributions
 lifetime-0.061
 injuries-0.059
uc-0.056
bek-0.055
har-0.054
Token friends
Token ID2460
Feature activation-9.10e-02

Pos logit contributions
 lifetime+0.025
 injuries+0.024
uc+0.023
bek+0.023
uddled+0.022
Neg logit contributions
 She-0.022
 means-0.017
 ornament-0.017
 Everyday-0.016
-0.015
Token,
Token ID11
Feature activation+1.52e-01

Pos logit contributions
 lifetime+0.005
 injuries+0.005
uc+0.005
bek+0.004
har+0.004
Neg logit contributions
 She-0.003
 means-0.002
 ornament-0.002
 Everyday-0.002
otion-0.002
Token the
Token ID262
Feature activation+4.28e-01

Pos logit contributions
 She+0.006
 means+0.005
 ornament+0.005
 Everyday+0.005
otion+0.004
Neg logit contributions
 lifetime-0.007
 injuries-0.007
uc-0.006
bek-0.006
har-0.006
Token wood
Token ID4898
Feature activation+2.99e-01

Pos logit contributions
 She+0.017
 means+0.012
 Everyday+0.011
otion+0.011
 ornament+0.011
Neg logit contributions
 lifetime-0.022
 injuries-0.021
uc-0.020
bek-0.019
har-0.019
Tokencut
Token ID8968
Feature activation+5.28e+00

Pos logit contributions
 She+0.013
 means+0.009
 Everyday+0.009
otion+0.009
 ornament+0.009
Neg logit contributions
 lifetime-0.014
 injuries-0.013
uc-0.012
uddled-0.012
bek-0.012
Tokenter
Token ID353
Feature activation+5.47e-01

Pos logit contributions
 She+0.241
 Everyday+0.180
 means+0.176
 ornament+0.175
otion+0.165
Neg logit contributions
 lifetime-0.227
 injuries-0.217
uc-0.210
bek-0.203
uddled-0.196
Token was
Token ID373
Feature activation-8.63e-02

Pos logit contributions
 She+0.025
 means+0.019
 ornament+0.019
 Everyday+0.018
otion+0.018
Neg logit contributions
 lifetime-0.024
 injuries-0.022
uc-0.022
bek-0.021
ounced-0.021
Token able
Token ID1498
Feature activation-1.29e+00

Pos logit contributions
 lifetime+0.003
 injuries+0.003
uc+0.003
bek+0.003
har+0.003
Neg logit contributions
 She-0.004
 means-0.003
 ornament-0.003
 Everyday-0.003
otion-0.003
Token to
Token ID284
Feature activation-2.75e-01

Pos logit contributions
 lifetime+0.064
 injuries+0.060
uc+0.058
bek+0.058
har+0.056
Neg logit contributions
 She-0.052
 means-0.037
 ornament-0.036
 Everyday-0.036
 she-0.033
Token part
Token ID636
Feature activation-1.89e+00

Pos logit contributions
 injuries+0.011
 lifetime+0.011
har+0.010
uc+0.009
bek+0.009
Neg logit contributions
 She-0.013
 means-0.010
otion-0.010
 Everyday-0.009
 It-0.009
Token that
Token ID326
Feature activation-9.66e-01

Pos logit contributions
 lifetime+0.035
 injuries+0.034
uc+0.033
bek+0.032
har+0.031
Neg logit contributions
 She-0.021
 means-0.014
 ornament-0.013
 Everyday-0.013
 It-0.011
Token the
Token ID262
Feature activation+6.20e-02

Pos logit contributions
 lifetime+0.046
 injuries+0.043
bek+0.038
har+0.036
uc+0.036
Neg logit contributions
 She-0.039
otion-0.030
 means-0.030
 Everyday-0.028
 ornament-0.028
Token kitten
Token ID42143
Feature activation-4.38e-01

Pos logit contributions
 She+0.002
 means+0.002
otion+0.002
 Everyday+0.002
 It+0.001
Neg logit contributions
 lifetime-0.003
 injuries-0.003
uc-0.003
har-0.003
bek-0.003
Token was
Token ID373
Feature activation-3.11e-01

Pos logit contributions
 lifetime+0.020
 injuries+0.019
uc+0.018
bek+0.018
har+0.018
Neg logit contributions
 She-0.019
 means-0.014
 ornament-0.014
 Everyday-0.013
otion-0.013
Token un
Token ID555
Feature activation-1.56e-01

Pos logit contributions
 lifetime+0.014
 injuries+0.013
uc+0.012
bek+0.012
har+0.012
Neg logit contributions
 She-0.014
 means-0.011
 ornament-0.010
 Everyday-0.010
otion-0.010
Tokenhar
Token ID9869
Feature activation-5.85e+00

Pos logit contributions
 lifetime+0.006
 injuries+0.006
uc+0.005
bek+0.005
uddled+0.005
Neg logit contributions
 She-0.008
 means-0.006
 ornament-0.006
 Everyday-0.006
-0.006
Tokenmed
Token ID1150
Feature activation-8.41e-01

Pos logit contributions
 lifetime+0.225
 injuries+0.209
bek+0.200
uc+0.185
har+0.182
Neg logit contributions
 She-0.290
 means-0.237
 ornament-0.226
-0.222
 Everyday-0.217
Token!
Token ID0
Feature activation-1.04e+00

Pos logit contributions
 lifetime+0.040
 injuries+0.039
uc+0.037
bek+0.036
har+0.035
Neg logit contributions
 She-0.034
 means-0.025
 ornament-0.025
 Everyday-0.024
otion-0.023
Token The
Token ID383
Feature activation-1.25e+00

Pos logit contributions
 lifetime+0.059
 injuries+0.054
bek+0.053
har+0.052
uc+0.052
Neg logit contributions
 She-0.029
 ornament-0.023
 means-0.023
otion-0.021
 Everyday-0.019
Token two
Token ID734
Feature activation+3.43e-01

Pos logit contributions
 lifetime+0.067
 injuries+0.065
uc+0.058
har+0.058
bek+0.057
Neg logit contributions
 She-0.046
 means-0.033
otion-0.031
 Everyday-0.028
 It-0.028
Token hugged
Token ID41130
Feature activation-2.15e+00

Pos logit contributions
 She+0.014
 means+0.011
 ornament+0.010
 Everyday+0.010
otion+0.010
Neg logit contributions
 lifetime-0.016
 injuries-0.016
uc-0.015
uddled-0.014
ounced-0.014
Token that
Token ID326
Feature activation-9.66e-01

Pos logit contributions
 lifetime+0.035
 injuries+0.034
uc+0.033
bek+0.032
har+0.031
Neg logit contributions
 She-0.021
 means-0.014
 ornament-0.013
 Everyday-0.013
 It-0.011
Token the
Token ID262
Feature activation+6.20e-02

Pos logit contributions
 lifetime+0.046
 injuries+0.043
bek+0.038
har+0.036
uc+0.036
Neg logit contributions
 She-0.039
otion-0.030
 means-0.030
 Everyday-0.028
 ornament-0.028
Token kitten
Token ID42143
Feature activation-4.38e-01

Pos logit contributions
 She+0.002
 means+0.002
otion+0.002
 Everyday+0.002
 It+0.001
Neg logit contributions
 lifetime-0.003
 injuries-0.003
uc-0.003
har-0.003
bek-0.003
Token was
Token ID373
Feature activation-3.11e-01

Pos logit contributions
 lifetime+0.020
 injuries+0.019
uc+0.018
bek+0.018
har+0.018
Neg logit contributions
 She-0.019
 means-0.014
 ornament-0.014
 Everyday-0.013
otion-0.013
Token un
Token ID555
Feature activation-1.56e-01

Pos logit contributions
 lifetime+0.014
 injuries+0.013
uc+0.012
bek+0.012
har+0.012
Neg logit contributions
 She-0.014
 means-0.011
 ornament-0.010
 Everyday-0.010
otion-0.010
Tokenhar
Token ID9869
Feature activation-5.85e+00

Pos logit contributions
 lifetime+0.006
 injuries+0.006
uc+0.005
bek+0.005
uddled+0.005
Neg logit contributions
 She-0.008
 means-0.006
 ornament-0.006
 Everyday-0.006
-0.006
Tokenmed
Token ID1150
Feature activation-8.41e-01

Pos logit contributions
 lifetime+0.225
 injuries+0.209
bek+0.200
uc+0.185
har+0.182
Neg logit contributions
 She-0.290
 means-0.237
 ornament-0.226
-0.222
 Everyday-0.217
Token!
Token ID0
Feature activation-1.04e+00

Pos logit contributions
 lifetime+0.040
 injuries+0.039
uc+0.037
bek+0.036
har+0.035
Neg logit contributions
 She-0.034
 means-0.025
 ornament-0.025
 Everyday-0.024
otion-0.023
Token The
Token ID383
Feature activation-1.25e+00

Pos logit contributions
 lifetime+0.059
 injuries+0.054
bek+0.053
har+0.052
uc+0.052
Neg logit contributions
 She-0.029
 ornament-0.023
 means-0.023
otion-0.021
 Everyday-0.019
Token two
Token ID734
Feature activation+3.43e-01

Pos logit contributions
 lifetime+0.067
 injuries+0.065
uc+0.058
har+0.058
bek+0.057
Neg logit contributions
 She-0.046
 means-0.033
otion-0.031
 Everyday-0.028
 It-0.028
Token hugged
Token ID41130
Feature activation-2.15e+00

Pos logit contributions
 She+0.014
 means+0.011
 ornament+0.010
 Everyday+0.010
otion+0.010
Neg logit contributions
 lifetime-0.016
 injuries-0.016
uc-0.015
uddled-0.014
ounced-0.014
Token tree
Token ID5509
Feature activation+8.61e-01

Pos logit contributions
 She+0.003
 means+0.002
otion+0.002
 Everyday+0.002
 ornament+0.002
Neg logit contributions
 lifetime-0.007
 injuries-0.006
bek-0.006
uc-0.006
har-0.006
Token,
Token ID11
Feature activation+2.09e-01

Pos logit contributions
 She+0.029
 means+0.023
otion+0.023
 ornament+0.023
 Everyday+0.021
Neg logit contributions
 lifetime-0.044
 injuries-0.041
har-0.038
uc-0.037
bek-0.037
Token he
Token ID339
Feature activation-4.53e-01

Pos logit contributions
 She+0.008
 means+0.006
 Everyday+0.006
 ornament+0.006
otion+0.005
Neg logit contributions
 lifetime-0.010
 injuries-0.010
bek-0.009
uc-0.009
har-0.009
Token was
Token ID373
Feature activation-5.82e-02

Pos logit contributions
 lifetime+0.020
 injuries+0.019
uc+0.018
bek+0.018
har+0.017
Neg logit contributions
 She-0.020
 means-0.016
 ornament-0.015
 Everyday-0.015
-0.014
Token un
Token ID555
Feature activation+2.57e-01

Pos logit contributions
 lifetime+0.003
 injuries+0.003
har+0.003
uc+0.003
ounced+0.003
Neg logit contributions
 She-0.002
 means-0.002
 ornament-0.002
 Everyday-0.002
otion-0.002
Tokenhar
Token ID9869
Feature activation-5.74e+00

Pos logit contributions
 She+0.018
 ornament+0.015
 means+0.014
 Everyday+0.014
+0.014
Neg logit contributions
 lifetime-0.005
 injuries-0.005
uc-0.004
bek-0.004
uddled-0.004
Tokenmed
Token ID1150
Feature activation-5.36e-01

Pos logit contributions
 lifetime+0.245
 injuries+0.219
bek+0.217
edy+0.209
cle+0.200
Neg logit contributions
 She-0.245
 means-0.202
-0.200
 ornament-0.196
bird-0.184
Token,
Token ID11
Feature activation+2.45e-01

Pos logit contributions
 lifetime+0.028
 injuries+0.027
uc+0.026
bek+0.025
har+0.025
Neg logit contributions
 She-0.019
 means-0.014
 ornament-0.013
 Everyday-0.013
otion-0.012
Token but
Token ID475
Feature activation-1.23e+00

Pos logit contributions
 She+0.010
 means+0.007
 ornament+0.007
 Everyday+0.007
otion+0.007
Neg logit contributions
 lifetime-0.012
 injuries-0.012
uc-0.011
bek-0.011
har-0.011
Token he
Token ID339
Feature activation-4.51e-01

Pos logit contributions
 lifetime+0.061
 injuries+0.058
bek+0.054
har+0.053
uc+0.053
Neg logit contributions
 She-0.049
 ornament-0.035
 means-0.035
 Everyday-0.034
otion-0.032
Token felt
Token ID2936
Feature activation-7.04e-02

Pos logit contributions
 lifetime+0.020
 injuries+0.019
uc+0.018
bek+0.018
har+0.017
Neg logit contributions
 She-0.020
 means-0.016
 ornament-0.015
 Everyday-0.015
-0.014
Token still
Token ID991
Feature activation-1.01e+00

Pos logit contributions
 lifetime+0.010
 injuries+0.009
har+0.008
uc+0.008
bek+0.008
Neg logit contributions
 She-0.010
 means-0.008
otion-0.008
 ornament-0.007
 Everyday-0.007
Token be
Token ID307
Feature activation-1.26e+00

Pos logit contributions
 lifetime+0.049
 injuries+0.047
har+0.042
uc+0.042
bek+0.041
Neg logit contributions
 She-0.039
 means-0.030
otion-0.028
 Everyday-0.028
 ornament-0.027
Token healthy
Token ID5448
Feature activation-8.13e-01

Pos logit contributions
 lifetime+0.062
 injuries+0.059
bek+0.058
uc+0.056
uddled+0.055
Neg logit contributions
 She-0.050
 means-0.036
 ornament-0.036
 Everyday-0.034
 she-0.032
Token and
Token ID290
Feature activation-9.80e-02

Pos logit contributions
 lifetime+0.037
 injuries+0.036
uc+0.032
bek+0.030
har+0.030
Neg logit contributions
 She-0.034
otion-0.026
 Everyday-0.026
 means-0.025
 ornament-0.025
Token un
Token ID555
Feature activation+3.31e-01

Pos logit contributions
 lifetime+0.004
 injuries+0.004
uc+0.004
bek+0.004
har+0.004
Neg logit contributions
 She-0.004
 means-0.003
 Everyday-0.003
 ornament-0.003
otion-0.003
Tokenhar
Token ID9869
Feature activation-5.58e+00

Pos logit contributions
 She+0.019
 means+0.016
 ornament+0.016
 Everyday+0.015
+0.014
Neg logit contributions
 lifetime-0.010
 injuries-0.010
uc-0.008
bek-0.008
uddled-0.008
Tokenmed
Token ID1150
Feature activation-4.69e-01

Pos logit contributions
 lifetime+0.205
 injuries+0.195
bek+0.178
uc+0.169
har+0.167
Neg logit contributions
 She-0.291
 means-0.239
 ornament-0.231
 Everyday-0.221
-0.214
Token.
Token ID13
Feature activation-1.81e-02

Pos logit contributions
 lifetime+0.021
 injuries+0.020
uc+0.019
bek+0.018
har+0.018
Neg logit contributions
 She-0.020
 means-0.015
 Everyday-0.015
 ornament-0.015
otion-0.014
Token Suddenly
Token ID24975
Feature activation-1.57e-01

Pos logit contributions
 lifetime+0.001
 injuries+0.001
uc+0.001
bek+0.001
har+0.001
Neg logit contributions
 She-0.001
 means-0.000
 ornament-0.000
 Everyday-0.000
otion-0.000
Token,
Token ID11
Feature activation+3.16e-01

Pos logit contributions
 lifetime+0.009
 injuries+0.009
uc+0.008
bek+0.008
ounced+0.008
Neg logit contributions
 She-0.005
 ornament-0.003
 Everyday-0.003
 means-0.003
igator-0.003
Token her
Token ID607
Feature activation+3.28e-01

Pos logit contributions
 She+0.011
 means+0.008
 ornament+0.008
 Everyday+0.008
otion+0.007
Neg logit contributions
 lifetime-0.016
 injuries-0.016
uc-0.015
bek-0.015
har-0.015
Token managed
Token ID5257
Feature activation-1.43e+00

Pos logit contributions
 lifetime+0.010
 injuries+0.010
uc+0.010
bek+0.009
har+0.009
Neg logit contributions
 She-0.009
 means-0.007
-0.007
 Everyday-0.006
 ornament-0.006
Token to
Token ID284
Feature activation-2.24e-01

Pos logit contributions
 lifetime+0.072
 injuries+0.067
uc+0.066
bek+0.065
har+0.063
Neg logit contributions
 She-0.058
 means-0.042
 Everyday-0.039
 ornament-0.039
 she-0.037
Token get
Token ID651
Feature activation+8.95e-01

Pos logit contributions
 lifetime+0.011
 injuries+0.011
uc+0.010
bek+0.010
har+0.010
Neg logit contributions
 She-0.009
 means-0.007
 ornament-0.006
 Everyday-0.006
otion-0.006
Token away
Token ID1497
Feature activation-1.10e+00

Pos logit contributions
 She+0.033
 means+0.023
 ornament+0.022
 Everyday+0.021
+0.019
Neg logit contributions
 lifetime-0.048
 injuries-0.046
uc-0.044
bek-0.044
uddled-0.042
Token un
Token ID555
Feature activation+2.88e-01

Pos logit contributions
 lifetime+0.065
 injuries+0.064
har+0.059
uc+0.059
bek+0.059
Neg logit contributions
 She-0.031
 means-0.021
 ornament-0.020
 Everyday-0.020
otion-0.018
Tokenhar
Token ID9869
Feature activation-5.56e+00

Pos logit contributions
 She+0.018
 means+0.015
 ornament+0.015
 Everyday+0.015
+0.014
Neg logit contributions
 lifetime-0.008
 injuries-0.007
uc-0.006
bek-0.006
uddled-0.006
Tokenmed
Token ID1150
Feature activation-4.41e-01

Pos logit contributions
 lifetime+0.220
 injuries+0.203
bek+0.193
uc+0.178
edy+0.176
Neg logit contributions
 She-0.273
 means-0.224
 ornament-0.211
-0.210
 Everyday-0.201
Token.
Token ID13
Feature activation+2.14e-02

Pos logit contributions
 lifetime+0.021
 injuries+0.020
uc+0.019
bek+0.019
har+0.019
Neg logit contributions
 She-0.018
 means-0.013
 ornament-0.013
 Everyday-0.013
otion-0.012
Token From
Token ID3574
Feature activation+7.19e-01

Pos logit contributions
 She+0.001
 means+0.000
 ornament+0.000
 Everyday+0.000
otion+0.000
Neg logit contributions
 lifetime-0.001
 injuries-0.001
uc-0.001
bek-0.001
har-0.001
Token that
Token ID326
Feature activation-5.18e-01

Pos logit contributions
 She+0.028
 means+0.021
 Everyday+0.020
 ornament+0.020
 she+0.018
Neg logit contributions
 lifetime-0.037
 injuries-0.035
uc-0.034
bek-0.033
har-0.032
Token day
Token ID1110
Feature activation-1.32e+00

Pos logit contributions
 lifetime+0.021
bek+0.016
 injuries+0.015
cle+0.014
har+0.014
Neg logit contributions
 She-0.025
otion-0.020
 means-0.020
 ornament-0.020
 herself-0.018
Token around
Token ID1088
Feature activation+3.77e-01

Pos logit contributions
 lifetime+0.015
 injuries+0.014
uc+0.013
bek+0.013
har+0.013
Neg logit contributions
 She-0.015
 means-0.012
 ornament-0.011
 Everyday-0.011
otion-0.011
Token in
Token ID287
Feature activation+1.32e-01

Pos logit contributions
 She+0.013
 means+0.009
 ornament+0.009
 Everyday+0.009
otion+0.008
Neg logit contributions
 lifetime-0.020
 injuries-0.020
uc-0.018
bek-0.018
har-0.018
Token z
Token ID1976
Feature activation+7.74e-01

Pos logit contributions
 She+0.004
 means+0.003
 ornament+0.003
 Everyday+0.003
otion+0.003
Neg logit contributions
 lifetime-0.007
 injuries-0.007
uc-0.007
bek-0.007
har-0.006
Tokenig
Token ID328
Feature activation-1.71e+00

Pos logit contributions
 She+0.038
otion+0.029
 means+0.028
 Everyday+0.028
 she+0.027
Neg logit contributions
 lifetime-0.032
 injuries-0.027
bek-0.027
har-0.025
cle-0.024
Tokenz
Token ID89
Feature activation+8.98e-01

Pos logit contributions
 lifetime+0.052
 injuries+0.049
uc+0.044
bek+0.044
har+0.041
Neg logit contributions
 She-0.101
 means-0.082
 ornament-0.080
 Everyday-0.079
otion-0.076
Tokenags
Token ID3775
Feature activation-5.31e+00

Pos logit contributions
 She+0.045
 Everyday+0.036
 her+0.035
 idea+0.034
 means+0.034
Neg logit contributions
 lifetime-0.038
bek-0.034
har-0.032
 injuries-0.031
cle-0.030
Token.
Token ID13
Feature activation-6.93e-02

Pos logit contributions
 lifetime+0.298
 injuries+0.285
uc+0.275
bek+0.270
har+0.264
Neg logit contributions
 She-0.177
 means-0.121
 ornament-0.115
 Everyday-0.114
otion-0.102
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
 lifetime+0.004
 injuries+0.003
har+0.003
uc+0.003
ounced+0.003
Neg logit contributions
 She-0.002
 ornament-0.002
 means-0.002
otion-0.001
 Everyday-0.001
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
 lifetime+0.007
 injuries+0.007
uc+0.007
bek+0.006
har+0.006
Neg logit contributions
 She-0.004
 means-0.003
 ornament-0.002
 Everyday-0.002
otion-0.002
TokenWhen
Token ID2215
Feature activation+5.14e-01

Pos logit contributions
 lifetime+0.005
 injuries+0.005
uc+0.005
bek+0.004
har+0.004
Neg logit contributions
 She-0.006
 means-0.005
 ornament-0.005
 Everyday-0.005
otion-0.005
Token Zig
Token ID24992
Feature activation-1.92e+00

Pos logit contributions
 She+0.020
 means+0.014
 Everyday+0.013
 ornament+0.013
 she+0.012
Neg logit contributions
 lifetime-0.027
 injuries-0.026
uc-0.025
bek-0.024
har-0.024
Token sure
Token ID1654
Feature activation-5.17e-01

Pos logit contributions
 She+0.021
otion+0.016
 Everyday+0.016
 means+0.015
 ornament+0.015
Neg logit contributions
 lifetime-0.023
 injuries-0.022
uc-0.021
har-0.020
eds-0.019
Token his
Token ID465
Feature activation-5.41e-01

Pos logit contributions
 lifetime+0.024
 injuries+0.023
uc+0.021
bek+0.021
har+0.020
Neg logit contributions
 She-0.021
 Everyday-0.016
 means-0.015
 ornament-0.015
otion-0.015
Token z
Token ID1976
Feature activation+5.48e-01

Pos logit contributions
 lifetime+0.023
 injuries+0.022
uc+0.021
bek+0.021
har+0.020
Neg logit contributions
 She-0.025
 means-0.019
 Everyday-0.019
 ornament-0.018
otion-0.018
Tokenig
Token ID328
Feature activation-1.79e+00

Pos logit contributions
 She+0.027
 ornament+0.023
 means+0.022
 Everyday+0.021
otion+0.020
Neg logit contributions
 lifetime-0.021
 injuries-0.018
bek-0.018
uddled-0.016
ounced-0.016
Tokenz
Token ID89
Feature activation+7.61e-01

Pos logit contributions
 lifetime+0.076
uc+0.071
 injuries+0.068
uddled+0.067
har+0.066
Neg logit contributions
 She-0.080
 Everyday-0.061
otion-0.060
 means-0.059
 ornament-0.059
Tokenags
Token ID3775
Feature activation-5.31e+00

Pos logit contributions
 She+0.035
 means+0.027
 ornament+0.027
 Everyday+0.026
otion+0.024
Neg logit contributions
 lifetime-0.034
 injuries-0.031
bek-0.030
uc-0.029
har-0.029
Token were
Token ID547
Feature activation+2.31e-01

Pos logit contributions
 lifetime+0.259
 injuries+0.242
uc+0.235
bek+0.233
har+0.229
Neg logit contributions
 She-0.208
 means-0.151
 Everyday-0.148
 ornament-0.146
otion-0.143
Token neat
Token ID15049
Feature activation+3.06e-02

Pos logit contributions
 She+0.009
 means+0.007
 ornament+0.007
 Everyday+0.007
otion+0.006
Neg logit contributions
 lifetime-0.011
 injuries-0.011
uc-0.010
bek-0.010
har-0.010
Token and
Token ID290
Feature activation-9.52e-02

Pos logit contributions
 She+0.001
otion+0.001
 Everyday+0.001
 means+0.001
 ornament+0.001
Neg logit contributions
 lifetime-0.001
 injuries-0.001
uc-0.001
har-0.001
uddled-0.001
Token tidy
Token ID43044
Feature activation-3.21e-01

Pos logit contributions
 lifetime+0.004
 injuries+0.004
uc+0.004
bek+0.004
har+0.004
Neg logit contributions
 She-0.004
 means-0.003
 ornament-0.003
 Everyday-0.003
otion-0.003
Token.
Token ID13
Feature activation+5.03e-02

Pos logit contributions
 lifetime+0.014
 injuries+0.013
uc+0.012
bek+0.012
har+0.012
Neg logit contributions
 She-0.014
otion-0.011
 means-0.011
 Everyday-0.010
 ornament-0.010
Token enjoyed
Token ID8359
Feature activation-7.01e-01

Pos logit contributions
 She+0.022
 means+0.017
 ornament+0.016
 Everyday+0.016
otion+0.015
Neg logit contributions
 lifetime-0.022
 injuries-0.021
uc-0.020
bek-0.019
har-0.019
Token writing
Token ID3597
Feature activation+1.28e+00

Pos logit contributions
 lifetime+0.030
 injuries+0.028
uc+0.027
bek+0.026
har+0.025
Neg logit contributions
 She-0.033
 means-0.026
 ornament-0.025
 Everyday-0.025
otion-0.023
Token z
Token ID1976
Feature activation+6.93e-01

Pos logit contributions
 She+0.045
 means+0.032
 ornament+0.030
 Everyday+0.030
 It+0.027
Neg logit contributions
 lifetime-0.070
 injuries-0.067
bek-0.064
uc-0.064
har-0.063
Tokenig
Token ID328
Feature activation-1.71e+00

Pos logit contributions
 She+0.032
 means+0.025
 Everyday+0.025
otion+0.025
 ornament+0.024
Neg logit contributions
 lifetime-0.027
bek-0.024
 injuries-0.024
har-0.021
cle-0.021
Tokenz
Token ID89
Feature activation+7.51e-01

Pos logit contributions
 lifetime+0.070
 injuries+0.065
uc+0.063
bek+0.061
har+0.059
Neg logit contributions
 She-0.083
 means-0.065
 ornament-0.063
 Everyday-0.062
otion-0.059
Tokenags
Token ID3775
Feature activation-5.30e+00

Pos logit contributions
 She+0.037
 her+0.031
 Everyday+0.031
 idea+0.030
 means+0.029
Neg logit contributions
 lifetime-0.031
bek-0.028
har-0.026
 injuries-0.026
ounced-0.024
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
 lifetime+0.309
 injuries+0.299
uc+0.286
bek+0.284
har+0.277
Neg logit contributions
 She-0.169
 means-0.111
 ornament-0.104
 Everyday-0.103
 It-0.089
Token He
Token ID679
Feature activation+4.71e-01

Pos logit contributions
 She+0.005
 ornament+0.004
 means+0.004
otion+0.003
igator+0.003
Neg logit contributions
 lifetime-0.014
 injuries-0.014
bek-0.013
har-0.013
uc-0.013
Token wrote
Token ID2630
Feature activation+3.31e-01

Pos logit contributions
 She+0.022
 means+0.017
 ornament+0.016
 Everyday+0.016
otion+0.015
Neg logit contributions
 lifetime-0.020
 injuries-0.019
uc-0.018
bek-0.018
har-0.017
Token z
Token ID1976
Feature activation+6.63e-01

Pos logit contributions
 She+0.011
 means+0.008
 Everyday+0.007
 ornament+0.007
 It+0.006
Neg logit contributions
 lifetime-0.019
 injuries-0.018
bek-0.017
uc-0.017
har-0.017
Tokenig
Token ID328
Feature activation-1.72e+00

Pos logit contributions
 She+0.029
 means+0.024
 Everyday+0.022
otion+0.022
 It+0.022
Neg logit contributions
 lifetime-0.029
 injuries-0.026
bek-0.024
cle-0.023
har-0.022
Token He
Token ID679
Feature activation+4.71e-01

Pos logit contributions
 She+0.005
 ornament+0.004
 means+0.004
otion+0.003
igator+0.003
Neg logit contributions
 lifetime-0.014
 injuries-0.014
bek-0.013
har-0.013
uc-0.013
Token wrote
Token ID2630
Feature activation+3.31e-01

Pos logit contributions
 She+0.022
 means+0.017
 ornament+0.016
 Everyday+0.016
otion+0.015
Neg logit contributions
 lifetime-0.020
 injuries-0.019
uc-0.018
bek-0.018
har-0.017
Token z
Token ID1976
Feature activation+6.63e-01

Pos logit contributions
 She+0.011
 means+0.008
 Everyday+0.007
 ornament+0.007
 It+0.006
Neg logit contributions
 lifetime-0.019
 injuries-0.018
bek-0.017
uc-0.017
har-0.017
Tokenig
Token ID328
Feature activation-1.72e+00

Pos logit contributions
 She+0.029
 means+0.024
 Everyday+0.022
otion+0.022
 It+0.022
Neg logit contributions
 lifetime-0.029
 injuries-0.026
bek-0.024
cle-0.023
har-0.022
Tokenz
Token ID89
Feature activation+7.43e-01

Pos logit contributions
 lifetime+0.069
 injuries+0.064
uc+0.062
bek+0.060
uddled+0.058
Neg logit contributions
 She-0.084
 means-0.065
 ornament-0.064
 Everyday-0.063
otion-0.060
Tokenags
Token ID3775
Feature activation-5.30e+00

Pos logit contributions
 She+0.036
 means+0.029
 Everyday+0.029
 her+0.029
 idea+0.028
Neg logit contributions
 lifetime-0.033
bek-0.029
 injuries-0.028
har-0.028
cle-0.026
Token on
Token ID319
Feature activation+3.09e-01

Pos logit contributions
 lifetime+0.309
 injuries+0.302
uc+0.287
bek+0.283
har+0.277
Neg logit contributions
 She-0.169
 means-0.112
 ornament-0.104
 Everyday-0.102
 It-0.091
Token walls
Token ID7714
Feature activation+1.73e+00

Pos logit contributions
 She+0.010
 means+0.007
 ornament+0.007
 Everyday+0.007
otion+0.006
Neg logit contributions
 lifetime-0.017
 injuries-0.017
uc-0.016
bek-0.016
har-0.015
Token,
Token ID11
Feature activation+3.87e-01

Pos logit contributions
 She+0.060
 means+0.045
 ornament+0.042
otion+0.041
 Everyday+0.040
Neg logit contributions
 lifetime-0.089
 injuries-0.084
uc-0.082
oy-0.078
bek-0.078
Token floors
Token ID18570
Feature activation+1.06e+00

Pos logit contributions
 She+0.014
 means+0.011
otion+0.010
 ornament+0.010
 Everyday+0.010
Neg logit contributions
 lifetime-0.019
 injuries-0.018
uc-0.017
har-0.016
bek-0.016
Token,
Token ID11
Feature activation+3.82e-01

Pos logit contributions
 She+0.033
 means+0.022
 ornament+0.021
 Everyday+0.021
otion+0.018
Neg logit contributions
 lifetime-0.061
 injuries-0.059
uc-0.057
bek-0.055
uddled-0.054
Token lines
Token ID3951
Feature activation+6.95e-01

Pos logit contributions
 lifetime+0.055
 injuries+0.054
uc+0.051
bek+0.049
har+0.048
Neg logit contributions
 She-0.047
 means-0.034
 ornament-0.034
 Everyday-0.033
otion-0.032
Token to
Token ID284
Feature activation-1.44e-01

Pos logit contributions
 She+0.025
 ornament+0.020
 means+0.019
 Everyday+0.019
otion+0.018
Neg logit contributions
 lifetime-0.031
 injuries-0.030
uc-0.029
oy-0.028
uddled-0.028
Token z
Token ID1976
Feature activation+7.91e-01

Pos logit contributions
 lifetime+0.007
 injuries+0.006
uc+0.006
bek+0.006
har+0.006
Neg logit contributions
 She-0.006
 means-0.004
otion-0.004
 Everyday-0.004
 ornament-0.004
Tokenig
Token ID328
Feature activation-1.62e+00

Pos logit contributions
 She+0.041
 means+0.033
 ornament+0.032
 Everyday+0.031
otion+0.031
Neg logit contributions
 lifetime-0.031
 injuries-0.027
bek-0.026
har-0.024
ounced-0.024
Tokenz
Token ID89
Feature activation+7.78e-01

Pos logit contributions
uc+0.060
 lifetime+0.056
uddled+0.053
 injuries+0.053
eds+0.051
Neg logit contributions
 She-0.075
 Everyday-0.062
 ornament-0.062
 wonder-0.061
igator-0.059
Tokenags
Token ID3775
Feature activation-5.27e+00

Pos logit contributions
 She+0.038
 means+0.030
 Everyday+0.030
 ornament+0.029
otion+0.028
Neg logit contributions
 lifetime-0.033
bek-0.029
 injuries-0.029
har-0.028
ounced-0.027
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
 lifetime+0.270
 injuries+0.253
uc+0.250
bek+0.245
har+0.237
Neg logit contributions
 She-0.184
 Everyday-0.131
 means-0.128
 ornament-0.128
otion-0.123
Token
Token ID198
Feature activation-2.17e-01

Pos logit contributions
 She+0.004
 means+0.003
 ornament+0.003
otion+0.002
 Everyday+0.002
Neg logit contributions
 lifetime-0.007
 injuries-0.006
uc-0.006
bek-0.006
har-0.006
Token
Token ID198
Feature activation-2.48e-01

Pos logit contributions
 lifetime+0.013
 injuries+0.013
uc+0.012
bek+0.012
har+0.012
Neg logit contributions
 She-0.006
 means-0.004
 Everyday-0.004
 ornament-0.004
otion-0.003
TokenOne
Token ID3198
Feature activation-6.12e-01

Pos logit contributions
 lifetime+0.011
 injuries+0.011
uc+0.010
bek+0.010
har+0.010
Neg logit contributions
 She-0.011
 means-0.008
 ornament-0.008
 Everyday-0.008
otion-0.007
Token day
Token ID1110
Feature activation-1.57e+00

Pos logit contributions
 lifetime+0.033
 injuries+0.032
uc+0.030
bek+0.030
har+0.029
Neg logit contributions
 She-0.022
 means-0.015
 ornament-0.015
 Everyday-0.015
otion-0.013
Token so
Token ID523
Feature activation-2.43e-01

Pos logit contributions
 She+0.016
 means+0.011
 ornament+0.011
 Everyday+0.011
otion+0.010
Neg logit contributions
 lifetime-0.020
 injuries-0.019
uc-0.018
bek-0.018
har-0.018
Token many
Token ID867
Feature activation+5.70e-01

Pos logit contributions
 lifetime+0.012
 injuries+0.011
uc+0.010
bek+0.010
har+0.010
Neg logit contributions
 She-0.010
 means-0.007
 ornament-0.007
 Everyday-0.007
otion-0.007
Token z
Token ID1976
Feature activation+6.28e-01

Pos logit contributions
 She+0.026
 means+0.020
 ornament+0.019
 Everyday+0.019
otion+0.018
Neg logit contributions
 lifetime-0.025
 injuries-0.024
uc-0.023
bek-0.022
har-0.022
Tokenig
Token ID328
Feature activation-1.69e+00

Pos logit contributions
 She+0.033
 means+0.027
 ornament+0.027
otion+0.027
 Everyday+0.026
Neg logit contributions
 lifetime-0.022
 injuries-0.019
bek-0.018
uc-0.017
har-0.017
Tokenz
Token ID89
Feature activation+8.43e-01

Pos logit contributions
 lifetime+0.070
 injuries+0.064
uc+0.064
uddled+0.060
har+0.059
Neg logit contributions
 She-0.080
 Everyday-0.063
 means-0.061
 ornament-0.061
 It-0.056
Tokenags
Token ID3775
Feature activation-5.26e+00

Pos logit contributions
 She+0.041
 means+0.033
 ornament+0.032
 Everyday+0.032
otion+0.030
Neg logit contributions
 lifetime-0.035
 injuries-0.031
bek-0.030
uc-0.029
har-0.029
Token,"
Token ID553
Feature activation+7.67e-01

Pos logit contributions
 lifetime+0.244
 injuries+0.222
uc+0.222
bek+0.216
har+0.216
Neg logit contributions
 She-0.212
 means-0.155
 ornament-0.155
 Everyday-0.155
otion-0.149
Token he
Token ID339
Feature activation-2.32e-01

Pos logit contributions
 She+0.030
otion+0.026
 ornament+0.025
 Everyday+0.023
 means+0.023
Neg logit contributions
 lifetime-0.034
 injuries-0.031
uc-0.030
bek-0.030
cle-0.029
Token said
Token ID531
Feature activation-1.92e-01

Pos logit contributions
 lifetime+0.010
 injuries+0.009
uc+0.009
bek+0.009
ounced+0.008
Neg logit contributions
 She-0.011
 ornament-0.008
 means-0.008
 Everyday-0.008
otion-0.008
Token nervously
Token ID43324
Feature activation+4.38e-01

Pos logit contributions
 lifetime+0.008
 injuries+0.008
uc+0.008
har+0.007
bek+0.007
Neg logit contributions
 She-0.008
 means-0.007
 Everyday-0.006
 ornament-0.006
otion-0.006
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
 She+0.020
 ornament+0.017
 Everyday+0.016
otion+0.016
 means+0.016
Neg logit contributions
 injuries-0.016
 lifetime-0.016
uc-0.015
har-0.014
bek-0.014
Token Jack
Token ID3619
Feature activation-2.06e-02

Pos logit contributions
 She+0.012
 means+0.009
 ornament+0.008
 Everyday+0.008
otion+0.008
Neg logit contributions
 lifetime-0.015
 injuries-0.014
uc-0.013
bek-0.013
har-0.013
Token was
Token ID373
Feature activation+2.37e-02

Pos logit contributions
 lifetime+0.001
 injuries+0.001
uc+0.001
bek+0.001
uddled+0.001
Neg logit contributions
 She-0.001
 ornament-0.001
otion-0.001
 means-0.001
 Everyday-0.001
Token turning
Token ID6225
Feature activation-1.40e+00

Pos logit contributions
 She+0.001
 means+0.001
 Everyday+0.001
 ornament+0.001
igator+0.001
Neg logit contributions
 injuries-0.001
 lifetime-0.001
uc-0.001
uddled-0.001
har-0.001
Token the
Token ID262
Feature activation+8.51e-02

Pos logit contributions
 injuries+0.061
uc+0.059
 lifetime+0.059
uddled+0.053
ounced+0.052
Neg logit contributions
 She-0.055
otion-0.042
 means-0.042
 Everyday-0.041
 ornament-0.040
Token r
Token ID374
Feature activation+1.43e+00

Pos logit contributions
 She+0.003
 means+0.002
 ornament+0.002
 Everyday+0.002
 she+0.002
Neg logit contributions
 lifetime-0.004
 injuries-0.004
bek-0.004
uc-0.004
ounced-0.004
Tokenags
Token ID3775
Feature activation-5.19e+00

Pos logit contributions
 She+0.061
 means+0.046
 ornament+0.045
 Everyday+0.044
+0.041
Neg logit contributions
 lifetime-0.065
 injuries-0.063
uc-0.062
bek-0.059
uddled-0.058
Token,
Token ID11
Feature activation+2.36e-01

Pos logit contributions
 lifetime+0.242
 injuries+0.232
uc+0.228
bek+0.217
uddled+0.215
Neg logit contributions
 She-0.193
 Everyday-0.144
 ornament-0.142
 means-0.141
otion-0.136
Token he
Token ID339
Feature activation-4.11e-01

Pos logit contributions
 She+0.009
 means+0.007
 ornament+0.007
 Everyday+0.007
otion+0.007
Neg logit contributions
 lifetime-0.011
 injuries-0.011
uc-0.010
bek-0.010
uddled-0.009
Token looked
Token ID3114
Feature activation-1.05e+00

Pos logit contributions
 lifetime+0.018
 injuries+0.017
bek+0.016
uc+0.016
har+0.015
Neg logit contributions
 She-0.020
 means-0.015
 Everyday-0.014
 ornament-0.014
-0.014
Token out
Token ID503
Feature activation+1.57e-01

Pos logit contributions
 lifetime+0.065
 injuries+0.062
uc+0.060
bek+0.060
har+0.058
Neg logit contributions
 She-0.032
 means-0.019
 ornament-0.018
 Everyday-0.018
 she-0.016
Token the
Token ID262
Feature activation+1.35e-01

Pos logit contributions
 She+0.005
 means+0.004
 ornament+0.004
 Everyday+0.003
otion+0.003
Neg logit contributions
 lifetime-0.008
 injuries-0.008
uc-0.008
bek-0.008
har-0.007
Token writing
Token ID3597
Feature activation+9.66e-01

Pos logit contributions
 lifetime+0.028
 injuries+0.027
uc+0.025
bek+0.025
har+0.024
Neg logit contributions
 She-0.023
 means-0.017
 ornament-0.017
 Everyday-0.017
otion-0.015
Token his
Token ID465
Feature activation-6.87e-01

Pos logit contributions
 She+0.035
 Everyday+0.024
 means+0.024
 ornament+0.023
otion+0.022
Neg logit contributions
 lifetime-0.049
 injuries-0.048
uc-0.046
bek-0.044
uddled-0.044
Token z
Token ID1976
Feature activation+6.57e-01

Pos logit contributions
 lifetime+0.033
 injuries+0.032
bek+0.031
uc+0.030
ounced+0.029
Neg logit contributions
 She-0.027
 ornament-0.021
 means-0.021
 Everyday-0.019
-0.018
Tokenig
Token ID328
Feature activation-1.66e+00

Pos logit contributions
 She+0.032
 ornament+0.027
 means+0.025
 Everyday+0.025
otion+0.023
Neg logit contributions
 lifetime-0.025
bek-0.022
 injuries-0.022
cle-0.020
uc-0.020
Tokenz
Token ID89
Feature activation+8.68e-01

Pos logit contributions
 lifetime+0.078
 injuries+0.072
uc+0.072
uddled+0.068
har+0.066
Neg logit contributions
 She-0.069
 means-0.052
 Everyday-0.051
 ornament-0.050
otion-0.049
Tokenags
Token ID3775
Feature activation-5.19e+00

Pos logit contributions
 She+0.042
 means+0.033
 ornament+0.033
 Everyday+0.032
otion+0.030
Neg logit contributions
 lifetime-0.037
 injuries-0.033
bek-0.032
har-0.031
uc-0.031
Token,
Token ID11
Feature activation+3.04e-01

Pos logit contributions
 lifetime+0.263
 injuries+0.248
uc+0.242
har+0.235
bek+0.234
Neg logit contributions
 She-0.183
 means-0.132
 Everyday-0.129
otion-0.128
 ornament-0.125
Token he
Token ID339
Feature activation-2.50e-01

Pos logit contributions
 She+0.011
 means+0.009
 ornament+0.009
 Everyday+0.009
otion+0.008
Neg logit contributions
 lifetime-0.014
 injuries-0.014
uc-0.012
bek-0.012
har-0.012
Token heard
Token ID2982
Feature activation-1.02e+00

Pos logit contributions
 lifetime+0.011
 injuries+0.010
uc+0.010
bek+0.010
har+0.009
Neg logit contributions
 She-0.011
 means-0.009
 ornament-0.009
 Everyday-0.008
otion-0.008
Token a
Token ID257
Feature activation-4.61e-01

Pos logit contributions
 lifetime+0.053
 injuries+0.051
uc+0.047
har+0.046
bek+0.046
Neg logit contributions
 She-0.036
 means-0.026
 ornament-0.025
 Everyday-0.024
otion-0.024
Token funny
Token ID8258
Feature activation+4.05e-02

Pos logit contributions
 lifetime+0.026
 injuries+0.025
uc+0.025
bek+0.024
uddled+0.024
Neg logit contributions
 She-0.015
 means-0.010
 ornament-0.010
 Everyday-0.009
-0.008
Token She
Token ID1375
Feature activation+6.01e-01

Pos logit contributions
 lifetime+0.011
 injuries+0.010
uc+0.009
bek+0.009
har+0.009
Neg logit contributions
 ornament-0.006
 She-0.006
igator-0.005
 means-0.005
 angel-0.005
Token found
Token ID1043
Feature activation-3.79e-01

Pos logit contributions
 She+0.027
 means+0.021
 ornament+0.020
 Everyday+0.020
+0.018
Neg logit contributions
 lifetime-0.027
 injuries-0.025
uc-0.024
bek-0.024
har-0.023
Token some
Token ID617
Feature activation+1.39e+00

Pos logit contributions
 injuries+0.019
 lifetime+0.019
uc+0.018
uddled+0.017
ounced+0.017
Neg logit contributions
 She-0.013
 means-0.010
 Everyday-0.009
 ornament-0.009
otion-0.009
Token old
Token ID1468
Feature activation+2.26e+00

Pos logit contributions
 She+0.055
 means+0.040
 Everyday+0.038
 ornament+0.038
otion+0.037
Neg logit contributions
 lifetime-0.067
 injuries-0.065
uc-0.063
uddled-0.060
bek-0.059
Token r
Token ID374
Feature activation+1.51e+00

Pos logit contributions
 She+0.097
 Everyday+0.071
 means+0.070
otion+0.069
 ornament+0.068
Neg logit contributions
 lifetime-0.103
 injuries-0.100
uc-0.096
bek-0.090
uddled-0.089
Tokenags
Token ID3775
Feature activation-5.18e+00

Pos logit contributions
 She+0.080
 means+0.065
 ornament+0.063
 Everyday+0.063
+0.060
Neg logit contributions
 lifetime-0.052
 injuries-0.052
uc-0.051
uddled-0.047
bek-0.046
Token which
Token ID543
Feature activation-1.31e+00

Pos logit contributions
 lifetime+0.267
 injuries+0.261
uc+0.257
bek+0.249
uddled+0.247
Neg logit contributions
 She-0.170
 means-0.124
 Everyday-0.123
 ornament-0.115
otion-0.110
Token were
Token ID547
Feature activation+1.26e-01

Pos logit contributions
 lifetime+0.055
 injuries+0.051
uc+0.050
bek+0.048
uddled+0.047
Neg logit contributions
 She-0.055
 Everyday-0.042
otion-0.042
 ornament-0.042
 means-0.041
Token dirty
Token ID11841
Feature activation+9.74e-02

Pos logit contributions
 She+0.005
 means+0.004
otion+0.004
 Everyday+0.004
 ornament+0.004
Neg logit contributions
 lifetime-0.006
 injuries-0.006
uc-0.005
ounced-0.005
har-0.005
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
 She+0.004
 Everyday+0.003
 means+0.003
otion+0.003
 ornament+0.003
Neg logit contributions
 injuries-0.004
 lifetime-0.004
uc-0.004
uddled-0.004
less-0.004
Token She
Token ID1375
Feature activation+6.26e-01

Pos logit contributions
 lifetime+0.007
 injuries+0.007
uc+0.007
uddled+0.007
bek+0.007
Neg logit contributions
 ornament-0.004
 She-0.003
igator-0.003
 means-0.003
 angel-0.003
Token mom
Token ID1995
Feature activation-2.47e-01

Pos logit contributions
 lifetime+0.003
 injuries+0.003
uc+0.003
bek+0.003
har+0.002
Neg logit contributions
 She-0.002
 means-0.002
 ornament-0.002
 Everyday-0.002
otion-0.002
Token bought
Token ID5839
Feature activation-6.87e-01

Pos logit contributions
 lifetime+0.011
 injuries+0.011
uc+0.010
bek+0.010
har+0.010
Neg logit contributions
 She-0.011
 means-0.008
 ornament-0.008
 Everyday-0.008
otion-0.007
Token some
Token ID617
Feature activation+1.35e+00

Pos logit contributions
 lifetime+0.040
 injuries+0.039
uc+0.038
bek+0.037
uddled+0.036
Neg logit contributions
 She-0.020
 means-0.013
 ornament-0.012
 Everyday-0.012
otion-0.011
Token new
Token ID649
Feature activation-8.25e-01

Pos logit contributions
 She+0.054
 means+0.040
 ornament+0.038
 Everyday+0.038
otion+0.035
Neg logit contributions
 lifetime-0.066
 injuries-0.063
uc-0.061
bek-0.059
har-0.058
Token r
Token ID374
Feature activation+1.40e+00

Pos logit contributions
 lifetime+0.037
 injuries+0.035
uc+0.034
bek+0.032
uddled+0.032
Neg logit contributions
 She-0.036
 means-0.027
 Everyday-0.026
 ornament-0.026
otion-0.025
Tokenags
Token ID3775
Feature activation-5.16e+00

Pos logit contributions
 She+0.072
 means+0.057
 ornament+0.056
 Everyday+0.055
otion+0.052
Neg logit contributions
 lifetime-0.054
 injuries-0.050
uc-0.046
bek-0.046
har-0.044
Token.
Token ID13
Feature activation-3.73e-04

Pos logit contributions
 lifetime+0.286
 injuries+0.274
uc+0.264
bek+0.260
har+0.254
Neg logit contributions
 She-0.176
 means-0.120
 ornament-0.115
 Everyday-0.114
otion-0.102
Token When
Token ID1649
Feature activation+4.33e-01

Pos logit contributions
 lifetime+0.000
 injuries+0.000
uc+0.000
bek+0.000
uddled+0.000
Neg logit contributions
 She-0.000
 ornament-0.000
 means-0.000
igator-0.000
otion-0.000
Token they
Token ID484
Feature activation-6.68e-01

Pos logit contributions
 She+0.014
 she+0.008
 means+0.008
 Everyday+0.008
 her+0.007
Neg logit contributions
 lifetime-0.025
 injuries-0.025
uc-0.024
har-0.024
ounced-0.024
Token got
Token ID1392
Feature activation-1.37e-01

Pos logit contributions
 lifetime+0.029
 injuries+0.028
uc+0.026
bek+0.026
har+0.025
Neg logit contributions
 She-0.031
 means-0.023
 ornament-0.023
 Everyday-0.023
otion-0.021
Token back
Token ID736
Feature activation-1.28e+00

Pos logit contributions
 lifetime+0.006
 injuries+0.006
uc+0.006
bek+0.006
har+0.005
Neg logit contributions
 She-0.006
 means-0.005
 ornament-0.004
 Everyday-0.004
otion-0.004
Token
Token ID198
Feature activation-2.50e-01

Pos logit contributions
 She+0.015
 ornament+0.014
 means+0.013
otion+0.013
 Everyday+0.011
Neg logit contributions
 lifetime-0.024
 injuries-0.023
uc-0.022
bek-0.021
har-0.021
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
 lifetime+0.014
 injuries+0.013
uc+0.013
bek+0.013
har+0.013
Neg logit contributions
 She-0.008
 means-0.006
 ornament-0.005
 Everyday-0.005
otion-0.005
TokenThe
Token ID464
Feature activation-1.56e+00

Pos logit contributions
 lifetime+0.012
 injuries+0.011
bek+0.010
uc+0.010
har+0.010
Neg logit contributions
 She-0.014
 means-0.012
 ornament-0.011
 Everyday-0.011
otion-0.011
Token dog
Token ID3290
Feature activation-9.56e-01

Pos logit contributions
 lifetime+0.082
 injuries+0.079
uc+0.075
bek+0.073
har+0.072
Neg logit contributions
 She-0.057
 means-0.040
 Everyday-0.038
 ornament-0.036
otion-0.036
Token w
Token ID266
Feature activation-7.23e-01

Pos logit contributions
 lifetime+0.038
 injuries+0.038
ounced+0.036
uc+0.036
uddled+0.035
Neg logit contributions
 She-0.041
 ornament-0.033
otion-0.031
 Everyday-0.031
 means-0.030
Tokenags
Token ID3775
Feature activation-5.15e+00

Pos logit contributions
 lifetime+0.025
 injuries+0.021
bek+0.021
har+0.019
uc+0.019
Neg logit contributions
 She-0.042
 means-0.034
 Everyday-0.033
 ornament-0.032
otion-0.032
Token its
Token ID663
Feature activation-1.34e+00

Pos logit contributions
 lifetime+0.268
 injuries+0.259
uc+0.249
bek+0.246
har+0.241
Neg logit contributions
 She-0.184
 means-0.132
 ornament-0.126
 Everyday-0.126
otion-0.116
Token tail
Token ID7894
Feature activation-1.25e+00

Pos logit contributions
 lifetime+0.061
 injuries+0.057
uc+0.056
bek+0.055
uddled+0.053
Neg logit contributions
 She-0.060
 ornament-0.044
 means-0.044
 Everyday-0.043
 It-0.040
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
 lifetime+0.061
 injuries+0.059
uc+0.055
uddled+0.055
ounced+0.054
Neg logit contributions
 She-0.045
 means-0.035
 ornament-0.033
otion-0.033
 Everyday-0.032
Token It
Token ID632
Feature activation-4.73e-01

Pos logit contributions
 She+0.002
 ornament+0.002
 means+0.002
otion+0.001
 Everyday+0.001
Neg logit contributions
 lifetime-0.007
 injuries-0.007
bek-0.006
uc-0.006
har-0.006
Token l
Token ID300
Feature activation-1.22e+00

Pos logit contributions
 lifetime+0.019
 injuries+0.018
ounced+0.018
uc+0.017
uddled+0.017
Neg logit contributions
 She-0.021
 ornament-0.017
otion-0.016
 Everyday-0.016
 means-0.015
Token it
Token ID340
Feature activation+4.26e-01

Pos logit contributions
 She+0.010
 ornament+0.008
 means+0.008
otion+0.007
 Everyday+0.007
Neg logit contributions
 lifetime-0.012
 injuries-0.012
bek-0.011
uc-0.011
har-0.011
Token was
Token ID373
Feature activation+6.60e-03

Pos logit contributions
 She+0.020
 ornament+0.014
 Everyday+0.014
 means+0.014
otion+0.014
Neg logit contributions
 lifetime-0.019
 injuries-0.018
uc-0.017
uddled-0.017
har-0.016
Token difficult
Token ID2408
Feature activation-1.44e-01

Pos logit contributions
 She+0.000
 means+0.000
 ornament+0.000
 Everyday+0.000
otion+0.000
Neg logit contributions
 lifetime-0.000
 injuries-0.000
uc-0.000
bek-0.000
har-0.000
Token finding
Token ID4917
Feature activation+5.76e-01

Pos logit contributions
 lifetime+0.006
 injuries+0.006
uc+0.006
bek+0.006
uddled+0.005
Neg logit contributions
 She-0.006
 ornament-0.005
 means-0.005
 Everyday-0.005
otion-0.004
Token r
Token ID374
Feature activation+1.48e+00

Pos logit contributions
 She+0.019
 means+0.013
 ornament+0.012
 Everyday+0.011
 It+0.011
Neg logit contributions
 lifetime-0.034
 injuries-0.031
bek-0.031
uc-0.031
ounced-0.030
Tokenags
Token ID3775
Feature activation-5.14e+00

Pos logit contributions
 She+0.066
 means+0.050
 Everyday+0.048
 ornament+0.048
otion+0.045
Neg logit contributions
 lifetime-0.068
 injuries-0.063
bek-0.060
uc-0.059
har-0.056
Token and
Token ID290
Feature activation-2.39e-02

Pos logit contributions
 lifetime+0.280
 injuries+0.265
uc+0.254
bek+0.250
har+0.243
Neg logit contributions
 She-0.189
 means-0.134
 ornament-0.125
 Everyday-0.119
 she-0.116
Token water
Token ID1660
Feature activation-4.78e-01

Pos logit contributions
 lifetime+0.001
 injuries+0.001
uc+0.001
bek+0.001
har+0.001
Neg logit contributions
 She-0.001
 means-0.001
 ornament-0.001
 Everyday-0.001
otion-0.001
Token.
Token ID13
Feature activation-1.64e-02

Pos logit contributions
 lifetime+0.023
 injuries+0.022
uc+0.021
bek+0.020
uddled+0.020
Neg logit contributions
 She-0.019
 means-0.014
 ornament-0.014
 Everyday-0.014
otion-0.013
Token But
Token ID887
Feature activation-2.18e-01

Pos logit contributions
 lifetime+0.001
 injuries+0.001
har+0.001
ounced+0.001
uc+0.001
Neg logit contributions
 She-0.001
 ornament-0.001
otion-0.000
 means-0.000
 angel-0.000
Token Bob
Token ID5811
Feature activation-1.73e+00

Pos logit contributions
 lifetime+0.010
 injuries+0.009
bek+0.009
har+0.008
oy+0.008
Neg logit contributions
 She-0.009
 ornament-0.007
otion-0.007
 means-0.007
 Everyday-0.006
Token him
Token ID683
Feature activation+1.12e-01

Pos logit contributions
 lifetime+0.045
 injuries+0.042
uc+0.041
bek+0.040
har+0.040
Neg logit contributions
 She-0.033
 means-0.024
 ornament-0.022
 Everyday-0.022
 she-0.020
Token to
Token ID284
Feature activation-1.88e-01

Pos logit contributions
 She+0.005
 means+0.003
 ornament+0.003
 Everyday+0.003
otion+0.003
Neg logit contributions
 lifetime-0.005
 injuries-0.005
uc-0.005
bek-0.005
har-0.004
Token turn
Token ID1210
Feature activation-7.40e-01

Pos logit contributions
 injuries+0.008
 lifetime+0.008
uc+0.007
bek+0.007
har+0.007
Neg logit contributions
 She-0.008
 means-0.006
otion-0.006
 ornament-0.006
 Everyday-0.006
Token the
Token ID262
Feature activation+2.11e-01

Pos logit contributions
 injuries+0.035
 lifetime+0.033
uc+0.032
har+0.030
bek+0.029
Neg logit contributions
 She-0.029
otion-0.021
 Everyday-0.021
 ornament-0.020
 means-0.020
Token r
Token ID374
Feature activation+1.48e+00

Pos logit contributions
 She+0.008
 means+0.006
 Everyday+0.006
 ornament+0.006
otion+0.005
Neg logit contributions
 lifetime-0.011
 injuries-0.010
uc-0.010
bek-0.009
har-0.009
Tokenags
Token ID3775
Feature activation-5.14e+00

Pos logit contributions
 She+0.062
 means+0.045
 ornament+0.045
 Everyday+0.044
+0.042
Neg logit contributions
 lifetime-0.069
 injuries-0.067
uc-0.067
uddled-0.063
har-0.062
Token she
Token ID673
Feature activation+1.24e-01

Pos logit contributions
 lifetime+0.242
 injuries+0.232
uc+0.224
bek+0.219
har+0.213
Neg logit contributions
 She-0.206
 Everyday-0.151
 means-0.150
 ornament-0.148
otion-0.142
Token kept
Token ID4030
Feature activation-5.35e-01

Pos logit contributions
 She+0.005
 means+0.004
+0.004
 Everyday+0.003
 ornament+0.003
Neg logit contributions
 lifetime-0.006
 injuries-0.006
bek-0.006
uc-0.006
har-0.005
Token in
Token ID287
Feature activation+3.72e-01

Pos logit contributions
 injuries+0.025
 lifetime+0.025
uc+0.023
har+0.022
bek+0.022
Neg logit contributions
 She-0.022
 Everyday-0.016
 means-0.016
 ornament-0.015
otion-0.015
Token the
Token ID262
Feature activation+2.45e-01

Pos logit contributions
 She+0.014
 means+0.011
 Everyday+0.010
 ornament+0.010
igator+0.010
Neg logit contributions
 injuries-0.019
 lifetime-0.018
uc-0.017
bek-0.016
har-0.016
Token closet
Token ID21615
Feature activation-5.09e-01

Pos logit contributions
 She+0.010
 means+0.007
 Everyday+0.007
 ornament+0.007
otion+0.007
Neg logit contributions
 lifetime-0.012
 injuries-0.012
uc-0.011
bek-0.011
har-0.010
Token grabbed
Token ID13176
Feature activation-1.47e+00

Pos logit contributions
 lifetime+0.056
 injuries+0.053
uc+0.051
bek+0.050
har+0.049
Neg logit contributions
 She-0.050
 means-0.037
 ornament-0.036
 Everyday-0.036
otion-0.033
Token one
Token ID530
Feature activation+5.55e-01

Pos logit contributions
 lifetime+0.079
 injuries+0.078
uc+0.077
bek+0.073
har+0.071
Neg logit contributions
 She-0.048
 means-0.033
 Everyday-0.031
 ornament-0.031
otion-0.029
Token of
Token ID286
Feature activation+1.18e+00

Pos logit contributions
 She+0.019
 means+0.014
 Everyday+0.013
 ornament+0.012
otion+0.012
Neg logit contributions
 lifetime-0.030
 injuries-0.029
uc-0.028
bek-0.027
har-0.027
Token the
Token ID262
Feature activation+8.05e-02

Pos logit contributions
 She+0.035
 means+0.022
 ornament+0.021
 Everyday+0.020
otion+0.018
Neg logit contributions
 lifetime-0.071
 injuries-0.069
uc-0.066
bek-0.065
har-0.064
Token r
Token ID374
Feature activation+1.42e+00

Pos logit contributions
 She+0.003
 means+0.003
 Everyday+0.003
 ornament+0.003
otion+0.002
Neg logit contributions
 lifetime-0.004
 injuries-0.004
uc-0.003
bek-0.003
har-0.003
Tokenags
Token ID3775
Feature activation-5.13e+00

Pos logit contributions
 She+0.073
 means+0.057
 ornament+0.056
 Everyday+0.055
otion+0.052
Neg logit contributions
 lifetime-0.056
 injuries-0.051
bek-0.048
uc-0.047
har-0.045
Token and
Token ID290
Feature activation+6.28e-02

Pos logit contributions
 lifetime+0.268
 injuries+0.256
uc+0.253
bek+0.246
har+0.243
Neg logit contributions
 She-0.174
 means-0.127
 Everyday-0.121
 ornament-0.120
otion-0.112
Token started
Token ID2067
Feature activation-1.69e-01

Pos logit contributions
 She+0.003
 means+0.002
 ornament+0.002
 Everyday+0.002
otion+0.002
Neg logit contributions
 lifetime-0.003
 injuries-0.003
uc-0.002
bek-0.002
har-0.002
Token to
Token ID284
Feature activation-2.08e-01

Pos logit contributions
 lifetime+0.007
 injuries+0.007
uc+0.007
bek+0.006
har+0.006
Neg logit contributions
 She-0.008
 means-0.006
 ornament-0.006
 Everyday-0.006
otion-0.005
Token spin
Token ID7906
Feature activation-4.62e-02

Pos logit contributions
 lifetime+0.008
 injuries+0.008
uc+0.008
har+0.008
bek+0.007
Neg logit contributions
 She-0.009
 means-0.008
otion-0.007
 ornament-0.007
 Everyday-0.007
Token it
Token ID340
Feature activation+3.96e-01

Pos logit contributions
 lifetime+0.002
 injuries+0.002
uc+0.002
har+0.002
uddled+0.002
Neg logit contributions
 She-0.002
 means-0.001
 Everyday-0.001
otion-0.001
 ornament-0.001
Token and
Token ID290
Feature activation-1.61e-01

Pos logit contributions
 lifetime+0.023
 injuries+0.022
uc+0.021
bek+0.020
har+0.020
Neg logit contributions
 She-0.015
 means-0.011
 ornament-0.011
otion-0.011
 Everyday-0.010
Token showed
Token ID3751
Feature activation+5.50e-01

Pos logit contributions
 lifetime+0.008
 injuries+0.008
uc+0.007
bek+0.007
har+0.007
Neg logit contributions
 She-0.007
 means-0.005
 ornament-0.005
 Everyday-0.005
 she-0.004
Token her
Token ID607
Feature activation+3.51e-01

Pos logit contributions
 She+0.016
 ornament+0.011
 means+0.010
 Everyday+0.010
otion+0.009
Neg logit contributions
uc-0.028
 injuries-0.028
 lifetime-0.027
uddled-0.027
eds-0.025
Token the
Token ID262
Feature activation+7.04e-02

Pos logit contributions
 She+0.014
 means+0.010
 ornament+0.009
 Everyday+0.009
otion+0.009
Neg logit contributions
 lifetime-0.017
 injuries-0.017
uc-0.016
bek-0.015
uddled-0.015
Token r
Token ID374
Feature activation+1.44e+00

Pos logit contributions
 She+0.003
 means+0.002
 Everyday+0.002
 ornament+0.002
otion+0.002
Neg logit contributions
 lifetime-0.003
 injuries-0.003
uc-0.003
uddled-0.003
har-0.003
Tokenags
Token ID3775
Feature activation-5.13e+00

Pos logit contributions
 She+0.076
 means+0.061
 ornament+0.059
 Everyday+0.059
+0.056
Neg logit contributions
 lifetime-0.051
 injuries-0.049
uc-0.048
uddled-0.045
bek-0.045
Token.
Token ID13
Feature activation-4.45e-02

Pos logit contributions
 lifetime+0.278
 injuries+0.268
uc+0.261
bek+0.253
har+0.249
Neg logit contributions
 She-0.171
 means-0.121
 Everyday-0.114
 ornament-0.113
otion-0.104
Token She
Token ID1375
Feature activation+6.49e-01

Pos logit contributions
 lifetime+0.002
uc+0.002
uddled+0.002
 injuries+0.002
ounced+0.002
Neg logit contributions
 ornament-0.001
igator-0.001
 means-0.001
 angel-0.001
 She-0.001
Token said
Token ID531
Feature activation-1.86e-01

Pos logit contributions
 She+0.028
 means+0.021
 ornament+0.021
 Everyday+0.020
otion+0.019
Neg logit contributions
 lifetime-0.030
 injuries-0.028
uc-0.027
bek-0.027
har-0.026
Token that
Token ID326
Feature activation-4.79e-01

Pos logit contributions
 lifetime+0.008
uc+0.008
uddled+0.008
 injuries+0.007
bek+0.007
Neg logit contributions
 She-0.007
 means-0.006
 Everyday-0.006
 ornament-0.005
igator-0.005
Token she
Token ID673
Feature activation+2.34e-02

Pos logit contributions
 lifetime+0.022
 injuries+0.021
uc+0.020
uddled+0.018
bek+0.018
Neg logit contributions
 She-0.017
igator-0.013
 Everyday-0.012
 ornament-0.012
otion-0.012
Token the
Token ID262
Feature activation+1.00e-01

Pos logit contributions
 lifetime+0.018
 injuries+0.018
uc+0.016
bek+0.016
har+0.015
Neg logit contributions
 She-0.013
 ornament-0.009
 Everyday-0.009
 means-0.009
otion-0.008
Token machine
Token ID4572
Feature activation+4.98e-01

Pos logit contributions
 She+0.004
 means+0.003
 ornament+0.003
 Everyday+0.002
otion+0.002
Neg logit contributions
 lifetime-0.005
 injuries-0.005
uc-0.005
bek-0.005
ounced-0.005
Token was
Token ID373
Feature activation-2.63e-02

Pos logit contributions
 She+0.021
 ornament+0.015
 Everyday+0.015
 means+0.015
otion+0.014
Neg logit contributions
 lifetime-0.023
 injuries-0.022
uc-0.021
bek-0.020
uddled-0.020
Token a
Token ID257
Feature activation-6.55e-01

Pos logit contributions
 lifetime+0.001
 injuries+0.001
uc+0.001
bek+0.001
uddled+0.001
Neg logit contributions
 She-0.001
 means-0.001
 ornament-0.001
 Everyday-0.001
 she-0.001
Token shred
Token ID21163
Feature activation-2.19e-01

Pos logit contributions
 lifetime+0.034
 injuries+0.033
uc+0.032
bek+0.031
uddled+0.031
Neg logit contributions
 She-0.023
 means-0.017
 ornament-0.017
 Everyday-0.015
-0.015
Tokender
Token ID1082
Feature activation+4.78e+00

Pos logit contributions
 lifetime+0.010
 injuries+0.009
uc+0.009
bek+0.009
har+0.008
Neg logit contributions
 She-0.010
 means-0.008
 ornament-0.007
 Everyday-0.007
-0.007
Token,
Token ID11
Feature activation+6.13e-01

Pos logit contributions
 She+0.212
 ornament+0.160
 Everyday+0.157
 means+0.149
otion+0.146
Neg logit contributions
 lifetime-0.207
 injuries-0.202
uc-0.183
bek-0.176
uddled-0.174
Token that
Token ID326
Feature activation-2.76e-01

Pos logit contributions
 She+0.023
 means+0.017
 ornament+0.016
 Everyday+0.016
otion+0.015
Neg logit contributions
 lifetime-0.031
 injuries-0.030
uc-0.028
bek-0.028
har-0.027
Token could
Token ID714
Feature activation+6.88e-01

Pos logit contributions
 lifetime+0.012
 injuries+0.011
uc+0.010
bek+0.010
ounced+0.010
Neg logit contributions
 She-0.012
 ornament-0.009
 Everyday-0.009
otion-0.009
 means-0.008
Token cut
Token ID2005
Feature activation-1.16e-01

Pos logit contributions
 She+0.028
 means+0.021
 Everyday+0.021
 ornament+0.020
otion+0.020
Neg logit contributions
 injuries-0.032
 lifetime-0.030
har-0.028
uc-0.028
bek-0.027
Token things
Token ID1243
Feature activation+6.83e-01

Pos logit contributions
 lifetime+0.006
 injuries+0.006
uc+0.005
bek+0.005
har+0.005
Neg logit contributions
 She-0.004
 ornament-0.003
 Everyday-0.003
 means-0.003
otion-0.003
Token have
Token ID423
Feature activation-8.40e-01

Pos logit contributions
 lifetime+0.006
 injuries+0.006
uc+0.005
bek+0.005
har+0.005
Neg logit contributions
 She-0.006
 ornament-0.004
 means-0.004
otion-0.004
 Everyday-0.004
Token to
Token ID284
Feature activation-1.51e-01

Pos logit contributions
 lifetime+0.047
bek+0.047
uc+0.046
har+0.044
uddled+0.044
Neg logit contributions
 She-0.027
 means-0.018
-0.017
 she-0.016
 her-0.016
Token find
Token ID1064
Feature activation-1.62e-01

Pos logit contributions
 injuries+0.007
 lifetime+0.006
har+0.006
bek+0.006
uc+0.006
Neg logit contributions
 She-0.007
 means-0.005
otion-0.005
 ornament-0.005
 Everyday-0.005
Token the
Token ID262
Feature activation+2.48e-01

Pos logit contributions
 lifetime+0.009
bek+0.009
 injuries+0.009
uc+0.009
uddled+0.008
Neg logit contributions
 She-0.005
 means-0.004
 ornament-0.004
 Everyday-0.003
 her-0.003
Token pasta
Token ID26296
Feature activation+9.30e-01

Pos logit contributions
 She+0.010
 means+0.007
 Everyday+0.007
 ornament+0.007
otion+0.007
Neg logit contributions
 lifetime-0.012
 injuries-0.012
uc-0.011
bek-0.011
har-0.011
Token aisle
Token ID28960
Feature activation+4.23e+00

Pos logit contributions
 She+0.039
 Everyday+0.029
 means+0.028
 ornament+0.028
otion+0.026
Neg logit contributions
 lifetime-0.043
 injuries-0.040
uc-0.039
bek-0.037
uddled-0.037
Token."
Token ID526
Feature activation+4.53e-01

Pos logit contributions
 She+0.154
 means+0.109
 ornament+0.103
 Everyday+0.102
+0.092
Neg logit contributions
 lifetime-0.224
 injuries-0.216
uc-0.208
bek-0.205
har-0.199
Token
Token ID198
Feature activation+2.74e-02

Pos logit contributions
 She+0.014
 ornament+0.010
 means+0.010
 Everyday+0.009
otion+0.009
Neg logit contributions
 lifetime-0.025
 injuries-0.024
uc-0.023
bek-0.023
har-0.023
Token
Token ID198
Feature activation-2.74e-02

Pos logit contributions
 She+0.001
 means+0.000
 Everyday+0.000
 ornament+0.000
 It+0.000
Neg logit contributions
 lifetime-0.002
 injuries-0.002
uc-0.002
bek-0.002
ounced-0.002
TokenL
Token ID43
Feature activation-4.15e-02

Pos logit contributions
 lifetime+0.001
 injuries+0.001
uc+0.001
bek+0.001
ounced+0.001
Neg logit contributions
 She-0.001
 means-0.001
 ornament-0.001
 Everyday-0.001
-0.001
Tokenily
Token ID813
Feature activation+1.79e-01

Pos logit contributions
 lifetime+0.002
 injuries+0.002
uc+0.002
bek+0.002
uddled+0.002
Neg logit contributions
 She-0.001
 means-0.001
 Everyday-0.001
 ornament-0.001
-0.001
Token and
Token ID290
Feature activation+1.48e-01

Pos logit contributions
 lifetime+0.007
 injuries+0.007
uc+0.007
bek+0.007
har+0.007
Neg logit contributions
 She-0.004
 means-0.003
 Everyday-0.003
 ornament-0.003
otion-0.002
Token it
Token ID340
Feature activation+5.70e-01

Pos logit contributions
 She+0.006
 means+0.005
 Everyday+0.005
 ornament+0.005
otion+0.005
Neg logit contributions
 lifetime-0.007
 injuries-0.006
uc-0.006
bek-0.006
uddled-0.006
Token was
Token ID373
Feature activation+2.55e-02

Pos logit contributions
 She+0.024
 means+0.018
 Everyday+0.018
 ornament+0.018
otion+0.017
Neg logit contributions
 lifetime-0.026
 injuries-0.025
uc-0.024
bek-0.023
uddled-0.023
Token the
Token ID262
Feature activation+3.98e-01

Pos logit contributions
 She+0.001
 means+0.001
 Everyday+0.001
 ornament+0.001
otion+0.001
Neg logit contributions
 lifetime-0.001
 injuries-0.001
uc-0.001
bek-0.001
har-0.001
Token y
Token ID331
Feature activation+3.33e-01

Pos logit contributions
 She+0.016
 means+0.012
 ornament+0.012
 Everyday+0.011
+0.010
Neg logit contributions
 lifetime-0.020
 injuries-0.019
uc-0.018
bek-0.018
har-0.017
Tokenumm
Token ID13929
Feature activation+4.23e+00

Pos logit contributions
 She+0.014
 means+0.011
 ornament+0.010
 Everyday+0.010
otion+0.009
Neg logit contributions
 lifetime-0.016
 injuries-0.015
uc-0.014
bek-0.014
har-0.014
Tokeniest
Token ID6386
Feature activation+1.62e+00

Pos logit contributions
 She+0.220
 Everyday+0.175
 means+0.167
 ornament+0.160
 It+0.160
Neg logit contributions
 lifetime-0.153
 injuries-0.151
uc-0.147
bek-0.137
uddled-0.136
Token ice
Token ID4771
Feature activation-3.51e-01

Pos logit contributions
 She+0.065
 means+0.048
 ornament+0.048
 Everyday+0.046
 she+0.042
Neg logit contributions
 lifetime-0.079
 injuries-0.075
uc-0.073
bek-0.072
har-0.069
Token cream
Token ID8566
Feature activation-7.80e-02

Pos logit contributions
 lifetime+0.013
 injuries+0.012
uc+0.011
bek+0.011
har+0.010
Neg logit contributions
 She-0.019
 means-0.015
 ornament-0.015
 Everyday-0.014
 she-0.014
Token she
Token ID673
Feature activation+4.47e-01

Pos logit contributions
 lifetime+0.004
 injuries+0.004
uc+0.003
bek+0.003
har+0.003
Neg logit contributions
 She-0.003
 means-0.002
 ornament-0.002
 Everyday-0.002
otion-0.002
Token ever
Token ID1683
Feature activation-3.58e-01

Pos logit contributions
 She+0.017
 means+0.013
+0.012
 she+0.012
 ornament+0.012
Neg logit contributions
 injuries-0.022
 lifetime-0.022
uc-0.021
bek-0.020
uddled-0.020
Token.
Token ID13
Feature activation+1.51e-01

Pos logit contributions
 She+0.052
 means+0.039
 ornament+0.037
 Everyday+0.037
otion+0.034
Neg logit contributions
 lifetime-0.061
 injuries-0.058
uc-0.055
bek-0.054
har-0.053
Token She
Token ID1375
Feature activation+9.14e-01

Pos logit contributions
 means+0.002
 ornament+0.001
igator+0.001
 idea+0.001
 She+0.001
Neg logit contributions
 lifetime-0.010
 injuries-0.009
aj-0.009
bek-0.009
uddled-0.009
Token loved
Token ID6151
Feature activation-8.14e-01

Pos logit contributions
 She+0.042
 means+0.032
 ornament+0.031
 Everyday+0.031
+0.029
Neg logit contributions
 lifetime-0.040
 injuries-0.038
uc-0.036
bek-0.036
har-0.034
Token to
Token ID284
Feature activation-1.73e-01

Pos logit contributions
 lifetime+0.033
 injuries+0.031
uc+0.031
bek+0.028
ounced+0.028
Neg logit contributions
 She-0.037
 means-0.030
otion-0.029
 Everyday-0.029
 ornament-0.028
Token eat
Token ID4483
Feature activation-1.56e-01

Pos logit contributions
 lifetime+0.005
 injuries+0.005
uc+0.005
oy+0.004
cle+0.004
Neg logit contributions
 She-0.009
 means-0.008
otion-0.008
 ornament-0.007
 Everyday-0.007
Token av
Token ID1196
Feature activation+4.40e+00

Pos logit contributions
uc+0.006
 injuries+0.005
 lifetime+0.005
uddled+0.005
oy+0.005
Neg logit contributions
 She-0.007
 Everyday-0.006
 means-0.006
otion-0.006
 ornament-0.006
Tokenoc
Token ID420
Feature activation+2.43e+00

Pos logit contributions
 She+0.243
otion+0.199
 means+0.197
 Everyday+0.189
 ornament+0.187
Neg logit contributions
 lifetime-0.149
uc-0.134
 injuries-0.130
bek-0.127
oy-0.120
Tokenados
Token ID22484
Feature activation+2.18e+00

Pos logit contributions
 She+0.101
 means+0.073
 Everyday+0.072
otion+0.069
 ornament+0.068
Neg logit contributions
 lifetime-0.114
uc-0.112
 injuries-0.111
bek-0.103
ounced-0.102
Token because
Token ID780
Feature activation-2.41e+00

Pos logit contributions
 She+0.094
otion+0.076
 means+0.075
 Everyday+0.074
 ornament+0.073
Neg logit contributions
 lifetime-0.089
uc-0.084
 injuries-0.080
bek-0.075
uddled-0.074
Token they
Token ID484
Feature activation-3.52e-01

Pos logit contributions
 lifetime+0.084
uc+0.072
bek+0.072
 injuries+0.070
 brand+0.066
Neg logit contributions
 She-0.088
 cry-0.084
 repeats-0.079
 means-0.078
 shrug-0.078
Token were
Token ID547
Feature activation+3.93e-01

Pos logit contributions
 lifetime+0.014
uc+0.013
bek+0.012
 injuries+0.012
ounced+0.011
Neg logit contributions
 She-0.016
 means-0.012
otion-0.012
 Everyday-0.012
 ornament-0.012
Token The
Token ID383
Feature activation-9.04e-01

Pos logit contributions
 She+0.003
 Everyday+0.002
 It+0.002
 means+0.002
 He+0.001
Neg logit contributions
 lifetime-0.005
 injuries-0.005
uc-0.005
bek-0.005
har-0.005
Token ocean
Token ID9151
Feature activation+3.99e-01

Pos logit contributions
 lifetime+0.051
uc+0.051
bek+0.051
uddled+0.049
ounced+0.049
Neg logit contributions
 She-0.026
 Everyday-0.019
 means-0.017
 ornament-0.017
 answer-0.017
Token was
Token ID373
Feature activation+2.47e-03

Pos logit contributions
 She+0.013
 means+0.010
 Everyday+0.009
 ornament+0.009
+0.009
Neg logit contributions
 lifetime-0.022
 injuries-0.021
uc-0.020
bek-0.020
har-0.019
Token a
Token ID257
Feature activation-6.57e-01

Pos logit contributions
 She+0.000
 means+0.000
 Everyday+0.000
 ornament+0.000
+0.000
Neg logit contributions
 lifetime-0.000
 injuries-0.000
uc-0.000
bek-0.000
uddled-0.000
Token sad
Token ID6507
Feature activation+1.18e+00

Pos logit contributions
uc+0.039
uddled+0.037
ounced+0.035
aj+0.035
eds+0.035
Neg logit contributions
 She-0.017
 ornament-0.013
-0.013
 idea-0.012
 baby-0.012
Tokender
Token ID1082
Feature activation+4.61e+00

Pos logit contributions
 She+0.049
 means+0.037
 ornament+0.036
 Everyday+0.035
otion+0.033
Neg logit contributions
 lifetime-0.056
 injuries-0.053
uc-0.049
bek-0.049
har-0.047
Token place
Token ID1295
Feature activation-1.80e+00

Pos logit contributions
 She+0.176
 ornament+0.127
 Everyday+0.119
 means+0.118
otion+0.114
Neg logit contributions
 lifetime-0.235
 injuries-0.221
uc-0.203
bek-0.200
uddled-0.199
Token without
Token ID1231
Feature activation+1.07e+00

Pos logit contributions
 lifetime+0.098
 injuries+0.094
uc+0.092
bek+0.090
har+0.087
Neg logit contributions
 She-0.063
 means-0.044
 ornament-0.041
 Everyday-0.041
 she-0.037
Token Fin
Token ID4463
Feature activation-6.69e-01

Pos logit contributions
 She+0.036
 means+0.025
 Everyday+0.022
 ornament+0.022
 her+0.022
Neg logit contributions
 lifetime-0.060
bek-0.057
uc-0.057
uddled-0.056
har-0.055
Token's
Token ID338
Feature activation+6.03e-02

Pos logit contributions
 lifetime+0.038
 injuries+0.036
bek+0.036
uc+0.036
har+0.035
Neg logit contributions
 She-0.022
 means-0.015
-0.013
 Everyday-0.013
 ornament-0.013
Token happy
Token ID3772
Feature activation+3.84e-01

Pos logit contributions
 She+0.002
 answer+0.002
 means+0.002
 Everyday+0.002
 idea+0.002
Neg logit contributions
 lifetime-0.003
bek-0.003
uc-0.003
har-0.003
uddled-0.003
Token happy
Token ID3772
Feature activation+3.07e-01

Pos logit contributions
 She+0.006
 means+0.005
otion+0.005
 Everyday+0.005
 ornament+0.005
Neg logit contributions
 lifetime-0.005
 injuries-0.005
har-0.005
ounced-0.005
uddled-0.004
Token and
Token ID290
Feature activation-3.02e-01

Pos logit contributions
 She+0.011
 means+0.007
 ornament+0.007
 Everyday+0.007
otion+0.006
Neg logit contributions
 lifetime-0.017
 injuries-0.016
uc-0.016
bek-0.015
har-0.015
Token cl
Token ID537
Feature activation+1.83e-01

Pos logit contributions
 lifetime+0.014
 injuries+0.013
uc+0.013
bek+0.012
har+0.012
Neg logit contributions
 She-0.013
 means-0.010
 Everyday-0.010
 ornament-0.010
 she-0.009
Tokenapped
Token ID6320
Feature activation+1.08e+00

Pos logit contributions
 She+0.009
 means+0.007
 ornament+0.007
 Everyday+0.007
otion+0.007
Neg logit contributions
 lifetime-0.007
 injuries-0.007
uc-0.006
bek-0.006
har-0.006
Token even
Token ID772
Feature activation-1.81e+00

Pos logit contributions
 She+0.047
 means+0.035
 ornament+0.034
 Everyday+0.034
otion+0.031
Neg logit contributions
 lifetime-0.051
 injuries-0.048
uc-0.046
bek-0.045
har-0.044
Token louder
Token ID27089
Feature activation+1.06e+00

Pos logit contributions
 lifetime+0.082
 injuries+0.077
uc+0.072
bek+0.072
har+0.070
Neg logit contributions
 She-0.081
 means-0.060
 ornament-0.059
 Everyday-0.059
otion-0.057
Token.
Token ID13
Feature activation-3.35e-02

Pos logit contributions
 She+0.045
 means+0.034
 ornament+0.033
 Everyday+0.033
otion+0.031
Neg logit contributions
 lifetime-0.049
 injuries-0.047
uc-0.044
bek-0.043
har-0.042
Token The
Token ID383
Feature activation-1.14e+00

Pos logit contributions
 lifetime+0.002
 injuries+0.002
uc+0.002
bek+0.002
har+0.002
Neg logit contributions
 She-0.001
 means-0.001
 Everyday-0.001
 ornament-0.001
 It-0.001
Token end
Token ID886
Feature activation-7.03e-01

Pos logit contributions
 lifetime+0.056
 injuries+0.054
uc+0.051
bek+0.050
har+0.049
Neg logit contributions
 She-0.047
 means-0.034
 Everyday-0.032
 ornament-0.032
otion-0.030
Token.
Token ID13
Feature activation-3.03e-02

Pos logit contributions
 lifetime+0.043
uc+0.041
 injuries+0.041
bek+0.041
ounced+0.040
Neg logit contributions
 She-0.023
 means-0.016
-0.014
 It-0.012
 Everyday-0.012
Token<|endoftext|>
Token ID50256
Feature activation-1.52e+00

Pos logit contributions
 lifetime+0.002
 injuries+0.002
uc+0.002
bek+0.002
ounced+0.002
Neg logit contributions
 She-0.001
 means-0.000
 Everyday-0.000
 It-0.000
 ornament-0.000
Token.
Token ID13
Feature activation-3.37e-02

Pos logit contributions
 She+0.019
 means+0.015
 ornament+0.015
 Everyday+0.014
otion+0.014
Neg logit contributions
 lifetime-0.021
 injuries-0.021
uc-0.019
uddled-0.018
bek-0.018
Token She
Token ID1375
Feature activation+7.45e-01

Pos logit contributions
 lifetime+0.002
bek+0.002
har+0.002
cle+0.002
 injuries+0.002
Neg logit contributions
 ornament-0.000
igator-0.000
otion-0.000
 idea-0.000
 means-0.000
Token said
Token ID531
Feature activation-3.65e-01

Pos logit contributions
 She+0.031
 means+0.023
 Everyday+0.022
 ornament+0.021
+0.020
Neg logit contributions
 lifetime-0.037
 injuries-0.035
uc-0.033
bek-0.033
har-0.031
Token:
Token ID25
Feature activation-1.79e+00

Pos logit contributions
 injuries+0.015
uc+0.015
 lifetime+0.015
uddled+0.015
ounced+0.014
Neg logit contributions
 She-0.013
 Everyday-0.012
 ornament-0.011
 means-0.011
 idea-0.010
Token "
Token ID366
Feature activation-7.41e-01

Pos logit contributions
 lifetime+0.081
 injuries+0.075
uc+0.075
bek+0.073
har+0.070
Neg logit contributions
 She-0.065
 ornament-0.057
 means-0.054
igator-0.052
 Everyday-0.052
TokenDon
Token ID3987
Feature activation+2.23e+00

Pos logit contributions
 lifetime+0.027
 injuries+0.026
bek+0.025
har+0.024
uc+0.024
Neg logit contributions
 She-0.037
 means-0.030
 ornament-0.030
 Everyday-0.030
otion-0.029
Token't
Token ID470
Feature activation+1.77e+00

Pos logit contributions
 She+0.058
 ornament+0.042
 means+0.036
 Everyday+0.035
otion+0.031
Neg logit contributions
 lifetime-0.127
 injuries-0.124
bek-0.120
har-0.120
uc-0.118
Token worry
Token ID5490
Feature activation+1.38e+00

Pos logit contributions
 She+0.075
 means+0.062
otion+0.059
 ornament+0.059
 Everyday+0.058
Neg logit contributions
 injuries-0.075
 lifetime-0.074
har-0.070
uc-0.066
bek-0.065
Token,
Token ID11
Feature activation+8.03e-02

Pos logit contributions
 She+0.050
 means+0.038
 Everyday+0.038
 ornament+0.037
otion+0.035
Neg logit contributions
 lifetime-0.065
 injuries-0.064
uc-0.059
bek-0.059
har-0.057
Token I
Token ID314
Feature activation-6.21e-01

Pos logit contributions
 She+0.004
 means+0.003
 Everyday+0.003
 ornament+0.003
otion+0.003
Neg logit contributions
 lifetime-0.003
 injuries-0.003
bek-0.003
har-0.003
cle-0.003
Token'll
Token ID1183
Feature activation-5.22e-01

Pos logit contributions
 lifetime+0.027
 injuries+0.025
uc+0.024
bek+0.024
har+0.024
Neg logit contributions
 She-0.028
 ornament-0.022
 means-0.021
 Everyday-0.020
otion-0.019
TokenWe
Token ID1135
Feature activation-3.47e-01

Pos logit contributions
 lifetime+0.011
 injuries+0.011
bek+0.011
uc+0.011
har+0.011
Neg logit contributions
 She-0.018
 Everyday-0.015
 ornament-0.014
 means-0.014
 idea-0.014
Token found
Token ID1043
Feature activation-1.59e-01

Pos logit contributions
 injuries+0.017
 lifetime+0.017
uc+0.016
ounced+0.016
bek+0.016
Neg logit contributions
 She-0.013
 Everyday-0.009
 means-0.009
 ornament-0.009
otion-0.008
Token them
Token ID606
Feature activation-1.70e-01

Pos logit contributions
 lifetime+0.011
 injuries+0.010
bek+0.010
uc+0.010
har+0.010
Neg logit contributions
 She-0.004
 means-0.003
 ornament-0.002
 It-0.002
 Everyday-0.002
Token in
Token ID287
Feature activation+5.25e-01

Pos logit contributions
 lifetime+0.008
 injuries+0.008
uc+0.007
bek+0.007
har+0.007
Neg logit contributions
 She-0.007
 means-0.005
 ornament-0.005
 Everyday-0.005
otion-0.005
Token the
Token ID262
Feature activation+2.87e-01

Pos logit contributions
 She+0.018
 means+0.012
 ornament+0.011
 Everyday+0.011
otion+0.011
Neg logit contributions
 lifetime-0.029
 injuries-0.028
uc-0.027
bek-0.026
har-0.026
Token box
Token ID3091
Feature activation+1.56e+00

Pos logit contributions
 She+0.010
 ornament+0.007
 means+0.006
 Everyday+0.006
 It+0.006
Neg logit contributions
 lifetime-0.016
uc-0.016
uddled-0.015
 injuries-0.015
ounced-0.015
Token!"
Token ID2474
Feature activation+8.67e-01

Pos logit contributions
 She+0.070
 means+0.053
 ornament+0.052
 Everyday+0.052
otion+0.049
Neg logit contributions
 lifetime-0.067
 injuries-0.063
uc-0.060
bek-0.059
har-0.058
Token
Token ID198
Feature activation+2.47e-01

Pos logit contributions
 She+0.029
 ornament+0.027
 means+0.025
igator+0.024
 Everyday+0.023
Neg logit contributions
 lifetime-0.039
 injuries-0.039
uddled-0.036
uc-0.036
har-0.035
Token
Token ID198
Feature activation+2.31e-01

Pos logit contributions
 She+0.008
 means+0.006
 ornament+0.005
 Everyday+0.005
otion+0.005
Neg logit contributions
 lifetime-0.014
 injuries-0.013
uc-0.013
bek-0.012
har-0.012
Token"
Token ID1
Feature activation-5.16e-01

Pos logit contributions
 She+0.011
 means+0.009
 ornament+0.009
 Everyday+0.009
otion+0.008
Neg logit contributions
 lifetime-0.009
 injuries-0.009
uc-0.008
har-0.008
bek-0.008
TokenNo
Token ID2949
Feature activation+1.60e+00

Pos logit contributions
 lifetime+0.021
 injuries+0.020
uc+0.019
bek+0.019
har+0.018
Neg logit contributions
 She-0.025
 means-0.019
 ornament-0.019
 Everyday-0.019
otion-0.017
Token couldn
Token ID3521
Feature activation+3.61e-01

Pos logit contributions
 lifetime+0.025
 injuries+0.023
uc+0.023
bek+0.022
har+0.021
Neg logit contributions
 She-0.022
 ornament-0.018
otion-0.016
 Everyday-0.016
 means-0.016
Token't
Token ID470
Feature activation+1.90e+00

Pos logit contributions
 She+0.014
 wonder+0.012
 idea+0.012
 ornament+0.011
 Everyday+0.010
Neg logit contributions
 injuries-0.013
 lifetime-0.012
uddled-0.012
uc-0.012
ar-0.012
Token have
Token ID423
Feature activation-9.78e-01

Pos logit contributions
 She+0.086
 means+0.070
 ornament+0.065
 Everyday+0.063
otion+0.061
Neg logit contributions
 injuries-0.079
 lifetime-0.078
har-0.069
uc-0.069
oy-0.068
Token finished
Token ID5201
Feature activation-1.45e+00

Pos logit contributions
 lifetime+0.054
 injuries+0.052
uc+0.050
bek+0.050
har+0.048
Neg logit contributions
 She-0.034
 means-0.023
 ornament-0.022
 Everyday-0.022
-0.019
Token this
Token ID428
Feature activation+7.49e-01

Pos logit contributions
 injuries+0.062
 lifetime+0.062
uc+0.058
har+0.054
ounced+0.052
Neg logit contributions
 She-0.065
 Everyday-0.049
 means-0.047
 ornament-0.047
otion-0.046
Token without
Token ID1231
Feature activation+9.26e-01

Pos logit contributions
 She+0.034
 means+0.024
 Everyday+0.023
otion+0.023
 It+0.022
Neg logit contributions
 lifetime-0.034
 injuries-0.032
uc-0.030
har-0.029
bek-0.028
Token you
Token ID345
Feature activation+2.14e-01

Pos logit contributions
 She+0.038
 means+0.028
 ornament+0.027
 Everyday+0.027
otion+0.025
Neg logit contributions
 lifetime-0.045
 injuries-0.042
uc-0.041
bek-0.040
uddled-0.039
Token."
Token ID526
Feature activation+1.63e-01

Pos logit contributions
 She+0.009
 means+0.006
 ornament+0.006
 Everyday+0.006
otion+0.005
Neg logit contributions
 lifetime-0.011
 injuries-0.010
uc-0.010
bek-0.010
har-0.009
Token Tim
Token ID5045
Feature activation+2.56e-01

Pos logit contributions
 She+0.005
 means+0.003
 Everyday+0.003
 It+0.003
 ornament+0.003
Neg logit contributions
 lifetime-0.010
 injuries-0.010
uc-0.009
bek-0.009
ounced-0.009
Tokenmy
Token ID1820
Feature activation+1.17e+00

Pos logit contributions
 She+0.009
 means+0.006
 ornament+0.006
 Everyday+0.006
otion+0.005
Neg logit contributions
 lifetime-0.014
 injuries-0.013
uc-0.013
bek-0.013
har-0.012
Token smiled
Token ID13541
Feature activation-7.37e-01

Pos logit contributions
 She+0.053
 means+0.041
 ornament+0.040
 Everyday+0.039
otion+0.039
Neg logit contributions
 lifetime-0.050
 injuries-0.047
uc-0.046
bek-0.045
uddled-0.044
Token her
Token ID607
Feature activation+7.19e-01

Pos logit contributions
 injuries+0.011
 lifetime+0.010
uc+0.010
har+0.009
bek+0.009
Neg logit contributions
 She-0.007
 means-0.005
 ornament-0.005
 Everyday-0.005
otion-0.004
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
 She+0.030
 means+0.023
 Everyday+0.022
otion+0.021
 ornament+0.021
Neg logit contributions
 lifetime-0.033
 injuries-0.032
uc-0.030
har-0.029
bek-0.029
Token
Token ID198
Feature activation+2.02e-02

Pos logit contributions
 ornament+0.005
 She+0.005
igator+0.005
 means+0.004
otion+0.004
Neg logit contributions
 lifetime-0.009
 injuries-0.008
uc-0.008
uddled-0.008
ounced-0.008
Token
Token ID198
Feature activation+9.62e-03

Pos logit contributions
 She+0.001
 ornament+0.001
 means+0.001
 Everyday+0.000
otion+0.000
Neg logit contributions
 lifetime-0.001
 injuries-0.001
uc-0.001
har-0.001
bek-0.001
Token"
Token ID1
Feature activation-6.98e-01

Pos logit contributions
 She+0.000
 ornament+0.000
 means+0.000
 Everyday+0.000
otion+0.000
Neg logit contributions
 lifetime-0.000
 injuries-0.000
bek-0.000
har-0.000
uc-0.000
TokenWow
Token ID22017
Feature activation+1.75e+00

Pos logit contributions
 lifetime+0.029
 injuries+0.028
bek+0.026
uc+0.026
har+0.025
Neg logit contributions
 She-0.033
 means-0.025
 Everyday-0.025
 ornament-0.025
otion-0.023
Token,
Token ID11
Feature activation+5.22e-01

Pos logit contributions
 She+0.053
 means+0.035
 Everyday+0.033
 ornament+0.033
otion+0.028
Neg logit contributions
 lifetime-0.102
 injuries-0.099
uc-0.095
bek-0.093
ounced-0.091
Token Lily
Token ID20037
Feature activation+7.52e-01

Pos logit contributions
 She+0.020
 Everyday+0.016
 means+0.015
 ornament+0.014
otion+0.013
Neg logit contributions
 injuries-0.024
 lifetime-0.023
bek-0.021
uc-0.021
oy-0.020
Token,
Token ID11
Feature activation+5.54e-01

Pos logit contributions
 She+0.024
 means+0.016
 ornament+0.015
 Everyday+0.015
otion+0.013
Neg logit contributions
 lifetime-0.043
 injuries-0.042
uc-0.040
bek-0.039
har-0.039
Token you
Token ID345
Feature activation+3.34e-01

Pos logit contributions
 She+0.023
 Everyday+0.019
 means+0.018
 ornament+0.017
 Every+0.015
Neg logit contributions
 injuries-0.024
 lifetime-0.022
uc-0.021
bek-0.020
ounced-0.020
Token were
Token ID547
Feature activation+2.68e-01

Pos logit contributions
 She+0.010
 means+0.006
 ornament+0.006
 Everyday+0.006
otion+0.005
Neg logit contributions
 lifetime-0.020
 injuries-0.019
uc-0.019
bek-0.018
har-0.018
Token here
Token ID994
Feature activation+3.75e-01

Pos logit contributions
 She+0.032
 Everyday+0.024
 means+0.024
 ornament+0.023
otion+0.022
Neg logit contributions
 lifetime-0.035
 injuries-0.034
uc-0.032
ounced-0.030
har-0.030
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
 She+0.014
 means+0.010
 ornament+0.010
 Everyday+0.010
otion+0.009
Neg logit contributions
 lifetime-0.019
 injuries-0.018
uc-0.017
bek-0.016
har-0.016
Token Maybe
Token ID6674
Feature activation+1.48e+00

Pos logit contributions
 She+0.007
 ornament+0.006
 means+0.005
otion+0.005
 Everyday+0.005
Neg logit contributions
 lifetime-0.009
 injuries-0.009
bek-0.008
har-0.008
uc-0.008
Token it
Token ID340
Feature activation+3.82e-01

Pos logit contributions
 She+0.054
 means+0.039
 ornament+0.037
 Everyday+0.036
 she+0.033
Neg logit contributions
 lifetime-0.078
 injuries-0.075
uc-0.073
bek-0.072
har-0.070
Token likes
Token ID7832
Feature activation+6.26e-01

Pos logit contributions
 She+0.016
 ornament+0.012
 Everyday+0.012
otion+0.012
 It+0.010
Neg logit contributions
 lifetime-0.017
 injuries-0.016
ounced-0.015
uddled-0.015
uc-0.015
Token to
Token ID284
Feature activation-2.05e-01

Pos logit contributions
 She+0.026
 means+0.019
 ornament+0.019
 Everyday+0.018
otion+0.017
Neg logit contributions
 lifetime-0.030
 injuries-0.029
uc-0.027
bek-0.027
har-0.026
Token splash
Token ID22870
Feature activation-5.94e-02

Pos logit contributions
 injuries+0.009
 lifetime+0.009
uc+0.008
har+0.008
oy+0.008
Neg logit contributions
 She-0.009
 means-0.007
 ornament-0.007
 Everyday-0.007
otion-0.007
Token and
Token ID290
Feature activation+3.53e-01

Pos logit contributions
 lifetime+0.003
 injuries+0.003
uc+0.003
bek+0.003
har+0.003
Neg logit contributions
 She-0.002
 ornament-0.001
 Everyday-0.001
 means-0.001
otion-0.001
Token play
Token ID711
Feature activation+4.55e-02

Pos logit contributions
 She+0.016
 means+0.012
 Everyday+0.011
 ornament+0.011
otion+0.011
Neg logit contributions
 lifetime-0.016
 injuries-0.015
uc-0.014
har-0.014
bek-0.013
Token with
Token ID351
Feature activation+1.33e+00

Pos logit contributions
 She+0.002
 means+0.001
 Everyday+0.001
 ornament+0.001
otion+0.001
Neg logit contributions
 lifetime-0.002
 injuries-0.002
uc-0.002
bek-0.002
har-0.002
Token its
Token ID663
Feature activation-1.38e+00

Pos logit contributions
 She+0.053
 means+0.039
 ornament+0.038
 Everyday+0.037
otion+0.034
Neg logit contributions
 lifetime-0.066
 injuries-0.063
uc-0.060
bek-0.059
har-0.058
TokenL
Token ID43
Feature activation-1.87e-01

Pos logit contributions
 lifetime+0.003
 injuries+0.003
bek+0.003
uc+0.002
har+0.002
Neg logit contributions
 She-0.004
 means-0.003
 ornament-0.003
 Everyday-0.003
otion-0.003
Tokenily
Token ID813
Feature activation+7.39e-02

Pos logit contributions
 lifetime+0.010
 injuries+0.009
uc+0.009
bek+0.009
uddled+0.009
Neg logit contributions
 She-0.007
 means-0.005
 Everyday-0.005
 ornament-0.004
-0.004
Token picked
Token ID6497
Feature activation-1.19e+00

Pos logit contributions
 She+0.003
 ornament+0.003
otion+0.003
 means+0.002
 Everyday+0.002
Neg logit contributions
 lifetime-0.003
 injuries-0.003
uddled-0.003
uc-0.003
ounced-0.003
Token one
Token ID530
Feature activation+5.46e-01

Pos logit contributions
uc+0.047
 injuries+0.043
 lifetime+0.043
eds+0.042
ishing+0.041
Neg logit contributions
 She-0.045
otion-0.036
 Everyday-0.035
case-0.034
 seem-0.034
Token of
Token ID286
Feature activation+1.33e+00

Pos logit contributions
 She+0.021
 means+0.015
 Everyday+0.015
otion+0.014
 seem+0.013
Neg logit contributions
 lifetime-0.026
 injuries-0.025
uc-0.024
bek-0.023
har-0.023
Token the
Token ID262
Feature activation+4.78e-02

Pos logit contributions
 She+0.041
 means+0.030
otion+0.026
 Everyday+0.026
 ornament+0.026
Neg logit contributions
 injuries-0.071
 lifetime-0.070
uc-0.068
bek-0.065
har-0.063
Token flowers
Token ID12734
Feature activation+2.89e-01

Pos logit contributions
 She+0.002
 means+0.001
 Everyday+0.001
otion+0.001
 ornament+0.001
Neg logit contributions
 lifetime-0.003
 injuries-0.003
uc-0.002
bek-0.002
har-0.002
Token and
Token ID290
Feature activation+1.51e-01

Pos logit contributions
 She+0.011
 means+0.010
 Everyday+0.009
otion+0.009
 asks+0.009
Neg logit contributions
 lifetime-0.012
 injuries-0.011
uc-0.011
uddled-0.010
ppers-0.010
Token gave
Token ID2921
Feature activation+6.02e-01

Pos logit contributions
 She+0.007
otion+0.005
 Everyday+0.005
 means+0.005
 ornament+0.005
Neg logit contributions
 lifetime-0.007
 injuries-0.006
uc-0.006
bek-0.006
ounced-0.006
Token it
Token ID340
Feature activation+3.00e-01

Pos logit contributions
 She+0.021
 Everyday+0.015
 means+0.015
otion+0.014
 ornament+0.014
Neg logit contributions
 lifetime-0.030
 injuries-0.030
uc-0.029
bek-0.027
ounced-0.027
Token to
Token ID284
Feature activation-1.63e-01

Pos logit contributions
 She+0.012
 means+0.008
 ornament+0.008
 Everyday+0.008
 It+0.007
Neg logit contributions
 lifetime-0.015
 injuries-0.015
bek-0.014
uc-0.014
har-0.014
Token little
Token ID1310
Feature activation+1.63e+00

Pos logit contributions
 injuries+0.007
har+0.007
 lifetime+0.007
ounced+0.007
uc+0.006
Neg logit contributions
 She-0.007
 ornament-0.006
 means-0.006
 Everyday-0.005
otion-0.005
Token to
Token ID284
Feature activation-3.83e-01

Pos logit contributions
 She+0.065
 means+0.047
 ornament+0.046
 Everyday+0.046
otion+0.043
Neg logit contributions
 lifetime-0.080
 injuries-0.078
uc-0.072
bek-0.070
uddled-0.068
Token understand
Token ID1833
Feature activation-1.54e-01

Pos logit contributions
 injuries+0.017
 lifetime+0.015
har+0.014
oy+0.014
edy+0.013
Neg logit contributions
 She-0.016
 ornament-0.013
 means-0.013
otion-0.013
 Everyday-0.012
Token,
Token ID11
Feature activation+2.22e-01

Pos logit contributions
 lifetime+0.006
 injuries+0.006
uc+0.005
bek+0.005
har+0.005
Neg logit contributions
 She-0.007
 means-0.006
 ornament-0.006
 Everyday-0.006
otion-0.005
Token so
Token ID523
Feature activation-2.11e-01

Pos logit contributions
 She+0.009
 ornament+0.007
 means+0.007
otion+0.007
 Everyday+0.007
Neg logit contributions
 lifetime-0.010
 injuries-0.010
bek-0.008
har-0.008
uc-0.008
Token he
Token ID339
Feature activation-4.99e-01

Pos logit contributions
 lifetime+0.011
 injuries+0.011
uc+0.010
bek+0.010
har+0.010
Neg logit contributions
 She-0.007
 means-0.006
 ornament-0.006
 Everyday-0.005
otion-0.005
Token just
Token ID655
Feature activation-1.45e+00

Pos logit contributions
 lifetime+0.023
 injuries+0.022
uc+0.021
bek+0.021
har+0.020
Neg logit contributions
 She-0.022
 means-0.016
 Everyday-0.015
 ornament-0.015
-0.014
Token listened
Token ID16399
Feature activation-1.49e+00

Pos logit contributions
 lifetime+0.062
 injuries+0.058
uc+0.056
bek+0.054
har+0.053
Neg logit contributions
 She-0.068
 means-0.052
 ornament-0.051
 Everyday-0.050
otion-0.047
Token to
Token ID284
Feature activation-3.72e-01

Pos logit contributions
 lifetime+0.069
 injuries+0.065
uc+0.062
bek+0.061
har+0.059
Neg logit contributions
 She-0.063
 means-0.048
 ornament-0.047
 Everyday-0.046
otion-0.044
Token M
Token ID337
Feature activation-1.88e-01

Pos logit contributions
 lifetime+0.018
 injuries+0.018
uc+0.016
bek+0.016
har+0.016
Neg logit contributions
 She-0.013
 means-0.010
 ornament-0.010
 Everyday-0.009
otion-0.009
Tokenummy
Token ID13513
Feature activation+1.76e+00

Pos logit contributions
 lifetime+0.008
 injuries+0.008
uc+0.008
uddled+0.007
bek+0.007
Neg logit contributions
 She-0.008
 means-0.007
 ornament-0.006
 Everyday-0.006
 It-0.006
Token see
Token ID766
Feature activation-7.09e-01

Pos logit contributions
 She+0.008
otion+0.007
 ornament+0.007
 means+0.007
 avocado+0.006
Neg logit contributions
 lifetime-0.007
 injuries-0.007
edy-0.007
oy-0.007
bek-0.007
Token a
Token ID257
Feature activation-5.09e-01

Pos logit contributions
 lifetime+0.035
 injuries+0.034
uc+0.032
bek+0.031
uddled+0.031
Neg logit contributions
 She-0.027
 means-0.020
 ornament-0.019
 Everyday-0.019
otion-0.018
Token sign
Token ID1051
Feature activation-2.81e-01

Pos logit contributions
 lifetime+0.027
 injuries+0.026
uc+0.025
bek+0.024
har+0.024
Neg logit contributions
 She-0.018
 means-0.013
 ornament-0.012
 Everyday-0.012
otion-0.011
Token that
Token ID326
Feature activation-2.96e-01

Pos logit contributions
 lifetime+0.013
 injuries+0.012
uc+0.012
bek+0.011
uddled+0.011
Neg logit contributions
 She-0.011
 ornament-0.008
 Everyday-0.008
 means-0.008
otion-0.008
Token says
Token ID1139
Feature activation+5.13e-01

Pos logit contributions
 lifetime+0.014
uc+0.013
 injuries+0.012
ounced+0.012
bek+0.012
Neg logit contributions
 She-0.011
 ornament-0.009
 Everyday-0.008
otion-0.008
igator-0.007
Token "
Token ID366
Feature activation-3.80e-01

Pos logit contributions
 She+0.017
 means+0.011
 ornament+0.010
 Everyday+0.010
otion+0.010
Neg logit contributions
 lifetime-0.029
 injuries-0.028
uc-0.027
bek-0.027
har-0.026
Tokentem
Token ID11498
Feature activation-8.53e-01

Pos logit contributions
 lifetime+0.016
 injuries+0.015
uc+0.014
bek+0.014
har+0.013
Neg logit contributions
 She-0.018
 means-0.014
 ornament-0.014
 Everyday-0.014
otion-0.013
Tokenple
Token ID1154
Feature activation-5.77e-01

Pos logit contributions
 lifetime+0.037
 injuries+0.034
uc+0.032
bek+0.032
uddled+0.031
Neg logit contributions
 She-0.040
 means-0.031
 Everyday-0.030
 ornament-0.030
otion-0.028
Token".
Token ID1911
Feature activation+2.01e-01

Pos logit contributions
 lifetime+0.026
uc+0.024
 injuries+0.023
bek+0.023
har+0.022
Neg logit contributions
 She-0.025
 ornament-0.018
 Everyday-0.018
 means-0.018
otion-0.017
Token They
Token ID1119
Feature activation+7.03e-02

Pos logit contributions
 ornament+0.006
 She+0.006
igator+0.005
otion+0.005
 means+0.005
Neg logit contributions
 injuries-0.009
uc-0.009
 lifetime-0.009
uddled-0.009
aj-0.008
Token ask
Token ID1265
Feature activation+3.99e-01

Pos logit contributions
 She+0.003
 ornament+0.002
otion+0.002
 means+0.002
 avocado+0.002
Neg logit contributions
 lifetime-0.003
 injuries-0.003
bek-0.003
oy-0.003
har-0.002
Token sure
Token ID1654
Feature activation-8.22e-01

Pos logit contributions
 lifetime+0.042
 injuries+0.037
uc+0.036
har+0.035
bek+0.035
Neg logit contributions
 She-0.053
 means-0.043
 ornament-0.042
 Everyday-0.041
otion-0.038
Token we
Token ID356
Feature activation-3.00e-01

Pos logit contributions
 lifetime+0.040
 injuries+0.039
uc+0.037
bek+0.036
har+0.035
Neg logit contributions
 She-0.034
 means-0.025
 ornament-0.024
 Everyday-0.023
otion-0.021
Token will
Token ID481
Feature activation+7.44e-02

Pos logit contributions
 lifetime+0.014
 injuries+0.013
uc+0.013
bek+0.012
ounced+0.012
Neg logit contributions
 She-0.013
 ornament-0.009
 means-0.009
 Everyday-0.009
otion-0.008
Token continue
Token ID2555
Feature activation-2.11e+00

Pos logit contributions
 She+0.003
 means+0.002
 ornament+0.002
 Everyday+0.002
otion+0.002
Neg logit contributions
 lifetime-0.004
 injuries-0.003
uc-0.003
har-0.003
bek-0.003
Token to
Token ID284
Feature activation-4.72e-01

Pos logit contributions
 lifetime+0.100
 injuries+0.096
uc+0.092
bek+0.092
ounced+0.089
Neg logit contributions
 She-0.087
 means-0.066
 ornament-0.062
 Everyday-0.061
-0.058
Token hold
Token ID1745
Feature activation-1.98e+00

Pos logit contributions
 lifetime+0.020
 injuries+0.020
har+0.017
uc+0.016
bek+0.016
Neg logit contributions
 She-0.022
 means-0.017
otion-0.017
 Everyday-0.016
 ornament-0.016
Token hands
Token ID2832
Feature activation-1.33e+00

Pos logit contributions
 lifetime+0.098
 injuries+0.092
bek+0.088
uc+0.087
ounced+0.084
Neg logit contributions
 She-0.081
 means-0.061
 ornament-0.058
 Everyday-0.057
-0.055
Token for
Token ID329
Feature activation+6.60e-01

Pos logit contributions
 lifetime+0.066
 injuries+0.062
uc+0.060
bek+0.057
har+0.057
Neg logit contributions
 She-0.052
 means-0.038
 ornament-0.037
 Everyday-0.037
otion-0.036
Token all
Token ID477
Feature activation+2.56e-01

Pos logit contributions
 She+0.024
 means+0.018
 ornament+0.017
 Everyday+0.016
+0.016
Neg logit contributions
 lifetime-0.034
uc-0.032
bek-0.032
ounced-0.031
uddled-0.031
Token the
Token ID262
Feature activation+5.05e-01

Pos logit contributions
 She+0.010
otion+0.007
 ornament+0.007
 means+0.007
 Everyday+0.007
Neg logit contributions
 lifetime-0.013
 injuries-0.012
uc-0.012
bek-0.011
eds-0.011
Token seasons
Token ID7028
Feature activation+2.18e-01

Pos logit contributions
 She+0.022
 means+0.017
 Everyday+0.016
 ornament+0.016
otion+0.015
Neg logit contributions
 lifetime-0.023
 injuries-0.022
uc-0.021
bek-0.020
har-0.020
Token there
Token ID612
Feature activation+6.44e-01

Pos logit contributions
 She+0.024
 Everyday+0.019
 means+0.019
 ornament+0.018
otion+0.016
Neg logit contributions
 lifetime-0.023
 injuries-0.022
uc-0.022
ounced-0.022
bek-0.021
Token was
Token ID373
Feature activation+3.40e-01

Pos logit contributions
 She+0.027
 ornament+0.020
 means+0.020
 Everyday+0.019
otion+0.018
Neg logit contributions
 lifetime-0.029
 injuries-0.027
uc-0.027
uddled-0.026
bek-0.026
Token a
Token ID257
Feature activation-1.72e-01

Pos logit contributions
 She+0.013
 means+0.011
otion+0.010
 Everyday+0.010
 ornament+0.010
Neg logit contributions
 lifetime-0.014
 injuries-0.014
uc-0.013
uddled-0.013
oy-0.013
Token little
Token ID1310
Feature activation+2.04e+00

Pos logit contributions
 lifetime+0.010
 injuries+0.009
uc+0.009
bek+0.009
ounced+0.009
Neg logit contributions
 She-0.006
 ornament-0.004
 means-0.004
 Everyday-0.003
 she-0.003
Token boy
Token ID2933
Feature activation+1.78e+00

Pos logit contributions
 She+0.078
 Everyday+0.062
 means+0.062
otion+0.058
 seem+0.055
Neg logit contributions
 lifetime-0.095
uc-0.090
 injuries-0.089
bek-0.086
har-0.082
Token named
Token ID3706
Feature activation-3.44e+00

Pos logit contributions
 She+0.088
 ornament+0.066
 means+0.066
 Everyday+0.062
 she+0.061
Neg logit contributions
 lifetime-0.078
 injuries-0.072
bek-0.067
uc-0.066
uddled-0.066
Token Tim
Token ID5045
Feature activation+3.98e-01

Pos logit contributions
 lifetime+0.187
 injuries+0.177
uc+0.170
bek+0.169
har+0.165
Neg logit contributions
 She-0.131
 means-0.089
 Everyday-0.087
 ornament-0.085
 It-0.080
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
 She+0.014
 means+0.010
 ornament+0.009
 Everyday+0.009
 It+0.008
Neg logit contributions
 lifetime-0.023
 injuries-0.021
uc-0.021
bek-0.020
har-0.020
Token Tim
Token ID5045
Feature activation+4.26e-01

Pos logit contributions
 She+0.007
 means+0.006
 ornament+0.006
otion+0.005
 Everyday+0.005
Neg logit contributions
 lifetime-0.008
 injuries-0.008
uc-0.007
bek-0.007
har-0.007
Token was
Token ID373
Feature activation+3.11e-01

Pos logit contributions
 She+0.021
 means+0.016
 Everyday+0.016
 ornament+0.016
otion+0.016
Neg logit contributions
 lifetime-0.016
 injuries-0.015
uc-0.015
bek-0.014
har-0.014
Token a
Token ID257
Feature activation-2.12e-01

Pos logit contributions
 She+0.012
 means+0.011
 Everyday+0.010
otion+0.009
 ornament+0.009
Neg logit contributions
 injuries-0.013
 lifetime-0.013
uc-0.012
har-0.011
ounced-0.011
Token They
Token ID1119
Feature activation-1.01e-01

Pos logit contributions
 She+0.000
 ornament+0.000
 means+0.000
 Everyday+0.000
otion+0.000
Neg logit contributions
 lifetime-0.000
 injuries-0.000
uc-0.000
bek-0.000
har-0.000
Token knew
Token ID2993
Feature activation-1.75e-01

Pos logit contributions
 lifetime+0.005
 injuries+0.005
uc+0.004
bek+0.004
har+0.004
Neg logit contributions
 She-0.004
 means-0.003
 ornament-0.003
 Everyday-0.003
otion-0.003
Token they
Token ID484
Feature activation-4.78e-01

Pos logit contributions
 lifetime+0.009
 injuries+0.009
uc+0.008
bek+0.008
har+0.008
Neg logit contributions
 She-0.007
 means-0.005
 ornament-0.005
 Everyday-0.005
otion-0.004
Token wouldn
Token ID3636
Feature activation+1.95e-01

Pos logit contributions
 lifetime+0.023
 injuries+0.022
uc+0.021
bek+0.020
har+0.020
Neg logit contributions
 She-0.020
 means-0.015
 Everyday-0.014
 ornament-0.014
-0.014
Token't
Token ID470
Feature activation+1.95e+00

Pos logit contributions
 She+0.006
 ornament+0.005
 means+0.004
igator+0.004
 Everyday+0.004
Neg logit contributions
 injuries-0.010
 lifetime-0.010
har-0.009
uddled-0.009
uc-0.009
Token forget
Token ID6044
Feature activation-3.03e+00

Pos logit contributions
 She+0.082
 means+0.064
 ornament+0.063
 Everyday+0.060
otion+0.059
Neg logit contributions
 lifetime-0.088
 injuries-0.086
har-0.079
uc-0.075
bek-0.073
Token this
Token ID428
Feature activation+9.04e-01

Pos logit contributions
 lifetime+0.168
 injuries+0.160
uc+0.156
bek+0.154
har+0.150
Neg logit contributions
 She-0.106
 means-0.075
 ornament-0.068
 Everyday-0.068
-0.063
Token special
Token ID2041
Feature activation-1.85e-02

Pos logit contributions
 She+0.031
 means+0.023
 ornament+0.021
 Everyday+0.021
+0.020
Neg logit contributions
 lifetime-0.048
 injuries-0.047
uc-0.047
bek-0.045
ounced-0.045
Token day
Token ID1110
Feature activation-1.33e+00

Pos logit contributions
 lifetime+0.001
 injuries+0.001
uc+0.001
bek+0.001
ounced+0.001
Neg logit contributions
 She-0.001
 means-0.001
 ornament-0.000
 Everyday-0.000
 idea-0.000
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
 lifetime+0.073
 injuries+0.070
uc+0.069
bek+0.068
ounced+0.067
Neg logit contributions
 She-0.047
 means-0.032
 she-0.030
-0.029
 Everyday-0.029
Token<|endoftext|>
Token ID50256
Feature activation-1.47e+00

Pos logit contributions
 lifetime+0.001
 injuries+0.001
uc+0.001
bek+0.001
har+0.001
Neg logit contributions
 She-0.001
 means-0.000
 ornament-0.000
 Everyday-0.000
otion-0.000
Token there
Token ID612
Feature activation+4.45e-01

Pos logit contributions
 She+0.021
 means+0.016
 Everyday+0.016
 ornament+0.016
 seem+0.014
Neg logit contributions
 lifetime-0.021
 injuries-0.020
uc-0.020
ounced-0.020
bek-0.020
Token was
Token ID373
Feature activation+2.13e-01

Pos logit contributions
 She+0.019
 ornament+0.015
 means+0.014
 Everyday+0.014
otion+0.013
Neg logit contributions
 lifetime-0.019
 injuries-0.018
uc-0.017
uddled-0.017
bek-0.016
Token a
Token ID257
Feature activation-1.88e-01

Pos logit contributions
 She+0.008
 means+0.007
otion+0.006
 Everyday+0.006
 ornament+0.006
Neg logit contributions
 lifetime-0.009
 injuries-0.009
uc-0.008
uddled-0.008
oy-0.008
Token little
Token ID1310
Feature activation+1.93e+00

Pos logit contributions
 lifetime+0.011
 injuries+0.010
uc+0.010
bek+0.010
ounced+0.010
Neg logit contributions
 She-0.006
 ornament-0.004
 means-0.004
 Everyday-0.004
 she-0.003
Token girl
Token ID2576
Feature activation+2.02e+00

Pos logit contributions
 She+0.074
 Everyday+0.058
 means+0.057
otion+0.053
 seem+0.050
Neg logit contributions
 lifetime-0.092
uc-0.086
 injuries-0.085
bek-0.083
har-0.080
Token named
Token ID3706
Feature activation-3.58e+00

Pos logit contributions
 She+0.104
 ornament+0.079
 means+0.079
 Everyday+0.075
 she+0.074
Neg logit contributions
 lifetime-0.083
 injuries-0.076
uc-0.072
bek-0.071
uddled-0.069
Token Lily
Token ID20037
Feature activation+9.99e-01

Pos logit contributions
 lifetime+0.208
 injuries+0.199
uc+0.192
bek+0.190
har+0.185
Neg logit contributions
 She-0.115
 means-0.076
 ornament-0.073
 Everyday-0.072
otion-0.063
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
 She+0.040
 means+0.029
 ornament+0.028
 Everyday+0.027
+0.024
Neg logit contributions
 lifetime-0.051
 injuries-0.049
uc-0.047
bek-0.046
har-0.045
Token Lily
Token ID20037
Feature activation+9.88e-01

Pos logit contributions
 means+0.001
 ornament+0.001
 She+0.001
igator+0.000
otion+0.000
Neg logit contributions
 lifetime-0.009
 injuries-0.009
uc-0.009
uddled-0.009
aj-0.009
Token loved
Token ID6151
Feature activation-5.04e-01

Pos logit contributions
 She+0.049
 means+0.039
 Everyday+0.038
 ornament+0.038
otion+0.037
Neg logit contributions
 lifetime-0.037
 injuries-0.035
uc-0.033
bek-0.032
uddled-0.032
Token going
Token ID1016
Feature activation+1.17e+00

Pos logit contributions
 lifetime+0.021
 injuries+0.020
uc+0.020
bek+0.018
uddled+0.018
Neg logit contributions
 She-0.023
 means-0.018
 Everyday-0.017
otion-0.017
 ornament-0.017
Token there
Token ID612
Feature activation+4.51e-01

Pos logit contributions
 She+0.020
 means+0.016
 Everyday+0.016
 ornament+0.015
otion+0.014
Neg logit contributions
 lifetime-0.022
 injuries-0.021
uc-0.021
bek-0.020
ounced-0.020
Token was
Token ID373
Feature activation+3.02e-01

Pos logit contributions
 She+0.019
 ornament+0.015
 means+0.015
 Everyday+0.014
otion+0.014
Neg logit contributions
 lifetime-0.019
 injuries-0.018
uc-0.017
uddled-0.017
bek-0.016
Token a
Token ID257
Feature activation-1.64e-01

Pos logit contributions
 She+0.011
 means+0.010
otion+0.009
 ornament+0.008
 Everyday+0.008
Neg logit contributions
 lifetime-0.013
 injuries-0.013
uc-0.012
uddled-0.012
oy-0.012
Token little
Token ID1310
Feature activation+1.94e+00

Pos logit contributions
 lifetime+0.009
 injuries+0.009
uc+0.009
bek+0.009
ounced+0.008
Neg logit contributions
 She-0.005
 ornament-0.004
 means-0.003
 Everyday-0.003
 she-0.003
Token girl
Token ID2576
Feature activation+2.02e+00

Pos logit contributions
 She+0.074
 Everyday+0.058
 means+0.058
otion+0.055
 seem+0.051
Neg logit contributions
 lifetime-0.091
uc-0.086
 injuries-0.084
bek-0.082
har-0.079
Token named
Token ID3706
Feature activation-3.47e+00

Pos logit contributions
 She+0.104
 ornament+0.079
 means+0.078
 Everyday+0.075
 she+0.074
Neg logit contributions
 lifetime-0.084
 injuries-0.078
uc-0.073
bek-0.072
uddled-0.070
Token Lily
Token ID20037
Feature activation+1.01e+00

Pos logit contributions
 lifetime+0.196
 injuries+0.188
uc+0.181
bek+0.178
har+0.174
Neg logit contributions
 She-0.113
 means-0.078
 ornament-0.074
 Everyday-0.073
otion-0.064
Token.
Token ID13
Feature activation+2.32e-01

Pos logit contributions
 She+0.039
 means+0.028
 ornament+0.027
 Everyday+0.026
+0.024
Neg logit contributions
 lifetime-0.054
 injuries-0.051
uc-0.049
bek-0.048
har-0.047
Token She
Token ID1375
Feature activation+7.81e-01

Pos logit contributions
 means+0.002
 ornament+0.002
 She+0.001
igator+0.001
otion+0.001
Neg logit contributions
 lifetime-0.015
 injuries-0.015
uc-0.014
uddled-0.014
aj-0.014
Token had
Token ID550
Feature activation-1.81e+00

Pos logit contributions
 She+0.036
 means+0.027
 ornament+0.026
 Everyday+0.025
 she+0.024
Neg logit contributions
 lifetime-0.036
 injuries-0.035
bek-0.032
uc-0.032
har-0.031
Token a
Token ID257
Feature activation-1.84e-01

Pos logit contributions
 lifetime+0.094
 injuries+0.091
uc+0.086
bek+0.082
uddled+0.082
Neg logit contributions
 She-0.061
 means-0.049
 Everyday-0.044
otion-0.043
 ornament-0.043
Token with
Token ID351
Feature activation+9.44e-01

Pos logit contributions
 She+0.029
 means+0.021
 ornament+0.020
 Everyday+0.020
otion+0.020
Neg logit contributions
 lifetime-0.032
 injuries-0.030
uc-0.028
uddled-0.028
bek-0.028
Token his
Token ID465
Feature activation-6.59e-01

Pos logit contributions
 She+0.032
 means+0.020
 Everyday+0.020
 ornament+0.020
 she+0.018
Neg logit contributions
 lifetime-0.054
 injuries-0.051
bek-0.050
uc-0.050
har-0.048
Token mom
Token ID1995
Feature activation-2.95e-01

Pos logit contributions
bek+0.034
 lifetime+0.033
uddled+0.032
uc+0.032
har+0.031
Neg logit contributions
 She-0.022
 ornament-0.016
 baby-0.015
 she-0.015
 idea-0.015
Token and
Token ID290
Feature activation-9.17e-02

Pos logit contributions
 lifetime+0.017
 injuries+0.016
uc+0.016
bek+0.015
har+0.015
Neg logit contributions
 She-0.010
 means-0.006
 ornament-0.006
 Everyday-0.006
otion-0.005
Token they
Token ID484
Feature activation-5.36e-01

Pos logit contributions
 lifetime+0.004
 injuries+0.004
uc+0.004
bek+0.004
ounced+0.003
Neg logit contributions
 She-0.004
 means-0.003
 Everyday-0.003
 ornament-0.003
otion-0.003
Token both
Token ID1111
Feature activation-2.60e+00

Pos logit contributions
 lifetime+0.026
 injuries+0.026
uc+0.024
bek+0.023
har+0.023
Neg logit contributions
 She-0.022
 Everyday-0.015
-0.015
 ornament-0.015
 means-0.015
Token had
Token ID550
Feature activation-2.05e+00

Pos logit contributions
 lifetime+0.111
 injuries+0.109
uc+0.102
bek+0.099
uddled+0.096
Neg logit contributions
 She-0.122
 means-0.092
 Everyday-0.090
 ornament-0.088
-0.086
Token lots
Token ID6041
Feature activation+5.10e-01

Pos logit contributions
 injuries+0.126
bek+0.126
 lifetime+0.124
cle+0.123
uc+0.123
Neg logit contributions
 She-0.056
-0.035
 her-0.032
 It-0.031
 Everyday-0.030
Token of
Token ID286
Feature activation+1.30e+00

Pos logit contributions
 She+0.009
 Everyday+0.003
 means+0.003
 ornament+0.002
+0.002
Neg logit contributions
 lifetime-0.037
bek-0.036
 injuries-0.036
uc-0.035
har-0.034
Token fun
Token ID1257
Feature activation-5.27e-01

Pos logit contributions
 She+0.048
 idea+0.039
 Everyday+0.037
 means+0.034
 her+0.034
Neg logit contributions
bek-0.070
 lifetime-0.065
uc-0.064
cle-0.063
har-0.063
Token!
Token ID0
Feature activation-8.17e-01

Pos logit contributions
 lifetime+0.029
bek+0.028
 injuries+0.028
har+0.027
ounced+0.027
Neg logit contributions
 She-0.017
 means-0.013
 she-0.011
 sometimes-0.011
 Everyday-0.011