TOP ACTIVATIONS
MAX = 3.244

! I 'm the best gol fer in the world !"
! I 'm the best gol fer in the world and
a strange , raspberry - col oured object hiding under a
. The strong mer maid thought about the promise
water . The mer maid smiled at Melody .
want to help . B amb i felt sad and scared
, and she wanted the mer maid to come too .
castle so much . The mer maid said , " I
Peter wasn 't the greatest sur fer , but he loved
Jack asked the mer maid why the waves were
Finally , B amb i found a kind hunter
again . He said the mer maid and a dolphin were
. She was the happiest ostr ich in the world .
dog as a yellow lab rador . Lily got excited and
demands , crab ," the mer maid said . " I
sur fer . Whenever the sur fer balanced on the board
pond ?" The gol fer replied , " I
, with all its different col ours , shapes and textures
from the trap . B amb i was very happy and
reef with many fish and cor als . Anna wanted to

MIN ACTIVATIONS
MIN = -2.647

rocket inside and climbs in too . He pret
modest and don 't eat too many pr unes . They
says : " Dr ink some water . It
" I think we ate too much l oll ipop ,"
olded him when he took too long . One
water . " Dr ink some water , Lily
make us dirty and wet too . Look , he has
" You mixed too much , my sweet ,"
. They should not go too far from their cabin .
nightmare if we stay out too long . â €
you okay ? You ate too fast . You need to
He likes hugs and kisses too . But be gentle and
need because if we take too much , somebody else can
please . I like books too . Can I see that
next door will bloom again too . <|endoftext|> Once upon a
cake . The oven was too hot and the cake burned
Sure , I like buttons too . Come in , come
will be quiet and careful too ," Tom said .
told them not to go too close to the edge or
But we should not stay too long . They have a

INTERVAL 1.771 - 3.244
CONTAINS 0.027%

. The oct opus gave El sie a band age and
alone . Cr ab saw Sara and Ben .
that day on , the ostr ich realized that being different
we are here !" Tom and Lily said . " We
said Sarah . " I wish we could find your avocado

INTERVAL 0.298 - 1.771
CONTAINS 23.547%

. But then , her mom told her a
if he could go inside . The guard
big rain came . The rain was so strong that it
a store . The store had very colorful walls and a
named Lily . She loved l oll ip ops , especially

INTERVAL -1.174 - 0.298
CONTAINS 75.767%

looking for her lost toy . Tim showed her where he
his teeth to tie the string around the tree . The
a good time ." Tom did not listen .
The bird says , " Hello , Anna and Ben
was a brave boy . He loved to play with his

INTERVAL -2.647 - -1.174
CONTAINS 0.659%

Lu cy was very excited because she 'd never seen a
Max , the dog from next door . He sees the
Mom and Dad . I don 't need a mustache to
Sara ." <|endoftext|> Anna and Ben were twins who liked to
't want to get wet too . Tim
Token!
Token ID0
Feature activation-2.01e-02

Pos logit contributions
 mom+0.021
Mom+0.020
 owner+0.018
 slowly+0.018
 moms+0.018
Neg logit contributions
pole-0.034
rily-0.032
cient-0.031
izer-0.031
iche-0.030
Token I
Token ID314
Feature activation-5.02e-01

Pos logit contributions
 mom+0.001
 slowly+0.001
Mom+0.001
 owner+0.001
 soon+0.001
Neg logit contributions
rily-0.001
pole-0.001
iche-0.001
izer-0.001
 cuc-0.001
Token'm
Token ID1101
Feature activation+4.60e-01

Pos logit contributions
 mom+0.016
Mom+0.016
 herself+0.014
 words+0.014
 moms+0.014
Neg logit contributions
iche-0.021
pole-0.021
venture-0.020
cient-0.020
izer-0.020
Token the
Token ID262
Feature activation-5.73e-03

Pos logit contributions
rily+0.025
pole+0.023
iche+0.022
izer+0.022
cient+0.022
Neg logit contributions
 mom-0.013
 slowly-0.011
 soon-0.011
 brave-0.011
 owner-0.011
Token best
Token ID1266
Feature activation+2.97e-01

Pos logit contributions
 mom+0.000
 slowly+0.000
Mom+0.000
 soon+0.000
 care+0.000
Neg logit contributions
rily-0.000
pole-0.000
iche-0.000
izer-0.000
 cuc-0.000
Token gol
Token ID43791
Feature activation+3.24e+00

Pos logit contributions
rily+0.013
pole+0.011
iche+0.011
ifying+0.011
cient+0.011
Neg logit contributions
 mom-0.011
 owner-0.010
 slowly-0.010
 soon-0.010
 care-0.010
Tokenfer
Token ID2232
Feature activation+2.05e+00

Pos logit contributions
rily+0.131
ifying+0.111
iche+0.111
ulp+0.110
ksh+0.108
Neg logit contributions
 mom-0.130
Mom-0.126
 owner-0.126
 next-0.124
 brave-0.123
Token in
Token ID287
Feature activation+9.86e-01

Pos logit contributions
rily+0.122
pole+0.115
iche+0.112
izer+0.112
cient+0.110
Neg logit contributions
 mom-0.041
 slowly-0.036
Mom-0.035
 soon-0.034
 care-0.034
Token the
Token ID262
Feature activation+6.16e-01

Pos logit contributions
rily+0.062
pole+0.058
iche+0.057
izer+0.056
cient+0.056
Neg logit contributions
 mom-0.019
 slowly-0.016
 soon-0.015
 care-0.015
Mom-0.015
Token world
Token ID995
Feature activation+8.90e-01

Pos logit contributions
rily+0.029
pole+0.027
iche+0.026
izer+0.026
cient+0.025
Neg logit contributions
 mom-0.020
 slowly-0.019
Mom-0.019
 soon-0.018
 care-0.018
Token!"
Token ID2474
Feature activation+4.20e-01

Pos logit contributions
rily+0.047
pole+0.043
iche+0.042
izer+0.042
cient+0.041
Neg logit contributions
 mom-0.024
 slowly-0.022
 soon-0.022
Mom-0.021
 care-0.021
Token!
Token ID0
Feature activation+7.97e-02

Pos logit contributions
 mom+0.006
Mom+0.005
 slowly+0.005
 owner+0.005
 soon+0.005
Neg logit contributions
pole-0.011
rily-0.011
izer-0.010
iche-0.010
cient-0.010
Token I
Token ID314
Feature activation-8.35e-02

Pos logit contributions
rily+0.003
pole+0.003
iche+0.003
izer+0.003
ifying+0.003
Neg logit contributions
 mom-0.003
 slowly-0.003
Mom-0.003
 soon-0.003
 care-0.003
Token'm
Token ID1101
Feature activation+7.00e-01

Pos logit contributions
 mom+0.002
Mom+0.002
 slowly+0.002
 owner+0.002
 people+0.002
Neg logit contributions
rily-0.004
iche-0.004
pole-0.004
ifying-0.004
venture-0.004
Token the
Token ID262
Feature activation+1.37e-01

Pos logit contributions
rily+0.040
pole+0.037
izer+0.036
iche+0.036
cient+0.035
Neg logit contributions
 mom-0.018
 brave-0.016
 slowly-0.016
 now-0.015
 soon-0.015
Token best
Token ID1266
Feature activation+2.91e-01

Pos logit contributions
rily+0.006
pole+0.006
iche+0.006
izer+0.006
 cuc+0.006
Neg logit contributions
 mom-0.004
 slowly-0.004
Mom-0.004
 soon-0.004
 care-0.004
Token gol
Token ID43791
Feature activation+3.19e+00

Pos logit contributions
rily+0.012
pole+0.011
iche+0.011
izer+0.010
cient+0.010
Neg logit contributions
 mom-0.011
 slowly-0.011
Mom-0.010
 soon-0.010
 owner-0.010
Tokenfer
Token ID2232
Feature activation+1.87e+00

Pos logit contributions
rily+0.128
ifying+0.110
iche+0.110
ulp+0.109
ksh+0.107
Neg logit contributions
 mom-0.127
 owner-0.124
Mom-0.123
 next-0.122
 brave-0.121
Token in
Token ID287
Feature activation+1.13e+00

Pos logit contributions
rily+0.110
pole+0.103
iche+0.101
izer+0.101
cient+0.098
Neg logit contributions
 mom-0.039
 slowly-0.034
Mom-0.033
 soon-0.033
 care-0.033
Token the
Token ID262
Feature activation+5.91e-01

Pos logit contributions
rily+0.069
pole+0.065
iche+0.063
izer+0.063
cient+0.062
Neg logit contributions
 mom-0.023
 slowly-0.020
 care-0.019
 soon-0.019
Mom-0.019
Token world
Token ID995
Feature activation+8.18e-01

Pos logit contributions
rily+0.028
pole+0.026
iche+0.025
izer+0.025
cient+0.025
Neg logit contributions
 mom-0.019
 slowly-0.017
Mom-0.017
 care-0.017
 soon-0.017
Token and
Token ID290
Feature activation+3.26e-01

Pos logit contributions
rily+0.043
pole+0.040
iche+0.039
izer+0.039
cient+0.038
Neg logit contributions
 mom-0.022
 slowly-0.020
 soon-0.020
Mom-0.019
 care-0.019
Token a
Token ID257
Feature activation+5.62e-02

Pos logit contributions
 care+0.000
Mom+0.000
OK+0.000
 bud+0.000
 moms+0.000
Neg logit contributions
rily-0.000
 cuc-0.000
pole-0.000
eh-0.000
cient-0.000
Token strange
Token ID6283
Feature activation-1.10e-01

Pos logit contributions
rily+0.003
 cuc+0.003
iche+0.003
pole+0.002
ifying+0.002
Neg logit contributions
 care-0.002
 soon-0.002
Mom-0.001
 slowly-0.001
 herself-0.001
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
Mom+0.003
 mom+0.003
 soon+0.003
 care+0.003
 slowly+0.003
Neg logit contributions
rily-0.005
pole-0.005
iche-0.005
izer-0.005
 cuc-0.005
Token raspberry
Token ID38973
Feature activation-6.18e-01

Pos logit contributions
 mom+0.005
Mom+0.005
 care+0.005
 moms+0.005
 soon+0.005
Neg logit contributions
rily-0.009
pole-0.009
iche-0.009
izer-0.008
 cuc-0.008
Token-
Token ID12
Feature activation+8.30e-01

Pos logit contributions
 care+0.020
Mom+0.019
 mom+0.019
 love+0.018
 soon+0.017
Neg logit contributions
rily-0.027
 cuc-0.027
pole-0.026
eh-0.026
iche-0.025
Tokencol
Token ID4033
Feature activation+2.98e+00

Pos logit contributions
rily+0.026
pole+0.025
eh+0.024
 cuc+0.024
iche+0.023
Neg logit contributions
 mom-0.041
 moms-0.037
 soon-0.036
 owner-0.036
 brave-0.036
Tokenoured
Token ID8167
Feature activation+4.43e-01

Pos logit contributions
rily+0.071
pole+0.069
izer+0.064
iche+0.059
venture+0.056
Neg logit contributions
 mom-0.154
 slowly-0.149
 care-0.146
Mom-0.144
 soon-0.144
Token object
Token ID2134
Feature activation+6.18e-01

Pos logit contributions
rily+0.018
 cuc+0.017
pole+0.017
iche+0.016
ifying+0.016
Neg logit contributions
 mom-0.016
Mom-0.016
 care-0.016
 slowly-0.015
 soon-0.015
Token hiding
Token ID11816
Feature activation+2.75e-01

Pos logit contributions
rily+0.033
pole+0.031
ifying+0.029
iche+0.029
izer+0.029
Neg logit contributions
 mom-0.016
Mom-0.015
 care-0.015
 slowly-0.014
 soon-0.014
Token under
Token ID739
Feature activation+8.37e-01

Pos logit contributions
rily+0.010
pole+0.008
ifying+0.008
izer+0.008
 cuc+0.007
Neg logit contributions
 mom-0.012
 care-0.012
Mom-0.012
 moms-0.011
 owner-0.011
Token a
Token ID257
Feature activation+4.07e-02

Pos logit contributions
rily+0.045
 cuc+0.041
ifying+0.040
izer+0.039
pole+0.039
Neg logit contributions
 owner-0.018
 moms-0.018
 soon-0.017
OK-0.017
Mom-0.017
Token.
Token ID13
Feature activation+3.16e-01

Pos logit contributions
 care+0.032
Mom+0.031
 mom+0.031
 moms+0.031
 words+0.030
Neg logit contributions
rily-0.075
pole-0.067
ifying-0.067
iche-0.064
 cuc-0.062
Token
Token ID220
Feature activation+4.64e-01

Pos logit contributions
rily+0.018
pole+0.017
iche+0.016
izer+0.016
cient+0.016
Neg logit contributions
 mom-0.007
 slowly-0.006
Mom-0.006
 care-0.006
 soon-0.006
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
rily+0.027
pole+0.026
iche+0.025
izer+0.025
cient+0.025
Neg logit contributions
 mom-0.009
 slowly-0.008
Mom-0.008
 soon-0.008
 care-0.008
TokenThe
Token ID464
Feature activation-1.04e-02

Pos logit contributions
rily+0.008
pole+0.007
izer+0.007
iche+0.007
venture+0.007
Neg logit contributions
 mom-0.004
 care-0.004
 owner-0.004
 slowly-0.004
 soon-0.004
Token strong
Token ID1913
Feature activation+9.97e-01

Pos logit contributions
 mom+0.000
 slowly+0.000
 owner+0.000
 soon+0.000
Tom+0.000
Neg logit contributions
rily-0.000
 cuc-0.000
pole-0.000
iche-0.000
venture-0.000
Token mer
Token ID4017
Feature activation+2.95e+00

Pos logit contributions
rily+0.047
pole+0.042
 cuc+0.039
melon+0.039
iche+0.038
Neg logit contributions
Mom-0.030
 mom-0.029
Tom-0.029
 owner-0.029
 slowly-0.028
Tokenmaid
Token ID23151
Feature activation+1.11e+00

Pos logit contributions
rily+0.085
iche+0.080
izer+0.080
pole+0.080
ifying+0.078
Neg logit contributions
 mom-0.142
 slowly-0.135
 soon-0.131
 owner-0.129
 care-0.129
Token thought
Token ID1807
Feature activation+1.41e-01

Pos logit contributions
rily+0.061
pole+0.057
iche+0.055
izer+0.055
cient+0.055
Neg logit contributions
 mom-0.027
Mom-0.024
 slowly-0.024
 care-0.023
 owner-0.023
Token about
Token ID546
Feature activation+3.69e-01

Pos logit contributions
rily+0.007
pole+0.007
venture+0.006
iche+0.006
 cuc+0.006
Neg logit contributions
Mom-0.004
 mom-0.004
 owner-0.003
 slowly-0.003
 care-0.003
Token the
Token ID262
Feature activation-2.23e-01

Pos logit contributions
rily+0.017
 cuc+0.016
pole+0.016
 films+0.016
cient+0.015
Neg logit contributions
Mom-0.011
 owner-0.010
 mom-0.009
 slowly-0.009
 moms-0.009
Token promise
Token ID6991
Feature activation+6.79e-02

Pos logit contributions
Mom+0.008
 mom+0.008
 slowly+0.008
 soon+0.007
Tom+0.007
Neg logit contributions
rily-0.009
 cuc-0.008
pole-0.008
iche-0.008
cient-0.008
Token water
Token ID1660
Feature activation-4.30e-02

Pos logit contributions
rily+0.046
 cuc+0.045
ifying+0.042
pole+0.042
iche+0.040
Neg logit contributions
 mom-0.032
Mom-0.032
 soon-0.030
 owner-0.030
 slowly-0.029
Token.
Token ID13
Feature activation+3.76e-01

Pos logit contributions
 mom+0.001
 slowly+0.001
Mom+0.001
 soon+0.001
 care+0.001
Neg logit contributions
rily-0.003
pole-0.003
iche-0.003
izer-0.003
cient-0.002
Token
Token ID198
Feature activation+3.72e-01

Pos logit contributions
rily+0.020
pole+0.018
iche+0.018
izer+0.018
cient+0.017
Neg logit contributions
 mom-0.010
 slowly-0.009
Mom-0.009
 care-0.009
 soon-0.009
Token
Token ID198
Feature activation+6.16e-01

Pos logit contributions
rily+0.016
pole+0.016
iche+0.015
venture+0.015
izer+0.015
Neg logit contributions
 mom-0.011
 care-0.010
 bud-0.010
 slowly-0.010
 brave-0.010
TokenThe
Token ID464
Feature activation-3.72e-01

Pos logit contributions
pole+0.029
rily+0.029
venture+0.027
ifying+0.027
izer+0.027
Neg logit contributions
 mom-0.018
 care-0.016
 love-0.015
 hard-0.015
 Mom-0.014
Token mer
Token ID4017
Feature activation+2.93e+00

Pos logit contributions
Tom+0.016
Mom+0.014
 soon+0.014
Ann+0.014
Mac+0.014
Neg logit contributions
rily-0.012
 cuc-0.011
melon-0.010
 films-0.010
pole-0.010
Tokenmaid
Token ID23151
Feature activation+5.47e-01

Pos logit contributions
rily+0.087
iche+0.083
pole+0.083
eh+0.082
izer+0.080
Neg logit contributions
 mom-0.135
 slowly-0.130
 soon-0.124
 owner-0.124
 care-0.124
Token smiled
Token ID13541
Feature activation+1.09e-02

Pos logit contributions
rily+0.028
pole+0.027
 cuc+0.025
venture+0.024
cient+0.024
Neg logit contributions
 mom-0.014
Mom-0.014
 care-0.014
 hard-0.013
 owner-0.013
Token at
Token ID379
Feature activation+2.87e-02

Pos logit contributions
rily+0.001
pole+0.001
ifying+0.001
 cuc+0.001
cient+0.000
Neg logit contributions
 mom-0.000
 care-0.000
Mom-0.000
 moms-0.000
Tom-0.000
Token Melody
Token ID38906
Feature activation-5.05e-03

Pos logit contributions
rily+0.001
pole+0.001
izer+0.001
cient+0.001
ifying+0.001
Neg logit contributions
Mom-0.001
 mom-0.001
 owner-0.001
 cre-0.001
 slowly-0.001
Token.
Token ID13
Feature activation+8.17e-02

Pos logit contributions
 mom+0.000
Mom+0.000
 slowly+0.000
 care+0.000
 owner+0.000
Neg logit contributions
rily-0.000
pole-0.000
iche-0.000
izer-0.000
venture-0.000
Token want
Token ID765
Feature activation+8.34e-01

Pos logit contributions
 mom+0.004
Mom+0.004
 moms+0.003
 slowly+0.003
 owner+0.003
Neg logit contributions
pole-0.007
rily-0.007
iche-0.006
izer-0.006
 cuc-0.006
Token to
Token ID284
Feature activation+4.10e-01

Pos logit contributions
rily+0.040
pole+0.037
 cuc+0.037
izer+0.036
melon+0.036
Neg logit contributions
Mom-0.022
 mom-0.020
 slowly-0.019
 brave-0.019
 soon-0.018
Token help
Token ID1037
Feature activation+7.16e-02

Pos logit contributions
rily+0.017
izer+0.016
pole+0.016
cient+0.015
iche+0.015
Neg logit contributions
Mom-0.013
 mom-0.013
 owner-0.012
 slowly-0.012
 hard-0.012
Token.
Token ID13
Feature activation+5.41e-01

Pos logit contributions
rily+0.004
pole+0.004
izer+0.003
cient+0.003
iche+0.003
Neg logit contributions
 mom-0.002
Mom-0.002
 slowly-0.002
 owner-0.002
 care-0.002
Token B
Token ID347
Feature activation+1.52e+00

Pos logit contributions
rily+0.025
pole+0.023
iche+0.022
izer+0.022
cient+0.022
Neg logit contributions
 mom-0.018
 slowly-0.017
Mom-0.016
 care-0.016
 soon-0.016
Tokenamb
Token ID4131
Feature activation+2.90e+00

Pos logit contributions
rily+0.049
pole+0.045
iche+0.043
izer+0.042
ifying+0.041
Neg logit contributions
 mom-0.071
 slowly-0.068
Mom-0.067
 soon-0.066
 care-0.066
Tokeni
Token ID72
Feature activation-1.14e-01

Pos logit contributions
rily+0.138
pole+0.128
iche+0.124
izer+0.123
cient+0.121
Neg logit contributions
 mom-0.093
 slowly-0.086
Mom-0.085
 soon-0.084
 care-0.083
Token felt
Token ID2936
Feature activation+4.86e-01

Pos logit contributions
 mom+0.003
 care+0.003
Mom+0.003
 love+0.003
 Mom+0.003
Neg logit contributions
rily-0.005
pole-0.005
cient-0.005
venture-0.005
eh-0.005
Token sad
Token ID6507
Feature activation+7.33e-01

Pos logit contributions
rily+0.024
pole+0.022
iche+0.022
izer+0.021
cient+0.021
Neg logit contributions
 mom-0.014
Mom-0.013
 slowly-0.013
 soon-0.012
 owner-0.012
Token and
Token ID290
Feature activation+7.25e-01

Pos logit contributions
rily+0.038
pole+0.035
iche+0.034
izer+0.033
 cuc+0.033
Neg logit contributions
Mom-0.018
 mom-0.018
 care-0.016
 slowly-0.016
 moms-0.016
Token scared
Token ID12008
Feature activation+4.75e-01

Pos logit contributions
rily+0.040
pole+0.037
iche+0.036
izer+0.036
cient+0.035
Neg logit contributions
 mom-0.018
 slowly-0.017
Mom-0.016
 soon-0.016
 care-0.016
Token,
Token ID11
Feature activation+2.37e-01

Pos logit contributions
rily+0.025
pole+0.024
iche+0.023
izer+0.023
venture+0.022
Neg logit contributions
 mom-0.008
Mom-0.007
 slowly-0.007
 care-0.007
 owner-0.006
Token and
Token ID290
Feature activation+3.82e-01

Pos logit contributions
rily+0.011
pole+0.011
iche+0.010
izer+0.010
cient+0.010
Neg logit contributions
 mom-0.007
 slowly-0.007
Mom-0.007
 care-0.007
 soon-0.007
Token she
Token ID673
Feature activation+4.48e-01

Pos logit contributions
rily+0.018
pole+0.018
cient+0.017
 cuc+0.017
iche+0.017
Neg logit contributions
Mom-0.011
 mom-0.009
 owner-0.009
Tom-0.008
 moms-0.008
Token wanted
Token ID2227
Feature activation+4.04e-01

Pos logit contributions
pole+0.021
rily+0.021
cient+0.021
venture+0.020
 cuc+0.019
Neg logit contributions
Mom-0.012
 mom-0.011
 care-0.011
 words-0.010
Tom-0.010
Token the
Token ID262
Feature activation+2.89e-02

Pos logit contributions
 cuc+0.013
 pic+0.012
rily+0.012
 films+0.012
 Wed+0.012
Neg logit contributions
Mom-0.015
OK-0.014
 cre-0.013
Tom-0.013
 bud-0.012
Token mer
Token ID4017
Feature activation+2.90e+00

Pos logit contributions
rily+0.001
pole+0.001
 cuc+0.001
izer+0.001
cient+0.001
Neg logit contributions
Mom-0.001
 mom-0.001
 slowly-0.001
 soon-0.001
Tom-0.001
Tokenmaid
Token ID23151
Feature activation+1.06e+00

Pos logit contributions
rily+0.085
izer+0.082
ifying+0.082
pole+0.081
iche+0.081
Neg logit contributions
 mom-0.134
 slowly-0.128
 soon-0.124
 care-0.122
 owner-0.122
Token to
Token ID284
Feature activation+2.00e-01

Pos logit contributions
rily+0.064
pole+0.061
iche+0.058
cient+0.058
ifying+0.058
Neg logit contributions
 mom-0.019
Mom-0.017
 slowly-0.016
 care-0.015
 soon-0.015
Token come
Token ID1282
Feature activation+3.66e-01

Pos logit contributions
rily+0.010
pole+0.010
izer+0.010
iche+0.010
 cuc+0.009
Neg logit contributions
 mom-0.005
Mom-0.005
 slowly-0.005
 owner-0.004
 soon-0.004
Token too
Token ID1165
Feature activation-1.44e+00

Pos logit contributions
rily+0.020
pole+0.018
izer+0.017
ifying+0.017
iche+0.017
Neg logit contributions
 mom-0.009
 care-0.008
Mom-0.008
 moms-0.008
Tom-0.008
Token.
Token ID13
Feature activation+3.16e-01

Pos logit contributions
 care+0.032
Mom+0.031
 mom+0.031
 moms+0.031
 words+0.030
Neg logit contributions
rily-0.075
pole-0.067
ifying-0.067
iche-0.064
 cuc-0.062
Token castle
Token ID16669
Feature activation+6.82e-01

Pos logit contributions
rily+0.009
pole+0.009
iche+0.008
izer+0.008
cient+0.008
Neg logit contributions
 mom-0.006
 slowly-0.006
Mom-0.005
 soon-0.005
 care-0.005
Token so
Token ID523
Feature activation+1.17e-01

Pos logit contributions
rily+0.040
pole+0.038
iche+0.036
izer+0.036
ifying+0.036
Neg logit contributions
 mom-0.014
Mom-0.013
 slowly-0.012
 care-0.012
 owner-0.012
Token much
Token ID881
Feature activation-3.38e-01

Pos logit contributions
rily+0.006
pole+0.005
cient+0.005
iche+0.005
ifying+0.005
Neg logit contributions
Mom-0.003
 care-0.003
 mom-0.003
 owner-0.003
 moms-0.003
Token.
Token ID13
Feature activation-2.25e-01

Pos logit contributions
Mom+0.007
 mom+0.007
 owner+0.006
 slowly+0.006
 moms+0.006
Neg logit contributions
rily-0.019
pole-0.018
izer-0.017
iche-0.017
 cuc-0.017
Token The
Token ID383
Feature activation-3.21e-01

Pos logit contributions
 mom+0.007
 slowly+0.007
Mom+0.006
 soon+0.006
 care+0.006
Neg logit contributions
rily-0.011
pole-0.010
izer-0.010
iche-0.010
ifying-0.010
Token mer
Token ID4017
Feature activation+2.88e+00

Pos logit contributions
 mom+0.016
 owner+0.016
Mom+0.016
 slowly+0.016
 soon+0.016
Neg logit contributions
rily-0.007
pole-0.006
 cuc-0.006
venture-0.006
iche-0.005
Tokenmaid
Token ID23151
Feature activation+8.29e-01

Pos logit contributions
rily+0.082
iche+0.078
izer+0.078
pole+0.077
ifying+0.077
Neg logit contributions
 mom-0.137
 slowly-0.131
 soon-0.126
 care-0.125
 owner-0.125
Token said
Token ID531
Feature activation+2.88e-01

Pos logit contributions
rily+0.043
pole+0.041
cient+0.038
iche+0.038
 cuc+0.037
Neg logit contributions
 mom-0.022
Mom-0.021
 care-0.020
 owner-0.019
 slowly-0.019
Token,
Token ID11
Feature activation+4.29e-01

Pos logit contributions
rily+0.016
pole+0.015
 cuc+0.015
cient+0.014
melon+0.014
Neg logit contributions
Mom-0.007
 care-0.006
 owner-0.006
 mom-0.006
 moms-0.006
Token "
Token ID366
Feature activation-3.39e-01

Pos logit contributions
rily+0.021
pole+0.019
iche+0.019
izer+0.019
cient+0.018
Neg logit contributions
 mom-0.013
 slowly-0.012
Mom-0.012
 care-0.012
 soon-0.012
TokenI
Token ID40
Feature activation-3.76e-01

Pos logit contributions
 mom+0.009
 slowly+0.008
 care+0.008
 soon+0.007
 owner+0.007
Neg logit contributions
rily-0.018
pole-0.017
iche-0.017
izer-0.017
ifying-0.016
Token Peter
Token ID5613
Feature activation-3.20e-01

Pos logit contributions
 mom+0.001
Mom+0.001
 slowly+0.001
 soon+0.001
 care+0.001
Neg logit contributions
rily-0.003
pole-0.003
iche-0.003
izer-0.003
 cuc-0.003
Token wasn
Token ID2492
Feature activation-2.93e-01

Pos logit contributions
 mom+0.007
Mom+0.007
 care+0.007
 owner+0.006
 hard+0.006
Neg logit contributions
rily-0.017
pole-0.017
venture-0.016
cient-0.015
izer-0.015
Token't
Token ID470
Feature activation+2.48e-01

Pos logit contributions
 mom+0.007
 slowly+0.006
Mom+0.006
 soon+0.006
 brave+0.006
Neg logit contributions
rily-0.017
pole-0.016
iche-0.016
izer-0.016
 cuc-0.015
Token the
Token ID262
Feature activation+7.27e-03

Pos logit contributions
rily+0.013
pole+0.012
izer+0.012
iche+0.012
cient+0.012
Neg logit contributions
 mom-0.007
 slowly-0.006
 soon-0.006
 care-0.006
 brave-0.006
Token greatest
Token ID6000
Feature activation+4.35e-01

Pos logit contributions
rily+0.000
pole+0.000
izer+0.000
iche+0.000
 cuc+0.000
Neg logit contributions
 mom-0.000
Mom-0.000
 slowly-0.000
 soon-0.000
 care-0.000
Token sur
Token ID969
Feature activation+2.88e+00

Pos logit contributions
rily+0.019
pole+0.018
izer+0.018
iche+0.017
 cuc+0.017
Neg logit contributions
 mom-0.014
Mom-0.014
 slowly-0.013
 soon-0.013
 owner-0.013
Tokenfer
Token ID2232
Feature activation+1.37e+00

Pos logit contributions
rily+0.109
 cuc+0.094
iche+0.093
ifying+0.092
ulp+0.092
Neg logit contributions
 mom-0.119
 soon-0.117
 slowly-0.115
 owner-0.114
 brave-0.114
Token,
Token ID11
Feature activation-4.17e-02

Pos logit contributions
rily+0.083
pole+0.079
iche+0.076
izer+0.076
cient+0.075
Neg logit contributions
 mom-0.025
 slowly-0.022
Mom-0.022
 care-0.021
 soon-0.020
Token but
Token ID475
Feature activation-6.75e-01

Pos logit contributions
 mom+0.001
Mom+0.001
 slowly+0.001
 care+0.001
 owner+0.001
Neg logit contributions
rily-0.002
pole-0.002
izer-0.002
iche-0.002
venture-0.002
Token he
Token ID339
Feature activation+2.85e-01

Pos logit contributions
Mom+0.016
 mom+0.015
 owner+0.014
 moms+0.014
 slowly+0.014
Neg logit contributions
rily-0.035
pole-0.034
venture-0.032
ifying-0.032
 cuc-0.031
Token loved
Token ID6151
Feature activation+1.14e-01

Pos logit contributions
rily+0.016
pole+0.015
venture+0.015
 cuc+0.014
cient+0.014
Neg logit contributions
Mom-0.005
 mom-0.005
 care-0.005
 owner-0.004
 moms-0.004
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
rily+0.007
izer+0.006
pole+0.006
ifying+0.006
iche+0.006
Neg logit contributions
 mom-0.004
 slowly-0.004
 Mom-0.004
Mom-0.004
 soon-0.004
Token
Token ID198
Feature activation+1.59e-01

Pos logit contributions
rily+0.008
pole+0.007
iche+0.007
izer+0.007
venture+0.007
Neg logit contributions
 mom-0.004
 care-0.003
 slowly-0.003
 owner-0.003
 soon-0.003
TokenJack
Token ID14295
Feature activation-7.61e-01

Pos logit contributions
rily+0.008
pole+0.008
iche+0.008
izer+0.008
 cuc+0.007
Neg logit contributions
 mom-0.004
 slowly-0.004
 care-0.004
 owner-0.004
 soon-0.004
Token asked
Token ID1965
Feature activation-5.81e-01

Pos logit contributions
 mom+0.019
Mom+0.018
 care+0.018
 moms+0.017
 owner+0.017
Neg logit contributions
rily-0.040
pole-0.039
cient-0.035
venture-0.035
 cuc-0.035
Token the
Token ID262
Feature activation-1.84e-01

Pos logit contributions
Mom+0.018
 owner+0.016
OK+0.016
 moms+0.016
Tom+0.015
Neg logit contributions
rily-0.026
pole-0.024
 cuc-0.024
arium-0.022
cient-0.022
Token mer
Token ID4017
Feature activation+2.86e+00

Pos logit contributions
 soon+0.009
 mom+0.009
 slowly+0.009
 care+0.009
 owner+0.009
Neg logit contributions
rily-0.004
pole-0.003
 cuc-0.003
cient-0.003
iche-0.003
Tokenmaid
Token ID23151
Feature activation+7.92e-01

Pos logit contributions
rily+0.080
pole+0.072
iche+0.071
izer+0.071
ifying+0.068
Neg logit contributions
 mom-0.143
 slowly-0.137
 soon-0.133
Mom-0.133
 owner-0.132
Token why
Token ID1521
Feature activation-1.54e+00

Pos logit contributions
rily+0.041
pole+0.040
cient+0.038
 cuc+0.037
melon+0.036
Neg logit contributions
Mom-0.022
Tom-0.019
 care-0.019
No-0.019
 mom-0.018
Token the
Token ID262
Feature activation-2.56e-01

Pos logit contributions
Mom+0.048
 owner+0.043
 care+0.041
 moms+0.040
 mom+0.039
Neg logit contributions
rily-0.071
pole-0.066
 cuc-0.064
cient-0.063
melon-0.063
Token waves
Token ID9813
Feature activation+1.70e+00

Pos logit contributions
Mom+0.013
 mom+0.013
 owner+0.013
 slowly+0.013
 soon+0.013
Neg logit contributions
rily-0.006
 cuc-0.005
pole-0.005
iche-0.005
venture-0.005
Token were
Token ID547
Feature activation+3.04e-02

Pos logit contributions
rily+0.097
pole+0.091
izer+0.090
iche+0.090
 cuc+0.087
Neg logit contributions
 mom-0.038
 slowly-0.035
 soon-0.033
 care-0.032
Mom-0.032
Token
Token ID198
Feature activation+5.50e-01

Pos logit contributions
rily+0.035
pole+0.033
iche+0.032
izer+0.032
ifying+0.031
Neg logit contributions
 mom-0.011
 slowly-0.010
Mom-0.010
 soon-0.009
 owner-0.009
Token
Token ID198
Feature activation+6.26e-01

Pos logit contributions
rily+0.029
pole+0.027
iche+0.027
izer+0.027
venture+0.026
Neg logit contributions
 mom-0.014
 care-0.012
 slowly-0.012
 owner-0.012
Mom-0.012
TokenFinally
Token ID11158
Feature activation+7.30e-01

Pos logit contributions
rily+0.028
pole+0.027
venture+0.025
izer+0.025
eh+0.025
Neg logit contributions
 mom-0.021
 care-0.019
 moms-0.018
 Mom-0.018
 words-0.018
Token,
Token ID11
Feature activation+6.90e-01

Pos logit contributions
rily+0.048
pole+0.045
izer+0.044
iche+0.043
 cuc+0.043
Neg logit contributions
 mom-0.009
Mom-0.008
 slowly-0.008
 owner-0.007
 care-0.007
Token B
Token ID347
Feature activation+1.28e+00

Pos logit contributions
rily+0.038
pole+0.036
iche+0.035
izer+0.035
venture+0.034
Neg logit contributions
 mom-0.018
 soon-0.017
 slowly-0.016
Mom-0.016
 care-0.015
Tokenamb
Token ID4131
Feature activation+2.84e+00

Pos logit contributions
rily+0.041
pole+0.037
iche+0.036
izer+0.035
ifying+0.034
Neg logit contributions
 mom-0.060
 slowly-0.057
Mom-0.057
 soon-0.056
 care-0.056
Tokeni
Token ID72
Feature activation-1.10e-01

Pos logit contributions
rily+0.136
pole+0.128
iche+0.124
izer+0.123
ifying+0.120
Neg logit contributions
 mom-0.089
 slowly-0.083
Mom-0.081
 soon-0.080
 care-0.080
Token found
Token ID1043
Feature activation+2.69e-01

Pos logit contributions
 mom+0.003
Mom+0.003
 care+0.003
 love+0.003
 hard+0.003
Neg logit contributions
rily-0.005
pole-0.005
izer-0.005
iche-0.005
cient-0.005
Token a
Token ID257
Feature activation+3.78e-01

Pos logit contributions
rily+0.014
pole+0.012
izer+0.012
 cuc+0.012
cient+0.012
Neg logit contributions
Mom-0.007
 mom-0.007
 slowly-0.006
 bud-0.006
 moms-0.006
Token kind
Token ID1611
Feature activation-1.53e-01

Pos logit contributions
rily+0.020
pole+0.019
izer+0.019
iche+0.019
 cuc+0.018
Neg logit contributions
 mom-0.009
Mom-0.009
 slowly-0.008
 care-0.008
 soon-0.008
Token hunter
Token ID19177
Feature activation+4.80e-01

Pos logit contributions
Mom+0.004
 soon+0.004
 mom+0.004
 slowly+0.004
 moms+0.004
Neg logit contributions
rily-0.008
pole-0.007
izer-0.007
 cuc-0.007
iche-0.007
Token again
Token ID757
Feature activation-6.07e-01

Pos logit contributions
Mom+0.009
 mom+0.008
 care+0.008
 moms+0.007
 slowly+0.007
Neg logit contributions
rily-0.013
pole-0.013
venture-0.012
ifying-0.011
iche-0.011
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
Mom+0.014
 mom+0.013
 moms+0.012
 care+0.011
 owner+0.011
Neg logit contributions
rily-0.032
pole-0.030
ifying-0.029
 cuc-0.029
iche-0.028
Token He
Token ID679
Feature activation-7.52e-02

Pos logit contributions
rily+0.001
pole+0.001
iche+0.001
izer+0.001
cient+0.001
Neg logit contributions
 mom-0.000
Mom-0.000
 slowly-0.000
 soon-0.000
 care-0.000
Token said
Token ID531
Feature activation-1.35e-01

Pos logit contributions
Mom+0.002
 care+0.002
 mom+0.002
No+0.002
Tom+0.002
Neg logit contributions
rily-0.004
pole-0.004
venture-0.003
 cuc-0.003
cient-0.003
Token the
Token ID262
Feature activation-2.97e-01

Pos logit contributions
Mom+0.004
 moms+0.004
Tom+0.004
 care+0.004
 love+0.004
Neg logit contributions
rily-0.006
pole-0.006
 cuc-0.006
cient-0.005
venture-0.005
Token mer
Token ID4017
Feature activation+2.84e+00

Pos logit contributions
Mom+0.010
 slowly+0.010
 soon+0.010
 mom+0.009
Tom+0.009
Neg logit contributions
rily-0.012
 cuc-0.011
cient-0.011
pole-0.011
venture-0.011
Tokenmaid
Token ID23151
Feature activation+1.04e+00

Pos logit contributions
rily+0.079
pole+0.073
iche+0.071
izer+0.070
ifying+0.068
Neg logit contributions
 mom-0.142
 slowly-0.135
 soon-0.131
 owner-0.131
Mom-0.131
Token and
Token ID290
Feature activation+2.43e-01

Pos logit contributions
rily+0.056
pole+0.056
cient+0.053
venture+0.052
ifying+0.050
Neg logit contributions
Mom-0.026
 mom-0.022
 care-0.021
Tom-0.021
 moms-0.020
Token a
Token ID257
Feature activation+6.83e-01

Pos logit contributions
rily+0.012
pole+0.012
 cuc+0.011
cient+0.011
venture+0.011
Neg logit contributions
Mom-0.006
 care-0.006
 mom-0.005
 hard-0.005
 love-0.005
Token dolphin
Token ID44136
Feature activation+2.21e-01

Pos logit contributions
rily+0.035
pole+0.033
 cuc+0.033
iche+0.033
cient+0.032
Neg logit contributions
Mom-0.016
 mom-0.016
 slowly-0.016
 soon-0.015
 care-0.015
Token were
Token ID547
Feature activation+1.25e-01

Pos logit contributions
rily+0.013
pole+0.012
iche+0.012
izer+0.012
cient+0.012
Neg logit contributions
 mom-0.004
 slowly-0.004
Mom-0.004
 soon-0.004
 care-0.004
Token.
Token ID13
Feature activation-4.81e-01

Pos logit contributions
rily+0.024
iche+0.021
pole+0.021
izer+0.021
ifying+0.021
Neg logit contributions
 mom-0.012
 slowly-0.011
 owner-0.010
 soon-0.010
 brave-0.010
Token She
Token ID1375
Feature activation-8.87e-01

Pos logit contributions
 mom+0.015
 slowly+0.013
Mom+0.013
 care+0.013
 owner+0.013
Neg logit contributions
rily-0.024
pole-0.022
iche-0.022
izer-0.022
ifying-0.021
Token was
Token ID373
Feature activation-6.14e-02

Pos logit contributions
 mom+0.025
 slowly+0.023
Mom+0.023
 care+0.022
 soon+0.022
Neg logit contributions
rily-0.045
pole-0.042
iche-0.041
izer-0.041
cient-0.040
Token the
Token ID262
Feature activation-2.35e-01

Pos logit contributions
 mom+0.001
 brave+0.001
 hard+0.001
 owner+0.001
 slowly+0.001
Neg logit contributions
rily-0.004
iche-0.004
izer-0.004
pole-0.004
ifying-0.004
Token happiest
Token ID49414
Feature activation+4.18e-01

Pos logit contributions
 mom+0.007
 slowly+0.007
 care+0.006
 soon+0.006
Mom+0.006
Neg logit contributions
pole-0.011
rily-0.010
izer-0.010
cient-0.010
iche-0.010
Token ostr
Token ID42387
Feature activation+2.84e+00

Pos logit contributions
rily+0.016
pole+0.015
izer+0.014
iche+0.014
cient+0.014
Neg logit contributions
 mom-0.016
 slowly-0.016
Mom-0.015
 care-0.015
 soon-0.015
Tokenich
Token ID488
Feature activation+1.16e+00

Pos logit contributions
rily+0.091
pole+0.082
iche+0.077
izer+0.077
cient+0.074
Neg logit contributions
 mom-0.134
 slowly-0.128
Mom-0.126
 care-0.125
 soon-0.125
Token in
Token ID287
Feature activation+1.13e-01

Pos logit contributions
rily+0.064
pole+0.059
iche+0.058
izer+0.058
cient+0.057
Neg logit contributions
 mom-0.029
 slowly-0.027
 soon-0.026
Mom-0.025
 care-0.025
Token the
Token ID262
Feature activation+1.48e-01

Pos logit contributions
rily+0.008
pole+0.007
iche+0.007
izer+0.007
cient+0.007
Neg logit contributions
 mom-0.002
 slowly-0.001
 care-0.001
 soon-0.001
Mom-0.001
Token world
Token ID995
Feature activation+2.67e-01

Pos logit contributions
rily+0.006
pole+0.006
iche+0.006
izer+0.006
cient+0.005
Neg logit contributions
 mom-0.005
 slowly-0.005
 care-0.005
 soon-0.005
Mom-0.005
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
rily+0.016
pole+0.015
iche+0.015
venture+0.015
izer+0.015
Neg logit contributions
 mom-0.005
Mom-0.004
 slowly-0.004
 care-0.004
 owner-0.004
Token dog
Token ID3290
Feature activation+4.55e-01

Pos logit contributions
 mom+0.003
 slowly+0.003
Mom+0.003
 soon+0.003
 care+0.003
Neg logit contributions
rily-0.007
pole-0.006
izer-0.006
iche-0.006
cient-0.006
Token as
Token ID355
Feature activation-3.51e-01

Pos logit contributions
rily+0.024
pole+0.023
cient+0.022
izer+0.022
venture+0.022
Neg logit contributions
Mom-0.011
 mom-0.011
 care-0.010
 slowly-0.010
 moms-0.009
Token a
Token ID257
Feature activation+3.65e-01

Pos logit contributions
Mom+0.008
 mom+0.007
 care+0.007
 slowly+0.007
 owner+0.007
Neg logit contributions
rily-0.019
pole-0.018
 cuc-0.017
iche-0.017
izer-0.017
Token yellow
Token ID7872
Feature activation+6.65e-01

Pos logit contributions
rily+0.017
pole+0.016
izer+0.016
 cuc+0.016
iche+0.016
Neg logit contributions
Mom-0.010
 mom-0.010
 soon-0.010
 slowly-0.010
 care-0.010
Token lab
Token ID2248
Feature activation+7.42e-01

Pos logit contributions
rily+0.032
pole+0.031
izer+0.030
iche+0.029
ifying+0.029
Neg logit contributions
 mom-0.018
Mom-0.018
 soon-0.018
 slowly-0.018
 care-0.017
Tokenrador
Token ID40368
Feature activation+2.83e+00

Pos logit contributions
rily+0.041
pole+0.040
izer+0.037
 cuc+0.037
venture+0.037
Neg logit contributions
 mom-0.017
Mom-0.016
 care-0.015
 slowly-0.015
 soon-0.014
Token.
Token ID13
Feature activation+3.26e-02

Pos logit contributions
rily+0.159
pole+0.149
iche+0.147
izer+0.145
ifying+0.142
Neg logit contributions
 mom-0.067
 slowly-0.059
 care-0.057
 owner-0.057
Mom-0.057
Token Lily
Token ID20037
Feature activation-6.98e-01

Pos logit contributions
rily+0.002
pole+0.002
izer+0.002
iche+0.002
ifying+0.002
Neg logit contributions
 mom-0.001
 slowly-0.001
Mom-0.001
 owner-0.001
 soon-0.001
Token got
Token ID1392
Feature activation-2.98e-01

Pos logit contributions
 care+0.014
 mom+0.014
Mom+0.013
 love+0.012
 hard+0.012
Neg logit contributions
rily-0.038
pole-0.037
cient-0.035
 cuc-0.034
venture-0.034
Token excited
Token ID6568
Feature activation+8.57e-01

Pos logit contributions
Mom+0.009
Tom+0.009
 mom+0.009
 soon+0.008
 owner+0.008
Neg logit contributions
rily-0.014
pole-0.012
cient-0.012
 cuc-0.012
izer-0.011
Token and
Token ID290
Feature activation-9.39e-02

Pos logit contributions
rily+0.048
pole+0.045
izer+0.042
iche+0.042
 cuc+0.042
Neg logit contributions
 mom-0.020
Mom-0.020
 slowly-0.017
 owner-0.017
 moms-0.017
Token demands
Token ID8665
Feature activation-3.16e-01

Pos logit contributions
rily+0.022
pole+0.020
izer+0.020
iche+0.020
 cuc+0.020
Neg logit contributions
Mom-0.012
 slowly-0.011
 soon-0.011
 mom-0.010
Tom-0.010
Token,
Token ID11
Feature activation+1.07e-01

Pos logit contributions
 mom+0.008
Mom+0.008
 slowly+0.008
 care+0.007
 owner+0.007
Neg logit contributions
rily-0.016
pole-0.015
iche-0.014
cient-0.014
izer-0.014
Token crab
Token ID32202
Feature activation+8.57e-02

Pos logit contributions
rily+0.006
pole+0.005
izer+0.005
iche+0.005
 cuc+0.005
Neg logit contributions
 mom-0.003
Mom-0.003
 slowly-0.003
 care-0.003
 soon-0.003
Token,"
Token ID553
Feature activation-1.19e+00

Pos logit contributions
rily+0.005
pole+0.004
iche+0.004
cient+0.004
izer+0.004
Neg logit contributions
 mom-0.002
Mom-0.002
 slowly-0.002
 care-0.002
 owner-0.002
Token the
Token ID262
Feature activation+3.53e-01

Pos logit contributions
Mom+0.029
 mom+0.028
 owner+0.027
 love+0.027
 moms+0.026
Neg logit contributions
rily-0.061
 cuc-0.057
pole-0.057
cient-0.055
izer-0.051
Token mer
Token ID4017
Feature activation+2.83e+00

Pos logit contributions
rily+0.011
 cuc+0.010
pole+0.009
melon+0.009
cient+0.008
Neg logit contributions
 soon-0.016
Mom-0.015
 slowly-0.015
 mom-0.015
Tom-0.015
Tokenmaid
Token ID23151
Feature activation+1.84e-01

Pos logit contributions
rily+0.068
iche+0.066
izer+0.066
ifying+0.066
pole+0.065
Neg logit contributions
 mom-0.145
 slowly-0.140
 owner-0.134
 soon-0.134
 care-0.133
Token said
Token ID531
Feature activation-6.39e-01

Pos logit contributions
rily+0.009
pole+0.009
cient+0.008
 cuc+0.008
iche+0.008
Neg logit contributions
 mom-0.005
Mom-0.005
 care-0.005
 love-0.004
 owner-0.004
Token.
Token ID13
Feature activation+4.36e-01

Pos logit contributions
Mom+0.013
 mom+0.012
 moms+0.012
 care+0.011
 love+0.011
Neg logit contributions
rily-0.038
pole-0.035
 cuc-0.033
cient-0.033
izer-0.033
Token "
Token ID366
Feature activation-6.62e-01

Pos logit contributions
rily+0.023
pole+0.022
iche+0.022
izer+0.022
ifying+0.021
Neg logit contributions
 mom-0.012
 slowly-0.011
Mom-0.010
 soon-0.010
 owner-0.010
TokenI
Token ID40
Feature activation-3.52e-01

Pos logit contributions
 mom+0.019
 slowly+0.016
 owner+0.016
 hard+0.016
 care+0.016
Neg logit contributions
rily-0.034
pole-0.032
iche-0.031
izer-0.031
ifying-0.030
Token sur
Token ID969
Feature activation+2.64e+00

Pos logit contributions
rily+0.005
izer+0.004
 cuc+0.004
cient+0.004
pole+0.004
Neg logit contributions
Mom-0.006
 mom-0.006
 soon-0.006
 owner-0.005
 care-0.005
Tokenfer
Token ID2232
Feature activation+2.72e-01

Pos logit contributions
rily+0.099
ifying+0.085
pole+0.085
 cuc+0.085
iche+0.082
Neg logit contributions
 mom-0.110
 slowly-0.107
 soon-0.107
Mom-0.105
 next-0.104
Token.
Token ID13
Feature activation+9.43e-02

Pos logit contributions
rily+0.015
pole+0.014
iche+0.014
izer+0.014
cient+0.014
Neg logit contributions
 mom-0.006
 slowly-0.006
Mom-0.005
 soon-0.005
 care-0.005
Token Whenever
Token ID21326
Feature activation-7.70e-03

Pos logit contributions
rily+0.004
pole+0.004
cient+0.004
iche+0.004
ifying+0.004
Neg logit contributions
 mom-0.003
 care-0.003
 slowly-0.003
 soon-0.003
Mom-0.003
Token the
Token ID262
Feature activation+4.93e-01

Pos logit contributions
Mom+0.000
 owner+0.000
 cre+0.000
 love+0.000
 moms+0.000
Neg logit contributions
arium-0.000
cient-0.000
 films-0.000
rily-0.000
venture-0.000
Token sur
Token ID969
Feature activation+2.82e+00

Pos logit contributions
rily+0.015
 cuc+0.015
izer+0.014
 cinema+0.014
venture+0.014
Neg logit contributions
 soon-0.019
 mom-0.019
Mom-0.019
 slowly-0.018
 care-0.018
Tokenfer
Token ID2232
Feature activation+1.46e+00

Pos logit contributions
rily+0.103
pole+0.089
 cuc+0.088
iche+0.088
ifying+0.088
Neg logit contributions
 mom-0.119
 slowly-0.116
 soon-0.116
Mom-0.115
 now-0.112
Token balanced
Token ID12974
Feature activation+5.74e-01

Pos logit contributions
pole+0.069
rily+0.069
venture+0.065
cient+0.064
iche+0.064
Neg logit contributions
Mom-0.040
 mom-0.040
 care-0.035
 slowly-0.034
 owner-0.034
Token on
Token ID319
Feature activation+5.28e-01

Pos logit contributions
rily+0.030
pole+0.028
izer+0.027
iche+0.026
venture+0.026
Neg logit contributions
 mom-0.015
Mom-0.015
 slowly-0.014
 care-0.014
 owner-0.013
Token the
Token ID262
Feature activation+6.81e-01

Pos logit contributions
rily+0.024
ifying+0.021
izer+0.021
pole+0.021
 films+0.020
Neg logit contributions
Mom-0.016
 moms-0.014
 owner-0.014
 soon-0.013
OK-0.013
Token board
Token ID3096
Feature activation+8.83e-02

Pos logit contributions
rily+0.030
pole+0.027
izer+0.027
iche+0.026
venture+0.026
Neg logit contributions
 mom-0.023
Mom-0.022
 soon-0.022
 slowly-0.022
 owner-0.021
Token pond
Token ID16723
Feature activation+1.04e-01

Pos logit contributions
rily+0.029
pole+0.027
iche+0.026
izer+0.026
cient+0.026
Neg logit contributions
 mom-0.022
 slowly-0.020
Mom-0.020
 care-0.019
 soon-0.019
Token?"
Token ID1701
Feature activation-2.18e-01

Pos logit contributions
rily+0.005
pole+0.004
iche+0.004
izer+0.004
cient+0.004
Neg logit contributions
 mom-0.004
 slowly-0.004
Mom-0.003
 soon-0.003
 owner-0.003
Token
Token ID198
Feature activation+8.74e-02

Pos logit contributions
 mom+0.008
Mom+0.008
 slowly+0.007
 soon+0.007
 love+0.007
Neg logit contributions
rily-0.009
pole-0.008
izer-0.008
iche-0.008
cient-0.008
Token
Token ID198
Feature activation-2.73e-02

Pos logit contributions
izer+0.004
rily+0.004
pole+0.004
iche+0.004
venture+0.003
Neg logit contributions
 bud-0.002
 mom-0.002
 luck-0.002
 care-0.002
 soon-0.002
TokenThe
Token ID464
Feature activation+3.64e-01

Pos logit contributions
 mom+0.001
 luck+0.001
 bud+0.001
 love+0.001
 hard+0.001
Neg logit contributions
izer-0.001
pole-0.001
venture-0.001
eh-0.001
iche-0.001
Token gol
Token ID43791
Feature activation+2.82e+00

Pos logit contributions
rily+0.013
pole+0.012
izer+0.011
iche+0.011
 cuc+0.011
Neg logit contributions
 mom-0.016
 slowly-0.015
 soon-0.015
Mom-0.015
 care-0.015
Tokenfer
Token ID2232
Feature activation+6.05e-01

Pos logit contributions
rily+0.104
iche+0.091
ifying+0.088
ulp+0.088
pole+0.086
Neg logit contributions
 mom-0.121
 owner-0.118
Mom-0.117
 slowly-0.115
 soon-0.115
Token replied
Token ID8712
Feature activation-1.57e-01

Pos logit contributions
pole+0.030
rily+0.029
cient+0.027
venture+0.027
iche+0.026
Neg logit contributions
Mom-0.016
 mom-0.016
 care-0.015
 moms-0.014
 love-0.014
Token,
Token ID11
Feature activation+6.13e-01

Pos logit contributions
Mom+0.002
 mom+0.002
 care+0.002
 slowly+0.002
 love+0.002
Neg logit contributions
rily-0.010
pole-0.010
cient-0.009
izer-0.009
iche-0.009
Token "
Token ID366
Feature activation-2.05e-01

Pos logit contributions
rily+0.034
iche+0.032
izer+0.032
pole+0.032
ifying+0.031
Neg logit contributions
 mom-0.016
 slowly-0.014
Mom-0.013
 soon-0.013
 brave-0.013
TokenI
Token ID40
Feature activation-2.27e-01

Pos logit contributions
 mom+0.006
 care+0.005
 slowly+0.005
 hard+0.005
 soon+0.005
Neg logit contributions
rily-0.010
pole-0.010
iche-0.010
ifying-0.010
izer-0.010
Token,
Token ID11
Feature activation+8.83e-02

Pos logit contributions
Mom+0.003
 mom+0.003
 care+0.003
 moms+0.003
 owner+0.003
Neg logit contributions
rily-0.007
pole-0.006
 cuc-0.006
ifying-0.006
izer-0.006
Token with
Token ID351
Feature activation-1.72e-01

Pos logit contributions
rily+0.005
pole+0.004
iche+0.004
izer+0.004
cient+0.004
Neg logit contributions
 mom-0.002
 slowly-0.002
Mom-0.002
 soon-0.002
 care-0.002
Token all
Token ID477
Feature activation-5.56e-02

Pos logit contributions
Mom+0.004
 mom+0.004
 owner+0.004
 slowly+0.004
 soon+0.004
Neg logit contributions
rily-0.009
pole-0.008
 cuc-0.008
ifying-0.008
ulp-0.008
Token its
Token ID663
Feature activation-6.06e-01

Pos logit contributions
 mom+0.001
 slowly+0.001
 care+0.001
Mom+0.001
 soon+0.001
Neg logit contributions
rily-0.003
pole-0.003
iche-0.003
izer-0.003
cient-0.003
Token different
Token ID1180
Feature activation+4.52e-01

Pos logit contributions
Mom+0.017
 mom+0.016
Tom+0.015
 slowly+0.015
 soon+0.015
Neg logit contributions
rily-0.028
iche-0.028
pole-0.028
 cuc-0.028
cient-0.027
Tokencol
Token ID4033
Feature activation+2.82e+00

Pos logit contributions
rily+0.022
 cuc+0.020
pole+0.020
iche+0.020
izer+0.019
Neg logit contributions
Mom-0.013
 mom-0.013
Tom-0.012
 owner-0.012
 slowly-0.012
Tokenours
Token ID4662
Feature activation-1.75e-02

Pos logit contributions
rily+0.077
pole+0.073
izer+0.068
iche+0.064
venture+0.061
Neg logit contributions
 mom-0.139
 care-0.134
 slowly-0.133
Mom-0.130
 love-0.130
Token,
Token ID11
Feature activation+3.49e-01

Pos logit contributions
 mom+0.000
Mom+0.000
 owner+0.000
 brave+0.000
 moms+0.000
Neg logit contributions
rily-0.001
 cuc-0.001
pole-0.001
melon-0.001
ifying-0.001
Token shapes
Token ID15268
Feature activation+4.06e-01

Pos logit contributions
rily+0.019
pole+0.018
iche+0.017
izer+0.017
 cuc+0.017
Neg logit contributions
 mom-0.008
 slowly-0.008
Mom-0.008
 care-0.007
 soon-0.007
Token and
Token ID290
Feature activation+1.08e-01

Pos logit contributions
rily+0.024
pole+0.022
iche+0.021
izer+0.021
 cuc+0.021
Neg logit contributions
 mom-0.009
Mom-0.008
 slowly-0.007
 care-0.007
 owner-0.007
Token textures
Token ID20028
Feature activation+2.20e-01

Pos logit contributions
 cuc+0.005
rily+0.005
pole+0.005
iche+0.004
ifying+0.004
Neg logit contributions
Mom-0.003
 mom-0.003
 owner-0.003
 slowly-0.003
Tom-0.003
Token from
Token ID422
Feature activation+8.66e-01

Pos logit contributions
rily+0.003
iche+0.002
ifying+0.002
pole+0.002
ulp+0.002
Neg logit contributions
 mom-0.001
 slowly-0.001
 care-0.001
 brave-0.001
 soon-0.001
Token the
Token ID262
Feature activation-1.97e-01

Pos logit contributions
rily+0.048
pole+0.045
izer+0.044
iche+0.044
cient+0.043
Neg logit contributions
 mom-0.020
Mom-0.018
 slowly-0.018
 care-0.017
 owner-0.017
Token trap
Token ID12840
Feature activation-1.73e-01

Pos logit contributions
 mom+0.009
Mom+0.008
 slowly+0.008
 soon+0.008
 care+0.008
Neg logit contributions
rily-0.007
pole-0.006
izer-0.006
iche-0.006
cient-0.006
Token.
Token ID13
Feature activation+2.54e-01

Pos logit contributions
 mom+0.003
 slowly+0.002
 soon+0.002
 owner+0.002
 care+0.002
Neg logit contributions
rily-0.011
iche-0.010
pole-0.010
izer-0.010
ifying-0.010
Token B
Token ID347
Feature activation+1.33e+00

Pos logit contributions
rily+0.013
iche+0.012
pole+0.012
izer+0.012
eh+0.012
Neg logit contributions
 mom-0.008
 slowly-0.007
 soon-0.007
Mom-0.007
 care-0.007
Tokenamb
Token ID4131
Feature activation+2.82e+00

Pos logit contributions
rily+0.042
pole+0.038
iche+0.037
izer+0.036
ifying+0.035
Neg logit contributions
 mom-0.063
 slowly-0.060
Mom-0.059
 soon-0.059
 care-0.059
Tokeni
Token ID72
Feature activation-4.26e-01

Pos logit contributions
rily+0.135
pole+0.126
iche+0.122
izer+0.122
ifying+0.118
Neg logit contributions
 mom-0.089
 slowly-0.083
Mom-0.081
 soon-0.080
 care-0.080
Token was
Token ID373
Feature activation-4.43e-01

Pos logit contributions
 mom+0.010
Mom+0.009
 care+0.008
 Mom+0.008
 hard+0.008
Neg logit contributions
rily-0.023
pole-0.022
cient-0.021
izer-0.021
 cuc-0.021
Token very
Token ID845
Feature activation+2.05e-01

Pos logit contributions
 mom+0.014
 slowly+0.013
Mom+0.013
 care+0.013
 soon+0.013
Neg logit contributions
rily-0.021
pole-0.019
iche-0.019
izer-0.019
cient-0.018
Token happy
Token ID3772
Feature activation+5.70e-01

Pos logit contributions
rily+0.010
pole+0.009
iche+0.009
izer+0.009
cient+0.009
Neg logit contributions
 mom-0.006
Mom-0.005
 slowly-0.005
 care-0.005
 soon-0.005
Token and
Token ID290
Feature activation+9.98e-02

Pos logit contributions
rily+0.030
pole+0.027
 cuc+0.027
izer+0.026
melon+0.025
Neg logit contributions
Mom-0.015
 mom-0.013
 moms-0.013
 care-0.012
 slowly-0.012
Token reef
Token ID25088
Feature activation-1.75e-01

Pos logit contributions
 mom+0.002
 care+0.001
 slowly+0.001
Mom+0.001
 soon+0.001
Neg logit contributions
rily-0.003
pole-0.003
iche-0.003
izer-0.003
 cuc-0.003
Token with
Token ID351
Feature activation+9.92e-01

Pos logit contributions
 slowly+0.003
 next+0.003
 mom+0.003
 now+0.002
 soon+0.002
Neg logit contributions
rily-0.012
ksh-0.011
iche-0.011
izer-0.011
cient-0.011
Token many
Token ID867
Feature activation+1.96e-01

Pos logit contributions
rily+0.055
pole+0.052
 cuc+0.050
iche+0.050
ifying+0.049
Neg logit contributions
 mom-0.023
 slowly-0.021
Mom-0.021
 care-0.020
 soon-0.020
Token fish
Token ID5916
Feature activation-1.54e-01

Pos logit contributions
rily+0.009
 cuc+0.008
pole+0.008
 films+0.008
iche+0.008
Neg logit contributions
Mom-0.006
 mom-0.006
 owner-0.006
 soon-0.006
Tom-0.005
Token and
Token ID290
Feature activation+7.26e-01

Pos logit contributions
 mom+0.004
 slowly+0.004
 soon+0.004
 owner+0.004
 brave+0.003
Neg logit contributions
rily-0.008
pole-0.008
izer-0.008
iche-0.008
cient-0.007
Token cor
Token ID1162
Feature activation+2.81e+00

Pos logit contributions
rily+0.035
pole+0.033
 cuc+0.033
iche+0.032
ifying+0.031
Neg logit contributions
 mom-0.021
Mom-0.020
 slowly-0.019
 owner-0.019
 care-0.019
Tokenals
Token ID874
Feature activation+1.00e-01

Pos logit contributions
rily+0.113
pole+0.103
izer+0.101
iche+0.101
venture+0.094
Neg logit contributions
 care-0.106
 mom-0.105
 slowly-0.104
 love-0.101
 brave-0.099
Token.
Token ID13
Feature activation+6.66e-01

Pos logit contributions
rily+0.007
izer+0.006
iche+0.006
pole+0.006
venture+0.006
Neg logit contributions
 mom-0.001
 slowly-0.001
 soon-0.001
 owner-0.001
 hard-0.001
Token Anna
Token ID11735
Feature activation-4.16e-02

Pos logit contributions
pole+0.038
izer+0.037
rily+0.037
cient+0.036
eh+0.036
Neg logit contributions
 mom-0.018
 Mom-0.016
Mom-0.015
 slowly-0.015
 owner-0.013
Token wanted
Token ID2227
Feature activation+2.22e-01

Pos logit contributions
 mom+0.001
 care+0.001
Mom+0.001
 hard+0.001
 owner+0.001
Neg logit contributions
rily-0.002
pole-0.002
 cuc-0.002
iche-0.002
venture-0.002
Token to
Token ID284
Feature activation-1.63e-01

Pos logit contributions
rily+0.012
pole+0.011
 cuc+0.011
iche+0.011
melon+0.011
Neg logit contributions
Mom-0.005
 mom-0.005
 slowly-0.004
 owner-0.004
 care-0.004
Token rocket
Token ID10701
Feature activation-9.57e-02

Pos logit contributions
rily+0.008
pole+0.008
izer+0.007
iche+0.007
ifying+0.007
Neg logit contributions
 mom-0.005
 slowly-0.004
Mom-0.004
 soon-0.004
 care-0.004
Token inside
Token ID2641
Feature activation-2.62e-01

Pos logit contributions
 mom+0.002
 slowly+0.002
 soon+0.002
Mom+0.002
 care+0.002
Neg logit contributions
rily-0.005
pole-0.005
iche-0.005
izer-0.005
cient-0.005
Token and
Token ID290
Feature activation-1.04e-01

Pos logit contributions
 mom+0.004
Mom+0.004
 slowly+0.004
 care+0.004
 owner+0.004
Neg logit contributions
rily-0.016
pole-0.015
iche-0.015
izer-0.015
ifying-0.015
Token climbs
Token ID34332
Feature activation-4.14e-01

Pos logit contributions
 mom+0.003
Mom+0.003
 owner+0.003
 slowly+0.003
 care+0.003
Neg logit contributions
rily-0.005
pole-0.005
 cuc-0.004
iche-0.004
izer-0.004
Token in
Token ID287
Feature activation-4.12e-01

Pos logit contributions
 mom+0.012
Mom+0.011
 care+0.011
 owner+0.011
 brave+0.011
Neg logit contributions
rily-0.021
pole-0.019
izer-0.018
venture-0.018
iche-0.018
Token too
Token ID1165
Feature activation-2.65e+00

Pos logit contributions
 moms+0.009
 mom+0.009
Mom+0.009
 owner+0.009
 brave+0.008
Neg logit contributions
rily-0.022
izer-0.019
pole-0.019
ifying-0.019
 stret-0.018
Token.
Token ID13
Feature activation-4.35e-01

Pos logit contributions
 moms+0.066
Mom+0.064
 mom+0.062
 care+0.060
 owner+0.056
Neg logit contributions
rily-0.129
pole-0.115
ifying-0.109
umbs-0.108
arium-0.103
Token
Token ID198
Feature activation-7.09e-01

Pos logit contributions
 mom+0.011
Mom+0.010
 slowly+0.010
 soon+0.010
 owner+0.010
Neg logit contributions
rily-0.024
pole-0.022
izer-0.021
iche-0.021
cient-0.021
Token
Token ID198
Feature activation-2.44e-01

Pos logit contributions
 mom+0.018
 care+0.017
 slowly+0.017
 brave+0.016
 soon+0.016
Neg logit contributions
rily-0.035
pole-0.033
iche-0.033
izer-0.033
venture-0.032
TokenHe
Token ID1544
Feature activation-2.50e-01

Pos logit contributions
 mom+0.005
 care+0.004
 slowly+0.004
 owner+0.004
 hard+0.004
Neg logit contributions
rily-0.013
pole-0.013
iche-0.013
eh-0.013
izer-0.013
Token pret
Token ID2181
Feature activation-3.50e-01

Pos logit contributions
 mom+0.009
Mom+0.009
 owner+0.008
 care+0.008
 soon+0.008
Neg logit contributions
rily-0.010
pole-0.009
venture-0.009
 cuc-0.009
iche-0.009
Token modest
Token ID12949
Feature activation-5.73e-01

Pos logit contributions
Mom+0.013
 mom+0.012
 slowly+0.011
 care+0.011
No+0.011
Neg logit contributions
rily-0.026
iche-0.025
pole-0.025
 cuc-0.024
eh-0.024
Token and
Token ID290
Feature activation+1.73e-01

Pos logit contributions
Mom+0.013
 mom+0.012
 slowly+0.011
Tom+0.011
 owner+0.011
Neg logit contributions
rily-0.032
pole-0.028
 cuc-0.028
iche-0.028
ifying-0.027
Token don
Token ID836
Feature activation-1.86e+00

Pos logit contributions
rily+0.008
iche+0.008
 cuc+0.008
pole+0.008
cient+0.008
Neg logit contributions
Mom-0.005
 mom-0.005
 owner-0.004
 slowly-0.004
Tom-0.004
Token't
Token ID470
Feature activation-3.04e-01

Pos logit contributions
 mom+0.042
Mom+0.038
 slowly+0.038
 soon+0.035
 hard+0.034
Neg logit contributions
rily-0.110
pole-0.103
izer-0.100
iche-0.100
 cuc-0.099
Token eat
Token ID4483
Feature activation-8.02e-01

Pos logit contributions
 mom+0.009
Mom+0.009
 slowly+0.008
 owner+0.008
 moms+0.008
Neg logit contributions
pole-0.014
iche-0.014
rily-0.014
 cuc-0.013
melon-0.012
Token too
Token ID1165
Feature activation-2.58e+00

Pos logit contributions
 mom+0.021
Mom+0.021
 owner+0.020
 slowly+0.020
 care+0.020
Neg logit contributions
rily-0.041
iche-0.037
pole-0.037
 cuc-0.037
izer-0.036
Token many
Token ID867
Feature activation-1.69e-01

Pos logit contributions
 mom+0.062
Mom+0.061
Mac+0.058
Jenn+0.058
 owner+0.057
Neg logit contributions
rily-0.122
iche-0.115
eh-0.110
 ours-0.110
ifying-0.110
Token pr
Token ID778
Feature activation+1.20e+00

Pos logit contributions
Mom+0.005
 mom+0.005
 owner+0.005
 slowly+0.005
No+0.004
Neg logit contributions
rily-0.008
 cuc-0.008
iche-0.007
pole-0.007
eh-0.007
Tokenunes
Token ID4015
Feature activation-2.38e-01

Pos logit contributions
rily+0.034
pole+0.030
iche+0.028
izer+0.028
cient+0.026
Neg logit contributions
 mom-0.062
 slowly-0.059
Mom-0.058
 soon-0.058
 care-0.058
Token.
Token ID13
Feature activation-9.51e-02

Pos logit contributions
 mom+0.004
 slowly+0.004
Mom+0.003
 care+0.003
 soon+0.003
Neg logit contributions
rily-0.015
pole-0.014
iche-0.014
izer-0.014
cient-0.013
Token They
Token ID1119
Feature activation+3.49e-01

Pos logit contributions
 mom+0.004
 slowly+0.003
Mom+0.003
 soon+0.003
 care+0.003
Neg logit contributions
rily-0.004
pole-0.004
iche-0.003
izer-0.003
cient-0.003
Token says
Token ID1139
Feature activation-2.93e-01

Pos logit contributions
 mom+0.001
 slowly+0.001
Mom+0.001
 soon+0.000
 care+0.000
Neg logit contributions
rily-0.001
pole-0.001
iche-0.001
izer-0.001
cient-0.001
Token:
Token ID25
Feature activation-7.20e-02

Pos logit contributions
Mom+0.006
 owner+0.006
 slowly+0.006
 care+0.005
 soon+0.005
Neg logit contributions
rily-0.016
 cuc-0.015
cient-0.015
iche-0.014
pole-0.014
Token
Token ID198
Feature activation-7.32e-01

Pos logit contributions
 mom+0.002
 slowly+0.002
Mom+0.002
 soon+0.002
 care+0.002
Neg logit contributions
rily-0.004
pole-0.004
izer-0.004
iche-0.004
cient-0.003
Token
Token ID198
Feature activation-5.46e-01

Pos logit contributions
 luck+0.024
 care+0.023
 brave+0.023
 soon+0.022
 slowly+0.022
Neg logit contributions
rily-0.026
izer-0.026
iche-0.026
umbs-0.024
 stret-0.024
Token"
Token ID1
Feature activation-9.92e-01

Pos logit contributions
 luck+0.017
 words+0.015
 mom+0.015
 care+0.015
 brave+0.014
Neg logit contributions
eh-0.019
ifying-0.019
iche-0.019
ults-0.019
cks-0.018
TokenDr
Token ID6187
Feature activation-2.56e+00

Pos logit contributions
 mom+0.029
 slowly+0.027
 care+0.026
 hard+0.026
 brave+0.026
Neg logit contributions
rily-0.047
pole-0.044
izer-0.044
iche-0.044
 cuc-0.043
Tokenink
Token ID676
Feature activation-2.29e-01

Pos logit contributions
 mom+0.092
 slowly+0.086
 soon+0.085
 owner+0.084
 care+0.084
Neg logit contributions
rily-0.106
iche-0.098
izer-0.098
pole-0.095
ifying-0.092
Token some
Token ID617
Feature activation-4.34e-01

Pos logit contributions
 mom+0.005
Mom+0.005
 slowly+0.005
 soon+0.005
 care+0.005
Neg logit contributions
rily-0.013
pole-0.012
izer-0.012
iche-0.012
cient-0.011
Token water
Token ID1660
Feature activation+2.70e-01

Pos logit contributions
Mom+0.014
 mom+0.013
 slowly+0.013
 soon+0.013
 owner+0.013
Neg logit contributions
rily-0.019
 cuc-0.018
ifying-0.018
iche-0.017
pole-0.017
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
rily+0.015
pole+0.014
iche+0.013
izer+0.013
cient+0.013
Neg logit contributions
 mom-0.007
 slowly-0.006
Mom-0.006
 soon-0.006
 care-0.006
Token It
Token ID632
Feature activation-4.74e-01

Pos logit contributions
 mom+0.007
 slowly+0.006
Mom+0.006
 soon+0.006
 care+0.006
Neg logit contributions
rily-0.009
pole-0.008
iche-0.008
izer-0.008
cient-0.008
Token"
Token ID1
Feature activation-1.24e+00

Pos logit contributions
 mom+0.007
 care+0.007
 luck+0.007
 bud+0.007
 words+0.006
Neg logit contributions
izer-0.013
eh-0.013
pole-0.013
rily-0.013
ifying-0.013
TokenI
Token ID40
Feature activation-6.85e-01

Pos logit contributions
 mom+0.038
 care+0.035
 hard+0.035
 slowly+0.034
 owner+0.034
Neg logit contributions
rily-0.057
iche-0.052
izer-0.052
ifying-0.051
pole-0.051
Token think
Token ID892
Feature activation-3.45e-01

Pos logit contributions
Mom+0.022
 mom+0.020
 owner+0.020
 slowly+0.020
 words+0.020
Neg logit contributions
iche-0.027
ifying-0.025
cient-0.025
venture-0.025
eh-0.024
Token we
Token ID356
Feature activation+1.36e-01

Pos logit contributions
Mom+0.008
 mom+0.008
 care+0.008
 slowly+0.008
 owner+0.008
Neg logit contributions
rily-0.018
pole-0.017
iche-0.016
venture-0.016
 cuc-0.016
Token ate
Token ID15063
Feature activation-5.66e-01

Pos logit contributions
ifying+0.006
iche+0.006
rily+0.006
cient+0.006
pole+0.006
Neg logit contributions
 mom-0.004
Mom-0.004
 owner-0.004
 slowly-0.004
 next-0.004
Token too
Token ID1165
Feature activation-2.55e+00

Pos logit contributions
 mom+0.015
Mom+0.015
 owner+0.015
 care+0.014
 slowly+0.014
Neg logit contributions
rily-0.028
pole-0.025
iche-0.025
ifying-0.024
 cuc-0.024
Token much
Token ID881
Feature activation-7.71e-01

Pos logit contributions
Mom+0.073
 owner+0.073
 stitches+0.073
 mom+0.072
Sp+0.072
Neg logit contributions
rily-0.096
 ours-0.095
umbs-0.090
eh-0.089
pole-0.085
Token l
Token ID300
Feature activation+1.32e-01

Pos logit contributions
 mom+0.021
Mom+0.020
 owner+0.019
 slowly+0.018
 brave+0.018
Neg logit contributions
rily-0.037
iche-0.036
 cuc-0.035
pole-0.035
izer-0.034
Tokenoll
Token ID692
Feature activation+1.43e-01

Pos logit contributions
rily+0.004
pole+0.003
ifying+0.003
iche+0.003
izer+0.003
Neg logit contributions
 mom-0.007
Mom-0.006
 slowly-0.006
 care-0.006
 soon-0.006
Tokenipop
Token ID42800
Feature activation-8.44e-01

Pos logit contributions
rily+0.005
pole+0.004
cient+0.004
iche+0.004
venture+0.004
Neg logit contributions
Mom-0.006
 mom-0.006
 owner-0.006
 slowly-0.006
 care-0.006
Token,"
Token ID553
Feature activation-9.30e-01

Pos logit contributions
 mom+0.023
Mom+0.023
 care+0.021
 slowly+0.020
 owner+0.020
Neg logit contributions
rily-0.043
pole-0.041
iche-0.037
umbs-0.037
venture-0.037
Tokenolded
Token ID48959
Feature activation+4.59e-03

Pos logit contributions
rily+0.012
pole+0.012
 cuc+0.011
iche+0.011
izer+0.011
Neg logit contributions
 mom-0.006
Mom-0.006
 slowly-0.006
 soon-0.006
 owner-0.006
Token him
Token ID683
Feature activation+1.32e-02

Pos logit contributions
rily+0.000
pole+0.000
izer+0.000
cient+0.000
iche+0.000
Neg logit contributions
 mom-0.000
Mom-0.000
 slowly-0.000
 soon-0.000
 owner-0.000
Token when
Token ID618
Feature activation-1.18e+00

Pos logit contributions
rily+0.001
pole+0.001
izer+0.001
venture+0.001
cient+0.001
Neg logit contributions
Mom-0.000
 mom-0.000
 slowly-0.000
 care-0.000
 owner-0.000
Token he
Token ID339
Feature activation-1.10e-02

Pos logit contributions
Mom+0.033
 slowly+0.029
 care+0.028
 owner+0.028
 mom+0.028
Neg logit contributions
rily-0.054
pole-0.051
venture-0.051
 cuc-0.050
iche-0.050
Token took
Token ID1718
Feature activation-5.66e-01

Pos logit contributions
Mom+0.000
 mom+0.000
 care+0.000
No+0.000
 moms+0.000
Neg logit contributions
pole-0.000
venture-0.000
cient-0.000
rily-0.000
iche-0.000
Token too
Token ID1165
Feature activation-2.55e+00

Pos logit contributions
Mom+0.015
 owner+0.014
 mom+0.014
 bud+0.013
 moms+0.013
Neg logit contributions
rily-0.028
izer-0.026
venture-0.025
pole-0.025
 stret-0.025
Token long
Token ID890
Feature activation+8.93e-02

Pos logit contributions
 moms+0.071
Mac+0.070
 mom+0.070
Mom+0.069
Jenn+0.065
Neg logit contributions
ifying-0.091
 ours-0.090
eh-0.090
umbs-0.084
rily-0.084
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
rily+0.005
pole+0.005
venture+0.005
izer+0.005
iche+0.005
Neg logit contributions
 mom-0.002
Mom-0.002
 slowly-0.002
 owner-0.001
 moms-0.001
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
 mom+0.006
 slowly+0.006
Mom+0.006
 care+0.006
 soon+0.006
Neg logit contributions
rily-0.011
pole-0.011
iche-0.010
izer-0.010
cient-0.010
Token
Token ID198
Feature activation+2.40e-01

Pos logit contributions
 mom+0.005
 care+0.005
 slowly+0.005
 soon+0.005
 brave+0.004
Neg logit contributions
rily-0.011
pole-0.011
iche-0.010
izer-0.010
venture-0.010
TokenOne
Token ID3198
Feature activation-6.20e-01

Pos logit contributions
rily+0.012
pole+0.012
izer+0.011
iche+0.011
venture+0.011
Neg logit contributions
 mom-0.006
 care-0.006
 slowly-0.006
 hard-0.005
 soon-0.005
Token water
Token ID1660
Feature activation+5.14e-01

Pos logit contributions
rily+0.031
 cuc+0.029
ifying+0.028
iche+0.028
ulp+0.027
Neg logit contributions
 owner-0.027
 soon-0.026
 mom-0.026
 slowly-0.026
Mom-0.026
Token.
Token ID13
Feature activation-3.26e-01

Pos logit contributions
rily+0.031
pole+0.030
izer+0.029
iche+0.029
 cuc+0.028
Neg logit contributions
 mom-0.009
Mom-0.008
 slowly-0.008
 care-0.008
 soon-0.008
Token
Token ID198
Feature activation-5.29e-01

Pos logit contributions
 mom+0.008
 slowly+0.007
Mom+0.007
 soon+0.007
 care+0.007
Neg logit contributions
rily-0.018
pole-0.017
iche-0.017
izer-0.017
cient-0.016
Token
Token ID198
Feature activation-2.50e-01

Pos logit contributions
 mom+0.013
 care+0.013
 slowly+0.012
 owner+0.012
 brave+0.012
Neg logit contributions
rily-0.027
iche-0.025
pole-0.025
izer-0.024
venture-0.023
Token"
Token ID1
Feature activation-1.02e+00

Pos logit contributions
 mom+0.005
 care+0.005
 words+0.005
 luck+0.005
 hard+0.005
Neg logit contributions
rily-0.013
eh-0.013
pole-0.012
iche-0.012
izer-0.012
TokenDr
Token ID6187
Feature activation-2.54e+00

Pos logit contributions
 mom+0.034
 slowly+0.032
 soon+0.031
 owner+0.030
 brave+0.030
Neg logit contributions
rily-0.045
iche-0.042
izer-0.042
pole-0.042
ifying-0.041
Tokenink
Token ID676
Feature activation+2.33e-02

Pos logit contributions
 mom+0.069
 slowly+0.061
 soon+0.061
 owner+0.061
Mom+0.059
Neg logit contributions
rily-0.128
izer-0.120
iche-0.120
pole-0.117
umbs-0.113
Token some
Token ID617
Feature activation-1.70e-01

Pos logit contributions
rily+0.001
pole+0.001
iche+0.001
izer+0.001
cient+0.001
Neg logit contributions
 mom-0.001
 slowly-0.001
Mom-0.001
 soon-0.001
 care-0.001
Token water
Token ID1660
Feature activation+6.20e-02

Pos logit contributions
 mom+0.005
Mom+0.005
 slowly+0.005
 owner+0.005
 soon+0.005
Neg logit contributions
rily-0.008
iche-0.007
ifying-0.007
 cuc-0.007
pole-0.007
Token,
Token ID11
Feature activation+5.07e-01

Pos logit contributions
rily+0.003
pole+0.003
izer+0.003
iche+0.003
cient+0.003
Neg logit contributions
 mom-0.001
Mom-0.001
 slowly-0.001
 owner-0.001
 soon-0.001
Token Lily
Token ID20037
Feature activation-1.10e+00

Pos logit contributions
rily+0.025
pole+0.024
ifying+0.023
 cuc+0.023
iche+0.023
Neg logit contributions
 slowly-0.013
 care-0.013
 owner-0.012
 mom-0.012
Mom-0.012
Token make
Token ID787
Feature activation+4.35e-02

Pos logit contributions
 mom+0.001
Mom+0.001
 slowly+0.000
 owner+0.000
 soon+0.000
Neg logit contributions
rily-0.001
pole-0.001
izer-0.001
iche-0.001
ifying-0.001
Token us
Token ID514
Feature activation+1.20e-01

Pos logit contributions
rily+0.002
pole+0.002
izer+0.002
iche+0.002
cient+0.002
Neg logit contributions
Mom-0.001
 slowly-0.001
 bud-0.001
 owner-0.001
 mom-0.001
Token dirty
Token ID11841
Feature activation-8.88e-01

Pos logit contributions
rily+0.006
pole+0.006
iche+0.006
izer+0.006
ifying+0.005
Neg logit contributions
 mom-0.003
Mom-0.003
 slowly-0.003
 owner-0.003
 soon-0.003
Token and
Token ID290
Feature activation+2.57e-01

Pos logit contributions
 mom+0.023
Mom+0.023
 care+0.021
 moms+0.021
 owner+0.021
Neg logit contributions
rily-0.047
pole-0.042
izer-0.040
 stret-0.039
iche-0.039
Token wet
Token ID9583
Feature activation-9.81e-01

Pos logit contributions
rily+0.013
pole+0.012
iche+0.012
ifying+0.012
 cuc+0.011
Neg logit contributions
 mom-0.007
Mom-0.007
 slowly-0.006
 owner-0.006
 soon-0.006
Token too
Token ID1165
Feature activation-2.54e+00

Pos logit contributions
 mom+0.024
Mom+0.022
 care+0.021
 slowly+0.021
 owner+0.021
Neg logit contributions
rily-0.053
pole-0.048
izer-0.047
 stret-0.045
iche-0.045
Token.
Token ID13
Feature activation+7.58e-02

Pos logit contributions
 mom+0.067
Mom+0.064
 moms+0.062
 owner+0.061
 care+0.060
Neg logit contributions
rily-0.122
pole-0.114
iche-0.112
ifying-0.110
umbs-0.106
Token Look
Token ID6803
Feature activation+1.15e-01

Pos logit contributions
rily+0.003
pole+0.003
izer+0.003
iche+0.003
ifying+0.003
Neg logit contributions
 mom-0.003
 slowly-0.003
 soon-0.002
Mom-0.002
 care-0.002
Token,
Token ID11
Feature activation+7.10e-02

Pos logit contributions
rily+0.007
pole+0.006
cient+0.006
venture+0.006
iche+0.006
Neg logit contributions
Mom-0.002
 owner-0.002
 mom-0.002
 slowly-0.002
 care-0.002
Token he
Token ID339
Feature activation-1.28e-02

Pos logit contributions
rily+0.004
pole+0.003
ifying+0.003
izer+0.003
 cuc+0.003
Neg logit contributions
 slowly-0.002
Mom-0.002
 mom-0.002
 care-0.002
 owner-0.002
Token has
Token ID468
Feature activation+2.93e-01

Pos logit contributions
 mom+0.000
 slowly+0.000
 soon+0.000
 care+0.000
Mom+0.000
Neg logit contributions
rily-0.001
pole-0.001
izer-0.001
iche-0.001
 cuc-0.001
Token
Token ID198
Feature activation+6.17e-02

Pos logit contributions
rily+0.020
pole+0.018
iche+0.018
izer+0.018
 cuc+0.017
Neg logit contributions
 care-0.009
 mom-0.008
 slowly-0.008
 soon-0.008
 love-0.008
Token
Token ID198
Feature activation+2.91e-01

Pos logit contributions
rily+0.003
iche+0.003
izer+0.002
pole+0.002
ulp+0.002
Neg logit contributions
 care-0.002
 bud-0.002
 slowly-0.002
 brave-0.002
 luck-0.002
Token"
Token ID1
Feature activation-9.56e-01

Pos logit contributions
rily+0.015
iche+0.014
izer+0.014
eh+0.013
pole+0.013
Neg logit contributions
 words-0.006
 care-0.006
 love-0.006
 luck-0.006
 hard-0.005
TokenYou
Token ID1639
Feature activation-2.23e-01

Pos logit contributions
 mom+0.027
 care+0.027
 slowly+0.026
 soon+0.025
 hard+0.025
Neg logit contributions
rily-0.044
ifying-0.042
iche-0.041
pole-0.041
izer-0.040
Token mixed
Token ID7668
Feature activation-8.33e-01

Pos logit contributions
 mom+0.006
 slowly+0.006
Mom+0.006
 owner+0.006
 soon+0.006
Neg logit contributions
rily-0.010
pole-0.010
ifying-0.010
iche-0.010
cient-0.010
Token too
Token ID1165
Feature activation-2.52e+00

Pos logit contributions
 owner+0.023
 mom+0.022
 brave+0.021
Mom+0.021
 moms+0.021
Neg logit contributions
rily-0.041
iche-0.036
 cuc-0.036
pole-0.036
ays-0.035
Token much
Token ID881
Feature activation-1.05e+00

Pos logit contributions
Mac+0.072
 owner+0.070
 mom+0.065
 moms+0.065
Jay+0.065
Neg logit contributions
 ours-0.093
rily-0.093
eh-0.087
iche-0.086
umbs-0.086
Token,
Token ID11
Feature activation-2.73e-01

Pos logit contributions
 mom+0.032
 owner+0.031
 brave+0.030
Mom+0.029
 slowly+0.029
Neg logit contributions
iche-0.044
 cuc-0.044
rily-0.043
ays-0.041
ulp-0.040
Token my
Token ID616
Feature activation-3.90e-01

Pos logit contributions
 care+0.007
 slowly+0.007
Mom+0.007
 mom+0.007
 owner+0.007
Neg logit contributions
rily-0.014
pole-0.013
 cuc-0.012
iche-0.012
izer-0.012
Token sweet
Token ID6029
Feature activation-5.44e-01

Pos logit contributions
 slowly+0.012
Mom+0.012
 soon+0.011
 care+0.011
Tom+0.011
Neg logit contributions
rily-0.016
iche-0.016
 cuc-0.016
cient-0.015
ifying-0.015
Token,"
Token ID553
Feature activation-5.95e-01

Pos logit contributions
 slowly+0.019
Mom+0.018
 soon+0.018
 brave+0.018
 hard+0.017
Neg logit contributions
rily-0.023
iche-0.022
pole-0.021
izer-0.021
 cuc-0.020
Token.
Token ID13
Feature activation-1.39e-01

Pos logit contributions
 mom+0.021
Mom+0.020
 care+0.019
 brave+0.017
 slowly+0.017
Neg logit contributions
rily-0.050
pole-0.047
izer-0.046
ifying-0.045
venture-0.045
Token They
Token ID1119
Feature activation+2.57e-01

Pos logit contributions
 mom+0.005
 slowly+0.005
Mom+0.005
 soon+0.005
 care+0.005
Neg logit contributions
rily-0.006
pole-0.006
iche-0.005
izer-0.005
cient-0.005
Token should
Token ID815
Feature activation+1.02e-01

Pos logit contributions
pole+0.010
rily+0.010
cient+0.010
ifying+0.010
 cuc+0.010
Neg logit contributions
 mom-0.007
Mom-0.007
 things-0.006
 hard-0.006
 words-0.006
Token not
Token ID407
Feature activation-1.93e-01

Pos logit contributions
rily+0.005
pole+0.005
izer+0.005
iche+0.005
melon+0.005
Neg logit contributions
 slowly-0.002
 mom-0.002
Mom-0.002
 owner-0.002
 soon-0.002
Token go
Token ID467
Feature activation-5.44e-01

Pos logit contributions
 mom+0.006
Mom+0.005
 slowly+0.005
 brave+0.005
 hard+0.005
Neg logit contributions
rily-0.009
pole-0.009
iche-0.009
izer-0.008
ifying-0.008
Token too
Token ID1165
Feature activation-2.51e+00

Pos logit contributions
 mom+0.015
Mom+0.015
 slowly+0.014
 care+0.014
 owner+0.013
Neg logit contributions
rily-0.027
pole-0.025
iche-0.024
ifying-0.024
izer-0.024
Token far
Token ID1290
Feature activation-6.05e-01

Pos logit contributions
Mac+0.084
Jenn+0.082
Mom+0.080
 words+0.080
 mom+0.080
Neg logit contributions
rily-0.083
ifying-0.081
arium-0.079
 ours-0.078
eh-0.075
Token from
Token ID422
Feature activation-1.10e-01

Pos logit contributions
 mom+0.015
Mom+0.014
 slowly+0.013
 owner+0.013
 care+0.013
Neg logit contributions
rily-0.033
pole-0.030
izer-0.029
ifying-0.029
iche-0.029
Token their
Token ID511
Feature activation+3.04e-01

Pos logit contributions
Mom+0.003
 mom+0.003
 care+0.003
 owner+0.003
 slowly+0.003
Neg logit contributions
rily-0.005
izer-0.005
iche-0.005
cient-0.005
 films-0.005
Token cabin
Token ID9351
Feature activation-9.55e-01

Pos logit contributions
 ranch+0.009
rily+0.008
 stret+0.008
 films+0.008
 cinema+0.008
Neg logit contributions
No-0.013
Mom-0.013
Tom-0.013
Mac-0.012
 herself-0.012
Token.
Token ID13
Feature activation-1.93e-01

Pos logit contributions
Mom+0.020
 mom+0.020
 care+0.018
 love+0.016
 moms+0.016
Neg logit contributions
rily-0.054
pole-0.051
izer-0.048
ifying-0.048
cient-0.047
Token nightmare
Token ID17123
Feature activation+3.47e-03

Pos logit contributions
 mom+0.009
Mom+0.009
 slowly+0.009
 soon+0.008
 care+0.008
Neg logit contributions
rily-0.018
pole-0.018
iche-0.018
izer-0.018
ifying-0.018
Token if
Token ID611
Feature activation-6.99e-01

Pos logit contributions
rily+0.000
pole+0.000
iche+0.000
izer+0.000
venture+0.000
Neg logit contributions
 mom-0.000
Mom-0.000
 slowly-0.000
 care-0.000
 owner-0.000
Token we
Token ID356
Feature activation+2.26e-01

Pos logit contributions
Mom+0.023
 owner+0.021
 mom+0.021
 slowly+0.021
 moms+0.019
Neg logit contributions
rily-0.028
venture-0.027
pole-0.027
iche-0.026
cient-0.026
Token stay
Token ID2652
Feature activation-3.54e-01

Pos logit contributions
rily+0.010
cient+0.010
pole+0.010
ifying+0.009
iche+0.009
Neg logit contributions
Mom-0.007
 mom-0.007
Tom-0.006
 owner-0.006
 hard-0.006
Token out
Token ID503
Feature activation+4.54e-01

Pos logit contributions
 mom+0.010
Mom+0.009
 slowly+0.009
 care+0.009
 owner+0.009
Neg logit contributions
rily-0.018
pole-0.017
izer-0.016
iche-0.016
ifying-0.016
Token too
Token ID1165
Feature activation-2.50e+00

Pos logit contributions
rily+0.025
pole+0.024
iche+0.023
izer+0.023
ifying+0.023
Neg logit contributions
 mom-0.011
 slowly-0.009
Mom-0.009
 care-0.009
 owner-0.009
Token long
Token ID890
Feature activation-1.61e-01

Pos logit contributions
Mom+0.055
 moms+0.053
 care+0.053
 mom+0.051
Mac+0.050
Neg logit contributions
rily-0.125
ifying-0.117
pole-0.115
 battlefield-0.108
cient-0.107
Token.
Token ID13
Feature activation+2.35e-01

Pos logit contributions
 mom+0.003
Mom+0.003
 slowly+0.003
 care+0.003
 owner+0.003
Neg logit contributions
rily-0.009
pole-0.009
iche-0.009
izer-0.009
ifying-0.008
Tokenâ
Token ID22940
Feature activation+9.44e-01

Pos logit contributions
rily+0.011
pole+0.010
izer+0.009
iche+0.009
cient+0.009
Neg logit contributions
 mom-0.009
Mom-0.008
 Mom-0.008
 soon-0.008
 slowly-0.008
Token€
Token ID26391
Feature activation+1.49e-01

Pos logit contributions
rily+0.036
pole+0.031
 cuc+0.030
ifying+0.030
iche+0.029
Neg logit contributions
 mom-0.039
 care-0.039
 slowly-0.038
Mom-0.038
 soon-0.037
Token
Token ID198
Feature activation+3.31e-02

Pos logit contributions
rily+0.006
pole+0.005
iche+0.005
izer+0.005
cient+0.005
Neg logit contributions
 mom-0.005
 slowly-0.005
Mom-0.005
 owner-0.005
 care-0.005
Token you
Token ID345
Feature activation+6.59e-02

Pos logit contributions
rily+0.012
pole+0.012
iche+0.011
ifying+0.011
venture+0.011
Neg logit contributions
 mom-0.007
 slowly-0.007
Mom-0.007
 owner-0.007
 care-0.007
Token okay
Token ID8788
Feature activation-9.81e-01

Pos logit contributions
rily+0.003
pole+0.003
iche+0.003
izer+0.003
ifying+0.003
Neg logit contributions
 mom-0.002
 slowly-0.002
Mom-0.002
 care-0.002
 owner-0.002
Token?
Token ID30
Feature activation+5.35e-01

Pos logit contributions
 slowly+0.034
 mom+0.034
 care+0.034
Mom+0.034
 owner+0.033
Neg logit contributions
rily-0.040
pole-0.036
iche-0.036
izer-0.035
cient-0.034
Token You
Token ID921
Feature activation+1.15e-01

Pos logit contributions
rily+0.022
pole+0.020
iche+0.019
izer+0.019
ifying+0.019
Neg logit contributions
 mom-0.021
 slowly-0.020
Mom-0.019
 soon-0.019
 care-0.019
Token ate
Token ID15063
Feature activation-6.56e-01

Pos logit contributions
rily+0.005
pole+0.005
iche+0.005
cient+0.005
ifying+0.005
Neg logit contributions
 mom-0.003
Mom-0.003
 slowly-0.003
 owner-0.003
 soon-0.003
Token too
Token ID1165
Feature activation-2.50e+00

Pos logit contributions
 mom+0.017
 owner+0.016
Mom+0.016
 slowly+0.016
 care+0.015
Neg logit contributions
rily-0.034
pole-0.030
iche-0.030
 cuc-0.030
ifying-0.029
Token fast
Token ID3049
Feature activation-4.93e-01

Pos logit contributions
 mom+0.083
 bud+0.078
 owner+0.078
 moms+0.076
Mom+0.075
Neg logit contributions
eh-0.088
umbs-0.087
rily-0.083
 ours-0.083
iche-0.081
Token.
Token ID13
Feature activation+4.17e-01

Pos logit contributions
 mom+0.013
Mom+0.013
 owner+0.012
 slowly+0.012
 moms+0.011
Neg logit contributions
rily-0.024
pole-0.022
 cuc-0.022
izer-0.022
iche-0.022
Token You
Token ID921
Feature activation+4.72e-02

Pos logit contributions
rily+0.017
pole+0.015
iche+0.015
izer+0.015
cient+0.014
Neg logit contributions
 mom-0.017
Mom-0.015
 slowly-0.015
 care-0.015
 soon-0.015
Token need
Token ID761
Feature activation-4.67e-01

Pos logit contributions
rily+0.002
pole+0.002
iche+0.002
cient+0.002
ifying+0.002
Neg logit contributions
 mom-0.001
Mom-0.001
 slowly-0.001
 owner-0.001
 soon-0.001
Token to
Token ID284
Feature activation+6.15e-01

Pos logit contributions
Mom+0.010
 mom+0.010
 slowly+0.010
 brave+0.010
 soon+0.010
Neg logit contributions
 cuc-0.022
iche-0.021
rily-0.021
ulp-0.021
ifying-0.020
Token He
Token ID679
Feature activation-3.39e-01

Pos logit contributions
rily+0.006
pole+0.006
izer+0.006
cient+0.006
ifying+0.006
Neg logit contributions
 slowly-0.008
 mom-0.008
 soon-0.008
 hard-0.008
Mom-0.008
Token likes
Token ID7832
Feature activation-2.19e-01

Pos logit contributions
Mom+0.009
 mom+0.008
 slowly+0.007
 soon+0.007
 Mom+0.007
Neg logit contributions
rily-0.017
pole-0.017
cient-0.016
venture-0.016
iche-0.016
Token hugs
Token ID40657
Feature activation-6.46e-01

Pos logit contributions
Mom+0.006
 slowly+0.006
 mom+0.005
 brave+0.005
 soon+0.005
Neg logit contributions
rily-0.010
 cuc-0.010
pole-0.010
 films-0.009
cient-0.009
Token and
Token ID290
Feature activation-3.68e-01

Pos logit contributions
 mom+0.017
 slowly+0.016
 brave+0.016
 hard+0.016
 care+0.016
Neg logit contributions
rily-0.029
pole-0.029
izer-0.026
cient-0.026
 films-0.026
Token kisses
Token ID36544
Feature activation-1.27e+00

Pos logit contributions
Mom+0.010
 mom+0.009
 slowly+0.009
 soon+0.009
 owner+0.008
Neg logit contributions
rily-0.017
 cuc-0.016
pole-0.016
izer-0.016
cient-0.016
Token too
Token ID1165
Feature activation-2.48e+00

Pos logit contributions
 brave+0.033
 bud+0.032
 mom+0.031
 nos+0.031
 moms+0.030
Neg logit contributions
rily-0.056
pole-0.054
ifying-0.051
umbs-0.048
cient-0.048
Token.
Token ID13
Feature activation-2.89e-01

Pos logit contributions
 mom+0.065
Mom+0.064
 moms+0.063
 care+0.062
 owner+0.059
Neg logit contributions
rily-0.112
pole-0.105
iche-0.102
ifying-0.102
venture-0.099
Token But
Token ID887
Feature activation-9.64e-01

Pos logit contributions
 mom+0.009
 slowly+0.009
 soon+0.009
Mom+0.009
 care+0.009
Neg logit contributions
rily-0.013
pole-0.012
izer-0.012
iche-0.012
ifying-0.011
Token be
Token ID307
Feature activation-1.17e+00

Pos logit contributions
Mom+0.026
 care+0.025
 slowly+0.024
 owner+0.024
 mom+0.023
Neg logit contributions
pole-0.043
rily-0.040
 cuc-0.038
cient-0.037
venture-0.036
Token gentle
Token ID10296
Feature activation-1.34e+00

Pos logit contributions
 mom+0.022
 care+0.019
 slowly+0.018
 moms+0.018
Mom+0.018
Neg logit contributions
rily-0.067
pole-0.066
iche-0.064
eh-0.063
izer-0.063
Token and
Token ID290
Feature activation-9.11e-02

Pos logit contributions
 mom+0.034
 care+0.032
Mom+0.032
 moms+0.030
 bud+0.030
Neg logit contributions
rily-0.066
pole-0.063
izer-0.059
iche-0.058
 films-0.057
Token need
Token ID761
Feature activation-2.82e-01

Pos logit contributions
 mom+0.004
Mom+0.004
 slowly+0.004
 owner+0.004
 soon+0.003
Neg logit contributions
rily-0.008
pole-0.007
iche-0.007
cient-0.007
ifying-0.007
Token because
Token ID780
Feature activation-8.60e-01

Pos logit contributions
 mom+0.005
 slowly+0.004
Mom+0.004
 care+0.004
 soon+0.004
Neg logit contributions
rily-0.018
pole-0.017
iche-0.016
izer-0.016
cient-0.016
Token if
Token ID611
Feature activation-1.16e+00

Pos logit contributions
 mom+0.023
Mom+0.022
 slowly+0.021
 care+0.021
 owner+0.020
Neg logit contributions
rily-0.043
pole-0.040
iche-0.039
ifying-0.038
 cuc-0.038
Token we
Token ID356
Feature activation-6.97e-02

Pos logit contributions
Mom+0.039
 mom+0.034
 slowly+0.034
 owner+0.034
 care+0.034
Neg logit contributions
rily-0.048
pole-0.044
cient-0.044
 cuc-0.043
venture-0.042
Token take
Token ID1011
Feature activation-5.37e-01

Pos logit contributions
Mom+0.002
 mom+0.002
 owner+0.002
 slowly+0.002
Tom+0.002
Neg logit contributions
rily-0.003
cient-0.003
pole-0.003
ifying-0.003
iche-0.003
Token too
Token ID1165
Feature activation-2.47e+00

Pos logit contributions
Mom+0.012
 mom+0.011
 owner+0.010
 bud+0.010
Tom+0.009
Neg logit contributions
rily-0.029
iche-0.027
izer-0.027
venture-0.026
pole-0.026
Token much
Token ID881
Feature activation-8.90e-01

Pos logit contributions
 mom+0.064
Mac+0.063
Mom+0.063
Jenn+0.062
 moms+0.061
Neg logit contributions
ifying-0.106
rily-0.104
 ours-0.102
iche-0.094
pole-0.093
Token,
Token ID11
Feature activation-3.03e-01

Pos logit contributions
 mom+0.017
 slowly+0.015
Mom+0.014
 soon+0.014
 owner+0.014
Neg logit contributions
rily-0.054
pole-0.051
iche-0.050
izer-0.049
 cuc-0.048
Token somebody
Token ID8276
Feature activation+9.94e-01

Pos logit contributions
Mom+0.007
 mom+0.007
 slowly+0.007
 care+0.007
 owner+0.006
Neg logit contributions
rily-0.015
pole-0.014
ifying-0.014
iche-0.014
 cuc-0.014
Token else
Token ID2073
Feature activation+4.08e-02

Pos logit contributions
rily+0.055
pole+0.053
iche+0.051
cient+0.050
venture+0.050
Neg logit contributions
 mom-0.022
Mom-0.021
 slowly-0.020
 owner-0.019
 soon-0.018
Token can
Token ID460
Feature activation-2.09e-01

Pos logit contributions
rily+0.002
pole+0.002
cient+0.002
venture+0.002
iche+0.002
Neg logit contributions
 mom-0.001
Mom-0.001
 slowly-0.001
 owner-0.001
 moms-0.001
Token please
Token ID3387
Feature activation-1.48e+00

Pos logit contributions
rily+0.013
pole+0.013
 cuc+0.012
izer+0.012
iche+0.012
Neg logit contributions
 slowly-0.008
 care-0.008
 mom-0.008
Mom-0.008
 soon-0.008
Token.
Token ID13
Feature activation+7.54e-02

Pos logit contributions
Mom+0.051
No+0.048
 slowly+0.047
 mom+0.046
 moms+0.046
Neg logit contributions
pole-0.054
rily-0.049
cient-0.049
 cuc-0.048
izer-0.047
Token I
Token ID314
Feature activation-6.44e-01

Pos logit contributions
rily+0.003
pole+0.003
ifying+0.003
izer+0.002
cient+0.002
Neg logit contributions
 mom-0.003
 slowly-0.003
 care-0.003
 soon-0.003
Mom-0.003
Token like
Token ID588
Feature activation-5.09e-01

Pos logit contributions
Mom+0.022
 mom+0.021
 moms+0.020
 Mom+0.019
 slowly+0.019
Neg logit contributions
iche-0.024
venture-0.024
pole-0.023
 gra-0.022
izer-0.022
Token books
Token ID3835
Feature activation-1.27e+00

Pos logit contributions
 mom+0.015
 slowly+0.013
Mom+0.013
 care+0.013
 soon+0.013
Neg logit contributions
rily-0.026
pole-0.024
iche-0.024
izer-0.024
ifying-0.023
Token too
Token ID1165
Feature activation-2.46e+00

Pos logit contributions
 mom+0.030
 care+0.027
Mom+0.026
 moms+0.025
 owner+0.025
Neg logit contributions
rily-0.068
pole-0.066
iche-0.061
venture-0.060
ifying-0.060
Token.
Token ID13
Feature activation+2.78e-01

Pos logit contributions
 mom+0.055
 care+0.052
Mom+0.051
 slowly+0.049
 moms+0.049
Neg logit contributions
rily-0.127
pole-0.122
iche-0.117
venture-0.114
umbs-0.114
Token Can
Token ID1680
Feature activation-2.36e-01

Pos logit contributions
rily+0.011
pole+0.010
iche+0.010
izer+0.010
ifying+0.010
Neg logit contributions
 mom-0.011
 slowly-0.010
Mom-0.010
 care-0.010
 soon-0.010
Token I
Token ID314
Feature activation-1.07e+00

Pos logit contributions
 slowly+0.009
 care+0.009
 brave+0.009
 hard+0.009
 owner+0.009
Neg logit contributions
pole-0.008
rily-0.007
cient-0.007
ifying-0.007
venture-0.007
Token see
Token ID766
Feature activation+8.95e-01

Pos logit contributions
 slowly+0.027
 mom+0.027
 hard+0.027
Mom+0.027
 things+0.026
Neg logit contributions
pole-0.047
iche-0.045
ifying-0.044
cient-0.044
rily-0.044
Token that
Token ID326
Feature activation-7.34e-01

Pos logit contributions
rily+0.050
pole+0.048
iche+0.046
cient+0.045
izer+0.045
Neg logit contributions
 mom-0.019
 slowly-0.018
Mom-0.018
 care-0.017
 owner-0.017
Token next
Token ID1306
Feature activation-1.31e+00

Pos logit contributions
rily+0.011
pole+0.011
venture+0.010
iche+0.010
ifying+0.010
Neg logit contributions
Mom-0.004
 mom-0.004
Tom-0.004
 owner-0.004
No-0.004
Token door
Token ID3420
Feature activation-1.29e-01

Pos logit contributions
 mom+0.047
Mom+0.044
 slowly+0.043
 owner+0.042
 soon+0.042
Neg logit contributions
rily-0.056
pole-0.052
iche-0.051
izer-0.050
venture-0.049
Token will
Token ID481
Feature activation-1.14e+00

Pos logit contributions
Mom+0.003
 mom+0.003
 care+0.002
 owner+0.002
Tom+0.002
Neg logit contributions
rily-0.007
pole-0.007
venture-0.006
ifying-0.006
cient-0.006
Token bloom
Token ID29955
Feature activation-6.15e-01

Pos logit contributions
 mom+0.030
Mom+0.030
 moms+0.028
 owner+0.028
 slowly+0.027
Neg logit contributions
rily-0.055
pole-0.052
iche-0.049
izer-0.048
 cuc-0.048
Token again
Token ID757
Feature activation-1.09e+00

Pos logit contributions
 mom+0.013
Mom+0.011
 care+0.011
 owner+0.011
 slowly+0.010
Neg logit contributions
rily-0.036
pole-0.034
iche-0.032
ifying-0.032
izer-0.032
Token too
Token ID1165
Feature activation-2.44e+00

Pos logit contributions
 mom+0.021
Mom+0.020
 moms+0.019
 care+0.019
 owner+0.018
Neg logit contributions
rily-0.060
pole-0.056
iche-0.055
izer-0.054
ifying-0.054
Token.
Token ID13
Feature activation-5.10e-02

Pos logit contributions
 mom+0.051
Mom+0.050
 moms+0.049
 owner+0.048
 care+0.047
Neg logit contributions
rily-0.134
pole-0.120
iche-0.119
ifying-0.118
ulp-0.115
Token<|endoftext|>
Token ID50256
Feature activation-4.94e-01

Pos logit contributions
 mom+0.001
Mom+0.001
 slowly+0.001
 soon+0.001
 owner+0.001
Neg logit contributions
rily-0.003
pole-0.003
iche-0.003
izer-0.003
cient-0.003
TokenOnce
Token ID7454
Feature activation+2.98e-01

Pos logit contributions
 mom+0.016
 slowly+0.015
Mom+0.015
 soon+0.014
 care+0.014
Neg logit contributions
rily-0.024
pole-0.022
iche-0.021
izer-0.021
cient-0.020
Token upon
Token ID2402
Feature activation+7.01e-01

Pos logit contributions
rily+0.015
pole+0.014
izer+0.013
iche+0.013
ifying+0.013
Neg logit contributions
 mom-0.010
 slowly-0.009
Mom-0.008
 soon-0.008
 hard-0.008
Token a
Token ID257
Feature activation+7.87e-01

Pos logit contributions
rily+0.048
pole+0.046
iche+0.044
izer+0.044
cient+0.044
Neg logit contributions
 mom-0.009
 slowly-0.007
Mom-0.007
 soon-0.006
 care-0.006
Token cake
Token ID12187
Feature activation+5.14e-01

Pos logit contributions
rily+0.038
pole+0.036
iche+0.034
izer+0.034
 cuc+0.033
Neg logit contributions
 mom-0.027
Mom-0.026
 slowly-0.025
 care-0.025
 owner-0.025
Token.
Token ID13
Feature activation-5.92e-01

Pos logit contributions
rily+0.030
izer+0.028
pole+0.028
iche+0.027
cient+0.027
Neg logit contributions
 mom-0.011
 slowly-0.010
 soon-0.010
 now-0.010
 owner-0.009
Token The
Token ID383
Feature activation-5.90e-01

Pos logit contributions
 mom+0.017
Mom+0.015
 slowly+0.015
 soon+0.014
 Mom+0.014
Neg logit contributions
rily-0.031
pole-0.029
izer-0.029
iche-0.028
cient-0.028
Token oven
Token ID14361
Feature activation+2.76e-01

Pos logit contributions
 mom+0.017
 slowly+0.016
 care+0.016
Mom+0.016
 soon+0.016
Neg logit contributions
rily-0.028
pole-0.027
iche-0.026
 cuc-0.026
izer-0.026
Token was
Token ID373
Feature activation-8.27e-01

Pos logit contributions
rily+0.015
pole+0.015
iche+0.014
venture+0.014
melon+0.014
Neg logit contributions
 mom-0.006
Mom-0.006
 care-0.006
 slowly-0.005
 owner-0.005
Token too
Token ID1165
Feature activation-2.43e+00

Pos logit contributions
 mom+0.020
 care+0.018
Mom+0.018
 slowly+0.018
 soon+0.017
Neg logit contributions
rily-0.045
pole-0.042
iche-0.041
izer-0.040
ifying-0.040
Token hot
Token ID3024
Feature activation-1.20e-01

Pos logit contributions
 care+0.054
 mom+0.054
Mom+0.051
 owner+0.050
 moms+0.049
Neg logit contributions
rily-0.127
iche-0.121
pole-0.115
eh-0.114
ifying-0.113
Token and
Token ID290
Feature activation-6.25e-01

Pos logit contributions
 mom+0.002
 slowly+0.002
 soon+0.002
 now+0.002
 care+0.002
Neg logit contributions
rily-0.008
pole-0.007
iche-0.007
izer-0.007
cient-0.007
Token the
Token ID262
Feature activation+6.93e-02

Pos logit contributions
Mom+0.016
 care+0.016
 owner+0.015
 mom+0.015
Tom+0.015
Neg logit contributions
rily-0.030
pole-0.027
ifying-0.027
 cuc-0.027
iche-0.027
Token cake
Token ID12187
Feature activation+4.86e-01

Pos logit contributions
rily+0.003
pole+0.003
iche+0.003
izer+0.003
 cuc+0.003
Neg logit contributions
 mom-0.002
 slowly-0.002
Mom-0.002
 care-0.002
 soon-0.002
Token burned
Token ID11544
Feature activation-6.65e-01

Pos logit contributions
rily+0.028
pole+0.027
iche+0.026
ifying+0.025
izer+0.025
Neg logit contributions
 mom-0.009
Mom-0.009
 care-0.008
 slowly-0.008
 owner-0.008
TokenSure
Token ID19457
Feature activation-1.05e-01

Pos logit contributions
 mom+0.018
 slowly+0.016
 soon+0.016
 care+0.015
 now+0.015
Neg logit contributions
rily-0.039
pole-0.038
 cuc-0.037
ifying-0.037
iche-0.036
Token,
Token ID11
Feature activation+5.83e-01

Pos logit contributions
Mom+0.003
 slowly+0.003
Sp+0.002
 owner+0.002
 soon+0.002
Neg logit contributions
pole-0.004
rily-0.004
 cuc-0.004
izer-0.004
ulp-0.004
Token I
Token ID314
Feature activation-3.77e-01

Pos logit contributions
rily+0.028
pole+0.026
 cuc+0.025
iche+0.025
izer+0.025
Neg logit contributions
 mom-0.017
 slowly-0.016
Mom-0.016
 care-0.016
 owner-0.016
Token like
Token ID588
Feature activation-3.82e-01

Pos logit contributions
 mom+0.012
Mom+0.011
 slowly+0.011
 moms+0.011
 owner+0.010
Neg logit contributions
iche-0.016
pole-0.015
venture-0.015
rily-0.015
izer-0.015
Token buttons
Token ID12163
Feature activation-2.83e-01

Pos logit contributions
 mom+0.011
Mom+0.011
 slowly+0.011
 owner+0.011
 soon+0.011
Neg logit contributions
rily-0.017
pole-0.016
 cuc-0.016
izer-0.015
iche-0.015
Token too
Token ID1165
Feature activation-2.43e+00

Pos logit contributions
Mom+0.008
 mom+0.008
 owner+0.007
 moms+0.007
 slowly+0.007
Neg logit contributions
rily-0.014
pole-0.013
ays-0.012
ifying-0.012
umbs-0.012
Token.
Token ID13
Feature activation+2.52e-01

Pos logit contributions
 mom+0.069
Mom+0.067
 care+0.066
 moms+0.066
 owner+0.063
Neg logit contributions
rily-0.109
pole-0.102
 ours-0.095
iche-0.095
ifying-0.094
Token Come
Token ID7911
Feature activation-1.93e-01

Pos logit contributions
rily+0.010
pole+0.009
ifying+0.009
izer+0.009
iche+0.009
Neg logit contributions
 mom-0.010
 slowly-0.009
 soon-0.009
 care-0.009
 owner-0.009
Token in
Token ID287
Feature activation+1.88e-01

Pos logit contributions
 mom+0.005
 slowly+0.005
Mom+0.005
 care+0.004
 owner+0.004
Neg logit contributions
rily-0.010
pole-0.010
izer-0.009
iche-0.009
ifying-0.009
Token,
Token ID11
Feature activation-2.83e-02

Pos logit contributions
rily+0.011
pole+0.010
izer+0.010
iche+0.010
ifying+0.010
Neg logit contributions
 mom-0.004
Mom-0.003
 slowly-0.003
 owner-0.003
 care-0.003
Token come
Token ID1282
Feature activation-3.79e-02

Pos logit contributions
 mom+0.001
 slowly+0.001
Mom+0.001
 owner+0.001
 care+0.001
Neg logit contributions
rily-0.001
pole-0.001
iche-0.001
izer-0.001
 cuc-0.001
Token will
Token ID481
Feature activation-4.10e-01

Pos logit contributions
Mom+0.007
 mom+0.007
 slowly+0.007
No+0.007
 next+0.007
Neg logit contributions
cient-0.009
pole-0.009
iche-0.009
 cuc-0.009
venture-0.008
Token be
Token ID307
Feature activation-3.45e-01

Pos logit contributions
Mom+0.010
 mom+0.009
No+0.009
 slowly+0.009
 owner+0.009
Neg logit contributions
pole-0.019
rily-0.018
izer-0.018
iche-0.018
cient-0.018
Token quiet
Token ID5897
Feature activation-9.60e-01

Pos logit contributions
 mom+0.007
 slowly+0.006
Mom+0.006
 care+0.006
 soon+0.005
Neg logit contributions
rily-0.021
pole-0.020
iche-0.019
izer-0.019
 cuc-0.018
Token and
Token ID290
Feature activation+3.87e-01

Pos logit contributions
Mom+0.024
 mom+0.024
 moms+0.023
 care+0.022
No+0.021
Neg logit contributions
rily-0.050
pole-0.045
ifying-0.042
iche-0.041
venture-0.041
Token careful
Token ID8161
Feature activation-1.07e+00

Pos logit contributions
pole+0.019
rily+0.019
cient+0.019
 cuc+0.019
ifying+0.019
Neg logit contributions
Mom-0.009
 mom-0.009
 owner-0.007
Tom-0.007
 slowly-0.007
Token too
Token ID1165
Feature activation-2.42e+00

Pos logit contributions
 mom+0.024
 care+0.024
 moms+0.023
Mom+0.023
Tom+0.022
Neg logit contributions
rily-0.055
pole-0.052
iche-0.047
venture-0.046
 films-0.046
Token,"
Token ID553
Feature activation-7.37e-01

Pos logit contributions
Mom+0.061
 care+0.057
No+0.056
 mom+0.055
 moms+0.053
Neg logit contributions
rily-0.119
pole-0.108
ifying-0.104
iche-0.103
cient-0.102
Token Tom
Token ID4186
Feature activation+5.88e-01

Pos logit contributions
 care+0.017
Mom+0.017
 slowly+0.017
 soon+0.016
 mom+0.016
Neg logit contributions
rily-0.036
pole-0.034
 cuc-0.034
cient-0.034
iche-0.033
Token said
Token ID531
Feature activation+1.54e-01

Pos logit contributions
rily+0.031
pole+0.031
izer+0.029
iche+0.028
 cuc+0.028
Neg logit contributions
 mom-0.014
Mom-0.013
 care-0.012
 slowly-0.012
 hard-0.012
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
rily+0.010
pole+0.009
iche+0.009
izer+0.009
cient+0.009
Neg logit contributions
 mom-0.002
Mom-0.002
 slowly-0.002
 care-0.002
 soon-0.002
Token
Token ID198
Feature activation-3.95e-01

Pos logit contributions
rily+0.013
pole+0.012
izer+0.012
iche+0.012
 cuc+0.012
Neg logit contributions
 mom-0.006
 care-0.006
 slowly-0.006
 soon-0.006
 love-0.006
Token told
Token ID1297
Feature activation-3.34e-01

Pos logit contributions
pole+0.002
rily+0.002
cient+0.002
venture+0.002
iche+0.002
Neg logit contributions
 mom-0.001
Mom-0.001
 owner-0.001
 slowly-0.001
 moms-0.001
Token them
Token ID606
Feature activation-2.07e-01

Pos logit contributions
Mom+0.011
 owner+0.011
 slowly+0.010
 mom+0.010
 brave+0.010
Neg logit contributions
rily-0.014
 films-0.012
iche-0.012
pole-0.012
izer-0.012
Token not
Token ID407
Feature activation-1.18e+00

Pos logit contributions
Mom+0.007
Tom+0.006
OK+0.005
No+0.005
Mac+0.005
Neg logit contributions
rily-0.009
cient-0.008
 films-0.008
venture-0.008
pole-0.008
Token to
Token ID284
Feature activation+6.33e-02

Pos logit contributions
 mom+0.024
Mom+0.023
 slowly+0.021
 soon+0.019
 owner+0.019
Neg logit contributions
rily-0.065
pole-0.063
iche-0.063
izer-0.061
ifying-0.060
Token go
Token ID467
Feature activation-4.73e-01

Pos logit contributions
iche+0.003
pole+0.003
izer+0.003
rily+0.003
cient+0.003
Neg logit contributions
 mom-0.002
Mom-0.002
 slowly-0.002
 brave-0.001
 owner-0.001
Token too
Token ID1165
Feature activation-2.42e+00

Pos logit contributions
 mom+0.014
Mom+0.013
 care+0.012
 slowly+0.012
 owner+0.012
Neg logit contributions
rily-0.023
pole-0.021
iche-0.021
izer-0.021
ifying-0.020
Token close
Token ID1969
Feature activation-2.66e-02

Pos logit contributions
Mac+0.073
Jenn+0.072
 mom+0.068
Mom+0.068
 words+0.066
Neg logit contributions
rily-0.086
 ours-0.083
ifying-0.083
arium-0.080
iche-0.079
Token to
Token ID284
Feature activation-1.48e-01

Pos logit contributions
Mom+0.001
 mom+0.001
 care+0.000
 brave+0.000
 owner+0.000
Neg logit contributions
rily-0.001
pole-0.001
ifying-0.001
izer-0.001
iche-0.001
Token the
Token ID262
Feature activation+1.95e-01

Pos logit contributions
Mom+0.004
 mom+0.003
 owner+0.003
 slowly+0.003
 care+0.003
Neg logit contributions
rily-0.008
izer-0.007
iche-0.007
pole-0.007
cient-0.007
Token edge
Token ID5743
Feature activation+1.57e-01

Pos logit contributions
rily+0.007
 cuc+0.007
cient+0.007
iche+0.007
ifying+0.007
Neg logit contributions
Mom-0.007
 soon-0.007
 care-0.007
 mom-0.007
 slowly-0.007
Token or
Token ID393
Feature activation-1.01e-02

Pos logit contributions
rily+0.009
pole+0.008
iche+0.008
izer+0.008
ifying+0.008
Neg logit contributions
 mom-0.004
Mom-0.003
 slowly-0.003
 care-0.003
 owner-0.003
Token But
Token ID887
Feature activation-6.47e-01

Pos logit contributions
rily+0.008
pole+0.008
izer+0.007
iche+0.007
venture+0.007
Neg logit contributions
 mom-0.008
Mom-0.007
 slowly-0.007
 soon-0.007
 owner-0.007
Token we
Token ID356
Feature activation-3.80e-01

Pos logit contributions
Mom+0.020
 care+0.020
 slowly+0.019
 owner+0.019
Tom+0.019
Neg logit contributions
rily-0.027
pole-0.026
 cuc-0.025
cient-0.025
melon-0.024
Token should
Token ID815
Feature activation+3.07e-01

Pos logit contributions
Mom+0.011
 mom+0.010
 owner+0.010
 slowly+0.009
No+0.009
Neg logit contributions
pole-0.017
cient-0.017
iche-0.017
rily-0.017
ifying-0.016
Token not
Token ID407
Feature activation+6.27e-02

Pos logit contributions
pole+0.015
rily+0.015
iche+0.015
melon+0.014
izer+0.014
Neg logit contributions
Mom-0.007
 slowly-0.006
 mom-0.006
 hard-0.006
 owner-0.006
Token stay
Token ID2652
Feature activation-1.98e-01

Pos logit contributions
pole+0.003
iche+0.003
cient+0.003
izer+0.003
melon+0.003
Neg logit contributions
 mom-0.002
Mom-0.002
 hard-0.002
 slowly-0.002
 brave-0.001
Token too
Token ID1165
Feature activation-2.42e+00

Pos logit contributions
 mom+0.006
Mom+0.006
 care+0.005
Tom+0.005
 slowly+0.005
Neg logit contributions
rily-0.010
izer-0.009
pole-0.009
iche-0.009
ifying-0.008
Token long
Token ID890
Feature activation-1.96e-01

Pos logit contributions
Mac+0.049
 care+0.048
No+0.046
Mom+0.046
 mom+0.045
Neg logit contributions
rily-0.124
pole-0.115
iche-0.111
 ours-0.110
ifying-0.110
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
 mom+0.004
Mom+0.004
 slowly+0.004
 care+0.003
 owner+0.003
Neg logit contributions
rily-0.011
pole-0.011
iche-0.010
izer-0.010
cient-0.010
Token They
Token ID1119
Feature activation+8.88e-01

Pos logit contributions
rily+0.008
pole+0.008
izer+0.007
iche+0.007
 cuc+0.007
Neg logit contributions
 mom-0.008
Mom-0.007
 slowly-0.007
 soon-0.007
 owner-0.007
Token have
Token ID423
Feature activation+2.89e-01

Pos logit contributions
rily+0.040
pole+0.040
cient+0.039
iche+0.038
venture+0.037
Neg logit contributions
Mom-0.027
 mom-0.027
 slowly-0.024
 owner-0.024
Tom-0.023
Token a
Token ID257
Feature activation+2.16e-01

Pos logit contributions
rily+0.015
pole+0.014
ifying+0.014
iche+0.014
cient+0.014
Neg logit contributions
Mom-0.007
 mom-0.006
 slowly-0.006
 soon-0.005
Tom-0.005
Token.
Token ID13
Feature activation-6.24e-02

Pos logit contributions
Mom+0.000
 moms+0.000
Tom+0.000
 owner+0.000
 mom+0.000
Neg logit contributions
rily-0.000
pole-0.000
 cuc-0.000
cient-0.000
 gra-0.000
Token The
Token ID383
Feature activation-2.56e-01

Pos logit contributions
Mom+0.002
 mom+0.002
 owner+0.002
Tom+0.002
 soon+0.002
Neg logit contributions
rily-0.003
pole-0.002
cient-0.002
 cuc-0.002
venture-0.002
Token oct
Token ID19318
Feature activation+1.73e+00

Pos logit contributions
 slowly+0.013
 soon+0.013
 care+0.013
 mom+0.013
Mom+0.013
Neg logit contributions
 cuc-0.005
rily-0.005
izer-0.004
iche-0.004
melon-0.004
Tokenopus
Token ID25790
Feature activation+5.41e-01

Pos logit contributions
rily+0.048
pole+0.043
iche+0.042
izer+0.041
cient+0.039
Neg logit contributions
 mom-0.090
 slowly-0.085
 care-0.084
 soon-0.084
Mom-0.084
Token gave
Token ID2921
Feature activation+2.55e-01

Pos logit contributions
rily+0.029
pole+0.028
venture+0.026
cient+0.026
 cuc+0.026
Neg logit contributions
Mom-0.012
 mom-0.012
 care-0.011
 love-0.010
Tom-0.010
Token El
Token ID2574
Feature activation+2.02e+00

Pos logit contributions
rily+0.012
pole+0.010
 cuc+0.009
cient+0.009
venture+0.009
Neg logit contributions
Mom-0.009
 owner-0.009
Tom-0.009
 mom-0.009
 moms-0.008
Tokensie
Token ID44524
Feature activation-1.05e-01

Pos logit contributions
rily+0.053
pole+0.052
iche+0.050
izer+0.049
ifying+0.048
Neg logit contributions
 mom-0.100
 slowly-0.100
 soon-0.098
 brave-0.097
 care-0.097
Token a
Token ID257
Feature activation+1.48e-01

Pos logit contributions
 mom+0.002
Mom+0.002
 slowly+0.002
 soon+0.002
Tom+0.002
Neg logit contributions
rily-0.006
pole-0.006
izer-0.005
iche-0.005
 cuc-0.005
Token band
Token ID4097
Feature activation+1.63e-02

Pos logit contributions
rily+0.010
pole+0.009
iche+0.009
izer+0.009
ays+0.009
Neg logit contributions
 mom-0.002
 hard-0.002
 slowly-0.002
 brave-0.002
 care-0.002
Tokenage
Token ID496
Feature activation-3.14e-01

Pos logit contributions
rily+0.001
pole+0.001
izer+0.001
iche+0.001
venture+0.001
Neg logit contributions
 mom-0.000
Mom-0.000
 slowly-0.000
 soon-0.000
 owner-0.000
Token and
Token ID290
Feature activation+5.17e-03

Pos logit contributions
 mom+0.006
Mom+0.006
 slowly+0.005
 care+0.005
 soon+0.005
Neg logit contributions
rily-0.018
pole-0.017
izer-0.017
iche-0.017
ifying-0.016
Token alone
Token ID3436
Feature activation+1.13e-02

Pos logit contributions
 mom+0.007
 slowly+0.006
Mom+0.006
 care+0.006
 soon+0.006
Neg logit contributions
rily-0.017
pole-0.016
iche-0.015
izer-0.015
ifying-0.015
Token.
Token ID13
Feature activation+3.97e-01

Pos logit contributions
rily+0.001
pole+0.001
ifying+0.001
iche+0.001
izer+0.001
Neg logit contributions
 mom-0.000
Mom-0.000
 care-0.000
 moms-0.000
 owner-0.000
Token
Token ID198
Feature activation-9.28e-02

Pos logit contributions
rily+0.019
pole+0.018
iche+0.017
izer+0.017
ifying+0.017
Neg logit contributions
 mom-0.012
 slowly-0.011
Mom-0.011
 soon-0.011
 care-0.011
Token
Token ID198
Feature activation+3.95e-01

Pos logit contributions
 bud+0.003
 care+0.003
 luck+0.003
 slowly+0.003
 mom+0.003
Neg logit contributions
rily-0.004
izer-0.004
iche-0.003
ulp-0.003
venture-0.003
TokenCr
Token ID13916
Feature activation-4.28e-01

Pos logit contributions
rily+0.018
ifying+0.018
iche+0.017
izer+0.017
eh+0.017
Neg logit contributions
 care-0.010
 words-0.010
 mom-0.009
 luck-0.009
 bud-0.009
Tokenab
Token ID397
Feature activation+1.85e+00

Pos logit contributions
 slowly+0.014
Mom+0.014
 mom+0.014
 soon+0.013
 owner+0.013
Neg logit contributions
rily-0.020
pole-0.018
 cuc-0.017
izer-0.017
cient-0.017
Token saw
Token ID2497
Feature activation+3.85e-01

Pos logit contributions
rily+0.090
pole+0.084
iche+0.082
izer+0.081
cient+0.079
Neg logit contributions
 mom-0.057
 slowly-0.053
Mom-0.052
 soon-0.051
 care-0.051
Token Sara
Token ID24799
Feature activation-7.28e-01

Pos logit contributions
rily+0.019
 cuc+0.016
melon+0.016
iche+0.016
ays+0.016
Neg logit contributions
 care-0.010
Mom-0.010
 bud-0.010
 slowly-0.010
 soon-0.010
Token and
Token ID290
Feature activation+1.73e+00

Pos logit contributions
 mom+0.018
 slowly+0.016
Mom+0.016
 soon+0.015
 care+0.015
Neg logit contributions
rily-0.040
pole-0.038
iche-0.037
izer-0.037
cient-0.036
Token Ben
Token ID3932
Feature activation-5.87e-01

Pos logit contributions
rily+0.090
pole+0.084
iche+0.082
izer+0.082
cient+0.080
Neg logit contributions
 mom-0.048
 slowly-0.044
Mom-0.043
 soon-0.042
 care-0.042
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 mom+0.017
 slowly+0.016
 soon+0.015
 care+0.015
Mom+0.015
Neg logit contributions
rily-0.030
pole-0.028
iche-0.027
izer-0.027
 cuc-0.026
Token that
Token ID326
Feature activation+2.63e-01

Pos logit contributions
 mom+0.008
 care+0.008
Mom+0.007
 love+0.007
 slowly+0.007
Neg logit contributions
rily-0.018
pole-0.017
izer-0.017
cient-0.016
venture-0.016
Token day
Token ID1110
Feature activation+8.28e-01

Pos logit contributions
rily+0.017
ifying+0.015
pole+0.015
iche+0.015
ksh+0.015
Neg logit contributions
 mom-0.006
 hard-0.005
 soon-0.005
Mom-0.005
 next-0.005
Token on
Token ID319
Feature activation+9.58e-01

Pos logit contributions
rily+0.050
pole+0.046
iche+0.045
 cuc+0.044
izer+0.044
Neg logit contributions
 mom-0.019
 slowly-0.017
Mom-0.017
 soon-0.016
 hard-0.016
Token,
Token ID11
Feature activation+3.95e-01

Pos logit contributions
rily+0.066
pole+0.063
izer+0.061
iche+0.061
cient+0.060
Neg logit contributions
 mom-0.010
 slowly-0.008
Mom-0.008
 care-0.007
 soon-0.007
Token the
Token ID262
Feature activation-5.98e-02

Pos logit contributions
rily+0.027
pole+0.025
izer+0.025
iche+0.025
ifying+0.025
Neg logit contributions
 mom-0.006
 slowly-0.005
 soon-0.005
 care-0.004
 brave-0.004
Token ostr
Token ID42387
Feature activation+2.07e+00

Pos logit contributions
 slowly+0.003
 mom+0.003
 care+0.003
 soon+0.003
Mom+0.003
Neg logit contributions
rily-0.002
pole-0.002
izer-0.002
iche-0.002
cient-0.002
Tokenich
Token ID488
Feature activation-6.31e-04

Pos logit contributions
rily+0.054
pole+0.052
cient+0.047
izer+0.046
iche+0.045
Neg logit contributions
 care-0.106
 mom-0.106
 slowly-0.104
 soon-0.101
 hard-0.101
Token realized
Token ID6939
Feature activation-1.88e-01

Pos logit contributions
 mom+0.000
Mom+0.000
 care+0.000
 slowly+0.000
 love+0.000
Neg logit contributions
rily-0.000
pole-0.000
izer-0.000
cient-0.000
iche-0.000
Token that
Token ID326
Feature activation-4.16e-01

Pos logit contributions
Mom+0.004
 mom+0.004
 care+0.004
 slowly+0.004
 love+0.004
Neg logit contributions
rily-0.010
pole-0.010
 cuc-0.009
cient-0.009
izer-0.009
Token being
Token ID852
Feature activation+8.61e-02

Pos logit contributions
Mom+0.011
 mom+0.010
 slowly+0.010
 care+0.010
 moms+0.010
Neg logit contributions
rily-0.020
pole-0.020
 cuc-0.019
cient-0.019
iche-0.018
Token different
Token ID1180
Feature activation+4.02e-01

Pos logit contributions
rily+0.005
pole+0.005
iche+0.005
izer+0.005
 cuc+0.004
Neg logit contributions
 mom-0.002
Mom-0.002
 slowly-0.002
 soon-0.001
 care-0.001
Token we
Token ID356
Feature activation+6.77e-02

Pos logit contributions
rily+0.064
pole+0.062
ifying+0.060
venture+0.060
izer+0.060
Neg logit contributions
 slowly-0.024
Mom-0.022
 soon-0.021
 owner-0.021
 care-0.021
Token are
Token ID389
Feature activation+3.38e-01

Pos logit contributions
iche+0.002
ifying+0.002
pole+0.002
rily+0.002
venture+0.002
Neg logit contributions
 slowly-0.002
Mom-0.002
 mom-0.002
 owner-0.002
 next-0.002
Token here
Token ID994
Feature activation+1.09e-01

Pos logit contributions
rily+0.017
pole+0.016
izer+0.016
iche+0.015
 cuc+0.015
Neg logit contributions
 mom-0.011
 brave-0.010
 soon-0.010
 now-0.010
 slowly-0.009
Token!"
Token ID2474
Feature activation-5.13e-01

Pos logit contributions
rily+0.005
pole+0.005
ifying+0.005
iche+0.005
venture+0.005
Neg logit contributions
Mom-0.003
 care-0.003
 slowly-0.003
 mom-0.003
 owner-0.003
Token Tom
Token ID4186
Feature activation+4.22e-01

Pos logit contributions
 mom+0.011
 slowly+0.010
Mom+0.010
 care+0.010
 soon+0.010
Neg logit contributions
rily-0.029
pole-0.027
iche-0.026
izer-0.026
 cuc-0.026
Token and
Token ID290
Feature activation+1.82e+00

Pos logit contributions
rily+0.024
pole+0.023
izer+0.021
iche+0.021
cient+0.021
Neg logit contributions
 mom-0.009
Mom-0.008
 care-0.008
 slowly-0.008
 love-0.008
Token Lily
Token ID20037
Feature activation+2.82e-01

Pos logit contributions
rily+0.097
pole+0.092
izer+0.089
iche+0.089
 cuc+0.087
Neg logit contributions
 mom-0.044
 slowly-0.042
Mom-0.041
 soon-0.040
 care-0.040
Token said
Token ID531
Feature activation-3.11e-01

Pos logit contributions
rily+0.017
pole+0.016
iche+0.015
izer+0.015
cient+0.015
Neg logit contributions
 mom-0.006
 slowly-0.005
Mom-0.005
 care-0.005
 soon-0.005
Token.
Token ID13
Feature activation+3.69e-01

Pos logit contributions
 mom+0.005
Mom+0.004
 slowly+0.004
 care+0.004
 soon+0.004
Neg logit contributions
rily-0.020
pole-0.018
iche-0.018
izer-0.018
cient-0.018
Token "
Token ID366
Feature activation-7.76e-01

Pos logit contributions
rily+0.019
pole+0.018
iche+0.017
izer+0.017
cient+0.017
Neg logit contributions
 mom-0.011
 slowly-0.010
Mom-0.009
 soon-0.009
 care-0.009
TokenWe
Token ID1135
Feature activation-1.66e-02

Pos logit contributions
 mom+0.025
Mom+0.023
 slowly+0.023
 soon+0.023
 care+0.022
Neg logit contributions
rily-0.037
pole-0.034
iche-0.033
izer-0.033
cient-0.033
Token said
Token ID531
Feature activation+1.24e-01

Pos logit contributions
 bud+0.028
Mom+0.027
 care+0.027
 love+0.027
 slowly+0.026
Neg logit contributions
rily-0.037
pole-0.037
 cuc-0.033
cient-0.032
izer-0.031
Token Sarah
Token ID10490
Feature activation+4.26e-02

Pos logit contributions
rily+0.005
pole+0.005
izer+0.005
cient+0.005
iche+0.005
Neg logit contributions
Mom-0.004
 love-0.004
 slowly-0.004
 care-0.004
 mom-0.004
Token.
Token ID13
Feature activation+3.75e-01

Pos logit contributions
rily+0.002
pole+0.002
cient+0.002
 cuc+0.002
venture+0.002
Neg logit contributions
Mom-0.001
No-0.001
 love-0.001
 moms-0.001
 care-0.001
Token "
Token ID366
Feature activation+1.21e-01

Pos logit contributions
rily+0.018
pole+0.017
iche+0.016
izer+0.016
cient+0.016
Neg logit contributions
 mom-0.011
 slowly-0.011
 care-0.010
Mom-0.010
 soon-0.010
TokenI
Token ID40
Feature activation+2.45e-01

Pos logit contributions
rily+0.006
pole+0.006
iche+0.006
izer+0.006
ifying+0.005
Neg logit contributions
 mom-0.003
 slowly-0.003
 care-0.003
Mom-0.003
 soon-0.003
Token wish
Token ID4601
Feature activation+1.84e+00

Pos logit contributions
iche+0.011
rily+0.010
cient+0.010
venture+0.010
pole+0.010
Neg logit contributions
Mom-0.007
 mom-0.007
 moms-0.007
 owner-0.006
 slowly-0.006
Token we
Token ID356
Feature activation+5.46e-01

Pos logit contributions
rily+0.076
pole+0.072
cient+0.068
iche+0.067
venture+0.067
Neg logit contributions
Mom-0.064
 slowly-0.062
 mom-0.062
 care-0.061
 owner-0.060
Token could
Token ID714
Feature activation+5.94e-01

Pos logit contributions
rily+0.027
pole+0.026
ifying+0.026
venture+0.025
cient+0.025
Neg logit contributions
 mom-0.014
Mom-0.014
 slowly-0.013
 owner-0.013
 moms-0.012
Token find
Token ID1064
Feature activation+7.05e-01

Pos logit contributions
pole+0.025
rily+0.024
iche+0.022
izer+0.022
cient+0.022
Neg logit contributions
Mom-0.018
No-0.017
 hard-0.017
 slowly-0.016
 mom-0.016
Token your
Token ID534
Feature activation+1.65e-01

Pos logit contributions
rily+0.042
pole+0.040
iche+0.038
izer+0.038
 cuc+0.038
Neg logit contributions
 mom-0.013
 slowly-0.012
Mom-0.012
 soon-0.011
 owner-0.011
Token avocado
Token ID40377
Feature activation+1.91e-01

Pos logit contributions
rily+0.006
pole+0.006
iche+0.005
izer+0.005
 cuc+0.005
Neg logit contributions
 mom-0.007
 slowly-0.006
Mom-0.006
 soon-0.006
 care-0.006
Token.
Token ID13
Feature activation+6.00e-01

Pos logit contributions
rily+0.033
pole+0.031
venture+0.030
iche+0.030
izer+0.029
Neg logit contributions
Mom-0.016
 mom-0.015
 slowly-0.014
 owner-0.013
 moms-0.013
Token
Token ID198
Feature activation+3.37e-01

Pos logit contributions
rily+0.030
pole+0.029
iche+0.028
cient+0.027
izer+0.027
Neg logit contributions
 mom-0.015
 slowly-0.015
 care-0.015
 soon-0.014
 brave-0.014
Token
Token ID198
Feature activation+6.28e-01

Pos logit contributions
rily+0.019
pole+0.018
iche+0.018
izer+0.017
cient+0.017
Neg logit contributions
 mom-0.007
 slowly-0.007
 care-0.006
Mom-0.006
 soon-0.006
TokenBut
Token ID1537
Feature activation-1.47e-01

Pos logit contributions
rily+0.031
pole+0.031
iche+0.029
eh+0.029
venture+0.029
Neg logit contributions
 mom-0.016
 care-0.016
 slowly-0.014
 love-0.014
 luck-0.014
Token then
Token ID788
Feature activation+3.13e-01

Pos logit contributions
 care+0.003
 slowly+0.003
 love+0.003
Mom+0.003
 brave+0.003
Neg logit contributions
rily-0.007
pole-0.007
 cuc-0.006
iche-0.006
venture-0.006
Token,
Token ID11
Feature activation+5.16e-01

Pos logit contributions
 cuc+0.010
cient+0.010
 Freddie+0.010
eh+0.010
venture+0.010
Neg logit contributions
 owner-0.009
 value-0.009
Mom-0.009
 instructions-0.009
 slowly-0.009
Token her
Token ID607
Feature activation+3.06e-01

Pos logit contributions
rily+0.027
pole+0.025
 cuc+0.025
iche+0.024
ifying+0.024
Neg logit contributions
 slowly-0.011
 care-0.011
Mom-0.011
 owner-0.010
 mom-0.010
Token mom
Token ID1995
Feature activation+1.20e-01

Pos logit contributions
rily+0.017
izer+0.017
iche+0.016
ifying+0.016
 stret+0.016
Neg logit contributions
 care-0.005
 soon-0.004
 slowly-0.004
 herself-0.003
 it-0.003
Token told
Token ID1297
Feature activation-1.87e-01

Pos logit contributions
pole+0.005
rily+0.005
iche+0.005
cient+0.005
venture+0.005
Neg logit contributions
 care-0.004
 mom-0.003
 love-0.003
 moms-0.003
 hard-0.003
Token her
Token ID607
Feature activation-3.86e-01

Pos logit contributions
 mom+0.003
 slowly+0.003
Mom+0.003
 care+0.003
 soon+0.003
Neg logit contributions
rily-0.012
pole-0.011
iche-0.011
izer-0.011
cient-0.011
Token a
Token ID257
Feature activation-2.08e-01

Pos logit contributions
Mom+0.011
 slowly+0.010
Mac+0.010
OK+0.009
Tom+0.009
Neg logit contributions
rily-0.017
venture-0.016
ulp-0.016
cient-0.015
iche-0.015
Token if
Token ID611
Feature activation-5.28e-01

Pos logit contributions
rily+0.017
pole+0.016
cient+0.015
izer+0.015
venture+0.015
Neg logit contributions
 mom-0.006
Mom-0.006
 care-0.005
 slowly-0.005
 love-0.005
Token he
Token ID339
Feature activation+9.12e-01

Pos logit contributions
Mom+0.017
 love+0.015
 care+0.014
 owner+0.014
 mom+0.014
Neg logit contributions
rily-0.022
cient-0.021
 cuc-0.021
pole-0.020
venture-0.020
Token could
Token ID714
Feature activation-3.75e-01

Pos logit contributions
pole+0.031
venture+0.029
rily+0.028
cient+0.026
arium+0.026
Neg logit contributions
Mom-0.034
 mom-0.031
 things-0.031
 hard-0.030
 care-0.030
Token go
Token ID467
Feature activation-3.18e-01

Pos logit contributions
Mom+0.011
 mom+0.011
 hard+0.010
 care+0.010
Tom+0.010
Neg logit contributions
pole-0.016
rily-0.016
izer-0.015
iche-0.015
cient-0.015
Token inside
Token ID2641
Feature activation+2.72e-01

Pos logit contributions
 mom+0.009
Mom+0.008
 moms+0.008
 owner+0.008
 words+0.007
Neg logit contributions
rily-0.016
pole-0.015
izer-0.014
venture-0.014
ifying-0.014
Token.
Token ID13
Feature activation+5.20e-01

Pos logit contributions
rily+0.016
pole+0.015
izer+0.015
iche+0.015
ifying+0.015
Neg logit contributions
 mom-0.005
Mom-0.004
 care-0.004
 slowly-0.004
 owner-0.004
Token
Token ID220
Feature activation+3.20e-01

Pos logit contributions
rily+0.026
pole+0.024
iche+0.024
izer+0.023
cient+0.023
Neg logit contributions
 mom-0.015
 slowly-0.014
Mom-0.014
 soon-0.014
 care-0.014
Token
Token ID198
Feature activation+5.75e-01

Pos logit contributions
rily+0.019
pole+0.018
iche+0.017
izer+0.017
cient+0.017
Neg logit contributions
 mom-0.007
 slowly-0.006
Mom-0.006
 care-0.006
 soon-0.006
Token
Token ID198
Feature activation+7.21e-01

Pos logit contributions
rily+0.027
pole+0.026
izer+0.026
iche+0.026
cient+0.025
Neg logit contributions
 mom-0.016
 care-0.014
 love-0.014
 slowly-0.014
 brave-0.014
TokenThe
Token ID464
Feature activation-6.63e-01

Pos logit contributions
rily+0.036
pole+0.035
izer+0.034
cient+0.034
 cuc+0.033
Neg logit contributions
 mom-0.019
 care-0.017
 love-0.016
 hard-0.016
 slowly-0.016
Token guard
Token ID4860
Feature activation+9.69e-01

Pos logit contributions
 soon+0.015
 care+0.015
Mom+0.014
 slowly+0.014
 love+0.013
Neg logit contributions
pole-0.031
cient-0.030
venture-0.030
rily-0.029
 cuc-0.029
Token big
Token ID1263
Feature activation-2.49e-01

Pos logit contributions
rily+0.000
pole+0.000
izer+0.000
iche+0.000
venture+0.000
Neg logit contributions
 mom-0.000
Mom-0.000
 care-0.000
 owner-0.000
 slowly-0.000
Token rain
Token ID6290
Feature activation+3.55e-01

Pos logit contributions
 mom+0.008
 slowly+0.007
Mom+0.007
 care+0.007
 soon+0.007
Neg logit contributions
rily-0.012
pole-0.011
iche-0.011
izer-0.011
cient-0.010
Token came
Token ID1625
Feature activation+2.54e-01

Pos logit contributions
rily+0.022
izer+0.020
 cuc+0.020
pole+0.020
cient+0.020
Neg logit contributions
 mom-0.006
 slowly-0.006
 soon-0.006
 now-0.005
 next-0.005
Token.
Token ID13
Feature activation-3.40e-02

Pos logit contributions
rily+0.013
pole+0.012
iche+0.012
izer+0.012
cient+0.012
Neg logit contributions
 mom-0.007
 slowly-0.007
 soon-0.006
Mom-0.006
 owner-0.006
Token The
Token ID383
Feature activation-3.21e-01

Pos logit contributions
 mom+0.001
 slowly+0.001
Mom+0.001
 soon+0.001
 owner+0.001
Neg logit contributions
rily-0.002
pole-0.002
izer-0.002
iche-0.002
cient-0.002
Token rain
Token ID6290
Feature activation+6.87e-01

Pos logit contributions
 care+0.010
 slowly+0.009
 soon+0.009
Mom+0.009
 mom+0.009
Neg logit contributions
rily-0.013
iche-0.013
pole-0.013
 cuc-0.013
melon-0.013
Token was
Token ID373
Feature activation-5.83e-01

Pos logit contributions
pole+0.032
rily+0.031
venture+0.030
iche+0.030
melon+0.029
Neg logit contributions
Mom-0.018
 mom-0.018
 care-0.016
Mac-0.015
 moms-0.015
Token so
Token ID523
Feature activation-8.25e-01

Pos logit contributions
 mom+0.015
 care+0.014
Mom+0.014
 slowly+0.013
 owner+0.013
Neg logit contributions
rily-0.030
pole-0.029
iche-0.027
izer-0.027
ifying-0.027
Token strong
Token ID1913
Feature activation-6.42e-02

Pos logit contributions
 owner+0.021
 mom+0.021
 care+0.021
Mom+0.020
 words+0.019
Neg logit contributions
rily-0.039
iche-0.036
ifying-0.036
pole-0.035
ulp-0.034
Token that
Token ID326
Feature activation-6.57e-01

Pos logit contributions
 mom+0.001
Mom+0.001
 owner+0.001
 care+0.001
 slowly+0.001
Neg logit contributions
rily-0.004
pole-0.004
izer-0.003
iche-0.003
 cuc-0.003
Token it
Token ID340
Feature activation+8.42e-02

Pos logit contributions
Mom+0.014
 owner+0.013
 moms+0.013
 care+0.012
 bud+0.012
Neg logit contributions
rily-0.033
pole-0.032
ifying-0.030
 cuc-0.030
iche-0.030
Token a
Token ID257
Feature activation-9.14e-02

Pos logit contributions
 slowly+0.004
 care+0.004
Mom+0.004
OK+0.004
 soon+0.004
Neg logit contributions
rily-0.008
 cuc-0.007
pole-0.007
izer-0.007
eh-0.007
Token store
Token ID3650
Feature activation-3.85e-02

Pos logit contributions
 care+0.003
 soon+0.003
 slowly+0.002
Mom+0.002
 herself+0.002
Neg logit contributions
rily-0.004
 cuc-0.004
ifying-0.004
iche-0.004
pole-0.004
Token.
Token ID13
Feature activation+3.12e-01

Pos logit contributions
 mom+0.001
 slowly+0.001
 care+0.001
Mom+0.001
 soon+0.001
Neg logit contributions
rily-0.002
pole-0.002
izer-0.002
iche-0.002
ifying-0.002
Token The
Token ID383
Feature activation+3.19e-02

Pos logit contributions
rily+0.017
pole+0.016
izer+0.016
iche+0.016
cient+0.015
Neg logit contributions
 mom-0.008
 slowly-0.007
Mom-0.007
 owner-0.007
 care-0.007
Token store
Token ID3650
Feature activation+1.91e-01

Pos logit contributions
rily+0.001
pole+0.001
cient+0.001
 cuc+0.001
iche+0.001
Neg logit contributions
 soon-0.001
 slowly-0.001
 care-0.001
 mom-0.001
Mom-0.001
Token had
Token ID550
Feature activation+3.56e-01

Pos logit contributions
pole+0.010
rily+0.009
melon+0.009
cient+0.009
venture+0.008
Neg logit contributions
 care-0.005
 love-0.004
 soon-0.004
 slowly-0.004
 hard-0.004
Token very
Token ID845
Feature activation+6.23e-01

Pos logit contributions
rily+0.017
pole+0.016
ifying+0.016
 cuc+0.015
umbs+0.015
Neg logit contributions
 slowly-0.008
 soon-0.008
 mom-0.008
 care-0.008
Mom-0.008
Token colorful
Token ID20239
Feature activation+3.66e-02

Pos logit contributions
rily+0.025
eh+0.023
 ours+0.021
 cuc+0.021
pole+0.020
Neg logit contributions
 care-0.021
Mac-0.019
 soon-0.018
Mom-0.018
 Rat-0.018
Token walls
Token ID7714
Feature activation+5.63e-01

Pos logit contributions
rily+0.002
 cuc+0.001
pole+0.001
 films+0.001
 ranch+0.001
Neg logit contributions
Mom-0.001
 soon-0.001
 mom-0.001
 slowly-0.001
 brave-0.001
Token and
Token ID290
Feature activation+4.23e-01

Pos logit contributions
rily+0.031
pole+0.030
ifying+0.028
iche+0.027
izer+0.027
Neg logit contributions
 mom-0.013
Mom-0.012
 care-0.012
 moms-0.011
 slowly-0.011
Token a
Token ID257
Feature activation+3.86e-01

Pos logit contributions
rily+0.016
 cuc+0.016
pole+0.015
ifying+0.014
 films+0.014
Neg logit contributions
 care-0.013
Mom-0.012
 bud-0.012
Mac-0.011
 respect-0.011
Token named
Token ID3706
Feature activation-9.29e-02

Pos logit contributions
 mom+0.001
 slowly+0.001
Mom+0.001
 soon+0.001
 owner+0.001
Neg logit contributions
rily-0.001
pole-0.001
iche-0.001
izer-0.001
ifying-0.001
Token Lily
Token ID20037
Feature activation-3.74e-01

Pos logit contributions
 mom+0.002
Mom+0.002
 slowly+0.002
 soon+0.002
 care+0.002
Neg logit contributions
rily-0.005
pole-0.005
izer-0.005
iche-0.005
ifying-0.005
Token.
Token ID13
Feature activation-5.44e-02

Pos logit contributions
 mom+0.004
 slowly+0.003
Mom+0.003
 soon+0.003
 love+0.003
Neg logit contributions
rily-0.027
pole-0.025
iche-0.025
ifying-0.025
izer-0.025
Token She
Token ID1375
Feature activation-5.10e-01

Pos logit contributions
 mom+0.001
 slowly+0.001
 care+0.001
 soon+0.001
Mom+0.001
Neg logit contributions
rily-0.003
pole-0.003
iche-0.003
izer-0.003
ifying-0.003
Token loved
Token ID6151
Feature activation+1.85e-01

Pos logit contributions
 mom+0.011
 slowly+0.010
Mom+0.009
 care+0.009
 soon+0.009
Neg logit contributions
rily-0.029
pole-0.028
iche-0.027
izer-0.027
venture-0.026
Token l
Token ID300
Feature activation+3.05e-01

Pos logit contributions
rily+0.012
pole+0.012
izer+0.011
iche+0.011
ifying+0.011
Neg logit contributions
 mom-0.003
 slowly-0.002
 care-0.002
Mom-0.002
 owner-0.002
Tokenoll
Token ID692
Feature activation-3.05e-02

Pos logit contributions
rily+0.009
pole+0.007
umbs+0.007
iche+0.007
eh+0.007
Neg logit contributions
 mom-0.016
 soon-0.015
 slowly-0.015
Mom-0.015
 care-0.015
Tokenip
Token ID541
Feature activation-2.14e-01

Pos logit contributions
 mom+0.001
Mom+0.001
 slowly+0.001
 owner+0.001
 soon+0.001
Neg logit contributions
rily-0.001
pole-0.001
cient-0.001
iche-0.001
venture-0.001
Tokenops
Token ID2840
Feature activation-4.45e-01

Pos logit contributions
 mom+0.008
Mom+0.008
 slowly+0.008
 soon+0.008
 Mom+0.008
Neg logit contributions
rily-0.008
pole-0.007
cient-0.007
iche-0.007
izer-0.007
Token,
Token ID11
Feature activation+4.52e-01

Pos logit contributions
 mom+0.008
 slowly+0.007
Mom+0.007
 soon+0.006
 care+0.006
Neg logit contributions
rily-0.027
pole-0.026
iche-0.025
izer-0.025
cient-0.025
Token especially
Token ID2592
Feature activation+3.42e-02

Pos logit contributions
rily+0.027
izer+0.025
ifying+0.025
cient+0.024
iche+0.024
Neg logit contributions
 mom-0.012
 slowly-0.010
 soon-0.010
Mom-0.009
 love-0.009
Token looking
Token ID2045
Feature activation+7.04e-02

Pos logit contributions
rily+0.017
pole+0.016
iche+0.015
izer+0.015
cient+0.015
Neg logit contributions
 mom-0.009
 slowly-0.008
Mom-0.008
 care-0.008
 soon-0.008
Token for
Token ID329
Feature activation-1.36e-01

Pos logit contributions
rily+0.004
pole+0.003
iche+0.003
izer+0.003
cient+0.003
Neg logit contributions
 mom-0.002
 slowly-0.002
Mom-0.002
 soon-0.002
 care-0.002
Token her
Token ID607
Feature activation-3.12e-01

Pos logit contributions
Mom+0.004
OK+0.004
 slowly+0.003
 soon+0.003
No+0.003
Neg logit contributions
rily-0.006
 cuc-0.006
izer-0.006
ulp-0.006
pole-0.005
Token lost
Token ID2626
Feature activation+5.38e-01

Pos logit contributions
Mom+0.007
 mom+0.007
 slowly+0.007
 care+0.007
 soon+0.006
Neg logit contributions
rily-0.017
pole-0.016
izer-0.015
ifying-0.015
iche-0.015
Token toy
Token ID13373
Feature activation+1.74e-01

Pos logit contributions
rily+0.025
pole+0.022
izer+0.022
 stret+0.021
 cuc+0.021
Neg logit contributions
Mom-0.018
 care-0.017
 soon-0.017
 slowly-0.016
 mom-0.016
Token.
Token ID13
Feature activation+8.76e-02

Pos logit contributions
rily+0.011
pole+0.010
iche+0.010
izer+0.010
cient+0.010
Neg logit contributions
 mom-0.003
 slowly-0.003
Mom-0.003
 care-0.003
 soon-0.003
Token Tim
Token ID5045
Feature activation-5.71e-01

Pos logit contributions
rily+0.004
pole+0.004
cient+0.004
iche+0.004
izer+0.004
Neg logit contributions
 mom-0.003
 slowly-0.002
Mom-0.002
 care-0.002
 soon-0.002
Token showed
Token ID3751
Feature activation-4.83e-01

Pos logit contributions
Mom+0.015
 care+0.015
 mom+0.015
No+0.015
 moms+0.015
Neg logit contributions
pole-0.027
rily-0.026
cient-0.025
venture-0.024
eh-0.023
Token her
Token ID607
Feature activation-6.97e-01

Pos logit contributions
Mom+0.012
 soon+0.011
 owner+0.011
OK+0.011
 slowly+0.010
Neg logit contributions
rily-0.025
pole-0.023
izer-0.022
cient-0.021
venture-0.020
Token where
Token ID810
Feature activation+2.55e-01

Pos logit contributions
 mom+0.014
Mom+0.014
 slowly+0.013
 soon+0.013
 care+0.013
Neg logit contributions
rily-0.040
pole-0.038
izer-0.037
iche-0.036
cient-0.036
Token he
Token ID339
Feature activation+7.76e-02

Pos logit contributions
rily+0.010
venture+0.010
pole+0.010
melon+0.010
 cuc+0.010
Neg logit contributions
Mom-0.008
 slowly-0.006
 bud-0.006
 owner-0.006
 moms-0.006
Token his
Token ID465
Feature activation-5.99e-01

Pos logit contributions
Mom+0.001
 mom+0.001
 owner+0.001
 soon+0.001
 moms+0.001
Neg logit contributions
rily-0.002
 cuc-0.002
pole-0.002
izer-0.001
melon-0.001
Token teeth
Token ID9941
Feature activation-3.56e-01

Pos logit contributions
Mom+0.016
 soon+0.015
 slowly+0.015
 mom+0.015
Tom+0.015
Neg logit contributions
rily-0.029
pole-0.027
izer-0.027
cient-0.027
 cuc-0.027
Token to
Token ID284
Feature activation-2.20e-01

Pos logit contributions
Mom+0.010
 mom+0.010
 moms+0.009
Tom+0.008
 Mom+0.008
Neg logit contributions
rily-0.018
pole-0.017
ifying-0.016
 cuc-0.016
venture-0.016
Token tie
Token ID9839
Feature activation-1.66e-01

Pos logit contributions
Mom+0.006
 mom+0.006
 moms+0.006
 owner+0.006
Tom+0.005
Neg logit contributions
rily-0.010
pole-0.010
izer-0.010
 cuc-0.009
iche-0.009
Token the
Token ID262
Feature activation+2.64e-01

Pos logit contributions
Mom+0.005
 mom+0.004
 moms+0.004
Tom+0.004
 bud+0.004
Neg logit contributions
rily-0.008
pole-0.007
ifying-0.007
melon-0.007
 cuc-0.007
Token string
Token ID4731
Feature activation+1.19e-01

Pos logit contributions
rily+0.011
pole+0.011
izer+0.011
ifying+0.010
cient+0.010
Neg logit contributions
 soon-0.008
Mom-0.008
 slowly-0.008
 mom-0.008
 care-0.008
Token around
Token ID1088
Feature activation+4.99e-01

Pos logit contributions
rily+0.007
pole+0.006
ifying+0.006
izer+0.006
melon+0.006
Neg logit contributions
 mom-0.003
Mom-0.003
 care-0.002
 moms-0.002
 love-0.002
Token the
Token ID262
Feature activation+2.51e-01

Pos logit contributions
rily+0.027
pole+0.025
izer+0.025
 cuc+0.025
iche+0.024
Neg logit contributions
Mom-0.011
 mom-0.011
 moms-0.010
 owner-0.010
 care-0.010
Token tree
Token ID5509
Feature activation+2.85e-01

Pos logit contributions
rily+0.010
pole+0.010
izer+0.010
iche+0.010
cient+0.010
Neg logit contributions
 care-0.008
 slowly-0.008
 mom-0.008
 soon-0.008
Mom-0.008
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
rily+0.017
pole+0.016
iche+0.016
izer+0.015
cient+0.015
Neg logit contributions
 mom-0.005
Mom-0.005
 care-0.005
 slowly-0.004
 soon-0.004
Token The
Token ID383
Feature activation-4.84e-01

Pos logit contributions
 mom+0.003
 slowly+0.003
Mom+0.003
 soon+0.003
 owner+0.003
Neg logit contributions
rily-0.006
pole-0.005
iche-0.005
izer-0.005
ifying-0.005
Token a
Token ID257
Feature activation-2.70e-01

Pos logit contributions
Mom+0.002
 mom+0.002
 slowly+0.001
 owner+0.001
 brave+0.001
Neg logit contributions
rily-0.004
pole-0.003
iche-0.003
 cuc-0.003
venture-0.003
Token good
Token ID922
Feature activation-1.77e-01

Pos logit contributions
 mom+0.007
Mom+0.006
 slowly+0.006
 soon+0.006
 care+0.006
Neg logit contributions
rily-0.014
pole-0.013
iche-0.013
izer-0.013
 cuc-0.013
Token time
Token ID640
Feature activation-3.37e-01

Pos logit contributions
 mom+0.005
Mom+0.005
 slowly+0.005
 soon+0.004
 owner+0.004
Neg logit contributions
rily-0.009
pole-0.008
izer-0.008
iche-0.008
 cuc-0.008
Token."
Token ID526
Feature activation-6.33e-01

Pos logit contributions
 mom+0.009
Mom+0.009
 slowly+0.008
 owner+0.008
 care+0.008
Neg logit contributions
rily-0.017
pole-0.016
iche-0.015
ifying-0.015
venture-0.015
Token
Token ID198
Feature activation-6.12e-01

Pos logit contributions
 mom+0.014
 slowly+0.013
Mom+0.012
 care+0.012
 soon+0.012
Neg logit contributions
rily-0.036
pole-0.034
iche-0.033
izer-0.033
cient-0.032
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
 mom+0.015
 care+0.014
 slowly+0.014
 soon+0.013
 brave+0.013
Neg logit contributions
rily-0.032
pole-0.030
izer-0.029
iche-0.029
venture-0.028
TokenTom
Token ID13787
Feature activation-2.52e-01

Pos logit contributions
 mom+0.006
 care+0.005
 hard+0.005
 slowly+0.005
 now+0.005
Neg logit contributions
rily-0.012
pole-0.012
izer-0.012
eh-0.011
iche-0.011
Token did
Token ID750
Feature activation+6.69e-02

Pos logit contributions
 mom+0.007
Mom+0.007
 care+0.006
 owner+0.006
 soon+0.006
Neg logit contributions
rily-0.012
pole-0.012
venture-0.011
iche-0.011
izer-0.011
Token not
Token ID407
Feature activation+5.11e-01

Pos logit contributions
rily+0.004
pole+0.004
iche+0.004
izer+0.004
venture+0.004
Neg logit contributions
 mom-0.001
Mom-0.001
 slowly-0.001
 owner-0.001
 care-0.001
Token listen
Token ID6004
Feature activation-4.28e-01

Pos logit contributions
pole+0.020
iche+0.019
rily+0.019
izer+0.018
venture+0.018
Neg logit contributions
Mom-0.017
 mom-0.017
 moms-0.016
 slowly-0.016
 owner-0.016
Token.
Token ID13
Feature activation-8.79e-01

Pos logit contributions
Mom+0.013
 moms+0.011
Mac+0.011
 care+0.010
 love+0.010
Neg logit contributions
rily-0.021
pole-0.017
 stret-0.017
ulp-0.016
melon-0.016
Token
Token ID198
Feature activation-2.72e-01

Pos logit contributions
 mom+0.013
 care+0.012
 slowly+0.011
 owner+0.011
 brave+0.011
Neg logit contributions
rily-0.028
pole-0.026
izer-0.025
iche-0.025
ifying-0.024
TokenThe
Token ID464
Feature activation-5.55e-01

Pos logit contributions
 mom+0.007
 care+0.007
 hard+0.006
 owner+0.006
 luck+0.006
Neg logit contributions
rily-0.014
pole-0.013
izer-0.013
iche-0.013
ifying-0.013
Token bird
Token ID6512
Feature activation+4.92e-01

Pos logit contributions
 care+0.017
 soon+0.016
 slowly+0.015
 mom+0.014
 love+0.014
Neg logit contributions
pole-0.024
izer-0.023
rily-0.023
 cuc-0.023
cient-0.023
Token says
Token ID1139
Feature activation-2.30e-01

Pos logit contributions
rily+0.025
pole+0.023
iche+0.022
izer+0.022
cient+0.022
Neg logit contributions
 mom-0.014
 slowly-0.013
Mom-0.013
 care-0.013
 soon-0.013
Token,
Token ID11
Feature activation+4.64e-01

Pos logit contributions
Mom+0.004
 care+0.003
 mom+0.003
 owner+0.003
 slowly+0.003
Neg logit contributions
rily-0.014
pole-0.013
 cuc-0.013
cient-0.013
melon-0.012
Token "
Token ID366
Feature activation-4.23e-01

Pos logit contributions
rily+0.026
pole+0.024
iche+0.024
izer+0.024
venture+0.023
Neg logit contributions
 mom-0.012
 slowly-0.011
Mom-0.010
 soon-0.010
 owner-0.010
TokenHello
Token ID15496
Feature activation+1.20e-01

Pos logit contributions
 mom+0.011
 slowly+0.011
Mom+0.011
 care+0.010
 soon+0.010
Neg logit contributions
rily-0.022
pole-0.021
iche-0.020
izer-0.020
cient-0.020
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
rily+0.007
pole+0.007
ifying+0.007
izer+0.007
cient+0.007
Neg logit contributions
Mom-0.002
 mom-0.002
 care-0.002
 slowly-0.002
 owner-0.002
Token Anna
Token ID11735
Feature activation-4.98e-01

Pos logit contributions
rily+0.057
pole+0.054
iche+0.052
izer+0.052
cient+0.051
Neg logit contributions
 mom-0.027
 slowly-0.025
Mom-0.024
 care-0.024
 soon-0.024
Token and
Token ID290
Feature activation+9.33e-01

Pos logit contributions
 mom+0.010
Mom+0.009
 care+0.009
 slowly+0.009
 owner+0.008
Neg logit contributions
rily-0.029
pole-0.028
izer-0.027
iche-0.026
 cuc-0.026
Token Ben
Token ID3932
Feature activation-8.95e-01

Pos logit contributions
rily+0.048
pole+0.045
iche+0.043
izer+0.043
cient+0.042
Neg logit contributions
 mom-0.026
 slowly-0.024
Mom-0.024
 soon-0.023
 care-0.023
Token was
Token ID373
Feature activation+8.75e-01

Pos logit contributions
rily+0.008
pole+0.008
venture+0.007
izer+0.007
iche+0.007
Neg logit contributions
 care-0.003
 owner-0.003
 slowly-0.003
 mom-0.003
Mom-0.003
Token a
Token ID257
Feature activation-7.62e-02

Pos logit contributions
rily+0.067
pole+0.063
cient+0.063
iche+0.061
izer+0.060
Neg logit contributions
 mom-0.007
Mom-0.005
 soon-0.004
 love-0.004
 hard-0.004
Token brave
Token ID14802
Feature activation+2.69e-01

Pos logit contributions
 mom+0.002
 care+0.002
 slowly+0.002
 soon+0.002
Mom+0.002
Neg logit contributions
rily-0.004
pole-0.004
iche-0.004
izer-0.004
 cuc-0.004
Token boy
Token ID2933
Feature activation+1.15e-01

Pos logit contributions
rily+0.013
pole+0.012
izer+0.012
iche+0.011
 cuc+0.011
Neg logit contributions
 mom-0.008
Mom-0.008
 slowly-0.008
 soon-0.008
 care-0.008
Token.
Token ID13
Feature activation-4.18e-01

Pos logit contributions
rily+0.005
pole+0.004
iche+0.004
izer+0.004
ifying+0.004
Neg logit contributions
 mom-0.005
 slowly-0.004
 soon-0.004
 care-0.004
Mom-0.004
Token He
Token ID679
Feature activation-9.09e-01

Pos logit contributions
 mom+0.011
 slowly+0.010
Mom+0.010
 soon+0.010
 care+0.010
Neg logit contributions
rily-0.022
pole-0.021
iche-0.020
izer-0.020
cient-0.020
Token loved
Token ID6151
Feature activation+1.26e-01

Pos logit contributions
 mom+0.028
Mom+0.025
 moms+0.024
 care+0.024
 owner+0.024
Neg logit contributions
venture-0.040
eh-0.039
pole-0.039
rily-0.039
 cuc-0.038
Token to
Token ID284
Feature activation-7.68e-01

Pos logit contributions
rily+0.008
pole+0.008
izer+0.008
iche+0.008
ifying+0.008
Neg logit contributions
 mom-0.002
 slowly-0.002
 care-0.002
 soon-0.001
 owner-0.001
Token play
Token ID711
Feature activation-8.67e-03

Pos logit contributions
 mom+0.021
 slowly+0.019
Mom+0.019
 care+0.019
 soon+0.018
Neg logit contributions
rily-0.040
pole-0.038
iche-0.037
izer-0.036
ifying-0.036
Token with
Token ID351
Feature activation-1.12e-01

Pos logit contributions
 mom+0.000
 slowly+0.000
 brave+0.000
 soon+0.000
 next+0.000
Neg logit contributions
rily-0.001
pole-0.000
ifying-0.000
iche-0.000
cient-0.000
Token his
Token ID465
Feature activation-8.72e-01

Pos logit contributions
 mom+0.002
 slowly+0.002
Mom+0.002
 care+0.002
 soon+0.002
Neg logit contributions
rily-0.007
pole-0.006
iche-0.006
izer-0.006
ifying-0.006
TokenLu
Token ID25596
Feature activation+6.54e-01

Pos logit contributions
rily+0.005
pole+0.005
iche+0.005
izer+0.005
venture+0.005
Neg logit contributions
 mom-0.003
 slowly-0.003
 care-0.003
 owner-0.003
 soon-0.003
Tokency
Token ID948
Feature activation-3.34e-01

Pos logit contributions
rily+0.025
pole+0.022
izer+0.021
ifying+0.021
iche+0.021
Neg logit contributions
 mom-0.027
 slowly-0.025
 owner-0.025
Mom-0.025
 soon-0.025
Token was
Token ID373
Feature activation-4.29e-01

Pos logit contributions
 care+0.009
 things+0.008
Mom+0.008
 hard+0.008
 moms+0.008
Neg logit contributions
rily-0.016
pole-0.015
venture-0.014
 cuc-0.014
cient-0.013
Token very
Token ID845
Feature activation+1.93e-01

Pos logit contributions
 mom+0.013
Mom+0.012
 slowly+0.012
 care+0.011
 soon+0.011
Neg logit contributions
rily-0.021
pole-0.020
iche-0.019
izer-0.019
cient-0.019
Token excited
Token ID6568
Feature activation+5.62e-01

Pos logit contributions
rily+0.009
pole+0.008
iche+0.008
cient+0.008
 cuc+0.008
Neg logit contributions
Mom-0.006
 mom-0.005
Tom-0.005
 care-0.005
 slowly-0.005
Token because
Token ID780
Feature activation-1.23e+00

Pos logit contributions
rily+0.033
pole+0.031
iche+0.030
izer+0.029
venture+0.029
Neg logit contributions
 mom-0.011
Mom-0.011
 slowly-0.010
 care-0.009
 owner-0.009
Token she
Token ID673
Feature activation-7.23e-01

Pos logit contributions
 care+0.035
 moms+0.031
Mom+0.030
Mac+0.030
OK+0.030
Neg logit contributions
rily-0.055
pole-0.049
ifying-0.047
 album-0.046
arium-0.046
Token'd
Token ID1549
Feature activation+2.45e-01

Pos logit contributions
 moms+0.023
 mom+0.022
 things+0.021
Mom+0.021
 people+0.020
Neg logit contributions
venture-0.028
pole-0.028
rily-0.027
cient-0.027
ults-0.026
Token never
Token ID1239
Feature activation+1.40e-01

Pos logit contributions
rily+0.012
pole+0.011
venture+0.011
 cuc+0.011
izer+0.011
Neg logit contributions
Mom-0.006
 mom-0.006
 moms-0.005
 slowly-0.005
 owner-0.005
Token seen
Token ID1775
Feature activation+7.63e-01

Pos logit contributions
pole+0.005
rily+0.005
venture+0.005
iche+0.005
eh+0.005
Neg logit contributions
 mom-0.005
Mom-0.005
 slowly-0.005
 moms-0.005
 owner-0.005
Token a
Token ID257
Feature activation-8.51e-01

Pos logit contributions
rily+0.040
pole+0.038
izer+0.035
 cuc+0.035
iche+0.035
Neg logit contributions
Mom-0.020
 mom-0.018
 slowly-0.018
 care-0.017
 owner-0.017
Token Max
Token ID5436
Feature activation-4.16e-01

Pos logit contributions
 mom+0.009
 slowly+0.008
Mom+0.007
 soon+0.007
 owner+0.007
Neg logit contributions
rily-0.025
pole-0.023
iche-0.023
cient-0.022
izer-0.022
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
 mom+0.007
 slowly+0.006
 soon+0.006
 brave+0.005
 now+0.005
Neg logit contributions
rily-0.027
iche-0.026
ifying-0.025
 cuc-0.025
pole-0.025
Token the
Token ID262
Feature activation+5.40e-02

Pos logit contributions
 mom+0.004
 slowly+0.004
Mom+0.004
 care+0.004
 soon+0.004
Neg logit contributions
rily-0.008
pole-0.007
izer-0.007
iche-0.007
ifying-0.007
Token dog
Token ID3290
Feature activation-1.72e-01

Pos logit contributions
rily+0.002
pole+0.002
 cuc+0.002
izer+0.002
iche+0.002
Neg logit contributions
 care-0.002
 slowly-0.002
 soon-0.002
Mom-0.002
 mom-0.002
Token from
Token ID422
Feature activation-9.80e-02

Pos logit contributions
 mom+0.003
Mom+0.003
 slowly+0.003
 care+0.003
 soon+0.003
Neg logit contributions
rily-0.010
pole-0.010
iche-0.010
izer-0.010
cient-0.009
Token next
Token ID1306
Feature activation-1.34e+00

Pos logit contributions
 mom+0.002
Mom+0.002
 slowly+0.002
 care+0.002
 owner+0.002
Neg logit contributions
rily-0.005
pole-0.005
izer-0.005
 cuc-0.005
iche-0.005
Token door
Token ID3420
Feature activation-5.40e-01

Pos logit contributions
 mom+0.032
Mom+0.030
 slowly+0.029
 soon+0.029
 care+0.028
Neg logit contributions
rily-0.071
pole-0.068
izer-0.066
iche-0.065
ifying-0.064
Token.
Token ID13
Feature activation-3.92e-01

Pos logit contributions
Mom+0.011
 mom+0.010
 care+0.010
 love+0.009
 slowly+0.009
Neg logit contributions
rily-0.031
pole-0.030
ifying-0.028
cient-0.027
izer-0.027
Token He
Token ID679
Feature activation-5.97e-01

Pos logit contributions
 mom+0.010
 slowly+0.009
Mom+0.009
 owner+0.009
 soon+0.009
Neg logit contributions
rily-0.022
pole-0.020
iche-0.020
izer-0.019
cient-0.019
Token sees
Token ID7224
Feature activation+1.60e-01

Pos logit contributions
Mom+0.021
 mom+0.020
 owner+0.019
 care+0.019
 soon+0.019
Neg logit contributions
rily-0.024
pole-0.023
cient-0.023
venture-0.022
 cuc-0.022
Token the
Token ID262
Feature activation-2.92e-01

Pos logit contributions
rily+0.008
pole+0.008
izer+0.007
iche+0.007
cient+0.007
Neg logit contributions
Mom-0.004
 slowly-0.004
 mom-0.004
 soon-0.004
 care-0.004
Token Mom
Token ID11254
Feature activation-6.50e-01

Pos logit contributions
rily+0.020
pole+0.019
 cuc+0.018
iche+0.018
izer+0.018
Neg logit contributions
 slowly-0.002
 care-0.002
 mom-0.002
Mom-0.002
 soon-0.002
Token and
Token ID290
Feature activation+9.77e-01

Pos logit contributions
 care+0.015
 slowly+0.015
 mom+0.015
No+0.014
Mom+0.014
Neg logit contributions
rily-0.033
pole-0.032
izer-0.029
iche-0.029
 cuc-0.029
Token Dad
Token ID17415
Feature activation-5.77e-01

Pos logit contributions
rily+0.064
pole+0.061
izer+0.060
iche+0.060
ifying+0.058
Neg logit contributions
 mom-0.017
Mom-0.013
 slowly-0.013
 owner-0.012
 soon-0.012
Token.
Token ID13
Feature activation+4.67e-01

Pos logit contributions
Mom+0.015
 mom+0.014
 slowly+0.013
No+0.013
 moms+0.013
Neg logit contributions
rily-0.028
pole-0.027
iche-0.025
venture-0.025
 cuc-0.025
Token I
Token ID314
Feature activation-4.02e-01

Pos logit contributions
rily+0.019
pole+0.017
iche+0.017
izer+0.016
ifying+0.016
Neg logit contributions
 mom-0.018
 slowly-0.017
Mom-0.017
 soon-0.017
 care-0.017
Token don
Token ID836
Feature activation-1.31e+00

Pos logit contributions
 mom+0.010
Mom+0.009
 moms+0.009
 people+0.008
 slowly+0.008
Neg logit contributions
iche-0.019
ksh-0.018
venture-0.018
ifying-0.018
izer-0.017
Token't
Token ID470
Feature activation+2.51e-01

Pos logit contributions
 mom+0.024
Mom+0.023
 slowly+0.021
 now+0.019
 next+0.019
Neg logit contributions
rily-0.084
pole-0.078
 cuc-0.077
iche-0.077
izer-0.077
Token need
Token ID761
Feature activation-9.22e-02

Pos logit contributions
iche+0.010
pole+0.009
izer+0.009
rily+0.009
venture+0.009
Neg logit contributions
 slowly-0.008
 mom-0.008
Mom-0.008
 moms-0.008
 owner-0.008
Token a
Token ID257
Feature activation+2.28e-01

Pos logit contributions
Mom+0.002
 slowly+0.002
 owner+0.002
 mom+0.002
 soon+0.002
Neg logit contributions
rily-0.004
 cuc-0.004
iche-0.004
pole-0.004
ulp-0.004
Token mustache
Token ID49303
Feature activation-5.50e-01

Pos logit contributions
rily+0.010
pole+0.009
iche+0.009
izer+0.009
 cuc+0.009
Neg logit contributions
 mom-0.008
 slowly-0.007
Mom-0.007
 soon-0.007
 care-0.007
Token to
Token ID284
Feature activation+2.60e-01

Pos logit contributions
 mom+0.012
Mom+0.012
 slowly+0.011
 care+0.011
 brave+0.010
Neg logit contributions
rily-0.030
pole-0.029
iche-0.028
izer-0.028
ifying-0.027
Token Sara
Token ID24799
Feature activation-1.79e-01

Pos logit contributions
Mom+0.002
 care+0.002
 mom+0.002
 slowly+0.002
 hard+0.002
Neg logit contributions
rily-0.003
pole-0.003
izer-0.003
 cuc-0.003
cient-0.003
Token."
Token ID526
Feature activation-1.03e-02

Pos logit contributions
 mom+0.005
 slowly+0.004
Mom+0.004
 soon+0.004
 care+0.004
Neg logit contributions
rily-0.009
pole-0.009
iche-0.009
izer-0.009
cient-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.77e-01

Pos logit contributions
 mom+0.000
 slowly+0.000
 care+0.000
Mom+0.000
 soon+0.000
Neg logit contributions
rily-0.001
pole-0.001
iche-0.001
izer-0.001
 cuc-0.001
TokenAnna
Token ID31160
Feature activation-8.22e-01

Pos logit contributions
 mom+0.005
Mom+0.005
 slowly+0.005
 soon+0.005
 care+0.005
Neg logit contributions
rily-0.009
pole-0.009
iche-0.008
cient-0.008
izer-0.008
Token and
Token ID290
Feature activation-2.17e-01

Pos logit contributions
 mom+0.021
Mom+0.021
 moms+0.018
 slowly+0.018
 hard+0.018
Neg logit contributions
rily-0.039
izer-0.037
pole-0.037
venture-0.036
 cuc-0.035
Token Ben
Token ID3932
Feature activation-1.22e+00

Pos logit contributions
 mom+0.004
 care+0.003
 love+0.003
 Mom+0.003
Mom+0.003
Neg logit contributions
rily-0.015
pole-0.014
iche-0.014
izer-0.014
ksh-0.014
Token were
Token ID547
Feature activation-3.43e-01

Pos logit contributions
 mom+0.034
Mom+0.031
 slowly+0.031
 owner+0.030
 soon+0.029
Neg logit contributions
rily-0.059
pole-0.056
izer-0.054
iche-0.054
venture-0.054
Token twins
Token ID20345
Feature activation+1.56e-01

Pos logit contributions
 mom+0.008
 hard+0.006
 friends+0.006
 brave+0.006
 Mom+0.006
Neg logit contributions
rily-0.023
pole-0.022
izer-0.021
ifying-0.020
 cuc-0.020
Token who
Token ID508
Feature activation+1.61e-01

Pos logit contributions
rily+0.008
pole+0.008
iche+0.008
cient+0.008
ifying+0.008
Neg logit contributions
 mom-0.003
 slowly-0.003
Mom-0.003
 care-0.003
 soon-0.003
Token liked
Token ID8288
Feature activation+4.01e-01

Pos logit contributions
rily+0.008
pole+0.007
iche+0.007
venture+0.007
cient+0.007
Neg logit contributions
 mom-0.004
Mom-0.004
 slowly-0.004
 owner-0.004
 bud-0.004
Token to
Token ID284
Feature activation-9.61e-01

Pos logit contributions
rily+0.027
pole+0.026
izer+0.025
iche+0.025
ifying+0.025
Neg logit contributions
 mom-0.006
 slowly-0.004
 owner-0.004
 soon-0.004
 care-0.004
Token't
Token ID470
Feature activation+2.95e-01

Pos logit contributions
rily+0.009
pole+0.009
iche+0.009
izer+0.008
ifying+0.008
Neg logit contributions
 mom-0.006
 slowly-0.005
 owner-0.005
 care-0.005
Mom-0.005
Token want
Token ID765
Feature activation+1.75e-01

Pos logit contributions
iche+0.012
pole+0.011
rily+0.010
venture+0.010
ulp+0.010
Neg logit contributions
 mom-0.010
Mom-0.010
 moms-0.010
 slowly-0.010
 owner-0.009
Token to
Token ID284
Feature activation+7.17e-01

Pos logit contributions
rily+0.008
pole+0.008
melon+0.008
iche+0.008
ulp+0.007
Neg logit contributions
Mom-0.005
 slowly-0.004
 mom-0.004
 brave-0.004
OK-0.004
Token get
Token ID651
Feature activation+3.02e-01

Pos logit contributions
iche+0.029
rily+0.027
pole+0.027
ulp+0.027
melon+0.026
Neg logit contributions
 owner-0.021
Mom-0.021
 words-0.021
 slowly-0.020
 hard-0.020
Token wet
Token ID9583
Feature activation+2.32e-01

Pos logit contributions
rily+0.015
iche+0.014
pole+0.014
ifying+0.014
venture+0.014
Neg logit contributions
Mom-0.007
 mom-0.007
 owner-0.007
 slowly-0.007
Tom-0.007
Token too
Token ID1165
Feature activation-1.55e+00

Pos logit contributions
rily+0.014
pole+0.013
iche+0.013
izer+0.013
ifying+0.013
Neg logit contributions
 mom-0.004
 slowly-0.004
Mom-0.004
 care-0.004
 owner-0.004
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
 mom+0.031
Mom+0.030
 owner+0.028
 slowly+0.027
 moms+0.026
Neg logit contributions
rily-0.085
ifying-0.078
pole-0.077
iche-0.077
ulp-0.075
Token
Token ID220
Feature activation+3.85e-01

Pos logit contributions
rily+0.007
pole+0.006
izer+0.006
iche+0.006
ifying+0.006
Neg logit contributions
 mom-0.004
 slowly-0.003
Mom-0.003
 soon-0.003
 owner-0.003
Token
Token ID198
Feature activation+5.11e-01

Pos logit contributions
rily+0.024
pole+0.022
izer+0.022
iche+0.022
 cuc+0.022
Neg logit contributions
 mom-0.007
Mom-0.006
 slowly-0.006
 owner-0.006
 soon-0.006
Token
Token ID198
Feature activation+4.83e-01

Pos logit contributions
rily+0.030
pole+0.028
iche+0.027
izer+0.027
cient+0.027
Neg logit contributions
 mom-0.011
 slowly-0.009
Mom-0.009
 care-0.009
 soon-0.009
TokenTim
Token ID14967
Feature activation-2.64e-02

Pos logit contributions
rily+0.026
pole+0.025
izer+0.024
iche+0.024
venture+0.024
Neg logit contributions
 mom-0.012
 care-0.010
 slowly-0.010
 soon-0.010
 owner-0.010