TOP ACTIVATIONS
MAX = 0.105

. As Daisy â s gaze passed by ,
. When Lily â s mom picked her up
and Jack didn â t give up . They
restless and couldn â t get to sleep .
Jack and Emma â s mom gave them cel
but he didn â t know which way to
not to ruin them . We all need to use things
bad things happen , but we can always find a way
. She watched the rain drops fall and thought they were
't taste good , but we can always ask for help
was raining . The rain drops soaked the ship and made
the rain . The rain drops made a clear sound as
. All of the rain drops kissed the oak so it
excited and couldn â t wait to have fun
veil . When the loud voices come , it 's like
really hard . The rain drops were big and strong ,
She told him , " we don 't buy things from
Tom â s friend , Sarah ,
for the next day . We should be generous and share
can still happen . But we can always learn from our

MIN ACTIVATIONS
MIN = -0.102

Mark 's belt gl owed even brighter . They sat in
came back home empty - handed and told Lily the news
was astonished and cl apped even louder . Now it was
the garden looks much brighter even on cloudy days ". <|endoftext|>
listened and it gl owed even brighter . Bobby smiled and
. He whispered even more to the spark and
ase and it turned out even better than the old one
hall together and it looked much better . They both felt
boy 's dreams all seemed possible . Every day he started
barking . His owner spoke even more softly this time ,
looked y ummy and tasted even better . The girl was
and it gl owed an even brighter purple than before .
butterfly and the painting looked even brighter . Pete was so
, Lily 's room looked much better . Her mom said
the calendar . It looks even better than before ."
gray soup and it tasted even better than it sm elt
that the book case looked even better with the yellow paint
! She thought it looked even better than before . The
broken tray and it looked even more beautiful than before .
- the bow gl owed even brighter .

INTERVAL 0.053 - 0.105
CONTAINS 0.665%

and laughing together . As they went , Bob heard a
gathering flowers , and when they had finished , they proudly
to the family . The kids learned lots of new things
have to listen first . We have a song for you
the corn . But , you have to be fast !"

INTERVAL 0.002 - 0.053
CONTAINS 45.106%

â Oh no , I 'm sorry !
. He saw Mia still swimming and smiled .
room , and she was screaming ! She had found Sam
good dad . You support me and our baby ."
do something amazing !", he said . Mom my

INTERVAL -0.050 - 0.002
CONTAINS 53.928%

make my kingdom better ?" The little girl thought hard
Lily with the block . He struck her on the head
best friends ." Mom hugged Lily and Anna and
to play with toys . One day , they find a
M olly was so happy ! She got a new

INTERVAL -0.102 - -0.050
CONTAINS 0.302%

playing with her brother was even better than playing with her
Finally , they arrived back home . Their son ran
was a great game . <|endoftext|> Tom my was a clumsy
was scared because he thought he broke his toy car .
the submarine smiled and waved back . Suddenly ,
Token.
Token ID13
Feature activation+8.12e-03

Pos logit contributions
keeper+0.032
eddy+0.029
lan+0.028
ber+0.028
minded+0.027
Neg logit contributions
hing-0.046
oned-0.045
 blush-0.042
 urge-0.040
 start-0.040
Token As
Token ID1081
Feature activation-2.08e-02

Pos logit contributions
oned+0.016
hing+0.016
 blush+0.015
 grin+0.014
 ease+0.014
Neg logit contributions
keeper-0.013
ber-0.013
lan-0.012
 mysteries-0.012
eddy-0.012
Token Daisy
Token ID40355
Feature activation+2.81e-02

Pos logit contributions
keeper+0.036
eddy+0.034
ray+0.033
ber+0.033
ney+0.033
Neg logit contributions
oned-0.037
 hardly-0.035
hing-0.035
 blush-0.035
 ease-0.035
Tokenâ
Token ID22940
Feature activation+1.84e-02

Pos logit contributions
hing+0.056
oned+0.053
 blush+0.051
 grin+0.048
 urge+0.046
Neg logit contributions
keeper-0.047
 mysteries-0.042
ber-0.042
lan-0.041
minded-0.041
Token
Token ID26391
Feature activation+3.42e-02

Pos logit contributions
hing+0.027
 blush+0.026
oned+0.026
 grin+0.025
 hardly+0.023
Neg logit contributions
keeper-0.036
minded-0.035
ray-0.034
lan-0.033
 Cream-0.032
Token
Token ID8151
Feature activation+1.05e-01

Pos logit contributions
hing+0.071
oned+0.068
 blush+0.064
 grin+0.062
 urge+0.059
Neg logit contributions
keeper-0.054
lan-0.048
 mysteries-0.048
minded-0.047
ber-0.047
Tokens
Token ID82
Feature activation+2.14e-02

Pos logit contributions
hing+0.207
 blush+0.195
oned+0.192
 grin+0.182
 urge+0.181
Neg logit contributions
keeper-0.181
o-0.161
ber-0.156
enn-0.154
eddy-0.150
Token gaze
Token ID17841
Feature activation+1.14e-02

Pos logit contributions
hing+0.049
oned+0.047
 blush+0.045
 grin+0.043
 urge+0.042
Neg logit contributions
keeper-0.030
lan-0.025
eddy-0.025
minded-0.025
ber-0.025
Token passed
Token ID3804
Feature activation-2.08e-02

Pos logit contributions
hing+0.023
oned+0.022
 blush+0.021
 grin+0.020
 urge+0.019
Neg logit contributions
keeper-0.019
lan-0.017
 mysteries-0.017
ber-0.016
minded-0.016
Token by
Token ID416
Feature activation+8.81e-03

Pos logit contributions
lan+0.035
keeper+0.035
ber+0.035
eddy+0.034
 Cream+0.032
Neg logit contributions
hing-0.038
oned-0.036
 blush-0.034
 grin-0.033
 ease-0.032
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
hing+0.017
oned+0.017
 blush+0.016
 grin+0.015
 ease+0.015
Neg logit contributions
keeper-0.015
ber-0.013
eddy-0.013
lan-0.013
minded-0.013
Token.
Token ID13
Feature activation+9.48e-03

Pos logit contributions
hing+0.021
oned+0.021
 blush+0.020
 grin+0.019
 urge+0.019
Neg logit contributions
keeper-0.016
 mysteries-0.014
lan-0.014
ber-0.014
minded-0.014
Token When
Token ID1649
Feature activation+1.94e-03

Pos logit contributions
oned+0.017
 hardly+0.015
 blush+0.015
urance+0.015
 grin+0.014
Neg logit contributions
 mysteries-0.017
lan-0.017
ber-0.016
ind-0.016
 cave-0.016
Token Lily
Token ID20037
Feature activation+1.44e-02

Pos logit contributions
oned+0.004
 blush+0.004
hing+0.004
 hardly+0.004
 start+0.004
Neg logit contributions
keeper-0.003
minded-0.003
 mysteries-0.003
ber-0.003
ray-0.003
Tokenâ
Token ID22940
Feature activation+1.42e-02

Pos logit contributions
hing+0.025
oned+0.024
 blush+0.022
 grin+0.021
 urge+0.021
Neg logit contributions
keeper-0.027
lan-0.025
 mysteries-0.025
eddy-0.025
minded-0.025
Token
Token ID26391
Feature activation+2.20e-02

Pos logit contributions
hing+0.022
 blush+0.020
oned+0.020
 ease+0.019
 hardly+0.019
Neg logit contributions
keeper-0.027
minded-0.026
ray-0.026
 Cream-0.025
eddy-0.024
Token
Token ID8151
Feature activation+1.00e-01

Pos logit contributions
hing+0.045
oned+0.044
 blush+0.041
 grin+0.040
 hardly+0.038
Neg logit contributions
keeper-0.035
lan-0.031
 mysteries-0.031
minded-0.031
eddy-0.030
Tokens
Token ID82
Feature activation+1.40e-02

Pos logit contributions
hing+0.224
oned+0.209
 blush+0.204
 grin+0.196
 urge+0.192
Neg logit contributions
keeper-0.149
ber-0.125
eddy-0.125
lan-0.122
o-0.121
Token mom
Token ID1995
Feature activation-9.06e-03

Pos logit contributions
hing+0.032
oned+0.031
 blush+0.030
 grin+0.029
 urge+0.028
Neg logit contributions
keeper-0.019
lan-0.017
 mysteries-0.016
minded-0.016
ber-0.016
Token picked
Token ID6497
Feature activation-2.55e-02

Pos logit contributions
keeper+0.014
lan+0.013
 mysteries+0.013
minded+0.013
ber+0.013
Neg logit contributions
hing-0.018
oned-0.018
 blush-0.017
 grin-0.016
mine-0.016
Token her
Token ID607
Feature activation-1.58e-02

Pos logit contributions
keeper+0.049
lan+0.047
ber+0.046
ney+0.045
 mysteries+0.045
Neg logit contributions
hing-0.043
oned-0.042
 blush-0.040
mine-0.037
 ease-0.036
Token up
Token ID510
Feature activation-2.24e-02

Pos logit contributions
lan+0.029
keeper+0.028
minded+0.027
 mysteries+0.027
eddy+0.026
Neg logit contributions
oned-0.026
hing-0.026
 blush-0.025
 urge-0.024
 grin-0.023
Token and
Token ID290
Feature activation+3.18e-02

Pos logit contributions
keeper+0.030
 mysteries+0.027
lan+0.027
minded+0.027
eddy+0.026
Neg logit contributions
hing-0.035
oned-0.033
 blush-0.032
 grin-0.031
mine-0.029
Token Jack
Token ID3619
Feature activation+3.03e-02

Pos logit contributions
hing+0.080
oned+0.078
 blush+0.076
 grin+0.073
 urge+0.071
Neg logit contributions
keeper-0.036
 mysteries-0.030
ber-0.030
eddy-0.030
lan-0.030
Token didn
Token ID1422
Feature activation+3.69e-02

Pos logit contributions
hing+0.058
oned+0.057
 blush+0.052
 AW+0.049
 grin+0.048
Neg logit contributions
keeper-0.054
ber-0.047
 who-0.046
lan-0.045
minded-0.045
Tokenâ
Token ID22940
Feature activation+1.83e-02

Pos logit contributions
hing+0.060
 blush+0.055
oned+0.053
 grin+0.053
 thrill+0.051
Neg logit contributions
keeper-0.069
eddy-0.065
 late-0.064
 Cream-0.060
 who-0.059
Token
Token ID26391
Feature activation+3.45e-02

Pos logit contributions
hing+0.027
 blush+0.027
oned+0.026
 grin+0.025
 ease+0.025
Neg logit contributions
keeper-0.034
minded-0.034
ray-0.033
 Cream-0.031
lan-0.031
Token
Token ID8151
Feature activation+9.63e-02

Pos logit contributions
hing+0.056
oned+0.054
 blush+0.050
 grin+0.048
 hardly+0.046
Neg logit contributions
keeper-0.069
 mysteries-0.063
minded-0.063
lan-0.062
eddy-0.062
Tokent
Token ID83
Feature activation+3.04e-02

Pos logit contributions
hing+0.175
 blush+0.159
oned+0.159
 grin+0.148
 urge+0.144
Neg logit contributions
keeper-0.185
ber-0.163
o-0.161
eddy-0.159
lan-0.157
Token give
Token ID1577
Feature activation-2.63e-02

Pos logit contributions
oned+0.043
hing+0.042
 AW+0.042
 ease+0.037
rell+0.035
Neg logit contributions
 who-0.061
keeper-0.059
minded-0.056
lan-0.054
o-0.054
Token up
Token ID510
Feature activation-3.16e-02

Pos logit contributions
keeper+0.054
ney+0.051
ber+0.051
lan+0.051
eddy+0.050
Neg logit contributions
 blush-0.036
 start-0.034
oned-0.034
hing-0.034
 ease-0.033
Token.
Token ID13
Feature activation-3.37e-04

Pos logit contributions
keeper+0.054
eddy+0.050
 mysteries+0.048
lan+0.048
ray+0.048
Neg logit contributions
hing-0.057
oned-0.057
 blush-0.056
 grin-0.055
 urge-0.051
Token They
Token ID1119
Feature activation+3.09e-02

Pos logit contributions
 mysteries+0.001
ber+0.001
ind+0.001
keeper+0.001
lan+0.001
Neg logit contributions
oned-0.001
 grin-0.001
 blush-0.001
hing-0.001
 hardly-0.001
Token restless
Token ID42537
Feature activation+1.51e-02

Pos logit contributions
hing+0.050
oned+0.048
 blush+0.046
 grin+0.043
 AW+0.042
Neg logit contributions
keeper-0.040
ber-0.035
lan-0.035
 mysteries-0.035
minded-0.034
Token and
Token ID290
Feature activation-8.40e-03

Pos logit contributions
hing+0.035
oned+0.033
 blush+0.032
 grin+0.031
 urge+0.031
Neg logit contributions
keeper-0.020
lan-0.018
ber-0.017
 mysteries-0.017
eddy-0.017
Token couldn
Token ID3521
Feature activation+1.75e-02

Pos logit contributions
keeper+0.014
 mysteries+0.013
lan+0.013
eddy+0.013
minded+0.013
Neg logit contributions
hing-0.016
oned-0.016
 blush-0.015
 grin-0.014
 urge-0.014
Tokenâ
Token ID22940
Feature activation+2.47e-02

Pos logit contributions
hing+0.021
oned+0.019
 blush+0.018
 grin+0.016
 contain+0.015
Neg logit contributions
keeper-0.043
eddy-0.040
ber-0.039
lan-0.039
 mysteries-0.039
Token
Token ID26391
Feature activation+3.95e-02

Pos logit contributions
 blush+0.035
 grin+0.033
hing+0.032
oned+0.030
 crash+0.028
Neg logit contributions
minded-0.048
ray-0.046
keeper-0.044
 late-0.044
clock-0.043
Token
Token ID8151
Feature activation+9.35e-02

Pos logit contributions
hing+0.072
oned+0.069
 blush+0.064
 grin+0.061
 hardly+0.061
Neg logit contributions
keeper-0.072
 mysteries-0.065
minded-0.065
lan-0.064
eddy-0.063
Tokent
Token ID83
Feature activation+3.04e-02

Pos logit contributions
hing+0.157
oned+0.146
 blush+0.141
 grin+0.132
 urge+0.126
Neg logit contributions
keeper-0.189
ber-0.169
eddy-0.167
lan-0.166
minded-0.163
Token get
Token ID651
Feature activation-3.29e-02

Pos logit contributions
hing+0.041
oned+0.040
 AW+0.036
 blush+0.035
rell+0.033
Neg logit contributions
keeper-0.066
 who-0.063
lan-0.061
ber-0.061
minded-0.060
Token to
Token ID284
Feature activation+1.50e-02

Pos logit contributions
keeper+0.057
 mysteries+0.053
eddy+0.053
ber+0.053
minded+0.051
Neg logit contributions
hing-0.057
oned-0.056
 start-0.055
 blush-0.055
 urge-0.053
Token sleep
Token ID3993
Feature activation+4.68e-03

Pos logit contributions
hing+0.025
oned+0.024
 blush+0.022
 grin+0.020
 AW+0.020
Neg logit contributions
keeper-0.029
 mysteries-0.027
lan-0.027
minded-0.026
eddy-0.026
Token.
Token ID13
Feature activation+3.02e-03

Pos logit contributions
hing+0.011
oned+0.011
 blush+0.010
 grin+0.010
 urge+0.010
Neg logit contributions
keeper-0.006
 mysteries-0.005
lan-0.005
minded-0.005
eddy-0.005
Token Jack
Token ID3619
Feature activation-2.09e-03

Pos logit contributions
hing+0.021
oned+0.021
 blush+0.020
 start+0.019
 grin+0.019
Neg logit contributions
keeper-0.019
ber-0.018
minded-0.017
eddy-0.017
lan-0.017
Token and
Token ID290
Feature activation+5.13e-02

Pos logit contributions
keeper+0.003
 mysteries+0.003
minded+0.003
lan+0.003
eddy+0.003
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.004
 grin-0.004
 hardly-0.004
Token Emma
Token ID18966
Feature activation+4.33e-02

Pos logit contributions
hing+0.094
oned+0.092
 blush+0.089
 grin+0.083
 urge+0.082
Neg logit contributions
keeper-0.091
lan-0.085
ber-0.084
minded-0.081
 mysteries-0.081
Tokenâ
Token ID22940
Feature activation+2.62e-02

Pos logit contributions
hing+0.085
oned+0.084
 blush+0.075
ivated+0.072
 AW+0.072
Neg logit contributions
keeper-0.075
ber-0.067
lan-0.065
 who-0.064
o-0.063
Token
Token ID26391
Feature activation+4.75e-02

Pos logit contributions
 blush+0.037
hing+0.036
 hardly+0.035
 grin+0.034
oned+0.034
Neg logit contributions
ray-0.049
minded-0.048
keeper-0.048
 Cream-0.045
o-0.045
Token
Token ID8151
Feature activation+9.24e-02

Pos logit contributions
hing+0.099
oned+0.094
 blush+0.090
 grin+0.085
 respond+0.083
Neg logit contributions
keeper-0.076
lan-0.067
ber-0.067
 mysteries-0.066
minded-0.066
Tokens
Token ID82
Feature activation+3.01e-02

Pos logit contributions
hing+0.234
oned+0.220
 blush+0.218
 grin+0.206
 AW+0.205
Neg logit contributions
keeper-0.110
ber-0.091
eddy-0.090
o-0.090
lan-0.089
Token mom
Token ID1995
Feature activation+6.90e-03

Pos logit contributions
hing+0.069
oned+0.066
 blush+0.064
 grin+0.061
 urge+0.060
Neg logit contributions
keeper-0.042
 mysteries-0.036
lan-0.036
minded-0.036
ber-0.035
Token gave
Token ID2921
Feature activation-2.65e-02

Pos logit contributions
hing+0.014
oned+0.013
 blush+0.012
 grin+0.012
 AW+0.011
Neg logit contributions
keeper-0.012
lan-0.010
ber-0.010
eddy-0.010
minded-0.010
Token them
Token ID606
Feature activation+8.00e-03

Pos logit contributions
keeper+0.056
ber+0.053
lan+0.051
 mysteries+0.050
ney+0.050
Neg logit contributions
hing-0.043
oned-0.041
 blush-0.039
mine-0.038
 ease-0.037
Token cel
Token ID18725
Feature activation-1.07e-02

Pos logit contributions
hing+0.016
 blush+0.015
oned+0.015
 start+0.014
 grin+0.013
Neg logit contributions
keeper-0.013
ber-0.012
 mysteries-0.011
minded-0.011
eddy-0.011
Token but
Token ID475
Feature activation+6.98e-03

Pos logit contributions
hing+0.023
oned+0.023
 blush+0.022
 grin+0.021
 urge+0.021
Neg logit contributions
keeper-0.021
eddy-0.019
lan-0.019
ber-0.019
minded-0.019
Token he
Token ID339
Feature activation-1.26e-02

Pos logit contributions
hing+0.013
oned+0.013
 blush+0.012
 grin+0.011
 urge+0.011
Neg logit contributions
keeper-0.012
lan-0.011
minded-0.011
eddy-0.011
 mysteries-0.011
Token didn
Token ID1422
Feature activation+3.25e-02

Pos logit contributions
keeper+0.025
lan+0.024
 mysteries+0.023
eddy+0.023
minded+0.023
Neg logit contributions
hing-0.019
oned-0.019
 blush-0.018
 hardly-0.018
 grin-0.017
Tokenâ
Token ID22940
Feature activation+8.93e-03

Pos logit contributions
hing+0.045
 blush+0.041
oned+0.041
 grin+0.041
 thrill+0.036
Neg logit contributions
keeper-0.072
eddy-0.067
 late-0.064
 mysteries-0.063
ber-0.063
Token
Token ID26391
Feature activation+1.95e-02

Pos logit contributions
 blush+0.012
oned+0.012
hing+0.012
 grin+0.012
 leap+0.011
Neg logit contributions
minded-0.017
ray-0.017
keeper-0.017
eddy-0.016
 late-0.016
Token
Token ID8151
Feature activation+9.18e-02

Pos logit contributions
hing+0.036
oned+0.034
 blush+0.032
 grin+0.031
 hardly+0.029
Neg logit contributions
keeper-0.036
 mysteries-0.032
lan-0.032
minded-0.032
eddy-0.031
Tokent
Token ID83
Feature activation+2.14e-02

Pos logit contributions
hing+0.157
oned+0.145
 blush+0.142
 grin+0.135
 urge+0.132
Neg logit contributions
keeper-0.182
ber-0.163
eddy-0.162
lan-0.158
o-0.157
Token know
Token ID760
Feature activation-2.29e-02

Pos logit contributions
hing+0.030
oned+0.029
 blush+0.027
 AW+0.026
 grin+0.025
Neg logit contributions
keeper-0.044
 who-0.043
minded-0.042
ber-0.041
 mysteries-0.041
Token which
Token ID543
Feature activation-4.41e-02

Pos logit contributions
keeper+0.040
minded+0.036
eddy+0.036
lan+0.036
ney+0.035
Neg logit contributions
hing-0.043
 blush-0.041
oned-0.038
 ease-0.037
 start-0.037
Token way
Token ID835
Feature activation-6.32e-03

Pos logit contributions
keeper+0.076
minded+0.069
eddy+0.069
lan+0.069
ber+0.067
Neg logit contributions
hing-0.085
 blush-0.078
oned-0.077
 urge-0.077
 start-0.075
Token to
Token ID284
Feature activation+2.73e-02

Pos logit contributions
keeper+0.008
lan+0.007
minded+0.007
ber+0.007
 mysteries+0.007
Neg logit contributions
hing-0.015
oned-0.015
 blush-0.014
 grin-0.014
 urge-0.013
Token not
Token ID407
Feature activation+1.34e-02

Pos logit contributions
hing+0.009
oned+0.008
 blush+0.008
 grin+0.008
 urge+0.007
Neg logit contributions
keeper-0.005
 mysteries-0.004
lan-0.004
minded-0.004
ber-0.004
Token to
Token ID284
Feature activation+3.02e-02

Pos logit contributions
hing+0.038
oned+0.037
 blush+0.035
 grin+0.034
 urge+0.034
Neg logit contributions
keeper-0.011
lan-0.008
 mysteries-0.008
ber-0.008
minded-0.008
Token ruin
Token ID16866
Feature activation-1.65e-03

Pos logit contributions
hing+0.042
oned+0.040
 blush+0.036
 grin+0.034
 AW+0.032
Neg logit contributions
keeper-0.067
 mysteries-0.063
minded-0.062
lan-0.062
ber-0.062
Token them
Token ID606
Feature activation+1.96e-02

Pos logit contributions
keeper+0.003
eddy+0.003
lan+0.003
ber+0.003
minded+0.003
Neg logit contributions
hing-0.003
 ease-0.003
 start-0.003
 blush-0.003
oned-0.003
Token.
Token ID13
Feature activation+3.95e-03

Pos logit contributions
hing+0.051
oned+0.050
 blush+0.048
 grin+0.046
 urge+0.045
Neg logit contributions
keeper-0.020
 mysteries-0.017
lan-0.017
ber-0.016
minded-0.016
Token We
Token ID775
Feature activation+9.16e-02

Pos logit contributions
 hardly+0.006
 grin+0.006
oned+0.005
 ease+0.005
urance+0.005
Neg logit contributions
ind-0.008
 mysteries-0.007
keeper-0.007
ups-0.007
minded-0.007
Token all
Token ID477
Feature activation+3.37e-02

Pos logit contributions
hing+0.185
oned+0.179
 blush+0.169
 grin+0.160
 urge+0.153
Neg logit contributions
keeper-0.151
 mysteries-0.132
ber-0.131
minded-0.131
lan-0.131
Token need
Token ID761
Feature activation-7.24e-03

Pos logit contributions
oned+0.046
hing+0.046
 blush+0.042
 grin+0.037
 AW+0.036
Neg logit contributions
keeper-0.075
 mysteries-0.068
minded-0.068
ber-0.068
 who-0.067
Token to
Token ID284
Feature activation+2.17e-02

Pos logit contributions
keeper+0.009
eddy+0.008
ber+0.008
minded+0.007
lan+0.007
Neg logit contributions
hing-0.017
oned-0.017
 ease-0.016
 blush-0.016
 start-0.016
Token use
Token ID779
Feature activation-1.85e-02

Pos logit contributions
hing+0.033
oned+0.032
 blush+0.029
 AW+0.027
 grin+0.027
Neg logit contributions
keeper-0.043
 who-0.040
 mysteries-0.040
minded-0.039
lan-0.039
Token things
Token ID1243
Feature activation+2.15e-02

Pos logit contributions
keeper+0.031
ber+0.027
eddy+0.027
lan+0.027
minded+0.026
Neg logit contributions
hing-0.038
oned-0.035
 blush-0.034
 ease-0.034
 start-0.033
Token bad
Token ID2089
Feature activation+1.97e-02

Pos logit contributions
hing+0.037
oned+0.035
 blush+0.033
 urge+0.033
 grin+0.032
Neg logit contributions
keeper-0.025
lan-0.022
ber-0.022
eddy-0.022
minded-0.021
Token things
Token ID1243
Feature activation+3.92e-02

Pos logit contributions
hing+0.048
oned+0.046
 urge+0.045
 blush+0.044
 realize+0.043
Neg logit contributions
keeper-0.025
ber-0.020
eddy-0.020
lan-0.020
elia-0.018
Token happen
Token ID1645
Feature activation+4.11e-03

Pos logit contributions
hing+0.077
oned+0.076
 blush+0.072
 grin+0.069
mine+0.066
Neg logit contributions
keeper-0.064
 mysteries-0.059
minded-0.056
 who-0.054
ber-0.054
Token,
Token ID11
Feature activation+1.46e-02

Pos logit contributions
oned+0.006
hing+0.006
're+0.006
 start+0.005
 urge+0.005
Neg logit contributions
keeper-0.008
ber-0.008
lan-0.007
eddy-0.007
 Cream-0.007
Token but
Token ID475
Feature activation+8.91e-03

Pos logit contributions
hing+0.027
oned+0.026
 blush+0.025
 urge+0.024
 realize+0.023
Neg logit contributions
keeper-0.027
ber-0.025
lan-0.024
eddy-0.024
minded-0.023
Token we
Token ID356
Feature activation+9.11e-02

Pos logit contributions
hing+0.018
oned+0.017
 blush+0.016
 urge+0.016
 respond+0.016
Neg logit contributions
keeper-0.014
ber-0.013
lan-0.013
eddy-0.013
minded-0.012
Token can
Token ID460
Feature activation+4.84e-02

Pos logit contributions
hing+0.182
oned+0.175
 blush+0.166
 grin+0.158
 AW+0.150
Neg logit contributions
keeper-0.152
 mysteries-0.134
minded-0.132
lan-0.132
ber-0.131
Token always
Token ID1464
Feature activation+1.31e-03

Pos logit contributions
hing+0.084
oned+0.083
 blush+0.078
 grin+0.072
 AW+0.071
Neg logit contributions
keeper-0.092
minded-0.082
eddy-0.080
 mysteries-0.080
lan-0.080
Token find
Token ID1064
Feature activation-2.24e-02

Pos logit contributions
hing+0.002
oned+0.002
 blush+0.002
 grin+0.002
 urge+0.001
Neg logit contributions
keeper-0.003
 mysteries-0.003
lan-0.003
minded-0.003
ber-0.003
Token a
Token ID257
Feature activation-1.30e-02

Pos logit contributions
keeper+0.039
ber+0.037
eddy+0.033
lan+0.033
o+0.032
Neg logit contributions
hing-0.043
oned-0.040
 ease-0.040
 blush-0.040
 urge-0.038
Token way
Token ID835
Feature activation-8.36e-03

Pos logit contributions
keeper+0.024
minded+0.024
lan+0.024
eddy+0.024
ber+0.023
Neg logit contributions
 urge-0.019
 start-0.018
hing-0.018
 realize-0.018
oned-0.018
Token.
Token ID13
Feature activation+8.91e-03

Pos logit contributions
hing+0.007
oned+0.007
 blush+0.007
 grin+0.007
 urge+0.006
Neg logit contributions
keeper-0.004
lan-0.003
minded-0.003
ber-0.003
eddy-0.003
Token She
Token ID1375
Feature activation-7.37e-03

Pos logit contributions
oned+0.018
hing+0.018
 blush+0.018
 grin+0.016
 urge+0.016
Neg logit contributions
keeper-0.014
lan-0.013
ber-0.012
 mysteries-0.012
minded-0.012
Token watched
Token ID7342
Feature activation+3.63e-02

Pos logit contributions
keeper+0.014
lan+0.013
 mysteries+0.012
minded+0.012
eddy+0.012
Neg logit contributions
hing-0.013
oned-0.012
 blush-0.012
 grin-0.011
 urge-0.011
Token the
Token ID262
Feature activation+1.76e-02

Pos logit contributions
 AW+0.091
hing+0.090
 grin+0.089
oned+0.087
 contain+0.086
Neg logit contributions
 who-0.033
eddy-0.028
 by-0.028
 to-0.027
 God-0.027
Token rain
Token ID6290
Feature activation+4.53e-02

Pos logit contributions
hing+0.026
oned+0.026
 blush+0.023
 grin+0.022
mine+0.021
Neg logit contributions
keeper-0.038
lan-0.035
 mysteries-0.035
eddy-0.035
minded-0.034
Tokendrops
Token ID49253
Feature activation+9.05e-02

Pos logit contributions
oned+0.088
hing+0.086
 AW+0.082
 grin+0.082
 ease+0.082
Neg logit contributions
keeper-0.067
 who-0.062
ups-0.060
o-0.058
 boy-0.058
Token fall
Token ID2121
Feature activation+2.29e-02

Pos logit contributions
oned+0.182
 AW+0.173
urance+0.171
hing+0.170
Ta+0.168
Neg logit contributions
 to-0.120
 who-0.115
 by-0.110
 again-0.109
 too-0.102
Token and
Token ID290
Feature activation-1.69e-02

Pos logit contributions
hing+0.052
oned+0.052
 blush+0.049
 grin+0.048
 contain+0.046
Neg logit contributions
keeper-0.029
eddy-0.026
 mysteries-0.024
minded-0.024
lan-0.024
Token thought
Token ID1807
Feature activation+2.80e-03

Pos logit contributions
keeper+0.024
 mysteries+0.021
lan+0.021
minded+0.021
ber+0.020
Neg logit contributions
hing-0.038
oned-0.037
 blush-0.033
 grin-0.033
 AW-0.032
Token they
Token ID484
Feature activation+1.25e-02

Pos logit contributions
hing+0.007
oned+0.007
 grin+0.006
 blush+0.006
 AW+0.006
Neg logit contributions
keeper-0.003
 mysteries-0.003
lan-0.003
eddy-0.002
 who-0.002
Token were
Token ID547
Feature activation+1.71e-02

Pos logit contributions
oned+0.023
hing+0.021
 grin+0.020
 AW+0.019
 sidelines+0.019
Neg logit contributions
keeper-0.022
 mysteries-0.021
 who-0.021
ray-0.020
minded-0.020
Token't
Token ID470
Feature activation+3.93e-02

Pos logit contributions
 blush+0.054
 grin+0.053
oned+0.051
hing+0.051
 thrill+0.046
Neg logit contributions
keeper-0.080
eddy-0.077
ups-0.073
 mysteries-0.072
 late-0.072
Token taste
Token ID6938
Feature activation+7.39e-03

Pos logit contributions
oned+0.055
hing+0.054
 blush+0.047
 grin+0.045
 AW+0.045
Neg logit contributions
keeper-0.084
 mysteries-0.077
minded-0.077
 who-0.076
 boy-0.075
Token good
Token ID922
Feature activation+1.08e-03

Pos logit contributions
hing+0.014
oned+0.013
 blush+0.013
 grin+0.012
 urge+0.012
Neg logit contributions
keeper-0.013
 mysteries-0.012
lan-0.012
ber-0.012
minded-0.011
Token,
Token ID11
Feature activation+9.98e-03

Pos logit contributions
hing+0.002
oned+0.002
 blush+0.002
 grin+0.002
 urge+0.002
Neg logit contributions
keeper-0.002
ber-0.001
lan-0.001
 mysteries-0.001
eddy-0.001
Token but
Token ID475
Feature activation+1.11e-02

Pos logit contributions
hing+0.017
 blush+0.017
oned+0.017
 urge+0.015
 hardly+0.015
Neg logit contributions
keeper-0.019
ber-0.017
lan-0.017
ray-0.017
minded-0.017
Token we
Token ID356
Feature activation+8.99e-02

Pos logit contributions
hing+0.021
oned+0.021
 blush+0.020
 urge+0.020
 start+0.019
Neg logit contributions
keeper-0.018
ber-0.016
lan-0.016
eddy-0.016
minded-0.016
Token can
Token ID460
Feature activation+4.55e-02

Pos logit contributions
hing+0.178
oned+0.171
 blush+0.162
 grin+0.154
 urge+0.147
Neg logit contributions
keeper-0.151
 mysteries-0.134
lan-0.133
minded-0.133
ber-0.132
Token always
Token ID1464
Feature activation+4.57e-04

Pos logit contributions
hing+0.078
oned+0.076
 blush+0.070
 grin+0.066
 AW+0.066
Neg logit contributions
keeper-0.087
minded-0.077
eddy-0.077
lan-0.077
 mysteries-0.076
Token ask
Token ID1265
Feature activation-1.33e-02

Pos logit contributions
hing+0.001
oned+0.001
 blush+0.001
 grin+0.000
 urge+0.000
Neg logit contributions
keeper-0.001
 mysteries-0.001
lan-0.001
ber-0.001
minded-0.001
Token for
Token ID329
Feature activation-2.13e-02

Pos logit contributions
keeper+0.024
ney+0.022
enn+0.021
eddy+0.021
ber+0.021
Neg logit contributions
oned-0.024
 blush-0.023
hing-0.023
 start-0.022
 hardly-0.021
Token help
Token ID1037
Feature activation+1.85e-02

Pos logit contributions
keeper+0.053
eddy+0.051
ney+0.050
ber+0.049
enn+0.048
Neg logit contributions
 blush-0.024
 start-0.023
hing-0.022
oned-0.022
 ease-0.021
Token was
Token ID373
Feature activation+1.37e-02

Pos logit contributions
hing+0.014
oned+0.013
 blush+0.013
 grin+0.012
 respond+0.012
Neg logit contributions
keeper-0.010
 mysteries-0.009
eddy-0.009
lan-0.009
ber-0.009
Token raining
Token ID43079
Feature activation+2.55e-02

Pos logit contributions
hing+0.031
oned+0.031
 blush+0.029
 grin+0.028
mine+0.027
Neg logit contributions
keeper-0.018
 cave-0.017
 late-0.016
 coal-0.016
eddy-0.016
Token.
Token ID13
Feature activation+1.57e-03

Pos logit contributions
oned+0.070
hing+0.069
 blush+0.068
 grin+0.067
 respond+0.064
Neg logit contributions
keeper-0.021
eddy-0.017
 mysteries-0.016
 late-0.016
lan-0.015
Token The
Token ID383
Feature activation+4.19e-03

Pos logit contributions
oned+0.002
 blush+0.002
hing+0.002
 hardly+0.002
mine+0.002
Neg logit contributions
ber-0.003
minded-0.003
keeper-0.003
 mysteries-0.003
ind-0.003
Token rain
Token ID6290
Feature activation+4.02e-02

Pos logit contributions
hing+0.008
 urge+0.008
 start+0.008
oned+0.007
 blush+0.007
Neg logit contributions
keeper-0.007
minded-0.006
ber-0.006
elia-0.006
enn-0.006
Tokendrops
Token ID49253
Feature activation+8.98e-02

Pos logit contributions
oned+0.086
hing+0.084
 blush+0.081
 respond+0.080
 grin+0.078
Neg logit contributions
keeper-0.060
 boy-0.054
o-0.052
ups-0.052
eddy-0.051
Token soaked
Token ID33657
Feature activation+3.52e-02

Pos logit contributions
oned+0.181
 respond+0.174
 grin+0.173
urance+0.170
hing+0.163
Neg logit contributions
 who-0.122
 to-0.113
s-0.110
o-0.104
 by-0.103
Token the
Token ID262
Feature activation+9.76e-03

Pos logit contributions
hing+0.078
oned+0.075
 blush+0.072
 grin+0.069
 urge+0.066
Neg logit contributions
keeper-0.051
 mysteries-0.044
lan-0.044
minded-0.043
eddy-0.043
Token ship
Token ID4074
Feature activation-5.17e-03

Pos logit contributions
hing+0.018
 urge+0.016
 start+0.015
oned+0.015
 blush+0.015
Neg logit contributions
keeper-0.018
ber-0.016
minded-0.016
enn-0.016
elia-0.016
Token and
Token ID290
Feature activation-3.39e-03

Pos logit contributions
keeper+0.006
 who+0.005
 cave+0.004
 mysteries+0.004
 coal+0.004
Neg logit contributions
oned-0.013
hing-0.013
 blush-0.012
 grin-0.012
 AW-0.011
Token made
Token ID925
Feature activation+2.96e-02

Pos logit contributions
keeper+0.005
minded+0.004
lan+0.004
 mysteries+0.004
ber+0.004
Neg logit contributions
hing-0.007
oned-0.007
 blush-0.007
 grin-0.006
 urge-0.006
Token the
Token ID262
Feature activation-1.37e-03

Pos logit contributions
hing+0.035
oned+0.033
 blush+0.032
 grin+0.031
 urge+0.029
Neg logit contributions
keeper-0.020
 mysteries-0.017
lan-0.017
minded-0.017
ber-0.017
Token rain
Token ID6290
Feature activation+2.27e-02

Pos logit contributions
keeper+0.003
 mysteries+0.002
lan+0.002
eddy+0.002
minded+0.002
Neg logit contributions
hing-0.002
oned-0.002
 blush-0.002
 grin-0.002
mine-0.002
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
hing+0.065
oned+0.064
 blush+0.061
 grin+0.060
 respond+0.059
Neg logit contributions
keeper-0.017
lan-0.013
minded-0.013
eddy-0.013
ber-0.013
Token The
Token ID383
Feature activation+9.26e-03

Pos logit contributions
hing+0.029
oned+0.029
 blush+0.027
 grin+0.026
 urge+0.025
Neg logit contributions
keeper-0.016
 mysteries-0.014
lan-0.014
ber-0.014
minded-0.014
Token rain
Token ID6290
Feature activation+3.90e-02

Pos logit contributions
hing+0.016
oned+0.016
 grin+0.014
 blush+0.014
mine+0.013
Neg logit contributions
keeper-0.017
eddy-0.016
 mysteries-0.016
lan-0.016
ups-0.016
Tokendrops
Token ID49253
Feature activation+8.92e-02

Pos logit contributions
oned+0.090
hing+0.088
 ease+0.085
 respond+0.084
 grin+0.084
Neg logit contributions
keeper-0.048
ups-0.045
 boy-0.043
o-0.040
eddy-0.040
Token made
Token ID925
Feature activation+2.89e-02

Pos logit contributions
oned+0.189
urance+0.177
 grin+0.170
 respond+0.169
 Deal+0.167
Neg logit contributions
s-0.113
 who-0.113
 to-0.106
st-0.098
ups-0.097
Token a
Token ID257
Feature activation+9.21e-03

Pos logit contributions
oned+0.061
hing+0.060
 grin+0.058
 contain+0.055
 AW+0.055
Neg logit contributions
keeper-0.039
eddy-0.038
 mysteries-0.036
 late-0.036
ups-0.035
Token clear
Token ID1598
Feature activation+1.81e-02

Pos logit contributions
hing+0.020
oned+0.019
 blush+0.017
mine+0.017
 ease+0.016
Neg logit contributions
keeper-0.015
 cave-0.012
 mysteries-0.012
eddy-0.012
 boy-0.012
Token sound
Token ID2128
Feature activation+4.56e-02

Pos logit contributions
oned+0.035
hing+0.035
 blush+0.031
ivated+0.030
 ease+0.030
Neg logit contributions
keeper-0.030
 cave-0.027
eddy-0.026
 mysteries-0.026
 boy-0.026
Token as
Token ID355
Feature activation-3.57e-02

Pos logit contributions
oned+0.124
hing+0.118
 grin+0.115
 AW+0.115
 ease+0.114
Neg logit contributions
 who-0.038
keeper-0.031
 to-0.029
eddy-0.026
--0.026
Token.
Token ID13
Feature activation+8.13e-04

Pos logit contributions
oned+0.077
hing+0.077
 blush+0.074
 grin+0.074
 AW+0.073
Neg logit contributions
keeper-0.021
eddy-0.017
ups-0.016
minded-0.016
 who-0.016
Token All
Token ID1439
Feature activation+1.33e-02

Pos logit contributions
oned+0.001
hing+0.001
 blush+0.001
 hardly+0.001
 grin+0.001
Neg logit contributions
keeper-0.002
ber-0.002
minded-0.002
 mysteries-0.002
ind-0.002
Token of
Token ID286
Feature activation+1.93e-02

Pos logit contributions
hing+0.028
oned+0.026
 blush+0.025
 urge+0.024
 grin+0.024
Neg logit contributions
keeper-0.022
lan-0.019
minded-0.019
ber-0.019
 mysteries-0.019
Token the
Token ID262
Feature activation+1.21e-02

Pos logit contributions
hing+0.040
oned+0.036
 blush+0.035
 ease+0.034
 hardly+0.033
Neg logit contributions
keeper-0.035
ber-0.031
minded-0.030
lan-0.029
eddy-0.029
Token rain
Token ID6290
Feature activation+4.16e-02

Pos logit contributions
hing+0.026
oned+0.025
 blush+0.024
 urge+0.023
 grin+0.023
Neg logit contributions
keeper-0.018
ber-0.016
lan-0.016
eddy-0.016
minded-0.016
Tokendrops
Token ID49253
Feature activation+8.92e-02

Pos logit contributions
oned+0.083
hing+0.079
 blush+0.076
 grin+0.076
 respond+0.074
Neg logit contributions
keeper-0.065
ups-0.061
o-0.058
 who-0.057
 mysteries-0.057
Token kissed
Token ID27946
Feature activation+1.09e-02

Pos logit contributions
oned+0.175
 grin+0.162
hing+0.160
urance+0.160
 respond+0.158
Neg logit contributions
 who-0.126
 to-0.121
keeper-0.114
ups-0.112
st-0.110
Token the
Token ID262
Feature activation+9.83e-03

Pos logit contributions
hing+0.024
oned+0.022
 blush+0.021
mine+0.020
 grin+0.020
Neg logit contributions
keeper-0.017
minded-0.015
ber-0.015
 mysteries-0.015
lan-0.015
Token oak
Token ID26210
Feature activation-2.20e-02

Pos logit contributions
hing+0.019
oned+0.018
 blush+0.018
 grin+0.017
 urge+0.016
Neg logit contributions
keeper-0.017
ber-0.015
minded-0.015
lan-0.015
 mysteries-0.015
Token so
Token ID523
Feature activation-1.57e-02

Pos logit contributions
keeper+0.032
lan+0.029
 mysteries+0.029
minded+0.029
eddy+0.029
Neg logit contributions
hing-0.048
oned-0.045
 blush-0.043
 grin-0.042
 urge-0.041
Token it
Token ID340
Feature activation+4.33e-03

Pos logit contributions
keeper+0.025
minded+0.023
 mysteries+0.022
eddy+0.022
ber+0.022
Neg logit contributions
hing-0.032
oned-0.030
 blush-0.029
 grin-0.028
 hardly-0.028
Token excited
Token ID6568
Feature activation-7.81e-03

Pos logit contributions
hing+0.057
oned+0.054
 blush+0.051
 AW+0.049
 grin+0.048
Neg logit contributions
keeper-0.032
 who-0.030
ber-0.029
o-0.029
 mysteries-0.029
Token and
Token ID290
Feature activation-2.05e-02

Pos logit contributions
keeper+0.010
ber+0.008
minded+0.008
 mysteries+0.008
lan+0.008
Neg logit contributions
hing-0.018
oned-0.018
 blush-0.017
 urge-0.017
 hardly-0.016
Token couldn
Token ID3521
Feature activation+1.23e-02

Pos logit contributions
keeper+0.033
lan+0.030
ber+0.029
 mysteries+0.029
minded+0.029
Neg logit contributions
hing-0.043
oned-0.041
 blush-0.038
 grin-0.036
 respond-0.035
Tokenâ
Token ID22940
Feature activation+1.79e-02

Pos logit contributions
hing+0.017
 blush+0.016
oned+0.015
 grin+0.015
 thrill+0.013
Neg logit contributions
keeper-0.028
eddy-0.025
o-0.024
ber-0.024
 Cream-0.024
Token
Token ID26391
Feature activation+2.19e-02

Pos logit contributions
 blush+0.026
 grin+0.025
hing+0.025
oned+0.023
 crash+0.022
Neg logit contributions
minded-0.032
ray-0.032
keeper-0.030
 Cream-0.030
ney-0.030
Token
Token ID8151
Feature activation+8.91e-02

Pos logit contributions
hing+0.046
oned+0.044
 blush+0.042
 grin+0.040
 respond+0.039
Neg logit contributions
keeper-0.034
 mysteries-0.030
lan-0.030
ber-0.030
eddy-0.030
Tokent
Token ID83
Feature activation+2.22e-02

Pos logit contributions
hing+0.152
oned+0.142
 blush+0.137
 grin+0.130
 urge+0.124
Neg logit contributions
keeper-0.177
ber-0.158
eddy-0.157
lan-0.156
minded-0.152
Token wait
Token ID4043
Feature activation+1.21e-02

Pos logit contributions
hing+0.031
 AW+0.030
oned+0.027
rell+0.026
ivery+0.025
Neg logit contributions
 who-0.047
keeper-0.045
minded-0.044
lan-0.043
ber-0.043
Token to
Token ID284
Feature activation+1.67e-03

Pos logit contributions
hing+0.034
oned+0.033
 blush+0.032
 grin+0.031
 urge+0.030
Neg logit contributions
keeper-0.010
lan-0.008
 mysteries-0.008
ber-0.007
minded-0.007
Token have
Token ID423
Feature activation-2.00e-02

Pos logit contributions
hing+0.002
oned+0.002
 blush+0.002
ivery+0.002
 AW+0.002
Neg logit contributions
keeper-0.003
 mysteries-0.003
 who-0.003
lan-0.003
minded-0.003
Token fun
Token ID1257
Feature activation+3.31e-02

Pos logit contributions
keeper+0.033
ber+0.029
eddy+0.027
minded+0.027
lan+0.027
Neg logit contributions
hing-0.041
oned-0.039
 blush-0.039
 grin-0.036
 ease-0.036
Token veil
Token ID30615
Feature activation-2.52e-02

Pos logit contributions
keeper+0.017
minded+0.015
ber+0.015
lan+0.015
eddy+0.015
Neg logit contributions
hing-0.018
oned-0.017
 blush-0.017
 grin-0.017
 urge-0.016
Token.
Token ID13
Feature activation-1.73e-04

Pos logit contributions
keeper+0.029
 mysteries+0.026
lan+0.026
minded+0.026
eddy+0.025
Neg logit contributions
hing-0.061
oned-0.059
 blush-0.057
 grin-0.055
 urge-0.053
Token When
Token ID1649
Feature activation+8.08e-04

Pos logit contributions
keeper+0.000
 mysteries+0.000
ind+0.000
lan+0.000
minded+0.000
Neg logit contributions
oned-0.000
hing-0.000
 blush-0.000
 grin-0.000
 hardly-0.000
Token the
Token ID262
Feature activation+1.02e-02

Pos logit contributions
hing+0.001
 blush+0.001
oned+0.001
 start+0.001
 hardly+0.001
Neg logit contributions
keeper-0.002
eddy-0.001
minded-0.001
ber-0.001
lan-0.001
Token loud
Token ID7812
Feature activation+4.93e-02

Pos logit contributions
hing+0.019
 blush+0.018
oned+0.018
 urge+0.017
 grin+0.017
Neg logit contributions
keeper-0.017
eddy-0.016
minded-0.016
ber-0.015
 mysteries-0.015
Token voices
Token ID10839
Feature activation+8.90e-02

Pos logit contributions
hing+0.092
oned+0.087
 blush+0.084
 grin+0.080
 urge+0.077
Neg logit contributions
keeper-0.088
minded-0.079
lan-0.079
 mysteries-0.079
ber-0.079
Token come
Token ID1282
Feature activation+2.37e-02

Pos logit contributions
oned+0.135
hing+0.126
 blush+0.125
 grin+0.113
ivery+0.113
Neg logit contributions
keeper-0.183
 mysteries-0.157
 who-0.156
minded-0.155
ber-0.154
Token,
Token ID11
Feature activation+1.18e-02

Pos logit contributions
hing+0.048
oned+0.046
 blush+0.043
 grin+0.041
 urge+0.040
Neg logit contributions
keeper-0.039
 mysteries-0.035
lan-0.035
ber-0.034
minded-0.034
Token it
Token ID340
Feature activation+8.42e-03

Pos logit contributions
hing+0.021
 blush+0.020
oned+0.020
 start+0.019
 grin+0.019
Neg logit contributions
keeper-0.020
eddy-0.020
minded-0.019
lan-0.019
ber-0.019
Token's
Token ID338
Feature activation+1.51e-02

Pos logit contributions
hing+0.017
oned+0.016
 blush+0.016
 grin+0.015
 urge+0.015
Neg logit contributions
keeper-0.013
lan-0.012
eddy-0.012
 mysteries-0.012
ber-0.012
Token like
Token ID588
Feature activation-1.25e-02

Pos logit contributions
hing+0.028
oned+0.027
 blush+0.025
 grin+0.024
 urge+0.023
Neg logit contributions
keeper-0.027
lan-0.024
 mysteries-0.024
minded-0.024
ber-0.024
Token really
Token ID1107
Feature activation-2.73e-02

Pos logit contributions
oned+0.075
hing+0.075
 grin+0.073
 blush+0.072
 respond+0.070
Neg logit contributions
keeper-0.016
eddy-0.013
ups-0.012
o-0.011
minded-0.011
Token hard
Token ID1327
Feature activation+6.40e-03

Pos logit contributions
keeper+0.043
 mysteries+0.038
lan+0.038
minded+0.037
ber+0.037
Neg logit contributions
hing-0.057
oned-0.055
 blush-0.052
 grin-0.050
 respond-0.048
Token.
Token ID13
Feature activation+8.52e-03

Pos logit contributions
hing+0.016
oned+0.016
 blush+0.015
 urge+0.014
 grin+0.014
Neg logit contributions
keeper-0.008
ber-0.006
lan-0.006
minded-0.006
 mysteries-0.006
Token The
Token ID383
Feature activation+1.38e-02

Pos logit contributions
hing+0.020
oned+0.018
 blush+0.017
 grin+0.017
 respond+0.017
Neg logit contributions
keeper-0.012
eddy-0.010
 mysteries-0.010
minded-0.010
lan-0.010
Token rain
Token ID6290
Feature activation+4.04e-02

Pos logit contributions
hing+0.024
oned+0.023
 blush+0.020
 grin+0.020
mine+0.019
Neg logit contributions
keeper-0.027
 boy-0.024
lan-0.024
 mysteries-0.024
ups-0.024
Tokendrops
Token ID49253
Feature activation+8.90e-02

Pos logit contributions
oned+0.096
hing+0.094
 respond+0.092
 grin+0.091
 blush+0.089
Neg logit contributions
keeper-0.047
ups-0.044
 boy-0.044
ney-0.041
o-0.040
Token were
Token ID547
Feature activation+1.90e-02

Pos logit contributions
oned+0.189
 grin+0.179
 respond+0.178
urance+0.177
 Deal+0.167
Neg logit contributions
 to-0.108
s-0.108
 who-0.108
 by-0.100
ups-0.100
Token big
Token ID1263
Feature activation+1.11e-03

Pos logit contributions
hing+0.048
oned+0.047
 grin+0.044
mine+0.043
 blush+0.042
Neg logit contributions
 late-0.020
keeper-0.019
 coal-0.018
 who-0.017
 to-0.017
Token and
Token ID290
Feature activation-1.76e-02

Pos logit contributions
hing+0.003
oned+0.002
 blush+0.002
 grin+0.002
 urge+0.002
Neg logit contributions
keeper-0.001
lan-0.001
ber-0.001
minded-0.001
 mysteries-0.001
Token strong
Token ID1913
Feature activation+1.92e-02

Pos logit contributions
keeper+0.028
 mysteries+0.025
lan+0.024
ber+0.024
eddy+0.024
Neg logit contributions
hing-0.037
oned-0.036
 blush-0.033
 grin-0.032
mine-0.030
Token,
Token ID11
Feature activation+9.08e-03

Pos logit contributions
hing+0.045
oned+0.044
 blush+0.042
 grin+0.040
 urge+0.039
Neg logit contributions
keeper-0.025
 mysteries-0.021
lan-0.021
minded-0.021
ber-0.021
Token She
Token ID1375
Feature activation-2.87e-02

Pos logit contributions
keeper+0.014
 mysteries+0.013
eddy+0.013
lan+0.013
ber+0.013
Neg logit contributions
oned-0.013
hing-0.013
 blush-0.013
 grin-0.012
 urge-0.012
Token told
Token ID1297
Feature activation-1.08e-02

Pos logit contributions
lan+0.049
keeper+0.049
 mysteries+0.049
minded+0.049
eddy+0.048
Neg logit contributions
hing-0.047
oned-0.046
 blush-0.046
 urge-0.044
 hardly-0.044
Token him
Token ID683
Feature activation-2.27e-02

Pos logit contributions
keeper+0.018
 mysteries+0.015
lan+0.015
ber+0.015
eddy+0.015
Neg logit contributions
hing-0.022
oned-0.021
 blush-0.020
 grin-0.019
 urge-0.018
Token,
Token ID11
Feature activation+5.44e-03

Pos logit contributions
eddy+0.030
keeper+0.030
minded+0.028
ind+0.028
lan+0.027
Neg logit contributions
hing-0.047
oned-0.047
 blush-0.047
 hardly-0.045
 grin-0.044
Token "
Token ID366
Feature activation-5.71e-03

Pos logit contributions
hing+0.011
oned+0.011
 blush+0.011
 grin+0.010
 hardly+0.010
Neg logit contributions
keeper-0.008
eddy-0.008
 mysteries-0.008
lan-0.008
minded-0.008
Tokenwe
Token ID732
Feature activation+8.87e-02

Pos logit contributions
keeper+0.010
 cave+0.009
 mysteries+0.009
lan+0.009
eddy+0.009
Neg logit contributions
hing-0.010
oned-0.010
mine-0.009
 blush-0.009
 hardly-0.009
Token don
Token ID836
Feature activation+3.93e-02

Pos logit contributions
hing+0.150
oned+0.142
 blush+0.133
 grin+0.126
 urge+0.119
Neg logit contributions
keeper-0.175
 mysteries-0.158
lan-0.158
ber-0.157
minded-0.157
Token't
Token ID470
Feature activation+2.90e-02

Pos logit contributions
 blush+0.059
 grin+0.057
hing+0.055
oned+0.055
 thrill+0.053
Neg logit contributions
keeper-0.076
eddy-0.073
 who-0.069
 Cream-0.069
ups-0.069
Token buy
Token ID2822
Feature activation-5.21e-02

Pos logit contributions
hing+0.041
oned+0.038
 blush+0.035
 grin+0.032
 AW+0.031
Neg logit contributions
keeper-0.065
minded-0.059
ber-0.058
 mysteries-0.058
eddy-0.058
Token things
Token ID1243
Feature activation+1.44e-02

Pos logit contributions
keeper+0.094
ney+0.094
enn+0.090
ber+0.090
lan+0.087
Neg logit contributions
hing-0.084
 blush-0.084
 start-0.082
 ease-0.082
oned-0.080
Token from
Token ID422
Feature activation+3.90e-04

Pos logit contributions
hing+0.032
oned+0.031
 blush+0.029
 start+0.027
're+0.027
Neg logit contributions
keeper-0.020
lan-0.018
 mysteries-0.018
minded-0.017
ber-0.017
Token
Token ID198
Feature activation-1.28e-03

Pos logit contributions
hing+0.019
oned+0.018
 blush+0.017
 grin+0.017
 urge+0.016
Neg logit contributions
keeper-0.010
 mysteries-0.009
lan-0.009
ber-0.009
minded-0.009
Token
Token ID198
Feature activation-3.55e-03

Pos logit contributions
keeper+0.001
minded+0.001
lan+0.001
 mysteries+0.001
ber+0.001
Neg logit contributions
hing-0.003
oned-0.003
 blush-0.003
 grin-0.003
 hardly-0.003
TokenTom
Token ID13787
Feature activation+1.90e-03

Pos logit contributions
keeper+0.005
ber+0.005
 mysteries+0.004
ups+0.004
lan+0.004
Neg logit contributions
hing-0.008
 blush-0.007
oned-0.007
 grin-0.007
 respond-0.007
Tokenâ
Token ID22940
Feature activation+6.58e-03

Pos logit contributions
hing+0.003
oned+0.003
're+0.003
 urge+0.003
 blush+0.003
Neg logit contributions
keeper-0.003
lan-0.003
 mysteries-0.003
eddy-0.003
minded-0.003
Token
Token ID26391
Feature activation+2.85e-02

Pos logit contributions
 blush+0.010
hing+0.010
oned+0.010
 grin+0.009
 ease+0.009
Neg logit contributions
keeper-0.012
minded-0.012
ray-0.011
eddy-0.011
o-0.011
Token
Token ID8151
Feature activation+8.87e-02

Pos logit contributions
hing+0.063
oned+0.061
 blush+0.058
 grin+0.056
 urge+0.054
Neg logit contributions
keeper-0.041
 mysteries-0.035
lan-0.035
minded-0.035
ber-0.035
Tokens
Token ID82
Feature activation+7.54e-03

Pos logit contributions
hing+0.213
 blush+0.203
oned+0.198
 grin+0.192
 AW+0.189
Neg logit contributions
keeper-0.114
o-0.104
ber-0.095
eddy-0.094
enn-0.092
Token friend
Token ID1545
Feature activation+1.22e-02

Pos logit contributions
hing+0.019
oned+0.018
 blush+0.017
 grin+0.017
 urge+0.017
Neg logit contributions
keeper-0.009
lan-0.008
minded-0.007
 mysteries-0.007
eddy-0.007
Token,
Token ID11
Feature activation+2.09e-02

Pos logit contributions
hing+0.019
oned+0.019
 blush+0.018
 grin+0.017
 urge+0.016
Neg logit contributions
keeper-0.024
 mysteries-0.023
lan-0.023
eddy-0.023
minded-0.022
Token Sarah
Token ID10490
Feature activation+1.64e-02

Pos logit contributions
hing+0.031
 blush+0.029
oned+0.029
 grin+0.027
're+0.027
Neg logit contributions
keeper-0.041
minded-0.041
ind-0.041
eddy-0.041
lan-0.040
Token,
Token ID11
Feature activation+2.65e-02

Pos logit contributions
hing+0.027
oned+0.025
 blush+0.023
 grin+0.022
mine+0.022
Neg logit contributions
keeper-0.032
 mysteries-0.030
minded-0.030
eddy-0.030
lan-0.030
Token for
Token ID329
Feature activation+2.18e-02

Pos logit contributions
eddy+0.013
keeper+0.013
 mysteries+0.013
lan+0.013
minded+0.013
Neg logit contributions
hing-0.018
oned-0.018
 blush-0.017
 contain-0.016
mine-0.016
Token the
Token ID262
Feature activation+2.53e-02

Pos logit contributions
hing+0.041
oned+0.040
 blush+0.038
 ease+0.037
 hardly+0.037
Neg logit contributions
keeper-0.040
eddy-0.036
ber-0.035
enn-0.033
minded-0.033
Token next
Token ID1306
Feature activation-1.35e-02

Pos logit contributions
hing+0.050
oned+0.048
 blush+0.048
 urge+0.046
 grin+0.046
Neg logit contributions
keeper-0.040
eddy-0.037
ber-0.036
lan-0.036
minded-0.036
Token day
Token ID1110
Feature activation-1.65e-02

Pos logit contributions
keeper+0.029
eddy+0.029
minded+0.028
ber+0.027
us+0.027
Neg logit contributions
 urge-0.017
 blush-0.016
oned-0.016
 start-0.016
 run-0.015
Token.
Token ID13
Feature activation+3.15e-03

Pos logit contributions
keeper+0.026
 mysteries+0.025
eddy+0.025
ind+0.024
ray+0.024
Neg logit contributions
oned-0.028
hing-0.028
 start-0.027
 contain-0.027
're-0.026
Token We
Token ID775
Feature activation+8.85e-02

Pos logit contributions
hing+0.005
oned+0.005
 hardly+0.005
're+0.005
 grin+0.005
Neg logit contributions
ind-0.006
keeper-0.006
ber-0.005
eddy-0.005
 mysteries-0.005
Token should
Token ID815
Feature activation+1.48e-02

Pos logit contributions
hing+0.174
oned+0.170
 blush+0.160
 grin+0.152
 AW+0.145
Neg logit contributions
keeper-0.148
 mysteries-0.130
ber-0.128
minded-0.128
lan-0.128
Token be
Token ID307
Feature activation+3.03e-02

Pos logit contributions
hing+0.026
oned+0.025
 blush+0.022
ivated+0.021
 AW+0.021
Neg logit contributions
keeper-0.027
 who-0.025
 mysteries-0.024
lan-0.024
minded-0.024
Token generous
Token ID14431
Feature activation-1.05e-02

Pos logit contributions
hing+0.058
oned+0.056
 blush+0.054
 grin+0.053
 urge+0.051
Neg logit contributions
keeper-0.051
eddy-0.047
lan-0.045
ber-0.044
minded-0.044
Token and
Token ID290
Feature activation-2.93e-02

Pos logit contributions
keeper+0.016
ind+0.016
lan+0.016
 mysteries+0.016
eddy+0.015
Neg logit contributions
hing-0.018
oned-0.017
 urge-0.016
 blush-0.016
 laugh-0.016
Token share
Token ID2648
Feature activation-1.03e-02

Pos logit contributions
keeper+0.069
eddy+0.064
ber+0.064
 mysteries+0.063
minded+0.063
Neg logit contributions
hing-0.037
oned-0.035
 blush-0.033
 grin-0.031
 urge-0.029
Token can
Token ID460
Feature activation+1.95e-02

Pos logit contributions
hing+0.053
oned+0.052
 blush+0.050
 grin+0.048
mine+0.045
Neg logit contributions
keeper-0.038
 mysteries-0.034
minded-0.033
ber-0.032
lan-0.032
Token still
Token ID991
Feature activation+8.15e-03

Pos logit contributions
hing+0.028
oned+0.027
 blush+0.025
 grin+0.023
 AW+0.021
Neg logit contributions
keeper-0.043
 mysteries-0.040
minded-0.039
lan-0.038
eddy-0.038
Token happen
Token ID1645
Feature activation+2.45e-03

Pos logit contributions
hing+0.012
oned+0.011
 blush+0.010
 grin+0.010
mine+0.009
Neg logit contributions
keeper-0.018
 mysteries-0.017
minded-0.016
lan-0.016
ber-0.016
Token.
Token ID13
Feature activation-5.08e-03

Pos logit contributions
oned+0.004
hing+0.004
 start+0.004
're+0.004
 happen+0.004
Neg logit contributions
keeper-0.004
ber-0.004
eddy-0.004
enn-0.004
lan-0.004
Token But
Token ID887
Feature activation+1.87e-02

Pos logit contributions
ind+0.010
ber+0.010
 mysteries+0.010
ishes+0.009
ups+0.009
Neg logit contributions
're-0.007
oned-0.007
 blush-0.006
hing-0.006
mine-0.006
Token we
Token ID356
Feature activation+8.85e-02

Pos logit contributions
oned+0.033
 blush+0.032
hing+0.031
 urge+0.030
 start+0.030
Neg logit contributions
keeper-0.032
eddy-0.031
lan-0.030
ups-0.030
minded-0.029
Token can
Token ID460
Feature activation+4.23e-02

Pos logit contributions
hing+0.186
oned+0.181
 blush+0.172
 grin+0.162
 AW+0.156
Neg logit contributions
keeper-0.138
 mysteries-0.120
minded-0.119
ber-0.118
lan-0.117
Token always
Token ID1464
Feature activation-3.95e-03

Pos logit contributions
hing+0.074
oned+0.073
 blush+0.069
 grin+0.064
 AW+0.064
Neg logit contributions
keeper-0.079
minded-0.070
eddy-0.069
ber-0.069
lan-0.068
Token learn
Token ID2193
Feature activation-2.67e-02

Pos logit contributions
keeper+0.009
 mysteries+0.008
minded+0.008
ber+0.008
lan+0.008
Neg logit contributions
hing-0.006
oned-0.005
 blush-0.005
 grin-0.005
 AW-0.004
Token from
Token ID422
Feature activation-1.40e-02

Pos logit contributions
keeper+0.044
lan+0.043
ber+0.043
o+0.041
ney+0.041
Neg logit contributions
hing-0.046
 blush-0.043
oned-0.042
 start-0.042
 respond-0.041
Token our
Token ID674
Feature activation+3.63e-03

Pos logit contributions
keeper+0.026
eddy+0.024
enn+0.023
ney+0.023
ray+0.022
Neg logit contributions
hing-0.023
oned-0.023
 start-0.022
 blush-0.022
 ease-0.022
Token Mark
Token ID2940
Feature activation-2.85e-03

Pos logit contributions
keeper+0.006
minded+0.006
 mysteries+0.006
eddy+0.006
lan+0.006
Neg logit contributions
hing-0.006
oned-0.005
 blush-0.005
 grin-0.005
 start-0.005
Token's
Token ID338
Feature activation+9.63e-03

Pos logit contributions
keeper+0.005
 mysteries+0.005
minded+0.004
lan+0.004
eddy+0.004
Neg logit contributions
hing-0.005
oned-0.005
 blush-0.005
 grin-0.005
mine-0.005
Token belt
Token ID10999
Feature activation-1.43e-02

Pos logit contributions
hing+0.019
 blush+0.019
oned+0.018
 grin+0.018
 urge+0.018
Neg logit contributions
keeper-0.015
minded-0.014
 mysteries-0.013
lan-0.013
eddy-0.013
Token gl
Token ID1278
Feature activation+5.55e-03

Pos logit contributions
keeper+0.019
ber+0.016
lan+0.016
 mysteries+0.016
eddy+0.016
Neg logit contributions
hing-0.034
oned-0.032
 blush-0.031
 grin-0.029
 respond-0.029
Tokenowed
Token ID6972
Feature activation+1.20e-02

Pos logit contributions
hing+0.009
oned+0.008
 blush+0.008
 grin+0.008
 urge+0.008
Neg logit contributions
keeper-0.011
eddy-0.010
ber-0.010
lan-0.010
minded-0.010
Token even
Token ID772
Feature activation-8.63e-02

Pos logit contributions
hing+0.027
oned+0.026
 blush+0.025
 grin+0.024
 urge+0.023
Neg logit contributions
keeper-0.017
lan-0.015
 mysteries-0.015
ber-0.014
minded-0.014
Token brighter
Token ID26083
Feature activation-2.54e-02

Pos logit contributions
lan+0.130
keeper+0.129
eddy+0.127
 mysteries+0.125
ber+0.123
Neg logit contributions
hing-0.167
 blush-0.159
oned-0.157
 start-0.151
 grin-0.147
Token.
Token ID13
Feature activation+6.32e-03

Pos logit contributions
keeper+0.037
ray+0.035
minded+0.033
 mysteries+0.033
enn+0.032
Neg logit contributions
hing-0.051
oned-0.049
 grin-0.048
 blush-0.048
 start-0.045
Token They
Token ID1119
Feature activation+4.09e-02

Pos logit contributions
oned+0.011
 hardly+0.010
hing+0.010
 blush+0.010
 grin+0.009
Neg logit contributions
ind-0.011
keeper-0.011
ber-0.011
 mysteries-0.011
minded-0.011
Token sat
Token ID3332
Feature activation-6.56e-03

Pos logit contributions
hing+0.070
oned+0.069
 blush+0.061
 grin+0.058
 AW+0.058
Neg logit contributions
keeper-0.080
ber-0.072
lan-0.072
 mysteries-0.070
minded-0.070
Token in
Token ID287
Feature activation+1.73e-02

Pos logit contributions
keeper+0.012
 mysteries+0.012
lan+0.012
ray+0.011
minded+0.011
Neg logit contributions
hing-0.011
oned-0.011
 grin-0.010
 blush-0.010
mine-0.009
Token came
Token ID1625
Feature activation+6.03e-03

Pos logit contributions
keeper+0.042
lan+0.038
eddy+0.038
 mysteries+0.037
minded+0.037
Neg logit contributions
hing-0.047
oned-0.045
 blush-0.043
 grin-0.041
 urge-0.039
Token back
Token ID736
Feature activation-4.52e-02

Pos logit contributions
hing+0.010
oned+0.010
 blush+0.009
 grin+0.008
 respond+0.008
Neg logit contributions
keeper-0.012
 mysteries-0.011
lan-0.011
minded-0.011
ber-0.011
Token home
Token ID1363
Feature activation-3.51e-02

Pos logit contributions
keeper+0.061
lan+0.057
 mysteries+0.055
minded+0.055
ber+0.052
Neg logit contributions
oned-0.097
hing-0.094
 blush-0.091
 grin-0.088
 laugh-0.087
Token empty
Token ID6565
Feature activation-1.91e-02

Pos logit contributions
keeper+0.045
 mysteries+0.044
minded+0.043
lan+0.042
eddy+0.041
Neg logit contributions
oned-0.077
hing-0.076
 blush-0.072
 grin-0.071
 urge-0.069
Token-
Token ID12
Feature activation-3.21e-02

Pos logit contributions
keeper+0.027
minded+0.025
ber+0.025
lan+0.024
eddy+0.024
Neg logit contributions
hing-0.039
oned-0.039
 blush-0.038
 grin-0.036
 urge-0.035
Tokenhanded
Token ID13638
Feature activation-8.45e-02

Pos logit contributions
 mysteries+0.048
 cave+0.046
keeper+0.045
 cel+0.045
lan+0.044
Neg logit contributions
hing-0.062
oned-0.061
mine-0.056
 blush-0.055
ivated-0.050
Token and
Token ID290
Feature activation-3.38e-02

Pos logit contributions
 mysteries+0.126
lan+0.121
minded+0.115
eddy+0.111
ind+0.111
Neg logit contributions
oned-0.158
hing-0.156
 blush-0.149
 grin-0.148
mine-0.146
Token told
Token ID1297
Feature activation-2.48e-03

Pos logit contributions
keeper+0.060
eddy+0.056
lan+0.056
minded+0.055
ber+0.055
Neg logit contributions
hing-0.059
 blush-0.058
oned-0.057
 grin-0.054
 urge-0.052
Token Lily
Token ID20037
Feature activation-1.19e-02

Pos logit contributions
keeper+0.005
ber+0.005
eddy+0.005
lan+0.005
 mysteries+0.005
Neg logit contributions
hing-0.003
oned-0.003
 blush-0.003
 grin-0.003
 ease-0.003
Token the
Token ID262
Feature activation-1.47e-02

Pos logit contributions
keeper+0.017
lan+0.016
eddy+0.016
minded+0.016
 mysteries+0.016
Neg logit contributions
hing-0.025
oned-0.024
 blush-0.024
 grin-0.022
 realize-0.022
Token news
Token ID1705
Feature activation-2.28e-03

Pos logit contributions
keeper+0.023
minded+0.023
eddy+0.022
ber+0.021
lan+0.021
Neg logit contributions
hing-0.028
 blush-0.027
oned-0.026
 grin-0.026
 urge-0.025
Token was
Token ID373
Feature activation+2.10e-02

Pos logit contributions
hing+0.012
oned+0.012
 blush+0.011
 grin+0.011
 urge+0.010
Neg logit contributions
keeper-0.011
 mysteries-0.010
ber-0.010
lan-0.010
minded-0.010
Token astonished
Token ID40962
Feature activation-1.30e-02

Pos logit contributions
hing+0.044
oned+0.042
 blush+0.040
 AW+0.037
 grin+0.037
Neg logit contributions
keeper-0.034
ber-0.029
lan-0.028
 mysteries-0.028
 who-0.028
Token and
Token ID290
Feature activation-3.06e-02

Pos logit contributions
keeper+0.020
ber+0.019
 mysteries+0.019
ray+0.019
lan+0.019
Neg logit contributions
hing-0.025
oned-0.023
 grin-0.022
 urge-0.022
ivated-0.022
Token cl
Token ID537
Feature activation+2.43e-02

Pos logit contributions
keeper+0.054
eddy+0.050
lan+0.049
 mysteries+0.049
ber+0.048
Neg logit contributions
hing-0.057
oned-0.054
 blush-0.052
 grin-0.049
 urge-0.047
Tokenapped
Token ID6320
Feature activation+2.04e-02

Pos logit contributions
hing+0.044
oned+0.042
 blush+0.040
 urge+0.037
 grin+0.037
Neg logit contributions
keeper-0.045
ber-0.040
minded-0.040
eddy-0.040
lan-0.040
Token even
Token ID772
Feature activation-8.34e-02

Pos logit contributions
hing+0.048
oned+0.047
 blush+0.044
 grin+0.042
 urge+0.041
Neg logit contributions
keeper-0.027
lan-0.023
 mysteries-0.022
minded-0.022
ber-0.022
Token louder
Token ID27089
Feature activation+9.30e-05

Pos logit contributions
keeper+0.116
eddy+0.108
lan+0.105
ber+0.104
 mysteries+0.101
Neg logit contributions
hing-0.188
oned-0.177
 blush-0.172
 grin-0.166
 urge-0.161
Token.
Token ID13
Feature activation-3.97e-03

Pos logit contributions
hing+0.000
oned+0.000
 blush+0.000
 grin+0.000
 urge+0.000
Neg logit contributions
keeper-0.000
 mysteries-0.000
lan-0.000
ber-0.000
minded-0.000
Token Now
Token ID2735
Feature activation+2.14e-02

Pos logit contributions
keeper+0.007
 mysteries+0.007
eddy+0.007
ber+0.007
ind+0.007
Neg logit contributions
oned-0.007
hing-0.007
 blush-0.006
 hardly-0.006
 grin-0.006
Token it
Token ID340
Feature activation+1.01e-02

Pos logit contributions
hing+0.048
oned+0.046
 blush+0.044
 grin+0.042
 urge+0.041
Neg logit contributions
keeper-0.030
lan-0.026
 mysteries-0.026
eddy-0.026
ber-0.026
Token was
Token ID373
Feature activation+6.11e-03

Pos logit contributions
hing+0.023
oned+0.023
 blush+0.021
 grin+0.020
 urge+0.020
Neg logit contributions
keeper-0.014
 mysteries-0.012
ber-0.012
lan-0.012
minded-0.012
Token the
Token ID262
Feature activation+2.10e-03

Pos logit contributions
hing+0.034
oned+0.033
 blush+0.031
 grin+0.030
 start+0.030
Neg logit contributions
keeper-0.026
eddy-0.024
 mysteries-0.024
lan-0.023
ber-0.023
Token garden
Token ID11376
Feature activation-6.51e-03

Pos logit contributions
hing+0.004
 blush+0.004
oned+0.004
 grin+0.004
 urge+0.004
Neg logit contributions
keeper-0.003
minded-0.003
eddy-0.003
lan-0.003
ber-0.003
Token looks
Token ID3073
Feature activation+3.66e-03

Pos logit contributions
keeper+0.009
eddy+0.009
lan+0.009
minded+0.008
ber+0.008
Neg logit contributions
hing-0.014
oned-0.013
 blush-0.012
 urge-0.012
're-0.012
Token much
Token ID881
Feature activation-6.87e-02

Pos logit contributions
hing+0.006
oned+0.006
 blush+0.006
 grin+0.006
 urge+0.006
Neg logit contributions
keeper-0.007
ber-0.006
eddy-0.006
lan-0.006
minded-0.006
Token brighter
Token ID26083
Feature activation-1.97e-02

Pos logit contributions
ber+0.113
keeper+0.111
eddy+0.108
lan+0.107
enn+0.105
Neg logit contributions
hing-0.127
 blush-0.122
oned-0.117
 urge-0.116
 start-0.115
Token even
Token ID772
Feature activation-8.31e-02

Pos logit contributions
keeper+0.033
ray+0.029
eddy+0.028
enn+0.028
ber+0.028
Neg logit contributions
hing-0.036
oned-0.034
 blush-0.032
 start-0.032
 grin-0.032
Token on
Token ID319
Feature activation+1.59e-02

Pos logit contributions
eddy+0.148
keeper+0.145
enn+0.142
ber+0.136
ipt+0.134
Neg logit contributions
hing-0.143
 blush-0.131
 start-0.127
 urge-0.123
oned-0.123
Token cloudy
Token ID40026
Feature activation+1.91e-02

Pos logit contributions
hing+0.032
oned+0.030
 ease+0.028
 start+0.028
 blush+0.028
Neg logit contributions
keeper-0.027
ber-0.025
minded-0.023
eddy-0.023
ray-0.022
Token days
Token ID1528
Feature activation+1.24e-02

Pos logit contributions
hing+0.043
oned+0.040
 blush+0.038
 urge+0.036
 grin+0.035
Neg logit contributions
keeper-0.029
ber-0.025
minded-0.024
lan-0.023
eddy-0.023
Token".
Token ID1911
Feature activation+4.56e-03

Pos logit contributions
hing+0.029
oned+0.028
 blush+0.025
 grin+0.024
mine+0.024
Neg logit contributions
keeper-0.017
 mysteries-0.015
ber-0.014
minded-0.014
lan-0.014
Token<|endoftext|>
Token ID50256
Feature activation-3.72e-02

Pos logit contributions
oned+0.008
 grin+0.008
hing+0.007
 hardly+0.007
're+0.007
Neg logit contributions
 mysteries-0.008
ind-0.008
lan-0.008
minded-0.008
ber-0.008
Token listened
Token ID16399
Feature activation-1.62e-03

Pos logit contributions
keeper+0.015
minded+0.014
 mysteries+0.014
eddy+0.014
lan+0.014
Neg logit contributions
hing-0.016
oned-0.014
 blush-0.014
 grin-0.014
 urge-0.013
Token and
Token ID290
Feature activation-4.03e-02

Pos logit contributions
keeper+0.002
 mysteries+0.002
ber+0.002
eddy+0.002
lan+0.002
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.003
 grin-0.003
 urge-0.003
Token it
Token ID340
Feature activation-3.90e-03

Pos logit contributions
eddy+0.077
keeper+0.073
lan+0.069
 mysteries+0.069
ber+0.069
Neg logit contributions
hing-0.070
oned-0.065
 grin-0.061
 blush-0.061
 start-0.060
Token gl
Token ID1278
Feature activation+2.13e-03

Pos logit contributions
keeper+0.007
lan+0.006
 mysteries+0.006
eddy+0.006
ber+0.006
Neg logit contributions
hing-0.007
oned-0.007
 blush-0.006
 grin-0.006
 urge-0.006
Tokenowed
Token ID6972
Feature activation+5.76e-03

Pos logit contributions
hing+0.003
oned+0.003
 blush+0.003
 grin+0.003
 urge+0.003
Neg logit contributions
keeper-0.005
ber-0.004
lan-0.004
minded-0.004
eddy-0.004
Token even
Token ID772
Feature activation-8.26e-02

Pos logit contributions
hing+0.013
oned+0.012
 blush+0.011
 grin+0.011
 urge+0.011
Neg logit contributions
keeper-0.009
 mysteries-0.008
ber-0.008
lan-0.008
minded-0.008
Token brighter
Token ID26083
Feature activation-2.57e-02

Pos logit contributions
eddy+0.142
lan+0.137
keeper+0.133
enn+0.131
ber+0.131
Neg logit contributions
hing-0.153
 blush-0.136
oned-0.136
 start-0.134
 urge-0.131
Token.
Token ID13
Feature activation-9.11e-03

Pos logit contributions
keeper+0.038
ray+0.035
eddy+0.035
ber+0.033
 mysteries+0.033
Neg logit contributions
hing-0.052
oned-0.050
 grin-0.049
 blush-0.047
 urge-0.046
Token Bobby
Token ID17155
Feature activation-2.62e-03

Pos logit contributions
ind+0.018
 mysteries+0.017
ber+0.017
ups+0.017
eddy+0.017
Neg logit contributions
oned-0.015
hing-0.014
 grin-0.013
 ease-0.013
 hardly-0.013
Token smiled
Token ID13541
Feature activation+1.23e-02

Pos logit contributions
keeper+0.004
 mysteries+0.004
lan+0.004
eddy+0.004
minded+0.004
Neg logit contributions
hing-0.005
oned-0.005
 blush-0.004
 grin-0.004
mine-0.004
Token and
Token ID290
Feature activation-2.67e-02

Pos logit contributions
hing+0.025
oned+0.024
 grin+0.023
 blush+0.022
 urge+0.021
Neg logit contributions
keeper-0.019
 mysteries-0.018
lan-0.017
ber-0.017
minded-0.017
Token.
Token ID13
Feature activation-2.59e-03

Pos logit contributions
hing+0.008
oned+0.008
 blush+0.007
 grin+0.007
 urge+0.007
Neg logit contributions
keeper-0.005
eddy-0.004
lan-0.004
ber-0.004
minded-0.004
Token
Token ID198
Feature activation-5.39e-03

Pos logit contributions
ind+0.006
 mysteries+0.005
ishes+0.005
ber+0.005
ups+0.005
Neg logit contributions
oned-0.003
 grin-0.003
 ease-0.003
 hardly-0.003
hing-0.003
Token
Token ID198
Feature activation-4.90e-03

Pos logit contributions
keeper+0.009
 mysteries+0.008
minded+0.008
lan+0.008
ishes+0.008
Neg logit contributions
hing-0.010
oned-0.010
 hardly-0.009
 grin-0.009
 ease-0.009
TokenHe
Token ID1544
Feature activation-2.73e-02

Pos logit contributions
keeper+0.008
 mysteries+0.008
eddy+0.008
lan+0.008
minded+0.008
Neg logit contributions
hing-0.010
oned-0.009
 hardly-0.008
 blush-0.008
 grin-0.008
Token whispered
Token ID25029
Feature activation-3.56e-03

Pos logit contributions
keeper+0.054
eddy+0.052
minded+0.051
lan+0.050
 mysteries+0.050
Neg logit contributions
hing-0.043
oned-0.040
 blush-0.039
 grin-0.037
 hardly-0.036
Token even
Token ID772
Feature activation-8.20e-02

Pos logit contributions
keeper+0.004
 mysteries+0.004
eddy+0.004
minded+0.004
lan+0.004
Neg logit contributions
hing-0.008
oned-0.008
 blush-0.007
 grin-0.007
 respond-0.007
Token more
Token ID517
Feature activation-2.04e-02

Pos logit contributions
eddy+0.145
keeper+0.139
lan+0.137
 Cream+0.136
enn+0.135
Neg logit contributions
hing-0.149
oned-0.132
 respond-0.131
 start-0.130
 urge-0.128
Token to
Token ID284
Feature activation+4.97e-03

Pos logit contributions
keeper+0.037
eddy+0.035
minded+0.033
lan+0.033
ber+0.032
Neg logit contributions
hing-0.038
oned-0.034
 blush-0.034
 grin-0.033
 respond-0.033
Token the
Token ID262
Feature activation+3.03e-03

Pos logit contributions
hing+0.008
oned+0.008
 blush+0.007
 grin+0.007
 urge+0.007
Neg logit contributions
keeper-0.010
lan-0.009
 mysteries-0.009
ber-0.009
minded-0.009
Token spark
Token ID9009
Feature activation+9.24e-03

Pos logit contributions
hing+0.005
 blush+0.005
 urge+0.005
 grin+0.005
 thrill+0.005
Neg logit contributions
keeper-0.005
eddy-0.005
o-0.005
minded-0.005
ups-0.005
Token and
Token ID290
Feature activation-2.61e-02

Pos logit contributions
hing+0.018
oned+0.016
 blush+0.015
 grin+0.015
 urge+0.015
Neg logit contributions
keeper-0.016
eddy-0.015
lan-0.015
 mysteries-0.015
minded-0.014
Tokenase
Token ID589
Feature activation-2.68e-02

Pos logit contributions
hing+0.004
oned+0.004
 blush+0.004
mine+0.003
 grin+0.003
Neg logit contributions
keeper-0.004
 mysteries-0.004
minded-0.003
lan-0.003
ber-0.003
Token and
Token ID290
Feature activation-1.59e-02

Pos logit contributions
keeper+0.031
minded+0.027
 mysteries+0.027
lan+0.027
eddy+0.026
Neg logit contributions
hing-0.066
oned-0.064
 blush-0.061
 grin-0.059
 urge-0.057
Token it
Token ID340
Feature activation-2.88e-03

Pos logit contributions
keeper+0.028
eddy+0.025
ber+0.025
minded+0.025
lan+0.024
Neg logit contributions
hing-0.030
 blush-0.028
oned-0.028
 grin-0.026
 start-0.025
Token turned
Token ID2900
Feature activation-6.88e-05

Pos logit contributions
keeper+0.004
 mysteries+0.004
lan+0.003
minded+0.003
eddy+0.003
Neg logit contributions
oned-0.006
hing-0.006
 blush-0.006
 grin-0.006
 respond-0.006
Token out
Token ID503
Feature activation-2.78e-02

Pos logit contributions
keeper+0.000
ber+0.000
minded+0.000
 mysteries+0.000
lan+0.000
Neg logit contributions
hing-0.000
oned-0.000
 blush-0.000
 grin-0.000
 start-0.000
Token even
Token ID772
Feature activation-8.08e-02

Pos logit contributions
keeper+0.042
ber+0.042
enn+0.039
eddy+0.038
o+0.038
Neg logit contributions
hing-0.055
 blush-0.053
oned-0.053
 start-0.052
 grin-0.050
Token better
Token ID1365
Feature activation-4.63e-02

Pos logit contributions
keeper+0.136
ber+0.134
eddy+0.133
enn+0.131
lan+0.130
Neg logit contributions
hing-0.145
 blush-0.142
 start-0.134
oned-0.129
 begin-0.126
Token than
Token ID621
Feature activation-9.58e-03

Pos logit contributions
keeper+0.071
eddy+0.061
ray+0.060
enn+0.060
minded+0.059
Neg logit contributions
hing-0.091
 blush-0.089
oned-0.086
 grin-0.086
 start-0.085
Token the
Token ID262
Feature activation+6.79e-03

Pos logit contributions
keeper+0.017
eddy+0.015
ber+0.015
minded+0.015
ney+0.015
Neg logit contributions
hing-0.017
 blush-0.017
oned-0.017
 ease-0.016
 start-0.015
Token old
Token ID1468
Feature activation-1.58e-02

Pos logit contributions
hing+0.013
 blush+0.012
oned+0.012
 urge+0.011
 grin+0.011
Neg logit contributions
keeper-0.011
minded-0.011
ber-0.010
eddy-0.010
lan-0.010
Token one
Token ID530
Feature activation-2.92e-02

Pos logit contributions
minded+0.026
keeper+0.026
 mysteries+0.024
lan+0.024
ber+0.024
Neg logit contributions
 blush-0.028
hing-0.028
oned-0.027
 urge-0.026
 grin-0.026
Token hall
Token ID6899
Feature activation-1.67e-03

Pos logit contributions
hing+0.012
oned+0.011
 blush+0.010
 grin+0.010
mine+0.010
Neg logit contributions
keeper-0.009
 mysteries-0.008
minded-0.008
ber-0.008
lan-0.008
Token together
Token ID1978
Feature activation+6.98e-03

Pos logit contributions
keeper+0.002
 mysteries+0.002
lan+0.002
eddy+0.002
minded+0.002
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.004
 grin-0.004
 urge-0.003
Token and
Token ID290
Feature activation-7.96e-03

Pos logit contributions
hing+0.019
oned+0.018
 blush+0.017
 grin+0.016
 AW+0.016
Neg logit contributions
keeper-0.007
 mysteries-0.006
ber-0.006
lan-0.006
minded-0.006
Token it
Token ID340
Feature activation-4.01e-03

Pos logit contributions
keeper+0.014
 mysteries+0.013
lan+0.013
minded+0.013
eddy+0.012
Neg logit contributions
hing-0.015
oned-0.014
 blush-0.014
 grin-0.013
 urge-0.012
Token looked
Token ID3114
Feature activation-3.94e-03

Pos logit contributions
keeper+0.004
 mysteries+0.004
ups+0.003
ber+0.003
minded+0.003
Neg logit contributions
hing-0.011
oned-0.010
 blush-0.010
 grin-0.010
 respond-0.009
Token much
Token ID881
Feature activation-8.07e-02

Pos logit contributions
keeper+0.007
 mysteries+0.006
lan+0.006
ber+0.006
minded+0.006
Neg logit contributions
hing-0.008
oned-0.008
 blush-0.007
 grin-0.007
 urge-0.007
Token better
Token ID1365
Feature activation-5.60e-02

Pos logit contributions
keeper+0.126
ber+0.122
lan+0.120
 mysteries+0.114
eddy+0.112
Neg logit contributions
hing-0.163
 blush-0.153
oned-0.151
 grin-0.146
 urge-0.142
Token.
Token ID13
Feature activation+2.15e-03

Pos logit contributions
keeper+0.091
eddy+0.080
 mysteries+0.078
ray+0.077
minded+0.075
Neg logit contributions
oned-0.099
 blush-0.097
hing-0.096
 urge-0.095
 start-0.095
Token They
Token ID1119
Feature activation+3.63e-02

Pos logit contributions
oned+0.004
 blush+0.004
hing+0.004
 grin+0.004
 hardly+0.004
Neg logit contributions
keeper-0.003
ber-0.003
 mysteries-0.003
minded-0.003
lan-0.003
Token both
Token ID1111
Feature activation+1.29e-02

Pos logit contributions
oned+0.071
hing+0.071
 AW+0.063
 blush+0.061
 respond+0.061
Neg logit contributions
keeper-0.062
 who-0.056
ber-0.055
lan-0.054
 mysteries-0.054
Token felt
Token ID2936
Feature activation+1.66e-02

Pos logit contributions
oned+0.028
hing+0.028
 AW+0.024
 respond+0.024
ivated+0.024
Neg logit contributions
keeper-0.019
 who-0.018
ber-0.017
lan-0.017
o-0.016
Token boy
Token ID2933
Feature activation+5.86e-03

Pos logit contributions
keeper+0.014
eddy+0.014
enn+0.013
minded+0.013
o+0.013
Neg logit contributions
 blush-0.020
hing-0.020
oned-0.019
 urge-0.019
 grin-0.019
Token's
Token ID338
Feature activation+2.24e-03

Pos logit contributions
hing+0.010
oned+0.010
 blush+0.009
 grin+0.009
 hardly+0.008
Neg logit contributions
keeper-0.011
 mysteries-0.010
eddy-0.010
lan-0.010
minded-0.010
Token dreams
Token ID10625
Feature activation+2.89e-02

Pos logit contributions
hing+0.004
oned+0.004
 blush+0.004
 grin+0.004
 urge+0.004
Neg logit contributions
keeper-0.004
minded-0.004
lan-0.003
eddy-0.003
 mysteries-0.003
Token all
Token ID477
Feature activation-5.56e-03

Pos logit contributions
oned+0.053
hing+0.052
 blush+0.051
 AW+0.047
 grin+0.047
Neg logit contributions
keeper-0.051
ber-0.042
 mysteries-0.042
 who-0.042
ups-0.042
Token seemed
Token ID3947
Feature activation-1.60e-02

Pos logit contributions
keeper+0.010
 mysteries+0.009
minded+0.009
lan+0.008
ber+0.008
Neg logit contributions
oned-0.010
hing-0.010
 blush-0.010
 grin-0.009
 AW-0.009
Token possible
Token ID1744
Feature activation-8.02e-02

Pos logit contributions
keeper+0.018
eddy+0.017
ber+0.016
lan+0.016
minded+0.016
Neg logit contributions
hing-0.038
oned-0.037
 blush-0.036
 grin-0.035
 respond-0.034
Token.
Token ID13
Feature activation+8.57e-04

Pos logit contributions
eddy+0.125
keeper+0.124
minded+0.113
ind+0.112
lan+0.111
Neg logit contributions
oned-0.145
hing-0.142
 start-0.139
're-0.135
 hardly-0.135
Token Every
Token ID3887
Feature activation-2.91e-02

Pos logit contributions
oned+0.001
hing+0.001
 blush+0.001
 grin+0.001
 ease+0.001
Neg logit contributions
ind-0.002
 mysteries-0.002
ber-0.002
lan-0.001
keeper-0.001
Token day
Token ID1110
Feature activation+1.26e-02

Pos logit contributions
keeper+0.057
eddy+0.054
minded+0.054
lan+0.053
ber+0.052
Neg logit contributions
hing-0.047
oned-0.045
 grin-0.042
 blush-0.042
 urge-0.042
Token he
Token ID339
Feature activation-1.80e-02

Pos logit contributions
hing+0.024
oned+0.023
 blush+0.022
 grin+0.021
 urge+0.021
Neg logit contributions
keeper-0.021
eddy-0.020
ber-0.019
 mysteries-0.019
minded-0.019
Token started
Token ID2067
Feature activation-3.14e-03

Pos logit contributions
keeper+0.033
eddy+0.031
lan+0.030
 mysteries+0.030
minded+0.030
Neg logit contributions
hing-0.033
oned-0.030
 blush-0.029
 grin-0.027
 urge-0.027
Token barking
Token ID48961
Feature activation+1.53e-02

Pos logit contributions
hing+0.018
oned+0.017
 blush+0.016
 grin+0.016
 urge+0.015
Neg logit contributions
keeper-0.015
 mysteries-0.014
ber-0.014
lan-0.013
eddy-0.013
Token.
Token ID13
Feature activation-1.55e-03

Pos logit contributions
hing+0.039
oned+0.038
 blush+0.036
 grin+0.035
 urge+0.034
Neg logit contributions
keeper-0.017
 mysteries-0.014
lan-0.014
minded-0.014
ber-0.014
Token His
Token ID2399
Feature activation-1.88e-02

Pos logit contributions
eddy+0.003
keeper+0.003
 mysteries+0.003
ind+0.003
ber+0.003
Neg logit contributions
oned-0.003
 blush-0.003
 hardly-0.003
 grin-0.002
hing-0.002
Token owner
Token ID4870
Feature activation-1.81e-02

Pos logit contributions
keeper+0.026
minded+0.025
lan+0.024
eddy+0.024
 mysteries+0.023
Neg logit contributions
hing-0.040
 blush-0.039
 grin-0.038
oned-0.038
 urge-0.037
Token spoke
Token ID5158
Feature activation-2.25e-02

Pos logit contributions
keeper+0.033
lan+0.031
 mysteries+0.030
eddy+0.030
minded+0.030
Neg logit contributions
hing-0.033
oned-0.032
 blush-0.030
 grin-0.028
 urge-0.027
Token even
Token ID772
Feature activation-7.96e-02

Pos logit contributions
keeper+0.027
lan+0.027
 mysteries+0.026
eddy+0.026
ber+0.025
Neg logit contributions
hing-0.051
oned-0.051
 blush-0.048
 grin-0.047
 respond-0.046
Token more
Token ID517
Feature activation-2.08e-02

Pos logit contributions
lan+0.140
eddy+0.138
keeper+0.136
 Cream+0.129
ber+0.126
Neg logit contributions
hing-0.146
oned-0.135
 blush-0.129
 realize-0.127
 respond-0.126
Token softly
Token ID26625
Feature activation-2.80e-02

Pos logit contributions
keeper+0.033
lan+0.029
eddy+0.029
minded+0.028
ber+0.027
Neg logit contributions
hing-0.044
oned-0.042
 blush-0.040
 grin-0.039
 urge-0.038
Token this
Token ID428
Feature activation-1.20e-02

Pos logit contributions
keeper+0.039
 mysteries+0.036
eddy+0.035
lan+0.035
minded+0.034
Neg logit contributions
hing-0.062
oned-0.060
 blush-0.057
 grin-0.055
 urge-0.053
Token time
Token ID640
Feature activation-5.22e-03

Pos logit contributions
keeper+0.026
lan+0.024
minded+0.024
eddy+0.024
ber+0.024
Neg logit contributions
hing-0.018
oned-0.017
 blush-0.016
 grin-0.015
 urge-0.015
Token,
Token ID11
Feature activation-1.80e-03

Pos logit contributions
keeper+0.008
eddy+0.007
 mysteries+0.007
lan+0.007
minded+0.006
Neg logit contributions
hing-0.011
oned-0.011
 blush-0.011
 grin-0.010
 urge-0.010
Token looked
Token ID3114
Feature activation-3.22e-03

Pos logit contributions
keeper+0.000
lan+0.000
 mysteries+0.000
minded+0.000
ber+0.000
Neg logit contributions
hing-0.000
oned-0.000
 blush-0.000
 grin-0.000
 urge-0.000
Token y
Token ID331
Feature activation+1.76e-04

Pos logit contributions
keeper+0.005
lan+0.005
ber+0.005
 mysteries+0.005
eddy+0.005
Neg logit contributions
hing-0.006
oned-0.006
 blush-0.006
 grin-0.005
 urge-0.005
Tokenummy
Token ID13513
Feature activation-2.58e-02

Pos logit contributions
hing+0.000
oned+0.000
 blush+0.000
 grin+0.000
 urge+0.000
Neg logit contributions
keeper-0.000
 mysteries-0.000
ber-0.000
lan-0.000
minded-0.000
Token and
Token ID290
Feature activation-3.95e-02

Pos logit contributions
keeper+0.042
minded+0.040
lan+0.040
 mysteries+0.039
eddy+0.039
Neg logit contributions
 blush-0.046
hing-0.046
 grin-0.045
oned-0.045
 urge-0.044
Token tasted
Token ID29187
Feature activation-6.35e-03

Pos logit contributions
keeper+0.069
eddy+0.065
minded+0.064
ber+0.064
lan+0.064
Neg logit contributions
hing-0.071
 blush-0.071
oned-0.068
 grin-0.065
 urge-0.062
Token even
Token ID772
Feature activation-7.96e-02

Pos logit contributions
keeper+0.010
ber+0.009
lan+0.009
 mysteries+0.009
minded+0.009
Neg logit contributions
hing-0.013
oned-0.013
 blush-0.012
 grin-0.012
 urge-0.011
Token better
Token ID1365
Feature activation-4.33e-02

Pos logit contributions
lan+0.129
eddy+0.128
ber+0.127
keeper+0.126
enn+0.124
Neg logit contributions
hing-0.145
 blush-0.144
oned-0.135
 urge-0.133
 itch-0.130
Token.
Token ID13
Feature activation-3.01e-04

Pos logit contributions
keeper+0.068
eddy+0.063
 mysteries+0.059
ray+0.059
enn+0.059
Neg logit contributions
 blush-0.079
oned-0.077
 urge-0.075
 grin-0.075
hing-0.075
Token The
Token ID383
Feature activation-6.92e-03

Pos logit contributions
keeper+0.001
ber+0.001
lan+0.001
 mysteries+0.001
ind+0.001
Neg logit contributions
 blush-0.000
oned-0.000
hing-0.000
 grin-0.000
 urge-0.000
Token girl
Token ID2576
Feature activation+1.63e-02

Pos logit contributions
keeper+0.008
eddy+0.007
lan+0.007
ber+0.007
 mysteries+0.007
Neg logit contributions
 blush-0.016
hing-0.016
oned-0.016
 urge-0.015
 grin-0.015
Token was
Token ID373
Feature activation+1.41e-02

Pos logit contributions
hing+0.029
oned+0.028
 blush+0.028
 grin+0.027
 urge+0.026
Neg logit contributions
keeper-0.028
 mysteries-0.027
eddy-0.027
minded-0.026
lan-0.026
Token and
Token ID290
Feature activation-2.69e-02

Pos logit contributions
keeper+0.004
ber+0.003
ray+0.003
minded+0.003
enn+0.003
Neg logit contributions
hing-0.003
 blush-0.003
 grin-0.003
 urge-0.003
 laugh-0.002
Token it
Token ID340
Feature activation-3.59e-03

Pos logit contributions
keeper+0.047
eddy+0.043
minded+0.043
ber+0.042
lan+0.042
Neg logit contributions
hing-0.050
 blush-0.046
oned-0.046
 grin-0.044
 urge-0.043
Token gl
Token ID1278
Feature activation+5.39e-03

Pos logit contributions
keeper+0.006
 mysteries+0.005
lan+0.005
ber+0.005
minded+0.005
Neg logit contributions
hing-0.007
oned-0.007
 blush-0.007
 grin-0.006
 urge-0.006
Tokenowed
Token ID6972
Feature activation+1.25e-02

Pos logit contributions
hing+0.009
 blush+0.009
oned+0.008
 grin+0.008
 urge+0.008
Neg logit contributions
keeper-0.011
ber-0.010
eddy-0.010
lan-0.010
minded-0.009
Token an
Token ID281
Feature activation+1.00e-02

Pos logit contributions
hing+0.027
oned+0.026
 blush+0.024
 grin+0.023
 urge+0.022
Neg logit contributions
keeper-0.019
ber-0.017
 mysteries-0.017
minded-0.017
lan-0.017
Token even
Token ID772
Feature activation-7.95e-02

Pos logit contributions
hing+0.024
oned+0.023
 blush+0.023
 grin+0.021
 AW+0.021
Neg logit contributions
keeper-0.013
eddy-0.011
lan-0.011
minded-0.011
 mysteries-0.011
Token brighter
Token ID26083
Feature activation-2.01e-02

Pos logit contributions
eddy+0.139
enn+0.135
keeper+0.135
lan+0.132
ber+0.129
Neg logit contributions
hing-0.135
 start-0.120
 urge-0.118
 blush-0.116
 contain-0.115
Token purple
Token ID14032
Feature activation+9.88e-03

Pos logit contributions
keeper+0.032
minded+0.030
ray+0.029
eddy+0.028
ber+0.028
Neg logit contributions
hing-0.039
 urge-0.037
 grin-0.037
oned-0.036
 thrill-0.036
Token than
Token ID621
Feature activation-2.17e-02

Pos logit contributions
hing+0.021
oned+0.019
 blush+0.019
 urge+0.019
 grin+0.018
Neg logit contributions
keeper-0.015
minded-0.014
ber-0.013
 mysteries-0.013
eddy-0.013
Token before
Token ID878
Feature activation+6.04e-03

Pos logit contributions
keeper+0.040
eddy+0.038
minded+0.036
ber+0.035
enn+0.035
Neg logit contributions
hing-0.038
oned-0.036
 blush-0.036
 ease-0.033
 hardly-0.033
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
oned+0.013
hing+0.013
 blush+0.012
 hardly+0.012
 start+0.012
Neg logit contributions
keeper-0.009
eddy-0.008
ber-0.008
minded-0.008
lan-0.007
Token butterfly
Token ID35113
Feature activation+1.53e-02

Pos logit contributions
hing+0.044
oned+0.042
 blush+0.040
 grin+0.038
 urge+0.037
Neg logit contributions
keeper-0.041
ber-0.037
eddy-0.037
 mysteries-0.037
lan-0.037
Token and
Token ID290
Feature activation-1.08e-03

Pos logit contributions
hing+0.037
oned+0.036
 blush+0.034
 grin+0.033
 urge+0.032
Neg logit contributions
keeper-0.019
lan-0.016
 mysteries-0.016
ber-0.016
minded-0.016
Token the
Token ID262
Feature activation+1.21e-02

Pos logit contributions
keeper+0.002
eddy+0.002
ber+0.002
 mysteries+0.002
lan+0.002
Neg logit contributions
hing-0.002
oned-0.002
 blush-0.002
 grin-0.002
 urge-0.002
Token painting
Token ID12036
Feature activation+6.88e-03

Pos logit contributions
hing+0.024
oned+0.023
 blush+0.022
 grin+0.021
 urge+0.020
Neg logit contributions
keeper-0.021
 mysteries-0.018
lan-0.018
ber-0.018
minded-0.018
Token looked
Token ID3114
Feature activation-1.99e-04

Pos logit contributions
hing+0.015
oned+0.014
 blush+0.013
 grin+0.013
 urge+0.012
Neg logit contributions
keeper-0.011
lan-0.009
 mysteries-0.009
ber-0.009
minded-0.009
Token even
Token ID772
Feature activation-7.90e-02

Pos logit contributions
keeper+0.000
ber+0.000
lan+0.000
 mysteries+0.000
minded+0.000
Neg logit contributions
hing-0.000
oned-0.000
 blush-0.000
 grin-0.000
 urge-0.000
Token brighter
Token ID26083
Feature activation-2.86e-02

Pos logit contributions
keeper+0.120
ber+0.116
eddy+0.113
lan+0.111
enn+0.106
Neg logit contributions
hing-0.165
 blush-0.157
oned-0.152
 grin-0.146
 start-0.142
Token.
Token ID13
Feature activation+2.32e-03

Pos logit contributions
keeper+0.046
ray+0.043
ber+0.041
enn+0.041
eddy+0.039
Neg logit contributions
hing-0.053
 grin-0.051
 blush-0.050
oned-0.050
 start-0.048
Token Pete
Token ID17542
Feature activation+1.21e-02

Pos logit contributions
oned+0.004
hing+0.004
 grin+0.004
 ease+0.004
 hardly+0.004
Neg logit contributions
keeper-0.004
ber-0.004
 mysteries-0.004
ind-0.004
eddy-0.004
Token was
Token ID373
Feature activation+2.60e-02

Pos logit contributions
hing+0.025
oned+0.024
 blush+0.023
 grin+0.022
 urge+0.021
Neg logit contributions
keeper-0.019
 mysteries-0.017
lan-0.017
ber-0.017
minded-0.017
Token so
Token ID523
Feature activation-3.33e-03

Pos logit contributions
hing+0.064
oned+0.061
 blush+0.058
 AW+0.057
mine+0.054
Neg logit contributions
keeper-0.033
 who-0.028
ber-0.027
lan-0.026
 mysteries-0.025
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
keeper+0.012
lan+0.011
ber+0.011
eddy+0.010
minded+0.010
Neg logit contributions
hing-0.013
oned-0.013
 blush-0.012
 grin-0.012
 start-0.011
Token Lily
Token ID20037
Feature activation+6.97e-03

Pos logit contributions
hing+0.024
oned+0.024
 blush+0.024
 grin+0.022
 start+0.021
Neg logit contributions
keeper-0.017
minded-0.015
ber-0.015
 mysteries-0.015
lan-0.015
Token's
Token ID338
Feature activation+1.18e-02

Pos logit contributions
hing+0.015
oned+0.014
 blush+0.014
 grin+0.013
 urge+0.013
Neg logit contributions
keeper-0.011
 mysteries-0.009
lan-0.009
minded-0.009
ber-0.009
Token room
Token ID2119
Feature activation+5.28e-03

Pos logit contributions
hing+0.031
oned+0.030
 blush+0.029
 grin+0.028
mine+0.027
Neg logit contributions
keeper-0.013
 mysteries-0.010
eddy-0.010
ber-0.010
lan-0.010
Token looked
Token ID3114
Feature activation+2.42e-03

Pos logit contributions
hing+0.014
oned+0.013
 blush+0.012
 grin+0.012
 respond+0.012
Neg logit contributions
keeper-0.006
 mysteries-0.005
ber-0.004
eddy-0.004
lan-0.004
Token much
Token ID881
Feature activation-7.90e-02

Pos logit contributions
hing+0.005
oned+0.005
 blush+0.005
 grin+0.004
 urge+0.004
Neg logit contributions
keeper-0.004
 mysteries-0.003
lan-0.003
minded-0.003
ber-0.003
Token better
Token ID1365
Feature activation-5.34e-02

Pos logit contributions
keeper+0.117
ber+0.114
lan+0.113
 mysteries+0.106
minded+0.105
Neg logit contributions
hing-0.163
 blush-0.156
oned-0.152
 grin-0.147
 urge-0.144
Token.
Token ID13
Feature activation+7.22e-03

Pos logit contributions
keeper+0.090
 mysteries+0.081
minded+0.080
eddy+0.077
lan+0.077
Neg logit contributions
 blush-0.087
 start-0.085
 urge-0.084
hing-0.084
oned-0.083
Token Her
Token ID2332
Feature activation-6.25e-03

Pos logit contributions
oned+0.011
hing+0.010
 blush+0.010
 grin+0.009
 urge+0.009
Neg logit contributions
keeper-0.015
ber-0.014
 mysteries-0.014
lan-0.014
minded-0.014
Token mom
Token ID1995
Feature activation-5.14e-03

Pos logit contributions
keeper+0.006
eddy+0.005
 mysteries+0.005
ber+0.005
lan+0.004
Neg logit contributions
hing-0.018
oned-0.017
 blush-0.016
mine-0.015
 AW-0.015
Token said
Token ID531
Feature activation-1.04e-02

Pos logit contributions
keeper+0.008
 mysteries+0.007
lan+0.007
minded+0.007
ber+0.007
Neg logit contributions
hing-0.011
oned-0.010
 blush-0.010
 grin-0.009
 urge-0.009
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
keeper+0.062
ney+0.058
ber+0.056
enn+0.055
lan+0.054
Neg logit contributions
hing-0.046
 start-0.045
 blush-0.044
 ease-0.042
oned-0.040
Token calendar
Token ID11845
Feature activation-2.35e-02

Pos logit contributions
hing+0.020
oned+0.019
 blush+0.017
 grin+0.016
mine+0.015
Neg logit contributions
keeper-0.024
 mysteries-0.022
lan-0.022
minded-0.021
ber-0.021
Token.
Token ID13
Feature activation-8.83e-03

Pos logit contributions
keeper+0.031
lan+0.030
eddy+0.029
 mysteries+0.029
minded+0.029
Neg logit contributions
hing-0.050
oned-0.048
 blush-0.045
 start-0.045
mine-0.044
Token It
Token ID632
Feature activation-2.13e-02

Pos logit contributions
keeper+0.015
 mysteries+0.014
ber+0.014
ups+0.014
ind+0.014
Neg logit contributions
hing-0.017
oned-0.016
 blush-0.015
're-0.014
 grin-0.014
Token looks
Token ID3073
Feature activation-1.15e-02

Pos logit contributions
keeper+0.031
lan+0.028
minded+0.028
ber+0.027
eddy+0.027
Neg logit contributions
hing-0.047
oned-0.043
 blush-0.043
 start-0.040
're-0.040
Token even
Token ID772
Feature activation-7.88e-02

Pos logit contributions
keeper+0.022
ber+0.020
lan+0.019
eddy+0.019
minded+0.019
Neg logit contributions
hing-0.020
oned-0.019
 blush-0.019
 grin-0.017
 urge-0.017
Token better
Token ID1365
Feature activation-5.64e-02

Pos logit contributions
keeper+0.134
ber+0.126
eddy+0.124
lan+0.123
enn+0.122
Neg logit contributions
hing-0.145
 blush-0.142
oned-0.131
 start-0.130
 begin-0.123
Token than
Token ID621
Feature activation-1.82e-02

Pos logit contributions
keeper+0.106
eddy+0.091
ipt+0.091
enn+0.090
ray+0.087
Neg logit contributions
 start-0.080
oned-0.076
 hardly-0.075
 urge-0.074
 happen-0.073
Token before
Token ID878
Feature activation+1.55e-03

Pos logit contributions
keeper+0.031
eddy+0.028
ber+0.027
minded+0.027
enn+0.027
Neg logit contributions
hing-0.034
oned-0.033
 blush-0.033
 ease-0.031
 start-0.031
Token."
Token ID526
Feature activation-5.55e-03

Pos logit contributions
oned+0.003
hing+0.003
 start+0.003
 urge+0.003
 blush+0.003
Neg logit contributions
keeper-0.002
minded-0.002
ber-0.002
eddy-0.002
ray-0.002
Token
Token ID198
Feature activation-1.24e-02

Pos logit contributions
ind+0.011
 mysteries+0.010
minded+0.010
elia+0.010
keeper+0.009
Neg logit contributions
 grin-0.009
 blush-0.009
oned-0.008
 hardly-0.008
 realize-0.008
Token gray
Token ID12768
Feature activation+1.85e-02

Pos logit contributions
hing+0.033
 blush+0.033
 urge+0.032
oned+0.031
 grin+0.030
Neg logit contributions
keeper-0.031
ber-0.030
minded-0.029
 mysteries-0.029
eddy-0.028
Token soup
Token ID17141
Feature activation-6.53e-03

Pos logit contributions
hing+0.038
 blush+0.036
oned+0.036
 grin+0.034
 urge+0.034
Neg logit contributions
keeper-0.029
minded-0.027
ber-0.027
 mysteries-0.026
lan-0.025
Token and
Token ID290
Feature activation-9.81e-03

Pos logit contributions
keeper+0.009
minded+0.008
 mysteries+0.008
ber+0.008
lan+0.008
Neg logit contributions
hing-0.014
oned-0.014
 blush-0.013
 grin-0.013
 urge-0.012
Token it
Token ID340
Feature activation-5.70e-03

Pos logit contributions
keeper+0.017
minded+0.016
eddy+0.016
ber+0.015
enn+0.015
Neg logit contributions
hing-0.018
 blush-0.017
oned-0.017
 start-0.016
 hardly-0.016
Token tasted
Token ID29187
Feature activation-1.86e-03

Pos logit contributions
keeper+0.009
minded+0.008
 mysteries+0.008
lan+0.008
ber+0.008
Neg logit contributions
hing-0.012
oned-0.011
 blush-0.010
 grin-0.010
 urge-0.010
Token even
Token ID772
Feature activation-7.86e-02

Pos logit contributions
keeper+0.003
ber+0.003
minded+0.003
lan+0.003
 mysteries+0.003
Neg logit contributions
hing-0.004
oned-0.003
 blush-0.003
 grin-0.003
 urge-0.003
Token better
Token ID1365
Feature activation-4.53e-02

Pos logit contributions
ber+0.138
keeper+0.136
enn+0.133
eddy+0.131
lan+0.129
Neg logit contributions
hing-0.137
 blush-0.134
 start-0.122
 urge-0.121
oned-0.121
Token than
Token ID621
Feature activation-1.34e-02

Pos logit contributions
keeper+0.069
eddy+0.060
minded+0.057
ray+0.057
enn+0.056
Neg logit contributions
hing-0.088
 blush-0.088
oned-0.087
 grin-0.086
 start-0.085
Token it
Token ID340
Feature activation-1.60e-02

Pos logit contributions
keeper+0.025
eddy+0.023
ber+0.023
enn+0.022
ney+0.022
Neg logit contributions
hing-0.023
 ease-0.023
 blush-0.022
oned-0.022
 start-0.021
Token sm
Token ID895
Feature activation+1.11e-02

Pos logit contributions
keeper+0.024
minded+0.021
eddy+0.021
 mysteries+0.021
lan+0.021
Neg logit contributions
hing-0.035
oned-0.033
 blush-0.032
 grin-0.030
 hardly-0.029
Tokenelt
Token ID2120
Feature activation-3.37e-02

Pos logit contributions
hing+0.027
oned+0.026
 blush+0.025
mine+0.024
hered+0.023
Neg logit contributions
keeper-0.014
 mysteries-0.012
ber-0.011
 cave-0.011
 whose-0.011
Token that
Token ID326
Feature activation-3.66e-02

Pos logit contributions
hing+0.007
oned+0.007
 blush+0.007
 grin+0.007
 urge+0.006
Neg logit contributions
keeper-0.003
lan-0.002
 mysteries-0.002
ber-0.002
minded-0.002
Token the
Token ID262
Feature activation-7.42e-03

Pos logit contributions
keeper+0.065
eddy+0.059
ray+0.058
enn+0.058
ber+0.057
Neg logit contributions
hing-0.066
 blush-0.062
oned-0.061
 start-0.060
 laugh-0.059
Token book
Token ID1492
Feature activation-1.35e-02

Pos logit contributions
keeper+0.010
eddy+0.010
lan+0.010
minded+0.009
enn+0.009
Neg logit contributions
 blush-0.016
hing-0.015
 urge-0.015
oned-0.015
 grin-0.015
Tokencase
Token ID7442
Feature activation-3.64e-02

Pos logit contributions
keeper+0.019
lan+0.018
minded+0.017
eddy+0.017
 mysteries+0.017
Neg logit contributions
hing-0.029
oned-0.028
 blush-0.027
 grin-0.026
 urge-0.025
Token looked
Token ID3114
Feature activation-3.75e-03

Pos logit contributions
keeper+0.052
lan+0.048
minded+0.047
eddy+0.046
 mysteries+0.046
Neg logit contributions
hing-0.079
oned-0.074
 blush-0.072
 grin-0.069
 urge-0.068
Token even
Token ID772
Feature activation-7.85e-02

Pos logit contributions
keeper+0.007
ber+0.006
lan+0.006
eddy+0.006
minded+0.006
Neg logit contributions
hing-0.007
oned-0.006
 blush-0.006
 grin-0.006
 urge-0.006
Token better
Token ID1365
Feature activation-5.07e-02

Pos logit contributions
lan+0.130
keeper+0.130
ber+0.129
eddy+0.127
enn+0.125
Neg logit contributions
hing-0.143
 blush-0.138
oned-0.129
 start-0.128
 urge-0.124
Token with
Token ID351
Feature activation-7.44e-03

Pos logit contributions
keeper+0.082
eddy+0.075
enn+0.069
ind+0.069
ipt+0.069
Neg logit contributions
hing-0.086
oned-0.084
 hardly-0.084
 start-0.084
 blush-0.083
Token the
Token ID262
Feature activation-5.37e-03

Pos logit contributions
keeper+0.014
ber+0.013
eddy+0.012
enn+0.012
minded+0.012
Neg logit contributions
hing-0.013
 ease-0.013
 blush-0.012
oned-0.012
 start-0.012
Token yellow
Token ID7872
Feature activation-3.38e-03

Pos logit contributions
keeper+0.009
eddy+0.008
lan+0.008
minded+0.008
ber+0.008
Neg logit contributions
 blush-0.010
hing-0.010
 urge-0.010
oned-0.009
 grin-0.009
Token paint
Token ID7521
Feature activation-4.92e-03

Pos logit contributions
keeper+0.005
minded+0.004
lan+0.004
ber+0.004
 mysteries+0.004
Neg logit contributions
hing-0.008
oned-0.007
 blush-0.007
 grin-0.007
 urge-0.007
Token!
Token ID0
Feature activation+3.53e-03

Pos logit contributions
ind+0.010
 mysteries+0.010
keeper+0.010
minded+0.010
eddy+0.010
Neg logit contributions
oned-0.011
 blush-0.010
hing-0.010
 grin-0.010
 hardly-0.010
Token She
Token ID1375
Feature activation-1.93e-02

Pos logit contributions
oned+0.005
 blush+0.005
 grin+0.005
hing+0.005
 hardly+0.004
Neg logit contributions
 mysteries-0.007
ber-0.007
ind-0.007
minded-0.007
lan-0.007
Token thought
Token ID1807
Feature activation-3.54e-03

Pos logit contributions
keeper+0.035
lan+0.031
 mysteries+0.031
ber+0.031
eddy+0.030
Neg logit contributions
hing-0.036
oned-0.035
 blush-0.032
 grin-0.030
 respond-0.029
Token it
Token ID340
Feature activation-1.47e-02

Pos logit contributions
keeper+0.005
 mysteries+0.004
lan+0.004
ber+0.004
minded+0.004
Neg logit contributions
hing-0.008
oned-0.008
 blush-0.008
 grin-0.007
 urge-0.007
Token looked
Token ID3114
Feature activation-5.60e-03

Pos logit contributions
keeper+0.018
 mysteries+0.015
minded+0.015
lan+0.015
ber+0.015
Neg logit contributions
hing-0.036
oned-0.035
 blush-0.033
 grin-0.032
 urge-0.031
Token even
Token ID772
Feature activation-7.85e-02

Pos logit contributions
keeper+0.009
 mysteries+0.008
minded+0.008
lan+0.008
ber+0.008
Neg logit contributions
hing-0.011
oned-0.011
 blush-0.010
 grin-0.010
mine-0.009
Token better
Token ID1365
Feature activation-4.66e-02

Pos logit contributions
keeper+0.121
ber+0.114
lan+0.113
eddy+0.111
enn+0.103
Neg logit contributions
hing-0.165
 blush-0.155
oned-0.153
 grin-0.143
 urge-0.143
Token than
Token ID621
Feature activation-1.61e-02

Pos logit contributions
keeper+0.077
eddy+0.072
lan+0.067
 mysteries+0.064
ray+0.064
Neg logit contributions
oned-0.084
hing-0.084
 grin-0.081
 blush-0.081
 hardly-0.081
Token before
Token ID878
Feature activation+1.20e-02

Pos logit contributions
keeper+0.029
eddy+0.026
ber+0.026
minded+0.025
ney+0.025
Neg logit contributions
hing-0.029
 blush-0.029
oned-0.029
 ease-0.028
 hardly-0.027
Token.
Token ID13
Feature activation+2.85e-03

Pos logit contributions
hing+0.026
oned+0.026
 blush+0.024
 grin+0.024
 hardly+0.023
Neg logit contributions
keeper-0.018
ber-0.016
minded-0.015
eddy-0.015
lan-0.015
Token The
Token ID383
Feature activation+6.55e-03

Pos logit contributions
oned+0.004
 blush+0.004
 grin+0.004
 hardly+0.004
 ease+0.004
Neg logit contributions
ind-0.006
 mysteries-0.006
ber-0.006
keeper-0.006
lan-0.006
Token broken
Token ID5445
Feature activation-1.46e-04

Pos logit contributions
hing+0.045
oned+0.043
 blush+0.040
 grin+0.038
mine+0.036
Neg logit contributions
keeper-0.042
lan-0.037
 mysteries-0.037
ber-0.037
minded-0.036
Token tray
Token ID26473
Feature activation-7.61e-03

Pos logit contributions
keeper+0.000
 mysteries+0.000
lan+0.000
ber+0.000
minded+0.000
Neg logit contributions
hing-0.000
oned-0.000
 blush-0.000
 grin-0.000
 urge-0.000
Token and
Token ID290
Feature activation-1.22e-02

Pos logit contributions
keeper+0.010
 mysteries+0.009
lan+0.009
minded+0.009
ber+0.009
Neg logit contributions
hing-0.017
oned-0.017
 blush-0.016
 grin-0.015
mine-0.015
Token it
Token ID340
Feature activation-3.59e-03

Pos logit contributions
keeper+0.021
eddy+0.020
lan+0.020
minded+0.020
 mysteries+0.019
Neg logit contributions
hing-0.021
 blush-0.021
oned-0.020
 start-0.019
 grin-0.019
Token looked
Token ID3114
Feature activation-2.01e-03

Pos logit contributions
keeper+0.005
 mysteries+0.005
ber+0.005
lan+0.005
minded+0.005
Neg logit contributions
hing-0.008
oned-0.008
 blush-0.007
 grin-0.007
 respond-0.007
Token even
Token ID772
Feature activation-7.84e-02

Pos logit contributions
keeper+0.004
ber+0.003
lan+0.003
eddy+0.003
minded+0.003
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.003
 grin-0.003
 urge-0.003
Token more
Token ID517
Feature activation-3.40e-02

Pos logit contributions
keeper+0.117
ber+0.115
lan+0.114
eddy+0.113
enn+0.106
Neg logit contributions
hing-0.158
 blush-0.153
oned-0.150
 start-0.141
 urge-0.138
Token beautiful
Token ID4950
Feature activation-1.11e-02

Pos logit contributions
keeper+0.059
ber+0.054
minded+0.053
 mysteries+0.053
lan+0.053
Neg logit contributions
hing-0.061
oned-0.058
 blush-0.058
 grin-0.054
 urge-0.054
Token than
Token ID621
Feature activation-1.06e-02

Pos logit contributions
keeper+0.017
enn+0.016
ray+0.015
o+0.015
ber+0.015
Neg logit contributions
hing-0.020
 blush-0.020
oned-0.020
 grin-0.018
 urge-0.018
Token before
Token ID878
Feature activation+1.02e-02

Pos logit contributions
keeper+0.018
eddy+0.017
minded+0.016
ber+0.016
enn+0.016
Neg logit contributions
hing-0.019
 blush-0.019
oned-0.019
 ease-0.018
mine-0.017
Token.
Token ID13
Feature activation+4.28e-03

Pos logit contributions
oned+0.023
hing+0.022
 blush+0.022
 grin+0.021
 hardly+0.020
Neg logit contributions
keeper-0.014
minded-0.012
ber-0.012
lan-0.012
eddy-0.012
Token -
Token ID532
Feature activation-1.42e-03

Pos logit contributions
keeper+0.038
eddy+0.033
lan+0.032
 mysteries+0.032
ber+0.032
Neg logit contributions
hing-0.069
oned-0.068
 blush-0.064
 grin-0.062
 urge-0.060
Token the
Token ID262
Feature activation+6.98e-03

Pos logit contributions
keeper+0.002
minded+0.002
lan+0.002
ber+0.002
eddy+0.002
Neg logit contributions
hing-0.003
oned-0.003
 blush-0.003
 grin-0.003
 urge-0.003
Token bow
Token ID9563
Feature activation-5.39e-03

Pos logit contributions
hing+0.012
oned+0.011
 blush+0.010
ivated+0.010
mine+0.009
Neg logit contributions
keeper-0.014
 mysteries-0.013
eddy-0.012
 boy-0.012
 who-0.012
Token gl
Token ID1278
Feature activation+8.07e-03

Pos logit contributions
keeper+0.009
 mysteries+0.007
 who+0.007
ber+0.007
eddy+0.006
Neg logit contributions
hing-0.011
oned-0.011
 blush-0.010
ivery-0.010
ivated-0.010
Tokenowed
Token ID6972
Feature activation+1.41e-02

Pos logit contributions
hing+0.013
oned+0.012
 blush+0.012
 urge+0.011
 grin+0.011
Neg logit contributions
keeper-0.016
eddy-0.015
ber-0.015
lan-0.014
minded-0.014
Token even
Token ID772
Feature activation-7.82e-02

Pos logit contributions
hing+0.032
oned+0.031
 blush+0.029
 grin+0.028
 urge+0.027
Neg logit contributions
keeper-0.020
lan-0.017
 mysteries-0.017
eddy-0.017
minded-0.017
Token brighter
Token ID26083
Feature activation-2.41e-02

Pos logit contributions
keeper+0.118
eddy+0.116
lan+0.115
ber+0.113
enn+0.106
Neg logit contributions
hing-0.159
 blush-0.149
oned-0.149
 start-0.139
 grin-0.138
Token.
Token ID13
Feature activation+3.16e-03

Pos logit contributions
keeper+0.035
ray+0.031
eddy+0.031
minded+0.031
ber+0.030
Neg logit contributions
hing-0.050
oned-0.048
 blush-0.047
 grin-0.047
 start-0.045
Token
Token ID220
Feature activation+6.02e-03

Pos logit contributions
oned+0.007
hing+0.007
 blush+0.006
 grin+0.006
 hardly+0.006
Neg logit contributions
keeper-0.005
minded-0.004
ber-0.004
lan-0.004
 mysteries-0.004
Token
Token ID198
Feature activation+6.01e-03

Pos logit contributions
oned+0.015
hing+0.015
 blush+0.014
 grin+0.013
 ease+0.013
Neg logit contributions
keeper-0.007
 mysteries-0.006
lan-0.006
minded-0.006
ber-0.006
Token
Token ID198
Feature activation+4.83e-03

Pos logit contributions
hing+0.018
oned+0.017
 blush+0.016
 grin+0.016
 urge+0.016
Neg logit contributions
keeper-0.004
eddy-0.003
lan-0.003
 mysteries-0.003
ber-0.003
Token and
Token ID290
Feature activation-2.08e-03

Pos logit contributions
hing+0.046
oned+0.045
 blush+0.043
 grin+0.041
 respond+0.040
Neg logit contributions
keeper-0.024
 mysteries-0.020
lan-0.020
minded-0.020
ber-0.020
Token laughing
Token ID14376
Feature activation+3.75e-02

Pos logit contributions
keeper+0.004
lan+0.004
 mysteries+0.004
minded+0.003
ber+0.003
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.003
 grin-0.003
 AW-0.003
Token together
Token ID1978
Feature activation+1.75e-02

Pos logit contributions
hing+0.091
oned+0.087
 blush+0.083
 grin+0.081
 urge+0.079
Neg logit contributions
keeper-0.047
ber-0.040
eddy-0.039
 mysteries-0.039
lan-0.039
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
hing+0.044
oned+0.043
 blush+0.040
 grin+0.039
 urge+0.038
Neg logit contributions
keeper-0.021
eddy-0.018
 mysteries-0.017
minded-0.017
ber-0.017
Token As
Token ID1081
Feature activation-1.35e-02

Pos logit contributions
hing+0.027
oned+0.027
 blush+0.025
 grin+0.024
 urge+0.024
Neg logit contributions
keeper-0.021
eddy-0.019
ber-0.019
minded-0.019
 mysteries-0.019
Token they
Token ID484
Feature activation+6.58e-02

Pos logit contributions
keeper+0.027
eddy+0.026
ray+0.024
minded+0.024
ber+0.023
Neg logit contributions
 hardly-0.022
hing-0.021
 start-0.020
oned-0.020
 barely-0.020
Token went
Token ID1816
Feature activation+1.37e-02

Pos logit contributions
oned+0.127
hing+0.125
 blush+0.119
 grin+0.115
 AW+0.113
Neg logit contributions
keeper-0.098
 mysteries-0.096
o-0.095
 who-0.094
 cave-0.092
Token,
Token ID11
Feature activation+2.29e-02

Pos logit contributions
hing+0.029
oned+0.028
 blush+0.027
 grin+0.025
mine+0.025
Neg logit contributions
keeper-0.021
 mysteries-0.019
lan-0.018
minded-0.018
ber-0.018
Token Bob
Token ID5811
Feature activation-1.35e-02

Pos logit contributions
hing+0.046
oned+0.045
 hardly+0.042
 blush+0.041
 urge+0.041
Neg logit contributions
keeper-0.037
eddy-0.034
minded-0.033
 mysteries-0.032
ber-0.032
Token heard
Token ID2982
Feature activation+2.29e-02

Pos logit contributions
keeper+0.026
eddy+0.025
 mysteries+0.024
lan+0.023
minded+0.023
Neg logit contributions
hing-0.022
oned-0.021
 hardly-0.019
're-0.019
 urge-0.019
Token a
Token ID257
Feature activation+1.18e-02

Pos logit contributions
hing+0.044
oned+0.041
 blush+0.040
 respond+0.038
 start+0.038
Neg logit contributions
keeper-0.039
ber-0.036
eddy-0.036
minded-0.034
enn-0.034
Token gathering
Token ID11228
Feature activation-2.17e-02

Pos logit contributions
keeper+0.003
 mysteries+0.002
lan+0.002
minded+0.002
ber+0.002
Neg logit contributions
hing-0.009
oned-0.008
 blush-0.008
 grin-0.008
mine-0.007
Token flowers
Token ID12734
Feature activation+3.45e-02

Pos logit contributions
keeper+0.035
ber+0.033
eddy+0.032
lan+0.031
enn+0.030
Neg logit contributions
hing-0.044
 blush-0.042
oned-0.041
 grin-0.038
 ease-0.038
Token,
Token ID11
Feature activation+1.54e-02

Pos logit contributions
hing+0.077
oned+0.074
 blush+0.071
 grin+0.067
mine+0.066
Neg logit contributions
keeper-0.048
ber-0.042
eddy-0.042
lan-0.042
minded-0.042
Token and
Token ID290
Feature activation+3.11e-04

Pos logit contributions
hing+0.029
oned+0.028
 blush+0.026
 grin+0.025
 urge+0.024
Neg logit contributions
keeper-0.027
lan-0.024
 mysteries-0.024
ber-0.024
minded-0.024
Token when
Token ID618
Feature activation+1.08e-02

Pos logit contributions
hing+0.001
oned+0.001
 blush+0.001
 grin+0.000
 urge+0.000
Neg logit contributions
keeper-0.001
eddy-0.001
ber-0.001
minded-0.001
lan-0.001
Token they
Token ID484
Feature activation+5.40e-02

Pos logit contributions
 blush+0.021
hing+0.021
oned+0.021
 start+0.020
 hardly+0.020
Neg logit contributions
keeper-0.017
eddy-0.015
minded-0.015
ray-0.015
ber-0.014
Token had
Token ID550
Feature activation-5.51e-03

Pos logit contributions
hing+0.092
oned+0.091
 AW+0.080
 blush+0.079
 grin+0.077
Neg logit contributions
keeper-0.104
 mysteries-0.094
minded-0.092
ber-0.092
lan-0.091
Token finished
Token ID5201
Feature activation-1.41e-02

Pos logit contributions
keeper+0.010
ber+0.009
lan+0.009
 mysteries+0.008
minded+0.008
Neg logit contributions
hing-0.011
oned-0.010
 blush-0.010
 grin-0.009
 urge-0.009
Token,
Token ID11
Feature activation+1.84e-02

Pos logit contributions
keeper+0.025
ber+0.022
eddy+0.022
lan+0.022
minded+0.022
Neg logit contributions
hing-0.027
oned-0.026
 blush-0.025
 grin-0.023
 urge-0.022
Token they
Token ID484
Feature activation+3.19e-02

Pos logit contributions
hing+0.037
 blush+0.036
oned+0.036
 grin+0.033
 start+0.033
Neg logit contributions
keeper-0.028
eddy-0.026
ber-0.025
ray-0.025
minded-0.025
Token proudly
Token ID23017
Feature activation-2.62e-02

Pos logit contributions
hing+0.059
oned+0.058
 blush+0.052
 grin+0.050
 AW+0.050
Neg logit contributions
keeper-0.058
ber-0.051
lan-0.051
 mysteries-0.051
minded-0.051
Token to
Token ID284
Feature activation+1.13e-02

Pos logit contributions
hing+0.026
oned+0.025
 blush+0.023
 grin+0.022
 urge+0.022
Neg logit contributions
keeper-0.018
 mysteries-0.016
lan-0.016
minded-0.016
eddy-0.016
Token the
Token ID262
Feature activation+1.59e-02

Pos logit contributions
hing+0.017
oned+0.016
 blush+0.015
 grin+0.014
 urge+0.013
Neg logit contributions
keeper-0.024
lan-0.022
ber-0.022
eddy-0.022
 mysteries-0.022
Token family
Token ID1641
Feature activation+3.68e-03

Pos logit contributions
hing+0.034
oned+0.033
 blush+0.032
 grin+0.030
 urge+0.030
Neg logit contributions
keeper-0.024
lan-0.021
 mysteries-0.021
ber-0.021
eddy-0.020
Token.
Token ID13
Feature activation+1.57e-04

Pos logit contributions
hing+0.008
oned+0.008
 blush+0.008
 grin+0.007
 urge+0.007
Neg logit contributions
keeper-0.005
 mysteries-0.005
lan-0.005
eddy-0.005
minded-0.004
Token The
Token ID383
Feature activation+1.21e-03

Pos logit contributions
hing+0.000
oned+0.000
 blush+0.000
 grin+0.000
rell+0.000
Neg logit contributions
 mysteries-0.000
ind-0.000
keeper-0.000
ber-0.000
lan-0.000
Token kids
Token ID3988
Feature activation+6.32e-02

Pos logit contributions
hing+0.003
oned+0.003
 blush+0.003
 grin+0.002
 AW+0.002
Neg logit contributions
keeper-0.002
 mysteries-0.001
minded-0.001
ber-0.001
lan-0.001
Token learned
Token ID4499
Feature activation-1.94e-02

Pos logit contributions
hing+0.113
oned+0.110
 blush+0.102
 grin+0.096
 AW+0.094
Neg logit contributions
keeper-0.119
ber-0.105
minded-0.103
lan-0.103
 mysteries-0.103
Token lots
Token ID6041
Feature activation+4.98e-03

Pos logit contributions
keeper+0.031
eddy+0.030
ber+0.029
ney+0.028
lan+0.028
Neg logit contributions
hing-0.037
oned-0.036
 blush-0.034
 hardly-0.034
 realize-0.033
Token of
Token ID286
Feature activation+9.69e-04

Pos logit contributions
hing+0.008
oned+0.007
 blush+0.006
 grin+0.006
 respond+0.006
Neg logit contributions
keeper-0.010
lan-0.010
ber-0.009
minded-0.009
eddy-0.009
Token new
Token ID649
Feature activation-8.86e-03

Pos logit contributions
hing+0.002
oned+0.002
 blush+0.002
 grin+0.002
 urge+0.002
Neg logit contributions
keeper-0.001
lan-0.001
eddy-0.001
ber-0.001
minded-0.001
Token things
Token ID1243
Feature activation+1.76e-02

Pos logit contributions
keeper+0.011
lan+0.009
eddy+0.009
ber+0.009
minded+0.008
Neg logit contributions
hing-0.022
oned-0.021
 blush-0.020
 grin-0.020
 urge-0.019
Token have
Token ID423
Feature activation-2.09e-02

Pos logit contributions
hing+0.059
oned+0.056
 blush+0.052
 grin+0.050
 respond+0.046
Neg logit contributions
keeper-0.071
 mysteries-0.064
ber-0.063
lan-0.063
minded-0.063
Token to
Token ID284
Feature activation+1.29e-02

Pos logit contributions
keeper+0.030
eddy+0.029
minded+0.025
ber+0.025
oc+0.025
Neg logit contributions
 start-0.043
 blush-0.041
 hardly-0.041
're-0.040
oned-0.039
Token listen
Token ID6004
Feature activation+4.67e-04

Pos logit contributions
hing+0.023
oned+0.022
 blush+0.019
 AW+0.019
rell+0.018
Neg logit contributions
keeper-0.023
 mysteries-0.022
 who-0.021
lan-0.021
minded-0.021
Token first
Token ID717
Feature activation-1.07e-02

Pos logit contributions
oned+0.001
hing+0.001
 start+0.001
 blush+0.001
 hardly+0.001
Neg logit contributions
keeper-0.001
 mysteries-0.001
eddy-0.000
lan-0.000
minded-0.000
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
keeper+0.015
 mysteries+0.014
lan+0.013
eddy+0.013
minded+0.013
Neg logit contributions
hing-0.024
oned-0.023
 blush-0.021
 grin-0.020
 start-0.020
Token We
Token ID775
Feature activation+6.20e-02

Pos logit contributions
keeper+0.025
 mysteries+0.023
by+0.023
 cave+0.022
lan+0.021
Neg logit contributions
oned-0.022
hing-0.020
 blush-0.020
urance-0.018
're-0.018
Token have
Token ID423
Feature activation-1.63e-02

Pos logit contributions
oned+0.099
hing+0.099
 blush+0.091
 grin+0.087
ivated+0.081
Neg logit contributions
keeper-0.120
 who-0.110
 mysteries-0.110
ber-0.108
ray-0.108
Token a
Token ID257
Feature activation-9.59e-03

Pos logit contributions
eddy+0.025
keeper+0.024
oc+0.021
ind+0.021
ber+0.020
Neg logit contributions
 start-0.033
're-0.030
 hardly-0.030
 blush-0.030
oned-0.029
Token song
Token ID3496
Feature activation+2.87e-02

Pos logit contributions
keeper+0.019
minded+0.018
lan+0.017
eddy+0.017
ber+0.017
Neg logit contributions
hing-0.015
oned-0.015
 blush-0.014
 grin-0.014
 urge-0.013
Token for
Token ID329
Feature activation+9.28e-03

Pos logit contributions
hing+0.071
oned+0.069
 blush+0.066
 grin+0.063
 urge+0.061
Neg logit contributions
keeper-0.033
 mysteries-0.029
minded-0.028
lan-0.028
eddy-0.027
Token you
Token ID345
Feature activation+2.13e-02

Pos logit contributions
oned+0.019
hing+0.019
 blush+0.018
 start+0.017
 ease+0.017
Neg logit contributions
keeper-0.016
ber-0.014
eddy-0.013
minded-0.013
ney-0.013
Token the
Token ID262
Feature activation+4.97e-03

Pos logit contributions
keeper+0.009
eddy+0.009
ney+0.008
ber+0.008
lan+0.008
Neg logit contributions
hing-0.010
oned-0.010
 blush-0.010
 start-0.009
 ease-0.009
Token corn
Token ID11676
Feature activation-2.55e-02

Pos logit contributions
hing+0.012
oned+0.012
 urge+0.011
 blush+0.011
 start+0.011
Neg logit contributions
keeper-0.006
eddy-0.005
minded-0.005
ber-0.005
lan-0.005
Token.
Token ID13
Feature activation-5.18e-03

Pos logit contributions
keeper+0.037
eddy+0.036
lan+0.034
minded+0.034
 mysteries+0.033
Neg logit contributions
hing-0.054
oned-0.051
 blush-0.048
 start-0.047
 urge-0.047
Token But
Token ID887
Feature activation+1.28e-02

Pos logit contributions
keeper+0.010
 mysteries+0.009
ind+0.009
ups+0.009
ber+0.009
Neg logit contributions
hing-0.009
oned-0.009
 blush-0.008
're-0.008
 hardly-0.007
Token,
Token ID11
Feature activation+1.99e-02

Pos logit contributions
hing+0.020
oned+0.019
 start+0.018
 blush+0.018
 urge+0.018
Neg logit contributions
keeper-0.026
eddy-0.025
lan-0.024
minded-0.024
ber-0.023
Token you
Token ID345
Feature activation+6.12e-02

Pos logit contributions
hing+0.032
oned+0.030
 blush+0.029
 start+0.028
 urge+0.028
Neg logit contributions
keeper-0.040
eddy-0.039
lan-0.037
minded-0.037
 mysteries-0.036
Token have
Token ID423
Feature activation-1.05e-02

Pos logit contributions
hing+0.108
oned+0.104
 blush+0.097
 grin+0.091
 AW+0.086
Neg logit contributions
keeper-0.116
 mysteries-0.104
ber-0.103
minded-0.103
lan-0.103
Token to
Token ID284
Feature activation+1.52e-02

Pos logit contributions
eddy+0.014
keeper+0.014
ind+0.012
ber+0.012
lan+0.012
Neg logit contributions
 start-0.022
're-0.022
 hardly-0.022
oned-0.022
 blush-0.021
Token be
Token ID307
Feature activation+1.43e-02

Pos logit contributions
hing+0.025
oned+0.024
 blush+0.021
rell+0.020
 AW+0.020
Neg logit contributions
keeper-0.028
minded-0.027
 who-0.026
 mysteries-0.026
lan-0.025
Token fast
Token ID3049
Feature activation+3.87e-03

Pos logit contributions
hing+0.030
oned+0.029
 blush+0.027
 grin+0.026
 urge+0.025
Neg logit contributions
keeper-0.023
lan-0.020
eddy-0.020
 mysteries-0.019
ber-0.019
Token!"
Token ID2474
Feature activation+4.85e-03

Pos logit contributions
hing+0.008
oned+0.008
 blush+0.007
 urge+0.007
 grin+0.007
Neg logit contributions
keeper-0.006
lan-0.005
eddy-0.005
 mysteries-0.005
ber-0.005
Tokenâ
Token ID22940
Feature activation-4.24e-03

Pos logit contributions
keeper+0.003
eddy+0.003
ber+0.003
 who+0.003
enn+0.003
Neg logit contributions
hing-0.008
 blush-0.007
oned-0.007
 grin-0.007
 ease-0.007
Token
Token ID26391
Feature activation+1.29e-02

Pos logit contributions
keeper+0.009
minded+0.009
eddy+0.009
ber+0.009
lan+0.008
Neg logit contributions
hing-0.006
oned-0.005
 blush-0.005
 grin-0.005
 ease-0.005
Token
Token ID129
Feature activation+4.41e-03

Pos logit contributions
hing+0.028
oned+0.027
 blush+0.026
 grin+0.025
 urge+0.024
Neg logit contributions
keeper-0.019
 mysteries-0.017
lan-0.017
minded-0.017
ber-0.016
Token
Token ID241
Feature activation-6.51e-03

Pos logit contributions
hing+0.008
 blush+0.007
oned+0.007
 grin+0.007
're+0.006
Neg logit contributions
keeper-0.008
ind-0.008
minded-0.008
 mysteries-0.008
ber-0.008
TokenOh
Token ID5812
Feature activation+3.22e-02

Pos logit contributions
keeper+0.010
 mysteries+0.009
minded+0.009
lan+0.009
ber+0.008
Neg logit contributions
hing-0.014
oned-0.014
 blush-0.013
 grin-0.012
 urge-0.012
Token no
Token ID645
Feature activation+4.99e-03

Pos logit contributions
hing+0.070
oned+0.067
 blush+0.062
 grin+0.060
 respond+0.058
Neg logit contributions
keeper-0.048
 who-0.042
 mysteries-0.042
ber-0.042
 whose-0.042
Token,
Token ID11
Feature activation+1.76e-02

Pos logit contributions
hing+0.011
oned+0.010
 blush+0.010
 grin+0.009
 urge+0.009
Neg logit contributions
keeper-0.008
lan-0.007
 mysteries-0.007
ber-0.007
minded-0.007
Token I
Token ID314
Feature activation+4.93e-02

Pos logit contributions
hing+0.040
oned+0.038
 blush+0.037
 grin+0.035
 urge+0.035
Neg logit contributions
keeper-0.025
lan-0.021
minded-0.021
ber-0.021
eddy-0.021
Token'm
Token ID1101
Feature activation+2.79e-02

Pos logit contributions
hing+0.092
oned+0.089
 blush+0.079
 grin+0.075
 sidelines+0.075
Neg logit contributions
keeper-0.088
ber-0.079
ray-0.078
 mysteries-0.078
minded-0.077
Token sorry
Token ID7926
Feature activation+4.76e-03

Pos logit contributions
hing+0.056
oned+0.051
 blush+0.048
 grin+0.046
mine+0.045
Neg logit contributions
keeper-0.048
ber-0.042
 who-0.042
lan-0.042
 mysteries-0.042
Token!
Token ID0
Feature activation-7.52e-03

Pos logit contributions
hing+0.008
oned+0.008
 blush+0.008
 urge+0.007
 grin+0.007
Neg logit contributions
keeper-0.009
lan-0.008
minded-0.008
 mysteries-0.008
ber-0.008
Token.
Token ID13
Feature activation+5.89e-03

Pos logit contributions
hing+0.013
oned+0.013
 blush+0.012
 grin+0.012
 urge+0.012
Neg logit contributions
keeper-0.007
eddy-0.006
 mysteries-0.006
ber-0.006
lan-0.006
Token He
Token ID679
Feature activation-2.31e-02

Pos logit contributions
oned+0.013
hing+0.012
 blush+0.012
 grin+0.011
 hardly+0.011
Neg logit contributions
keeper-0.008
 mysteries-0.008
eddy-0.008
minded-0.008
ber-0.008
Token saw
Token ID2497
Feature activation-3.08e-02

Pos logit contributions
keeper+0.044
eddy+0.043
lan+0.041
minded+0.040
 mysteries+0.040
Neg logit contributions
hing-0.039
oned-0.036
 blush-0.034
 hardly-0.033
 grin-0.033
Token Mia
Token ID32189
Feature activation+1.15e-02

Pos logit contributions
keeper+0.045
eddy+0.040
ber+0.040
lan+0.039
minded+0.039
Neg logit contributions
hing-0.067
oned-0.063
 blush-0.062
 grin-0.058
 ease-0.057
Token still
Token ID991
Feature activation-1.27e-02

Pos logit contributions
hing+0.022
oned+0.021
 blush+0.020
 grin+0.019
 AW+0.019
Neg logit contributions
keeper-0.020
ber-0.017
 mysteries-0.017
lan-0.017
minded-0.017
Token swimming
Token ID14899
Feature activation+1.12e-02

Pos logit contributions
keeper+0.024
ber+0.022
 who+0.022
 mysteries+0.021
minded+0.021
Neg logit contributions
hing-0.022
oned-0.021
 blush-0.020
 AW-0.018
mine-0.018
Token and
Token ID290
Feature activation-1.88e-03

Pos logit contributions
hing+0.025
oned+0.023
 blush+0.022
 grin+0.021
 urge+0.021
Neg logit contributions
keeper-0.016
lan-0.015
eddy-0.014
 mysteries-0.014
minded-0.014
Token smiled
Token ID13541
Feature activation+3.96e-03

Pos logit contributions
keeper+0.003
 mysteries+0.003
lan+0.003
eddy+0.003
minded+0.003
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.003
 grin-0.003
 urge-0.003
Token.
Token ID13
Feature activation+4.74e-03

Pos logit contributions
hing+0.009
oned+0.008
 blush+0.008
 grin+0.008
 urge+0.008
Neg logit contributions
keeper-0.005
 mysteries-0.005
lan-0.005
minded-0.005
ber-0.005
Token
Token ID198
Feature activation-1.59e-03

Pos logit contributions
oned+0.011
hing+0.010
 blush+0.010
 grin+0.010
 hardly+0.010
Neg logit contributions
keeper-0.006
 mysteries-0.006
eddy-0.006
ind-0.006
minded-0.006
Token
Token ID198
Feature activation-3.69e-03

Pos logit contributions
keeper+0.002
 mysteries+0.002
minded+0.002
lan+0.001
eddy+0.001
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.004
 hardly-0.004
 grin-0.004
Token room
Token ID2119
Feature activation-8.80e-03

Pos logit contributions
 blush+0.015
oned+0.015
hing+0.014
 urge+0.014
 start+0.014
Neg logit contributions
keeper-0.010
lan-0.010
ber-0.010
minded-0.009
eddy-0.009
Token,
Token ID11
Feature activation+8.82e-03

Pos logit contributions
keeper+0.011
 mysteries+0.009
lan+0.009
eddy+0.009
minded+0.009
Neg logit contributions
hing-0.021
oned-0.020
 blush-0.019
 grin-0.019
 urge-0.018
Token and
Token ID290
Feature activation-4.65e-03

Pos logit contributions
hing+0.019
oned+0.018
 blush+0.017
 grin+0.016
 urge+0.016
Neg logit contributions
keeper-0.013
lan-0.012
eddy-0.012
minded-0.012
 mysteries-0.012
Token she
Token ID673
Feature activation-1.66e-02

Pos logit contributions
keeper+0.007
eddy+0.007
lan+0.007
 mysteries+0.007
minded+0.007
Neg logit contributions
hing-0.009
oned-0.009
 blush-0.009
 grin-0.008
 hardly-0.008
Token was
Token ID373
Feature activation+1.53e-02

Pos logit contributions
keeper+0.029
lan+0.028
minded+0.027
eddy+0.027
 mysteries+0.027
Neg logit contributions
hing-0.030
oned-0.029
 blush-0.028
 grin-0.026
 hardly-0.025
Token screaming
Token ID14788
Feature activation+3.14e-02

Pos logit contributions
hing+0.032
oned+0.030
 blush+0.028
 AW+0.027
 grin+0.027
Neg logit contributions
keeper-0.025
ber-0.022
 mysteries-0.021
lan-0.021
minded-0.021
Token!
Token ID0
Feature activation+5.92e-04

Pos logit contributions
hing+0.079
oned+0.076
 blush+0.073
 grin+0.071
 urge+0.068
Neg logit contributions
keeper-0.036
 mysteries-0.030
lan-0.030
minded-0.030
ber-0.029
Token She
Token ID1375
Feature activation-1.86e-02

Pos logit contributions
oned+0.001
hing+0.001
 blush+0.001
 grin+0.001
 hardly+0.001
Neg logit contributions
keeper-0.001
eddy-0.001
 mysteries-0.001
ber-0.001
lan-0.001
Token had
Token ID550
Feature activation-9.63e-03

Pos logit contributions
keeper+0.034
lan+0.031
minded+0.031
eddy+0.031
 mysteries+0.031
Neg logit contributions
hing-0.034
oned-0.032
 blush-0.031
 grin-0.029
 urge-0.028
Token found
Token ID1043
Feature activation-3.59e-02

Pos logit contributions
keeper+0.016
 mysteries+0.015
lan+0.014
ber+0.014
minded+0.014
Neg logit contributions
hing-0.019
oned-0.018
 blush-0.017
 grin-0.017
 urge-0.016
Token Sam
Token ID3409
Feature activation-6.53e-03

Pos logit contributions
keeper+0.062
ber+0.057
lan+0.056
eddy+0.056
minded+0.055
Neg logit contributions
hing-0.069
oned-0.065
 blush-0.064
 ease-0.060
 grin-0.057
Token good
Token ID922
Feature activation+9.36e-03

Pos logit contributions
keeper+0.049
minded+0.044
 mysteries+0.043
lan+0.043
ber+0.042
Neg logit contributions
hing-0.046
oned-0.043
 blush-0.042
 grin-0.040
 urge-0.039
Token dad
Token ID9955
Feature activation-3.51e-02

Pos logit contributions
hing+0.024
oned+0.024
 blush+0.023
 grin+0.022
 urge+0.022
Neg logit contributions
keeper-0.010
 mysteries-0.008
lan-0.008
minded-0.008
ber-0.008
Token.
Token ID13
Feature activation-3.60e-03

Pos logit contributions
 mysteries+0.057
keeper+0.056
lan+0.053
 surroundings+0.052
Really+0.052
Neg logit contributions
hing-0.065
mine-0.059
 are-0.057
're-0.057
 laugh-0.056
Token You
Token ID921
Feature activation+4.35e-02

Pos logit contributions
keeper+0.006
 mysteries+0.006
ups+0.005
ber+0.005
minded+0.005
Neg logit contributions
hing-0.007
oned-0.007
 blush-0.007
 grin-0.006
 urge-0.006
Token support
Token ID1104
Feature activation+8.22e-03

Pos logit contributions
oned+0.068
 sidelines+0.054
hing+0.054
 grin+0.051
 blush+0.050
Neg logit contributions
 who-0.089
keeper-0.089
ray-0.087
 mysteries-0.087
minded-0.083
Token me
Token ID502
Feature activation+1.38e-02

Pos logit contributions
hing+0.015
 blush+0.015
oned+0.015
mine+0.014
 start+0.014
Neg logit contributions
keeper-0.014
lan-0.013
ber-0.013
eddy-0.012
ney-0.012
Token and
Token ID290
Feature activation-2.37e-02

Pos logit contributions
hing+0.028
oned+0.027
 blush+0.025
mine+0.024
 start+0.024
Neg logit contributions
keeper-0.021
 mysteries-0.019
lan-0.019
eddy-0.018
Really-0.018
Token our
Token ID674
Feature activation+1.25e-02

Pos logit contributions
keeper+0.040
eddy+0.037
minded+0.036
lan+0.035
enn+0.035
Neg logit contributions
 blush-0.043
hing-0.043
oned-0.040
 grin-0.040
 laugh-0.040
Token baby
Token ID5156
Feature activation+1.66e-03

Pos logit contributions
hing+0.023
 blush+0.023
oned+0.023
 urge+0.022
 grin+0.021
Neg logit contributions
keeper-0.020
lan-0.019
minded-0.019
ber-0.018
 mysteries-0.017
Token."
Token ID526
Feature activation-1.63e-02

Pos logit contributions
hing+0.003
 blush+0.003
oned+0.003
 urge+0.003
mine+0.003
Neg logit contributions
keeper-0.002
lan-0.002
minded-0.002
 mysteries-0.002
eddy-0.002
Token
Token ID198
Feature activation-1.46e-02

Pos logit contributions
keeper+0.026
 mysteries+0.025
ind+0.024
ber+0.024
ney+0.023
Neg logit contributions
 blush-0.032
oned-0.032
hing-0.030
 grin-0.030
 ease-0.028
Token do
Token ID466
Feature activation-2.07e-03

Pos logit contributions
hing+0.030
oned+0.029
 blush+0.027
 grin+0.025
 AW+0.024
Neg logit contributions
keeper-0.037
 mysteries-0.035
lan-0.033
minded-0.033
 cave-0.032
Token something
Token ID1223
Feature activation+3.09e-03

Pos logit contributions
keeper+0.003
eddy+0.003
lan+0.003
ber+0.003
minded+0.003
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.004
 grin-0.004
 hardly-0.004
Token amazing
Token ID4998
Feature activation+4.66e-03

Pos logit contributions
hing+0.006
oned+0.006
 blush+0.006
 grin+0.005
 urge+0.005
Neg logit contributions
keeper-0.005
eddy-0.005
minded-0.005
lan-0.005
ber-0.005
Token!",
Token ID40754
Feature activation+1.74e-03

Pos logit contributions
hing+0.009
oned+0.009
 blush+0.008
 grin+0.008
 respond+0.008
Neg logit contributions
keeper-0.008
lan-0.007
eddy-0.007
minded-0.007
ber-0.007
Token he
Token ID339
Feature activation-1.57e-02

Pos logit contributions
hing+0.003
 blush+0.003
oned+0.003
 shout+0.003
 grin+0.003
Neg logit contributions
eddy-0.003
keeper-0.003
 mysteries-0.003
lan-0.003
ind-0.003
Token said
Token ID531
Feature activation+9.99e-03

Pos logit contributions
keeper+0.026
 mysteries+0.023
eddy+0.023
lan+0.023
minded+0.023
Neg logit contributions
hing-0.031
oned-0.030
 blush-0.029
 grin-0.027
 urge-0.026
Token.
Token ID13
Feature activation-1.85e-02

Pos logit contributions
hing+0.024
oned+0.023
 blush+0.022
 grin+0.022
 urge+0.021
Neg logit contributions
keeper-0.012
 mysteries-0.011
lan-0.010
eddy-0.010
minded-0.010
Token
Token ID198
Feature activation-1.51e-02

Pos logit contributions
 mysteries+0.029
ind+0.028
minded+0.026
lan+0.025
 cave+0.025
Neg logit contributions
oned-0.039
hing-0.035
 hardly-0.035
 blush-0.034
 grin-0.034
Token
Token ID198
Feature activation-1.66e-02

Pos logit contributions
 mysteries+0.020
keeper+0.019
minded+0.019
lan+0.019
ishes+0.019
Neg logit contributions
oned-0.033
hing-0.033
 hardly-0.030
 grin-0.029
 blush-0.029
TokenMom
Token ID29252
Feature activation-2.61e-02

Pos logit contributions
keeper+0.027
 mysteries+0.026
lan+0.025
eddy+0.025
minded+0.025
Neg logit contributions
hing-0.032
oned-0.031
 blush-0.028
 grin-0.027
 hardly-0.027
Tokenmy
Token ID1820
Feature activation-2.17e-02

Pos logit contributions
keeper+0.039
 mysteries+0.035
lan+0.035
minded+0.034
ber+0.034
Neg logit contributions
hing-0.056
oned-0.054
 blush-0.051
 grin-0.049
 urge-0.047
Token make
Token ID787
Feature activation+2.40e-02

Pos logit contributions
oned+0.038
hing+0.036
 AW+0.035
 blush+0.033
 grin+0.032
Neg logit contributions
keeper-0.041
 who-0.039
 mysteries-0.038
minded-0.037
 cel-0.037
Token my
Token ID616
Feature activation+2.22e-02

Pos logit contributions
hing+0.058
oned+0.055
 blush+0.053
 grin+0.049
 urge+0.049
Neg logit contributions
keeper-0.032
ber-0.027
lan-0.027
minded-0.026
eddy-0.026
Token kingdom
Token ID13239
Feature activation+3.50e-03

Pos logit contributions
hing+0.046
oned+0.044
 blush+0.043
 grin+0.041
 urge+0.041
Neg logit contributions
keeper-0.034
eddy-0.031
lan-0.030
ber-0.030
minded-0.030
Token better
Token ID1365
Feature activation-2.98e-02

Pos logit contributions
hing+0.007
oned+0.007
 blush+0.006
 grin+0.006
mine+0.006
Neg logit contributions
keeper-0.006
 mysteries-0.005
minded-0.005
ber-0.005
lan-0.005
Token?"
Token ID1701
Feature activation-4.86e-05

Pos logit contributions
keeper+0.046
lan+0.041
 mysteries+0.039
minded+0.039
eddy+0.038
Neg logit contributions
hing-0.062
oned-0.059
 blush-0.056
 urge-0.055
 grin-0.055
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
ind+0.000
ber+0.000
 mysteries+0.000
eddy+0.000
lan+0.000
Neg logit contributions
 blush-0.000
hing-0.000
oned-0.000
 grin-0.000
mine-0.000
TokenThe
Token ID464
Feature activation-9.28e-03

Pos logit contributions
 mysteries+0.015
keeper+0.015
ber+0.014
lan+0.014
ishes+0.013
Neg logit contributions
hing-0.023
oned-0.022
 blush-0.021
mine-0.021
 grin-0.020
Token little
Token ID1310
Feature activation-1.81e-02

Pos logit contributions
eddy+0.012
keeper+0.012
enn+0.012
 mysteries+0.011
lan+0.011
Neg logit contributions
 blush-0.019
oned-0.019
hing-0.019
 urge-0.018
 grin-0.018
Token girl
Token ID2576
Feature activation+1.73e-02

Pos logit contributions
eddy+0.023
enn+0.022
lan+0.022
 mysteries+0.022
ber+0.022
Neg logit contributions
 blush-0.038
 urge-0.037
hing-0.037
oned-0.036
 grin-0.036
Token thought
Token ID1807
Feature activation-1.86e-02

Pos logit contributions
 blush+0.027
hing+0.027
oned+0.026
 grin+0.026
 hardly+0.024
Neg logit contributions
lan-0.033
eddy-0.033
 mysteries-0.033
keeper-0.032
minded-0.031
Token hard
Token ID1327
Feature activation-4.68e-04

Pos logit contributions
keeper+0.025
 mysteries+0.021
lan+0.021
minded+0.021
ber+0.021
Neg logit contributions
hing-0.043
oned-0.042
 blush-0.040
 grin-0.038
mine-0.037
Token Lily
Token ID20037
Feature activation-1.27e-02

Pos logit contributions
hing+0.010
oned+0.009
 blush+0.009
mine+0.008
 grin+0.008
Neg logit contributions
keeper-0.008
lan-0.007
ber-0.007
 mysteries-0.007
eddy-0.007
Token with
Token ID351
Feature activation-3.55e-03

Pos logit contributions
keeper+0.013
 who+0.011
ber+0.011
 whose+0.010
o+0.010
Neg logit contributions
hing-0.032
oned-0.031
 blush-0.030
 grin-0.029
 hardly-0.028
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
keeper+0.004
 mysteries+0.004
 who+0.003
 coal+0.003
minded+0.003
Neg logit contributions
hing-0.009
oned-0.009
 grin-0.008
 blush-0.008
ivery-0.008
Token block
Token ID2512
Feature activation-2.23e-02

Pos logit contributions
hing+0.026
oned+0.025
 blush+0.023
ivated+0.022
mine+0.022
Neg logit contributions
keeper-0.014
 mysteries-0.011
lan-0.011
ber-0.011
 boy-0.010
Token.
Token ID13
Feature activation+7.12e-03

Pos logit contributions
keeper+0.023
ber+0.018
lan+0.017
 mysteries+0.017
minded+0.017
Neg logit contributions
hing-0.059
oned-0.058
 blush-0.055
 grin-0.053
 respond-0.052
Token He
Token ID679
Feature activation-2.81e-02

Pos logit contributions
oned+0.016
hing+0.015
 blush+0.015
 hardly+0.014
 grin+0.014
Neg logit contributions
 mysteries-0.010
keeper-0.010
ber-0.009
lan-0.009
minded-0.009
Token struck
Token ID7425
Feature activation-3.26e-04

Pos logit contributions
keeper+0.058
 mysteries+0.052
lan+0.052
ber+0.052
minded+0.052
Neg logit contributions
hing-0.045
oned-0.043
 blush-0.040
 grin-0.038
 urge-0.036
Token her
Token ID607
Feature activation-1.32e-02

Pos logit contributions
keeper+0.001
lan+0.000
 mysteries+0.000
ber+0.000
minded+0.000
Neg logit contributions
hing-0.001
oned-0.001
 blush-0.001
 grin-0.001
 urge-0.001
Token on
Token ID319
Feature activation+2.58e-02

Pos logit contributions
keeper+0.019
ber+0.016
eddy+0.016
 mysteries+0.016
lan+0.016
Neg logit contributions
hing-0.030
oned-0.029
 blush-0.027
 grin-0.026
 respond-0.025
Token the
Token ID262
Feature activation+1.90e-02

Pos logit contributions
hing+0.052
oned+0.051
 blush+0.050
 grin+0.046
mine+0.046
Neg logit contributions
keeper-0.041
ber-0.038
lan-0.038
minded-0.037
 mysteries-0.036
Token head
Token ID1182
Feature activation+5.89e-03

Pos logit contributions
hing+0.040
oned+0.038
 blush+0.034
ivated+0.034
 respond+0.033
Neg logit contributions
keeper-0.032
lan-0.027
eddy-0.026
ber-0.026
 mysteries-0.026
Token best
Token ID1266
Feature activation+8.00e-03

Pos logit contributions
hing+0.039
oned+0.037
 blush+0.035
 grin+0.032
 urge+0.031
Neg logit contributions
keeper-0.055
 mysteries-0.050
lan-0.050
ber-0.050
minded-0.050
Token friends
Token ID2460
Feature activation+2.83e-02

Pos logit contributions
hing+0.020
oned+0.019
 blush+0.018
 grin+0.018
 urge+0.017
Neg logit contributions
keeper-0.010
 mysteries-0.008
minded-0.008
lan-0.008
ber-0.008
Token."
Token ID526
Feature activation-1.04e-02

Pos logit contributions
hing+0.067
oned+0.062
 blush+0.060
're+0.057
 grin+0.057
Neg logit contributions
keeper-0.038
lan-0.032
eddy-0.031
 mysteries-0.031
minded-0.031
Token
Token ID198
Feature activation-7.07e-03

Pos logit contributions
 mysteries+0.019
ind+0.017
keeper+0.016
minded+0.016
ber+0.016
Neg logit contributions
 blush-0.019
oned-0.018
 grin-0.018
 shout-0.016
hing-0.016
Token
Token ID198
Feature activation-5.77e-03

Pos logit contributions
keeper+0.010
 mysteries+0.009
minded+0.009
lan+0.009
ber+0.009
Neg logit contributions
hing-0.016
oned-0.015
 blush-0.014
 grin-0.014
 ease-0.013
TokenMom
Token ID29252
Feature activation-2.13e-02

Pos logit contributions
keeper+0.010
 mysteries+0.009
minded+0.009
lan+0.009
ber+0.009
Neg logit contributions
hing-0.011
oned-0.011
 blush-0.010
 grin-0.009
mine-0.009
Token hugged
Token ID41130
Feature activation-6.23e-03

Pos logit contributions
keeper+0.040
 mysteries+0.040
lan+0.039
minded+0.037
ber+0.037
Neg logit contributions
hing-0.037
 blush-0.034
oned-0.033
 grin-0.032
mine-0.030
Token Lily
Token ID20037
Feature activation-1.39e-02

Pos logit contributions
keeper+0.012
ber+0.011
lan+0.011
 mysteries+0.011
ney+0.011
Neg logit contributions
oned-0.011
hing-0.011
 blush-0.010
mine-0.010
 start-0.009
Token and
Token ID290
Feature activation-1.34e-02

Pos logit contributions
 mysteries+0.022
lan+0.022
keeper+0.021
ber+0.019
minded+0.019
Neg logit contributions
hing-0.027
oned-0.026
 blush-0.026
 grin-0.026
 laugh-0.024
Token Anna
Token ID11735
Feature activation+2.44e-02

Pos logit contributions
keeper+0.024
 mysteries+0.022
eddy+0.022
ind+0.022
ber+0.022
Neg logit contributions
 blush-0.024
hing-0.023
oned-0.022
 grin-0.022
 start-0.020
Token and
Token ID290
Feature activation+3.21e-02

Pos logit contributions
hing+0.052
oned+0.051
 blush+0.050
 grin+0.048
 start+0.047
Neg logit contributions
keeper-0.035
 mysteries-0.033
lan-0.032
eddy-0.031
ber-0.031
Token to
Token ID284
Feature activation+1.46e-02

Pos logit contributions
keeper+0.009
minded+0.007
ber+0.007
elia+0.007
 mysteries+0.006
Neg logit contributions
hing-0.019
 blush-0.018
oned-0.018
 laugh-0.017
 start-0.017
Token play
Token ID711
Feature activation+2.53e-02

Pos logit contributions
oned+0.030
hing+0.028
ivated+0.028
 AW+0.028
aned+0.026
Neg logit contributions
 who-0.019
lan-0.019
o-0.019
 coal-0.018
 cave-0.018
Token with
Token ID351
Feature activation-1.11e-02

Pos logit contributions
hing+0.055
oned+0.053
 blush+0.050
 grin+0.047
 urge+0.046
Neg logit contributions
keeper-0.039
lan-0.033
 mysteries-0.033
ber-0.033
minded-0.033
Token toys
Token ID14958
Feature activation+2.07e-02

Pos logit contributions
keeper+0.023
minded+0.020
ber+0.019
lan+0.019
ray+0.019
Neg logit contributions
hing-0.019
 blush-0.018
 laugh-0.016
oned-0.016
 start-0.016
Token.
Token ID13
Feature activation+9.76e-03

Pos logit contributions
hing+0.042
 blush+0.040
oned+0.040
 start+0.038
mine+0.037
Neg logit contributions
keeper-0.031
lan-0.028
eddy-0.027
 mysteries-0.027
minded-0.027
Token One
Token ID1881
Feature activation-3.26e-02

Pos logit contributions
hing+0.017
oned+0.017
 blush+0.016
 grin+0.015
 urge+0.014
Neg logit contributions
keeper-0.019
minded-0.017
ber-0.017
 mysteries-0.017
lan-0.017
Token day
Token ID1110
Feature activation+9.26e-03

Pos logit contributions
keeper+0.056
 mysteries+0.050
lan+0.050
minded+0.049
ber+0.049
Neg logit contributions
hing-0.064
oned-0.062
 blush-0.058
 grin-0.055
 urge-0.053
Token,
Token ID11
Feature activation+8.01e-03

Pos logit contributions
hing+0.021
oned+0.020
 blush+0.019
 grin+0.018
ivery+0.018
Neg logit contributions
keeper-0.013
lan-0.012
ber-0.012
 mysteries-0.011
eddy-0.011
Token they
Token ID484
Feature activation+4.07e-02

Pos logit contributions
hing+0.020
oned+0.020
 blush+0.019
 grin+0.018
 urge+0.017
Neg logit contributions
keeper-0.009
lan-0.008
 mysteries-0.008
ber-0.008
minded-0.008
Token find
Token ID1064
Feature activation-1.83e-02

Pos logit contributions
hing+0.063
oned+0.063
 blush+0.054
 grin+0.053
 AW+0.053
Neg logit contributions
keeper-0.082
 who-0.077
 mysteries-0.077
minded-0.076
 boy-0.076
Token a
Token ID257
Feature activation-1.85e-02

Pos logit contributions
keeper+0.029
minded+0.025
ber+0.025
lan+0.024
 mysteries+0.024
Neg logit contributions
hing-0.038
oned-0.037
 blush-0.035
 urge-0.033
 grin-0.032
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
hing+0.051
oned+0.050
 blush+0.048
 grin+0.046
 respond+0.045
Neg logit contributions
keeper-0.024
eddy-0.020
 mysteries-0.020
lan-0.020
ber-0.020
TokenM
Token ID44
Feature activation-1.24e-02

Pos logit contributions
keeper+0.019
 mysteries+0.017
minded+0.017
lan+0.017
eddy+0.016
Neg logit contributions
oned-0.026
hing-0.025
 blush-0.023
 grin-0.022
 hardly-0.022
Tokenolly
Token ID5098
Feature activation+7.54e-03

Pos logit contributions
keeper+0.023
 mysteries+0.023
minded+0.022
eddy+0.020
lan+0.020
Neg logit contributions
oned-0.020
hing-0.020
 blush-0.020
 grin-0.019
 hardly-0.018
Token was
Token ID373
Feature activation+2.07e-02

Pos logit contributions
hing+0.015
oned+0.014
 blush+0.014
 grin+0.013
 urge+0.013
Neg logit contributions
keeper-0.013
 mysteries-0.011
lan-0.011
minded-0.011
eddy-0.011
Token so
Token ID523
Feature activation-3.00e-03

Pos logit contributions
hing+0.050
oned+0.046
 blush+0.043
 AW+0.043
mine+0.042
Neg logit contributions
keeper-0.027
 who-0.023
ber-0.023
 late-0.022
 mysteries-0.022
Token happy
Token ID3772
Feature activation-3.05e-03

Pos logit contributions
keeper+0.005
ber+0.004
 late+0.004
 who+0.004
lan+0.004
Neg logit contributions
hing-0.007
oned-0.006
 blush-0.006
mine-0.006
rell-0.006
Token!
Token ID0
Feature activation+9.26e-03

Pos logit contributions
keeper+0.004
minded+0.004
 mysteries+0.004
ber+0.004
lan+0.004
Neg logit contributions
oned-0.006
 blush-0.006
hing-0.006
 grin-0.006
 hardly-0.006
Token She
Token ID1375
Feature activation-1.28e-02

Pos logit contributions
oned+0.015
 blush+0.014
hing+0.014
 grin+0.013
 hardly+0.013
Neg logit contributions
keeper-0.018
 mysteries-0.018
minded-0.017
lan-0.017
ber-0.017
Token got
Token ID1392
Feature activation-1.72e-02

Pos logit contributions
keeper+0.023
 mysteries+0.021
lan+0.021
minded+0.021
ber+0.021
Neg logit contributions
hing-0.023
oned-0.022
 blush-0.021
 grin-0.020
 urge-0.019
Token a
Token ID257
Feature activation-1.97e-02

Pos logit contributions
keeper+0.026
 mysteries+0.024
ber+0.024
lan+0.023
eddy+0.023
Neg logit contributions
hing-0.034
 blush-0.034
oned-0.034
 grin-0.031
 start-0.030
Token new
Token ID649
Feature activation-1.08e-02

Pos logit contributions
keeper+0.032
 mysteries+0.028
lan+0.028
ber+0.027
minded+0.027
Neg logit contributions
hing-0.041
oned-0.039
 blush-0.037
 grin-0.035
 urge-0.034
Token playing
Token ID2712
Feature activation+2.01e-02

Pos logit contributions
keeper+0.049
lan+0.044
ber+0.044
eddy+0.043
minded+0.042
Neg logit contributions
hing-0.052
oned-0.050
 blush-0.050
 grin-0.045
 urge-0.045
Token with
Token ID351
Feature activation-1.59e-03

Pos logit contributions
hing+0.049
oned+0.047
 blush+0.045
 grin+0.043
 urge+0.042
Neg logit contributions
keeper-0.025
lan-0.021
ber-0.021
 mysteries-0.021
minded-0.021
Token her
Token ID607
Feature activation-4.37e-03

Pos logit contributions
keeper+0.003
ber+0.003
lan+0.002
eddy+0.002
minded+0.002
Neg logit contributions
hing-0.003
oned-0.003
 blush-0.003
 ease-0.003
 grin-0.003
Token brother
Token ID3956
Feature activation+5.55e-03

Pos logit contributions
keeper+0.007
minded+0.006
lan+0.006
eddy+0.006
ber+0.006
Neg logit contributions
hing-0.009
oned-0.008
 blush-0.008
 grin-0.008
 urge-0.008
Token was
Token ID373
Feature activation+2.29e-02

Pos logit contributions
hing+0.011
oned+0.011
 blush+0.010
 grin+0.010
 urge+0.009
Neg logit contributions
keeper-0.009
lan-0.008
eddy-0.008
minded-0.008
 mysteries-0.008
Token even
Token ID772
Feature activation-5.83e-02

Pos logit contributions
hing+0.048
oned+0.046
 blush+0.044
 grin+0.042
 urge+0.041
Neg logit contributions
keeper-0.036
lan-0.032
ber-0.031
eddy-0.031
 mysteries-0.031
Token better
Token ID1365
Feature activation-3.44e-02

Pos logit contributions
keeper+0.090
lan+0.086
eddy+0.085
ber+0.082
enn+0.080
Neg logit contributions
hing-0.117
 blush-0.112
oned-0.109
 urge-0.104
 realize-0.102
Token than
Token ID621
Feature activation-2.53e-03

Pos logit contributions
keeper+0.053
lan+0.045
eddy+0.045
elia+0.042
ray+0.042
Neg logit contributions
oned-0.068
hing-0.067
 urge-0.066
 blush-0.065
 realize-0.064
Token playing
Token ID2712
Feature activation+2.05e-02

Pos logit contributions
keeper+0.005
eddy+0.005
ber+0.005
lan+0.005
minded+0.005
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.004
 ease-0.003
 grin-0.003
Token with
Token ID351
Feature activation-2.71e-03

Pos logit contributions
hing+0.053
oned+0.051
 blush+0.049
 grin+0.047
 urge+0.046
Neg logit contributions
keeper-0.022
 mysteries-0.018
lan-0.018
minded-0.018
ber-0.018
Token her
Token ID607
Feature activation-4.41e-03

Pos logit contributions
keeper+0.005
ber+0.004
lan+0.004
minded+0.004
eddy+0.004
Neg logit contributions
hing-0.005
oned-0.005
 blush-0.005
 ease-0.005
 grin-0.005
Token
Token ID198
Feature activation+2.26e-03

Pos logit contributions
oned+0.005
hing+0.005
 blush+0.004
 hardly+0.004
 ease+0.004
Neg logit contributions
keeper-0.004
 mysteries-0.004
ber-0.004
minded-0.004
lan-0.003
TokenFinally
Token ID11158
Feature activation-1.32e-03

Pos logit contributions
oned+0.004
hing+0.004
 blush+0.004
 hardly+0.004
mine+0.004
Neg logit contributions
keeper-0.004
 mysteries-0.003
lan-0.003
ber-0.003
minded-0.003
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
keeper+0.002
lan+0.002
 mysteries+0.002
ber+0.002
minded+0.002
Neg logit contributions
hing-0.003
oned-0.003
 blush-0.002
 grin-0.002
 respond-0.002
Token they
Token ID484
Feature activation+3.32e-02

Pos logit contributions
hing+0.030
oned+0.029
 blush+0.028
 grin+0.026
 urge+0.025
Neg logit contributions
keeper-0.023
 mysteries-0.020
ber-0.020
minded-0.020
eddy-0.020
Token arrived
Token ID5284
Feature activation-2.70e-02

Pos logit contributions
hing+0.065
oned+0.062
 blush+0.056
 AW+0.053
 respond+0.053
Neg logit contributions
keeper-0.061
ber-0.052
lan-0.051
 who-0.050
 mysteries-0.050
Token back
Token ID736
Feature activation-5.63e-02

Pos logit contributions
keeper+0.047
 mysteries+0.044
lan+0.043
minded+0.042
eddy+0.042
Neg logit contributions
oned-0.049
hing-0.049
 blush-0.045
 grin-0.045
 urge-0.043
Token home
Token ID1363
Feature activation-4.48e-02

Pos logit contributions
keeper+0.083
 mysteries+0.076
lan+0.076
minded+0.074
ber+0.068
Neg logit contributions
oned-0.116
hing-0.111
 blush-0.105
 start-0.104
 grin-0.104
Token.
Token ID13
Feature activation-3.44e-03

Pos logit contributions
keeper+0.055
 mysteries+0.053
lan+0.051
minded+0.051
eddy+0.049
Neg logit contributions
hing-0.102
oned-0.101
 blush-0.095
 grin-0.094
 urge-0.091
Token Their
Token ID5334
Feature activation+6.04e-04

Pos logit contributions
ber+0.006
keeper+0.006
 mysteries+0.006
ind+0.006
lan+0.006
Neg logit contributions
oned-0.006
 blush-0.006
hing-0.005
 grin-0.005
 hardly-0.005
Token son
Token ID3367
Feature activation-3.93e-03

Pos logit contributions
hing+0.001
oned+0.001
 blush+0.001
 grin+0.001
 urge+0.001
Neg logit contributions
keeper-0.001
lan-0.001
ber-0.001
minded-0.001
eddy-0.001
Token ran
Token ID4966
Feature activation+1.73e-02

Pos logit contributions
 mysteries+0.007
minded+0.007
lan+0.007
eddy+0.007
keeper+0.006
Neg logit contributions
hing-0.007
oned-0.007
 grin-0.007
 blush-0.007
 laugh-0.006
Token was
Token ID373
Feature activation-1.72e-02

Pos logit contributions
keeper+0.035
lan+0.031
minded+0.030
ber+0.030
 mysteries+0.030
Neg logit contributions
hing-0.058
oned-0.055
 blush-0.053
 grin-0.050
 urge-0.049
Token a
Token ID257
Feature activation-2.12e-02

Pos logit contributions
keeper+0.033
minded+0.030
ber+0.030
eddy+0.029
lan+0.029
Neg logit contributions
hing-0.030
 blush-0.028
oned-0.027
 start-0.026
 urge-0.026
Token great
Token ID1049
Feature activation-2.23e-03

Pos logit contributions
keeper+0.039
minded+0.036
lan+0.034
eddy+0.034
ber+0.034
Neg logit contributions
hing-0.038
oned-0.035
 blush-0.035
 grin-0.034
 urge-0.033
Token game
Token ID983
Feature activation-1.81e-03

Pos logit contributions
keeper+0.005
 mysteries+0.005
lan+0.005
ber+0.005
minded+0.005
Neg logit contributions
hing-0.003
oned-0.003
 blush-0.003
 grin-0.002
 urge-0.002
Token.
Token ID13
Feature activation-2.02e-02

Pos logit contributions
keeper+0.002
minded+0.002
eddy+0.002
 mysteries+0.002
lan+0.002
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.004
 grin-0.004
 urge-0.004
Token<|endoftext|>
Token ID50256
Feature activation-6.44e-02

Pos logit contributions
keeper+0.043
ind+0.041
ber+0.040
minded+0.040
 mysteries+0.040
Neg logit contributions
oned-0.030
hing-0.028
 blush-0.027
 AW-0.025
 ease-0.024
TokenTom
Token ID13787
Feature activation-6.46e-03

Pos logit contributions
keeper+0.094
minded+0.080
ber+0.078
lan+0.076
ind+0.073
Neg logit contributions
hing-0.151
oned-0.139
hered-0.132
mine-0.129
 blush-0.128
Tokenmy
Token ID1820
Feature activation-4.52e-03

Pos logit contributions
keeper+0.010
lan+0.009
 mysteries+0.008
minded+0.008
ber+0.008
Neg logit contributions
hing-0.014
oned-0.013
 blush-0.013
 grin-0.012
 urge-0.012
Token was
Token ID373
Feature activation-9.61e-03

Pos logit contributions
keeper+0.008
lan+0.007
 mysteries+0.007
minded+0.007
ber+0.007
Neg logit contributions
hing-0.009
oned-0.008
 blush-0.008
 grin-0.008
 urge-0.007
Token a
Token ID257
Feature activation-4.32e-02

Pos logit contributions
keeper+0.014
ber+0.013
 who+0.013
lan+0.012
 whose+0.012
Neg logit contributions
hing-0.022
oned-0.020
 blush-0.019
 grin-0.018
 AW-0.018
Token clumsy
Token ID39120
Feature activation+3.56e-03

Pos logit contributions
keeper+0.081
ups+0.081
minded+0.076
 mysteries+0.075
ray+0.074
Neg logit contributions
 urge-0.071
hing-0.066
 grin-0.066
 laugh-0.065
 start-0.064
Token was
Token ID373
Feature activation+1.88e-02

Pos logit contributions
hing+0.008
oned+0.008
 blush+0.008
 grin+0.007
 urge+0.007
Neg logit contributions
keeper-0.006
lan-0.006
 mysteries-0.006
minded-0.006
ber-0.006
Token scared
Token ID12008
Feature activation+1.30e-02

Pos logit contributions
hing+0.042
oned+0.039
 blush+0.037
mine+0.036
 AW+0.036
Neg logit contributions
keeper-0.028
 who-0.025
ber-0.024
 mysteries-0.023
 cave-0.023
Token because
Token ID780
Feature activation-9.70e-03

Pos logit contributions
hing+0.024
oned+0.023
 blush+0.022
 urge+0.021
 realize+0.021
Neg logit contributions
keeper-0.024
 mysteries-0.022
ber-0.022
lan-0.022
minded-0.021
Token he
Token ID339
Feature activation-2.83e-02

Pos logit contributions
keeper+0.014
minded+0.014
lan+0.013
eddy+0.012
ber+0.012
Neg logit contributions
hing-0.021
oned-0.019
 blush-0.019
 hardly-0.019
 crash-0.018
Token thought
Token ID1807
Feature activation-1.32e-02

Pos logit contributions
keeper+0.054
lan+0.049
 mysteries+0.048
minded+0.048
ber+0.048
Neg logit contributions
hing-0.050
oned-0.047
 blush-0.045
 grin-0.042
 urge-0.041
Token he
Token ID339
Feature activation-5.21e-02

Pos logit contributions
keeper+0.017
minded+0.015
lan+0.015
ber+0.015
 mysteries+0.015
Neg logit contributions
hing-0.031
oned-0.030
 blush-0.029
 grin-0.028
 urge-0.027
Token broke
Token ID6265
Feature activation-2.65e-02

Pos logit contributions
keeper+0.091
lan+0.086
minded+0.083
ber+0.080
 mysteries+0.080
Neg logit contributions
hing-0.099
 blush-0.091
oned-0.090
 hardly-0.085
're-0.085
Token his
Token ID465
Feature activation-1.19e-02

Pos logit contributions
keeper+0.057
lan+0.054
ber+0.054
minded+0.052
elia+0.050
Neg logit contributions
hing-0.035
 start-0.034
 crash-0.034
 ease-0.033
 blush-0.031
Token toy
Token ID13373
Feature activation-2.49e-02

Pos logit contributions
keeper+0.015
 mysteries+0.013
ber+0.013
eddy+0.012
lan+0.012
Neg logit contributions
hing-0.029
oned-0.028
 blush-0.027
 grin-0.025
mine-0.025
Token car
Token ID1097
Feature activation-3.77e-02

Pos logit contributions
keeper+0.029
 mysteries+0.026
lan+0.025
minded+0.025
ber+0.024
Neg logit contributions
hing-0.062
oned-0.059
 blush-0.056
 grin-0.054
 urge-0.053
Token.
Token ID13
Feature activation+8.78e-03

Pos logit contributions
keeper+0.042
lan+0.038
 mysteries+0.037
minded+0.036
ber+0.036
Neg logit contributions
hing-0.095
oned-0.092
 blush-0.087
 grin-0.083
 urge-0.083
Token the
Token ID262
Feature activation+1.48e-02

Pos logit contributions
hing+0.034
oned+0.033
 blush+0.031
 grin+0.030
 urge+0.029
Neg logit contributions
keeper-0.022
lan-0.019
ber-0.019
minded-0.019
eddy-0.019
Token submarine
Token ID24927
Feature activation-5.82e-03

Pos logit contributions
hing+0.029
oned+0.027
 start+0.026
 urge+0.026
 blush+0.025
Neg logit contributions
keeper-0.025
ber-0.023
eddy-0.023
lan-0.022
minded-0.021
Token smiled
Token ID13541
Feature activation+6.10e-03

Pos logit contributions
keeper+0.011
lan+0.010
ber+0.010
 mysteries+0.010
minded+0.010
Neg logit contributions
hing-0.010
oned-0.010
 blush-0.009
 grin-0.008
 AW-0.008
Token and
Token ID290
Feature activation-3.48e-02

Pos logit contributions
hing+0.013
oned+0.013
 blush+0.012
 grin+0.012
 urge+0.012
Neg logit contributions
keeper-0.009
 mysteries-0.008
lan-0.008
ber-0.008
minded-0.008
Token waved
Token ID26834
Feature activation-9.31e-04

Pos logit contributions
keeper+0.067
eddy+0.061
 mysteries+0.060
lan+0.060
ber+0.060
Neg logit contributions
hing-0.061
oned-0.057
 blush-0.054
 grin-0.052
 urge-0.049
Token back
Token ID736
Feature activation-5.43e-02

Pos logit contributions
keeper+0.002
 mysteries+0.002
ber+0.002
eddy+0.002
lan+0.002
Neg logit contributions
hing-0.002
oned-0.002
 blush-0.001
 grin-0.001
 urge-0.001
Token.
Token ID13
Feature activation-1.76e-03

Pos logit contributions
keeper+0.084
eddy+0.077
minded+0.073
 Cream+0.073
 mysteries+0.072
Neg logit contributions
oned-0.106
hing-0.104
 blush-0.098
 start-0.095
 grin-0.094
Token
Token ID198
Feature activation-1.56e-03

Pos logit contributions
keeper+0.003
ind+0.003
 mysteries+0.003
ber+0.003
ups+0.003
Neg logit contributions
oned-0.003
hing-0.003
 ease-0.003
 grin-0.003
're-0.003
Token
Token ID198
Feature activation-1.15e-03

Pos logit contributions
keeper+0.002
minded+0.002
 mysteries+0.002
lan+0.002
ber+0.002
Neg logit contributions
hing-0.004
oned-0.004
 blush-0.004
 grin-0.003
 hardly-0.003
TokenSuddenly
Token ID38582
Feature activation-1.44e-03

Pos logit contributions
keeper+0.002
 mysteries+0.001
lan+0.001
ber+0.001
minded+0.001
Neg logit contributions
hing-0.003
oned-0.002
 blush-0.002
 grin-0.002
 urge-0.002
Token,
Token ID11
Feature activation+1.94e-02

Pos logit contributions
keeper+0.003
eddy+0.003
ber+0.003
minded+0.003
 mysteries+0.002
Neg logit contributions
hing-0.002
oned-0.002
 blush-0.002
 grin-0.002
 start-0.002