TOP ACTIVATIONS
MAX = 8.716

kept crying and throwing a tant rum . Finally
sad . She threw a tant rum and cried loudly .
to cry and threw a tant rum . She didn 't
. He kept throwing a tant rum . His mom was
listen . She threw a tant rum in the store .
and started to throw a tant rum . Suddenly , she
her feet and threw a tant rum . She grabbed the
She cried and threw a tant rum . Her mom tried
and so she threw a tant rum when her parents said
stubborn . She throws a tant rum . She kicks and
surprise . He threw a tant rum . He kicked and
his feet and threw a tant rum . "
them and never threw a tant rum again . <|endoftext|> Once
temper and often threw temper tant rums . One day ,
sad . He has a bru ise on his leg .
arm . She had a bru ise on her leg .
. He had a big bru ise on his head from
hurt . Ben had a bru ise on his leg and
on his head and a bru ise on his arm .
. You just have a bru ise . You can go

MIN ACTIVATIONS
MIN = -6.871

, mum and dad told Maria that they had an exciting
there was a girl called Min nie . She was very
of the items and told Min nie the cost . Min
Lily assured her that L izzy was very friendly and loved
day Mia 's mom introduced Mia to a diamond . It
dad made sure to encourage Maria to try all of the
today , so she showed Emily her new purple bag .
was getting frustrated and told Julie that it was time to
home a big hoop for Maria . She was so excited
a baby squirrel called T izzy . She was very small
of something y ummy told Bella it was ready . She
hospital and the doctor said Tina had eaten a bad avocado
opened the bag and showed Emily the big box inside .
day , the wagon told Mia , " Thank you for
They were so proud of Maria . She had so much
M um showed Maria how to count as she
friendly , and he taught Ted lots of new things .
somewhere . She told L izzy to stay put and not
age . The vet said Luna would heal soon , but
't you invite your friend Emily over to play with you

INTERVAL 4.819 - 8.716
CONTAINS 0.006%

she walked she heard a sound . It was a little
far . She started to regret her decision to go so
He then heard a voice coming from the dust cloud
. Max had a big heart and he was very kind
street , she heard a sound . It was not a

INTERVAL 0.922 - 4.819
CONTAINS 19.322%

very helpful . One night , the stars shone bright in
inside . The mom was sad because the home was still
back down and continued to gaze at the night sky ,
named Lily . She loved to visit her grandma 's house
. They curled up on the rug and hugged their t

INTERVAL -2.975 - 0.922
CONTAINS 79.854%

the pile of leaves even bigger . Mia was happy to
had made a mess and he should have cleaned it up
The next day , the little girl came back
he loved to give it c udd les .
fell asleep and dream t of the bird she saw earlier

INTERVAL -6.871 - -2.975
CONTAINS 0.818%

clean block of ice and decided he wanted to melt it
cliff that held the start of it all . <|endoftext|> Once
punished . She said that Lily had to go to the
Lily 's mom my said they were going to a festival
spotted a rock . He decided he wanted to keep it
Token kept
Token ID4030
Feature activation+4.93e-02

Pos logit contributions
 pic+0.016
 from+0.016
sc+0.015
 so+0.015
 off+0.015
Neg logit contributions
joy-0.017
 nods-0.017
 nodded-0.016
 thrill-0.016
 relief-0.016
Token crying
Token ID13774
Feature activation+4.15e+00

Pos logit contributions
joy+0.002
 relief+0.001
 thrill+0.001
 disl+0.001
 sorry+0.001
Neg logit contributions
's-0.001
-0.001
sc-0.001
ches-0.001
 called-0.001
Token and
Token ID290
Feature activation-5.59e-01

Pos logit contributions
joy+0.212
 thrill+0.202
 disl+0.197
 relief+0.195
 nods+0.193
Neg logit contributions
's-0.019
 so-0.013
-0.013
 called-0.013
 and-0.011
Token throwing
Token ID9644
Feature activation+5.79e-01

Pos logit contributions
's+0.012
 so+0.012
 called+0.012
 and+0.011
+0.010
Neg logit contributions
joy-0.020
 thrill-0.019
 disl-0.019
 Majesty-0.018
 nods-0.018
Token a
Token ID257
Feature activation-1.16e-01

Pos logit contributions
joy+0.023
 thrill+0.022
 relief+0.022
 disl+0.021
 nods+0.021
Neg logit contributions
's-0.008
 called-0.007
-0.007
 so-0.007
 and-0.006
Token tant
Token ID24246
Feature activation+8.72e+00

Pos logit contributions
's+0.003
 called+0.003
 so+0.003
+0.003
 pic+0.003
Neg logit contributions
joy-0.003
 thrill-0.003
 relief-0.003
 disl-0.003
 nods-0.003
Tokenrum
Token ID6582
Feature activation+1.65e+00

Pos logit contributions
joy+0.176
 thrill+0.154
 disl+0.147
 relief+0.139
 nods+0.137
Neg logit contributions
's-0.310
 so-0.299
 and-0.296
 called-0.296
-0.295
Token.
Token ID13
Feature activation+3.47e-01

Pos logit contributions
joy+0.078
 thrill+0.074
 disl+0.073
 relief+0.071
 nods+0.071
Neg logit contributions
's-0.014
 so-0.012
 and-0.012
 called-0.011
-0.011
Token
Token ID198
Feature activation-2.50e-01

Pos logit contributions
joy+0.013
 thrill+0.012
 disl+0.012
 relief+0.011
 nods+0.011
Neg logit contributions
's-0.007
 so-0.006
 and-0.006
 called-0.006
-0.006
Token
Token ID198
Feature activation-6.71e-02

Pos logit contributions
's+0.006
 so+0.005
 pic+0.005
 unw+0.005
 separate+0.005
Neg logit contributions
joy-0.007
 disl-0.006
 thrill-0.006
 relief-0.006
 nods-0.006
TokenFinally
Token ID11158
Feature activation-8.26e-01

Pos logit contributions
 pic+0.002
 so+0.002
 from+0.002
's+0.001
 into+0.001
Neg logit contributions
joy-0.002
 disl-0.002
Sorry-0.002
 thrill-0.002
 relief-0.002
Token sad
Token ID6507
Feature activation+3.61e+00

Pos logit contributions
's+0.014
 so+0.014
 called+0.014
 and+0.013
+0.013
Neg logit contributions
joy-0.020
 thrill-0.019
 disl-0.018
 relief-0.017
 Majesty-0.017
Token.
Token ID13
Feature activation+1.88e-01

Pos logit contributions
joy+0.176
 thrill+0.168
 disl+0.165
 Majesty+0.161
 relief+0.160
Neg logit contributions
's-0.028
 and-0.026
 so-0.026
 called-0.022
-0.020
Token She
Token ID1375
Feature activation-2.30e-01

Pos logit contributions
joy+0.007
 thrill+0.006
 disl+0.006
 nods+0.006
 relief+0.006
Neg logit contributions
's-0.004
 so-0.004
 called-0.004
 and-0.004
-0.004
Token threw
Token ID9617
Feature activation+9.15e-01

Pos logit contributions
 pic+0.005
's+0.005
 from+0.005
+0.005
 into+0.005
Neg logit contributions
joy-0.006
 nods-0.006
 disl-0.006
 relief-0.006
 thrill-0.006
Token a
Token ID257
Feature activation+2.58e-01

Pos logit contributions
joy+0.039
 thrill+0.037
 relief+0.036
 disl+0.035
 nods+0.035
Neg logit contributions
's-0.011
 called-0.010
-0.010
 so-0.009
 pic-0.009
Token tant
Token ID24246
Feature activation+8.56e+00

Pos logit contributions
joy+0.008
 thrill+0.007
 disl+0.007
 relief+0.007
 nods+0.007
Neg logit contributions
's-0.007
 so-0.006
 called-0.006
-0.006
 and-0.006
Tokenrum
Token ID6582
Feature activation+2.34e+00

Pos logit contributions
joy+0.184
 thrill+0.163
 disl+0.151
 relief+0.146
 nods+0.144
Neg logit contributions
's-0.296
 so-0.286
 and-0.285
 called-0.284
-0.279
Token and
Token ID290
Feature activation+5.37e-01

Pos logit contributions
joy+0.123
 thrill+0.120
 disl+0.119
 relief+0.115
 Majesty+0.114
Neg logit contributions
's-0.009
 and-0.008
 so-0.007
 called-0.004
-0.003
Token cried
Token ID16896
Feature activation+2.38e+00

Pos logit contributions
joy+0.019
 thrill+0.017
 disl+0.017
 relief+0.016
 nods+0.016
Neg logit contributions
's-0.011
-0.010
 so-0.010
 and-0.010
 called-0.010
Token loudly
Token ID23112
Feature activation+9.50e-01

Pos logit contributions
joy+0.108
 thrill+0.102
 disl+0.099
 relief+0.098
 nods+0.097
Neg logit contributions
's-0.025
 so-0.021
-0.021
 called-0.021
 and-0.020
Token.
Token ID13
Feature activation+1.60e-01

Pos logit contributions
joy+0.045
 thrill+0.043
 disl+0.042
 relief+0.041
 nods+0.041
Neg logit contributions
's-0.008
 so-0.007
 and-0.007
 called-0.006
-0.006
Token to
Token ID284
Feature activation+7.02e-01

Pos logit contributions
sc+0.005
+0.005
ches+0.005
's+0.005
 called+0.005
Neg logit contributions
joy-0.021
 disl-0.018
 nods-0.017
 thrill-0.017
 grin-0.017
Token cry
Token ID3960
Feature activation+3.13e+00

Pos logit contributions
joy+0.020
 thrill+0.018
 disl+0.017
 relief+0.016
 nods+0.016
Neg logit contributions
's-0.020
 so-0.019
 and-0.019
 called-0.019
-0.019
Token and
Token ID290
Feature activation-2.46e-01

Pos logit contributions
joy+0.155
 thrill+0.147
 disl+0.143
 relief+0.142
 nods+0.141
Neg logit contributions
's-0.018
-0.014
 called-0.013
 so-0.013
 and-0.011
Token threw
Token ID9617
Feature activation+6.91e-01

Pos logit contributions
's+0.004
 so+0.004
 called+0.004
 and+0.004
+0.004
Neg logit contributions
joy-0.010
 thrill-0.009
 disl-0.009
 relief-0.009
 nods-0.008
Token a
Token ID257
Feature activation+8.87e-02

Pos logit contributions
joy+0.030
 thrill+0.028
 disl+0.027
 relief+0.027
 nods+0.026
Neg logit contributions
's-0.009
 called-0.008
 so-0.008
-0.008
 and-0.007
Token tant
Token ID24246
Feature activation+8.47e+00

Pos logit contributions
joy+0.003
 thrill+0.003
 disl+0.003
 relief+0.003
 nods+0.003
Neg logit contributions
's-0.002
 so-0.002
 called-0.002
-0.002
 and-0.002
Tokenrum
Token ID6582
Feature activation+1.94e+00

Pos logit contributions
joy+0.176
 thrill+0.154
 disl+0.146
 relief+0.139
 nods+0.137
Neg logit contributions
's-0.298
 so-0.287
 and-0.285
 called-0.284
-0.283
Token.
Token ID13
Feature activation+5.61e-03

Pos logit contributions
joy+0.093
 thrill+0.088
 disl+0.087
 relief+0.085
 nods+0.084
Neg logit contributions
's-0.016
 so-0.014
 and-0.013
 called-0.013
-0.012
Token She
Token ID1375
Feature activation+8.50e-02

Pos logit contributions
joy+0.000
 thrill+0.000
 disl+0.000
 relief+0.000
 nods+0.000
Neg logit contributions
's-0.000
 so-0.000
 called-0.000
 and-0.000
-0.000
Token didn
Token ID1422
Feature activation+5.85e-01

Pos logit contributions
joy+0.002
 nods+0.002
 thrill+0.002
 disl+0.002
 sincere+0.002
Neg logit contributions
 pic-0.002
 from-0.002
-0.002
 so-0.002
sc-0.002
Token't
Token ID470
Feature activation+2.20e-01

Pos logit contributions
 sorry+0.007
 Okay+0.006
 thrill+0.006
severe+0.005
 disl+0.005
Neg logit contributions
 pic-0.022
 called-0.021
mary-0.020
 way-0.019
 lin-0.019
Token.
Token ID13
Feature activation-5.92e-01

Pos logit contributions
's+0.020
 pic+0.020
 called+0.020
+0.019
 lin+0.018
Neg logit contributions
joy-0.072
 disl-0.064
 thrill-0.063
 nods-0.062
 relief-0.061
Token He
Token ID679
Feature activation+7.14e-01

Pos logit contributions
 and+0.014
 so+0.014
's+0.014
 called+0.013
+0.013
Neg logit contributions
joy-0.019
 thrill-0.018
 disl-0.017
 Majesty-0.017
 tweet-0.017
Token kept
Token ID4030
Feature activation+8.54e-02

Pos logit contributions
joy+0.023
 thrill+0.022
 nods+0.021
 disl+0.020
 relief+0.020
Neg logit contributions
's-0.016
 so-0.015
-0.015
 off-0.015
 closely-0.014
Token throwing
Token ID9644
Feature activation+6.59e-01

Pos logit contributions
joy+0.003
 thrill+0.003
 disl+0.002
 relief+0.002
 nods+0.002
Neg logit contributions
's-0.002
-0.002
 called-0.002
 so-0.002
ches-0.002
Token a
Token ID257
Feature activation+5.13e-01

Pos logit contributions
joy+0.027
 thrill+0.025
 disl+0.024
 relief+0.024
 nods+0.024
Neg logit contributions
's-0.010
 so-0.009
 called-0.009
-0.009
 and-0.009
Token tant
Token ID24246
Feature activation+8.43e+00

Pos logit contributions
joy+0.019
 disl+0.018
 thrill+0.018
 Majesty+0.017
 nods+0.017
Neg logit contributions
's-0.010
 and-0.009
 so-0.009
 called-0.009
-0.009
Tokenrum
Token ID6582
Feature activation+2.00e+00

Pos logit contributions
joy+0.190
 thrill+0.168
 disl+0.152
 relief+0.151
 nods+0.148
Neg logit contributions
's-0.286
 and-0.276
 so-0.274
 called-0.273
-0.268
Token.
Token ID13
Feature activation+2.31e-01

Pos logit contributions
joy+0.095
 thrill+0.090
 disl+0.088
 relief+0.086
 nods+0.086
Neg logit contributions
's-0.017
 so-0.014
 called-0.013
 and-0.013
-0.013
Token His
Token ID2399
Feature activation+1.39e+00

Pos logit contributions
joy+0.007
 thrill+0.007
 disl+0.007
 tweet+0.007
 Majesty+0.007
Neg logit contributions
 and-0.005
 so-0.005
's-0.005
 called-0.005
 to-0.005
Token mom
Token ID1995
Feature activation+1.12e+00

Pos logit contributions
joy+0.061
 disl+0.059
viously+0.058
 thrill+0.057
reetings+0.056
Neg logit contributions
 and-0.018
's-0.017
 so-0.015
 to-0.015
 shelf-0.015
Token was
Token ID373
Feature activation-1.51e-01

Pos logit contributions
joy+0.037
 thrill+0.035
 nods+0.034
 relief+0.033
 disl+0.032
Neg logit contributions
 pic-0.021
 so-0.021
's-0.021
 off-0.021
-0.021
Token listen
Token ID6004
Feature activation-1.64e+00

Pos logit contributions
joy+0.027
 Majesty+0.023
ses+0.023
Really+0.021
 thrill+0.021
Neg logit contributions
 and-0.019
's-0.019
 called-0.018
 so-0.018
-0.018
Token.
Token ID13
Feature activation-8.00e-01

Pos logit contributions
's+0.013
 pic+0.012
+0.012
 called+0.012
 unw+0.010
Neg logit contributions
joy-0.076
 disl-0.069
 thrill-0.068
 nods-0.068
 relief-0.066
Token She
Token ID1375
Feature activation+1.89e-01

Pos logit contributions
's+0.017
 and+0.017
 so+0.016
+0.016
 called+0.016
Neg logit contributions
joy-0.028
 thrill-0.027
 disl-0.025
 relief-0.025
 nods-0.024
Token threw
Token ID9617
Feature activation+2.38e-01

Pos logit contributions
joy+0.006
 nods+0.005
 thrill+0.005
 disl+0.005
 relief+0.005
Neg logit contributions
's-0.004
 so-0.004
 off-0.004
-0.004
 into-0.004
Token a
Token ID257
Feature activation+1.48e-01

Pos logit contributions
joy+0.010
 thrill+0.009
 relief+0.009
 disl+0.009
 nods+0.009
Neg logit contributions
's-0.003
 called-0.003
-0.003
 so-0.003
 and-0.003
Token tant
Token ID24246
Feature activation+8.32e+00

Pos logit contributions
joy+0.004
 thrill+0.004
 disl+0.004
 relief+0.004
 nods+0.004
Neg logit contributions
's-0.004
 so-0.004
 called-0.004
-0.004
 and-0.004
Tokenrum
Token ID6582
Feature activation+2.10e+00

Pos logit contributions
joy+0.196
 thrill+0.176
 relief+0.159
 disl+0.157
 Majesty+0.155
Neg logit contributions
's-0.272
 and-0.265
 called-0.262
 so-0.262
 from-0.255
Token in
Token ID287
Feature activation+6.23e-01

Pos logit contributions
joy+0.109
 thrill+0.105
 disl+0.104
 relief+0.101
 Majesty+0.100
Neg logit contributions
's-0.010
 and-0.009
 so-0.008
 called-0.006
-0.005
Token the
Token ID262
Feature activation+2.74e+00

Pos logit contributions
joy+0.024
 thrill+0.022
 relief+0.021
 disl+0.020
 sorry+0.020
Neg logit contributions
's-0.010
ches-0.010
 closely-0.009
mary-0.009
 pic-0.009
Token store
Token ID3650
Feature activation-9.40e-01

Pos logit contributions
joy+0.106
 thrill+0.098
 disl+0.097
 Majesty+0.093
 nods+0.093
Neg logit contributions
's-0.049
 so-0.045
 and-0.044
 called-0.044
-0.043
Token.
Token ID13
Feature activation-3.29e-02

Pos logit contributions
+0.009
's+0.008
 pic+0.007
ches+0.007
At+0.006
Neg logit contributions
joy-0.043
 thrill-0.039
 nods-0.038
 winners-0.037
 disl-0.037
Token and
Token ID290
Feature activation-6.51e-01

Pos logit contributions
joy+0.155
 thrill+0.142
 nods+0.139
 disl+0.138
 relief+0.138
Neg logit contributions
-0.016
's-0.016
 pic-0.015
 unw-0.014
 called-0.013
Token started
Token ID2067
Feature activation-9.52e-01

Pos logit contributions
's+0.013
+0.013
 pic+0.013
 so+0.013
 and+0.013
Neg logit contributions
joy-0.021
 disl-0.019
 thrill-0.019
 nods-0.019
 relief-0.019
Token to
Token ID284
Feature activation+6.00e-01

Pos logit contributions
+0.011
sc+0.010
's+0.010
ches+0.010
 pic+0.009
Neg logit contributions
joy-0.042
 disl-0.037
 sorry-0.035
 nods-0.035
 sincere-0.034
Token throw
Token ID3714
Feature activation+6.84e-01

Pos logit contributions
joy+0.012
 thrill+0.011
 disl+0.011
 relief+0.010
 nods+0.010
Neg logit contributions
's-0.020
 so-0.020
 called-0.019
-0.019
 and-0.019
Token a
Token ID257
Feature activation-3.33e-02

Pos logit contributions
joy+0.026
 thrill+0.024
 relief+0.024
 nods+0.023
viously+0.023
Neg logit contributions
's-0.010
 called-0.010
-0.010
At-0.009
 pic-0.009
Token tant
Token ID24246
Feature activation+8.23e+00

Pos logit contributions
's+0.001
 called+0.001
+0.001
 so+0.001
 and+0.001
Neg logit contributions
joy-0.001
 thrill-0.001
 relief-0.001
 disl-0.001
 nods-0.001
Tokenrum
Token ID6582
Feature activation+1.86e+00

Pos logit contributions
joy+0.160
 thrill+0.140
 disl+0.136
 relief+0.126
 nods+0.124
Neg logit contributions
's-0.297
 so-0.288
-0.284
 called-0.283
 and-0.283
Token.
Token ID13
Feature activation+9.50e-02

Pos logit contributions
joy+0.085
 thrill+0.079
 disl+0.076
 nods+0.076
 relief+0.076
Neg logit contributions
's-0.018
-0.015
 called-0.015
 so-0.015
 pic-0.014
Token Suddenly
Token ID24975
Feature activation+6.58e-01

Pos logit contributions
joy+0.003
 thrill+0.003
 disl+0.003
 relief+0.003
 nods+0.003
Neg logit contributions
's-0.002
 so-0.002
 called-0.002
 and-0.002
-0.002
Token,
Token ID11
Feature activation+6.82e-01

Pos logit contributions
joy+0.032
 disl+0.031
 thrill+0.031
 relief+0.030
 Majesty+0.030
Neg logit contributions
's-0.005
 and-0.005
 so-0.004
 called-0.004
-0.004
Token she
Token ID673
Feature activation-3.71e-01

Pos logit contributions
joy+0.032
 disl+0.032
 Majesty+0.031
 nods+0.030
 thrill+0.030
Neg logit contributions
 so-0.008
 and-0.007
 called-0.007
's-0.007
 from-0.006
Token her
Token ID607
Feature activation+1.51e+00

Pos logit contributions
joy+0.020
 thrill+0.018
 disl+0.018
 relief+0.018
 nods+0.018
Neg logit contributions
's-0.007
-0.006
 called-0.006
 so-0.005
 pic-0.005
Token feet
Token ID3625
Feature activation+5.37e-01

Pos logit contributions
joy+0.049
 thrill+0.045
 nods+0.041
 relief+0.041
ses+0.041
Neg logit contributions
's-0.035
-0.035
 called-0.035
 so-0.034
ches-0.032
Token and
Token ID290
Feature activation+7.54e-01

Pos logit contributions
joy+0.023
 thrill+0.023
 grin+0.021
 sorry+0.021
 vibration+0.021
Neg logit contributions
 pic-0.006
 lin-0.004
's-0.004
-0.003
 called-0.003
Token threw
Token ID9617
Feature activation+3.30e-01

Pos logit contributions
joy+0.027
 thrill+0.025
 disl+0.025
 relief+0.024
 nods+0.024
Neg logit contributions
's-0.015
 so-0.014
 and-0.014
 called-0.014
-0.014
Token a
Token ID257
Feature activation+4.97e-02

Pos logit contributions
joy+0.013
 thrill+0.013
 relief+0.012
 disl+0.012
 nods+0.012
Neg logit contributions
's-0.005
 called-0.004
-0.004
 so-0.004
 and-0.004
Token tant
Token ID24246
Feature activation+8.22e+00

Pos logit contributions
joy+0.001
 thrill+0.001
 disl+0.001
 relief+0.001
 nods+0.001
Neg logit contributions
's-0.001
 so-0.001
 and-0.001
 called-0.001
-0.001
Tokenrum
Token ID6582
Feature activation+1.90e+00

Pos logit contributions
joy+0.163
 thrill+0.142
 disl+0.135
 relief+0.127
 nods+0.126
Neg logit contributions
's-0.296
 so-0.286
 and-0.283
 called-0.283
-0.282
Token.
Token ID13
Feature activation+3.47e-01

Pos logit contributions
joy+0.090
 thrill+0.085
 disl+0.084
 relief+0.082
 nods+0.081
Neg logit contributions
's-0.016
 so-0.014
 and-0.013
 called-0.013
-0.013
Token She
Token ID1375
Feature activation+4.46e-01

Pos logit contributions
joy+0.011
 thrill+0.010
 disl+0.010
 relief+0.010
 nods+0.010
Neg logit contributions
's-0.008
 so-0.008
 and-0.008
 called-0.008
-0.008
Token grabbed
Token ID13176
Feature activation+1.64e+00

Pos logit contributions
joy+0.012
 thrill+0.012
 nods+0.012
 disl+0.011
 relief+0.011
Neg logit contributions
-0.010
's-0.010
 from-0.010
 pic-0.010
 so-0.010
Token the
Token ID262
Feature activation+5.45e-01

Pos logit contributions
joy+0.064
 thrill+0.060
 relief+0.058
 disl+0.056
viously+0.056
Neg logit contributions
's-0.026
-0.023
 called-0.022
 pic-0.022
 so-0.021
Token She
Token ID1375
Feature activation-1.93e-01

Pos logit contributions
's+0.003
 so+0.003
 called+0.002
 and+0.002
+0.002
Neg logit contributions
joy-0.004
 thrill-0.003
 disl-0.003
 relief-0.003
 nods-0.003
Token cried
Token ID16896
Feature activation+1.92e+00

Pos logit contributions
sc+0.004
 pic+0.004
 into+0.004
 off+0.004
 from+0.004
Neg logit contributions
 nods-0.005
 nodded-0.005
joy-0.005
 thrill-0.004
 relief-0.004
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
joy+0.099
 thrill+0.094
 disl+0.092
 relief+0.091
 nods+0.091
Neg logit contributions
's-0.008
-0.005
 so-0.005
 called-0.005
 and-0.004
Token threw
Token ID9617
Feature activation+2.57e-01

Pos logit contributions
joy+0.004
 thrill+0.004
 disl+0.004
 relief+0.004
 nods+0.003
Neg logit contributions
's-0.002
 so-0.002
 and-0.002
 called-0.002
-0.002
Token a
Token ID257
Feature activation-2.06e-02

Pos logit contributions
joy+0.011
 thrill+0.010
 disl+0.010
 relief+0.010
 nods+0.010
Neg logit contributions
's-0.004
 called-0.003
-0.003
 so-0.003
 and-0.003
Token tant
Token ID24246
Feature activation+8.14e+00

Pos logit contributions
's+0.001
 so+0.000
 and+0.000
 called+0.000
+0.000
Neg logit contributions
joy-0.001
 thrill-0.001
 disl-0.001
 nods-0.001
 relief-0.001
Tokenrum
Token ID6582
Feature activation+1.89e+00

Pos logit contributions
joy+0.177
 thrill+0.157
 disl+0.146
 relief+0.142
 nods+0.140
Neg logit contributions
's-0.279
 so-0.269
 and-0.268
 called-0.267
-0.262
Token.
Token ID13
Feature activation+2.67e-01

Pos logit contributions
joy+0.094
 thrill+0.090
 disl+0.089
 relief+0.086
 nods+0.085
Neg logit contributions
's-0.014
 and-0.012
 so-0.012
 called-0.010
-0.009
Token Her
Token ID2332
Feature activation+7.26e-01

Pos logit contributions
joy+0.010
 thrill+0.009
 disl+0.009
 relief+0.008
 nods+0.008
Neg logit contributions
's-0.006
 so-0.005
 and-0.005
 called-0.005
-0.005
Token mom
Token ID1995
Feature activation+9.44e-01

Pos logit contributions
joy+0.031
 thrill+0.030
viously+0.030
 disl+0.029
reetings+0.028
Neg logit contributions
's-0.009
 and-0.009
 so-0.009
 called-0.008
 to-0.008
Token tried
Token ID3088
Feature activation-1.38e+00

Pos logit contributions
joy+0.030
 thrill+0.028
 nods+0.028
 relief+0.027
 sincere+0.026
Neg logit contributions
 pic-0.019
 closely-0.018
 shelf-0.017
-0.017
 so-0.017
Token and
Token ID290
Feature activation-1.97e-01

Pos logit contributions
joy+0.055
 thrill+0.049
 nods+0.048
 relief+0.046
 winners+0.046
Neg logit contributions
-0.010
 lin-0.010
's-0.009
 unw-0.009
At-0.009
Token so
Token ID523
Feature activation+8.30e-01

Pos logit contributions
's+0.005
 so+0.004
 and+0.004
+0.004
 called+0.004
Neg logit contributions
joy-0.006
 thrill-0.006
 disl-0.006
 relief-0.006
 nods-0.006
Token she
Token ID673
Feature activation-1.06e+00

Pos logit contributions
joy+0.034
 thrill+0.032
 disl+0.031
viously+0.030
ses+0.030
Neg logit contributions
's-0.015
 and-0.014
 so-0.013
 called-0.013
-0.013
Token threw
Token ID9617
Feature activation+9.29e-01

Pos logit contributions
 from+0.022
 closely+0.022
 pic+0.022
+0.021
 so+0.021
Neg logit contributions
 nods-0.029
 thrill-0.028
joy-0.028
 relief-0.027
 nodded-0.027
Token a
Token ID257
Feature activation+1.57e-01

Pos logit contributions
joy+0.038
 thrill+0.036
 relief+0.035
 disl+0.034
 nods+0.034
Neg logit contributions
's-0.013
 called-0.012
-0.012
 so-0.011
 and-0.011
Token tant
Token ID24246
Feature activation+8.14e+00

Pos logit contributions
joy+0.006
 thrill+0.005
 disl+0.005
 relief+0.005
 nods+0.005
Neg logit contributions
's-0.003
 so-0.003
 called-0.003
-0.003
 and-0.003
Tokenrum
Token ID6582
Feature activation+2.25e+00

Pos logit contributions
joy+0.168
 thrill+0.147
 disl+0.140
 relief+0.132
 nods+0.131
Neg logit contributions
's-0.287
 so-0.278
 and-0.275
 called-0.275
-0.272
Token when
Token ID618
Feature activation-9.46e-01

Pos logit contributions
joy+0.107
 thrill+0.102
 disl+0.100
 relief+0.098
 nods+0.097
Neg logit contributions
's-0.019
 so-0.016
 and-0.016
 called-0.015
-0.015
Token her
Token ID607
Feature activation+3.32e-01

Pos logit contributions
's+0.014
 so+0.013
 and+0.013
 called+0.013
+0.012
Neg logit contributions
 disl-0.039
joy-0.039
 thrill-0.038
 Majesty-0.037
 nods-0.036
Token parents
Token ID3397
Feature activation+2.19e+00

Pos logit contributions
joy+0.012
 thrill+0.011
 disl+0.011
 relief+0.011
 nods+0.011
Neg logit contributions
's-0.006
 so-0.006
 and-0.006
 called-0.006
-0.006
Token said
Token ID531
Feature activation-1.70e+00

Pos logit contributions
joy+0.063
 thrill+0.058
 nods+0.055
 relief+0.053
 ah+0.053
Neg logit contributions
 closely-0.047
 pic-0.045
 so-0.044
 shelf-0.044
 off-0.044
Token stubborn
Token ID24444
Feature activation+6.39e-01

Pos logit contributions
joy+0.028
 thrill+0.026
 disl+0.026
 Majesty+0.024
 relief+0.024
Neg logit contributions
's-0.020
 so-0.020
 called-0.020
 and-0.019
-0.018
Token.
Token ID13
Feature activation+4.12e-02

Pos logit contributions
joy+0.030
 thrill+0.027
 nods+0.027
 disl+0.026
 relief+0.026
Neg logit contributions
's-0.006
 called-0.005
-0.005
 so-0.005
 unw-0.004
Token She
Token ID1375
Feature activation+3.97e-01

Pos logit contributions
joy+0.001
 thrill+0.001
 disl+0.001
 relief+0.001
 Majesty+0.001
Neg logit contributions
's-0.001
 and-0.001
 so-0.001
-0.001
 called-0.001
Token throws
Token ID12542
Feature activation+5.44e-01

Pos logit contributions
joy+0.012
 thrill+0.011
 disl+0.011
 relief+0.010
 Majesty+0.010
Neg logit contributions
's-0.011
 and-0.010
 so-0.010
-0.010
 called-0.010
Token a
Token ID257
Feature activation+2.40e-01

Pos logit contributions
joy+0.022
 thrill+0.021
 relief+0.020
 nods+0.020
 disl+0.020
Neg logit contributions
's-0.007
 called-0.007
-0.007
 so-0.006
 and-0.006
Token tant
Token ID24246
Feature activation+7.94e+00

Pos logit contributions
joy+0.007
 thrill+0.006
 disl+0.006
 nods+0.006
 relief+0.006
Neg logit contributions
's-0.006
 so-0.006
 called-0.006
-0.006
 and-0.006
Tokenrum
Token ID6582
Feature activation+1.89e+00

Pos logit contributions
joy+0.186
 thrill+0.166
 relief+0.151
 disl+0.147
 nods+0.145
Neg logit contributions
's-0.263
 and-0.254
 so-0.251
 called-0.250
 from-0.244
Token.
Token ID13
Feature activation+3.11e-01

Pos logit contributions
joy+0.093
 thrill+0.089
 disl+0.088
 relief+0.086
 Majesty+0.084
Neg logit contributions
's-0.014
 and-0.012
 so-0.012
 called-0.010
-0.010
Token She
Token ID1375
Feature activation+6.94e-01

Pos logit contributions
joy+0.011
 thrill+0.011
 disl+0.010
 relief+0.010
 Majesty+0.010
Neg logit contributions
's-0.006
 and-0.006
 so-0.006
-0.006
 called-0.006
Token kicks
Token ID17364
Feature activation+6.94e-01

Pos logit contributions
joy+0.019
 thrill+0.018
 disl+0.017
 nods+0.017
 relief+0.017
Neg logit contributions
's-0.019
 so-0.018
 called-0.018
-0.017
 and-0.017
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
joy+0.029
 thrill+0.027
 nods+0.026
 relief+0.026
 disl+0.025
Neg logit contributions
's-0.009
 called-0.008
 pic-0.008
-0.008
 unw-0.007
Token surprise
Token ID5975
Feature activation-8.57e-01

Pos logit contributions
 and+0.004
's+0.004
 so+0.004
 called+0.003
 from+0.003
Neg logit contributions
joy-0.006
 disl-0.006
 thrill-0.006
 nods-0.006
osity-0.006
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
+0.013
ches+0.013
At+0.013
mary+0.012
On+0.012
Neg logit contributions
joy-0.031
 thrill-0.029
 relief-0.028
 nods-0.028
 winners-0.027
Token He
Token ID679
Feature activation+9.86e-01

Pos logit contributions
 and+0.002
's+0.002
 so+0.002
+0.002
 called+0.002
Neg logit contributions
joy-0.003
 thrill-0.003
 disl-0.003
 Majesty-0.003
 tweet-0.003
Token threw
Token ID9617
Feature activation+2.22e-01

Pos logit contributions
joy+0.027
 nods+0.026
 thrill+0.025
 relief+0.024
ses+0.024
Neg logit contributions
's-0.024
 off-0.024
-0.024
 so-0.023
 from-0.023
Token a
Token ID257
Feature activation+6.10e-03

Pos logit contributions
joy+0.009
 thrill+0.009
 relief+0.008
 disl+0.008
 nods+0.008
Neg logit contributions
's-0.003
 called-0.003
-0.003
 so-0.003
 and-0.003
Token tant
Token ID24246
Feature activation+7.87e+00

Pos logit contributions
joy+0.000
 thrill+0.000
 disl+0.000
 relief+0.000
 nods+0.000
Neg logit contributions
's-0.000
 so-0.000
 and-0.000
 called-0.000
-0.000
Tokenrum
Token ID6582
Feature activation+2.01e+00

Pos logit contributions
joy+0.180
 thrill+0.160
 relief+0.145
 disl+0.144
 nods+0.141
Neg logit contributions
's-0.264
 and-0.256
 so-0.253
 called-0.252
-0.246
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
joy+0.098
 thrill+0.093
 disl+0.092
 relief+0.090
 nods+0.089
Neg logit contributions
's-0.015
 and-0.013
 so-0.013
 called-0.012
-0.011
Token He
Token ID679
Feature activation+1.20e+00

Pos logit contributions
joy+0.007
 thrill+0.006
 disl+0.006
 Majesty+0.006
 relief+0.006
Neg logit contributions
 and-0.005
's-0.005
 so-0.005
 called-0.004
-0.004
Token kicked
Token ID12165
Feature activation+7.79e-01

Pos logit contributions
joy+0.032
 thrill+0.031
 nods+0.030
ses+0.030
 liar+0.029
Neg logit contributions
-0.028
's-0.027
 closely-0.027
 so-0.027
 off-0.027
Token and
Token ID290
Feature activation+1.22e+00

Pos logit contributions
joy+0.032
 thrill+0.030
 relief+0.029
 nods+0.028
 disl+0.028
Neg logit contributions
's-0.010
 called-0.010
 pic-0.010
-0.010
mary-0.009
Token his
Token ID465
Feature activation+2.09e+00

Pos logit contributions
joy+0.129
 thrill+0.119
 relief+0.114
 nods+0.113
 disl+0.112
Neg logit contributions
's-0.063
-0.060
 so-0.056
 called-0.056
 pic-0.054
Token feet
Token ID3625
Feature activation+4.20e-01

Pos logit contributions
joy+0.063
 thrill+0.059
 nods+0.054
ses+0.054
 relief+0.054
Neg logit contributions
's-0.052
-0.051
 called-0.051
 so-0.050
 and-0.047
Token and
Token ID290
Feature activation+1.34e+00

Pos logit contributions
joy+0.016
 thrill+0.016
 sorry+0.015
 grin+0.014
 vibration+0.014
Neg logit contributions
 pic-0.006
 lin-0.005
mary-0.004
At-0.004
 called-0.003
Token threw
Token ID9617
Feature activation+1.54e-01

Pos logit contributions
joy+0.051
 thrill+0.048
 disl+0.047
 relief+0.045
 Majesty+0.045
Neg logit contributions
's-0.025
 so-0.023
 called-0.023
 and-0.023
-0.022
Token a
Token ID257
Feature activation+2.19e-01

Pos logit contributions
joy+0.006
 thrill+0.006
 relief+0.005
 disl+0.005
 nods+0.005
Neg logit contributions
's-0.002
 called-0.002
-0.002
 so-0.002
 and-0.002
Token tant
Token ID24246
Feature activation+7.21e+00

Pos logit contributions
joy+0.008
 thrill+0.007
 disl+0.007
 relief+0.007
 Majesty+0.007
Neg logit contributions
's-0.004
 so-0.004
 and-0.004
-0.004
 called-0.004
Tokenrum
Token ID6582
Feature activation+1.48e+00

Pos logit contributions
joy+0.163
 thrill+0.146
 relief+0.133
 disl+0.132
 nods+0.128
Neg logit contributions
's-0.242
 and-0.233
 so-0.231
 called-0.230
-0.225
Token.
Token ID13
Feature activation+7.07e-02

Pos logit contributions
joy+0.071
 thrill+0.068
 disl+0.067
 relief+0.065
 nods+0.065
Neg logit contributions
's-0.012
 so-0.010
 and-0.010
 called-0.009
-0.009
Token
Token ID198
Feature activation-6.63e-01

Pos logit contributions
joy+0.003
 thrill+0.003
 disl+0.003
 relief+0.002
 Majesty+0.002
Neg logit contributions
's-0.001
 so-0.001
 and-0.001
 called-0.001
-0.001
Token
Token ID198
Feature activation-5.29e-01

Pos logit contributions
's+0.013
 so+0.013
 called+0.012
 off+0.012
 from+0.012
Neg logit contributions
joy-0.023
 thrill-0.021
 disl-0.021
 nods-0.020
 relief-0.020
Token"
Token ID1
Feature activation+1.18e+00

Pos logit contributions
's+0.010
 so+0.009
 called+0.009
 and+0.009
 from+0.008
Neg logit contributions
joy-0.020
 thrill-0.018
 disl-0.018
 relief-0.017
 nods-0.017
Token them
Token ID606
Feature activation-8.53e-01

Pos logit contributions
's+0.023
+0.021
 called+0.020
 so+0.020
 and+0.020
Neg logit contributions
joy-0.050
 thrill-0.046
 disl-0.046
 relief-0.045
 Majesty-0.044
Token and
Token ID290
Feature activation-8.53e-01

Pos logit contributions
's+0.009
 called+0.008
+0.008
 lin+0.007
At+0.007
Neg logit contributions
joy-0.036
 thrill-0.033
 disl-0.033
 winners-0.032
 relief-0.032
Token never
Token ID1239
Feature activation+8.72e-01

Pos logit contributions
's+0.013
 and+0.012
 so+0.012
 called+0.012
 from+0.011
Neg logit contributions
joy-0.037
 thrill-0.035
viously-0.033
 nods-0.032
 relief-0.032
Token threw
Token ID9617
Feature activation-9.10e-01

Pos logit contributions
joy+0.034
 thrill+0.032
 Majesty+0.030
 disl+0.029
 Amen+0.029
Neg logit contributions
's-0.018
 so-0.016
 and-0.016
 called-0.016
 to-0.016
Token a
Token ID257
Feature activation-9.44e-01

Pos logit contributions
's+0.014
 called+0.013
+0.013
 so+0.012
 and+0.012
Neg logit contributions
joy-0.036
 thrill-0.034
 disl-0.033
 relief-0.033
 nods-0.032
Token tant
Token ID24246
Feature activation+6.90e+00

Pos logit contributions
's+0.021
 so+0.020
 and+0.020
+0.019
 called+0.019
Neg logit contributions
joy-0.032
 thrill-0.029
 disl-0.029
 nods-0.028
 relief-0.028
Tokenrum
Token ID6582
Feature activation+5.45e-01

Pos logit contributions
joy+0.175
 thrill+0.163
 relief+0.150
 nods+0.142
 disl+0.135
Neg logit contributions
's-0.213
 and-0.205
 called-0.200
 so-0.200
 from-0.195
Token again
Token ID757
Feature activation+2.00e-01

Pos logit contributions
joy+0.026
 thrill+0.024
 disl+0.024
 relief+0.023
 nods+0.023
Neg logit contributions
's-0.005
 so-0.004
 and-0.004
 called-0.004
-0.004
Token.
Token ID13
Feature activation-4.97e-01

Pos logit contributions
joy+0.009
 disl+0.008
 thrill+0.008
 relief+0.008
 nods+0.008
Neg logit contributions
's-0.002
-0.002
 called-0.002
 so-0.002
 closely-0.002
Token<|endoftext|>
Token ID50256
Feature activation+2.54e+00

Pos logit contributions
's+0.009
 so+0.008
 and+0.008
+0.008
 called+0.008
Neg logit contributions
joy-0.019
 thrill-0.018
 disl-0.017
 relief-0.017
 nods-0.017
TokenOnce
Token ID7454
Feature activation-1.31e+00

Pos logit contributions
joy+0.097
 thrill+0.089
 disl+0.086
 nods+0.085
 relief+0.085
Neg logit contributions
's-0.044
 so-0.041
 called-0.041
-0.039
 and-0.039
Token temper
Token ID4124
Feature activation+2.51e+00

Pos logit contributions
's+0.008
 and+0.008
 so+0.007
 called+0.007
 to+0.007
Neg logit contributions
joy-0.011
 thrill-0.010
 disl-0.010
 relief-0.009
 nods-0.009
Token and
Token ID290
Feature activation-1.88e-01

Pos logit contributions
joy+0.115
 thrill+0.109
 disl+0.108
 nods+0.104
 relief+0.104
Neg logit contributions
's-0.022
-0.019
 so-0.017
 called-0.017
 closely-0.015
Token often
Token ID1690
Feature activation+8.80e-01

Pos logit contributions
's+0.003
 so+0.003
 called+0.003
 and+0.003
+0.003
Neg logit contributions
joy-0.007
 thrill-0.007
 disl-0.007
 relief-0.007
 nods-0.007
Token threw
Token ID9617
Feature activation+4.37e-01

Pos logit contributions
joy+0.033
 thrill+0.031
 disl+0.030
 relief+0.029
 nods+0.029
Neg logit contributions
's-0.017
 called-0.016
 so-0.016
 and-0.016
-0.015
Token temper
Token ID4124
Feature activation+2.25e+00

Pos logit contributions
joy+0.017
 thrill+0.016
 disl+0.015
 relief+0.015
 nods+0.015
Neg logit contributions
's-0.008
 called-0.007
-0.007
 so-0.007
 and-0.007
Token tant
Token ID24246
Feature activation+6.48e+00

Pos logit contributions
joy+0.082
 thrill+0.076
 disl+0.075
 nods+0.072
 relief+0.072
Neg logit contributions
's-0.043
 so-0.040
 called-0.039
-0.039
 and-0.038
Tokenrums
Token ID45241
Feature activation+1.70e+00

Pos logit contributions
joy+0.161
 thrill+0.152
 relief+0.137
 nods+0.129
ses+0.124
Neg logit contributions
's-0.204
 and-0.195
 so-0.193
 called-0.193
 off-0.187
Token.
Token ID13
Feature activation+6.84e-02

Pos logit contributions
joy+0.071
 thrill+0.067
 disl+0.066
 nods+0.064
 Majesty+0.064
Neg logit contributions
's-0.018
 so-0.017
 called-0.017
 pic-0.016
-0.016
Token One
Token ID1881
Feature activation-7.23e-01

Pos logit contributions
joy+0.003
 thrill+0.002
 disl+0.002
 relief+0.002
 Majesty+0.002
Neg logit contributions
's-0.001
 so-0.001
 and-0.001
-0.001
 called-0.001
Token day
Token ID1110
Feature activation+1.69e+00

Pos logit contributions
's+0.013
 so+0.012
+0.012
 called+0.012
 off+0.012
Neg logit contributions
joy-0.027
 thrill-0.025
 disl-0.024
 relief-0.023
 nods-0.023
Token,
Token ID11
Feature activation+9.58e-01

Pos logit contributions
joy+0.078
 disl+0.075
 thrill+0.075
ses+0.074
 Majesty+0.073
Neg logit contributions
's-0.018
 and-0.017
 so-0.017
-0.015
 called-0.014
Token sad
Token ID6507
Feature activation+2.97e+00

Pos logit contributions
joy+0.027
 thrill+0.025
 disl+0.025
 Majesty+0.024
 nods+0.024
Neg logit contributions
's-0.020
 so-0.019
 called-0.019
 and-0.019
-0.018
Token.
Token ID13
Feature activation-2.69e-01

Pos logit contributions
joy+0.139
 thrill+0.132
 disl+0.130
 nods+0.128
 relief+0.127
Neg logit contributions
's-0.025
-0.021
 so-0.021
 called-0.020
 and-0.017
Token He
Token ID679
Feature activation+4.38e-01

Pos logit contributions
 and+0.006
 so+0.005
 called+0.005
's+0.005
 to+0.005
Neg logit contributions
joy-0.010
 thrill-0.009
viously-0.009
 Majesty-0.009
 disl-0.009
Token has
Token ID468
Feature activation-2.05e-01

Pos logit contributions
joy+0.014
 thrill+0.012
 disl+0.012
 relief+0.011
 Majesty+0.011
Neg logit contributions
's-0.012
 and-0.011
 so-0.011
-0.011
 called-0.011
Token a
Token ID257
Feature activation+1.26e+00

Pos logit contributions
's+0.004
 so+0.004
 and+0.004
 called+0.004
+0.004
Neg logit contributions
joy-0.008
 thrill-0.007
 disl-0.007
 relief-0.007
 nods-0.007
Token bru
Token ID18145
Feature activation+6.45e+00

Pos logit contributions
joy+0.048
 thrill+0.045
 disl+0.045
viously+0.044
 Majesty+0.043
Neg logit contributions
's-0.022
 and-0.022
 so-0.021
 called-0.021
 to-0.020
Tokenise
Token ID786
Feature activation+1.66e+00

Pos logit contributions
joy+0.203
 thrill+0.190
 relief+0.178
 disl+0.176
 nods+0.172
Neg logit contributions
's-0.167
 and-0.157
 called-0.155
 so-0.154
-0.147
Token on
Token ID319
Feature activation+7.45e-01

Pos logit contributions
joy+0.080
 thrill+0.076
 disl+0.075
 relief+0.073
 nods+0.073
Neg logit contributions
's-0.014
 and-0.012
 so-0.012
 called-0.011
-0.010
Token his
Token ID465
Feature activation+1.96e+00

Pos logit contributions
joy+0.027
 thrill+0.025
 disl+0.025
 relief+0.024
 nods+0.023
Neg logit contributions
's-0.015
-0.014
 called-0.013
 so-0.013
 and-0.013
Token leg
Token ID1232
Feature activation+2.81e+00

Pos logit contributions
joy+0.045
 thrill+0.042
 relief+0.040
ses+0.040
 disl+0.038
Neg logit contributions
-0.061
 called-0.059
's-0.059
 pic-0.057
At-0.057
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
joy+0.123
 thrill+0.117
 disl+0.115
 nods+0.113
 relief+0.112
Neg logit contributions
's-0.027
-0.025
 so-0.023
 called-0.022
 pic-0.022
Token arm
Token ID3211
Feature activation+1.52e+00

Pos logit contributions
joy+0.026
 thrill+0.026
 relief+0.023
ses+0.023
 grin+0.023
Neg logit contributions
-0.045
 called-0.045
 pic-0.045
At-0.044
sc-0.044
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
joy+0.068
 thrill+0.064
 disl+0.063
 relief+0.062
 nods+0.062
Neg logit contributions
's-0.015
-0.013
 so-0.013
 called-0.013
 pic-0.011
Token She
Token ID1375
Feature activation+5.13e-01

Pos logit contributions
 thrill+0.009
joy+0.009
 Majesty+0.008
 disl+0.008
confidence+0.008
Neg logit contributions
 and-0.004
 so-0.004
's-0.004
 called-0.003
-0.003
Token had
Token ID550
Feature activation+2.80e-01

Pos logit contributions
joy+0.017
 thrill+0.016
 disl+0.015
 nods+0.015
 relief+0.015
Neg logit contributions
's-0.011
 so-0.011
-0.011
 and-0.010
 called-0.010
Token a
Token ID257
Feature activation+1.33e+00

Pos logit contributions
joy+0.011
 thrill+0.010
 disl+0.010
 relief+0.010
 nods+0.010
Neg logit contributions
's-0.005
 so-0.005
 and-0.004
 called-0.004
-0.004
Token bru
Token ID18145
Feature activation+6.42e+00

Pos logit contributions
joy+0.046
 disl+0.044
 thrill+0.042
 Majesty+0.042
viously+0.042
Neg logit contributions
 and-0.028
's-0.028
 so-0.027
 to-0.026
 called-0.026
Tokenise
Token ID786
Feature activation+1.50e+00

Pos logit contributions
joy+0.202
 thrill+0.189
 relief+0.175
 disl+0.175
 nods+0.171
Neg logit contributions
's-0.166
 and-0.157
 called-0.154
 so-0.153
-0.146
Token on
Token ID319
Feature activation+6.19e-01

Pos logit contributions
joy+0.072
 thrill+0.068
 disl+0.067
 relief+0.065
 nods+0.065
Neg logit contributions
's-0.013
 so-0.011
 and-0.010
 called-0.010
-0.010
Token her
Token ID607
Feature activation+1.23e+00

Pos logit contributions
joy+0.024
 thrill+0.023
 disl+0.022
 relief+0.022
ses+0.021
Neg logit contributions
's-0.010
-0.010
 called-0.009
 so-0.009
sc-0.008
Token leg
Token ID1232
Feature activation+2.73e+00

Pos logit contributions
joy+0.026
 thrill+0.026
 relief+0.024
ses+0.023
 grin+0.023
Neg logit contributions
-0.038
 called-0.038
 pic-0.038
's-0.037
sc-0.037
Token.
Token ID13
Feature activation+1.61e-01

Pos logit contributions
joy+0.119
 thrill+0.113
 disl+0.110
 nods+0.108
 relief+0.108
Neg logit contributions
's-0.027
-0.025
 so-0.024
 called-0.023
 pic-0.023
Token.
Token ID13
Feature activation+5.30e-01

Pos logit contributions
joy+0.005
 thrill+0.004
 disl+0.004
 relief+0.004
 nods+0.004
Neg logit contributions
's-0.001
-0.001
 called-0.001
 so-0.001
 and-0.001
Token He
Token ID679
Feature activation+5.76e-01

Pos logit contributions
joy+0.016
 thrill+0.015
 disl+0.014
 relief+0.014
 nods+0.014
Neg logit contributions
's-0.014
 so-0.013
 and-0.013
 called-0.013
-0.013
Token had
Token ID550
Feature activation+5.49e-02

Pos logit contributions
joy+0.017
 thrill+0.016
 nods+0.016
 relief+0.016
 disl+0.015
Neg logit contributions
's-0.012
 so-0.012
 from-0.012
-0.012
 pic-0.012
Token a
Token ID257
Feature activation+8.16e-01

Pos logit contributions
joy+0.003
 thrill+0.002
 disl+0.002
 relief+0.002
 nods+0.002
Neg logit contributions
's-0.001
 so-0.000
 called-0.000
 and-0.000
-0.000
Token big
Token ID1263
Feature activation+8.54e-01

Pos logit contributions
joy+0.031
 disl+0.029
 thrill+0.029
 nods+0.028
viously+0.028
Neg logit contributions
's-0.014
 so-0.014
 and-0.013
-0.013
 called-0.013
Token bru
Token ID18145
Feature activation+6.29e+00

Pos logit contributions
 disl+0.024
joy+0.024
osity+0.024
viously+0.022
 thrill+0.022
Neg logit contributions
 and-0.023
's-0.022
 so-0.022
 called-0.021
-0.020
Tokenise
Token ID786
Feature activation+1.56e+00

Pos logit contributions
joy+0.201
 thrill+0.189
 relief+0.177
 disl+0.174
 nods+0.173
Neg logit contributions
's-0.161
 and-0.152
 so-0.149
 called-0.148
-0.142
Token on
Token ID319
Feature activation+6.63e-01

Pos logit contributions
joy+0.073
 thrill+0.070
 disl+0.069
 relief+0.067
 nods+0.067
Neg logit contributions
's-0.014
 so-0.012
 and-0.012
 called-0.012
-0.011
Token his
Token ID465
Feature activation+1.57e+00

Pos logit contributions
joy+0.025
 thrill+0.023
 disl+0.023
 relief+0.022
 nods+0.022
Neg logit contributions
's-0.012
-0.011
 called-0.011
 so-0.011
 and-0.011
Token head
Token ID1182
Feature activation+2.04e+00

Pos logit contributions
joy+0.036
 thrill+0.035
 relief+0.032
ses+0.031
 grin+0.030
Neg logit contributions
 called-0.049
-0.048
's-0.047
sc-0.046
At-0.046
Token from
Token ID422
Feature activation-2.63e-01

Pos logit contributions
joy+0.098
 thrill+0.093
 disl+0.091
 relief+0.089
 nods+0.089
Neg logit contributions
's-0.015
 so-0.012
 called-0.011
-0.011
 and-0.010
Token hurt
Token ID5938
Feature activation+1.94e+00

Pos logit contributions
joy+0.079
 disl+0.074
 thrill+0.074
 Majesty+0.073
 nods+0.071
Neg logit contributions
's-0.044
 and-0.044
 so-0.042
 called-0.041
 from-0.040
Token.
Token ID13
Feature activation-4.26e-01

Pos logit contributions
joy+0.081
 thrill+0.073
 relief+0.071
viously+0.070
 disl+0.070
Neg logit contributions
's-0.024
 pic-0.024
 unw-0.022
-0.021
 called-0.021
Token Ben
Token ID3932
Feature activation-1.42e+00

Pos logit contributions
's+0.007
 and+0.007
 so+0.007
+0.007
 called+0.007
Neg logit contributions
joy-0.016
 thrill-0.016
 disl-0.015
 Majesty-0.015
 relief-0.015
Token had
Token ID550
Feature activation+3.74e-02

Pos logit contributions
's+0.032
 so+0.030
 and+0.030
 called+0.029
+0.029
Neg logit contributions
joy-0.049
 thrill-0.045
 disl-0.044
 relief-0.042
 Majesty-0.042
Token a
Token ID257
Feature activation+1.15e+00

Pos logit contributions
joy+0.001
 thrill+0.001
 disl+0.001
 relief+0.001
 nods+0.001
Neg logit contributions
's-0.001
 so-0.001
 and-0.001
 called-0.001
-0.001
Token bru
Token ID18145
Feature activation+6.21e+00

Pos logit contributions
joy+0.038
 disl+0.037
osity+0.035
viously+0.035
folk+0.035
Neg logit contributions
's-0.025
 and-0.025
 so-0.024
 very-0.023
 really-0.023
Tokenise
Token ID786
Feature activation+1.55e+00

Pos logit contributions
joy+0.193
 thrill+0.179
 disl+0.169
 relief+0.167
 nods+0.164
Neg logit contributions
's-0.161
 and-0.152
 so-0.150
 called-0.150
-0.145
Token on
Token ID319
Feature activation+4.87e-01

Pos logit contributions
joy+0.074
 thrill+0.070
 disl+0.068
 relief+0.067
 nods+0.067
Neg logit contributions
's-0.012
 so-0.010
 called-0.010
-0.010
 and-0.009
Token his
Token ID465
Feature activation+1.70e+00

Pos logit contributions
joy+0.019
 thrill+0.018
 disl+0.018
 relief+0.018
 nods+0.017
Neg logit contributions
's-0.008
-0.007
 called-0.007
 so-0.007
 and-0.006
Token leg
Token ID1232
Feature activation+2.78e+00

Pos logit contributions
joy+0.047
 thrill+0.045
 relief+0.043
ses+0.042
 nods+0.041
Neg logit contributions
-0.044
 called-0.043
's-0.043
 pic-0.042
 closely-0.041
Token and
Token ID290
Feature activation+4.48e-01

Pos logit contributions
joy+0.126
 thrill+0.120
 disl+0.117
 nods+0.116
 relief+0.115
Neg logit contributions
's-0.021
-0.019
 pic-0.018
 so-0.018
 called-0.017
Token on
Token ID319
Feature activation-2.98e-01

Pos logit contributions
joy+0.087
 thrill+0.082
 disl+0.081
 relief+0.079
 nods+0.079
Neg logit contributions
's-0.021
 so-0.019
 and-0.018
 called-0.018
-0.018
Token his
Token ID465
Feature activation+1.95e+00

Pos logit contributions
's+0.005
+0.004
 so+0.004
 called+0.004
 and+0.004
Neg logit contributions
joy-0.012
 thrill-0.011
 disl-0.011
 relief-0.010
 nods-0.010
Token head
Token ID1182
Feature activation+1.64e+00

Pos logit contributions
joy+0.060
 thrill+0.056
 relief+0.053
 disl+0.053
 nods+0.052
Neg logit contributions
's-0.046
-0.045
 called-0.044
 so-0.044
 and-0.043
Token and
Token ID290
Feature activation+7.20e-01

Pos logit contributions
joy+0.073
 thrill+0.069
 nods+0.066
 disl+0.066
 relief+0.065
Neg logit contributions
 pic-0.014
's-0.011
 called-0.011
-0.010
 unw-0.010
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
joy+0.030
 thrill+0.028
 disl+0.028
 nods+0.027
 Majesty+0.027
Neg logit contributions
's-0.011
 so-0.010
 called-0.010
 and-0.010
-0.009
Token bru
Token ID18145
Feature activation+6.13e+00

Pos logit contributions
joy+0.039
 disl+0.037
 Majesty+0.036
viously+0.035
 thrill+0.035
Neg logit contributions
's-0.020
 and-0.020
 so-0.019
 called-0.019
-0.018
Tokenise
Token ID786
Feature activation+1.41e+00

Pos logit contributions
joy+0.192
 thrill+0.178
 disl+0.168
 relief+0.166
 nods+0.164
Neg logit contributions
's-0.158
 and-0.149
 called-0.147
 so-0.146
-0.141
Token on
Token ID319
Feature activation+2.24e-01

Pos logit contributions
joy+0.068
 thrill+0.064
 disl+0.063
 relief+0.062
 nods+0.062
Neg logit contributions
's-0.012
 so-0.010
 and-0.009
 called-0.009
-0.009
Token his
Token ID465
Feature activation+1.61e+00

Pos logit contributions
joy+0.009
 thrill+0.008
 disl+0.008
 relief+0.008
 nods+0.008
Neg logit contributions
's-0.004
-0.004
 so-0.004
 called-0.003
 and-0.003
Token arm
Token ID3211
Feature activation+1.15e+00

Pos logit contributions
joy+0.050
 thrill+0.046
 disl+0.045
 relief+0.043
 nods+0.043
Neg logit contributions
's-0.040
 so-0.038
 called-0.038
 and-0.038
-0.038
Token.
Token ID13
Feature activation-2.15e-01

Pos logit contributions
joy+0.053
 thrill+0.051
 disl+0.049
 relief+0.049
 nods+0.048
Neg logit contributions
's-0.010
 so-0.009
-0.009
 called-0.009
 and-0.008
Token.
Token ID13
Feature activation-8.29e-01

Pos logit contributions
's+0.010
 pic+0.010
At+0.010
 called+0.009
+0.009
Neg logit contributions
joy-0.023
 thrill-0.021
 disl-0.021
 relief-0.020
 Majesty-0.020
Token You
Token ID921
Feature activation+2.18e+00

Pos logit contributions
's+0.024
+0.023
 so+0.023
 and+0.022
 called+0.022
Neg logit contributions
joy-0.023
 thrill-0.021
 disl-0.020
 relief-0.019
 nods-0.019
Token just
Token ID655
Feature activation+3.26e+00

Pos logit contributions
joy+0.085
 thrill+0.073
 disl+0.071
 Majesty+0.070
 relief+0.070
Neg logit contributions
's-0.050
 and-0.046
-0.046
 so-0.045
 called-0.045
Token have
Token ID423
Feature activation-8.28e-01

Pos logit contributions
joy+0.121
 thrill+0.108
 Majesty+0.100
 disl+0.099
Sorry+0.097
Neg logit contributions
's-0.078
 called-0.074
-0.073
 so-0.073
 and-0.072
Token a
Token ID257
Feature activation-1.28e-01

Pos logit contributions
's+0.007
+0.006
 called+0.006
 so+0.006
 and+0.005
Neg logit contributions
joy-0.039
 thrill-0.037
 disl-0.036
 relief-0.035
 nods-0.035
Token bru
Token ID18145
Feature activation+5.99e+00

Pos logit contributions
's+0.003
 so+0.003
 and+0.002
+0.002
 called+0.002
Neg logit contributions
joy-0.005
 thrill-0.004
 disl-0.004
 nods-0.004
 relief-0.004
Tokenise
Token ID786
Feature activation+1.13e+00

Pos logit contributions
joy+0.186
 thrill+0.174
 relief+0.163
 disl+0.162
 nods+0.158
Neg logit contributions
's-0.155
 and-0.146
 so-0.145
 called-0.144
-0.140
Token.
Token ID13
Feature activation-4.02e-01

Pos logit contributions
joy+0.050
 thrill+0.047
 disl+0.046
 relief+0.045
 nods+0.044
Neg logit contributions
's-0.014
 so-0.012
 called-0.012
 and-0.012
-0.012
Token You
Token ID921
Feature activation+2.34e+00

Pos logit contributions
's+0.012
+0.011
 so+0.011
 called+0.011
 and+0.011
Neg logit contributions
joy-0.011
 thrill-0.010
 disl-0.009
 relief-0.009
 nods-0.009
Token can
Token ID460
Feature activation+2.02e-01

Pos logit contributions
joy+0.094
 thrill+0.078
 disl+0.077
 Majesty+0.076
 relief+0.075
Neg logit contributions
's-0.054
-0.050
 and-0.050
 so-0.049
 called-0.048
Token go
Token ID467
Feature activation-4.38e-01

Pos logit contributions
joy+0.007
 thrill+0.007
 disl+0.007
 relief+0.006
 nods+0.006
Neg logit contributions
's-0.004
 so-0.004
 and-0.004
 called-0.004
-0.004
Token,
Token ID11
Feature activation+9.04e-01

Pos logit contributions
joy+0.083
 thrill+0.079
 disl+0.078
 relief+0.076
 Majesty+0.075
Neg logit contributions
's-0.017
 and-0.015
 so-0.014
 called-0.014
-0.013
Token mum
Token ID25682
Feature activation+1.43e+00

Pos logit contributions
joy+0.033
 thrill+0.030
 disl+0.030
 relief+0.029
 nods+0.029
Neg logit contributions
's-0.018
 so-0.016
-0.016
 called-0.016
 and-0.016
Token and
Token ID290
Feature activation-1.36e+00

Pos logit contributions
joy+0.044
 nods+0.043
 disl+0.041
 relief+0.038
 thrill+0.038
Neg logit contributions
sc-0.028
 off-0.028
 closely-0.028
 so-0.027
 into-0.027
Token dad
Token ID9955
Feature activation+2.41e+00

Pos logit contributions
's+0.027
 so+0.027
 and+0.026
 called+0.026
 from+0.024
Neg logit contributions
joy-0.049
 thrill-0.048
 disl-0.046
 relief-0.045
ses-0.044
Token told
Token ID1297
Feature activation-9.19e-01

Pos logit contributions
joy+0.076
 thrill+0.072
 nods+0.072
 relief+0.069
viously+0.069
Neg logit contributions
ches-0.045
 closely-0.045
 pic-0.044
's-0.044
 shelf-0.043
Token Maria
Token ID14200
Feature activation-6.87e+00

Pos logit contributions
's+0.015
+0.015
 called+0.014
sc+0.014
 so+0.014
Neg logit contributions
joy-0.034
 disl-0.032
 thrill-0.032
 relief-0.032
 nods-0.031
Token that
Token ID326
Feature activation-4.49e-01

Pos logit contributions
's+0.082
 called+0.082
 unw+0.081
 pic+0.079
sc+0.079
Neg logit contributions
joy-0.270
 disl-0.261
 nods-0.250
 thrill-0.248
 Majesty-0.244
Token they
Token ID484
Feature activation-2.60e+00

Pos logit contributions
's+0.007
 so+0.007
 called+0.007
 and+0.007
+0.007
Neg logit contributions
joy-0.018
 thrill-0.016
 disl-0.016
 relief-0.016
 nods-0.016
Token had
Token ID550
Feature activation-1.41e+00

Pos logit contributions
's+0.048
 so+0.044
 and+0.043
+0.043
 called+0.043
Neg logit contributions
joy-0.105
 thrill-0.096
 disl-0.094
 Majesty-0.093
 relief-0.091
Token an
Token ID281
Feature activation-1.08e+00

Pos logit contributions
 closely+0.016
's+0.016
+0.015
ches+0.015
 called+0.014
Neg logit contributions
joy-0.060
 thrill-0.054
 disl-0.054
 relief-0.053
 winners-0.052
Token exciting
Token ID7895
Feature activation+7.18e-01

Pos logit contributions
sc+0.033
ches+0.030
's+0.030
At+0.029
mary+0.029
Neg logit contributions
 relief-0.026
 sorry-0.024
 thrill-0.024
 disl-0.023
 Majesty-0.023
Token there
Token ID612
Feature activation+6.18e-01

Pos logit contributions
's+0.004
 called+0.003
 so+0.003
 and+0.003
+0.003
Neg logit contributions
joy-0.013
 thrill-0.012
 disl-0.012
 nods-0.012
 relief-0.012
Token was
Token ID373
Feature activation+2.43e-02

Pos logit contributions
joy+0.026
 thrill+0.024
 disl+0.023
 nods+0.023
 relief+0.023
Neg logit contributions
's-0.007
 so-0.007
 off-0.007
 called-0.007
 into-0.007
Token a
Token ID257
Feature activation-1.75e+00

Pos logit contributions
joy+0.001
 thrill+0.001
 relief+0.001
 disl+0.001
 nods+0.001
Neg logit contributions
's-0.000
ches-0.000
 off-0.000
 called-0.000
 closely-0.000
Token girl
Token ID2576
Feature activation+1.24e+00

Pos logit contributions
's+0.031
 so+0.028
+0.027
 called+0.027
 and+0.027
Neg logit contributions
joy-0.067
 thrill-0.063
 disl-0.061
 relief-0.060
 nods-0.059
Token called
Token ID1444
Feature activation-1.51e+00

Pos logit contributions
joy+0.031
 Amen+0.026
 disl+0.026
 relief+0.025
 sincere+0.025
Neg logit contributions
 off-0.034
ches-0.032
 pic-0.032
sc-0.031
 closely-0.030
Token Min
Token ID1855
Feature activation-6.59e+00

Pos logit contributions
's+0.036
 closely+0.033
 and+0.033
 so+0.033
 off+0.032
Neg logit contributions
joy-0.047
 disl-0.041
 thrill-0.040
 relief-0.040
ses-0.038
Tokennie
Token ID11952
Feature activation-3.41e+00

Pos logit contributions
 off+0.135
 so+0.133
 pic+0.130
's+0.129
+0.126
Neg logit contributions
joy-0.216
 thrill-0.194
 nods-0.191
 disl-0.190
 relief-0.189
Token.
Token ID13
Feature activation+2.00e-01

Pos logit contributions
 pic+0.052
sc+0.049
 off+0.049
ches+0.049
At+0.047
Neg logit contributions
joy-0.130
 nods-0.113
 disl-0.113
 thrill-0.111
 relief-0.111
Token She
Token ID1375
Feature activation-2.40e-01

Pos logit contributions
joy+0.006
 thrill+0.006
 disl+0.005
 nods+0.005
 relief+0.005
Neg logit contributions
's-0.005
 so-0.005
 called-0.005
-0.005
 and-0.005
Token was
Token ID373
Feature activation+7.68e-01

Pos logit contributions
 off+0.006
 closely+0.005
's+0.005
 into+0.005
 from+0.005
Neg logit contributions
joy-0.007
 disl-0.006
 thrill-0.006
 nods-0.006
 relief-0.006
Token very
Token ID845
Feature activation+1.72e-01

Pos logit contributions
joy+0.029
 sorry+0.026
 thrill+0.026
 relief+0.025
 winners+0.025
Neg logit contributions
ches-0.013
's-0.013
sc-0.012
 closely-0.012
 off-0.012
Token of
Token ID286
Feature activation-2.79e+00

Pos logit contributions
 convey+0.005
 called+0.005
 shelf+0.005
 pic+0.005
's+0.005
Neg logit contributions
joy-0.011
 nods-0.010
 thrill-0.010
viously-0.010
 relief-0.010
Token the
Token ID262
Feature activation-2.91e-01

Pos logit contributions
's+0.051
+0.047
 called+0.042
 so+0.040
 closely+0.039
Neg logit contributions
joy-0.102
 thrill-0.095
 relief-0.090
 nods-0.088
 ah-0.086
Token items
Token ID3709
Feature activation-1.14e+00

Pos logit contributions
's+0.006
+0.006
 called+0.006
 so+0.005
 from+0.005
Neg logit contributions
joy-0.010
 thrill-0.009
 nods-0.009
 disl-0.009
 relief-0.008
Token and
Token ID290
Feature activation-7.06e-01

Pos logit contributions
 pic+0.014
's+0.013
ches+0.012
 called+0.012
During+0.012
Neg logit contributions
joy-0.047
 thrill-0.042
 nods-0.042
 winners-0.040
 grin-0.040
Token told
Token ID1297
Feature activation-7.32e-01

Pos logit contributions
's+0.011
+0.010
 so+0.010
ches+0.010
 pic+0.010
Neg logit contributions
joy-0.026
 thrill-0.024
 nods-0.023
 relief-0.023
 ah-0.023
Token Min
Token ID1855
Feature activation-6.49e+00

Pos logit contributions
's+0.012
 called+0.011
+0.011
 so+0.011
 and+0.010
Neg logit contributions
joy-0.028
 thrill-0.026
 relief-0.026
 disl-0.026
 nods-0.025
Tokennie
Token ID11952
Feature activation-4.41e+00

Pos logit contributions
's+0.124
 so+0.121
 called+0.119
 off+0.116
 pic+0.115
Neg logit contributions
joy-0.227
 thrill-0.207
 disl-0.205
 nods-0.203
 relief-0.200
Token the
Token ID262
Feature activation-2.26e+00

Pos logit contributions
's+0.061
 so+0.056
 called+0.055
 and+0.054
+0.054
Neg logit contributions
joy-0.183
 thrill-0.171
 disl-0.169
 nods-0.164
 relief-0.164
Token cost
Token ID1575
Feature activation-1.28e+00

Pos logit contributions
's+0.047
 so+0.044
 and+0.043
 called+0.043
+0.043
Neg logit contributions
joy-0.080
 thrill-0.074
 disl-0.073
 relief-0.070
 nods-0.070
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
's+0.021
 pic+0.019
 called+0.018
 closely+0.017
ches+0.017
Neg logit contributions
joy-0.049
 thrill-0.044
viously-0.043
 winners-0.043
 disl-0.043
Token Min
Token ID1855
Feature activation-4.10e+00

Pos logit contributions
's+0.001
 so+0.001
 and+0.001
 called+0.001
+0.001
Neg logit contributions
joy-0.001
 thrill-0.001
 disl-0.001
 relief-0.001
 nods-0.001
Token Lily
Token ID20037
Feature activation-1.28e+00

Pos logit contributions
's+0.006
+0.006
 and+0.006
 so+0.006
 called+0.006
Neg logit contributions
joy-0.013
 thrill-0.012
 disl-0.012
 relief-0.012
 nods-0.012
Token assured
Token ID13933
Feature activation-6.06e-01

Pos logit contributions
's+0.027
 so+0.026
+0.025
 called+0.025
 off+0.025
Neg logit contributions
joy-0.042
 thrill-0.039
 disl-0.038
 nods-0.038
 relief-0.037
Token her
Token ID607
Feature activation-2.60e+00

Pos logit contributions
's+0.011
+0.010
sc+0.010
 so+0.010
At+0.010
Neg logit contributions
joy-0.021
 disl-0.019
 relief-0.019
 thrill-0.019
 nods-0.019
Token that
Token ID326
Feature activation-1.16e+00

Pos logit contributions
's+0.052
 so+0.050
 called+0.049
+0.049
 and+0.048
Neg logit contributions
joy-0.092
 thrill-0.084
 disl-0.084
 relief-0.081
 nods-0.081
Token L
Token ID406
Feature activation-4.67e+00

Pos logit contributions
's+0.021
 so+0.020
 called+0.019
+0.019
 and+0.019
Neg logit contributions
joy-0.043
 thrill-0.040
 disl-0.039
 nods-0.038
 relief-0.038
Tokenizzy
Token ID40593
Feature activation-6.47e+00

Pos logit contributions
's+0.135
 so+0.132
 separate+0.131
 off+0.129
+0.127
Neg logit contributions
joy-0.118
 disl-0.096
 thrill-0.095
 nods-0.094
iness-0.093
Token was
Token ID373
Feature activation-1.86e+00

Pos logit contributions
's+0.088
 so+0.083
 called+0.081
+0.079
 and+0.078
Neg logit contributions
joy-0.264
 thrill-0.246
 disl-0.246
 nods-0.239
 relief-0.236
Token very
Token ID845
Feature activation-1.20e+00

Pos logit contributions
's+0.035
 so+0.033
 and+0.032
 called+0.032
+0.031
Neg logit contributions
joy-0.071
 thrill-0.066
 disl-0.064
 nods-0.063
 relief-0.062
Token friendly
Token ID8030
Feature activation+7.71e-02

Pos logit contributions
's+0.030
 so+0.028
 called+0.028
 and+0.028
+0.028
Neg logit contributions
joy-0.038
 thrill-0.035
 disl-0.034
 relief-0.033
 nods-0.033
Token and
Token ID290
Feature activation-1.43e+00

Pos logit contributions
joy+0.004
 thrill+0.003
 nods+0.003
 disl+0.003
 relief+0.003
Neg logit contributions
's-0.001
-0.000
 called-0.000
 unw-0.000
At-0.000
Token loved
Token ID6151
Feature activation-1.37e+00

Pos logit contributions
's+0.029
 so+0.028
 called+0.028
 and+0.027
+0.026
Neg logit contributions
joy-0.057
 thrill-0.052
 Majesty-0.047
 relief-0.047
 vibration-0.046
Token day
Token ID1110
Feature activation+1.52e+00

Pos logit contributions
's+0.008
 so+0.008
 called+0.007
+0.007
 and+0.007
Neg logit contributions
joy-0.018
 thrill-0.017
 disl-0.017
 relief-0.016
 nods-0.016
Token Mia
Token ID32189
Feature activation-2.82e+00

Pos logit contributions
joy+0.068
 thrill+0.064
 disl+0.063
 relief+0.062
 nods+0.061
Neg logit contributions
's-0.018
 so-0.016
 and-0.015
 called-0.015
-0.015
Token's
Token ID338
Feature activation-8.04e-01

Pos logit contributions
 shelf+0.054
 off+0.053
sc+0.052
 so+0.052
 lin+0.052
Neg logit contributions
joy-0.087
 nods-0.080
 disl-0.075
 nodded-0.073
 thrill-0.072
Token mom
Token ID1995
Feature activation+1.13e+00

Pos logit contributions
's+0.018
 so+0.017
 called+0.017
+0.017
 and+0.017
Neg logit contributions
joy-0.026
 thrill-0.024
 disl-0.023
 relief-0.023
 Majesty-0.023
Token introduced
Token ID5495
Feature activation-1.98e+00

Pos logit contributions
joy+0.031
 nods+0.029
 nodded+0.027
 disl+0.027
 grin+0.026
Neg logit contributions
sc-0.024
 closely-0.024
 pic-0.024
 shelf-0.024
 off-0.023
Token Mia
Token ID32189
Feature activation-6.19e+00

Pos logit contributions
's+0.033
+0.031
 called+0.030
ches+0.029
At+0.028
Neg logit contributions
joy-0.077
 disl-0.067
 nods-0.064
 relief-0.064
viously-0.064
Token to
Token ID284
Feature activation-7.83e-01

Pos logit contributions
 lin+0.065
 unw+0.062
 convey+0.061
At+0.060
 pic+0.059
Neg logit contributions
joy-0.249
 disl-0.213
 nods-0.211
 tweet-0.209
 Majesty-0.204
Token a
Token ID257
Feature activation-1.90e+00

Pos logit contributions
's+0.012
+0.011
 called+0.010
ches+0.010
sc+0.010
Neg logit contributions
joy-0.032
 thrill-0.028
 disl-0.028
 relief-0.028
 Majesty-0.027
Token diamond
Token ID15291
Feature activation+1.65e+00

Pos logit contributions
's+0.037
 off+0.034
 into+0.033
ches+0.033
 and+0.033
Neg logit contributions
joy-0.067
 thrill-0.060
 disl-0.058
 relief-0.057
 Majesty-0.056
Token.
Token ID13
Feature activation-6.28e-01

Pos logit contributions
joy+0.065
 thrill+0.060
 nods+0.058
 Majesty+0.057
 disl+0.057
Neg logit contributions
-0.021
's-0.020
At-0.018
 closely-0.018
 so-0.017
Token It
Token ID632
Feature activation+1.89e+00

Pos logit contributions
's+0.014
 so+0.013
 called+0.013
 and+0.013
+0.013
Neg logit contributions
joy-0.021
 thrill-0.019
 disl-0.018
 nods-0.018
 relief-0.018
Token dad
Token ID9955
Feature activation+2.21e+00

Pos logit contributions
's+0.029
 so+0.028
 and+0.028
 called+0.027
+0.026
Neg logit contributions
joy-0.046
 thrill-0.044
 disl-0.043
 relief-0.041
 nods-0.040
Token made
Token ID925
Feature activation-1.82e+00

Pos logit contributions
joy+0.076
 nods+0.071
 thrill+0.071
 grin+0.068
 relief+0.067
Neg logit contributions
's-0.038
 so-0.036
 closely-0.036
 shelf-0.035
 pic-0.035
Token sure
Token ID1654
Feature activation-1.89e+00

Pos logit contributions
 shelf+0.030
's+0.030
 called+0.030
At+0.029
+0.029
Neg logit contributions
joy-0.065
 relief-0.060
 thrill-0.060
 disl-0.057
 nods-0.057
Token to
Token ID284
Feature activation-1.64e-01

Pos logit contributions
's+0.022
 called+0.020
+0.020
 so+0.019
 and+0.019
Neg logit contributions
joy-0.082
 thrill-0.076
 disl-0.075
 relief-0.074
 nods-0.073
Token encourage
Token ID7898
Feature activation+9.00e-01

Pos logit contributions
's+0.003
sc+0.003
ches+0.003
 closely+0.003
+0.003
Neg logit contributions
 thrill-0.005
joy-0.005
 relief-0.005
 disl-0.005
 grin-0.005
Token Maria
Token ID14200
Feature activation-6.11e+00

Pos logit contributions
joy+0.033
 relief+0.031
 thrill+0.031
 disl+0.030
 nods+0.029
Neg logit contributions
's-0.015
 called-0.013
-0.013
At-0.013
 so-0.012
Token to
Token ID284
Feature activation-4.63e-01

Pos logit contributions
 pic+0.050
 convey+0.047
 lin+0.046
 shelf+0.046
At+0.043
Neg logit contributions
joy-0.253
 disl-0.233
 nods-0.233
 thrill-0.230
 Majesty-0.228
Token try
Token ID1949
Feature activation-1.34e+00

Pos logit contributions
's+0.009
 so+0.008
 called+0.008
+0.008
 closely+0.008
Neg logit contributions
joy-0.016
 thrill-0.015
 disl-0.015
 relief-0.015
 Majesty-0.014
Token all
Token ID477
Feature activation-9.07e-01

Pos logit contributions
 called+0.018
's+0.018
sc+0.018
+0.017
At+0.017
Neg logit contributions
joy-0.054
 disl-0.046
 thrill-0.045
 nods-0.043
 relief-0.043
Token of
Token ID286
Feature activation-3.45e+00

Pos logit contributions
 called+0.013
 convey+0.013
 unw+0.012
's+0.012
ches+0.012
Neg logit contributions
joy-0.034
 disl-0.032
 thrill-0.032
 nods-0.031
 relief-0.031
Token the
Token ID262
Feature activation-9.38e-01

Pos logit contributions
's+0.057
+0.052
 called+0.049
At+0.047
sc+0.047
Neg logit contributions
joy-0.134
 thrill-0.122
 relief-0.121
 disl-0.120
 Majesty-0.113
Token today
Token ID1909
Feature activation-1.64e+00

Pos logit contributions
joy+0.045
 thrill+0.041
 disl+0.041
 relief+0.040
 nods+0.040
Neg logit contributions
's-0.010
-0.009
 so-0.008
 called-0.008
ches-0.008
Token,
Token ID11
Feature activation-3.17e-01

Pos logit contributions
ches+0.020
's+0.018
 closely+0.018
+0.017
 off+0.017
Neg logit contributions
joy-0.067
 thrill-0.060
 relief-0.058
 nods-0.058
 grin-0.057
Token so
Token ID523
Feature activation-8.58e-01

Pos logit contributions
's+0.005
 off+0.005
ches+0.005
 closely+0.005
+0.005
Neg logit contributions
joy-0.012
 thrill-0.011
 disl-0.010
 relief-0.010
 nods-0.010
Token she
Token ID673
Feature activation-2.47e+00

Pos logit contributions
's+0.019
ches+0.018
sc+0.017
 shelf+0.016
 called+0.016
Neg logit contributions
joy-0.029
 thrill-0.026
 relief-0.026
 disl-0.025
 nods-0.025
Token showed
Token ID3751
Feature activation-9.10e-01

Pos logit contributions
's+0.044
 closely+0.042
 shelf+0.042
 so+0.041
 off+0.041
Neg logit contributions
joy-0.081
 nods-0.075
 disl-0.073
 thrill-0.073
 relief-0.071
Token Emily
Token ID17608
Feature activation-6.07e+00

Pos logit contributions
's+0.016
+0.015
 shelf+0.015
 called+0.015
sc+0.015
Neg logit contributions
joy-0.032
 relief-0.029
 disl-0.028
viously-0.028
 nods-0.028
Token her
Token ID607
Feature activation-2.51e+00

Pos logit contributions
sc+0.109
 pic+0.106
 lin+0.103
At+0.098
 called+0.098
Neg logit contributions
joy-0.211
 nods-0.177
 disl-0.176
 grin-0.172
 wow-0.171
Token new
Token ID649
Feature activation-3.13e-01

Pos logit contributions
's+0.052
 called+0.049
ches+0.048
+0.047
 so+0.047
Neg logit contributions
joy-0.086
 thrill-0.076
 disl-0.076
 relief-0.074
 nods-0.074
Token purple
Token ID14032
Feature activation-1.67e+00

Pos logit contributions
's+0.006
 so+0.006
 called+0.006
+0.006
 and+0.006
Neg logit contributions
joy-0.011
 thrill-0.010
 disl-0.010
 nods-0.010
 relief-0.010
Token bag
Token ID6131
Feature activation-1.53e+00

Pos logit contributions
's+0.036
 and+0.034
 so+0.034
 called+0.033
+0.033
Neg logit contributions
joy-0.058
 thrill-0.054
 disl-0.052
 relief-0.051
 nods-0.050
Token.
Token ID13
Feature activation-6.09e-01

Pos logit contributions
 pic+0.022
's+0.018
+0.018
 closely+0.018
ches+0.017
Neg logit contributions
joy-0.060
 thrill-0.054
 grin-0.052
 nods-0.052
 relief-0.051
Token was
Token ID373
Feature activation-1.07e-01

Pos logit contributions
joy+0.044
 nods+0.040
 thrill+0.039
 relief+0.038
 disl+0.038
Neg logit contributions
 pic-0.028
's-0.027
 so-0.027
-0.027
 off-0.026
Token getting
Token ID1972
Feature activation-3.44e-01

Pos logit contributions
's+0.002
+0.002
ches+0.002
 and+0.002
 pic+0.002
Neg logit contributions
joy-0.004
 thrill-0.003
 relief-0.003
 sorry-0.003
 disl-0.003
Token frustrated
Token ID14718
Feature activation+1.78e+00

Pos logit contributions
's+0.008
 pic+0.008
+0.007
ches+0.007
 shelf+0.007
Neg logit contributions
joy-0.011
 sorry-0.010
 relief-0.009
 thrill-0.009
 sincere-0.009
Token and
Token ID290
Feature activation-1.51e+00

Pos logit contributions
joy+0.073
 thrill+0.066
 nods+0.064
 relief+0.064
 sorry+0.063
Neg logit contributions
 pic-0.022
 unw-0.020
 lin-0.019
mary-0.019
's-0.019
Token told
Token ID1297
Feature activation-5.07e-01

Pos logit contributions
's+0.031
 so+0.029
 and+0.028
+0.028
 called+0.028
Neg logit contributions
joy-0.053
 thrill-0.049
 disl-0.048
 relief-0.047
 nods-0.047
Token Julie
Token ID21946
Feature activation-6.06e+00

Pos logit contributions
's+0.009
+0.009
 called+0.008
 so+0.008
 pic+0.008
Neg logit contributions
joy-0.018
 thrill-0.017
 relief-0.016
 disl-0.016
 nods-0.016
Token that
Token ID326
Feature activation-4.09e-01

Pos logit contributions
's+0.076
 called+0.072
 so+0.069
+0.068
 and+0.066
Neg logit contributions
joy-0.250
 disl-0.234
 thrill-0.231
 nods-0.225
 relief-0.223
Token it
Token ID340
Feature activation-1.92e-01

Pos logit contributions
's+0.007
 so+0.007
 and+0.007
 called+0.007
+0.007
Neg logit contributions
joy-0.016
 thrill-0.015
 disl-0.014
 relief-0.014
 nods-0.014
Token was
Token ID373
Feature activation-2.26e+00

Pos logit contributions
 so+0.002
 unw+0.002
 called+0.002
 pic+0.002
 off+0.002
Neg logit contributions
joy-0.008
 disl-0.007
 nods-0.007
 thrill-0.007
ses-0.007
Token time
Token ID640
Feature activation-1.04e+00

Pos logit contributions
's+0.040
+0.036
 closely+0.035
 and+0.034
 unw+0.034
Neg logit contributions
joy-0.084
 thrill-0.078
 relief-0.076
 disl-0.075
 sorry-0.073
Token to
Token ID284
Feature activation+4.84e-01

Pos logit contributions
's+0.011
+0.010
 pic+0.010
 unw+0.009
ches+0.009
Neg logit contributions
joy-0.045
 thrill-0.041
 disl-0.040
ses-0.040
 relief-0.039
Token home
Token ID1363
Feature activation+1.96e+00

Pos logit contributions
 shelf+0.015
 called+0.014
 closely+0.014
At+0.014
mary+0.014
Neg logit contributions
joy-0.029
 relief-0.025
 nods-0.025
 thrill-0.024
 sorry-0.023
Token a
Token ID257
Feature activation-2.05e+00

Pos logit contributions
joy+0.082
 thrill+0.074
 disl+0.073
 relief+0.071
 nods+0.071
Neg logit contributions
's-0.026
-0.024
 called-0.022
 closely-0.022
ches-0.022
Token big
Token ID1263
Feature activation-1.46e+00

Pos logit contributions
's+0.036
 so+0.033
 called+0.033
+0.032
 off+0.032
Neg logit contributions
joy-0.077
 thrill-0.071
 relief-0.068
 disl-0.067
 nods-0.066
Token hoop
Token ID44792
Feature activation-1.36e+00

Pos logit contributions
's+0.031
 so+0.029
+0.028
 called+0.028
 off+0.027
Neg logit contributions
joy-0.051
 thrill-0.046
 relief-0.045
 disl-0.044
 nods-0.044
Token for
Token ID329
Feature activation-1.14e+00

Pos logit contributions
 pic+0.019
At+0.016
 so+0.016
's+0.016
 closely+0.015
Neg logit contributions
joy-0.050
 thrill-0.048
ses-0.046
 nods-0.046
 winners-0.046
Token Maria
Token ID14200
Feature activation-6.04e+00

Pos logit contributions
's+0.025
+0.022
 and+0.021
 closely+0.021
ches+0.020
Neg logit contributions
joy-0.040
 thrill-0.037
 relief-0.036
 disl-0.035
 Majesty-0.034
Token.
Token ID13
Feature activation-4.97e-01

Pos logit contributions
 pic+0.088
At+0.082
 lin+0.082
On+0.077
 off+0.076
Neg logit contributions
joy-0.201
 nods-0.190
 disl-0.186
 grin-0.181
 beard-0.181
Token She
Token ID1375
Feature activation-2.88e-01

Pos logit contributions
's+0.012
 so+0.012
 called+0.012
+0.011
 and+0.011
Neg logit contributions
joy-0.015
 thrill-0.014
 disl-0.013
 nods-0.013
 relief-0.013
Token was
Token ID373
Feature activation+5.99e-01

Pos logit contributions
 off+0.006
 so+0.005
 from+0.005
's+0.005
 into+0.005
Neg logit contributions
 nods-0.009
joy-0.009
 disl-0.009
 thrill-0.008
 Majesty-0.008
Token so
Token ID523
Feature activation-3.21e-02

Pos logit contributions
joy+0.026
 sorry+0.024
 thrill+0.024
 winners+0.023
 relief+0.023
Neg logit contributions
's-0.006
sc-0.005
 pic-0.005
ches-0.005
At-0.005
Token excited
Token ID6568
Feature activation-1.18e+00

Pos logit contributions
's+0.001
 so+0.001
 and+0.001
 called+0.001
+0.001
Neg logit contributions
joy-0.001
 thrill-0.001
 disl-0.001
 relief-0.001
 nods-0.001
Token a
Token ID257
Feature activation-1.74e+00

Pos logit contributions
's+0.001
ches+0.001
 off+0.001
 shelf+0.001
 closely+0.001
Neg logit contributions
joy-0.002
 thrill-0.002
 relief-0.002
 disl-0.002
 nods-0.002
Token baby
Token ID5156
Feature activation+1.04e+00

Pos logit contributions
's+0.030
 so+0.028
 called+0.027
 and+0.027
+0.027
Neg logit contributions
joy-0.068
 thrill-0.063
 disl-0.061
 relief-0.060
 nods-0.060
Token squirrel
Token ID33039
Feature activation+1.51e+00

Pos logit contributions
joy+0.035
 thrill+0.032
 disl+0.031
 relief+0.030
 nods+0.030
Neg logit contributions
's-0.021
-0.020
 so-0.020
 off-0.020
 closely-0.019
Token called
Token ID1444
Feature activation-1.70e+00

Pos logit contributions
joy+0.051
 thrill+0.047
 disl+0.046
 nods+0.045
 sincere+0.045
Neg logit contributions
 pic-0.027
 off-0.025
-0.025
 unw-0.025
 lin-0.024
Token T
Token ID309
Feature activation-3.73e+00

Pos logit contributions
's+0.040
 so+0.038
 and+0.037
 called+0.037
+0.037
Neg logit contributions
joy-0.055
 thrill-0.049
 disl-0.048
 relief-0.047
 Majesty-0.046
Tokenizzy
Token ID40593
Feature activation-6.00e+00

Pos logit contributions
 so+0.097
's+0.096
 called+0.095
+0.093
 off+0.090
Neg logit contributions
joy-0.111
 thrill-0.091
 nods-0.091
 Majesty-0.090
 disl-0.089
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
 off+0.083
 pic+0.079
+0.079
At+0.077
's+0.077
Neg logit contributions
joy-0.237
 disl-0.207
 Majesty-0.206
 thrill-0.204
 nods-0.200
Token She
Token ID1375
Feature activation-4.63e-01

Pos logit contributions
's+0.004
+0.004
ches+0.004
 closely+0.003
 off+0.003
Neg logit contributions
joy-0.005
 thrill-0.004
 nods-0.004
 disl-0.004
 relief-0.004
Token was
Token ID373
Feature activation+4.36e-01

Pos logit contributions
 off+0.011
's+0.011
 closely+0.011
 so+0.011
 from+0.010
Neg logit contributions
joy-0.013
 disl-0.012
 thrill-0.012
 nods-0.012
 relief-0.011
Token very
Token ID845
Feature activation-1.37e-01

Pos logit contributions
joy+0.018
 thrill+0.016
 sorry+0.016
ses+0.016
 relief+0.016
Neg logit contributions
's-0.006
ches-0.005
 closely-0.005
-0.005
sc-0.005
Token small
Token ID1402
Feature activation-8.57e-01

Pos logit contributions
's+0.003
 so+0.003
 called+0.003
 and+0.003
+0.003
Neg logit contributions
joy-0.004
 thrill-0.004
 disl-0.004
 relief-0.004
 nods-0.004
Token of
Token ID286
Feature activation-1.91e+00

Pos logit contributions
joy+0.015
 thrill+0.015
 relief+0.015
 nods+0.014
 grin+0.014
Neg logit contributions
's-0.006
 closely-0.006
 called-0.006
 convey-0.006
 unw-0.006
Token something
Token ID1223
Feature activation+2.05e-01

Pos logit contributions
's+0.021
 so+0.020
 and+0.019
 called+0.018
+0.018
Neg logit contributions
joy-0.086
 thrill-0.081
 disl-0.080
 nods-0.078
 Majesty-0.077
Token y
Token ID331
Feature activation-1.93e+00

Pos logit contributions
joy+0.007
 relief+0.006
 thrill+0.006
 disl+0.006
 nods+0.006
Neg logit contributions
's-0.004
ches-0.003
 called-0.003
On-0.003
At-0.003
Tokenummy
Token ID13513
Feature activation-2.14e+00

Pos logit contributions
's+0.048
 called+0.046
 so+0.045
+0.044
At+0.043
Neg logit contributions
joy-0.058
 thrill-0.053
 relief-0.050
viously-0.050
 nods-0.050
Token told
Token ID1297
Feature activation-1.04e+00

Pos logit contributions
's+0.026
 called+0.024
 so+0.023
 closely+0.023
+0.023
Neg logit contributions
joy-0.086
 thrill-0.080
 relief-0.080
 disl-0.079
 nods-0.079
Token Bella
Token ID39304
Feature activation-5.98e+00

Pos logit contributions
's+0.022
+0.020
 called+0.019
 so+0.019
At+0.019
Neg logit contributions
joy-0.035
 thrill-0.032
 relief-0.032
 disl-0.031
 nods-0.029
Token it
Token ID340
Feature activation-1.46e+00

Pos logit contributions
 called+0.063
 pic+0.061
At+0.061
's+0.060
 convey+0.058
Neg logit contributions
joy-0.243
 disl-0.232
 nods-0.221
 thrill-0.220
 relief-0.217
Token was
Token ID373
Feature activation-2.62e+00

Pos logit contributions
 pic+0.020
At+0.019
 off+0.019
 called+0.019
 convey+0.019
Neg logit contributions
joy-0.052
 nods-0.049
 disl-0.048
 relief-0.048
 thrill-0.047
Token ready
Token ID3492
Feature activation-3.75e+00

Pos logit contributions
sc+0.041
's+0.040
 convey+0.038
 aisle+0.037
At+0.037
Neg logit contributions
joy-0.097
 relief-0.091
 sorry-0.087
 thrill-0.085
 disl-0.084
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
ches+0.056
's+0.054
At+0.054
sc+0.053
 convey+0.052
Neg logit contributions
joy-0.139
 relief-0.118
 thrill-0.117
 grin-0.115
ment-0.115
Token She
Token ID1375
Feature activation-1.15e-01

Pos logit contributions
's+0.005
 so+0.004
 and+0.004
 called+0.004
+0.004
Neg logit contributions
joy-0.007
 thrill-0.006
 disl-0.006
 relief-0.006
 nods-0.006
Token hospital
Token ID4436
Feature activation+1.29e+00

Pos logit contributions
's+0.006
 so+0.006
 and+0.006
 called+0.006
+0.006
Neg logit contributions
joy-0.012
 thrill-0.012
 disl-0.011
 relief-0.011
 nods-0.011
Token and
Token ID290
Feature activation-7.15e-01

Pos logit contributions
joy+0.059
 thrill+0.054
 disl+0.053
 relief+0.053
 nods+0.052
Neg logit contributions
's-0.011
-0.010
 so-0.009
 called-0.009
 pic-0.009
Token the
Token ID262
Feature activation-2.83e-01

Pos logit contributions
's+0.012
 so+0.011
 called+0.011
 and+0.011
+0.010
Neg logit contributions
joy-0.028
 thrill-0.027
 disl-0.026
 relief-0.025
 nods-0.025
Token doctor
Token ID6253
Feature activation+1.95e+00

Pos logit contributions
's+0.007
 so+0.007
 and+0.006
 called+0.006
+0.006
Neg logit contributions
joy-0.009
 disl-0.009
 thrill-0.009
 nods-0.008
osity-0.008
Token said
Token ID531
Feature activation-1.91e+00

Pos logit contributions
joy+0.075
 thrill+0.070
 disl+0.068
 relief+0.066
 nods+0.066
Neg logit contributions
's-0.036
 so-0.033
 called-0.033
 and-0.032
-0.032
Token Tina
Token ID35903
Feature activation-5.95e+00

Pos logit contributions
's+0.034
 so+0.032
 and+0.031
+0.031
 called+0.030
Neg logit contributions
joy-0.074
 thrill-0.071
 disl-0.068
 nods-0.066
 relief-0.066
Token had
Token ID550
Feature activation-2.11e+00

Pos logit contributions
's+0.089
 so+0.081
 and+0.079
+0.078
 called+0.078
Neg logit contributions
joy-0.245
 thrill-0.230
 disl-0.225
 relief-0.219
 nods-0.218
Token eaten
Token ID17065
Feature activation-4.72e-01

Pos logit contributions
's+0.029
 so+0.026
 called+0.026
 and+0.026
+0.025
Neg logit contributions
joy-0.088
 thrill-0.082
 disl-0.081
 relief-0.079
 nods-0.078
Token a
Token ID257
Feature activation-7.16e-01

Pos logit contributions
's+0.007
+0.007
 called+0.006
 closely+0.006
 and+0.006
Neg logit contributions
joy-0.019
 thrill-0.017
 relief-0.017
 disl-0.017
 nods-0.016
Token bad
Token ID2089
Feature activation+2.29e+00

Pos logit contributions
's+0.013
 so+0.013
 and+0.012
+0.012
 called+0.011
Neg logit contributions
joy-0.027
 thrill-0.025
 disl-0.025
 nods-0.025
 Majesty-0.024
Token avocado
Token ID40377
Feature activation-8.90e-01

Pos logit contributions
joy+0.069
 disl+0.066
 thrill+0.065
osity+0.063
 nods+0.062
Neg logit contributions
 and-0.059
's-0.057
 so-0.055
 to-0.053
 really-0.051
Token opened
Token ID4721
Feature activation-2.78e+00

Pos logit contributions
joy+0.020
 thrill+0.018
 nods+0.018
 disl+0.017
 relief+0.017
Neg logit contributions
's-0.020
 so-0.019
 called-0.018
-0.018
 off-0.018
Token the
Token ID262
Feature activation+4.56e-02

Pos logit contributions
's+0.046
 called+0.045
sc+0.045
+0.044
 pic+0.043
Neg logit contributions
joy-0.107
 nods-0.089
 relief-0.087
 thrill-0.085
 disl-0.083
Token bag
Token ID6131
Feature activation-1.31e-01

Pos logit contributions
joy+0.001
 beard+0.001
 thrill+0.001
 relief+0.001
 nods+0.001
Neg logit contributions
ide-0.001
 called-0.001
ches-0.001
During-0.001
 often-0.001
Token and
Token ID290
Feature activation-7.88e-01

Pos logit contributions
 pic+0.001
+0.001
ches+0.001
 separate+0.001
's+0.001
Neg logit contributions
joy-0.006
 nods-0.005
 thrill-0.005
 grin-0.005
 nodded-0.005
Token showed
Token ID3751
Feature activation-1.44e+00

Pos logit contributions
's+0.013
 so+0.013
+0.012
 and+0.012
 called+0.012
Neg logit contributions
joy-0.030
 thrill-0.027
 nods-0.026
 disl-0.026
 relief-0.026
Token Emily
Token ID17608
Feature activation-5.95e+00

Pos logit contributions
 called+0.029
's+0.029
 pic+0.029
On+0.028
At+0.028
Neg logit contributions
joy-0.045
 relief-0.041
 nods-0.040
 thrill-0.040
 disl-0.040
Token the
Token ID262
Feature activation-2.76e+00

Pos logit contributions
 pic+0.100
 called+0.099
sc+0.098
ches+0.089
At+0.088
Neg logit contributions
joy-0.215
 disl-0.193
 nods-0.192
 thrill-0.183
 grin-0.182
Token big
Token ID1263
Feature activation-2.88e+00

Pos logit contributions
ches+0.062
 called+0.060
 off+0.057
+0.056
 into+0.056
Neg logit contributions
joy-0.093
 thrill-0.077
 relief-0.074
 beard-0.073
 Majesty-0.071
Token box
Token ID3091
Feature activation-2.86e+00

Pos logit contributions
's+0.059
+0.057
 called+0.056
At+0.055
 off+0.055
Neg logit contributions
joy-0.100
 thrill-0.087
 nods-0.086
 grin-0.083
 relief-0.082
Token inside
Token ID2641
Feature activation-7.12e-01

Pos logit contributions
 pic+0.051
ches+0.041
During+0.036
At+0.036
sc+0.035
Neg logit contributions
joy-0.104
 grin-0.096
 thrill-0.095
 nods-0.095
 sincere-0.090
Token.
Token ID13
Feature activation-7.25e-01

Pos logit contributions
ches+0.009
's+0.009
+0.008
 pic+0.008
sc+0.008
Neg logit contributions
joy-0.029
 thrill-0.026
 relief-0.025
 nods-0.025
 disl-0.024
Token day
Token ID1110
Feature activation+7.89e-01

Pos logit contributions
joy+0.002
 thrill+0.002
 relief+0.002
 disl+0.002
 nods+0.002
Neg logit contributions
's-0.001
 so-0.001
 called-0.001
 into-0.001
 closely-0.001
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
joy+0.033
 thrill+0.030
 nods+0.028
 relief+0.028
 disl+0.028
Neg logit contributions
's-0.010
ches-0.010
-0.009
 called-0.009
 closely-0.009
Token the
Token ID262
Feature activation-4.17e-01

Pos logit contributions
joy+0.045
 disl+0.042
 thrill+0.042
 Majesty+0.040
 relief+0.040
Neg logit contributions
's-0.016
 so-0.015
 and-0.015
 called-0.014
-0.014
Token wagon
Token ID31860
Feature activation-9.75e-01

Pos logit contributions
 and+0.007
 so+0.007
's+0.007
 called+0.007
+0.007
Neg logit contributions
joy-0.016
 disl-0.016
 Majesty-0.015
 thrill-0.015
ses-0.014
Token told
Token ID1297
Feature activation-6.64e-01

Pos logit contributions
's+0.017
 so+0.016
+0.016
 called+0.015
 closely+0.015
Neg logit contributions
joy-0.036
 thrill-0.034
 nods-0.033
 relief-0.032
 disl-0.032
Token Mia
Token ID32189
Feature activation-5.94e+00

Pos logit contributions
's+0.013
+0.012
 so+0.012
 called+0.012
 and+0.012
Neg logit contributions
joy-0.024
 thrill-0.022
 disl-0.022
 relief-0.022
 nods-0.021
Token,
Token ID11
Feature activation-7.45e-01

Pos logit contributions
sc+0.047
 pic+0.047
At+0.045
 shelf+0.044
+0.043
Neg logit contributions
joy-0.260
 nods-0.241
 disl-0.241
 thrill-0.235
 relief-0.232
Token "
Token ID366
Feature activation+4.36e-01

Pos logit contributions
's+0.015
 and+0.014
 so+0.014
 called+0.014
+0.013
Neg logit contributions
joy-0.027
 thrill-0.026
 disl-0.025
 Majesty-0.024
 relief-0.024
TokenThank
Token ID10449
Feature activation+8.32e-01

Pos logit contributions
joy+0.013
 disl+0.012
 thrill+0.012
 Majesty+0.012
ses+0.012
Neg logit contributions
's-0.012
-0.011
 and-0.011
 so-0.011
 called-0.011
Token you
Token ID345
Feature activation+9.51e-01

Pos logit contributions
joy+0.031
 thrill+0.029
 disl+0.028
 relief+0.028
 nods+0.027
Neg logit contributions
's-0.015
 so-0.014
 and-0.014
 called-0.014
-0.014
Token for
Token ID329
Feature activation-2.92e-02

Pos logit contributions
joy+0.034
 sorry+0.031
 relief+0.030
 tweet+0.030
 thrill+0.029
Neg logit contributions
 unw-0.017
At-0.017
 closely-0.017
On-0.016
 pic-0.015
Token They
Token ID1119
Feature activation-6.31e-01

Pos logit contributions
's+0.000
 and+0.000
+0.000
 so+0.000
 called+0.000
Neg logit contributions
joy-0.001
 thrill-0.001
 disl-0.001
 liar-0.001
 relief-0.001
Token were
Token ID547
Feature activation+4.89e-01

Pos logit contributions
's+0.013
 so+0.012
 called+0.012
 and+0.012
+0.012
Neg logit contributions
joy-0.023
 thrill-0.021
 disl-0.021
 Majesty-0.021
 relief-0.020
Token so
Token ID523
Feature activation+1.03e-02

Pos logit contributions
joy+0.020
 thrill+0.019
 relief+0.018
 disl+0.018
 winners+0.018
Neg logit contributions
's-0.007
-0.006
ches-0.006
 called-0.006
 and-0.006
Token proud
Token ID6613
Feature activation+6.01e-01

Pos logit contributions
joy+0.000
 thrill+0.000
 disl+0.000
 relief+0.000
 nods+0.000
Neg logit contributions
's-0.000
 so-0.000
 and-0.000
-0.000
 called-0.000
Token of
Token ID286
Feature activation-2.25e+00

Pos logit contributions
joy+0.024
 thrill+0.022
 disl+0.021
 nods+0.021
 relief+0.021
Neg logit contributions
's-0.008
ches-0.008
-0.007
 shelf-0.007
 unw-0.007
Token Maria
Token ID14200
Feature activation-5.91e+00

Pos logit contributions
's+0.055
 so+0.052
 and+0.051
 called+0.051
+0.051
Neg logit contributions
joy-0.071
 thrill-0.066
 disl-0.064
 relief-0.061
 nods-0.061
Token.
Token ID13
Feature activation+3.46e-01

Pos logit contributions
At+0.056
On+0.055
sc+0.054
 pic+0.052
During+0.050
Neg logit contributions
joy-0.253
 grin-0.226
 nods-0.225
 relief-0.225
 thrill-0.223
Token She
Token ID1375
Feature activation-6.04e-01

Pos logit contributions
joy+0.014
 thrill+0.014
 disl+0.013
 liar+0.013
 relief+0.013
Neg logit contributions
's-0.006
-0.005
 and-0.005
 so-0.005
 called-0.005
Token had
Token ID550
Feature activation+1.50e-01

Pos logit contributions
's+0.012
 off+0.011
 closely+0.011
 so+0.011
 into+0.011
Neg logit contributions
joy-0.020
 thrill-0.019
 disl-0.019
 nods-0.019
 relief-0.018
Token so
Token ID523
Feature activation+1.37e+00

Pos logit contributions
joy+0.005
 thrill+0.005
 disl+0.005
 relief+0.004
 nods+0.004
Neg logit contributions
's-0.003
 and-0.003
 closely-0.003
-0.003
 from-0.003
Token much
Token ID881
Feature activation+7.21e-01

Pos logit contributions
joy+0.065
 Majesty+0.062
 disl+0.062
 thrill+0.062
osity+0.060
Neg logit contributions
 so-0.012
's-0.012
-0.012
 and-0.011
 really-0.010
Token
Token ID198
Feature activation-1.03e+00

Pos logit contributions
's+0.012
 so+0.011
 and+0.011
 called+0.011
+0.011
Neg logit contributions
joy-0.027
 thrill-0.026
 disl-0.025
 relief-0.024
 nods-0.024
Token
Token ID198
Feature activation-7.13e-01

Pos logit contributions
's+0.019
 so+0.018
 called+0.017
 off+0.017
 from+0.016
Neg logit contributions
joy-0.037
 disl-0.034
 thrill-0.034
 nods-0.033
 relief-0.033
TokenM
Token ID44
Feature activation-2.60e+00

Pos logit contributions
's+0.017
 so+0.017
 called+0.016
 off+0.016
 and+0.016
Neg logit contributions
joy-0.022
 disl-0.020
 thrill-0.019
 nods-0.019
 relief-0.019
Tokenum
Token ID388
Feature activation+1.09e+00

Pos logit contributions
's+0.050
 and+0.047
 so+0.046
 called+0.046
+0.046
Neg logit contributions
joy-0.096
 thrill-0.090
 disl-0.087
 relief-0.085
 nods-0.083
Token showed
Token ID3751
Feature activation-1.53e+00

Pos logit contributions
joy+0.045
 nods+0.043
 nodded+0.042
 grin+0.041
 thrill+0.041
Neg logit contributions
 pic-0.013
sc-0.012
 off-0.011
 so-0.010
 shelf-0.010
Token Maria
Token ID14200
Feature activation-5.90e+00

Pos logit contributions
's+0.039
 called+0.037
 so+0.037
+0.037
 and+0.036
Neg logit contributions
joy-0.044
 thrill-0.040
 disl-0.039
 nods-0.038
 relief-0.038
Token how
Token ID703
Feature activation-1.76e-01

Pos logit contributions
sc+0.109
 pic+0.108
 called+0.106
 lin+0.099
 shelf+0.099
Neg logit contributions
joy-0.199
 nods-0.187
 disl-0.181
 thrill-0.174
 Majesty-0.171
Token to
Token ID284
Feature activation-2.69e-02

Pos logit contributions
's+0.001
 called+0.001
 so+0.001
 and+0.001
+0.001
Neg logit contributions
joy-0.009
 disl-0.008
 thrill-0.008
 relief-0.008
 nods-0.008
Token count
Token ID954
Feature activation-2.52e+00

Pos logit contributions
's+0.001
 so+0.001
 pic+0.001
 from+0.000
 off+0.000
Neg logit contributions
 thrill-0.001
 disl-0.001
joy-0.001
 nods-0.001
 relief-0.001
Token as
Token ID355
Feature activation-9.28e-01

Pos logit contributions
 pic+0.040
's+0.038
ches+0.038
+0.037
 lin+0.037
Neg logit contributions
joy-0.090
 nods-0.081
 thrill-0.078
ment-0.075
 winners-0.074
Token she
Token ID673
Feature activation-1.29e+00

Pos logit contributions
's+0.015
 called+0.013
 so+0.013
+0.013
 and+0.012
Neg logit contributions
joy-0.036
 thrill-0.034
 disl-0.033
 nods-0.032
 relief-0.032
Token friendly
Token ID8030
Feature activation+1.05e+00

Pos logit contributions
joy+0.012
 thrill+0.011
 disl+0.011
 nods+0.010
ses+0.010
Neg logit contributions
 so-0.010
's-0.009
 called-0.009
 really-0.009
-0.009
Token,
Token ID11
Feature activation+6.53e-01

Pos logit contributions
joy+0.052
 thrill+0.047
 nods+0.046
 relief+0.046
 disl+0.046
Neg logit contributions
's-0.007
-0.006
 called-0.005
ches-0.005
At-0.004
Token and
Token ID290
Feature activation-4.87e-01

Pos logit contributions
joy+0.028
 thrill+0.026
 relief+0.025
 disl+0.024
 nods+0.024
Neg logit contributions
's-0.008
ches-0.007
-0.006
 called-0.006
 off-0.006
Token he
Token ID339
Feature activation+4.04e-01

Pos logit contributions
's+0.011
 so+0.010
 called+0.010
 and+0.010
+0.010
Neg logit contributions
joy-0.017
 thrill-0.016
 disl-0.015
 relief-0.015
 Majesty-0.015
Token taught
Token ID7817
Feature activation-7.89e-01

Pos logit contributions
joy+0.014
 thrill+0.012
 disl+0.012
 nods+0.012
 Majesty+0.012
Neg logit contributions
's-0.008
 so-0.008
-0.008
 off-0.008
 and-0.007
Token Ted
Token ID11396
Feature activation-5.89e+00

Pos logit contributions
's+0.017
 so+0.015
 called+0.015
+0.015
 and+0.015
Neg logit contributions
joy-0.027
 thrill-0.025
 disl-0.025
 relief-0.024
 nods-0.024
Token lots
Token ID6041
Feature activation-3.11e+00

Pos logit contributions
's+0.086
 called+0.078
 so+0.078
+0.076
 and+0.076
Neg logit contributions
joy-0.240
 thrill-0.224
 disl-0.221
 nods-0.214
 relief-0.213
Token of
Token ID286
Feature activation-1.83e+00

Pos logit contributions
's+0.049
 unw+0.046
 separate+0.046
 lin+0.045
+0.045
Neg logit contributions
joy-0.114
 thrill-0.108
 disl-0.106
 nods-0.104
 relief-0.101
Token new
Token ID649
Feature activation+9.57e-01

Pos logit contributions
's+0.041
 and+0.041
 so+0.041
 called+0.040
+0.036
Neg logit contributions
joy-0.062
 thrill-0.057
 disl-0.054
 Majesty-0.054
ses-0.053
Token things
Token ID1243
Feature activation-8.89e-01

Pos logit contributions
joy+0.031
 thrill+0.029
 disl+0.027
 relief+0.027
 nods+0.026
Neg logit contributions
's-0.023
 and-0.023
 so-0.022
 called-0.022
-0.021
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
's+0.009
 called+0.008
+0.008
 so+0.008
 and+0.007
Neg logit contributions
joy-0.039
 disl-0.036
 thrill-0.036
 nods-0.035
 relief-0.035
Token somewhere
Token ID7382
Feature activation+1.35e+00

Pos logit contributions
+0.015
 pic+0.015
 lin+0.015
mary+0.015
's+0.015
Neg logit contributions
joy-0.037
 thrill-0.030
 sorry-0.030
 relief-0.030
 nods-0.029
Token.
Token ID13
Feature activation-6.96e-02

Pos logit contributions
joy+0.046
 thrill+0.043
 relief+0.042
 disl+0.040
 sorry+0.040
Neg logit contributions
's-0.023
ches-0.022
mary-0.022
 lin-0.022
-0.021
Token She
Token ID1375
Feature activation-6.72e-02

Pos logit contributions
 pic+0.002
's+0.002
ches+0.002
+0.002
 from+0.002
Neg logit contributions
joy-0.002
 nods-0.002
 disl-0.002
 thrill-0.002
 Thank-0.002
Token told
Token ID1297
Feature activation-1.83e-01

Pos logit contributions
's+0.001
 off+0.001
sc+0.001
 pic+0.001
 closely+0.001
Neg logit contributions
joy-0.002
 nods-0.002
 nodded-0.002
 thrill-0.002
 relief-0.002
Token L
Token ID406
Feature activation-3.88e+00

Pos logit contributions
's+0.004
 pic+0.004
+0.004
sc+0.004
mary+0.004
Neg logit contributions
joy-0.005
 thrill-0.005
 disl-0.005
 relief-0.005
 hey-0.005
Tokenizzy
Token ID40593
Feature activation-5.89e+00

Pos logit contributions
's+0.123
 separate+0.118
 so+0.118
 off+0.117
 pic+0.116
Neg logit contributions
joy-0.088
 thrill-0.072
iness-0.070
 disl-0.068
 nods-0.066
Token to
Token ID284
Feature activation+2.76e-01

Pos logit contributions
's+0.070
 called+0.067
 so+0.063
+0.063
 unw+0.061
Neg logit contributions
joy-0.244
 disl-0.230
 thrill-0.225
 nods-0.222
 Majesty-0.218
Token stay
Token ID2652
Feature activation+1.58e+00

Pos logit contributions
joy+0.010
 thrill+0.010
 disl+0.009
 relief+0.009
 nods+0.009
Neg logit contributions
's-0.005
 so-0.005
 called-0.004
-0.004
 into-0.004
Token put
Token ID1234
Feature activation-8.02e-01

Pos logit contributions
joy+0.065
 relief+0.058
 thrill+0.058
 sorry+0.055
 nods+0.055
Neg logit contributions
's-0.023
-0.023
ches-0.020
sc-0.019
 pic-0.018
Token and
Token ID290
Feature activation-4.05e-01

Pos logit contributions
's+0.011
+0.011
 called+0.009
 closely+0.009
At+0.009
Neg logit contributions
joy-0.032
 thrill-0.030
 relief-0.030
 nods-0.029
 disl-0.029
Token not
Token ID407
Feature activation+1.06e+00

Pos logit contributions
's+0.007
 so+0.007
 called+0.007
 and+0.007
+0.006
Neg logit contributions
joy-0.017
 thrill-0.015
 disl-0.015
viously-0.014
ses-0.014
Tokenage
Token ID496
Feature activation+3.40e+00

Pos logit contributions
joy+0.149
 thrill+0.133
Really+0.129
 Majesty+0.129
viously+0.125
Neg logit contributions
 and-0.090
's-0.088
 called-0.083
 so-0.082
 to-0.080
Token.
Token ID13
Feature activation-9.85e-01

Pos logit contributions
joy+0.161
 thrill+0.153
 disl+0.150
 relief+0.147
 nods+0.146
Neg logit contributions
's-0.028
 so-0.024
 called-0.023
-0.023
 and-0.022
Token The
Token ID383
Feature activation+2.67e-01

Pos logit contributions
's+0.017
 so+0.016
 and+0.016
+0.015
 called+0.015
Neg logit contributions
joy-0.039
 thrill-0.036
 disl-0.036
 Majesty-0.034
 relief-0.034
Token vet
Token ID20202
Feature activation+2.22e+00

Pos logit contributions
joy+0.009
 disl+0.009
osity+0.008
 thrill+0.008
 nods+0.008
Neg logit contributions
's-0.006
 and-0.006
 called-0.006
 so-0.006
 from-0.006
Token said
Token ID531
Feature activation-1.61e+00

Pos logit contributions
joy+0.089
 thrill+0.083
 disl+0.082
 relief+0.079
 nods+0.079
Neg logit contributions
's-0.036
 so-0.033
 and-0.032
 called-0.032
-0.032
Token Luna
Token ID23694
Feature activation-5.85e+00

Pos logit contributions
's+0.026
 so+0.024
 called+0.024
 and+0.024
+0.024
Neg logit contributions
joy-0.064
 thrill-0.059
 disl-0.058
 relief-0.056
 nods-0.056
Token would
Token ID561
Feature activation-1.63e+00

Pos logit contributions
's+0.081
 so+0.077
 called+0.077
+0.074
 and+0.072
Neg logit contributions
joy-0.236
 thrill-0.220
 disl-0.219
 nods-0.214
 relief-0.212
Token heal
Token ID12035
Feature activation+1.00e+00

Pos logit contributions
's+0.032
 so+0.030
 called+0.029
+0.029
 and+0.029
Neg logit contributions
joy-0.059
 thrill-0.055
 disl-0.054
 relief-0.052
 nods-0.052
Token soon
Token ID2582
Feature activation-4.19e+00

Pos logit contributions
joy+0.040
 thrill+0.037
 disl+0.037
 relief+0.036
 nods+0.036
Neg logit contributions
's-0.016
 so-0.015
 called-0.014
-0.014
 and-0.014
Token,
Token ID11
Feature activation-5.32e-01

Pos logit contributions
+0.042
's+0.041
 closely+0.040
At+0.039
 pic+0.039
Neg logit contributions
joy-0.170
 disl-0.159
 thrill-0.158
 relief-0.156
 nods-0.154
Token but
Token ID475
Feature activation-6.35e-01

Pos logit contributions
's+0.006
+0.005
 called+0.005
 so+0.005
 off+0.005
Neg logit contributions
joy-0.023
 thrill-0.022
 disl-0.021
 relief-0.021
 nods-0.021
Token't
Token ID470
Feature activation-6.06e-01

Pos logit contributions
 sorry+0.019
 disl+0.019
joy+0.018
Sorry+0.017
 Amen+0.017
Neg logit contributions
 pic-0.031
 called-0.031
 unw-0.030
 closely-0.030
mary-0.030
Token you
Token ID345
Feature activation+1.93e+00

Pos logit contributions
sc+0.015
ches+0.015
mary+0.014
At+0.014
 called+0.014
Neg logit contributions
joy-0.016
 disl-0.016
 sorry-0.014
 thrill-0.014
 Majesty-0.014
Token invite
Token ID14037
Feature activation-2.11e+00

Pos logit contributions
 thrill+0.057
 disl+0.055
joy+0.054
 relief+0.054
 nod+0.053
Neg logit contributions
's-0.039
 pic-0.039
sc-0.037
 so-0.037
 into-0.036
Token your
Token ID534
Feature activation-3.30e-02

Pos logit contributions
+0.034
's+0.033
sc+0.031
 lin+0.031
 pic+0.029
Neg logit contributions
joy-0.074
 relief-0.070
 disl-0.069
 thrill-0.069
 sorry-0.068
Token friend
Token ID1545
Feature activation+1.33e+00

Pos logit contributions
's+0.001
 called+0.001
+0.001
ches+0.001
 closely+0.001
Neg logit contributions
joy-0.001
 thrill-0.001
 disl-0.001
 Majesty-0.001
 relief-0.001
Token Emily
Token ID17608
Feature activation-5.80e+00

Pos logit contributions
joy+0.042
 nods+0.038
 disl+0.038
 yeah+0.037
 sorry+0.036
Neg logit contributions
 closely-0.026
sc-0.024
-0.024
 unw-0.024
 convey-0.023
Token over
Token ID625
Feature activation-7.86e-01

Pos logit contributions
 lin+0.087
 closely+0.086
 convey+0.083
 unw+0.083
 pic+0.081
Neg logit contributions
joy-0.206
 disl-0.177
 nods-0.174
 tweet-0.171
 sorry-0.170
Token to
Token ID284
Feature activation-1.58e-01

Pos logit contributions
's+0.010
ches+0.010
+0.010
 unw+0.009
 closely+0.009
Neg logit contributions
joy-0.034
 disl-0.028
 nods-0.028
 relief-0.027
 thrill-0.027
Token play
Token ID711
Feature activation-1.01e+00

Pos logit contributions
's+0.003
+0.003
sc+0.003
ches+0.003
 closely+0.003
Neg logit contributions
joy-0.005
 disl-0.005
 thrill-0.005
 relief-0.005
 Majesty-0.004
Token with
Token ID351
Feature activation-2.96e+00

Pos logit contributions
+0.012
's+0.012
 closely+0.010
 called+0.010
 off+0.010
Neg logit contributions
joy-0.042
 disl-0.038
 thrill-0.038
 nods-0.037
 relief-0.037
Token you
Token ID345
Feature activation+7.72e-01

Pos logit contributions
+0.051
's+0.050
 unw+0.045
 closely+0.045
 so+0.045
Neg logit contributions
joy-0.104
 disl-0.095
 nods-0.090
 relief-0.090
 thrill-0.089
Token she
Token ID673
Feature activation+1.14e-02

Pos logit contributions
's+0.056
ches+0.051
+0.049
sc+0.048
 so+0.048
Neg logit contributions
joy-0.055
 thrill-0.047
 relief-0.043
 tweet-0.039
 disl-0.039
Token walked
Token ID6807
Feature activation-5.37e-02

Pos logit contributions
joy+0.000
 thrill+0.000
 nods+0.000
 relief+0.000
 disl+0.000
Neg logit contributions
's-0.000
 so-0.000
-0.000
 off-0.000
 from-0.000
Token she
Token ID673
Feature activation+8.64e-03

Pos logit contributions
 and+0.001
's+0.001
 so+0.001
 called+0.000
+0.000
Neg logit contributions
joy-0.002
 disl-0.002
 thrill-0.002
 Majesty-0.002
 relief-0.002
Token heard
Token ID2982
Feature activation+5.91e-01

Pos logit contributions
joy+0.000
 thrill+0.000
 disl+0.000
 relief+0.000
 nods+0.000
Neg logit contributions
's-0.000
 called-0.000
 so-0.000
 and-0.000
-0.000
Token a
Token ID257
Feature activation+8.08e-01

Pos logit contributions
joy+0.026
 thrill+0.025
 disl+0.024
 Majesty+0.024
 nods+0.023
Neg logit contributions
's-0.008
 so-0.007
 and-0.007
 called-0.007
-0.006
Token sound
Token ID2128
Feature activation+4.87e+00

Pos logit contributions
joy+0.027
 thrill+0.024
 disl+0.024
 Majesty+0.023
 nods+0.023
Neg logit contributions
's-0.019
 so-0.019
 and-0.018
 called-0.018
-0.018
Token.
Token ID13
Feature activation+6.44e-01

Pos logit contributions
joy+0.223
 thrill+0.212
 disl+0.206
 relief+0.203
 nods+0.201
Neg logit contributions
's-0.045
 so-0.038
-0.037
 called-0.037
 and-0.034
Token It
Token ID632
Feature activation+2.21e+00

Pos logit contributions
joy+0.022
 thrill+0.022
 disl+0.021
 winners+0.020
 Majesty+0.020
Neg logit contributions
's-0.015
 and-0.014
-0.014
 so-0.014
 called-0.013
Token was
Token ID373
Feature activation-3.29e-01

Pos logit contributions
joy+0.112
 thrill+0.103
 disl+0.102
 Majesty+0.102
 winners+0.099
Neg logit contributions
's-0.021
 called-0.017
 so-0.017
 and-0.016
-0.016
Token a
Token ID257
Feature activation-6.74e-01

Pos logit contributions
's+0.005
+0.005
ches+0.005
 closely+0.004
 off+0.004
Neg logit contributions
joy-0.013
 thrill-0.012
 relief-0.012
 disl-0.011
 nods-0.011
Token little
Token ID1310
Feature activation+3.84e-01

Pos logit contributions
's+0.013
 so+0.013
 called+0.013
 and+0.012
+0.012
Neg logit contributions
joy-0.025
 disl-0.023
 thrill-0.023
 nods-0.023
 Majesty-0.022
Token far
Token ID1290
Feature activation-2.12e-01

Pos logit contributions
joy+0.090
 thrill+0.084
 disl+0.081
 relief+0.080
 nods+0.078
Neg logit contributions
's-0.034
 called-0.029
 so-0.029
-0.028
 and-0.027
Token.
Token ID13
Feature activation+6.00e-01

Pos logit contributions
's+0.003
 pic+0.003
+0.002
 called+0.002
 lin+0.002
Neg logit contributions
joy-0.009
 thrill-0.008
 relief-0.008
 nods-0.008
 disl-0.008
Token She
Token ID1375
Feature activation+1.75e-01

Pos logit contributions
joy+0.021
 thrill+0.020
 disl+0.019
 relief+0.019
 nods+0.018
Neg logit contributions
's-0.013
 so-0.012
 and-0.012
-0.012
 called-0.012
Token started
Token ID2067
Feature activation-5.73e-01

Pos logit contributions
joy+0.006
 thrill+0.006
 disl+0.005
 nods+0.005
 relief+0.005
Neg logit contributions
's-0.004
 so-0.003
 called-0.003
 and-0.003
-0.003
Token to
Token ID284
Feature activation+7.54e-01

Pos logit contributions
's+0.005
 called+0.004
+0.004
 so+0.004
 shelf+0.003
Neg logit contributions
joy-0.027
 disl-0.025
 thrill-0.025
 nods-0.024
 relief-0.024
Token regret
Token ID13721
Feature activation+4.98e+00

Pos logit contributions
joy+0.024
 thrill+0.021
 disl+0.020
 relief+0.019
 nods+0.019
Neg logit contributions
's-0.020
 and-0.019
 so-0.019
-0.019
 called-0.019
Token her
Token ID607
Feature activation-9.47e-01

Pos logit contributions
joy+0.189
 thrill+0.176
 disl+0.172
 relief+0.167
 nods+0.165
Neg logit contributions
's-0.089
 so-0.081
 called-0.081
-0.081
 and-0.079
Token decision
Token ID2551
Feature activation+2.07e+00

Pos logit contributions
's+0.024
 so+0.022
 called+0.022
+0.022
 and+0.022
Neg logit contributions
joy-0.029
 thrill-0.027
 disl-0.026
 relief-0.026
 nods-0.025
Token to
Token ID284
Feature activation+4.75e-01

Pos logit contributions
joy+0.092
 thrill+0.083
 relief+0.081
 nods+0.080
 disl+0.079
Neg logit contributions
 pic-0.018
's-0.017
 lin-0.016
 called-0.015
 unw-0.014
Token go
Token ID467
Feature activation-8.58e-01

Pos logit contributions
joy+0.018
 thrill+0.016
 disl+0.016
 nods+0.016
 Majesty+0.015
Neg logit contributions
's-0.009
 so-0.009
 and-0.009
 called-0.008
-0.008
Token so
Token ID523
Feature activation+1.35e-01

Pos logit contributions
's+0.015
+0.013
 lin+0.013
 shelf+0.013
 unw+0.013
Neg logit contributions
joy-0.032
 thrill-0.028
 relief-0.027
 disl-0.026
 sorry-0.026
Token
Token ID198
Feature activation-2.71e-01

Pos logit contributions
's+0.013
 so+0.012
 called+0.011
 and+0.011
 off+0.011
Neg logit contributions
joy-0.027
 thrill-0.025
 disl-0.025
 relief-0.024
 nods-0.024
TokenHe
Token ID1544
Feature activation+6.22e-01

Pos logit contributions
's+0.005
 so+0.005
 and+0.005
 called+0.005
 off+0.005
Neg logit contributions
joy-0.010
 thrill-0.009
 disl-0.009
 relief-0.008
 nods-0.008
Token then
Token ID788
Feature activation+8.02e-01

Pos logit contributions
joy+0.021
 nods+0.019
 thrill+0.019
 disl+0.018
 Majesty+0.018
Neg logit contributions
's-0.012
 so-0.012
 shelf-0.011
 from-0.011
 pic-0.011
Token heard
Token ID2982
Feature activation+1.27e+00

Pos logit contributions
joy+0.030
 thrill+0.028
 disl+0.027
 relief+0.027
 Majesty+0.026
Neg logit contributions
's-0.016
 called-0.015
 and-0.015
 so-0.014
-0.014
Token a
Token ID257
Feature activation+5.33e-01

Pos logit contributions
joy+0.054
 thrill+0.051
 disl+0.050
 nods+0.049
 relief+0.049
Neg logit contributions
's-0.017
 so-0.016
 and-0.016
 called-0.015
-0.015
Token voice
Token ID3809
Feature activation+4.89e+00

Pos logit contributions
joy+0.016
 thrill+0.014
 disl+0.014
 Majesty+0.013
ses+0.013
Neg logit contributions
's-0.015
 so-0.015
 and-0.014
-0.014
 called-0.014
Token coming
Token ID2406
Feature activation+8.09e-01

Pos logit contributions
joy+0.211
 thrill+0.199
 disl+0.195
 relief+0.192
 nods+0.192
Neg logit contributions
's-0.057
 so-0.050
-0.048
 called-0.046
 and-0.046
Token from
Token ID422
Feature activation-1.15e+00

Pos logit contributions
joy+0.037
 thrill+0.034
 relief+0.033
 disl+0.032
 tweet+0.032
Neg logit contributions
's-0.007
 pic-0.007
ches-0.005
At-0.005
 shelf-0.005
Token the
Token ID262
Feature activation-1.35e-01

Pos logit contributions
 and+0.011
 so+0.010
 called+0.010
's+0.009
 from+0.009
Neg logit contributions
joy-0.053
 thrill-0.051
 disl-0.051
 Majesty-0.050
 nods-0.049
Token dust
Token ID8977
Feature activation-1.66e+00

Pos logit contributions
 so+0.002
's+0.002
 and+0.002
+0.002
 called+0.002
Neg logit contributions
joy-0.005
 disl-0.005
 nods-0.005
 thrill-0.005
 Majesty-0.005
Token cloud
Token ID6279
Feature activation-1.21e+00

Pos logit contributions
's+0.020
+0.018
 so+0.018
 called+0.017
 and+0.016
Neg logit contributions
joy-0.070
 thrill-0.066
 disl-0.063
 relief-0.063
 nods-0.063
Token.
Token ID13
Feature activation-6.33e-01

Pos logit contributions
 so+0.032
's+0.032
 off+0.030
 pic+0.030
+0.030
Neg logit contributions
joy-0.125
 disl-0.110
 nods-0.108
 thrill-0.107
 sincere-0.105
Token Max
Token ID5436
Feature activation-1.79e+00

Pos logit contributions
's+0.012
 so+0.011
 called+0.011
+0.011
 and+0.011
Neg logit contributions
joy-0.023
 thrill-0.021
 disl-0.021
 nods-0.020
 relief-0.020
Token had
Token ID550
Feature activation-1.65e-01

Pos logit contributions
 off+0.033
 from+0.031
 into+0.031
sc+0.031
 lin+0.030
Neg logit contributions
joy-0.062
 disl-0.057
 nods-0.057
 thrill-0.053
 sincere-0.052
Token a
Token ID257
Feature activation+6.71e-01

Pos logit contributions
's+0.002
+0.002
 closely+0.002
 and+0.002
 called+0.002
Neg logit contributions
joy-0.007
 thrill-0.006
 disl-0.006
 relief-0.006
 nods-0.006
Token big
Token ID1263
Feature activation+6.30e-01

Pos logit contributions
joy+0.028
 thrill+0.026
 disl+0.026
 relief+0.025
 nods+0.025
Neg logit contributions
's-0.010
 so-0.009
 and-0.009
 called-0.009
-0.008
Token heart
Token ID2612
Feature activation+4.94e+00

Pos logit contributions
joy+0.024
 disl+0.023
 thrill+0.023
osity+0.022
 relief+0.022
Neg logit contributions
 and-0.011
's-0.011
 so-0.010
 called-0.010
 really-0.009
Token and
Token ID290
Feature activation-4.18e-02

Pos logit contributions
joy+0.231
 thrill+0.219
 disl+0.212
 nods+0.209
 relief+0.207
Neg logit contributions
's-0.038
-0.034
 so-0.030
 called-0.029
 pic-0.027
Token he
Token ID339
Feature activation+5.77e-01

Pos logit contributions
's+0.001
 so+0.001
 called+0.001
 and+0.001
+0.001
Neg logit contributions
joy-0.002
 thrill-0.002
 disl-0.001
 relief-0.001
 Majesty-0.001
Token was
Token ID373
Feature activation+6.20e-01

Pos logit contributions
joy+0.018
 disl+0.017
 thrill+0.017
 nods+0.017
 Majesty+0.016
Neg logit contributions
's-0.012
-0.012
 off-0.012
 so-0.012
 from-0.012
Token very
Token ID845
Feature activation+3.64e-02

Pos logit contributions
joy+0.027
 thrill+0.025
 disl+0.025
 relief+0.024
 nods+0.024
Neg logit contributions
's-0.007
-0.006
 and-0.006
 called-0.006
 so-0.006
Token kind
Token ID1611
Feature activation+1.13e+00

Pos logit contributions
joy+0.001
 thrill+0.001
 relief+0.001
 disl+0.001
 nods+0.001
Neg logit contributions
's-0.001
 so-0.001
 called-0.001
 and-0.001
 really-0.001
Token street
Token ID4675
Feature activation+6.34e-01

Pos logit contributions
joy+0.020
 thrill+0.020
 nods+0.018
 Majesty+0.018
 disl+0.018
Neg logit contributions
's-0.013
ches-0.013
-0.013
 called-0.012
 unw-0.012
Token,
Token ID11
Feature activation+1.27e+00

Pos logit contributions
joy+0.030
 thrill+0.028
 disl+0.028
 relief+0.027
 nods+0.027
Neg logit contributions
's-0.006
 so-0.005
 and-0.005
 called-0.005
-0.005
Token she
Token ID673
Feature activation-1.08e-01

Pos logit contributions
joy+0.052
 disl+0.050
 thrill+0.049
 Majesty+0.048
 nods+0.047
Neg logit contributions
 so-0.021
's-0.020
 and-0.020
-0.018
 called-0.018
Token heard
Token ID2982
Feature activation+1.07e+00

Pos logit contributions
's+0.003
 called+0.003
 and+0.003
 so+0.003
+0.002
Neg logit contributions
joy-0.003
 thrill-0.003
 disl-0.003
 relief-0.003
 nods-0.003
Token a
Token ID257
Feature activation+8.09e-01

Pos logit contributions
joy+0.050
 thrill+0.047
 disl+0.047
 Majesty+0.046
ses+0.046
Neg logit contributions
's-0.012
 so-0.012
 and-0.011
 called-0.010
-0.010
Token sound
Token ID2128
Feature activation+4.95e+00

Pos logit contributions
joy+0.027
 thrill+0.024
 disl+0.024
 Majesty+0.023
 nods+0.023
Neg logit contributions
's-0.019
 so-0.019
 and-0.018
 called-0.018
-0.018
Token.
Token ID13
Feature activation+6.51e-01

Pos logit contributions
joy+0.227
 thrill+0.215
 disl+0.210
 relief+0.206
 nods+0.204
Neg logit contributions
's-0.049
 so-0.043
 called-0.041
-0.041
 and-0.041
Token It
Token ID632
Feature activation+2.24e+00

Pos logit contributions
joy+0.023
 thrill+0.022
 disl+0.021
 Majesty+0.021
 winners+0.020
Neg logit contributions
's-0.015
 so-0.014
 and-0.014
-0.014
 called-0.013
Token was
Token ID373
Feature activation-2.69e-01

Pos logit contributions
joy+0.110
 thrill+0.102
 disl+0.100
 Majesty+0.100
 winners+0.097
Neg logit contributions
's-0.023
 called-0.020
 so-0.020
 and-0.019
-0.019
Token not
Token ID407
Feature activation-3.50e-02

Pos logit contributions
's+0.004
+0.004
ches+0.004
sc+0.004
 closely+0.004
Neg logit contributions
joy-0.010
 thrill-0.010
 relief-0.009
 disl-0.009
 nods-0.009
Token a
Token ID257
Feature activation-8.52e-01

Pos logit contributions
's+0.000
 so+0.000
 and+0.000
 called+0.000
+0.000
Neg logit contributions
joy-0.001
 thrill-0.001
 disl-0.001
 nods-0.001
 Majesty-0.001
Token very
Token ID845
Feature activation+9.78e-01

Pos logit contributions
joy+0.004
 thrill+0.003
 disl+0.003
 Majesty+0.003
 relief+0.003
Neg logit contributions
's-0.002
 so-0.002
 called-0.002
 and-0.001
-0.001
Token helpful
Token ID7613
Feature activation+7.46e-01

Pos logit contributions
joy+0.037
 thrill+0.033
 disl+0.031
viously+0.031
 relief+0.031
Neg logit contributions
 called-0.021
's-0.021
 so-0.020
 really-0.020
 very-0.020
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
joy+0.034
 thrill+0.032
 disl+0.031
 relief+0.031
 nods+0.031
Neg logit contributions
's-0.007
-0.006
 so-0.006
 called-0.006
 and-0.006
Token One
Token ID1881
Feature activation-6.57e-01

Pos logit contributions
joy+0.010
 thrill+0.009
 nods+0.009
 disl+0.008
 yeah+0.008
Neg logit contributions
's-0.007
 so-0.007
-0.007
 off-0.007
 from-0.007
Token night
Token ID1755
Feature activation+1.61e+00

Pos logit contributions
's+0.011
 so+0.011
 called+0.010
+0.010
 and+0.010
Neg logit contributions
joy-0.025
 thrill-0.023
 disl-0.022
 relief-0.022
 nods-0.022
Token,
Token ID11
Feature activation+9.67e-01

Pos logit contributions
joy+0.073
 thrill+0.069
 disl+0.067
 relief+0.066
 nods+0.066
Neg logit contributions
's-0.017
 so-0.015
 and-0.015
 called-0.015
-0.014
Token the
Token ID262
Feature activation+3.62e-03

Pos logit contributions
joy+0.033
 thrill+0.031
 disl+0.029
 nods+0.029
 relief+0.028
Neg logit contributions
's-0.020
-0.020
 so-0.018
ches-0.018
 called-0.018
Token stars
Token ID5788
Feature activation+1.60e+00

Pos logit contributions
joy+0.000
 disl+0.000
 thrill+0.000
 Majesty+0.000
 yeah+0.000
Neg logit contributions
's-0.000
 and-0.000
 so-0.000
 called-0.000
 from-0.000
Token shone
Token ID44193
Feature activation+7.72e-01

Pos logit contributions
joy+0.061
 disl+0.056
 thrill+0.055
 Majesty+0.054
 relief+0.053
Neg logit contributions
's-0.031
 and-0.030
 called-0.029
 so-0.029
-0.027
Token bright
Token ID6016
Feature activation+2.57e-01

Pos logit contributions
joy+0.032
 thrill+0.030
 disl+0.029
 relief+0.029
 nods+0.029
Neg logit contributions
's-0.010
-0.009
 called-0.009
 so-0.009
 and-0.009
Token in
Token ID287
Feature activation+2.47e-01

Pos logit contributions
joy+0.011
 thrill+0.011
 nods+0.010
 relief+0.010
 vibration+0.010
Neg logit contributions
's-0.003
-0.003
 called-0.002
ches-0.002
 pic-0.002
Token inside
Token ID2641
Feature activation-1.24e-01

Pos logit contributions
joy+0.001
 nods+0.001
 thrill+0.001
 relief+0.001
 disl+0.001
Neg logit contributions
 pic-0.000
 closely-0.000
mary-0.000
On-0.000
ool-0.000
Token.
Token ID13
Feature activation-5.21e-01

Pos logit contributions
's+0.001
+0.001
 so+0.001
 called+0.001
 and+0.001
Neg logit contributions
joy-0.006
 thrill-0.005
 disl-0.005
 relief-0.005
 nods-0.005
Token The
Token ID383
Feature activation-7.95e-02

Pos logit contributions
's+0.013
 so+0.012
 and+0.012
 called+0.012
+0.012
Neg logit contributions
joy-0.016
 thrill-0.015
 disl-0.015
 relief-0.014
 nods-0.014
Token mom
Token ID1995
Feature activation+6.91e-01

Pos logit contributions
's+0.002
 and+0.002
 so+0.001
+0.001
 called+0.001
Neg logit contributions
joy-0.003
 disl-0.003
 thrill-0.003
 nods-0.003
 Majesty-0.003
Token was
Token ID373
Feature activation+5.03e-02

Pos logit contributions
joy+0.025
 nods+0.024
 nodded+0.023
 thrill+0.022
 understands+0.022
Neg logit contributions
 pic-0.009
 closely-0.009
sc-0.008
 off-0.008
-0.008
Token sad
Token ID6507
Feature activation+2.94e+00

Pos logit contributions
joy+0.002
 thrill+0.002
 relief+0.002
 disl+0.002
 nods+0.002
Neg logit contributions
's-0.001
-0.001
 and-0.001
 called-0.001
 so-0.001
Token because
Token ID780
Feature activation-1.48e+00

Pos logit contributions
joy+0.138
 thrill+0.131
 disl+0.127
 relief+0.126
 nods+0.125
Neg logit contributions
's-0.026
-0.021
 so-0.021
 called-0.021
 and-0.019
Token the
Token ID262
Feature activation-6.99e-01

Pos logit contributions
's+0.026
 so+0.024
 called+0.024
+0.024
 and+0.024
Neg logit contributions
joy-0.056
 thrill-0.052
 disl-0.051
 relief-0.050
 nods-0.049
Token home
Token ID1363
Feature activation+2.67e+00

Pos logit contributions
's+0.013
 so+0.012
 and+0.012
 called+0.011
+0.011
Neg logit contributions
joy-0.027
 disl-0.025
 thrill-0.025
 nods-0.024
 Majesty-0.024
Token was
Token ID373
Feature activation-1.58e+00

Pos logit contributions
joy+0.112
 thrill+0.105
 disl+0.102
 nods+0.101
 relief+0.101
Neg logit contributions
's-0.032
-0.030
 so-0.029
 called-0.028
 closely-0.027
Token still
Token ID991
Feature activation+3.71e-01

Pos logit contributions
's+0.029
+0.027
sc+0.025
ches+0.024
 closely+0.024
Neg logit contributions
joy-0.057
 thrill-0.054
 relief-0.054
 disl-0.051
 sorry-0.051
Token back
Token ID736
Feature activation+2.06e+00

Pos logit contributions
 disl+0.029
 thrill+0.028
 Majesty+0.027
 liar+0.027
ses+0.026
Neg logit contributions
 and-0.009
 so-0.008
 into-0.008
 in-0.007
 off-0.007
Token down
Token ID866
Feature activation+1.43e+00

Pos logit contributions
joy+0.091
 thrill+0.084
 nods+0.082
 relief+0.080
 disl+0.079
Neg logit contributions
's-0.021
-0.019
 called-0.017
 so-0.017
 pic-0.017
Token and
Token ID290
Feature activation+6.94e-02

Pos logit contributions
joy+0.065
 thrill+0.057
 nods+0.056
 relief+0.055
 grin+0.053
Neg logit contributions
's-0.016
-0.014
ches-0.013
 pic-0.013
 called-0.011
Token continued
Token ID3767
Feature activation-2.01e+00

Pos logit contributions
joy+0.003
 disl+0.003
 thrill+0.002
 liar+0.002
 Majesty+0.002
Neg logit contributions
's-0.001
 and-0.001
 so-0.001
 called-0.001
-0.001
Token to
Token ID284
Feature activation+8.78e-01

Pos logit contributions
's+0.025
+0.023
 called+0.021
 so+0.021
ches+0.021
Neg logit contributions
joy-0.088
 nods-0.075
 disl-0.074
 thrill-0.074
 relief-0.073
Token gaze
Token ID17841
Feature activation+1.35e+00

Pos logit contributions
joy+0.032
 disl+0.028
 thrill+0.028
ses+0.028
 liar+0.027
Neg logit contributions
's-0.020
 and-0.019
-0.019
 so-0.018
 called-0.018
Token at
Token ID379
Feature activation+1.36e-01

Pos logit contributions
joy+0.060
 thrill+0.057
 disl+0.056
 relief+0.054
 nods+0.054
Neg logit contributions
's-0.016
 so-0.015
 and-0.015
 called-0.014
-0.013
Token the
Token ID262
Feature activation+2.79e-01

Pos logit contributions
joy+0.006
 thrill+0.006
 disl+0.006
 relief+0.005
 nods+0.005
Neg logit contributions
's-0.002
 so-0.001
-0.001
 called-0.001
 and-0.001
Token night
Token ID1755
Feature activation-1.79e-01

Pos logit contributions
joy+0.009
 disl+0.009
 yeah+0.008
ses+0.008
 thrill+0.008
Neg logit contributions
's-0.007
 and-0.007
 so-0.006
-0.006
 called-0.006
Token sky
Token ID6766
Feature activation+5.52e-01

Pos logit contributions
's+0.005
 so+0.005
 called+0.005
+0.005
 and+0.005
Neg logit contributions
joy-0.005
 thrill-0.004
 disl-0.004
 relief-0.004
 nods-0.004
Token,
Token ID11
Feature activation+1.17e+00

Pos logit contributions
joy+0.024
 thrill+0.023
 nods+0.022
 disl+0.022
 relief+0.022
Neg logit contributions
's-0.006
-0.005
 called-0.005
 so-0.005
 pic-0.004
Token named
Token ID3706
Feature activation+1.71e-01

Pos logit contributions
joy+0.035
 Amen+0.033
 disl+0.031
 relief+0.031
 sincere+0.031
Neg logit contributions
 off-0.038
ches-0.036
 pic-0.035
sc-0.035
 closely-0.035
Token Lily
Token ID20037
Feature activation-2.54e+00

Pos logit contributions
joy+0.005
 disl+0.005
 relief+0.005
 thrill+0.005
 sorry+0.005
Neg logit contributions
's-0.004
-0.003
 and-0.003
ches-0.003
 so-0.003
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
 off+0.036
 pic+0.036
 so+0.034
ches+0.034
sc+0.034
Neg logit contributions
joy-0.094
 disl-0.089
 thrill-0.086
 nods-0.085
 relief-0.084
Token She
Token ID1375
Feature activation-2.07e-01

Pos logit contributions
's+0.007
 so+0.006
 called+0.006
+0.006
 and+0.006
Neg logit contributions
joy-0.009
 thrill-0.008
 disl-0.008
 nods-0.008
 relief-0.008
Token loved
Token ID6151
Feature activation-4.37e-01

Pos logit contributions
's+0.005
 so+0.005
 off+0.005
+0.005
 closely+0.005
Neg logit contributions
joy-0.006
 disl-0.006
 thrill-0.006
 nods-0.005
 relief-0.005
Token to
Token ID284
Feature activation+1.33e+00

Pos logit contributions
+0.006
ches+0.006
sc+0.005
's+0.005
 closely+0.005
Neg logit contributions
joy-0.018
 disl-0.017
 nods-0.016
 thrill-0.015
 tweet-0.015
Token visit
Token ID3187
Feature activation-3.26e-01

Pos logit contributions
 disl+0.039
joy+0.039
 thrill+0.038
 nods+0.037
 relief+0.036
Neg logit contributions
's-0.031
 so-0.029
-0.029
 closely-0.028
 into-0.028
Token her
Token ID607
Feature activation+3.66e-01

Pos logit contributions
sc+0.008
+0.008
ches+0.007
's+0.006
 lin+0.006
Neg logit contributions
joy-0.010
 relief-0.009
 nods-0.009
 disl-0.009
 thrill-0.008
Token grandma
Token ID49890
Feature activation+2.61e-01

Pos logit contributions
joy+0.010
 Majesty+0.008
 disl+0.008
 nods+0.008
 thrill+0.008
Neg logit contributions
-0.011
sc-0.011
ches-0.010
's-0.010
 so-0.010
Token's
Token ID338
Feature activation-1.50e+00

Pos logit contributions
joy+0.008
 Amen+0.008
 writer+0.007
 nod+0.007
 nods+0.007
Neg logit contributions
sc-0.004
ula-0.004
ool-0.004
Days-0.004
 slippery-0.004
Token house
Token ID2156
Feature activation-6.59e-01

Pos logit contributions
+0.037
ches+0.037
's+0.035
 off+0.034
 called+0.034
Neg logit contributions
joy-0.047
 disl-0.040
 Majesty-0.040
 thrill-0.040
 relief-0.039
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
joy+0.054
 thrill+0.050
 relief+0.049
 vibration+0.046
 grin+0.045
Neg logit contributions
's-0.019
-0.018
At-0.018
ches-0.017
 unw-0.016
Token They
Token ID1119
Feature activation-1.63e-01

Pos logit contributions
joy+0.007
 thrill+0.006
 disl+0.006
 nods+0.006
 relief+0.006
Neg logit contributions
's-0.005
-0.004
 called-0.004
 so-0.004
 and-0.004
Token curled
Token ID41108
Feature activation-8.63e-01

Pos logit contributions
's+0.003
 so+0.003
 and+0.003
+0.003
 called+0.003
Neg logit contributions
joy-0.006
 thrill-0.005
 relief-0.005
 nods-0.005
 disl-0.005
Token up
Token ID510
Feature activation+5.99e-01

Pos logit contributions
's+0.008
 so+0.007
 and+0.007
 called+0.007
+0.006
Neg logit contributions
joy-0.040
 thrill-0.038
 disl-0.038
 relief-0.037
 nods-0.036
Token on
Token ID319
Feature activation-7.89e-01

Pos logit contributions
joy+0.025
 thrill+0.023
 nods+0.022
 relief+0.022
 sorry+0.021
Neg logit contributions
's-0.008
-0.008
ches-0.006
 lin-0.006
At-0.006
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
's+0.014
+0.014
ches+0.013
sc+0.013
 called+0.013
Neg logit contributions
joy-0.029
 relief-0.026
 thrill-0.026
 nods-0.024
 afar-0.024
Token rug
Token ID14477
Feature activation-2.61e+00

Pos logit contributions
 thrill+0.040
joy+0.039
 relief+0.037
 nods+0.036
 vibration+0.035
Neg logit contributions
's-0.023
-0.023
 so-0.022
 called-0.022
 and-0.021
Token and
Token ID290
Feature activation+3.71e-01

Pos logit contributions
+0.028
 pic+0.027
 lin+0.027
At+0.027
During+0.025
Neg logit contributions
joy-0.097
 thrill-0.095
 grin-0.089
 relief-0.089
 nods-0.089
Token hugged
Token ID41130
Feature activation+9.83e-01

Pos logit contributions
joy+0.015
 thrill+0.014
 disl+0.014
 Majesty+0.013
 relief+0.013
Neg logit contributions
's-0.006
 so-0.006
 and-0.006
 called-0.006
-0.006
Token their
Token ID511
Feature activation+1.19e-02

Pos logit contributions
joy+0.046
 thrill+0.044
 disl+0.043
 Majesty+0.042
 relief+0.042
Neg logit contributions
's-0.009
 so-0.008
 and-0.008
 called-0.008
-0.007
Token t
Token ID256
Feature activation-4.47e-01

Pos logit contributions
joy+0.000
 thrill+0.000
 disl+0.000
 relief+0.000
 nods+0.000
Neg logit contributions
's-0.000
 so-0.000
 and-0.000
 called-0.000
-0.000
Token the
Token ID262
Feature activation+1.97e-02

Pos logit contributions
's+0.023
+0.021
 called+0.021
 so+0.020
 and+0.020
Neg logit contributions
joy-0.062
 thrill-0.059
 disl-0.058
 relief-0.056
 nods-0.055
Token pile
Token ID14540
Feature activation-2.42e+00

Pos logit contributions
joy+0.001
 disl+0.001
 Amen+0.001
 thrill+0.001
 relief+0.001
Neg logit contributions
 and-0.000
's-0.000
 so-0.000
 called-0.000
 really-0.000
Token of
Token ID286
Feature activation-2.07e+00

Pos logit contributions
 pic+0.037
ches+0.031
's+0.031
 called+0.030
At+0.030
Neg logit contributions
 thrill-0.087
joy-0.087
 sorry-0.083
 relief-0.083
 grin-0.082
Token leaves
Token ID5667
Feature activation-1.53e+00

Pos logit contributions
's+0.039
 so+0.036
+0.035
 called+0.035
 and+0.035
Neg logit contributions
joy-0.077
 thrill-0.072
 disl-0.070
 relief-0.068
 nods-0.068
Token even
Token ID772
Feature activation-1.68e+00

Pos logit contributions
 pic+0.022
's+0.018
At+0.018
 called+0.018
During+0.017
Neg logit contributions
 thrill-0.054
 disl-0.052
joy-0.052
 nod-0.050
 grin-0.050
Token bigger
Token ID5749
Feature activation-1.27e+00

Pos logit contributions
's+0.031
+0.028
 so+0.027
 called+0.027
 and+0.027
Neg logit contributions
joy-0.061
 thrill-0.058
 disl-0.056
 relief-0.055
 nods-0.055
Token.
Token ID13
Feature activation+2.30e-03

Pos logit contributions
's+0.014
+0.013
 called+0.012
ches+0.012
 closely+0.011
Neg logit contributions
joy-0.055
 thrill-0.051
 disl-0.051
 nods-0.049
 relief-0.048
Token Mia
Token ID32189
Feature activation-2.31e+00

Pos logit contributions
joy+0.000
 thrill+0.000
 disl+0.000
 relief+0.000
 nods+0.000
Neg logit contributions
's-0.000
 so-0.000
 and-0.000
 called-0.000
-0.000
Token was
Token ID373
Feature activation+4.08e-01

Pos logit contributions
 off+0.040
 pic+0.039
 so+0.038
 closely+0.037
 shelf+0.036
Neg logit contributions
joy-0.079
 nods-0.075
 thrill-0.072
 disl-0.072
 nodded-0.068
Token happy
Token ID3772
Feature activation+1.31e+00

Pos logit contributions
joy+0.016
 thrill+0.015
 disl+0.015
 relief+0.015
 nods+0.014
Neg logit contributions
's-0.007
 so-0.006
 and-0.006
-0.006
 called-0.006
Token to
Token ID284
Feature activation+1.12e+00

Pos logit contributions
joy+0.065
 thrill+0.061
 disl+0.060
 relief+0.059
 nods+0.059
Neg logit contributions
's-0.009
 so-0.008
 and-0.007
 called-0.007
-0.007
Token had
Token ID550
Feature activation-1.46e+00

Pos logit contributions
's+0.031
 and+0.027
 called+0.026
 so+0.025
+0.025
Neg logit contributions
joy-0.111
 thrill-0.106
 Majesty-0.103
 disl-0.103
 relief-0.099
Token made
Token ID925
Feature activation-1.72e+00

Pos logit contributions
's+0.018
 so+0.017
 called+0.016
 and+0.015
 to+0.014
Neg logit contributions
joy-0.065
 thrill-0.064
 disl-0.061
 relief-0.060
 Majesty-0.060
Token a
Token ID257
Feature activation+7.76e-02

Pos logit contributions
's+0.024
 called+0.022
+0.022
 so+0.021
 and+0.021
Neg logit contributions
joy-0.071
 thrill-0.067
 disl-0.065
 relief-0.065
 nods-0.063
Token mess
Token ID2085
Feature activation+2.83e+00

Pos logit contributions
joy+0.003
 thrill+0.003
 disl+0.003
 nods+0.003
 relief+0.003
Neg logit contributions
's-0.002
 so-0.001
 and-0.001
 called-0.001
-0.001
Token and
Token ID290
Feature activation+5.41e-01

Pos logit contributions
joy+0.129
 thrill+0.120
 disl+0.118
 relief+0.117
 nods+0.115
Neg logit contributions
's-0.025
 pic-0.021
 called-0.021
-0.020
 so-0.019
Token he
Token ID339
Feature activation-4.57e-02

Pos logit contributions
joy+0.022
 thrill+0.020
 disl+0.020
 Majesty+0.019
 nods+0.019
Neg logit contributions
's-0.009
 so-0.009
 and-0.009
 called-0.009
-0.008
Token should
Token ID815
Feature activation-1.40e+00

Pos logit contributions
's+0.001
 so+0.001
 and+0.001
 called+0.001
+0.001
Neg logit contributions
joy-0.002
 thrill-0.002
 disl-0.002
 relief-0.001
 nods-0.001
Token have
Token ID423
Feature activation-5.86e-01

Pos logit contributions
's+0.024
 and+0.023
 so+0.021
+0.021
 called+0.021
Neg logit contributions
joy-0.062
 thrill-0.054
 Majesty-0.051
ses-0.051
 disl-0.051
Token cleaned
Token ID20750
Feature activation-8.84e-01

Pos logit contributions
's+0.013
 so+0.012
 called+0.012
 and+0.012
+0.011
Neg logit contributions
joy-0.020
 thrill-0.019
 disl-0.018
 relief-0.018
 nods-0.018
Token it
Token ID340
Feature activation-9.18e-01

Pos logit contributions
's+0.013
 called+0.013
At+0.013
+0.013
 pic+0.012
Neg logit contributions
joy-0.034
 relief-0.031
 thrill-0.031
 disl-0.031
 nods-0.029
Token up
Token ID510
Feature activation-1.22e+00

Pos logit contributions
 pic+0.019
 called+0.017
At+0.017
 lin+0.016
 aisle+0.016
Neg logit contributions
 sorry-0.028
 relief-0.028
 thrill-0.027
joy-0.026
 disl-0.026
Token
Token ID198
Feature activation-2.67e-01

Pos logit contributions
joy+0.001
 thrill+0.000
 disl+0.000
 relief+0.000
 nods+0.000
Neg logit contributions
's-0.000
 so-0.000
 called-0.000
 and-0.000
-0.000
Token
Token ID198
Feature activation-7.76e-02

Pos logit contributions
 separate+0.007
's+0.006
 so+0.006
 into+0.006
 off+0.006
Neg logit contributions
joy-0.007
 disl-0.006
 nods-0.006
 relief-0.006
 thrill-0.006
TokenThe
Token ID464
Feature activation-4.26e-02

Pos logit contributions
 into+0.002
 off+0.002
's+0.002
 called+0.002
 so+0.002
Neg logit contributions
Sorry-0.002
Really-0.002
Thank-0.002
Hi-0.002
 disl-0.002
Token next
Token ID1306
Feature activation-2.34e+00

Pos logit contributions
's+0.001
 so+0.001
+0.001
 called+0.001
 off+0.001
Neg logit contributions
joy-0.001
 thrill-0.001
 relief-0.001
 disl-0.001
 nods-0.001
Token day
Token ID1110
Feature activation+9.67e-01

Pos logit contributions
's+0.043
 so+0.040
+0.039
 called+0.039
 and+0.037
Neg logit contributions
joy-0.088
 thrill-0.082
 disl-0.079
 relief-0.078
 nods-0.078
Token,
Token ID11
Feature activation+4.41e-01

Pos logit contributions
joy+0.043
 thrill+0.040
 disl+0.039
 relief+0.039
 nods+0.038
Neg logit contributions
's-0.011
 so-0.010
 called-0.009
-0.009
 and-0.009
Token the
Token ID262
Feature activation-3.48e-01

Pos logit contributions
joy+0.018
 thrill+0.016
 disl+0.016
 Majesty+0.016
 nods+0.016
Neg logit contributions
's-0.007
 so-0.007
 and-0.007
 called-0.007
-0.007
Token little
Token ID1310
Feature activation-1.65e+00

Pos logit contributions
's+0.006
 so+0.005
+0.005
 called+0.005
 and+0.005
Neg logit contributions
joy-0.014
 thrill-0.013
 disl-0.012
 relief-0.012
 nods-0.012
Token girl
Token ID2576
Feature activation+6.06e-01

Pos logit contributions
's+0.053
 and+0.052
 so+0.051
 called+0.050
+0.048
Neg logit contributions
joy-0.041
 disl-0.039
 thrill-0.038
 Majesty-0.035
 relief-0.034
Token came
Token ID1625
Feature activation+1.21e+00

Pos logit contributions
joy+0.020
 nods+0.018
 Majesty+0.016
 nodded+0.016
 grin+0.016
Neg logit contributions
 so-0.012
 closely-0.011
 shelf-0.011
 off-0.011
sc-0.011
Token back
Token ID736
Feature activation+1.52e+00

Pos logit contributions
joy+0.044
 nods+0.035
 thrill+0.034
viously+0.034
 tweet+0.033
Neg logit contributions
 shelf-0.024
 pic-0.024
 convey-0.023
 unw-0.022
 lin-0.022
Token he
Token ID339
Feature activation+4.17e-01

Pos logit contributions
joy+0.010
 thrill+0.009
 disl+0.009
 relief+0.008
 nods+0.008
Neg logit contributions
's-0.005
 so-0.005
 and-0.005
 called-0.005
-0.005
Token loved
Token ID6151
Feature activation-7.96e-01

Pos logit contributions
joy+0.010
 disl+0.010
 nods+0.010
 thrill+0.009
 sincere+0.009
Neg logit contributions
 off-0.011
's-0.011
 so-0.010
 closely-0.010
-0.010
Token to
Token ID284
Feature activation+1.12e+00

Pos logit contributions
+0.010
's+0.010
ches+0.008
sc+0.008
 closely+0.007
Neg logit contributions
joy-0.035
 disl-0.032
 relief-0.030
 thrill-0.030
 Majesty-0.030
Token give
Token ID1577
Feature activation-1.19e-02

Pos logit contributions
joy+0.038
 thrill+0.035
 disl+0.035
 relief+0.033
 nods+0.033
Neg logit contributions
's-0.025
 so-0.023
-0.023
 and-0.022
 called-0.022
Token it
Token ID340
Feature activation-1.14e+00

Pos logit contributions
's+0.000
 so+0.000
 called+0.000
+0.000
 and+0.000
Neg logit contributions
joy-0.000
 thrill-0.000
 disl-0.000
 relief-0.000
 nods-0.000
Token c
Token ID269
Feature activation+8.21e-02

Pos logit contributions
's+0.016
+0.015
 called+0.015
 so+0.014
 closely+0.014
Neg logit contributions
joy-0.045
 thrill-0.042
 disl-0.041
 relief-0.041
 nods-0.040
Tokenudd
Token ID4185
Feature activation-1.69e+00

Pos logit contributions
joy+0.002
 thrill+0.002
 disl+0.002
 nods+0.002
 relief+0.002
Neg logit contributions
's-0.002
 so-0.002
-0.002
 called-0.002
 and-0.002
Tokenles
Token ID829
Feature activation-5.88e-01

Pos logit contributions
's+0.056
+0.056
 called+0.054
 so+0.053
 pic+0.052
Neg logit contributions
joy-0.034
 thrill-0.033
 disl-0.032
 relief-0.029
viously-0.029
Token.
Token ID13
Feature activation+6.20e-01

Pos logit contributions
's+0.006
+0.006
At+0.006
 called+0.005
 pic+0.005
Neg logit contributions
joy-0.025
 thrill-0.023
 disl-0.023
 relief-0.022
 winners-0.022
Token
Token ID198
Feature activation-2.38e-01

Pos logit contributions
joy+0.024
 thrill+0.022
 disl+0.022
 relief+0.021
 nods+0.021
Neg logit contributions
's-0.011
 so-0.010
 called-0.010
 and-0.010
-0.010
Token
Token ID198
Feature activation+3.43e-02

Pos logit contributions
's+0.005
 off+0.005
 so+0.005
+0.005
 into+0.005
Neg logit contributions
joy-0.008
 thrill-0.007
 nods-0.007
 disl-0.007
 relief-0.006
Token fell
Token ID3214
Feature activation+5.38e-01

Pos logit contributions
sc+0.010
 pic+0.010
 closely+0.010
 into+0.010
 from+0.010
Neg logit contributions
 nods-0.012
joy-0.012
 nodded-0.012
 grin-0.011
 thrill-0.011
Token asleep
Token ID16039
Feature activation+2.46e+00

Pos logit contributions
joy+0.014
 thrill+0.013
 nods+0.012
 relief+0.012
 disl+0.011
Neg logit contributions
's-0.016
-0.015
 called-0.015
 so-0.014
 pic-0.014
Token and
Token ID290
Feature activation+1.87e-01

Pos logit contributions
joy+0.108
 disl+0.106
 thrill+0.104
 Majesty+0.101
 nods+0.100
Neg logit contributions
 and-0.029
's-0.028
 so-0.028
 called-0.026
 from-0.024
Token dream
Token ID4320
Feature activation+1.19e+00

Pos logit contributions
joy+0.007
 thrill+0.007
 disl+0.007
 Majesty+0.006
 nods+0.006
Neg logit contributions
's-0.004
 so-0.003
 called-0.003
 and-0.003
-0.003
Tokent
Token ID83
Feature activation-3.35e-01

Pos logit contributions
joy+0.040
 thrill+0.037
 disl+0.035
 relief+0.034
 nods+0.034
Neg logit contributions
's-0.027
 so-0.025
-0.025
 called-0.025
 and-0.024
Token of
Token ID286
Feature activation-1.70e+00

Pos logit contributions
's+0.006
+0.005
 called+0.005
 and+0.005
 unw+0.005
Neg logit contributions
joy-0.012
 thrill-0.012
 disl-0.011
 relief-0.011
 vibration-0.011
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
's+0.029
 so+0.026
 called+0.026
+0.026
 and+0.026
Neg logit contributions
joy-0.067
 thrill-0.062
 disl-0.061
 relief-0.059
 nods-0.059
Token bird
Token ID6512
Feature activation+4.59e-01

Pos logit contributions
's+0.002
 and+0.002
 so+0.002
+0.002
 called+0.002
Neg logit contributions
joy-0.005
 disl-0.005
osity-0.004
viously-0.004
 nods-0.004
Token she
Token ID673
Feature activation-2.54e+00

Pos logit contributions
joy+0.018
 thrill+0.017
 disl+0.016
 relief+0.016
 nods+0.016
Neg logit contributions
's-0.007
-0.007
 so-0.006
 called-0.006
 off-0.006
Token saw
Token ID2497
Feature activation-2.67e-01

Pos logit contributions
's+0.041
 so+0.039
 called+0.037
 and+0.037
+0.037
Neg logit contributions
joy-0.099
 thrill-0.093
 disl-0.091
 relief-0.089
 nods-0.088
Token earlier
Token ID2961
Feature activation-1.46e-01

Pos logit contributions
's+0.003
+0.003
 so+0.003
 called+0.003
 and+0.002
Neg logit contributions
joy-0.012
 thrill-0.011
 disl-0.011
 relief-0.011
 nods-0.011
Token clean
Token ID3424
Feature activation-2.20e-01

Pos logit contributions
joy+0.006
 disl+0.006
 thrill+0.006
 nods+0.006
viously+0.006
Neg logit contributions
's-0.002
 so-0.002
 and-0.002
 called-0.002
-0.002
Token block
Token ID2512
Feature activation+3.20e-01

Pos logit contributions
's+0.004
 and+0.004
 so+0.004
 called+0.004
+0.004
Neg logit contributions
joy-0.008
 thrill-0.008
 disl-0.007
 nods-0.007
 relief-0.007
Token of
Token ID286
Feature activation-4.53e-01

Pos logit contributions
 thrill+0.012
joy+0.012
 vibration+0.011
 relief+0.011
 grin+0.010
Neg logit contributions
 pic-0.005
mary-0.004
ches-0.004
's-0.004
At-0.004
Token ice
Token ID4771
Feature activation+6.38e-01

Pos logit contributions
 so+0.007
 and+0.007
's+0.007
 called+0.007
 from+0.006
Neg logit contributions
joy-0.019
 Majesty-0.017
 nods-0.017
 disl-0.017
 thrill-0.017
Token and
Token ID290
Feature activation+1.68e-01

Pos logit contributions
 thrill+0.018
joy+0.018
 vibration+0.016
 disl+0.016
ment+0.016
Neg logit contributions
-0.015
 pic-0.014
's-0.013
 called-0.013
ches-0.013
Token decided
Token ID3066
Feature activation-3.43e+00

Pos logit contributions
joy+0.007
 thrill+0.006
 disl+0.006
 nods+0.006
 relief+0.006
Neg logit contributions
's-0.003
 so-0.003
 called-0.003
 and-0.003
-0.003
Token he
Token ID339
Feature activation-1.97e+00

Pos logit contributions
's+0.017
 so+0.013
 and+0.011
 called+0.011
+0.011
Neg logit contributions
joy-0.176
 thrill-0.167
 disl-0.164
 relief-0.161
 nods-0.160
Token wanted
Token ID2227
Feature activation-5.43e-01

Pos logit contributions
 pic+0.041
's+0.040
 so+0.039
 from+0.038
 and+0.038
Neg logit contributions
joy-0.061
 disl-0.056
 nods-0.056
 thrill-0.055
viously-0.052
Token to
Token ID284
Feature activation+9.77e-02

Pos logit contributions
ches+0.007
sc+0.006
+0.006
mary+0.006
's+0.006
Neg logit contributions
joy-0.023
 disl-0.020
 relief-0.019
 thrill-0.019
 forgiveness-0.019
Token melt
Token ID16867
Feature activation+1.05e+00

Pos logit contributions
joy+0.003
 thrill+0.003
 disl+0.003
 relief+0.003
 nods+0.003
Neg logit contributions
's-0.002
 so-0.002
 called-0.002
 and-0.002
-0.002
Token it
Token ID340
Feature activation+2.05e-02

Pos logit contributions
joy+0.042
 thrill+0.039
 relief+0.038
 disl+0.037
 nods+0.037
Neg logit contributions
's-0.016
-0.015
 called-0.014
At-0.013
 pic-0.013
Token cliff
Token ID19516
Feature activation-3.35e-01

Pos logit contributions
joy+0.029
 thrill+0.027
ses+0.027
 disl+0.027
viously+0.026
Neg logit contributions
 and-0.023
's-0.023
 so-0.022
 called-0.020
 from-0.020
Token that
Token ID326
Feature activation+1.08e+00

Pos logit contributions
's+0.004
 called+0.003
 so+0.003
+0.003
 off+0.003
Neg logit contributions
joy-0.014
 thrill-0.013
 disl-0.013
 nods-0.013
 relief-0.013
Token held
Token ID2714
Feature activation-9.67e-02

Pos logit contributions
joy+0.043
 thrill+0.040
 disl+0.040
 relief+0.039
 nods+0.038
Neg logit contributions
's-0.017
 so-0.016
 and-0.015
 called-0.015
-0.015
Token the
Token ID262
Feature activation-4.90e-01

Pos logit contributions
's+0.001
 called+0.001
 so+0.001
+0.001
 and+0.001
Neg logit contributions
joy-0.004
 thrill-0.004
 relief-0.004
 disl-0.004
 nods-0.004
Token start
Token ID923
Feature activation-1.68e+00

Pos logit contributions
's+0.010
 so+0.010
 and+0.010
+0.010
 called+0.009
Neg logit contributions
joy-0.017
 disl-0.016
 thrill-0.016
 nods-0.015
 relief-0.015
Token of
Token ID286
Feature activation-3.03e+00

Pos logit contributions
's+0.018
 called+0.018
 pic+0.017
 unw+0.016
 closely+0.016
Neg logit contributions
joy-0.069
 thrill-0.064
 relief-0.060
 winners-0.060
 nods-0.060
Token it
Token ID340
Feature activation-6.93e-01

Pos logit contributions
's+0.047
+0.042
 called+0.040
 so+0.039
ches+0.038
Neg logit contributions
joy-0.124
 thrill-0.115
 relief-0.113
 disl-0.111
 nods-0.106
Token all
Token ID477
Feature activation-8.31e-01

Pos logit contributions
 called+0.009
's+0.008
 lin+0.007
+0.007
 so+0.007
Neg logit contributions
joy-0.028
 thrill-0.026
 relief-0.025
 disl-0.025
 nods-0.025
Token.
Token ID13
Feature activation+7.87e-01

Pos logit contributions
's+0.012
 called+0.011
 so+0.011
+0.010
 and+0.010
Neg logit contributions
joy-0.034
 thrill-0.032
 disl-0.031
 relief-0.031
 nods-0.031
Token<|endoftext|>
Token ID50256
Feature activation+2.26e+00

Pos logit contributions
joy+0.031
 thrill+0.030
 disl+0.029
 relief+0.028
 nods+0.028
Neg logit contributions
's-0.013
 so-0.012
-0.012
 and-0.012
 called-0.011
TokenOnce
Token ID7454
Feature activation-1.52e+00

Pos logit contributions
joy+0.066
 thrill+0.061
 disl+0.060
 relief+0.057
 Majesty+0.056
Neg logit contributions
's-0.061
 so-0.058
 and-0.058
-0.057
 called-0.057
Token punished
Token ID16851
Feature activation+7.89e-01

Pos logit contributions
joy+0.092
 thrill+0.085
 disl+0.082
 nods+0.079
 relief+0.079
Neg logit contributions
's-0.062
 so-0.060
 called-0.058
 and-0.058
-0.056
Token.
Token ID13
Feature activation-9.25e-01

Pos logit contributions
joy+0.035
 thrill+0.033
 disl+0.032
 relief+0.032
 nods+0.032
Neg logit contributions
's-0.008
-0.007
 called-0.007
 so-0.007
 and-0.006
Token She
Token ID1375
Feature activation-2.27e-01

Pos logit contributions
's+0.016
 so+0.016
 and+0.015
 called+0.015
+0.014
Neg logit contributions
joy-0.036
 thrill-0.035
 disl-0.033
 Majesty-0.032
 relief-0.032
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 pic+0.005
 closely+0.005
+0.005
's+0.005
 shelf+0.005
Neg logit contributions
 nods-0.006
joy-0.006
 relief-0.006
 sincere-0.006
 thrill-0.006
Token that
Token ID326
Feature activation+1.16e-01

Pos logit contributions
's+0.016
 so+0.015
 and+0.015
+0.014
 called+0.013
Neg logit contributions
joy-0.042
 thrill-0.041
 Majesty-0.038
 relief-0.037
 disl-0.037
Token Lily
Token ID20037
Feature activation-3.38e+00

Pos logit contributions
joy+0.005
 thrill+0.005
 disl+0.004
 relief+0.004
 Majesty+0.004
Neg logit contributions
's-0.002
 so-0.002
 and-0.002
 called-0.002
-0.002
Token had
Token ID550
Feature activation-1.64e+00

Pos logit contributions
 so+0.049
's+0.048
 called+0.046
 off+0.045
 closely+0.045
Neg logit contributions
joy-0.129
 thrill-0.121
 disl-0.121
 nods-0.119
 relief-0.116
Token to
Token ID284
Feature activation+1.14e+00

Pos logit contributions
's+0.015
 so+0.013
+0.013
 and+0.013
 called+0.012
Neg logit contributions
joy-0.077
 thrill-0.072
 disl-0.071
 relief-0.069
 nods-0.069
Token go
Token ID467
Feature activation-1.32e+00

Pos logit contributions
 thrill+0.034
 disl+0.034
joy+0.034
 relief+0.033
 nods+0.031
Neg logit contributions
's-0.026
 so-0.024
 into-0.024
 closely-0.023
 and-0.023
Token to
Token ID284
Feature activation+1.28e-01

Pos logit contributions
+0.018
's+0.017
 pic+0.016
 lin+0.016
 closely+0.015
Neg logit contributions
joy-0.052
 sorry-0.047
 winners-0.044
 disl-0.044
 relief-0.044
Token the
Token ID262
Feature activation+1.43e-01

Pos logit contributions
joy+0.005
 disl+0.004
 thrill+0.004
 relief+0.004
 Majesty+0.004
Neg logit contributions
's-0.003
-0.002
ches-0.002
 called-0.002
 so-0.002
Token Lily
Token ID20037
Feature activation-1.22e+00

Pos logit contributions
joy+0.036
 thrill+0.034
 disl+0.033
 relief+0.032
 nods+0.032
Neg logit contributions
's-0.016
 so-0.015
 and-0.014
 called-0.014
-0.014
Token's
Token ID338
Feature activation-2.65e-01

Pos logit contributions
ula+0.020
sc+0.019
 off+0.019
 closely+0.019
ches+0.018
Neg logit contributions
joy-0.040
 nods-0.038
 disl-0.036
 nodded-0.035
 understands-0.035
Token mom
Token ID1995
Feature activation+1.42e+00

Pos logit contributions
's+0.006
+0.006
ches+0.006
 so+0.006
 called+0.006
Neg logit contributions
joy-0.009
 thrill-0.008
 nods-0.007
 Majesty-0.007
 disl-0.007
Tokenmy
Token ID1820
Feature activation-1.46e+00

Pos logit contributions
joy+0.032
 nods+0.031
 nodded+0.030
Sorry+0.028
 ashamed+0.028
Neg logit contributions
 closely-0.037
 pic-0.036
sc-0.036
 shelf-0.035
 convey-0.034
Token said
Token ID531
Feature activation-1.99e+00

Pos logit contributions
 closely+0.032
 pic+0.032
sc+0.030
 shelf+0.030
 off+0.030
Neg logit contributions
joy-0.039
 nods-0.037
 nodded-0.035
 disl-0.035
 ashamed-0.034
Token they
Token ID484
Feature activation-3.27e+00

Pos logit contributions
's+0.024
 so+0.022
 called+0.022
 and+0.021
+0.021
Neg logit contributions
joy-0.086
 thrill-0.081
 disl-0.080
 relief-0.077
 nods-0.077
Token were
Token ID547
Feature activation-2.21e+00

Pos logit contributions
's+0.060
 so+0.055
 and+0.053
+0.053
 called+0.052
Neg logit contributions
joy-0.130
 thrill-0.122
 disl-0.121
 Majesty-0.117
 relief-0.115
Token going
Token ID1016
Feature activation-1.65e+00

Pos logit contributions
ches+0.045
's+0.043
+0.043
sc+0.042
 closely+0.039
Neg logit contributions
joy-0.077
 sorry-0.071
 thrill-0.066
 sincere-0.066
 winners-0.065
Token to
Token ID284
Feature activation-8.05e-01

Pos logit contributions
 closely+0.028
+0.026
ide+0.025
 shelf+0.024
ches+0.024
Neg logit contributions
joy-0.062
 winners-0.050
 nods-0.049
 sorry-0.048
ancer-0.047
Token a
Token ID257
Feature activation-1.39e+00

Pos logit contributions
ches+0.017
 closely+0.016
sc+0.015
's+0.015
 called+0.014
Neg logit contributions
joy-0.026
 disl-0.026
 relief-0.025
 thrill-0.025
 winners-0.023
Token festival
Token ID10876
Feature activation-5.78e-01

Pos logit contributions
 off+0.031
ches+0.031
 into+0.031
's+0.030
 from+0.030
Neg logit contributions
joy-0.042
 thrill-0.036
 relief-0.035
 disl-0.035
 Majesty-0.035
Token spotted
Token ID13489
Feature activation+1.22e+00

Pos logit contributions
joy+0.042
 thrill+0.039
 disl+0.038
 relief+0.037
 nods+0.036
Neg logit contributions
's-0.019
 so-0.017
 and-0.017
-0.017
 called-0.017
Token a
Token ID257
Feature activation+3.00e-01

Pos logit contributions
joy+0.054
 thrill+0.052
 disl+0.050
 nods+0.050
 Majesty+0.049
Neg logit contributions
 so-0.016
's-0.015
 and-0.015
 called-0.014
-0.013
Token rock
Token ID3881
Feature activation+2.18e+00

Pos logit contributions
joy+0.012
 disl+0.011
 thrill+0.011
 nods+0.011
viously+0.010
Neg logit contributions
's-0.005
 so-0.005
 and-0.005
-0.005
 called-0.005
Token.
Token ID13
Feature activation+5.91e-01

Pos logit contributions
joy+0.091
 thrill+0.084
 disl+0.080
 nods+0.079
 vibration+0.077
Neg logit contributions
 pic-0.024
's-0.024
At-0.021
-0.021
 called-0.021
Token He
Token ID679
Feature activation+9.84e-01

Pos logit contributions
joy+0.017
 thrill+0.015
 disl+0.015
 relief+0.014
 nods+0.014
Neg logit contributions
's-0.016
 so-0.015
 called-0.015
 and-0.015
-0.015
Token decided
Token ID3066
Feature activation-3.14e+00

Pos logit contributions
joy+0.033
 thrill+0.030
 nods+0.030
 disl+0.029
 relief+0.028
Neg logit contributions
's-0.018
 pic-0.018
 so-0.018
 from-0.018
 off-0.017
Token he
Token ID339
Feature activation-1.46e+00

Pos logit contributions
's+0.021
 called+0.017
 so+0.016
+0.015
 and+0.014
Neg logit contributions
joy-0.154
 thrill-0.143
 disl-0.143
 nods-0.139
 relief-0.138
Token wanted
Token ID2227
Feature activation-9.82e-02

Pos logit contributions
's+0.030
 pic+0.030
 shelf+0.028
 so+0.028
 lin+0.028
Neg logit contributions
joy-0.045
 nods-0.042
 disl-0.041
 thrill-0.041
 Majesty-0.039
Token to
Token ID284
Feature activation+2.97e-01

Pos logit contributions
's+0.001
 lin+0.001
+0.001
ches+0.001
mary+0.001
Neg logit contributions
joy-0.004
 disl-0.004
 thrill-0.004
 relief-0.004
 nods-0.004
Token keep
Token ID1394
Feature activation+1.42e+00

Pos logit contributions
joy+0.010
 thrill+0.010
 disl+0.009
 relief+0.009
 Majesty+0.009
Neg logit contributions
's-0.006
 so-0.005
 called-0.005
 and-0.005
-0.005
Token it
Token ID340
Feature activation+4.10e-01

Pos logit contributions
joy+0.055
 thrill+0.050
 disl+0.049
 relief+0.048
 nods+0.047
Neg logit contributions
's-0.024
-0.022
 called-0.021
 so-0.020
 and-0.020